BLASTP 2.2.26 [Sep-21-2011]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for compositional score matrix adjustment: Altschul, Stephen F.,
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.
Query= 048503
(374 letters)
Database: nr
23,463,169 sequences; 8,064,228,071 total letters
Searching..................................................done
>gi|255566010|ref|XP_002523993.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
gi|223536720|gb|EEF38361.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
Length = 439
Score = 311 bits (798), Expect = 3e-82, Method: Compositional matrix adjust.
Identities = 187/413 (45%), Positives = 246/413 (59%), Gaps = 53/413 (12%)
Query: 2 QNSQKLPFYNDNETPKSPI-------------------SIIY----QAEIISVDDIYLMH 38
++S K PFYN ETP I S I+ Q+E+IS YLM
Sbjct: 36 RDSPKSPFYNPRETPTQRIVSAVRRSMSRVHHFSPTKNSDIFTDTAQSEMISNQGEYLMK 95
Query: 39 LSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSSSQCA 98
S+GTP DI DTGSD WTQC+PC + C++Q+ PLFDPK SSTY ISCS+ QC
Sbjct: 96 FSLGTPAFDILAIADTGSDLIWTQCKPCDQ--CYEQDAPLFDPKSSSTYRDISCSTKQCD 153
Query: 99 VVTSNCS---EGD--CSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMPNVIFGCG 153
++ S EG+ C YS+ YG SF+SGN+A +T+T STSG PV +P I GCG
Sbjct: 154 LLKEGASCSGEGNKTCHYSYSYGD---RSFTSGNVAADTITLGSTSGRPVLLPKAIIGCG 210
Query: 154 HKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCL-----PDQGSSKINFG-- 206
H N S T +G GP SLISQ+G++I GKFSYCL SSK+NFG
Sbjct: 211 HNNGGSFTEKGSGIVGLGGGP--ISLISQLGSTIDGKFSYCLVPLSSNATNSSKLNFGSN 268
Query: 207 GIVAGAGVVSTPLIIRD---HYYLSLEAISVGNQRLEFVSSS----TGNIFVDTGVLRTL 259
GIV+G GV STPLI +D Y+L+LEA+SVG++R++F SS GNI +D+G TL
Sbjct: 269 GIVSGGGVQSTPLISKDPDTFYFLTLEAVSVGSERIKFPGSSFGTSEGNIIIDSGTTLTL 328
Query: 260 LPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDVLCYNISSQPKFPEVTIHFRGADVKLS 319
P ++ S L S + + + PV+ +P LCY+I + KFP +T HF GADVKL+
Sbjct: 329 FPEDFFSELSSAVQDAVAGTPVE----DPSGILSLCYSIDADLKFPSITAHFDGADVKLN 384
Query: 320 PSNLFRNISDEIMCSAFRGGNANIVYGRIMQINFLIGYDIEQAMVSFKPSRCT 372
P N F +SD ++C AF N+ ++G + Q+NFL+GYD+E VSFKP+ CT
Sbjct: 385 PLNTFVQVSDTVLCFAFNPINSGAIFGNLAQMNFLVGYDLEGKTVSFKPTDCT 437
>gi|224130878|ref|XP_002320947.1| predicted protein [Populus trichocarpa]
gi|222861720|gb|EEE99262.1| predicted protein [Populus trichocarpa]
Length = 440
Score = 298 bits (762), Expect = 4e-78, Method: Compositional matrix adjust.
Identities = 173/407 (42%), Positives = 241/407 (59%), Gaps = 53/407 (13%)
Query: 8 PFYNDNETP--------KSPISIIYQAEIISVDDI---------------YLMHLSIGTP 44
PFYN ET + IS ++ + I+ + YLM LS+GTP
Sbjct: 45 PFYNSEETDLQRINNALRRSISRVHHFDPIAAASVSPKAAESDVTSNRGEYLMSLSLGTP 104
Query: 45 PVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSSSQCAVV-TSN 103
P I G DTGSD WTQC+PC C+KQ PLFDPK S TY SC + QC+++ S
Sbjct: 105 PFKIMGIADTGSDLIWTQCKPCER--CYKQVDPLFDPKSSKTYRDFSCDARQCSLLDQST 162
Query: 104 CSEGDCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMPNVIFGCGHKNLASPTSD 163
CS C Y + YG +Y + GN+A++T+T +ST+G PV P + GCGH+N T
Sbjct: 163 CSGNICQYQYSYGDRSY---TMGNVASDTITLDSTTGSPVSFPKTVIGCGHEN--DGTFS 217
Query: 164 SKQTGIIGLGPGNSSLISQMGTSIAGKFSYCL-----PDQGSSKINFG--GIVAGAGVVS 216
K +GI+GLG G SLISQMG+S+ GKFSYCL SSK+NFG +V+G GV S
Sbjct: 218 DKGSGIVGLGAGPLSLISQMGSSVGGKFSYCLVPLSSRAGNSSKLNFGSNAVVSGPGVQS 277
Query: 217 TPLI----IRDHYYLSLEAISVGNQRLEFVSSS----TGNIFVDTGVLRTLLPLEYHSNL 268
TPL+ + Y+L+LEA+SVGN+R++F SS GNI +D+G T++P ++ SNL
Sbjct: 278 TPLLSSETMSSFYFLTLEAMSVGNERIKFGDSSLGTGEGNIIIDSGTTLTIVPDDFFSNL 337
Query: 269 KSVMSNMIKAQPVKGVGAEP-GFSDVLCYNISSQPKFPEVTIHFRGADVKLSPSNLFRNI 327
+ + N ++ + + +P GF V CY+ +S K P +T HF GADVKL P N F +
Sbjct: 338 STAVGNQVEGRRAE----DPSGFLSV-CYSATSDLKVPAITAHFTGADVKLKPINTFVQV 392
Query: 328 SDEIMCSAFRGGNANI-VYGRIMQINFLIGYDIEQAMVSFKPSRCTN 373
SD+++C AF + I +YG + Q+NFL+ Y+I+ +SFKP+ CT
Sbjct: 393 SDDVVCLAFASTTSGISIYGNVAQMNFLVEYNIQGKSLSFKPTDCTK 439
>gi|297805038|ref|XP_002870403.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
gi|297316239|gb|EFH46662.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
Length = 440
Score = 294 bits (752), Expect = 6e-77, Method: Compositional matrix adjust.
Identities = 182/411 (44%), Positives = 246/411 (59%), Gaps = 52/411 (12%)
Query: 2 QNSQKLPFYNDNETPKS--------PISIIYQAEIISVDDI---------------YLMH 38
++S K PFYN ET +S ++ IS D YLM+
Sbjct: 38 RDSPKSPFYNPTETSSQRLRNAIHRSVSRVFHFTDISQKDASDNAPQIDLTSNSGEYLMN 97
Query: 39 LSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSSSQCA 98
+S+GTPP I DTGSD WTQC+PC DC+ Q PLFDPK SSTY +SCSSSQC
Sbjct: 98 ISLGTPPFPIMAIADTGSDLLWTQCKPCD--DCYTQVDPLFDPKASSTYKDVSCSSSQCT 155
Query: 99 VV--TSNCSEGD--CSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMPNVIFGCGH 154
+ ++CS D CSYS YG +Y + GN+A +TLT ST PV++ N+I GCGH
Sbjct: 156 ALENQASCSTEDNTCSYSTSYGDRSY---TKGNIAVDTLTLGSTDTRPVQLKNIIIGCGH 212
Query: 155 KNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCL-----PDQGSSKINFG--G 207
N T + K +GI+GLG G SLI+Q+G SI GKFSYCL + +SKINFG
Sbjct: 213 NNAG--TFNKKGSGIVGLGGGAVSLITQLGDSIDGKFSYCLVPLTSENDRTSKINFGTNA 270
Query: 208 IVAGAGVVSTPLIIRDH---YYLSLEAISVGNQRLEFVSSSTG----NIFVDTGVLRTLL 260
+V+G GVVSTPLI + YYL+L++ISVG++ +++ S +G NI +D+G TLL
Sbjct: 271 VVSGTGVVSTPLIAKSQETFYYLTLKSISVGSKEVQYPGSDSGSGEGNIIIDSGTTLTLL 330
Query: 261 PLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDVLCYNISSQPKFPEVTIHFRGADVKLSP 320
P E++S L+ +++ I A+ K + G S LCY+ + K P +T+HF GADV L P
Sbjct: 331 PTEFYSELEDAVASSIDAE--KKQDPQTGLS--LCYSATGDLKVPAITMHFDGADVNLKP 386
Query: 321 SNLFRNISDEIMCSAFRGGNANIVYGRIMQINFLIGYDIEQAMVSFKPSRC 371
SN F IS++++C AFRG + +YG + Q+NFL+GYD VSFKP+ C
Sbjct: 387 SNCFVQISEDLVCFAFRGSPSFSIYGNVAQMNFLVGYDTVSKTVSFKPTDC 437
>gi|15217764|ref|NP_176663.1| aspartyl protease-like protein [Arabidopsis thaliana]
gi|5042418|gb|AAD38257.1|AC006193_13 Hypothetical Protein [Arabidopsis thaliana]
gi|332196174|gb|AEE34295.1| aspartyl protease-like protein [Arabidopsis thaliana]
Length = 431
Score = 291 bits (745), Expect = 3e-76, Method: Compositional matrix adjust.
Identities = 177/383 (46%), Positives = 246/383 (64%), Gaps = 33/383 (8%)
Query: 7 LPFYNDNETPKSPISIIYQAEIISVDDIYLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPC 66
L F ND+ +P SP Q+ I S YLM++SIGTPPV I DTGSD WTQC PC
Sbjct: 63 LQFSNDDASPNSP-----QSFITSNRGEYLMNISIGTPPVPILAIADTGSDLIWTQCNPC 117
Query: 67 PELDCFKQEPPLFDPKKSSTYNSISCSSSQC-AVVTSNCS--EGDCSYSFLYGRGAYASF 123
DC++Q PLFDPK+SSTY +SCSSSQC A+ ++CS E CSY+ YG +Y
Sbjct: 118 E--DCYQQTSPLFDPKESSTYRKVSCSSSQCRALEDASCSTDENTCSYTITYGDNSY--- 172
Query: 124 SSGNLATETLTFNSTSGLPVEMPNVIFGCGHKNLASPTSDSKQTGIIGLGPGNSSLISQM 183
+ G++A +T+T S+ PV + N+I GCGH+N T D +GIIGLG G++SL+SQ+
Sbjct: 173 TKGDVAVDTVTMGSSGRRPVSLRNMIIGCGHENTG--TFDPAGSGIIGLGGGSTSLVSQL 230
Query: 184 GTSIAGKFSYCL----PDQG-SSKINFG--GIVAGAGVVSTPLIIRD---HYYLSLEAIS 233
SI GKFSYCL + G +SKINFG GIV+G GVVST ++ +D +Y+L+LEAIS
Sbjct: 231 RKSINGKFSYCLVPFTSETGLTSKINFGTNGIVSGDGVVSTSMVKKDPATYYFLNLEAIS 290
Query: 234 VGNQRLEFVSS----STGNIFVDTGVLRTLLPLEYHSNLKSVMSNMIKAQPVKGVGAEPG 289
VG+++++F S+ GNI +D+G TLLP ++ L+SV+++ IKA+ V+ +P
Sbjct: 291 VGSKKIQFTSTIFGTGEGNIVIDSGTTLTLLPSNFYYELESVVASTIKAERVQ----DPD 346
Query: 290 FSDVLCYNISSQPKFPEVTIHFRGADVKLSPSNLFRNISDEIMCSAFRGGNANIVYGRIM 349
LCY SS K P++T+HF+G DVKL N F +S+++ C AF ++G +
Sbjct: 347 GILSLCYRDSSSFKVPDITVHFKGGDVKLGNLNTFVAVSEDVSCFAFAANEQLTIFGNLA 406
Query: 350 QINFLIGYDIEQAMVSFKPSRCT 372
Q+NFL+GYD VSFK + C+
Sbjct: 407 QMNFLVGYDTVSGTVSFKKTDCS 429
>gi|15242803|ref|NP_198319.1| aspartyl protease family protein [Arabidopsis thaliana]
gi|75130158|sp|Q6XBF8.1|CDR1_ARATH RecName: Full=Aspartic proteinase CDR1; AltName: Full=Protein
CONSTITUTIVE DISEASE RESISTANCE 1; Flags: Precursor
gi|37935737|gb|AAP72988.1| CDR1 [Arabidopsis thaliana]
gi|91806924|gb|ABE66189.1| aspartyl protease family protein [Arabidopsis thaliana]
gi|109946613|gb|ABG48485.1| At5g33340 [Arabidopsis thaliana]
gi|332006513|gb|AED93896.1| aspartyl protease family protein [Arabidopsis thaliana]
Length = 437
Score = 285 bits (728), Expect = 3e-74, Method: Compositional matrix adjust.
Identities = 172/367 (46%), Positives = 233/367 (63%), Gaps = 32/367 (8%)
Query: 25 QAEIISVDDIYLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKS 84
Q ++ S YLM++SIGTPP I DTGSD WTQC PC DC+ Q PLFDPK S
Sbjct: 80 QIDLTSNSGEYLMNVSIGTPPFPIMAIADTGSDLLWTQCAPCD--DCYTQVDPLFDPKTS 137
Query: 85 STYNSISCSSSQCAVV--TSNCSEGD--CSYSFLYGRGAYASFSSGNLATETLTFNSTSG 140
STY +SCSSSQC + ++CS D CSYS YG +Y + GN+A +TLT S+
Sbjct: 138 STYKDVSCSSSQCTALENQASCSTNDNTCSYSLSYGDNSY---TKGNIAVDTLTLGSSDT 194
Query: 141 LPVEMPNVIFGCGHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCL----- 195
P+++ N+I GCGH N T + K +GI+GLG G SLI Q+G SI GKFSYCL
Sbjct: 195 RPMQLKNIIIGCGHNNAG--TFNKKGSGIVGLGGGPVSLIKQLGDSIDGKFSYCLVPLTS 252
Query: 196 -PDQGSSKINFG--GIVAGAGVVSTPLIIR----DHYYLSLEAISVGNQRLEFVSSSTG- 247
DQ +SKINFG IV+G+GVVSTPLI + YYL+L++ISVG++++++ S +
Sbjct: 253 KKDQ-TSKINFGTNAIVSGSGVVSTPLIAKASQETFYYLTLKSISVGSKQIQYSGSDSES 311
Query: 248 ---NIFVDTGVLRTLLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDVLCYNISSQPKF 304
NI +D+G TLLP E++S L+ +++ I A+ K + G S LCY+ + K
Sbjct: 312 SEGNIIIDSGTTLTLLPTEFYSELEDAVASSIDAE--KKQDPQSGLS--LCYSATGDLKV 367
Query: 305 PEVTIHFRGADVKLSPSNLFRNISDEIMCSAFRGGNANIVYGRIMQINFLIGYDIEQAMV 364
P +T+HF GADVKL SN F +S++++C AFRG + +YG + Q+NFL+GYD V
Sbjct: 368 PVITMHFDGADVKLDSSNAFVQVSEDLVCFAFRGSPSFSIYGNVAQMNFLVGYDTVSKTV 427
Query: 365 SFKPSRC 371
SFKP+ C
Sbjct: 428 SFKPTDC 434
>gi|357487631|ref|XP_003614103.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
gi|355515438|gb|AES97061.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
Length = 431
Score = 285 bits (728), Expect = 4e-74, Method: Compositional matrix adjust.
Identities = 156/353 (44%), Positives = 222/353 (62%), Gaps = 27/353 (7%)
Query: 35 YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSS 94
YLM S+GTPP +++G VDTGSD W QC+PC + C+KQ P+F+P KSS+Y +I CSS
Sbjct: 87 YLMTYSVGTPPFNVYGVVDTGSDIVWLQCKPCEQ--CYKQTTPIFNPSKSSSYKNIPCSS 144
Query: 95 SQCAVV--TSNCSEGDCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMPNVIFGC 152
+ C V TS + C Y+ + +Y S G L+ ETLT +ST+G V P + GC
Sbjct: 145 NLCQSVRYTSCNKQNSCEYTINFSDQSY---SQGELSVETLTLDSTTGHSVSFPKTVIGC 201
Query: 153 GHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCL-----PDQGSSKINFG- 206
GH N ++ +GI+GLG G SL +Q+ +SI GKFSYCL +SK+NFG
Sbjct: 202 GHNNRGMFQGET--SGIVGLGIGPVSLTTQLKSSIGGKFSYCLLPLLVDSNKTSKLNFGD 259
Query: 207 -GIVAGAGVVSTPLIIRD---HYYLSLEAISVGNQRLEFV---SSSTGNIFVDTGVLRTL 259
+V+G GVVSTP + +D YYL+LEA SVGN+R+EF S GNI +D+G TL
Sbjct: 260 AAVVSGDGVVSTPFVKKDPQAFYYLTLEAFSVGNKRIEFEVLDDSEEGNIILDSGTTLTL 319
Query: 260 LPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDVLCYNISS-QPKFPEVTIHFRGADVKL 318
LP ++NL+S ++ ++K V +P LCY+I+S Q FP +T HF+GAD+KL
Sbjct: 320 LPSHVYTNLESAVAQLVKLDRVD----DPNQLLNLCYSITSDQYDFPIITAHFKGADIKL 375
Query: 319 SPSNLFRNISDEIMCSAFRGGNANIVYGRIMQINFLIGYDIEQAMVSFKPSRC 371
+P + F +++D ++C AF ++G + Q+N L+GYD++Q +VSFKPS C
Sbjct: 376 NPISTFAHVADGVVCLAFTSSQTGPIFGNLAQLNLLVGYDLQQNIVSFKPSDC 428
>gi|116831531|gb|ABK28718.1| unknown [Arabidopsis thaliana]
Length = 438
Score = 284 bits (727), Expect = 4e-74, Method: Compositional matrix adjust.
Identities = 172/367 (46%), Positives = 233/367 (63%), Gaps = 32/367 (8%)
Query: 25 QAEIISVDDIYLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKS 84
Q ++ S YLM++SIGTPP I DTGSD WTQC PC DC+ Q PLFDPK S
Sbjct: 80 QIDLTSNSGEYLMNVSIGTPPFPIMAIADTGSDLLWTQCAPCD--DCYTQVDPLFDPKTS 137
Query: 85 STYNSISCSSSQCAVV--TSNCSEGD--CSYSFLYGRGAYASFSSGNLATETLTFNSTSG 140
STY +SCSSSQC + ++CS D CSYS YG +Y + GN+A +TLT S+
Sbjct: 138 STYKDVSCSSSQCTALENQASCSTNDNTCSYSLSYGDNSY---TKGNIAVDTLTLGSSDT 194
Query: 141 LPVEMPNVIFGCGHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCL----- 195
P+++ N+I GCGH N T + K +GI+GLG G SLI Q+G SI GKFSYCL
Sbjct: 195 RPMQLKNIIIGCGHNNAG--TFNKKGSGIVGLGGGPVSLIKQLGDSIDGKFSYCLVPLTS 252
Query: 196 -PDQGSSKINFG--GIVAGAGVVSTPLIIR----DHYYLSLEAISVGNQRLEFVSSSTG- 247
DQ +SKINFG IV+G+GVVSTPLI + YYL+L++ISVG++++++ S +
Sbjct: 253 KKDQ-TSKINFGTNAIVSGSGVVSTPLIAKASQETFYYLTLKSISVGSKQIQYSGSDSES 311
Query: 248 ---NIFVDTGVLRTLLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDVLCYNISSQPKF 304
NI +D+G TLLP E++S L+ +++ I A+ K + G S LCY+ + K
Sbjct: 312 SEGNIIIDSGTTLTLLPTEFYSELEDAVASSIDAE--KKQDPQSGLS--LCYSATGDLKV 367
Query: 305 PEVTIHFRGADVKLSPSNLFRNISDEIMCSAFRGGNANIVYGRIMQINFLIGYDIEQAMV 364
P +T+HF GADVKL SN F +S++++C AFRG + +YG + Q+NFL+GYD V
Sbjct: 368 PVITMHFDGADVKLDSSNAFVQVSEDLVCFAFRGSPSFSIYGNVAQMNFLVGYDTVSKTV 427
Query: 365 SFKPSRC 371
SFKP+ C
Sbjct: 428 SFKPTDC 434
>gi|224126551|ref|XP_002329582.1| predicted protein [Populus trichocarpa]
gi|222870291|gb|EEF07422.1| predicted protein [Populus trichocarpa]
Length = 440
Score = 279 bits (714), Expect = 1e-72, Method: Compositional matrix adjust.
Identities = 167/370 (45%), Positives = 227/370 (61%), Gaps = 36/370 (9%)
Query: 25 QAEIISVDDIYLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKS 84
++EII+ YLM LS+GTPP +I DTGSD WTQC PC + C+KQ PLFDPK S
Sbjct: 83 ESEIIANGGEYLMSLSLGTPPFEILAIADTGSDLIWTQCTPCDK--CYKQIAPLFDPKSS 140
Query: 85 STYNSISCSSSQCAVV--TSNCS-EGDCSYSFLYGRGAYASFSSGNLATETLTFNSTSGL 141
TY +SC + QC + +S+CS E C YS+ YG SF++GNLA +T+T ST+G
Sbjct: 141 KTYRDLSCDTRQCQNLGESSSCSSEQLCQYSYYYGD---RSFTNGNLAVDTVTLPSTNGG 197
Query: 142 PVEMPNVIFGCGHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCL------ 195
PV P + GCG +N + T D K +GIIGLG G SLISQMG+S+ GKFSYCL
Sbjct: 198 PVYFPKTVIGCGRRN--NGTFDKKDSGIIGLGGGPMSLISQMGSSVGGKFSYCLVPFSSE 255
Query: 196 PDQGSSKINFG--GIVAGAGVVSTPLIIRD---HYYLSLEAISVGNQRLEFVSSSTG--- 247
SSK++FG +V+G+GV STPLI ++ YYL+LEA+SVG++++EF SS G
Sbjct: 256 SAGNSSKLHFGRNAVVSGSGVQSTPLISKNPDTFYYLTLEAMSVGDKKIEFGGSSFGGSE 315
Query: 248 -NIFVDTGVLRTLLPLEYHSNLKSVMSNMI----KAQPVKGVGAEPGFSDVLCYNISSQP 302
NI +D+G TL P+ + + + + N + + Q G+ + CY +
Sbjct: 316 GNIIIDSGTSLTLFPVNFFTEFATAVENAVINGERTQDASGLLSH-------CYRPTPDL 368
Query: 303 KFPEVTIHFRGADVKLSPSNLFRNISDEIMCSAFRGGNANIVYGRIMQINFLIGYDIEQA 362
K P +T HF GADV L N F ISD+++C AF + ++G + Q+NFLIGYDI+
Sbjct: 369 KVPVITAHFNGADVVLQTLNTFILISDDVLCLAFNSTQSGAIFGNVAQMNFLIGYDIQGK 428
Query: 363 MVSFKPSRCT 372
VSFKP+ CT
Sbjct: 429 SVSFKPTDCT 438
>gi|225427550|ref|XP_002266461.1| PREDICTED: probable aspartic protease At2g35615 [Vitis vinifera]
Length = 439
Score = 274 bits (701), Expect = 4e-71, Method: Compositional matrix adjust.
Identities = 161/369 (43%), Positives = 219/369 (59%), Gaps = 30/369 (8%)
Query: 25 QAEIISVDDIYLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKS 84
Q+ I+ YLM+L IGTPPV + VDTGSD TWTQC PC C+KQ PLFDPK S
Sbjct: 82 QSRIVPSAGEYLMNLYIGTPPVPVIAIVDTGSDLTWTQCRPCTH--CYKQVVPLFDPKNS 139
Query: 85 STYNSISCSSSQCAVVTSN--CS-EGDCSYSFLYGRGAYASFSSGNLATETLTFNSTSGL 141
STY SC +S C + + CS E C++ + Y G SF+ GNLA+ETLT +ST+G
Sbjct: 140 STYRDSSCGTSFCLALGKDRSCSKEKKCTFRYSYADG---SFTGGNLASETLTVDSTAGK 196
Query: 142 PVEMPNVIFGCGHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCL-----P 196
PV P FGCGH + D +GI+GLG G SLISQ+ ++I G FSYCL
Sbjct: 197 PVSFPGFAFGCGHS--SGGIFDKSSSGIVGLGGGELSLISQLKSTINGLFSYCLLPVSTD 254
Query: 197 DQGSSKINFG--GIVAGAGVVSTPLIIRD---HYYLSLEAISVGNQRLEFVSSST----- 246
SS+INFG G V+G G VSTPL+ + YYL+LE ISVG +RL + S
Sbjct: 255 SSISSRINFGASGRVSGYGTVSTPLVQKSPDTFYYLTLEGISVGKKRLPYKGYSKKTEVE 314
Query: 247 -GNIFVDTGVLRTLLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDVLCYNISSQPKFP 305
GNI VD+G T LP E++S L+ ++N IK + V+ +P LCYN +++ P
Sbjct: 315 EGNIIVDSGTTYTFLPQEFYSKLEKSVANSIKGKRVR----DPNGIFSLCYNTTAEINAP 370
Query: 306 EVTIHFRGADVKLSPSNLFRNISDEIMCSAFRGGNANIVYGRIMQINFLIGYDIEQAMVS 365
+T HF+ A+V+L P N F + ++++C + V G + Q+NFL+G+D+ + VS
Sbjct: 371 IITAHFKDANVELQPLNTFMRMQEDLVCFTVAPTSDIGVLGNLAQVNFLVGFDLRKKRVS 430
Query: 366 FKPSRCTNY 374
FK + CT +
Sbjct: 431 FKAADCTQH 439
>gi|357492389|ref|XP_003616483.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
gi|355517818|gb|AES99441.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
Length = 434
Score = 273 bits (697), Expect = 1e-70, Method: Compositional matrix adjust.
Identities = 158/358 (44%), Positives = 224/358 (62%), Gaps = 30/358 (8%)
Query: 35 YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSS 94
YLM S+GTPP I+G DTGSD W QCEPC + C+ Q P+F+P KSS+Y +I CSS
Sbjct: 87 YLMTYSVGTPPTKIYGIADTGSDIVWLQCEPCEQ--CYNQTTPIFNPSKSSSYKNIPCSS 144
Query: 95 SQC-AVVTSNCSEGD-CSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMPNVIFGC 152
C +V ++CS+ + C Y YG +S S G+L+ +TL+ STSG PV P ++ GC
Sbjct: 145 KLCHSVRDTSCSDQNSCQYKISYGD---SSHSQGDLSVDTLSLESTSGSPVSFPKIVIGC 201
Query: 153 GHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCL------PDQGSSKINFG 206
G N T +GI+GLG G SLI+Q+G+SI GKFSYCL SS ++FG
Sbjct: 202 GTDNAG--TFGGASSGIVGLGGGPVSLITQLGSSIGGKFSYCLVPLLNKESNASSILSFG 259
Query: 207 --GIVAGAGVVSTPLIIRD--HYYLSLEAISVGNQRLEFVSSS-----TGNIFVDTGVLR 257
+V+G GVVSTPLI +D Y+L+L+A SVGN+R+EF SS GNI +D+G
Sbjct: 260 DAAVVSGDGVVSTPLIKKDPVFYFLTLQAFSVGNKRVEFGGSSEGGDDEGNIIIDSGTTL 319
Query: 258 TLLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDVLCYNI-SSQPKFPEVTIHFRGADV 316
TL+P + ++NL+S + +++K V +P LCY++ S++ FP +T+HF+GADV
Sbjct: 320 TLIPSDVYTNLESAVVDLVKLDRVD----DPNQQFSLCYSLKSNEYDFPIITVHFKGADV 375
Query: 317 KLSPSNLFRNISDEIMCSAFR-GGNANIVYGRIMQINFLIGYDIEQAMVSFKPSRCTN 373
+L + F I+D I+C AF+ ++G + Q N L+GYD++Q VSFKP+ CT
Sbjct: 376 ELHSISTFVPITDGIVCFAFQPSPQLGSIFGNLAQQNLLVGYDLQQKTVSFKPTDCTK 433
>gi|357500973|ref|XP_003620775.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
gi|357500991|ref|XP_003620784.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
gi|355495790|gb|AES76993.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
gi|355495799|gb|AES77002.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
Length = 438
Score = 271 bits (692), Expect = 4e-70, Method: Compositional matrix adjust.
Identities = 156/382 (40%), Positives = 228/382 (59%), Gaps = 33/382 (8%)
Query: 13 NETPKSPISIIYQAEIISVDDIYLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCF 72
N + K+ ++ ++ +IS + Y+M S+GTPP+ +G VDTGSD W QCEPC + C+
Sbjct: 65 NHSNKNSLASTPESTVISYEGDYIMSYSVGTPPIKSYGIVDTGSDIVWLQCEPCEQ--CY 122
Query: 73 KQEPPLFDPKKSSTYNSISCSSSQCAVV--TSNCSEGDCSYSFLYGRGAYASFSSGNLAT 130
Q P F+P KSS+Y +ISCSS C V TS + +C YS YG ++ S G+L+
Sbjct: 123 NQTTPKFNPSKSSSYKNISCSSKLCQSVRDTSCNDKKNCEYSINYGNQSH---SQGDLSL 179
Query: 131 ETLTFNSTSGLPVEMPNVIFGCGHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGK 190
ETLT ST+G PV P + GCG N+ S S +G GP +SLI+Q+G SI GK
Sbjct: 180 ETLTLESTTGRPVSFPKTVIGCGTNNIGSFKRVSSGVVGLGGGP--ASLITQLGPSIGGK 237
Query: 191 FSYCLP---------DQGSSKINFG--GIVAGAGVVSTPLIIRDH---YYLSLEAISVGN 236
FSYCL GSSK+NFG IV+G V+STP++ +DH YYL++EA SVG+
Sbjct: 238 FSYCLVRMSITLKNMSMGSSKLNFGDVAIVSGHNVLSTPIVKKDHSFFYYLTIEAFSVGD 297
Query: 237 QRLEFVSSSTG----NIFVDTGVLRTLLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSD 292
+R+EF SS G NI +D+ + T +P + ++ L S + +++ + V +P
Sbjct: 298 KRVEFAGSSKGVEEGNIIIDSSTIVTFVPSDVYTKLNSAIVDLVTLERVD----DPNQQF 353
Query: 293 VLCYNISSQPK--FPEVTIHFRGADVKLSPSNLFRNISDEIMCSAFRGGNANIVYGRIMQ 350
LCYN+SS + FP +T HF+GAD+ L +N F ++ +++C AF N ++G Q
Sbjct: 354 SLCYNVSSDEEYDFPYMTAHFKGADILLYATNTFVEVARDVLCFAFAPSNGGAIFGSFSQ 413
Query: 351 INFLIGYDIEQAMVSFKPSRCT 372
+F++GYD++Q VSFK CT
Sbjct: 414 QDFMVGYDLQQKTVSFKSVDCT 435
>gi|357514995|ref|XP_003627786.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
gi|355521808|gb|AET02262.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
Length = 436
Score = 270 bits (691), Expect = 7e-70, Method: Compositional matrix adjust.
Identities = 153/381 (40%), Positives = 232/381 (60%), Gaps = 32/381 (8%)
Query: 13 NETPKSPISIIYQAEIISVDDIYLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCF 72
N K ++ I Q+ +I YLM S+GTPP ++G VDTGSD W QCEPC E C+
Sbjct: 65 NHFYKYSLANIPQSTVIPDIGEYLMTYSVGTPPFKLYGIVDTGSDIVWLQCEPCQE--CY 122
Query: 73 KQEPPLFDPKKSSTYNSISCSSSQC-AVVTSNCSEGD-CSYSFLYGRGAYASFSSGNLAT 130
Q P+F+P KSS+Y +I C S C ++ ++C++ + C YS YG ++ S G+L+
Sbjct: 123 NQTTPMFNPSKSSSYKNIPCPSKLCQSMEDTSCNDKNYCEYSTYYGDNSH---SGGDLSV 179
Query: 131 ETLTFNSTSGLPVEMPNVIFGCGHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGK 190
+TLT ST+GL V PN++ GCG N+ S + +GI+G G G +S I+Q+G+S GK
Sbjct: 180 DTLTLESTNGLTVSFPNIVIGCGTNNILS--YEGASSGIVGFGSGPASFITQLGSSTGGK 237
Query: 191 FSYCLP---------DQGSSKINFG--GIVAGAGVVSTPLIIRD---HYYLSLEAISVGN 236
FSYCL +SK+NFG V+G GVV+TP++ +D YYL+LEA SVGN
Sbjct: 238 FSYCLTPLFSVTNIQSNATSKLNFGDAATVSGDGVVTTPILKKDPETFYYLTLEAFSVGN 297
Query: 237 QRLEF----VSSSTGNIFVDTGVLRTLLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSD 292
+R+E + GNI +D+G T L + +S L+S + +++K + V +P +
Sbjct: 298 RRVEIGGVPNGDNEGNIIIDSGTTLTSLTKDDYSFLESAVVDLVKLERVD----DPTQTL 353
Query: 293 VLCYNISSQP-KFPEVTIHFRGADVKLSPSNLFRNISDEIMCSAFRGGNANIVYGRIMQI 351
LCY++ ++ FP +T+HF+GADV L P + F +++D + C AF + ++G + Q
Sbjct: 354 NLCYSVKAEGYDFPIITMHFKGADVDLHPISTFVSVADGVFCLAFESSQDHAIFGNLAQQ 413
Query: 352 NFLIGYDIEQAMVSFKPSRCT 372
N ++GYD++Q +VSFKPS CT
Sbjct: 414 NLMVGYDLQQKIVSFKPSDCT 434
>gi|297805036|ref|XP_002870402.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
gi|297316238|gb|EFH46661.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
Length = 435
Score = 267 bits (683), Expect = 5e-69, Method: Compositional matrix adjust.
Identities = 168/406 (41%), Positives = 237/406 (58%), Gaps = 47/406 (11%)
Query: 2 QNSQKLPFYNDNETPKSPI-SIIY----------------------QAEIISVDDIYLMH 38
++S K PFYN ETP I + I+ Q +I YLM+
Sbjct: 38 RDSPKSPFYNPAETPSQRIRNAIHRSFNRVSHFTDLSEMDASLNSPQTDITPCGGEYLMN 97
Query: 39 LSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSSSQCA 98
LS+GTPP I DTGS+ WTQC+PC DC+ Q PLFDPK SSTY +SCSSSQC
Sbjct: 98 LSLGTPPSPIMAVADTGSNLIWTQCKPCD--DCYTQVDPLFDPKASSTYKDVSCSSSQCT 155
Query: 99 VV--TSNCSEGD--CSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMPNVIFGCGH 154
+ ++CS D CSY Y G+Y + G A +TLT ST PV++ N+I GCG
Sbjct: 156 ALENQASCSTEDKTCSYLVSYADGSY---TMGKFAVDTLTLGSTDNRPVQLKNIIIGCGQ 212
Query: 155 KNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCL--PDQGSSKINFG--GIVA 210
N T +K +G++GLG G SLI Q+G SI GKFSYCL + +SKINFG +V+
Sbjct: 213 NNAV--TFRNKSSGVVGLGGGAVSLIKQLGDSIDGKFSYCLVPENDQTSKINFGTNAVVS 270
Query: 211 GAGVVSTPLIIRDH---YYLSLEAISVGNQRLEFVSSS-TGNIFVDTGVLRTLLPLEYHS 266
G G VSTPL+++ YYL+L++ISVG++ ++ S+ GN+ +D+G TLLP++Y+
Sbjct: 271 GPGTVSTPLVVKSRDTFYYLTLKSISVGSKNMQTPDSNIKGNMVIDSGTTLTLLPVKYYI 330
Query: 267 NLKSVMSNMIKAQPVKGVGAEPGFSDVLCYNISSQPKFPEVTIHFRGADVKLSPSNLFRN 326
+++ ++++I A K + LCYN ++ P +T+HF GADVKL P N F
Sbjct: 331 EIENAVASLINADKSK----DERIGSSLCYNATADLNIPVITMHFEGADVKLYPYNSFFK 386
Query: 327 ISDEIMCSAF-RGGNANIVYGRIMQINFLIGYDIEQAMVSFKPSRC 371
++++++C AF N +YG + Q NFL+GYD +SFKP+ C
Sbjct: 387 VTEDLVCLAFGMSFYRNGIYGNVAQKNFLVGYDTASKTMSFKPTDC 432
>gi|357492401|ref|XP_003616489.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
gi|355517824|gb|AES99447.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
Length = 434
Score = 267 bits (682), Expect = 6e-69, Method: Compositional matrix adjust.
Identities = 156/358 (43%), Positives = 221/358 (61%), Gaps = 30/358 (8%)
Query: 35 YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSS 94
YLM S+GTPP I+G DTGSD W QCEPC + C+ Q P+F+P KSS+Y +I C S
Sbjct: 87 YLMTYSVGTPPTKIYGIADTGSDIVWLQCEPCEQ--CYNQTTPIFNPSKSSSYKNIPCLS 144
Query: 95 SQC-AVVTSNCSEGD-CSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMPNVIFGC 152
C +V ++CS+ + C Y YG +S S G+L+ +TL+ STSG PV P + GC
Sbjct: 145 KLCHSVRDTSCSDQNSCQYKISYGD---SSHSQGDLSVDTLSLESTSGSPVSFPKTVIGC 201
Query: 153 GHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCL------PDQGSSKINFG 206
G N T +GI+GLG G SLI+Q+G+SI GKFSYCL SS ++FG
Sbjct: 202 GTDNAG--TFGGASSGIVGLGGGPVSLITQLGSSIGGKFSYCLVPLLNKESNASSILSFG 259
Query: 207 --GIVAGAGVVSTPLIIRD--HYYLSLEAISVGNQRLEFVSSS-----TGNIFVDTGVLR 257
+V+G GVVSTPLI +D Y+L+L+A SVGN+R+EF SS GNI +D+G
Sbjct: 260 DAAVVSGDGVVSTPLIKKDPVFYFLTLQAFSVGNKRVEFGGSSEGGDDEGNIIIDSGTTL 319
Query: 258 TLLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDVLCYNI-SSQPKFPEVTIHFRGADV 316
TL+P + ++NL+S + +++K V +P LCY++ S++ FP +T HF+GAD+
Sbjct: 320 TLIPSDVYTNLESAVVDLVKLDRVD----DPNQQFSLCYSLKSNEYDFPIITAHFKGADI 375
Query: 317 KLSPSNLFRNISDEIMCSAFR-GGNANIVYGRIMQINFLIGYDIEQAMVSFKPSRCTN 373
+L + F I+D I+C AF+ ++G + Q N L+GYD++Q VSFKP+ CT
Sbjct: 376 ELHSISTFVPITDGIVCFAFQPSPQLGSIFGNLAQQNLLVGYDLQQKTVSFKPTDCTK 433
>gi|357514989|ref|XP_003627783.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
gi|355521805|gb|AET02259.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
Length = 416
Score = 265 bits (677), Expect = 3e-68, Method: Compositional matrix adjust.
Identities = 156/391 (39%), Positives = 224/391 (57%), Gaps = 52/391 (13%)
Query: 3 NSQKLPFYNDNETPKSPISIIYQAEIISVDDIYLMHLSIGTPPVDIFGSVDTGSDCTWTQ 62
N+ + N K+ ++ Q+ +I YLM S+GTPP ++G DTGSD W Q
Sbjct: 55 NAARRSINRANHFYKTALTNTPQSTVIPDHGEYLMTYSVGTPPFKLYGIADTGSDIVWLQ 114
Query: 63 CEPCPELDCFKQEPPLFDPKKSSTYNSISCSSSQCAVVTSNCSEGDCSYSFLYGRGAYAS 122
CEPC E C+ Q P F P KSSTY +I CSS C S
Sbjct: 115 CEPCKE--CYNQTTPKFKPSKSSTYKNIPCSSDLC-----------------------KS 149
Query: 123 FSSGNLATETLTFNSTSGLPVEMPNVIFGCGHKNLASPTSDSKQTGIIGLGPGNSSLISQ 182
GNL+ +TLT S++G P+ P + GCG N S + +GI+GLG G +SLI+Q
Sbjct: 150 GQQGNLSVDTLTLESSTGHPISFPKTVIGCGTDNTVS--FEGASSGIVGLGGGPASLITQ 207
Query: 183 MGTSIAGKFSYCL-----PDQGSSKINFG--GIVAGAGVVSTPLIIRD---HYYLSLEAI 232
+G+SI KFSYCL +SK+NFG +V+G GVVSTP++ +D YYL+LEA
Sbjct: 208 LGSSIDAKFSYCLLPNPVESNTTSKLNFGDTAVVSGDGVVSTPIVKKDPIVFYYLTLEAF 267
Query: 233 SVGNQRLEFVSSST----GNIFVDTGVLRTLLPLEYHSNLKSVMSNMIKAQPVKGVGAEP 288
SVGN+R+EF SS GNI +D+G T++P + ++NL+S + ++K + V +P
Sbjct: 268 SVGNKRIEFEGSSNGGHEGNIIIDSGTTLTVIPTDVYNNLESAVLELVKLKRVN----DP 323
Query: 289 GFSDVLCYNISSQP-KFPEVTIHFRGADVKLSPSNLFRNISDEIMCSAFRGGNANI---- 343
LCY+++S FP +T HF+GADVKL P + F +++D I+C AF +A I
Sbjct: 324 TRLFNLCYSVTSDGYDFPIITTHFKGADVKLHPISTFVDVADGIVCLAFATTSAFIPSDV 383
Query: 344 --VYGRIMQINFLIGYDIEQAMVSFKPSRCT 372
++G + Q N L+GYD++Q +VSFKP+ C+
Sbjct: 384 VSIFGNLAQQNLLVGYDLQQKIVSFKPTDCS 414
>gi|225427552|ref|XP_002266498.1| PREDICTED: probable aspartic protease At2g35615 [Vitis vinifera]
Length = 441
Score = 263 bits (673), Expect = 9e-68, Method: Compositional matrix adjust.
Identities = 155/369 (42%), Positives = 218/369 (59%), Gaps = 32/369 (8%)
Query: 25 QAEIISVDDIYLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKS 84
Q+ ++ Y+M+LSIGTPPV + VDTGSD TWTQC PC C+KQ P FDPK S
Sbjct: 82 QSRLVPSAGEYIMNLSIGTPPVPVIAIVDTGSDLTWTQCRPCTH--CYKQVVPFFDPKNS 139
Query: 85 STYNSISCSSSQCAVVTSN--CSEG-DCSYSFLYGRGAYASFSSGNLATETLTFNSTSGL 141
STY SC +S C + ++ C G C++ + Y G SF+ GNLA ETLT ST+G
Sbjct: 140 STYRDSSCGTSFCLALGNDRSCRNGKKCTFMYSYADG---SFTGGNLAVETLTVASTAGK 196
Query: 142 PVEMPNVIFGCGHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCL-----P 196
PV P FGC H+ + D +GI+GLG S+ISQ+ ++I G+FSYCL
Sbjct: 197 PVSFPGFAFGCVHR--SGGIFDEHSSGIVGLGVAELSMISQLKSTINGRFSYCLLPVFTD 254
Query: 197 DQGSSKINFG--GIVAGAGVVSTPLIIR--DHYY--LSLEAISVGNQRLEFVSSST---- 246
SS+INFG GIV+GAG VSTPL+++ D YY ++LE SVG +RL + S
Sbjct: 255 SSMSSRINFGRSGIVSGAGTVSTPLVMKGPDTYYYLITLEGFSVGKKRLSYKGFSKKAEV 314
Query: 247 --GNIFVDTGVLRTLLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDVLCYNIS-SQPK 303
GNI VD+G T LPLE++ L+ +++ IK + V+ +P LCYN + Q
Sbjct: 315 EEGNIIVDSGTTYTYLPLEFYVKLEESVAHSIKGKRVR----DPNGISSLCYNTTVDQID 370
Query: 304 FPEVTIHFRGADVKLSPSNLFRNISDEIMCSAFRGGNANIVYGRIMQINFLIGYDIEQAM 363
P +T HF+ A+V+L P N F + ++++C + + G + Q+NFL+G+D+ +
Sbjct: 371 APIITAHFKDANVELQPWNTFLRMQEDLVCFTVLPTSDIGILGNLAQVNFLVGFDLRKKR 430
Query: 364 VSFKPSRCT 372
VSFK + CT
Sbjct: 431 VSFKAADCT 439
>gi|449454652|ref|XP_004145068.1| PREDICTED: probable aspartic protease At2g35615-like [Cucumis
sativus]
gi|449470630|ref|XP_004153019.1| PREDICTED: probable aspartic protease At2g35615-like [Cucumis
sativus]
Length = 435
Score = 262 bits (669), Expect = 2e-67, Method: Compositional matrix adjust.
Identities = 151/359 (42%), Positives = 211/359 (58%), Gaps = 33/359 (9%)
Query: 35 YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSS 94
YLM LS+GTPP I DTGSD WTQCEPC +C++Q+ P+F+P KS+TY +SCSS
Sbjct: 85 YLMKLSVGTPPFPIIAVADTGSDIIWTQCEPC--TNCYQQDLPMFNPSKSTTYRKVSCSS 142
Query: 95 SQCAVV--TSNCS-EGDCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMPNVIFG 151
C+ ++CS + DC+YS YG ++ S G+ A +TLT STSG V P G
Sbjct: 143 PVCSFTGEDNSCSFKPDCTYSISYGDNSH---SQGDFAVDTLTMGSTSGRVVAFPRTAIG 199
Query: 152 CGHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLP-----DQGSSKINFG 206
CGH N S D+ +GI+GLG G +SLI QMG+++ GKFSYCL D GS+K+NFG
Sbjct: 200 CGHDNAGS--FDANVSGIVGLGLGPASLIKQMGSAVGGKFSYCLTPIGNDDGGSNKLNFG 257
Query: 207 --GIVAGAGVVSTPLIIRDH----YYLSLEAISVGNQRLEFVSSST-----GNIFVDTGV 255
V+G+G VSTP+ I D Y L L+A+SVG + ++++ NI +D+G
Sbjct: 258 SNANVSGSGAVSTPIYISDKFKSFYSLKLKAVSVGRNNTFYSTANSILGGKANIIIDSGT 317
Query: 256 LRTLLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDVLCYNISSQP-KFPEVTIHFRGA 314
TLLP++ + N +SN I Q +P C+ ++ K P + +HF GA
Sbjct: 318 TLTLLPVDLYHNFAKAISNSINLQRTD----DPNQFLEYCFETTTDDYKVPFIAMHFEGA 373
Query: 315 DVKLSPSNLFRNISDEIMCSAFRGGNAN--IVYGRIMQINFLIGYDIEQAMVSFKPSRC 371
+++L N+ +SD ++C AF G N +YG I QINFL+GYD+ +SFKP C
Sbjct: 374 NLRLQRENVLIRVSDNVICLAFAGAQDNDISIYGNIAQINFLVGYDVTNMSLSFKPMNC 432
>gi|449499012|ref|XP_004160696.1| PREDICTED: probable aspartic protease At2g35615-like [Cucumis
sativus]
Length = 435
Score = 259 bits (661), Expect = 2e-66, Method: Compositional matrix adjust.
Identities = 150/359 (41%), Positives = 210/359 (58%), Gaps = 33/359 (9%)
Query: 35 YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSS 94
YLM LS+GTPP I DTGSD WTQC PC +C++Q+ P+F+P KS+TY +SCSS
Sbjct: 85 YLMKLSVGTPPFPIIAVADTGSDIIWTQCVPC--TNCYQQDLPMFNPSKSTTYRKVSCSS 142
Query: 95 SQCAVV--TSNCS-EGDCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMPNVIFG 151
C+ ++CS + DC+YS YG ++ S G+ A +TLT STSG V P G
Sbjct: 143 PVCSFTGEDNSCSFKPDCTYSISYGDNSH---SQGDFAVDTLTMGSTSGRVVAFPRTAIG 199
Query: 152 CGHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLP-----DQGSSKINFG 206
CGH N S D+ +GI+GLG G +SLI QMG+++ GKFSYCL D GS+K+NFG
Sbjct: 200 CGHDNAGS--FDANVSGIVGLGLGPASLIKQMGSAVGGKFSYCLTPIGNDDGGSNKLNFG 257
Query: 207 --GIVAGAGVVSTPLIIRDH----YYLSLEAISVGNQRLEFVSSST-----GNIFVDTGV 255
V+G+G VSTP+ I D Y L L+A+SVG + ++++ NI +D+G
Sbjct: 258 SNANVSGSGAVSTPIYISDKFKSFYSLKLKAVSVGRNNTFYSTANSILGGKANIIIDSGT 317
Query: 256 LRTLLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDVLCYNISSQP-KFPEVTIHFRGA 314
TLLP++ + N +SN I Q +P C+ ++ K P + +HF GA
Sbjct: 318 TLTLLPVDLYHNFAKAISNSINLQRTD----DPNQFLEYCFETTTDDYKVPFIAMHFEGA 373
Query: 315 DVKLSPSNLFRNISDEIMCSAFRGGNAN--IVYGRIMQINFLIGYDIEQAMVSFKPSRC 371
+++L N+ +SD ++C AF G N +YG I QINFL+GYD+ +SFKP C
Sbjct: 374 NLRLQRENVLIRVSDNVICLAFAGAQDNDISIYGNIAQINFLVGYDVTNMSLSFKPMNC 432
>gi|225427554|ref|XP_002266533.1| PREDICTED: probable aspartic protease At2g35615-like [Vitis
vinifera]
Length = 447
Score = 257 bits (657), Expect = 5e-66, Method: Compositional matrix adjust.
Identities = 155/372 (41%), Positives = 227/372 (61%), Gaps = 34/372 (9%)
Query: 25 QAEIISVDDIYLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKS 84
Q+ +IS + YLM++S+GTPPV + G DTGSD W QC+PC C++Q P+FDP KS
Sbjct: 85 QSPVISNNGEYLMNISLGTPPVSMHGIADTGSDLLWRQCKPCDS--CYEQIEPIFDPAKS 142
Query: 85 STYNSISCSSSQCAVV--TSNCSEGD-CSYSFLYGRGAYASFSSGNLATETLTFNSTSGL 141
TY +SC C+ + CS+ + C YS+ YG G++ +SG+LA +TLT ST+G
Sbjct: 143 KTYQILSCEGKSCSNLGGQGGCSDDNTCIYSYSYGDGSH---TSGDLAVDTLTIGSTTGR 199
Query: 142 PVEMPNVIFGCGHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLPDQG-- 199
PV +P V+FGCGH N T + +G++GLG G S+ISQ+ I G+FSYCL G
Sbjct: 200 PVSVPKVVFGCGHNN--GGTFELHGSGLVGLGGGPLSMISQLRPLIGGRFSYCLVPLGND 257
Query: 200 ---SSKINFG--GIVAGAGVVSTPLIIRD---HYYLSLEAISVGNQRLEF---------- 241
SSK++FG GIV+GAG VSTPL R YYL+LE++SVG+++L +
Sbjct: 258 PSVSSKMHFGSRGIVSGAGAVSTPLASRQPDTFYYLTLESMSVGSKKLAYKGFSKVGSPL 317
Query: 242 VSSSTGNIFVDTGVLRTLLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDVLCYNISSQ 301
+ GNI +D+G TLLP +++ L+S + + I +PV+ +P LCY+ S
Sbjct: 318 ADADEGNIIIDSGTTLTLLPQDFYGTLESNVVSAIGGKPVR----DPNNVFSLCYSNLSG 373
Query: 302 PKFPEVTIHFRGADVKLSPSNLFRNISDEIMCSAFRGGNANIVYGRIMQINFLIGYDIEQ 361
+ P +T HF GAD++L P N F + +++ C A + ++G + Q+NFL+GYD++
Sbjct: 374 LRIPTITAHFVGADLELKPLNTFVQVQEDLFCFAMIPVSDLAIFGNLAQMNFLVGYDLKS 433
Query: 362 AMVSFKPSRCTN 373
VSFKP+ CT
Sbjct: 434 RTVSFKPTDCTK 445
>gi|356546370|ref|XP_003541599.1| PREDICTED: probable aspartic protease At2g35615-like [Glycine max]
Length = 434
Score = 256 bits (655), Expect = 1e-65, Method: Compositional matrix adjust.
Identities = 152/368 (41%), Positives = 217/368 (58%), Gaps = 31/368 (8%)
Query: 25 QAEIISVDDIYLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKS 84
+A I D YL+ S+G PP ++G +DTGSD W QC+PC + C+ Q +FDP KS
Sbjct: 76 KATITQNDGEYLISYSVGIPPFQLYGIIDTGSDMIWLQCKPCEK--CYNQTTRIFDPSKS 133
Query: 85 STYNSISCSSSQC-AVVTSNCSEGD---CSYSFLYGRGAYASFSSGNLATETLTFNSTSG 140
+TY + SS+ C +V ++CS + C Y+ YG G+Y S G+L+ ETLT ST+G
Sbjct: 134 NTYKILPFSSTTCQSVEDTSCSSDNRKMCEYTIYYGDGSY---SQGDLSVETLTLGSTNG 190
Query: 141 LPVEMPNVIFGCGHKNLASPTSDSKQTGIIGLGPGNSSLISQM---GTSIAGKFSYCLPD 197
V+ + GCG N S + K +GI+GLG G SLI+Q+ +SI KFSYCL
Sbjct: 191 SSVKFRRTVIGCGRNNTVS--FEGKSSGIVGLGNGPVSLINQLRRRSSSIGRKFSYCLAS 248
Query: 198 QG--SSKINFG--GIVAGAGVVSTPLIIRD---HYYLSLEAISVGNQRLEFVSSS----- 245
SSK+NFG +V+G G VSTP++ D YYL+LEA SVGN R+EF SSS
Sbjct: 249 MSNISSKLNFGDAAVVSGDGTVSTPIVTHDPKVFYYLTLEAFSVGNNRIEFTSSSFRFGE 308
Query: 246 TGNIFVDTGVLRTLLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDVLCYNIS-SQPKF 304
GNI +D+G TLLP + +S L+S ++++++ VK +P LCY + +
Sbjct: 309 KGNIIIDSGTTLTLLPNDIYSKLESAVADLVELDRVK----DPLKQLSLCYRSTFDELNA 364
Query: 305 PEVTIHFRGADVKLSPSNLFRNISDEIMCSAFRGGNANIVYGRIMQINFLIGYDIEQAMV 364
P + HF GADVKL+ N F + + C AF ++G + Q NFL+GYD+++ +V
Sbjct: 365 PVIMAHFSGADVKLNAVNTFIEVEQGVTCLAFISSKIGPIFGNMAQQNFLVGYDLQKKIV 424
Query: 365 SFKPSRCT 372
SFKP+ C+
Sbjct: 425 SFKPTDCS 432
>gi|357457681|ref|XP_003599121.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
gi|355488169|gb|AES69372.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
Length = 439
Score = 256 bits (654), Expect = 1e-65, Method: Compositional matrix adjust.
Identities = 167/414 (40%), Positives = 230/414 (55%), Gaps = 55/414 (13%)
Query: 3 NSQKLPFYNDNETPKSPISIIYQAEI-----------ISVDDI------------YLMHL 39
+S + PFYN ET IS + I +S +D+ Y+M
Sbjct: 35 DSSRSPFYNIRETQLQRISNVVTHSIKRAHYLNHVFSLSHNDLPKPTIIPYAGSYYVMSY 94
Query: 40 SIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSSSQCAV 99
SIGTPP ++G VDTGSD W QC+PC C Q P+F+P KSSTY +I CSS C
Sbjct: 95 SIGTPPFQLYGVVDTGSDGIWFQCKPCKP--CLNQTSPIFNPSKSSTYKNIRCSSPICKR 152
Query: 100 -VTSNCS---EGDCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMPNVIFGCGHK 155
+ CS + C Y Y S S G+++ +TLT NS G P+ P ++ GCGHK
Sbjct: 153 GEKTRCSSNRKRKCEYEITY---LDRSGSQGDISKDTLTLNSNDGSPISFPKIVIGCGHK 209
Query: 156 NLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLPD-----QGSSKINFG--GI 208
N S T++ +GIIG G GN S++SQ+G+SI GKFSYCL SSK+ FG +
Sbjct: 210 N--SLTTEGLASGIIGFGRGNFSIVSQLGSSIGGKFSYCLASLFSKANISSKLYFGDMAV 267
Query: 209 VAGAGVVSTPLI---IRDHYYLSLEAISVGNQRLEFVSSS-----TGNIFVDTGVLRTLL 260
V+G GVVSTPLI +Y+ +LEA SVG+ ++ SS GN +D+G T L
Sbjct: 268 VSGHGVVSTPLIQSFYVGNYFTNLEAFSVGDHIIKLKDSSLIPDNEGNAVIDSGSTITQL 327
Query: 261 PLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDVLCYNIS-SQPKFPEVTIHFRGADVKLS 319
P + +S L++ + +M+K + VK +P LCY + + + P +T HFRGADVKL+
Sbjct: 328 PNDVYSQLETAVISMVKLKRVK----DPTQQLSLCYKTTLKKYEVPIITAHFRGADVKLN 383
Query: 320 PSNLFRNISDEIMCSAFRGGN-ANIVYGRIMQINFLIGYDIEQAMVSFKPSRCT 372
N F ++ E+MC AF +VYG I Q NFL+GYD + ++SFKP+ CT
Sbjct: 384 AFNTFIQMNHEVMCFAFNSSAFPWVVYGNIAQQNFLVGYDTLKNIISFKPTNCT 437
>gi|356495754|ref|XP_003516738.1| PREDICTED: probable aspartic protease At2g35615-like [Glycine max]
Length = 420
Score = 255 bits (652), Expect = 2e-65, Method: Compositional matrix adjust.
Identities = 160/392 (40%), Positives = 226/392 (57%), Gaps = 28/392 (7%)
Query: 1 AQNSQKLPFYNDNETPKSPISIIYQAEIISVDDIYLMHLSIGTPPVDIFGSVDTGSDCTW 60
+ ++ LP E ++ Q+ I + YLM LSIGTPP I+G DTGSD TW
Sbjct: 38 SSHAHVLPLRRLMELSAMEKTLTPQSPIYAYLGHYLMELSIGTPPFKIYGIADTGSDLTW 97
Query: 61 TQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSSSQCAVV-TSNCS-EGDCSYSFLYGRG 118
T C PC +C+KQ P+FDP+KS+TY +ISC S C + T CS + C+Y++ Y
Sbjct: 98 TSCVPCN--NCYKQRNPMFDPQKSTTYRNISCDSKLCHKLDTGVCSPQKRCNYTYAY--- 152
Query: 119 AYASFSSGNLATETLTFNSTSGLPVEMPNVIFGCGHKNLASPTSDSKQTGIIGLGPGNSS 178
A A+ + G LA ET+T +ST G V + ++FGCGH N + + GIIGLG G S
Sbjct: 153 ASAAITRGVLAQETITLSSTKGKSVPLKGIVFGCGHNNTGG--FNDHEMGIIGLGGGPVS 210
Query: 179 LISQMGTSIAGK-FSYCL-PDQG----SSKINF--GGIVAGAGVVSTPLIIRDH---YYL 227
LISQMG+S GK FS CL P SSK++F G V+G GVVSTPL+ + Y++
Sbjct: 211 LISQMGSSFGGKRFSQCLVPFHTDVSVSSKMSFGKGSKVSGKGVVSTPLVAKQDKTPYFV 270
Query: 228 SLEAISVGNQRLEFVSSS----TGNIFVDTGVLRTLLPLEYHSNLKSVMSNMIKAQPVKG 283
+L ISV N L F SS GN+F+D+G T+LP + + + + + + + +PV
Sbjct: 271 TLLGISVENTYLHFNGSSQNVEKGNMFLDSGTPPTILPTQLYDQVVAQVRSEVAMKPVTD 330
Query: 284 VGAEPGFSDVLCYNISSQPKFPEVTIHFRGADVKLSPSNLFRNISDEIMCSAFRGGNAN- 342
+P LCY + + P +T HF GADVKLSP+ F + D + C F +++
Sbjct: 331 ---DPDLGPQLCYRTKNNLRGPVLTAHFEGADVKLSPTQTFISPKDGVFCLGFTNTSSDG 387
Query: 343 IVYGRIMQINFLIGYDIEQAMVSFKPSRCTNY 374
VYG Q N+LIG+D+++ +VSFKP CT +
Sbjct: 388 GVYGNFAQSNYLIGFDLDRQVVSFKPKDCTKH 419
>gi|225436202|ref|XP_002271145.1| PREDICTED: probable aspartic protease At2g35615-like [Vitis
vinifera]
Length = 440
Score = 252 bits (643), Expect = 2e-64, Method: Compositional matrix adjust.
Identities = 158/408 (38%), Positives = 234/408 (57%), Gaps = 47/408 (11%)
Query: 2 QNSQKLPFYNDNETPKSPISIIY----------------QAEIISVDDIYLMHLSIGTPP 45
++S K P YN +ETP + + + + S + YLM +SIGTPP
Sbjct: 42 RDSPKSPLYNPSETPAERLDRFFRRFMSFSEASISPNTPEPPVSSNNGEYLMKISIGTPP 101
Query: 46 VDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSSSQCAVV-TSNC 104
D++G DTGSD WTQC PC L C+KQ+ P+FDP KS+++ +SC S QC ++ T +C
Sbjct: 102 FDVYGIYDTGSDLMWTQCLPC--LSCYKQKNPMFDPSKSTSFKEVSCESQQCRLLDTVSC 159
Query: 105 SEGD--CSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMPNVIFGCGHKNLASPTS 162
S+ C +S+ YG G S + G +ATETLT NS SG P + N++FGCGH N S T
Sbjct: 160 SQPQKLCDFSYGYGDG---SLAQGVIATETLTLNSNSGQPTSILNIVFGCGHNN--SGTF 214
Query: 163 DSKQTGIIGLGPGNSSLISQMGTSIAG--KFSYCL-PDQG----SSKINFG--GIVAGAG 213
+ + G+ G G SL SQ+ +++ KFS CL P + +SKI FG V+G+
Sbjct: 215 NENEMGLFGTGGRPLSLTSQIMSTLGSGRKFSQCLVPFRTDPSITSKIIFGPEAEVSGSD 274
Query: 214 VVSTPLIIRD---HYYLSLEAISVGNQRLEFVSSS----TGNIFVDTGVLRTLLPLEYHS 266
VVSTPL+ +D +Y+++L+ ISVG++ F SSS GN+F+D G TLLP ++++
Sbjct: 275 VVSTPLVTKDDPTYYFVTLDGISVGDKLFPFSSSSPMATKGNVFIDAGTPPTLLPRDFYN 334
Query: 267 NLKSVMSNMIKAQPVKGVGAEPGFSDVLCYNISSQPKFPEVTIHFRGADVKLSPSNLFRN 326
L + I +PV+ +P LCY ++ P +T HF GADV+L P N F +
Sbjct: 335 RLVQGVKEAIPMEPVQ----DPDLQPQLCYRSATLIDGPILTAHFDGADVQLKPLNTFIS 390
Query: 327 ISDEIMCSAFRGGNANI-VYGRIMQINFLIGYDIEQAMVSFKPSRCTN 373
+ + C A + + + ++G +Q+NFLIG+D++ VSFK CT
Sbjct: 391 PKEGVYCFAMQPIDGDTGIFGNFVQMNFLIGFDLDGKKVSFKAVDCTK 438
>gi|147811402|emb|CAN61225.1| hypothetical protein VITISV_006732 [Vitis vinifera]
Length = 440
Score = 251 bits (642), Expect = 3e-64, Method: Compositional matrix adjust.
Identities = 158/408 (38%), Positives = 234/408 (57%), Gaps = 47/408 (11%)
Query: 2 QNSQKLPFYNDNETPKSPISIIY----------------QAEIISVDDIYLMHLSIGTPP 45
++S K P YN +ETP + + + + S + YLM +SIGTPP
Sbjct: 42 RDSPKSPLYNPSETPAERLDRFFRRFMSFSEASISPNTPEPPVSSNNGEYLMKISIGTPP 101
Query: 46 VDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSSSQCAVV-TSNC 104
D++G DTGSD WTQC PC L C+KQ+ P+FDP KS+++ +SC S QC ++ T +C
Sbjct: 102 FDVYGIYDTGSDLMWTQCLPC--LSCYKQKNPMFDPSKSTSFKEVSCESQQCRLLDTVSC 159
Query: 105 SEGD--CSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMPNVIFGCGHKNLASPTS 162
S+ C +S+ YG G S + G +ATETLT NS SG P + N++FGCGH N S T
Sbjct: 160 SQPQKLCDFSYGYGDG---SLAQGVIATETLTLNSNSGQPXSIXNIVFGCGHNN--SGTF 214
Query: 163 DSKQTGIIGLGPGNSSLISQMGTSIAG--KFSYCL-PDQG----SSKINFG--GIVAGAG 213
+ + G+ G G SL SQ+ +++ KFS CL P + +SKI FG V+G+
Sbjct: 215 NENEMGLFGTGGRPLSLTSQIMSTLGSGRKFSQCLVPFRTDPSITSKIIFGPEAEVSGSX 274
Query: 214 VVSTPLIIRD---HYYLSLEAISVGNQRLEFVSSS----TGNIFVDTGVLRTLLPLEYHS 266
VVSTPL+ +D +Y+++L+ ISVG++ F SSS GN+F+D G TLLP ++++
Sbjct: 275 VVSTPLVTKDDPTYYFVTLDGISVGDKLFPFSSSSPMATKGNVFIDAGTPPTLLPRDFYN 334
Query: 267 NLKSVMSNMIKAQPVKGVGAEPGFSDVLCYNISSQPKFPEVTIHFRGADVKLSPSNLFRN 326
L + I +PV+ +P LCY ++ P +T HF GADV+L P N F +
Sbjct: 335 RLVQGVKEAIPMEPVQ----DPDLQPQLCYRSATLIDGPILTAHFDGADVQLKPLNTFIS 390
Query: 327 ISDEIMCSAFRGGNANI-VYGRIMQINFLIGYDIEQAMVSFKPSRCTN 373
+ + C A + + + ++G +Q+NFLIG+D++ VSFK CT
Sbjct: 391 PKEGVYCFAMQPIDGDTGIFGNFVQMNFLIGFDLDGKKVSFKAVDCTK 438
>gi|356546372|ref|XP_003541600.1| PREDICTED: probable aspartic protease At2g35615-like [Glycine max]
Length = 433
Score = 248 bits (633), Expect = 3e-63, Method: Compositional matrix adjust.
Identities = 151/374 (40%), Positives = 212/374 (56%), Gaps = 32/374 (8%)
Query: 15 TPKSPISIIYQAEIISVDDIYLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQ 74
+P SP + +IS YL+ S+GTP + +FG +DTGSD W QC+PC + C++Q
Sbjct: 74 SPNSP-----ETTVISALGEYLISYSVGTPSLQVFGILDTGSDIIWLQCQPCKK--CYEQ 126
Query: 75 EPPLFDPKKSSTYNSISCSSSQCAVV--TSNCSEGDCSYSFLYGRGAYASFSSGNLATET 132
P+FD KS TY ++ C S+ C V T S C YS Y G S S G+L+ ET
Sbjct: 127 TTPIFDSSKSQTYKTLPCPSNTCQSVQGTFCSSRKHCLYSIHYVDG---SQSLGDLSVET 183
Query: 133 LTFNSTSGLPVEMPNVIFGCGHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFS 192
LT ST+G PV+ P + GCG N + K +GI+GLG G SLI+Q+ S GKFS
Sbjct: 184 LTLGSTNGSPVQFPGTVIGCGRYNAIG--IEEKNSGIVGLGRGPMSLITQLSPSTGGKFS 241
Query: 193 YCLP---DQGSSKINFG--GIVAGAGVVSTPLIIRD---HYYLSLEAISVGNQRLEFVSS 244
YCL SSK+NFG +V+G G VSTPL ++ Y+L+LEA SVG R+EF S
Sbjct: 242 YCLVPGLSTASSKLNFGNAAVVSGRGTVSTPLFSKNGLVFYFLTLEAFSVGRNRIEFGSP 301
Query: 245 STG---NIFVDTGVLRTLLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDVLCYNISSQ 301
+G NI +D+G T LP +S L++ ++ + Q V+ +P LCY ++
Sbjct: 302 GSGGKGNIIIDSGTTLTALPNGVYSKLEAAVAKTVILQRVR----DPNQVLGLCYKVTPD 357
Query: 302 ---PKFPEVTIHFRGADVKLSPSNLFRNISDEIMCSAFRGGNANIVYGRIMQINFLIGYD 358
P +T HF GADV L+ N F ++D+++C AF+ V+G + Q N L+GYD
Sbjct: 358 KLDASVPVITAHFSGADVTLNAINTFVQVADDVVCFAFQPTETGAVFGNLAQQNLLVGYD 417
Query: 359 IEQAMVSFKPSRCT 372
++ VSFK + CT
Sbjct: 418 LQMNTVSFKHTDCT 431
>gi|356546376|ref|XP_003541602.1| PREDICTED: probable aspartic protease At2g35615-like [Glycine max]
Length = 450
Score = 247 bits (630), Expect = 7e-63, Method: Compositional matrix adjust.
Identities = 159/363 (43%), Positives = 222/363 (61%), Gaps = 36/363 (9%)
Query: 35 YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSS 94
YLM S+GTPP +I G VDTGS TW QC+ C DC++Q P+FDP KS TY ++ CSS
Sbjct: 97 YLMSYSVGTPPFEILGVVDTGSGITWMQCQRCE--DCYEQTTPIFDPSKSKTYKTLPCSS 154
Query: 95 SQCAVV--TSNCSEGD--CSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMPNVIF 150
+ C V T +CS C Y+ YG G++ S G+L+ ETLT ST+G V+ PN +
Sbjct: 155 NMCQSVISTPSCSSDKIGCKYTIKYGDGSH---SQGDLSVETLTLGSTNGSSVQFPNTVI 211
Query: 151 GCGHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLP-----DQGSSKINF 205
GCGH N T + +G++GLG G SLISQ+ +SI GKFSYCL SSK+NF
Sbjct: 212 GCGHNNKG--TFQGEGSGVVGLGGGPVSLISQLSSSIGGKFSYCLAPMFSQSNSSSKLNF 269
Query: 206 G--GIVAGAGVVSTPLIIRD----HYYLSLEAISVGNQRLEFVSSSTGN--------IFV 251
G +V+G G VSTPL+ + YYL+LEA SVG++R+EFV S+ + I +
Sbjct: 270 GDAAVVSGLGAVSTPLVSKTGSEVFYYLTLEAFSVGDKRIEFVGGSSSSGSSNGEGNIII 329
Query: 252 DTGVLRTLLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDVLCYNI--SSQPKFPEVTI 309
D+G TLLP E +SNL+S +++ I+A V ++P LCY S Q P +T
Sbjct: 330 DSGTTLTLLPQEDYSNLESAVADAIQANRV----SDPSNFLSLCYQTTPSGQLDVPVITA 385
Query: 310 HFRGADVKLSPSNLFRNISDEIMCSAFRGGNANIVYGRIMQINFLIGYDIEQAMVSFKPS 369
HF+GADV+L+P + F +++ ++C AF ++G + Q+N L+GYD+ + VSFKP+
Sbjct: 386 HFKGADVELNPISTFVQVAEGVVCFAFHSSEVVSIFGNLAQLNLLVGYDLMEQTVSFKPT 445
Query: 370 RCT 372
CT
Sbjct: 446 DCT 448
>gi|449454654|ref|XP_004145069.1| PREDICTED: probable aspartic protease At2g35615-like [Cucumis
sativus]
gi|449470632|ref|XP_004153020.1| PREDICTED: probable aspartic protease At2g35615-like [Cucumis
sativus]
gi|449499016|ref|XP_004160697.1| PREDICTED: probable aspartic protease At2g35615-like [Cucumis
sativus]
Length = 434
Score = 247 bits (630), Expect = 8e-63, Method: Compositional matrix adjust.
Identities = 159/409 (38%), Positives = 225/409 (55%), Gaps = 50/409 (12%)
Query: 2 QNSQKLPFYNDNETPKSPI----------------SIIYQAEIISVDDIYLMHLSIGTPP 45
++S K P YN +ET I S +A I + YL+ +S+GTPP
Sbjct: 34 RDSPKSPMYNSSETHFDRIVNALRRSSHRNTVVLESDTAEAPIFNNGGEYLVEISVGTPP 93
Query: 46 VDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSSSQCAVV--TSN 103
I DTGSD WTQC+PC +C++Q P+FDP KS+TY +++CSS C+ S+
Sbjct: 94 FSIVAVADTGSDVIWTQCKPCS--NCYQQNAPMFDPSKSTTYKNVACSSPVCSYSGDGSS 151
Query: 104 CS-EGDCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMPNVIFGCGHKNLASPTS 162
CS + +C YS YG ++ S GNLA +T+T STSG PV P + GCGH N T
Sbjct: 152 CSDDSECLYSIAYGDDSH---SQGNLAVDTVTMQSTSGRPVAFPRTVIGCGHDNAG--TF 206
Query: 163 DSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLPDQG------SSKINFG--GIVAGAGV 214
++ +GI+GLG G +SL++Q+G + GKFSYCL G S+K+NFG V+G+G
Sbjct: 207 NANVSGIVGLGRGPASLVTQLGPATGGKFSYCLIPIGTGSTNDSTKLNFGSNANVSGSGT 266
Query: 215 VSTPLI----IRDHYYLSLEAISVGNQRLEFVSSST-----GNIFVDTGVLRTLLPLEYH 265
VSTP+ + Y L LEA+SVG+ + F ++ NI +D+G T LP
Sbjct: 267 VSTPIYSSAQYKTFYSLKLEAVSVGDTKFNFPEGASKLGGESNIIIDSGTTLTYLPSALL 326
Query: 266 SNLKSVMSNMIKAQPVKGVGAEPG-FSDVLCYNISSQPKFPEVTIHFRGADVKLSPSNLF 324
++ S +S + + +P F D + + P VT+HF GADV L NLF
Sbjct: 327 NSFGSAISQSMSLPHAQ----DPSEFLDYCFATTTDDYEMPPVTMHFEGADVPLQRENLF 382
Query: 325 RNISDEIMCSAFRG-GNANI-VYGRIMQINFLIGYDIEQAMVSFKPSRC 371
+SD+ +C AF + NI +YG I Q NFL+GYDI+ VSF+P+ C
Sbjct: 383 VRLSDDTICLAFGSFPDDNIFIYGNIAQSNFLVGYDIKNLAVSFQPAHC 431
>gi|225427558|ref|XP_002266614.1| PREDICTED: probable aspartic protease At2g35615-like [Vitis
vinifera]
Length = 444
Score = 247 bits (630), Expect = 8e-63, Method: Compositional matrix adjust.
Identities = 156/415 (37%), Positives = 233/415 (56%), Gaps = 54/415 (13%)
Query: 1 AQNSQKLPFYNDNETPKSPISIIY--------------------QAEIISVDDIYLMHLS 40
+++S PFYN +ET + + Q+++IS YLM++S
Sbjct: 40 SRDSPHSPFYNPSETKYQRLQKAFRRSILRGNHFRAMRASPNDIQSDVISGGGAYLMNIS 99
Query: 41 IGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSSSQCAVV 100
+GTPPV + G DTGSD W QC PCP +C++Q PLFDPK+S TY ++ C + C +
Sbjct: 100 LGTPPVPMLGIADTGSDLIWRQCLPCP--NCYEQVEPLFDPKESETYKTLDCDNEFCQDL 157
Query: 101 TSNCSEGD---CSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMPNVIFGCGHKNL 157
S D C+YS+ YG +Y + G+L+++TLT ST G P P + FGCGH N
Sbjct: 158 GQQGSCDDDNTCTYSYSYGDRSY---TRGDLSSDTLTIGSTEGDPASFPGIAFGCGHDN- 213
Query: 158 ASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCL-----PDQGSSKINFG--GIVA 210
T + K G+IGLG G SL+ Q+ + + G+FSYCL SSKINFG G+V+
Sbjct: 214 -GGTFNEKDGGLIGLGGGPLSLVMQLSSEVGGQFSYCLVPLSSDSTVSSKINFGKSGVVS 272
Query: 211 GAGVVSTPLII---RDHYYLSLEAISVGNQRLEF----------VSSSTGNIFVDTGVLR 257
G+G VSTPLI YYL+LE +SVG++ + F + GNI +D+G
Sbjct: 273 GSGTVSTPLIKGTPDTFYYLTLEGLSVGSETVAFKGFSENKSSPAAVEEGNIIIDSGTTL 332
Query: 258 TLLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDVLCYNISSQPKFPEVTIHFRGADVK 317
TLLP +++++++S ++N I Q +P LCY+ + + P +T HF GADV+
Sbjct: 333 TLLPQDFYTDVESALTNAIGGQTT----TDPNGIFSLCYSSVNNLEIPTITAHFTGADVQ 388
Query: 318 LSPSNLFRNISDEIMCSAFRGGNANIVYGRIMQINFLIGYDIEQAMVSFKPSRCT 372
L P N F + ++++C + + ++G + QINFL+GYD++ VSFK + CT
Sbjct: 389 LPPLNTFVQVQEDLVCFSMIPSSNLAIFGNLAQINFLVGYDLKNNKVSFKQTDCT 443
>gi|358346443|ref|XP_003637277.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
gi|355503212|gb|AES84415.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
Length = 434
Score = 244 bits (623), Expect = 5e-62, Method: Compositional matrix adjust.
Identities = 148/356 (41%), Positives = 211/356 (59%), Gaps = 32/356 (8%)
Query: 35 YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSS 94
YL+ S+GTPP ++G +DTGS+ W QC+PC CF Q P+F+P KSS+Y +I C+S
Sbjct: 89 YLISYSVGTPPFKVYGFMDTGSNIVWLQCQPCNT--CFNQTSPIFNPSKSSSYKNIPCTS 146
Query: 95 SQCAVVTS---NCSEGD--CSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMPNVI 149
S C +CS G C YS YG A S G+L+ ++LT +STSG V PN++
Sbjct: 147 STCKDTNDTHISCSNGGDVCEYSITYGGDAK---SQGDLSNDSLTLDSTSGSSVLFPNIV 203
Query: 150 FGCGHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAG-KFSYCL-----PDQGSSKI 203
GCGH N+ +S+ +G++G+G G SLI Q+G+S G KFSYCL SSK+
Sbjct: 204 IGCGHINVLQ--DNSQSSGVVGMGRGPMSLIKQVGSSSVGSKFSYCLIPYNSDSNSSSKL 261
Query: 204 NFGG--IVAGAGVVSTPLII----RDHYYLSLEAISVGNQRLEF---VSSSTGNIFVDTG 254
FG +V+G VVSTP++ ++Y+L+LEA SVGN R+E+ ++ST NI +D+G
Sbjct: 262 IFGEDVVVSGEIVVSTPMVKVNGQENYYFLTLEAFSVGNNRIEYGERSNASTQNILIDSG 321
Query: 255 VLRTLLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDVLCYNIS-SQPKFPEVTIHFRG 313
T+LP + S L S ++ +K ++ P LCYN + Q P++T HF G
Sbjct: 322 TPLTMLPNLFLSKLVSYVAQEVKLPRIE----PPDHHLSLCYNTTGKQLNVPDITAHFNG 377
Query: 314 ADVKLSPSNLFRNISDEIMCSAFRGGNANIVYGRIMQINFLIGYDIEQAMVSFKPS 369
ADVKL+ + F D IMC F N ++G I Q N LI YD+E+ ++SFKP+
Sbjct: 378 ADVKLNSNGTFFPFEDGIMCFGFISSNGLEIFGNIAQNNLLIDYDLEKEIISFKPT 433
>gi|356540371|ref|XP_003538663.1| PREDICTED: probable aspartic protease At2g35615-like [Glycine max]
Length = 374
Score = 244 bits (622), Expect = 6e-62, Method: Compositional matrix adjust.
Identities = 150/369 (40%), Positives = 215/369 (58%), Gaps = 29/369 (7%)
Query: 25 QAEIISVDDIYLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKS 84
Q+ I + YLM +SIGTPP I+G DTGSD TWT C PC + C+KQ P+FDP+KS
Sbjct: 15 QSPIYAYLGHYLMEVSIGTPPFKIYGIADTGSDLTWTSCVPCNK--CYKQRNPIFDPQKS 72
Query: 85 STYNSISCSSSQCAVV-TSNCS-EGDCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLP 142
++Y +ISC S C + T CS + C+Y++ Y A A+ + G LA ET+T +ST G
Sbjct: 73 TSYRNISCDSKLCHKLDTGVCSPQKHCNYTYAY---ASAAITQGVLAQETITLSSTKGES 129
Query: 143 VEMPNVIFGCGHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGK-FSYCLPD---- 197
V + ++FGCGH N + ++ GIIGLG G S ISQ+G+S GK FS CL
Sbjct: 130 VPLKGIVFGCGHNNTGG--FNDREMGIIGLGGGPVSFISQIGSSFGGKRFSQCLVPFHTD 187
Query: 198 -QGSSKINF--GGIVAGAGVVSTPLIIRDH---YYLSLEAISVGNQRLEFVSSST----- 246
SSK++ G V+G GVVSTPL+ + Y+++L ISVGN L F SS+
Sbjct: 188 VSVSSKMSLGKGSEVSGKGVVSTPLVAKQDKTPYFVTLLGISVGNTYLHFNGSSSQSVEK 247
Query: 247 GNIFVDTGVLRTLLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDVLCYNISSQPKFPE 306
GN+F+D+G T+LP + + L + + + + +PV + LCY + + P
Sbjct: 248 GNVFLDSGTPPTILPTQLYDRLVAQVRSEVAMKPVTN---DLDLGPQLCYRTKNNLRGPV 304
Query: 307 VTIHFRGADVKLSPSNLFRNISDEIMCSAFRGGNAN-IVYGRIMQINFLIGYDIEQAMVS 365
+T HF G DVKL P+ F + D + C F +++ VYG Q N+LIG+D+++ +VS
Sbjct: 305 LTAHFEGGDVKLLPTQTFVSPKDGVFCLGFTNTSSDGGVYGNFAQSNYLIGFDLDRQVVS 364
Query: 366 FKPSRCTNY 374
FKP CT +
Sbjct: 365 FKPMDCTKH 373
>gi|225427556|ref|XP_002266575.1| PREDICTED: probable aspartic protease At2g35615 [Vitis vinifera]
Length = 445
Score = 243 bits (620), Expect = 1e-61, Method: Compositional matrix adjust.
Identities = 154/416 (37%), Positives = 235/416 (56%), Gaps = 54/416 (12%)
Query: 1 AQNSQKLPFYNDNETPKSPISIIYQAEII----------SVDDI----------YLMHLS 40
+++S + PFYN +ET + ++ I+ S +DI YLM++S
Sbjct: 40 SRDSPRSPFYNPSETKYQRLQKAFRRSILRGNHFRAIRASPNDIQSNVISGGGSYLMNIS 99
Query: 41 IGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSSSQCAVV 100
+GTPPV + G DTGSD W QC PC DC+KQ PLFDPKKS TY ++ C++ C +
Sbjct: 100 LGTPPVSMLGIADTGSDLIWRQCLPCD--DCYKQVEPLFDPKKSKTYKTLGCNNDFCQDL 157
Query: 101 TSNCSEGD---CSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMPNVIFGCGHKNL 157
S GD C+ S+ YG +Y + +L++ET T ST G P P + FGCGH N
Sbjct: 158 GQQGSCGDDNTCTSSYSYGDQSY---TRRDLSSETFTIGSTEGDPASFPGLAFGCGHSN- 213
Query: 158 ASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCL-----PDQGSSKINFG--GIVA 210
T + K +G+IGLG G SL+ Q+ + + G+FSYCL SSKINFG +V+
Sbjct: 214 -GGTFNEKDSGLIGLGGGPLSLVMQLSSKVGGQFSYCLVPLSSDSTASSKINFGKSAVVS 272
Query: 211 GAGVVSTPLII---RDHYYLSLEAISVGNQRLEF----------VSSSTGNIFVDTGVLR 257
G+G VSTPLI YYL+LE +S+G++++ F ++ NI +D+G
Sbjct: 273 GSGTVSTPLIKGTPDTFYYLTLEGMSLGSEKVAFKGFSKNKSSPAAAEESNIIIDSGTTL 332
Query: 258 TLLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDVLCYNISSQPKFPEVTIHFRGADVK 317
TLLP +++++++S ++ +I Q +P + LCY+ + + P +T HF GADV+
Sbjct: 333 TLLPRDFYTDMESALTKVIGGQTT----TDPRGTFSLCYSGVKKLEIPTITAHFIGADVQ 388
Query: 318 LSPSNLFRNISDEIMCSAFRGGNANIVYGRIMQINFLIGYDIEQAMVSFKPSRCTN 373
L P N F ++++C + + ++G + Q+NFL+GYD++ VSFKP+ CT
Sbjct: 389 LPPLNTFVQAQEDLVCFSMIPSSNLAIFGNLSQMNFLVGYDLKNNKVSFKPTDCTK 444
>gi|356546378|ref|XP_003541603.1| PREDICTED: probable aspartic protease At2g35615-like [Glycine max]
Length = 439
Score = 243 bits (619), Expect = 2e-61, Method: Compositional matrix adjust.
Identities = 161/412 (39%), Positives = 225/412 (54%), Gaps = 51/412 (12%)
Query: 2 QNSQKLPFYNDNETPKSPISII----------YQAEIISVDDI----------YLMHLSI 41
++S + P Y ETP ++ ++ +S D YLM S+
Sbjct: 38 RDSSRSPLYRPTETPFQRVANAVRRSINRGNHFKKAFVSTDSAESTVVASQGEYLMRYSV 97
Query: 42 GTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSSSQCAVVT 101
G+PP + G VDTGSD W QCEPC DC+KQ P+FDP KS TY ++ CSS+ C +
Sbjct: 98 GSPPFQVLGIVDTGSDILWLQCEPCE--DCYKQTTPIFDPSKSKTYKTLPCSSNTCESLR 155
Query: 102 SNCSEGD--CSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMPNVIFGCGHKNLAS 159
+ D C YS YG G++ S G+L+ ETLT ST G V P + GCGH N +
Sbjct: 156 NTACSSDNVCEYSIDYGDGSH---SDGDLSVETLTLGSTDGSSVHFPKTVIGCGHNNGGT 212
Query: 160 PTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLP-----DQGSSKINFG--GIVAGA 212
+ +G GP SLISQ+ +SI GKFSYCL SSK+NFG +V+G
Sbjct: 213 FQEEGSGIVGLGGGP--VSLISQLSSSIGGKFSYCLAPIFSESNSSSKLNFGDAAVVSGR 270
Query: 213 GVVSTPLIIRD---HYYLSLEAISVGNQRLEFVSSST-------GNIFVDTGVLRTLLPL 262
G VSTPL + Y+L+LEA SVG+ R+EF SS+ GNI +D+G TLLP
Sbjct: 271 GTVSTPLDPLNGQVFYFLTLEAFSVGDNRIEFSGSSSSGSGSGDGNIIIDSGTTLTLLPQ 330
Query: 263 EYHSNLKSVMSNMIKAQPVKGVGAEPGFSDVLCYNISS-QPKFPEVTIHFRGADVKLSPS 321
E + NL+S +S++IK + + +P LCY +S + P +T HF+GADV+L+P
Sbjct: 331 EDYLNLESAVSDVIKLERAR----DPSKLLSLCYKTTSDELDLPVITAHFKGADVELNPI 386
Query: 322 NLFRNISDEIMCSAFRGGNANIVYGRIMQINFLIGYDIEQAMVSFKPSRCTN 373
+ F + ++C AF ++G + Q N L+GYD+ + VSFKP+ CT
Sbjct: 387 STFVPVEKGVVCFAFISSKIGAIFGNLAQQNLLVGYDLVKKTVSFKPTDCTK 438
>gi|30686482|ref|NP_850251.1| aspartyl protease-like protein [Arabidopsis thaliana]
gi|122215044|sp|Q3EBM5.1|ASPR1_ARATH RecName: Full=Probable aspartic protease At2g35615; Flags:
Precursor
gi|330254036|gb|AEC09130.1| aspartyl protease-like protein [Arabidopsis thaliana]
Length = 447
Score = 242 bits (618), Expect = 2e-61, Method: Compositional matrix adjust.
Identities = 151/379 (39%), Positives = 217/379 (57%), Gaps = 40/379 (10%)
Query: 25 QAEIISVDDIYLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKS 84
Q+ +I D + M ++IGTPP+ +F DTGSD TW QC+PC + C+K+ P+FD KKS
Sbjct: 75 QSGLIGADGEFFMSITIGTPPIKVFAIADTGSDLTWVQCKPCQQ--CYKENGPIFDKKKS 132
Query: 85 STYNSISCSSSQCAVVTSN---CSEGD--CSYSFLYGRGAYASFSSGNLATETLTFNSTS 139
STY S C S C ++S C E + C Y + YG SFS G++ATET++ +S S
Sbjct: 133 STYKSEPCDSRNCQALSSTERGCDESNNICKYRYSYGD---QSFSKGDVATETVSIDSAS 189
Query: 140 GLPVEMPNVIFGCGHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLPDQ- 198
G PV P +FGCG+ N T D +GIIGLG G+ SLISQ+G+SI+ KFSYCL +
Sbjct: 190 GSPVSFPGTVFGCGYNN--GGTFDETGSGIIGLGGGHLSLISQLGSSISKKFSYCLSHKS 247
Query: 199 ----GSSKINFG------GIVAGAGVVSTPLIIRD---HYYLSLEAISVGNQRLEFVSSS 245
G+S IN G + +GVVSTPL+ ++ +YYL+LEAISVG +++ + SS
Sbjct: 248 ATTNGTSVINLGTNSIPSSLSKDSGVVSTPLVDKEPLTYYYLTLEAISVGKKKIPYTGSS 307
Query: 246 ------------TGNIFVDTGVLRTLLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDV 293
+GNI +D+G TLL + S + + K V G
Sbjct: 308 YNPNDDGILSETSGNIIIDSGTTLTLLEAGFFDKFSSAVEESVTG--AKRVSDPQGLLSH 365
Query: 294 LCYNISSQPKFPEVTIHFRGADVKLSPSNLFRNISDEIMCSAFRGGNANIVYGRIMQINF 353
+ S++ PE+T+HF GADV+LSP N F +S++++C + +YG Q++F
Sbjct: 366 CFKSGSAEIGLPEITVHFTGADVRLSPINAFVKLSEDMVCLSMVPTTEVAIYGNFAQMDF 425
Query: 354 LIGYDIEQAMVSFKPSRCT 372
L+GYD+E VSF+ C+
Sbjct: 426 LVGYDLETRTVSFQHMDCS 444
>gi|297823357|ref|XP_002879561.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
gi|297325400|gb|EFH55820.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
Length = 447
Score = 241 bits (614), Expect = 5e-61, Method: Compositional matrix adjust.
Identities = 150/379 (39%), Positives = 217/379 (57%), Gaps = 40/379 (10%)
Query: 25 QAEIISVDDIYLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKS 84
Q+ +I D + M ++IGTPP+ +F DTGSD TW QC+PC + C+K+ P+FD KKS
Sbjct: 75 QSGLIGADGEFFMSITIGTPPMKVFAIADTGSDLTWVQCKPCQQ--CYKENGPIFDKKKS 132
Query: 85 STYNSISCSSSQCAVVTSN---CSEGD--CSYSFLYGRGAYASFSSGNLATETLTFNSTS 139
STY S C S C ++S+ C E C Y + YG SFS G++ATET++ +S S
Sbjct: 133 STYKSEPCDSRNCHALSSSERGCDESKNVCKYRYSYGD---QSFSKGDVATETISIDSAS 189
Query: 140 GLPVEMPNVIFGCGHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLPDQ- 198
G PV P +FGCG+ N T D +GIIGLG G+ SLISQ+G+SI+ KFSYCL +
Sbjct: 190 GSPVSFPGTVFGCGYNN--GGTFDETGSGIIGLGGGHLSLISQLGSSISKKFSYCLSHKS 247
Query: 199 ----GSSKINFG------GIVAGAGVVSTPLII---RDHYYLSLEAISVGNQRLEFVSSS 245
G+S IN G + +GV+STPL+ R +YYL+LEAISVG +++ + SS
Sbjct: 248 ATTNGTSVINLGTNSIPSSLSKDSGVISTPLVDKEPRTYYYLTLEAISVGKKKIPYTGSS 307
Query: 246 ------------TGNIFVDTGVLRTLLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDV 293
+GNI +D+G TLL + + + ++ K V G
Sbjct: 308 YNPNDGGIFSETSGNIIIDSGTTLTLLDSGFFDKFGAAVEELVTG--AKRVSDPQGLLSH 365
Query: 294 LCYNISSQPKFPEVTIHFRGADVKLSPSNLFRNISDEIMCSAFRGGNANIVYGRIMQINF 353
+ S++ PE+T+HF GADV+LSP N F +S++++C + +YG Q++F
Sbjct: 366 CFKSGSAEIGLPEITVHFTGADVRLSPINAFVKVSEDMVCLSMVPTTEVAIYGNFAQMDF 425
Query: 354 LIGYDIEQAMVSFKPSRCT 372
L+GYD+E VSF+ C+
Sbjct: 426 LVGYDLETRTVSFQRMDCS 444
>gi|357515001|ref|XP_003627789.1| Aspartic proteinase nepenthesin-2 [Medicago truncatula]
gi|355521811|gb|AET02265.1| Aspartic proteinase nepenthesin-2 [Medicago truncatula]
Length = 415
Score = 241 bits (614), Expect = 6e-61, Method: Compositional matrix adjust.
Identities = 148/364 (40%), Positives = 206/364 (56%), Gaps = 43/364 (11%)
Query: 25 QAEIISVDDIYLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKS 84
Q+ + S YLM SIGTPP +FG VDTGSD W QCEPC + C+ Q P+FDP S
Sbjct: 78 QSTVNSDKGEYLMSYSIGTPPFKVFGFVDTGSDLVWLQCEPCKQ--CYPQITPIFDPSLS 135
Query: 85 STYNSISCSSSQC-AVVTSNCSEGDCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPV 143
S+Y +I C S C ++ T++C G L+ ETLT +ST+G V
Sbjct: 136 SSYQNIPCLSDTCHSMRTTSCD------------------VRGYLSVETLTLDSTTGYSV 177
Query: 144 EMPNVIFGCGHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYC----LPDQG 199
P + GCG++N T +GI+GLG G SL SQ+GTSI GKFSYC LP+
Sbjct: 178 SFPKTMIGCGYRNTG--TFHGPSSGIVGLGSGPMSLPSQLGTSIGGKFSYCLGPWLPNS- 234
Query: 200 SSKINFG--GIVAGAGVVSTPLIIRDH---YYLSLEAISVGNQRLEF----VSSSTGNIF 250
+SK+NFG IV G G ++TP++ +D YYL+LEA SVGN+ +EF + GNI
Sbjct: 235 TSKLNFGDAAIVYGDGAMTTPIVKKDAQSGYYLTLEAFSVGNKLIEFGGPTYGGNEGNIL 294
Query: 251 VDTGVLRTLLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDVLCYNISSQP-KFPEVTI 309
+D+G T LP + + +S ++ I + V+ +P + LCYN++ + P +T
Sbjct: 295 IDSGTTFTFLPYDVYYRFESAVAEYINLEHVE----DPNGTFKLCYNVAYHGFEAPLITA 350
Query: 310 HFRGADVKLSPSNLFRNISDEIMCSAFRGGNANIVYGRIMQINFLIGYDIEQAMVSFKPS 369
HF+GAD+KL + F +SD I C AF I +G + Q N L+GY++ Q V+FKP
Sbjct: 351 HFKGADIKLYYISTFIKVSDGIACLAFIPSQTAI-FGNVAQQNLLVGYNLVQNTVTFKPV 409
Query: 370 RCTN 373
CT
Sbjct: 410 DCTK 413
>gi|357450863|ref|XP_003595708.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
gi|355484756|gb|AES65959.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
Length = 407
Score = 240 bits (613), Expect = 7e-61, Method: Compositional matrix adjust.
Identities = 154/393 (39%), Positives = 217/393 (55%), Gaps = 38/393 (9%)
Query: 4 SQKLPFYNDNETPKSPISIIYQAEIISVDDIYLMHLSIGTPPVDIFGSVDTGSDCTWTQC 63
S KL N + P +I Q+ + + D YLM LSIGTPP+ I+ DTGSD W QC
Sbjct: 31 SVKLIRRNSSHDSYKPSTI--QSPVSAYDCEYLMELSIGTPPIKIYAEADTGSDLVWFQC 88
Query: 64 EPCPELDCFKQEPPLFDPKKSSTYNSISCSSSQCAVVTSNCSEGD---CSYSFLYGRGAY 120
PC + C+KQ+ P+FDP+ SS+Y +I+C + C + S+ D C+Y++ Y A
Sbjct: 89 IPCTK--CYKQQNPMFDPRSSSSYTNITCGTESCNKLDSSLCSTDQKTCNYTYSY---AD 143
Query: 121 ASFSSGNLATETLTFNSTSGLPVEMPNVIFGCGHKNLASPTSDSKQTGIIGLGPGNSSLI 180
S + G LA ETLT ST+G PV +IFGCGH N + ++ G+IGLG G SLI
Sbjct: 144 NSITQGVLAQETLTLTSTTGEPVAFQGIIFGCGHNNSG---FNDREMGLIGLGRGPLSLI 200
Query: 181 SQMGTSIAG---KFSYCL-----PDQGSSKINF--GGIVAGAGVVSTPLIIRD--HYYLS 228
SQ+G+S+ FS CL +S++NF G V G G VSTPLI +D Y+ +
Sbjct: 201 SQIGSSLGAGGNMFSQCLVPFNTDPSITSQMNFGKGSEVLGNGTVSTPLISKDGTGYFAT 260
Query: 229 LEAISVGNQRLEFVSSST------GNIFVDTGVLRTLLPLEYHSNLKSVMSNMIKAQPVK 282
L ISV + L F + S+ GNI +D+G T LP E++ L + N + +P +
Sbjct: 261 LLGISVEDINLPFSNGSSLGTITKGNILIDSGTTITYLPEEFYHRLIEQVRNKVALEPFR 320
Query: 283 GVGAEPGFSDVLCYNISSQPKFPEVTIHFRGADVKLSPSNLFRNISDEIMCSA-FRGGNA 341
G E LCY + P +TIHF G DV L+P+ +F + D+ C A F
Sbjct: 321 IDGYE------LCYQTPTNLNGPTLTIHFEGGDVLLTPAQMFIPVQDDNFCFAVFDTNEE 374
Query: 342 NIVYGRIMQINFLIGYDIEQAMVSFKPSRCTNY 374
+ YG Q N+LIG+D+E+ +VSFK + CT +
Sbjct: 375 YVTYGNYAQSNYLIGFDLERQVVSFKATDCTKF 407
>gi|357486591|ref|XP_003613583.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
gi|355514918|gb|AES96541.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
Length = 437
Score = 240 bits (612), Expect = 9e-61, Method: Compositional matrix adjust.
Identities = 146/360 (40%), Positives = 210/360 (58%), Gaps = 31/360 (8%)
Query: 33 DIYLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISC 92
D Y++ IGTPP ++G +DT +D W QC PC CF P+FDP KSSTY +I C
Sbjct: 87 DGYIISFLIGTPPFQLYGVMDTANDNIWFQCNPCKP--CFNTTSPMFDPSKSSTYKTIPC 144
Query: 93 SSSQCA-VVTSNCSEGD---CSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMPNV 148
SS +C V ++CS D C YSF YG AY S G+L+ +TLT NS + P+ N+
Sbjct: 145 SSPKCKNVENTHCSSDDKKVCEYSFTYGGEAY---SQGDLSIDTLTLNSNNDTPISFKNI 201
Query: 149 IFGCGHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCL----PDQG-SSKI 203
+ GCGH+N + +G IGLG G S ISQ+ +SI GKFSYCL ++G S K+
Sbjct: 202 VIGCGHRNKGP--LEGYVSGNIGLGRGPLSFISQLNSSIGGKFSYCLVPLFSNEGISGKL 259
Query: 204 NFG--GIVAGAGVVSTPLIIRD-HYYLSLEAISVGNQRLEFVSSST-----GNIFVDTGV 255
+FG +V+G G VSTP+ + Y +L A+SVG+ ++F +S++ GN +D+G
Sbjct: 260 HFGDKSVVSGVGTVSTPITAGEIGYSTTLNALSVGDHIIKFENSTSKNDNLGNTIIDSGT 319
Query: 256 LRTLLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDVLCYNIS-SQPKFPEVTIHFRGA 314
T+LP +S L+S++++M+K + K P LCY + P +T HF GA
Sbjct: 320 TLTILPENVYSRLESIVTSMVKLERAK----SPNQQFKLCYKATLKNLDVPIITAHFNGA 375
Query: 315 DVKLSPSNLFRNISDEIMCSAF--RGGNANIVYGRIMQINFLIGYDIEQAMVSFKPSRCT 372
DV L+ N F I E++C AF G + G I Q NFL+G+D+++ ++SFKP+ CT
Sbjct: 376 DVHLNSLNTFYPIDHEVVCFAFVSVGNFPGTIIGNIAQQNFLVGFDLQKNIISFKPTDCT 435
>gi|296085498|emb|CBI29230.3| unnamed protein product [Vitis vinifera]
Length = 542
Score = 239 bits (610), Expect = 1e-60, Method: Compositional matrix adjust.
Identities = 145/347 (41%), Positives = 200/347 (57%), Gaps = 33/347 (9%)
Query: 25 QAEIISVDDIYLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKS 84
Q+ I+ YLM+L IGTPPV + VDTGSD TWTQC PC C+KQ PLFDPK S
Sbjct: 82 QSRIVPSAGEYLMNLYIGTPPVPVIAIVDTGSDLTWTQCRPCTH--CYKQVVPLFDPKNS 139
Query: 85 STYNSISCSSSQCAVVTSN--CS-EGDCSYSFLYGRGAYASFSSGNLATETLTFNSTSGL 141
STY SC +S C + + CS E C++ + Y G SF+ GNLA+ETLT +ST+G
Sbjct: 140 STYRDSSCGTSFCLALGKDRSCSKEKKCTFRYSYADG---SFTGGNLASETLTVDSTAGK 196
Query: 142 PVEMPNVIFGCGHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCL-----P 196
PV P FGCGH + D +GI+GLG G SLISQ+ ++I G FSYCL
Sbjct: 197 PVSFPGFAFGCGHS--SGGIFDKSSSGIVGLGGGELSLISQLKSTINGLFSYCLLPVSTD 254
Query: 197 DQGSSKINFG--GIVAGAGVVSTPLIIRDHYYLSLEAISVGNQRLEFVSSSTGNIFVDTG 254
SS+INFG G V+G G VSTPL + Y +++ E GNI VD+G
Sbjct: 255 SSISSRINFGASGRVSGYGTVSTPLRLPYKGY---------SKKTEV---EEGNIIVDSG 302
Query: 255 VLRTLLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDVLCYNISSQPKFPEVTIHFRGA 314
T LP E++S L+ ++N IK + V+ +P LCYN +++ P +T HF+ A
Sbjct: 303 TTYTFLPQEFYSKLEKSVANSIKGKRVR----DPNGIFSLCYNTTAEINAPIITAHFKDA 358
Query: 315 DVKLSPSNLFRNISDEIMCSAFRGGNANIVYGRIMQINFLIGYDIEQ 361
+V+L P N F + ++++C + V G + Q+NFL+G+D+ +
Sbjct: 359 NVELQPLNTFMRMQEDLVCFTVAPTSDIGVLGNLAQVNFLVGFDLRK 405
Score = 87.4 bits (215), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 45/127 (35%), Positives = 74/127 (58%), Gaps = 5/127 (3%)
Query: 247 GNIFVDTGVLRTLLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDVLCYNIS-SQPKFP 305
GNI VD+G T LPLE++ L+ +++ IK + V+ +P LCYN + Q P
Sbjct: 418 GNIIVDSGTTYTYLPLEFYVKLEESVAHSIKGKRVR----DPNGISSLCYNTTVDQIDAP 473
Query: 306 EVTIHFRGADVKLSPSNLFRNISDEIMCSAFRGGNANIVYGRIMQINFLIGYDIEQAMVS 365
+T HF+ A+V+L P N F + ++++C + + G + Q+NFL+G+D+ + VS
Sbjct: 474 IITAHFKDANVELQPWNTFLRMQEDLVCFTVLPTSDIGILGNLAQVNFLVGFDLRKKRVS 533
Query: 366 FKPSRCT 372
FK + CT
Sbjct: 534 FKAADCT 540
>gi|357450869|ref|XP_003595711.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
gi|355484759|gb|AES65962.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
Length = 443
Score = 238 bits (608), Expect = 2e-60, Method: Compositional matrix adjust.
Identities = 157/395 (39%), Positives = 228/395 (57%), Gaps = 43/395 (10%)
Query: 2 QNSQKLPFYNDNETPKSPISIIYQAEIISVDDIYLMHLSIGTPPVDIFGSVDTGSDCTWT 61
+NS ++ F + T ++P+S+ + YLM LSIGTPPV + VDTGSD W
Sbjct: 36 RNSSQVLF--NRITAQTPVSVHHYD--------YLMELSIGTPPVKTYAQVDTGSDLIWL 85
Query: 62 QCEPCPELDCFKQEPPLFDPKKSSTYNSISCSSSQCA-VVTSNCS--EGDCSYSFLYGRG 118
QC PC +C+KQ P+FDP+ SSTY++I+ S C+ + +++CS + +C+Y++ Y
Sbjct: 86 QCIPC--TNCYKQLNPMFDPQSSSTYSNIAYGSESCSKLYSTSCSPDQNNCNYTYSYEDD 143
Query: 119 AYASFSSGNLATETLTFNSTSGLPVEMPNVIFGCGHKNLASPTSDSKQTGIIGLGPGNSS 178
S + G LA ETLT ST+G PV + VIFGCGH N + + K+ GIIGLG G S
Sbjct: 144 ---SITEGVLAQETLTLTSTTGKPVALKGVIFGCGHNN--NGVFNDKEMGIIGLGRGPLS 198
Query: 179 LISQMGTSIAGK-FSYCL-PDQG----SSKINF--GGIVAGAGVVSTPLIIRD----HYY 226
L+SQ+G+S GK FS CL P +S ++F G V G GVVSTPL+ ++ Y+
Sbjct: 199 LVSQIGSSFGGKMFSQCLVPFHTNPSITSPMSFGKGSEVLGNGVVSTPLVSKNTHQAFYF 258
Query: 227 LSLEAISVGNQRLEFVSSST------GNIFVDTGVLRTLLPLEYHSNLKSVMSNMIKAQP 280
++L ISV + L F S+ GN+ +D+G TLLP +++ L + N + P
Sbjct: 259 VTLLGISVEDINLPFNDGSSLEPITKGNMVIDSGTPTTLLPEDFYHRLVEEVRNKVALDP 318
Query: 281 VKGVGAEPGFSDVLCYNISSQPKFPEVTIHFRGADVKLSPSNLFRNISDEIMCSAFRGGN 340
+ +P LCY + K +T HF GADV L+P+ +F + D I C AF
Sbjct: 319 IP---IDPTLGYQLCYRTPTNLKGTTLTAHFEGADVLLTPTQIFIPVQDGIFCFAFTSTF 375
Query: 341 ANI--VYGRIMQINFLIGYDIEQAMVSFKPSRCTN 373
+N +YG Q N+LIG+D+E+ +VSFK + CTN
Sbjct: 376 SNEYGIYGNHAQSNYLIGFDLEKQLVSFKATDCTN 410
>gi|449515031|ref|XP_004164553.1| PREDICTED: probable aspartic protease At2g35615-like [Cucumis
sativus]
Length = 430
Score = 238 bits (606), Expect = 5e-60, Method: Compositional matrix adjust.
Identities = 146/357 (40%), Positives = 208/357 (58%), Gaps = 39/357 (10%)
Query: 35 YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSS 94
YLM +SIGTPPVD G DTGSD TW QC PC L C++Q P+F+P KS++++ + C++
Sbjct: 92 YLMSVSIGTPPVDYLGIADTGSDLTWAQCLPC--LKCYQQLRPIFNPLKSTSFSHVPCNT 149
Query: 95 SQC-AVVTSNCS-EGDCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMPNVIFGC 152
C AV +C +G C YS+ YG Y S G+L E +T S+S + GC
Sbjct: 150 QTCHAVDDGHCGVQGVCDYSYTYGDRTY---SKGDLGFEKITIGSSS------VKSVIGC 200
Query: 153 GHKNLASPTSDSKQTGIIGLGPGNSSLISQMG--TSIAGKFSYCLP---DQGSSKINFG- 206
GH AS +G+IGLG G SL+SQM + I+ +FSYCLP + KINFG
Sbjct: 201 GH---ASSGGFGFASGVIGLGGGQLSLVSQMSQTSGISRRFSYCLPTLLSHANGKINFGE 257
Query: 207 -GIVAGAGVVSTPLIIRD---HYYLSLEAISVGNQRLEFVSSSTGNIFVDTGVLRTLLPL 262
+V+G GVVSTPLI ++ +YY++LEAIS+GN+R + GN+ +D+G T+LP
Sbjct: 258 NAVVSGPGVVSTPLISKNTVTYYYITLEAISIGNER-HMAFAKQGNVIIDSGTTLTILPK 316
Query: 263 EYHSNLKSVMSNMIKAQPVKGVGAEPGFSDVLCY----NISSQPKFPEVTIHFR-GADVK 317
E + + S + ++KA+ VK +P S LC+ N ++ P +T HF GA+V
Sbjct: 317 ELYDGVVSSLLKVVKAKRVK----DPHGSLDLCFDDGINAAASLGIPVITAHFSGGANVN 372
Query: 318 LSPSNLFRNISDEIMCSAFRGGNANI---VYGRIMQINFLIGYDIEQAMVSFKPSRC 371
L P N FR ++D + C + + + G + Q NFLIGYD+E +SFKP+ C
Sbjct: 373 LLPINTFRKVADNVNCLTLKAASPTTEFGIIGNLAQANFLIGYDLEAKRLSFKPTVC 429
>gi|356542355|ref|XP_003539632.1| PREDICTED: probable aspartic protease At2g35615-like [Glycine max]
Length = 444
Score = 234 bits (596), Expect = 7e-59, Method: Compositional matrix adjust.
Identities = 151/370 (40%), Positives = 219/370 (59%), Gaps = 33/370 (8%)
Query: 25 QAEIISVDDIYLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKS 84
++ +I+ YLM S+GTPP I G VDTGSD W QC+PC DC+ Q P+FDP +S
Sbjct: 84 ESTVIASQGEYLMSYSVGTPPFQILGIVDTGSDIIWLQCQPCE--DCYNQTTPIFDPSQS 141
Query: 85 STYNSISCSSSQCAVVTS--NCSEG--DCSYSFLYGRGAYASFSSGNLATETLTFNSTSG 140
TY ++ CSS+ C V S +CS +C Y+ YG ++ S G+L+ ETLT ST G
Sbjct: 142 KTYKTLPCSSNICQSVQSAASCSSNNDECEYTITYGDNSH---SQGDLSVETLTLGSTDG 198
Query: 141 LPVEMPNVIFGCGHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLP---- 196
V+ P + GCGH N + + +G GP SLISQ+ +SI GKFSYCL
Sbjct: 199 SSVQFPKTVIGCGHNNKGTFQREGSGIVGLGGGP--VSLISQLSSSIGGKFSYCLAPLFS 256
Query: 197 -DQGSSKINFG--GIVAGAGVVSTPLIIRD---HYYLSLEAISVGNQRLEFVSSSTG--- 247
SSK+NFG +V+G G VSTP++ ++ Y+L+LEA SVG+ R+EF SSS
Sbjct: 257 QSNSSSKLNFGDEAVVSGRGTVSTPIVPKNGLGFYFLTLEAFSVGDNRIEFGSSSFESSG 316
Query: 248 ---NIFVDTGVLRTLLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDVLCYNISSQPKF 304
NI +D+G T+LP + + NL+S +++ I+ + V+ +P LCY +S +
Sbjct: 317 GEGNIIIDSGTTLTILPEDDYLNLESAVADAIELERVE----DPSKFLRLCYRTTSSDEL 372
Query: 305 --PEVTIHFRGADVKLSPSNLFRNISDEIMCSAFRGGNANIVYGRIMQINFLIGYDIEQA 362
P +T HF+GADV+L+P + F + + ++C AFR ++G + Q N L+GYD+ +
Sbjct: 373 NVPVITAHFKGADVELNPISTFIEVDEGVVCFAFRSSKIGPIFGNLAQQNLLVGYDLVKQ 432
Query: 363 MVSFKPSRCT 372
VSFKP+ CT
Sbjct: 433 TVSFKPTDCT 442
>gi|224074591|ref|XP_002304395.1| predicted protein [Populus trichocarpa]
gi|222841827|gb|EEE79374.1| predicted protein [Populus trichocarpa]
Length = 443
Score = 233 bits (594), Expect = 1e-58, Method: Compositional matrix adjust.
Identities = 143/372 (38%), Positives = 211/372 (56%), Gaps = 32/372 (8%)
Query: 24 YQAEIISVDDIYLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKK 83
+Q +++ Y M +SIGTP V++ DTGSD TW QC PC C++Q+ PLFDP +
Sbjct: 83 FQNDLVPNGGEYFMKMSIGTPLVEVIVIADTGSDLTWVQCLPCDP--CYRQKSPLFDPSR 140
Query: 84 SSTYNSISCSSSQCAV--VTSNCSEGD---CSYSFLYGRGAYASFSSGNLATETLTFNST 138
SS+Y + C S C V+ D C Y + YG +Y ++GNLATE T ST
Sbjct: 141 SSSYRHMLCGSRFCNALDVSEQACTMDTNICEYHYSYGDKSY---TNGNLATEKFTIGST 197
Query: 139 SGLPVEMPNVIFGCGHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCL--- 195
S PV + ++FGCG N T D +GI+GLG G SL+SQ+ + I GKFSYCL
Sbjct: 198 SSRPVHLSPIVFGCGTGN--GGTFDELGSGIVGLGGGALSLVSQLSSIIKGKFSYCLVPL 255
Query: 196 --PDQGSSKINFG--GIVAGAGVVSTPLIIRD---HYYLSLEAISVGNQRLEFVSS---- 244
+SKI FG +++G VVSTPL+ + +YY++LEAISVGN+RL + +
Sbjct: 256 SEQSNVTSKIKFGTDSVISGPQVVSTPLVSKQPDTYYYVTLEAISVGNKRLPYTNGLLNG 315
Query: 245 --STGNIFVDTGVLRTLLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDVLCYNISSQP 302
GN+ +D+G T L E+ + L+ V+ +KA+ V ++P +C+ +
Sbjct: 316 NVEKGNVIIDSGTTLTFLDSEFFTELERVLEETVKAERV----SDPRGLFSVCFRSAGDI 371
Query: 303 KFPEVTIHFRGADVKLSPSNLFRNISDEIMCSAFRGGNANIVYGRIMQINFLIGYDIEQA 362
P + +HF ADVKL P N F ++++C N ++G + Q++FL+GYD+E+
Sbjct: 372 DLPVIAVHFNDADVKLQPLNTFVKADEDLLCFTMISSNQIGIFGNLAQMDFLVGYDLEKR 431
Query: 363 MVSFKPSRCTNY 374
VSFKP+ CT +
Sbjct: 432 TVSFKPTDCTKH 443
>gi|357481191|ref|XP_003610881.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
gi|355512216|gb|AES93839.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
Length = 427
Score = 233 bits (593), Expect = 1e-58, Method: Compositional matrix adjust.
Identities = 151/400 (37%), Positives = 216/400 (54%), Gaps = 38/400 (9%)
Query: 2 QNSQKLPFYNDNETPKSPISIIYQ------------AEIISVDDIYLMHLSIGTPPVDIF 49
+NS PFY N K+ + YQ + S + YLM L++G+PPVDI+
Sbjct: 37 KNSPNSPFYKSNNFHKNKLRSFYQVPKKSFVQKSPYTRVTSNNGDYLMKLTLGSPPVDIY 96
Query: 50 GSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSSSQCAVVTSNCS-EGD 108
G VDTGSD W QC PC C++Q+ P+F+P +S TY+ I C S QC+ +CS +
Sbjct: 97 GLVDTGSDLVWAQCTPCG--GCYRQKSPMFEPLRSKTYSPIPCESEQCSFFGYSCSPQKM 154
Query: 109 CSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMPNVIFGCGHKNLASPTSDSKQTG 168
C+YS+ Y A +S + G LA E +TF+ST G PV + ++IFGCGH N + +
Sbjct: 155 CAYSYSY---ADSSVTKGVLAREAITFSSTDGDPVVVGDIIFGCGHSNSGTFNENDMGII 211
Query: 169 IIGLGPGNSSLISQMGTSIAGK-FSYCL-----PDQGSSKINFG--GIVAGAGVVSTPLI 220
+G GP SL+SQ+GT K FS CL S INFG V+G GVV+TPL
Sbjct: 212 GMGGGP--LSLVSQIGTLYGSKRFSQCLVPFHTDAHTSGTINFGEESDVSGEGVVTTPLA 269
Query: 221 IRD---HYYLSLEAISVGNQRLEFVSSST---GNIFVDTGVLRTLLPLEYHSNLKSVMSN 274
+ Y ++LE ISVG+ + F SS T GNI +D+G T +P E++ L +
Sbjct: 270 SEEGQTSYLVTLEGISVGDTFVRFNSSETLSKGNIMIDSGTPATYIPQEFYERLVEELKV 329
Query: 275 MIKAQPVKGVGAEPGFSDVLCYNISSQPKFPEVTIHFRGADVKLSPSNLFRNISDEIMCS 334
P++ +P LCY + + P +T HF GADV+L P F D + C
Sbjct: 330 QSSLLPIED---DPDLGTQLCYRSETNLEGPILTAHFEGADVQLLPIQTFIPPKDGVFCF 386
Query: 335 AFRGG-NANIVYGRIMQINFLIGYDIEQAMVSFKPSRCTN 373
A G + + ++G Q N L+G+D+++ +SFKP+ CTN
Sbjct: 387 AMAGSTDGDYIFGNFAQSNILMGFDLDRKTISFKPTDCTN 426
>gi|356555042|ref|XP_003545848.1| PREDICTED: probable aspartic protease At2g35615-like [Glycine max]
Length = 431
Score = 231 bits (589), Expect = 4e-58, Method: Compositional matrix adjust.
Identities = 142/359 (39%), Positives = 206/359 (57%), Gaps = 30/359 (8%)
Query: 32 DDIYLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSIS 91
D YLM S+GTPP ++G VDT SD W QC+ C C+ P+FDP S TY ++
Sbjct: 85 DGDYLMSYSLGTPPFPVYGIVDTASDIIWVQCQLCET--CYNDTSPMFDPSYSKTYKNLP 142
Query: 92 CSSSQCAVVT-SNCSEGD---CSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMPN 147
CSS+ C V ++CS + C ++ Y G++ S G+L ET+T S + V P
Sbjct: 143 CSSTTCKSVQGTSCSSDERKICEHTVNYKDGSH---SQGDLIVETVTLGSYNDPFVHFPR 199
Query: 148 VIFGCGHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLP--DQGSSKINF 205
+ GC S S GI+GLG G SL+ Q+ +SI+ KFSYCL SSK+ F
Sbjct: 200 TVIGCIRNTNVSFDS----IGIVGLGGGPVSLVPQLSSSISKKFSYCLAPISDRSSKLKF 255
Query: 206 G--GIVAGAGVVSTPLIIRDH---YYLSLEAISVGNQRLEFVSSST-----GNIFVDTGV 255
G +V+G G VST ++ +D YYL+LEA SVGN R+EF SSS+ GNI +D+G
Sbjct: 256 GDAAMVSGDGTVSTRIVFKDWKKFYYLTLEAFSVGNNRIEFRSSSSRSSGKGNIIIDSGT 315
Query: 256 LRTLLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDVLCYNIS-SQPKFPEVTIHFRGA 314
T+LP + +S L+S +++++K + + +P LCY + + P +T HF GA
Sbjct: 316 TFTVLPDDVYSKLESAVADVVKLERAE----DPLKQFSLCYKSTYDKVDVPVITAHFSGA 371
Query: 315 DVKLSPSNLFRNISDEIMCSAFRGGNANIVYGRIMQINFLIGYDIEQAMVSFKPSRCTN 373
DVKL+ N F S ++C AF + ++G + Q NFL+GYD+++ +VSFKP+ CT
Sbjct: 372 DVKLNALNTFIVASHRVVCLAFLSSQSGAIFGNLAQQNFLVGYDLQRKIVSFKPTDCTK 430
>gi|449515033|ref|XP_004164554.1| PREDICTED: probable aspartic protease At2g35615-like [Cucumis
sativus]
Length = 430
Score = 229 bits (584), Expect = 2e-57, Method: Compositional matrix adjust.
Identities = 143/358 (39%), Positives = 208/358 (58%), Gaps = 39/358 (10%)
Query: 35 YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSS 94
YLM +SIGTPPVD G DTGSD W QC PC L C+KQ P+FDP KS++++ + C+S
Sbjct: 92 YLMSVSIGTPPVDYIGMADTGSDLMWAQCLPC--LKCYKQSRPIFDPLKSTSFSHVPCNS 149
Query: 95 SQC-AVVTSNC-SEGDCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMPNVIFGC 152
C A+ S+C ++G C YS+ YG Y + G+L E +T S+S + GC
Sbjct: 150 QNCKAIDDSHCGAQGVCDYSYTYGDQTY---TKGDLGFEKITIGSSS------VKSVIGC 200
Query: 153 GHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTS--IAGKFSYCLP---DQGSSKINFG- 206
GH++ S +IGLG G SL+SQM + I+ +FSYCLP + KINFG
Sbjct: 201 GHESGGGFGFASG---VIGLGGGQLSLVSQMSQTSGISRRFSYCLPTLLSHANGKINFGQ 257
Query: 207 -GIVAGAGVVSTPLIIRD---HYYLSLEAISVGNQRLEFVSSSTGNIFVDTGVLRTLLPL 262
+V+G GVVSTPLI ++ +YY++LEAIS+GN+R S+ GN+ +D+G + LP
Sbjct: 258 NAVVSGPGVVSTPLISKNPVTYYYVTLEAISIGNER-HMASAKQGNVIIDSGTTLSFLPK 316
Query: 263 EYHSNLKSVMSNMIKAQPVKGVGAEPGFSDVLCY----NISSQPKFPEVTIHFR-GADVK 317
E + + S + ++KA+ VK +PG LC+ N+++ P +T F GA+V
Sbjct: 317 ELYDGVVSSLLKVVKAKRVK----DPGNFWDLCFDDGINVATSSGIPIITAQFSGGANVN 372
Query: 318 LSPSNLFRNISDEIMCSAFRGGNANIVYGRIMQI---NFLIGYDIEQAMVSFKPSRCT 372
L P N F+ +++ + C + +G I + NFLIGYD+E +SFKP+ CT
Sbjct: 373 LLPVNTFQKVANNVNCLTLTPASPTDEFGIIGNLALANFLIGYDLEAKRLSFKPTVCT 430
>gi|15226315|ref|NP_180368.1| aspartyl protease-like protein [Arabidopsis thaliana]
gi|4510415|gb|AAD21501.1| putative chloroplast nucleoid DNA binding protein [Arabidopsis
thaliana]
gi|330252975|gb|AEC08069.1| aspartyl protease-like protein [Arabidopsis thaliana]
Length = 396
Score = 228 bits (581), Expect = 3e-57, Method: Compositional matrix adjust.
Identities = 141/361 (39%), Positives = 197/361 (54%), Gaps = 38/361 (10%)
Query: 26 AEIISVDDIYLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSS 85
A + + +YLM L +GTPP +I +DTGS+ TWTQC PC + C++Q P+FDP KSS
Sbjct: 56 ANTVFDNSVYLMKLQVGTPPFEIQAIIDTGSEITWTQCLPC--VHCYEQNAPIFDPSKSS 113
Query: 86 TYNSISCSSSQCAVVTSNCSEGDCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEM 145
T+ C C Y Y Y + G LATET+T +STSG P M
Sbjct: 114 TFKEKRCDGHSCP------------YEVDYFDHTY---TMGTLATETITLHSTSGEPFVM 158
Query: 146 PNVIFGCGHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLPDQGSSKINF 205
P I GCGH N S S G++GL G SSLI+QMG G SYC QG+SKINF
Sbjct: 159 PETIIGCGHNNSWFKPSFS---GMVGLNWGPSSLITQMGGEYPGLMSYCFSGQGTSKINF 215
Query: 206 G--GIVAGAGVVSTPLIIRD----HYYLSLEAISVGNQRLEFVSSS----TGNIFVDTGV 255
G IVAG GVVST + + YYL+L+A+SVGN R+E + ++ GNI +D+G
Sbjct: 216 GANAIVAGDGVVSTTMFMTTAKPGFYYLNLDAVSVGNTRIETMGTTFHALEGNIVIDSGT 275
Query: 256 LRTLLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDVLCYNISSQPKFPEVTIHFR-GA 314
T P+ Y + ++ + +++ A A+P +D+LCYN + FP +T+HF G
Sbjct: 276 TLTYFPVSYCNLVRQAVEHVVTAVR----AADPTGNDMLCYNSDTIDIFPVITMHFSGGV 331
Query: 315 DVKLSPSNLFRNISD-EIMCSAF--RGGNANIVYGRIMQINFLIGYDIEQAMVSFKPSRC 371
D+ L N++ ++ + C A ++G Q NFL+GYD +VSF P+ C
Sbjct: 332 DLVLDKYNMYMESNNGGVFCLAIICNSPTQEAIFGNRAQNNFLVGYDSSSLLVSFSPTNC 391
Query: 372 T 372
+
Sbjct: 392 S 392
>gi|357487593|ref|XP_003614084.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
gi|355515419|gb|AES97042.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
Length = 412
Score = 228 bits (580), Expect = 5e-57, Method: Compositional matrix adjust.
Identities = 150/408 (36%), Positives = 217/408 (53%), Gaps = 68/408 (16%)
Query: 4 SQKLPFYNDNETPKSPISII---------YQAEIIS-----VDDI---------YLMHLS 40
S + PFYN ET IS I Y + S + D+ Y+M S
Sbjct: 36 SSRSPFYNPKETQIQRISSILNYSINRVRYLNHVFSFSPNKIQDVPLSSFMGAGYVMSYS 95
Query: 41 IGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSSSQCAVV 100
IGTPP ++ +DTG+D W QC+PC C Q P+F P KSSTY +I C+S C
Sbjct: 96 IGTPPFQLYSLIDTGNDNIWFQCKPCKP--CLNQTSPMFHPSKSSTYKTIPCTSPICK-- 151
Query: 101 TSNCSEGDCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMPNVIFGCGHKNLASP 160
D Y L +TLT NS +G P+ N++ GCGH+N
Sbjct: 152 -----NADGHY----------------LGVDTLTLNSNNGTPISFKNIVIGCGHRNQGP- 189
Query: 161 TSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCL-----PDQGSSKINFG--GIVAGAG 213
+ +G IGL G S ISQ+ +SI GKFSYCL + SSK++FG V+G G
Sbjct: 190 -LEGYVSGNIGLARGPLSFISQLNSSIGGKFSYCLVPLFSKENVSSKLHFGDKSTVSGLG 248
Query: 214 VVSTPLIIRDHYYLSLEAISVGNQRLEFVSS-STGNIFVDTGVLRTLLPLEYHSNLKSVM 272
VSTP+ + Y++SLEA SVG+ ++ +S + GN +D+G T+LP + +S L+SV+
Sbjct: 249 TVSTPIKEENGYFVSLEAFSVGDHIIKLENSDNRGNSIIDSGTTMTILPKDVYSRLESVV 308
Query: 273 SNMIKAQPVKGVGAEPGFSDVLCYNISSQPKFPEVTI---HFRGADVKLSPSNLFRNISD 329
+M+K + VK +P LCY +S +V I HF G++V L+ N F I+D
Sbjct: 309 LDMVKLKRVK----DPSQQFNLCYQTTSTTLLTKVLIITAHFSGSEVHLNALNTFYPITD 364
Query: 330 EIMCSAF-RGGNAN--IVYGRIMQINFLIGYDIEQAMVSFKPSRCTNY 374
E++C AF GGN + ++G ++Q NFL+G+D+ + +SFKP+ CT +
Sbjct: 365 EVICFAFVSGGNFSSLAIFGNVVQQNFLVGFDLNKKTISFKPTDCTKH 412
>gi|15226317|ref|NP_180370.1| aspartyl protease-like protein [Arabidopsis thaliana]
gi|4063755|gb|AAC98463.1| putative chloroplast nucleoid DNA binding protein [Arabidopsis
thaliana]
gi|20197953|gb|AAM15327.1| putative chloroplast nucleoid DNA binding protein [Arabidopsis
thaliana]
gi|330252977|gb|AEC08071.1| aspartyl protease-like protein [Arabidopsis thaliana]
Length = 392
Score = 226 bits (576), Expect = 1e-56, Method: Compositional matrix adjust.
Identities = 139/355 (39%), Positives = 195/355 (54%), Gaps = 40/355 (11%)
Query: 33 DIYLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISC 92
+IYLM L +GTPP +I +DTGSD WTQC PC +C+ Q P+FDP SST+
Sbjct: 59 NIYLMKLQVGTPPFEIEAEIDTGSDLIWTQCMPCT--NCYSQYAPIFDPSNSSTFKE--- 113
Query: 93 SSSQCAVVTSNCSEGDCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMPNVIFGC 152
C+ C Y +Y A ++S G LATET+T +STSG P MP GC
Sbjct: 114 ---------KRCNGNSCHYKIIY---ADTTYSKGTLATETVTIHSTSGEPFVMPETTIGC 161
Query: 153 GHK-NLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLPDQGSSKINFG--GIV 209
GH + PT +G++GL G SSLI+QMG G SYC QG+SKINFG IV
Sbjct: 162 GHNSSWFKPTF----SGMVGLSWGPSSLITQMGGEYPGLMSYCFASQGTSKINFGTNAIV 217
Query: 210 AGAGVVSTPLIIRDH----YYLSLEAISVGNQRLEFVSSS----TGNIFVDTGVLRTLLP 261
AG GVVST + + YYL+L+A+SVG+ +E + ++ GNI +D+G T P
Sbjct: 218 AGDGVVSTTMFLTTAKPGLYYLNLDAVSVGDTHVETMGTTFHALEGNIIIDSGTTLTYFP 277
Query: 262 LEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDVLCYNISSQPKFPEVTIHFR-GADVKLSP 320
+ Y + ++ + + + A A+P +D+LCY + FP +T+HF GAD+ L
Sbjct: 278 VSYCNLVREAVDHYVTAVRT----ADPTGNDMLCYYTDTIDIFPVITMHFSGGADLVLDK 333
Query: 321 SNLF-RNISDEIMCSAFRGGN--ANIVYGRIMQINFLIGYDIEQAMVSFKPSRCT 372
N++ I+ C A N + ++G Q NFL+GYD +VSF P+ C+
Sbjct: 334 YNMYIETITRGTFCLAIICNNPPQDAIFGNRAQNNFLVGYDSSSLLVSFSPTNCS 388
>gi|357481199|ref|XP_003610885.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
gi|355512220|gb|AES93843.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
Length = 416
Score = 226 bits (575), Expect = 2e-56, Method: Compositional matrix adjust.
Identities = 150/373 (40%), Positives = 208/373 (55%), Gaps = 29/373 (7%)
Query: 20 ISIIYQAEIISVDDIYLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLF 79
I I QA I + +LM + IGTPP+ I G VDTGSD W QC PC L C+KQ P+F
Sbjct: 53 IQNIVQAPINAYIGQHLMEIYIGTPPIKITGLVDTGSDLIWIQCAPC--LGCYKQIKPMF 110
Query: 80 DPKKSSTYNSISCSSSQCAVV-TSNCS-EGDCSYSFLYGRGAYASFSSGNLATETLTFNS 137
DP KSSTYN+ISC S C + T CS E C+Y++ YG S + G LA +T TF S
Sbjct: 111 DPLKSSTYNNISCDSPLCHKLDTGVCSPEKRCNYTYGYGDN---SLTKGVLAQDTATFTS 167
Query: 138 TSGLPVEMPNVIFGCGHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAG-KFSYCLP 196
+G PV + +FGCGH N + + G+IGLG G +SLISQ+G G KFS CL
Sbjct: 168 NTGKPVSLSRFLFGCGHNNTGG--FNDHEMGLIGLGGGPTSLISQIGPLFGGKKFSQCLV 225
Query: 197 D-----QGSSKINF--GGIVAGAGVVSTPLIIRDH---YYLSLEAISVGNQRLEFVSS-S 245
+ SS+++F G V G GVV+TPL+ R+ Y+++L ISV + S+
Sbjct: 226 PFLTDIKISSRMSFGKGSQVLGNGVVTTPLVPREKDTSYFVTLLGISVEDTYFPMNSTIG 285
Query: 246 TGNIFVDTGVLRTLLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDVLCYNISSQPKFP 305
N+ VD+G LLP + + + + + N + +P+ +P LCY + K P
Sbjct: 286 KANMLVDSGTPPILLPQQLYDKVFAEVRNKVALKPIT---DDPSLGTQLCYRTQTNLKGP 342
Query: 306 EVTIHFRGADVKLSPSNLFRNISDE---IMCSAF--RGGNANIVYGRIMQINFLIGYDIE 360
+T HF GA+V L+P F + + I C A R + VYG Q N+LIG+D++
Sbjct: 343 TLTFHFVGANVLLTPIQTFIPPTPQTKGIFCLAIYNRTNSDPGVYGNFAQSNYLIGFDLD 402
Query: 361 QAMVSFKPSRCTN 373
+ +VSFKP+ CT
Sbjct: 403 RQVVSFKPTDCTK 415
>gi|297846526|ref|XP_002891144.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
gi|297336986|gb|EFH67403.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
Length = 445
Score = 225 bits (574), Expect = 2e-56, Method: Compositional matrix adjust.
Identities = 145/380 (38%), Positives = 206/380 (54%), Gaps = 44/380 (11%)
Query: 25 QAEIISVDDIYLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKS 84
Q+ +IS Y M +SIGTPP DTGSD TW QC+PC + C+KQ PLFD KKS
Sbjct: 75 QSGLISNGGEYFMSISIGTPPSKFLAIADTGSDLTWVQCKPCQQ--CYKQNTPLFDKKKS 132
Query: 85 STYNSISCSSSQCAVVTSN---CSEGD--CSYSFLYGRGAYASFSSGNLATETLTFNSTS 139
STY + SC S C ++ + C E C Y + YG SF+ G +ATET++ +S+S
Sbjct: 133 STYKTESCDSITCNALSEHEEGCDESRNACKYRYSYGD---ESFTKGEVATETISIDSSS 189
Query: 140 GLPVEMPNVIFGCGHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLPD-- 197
G PV P FGCG+ N + +G GP SL+SQ+G+SI KFSYCL
Sbjct: 190 GSPVSFPGTAFGCGYNNGGTFEETGSGIIGLGGGP--LSLVSQLGSSIGKKFSYCLSHTS 247
Query: 198 ---QGSSKINFG------GIVAGAGVVSTPLIIRD---HYYLSLEAISVGNQRLEFV--- 242
G+S IN G + +++TPLI +D +Y+L+LEAI+VG +L +
Sbjct: 248 ATTNGTSVINLGTNSMTSKPSKDSAILTTPLIQKDPETYYFLTLEAITVGKTKLPYTGGG 307
Query: 243 -------SSSTGNIFVDTGVLRTLLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDVLC 295
S TGNI +D+G TLL ++ + +V+ + K V G +L
Sbjct: 308 GYSLNRKSKKTGNIIIDSGTTLTLLDSGFYDDFGAVVEESVTG--AKRVSDPQG---ILT 362
Query: 296 YNISSQPK---FPEVTIHFRGADVKLSPSNLFRNISDEIMCSAFRGGNANIVYGRIMQIN 352
+ S K P +T+HF GADVKLSP N F +S++I+C + +YG ++Q++
Sbjct: 363 HCFKSGDKEIGLPTITMHFTGADVKLSPINSFVKLSEDIVCLSMIPTTEVAIYGNMVQMD 422
Query: 353 FLIGYDIEQAMVSFKPSRCT 372
FL+GYD+E VSF+ C+
Sbjct: 423 FLVGYDLETKTVSFQRMDCS 442
>gi|357481205|ref|XP_003610888.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
gi|355512223|gb|AES93846.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
Length = 413
Score = 224 bits (572), Expect = 4e-56, Method: Compositional matrix adjust.
Identities = 147/373 (39%), Positives = 208/373 (55%), Gaps = 30/373 (8%)
Query: 20 ISIIYQAEIISVDDIYLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLF 79
I I QA I + YLM L IGTPP+ I G+VDTGSD W QC PC L C+ Q P+F
Sbjct: 49 IQDIVQAPINAYIGQYLMELYIGTPPIKISGTVDTGSDLIWVQCVPC--LGCYNQINPMF 106
Query: 80 DPKKSSTYNSISCSSSQC-AVVTSNCS-EGDCSYSFLYGRGAYASFSSGNLATETLTFNS 137
DP KSSTY +ISC S C CS E C Y++ Y A +S + G LA ET+T S
Sbjct: 107 DPLKSSTYTNISCDSPLCYKPYIGECSPEKRCDYTYGY---ADSSLTKGVLAQETVTLTS 163
Query: 138 TSGLPVEMPNVIFGCGHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAG-KFSYCLP 196
+G P+ + ++FGCGH N + + + G+IGLG G +SL+SQ+G G KFS CL
Sbjct: 164 NTGKPISLQGILFGCGHNNTGN--FNDHEMGLIGLGGGPTSLVSQIGPLFGGKKFSQCLV 221
Query: 197 D-----QGSSKINF--GGIVAGAGVVSTPLIIRDH----YYLSLEAISVGNQRLEFVSS- 244
SS+++F G V G GVV+TPL+ R+ YY++L ISV + L S+
Sbjct: 222 PFLTDITISSQMSFGKGSEVLGEGVVTTPLVQREQDMTSYYVTLLGISVEDTYLPMNSTI 281
Query: 245 STGNIFVDTGVLRTLLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDVLCYNISSQPKF 304
GN+ VD+G +LP + + + + N + +P+ +P LCY + K
Sbjct: 282 EKGNMLVDSGTPPNILPQQLYDRVYVEVKNKVPLEPIT---DDPSLGPQLCYRTQTNLKG 338
Query: 305 PEVTIHFRGADVKLSPSNLFRNISDE---IMCSAFRG-GNANI-VYGRIMQINFLIGYDI 359
P +T HF GA++ L+P F + E + C A N++ +YG Q N+LIG+D+
Sbjct: 339 PTLTYHFEGANLLLTPIQTFIPPTPETKGVFCLAITNCANSDPGIYGNFAQTNYLIGFDL 398
Query: 360 EQAMVSFKPSRCT 372
++ +VSFKP+ CT
Sbjct: 399 DRQIVSFKPTDCT 411
>gi|343198386|gb|AEM05966.1| nodulin 41 [Phaseolus vulgaris]
Length = 437
Score = 224 bits (571), Expect = 5e-56, Method: Compositional matrix adjust.
Identities = 145/380 (38%), Positives = 205/380 (53%), Gaps = 34/380 (8%)
Query: 11 NDNETPKSPISIIYQAEIISVDDIYLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELD 70
+N+ P+S + I++ E YLM IGTPPV+ + DTGSD W QC PC
Sbjct: 74 QNNKLPQS-VLILHNGE-------YLMRFYIGTPPVERLATADTGSDLIWVQCSPCA--S 123
Query: 71 CFKQEPPLFDPKKSSTYNSISCSSSQCAVV---TSNCSE-GDCSYSFLYGRGAYASFSSG 126
CF Q PLF P KSST+ +C S C ++ C + G+C Y++ YG SFS G
Sbjct: 124 CFPQSTPLFQPLKSSTFMPTTCRSQPCTLLLPEQKGCGKSGECIYTYKYGDQY--SFSEG 181
Query: 127 NLATETLTFNSTSGL-PVEMPNVIFGCGHKNLASPTSDSKQTGIIGLGPGNSSLISQMGT 185
L+TETL F+S G+ V PN FGCG N + K TGI+GLG G SL+SQ+G
Sbjct: 182 LLSTETLRFDSQGGVQTVAFPNSFFGCGLYNNITVFPSYKLTGIMGLGAGPLSLVSQIGD 241
Query: 186 SIAGKFSYCLPDQGS---SKINFGG--IVAGAGVVSTPLIIR----DHYYLSLEAISVGN 236
I KFSYCL GS SK+ FG I+ G GVVSTP+II+ +Y+L+LEA++V
Sbjct: 242 QIGHKFSYCLLPLGSTSTSKLKFGNESIITGEGVVSTPMIIKPWLPTYYFLNLEAVTVAQ 301
Query: 237 QRLEFVSSSTGNIFVDTGVLRTLLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDVLCY 296
+ + S+ GN+ +D+G L T L ++ N + + + + V+ V + F C+
Sbjct: 302 KTVP-TGSTDGNVIIDSGTLLTYLGESFYYNFAASLQESLAVELVQDVLSPLPF----CF 356
Query: 297 NISSQPKFPEVTIHFRGADVKLSPSNLFRNISDE-IMCSAFRGGNAN--IVYGRIMQINF 353
FPE+ F GA V L P+NLF D +C + + ++G QI+F
Sbjct: 357 PYRDNFVFPEIAFQFTGARVSLKPANLFVMTEDRNTVCLMIAPSSVSGISIFGSFSQIDF 416
Query: 354 LIGYDIEQAMVSFKPSRCTN 373
+ YD+E VSF+P+ C+
Sbjct: 417 QVEYDLEGKKVSFQPTDCSK 436
>gi|255553149|ref|XP_002517617.1| Aspartic proteinase nepenthesin-2 precursor, putative [Ricinus
communis]
gi|223543249|gb|EEF44781.1| Aspartic proteinase nepenthesin-2 precursor, putative [Ricinus
communis]
Length = 449
Score = 224 bits (570), Expect = 6e-56, Method: Compositional matrix adjust.
Identities = 145/378 (38%), Positives = 209/378 (55%), Gaps = 39/378 (10%)
Query: 23 IYQAEIISVDDIYLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPK 82
+ Q++I+ YLM +SIG P V+I DTGSD W QC+PC C+KQ P+FDP+
Sbjct: 81 LVQSDIVPGGGEYLMRISIGNPQVEILAIADTGSDLIWVQCQPCEM--CYKQNSPIFDPR 138
Query: 83 KSSTYNSISCSSSQCAVVTSNCSEGD-------CSYSFLYGRGAYASFSSGNLATETL-- 133
+SS+Y ++ C + C + D C Y++ YG SFS G+LA E
Sbjct: 139 RSSSYRNVLCGNEFCNKLDGEARSCDARGFVKTCGYTYSYGD---QSFSDGHLAIERFGI 195
Query: 134 --TFNSTSGLPVEMPNVIFGCGHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKF 191
T ++TS V FGCG KN T D +GIIGLG G+ SL+SQ+G ++GKF
Sbjct: 196 GSTNSNTSAAIAYFQEVAFGCGTKN--GGTFDELGSGIIGLGGGSMSLVSQLGPKLSGKF 253
Query: 192 SYCL-----PDQGSSKINFGGIVAGAG----VVSTPLIIRD---HYYLSLEAISVGNQRL 239
SYCL +SKINFG + +G VVSTPL+ + +YYL+LEAISV N+RL
Sbjct: 254 SYCLVPTSEQSNYTSKINFGNDINISGSNYNVVSTPLLPKKPETYYYLTLEAISVENKRL 313
Query: 240 EFVS-----SSTGNIFVDTGVLRTLLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDVL 294
+ + GNI +D+G T L E+ +NL S + +K + V ++P +
Sbjct: 314 PYTNLWNGEVEKGNIIIDSGTTLTFLDSEFFNNLDSAVEEAVKGERV----SDPHGLFNI 369
Query: 295 CYNISSQPKFPEVTIHFRGADVKLSPSNLFRNISDEIMCSAFRGGNANIVYGRIMQINFL 354
C+ + P +T HF GADV+L P N F + ++++C N ++G + Q+NFL
Sbjct: 370 CFKDEKAIELPIITAHFTGADVELQPVNTFAKVEEDLLCFTMIPSNDIAIFGNLAQMNFL 429
Query: 355 IGYDIEQAMVSFKPSRCT 372
+GYD+E+ VSF P+ CT
Sbjct: 430 VGYDLEKKAVSFLPTDCT 447
>gi|67633548|gb|AAY78698.1| aspartyl protease family protein [Arabidopsis thaliana]
Length = 392
Score = 224 bits (570), Expect = 6e-56, Method: Compositional matrix adjust.
Identities = 138/355 (38%), Positives = 194/355 (54%), Gaps = 40/355 (11%)
Query: 33 DIYLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISC 92
+IYLM L +GTPP +I +DTGSD WTQC PC +C+ Q P+FDP SST+
Sbjct: 59 NIYLMKLQVGTPPFEIEAEIDTGSDLIWTQCMPCT--NCYSQYAPIFDPSNSSTFKE--- 113
Query: 93 SSSQCAVVTSNCSEGDCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMPNVIFGC 152
C+ C Y +Y A ++S G LATET+T +STSG P MP GC
Sbjct: 114 ---------KRCNGNSCHYKIIY---ADTTYSKGTLATETVTIHSTSGEPFVMPETTIGC 161
Query: 153 GHK-NLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLPDQGSSKINFG--GIV 209
GH + PT +G++GL G SSLI+QMG G SYC QG+SKINFG IV
Sbjct: 162 GHNSSWFKPTF----SGMVGLSWGPSSLITQMGGEYPGLMSYCFASQGTSKINFGTNAIV 217
Query: 210 AGAGVVSTPLIIRDH----YYLSLEAISVGNQRLEFVSSS----TGNIFVDTGVLRTLLP 261
AG GVVST + + YYL+L+A+SVG+ +E + ++ GNI +D+G T P
Sbjct: 218 AGDGVVSTTMFLTTAKPGLYYLNLDAVSVGDTHVETMGTTFHALEGNIIIDSGTTLTYFP 277
Query: 262 LEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDVLCYNISSQPKFPEVTIHFR-GADVKLSP 320
+ Y + ++ + + + A A+P +D+LCY + FP +T+HF GAD+ L
Sbjct: 278 VSYCNLVREAVDHYVTAVRT----ADPTGNDMLCYYTDTIDIFPVITMHFSGGADLVLDK 333
Query: 321 SNLF-RNISDEIMCSAFRGGN--ANIVYGRIMQINFLIGYDIEQAMVSFKPSRCT 372
N++ I+ C A N + ++G Q NFL+GYD +V F P+ C+
Sbjct: 334 YNMYIETITRGTFCLAIICNNPPQDAIFGNRAQNNFLVGYDSSSLLVFFSPTNCS 388
>gi|297822477|ref|XP_002879121.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
gi|297324960|gb|EFH55380.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
Length = 711
Score = 223 bits (567), Expect = 2e-55, Method: Compositional matrix adjust.
Identities = 139/362 (38%), Positives = 196/362 (54%), Gaps = 40/362 (11%)
Query: 26 AEIISVDDIYLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSS 85
A+ + + +YLM L +GTPP +I +DTGS+ TWTQC PC + C+KQ P+FDP KSS
Sbjct: 371 ADTVFDNSVYLMKLQVGTPPFEIEAVIDTGSEITWTQCLPC--VHCYKQNAPIFDPSKSS 428
Query: 86 TYNSISCSSSQCAVVTSNCSEGDCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEM 145
T+ C + C Y Y Y + G LAT+T+T +STSG P M
Sbjct: 429 TFKE------------KRCHDHSCPYEVDYFDKTY---TKGTLATDTVTIHSTSGEPFVM 473
Query: 146 PNVIFGCGHKN-LASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLPDQGSSKIN 204
I GCG N P+ + G +GL G SLI+QMG G SYC G+SKIN
Sbjct: 474 AETIIGCGRNNSWFRPSFE----GFVGLNWGPLSLITQMGGEYPGLMSYCFAGNGTSKIN 529
Query: 205 FG--GIVAGAGVVSTPLIIRD----HYYLSLEAISVGNQRLEFVSSS----TGNIFVDTG 254
FG IV G GVVST + + YYL+L+A+SVG+ R+E + + GNI +D+G
Sbjct: 530 FGTNAIVGGGGVVSTTMFVTTARPGFYYLNLDAVSVGDTRIETLGTPFHALEGNIVIDSG 589
Query: 255 VLRTLLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDVLCYNISSQPKFPEVTIHFR-G 313
T P Y + ++ + +++ A P A+P +D+LCY ++ FP +T+HF G
Sbjct: 590 TTLTYFPESYCNLVRQAVEHVVPAVP----AADPTGNDLLCYYSNTTEIFPVITMHFSGG 645
Query: 314 ADVKLSPSNLF-RNISDEIMCSAFRGGN--ANIVYGRIMQINFLIGYDIEQAMVSFKPSR 370
AD+ L N+F + S + C A N ++G Q NFL+GYD +VSFKP+
Sbjct: 646 ADLVLDKYNMFMESYSGGLFCLAIICNNPTQEAIFGNRAQNNFLVGYDSSSLLVSFKPTN 705
Query: 371 CT 372
C+
Sbjct: 706 CS 707
Score = 187 bits (476), Expect = 6e-45, Method: Compositional matrix adjust.
Identities = 123/336 (36%), Positives = 173/336 (51%), Gaps = 54/336 (16%)
Query: 35 YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSS 94
YLM L IGTPP ++ +DTGS+ WTQC PC L C+ Q+ P+FDP KSST+ C
Sbjct: 65 YLMKLQIGTPPFEVEAVLDTGSELIWTQCLPC--LHCYDQKAPIFDPSKSSTFKETRC-- 120
Query: 95 SQCAVVTSNCSEGDCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMPNVIFGCGH 154
N + C Y +Y +Y + G LATET+T +STSG+P MP I GC
Sbjct: 121 --------NTPDHSCPYKLVYDDKSY---TQGTLATETVTIHSTSGVPFVMPETIIGCSR 169
Query: 155 KNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLPDQGSSKINFGGIVAGAGV 214
N S S +GI+GL G+ SLISQM GG G GV
Sbjct: 170 NNSGSGFRPSS-SGIVGLSRGSLSLISQM----------------------GGAYPGDGV 206
Query: 215 VSTPLII----RDHYYLSLEAISVGNQRLEFVSSS----TGNIFVDTGVLRTLLPLEYHS 266
VST + R YYL+L+A+SVG+ R+E V + GNI +D+G T P+ Y +
Sbjct: 207 VSTTMFAKTAKRGQYYLNLDAVSVGDTRIETVGTPFHALNGNIVIDSGTPLTYFPVSYCN 266
Query: 267 NLKSVMSNMIKAQPVKGVGAEPGFSDVLCYNISSQPKFPEVTIHFR-GADVKLSPSNLFR 325
++ + ++ A V +P +D+LCY ++ FP +T+HF GAD+ L N++
Sbjct: 267 LVRKAVERVVTADRV----VDPSRNDMLCYYSNTIEIFPVITVHFSGGADLVLDKYNMYM 322
Query: 326 NISD-EIMCSAFRGGNAN--IVYGRIMQINFLIGYD 358
++ + C A N ++G Q NFL+GYD
Sbjct: 323 ELNRGGVFCLAIICNNPTQVAIFGNRAQNNFLVGYD 358
>gi|297826117|ref|XP_002880941.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
gi|297326780|gb|EFH57200.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
Length = 397
Score = 222 bits (566), Expect = 2e-55, Method: Compositional matrix adjust.
Identities = 139/354 (39%), Positives = 190/354 (53%), Gaps = 35/354 (9%)
Query: 34 IYLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCS 93
IYLM L +GTPP +I +DTGSD WTQC PCP +C+ Q P+FDP KSST+
Sbjct: 60 IYLMRLQLGTPPFEIVAEIDTGSDLIWTQCMPCP--NCYTQFAPIFDPSKSSTFKE---- 113
Query: 94 SSQCAVVTSNCSEGDCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMPNVIFGCG 153
C C Y +Y A S+S+G LATET+T STSG P M GCG
Sbjct: 114 --------KRCHGNSCPYEIIY---ADESYSTGILATETVTIQSTSGEPFVMAETSIGCG 162
Query: 154 --HKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLPDQGSSKINFG--GIV 209
+ NL +P + +GI+GL G SSLISQM I G SYC QG+SKINFG +V
Sbjct: 163 LNNSNLMTPGYAASSSGIVGLNMGPSSLISQMDLPIPGLISYCFSSQGTSKINFGTNAVV 222
Query: 210 AGAGVVSTPLIIRD---HYYLSLEAISVGNQRLEFVSS----STGNIFVDTGVLRTLLPL 262
AG G V+ + I+ YYL+L+A+SVG++R+E + + GNIF+D+G T LP
Sbjct: 223 AGDGTVAADMFIKKDQPFYYLNLDAVSVGDKRIETLGTPFHAQDGNIFIDSGTTYTYLPT 282
Query: 263 EYHSNLKSVMSNMIKAQPVKGVGAEPGFSDVLCYNISSQPKFPEVTIHFR-GADVKLSPS 321
Y NL +P ++LCYN + FP +T+HF GAD+ L
Sbjct: 283 SY-CNLVREAVAASVVA--ANQVPDPSSENLLCYNWDTMEIFPVITLHFAGGADLVLDKY 339
Query: 322 NLF-RNISDEIMCSAFRGGNANI--VYGRIMQINFLIGYDIEQAMVSFKPSRCT 372
N++ I+ C A + ++ ++G N L+GYD ++SF P+ C+
Sbjct: 340 NMYVETITGGTFCLAIGCVDPSMPAIFGNRAHNNLLVGYDSSTLVISFSPTNCS 393
>gi|30683732|ref|NP_180371.2| aspartyl protease-like protein [Arabidopsis thaliana]
gi|28392898|gb|AAO41885.1| putative chloroplast nucleoid DNA binding protein [Arabidopsis
thaliana]
gi|56382011|gb|AAV85724.1| At2g28040 [Arabidopsis thaliana]
gi|330252978|gb|AEC08072.1| aspartyl protease-like protein [Arabidopsis thaliana]
Length = 395
Score = 219 bits (558), Expect = 2e-54, Method: Compositional matrix adjust.
Identities = 137/352 (38%), Positives = 190/352 (53%), Gaps = 39/352 (11%)
Query: 35 YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSS 94
YLM L IGTPP +I +DTGS+ WTQC PC + C+ Q P+FDP KSST+ I C +
Sbjct: 65 YLMKLQIGTPPFEIEAVLDTGSEHIWTQCLPC--VHCYNQTAPIFDPSKSSTFKEIRCDT 122
Query: 95 SQCAVVTSNCSEGDCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMPNVIFGCGH 154
+ C Y +YG +Y + G L TET+T +STSG P MP I GCG
Sbjct: 123 H----------DHSCPYELVYGGKSY---TKGTLVTETVTIHSTSGQPFVMPETIIGCGR 169
Query: 155 KNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLPDQGSSKINFG--GIVAGA 212
N + G++GL G SLI+QMG G SYC +G+SKINFG IVAG
Sbjct: 170 NNSGFKPGFA---GVVGLDRGPKSLITQMGGEYPGLMSYCFAGKGTSKINFGANAIVAGD 226
Query: 213 GVVSTPLIIRD----HYYLSLEAISVGNQRLEFVSSS----TGNIFVDTGVLRTLLPLEY 264
GVVST + ++ YYL+L+A+SVGN R+E V + GNI +D+G T P Y
Sbjct: 227 GVVSTTVFVKTAKPGFYYLNLDAVSVGNTRIETVGTPFHALKGNIVIDSGSTLTYFPESY 286
Query: 265 HSNLKSVMSNMIKAQPVKGVGAEPGFSDVLCYNISSQPKFPEVTIHFR-GADVKLSPSNL 323
+ ++ + ++ A SD+LCY + FP +T+HF GAD+ L N+
Sbjct: 287 CNLVRKAVEQVVTAVRFPR-------SDILCYYSKTIDIFPVITMHFSGGADLVLDKYNM 339
Query: 324 F-RNISDEIMCSAFRGGN--ANIVYGRIMQINFLIGYDIEQAMVSFKPSRCT 372
+ + + + C A + ++G Q NFL+GYD +VSFKP+ C+
Sbjct: 340 YVASNTGGVFCLAIICNSPIEEAIFGNRAQNNFLVGYDSSSLLVSFKPTNCS 391
>gi|4063754|gb|AAC98462.1| putative chloroplast nucleoid DNA binding protein [Arabidopsis
thaliana]
gi|20197956|gb|AAM15330.1| putative chloroplast nucleoid DNA binding protein [Arabidopsis
thaliana]
Length = 389
Score = 219 bits (558), Expect = 2e-54, Method: Compositional matrix adjust.
Identities = 137/352 (38%), Positives = 190/352 (53%), Gaps = 39/352 (11%)
Query: 35 YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSS 94
YLM L IGTPP +I +DTGS+ WTQC PC + C+ Q P+FDP KSST+ I C +
Sbjct: 59 YLMKLQIGTPPFEIEAVLDTGSEHIWTQCLPC--VHCYNQTAPIFDPSKSSTFKEIRCDT 116
Query: 95 SQCAVVTSNCSEGDCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMPNVIFGCGH 154
+ C Y +YG +Y + G L TET+T +STSG P MP I GCG
Sbjct: 117 H----------DHSCPYELVYGGKSY---TKGTLVTETVTIHSTSGQPFVMPETIIGCGR 163
Query: 155 KNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLPDQGSSKINFG--GIVAGA 212
N + G++GL G SLI+QMG G SYC +G+SKINFG IVAG
Sbjct: 164 NNSGFKPGFA---GVVGLDRGPKSLITQMGGEYPGLMSYCFAGKGTSKINFGANAIVAGD 220
Query: 213 GVVSTPLIIRD----HYYLSLEAISVGNQRLEFVSSS----TGNIFVDTGVLRTLLPLEY 264
GVVST + ++ YYL+L+A+SVGN R+E V + GNI +D+G T P Y
Sbjct: 221 GVVSTTVFVKTAKPGFYYLNLDAVSVGNTRIETVGTPFHALKGNIVIDSGSTLTYFPESY 280
Query: 265 HSNLKSVMSNMIKAQPVKGVGAEPGFSDVLCYNISSQPKFPEVTIHFR-GADVKLSPSNL 323
+ ++ + ++ A SD+LCY + FP +T+HF GAD+ L N+
Sbjct: 281 CNLVRKAVEQVVTAVRFPR-------SDILCYYSKTIDIFPVITMHFSGGADLVLDKYNM 333
Query: 324 F-RNISDEIMCSAFRGGN--ANIVYGRIMQINFLIGYDIEQAMVSFKPSRCT 372
+ + + + C A + ++G Q NFL+GYD +VSFKP+ C+
Sbjct: 334 YVASNTGGVFCLAIICNSPIEEAIFGNRAQNNFLVGYDSSSLLVSFKPTNCS 385
>gi|449462553|ref|XP_004149005.1| PREDICTED: probable aspartic protease At2g35615-like [Cucumis
sativus]
Length = 418
Score = 219 bits (558), Expect = 2e-54, Method: Compositional matrix adjust.
Identities = 139/352 (39%), Positives = 202/352 (57%), Gaps = 39/352 (11%)
Query: 41 IGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSSSQC-AV 99
IGTPPVD G DTGSD TW QC PC L C++Q P+F+P KS++++ + C++ C AV
Sbjct: 86 IGTPPVDYLGIADTGSDLTWAQCLPC--LKCYQQLRPIFNPLKSTSFSHVPCNTQTCHAV 143
Query: 100 VTSNCS-EGDCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMPNVIFGCGHKNLA 158
+C +G C YS+ YG Y S G+L E +T S+S + GCGH A
Sbjct: 144 DDGHCGVQGVCDYSYTYGDRTY---SKGDLGFEKITIGSSS------VKSVIGCGH---A 191
Query: 159 SPTSDSKQTGIIGLGPGNSSLISQMG--TSIAGKFSYCLP---DQGSSKINFG--GIVAG 211
S +G+IGLG G SL+SQM + I+ +FSYCLP + KINFG +V+G
Sbjct: 192 SSGGFGFASGVIGLGGGQLSLVSQMSQTSGISRRFSYCLPTLLSHANGKINFGQNAVVSG 251
Query: 212 AGVVSTPLIIRD---HYYLSLEAISVGNQRLEFVSSSTGNIFVDTGVLRTLLPLEYHSNL 268
GVVSTPLI ++ +YY++LEAIS+GN+R + GN+ +D+G + LP E + +
Sbjct: 252 PGVVSTPLISKNTVTYYYITLEAISIGNER-HMAFAKQGNVIIDSGTTLSFLPKELYDGV 310
Query: 269 KSVMSNMIKAQPVKGVGAEPGFSDVLCY----NISSQPKFPEVTIHFR-GADVKLSPSNL 323
S + ++KA+ VK +PG LC+ N+++ P +T F GA+V L P N
Sbjct: 311 VSSLLKVVKAKRVK----DPGNFWDLCFDDGINVATSSGIPIITAQFSGGANVNLLPVNT 366
Query: 324 FRNISDEIMCSAFRGGNANIVYGRIMQI---NFLIGYDIEQAMVSFKPSRCT 372
F+ +++ + C + +G I + NFLIGYD+E +SFKP+ CT
Sbjct: 367 FQKVANNVNCLTLTPASPTDEFGIIGNLALANFLIGYDLEAKRLSFKPTVCT 418
>gi|15226358|ref|NP_180389.1| aspartyl protease-like protein [Arabidopsis thaliana]
gi|4803959|gb|AAD29831.1| putative chloroplast nucleoid DNA binding protein [Arabidopsis
thaliana]
gi|330252998|gb|AEC08092.1| aspartyl protease-like protein [Arabidopsis thaliana]
Length = 756
Score = 216 bits (550), Expect = 2e-53, Method: Compositional matrix adjust.
Identities = 136/354 (38%), Positives = 190/354 (53%), Gaps = 36/354 (10%)
Query: 34 IYLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCS 93
IYLM L +GTPP +I +DTGSD WTQC PCP +C+ Q P+FDP KSST+
Sbjct: 420 IYLMKLQVGTPPFEIVAEIDTGSDIIWTQCMPCP--NCYSQFAPIFDPSKSSTFRE---- 473
Query: 94 SSQCAVVTSNCSEGDCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMPNVIFGCG 153
C+ C Y +Y Y S G LATET+T STSG P M GCG
Sbjct: 474 --------QRCNGNSCHYEIIYADKTY---SKGILATETVTIPSTSGEPFVMAETKIGCG 522
Query: 154 --HKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLPDQGSSKINFG--GIV 209
+ NL S +GI+GL G SLISQM G SYC QG+SKINFG IV
Sbjct: 523 LDNTNLQYSGFASSSSGIVGLNMGPLSLISQMDLPYPGLISYCFSGQGTSKINFGTNAIV 582
Query: 210 AGAGVVSTPLIIRDH---YYLSLEAISVGNQRLEFV----SSSTGNIFVDTGVLRTLLPL 262
AG G V+ + I+ YYL+L+A+SV + + + + GNIF+D+G T P+
Sbjct: 583 AGDGTVAADMFIKKDNPFYYLNLDAVSVEDNLIATLGTPFHAEDGNIFIDSGTTLTYFPM 642
Query: 263 EYHSNLKSVMSNMIKAQPVKGVGAEPGFSDVLCYNISSQPKFPEVTIHFR-GADVKLSPS 321
Y + ++ + ++ A V +G++ ++LCY + FP +T+HF GAD+ L
Sbjct: 643 SYCNLVREAVEQVVTAVKVPDMGSD----NLLCYYSDTIDIFPVITMHFSGGADLVLDKY 698
Query: 322 NLF-RNISDEIMCSAFRGGNANI--VYGRIMQINFLIGYDIEQAMVSFKPSRCT 372
N++ I+ I C A + ++ V+G Q NFL+GYD ++SF P+ C+
Sbjct: 699 NMYLETITGGIFCLAIGCNDPSMPAVFGNRAQNNFLVGYDPSSNVISFSPTNCS 752
Score = 208 bits (529), Expect = 4e-51, Method: Compositional matrix adjust.
Identities = 134/341 (39%), Positives = 181/341 (53%), Gaps = 36/341 (10%)
Query: 33 DIYLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISC 92
+IYLM L +GTPP +I +DTGSD WTQC PCP DC+ Q P+FDP KSST+N
Sbjct: 80 NIYLMKLQVGTPPFEIAAEIDTGSDLIWTQCMPCP--DCYSQFDPIFDPSKSSTFNE--- 134
Query: 93 SSSQCAVVTSNCSEGDCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMPNVIFGC 152
C C Y +Y Y S G LATET+T +STSG P M GC
Sbjct: 135 ---------QRCHGKSCHYEIIYEDNTY---SKGILATETVTIHSTSGEPFVMAETTIGC 182
Query: 153 GHKN--LASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLPDQGSSKINFG--GI 208
G N L + S +GI+GL G SLISQM G SYC QG+SKINFG I
Sbjct: 183 GLHNTDLDNSGFASSSSGIVGLNMGPRSLISQMDLPYPGLISYCFSGQGTSKINFGTNAI 242
Query: 209 VAGAGVVSTPLIIRDH---YYLSLEAISVGNQRLEFV----SSSTGNIFVDTGVLRTLLP 261
VAG G V+ + I+ YYL+L+A+SV + R+E + + GNI +D+G T P
Sbjct: 243 VAGDGTVAADMFIKKDNPFYYLNLDAVSVEDNRIETLGTPFHAEDGNIVIDSGSTVTYFP 302
Query: 262 LEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDVLCYNISSQPKFPEVTIHFR-GADVKLSP 320
+ Y + ++ + ++ A V +P +D+LCY + FP +T+HF GAD+ L
Sbjct: 303 VSYCNLVRKAVEQVVTAVRVP----DPSGNDMLCYFSETIDIFPVITMHFSGGADLVLDK 358
Query: 321 SNLF-RNISDEIMCSAF--RGGNANIVYGRIMQINFLIGYD 358
N++ + S + C A ++G Q NFL+GYD
Sbjct: 359 YNMYMESNSGGLFCLAIICNSPTQEAIFGNRAQNNFLVGYD 399
>gi|317106730|dbj|BAJ53226.1| JHL06P13.6 [Jatropha curcas]
Length = 445
Score = 215 bits (547), Expect = 3e-53, Method: Compositional matrix adjust.
Identities = 139/365 (38%), Positives = 203/365 (55%), Gaps = 39/365 (10%)
Query: 35 YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSS 94
Y M +SIGTPP+++ DTGSD W QC+PC E C+KQ+ P+F+PK+SSTY + C +
Sbjct: 94 YFMRISIGTPPIEVLVIADTGSDLIWVQCQPCQE--CYKQKSPIFNPKQSSTYRRVLCET 151
Query: 95 SQCAVVTSN---CSE----GDCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMPN 147
C + S+ CS C YS+ YG SF+ G LATE ST+ E+
Sbjct: 152 RYCNALNSDMRACSAHGFFKACGYSYSYGD---HSFTMGYLATERFIIGSTNNSIQELA- 207
Query: 148 VIFGCGHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCL-PDQGSSKINFG 206
FGCG+ N D +GI+GLG G+ SLISQ+GT I KFSYCL P S + G
Sbjct: 208 --FGCGNSN--GGNFDEVGSGIVGLGGGSLSLISQLGTKIDNKFSYCLVPILEKSNFSLG 263
Query: 207 GIVAGAG--------VVSTPLIIRD---HYYLSLEAISVGNQRLEFVSS------STGNI 249
IV G VSTPL+ ++ YYL+LEAISVGN+RL + +S GNI
Sbjct: 264 KIVFGDNSFISGSDTYVSTPLVSKEPETFYYLTLEAISVGNERLAYENSRNDGNVEKGNI 323
Query: 250 FVDTGVLRTLLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDVLCYNISSQPKFPEVTI 309
+D+G T L + ++ L+ V+ ++ + V ++P +C+ + P +T+
Sbjct: 324 IIDSGTTLTFLDSKLYNKLELVLEKAVEGERV----SDPNGIFSICFRDKIGIELPIITV 379
Query: 310 HFRGADVKLSPSNLFRNISDEIMCSAFRGGNANIVYGRIMQINFLIGYDIEQAMVSFKPS 369
HF ADV+L P N F ++++C N ++G + Q+NFL+GYD+++ VSF P+
Sbjct: 380 HFTDADVELKPINTFAKAEEDLLCFTMIPSNGIAIFGNLAQMNFLVGYDLDKNCVSFMPT 439
Query: 370 RCTNY 374
C+ +
Sbjct: 440 DCSGH 444
>gi|356557014|ref|XP_003546813.1| PREDICTED: probable aspartic protease At2g35615-like [Glycine max]
Length = 435
Score = 213 bits (543), Expect = 9e-53, Method: Compositional matrix adjust.
Identities = 141/382 (36%), Positives = 202/382 (52%), Gaps = 35/382 (9%)
Query: 9 FYNDNETPKSPISIIYQAEIISVDDIYLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPE 68
F ++N+ P+S +I YLM IG+PPV+ VDTGS W QC PC
Sbjct: 71 FLDENKLPES--------LLIPDKGEYLMRFYIGSPPVERLAMVDTGSSLIWLQCSPCH- 121
Query: 69 LDCFKQEPPLFDPKKSSTYNSISCSSSQCAVVT---SNCSE-GDCSYSFLYGRGAYASFS 124
+CF QE PLF+P KSSTY +C S C ++ +C + G C Y +YG SFS
Sbjct: 122 -NCFPQETPLFEPLKSSTYKYATCDSQPCTLLQPSQRDCGKLGQCIYGIMYGD---KSFS 177
Query: 125 SGNLATETLTFNSTSGL-PVEMPNVIFGCGHKNLASPTSDSKQTGIIGLGPGNSSLISQM 183
G L TETL+F ST G V PN IFGCG N + + +K GI GLG G SL+SQ+
Sbjct: 178 VGILGTETLSFGSTGGAQTVSFPNTIFGCGVDNNFTIYTSNKVMGIAGLGAGPLSLVSQL 237
Query: 184 GTSIAGKFSYCL---PDQGSSKINFG--GIVAGAGVVSTPLIIR----DHYYLSLEAISV 234
G I KFSYCL +SK+ FG I+ GVVSTPLII+ +Y+L+LEA+++
Sbjct: 238 GAQIGHKFSYCLLPYDSTSTSKLKFGSEAIITTNGVVSTPLIIKPSLPTYYFLNLEAVTI 297
Query: 235 GNQRLEFVSSSTGNIFVDTGVLRTLLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDVL 294
G Q++ + GNI +D+G T L +++N + + + + ++ + +
Sbjct: 298 G-QKVVSTGQTDGNIVIDSGTPLTYLENTFYNNFVASLQETLGVKLLQDLPSPLK----T 352
Query: 295 CYNISSQPKFPEVTIHFRGADVKLSPSNLFRNISD-EIMCSAF--RGGNANIVYGRIMQI 351
C+ + P++ F GA V L P N+ ++D I+C A G ++G I Q
Sbjct: 353 CFPNRANLAIPDIAFQFTGASVALRPKNVLIPLTDSNILCLAVVPSSGIGISLFGSIAQY 412
Query: 352 NFLIGYDIEQAMVSFKPSRCTN 373
+F + YD+E VSF P+ C
Sbjct: 413 DFQVEYDLEGKKVSFAPTDCAK 434
>gi|356495752|ref|XP_003516737.1| PREDICTED: probable aspartic protease At2g35615-like [Glycine max]
Length = 396
Score = 213 bits (542), Expect = 1e-52, Method: Compositional matrix adjust.
Identities = 150/387 (38%), Positives = 212/387 (54%), Gaps = 34/387 (8%)
Query: 8 PFYNDNETPKSPI-SIIYQAEIISVDDIYLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPC 66
PFY +E + S + S + YLM L++GTPPVD++G VDTGSD W QC PC
Sbjct: 22 PFYKSDELHMHRLGSNGVFTRVTSNNGDYLMKLTLGTPPVDVYGLVDTGSDLVWAQCTPC 81
Query: 67 PELDCFKQEPPLFDPKKSSTYNSISCSSSQC-AVVTSNCS-EGDCSYSFLYGRGAYASFS 124
C++Q+ P+F+P +S+TY I C S +C ++ +CS + C+YS+ Y A +S +
Sbjct: 82 Q--GCYRQKSPMFEPLRSNTYTPIPCDSEECNSLFGHSCSPQKLCAYSYAY---ADSSVT 136
Query: 125 SGNLATETLTFNSTSGLPVEMPNVIFGCGHKNLASPTSDSKQTGIIGLGPGNSSLISQMG 184
G LA ET+TF+ST G PV + +++FGCGH N S T + GIIGLG G SL+SQ G
Sbjct: 137 KGVLARETVTFSSTDGEPVVVGDIVFGCGHSN--SGTFNENDMGIIGLGGGPLSLVSQFG 194
Query: 185 TSIAGK-FSYCLPDQGSSKINFGGI-------VAGAGVVSTPLIIRDH---YYLSLEAIS 233
K FS CL + G I V+G GV +TPL+ + Y ++LE IS
Sbjct: 195 NLYGSKRFSQCLVPFHADPHTLGTISFGDASDVSGEGVAATPLVSEEGQTPYLVTLEGIS 254
Query: 234 VGNQRLEFVSS---STGNIFVDTGVLRTLLPLEYHSNL---KSVMSNMIKAQPVKGVGAE 287
VG+ + F SS S GNI +D+G T LP E++ L V SNM+ + +
Sbjct: 255 VGDTFVSFNSSEMLSKGNIMIDSGTPATYLPQEFYDRLVKELKVQSNMLP------IDDD 308
Query: 288 PGFSDVLCYNISSQPKFPEVTIHFRGADVKLSPSNLFRNISDEIMCSAFRG-GNANIVYG 346
P LCY + + P + HF GADV+L P F D + C A G + ++G
Sbjct: 309 PDLGTQLCYRSETNLEGPILIAHFEGADVQLMPIQTFIPPKDGVFCFAMAGTTDGEYIFG 368
Query: 347 RIMQINFLIGYDIEQAMVSFKPSRCTN 373
Q N LIG+D+++ VSFK + C+N
Sbjct: 369 NFAQSNVLIGFDLDRKTVSFKATDCSN 395
>gi|356528671|ref|XP_003532923.1| PREDICTED: probable aspartic protease At2g35615-like [Glycine max]
Length = 440
Score = 211 bits (537), Expect = 4e-52, Method: Compositional matrix adjust.
Identities = 134/392 (34%), Positives = 206/392 (52%), Gaps = 34/392 (8%)
Query: 1 AQNSQKLPFYNDNETPKSPISIIYQAEIISVDDIYLMHLSIGTPPVDIFGSVDTGSDCTW 60
A++ ++L +++ +SP +I E I+ YLM IGTPPV+ F DTGSD W
Sbjct: 63 ARSKRRLRLSQNDD--RSPGTITIPDEPITE---YLMRFYIGTPPVERFAIADTGSDLIW 117
Query: 61 TQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSSSQCAVV---TSNC--SEGDCSYSFLY 115
QC PC + C Q PLFDP+KSST+ ++ C S C ++ C G C Y ++Y
Sbjct: 118 VQCAPCEK--CVPQNAPLFDPRKSSTFKTVPCDSQPCTLLPPSQRACVGKSGQCYYQYIY 175
Query: 116 GRGAYASFSSGNLATETLTFNSTSGLPVEMPNVIFGCGHKNLASPTSDSKQTGIIGLGPG 175
G + SG L E++ F S + ++ P + FGC N + + G++GLG G
Sbjct: 176 GD---HTLVSGILGFESINFGSKNN-AIKFPKLTFGCTFSNNDTVDESKRNMGLVGLGVG 231
Query: 176 NSSLISQMGTSIAGKFSYCLP---DQGSSKINFGG---IVAGAGVVSTPLIIR----DHY 225
SLISQ+G I KFSYC P +SK+ FG + GVVSTPLII+ +Y
Sbjct: 232 PLSLISQLGYQIGRKFSYCFPPLSSNSTSKMRFGNDAIVKQIKGVVSTPLIIKSIGPSYY 291
Query: 226 YLSLEAISVGNQRLEFVSSST-GNIFVDTGVLRTLLPLEYHSNLKSVMSNMIKAQPVKGV 284
YL+LE +S+GN++++ S T GNI +D+G T+L +++ +++ + + VK
Sbjct: 292 YLNLEGVSIGNKKVKTSESQTDGNILIDSGTSFTILKQSFYNKFVALVKEVYGVEAVK-- 349
Query: 285 GAEPGFSDVLCY-NISSQPKFPEVTIHFRGADVKLSPSNLFRNISDEIMCSAF--RGGNA 341
P C+ N + +FP+V F GA V++ SNLF + ++C
Sbjct: 350 --IPPLVYNFCFENKGKRKRFPDVVFLFTGAKVRVDASNLFEAEDNNLLCMVALPTSDED 407
Query: 342 NIVYGRIMQINFLIGYDIEQAMVSFKPSRCTN 373
+ ++G QI + + YD++ MVSF P+ C
Sbjct: 408 DSIFGNHAQIGYQVEYDLQGGMVSFAPADCAK 439
>gi|15222357|ref|NP_174430.1| aspartyl protease-like protein [Arabidopsis thaliana]
gi|12322538|gb|AAG51267.1|AC027135_8 chloroplast nucleoid DNA binding protein, putative [Arabidopsis
thaliana]
gi|67633408|gb|AAY78629.1| aspartyl protease family protein [Arabidopsis thaliana]
gi|332193236|gb|AEE31357.1| aspartyl protease-like protein [Arabidopsis thaliana]
Length = 445
Score = 211 bits (537), Expect = 4e-52, Method: Compositional matrix adjust.
Identities = 137/377 (36%), Positives = 202/377 (53%), Gaps = 38/377 (10%)
Query: 25 QAEIISVDDIYLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKS 84
Q+ +IS Y M +SIGTPP +F DTGSD TW QC+PC + C+KQ PLFD KKS
Sbjct: 75 QSGLISNGGEYFMSISIGTPPSKVFAIADTGSDLTWVQCKPCQQ--CYKQNSPLFDKKKS 132
Query: 85 STYNSISCSSSQCAVVTSN---CSEGD--CSYSFLYGRGAYASFSSGNLATETLTFNSTS 139
STY + SC S C ++ + C E C Y + YG SF+ G++ATET++ +S+S
Sbjct: 133 STYKTESCDSKTCQALSEHEEGCDESKDICKYRYSYGDN---SFTKGDVATETISIDSSS 189
Query: 140 GLPVEMPNVIFGCGHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLPD-- 197
G V P +FGCG+ N + +G GP SL+SQ+G+SI KFSYCL
Sbjct: 190 GSSVSFPGTVFGCGYNNGGTFEETGSGIIGLGGGP--LSLVSQLGSSIGKKFSYCLSHTA 247
Query: 198 ---QGSSKINFG------GIVAGAGVVSTPLIIRD---HYYLSLEAISVGNQRLEFV--- 242
G+S IN G + ++TPLI +D +Y+L+LEA++VG +L +
Sbjct: 248 ATTNGTSVINLGTNSIPSNPSKDSATLTTPLIQKDPETYYFLTLEAVTVGKTKLPYTGGG 307
Query: 243 -------SSSTGNIFVDTGVLRTLLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDVLC 295
S TGNI +D+G TLL ++ + + + + K V G
Sbjct: 308 YGLNGKSSKRTGNIIIDSGTTLTLLDSGFYDDFGTAVEESVTG--AKRVSDPQGLLTHCF 365
Query: 296 YNISSQPKFPEVTIHFRGADVKLSPSNLFRNISDEIMCSAFRGGNANIVYGRIMQINFLI 355
+ + P +T+HF ADVKLSP N F ++++ +C + +YG ++Q++FL+
Sbjct: 366 KSGDKEIGLPAITMHFTNADVKLSPINAFVKLNEDTVCLSMIPTTEVAIYGNMVQMDFLV 425
Query: 356 GYDIEQAMVSFKPSRCT 372
GYD+E VSF+ C+
Sbjct: 426 GYDLETKTVSFQRMDCS 442
>gi|388502342|gb|AFK39237.1| unknown [Lotus japonicus]
Length = 440
Score = 209 bits (532), Expect = 2e-51, Method: Compositional matrix adjust.
Identities = 135/363 (37%), Positives = 190/363 (52%), Gaps = 31/363 (8%)
Query: 28 IISVDDIYLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTY 87
II + YLM + IGTP V+ DTGSD TW QC PC CF Q PL+DP SST+
Sbjct: 89 IIPNNGNYLMRIYIGTPSVERLAIADTGSDLTWVQCSPCDNTKCFAQNTPLYDPLNSSTF 148
Query: 88 NSISCSSSQCAVVTSN---CSE-GDCSYSFLYGRGAYA----SFSSGNLATETLTFNSTS 139
+ C S C + + CS+ GDC Y++ YG +Y+ S S L L +NS
Sbjct: 149 TLLPCDSQPCTQLPYSQYVCSDYGDCIYAYTYGDNSYSYGGLSSDSIRLMLLQLHYNS-- 206
Query: 140 GLPVEMPNVIFGCGHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCL---P 196
+ FGCG +N + K TGI+GLG G SL+SQ+G I KFSYCL
Sbjct: 207 -------KICFGCGFQNKFTADKSGKTTGIVGLGAGPLSLVSQLGDEIGHKFSYCLLPFS 259
Query: 197 DQGSSKINFG--GIVAGAGVVSTPLIIRDH---YYLSLEAISVGNQRLEFVSSSTGNIFV 251
+SK+ FG IV G GVVSTPLII+ YYL+LE I+VG + ++ + GNI +
Sbjct: 260 SNSNSKLKFGEAAIVQGNGVVSTPLIIKPDLPFYYLNLEGITVGAKTVK-TGQTDGNIII 318
Query: 252 DTGVLRTLLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDVLCYNISSQPKFPEVTIHF 311
D+G T L +++ S++ + + + + F +S+ P+V HF
Sbjct: 319 DSGSTLTYLEESFYNEFVSLVKETVAVEEDQYIPYPFDFCFTYKEGMSTP---PDVVFHF 375
Query: 312 RGADVKLSPSNLFRNISDEIMCSAFRGGNAN--IVYGRIMQINFLIGYDIEQAMVSFKPS 369
G DV L P N I D ++CS + + ++G + QI+F +GYDI+ VSF P+
Sbjct: 376 TGGDVVLKPMNTLVLIEDNLICSTVVPSHFDGIAIFGNLGQIDFHVGYDIQGGKVSFAPT 435
Query: 370 RCT 372
C+
Sbjct: 436 DCS 438
>gi|449462551|ref|XP_004149004.1| PREDICTED: probable aspartic protease At2g35615-like [Cucumis
sativus]
gi|449515029|ref|XP_004164552.1| PREDICTED: probable aspartic protease At2g35615-like [Cucumis
sativus]
Length = 434
Score = 207 bits (526), Expect = 8e-51, Method: Compositional matrix adjust.
Identities = 148/371 (39%), Positives = 209/371 (56%), Gaps = 41/371 (11%)
Query: 25 QAEIISVDDIYLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKS 84
++ II +LM + IGTPPV++ DTGSD TWTQC PC E CF Q P+F+P++S
Sbjct: 80 RSPIIPDSGEFLMSIFIGTPPVNVIAIADTGSDLTWTQCLPCRE--CFNQSQPIFNPRRS 137
Query: 85 STYNSISCSSSQCAVVTSNCSEGD---CSYSFLYGRGAYASFSSGNLATETLTFNSTSGL 141
S+Y +SC+S C + S D CSY + YG SF+ G+LA++ +T S
Sbjct: 138 SSYRKVSCASDTCRSLESYHCGPDLQSCSYGYSYGD---RSFTYGDLASDQITIGS---- 190
Query: 142 PVEMPNVIFGCGHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAG---KFSYCLPDQ 198
++P + GCGH+N T +GIIGLG G+ SL+SQM T IAG +FSYCLP
Sbjct: 191 -FKLPKTVIGCGHQN--GGTFGGVTSGIIGLGGGSLSLVSQMRT-IAGVKPRFSYCLPTF 246
Query: 199 GSSK-----INFG--GIVAGAGVVSTPLIIRD---HYYLSLEAISVGNQRLEFVS----- 243
S+ I+FG +V+G VVSTPL+ R Y+L+LEAISVG +R + +
Sbjct: 247 FSNANITGTISFGRKAVVSGRQVVSTPLVPRSPDTFYFLTLEAISVGKKRFKAANGISAM 306
Query: 244 SSTGNIFVDTGVLRTLLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDVLCYNISS--Q 301
++ GNI +D+G TLLP + + S ++ +IKA+ V +P LCY+
Sbjct: 307 TNHGNIIIDSGTTLTLLPRSLYYGVFSTLARVIKAKRVD----DPSGILELCYSAGQVDD 362
Query: 302 PKFPEVTIHFRG-ADVKLSPSNLFRNISDEIMCSAFRGGNANIVYGRIMQINFLIGYDIE 360
P +T HF G ADVKL P N F ++D + C F ++G + QINF +GYD+
Sbjct: 363 LNIPIITAHFAGGADVKLLPVNTFAPVADNVTCLTFAPATQVAIFGNLAQINFEVGYDLG 422
Query: 361 QAMVSFKPSRC 371
+SF+P C
Sbjct: 423 NKRLSFEPKLC 433
>gi|296090179|emb|CBI39998.3| unnamed protein product [Vitis vinifera]
Length = 334
Score = 205 bits (521), Expect = 3e-50, Method: Compositional matrix adjust.
Identities = 134/376 (35%), Positives = 200/376 (53%), Gaps = 72/376 (19%)
Query: 15 TPKSPISIIYQAEIISVDDIYLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQ 74
TP+ P+S S + YLM +SIGTPP D++G DTGSD WTQC PC L C+KQ
Sbjct: 12 TPEPPVS--------SNNGEYLMKISIGTPPFDVYGIYDTGSDLMWTQCLPC--LSCYKQ 61
Query: 75 EPPLFDPKKSSTYNSISCSSSQCAVVTSNCSEGDCSYSFLYGRGAYASFSSGNLATETLT 134
+ P+FDP KS+++ +SC S QC ++ +
Sbjct: 62 KNPMFDPSKSTSFKEVSCESQQCRLLDT-------------------------------- 89
Query: 135 FNSTSGLPVEMPNVIFGCGHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAG--KFS 192
P + N++FGCGH N S T + + G+ G G SL SQ+ +++ KFS
Sbjct: 90 -------PTSILNIVFGCGHNN--SGTFNENEMGLFGTGGRPLSLTSQIMSTLGSGRKFS 140
Query: 193 YCL-PDQG----SSKINFG--GIVAGAGVVSTPLIIRD---HYYLSLEAISVGNQRLEFV 242
CL P + +SKI FG V+G+ VVSTPL+ +D +Y+++L+ ISVG++ F
Sbjct: 141 QCLVPFRTDPSITSKIIFGPEAEVSGSDVVSTPLVTKDDPTYYFVTLDGISVGDKLFPFS 200
Query: 243 SSS----TGNIFVDTGVLRTLLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDVLCYNI 298
SSS GN+F+D G TLLP ++++ L + I +PV+ +P LCY
Sbjct: 201 SSSPMATKGNVFIDAGTPPTLLPRDFYNRLVQGVKEAIPMEPVQ----DPDLQPQLCYRS 256
Query: 299 SSQPKFPEVTIHFRGADVKLSPSNLFRNISDEIMCSAFRGGNANI-VYGRIMQINFLIGY 357
++ P +T HF GADV+L P N F + + + C A + + + ++G +Q+NFLIG+
Sbjct: 257 ATLIDGPILTAHFDGADVQLKPLNTFISPKEGVYCFAMQPIDGDTGIFGNFVQMNFLIGF 316
Query: 358 DIEQAMVSFKPSRCTN 373
D++ VSFK CT
Sbjct: 317 DLDGKKVSFKAVDCTK 332
>gi|255566006|ref|XP_002523991.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
gi|223536718|gb|EEF38359.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
Length = 455
Score = 202 bits (515), Expect = 2e-49, Method: Compositional matrix adjust.
Identities = 138/377 (36%), Positives = 194/377 (51%), Gaps = 40/377 (10%)
Query: 25 QAEIISVDDIYLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKS 84
A I S D YLM L IGTPP +I ++DTGS+ W C C DCF Q +F+P S
Sbjct: 88 HASIFSGDGNYLMKLLIGTPPTEIHAAIDTGSNVIWIPCINCK--DCFNQSSSIFNPLAS 145
Query: 85 STYNSISCSSSQCAVVTSNC-SEGDCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPV 143
STY C S QC +S+C S+ C YS + +G +A +T+T S+ G P
Sbjct: 146 STYQDAPCDSYQCETTSSSCQSDNVCLYSC--DEKHQLNCPNGRIAVDTMTLTSSDGRPF 203
Query: 144 EMPNVIFGCG---HKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLPDQGS 200
+P F CG +K A G+IGLG G SL S++ GKFSYCL D S
Sbjct: 204 PLPYSDFVCGNSIYKTFAG-------VGVIGLGRGALSLTSKLYHLSDGKFSYCLADYYS 256
Query: 201 ---SKINFG--GIVAGAG--VVSTPLIIRDH---YYLSLEAISVGNQRLEF------VSS 244
SKINFG ++ VVST L H YY++LE ISVG +R + +
Sbjct: 257 KQPSKINFGLQSFISDDDLEVVSTTLGHHRHSGNYYVTLEGISVGEKRQDLYYVDDPFAP 316
Query: 245 STGNIFVDTGVLRTLLPLEYHSNLKSVMSNMIKAQPVKG-------VGAEPGFSDVLCYN 297
GN+ +D+G + TLLP +++ L S +S I P + C+
Sbjct: 317 PVGNMLIDSGTMFTLLPKDFYDYLWSTVSYAIPENPQNHPHNSRFPFSMDNTLKLSPCFW 376
Query: 298 ISSQPKFPEVTIHFRGADVKLSPSNLFRNISDEIMCSAFRGGN--ANIVYGRIMQINFLI 355
+ KFP++TIHF ADV+LS N F ++++++C AF + VYG Q+NF++
Sbjct: 377 YYPELKFPKITIHFTDADVELSDDNSFIRVAEDVVCFAFAATQPGQSTVYGSWQQMNFIL 436
Query: 356 GYDIEQAMVSFKPSRCT 372
GYD+++ VSFK + C+
Sbjct: 437 GYDLKRGTVSFKRTDCS 453
>gi|356525748|ref|XP_003531485.1| PREDICTED: probable aspartic protease At2g35615-like [Glycine max]
Length = 436
Score = 199 bits (507), Expect = 1e-48, Method: Compositional matrix adjust.
Identities = 143/405 (35%), Positives = 197/405 (48%), Gaps = 50/405 (12%)
Query: 8 PFYNDNETP-----KSPISIIYQAEIISVDDI----------------YLMHLSIGTPPV 46
PFY + TP + + IYQ S D+ YLM IGTPPV
Sbjct: 42 PFYKPSLTPSDRIINTALRSIYQLNRASHSDLNEKKTLERVRIPNHGEYLMRFYIGTPPV 101
Query: 47 DIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSSSQCA---VVTSN 103
+ DT SD W QC PC CF Q+ PLF+P KSST+ ++SC S C +
Sbjct: 102 ERLAIADTASDLIWVQCSPCET--CFPQDTPLFEPHKSSTFANLSCDSQPCTSSNIYYCP 159
Query: 104 CSEGDCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMPNVIFGCGHKNLASPTSD 163
C Y+ YG G S + G L TE++ F S + V P IFGCG N
Sbjct: 160 LVGNLCLYTNTYGDG---SSTKGVLCTESIHFGSQT---VTFPKTIFGCGSNNDFMHQIS 213
Query: 164 SKQTGIIGLGPGNSSLISQMGTSIAGKFSYC-LPDQGSS--KINFGG--IVAGAGVVSTP 218
+K TGI+GLG G SL+SQ+G I KFSYC LP +S K+ FG + G GVVSTP
Sbjct: 214 NKVTGIVGLGAGPLSLVSQLGDQIGHKFSYCLLPFTSTSTIKLKFGNDTTITGNGVVSTP 273
Query: 219 LIIRDH----YYLSLEAISVGNQRLEFVSS--STGNIFVDTGVLRTLLPLEYHSNLKSVM 272
LII H Y+L L I++G + L+ ++ + GNI +D G + T L + ++ N +++
Sbjct: 274 LIIDPHYPSYYFLHLVGITIGQKMLQVRTTDHTNGNIIIDLGTVLTYLEVNFYHNFVTLL 333
Query: 273 SNMIKAQPVKGVGAEPGFSDVLCYNISSQPKFPEVTIHFRGADVKLSPSNLFRNISD-EI 331
+ K P F C+ + FP++ F GA V LSP NLF D +
Sbjct: 334 REALGISETKDDIPYP-FD--FCFPNQANITFPKIVFQFTGAKVFLSPKNLFFRFDDLNM 390
Query: 332 MCSAFRG---GNANIVYGRIMQINFLIGYDIEQAMVSFKPSRCTN 373
+C A V+G + Q++F + YD + VSF P+ C+
Sbjct: 391 ICLAVLPDFYAKGFSVFGNLAQVDFQVEYDRKGKKVSFAPADCSK 435
>gi|255566002|ref|XP_002523989.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
gi|223536716|gb|EEF38357.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
Length = 448
Score = 193 bits (491), Expect = 1e-46, Method: Compositional matrix adjust.
Identities = 139/373 (37%), Positives = 193/373 (51%), Gaps = 34/373 (9%)
Query: 25 QAEIISVDDIYLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKS 84
Q+E+ YL+ +S+GTPP +I D D TW C+ C DC K F P +S
Sbjct: 87 QSELNFSKGNYLIKISVGTPPAEILALADITGDLTWLPCKTCQ--DCTKDGFTFF-PSES 143
Query: 85 STYNSISCSSSQCAVVT-SNCSEGDCSYSFLYG---RGAYASFSSGNLATETLTFNSTSG 140
STY S +C S QC + + C C Y L G + + + G +A +T++F+S+SG
Sbjct: 144 STYTSAACESYQCQITNGAVCQTKMCIY--LCGPLPQQRSSCTNKGLVAMDTISFHSSSG 201
Query: 141 LPVEMPNVIFGCGHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCL---PD 197
+ PN F CG GI+GLG G S+ SQM I G FS CL
Sbjct: 202 QALSYPNTNFICG---TFIDNWHYIGAGIVGLGRGLFSMTSQMKHLINGTFSQCLVPYSS 258
Query: 198 QGSSKINFG--GIVAGAGVVSTPLIIRDH---YYLSLEAISVGNQRL--EFVSSSTGNIF 250
+ SSKINFG G+V+G GVVSTP+ Y+L LEA+SVG R+ F S+ NI+
Sbjct: 259 KQSSKINFGLKGVVSGEGVVSTPIADDGESGAYFLFLEAMSVGGNRVANNFYSAPKSNIY 318
Query: 251 VDTGVLRTLLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDVLCYNISSQPKF--PEVT 308
+D T LP +++ N+++ + I P+ E S LCY S F P +T
Sbjct: 319 IDWRTTFTSLPHDFYENVEAEVRKAINLTPIN-YNNERKLS--LCYKSESDHDFDAPPIT 375
Query: 309 IHFRGADVKLSPSNLFRNISDEIMCSAFRGGNANI-------VYGRIMQINFLIGYDIEQ 361
+HF ADV+LSP N F + ++C AF G N VYG Q+NF++GYD++
Sbjct: 376 MHFTNADVQLSPLNTFVRMDWNVVCFAFLDGTFNATKRITHAVYGSWQQMNFIVGYDLKS 435
Query: 362 AMVSFKPSRCTNY 374
+ VSFK + CT Y
Sbjct: 436 STVSFKQADCTLY 448
>gi|362799904|dbj|BAL41445.1| aspartyl protease 1 [Linum grandiflorum]
Length = 449
Score = 191 bits (486), Expect = 3e-46, Method: Compositional matrix adjust.
Identities = 132/395 (33%), Positives = 213/395 (53%), Gaps = 56/395 (14%)
Query: 22 IIYQAEIISVDDIYLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDP 81
+ +Q +++ Y+M+LSIGTPP I DTGSD TW Q +PC + C+ Q+ P+FDP
Sbjct: 67 VDFQTDLLPSGGEYMMNLSIGTPPFPILAIADTGSDLTWLQSKPCDQ--CYPQKGPIFDP 124
Query: 82 KKSSTYNSISCSSSQCAVV---TSNCSE-GDCSYSFLYGRGAYASFSSGNLATETLTFNS 137
S+T++ + C+++ C + +C++ C Y++ YG +Y ++G LA++T+T +
Sbjct: 125 SNSTTFHKLPCTTAPCNALDESARSCTDPTTCGYTYSYGDHSY---TTGYLASDTVTVGN 181
Query: 138 TSGLPVEMPNVIFGCGHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCL-- 195
S V++ NV FGCG +N + D + +GI+GLG GN S +SQ+G +I KFSYCL
Sbjct: 182 AS---VQIRNVAFGCGTRNGGN--FDEQGSGIVGLGGGNLSFVSQLGDTIGKKFSYCLLP 236
Query: 196 ----------PDQGSSKINFG-------GIVAGAGVVSTPLIIRD---HYYLSLEAISVG 235
+S+I FG G +TPL+ ++ +YYL++EAI+VG
Sbjct: 237 LENEISSQPSDSPATSRIVFGDNPVFSSSSTNGVVFATTPLVNKEPSTYYYLTIEAITVG 296
Query: 236 NQRLEFV---------------SSSTGNIFVDTGVLRTLLPLEYHSNLKSVMSNMIKAQP 280
++L + S GNI +D+G T L E++ L++ + IK +
Sbjct: 297 RKKLLYSSSSSKTASYDSGSKSSVEEGNIIIDSGTTLTFLEEEFYGALEAALVEEIKMER 356
Query: 281 VKGVGAEPGFSDVLCYNISSQP-KFPEVTIHFRG-ADVKLSPSNLFRNISDEIMCSAFRG 338
V V FS LC+ + + P + +HFRG ADV+L P N F + ++C
Sbjct: 357 VNDV-KNSMFS--LCFKSGKEEVELPLMKVHFRGGADVELKPVNTFVRAEEGLVCFTMLP 413
Query: 339 GNANIVYGRIMQINFLIGYDIEQAMVSFKPSRCTN 373
N +YG + Q+NF++GYD+ + VSF P+ C+
Sbjct: 414 TNDVGIYGNLAQMNFVVGYDLGKRTVSFLPADCSK 448
>gi|356557010|ref|XP_003546811.1| PREDICTED: probable aspartic protease At2g35615-like [Glycine max]
Length = 437
Score = 191 bits (486), Expect = 4e-46, Method: Compositional matrix adjust.
Identities = 143/383 (37%), Positives = 204/383 (53%), Gaps = 38/383 (9%)
Query: 9 FYNDNETPKSPISIIYQAEIISVDDIYLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPE 68
F ++N P+S +I + YLM L IGTPPV+ DTGSD W QC PC
Sbjct: 74 FLDENNLPES--------LLIPENGEYLMTLYIGTPPVERLAIADTGSDLIWVQCSPCQ- 124
Query: 69 LDCFKQEPPLFDPKKSSTYNSISCSSSQCAVV---TSNCSE-GDCSYSFLYGRGAYASFS 124
+CF Q+ PLF+P KSST+ + +C S C V C + G C YS+ YG SF+
Sbjct: 125 -NCFPQDTPLFEPLKSSTFKAATCDSQPCTSVPPSQRQCGKVGQCIYSYSYGD---KSFT 180
Query: 125 SGNLATETLTFNST-SGLPVEMPNVIFGCG-HKNLASPTSDSKQTGIIGLGPGNSSLISQ 182
G + TETL+F ST V P+ IFGCG + N TSD K TG++GLG G SL+SQ
Sbjct: 181 VGVVGTETLSFGSTGDAQTVSFPSSIFGCGVYNNFTFHTSD-KVTGLVGLGGGPLSLVSQ 239
Query: 183 MGTSIAGKFSYCL---PDQGSSKINFG--GIVAGAGVVSTPLIIR----DHYYLSLEAIS 233
+G I KFSYCL +SK+ FG IV GVVSTPLII+ Y+L+LEA++
Sbjct: 240 LGPQIGYKFSYCLLPFSSNSTSKLKFGSEAIVTTNGVVSTPLIIKPLFPSFYFLNLEAVT 299
Query: 234 VGNQRLEFVSSSTGNIFVDTGVLRTLLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDV 293
+G Q++ + GNI +D+G + T L +++N + + ++ + + + F
Sbjct: 300 IG-QKVVPTGRTDGNIIIDSGTVLTYLEQTFYNNFVASLQEVLSVESAQDL----PFPFK 354
Query: 294 LCYNISSQPKFPEVTIHFRGADVKLSPSNLFRNISDE-IMCSAFRGGNAN--IVYGRIMQ 350
C+ P + F GA V L P NL + D ++C A + + ++G + Q
Sbjct: 355 FCFPYRDM-TIPVIAFQFTGASVALQPKNLLIKLQDRNMLCLAVVPSSLSGISIFGNVAQ 413
Query: 351 INFLIGYDIEQAMVSFKPSRCTN 373
+F + YD+E VSF P+ CT
Sbjct: 414 FDFQVVYDLEGKKVSFAPTDCTK 436
>gi|356528627|ref|XP_003532901.1| PREDICTED: probable aspartic protease At2g35615-like [Glycine max]
Length = 430
Score = 191 bits (484), Expect = 6e-46, Method: Compositional matrix adjust.
Identities = 130/386 (33%), Positives = 187/386 (48%), Gaps = 34/386 (8%)
Query: 3 NSQKLPFYNDNETPKSPISIIYQAEIISVDDI--YLMHLSIGTPPVDIFGSVDTGSDCTW 60
S+++ F P SPI I + D YLM S+GTP V+ DTGSD +W
Sbjct: 61 RSKRVNFIGQISPPLSPI-------ITPIPDHGEYLMRFSLGTPSVERLAIFDTGSDLSW 113
Query: 61 TQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSSSQCAVVTSNCSE----GDCSYSFLYG 116
QC PC C+ QE PLFDP +SSTY + C S C + N E C Y YG
Sbjct: 114 LQCTPCKT--CYPQEAPLFDPTQSSTYVDVPCESQPCTLFPQNQRECGSSKQCIYLHQYG 171
Query: 117 RGAYASFSSGNLATETLTFNSTSGLP---VEMPNVIFGCGHKNLASPTSDSKQTGIIGLG 173
SF+ G L +T++F+ST G+ P +FGC + + +K G +GLG
Sbjct: 172 TD---SFTIGRLGYDTISFSST-GMGQGGATFPKSVFGCAFYSNFTFKISTKANGFVGLG 227
Query: 174 PGNSSLISQMGTSIAGKFSYCLPDQGSS---KINFGGIVAGAGVVSTPLIIR----DHYY 226
PG SL SQ+G I KFSYC+ S+ K+ FG + VVSTP +I +Y
Sbjct: 228 PGPLSLASQLGDQIGHKFSYCMVPFSSTSTGKLKFGSMAPTNEVVSTPFMINPSYPSYYV 287
Query: 227 LSLEAISVGNQRLEFVSSSTGNIFVDTGVLRTLLPLEYHSNLKSVMSNMIKAQPVKGVGA 286
L+LE I+VG +++ GNI +D+ + T L +++ S + I + V
Sbjct: 288 LNLEGITVGQKKV-LTGQIGGNIIIDSVPILTHLEQGIYTDFISSVKEAINVE----VAE 342
Query: 287 EPGFSDVLCYNISSQPKFPEVTIHFRGADVKLSPSNLFRNISDEIMCSAFRGGNANIVYG 346
+ C + FPE HF GADV L P N+F + + ++C ++G
Sbjct: 343 DAPTPFEYCVRNPTNLNFPEFVFHFTGADVVLGPKNMFIALDNNLVCMTVVPSKGISIFG 402
Query: 347 RIMQINFLIGYDIEQAMVSFKPSRCT 372
Q+NF + YD+ + VSF P+ C+
Sbjct: 403 NWAQVNFQVEYDLGEKKVSFAPTNCS 428
>gi|296085499|emb|CBI29231.3| unnamed protein product [Vitis vinifera]
Length = 308
Score = 188 bits (478), Expect = 3e-45, Method: Compositional matrix adjust.
Identities = 125/355 (35%), Positives = 184/355 (51%), Gaps = 74/355 (20%)
Query: 25 QAEIISVDDIYLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKS 84
Q+ +IS YLM++S+GTPPV + G DTGSD W QC PC DC+KQ PLFDPKKS
Sbjct: 19 QSNVISGGGSYLMNISLGTPPVSMLGIADTGSDLIWRQCLPCD--DCYKQVEPLFDPKKS 76
Query: 85 STYNSISCSSSQCAVVTSNCSEGDCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVE 144
TY ++ G L++ET T ST G P
Sbjct: 77 KTYKTL-----------------------------------GYLSSETFTIGSTEGDPAS 101
Query: 145 MPNVIFGCGHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCL-----PDQG 199
P + FGCGH N T + K +G+IGLG G SL+ Q+ + + G+FSYCL
Sbjct: 102 FPGLAFGCGHSN--GGTFNEKDSGLIGLGGGPLSLVMQLSSKVGGQFSYCLVPLSSDSTA 159
Query: 200 SSKINFG--GIVAGAGVVSTPLIIRDHYYLSLEAISVGNQRLEFVSSSTGNIFVDTGVLR 257
SSKINFG +V+G+G S+P ++ NI +D+G
Sbjct: 160 SSKINFGKSAVVSGSGT-SSP-----------------------AAAEESNIIIDSGTTL 195
Query: 258 TLLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDVLCYNISSQPKFPEVTIHFRGADVK 317
TLLP +++++++S ++ +I Q +P + LCY+ + + P +T HF GADV+
Sbjct: 196 TLLPRDFYTDMESALTKVIGGQTT----TDPRGTFSLCYSGVKKLEIPTITAHFIGADVQ 251
Query: 318 LSPSNLFRNISDEIMCSAFRGGNANIVYGRIMQINFLIGYDIEQAMVSFKPSRCT 372
L P N F ++++C + + ++G + Q+NFL+GYD++ VSFKP+ CT
Sbjct: 252 LPPLNTFVQAQEDLVCFSMIPSSNLAIFGNLSQMNFLVGYDLKNNKVSFKPTDCT 306
>gi|356540369|ref|XP_003538662.1| PREDICTED: LOW QUALITY PROTEIN: probable aspartic protease
At2g35615-like [Glycine max]
Length = 364
Score = 187 bits (474), Expect = 9e-45, Method: Compositional matrix adjust.
Identities = 138/355 (38%), Positives = 188/355 (52%), Gaps = 38/355 (10%)
Query: 35 YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDP-KKSSTYNSISCS 93
YLM L++GTPPVD++G VDT SD W QC PC C+KQ+ P+FDP K+ +++ SCS
Sbjct: 31 YLMKLTLGTPPVDVYGLVDTDSDLVWAQCTPC--QGCYKQKNPMFDPLKECNSFFDHSCS 88
Query: 94 SSQCAVVTSNCSEGDCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMPNVIFGCG 153
E C Y + Y A S + G LA E TF+ST G P+ + ++IFGCG
Sbjct: 89 -----------PEKACDYVYAY---ADDSATKGMLAKEIATFSSTDGKPI-VESIIFGCG 133
Query: 154 HKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGK-FSYCL-----PDQGSSKINFGG 207
H N + G+IGLG G SL+SQMG K FS CL S I+ G
Sbjct: 134 HNNTG--VFNENDMGLIGLGGGPLSLVSQMGNLYGSKRFSQCLVPFHADPHTSGTISLGE 191
Query: 208 I--VAGAGVVSTPLIIRDH---YYLSLEAISVGNQRLEFVSS---STGNIFVDTGVLRTL 259
V+G GVV+TPL+ + Y ++LE ISVG+ + F SS S GNI +D+G T
Sbjct: 192 ASDVSGEGVVTTPLVSEEGQTPYLVTLEGISVGDTFVPFNSSEMLSKGNIMIDSGTPETY 251
Query: 260 LPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDVLCYNISSQPKFPEVTIHFRGADVKLS 319
LP E++ L + I P+ +P LCY + + P +T HF GADVKL
Sbjct: 252 LPQEFYDRLVEELKVQINLPPIH---VDPDLGTQLCYKSETNLEGPILTAHFEGADVKLL 308
Query: 320 PSNLFRNISDEIMCSAFRGGNANI-VYGRIMQINFLIGYDIEQAMVSFKPSRCTN 373
P F D + C A G + ++G Q N LIG+D+++ +V FKP+ T
Sbjct: 309 PLQTFIPPKDGVFCFAMTGTTDGLYIFGNFAQSNVLIGFDLDKRIVFFKPTDFTK 363
>gi|30678047|ref|NP_565298.2| aspartyl protease-like protein [Arabidopsis thaliana]
gi|30102688|gb|AAP21262.1| At2g03200 [Arabidopsis thaliana]
gi|110736021|dbj|BAE99983.1| putative chloroplast nucleoid DNA binding protein [Arabidopsis
thaliana]
gi|330250580|gb|AEC05674.1| aspartyl protease-like protein [Arabidopsis thaliana]
Length = 461
Score = 185 bits (470), Expect = 2e-44, Method: Compositional matrix adjust.
Identities = 135/371 (36%), Positives = 193/371 (52%), Gaps = 53/371 (14%)
Query: 35 YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSS 94
+LM LSIG P V VDTGSD WTQC+PC E CF Q P+FDP+KSS+Y+ + CSS
Sbjct: 107 FLMELSIGNPAVKYSAIVDTGSDLIWTQCKPCTE--CFDQPTPIFDPEKSSSYSKVGCSS 164
Query: 95 SQC-AVVTSNCSEGDCSYSFLYGRGAYASFSSGNLATETLTF---NSTSGLPVEMPNVIF 150
C A+ SNC+E + +LY G Y+S + G LATET TF NS SG+ F
Sbjct: 165 GLCNALPRSNCNEDKDACEYLYTYGDYSS-TRGLLATETFTFEDENSISGIG-------F 216
Query: 151 GCGHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLP----DQGSSKINFG 206
GCG +N S+ +G++GLG G SLISQ+ + KFSYCL + SS + G
Sbjct: 217 GCGVENEGDGF--SQGSGLVGLGRGPLSLISQLKET---KFSYCLTSIEDSEASSSLFIG 271
Query: 207 GIVAG----------AGVVSTPLIIRD-----HYYLSLEAISVGNQRL-------EFVSS 244
+ +G V T ++R+ YYL L+ I+VG +RL E
Sbjct: 272 SLASGIVNKTGASLDGEVTKTMSLLRNPDQPSFYYLELQGITVGAKRLSVEKSTFELAED 331
Query: 245 STGNIFVDTGVLRTLLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDVLCYNISSQPK- 303
TG + +D+G T L LK ++ + + PV G+ G LC+ + K
Sbjct: 332 GTGGMIIDSGTTITYLEETAFKVLKEEFTSRM-SLPVDDSGS-TGLD--LCFKLPDAAKN 387
Query: 304 --FPEVTIHFRGADVKLSPSN-LFRNISDEIMCSAFRGGNANIVYGRIMQINFLIGYDIE 360
P++ HF+GAD++L N + + S ++C A N ++G + Q NF + +D+E
Sbjct: 388 IAVPKMIFHFKGADLELPGENYMVADSSTGVLCLAMGSSNGMSIFGNVQQQNFNVLHDLE 447
Query: 361 QAMVSFKPSRC 371
+ VSF P+ C
Sbjct: 448 KETVSFVPTEC 458
>gi|168000296|ref|XP_001752852.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162696015|gb|EDQ82356.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 384
Score = 185 bits (470), Expect = 3e-44, Method: Compositional matrix adjust.
Identities = 124/392 (31%), Positives = 196/392 (50%), Gaps = 35/392 (8%)
Query: 2 QNSQKLPFYNDNETPKSPISIIYQAEIISVDDIYLMHLSIGTPPVDIFGSVDTGSDCTWT 61
++ +++ FY +P + S +Q+ + + + YLM L++G+PP VDTGSD W
Sbjct: 6 RSHERVAFYTLKLSPDAFGSQEFQSPVKAGNGEYLMTLTLGSPPQSFDVIVDTGSDLNWV 65
Query: 62 QCEPCPELDCFKQEPPLFDPKKSSTYNSISCSSSQC---AVVTSNCSEGDCSYSFLYGRG 118
QC PC C++Q P FDP KS ++ +C+ + C A+ C+ C Y + YG
Sbjct: 66 QCLPCRV--CYQQPGPKFDPSKSRSFRKAACTDNLCNVSALPLKACAANVCQYQYTYGD- 122
Query: 119 AYASFSSGNLATETLTFNSTSGLPVEMPNVIFGCGHKNLASPTSDSKQTGIIGLGPGNSS 178
S ++G+LA ET++ N+ +G +PN FGCG +NL + + G++GLG G S
Sbjct: 123 --QSNTNGDLAFETISLNNGAGTQ-SVPNFAFGCGTQNLG---TFAGAAGLVGLGQGPLS 176
Query: 179 LISQMGTSIAGKFSYCLPDQGS---SKINFGGIVAGAGVVSTPLIIR----DHYYLSLEA 231
L SQ+ + A KFSYCL S S + FG I A A + T +++ +YY+ L +
Sbjct: 177 LNSQLSHTFANKFSYCLVSLNSLSASPLTFGSIAAAANIQYTSIVVNARHPTYYYVQLNS 236
Query: 232 ISVGNQRLEFV--------SSSTGNIFVDTGVLRTLLPLEYHSNLKSVMSNMIKAQPVKG 283
I VG Q L S+ G +D+G T+L L +S + + + + G
Sbjct: 237 IEVGGQPLNLAPSVFAIDQSTGRGGTIIDSGTTITMLTLPAYSAVLRAYESFVNYPRLDG 296
Query: 284 VGAEPGFSDVLCYNIS--SQPKFPEVTIHFRGADVKLSPSNLF--RNISDEIMCSAFRGG 339
+ LC+NI+ S P P++ F+GAD ++ NLF + S +C A G
Sbjct: 297 ----SAYGLDLCFNIAGVSNPSVPDMVFKFQGADFQMRGENLFVLVDTSATTLCLAMGGS 352
Query: 340 NANIVYGRIMQINFLIGYDIEQAMVSFKPSRC 371
+ G I Q N L+ YD+E + F + C
Sbjct: 353 QGFSIIGNIQQQNHLVVYDLEAKKIGFATADC 384
>gi|302768196|ref|XP_002967518.1| hypothetical protein SELMODRAFT_87804 [Selaginella moellendorffii]
gi|300165509|gb|EFJ32117.1| hypothetical protein SELMODRAFT_87804 [Selaginella moellendorffii]
Length = 398
Score = 185 bits (470), Expect = 3e-44, Method: Compositional matrix adjust.
Identities = 133/392 (33%), Positives = 197/392 (50%), Gaps = 47/392 (11%)
Query: 13 NETPKSP-ISIIYQAEIISVDDIYLMHLSIGTPPVDIFGSV-DTGSDCTWTQCEPCPELD 70
+E P P +S Y++ + S Y+ +S+GTP +F + DTGSD W QC+PC
Sbjct: 17 SEVPYPPSVSTDYESPVASGGGDYVTTISLGTP-AKVFSVIADTGSDLIWIQCKPCQA-- 73
Query: 71 CFKQEPPLFDPKKSSTYNSISCSSSQCAVVTSNCSEGDCSYSFLYGRGAYASFSSGNLAT 130
CF Q+ P+FDP+ SS+Y ++SC + C + DC YS+ YG G S + G L++
Sbjct: 74 CFNQKDPIFDPEGSSSYTTMSCGDTLCDSLPRKSCSPDCDYSYGYGDG---SGTRGTLSS 130
Query: 131 ETLTFNSTSGLPVEMPNVIFGCGHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGK 190
ET+T ST G + N+ FGCGH N S S G++GLG GN S +SQ+G K
Sbjct: 131 ETVTLTSTQGEKLAAKNIAFGCGHLNRGSFNDAS---GLVGLGRGNLSFVSQLGDLFGHK 187
Query: 191 FSYCL------PDQGSSKINFGGIVAGAG------VVSTPLI----IRDHYYLSLEAISV 234
FSYCL P + +S + FG + TP+I + YY+ L+ IS+
Sbjct: 188 FSYCLVPWRDAPSK-TSPMFFGDESSSHSSGKKLHYAFTPMIHNPAMESFYYVKLKDISI 246
Query: 235 GNQRLEFVSSS-------TGNIFVDTGVLRTLLPLEYHSNLKSVMSNMIKAQPVKGVGAE 287
+ L + S +G + D+G TLLP + + + + I + G A
Sbjct: 247 AGRALRIPAGSFDIKPDGSGGMIFDSGTTLTLLPDAPYQIVLRALRSKISFPKIDGSSA- 305
Query: 288 PGFSDVLCYNISS-----QPKFPEVTIHFRGADVKLSPSNLF--RNISDEIMCSAFRGGN 340
G LCY++S + K P + HF GAD +L N F N + I+C A N
Sbjct: 306 -GLD--LCYDVSGSKASYKMKIPAMVFHFEGADYQLPVENYFIAANDAGTIVCLAMVSSN 362
Query: 341 ANI-VYGRIMQINFLIGYDIEQAMVSFKPSRC 371
+I +YG +MQ NF + YDI + + + PS+C
Sbjct: 363 MDIGIYGNMMQQNFRVMYDIGSSKIGWAPSQC 394
>gi|297817972|ref|XP_002876869.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
gi|297322707|gb|EFH53128.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
Length = 462
Score = 185 bits (470), Expect = 3e-44, Method: Compositional matrix adjust.
Identities = 135/371 (36%), Positives = 194/371 (52%), Gaps = 53/371 (14%)
Query: 35 YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSS 94
+LM LSIG P V VDTGSD WTQC+PC E CF Q P+FDP+KSS+Y+ + CSS
Sbjct: 108 FLMELSIGNPAVKYAAIVDTGSDLIWTQCKPCTE--CFDQPTPIFDPEKSSSYSKVGCSS 165
Query: 95 SQC-AVVTSNCSEGDCSYSFLYGRGAYASFSSGNLATETLTF---NSTSGLPVEMPNVIF 150
C A+ SNC+E S +LY G Y+S + G LATET TF NS SG+ F
Sbjct: 166 GLCNALPRSNCNEDKDSCEYLYTYGDYSS-TRGLLATETFTFEDENSISGIG-------F 217
Query: 151 GCGHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLP----DQGSSKINFG 206
GCG +N S+ +G++GLG G SLISQ+ + KFSYCL + SS + G
Sbjct: 218 GCGVENEGDGF--SQGSGLVGLGRGPLSLISQLKET---KFSYCLTSIEDSEASSSLFIG 272
Query: 207 GIVAG----------AGVVSTPLIIRD-----HYYLSLEAISVGNQRL-------EFVSS 244
+ +G V T ++R+ YYL L+ I+VG +RL E
Sbjct: 273 SLASGIVNKTGANLDGEVTKTMSLLRNPDQPSFYYLELQGITVGAKRLSVEKSTFELSED 332
Query: 245 STGNIFVDTGVLRTLLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDVLCYNISSQPK- 303
TG + +D+G T L LK ++ + + PV G+ G LC+ + + K
Sbjct: 333 GTGGMIIDSGTTITYLEETAFKVLKEEFTSRM-SLPVDDSGS-TGLD--LCFKLPNAAKN 388
Query: 304 --FPEVTIHFRGADVKLSPSN-LFRNISDEIMCSAFRGGNANIVYGRIMQINFLIGYDIE 360
P++ HF+GAD++L N + + S ++C A N ++G + Q NF + +D+E
Sbjct: 389 IAVPKLIFHFKGADLELPGENYMVADSSTGVLCLAMGSSNGMSIFGNVQQQNFNVLHDLE 448
Query: 361 QAMVSFKPSRC 371
+ V+F P+ C
Sbjct: 449 KETVTFVPTEC 459
>gi|449523529|ref|XP_004168776.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Cucumis sativus]
Length = 461
Score = 184 bits (468), Expect = 4e-44, Method: Compositional matrix adjust.
Identities = 130/375 (34%), Positives = 200/375 (53%), Gaps = 45/375 (12%)
Query: 25 QAEIISVDDIYLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKS 84
+A +++ + +LM L+IG+PP +DTGSD WTQC+PC + CF Q P+FDPK+S
Sbjct: 101 KAPVVAGNGEFLMKLAIGSPPRSFSAIMDTGSDLIWTQCKPCQQ--CFDQSTPIFDPKQS 158
Query: 85 STYNSISCSSSQC-AVVTSNCSEGDCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPV 143
S++ ISCSS C A+ TS CS C Y + YG +S + G LA ET TF ++ +
Sbjct: 159 SSFYKISCSSELCGALPTSTCSSDGCEYLYTYGD---SSSTQGVLAFETFTFGDSTEDQI 215
Query: 144 EMPNVIFGCGHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLP---DQGS 200
+P + FGCG+ N + S+ G++GLG G SL+SQ+ KF+YCL D
Sbjct: 216 SIPGLGFGCGNDN--NGDGFSQGAGLVGLGRGPLSLVSQLKEQ---KFAYCLTAIDDSKP 270
Query: 201 SKINFGGIV------AGAGVVSTPLIIR----DHYYLSLEAISVGNQRL-------EFVS 243
S + G + + + +TPLI YYLSL+ ISVG +L E
Sbjct: 271 SSLLLGSLANITPKTSKDEMKTTPLIKNPSQPSFYYLSLQGISVGGTQLSIPKSTFELHD 330
Query: 244 SSTGNIFVDTGVLRTLLPLEYHSNLKSVMSNMIKAQ---PVKGVGAEPGFSDVLCYNI-- 298
+G + +D+G T + ++LK N AQ PV G G D LC+N+
Sbjct: 331 DGSGGVIIDSGTTITYVENSAFTSLK----NEFIAQMNLPVDDSGT--GGLD-LCFNLPA 383
Query: 299 -SSQPKFPEVTIHFRGADVKLSPSNLFRNISDE-IMCSAFRGGNANIVYGRIMQINFLIG 356
++Q + P++T HF+GAD++L N S ++C A ++G + Q NF++
Sbjct: 384 GTNQVEVPKLTFHFKGADLELPGENYMIGDSKAGLLCLAIGSSRGMSIFGNLQQQNFMVV 443
Query: 357 YDIEQAMVSFKPSRC 371
+D+++ +SF P++C
Sbjct: 444 HDLQEETLSFLPTQC 458
>gi|449453902|ref|XP_004144695.1| PREDICTED: LOW QUALITY PROTEIN: aspartic proteinase
nepenthesin-1-like, partial [Cucumis sativus]
Length = 716
Score = 184 bits (468), Expect = 5e-44, Method: Compositional matrix adjust.
Identities = 130/375 (34%), Positives = 200/375 (53%), Gaps = 45/375 (12%)
Query: 25 QAEIISVDDIYLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKS 84
+A +++ + +LM L+IG+PP +DTGSD WTQC+PC + CF Q P+FDPK+S
Sbjct: 356 KAPVVAGNGEFLMKLAIGSPPRSFSAIMDTGSDLIWTQCKPCQQ--CFDQSTPIFDPKQS 413
Query: 85 STYNSISCSSSQC-AVVTSNCSEGDCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPV 143
S++ ISCSS C A+ TS CS C Y + YG +S + G LA ET TF ++ +
Sbjct: 414 SSFYKISCSSELCGALPTSTCSSDGCEYLYTYGD---SSSTQGVLAFETFTFGDSTEDQI 470
Query: 144 EMPNVIFGCGHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLP---DQGS 200
+P + FGCG+ N + S+ G++GLG G SL+SQ+ KF+YCL D
Sbjct: 471 SIPGLGFGCGNDN--NGDGFSQGAGLVGLGRGPLSLVSQLKEQ---KFAYCLTAIDDSKP 525
Query: 201 SKINFGGIV------AGAGVVSTPLIIR----DHYYLSLEAISVGNQRL-------EFVS 243
S + G + + + +TPLI YYLSL+ ISVG +L E
Sbjct: 526 SSLLLGSLANITPKTSKDEMKTTPLIKNPSQPSFYYLSLQGISVGGTQLSIPKSTFELHD 585
Query: 244 SSTGNIFVDTGVLRTLLPLEYHSNLKSVMSNMIKAQ---PVKGVGAEPGFSDVLCYNI-- 298
+G + +D+G T + ++LK N AQ PV G G D LC+N+
Sbjct: 586 DGSGGVIIDSGTTITYVENSAFTSLK----NEFIAQMNLPVDDSGT--GGLD-LCFNLPA 638
Query: 299 -SSQPKFPEVTIHFRGADVKLSPSNLFRNISDE-IMCSAFRGGNANIVYGRIMQINFLIG 356
++Q + P++T HF+GAD++L N S ++C A ++G + Q NF++
Sbjct: 639 GTNQVEVPKLTFHFKGADLELPGENYMIGDSKAGLLCLAIGSSRGMSIFGNLQQQNFMVV 698
Query: 357 YDIEQAMVSFKPSRC 371
+D+++ +SF P++C
Sbjct: 699 HDLQEETLSFLPTQC 713
>gi|20197342|gb|AAC34482.2| putative chloroplast nucleoid DNA binding protein [Arabidopsis
thaliana]
Length = 353
Score = 183 bits (464), Expect = 1e-43, Method: Compositional matrix adjust.
Identities = 134/369 (36%), Positives = 191/369 (51%), Gaps = 53/369 (14%)
Query: 37 MHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSSSQ 96
M LSIG P V VDTGSD WTQC+PC E CF Q P+FDP+KSS+Y+ + CSS
Sbjct: 1 MELSIGNPAVKYSAIVDTGSDLIWTQCKPCTE--CFDQPTPIFDPEKSSSYSKVGCSSGL 58
Query: 97 C-AVVTSNCSEGDCSYSFLYGRGAYASFSSGNLATETLTF---NSTSGLPVEMPNVIFGC 152
C A+ SNC+E + +LY G Y+S + G LATET TF NS SG+ FGC
Sbjct: 59 CNALPRSNCNEDKDACEYLYTYGDYSS-TRGLLATETFTFEDENSISGIG-------FGC 110
Query: 153 GHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLP----DQGSSKINFGGI 208
G +N S+ +G++GLG G SLISQ+ + KFSYCL + SS + G +
Sbjct: 111 GVENEGDGF--SQGSGLVGLGRGPLSLISQLKET---KFSYCLTSIEDSEASSSLFIGSL 165
Query: 209 VAG----------AGVVSTPLIIRD-----HYYLSLEAISVGNQRL-------EFVSSST 246
+G V T ++R+ YYL L+ I+VG +RL E T
Sbjct: 166 ASGIVNKTGASLDGEVTKTMSLLRNPDQPSFYYLELQGITVGAKRLSVEKSTFELAEDGT 225
Query: 247 GNIFVDTGVLRTLLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDVLCYNISSQPK--- 303
G + +D+G T L LK ++ + + PV G+ G LC+ + K
Sbjct: 226 GGMIIDSGTTITYLEETAFKVLKEEFTSRM-SLPVDDSGS-TGLD--LCFKLPDAAKNIA 281
Query: 304 FPEVTIHFRGADVKLSPSN-LFRNISDEIMCSAFRGGNANIVYGRIMQINFLIGYDIEQA 362
P++ HF+GAD++L N + + S ++C A N ++G + Q NF + +D+E+
Sbjct: 282 VPKMIFHFKGADLELPGENYMVADSSTGVLCLAMGSSNGMSIFGNVQQQNFNVLHDLEKE 341
Query: 363 MVSFKPSRC 371
VSF P+ C
Sbjct: 342 TVSFVPTEC 350
>gi|302753526|ref|XP_002960187.1| hypothetical protein SELMODRAFT_75184 [Selaginella moellendorffii]
gi|300171126|gb|EFJ37726.1| hypothetical protein SELMODRAFT_75184 [Selaginella moellendorffii]
Length = 398
Score = 183 bits (464), Expect = 1e-43, Method: Compositional matrix adjust.
Identities = 131/392 (33%), Positives = 197/392 (50%), Gaps = 47/392 (11%)
Query: 13 NETPKSP-ISIIYQAEIISVDDIYLMHLSIGTPPVDIFGSV-DTGSDCTWTQCEPCPELD 70
+E P P +S Y++ + S Y+ +S+GTP +F + DTGSD W QC+PC
Sbjct: 17 SEVPYPPSVSTDYESPVASGGGDYVTTISLGTP-AKVFSVIADTGSDLIWIQCKPCQA-- 73
Query: 71 CFKQEPPLFDPKKSSTYNSISCSSSQCAVVTSNCSEGDCSYSFLYGRGAYASFSSGNLAT 130
CF Q+ P+FDP+ SS+Y ++SC + C + +C YS+ YG G S + G L++
Sbjct: 74 CFNQKDPIFDPEGSSSYTTMSCGDTLCDSLPRKSCSPNCDYSYGYGDG---SGTRGTLSS 130
Query: 131 ETLTFNSTSGLPVEMPNVIFGCGHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGK 190
ET+T ST G + N+ FGCGH N S S G++GLG GN S +SQ+G K
Sbjct: 131 ETVTLTSTQGEKLAAKNIAFGCGHLNRGSFNDAS---GLVGLGRGNLSFVSQLGDLFGHK 187
Query: 191 FSYCL------PDQGSSKINFGGIVAGAG------VVSTPLI----IRDHYYLSLEAISV 234
FSYCL P + +S + FG + TP+I + YY+ L+ IS+
Sbjct: 188 FSYCLVPWRDAPSK-TSPMFFGDESSSHSSGKKLHYAFTPMIHNPAMESFYYVKLKDISI 246
Query: 235 GNQRLEFVSSS-------TGNIFVDTGVLRTLLPLEYHSNLKSVMSNMIKAQPVKGVGAE 287
+ L + S +G + D+G TLLP + + + + + + G A
Sbjct: 247 AGRALRIPAGSFDIKPDGSGGMIFDSGTTLTLLPDAPYQIVLRALRSKVSFPEIDGSSA- 305
Query: 288 PGFSDVLCYNISS-----QPKFPEVTIHFRGADVKLSPSNLF--RNISDEIMCSAFRGGN 340
G LCY++S + K P + HF GAD +L N F N + I+C A N
Sbjct: 306 -GLD--LCYDVSGSKASYKKKIPAMVFHFEGADHQLPVENYFIAANDAGTIVCLAMVSSN 362
Query: 341 ANI-VYGRIMQINFLIGYDIEQAMVSFKPSRC 371
+I +YG +MQ NF + YDI + + + PS+C
Sbjct: 363 MDIGIYGNMMQQNFRVMYDIGSSKIGWAPSQC 394
>gi|168025534|ref|XP_001765289.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162683608|gb|EDQ70017.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 372
Score = 182 bits (461), Expect = 3e-43, Method: Compositional matrix adjust.
Identities = 134/394 (34%), Positives = 196/394 (49%), Gaps = 50/394 (12%)
Query: 5 QKLPFYNDNETPKSPISIIYQAEIISVDDIYLMHLSIGTPPVDIFGSVDTGSDCTWTQCE 64
+ LP DNE+ + P S Y +L+ + +GTPP +DTGSD TW Q E
Sbjct: 3 ETLPGQTDNESYEFPESAGYGE--------FLVPIYLGTPPQKAVVIIDTGSDLTWIQSE 54
Query: 65 PCPELDCFKQEPPLFDPKKSSTYNSISCSSSQCAVV--TSNCS-EGDCSYSFLYGRGAYA 121
PC CF+Q P+FDP KSSTYN I+CSSS CA + T CS +C Y++ YG G
Sbjct: 55 PCRA--CFEQADPIFDPSKSSTYNKIACSSSACADLLGTQTCSAAANCIYAYGYGDG--- 109
Query: 122 SFSSGNLATETLTFNSTSGLPVEMPNVIFGCGHKNLASPTSDSKQTGIIGLGPGNSSLIS 181
S + G + ET+T T+G V+ ++ G D+ GI+GLG G S+ S
Sbjct: 110 SVTRGYFSKETITATDTAGEEVKFGASVYNTGTFG------DTGGEGILGLGQGPVSMPS 163
Query: 182 QMGTSIAGKFSYCLPDQGS-----SKINFGGIVAGAGVVS-TPLIIR-DH---YYLSLEA 231
Q+G+ + KFSYCL D S S + FG +G V TP++ DH YY++++
Sbjct: 164 QLGSVLGNKFSYCLVDWLSAGSETSTMYFGDAAVPSGEVQYTPIVPNADHPTYYYIAVQG 223
Query: 232 ISVGNQRL-------EFVSSSTGNIFVDTGVLRTLLPLEYHSNLKSVMSNMIK-AQPVKG 283
ISVG L E S +G +D+G T L E + L + ++ ++
Sbjct: 224 ISVGGSLLDIDQSVYEIDSGGSGGTIIDSGTTITYLQQEVFNALVAAYTSQVRYPTTTSA 283
Query: 284 VGAEPGFSDVLCYNI--SSQPKFPEVTIHFRGADVKLSPSNLFRNISDEIMCSAFRGG-- 339
G + LC+N + P FP +TIH G ++L +N F ++ I+C AF
Sbjct: 284 TGLD------LCFNTRGTGSPVFPAMTIHLDGVHLELPTANTFISLETNIICLAFASALD 337
Query: 340 NANIVYGRIMQINFLIGYDIEQAMVSFKPSRCTN 373
++G I Q NF I YD++ + F P+ C +
Sbjct: 338 FPIAIFGNIQQQNFDIVYDLDNMRIGFAPADCAS 371
>gi|357167693|ref|XP_003581287.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Brachypodium
distachyon]
Length = 468
Score = 181 bits (459), Expect = 5e-43, Method: Compositional matrix adjust.
Identities = 130/369 (35%), Positives = 188/369 (50%), Gaps = 49/369 (13%)
Query: 35 YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSS 94
+LM +SIGTP + VDTGSD WTQC+PC E CF Q P+FDP SSTY+++ CSS
Sbjct: 118 FLMDMSIGTPALAYAAIVDTGSDLVWTQCKPCVE--CFNQSTPVFDPSSSSTYSTLPCSS 175
Query: 95 SQCA-VVTSNCSEG--DCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMPNVIFG 151
S C+ + TS C+ DC Y++ YG AS + G LA ET T T ++P V FG
Sbjct: 176 SLCSDLPTSTCTSAAKDCGYTYTYGD---ASSTQGVLAAETFTLAKT-----KLPGVAFG 227
Query: 152 CGHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLP---DQGSSKINFGGI 208
CG N ++ G++GLG G SL+SQ+G GKFSYCL D S + G +
Sbjct: 228 CGDTNEGD--GFTQGAGLVGLGRGPLSLVSQLGL---GKFSYCLTSLDDTSKSPLLLGSL 282
Query: 209 VA-------GAGVVSTPLIIR----DHYYLSLEAISVGNQRLEFVSSS-------TGNIF 250
A A + +TPLI YY++L+A++VG+ R+ S+ TG +
Sbjct: 283 AAISTDTASAAAIQTTPLIKNPSQPSFYYVTLKALTVGSTRIPLPGSAFAVQDDGTGGVI 342
Query: 251 VDTGVLRTLLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDVLCYNISS----QPKFPE 306
VD+G T L L+ + LK + +K PV G+ G LC+ + + P+
Sbjct: 343 VDSGTSITYLELQGYRPLKKAFAAQMKL-PVAD-GSAVGLD--LCFKAPASGVDDVEVPK 398
Query: 307 VTIHFR-GADVKLSPSN-LFRNISDEIMCSAFRGGNANIVYGRIMQINFLIGYDIEQAMV 364
+ +HF GAD+ L N + + + +C G + G Q N YD+++ +
Sbjct: 399 LVLHFDGGADLDLPAENYMVLDSASGALCLTVMGSRGLSIIGNFQQQNIQFVYDVDKDTL 458
Query: 365 SFKPSRCTN 373
SF P +C
Sbjct: 459 SFAPVQCAK 467
>gi|356508918|ref|XP_003523200.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
Length = 453
Score = 180 bits (457), Expect = 9e-43, Method: Compositional matrix adjust.
Identities = 132/369 (35%), Positives = 196/369 (53%), Gaps = 38/369 (10%)
Query: 25 QAEIISVDDIYLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKS 84
+A I + + YLM L+IGTPPV +DTGSD WTQC+PC + C+KQ P+FDPKKS
Sbjct: 98 EAPIHAGNGEYLMELAIGTPPVSYPAVLDTGSDLIWTQCKPCTQ--CYKQPTPIFDPKKS 155
Query: 85 STYNSISCSSSQC-AVVTSNCSEGDCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPV 143
S+++ +SC SS C AV +S CS+G C Y + YG S + G LATET TF + V
Sbjct: 156 SSFSKVSCGSSLCSAVPSSTCSDG-CEYVYSYGD---YSMTQGVLATETFTFGKSKN-KV 210
Query: 144 EMPNVIFGCGHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLPDQGSSKI 203
+ N+ FGCG N + +G++GLG G SL+SQ+ +FSYCL +K
Sbjct: 211 SVHNIGFGCGEDNEGD--GFEQASGLVGLGRGPLSLVSQLKEP---RFSYCLTPMDDTKE 265
Query: 204 ------NFGGIVAGAGVVSTPLIIR----DHYYLSLEAISVGNQRLEFVSSS-------T 246
+ G + VV+TPL+ YYLSLE ISVG+ RL S+
Sbjct: 266 SILLLGSLGKVKDAKEVVTTPLLKNPLQPSFYYLSLEGISVGDTRLSIEKSTFEVGDDGN 325
Query: 247 GNIFVDTGVLRTLLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDVLCYNI---SSQPK 303
G + +D+G T + + LK + K P+ + G LC+++ S+Q +
Sbjct: 326 GGVIIDSGTTITYIEQKAFEALKKEFISQTKL-PLDKTSST-GLD--LCFSLPSGSTQVE 381
Query: 304 FPEVTIHFRGADVKLSPSNLFRNISD-EIMCSAFRGGNANIVYGRIMQINFLIGYDIEQA 362
P++ HF+G D++L N S+ + C A + ++G + Q N L+ +D+E+
Sbjct: 382 IPKIVFHFKGGDLELPAENYMIGDSNLGVACLAMGASSGMSIFGNVQQQNILVNHDLEKE 441
Query: 363 MVSFKPSRC 371
+SF P+ C
Sbjct: 442 TISFVPTSC 450
>gi|116790042|gb|ABK25480.1| unknown [Picea sitchensis]
Length = 460
Score = 179 bits (455), Expect = 1e-42, Method: Compositional matrix adjust.
Identities = 133/371 (35%), Positives = 185/371 (49%), Gaps = 42/371 (11%)
Query: 25 QAEIISVDDIYLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKS 84
+A + + + +LM ++IGTP + +DTGSD TWTQC+PC DC+ Q P++DP +S
Sbjct: 105 EAPVYAGNGEFLMKMAIGTPSLSFSAILDTGSDLTWTQCKPC--TDCYPQPTPIYDPSQS 162
Query: 85 STYNSISCSSSQC-AVVTSNCSEGDCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPV 143
STY+ + CSSS C A+ +CS +C Y + YG S + G L+ E+ T S S
Sbjct: 163 STYSKVPCSSSMCQALPMYSCSGANCEYLYSYGD---QSSTQGILSYESFTLTSQS---- 215
Query: 144 EMPNVIFGCGHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCL------PD 197
+P++ FGCG +N S G GP SLISQ+G S+ KFSYCL P
Sbjct: 216 -LPHIAFGCGQENEGGGFSQGGGLVGFGRGP--LSLISQLGQSLGNKFSYCLVSITDSPS 272
Query: 198 QGSSK-INFGGIVAGAGVVSTPLIIRD----HYYLSLEAISVGNQRLEFVSSS------- 245
+ S I + V STPL+ YYLSLE ISVG Q L+ +
Sbjct: 273 KTSPLFIGKTASLNAKTVSSTPLVQSRSRPTFYYLSLEGISVGGQLLDIADGTFDLQLDG 332
Query: 246 TGNIFVDTGVLRTLLPLEYHSNLKSVMSNMIKAQPVKG--VGAEPGFSDVLCYNI---SS 300
TG + +D+G T L + +K + + I V G +G + LC+ SS
Sbjct: 333 TGGVIIDSGTTVTYLEQSGYDVVKKAVISSINLPQVDGSNIGLD------LCFEPQSGSS 386
Query: 301 QPKFPEVTIHFRGADVKLSPSNLFRNISDEIMCSAFRGGNANIVYGRIMQINFLIGYDIE 360
FP +T HF GAD L N S I C A N ++G I Q N+ I YD E
Sbjct: 387 TSHFPTITFHFEGADFNLPKENYIYTDSSGIACLAMLPSNGMSIFGNIQQQNYQILYDNE 446
Query: 361 QAMVSFKPSRC 371
+ ++SF P+ C
Sbjct: 447 RNVLSFAPTVC 457
>gi|225438315|ref|XP_002272802.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Vitis vinifera]
Length = 436
Score = 179 bits (455), Expect = 2e-42, Method: Compositional matrix adjust.
Identities = 131/360 (36%), Positives = 186/360 (51%), Gaps = 46/360 (12%)
Query: 35 YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSS 94
+LM L+IGTP +DTGSD WTQC+PC DCF Q P+FDPKKSS+++ + CSS
Sbjct: 97 FLMKLAIGTPAETYSAIMDTGSDLIWTQCKPCK--DCFDQPTPIFDPKKSSSFSKLPCSS 154
Query: 95 SQCAVV-TSNCSEGDCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMPNVIFGCG 153
CA + S+CS+G C Y LY G Y+S + G LATET F S + + FGCG
Sbjct: 155 DLCAALPISSCSDG-CEY--LYSYGDYSS-TQGVLATETFAFGDAS-----VSKIGFGCG 205
Query: 154 HKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLP----DQGSSKINFGGIV 209
N S S+ G++GLG G SLISQ+G KFSYCL +G S + G
Sbjct: 206 EDNDGSGF--SQGAGLVGLGRGPLSLISQLGEP---KFSYCLTSMDDSKGISSLLVGSEA 260
Query: 210 AGAGVVSTPLIIR----DHYYLSLEAISVGNQRLEFVSSS-------TGNIFVDTGVLRT 258
++TPLI YYLSLE ISVG+ L S+ +G + +D+G T
Sbjct: 261 TMKNAITTPLIQNPSQPSFYYLSLEGISVGDTLLPIEKSTFSIQNDGSGGLIIDSGTTIT 320
Query: 259 LLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDV-LCYNI---SSQPKFPEVTIHFRGA 314
L + LK + +K E G + + LC+ + +S P++ HF GA
Sbjct: 321 YLEDSAFAALKKEFISQLKLDV-----DESGSTGLDLCFTLPPDASTVDVPQLVFHFEGA 375
Query: 315 DVKLSPSNLFRNISDE---IMCSAFRGGNANIVYGRIMQINFLIGYDIEQAMVSFKPSRC 371
D+KL N I+D ++C + ++G Q N ++ +D+E+ +SF P++C
Sbjct: 376 DLKLPAENYI--IADSGLGVICLTMGSSSGMSIFGNFQQQNIVVLHDLEKETISFAPAQC 433
>gi|61214233|sp|Q766C3.1|NEP1_NEPGR RecName: Full=Aspartic proteinase nepenthesin-1; AltName:
Full=Nepenthesin-I; Flags: Precursor
gi|41016421|dbj|BAD07474.1| aspartic proteinase nepenthesin I [Nepenthes gracilis]
Length = 437
Score = 179 bits (455), Expect = 2e-42, Method: Compositional matrix adjust.
Identities = 127/369 (34%), Positives = 191/369 (51%), Gaps = 41/369 (11%)
Query: 25 QAEIISVDDIYLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKS 84
+ + + D YLM+LSIGTP +DTGSD WTQC+PC + CF Q P+F+P+ S
Sbjct: 85 ETSVYAGDGEYLMNLSIGTPAQPFSAIMDTGSDLIWTQCQPCTQ--CFNQSTPIFNPQGS 142
Query: 85 STYNSISCSSSQCAVVTS-NCSEGDCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPV 143
S+++++ CSS C ++S CS C Y++ YG G S + G++ TETLTF S V
Sbjct: 143 SSFSTLPCSSQLCQALSSPTCSNNFCQYTYGYGDG---SETQGSMGTETLTFGS-----V 194
Query: 144 EMPNVIFGCGHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLPDQGS--- 200
+PN+ FGCG N G++G+G G SL SQ+ + KFSYC+ GS
Sbjct: 195 SIPNITFGCGENNQG--FGQGNGAGLVGMGRGPLSLPSQLDVT---KFSYCMTPIGSSTP 249
Query: 201 SKINFGGIV--AGAGVVSTPLI----IRDHYYLSLEAISVGNQRLEFVSSS--------T 246
S + G + AG +T LI I YY++L +SVG+ RL S+ T
Sbjct: 250 SNLLLGSLANSVTAGSPNTTLIQSSQIPTFYYITLNGLSVGSTRLPIDPSAFALNSNNGT 309
Query: 247 GNIFVDTGVLRTLLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDVLCYNISSQP---K 303
G I +D+G T + +++ + I V G+ GF LC+ S P +
Sbjct: 310 GGIIIDSGTTLTYFVNNAYQSVRQEFISQINLPVVN--GSSSGFD--LCFQTPSDPSNLQ 365
Query: 304 FPEVTIHFRGADVKLSPSNLFRNISDEIMCSAFRGGNANI-VYGRIMQINFLIGYDIEQA 362
P +HF G D++L N F + S+ ++C A + + ++G I Q N L+ YD +
Sbjct: 366 IPTFVMHFDGGDLELPSENYFISPSNGLICLAMGSSSQGMSIFGNIQQQNMLVVYDTGNS 425
Query: 363 MVSFKPSRC 371
+VSF ++C
Sbjct: 426 VVSFASAQC 434
>gi|356516413|ref|XP_003526889.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
Length = 453
Score = 179 bits (453), Expect = 3e-42, Method: Compositional matrix adjust.
Identities = 131/369 (35%), Positives = 196/369 (53%), Gaps = 38/369 (10%)
Query: 25 QAEIISVDDIYLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKS 84
+A I + + YL+ L+IGTPPV +DTGSD WTQC+PC C+KQ P+FDPKKS
Sbjct: 98 EAPIHAGNGEYLIELAIGTPPVSYPAVLDTGSDLIWTQCKPCTR--CYKQPTPIFDPKKS 155
Query: 85 STYNSISCSSSQC-AVVTSNCSEGDCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPV 143
S+++ +SC SS C A+ +S CS+G C Y + YG S + G LATET TF + V
Sbjct: 156 SSFSKVSCGSSLCSALPSSTCSDG-CEYVYSYGD---YSMTQGVLATETFTFGKSKN-KV 210
Query: 144 EMPNVIFGCGHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLPDQGSSKI 203
+ N+ FGCG N + +G++GLG G SL+SQ+ +FSYCL +K
Sbjct: 211 SVHNIGFGCGEDNEGD--GFEQASGLVGLGRGPLSLVSQLKEQ---RFSYCLTPIDDTKE 265
Query: 204 ------NFGGIVAGAGVVSTPLIIR----DHYYLSLEAISVGNQRLEFVSSS-------T 246
+ G + VV+TPL+ YYLSLEAISVG+ RL S+
Sbjct: 266 SVLLLGSLGKVKDAKEVVTTPLLKNPLQPSFYYLSLEAISVGDTRLSIEKSTFEVGDDGN 325
Query: 247 GNIFVDTGVLRTLLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDVLCYNI---SSQPK 303
G + +D+G T + + + LK + K K + G LC+++ S+Q +
Sbjct: 326 GGVIIDSGTTITYVQQKAYEALKKEFISQTKLALDK--TSSTGLD--LCFSLPSGSTQVE 381
Query: 304 FPEVTIHFRGADVKLSPSNLFRNISD-EIMCSAFRGGNANIVYGRIMQINFLIGYDIEQA 362
P++ HF+G D++L N S+ + C A + ++G + Q N L+ +D+E+
Sbjct: 382 IPKLVFHFKGGDLELPAENYMIGDSNLGVACLAMGASSGMSIFGNVQQQNILVNHDLEKE 441
Query: 363 MVSFKPSRC 371
+SF P+ C
Sbjct: 442 TISFVPTSC 450
>gi|115458644|ref|NP_001052922.1| Os04g0448300 [Oryza sativa Japonica Group]
gi|113564493|dbj|BAF14836.1| Os04g0448300 [Oryza sativa Japonica Group]
gi|215766465|dbj|BAG98773.1| unnamed protein product [Oryza sativa Japonica Group]
gi|215767943|dbj|BAH00172.1| unnamed protein product [Oryza sativa Japonica Group]
Length = 454
Score = 178 bits (451), Expect = 4e-42, Method: Compositional matrix adjust.
Identities = 131/368 (35%), Positives = 186/368 (50%), Gaps = 52/368 (14%)
Query: 35 YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSS 94
+LM +SIGTP + VDTGSD WTQC+PC +DCFKQ P+FDP SSTY ++ CSS
Sbjct: 105 FLMDVSIGTPALAYSAIVDTGSDLVWTQCKPC--VDCFKQSTPVFDPSSSSTYATVPCSS 162
Query: 95 SQCA-VVTSNC-SEGDCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMPNVIFGC 152
+ C+ + TS C S C Y++ YG +S + G LATET T + ++P V+FGC
Sbjct: 163 ASCSDLPTSKCTSASKCGYTYTYGD---SSSTQGVLATETFTLAKS-----KLPGVVFGC 214
Query: 153 GHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLP---DQGSSKINFGGIV 209
G N S+ G++GLG G SL+SQ+G KFSYCL D +S + G +
Sbjct: 215 GDTNEGD--GFSQGAGLVGLGRGPLSLVSQLGLD---KFSYCLTSLDDTNNSPLLLGSLA 269
Query: 210 A-------GAGVVSTPLIIR----DHYYLSLEAISVGNQRLEFVSSS-------TGNIFV 251
+ V +TPLI YY+SL+AI+VG+ R+ SS+ TG + V
Sbjct: 270 GISEASAAASSVQTTPLIKNPSQPSFYYVSLKAITVGSTRISLPSSAFAVQDDGTGGVIV 329
Query: 252 DTGVLRTLLPLEYHSNLKSVMSNMIK--AQPVKGVGAEPGFSDVLCYNISS----QPKFP 305
D+G T L ++ + LK + + A GVG + LC+ + Q + P
Sbjct: 330 DSGTSITYLEVQGYRALKKAFAAQMALPAADGSGVGLD------LCFRAPAKGVDQVEVP 383
Query: 306 EVTIHFR-GADVKLSPSN-LFRNISDEIMCSAFRGGNANIVYGRIMQINFLIGYDIEQAM 363
+ HF GAD+ L N + + +C G + G Q NF YD+
Sbjct: 384 RLVFHFDGGADLDLPAENYMVLDGGSGALCLTVMGSRGLSIIGNFQQQNFQFVYDVGHDT 443
Query: 364 VSFKPSRC 371
+SF P +C
Sbjct: 444 LSFAPVQC 451
>gi|38344829|emb|CAD40873.2| OSJNBa0064H22.10 [Oryza sativa Japonica Group]
gi|116310063|emb|CAH67084.1| H0818E04.1 [Oryza sativa Indica Group]
gi|116310186|emb|CAH67198.1| OSIGBa0152K17.10 [Oryza sativa Indica Group]
Length = 444
Score = 177 bits (450), Expect = 6e-42, Method: Compositional matrix adjust.
Identities = 131/368 (35%), Positives = 186/368 (50%), Gaps = 52/368 (14%)
Query: 35 YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSS 94
+LM +SIGTP + VDTGSD WTQC+PC +DCFKQ P+FDP SSTY ++ CSS
Sbjct: 95 FLMDVSIGTPALAYSAIVDTGSDLVWTQCKPC--VDCFKQSTPVFDPSSSSTYATVPCSS 152
Query: 95 SQCA-VVTSNC-SEGDCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMPNVIFGC 152
+ C+ + TS C S C Y++ YG +S + G LATET T + ++P V+FGC
Sbjct: 153 ASCSDLPTSKCTSASKCGYTYTYGD---SSSTQGVLATETFTLAKS-----KLPGVVFGC 204
Query: 153 GHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLP---DQGSSKINFGGIV 209
G N S+ G++GLG G SL+SQ+G KFSYCL D +S + G +
Sbjct: 205 GDTNEGD--GFSQGAGLVGLGRGPLSLVSQLGLD---KFSYCLTSLDDTNNSPLLLGSLA 259
Query: 210 A-------GAGVVSTPLIIR----DHYYLSLEAISVGNQRLEFVSSS-------TGNIFV 251
+ V +TPLI YY+SL+AI+VG+ R+ SS+ TG + V
Sbjct: 260 GISEASAAASSVQTTPLIKNPSQPSFYYVSLKAITVGSTRISLPSSAFAVQDDGTGGVIV 319
Query: 252 DTGVLRTLLPLEYHSNLKSVMSNMIK--AQPVKGVGAEPGFSDVLCYNISS----QPKFP 305
D+G T L ++ + LK + + A GVG + LC+ + Q + P
Sbjct: 320 DSGTSITYLEVQGYRALKKAFAAQMALPAADGSGVGLD------LCFRAPAKGVDQVEVP 373
Query: 306 EVTIHFR-GADVKLSPSN-LFRNISDEIMCSAFRGGNANIVYGRIMQINFLIGYDIEQAM 363
+ HF GAD+ L N + + +C G + G Q NF YD+
Sbjct: 374 RLVFHFDGGADLDLPAENYMVLDGGSGALCLTVMGSRGLSIIGNFQQQNFQFVYDVGHDT 433
Query: 364 VSFKPSRC 371
+SF P +C
Sbjct: 434 LSFAPVQC 441
>gi|125548488|gb|EAY94310.1| hypothetical protein OsI_16079 [Oryza sativa Indica Group]
Length = 423
Score = 177 bits (449), Expect = 7e-42, Method: Compositional matrix adjust.
Identities = 131/368 (35%), Positives = 186/368 (50%), Gaps = 52/368 (14%)
Query: 35 YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSS 94
+LM +SIGTP + VDTGSD WTQC+PC +DCFKQ P+FDP SSTY ++ CSS
Sbjct: 74 FLMDVSIGTPALAYSAIVDTGSDLVWTQCKPC--VDCFKQSTPVFDPSSSSTYATVPCSS 131
Query: 95 SQCA-VVTSNC-SEGDCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMPNVIFGC 152
+ C+ + TS C S C Y++ YG +S + G LATET T + ++P V+FGC
Sbjct: 132 ASCSDLPTSKCTSASKCGYTYTYGD---SSSTQGVLATETFTLAKS-----KLPGVVFGC 183
Query: 153 GHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLP---DQGSSKINFGGIV 209
G N S+ G++GLG G SL+SQ+G KFSYCL D +S + G +
Sbjct: 184 GDTNEGD--GFSQGAGLVGLGRGPLSLVSQLGLD---KFSYCLTSLDDTNNSPLLLGSLA 238
Query: 210 A-------GAGVVSTPLIIR----DHYYLSLEAISVGNQRLEFVSSS-------TGNIFV 251
+ V +TPLI YY+SL+AI+VG+ R+ SS+ TG + V
Sbjct: 239 GISEASAAASSVQTTPLIKNPSQPSFYYVSLKAITVGSTRISLPSSAFAVQDDGTGGVIV 298
Query: 252 DTGVLRTLLPLEYHSNLKSVMSNMIK--AQPVKGVGAEPGFSDVLCYNISS----QPKFP 305
D+G T L ++ + LK + + A GVG + LC+ + Q + P
Sbjct: 299 DSGTSITYLEVQGYRALKKAFAAQMALPAADGSGVGLD------LCFRAPAKGVDQVEVP 352
Query: 306 EVTIHFR-GADVKLSPSN-LFRNISDEIMCSAFRGGNANIVYGRIMQINFLIGYDIEQAM 363
+ HF GAD+ L N + + +C G + G Q NF YD+
Sbjct: 353 RLVFHFDGGADLDLPAENYMVLDGGSGALCLTVMGSRGLSIIGNFQQQNFQFVYDVGHDT 412
Query: 364 VSFKPSRC 371
+SF P +C
Sbjct: 413 LSFAPVQC 420
>gi|224286173|gb|ACN40797.1| unknown [Picea sitchensis]
Length = 383
Score = 177 bits (448), Expect = 1e-41, Method: Compositional matrix adjust.
Identities = 131/363 (36%), Positives = 189/363 (52%), Gaps = 50/363 (13%)
Query: 35 YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSS 94
YL+ ++IGTP + + +DTGSD WT+C PC DC ++DP SSTY+ + C S
Sbjct: 42 YLIQMAIGTPALSLSAIMDTGSDLVWTKCNPC--TDC--STSSIYDPSSSSTYSKVLCQS 97
Query: 95 SQC---AVVTSNCSEGDCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMPNVIFG 151
S C ++ + N ++GDC Y + YG S +SG L+ ET + +S S +PN+ FG
Sbjct: 98 SLCQPPSIFSCN-NDGDCEYVYPYGD---RSSTSGILSDETFSISSQS-----LPNITFG 148
Query: 152 CGHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLPDQGSSK------INF 205
CGH N K G++G G G+ SL+SQ+G S+ KFSYCL + S I
Sbjct: 149 CGHDNQGF----DKVGGLVGFGRGSLSLVSQLGPSMGNKFSYCLVSRTDSSKTSPLFIGN 204
Query: 206 GGIVAGAGVVSTPLIIR---DHYYLSLEAISVGNQRL-------EFVSSSTGNIFVDTGV 255
+ V STPL+ +HYYLSLE ISVG Q L + S +G + +D+G
Sbjct: 205 TASLEATTVGSTPLVQSSSTNHYYLSLEGISVGGQSLAIPTGTFDIQSDGSGGLIIDSGT 264
Query: 256 LRTLLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDVLCYNI--SSQPKFPEVTIHFRG 313
T L + +K M + I G LC+N SS P FP +T HF+G
Sbjct: 265 TLTFLQQTAYDAVKEAMVSSINLPQADG-------QLDLCFNQQGSSNPGFPSMTFHFKG 317
Query: 314 ADVKLSPSN-LFRNISDEIMCSAFRGGNANI----VYGRIMQINFLIGYDIEQAMVSFKP 368
AD + N LF + + +I+C A N+N+ ++G + Q N+ I YD E ++SF P
Sbjct: 318 ADYDVPKENYLFPDSTSDIVCLAMMPTNSNLGNMAIFGNVQQQNYQILYDNENNVLSFAP 377
Query: 369 SRC 371
+ C
Sbjct: 378 TAC 380
>gi|414584780|tpg|DAA35351.1| TPA: hypothetical protein ZEAMMB73_696016 [Zea mays]
Length = 524
Score = 175 bits (444), Expect = 3e-41, Method: Compositional matrix adjust.
Identities = 116/367 (31%), Positives = 179/367 (48%), Gaps = 44/367 (11%)
Query: 35 YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSS 94
YL+ +S+G+PP + + VD+GSD W QC+PC L+C+ Q PLFDP S+T++ +SC S
Sbjct: 171 YLVRVSVGSPPTEQYLVVDSGSDVMWVQCKPC--LECYVQADPLFDPATSATFSGVSCGS 228
Query: 95 SQCAVV-TSNCSEGD---CSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMPNVIF 150
+ C ++ TS C +G+ C Y Y G+Y + G LA ETLT T+ + V+
Sbjct: 229 AICRILPTSACGDGELGGCEYEVSYADGSY---TKGALALETLTLGGTA-----VEGVVI 280
Query: 151 GCGHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLPDQ-----GSSKINF 205
GCGH+N G++GLG G SL+ Q+G + G FSYCL + G++ +
Sbjct: 281 GCGHRNRGLFVG---AAGLMGLGWGPMSLVGQLGGEVGGAFSYCLASRGGYGSGAADDDA 337
Query: 206 GGIVAG------AGVVSTPLIIRDH----YYLSLEAISVGNQRL-------EFVSSSTGN 248
G +V G G V PL+ YY+ L I VG++RL + G+
Sbjct: 338 GWLVLGRSEAVPEGAVWVPLVRNPRAPSFYYVGLSGIEVGDERLPLQAGLFQLTEDGAGD 397
Query: 249 IFVDTGVLRTLLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDVLCYNIS--SQPKFPE 306
+ +DTG T LP E ++ L+ + + G D CY++S + + P
Sbjct: 398 VVMDTGTTVTRLPQEAYAALRDAFVGALAGAVPRAQGVSSSVLDT-CYDLSGYASVRVPT 456
Query: 307 VTIHFRG-ADVKLSPSNLFRNISDEIMCSAFRGGNANI-VYGRIMQINFLIGYDIEQAMV 364
V+ F G A + L+ N+ + I C AF ++ + + G Q I D +
Sbjct: 457 VSFCFDGDARLILAARNVLLEVDMGIYCLAFAPSSSGLSIMGNTQQAGIQITVDSANGYI 516
Query: 365 SFKPSRC 371
F P+ C
Sbjct: 517 GFGPANC 523
>gi|242050430|ref|XP_002462959.1| hypothetical protein SORBIDRAFT_02g035310 [Sorghum bicolor]
gi|241926336|gb|EER99480.1| hypothetical protein SORBIDRAFT_02g035310 [Sorghum bicolor]
Length = 448
Score = 175 bits (444), Expect = 3e-41, Method: Compositional matrix adjust.
Identities = 126/373 (33%), Positives = 189/373 (50%), Gaps = 51/373 (13%)
Query: 35 YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSS 94
YLM LSIGTPP+ DTGSD WTQC PC CF Q PL++P S+T+ + C+S
Sbjct: 92 YLMTLSIGTPPLSYPAIADTGSDLIWTQCAPCSGDQCFAQPAPLYNPASSTTFGVLPCNS 151
Query: 95 --SQCAVVTS------NCSEGDCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMP 146
S CA V + C+ C Y+ YG G +++G +ET TF S + +P
Sbjct: 152 SLSMCAGVLAGKAPPPGCA---CMYNQTYGTG----WTAGVQGSETFTFGSAAADQARVP 204
Query: 147 NVIFGCGHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCL-PDQ---GSSK 202
+ FGC + AS + + G++GLG G+ SL+SQ+G AG+FSYCL P Q +S
Sbjct: 205 GIAFGCSN---ASSSDWNGSAGLVGLGRGSLSLVSQLG---AGRFSYCLTPFQDTNSTST 258
Query: 203 INFGGIVA--GAGVVSTPLI-------IRDHYYLSLEAISVGNQRL-------EFVSSST 246
+ G A G GV STP + + +YYL+L IS+G + L + T
Sbjct: 259 LLLGPSAALNGTGVRSTPFVASPAKAPMSTYYYLNLTGISLGAKALSISPDAFSLKADGT 318
Query: 247 GNIFVDTGVLRTLLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDVLCYNI----SSQP 302
G + +D+G T L + +++ + +++ + G + G LCY + S+ P
Sbjct: 319 GGLIIDSGTTITSLVNAAYQQVRAAVQSLVTLPAIDGSDST-GLD--LCYALPTPTSAPP 375
Query: 303 KFPEVTIHFRGADVKLSPSNLFRNISDEIMCSAFRG--GNANIVYGRIMQINFLIGYDIE 360
P +T+HF GAD+ L P++ + + C A R A +G Q N I YD+
Sbjct: 376 AMPSMTLHFDGADMVL-PADSYMISGSGVWCLAMRNQTDGAMSTFGNYQQQNMHILYDVR 434
Query: 361 QAMVSFKPSRCTN 373
M+SF P++C+
Sbjct: 435 NEMLSFAPAKCST 447
>gi|255566008|ref|XP_002523992.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
gi|223536719|gb|EEF38360.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
Length = 442
Score = 175 bits (443), Expect = 3e-41, Method: Compositional matrix adjust.
Identities = 128/375 (34%), Positives = 197/375 (52%), Gaps = 39/375 (10%)
Query: 23 IYQAEIISVDD--IYLMHLSIGTPPVDIFGSVDTGSDCTWTQC---EPCPELDCFKQEPP 77
I AE S+ D +LM +SIG PP ++ +V TGSD W C +PC +C +
Sbjct: 84 ITAAEFPSILDNGDFLMKISIGIPPTELLVNVATGSDLVWIPCLSFKPCTH-NC---DLR 139
Query: 78 LFDPKKSSTYNSISCSSSQCAVV-TSNCSEGDCSYSFLYGRGAYASFSSGNLATETLTFN 136
FDP +SSTY ++ C S +C + + C DC YS S G+LA +TLT N
Sbjct: 140 FFDPMESSTYKNVPCDSYRCQITNAATCQFSDCFYSC--DPRHQDSCPDGDLAMDTLTLN 197
Query: 137 STSGLPVEMPNVIFGCGHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCL- 195
ST+G +PN F CG++ D GI+GLG G+ SL++++ I GKFS+C+
Sbjct: 198 STTGKSFMLPNTGFICGNR----IGGDYPGVGILGLGHGSLSLLNRISHLIDGKFSHCIV 253
Query: 196 --PDQGSSKINFG--GIVAGAGVVSTPLIIRD---HYYLSLEAISVGNQRL-------EF 241
+SK++FG +V+G+ + ST L + Y LS ISVGN+ + ++
Sbjct: 254 PYSSNQTSKLSFGDKAVVSGSAMFSTRLDMTGGPYSYTLSFYGISVGNKSISAGGIGSDY 313
Query: 242 VSSSTGNIFVDTGVLRTLLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDVLCYNISSQ 301
+ G +D+G + T P ++S L+ + I+ +P+ +P LCY S
Sbjct: 314 YMNGLG---MDSGTMFTYFPEYFYSQLEYDVRYAIQQEPLY---PDPTRRLRLCYRYSPD 367
Query: 302 PKFPEVTIHFRGADVKLSPSNLFRNISDEIMCSAFRGGNA--NIVYGRIMQINFLIGYDI 359
P +T+HF G V+LS SN F ++++I+C AF ++ + V+G Q N LIGYD+
Sbjct: 368 FSPPTITMHFEGGSVELSSSNSFIRMTEDIVCLAFATSSSEQDAVFGYWQQTNLLIGYDL 427
Query: 360 EQAMVSFKPSRCTNY 374
+ +SF + CT Y
Sbjct: 428 DAGFLSFLKTDCTKY 442
>gi|255563827|ref|XP_002522914.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
gi|223537841|gb|EEF39457.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
Length = 442
Score = 175 bits (443), Expect = 4e-41, Method: Compositional matrix adjust.
Identities = 136/367 (37%), Positives = 201/367 (54%), Gaps = 43/367 (11%)
Query: 28 IISVDDIYLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTY 87
++S + +LM+L+IGTPP +DTGSD WTQC+PC + CF Q P+FDPKKSS++
Sbjct: 93 VLSGNGEFLMNLAIGTPPETYSAIMDTGSDLIWTQCKPCTQ--CFDQPSPIFDPKKSSSF 150
Query: 88 NSISCSSSQC-AVVTSNCSEGDCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMP 146
+ +SCSS C A+ S+CS+ C Y LY G Y+S + G +ATET TF V +P
Sbjct: 151 SKLSCSSQLCKALPQSSCSD-SCEY--LYTYGDYSS-TQGTMATETFTFGK-----VSIP 201
Query: 147 NVIFGCGHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLP---DQGSSKI 203
NV FGCG N ++ +G++GLG G SL+SQ+ + KFSYCL D +S +
Sbjct: 202 NVGFGCGEDNEGD--GFTQGSGLVGLGRGPLSLVSQLKEA---KFSYCLTSIDDTKTSTL 256
Query: 204 NFGGIVA----GAGVVSTPLIIR----DHYYLSLEAISVGNQRLEFVSSS-------TGN 248
G + + A + +TPLI YYLSLE ISVG RL S+ TG
Sbjct: 257 LMGSLASVNGTSAAIRTTPLIQNPLQPSFYYLSLEGISVGGTRLPIKESTFQLQDDGTGG 316
Query: 249 IFVDTGVLRTLLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDVLCYNI---SSQPKFP 305
+ +D+G T L +K ++ + PV GA G LCYN+ +S+ + P
Sbjct: 317 LIIDSGTTITYLEESAFDLVKKEFTSQM-GLPVDNSGAT-GLE--LCYNLPSDTSELEVP 372
Query: 306 EVTIHFRGADVKLSPSN-LFRNISDEIMCSAFRGGNANIVYGRIMQINFLIGYDIEQAMV 364
++ +HF GAD++L N + + S ++C A ++G + Q N + +D+E+ +
Sbjct: 373 KLVLHFTGADLELPGENYMIADSSMGVICLAMGSSGGMSIFGNVQQQNMFVSHDLEKETL 432
Query: 365 SFKPSRC 371
SF P+ C
Sbjct: 433 SFLPTNC 439
>gi|242092368|ref|XP_002436674.1| hypothetical protein SORBIDRAFT_10g006870 [Sorghum bicolor]
gi|241914897|gb|EER88041.1| hypothetical protein SORBIDRAFT_10g006870 [Sorghum bicolor]
Length = 461
Score = 174 bits (440), Expect = 8e-41, Method: Compositional matrix adjust.
Identities = 126/382 (32%), Positives = 182/382 (47%), Gaps = 56/382 (14%)
Query: 31 VDDIYLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSI 90
V + YL+HL++GTPP + ++DTGSD WTQC PC DCF Q PL DP SSTY ++
Sbjct: 88 VTNEYLVHLAVGTPPRPVALTLDTGSDLVWTQCAPC--RDCFHQGLPLLDPAASSTYAAL 145
Query: 91 SCSSSQC-AVVTSNCSEG----------DCSYSFLYGRGAYASFSSGNLATETLTFNSTS 139
C + +C A+ ++C G C+Y + YG S + G +AT+ TF +
Sbjct: 146 PCGAPRCRALPFTSCGGGGRSSWGNGNRSCAYIYHYGD---KSVTVGEIATDRFTFGGDN 202
Query: 140 G-----LPVEMPNVIFGCGHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYC 194
G LP + FGCGH N S +TGI G G G SL SQ+ + FSYC
Sbjct: 203 GDGDSRLPTR--RLTFGCGHFNKG--VFQSNETGIAGFGRGRWSLPSQLNVT---TFSYC 255
Query: 195 LP---DQGSSKINFGGIVAGA-----------GVVSTPLIIRDH----YYLSLEAISVGN 236
+ SS + GG A A V +TPL+ Y+LSL+ ISVG
Sbjct: 256 FTSMFESKSSLVTLGGAPAAALLYSHAAHISGEVRTTPLLKNPSQPSLYFLSLKGISVGK 315
Query: 237 QRLEFVSSSTGNIFVDTGVLRTLLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDVLCY 296
RL + + +D+G T LP + +K+ + + P G G + LC+
Sbjct: 316 TRLAVPEAKLRSTIIDSGASITTLPEAVYEAVKAEFAAQVGLPP---TGVVEGSALDLCF 372
Query: 297 NIS-----SQPKFPEVTIHFRGADVKLSPSN-LFRNISDEIMCSAFRGGNAN-IVYGRIM 349
+ +P P +T+H GAD +L N +F +++ +MC + V G
Sbjct: 373 ALPVTALWRRPPVPSLTLHLDGADWELPRGNYVFEDLAARVMCVVLDAAPGDQTVIGNFQ 432
Query: 350 QINFLIGYDIEQAMVSFKPSRC 371
Q N + YD+E +SF P+RC
Sbjct: 433 QQNTHVVYDLENDWLSFAPARC 454
>gi|147862576|emb|CAN79341.1| hypothetical protein VITISV_006338 [Vitis vinifera]
Length = 436
Score = 174 bits (440), Expect = 9e-41, Method: Compositional matrix adjust.
Identities = 124/357 (34%), Positives = 181/357 (50%), Gaps = 40/357 (11%)
Query: 35 YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSS 94
+LM+L+IGTP +DTGSD WTQC+PC CF Q P+FDP+KSS+++ + CSS
Sbjct: 97 FLMNLAIGTPAETYSAIMDTGSDLIWTQCKPCKV--CFDQPTPIFDPEKSSSFSKLPCSS 154
Query: 95 SQC-AVVTSNCSEGDCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMPNVIFGCG 153
C A+ S+CS+G C Y + YG S + G LATET TF S + + FGCG
Sbjct: 155 DLCVALPISSCSDG-CEYRYSYGD---HSSTQGVLATETFTFGDAS-----VSKIGFGCG 205
Query: 154 HKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLP----DQGSSKINFGGIV 209
N + S+ G++GLG G SLISQ+G KFSYCL +G S + G
Sbjct: 206 EDNRGR--AYSQGAGLVGLGRGPLSLISQLGVP---KFSYCLTSIDDSKGISTLLVGSEA 260
Query: 210 AGAGVVSTPLIIR----DHYYLSLEAISVGNQRLEFVSSS-------TGNIFVDTGVLRT 258
+ TPLI YYLSLE ISVG+ L S+ +G + +D+G T
Sbjct: 261 TVKSAIPTPLIQNPSRPSFYYLSLEGISVGDTLLPIEKSTFSIQDDGSGGLIIDSGTTIT 320
Query: 259 LLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDVLCYNI---SSQPKFPEVTIHFRGAD 315
L + LK + +K V A LC+ + S + P++ HF G D
Sbjct: 321 YLKDNAFAALKKEFISQMKLD----VDASGSTELELCFTLPPDGSPVEVPQLVFHFEGVD 376
Query: 316 VKLSPSN-LFRNISDEIMCSAFRGGNANIVYGRIMQINFLIGYDIEQAMVSFKPSRC 371
+KL N + + + ++C + ++G Q N ++ +D+E+ +SF P++C
Sbjct: 377 LKLPKENYIIEDSALRVICLTMGSSSGMSIFGNFQQQNIVVLHDLEKETISFAPAQC 433
>gi|225437854|ref|XP_002264056.1| PREDICTED: aspartic proteinase nepenthesin-1 [Vitis vinifera]
Length = 436
Score = 173 bits (439), Expect = 1e-40, Method: Compositional matrix adjust.
Identities = 124/357 (34%), Positives = 180/357 (50%), Gaps = 40/357 (11%)
Query: 35 YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSS 94
+LM+L+IGTP +DTGSD WTQC+PC CF Q P+FDP+KSS+++ + CSS
Sbjct: 97 FLMNLAIGTPAETYSAIMDTGSDLIWTQCKPCKV--CFDQPTPIFDPEKSSSFSKLPCSS 154
Query: 95 SQC-AVVTSNCSEGDCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMPNVIFGCG 153
C A+ S+CS+G C Y + YG S + G LATET TF S + + FGCG
Sbjct: 155 DLCVALPISSCSDG-CEYRYSYGD---HSSTQGVLATETFTFGDAS-----VSKIGFGCG 205
Query: 154 HKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLP----DQGSSKINFGGIV 209
N + S+ G++GLG G SLISQ+G KFSYCL +G S + G
Sbjct: 206 EDNRGR--AYSQGAGLVGLGRGPLSLISQLGVP---KFSYCLTSIDDSKGISTLLVGSEA 260
Query: 210 AGAGVVSTPLIIR----DHYYLSLEAISVGNQRLEFVSSS-------TGNIFVDTGVLRT 258
+ TPLI YYLSLE ISVG+ L S+ +G + +D+G T
Sbjct: 261 TVKSAIPTPLIQNPSRPSFYYLSLEGISVGDTLLPIEKSTFSIQDDGSGGLIIDSGTTIT 320
Query: 259 LLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDVLCYNI---SSQPKFPEVTIHFRGAD 315
L + LK + +K V A LC+ + S P++ HF G D
Sbjct: 321 YLKDSAFAALKKEFISQMKLD----VDASGSTELELCFTLPPDGSPVDVPQLVFHFEGVD 376
Query: 316 VKLSPSN-LFRNISDEIMCSAFRGGNANIVYGRIMQINFLIGYDIEQAMVSFKPSRC 371
+KL N + + + ++C + ++G Q N ++ +D+E+ +SF P++C
Sbjct: 377 LKLPKENYIIEDSALRVICLTMGSSSGMSIFGNFQQQNIVVLHDLEKETISFAPAQC 433
>gi|414877221|tpg|DAA54352.1| TPA: hypothetical protein ZEAMMB73_214154 [Zea mays]
Length = 447
Score = 172 bits (437), Expect = 2e-40, Method: Compositional matrix adjust.
Identities = 134/381 (35%), Positives = 190/381 (49%), Gaps = 55/381 (14%)
Query: 26 AEIISVDDIYLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSS 85
A + S YLM L+IGTPPV DTGSD TWTQC+PC CF Q+ P++D SS
Sbjct: 84 ARLRSGQAEYLMELAIGTPPVPFVALADTGSDLTWTQCQPCKL--CFPQDTPIYDTAVSS 141
Query: 86 TYNSISCSSSQCAVVTS--NC--SEGDCSYSFLYGRGAYASFSSGNLATETLTFNSTSGL 141
+++ + C+S+ C + S NC S C Y + YG GAY S+G L TETLTF G
Sbjct: 142 SFSPVPCASATCLPIWSSRNCTASSSPCRYRYAYGDGAY---SAGVLGTETLTFPGAPG- 197
Query: 142 PVEMPNVIFGCGHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLPD---- 197
V + + FGCG N TG +GLG G+ SL++Q+G GKFSYCL D
Sbjct: 198 -VSVGGIAFGCGVDNGG---LSYNSTGTVGLGRGSLSLVAQLGV---GKFSYCLTDFFNT 250
Query: 198 QGSSKINFGGIV------AGAGVVSTPLI----IRDHYYLSLEAISVGNQRL-------E 240
S + FG + GA V STPL+ + YY+SLE IS+G+ RL +
Sbjct: 251 SLGSPVLFGALAELAAPSTGAAVQSTPLVQSPYVPTWYYVSLEGISLGDARLPIPNGTFD 310
Query: 241 FVSSSTGNIFVDTGVLRTLLPLEYHSNLKSVMSNM--IKAQPVKGVGAEPGFSDVLCYNI 298
+G + VD+G T L S + V+ ++ + QPV + D C+
Sbjct: 311 LRDDGSGGMIVDSGTTFTFL---VESAFRVVVDHVAGVLRQPVVNASSL----DSPCFPA 363
Query: 299 SSQ----PKFPEVTIHFR-GADVKLSPSNLFR-NISDEIMCSAFRGG-NANI-VYGRIMQ 350
++ P P++ +HF GAD++L N N + C G +A++ + G Q
Sbjct: 364 ATGEQQLPAMPDMVLHFAGGADMRLHRDNYMSFNQEESSFCLNIAGSPSADVSILGNFQQ 423
Query: 351 INFLIGYDIEQAMVSFKPSRC 371
N + +DI +SF P+ C
Sbjct: 424 QNIQMLFDITVGQLSFMPTDC 444
>gi|90399033|emb|CAJ86229.1| H0402C08.5 [Oryza sativa Indica Group]
gi|125550227|gb|EAY96049.1| hypothetical protein OsI_17922 [Oryza sativa Indica Group]
Length = 473
Score = 172 bits (436), Expect = 2e-40, Method: Compositional matrix adjust.
Identities = 119/364 (32%), Positives = 173/364 (47%), Gaps = 47/364 (12%)
Query: 35 YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSS 94
Y + + +G+PP D + VD+GSD W QC PC + C+ Q PLFDP SS+++ +SC S
Sbjct: 130 YFVRVGVGSPPTDQYLVVDSGSDVIWVQCRPCEQ--CYAQTDPLFDPAASSSFSGVSCGS 187
Query: 95 SQCAVVT-----SNCSEGDCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMPNVI 149
+ C ++ G C YS YG G+Y + G LA ETLT T+ + V
Sbjct: 188 AICRTLSGTGCGGGGDAGKCDYSVTYGDGSY---TKGELALETLTLGGTA-----VQGVA 239
Query: 150 FGCGHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLPDQGSSKINFGGIV 209
GCGH+N + G++GLG G SLI Q+G + G FSYCL +G+ G +V
Sbjct: 240 IGCGHRNSGLFVGAA---GLLGLGWGAMSLIGQLGGAAGGVFSYCLASRGAGGA--GSLV 294
Query: 210 AG------AGVVSTPLIIRDH----YYLSLEAISVGNQRL-------EFVSSSTGNIFVD 252
G G V PL+ + YY+ L I VG +RL + G + +D
Sbjct: 295 LGRTEAVPVGAVWVPLVRNNQASSFYYVGLTGIGVGGERLPLQDGLFQLTEDGAGGVVMD 354
Query: 253 TGVLRTLLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDV-LCYNIS--SQPKFPEVTI 309
TG T LP E ++ L+ + A P P S + CY++S + + P V+
Sbjct: 355 TGTAVTRLPREAYAALRGAFDGAMGALP-----RSPAVSLLDTCYDLSGYASVRVPTVSF 409
Query: 310 HF-RGADVKLSPSNLFRNISDEIMCSAFRGGNANI-VYGRIMQINFLIGYDIEQAMVSFK 367
+F +GA + L NL + + C AF ++ I + G I Q I D V F
Sbjct: 410 YFDQGAVLTLPARNLLVEVGGAVFCLAFAPSSSGISILGNIQQEGIQITVDSANGYVGFG 469
Query: 368 PSRC 371
P+ C
Sbjct: 470 PNTC 473
>gi|326491519|dbj|BAJ94237.1| predicted protein [Hordeum vulgare subsp. vulgare]
gi|326524456|dbj|BAK00611.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 499
Score = 172 bits (436), Expect = 2e-40, Method: Compositional matrix adjust.
Identities = 120/352 (34%), Positives = 181/352 (51%), Gaps = 30/352 (8%)
Query: 35 YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSS 94
Y++ + +GTP DTGSD TW QC PC + C+KQ+ PLFDP KSSTY ++SC+
Sbjct: 163 YVVTVGLGTPASKYTVVFDTGSDTTWVQCRPC-VVKCYKQKEPLFDPAKSSTYANVSCTD 221
Query: 95 SQCAVVTSN-CSEGDCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMPNVIFGCG 153
S CA + +N C+ G C Y+ YG G+Y + G A +TLT + + FGCG
Sbjct: 222 SACADLDTNGCTGGHCLYAVQYGDGSY---TVGFFAQDTLTIAHDA-----IKGFRFGCG 273
Query: 154 HKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLP--DQGSSKINFGGIVAG 211
KN K G++GLG G +SL Q G F+YCLP G+ ++FG AG
Sbjct: 274 EKNNG---LFGKTAGLMGLGRGKTSLTVQAYNKYGGAFAYCLPALTTGTGYLDFGPGSAG 330
Query: 212 AGVVSTPLII---RDHYYLSLEAISVGNQRLEFVSS--STGNIFVDTGVLRTLLPLEYHS 266
TP++ + YY+ + I VG Q++ S ST VD+G + T LP ++
Sbjct: 331 NNARLTPMLTDKGQTFYYVGMTGIRVGGQQVPVAESVFSTAGTLVDSGTVITRLPATAYT 390
Query: 267 NLKSVMSNMIKAQPVKGVGAEPGFSDV-LCYNIS--SQPKFPEVTIHFR-GADVKLSPSN 322
L S ++ A +G PG+S + CY+ + S + P V++ F+ GA + + S
Sbjct: 391 ALSSAFDKVMLA---RGYKKAPGYSILDTCYDFTGLSDVELPTVSLVFQGGACLDVDVSG 447
Query: 323 LFRNISDEIMCSAF--RGGNANI-VYGRIMQINFLIGYDIEQAMVSFKPSRC 371
+ IS+ +C AF G + ++ + G Q + + YD+ + V F P C
Sbjct: 448 IVYAISEAQVCLAFASNGDDESVAIVGNTQQKTYGVLYDLGKKTVGFAPGSC 499
>gi|296090291|emb|CBI40110.3| unnamed protein product [Vitis vinifera]
Length = 408
Score = 172 bits (435), Expect = 3e-40, Method: Compositional matrix adjust.
Identities = 125/371 (33%), Positives = 185/371 (49%), Gaps = 38/371 (10%)
Query: 25 QAEIISVD--DIYLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPK 82
QA +++ D +L++ S+G PPV +DTGSD W QC PC DCF+Q P+FDP
Sbjct: 47 QANMVADDRGQAFLVNFSVGRPPVPQLVGIDTGSDLLWVQCRPCA--DCFRQSTPIFDPS 104
Query: 83 KSSTYNSISCSSSQC--AVVTSNCSEGDCSYSFLYGRGAYASFSSGNLATETLTFNSTSG 140
KSSTY +S S C + C Y+ Y G S SSGNLATE + F ++
Sbjct: 105 KSSTYVDLSYDSPICPNSPQKKYNHLNQCIYNASYADG---STSSGNLATEDIVFETSDQ 161
Query: 141 LPVEMPNVIFGCGHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLPDQGS 200
V + +V+FGCGH N D +Q+GI+GL G+ S++S++G+ +FSYC+ D
Sbjct: 162 GTVTVSSVVFGCGHSNRGR--FDGQQSGILGLSAGDQSIVSRLGS----RFSYCIGDLFD 215
Query: 201 SKINFGGIVAGAGVV----STPL-IIRDHYYLSLEAISVGNQRLEF-------VSSSTGN 248
+V G GV STP YY++LE ISVG RL+ S G
Sbjct: 216 PHYTHNQLVLGDGVKMEGSSTPFHTFNGFYYVTLEGISVGETRLDINPEVFQRTESGQGG 275
Query: 249 IFVDTGVLRTLLPLEYHSNLKSVMSNMIKAQPVKGVGAE-PGFSDVLCYNISSQPK---F 304
+ +D+G T L + L + + +++ + + PG+ LCY F
Sbjct: 276 VVMDSGTTATFLAKDGFDPLSNEIQRLVRGHFQQVIYRTIPGW---LCYKGRVNEDLRGF 332
Query: 305 PEVTIHF-RGADVKLSPSNLFRNISDEIMCSAFRGGN-ANI--VYGRIMQINFLIGYDIE 360
PE+ HF GAD+ L ++LF + ++ C A N NI V G + Q ++ + YD+
Sbjct: 333 PELAFHFAEGADLVLDANSLFVQKNQDVFCLAVLESNLKNIGSVIGIMAQQHYNVAYDLI 392
Query: 361 QAMVSFKPSRC 371
V F+ + C
Sbjct: 393 GKRVYFQRTDC 403
>gi|326494754|dbj|BAJ94496.1| predicted protein [Hordeum vulgare subsp. vulgare]
gi|326514480|dbj|BAJ96227.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 449
Score = 172 bits (435), Expect = 3e-40, Method: Compositional matrix adjust.
Identities = 120/366 (32%), Positives = 178/366 (48%), Gaps = 46/366 (12%)
Query: 35 YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSS 94
+LM +SIGTP V +DTGSD WTQC+PC E CF Q P+FDP SSTY ++ CSS
Sbjct: 102 FLMDMSIGTPAVAYAAIIDTGSDLVWTQCKPCVE--CFNQSTPVFDPSSSSTYAALPCSS 159
Query: 95 SQCA-VVTSNCSEGDCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMPNVIFGCG 153
+ C+ + +S C+ C Y++ YG +S + G LA ET T T ++P+V FGCG
Sbjct: 160 TLCSDLPSSKCTSAKCGYTYTYGD---SSSTQGVLAAETFTLAKT-----KLPDVAFGCG 211
Query: 154 HKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLP---DQGSSKINFGGIVA 210
N ++ G++GLG G SL+SQ+G + KFSYCL D S + G +
Sbjct: 212 DTNEGD--GFTQGAGLVGLGRGPLSLVSQLGLN---KFSYCLTSLDDTSKSPLLLGSLAT 266
Query: 211 -------GAGVVSTPLIIR----DHYYLSLEAISVGNQRLEFVSSS-------TGNIFVD 252
+ V +TPLI YY++L+ ++VG+ + SS+ TG + VD
Sbjct: 267 ISESAAAASSVQTTPLIRNPSQPSFYYVNLKGLTVGSTHITLPSSAFAVQDDGTGGVIVD 326
Query: 253 TGVLRTLLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDVLCYNISS----QPKFPEVT 308
+G T L L+ + LK + +K G G C+ + Q + P++
Sbjct: 327 SGTSITYLELQGYRALKKAFAAQMKLPAADG----SGIGLDTCFEAPASGVDQVEVPKLV 382
Query: 309 IHFRGADVKLSPSN-LFRNISDEIMCSAFRGGNANIVYGRIMQINFLIGYDIEQAMVSFK 367
H GAD+ L N + + +C G + G Q N YD+ + +SF
Sbjct: 383 FHLDGADLDLPAENYMVLDSGSGALCLTVMGSRGLSIIGNFQQQNIQFVYDVGENTLSFA 442
Query: 368 PSRCTN 373
P +C
Sbjct: 443 PVQCAK 448
>gi|147786881|emb|CAN62311.1| hypothetical protein VITISV_008929 [Vitis vinifera]
Length = 408
Score = 172 bits (435), Expect = 3e-40, Method: Compositional matrix adjust.
Identities = 125/371 (33%), Positives = 185/371 (49%), Gaps = 38/371 (10%)
Query: 25 QAEIISVD--DIYLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPK 82
QA +++ D +L++ S+G PPV +DTGSD W QC PC DCF+Q P+FDP
Sbjct: 47 QANMVADDRGQAFLVNFSVGRPPVPQLVGIDTGSDLLWVQCRPCA--DCFRQSTPIFDPS 104
Query: 83 KSSTYNSISCSSSQC--AVVTSNCSEGDCSYSFLYGRGAYASFSSGNLATETLTFNSTSG 140
KSSTY +S S C + C Y+ Y G S SSGNLATE + F ++
Sbjct: 105 KSSTYVDLSYDSPICPNSPQKKYNHLNQCIYNASYADG---STSSGNLATEDIVFETSDQ 161
Query: 141 LPVEMPNVIFGCGHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLPDQGS 200
V + +V+FGCGH N D +Q+GI+GL G+ S++S++G+ +FSYC+ D
Sbjct: 162 GTVTVSSVVFGCGHSNRGR--FDGQQSGILGLSAGDQSIVSRLGS----RFSYCIGDLFD 215
Query: 201 SKINFGGIVAGAGVV----STPL-IIRDHYYLSLEAISVGNQRLEF-------VSSSTGN 248
+V G GV STP YY++LE ISVG RL+ S G
Sbjct: 216 PHYTHNQLVLGDGVKMEGSSTPFHTFNGFYYVTLEGISVGETRLDINPEVFQRTESGQGG 275
Query: 249 IFVDTGVLRTLLPLEYHSNLKSVMSNMIKAQPVKGVGAE-PGFSDVLCYNISSQPK---F 304
+ +D+G T L + L + + +++ + + PG+ LCY F
Sbjct: 276 VVMDSGTTATFLAKDGFDPLSNEIQRLVRGHFQQVIYRTIPGW---LCYKGRVNEDLRGF 332
Query: 305 PEVTIHF-RGADVKLSPSNLFRNISDEIMCSAFRGGN-ANI--VYGRIMQINFLIGYDIE 360
PE+ HF GAD+ L ++LF + ++ C A N NI V G + Q ++ + YD+
Sbjct: 333 PELAFHFAEGADLVLDANSLFVQKNQDVFCLAVLESNLKNIGSVIGIMAQQHYNVAYDLI 392
Query: 361 QAMVSFKPSRC 371
V F+ + C
Sbjct: 393 GKRVYFQRTDC 403
>gi|125592062|gb|EAZ32412.1| hypothetical protein OsJ_16623 [Oryza sativa Japonica Group]
Length = 473
Score = 172 bits (435), Expect = 3e-40, Method: Compositional matrix adjust.
Identities = 118/364 (32%), Positives = 173/364 (47%), Gaps = 47/364 (12%)
Query: 35 YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSS 94
Y + + +G+PP D + VD+GSD W QC PC + C+ Q PLFDP SS+++ +SC S
Sbjct: 130 YFVRVGVGSPPTDQYLVVDSGSDVIWVQCRPCEQ--CYAQTDPLFDPAASSSFSGVSCGS 187
Query: 95 SQCAVVT-----SNCSEGDCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMPNVI 149
+ C ++ G C YS YG G+Y + G LA ETLT T+ + V
Sbjct: 188 AICRTLSGTGCGGGGDAGKCDYSVTYGDGSY---TKGELALETLTLGGTA-----VQGVA 239
Query: 150 FGCGHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLPDQGSSKINFGGIV 209
GCGH+N + G++GLG G SL+ Q+G + G FSYCL +G+ G +V
Sbjct: 240 IGCGHRNSGLFVGAA---GLLGLGWGAMSLVGQLGGAAGGVFSYCLASRGAG--GAGSLV 294
Query: 210 AG------AGVVSTPLIIRDH----YYLSLEAISVGNQRL-------EFVSSSTGNIFVD 252
G G V PL+ + YY+ L I VG +RL + G + +D
Sbjct: 295 LGRTEAVPVGAVWVPLVRNNQASSFYYVGLTGIGVGGERLPLQDSLFQLTEDGAGGVVMD 354
Query: 253 TGVLRTLLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDV-LCYNIS--SQPKFPEVTI 309
TG T LP E ++ L+ + A P P S + CY++S + + P V+
Sbjct: 355 TGTAVTRLPREAYAALRGAFDGAMGALP-----RSPAVSLLDTCYDLSGYASVRVPTVSF 409
Query: 310 HF-RGADVKLSPSNLFRNISDEIMCSAFRGGNANI-VYGRIMQINFLIGYDIEQAMVSFK 367
+F +GA + L NL + + C AF ++ I + G I Q I D V F
Sbjct: 410 YFDQGAVLTLPARNLLVEVGGAVFCLAFAPSSSGISILGNIQQEGIQITVDSANGYVGFG 469
Query: 368 PSRC 371
P+ C
Sbjct: 470 PNTC 473
>gi|225446261|ref|XP_002265547.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Vitis vinifera]
Length = 440
Score = 172 bits (435), Expect = 3e-40, Method: Compositional matrix adjust.
Identities = 125/371 (33%), Positives = 188/371 (50%), Gaps = 38/371 (10%)
Query: 25 QAEIISVD--DIYLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPK 82
QA +++ D +L++ S+G PPV +DTGSD W QC PC DCF+Q P+FDP
Sbjct: 79 QANMVADDRGQAFLVNFSVGRPPVPQLVGIDTGSDLLWVQCRPCA--DCFRQSTPIFDPS 136
Query: 83 KSSTYNSISCSSSQC--AVVTSNCSEGDCSYSFLYGRGAYASFSSGNLATETLTFNSTSG 140
KSSTY +S S C + C Y+ Y G S SSGNLATE + F ++
Sbjct: 137 KSSTYVDLSYDSPICPNSPQKKYNHLNQCIYNASYADG---STSSGNLATEDIVFETSDQ 193
Query: 141 LPVEMPNVIFGCGHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLPDQGS 200
V + +V+FGCGH N D +Q+GI+GL G+ S++S++G+ +FSYC+ D
Sbjct: 194 GTVTVSSVVFGCGHSNRGR--FDGQQSGILGLSAGDQSIVSRLGS----RFSYCIGDLFD 247
Query: 201 SKINFGGIVAGAGVV----STPL-IIRDHYYLSLEAISVGNQRLEF-------VSSSTGN 248
+V G GV STP YY++LE ISVG RL+ S G
Sbjct: 248 PHYTHNQLVLGDGVKMEGSSTPFHTFNGFYYVTLEGISVGETRLDINPEVFQRTESGQGG 307
Query: 249 IFVDTGVLRTLLPLEYHSNLKSVMSNMIKAQPVKGVGAE-PGFSDVLCY--NISSQPK-F 304
+ +D+G T L + L + + +++ + + PG+ LCY ++ + F
Sbjct: 308 VVMDSGTTATFLAKDGFDPLSNEIQRLVRGHFQQVIYRTIPGW---LCYKGRVNEDLRGF 364
Query: 305 PEVTIHF-RGADVKLSPSNLFRNISDEIMCSAFRGGN-ANI--VYGRIMQINFLIGYDIE 360
PE+ HF GAD+ L ++LF + ++ C A N NI V G + Q ++ + YD+
Sbjct: 365 PELAFHFAEGADLVLDANSLFVQKNQDVFCLAVLESNLKNIGSVIGIMAQQHYNVAYDLI 424
Query: 361 QAMVSFKPSRC 371
V F+ + C
Sbjct: 425 GKRVYFQRTDC 435
>gi|414584783|tpg|DAA35354.1| TPA: hypothetical protein ZEAMMB73_186928 [Zea mays]
Length = 464
Score = 171 bits (434), Expect = 4e-40, Method: Compositional matrix adjust.
Identities = 119/356 (33%), Positives = 175/356 (49%), Gaps = 37/356 (10%)
Query: 35 YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSS 94
Y + + IG+PP + + VD+GSD W QC+PC L+C+ Q PLFDP S+T++++ C S
Sbjct: 127 YFVRVGIGSPPTEQYLVVDSGSDVIWVQCKPC--LECYAQADPLFDPATSATFSAVPCGS 184
Query: 95 SQCAVV-TSNCSE-GDCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMPNVIFGC 152
+ C + TS C + G C Y YG G+Y + G LA ETLT T+ + V GC
Sbjct: 185 AVCRTLRTSGCGDSGGCDYEVSYGDGSY---TKGALALETLTLGGTA-----VEGVAIGC 236
Query: 153 GHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLPDQGSSKINFGGIVA-G 211
GH+N + G++GLG G SL+ Q+G + G FSYCL +G+ + G A
Sbjct: 237 GHRNRGLFVGAA---GLLGLGWGPMSLVGQLGGAAGGAFSYCLASRGAGSLVLGRSEAVP 293
Query: 212 AGVVSTPLIIRDH----YYLSLEAISVGNQRL-------EFVSSSTGNIFVDTGVLRTLL 260
G V PL+ YY+ L I VG++RL + G + +DTG T L
Sbjct: 294 EGAVWVPLVRNPQAPSFYYVGLSGIGVGDERLPLQEDLFQLTEDGAGGVVMDTGTAVTRL 353
Query: 261 PLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDV-LCYNIS--SQPKFPEVTIHFRG-ADV 316
P E ++ L+ + A P PG S + CY++S + + P V+ +F G A +
Sbjct: 354 PQEAYAALRDAFVAAVGALP-----RAPGVSLLDTCYDLSGYTSVRVPTVSFYFDGAATL 408
Query: 317 KLSPSNLFRNISDEIMCSAFRGGNAN-IVYGRIMQINFLIGYDIEQAMVSFKPSRC 371
L NL + I C AF ++ + G I Q I D + F P+ C
Sbjct: 409 TLPARNLLLEVDGGIYCLAFAPSSSGPSILGNIQQEGIQITVDSANGYIGFGPTTC 464
>gi|326520736|dbj|BAJ92731.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 499
Score = 171 bits (434), Expect = 4e-40, Method: Compositional matrix adjust.
Identities = 120/352 (34%), Positives = 181/352 (51%), Gaps = 30/352 (8%)
Query: 35 YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSS 94
Y++ + +GTP DTGSD TW QC PC + C+KQ+ PLFDP KSSTY ++SC+
Sbjct: 163 YVVTVGLGTPASKYTVVFDTGSDTTWVQCRPC-VVKCYKQKGPLFDPAKSSTYANVSCTD 221
Query: 95 SQCAVVTSN-CSEGDCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMPNVIFGCG 153
S CA + +N C+ G C Y+ YG G+Y + G A +TLT + + FGCG
Sbjct: 222 SACADLDTNGCTGGHCLYAVQYGDGSY---TVGFFAQDTLTIAHDA-----IKGFRFGCG 273
Query: 154 HKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLP--DQGSSKINFGGIVAG 211
KN K G++GLG G +SL Q G F+YCLP G+ ++FG AG
Sbjct: 274 EKNNG---LFGKTAGLMGLGRGKTSLTVQAYNKYGGAFAYCLPALTTGTGYLDFGPGSAG 330
Query: 212 AGVVSTPLII---RDHYYLSLEAISVGNQRLEFVSS--STGNIFVDTGVLRTLLPLEYHS 266
TP++ + YY+ + I VG Q++ S ST VD+G + T LP ++
Sbjct: 331 NNARLTPMLTDKGQTFYYVGMTGIRVGGQQVPVAESVFSTAGTLVDSGTVITRLPATAYT 390
Query: 267 NLKSVMSNMIKAQPVKGVGAEPGFSDV-LCYNIS--SQPKFPEVTIHFR-GADVKLSPSN 322
L S ++ A +G PG+S + CY+ + S + P V++ F+ GA + + S
Sbjct: 391 ALSSAFDKVMLA---RGYKKAPGYSILDTCYDFTGLSDVELPTVSLVFQGGACLDVDVSG 447
Query: 323 LFRNISDEIMCSAF--RGGNANI-VYGRIMQINFLIGYDIEQAMVSFKPSRC 371
+ IS+ +C AF G + ++ + G Q + + YD+ + V F P C
Sbjct: 448 IVYAISEAQVCLAFASNGDDESVAIVGNTQQKTYGVLYDLGKKTVGFAPGSC 499
>gi|165292434|dbj|BAF98915.1| aspartic proteinase nepenthesin I [Nepenthes alata]
Length = 437
Score = 171 bits (433), Expect = 5e-40, Method: Compositional matrix adjust.
Identities = 124/362 (34%), Positives = 183/362 (50%), Gaps = 41/362 (11%)
Query: 32 DDIYLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSIS 91
D YLM+LSIGTP +DTGSD WTQC+PC + CF Q P+F+P+ SS+++++
Sbjct: 92 DGEYLMNLSIGTPAQPFSAIMDTGSDLIWTQCQPCTQ--CFNQSTPIFNPQGSSSFSTLP 149
Query: 92 CSSSQCAVVTS-NCSEGDCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMPNVIF 150
CSS C + S CS C Y++ YG G S + G++ TETLTF S V +PN+ F
Sbjct: 150 CSSQLCQALQSPTCSNNSCQYTYGYGDG---SETQGSMGTETLTFGS-----VSIPNITF 201
Query: 151 GCGHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLPDQGSSK---INFGG 207
GCG N G++G+G G SL SQ+ + KFSYC+ GSS + G
Sbjct: 202 GCGENNQG--FGQGNGAGLVGMGRGPLSLPSQLDVT---KFSYCMTPIGSSNSSTLLLGS 256
Query: 208 IV--AGAGVVSTPLI----IRDHYYLSLEAISVGNQRLEFVSS--------STGNIFVDT 253
+ AG +T LI I YY++L +SVG+ L S TG I +D+
Sbjct: 257 LANSVTAGSPNTTLIQSSQIPTFYYITLNGLSVGSTPLPIDPSVFKLNSNNGTGGIIIDS 316
Query: 254 GVLRTLLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDVLCYNI---SSQPKFPEVTIH 310
G T + ++ + + V G+ GF LC+ + S + P +H
Sbjct: 317 GTTLTYFVDNAYQAVRQAFISQMNLSVVN--GSSSGFD--LCFQMPSDQSNLQIPTFVMH 372
Query: 311 FRGADVKLSPSNLFRNISDEIMCSAFRGGNANI-VYGRIMQINFLIGYDIEQAMVSFKPS 369
F G D+ L N F + S+ ++C A + + ++G I Q N L+ YD ++VSF +
Sbjct: 373 FDGGDLVLPSENYFISPSNGLICLAMGSSSQGMSIFGNIQQQNLLVVYDTGNSVVSFLSA 432
Query: 370 RC 371
+C
Sbjct: 433 QC 434
>gi|242077672|ref|XP_002448772.1| hypothetical protein SORBIDRAFT_06g032900 [Sorghum bicolor]
gi|241939955|gb|EES13100.1| hypothetical protein SORBIDRAFT_06g032900 [Sorghum bicolor]
Length = 471
Score = 171 bits (433), Expect = 5e-40, Method: Compositional matrix adjust.
Identities = 121/365 (33%), Positives = 177/365 (48%), Gaps = 46/365 (12%)
Query: 35 YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSS 94
Y + + IG+PP + + VD+GSD W QC+PC L+C+ Q PLFDP S+T++++SC S
Sbjct: 125 YFVRVGIGSPPTEQYLVVDSGSDVIWVQCKPC--LECYAQADPLFDPASSATFSAVSCGS 182
Query: 95 SQCAVV-TSNCSE-GDCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMPNVIFGC 152
+ C + TS C + G C Y YG G+Y + G LA ETLT T+ + V GC
Sbjct: 183 AICRTLRTSGCGDSGGCEYEVSYGDGSY---TKGTLALETLTLGGTA-----VEGVAIGC 234
Query: 153 GHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLPDQGSSKIN----FGGI 208
GH+N G++GLG G SL+ Q+G + G FSYCL +G S G +
Sbjct: 235 GHRNRGLFVG---AAGLLGLGWGPMSLVGQLGGAAGGAFSYCLASRGGSGSGAADAAGSL 291
Query: 209 VAG------AGVVSTPLIIRDH----YYLSLEAISVGNQRL-------EFVSSSTGNIFV 251
V G G V PL+ YY+ + I VG++RL + G + +
Sbjct: 292 VLGRSEAVPEGAVWVPLVRNPQAPSFYYVGVSGIGVGDERLPLQDGLFQLTEDGGGGVVM 351
Query: 252 DTGVLRTLLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDV-LCYNIS--SQPKFPEVT 308
DTG T LP E ++ L+ + A P PG S + CY++S + + P V+
Sbjct: 352 DTGTAVTRLPQEAYAALRDAFVGAVGALP-----RAPGVSLLDTCYDLSGYTSVRVPTVS 406
Query: 309 IHFRG-ADVKLSPSNLFRNISDEIMCSAFRGGNANI-VYGRIMQINFLIGYDIEQAMVSF 366
+F G A + L NL + I C AF ++ + + G I Q I D + F
Sbjct: 407 FYFDGAATLTLPARNLLLEVDGGIYCLAFAPSSSGLSILGNIQQEGIQITVDSANGYIGF 466
Query: 367 KPSRC 371
P+ C
Sbjct: 467 GPATC 471
>gi|409179880|gb|AFV26025.1| aspartic proteinase nepenthesin 2 [Nepenthes mirabilis]
Length = 437
Score = 171 bits (433), Expect = 5e-40, Method: Compositional matrix adjust.
Identities = 117/358 (32%), Positives = 184/358 (51%), Gaps = 40/358 (11%)
Query: 35 YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSS 94
YLM+++IGTP + +DTGSD WTQCEPC + CF Q P+F+P+ SS+++++ C S
Sbjct: 96 YLMNVAIGTPASSLSAIMDTGSDLIWTQCEPCTQ--CFSQPTPIFNPQDSSSFSTLPCES 153
Query: 95 SQCAVVTSNCSEGDCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMPNVIFGCGH 154
C + S DC Y++ YG G S + G +ATET TF ++S +PN+ FGCG
Sbjct: 154 QYCQDLPSESCYNDCQYTYGYGDG---SSTQGYMATETFTFETSS-----VPNIAFGCGE 205
Query: 155 KNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLP---DQGSSKINFGGIVAG 211
N G+IG+G G SL SQ+G G+FSYC+ S + G +G
Sbjct: 206 DNQG--FGQGNGAGLIGMGWGPLSLPSQLG---VGQFSYCMTSSGSSSPSTLALGSAASG 260
Query: 212 A--GVVSTPLIIRD----HYYLSLEAISVGNQRLEFVSSS-------TGNIFVDTGVLRT 258
G ST LI +YY++L+ I+VG L SS+ TG + +D+G T
Sbjct: 261 VPEGSPSTTLIHSSLNPTYYYITLQGITVGGDNLGIPSSTFQLQDDGTGGMIIDSGTTLT 320
Query: 259 LLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDVLCYNI---SSQPKFPEVTIHFRGAD 315
LP + ++ + ++ I PV + G S C+ + S + PE+++ F G
Sbjct: 321 YLPQDAYNAVAQAFTDQINLSPVD--ESSSGLST--CFQLPSDGSTVQVPEISMQFDGGV 376
Query: 316 VKLSPSNLFRNISDEIMCSAFRGGNAN--IVYGRIMQINFLIGYDIEQAMVSFKPSRC 371
+ L N+ + ++ ++C A + ++G I Q + YD++ VSF P++C
Sbjct: 377 LNLGEENVLISPAEGVICLAMGSSSQQGISIFGNIQQQETQVLYDLQNLAVSFVPTQC 434
>gi|413952718|gb|AFW85367.1| hypothetical protein ZEAMMB73_231535 [Zea mays]
Length = 443
Score = 171 bits (433), Expect = 6e-40, Method: Compositional matrix adjust.
Identities = 122/367 (33%), Positives = 181/367 (49%), Gaps = 44/367 (11%)
Query: 35 YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSS 94
YL+ L++GTP + ++DTGSD WTQC PC DCF Q+ P+ DP SSTY ++ C +
Sbjct: 84 YLVRLAVGTPRRPVALTLDTGSDLVWTQCAPC--RDCFDQDLPVLDPAASSTYAALPCGA 141
Query: 95 SQC-AVVTSNC------SEGDCSYSFLYGRGAYASFSSGNLATETLTFNST--SGLPVEM 145
++C A+ ++C + C Y++ YG S + G +AT+ TF + SG +
Sbjct: 142 ARCRALPFTSCGVRTLGNHRSCIYAYHYGD---KSLTVGEIATDRFTFGDSGGSGESLHT 198
Query: 146 PNVIFGCGHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLP---DQGSSK 202
+ FGCGH N S +TGI G G G SL SQ+ + FSYC + SS
Sbjct: 199 RRLTFGCGHLNKG--VFQSNETGIAGFGRGRWSLPSQLNVT---SFSYCFTSMFESKSSL 253
Query: 203 INFGGIVAG------AGVVSTPLIIRD-----HYYLSLEAISVGNQRLEFVSSSTGNIFV 251
+ GG A +G V T I+++ Y+LSL+ ISVG RL + + +
Sbjct: 254 VTLGGSPAALYSHAHSGEVRTTPILKNPSQPSLYFLSLKGISVGKTRLPVPETKFRSTII 313
Query: 252 DTGVLRTLLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDVLCYNIS-----SQPKFPE 306
D+G T LP E + +K+ + + P G E D LC+ + +P P
Sbjct: 314 DSGASITTLPEEVYEAVKAEFAAQVGLPP---SGVEGSALD-LCFALPVTALWRRPAVPS 369
Query: 307 VTIHFRGADVKLSPSN-LFRNISDEIMCSAFRGG-NANIVYGRIMQINFLIGYDIEQAMV 364
+T+H GAD +L SN +F ++ +MC V G Q N + YD+E +
Sbjct: 370 LTLHLEGADWELPRSNYVFEDLGARVMCIVLDAAPGEQTVIGNFQQQNTHVVYDLENDRL 429
Query: 365 SFKPSRC 371
SF P+RC
Sbjct: 430 SFAPARC 436
>gi|357143847|ref|XP_003573077.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Brachypodium
distachyon]
Length = 420
Score = 171 bits (433), Expect = 6e-40, Method: Compositional matrix adjust.
Identities = 131/375 (34%), Positives = 183/375 (48%), Gaps = 46/375 (12%)
Query: 27 EIISVDDIYLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSST 86
+ SV YLM L+IG PPV DTGSD TWTQC+PC CF Q+ P++DP SST
Sbjct: 63 RLHSVQVEYLMELAIGKPPVPFVALADTGSDLTWTQCQPCKL--CFPQDTPVYDPSASST 120
Query: 87 YNSISCSSSQCAVVTS-NCSEGD-CSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVE 144
++ + CSS+ C + S NC+ C Y + YG GAY S+G L TETLT +S PV
Sbjct: 121 FSPLPCSSATCLPIWSRNCTPSSLCRYRYAYGDGAY---SAGILGTETLTLGPSSA-PVS 176
Query: 145 MPNVIFGCGHKNLASPTSDS-KQTGIIGLGPGNSSLISQMGTSIAGKFSYCLPDQGSSKI 203
+ V FGCG N DS TG +GLG G SL++Q+G GKFSYCL D +S +
Sbjct: 177 VGGVAFGCGTDN----GGDSLNSTGTVGLGRGTLSLLAQLG---VGKFSYCLTDFFNSAL 229
Query: 204 N-------FGGIVAGAGVV-STPLIIR----DHYYLSLEAISVGNQRL-------EFVSS 244
+ + G V STPL+ Y++SL+ IS+G+ RL +
Sbjct: 230 DSPFLLGTLAELAPGPSTVQSTPLLQSPQNPSRYFVSLQGISLGDVRLPIPNGTFDLRGD 289
Query: 245 STGNIFVDTGVLRTLLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDVLCYNISS--QP 302
TG + VD+G T+L + ++ ++ PV D C+ + P
Sbjct: 290 GTGGMIVDSGTTFTILAESGFREVVGRVARVLGQPPVNASSL-----DAPCFPAPAGEPP 344
Query: 303 KFPEVTIHFR-GADVKLSPSNLFR-NISDEIMCSAFRGG--NANIVYGRIMQINFLIGYD 358
P++ +HF GAD++L N N D C G + V G Q N + +D
Sbjct: 345 YMPDLVLHFAGGADMRLYRDNYMSYNEEDSSFCLNIAGTTPESTSVLGNFQQQNIQMLFD 404
Query: 359 IEQAMVSFKPSRCTN 373
+SF P+ C+
Sbjct: 405 TTVGQLSFLPTDCSK 419
>gi|61214232|sp|Q766C2.1|NEP2_NEPGR RecName: Full=Aspartic proteinase nepenthesin-2; AltName:
Full=Nepenthesin-II; Flags: Precursor
gi|41016423|dbj|BAD07475.1| aspartic proteinase nepenthesin II [Nepenthes gracilis]
Length = 438
Score = 171 bits (432), Expect = 6e-40, Method: Compositional matrix adjust.
Identities = 120/363 (33%), Positives = 189/363 (52%), Gaps = 43/363 (11%)
Query: 32 DDIYLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSIS 91
D YLM+++IGTP +DTGSD WTQCEPC + CF Q P+F+P+ SS+++++
Sbjct: 93 DGEYLMNVAIGTPDSSFSAIMDTGSDLIWTQCEPCTQ--CFSQPTPIFNPQDSSSFSTLP 150
Query: 92 CSSSQCAVVTSN-CSEGDCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMPNVIF 150
C S C + S C+ +C Y++ YG G S + G +ATET TF ++S +PN+ F
Sbjct: 151 CESQYCQDLPSETCNNNECQYTYGYGDG---STTQGYMATETFTFETSS-----VPNIAF 202
Query: 151 GCGHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLPDQGSSK---INFGG 207
GCG N G+IG+G G SL SQ+G G+FSYC+ GSS + G
Sbjct: 203 GCGEDNQG--FGQGNGAGLIGMGWGPLSLPSQLG---VGQFSYCMTSYGSSSPSTLALGS 257
Query: 208 IVAGA--GVVSTPLIIRD----HYYLSLEAISVGNQRLEFVSSS-------TGNIFVDTG 254
+G G ST LI +YY++L+ I+VG L SS+ TG + +D+G
Sbjct: 258 AASGVPEGSPSTTLIHSSLNPTYYYITLQGITVGGDNLGIPSSTFQLQDDGTGGMIIDSG 317
Query: 255 VLRTLLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDVLCYNI---SSQPKFPEVTIHF 311
T LP + ++ + ++ I V + G S C+ S + PE+++ F
Sbjct: 318 TTLTYLPQDAYNAVAQAFTDQINLPTVD--ESSSGLS--TCFQQPSDGSTVQVPEISMQF 373
Query: 312 RGADVKLSPSNLFRNISDEIMCSAFRGGNANI---VYGRIMQINFLIGYDIEQAMVSFKP 368
G + L N+ + ++ ++C A G ++ + ++G I Q + YD++ VSF P
Sbjct: 374 DGGVLNLGEQNILISPAEGVICLAM-GSSSQLGISIFGNIQQQETQVLYDLQNLAVSFVP 432
Query: 369 SRC 371
++C
Sbjct: 433 TQC 435
>gi|409179878|gb|AFV26024.1| aspartic proteinase nepenthesin 1 [Nepenthes mirabilis]
Length = 437
Score = 171 bits (432), Expect = 8e-40, Method: Compositional matrix adjust.
Identities = 124/369 (33%), Positives = 186/369 (50%), Gaps = 41/369 (11%)
Query: 25 QAEIISVDDIYLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKS 84
+ + + D YLM+LSIGTP +DTGSD WTQC+PC + CF Q P+F+P+ S
Sbjct: 85 ETPVYAGDGEYLMNLSIGTPAQPFSAIMDTGSDLIWTQCQPCTQ--CFNQSTPIFNPQGS 142
Query: 85 STYNSISCSSSQCAVVTS-NCSEGDCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPV 143
S+++++ CSS C + S CS C Y++ YG G S + G++ TETLTF S V
Sbjct: 143 SSFSTLPCSSQLCQALQSPTCSNNSCQYTYGYGDG---SETQGSMGTETLTFGS-----V 194
Query: 144 EMPNVIFGCGHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLPDQG---S 200
+PN+ FGCG N G++G+G G SL SQ+ + KFSYC+ G S
Sbjct: 195 SIPNITFGCGENNQG--FGQGNGAGLVGMGRGPLSLPSQLDVT---KFSYCMTPIGSSTS 249
Query: 201 SKINFGGIV--AGAGVVSTPLI----IRDHYYLSLEAISVGNQRLEFVSS--------ST 246
S + G + AG +T LI I YY++L +SVG+ L S T
Sbjct: 250 STLLLGSLANSVTAGSPNTTLIESSQIPTFYYITLNGLSVGSTPLPIDPSVFKLNSNNGT 309
Query: 247 GNIFVDTGVLRTLLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDVLCYNI---SSQPK 303
G I +D+G T + ++ + + V G+ GF LC+ + S +
Sbjct: 310 GGIIIDSGTTLTYFADNAYQAVRQAFISQMNLSVVN--GSSSGFD--LCFQMPSDQSNLQ 365
Query: 304 FPEVTIHFRGADVKLSPSNLFRNISDEIMCSAFRGGNANI-VYGRIMQINFLIGYDIEQA 362
P +HF G D+ L N F + S+ ++C A + + ++G I Q N L+ YD +
Sbjct: 366 IPTFVMHFDGGDLVLPSENYFISPSNGLICLAMGSSSQGMSIFGNIQQQNLLVVYDTGNS 425
Query: 363 MVSFKPSRC 371
+VSF ++C
Sbjct: 426 VVSFLFAQC 434
>gi|125590542|gb|EAZ30892.1| hypothetical protein OsJ_14967 [Oryza sativa Japonica Group]
Length = 516
Score = 170 bits (431), Expect = 8e-40, Method: Compositional matrix adjust.
Identities = 128/362 (35%), Positives = 181/362 (50%), Gaps = 52/362 (14%)
Query: 41 IGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSSSQCA-V 99
IGTP + VDTGSD WTQC+PC +DCFKQ P+FDP SSTY ++ CSS+ C+ +
Sbjct: 173 IGTPALAYSAIVDTGSDLVWTQCKPC--VDCFKQSTPVFDPSSSSTYATVPCSSASCSDL 230
Query: 100 VTSNC-SEGDCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMPNVIFGCGHKNLA 158
TS C S C Y++ YG +S + G LATET T + ++P V+FGCG N
Sbjct: 231 PTSKCTSASKCGYTYTYGD---SSSTQGVLATETFTLAKS-----KLPGVVFGCGDTNEG 282
Query: 159 SPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLP---DQGSSKINFGGIVA----- 210
S+ G++GLG G SL+SQ+G KFSYCL D +S + G +
Sbjct: 283 D--GFSQGAGLVGLGRGPLSLVSQLGLD---KFSYCLTSLDDTNNSPLLLGSLAGISEAS 337
Query: 211 --GAGVVSTPLIIR----DHYYLSLEAISVGNQRLEFVSSS-------TGNIFVDTGVLR 257
+ V +TPLI YY+SL+AI+VG+ R+ SS+ TG + VD+G
Sbjct: 338 AAASSVQTTPLIKNPSQPSFYYVSLKAITVGSTRISLPSSAFAVQDDGTGGVIVDSGTSI 397
Query: 258 TLLPLEYHSNLKSVMSNMIK--AQPVKGVGAEPGFSDVLCYNISS----QPKFPEVTIHF 311
T L ++ + LK + + A GVG + LC+ + Q + P + HF
Sbjct: 398 TYLEVQGYRALKKAFAAQMALPAADGSGVGLD------LCFRAPAKGVDQVEVPRLVFHF 451
Query: 312 R-GADVKLSPSN-LFRNISDEIMCSAFRGGNANIVYGRIMQINFLIGYDIEQAMVSFKPS 369
GAD+ L N + + +C G + G Q NF YD+ +SF P
Sbjct: 452 DGGADLDLPAENYMVLDGGSGALCLTVMGSRGLSIIGNFQQQNFQFVYDVGHDTLSFAPV 511
Query: 370 RC 371
+C
Sbjct: 512 QC 513
>gi|357143850|ref|XP_003573078.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Brachypodium
distachyon]
Length = 431
Score = 169 bits (429), Expect = 2e-39, Method: Compositional matrix adjust.
Identities = 132/379 (34%), Positives = 189/379 (49%), Gaps = 49/379 (12%)
Query: 27 EIISVDDIYLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSST 86
+ SV YLM L+IGTPPV DTGSD TWTQC+PC CF Q+ P++DP SST
Sbjct: 69 RLHSVQVEYLMELAIGTPPVPFVALADTGSDLTWTQCQPCKL--CFPQDTPVYDPSASST 126
Query: 87 YNSISCSSSQCAVV--TSNCSEGD--CSYSFLYGRGAYASFSSGNLATETLTFNST-SGL 141
++ + CSS+ C V + NCS C Y + Y GAY S+G L TETLT S+ G
Sbjct: 127 FSPVPCSSATCLPVLRSRNCSTPSSLCRYGYSYSDGAY---SAGILGTETLTLGSSVPGQ 183
Query: 142 PVEMPNVIFGCGHKNLASPTSDS-KQTGIIGLGPGNSSLISQMGTSIAGKFSYCLPDQGS 200
V + +V FGCG N DS TG +GLG G SL++Q+G GKFSYCL D +
Sbjct: 184 AVSVSDVAFGCGTDN----GGDSLNSTGTVGLGRGTLSLLAQLG---VGKFSYCLTDFFN 236
Query: 201 SKIN-------FGGIVAGAGVV-STPLIIR----DHYYLSLEAISVGNQRL-------EF 241
S ++ + G G V STPL+ Y +SL+ I++G+ RL +
Sbjct: 237 STLDSPFLLGTLAELAPGPGAVQSTPLLQSPLNPSRYVVSLQGITLGDVRLPIPNKTFDL 296
Query: 242 VSSSTGNIFVDTGVLRTLLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDVLCYNISS- 300
++STG + VD+G ++LP + ++ ++ PV D C+ +
Sbjct: 297 HANSTGGMVVDSGTTFSILPESGFRVVVDHVAQVLGQPPVNASSL-----DSPCFPAPAG 351
Query: 301 ---QPKFPEVTIHFR-GADVKLSPSNLFR-NISDEIMCSAFRGGNANI-VYGRIMQINFL 354
P P++ +HF GAD++L N N D C G + + G Q N
Sbjct: 352 ERQLPFMPDLVLHFAGGADMRLHRDNYMSYNQEDSSFCLNIVGTTSTWSMLGNFQQQNIQ 411
Query: 355 IGYDIEQAMVSFKPSRCTN 373
+ +D+ +SF P+ C+
Sbjct: 412 MLFDMTVGQLSFLPTDCSK 430
>gi|38344196|emb|CAE05761.2| OSJNBa0064G10.12 [Oryza sativa Japonica Group]
Length = 451
Score = 168 bits (426), Expect = 3e-39, Method: Compositional matrix adjust.
Identities = 114/354 (32%), Positives = 169/354 (47%), Gaps = 49/354 (13%)
Query: 35 YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSS 94
Y + + +G+PP D + VD+GSD W QC PC + C+ Q PLFDP SS+++ +SC S
Sbjct: 130 YFVRVGVGSPPTDQYLVVDSGSDVIWVQCRPCEQ--CYAQTDPLFDPAASSSFSGVSCGS 187
Query: 95 SQCAVVT-----SNCSEGDCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMPNVI 149
+ C ++ G C YS YG G+Y + G LA ETLT T+ + V
Sbjct: 188 AICRTLSGTGCGGGGDAGKCDYSVTYGDGSY---TKGELALETLTLGGTA-----VQGVA 239
Query: 150 FGCGHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLPDQGSSKINFGGIV 209
GCGH+N + G++GLG G SL+ Q+G + G FSYCL +G+
Sbjct: 240 IGCGHRNSGLFVGAA---GLLGLGWGAMSLVGQLGGAAGGVFSYCLASRGA--------- 287
Query: 210 AGAGVVSTPLIIRDHYYLSLEAISVGNQRL-------EFVSSSTGNIFVDTGVLRTLLPL 262
GAG +++ YY+ L I VG +RL + G + +DTG T LP
Sbjct: 288 GGAGSLASSF-----YYVGLTGIGVGGERLPLQDSLFQLTEDGAGGVVMDTGTAVTRLPR 342
Query: 263 EYHSNLKSVMSNMIKAQPVKGVGAEPGFSDV-LCYNIS--SQPKFPEVTIHF-RGADVKL 318
E ++ L+ + A P P S + CY++S + + P V+ +F +GA + L
Sbjct: 343 EAYAALRGAFDGAMGALP-----RSPAVSLLDTCYDLSGYASVRVPTVSFYFDQGAVLTL 397
Query: 319 SPSNLFRNISDEIMCSAFRGGNANI-VYGRIMQINFLIGYDIEQAMVSFKPSRC 371
NL + + C AF ++ I + G I Q I D V F P+ C
Sbjct: 398 PARNLLVEVGGAVFCLAFAPSSSGISILGNIQQEGIQITVDSANGYVGFGPNTC 451
>gi|242079447|ref|XP_002444492.1| hypothetical protein SORBIDRAFT_07g022780 [Sorghum bicolor]
gi|241940842|gb|EES13987.1| hypothetical protein SORBIDRAFT_07g022780 [Sorghum bicolor]
Length = 441
Score = 167 bits (424), Expect = 6e-39, Method: Compositional matrix adjust.
Identities = 122/370 (32%), Positives = 179/370 (48%), Gaps = 53/370 (14%)
Query: 35 YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSS 94
YL+ L+IGTPP+ +DTGSD WTQC PC L C Q P FD K+S+TY ++ C S
Sbjct: 89 YLVDLAIGTPPLYYTAIMDTGSDLIWTQCAPC--LLCAAQPTPYFDVKRSATYRALPCRS 146
Query: 95 SQCAVVTS-NCSEGDCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMPNVIFGCG 153
S+CA ++S +C + C Y + YG A ++G LA ET TF + S V N+ FGCG
Sbjct: 147 SRCAALSSPSCFKKMCVYQYYYGDTAS---TAGVLANETFTFGAASSTKVRAANISFGCG 203
Query: 154 HKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLPDQGS---SKINFGGIV- 209
N + S G++G G G SL+SQ+G S +FSYCL S S++ FG
Sbjct: 204 SLNAGELANSS---GMVGFGRGPLSLVSQLGPS---RFSYCLTSYLSPTPSRLYFGVFAN 257
Query: 210 -------AGAGVVSTPLIIR----DHYYLSLEAISVGNQRL-------EFVSSSTGNIFV 251
+G+ V STP +I + Y+LS++ IS+G +RL TG + +
Sbjct: 258 LNSTNTSSGSPVQSTPFVINPALPNMYFLSVKGISLGTKRLPIDPLVFAINDDGTGGVII 317
Query: 252 DTGVLRTLLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDV-----LCYNISSQPK--- 303
D+G T L + + ++ +++ I P +D C+ P
Sbjct: 318 DSGTSITWLQQDAYEAVRRGLASTIPL---------PAMNDTDIGLDTCFQWPPPPNVTV 368
Query: 304 -FPEVTIHFRGADVKLSPSNLFRNISDE-IMCSAFRGGNANIVYGRIMQINFLIGYDIEQ 361
P+ HF GA++ L P N S +C A + + G Q N + YDI
Sbjct: 369 TVPDFVFHFDGANMTLPPENYMLIASTTGYLCLAMAPTSVGTIIGNYQQQNLHLLYDIAN 428
Query: 362 AMVSFKPSRC 371
+ +SF P+ C
Sbjct: 429 SFLSFVPAPC 438
>gi|297603570|ref|NP_001054261.2| Os04g0677100 [Oryza sativa Japonica Group]
gi|255675885|dbj|BAF16175.2| Os04g0677100 [Oryza sativa Japonica Group]
Length = 464
Score = 167 bits (423), Expect = 7e-39, Method: Compositional matrix adjust.
Identities = 116/359 (32%), Positives = 170/359 (47%), Gaps = 46/359 (12%)
Query: 35 YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSS 94
Y + + +G+PP D + VD+GSD W QC PC + C+ Q PLFDP SS+++ +SC S
Sbjct: 130 YFVRVGVGSPPTDQYLVVDSGSDVIWVQCRPCEQ--CYAQTDPLFDPAASSSFSGVSCGS 187
Query: 95 SQCAVVT-----SNCSEGDCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMPNVI 149
+ C ++ G C YS YG G+Y + G LA ETLT T+ + V
Sbjct: 188 AICRTLSGTGCGGGGDAGKCDYSVTYGDGSY---TKGELALETLTLGGTA-----VQGVA 239
Query: 150 FGCGHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLPDQGSSKINFGGIV 209
GCGH+N + G++GLG G SL+ Q+G + G FSYCL +G+ G +V
Sbjct: 240 IGCGHRNSGLFVGAA---GLLGLGWGAMSLVGQLGGAAGGVFSYCLASRGAG--GAGSLV 294
Query: 210 AGAGVVSTPLIIRDH-----YYLSLEAISVGNQRL-------EFVSSSTGNIFVDTGVLR 257
G T + R YY+ L I VG +RL + G + +DTG
Sbjct: 295 LG----RTEAVPRGRRASSFYYVGLTGIGVGGERLPLQDSLFQLTEDGAGGVVMDTGTAV 350
Query: 258 TLLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDV-LCYNIS--SQPKFPEVTIHF-RG 313
T LP E ++ L+ + A P P S + CY++S + + P V+ +F +G
Sbjct: 351 TRLPREAYAALRGAFDGAMGALP-----RSPAVSLLDTCYDLSGYASVRVPTVSFYFDQG 405
Query: 314 ADVKLSPSNLFRNISDEIMCSAFRGGNANI-VYGRIMQINFLIGYDIEQAMVSFKPSRC 371
A + L NL + + C AF ++ I + G I Q I D V F P+ C
Sbjct: 406 AVLTLPARNLLVEVGGAVFCLAFAPSSSGISILGNIQQEGIQITVDSANGYVGFGPNTC 464
>gi|242073260|ref|XP_002446566.1| hypothetical protein SORBIDRAFT_06g018170 [Sorghum bicolor]
gi|241937749|gb|EES10894.1| hypothetical protein SORBIDRAFT_06g018170 [Sorghum bicolor]
Length = 452
Score = 167 bits (423), Expect = 7e-39, Method: Compositional matrix adjust.
Identities = 126/367 (34%), Positives = 182/367 (49%), Gaps = 47/367 (12%)
Query: 35 YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSS 94
+LM ++IGTP + VDTGSD WTQC+PC +DCFKQ P+FDP SSTY ++ CSS
Sbjct: 100 FLMDVAIGTPALSYAAIVDTGSDLVWTQCKPC--VDCFKQSTPVFDPSSSSTYATVPCSS 157
Query: 95 SQCA-VVTSNC-SEGDCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMPNVIFGC 152
+ C+ + TS C S C Y++ YG AS + G LA+ET T ++P V FGC
Sbjct: 158 ALCSDLPTSTCTSASKCGYTYTYGD---ASSTQGVLASETFTLGKEK---KKLPGVAFGC 211
Query: 153 GHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLPD----QGSSKINFGGI 208
G N ++ G++GLG G SL+SQ+G KFSYCL G S + GG
Sbjct: 212 GDTNEGD--GFTQGAGLVGLGRGPLSLVSQLGLD---KFSYCLTSLDDGDGKSPLLLGGS 266
Query: 209 VAGAG-------VVSTPLIIR----DHYYLSLEAISVGNQRLEFVSSS-------TGNIF 250
A V +TPL+ YY+SL ++VG+ R+ +S+ TG +
Sbjct: 267 AAAISESAATAPVQTTPLVKNPSQPSFYYVSLTGLTVGSTRITLPASAFAIQDDGTGGVI 326
Query: 251 VDTGVLRTLLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDVLCYNISS----QPKFPE 306
VD+G T L L+ + LK + V G+E G LC+ + + + P+
Sbjct: 327 VDSGTSITYLELQGYRALKKAFVAQMALPTVD--GSEIGLD--LCFQGPAKGVDEVQVPK 382
Query: 307 VTIHFR-GADVKLSPSN-LFRNISDEIMCSAFRGGNANIVYGRIMQINFLIGYDIEQAMV 364
+ +HF GAD+ L N + + + +C + G Q NF YD+ +
Sbjct: 383 LVLHFDGGADLDLPAENYMVLDSASGALCLTVAPSRGLSIIGNFQQQNFQFVYDVAGDTL 442
Query: 365 SFKPSRC 371
SF P +C
Sbjct: 443 SFAPVQC 449
>gi|224126751|ref|XP_002329464.1| predicted protein [Populus trichocarpa]
gi|222870144|gb|EEF07275.1| predicted protein [Populus trichocarpa]
Length = 439
Score = 166 bits (421), Expect = 1e-38, Method: Compositional matrix adjust.
Identities = 134/360 (37%), Positives = 195/360 (54%), Gaps = 43/360 (11%)
Query: 35 YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSS 94
+LM L+IGTPP +DTGSD WTQC+PC + CF Q P+FDPKKSS+++ +SCSS
Sbjct: 97 FLMKLAIGTPPETYSAIMDTGSDLIWTQCKPCTQ--CFDQPTPIFDPKKSSSFSKLSCSS 154
Query: 95 SQC-AVVTSNCSEGDCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMPNVIFGCG 153
C A+ S CS+G C Y LYG G Y+S + G LA+ETLTF V +P V FGCG
Sbjct: 155 KLCEALPQSTCSDG-CEY--LYGYGDYSS-TQGMLASETLTFGK-----VSVPEVAFGCG 205
Query: 154 HKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLP---DQGSSKINFGGI-- 208
N S S+ +G++GLG G SL+SQ+ KFSYCL D +S + G +
Sbjct: 206 EDNEGS--GFSQGSGLVGLGRGPLSLVSQLKEP---KFSYCLTSVDDTKASTLLMGSLAS 260
Query: 209 --VAGAGVVSTPLIIR----DHYYLSLEAISVGNQRLEFVSSS-------TGNIFVDTGV 255
+ + + +TPLI YYLSLE ISVG+ L S+ +G + +D+G
Sbjct: 261 VKASDSEIKTTPLIQNSAQPSFYYLSLEGISVGDTSLPIKKSTFSLQEDGSGGLIIDSGT 320
Query: 256 LRTLLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDVLCYNI---SSQPKFPEVTIHFR 312
T L + ++ I PV G+ G +C+ + S+ + P++ HF
Sbjct: 321 TITYLEQSAFDLVAKEFTSQINL-PVDNSGST-GLE--VCFTLPSGSTDIEVPKLVFHFD 376
Query: 313 GADVKLSPSN-LFRNISDEIMCSAFRGGNANIVYGRIMQINFLIGYDIEQAMVSFKPSRC 371
GAD++L N + + S + C A + ++G I Q N L+ +D+E+ +SF P++C
Sbjct: 377 GADLELPAENYMIADASMGVACLAMGSSSGMSIFGNIQQQNMLVLHDLEKETLSFLPTQC 436
>gi|224060469|ref|XP_002300215.1| predicted protein [Populus trichocarpa]
gi|222847473|gb|EEE85020.1| predicted protein [Populus trichocarpa]
Length = 439
Score = 165 bits (418), Expect = 3e-38, Method: Compositional matrix adjust.
Identities = 136/370 (36%), Positives = 200/370 (54%), Gaps = 43/370 (11%)
Query: 25 QAEIISVDDIYLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKS 84
+A ++ + +LM L+IGTPP +DTGSD WTQC+PC + CF Q P+FDPKKS
Sbjct: 87 EAPVLPGNGEFLMKLAIGTPPETYSAILDTGSDLIWTQCKPCTQ--CFHQSTPIFDPKKS 144
Query: 85 STYNSISCSSSQC-AVVTSNCSEGDCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPV 143
S+++ +SCSS C A+ S+C+ G C Y LY G Y+S + G LA+ETLTF S
Sbjct: 145 SSFSKLSCSSQLCEALPQSSCNNG-CEY--LYSYGDYSS-TQGILASETLTFGKAS---- 196
Query: 144 EMPNVIFGCGHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLP---DQGS 200
+PNV FGCG N S S+ G++GLG G SL+SQ+ KFSYCL D +
Sbjct: 197 -VPNVAFGCGADNEGS--GFSQGAGLVGLGRGPLSLVSQLKEP---KFSYCLTTVDDTKT 250
Query: 201 SKINFGGI----VAGAGVVSTPLIIR----DHYYLSLEAISVGNQRLEFVSSS------- 245
S + G + + + + +TPLI YYLSLE ISVG+ RL S+
Sbjct: 251 STLLMGSLASVNASSSAIKTTPLIHSPAHPSFYYLSLEGISVGDTRLPIKKSTFSLQDDG 310
Query: 246 TGNIFVDTGVLRTLLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDVLCYNI---SSQP 302
+G + +D+G T L E NL + PV G+ G +C+ + S+
Sbjct: 311 SGGLIIDSGTTITYLE-ESAFNLVAKEFTAKINLPVDSSGST-GLD--VCFTLPSGSTNI 366
Query: 303 KFPEVTIHFRGADVKLSPSN-LFRNISDEIMCSAFRGGNANIVYGRIMQINFLIGYDIEQ 361
+ P++ HF GAD++L N + + S + C A + ++G + Q N L+ +D+E+
Sbjct: 367 EVPKLVFHFDGADLELPAENYMIGDSSMGVACLAMGSSSGMSIFGNVQQQNMLVLHDLEK 426
Query: 362 AMVSFKPSRC 371
+SF P++C
Sbjct: 427 ETLSFLPTQC 436
>gi|357143854|ref|XP_003573079.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Brachypodium
distachyon]
Length = 417
Score = 165 bits (418), Expect = 3e-38, Method: Compositional matrix adjust.
Identities = 132/376 (35%), Positives = 187/376 (49%), Gaps = 46/376 (12%)
Query: 27 EIISVDDIYLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSST 86
+ SV YLM L+IGTPPV DTGSD TWTQC+PC CF Q+ P++DP SST
Sbjct: 58 RLHSVQVEYLMELAIGTPPVPFVALADTGSDLTWTQCQPCKL--CFPQDTPVYDPSASST 115
Query: 87 YNSISCSSSQC--AVVTSNCSE--GDCSYSFLYGRGAYASFSSGNLATETLTFNST-SGL 141
++ + CSS+ C + NCS C Y + Y GAY S G L TETLT S+ G
Sbjct: 116 FSPVPCSSATCLPTWRSRNCSNPSSPCRYIYSYSDGAY---SVGILGTETLTIGSSVPGQ 172
Query: 142 PVEMPNVIFGCGHKNLASPTSDS-KQTGIIGLGPGNSSLISQMGTSIAGKFSYCLPDQGS 200
V + +V FGCG N DS TG +GLG G SL++Q+G GKFSYCL D +
Sbjct: 173 TVSVGSVAFGCGTDN----GGDSLNSTGTVGLGRGTLSLLAQLG---VGKFSYCLTDFFN 225
Query: 201 SKIN---FGGIVA----GAGVV-STPLIIR----DHYYLSLEAISVGNQRL-------EF 241
S ++ F G +A G G V STPL+ Y+++L+ IS+G+ RL +
Sbjct: 226 STMDSPFFLGTLAELAPGPGTVQSTPLLQSPLNPSRYFVNLQGISLGDVRLPIPNGTFDL 285
Query: 242 VSSSTGNIFVDTGVLRTLLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDVLCY-NISS 300
+ G + VD+G T+L + ++ ++ PV D C+ +
Sbjct: 286 RADGNGGMMVDSGTTFTILAKSGFREVVDRVAQLLGQPPVNASSL-----DSPCFPSPDG 340
Query: 301 QPKFPEVTIHFR-GADVKLSPSNLFR-NISDEIMCSAFRGGNANI-VYGRIMQINFLIGY 357
+P P++ +HF GAD++L N N D C G + G Q N + +
Sbjct: 341 EPFMPDLVLHFAGGADMRLHRDNYMSYNEDDSSFCLNIVGSPSTWSRLGNFQQQNIQMLF 400
Query: 358 DIEQAMVSFKPSRCTN 373
D+ +SF P+ C+
Sbjct: 401 DMTVGQLSFLPTDCSK 416
>gi|413921976|gb|AFW61908.1| hypothetical protein ZEAMMB73_608282 [Zea mays]
Length = 459
Score = 165 bits (417), Expect = 3e-38, Method: Compositional matrix adjust.
Identities = 133/392 (33%), Positives = 196/392 (50%), Gaps = 63/392 (16%)
Query: 26 AEIISVDDIYLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSS 85
A + S YLM L+IGTPPV DTGSD TWTQC+PC CF Q+ P++D S+
Sbjct: 86 ARLRSGQAEYLMELAIGTPPVPFVALADTGSDLTWTQCKPCKL--CFPQDTPIYDTAASA 143
Query: 86 TYNSISCSSSQCAVV---TSNCSE---GDCSYSFLYGRGAYASFSSGNLATETLTF-NST 138
+++ + C+S+ C + + NC+ C Y + Y GAY S+G L TETLTF S+
Sbjct: 144 SFSPVPCASATCLPIWRSSRNCTATTTSPCRYRYAYDDGAY---SAGVLGTETLTFAGSS 200
Query: 139 SGLP---VEMPNVIFGCGHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCL 195
G P V + V FGCG N TG +GLG G+ SL++Q+G GKFSYCL
Sbjct: 201 PGAPGPGVSVGGVAFGCGVDNGG---LSYNSTGTVGLGRGSLSLVAQLGV---GKFSYCL 254
Query: 196 PD----QGSSKINFGGI--------VAGAGVVSTPLIIRDH----YYLSLEAISVGNQRL 239
D S + FG + + GA V STPL+ + YY+SLE IS+G+ RL
Sbjct: 255 TDFFNTSLGSPVLFGSLAELAAPSTIGGAAVQSTPLVQGPYNPSRYYVSLEGISLGDARL 314
Query: 240 -------EFVSSSTGNIFVDTGVLRTLLPLEYHSNLKSVMSNM--IKAQPVKGVGAEPGF 290
+ +G + VD+G + T+L S + V++++ + QPV +
Sbjct: 315 PIPNGTFDLRDDGSGGMIVDSGTIFTVL---VESAFRVVVNHVAGVLNQPVVNASSL--- 368
Query: 291 SDVLCYNISSQ----PKFPEVTIHFR-GADVKLSPSNLFRNISDEIMCSAFRGGNANIVY 345
D C+ ++ P P++ +HF GAD++L N + + + E A Y
Sbjct: 369 -DSPCFPATAGEQQLPDMPDMLLHFAGGADMRLHRDN-YMSFNQESSSFCLNIAGAPSAY 426
Query: 346 GRIM----QINFLIGYDIEQAMVSFKPSRCTN 373
G I+ Q N + +DI +SF P+ C+
Sbjct: 427 GSILGNFQQQNIQMLFDITVGQLSFVPTDCSK 458
>gi|242050432|ref|XP_002462960.1| hypothetical protein SORBIDRAFT_02g035320 [Sorghum bicolor]
gi|241926337|gb|EER99481.1| hypothetical protein SORBIDRAFT_02g035320 [Sorghum bicolor]
Length = 445
Score = 164 bits (416), Expect = 4e-38, Method: Compositional matrix adjust.
Identities = 122/373 (32%), Positives = 186/373 (49%), Gaps = 48/373 (12%)
Query: 35 YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSS 94
YLM L+IGTPPV DTGSD WTQC PC CF+Q PL++P S+T+ + C+S
Sbjct: 86 YLMTLAIGTPPVSYQAIADTGSDLIWTQCAPCSS-QCFQQPTPLYNPSSSTTFAVLPCNS 144
Query: 95 --SQCAVVTSNCSEG---DCSYSFLYGRGAYASFSSGNLATETLTF-NSTSGLPVEMPNV 148
S CA + + C Y+ YG G ++S +ET TF +ST +P +
Sbjct: 145 SLSMCAAALAGTTPPPGCTCMYNMTYGSG----WTSVYQGSETFTFGSSTPANQTGVPGI 200
Query: 149 IFGCGHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLP---DQGSSKINF 205
FGC N + + S +G++GLG G+ SL+SQ+G KFSYCL D S+
Sbjct: 201 AFGC--SNASGGFNTSSASGLVGLGRGSLSLVSQLGVP---KFSYCLTPYQDTNSTSTLL 255
Query: 206 GGIVAG----AGVVSTPLI-------IRDHYYLSLEAISVGNQRLEFVSSS-------TG 247
G A GV STP + + +YYL+L IS+G L +++ TG
Sbjct: 256 LGPSASLNDTGGVSSTPFVASPSDAPMSTYYYLNLTGISLGTTALSIPTTALSLKADGTG 315
Query: 248 NIFVDTGVLRTLLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDVLCYNI----SSQPK 303
+D+G TLL + +++ + +++ G A G LC+ + S+ P
Sbjct: 316 GFIIDSGTTITLLGNTAYQQVRAAVVSLVTLPTTDGGSAATGLD--LCFELPSSTSAPPT 373
Query: 304 FPEVTIHFRGADVKLSPSNLFRNISDEIMCSAFRG---GNANIVYGRIMQINFLIGYDIE 360
P +T+HF GAD+ L P++ + + + C A + G +I+ G Q N I YD+
Sbjct: 374 MPSMTLHFDGADMVL-PADSYMMLDSNLWCLAMQNQTDGGVSIL-GNYQQQNMHILYDVG 431
Query: 361 QAMVSFKPSRCTN 373
Q ++F P++C+
Sbjct: 432 QETLTFAPAKCST 444
>gi|225216930|gb|ACN85225.1| aspartic proteinase nepenthesin-1 precursor [Oryza punctata]
gi|225216938|gb|ACN85232.1| aspartic proteinase nepenthesin-1 precursor [Oryza minuta]
Length = 516
Score = 164 bits (416), Expect = 5e-38, Method: Compositional matrix adjust.
Identities = 119/353 (33%), Positives = 180/353 (50%), Gaps = 31/353 (8%)
Query: 35 YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSS 94
Y++ + +GTP DTGSD TW QC+PC + C++Q LFDP +SSTY ++SC++
Sbjct: 179 YVVTVGLGTPASRYTVVFDTGSDTTWVQCQPCVVV-CYEQREKLFDPARSSTYANVSCAA 237
Query: 95 SQCA-VVTSNCSEGDCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMPNVIFGCG 153
C+ + T CS G C Y YG G+Y S G A +TLT +S + FGCG
Sbjct: 238 PACSDLDTRGCSGGHCLYGVQYGDGSY---SIGFFAMDTLTLSSYDAV----KGFRFGCG 290
Query: 154 HKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLPDQ--GSSKINFGGIVAG 211
+N + G++GLG G +SL Q G F++CLP + G+ ++FG
Sbjct: 291 ERNEG---LFGEAAGLLGLGRGKTSLPVQTYDKYGGVFAHCLPARSTGTGYLDFGAGSPA 347
Query: 212 AGVVSTPLIIRD---HYYLSLEAISVGNQRLEFVSS--STGNIFVDTGVLRTLLPLEYHS 266
A + +TP+++ + YY+ L I VG + L S +T VD+G + T LP +S
Sbjct: 348 ARLTTTPMLVDNGPTFYYVGLTGIRVGGRLLYIPQSVFATAGTIVDSGTVITRLPPAAYS 407
Query: 267 NLKSVMSNMIKAQPVKGVGAEPGFSDV-LCYNIS--SQPKFPEVTIHFR-GADVKLSPSN 322
+L+S + + A +G P S + CY+ + SQ P V++ F+ GA + + S
Sbjct: 408 SLRSAFAAAMSA---RGYKKAPAVSLLDTCYDFAGMSQVAIPTVSLLFQGGARLDVDASG 464
Query: 323 LFRNISDEIMCSAFR----GGNANIVYGRIMQINFLIGYDIEQAMVSFKPSRC 371
+ S +C AF GG+ IV G F + YDI + +VSF P C
Sbjct: 465 IMYAASASQVCLAFAANEDGGDVGIV-GNTQLKTFGVAYDIGKKVVSFSPGAC 516
>gi|414590468|tpg|DAA41039.1| TPA: aspartic proteinase nepenthesin-1 [Zea mays]
Length = 469
Score = 164 bits (416), Expect = 6e-38, Method: Compositional matrix adjust.
Identities = 122/371 (32%), Positives = 187/371 (50%), Gaps = 46/371 (12%)
Query: 35 YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSS 94
YLM L+IGTPP+ DTGSD WTQC PC CF+Q PL++P S+T++ + C+S
Sbjct: 112 YLMTLAIGTPPLPYAAVADTGSDLIWTQCAPC-GTQCFEQPAPLYNPASSTTFSVLPCNS 170
Query: 95 --SQCAVVTSNCSEGD---CSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMPNVI 149
S CA + + C Y+ YG G +++G +ET TF S++ +P V
Sbjct: 171 SLSMCAGALAGAAPPPGCACMYNQTYGTG----WTAGVQGSETFTFGSSAADQARVPGVA 226
Query: 150 FGCGHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCL-PDQ---GSSKINF 205
FGC + AS + + G++GLG G+ SL+SQ+G AG+FSYCL P Q +S +
Sbjct: 227 FGCSN---ASSSDWNGSAGLVGLGRGSLSLVSQLG---AGRFSYCLTPFQDTNSTSTLLL 280
Query: 206 GGIVA--GAGVVSTPLI-------IRDHYYLSLEAISVGNQRL-------EFVSSSTGNI 249
G A G GV STP + + +YYL+L IS+G + L TG +
Sbjct: 281 GPSAALNGTGVRSTPFVASPARAPMSTYYYLNLTGISLGAKALPISPGAFSLKPDGTGGL 340
Query: 250 FVDTGVLRTLLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDVLCYNI---SSQPK--F 304
+D+G T L + +++ + +++ P G LC+ + +S P
Sbjct: 341 IIDSGTTITSLANAAYQQVRAAVKSLVTTLPTVDGSDSTGLD--LCFALPAPTSAPPAVL 398
Query: 305 PEVTIHFRGADVKLSPSNLFRNISDEIMCSAFRG--GNANIVYGRIMQINFLIGYDIEQA 362
P +T+HF GAD+ L P++ + + C A R A +G Q N I YD+ +
Sbjct: 399 PSMTLHFDGADMVL-PADSYMISGSGVWCLAMRNQTDGAMSTFGNYQQQNMHILYDVREE 457
Query: 363 MVSFKPSRCTN 373
+SF P++C+
Sbjct: 458 TLSFAPAKCST 468
>gi|115476828|ref|NP_001062010.1| Os08g0469000 [Oryza sativa Japonica Group]
gi|42407407|dbj|BAD09565.1| putative nucleoid DNA-binding protein [Oryza sativa Japonica Group]
gi|113623979|dbj|BAF23924.1| Os08g0469000 [Oryza sativa Japonica Group]
gi|125603713|gb|EAZ43038.1| hypothetical protein OsJ_27627 [Oryza sativa Japonica Group]
Length = 448
Score = 164 bits (414), Expect = 9e-38, Method: Compositional matrix adjust.
Identities = 125/369 (33%), Positives = 188/369 (50%), Gaps = 47/369 (12%)
Query: 35 YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSS 94
YLM L+IGTPP+ VDTGSD WTQC PC + C Q P F P +S+TY + C S
Sbjct: 92 YLMDLAIGTPPLRYTAMVDTGSDLIWTQCAPC--VLCADQPTPYFRPARSATYRLVPCRS 149
Query: 95 SQCAVVT-SNCSEGD-CSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMPNVIFGC 152
CA + C + C Y + YG A ++G LA+ET TF + + V + +V FGC
Sbjct: 150 PLCAALPYPACFQRSVCVYQYYYGDEAS---TAGVLASETFTFGAANSSKVMVSDVAFGC 206
Query: 153 GHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLPDQGS---SKINFGGIV 209
G+ N + S G++GLG G SL+SQ+G S +FSYCL S S++NFG
Sbjct: 207 GNINSGQLANSS---GMVGLGRGPLSLVSQLGPS---RFSYCLTSFLSPEPSRLNFGVFA 260
Query: 210 ---------AGAGVVSTPLIIRDH----YYLSLEAISVGNQRL-------EFVSSSTGNI 249
+G+ V STPL++ Y++SL+ IS+G +RL TG +
Sbjct: 261 TLNGTNASSSGSPVQSTPLVVNAALPSLYFMSLKGISLGQKRLPIDPLVFAINDDGTGGV 320
Query: 250 FVDTGVLRTLLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDVLCYNISSQPK----FP 305
F+D+G T L + + ++ + ++++ P E G C+ P P
Sbjct: 321 FIDSGTSLTWLQQDAYDAVRRELVSVLRPLPPTN-DTEIGLET--CFPWPPPPSVAVTVP 377
Query: 306 EVTIHFR-GADVKLSPSN-LFRNISDEIMCSAF-RGGNANIVYGRIMQINFLIGYDIEQA 362
++ +HF GA++ + P N + + + +C A R G+A I+ G Q N I YDI +
Sbjct: 378 DMELHFDGGANMTVPPENYMLIDGATGFLCLAMIRSGDATII-GNYQQQNMHILYDIANS 436
Query: 363 MVSFKPSRC 371
++SF P+ C
Sbjct: 437 LLSFVPAPC 445
>gi|125561848|gb|EAZ07296.1| hypothetical protein OsI_29544 [Oryza sativa Indica Group]
Length = 448
Score = 164 bits (414), Expect = 9e-38, Method: Compositional matrix adjust.
Identities = 124/369 (33%), Positives = 187/369 (50%), Gaps = 47/369 (12%)
Query: 35 YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSS 94
YLM L+IGTPP+ VDTGSD WTQC PC + C Q P F P +S+TY + C S
Sbjct: 92 YLMDLAIGTPPLRYTAMVDTGSDLIWTQCAPC--VLCADQPTPYFRPARSATYRLVPCRS 149
Query: 95 SQCAVV--TSNCSEGDCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMPNVIFGC 152
CA + + C Y + YG A ++G LA+ET TF + + V + +V FGC
Sbjct: 150 PLCAALPYPACFQRSVCVYQYYYGDEAS---TAGVLASETFTFGAANSSKVMVSDVAFGC 206
Query: 153 GHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLPDQGS---SKINFGGIV 209
G+ N + S G++GLG G SL+SQ+G S +FSYCL S S++NFG
Sbjct: 207 GNINSGQLANSS---GMVGLGRGPLSLVSQLGPS---RFSYCLTSFLSPEPSRLNFGVFA 260
Query: 210 ---------AGAGVVSTPLIIRDH----YYLSLEAISVGNQRL-------EFVSSSTGNI 249
+G+ V STPL++ Y++SL+ IS+G +RL TG +
Sbjct: 261 TLNGTNASSSGSPVQSTPLVVNAALPSLYFMSLKGISLGQKRLPIDPLVFAINDDGTGGV 320
Query: 250 FVDTGVLRTLLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDVLCYNISSQPK----FP 305
F+D+G T L + + ++ + ++++ P E G C+ P P
Sbjct: 321 FIDSGTSLTWLQQDAYDAVRHELVSVLRPLPPTN-DTEIGLET--CFPWPPPPSVAVTVP 377
Query: 306 EVTIHFR-GADVKLSPSN-LFRNISDEIMCSAF-RGGNANIVYGRIMQINFLIGYDIEQA 362
++ +HF GA++ + P N + + + +C A R G+A I+ G Q N I YDI +
Sbjct: 378 DMELHFDGGANMTVPPENYMLIDGATGFLCLAMIRSGDATII-GNYQQQNMHILYDIANS 436
Query: 363 MVSFKPSRC 371
++SF P+ C
Sbjct: 437 LLSFVPAPC 445
>gi|195625122|gb|ACG34391.1| aspartic proteinase nepenthesin-2 precursor [Zea mays]
Length = 471
Score = 163 bits (413), Expect = 1e-37, Method: Compositional matrix adjust.
Identities = 117/355 (32%), Positives = 172/355 (48%), Gaps = 34/355 (9%)
Query: 35 YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSS 94
Y+ L +GTP VDTGS TW QC PC + C +Q PLFDP+ SSTY S+ CS+
Sbjct: 134 YVTQLGLGTPSTSYAMVVDTGSSLTWLQCSPC-VVSCHRQVGPLFDPRASSTYASVRCSA 192
Query: 95 SQC------AVVTSNCSEGD-CSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMPN 147
SQC + S CS + C Y YG +SFS G+L+T+T++F ST P+
Sbjct: 193 SQCDELQAATLNPSACSASNVCIYQASYGD---SSFSVGSLSTDTVSFGST-----RYPS 244
Query: 148 VIFGCGHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLPDQGSSK-INFG 206
+GCG N + G+IGL SL+ Q+ S+ FSYCLP S+ ++ G
Sbjct: 245 FYYGCGQDNEG---LFGRSAGLIGLARNKLSLLYQLAPSLGYSFSYCLPTAASTGYLSIG 301
Query: 207 GIVAGAGVVSTPL----IIRDHYYLSLEAISVGNQRLEFVSSSTGNI--FVDTGVLRTLL 260
G TP+ + Y+++L +SVG L S ++ +D+G + T L
Sbjct: 302 PYNTGHYYSYTPMASSSLDASLYFITLSGMSVGGSPLAVSPSEYSSLPTIIDSGTVITRL 361
Query: 261 PLEYHSNLKSVMSNMIKAQPVKGVGAEPGFS--DVLCYNISSQPKFPEVTIHFR-GADVK 317
P H+ L + AQ + G P FS D +SQ + P V + F GA +K
Sbjct: 362 PTAVHTALSKAV-----AQAMAGAQRAPAFSILDTCFEGQASQLRVPTVAMAFAGGASMK 416
Query: 318 LSPSNLFRNISDEIMCSAFRGGNANIVYGRIMQINFLIGYDIEQAMVSFKPSRCT 372
L+ N+ ++ D C AF ++ + G Q F + YD+ Q+ + F C+
Sbjct: 417 LTTRNVLIDVDDSTTCLAFAPTDSTAIIGNTQQQTFSVIYDVAQSRIGFSAGGCS 471
>gi|226503109|ref|NP_001147206.1| aspartic proteinase nepenthesin-2 precursor [Zea mays]
gi|195608496|gb|ACG26078.1| aspartic proteinase nepenthesin-2 precursor [Zea mays]
gi|413921850|gb|AFW61782.1| aspartic proteinase nepenthesin-2 [Zea mays]
Length = 441
Score = 163 bits (413), Expect = 1e-37, Method: Compositional matrix adjust.
Identities = 122/365 (33%), Positives = 178/365 (48%), Gaps = 43/365 (11%)
Query: 35 YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSS 94
YL+ L+IGTPP+ +DTGSD WTQC PC L C Q P FD KKS+TY ++ C S
Sbjct: 89 YLVDLAIGTPPLYYTAIMDTGSDLIWTQCAPC--LLCADQPTPYFDVKKSATYRALPCRS 146
Query: 95 SQCAVVTS-NCSEGDCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMPNVIFGCG 153
S+CA ++S +C + C Y + YG A ++G LA ET TF + + V N+ FGCG
Sbjct: 147 SRCASLSSPSCFKKMCVYQYYYGDTAS---TAGVLANETFTFGAANSTKVRATNIAFGCG 203
Query: 154 HKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLPDQGS---SKINFG---- 206
N + S G++G G G SL+SQ+G S +FSYCL S S++ FG
Sbjct: 204 SLNAGDLANSS---GMVGFGRGPLSLVSQLGPS---RFSYCLTSYLSATPSRLYFGVYAN 257
Query: 207 ----GIVAGAGVVSTPLIIR----DHYYLSLEAISVGNQRL-------EFVSSSTGNIFV 251
+G+ V STP +I + Y+LSL+AIS+G + L TG + +
Sbjct: 258 LSSTNTSSGSPVQSTPFVINPALPNMYFLSLKAISLGTKLLPIDPLVFAINDDGTGGVII 317
Query: 252 DTGVLRTLLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDVLCYNISSQPK----FPEV 307
D+G T L + + ++V ++ A P+ + D C+ P P++
Sbjct: 318 DSGTSITWLQQDAY---EAVRRGLVSAIPLPAMNDTDIGLDT-CFQWPPPPNVTVTVPDL 373
Query: 308 TIHFRGADVKLSPSNLFRNISDE-IMCSAFRGGNANIVYGRIMQINFLIGYDIEQAMVSF 366
HF A++ L P N S +C + G Q N + YDI + +SF
Sbjct: 374 VFHFDSANMTLLPENYMLIASTTGYLCLVMAPTGVGTIIGNYQQQNLHLLYDIGNSFLSF 433
Query: 367 KPSRC 371
P+ C
Sbjct: 434 VPAPC 438
>gi|168043550|ref|XP_001774247.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162674374|gb|EDQ60883.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 419
Score = 163 bits (413), Expect = 1e-37, Method: Compositional matrix adjust.
Identities = 125/366 (34%), Positives = 178/366 (48%), Gaps = 34/366 (9%)
Query: 23 IYQAEIISVDDIYLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPK 82
++ + S + YL+ +S G+PP VDTGSD WTQC PC C +FDP
Sbjct: 68 LFSTPVASGNGEYLIDISFGSPPQKASVIVDTGSDLIWTQCLPCET--CNAAASVIFDPV 125
Query: 83 KSSTYNSISCSSSQCAVVTSNCSEGDCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLP 142
KSSTY+++SC+S+ C+ + C Y ++YG G S +SG L+TET+T + +
Sbjct: 126 KSSTYDTVSCASNFCSSLPFQSCTTSCKYDYMYGDG---SSTSGALSTETVTVGTGT--- 179
Query: 143 VEMPNVIFGCGHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLPDQGSSK 202
+PNV FGCGH NL S + GI+GLG G SLISQ + + KFSYCL GS+K
Sbjct: 180 --IPNVAFGCGHTNLG---SFAGAAGIVGLGQGPLSLISQASSITSKKFSYCLVPLGSTK 234
Query: 203 IN---FGGIVAGAGVVSTPLIIR----DHYYLSLEAISVGNQRLEFV-------SSSTGN 248
+ G A GV T L+ YY L ISV + + + +S G
Sbjct: 235 TSPMLIGDSAAAGGVAYTALLTNTANPTFYYADLTGISVSGKAVTYPVGTFSIDASGQGG 294
Query: 249 IFVDTGVLRTLLPLEYHSNLKSVMSNMIKAQPVKG--VGAEPGFSDVLCYNISSQPKFPE 306
+D+G T L + L + + + G G + FS N P +P
Sbjct: 295 FILDSGTTLTYLETGAFNALVAALKAEVPFPEADGSLYGLDYCFSTAGVAN----PTYPT 350
Query: 307 VTIHFRGADVKLSPSNLFRNI-SDEIMCSAFRGGNANIVYGRIMQINFLIGYDIEQAMVS 365
+T HF+GAD +L P N+F + + +C A + G I Q N LI +D+ V
Sbjct: 351 MTFHFKGADYELPPENVFVALDTGGSICLAMAASTGFSIMGNIQQQNHLIVHDLVNQRVG 410
Query: 366 FKPSRC 371
FK + C
Sbjct: 411 FKEANC 416
>gi|226494448|ref|NP_001141341.1| uncharacterized protein LOC100273432 precursor [Zea mays]
gi|194704078|gb|ACF86123.1| unknown [Zea mays]
gi|413953775|gb|AFW86424.1| aspartic proteinase nepenthesin-2 [Zea mays]
Length = 471
Score = 163 bits (413), Expect = 1e-37, Method: Compositional matrix adjust.
Identities = 118/355 (33%), Positives = 172/355 (48%), Gaps = 34/355 (9%)
Query: 35 YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSS 94
Y+ L +GTP VDTGS TW QC PC + C +Q PLFDP+ SSTY S+ CS+
Sbjct: 134 YVTQLGLGTPSTSYAMVVDTGSSLTWLQCSPC-VVSCHRQVGPLFDPRASSTYTSVRCSA 192
Query: 95 SQC------AVVTSNCSEGD-CSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMPN 147
SQC + S CS + C Y YG +SFS G L+T+T++F STS P+
Sbjct: 193 SQCDELQAATLNPSACSASNVCIYQASYGD---SSFSVGYLSTDTVSFGSTS-----YPS 244
Query: 148 VIFGCGHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLPDQGSSK-INFG 206
+GCG N + G+IGL SL+ Q+ S+ FSYCLP S+ ++ G
Sbjct: 245 FYYGCGQDNEG---LFGRSAGLIGLARNKLSLLYQLAPSLGYSFSYCLPTAASTGYLSIG 301
Query: 207 GIVAGAGVVSTPL----IIRDHYYLSLEAISVGNQRLEFVSSSTGNI--FVDTGVLRTLL 260
G TP+ + Y+++L +SVG L S ++ +D+G + T L
Sbjct: 302 PYNTGHYYSYTPMASSSLDASLYFITLSGMSVGGSPLAVSPSEYSSLPTIIDSGTVITRL 361
Query: 261 PLEYHSNLKSVMSNMIKAQPVKGVGAEPGFS--DVLCYNISSQPKFPEVTIHFR-GADVK 317
P H+ L + AQ + G P FS D +SQ + P V + F GA +K
Sbjct: 362 PTAVHTALSKAV-----AQAMAGAQRAPAFSILDTCFEGQASQLRVPTVVMAFAGGASMK 416
Query: 318 LSPSNLFRNISDEIMCSAFRGGNANIVYGRIMQINFLIGYDIEQAMVSFKPSRCT 372
L+ N+ ++ D C AF ++ + G Q F + YD+ Q+ + F C+
Sbjct: 417 LTTRNVLIDVDDSTTCLAFAPTDSTAIIGNTQQQTFSVIYDVAQSRIGFSAGGCS 471
>gi|414886964|tpg|DAA62978.1| TPA: aspartic proteinase nepenthesin-1 [Zea mays]
Length = 452
Score = 163 bits (412), Expect = 1e-37, Method: Compositional matrix adjust.
Identities = 123/373 (32%), Positives = 179/373 (47%), Gaps = 49/373 (13%)
Query: 35 YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSS 94
YLM L+IGTPP+ DTGSD WTQC PC CF+Q PL++P S+T+ + C+S
Sbjct: 92 YLMALAIGTPPLPYQAIADTGSDLIWTQCAPCTS-QCFRQPTPLYNPSSSTTFAVLPCNS 150
Query: 95 SQCAVVTSNCSEGD-------CSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMPN 147
S + G C+Y+ YG G + F +ET TF ST +P
Sbjct: 151 SLSVCAAALAGTGTAPPPGCACTYNVTYGSGWTSVFQ----GSETFTFGSTPAGHARVPG 206
Query: 148 VIFGCGHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLP---DQGSSKIN 204
+ FGC +S + S +G++GLG G SL+SQ+G KFSYCL D S+
Sbjct: 207 IAFGC--STASSGFNASSASGLVGLGRGRLSLVSQLGVP---KFSYCLTPYQDTNSTSTL 261
Query: 205 FGGIVAG----AGVVSTPLI-------IRDHYYLSLEAISVGNQRL-------EFVSSST 246
G A AGV STP + + YYL+L IS+G L + T
Sbjct: 262 LLGPSASLNGTAGVSSTPFVASPSTAPMNTFYYLNLTGISLGTTALSIPPDAFSLNADGT 321
Query: 247 GNIFVDTGVLRTLLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDVLCYNI----SSQP 302
G + +D+G TLL + +++ + +++ G A+ G LC+ + S+ P
Sbjct: 322 GGLIIDSGTTITLLGNTAYQQVRAAVVSLVTLPTTDG-SADTGLD--LCFMLPSSTSAPP 378
Query: 303 KFPEVTIHFRGADVKLSPSNLFRNISDEIMCSAFRG---GNANIVYGRIMQINFLIGYDI 359
P +T+HF GAD+ L + + + C A + G NI+ G Q N I YDI
Sbjct: 379 AMPSMTLHFNGADMVLPADSYMMSDDSGLWCLAMQNQTDGEVNIL-GNYQQQNMHILYDI 437
Query: 360 EQAMVSFKPSRCT 372
Q +SF P++C+
Sbjct: 438 GQETLSFAPAKCS 450
>gi|115472517|ref|NP_001059857.1| Os07g0533600 [Oryza sativa Japonica Group]
gi|22831047|dbj|BAC15910.1| putative nucleoid DNA-binding protein cnd41 [Oryza sativa Japonica
Group]
gi|50508280|dbj|BAD32129.1| putative nucleoid DNA-binding protein cnd41 [Oryza sativa Japonica
Group]
gi|113611393|dbj|BAF21771.1| Os07g0533600 [Oryza sativa Japonica Group]
gi|215766673|dbj|BAG98901.1| unnamed protein product [Oryza sativa Japonica Group]
Length = 441
Score = 163 bits (412), Expect = 1e-37, Method: Compositional matrix adjust.
Identities = 122/366 (33%), Positives = 185/366 (50%), Gaps = 48/366 (13%)
Query: 35 YLMHLSIGTPPVDIFGSVDTGSDCTWTQCE-PCPELDCFKQEPPLFDPKKSSTYNSISCS 93
YL+ ++IGTPP+ + +DTGSD WTQC+ PC CF Q PL+ P +S+TY ++SC
Sbjct: 92 YLVDIAIGTPPLPLTAVLDTGSDLIWTQCDAPCRR--CFPQPAPLYAPARSATYANVSCR 149
Query: 94 SSQCAVVT---SNCSEGD--CSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMPNV 148
S C + S CS D C+Y F YG G + + G LATET T S + + V
Sbjct: 150 SPMCQALQSPWSRCSPPDTGCAYYFSYGDG---TSTDGVLATETFTLGSDTA----VRGV 202
Query: 149 IFGCGHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLPDQGSSKIN--FG 206
FGCG +NL S + S G++G+G G SL+SQ+G + +FSYC ++ + F
Sbjct: 203 AFGCGTENLGSTDNSS---GLVGMGRGPLSLVSQLGVT---RFSYCFTPFNATAASPLFL 256
Query: 207 GIVA--GAGVVSTPLI---------IRDHYYLSLEAISVGNQRL-------EFVSSSTGN 248
G A + +TP + +YYLSLE I+VG+ L G
Sbjct: 257 GSSARLSSAAKTTPFVPSPSGGARRRSSYYYLSLEGITVGDTLLPIDPAVFRLTPMGDGG 316
Query: 249 IFVDTGVLRTLLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDVLCYNISSQP--KFPE 306
+ +D+G T L L +++ ++ P+ GA G S LC+ +S + P
Sbjct: 317 VIIDSGTTFTALEERAFVALARALASRVR-LPLAS-GAHLGLS--LCFAAASPEAVEVPR 372
Query: 307 VTIHFRGADVKL-SPSNLFRNISDEIMCSAFRGGNANIVYGRIMQINFLIGYDIEQAMVS 365
+ +HF GAD++L S + + S + C V G + Q N I YD+E+ ++S
Sbjct: 373 LVLHFDGADMELRRESYVVEDRSAGVACLGMVSARGMSVLGSMQQQNTHILYDLERGILS 432
Query: 366 FKPSRC 371
F+P++C
Sbjct: 433 FEPAKC 438
>gi|226508202|ref|NP_001141111.1| hypothetical protein precursor [Zea mays]
gi|194702684|gb|ACF85426.1| unknown [Zea mays]
gi|414590469|tpg|DAA41040.1| TPA: hypothetical protein ZEAMMB73_571218 [Zea mays]
Length = 439
Score = 163 bits (412), Expect = 1e-37, Method: Compositional matrix adjust.
Identities = 128/406 (31%), Positives = 199/406 (49%), Gaps = 59/406 (14%)
Query: 2 QNSQKLPFYNDNETPKSPISIIYQAEIISVDDIYLMHLSIGTPPVDIFGSVDTGSDCTWT 61
N++KL + + T +P+S +V +LM L+IGTPP+ DTGSD WT
Sbjct: 58 HNARKLAASSSDGTVSAPVSPT------TVPGEFLMTLAIGTPPLPFLAIADTGSDLIWT 111
Query: 62 QCEPCPELDCFKQEPPLFDPKKSSTYNSISCSSSQCAVVTSNCSEGDCSYSFLYGRGAYA 121
QC PC CF+Q PL++P S+T++++ C+SS + C+ C Y+ YG G
Sbjct: 112 QCAPCSR-QCFQQPTPLYNPSSSTTFSALPCNSS-LGLCAPACA---CMYNMTYGSGWTY 166
Query: 122 SFSSGNLATETLTF-NSTSGLPVEMPNVIFGCGHKNLASPTSDSKQTGIIGLGPGNSSLI 180
F TET TF +ST V +P + FGC N +S + S +G++GLG G+ SL+
Sbjct: 167 VFQ----GTETFTFGSSTPADQVRVPGIAFGC--SNASSGFNASSASGLVGLGRGSLSLV 220
Query: 181 SQMGTSIAGKFSYCL-PDQG-----------SSKINFGGIVAGAGVVSTPLIIRDHYYLS 228
SQ+G A KFSYCL P Q S+ +N G+V+ V++P I +YYL+
Sbjct: 221 SQLG---APKFSYCLTPYQDTNSTSTLLLGPSASLNDTGVVSSTPFVASPSSI--YYYLN 275
Query: 229 LEAISVGNQRL-------EFVSSSTGNIFVDTGVLRTLLPLEYHSNLKSVMSNMIKAQPV 281
L IS+G L + TG + +D+G T+L + +++ + +++
Sbjct: 276 LTGISLGTTALPIPPNAFSLKADGTGGLIIDSGTTITMLGNTAYQQVRAAVLSLVTLPTT 335
Query: 282 KGVGAEPGFSDVLCYNI----SSQPKFPEVTIHFRGADVKLSPSNLFR-----NISDEIM 332
G A G LC+ + S+ P P +T+HF GAD+ L N + +
Sbjct: 336 DGSAAT-GLD--LCFELPSSTSAPPSMPSMTLHFDGADMVLPADNYMMSLSDPDSDSSLW 392
Query: 333 CSAFRG-----GNANIVYGRIMQINFLIGYDIEQAMVSFKPSRCTN 373
C A + G + G Q N I YD+ + +SF P++C+
Sbjct: 393 CLAMQNQTDTDGVVVSILGNYQQQNMHILYDVGKETLSFAPAKCST 438
>gi|125558631|gb|EAZ04167.1| hypothetical protein OsI_26309 [Oryza sativa Indica Group]
Length = 441
Score = 162 bits (411), Expect = 2e-37, Method: Compositional matrix adjust.
Identities = 122/366 (33%), Positives = 185/366 (50%), Gaps = 48/366 (13%)
Query: 35 YLMHLSIGTPPVDIFGSVDTGSDCTWTQCE-PCPELDCFKQEPPLFDPKKSSTYNSISCS 93
YL+ ++IGTPP+ + +DTGSD WTQC+ PC CF Q PL+ P +S+TY ++SC
Sbjct: 92 YLVDIAIGTPPLPLTAVLDTGSDLIWTQCDAPCRR--CFPQPAPLYAPARSATYANVSCR 149
Query: 94 SSQCAVVT---SNCSEGD--CSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMPNV 148
S C + S CS D C+Y F YG G + + G LATET T S + + V
Sbjct: 150 SPMCQALQSPWSRCSPPDTGCAYYFSYGDG---TSTDGVLATETFTLGSDTA----VRGV 202
Query: 149 IFGCGHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLPDQGSSKIN--FG 206
FGCG +NL S + S G++G+G G SL+SQ+G + +FSYC ++ + F
Sbjct: 203 AFGCGTENLGSTDNSS---GLVGMGRGPLSLVSQLGVT---RFSYCFTPFNATAASPLFL 256
Query: 207 GIVA--GAGVVSTPLI---------IRDHYYLSLEAISVGNQRL-------EFVSSSTGN 248
G A + +TP + +YYLSLE I+VG+ L G
Sbjct: 257 GSSARLSSAAKTTPFVPSPSGGARRRSSYYYLSLEGITVGDTLLPIDPAVFRLTPMGDGG 316
Query: 249 IFVDTGVLRTLLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDVLCYNISSQP--KFPE 306
+ +D+G T L L +++ ++ P+ GA G S LC+ +S + P
Sbjct: 317 VIIDSGTTFTALEESAFVALARALASRVR-LPLAS-GAHLGLS--LCFAAASPEAVEVPR 372
Query: 307 VTIHFRGADVKL-SPSNLFRNISDEIMCSAFRGGNANIVYGRIMQINFLIGYDIEQAMVS 365
+ +HF GAD++L S + + S + C V G + Q N I YD+E+ ++S
Sbjct: 373 LVLHFDGADMELRRESYVVEDRSAGVACLGMVSARGMSVLGSMQQQNTHILYDLERGILS 432
Query: 366 FKPSRC 371
F+P++C
Sbjct: 433 FEPAKC 438
>gi|226493786|ref|NP_001142400.1| uncharacterized protein LOC100274575 [Zea mays]
gi|194708650|gb|ACF88409.1| unknown [Zea mays]
Length = 392
Score = 162 bits (411), Expect = 2e-37, Method: Compositional matrix adjust.
Identities = 123/373 (32%), Positives = 179/373 (47%), Gaps = 49/373 (13%)
Query: 35 YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSS 94
YLM L+IGTPP+ DTGSD WTQC PC CF+Q PL++P S+T+ + C+S
Sbjct: 32 YLMALAIGTPPLPYQAIADTGSDLIWTQCAPCTS-QCFRQPTPLYNPSSSTTFAVLPCNS 90
Query: 95 SQCAVVTSNCSEGD-------CSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMPN 147
S + G C+Y+ YG G + F +ET TF ST +P
Sbjct: 91 SLSVCAAALAGTGTAPPPGCACTYNVTYGSGWTSVFQ----GSETFTFGSTPAGHARVPG 146
Query: 148 VIFGCGHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLP---DQGSSKIN 204
+ FGC +S + S +G++GLG G SL+SQ+G KFSYCL D S+
Sbjct: 147 IAFGC--STASSGFNASSASGLVGLGRGRLSLVSQLGVP---KFSYCLTPYQDTNSTSTL 201
Query: 205 FGGIVAG----AGVVSTPLI-------IRDHYYLSLEAISVGNQRL-------EFVSSST 246
G A AGV STP + + YYL+L IS+G L + T
Sbjct: 202 LLGPSASLNGTAGVSSTPFVASPSTAPMNTFYYLNLTGISLGTTALSIPPDAFSLNADGT 261
Query: 247 GNIFVDTGVLRTLLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDVLCYNI----SSQP 302
G + +D+G TLL + +++ + +++ G A+ G LC+ + S+ P
Sbjct: 262 GGLIIDSGTTITLLGNTAYQQVRAAVVSLVTLPTTDG-SADTGLD--LCFMLPSSTSAPP 318
Query: 303 KFPEVTIHFRGADVKLSPSNLFRNISDEIMCSAFRG---GNANIVYGRIMQINFLIGYDI 359
P +T+HF GAD+ L + + + C A + G NI+ G Q N I YDI
Sbjct: 319 AMPSMTLHFNGADMVLPADSYMMSDDSGLWCLAMQNQTDGEVNIL-GNYQQQNMHILYDI 377
Query: 360 EQAMVSFKPSRCT 372
Q +SF P++C+
Sbjct: 378 GQETLSFAPAKCS 390
>gi|326507654|dbj|BAK03220.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 442
Score = 162 bits (410), Expect = 2e-37, Method: Compositional matrix adjust.
Identities = 121/369 (32%), Positives = 183/369 (49%), Gaps = 45/369 (12%)
Query: 35 YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSS 94
Y+M L+IGTPP+ DTGSD WTQC PC CFKQ ++P S+T+ + C+S
Sbjct: 88 YIMTLAIGTPPLSYPAIADTGSDLIWTQCAPCGS-QCFKQAGQPYNPSSSTTFGVLPCNS 146
Query: 95 --SQCAVVTSNCSEGDCS--YSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMPNVIF 150
S CA + CS Y+ YG G +++G + ET TF ST +P + F
Sbjct: 147 SVSMCAALAGPSPPPGCSCMYNQTYGTG----WTAGIQSVETFTFGSTPADQTRVPGIAF 202
Query: 151 GCGHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCL-PDQ---GSSKINFG 206
GC + AS + G++GLG G+ SL+SQ+G AG FSYCL P Q +S + G
Sbjct: 203 GCSN---ASSDDWNGSAGLVGLGRGSMSLVSQLG---AGMFSYCLTPFQDANSTSTLLLG 256
Query: 207 GIVA--GAGVVSTPLI-------IRDHYYLSLEAISVGNQRLE-------FVSSSTGNIF 250
A G GV++TP + + +YYL+L IS+G L + TG +
Sbjct: 257 PSAALNGTGVLTTPFVASPSKAPMSTYYYLNLTGISIGTTALSIPPNAFALRTDGTGGLI 316
Query: 251 VDTGVLRTLLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDVLCYNISSQ----PKFPE 306
+D+G T L + +++ + +++ PV G LC+ ++S+ P P
Sbjct: 317 IDSGTTITSLVDAAYQQVRAAIESLVTL-PVADGSDSTGLD--LCFALTSETSTPPSMPS 373
Query: 307 VTIHFRGADVKLSPSNLFRNISDEIMCSAFRGGN--ANIVYGRIMQINFLIGYDIEQAMV 364
+T HF GAD+ L P + + + + C A R A +G Q N + YDI + +
Sbjct: 374 MTFHFDGADMVL-PVDNYMILGSGVWCLAMRNQTVGAMSTFGNYQQQNVHLLYDIHEETL 432
Query: 365 SFKPSRCTN 373
SF P++C+
Sbjct: 433 SFAPAKCST 441
>gi|242056193|ref|XP_002457242.1| hypothetical protein SORBIDRAFT_03g003930 [Sorghum bicolor]
gi|241929217|gb|EES02362.1| hypothetical protein SORBIDRAFT_03g003930 [Sorghum bicolor]
Length = 457
Score = 162 bits (410), Expect = 3e-37, Method: Compositional matrix adjust.
Identities = 124/374 (33%), Positives = 186/374 (49%), Gaps = 39/374 (10%)
Query: 25 QAEIISVDDIYLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPE--LDCFKQEPPLFDPK 82
+++II+ YLM++++GTPP + DTGSD W C D +F P
Sbjct: 93 ESKIITRSFEYLMYVNVGTPPTQLLAIADTGSDLVWVNCSSSGGGLADADAGGNVVFQPT 152
Query: 83 KSSTYNSISCSSSQC-AVVTSNC-SEGDCSYSFLYGRGAYASFSSGNLATETLTFNSTSG 140
+SSTY+ +SC S+ C A+ ++C ++ +C Y + YG G S + G L+TET +F G
Sbjct: 153 RSSTYSQLSCQSNACQALSQASCDADSECQYQYSYGDG---SRTIGVLSTETFSFVDGGG 209
Query: 141 L-PVEMPNVIFGCGHKNLASPTSDSKQTGIIGLGPGNSSLISQMG--TSIAGKFSYCL-- 195
V +P V FGC + + SD G++GLG G SL+SQ+G T I K SYCL
Sbjct: 210 KGQVRVPRVNFGCSTASAGTFRSD----GLVGLGAGAFSLVSQLGATTHIDRKLSYCLIP 265
Query: 196 --PDQGSSKINFG--GIVAGAGVVSTPLIIRD---HYYLSLEAISVGNQRLEFVSSSTGN 248
SS +NFG +V+ G STPL+ D +Y ++LE+++VG Q V++
Sbjct: 266 SYDANSSSTLNFGSRAVVSEPGAASTPLVPSDVDSYYTVALESVAVGGQE---VATHDSR 322
Query: 249 IFVDTGVLRTLLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDVLCYNISSQPK----- 303
I VD+G T L L + + IK Q V+ P LCY++ + +
Sbjct: 323 IIVDSGTTLTFLDPALLGPLVTELERRIKLQRVQ----PPEQLLQLCYDVQGKSETDNFG 378
Query: 304 FPEVTIHF-RGADVKLSPSNLFRNISDEIMCSAF---RGGNANIVYGRIMQINFLIGYDI 359
P+VT+ F GA V L P N F + + +C + G I Q NF +GYD+
Sbjct: 379 IPDVTLRFGGGAAVTLRPENTFSLLQEGTLCLVLVPVSESQPVSILGNIAQQNFHVGYDL 438
Query: 360 EQAMVSFKPSRCTN 373
+ V+F + C
Sbjct: 439 DARTVTFAAADCAR 452
>gi|226492465|ref|NP_001150925.1| aspartic proteinase nepenthesin-1 [Zea mays]
gi|195642996|gb|ACG40966.1| aspartic proteinase nepenthesin-1 precursor [Zea mays]
Length = 472
Score = 162 bits (410), Expect = 3e-37, Method: Compositional matrix adjust.
Identities = 124/372 (33%), Positives = 185/372 (49%), Gaps = 47/372 (12%)
Query: 35 YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSS 94
YLM L+IGTPP+ DTGSD WTQC PC CF+Q PL++P S+T++ + C+S
Sbjct: 114 YLMTLAIGTPPLPYAAVADTGSDLIWTQCAPC-GTQCFEQPAPLYNPASSTTFSVLPCNS 172
Query: 95 --SQCAVVTSNCSEGD---CSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMPNVI 149
S CA + + C Y YG G +++G +ET TF S++ +P V
Sbjct: 173 SLSMCAGALAGAAPPPGCACMYYQTYGTG----WTAGVQGSETFTFGSSAADQARVPGVA 228
Query: 150 FGCGHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCL-PDQ---GSSKINF 205
FGC + AS + + G++GLG G+ SL+SQ+G AG+FSYCL P Q +S +
Sbjct: 229 FGCSN---ASSSDWNGSAGLVGLGRGSLSLVSQLG---AGRFSYCLTPFQDTNSTSTLLL 282
Query: 206 GGIVA--GAGVVSTPLI-------IRDHYYLSLEAISVGNQRL-------EFVSSSTGNI 249
G A G GV STP + + +YYL+L IS+G + L TG +
Sbjct: 283 GPSAALNGTGVRSTPFVASPARAPMSTYYYLNLTGISLGAKALPISPGAFSLKPDGTGGL 342
Query: 250 FVDTG-VLRTLLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDVLCYNI---SSQPK-- 303
+D+G + +L Y +V S ++ P G LC+ + +S P
Sbjct: 343 IIDSGTTITSLANAAYQQVRAAVKSQLVTTLPTVDGSDSTGLD--LCFALPAPTSAPPAV 400
Query: 304 FPEVTIHFRGADVKLSPSNLFRNISDEIMCSAFRG--GNANIVYGRIMQINFLIGYDIEQ 361
P +T+HF GAD+ L P++ + + C A R A +G Q N I YD+ +
Sbjct: 401 LPSMTLHFDGADMVL-PADSYMISGSGVWCLAMRNQTDGAMSTFGNYQQQNMHILYDVRE 459
Query: 362 AMVSFKPSRCTN 373
+SF P++C+
Sbjct: 460 ETLSFAPAKCST 471
>gi|226508080|ref|NP_001150678.1| aspartic proteinase nepenthesin-1 precursor [Zea mays]
gi|195641018|gb|ACG39977.1| aspartic proteinase nepenthesin-1 precursor [Zea mays]
Length = 450
Score = 162 bits (409), Expect = 3e-37, Method: Compositional matrix adjust.
Identities = 123/373 (32%), Positives = 178/373 (47%), Gaps = 49/373 (13%)
Query: 35 YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSS 94
YLM L+IGTPP+ DTGSD WTQC PC CF+Q PL++P S+T+ + C+S
Sbjct: 90 YLMALAIGTPPLPYQAIADTGSDLIWTQCAPCTS-QCFRQPTPLYNPSSSTTFAVLPCNS 148
Query: 95 SQCAVVTSNCSEGD-------CSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMPN 147
S + G C+Y+ YG G + F +ET TF ST +P
Sbjct: 149 SLSVCAAALAGTGTAPPPGCACTYNVTYGSGWTSVFQ----GSETFTFGSTPAGQSRVPG 204
Query: 148 VIFGCGHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLP---DQGSSKIN 204
+ FGC +S + S +G++GLG G SL+SQ+G KFSYCL D S+
Sbjct: 205 IAFGC--STASSGFNASSASGLVGLGRGRLSLVSQLGVP---KFSYCLTPYQDTNSTSTL 259
Query: 205 FGGIVAG----AGVVSTPLI-------IRDHYYLSLEAISVGNQRLE-------FVSSST 246
G A AGV STP + + YYL+L IS+G L + T
Sbjct: 260 LLGPSASLNGTAGVSSTPFVASPSTAPMNTFYYLNLTGISLGTTALSIPPDAFLLNADGT 319
Query: 247 GNIFVDTGVLRTLLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDVLCYNI----SSQP 302
G + +D+G TLL + +++ + +++ G A G LC+ + S+ P
Sbjct: 320 GGLIIDSGTTITLLGNTAYQQVRAAVVSLVTLPTTDGSAAT-GLD--LCFMLPSSTSAPP 376
Query: 303 KFPEVTIHFRGADVKLSPSNLFRNISDEIMCSAFRG---GNANIVYGRIMQINFLIGYDI 359
P +T+HF GAD+ L + + + C A + G NI+ G Q N I YDI
Sbjct: 377 AMPSMTLHFNGADMVLPADSYMMSDDSGLWCLAMQNQTDGEVNIL-GNYQQQNMHILYDI 435
Query: 360 EQAMVSFKPSRCT 372
Q +SF P++C+
Sbjct: 436 GQETLSFAPAKCS 448
>gi|242044886|ref|XP_002460314.1| hypothetical protein SORBIDRAFT_02g026340 [Sorghum bicolor]
gi|241923691|gb|EER96835.1| hypothetical protein SORBIDRAFT_02g026340 [Sorghum bicolor]
Length = 444
Score = 161 bits (408), Expect = 4e-37, Method: Compositional matrix adjust.
Identities = 124/371 (33%), Positives = 181/371 (48%), Gaps = 41/371 (11%)
Query: 28 IISVDDIYLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTY 87
+++ D YLM + IGTP +DTGSD WTQC PC L C Q P FDP SSTY
Sbjct: 85 VLASDGEYLMEMGIGTPARFYSAILDTGSDLIWTQCAPC--LLCVDQPTPYFDPANSSTY 142
Query: 88 NSISCSSSQC-AVVTSNCSEGDCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMP 146
S+ CS+ C A+ C + C Y + YG A ++G LA ET TF T+ V +P
Sbjct: 143 RSLGCSAPACNALYYPLCYQKTCVYQYFYGDSAS---TAGVLANETFTFG-TNDTRVTLP 198
Query: 147 NVIFGCGHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLPDQGS---SKI 203
+ FGCG+ N S + S G++G G G+ SL+SQ+G+ +FSYCL S S++
Sbjct: 199 RISFGCGNLNAGSLANGS---GMVGFGRGSLSLVSQLGSP---RFSYCLTSFLSPVRSRL 252
Query: 204 NFGGIVA-----GAGVVSTPLIIR----DHYYLSLEAISVGNQRLEF--------VSSST 246
FG + V STP II Y+L++ ISVG RL + T
Sbjct: 253 YFGAYATLNSTNASTVQSTPFIINPALPTMYFLNMTGISVGGNRLPIDPAVLAINDTDGT 312
Query: 247 GNIFVDTGVLRTLLPL-EYHSNLKSVMSNMIKAQPVKGVGAEPGFSDVLCYNISSQPK-- 303
G +D+G T L Y++ ++ + + P+ V E D C+ P+
Sbjct: 313 GGTIIDSGTTITYLAEPAYYAVREAFVLYLNSTLPLLDV-TETSVLDT-CFQWPPPPRQS 370
Query: 304 --FPEVTIHFRGADVKLSPSN-LFRNISDEIMCSAFRGGNANIVYGRIMQINFLIGYDIE 360
P++ +HF GAD +L N + + S +C A + + G NF + YD+E
Sbjct: 371 VTLPQLVLHFDGADWELPLQNYMLVDPSTGGLCLAMATSSDGSIIGSYQHQNFNVLYDLE 430
Query: 361 QAMVSFKPSRC 371
+++SF P+ C
Sbjct: 431 NSLLSFVPAPC 441
>gi|413947545|gb|AFW80194.1| hypothetical protein ZEAMMB73_386053 [Zea mays]
Length = 456
Score = 161 bits (408), Expect = 4e-37, Method: Compositional matrix adjust.
Identities = 120/378 (31%), Positives = 187/378 (49%), Gaps = 45/378 (11%)
Query: 25 QAEIISVDDIYLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKS 84
+++II+ YLM++++GTPP + DTGSD W C +F P +S
Sbjct: 90 ESKIITRSFEYLMYVNVGTPPAQMLAIADTGSDLVWVNCSSNGGGGGASDGAVVFHPSRS 149
Query: 85 STYNSISCSSSQC-AVVTSNC-SEGDCSYSFLYGRGAYASFSSGNLATETLTF---NSTS 139
+TY+ +SC S+ C A+ ++C ++ +C Y + YG G S + G L+TET +F
Sbjct: 150 TTYSLLSCQSAACQALSQASCDADSECQYQYAYGDG---SRTIGVLSTETFSFAAAGGGG 206
Query: 140 GLPVEMPNVIFGCGHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTS--IAGKFSYCL-- 195
V +P V FGC + S SD G++GLG G SL+SQ+G + IA +FSYCL
Sbjct: 207 EGQVRVPRVSFGCSTGSAGSFRSD----GLVGLGAGALSLVSQLGAAARIARRFSYCLVP 262
Query: 196 ---PDQGSSKINFG--GIVAGAGVVSTPLI---IRDHYYLSLEAISVGNQRLEFVSSSTG 247
SS ++FG +V+ G STPL+ + +Y ++LE+++V Q + S+++
Sbjct: 263 PYAAANSSSTLSFGARAVVSDPGAASTPLVPSEVDSYYTVALESVAVAGQ--DVASANSS 320
Query: 248 NIFVDTGVLRTLLPLEYHSNLKSVMSNMI---KAQPVKGVGAEPGFSDVLCYNISSQPK- 303
I VD+G T L L + + I +AQP P LCY++ + +
Sbjct: 321 RIIVDSGTTLTFLDPALLRPLVAELERRIRLPRAQP-------PEQLLQLCYDVQGKSQA 373
Query: 304 ----FPEVTIHF-RGADVKLSPSNLFRNISDEIMCSAF---RGGNANIVYGRIMQINFLI 355
P+VT+ F GA V L P N F + + +C + G I Q NF +
Sbjct: 374 EDFGIPDVTLRFGGGASVTLRPENTFSLLEEGTLCLVLVPVSESQPVSILGNIAQQNFHV 433
Query: 356 GYDIEQAMVSFKPSRCTN 373
GYD++ V+F CT
Sbjct: 434 GYDLDARTVTFAAVDCTR 451
>gi|225430555|ref|XP_002285593.1| PREDICTED: aspartic proteinase nepenthesin-2 [Vitis vinifera]
Length = 481
Score = 161 bits (408), Expect = 4e-37, Method: Compositional matrix adjust.
Identities = 119/358 (33%), Positives = 173/358 (48%), Gaps = 36/358 (10%)
Query: 35 YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSS 94
Y++ + +GTP D+ DTGSD TWTQCEPC C+ Q+ P+F+P KS++Y +ISCSS
Sbjct: 138 YVVTVGLGTPKRDLTFIFDTGSDLTWTQCEPCARY-CYHQQEPIFNPSKSTSYTNISCSS 196
Query: 95 SQCAVVTS------NCSEGDCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMPNV 148
C + S +CS C Y YG +Y S G A + L ST N
Sbjct: 197 PTCDELKSGTGNSPSCSASTCVYGIQYGDQSY---SVGFFAQDKLALTSTD----VFNNF 249
Query: 149 IFGCGHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLPDQGSSK--INFG 206
+FGCG N + G+IGLG SL+SQ FSYCLP SS + FG
Sbjct: 250 LFGCGQNNRGLFVGVA---GLIGLGRNALSLVSQTAQKYGKLFSYCLPSTSSSTGYLTFG 306
Query: 207 -GIVAGAGVVSTPLIIRDH----YYLSLEAISVGNQRLEFVSS--STGNIFVDTGVLRTL 259
G V TP ++ Y+L+L AISVG ++L +S ST +D+G + +
Sbjct: 307 SGGGTSKAVKFTPSLVNSQGPSFYFLNLIAISVGGRKLSTSASVFSTAGTIIDSGTVISR 366
Query: 260 LPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDVLCYNISSQP--KFPEVTIHFR-GADV 316
LP +S+L++ + P A P CY+ S P++ ++F GA++
Sbjct: 367 LPPTAYSDLRASFQQQMSKYP----KAAPASILDTCYDFSQYDTVDVPKINLYFSDGAEM 422
Query: 317 KLSPSNLFRNISDEIMCSAFRGGNANI---VYGRIMQINFLIGYDIEQAMVSFKPSRC 371
L PS +F ++ +C AF G + + G + Q F + YD+ + F P C
Sbjct: 423 DLDPSGIFYILNISQVCLAFAGNSDATDIAILGNVQQKTFDVVYDVAGGRIGFAPGGC 480
>gi|168040957|ref|XP_001772959.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162675692|gb|EDQ62184.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 351
Score = 161 bits (408), Expect = 4e-37, Method: Compositional matrix adjust.
Identities = 119/364 (32%), Positives = 180/364 (49%), Gaps = 46/364 (12%)
Query: 35 YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSS 94
Y++ +S+GTPP VDTGSD W QC PC CF+Q PLF P SS+Y++ SC+
Sbjct: 8 YVLQISLGTPPQQFSAIVDTGSDLCWVQCAPCAR--CFEQPDPLFIPLASSSYSNASCTD 65
Query: 95 SQC-AVVTSNCS-EGDCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMPNVIFGC 152
S C A+ CS C+YS+ YG G S + G+ A ET+T N ++ + + FGC
Sbjct: 66 SLCDALPRPTCSMRNTCTYSYSYGDG---SNTRGDFAFETVTLNGST-----LARIGFGC 117
Query: 153 GHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLPDQGS----SKINFGGI 208
GH + + G+IGLG G SL SQ+ +S FSYCL DQ + S I FG
Sbjct: 118 GHNQEG---TFAGADGLIGLGQGPLSLPSQLNSSFTHIFSYCLVDQSTTGTFSPITFGNA 174
Query: 209 VAGAGVVSTPLIIRD----HYYLSLEAISVGNQRL-------EFVSSSTGNIFVDTGVLR 257
+ TPL+ + +YY+ +E+ISVGN+R+ ++ G + +D+G
Sbjct: 175 AENSRASFTPLLQNEDNPSYYYVGVESISVGNRRVPTPPSAFRIDANGVGGVILDSGTTI 234
Query: 258 TLLPLEYHSNLKSVMSNMI---KAQPVKGVGAEPGFSDVLCYNIS----SQPKFPEVTIH 310
T L + + + I +A P + LCY+IS S P +T+H
Sbjct: 235 TYWRLAAFIPILAELRRQISYPEADPTP-------YGLNLCYDISSVSASSLTLPSMTVH 287
Query: 311 FRGADVKLSPSNLFRNISD--EIMCSAFRGGNANIVYGRIMQINFLIGYDIEQAMVSFKP 368
D ++ SNL+ + + E +C+A + + G + Q N LI D+ + V F
Sbjct: 288 LTNVDFEIPVSNLWVLVDNFGETVCTAMSTSDQFSIIGNVQQQNNLIVTDVANSRVGFLA 347
Query: 369 SRCT 372
+ C+
Sbjct: 348 TDCS 351
>gi|225470916|ref|XP_002263964.1| PREDICTED: aspartic proteinase nepenthesin-1 [Vitis vinifera]
gi|147788999|emb|CAN64659.1| hypothetical protein VITISV_009613 [Vitis vinifera]
Length = 489
Score = 161 bits (408), Expect = 4e-37, Method: Compositional matrix adjust.
Identities = 120/360 (33%), Positives = 172/360 (47%), Gaps = 41/360 (11%)
Query: 35 YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSS 94
Y L +GTPP ++ +DTGSD W QC PC + C+ Q P+FDPKKS +++SISC S
Sbjct: 147 YFTRLGVGTPPKYVYMVLDTGSDVVWIQCAPCRK--CYSQTDPVFDPKKSGSFSSISCRS 204
Query: 95 SQCAVVTS-NC-SEGDCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMPNVIFGC 152
C + S C S C Y YG G SF+ G +TETLTF T +P V GC
Sbjct: 205 PLCLRLDSPGCNSRQSCLYQVAYGDG---SFTFGEFSTETLTFRGT-----RVPKVALGC 256
Query: 153 GHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLPDQGS----SKINFGGI 208
GH N + G+ S +Q G KFSYCL D+ + S + FG
Sbjct: 257 GHDNEGLFVGAAGLLGLGRG---RLSFPTQTGLRFGRKFSYCLVDRSASSKPSSVVFGQS 313
Query: 209 VAGAGVVSTPLI----IRDHYYLSLEAISVGNQRLEFVSSS--------TGNIFVDTGVL 256
V TPLI + YYL L ISVG R+ +++S G + +D+G
Sbjct: 314 AVSRTAVFTPLITNPKLDTFYYLELTGISVGGARVAGITASLFKLDTAGNGGVIIDSGTS 373
Query: 257 RTLLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDV-LCYNIS--SQPKFPEVTIHFRG 313
T L + +L+ A +K P +S C+++S ++ K P V +HFRG
Sbjct: 374 VTRLTRRAYVSLRDAFR--AGAADLK---RAPDYSLFDTCFDLSGKTEVKVPTVVMHFRG 428
Query: 314 ADVKLSPSNLFRNI-SDEIMCSAFRGGNANI-VYGRIMQINFLIGYDIEQAMVSFKPSRC 371
ADV L +N + ++ + C AF G + + + G I Q F + +D+ + + F C
Sbjct: 429 ADVSLPATNYLIPVDTNGVFCFAFAGTMSGLSIIGNIQQQGFRVVFDVAASRIGFAARGC 488
>gi|225216995|gb|ACN85284.1| aspartic proteinase nepenthesin-1 precursor [Oryza australiensis]
Length = 519
Score = 160 bits (406), Expect = 7e-37, Method: Compositional matrix adjust.
Identities = 121/355 (34%), Positives = 182/355 (51%), Gaps = 33/355 (9%)
Query: 35 YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSS 94
Y++ + +GTP DTGSD TW QC+PC + C++Q LFDP +SSTY +ISC++
Sbjct: 180 YVVTVGLGTPASRYTVVFDTGSDTTWVQCQPCVVV-CYEQREKLFDPARSSTYANISCAA 238
Query: 95 SQCA-VVTSNCSEGDCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMPNVIFGCG 153
C+ + T CS G+C Y YG G+Y S G A +TLT +S + FGCG
Sbjct: 239 PACSDLDTRGCSGGNCLYGVQYGDGSY---SIGFFAMDTLTLSSYD----AVKGFRFGCG 291
Query: 154 HKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLPDQ--GSSKINFG-GIVA 210
+N + G++GLG G +SL Q G F++CLP + G+ ++FG G A
Sbjct: 292 ERNEG---LFGEAAGLLGLGRGKTSLPVQTYDKYGGVFAHCLPARSSGTGYLDFGPGSPA 348
Query: 211 GAGV-VSTPLIIRD---HYYLSLEAISVGNQRLEFVSS--STGNIFVDTGVLRTLLPLEY 264
AG ++TP++ + YY+ + I VG Q L S +T VD+G + T LP
Sbjct: 349 AAGARLTTPMLTDNGPTFYYVGMTGIRVGGQLLSIPQSVFTTAGTIVDSGTVITRLPPAA 408
Query: 265 HSNLKSVMSNMIKAQPVKGVGAEPGFSDV-LCYNIS--SQPKFPEVTIHFR-GADVKLSP 320
+S+L+S ++ + A +G P S + CY+ + SQ P V++ F+ GA + +
Sbjct: 409 YSSLRSAFASAMAA---RGYKKAPAVSLLDTCYDFTGMSQVAIPTVSLLFQGGARLDVDA 465
Query: 321 SNLFRNISDEIMCSAFR----GGNANIVYGRIMQINFLIGYDIEQAMVSFKPSRC 371
S + S +C F GG+ IV G F + YDI + +V F P C
Sbjct: 466 SGIMYAASVSQVCLGFAANEDGGDVGIV-GNTQLKTFGVAYDIGKKVVGFSPGAC 519
>gi|293332735|ref|NP_001168472.1| uncharacterized protein LOC100382248 [Zea mays]
gi|223948487|gb|ACN28327.1| unknown [Zea mays]
Length = 434
Score = 160 bits (404), Expect = 1e-36, Method: Compositional matrix adjust.
Identities = 119/354 (33%), Positives = 171/354 (48%), Gaps = 32/354 (9%)
Query: 35 YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSS 94
Y++ + +GTP DTGSD TW QC+PC C++Q+ PLFDP KS+TY +ISCSS
Sbjct: 96 YVVPVRLGTPAERFTVVFDTGSDTTWVQCQPCVAY-CYRQKEPLFDPTKSATYANISCSS 154
Query: 95 SQCA-VVTSNCSEGDCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMPNVIFGCG 153
S C+ + S CS G C Y YG G+Y + G A +TLT + + N FGCG
Sbjct: 155 SYCSDLYVSGCSGGHCLYGIQYGDGSY---TIGFYAQDTLTLAYDT-----IKNFRFGCG 206
Query: 154 HKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLP--DQGSSKINFGGIVAG 211
KN + G++GLG G +SL Q G F+YCLP G+ ++ G
Sbjct: 207 EKNRG---LFGRAAGLLGLGRGKTSLPVQAYDKYGGVFAYCLPATSAGTGFLDLGPGAPA 263
Query: 212 AGVVSTPLIIRD---HYYLSLEAISVGNQRLEFVSS--STGNIFVDTGVLRTLLPLEYHS 266
A TP+++ YY+ + I VG L S ST VD+G + T LP ++
Sbjct: 264 ANARLTPMLVDRGPTFYYVGMTGIKVGGHVLPIPGSVFSTAGTLVDSGTVITRLPPSAYA 323
Query: 267 NLKSVMSNMIKAQPVKGVGAEPGFSDV-LCYNISSQP----KFPEVTIHFRGA---DVKL 318
L+S S KA G A P FS + CY+++ P V++ F+G DV
Sbjct: 324 PLRSAFS---KAMQGLGYSAAPAFSILDTCYDLTGHKGGSIALPAVSLVFQGGACLDVDA 380
Query: 319 SPSNLFRNISDEIMCSAFRGGNANI-VYGRIMQINFLIGYDIEQAMVSFKPSRC 371
S ++S + A + ++ + G Q + YDI + +V F P C
Sbjct: 381 SGILYVADVSQACLAFAPNADDTDVAIVGNTQQKTHGVLYDIGKKIVGFAPGAC 434
>gi|413943688|gb|AFW76337.1| hypothetical protein ZEAMMB73_223549 [Zea mays]
gi|413943689|gb|AFW76338.1| hypothetical protein ZEAMMB73_223549 [Zea mays]
Length = 499
Score = 159 bits (403), Expect = 2e-36, Method: Compositional matrix adjust.
Identities = 119/354 (33%), Positives = 171/354 (48%), Gaps = 32/354 (9%)
Query: 35 YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSS 94
Y++ + +GTP DTGSD TW QC+PC C++Q+ PLFDP KS+TY +ISCSS
Sbjct: 161 YVVPVRLGTPAERFTVVFDTGSDTTWVQCQPCVAY-CYRQKEPLFDPTKSATYANISCSS 219
Query: 95 SQCA-VVTSNCSEGDCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMPNVIFGCG 153
S C+ + S CS G C Y YG G S++ G A +TLT + + N FGCG
Sbjct: 220 SYCSDLYVSGCSGGHCLYGIQYGDG---SYTIGFYAQDTLTLAYDT-----IKNFRFGCG 271
Query: 154 HKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLP--DQGSSKINFGGIVAG 211
KN + G++GLG G +SL Q G F+YCLP G+ ++ G
Sbjct: 272 EKNRG---LFGRAAGLLGLGRGKTSLPVQAYDKYGGVFAYCLPATSAGTGFLDLGPGAPA 328
Query: 212 AGVVSTPLIIRD---HYYLSLEAISVGNQRLEFVSS--STGNIFVDTGVLRTLLPLEYHS 266
A TP+++ YY+ + I VG L S ST VD+G + T LP ++
Sbjct: 329 ANARLTPMLVDRGPTFYYVGMTGIKVGGHVLPIPGSVFSTAGTLVDSGTVITRLPPSAYA 388
Query: 267 NLKSVMSNMIKAQPVKGVGAEPGFSDV-LCYNISSQP----KFPEVTIHFRGA---DVKL 318
L+S S KA G A P FS + CY+++ P V++ F+G DV
Sbjct: 389 PLRSAFS---KAMQGLGYSAAPAFSILDTCYDLTGHKGGSIALPAVSLVFQGGACLDVDA 445
Query: 319 SPSNLFRNISDEIMCSAFRGGNANI-VYGRIMQINFLIGYDIEQAMVSFKPSRC 371
S ++S + A + ++ + G Q + YDI + +V F P C
Sbjct: 446 SGILYVADVSQACLAFAPNADDTDVAIVGNTQQKTHGVLYDIGKKIVGFAPGAC 499
>gi|255585473|ref|XP_002533429.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
gi|223526717|gb|EEF28949.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
Length = 448
Score = 159 bits (403), Expect = 2e-36, Method: Compositional matrix adjust.
Identities = 120/363 (33%), Positives = 184/363 (50%), Gaps = 42/363 (11%)
Query: 34 IYLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCS 93
++L++ S+G P +DTGS+ W +C PC C +Q PL DP KSSTY S+ C+
Sbjct: 98 LFLVNFSMGQPATPQLAIMDTGSNILWVRCAPCKR--CTQQNGPLLDPSKSSTYASLPCT 155
Query: 94 SSQCAVV-TSNCSE-GDCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMPNVIFG 151
++ C ++ C+ C Y+ Y G S+G LATE L F+S+ +P+V+FG
Sbjct: 156 NTMCHYAPSAYCNRLNQCGYNLSYATGLS---SAGVLATEQLIFHSSDEGVNAVPSVVFG 212
Query: 152 CGHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCL-----PDQGSSKINFG 206
C H+N D + TG+ GLG G +S +++MG+ KFSYCL P G +++ FG
Sbjct: 213 CSHEN--GDYKDRRFTGVFGLGKGITSFVTRMGS----KFSYCLGNIADPHYGYNQLVFG 266
Query: 207 GIVAGAGVVSTPL-IIRDHYYLSLEAISVGNQRLEFVSSS---TGN---IFVDTGVLRTL 259
G STPL ++ HYY++LE ISVG +RL+ S++ GN +D+G T
Sbjct: 267 EKANFEG-YSTPLKVVNGHYYVTLEGISVGEKRLDIDSTAFSMKGNEKSALIDSGTALTW 325
Query: 260 LPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDVLCYNISSQPK---FPEVTIHFR-GAD 315
L S ++ + N ++ Q + GV CY + FP VT HF GAD
Sbjct: 326 LA---ESAFRA-LDNEVR-QLLDGVLMPFWRGSFACYKGTVSQDLIGFPVVTFHFSGGAD 380
Query: 316 VKLSPSNLFRNISDEIMCSAFRGGNAN-------IVYGRIMQINFLIGYDIEQAMVSFKP 368
+ L ++F + +I+C A R +A V G + Q + + YD+ + F+
Sbjct: 381 LDLDTESMFYQATPDILCIAVRQASAYGNDFKSFSVIGLMAQQYYNMAYDLNSNKLFFQR 440
Query: 369 SRC 371
C
Sbjct: 441 IDC 443
>gi|125547728|gb|EAY93550.1| hypothetical protein OsI_15341 [Oryza sativa Indica Group]
Length = 418
Score = 159 bits (403), Expect = 2e-36, Method: Compositional matrix adjust.
Identities = 119/356 (33%), Positives = 175/356 (49%), Gaps = 47/356 (13%)
Query: 35 YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSS 94
YL+HL+ GTPP ++ ++DTGSD TWTQC+ CP CF Q PLFDP SS++ S+ CSS
Sbjct: 88 YLVHLAAGTPPQEVQLTLDTGSDITWTQCKRCPASACFNQTLPLFDPSASSSFASLPCSS 147
Query: 95 SQCAVVTSNCSEGD------CSYSFLYGRGAYASFSSGNLATETLTFNSTSGL--PVEMP 146
C T C G+ C+YS YG G S S G + E TF S +G +P
Sbjct: 148 PACE-TTPPCGGGNDATSRPCNYSISYGDG---SVSRGEIGREVFTFASGTGEGSSAAVP 203
Query: 147 NVIFGCGHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLPDQGSSKINFG 206
++FGCGH N TS+ +TGI G G G+ SL SQ+ G FS+C SK +
Sbjct: 204 GLVFGCGHANRGVFTSN--ETGIAGFGRGSLSLPSQLKV---GNFSHCFTTITGSKTS-- 256
Query: 207 GIVAGAGVVSTPLIIRDHYYLSLEAISVGNQRLEFVSSSTGNIFVDTGVLRTLLPLEYHS 266
++ G V+ P A +G +R + ST ++G T LP +
Sbjct: 257 AVLLGLPGVAPP-----------SASPLGRRRGSYRCRSTPRS-SNSGTSITSLPPRTYR 304
Query: 267 NLKSVMSNMIKAQPVKGVGAEPGFSDVLCYNI---SSQPKFPEVTIHFRGADVKLSPSN- 322
++ + +K V G +P C++ +P P + +HF GA ++L N
Sbjct: 305 AVREEFAAQVKLPVVPGNATDP----FTCFSAPLRGPKPDVPTMALHFEGATMRLPQENY 360
Query: 323 LFRNISDE-------IMCSAFRGGNANIVYGRIMQINFLIGYDIEQAMVSFKPSRC 371
+F + D+ I+C A G I+ G I Q N + YD++ + +SF P++C
Sbjct: 361 VFEVVDDDDAGNSSRIICLAVIEG-GEIILGNIQQQNMHVLYDLQNSKLSFVPAQC 415
>gi|224092220|ref|XP_002309515.1| predicted protein [Populus trichocarpa]
gi|222855491|gb|EEE93038.1| predicted protein [Populus trichocarpa]
Length = 473
Score = 159 bits (402), Expect = 2e-36, Method: Compositional matrix adjust.
Identities = 125/362 (34%), Positives = 179/362 (49%), Gaps = 47/362 (12%)
Query: 35 YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSS 94
Y + + +GTP + DTGSD TWTQCEPC + C+KQ+ P DP KS++Y +ISCSS
Sbjct: 133 YAVTVGLGTPKKEFTLIFDTGSDLTWTQCEPCAK-TCYKQKEPRLDPTKSTSYKNISCSS 191
Query: 95 SQCAVVTS----NCSEGDCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMPNVIF 150
+ C ++ + +CS C Y YG G+Y S G ATETLT +S++ N +F
Sbjct: 192 AFCKLLDTEGGESCSSPTCLYQVQYGDGSY---SIGFFATETLTLSSSN----VFKNFLF 244
Query: 151 GCGHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLPDQGSSK--INFGGI 208
GCG +N + G++GLG SL SQ FSYCLP SSK ++FGG
Sbjct: 245 GCGQQNSGLFRGAA---GLLGLGRTKLSLPSQTAQKYKKLFSYCLPASSSSKGYLSFGGQ 301
Query: 209 VAGAGVVSTPL----IIRDHYYLSLEAISVGNQRLEFVSS--STGNIFVDTGVLRTLLPL 262
V+ V TPL Y L + +SVG +L +S ST +D+G + T LP
Sbjct: 302 VSKT-VKFTPLSEDFKSTPFYGLDITELSVGGNKLSIDASIFSTSGTVIDSGTVITRLPS 360
Query: 263 EYHSNLKSVMSNMIKAQPVKGVGAEPGFSDV-LCYNISSQP--KFPEVTIHFRGA---DV 316
+S L S ++ P + G+S CY+ S K P+V + F+G D+
Sbjct: 361 TAYSALSSAFQKLMTDYP-----STDGYSIFDTCYDFSKNETIKIPKVGVSFKGGVEMDI 415
Query: 317 KLS----PSNLFRNISDEIMCSAFRGGNANI---VYGRIMQINFLIGYDIEQAMVSFKPS 369
+S P N + + C AF G ++ ++G Q + + YD + V F PS
Sbjct: 416 DVSGILYPVNGLKKV-----CLAFAGNGDDVKAAIFGNTQQKTYQVVYDDAKGRVGFAPS 470
Query: 370 RC 371
C
Sbjct: 471 GC 472
>gi|116787398|gb|ABK24493.1| unknown [Picea sitchensis]
Length = 479
Score = 159 bits (402), Expect = 2e-36, Method: Compositional matrix adjust.
Identities = 120/359 (33%), Positives = 173/359 (48%), Gaps = 41/359 (11%)
Query: 35 YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSS 94
Y++ GTP + +DTGSD TW QC+PC DC+ Q P+F+P++SS+Y +SC S
Sbjct: 138 YIVTAGFGTPAKNSLLIIDTGSDVTWIQCKPCS--DCYSQVDPIFEPQQSSSYKHLSCLS 195
Query: 95 SQCAVVTS--NCSEGDCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMPNVIFGC 152
S C +T+ +C G C Y YG G S S G+ + ETLT S S P+ FGC
Sbjct: 196 SACTELTTMNHCRLGGCVYEINYGDG---SRSQGDFSQETLTLGSDS-----FPSFAFGC 247
Query: 153 GHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLPD------QGSSKINFG 206
GH N + G++GLG S SQ + G+FSYCLPD GS + G
Sbjct: 248 GHTNTGLFKGSA---GLLGLGRTALSFPSQTKSKYGGQFSYCLPDFVSSTSTGSFSVGQG 304
Query: 207 GIVAGAGVVSTPLIIRDH----YYLSLEAISVGNQRLEFVSS--STGNIFVDTGVLRTLL 260
I A A V PL+ + Y++ L ISVG +RL + G VD+G + T L
Sbjct: 305 SIPATATFV--PLVSNSNYPSFYFVGLNGISVGGERLSIPPAVLGRGGTIVDSGTVITRL 362
Query: 261 PLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDVLCYNIS--SQPKFPEVTIHFR-GADVK 317
+ + LK+ + + P A+P CY++S SQ + P +T HF+ ADV
Sbjct: 363 VPQAYDALKTSFRSKTRNLP----SAKPFSILDTCYDLSSYSQVRIPTITFHFQNNADVA 418
Query: 318 LSPSNLFRNISDE--IMCSAFRGGNANI---VYGRIMQINFLIGYDIEQAMVSFKPSRC 371
+S + I + +C AF + +I + G Q + +D + F P C
Sbjct: 419 VSAVGILFTIQSDGSQVCLAFASASQSISTNIIGNFQQQRMRVAFDTGAGRIGFAPGSC 477
>gi|115479485|ref|NP_001063336.1| Os09g0452400 [Oryza sativa Japonica Group]
gi|51535935|dbj|BAD38017.1| putative aspartic proteinase nepenthesin I [Oryza sativa Japonica
Group]
gi|113631569|dbj|BAF25250.1| Os09g0452400 [Oryza sativa Japonica Group]
gi|215693279|dbj|BAG88661.1| unnamed protein product [Oryza sativa Japonica Group]
Length = 441
Score = 158 bits (400), Expect = 4e-36, Method: Compositional matrix adjust.
Identities = 121/371 (32%), Positives = 178/371 (47%), Gaps = 54/371 (14%)
Query: 35 YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSS 94
YLM + IG+PP +DTGSD WTQC PC L C +Q P F+P KS++Y S+ CSS
Sbjct: 88 YLMDVGIGSPPRYFSAMIDTGSDLIWTQCAPC--LLCVEQPTPYFEPAKSTSYASLPCSS 145
Query: 95 SQC-AVVTSNCSEGDCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMPNVIFGCG 153
+ C A+ + C + C Y YG A S+G LA ET TF + S V +P V FGCG
Sbjct: 146 AMCNALYSPLCFQNACVYQAFYGDSAS---SAGVLANETFTFGTNS-TRVAVPRVSFGCG 201
Query: 154 HKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLP---DQGSSKINFGGIV- 209
+ N + + S G++G G G SL+SQ+G+ +FSYCL +S++ FG
Sbjct: 202 NMNAGTLFNGS---GMVGFGRGALSLVSQLGSP---RFSYCLTSFMSPATSRLYFGAYAT 255
Query: 210 -------AGAGVVSTPLIIR----DHYYLSLEAISVGNQRLEFVSS--------STGNIF 250
+ V STP I+ Y+L++ ISV L S TG +
Sbjct: 256 LNSTNTSSSGPVQSTPFIVNPALPTMYFLNMTGISVAGDLLPIDPSVFAINETDGTGGVI 315
Query: 251 VDTGVLRTLLPLEYHSNLKSVMSNMIKAQPVKGVG-----AEPGFSDVLCYNISSQPK-- 303
+D+G T L ++ M++ V VG A P + C+ P+
Sbjct: 316 IDSGTTVTFLAQPAYA--------MVQGAFVAWVGLPRANATPSDTFDTCFKWPPPPRRM 367
Query: 304 --FPEVTIHFRGADVKLSPSN-LFRNISDEIMCSAFRGGNANIVYGRIMQINFLIGYDIE 360
PE+ +HF GAD++L N + + +C A + + G NF + YD+E
Sbjct: 368 VTLPEMVLHFDGADMELPLENYMVMDGGTGNLCLAMLPSDDGSIIGSFQHQNFHMLYDLE 427
Query: 361 QAMVSFKPSRC 371
+++SF P+ C
Sbjct: 428 NSLLSFVPAPC 438
>gi|125563957|gb|EAZ09337.1| hypothetical protein OsI_31609 [Oryza sativa Indica Group]
gi|125605916|gb|EAZ44952.1| hypothetical protein OsJ_29595 [Oryza sativa Japonica Group]
Length = 438
Score = 158 bits (399), Expect = 4e-36, Method: Compositional matrix adjust.
Identities = 121/371 (32%), Positives = 178/371 (47%), Gaps = 54/371 (14%)
Query: 35 YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSS 94
YLM + IG+PP +DTGSD WTQC PC L C +Q P F+P KS++Y S+ CSS
Sbjct: 85 YLMDVGIGSPPRYFSAMIDTGSDLIWTQCAPC--LLCVEQPTPYFEPAKSTSYASLPCSS 142
Query: 95 SQC-AVVTSNCSEGDCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMPNVIFGCG 153
+ C A+ + C + C Y YG A S+G LA ET TF + S V +P V FGCG
Sbjct: 143 AMCNALYSPLCFQNACVYQAFYGDSAS---SAGVLANETFTFGTNS-TRVAVPRVSFGCG 198
Query: 154 HKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLP---DQGSSKINFGGIV- 209
+ N + + S G++G G G SL+SQ+G+ +FSYCL +S++ FG
Sbjct: 199 NMNAGTLFNGS---GMVGFGRGALSLVSQLGSP---RFSYCLTSFMSPATSRLYFGAYAT 252
Query: 210 -------AGAGVVSTPLIIR----DHYYLSLEAISVGNQRLEFVSS--------STGNIF 250
+ V STP I+ Y+L++ ISV L S TG +
Sbjct: 253 LNSTNTSSSGPVQSTPFIVNPALPTMYFLNMTGISVAGDLLPIDPSVFAINETDGTGGVI 312
Query: 251 VDTGVLRTLLPLEYHSNLKSVMSNMIKAQPVKGVG-----AEPGFSDVLCYNISSQPK-- 303
+D+G T L ++ M++ V VG A P + C+ P+
Sbjct: 313 IDSGTTVTFLAQPAYA--------MVQGAFVAWVGLPRANATPSDTFDTCFKWPPPPRRM 364
Query: 304 --FPEVTIHFRGADVKLSPSN-LFRNISDEIMCSAFRGGNANIVYGRIMQINFLIGYDIE 360
PE+ +HF GAD++L N + + +C A + + G NF + YD+E
Sbjct: 365 VTLPEMVLHFDGADMELPLENYMVMDGGTGNLCLAMLPSDDGSIIGSFQHQNFHMLYDLE 424
Query: 361 QAMVSFKPSRC 371
+++SF P+ C
Sbjct: 425 NSLLSFVPAPC 435
>gi|125543640|gb|EAY89779.1| hypothetical protein OsI_11321 [Oryza sativa Indica Group]
Length = 434
Score = 157 bits (398), Expect = 5e-36, Method: Compositional matrix adjust.
Identities = 124/365 (33%), Positives = 176/365 (48%), Gaps = 43/365 (11%)
Query: 35 YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSS 94
YL+HL+IGTPP + ++DTGSD WTQC+PCP CF Q P FDP SST + SC S
Sbjct: 82 YLVHLAIGTPPQPVQLTLDTGSDLIWTQCQPCPA--CFDQALPYFDPSTSSTLSLTSCDS 139
Query: 95 SQC-AVVTSNCSEGD------CSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMPN 147
+ C + ++C C Y++ YG S ++G L + TF G +P
Sbjct: 140 TLCQGLPVASCGSPKFWPNQTCVYTYSYGD---KSVTTGFLEVDKFTF---VGAGASVPG 193
Query: 148 VIFGCGHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCL-------PDQGS 200
V FGCG N + S +TGI G G G SL SQ+ G FS+C P
Sbjct: 194 VAFGCGLFN--NGVFKSNETGIAGFGRGPLSLPSQLKV---GNFSHCFTAVNGLKPSTVL 248
Query: 201 SKINFGGIVAGAGVV-STPLIIR----DHYYLSLEAISVGNQRL-----EF-VSSSTGNI 249
+ +G G V STPLI YYLSL+ I+VG+ RL EF + + TG
Sbjct: 249 LDLPADLYKSGRGAVQSTPLIQNPANPTFYYLSLKGITVGSTRLPVPESEFTLKNGTGGT 308
Query: 250 FVDTGVLRTLLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDVLCYNISSQPKFPEVTI 309
+D+G T LP + ++ + +K V G +P F L + ++P P++ +
Sbjct: 309 IIDSGTAMTSLPTRVYRLVRDAFAAQVKLPVVSGNTTDPYF--CLSAPLRAKPYVPKLVL 366
Query: 310 HFRGADVKLSPSNLFRNISD---EIMCSAFRGGNANIVYGRIMQINFLIGYDIEQAMVSF 366
HF GA + L N + D I+C A G G Q N + YD++ + +SF
Sbjct: 367 HFEGATMDLPRENYVFEVEDAGSSILCLAIIEGGEVTTIGNFQQQNMHVLYDLQNSKLSF 426
Query: 367 KPSRC 371
P++C
Sbjct: 427 VPAQC 431
>gi|302757235|ref|XP_002962041.1| hypothetical protein SELMODRAFT_64201 [Selaginella moellendorffii]
gi|300170700|gb|EFJ37301.1| hypothetical protein SELMODRAFT_64201 [Selaginella moellendorffii]
Length = 367
Score = 157 bits (398), Expect = 6e-36, Method: Compositional matrix adjust.
Identities = 129/377 (34%), Positives = 188/377 (49%), Gaps = 53/377 (14%)
Query: 35 YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSS 94
Y M + +G+PP VDTGSD W QC+PC + C+ Q P++DP SST+ SCS+
Sbjct: 4 YTMEIELGSPPKKFNAIVDTGSDLVWIQCKPCSQ--CYSQSDPIYDPSASSTFAKTSCST 61
Query: 95 SQCAVV-TSNCSEG--DCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMPNVIFG 151
S C + S CS C Y + YG +S + G+ A ETLT S+ G PN FG
Sbjct: 62 SSCQSLPASGCSSSAKTCIYGYQYGD---SSSTQGDFALETLTLRSSGGSSKAFPNFQFG 118
Query: 152 CGHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLPD-----QGSSKINFG 206
CG N S GI+GLG G SL +Q+G++I KFSYCL D +S + FG
Sbjct: 119 CGRLNSGSF---GGAAGIVGLGQGKISLSTQLGSAINNKFSYCLVDFDDDSSKTSPLIFG 175
Query: 207 GIVA-GAGVVSTPLI----IRDHYYLSLEAISVGNQRL-------EFVS----------- 243
+ G+G +STP+I +Y++ LE ISVG ++L +F+S
Sbjct: 176 SSASTGSGAISTPIIPNSGRSTYYFVGLEGISVGGKQLSLATRAIDFLSVRSKKKLRVRA 235
Query: 244 ---SSTGNIFVDTGVLRTLLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDVLCYNISS 300
+S G IF D+G TLL +S +KS ++ + V + GF LCY++S
Sbjct: 236 LEVNSGGTIF-DSGTTLTLLDDAVYSKVKSAFASSVSLPTVD--ASSSGFD--LCYDVSK 290
Query: 301 QP--KFPEVTIHFRGADVKLSPSNLF--RNISDEIMCSAF--RGGNANIVYGRIMQINFL 354
KFP +T+ F+G N F + ++ + C A G + G +MQ N+
Sbjct: 291 SKNFKFPALTLAFKGTKFSPPQKNYFVIVDTAETVACLAMGGSGSLGLGIIGNLMQQNYH 350
Query: 355 IGYDIEQAMVSFKPSRC 371
+ YD + +S P++C
Sbjct: 351 VVYDRGTSTISMSPAQC 367
>gi|125543638|gb|EAY89777.1| hypothetical protein OsI_11319 [Oryza sativa Indica Group]
Length = 390
Score = 157 bits (398), Expect = 6e-36, Method: Compositional matrix adjust.
Identities = 122/371 (32%), Positives = 176/371 (47%), Gaps = 52/371 (14%)
Query: 35 YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSS 94
YL+HL+IGTPP + ++DTGSD WTQC+PC + CF Q P FD +SST + C S
Sbjct: 35 YLVHLAIGTPPQPVQLTLDTGSDLIWTQCKPC--VSCFDQPLPYFDTSRSSTNALLPCES 92
Query: 95 SQCAV-------VTSNCSEGDCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMPN 147
+QC + V N + C+Y YG S + G LA + TF + + L P
Sbjct: 93 TQCKLDPTVTVCVKLNQTVQTCAYYTSYGDN---SVTIGLLAADKFTFVAGTSL----PG 145
Query: 148 VIFGCGHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYC-------LPDQGS 200
V FGCG N +S +TGI G G G SL SQ+ G FS+C +P
Sbjct: 146 VTFGCGLNNTG--VFNSNETGIAGFGRGPLSLPSQLK---VGNFSHCFTTITGAIPSTVL 200
Query: 201 SKINFGGIVAGAGVV-STPLIIRDH-------YYLSLEAISVGNQRLEF------VSSST 246
+ G G V +TPLI YYLSL+ I+VG+ RL +++ T
Sbjct: 201 LDLPADLFSNGQGAVQTTPLIQYAKNEANPTLYYLSLKGITVGSTRLPVPESAFALTNGT 260
Query: 247 GNIFVDTGVLRTLLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDVLCYNISSQ--PKF 304
G +D+G T LP + + ++ + IK V G C++ SQ P
Sbjct: 261 GGTIIDSGTSITSLPPQVYQVVRDEFAAQIKLPVVPGNAT----GHYTCFSAPSQAKPDV 316
Query: 305 PEVTIHFRGADVKLSPSNLFRNISDE----IMCSAFRGGNANIVYGRIMQINFLIGYDIE 360
P++ +HF GA + L N + D+ I+C A G+ + G Q N + YD++
Sbjct: 317 PKLVLHFEGATMDLPRENYVFEVPDDAGNSIICLAINKGDETTIIGNFQQQNMHVLYDLQ 376
Query: 361 QAMVSFKPSRC 371
M+SF ++C
Sbjct: 377 NNMLSFVAAQC 387
>gi|225216988|gb|ACN85278.1| aspartic proteinase nepenthesin-1 precursor [Oryza alta]
Length = 518
Score = 157 bits (398), Expect = 6e-36, Method: Compositional matrix adjust.
Identities = 120/355 (33%), Positives = 179/355 (50%), Gaps = 33/355 (9%)
Query: 35 YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSS 94
Y++ + +GTP DTGSD TW QC+PC + C++Q+ LFDP +SSTY ++SC++
Sbjct: 179 YVVTVGLGTPASRYTVVFDTGSDTTWVQCQPCVVV-CYEQQEKLFDPARSSTYANVSCAA 237
Query: 95 SQC-AVVTSNCSEGDCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMPNVIFGCG 153
C + T CS G C Y YG G+Y S G A +TLT +S + FGCG
Sbjct: 238 PACFDLDTRGCSGGHCLYGVQYGDGSY---SIGFFAMDTLTLSSYD----AVKGFRFGCG 290
Query: 154 HKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLPDQ--GSSKINFG-GIVA 210
+N + G++GLG G +SL Q G F++CLP + G+ ++FG G A
Sbjct: 291 ERNEG---LFGEAAGLLGLGRGKTSLPVQTYDKYGGVFAHCLPARSSGTGYLDFGPGSPA 347
Query: 211 GAGV-VSTPLIIRD---HYYLSLEAISVGNQRLEFVSS--STGNIFVDTGVLRTLLPLEY 264
AG ++TP++ + YY+ + I VG Q L S +T VD+G + T LP
Sbjct: 348 AAGARLTTPMLTDNGPTFYYVGMTGIRVGGQLLSIPQSVFATAGTIVDSGTVITRLPPPA 407
Query: 265 HSNLKSVMSNMIKAQPVKGVGAEPGFSDV-LCYNIS--SQPKFPEVTIHFR-GADVKLSP 320
+S+L+S + A +G P S + CY+ + SQ P V++ F+ GA + +
Sbjct: 408 YSSLRSA---FVSAMAARGYKKAPAVSLLDTCYDFTGMSQVAIPTVSLLFQGGAILDVDA 464
Query: 321 SNLFRNISDEIMCSAFR----GGNANIVYGRIMQINFLIGYDIEQAMVSFKPSRC 371
S + S +C F GG+ IV G F + YDI + +V F P C
Sbjct: 465 SGIMYAASVSQVCLGFAANEDGGDVGIV-GNTQLKTFGVAYDIGKKVVGFSPGAC 518
>gi|242041115|ref|XP_002467952.1| hypothetical protein SORBIDRAFT_01g037070 [Sorghum bicolor]
gi|241921806|gb|EER94950.1| hypothetical protein SORBIDRAFT_01g037070 [Sorghum bicolor]
Length = 774
Score = 157 bits (398), Expect = 7e-36, Method: Compositional matrix adjust.
Identities = 125/375 (33%), Positives = 182/375 (48%), Gaps = 50/375 (13%)
Query: 32 DDIYLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSIS 91
D YL+HL+IGTPP + +DTGSD WTQC PCP CF + DP SST++ +
Sbjct: 412 DTEYLVHLAIGTPPQPVQLILDTGSDLVWTQCRPCPV--CFSRALGPLDPSNSSTFDVLP 469
Query: 92 CSSSQCAVVT-SNCSEGD-----CSYSFLYGRGAYASFSSGNLATETLTFNSTSGL-PVE 144
CSS C +T S+C + + C Y + Y G S ++G+L ET TF + G
Sbjct: 470 CSSPVCDNLTWSSCGKHNWGNQTCVYVYAYADG---SITTGHLDAETFTFAAADGTGQAT 526
Query: 145 MPNVIFGCGHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLPDQGSSK-- 202
+P++ FGCG N TS+ +TGI G G G SL SQ+ FS+C S+
Sbjct: 527 VPDLAFGCGLFNNGIFTSN--ETGIAGFGRGALSLPSQLKVD---NFSHCFTAITGSEPS 581
Query: 203 -------INFGGIVAGAGVVSTPLI-----IRDHYYLSLEAISVGNQRLEFVSSS----- 245
N GA V STPL+ +R YYLSL+ I+VG+ RL S+
Sbjct: 582 SVLLGLPANLYSDADGA-VQSTPLVQNFSSLR-AYYLSLKGITVGSTRLPIPESTFALKQ 639
Query: 246 --TGNIFVDTGVLRTLLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDVLCYNIS---- 299
TG +D+G T LP + + + + ++ PV A LC++ S
Sbjct: 640 DGTGGTIIDSGTGMTTLPQDAYKLVHDAFTAQVRL-PVD--NATSSSLSRLCFSFSVPRR 696
Query: 300 SQPKFPEVTIHFRGADVKLSPSNL---FRNISDEIMCSAFRGGNANIVYGRIMQINFLIG 356
++P P++ +HF GA + L N F + + C A G+ + G Q N +
Sbjct: 697 AKPDVPKLVLHFEGATLDLPRENYMFEFEDAGGSVTCLAINAGDDLTIIGNYQQQNLHVL 756
Query: 357 YDIEQAMVSFKPSRC 371
YD+ + M+SF P++C
Sbjct: 757 YDLVRNMLSFVPAQC 771
>gi|115452685|ref|NP_001049943.1| Os03g0318400 [Oryza sativa Japonica Group]
gi|108707841|gb|ABF95636.1| Eukaryotic aspartyl protease family protein, expressed [Oryza
sativa Japonica Group]
gi|113548414|dbj|BAF11857.1| Os03g0318400 [Oryza sativa Japonica Group]
Length = 434
Score = 157 bits (398), Expect = 7e-36, Method: Compositional matrix adjust.
Identities = 124/365 (33%), Positives = 176/365 (48%), Gaps = 43/365 (11%)
Query: 35 YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSS 94
YL+HL+IGTPP + ++DTGSD WTQC+PCP CF Q P FDP SST + SC S
Sbjct: 82 YLVHLAIGTPPQPVQLTLDTGSDLIWTQCQPCPA--CFDQALPYFDPSTSSTLSLTSCDS 139
Query: 95 SQC-AVVTSNCS------EGDCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMPN 147
+ C + ++C C Y++ YG S ++G L + TF G +P
Sbjct: 140 TLCQGLPVASCGSPKFWPNQTCVYTYSYGD---KSVTTGFLEVDKFTF---VGAGASVPG 193
Query: 148 VIFGCGHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCL-------PDQGS 200
V FGCG N + S +TGI G G G SL SQ+ G FS+C P
Sbjct: 194 VAFGCGLFN--NGVFKSNETGIAGFGRGPLSLPSQLKV---GNFSHCFTAVNGLKPSTVL 248
Query: 201 SKINFGGIVAGAGVV-STPLIIR----DHYYLSLEAISVGNQRL-----EF-VSSSTGNI 249
+ +G G V STPLI YYLSL+ I+VG+ RL EF + + TG
Sbjct: 249 LDLPADLYKSGRGAVQSTPLIQNPANPTFYYLSLKGITVGSTRLPVPESEFALKNGTGGT 308
Query: 250 FVDTGVLRTLLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDVLCYNISSQPKFPEVTI 309
+D+G T LP + ++ + +K V G +P F L + ++P P++ +
Sbjct: 309 IIDSGTAMTSLPTRVYRLVRDAFAAQVKLPVVSGNTTDPYF--CLSAPLRAKPYVPKLVL 366
Query: 310 HFRGADVKLSPSNLFRNISD---EIMCSAFRGGNANIVYGRIMQINFLIGYDIEQAMVSF 366
HF GA + L N + D I+C A G G Q N + YD++ + +SF
Sbjct: 367 HFEGATMDLPRENYVFEVEDAGSSILCLAIIEGGEVTTIGNFQQQNMHVLYDLQNSKLSF 426
Query: 367 KPSRC 371
P++C
Sbjct: 427 VPAQC 431
>gi|358345197|ref|XP_003636668.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
gi|355502603|gb|AES83806.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
Length = 294
Score = 157 bits (397), Expect = 9e-36, Method: Compositional matrix adjust.
Identities = 114/276 (41%), Positives = 159/276 (57%), Gaps = 32/276 (11%)
Query: 2 QNSQKLPFYNDNETPKSPISIIYQAEIISVDDIYLMHLSIGTPPVDIFGSVDTGSDCTWT 61
+NS K F+N N T +SP+S + YLM LSIGTPPV I+ DTGSD W
Sbjct: 36 RNSSK-DFFNRN-TIQSPVSANHYD--------YLMELSIGTPPVKIYAQADTGSDLIWL 85
Query: 62 QCEPCPELDCFKQEPPLFDPKKSSTYNSISCSSSQCA-VVTSNCS--EGDCSYSFLYGRG 118
QC PC +C+KQ P+FD + SST+++I+C S C+ + +++CS + +C Y++ Y G
Sbjct: 86 QCIPC--TNCYKQLNPMFDSQSSSTFSNIACGSESCSKLYSTSCSPDQINCKYNYSYVDG 143
Query: 119 AYASFSSGNLATETLTFNSTSGLPVEMPNVIFGCGHKNLASPTSDSKQTGIIGLGPGNSS 178
S + G LA ETLT ST+G PV VIFGCGH N + + K+ GIIGLG G S
Sbjct: 144 ---SETQGVLAQETLTLTSTTGEPVAFKGVIFGCGHNNNGA--FNDKEMGIIGLGRGPLS 198
Query: 179 LISQMGTSIAGK-FSYCLPDQG-----SSKINF--GGIVAGAGVVSTPLI----IRDHYY 226
L+SQ+G+S+ G FS CL SS ++F G V G GVVSTPL+ + Y+
Sbjct: 199 LVSQIGSSLGGNMFSQCLVPFNTNPSISSPMSFGKGSEVLGNGVVSTPLVSKTTYQSFYF 258
Query: 227 LSLEAISVGNQRLEFVSSSTGNIFVDTGVLRTLLPL 262
++L ISV + L F + S+ V+ + P+
Sbjct: 259 VTLLGISVEDINLPFNAGSSLEPAAKGNVIPQIWPV 294
>gi|225217056|gb|ACN85339.1| aspartic proteinase nepenthesin-1 precursor [Oryza granulata]
Length = 521
Score = 156 bits (395), Expect = 1e-35, Method: Compositional matrix adjust.
Identities = 117/354 (33%), Positives = 178/354 (50%), Gaps = 31/354 (8%)
Query: 35 YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSS 94
Y++ + +GTP DTGSD TW QC+PC + C+KQ+ LFDP +SSTY ++SC++
Sbjct: 182 YVVTIGLGTPASRYTVVFDTGSDTTWVQCQPCVVV-CYKQQEKLFDPARSSTYANVSCAA 240
Query: 95 SQCA-VVTSNCSEGDCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMPNVIFGCG 153
C+ + T CS G C YS YG G+Y S G A +TLT +S + FGCG
Sbjct: 241 PACSDLYTRGCSGGHCLYSVQYGDGSY---SIGFFAMDTLTLSSYDAV----KGFRFGCG 293
Query: 154 HKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLPDQ--GSSKINFG-GIVA 210
+N + G++GLG G +SL Q G F++CLP + G+ ++FG G A
Sbjct: 294 ERNEG---LFGEAAGLLGLGRGKTSLPVQTYDKYGGVFAHCLPARSSGTGYLDFGPGSPA 350
Query: 211 GAGVVSTPLIIRDH----YYLSLEAISVGNQRLEFVSS--STGNIFVDTGVLRTLLPLEY 264
G T ++ D+ YY+ + I VG Q L S ST VD+G + T LP
Sbjct: 351 AVGARQTTPMLTDNGPTFYYVGMTGIRVGGQLLSIPQSVFSTAGTIVDSGTVITRLPPAA 410
Query: 265 HSNLKSVMSNMIKAQPVKGVGAEPGFSDV-LCYNIS--SQPKFPEVTIHFR-GADVKLSP 320
+S+L+S ++ + A +G P S + CY+ + S+ P+V++ F+ GA + ++
Sbjct: 411 YSSLRSAFASAMAA---RGYKKAPALSLLDTCYDFTGMSEVAIPKVSLLFQGGAYLDVNA 467
Query: 321 SNLFRNISDEIMCSAFRGGNANI---VYGRIMQINFLIGYDIEQAMVSFKPSRC 371
S + S +C F + + G F + YDI + V F P C
Sbjct: 468 SGIMYAASLSQVCLGFAANEDDDDVGIVGNTQLKTFGVVYDIGKKTVGFSPGAC 521
>gi|357122568|ref|XP_003562987.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Brachypodium
distachyon]
Length = 455
Score = 156 bits (395), Expect = 1e-35, Method: Compositional matrix adjust.
Identities = 122/379 (32%), Positives = 181/379 (47%), Gaps = 51/379 (13%)
Query: 35 YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPEL------DCFKQEPPLFDPKKSSTYN 88
Y+M LSIGTPP+ DTGSD WTQC PC + CFKQ L++P S+T+
Sbjct: 87 YIMTLSIGTPPLSYRAIADTGSDLIWTQCAPCGDTVTDTDNQCFKQSGCLYNPSSSTTFG 146
Query: 89 SISCSS--SQCAVVTSNCSEGDCS--YSFLYGRGAYASFSSGNLATETLTFNSTSGLP-V 143
+ C+S S CA + C+ Y+ YG G +++G + ET TF S+S P V
Sbjct: 147 VLPCNSPLSMCAAMAGPSPPPGCACMYNQTYGTG----WTAGVQSVETFTFGSSSTPPAV 202
Query: 144 EMPNVIFGCGHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLP---DQGS 200
+PN+ FGC + AS + G++GLG G+ SL+SQ+G AG FSYCL D S
Sbjct: 203 RVPNIAFGCSN---ASSNDWNGSAGLVGLGRGSMSLVSQLG---AGAFSYCLTPFQDANS 256
Query: 201 SKINFGGIVAGAG------VVSTPLI-------IRDHYYLSLEAISVGNQRL-------E 240
+ G A A V STP + + +YYL+L ISVG L
Sbjct: 257 TSTLLLGPSAAAALKGTGPVRSTPFVAGPSKAPMSTYYYLNLTGISVGETALAIPPDAFS 316
Query: 241 FVSSSTGNIFVDTG-VLRTLLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDVLCYNIS 299
+ TG + +D+G + TL+ Y +V S ++ P+ G + LC+ +
Sbjct: 317 LRADGTGGLIIDSGTTITTLVDSAYQQVRAAVRSLLVTRLPLAH-GPDHSTGLDLCFALK 375
Query: 300 SQ---PKFPEVTIHFRGADVKLSPSNLFRNISDEIMCSAFRGGN--ANIVYGRIMQINFL 354
+ P P +T+HF G + P + + + C A R A + G Q N
Sbjct: 376 ASTPPPAMPSMTLHFEGGADMVLPVENYMILGSGVWCLAMRNQTVGAMSMVGNYQQQNIH 435
Query: 355 IGYDIEQAMVSFKPSRCTN 373
+ YD+ + +SF P+ C++
Sbjct: 436 VLYDVRKETLSFAPAVCSS 454
>gi|224142011|ref|XP_002324354.1| predicted protein [Populus trichocarpa]
gi|222865788|gb|EEF02919.1| predicted protein [Populus trichocarpa]
Length = 471
Score = 156 bits (394), Expect = 2e-35, Method: Compositional matrix adjust.
Identities = 122/359 (33%), Positives = 175/359 (48%), Gaps = 41/359 (11%)
Query: 35 YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSS 94
Y + + +GTP D DTGSD TWTQCEPC CF Q FDP KS++Y ++SCSS
Sbjct: 132 YAVTVGLGTPKKDFSLLFDTGSDLTWTQCEPCSG-GCFPQNDEKFDPTKSTSYKNLSCSS 190
Query: 95 SQCAVVTSNCSEG-----DCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMPNVI 149
C + ++G C Y YG G ++ G LATETLT + N +
Sbjct: 191 EPCKSIGKESAQGCSSSNSCLYGVKYGTG----YTVGFLATETLTITPSD----VFENFV 242
Query: 150 FGCGHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLPDQGSS--KINFGG 207
GCG +N S G++GLG +L SQ ++ FSYCLP SS ++FGG
Sbjct: 243 IGCGERNGG---RFSGTAGLLGLGRSPVALPSQTSSTYKNLFSYCLPASSSSTGHLSFGG 299
Query: 208 IVAGAGVVSTPLI--IRDHYYLSLEAISVGNQRLEFVSS--STGNIFVDTGVLRTLLPLE 263
V+ A TP+ I + Y L + ISVG ++L S T +D+G T LP
Sbjct: 300 GVSQAAKF-TPITSKIPELYGLDVSGISVGGRKLPIDPSVFRTAGTIIDSGTTLTYLPST 358
Query: 264 YHSNLKSVMSNMIKAQPV-KGV-GAEPGFSDVLCYNISSQPK----FPEVTIHFRGA-DV 316
HS L S M+ + KG G +P CY+ S P+++I F G +V
Sbjct: 359 AHSALSSAFQEMMTNYTLTKGTSGLQP------CYDFSKHANDNITIPQISIFFEGGVEV 412
Query: 317 KLSPSNLFRNISD-EIMCSAFR--GGNANI-VYGRIMQINFLIGYDIEQAMVSFKPSRC 371
+ S +F + E +C AF+ G + ++ ++G + Q + + YD+ + MV F P C
Sbjct: 413 DIDDSGIFIAANGLEEVCLAFKDNGNDTDVAIFGNVQQKTYEVVYDVAKGMVGFAPGGC 471
>gi|168005153|ref|XP_001755275.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162693403|gb|EDQ79755.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 429
Score = 156 bits (394), Expect = 2e-35, Method: Compositional matrix adjust.
Identities = 110/366 (30%), Positives = 173/366 (47%), Gaps = 34/366 (9%)
Query: 23 IYQAEIISVDDIYLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPK 82
+++ + S + YL+ +S G PP VDTGSD W QC PC C++ FDP
Sbjct: 78 LFETPVASGNGEYLIDISYGNPPQKSTAIVDTGSDLNWVQCLPCKS--CYETLSAKFDPS 135
Query: 83 KSSTYNSISCSSSQCAVVTSNCSEGDCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLP 142
KS++Y ++ C S+ C + C Y ++YG G S +SG L+T+ +T +
Sbjct: 136 KSASYKTLGCGSNFCQDLPFQSCAASCQYDYMYGDG---SSTSGALSTDDVTIGTG---- 188
Query: 143 VEMPNVIFGCGHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLPDQGSSK 202
++PNV FGCG+ NL + G+ SL+SQ+G + KFSYCL GS+K
Sbjct: 189 -KIPNVAFGCGNSNLGTFAGAGGLVGLGKG---PLSLVSQLGGTATKKFSYCLVPLGSTK 244
Query: 203 ---INFGGIVAGAGVVSTPLIIRDH----YYLSLEAISVGNQRLEF-------VSSSTGN 248
+ G GV TP++ ++ YY L+ ISV + + + ++ G
Sbjct: 245 TSPLYIGDSTLAGGVAYTPMLTNNNYPTFYYAELQGISVEGKAVNYPANTFDIAATGRGG 304
Query: 249 IFVDTGVLRTLLPLEYHSNLKSVMSNMIKAQPVKG--VGAEPGFSDVLCYNISSQPKFPE 306
+ +D+G T L ++ + + + + + G G E FS N P +P
Sbjct: 305 LILDSGTTLTYLDVDAFNPMVAALKAALPYPEADGSFYGLEYCFSTAGVAN----PTYPT 360
Query: 307 VTIHFRGADVKLSPSNLFRNISDE-IMCSAFRGGNANIVYGRIMQINFLIGYDIEQAMVS 365
V HF GADV L+P N F + E C A ++G I Q+N +I +D+ +
Sbjct: 361 VVFHFNGADVALAPDNTFIALDFEGTTCLAMASSTGFSIFGNIQQLNHVIVHDLVNKRIG 420
Query: 366 FKPSRC 371
FK + C
Sbjct: 421 FKSANC 426
>gi|45735840|dbj|BAD12875.1| putative chloroplast nucleoid DNA-binding protein cnd41 [Oryza
sativa Japonica Group]
gi|45735966|dbj|BAD12995.1| putative chloroplast nucleoid DNA-binding protein cnd41 [Oryza
sativa Japonica Group]
gi|125583491|gb|EAZ24422.1| hypothetical protein OsJ_08175 [Oryza sativa Japonica Group]
Length = 475
Score = 156 bits (394), Expect = 2e-35, Method: Compositional matrix adjust.
Identities = 120/354 (33%), Positives = 183/354 (51%), Gaps = 37/354 (10%)
Query: 35 YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSS 94
Y++ +S+GTP V VDTGSD +W QC+PCP C+ Q PLFDP +SS+Y+++ C++
Sbjct: 142 YVVTVSLGTPAVAQTLEVDTGSDVSWVQCKPCPSPPCYSQRDPLFDPTRSSSYSAVPCAA 201
Query: 95 SQC---AVVTSNCSEGDCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMPNVIFG 151
+ C A+ ++ CS G C Y YG G S ++G +++TLT ++ L +FG
Sbjct: 202 ASCSQLALYSNGCSGGQCGYVVSYGDG---STTTGVYSSDTLTLTGSNAL----KGFLFG 254
Query: 152 CGHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLPDQGSS--KINFGGIV 209
CGH A + G++GLG SL+SQ ++ G FSYCLP +S I+ GG
Sbjct: 255 CGH---AQQGLFAGVDGLLGLGRQGQSLVSQASSTYGGVFSYCLPPTQNSVGYISLGGPS 311
Query: 210 AGAGVVSTPLIIRD----HYYLSLEAISVGNQRLEFVSS--STGNIFVDTGVLRTLLPLE 263
+ AG +TPL+ +Y + L ISVG Q L +S ++G + VDTG + T LP
Sbjct: 312 STAGFSTTPLLTASNDPTYYIVMLAGISVGGQPLSIDASVFASGAV-VDTGTVVTRLPPT 370
Query: 264 YHSNLKSVMSNMIKAQPVKGVGAEPGFSDVLCYNIS--SQPKFPEVTIHF-RGADVKLSP 320
+S L+S + A G D CY+ + P ++I F GA + L
Sbjct: 371 AYSALRSAFRAAMAPYGYPSAPAT-GILDT-CYDFTRYGTVTLPTISIAFGGGAAMDLGT 428
Query: 321 SNLFRNISDEIMCSAFR--GGNANI-VYGRIMQINFLIGYDIEQAMVSFKPSRC 371
S + + C AF GG++ + G + Q +F + +D + V F P+ C
Sbjct: 429 SGILTS-----GCLAFAPTGGDSQASILGNVQQRSFEVRFD--GSTVGFMPASC 475
>gi|297803034|ref|XP_002869401.1| aspartyl protease family [Arabidopsis lyrata subsp. lyrata]
gi|297315237|gb|EFH45660.1| aspartyl protease family [Arabidopsis lyrata subsp. lyrata]
Length = 438
Score = 156 bits (394), Expect = 2e-35, Method: Compositional matrix adjust.
Identities = 120/361 (33%), Positives = 175/361 (48%), Gaps = 37/361 (10%)
Query: 35 YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSS 94
+L+++SIG+PPV +DT SD W QC PC ++C+ Q P+FDP +S T+ + SC +
Sbjct: 85 FLVNISIGSPPVTQLLHMDTASDLLWLQCRPC--INCYAQSLPIFDPSRSYTHRNESCRT 142
Query: 95 SQCAV--VTSNCSEGDCSYSFLYGRGAYASFSSGNLATETLTFNST--SGLPVEMPNVIF 150
SQ ++ + N C YS Y G + S G LA E L FN+ + +V+F
Sbjct: 143 SQYSMPSLRFNAKTRSCEYSMRYMDG---TGSKGILAKEMLMFNTIYDESSSAALHDVVF 199
Query: 151 GCGHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLPDQGSSKINFGGIVA 210
GCGH N P TGI+GLG G SL+ + GT KFSYC +V
Sbjct: 200 GCGHDNYGEPLVG---TGILGLGYGEFSLVHRFGT----KFSYCFGSLDDPSYPHNVLVL 252
Query: 211 ---GAGVV--STPL-IIRDHYYLSLEAISVGNQRLEF--------VSSSTGNIFVDTGVL 256
GA ++ +TPL I YY+++EAISV L + G +DTG
Sbjct: 253 GDDGANILGDTTPLEIYNGFYYVTIEAISVDGIILPIDPWVFNRNHQTGLGGTIIDTGNS 312
Query: 257 RTLLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDVLCYNISSQPK-----FPEVTIHF 311
T L E + LK+ + + + + + V CYN + + FP VT HF
Sbjct: 313 LTSLVEEAYKPLKNKIEDYFEGRFTAADVNQDDMFKVECYNGNLERDLVESGFPIVTFHF 372
Query: 312 R-GADVKLSPSNLFRNISDEIMCSAFRGGNANIVYGRIMQINFLIGYDIEQAMVSFKPSR 370
GA++ L ++F +S + C A GN N + G Q ++ IGYD+E +SF+
Sbjct: 373 SDGAELSLDVKSVFMKLSPNVFCLAVTPGNMNSI-GATAQQSYNIGYDLEAKKISFERID 431
Query: 371 C 371
C
Sbjct: 432 C 432
>gi|255564685|ref|XP_002523337.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
gi|223537425|gb|EEF39053.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
Length = 469
Score = 155 bits (393), Expect = 2e-35, Method: Compositional matrix adjust.
Identities = 121/365 (33%), Positives = 172/365 (47%), Gaps = 50/365 (13%)
Query: 35 YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSS 94
Y + +GTPP ++ +DTGSD W QC PC C+ Q P+FDP+KS ++ SI+C S
Sbjct: 126 YFTRIGVGTPPRYVYMVLDTGSDIVWIQCAPCKR--CYAQSDPVFDPRKSRSFASIACRS 183
Query: 95 SQCAVVTS---NCSEGDCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMPNVIFG 151
C + S N + C Y YG G SF+ G+ +TETLTF T + V G
Sbjct: 184 PLCHRLDSPGCNTQKQTCMYQVSYGDG---SFTFGDFSTETLTFRRT-----RVARVALG 235
Query: 152 CGHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLPDQGS----SKINFGG 207
CGH N + G+ S SQ G KFSYCL D+ + S + FG
Sbjct: 236 CGHDNEGLFVGAAGLLGLGRG---RLSFPSQTGRRFNHKFSYCLVDRSASSKPSSMVFGD 292
Query: 208 IVAGAGVVSTPLI----IRDHYYLSLEAISVGNQRLEFVSSS------TGN--IFVDTG- 254
TPL+ + YY+ L ISVG R+ +++S TGN + +D+G
Sbjct: 293 SAVSRTARFTPLVSNPKLDTFYYVELLGISVGGTRVPGITASLFKLDQTGNGGVIIDSGT 352
Query: 255 -VLRTLLP--LEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDV-LCYNIS--SQPKFPEVT 308
V R P + + ++ SN+ +A P FS C+++S ++ K P V
Sbjct: 353 SVTRLTRPAYIAFRDAFRAGASNLKRA---------PQFSLFDTCFDLSGKTEVKVPTVV 403
Query: 309 IHFRGADVKLSPSNLFRNI-SDEIMCSAFRGGNANI-VYGRIMQINFLIGYDIEQAMVSF 366
+HFRGADV L SN + + C AF G + + G I Q F + YD+ + V F
Sbjct: 404 LHFRGADVSLPASNYLIPVDTSGNFCLAFAGTMGGLSIIGNIQQQGFRVVYDLAGSRVGF 463
Query: 367 KPSRC 371
P C
Sbjct: 464 APHGC 468
>gi|125540927|gb|EAY87322.1| hypothetical protein OsI_08726 [Oryza sativa Indica Group]
Length = 464
Score = 155 bits (393), Expect = 3e-35, Method: Compositional matrix adjust.
Identities = 118/352 (33%), Positives = 183/352 (51%), Gaps = 33/352 (9%)
Query: 35 YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSS 94
Y++ +S+GTP V VDTGSD +W QC+PCP C+ Q PLFDP +SS+Y+++ C++
Sbjct: 131 YVVTVSLGTPAVAQTLEVDTGSDVSWVQCKPCPSPPCYSQRDPLFDPTRSSSYSAVPCAA 190
Query: 95 SQC---AVVTSNCSEGDCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMPNVIFG 151
+ C A+ ++ CS G C Y YG G S ++G +++TLT ++ L +FG
Sbjct: 191 ASCSQLALYSNGCSGGQCGYVVSYGDG---STTTGVYSSDTLTLTGSNAL----KGFLFG 243
Query: 152 CGHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLPDQGSS--KINFGGIV 209
CGH A + G++GLG SL+SQ ++ G FSYCLP +S I+ GG
Sbjct: 244 CGH---AQQGLFAGVDGLLGLGRQGQSLVSQASSTYGGVFSYCLPPTQNSVGYISLGGPS 300
Query: 210 AGAGVVSTPLIIRD----HYYLSLEAISVGNQRLEFVSS--STGNIFVDTGVLRTLLPLE 263
+ AG +TPL+ +Y + L ISVG Q L +S ++G + VDTG + T LP
Sbjct: 301 STAGFSTTPLLTASNDPTYYIVMLAGISVGGQPLSIDASVFASGAV-VDTGTVVTRLPPT 359
Query: 264 YHSNLKSVMSNMIKAQPVKGVGAEPGFSDVLCYNIS--SQPKFPEVTIHF-RGADVKLSP 320
+S L+S + A G D CY+ + P ++I F GA + L
Sbjct: 360 AYSALRSAFRAAMAPYGYPSAPAT-GILDT-CYDFTRYGTVTLPTISIAFGGGAAMDLGT 417
Query: 321 SNLFRNISDEIMCSAFRGGNANI-VYGRIMQINFLIGYDIEQAMVSFKPSRC 371
S + ++ + A GG++ + G + Q +F + +D + V F P+ C
Sbjct: 418 SGI---LTSGCLAFAPTGGDSQASILGNVQQRSFEVRFD--GSTVGFMPASC 464
>gi|226504334|ref|NP_001141706.1| uncharacterized protein LOC100273835 precursor [Zea mays]
gi|194705620|gb|ACF86894.1| unknown [Zea mays]
gi|414885968|tpg|DAA61982.1| TPA: hypothetical protein ZEAMMB73_231717 [Zea mays]
Length = 477
Score = 155 bits (392), Expect = 3e-35, Method: Compositional matrix adjust.
Identities = 120/362 (33%), Positives = 178/362 (49%), Gaps = 44/362 (12%)
Query: 35 YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSS 94
Y L +GTP D+ +DTGSD +W QC+PCP DC++Q LFDP KSSTY+ I+CSS
Sbjct: 134 YFTSLRLGTPATDLLVELDTGSDQSWIQCKPCP--DCYEQHEALFDPSKSSTYSDITCSS 191
Query: 95 SQCAVVTS----NC-SEGDCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMPNVI 149
+C + S NC S+ C Y Y A S++ GNLA +TLT + T +P +
Sbjct: 192 RECQELGSSHKHNCSSDKKCPYEITY---ADDSYTVGNLARDTLTLSPTDA----VPGFV 244
Query: 150 FGCGHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLPDQGSSK--INFGG 207
FGCGH N S + G++GLG G +SL SQ+ FSYCLP S+ ++F G
Sbjct: 245 FGCGHNNAG---SFGEIDGLLGLGRGKASLSSQVAARYGAGFSYCLPSSPSATGYLSFSG 301
Query: 208 IVAGAGVVS--TPLIIRDH---YYLSLEAISVGNQRLEF---VSSSTGNIFVDTGVLRTL 259
A A + T ++ H YYL+L I+V + ++ V ++ +D+G +
Sbjct: 302 AAAAAPTNAQFTEMVAGQHPSFYYLNLTGITVAGRAIKVPPSVFATAAGTIIDSGTAFSC 361
Query: 260 LPLEYHSNLKSVMSNMI---KAQPVKGVGAEPGFSDVLCYNISSQP--KFPEVTIHFR-G 313
LP ++ L+S + + + K P + F CY+++ + P V + F G
Sbjct: 362 LPPSAYAALRSSVRSAMGRYKRAPSSTI-----FD--TCYDLTGHETVRIPSVALVFADG 414
Query: 314 ADVKLSPSNLFRNISD-EIMCSAFRGGNANI---VYGRIMQINFLIGYDIEQAMVSFKPS 369
A V L PS + S+ C AF + V G Q + YD++ V F +
Sbjct: 415 ATVHLHPSGVLYTWSNVSQTCLAFLPNPDDTSLGVLGNTQQRTLAVIYDVDNQKVGFGAN 474
Query: 370 RC 371
C
Sbjct: 475 GC 476
>gi|326490862|dbj|BAJ90098.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 447
Score = 155 bits (391), Expect = 3e-35, Method: Compositional matrix adjust.
Identities = 115/366 (31%), Positives = 170/366 (46%), Gaps = 42/366 (11%)
Query: 35 YLMHLSIGTP-PVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCS 93
YL+H IGTP P + VDTGSD WTQC PC DCF Q P FD S T + + C+
Sbjct: 92 YLIHFGIGTPRPQQVALEVDTGSDVVWTQCRPC--FDCFTQPLPRFDTSASDTVHGVLCT 149
Query: 94 SSQC-AVVTSNCSEGDCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMPNVIFGC 152
C A+ C G C+Y YG S + G LA ++ TF+ G V +P+++FGC
Sbjct: 150 DPICRALRPHACFLGGCTYQVNYGDN---SVTIGQLAKDSFTFDGKGGGKVTVPDLVFGC 206
Query: 153 GHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLP---DQGSSKINFGGIV 209
G N + S +TGI G G G SL Q+G S FSYC + S+ + GG
Sbjct: 207 GQYNTGN--FHSNETGIAGFGRGPLSLPRQLGVS---SFSYCFTTIFESKSTPVFLGGAP 261
Query: 210 AG-------AGVVSTPLIIR--DHYYLSLEAISVGNQRLEFVSSS-------TGNIFVDT 253
A ++STP + ++YYLSL+ I+VG RL S+ +G +D+
Sbjct: 262 ADGLRAHATGPILSTPFLPNHPEYYYLSLKGITVGKTRLAVPESAFVVKADGSGGTIIDS 321
Query: 254 GVLRTLLPLEYHSNLKSVMSNMIKAQPVKGVG-AEPGFSDVLCYNISSQPK-----FPEV 307
G T P + +S+ + P+ + G + C++ S P P++
Sbjct: 322 GTAITAFP---RAVFRSLWEAFVAQVPLPHTSYNDTGEPTLQCFSTESVPDASKVPVPKM 378
Query: 308 TIHFRGADVKLSPSNLFRNI--SDEIMCSAFRGGNANIVYGRIMQINFLIGYDIEQAMVS 365
T+H GAD +L N SD++ G + + G Q N I +D+ +
Sbjct: 379 TLHLEGADWELPRENYMAEYPDSDQLCVVVLAGDDDRTMIGNFQQQNMHIVHDLAGNKLV 438
Query: 366 FKPSRC 371
+P++C
Sbjct: 439 IEPAQC 444
>gi|449443786|ref|XP_004139658.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 1-like [Cucumis
sativus]
gi|449475416|ref|XP_004154449.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 1-like [Cucumis
sativus]
Length = 453
Score = 155 bits (391), Expect = 4e-35, Method: Compositional matrix adjust.
Identities = 122/361 (33%), Positives = 170/361 (47%), Gaps = 40/361 (11%)
Query: 35 YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSS 94
Y L +GTPP ++ +DTGSD W QC PC + C+ Q P+F+P KS ++ I CSS
Sbjct: 110 YFTRLGVGTPPRYLYMVLDTGSDVVWLQCSPCRK--CYSQSDPIFNPYKSKSFAGIPCSS 167
Query: 95 SQCAVV-TSNCS--EGDCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMPNVIFG 151
C + +S CS C Y YG G SF++G+ ATETLTF ++ V G
Sbjct: 168 PLCRRLDSSGCSTRRHTCLYQVSYGDG---SFTTGDFATETLTFRGN-----KIAKVALG 219
Query: 152 CGHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLPDQGS----SKINFGG 207
CGH N + G+ S SQ G KFSYCL D+ + S + FG
Sbjct: 220 CGHHNEGLFVGAAGLLGLGRG---RLSFPSQTGIRFNHKFSYCLVDRSASSKPSSMVFGD 276
Query: 208 IVAGAGVVSTPLI----IRDHYYLSLEAISVGNQRLEFVSSS--------TGNIFVDTGV 255
TPLI + YY+ L ISVG R+ VS S G + +D+G
Sbjct: 277 AAISRLARFTPLIRNPKLDTFYYVGLIGISVGGVRVRGVSPSLFKLDSAGNGGVIIDSGT 336
Query: 256 LRTLLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDVLCYNISSQP--KFPEVTIHFRG 313
T L ++ L+ + A+ +K G E D CY++S Q K P V +HFRG
Sbjct: 337 SVTRLTRPAYTALRDAFR--VGARHLK-RGPEFSLFDT-CYDLSGQSSVKVPTVVLHFRG 392
Query: 314 ADVKLSPSNLFRNISDE-IMCSAFRGGNANI-VYGRIMQINFLIGYDIEQAMVSFKPSRC 371
AD+ L +N + + C AF G + + + G I Q F + YD+ + + F P C
Sbjct: 393 ADMALPATNYLIPVDENGSFCFAFAGTISGLSIIGNIQQQGFRVVYDLAGSRIGFAPRGC 452
Query: 372 T 372
T
Sbjct: 453 T 453
>gi|125543634|gb|EAY89773.1| hypothetical protein OsI_11315 [Oryza sativa Indica Group]
Length = 474
Score = 154 bits (390), Expect = 5e-35, Method: Compositional matrix adjust.
Identities = 120/378 (31%), Positives = 180/378 (47%), Gaps = 52/378 (13%)
Query: 32 DDIYLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSIS 91
D YL+H++IGTPP + +DTGSD TWTQC PC + CF+Q P F+P +S T++ +
Sbjct: 108 DTEYLVHMAIGTPPQPVQLILDTGSDLTWTQCAPC--VSCFRQSLPRFNPSRSMTFSVLP 165
Query: 92 CSSSQCAVVT-SNCSE-----GDCSYSFLYGRGAYASFSSGNLATETLTFNSTSGL--PV 143
C C +T S+C E G C Y++ Y A S ++G+L ++T +F S
Sbjct: 166 CDLRICRDLTWSSCGEQSWGNGICVYAYAY---ADHSITTGHLDSDTFSFASADHAIGGA 222
Query: 144 EMPNVIFGCGHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLPDQGSSKI 203
+P++ FGCG N + S +TGI G G S+ +Q+ FSYC S+
Sbjct: 223 SVPDLTFGCGLFN--NGIFVSNETGIAGFSRGALSMPAQLKVD---NFSYCFTAITGSEP 277
Query: 204 N--FGGIV---------AGAGVVSTPLIIRDH------YYLSLEAISVGNQRLEFVSS-- 244
+ F G+ G GVV + +IR H YY+SL+ ++VG RL S
Sbjct: 278 SPVFLGVPPNLYSDAAGGGHGVVQSTALIRYHSSQLKAYYISLKGVTVGTTRLPIPESVF 337
Query: 245 -----STGNIFVDTGVLRTLLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDVLCYNI- 298
TG VD+G T+LP + +++ + AQ V LC+++
Sbjct: 338 ALKEDGTGGTIVDSGTGMTMLP----EAVYNLVCDAFVAQTKLTVHNSTSSLSQLCFSVP 393
Query: 299 -SSQPKFPEVTIHFRGADVKLSPSNLFRNISD----EIMCSAFRGGNANIVYGRIMQINF 353
++P P + +HF GA + L N I + + C A G V G Q N
Sbjct: 394 PGAKPDVPALVLHFEGATLDLPRENYMFEIEEAGGIRLTCLAINAGEDLSVIGNFQQQNM 453
Query: 354 LIGYDIEQAMVSFKPSRC 371
+ YD+ M+SF P+RC
Sbjct: 454 HVLYDLANDMLSFVPARC 471
>gi|357158688|ref|XP_003578209.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Brachypodium
distachyon]
Length = 443
Score = 154 bits (390), Expect = 6e-35, Method: Compositional matrix adjust.
Identities = 119/375 (31%), Positives = 177/375 (47%), Gaps = 47/375 (12%)
Query: 28 IISVDDIYLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTY 87
+++ + YLM + IGTPP +DTGSD WTQC PC + C Q P FDP +S +Y
Sbjct: 82 VLASEGEYLMSMGIGTPPRYYSAILDTGSDLIWTQCAPC--MLCVDQPTPFFDPAQSPSY 139
Query: 88 NSISCSSSQC-AVVTSNCSEGDCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMP 146
+ C+S C A+ C C Y + YG A ++G L+ ET TF T+ V +P
Sbjct: 140 AKLPCNSPMCNALYYPLCYRNVCVYQYFYGDSAN---TAGVLSNETFTFG-TNDTRVTVP 195
Query: 147 NVIFGCGHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLPDQGS---SKI 203
+ FGCG+ N S + S G++G G G SL+SQ+G+ +FSYCL S S++
Sbjct: 196 RIAFGCGNLNAGSLFNGS---GMVGFGRGPLSLVSQLGSP---RFSYCLTSFMSPVPSRL 249
Query: 204 NFGGIV--------AGAGVVSTPLIIR----DHYYLSLEAISVGNQRLEFVSS------- 244
FG G V STP I+ YYL++ ISVG + L S
Sbjct: 250 YFGAYATLNSTSASTGEPVQSTPFIVNPGLPTMYYLNMTGISVGGELLPIDPSVFAINDA 309
Query: 245 -STGNIFVDTGVLRTLLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDVL--CYNISSQ 301
TG + +D+G T L + + ++ + P+ + +DVL C+
Sbjct: 310 DGTGGVIIDSGSTITYLARAAYDMVHQAFADQV-GLPLTNATS---LADVLDTCFVWPPP 365
Query: 302 PK----FPEVTIHFRGADVKLSPSNLFRNISDE-IMCSAFRGGNANIVYGRIMQINFLIG 356
P+ PE+ HF GA+++L N D +C A + + G NF +
Sbjct: 366 PRKIVTMPELAFHFEGANMELPLENYMLIDGDTGNLCLAIAASDDGSIIGSFQHQNFHVL 425
Query: 357 YDIEQAMVSFKPSRC 371
YD E +++SF P+ C
Sbjct: 426 YDNENSLLSFTPATC 440
>gi|115452683|ref|NP_001049942.1| Os03g0317300 [Oryza sativa Japonica Group]
gi|108707833|gb|ABF95628.1| Eukaryotic aspartyl protease family protein, expressed [Oryza
sativa Japonica Group]
gi|113548413|dbj|BAF11856.1| Os03g0317300 [Oryza sativa Japonica Group]
Length = 448
Score = 154 bits (390), Expect = 6e-35, Method: Compositional matrix adjust.
Identities = 120/378 (31%), Positives = 180/378 (47%), Gaps = 52/378 (13%)
Query: 32 DDIYLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSIS 91
D YL+H++IGTPP + +DTGSD TWTQC PC + CF+Q P F+P +S T++ +
Sbjct: 82 DTEYLVHMAIGTPPQPVQLILDTGSDLTWTQCAPC--VSCFRQSLPRFNPSRSMTFSVLP 139
Query: 92 CSSSQCAVVT-SNCSE-----GDCSYSFLYGRGAYASFSSGNLATETLTFNSTSGL--PV 143
C C +T S+C E G C Y++ Y A S ++G+L ++T +F S
Sbjct: 140 CDLRICRDLTWSSCGEQSWGNGICVYAYAY---ADHSITTGHLDSDTFSFASADHAIGGA 196
Query: 144 EMPNVIFGCGHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLPDQGSSKI 203
+P++ FGCG N + S +TGI G G S+ +Q+ FSYC S+
Sbjct: 197 SVPDLTFGCGLFN--NGIFVSNETGIAGFSRGALSMPAQLKVD---NFSYCFTAITGSEP 251
Query: 204 N--FGGIV---------AGAGVVSTPLIIRDH------YYLSLEAISVGNQRLEFVSS-- 244
+ F G+ G GVV + +IR H YY+SL+ ++VG RL S
Sbjct: 252 SPVFLGVPPNLYSDAAGGGHGVVQSTALIRYHSSQLKAYYISLKGVTVGTTRLPIPESVF 311
Query: 245 -----STGNIFVDTGVLRTLLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDVLCYNI- 298
TG VD+G T+LP + +++ + AQ V LC+++
Sbjct: 312 ALKEDGTGGTIVDSGTGMTMLP----EAVYNLVCDAFVAQTKLTVHNSTSSLSQLCFSVP 367
Query: 299 -SSQPKFPEVTIHFRGADVKLSPSNLFRNISD----EIMCSAFRGGNANIVYGRIMQINF 353
++P P + +HF GA + L N I + + C A G V G Q N
Sbjct: 368 PGAKPDVPALVLHFEGATLDLPRENYMFEIEEAGGIRLTCLAINAGEDLSVIGNFQQQNM 427
Query: 354 LIGYDIEQAMVSFKPSRC 371
+ YD+ M+SF P+RC
Sbjct: 428 HVLYDLANDMLSFVPARC 445
>gi|242095588|ref|XP_002438284.1| hypothetical protein SORBIDRAFT_10g011120 [Sorghum bicolor]
gi|241916507|gb|EER89651.1| hypothetical protein SORBIDRAFT_10g011120 [Sorghum bicolor]
Length = 487
Score = 154 bits (389), Expect = 6e-35, Method: Compositional matrix adjust.
Identities = 118/373 (31%), Positives = 185/373 (49%), Gaps = 42/373 (11%)
Query: 35 YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSS 94
Y++ + IGTPP + DTGSD TW QC PCP+ C+ Q+ PLFDP KSSTY + CS+
Sbjct: 122 YVVTIGIGTPPRNFTVLFDTGSDLTWVQCLPCPDSSCYPQQEPLFDPSKSSTYVDVPCSA 181
Query: 95 SQC---AVVTSNCSEGDCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMPNVIFG 151
+C V + C C YS YG S + G+LA ET T + S L V+FG
Sbjct: 182 PECHIGGVQQTRCGATSCEYSVKYGD---ESETHGSLAEETFTLSPPSPLAPAATGVVFG 238
Query: 152 CGHKNLASPTSDSKQ--TGIIGLGPGNSSLISQMGTSI---AGKFSYCLPDQGSSK--IN 204
C H+ + S +D+ G++GLG G+SS++SQ SI G FSYCLP +GSS +
Sbjct: 239 CSHEYI-SVFNDTGMGVAGLLGLGRGDSSILSQTRRSINSGGGVFSYCLPPRGSSTGYLT 297
Query: 205 FGGIVAG-----AGVVSTPLI-----IRDHYYLSLEAISVGNQRLEFVSS--STGNIFVD 252
GG A + + TPLI +R Y ++L +SV ++ +S S G + +D
Sbjct: 298 IGGGAAAPQQQYSNLSFTPLITTISQLRSAYVVNLAGVSVNGAAVDIPASAFSLGAV-ID 356
Query: 253 TGVLRTLLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDVLCYNISSQP--KFPEVTIH 310
+G + T +P + L+ + + + G+ CY+++ Q P V +
Sbjct: 357 SGTVVTHMPAAAYYPLRDEFRLHMGSYKMLPEGSMKLLDT--CYDVTGQDVVTAPRVALE 414
Query: 311 F-RGADVKLSPSNLFRNISDE--------IMCSAFRGGNAN--IVYGRIMQINFLIGYDI 359
F GA + + S + + E + C AF N+ ++ G + Q + + +D+
Sbjct: 415 FGGGARIDVDASGILLVLPAEDGSGQSLTLACLAFLPTNSAGLVIVGNMQQRAYNVVFDV 474
Query: 360 EQAMVSFKPSRCT 372
+ + F P+ C+
Sbjct: 475 DGGRIGFGPNGCS 487
>gi|125586057|gb|EAZ26721.1| hypothetical protein OsJ_10629 [Oryza sativa Japonica Group]
Length = 474
Score = 154 bits (389), Expect = 6e-35, Method: Compositional matrix adjust.
Identities = 120/378 (31%), Positives = 180/378 (47%), Gaps = 52/378 (13%)
Query: 32 DDIYLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSIS 91
D YL+H++IGTPP + +DTGSD TWTQC PC + CF+Q P F+P +S T++ +
Sbjct: 108 DTEYLVHMAIGTPPQPVQLILDTGSDLTWTQCAPC--VSCFRQSLPRFNPSRSMTFSVLP 165
Query: 92 CSSSQCAVVT-SNCSE-----GDCSYSFLYGRGAYASFSSGNLATETLTFNSTSGL--PV 143
C C +T S+C E G C Y++ Y A S ++G+L ++T +F S
Sbjct: 166 CDLRICRDLTWSSCGEQSWGNGICVYAYAY---ADHSITTGHLDSDTFSFASADHAIGGA 222
Query: 144 EMPNVIFGCGHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLPDQGSSKI 203
+P++ FGCG N + S +TGI G G S+ +Q+ FSYC S+
Sbjct: 223 SVPDLTFGCGLFN--NGIFVSNETGIAGFSRGALSMPAQLKVD---NFSYCFTAITGSEP 277
Query: 204 N--FGGIV---------AGAGVVSTPLIIRDH------YYLSLEAISVGNQRLEFVSS-- 244
+ F G+ G GVV + +IR H YY+SL+ ++VG RL S
Sbjct: 278 SPVFLGVPPNLYSDAAGGGHGVVQSTALIRYHSSQLKAYYISLKGVTVGTTRLPIPESVF 337
Query: 245 -----STGNIFVDTGVLRTLLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDVLCYNI- 298
TG VD+G T+LP + +++ + AQ V LC+++
Sbjct: 338 ALKEDGTGGTIVDSGTGMTMLP----EAVYNLVCDAFVAQTKLTVHNSTSSLSQLCFSVP 393
Query: 299 -SSQPKFPEVTIHFRGADVKLSPSNLFRNISD----EIMCSAFRGGNANIVYGRIMQINF 353
++P P + +HF GA + L N I + + C A G V G Q N
Sbjct: 394 PGAKPDVPALVLHFEGATLDLPRENYMFEIEEAGGIRLTCLAINAGEDLSVIGNFQQQNM 453
Query: 354 LIGYDIEQAMVSFKPSRC 371
+ YD+ M+SF P+RC
Sbjct: 454 HVLYDLANDMLSFVPARC 471
>gi|326530426|dbj|BAJ97639.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 479
Score = 154 bits (389), Expect = 7e-35, Method: Compositional matrix adjust.
Identities = 117/364 (32%), Positives = 176/364 (48%), Gaps = 44/364 (12%)
Query: 35 YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSS 94
Y + + +G+PP + + VD+GSD W QC PC E C++Q PLFDP S+++ ++ C S
Sbjct: 133 YFVRVGVGSPPTEQYLVVDSGSDVIWIQCRPCAE--CYQQADPLFDPAASASFTAVPCDS 190
Query: 95 SQCAVV---TSNCSE-GDCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMPNVIF 150
C + +S C++ G C Y YG G+Y + G LA ETLTF ++ PV+ V
Sbjct: 191 GVCRTLPGGSSGCADSGACRYQVSYGDGSY---TQGVLAMETLTFGDST--PVQ--GVAI 243
Query: 151 GCGHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLPDQGSSKINFGGIVA 210
GCGH+N G++GLG G SL+ Q+G + G FSYCL +G+ G +V
Sbjct: 244 GCGHRNRGLFVG---AAGLLGLGWGPMSLVGQLGGAAGGAFSYCLASRGADA-GAGSLVF 299
Query: 211 G------AGVVSTPLIIRDH----YYLSLEAISVGNQRL-------EFVSSSTGNIFVDT 253
G G V PL+ YY+ L + VG +RL + G + +DT
Sbjct: 300 GRDDAMPVGAVWVPLLRNAQQPSFYYVGLTGLGVGGERLPLQDGLFDLTEDGGGGVVMDT 359
Query: 254 GVLRTLLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDV-LCYNIS--SQPKFPEVTIH 310
G T LP + ++ L+ ++ I + PG S + CY++S + + P V ++
Sbjct: 360 GTAVTRLPPDAYAALRDAFASTIGGDLPR----APGVSLLDTCYDLSGYASVRVPTVALY 415
Query: 311 F--RGADVKLSPSNLFRNISDEIMCSAFRGGNANI-VYGRIMQINFLIGYDIEQAMVSFK 367
F GA + L NL + + C AF + + + G I Q I D V F
Sbjct: 416 FGRDGAALTLPARNLLVEMGGGVYCLAFAASASGLSILGNIQQQGIQITVDSANGYVGFG 475
Query: 368 PSRC 371
PS C
Sbjct: 476 PSTC 479
>gi|242050426|ref|XP_002462957.1| hypothetical protein SORBIDRAFT_02g035290 [Sorghum bicolor]
gi|241926334|gb|EER99478.1| hypothetical protein SORBIDRAFT_02g035290 [Sorghum bicolor]
Length = 452
Score = 154 bits (388), Expect = 9e-35, Method: Compositional matrix adjust.
Identities = 119/369 (32%), Positives = 171/369 (46%), Gaps = 43/369 (11%)
Query: 35 YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSS 94
Y M LS+GTPP+ +DTGSD TWTQC PC CF Q PL+DP +SST++ + C+S
Sbjct: 96 YHMILSVGTPPLAFPAIIDTGSDLTWTQCAPC-TTACFAQPTPLYDPARSSTFSKLPCAS 154
Query: 95 SQCAVVTS---NCSEGDCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPV---EMPNV 148
C + S C+ C Y + Y G F++G LA +TL G V
Sbjct: 155 PLCQALPSAFRACNATGCVYDYRYAVG----FTAGYLAADTLAIGDGDGDGDASSSFAGV 210
Query: 149 IFGCGHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCL---PDQGSSKINF 205
FGC N S GI+GLG SL+SQ+G G+FSYCL D G+S I F
Sbjct: 211 AFGCSTANGGDMDGAS---GIVGLGRSALSLLSQIGV---GRFSYCLRSDADAGASPILF 264
Query: 206 GGI--VAGAGVVST-----PLIIRD---HYYLSLEAISVGNQRLE-------FVSSSTGN 248
G + V G V ST P+ R +YY++L I+VG+ L F ++ G
Sbjct: 265 GALANVTGDKVQSTALLRNPVAARRRAPYYYVNLTGIAVGSTDLPVTSSTFGFTAAGAGG 324
Query: 249 IFVDTGVLRTLLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDVLCYNI-SSQPKFPEV 307
+ VD+G T L ++ L+ + + GA+ F LC+ ++ P +
Sbjct: 325 VIVDSGTTFTYLAEAGYTMLRQAFLSQTAGLLTRVSGAQFDFD--LCFEAGAADTPVPRL 382
Query: 308 TIHFRGADVKLSPSNLFRNISDE---IMCSAFRGGNANIVYGRIMQINFLIGYDIEQAMV 364
F G P + + DE + C V G +MQ++ + YD++ A
Sbjct: 383 VFRFAGGAEYAVPRQSYFDAVDEGGRVACLLVLPTRGVSVIGNVMQMDLHVLYDLDGATF 442
Query: 365 SFKPSRCTN 373
SF P+ C +
Sbjct: 443 SFAPADCAS 451
>gi|242095586|ref|XP_002438283.1| hypothetical protein SORBIDRAFT_10g011110 [Sorghum bicolor]
gi|241916506|gb|EER89650.1| hypothetical protein SORBIDRAFT_10g011110 [Sorghum bicolor]
Length = 470
Score = 154 bits (388), Expect = 9e-35, Method: Compositional matrix adjust.
Identities = 114/355 (32%), Positives = 171/355 (48%), Gaps = 35/355 (9%)
Query: 35 YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSS 94
Y+ L +GTP VDTGS TW QC PC + C +Q PL+DP+ SSTY ++ CS+
Sbjct: 134 YVTELGLGTPATSYAMVVDTGSSLTWLQCSPC-VVSCHRQVGPLYDPRASSTYATVPCSA 192
Query: 95 SQC------AVVTSNCSEGD-CSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMPN 147
SQC + S CS + C Y YG +SFS G L+ +T++F S S PN
Sbjct: 193 SQCDELQAATLNPSACSVRNVCIYQASYGD---SSFSVGYLSRDTVSFGSGS-----YPN 244
Query: 148 VIFGCGHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLPDQGSSKINFGG 207
+GCG N + G+IGL SL+ Q+ S+ FSYCLP S+ G
Sbjct: 245 FYYGCGQDNEG---LFGRSAGLIGLARNKLSLLYQLAPSLGYSFSYCLPTPASTGYLSIG 301
Query: 208 IVAGAGVVSTPL----IIRDHYYLSLEAISVGNQRLEFVSSSTGNI--FVDTGVLRTLLP 261
TP+ + Y+++L +SVG L + ++ +D+G + T LP
Sbjct: 302 PYTSGHYSYTPMASSSLDASLYFVTLSGMSVGGSPLAVSPAEYSSLPTIIDSGTVITRLP 361
Query: 262 LEYHSNL-KSVMSNMIKAQPVKGVGAEPGFS--DVLCYNISSQPKFPEVTIHFR-GADVK 317
++ L K+V + M+ GV + P FS D +SQ + P V + F GA +K
Sbjct: 362 TAVYTALSKAVAAAMV------GVQSAPAFSILDTCFQGQASQLRVPAVAMAFAGGATLK 415
Query: 318 LSPSNLFRNISDEIMCSAFRGGNANIVYGRIMQINFLIGYDIEQAMVSFKPSRCT 372
L+ N+ ++ D C AF ++ + G Q F + YD+ Q+ + F C+
Sbjct: 416 LATQNVLIDVDDSTTCLAFAPTDSTTIIGNTQQQTFSVVYDVAQSRIGFAAGGCS 470
>gi|225216949|gb|ACN85242.1| aspartic proteinase nepenthesin-1 precursor [Oryza minuta]
Length = 519
Score = 153 bits (386), Expect = 1e-34, Method: Compositional matrix adjust.
Identities = 116/355 (32%), Positives = 177/355 (49%), Gaps = 33/355 (9%)
Query: 35 YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSS 94
Y++ + +GTP DTGSD TW QC+PC + C++Q LFDP +SSTY ++SC++
Sbjct: 180 YVVTVGLGTPASRYTVVFDTGSDTTWVQCQPCVVV-CYEQREKLFDPARSSTYANVSCAA 238
Query: 95 SQCAVVT-SNCSEGDCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMPNVIFGCG 153
C+ + CS G C Y YG G+Y S G A +TLT +S + FGCG
Sbjct: 239 PACSDLNIHGCSGGHCLYGVQYGDGSY---SIGFFAMDTLTLSSYDAV----KGFRFGCG 291
Query: 154 HKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLPDQ--GSSKINFGG--IV 209
+N + G++GLG G +SL Q G F++CLP + G+ ++FG +
Sbjct: 292 ERNEG---LFGEAAGLLGLGRGKTSLPVQTYDKYGGVFAHCLPARSTGTGYLDFGAGSLA 348
Query: 210 AGAGVVSTPLIIRD---HYYLSLEAISVGNQRLEFVSS--STGNIFVDTGVLRTLLPLEY 264
A ++TP++ + YY+ + I VG Q L S +T VD+G + T LP
Sbjct: 349 AARARLTTPMLTENGPTFYYVGMTGIRVGGQLLSIPQSVFATAGTIVDSGTVITRLPPAA 408
Query: 265 HSNLKSVMSNMIKAQPVKGVGAEPGFSDV-LCYNIS--SQPKFPEVTIHFR-GADVKLSP 320
+S+L+ + + A +G P S + CY+ + SQ P V++ F+ GA + +
Sbjct: 409 YSSLRYAFAAAMAA---RGYKKAPAVSLLDTCYDFTGMSQVAIPTVSLLFQGGARLDVDA 465
Query: 321 SNLFRNISDEIMCSAFR----GGNANIVYGRIMQINFLIGYDIEQAMVSFKPSRC 371
S + S +C AF GG+ IV G F + YDI + +V F P C
Sbjct: 466 SGIMYAASASQVCLAFAANEDGGDVGIV-GNTQLKTFGVAYDIGKKVVGFYPGAC 519
>gi|19424106|gb|AAL87345.1| putative chloroplast nucleoid DNA-binding protein [Arabidopsis
thaliana]
Length = 500
Score = 152 bits (385), Expect = 2e-34, Method: Compositional matrix adjust.
Identities = 113/361 (31%), Positives = 176/361 (48%), Gaps = 46/361 (12%)
Query: 35 YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSS 94
Y + +GTP D++ +DTGSD W QCEPC DC++Q P+F+P SSTY S++CS+
Sbjct: 162 YFSRIGVGTPAKDMYLVLDTGSDVNWIQCEPCA--DCYQQSDPVFNPTSSSTYKSLTCSA 219
Query: 95 SQCAVV-TSNCSEGDCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMPNVIFGCG 153
QC+++ TS C C Y YG G SF+ G LAT+T+TF ++ ++ NV GCG
Sbjct: 220 PQCSLLETSACRSNKCLYQVSYGDG---SFTVGELATDTVTFGNSG----KINNVALGCG 272
Query: 154 HKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLPDQGSSK---INFGGIVA 210
H N + TG GL ++S A FSYCL D+ S K ++F +
Sbjct: 273 HDN------EGLFTGAAGLLGLGGGVLSITNQMKATSFSYCLVDRDSGKSSSLDFNSVQL 326
Query: 211 GAGVVSTPLI----IRDHYYLSLEAISVGNQRL-------EFVSSSTGNIFVDTGVLRTL 259
G G + PL+ I YY+ L SVG +++ + +S +G + +D G T
Sbjct: 327 GGGDATAPLLRNKKIDTFYYVGLSGFSVGGEKVVLPDAIFDVDASGSGGVILDCGTAVTR 386
Query: 260 LPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDV----LCYNIS--SQPKFPEVTIHFRG 313
L + +++L+ + V + G S + CY+ S S K P V HF G
Sbjct: 387 LQTQAYNSLRDAFLKLT-------VNLKKGSSSISLFDTCYDFSSLSTVKVPTVAFHFTG 439
Query: 314 AD-VKLSPSNLFRNISDE-IMCSAFRGGNANI-VYGRIMQINFLIGYDIEQAMVSFKPSR 370
+ L N + D C AF ++++ + G + Q I YD+ + ++ ++
Sbjct: 440 GKSLDLPAKNYLIPVDDSGTFCFAFAPTSSSLSIIGNVQQQGTRITYDLSKNVIGLSGNK 499
Query: 371 C 371
C
Sbjct: 500 C 500
>gi|168059885|ref|XP_001781930.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162666576|gb|EDQ53226.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 355
Score = 152 bits (384), Expect = 3e-34, Method: Compositional matrix adjust.
Identities = 118/359 (32%), Positives = 166/359 (46%), Gaps = 32/359 (8%)
Query: 35 YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSS 94
YL + +GTP VDTGSD TW QC PC C+ Q LF P S+++ ++C +
Sbjct: 3 YLATVRLGTPERVFSVIVDTGSDLTWVQCSPCGT--CYSQNDSLFIPNTSTSFTKLACGT 60
Query: 95 SQC-AVVTSNCSEGDCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMPNVIFGCG 153
C + C++ C Y + YG G S S+G+ +T+T + +G ++PN FGCG
Sbjct: 61 ELCNGLPYPMCNQTTCVYWYSYGDG---SLSTGDFVYDTITMDGINGQKQQVPNFAFGCG 117
Query: 154 HKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCL-----PDQGSSKINFG-- 206
H N S + GI+GLG G S SQ+ T GKFSYCL P +S + FG
Sbjct: 118 HDNEG---SFAGADGILGLGQGPLSFPSQLKTVFNGKFSYCLVDWLAPPTQTSPLLFGDA 174
Query: 207 GIVAGAGVVSTPLI----IRDHYYLSLEAISVGNQRLEFVSSS--------TGNIFVDTG 254
+ GV L+ + +YY+ L ISVG + L S++ G IF D+G
Sbjct: 175 AVPTFPGVKYISLLTNPKVPTYYYVKLNGISVGGKLLNISSTAFDIDSVGRAGTIF-DSG 233
Query: 255 VLRTLLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDVLCYNISSQ-PKFPEVTIHFRG 313
T L E H + + M+ P K + G L Q P P +T HF G
Sbjct: 234 TTVTQLAGEVHQEVLAAMNASTMDYPRKSDDSS-GLDLCLGGFAEGQLPTVPSMTFHFEG 292
Query: 314 ADVKLSPSNLFRNI-SDEIMCSAFRGGNANIVYGRIMQINFLIGYDIEQAMVSFKPSRC 371
D++L PSN F + S + C + + G I Q NF + YD + F P C
Sbjct: 293 GDMELPPSNYFIFLESSQSYCFSMVSSPDVTIIGSIQQQNFQVYYDTVGRKIGFVPKSC 351
>gi|297834758|ref|XP_002885261.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
gi|297331101|gb|EFH61520.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
Length = 500
Score = 152 bits (384), Expect = 3e-34, Method: Compositional matrix adjust.
Identities = 113/357 (31%), Positives = 177/357 (49%), Gaps = 38/357 (10%)
Query: 35 YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSS 94
Y + +GTP +++ +DTGSD W QCEPC DC++Q P+F+P SSTY S++CS+
Sbjct: 162 YFSRIGVGTPAKEMYLVLDTGSDVNWIQCEPCS--DCYQQSDPVFNPTSSSTYKSLTCSA 219
Query: 95 SQCAVV-TSNCSEGDCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMPNVIFGCG 153
QC+++ TS C C Y YG G SF+ G LAT+T+TF ++ ++ +V GCG
Sbjct: 220 PQCSLLETSACRSNKCLYQVSYGDG---SFTVGELATDTVTFGNSG----KINDVALGCG 272
Query: 154 HKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLPDQGSSK---INFGGIVA 210
H N T + G+ G S+ +QM A FSYCL D+ S K ++F +
Sbjct: 273 HDNEGLFTGAAGLLGLGGG---ALSITNQMK---ATSFSYCLVDRDSGKSSSLDFNSVQL 326
Query: 211 GAGVVSTPLI----IRDHYYLSLEAISVGNQRL-------EFVSSSTGNIFVDTGVLRTL 259
G+G + PL+ I YY+ L SVG Q++ + +S +G + +D G T
Sbjct: 327 GSGDATAPLLRNQKIDTFYYVGLSGFSVGGQKVMMPDAIFDVDASGSGGVILDCGTAVTR 386
Query: 260 LPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDVLCYNIS--SQPKFPEVTIHFRGAD-V 316
L + +++L+ + KG + F CY+ S S K P V HF G +
Sbjct: 387 LQTQAYNSLRDAFLKLT-TNLKKGTSSISLFD--TCYDFSSLSSVKVPTVAFHFTGGKSL 443
Query: 317 KLSPSNLFRNISDE-IMCSAFRGGNANI-VYGRIMQINFLIGYDIEQAMVSFKPSRC 371
L N + D C AF ++++ + G + Q I YD+ ++ ++C
Sbjct: 444 DLPAKNYLIPVDDNGTFCFAFAPTSSSLSIIGNVQQQGTRITYDLANKIIGLSGNKC 500
>gi|297848386|ref|XP_002892074.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
gi|297337916|gb|EFH68333.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
Length = 485
Score = 152 bits (383), Expect = 4e-34, Method: Compositional matrix adjust.
Identities = 117/361 (32%), Positives = 163/361 (45%), Gaps = 42/361 (11%)
Query: 35 YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSS 94
Y L +GTP ++ +DTGSD W QC PC C+ Q P+FDP+KS TY +I CSS
Sbjct: 142 YFTRLGVGTPARYVYMVLDTGSDIVWLQCAPCRR--CYSQSDPIFDPRKSKTYATIPCSS 199
Query: 95 SQCAVVTS---NCSEGDCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMPNVIFG 151
C + S N C Y YG G SF+ G+ +TETLTF + V G
Sbjct: 200 PHCRRLDSAGCNTRRKTCLYQVSYGDG---SFTVGDFSTETLTFRRN-----RVKGVALG 251
Query: 152 CGHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLPDQGS----SKINFGG 207
CGH N + G+ S Q G KFSYCL D+ + S + FG
Sbjct: 252 CGHDNEGLFVGAAGLLGLGKG---KLSFPGQTGHRFNQKFSYCLVDRSASSKPSSVVFGN 308
Query: 208 IVAGAGVVSTPLI----IRDHYYLSLEAISVGNQRLEFVSSS--------TGNIFVDTGV 255
TPL+ + YY+ L ISVG R+ V++S G + +D+G
Sbjct: 309 AAVSRIARFTPLLSNPKLDTFYYVELLGISVGGTRVPGVAASLFKLDQIGNGGVIIDSGT 368
Query: 256 LRTLLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDV-LCYNIS--SQPKFPEVTIHFR 312
T L + ++ KA + P FS C+++S ++ K P V +HFR
Sbjct: 369 SVTRLIRPAYIAMRDAFRVGAKA-----LKRAPDFSLFDTCFDLSNMNEVKVPTVVLHFR 423
Query: 313 GADVKLSPSNLFRNI-SDEIMCSAFRGGNANI-VYGRIMQINFLIGYDIEQAMVSFKPSR 370
GADV L +N + ++ C AF G + + G I Q F + YD+ + V F P
Sbjct: 424 GADVSLPATNYLIPVDTNGKFCFAFAGTMGGLSIIGNIQQQGFRVVYDLASSRVGFAPGG 483
Query: 371 C 371
C
Sbjct: 484 C 484
>gi|356524289|ref|XP_003530762.1| PREDICTED: LOW QUALITY PROTEIN: aspartic proteinase
nepenthesin-2-like [Glycine max]
Length = 392
Score = 151 bits (382), Expect = 4e-34, Method: Compositional matrix adjust.
Identities = 119/361 (32%), Positives = 180/361 (49%), Gaps = 39/361 (10%)
Query: 35 YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSS 94
Y++ + +GTP D+ DTGSD TWTQCEPC C+KQ+ +FDP KSS+Y +I+C+S
Sbjct: 46 YVVVVGLGTPKRDLSLVFDTGSDLTWTQCEPCAG-SCYKQQDAIFDPSKSSSYTNITCTS 104
Query: 95 SQCAVVTSNCSEGDCS--------YSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMP 146
S C +TS+ + +CS Y YG S S G L+ E LT +T +
Sbjct: 105 SLCTQLTSDGIKSECSSSTDASCIYDAKYGDN---STSVGFLSQERLTITATD----IVD 157
Query: 147 NVIFGCGHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLPDQGSS--KIN 204
+ +FGCG N + G++GLG S++ Q ++ FSYCLP SS +
Sbjct: 158 DFLFGCGQDNEGLFNGSA---GLMGLGRHPISIVQQTSSNYNKIFSYCLPATSSSLGHLT 214
Query: 205 FGGIVA-GAGVVSTPL--IIRDH--YYLSLEAISVGNQRLEFVSSST---GNIFVDTGVL 256
FG A A ++ TPL I D+ Y L + +ISVG +L VSSST G +D+G +
Sbjct: 215 FGASAATNASLIYTPLSTISGDNSFYGLDIVSISVGGTKLPAVSSSTFSAGGSIIDSGTV 274
Query: 257 RTLLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDVLCYNISSQPKF--PEVTIHFRGA 314
T L ++ L+S ++ PV E G D CY++S + P + F G
Sbjct: 275 ITRLAPTVYAALRSAFRRXMEKYPVAN---EAGLLDT-CYDLSGYKEISVPRIDFEFSGG 330
Query: 315 -DVKLSPSNLFRNISDEIMCSAFRGGNAN---IVYGRIMQINFLIGYDIEQAMVSFKPSR 370
V+L + S++ +C AF ++ V+G + Q + YD++ + F +
Sbjct: 331 VTVELXHRGILXVESEQQVCLAFAANGSDNDITVFGNVQQKTLEVVYDVKGGRIGFGAAG 390
Query: 371 C 371
C
Sbjct: 391 C 391
>gi|145693992|gb|ABP93696.1| unknown protein isoform 1 [Lemna minor]
Length = 350
Score = 151 bits (382), Expect = 4e-34, Method: Compositional matrix adjust.
Identities = 118/350 (33%), Positives = 169/350 (48%), Gaps = 29/350 (8%)
Query: 35 YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSS 94
Y++ + GTP + DTGS+ W QC+PC + C+ Q+ PLFDP SSTY +ISC+S
Sbjct: 16 YVITVGFGTPKKNQTVIFDTGSNVNWIQCKPC-VVSCYPQQEPLFDPTLSSTYRNISCTS 74
Query: 95 SQCAVVTSN-CSEGDCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMPNVIFGCG 153
+ C ++S CS C Y YG G S + G LATET T + + N IFGCG
Sbjct: 75 AACTGLSSRGCSGSTCVYGVTYGDG---SSTVGFLATETFTLAAGN----VFNNFIFGCG 127
Query: 154 HKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLPDQGSSK--INFGGIVAG 211
N T + G+IGLG SL SQ+ TS+ FSYCLP S+ +N G +
Sbjct: 128 QNNQGLFTGAA---GLIGLGRSPYSLNSQLATSLGNIFSYCLPSTSSATGYLNIGNPLRT 184
Query: 212 AGVVSTPLIIR--DHYYLSLEAISVGNQRLEFVSS---STGNIFVDTGVLRTLLPLEYHS 266
G + R Y++ L ISVG RL S+ S G I +D+G + T LP +
Sbjct: 185 PGYTAMLTNSRAPTLYFIDLIGISVGGTRLALSSTVFQSVGTI-IDSGTVITRLPPTAYG 243
Query: 267 NLKSVMSNMIKAQPVKGVGAEPGFSDVLCYNIS--SQPKFPEVTIHFRGADVKLSPSNLF 324
L++ + Q + A D CY+ S + FP + +H+ G DV + + +F
Sbjct: 244 ALRTAFRAAMT-QYTRAAAAS--ILDT-CYDFSRTTTVTFPTIKLHYTGLDVTIPGAGVF 299
Query: 325 RNISDEIMCSAFRGGNANI---VYGRIMQINFLIGYDIEQAMVSFKPSRC 371
IS +C AF G + + + G + Q + YD + F C
Sbjct: 300 YVISSSQVCLAFAGNSDSTQIGIIGNVQQRTMEVTYDNALKRIGFAAGAC 349
>gi|15223368|ref|NP_171637.1| aspartyl protease-like protein [Arabidopsis thaliana]
gi|9665144|gb|AAF97328.1|AC023628_9 Unknown protein [Arabidopsis thaliana]
gi|22135930|gb|AAM91547.1| chloroplast nucleoid DNA binding protein, putative [Arabidopsis
thaliana]
gi|30387595|gb|AAP31963.1| At1g01300 [Arabidopsis thaliana]
gi|332189147|gb|AEE27268.1| aspartyl protease-like protein [Arabidopsis thaliana]
Length = 485
Score = 151 bits (382), Expect = 4e-34, Method: Compositional matrix adjust.
Identities = 117/361 (32%), Positives = 165/361 (45%), Gaps = 42/361 (11%)
Query: 35 YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSS 94
Y L +GTP ++ +DTGSD W QC PC C+ Q P+FDP+KS TY +I CSS
Sbjct: 142 YFTRLGVGTPARYVYMVLDTGSDIVWLQCAPCRR--CYSQSDPIFDPRKSKTYATIPCSS 199
Query: 95 SQCAVVTS---NCSEGDCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMPNVIFG 151
C + S N C Y YG G SF+ G+ +TETLTF + V G
Sbjct: 200 PHCRRLDSAGCNTRRKTCLYQVSYGDG---SFTVGDFSTETLTFRRN-----RVKGVALG 251
Query: 152 CGHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLPDQGS----SKINFGG 207
CGH N + G+ S Q G KFSYCL D+ + S + FG
Sbjct: 252 CGHDNEGLFVGAAGLLGLGKG---KLSFPGQTGHRFNQKFSYCLVDRSASSKPSSVVFGN 308
Query: 208 IVAGAGVVSTPLI----IRDHYYLSLEAISVGNQRLEFVSSS--------TGNIFVDTGV 255
TPL+ + YY+ L ISVG R+ V++S G + +D+G
Sbjct: 309 AAVSRIARFTPLLSNPKLDTFYYVGLLGISVGGTRVPGVTASLFKLDQIGNGGVIIDSGT 368
Query: 256 LRTLLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDV-LCYNIS--SQPKFPEVTIHFR 312
T L + ++ + A+ +K P FS C+++S ++ K P V +HFR
Sbjct: 369 SVTRLIRPAYIAMRDAF--RVGAKTLK---RAPDFSLFDTCFDLSNMNEVKVPTVVLHFR 423
Query: 313 GADVKLSPSNLFRNI-SDEIMCSAFRGGNANI-VYGRIMQINFLIGYDIEQAMVSFKPSR 370
GADV L +N + ++ C AF G + + G I Q F + YD+ + V F P
Sbjct: 424 GADVSLPATNYLIPVDTNGKFCFAFAGTMGGLSIIGNIQQQGFRVVYDLASSRVGFAPGG 483
Query: 371 C 371
C
Sbjct: 484 C 484
>gi|225217008|gb|ACN85295.1| aspartic proteinase nepenthesin-1 precursor [Oryza coarctata]
Length = 500
Score = 151 bits (382), Expect = 4e-34, Method: Compositional matrix adjust.
Identities = 114/354 (32%), Positives = 176/354 (49%), Gaps = 31/354 (8%)
Query: 35 YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSS 94
Y++ + +GTP DTGSD TW QCEPC + C+KQ+ LFDP +SSTY +ISC++
Sbjct: 161 YVVTIGLGTPAGRYTVVFDTGSDTTWVQCEPCVVV-CYKQQEKLFDPARSSTYANISCAA 219
Query: 95 SQCA-VVTSNCSEGDCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMPNVIFGCG 153
C+ + CS G C Y YG G+Y S G A +TLT +S + FGCG
Sbjct: 220 PACSDLYIKGCSGGHCLYGVQYGDGSY---SIGFFAMDTLTLSSYDAI----KGFRFGCG 272
Query: 154 HKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLPDQ--GSSKINF--GGIV 209
+N + G++GLG G +SL Q G F++C P + G+ ++F G +
Sbjct: 273 ERNEG---LYGEAAGLLGLGRGKTSLPVQAYDKYGGVFAHCFPARSSGTGYLDFGPGSLP 329
Query: 210 AGAGVVSTPLIIRD---HYYLSLEAISVGNQRLEFVSS--STGNIFVDTGVLRTLLPLEY 264
A + ++TP+++ + YY+ L I VG + L S +T VD+G + T LP
Sbjct: 330 AVSAKLTTPMLVDNGPTFYYVGLTGIRVGGKLLSIPQSVFTTSGTIVDSGTVITRLPPAA 389
Query: 265 HSNLKSVMSNMIKAQPVKGVGAEPGFSDV-LCYNIS--SQPKFPEVTIHFR-GADVKLSP 320
+S+L+S ++ A +G P S + CY+ + S+ P V++ F+ GA + +
Sbjct: 390 YSSLRSAFAS---AMAERGYKKAPALSLLDTCYDFTGMSEVAIPTVSLLFQGGASLDVHA 446
Query: 321 SNLFRNISDEIMCSAFRGGNANI---VYGRIMQINFLIGYDIEQAMVSFKPSRC 371
S + S C F G + + G F + YDI + +V F P C
Sbjct: 447 SGIIYAASVSQACLGFAGNKEDDDVGIVGNTQLKTFGVVYDIGKKVVGFCPGAC 500
>gi|108707838|gb|ABF95633.1| Eukaryotic aspartyl protease family protein [Oryza sativa Japonica
Group]
Length = 391
Score = 151 bits (382), Expect = 5e-34, Method: Compositional matrix adjust.
Identities = 122/371 (32%), Positives = 176/371 (47%), Gaps = 51/371 (13%)
Query: 35 YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSS 94
YL+HL+IGTPP + ++DTGSD WTQC+PCP CF Q P FDP SST + SC S
Sbjct: 35 YLVHLAIGTPPQPVQLTLDTGSDLIWTQCQPCPA--CFDQALPYFDPSTSSTLSLTSCDS 92
Query: 95 SQC-AVVTSNCSEGD------CSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMPN 147
+ C + ++C C Y++ YG S ++G L + TF G +P
Sbjct: 93 TLCQGLPVASCGSPKFWPNQTCVYTYSYGD---KSVTTGFLEVDKFTF---VGAGASVPG 146
Query: 148 VIFGCGHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYC-------LPDQGS 200
V FGCG N + S +TGI G G G SL SQ+ G FS+C +P
Sbjct: 147 VAFGCGLFN--NGVFKSNETGIAGFGRGPLSLPSQLK---VGNFSHCFTTITGAIPSTVL 201
Query: 201 SKINFGGIVAGAGVV-STPLIIRDH-------YYLSLEAISVGNQRLEF------VSSST 246
+ G G V +TPLI YYLSL+ I+VG+ RL +++ T
Sbjct: 202 LDLPADLFSNGQGAVQTTPLIQYAKNEANPTLYYLSLKGITVGSTRLPVPESAFALTNGT 261
Query: 247 GNIFVDTGVLRTLLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDVLCYNISSQ--PKF 304
G +D+G T LP + + ++ + IK V G C++ SQ P
Sbjct: 262 GGTIIDSGTSITSLPPQVYQVVRDEFAAQIKLPVVPGNAT----GHYTCFSAPSQAKPDV 317
Query: 305 PEVTIHFRGADVKLSPSNLFRNISDE----IMCSAFRGGNANIVYGRIMQINFLIGYDIE 360
P++ +HF GA + L N + D+ I+C A G+ + G Q N + YD++
Sbjct: 318 PKLVLHFEGATMDLPRENYVFEVPDDAGNSIICLAINKGDETTIIGNFQQQNMHVLYDLQ 377
Query: 361 QAMVSFKPSRC 371
M+SF ++C
Sbjct: 378 NNMLSFVAAQC 388
>gi|115467014|ref|NP_001057106.1| Os06g0209100 [Oryza sativa Japonica Group]
gi|51091210|dbj|BAD35903.1| putative aspartic proteinase nepenthesin I [Oryza sativa Japonica
Group]
gi|113595146|dbj|BAF19020.1| Os06g0209100 [Oryza sativa Japonica Group]
gi|125554496|gb|EAZ00102.1| hypothetical protein OsI_22105 [Oryza sativa Indica Group]
Length = 454
Score = 151 bits (382), Expect = 5e-34, Method: Compositional matrix adjust.
Identities = 124/382 (32%), Positives = 182/382 (47%), Gaps = 61/382 (15%)
Query: 31 VDDIYLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQ-EPPLFDPKKSSTYNS 89
V + YLMH+S+GTPP + ++DTGSD WTQC PC LDCF+Q P+ DP SST+ +
Sbjct: 86 VTNEYLMHVSVGTPPRPVALTLDTGSDLVWTQCAPC--LDCFEQGAAPVLDPAASSTHAA 143
Query: 90 ISCSSSQC-AVVTSNC---SEGD--CSYSFLYGRGAYASFSSGNLATETLTF---NSTSG 140
+ C + C A+ ++C S GD C Y + YG S + G LAT++ TF ++ G
Sbjct: 144 LPCDAPLCRALPFTSCGGRSWGDRSCVYVYHYGD---RSLTVGQLATDSFTFGGDDNAGG 200
Query: 141 LPVEMPNVIFGCGHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLPD--- 197
L V FGCGH N + +TGI G G G SL SQ+ + FSYC
Sbjct: 201 LAAR--RVTFGCGHIN--KGIFQANETGIAGFGRGRWSLPSQLNVT---SFSYCFTSMFD 253
Query: 198 -QGSSKINFGGI---------VAGAGVVSTPLIIRD-----HYYLSLEAISVGNQRLEFV 242
+ SS + G A G V T +I++ Y++ L ISVG R+
Sbjct: 254 TKSSSVVTLGAAAAELLHTHHAAHTGDVRTTRLIKNPSQPSLYFVPLRGISVGGARVAVP 313
Query: 243 SSS-TGNIFVDTGVLRTLLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDV----LCYN 297
S + +D+G T LP + + +KA+ V VG + LC+
Sbjct: 314 ESRLRSSTIIDSGASITTLPEDVY--------EAVKAEFVSQVGLPAAAAGSAALDLCFA 365
Query: 298 IS-----SQPKFPEVTIHFR-GADVKLSPSN-LFRNISDEIMCSAF-RGGNANIVYGRIM 349
+ +P P +T+H GAD +L N +F + + ++C +V G
Sbjct: 366 LPVAALWRRPAVPALTLHLDGGADWELPRGNYVFEDYAARVLCVVLDAAAGEQVVIGNYQ 425
Query: 350 QINFLIGYDIEQAMVSFKPSRC 371
Q N + YD+E ++SF P+RC
Sbjct: 426 QQNTHVVYDLENDVLSFAPARC 447
>gi|356498789|ref|XP_003518231.1| PREDICTED: probable aspartic protease At2g35615-like [Glycine max]
Length = 446
Score = 151 bits (381), Expect = 6e-34, Method: Compositional matrix adjust.
Identities = 122/368 (33%), Positives = 181/368 (49%), Gaps = 48/368 (13%)
Query: 34 IYLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCS 93
++LM+ SIG PP+ +DTGS TW C PC C +Q P+FDP KSSTY+++SCS
Sbjct: 92 VFLMNFSIGEPPIPQLAVMDTGSSLTWVMCHPCSS--CSQQSVPIFDPSKSSTYSNLSCS 149
Query: 94 S-SQCAVVTSNCSEGDCSYSFLY-GRGAYASFSSGNLATETLTFNSTSGLPVEMPNVIFG 151
++C VV G+C YS Y G G+ S G A E LT + +++P++IFG
Sbjct: 150 ECNKCDVVN-----GECPYSVEYVGSGS----SQGIYAREQLTLETIDESIIKVPSLIFG 200
Query: 152 CGHKNLASPTSDSKQ--TGIIGLGPGNSSLISQMGTSIAGKFSYCLPDQGSSKINFGGIV 209
CG K S Q G+ GLG G SL+ G KFSYC+ + ++ F +V
Sbjct: 201 CGRKFSISSNGYPYQGINGVFGLGSGRFSLLPSFGK----KFSYCIGNLRNTNYKFNRLV 256
Query: 210 AG-----AGVVSTPLIIRDHYYLSLEAISVGNQRLE-----FVSSSTGN---IFVDTGVL 256
G G +T +I YY++LEAIS+G ++L+ F S T N + +D+G
Sbjct: 257 LGDKANMQGDSTTLNVINGLYYVNLEAISIGGRKLDIDPTLFERSITDNNSGVIIDSGAD 316
Query: 257 RTLLPLEYHSNLKSVMSNMIK-----AQPVKGVGAEPGFSDVLCYNISSQPKFPEVTIHF 311
T L L + N+++ AQ K +S V+ ++S FP VT HF
Sbjct: 317 HTWLTKYGFEVLSFEVENLLEGVLVLAQQDKHNPYTLCYSGVVSQDLSG---FPLVTFHF 373
Query: 312 -RGADVKLSPSNLFRNISDEIMCSAFRGGN-------ANIVYGRIMQINFLIGYDIEQAM 363
GA + L +++F ++ C A GN + G + Q N+ +GYD+ +
Sbjct: 374 AEGAVLDLDVTSMFIQTTENEFCMAMLPGNYFGDDYESFSSIGMLAQQNYNVGYDLNRMR 433
Query: 364 VSFKPSRC 371
V F+ C
Sbjct: 434 VYFQRIDC 441
>gi|357124861|ref|XP_003564115.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Brachypodium
distachyon]
Length = 477
Score = 151 bits (381), Expect = 6e-34, Method: Compositional matrix adjust.
Identities = 129/396 (32%), Positives = 187/396 (47%), Gaps = 72/396 (18%)
Query: 31 VDDIYLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEP-PLFDPKKSSTYNS 89
V + YL+HLS+GTPP + ++DTGSD WTQC PC L+CF Q P+ DP SST+ +
Sbjct: 90 VTNEYLVHLSVGTPPRPVALTLDTGSDLVWTQCAPC--LNCFDQGAIPVLDPAASSTHAA 147
Query: 90 ISCSSSQC-AVVTSNCSEG-------DCSYSFLYGRGAYASFSSGNLATETLTF---NST 138
+ C + C A+ ++C G C Y + YG S + G LA++ TF ++
Sbjct: 148 VRCDAPVCRALPFTSCGRGGSSWGERSCVYVYHYGD---KSITVGKLASDRFTFGPGDNA 204
Query: 139 SGLPVEMPNVIFGCGHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLP-- 196
G V + FGCGH N + +TGI G G G SL SQ+G + FSYC
Sbjct: 205 DGGGVSERRLTFGCGHFNKG--IFQANETGIAGFGRGRWSLPSQLGVT---SFSYCFTSM 259
Query: 197 -DQGSSKINFGGIVAGA------GVVSTPLIIRDH-----YYLSLEAISVGNQRLEFVSS 244
+ SS + G VA A V STPL +RD Y+LSL+AI+VG R+
Sbjct: 260 FESTSSLVTLG--VAPAELHLTGQVQSTPL-LRDPSQPSLYFLSLKAITVGATRIPIPER 316
Query: 245 ST----GNIFVDTGVLRTLLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDVLCYNISS 300
+ +D+G T LP + + +K+ + PV V G + LC+ + S
Sbjct: 317 RQRLREASAIIDSGASITTLPEDVYEAVKAEFVAQV-GLPVSAV---EGSALDLCFALPS 372
Query: 301 QP-------------------KFPEVTIHF-RGADVKLSPSN-LFRNISDEIMC----SA 335
+ P + H GAD +L N +F + +MC +A
Sbjct: 373 AAAPKSAFGWRWRGRGRAMPVRVPRLVFHLGGGADWELPRENYVFEDYGARVMCLVLDAA 432
Query: 336 FRGGNANIVYGRIMQINFLIGYDIEQAMVSFKPSRC 371
GG+ +V G Q N + YD+E ++SF P+RC
Sbjct: 433 TGGGDQTVVIGNYQQQNTHVVYDLENDVLSFAPARC 468
>gi|302781726|ref|XP_002972637.1| hypothetical protein SELMODRAFT_97698 [Selaginella moellendorffii]
gi|300160104|gb|EFJ26723.1| hypothetical protein SELMODRAFT_97698 [Selaginella moellendorffii]
Length = 393
Score = 151 bits (381), Expect = 6e-34, Method: Compositional matrix adjust.
Identities = 119/356 (33%), Positives = 171/356 (48%), Gaps = 39/356 (10%)
Query: 35 YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSS 94
Y+M +S+GTP DTGSD W Q EPC C +FDP++SST+ + CSS
Sbjct: 55 YVMDISVGTPGKRFRAIADTGSDLVWVQSEPC--TGCSGGT--IFDPRQSSTFREMDCSS 110
Query: 95 SQCAVVTSNCSEGD--CSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMPNVIFGC 152
C + +C G CSYS+ YG G + G A +T++ +TSG + P+ GC
Sbjct: 111 QLCTELPGSCEPGSSACSYSYEYGSGE----TEGEFARDTISLGTTSGGSQKFPSFAVGC 166
Query: 153 GHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLPD----QGSSKINFG-- 206
G N D G++GLG G SL SQ+ +I KFSYCL D SS + FG
Sbjct: 167 GMVNSGFDGVD----GLVGLGQGPVSLTSQLSAAIDSKFSYCLVDINSQSESSPLLFGPS 222
Query: 207 GIVAGAGVVSTPL-----IIRDHYYLSLEAISVGNQRLEFVSSSTGNIFVDTGVLRTLLP 261
+ G G+ ST + +Y L++ I+V Q + S G +D+G T +P
Sbjct: 223 AALHGTGIQSTKITPPSDTYPTYYLLTVNGIAVAGQTM----GSPGTTIIDSGTTLTYVP 278
Query: 262 LEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDVLCYNISSQP--KFPEVTIHFRGADVKLS 319
+ + S M +M+ V G+ G LCY+ SS KFP +TI GA +
Sbjct: 279 SGVYGRVLSRMESMVTLPRVD--GSSMGLD--LCYDRSSNRNYKFPALTIRLAGATMTPP 334
Query: 320 PSNLFRNISD--EIMCSAF--RGGNANIVYGRIMQINFLIGYDIEQAMVSFKPSRC 371
SN F + D + +C A GG + G +MQ + I YD + +SF ++C
Sbjct: 335 SSNYFLVVDDSGDTVCLAMGSAGGLPVSIIGNVMQQGYHILYDRGSSELSFVQAKC 390
>gi|50508275|dbj|BAD32124.1| putative aspartic proteinase nepenthesin I [Oryza sativa Japonica
Group]
Length = 451
Score = 150 bits (380), Expect = 7e-34, Method: Compositional matrix adjust.
Identities = 128/394 (32%), Positives = 186/394 (47%), Gaps = 62/394 (15%)
Query: 22 IIYQAEIISVDDIYLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDP 81
+ +Q + + Y M+LSIGTPPV DTGS WTQC PC E C + P F P
Sbjct: 77 VSFQTLLDNSAGAYNMNLSIGTPPVTFSVLADTGSSLIWTQCAPCTE--CAARPAPPFQP 134
Query: 82 KKSSTYNSISCSSSQCAVVTS---NCSEGDCSYSFLYGRGAYASFSSGNLATETLTFNST 138
SST++ + C+SS C +TS C+ C Y + YG G F++G LATETL
Sbjct: 135 ASSSTFSKLPCASSLCQFLTSPYLTCNATGCVYYYPYGMG----FTAGYLATETLHVGGA 190
Query: 139 SGLPVEMPNVIFGCGHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCL--- 195
S P V FGC +N +S +GI+GLG SL+SQ+G G+FSYCL
Sbjct: 191 S-----FPGVAFGCSTENGVGNSS----SGIVGLGRSPLSLVSQVGV---GRFSYCLRSD 238
Query: 196 PDQGSSKINFGGI--VAGAGVVSTPLI------IRDHYYLSLEAISVGNQRLEFVSSS-- 245
D G S I FG + V G V STPL+ +YY++L I+VG L S++
Sbjct: 239 ADAGDSPILFGSLAKVTGGNVQSTPLLENPEMPSSSYYYVNLTGITVGATDLPVTSTTFG 298
Query: 246 ---------TGNIFVDTGVLRTLLPLEYHSNLK-SVMSNMIKAQPVKGV-GAEPGFSDVL 294
G VD+G T L E ++ +K + +S M A V G GF L
Sbjct: 299 FTRGAGAGLVGGTIVDSGTTLTYLVKEGYAMVKRAFLSQMATANLTTTVNGTRFGFD--L 356
Query: 295 CYNIS-----SQPKFPEVTIHFRG-ADVKLSPSNLFRNISDE------IMCSAFRGGNAN 342
C++ + S P + + F G A+ + + ++ + + C +
Sbjct: 357 CFDATAAGGGSGVPVPTLVLRFAGGAEYAVRRRSYVGVVAVDSQGRAAVECLLVLPASEK 416
Query: 343 I---VYGRIMQINFLIGYDIEQAMVSFKPSRCTN 373
+ + G +MQ++ + YD++ M SF P+ C N
Sbjct: 417 LSISIIGNVMQMDLHVLYDLDGGMFSFAPADCAN 450
>gi|15229656|ref|NP_188478.1| aspartyl protease family protein [Arabidopsis thaliana]
gi|75273882|sp|Q9LS40.1|ASPG1_ARATH RecName: Full=Protein ASPARTIC PROTEASE IN GUARD CELL 1;
Short=AtASPG1; Flags: Precursor
gi|11994113|dbj|BAB01116.1| CND41, chloroplast nucleoid DNA binding protein-like [Arabidopsis
thaliana]
gi|23297732|gb|AAN13013.1| putative chloroplast nucleoid DNA-binding protein [Arabidopsis
thaliana]
gi|332642583|gb|AEE76104.1| aspartyl protease family protein [Arabidopsis thaliana]
Length = 500
Score = 150 bits (380), Expect = 7e-34, Method: Compositional matrix adjust.
Identities = 112/361 (31%), Positives = 176/361 (48%), Gaps = 46/361 (12%)
Query: 35 YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSS 94
Y + +GTP +++ +DTGSD W QCEPC DC++Q P+F+P SSTY S++CS+
Sbjct: 162 YFSRIGVGTPAKEMYLVLDTGSDVNWIQCEPCA--DCYQQSDPVFNPTSSSTYKSLTCSA 219
Query: 95 SQCAVV-TSNCSEGDCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMPNVIFGCG 153
QC+++ TS C C Y YG G SF+ G LAT+T+TF ++ ++ NV GCG
Sbjct: 220 PQCSLLETSACRSNKCLYQVSYGDG---SFTVGELATDTVTFGNSG----KINNVALGCG 272
Query: 154 HKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLPDQGSSK---INFGGIVA 210
H N + TG GL ++S A FSYCL D+ S K ++F +
Sbjct: 273 HDN------EGLFTGAAGLLGLGGGVLSITNQMKATSFSYCLVDRDSGKSSSLDFNSVQL 326
Query: 211 GAGVVSTPLI----IRDHYYLSLEAISVGNQRL-------EFVSSSTGNIFVDTGVLRTL 259
G G + PL+ I YY+ L SVG +++ + +S +G + +D G T
Sbjct: 327 GGGDATAPLLRNKKIDTFYYVGLSGFSVGGEKVVLPDAIFDVDASGSGGVILDCGTAVTR 386
Query: 260 LPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDV----LCYNIS--SQPKFPEVTIHFRG 313
L + +++L+ + V + G S + CY+ S S K P V HF G
Sbjct: 387 LQTQAYNSLRDAFLKLT-------VNLKKGSSSISLFDTCYDFSSLSTVKVPTVAFHFTG 439
Query: 314 AD-VKLSPSNLFRNISDE-IMCSAFRGGNANI-VYGRIMQINFLIGYDIEQAMVSFKPSR 370
+ L N + D C AF ++++ + G + Q I YD+ + ++ ++
Sbjct: 440 GKSLDLPAKNYLIPVDDSGTFCFAFAPTSSSLSIIGNVQQQGTRITYDLSKNVIGLSGNK 499
Query: 371 C 371
C
Sbjct: 500 C 500
>gi|242070719|ref|XP_002450636.1| hypothetical protein SORBIDRAFT_05g008470 [Sorghum bicolor]
gi|241936479|gb|EES09624.1| hypothetical protein SORBIDRAFT_05g008470 [Sorghum bicolor]
Length = 410
Score = 150 bits (380), Expect = 8e-34, Method: Compositional matrix adjust.
Identities = 119/350 (34%), Positives = 177/350 (50%), Gaps = 31/350 (8%)
Query: 35 YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSS 94
Y M SIGTPP ++ DTGSD W +C C C Q P + P KSS+++ + CS
Sbjct: 82 YDMTFSIGTPPQELSALADTGSDLIWAKCGACTR--CVPQGSPSYYPNKSSSFSKLPCSG 139
Query: 95 SQCA-VVTSNCSEG--DCSYSFLYGRGAY-ASFSSGNLATETLTFNSTSGLPVEMPNVIF 150
S C+ + +S CS G +C Y + YG + ++ G L +ET T S + +P + F
Sbjct: 140 SLCSDLPSSQCSAGGAECDYKYSYGLASDPHHYTQGYLGSETFTLGSDA-----VPGIGF 194
Query: 151 GCGHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLPDQGS--SKINFG-G 207
GC S +G++GLG G SL+SQ+ G FSYCL + S + FG G
Sbjct: 195 GC---TTMSEGGYGSGSGLVGLGRGPLSLVSQLNV---GAFSYCLTSDAAKTSPLLFGSG 248
Query: 208 IVAGAGVVSTPLIIRDHYY--LSLEAISVGNQRLEFVSSSTGNIFVDTGVLRTLLPLEYH 265
+ GAGV STPL+ YY ++LE+IS+G SS I D+G L +
Sbjct: 249 ALTGAGVQSTPLLRTSTYYYTVNLESISIGAATTAGTGSS--GIIFDSGTTVAFLAEPAY 306
Query: 266 SNLK-SVMSNMIKAQPVKGVGAEPGFSDVLCYNISSQPKFPEVTIHFRGADVKLSPSNLF 324
+ K +V+S G G+ +C+ S FP + +HF G D+ L N F
Sbjct: 307 TLAKEAVLSQTTNLTMASG---RDGYE--VCFQTSGA-VFPSMVLHFDGGDMDLPTENYF 360
Query: 325 RNISDEIMCSAFRGGNANIVYGRIMQINFLIGYDIEQAMVSFKPSRCTNY 374
+ D + C + + + G IMQ+N+ I YD+E++M+SF+P+ C N+
Sbjct: 361 GAVDDSVSCWIVQKSPSLSIVGNIMQMNYHIRYDVEKSMLSFQPANCDNF 410
>gi|224130548|ref|XP_002320868.1| predicted protein [Populus trichocarpa]
gi|222861641|gb|EEE99183.1| predicted protein [Populus trichocarpa]
Length = 488
Score = 150 bits (380), Expect = 8e-34, Method: Compositional matrix adjust.
Identities = 121/361 (33%), Positives = 169/361 (46%), Gaps = 42/361 (11%)
Query: 35 YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSS 94
Y L +GTP ++ +DTGSD W QC PC + C+ Q P+FDP KS ++ +I C S
Sbjct: 145 YFTRLGVGTPARYVYMVLDTGSDIVWIQCAPC--IKCYSQTDPVFDPTKSRSFANIPCGS 202
Query: 95 SQCAVVT-SNCSEGD--CSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMPNVIFG 151
C + CS C Y YG G SF+ G +TETLTF T + V+ G
Sbjct: 203 PLCRRLDYPGCSTKKQICLYQVSYGDG---SFTVGEFSTETLTFRGT-----RVGRVVLG 254
Query: 152 CGHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLPDQGS----SKINFGG 207
CGH N + G+ S SQ+G KFSYCL D+ + S I FG
Sbjct: 255 CGHDNEGLFVGAAGLLGLGRG---RLSFPSQIGRRFNSKFSYCLGDRSASSRPSSIVFGD 311
Query: 208 IVAGAGVVSTPLI----IRDHYYLSLEAISVGNQRLEFVSS------STGN--IFVDTGV 255
TPL+ + YY+ L ISVG R+ +S+ STGN + +D+G
Sbjct: 312 SAISRTTRFTPLLSNPKLDTFYYVELLGISVGGTRVSGISASLFKLDSTGNGGVIIDSGT 371
Query: 256 LRTLLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDV-LCYNIS--SQPKFPEVTIHFR 312
T L + L+ ++ A +K P FS C+++S ++ K P V +HFR
Sbjct: 372 SVTRLTRAAYVALRDAF--LVGASNLK---RAPEFSLFDTCFDLSGKTEVKVPTVVLHFR 426
Query: 313 GADVKLSPSNLFRNISDE-IMCSAFRGGNANI-VYGRIMQINFLIGYDIEQAMVSFKPSR 370
GADV L SN + + C AF G + + + G I Q F + YD+ + V F P
Sbjct: 427 GADVPLPASNYLIPVDNSGSFCFAFAGTASGLSIIGNIQQQGFRVVYDLATSRVGFAPRG 486
Query: 371 C 371
C
Sbjct: 487 C 487
>gi|168013126|ref|XP_001759252.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162689565|gb|EDQ75936.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 385
Score = 150 bits (379), Expect = 9e-34, Method: Compositional matrix adjust.
Identities = 123/385 (31%), Positives = 178/385 (46%), Gaps = 59/385 (15%)
Query: 24 YQAEIISVDDI----YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLF 79
+QA +IS + Y + +S+GTPP ++ +DTGSD W QC PC + C+ Q +F
Sbjct: 22 FQAPVISGLSLGSGEYFIRVSVGTPPRGMYLVMDTGSDILWLQCAPC--VSCYHQCDEVF 79
Query: 80 DPKKSSTYNSISCSSSQCA-VVTSNCSEGDCSYSFLYGRGAYASFSSGNLATETLTFNST 138
DP KSSTY+++ C+S QC + C C Y YG G SFS+G AT+ ++ NST
Sbjct: 80 DPYKSSTYSTLGCNSRQCLNLDVGGCVGNKCLYQVDYGDG---SFSTGEFATDAVSLNST 136
Query: 139 S-GLPVEMPNVIFGCGHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLPD 197
S G V + + GCGH N + G+ S +Q+ + G+FSYCL
Sbjct: 137 SGGGQVVLNKIPLGCGHDNEGYFVGAAGLLGLGKG---PLSFPNQINSENGGRFSYCLTG 193
Query: 198 QGSSKINFGGIVAG------AGVVSTP----LIIRDHYYLSLEAISVGNQRLEFVSSS-- 245
+ + ++ G AGV TP L + YYL + ISVG L +S+
Sbjct: 194 RDTDSTERSSLIFGDAAVPPAGVRFTPQASNLRVSTFYYLKMTGISVGGSILTIPTSAFQ 253
Query: 246 -----TGNIFVDTGVLRTLLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDVL------ 294
G + +D+G T L +++L+ G SD++
Sbjct: 254 LDSLGNGGVIIDSGTSVTRLQNAAYASLREAF--------------RAGTSDLVLTTEFS 299
Query: 295 ----CYNIS--SQPKFPEVTIHFR-GADVKLSPSNLFRNISD-EIMCSAFRGGNANIVYG 346
CYN+S S P VT+HF+ GAD+KL SN + + C AF G + G
Sbjct: 300 LFDTCYNLSDLSSVDVPTVTLHFQGGADLKLPASNYLVPVDNSSTFCLAFAGTTGPSIIG 359
Query: 347 RIMQINFLIGYDIEQAMVSFKPSRC 371
I Q F + YD V F PS+C
Sbjct: 360 NIQQQGFRVIYDNLHNQVGFVPSQC 384
>gi|115479489|ref|NP_001063338.1| Os09g0452800 [Oryza sativa Japonica Group]
gi|51535938|dbj|BAD38020.1| putative aspartic proteinase nepenthesin I [Oryza sativa Japonica
Group]
gi|113631571|dbj|BAF25252.1| Os09g0452800 [Oryza sativa Japonica Group]
gi|125605918|gb|EAZ44954.1| hypothetical protein OsJ_29597 [Oryza sativa Japonica Group]
gi|215740967|dbj|BAG97462.1| unnamed protein product [Oryza sativa Japonica Group]
Length = 453
Score = 150 bits (379), Expect = 9e-34, Method: Compositional matrix adjust.
Identities = 109/368 (29%), Positives = 175/368 (47%), Gaps = 43/368 (11%)
Query: 35 YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSS 94
Y++ L++GTPP I +DTGSD WTQC+ C C +Q PLF P+ SS+Y + C+
Sbjct: 98 YVLDLAVGTPPQPITALLDTGSDLIWTQCDTCTA--CLRQPDPLFSPRMSSSYEPMRCAG 155
Query: 95 SQCA-VVTSNCSEGD-CSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMPNVIFGC 152
C ++ +C D C+Y + YG G + + G ATE TF S+SG +P + FGC
Sbjct: 156 QLCGDILHHSCVRPDTCTYRYSYGDG---TTTLGYYATERFTFASSSGETQSVP-LGFGC 211
Query: 153 GHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLPDQGSSK---INFGGIV 209
G N+ S + S GI+G G SL+SQ+ +FSYCL SS+ + FG +
Sbjct: 212 GTMNVGSLNNAS---GIVGFGRDPLSLVSQLSIR---RFSYCLTPYASSRKSTLQFGSLA 265
Query: 210 -------AGAGVVSTPLIIRDH----YYLSLEAISVGNQRLEFVSSS-------TGNIFV 251
A V +TP++ YY++ ++VG +RL +S+ +G + +
Sbjct: 266 DVGLYDDATGPVQTTPILQSAQNPTFYYVAFTGVTVGARRLRIPASAFALRPDGSGGVII 325
Query: 252 DTGVLRTLLPLEYHSNLKSVMSNMIK------AQPVKGVGAEPGFSDVLCYNISSQPKFP 305
D+G TL P+ + + + ++ + P GV ++ Q P
Sbjct: 326 DSGTALTLFPVAVLAEVVRAFRSQLRLPFANGSSPDDGVCFAAPAVAAGGGRMARQVAVP 385
Query: 306 EVTIHFRGADVKLSPSN-LFRNISDEIMCSAF-RGGNANIVYGRIMQINFLIGYDIEQAM 363
+ HF+GAD+ L N + + +C G+ G +Q + + YD+E+
Sbjct: 386 RMVFHFQGADLDLPRENYVLEDHRRGHLCVLLGDSGDDGATIGNFVQQDMRVVYDLERET 445
Query: 364 VSFKPSRC 371
+SF P C
Sbjct: 446 LSFAPVEC 453
>gi|168008816|ref|XP_001757102.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162691600|gb|EDQ77961.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 406
Score = 150 bits (379), Expect = 1e-33, Method: Compositional matrix adjust.
Identities = 119/376 (31%), Positives = 177/376 (47%), Gaps = 41/376 (10%)
Query: 24 YQAEIISVDDI----YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLF 79
+QA ++S + Y + +S+GTPP ++ +DTGSD W QC PC ++C+ Q +F
Sbjct: 43 FQAPVVSGLSLGSGEYFIRISVGTPPRRMYLVMDTGSDILWLQCAPC--VNCYHQSDAIF 100
Query: 80 DPKKSSTYNSISCSSSQCA-VVTSNCSEGDCSYSFLYGRGAYASFSSGNLATETLTFNST 138
DP KSSTY+++ CS+ QC + C C Y YG G SF++G T+ ++ NST
Sbjct: 101 DPYKSSTYSTLGCSTRQCLNLDIGTCQANKCLYQVDYGDG---SFTTGEFGTDDVSLNST 157
Query: 139 SGL-PVEMPNVIFGCGHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLPD 197
SG+ V + + GCGH N + G+ S +Q+ G+FSYCL D
Sbjct: 158 SGVGQVVLNKIPLGCGHDNEGYFVGAAGLLGLGKG---PLSFPNQVDPQNGGRFSYCLTD 214
Query: 198 ------QGSSKINFGGIVAGAGVVSTP----LIIRDHYYLSLEAISVGNQRLEFVSSS-- 245
+GSS + V AG TP + + YYL + ISVG L +S+
Sbjct: 215 RETDSTEGSSLVFGEAAVPPAGARFTPQDSNMRVPTFYYLKMTGISVGGTILTIPTSAFQ 274
Query: 246 -----TGNIFVDTGVLRTLLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDV-LCYNIS 299
G + +D+G T L +++L+ + GFS CY++S
Sbjct: 275 LDSLGNGGVIIDSGTSVTRLQNAAYASLRDAFR-----AGTSDLAPTAGFSLFDTCYDLS 329
Query: 300 --SQPKFPEVTIHFRGA-DVKLSPSNLFRNISD-EIMCSAFRGGNANIVYGRIMQINFLI 355
+ P VT+HF+G D+KL SN + + C AF G + G I Q F +
Sbjct: 330 GLASVDVPTVTLHFQGGTDLKLPASNYLIPVDNSNTFCLAFAGTTGPSIIGNIQQQGFRV 389
Query: 356 GYDIEQAMVSFKPSRC 371
YD V F PS+C
Sbjct: 390 IYDNLHNQVGFVPSQC 405
>gi|242044888|ref|XP_002460315.1| hypothetical protein SORBIDRAFT_02g026350 [Sorghum bicolor]
gi|241923692|gb|EER96836.1| hypothetical protein SORBIDRAFT_02g026350 [Sorghum bicolor]
Length = 456
Score = 150 bits (378), Expect = 1e-33, Method: Compositional matrix adjust.
Identities = 118/394 (29%), Positives = 183/394 (46%), Gaps = 51/394 (12%)
Query: 11 NDNETPKSPISIIYQAEIISVDDIYLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELD 70
ND++ P + + S D Y++ L+IGTPP + +DTGSD WTQC PC
Sbjct: 81 NDDQRTTPPTGVSVRP---SGDLEYVVDLAIGTPPQPVSALLDTGSDLIWTQCAPCAS-- 135
Query: 71 CFKQEPPLFDPKKSSTYNSISCSSSQCA-VVTSNCSEGD-CSYSFLYGRGAYASFSSGNL 128
C Q PLF P +S++Y + C+ C+ ++ C D C+Y + YG G + + G
Sbjct: 136 CLAQPDPLFAPGESASYEPMRCAGQLCSDILHHGCEMPDTCTYRYNYGDG---TMTMGVY 192
Query: 129 ATETLTFNSTSGLPVEMPNVIFGCGHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIA 188
ATE TF S+ G + + FGCG N+ S + S GI+G G SL+SQ+
Sbjct: 193 ATERFTFTSSGGDRLMTVPLGFGCGSMNVGSLNNGS---GIVGFGRNPLSLVSQLSIR-- 247
Query: 189 GKFSYCLPDQGS---SKINFGGIVAG------AGVVSTPLIIR----DHYYLSLEAISVG 235
+FSYCL GS S + FG + G V +TPL+ YY+ L ++VG
Sbjct: 248 -RFSYCLTSYGSGRKSTLLFGSLSGGVYGDATGPVQTTPLLQSLQNPTFYYVHLAGLTVG 306
Query: 236 NQRLEFVSSS-------TGNIFVDTGVLRTLLPLEYHSNLKSVMSNMIKAQPVKGVGAEP 288
+RL S+ +G + VD+G TLLP + + ++ G E
Sbjct: 307 ARRLRIPESAFALRPDGSGGVIVDSGTALTLLPGAVLAEVVRAFRQQLRLPFANGGNPED 366
Query: 289 GFSDVLCYNI---------SSQPKFPEVTIHFRGADVKLSPSN-LFRNISDEIMCSAFR- 337
G +C+ + +SQ P + HF+ AD+ L N + + +C
Sbjct: 367 G----VCFLVPAAWRRSSSTSQVPVPRMVFHFQDADLDLPRRNYVLDDHRKGRLCLLLAD 422
Query: 338 GGNANIVYGRIMQINFLIGYDIEQAMVSFKPSRC 371
G+ G ++Q + + YD+E +SF P++C
Sbjct: 423 SGDDGSTIGNLVQQDMRVLYDLEAETLSFAPAQC 456
>gi|225216879|gb|ACN85177.1| aspartic proteinase nepenthesin-1 precursor [Oryza nivara]
gi|225216896|gb|ACN85193.1| aspartic proteinase nepenthesin-1 precursor [Oryza rufipogon]
Length = 515
Score = 150 bits (378), Expect = 1e-33, Method: Compositional matrix adjust.
Identities = 115/354 (32%), Positives = 176/354 (49%), Gaps = 34/354 (9%)
Query: 35 YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSS 94
Y++ + +GTP DTGSD TW QC+PC + C++Q LFDP SSTY ++SC++
Sbjct: 179 YVVTVGLGTPASRYTVVFDTGSDTTWVQCQPC-VVACYEQREKLFDPASSSTYANVSCAA 237
Query: 95 SQCA-VVTSNCSEGDCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMPNVIFGCG 153
C+ + S CS G C Y YG G+Y S G A +TLT +S + FGCG
Sbjct: 238 PACSDLDVSGCSGGHCLYGVQYGDGSY---SIGFFAMDTLTLSSYD----AVKGFRFGCG 290
Query: 154 HKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLPDQ--GSSKINFGGIVAG 211
+N + G++GLG G +SL Q G F++CLP + G+ ++FG AG
Sbjct: 291 ERNDG---LFGEAAGLLGLGRGKTSLPVQTYGKYGGVFAHCLPARSTGTGYLDFG---AG 344
Query: 212 A--GVVSTPLIIRD---HYYLSLEAISVGNQRLEFVSS--STGNIFVDTGVLRTLLPLEY 264
+ +TP++ + YY+ + I VG + L S + VD+G + T LP
Sbjct: 345 SPPATTTTPMLTGNGPTFYYVGMTGIRVGGRLLPIAPSVFAAAGTIVDSGTVITRLPPAA 404
Query: 265 HSNLKSVMSNMIKAQPVKGVGAEPGFSDVLCYNIS--SQPKFPEVTIHFR-GADVKLSPS 321
+S+L+S + + A+ + A CY+ + SQ P V++ F+ GA + + S
Sbjct: 405 YSSLRSAFAAAMAARGYRKAAAVSLLD--TCYDFTGMSQVAIPTVSLLFQGGAALDVDAS 462
Query: 322 NLFRNISDEIMCSAFR----GGNANIVYGRIMQINFLIGYDIEQAMVSFKPSRC 371
+ +S +C AF GG+ IV G F + YDI + +V F P C
Sbjct: 463 GIMYTVSASQVCLAFAGNEDGGDVGIV-GNTQLKTFGVAYDIGKKVVGFSPGAC 515
>gi|15234607|ref|NP_194733.1| aspartyl protease family protein [Arabidopsis thaliana]
gi|4938480|emb|CAB43839.1| putative protein [Arabidopsis thaliana]
gi|7269904|emb|CAB80997.1| putative protein [Arabidopsis thaliana]
gi|67633776|gb|AAY78812.1| aspartyl protease family protein [Arabidopsis thaliana]
gi|332660311|gb|AEE85711.1| aspartyl protease family protein [Arabidopsis thaliana]
Length = 427
Score = 150 bits (378), Expect = 1e-33, Method: Compositional matrix adjust.
Identities = 117/358 (32%), Positives = 177/358 (49%), Gaps = 41/358 (11%)
Query: 35 YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSS 94
+L+++SIG+PP+ +DT SD W QC PC ++C+ Q P+FDP +S T+ + +C +
Sbjct: 85 FLVNISIGSPPITQLLHMDTASDLLWIQCLPC--INCYAQSLPIFDPSRSYTHRNETCRT 142
Query: 95 SQCAV--VTSNCSEGDCSYSFLYGRGAYASFSSGNLATETLTFNST--SGLPVEMPNVIF 150
SQ ++ + N + C YS Y + S G LA E L FN+ + +V+F
Sbjct: 143 SQYSMPSLKFNANTRSCEYSMRYVDD---TGSKGILAREMLLFNTIYDESSSAALHDVVF 199
Query: 151 GCGHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCL-----PDQGSSKINF 205
GCGH N P TGI+GLG G SL+ + G KFSYC P + +
Sbjct: 200 GCGHDNYGEPLVG---TGILGLGYGEFSLVHRFGK----KFSYCFGSLDDPSYPHNVLVL 252
Query: 206 GGIVAGAGVV--STPLIIRD-HYYLSLEAISVGNQRLEF--------VSSSTGNIFVDTG 254
G GA ++ +TPL I + YY+++EAISV L + G +DTG
Sbjct: 253 GD--DGANILGDTTPLEIHNGFYYVTIEAISVDGIILPIDPRVFNRNHQTGLGGTIIDTG 310
Query: 255 VLRTLLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDVLCYNISSQPK-----FPEVTI 309
T L E + LK+ + ++ + + ++ + CYN + + FP VT
Sbjct: 311 NSLTSLVEEAYKPLKNRIEDIFEGRFTAADVSQDDMIKMECYNGNFERDLVESGFPIVTF 370
Query: 310 HF-RGADVKLSPSNLFRNISDEIMCSAFRGGNANIVYGRIMQINFLIGYDIEQAMVSF 366
HF GA++ L +LF +S + C A GN N + G Q ++ IGYD+E VSF
Sbjct: 371 HFSEGAELSLDVKSLFMKLSPNVFCLAVTPGNLNSI-GATAQQSYNIGYDLEAMEVSF 427
>gi|125563959|gb|EAZ09339.1| hypothetical protein OsI_31611 [Oryza sativa Indica Group]
Length = 453
Score = 149 bits (377), Expect = 1e-33, Method: Compositional matrix adjust.
Identities = 109/368 (29%), Positives = 174/368 (47%), Gaps = 43/368 (11%)
Query: 35 YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSS 94
Y++ L++GTPP I +DTGSD WTQC+ C C +Q PLF P+ SS+Y + C+
Sbjct: 98 YVLDLAVGTPPQPITALLDTGSDLIWTQCDTCTA--CLRQPDPLFSPRMSSSYEPMRCAG 155
Query: 95 SQCA-VVTSNCSEGD-CSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMPNVIFGC 152
C ++ +C D C+Y + YG G + + G ATE TF S+SG +P + FGC
Sbjct: 156 QLCGDILHHSCVRPDTCTYRYSYGDG---TTTLGYYATERFTFASSSGETQSVP-LGFGC 211
Query: 153 GHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLPDQGSSK---INFGGIV 209
G N+ S + S GI+G G SL+SQ+ +FSYCL SS+ + FG +
Sbjct: 212 GTMNVGSLNNAS---GIVGFGRDPLSLVSQLSIR---RFSYCLTPYASSRKSTLQFGSLA 265
Query: 210 -------AGAGVVSTPLIIRDH----YYLSLEAISVGNQRLEFVSSS-------TGNIFV 251
A V +TP++ YY++ ++VG +RL +S+ +G + +
Sbjct: 266 DVGLYDDATGPVQTTPILQSAQNPTFYYVAFTGVTVGARRLRIPASAFALRPDGSGGVII 325
Query: 252 DTGVLRTLLPLEYHSNLKSVMSNMIK------AQPVKGVGAEPGFSDVLCYNISSQPKFP 305
D+G TL P + + + ++ + P GV ++ Q P
Sbjct: 326 DSGTALTLFPAAVLAEVVRAFRSQLRLPFANGSSPDDGVCFAAPAVAAGGGRMARQVAVP 385
Query: 306 EVTIHFRGADVKLSPSN-LFRNISDEIMCSAF-RGGNANIVYGRIMQINFLIGYDIEQAM 363
+ HF+GAD+ L N + + +C G+ G +Q + + YD+E+
Sbjct: 386 RMVFHFQGADLDLPRENYVLEDHRRGHLCVLLGDSGDDGATIGNFVQQDMRVVYDLERET 445
Query: 364 VSFKPSRC 371
+SF P C
Sbjct: 446 LSFAPVEC 453
>gi|356522504|ref|XP_003529886.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
Length = 473
Score = 149 bits (377), Expect = 2e-33, Method: Compositional matrix adjust.
Identities = 121/361 (33%), Positives = 171/361 (47%), Gaps = 42/361 (11%)
Query: 35 YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSS 94
Y L +GTPP ++ +DTGSD W QC+PC + C+ Q +FDP KS ++ I C S
Sbjct: 130 YFTRLGVGTPPKYLYMVLDTGSDVVWLQCKPCTK--CYSQTDQIFDPSKSKSFAGIPCYS 187
Query: 95 SQCAVVTS-NCSEGD--CSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMPNVIFG 151
C + S CS + C Y YG G SF+ G+ +TETLTF + +P V G
Sbjct: 188 PLCRRLDSPGCSLKNNLCQYQVSYGDG---SFTFGDFSTETLTFRRAA-----VPRVAIG 239
Query: 152 CGHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLPDQGSSK----INFGG 207
CGH N + G+ S +Q GT KFSYCL D+ +S I FG
Sbjct: 240 CGHDNEGLFVGAAGLLGLGRG---GLSFPTQTGTRFNNKFSYCLTDRTASAKPSSIVFGD 296
Query: 208 IVAGAGVVSTPLI----IRDHYYLSLEAISVGNQRLEFVSS------STGN--IFVDTGV 255
TPL+ + YY+ L ISVG + +S+ STGN + +D+G
Sbjct: 297 SAVSRTARFTPLVKNPKLDTFYYVELLGISVGGAPVRGISASFFRLDSTGNGGVIIDSGT 356
Query: 256 LRTLLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDV-LCYNIS--SQPKFPEVTIHFR 312
T L + +L+ + A +K P FS CY++S S+ K P V +HFR
Sbjct: 357 SVTRLTRPAYVSLRDAF--RVGASHLK---RAPEFSLFDTCYDLSGLSEVKVPTVVLHFR 411
Query: 313 GADVKLSPSNLFRNISDE-IMCSAFRGGNANI-VYGRIMQINFLIGYDIEQAMVSFKPSR 370
GADV L +N + + C AF G + + + G I Q F + +D+ + V F P
Sbjct: 412 GADVSLPAANYLVPVDNSGSFCFAFAGTMSGLSIIGNIQQQGFRVVFDLAGSRVGFAPRG 471
Query: 371 C 371
C
Sbjct: 472 C 472
>gi|115468912|ref|NP_001058055.1| Os06g0610800 [Oryza sativa Japonica Group]
gi|51091964|dbj|BAD35493.1| putative nucleoid DNA-binding protein cnd41, chloroplast [Oryza
sativa Japonica Group]
gi|113596095|dbj|BAF19969.1| Os06g0610800 [Oryza sativa Japonica Group]
gi|218198535|gb|EEC80962.1| hypothetical protein OsI_23679 [Oryza sativa Indica Group]
Length = 519
Score = 149 bits (377), Expect = 2e-33, Method: Compositional matrix adjust.
Identities = 115/354 (32%), Positives = 176/354 (49%), Gaps = 34/354 (9%)
Query: 35 YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSS 94
Y++ + +GTP DTGSD TW QC+PC + C++Q LFDP SSTY ++SC++
Sbjct: 183 YVVTVGLGTPASRYTVVFDTGSDTTWVQCQPC-VVACYEQREKLFDPASSSTYANVSCAA 241
Query: 95 SQCA-VVTSNCSEGDCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMPNVIFGCG 153
C+ + S CS G C Y YG G+Y S G A +TLT +S + FGCG
Sbjct: 242 PACSDLDVSGCSGGHCLYGVQYGDGSY---SIGFFAMDTLTLSSYD----AVKGFRFGCG 294
Query: 154 HKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLPDQ--GSSKINFGGIVAG 211
+N + G++GLG G +SL Q G F++CLP + G+ ++FG AG
Sbjct: 295 ERNDG---LFGEAAGLLGLGRGKTSLPVQTYGKYGGVFAHCLPARSTGTGYLDFG---AG 348
Query: 212 A--GVVSTPLIIRD---HYYLSLEAISVGNQRLEFVSS--STGNIFVDTGVLRTLLPLEY 264
+ +TP++ + YY+ + I VG + L S + VD+G + T LP
Sbjct: 349 SPPATTTTPMLTGNGPTFYYVGMTGIRVGGRLLPIAPSVFAAAGTIVDSGTVITRLPPAA 408
Query: 265 HSNLKSVMSNMIKAQPVKGVGAEPGFSDVLCYNIS--SQPKFPEVTIHFR-GADVKLSPS 321
+S+L+S + + A+ + A CY+ + SQ P V++ F+ GA + + S
Sbjct: 409 YSSLRSAFAAAMAARGYRKAAAVSLLD--TCYDFTGMSQVAIPTVSLLFQGGAALDVDAS 466
Query: 322 NLFRNISDEIMCSAFR----GGNANIVYGRIMQINFLIGYDIEQAMVSFKPSRC 371
+ +S +C AF GG+ IV G F + YDI + +V F P C
Sbjct: 467 GIMYTVSASQVCLAFAGNEDGGDVGIV-GNTQLKTFGVAYDIGKKVVGFSPGAC 519
>gi|225216914|gb|ACN85210.1| aspartic proteinase nepenthesin-1 precursor [Oryza glaberrima]
Length = 516
Score = 149 bits (376), Expect = 2e-33, Method: Compositional matrix adjust.
Identities = 115/354 (32%), Positives = 176/354 (49%), Gaps = 34/354 (9%)
Query: 35 YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSS 94
Y++ + +GTP DTGSD TW QC+PC + C++Q LFDP SSTY ++SC++
Sbjct: 180 YVVTVGLGTPASRYTVVFDTGSDTTWVQCQPC-VVACYEQREKLFDPASSSTYANVSCAA 238
Query: 95 SQCA-VVTSNCSEGDCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMPNVIFGCG 153
C+ + S CS G C Y YG G+Y S G A +TLT +S + FGCG
Sbjct: 239 PACSDLDVSGCSGGHCLYGVQYGDGSY---SIGFFAMDTLTLSSYD----AVKGFRFGCG 291
Query: 154 HKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLPDQ--GSSKINFGGIVAG 211
+N + G++GLG G +SL Q G F++CLP + G+ ++FG AG
Sbjct: 292 ERNDG---LFGEAAGLLGLGRGKTSLPVQTYGKYGGVFAHCLPPRSTGTGYLDFG---AG 345
Query: 212 A--GVVSTPLIIRD---HYYLSLEAISVGNQRLEFVSS--STGNIFVDTGVLRTLLPLEY 264
+ +TP++ + YY+ + I VG + L S + VD+G + T LP
Sbjct: 346 SPPATTTTPMLTGNGPTFYYVGMTGIRVGGRLLPIAPSVFAAAGTIVDSGTVITRLPPAA 405
Query: 265 HSNLKSVMSNMIKAQPVKGVGAEPGFSDVLCYNIS--SQPKFPEVTIHFR-GADVKLSPS 321
+S+L+S + + A+ + A CY+ + SQ P V++ F+ GA + + S
Sbjct: 406 YSSLRSAFAAAMAARGYRKAAAVSLLD--TCYDFTGMSQVAIPTVSLLFQGGAALDVDAS 463
Query: 322 NLFRNISDEIMCSAFR----GGNANIVYGRIMQINFLIGYDIEQAMVSFKPSRC 371
+ +S +C AF GG+ IV G F + YDI + +V F P C
Sbjct: 464 GIMYTVSASQVCLAFAGNEDGGDVGIV-GNTQLKTFGVAYDIGKKVVGFSPGAC 516
>gi|357130848|ref|XP_003567056.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Brachypodium
distachyon]
Length = 448
Score = 149 bits (376), Expect = 2e-33, Method: Compositional matrix adjust.
Identities = 118/364 (32%), Positives = 169/364 (46%), Gaps = 41/364 (11%)
Query: 35 YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSS 94
Y + +GTPP +DTGSD W QC+PC + C++Q PL+DP+ SSTY CS
Sbjct: 99 YFASVGVGTPPTPALLVIDTGSDVVWLQCKPC--VHCYRQLSPLYDPRGSSTYAQTPCSP 156
Query: 95 SQCA-VVTSNCSEGDCSYSFLYGRGAYASFSSGNLATETLTF-NSTSGLPVEMPNVIFGC 152
QC T + + G C Y +YG AS +SGNLAT+ L F N TS + NV GC
Sbjct: 157 PQCRNPQTCDGTTGGCGYRIVYGD---ASSTSGNLATDRLVFSNDTS-----VGNVTLGC 208
Query: 153 GHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLPDQGSSKINFGGIVAGA 212
GH N S G++G+ GN+S +Q+ S F+YCL D+ S + +V G
Sbjct: 209 GHDNEGLFGS---AAGLLGVARGNNSFATQVADSYGRYFAYCLGDRTRSGSSSSYLVFGR 265
Query: 213 GV------VSTPLIIRDH----YYLSLEAISVGNQRLEFVSSST---------GNIFVDT 253
V TPL YY+ + SVG + + S+++ G + VD+
Sbjct: 266 TAPEPPSSVFTPLRSNPRRPSLYYVDMVGFSVGGEPVTGFSNASLSLDPATGRGGVVVDS 325
Query: 254 GVLRTLLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDVLCYNIS--SQPKFPEVTIHF 311
G T + + L+ ++ VG D CY++ + P V +HF
Sbjct: 326 GTSITRFARDAYGALRDAFDARAAKVGMRKVGRGISVFDA-CYDLRGVAVADAPGVVLHF 384
Query: 312 R-GADVKLSPSN-LFRNISDEIMCSAFR--GGNANIVYGRIMQINFLIGYDIEQAMVSFK 367
GADV L P N L S C A G + V G ++Q F + +D+E V F+
Sbjct: 385 AGGADVALPPENYLVPEESGRYHCFALEAAGHDGLSVIGNVLQQRFRVVFDVENERVGFE 444
Query: 368 PSRC 371
P+ C
Sbjct: 445 PNGC 448
>gi|21594980|gb|AAM66061.1| chloroplast nucleoid DNA binding protein, putative [Arabidopsis
thaliana]
Length = 485
Score = 149 bits (375), Expect = 3e-33, Method: Compositional matrix adjust.
Identities = 116/361 (32%), Positives = 164/361 (45%), Gaps = 42/361 (11%)
Query: 35 YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSS 94
Y L +GTP ++ +DTGSD W QC PC C+ Q P+FDP+KS TY +I CSS
Sbjct: 142 YFTRLGVGTPARYVYMVLDTGSDIVWLQCAPCRR--CYSQSDPIFDPRKSKTYATIPCSS 199
Query: 95 SQCAVVTS---NCSEGDCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMPNVIFG 151
C + S N C Y YG G SF+ G+ +TETLTF + V G
Sbjct: 200 PHCRRLDSAGCNTRRKTCLYQVSYGDG---SFTVGDFSTETLTFRRN-----RVKGVALG 251
Query: 152 CGHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLPDQGS----SKINFGG 207
CGH N + G+ S Q G KFSYCL D+ + S + FG
Sbjct: 252 CGHDNEGLFVGAAGLLGLGKG---KLSFPGQTGHRFNQKFSYCLVDRSASSKPSSVVFGN 308
Query: 208 IVAGAGVVSTPLI----IRDHYYLSLEAISVGNQRLEFVSSS--------TGNIFVDTGV 255
TPL+ + YY+ L ISVG R+ V++S G + +D+G
Sbjct: 309 AAVSRIARFTPLLSNPKLDTFYYVGLLGISVGGTRVPGVTASLFKLDQIGNGGVIIDSGT 368
Query: 256 LRTLLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDV-LCYNIS--SQPKFPEVTIHFR 312
T L + ++ + A+ +K P FS C+++S ++ K P V +HFR
Sbjct: 369 SVTRLIRPAYIAMRDAFR--VGAKTLK---RAPNFSLFDTCFDLSNMNEVKVPTVVLHFR 423
Query: 313 GADVKLSPSNLFRNI-SDEIMCSAFRGGNANI-VYGRIMQINFLIGYDIEQAMVSFKPSR 370
ADV L +N + ++ C AF G + + G I Q F + YD+ + V F P
Sbjct: 424 RADVSLPATNYLIPVDTNGKFCFAFAGTMGGLSIIGNIQQQGFRVVYDLASSRVGFAPGG 483
Query: 371 C 371
C
Sbjct: 484 C 484
>gi|414589630|tpg|DAA40201.1| TPA: hypothetical protein ZEAMMB73_629620 [Zea mays]
Length = 443
Score = 149 bits (375), Expect = 3e-33, Method: Compositional matrix adjust.
Identities = 119/373 (31%), Positives = 180/373 (48%), Gaps = 44/373 (11%)
Query: 28 IISVDDIYLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTY 87
+++ D YLM + IGTP +DTGSD WTQC PC L C Q P FDP +S+TY
Sbjct: 83 VLASDGEYLMEMGIGTPTRYYSAILDTGSDLIWTQCAPC--LLCVDQPTPYFDPARSATY 140
Query: 88 NSISCSSSQC-AVVTSNCSEGDCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMP 146
S+ C+S C A+ C + C Y + YG A ++G LA ET TF T+ V +P
Sbjct: 141 RSLGCASPACNALYYPLCYQKVCVYQYFYGDSAS---TAGVLANETFTFG-TNETRVSLP 196
Query: 147 NVIFGCGHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLPDQGS---SKI 203
+ FGCG+ N S + S G++G G G+ SL+SQ+G+ +FSYCL S S++
Sbjct: 197 GISFGCGNLNAGSLANGS---GMVGFGRGSLSLVSQLGSP---RFSYCLTSFLSPVPSRL 250
Query: 204 NFGGI-------VAGAGVVSTPLIIR----DHYYLSLEAISVGNQRLEF--------VSS 244
FG + V STP ++ Y+L++ ISVG L +
Sbjct: 251 YFGVYATLNSTNASSEPVQSTPFVVNPALPTMYFLNMTGISVGGYLLPIDPAVFAINDTD 310
Query: 245 STGNIFVDTGVLRTLLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDVLCYNISSQPK- 303
TG +D+G T L + +++ ++ I P+ V + D C+ P+
Sbjct: 311 GTGGTIIDSGTTITYLAEPAYDAVRAAFASQITL-PLLNV-TDASVLDT-CFQWPPPPRQ 367
Query: 304 ---FPEVTIHFRGADVKLSPSN--LFRNISDEIMCSAFRGGNANIVYGRIMQINFLIGYD 358
P++ +HF GAD +L N L + +C A + + G NF + YD
Sbjct: 368 SVTLPQLVLHFDGADWELPLQNYMLVDPSTGGGLCLAMASSSDGSIIGSYQHQNFNVLYD 427
Query: 359 IEQAMVSFKPSRC 371
+E +++SF P+ C
Sbjct: 428 LENSLMSFVPAPC 440
>gi|223946005|gb|ACN27086.1| unknown [Zea mays]
Length = 336
Score = 148 bits (374), Expect = 4e-33, Method: Compositional matrix adjust.
Identities = 114/348 (32%), Positives = 167/348 (47%), Gaps = 43/348 (12%)
Query: 52 VDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSSSQCAVVTS-NCSEGDCS 110
+DTGSD WTQC PC L C Q P FD KKS+TY ++ C SS+CA ++S +C + C
Sbjct: 1 MDTGSDLIWTQCAPC--LLCADQPTPYFDVKKSATYRALPCRSSRCASLSSPSCFKKMCV 58
Query: 111 YSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMPNVIFGCGHKNLASPTSDSKQTGII 170
Y + YG A ++G LA ET TF + + V N+ FGCG N + S G++
Sbjct: 59 YQYYYGDTAS---TAGVLANETFTFGAANSTKVRATNIAFGCGSLNAGDLANSS---GMV 112
Query: 171 GLGPGNSSLISQMGTSIAGKFSYCLPDQGS---SKINFG--------GIVAGAGVVSTPL 219
G G G SL+SQ+G S +FSYCL S S++ FG +G+ V STP
Sbjct: 113 GFGRGPLSLVSQLGPS---RFSYCLTSYLSATPSRLYFGVYANLSSTNTSSGSPVQSTPF 169
Query: 220 IIR----DHYYLSLEAISVGNQRL-------EFVSSSTGNIFVDTGVLRTLLPLEYHSNL 268
+I + Y+LSL+AIS+G + L TG + +D+G T L + +
Sbjct: 170 VINPALPNMYFLSLKAISLGTKLLPIDPLVFAINDDGTGGVIIDSGTSITWLQQDAY--- 226
Query: 269 KSVMSNMIKAQPVKGVGAEPGFSDVLCYNISSQPK----FPEVTIHFRGADVKLSPSNLF 324
++V ++ A P+ + D C+ P P++ HF A++ L P N
Sbjct: 227 EAVRRGLVSAIPLPAMNDTDIGLDT-CFQWPPPPNVTVTVPDLVFHFDSANMTLLPENYM 285
Query: 325 RNISDE-IMCSAFRGGNANIVYGRIMQINFLIGYDIEQAMVSFKPSRC 371
S +C + G Q N + YDI + +SF P+ C
Sbjct: 286 LIASTTGYLCLVMAPTGVGTIIGNYQQQNLHLLYDIGNSFLSFVPAPC 333
>gi|449516339|ref|XP_004165204.1| PREDICTED: probable aspartic protease At2g35615-like [Cucumis
sativus]
Length = 456
Score = 148 bits (374), Expect = 4e-33, Method: Compositional matrix adjust.
Identities = 113/360 (31%), Positives = 172/360 (47%), Gaps = 35/360 (9%)
Query: 35 YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSS 94
+L++LSIG+PPV VDTGS W QC PC ++CF+Q FDP KS ++ ++ C
Sbjct: 104 FLVNLSIGSPPVTQLVVVDTGSSLLWVQCLPC--INCFQQSTSWFDPLKSVSFKTLGCGF 161
Query: 95 SQCAVVTS-NCSE-GDCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMPNVIFGC 152
+ C+ Y Y G S G LA E+L F + ++ N+ FGC
Sbjct: 162 PGYNYINGYKCNRFNQAEYKLRYLGG---DSSQGILAKESLLFETLDEGKIKKSNITFGC 218
Query: 153 GHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLPDQGSSKINFGGIVAGA 212
GH N+ + +D G+ GLG M T + KFSYC+ D + +V G
Sbjct: 219 GHMNIKT-NNDDAYNGVFGLGAYPH---ITMATQLGNKFSYCIGDINNPLYTHNHLVLGQ 274
Query: 213 GVV----STPLIIR-DHYYLSLEAISVGNQRL-------EFVSSSTGNIFVDTGVLRTLL 260
G STPL I HYY++L++ISVG++ L + S +G + +D+G+ T L
Sbjct: 275 GSYIEGDSTPLQIHFGHYYVTLQSISVGSKTLKIDPNAFKISSDGSGGVLIDSGMTYTKL 334
Query: 261 P---LE-YHSNLKSVMSNMIKAQPVKGVGAEPGFSDVLCYNISSQPKFPEVTIHFR-GAD 315
E + + +M +++ P + F V+ ++ FP VT HF GAD
Sbjct: 335 ANGGFELLYDEIVDLMKGLLERIPTQRKFEGLCFKGVVSRDLVG---FPAVTFHFAGGAD 391
Query: 316 VKLSPSNLFRNISDEIMCSAFRGGNANI----VYGRIMQINFLIGYDIEQAMVSFKPSRC 371
+ L +LFR + C A N+ + V G + Q N+ +G+D+EQ V F+ C
Sbjct: 392 LVLESGSLFRQHGGDRFCLAILPSNSELLNLSVIGILAQQNYNVGFDLEQMKVFFRRIDC 451
>gi|359497446|ref|XP_003635520.1| PREDICTED: aspartic proteinase nepenthesin-2-like, partial [Vitis
vinifera]
Length = 354
Score = 148 bits (373), Expect = 4e-33, Method: Compositional matrix adjust.
Identities = 107/356 (30%), Positives = 170/356 (47%), Gaps = 34/356 (9%)
Query: 35 YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSS 94
Y + + +G+P VDTGS +W QC+PC + C Q PLFDP S TY S+SC+S
Sbjct: 13 YYVKVGLGSPARYYSMIVDTGSSLSWLQCKPC-VVYCHVQADPLFDPSASKTYKSLSCTS 71
Query: 95 SQCAVVTS--------NCSEGDCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMP 146
SQC+ + S C Y+ YG +S+S G L+ + LT + +P
Sbjct: 72 SQCSSLVDATLNNPLCETSSNVCVYTASYGD---SSYSMGYLSQDLLTLAPSQ----TLP 124
Query: 147 NVIFGCGHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLPDQGSSK-INF 205
++GCG S + GI+GLG S++ Q+ + FSYCLP +G ++
Sbjct: 125 GFVYGCGQD---SEGLFGRAAGILGLGRNKLSMLGQVSSKFGYAFSYCLPTRGGGGFLSI 181
Query: 206 GGI-VAGAGVVSTPLIIR----DHYYLSLEAISVGNQRLEFVSSSTG-NIFVDTGVLRTL 259
G +AG+ TP+ Y+L L AI+VG + L ++ +D+G + T
Sbjct: 182 GKASLAGSAYKFTPMTTDPGNPSLYFLRLTAITVGGRALGVAAAQYRVPTIIDSGTVITR 241
Query: 260 LPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDV-LCY--NISSQPKFPEVTIHFRG-AD 315
LP+ ++ + ++ ++ + PGFS + C+ N+ PEV + F+G AD
Sbjct: 242 LPMSVYTPFQQAFVKIMSSKYAR----APGFSILDTCFKGNLKDMQSVPEVRLIFQGGAD 297
Query: 316 VKLSPSNLFRNISDEIMCSAFRGGNANIVYGRIMQINFLIGYDIEQAMVSFKPSRC 371
+ L P N+ + + + C AF G N + G Q F + +DI A + F C
Sbjct: 298 LNLRPVNVLLQVDEGLTCLAFAGNNGVAIIGNHQQQTFKVAHDISTARIGFATGGC 353
>gi|356567196|ref|XP_003551807.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Glycine max]
Length = 490
Score = 148 bits (373), Expect = 4e-33, Method: Compositional matrix adjust.
Identities = 117/347 (33%), Positives = 168/347 (48%), Gaps = 37/347 (10%)
Query: 35 YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSS 94
Y + + +GTP D+ DTGSD TWTQCEPC C+KQ+ +FDP KS++Y++I+C+S
Sbjct: 146 YFVVVGLGTPKRDLSLIFDTGSDLTWTQCEPCAR-SCYKQQDVIFDPSKSTSYSNITCTS 204
Query: 95 SQCAVVTS--------NCSEGDCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMP 146
+ C +++ + S C Y YG +SFS G + E LT +T +
Sbjct: 205 ALCTQLSTATGNDPGCSASTKACIYGIQYGD---SSFSVGYFSRERLTVTATD----VVD 257
Query: 147 NVIFGCGHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLPDQGSS--KIN 204
N +FGCG N + G+IGLG S + Q FSYCLP SS ++
Sbjct: 258 NFLFGCGQNNQGLFGGSA---GLIGLGRHPISFVQQTAAKYRKIFSYCLPSTSSSTGHLS 314
Query: 205 FGGIVAGAGVVSTPL--IIR--DHYYLSLEAISVGNQRLEFVSS--STGNIFVDTGVLRT 258
FG G + TP I R Y L + AI+VG +L SS STG +D+G + T
Sbjct: 315 FGPAATGRYLKYTPFSTISRGSSFYGLDITAIAVGGVKLPVSSSTFSTGGAIIDSGTVIT 374
Query: 259 LLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDVLCYNISSQPKFPEVTIHFR---GAD 315
LP + L+S + P G E D CY++S F TI F G
Sbjct: 375 RLPPTAYGALRSAFRQGMSKYPSAG---ELSILDT-CYDLSGYKVFSIPTIEFSFAGGVT 430
Query: 316 VKLSPSNLFRNISDEIMCSAF--RGGNANI-VYGRIMQINFLIGYDI 359
VKL P + S + +C AF G ++++ +YG + Q + YD+
Sbjct: 431 VKLPPQGILFVASTKQVCLAFAANGDDSDVTIYGNVQQRTIEVVYDV 477
>gi|225217022|gb|ACN85307.1| aspartic proteinase nepenthesin-1 precursor [Oryza ridleyi]
Length = 525
Score = 148 bits (373), Expect = 5e-33, Method: Compositional matrix adjust.
Identities = 114/354 (32%), Positives = 175/354 (49%), Gaps = 31/354 (8%)
Query: 35 YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSS 94
Y++ + +GTP DTGSD TW QCEPC + C++Q+ LFDP +SST +ISC++
Sbjct: 186 YVVTIGLGTPAGRYTVVFDTGSDTTWVQCEPCVVV-CYEQQEKLFDPARSSTDANISCAA 244
Query: 95 SQCA-VVTSNCSEGDCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMPNVIFGCG 153
C+ + T CS G C Y YG G+Y S G A +TLT +S + FGCG
Sbjct: 245 PACSDLYTKGCSGGHCLYGVQYGDGSY---SIGFFAMDTLTLSSYDAI----KGFRFGCG 297
Query: 154 HKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLPDQ--GSSKINF--GGIV 209
+N + G++GLG G +SL Q G F++C P + G+ ++F G
Sbjct: 298 ERNEG---LFGEAAGLLGLGRGKTSLPVQAYDKYGGVFAHCFPARSSGTGYLDFGPGSSP 354
Query: 210 AGAGVVSTPLIIRD---HYYLSLEAISVGNQRLEFVSS--STGNIFVDTGVLRTLLPLEY 264
A + ++TP+++ + YY+ L I VG + L S +T VD+G + T LP
Sbjct: 355 AVSTKLTTPMLVDNGLTFYYVGLTGIRVGGKLLSIPPSVFTTAGTIVDSGTVITRLPPAA 414
Query: 265 HSNLKSVMSNMIKAQPVKGVGAEPGFSDV-LCYNIS--SQPKFPEVTIHFR-GADVKLSP 320
+S+L+S ++ I A +G P S + CY+ + SQ P V++ F+ GA + +
Sbjct: 415 YSSLRSAFASAIAA---RGYKKAPALSLLDTCYDFTGMSQVAIPTVSLLFQGGASLDVDA 471
Query: 321 SNLFRNISDEIMCSAFRGGNANI---VYGRIMQINFLIGYDIEQAMVSFKPSRC 371
S + S C F + + G F + YDI + +V F P C
Sbjct: 472 SGIIYAASVSQACLGFAANEEDDDVGIVGNTQLKTFGVVYDIGKKVVGFSPGAC 525
>gi|312281631|dbj|BAJ33681.1| unnamed protein product [Thellungiella halophila]
Length = 502
Score = 147 bits (372), Expect = 6e-33, Method: Compositional matrix adjust.
Identities = 114/357 (31%), Positives = 172/357 (48%), Gaps = 38/357 (10%)
Query: 35 YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSS 94
Y + +GTP +++ +DTGSD W QC PC E C++Q P+FDP SST+ S++CS
Sbjct: 164 YFSRIGVGTPAKEMYVVLDTGSDVNWIQCLPCSE--CYQQSDPIFDPTSSSTFKSLTCSD 221
Query: 95 SQCAVV-TSNCSEGDCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMPNVIFGCG 153
+CA + S C C Y YG G SF+ GN AT+T+TF G ++ +V GCG
Sbjct: 222 PKCASLDVSACRSNKCLYQVSYGDG---SFTVGNYATDTVTF----GESGKVNDVALGCG 274
Query: 154 HKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLPDQGSSK---INFGGIVA 210
H N T + G+ G S+ +Q+ A FSYCL D+ S+K ++F +
Sbjct: 275 HDNEGLFTGAAGLLGLGGG---ALSMTNQIK---AKSFSYCLVDRDSAKSSSLDFNSVQI 328
Query: 211 GAGVVSTPLI----IRDHYYLSLEAISVGNQRL-------EFVSSSTGNIFVDTGVLRTL 259
GAG + PL+ + YY+ L SVG Q++ E +S G + +D G T
Sbjct: 329 GAGDATAPLLRNSKMDTFYYVGLSGFSVGGQQVSIPSSLFEVDASGAGGVILDCGTAVTR 388
Query: 260 LPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDVLCYNIS--SQPKFPEVTIHFRGAD-V 316
L + +++L+ +K G P CY+ S S K P VT HF G +
Sbjct: 389 LQTQAYNSLRDA---FVKLTTDFKKGTSPISLFDTCYDFSSLSTVKVPTVTFHFTGGKSL 445
Query: 317 KLSPSNLFRNISDE-IMCSAFRGGNANI-VYGRIMQINFLIGYDIEQAMVSFKPSRC 371
L N I D C AF ++++ + G + Q I YD+ ++ ++C
Sbjct: 446 NLPAKNYLIPIDDAGTFCFAFAPTSSSLSIIGNVQQQGTRITYDLANNLIGLSANKC 502
>gi|302780643|ref|XP_002972096.1| hypothetical protein SELMODRAFT_96575 [Selaginella moellendorffii]
gi|300160395|gb|EFJ27013.1| hypothetical protein SELMODRAFT_96575 [Selaginella moellendorffii]
Length = 393
Score = 147 bits (372), Expect = 7e-33, Method: Compositional matrix adjust.
Identities = 119/358 (33%), Positives = 173/358 (48%), Gaps = 43/358 (12%)
Query: 35 YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSS 94
Y+M +S+GTP DTGSD W Q EPC C +FDP++SST+ + CSS
Sbjct: 55 YVMDISVGTPGKRFRAIADTGSDLVWVQSEPC--TGCSGGT--IFDPRQSSTFREMDCSS 110
Query: 95 SQCAVVTSNCSEGD--CSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMPNVIFGC 152
CA + +C G CSYS+ YG G + G A +T++ +TS + P+ GC
Sbjct: 111 QLCAELPGSCEPGSSTCSYSYEYGSGE----TEGEFARDTISLGTTSDGSQKFPSFAVGC 166
Query: 153 GHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLPD----QGSSKINFG-- 206
G N D G++GLG G SL SQ+ +I KFSYCL D SS + FG
Sbjct: 167 GMVNSGFDGVD----GLVGLGQGPVSLTSQLSAAIDSKFSYCLVDINSQSESSPLLFGPS 222
Query: 207 GIVAGAGVVSTPL-----IIRDHYYLSLEAISVGNQRLEFVSSSTGNIFVDTGVLRTLLP 261
+ G G+ ST + +Y L++ I+V Q + S G +D+G T +P
Sbjct: 223 AALHGTGIQSTKITPPSDTYPTYYLLTVNGIAVAGQTM----GSPGTTIIDSGTTLTYVP 278
Query: 262 LEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDVLCYNISSQP--KFPEVTIHFRGADVKLS 319
+ + S M +M+ V G+ G LCY+ SS KFP +TI GA +
Sbjct: 279 SGVYGRVLSRMESMVTLPRVD--GSSMGLD--LCYDRSSNRNYKFPALTIRLAGATMTPP 334
Query: 320 PSNLFRNISD--EIMCSAFRGGNAN----IVYGRIMQINFLIGYDIEQAMVSFKPSRC 371
SN F + D + +C A G+A+ + G +MQ + I YD + +SF ++C
Sbjct: 335 SSNYFLVVDDSGDTVCLAM--GSASGLPVSIIGNVMQQGYHILYDRGSSELSFVQAKC 390
>gi|414876414|tpg|DAA53545.1| TPA: hypothetical protein ZEAMMB73_483039 [Zea mays]
Length = 506
Score = 147 bits (372), Expect = 7e-33, Method: Compositional matrix adjust.
Identities = 119/366 (32%), Positives = 175/366 (47%), Gaps = 54/366 (14%)
Query: 35 YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSS 94
Y + IG+P ++ +DTGSD TW QC+PC DC++Q P+FDP S++Y ++SC S
Sbjct: 166 YFSRVGIGSPARQLYMVLDTGSDVTWVQCQPC--ADCYQQSDPVFDPSLSASYAAVSCDS 223
Query: 95 SQCA-VVTSNC--SEGDCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMPNVIFG 151
+C + T+ C + G C Y YG G+Y + G+ ATETLT ++ PV NV G
Sbjct: 224 QRCRDLDTAACRNATGACLYEVAYGDGSY---TVGDFATETLTLGDST--PVG--NVAIG 276
Query: 152 CGHKN--LASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLPDQGS---SKINFG 206
CGH N L + G L S SQ+ S FSYCL D+ S S + FG
Sbjct: 277 CGHDNEGLFVGAAGLLALGGGPL-----SFPSQISAS---TFSYCLVDRDSPAASTLQFG 328
Query: 207 GIVAGAGVVSTPLI----IRDHYYLSLEAISVGNQRLEF--------VSSSTGNIFVDTG 254
A AG V+ PL+ YY++L ISVG Q L +S +G + VD+G
Sbjct: 329 DGAAEAGTVTAPLVRSPRTSTFYYVALSGISVGGQPLSIPASAFAMDATSGSGGVIVDSG 388
Query: 255 VLRTLLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDV----LCYNISSQP--KFPEVT 308
T L ++ L+ V+G + P S V CY++S + + P V+
Sbjct: 389 TAVTRLQSAAYAALRDAF--------VQGAPSLPRTSGVSLFDTCYDLSDRTSVEVPAVS 440
Query: 309 IHFRGAD-VKLSPSNLFRNISDE-IMCSAFRGGNANI-VYGRIMQINFLIGYDIEQAMVS 365
+ F G ++L N + C AF NA + + G + Q + +D + V
Sbjct: 441 LRFEGGGALRLPAKNYLIPVDGAGTYCLAFAPTNAAVSIIGNVQQQGTRVSFDTARGAVG 500
Query: 366 FKPSRC 371
F P++C
Sbjct: 501 FTPNKC 506
>gi|413918484|gb|AFW58416.1| hypothetical protein ZEAMMB73_998053 [Zea mays]
Length = 475
Score = 147 bits (371), Expect = 7e-33, Method: Compositional matrix adjust.
Identities = 122/376 (32%), Positives = 175/376 (46%), Gaps = 58/376 (15%)
Query: 35 YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSS 94
+LM LS+GTP + VDTGSD WTQC+PC E CF Q P+FDP SSTY ++ CSS
Sbjct: 116 FLMDLSVGTPALPYAAIVDTGSDLVWTQCKPCVE--CFNQTTPVFDPAASSTYAALPCSS 173
Query: 95 SQCA---------VVTSNCSEGDCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEM 145
+ CA +S+ + C Y++ YG AS + G LATET T ++
Sbjct: 174 ALCADLPTSTCASSSSSSSASSPCGYTYTYGD---ASSTQGVLATETFTLARQ-----KV 225
Query: 146 PNVIFGCGHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLPD----QGSS 201
P V FGCG N ++ G++GLG G SL+SQ+G +FSYCL G S
Sbjct: 226 PGVAFGCGDTNEGD--GFTQGAGLVGLGRGPLSLVSQLGID---RFSYCLTSLDDAAGRS 280
Query: 202 KINFGGIVAGAGVV------STPLIIR----DHYYLSLEAISVGNQRLEFVSSS------ 245
+ G + +TPL+ YY+SL ++VG+ RL SS+
Sbjct: 281 PLLLGSAAGISASAATAPAQTTPLVKNPSQPSFYYVSLTGLTVGSTRLALPSSAFAIQDD 340
Query: 246 -TGNIFVDTGVLRTLLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDVLCY-------N 297
TG + VD+G T L L + L+ + V +E G LC+ +
Sbjct: 341 GTGGVIVDSGTSITYLELRAYRALRKAFVAHMSLPTVD--ASEIGLD--LCFQGPAGAVD 396
Query: 298 ISSQPKFPEVTIHFR-GADVKLSPSN-LFRNISDEIMCSAFRGGNANIVYGRIMQINFLI 355
Q + P++ +HF GAD+ L N + + + +C + G Q NF
Sbjct: 397 QDVQVQVPKLVLHFDGGADLDLPAENYMVLDSASGALCLTVMASRGLSIIGNFQQQNFQF 456
Query: 356 GYDIEQAMVSFKPSRC 371
YD+ +SF P+ C
Sbjct: 457 VYDVAGDTLSFAPAEC 472
>gi|218192703|gb|EEC75130.1| hypothetical protein OsI_11317 [Oryza sativa Indica Group]
Length = 440
Score = 147 bits (371), Expect = 7e-33, Method: Compositional matrix adjust.
Identities = 118/366 (32%), Positives = 173/366 (47%), Gaps = 48/366 (13%)
Query: 35 YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSS 94
YL+HL+IGTPP + ++DTGSD WTQC+PC CF Q P +D +SST+ SC S
Sbjct: 91 YLLHLAIGTPPQPVQLTLDTGSDLVWTQCQPCAV--CFNQSLPYYDASRSSTFALPSCDS 148
Query: 95 SQCAVVTS-----NCSEGDCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMPNVI 149
+QC + S N + C++S+ YG S + G L ET++F + + +P V+
Sbjct: 149 TQCKLDPSVTMCVNQTVQTCAFSYSYGD---KSATIGFLDVETVSFVAGA----SVPGVV 201
Query: 150 FGCGHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLPDQGSSK------- 202
FGCG N S +TGI G G G SL SQ+ G FS+C K
Sbjct: 202 FGCGLNNTG--IFRSNETGIAGFGRGPLSLPSQLKV---GNFSHCFTAVSGRKPSTVLFD 256
Query: 203 INFGGIVAGAGVVSTPLIIRD-----HYYLSLEAISVGNQRLEFVSSS------TGNIFV 251
+ G G V T +I++ YYLSL+ I+VG+ RL S+ TG +
Sbjct: 257 LPADLYKNGRGTVQTTPLIKNPAHPTFYYLSLKGITVGSTRLPVPESAFALKNGTGGTII 316
Query: 252 DTGVLRTLLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDVLCYN---ISSQPKFPEVT 308
D+G T LP + + + +K V P +LC++ + P P++
Sbjct: 317 DSGTAFTSLPPRVYRLVHDEFAAHVKLPVVPSNETGP----LLCFSAPPLGKAPHVPKLV 372
Query: 309 IHFRGADVKLSPSNLFRNISDEIMCS---AFRGGNANIVYGRIMQINFLIGYDIEQAMVS 365
+HF GA + L N D CS A G I+ G Q N + YD++ + +S
Sbjct: 373 LHFEGATMHLPRENYVFEAKDGGNCSICLAIIEGEMTII-GNFQQQNMHVLYDLKNSKLS 431
Query: 366 FKPSRC 371
F ++C
Sbjct: 432 FVRAKC 437
>gi|242041575|ref|XP_002468182.1| hypothetical protein SORBIDRAFT_01g041210 [Sorghum bicolor]
gi|241922036|gb|EER95180.1| hypothetical protein SORBIDRAFT_01g041210 [Sorghum bicolor]
Length = 490
Score = 147 bits (371), Expect = 8e-33, Method: Compositional matrix adjust.
Identities = 120/371 (32%), Positives = 177/371 (47%), Gaps = 52/371 (14%)
Query: 35 YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSS 94
Y+ +++GTP V+ ++DT SD TW QC+PC C+ Q P+FDP+ S++Y +S ++
Sbjct: 138 YIAKIAVGTPGVEALLALDTASDLTWLQCQPCRR--CYPQSGPVFDPRHSTSYREMSFNA 195
Query: 95 SQCAVV----TSNCSEGDCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMPNVIF 150
+ C + + G C Y+ YG G S + G+ ETLTF V +P +
Sbjct: 196 ADCQALGRSGGGDAKRGTCVYTVGYGDG---STTVGDFIEETLTFAGG----VRLPRISI 248
Query: 151 GCGHKN---LASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCL------PDQGSS 201
GCGH N +P + GI+GLG G S +Q+ + G FSYCL P SS
Sbjct: 249 GCGHDNKGLFGAPAA-----GILGLGRGLMSFPNQIDHN--GTFSYCLVDFLSGPGSLSS 301
Query: 202 KINFG-GIVAGAGVVS-TPLIIR----DHYYLSLEAISVGNQRLEFVS---------SST 246
+ FG G V + VS TP ++ YY+ L ISVG R+ V+ +
Sbjct: 302 TLTFGAGAVDTSPPVSFTPTVLNLNMPTFYYVRLTGISVGGVRVPGVTERDLQLDPYTGR 361
Query: 247 GNIFVDTGVLRTLLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDVLCYNISSQ--PKF 304
G + VD+G T L ++ + + +G GF D CY + + K
Sbjct: 362 GGVIVDSGTAVTRLARPAYTAFRDAFRAVAVDLGQVSIGGPSGFFDT-CYTVGGRGMKKV 420
Query: 305 PEVTIHFRGA-DVKLSPSNLFRNI-SDEIMCSAFR--GGNANIVYGRIMQINFLIGYDIE 360
P V++HF G+ +VKL P N + S +C AF G ++ + G I Q F I YDI
Sbjct: 421 PTVSMHFAGSVEVKLQPKNYLIPVDSMGTVCFAFAATGDHSVSIIGNIQQQGFRIVYDI- 479
Query: 361 QAMVSFKPSRC 371
V F P+ C
Sbjct: 480 GGRVGFAPNSC 490
>gi|326499093|dbj|BAK06037.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 471
Score = 147 bits (371), Expect = 8e-33, Method: Compositional matrix adjust.
Identities = 125/385 (32%), Positives = 188/385 (48%), Gaps = 48/385 (12%)
Query: 24 YQAEIISVDDIYLMHLSIGTPPVDIFGSVDTGSDCTWTQCE---PCPELDCFKQ---EPP 77
+ +E+ S YLM ++IGTPP + DTGSD W C P L + +PP
Sbjct: 89 FVSELTSTPFEYLMAVNIGTPPTRMVAIADTGSDLIWLNCSYGGDGPGLAAARDADAQPP 148
Query: 78 --LFDPKKSSTYNSISCSSSQCAVV-TSNC-SEGDCSYSFLYGRGAYASFSSGNLATETL 133
FDP KS+T+ + C S C+ + ++C ++ C YS+ YG G++ +SG L+TET
Sbjct: 149 GVQFDPSKSTTFRLVDCDSVACSELPEASCGADSKCRYSYSYGDGSH---TSGVLSTETF 205
Query: 134 TFNST-----SGLPVEMPNVIFGCGHKNLASPTSDSKQTGIIGLGPGNSSLISQMG--TS 186
TF G + NV FGC + S S G++GLG G+ SL+SQ+G TS
Sbjct: 206 TFADAPGARGDGTTTRVANVNFGCSTTFVGS----SVGDGLVGLGGGDLSLVSQLGADTS 261
Query: 187 IAGKFSYCLPD---QGSSKINFG--GIVAGAGVVSTPLI---IRDHYYLSLEAISVGNQR 238
+ +FSYCL + SS +NFG V G V+TPLI ++ +Y + L ++ VGN+
Sbjct: 262 LGRRFSYCLVPYSVKASSALNFGPRAAVTDPGAVTTPLIPSQVKAYYIVELRSVKVGNKT 321
Query: 239 LEFVSSSTGNIFVDTGVLRTLLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDVLCYNI 298
E S + VD+G T LP L ++ IK P + P LC+++
Sbjct: 322 FEAPDRSP--LIVDSGTTLTFLPEALVDPLVKELTGRIKLPPAQ----SPERLLPLCFDV 375
Query: 299 SS------QPKFPEVTIHF-RGADVKLSPSNLFRNISDEIMCSAFRGGNANI---VYGRI 348
S P+VT+ GA V L N F + + +C A + + G I
Sbjct: 376 SGVREGQVAAMIPDVTVGLGGGAAVTLKAENTFVEVQEGTLCLAVSAMSEQFPASIIGNI 435
Query: 349 MQINFLIGYDIEQAMVSFKPSRCTN 373
Q N +GYD+++ V+F P+ C +
Sbjct: 436 AQQNMHVGYDLDKGTVTFAPAACAS 460
>gi|242056497|ref|XP_002457394.1| hypothetical protein SORBIDRAFT_03g006630 [Sorghum bicolor]
gi|241929369|gb|EES02514.1| hypothetical protein SORBIDRAFT_03g006630 [Sorghum bicolor]
Length = 509
Score = 147 bits (371), Expect = 8e-33, Method: Compositional matrix adjust.
Identities = 117/364 (32%), Positives = 175/364 (48%), Gaps = 50/364 (13%)
Query: 35 YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSS 94
Y + IG+P +++ +DTGSD TW QC+PC DC++Q P+FDP S++Y ++SC S
Sbjct: 169 YFSRVGIGSPARELYMVLDTGSDVTWVQCQPC--ADCYQQSDPVFDPSLSASYAAVSCDS 226
Query: 95 SQCA-VVTSNC--SEGDCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMPNVIFG 151
+C + T+ C + G C Y YG G+Y + G+ ATETLT ++ PV NV G
Sbjct: 227 PRCRDLDTAACRNATGACLYEVAYGDGSY---TVGDFATETLTLGDST--PVT--NVAIG 279
Query: 152 CGHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLPDQGS---SKINFGGI 208
CGH N + + G S IS A FSYCL D+ S S + FG
Sbjct: 280 CGHDNEGLFVGAAGLLALGGGPLSFPSQIS------ASTFSYCLVDRDSPAASTLQFGAD 333
Query: 209 VAGAGVVSTPLIIRDH----YYLSLEAISVGNQRLEFVSSS--------TGNIFVDTGVL 256
A A V+ PL+ YY++L ISVG Q L SS+ +G + VD+G
Sbjct: 334 GAEADTVTAPLVRSPRTGTFYYVALSGISVGGQALSIPSSAFAMDATSGSGGVIVDSGTA 393
Query: 257 RTLLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDV----LCYNISSQP--KFPEVTIH 310
T L ++ L+ V+G + P S V CY++S + + P V++
Sbjct: 394 VTRLQSSAYAALRDAF--------VRGTPSLPRTSGVSLFDTCYDLSDRTSVEVPAVSLR 445
Query: 311 FRGAD-VKLSPSNLFRNISDE-IMCSAFRGGNANI-VYGRIMQINFLIGYDIEQAMVSFK 367
F G ++L N + C AF NA + + G + Q + +D + +V F
Sbjct: 446 FEGGGALRLPAKNYLIPVDGAGTYCLAFAPTNAAVSIIGNVQQQGTRVSFDTAKGVVGFT 505
Query: 368 PSRC 371
P++C
Sbjct: 506 PNKC 509
>gi|242050428|ref|XP_002462958.1| hypothetical protein SORBIDRAFT_02g035300 [Sorghum bicolor]
gi|241926335|gb|EER99479.1| hypothetical protein SORBIDRAFT_02g035300 [Sorghum bicolor]
Length = 460
Score = 147 bits (370), Expect = 9e-33, Method: Compositional matrix adjust.
Identities = 118/375 (31%), Positives = 172/375 (45%), Gaps = 55/375 (14%)
Query: 35 YLMHLSIGTPPVDIFGSVDTGSDCTWTQCE-PCPELDCFKQEPPLFDPKKSSTYNSISCS 93
YL+ +IGTPP+ + +DTGSD WTQC+ PC CF Q PL+ P +S TY ++SC
Sbjct: 100 YLVDFAIGTPPLALSAVLDTGSDLIWTQCDAPCRR--CFPQPAPLYAPARSVTYANVSCG 157
Query: 94 SSQCAVVTS--------------NCSEGDCSYSFLYGRGAYASFSSGNLATETLTFNSTS 139
S C + S G C+Y + YG G S + G LATET TF
Sbjct: 158 SRLCDALPSLRPSSRCSASASAPAPERGGCTYYYSYGDG---SSTDGVLATETFTF---- 210
Query: 140 GLPVEMPNVIFGCGHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLP--- 196
G + ++ FGCG NL + S G++G+G G SL+SQ+G + KFSYC
Sbjct: 211 GAGTTVHDLAFGCGTDNLGGTDNSS---GLVGMGRGPLSLVSQLGVT---KFSYCFTPFN 264
Query: 197 DQGSSKINFGGIVA--GAGVVSTPLI-------IRDHYYLSLEAISVGNQRL-------E 240
D +S F G A STP + +YYLSLE I+VG+ L
Sbjct: 265 DTTTSSPLFLGSSASLSPAAKSTPFVPSPSGPRRSSYYYLSLEGITVGDTLLPIDPAVFR 324
Query: 241 FVSSSTGNIFVDTGVLRTLLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDVLCYNISS 300
+S G + +D+G T LE + + + + GA G S
Sbjct: 325 LTASGRGGLIIDSGT--TFTALEERAFVVLARAVAARVALPLASGAHLGLSVCFAAPQGR 382
Query: 301 QPK---FPEVTIHFRGADVKLSPSN-LFRNISDEIMCSAFRGGNANIVYGRIMQINFLIG 356
P+ P + +HF GAD++L S+ + + + C V G + Q N +
Sbjct: 383 GPEAVDVPRLVLHFDGADMELPRSSAVVEDRVAGVACLGIVSARGMSVLGSMQQQNMHVR 442
Query: 357 YDIEQAMVSFKPSRC 371
YD+ + ++SF+P+ C
Sbjct: 443 YDVGRDVLSFEPANC 457
>gi|224067990|ref|XP_002302634.1| predicted protein [Populus trichocarpa]
gi|222844360|gb|EEE81907.1| predicted protein [Populus trichocarpa]
Length = 490
Score = 147 bits (370), Expect = 1e-32, Method: Compositional matrix adjust.
Identities = 120/365 (32%), Positives = 170/365 (46%), Gaps = 50/365 (13%)
Query: 35 YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSS 94
Y L +GTP +F +DTGSD W QC PC + C+ Q P+F+P KS ++ +I C S
Sbjct: 147 YFTRLGVGTPARYVFMVLDTGSDVVWIQCAPCKK--CYSQTDPVFNPTKSRSFANIPCGS 204
Query: 95 SQCAVVTS-NCSEGD--CSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMPNVIFG 151
C + S CS C Y YG G SF+ G +TETLTF T + V G
Sbjct: 205 PLCRRLDSPGCSTKKHICLYQVSYGDG---SFTYGEFSTETLTFRGT-----RVGRVALG 256
Query: 152 CGHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLPDQGSSK----INFGG 207
CGH N + G+ S SQ+G + KFSYCL D+ +S + FG
Sbjct: 257 CGHDNEGLFIGAAGLLGLGRG---RLSFPSQIGRRFSRKFSYCLVDRSASSKPSYMVFGD 313
Query: 208 IVAGAGVVSTPLI----IRDHYYLSLEAISVGNQRLEFVSS------STGN--IFVDTGV 255
TPL+ + YY+ L +SVG R+ +++ STGN + +D+G
Sbjct: 314 SAISRTARFTPLVSNPKLDTFYYVELLGVSVGGTRVPGITASLFKLDSTGNGGVIIDSGT 373
Query: 256 LRTLLPLEYHSNLKSVM----SNMIKAQPVKGVGAEPGFSDV-LCYNIS--SQPKFPEVT 308
T L + L+ SN+ +A P FS C+++S ++ K P V
Sbjct: 374 SVTRLTRPAYVALRDAFRVGASNLKRA---------PEFSLFDTCFDLSGKTEVKVPTVV 424
Query: 309 IHFRGADVKLSPSNLFRNISDE-IMCSAFRGGNANI-VYGRIMQINFLIGYDIEQAMVSF 366
+HFRGADV L SN + + C AF G + + + G I Q F + YD+ + V F
Sbjct: 425 LHFRGADVSLPASNYLIPVDNSGSFCFAFAGTMSGLSIVGNIQQQGFRVVYDLAASRVGF 484
Query: 367 KPSRC 371
P C
Sbjct: 485 APRGC 489
>gi|414589628|tpg|DAA40199.1| TPA: hypothetical protein ZEAMMB73_627989 [Zea mays]
Length = 452
Score = 147 bits (370), Expect = 1e-32, Method: Compositional matrix adjust.
Identities = 116/405 (28%), Positives = 187/405 (46%), Gaps = 55/405 (13%)
Query: 2 QNSQKLPFYNDNETPKSPISIIYQAEIISVDDIYLMHLSIGTPPVDIFGSVDTGSDCTWT 61
+N + N+ +TP + + ++ Y++ L+IGTPP + +DTGSD WT
Sbjct: 68 RNRARFSGKNEQQTPAGVLPVRPSGDLE-----YVVDLAIGTPPQPVSALLDTGSDLIWT 122
Query: 62 QCEPCPELDCFKQEPPLFDPKKSSTYNSISCSSSQCA-VVTSNCSEGD-CSYSFLYGRGA 119
QC PC C Q PLF P +S++Y + C+ + C+ ++ +C D C+Y + YG G
Sbjct: 123 QCAPCAS--CLSQPDPLFAPGQSASYEPMRCAGTLCSDILHHSCERPDTCTYRYNYGDG- 179
Query: 120 YASFSSGNLATETLTFNSTSGLPVEMPNVI--FGCGHKNLASPTSDSKQTGIIGLGPGNS 177
+ + G ATE TF S+ G + V FGCG N+ S + S GI+G G
Sbjct: 180 --TMTVGVYATERFTFASSGGGGLTTTTVPLGFGCGSVNVGSLNNGS---GIVGFGRNPL 234
Query: 178 SLISQMGTSIAGKFSYCLPDQGSSK---INFGGIVAG------AGVVSTPLIIRDH---- 224
SL+SQ+ +FSYCL S + + FG + G V +TPL+
Sbjct: 235 SLVSQLSIR---RFSYCLTSYASRRQSTLLFGSLSDGVYGDATGRVQTTPLLQSPQNPTF 291
Query: 225 YYLSLEAISVGNQRLEFVSSS-------TGNIFVDTGVLRTLLPLEYHSNLKSVMSNMIK 277
YY+ ++VG +RL S+ +G + VD+G TLLP + + ++
Sbjct: 292 YYVHFTGLTVGARRLRIPESAFALRPDGSGGVIVDSGTALTLLPAAVLAEVVRAFRQQLR 351
Query: 278 AQPVKGVGAEPGFSDVLCYNI---------SSQPKFPEVTIHFRGADVKLSPSN-LFRNI 327
G E G +C+ + +SQ P + +HF+GAD+ L N + +
Sbjct: 352 LPFANGGNPEDG----VCFLVPAAWRRSSSTSQMPVPRMVLHFQGADLDLPRRNYVLDDH 407
Query: 328 SDEIMCSAFR-GGNANIVYGRIMQINFLIGYDIEQAMVSFKPSRC 371
+C G+ G ++Q + + YD+E +S P+RC
Sbjct: 408 RRGRLCLLLADSGDDGSTIGNLVQQDMRVLYDLEAETLSIAPARC 452
>gi|356575389|ref|XP_003555824.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
Length = 473
Score = 146 bits (369), Expect = 1e-32, Method: Compositional matrix adjust.
Identities = 111/361 (30%), Positives = 165/361 (45%), Gaps = 45/361 (12%)
Query: 35 YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSS 94
Y + + +G+PP + + +D+GSD W QCEPC + C+ Q P+F+P SS+Y +SC+S
Sbjct: 134 YFVRIGVGSPPRNQYVVIDSGSDIIWVQCEPCTQ--CYHQSDPVFNPADSSSYAGVSCAS 191
Query: 95 SQCAVV-TSNCSEGDCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMPNVIFGCG 153
+ C+ V + C EG C Y YG G+Y + G LA ETLTF T + NV GCG
Sbjct: 192 TVCSHVDNAGCHEGRCRYEVSYGDGSY---TKGTLALETLTFGRT-----LIRNVAIGCG 243
Query: 154 HKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLPDQG---SSKINFGGIVA 210
H N + G++GLG G S + Q+G G FSYCL +G S + FG
Sbjct: 244 HHNQGMFVGAA---GLLGLGSGPMSFVGQLGGQAGGTFSYCLVSRGIQSSGLLQFGREAV 300
Query: 211 GAGVVSTPLI----IRDHYYLSLEAISVGNQRL-------EFVSSSTGNIFVDTGVLRTL 259
G PLI + YY+ L + VG R+ + G + +DTG T
Sbjct: 301 PVGAAWVPLIHNPRAQSFYYVGLSGLGVGGLRVPISEDVFKLSELGDGGVVMDTGTAVTR 360
Query: 260 LPLEYHSNLKSVM----SNMIKAQPVKGVGAEPGFSDVLCYNISS--QPKFPEVTIHFRG 313
LP + + +N+ +A V CY++ + P V+ +F G
Sbjct: 361 LPTAAYEAFRDAFIAQTTNLPRASGVSIFDT--------CYDLFGFVSVRVPTVSFYFSG 412
Query: 314 ADVKLSPSNLFRNISDEI--MCSAFRGGNANI-VYGRIMQINFLIGYDIEQAMVSFKPSR 370
+ P+ F D++ C AF ++ + + G I Q I D V F P+
Sbjct: 413 GPILTLPARNFLIPVDDVGSFCFAFAPSSSGLSIIGNIQQEGIEISVDGANGFVGFGPNV 472
Query: 371 C 371
C
Sbjct: 473 C 473
>gi|115448353|ref|NP_001047956.1| Os02g0720900 [Oryza sativa Japonica Group]
gi|45735844|dbj|BAD12879.1| putative chloroplast nucleoid DNA-binding protein cnd41 [Oryza
sativa Japonica Group]
gi|45735970|dbj|BAD12999.1| putative chloroplast nucleoid DNA-binding protein cnd41 [Oryza
sativa Japonica Group]
gi|113537487|dbj|BAF09870.1| Os02g0720900 [Oryza sativa Japonica Group]
gi|125540930|gb|EAY87325.1| hypothetical protein OsI_08729 [Oryza sativa Indica Group]
gi|215692622|dbj|BAG88042.1| unnamed protein product [Oryza sativa Japonica Group]
Length = 458
Score = 146 bits (369), Expect = 1e-32, Method: Compositional matrix adjust.
Identities = 113/354 (31%), Positives = 169/354 (47%), Gaps = 33/354 (9%)
Query: 35 YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSS 94
Y+ + +GTP VDTGS TW QC PC + C +Q P+F+PK SSTY S+ CS+
Sbjct: 122 YVTRMGLGTPATQYVMVVDTGSSLTWLQCSPC-LVSCHRQSGPVFNPKSSSTYASVGCSA 180
Query: 95 SQCAVV------TSNCSEGD-CSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMPN 147
QC+ + S CS + C Y YG +SFS G L+ +T++F STS +PN
Sbjct: 181 QQCSDLPSATLNPSACSSSNVCIYQASYGD---SSFSVGYLSKDTVSFGSTS-----LPN 232
Query: 148 VIFGCGHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLPDQGSSKINFGG 207
+GCG N + G+IGL SL+ Q+ S+ F+YCLP SS G
Sbjct: 233 FYYGCGQDNEG---LFGRSAGLIGLARNKLSLLYQLAPSLGYSFTYCLPSSSSSGYLSLG 289
Query: 208 IVAGAGVVSTPLI---IRDH-YYLSLEAISVGNQRLEFVSSSTGNI--FVDTGVLRTLLP 261
TP++ + D Y++ L ++V L SS+ ++ +D+G + T LP
Sbjct: 290 SYNPGQYSYTPMVSSSLDDSLYFIKLSGMTVAGNPLSVSSSAYSSLPTIIDSGTVITRLP 349
Query: 262 LEYHSNLKSVMSNMIKAQPVKGVGAEPGFS--DVLCYNISSQPKFPEVTIHFR-GADVKL 318
+S L + A +KG +S D +S+ P VT+ F GA +KL
Sbjct: 350 TSVYSALSKAV-----AAAMKGTSRASAYSILDTCFKGQASRVSAPAVTMSFAGGAALKL 404
Query: 319 SPSNLFRNISDEIMCSAFRGGNANIVYGRIMQINFLIGYDIEQAMVSFKPSRCT 372
S NL ++ D C AF + + G Q F + YD++ + + F C+
Sbjct: 405 SAQNLLVDVDDSTTCLAFAPARSAAIIGNTQQQTFSVVYDVKSSRIGFAAGGCS 458
>gi|356537728|ref|XP_003537377.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Glycine max]
Length = 543
Score = 146 bits (369), Expect = 1e-32, Method: Compositional matrix adjust.
Identities = 118/383 (30%), Positives = 182/383 (47%), Gaps = 59/383 (15%)
Query: 35 YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSS 94
Y + + +GTPP ++ +DTGSD +W QC+PC DCF+Q + PK SSTY +ISC
Sbjct: 171 YFLDMFVGTPPKHVWLILDTGSDLSWIQCDPC--YDCFEQNGSHYYPKDSSTYRNISCYD 228
Query: 95 SQCAVVTS-----NCSEGD--CSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVE--- 144
+C +V+S +C + C Y + Y G S ++G+ A+ET T N T E
Sbjct: 229 PRCQLVSSSDPLQHCKAENQTCPYFYDYADG---SNTTGDFASETFTVNLTWPNGKEKFK 285
Query: 145 -MPNVIFGCGHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLPD-----Q 198
+ +V+FGCGH N S G++GLG G S SQ+ + FSYCL D
Sbjct: 286 QVVDVMFGCGHWNKGFFYGAS---GLLGLGRGPISFPSQIQSIYGHSFSYCLTDLFSNTS 342
Query: 199 GSSKINFG---GIVAGAGVVSTPLIIRDH------YYLSLEAISVGNQRLEFVSSSTGN- 248
SSK+ FG ++ + T L+ + YYL +++I VG + L+ +S T +
Sbjct: 343 VSSKLIFGEDKELLNNHNLNFTTLLAGEETPDETFYYLQIKSIMVGGEVLD-ISEQTWHW 401
Query: 249 ------------IFVDTGVLRTLLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDVLCY 296
+D+G T P + +K IK Q + A F CY
Sbjct: 402 SSEGAAADAGGGTIIDSGSTLTFFPDSAYDIIKEAFEKKIKLQQI----AADDFVMSPCY 457
Query: 297 NISS---QPKFPEVTIHFRGADVKLSPSN--LFRNISDEIMCSAFR---GGNANIVYGRI 348
N+S Q + P+ IHF V P+ ++ DE++C A + + G +
Sbjct: 458 NVSGAMMQVELPDFGIHFADGGVWNFPAENYFYQYEPDEVICLAIMKTPNHSHLTIIGNL 517
Query: 349 MQINFLIGYDIEQAMVSFKPSRC 371
+Q NF I YD++++ + + P RC
Sbjct: 518 LQQNFHILYDVKRSRLGYSPRRC 540
>gi|297798978|ref|XP_002867373.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
gi|297313209|gb|EFH43632.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
Length = 434
Score = 146 bits (369), Expect = 1e-32, Method: Compositional matrix adjust.
Identities = 113/357 (31%), Positives = 176/357 (49%), Gaps = 35/357 (9%)
Query: 35 YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSS 94
+L ++SIG PPV +DTGSD TW QC PC C+ Q P F P +SSTY + SC S
Sbjct: 88 FLANISIGDPPVPQLLLIDTGSDLTWIQCLPC---KCYPQTIPFFHPSRSSTYRNASCES 144
Query: 95 SQCAV--VTSNCSEGDCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMPNVIFGC 152
+ A+ + + G+C Y Y S + G LA E LTF ++ + PN++FGC
Sbjct: 145 APHAMPQIFRDEKTGNCRYHLRYRD---FSNTRGILAKEKLTFQTSDEGLISKPNIVFGC 201
Query: 153 GHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLPDQGSSKINFGGIVAGA 212
G N + ++ +G++GLGPG S++++ + KFSYC ++ G
Sbjct: 202 GQDN----SGFTQYSGVLGLGPGTFSIVTR---NFGSKFSYCFGSLIDPTYPHNFLILGN 254
Query: 213 GVV----STPL-IIRDHYYLSLEAISVGNQRLEFVS------SSTGNIFVDTGVLRTLLP 261
G TPL I +D YYL L+AIS+G + L+ S G +DTG T+L
Sbjct: 255 GARIEGDPTPLQIFQDRYYLDLQAISLGEKLLDIEPGIFQRYRSKGGTVIDTGCSPTILA 314
Query: 262 LEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDVLCYNISSQPK---FPEVTIHFR-GADVK 317
E + L + + + + ++ V +++ CY + + FP VT HF GA++
Sbjct: 315 REAYETLSEEI-DFLLGEVLRRVKDWEQYTNH-CYEGNLKLDLYGFPVVTFHFAGGAELA 372
Query: 318 LSPSNLF-RNISDEIMCSAFRGGNAN--IVYGRIMQINFLIGYDIEQAMVSFKPSRC 371
L +LF + S + C A + V G + Q N+ +GY++ V F+ + C
Sbjct: 373 LDVESLFVSSESGDSFCLAMTMNTFDDMSVIGAMAQQNYNVGYNLRTMKVYFQRTDC 429
>gi|293335828|ref|NP_001170221.1| uncharacterized protein LOC100384173 precursor [Zea mays]
gi|224034427|gb|ACN36289.1| unknown [Zea mays]
Length = 443
Score = 146 bits (369), Expect = 2e-32, Method: Compositional matrix adjust.
Identities = 118/373 (31%), Positives = 179/373 (47%), Gaps = 44/373 (11%)
Query: 28 IISVDDIYLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTY 87
+++ D YLM + IGTP +DTGSD WTQC PC L C Q P FDP +S+TY
Sbjct: 83 VLASDGEYLMEMGIGTPTRYYSAILDTGSDLIWTQCAPC--LLCVDQPTPYFDPARSATY 140
Query: 88 NSISCSSSQC-AVVTSNCSEGDCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMP 146
S+ C+S C A+ C + C Y + YG A ++G LA ET TF T+ V +P
Sbjct: 141 RSLGCASPACNALYYPLCYQKVCVYQYFYGDSAS---TAGVLANETFTFG-TNETRVSLP 196
Query: 147 NVIFGCGHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLPDQGS---SKI 203
+ FGCG+ N + S G++G G G+ SL+SQ+G+ +FSYCL S S++
Sbjct: 197 GISFGCGNLNAGLLANGS---GMVGFGRGSLSLVSQLGSP---RFSYCLTSFLSPVPSRL 250
Query: 204 NFGGI-------VAGAGVVSTPLIIR----DHYYLSLEAISVGNQRLEF--------VSS 244
FG + V STP ++ Y+L++ ISVG L +
Sbjct: 251 YFGVYATLNSTNASSEPVQSTPFVVNPALPTMYFLNMTGISVGGYLLPIDPAVFAINDTD 310
Query: 245 STGNIFVDTGVLRTLLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDVLCYNISSQPK- 303
TG +D+G T L + +++ ++ I P+ V + D C+ P+
Sbjct: 311 GTGGTIIDSGTTITYLAEPAYDAVRAAFASQITL-PLLNV-TDASVLDT-CFQWPPPPRQ 367
Query: 304 ---FPEVTIHFRGADVKLSPSN--LFRNISDEIMCSAFRGGNANIVYGRIMQINFLIGYD 358
P++ +HF GAD +L N L + +C A + + G NF + YD
Sbjct: 368 SVTLPQLVLHFDGADWELPLQNYMLVDPSTGGGLCLAMASSSDGSIIGSYQHQNFNVLYD 427
Query: 359 IEQAMVSFKPSRC 371
+E +++SF P+ C
Sbjct: 428 LENSLMSFVPAPC 440
>gi|242069057|ref|XP_002449805.1| hypothetical protein SORBIDRAFT_05g023600 [Sorghum bicolor]
gi|241935648|gb|EES08793.1| hypothetical protein SORBIDRAFT_05g023600 [Sorghum bicolor]
Length = 430
Score = 146 bits (368), Expect = 2e-32, Method: Compositional matrix adjust.
Identities = 122/387 (31%), Positives = 181/387 (46%), Gaps = 70/387 (18%)
Query: 26 AEIISVDDIYLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSS 85
A + S YLM L+IGTPPV DTGSD TWTQC+PC CF Q+ P++D SS
Sbjct: 74 ARLRSGQAEYLMELAIGTPPVPFIALADTGSDLTWTQCKPCKL--CFGQDTPIYDTTTSS 131
Query: 86 TYNSISCSSSQC-AVVTSNCS--EGDCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLP 142
+++ + CSS+ C + +S CS C Y + Y GAY+ +G
Sbjct: 132 SFSPLPCSSATCLPIWSSRCSTPSATCRYRYAYDDGAYSPECAG---------------- 175
Query: 143 VEMPNVIFGCGHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLPD----Q 198
+ + + FGCG N TG +GLG G+ SL++Q+G GKFSYCL D
Sbjct: 176 ISVGGIAFGCGVDNGG---LSYNSTGTVGLGRGSLSLVAQLG---VGKFSYCLTDFFNTS 229
Query: 199 GSSKINFGGIVAGAG---------VVSTPLIIRDH----YYLSLEAISVGNQRLEFVSSS 245
SS + FG + A V STPL+ + YY+SLE IS+G+ RL + +
Sbjct: 230 LSSPVFFGSLAELAASSASADAAVVQSTPLVQSPYNPSRYYVSLEGISLGDARLPIPNGT 289
Query: 246 --------TGNIFVDTGVLRTLLPLEYHSNLKSVMSNM--IKAQPVKGVGAEPGFSDVLC 295
+G + VD+G + T+L + + V+ ++ + QPV + D C
Sbjct: 290 FDLNDDDGSGGMIVDSGTIFTIL---VETGFRVVVDHVAGVLGQPVVNASSL----DRPC 342
Query: 296 Y-----NISSQPKFPEVTIHFR-GADVKLSPSNLFR-NISDEIMCSAFRGGN--ANIVYG 346
+ + P P++ +HF GAD++L N N + C G + V G
Sbjct: 343 FPAPAAGVQELPDMPDMVLHFAGGADMRLHRDNYMSFNEEESSFCLNIVGTESASGSVLG 402
Query: 347 RIMQINFLIGYDIEQAMVSFKPSRCTN 373
Q N + +DI +SF P+ C+
Sbjct: 403 NFQQQNIQMLFDITVGQLSFMPTDCSK 429
>gi|293333012|ref|NP_001168410.1| uncharacterized protein LOC100382179 precursor [Zea mays]
gi|223948083|gb|ACN28125.1| unknown [Zea mays]
gi|413953783|gb|AFW86432.1| hypothetical protein ZEAMMB73_737575 [Zea mays]
Length = 466
Score = 146 bits (368), Expect = 2e-32, Method: Compositional matrix adjust.
Identities = 113/353 (32%), Positives = 170/353 (48%), Gaps = 32/353 (9%)
Query: 35 YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSS 94
Y++ +S+GTP V S+DTGSD +W QC PC C Q+ LFDP KS+TY++ SCSS
Sbjct: 130 YVITVSLGTPAVTQVMSIDTGSDVSWVQCAPCAAQSCSSQKDKLFDPAKSATYSAFSCSS 189
Query: 95 SQCAVV---TSNCSEGDCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMPNVIFG 151
+QCA + + C C Y Y S ++G ++TL ++ + N FG
Sbjct: 190 AQCAQLGGEGNGCLNSHCQYIVKY---VDHSNTTGTYGSDTLGLTTSDAV----KNFQFG 242
Query: 152 CGHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLPDQGSSKINFGGIVAG 211
C H+ + + G++GLG SL+SQ + FSYCLP SS F + A
Sbjct: 243 CSHR---ANGFVGQLDGLMGLGGDTESLVSQTAATYGKAFSYCLPPSSSSAGGFLTLGAA 299
Query: 212 AGVVS------TPLI---IRDHYYLSLEAISVGNQRLEFVSSS-TGNIFVDTGVLRTLLP 261
AG S TPL+ + Y + L+AI+V +L +S +G VD+G + T LP
Sbjct: 300 AGGTSSSRYSRTPLVRFNVPTFYGVFLQAITVAGTKLNVPASVFSGASVVDSGTVITQLP 359
Query: 262 LEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDVLCYNIS--SQPKFPEVTIHF-RGADVKL 318
+ L++ +KA P A G D C++ S + P VT+ F RGA + L
Sbjct: 360 PTAYQALRTAFKKEMKAYP---SAAPVGILDT-CFDFSGIKTVRVPVVTLTFSRGAVMDL 415
Query: 319 SPSNLFRNISDEIMCSAFRGGNANIVYGRIMQINFLIGYDIEQAMVSFKPSRC 371
S +F +A G + G + Q F + +D+ + + F+P C
Sbjct: 416 DVSGIFYAGCLAFTATAQDGDTG--ILGNVQQRTFEMLFDVGGSTLGFRPGAC 466
>gi|168041176|ref|XP_001773068.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162675615|gb|EDQ62108.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 365
Score = 146 bits (368), Expect = 2e-32, Method: Compositional matrix adjust.
Identities = 112/374 (29%), Positives = 169/374 (45%), Gaps = 40/374 (10%)
Query: 24 YQAEIISVDDIYLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKK 83
+ A + + YL + +GTP VDTGSD TW QC PC + C+ Q LF P
Sbjct: 2 FTAPVAAARGEYLATVRLGTPERVFSVIVDTGSDLTWVQCSPCGK--CYSQNDALFLPNT 59
Query: 84 SSTYNSISCSSSQC-AVVTSNCSEGDCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLP 142
S+++ ++C S+ C + C++ C Y + YG G S ++G+ +T+T + +G
Sbjct: 60 STSFTKLACGSALCNGLPFPMCNQTTCVYWYSYGDG---SLTTGDFVYDTITMDGINGQK 116
Query: 143 VEMPNVIFGCGHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLPDQGSSK 202
++PN FGCGH N S + GI+GLG G S SQ+ + GKFSYCL D +
Sbjct: 117 QQVPNFAFGCGHDNEG---SFAGADGILGLGQGPLSFHSQLKSVYNGKFSYCLVDWLAPP 173
Query: 203 INFGGIVAGAGVVST-------PLI----IRDHYYLSLEAISVGNQRLEFVSS------- 244
++ G V P++ + +YY+ L ISVG+ L S+
Sbjct: 174 TQTSPLLFGDAAVPILPDVKYLPILANPKVPTYYYVKLNGISVGDNLLNISSTVFDIDSV 233
Query: 245 -STGNIFVDTGVLRTLLPLEYHSNLKSVM--SNMIKAQPVKGVGAEPGFSDVLC---YNI 298
G IF D+G T L + + + M S M ++ + + LC +
Sbjct: 234 GGAGTIF-DSGTTVTQLAEAAYKEVLAAMNASTMAYSRKIDDISRLD-----LCLSGFPK 287
Query: 299 SSQPKFPEVTIHFRGADVKLSPSNLFRNI-SDEIMCSAFRGGNANIVYGRIMQINFLIGY 357
P P +T HF G D+ L PSN F + S + C A + G + Q NF + Y
Sbjct: 288 DQLPTVPAMTFHFEGGDMVLPPSNYFIYLESSQSYCFAMTSSPDVNIIGSVQQQNFQVYY 347
Query: 358 DIEQAMVSFKPSRC 371
D + F P C
Sbjct: 348 DTAGRKLGFVPKDC 361
>gi|357137788|ref|XP_003570481.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Brachypodium
distachyon]
Length = 455
Score = 145 bits (367), Expect = 2e-32, Method: Compositional matrix adjust.
Identities = 109/352 (30%), Positives = 166/352 (47%), Gaps = 27/352 (7%)
Query: 35 YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSS 94
Y+ + +GTP VDTGS TW QC PC + C +Q P+FDPK SS+Y ++SCSS
Sbjct: 117 YVTRMGLGTPAKPYIMVVDTGSSLTWLQCSPC-RVSCHRQSGPVFDPKTSSSYAAVSCSS 175
Query: 95 SQC---AVVTSN---CSEGD-CSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMPN 147
QC + T N CS + C Y YG +SFS G L+ +T++F + S +PN
Sbjct: 176 PQCDGLSTATLNPAVCSPSNVCIYQASYGD---SSFSVGYLSKDTVSFGANS-----VPN 227
Query: 148 VIFGCGHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLPDQGSSKINFGG 207
+GCG N + G++GL SL+ Q+ ++ FSYCLP SS G
Sbjct: 228 FYYGCGQDNEG---LFGRSAGLMGLARNKLSLLYQLAPTLGYSFSYCLPSTSSSGYLSIG 284
Query: 208 IVAGAGVVSTPLI---IRDH-YYLSLEAISVGNQRLEFVSSSTGNI--FVDTGVLRTLLP 261
G TP++ + D Y++SL ++V + L SS ++ +D+G + T LP
Sbjct: 285 SYNPGGYSYTPMVSNTLDDSLYFISLSGMTVAGKPLAVSSSEYTSLPTIIDSGTVITRLP 344
Query: 262 LEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDVLCYNISSQPKFPEVTIHFR-GADVKLSP 320
++ L ++ +K K A S P V++ F GA +KLS
Sbjct: 345 TSVYTALSKAVAAAMKGS-TKRAAAYSILDTCFEGQASKLRAVPAVSMAFSGGATLKLSA 403
Query: 321 SNLFRNISDEIMCSAFRGGNANIVYGRIMQINFLIGYDIEQAMVSFKPSRCT 372
NL ++ C AF + + G Q F + YD++ + F + C+
Sbjct: 404 GNLLVDVDGATTCLAFAPARSAAIIGNTQQQTFSVVYDVKSNRIGFAAAGCS 455
>gi|356524287|ref|XP_003530761.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
Length = 481
Score = 145 bits (367), Expect = 2e-32, Method: Compositional matrix adjust.
Identities = 125/360 (34%), Positives = 173/360 (48%), Gaps = 38/360 (10%)
Query: 35 YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSS 94
Y + + +GTP D+ DTGSD TWTQCEPC C+KQ+ +FDP KSS+Y +I+C+S
Sbjct: 136 YFVVVGLGTPKRDLSLVFDTGSDLTWTQCEPCAG-SCYKQQDAIFDPSKSSSYINITCTS 194
Query: 95 SQCAVVT-----SNCSEGD--CSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMPN 147
S C +T S CS C Y YG S S G L+ E LT +T + +
Sbjct: 195 SLCTQLTSAGIKSRCSSSTTACIYGIQYGD---KSTSVGFLSQERLTITATD----IVDD 247
Query: 148 VIFGCGHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLPDQGSS--KINF 205
+FGCG N S G+IGLG S + Q + FSYCLP SS + F
Sbjct: 248 FLFGCGQDNEGLF---SGSAGLIGLGRHPISFVQQTSSIYNKIFSYCLPSTSSSLGHLTF 304
Query: 206 GGIVA-GAGVVSTPL--IIRDH--YYLSLEAISVGNQRLEFVSSST---GNIFVDTGVLR 257
G A A + TPL I D+ Y L + ISVG +L VSSST G +D+G +
Sbjct: 305 GASAATNANLKYTPLSTISGDNTFYGLDIVGISVGGTKLPAVSSSTFSAGGSIIDSGTVI 364
Query: 258 TLLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDVLCYNISSQPKF--PEVTIHFRGA- 314
T L ++ L+S ++ PV E G D CY+ S + P++ F G
Sbjct: 365 TRLAPTAYAALRSAFRQGMEKYPVAN---EDGLFDT-CYDFSGYKEISVPKIDFEFAGGV 420
Query: 315 DVKLSPSNLFRNISDEIMCSAFRG-GNAN--IVYGRIMQINFLIGYDIEQAMVSFKPSRC 371
V+L + S + +C AF GN N ++G + Q + YD+E + F + C
Sbjct: 421 TVELPLVGILIGRSAQQVCLAFAANGNDNDITIFGNVQQKTLEVVYDVEGGRIGFGAAGC 480
>gi|297811185|ref|XP_002873476.1| hypothetical protein ARALYDRAFT_325615 [Arabidopsis lyrata subsp.
lyrata]
gi|297319313|gb|EFH49735.1| hypothetical protein ARALYDRAFT_325615 [Arabidopsis lyrata subsp.
lyrata]
Length = 475
Score = 145 bits (367), Expect = 2e-32, Method: Compositional matrix adjust.
Identities = 117/359 (32%), Positives = 173/359 (48%), Gaps = 37/359 (10%)
Query: 35 YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSS 94
Y++ + +GTP D+ DTGSD TWTQC+PC C+ Q+ P+F+P KS++Y ++SCSS
Sbjct: 133 YIVTVGLGTPKNDLSLIFDTGSDLTWTQCQPCVR-TCYDQKEPIFNPSKSTSYYNVSCSS 191
Query: 95 SQCAVVTS------NCSEGDCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMPNV 148
+ C ++S +CS +C Y YG SFS G LA + T S+ V
Sbjct: 192 AACGSLSSATGNAGSCSASNCIYGIQYGD---QSFSVGFLAKDKFTLTSSD----VFDGV 244
Query: 149 IFGCGHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLPDQGS--SKINFG 206
FGCG N T + G++GLG S SQ T+ FSYCLP S + FG
Sbjct: 245 YFGCGENNQGLFTGVA---GLLGLGRDKLSFPSQTATAYNKIFSYCLPSSASYTGHLTFG 301
Query: 207 GIVAGAGVVSTPL-IIRD---HYYLSLEAISVGNQRLEFVSS--STGNIFVDTGVLRTLL 260
V TP+ I D Y L++ AI+VG Q+L S+ ST +D+G + T L
Sbjct: 302 SAGISRSVKFTPISTITDGTSFYGLNIVAITVGGQKLPIPSTVFSTPGALIDSGTVITRL 361
Query: 261 PLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDV-LCYNIS--SQPKFPEVTIHFR-GADV 316
P + ++ L+S + P G S + C+++S P+V F GA V
Sbjct: 362 PPKAYAALRSSFKAKMSKYPT-----TSGVSILDTCFDLSGFKTVTIPKVAFSFSGGAVV 416
Query: 317 KLSPSNLFRNISDEIMCSAFRGGNAN---IVYGRIMQINFLIGYDIEQAMVSFKPSRCT 372
+L +F +C AF G + + ++G + Q + YD V F P+ C+
Sbjct: 417 ELGSKGIFYAFKISQVCLAFAGNSDDSNAAIFGNVQQQTLEVVYDGAGGRVGFAPNGCS 475
>gi|8979711|emb|CAB96832.1| nucleoid DNA-binding protein cnd41-like protein [Arabidopsis
thaliana]
Length = 446
Score = 145 bits (366), Expect = 3e-32, Method: Compositional matrix adjust.
Identities = 117/359 (32%), Positives = 173/359 (48%), Gaps = 37/359 (10%)
Query: 35 YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSS 94
Y++ + +GTP D+ DTGSD TWTQC+PC C+ Q+ P+F+P KS++Y ++SCSS
Sbjct: 104 YIVTVGLGTPKNDLSLIFDTGSDLTWTQCQPCVR-TCYDQKEPIFNPSKSTSYYNVSCSS 162
Query: 95 SQCAVVTS------NCSEGDCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMPNV 148
+ C ++S +CS +C Y YG SFS G LA E T ++ V
Sbjct: 163 AACGSLSSATGNAGSCSASNCIYGIQYGD---QSFSVGFLAKEKFTLTNSD----VFDGV 215
Query: 149 IFGCGHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLPDQGS--SKINFG 206
FGCG N T + G++GLG S SQ T+ FSYCLP S + FG
Sbjct: 216 YFGCGENNQGLFTGVA---GLLGLGRDKLSFPSQTATAYNKIFSYCLPSSASYTGHLTFG 272
Query: 207 GIVAGAGVVSTPL-IIRD---HYYLSLEAISVGNQRLEFVSS--STGNIFVDTGVLRTLL 260
V TP+ I D Y L++ AI+VG Q+L S+ ST +D+G + T L
Sbjct: 273 SAGISRSVKFTPISTITDGTSFYGLNIVAITVGGQKLPIPSTVFSTPGALIDSGTVITRL 332
Query: 261 PLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDV-LCYNIS--SQPKFPEVTIHFR-GADV 316
P + ++ L+S + P G S + C+++S P+V F GA V
Sbjct: 333 PPKAYAALRSSFKAKMSKYPT-----TSGVSILDTCFDLSGFKTVTIPKVAFSFSGGAVV 387
Query: 317 KLSPSNLFRNISDEIMCSAFRGGNAN---IVYGRIMQINFLIGYDIEQAMVSFKPSRCT 372
+L +F +C AF G + + ++G + Q + YD V F P+ C+
Sbjct: 388 ELGSKGIFYVFKISQVCLAFAGNSDDSNAAIFGNVQQQTLEVVYDGAGGRVGFAPNGCS 446
>gi|50508279|dbj|BAD32128.1| putative aspartic proteinase nepenthesin II [Oryza sativa Japonica
Group]
gi|125600536|gb|EAZ40112.1| hypothetical protein OsJ_24555 [Oryza sativa Japonica Group]
Length = 455
Score = 145 bits (366), Expect = 3e-32, Method: Compositional matrix adjust.
Identities = 121/399 (30%), Positives = 189/399 (47%), Gaps = 71/399 (17%)
Query: 21 SIIYQAEIISVDDIYLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQ--EPPL 78
S+ QA++ + Y M++S+GTPP+D VDTGS+ W QC PC CF + P+
Sbjct: 77 SVNVQAQLENGAGAYNMNISLGTPPLDFPVIVDTGSNLIWAQCAPCTR--CFPRPTPAPV 134
Query: 79 FDPKKSSTYNSISCSSSQCAVVTSNC------SEGDCSYSFLYGRGAYASFSSGNLATET 132
P +SST++ + C+ S C + ++ + C+Y++ YG G +++G LATET
Sbjct: 135 LQPARSSTFSRLPCNGSFCQYLPTSSRPRTCNATAACAYNYTYGSG----YTAGYLATET 190
Query: 133 LTFNSTSGLPVEMPNVIFGCGHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFS 192
LT + P V FGC +N +S GI+GLG G SL+SQ+ G+FS
Sbjct: 191 LTVGDGT-----FPKVAFGCSTENGVDNSS-----GIVGLGRGPLSLVSQLAV---GRFS 237
Query: 193 YCL----PDQGSSKINFGGIVA---GAGVVSTPLIIR------DHYYLSLEAISVGNQRL 239
YCL D G+S I FG + G+ V STPL+ HYY++L I+V + L
Sbjct: 238 YCLRSDMADGGASPILFGSLAKLTEGSVVQSTPLLKNPYLQRSTHYYVNLTGIAVDSTEL 297
Query: 240 EFVSSS--------TGNIFVDTGVLRTLLPLEYHSNLK----SVMSNMIKAQPVKGVGAE 287
S+ G VD+G T L + ++ +K S M+N+ + P G
Sbjct: 298 PVTGSTFGFTQTGLGGGTIVDSGTTLTYLAKDGYAMVKQAFQSQMANLNQTTPASGA--- 354
Query: 288 PGFSDVLCYNISS-----QPKFPEVTIHFR-GADVKLSPSNLFRNISDE------IMCSA 335
+ LCY S+ + P + + F GA + N F + + + C
Sbjct: 355 -PYDLDLCYKPSAGGGGKAVRVPRLALRFAGGAKYNVPVQNYFAGVEADSQGRVTVACLL 413
Query: 336 FRGGNANI---VYGRIMQINFLIGYDIEQAMVSFKPSRC 371
++ + G +MQ++ + YDI+ M SF P+ C
Sbjct: 414 VLPATDDLPISIIGNLMQMDMHLLYDIDGGMFSFAPADC 452
>gi|108707835|gb|ABF95630.1| Eukaryotic aspartyl protease family protein [Oryza sativa Japonica
Group]
Length = 384
Score = 145 bits (366), Expect = 3e-32, Method: Compositional matrix adjust.
Identities = 118/366 (32%), Positives = 172/366 (46%), Gaps = 48/366 (13%)
Query: 35 YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSS 94
YL+HL+IGTPP + ++DTGS WTQC+PC CF Q P +D +SST+ SC S
Sbjct: 35 YLLHLAIGTPPQPVQLTLDTGSVLVWTQCQPCAV--CFNQSLPYYDASRSSTFALPSCDS 92
Query: 95 SQCAVVTS-----NCSEGDCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMPNVI 149
+QC + S N + C+YS+ YG S + G L ET++F + + +P V+
Sbjct: 93 TQCKLDPSVTMCVNQTVQTCAYSYSYGD---KSATIGFLDVETVSFVAGA----SVPGVV 145
Query: 150 FGCGHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLPDQGSSK------- 202
FGCG N S +TGI G G G SL SQ+ G FS+C K
Sbjct: 146 FGCGLNNTG--IFRSNETGIAGFGRGPLSLPSQLK---VGNFSHCFTAVSGRKPSTVLFD 200
Query: 203 INFGGIVAGAGVVSTPLIIRD-----HYYLSLEAISVGNQRLEFVSSS------TGNIFV 251
+ G G V T +I++ YYLSL+ I+VG+ RL S+ TG +
Sbjct: 201 LPADLYKNGRGTVQTTPLIKNPAHPTFYYLSLKGITVGSTRLPVPESAFALKNGTGGTII 260
Query: 252 DTGVLRTLLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDVLCYN---ISSQPKFPEVT 308
D+G T LP + + + +K V P +LC++ + P P++
Sbjct: 261 DSGTAFTSLPPRVYRLVHDEFAAHVKLPVVPSNETGP----LLCFSAPPLGKAPHVPKLV 316
Query: 309 IHFRGADVKLSPSNLFRNISDEIMCS---AFRGGNANIVYGRIMQINFLIGYDIEQAMVS 365
+HF GA + L N D CS A G I+ G Q N + YD++ + +S
Sbjct: 317 LHFEGATMHLPRENYVFEAKDGGNCSICLAIIEGEMTII-GNFQQQNMHVLYDLKNSKLS 375
Query: 366 FKPSRC 371
F ++C
Sbjct: 376 FVRAKC 381
>gi|22326716|ref|NP_196638.2| aspartyl protease family protein [Arabidopsis thaliana]
gi|18700103|gb|AAL77663.1| AT5g10770/T30N20_40 [Arabidopsis thaliana]
gi|24111269|gb|AAN46758.1| At5g10770/T30N20_40 [Arabidopsis thaliana]
gi|332004211|gb|AED91594.1| aspartyl protease family protein [Arabidopsis thaliana]
Length = 474
Score = 145 bits (366), Expect = 3e-32, Method: Compositional matrix adjust.
Identities = 117/359 (32%), Positives = 173/359 (48%), Gaps = 37/359 (10%)
Query: 35 YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSS 94
Y++ + +GTP D+ DTGSD TWTQC+PC C+ Q+ P+F+P KS++Y ++SCSS
Sbjct: 132 YIVTVGLGTPKNDLSLIFDTGSDLTWTQCQPCVRT-CYDQKEPIFNPSKSTSYYNVSCSS 190
Query: 95 SQCAVVTS------NCSEGDCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMPNV 148
+ C ++S +CS +C Y YG SFS G LA E T ++ V
Sbjct: 191 AACGSLSSATGNAGSCSASNCIYGIQYGD---QSFSVGFLAKEKFTLTNSD----VFDGV 243
Query: 149 IFGCGHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLPDQGS--SKINFG 206
FGCG N T + G++GLG S SQ T+ FSYCLP S + FG
Sbjct: 244 YFGCGENNQGLFTGVA---GLLGLGRDKLSFPSQTATAYNKIFSYCLPSSASYTGHLTFG 300
Query: 207 GIVAGAGVVSTPL-IIRD---HYYLSLEAISVGNQRLEFVSS--STGNIFVDTGVLRTLL 260
V TP+ I D Y L++ AI+VG Q+L S+ ST +D+G + T L
Sbjct: 301 SAGISRSVKFTPISTITDGTSFYGLNIVAITVGGQKLPIPSTVFSTPGALIDSGTVITRL 360
Query: 261 PLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDV-LCYNIS--SQPKFPEVTIHFR-GADV 316
P + ++ L+S + P G S + C+++S P+V F GA V
Sbjct: 361 PPKAYAALRSSFKAKMSKYPT-----TSGVSILDTCFDLSGFKTVTIPKVAFSFSGGAVV 415
Query: 317 KLSPSNLFRNISDEIMCSAFRGGNAN---IVYGRIMQINFLIGYDIEQAMVSFKPSRCT 372
+L +F +C AF G + + ++G + Q + YD V F P+ C+
Sbjct: 416 ELGSKGIFYVFKISQVCLAFAGNSDDSNAAIFGNVQQQTLEVVYDGAGGRVGFAPNGCS 474
>gi|242089623|ref|XP_002440644.1| hypothetical protein SORBIDRAFT_09g004500 [Sorghum bicolor]
gi|241945929|gb|EES19074.1| hypothetical protein SORBIDRAFT_09g004500 [Sorghum bicolor]
Length = 469
Score = 145 bits (366), Expect = 3e-32, Method: Compositional matrix adjust.
Identities = 113/356 (31%), Positives = 172/356 (48%), Gaps = 35/356 (9%)
Query: 35 YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSS 94
Y M SIGTPP + DTGSD WT+C+ + P SST+ + CS
Sbjct: 100 YDMEFSIGTPPQKLTALADTGSDLIWTKCDAGGGAA--WGGSSSYHPNASSTFTRLPCSD 157
Query: 95 SQCAVVTS----NCSEG--DCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMPNV 148
CA + S C+ G +C Y + YG G F+ G L +ET T + +P V
Sbjct: 158 RLCAALRSYSLARCAAGGAECDYKYAYGLGDDPDFTQGFLGSETFTLGGDA-----VPGV 212
Query: 149 IFGCGHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLPDQGS--SKINFG 206
FGC A + G++GLG G SL+SQ+ AG F YCL S S + FG
Sbjct: 213 GFGC---TTALEGDYGEGAGLVGLGRGPLSLVSQLD---AGTFMYCLTADASKASPLLFG 266
Query: 207 GIV----AGAGVVSTPLIIRDHYY-LSLEAISVGNQRLEFVSSSTGNIFVDTGVLRTLLP 261
+ AGAGV ST L+ +Y ++L +I++G+ V G +F D+G T L
Sbjct: 267 ALATMTGAGAGVQSTGLLASTTFYAVNLRSITIGSATTAGVGGPGGVVF-DSGTTLTYLA 325
Query: 262 LEYHSNLKSV-MSNMIKAQPVKGVGAEPGFSDVLCYNISSQPKF-PEVTIHFRG-ADVKL 318
++ K+ +S PV+G GF CY + P + +HF G AD+ L
Sbjct: 326 EPAYTEAKAAFLSQTTSLTPVEG---RYGFE--ACYEKPDSARLIPAMVLHFDGGADMAL 380
Query: 319 SPSNLFRNISDEIMCSAFRGGNANIVYGRIMQINFLIGYDIEQAMVSFKPSRCTNY 374
+N + D ++C + + + G IMQ+N+L+ +D+ ++++SF+P+ C +Y
Sbjct: 381 PVANYVVEVDDGVVCWVVQRSPSLSIIGNIMQMNYLVLHDVRKSVLSFQPANCDSY 436
>gi|222624819|gb|EEE58951.1| hypothetical protein OsJ_10630 [Oryza sativa Japonica Group]
Length = 440
Score = 145 bits (366), Expect = 3e-32, Method: Compositional matrix adjust.
Identities = 118/366 (32%), Positives = 172/366 (46%), Gaps = 48/366 (13%)
Query: 35 YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSS 94
YL+HL+IGTPP + ++DTGS WTQC+PC CF Q P +D +SST+ SC S
Sbjct: 91 YLLHLAIGTPPQPVQLTLDTGSVLVWTQCQPCAV--CFNQSLPYYDASRSSTFALPSCDS 148
Query: 95 SQCAVVTS-----NCSEGDCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMPNVI 149
+QC + S N + C+YS+ YG S + G L ET++F + + +P V+
Sbjct: 149 TQCKLDPSVTMCVNQTVQTCAYSYSYGD---KSATIGFLDVETVSFVAGA----SVPGVV 201
Query: 150 FGCGHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLPDQGSSK------- 202
FGCG N S +TGI G G G SL SQ+ G FS+C K
Sbjct: 202 FGCGLNNTG--IFRSNETGIAGFGRGPLSLPSQLKV---GNFSHCFTAVSGRKPSTVLFD 256
Query: 203 INFGGIVAGAGVVSTPLIIRD-----HYYLSLEAISVGNQRLEFVSSS------TGNIFV 251
+ G G V T +I++ YYLSL+ I+VG+ RL S+ TG +
Sbjct: 257 LPADLYKNGRGTVQTTPLIKNPAHPTFYYLSLKGITVGSTRLPVPESAFALKNGTGGTII 316
Query: 252 DTGVLRTLLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDVLCYN---ISSQPKFPEVT 308
D+G T LP + + + +K V P +LC++ + P P++
Sbjct: 317 DSGTAFTSLPPRVYRLVHDEFAAHVKLPVVPSNETGP----LLCFSAPPLGKAPHVPKLV 372
Query: 309 IHFRGADVKLSPSNLFRNISDEIMCS---AFRGGNANIVYGRIMQINFLIGYDIEQAMVS 365
+HF GA + L N D CS A G I+ G Q N + YD++ + +S
Sbjct: 373 LHFEGATMHLPRENYVFEAKDGGNCSICLAIIEGEMTII-GNFQQQNMHVLYDLKNSKLS 431
Query: 366 FKPSRC 371
F ++C
Sbjct: 432 FVRAKC 437
>gi|168054484|ref|XP_001779661.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162668975|gb|EDQ55572.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 419
Score = 145 bits (365), Expect = 4e-32, Method: Compositional matrix adjust.
Identities = 116/376 (30%), Positives = 175/376 (46%), Gaps = 58/376 (15%)
Query: 35 YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSS 94
Y + +GTPP VD+GSD W QC PC L C+ Q+ PL+ P SST+N + C S
Sbjct: 65 YFVDFFLGTPPQKFSLIVDSGSDLLWVQCAPC--LQCYAQDTPLYAPSNSSTFNPVPCLS 122
Query: 95 SQCAVVTSN----CS---EGDCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMPN 147
+C ++ + C G C+Y + Y A S S G A E+ T + V +
Sbjct: 123 PECLLIPATEGFPCDFHYPGACAYEYRY---ADTSLSKGVFAYESATVDD-----VRIDK 174
Query: 148 VIFGCGHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCL-----PDQGSSK 202
V FGCG N S + G++GLG G S SQ+G + KF+YCL P SS
Sbjct: 175 VAFGCGRDNQG---SFAAAGGVLGLGQGPLSFGSQVGYAYGNKFAYCLVNYLDPTSVSSW 231
Query: 203 INFGG--IVAGAGVVSTPLIIRDH----YYLSLEAISVGNQRLEFVSSST--------GN 248
+ FG I + TP++ YY+ +E + VG + L S+ G+
Sbjct: 232 LIFGDELISTIHDLQFTPIVSNSRNPTLYYVQIEKVMVGGESLPISHSAWSLDFLGNGGS 291
Query: 249 IFVDTGVLRTLLPLEYHSNLKSVMSNMI--KAQPVKGVGAEPGFSDVLCYNIS--SQPKF 304
IF + LP Y + L + N+ +A V+G+ LC +++ QP F
Sbjct: 292 IFDSGTTVTYWLPPAYRNILAAFDKNVRYPRAASVQGLD--------LCVDVTGVDQPSF 343
Query: 305 PEVTIHFRGADV-KLSPSNLFRNISDEIMCSAFRG-----GNANIVYGRIMQINFLIGYD 358
P TI G V + N F +++ + C A G G N + G ++Q NFL+ YD
Sbjct: 344 PSFTIVLGGGAVFQPQQGNYFVDVAPNVQCLAMAGLPSSVGGFNTI-GNLLQQNFLVQYD 402
Query: 359 IEQAMVSFKPSRCTNY 374
E+ + F P++C+++
Sbjct: 403 REENRIGFAPAKCSSH 418
>gi|326524580|dbj|BAK00673.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 461
Score = 144 bits (364), Expect = 5e-32, Method: Compositional matrix adjust.
Identities = 113/371 (30%), Positives = 175/371 (47%), Gaps = 46/371 (12%)
Query: 35 YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSS 94
YL+ L+IGTPP + +DTGSD WTQC PC C Q PLF P SS+Y + CS
Sbjct: 103 YLIDLAIGTPPQPVSALLDTGSDLIWTQCAPCAS--CLAQPDPLFAPAASSSYVPMRCSG 160
Query: 95 SQC-AVVTSNCSEGD-CSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMPNVIFGC 152
C ++ +C D C+Y + YG G + + G ATE TF S+SG + +P + FGC
Sbjct: 161 QLCNDILHHSCQRPDTCTYRYNYGDG---TTTLGVYATERFTFASSSGEKLSVP-LGFGC 216
Query: 153 GHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLPDQGSSKIN-------- 204
G N+ S + S GI+G G SL+SQ+ +FSYCL S++ +
Sbjct: 217 GTMNVGSLNNGS---GIVGFGRDPLSLVSQLSIR---RFSYCLTPYTSTRKSTLMFGSLS 270
Query: 205 ---FGGIVAGAGVVSTPLIIRDH-----YYLSLEAISVGNQRLEFVSSS-------TGNI 249
F G A G V T +++ YY+ ++VG +RL S+ +G +
Sbjct: 271 DGVFEGDDAATGQVQTTRLLQSRQNPTFYYVPFTGVTVGTRRLRIPLSAFALRPDGSGGV 330
Query: 250 FVDTGVLRTLLPLEYHSNLKSVMSNMIK------AQPVKGVG-AEPGFSDVLCYNISSQP 302
VD+G TL P + + ++ + P GV A P + + ++
Sbjct: 331 IVDSGTALTLFPAAVLTEVLRAFRAQLRLPFTSSSSPDDGVCFATPMAAGGRRASAATVV 390
Query: 303 KFPEVTIHFRGADVKLSPSN-LFRNISDEIMCSAFR-GGNANIVYGRIMQINFLIGYDIE 360
P + HF+GAD++L N + + +C G++ G +Q + + YD+E
Sbjct: 391 SVPRMAFHFQGADLELPRRNYVLDDPRRGSLCILLADSGDSGATIGNFVQQDMRVLYDLE 450
Query: 361 QAMVSFKPSRC 371
+SF P++C
Sbjct: 451 AETLSFAPAQC 461
>gi|294461757|gb|ADE76437.1| unknown [Picea sitchensis]
Length = 325
Score = 144 bits (364), Expect = 5e-32, Method: Compositional matrix adjust.
Identities = 102/340 (30%), Positives = 163/340 (47%), Gaps = 32/340 (9%)
Query: 48 IFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSSSQCAVVTS---NC 104
+F +DTGSD TW QC+PCP+ C+KQ+ LF P S+TY + C+S+ C + S +C
Sbjct: 1 MFLLIDTGSDITWIQCDPCPQ--CYKQQDSLFQPAGSATYKPLPCNSTMCQQLQSFSHSC 58
Query: 105 SEGDCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMPNVIFGCGHKNLASPTSDS 164
C+Y YG S + G+ A ETLT S + V +PN FGCGH N +
Sbjct: 59 LNSSCNYMVSYGD---KSTTRGDFALETLTLRSDDTILVSVPNFAFGCGHANKG---LFN 112
Query: 165 KQTGIIGLGPGNSSLISQMGTSIAGKFSYCLPDQGSS----KINFG-GIVAGAGVVSTPL 219
G++GLG + +Q + FSYCLP S+ ++FG + V TPL
Sbjct: 113 GAAGLMGLGKSSIGFPAQTSVAFGKVFSYCLPSVSSTIPSGILHFGEAAMLDYDVRFTPL 172
Query: 220 IIR----DHYYLSLEAISVGNQRLEFVSSSTGNIFVDTGVLRTLLPLEYHSNLKSVMSNM 275
+ Y++S+ I+VG++ L + + VD+G + + + L+ + +
Sbjct: 173 VDSSSGPSQYFVSMTGINVGDELLPI----SATVMVDSGTVISRFEQSAYERLRDAFTQI 228
Query: 276 IKAQPVKGVGAEPGFSDVLCYNISS--QPKFPEVTIHFR-GADVKLSPSNLFRNISDEIM 332
+ V P + C+ +S+ P +T+HFR A+++LSP ++ + D +M
Sbjct: 229 LPGLQT-AVSVAPFDT---CFRVSTVDDINIPLITLHFRDDAELRLSPVHILYPVDDGVM 284
Query: 333 CSAFR-GGNANIVYGRIMQINFLIGYDIEQAMVSFKPSRC 371
C AF + V G Q N YDI ++ + C
Sbjct: 285 CFAFAPSSSGRSVLGNFQQQNLRFVYDIPKSRLGISAFEC 324
>gi|356573235|ref|XP_003554768.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
Length = 472
Score = 144 bits (364), Expect = 5e-32, Method: Compositional matrix adjust.
Identities = 116/360 (32%), Positives = 162/360 (45%), Gaps = 40/360 (11%)
Query: 35 YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSS 94
Y + +GTP ++ +DTGSD W QC PC + C+ Q P+FDP KS TY I C +
Sbjct: 129 YFTRIGVGTPARYVYMVLDTGSDVVWLQCAPCRK--CYTQADPVFDPTKSRTYAGIPCGA 186
Query: 95 SQCAVVTS---NCSEGDCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMPNVIFG 151
C + S N C Y YG G SF+ G+ +TETLTF T + V G
Sbjct: 187 PLCRRLDSPGCNNKNKVCQYQVSYGDG---SFTFGDFSTETLTFRRT-----RVTRVALG 238
Query: 152 CGHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLPDQGSSK----INFGG 207
CGH N + G+ S Q G KFSYCL D+ +S + FG
Sbjct: 239 CGHDNEGLFIGAAGLLGLGRG---RLSFPVQTGRRFNQKFSYCLVDRSASAKPSSVVFGD 295
Query: 208 IVAGAGVVSTPLI----IRDHYYLSLEAISVGNQRLEFVSSS--------TGNIFVDTGV 255
TPLI + YYL L ISVG + +S+S G + +D+G
Sbjct: 296 SAVSRTARFTPLIKNPKLDTFYYLELLGISVGGSPVRGLSASLFRLDAAGNGGVIIDSGT 355
Query: 256 LRTLLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDVLCYNIS--SQPKFPEVTIHFRG 313
T L + L+ + A +K AE D C+++S ++ K P V +HFRG
Sbjct: 356 SVTRLTRPAYIALRDAFR--VGASHLK-RAAEFSLFDT-CFDLSGLTEVKVPTVVLHFRG 411
Query: 314 ADVKLSPSNLFRNISDE-IMCSAFRGGNANI-VYGRIMQINFLIGYDIEQAMVSFKPSRC 371
ADV L +N + + C AF G + + + G I Q F + +D+ + V F P C
Sbjct: 412 ADVSLPATNYLIPVDNSGSFCFAFAGTMSGLSIIGNIQQQGFRVSFDLAGSRVGFAPRGC 471
>gi|242066176|ref|XP_002454377.1| hypothetical protein SORBIDRAFT_04g029680 [Sorghum bicolor]
gi|241934208|gb|EES07353.1| hypothetical protein SORBIDRAFT_04g029680 [Sorghum bicolor]
Length = 474
Score = 144 bits (364), Expect = 6e-32, Method: Compositional matrix adjust.
Identities = 117/352 (33%), Positives = 165/352 (46%), Gaps = 32/352 (9%)
Query: 35 YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSS 94
Y++ +S+GTP V VDTGSD +W QC PC C+ Q+ PLFDP +SS+Y ++ C
Sbjct: 140 YVVTVSLGTPGVAQTLEVDTGSDLSWVQCTPCAAPACYSQKDPLFDPAQSSSYAAVPCGG 199
Query: 95 SQC---AVVTSNCSEGDCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMPNVIFG 151
C + S+CS C Y YG G S ++G +++TLT + + FG
Sbjct: 200 PVCGGLGIYASSCSAAQCGYVVSYGDG---SKTTGVYSSDTLTLSPND----AVRGFFFG 252
Query: 152 CGHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLPDQGSSK--INFGGIV 209
CGH +D G++GLG +SL+ Q + G FSYCLP + S+ + GG
Sbjct: 253 CGHAQSGFTGND----GLLGLGREEASLVEQTAGTYGGVFSYCLPTRPSTTGYLTLGGPS 308
Query: 210 AGA--GVVSTPLI----IRDHYYLSLEAISVGNQRLEFVSSS-TGNIFVDTGVLRTLLPL 262
A G +T L+ +Y + L ISVG Q+L SS G VDTG + T LP
Sbjct: 309 GAAPPGFSTTQLLSSPNAATYYVVMLTGISVGGQQLSVPSSVFAGGTVVDTGTVITRLPP 368
Query: 263 EYHSNLKSVMSNMIKAQPVKGVGAEPGFSDVLCYNIS--SQPKFPEVTIHFR-GADVKLS 319
++ L+S + + + A G D CYN S P V + F GA V L
Sbjct: 369 TAYAALRSAFRSGMASYGYPSAPAT-GILDT-CYNFSGYGTVTLPNVALTFSGGATVTLG 426
Query: 320 PSNLFRNISDEIMCSAFRGGNANIVYGRIMQINFLIGYDIEQAMVSFKPSRC 371
+ S GG A + G + Q +F + I+ V FKPS C
Sbjct: 427 ADGILSFGCLAFAPSGSDGGMA--ILGNVQQRSFEV--RIDGTSVGFKPSSC 474
>gi|414881704|tpg|DAA58835.1| TPA: hypothetical protein ZEAMMB73_701358 [Zea mays]
Length = 485
Score = 144 bits (364), Expect = 6e-32, Method: Compositional matrix adjust.
Identities = 112/373 (30%), Positives = 167/373 (44%), Gaps = 52/373 (13%)
Query: 35 YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSS 94
Y + +GTP +DTGSD W QC PC C++Q P+FDP++SS+Y ++ C +
Sbjct: 129 YFTKIGVGTPATQALMVLDTGSDVVWVQCAPCRR--CYEQSGPVFDPRRSSSYGAVGCGA 186
Query: 95 SQCAVVTS---NCSEGDCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMPNVIFG 151
+ C + S + G C Y YG G S ++G+ TETLTF + + V G
Sbjct: 187 ALCRRLDSGGCDLRRGACMYQVAYGDG---SVTAGDFVTETLTFAGGA----RVARVALG 239
Query: 152 CGHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLPDQ------------G 199
CGH N + + G+ S +Q+ FSYCL D+
Sbjct: 240 CGHDNEGLFVAAAGLLGLGRG---GLSFPTQISRRYGRSFSYCLVDRTSSGAGAAPGSHR 296
Query: 200 SSKINFGGIVAGAGVVS-TPLI----IRDHYYLSLEAISVGNQRLEFVSSS--------- 245
SS ++FG GA S TP++ + YY+ L ISVG R+ V+ S
Sbjct: 297 SSTVSFGAGSVGASSASFTPMVRNPRMETFYYVQLVGISVGGARVPGVAESDLRLDPSTG 356
Query: 246 TGNIFVDTGVLRTLLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDVL--CYNISSQP- 302
G + VD+G T L +S L+ + +A G+ PG + CY++ +
Sbjct: 357 RGGVIVDSGTSVTRLARASYSALR----DAFRAAAAGGLRLSPGGFSLFDTCYDLGGRRV 412
Query: 303 -KFPEVTIHFR-GADVKLSPSNLFRNI-SDEIMCSAFRGGNANI-VYGRIMQINFLIGYD 358
K P V++HF GA+ L P N + S C AF G + + + G I Q F + +D
Sbjct: 413 VKVPTVSMHFAGGAEAALPPENYLIPVDSRGTFCFAFAGTDGGVSIIGNIQQQGFRVVFD 472
Query: 359 IEQAMVSFKPSRC 371
+ V F P C
Sbjct: 473 GDGQRVGFAPKGC 485
>gi|145693994|gb|ABP93697.1| unknown protein isoform 2 [Lemna minor]
Length = 351
Score = 144 bits (363), Expect = 6e-32, Method: Compositional matrix adjust.
Identities = 110/352 (31%), Positives = 164/352 (46%), Gaps = 32/352 (9%)
Query: 35 YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSS 94
Y++ + GTP DTGSD W QC+PC + C+ Q+ PLFDP SSTY ++SC+
Sbjct: 16 YVITVGFGTPTRTQTVVFDTGSDVNWLQCKPC-AVRCYAQQEPLFDPSLSSTYRNVSCTE 74
Query: 95 SQC-AVVTSNCSEGDCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMPNVIFGCG 153
C + T CS C Y YG G S + G LA +T + N IFGCG
Sbjct: 75 PACVGLSTRGCSSSTCLYGVFYGDG---SSTIGFLAMDTFMLTPAQ----KFKNFIFGCG 127
Query: 154 HKNLASPTSDSKQTGIIGLGPGNS-SLISQMGTSIAGKFSYCLPDQGSSK--INFGGI-- 208
N + G++GLG ++ SL SQ+ S+ FSYCLP S+ +N G
Sbjct: 128 QNNTGLFQGTA---GLVGLGRSSTYSLNSQVAPSLGNVFSYCLPSTSSATGYLNIGNPQN 184
Query: 209 VAGAGVVSTPLIIRDHYYLSLEAISVGNQRLEFVSS---STGNIFVDTGVLRTLLPLEYH 265
G + T + Y++ L ISVG RL S+ S G I +D+G + T LP +
Sbjct: 185 TPGYTAMLTDTRVPTLYFIDLIGISVGGTRLSLSSTVFQSVGTI-IDSGTVITRLPPTAY 243
Query: 266 SNLKSVMSNMIKAQPVKGVGAEPGFSDV-LCYNISSQPK--FPEVTIHFRGADVKLSPSN 322
S LK+ + + + P + + CY+ S +P + +HF G DV++ +
Sbjct: 244 SALKTAVRAAMTQYTL-----APAVTILDTCYDFSRTTSVVYPVIVLHFAGLDVRIPATG 298
Query: 323 LFRNISDEIMCSAFRGGNANI---VYGRIMQINFLIGYDIEQAMVSFKPSRC 371
+F + +C AF G + + G + Q+ + YD E + F C
Sbjct: 299 VFFVFNSSQVCLAFAGNTDSTMIGIIGNVQQLTMEVTYDNELKRIGFSAGAC 350
>gi|2541876|dbj|BAA22813.1| CND41, chloroplast nucleoid DNA binding protein [Nicotiana tabacum]
Length = 502
Score = 144 bits (363), Expect = 6e-32, Method: Compositional matrix adjust.
Identities = 112/366 (30%), Positives = 175/366 (47%), Gaps = 45/366 (12%)
Query: 35 YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSS 94
Y++++ +GTP D+ DTGSD TWTQC+PC + C+ Q+ P+FDP S TY++ISC+S
Sbjct: 154 YIVNVGLGTPKKDLSLIFDTGSDLTWTQCQPCVK-SCYAQQQPIFDPSTSKTYSNISCTS 212
Query: 95 SQCAVVTS------NCSEGDCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMPNV 148
+ C+ + S CS +C Y YG +SF+ G A + LT
Sbjct: 213 AACSSLKSATGNSPGCSSSNCVYGIQYGD---SSFTIGFFAKDKLTLTQND----VFDGF 265
Query: 149 IFGCGHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLP-DQGSS-KINFG 206
+FGCG N K G+IGLG S++ Q FSYCLP +GS+ + FG
Sbjct: 266 MFGCGQNNKGLF---GKTAGLIGLGRDPLSIVQQTAQKFGKYFSYCLPTSRGSNGHLTFG 322
Query: 207 ---GIVAGA----GVVSTPLIIRD---HYYLSLEAISVGNQRLE---FVSSSTGNIFVDT 253
G+ A G+ TP +Y++ + ISVG + L + + G I +D+
Sbjct: 323 NGNGVKASKAVKNGITFTPFASSQGTAYYFIDVLGISVGGKALSISPMLFQNAGTI-IDS 381
Query: 254 GVLRTLLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDV-LCYNIS--SQPKFPEVTIH 310
G + T LP + +LKS + P P S + CY++S + P+++ +
Sbjct: 382 GTVITRLPSTAYGSLKSAFKQFMSKYPT-----APALSLLDTCYDLSNYTSISIPKISFN 436
Query: 311 FRG-ADVKLSPSNLFRNISDEIMCSAFRGGNANI---VYGRIMQINFLIGYDIEQAMVSF 366
F G A+V+L P+ + +C AF G + ++G I Q + YD+ + F
Sbjct: 437 FNGNANVELDPNGILITNGASQVCLAFAGNGDDDSIGIFGNIQQQTLEVVYDVAGGQLGF 496
Query: 367 KPSRCT 372
C+
Sbjct: 497 GYKGCS 502
>gi|357127503|ref|XP_003565419.1| PREDICTED: probable aspartic protease At2g35615-like [Brachypodium
distachyon]
Length = 486
Score = 144 bits (363), Expect = 7e-32, Method: Compositional matrix adjust.
Identities = 119/390 (30%), Positives = 179/390 (45%), Gaps = 55/390 (14%)
Query: 26 AEIISVDDIYLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPP--LFDPKK 83
AE++S YLM + +GTPPV + DTGSD W +C+ + D PP F P
Sbjct: 101 AEVVSRQFEYLMAIEVGTPPVRVLAIADTGSDLVWVKCKG-KDNDNNSTAPPSVYFVPSA 159
Query: 84 SSTYNSISCSSSQCAVVTSNCS---EGDCSYSFLYGRGAYASFSSGNLATETLTFNSTSG 140
SSTY + C + C ++S S +G C Y + YG G+ A SG L+TET TF++ +
Sbjct: 160 SSTYGRVGCDTKACRALSSAASCSPDGSCEYLYSYGDGSRA---SGQLSTETFTFSTIAD 216
Query: 141 -----------------LPVEMPNVIFGCGHKNLASPTSDSKQTGIIGLGPGNSSLISQM 183
VE+ + FGC + +D G SL SQ+
Sbjct: 217 SSKTNSHGNNNNNSSSHGQVEIAKLDFGCSTTTTGTFRADGLVGLGG----GPVSLASQL 272
Query: 184 G--TSIAGKFSYCLP----DQGSSKINFG--GIVAGAGVVSTPLI---IRDHYYLSLEAI 232
G TS+ KFSYCL SS +NFG +V+ G STPLI + +Y ++L++I
Sbjct: 273 GATTSLGRKFSYCLAPYANTNASSALNFGSRAVVSEPGAASTPLITGEVETYYTIALDSI 332
Query: 233 SVGNQRLEFVSSSTGNIFVDTGVLRTLLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSD 292
+V + +++ +I VD+G T L + L ++ IK + P
Sbjct: 333 NVAGTKRP-TTAAQAHIIVDSGTTLTYLDSALLTPLVKDLTRRIKLPRAE----SPEKIL 387
Query: 293 VLCYNIS-----SQPKFPEVTIHF-RGADVKLSPSNLFRNISDEIMCSAFRGGNANI--- 343
LCY+IS P+VT+ G +V L P N F + + ++C A +
Sbjct: 388 DLCYDISGVRGEDALGIPDVTLVLGGGGEVTLKPDNTFVVVQEGVLCLALVATSERQSVS 447
Query: 344 VYGRIMQINFLIGYDIEQAMVSFKPSRCTN 373
+ G I Q N +GYD+E+ V+F + C
Sbjct: 448 ILGNIAQQNLHVGYDLEKGTVTFAAADCAK 477
>gi|225216960|gb|ACN85252.1| aspartic proteinase nepenthesin-1 precursor [Oryza officinalis]
Length = 519
Score = 144 bits (363), Expect = 7e-32, Method: Compositional matrix adjust.
Identities = 116/355 (32%), Positives = 176/355 (49%), Gaps = 33/355 (9%)
Query: 35 YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSS 94
Y++ + +GTP DTGSD TW QC+PC + C++Q LFDP +SSTY ++SC++
Sbjct: 180 YVVTVGLGTPVSRYTVVFDTGSDTTWVQCQPCVVV-CYEQREKLFDPARSSTYANVSCAA 238
Query: 95 SQCAVVT-SNCSEGDCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMPNVIFGCG 153
C+ + CS G C Y YG G+Y S G A +TLT +S + FGCG
Sbjct: 239 PACSDLNIHGCSGGHCLYGVQYGDGSY---SIGFFAMDTLTLSSYD----AVKGFRFGCG 291
Query: 154 HKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLPDQ--GSSKINFGGIVAG 211
+N + G++GLG G +SL Q G F++CLP + G+ ++FG
Sbjct: 292 ERNEG---LFGEAAGLLGLGRGKTSLPVQTYDKYGGVFAHCLPARSTGTGYLDFGAGSLA 348
Query: 212 AGV--VSTPLIIRD---HYYLSLEAISVGNQRLEFVSS--STGNIFVDTGVLRTLLPLEY 264
A ++TP++ + YY+ + I VG Q L S +T VD+G + T LP
Sbjct: 349 AASARLTTPMLTDNGPTFYYVGMTGIRVGGQLLSIPQSVFATAGTIVDSGTVITRLPPAA 408
Query: 265 HSNLKSVMSNMIKAQPVKGVGAEPGFSDV-LCYNIS--SQPKFPEVTIHFR-GADVKLSP 320
+S+L+ + + A +G P S + CY+ + SQ P V++ F+ GA + +
Sbjct: 409 YSSLRYAFAAAMAA---RGYKKAPAVSLLDTCYDFTGMSQVAIPTVSLLFQGGARLDVDA 465
Query: 321 SNLFRNISDEIMCSAFR----GGNANIVYGRIMQINFLIGYDIEQAMVSFKPSRC 371
S + S +C AF GG+ IV G F + YDI + +V F P C
Sbjct: 466 SGIMYAASASQVCLAFAANEDGGDVGIV-GNTQLKTFGVAYDIGKKVVGFYPGAC 519
>gi|255537017|ref|XP_002509575.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
gi|223549474|gb|EEF50962.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
Length = 459
Score = 144 bits (363), Expect = 7e-32, Method: Compositional matrix adjust.
Identities = 113/369 (30%), Positives = 178/369 (48%), Gaps = 34/369 (9%)
Query: 24 YQAEIISVDDIYLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKK 83
Y+A ++S +L++ SIG PPV + +DTGS TW QCEPC ++C +Q+ PL++P
Sbjct: 99 YEASLLSELCTFLVNFSIGQPPVPQYAVMDTGSSLTWIQCEPC--INCHQQKGPLYNPSS 156
Query: 84 SSTYNSISCSSSQCAVVTSNCSEGDCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPV 143
SSTY S S T+ DC+YS Y A + + G A E L F +
Sbjct: 157 SSTYVSCSDFDRTDTTFTATHGS-DCNYSQTY---ADKTTTRGTYAREQLLFETPDDGIT 212
Query: 144 EMPNVIFGCGHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLPDQGSSKI 203
M +VIFGCGH N P +G+ GLG SS+IS++G FSYC+ + G
Sbjct: 213 IMHDVIFGCGHNNTQLPGPTGYASGVFGLGDSGSSIISKLGFG----FSYCIGNIGDPLY 268
Query: 204 NFGGIVAGAGV----VSTPLIIRDHYYLSLEAISVGNQRLEF---------VSSSTGNIF 250
F + G + STPL+ R YY++L IS+G +RL+ ++ + I
Sbjct: 269 GFHRLTLGNKLKIEGYSTPLVPRGLYYITLVGISIGQERLDIDPIVFQRVDLNGISSRIV 328
Query: 251 VDTGVLRTLLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDVLCY----NISSQPKFPE 306
+D+G + +P + ++ ++ +S+++ + S LCY N Q FP+
Sbjct: 329 IDSGATLSYIPRQAYNVVRDKVSSILSGFLSRYRYIARHLS--LCYIGKLNQDLQ-GFPD 385
Query: 307 VTIHF-RGADVKLSPSNLFRNISDEIMCSAF---RGGNANIVYGRIMQINFLIGYDIEQA 362
T H GAD+ LF +D ++C A + G + Q + + YD++Q
Sbjct: 386 ATFHLADGADLVFQVEGLFFQYTDNVLCLALVPTESDEETCLIGLLAQQYYNVAYDLKQQ 445
Query: 363 MVSFKPSRC 371
+ F+ C
Sbjct: 446 KLYFQRIEC 454
>gi|15222611|ref|NP_173922.1| aspartyl protease-like protein [Arabidopsis thaliana]
gi|12321511|gb|AAG50814.1|AC079281_16 hypothetical protein [Arabidopsis thaliana]
gi|20466516|gb|AAM20575.1| unknown protein [Arabidopsis thaliana]
gi|23198172|gb|AAN15613.1| unknown protein [Arabidopsis thaliana]
gi|110736960|dbj|BAF00436.1| hypothetical protein [Arabidopsis thaliana]
gi|332192515|gb|AEE30636.1| aspartyl protease-like protein [Arabidopsis thaliana]
Length = 483
Score = 144 bits (363), Expect = 7e-32, Method: Compositional matrix adjust.
Identities = 121/401 (30%), Positives = 186/401 (46%), Gaps = 62/401 (15%)
Query: 7 LPFYNDNETPKSPISIIYQAEIISVDD-----------IYLMHLSIGTPPVDIFGSVDTG 55
L N ++ PIS +Y E ++ Y + IG P +++ +DTG
Sbjct: 109 LAINNISKADLKPISTMYTTEEQDIEAPLISGTTQGSGEYFTRVGIGKPAREVYMVLDTG 168
Query: 56 SDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSSSQC-AVVTSNCSEGDCSYSFL 114
SD W QC PC DC+ Q P+F+P SS+Y +SC + QC A+ S C C Y
Sbjct: 169 SDVNWLQCTPCA--DCYHQTEPIFEPSSSSSYEPLSCDTPQCNALEVSECRNATCLYEVS 226
Query: 115 YGRGAYASFSSGNLATETLTFNSTSGLPVEMPNVIFGCGHKN---LASPTSDSKQTGIIG 171
YG G+Y + G+ ATETLT ST + NV GCGH N G +
Sbjct: 227 YGDGSY---TVGDFATETLTIGST-----LVQNVAVGCGHSNEGLFVGAAGLLGLGGGLL 278
Query: 172 LGPGNSSLISQMGTSIAGKFSYCLPDQGS---SKINFGGIVAGAGVVSTPLIIRDH---- 224
P SQ+ T+ FSYCL D+ S S ++FG ++ VV+ PL +R+H
Sbjct: 279 ALP------SQLNTT---SFSYCLVDRDSDSASTVDFGTSLSPDAVVA-PL-LRNHQLDT 327
Query: 225 -YYLSLEAISVGNQRLEFVSSS-------TGNIFVDTGVLRTLLPLEYHSNLK-SVMSNM 275
YYL L ISVG + L+ SS +G I +D+G T L E +++L+ S +
Sbjct: 328 FYYLGLTGISVGGELLQIPQSSFEMDESGSGGIIIDSGTAVTRLQTEIYNSLRDSFVKGT 387
Query: 276 IKAQPVKGVGAEPGFSDVLCYNISSQP--KFPEVTIHFRGADVKLSPSNLFRNISDEI-- 331
+ + GV CYN+S++ + P V HF G + P+ + D +
Sbjct: 388 LDLEKAAGVAMFD-----TCYNLSAKTTVEVPTVAFHFPGGKMLALPAKNYMIPVDSVGT 442
Query: 332 MCSAFRGGNANI-VYGRIMQINFLIGYDIEQAMVSFKPSRC 371
C AF +++ + G + Q + +D+ +++ F ++C
Sbjct: 443 FCLAFAPTASSLAIIGNVQQQGTRVTFDLANSLIGFSSNKC 483
>gi|357124468|ref|XP_003563922.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Brachypodium
distachyon]
Length = 450
Score = 144 bits (363), Expect = 7e-32, Method: Compositional matrix adjust.
Identities = 110/367 (29%), Positives = 173/367 (47%), Gaps = 53/367 (14%)
Query: 35 YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSS 94
Y+ L +GTP VD+GS TW QC PC + C Q PL+DP+ SSTY ++ CS+
Sbjct: 108 YITRLGLGTPTTTYVMVVDSGSSLTWLQCAPC-AVSCHPQAGPLYDPRASSTYAAVPCSA 166
Query: 95 SQCAVVT------SNCS-EGDCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMPN 147
QCA + S+CS G C Y YG G SFS G L+ +T++ +S+ P
Sbjct: 167 PQCAELQAATLNPSSCSGSGVCQYQASYGDG---SFSFGYLSKDTVSLSSSG----SFPG 219
Query: 148 VIFGCGHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLPDQGSSK---IN 204
+GCG N+ + G+IGL SL+SQ+ S+ F+YCLP ++ ++
Sbjct: 220 FYYGCGQDNVG---LFGRAAGLIGLARNKLSLLSQLAPSVGNSFAYCLPTSAAASAGYLS 276
Query: 205 FG--------GIVAGAGVVSTPLIIRDHYYLSLEAISVGNQRLEFVSSSTGNI--FVDTG 254
FG G + +VS+ L Y++SL +SV L SS G++ +D+G
Sbjct: 277 FGSNSDNKNPGKYSYTSMVSSSL-DASLYFVSLAGMSVAGSPLAVPSSEYGSLPTIIDSG 335
Query: 255 VLRTLLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDVLCYNI--------SSQPKFPE 306
+ T LP ++ L K VGA Y+I ++ P
Sbjct: 336 TVITRLPTPVYTALS------------KAVGAALAAPSAPAYSILQTCFKGQVAKLPVPA 383
Query: 307 VTIHFR-GADVKLSPSNLFRNISDEIMCSAFRGGNANIVYGRIMQINFLIGYDIEQAMVS 365
V + F GA ++L+P N+ ++++ C AF ++ + G Q F + YD++ + +
Sbjct: 384 VNMAFAGGATLRLTPGNVLVDVNETTTCLAFAPTDSTAIIGNTQQQTFSVVYDVKGSRIG 443
Query: 366 FKPSRCT 372
F C+
Sbjct: 444 FAAGGCS 450
>gi|167997964|ref|XP_001751688.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162696786|gb|EDQ83123.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 418
Score = 144 bits (362), Expect = 9e-32, Method: Compositional matrix adjust.
Identities = 116/376 (30%), Positives = 176/376 (46%), Gaps = 58/376 (15%)
Query: 35 YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSS 94
Y + +GTPP VD+GSD W QC PC + C+ Q+ PL+ P SST++ + C S
Sbjct: 64 YFVDFFLGTPPQKFSLIVDSGSDLLWVQCSPCRQ--CYAQDSPLYVPSNSSTFSPVPCLS 121
Query: 95 SQCAVVTSN----CS---EGDCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMPN 147
S C ++ + C G C+Y +LY A S S G A E+ T + V +
Sbjct: 122 SDCLLIPATEGFPCDFRYPGACAYEYLY---ADTSSSKGVFAYESATVDG-----VRIDK 173
Query: 148 VIFGCGHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCL-----PDQGSSK 202
V FGCG N S + G++GLG G S SQ+G + KF+YCL P SS
Sbjct: 174 VAFGCGSDNQG---SFAAAGGVLGLGQGPLSFGSQVGYAYGNKFAYCLVNYLDPTSVSSS 230
Query: 203 INFGG--IVAGAGVVSTPLIIRDH----YYLSLEAISVGNQRLEFVSSST--------GN 248
+ FG I + TP++ YY+ +E ++VG + L S+ G+
Sbjct: 231 LIFGDELISTIHDMQYTPIVSNPKSPTLYYVQIEKVTVGGKSLPISDSAWEIDLLGNGGS 290
Query: 249 IFVDTGVLRTLLPLEYHSNLKSVMSNMI--KAQPVKGVGAEPGFSDVLCYNIS--SQPKF 304
IF L P Y L + S + +A+ V+G+ LC ++ QP F
Sbjct: 291 IFDSGTTLTYWFPSAYSHILAAFDSGVHYPRAESVQGLD--------LCVELTGVDQPSF 342
Query: 305 PEVTIHF-RGADVKLSPSNLFRNISDEIMCSAFRG-----GNANIVYGRIMQINFLIGYD 358
P TI F GA + N F +++ + C A G G N + G ++Q NF + YD
Sbjct: 343 PSFTIEFDDGAVFQPEAENYFVDVAPNVRCLAMAGLASPLGGFNTI-GNLLQQNFFVQYD 401
Query: 359 IEQAMVSFKPSRCTNY 374
E+ ++ F P++C+++
Sbjct: 402 REENLIGFAPAKCSSH 417
>gi|357481195|ref|XP_003610883.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
gi|355512218|gb|AES93841.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
Length = 315
Score = 144 bits (362), Expect = 9e-32, Method: Compositional matrix adjust.
Identities = 109/302 (36%), Positives = 162/302 (53%), Gaps = 28/302 (9%)
Query: 91 SCSSSQCAVV-TSNCS-EGDCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMPNV 148
SC S C + T CS E C+Y++ YG S + G LA +T TF S +G V +
Sbjct: 20 SCDSPLCHKLDTGVCSPEKRCNYTYGYGDN---SLTKGVLAQDTATFTSNTGKLVSLSRF 76
Query: 149 IFGCGHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGK-FSYCLPD-----QGSSK 202
+FGCGH N + + G+IGLG G +SLISQ+G GK FS CL + SS+
Sbjct: 77 LFGCGHNNTGG--FNDHEMGLIGLGGGPTSLISQIGPLFGGKKFSQCLVPFLTDIKISSR 134
Query: 203 INFG--GIVAGAGVVSTPLIIRDH----YYLSLEAISVGNQRLEFVSS-STGNIFVDTGV 255
++FG V G GVV+TPL+ R+ Y+++L ISV + L S+ GN+ VD+G
Sbjct: 135 MSFGKGSQVLGDGVVTTPLVQREQDMTSYFVTLLGISVEDTYLPMNSTIEKGNMLVDSGT 194
Query: 256 LRTLLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDVLCYNISSQPKFPEVTIHFRGAD 315
+LP + + + + N + P++ + +P LCY + K P +T HF GA+
Sbjct: 195 PPNILPQQLYDRVYVEVKNNV---PLELITNDPSLGPQLCYRTQTNLKGPTLTYHFEGAN 251
Query: 316 VKLSPSNLFRNISDE---IMCSAFRG-GNANI-VYGRIMQINFLIGYDIEQAMVSFKPSR 370
+ L+P F + E + C A N+N VYG Q N+LIG+D+++ +VSFK +
Sbjct: 252 LLLTPIQTFIPPTPETKGVFCLAINNYTNSNGGVYGNFAQSNYLIGFDLDRQVVSFKATD 311
Query: 371 CT 372
CT
Sbjct: 312 CT 313
>gi|225216973|gb|ACN85264.1| aspartic proteinase nepenthesin-1 precursor [Oryza alta]
Length = 517
Score = 144 bits (362), Expect = 1e-31, Method: Compositional matrix adjust.
Identities = 116/355 (32%), Positives = 177/355 (49%), Gaps = 33/355 (9%)
Query: 35 YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSS 94
Y++ + +GTP DTGSD TW QC+PC + C++Q+ LFDP +SSTY ++SC++
Sbjct: 178 YVVTVGLGTPASRYTVVFDTGSDTTWVQCQPCVVV-CYEQQEKLFDPVRSSTYANVSCAA 236
Query: 95 SQCAVVT-SNCSEGDCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMPNVIFGCG 153
C+ + CS G C Y YG G+Y S G A +TLT +S + FGCG
Sbjct: 237 PACSDLNIHGCSGGHCLYGVQYGDGSY---SIGFFAMDTLTLSSYD----AVKGFRFGCG 289
Query: 154 HKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLPDQ--GSSKINFGGIVAG 211
+N + G++GLG G +SL Q G F++CLP + G+ ++FG
Sbjct: 290 ERNEG---LFGEAAGLLGLGRGKTSLPVQTYDKYGGVFAHCLPARSTGTGYLDFGAGSPA 346
Query: 212 AGV--VSTPLIIRD---HYYLSLEAISVGNQRLEFVSS--STGNIFVDTGVLRTLLPLEY 264
A ++TP++ + YY+ + I VG Q L S +T VD+G + T LP
Sbjct: 347 AASARLTTPMLTDNGPTFYYIGMTGIRVGGQLLSIPQSVFATAGTIVDSGTVITRLPPPA 406
Query: 265 HSNLKSVMSNMIKAQPVKGVGAEPGFSDV-LCYNIS--SQPKFPEVTIHFR-GADVKLSP 320
+S+L+ + + A +G P S + CY+ + SQ P V++ F+ GA + +
Sbjct: 407 YSSLRYAFAAAMAA---RGYKKAPAVSLLDTCYDFTGMSQVAIPTVSLLFQGGARLDVDA 463
Query: 321 SNLFRNISDEIMCSAFR----GGNANIVYGRIMQINFLIGYDIEQAMVSFKPSRC 371
S + S +C AF GG+ IV G F + YDI + +V F P C
Sbjct: 464 SGIMYAASASQVCLAFAANEDGGDVGIV-GNTQLKTFGVAYDIGKKVVGFYPGVC 517
>gi|255544139|ref|XP_002513132.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
gi|223548143|gb|EEF49635.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
Length = 481
Score = 143 bits (361), Expect = 1e-31, Method: Compositional matrix adjust.
Identities = 105/361 (29%), Positives = 161/361 (44%), Gaps = 45/361 (12%)
Query: 35 YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSS 94
Y + + +G+PP + + +D+GSD W QC+PC + C+ Q P+FDP S+++ + CSS
Sbjct: 142 YFIRIGVGSPPREQYVVIDSGSDIVWVQCQPCTQ--CYHQTDPVFDPADSASFMGVPCSS 199
Query: 95 SQCAVV-TSNCSEGDCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMPNVIFGCG 153
S C + + C G C Y +YG G+Y + G LA ETLTF T + NV GCG
Sbjct: 200 SVCERIENAGCHAGGCRYEVMYGDGSY---TKGTLALETLTFGRTV-----VRNVAIGCG 251
Query: 154 HKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLPDQG---SSKINFGGIVA 210
H+N + G+ G + SL+ Q+G G FSYCL +G + + FG
Sbjct: 252 HRNRGMFVGAAGLLGLGGG---SMSLVGQLGGQTGGAFSYCLVSRGTDSAGSLEFGRGAM 308
Query: 211 GAGVVSTPLIIRDH----YYLSLEAISVGNQRL-------EFVSSSTGNIFVDTGVLRTL 259
G PLI YY+ L + VG ++ + G + +DTG T
Sbjct: 309 PVGAAWIPLIRNPRAPSFYYIRLSGVGVGGMKVPISEDVFQLNEMGNGGVVMDTGTAVTR 368
Query: 260 LPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDV----LCYNISS--QPKFPEVTIHFRG 313
+P + + + G P S V CYN++ + P V+ +F G
Sbjct: 369 IPTVAYVAFRDAF--------IGQTGNLPRASGVSIFDTCYNLNGFVSVRVPTVSFYFAG 420
Query: 314 ADVKLSPSNLFRNISDEI--MCSAFRGGNANI-VYGRIMQINFLIGYDIEQAMVSFKPSR 370
+ P+ F D++ C AF + + + G I Q I +D V F P+
Sbjct: 421 GPILTLPARNFLIPVDDVGTFCFAFAASPSGLSIIGNIQQEGIQISFDGANGFVGFGPNV 480
Query: 371 C 371
C
Sbjct: 481 C 481
>gi|222624820|gb|EEE58952.1| hypothetical protein OsJ_10633 [Oryza sativa Japonica Group]
Length = 415
Score = 143 bits (361), Expect = 1e-31, Method: Compositional matrix adjust.
Identities = 115/358 (32%), Positives = 166/358 (46%), Gaps = 55/358 (15%)
Query: 35 YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSS 94
YL+HL+IGTPP + ++DTGSD WTQC+PCP CF Q P FDP SST + SC S
Sbjct: 89 YLVHLAIGTPPQPVQLTLDTGSDLIWTQCQPCPA--CFDQALPYFDPSTSSTLSLTSCDS 146
Query: 95 SQCAVVTSNCSEGDCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMPNVIFGCGH 154
+ C + ++F+ GA AS +P V FGCG
Sbjct: 147 TLCQGLPVASLPRSDKFTFV---GAGAS----------------------VPGVAFGCGL 181
Query: 155 KNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYC-------LPDQGSSKINFGG 207
N + S +TGI G G G SL SQ+ G FS+C +P +
Sbjct: 182 FN--NGVFKSNETGIAGFGRGPLSLPSQLKV---GNFSHCFTTITGAIPSTVLLDLPADL 236
Query: 208 IVAGAGVVSTPLIIRD-----HYYLSLEAISVGNQRL-----EF-VSSSTGNIFVDTGVL 256
G G V T +I++ YYLSL+ I+VG+ RL EF + + TG +D+G
Sbjct: 237 FSNGQGAVQTTPLIQNPANPTFYYLSLKGITVGSTRLPVPESEFALKNGTGGTIIDSGTA 296
Query: 257 RTLLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDVLCYNISSQPKFPEVTIHFRGADV 316
T LP + ++ + +K V G +P F L + ++P P++ +HF GA +
Sbjct: 297 MTSLPTRVYRLVRDAFAAQVKLPVVSGNTTDPYF--CLSAPLRAKPYVPKLVLHFEGATM 354
Query: 317 KLSPSNLFRNISD---EIMCSAFRGGNANIVYGRIMQINFLIGYDIEQAMVSFKPSRC 371
L N + D I+C A G G Q N + YD++ + +SF P++C
Sbjct: 355 DLPRENYVFEVEDAGSSILCLAIIEGGEVTTIGNFQQQNMHVLYDLQNSKLSFVPAQC 412
>gi|115434870|ref|NP_001042193.1| Os01g0178600 [Oryza sativa Japonica Group]
gi|55296112|dbj|BAD67831.1| putative CDR1 [Oryza sativa Japonica Group]
gi|55296252|dbj|BAD67993.1| putative CDR1 [Oryza sativa Japonica Group]
gi|113531724|dbj|BAF04107.1| Os01g0178600 [Oryza sativa Japonica Group]
gi|125569253|gb|EAZ10768.1| hypothetical protein OsJ_00604 [Oryza sativa Japonica Group]
Length = 454
Score = 143 bits (361), Expect = 1e-31, Method: Compositional matrix adjust.
Identities = 116/373 (31%), Positives = 177/373 (47%), Gaps = 41/373 (10%)
Query: 26 AEIISVDDIYLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPL-FDPKKS 84
++++S YLM +++G+PP + DTGSD W +C+ P FDP +S
Sbjct: 92 SKVVSRSFEYLMTVNLGSPPRSMLAIADTGSDLVWVKCKKGNNDTSSAAAPTTQFDPSRS 151
Query: 85 STYNSISCSSSQC-AVVTSNCSEG-DCSYSFLYGRGAYASFSSGNLATETLTFNS----T 138
STY +SC + C A+ + C +G +C+Y + YG G S ++G L+TET TF+
Sbjct: 152 STYGRVSCQTDACEALGRATCDDGSNCAYLYAYGDG---SNTTGVLSTETFTFDDGGSGR 208
Query: 139 SGLPVEMPNVIFGCGHKNLAS-PTSDSKQTGIIGLGPGNSSLISQMG--TSIAGKFSYCL 195
S V + V FGC S P G SL++Q+G TS+ +FSYCL
Sbjct: 209 SPRQVRVGGVKFGCSTATAGSFPADGLVGLGGG-----AVSLVTQLGGATSLGRRFSYCL 263
Query: 196 PDQ---GSSKINFGGI--VAGAGVVSTPLIIRD---HYYLSLEAISVGNQRLEFVSSSTG 247
SS +NFG + V G STPL+ D +Y + L+++ VGN+ + +SS
Sbjct: 264 VPHSVNASSALNFGALADVTEPGAASTPLVAGDVDTYYTVVLDSVKVGNKTVASAASS-- 321
Query: 248 NIFVDTGVLRTLLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDVLCYNISSQ-----P 302
I VD+G T L + +S I PV+ P LCYN++ +
Sbjct: 322 RIIVDSGTTLTFLDPSLLGPIVDELSRRITLPPVQ----SPDGLLQLCYNVAGREVEAGE 377
Query: 303 KFPEVTIHF-RGADVKLSPSNLFRNISDEIMCSAFRGGNAN---IVYGRIMQINFLIGYD 358
P++T+ F GA V L P N F + + +C A + G + Q N +GYD
Sbjct: 378 SIPDLTLEFGGGAAVALKPENAFVAVQEGTLCLAIVATTEQQPVSILGNLAQQNIHVGYD 437
Query: 359 IEQAMVSFKPSRC 371
++ V+F + C
Sbjct: 438 LDAGTVTFAGADC 450
>gi|356504173|ref|XP_003520873.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
Length = 461
Score = 143 bits (361), Expect = 1e-31, Method: Compositional matrix adjust.
Identities = 119/361 (32%), Positives = 163/361 (45%), Gaps = 42/361 (11%)
Query: 35 YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSS 94
Y + +GTP ++ +DTGSD W QC PC + C+ Q +FDP KS TY I C +
Sbjct: 118 YFTRIGVGTPARYVYMVLDTGSDVVWLQCAPCRK--CYTQTDHVFDPTKSRTYAGIPCGA 175
Query: 95 SQCAVVTS-NCSEGD--CSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMPNVIFG 151
C + S CS + C Y YG G SF+ G+ +TETLTF + V G
Sbjct: 176 PLCRRLDSPGCSNKNKVCQYQVSYGDG---SFTFGDFSTETLTFRRN-----RVTRVALG 227
Query: 152 CGHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLPDQGSSK----INFGG 207
CGH N T + G+ S Q G KFSYCL D+ +S + FG
Sbjct: 228 CGHDNEGLFTGAAGLLGLGRG---RLSFPVQTGRRFNHKFSYCLVDRSASAKPSSVIFGD 284
Query: 208 IVAGAGVVSTPLI----IRDHYYLSLEAISVGNQRLEFVSSS--------TGNIFVDTGV 255
TPLI + YYL L ISVG + +S+S G + +D+G
Sbjct: 285 SAVSRTAHFTPLIKNPKLDTFYYLELLGISVGGAPVRGLSASLFRLDAAGNGGVIIDSGT 344
Query: 256 LRTLLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDV-LCYNIS--SQPKFPEVTIHFR 312
T L + L+ I A +K P FS C+++S ++ K P V +HFR
Sbjct: 345 SVTRLTRPAYIALRDAFR--IGASHLK---RAPEFSLFDTCFDLSGLTEVKVPTVVLHFR 399
Query: 313 GADVKLSPSNLFRNISDE-IMCSAFRGGNANI-VYGRIMQINFLIGYDIEQAMVSFKPSR 370
GADV L +N + + C AF G + + + G I Q F I YD+ + V F P
Sbjct: 400 GADVSLPATNYLIPVDNSGSFCFAFAGTMSGLSIIGNIQQQGFRISYDLTGSRVGFAPRG 459
Query: 371 C 371
C
Sbjct: 460 C 460
>gi|326506682|dbj|BAJ91382.1| predicted protein [Hordeum vulgare subsp. vulgare]
gi|326525815|dbj|BAJ88954.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 473
Score = 143 bits (361), Expect = 1e-31, Method: Compositional matrix adjust.
Identities = 111/354 (31%), Positives = 166/354 (46%), Gaps = 33/354 (9%)
Query: 35 YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSS 94
Y+ + +GTP VDTGS TW QC PC + C +Q P+FDPK SS+Y ++SCS+
Sbjct: 137 YVTRMGLGTPAKPYIMVVDTGSSLTWLQCSPC-RVSCHRQSGPVFDPKTSSSYAAVSCST 195
Query: 95 SQC---AVVTSN---CSEGD-CSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMPN 147
QC + T N CS D C Y YG +SFS G L+ +T++F S S +PN
Sbjct: 196 PQCNDLSTATLNPAACSSSDVCIYQASYGD---SSFSVGYLSKDTVSFGSNS-----VPN 247
Query: 148 VIFGCGHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLPDQGSSKINFGG 207
+GCG N + G++GL SL+ Q+ ++ FSYCLP SS G
Sbjct: 248 FYYGCGQDNEG---LFGRSAGLMGLARNKLSLLYQLAPTLGYSFSYCLPSSSSSGYLSIG 304
Query: 208 IVAGAGVVSTPLI---IRDH-YYLSLEAISVGNQRLEFVSSSTGNI--FVDTGVLRTLLP 261
TP++ + D Y++ L ++V + L SS ++ +D+G + T LP
Sbjct: 305 SYNPGQYSYTPMVSSTLDDSLYFIKLSGMTVAGKPLAVSSSEYSSLPTIIDSGTVITRLP 364
Query: 262 LEYHSNLKSVMSNMIKAQPVKGVGAEPGFS--DVLCYNISSQPKFPEVTIHFR-GADVKL 318
+ L + A +KG +S D +S + P V++ F GA +KL
Sbjct: 365 TTVYDALSKAV-----AGAMKGTKRADAYSILDTCFVGQASSLRVPAVSMAFSGGAALKL 419
Query: 319 SPSNLFRNISDEIMCSAFRGGNANIVYGRIMQINFLIGYDIEQAMVSFKPSRCT 372
S NL ++ C AF + + G Q F + YD++ + F CT
Sbjct: 420 SAQNLLVDVDSSTTCLAFAPARSAAIIGNTQQQTFSVVYDVKSNRIGFAAGGCT 473
>gi|125600538|gb|EAZ40114.1| hypothetical protein OsJ_24557 [Oryza sativa Japonica Group]
Length = 412
Score = 143 bits (361), Expect = 1e-31, Method: Compositional matrix adjust.
Identities = 113/353 (32%), Positives = 172/353 (48%), Gaps = 51/353 (14%)
Query: 35 YLMHLSIGTPPVDIFGSVDTGSDCTWTQCE-PCPELDCFKQEPPLFDPKKSSTYNSISCS 93
YL+ ++IGTPP+ + +DTGSD WTQC+ PC CF Q PL+ P +S+TY ++SC
Sbjct: 92 YLVDIAIGTPPLPLTAVLDTGSDLIWTQCDAPCRR--CFPQPAPLYAPARSATYANVSCR 149
Query: 94 SSQCAVVT---SNCSEGD--CSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMPNV 148
S C + S CS D C+Y F YG G + + G LATET T S + + V
Sbjct: 150 SPMCQALQSPWSRCSPPDTGCAYYFSYGDG---TSTDGVLATETFTLGSDTA----VRGV 202
Query: 149 IFGCGHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLPDQGSSKINFGGI 208
FGCG +NL S + S G++G+G G SL+SQ+G + + S +
Sbjct: 203 AFGCGTENLGSTDNSS---GLVGMGRGPLSLVSQLGVTRPRR---------SCRARAAAR 250
Query: 209 VAGAGVVSTPLIIRDHYYLSLEAISVGNQRL-------EFVSSSTGNIFVDTGVLRTLLP 261
GA ++P LE I+VG+ L G + +D+G T L
Sbjct: 251 GGGAPTTTSP----------LEGITVGDTLLPIDPAVFRLTPMGDGGVIIDSGTTFTALE 300
Query: 262 LEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDVLCYNISSQP--KFPEVTIHFRGADVKL- 318
L +++ ++ P+ GA G S LC+ +S + P + +HF GAD++L
Sbjct: 301 ERAFVALARALASRVR-LPLAS-GAHLGLS--LCFAAASPEAVEVPRLVLHFDGADMELR 356
Query: 319 SPSNLFRNISDEIMCSAFRGGNANIVYGRIMQINFLIGYDIEQAMVSFKPSRC 371
S + + S + C V G + Q N I YD+E+ ++SF+P++C
Sbjct: 357 RESYVVEDRSAGVACLGMVSARGMSVLGSMQQQNTHILYDLERGILSFEPAKC 409
>gi|225430551|ref|XP_002283470.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Vitis vinifera]
Length = 490
Score = 143 bits (360), Expect = 1e-31, Method: Compositional matrix adjust.
Identities = 115/361 (31%), Positives = 171/361 (47%), Gaps = 42/361 (11%)
Query: 35 YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSS 94
Y++ + +G+P D+ DTGSD TWTQCEPC C++Q +FDP S +Y+++SC S
Sbjct: 147 YVVTVGLGSPKRDLTFIFDTGSDLTWTQCEPCVGY-CYQQREHIFDPSTSLSYSNVSCDS 205
Query: 95 SQCAVVTS------NCSEGDCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMPNV 148
C + S CS C Y YG G+Y S G A E L+ ST N
Sbjct: 206 PSCEKLESATGNSPGCSSSTCLYGIRYGDGSY---SIGFFAREKLSLTSTD----VFNNF 258
Query: 149 IFGCGHKN--LASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLP--DQGSSKIN 204
FGCG N L T+ G++GL SL+SQ FSYCLP + ++
Sbjct: 259 QFGCGQNNRGLFGGTA-----GLLGLARNPLSLVSQTAQKYGKVFSYCLPSSSSSTGYLS 313
Query: 205 FG-GIVAGAGVVSTPLIIRDH----YYLSLEAISVGNQRLEFVSS--STGNIFVDTGVLR 257
FG G V TP + Y+L + ISVG ++L S ST +D+G +
Sbjct: 314 FGSGDGDSKAVKFTPSEVNSDYPSFYFLDMVGISVGERKLPIPKSVFSTAGTIIDSGTVI 373
Query: 258 TLLPLEYHSNLKSVMSNMIKAQP-VKGVGAEPGFSDVLCYNISSQP--KFPEVTIHFR-G 313
+ LP +S+++ V ++ P VKGV CY++S K P++ ++F G
Sbjct: 374 SRLPPTVYSSVQKVFRELMSDYPRVKGVSILD-----TCYDLSKYKTVKVPKIILYFSGG 428
Query: 314 ADVKLSPSNLFRNISDEIMCSAFRGGNAN---IVYGRIMQINFLIGYDIEQAMVSFKPSR 370
A++ L+P + + +C AF G + + + G + Q + YD + V F PS
Sbjct: 429 AEMDLAPEGIIYVLKVSQVCLAFAGNSDDDEVAIIGNVQQKTIHVVYDDAEGRVGFAPSG 488
Query: 371 C 371
C
Sbjct: 489 C 489
>gi|45735845|dbj|BAD12880.1| putative chloroplast nucleoid DNA-binding protein cnd41 [Oryza
sativa Japonica Group]
gi|45735971|dbj|BAD13000.1| putative chloroplast nucleoid DNA-binding protein cnd41 [Oryza
sativa Japonica Group]
Length = 333
Score = 143 bits (360), Expect = 2e-31, Method: Compositional matrix adjust.
Identities = 112/350 (32%), Positives = 167/350 (47%), Gaps = 33/350 (9%)
Query: 39 LSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSSSQCA 98
+ +GTP VDTGS TW QC PC + C +Q P+F+PK SSTY S+ CS+ QC+
Sbjct: 1 MGLGTPATQYVMVVDTGSSLTWLQCSPC-LVSCHRQSGPVFNPKSSSTYASVGCSAQQCS 59
Query: 99 VV------TSNCSEGD-CSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMPNVIFG 151
+ S CS + C Y YG +SFS G L+ +T++F STS +PN +G
Sbjct: 60 DLPSATLNPSACSSSNVCIYQASYGD---SSFSVGYLSKDTVSFGSTS-----LPNFYYG 111
Query: 152 CGHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLPDQGSSKINFGGIVAG 211
CG N + G+IGL SL+ Q+ S+ F+YCLP SS G
Sbjct: 112 CGQDNEG---LFGRSAGLIGLARNKLSLLYQLAPSLGYSFTYCLPSSSSSGYLSLGSYNP 168
Query: 212 AGVVSTPLI---IRDH-YYLSLEAISVGNQRLEFVSSSTGNI--FVDTGVLRTLLPLEYH 265
TP++ + D Y++ L ++V L SS+ ++ +D+G + T LP +
Sbjct: 169 GQYSYTPMVSSSLDDSLYFIKLSGMTVAGNPLSVSSSAYSSLPTIIDSGTVITRLPTSVY 228
Query: 266 SNLKSVMSNMIKAQPVKGVGAEPGFS--DVLCYNISSQPKFPEVTIHFR-GADVKLSPSN 322
S L + A +KG +S D +S+ P VT+ F GA +KLS N
Sbjct: 229 SALSKAV-----AAAMKGTSRASAYSILDTCFKGQASRVSAPAVTMSFAGGAALKLSAQN 283
Query: 323 LFRNISDEIMCSAFRGGNANIVYGRIMQINFLIGYDIEQAMVSFKPSRCT 372
L ++ D C AF + + G Q F + YD++ + + F C+
Sbjct: 284 LLVDVDDSTTCLAFAPARSAAIIGNTQQQTFSVVYDVKSSRIGFAAGGCS 333
>gi|24430421|dbj|BAC22609.1| 41 kD chloroplast nucleoid DNA binding protein (CND41) [Nicotiana
sylvestris]
Length = 502
Score = 143 bits (360), Expect = 2e-31, Method: Compositional matrix adjust.
Identities = 112/366 (30%), Positives = 173/366 (47%), Gaps = 45/366 (12%)
Query: 35 YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSS 94
Y++++ +GTP D+ DTGSD TWTQC+PC + C+ Q+ P+FDP S TY++ISC+S
Sbjct: 154 YIVNVGLGTPKKDLSLIFDTGSDLTWTQCQPCVK-SCYAQQQPIFDPSASKTYSNISCTS 212
Query: 95 SQCAVVTS------NCSEGDCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMPNV 148
+ C+ + S CS +C Y YG +SF+ G A +TLT
Sbjct: 213 TACSGLKSATGNSPGCSSSNCVYGIQYGD---SSFTVGFFAKDTLTLTQND----VFDGF 265
Query: 149 IFGCGHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLP-DQGSS-KINFG 206
+FGCG N K G+IGLG S++ Q FSYCLP +GS+ + FG
Sbjct: 266 MFGCGQNNRG---LFGKTAGLIGLGRDPLSIVQQTAQKFGKYFSYCLPTSRGSNGHLTFG 322
Query: 207 ---GI----VAGAGVVSTPLIIRD---HYYLSLEAISVGNQRLE---FVSSSTGNIFVDT 253
G+ G+ TP Y++ + ISVG + L + + G I +D+
Sbjct: 323 NGNGVKTSKAVKNGITFTPFASSQGATFYFIDVLGISVGGKALSISPMLFQNAGTI-IDS 381
Query: 254 GVLRTLLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDV-LCYNIS--SQPKFPEVTIH 310
G + T LP + +LKS + P P S + CY++S + P+++ +
Sbjct: 382 GTVITRLPSTVYGSLKSTFKQFMSKYPT-----APALSLLDTCYDLSNYTSISIPKISFN 436
Query: 311 FRG-ADVKLSPSNLFRNISDEIMCSAFRGGNANI---VYGRIMQINFLIGYDIEQAMVSF 366
F G A+V L P+ + +C AF G + ++G I Q + YD+ + F
Sbjct: 437 FNGNANVDLEPNGILITNGASQVCLAFAGNGDDDTIGIFGNIQQQTLEVVYDVAGGQLGF 496
Query: 367 KPSRCT 372
C+
Sbjct: 497 GYKGCS 502
>gi|242093566|ref|XP_002437273.1| hypothetical protein SORBIDRAFT_10g023970 [Sorghum bicolor]
gi|241915496|gb|EER88640.1| hypothetical protein SORBIDRAFT_10g023970 [Sorghum bicolor]
Length = 503
Score = 143 bits (360), Expect = 2e-31, Method: Compositional matrix adjust.
Identities = 109/354 (30%), Positives = 172/354 (48%), Gaps = 32/354 (9%)
Query: 35 YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSS 94
Y++ + +GTP DTGSD TW QC+PC C++Q+ PLF P KS+TY +ISC+S
Sbjct: 165 YVVPIRLGTPAARFTVVFDTGSDTTWVQCQPCVAY-CYQQKEPLFTPTKSATYANISCTS 223
Query: 95 SQCA-VVTSNCSEGDCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMPNVIFGCG 153
S C+ + T CS G C Y+ YG G+Y + G A +TLT + + + FGCG
Sbjct: 224 SYCSDLDTRGCSGGHCLYAVQYGDGSY---TVGFYAQDTLTLGYDT-----VKDFRFGCG 275
Query: 154 HKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLP--DQGSSKINFGGIVAG 211
KN K G++GLG G +S+ Q +G F+YC+P G+ ++FG
Sbjct: 276 EKNRG---LFGKAAGLMGLGRGKTSVPVQAYDKYSGVFAYCIPATSSGTGFLDFGPGAPA 332
Query: 212 AGVVS-TPLIIRD---HYYLSLEAISVGNQRLEFVSS--STGNIFVDTGVLRTLLPLEYH 265
A TP+++ + YY+ + I VG L ++ S VD+G + T LP +
Sbjct: 333 AANARLTPMLVDNGPTFYYVGMTGIKVGGHLLSIPATVFSDAGALVDSGTVITRLPPSAY 392
Query: 266 SNLKSVMSNMIKAQPVKGVGAEPGFSDV-LCYNISSQP---KFPEVTIHFR-GADVKLSP 320
L+S + ++ G P FS + CY+++ P V++ F+ GA + +
Sbjct: 393 EPLRSAFAKGMEGL---GYKTAPAFSILDTCYDLTGYQGSIALPAVSLVFQGGACLDVDA 449
Query: 321 SNLFRNISDEIMCSAFRGGNAN---IVYGRIMQINFLIGYDIEQAMVSFKPSRC 371
S + C AF + + + G Q + + YD+ + +V F P C
Sbjct: 450 SGILYVADVSQACLAFAANDDDTDMTIVGNTQQKTYSVLYDLGKKVVGFAPGAC 503
>gi|413921849|gb|AFW61781.1| hypothetical protein ZEAMMB73_702843 [Zea mays]
Length = 379
Score = 142 bits (359), Expect = 2e-31, Method: Compositional matrix adjust.
Identities = 100/273 (36%), Positives = 145/273 (53%), Gaps = 37/273 (13%)
Query: 35 YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSS 94
YL+ L+IGTPP+ +DTGSD WTQC PC L C Q P FD KKS+TY ++ C S
Sbjct: 89 YLVDLAIGTPPLYYTAIMDTGSDLIWTQCAPC--LLCADQPTPYFDVKKSATYRALPCRS 146
Query: 95 SQCAVVTS-NCSEGDCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMPNVIFGCG 153
S+CA ++S +C + C Y + YG A ++G LA ET TF + + V N+ FGCG
Sbjct: 147 SRCASLSSPSCFKKMCVYQYYYGDTAS---TAGVLANETFTFGAANSTKVRATNIAFGCG 203
Query: 154 HKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLPDQGS---SKINFG---- 206
N + S G++G G G SL+SQ+G S +FSYCL S S++ FG
Sbjct: 204 SLNAGDLANSS---GMVGFGRGPLSLVSQLGPS---RFSYCLTSYLSATPSRLYFGVYAN 257
Query: 207 ----GIVAGAGVVSTPLIIR----DHYYLSLEAISVGNQRL-------EFVSSSTGNIFV 251
+G+ V STP +I + Y+LSL+AIS+G + L TG + +
Sbjct: 258 LSSTNTSSGSPVQSTPFVINPALPNMYFLSLKAISLGTKLLPIDPLVFAINDDGTGGVII 317
Query: 252 DTGVLRTLLPLEYHSNLKSVMSNMIKAQPVKGV 284
D+G T L + + ++V ++ A P+ +
Sbjct: 318 DSGTSITWLQQDAY---EAVRRGLVSAIPLTAM 347
>gi|125558629|gb|EAZ04165.1| hypothetical protein OsI_26307 [Oryza sativa Indica Group]
Length = 455
Score = 142 bits (359), Expect = 2e-31, Method: Compositional matrix adjust.
Identities = 120/399 (30%), Positives = 188/399 (47%), Gaps = 71/399 (17%)
Query: 21 SIIYQAEIISVDDIYLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQ--EPPL 78
S+ QA++ + Y M++S+GTPP+D VDTGS+ W QC PC CF + P+
Sbjct: 77 SVNVQAQLENGAGAYNMNISLGTPPLDFPVIVDTGSNLIWAQCAPCTR--CFPRPTPAPV 134
Query: 79 FDPKKSSTYNSISCSSSQCAVVTSNC------SEGDCSYSFLYGRGAYASFSSGNLATET 132
P +SST++ + C+ S C + ++ + C+Y++ YG G +++G LATET
Sbjct: 135 LQPARSSTFSRLPCNGSFCQYLPTSSRPRTCNATAACAYNYTYGSG----YTAGYLATET 190
Query: 133 LTFNSTSGLPVEMPNVIFGCGHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFS 192
LT + P V FGC +N +S GI+GLG G SL+SQ+ G+FS
Sbjct: 191 LTVGDGT-----FPKVAFGCSTENGVDNSS-----GIVGLGRGPLSLVSQLAV---GRFS 237
Query: 193 YCL----PDQGSSKINFGGIVA---GAGVVSTPLIIR------DHYYLSLEAISVGNQRL 239
YCL D G+S I FG + + V STPL+ HYY++L I+V + L
Sbjct: 238 YCLRSDMADGGASPILFGSLAKLTERSVVQSTPLLKNPYLQRSTHYYVNLTGIAVDSTEL 297
Query: 240 EFVSSS--------TGNIFVDTGVLRTLLPLEYHSNLK----SVMSNMIKAQPVKGVGAE 287
S+ G VD+G T L + ++ +K S M+N+ + P G
Sbjct: 298 PVTGSTFGFTQTGLGGGTIVDSGTTLTYLAKDGYAMVKQAFQSQMANLNQTTPASGA--- 354
Query: 288 PGFSDVLCYNISS-----QPKFPEVTIHFR-GADVKLSPSNLFRNISDE------IMCSA 335
+ LCY S+ + P + + F GA + N F + + + C
Sbjct: 355 -PYDLDLCYKPSAGGGGKAVRVPRLALRFAGGAKYNVPVQNYFAGVEADSQGRVTVACLL 413
Query: 336 FRGGNANI---VYGRIMQINFLIGYDIEQAMVSFKPSRC 371
++ + G +MQ++ + YDI+ M SF P+ C
Sbjct: 414 VLPATDDLPISIIGNLMQMDMHLLYDIDGGMFSFAPADC 452
>gi|414885970|tpg|DAA61984.1| TPA: hypothetical protein ZEAMMB73_915310 [Zea mays]
Length = 523
Score = 142 bits (359), Expect = 2e-31, Method: Compositional matrix adjust.
Identities = 104/354 (29%), Positives = 167/354 (47%), Gaps = 34/354 (9%)
Query: 35 YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSS 94
Y++ + +GTP D+ DTGSD +W QC+PC +C+KQ PLFDP +S+TY+++ C +
Sbjct: 188 YIVSVGLGTPRRDLLVVFDTGSDLSWVQCKPCN--NCYKQHDPLFDPSQSTTYSAVPCGA 245
Query: 95 SQCAVVTSNCSEGDCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMPNVIFGCGH 154
+C + + CS G C Y +YG S + GNLA +TLT +S ++ +FGCG
Sbjct: 246 QEC-LDSGTCSSGKCRYEVVYGD---MSQTDGNLARDTLTLGPSSD---QLQGFVFGCGD 298
Query: 155 KNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLPDQGSSK--INFGGIVAGA 212
+ + G+ GLG SL SQ FSYCLP ++ ++ G A
Sbjct: 299 DDTG---LFGRADGLFGLGRDRVSLASQAAARYGAGFSYCLPSSWRAEGYLSLGSAAAPP 355
Query: 213 GVVSTPLIIRDH----YYLSLEAISVGNQRLEF---VSSSTGNIFVDTGVLRTLLPLEYH 265
T ++ R YYL L I V + + V + G + +D+G + T LP +
Sbjct: 356 HAQFTAMVTRSDTPSFYYLDLVGIKVAGRTVRVAPAVFKAPGTV-IDSGTVITRLPSRAY 414
Query: 266 SNLKSVMSNMIKAQPVKGVGAEPGFSDV-LCYNISSQPK--FPEVTIHFR-GADVKLSPS 321
S L+S + ++ P S + CY+ + + K P V + F GA + L
Sbjct: 415 SALRSSFAGFMRRYK-----RAPALSILDTCYDFTGRTKVQIPSVALLFDGGATLNLGFG 469
Query: 322 NLFRNISDEIMCSAF--RGGNANI-VYGRIMQINFLIGYDIEQAMVSFKPSRCT 372
+ + C AF G + ++ + G + Q F + YD+ + F C+
Sbjct: 470 GVLYVANRSQACLAFASNGDDTSVGILGNMQQKTFAVVYDLANQKIGFGAKGCS 523
>gi|326506192|dbj|BAJ86414.1| predicted protein [Hordeum vulgare subsp. vulgare]
gi|326516358|dbj|BAJ92334.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 492
Score = 142 bits (359), Expect = 2e-31, Method: Compositional matrix adjust.
Identities = 113/370 (30%), Positives = 167/370 (45%), Gaps = 50/370 (13%)
Query: 35 YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSS 94
Y + +GTP +DTGSD W QC PC C++Q +FDP++S +YN++ C++
Sbjct: 140 YFTKIGVGTPATPALMVLDTGSDVVWLQCAPCRR--CYEQSGQVFDPRRSRSYNAVGCAA 197
Query: 95 SQCAVVTS---NCSEGDCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMPNVIFG 151
C + S + C Y YG G S ++G+ ATETLTF + + V G
Sbjct: 198 PLCRRLDSGGCDLRRSACLYQVAYGDG---SVTAGDFATETLTFAGGA----RVARVALG 250
Query: 152 CGHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLPDQGSSK--------I 203
CGH N + + G+ G+ S +Q+ FSYCL D+ SS +
Sbjct: 251 CGHDNEGLFVAAAGLLGLG---RGSLSFPTQISRRYGRSFSYCLVDRTSSANTASRSSTV 307
Query: 204 NFGGIVAGAGVVS--TPLI----IRDHYYLSLEAISVGNQRLEFV---------SSSTGN 248
FG G+ V S TP++ + YY+ L ISVG R+ V SS G
Sbjct: 308 TFGSGAVGSTVASSFTPMVKNPRMETFYYVQLIGISVGGARVPGVANSDLRLDPSSGRGG 367
Query: 249 IFVDTGVLRTLLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDVL--CYNISSQP--KF 304
+ VD+G T L +S L+ G+ PG + CY++S + K
Sbjct: 368 VIVDSGTSVTRLARPAYSALRDAFRGA-----AAGLRLSPGGFSLFDTCYDLSGRKVVKV 422
Query: 305 PEVTIHFR-GADVKLSPSNLFRNI-SDEIMCSAFRGGNANI-VYGRIMQINFLIGYDIEQ 361
P V++HF GA+ L P N + S C AF G + + + G I Q F + +D +
Sbjct: 423 PTVSMHFAGGAEAALPPENYLIPVDSKGTFCFAFAGTDGGVSIIGNIQQQGFRVVFDGDG 482
Query: 362 AMVSFKPSRC 371
V+F P C
Sbjct: 483 QRVAFTPKGC 492
>gi|326520291|dbj|BAK07404.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 464
Score = 142 bits (358), Expect = 3e-31, Method: Compositional matrix adjust.
Identities = 116/380 (30%), Positives = 175/380 (46%), Gaps = 55/380 (14%)
Query: 35 YLMHLSIGTP-PVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCS 93
YL+HL IGTP P + +DTGSD WTQC C CF Q P+F S T++ + CS
Sbjct: 94 YLIHLGIGTPRPQRVVLHLDTGSDLVWTQCA-CTV--CFDQPVPVFRASVSHTFSRVPCS 150
Query: 94 SSQCA----VVTSNCSEGDCSYSFLYGRGAYASFSSGNLATETLTFNS--TSGLPVEMPN 147
C + S C+ D S + YG + S ++G +A +T TF + + +PN
Sbjct: 151 DPLCGHAVYLPLSGCAARDRSCFYAYGYMDH-SITTGKMAEDTFTFKAPDRADTAAAVPN 209
Query: 148 VIFGCGHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLPDQGSSKIN--- 204
+ FGCG N T + Q+GI G G G SL SQ+ +FSYC S+++
Sbjct: 210 IRFGCGMMNYGLFTPN--QSGIAGFGTGPLSLPSQLKVR---RFSYCFTAMEESRVSPVI 264
Query: 205 FGG----IVAGA-----------GVVSTPLIIRDHYYLSLEAISVGNQRLEFVSSS---- 245
GG I A A G P+ + Y+LSL ++VG RL F +S+
Sbjct: 265 LGGEPENIEAHATGPIQSTPFAPGPAGAPVGSQPFYFLSLRGVTVGETRLPFNASTFALK 324
Query: 246 ---TGNIFVDTGVLRTLLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDVLCYNISSQ- 301
+G F+D+G T P +L+ + KG +P ++LC+++ ++
Sbjct: 325 GDGSGGTFIDSGTAITFFPQAVFRSLREAFVAQVPLPVAKGY-TDP--DNLLCFSVPAKK 381
Query: 302 --PKFPEVTIHFRGADVKLSPSNLFRNISDE-------IMCSAFRGGNAN-IVYGRIMQI 351
P P++ +H GAD +L N + D+ + GN+N + G Q
Sbjct: 382 KAPAVPKLILHLEGADWELPRENYVLDNDDDGSGAGRKLCVVILSAGNSNGTIIGNFQQQ 441
Query: 352 NFLIGYDIEQAMVSFKPSRC 371
N I YD+E + F P+RC
Sbjct: 442 NMHIVYDLESNKMVFAPARC 461
>gi|413952720|gb|AFW85369.1| hypothetical protein ZEAMMB73_571116 [Zea mays]
Length = 451
Score = 142 bits (357), Expect = 3e-31, Method: Compositional matrix adjust.
Identities = 121/380 (31%), Positives = 168/380 (44%), Gaps = 61/380 (16%)
Query: 35 YLMHLSIGTP-PVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISC- 92
YL+H +IGTP P + ++DTGSD WTQC PCP CF Q PLFDP SST+ +++C
Sbjct: 87 YLIHFNIGTPRPQRVALTMDTGSDLVWTQCTPCPV--CFDQPFPLFDPSVSSTFRAVACP 144
Query: 93 ----------SSSQCAVVTSNCSEGDCSYSFLYGRGAYASFSSGNLATETLTFNSTSGL- 141
S S CA+ T C CSY S ++G + +T TF S +G
Sbjct: 145 DPICRPSSGLSVSACALKTFRCFY-LCSY-------GDKSITAGYIFKDTFTFMSPNGEG 196
Query: 142 --PVEMPNVIFGCGHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLPDQG 199
PV + + FGCG N S ++GI G G G SL SQ+ G+FSYCL
Sbjct: 197 APPVAVSGLAFGCGDYNTG--VFASNESGIAGFGRGPLSLPSQLRV---GRFSYCLTSHD 251
Query: 200 SSKINFGGIV------------AGAGVVSTPLI----IRDHYYLSLEAISVGNQRLEFVS 243
++ N V + STP+I YYLSLE I+VG RL S
Sbjct: 252 ETESNKTSAVFLGTPPNGLRAHSSGPFRSTPIIHSPSFPTFYYLSLEGITVGKTRLPVDS 311
Query: 244 S-------STGNIFVDTGVLRTLLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDVLCY 296
S +G +D+G T P LK + + P+ ++LC+
Sbjct: 312 SVFALKKDGSGGTVIDSGTGVTTFPAAVFEQLK---NEFVAQLPLPRYDNTSEVGNLLCF 368
Query: 297 NI---SSQPKFPEVTIHFRGADVKLSPSNLFRNISDE-IMCSAFRGGNANIVY-GRIMQI 351
Q P++ H AD+ L N +D +MC G ++V G Q
Sbjct: 369 QRPKGGKQVPVPKLIFHLASADMDLPRENYIPEDTDSGVMCLMINGAEVDMVLIGNFQQQ 428
Query: 352 NFLIGYDIEQAMVSFKPSRC 371
N I YD+E + + F ++C
Sbjct: 429 NMHIVYDVENSKLLFASAQC 448
>gi|449457263|ref|XP_004146368.1| PREDICTED: probable aspartic protease At2g35615-like [Cucumis
sativus]
Length = 469
Score = 142 bits (357), Expect = 4e-31, Method: Compositional matrix adjust.
Identities = 115/373 (30%), Positives = 173/373 (46%), Gaps = 48/373 (12%)
Query: 35 YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSS 94
+L++LSIG+PPV VDTGS W QC PC ++CF+Q FDP KS ++ ++ C
Sbjct: 104 FLVNLSIGSPPVTQLVVVDTGSSLLWVQCLPC--INCFQQSTSWFDPLKSVSFKTLGCGF 161
Query: 95 SQCAVVTS-NCSE-GDCSYSFLYGRGAYASFSSGNLATETLTFN-------------STS 139
+ C+ Y Y G S G LA E+L F ST
Sbjct: 162 PGYNYINGYKCNRFNQAEYKLRYLGG---DSSQGILAKESLLFETLDEGRVFQYNAISTQ 218
Query: 140 GLPVEMPNVIFGCGHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLPDQG 199
++ N+ FGCGH N+ + +D G+ GLG M T + KFSYC+ D
Sbjct: 219 ISKIKKSNITFGCGHMNIKT-NNDDAYNGVFGLGAYPH---ITMATQLGNKFSYCIGDIN 274
Query: 200 SSKINFGGIVAGAGVV----STPLIIR-DHYYLSLEAISVGNQRL-------EFVSSSTG 247
+ +V G G STPL I HYY++L++ISVG++ L + S +G
Sbjct: 275 NPLYTHNHLVLGQGSYIEGDSTPLQIHFGHYYVTLQSISVGSKTLKIDPNAFKISSDGSG 334
Query: 248 NIFVDTGVLRTLLP---LE-YHSNLKSVMSNMIKAQPVKGVGAEPGFSDVLCYNISSQPK 303
+ +D+G+ T L E + + +M +++ P + F V+ ++
Sbjct: 335 GVLIDSGMTYTKLANGGFELLYDEIVDLMKGLLERIPTQRKFEGLCFKGVVSRDLVG--- 391
Query: 304 FPEVTIHFR-GADVKLSPSNLFRNISDEIMCSAFRGGNANI----VYGRIMQINFLIGYD 358
FP VT HF GAD+ L +LFR + C A N+ + V G + Q N+ +G+D
Sbjct: 392 FPAVTFHFAGGADLVLESGSLFRQHGGDRFCLAILPSNSELLNLSVIGILAQQNYNVGFD 451
Query: 359 IEQAMVSFKPSRC 371
+EQ V F+ C
Sbjct: 452 LEQMKVFFRRIDC 464
>gi|242058537|ref|XP_002458414.1| hypothetical protein SORBIDRAFT_03g033075 [Sorghum bicolor]
gi|241930389|gb|EES03534.1| hypothetical protein SORBIDRAFT_03g033075 [Sorghum bicolor]
Length = 448
Score = 141 bits (356), Expect = 4e-31, Method: Compositional matrix adjust.
Identities = 106/374 (28%), Positives = 172/374 (45%), Gaps = 49/374 (13%)
Query: 35 YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSS 94
Y +++G PP +DTGSD W QC PC C++Q PL+DP+ SST+ I C+S
Sbjct: 88 YFAVINVGDPPTRALVVIDTGSDLIWLQCVPCRH--CYRQVTPLYDPRSSSTHRRIPCAS 145
Query: 95 SQCAVVTS----NCSEGDCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMPNVIF 150
+C V + G C Y +YG G S SSG+LAT+ L F + + NV
Sbjct: 146 PRCRDVLRYPGCDARTGGCVYMVVYGDG---SASSGDLATDRLVFPDDT----HVHNVTL 198
Query: 151 GCGHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLPDQ------GSSKIN 204
GCGH N+ S G++G+G G S +Q+ + FSYCL D+ GSS +
Sbjct: 199 GCGHDNVGLLES---AAGLLGVGRGQLSFPTQLAPAYGHVFSYCLGDRLSRAQNGSSYLV 255
Query: 205 FGGIVAGAGVVSTPLIIRDH----YYLSLEAISVGNQRLEFVSSST---------GNIFV 251
FG TPL YY+ + SVG +R+ S+++ G I V
Sbjct: 256 FGRTPEPPSTAFTPLRTNPRRPSLYYVDMVGFSVGGERVTGFSNASLALNPATGRGGIVV 315
Query: 252 DTGVLRTLLPLEYHSNLKSVM-SNMIKAQPVKGVGAEPGFSDVLCYNI------SSQPKF 304
D+G + + ++ ++ S+ A ++ + + D CY++ ++ +
Sbjct: 316 DSGTAISRFARDAYAAVRDAFDSHAAAAGTMRKLATKFSVFDA-CYDLRGNGAPAAAVRV 374
Query: 305 PEVTIHFR-GADVKLSPSNLFRNIS----DEIMCSAFRGGNANI-VYGRIMQINFLIGYD 358
P + +HF GAD+ L +N + C + + + V G + Q F + +D
Sbjct: 375 PSIVLHFAGGADMALPQANYLIPVQGGDRRTYFCLGLQAADDGLNVLGNVQQQGFGLVFD 434
Query: 359 IEQAMVSFKPSRCT 372
+E+ + F P+ C+
Sbjct: 435 VERGRIGFTPNGCS 448
>gi|242045120|ref|XP_002460431.1| hypothetical protein SORBIDRAFT_02g028000 [Sorghum bicolor]
gi|241923808|gb|EER96952.1| hypothetical protein SORBIDRAFT_02g028000 [Sorghum bicolor]
Length = 481
Score = 141 bits (356), Expect = 5e-31, Method: Compositional matrix adjust.
Identities = 106/355 (29%), Positives = 167/355 (47%), Gaps = 28/355 (7%)
Query: 35 YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSS 94
Y++ + +GTP D+ DTGSD +W QC+PC C++Q PLFDP +S+TY+++ C +
Sbjct: 138 YIVSVGLGTPKRDLLVVFDTGSDLSWVQCKPCD--GCYQQHDPLFDPSQSTTYSAVPCGA 195
Query: 95 SQCAVVTS-NCSEGDCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPV--EMPNVIFG 151
+C + S +CS G C Y +YG S + GNLA +TLT +S ++ +FG
Sbjct: 196 QECRRLDSGSCSSGKCRYEVVYGD---MSQTDGNLARDTLTLGPSSSSSSSDQLQEFVFG 252
Query: 152 CGHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLPDQGSSKINFG-GIVA 210
CG + K G+ GLG SL SQ FSYCLP +++ G A
Sbjct: 253 CGDDDTG---LFGKADGLFGLGRDRVSLASQAAAKYGAGFSYCLPSSSTAEGYLSLGSAA 309
Query: 211 GAGVVSTPLIIRDH----YYLSLEAISVGNQ--RLEFVSSSTGNIFVDTGVLRTLLPLEY 264
T ++ R YYL+L I V + R+ T +D+G + T LP
Sbjct: 310 PPNARFTAMVTRSDTPSFYYLNLVGIKVAGRTVRVSPAVFRTPGTVIDSGTVITRLPSRA 369
Query: 265 HSNLKSVMSNMIKAQPVKGVGAEPGFSDV-LCYNISSQPK--FPEVTIHFR-GADVKLSP 320
++ L+S + +++ K P S + CY+ + + K P V + F GA + L
Sbjct: 370 YAALRSSFAGLMRRYSYK---RAPALSILDTCYDFTGRNKVQIPSVALLFDGGATLNLGF 426
Query: 321 SNLFRNISDEIMCSAF--RGGNANI-VYGRIMQINFLIGYDIEQAMVSFKPSRCT 372
+ + C AF G + +I + G + Q F + YD+ + F C+
Sbjct: 427 GEVLYVANKSQACLAFASNGDDTSIAILGNMQQKTFAVVYDVANQKIGFGAKGCS 481
>gi|148907930|gb|ABR17085.1| unknown [Picea sitchensis]
Length = 498
Score = 141 bits (356), Expect = 5e-31, Method: Compositional matrix adjust.
Identities = 110/363 (30%), Positives = 168/363 (46%), Gaps = 47/363 (12%)
Query: 35 YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSS 94
Y + +GTP + + +DTGSD W QCEPC E C+ Q P+F+P S++++++ C S
Sbjct: 157 YFTRIGVGTPTREQYMVLDTGSDVAWIQCEPCRE--CYSQADPIFNPSYSASFSTVGCDS 214
Query: 95 SQCAVVTS-NCSEGDCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMPNVIFGCG 153
+ C+ + + +C G C Y YG G+Y S+G+ ATETLTF +TS + NV GCG
Sbjct: 215 AVCSQLDAYDCHSGGCLYEASYGDGSY---STGSFATETLTFGTTS-----VANVAIGCG 266
Query: 154 HKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLPDQ---GSSKINFGGIVA 210
HKN+ + G+ S +Q+GT FSYCL D+ S + FG
Sbjct: 267 HKNVGLFIGAAGLLGLGAG---ALSFPNQIGTQTGHTFSYCLVDRESDSSGPLQFGPKSV 323
Query: 211 GAGVVSTPLIIRDH----YYLSLEAISVGNQRLEFV---------SSSTGNIFVDTGVLR 257
G + TPL H YYLS+ AISVG L+ + +S G +D+G +
Sbjct: 324 PVGSIFTPLEKNPHLPTFYYLSVTAISVGGALLDSIPPEVFRIDETSGHGGFIIDSGTVV 383
Query: 258 TLLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDV----LCYNISSQP--KFPEVTIHF 311
T L + ++ V G G P V CY++S P V HF
Sbjct: 384 TRLVTSAYDAVRDAF--------VAGTGQLPRTDAVSIFDTCYDLSGLQFVSVPTVGFHF 435
Query: 312 RGADVKLSPSNLFRNISDEI--MCSAFRGGNANI-VYGRIMQINFLIGYDIEQAMVSFKP 368
+ P+ + D + C AF +++ + G Q + + +D ++V F
Sbjct: 436 SNGASLILPAKNYLIPMDTVGTFCFAFAPAASSVSIMGNTQQQHIRVSFDSANSLVGFAF 495
Query: 369 SRC 371
+C
Sbjct: 496 DQC 498
>gi|15234606|ref|NP_194732.1| aspartyl protease family protein [Arabidopsis thaliana]
gi|4938479|emb|CAB43838.1| putative protein [Arabidopsis thaliana]
gi|7269903|emb|CAB80996.1| putative protein [Arabidopsis thaliana]
gi|67633774|gb|AAY78811.1| aspartyl protease family protein [Arabidopsis thaliana]
gi|332660310|gb|AEE85710.1| aspartyl protease family protein [Arabidopsis thaliana]
Length = 424
Score = 141 bits (356), Expect = 5e-31, Method: Compositional matrix adjust.
Identities = 116/389 (29%), Positives = 185/389 (47%), Gaps = 38/389 (9%)
Query: 6 KLPFYNDNETPKSPISIIYQAEIISV---DDIYLMHLSIGTPPVDIFGSVDTGSDCTWTQ 62
K+ + + TP S + ++ ++ +L ++SIG PPV +DTGSD TW
Sbjct: 46 KIGYLHSKSTPASRLDNLWTVSHVTPIPNPAAFLANISIGNPPVPQLLLIDTGSDLTWIH 105
Query: 63 CEPCPELDCFKQEPPLFDPKKSSTYNSISCSSSQCAV--VTSNCSEGDCSYSFLYGRGAY 120
C PC C+ Q P F P +SSTY + SC S+ A+ + + G+C Y Y
Sbjct: 106 CLPC---KCYPQTIPFFHPSRSSTYRNASCVSAPHAMPQIFRDEKTGNCQYHLRYRD--- 159
Query: 121 ASFSSGNLATETLTFNSTSGLPVEMPNVIFGCGHKNLASPTSDSKQTGIIGLGPGNSSLI 180
S + G LA E LTF ++ + N++FGCG N + +K +G++GLGPG S++
Sbjct: 160 FSNTRGILAEEKLTFETSDDGLISKQNIVFGCGQDN----SGFTKYSGVLGLGPGTFSIV 215
Query: 181 SQMGTSIAGKFSYCLPDQGSSKINFGGIVAGAGVV----STPL-IIRDHYYLSLEAISVG 235
++ + KFSYC + ++ G G TPL I +D YYL L+AIS G
Sbjct: 216 TR---NFGSKFSYCFGSLTNPTYPHNILILGNGAKIEGDPTPLQIFQDRYYLDLQAISFG 272
Query: 236 NQRLEF------VSSSTGNIFVDTGVLRTLLPLEYHSNLKSVMSNMIKAQPVKGVGAEPG 289
+ L+ S G +DTG T+L E + L + + + + ++ V
Sbjct: 273 EKLLDIEPGTFQRYRSQGGTVIDTGCSPTILAREAYETLSEEI-DFLLGEVLRRVKDWDQ 331
Query: 290 FSDVLCYNISSQPK---FPEVTIHFR-GADVKLSPSNLF-RNISDEIMCSAFRGGNAN-- 342
++ CY + + FP VT HF GA++ L +LF + S + C A +
Sbjct: 332 YT-TPCYEGNLKLDLYGFPVVTFHFAGGAELALDVESLFVSSESGDSFCLAMTMNTFDDM 390
Query: 343 IVYGRIMQINFLIGYDIEQAMVSFKPSRC 371
V G + Q N+ +GY++ V F+ + C
Sbjct: 391 SVIGAMAQQNYNVGYNLRTMKVYFQRTDC 419
>gi|225463766|ref|XP_002267930.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Vitis vinifera]
Length = 479
Score = 141 bits (355), Expect = 5e-31, Method: Compositional matrix adjust.
Identities = 106/356 (29%), Positives = 158/356 (44%), Gaps = 35/356 (9%)
Query: 35 YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSS 94
Y + + +G+PP + +D+GSD W QC+PC + C+ Q P+FDP S+++ +SCSS
Sbjct: 140 YFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCTQ--CYHQSDPVFDPADSASFTGVSCSS 197
Query: 95 SQC-AVVTSNCSEGDCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMPNVIFGCG 153
S C + + C G C Y YG G+Y + G LA ETLTF T + +V GCG
Sbjct: 198 SVCDRLENAGCHAGRCRYEVSYGDGSY---TKGTLALETLTFGRTM-----VRSVAIGCG 249
Query: 154 HKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLPDQG---SSKINFGGIVA 210
H+N + G+ G + S + Q+G G FSYCL +G S + FG
Sbjct: 250 HRNRGMFVGAAGLLGLGGG---SMSFVGQLGGQTGGAFSYCLVSRGTDSSGSLVFGREAL 306
Query: 211 GAGVVSTPLIIRDH----YYLSLEAISVGNQRL-------EFVSSSTGNIFVDTGVLRTL 259
AG PL+ YY+ L + VG R+ G + +DTG T
Sbjct: 307 PAGAAWVPLVRNPRAPSFYYIGLAGLGVGGIRVPISEEVFRLTELGDGGVVMDTGTAVTR 366
Query: 260 LP-LEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDVLCYNISSQPKFPEVTIHFRGADVKL 318
LP L Y + + ++ GV D+L + + P V+ +F G +
Sbjct: 367 LPTLAYQAFRDAFLAQTANLPRATGVAIFDTCYDLLGF---VSVRVPTVSFYFSGGPILT 423
Query: 319 SPSNLFRNISDE--IMCSAFRGGNANI-VYGRIMQINFLIGYDIEQAMVSFKPSRC 371
P+ F D+ C AF + + + G I Q I +D V F P+ C
Sbjct: 424 LPARNFLIPMDDAGTFCFAFAPSTSGLSILGNIQQEGIQISFDGANGYVGFGPNIC 479
>gi|297825301|ref|XP_002880533.1| hypothetical protein ARALYDRAFT_481251 [Arabidopsis lyrata subsp.
lyrata]
gi|297326372|gb|EFH56792.1| hypothetical protein ARALYDRAFT_481251 [Arabidopsis lyrata subsp.
lyrata]
Length = 430
Score = 141 bits (355), Expect = 6e-31, Method: Compositional matrix adjust.
Identities = 113/373 (30%), Positives = 175/373 (46%), Gaps = 54/373 (14%)
Query: 34 IYLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCS 93
++ ++ S+G PPV F +DTGS W QC PC P+F+P SST+ SC
Sbjct: 67 LFFVNFSVGQPPVPQFTIMDTGSSLLWIQCHPCKHCSSNHMIHPVFNPALSSTFVECSCD 126
Query: 94 SSQCAVV-TSNCSEGDCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMPNVIFGC 152
C +CS C Y +Y G + S G LA E LTF + +G V + FGC
Sbjct: 127 DRFCRYAPNGHCSSNKCVYEQVYISG---TGSKGVLAKERLTFTTPNGNTVVTQPIAFGC 183
Query: 153 GHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLPDQGSSKINFGGIVAG- 211
GH+N +S+ TGI+GLG +SL Q+G+ KFSYC+ D + + +V G
Sbjct: 184 GHEN--GEQLESEFTGILGLGAKPTSLAVQLGS----KFSYCIGDLANKNYGYNQLVLGE 237
Query: 212 -AGVVSTPLIIRDH-----YYLSLEAISVGNQRLEF-------VSSSTGNIFVDTGVLRT 258
A ++ P I YY++LE ISVG+++L S TG + +DTG L T
Sbjct: 238 DADILGDPTPIEFETENGIYYMNLEGISVGDKQLNIEPVVFKRRGSRTG-VILDTGTLYT 296
Query: 259 LLP----LEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDVLCYNISSQPK---FPEVTIHF 311
L E ++ +KS++ ++ F D LCY+ + FP VT HF
Sbjct: 297 WLADIAYRELYNEIKSILDPKLE---------RFWFRDFLCYHGRVNEELIGFPVVTFHF 347
Query: 312 R-GADVKLSPSNLFRNISD-----EIMCSAFR-----GGNAN--IVYGRIMQINFLIGYD 358
GA++ + +++F +++ + C + R GG G + Q + I YD
Sbjct: 348 AGGAELAMEATSMFYPMTESDTYHNVFCMSVRPTTEHGGEYKDFTAIGLMAQQYYNIAYD 407
Query: 359 IEQAMVSFKPSRC 371
+++ + + C
Sbjct: 408 LKERNIYLQRIDC 420
>gi|449432044|ref|XP_004133810.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 1-like [Cucumis
sativus]
Length = 471
Score = 141 bits (355), Expect = 6e-31, Method: Compositional matrix adjust.
Identities = 115/360 (31%), Positives = 166/360 (46%), Gaps = 41/360 (11%)
Query: 35 YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSS 94
Y + +GTPP ++ +DTGSD W QC PC +C+ Q P+F+P KS ++ + C +
Sbjct: 129 YFTRIGVGTPPKYVYMVLDTGSDIVWLQCAPCK--NCYSQTDPVFNPVKSGSFAKVLCRT 186
Query: 95 SQCAVVTS-NCSEGD-CSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMPNVIFGC 152
C + S C++ C Y YG G+Y ++G TETLTF T ++ V GC
Sbjct: 187 PLCRRLESPGCNQRQTCLYQVSYGDGSY---TTGEFVTETLTFRRT-----KVEQVALGC 238
Query: 153 GHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLPDQGS----SKINFGGI 208
GH N + G+ S SQ G + KFSYCL D+ + S + FG
Sbjct: 239 GHDNEGLFVGAAGLLGLGRG---GLSFPSQAGRTFNQKFSYCLVDRSASSKPSSVVFGNS 295
Query: 209 VAGAGVVSTPLI----IRDHYYLSLEAISVGNQRLEFVSSS------TGN--IFVDTGVL 256
TPL+ + YY+ L ISVG + +++S TGN + +D G
Sbjct: 296 AVSRTARFTPLLTNPRLDTFYYVELLGISVGGTPVSGITASHFKLDRTGNGGVIIDCGTS 355
Query: 257 RTLLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDV-LCYNISSQP--KFPEVTIHFRG 313
T L + L+ A +K + P FS CY++S + K P V +HFRG
Sbjct: 356 VTRLNKPAYIALRDAF--RAGASSLK---SAPEFSLFDTCYDLSGKTTVKVPTVVLHFRG 410
Query: 314 ADVKLSPSNLFRNISDE-IMCSAFRGGNANI-VYGRIMQINFLIGYDIEQAMVSFKPSRC 371
ADV L SN + C AF G + + + G I Q F + YD+ + V F P C
Sbjct: 411 ADVSLPASNYLIPVDGSGRFCFAFAGTTSGLSIIGNIQQQGFRVVYDLASSRVGFSPRGC 470
>gi|357143680|ref|XP_003573011.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Brachypodium
distachyon]
Length = 510
Score = 141 bits (355), Expect = 6e-31, Method: Compositional matrix adjust.
Identities = 117/375 (31%), Positives = 171/375 (45%), Gaps = 51/375 (13%)
Query: 35 YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSS 94
YL+ + +GTPP +DTGSD W QC PC LDCF Q P+FDP S++Y +++C
Sbjct: 150 YLVEVYVGTPPRRFQMIMDTGSDLNWLQCAPC--LDCFDQRGPVFDPMASTSYRNVTCGD 207
Query: 95 SQCAVV-------TSNCSEGD-CSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMP 146
++C +V T S D C Y + YG S ++G+LA E T N T+ +
Sbjct: 208 TRCGLVSPPAAPRTCRSSRSDPCPYYYWYGD---QSNTTGDLALEAFTVNLTASSSRRVD 264
Query: 147 NVIFGCGHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLPDQGS---SKI 203
V+ GCGH+N + G+ S SQ+ FSYCL D GS SKI
Sbjct: 265 GVVLGCGHRNRGLFHGAAGLLGLGRG---PLSFASQLRAVYGHAFSYCLVDHGSAVGSKI 321
Query: 204 NFGGIVAGAGVVSTPLI----------IRDHYYLSLEAISVGNQRLEFVSSS-------- 245
FG ++S P + YY+ L+ I VG + L+ S++
Sbjct: 322 VFGDDNV---LLSHPQLNYTAFAPSAAENTFYYVQLKGILVGGEMLDIPSNTWGVSKEDG 378
Query: 246 TGNIFVDTGVLRTLLPL-EYHSNLKSVMSNMIKAQPVKGVGAEPGFSDVLCYNIS--SQP 302
+G +D+G + P Y + ++ + M KA P+ + P S CYN+S +
Sbjct: 379 SGGTIIDSGTTLSYFPEPAYKAIRQAFVDRMDKAYPL--IADFPVLSP--CYNVSGVERV 434
Query: 303 KFPEVTIHFR-GADVKLSPSNLFRNISDE-IMCSAFRG--GNANIVYGRIMQINFLIGYD 358
+ PE ++ F GA N F + E IMC A G +A + G Q NF + YD
Sbjct: 435 EVPEFSLLFADGAVWDFPAENYFIRLDTEGIMCLAVLGTPRSAMSIIGNYQQQNFHVLYD 494
Query: 359 IEQAMVSFKPSRCTN 373
+ + F P RC
Sbjct: 495 LHHNRLGFAPRRCAE 509
>gi|116787367|gb|ABK24480.1| unknown [Picea sitchensis]
Length = 496
Score = 141 bits (355), Expect = 6e-31, Method: Compositional matrix adjust.
Identities = 116/360 (32%), Positives = 167/360 (46%), Gaps = 41/360 (11%)
Query: 35 YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSS 94
Y + IGTP + + +DTGSD W QCEPC E C+ Q P+F+P S +++++ C S
Sbjct: 154 YFTRIGIGTPTREQYMVLDTGSDVVWIQCEPCRE--CYSQADPIFNPSSSVSFSTVGCDS 211
Query: 95 SQCAVVTSN-CSEGDCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMPNVIFGCG 153
+ C+ + +N C G C Y YG G+Y + G+ ATETLTF +TS + NV GCG
Sbjct: 212 AVCSQLDANDCHGGGCLYEVSYGDGSY---TVGSYATETLTFGTTS-----IQNVAIGCG 263
Query: 154 HKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLPD---QGSSKINFGGIVA 210
H N+ + G+ + S +Q+GT FSYCL D + S + FG
Sbjct: 264 HDNVGLFVGAAGLLGLGAG---SLSFPAQLGTQTGRAFSYCLVDRDSESSGTLEFGPESV 320
Query: 211 GAGVVSTPLI----IRDHYYLSLEAISVGNQRLEFVSSST---------GNIFVDTGVLR 257
G + TPL+ + YYLS+ AISVG L+ V S G I +D+G
Sbjct: 321 PIGSIFTPLVANPFLPTFYYLSMVAISVGGVILDSVPSEAFRIDETTGRGGIIIDSGTAV 380
Query: 258 TLLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDV-LCYNISS--QPKFPEVTIHF-RG 313
T L + L+ + P G S CY++S+ P V HF G
Sbjct: 381 TRLQTSAYDALRDAFIAGTQHLP-----RADGISIFDTCYDLSALQSVSIPAVGFHFSNG 435
Query: 314 ADVKLSPSN-LFRNISDEIMCSAFRGGNANI-VYGRIMQINFLIGYDIEQAMVSFKPSRC 371
A L N L S C AF ++N+ + G I Q + +D ++V F +C
Sbjct: 436 AGFILPAKNCLIPMDSMGTFCFAFAPADSNLSIMGNIQQQGIRVSFDSANSLVGFAIDQC 495
>gi|356527089|ref|XP_003532146.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
Length = 488
Score = 141 bits (355), Expect = 7e-31, Method: Compositional matrix adjust.
Identities = 114/347 (32%), Positives = 170/347 (48%), Gaps = 38/347 (10%)
Query: 35 YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSS 94
Y + + +GTP D+ DTGSD TWTQCEPC C+KQ+ +FDP KS++Y++I+C+S
Sbjct: 145 YFVVVGLGTPKRDLSLIFDTGSDLTWTQCEPCAR-SCYKQQDAIFDPSKSTSYSNITCTS 203
Query: 95 SQCAVVTS--------NCSEGDCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMP 146
+ C +++ + S C Y YG +SFS G + E L+ +T +
Sbjct: 204 TLCTQLSTATGNEPGCSASTKACIYGIQYGD---SSFSVGYFSRERLSVTATD----IVD 256
Query: 147 NVIFGCGHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLPDQGSS--KIN 204
N +FGCG N + G+IGLG S + Q FSYCLP SS +++
Sbjct: 257 NFLFGCGQNNQGLFGGSA---GLIGLGRHPISFVQQTAAVYRKIFSYCLPATSSSTGRLS 313
Query: 205 FGGIVAGAGVVSTPL--IIR--DHYYLSLEAISVGNQRLEFVSS--STGNIFVDTGVLRT 258
F G + V TP I R Y L + ISVG +L SS STG +D+G + T
Sbjct: 314 F-GTTTTSYVKYTPFSTISRGSSFYGLDITGISVGGAKLPVSSSTFSTGGAIIDSGTVIT 372
Query: 259 LLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDVLCYNISSQPKF--PEVTIHFRGA-D 315
LP ++ L+S + P G E D CY++S F P++ F G
Sbjct: 373 RLPPTAYTALRSAFRQGMSKYPSAG---ELSILDT-CYDLSGYEVFSIPKIDFSFAGGVT 428
Query: 316 VKLSPSNLFRNISDEIMCSAF--RGGNANI-VYGRIMQINFLIGYDI 359
V+L P + S + +C AF G ++++ +YG + Q + YD+
Sbjct: 429 VQLPPQGILYVASAKQVCLAFAANGDDSDVTIYGNVQQKTIEVVYDV 475
>gi|449529638|ref|XP_004171805.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 1-like, partial
[Cucumis sativus]
Length = 384
Score = 140 bits (354), Expect = 7e-31, Method: Compositional matrix adjust.
Identities = 115/360 (31%), Positives = 166/360 (46%), Gaps = 41/360 (11%)
Query: 35 YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSS 94
Y + +GTPP ++ +DTGSD W QC PC +C+ Q P+F+P KS ++ + C +
Sbjct: 42 YFTRIGVGTPPKYVYMVLDTGSDIVWLQCAPCK--NCYSQTDPVFNPVKSGSFAKVLCRT 99
Query: 95 SQCAVVTS-NCSEGD-CSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMPNVIFGC 152
C + S C++ C Y YG G+Y ++G TETLTF T ++ V GC
Sbjct: 100 PLCRRLESPGCNQRQTCLYQVSYGDGSY---TTGEFVTETLTFRRT-----KVEQVALGC 151
Query: 153 GHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLPDQGS----SKINFGGI 208
GH N + G+ S SQ G + KFSYCL D+ + S + FG
Sbjct: 152 GHDNEGLFVGAAGLLGLGRG---GLSFPSQAGRTFNQKFSYCLVDRSASSKPSSVVFGNS 208
Query: 209 VAGAGVVSTPLI----IRDHYYLSLEAISVGNQRLEFVSSS------TGN--IFVDTGVL 256
TPL+ + YY+ L ISVG + +++S TGN + +D G
Sbjct: 209 AVSRTARFTPLLTNPRLDTFYYVELLGISVGGTPVSGITASHFKLDRTGNGGVIIDCGTS 268
Query: 257 RTLLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDV-LCYNISSQP--KFPEVTIHFRG 313
T L + L+ A +K + P FS CY++S + K P V +HFRG
Sbjct: 269 VTRLNKPAYIALRDAF--RAGASSLK---SAPEFSLFDTCYDLSGKTTVKVPTVVLHFRG 323
Query: 314 ADVKLSPSNLFRNISDE-IMCSAFRGGNANI-VYGRIMQINFLIGYDIEQAMVSFKPSRC 371
ADV L SN + C AF G + + + G I Q F + YD+ + V F P C
Sbjct: 324 ADVSLPASNYLIPVDGSGRFCFAFAGTTSGLSIIGNIQQQGFRVVYDLASSRVGFSPRGC 383
>gi|148908573|gb|ABR17396.1| unknown [Picea sitchensis]
Length = 350
Score = 140 bits (354), Expect = 7e-31, Method: Compositional matrix adjust.
Identities = 115/360 (31%), Positives = 167/360 (46%), Gaps = 41/360 (11%)
Query: 35 YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSS 94
Y + IGTP + + +DTGSD W QCEPC E C+ Q P+F+P S +++++ C S
Sbjct: 8 YFTRIGIGTPTREQYMVLDTGSDVVWIQCEPCRE--CYSQADPIFNPSSSVSFSTVGCDS 65
Query: 95 SQCAVVTSN-CSEGDCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMPNVIFGCG 153
+ C+ + +N C G C Y YG G+Y + G+ ATETLTF +TS + NV GCG
Sbjct: 66 AVCSQLDANDCHGGGCLYEVSYGDGSY---TVGSYATETLTFGTTS-----IQNVAIGCG 117
Query: 154 HKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLPD---QGSSKINFGGIVA 210
H N+ + G+ + S +Q+GT FSYCL D + S + FG
Sbjct: 118 HDNVGLFVGAAGLLGLGAG---SLSFPAQLGTQTGRAFSYCLVDRDSESSGTLEFGPESV 174
Query: 211 GAGVVSTPLI----IRDHYYLSLEAISVGNQRLEFVSSST---------GNIFVDTGVLR 257
G + TPL+ + YYLS+ AISVG L+ V S G I +D+G
Sbjct: 175 PIGSIFTPLVANPFLPTFYYLSMVAISVGGVILDSVPSEAFRIDETTGRGGIIIDSGTAV 234
Query: 258 TLLPLEYHSNLKSVMSNMIKAQP-VKGVGAEPGFSDVLCYNISS--QPKFPEVTIHF-RG 313
T L + L+ + P G+ CY++S+ P V HF G
Sbjct: 235 TRLQTSAYDALRDAFIAGTQHLPRADGISIFD-----TCYDLSALQSVSIPAVGFHFSNG 289
Query: 314 ADVKLSPSN-LFRNISDEIMCSAFRGGNANI-VYGRIMQINFLIGYDIEQAMVSFKPSRC 371
A L N L S C AF ++N+ + G I Q + +D ++V F +C
Sbjct: 290 AGFILPAKNCLIPMDSMGTFCFAFAPADSNLSIMGNIQQQGIRVSFDSANSLVGFAIDQC 349
>gi|118484458|gb|ABK94105.1| unknown [Populus trichocarpa]
Length = 499
Score = 140 bits (354), Expect = 7e-31, Method: Compositional matrix adjust.
Identities = 112/359 (31%), Positives = 168/359 (46%), Gaps = 43/359 (11%)
Query: 35 YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSS 94
Y + +G P + +DTGSD W QC+PC DC++Q P+FDP SSTY ++C S
Sbjct: 161 YFTRVGVGNPARQFYMVLDTGSDINWLQCQPCT--DCYQQTDPIFDPTASSTYAPVTCQS 218
Query: 95 SQCAVV-TSNCSEGDCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMPNVIFGCG 153
QC+ + S+C G C Y YG G+Y + G+ ATE+++F ++ + NV GCG
Sbjct: 219 QQCSSLEMSSCRSGQCLYQVNYGDGSY---TFGDFATESVSFGNSG----SVKNVALGCG 271
Query: 154 HKN--LASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLPDQ---GSSKINFGGI 208
H N L + G L SL +Q+ A FSYCL ++ GSS ++F
Sbjct: 272 HDNEGLFVGAAGLLGLGGGPL-----SLTNQLK---ATSFSYCLVNRDSAGSSTLDFNSA 323
Query: 209 VAGAGVVSTPLI----IRDHYYLSLEAISVGNQRL-------EFVSSSTGNIFVDTGVLR 257
G V+ PL+ I YY+ L +SVG Q + S G I VD G
Sbjct: 324 QLGVDSVTAPLMKNRKIDTFYYVGLSGMSVGGQMVSIPESTFRLDESGNGGIIVDCGTAI 383
Query: 258 TLLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDVLCYNISSQP--KFPEVTIHFR-GA 314
T L + ++ L+ M Q +K A F CY++S Q + P V+ HF G
Sbjct: 384 TRLQTQAYNPLRDAFVRM--TQNLKLTSAVALFD--TCYDLSGQASVRVPTVSFHFADGK 439
Query: 315 DVKLSPSNLFRNI-SDEIMCSAFRGGNANI-VYGRIMQINFLIGYDIEQAMVSFKPSRC 371
L +N + S C AF +++ + G + Q + +D+ + F P++C
Sbjct: 440 SWNLPAANYLIPVDSAGTYCFAFAPTTSSLSIIGNVQQQGTRVTFDLANNRMGFSPNKC 498
>gi|223949775|gb|ACN28971.1| unknown [Zea mays]
gi|414590177|tpg|DAA40748.1| TPA: hypothetical protein ZEAMMB73_257146 [Zea mays]
Length = 510
Score = 140 bits (354), Expect = 7e-31, Method: Compositional matrix adjust.
Identities = 116/375 (30%), Positives = 171/375 (45%), Gaps = 50/375 (13%)
Query: 35 YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSS 94
YL+ + +GTPP +DTGSD W QC PC LDCF+Q P+FDP SS+Y +++C
Sbjct: 149 YLIDVYVGTPPRRFRMIMDTGSDLNWLQCAPC--LDCFEQRGPVFDPAASSSYRNVTCGD 206
Query: 95 SQCAVVT--------SNCSEGDCSYSFLYGRGAYASFSSGNLATETLTFNSTS-GLPVEM 145
+C +V +E C Y + YG S ++G+LA E+ T N T+ G +
Sbjct: 207 QRCGLVAPPEAPRACRRPAEDSCPYYYWYGD---QSNTTGDLALESFTVNLTAPGASRRV 263
Query: 146 PNVIFGCGHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLPDQGS---SK 202
V+FGCGH+N + G+ S SQ+ FSYCL + GS SK
Sbjct: 264 DGVVFGCGHRNRGLFHGAAGLLGLGRG---PLSFASQLRAVYGHTFSYCLVEHGSDAGSK 320
Query: 203 INFGG---IVAGAGVVSTPLI-----IRDHYYLSLEAISVGNQRLEFVSSS-------TG 247
+ FG ++A + T YY+ L+ + VG L S + +G
Sbjct: 321 VVFGEDYLVLAHPQLKYTAFAPTSSPADTFYYVKLKGVLVGGDLLNISSDTWDVGKDGSG 380
Query: 248 NIFVDTG-VLRTLLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDVL--CYNIS--SQP 302
+D+G L + Y ++ + M + P+ P F VL CYN+S +P
Sbjct: 381 GTIIDSGTTLSYFVEPAYQVIRQAFVDLMSRLYPLI-----PDFP-VLNPCYNVSGVERP 434
Query: 303 KFPEVTIHFR-GADVKLSPSNLFRNIS-DEIMCSAFRG--GNANIVYGRIMQINFLIGYD 358
+ PE+++ F GA N F + D IMC A RG + G Q NF + YD
Sbjct: 435 EVPELSLLFADGAVWDFPAENYFVRLDPDGIMCLAVRGTPRTGMSIIGNFQQQNFHVVYD 494
Query: 359 IEQAMVSFKPSRCTN 373
++ + F P RC
Sbjct: 495 LQNNRLGFAPRRCAE 509
>gi|148905906|gb|ABR16115.1| unknown [Picea sitchensis]
Length = 482
Score = 140 bits (354), Expect = 7e-31, Method: Compositional matrix adjust.
Identities = 122/365 (33%), Positives = 174/365 (47%), Gaps = 49/365 (13%)
Query: 35 YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSS 94
Y++ GTP + +DTGSD TW QC+PC DC+ Q +F+PK+SS+Y ++ C S
Sbjct: 137 YIVTAGFGTPAKNSLLIIDTGSDLTWIQCKPCA--DCYSQVDAIFEPKQSSSYKTLPCLS 194
Query: 95 SQCA-VVTSN-----CSEGDCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMPNV 148
+ C ++TS C G C Y YG G S S G+ + ETLT S S N
Sbjct: 195 ATCTELITSESNPTPCLLGGCVYEINYGDG---SSSQGDFSQETLTLGSDS-----FQNF 246
Query: 149 IFGCGHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLPD------QGSSK 202
FGCGH N S G++GLG + S SQ + G+F+YCLPD GS
Sbjct: 247 AFGCGHTNTGLFKGSS---GLLGLGQNSLSFPSQSKSKYGGQFAYCLPDFGSSTSTGSFS 303
Query: 203 INFGGIVAGAGVVSTPLIIR----DHYYLSLEAISVGNQRLEFVSS--STGNIFVDTG-V 255
+ G I A A V TPL+ Y++ L ISVG RL + G+ VD+G V
Sbjct: 304 VGKGSIPASA--VFTPLVSNFMYPTFYFVGLNGISVGGDRLSIPPAVLGRGSTIVDSGTV 361
Query: 256 LRTLLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDVLCYNIS--SQPKFPEVTIHFR- 312
+ LLP Y++ LK+ + + P A+P CY++S SQ + P +T HF+
Sbjct: 362 ITRLLPQAYNA-LKTSFRSKTRDLP----SAKPFSILDTCYDLSRHSQVRIPTITFHFQN 416
Query: 313 GADVKLSPSNLFRNISD--EIMCSAFRGGNA----NIVYGRIMQINFLIGYDIEQAMVSF 366
ADV +S + + + +C AF + NI+ G Q + +D + F
Sbjct: 417 NADVAVSDVGILVPVQNGGSQVCLAFASASQMDGFNII-GNFQQQRMRVAFDTGAGRIGF 475
Query: 367 KPSRC 371
C
Sbjct: 476 ASGSC 480
>gi|242066142|ref|XP_002454360.1| hypothetical protein SORBIDRAFT_04g029400 [Sorghum bicolor]
gi|241934191|gb|EES07336.1| hypothetical protein SORBIDRAFT_04g029400 [Sorghum bicolor]
Length = 466
Score = 140 bits (354), Expect = 8e-31, Method: Compositional matrix adjust.
Identities = 122/360 (33%), Positives = 181/360 (50%), Gaps = 38/360 (10%)
Query: 30 SVDDI-YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYN 88
S+D + YL+ + +G+P +DTGSD +W QC+PC + C Q PLFDP SSTY+
Sbjct: 127 SLDTLEYLITVRLGSPGKSQTMLIDTGSDVSWVQCKPCSQ--CHSQADPLFDPSSSSTYS 184
Query: 89 SISCSSSQCAVV---TSNCSEGDCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEM 145
SCSS+ CA + + CS C Y+ YG G S ++G +++TL S + +
Sbjct: 185 PFSCSSAACAQLGQEGNGCSSSQCQYTVTYGDG---SSTTGTYSSDTLALGSNA-----V 236
Query: 146 PNVIFGCGHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLPDQGSSKINF 205
FGC N+ S +D + G++GLG G SL+SQ + FSYCLP SS F
Sbjct: 237 RKFQFGC--SNVESGFND-QTDGLMGLGGGAQSLVSQTAGTFGAAFSYCLPATSSSS-GF 292
Query: 206 GGIVAG-AGVVSTPLI----IRDHYYLSLEAISVGNQRLEFVSS--STGNIFVDTGVLRT 258
+ AG +G V TP++ + Y + ++AI VG ++L +S S G I +D+G + T
Sbjct: 293 LTLGAGTSGFVKTPMLRSSQVPTFYGVRIQAIRVGGRQLSIPTSVFSAGTI-MDSGTVLT 351
Query: 259 LLPLEYHSNLKSVMSNMIKAQPVKGVGAEP-GFSDVLCYNISSQP--KFPEVTIHFR-GA 314
LP +S L S +K P A P G D C++ S Q P V + F GA
Sbjct: 352 RLPPTAYSALSSAFKAGMKQYP----SAPPSGILDT-CFDFSGQSSVSIPTVALVFSGGA 406
Query: 315 DVKLSPSNLFRNISDEIMCSAFRGGNANI---VYGRIMQINFLIGYDIEQAMVSFKPSRC 371
V ++ + S+ I+C AF + + + G + Q F + YD+ V FK C
Sbjct: 407 VVDIASDGIMLQTSNSILCLAFAANSDDSSLGIIGNVQQRTFEVLYDVGGGAVGFKAGAC 466
>gi|18461217|dbj|BAB84414.1| chloroplast nucleoid DNA-binding protein cnd41-like [Oryza sativa
Japonica Group]
Length = 446
Score = 140 bits (354), Expect = 8e-31, Method: Compositional matrix adjust.
Identities = 107/374 (28%), Positives = 168/374 (44%), Gaps = 48/374 (12%)
Query: 35 YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSS 94
Y + +GTP +DTGSD W QC PC C+ Q +FDP++SSTY + CSS
Sbjct: 86 YFALVGVGTPSTKAMLVIDTGSDLVWLQCSPCRR--CYAQRGQVFDPRRSSTYRRVPCSS 143
Query: 95 SQCAVV------TSNCSEGDCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMPNV 148
QC + + + G C Y YG G S S+G+LAT+ L F + + + NV
Sbjct: 144 PQCRALRFPGCDSGGAAGGGCRYMVAYGDG---SSSTGDLATDKLAFANDT----YVNNV 196
Query: 149 IFGCGHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLPDQ-----GSSKI 203
GCG N S G++G+G G S+ +Q+ + F YCL D+ SS +
Sbjct: 197 TLGCGRDNEGLFDS---AAGLLGVGRGKISISTQVAPAYGSVFEYCLGDRTSRSTRSSYL 253
Query: 204 NFGGIVAGAGVVSTPLIIRDH----YYLSLEAISVGNQRLEFVSSST---------GNIF 250
FG T L+ YY+ + SVG +R+ S+++ G +
Sbjct: 254 VFGRTPEPPSTAFTALLSNPRRPSLYYVDMAGFSVGGERVTGFSNASLALDTATGRGGVV 313
Query: 251 VDTGVLRTLLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDVLCYNISSQPK--FPEVT 308
VD+G + + ++ L+ +A ++ + E D CY++ +P P +
Sbjct: 314 VDSGTAISRFARDAYAALRDAFDARARAAGMRRLAGEHSVFDA-CYDLRGRPAASAPLIV 372
Query: 309 IHFR-GADVKLSPSNLF-------RNISDEIMCSAFRGGNANI-VYGRIMQINFLIGYDI 359
+HF GAD+ L P N F R + C F + + V G + Q F + +D+
Sbjct: 373 LHFAGGADMALPPENYFLPVDGGRRRAASYRRCLGFEAADDGLSVIGNVQQQGFRVVFDV 432
Query: 360 EQAMVSFKPSRCTN 373
E+ + F P CT+
Sbjct: 433 EKERIGFAPKGCTS 446
>gi|15228618|ref|NP_191741.1| aspartyl protease-like protein [Arabidopsis thaliana]
gi|6850873|emb|CAB71112.1| putative protein [Arabidopsis thaliana]
gi|332646739|gb|AEE80260.1| aspartyl protease-like protein [Arabidopsis thaliana]
Length = 483
Score = 140 bits (354), Expect = 9e-31, Method: Compositional matrix adjust.
Identities = 119/366 (32%), Positives = 164/366 (44%), Gaps = 46/366 (12%)
Query: 35 YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSS 94
Y M L +GTP +++ +DTGSD W QC PC C+ Q +FDPKKS T+ ++ C S
Sbjct: 135 YFMRLGVGTPATNVYMVLDTGSDVVWLQCSPCKA--CYNQTDAIFDPKKSKTFATVPCGS 192
Query: 95 SQCAVV--TSNC---SEGDCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMPNVI 149
C + +S C C Y YG G SF+ G+ +TETLTF+ + +V
Sbjct: 193 RLCRRLDDSSECVTRRSKTCLYQVSYGDG---SFTEGDFSTETLTFHG-----ARVDHVP 244
Query: 150 FGCGHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLPDQ--------GSS 201
GCGH N + G+ S SQ GKFSYCL D+ S
Sbjct: 245 LGCGHDNEGLFVGAAGLLGLGRG---GLSFPSQTKNRYNGKFSYCLVDRTSSGSSSKPPS 301
Query: 202 KINFGGIVAGAGVVSTPLI----IRDHYYLSLEAISVGNQRLEFVSSS------TGN--I 249
I FG V TPL+ + YYL L ISVG R+ VS S TGN +
Sbjct: 302 TIVFGNAAVPKTSVFTPLLTNPKLDTFYYLQLLGISVGGSRVPGVSESQFKLDATGNGGV 361
Query: 250 FVDTGVLRTLLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDVLCYNIS--SQPKFPEV 307
+D+G T L + L+ + A +K + F C+++S + K P V
Sbjct: 362 IIDSGTSVTRLTQPAYVALRDAF--RLGATKLKRAPSYSLFDT--CFDLSGMTTVKVPTV 417
Query: 308 TIHFRGADVKLSPSNLFRNISDE-IMCSAFRGGNANI-VYGRIMQINFLIGYDIEQAMVS 365
HF G +V L SN ++ E C AF G ++ + G I Q F + YD+ + V
Sbjct: 418 VFHFGGGEVSLPASNYLIPVNTEGRFCFAFAGTMGSLSIIGNIQQQGFRVAYDLVGSRVG 477
Query: 366 FKPSRC 371
F C
Sbjct: 478 FLSRAC 483
>gi|224126221|ref|XP_002329620.1| predicted protein [Populus trichocarpa]
gi|222870359|gb|EEF07490.1| predicted protein [Populus trichocarpa]
Length = 357
Score = 140 bits (353), Expect = 9e-31, Method: Compositional matrix adjust.
Identities = 112/359 (31%), Positives = 168/359 (46%), Gaps = 43/359 (11%)
Query: 35 YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSS 94
Y + +G P + +DTGSD W QC+PC DC++Q P+FDP SSTY ++C S
Sbjct: 20 YFTRVGVGNPARQFYMVLDTGSDINWLQCQPC--TDCYQQTDPIFDPTASSTYAPVTCQS 77
Query: 95 SQCAVV-TSNCSEGDCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMPNVIFGCG 153
QC+ + S+C G C Y YG G+Y + G+ ATE+++F ++ + NV GCG
Sbjct: 78 QQCSSLEMSSCRSGQCLYQVNYGDGSY---TFGDFATESVSFGNSG----SVKNVALGCG 130
Query: 154 HKN--LASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLPDQ---GSSKINFGGI 208
H N L + G L SL +Q+ A FSYCL ++ GSS ++F
Sbjct: 131 HDNEGLFVGAAGLLGLGGGPL-----SLTNQLK---ATSFSYCLVNRDSAGSSTLDFNSA 182
Query: 209 VAGAGVVSTPLI----IRDHYYLSLEAISVGNQRL-------EFVSSSTGNIFVDTGVLR 257
G V+ PL+ I YY+ L +SVG Q + S G I VD G
Sbjct: 183 QLGVDSVTAPLMKNRKIDTFYYVGLSGMSVGGQMVSIPESTFRLDESGNGGIIVDCGTAI 242
Query: 258 TLLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDVLCYNISSQP--KFPEVTIHFR-GA 314
T L + ++ L+ M Q +K A F CY++S Q + P V+ HF G
Sbjct: 243 TRLQTQAYNPLRDAFVRM--TQNLKLTSAVALFD--TCYDLSGQASVRVPTVSFHFADGK 298
Query: 315 DVKLSPSNLFRNI-SDEIMCSAFRGGNANI-VYGRIMQINFLIGYDIEQAMVSFKPSRC 371
L +N + S C AF +++ + G + Q + +D+ + F P++C
Sbjct: 299 SWNLPAANYLIPVDSAGTYCFAFAPTTSSLSIIGNVQQQGTRVTFDLANNRMGFSPNKC 357
>gi|297821064|ref|XP_002878415.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
gi|297324253|gb|EFH54674.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
Length = 486
Score = 140 bits (353), Expect = 1e-30, Method: Compositional matrix adjust.
Identities = 119/366 (32%), Positives = 165/366 (45%), Gaps = 46/366 (12%)
Query: 35 YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSS 94
Y M L +GTP +++ +DTGSD W QC PC C+ Q +FDPKKS T+ ++ C S
Sbjct: 138 YFMRLGVGTPATNVYMVLDTGSDVVWLQCSPCKA--CYNQSDVIFDPKKSKTFATVPCGS 195
Query: 95 SQCAVV--TSNC---SEGDCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMPNVI 149
C + +S C C Y YG G SF+ G+ +TETLTF+ + +V
Sbjct: 196 RLCRRLDDSSECVTRRSKTCLYQVSYGDG---SFTEGDFSTETLTFHG-----ARVDHVP 247
Query: 150 FGCGHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLPDQ--------GSS 201
GCGH N + G+ S SQ + GKFSYCL D+ S
Sbjct: 248 LGCGHDNEGLFVGAAGLLGLGRG---GLSFPSQTKSRYNGKFSYCLVDRTSSGSSSKPPS 304
Query: 202 KINFGGIVAGAGVVSTPLI----IRDHYYLSLEAISVGNQRLEFVSSS------TGN--I 249
I FG V TPL+ + YYL L ISVG R+ VS S TGN +
Sbjct: 305 TIVFGNDAVPKTSVFTPLLTNPKLDTFYYLQLLGISVGGSRVPGVSESQFKLDATGNGGV 364
Query: 250 FVDTGVLRTLLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDVLCYNIS--SQPKFPEV 307
+D+G T L + L+ + A +K + F C+++S + K P V
Sbjct: 365 IIDSGTSVTRLTQSAYVALRDAF--RLGATKLKRAPSYSLFDT--CFDLSGMTTVKVPTV 420
Query: 308 TIHFRGADVKLSPSNLFRNISDE-IMCSAFRGGNANI-VYGRIMQINFLIGYDIEQAMVS 365
HF G +V L SN ++ E C AF G ++ + G I Q F + YD+ + V
Sbjct: 421 VFHFGGGEVSLPASNYLIPVNTEGRFCFAFAGTMGSLSIIGNIQQQGFRVAYDLVGSRVG 480
Query: 366 FKPSRC 371
F C
Sbjct: 481 FLSRAC 486
>gi|302786560|ref|XP_002975051.1| hypothetical protein SELMODRAFT_54028 [Selaginella moellendorffii]
gi|300157210|gb|EFJ23836.1| hypothetical protein SELMODRAFT_54028 [Selaginella moellendorffii]
Length = 359
Score = 140 bits (353), Expect = 1e-30, Method: Compositional matrix adjust.
Identities = 118/369 (31%), Positives = 179/369 (48%), Gaps = 51/369 (13%)
Query: 35 YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSS 94
Y+M LSIGTPP I +DTGSD W +C+ C D +F SS+Y + C+S
Sbjct: 5 YMMELSIGTPPQLIPAMIDTGSDLVWLKCDNCDHCDLDHHGETIFFSDASSSYKKLPCNS 64
Query: 95 SQCAVVTS-----NCSEGDCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVE---MP 146
+ C+ ++S C E C Y + YG G S +SG++ ++ ++F S
Sbjct: 65 THCSGMSSAGIGPRCEE-TCKYKYEYGDG---SRTSGDVGSDRISFRSHGAGEDHRSFFD 120
Query: 147 NVIFGCGHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCL-----PDQGSS 201
+FGCG K L + ++ G+IGLG + SLI Q+G + KFSYCL P S
Sbjct: 121 GFLFGCGRK-LKGDWNFTQ--GLIGLGQKSHSLIQQLGDKLGYKFSYCLVSYDSPPSAKS 177
Query: 202 KINFGGIVA--GAGVVSTPLIIRDH-----YYLSLEAISVGNQRLEFVSSSTGN------ 248
+ G A G VVSTP++ DH YY+ L++I+VG + +G+
Sbjct: 178 FLFLGSSAALRGHDVVSTPILHGDHLDQTLYYVDLQSITVGGVPVVVYDKESGHNTSVGP 237
Query: 249 -----IFVDTGVLRTLL-PLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDVLCYNISSQP 302
+D+G TLL P Y + KS+ +I + +G G LC+N S
Sbjct: 238 FLANKTVIDSGTTYTLLTPPVYEAMRKSIEEQVI----LPTLGNSAGLD--LCFNSSGDT 291
Query: 303 K--FPEVTIHFRG-ADVKLSPSNLFRNISDEIMCSAF--RGGNANIVYGRIMQINFLIGY 357
FP VT +F + L N+F+ S +++C + GG+ +I+ G + Q NF I Y
Sbjct: 292 SYGFPSVTFYFANQVQLVLPFENIFQVTSRDVVCLSMDSSGGDLSII-GNMQQQNFHILY 350
Query: 358 DIEQAMVSF 366
D+ + +SF
Sbjct: 351 DLVASQISF 359
>gi|224142007|ref|XP_002324352.1| predicted protein [Populus trichocarpa]
gi|222865786|gb|EEF02917.1| predicted protein [Populus trichocarpa]
Length = 460
Score = 140 bits (353), Expect = 1e-30, Method: Compositional matrix adjust.
Identities = 116/364 (31%), Positives = 177/364 (48%), Gaps = 48/364 (13%)
Query: 35 YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSS 94
Y++ + +GTP + DTGSD TWTQCEPC + C+KQ+ P +P S++Y +ISCSS
Sbjct: 119 YVVTVGLGTPKKEFTLIFDTGSDITWTQCEPCVKT-CYKQKEPRLNPSTSTSYKNISCSS 177
Query: 95 SQCAVVTS------NCSEGDCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMPNV 148
+ C +V S +CS C Y YG G+Y S G ATETLT +S++ N
Sbjct: 178 ALCKLVASGKKFSQSCSSSTCLYQVQYGDGSY---SIGFFATETLTLSSSN----VFKNF 230
Query: 149 IFGCGHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLPDQGSSK--INFG 206
+FGCG +N + G+ +L SQ + FSYCLP SSK ++ G
Sbjct: 231 LFGCGQQNNGLFGGAAGLLGLGRT---KLALPSQTAKTYKKLFSYCLPASSSSKGYLSLG 287
Query: 207 GIVAGAGVVSTPLII----RDHYYLSLEAISVGNQRLEFVSSS-TGNIFVDTGVLRTLLP 261
G V+ + V TPL Y L + +SVG ++L S+ + +D+G + T L
Sbjct: 288 GQVSKS-VKFTPLSADFDSTPFYGLDITGLSVGGRKLSIDESAFSAGTVIDSGTVITRLS 346
Query: 262 LEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDV-LCYNISSQP--KFPEVTIHFRGA---D 315
+S L S N++ P + G+S CY+ S + P+V + F+G D
Sbjct: 347 PTAYSELSSAFQNLMTDYP-----STSGYSIFDTCYDFSKYDTVRIPKVGVTFKGGVEMD 401
Query: 316 VKLS----PSNLFRNISDEIMCSAFRGGNAN---IVYGRIMQINFLIGYDIEQAMVSFKP 368
+ +S P N + + C AF G + + ++G + Q + + YD + V F P
Sbjct: 402 IDVSGILYPVNGLKKV-----CLAFAGNDDDSDTSIFGNVQQRTYQVVYDGAKGRVGFAP 456
Query: 369 SRCT 372
C+
Sbjct: 457 GGCS 460
>gi|168064509|ref|XP_001784204.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162664276|gb|EDQ51002.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 367
Score = 140 bits (352), Expect = 1e-30, Method: Compositional matrix adjust.
Identities = 110/364 (30%), Positives = 162/364 (44%), Gaps = 39/364 (10%)
Query: 35 YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSS 94
Y + +GTP D++ VDTGSD TW QC PC +C+KQ+ LF+P SS++ + CSS
Sbjct: 16 YFAVVGVGTPRRDMYLVVDTGSDITWLQCAPC--TNCYKQKDALFNPSSSSSFKVLDCSS 73
Query: 95 SQCA-VVTSNCSEGDCSYSFLYGRGAYASFSSGNLATETLTFNSTSGL-PVEMPNVIFGC 152
S C + C C Y YG G SF+ G L T+ + + G V + N+ GC
Sbjct: 74 SLCLNLDVMGCLSNKCLYQADYGDG---SFTMGELVTDNVVLDDAFGPGQVVLTNIPLGC 130
Query: 153 GHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLPDQGSSKINFGGIVAGA 212
GH N + + GI+GLG G S + + S FSYCLPD+ S + +V G
Sbjct: 131 GHDNEGTFGT---AAGILGLGRGPLSFPNNLDASTRNIFSYCLPDRESDPNHKSTLVFGD 187
Query: 213 GVV------STPLI-------IRDHYYLSLEAISVGNQRL--------EFVSSSTGNIFV 251
+ S I + +YY+ + ISVG L + S G
Sbjct: 188 AAIPHTATGSVKFIPQLRNPRVATYYYVQITGISVGGNLLTNIPASVFQLDSHGNGGTIF 247
Query: 252 DTGVLRTLLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDVLCYNISSQPKF--PEVTI 309
D+G T L ++ ++ + +A + A CY+ + P VT
Sbjct: 248 DSGTTITRLEARAYTAVR----DAFRAATMHLTSAADFKIFDTCYDFTGMNSISVPTVTF 303
Query: 310 HFRG-ADVKLSPSNLFRNIS-DEIMCSAFRGGNANIVYGRIMQINFLIGYDIEQAMVSFK 367
HF+G D++L PSN +S + I C AF V G + Q +F + YD +
Sbjct: 304 HFQGDVDMRLPPSNYIVPVSNNNIFCFAFAASMGPSVIGNVQQQSFRVIYDNVHKQIGLL 363
Query: 368 PSRC 371
P +C
Sbjct: 364 PDQC 367
>gi|115472515|ref|NP_001059856.1| Os07g0532800 [Oryza sativa Japonica Group]
gi|50508274|dbj|BAD32123.1| putative aspartic proteinase nepenthesin I [Oryza sativa Japonica
Group]
gi|113611392|dbj|BAF21770.1| Os07g0532800 [Oryza sativa Japonica Group]
Length = 436
Score = 140 bits (352), Expect = 1e-30, Method: Compositional matrix adjust.
Identities = 119/407 (29%), Positives = 182/407 (44%), Gaps = 59/407 (14%)
Query: 2 QNSQKLPFYNDNETPKSPI----SIIYQAEIISVDDIYLMHLSIGTPPVDIFGSVDTGSD 57
++S ++ F +D S+ +QA + + Y M++S+GTP + DTGSD
Sbjct: 49 RDSHRIAFLSDATAAGKATTTNSSVSFQALLENGVGGYNMNISVGTPLLTFSVVADTGSD 108
Query: 58 CTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSSSQCAVV---TSNCSEGDCSYSFL 114
WTQC PC + CF+Q P F P SST++ + C+SS C + C+ C Y++
Sbjct: 109 LIWTQCAPCTK--CFQQPAPPFQPASSSTFSKLPCTSSFCQFLPNSIRTCNATGCVYNYK 166
Query: 115 YGRGAYASFSSGNLATETLTFNSTSGLPVEMPNVIFGCGHKNLASPTSDSKQTGIIGLGP 174
YG G +++G LATETL S P+V FGC +N + +GI GLG
Sbjct: 167 YGSG----YTAGYLATETLKVGDAS-----FPSVAFGCSTENGVG----NSTSGIAGLGR 213
Query: 175 GNSSLISQMGTSIAGKFSYCLPD---QGSSKINFG-------GIVAGAGVVSTPLIIRDH 224
G SLI Q+G G+FSYCL G+S I FG G V V+ P + +
Sbjct: 214 GALSLIPQLG---VGRFSYCLRSGSAAGASPILFGSLANLTDGNVQSTPFVNNPAVHPSY 270
Query: 225 YYLSLEAISVGNQRLEFVSSS--------TGNIFVDTGVLRTLLPLEYHSNLKSVMSNMI 276
YY++L I+VG L +S+ G VD+G T L + + +K +
Sbjct: 271 YYVNLTGITVGETDLPVTTSTFGFTQNGLGGGTIVDSGTTLTYLAKDGYEMVKQAF--LS 328
Query: 277 KAQPVKGVGAEPGFSDVLCYNISSQP----KFPEVTIHFRGADVKLSPSNLFRNISDE-- 330
+ V V G LC+ + P + + F G P+ +D
Sbjct: 329 QTADVTTVNGTRGLD--LCFKSTGGGGGGIAVPSLVLRFDGGAEYAVPTYFAGVETDSQG 386
Query: 331 ------IMCSAFRGGNANIVYGRIMQINFLIGYDIEQAMVSFKPSRC 371
+M +G V G +MQ++ + YD++ + SF P+ C
Sbjct: 387 SVTVACLMMLPAKGDQPMSVIGNVMQMDMHLLYDLDGGIFSFAPADC 433
>gi|224142005|ref|XP_002324351.1| predicted protein [Populus trichocarpa]
gi|222865785|gb|EEF02916.1| predicted protein [Populus trichocarpa]
Length = 472
Score = 140 bits (352), Expect = 1e-30, Method: Compositional matrix adjust.
Identities = 116/364 (31%), Positives = 177/364 (48%), Gaps = 48/364 (13%)
Query: 35 YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSS 94
Y++ + +GTP + DTGSD TWTQCEPC + C+KQ+ P +P S++Y +ISCSS
Sbjct: 131 YVVTVGLGTPKKEFTLIFDTGSDITWTQCEPCVKT-CYKQKEPRLNPSTSTSYKNISCSS 189
Query: 95 SQCAVVTS------NCSEGDCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMPNV 148
+ C +V S +CS C Y YG G+Y S G ATETLT +S++ N
Sbjct: 190 ALCKLVASGKKFSQSCSSSTCLYQVQYGDGSY---SIGFFATETLTLSSSN----VFKNF 242
Query: 149 IFGCGHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLPDQGSSK--INFG 206
+FGCG +N + G+ +L SQ + FSYCLP SSK ++ G
Sbjct: 243 LFGCGQQNNGLFGGAAGLLGLGRT---KLALPSQTAKTYKKLFSYCLPASSSSKGYLSLG 299
Query: 207 GIVAGAGVVSTPLII----RDHYYLSLEAISVGNQRLEFVSSS-TGNIFVDTGVLRTLLP 261
G V+ + V TPL Y L + +SVG ++L S+ + +D+G + T L
Sbjct: 300 GQVSKS-VKFTPLSADFDSTPFYGLDITGLSVGGRKLSIDESAFSAGTVIDSGTVITRLS 358
Query: 262 LEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDV-LCYNISSQP--KFPEVTIHFRGA---D 315
+S L S N++ P + G+S CY+ S + P+V + F+G D
Sbjct: 359 PTAYSELSSAFQNLMTDYP-----STSGYSIFDTCYDFSKYDTVRIPKVGVTFKGGVEMD 413
Query: 316 VKLS----PSNLFRNISDEIMCSAFRGGNAN---IVYGRIMQINFLIGYDIEQAMVSFKP 368
+ +S P N + + C AF G + + ++G + Q + + YD + V F P
Sbjct: 414 IDVSGILYPVNGLKKV-----CLAFAGNDDDSDTSIFGNVQQRTYQVVYDGAKGRVGFAP 468
Query: 369 SRCT 372
C+
Sbjct: 469 GGCS 472
>gi|242095602|ref|XP_002438291.1| hypothetical protein SORBIDRAFT_10g011210 [Sorghum bicolor]
gi|241916514|gb|EER89658.1| hypothetical protein SORBIDRAFT_10g011210 [Sorghum bicolor]
Length = 464
Score = 139 bits (351), Expect = 2e-30, Method: Compositional matrix adjust.
Identities = 108/352 (30%), Positives = 165/352 (46%), Gaps = 31/352 (8%)
Query: 35 YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSS 94
Y++ ++IGTP V S+DTGSD +W QC PC C Q+ LFDP S+TY++ SC S
Sbjct: 129 YVITVTIGTPAVTQVMSIDTGSDVSWVQCAPCAAQSCSSQKDKLFDPAMSATYSAFSCGS 188
Query: 95 SQCAVV---TSNCSEGDCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMPNVIFG 151
+QCA + + C + C Y YG G S ++G ++TL+ S+ + + FG
Sbjct: 189 AQCAQLGDEGNGCLKSQCQYIVKYGDG---SNTAGTYGSDTLSLTSSDAV----KSFQFG 241
Query: 152 CGHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLPDQGSSKINFGGIVAG 211
C H+ + G++GLG SL+SQ + FSYCLP SS F + A
Sbjct: 242 CSHRAAGFV---GELDGLMGLGGDTESLVSQTAATYGKAFSYCLPPPSSSGGGFLTLGAA 298
Query: 212 AGVVS-----TPLI---IRDHYYLSLEAISVGNQRLEFVSSS-TGNIFVDTGVLRTLLPL 262
G S TP++ + Y + L+ I+V L +S +G VD+G + T LP
Sbjct: 299 GGASSSRYSHTPMVRFSVPTFYGVFLQGITVAGTMLNVPASVFSGASVVDSGTVITQLPP 358
Query: 263 EYHSNLKSVMSNMIKAQPVKGVGAEPGFSDVLCYNIS--SQPKFPEVTIHF-RGADVKLS 319
+ L++ +KA P A P S C++ S + P VT+ F RGA + L
Sbjct: 359 TAYQALRTAFKKEMKAYP----SAAPVGSLDTCFDFSGFNTITVPTVTLTFSRGAAMDLD 414
Query: 320 PSNLFRNISDEIMCSAFRGGNANIVYGRIMQINFLIGYDIEQAMVSFKPSRC 371
S + +A G + G + Q F + +D+ + F+ C
Sbjct: 415 ISGILYAGCLAFTATAHDGDTG--ILGNVQQRTFEMLFDVGGRTIGFRSGAC 464
>gi|224142009|ref|XP_002324353.1| predicted protein [Populus trichocarpa]
gi|222865787|gb|EEF02918.1| predicted protein [Populus trichocarpa]
Length = 412
Score = 139 bits (351), Expect = 2e-30, Method: Compositional matrix adjust.
Identities = 116/364 (31%), Positives = 177/364 (48%), Gaps = 48/364 (13%)
Query: 35 YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSS 94
Y++ + +GTP + DTGSD TWTQCEPC + C+KQ+ P +P S++Y +ISCSS
Sbjct: 71 YVVTVGLGTPKKEFTLIFDTGSDITWTQCEPCVKT-CYKQKEPRLNPSTSTSYKNISCSS 129
Query: 95 SQCAVVTS------NCSEGDCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMPNV 148
+ C +V S +CS C Y YG G+Y S G ATETLT +S++ N
Sbjct: 130 ALCKLVASGKKFSQSCSSSTCLYQVQYGDGSY---SIGFFATETLTLSSSN----VFKNF 182
Query: 149 IFGCGHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLPDQGSSK--INFG 206
+FGCG +N + G+ +L SQ + FSYCLP SSK ++ G
Sbjct: 183 LFGCGQQNNGLFGGAAGLLGLGRT---KLALPSQTAKTYKKLFSYCLPASSSSKGYLSLG 239
Query: 207 GIVAGAGVVSTPLII----RDHYYLSLEAISVGNQRLEFVSSS-TGNIFVDTGVLRTLLP 261
G V+ + V TPL Y L + +SVG ++L S+ + +D+G + T L
Sbjct: 240 GQVSKS-VKFTPLSADFDSTPFYGLDITGLSVGGRQLSIDESAFSAGTVIDSGTVITRLS 298
Query: 262 LEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDV-LCYNISSQP--KFPEVTIHFRGA---D 315
+S L S N++ P + G+S CY+ S + P+V + F+G D
Sbjct: 299 PTAYSELSSAFQNLMTDYP-----STSGYSIFDTCYDFSKYDTVRIPKVGVTFKGGVEMD 353
Query: 316 VKLS----PSNLFRNISDEIMCSAFRGGNAN---IVYGRIMQINFLIGYDIEQAMVSFKP 368
+ +S P N + + C AF G + + ++G + Q + + YD + V F P
Sbjct: 354 IDVSGILYPVNGLKKV-----CLAFAGNDDDSDTSIFGNVQQRTYQVVYDGAKGRVGFAP 408
Query: 369 SRCT 372
C+
Sbjct: 409 GGCS 412
>gi|293332561|ref|NP_001170100.1| uncharacterized protein LOC100384018 precursor [Zea mays]
gi|224033441|gb|ACN35796.1| unknown [Zea mays]
gi|413944035|gb|AFW76684.1| hypothetical protein ZEAMMB73_746438 [Zea mays]
Length = 456
Score = 139 bits (351), Expect = 2e-30, Method: Compositional matrix adjust.
Identities = 98/267 (36%), Positives = 131/267 (49%), Gaps = 32/267 (11%)
Query: 35 YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSS 94
YL+HL++GTPP + ++DTGSD WTQC PC DCF Q PL DP SSTY ++ C +
Sbjct: 86 YLVHLAVGTPPRPVALTLDTGSDLVWTQCAPC--RDCFDQGIPLLDPAASSTYAALPCGA 143
Query: 95 SQC-AVVTSNCSEGDCSYSFLYGRGAYASFSSGNLATETLTF------NSTSGLPVEMPN 147
+C A+ ++C C Y + YG S + G +AT+ TF N LP
Sbjct: 144 PRCRALPFTSCGGRSCVYVYHYGD---KSVTVGKIATDRFTFGDNGRRNGDGSLPATR-R 199
Query: 148 VIFGCGHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLP---DQGSSKIN 204
+ FGCGH N S +TGI G G G SL SQ+ A FSYC D SS +
Sbjct: 200 LTFGCGHFNKG--VFQSNETGIAGFGRGRWSLPSQLN---ATSFSYCFTSMFDSKSSIVT 254
Query: 205 FGGIVAG-------AGVVSTPLIIRDH----YYLSLEAISVGNQRLEFVSSSTGNIFVDT 253
GG A V +TPL Y+LSL+ ISVG RL + + +D+
Sbjct: 255 LGGAPAALYSHAHSGEVRTTPLFKNPSQPSLYFLSLKGISVGKTRLPVPETKFRSTIIDS 314
Query: 254 GVLRTLLPLEYHSNLKSVMSNMIKAQP 280
G T LP E + +K+ + + P
Sbjct: 315 GASITTLPEEVYEAVKAEFAAQVGLPP 341
>gi|356569424|ref|XP_003552901.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Glycine max]
Length = 536
Score = 139 bits (350), Expect = 2e-30, Method: Compositional matrix adjust.
Identities = 116/376 (30%), Positives = 178/376 (47%), Gaps = 51/376 (13%)
Query: 35 YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSS 94
Y + + +GTPP ++ +DTGSD +W QC+PC DCF+Q P ++P +SS+Y +ISC
Sbjct: 170 YFIDMFVGTPPKHVWLILDTGSDLSWIQCDPC--YDCFEQNGPHYNPNESSSYRNISCYD 227
Query: 95 SQCAVVTS-----NCSEGD--CSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVE--- 144
+C +V+S +C + C Y + Y G S ++G+ A ET T N T E
Sbjct: 228 PRCQLVSSPDPLQHCKTENQTCPYFYDYADG---SNTTGDFALETFTVNLTWPNGKEKFK 284
Query: 145 -MPNVIFGCGHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLPD-----Q 198
+ +V+FGCGH N G+ G S SQ+ + FSYCL D
Sbjct: 285 HVVDVMFGCGHWNKGFFHGAGGLLGLGR---GPLSFPSQLQSIYGHSFSYCLTDLFSNTS 341
Query: 199 GSSKINFG---GIVAGAGVVSTPLIIRDH------YYLSLEAISVGNQRLE-------FV 242
SSK+ FG ++ + T L+ + YYL +++I VG + L+ +
Sbjct: 342 VSSKLIFGEDKELLNHHNLNFTKLLAGEETPDDTFYYLQIKSIVVGGEVLDIPEKTWHWS 401
Query: 243 SSSTGNIFVDTGVLRTLLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDVLCYNISS-- 300
S G +D+G T P + +K IK Q + A F CYN+S
Sbjct: 402 SEGVGGTIIDSGSTLTFFPDSAYDVIKEAFEKKIKLQQI----AADDFIMSPCYNVSGAM 457
Query: 301 QPKFPEVTIHF-RGADVKLSPSNLFRNIS-DEIMCSA-FRGGNAN--IVYGRIMQINFLI 355
Q + P+ IHF GA N F DE++C A + N + + G ++Q NF I
Sbjct: 458 QVELPDYGIHFADGAVWNFPAENYFYQYEPDEVICLAILKTPNHSHLTIIGNLLQQNFHI 517
Query: 356 GYDIEQAMVSFKPSRC 371
YD++++ + + P RC
Sbjct: 518 LYDVKRSRLGYSPRRC 533
>gi|125558622|gb|EAZ04158.1| hypothetical protein OsI_26300 [Oryza sativa Indica Group]
Length = 435
Score = 139 bits (350), Expect = 2e-30, Method: Compositional matrix adjust.
Identities = 119/406 (29%), Positives = 184/406 (45%), Gaps = 58/406 (14%)
Query: 2 QNSQKLPFYNDNETPKSPI----SIIYQAEIISVDDIYLMHLSIGTPPVDIFGSVDTGSD 57
++S ++ F +D S+ +QA + + Y M++S+GTP + DTGSD
Sbjct: 49 RDSHRIAFLSDATAAGKATTTNSSVSFQALLENGVGGYNMNISVGTPLLTFPVVADTGSD 108
Query: 58 CTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSSSQCAVVTSN---CSEGDCSYSFL 114
WTQC PC + CF+Q P F P SST++ + C+SS C + ++ C+ C Y++
Sbjct: 109 LIWTQCAPCTK--CFQQPAPPFQPASSSTFSKLPCTSSFCQFLPNSIRTCNATGCVYNYK 166
Query: 115 YGRGAYASFSSGNLATETLTFNSTSGLPVEMPNVIFGCGHKNLASPTSDSKQTGIIGLGP 174
YG G +++G LATETL S P+V FGC +N + +GI GLG
Sbjct: 167 YGSG----YTAGYLATETLKVGDAS-----FPSVAFGCSTENGVG----NSTSGIAGLGR 213
Query: 175 GNSSLISQMGTSIAGKFSYCLPD---QGSSKINFG-------GIVAGAGVVSTPLIIRDH 224
G SLI Q+G G+FSYCL G+S I FG G V V+ P + +
Sbjct: 214 GALSLIPQLG---VGRFSYCLRSGSAAGASPILFGSLANLTDGNVQSTPFVNNPAVHPSY 270
Query: 225 YYLSLEAISVGNQRLEFVSSS--------TGNIFVDTGVLRTLLPLEYHSNLKSVMSNMI 276
YY++L I+VG L +S+ G VD+G T L + + +K +
Sbjct: 271 YYVNLTGITVGETDLPVTTSTFGFTQNGLGGGTIVDSGTTLTYLAKDGYEMVKQAF--LS 328
Query: 277 KAQPVKGVGAEPGFSDVLCYNISSQP---KFPEVTIHFRGADVKLSPSNLFRNISDE--- 330
+ V V G LC+ + P + + F G P+ +D
Sbjct: 329 QTANVTTVNGTRGLD--LCFKSTGGGGGIAVPSLVLRFDGGAEYAVPTYFAGVETDSQGS 386
Query: 331 -----IMCSAFRGGNANIVYGRIMQINFLIGYDIEQAMVSFKPSRC 371
+M +G V G +MQ++ + YD++ + SF P+ C
Sbjct: 387 VTVACLMMLPAKGDQPMSVIGNVMQMDMHLLYDLDGGIFSFSPADC 432
>gi|413953792|gb|AFW86441.1| hypothetical protein ZEAMMB73_342504 [Zea mays]
Length = 459
Score = 139 bits (350), Expect = 2e-30, Method: Compositional matrix adjust.
Identities = 112/368 (30%), Positives = 171/368 (46%), Gaps = 48/368 (13%)
Query: 30 SVDDI-YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYN 88
SVD + Y++ + +GTP V +DTGSD +W QC PC C+ Q+ PLFDP +SSTY
Sbjct: 114 SVDSLEYVVTVGLGTPAVSQVLLIDTGSDLSWVQCAPCNSTTCYPQKDPLFDPSRSSTYA 173
Query: 89 SISCSSSQCAVVT-----SNCSEG-----DCSYSFLYGRGAYASFSSGNLATETLTFNST 138
I C++ C +T S+C+ G C Y+ YG G S ++G + ETLT
Sbjct: 174 PIPCNTDACRDLTRDGYGSDCTSGSGGGAQCGYAITYGDG---SQTTGVYSNETLTMAPG 230
Query: 139 SGLPVEMPNVIFGCGHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLP-- 196
V + + FGCGH + K G++GLG SL+ Q + G FSYCLP
Sbjct: 231 ----VTVKDFHFGCGHDQDG---PNDKYDGLLGLGGAPESLVVQTSSVYGGAFSYCLPAA 283
Query: 197 -DQGSSKINFGGIVAGAGVVSTPLIIRDH--YYLSLEAISVGNQRLEFVSSS-TGNIFVD 252
DQ + +G V TP++ Y +++ I+VG + ++ S+ +G + +D
Sbjct: 284 NDQAGFLALGAPVNDASGFVFTPMVREQQTFYVVNMTGITVGGEPIDVPPSAFSGGMIID 343
Query: 253 TGVLRTLLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDVLCYNIS--SQPKFPEVTIH 310
+G + T L ++ L++ + A P+ P CYN + S P V +
Sbjct: 344 SGTVVTELQHTAYAALQAAFRKAMAAYPLL-----PNGELDTCYNFTGHSNVTVPRVALT 398
Query: 311 FR-GADVKLSPSNLFRNISDEIM---CSAFRGG---NANIVYGRIMQINFLIGYDIEQAM 363
F GA V L ++ D I+ C AF+ N + G + Q + YD+
Sbjct: 399 FSGGATVDL-------DVPDGILLDNCLAFQEAGPDNQPGILGNVNQRTLEVLYDVGHGR 451
Query: 364 VSFKPSRC 371
V F C
Sbjct: 452 VGFGADAC 459
>gi|293336306|ref|NP_001168599.1| uncharacterized protein LOC100382383 [Zea mays]
gi|223949441|gb|ACN28804.1| unknown [Zea mays]
Length = 326
Score = 139 bits (349), Expect = 3e-30, Method: Compositional matrix adjust.
Identities = 113/347 (32%), Positives = 165/347 (47%), Gaps = 50/347 (14%)
Query: 52 VDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSSSQCA-VVTSNC--SEGD 108
+DTGSD TW QC+PC DC++Q P+FDP S++Y ++SC S +C + T+ C + G
Sbjct: 3 LDTGSDVTWVQCQPC--ADCYQQSDPVFDPSLSASYAAVSCDSQRCRDLDTAACRNATGA 60
Query: 109 CSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMPNVIFGCGHKNLASPTSDSKQTG 168
C Y YG G+Y + G+ ATETLT ++ PV NV GCGH N +
Sbjct: 61 CLYEVAYGDGSY---TVGDFATETLTLGDST--PVG--NVAIGCGHDNEGLFVGAAGLLA 113
Query: 169 IIGLGPGNSSLISQMGTSIAGKFSYCLPDQGS---SKINFGGIVAGAGVVSTPLI----I 221
+ G S IS A FSYCL D+ S S + FG A AG V+ PL+
Sbjct: 114 LGGGPLSFPSQIS------ASTFSYCLVDRDSPAASTLQFGDGAAEAGTVTAPLVRSPRT 167
Query: 222 RDHYYLSLEAISVGNQRLEF--------VSSSTGNIFVDTGVLRTLLPLEYHSNLKSVMS 273
YY++L ISVG Q L +S +G + VD+G T L ++ L+
Sbjct: 168 STFYYVALSGISVGGQPLSIPASAFAMDATSGSGGVIVDSGTAVTRLQSAAYAALRDAF- 226
Query: 274 NMIKAQPVKGVGAEPGFSDV----LCYNISSQP--KFPEVTIHFRGAD-VKLSPSNLFRN 326
V+G + P S V CY++S + + P V++ F G ++L N
Sbjct: 227 -------VQGAPSLPRTSGVSLFDTCYDLSDRTSVEVPAVSLRFEGGGALRLPAKNYLIP 279
Query: 327 ISDE-IMCSAFRGGNANI-VYGRIMQINFLIGYDIEQAMVSFKPSRC 371
+ C AF NA + + G + Q + +D + V F P++C
Sbjct: 280 VDGAGTYCLAFAPTNAAVSIIGNVQQQGTRVSFDTARGAVGFTPNKC 326
>gi|242045564|ref|XP_002460653.1| hypothetical protein SORBIDRAFT_02g032590 [Sorghum bicolor]
gi|241924030|gb|EER97174.1| hypothetical protein SORBIDRAFT_02g032590 [Sorghum bicolor]
Length = 525
Score = 138 bits (348), Expect = 4e-30, Method: Compositional matrix adjust.
Identities = 116/386 (30%), Positives = 168/386 (43%), Gaps = 59/386 (15%)
Query: 35 YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSS 94
YLM + +GTPP +DTGSD W QC PC LDCF+Q P+FDP SS+Y +++C
Sbjct: 151 YLMDVYVGTPPRRFRMIMDTGSDLNWLQCAPC--LDCFEQRGPVFDPAASSSYRNVTCGD 208
Query: 95 SQCAVV----------TSNCS---EGDCSYSFLYGRGAYASFSSGNLATETLTFNSTS-G 140
+C V C E C Y + YG S ++G+LA E+ T N T+ G
Sbjct: 209 HRCGHVAPPPEPEASSPRTCRRPGEDPCPYYYWYGD---QSNTTGDLALESFTVNLTAPG 265
Query: 141 LPVEMPNVIFGCGHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLPDQGS 200
+ V+FGCGH+N + G+ S SQ+ FSYCL D GS
Sbjct: 266 ASRRVDGVVFGCGHRNRGLFHGAAGLLGLGRG---PLSFASQLRAVYGHTFSYCLVDHGS 322
Query: 201 ---SKINFGGIVAGAGVVSTPLI--------------IRDHYYLSLEAISVGNQRLEFVS 243
SK+ FG + + P + YY+ L+ + VG + L S
Sbjct: 323 DVGSKVVFGEDDDALALAAHPQLKYTAFAPASSSSSPADTFYYVKLKGVLVGGELLNISS 382
Query: 244 SS-------TGNIFVDTG-VLRTLLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDVLC 295
+ +G +D+G L + Y + M M ++ P+ V P S C
Sbjct: 383 DTWDVGKDGSGGTIIDSGTTLSYFVEPAYQVIRHAFMDRMSRSYPL--VPEFPVLSP--C 438
Query: 296 YNIS--SQPKFPEVTIHFR-GADVKLSPSNLFRNISDE---IMCSAFRG--GNANIVYGR 347
YN+S +P+ PE+++ F GA N F + + IMC A G + G
Sbjct: 439 YNVSGVERPEVPELSLLFADGAVWDFPAENYFIRLDPDGGSIMCLAVLGTPRTGMSIIGN 498
Query: 348 IMQINFLIGYDIEQAMVSFKPSRCTN 373
Q NF + YD++ + F P RC
Sbjct: 499 FQQQNFHVVYDLQNNRLGFAPRRCAE 524
>gi|312282359|dbj|BAJ34045.1| unnamed protein product [Thellungiella halophila]
Length = 484
Score = 138 bits (348), Expect = 4e-30, Method: Compositional matrix adjust.
Identities = 117/366 (31%), Positives = 165/366 (45%), Gaps = 46/366 (12%)
Query: 35 YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSS 94
Y M L +GTP +++ +DTGSD W QC PC C+ Q P+F+P KS T+ ++ C S
Sbjct: 136 YFMRLGVGTPATNMYMVLDTGSDVVWLQCSPCKV--CYNQSDPVFNPAKSKTFATVPCGS 193
Query: 95 SQCAVV--TSNC---SEGDCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMPNVI 149
C + +S C C Y YG G SF+ G+ +TETLTF+ + +V
Sbjct: 194 RLCRRLDDSSECVSRRSKACLYQVSYGDG---SFTVGDFSTETLTFHGA-----RVDHVA 245
Query: 150 FGCGHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLPDQ--------GSS 201
GCGH N + G+ S SQ GKFSYCL D+ S
Sbjct: 246 LGCGHDNEGLFVGAAGLLGLGRG---GLSFPSQTKNRYNGKFSYCLVDRTSSGSSSKPPS 302
Query: 202 KINFGGIVAGAGVVSTPLI----IRDHYYLSLEAISVGNQRLEFVSSS------TGN--I 249
I FG V TPL+ + YYL L ISVG R+ VS S TGN +
Sbjct: 303 TIVFGNGAVPKTAVFTPLLTNPKLDTFYYLQLLGISVGGSRVPGVSESQFKLDATGNGGV 362
Query: 250 FVDTGVLRTLLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDVLCYNIS--SQPKFPEV 307
+D+G T L + L+ + A +K + F C+++S + K P V
Sbjct: 363 IIDSGTSVTRLTQSAYVALRDAFR--LGATRLKRAPSYSLFDT--CFDLSGMTTVKVPTV 418
Query: 308 TIHFRGADVKLSPSNLFRNISDE-IMCSAFRGGNANI-VYGRIMQINFLIGYDIEQAMVS 365
HF G +V L SN ++++ C AF G ++ + G I Q F + YD+ + V
Sbjct: 419 VFHFTGGEVSLPASNYLIPVNNQGRFCFAFAGTMGSLSIIGNIQQQGFRVAYDLVGSRVG 478
Query: 366 FKPSRC 371
F C
Sbjct: 479 FLSRAC 484
>gi|326513755|dbj|BAJ87896.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 442
Score = 138 bits (348), Expect = 4e-30, Method: Compositional matrix adjust.
Identities = 119/369 (32%), Positives = 163/369 (44%), Gaps = 45/369 (12%)
Query: 31 VDDIYLMHLSIGTP---PVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTY 87
V+ YL+HLSIG P PV + ++DTGSD WTQCEPC E CF Q P FD S+T
Sbjct: 88 VNSEYLIHLSIGAPRSQPVVL--TLDTGSDVVWTQCEPCAE--CFTQPLPRFDTAASNTV 143
Query: 88 NSISCSSSQC-AVVTSNCSEGDCSYSFLYGRGAYASFSSGNLATETLTFNS-TSGLPVEM 145
S++CS C A C C+Y YG G S S G+ ++ TF+ G V +
Sbjct: 144 RSVACSDPLCNAHSEHGCFLHGCTYVSGYGDG---SLSFGHFLRDSFTFDDGKGGGKVTV 200
Query: 146 PNVIFGCGHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLP---DQGSSK 202
P++ FGCG N +TGI G G G SL SQ+ +FSYC + SS
Sbjct: 201 PDIGFGCGMYNAGRFL--QTETGIAGFGRGPLSLPSQLKVR---QFSYCFTTRFEAKSSP 255
Query: 203 INFGG-----IVAGAGVVSTPLII-------RDHYYLSLEAISVGNQRL---EFVSSSTG 247
+ GG A ++STP + HY LS + ++VG RL E + +G
Sbjct: 256 VFLGGAGDLKAHATGPILSTPFVRSLPPGTDNSHYVLSFKGVTVGKTRLPVPEIKADGSG 315
Query: 248 NIFVDTGVLRTLLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDVLCYNISSQ--PKFP 305
F+D+G T P LKS A PV E D +C++ + P
Sbjct: 316 ATFIDSGTDITTFPDAVFRQLKSAFIAQ-AALPVNKTADE----DDICFSWDGKKTAAMP 370
Query: 306 EVTIHFRGADVKLSPSNLF---RNISDEIMCSAFRGGNANIVYGRIMQINFLIGYDIEQA 362
++ H GAD L N R + + G + G Q N I YD+
Sbjct: 371 KLVFHLEGADWDLPRENYVTEDRESGQVCVAVSTSGQMDRTLIGNFQQQNTHIVYDLAAG 430
Query: 363 MVSFKPSRC 371
+ P++C
Sbjct: 431 KLLLVPAQC 439
>gi|326512608|dbj|BAJ99659.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 484
Score = 138 bits (347), Expect = 4e-30, Method: Compositional matrix adjust.
Identities = 109/356 (30%), Positives = 169/356 (47%), Gaps = 35/356 (9%)
Query: 35 YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSS 94
Y++ + +GTP D+ DTGSD +W QC PC DC++Q+ PLFDP +SSTY+++ C+S
Sbjct: 146 YVVSMGLGTPARDMTVVFDTGSDLSWVQCTPCS--DCYEQKDPLFDPARSSTYSAVPCAS 203
Query: 95 SQCAVVTSN-CS-EGDCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMPNVIFGC 152
+C + S CS + C Y +YG S + G LA +TLT + L P +FGC
Sbjct: 204 PECQGLDSRSCSRDKKCRYEVVYGD---QSQTDGALARDTLTLTQSDVL----PGFVFGC 256
Query: 153 GHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLPDQGSSK--INFGGIVA 210
G ++ + G++GLG SL SQ + FSYCLP S+ ++ GG A
Sbjct: 257 GEQDTG---LFGRADGLVGLGREKVSLSSQAASKYGAGFSYCLPSSPSAAGYLSLGG-PA 312
Query: 211 GAGVVSTPLIIRDH----YYLSLEAISVGNQ--RLEFVSSSTGNIFVDTGVLRTLLPLEY 264
A T + R YY+ L + V + R+ + S +D+G + T LP
Sbjct: 313 PANARFTAMETRHDSPSFYYVRLVGVKVAGRTVRVSPIVFSAAGTVIDSGTVITRLPPRV 372
Query: 265 HSNLKSVMSNMIKAQPVKGVGAEPGFSDV-LCYNISSQP--KFPEVTIHFR-GADVKLSP 320
++ L+S + ++ G P S + CY+ + + P V + F GA V L
Sbjct: 373 YAALRSAFA---RSMGRYGYKRAPALSILDTCYDFTGHTTVRIPSVALVFAGGAAVGLDF 429
Query: 321 SNLFRNISDEIMCSAFR----GGNANIVYGRIMQINFLIGYDIEQAMVSFKPSRCT 372
S + C AF G +A I+ G Q + YD+ + + F + C+
Sbjct: 430 SGVLYVAKVSQACLAFAPNGDGADAGII-GNTQQKTLAVVYDVARQKIGFGANGCS 484
>gi|357123876|ref|XP_003563633.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Brachypodium
distachyon]
Length = 503
Score = 138 bits (347), Expect = 4e-30, Method: Compositional matrix adjust.
Identities = 115/356 (32%), Positives = 174/356 (48%), Gaps = 34/356 (9%)
Query: 35 YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSS 94
Y++ + +GTPP DTGSD TW QC PC + C+KQ+ LFDP KSSTY ++SC+
Sbjct: 163 YVVPIGLGTPPSRFTVVFDTGSDTTWVQCRPC-VVSCYKQKDRLFDPAKSSTYANVSCAD 221
Query: 95 SQCA-VVTSNCSEGDCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMPNVIFGCG 153
CA + S C+ G C Y YG G+Y + G A +TL + + FGCG
Sbjct: 222 PACADLDASGCNAGHCLYGIQYGDGSY---TVGFFAKDTLAVAQDA-----IKGFKFGCG 273
Query: 154 HKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLPDQGSSK--INF---GGI 208
KN + G++GLG G +S+ Q G FSYCLP ++ + F
Sbjct: 274 EKNRG---LFGQTAGLLGLGRGPTSITVQAYEKYGGSFSYCLPASSAATGYLEFGPLSPS 330
Query: 209 VAGAGVVSTPLIIRD---HYYLSLEAISVGNQRL----EFVSSSTGNIFVDTGVLRTLLP 261
+G+ +TP++ YY+ L I VG ++L E V S++G + VD+G + T LP
Sbjct: 331 SSGSNAKTTPMLTDKGPTFYYVGLTGIRVGGKQLGAIPESVFSNSGTL-VDSGTVITRLP 389
Query: 262 LEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDVLCYNIS--SQPKFPEVTIHFR-GADVKL 318
++ L S + + A K A CY+ + SQ P V++ F+ GA + L
Sbjct: 390 DTAYAALSSAFAAAMAASGYKKAAAYSILD--TCYDFTGLSQVSLPTVSLVFQGGACLDL 447
Query: 319 SPSNLFRNISDEIMCSAF--RGGNANI-VYGRIMQINFLIGYDIEQAMVSFKPSRC 371
S + IS +C F G + ++ + G Q + + YD+ + +V F P C
Sbjct: 448 DASGIVYAISQSQVCLGFASNGDDESVGIVGNTQQRTYGVLYDVSKKVVGFAPGAC 503
>gi|357135412|ref|XP_003569303.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Brachypodium
distachyon]
Length = 494
Score = 137 bits (346), Expect = 6e-30, Method: Compositional matrix adjust.
Identities = 111/370 (30%), Positives = 164/370 (44%), Gaps = 50/370 (13%)
Query: 35 YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSS 94
Y + +GTP +DTGSD W QC PC C+ Q +FDP++S +Y ++ CS+
Sbjct: 142 YFTKIGVGTPATPALMVLDTGSDVVWLQCAPCRR--CYDQSGQVFDPRRSRSYGAVGCSA 199
Query: 95 SQCAVVTS---NCSEGDCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMPNVIFG 151
C + S + C Y YG G S ++G+ ATETLTF + + + G
Sbjct: 200 PLCRRLDSGGCDLRRKACLYQVAYGDG---SVTAGDFATETLTFAGGA----RVARIALG 252
Query: 152 CGHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLPDQGSSK--------I 203
CGH N + + G+ G+ S +Q+ FSYCL D+ SS +
Sbjct: 253 CGHDNEGLFVAAAGLLGLG---RGSLSFPAQISRRYGRSFSYCLVDRTSSANPASHSSTV 309
Query: 204 NFGGIVAGAGVVS--TPLI----IRDHYYLSLEAISVGNQRLEFV---------SSSTGN 248
FG G+ V + TP++ + YY+ L ISVG R+ V SS G
Sbjct: 310 TFGSGAVGSTVAASFTPMVKNPRMETFYYVQLVGISVGGARVSGVADSDLRLDPSSGRGG 369
Query: 249 IFVDTGVLRTLLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDVL--CYNISSQP--KF 304
+ VD+G T L +S L+ G+ PG + CY++S + K
Sbjct: 370 VIVDSGTSVTRLARPAYSALRDAFRAA-----AAGLRLSPGGFSLFDTCYDLSGRKVVKV 424
Query: 305 PEVTIHFR-GADVKLSPSNLFRNI-SDEIMCSAFRGGNANI-VYGRIMQINFLIGYDIEQ 361
P V++HF GA+ L P N + S C AF G + + + G I Q F + +D +
Sbjct: 425 PTVSMHFAGGAEAALPPENYLIPVDSKGTFCFAFAGTDGGVSIIGNIQQQGFRVVFDGDG 484
Query: 362 AMVSFKPSRC 371
V F P C
Sbjct: 485 QRVGFVPKGC 494
>gi|302774174|ref|XP_002970504.1| hypothetical protein SELMODRAFT_93504 [Selaginella moellendorffii]
gi|300162020|gb|EFJ28634.1| hypothetical protein SELMODRAFT_93504 [Selaginella moellendorffii]
Length = 483
Score = 137 bits (346), Expect = 6e-30, Method: Compositional matrix adjust.
Identities = 112/368 (30%), Positives = 174/368 (47%), Gaps = 45/368 (12%)
Query: 35 YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSS 94
Y + L +GTP +F VDTGSD W QC+PC C+KQ P+FDP+ SS++ I C S
Sbjct: 129 YFVRLGVGTPARSLFMVVDTGSDLPWLQCQPCKS--CYKQADPIFDPRNSSSFQRIPCLS 186
Query: 95 SQC-AVVTSNCS-----EGDCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMPNV 148
C A+ +CS CSY YG G SFS G+ +++ T + S + +V
Sbjct: 187 PLCKALEIHSCSGSRGATSRCSYQVAYGDG---SFSVGDFSSDLFTLGTGS----KAMSV 239
Query: 149 IFGCGHKN--LASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLPD------QGS 200
FGCG N L + + G L + S +S A FSYCL D + S
Sbjct: 240 AFGCGFDNEGLFAGAAGLLGLGAGKLSFPSQIFASSTNSSTANSFSYCLVDRSNPMTRSS 299
Query: 201 SKINFGGIVAGAGVVSTPLI----IRDHYYLSLEAISVGNQR-------LEFVSSSTGNI 249
S + FG + +PL+ + YY ++ +SVG + L+ S +G +
Sbjct: 300 SSLIFGAAAIPSTAALSPLLKNPKLDTFYYAAMIGVSVGGAQLPISLKSLQLSQSGSGGV 359
Query: 250 FVDTGVLRTLLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDV-LCYNISSQP--KFPE 306
+D+G T P ++ ++ N P + P +S CYN S + P
Sbjct: 360 IIDSGTSVTRFPTSVYATIRDAFRNATTNLP-----SAPRYSLFDTCYNFSGKASVDVPA 414
Query: 307 VTIHFR-GADVKLSPSNLFRNISDE-IMCSAFRGGNANI-VYGRIMQINFLIGYDIEQAM 363
+ +HF GAD++L P+N I+ C AF + + + G I Q +F IG+D++++
Sbjct: 415 LVLHFENGADLQLPPTNYLIPINTAGSFCLAFAPTSMELGIIGNIQQQSFRIGFDLQKSH 474
Query: 364 VSFKPSRC 371
++F P +C
Sbjct: 475 LAFAPQQC 482
>gi|297845610|ref|XP_002890686.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
gi|297336528|gb|EFH66945.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
Length = 486
Score = 137 bits (346), Expect = 6e-30, Method: Compositional matrix adjust.
Identities = 113/362 (31%), Positives = 170/362 (46%), Gaps = 51/362 (14%)
Query: 35 YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSS 94
Y + IG P +++ +DTGSD W QC PC DC+ Q P+F+P SS+Y +SC +
Sbjct: 151 YFTRVGIGNPAREVYMVLDTGSDVNWLQCTPCA--DCYHQTEPIFEPSSSSSYEPLSCDT 208
Query: 95 SQC-AVVTSNCSEGDCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMPNVIFGCG 153
QC A+ S C C Y YG G+Y + G+ ATETLT ST + NV GCG
Sbjct: 209 PQCNALEVSECRNATCLYEVSYGDGSY---TVGDFATETLTIGST-----LVQNVAVGCG 260
Query: 154 HKN---LASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLPDQGS---SKINFGG 207
H N G + P SQ+ T+ FSYCL D+ S S + FG
Sbjct: 261 HSNEGLFVGAAGLLGLGGGLLALP------SQLNTT---SFSYCLVDRDSDSASTVEFGT 311
Query: 208 IVAGAGVVSTPLIIRDH-----YYLSLEAISVGNQRLEFVSSS-------TGNIFVDTGV 255
+ VV+ PL +R+H YYL L ISVG + L+ SS +G I +D+G
Sbjct: 312 SLPPDAVVA-PL-LRNHQLDTFYYLGLTGISVGGELLQIPQSSFEMDESGSGGIIIDSGT 369
Query: 256 LRTLLPLEYHSNLK-SVMSNMIKAQPVKGVGAEPGFSDVLCYNISSQP--KFPEVTIHFR 312
T L +++L+ S + + GV CYN+S++ + P V HF
Sbjct: 370 AVTRLQTGIYNSLRDSFLKGTSDLEKAAGVAMFD-----TCYNLSAKTTIEVPTVAFHFP 424
Query: 313 GADVKLSPSNLFRNISDEI--MCSAFRGGNANI-VYGRIMQINFLIGYDIEQAMVSFKPS 369
G + P+ + D + C AF +++ + G + Q + +D+ +++ F +
Sbjct: 425 GGKMLALPAKNYMIPVDSVGTFCLAFAPTASSLAIIGNVQQQGTRVTFDLANSLIGFSSN 484
Query: 370 RC 371
+C
Sbjct: 485 KC 486
>gi|302784853|ref|XP_002974198.1| hypothetical protein SELMODRAFT_54030 [Selaginella moellendorffii]
gi|300157796|gb|EFJ24420.1| hypothetical protein SELMODRAFT_54030 [Selaginella moellendorffii]
Length = 359
Score = 137 bits (346), Expect = 6e-30, Method: Compositional matrix adjust.
Identities = 116/369 (31%), Positives = 178/369 (48%), Gaps = 51/369 (13%)
Query: 35 YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSS 94
Y+M LSIGTPP I +DTGSD W +C+ C D +F SS+Y + C+S
Sbjct: 5 YMMELSIGTPPQLIPAMIDTGSDLVWLKCDNCDHCDLDHHGETIFFSDASSSYKKLPCNS 64
Query: 95 SQCAVVTS-----NCSEGDCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVE---MP 146
+ C+ ++S C E C Y + YG G S +SG++ ++ ++F S
Sbjct: 65 THCSGMSSAGIGPRCEE-TCKYKYEYGDG---SRTSGDVGSDRISFRSHGAGEDHRSFFD 120
Query: 147 NVIFGCGHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCL-----PDQGSS 201
+FGC K L + ++ G+IGLG + SLI Q+G + KFSYCL P S
Sbjct: 121 GFLFGCARK-LKGDWNFTQ--GLIGLGQKSHSLIQQLGDKLGYKFSYCLVSYDSPPSAKS 177
Query: 202 KINFGGIVA--GAGVVSTPLIIRDH-----YYLSLEAISVGNQRLEFVSSSTGN------ 248
+ G A G VVSTP++ DH YY+ L++I++G + +G+
Sbjct: 178 FLFLGSSAALRGHDVVSTPILHGDHLDQTLYYVDLQSITIGGVPVVVYDKESGHNTSVGP 237
Query: 249 -----IFVDTGVLRTLL-PLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDVLCYNISSQP 302
+D+G TLL P Y + KS+ +I + +G G LC+N S
Sbjct: 238 FLANKTVIDSGTTYTLLTPPVYEAMRKSIEEQVI----LPTLGNSAGLD--LCFNSSGDT 291
Query: 303 K--FPEVTIHFRG-ADVKLSPSNLFRNISDEIMCSAF--RGGNANIVYGRIMQINFLIGY 357
FP VT +F + L N+F+ S +++C + GG+ +I+ G + Q NF I Y
Sbjct: 292 SYGFPSVTFYFANQVQLVLPFENIFQVTSRDVVCLSMDSSGGDLSII-GNMQQQNFHILY 350
Query: 358 DIEQAMVSF 366
D+ + +SF
Sbjct: 351 DLVASQISF 359
>gi|449466304|ref|XP_004150866.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 1-like [Cucumis
sativus]
gi|449485213|ref|XP_004157102.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 1-like [Cucumis
sativus]
Length = 473
Score = 137 bits (346), Expect = 7e-30, Method: Compositional matrix adjust.
Identities = 115/359 (32%), Positives = 169/359 (47%), Gaps = 38/359 (10%)
Query: 35 YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSS 94
Y++ + +GTP + DTGSD TWTQC+PC C+ Q+ P+F P +S+TY++ISCSS
Sbjct: 131 YIVSVGLGTPKKYLSLIFDTGSDLTWTQCQPCARY-CYNQKDPVFVPSQSTTYSNISCSS 189
Query: 95 SQCAVVTS------NCSEGD-CSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMPN 147
C+ + S CS C Y YG SFS G A ETLT ST + N
Sbjct: 190 PDCSQLESGTGNQPGCSAARACIYGIQYGD---QSFSVGYFAKETLTLTSTD----VIEN 242
Query: 148 VIFGCGHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLPDQGSSK--INF 205
+FGCG N S + G+IGLG S++ Q FSYCLP SS + F
Sbjct: 243 FLFGCGQNNRGLFGSAA---GLIGLGQDKISIVKQTAQKYGQVFSYCLPKTSSSTGYLTF 299
Query: 206 GGIVAGAGVVSTPLI----IRDHYYLSLEAISVGNQRLEFVSS--STGNIFVDTGVLRTL 259
GG G + TP+ + + Y + + + VG ++ SS ST +D+G + T
Sbjct: 300 GGGGGGGALKYTPITKAHGVANFYGVDIVGMKVGGTQIPISSSVFSTSGAIIDSGTVITR 359
Query: 260 LPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDV-LCYNIS--SQPKFPEVTIHFRGA-D 315
LP + +S LKS + P P S + CY++S S + P+V F+G +
Sbjct: 360 LPPDAYSALKSAFEKGMAKYP-----KAPELSILDTCYDLSKYSTIQIPKVGFVFKGGEE 414
Query: 316 VKLSPSNLFRNISDEIMCSAFRGG---NANIVYGRIMQINFLIGYDIEQAMVSFKPSRC 371
+ L + S +C AF G + + G + Q + YD+ + F + C
Sbjct: 415 LDLDGIGIMYGASTSQVCLAFAGNQDPSTVAIIGNVQQKTLQVVYDVGGGKIGFGYNGC 473
>gi|125527523|gb|EAY75637.1| hypothetical protein OsI_03542 [Oryza sativa Indica Group]
Length = 446
Score = 137 bits (346), Expect = 7e-30, Method: Compositional matrix adjust.
Identities = 106/374 (28%), Positives = 166/374 (44%), Gaps = 48/374 (12%)
Query: 35 YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSS 94
Y + +GTP +DTGSD W QC PC C+ Q +FDP++SSTY + CSS
Sbjct: 86 YFALVGVGTPSTKAMLVIDTGSDLVWLQCSPCRR--CYAQRGQVFDPRRSSTYRRVPCSS 143
Query: 95 SQCAVV------TSNCSEGDCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMPNV 148
QC + + + G C Y YG G S S+G LAT+ L F + + + NV
Sbjct: 144 PQCRALRFPGCDSGGAAGGGCRYMVAYGDG---SSSTGELATDKLAFANDT----YVNNV 196
Query: 149 IFGCGHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLPDQ-----GSSKI 203
GCG N S G++G+ G S+ +Q+ + F YCL D+ SS +
Sbjct: 197 TLGCGRDNEGLFDS---AAGLLGVARGKISISTQVAPAYGSVFEYCLGDRTSRSTRSSYL 253
Query: 204 NFGGIVAGAGVVSTPLIIRDH----YYLSLEAISVGNQRLEFVSSST---------GNIF 250
FG T L+ YY+ + SVG +R+ S+++ G +
Sbjct: 254 VFGRTPEPPSTAFTALLSNPRRPSLYYVDMAGFSVGGERVTGFSNASLALDTATGRGGVV 313
Query: 251 VDTGVLRTLLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDVLCYNISSQPK--FPEVT 308
VD+G + + ++ L+ +A ++ + E D CY++ +P P +
Sbjct: 314 VDSGTAISRFARDAYAALRDAFDARARAAGMRRLAGEHSVFDA-CYDLRGRPAASAPLIV 372
Query: 309 IHFR-GADVKLSPSNLF-------RNISDEIMCSAFRGGNANI-VYGRIMQINFLIGYDI 359
+HF GAD+ L P N F R + C F + + V G + Q F + +D+
Sbjct: 373 LHFAGGADMALPPENYFLPVDGGRRRAASYRRCLGFEAADDGLSVIGNVQQQGFRVVFDV 432
Query: 360 EQAMVSFKPSRCTN 373
E+ + F P CT+
Sbjct: 433 EKERIGFAPKGCTS 446
>gi|115445765|ref|NP_001046662.1| Os02g0314600 [Oryza sativa Japonica Group]
gi|46391044|dbj|BAD15987.1| putative chloroplast nucleoid DNA binding protein [Oryza sativa
Japonica Group]
gi|113536193|dbj|BAF08576.1| Os02g0314600 [Oryza sativa Japonica Group]
gi|125581836|gb|EAZ22767.1| hypothetical protein OsJ_06441 [Oryza sativa Japonica Group]
gi|215697168|dbj|BAG91162.1| unnamed protein product [Oryza sativa Japonica Group]
Length = 514
Score = 137 bits (345), Expect = 8e-30, Method: Compositional matrix adjust.
Identities = 124/400 (31%), Positives = 176/400 (44%), Gaps = 51/400 (12%)
Query: 13 NETPKSPISIIYQAEIISVDDI----YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPE 68
N TP+ ++ A + S + YL+ L +GTPP +DTGSD W QC PC
Sbjct: 126 NSTPRRALAERIVATVESGVAVGSGEYLVDLYVGTPPRRFQMIMDTGSDLNWLQCAPC-- 183
Query: 69 LDCFKQEPPLFDPKKSSTYNSISCSSSQCAVVT-----SNCSE---GDCSYSFLYGRGAY 120
LDCF+Q P+FDP S +Y +++C +C +V C C Y + YG
Sbjct: 184 LDCFEQRGPVFDPAASLSYRNVTCGDPRCGLVAPPTAPRACRRPHSDPCPYYYWYGD--- 240
Query: 121 ASFSSGNLATETLTFNSTS-GLPVEMPNVIFGCGHKNLASPTSDSKQTGIIGLGPGNSSL 179
S ++G+LA E T N T+ G + +V+FGCGH N + G+ S
Sbjct: 241 QSNTTGDLALEAFTVNLTAPGASRRVDDVVFGCGHSNRGLFHGAAGLLGLGRG---ALSF 297
Query: 180 ISQMGTSIAGKFSYCLPDQGS---SKINFGGIVAGAG---------VVSTPLIIRDHYYL 227
SQ+ FSYCL D GS SKI FG A G S YY+
Sbjct: 298 ASQLRAVYGHAFSYCLVDHGSSVGSKIVFGDDDALLGHPRLNYTAFAPSAAAAADTFYYV 357
Query: 228 SLEAISVGNQRLEFVSSS-------TGNIFVDTG-VLRTLLPLEYHSNLKSVMSNMIKAQ 279
L+ + VG ++L S+ +G +D+G L Y ++ + M KA
Sbjct: 358 QLKGVLVGGEKLNISPSTWDVGKDGSGGTIIDSGTTLSYFAEPAYEVIRRAFVERMDKAY 417
Query: 280 PVKGVGAEPGFSDVLCYNIS--SQPKFPEVTIHFR-GADVKLSPSNLFRNIS-DEIMCSA 335
P+ V P S CYN+S + + PE ++ F GA N F + D IMC A
Sbjct: 418 PL--VADFPVLSP--CYNVSGVERVEVPEFSLLFADGAVWDFPAENYFVRLDPDGIMCLA 473
Query: 336 FRG--GNANIVYGRIMQINFLIGYDIEQAMVSFKPSRCTN 373
G +A + G Q NF + YD++ + F P RC
Sbjct: 474 VLGTPRSAMSIIGNFQQQNFHVLYDLQNNRLGFAPRRCAE 513
>gi|255576511|ref|XP_002529147.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
gi|223531426|gb|EEF33260.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
Length = 479
Score = 137 bits (345), Expect = 8e-30, Method: Compositional matrix adjust.
Identities = 110/358 (30%), Positives = 164/358 (45%), Gaps = 43/358 (12%)
Query: 35 YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSS 94
Y + IG P ++ +DTGSD W QC PC DC+ Q P+F+P S++Y+ +SC +
Sbjct: 144 YFSRVGIGKPSSPVYMVLDTGSDVNWIQCAPCA--DCYHQADPIFEPASSTSYSPLSCDT 201
Query: 95 SQC-AVVTSNCSEGDCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMPNVIFGCG 153
QC ++ S C C Y YG G+Y + G+ TET+T S S + NV GCG
Sbjct: 202 KQCQSLDVSECRNNTCLYEVSYGDGSY---TVGDFVTETITLGSAS-----VDNVAIGCG 253
Query: 154 HKN--LASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLPDQGS---SKINFGGI 208
H N L + G L S SQ+ A FSYCL D+ S S + F
Sbjct: 254 HNNEGLFIGAAGLLGLGGGKL-----SFPSQIN---ASSFSYCLVDRDSDSASTLEFNSA 305
Query: 209 VAGAGVVSTPLIIRD---HYYLSLEAISVGNQRL-------EFVSSSTGNIFVDTGVLRT 258
+ + + L R+ YY+ + +SVG + L E S G I +D+G T
Sbjct: 306 LLPHAITAPLLRNRELDTFYYVGMTGLSVGGELLSIPESMFEMDESGNGGIIIDSGTAVT 365
Query: 259 LLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDVLCYNISSQP--KFPEVTIHFRGADV 316
L ++ L+ K PV +E D CY++S + + P VT H G V
Sbjct: 366 RLQTAAYNALRDAFVKGTKDLPVT---SEVALFDT-CYDLSRKTSVEVPTVTFHLAGGKV 421
Query: 317 KLSPSN--LFRNISDEIMCSAFRGGNANI-VYGRIMQINFLIGYDIEQAMVSFKPSRC 371
P+ L SD C AF ++ + + G + Q +G+D+ ++V F+P +C
Sbjct: 422 LPLPATNYLIPVDSDGTFCFAFAPTSSALSIIGNVQQQGTRVGFDLANSLVGFEPRQC 479
>gi|125539168|gb|EAY85563.1| hypothetical protein OsI_06935 [Oryza sativa Indica Group]
Length = 514
Score = 137 bits (345), Expect = 8e-30, Method: Compositional matrix adjust.
Identities = 124/400 (31%), Positives = 176/400 (44%), Gaps = 51/400 (12%)
Query: 13 NETPKSPISIIYQAEIISVDDI----YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPE 68
N TP+ ++ A + S + YL+ L +GTPP +DTGSD W QC PC
Sbjct: 126 NSTPRRALAERIVATVESGVAVGSGEYLVDLYVGTPPRRFQMIMDTGSDLNWLQCAPC-- 183
Query: 69 LDCFKQEPPLFDPKKSSTYNSISCSSSQCAVVT-----SNCSE---GDCSYSFLYGRGAY 120
LDCF+Q P+FDP S +Y +++C +C +V C C Y + YG
Sbjct: 184 LDCFEQRGPVFDPATSLSYRNVTCGDPRCGLVAPPTAPRACRRPHSDPCPYYYWYGD--- 240
Query: 121 ASFSSGNLATETLTFNSTS-GLPVEMPNVIFGCGHKNLASPTSDSKQTGIIGLGPGNSSL 179
S ++G+LA E T N T+ G + +V+FGCGH N + G+ S
Sbjct: 241 QSNTTGDLALEAFTVNLTAPGASRRVDDVVFGCGHSNRGLFHGAAGLLGLGRG---ALSF 297
Query: 180 ISQMGTSIAGKFSYCLPDQGS---SKINFGGIVAGAG---------VVSTPLIIRDHYYL 227
SQ+ FSYCL D GS SKI FG A G S YY+
Sbjct: 298 ASQLRAVYGHAFSYCLVDHGSSVGSKIVFGDDDALLGHPRLNYTAFAPSAAAAADTFYYV 357
Query: 228 SLEAISVGNQRLEFVSSS-------TGNIFVDTG-VLRTLLPLEYHSNLKSVMSNMIKAQ 279
L+ + VG ++L S+ +G +D+G L Y ++ + M KA
Sbjct: 358 QLKGVLVGGEKLNISPSTWDVGKDGSGGTIIDSGTTLSYFAEPAYEVIRRAFVERMDKAY 417
Query: 280 PVKGVGAEPGFSDVLCYNIS--SQPKFPEVTIHFR-GADVKLSPSNLFRNIS-DEIMCSA 335
P+ V P S CYN+S + + PE ++ F GA N F + D IMC A
Sbjct: 418 PL--VADFPVLSP--CYNVSGVERVEVPEFSLLFADGAVWDFPAENYFVRLDPDGIMCLA 473
Query: 336 FRG--GNANIVYGRIMQINFLIGYDIEQAMVSFKPSRCTN 373
G +A + G Q NF + YD++ + F P RC
Sbjct: 474 VLGTPRSAMSIIGNFQQQNFHVLYDLQNNRLGFAPRRCAE 513
>gi|356527091|ref|XP_003532147.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Glycine max]
Length = 482
Score = 137 bits (345), Expect = 9e-30, Method: Compositional matrix adjust.
Identities = 122/368 (33%), Positives = 170/368 (46%), Gaps = 40/368 (10%)
Query: 26 AEIISVDDIYLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSS 85
+I D Y++ + +GTP D+ DTGS TWTQCEPC C+KQ+ P+FDP KSS
Sbjct: 132 GRLIGSADYYVV-VGLGTPKRDLSLIFDTGSYLTWTQCEPCAG-SCYKQQDPIFDPSKSS 189
Query: 86 TYNSISCSSSQCAVVTS-NCS---EGDCSYSFLYGRGAYASFSSGNLATETLTFNSTSGL 141
+Y +I C+SS C S CS + C Y YG S S G L+ E LT +T
Sbjct: 190 SYTNIKCTSSLCTQFRSAGCSSSTDASCIYDVKYGDN---SISRGFLSQERLTITATD-- 244
Query: 142 PVEMPNVIFGCGHKN--LASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLPDQG 199
+ + +FGCG N L T+ G++GL S + Q + FSYCLP
Sbjct: 245 --IVHDFLFGCGQDNEGLFRGTA-----GLMGLSRHPISFVQQTSSIYNKIFSYCLPSTP 297
Query: 200 SS--KINFGGIVA-GAGVVSTPLII----RDHYYLSLEAISVGNQRLEFVSSST---GNI 249
SS + FG A A + TP Y L + ISVG +L VSSST G
Sbjct: 298 SSLGHLTFGASAATNANLKYTPFSTISGENSFYGLDIVGISVGGTKLPAVSSSTFSAGGS 357
Query: 250 FVDTGVLRTLLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDVLCYNISSQPKF--PEV 307
+D+G + T LP ++ L+S + PV G D CY+ S + P +
Sbjct: 358 IIDSGTVITRLPPTAYAALRSAFRQFMMKYPV-AYGTR--LLDT-CYDFSGYKEISVPRI 413
Query: 308 TIHFRGA-DVKLSPSNLFRNISDEIMCSAFRG-GNAN--IVYGRIMQINFLIGYDIEQAM 363
F G V+L + S + +C AF GN N ++G + Q + YD+E
Sbjct: 414 DFEFAGGVKVELPLVGILYGESAQQLCLAFAANGNGNDITIFGNVQQKTLEVVYDVEGGR 473
Query: 364 VSFKPSRC 371
+ F + C
Sbjct: 474 IGFGAAGC 481
>gi|413923780|gb|AFW63712.1| hypothetical protein ZEAMMB73_689747 [Zea mays]
Length = 470
Score = 137 bits (345), Expect = 9e-30, Method: Compositional matrix adjust.
Identities = 110/355 (30%), Positives = 169/355 (47%), Gaps = 39/355 (10%)
Query: 35 YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSS 94
Y++ S+GTP + VDTGSD +W QC+PC C++Q+ PLFDP +SS+Y ++ C
Sbjct: 137 YVVTASLGTPGMAQTLEVDTGSDLSWVQCKPCAAPSCYRQKDPLFDPAQSSSYAAVPCGR 196
Query: 95 SQCA---VVTSNCSEGDCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMPNVIFG 151
S CA + S CS C Y YG G S ++G +++TLT + + + +FG
Sbjct: 197 SACAGLGIYASACSAAQCGYVVSYGDG---SNTTGVYSSDTLTLAANA----TVQGFLFG 249
Query: 152 CGHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLPDQGSSK--INFGGIV 209
CGH S + G++G G SL+ Q + G FSYCLP + S+ + GG
Sbjct: 250 CGHAQ--SGGLFTGIDGLLGFGREQPSLVQQTAGAYGGVFSYCLPTKSSTTGYLTLGGPS 307
Query: 210 AGAGVVSTPLIIRD-----HYYLSLEAISVGNQRLEFVSSS-TGNIFVDTGVLRTLLPLE 263
A ST ++ +Y + L ISVG Q L +S+ VDTG + T LP
Sbjct: 308 GVAPGFSTTQLLPSPNAPTYYVVMLTGISVGGQPLSVPASAFAAGTVVDTGTVITRLPPA 367
Query: 264 YHSNLKSVMSNMIKAQPVKGVGAEP-GFSDVLCYNISSQPKFPEVTIHFRGADVKLSPSN 322
++ L+S + + + P A P G D CY+ + T++ + S
Sbjct: 368 AYAALRSAFRSGMASYP----SAPPIGILDT-CYSFAGYG-----TVNLTSVALTFSSGA 417
Query: 323 LFRNISDEIM---CSAFRGGNAN---IVYGRIMQINFLIGYDIEQAMVSFKPSRC 371
+D IM C AF ++ + G + Q +F + I+ + V F+PS C
Sbjct: 418 TMTLGADGIMSFGCLAFASSGSDGSMAILGNVQQRSFEV--RIDGSSVGFRPSSC 470
>gi|242092900|ref|XP_002436940.1| hypothetical protein SORBIDRAFT_10g011750 [Sorghum bicolor]
gi|241915163|gb|EER88307.1| hypothetical protein SORBIDRAFT_10g011750 [Sorghum bicolor]
Length = 465
Score = 137 bits (344), Expect = 1e-29, Method: Compositional matrix adjust.
Identities = 109/362 (30%), Positives = 163/362 (45%), Gaps = 46/362 (12%)
Query: 35 YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSS 94
Y++ L GTP V +DTGSD +W QC PC +C+ Q+ PLFDP KSSTY I+C +
Sbjct: 125 YMVTLGFGTPSVPQVLLMDTGSDVSWVQCAPCNSTECYPQKDPLFDPSKSSTYAPIACGA 184
Query: 95 SQCAVVTSN----CSEG--DCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMPNV 148
C + + C+ G C Y YG G S + G + ET+TF G+ V+ +
Sbjct: 185 DACNKLGDHYRNGCTSGGTQCGYRVEYGDG---SSTRGVYSNETITF--APGITVK--DF 237
Query: 149 IFGCGHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLPDQGSSKINFG-G 207
FGCGH K G++GLG SL+ Q + G FSYCLP S G
Sbjct: 238 HFGCGHDQRG---PSDKFDGLLGLGGAPESLVVQTASVYGGAFSYCLPALNSEAGFLALG 294
Query: 208 IVAGAGVVSTPLIIRDHYYLSLEA---------ISVGNQRLEFVSSS-TGNIFVDTGVLR 257
+ A ++ + ++L ++A ISVG + L+ S+ G + +D+G +
Sbjct: 295 VRPSAATNTSAFVFTPMWHLPMDATSYMVNMTGISVGGKPLDIPRSAFRGGMLIDSGTIV 354
Query: 258 TLLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDVLCYNIS--SQPKFPEVTIHFRGA- 314
T LP ++ L + + A P+ A F CYN + S P V + F G
Sbjct: 355 TELPETAYNALNAALRKAFAAYPMV---ASEDFD--TCYNFTGYSNVTVPRVALTFSGGA 409
Query: 315 --DVKLSPSNLFRNISDEIMCSAFRGGNANI---VYGRIMQINFLIGYDIEQAMVSFKPS 369
D+ + L ++ C AFR ++ + G + Q + YD V F+
Sbjct: 410 TIDLDVPNGILVKD------CLAFRESGPDVGLGIIGNVNQRTLEVLYDAGHGKVGFRAG 463
Query: 370 RC 371
C
Sbjct: 464 AC 465
>gi|224142013|ref|XP_002324355.1| predicted protein [Populus trichocarpa]
gi|222865789|gb|EEF02920.1| predicted protein [Populus trichocarpa]
Length = 477
Score = 137 bits (344), Expect = 1e-29, Method: Compositional matrix adjust.
Identities = 116/365 (31%), Positives = 174/365 (47%), Gaps = 37/365 (10%)
Query: 26 AEIISVDDIYLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSS 85
A I+ Y++ + +GTP D S DTGSD TWTQCEPC CF Q P FDP S+
Sbjct: 131 ASIVPTGGAYVVTVGLGTPKKDFTLSFDTGSDLTWTQCEPCLG-GCFPQNQPKFDPTTST 189
Query: 86 TYNSISCSSSQCAVVTS------NCSEGDCSYSFLYGRGAYASFSSGNLATETLTFNSTS 139
+Y ++SCSS C ++ +C C Y YG G ++ G LATETL S+
Sbjct: 190 SYKNVSCSSEFCKLIAEGNYPAQDCISNTCLYGIQYGSG----YTIGFLATETLAIASSD 245
Query: 140 GLPVEMPNVIFGCGHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLPDQG 199
N +FGC + S + + TG++GLG +L SQ FSYCLP
Sbjct: 246 ----VFKNFLFGCSEE---SRGTFNGTTGLLGLGRSPIALPSQTTNKYKNLFSYCLPASP 298
Query: 200 SS--KINFGGIVAGAGVVSTPL--IIRDHYYLSLEAISVGNQRLEFVSSSTGNIFVDTGV 255
SS ++FG V+ A STP+ ++ Y L+ ISV + L ++ S +D+G
Sbjct: 299 SSTGHLSFGVEVSQAA-KSTPISPKLKQLYGLNTVGISVRGRELP-INGSISRTIIDSGT 356
Query: 256 LRTLLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDVLCYNISS----QPKFPEVTIHF 311
T LP +S L S M+ + F CY+ S+ P ++I F
Sbjct: 357 TFTFLPSPTYSALGSAFREMMANYTL--TNGTSSFQP--CYDFSNIGNGTLTIPGISIFF 412
Query: 312 RGA-DVKLSPSNLFRNISD-EIMCSAF--RGGNANI-VYGRIMQINFLIGYDIEQAMVSF 366
G +V++ S + ++ + +C AF G +++ ++G Q + + YD+ + MV F
Sbjct: 413 EGGVEVEIDVSGIMIPVNGLKEVCLAFADTGSDSDFAIFGNYQQKTYEVIYDVAKGMVGF 472
Query: 367 KPSRC 371
P C
Sbjct: 473 APKGC 477
>gi|449434646|ref|XP_004135107.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 1-like [Cucumis
sativus]
Length = 486
Score = 137 bits (344), Expect = 1e-29, Method: Compositional matrix adjust.
Identities = 113/358 (31%), Positives = 174/358 (48%), Gaps = 43/358 (12%)
Query: 35 YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSS 94
Y + IG PP ++ +DTGSD +W QC PC E C++Q P+F+P S+++ S+SC +
Sbjct: 151 YFSRVGIGRPPSPVYMVLDTGSDVSWVQCAPCAE--CYEQTDPIFEPTSSASFTSLSCET 208
Query: 95 SQC-AVVTSNCSEGDCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMPNVIFGCG 153
QC ++ S C G C Y YG G+Y + G+ TET+T STS + N+ GCG
Sbjct: 209 EQCKSLDVSECRNGTCLYEVSYGDGSY---TVGDFVTETVTLGSTS-----LGNIAIGCG 260
Query: 154 HKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLPDQGS---SKINFGGIVA 210
H N + G+ G + S SQ+ A FSYCL D+ S S ++F +
Sbjct: 261 HNNEGLFIGAAGLLGLGGG---SLSFPSQLN---ASSFSYCLVDRDSDSTSTLDFNSPIT 314
Query: 211 GAGVVSTPLI----IRDHYYLSLEAISVGNQRL-----EFVSSSTGN--IFVDTGVLRTL 259
V+ PL + +YL L +SVG L F S GN I VD+G T
Sbjct: 315 -PDAVTAPLHRNPNLDTFFYLGLTGMSVGGAVLPIPETSFQMSEDGNGGIIVDSGTAVTR 373
Query: 260 LPLEYHSNLK-SVMSNMIKAQPVKGVGAEPGFSDVLCYNISSQPK--FPEVTIHF-RGAD 315
L ++ L+ + + + Q +GV D CY++SS+ + P V+ HF G +
Sbjct: 374 LQTTVYNVLRDAFVKSTHDLQTARGV----ALFDT-CYDLSSKSRVEVPTVSFHFANGNE 428
Query: 316 VKLSPSNLFRNISDE-IMCSAFRGGNANI-VYGRIMQINFLIGYDIEQAMVSFKPSRC 371
+ L N + E C AF ++ + + G Q +G+D+ ++V F P++C
Sbjct: 429 LPLPAKNYLIPVDSEGTFCFAFAPTDSTLSILGNAQQQGTRVGFDLANSLVGFSPNKC 486
>gi|242084330|ref|XP_002442590.1| hypothetical protein SORBIDRAFT_08g022570 [Sorghum bicolor]
gi|241943283|gb|EES16428.1| hypothetical protein SORBIDRAFT_08g022570 [Sorghum bicolor]
Length = 494
Score = 136 bits (343), Expect = 1e-29, Method: Compositional matrix adjust.
Identities = 115/376 (30%), Positives = 169/376 (44%), Gaps = 54/376 (14%)
Query: 35 YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSS 94
Y+ +++GTP V ++DT SD TW QC+PC C+ Q P+FDP+ S++Y ++ +
Sbjct: 134 YMAKIAVGTPAVQALLALDTASDLTWLQCQPCRR--CYPQSGPVFDPRHSTSYGEMNYDA 191
Query: 95 SQCAVV----TSNCSEGDCSYSFLYGRG-AYASFSSGNLATETLTFNSTSGLPVEMPNVI 149
C + + G C Y+ YG G S S G+L ETLTF V +
Sbjct: 192 PDCQALGRSGGGDAKRGTCIYTVQYGDGHGSTSTSVGDLVEETLTFAGG----VRQAYLS 247
Query: 150 FGCGHKN---LASPTSDSKQTGIIGLGPGNSSLISQMG-TSIAGKFSYCL------PDQG 199
GCGH N +P + GI+GLG G S+ Q+ FSYCL P
Sbjct: 248 IGCGHDNKGLFGAPAA-----GILGLGRGQISIPHQIAFLGYNASFSYCLVDFISGPGSP 302
Query: 200 SSKINFGGIVAGAGVVS-----TPLIIRDH----YYLSLEAISVGNQRLEFVS------- 243
SS + FG AGA S TP ++ + YY+ L +SVG R+ V+
Sbjct: 303 SSTLTFG---AGAVDTSPPASFTPTVLNQNMPTFYYVRLIGVSVGGVRVPGVTERDLQLD 359
Query: 244 --SSTGNIFVDTGVLRTLLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDVLCYNISSQ 301
+ G + +D+G T L + + + G G D CY + +
Sbjct: 360 PYTGRGGVILDSGTTVTRLARPAYVAFRDAFRAAATSLGQVSTGGPSGLFDT-CYTVGGR 418
Query: 302 P--KFPEVTIHFRGA-DVKLSPSNLFRNI-SDEIMCSAFRG-GNANI-VYGRIMQINFLI 355
K P V++HF G +V L P N + S +C AF G G+ ++ V G I+Q F +
Sbjct: 419 AGVKVPAVSMHFAGGVEVSLQPKNYLIPVDSRGTVCFAFAGTGDRSVSVIGNILQQGFRV 478
Query: 356 GYDIEQAMVSFKPSRC 371
YD+ V F P+ C
Sbjct: 479 VYDLAGQRVGFAPNNC 494
>gi|125536419|gb|EAY82907.1| hypothetical protein OsI_38120 [Oryza sativa Indica Group]
Length = 448
Score = 136 bits (343), Expect = 2e-29, Method: Compositional matrix adjust.
Identities = 117/375 (31%), Positives = 178/375 (47%), Gaps = 68/375 (18%)
Query: 2 QNSQKLPFYNDNETPKSPISIIYQAEIISVDDIYLMHLSIGTPPVDIFGSVDTGSDCTWT 61
++ ++L Y K+P++ + Y+M SIG PP+ I+ VDTGSD W
Sbjct: 60 RSRRRLSVYTSGTGTKAPVTKSQKG------GKYIMQFSIGEPPLLIWAEVDTGSDLMWV 113
Query: 62 QCEPCPELDCFKQEPPLFDPKKSSTYNSISCSSSQCA------VVTSNCSEGD--CSYSF 113
+C PC C PL+DP +S + + CSS C +++ CS+ C Y +
Sbjct: 114 KCSPCN--GCNPPPSPLYDPARSRSSGKLPCSSQLCQALGRGRIISDQCSDDPPLCGYHY 171
Query: 114 LYGRGAYASFSSGNLATETLTFNSTSGLPVEMPNVIFGCGHKNLASPTSDSKQ----TGI 169
YG S + G L TET TF G NV FG S T D Q G+
Sbjct: 172 AYGHSGDHS-TQGVLGTETFTF----GDGYVANNVSFG------RSDTIDGSQFGGTAGL 220
Query: 170 IGLGPGNSSLISQMGTSIAGKFSYCLPDQGS--SKINFGGIVA----GAGVVSTPLII-- 221
+GLG G+ SL+SQ+G AG+F+YCL + S I FG + A V STPL+
Sbjct: 221 VGLGRGHLSLVSQLG---AGRFAYCLAADPNVYSTILFGSLAALDTSAGDVSSTPLVTNP 277
Query: 222 ---RD-HYYLSLEAISVGNQRLEFV-------SSSTGNIFVDTGVLRTLLPLEYHSNLKS 270
RD HYY++L+ ISVG RL S +G +F D+G + T L + ++
Sbjct: 278 KPDRDTHYYVNLQGISVGGSRLPIKDGTFAINSDGSGGVFFDSGAIDTSLKDAAYQVVRQ 337
Query: 271 VMSNMIKAQPVKGVGAEPGFSDVLCYNISSQ---PKFPEVTIHF-RGADVKLSPSNLFRN 326
+++ I+ +G + G D C+ ++Q + P + +HF GAD+ L+ N +
Sbjct: 338 AITSEIQR-----LGYDAG--DDTCFVAANQQAVAQMPPLVLHFDDGADMSLNGRNYLKT 390
Query: 327 I----SDEIMCSAFR 337
S+ ++C A +
Sbjct: 391 STKGPSEVLVCMAIK 405
>gi|79407941|ref|NP_188636.2| aspartyl protease family protein [Arabidopsis thaliana]
gi|75273243|sp|Q9LHE3.1|ASPG2_ARATH RecName: Full=Protein ASPARTIC PROTEASE IN GUARD CELL 2;
Short=AtASPG2; Flags: Precursor
gi|11994777|dbj|BAB03167.1| nucleoid chloroplast DNA-binding protein-like [Arabidopsis
thaliana]
gi|28392860|gb|AAO41867.1| unknown protein [Arabidopsis thaliana]
gi|332642798|gb|AEE76319.1| aspartyl protease family protein [Arabidopsis thaliana]
Length = 470
Score = 136 bits (343), Expect = 2e-29, Method: Compositional matrix adjust.
Identities = 109/361 (30%), Positives = 160/361 (44%), Gaps = 45/361 (12%)
Query: 35 YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSS 94
Y + + +G+PP D + +D+GSD W QC+PC C+KQ P+FDP KS +Y +SC S
Sbjct: 131 YFVRIGVGSPPRDQYMVIDSGSDMVWVQCQPCKL--CYKQSDPVFDPAKSGSYTGVSCGS 188
Query: 95 SQC-AVVTSNCSEGDCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMPNVIFGCG 153
S C + S C G C Y +YG G+Y + G LA ETLTF T + NV GCG
Sbjct: 189 SVCDRIENSGCHSGGCRYEVMYGDGSY---TKGTLALETLTFAKTV-----VRNVAMGCG 240
Query: 154 HKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLPDQGSS---KINFGGIVA 210
H+N + GI G + S + Q+ G F YCL +G+ + FG
Sbjct: 241 HRNRGMFIGAAGLLGIGGG---SMSFVGQLSGQTGGAFGYCLVSRGTDSTGSLVFGREAL 297
Query: 211 GAGVVSTPLIIRDH----YYLSLEAISVGNQRL-------EFVSSSTGNIFVDTGVLRTL 259
G PL+ YY+ L+ + VG R+ + + G + +DTG T
Sbjct: 298 PVGASWVPLVRNPRAPSFYYVGLKGLGVGGVRIPLPDGVFDLTETGDGGVVMDTGTAVTR 357
Query: 260 LP----LEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDVLCYNISS--QPKFPEVTIHF-R 312
LP + + KS +N+ +A V CY++S + P V+ +F
Sbjct: 358 LPTAAYVAFRDGFKSQTANLPRASGVSIFDT--------CYDLSGFVSVRVPTVSFYFTE 409
Query: 313 GADVKLSPSNLFRNISDE-IMCSAFRGGNANI-VYGRIMQINFLIGYDIEQAMVSFKPSR 370
G + L N + D C AF + + G I Q + +D V F P+
Sbjct: 410 GPVLTLPARNFLMPVDDSGTYCFAFAASPTGLSIIGNIQQEGIQVSFDGANGFVGFGPNV 469
Query: 371 C 371
C
Sbjct: 470 C 470
>gi|226492391|ref|NP_001140482.1| uncharacterized protein LOC100272542 precursor [Zea mays]
gi|224030447|gb|ACN34299.1| unknown [Zea mays]
gi|414887506|tpg|DAA63520.1| TPA: hypothetical protein ZEAMMB73_432695 [Zea mays]
Length = 512
Score = 136 bits (342), Expect = 2e-29, Method: Compositional matrix adjust.
Identities = 121/381 (31%), Positives = 169/381 (44%), Gaps = 57/381 (14%)
Query: 35 YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSS 94
YLM + +GTPP +DTGSD W QC PC LDCF+Q P+FDP SS+Y +++C
Sbjct: 146 YLMDVYVGTPPRRFQMIMDTGSDLNWLQCAPC--LDCFEQRGPVFDPAASSSYRNLTCGD 203
Query: 95 SQCAVVTSNCS----------EGDCSYSFLYGRGAYASFSSGNLATETLTFNSTS-GLPV 143
+C V + E C Y + YG S S+G+LA E+ T N T+ G
Sbjct: 204 PRCGHVAPPEAPAPRACRRPGEDPCPYYYWYGD---QSNSTGDLALESFTVNLTAPGASS 260
Query: 144 EMPNVIFGCGHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGK-FSYCLPDQGS-- 200
+ V+FGCGH+N + G+ S SQ+ G FSYCL D GS
Sbjct: 261 RVDGVVFGCGHRNRGLFHGAAGLLGLGRG---PLSFASQLRAVYGGHTFSYCLVDHGSDV 317
Query: 201 -SKINFGGIVAGAGVVSTPLI-----------IRDHYYLSLEAISVGNQRLEFVSSS--- 245
SK+ FG A A + + P + YY+ L + VG + L S +
Sbjct: 318 ASKVVFGEDDALA-LAAHPRLKYTAFAPASSPADTFYYVRLTGVLVGGELLNISSDTWDA 376
Query: 246 ----TGNIFVDTG-VLRTLLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDVL--CYNI 298
+G +D+G L + Y ++ + M + P P F VL CYN+
Sbjct: 377 SEGGSGGTIIDSGTTLSYFVEPAYQVIRRAFIDRMSGSYP-----PVPDFP-VLSPCYNV 430
Query: 299 S--SQPKFPEVTIHFR-GADVKLSPSNLF-RNISDEIMCSAFRG--GNANIVYGRIMQIN 352
S +P+ PE+++ F GA N F R D IMC A G + G Q N
Sbjct: 431 SGVERPEVPELSLLFADGAVWDFPAENYFIRLDPDGIMCLAVLGTPRTGMSIIGNFQQQN 490
Query: 353 FLIGYDIEQAMVSFKPSRCTN 373
F + YD+ + F P RC
Sbjct: 491 FHVAYDLHNNRLGFAPRRCAE 511
>gi|359476193|ref|XP_003631802.1| PREDICTED: LOW QUALITY PROTEIN: aspartic proteinase
nepenthesin-2-like [Vitis vinifera]
Length = 496
Score = 135 bits (341), Expect = 2e-29, Method: Compositional matrix adjust.
Identities = 115/369 (31%), Positives = 171/369 (46%), Gaps = 48/369 (13%)
Query: 27 EIISVDDIYLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSST 86
++ D +L+ ++ GTPP +DTGS TWTQC+PC + C K FDP S T
Sbjct: 154 KLFDEDGNFLVDVAFGTPPQKFTLILDTGSSITWTQCKPC--VRCLKASRRHFDPSASLT 211
Query: 87 YNSISCSSSQCAVVTSNCSEGDCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMP 146
Y+ SC S +Y+ YG S S GN +T+T + P
Sbjct: 212 YSLGSCIPSTVGN----------TYNMTYGD---KSTSVGNYGCDTMTLEHSD----VFP 254
Query: 147 NVIFGCGHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLPDQG------- 199
FGCG N S + G++GLG G S +SQ + FSYCLP++
Sbjct: 255 KFQFGCGRNNEGDFGSGAD--GMLGLGQGQLSTVSQTASKFKKVFSYCLPEEDSIGSLLF 312
Query: 200 -------SSKINFGGIVAGAGVVSTPLIIRDHYYLSLEAISVGNQRLEFVSS---STGNI 249
SS + F +V G G ++ L +Y++ L ISVGN+RL SS S G I
Sbjct: 313 GEKATSQSSSLKFTSLVNGPG--TSGLEESGYYFVKLLDISVGNKRLNIPSSVFASPGTI 370
Query: 250 FVDTGVLRTLLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDVL--CYNISSQPK--FP 305
+D+G + T LP +S LK+ + P+ + G D+L CYN+S + P
Sbjct: 371 -IDSGTVITRLPQRAYSALKAAFKKAMAKYPLSNGRRKKG--DILDTCYNLSGRKDVLLP 427
Query: 306 EVTIHF-RGADVKLSPSNLFRNISDEIMCSAFRGGNANIVYGRIMQINFLIGYDIEQAMV 364
E+ +HF GADV+L+ + +C AF G + + G Q++ + YDI+ +
Sbjct: 428 EIVLHFGEGADVRLNGKRVIWGNDASRLCLAFAGNSELTIIGNRQQVSLTVLYDIQGGRI 487
Query: 365 SFKPSRCTN 373
F + C+
Sbjct: 488 GFGGNGCSK 496
>gi|242062704|ref|XP_002452641.1| hypothetical protein SORBIDRAFT_04g029670 [Sorghum bicolor]
gi|241932472|gb|EES05617.1| hypothetical protein SORBIDRAFT_04g029670 [Sorghum bicolor]
Length = 466
Score = 135 bits (341), Expect = 2e-29, Method: Compositional matrix adjust.
Identities = 112/353 (31%), Positives = 171/353 (48%), Gaps = 33/353 (9%)
Query: 35 YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSS 94
Y++ + IG+P V S+DTGSD +W QC+PC + C + LFDP SSTY+ SCSS
Sbjct: 131 YVITVGIGSPAVTQTMSMDTGSDVSWVQCKPCSQ--CHSEVDSLFDPSASSTYSPFSCSS 188
Query: 95 SQCAVVTSN-----CSEGDCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMPNVI 149
+ C ++ + CS C Y Y G S ++G +++TLT S + +
Sbjct: 189 AACVQLSQSQQGNGCSSSQCQYIVSYVDG---SSTTGTYSSDTLTLGSNA-----IKGFQ 240
Query: 150 FGCGHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLPDQ-GSSKINFGGI 208
FGC S + G++GLG SL+SQ + FSYCLP GSS G
Sbjct: 241 FGCSQSE--SGGFSDQTDGLMGLGGDAQSLVSQTAGTFGKAFSYCLPPTPGSSGFLTLGA 298
Query: 209 VAGAGVVSTPLI----IRDHYYLSLEAISVGNQRLEFVSS--STGNIFVDTGVLRTLLPL 262
+ +G V TP++ I +Y + LEAI VG Q+L +S S G++ +D+G + T LP
Sbjct: 299 ASRSGFVKTPMLRSTQIPTYYGVLLEAIRVGGQQLNIPTSVFSAGSV-MDSGTVITRLPP 357
Query: 263 EYHSNLKSVMSNMIKAQPVKGVGAEPGFSDVLCYNISSQP--KFPEVTIHFR-GADVKLS 319
+S L S +K P A+P C++ S Q P V + F GA V L
Sbjct: 358 TAYSALSSAFKAGMKKYPP----AQPSGILDTCFDFSGQSSVSIPSVALVFSGGAVVNLD 413
Query: 320 PSNLFRNISDEIMCSAFRGGNANIVY-GRIMQINFLIGYDIEQAMVSFKPSRC 371
+ + + + + A ++++ + G + Q F + YD+ V F+ C
Sbjct: 414 FNGIMLELDNWCLAFAANSDDSSLGFIGNVQQRTFEVLYDVGGGAVGFRAGAC 466
>gi|218192707|gb|EEC75134.1| hypothetical protein OsI_11325 [Oryza sativa Indica Group]
Length = 401
Score = 135 bits (341), Expect = 2e-29, Method: Compositional matrix adjust.
Identities = 110/313 (35%), Positives = 153/313 (48%), Gaps = 40/313 (12%)
Query: 35 YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSS 94
YL+HL+IGTPP + ++DTGSD WTQC+PCP CF Q P FDP SST + SC S
Sbjct: 82 YLVHLAIGTPPQPVQLTLDTGSDLIWTQCQPCPA--CFDQALPYFDPSTSSTLSLTSCDS 139
Query: 95 SQC-AVVTSNCSEGD------CSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMPN 147
+ C + ++C C Y++ YG S ++G L + TF G +P
Sbjct: 140 TLCQGLPVASCGSPKFWPNQTCVYTYSYGD---KSVTTGFLEVDKFTF---VGAGASVPG 193
Query: 148 VIFGCGHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCL-------PDQGS 200
V FGCG N + S +TGI G G G SL SQ+ G FS+C P
Sbjct: 194 VAFGCGLFN--NGVFKSNETGIAGFGRGPLSLPSQLKV---GNFSHCFTAVNGLKPSTVL 248
Query: 201 SKINFGGIVAGAGVV-STPLIIR----DHYYLSLEAISVGNQRL-----EF-VSSSTGNI 249
+ +G G V STPLI YYLSL+ I+VG+ RL EF + + TG
Sbjct: 249 LDLPADLYKSGRGAVQSTPLIQNPANPTFYYLSLKGITVGSTRLPVPESEFALKNGTGGT 308
Query: 250 FVDTGVLRTLLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDVLCYNISSQPKFPEVTI 309
+D+G T LP + ++ + +K V G +P F L + ++P P++ +
Sbjct: 309 IIDSGTAMTSLPTRVYRLVRDAFAAQVKLPVVSGNTTDPYF--CLSAPLRAKPYVPKLVL 366
Query: 310 HFRGADVKLSPSN 322
HF GA + L N
Sbjct: 367 HFEGATMDLPREN 379
>gi|255568540|ref|XP_002525244.1| Aspartic proteinase nepenthesin-2 precursor, putative [Ricinus
communis]
gi|223535541|gb|EEF37210.1| Aspartic proteinase nepenthesin-2 precursor, putative [Ricinus
communis]
Length = 460
Score = 135 bits (341), Expect = 2e-29, Method: Compositional matrix adjust.
Identities = 103/355 (29%), Positives = 167/355 (47%), Gaps = 33/355 (9%)
Query: 35 YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSS 94
Y + L +G+PP +DTGS +W QC+PC + C Q PLF+P S+TY + CSS
Sbjct: 120 YYLKLGLGSPPKYYTMILDTGSSLSWLQCKPC-VVYCHSQVDPLFEPSASNTYRPLYCSS 178
Query: 95 SQCAVVTSNC-------SEGDCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMPN 147
S+C+++ + + G C Y+ YG AS+S G L+ + LT + LP +
Sbjct: 179 SECSLLKAATLNDPLCTASGVCVYTASYGD---ASYSMGYLSRDLLTLTPSQTLP----S 231
Query: 148 VIFGCGHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLPDQGSSKINFGG 207
+GCG N K GI+GL S+++Q+ FSYCLP SS F
Sbjct: 232 FTYGCGQDN---EGLFGKAAGIVGLARDKLSMLAQLSPKYGYAFSYCLPTSTSSGGGFLS 288
Query: 208 I--VAGAGVVSTPLIIRDH----YYLSLEAISVGNQRLEFVSSSTG-NIFVDTGVLRTLL 260
I ++ + TP+I Y+L L AI+V + + ++ +D+G + T L
Sbjct: 289 IGKISPSSYKFTPMIRNSQNPSLYFLRLAAITVAGRPVGVAAAGYQVPTIIDSGTVVTRL 348
Query: 261 PLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDV-LCY--NISSQPKFPEVTIHFRG-ADV 316
P+ ++ L+ ++ + + P +S + C+ ++ S PE+ + F+G AD+
Sbjct: 349 PISIYAALREAFVKIMSRRYEQA----PAYSILDTCFKGSLKSMSGAPEIRMIFQGGADL 404
Query: 317 KLSPSNLFRNISDEIMCSAFRGGNANIVYGRIMQINFLIGYDIEQAMVSFKPSRC 371
L N+ I C AF N + G Q + I YD+ + + F P C
Sbjct: 405 SLRAPNILIEADKGIACLAFASSNQIAIIGNHQQQTYNIAYDVSASKIGFAPGGC 459
>gi|449530542|ref|XP_004172253.1| PREDICTED: LOW QUALITY PROTEIN: protein ASPARTIC PROTEASE IN GUARD
CELL 1-like [Cucumis sativus]
Length = 486
Score = 135 bits (341), Expect = 2e-29, Method: Compositional matrix adjust.
Identities = 113/358 (31%), Positives = 173/358 (48%), Gaps = 43/358 (12%)
Query: 35 YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSS 94
Y + IG PP ++ +DTGSD +W QC PC E C++Q P F+P S+++ S+SC +
Sbjct: 151 YFSRVGIGRPPSPVYMVLDTGSDVSWVQCAPCAE--CYEQTDPXFEPTSSASFTSLSCET 208
Query: 95 SQC-AVVTSNCSEGDCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMPNVIFGCG 153
QC ++ S C G C Y YG G+Y + G+ TET+T STS + N+ GCG
Sbjct: 209 EQCKSLDVSECRNGTCLYEVSYGDGSY---TVGDFVTETVTLGSTS-----LGNIAIGCG 260
Query: 154 HKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLPDQGS---SKINFGGIVA 210
H N + G+ G + S SQ+ A FSYCL D+ S S ++F +
Sbjct: 261 HNNEGLFIGAAGLLGLGGG---SLSFPSQLN---ASSFSYCLVDRDSDSTSTLDFNSPIT 314
Query: 211 GAGVVSTPLI----IRDHYYLSLEAISVGNQRL-----EFVSSSTGN--IFVDTGVLRTL 259
V+ PL + +YL L +SVG L F S GN I VD+G T
Sbjct: 315 -PDAVTAPLHRNPNLDTFFYLGLTGMSVGGAVLPIPETSFQMSEDGNGGIIVDSGTAVTR 373
Query: 260 LPLEYHSNLK-SVMSNMIKAQPVKGVGAEPGFSDVLCYNISSQPK--FPEVTIHF-RGAD 315
L ++ L+ + + + Q +GV D CY++SS+ + P V+ HF G +
Sbjct: 374 LQTTVYNVLRDAFVKSTHDLQTARGV----ALFDT-CYDLSSKSRVEVPTVSFHFANGNE 428
Query: 316 VKLSPSNLFRNISDE-IMCSAFRGGNANI-VYGRIMQINFLIGYDIEQAMVSFKPSRC 371
+ L N + E C AF ++ + + G Q +G+D+ ++V F P++C
Sbjct: 429 LPLPAKNYLIPVDSEGTFCFAFAPTDSTLSILGNAQQQGTRVGFDLANSLVGFSPNKC 486
>gi|326532056|dbj|BAK01404.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 506
Score = 135 bits (341), Expect = 3e-29, Method: Compositional matrix adjust.
Identities = 118/369 (31%), Positives = 170/369 (46%), Gaps = 42/369 (11%)
Query: 35 YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSS 94
YL+ + +GTPP +DTGSD W QC PC LDCF+Q P+FDP S +Y +++C
Sbjct: 149 YLVDVYLGTPPRRFRMIMDTGSDLNWLQCAPC--LDCFEQSGPIFDPAASISYRNVTCGD 206
Query: 95 SQCAVVT-------SNCSE---GDCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVE 144
+C +V+ C C Y + YG S ++G+LA E T N T
Sbjct: 207 DRCRLVSPPAESAPRECRRPRSDPCPYYYWYGD---QSNTTGDLALEAFTVNLTQSGTRR 263
Query: 145 MPNVIFGCGHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGK-FSYCLPDQGS--- 200
+ V FGCGH+N + G+ S SQ+ G FSYCL + GS
Sbjct: 264 VDGVAFGCGHRNRGLFHGAAGLLGLGRG---PLSFASQLRGVYGGHAFSYCLVEHGSAAG 320
Query: 201 SKINFG---GIVAGAGVVST---PLIIRD-HYYLSLEAISVGNQRLEFVSS--STGNIFV 251
SKI FG ++A + T P D YYL L++I VG + + S S G +
Sbjct: 321 SKIIFGHDDALLAHPQLNYTAFAPTTDADTFYYLQLKSILVGGEAVNISSDTLSAGGTII 380
Query: 252 DTGVLRTLLPL-EYHSNLKSVMSNMIKAQPVKGVGAEPGFSDVLCYNISSQPK--FPEVT 308
D+G + P Y + ++ + M + P+ + P S CYN+S K PE++
Sbjct: 381 DSGTTLSYFPEPAYQAIRQAFIDRMSPSYPL--ILGFPVLSP--CYNVSGAEKVEVPELS 436
Query: 309 IHFR-GADVKLSPSNLFRNISDE-IMCSAFRG--GNANIVYGRIMQINFLIGYDIEQAMV 364
+ F GA + N F + E IMC A G + + G Q NF + YD+E +
Sbjct: 437 LVFADGAAWEFPAENYFIRLEPEGIMCLAVLGTPRSGMSIIGNYQQQNFHVLYDLEHNRL 496
Query: 365 SFKPSRCTN 373
F P RC +
Sbjct: 497 GFAPRRCAD 505
>gi|297834938|ref|XP_002885351.1| pepsin A [Arabidopsis lyrata subsp. lyrata]
gi|297331191|gb|EFH61610.1| pepsin A [Arabidopsis lyrata subsp. lyrata]
Length = 471
Score = 135 bits (341), Expect = 3e-29, Method: Compositional matrix adjust.
Identities = 109/361 (30%), Positives = 159/361 (44%), Gaps = 45/361 (12%)
Query: 35 YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSS 94
Y + + +G+PP D + +D+GSD W QC+PC C+KQ P+FDP KS +Y +SC S
Sbjct: 132 YFVRIGVGSPPRDQYMVIDSGSDMVWVQCQPCKL--CYKQSDPVFDPAKSGSYTGVSCGS 189
Query: 95 SQC-AVVTSNCSEGDCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMPNVIFGCG 153
S C + S C G C Y +YG G+Y + G LA ETLTF T + NV GCG
Sbjct: 190 SVCDRIENSGCHSGGCRYEVMYGDGSY---TKGTLALETLTFAKTV-----VRNVAMGCG 241
Query: 154 HKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLPDQGSS---KINFGGIVA 210
H+N + GI G + S + Q+ G F YCL +G+ + FG
Sbjct: 242 HRNRGMFIGAAGLLGIGGG---SMSFVGQLSGQTGGAFGYCLVSRGTDSTGSLVFGREAL 298
Query: 211 GAGVVSTPLIIRDH----YYLSLEAISVGNQRL-------EFVSSSTGNIFVDTGVLRTL 259
G PL+ YY+ L+ + VG R+ + + G + +DTG T
Sbjct: 299 PVGASWVPLVRNPRAPSFYYVGLKGLGVGGVRIPLPDGVFDLTETGDGGVVMDTGTAVTR 358
Query: 260 LPL----EYHSNLKSVMSNMIKAQPVKGVGAEPGFSDVLCYNISS--QPKFPEVTIHF-R 312
LP + KS +N+ +A V CY++S + P V+ +F
Sbjct: 359 LPTGAYAAFRDGFKSQTANLPRASGVSIFDT--------CYDLSGFVSVRVPTVSFYFTE 410
Query: 313 GADVKLSPSNLFRNISDE-IMCSAFRGGNANI-VYGRIMQINFLIGYDIEQAMVSFKPSR 370
G + L N + D C AF + + G I Q + +D V F P+
Sbjct: 411 GPVLTLPARNFLMPVDDSGTYCFAFAASPTGLSIIGNIQQEGIQVSFDGANGFVGFGPNV 470
Query: 371 C 371
C
Sbjct: 471 C 471
>gi|242086412|ref|XP_002443631.1| hypothetical protein SORBIDRAFT_08g022620 [Sorghum bicolor]
gi|241944324|gb|EES17469.1| hypothetical protein SORBIDRAFT_08g022620 [Sorghum bicolor]
Length = 507
Score = 135 bits (341), Expect = 3e-29, Method: Compositional matrix adjust.
Identities = 114/384 (29%), Positives = 169/384 (44%), Gaps = 64/384 (16%)
Query: 35 YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSS 94
Y+ +++GTP V+ ++DT SD TW QC+PC C+ Q P+FDP+ S++Y ++ +
Sbjct: 141 YIAKIAVGTPAVEALLALDTASDLTWLQCQPCRR--CYPQSGPVFDPRHSTSYGEMNYDA 198
Query: 95 SQCAVV----TSNCSEGDCSYSFLYGRG---AYASFSSGNLATETLTFNSTSGLPVEMPN 147
C + + G C Y+ LYG G S S G+L ETLTF V
Sbjct: 199 PDCQALGRSGGGDAKRGTCIYTVLYGDGDGHGSTSTSVGDLVEETLTFAGG----VRQAY 254
Query: 148 VIFGCGHKN---LASPTSDSKQTGIIGLGPGNSSLISQMG-TSIAGKFSYCL------PD 197
+ GCGH N +P + GI+GL G S+ Q+ FSYCL P
Sbjct: 255 LSIGCGHDNKGLFGAPAA-----GILGLSRGQISIPHQIAFLGYNASFSYCLVDFISGPG 309
Query: 198 QGSSKINFGGIVAGAGVVS-------TPLIIRDH----YYLSLEAISVGNQRLEFVS--- 243
SS + F GAG V TP ++ + YY+ L +SVG R+ V+
Sbjct: 310 SPSSTLTF-----GAGAVDTSPPASFTPTVLNQNMPTFYYVRLIGVSVGGVRVPGVTERD 364
Query: 244 ------SSTGNIFVDTGVLRTLLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDVLCYN 297
+ G + +D+G T L ++ + G G D CY
Sbjct: 365 LQLDPYTGHGGVILDSGTTVTRLARPAYTAFRDAFRAAATGLGQVSTGGPSGLFDT-CYT 423
Query: 298 ISSQP------KFPEVTIHFRGA-DVKLSPSNLFRNI-SDEIMCSAFRG-GNANI-VYGR 347
+ + K P V++HF G ++ L P N + S +C AF G G+ ++ V G
Sbjct: 424 VGGRAGLRHCVKVPAVSMHFAGGVELSLQPKNYLITVDSRGTVCFAFAGTGDRSVSVIGN 483
Query: 348 IMQINFLIGYDIEQAMVSFKPSRC 371
I+Q F + YDI V F P+ C
Sbjct: 484 ILQQGFRVVYDIGGQRVGFAPNSC 507
>gi|302793638|ref|XP_002978584.1| hypothetical protein SELMODRAFT_54048 [Selaginella moellendorffii]
gi|300153933|gb|EFJ20570.1| hypothetical protein SELMODRAFT_54048 [Selaginella moellendorffii]
Length = 407
Score = 135 bits (341), Expect = 3e-29, Method: Compositional matrix adjust.
Identities = 112/368 (30%), Positives = 174/368 (47%), Gaps = 45/368 (12%)
Query: 35 YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSS 94
Y + L +GTP +F VDTGSD W QC+PC C+KQ P+FDP+ SS++ I C S
Sbjct: 54 YFVRLGLGTPARSLFMVVDTGSDLPWLQCQPCKS--CYKQADPIFDPRNSSSFQRIPCLS 111
Query: 95 SQC-AVVTSNCS-----EGDCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMPNV 148
C A+ +CS CSY YG G SFS G+ +++ T + S + +V
Sbjct: 112 PLCKALEVHSCSGSRGATSRCSYQVAYGDG---SFSVGDFSSDLFTLGTGS----KAMSV 164
Query: 149 IFGCGHKN--LASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLPD------QGS 200
FGCG N L + + G L + S +S A FSYCL D + S
Sbjct: 165 AFGCGFDNEGLFAGAAGLLGLGAGKLSFPSQIFASSTNSSTANSFSYCLVDRSNPMTRSS 224
Query: 201 SKINFGGIVAGAGVVSTPLI----IRDHYYLSLEAISVGNQR-------LEFVSSSTGNI 249
S + FG + +PL+ + YY ++ +SVG + L+ S +G +
Sbjct: 225 SSLIFGVAAIPSTAALSPLLKNPKLDTFYYAAMIGVSVGGAQLPISLKSLQLSQSGSGGV 284
Query: 250 FVDTGVLRTLLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDV-LCYNISSQP--KFPE 306
+D+G T P ++ ++ N P + P +S CYN S + P
Sbjct: 285 IIDSGTSVTRFPTSVYATIRDAFRNATINLP-----SAPRYSLFDTCYNFSGKASVDVPA 339
Query: 307 VTIHFR-GADVKLSPSNLFRNISDE-IMCSAFRGGNANI-VYGRIMQINFLIGYDIEQAM 363
+ +HF GAD++L P+N I+ C AF + + + G I Q +F IG+D++++
Sbjct: 340 LVLHFENGADLQLPPTNYLIPINTAGSFCLAFAPTSMELGIIGNIQQQSFRIGFDLQKSH 399
Query: 364 VSFKPSRC 371
++F P +C
Sbjct: 400 LAFAPQQC 407
>gi|308044575|ref|NP_001183392.1| uncharacterized protein LOC100501808 [Zea mays]
gi|238011188|gb|ACR36629.1| unknown [Zea mays]
Length = 342
Score = 135 bits (340), Expect = 3e-29, Method: Compositional matrix adjust.
Identities = 108/356 (30%), Positives = 161/356 (45%), Gaps = 52/356 (14%)
Query: 52 VDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSSSQCAVVTS---NCSEGD 108
+DTGSD W QC PC C++Q P+FDP++SS+Y ++ C ++ C + S + G
Sbjct: 3 LDTGSDVVWVQCAPCRR--CYEQSGPVFDPRRSSSYGAVGCGAALCRRLDSGGCDLRRGA 60
Query: 109 CSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMPNVIFGCGHKNLASPTSDSKQTG 168
C Y YG G S ++G+ TETLTF + + V GCGH N + + G
Sbjct: 61 CMYQVAYGDG---SVTAGDFVTETLTFAGGA----RVARVALGCGHDNEGLFVAAAGLLG 113
Query: 169 IIGLGPGNSSLISQMGTSIAGKFSYCLPDQ------------GSSKINFGGIVAGAGVVS 216
+ S +Q+ FSYCL D+ SS ++FG GA S
Sbjct: 114 LGRG---GLSFPTQISRRYGRSFSYCLVDRTSSGAGAAPGSHRSSTVSFGAGSVGASSAS 170
Query: 217 -TPLI----IRDHYYLSLEAISVGNQRLEFV---------SSSTGNIFVDTGVLRTLLPL 262
TP++ + YY+ L ISVG R+ V S+ G + VD+G T L
Sbjct: 171 FTPMVRNPRMETFYYVQLVGISVGGARVPGVAESDLRLDPSTGRGGVIVDSGTSVTRLAR 230
Query: 263 EYHSNLKSVMSNMIKAQPVKGVGAEPGFSDVL--CYNISSQP--KFPEVTIHFR-GADVK 317
+S L+ + +A G+ PG + CY++ + K P V++HF GA+
Sbjct: 231 ASYSALR----DAFRAAAAGGLRLSPGGFSLFDTCYDLGGRRVVKVPTVSMHFAGGAEAA 286
Query: 318 LSPSNLFRNI-SDEIMCSAFRGGNANI-VYGRIMQINFLIGYDIEQAMVSFKPSRC 371
L P N + S C AF G + + + G I Q F + +D + V F P C
Sbjct: 287 LPPENYLIPVDSRGTFCFAFAGTDGGVSIIGNIQQQGFRVVFDGDGQRVGFAPKGC 342
>gi|326531454|dbj|BAJ97731.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 476
Score = 135 bits (340), Expect = 3e-29, Method: Compositional matrix adjust.
Identities = 110/351 (31%), Positives = 165/351 (47%), Gaps = 31/351 (8%)
Query: 35 YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSS 94
Y++ +S+GTP V VDTGSD +W QC+PC C Q LFDP KSSTY+++ C +
Sbjct: 143 YVVTVSLGTPGVSQTVEVDTGSDVSWVQCKPCSAPACNSQRDQLFDPAKSSTYSAVPCGA 202
Query: 95 SQCA---VVTSNCSEGDCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMPNVIFG 151
C+ + + CS C Y YG G S ++G ++TL + + +FG
Sbjct: 203 DACSELRIYEAGCSGSQCGYVVSYGDG---SNTTGVYGSDTLALAPGN----TVGTFLFG 255
Query: 152 CGHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLPDQGSSK--INFGGIV 209
CGH A + G++ LG + SL SQ + G FSYCLP + S+ + GG
Sbjct: 256 CGH---AQAGMFAGIDGLLALGRQSMSLKSQAAGAYGGVFSYCLPSKQSAAGYLTLGGPT 312
Query: 210 AGAGVVSTPLIIR----DHYYLSLEAISVGNQRLEFVSSS-TGNIFVDTGVLRTLLPLEY 264
+ +G +T L+ Y + L ISVG Q++ +S+ G VDTG + T LP
Sbjct: 313 SASGFATTGLLTAWAAPTFYMVMLTGISVGGQQVAVPASAFAGGTVVDTGTVITRLPPTA 372
Query: 265 HSNLKSVMSNMIKAQPVKGVGAEPGFSDVLCYNISSQP--KFPEVTIHFR-GADVKLSPS 321
++ L+S I A G D CY+ S P V + F GA + L
Sbjct: 373 YAALRSAFRGAIAPYGYPSAPAN-GILDT-CYDFSRYGVVTLPTVALTFSGGATLALEAP 430
Query: 322 NLFRNISDEIMCSAFRGGNAN-IVYGRIMQINFLIGYDIEQAMVSFKPSRC 371
+ +S + A GG+ + + G + Q +F + +D + V F P C
Sbjct: 431 GI---LSSGCLAFAPNGGDGDAAILGNVQQRSFAVRFD--GSTVGFMPGAC 476
>gi|242046218|ref|XP_002460980.1| hypothetical protein SORBIDRAFT_02g038700 [Sorghum bicolor]
gi|241924357|gb|EER97501.1| hypothetical protein SORBIDRAFT_02g038700 [Sorghum bicolor]
Length = 517
Score = 135 bits (340), Expect = 3e-29, Method: Compositional matrix adjust.
Identities = 119/380 (31%), Positives = 170/380 (44%), Gaps = 55/380 (14%)
Query: 35 YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSS 94
YLM + +GTPP +DTGSD W QC PC LDCF Q P+FDP SS+Y +++C
Sbjct: 151 YLMDVYVGTPPRRFRMIMDTGSDLNWLQCAPC--LDCFDQVGPVFDPAASSSYRNVTCGD 208
Query: 95 SQCAVVT--------SNCSEGDCSYSFLYGRGAYASFSSGNLATETLTFNSTS-GLPVEM 145
+C +V E C Y + YG S ++G+LA E+ T N T+ G +
Sbjct: 209 QRCGLVAPPEPPRACRRPGEDSCPYYYWYGD---QSNTTGDLALESFTVNLTAPGASRRV 265
Query: 146 PNVIFGCGHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLPDQGS---SK 202
+V+FGCGH N + G+ S SQ+ FSYCL D GS SK
Sbjct: 266 DDVVFGCGHWNRGLFHGAAGLLGLGRG---PLSFASQLRAVYGHTFSYCLVDHGSDVASK 322
Query: 203 INFGGIVAGAGVVSTPLI-----------IRDHYYLSLEAISVGNQRLEFVSSS------ 245
+ FG A A + P + YY+ L+ + VG + L S +
Sbjct: 323 VVFGEDDALALAAAHPQLNYTAFAPASSPADTFYYVKLKGVLVGGELLNISSDTWGVGEG 382
Query: 246 ---TGNIFVDTG-VLRTLLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDVL--CYNIS 299
+G +D+G L + Y ++ + M ++ P+ P F VL CYN+S
Sbjct: 383 EGGSGGTIIDSGTTLSYFVEPAYQVIRQAFIDRMGRSYPLI-----PDFP-VLSPCYNVS 436
Query: 300 --SQPKFPEVTIHFR-GADVKLSPSNLF-RNISDEIMCSAFRG--GNANIVYGRIMQINF 353
+P+ PE+++ F GA N F R D IMC A G + G Q NF
Sbjct: 437 GVDRPEVPELSLLFADGAVWDFPAENYFIRLDPDGIMCLAVLGTPRTGMSIIGNFQQQNF 496
Query: 354 LIGYDIEQAMVSFKPSRCTN 373
+ YD++ + F P RC
Sbjct: 497 HVVYDLKNNRLGFAPRRCAE 516
>gi|125540928|gb|EAY87323.1| hypothetical protein OsI_08727 [Oryza sativa Indica Group]
Length = 463
Score = 135 bits (340), Expect = 4e-29, Method: Compositional matrix adjust.
Identities = 115/358 (32%), Positives = 171/358 (47%), Gaps = 42/358 (11%)
Query: 35 YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSS 94
Y++ + +GTP V ++DTGSD +W QC PCP C Q LFDP KSSTY ++SC++
Sbjct: 127 YVISVGLGTPAVTQTVTIDTGSDVSWVQCNPCPNPPCHAQTGALFDPAKSSTYRAVSCAA 186
Query: 95 SQCAVVTSN-----CSEGDCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMPNVI 149
++CA + + +C Y YG G S ++G + +TLT SG +
Sbjct: 187 AECAQLEQQGNGCGATNYECQYGVQYGDG---STTNGTYSRDTLTL---SGASDAVKGFQ 240
Query: 150 FGCGHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCL-PDQGSSKINFGGI 208
FGC H L S SD + G++GLG G SL+SQ + FSYCL P GSS G
Sbjct: 241 FGCSH--LESGFSD-QTDGLMGLGGGAQSLVSQTAAAYGNSFSYCLPPTSGSSGFLTLGG 297
Query: 209 VAGAGVVSTPLIIRDH-----YYLSLEAISVGNQRLEFVSS--STGNIFVDTGVLRTLLP 261
GA T ++R Y L+ I+VG ++L S + G++ VD+G + T LP
Sbjct: 298 GGGASGFVTTRMLRSKQIPTFYGARLQDIAVGGKQLGLSPSVFAAGSV-VDSGTIITRLP 356
Query: 262 LEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDV-LCYNISSQPK--FPEVTIHFR-GADVK 317
+S L S +K + P S + C++ + Q + P V + F GA +
Sbjct: 357 PTAYSALSSAFKAGMKQYR-----SAPARSILDTCFDFAGQTQISIPTVALVFSGGAAID 411
Query: 318 LSPSNLFRNISDEIMCSAFRG----GNANIVYGRIMQINFLIGYDIEQAMVSFKPSRC 371
L P+ + C AF G I+ G + Q F + YD+ + + F+ C
Sbjct: 412 LDPNGIMYG-----NCLAFAATGDDGTTGII-GNVQQRTFEVLYDVGSSTLGFRSGAC 463
>gi|148906646|gb|ABR16474.1| unknown [Picea sitchensis]
Length = 538
Score = 135 bits (339), Expect = 4e-29, Method: Compositional matrix adjust.
Identities = 112/365 (30%), Positives = 169/365 (46%), Gaps = 51/365 (13%)
Query: 35 YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSS 94
Y + +GTP + + +DTGSD W QCEPC + C+ Q P+F+P S++++++ C+S
Sbjct: 197 YFTRIGVGTPMREQYMVLDTGSDVVWIQCEPCSK--CYSQVDPIFNPSLSASFSTLGCNS 254
Query: 95 SQCAVVTS-NCSEGDCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMPNVIFGCG 153
+ C+ + + NC G C Y YG G+Y + G+ ATE LTF +TS + NV GCG
Sbjct: 255 AVCSYLDAYNCHGGGCLYKVSYGDGSY---TIGSFATEMLTFGTTS-----VRNVAIGCG 306
Query: 154 HKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLPD---QGSSKINFGGIVA 210
H N + G+ S SQ+GT FSYCL D + S + FG
Sbjct: 307 HDNAGLFVGAAGLLGLGAG---LLSFPSQLGTQTGRAFSYCLVDRFSESSGTLEFGPESV 363
Query: 211 GAGVVSTPLI----IRDHYYLSLEAISVGNQRLEFV---------SSSTGNIFVDTGVLR 257
G + TPL+ + YY+ L +ISVG L+ V +S G VD+G
Sbjct: 364 PLGSILTPLLTNPSLPTFYYVPLISISVGGALLDSVPPDVFRIDETSGRGGFIVDSGTAV 423
Query: 258 TLLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDV----LCYNISSQP--KFPEVTIHF 311
T L++ + + ++ V G P V CY++S P P V HF
Sbjct: 424 T--------RLQTPVYDAVRDAFVAGTRQLPKAEGVSIFDTCYDLSGLPLVNVPTVVFHF 475
Query: 312 -RGADVKLSPSNLFRNISDEIM---CSAFRGGNANI-VYGRIMQINFLIGYDIEQAMVSF 366
GA + L N I + M C AF +++ + G I Q + +D ++V F
Sbjct: 476 SNGASLILPAKNYM--IPMDFMGTFCFAFAPATSDLSIMGNIQQQGIRVSFDTANSLVGF 533
Query: 367 KPSRC 371
+C
Sbjct: 534 ALRQC 538
>gi|242058093|ref|XP_002458192.1| hypothetical protein SORBIDRAFT_03g028480 [Sorghum bicolor]
gi|241930167|gb|EES03312.1| hypothetical protein SORBIDRAFT_03g028480 [Sorghum bicolor]
Length = 468
Score = 135 bits (339), Expect = 4e-29, Method: Compositional matrix adjust.
Identities = 113/370 (30%), Positives = 177/370 (47%), Gaps = 47/370 (12%)
Query: 30 SVDDI-YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYN 88
SVD + Y++ + +GTP V +DTGSD +W QC+PC C+ Q+ PLFDP KSSTY
Sbjct: 118 SVDSLEYVVTVGLGTPSVSQVLLIDTGSDLSWVQCQPCNSTTCYPQKDPLFDPSKSSTYA 177
Query: 89 SISCSSSQCAVVTSN-----CSEGD----CSYSFLYGRGAYASFSSGNLATETLTFNSTS 139
I C++ C +T + C+ GD C ++ YG G S + G + ETL
Sbjct: 178 PIPCNTDACRDLTDDGYGGGCASGDGAAQCGFAITYGDG---SQTRGVYSNETLALAPG- 233
Query: 140 GLPVEMPNVIFGCGHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLP--- 196
V + + FGCGH ++ K G++GLG SL+ Q + G FSYCLP
Sbjct: 234 ---VAVKDFRFGCGHDQDG---ANDKYDGLLGLGGAPESLVVQTASVYGGAFSYCLPALN 287
Query: 197 ------DQGSSKINFGGIVAGAGVVSTPLIIRDH--YYLSLEAISVGNQRLEFVSSS-TG 247
G GG+V +G V TP+I + Y +++ I+VG + ++ S+ +G
Sbjct: 288 NQVGFLALGGGGAPSGGVVNTSGFVFTPMIREEETFYVVNMTGITVGGEPIDVPPSAFSG 347
Query: 248 NIFVDTGVLRTLLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDVLCYNIS--SQPKFP 305
+ +D+G + T L ++ L++ + A P+ G D CY+ S S P
Sbjct: 348 GMIIDSGTVVTELQHTAYNALQAAFRKAMAAYPL----VRNGELDT-CYDFSGYSNVTLP 402
Query: 306 EVTIHFR-GADVKLSPSNLFRNISDEIMCSAFRGGNANI---VYGRIMQINFLIGYDIEQ 361
+V + F GA + L N + D+ C AF+ + + G + Q + YD +
Sbjct: 403 KVALTFSGGATIDLDVPNGI--LLDD--CLAFQESGPDDQPGILGNVNQRTLEVLYDAGR 458
Query: 362 AMVSFKPSRC 371
V F+ + C
Sbjct: 459 GRVGFRAAVC 468
>gi|302802500|ref|XP_002983004.1| hypothetical protein SELMODRAFT_13348 [Selaginella moellendorffii]
gi|300149157|gb|EFJ15813.1| hypothetical protein SELMODRAFT_13348 [Selaginella moellendorffii]
Length = 332
Score = 134 bits (338), Expect = 5e-29, Method: Compositional matrix adjust.
Identities = 112/368 (30%), Positives = 173/368 (47%), Gaps = 67/368 (18%)
Query: 34 IYLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCS 93
+Y +++G+PP D +DTGSD TW +C+PC DC FD S+TY +++C+
Sbjct: 2 VYYSTITLGSPPKDFSLVMDTGSDLTWVRCDPCSP-DC----SSTFDRLASNTYKALTCA 56
Query: 94 SSQCAVVTSNCSEGDCSYSFLYGRGAYASFSSGNLATETLTF-NSTSGLPVEMPNVIFGC 152
YS+ YG G SF+ G+L+ +TL + S E P +FGC
Sbjct: 57 D---------------DYSYGYGDG---SFTQGDLSVDTLKMAGAASDELEEFPGFVFGC 98
Query: 153 GH--KNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLPDQGS------SKIN 204
G K L S + GI+ L PG+ S SQ+G KFSYCL Q + S +
Sbjct: 99 GSLLKGLI-----SGEVGILALSPGSLSFPSQIGEKYGNKFSYCLLRQTAQNSLKKSPMV 153
Query: 205 FGGIVA-----GAGVVS----TPLIIRDHYY-LSLEAISVGNQRLE-----FVSSSTGNI 249
FG G+G + TP+ YY + L+ ISVGNQRL+ F++
Sbjct: 154 FGEAAVELKEPGSGKLQELQYTPIGESSIYYTVRLDGISVGNQRLDLSPSAFLNGQDKPT 213
Query: 250 FVDTGVLRTLLPLEYHSNLKSVMSNMIKAQ---PVKGVGAEPGFSDVLCYNI--SSQPKF 304
D+G T+LP ++K +++M+ +KG+ A C+ + SS
Sbjct: 214 IFDSGTTLTMLPPGVCDSIKQSLASMVSGAEFVAIKGLDA--------CFRVPPSSGQGL 265
Query: 305 PEVTIHFR-GADVKLSPSNLFRNISDEIMCSAFRGGNANIVYGRIMQINFLIGYDIEQAM 363
P++T HF GAD PSN ++ + C F N ++G + Q +F + +D++
Sbjct: 266 PDITFHFNGGADFVTRPSNYVIDLG-SLQCLIFVPTNEVSIFGNLQQQDFFVLHDMDNRR 324
Query: 364 VSFKPSRC 371
+ FK + C
Sbjct: 325 IGFKETDC 332
>gi|449440933|ref|XP_004138238.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 1-like [Cucumis
sativus]
Length = 487
Score = 134 bits (338), Expect = 6e-29, Method: Compositional matrix adjust.
Identities = 119/360 (33%), Positives = 176/360 (48%), Gaps = 43/360 (11%)
Query: 35 YLMHLSIGTPPVDIFGSV-DTGSDCTWTQCEPCP-ELDCFKQEPPLFDPKKSSTYNSISC 92
YL + +G P V +F V DTGSD TW QC+PC E C+KQ P+FDPK SS+Y+ +SC
Sbjct: 148 YLAQIGVGQP-VKLFYLVPDTGSDVTWLQCQPCASENTCYKQFDPIFDPKSSSSYSPLSC 206
Query: 93 SSSQCAVV-TSNCSEGDCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMPNVIFG 151
+S QC ++ +NC+ C Y YG G SF++G LATETL+F +++ +PN+ G
Sbjct: 207 NSQQCKLLDKANCNSDTCIYQVHYGDG---SFTTGELATETLSFGNSN----SIPNLPIG 259
Query: 152 CGHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLPD---QGSSKINFGGI 208
CGH N + G+ G G SL SQ+ A FSYCL + SS + F
Sbjct: 260 CGHDNEGLFAGGAGLIGLGG---GAISLSSQLK---ASSFSYCLVNLDSDSSSTLEFNSN 313
Query: 209 VAGAGVVSTPLIIRDHY----YLSLEAISVGNQ-------RLEFVSSSTGNIFVDTGVLR 257
+ + S PL+ D + Y+ + ISVG + R E S G I VD+G +
Sbjct: 314 MPSDSLTS-PLVKNDRFHSYRYVKVVGISVGGKTLPISPTRFEIDESGLGGIIVDSGTII 372
Query: 258 TLLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDV-LCYNISSQPKFPEVTIHF---RG 313
+ LP + + +L+ + + + PG S CYN S Q TI F G
Sbjct: 373 SRLPSDVYESLREAFVKLTSS-----LSPAPGISVFDTCYNFSGQSNVEVPTIAFVLSEG 427
Query: 314 ADVKLSPSNLFRNISDE-IMCSAFRGGNANI-VYGRIMQINFLIGYDIEQAMVSFKPSRC 371
++L N + C AF +++ + G Q + YD+ ++V F ++C
Sbjct: 428 TSLRLPARNYLIMLDTAGTYCLAFIKTKSSLSIIGSFQQQGIRVSYDLTNSLVGFSTNKC 487
>gi|115448349|ref|NP_001047954.1| Os02g0720600 [Oryza sativa Japonica Group]
gi|45735841|dbj|BAD12876.1| putative 41 kD chloroplast nucleoid DNA binding protein (CND41)
[Oryza sativa Japonica Group]
gi|45735967|dbj|BAD12996.1| putative 41 kD chloroplast nucleoid DNA binding protein (CND41)
[Oryza sativa Japonica Group]
gi|113537485|dbj|BAF09868.1| Os02g0720600 [Oryza sativa Japonica Group]
gi|125583492|gb|EAZ24423.1| hypothetical protein OsJ_08176 [Oryza sativa Japonica Group]
gi|215740989|dbj|BAG97484.1| unnamed protein product [Oryza sativa Japonica Group]
Length = 463
Score = 134 bits (338), Expect = 6e-29, Method: Compositional matrix adjust.
Identities = 113/358 (31%), Positives = 171/358 (47%), Gaps = 42/358 (11%)
Query: 35 YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSS 94
Y++ + +GTP V ++DTGSD +W QC PCP C+ Q LFDP KSSTY ++SC++
Sbjct: 127 YVISVGLGTPAVTQTVTIDTGSDVSWVQCNPCPNPPCYAQTGALFDPAKSSTYRAVSCAA 186
Query: 95 SQCAVVTSN-----CSEGDCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMPNVI 149
++CA + + +C Y YG G S ++G + +TLT SG +
Sbjct: 187 AECAQLEQQGNGCGATNYECQYGVQYGDG---STTNGTYSRDTLTL---SGASDAVKGFQ 240
Query: 150 FGCGHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCL-PDQGSSKINFGGI 208
FGC H + S SD + G++GLG G SL+SQ + FSYCL P GSS G
Sbjct: 241 FGCSH--VESGFSD-QTDGLMGLGGGAQSLVSQTAAAYGNSFSYCLPPTSGSSGFLTLGG 297
Query: 209 VAGAGVVSTPLIIRDH-----YYLSLEAISVGNQRLEFVSS--STGNIFVDTGVLRTLLP 261
G T ++R Y L+ I+VG ++L S + G++ VD+G + T LP
Sbjct: 298 GGGVSGFVTTRMLRSRQIPTFYGARLQDIAVGGKQLGLSPSVFAAGSV-VDSGTIITRLP 356
Query: 262 LEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDV-LCYNISSQPK--FPEVTIHFR-GADVK 317
+S L S +K + P S + C++ + Q + P V + F GA +
Sbjct: 357 PTAYSALSSAFKAGMKQYR-----SAPARSILDTCFDFAGQTQISIPTVALVFSGGAAID 411
Query: 318 LSPSNLFRNISDEIMCSAFRG----GNANIVYGRIMQINFLIGYDIEQAMVSFKPSRC 371
L P+ + C AF G I+ G + Q F + YD+ + + F+ C
Sbjct: 412 LDPNGIMYG-----NCLAFAATGDDGTTGII-GNVQQRTFEVLYDVGSSTLGFRSGAC 463
>gi|449527149|ref|XP_004170575.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 1-like [Cucumis
sativus]
Length = 487
Score = 134 bits (338), Expect = 6e-29, Method: Compositional matrix adjust.
Identities = 116/359 (32%), Positives = 173/359 (48%), Gaps = 41/359 (11%)
Query: 35 YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCP-ELDCFKQEPPLFDPKKSSTYNSISCS 93
YL + +G P + DTGSD TW QC+PC E C+KQ P+FDPK SS+Y+ +SC+
Sbjct: 148 YLAQIGVGQPVKLFYLVPDTGSDVTWLQCQPCASENTCYKQFDPIFDPKSSSSYSPLSCN 207
Query: 94 SSQCAVV-TSNCSEGDCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMPNVIFGC 152
S QC ++ +NC+ C Y YG G SF++G LATETL+F +++ +PN+ GC
Sbjct: 208 SQQCKLLDKANCNSDTCIYQVHYGDG---SFTTGELATETLSFGNSN----SIPNLPIGC 260
Query: 153 GHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLPD---QGSSKINFGGIV 209
GH N + G+ G G SL SQ+ A FSYCL + SS + F +
Sbjct: 261 GHDNEGLFAGGAGLIGLGG---GAISLSSQLK---ASSFSYCLVNLDSDSSSTLEFNSYM 314
Query: 210 AGAGVVSTPLIIRDHY----YLSLEAISVGNQ-------RLEFVSSSTGNIFVDTGVLRT 258
+ S PL+ D + Y+ + ISVG + R E S G I VD+G + +
Sbjct: 315 PSDSLTS-PLVKNDRFHSYRYVKVVGISVGGKTLPISPTRFEIDESGLGGIIVDSGTIIS 373
Query: 259 LLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDV-LCYNISSQPKFPEVTIHF---RGA 314
LP + + +L+ + + + PG S CYN S Q TI F G
Sbjct: 374 RLPSDVYESLREAFVKLTSS-----LSPAPGISVFDTCYNFSGQSNVEVPTIAFVLSEGT 428
Query: 315 DVKLSPSNLFRNISDE-IMCSAFRGGNANI-VYGRIMQINFLIGYDIEQAMVSFKPSRC 371
++L N + C AF +++ + G Q + YD+ ++V F ++C
Sbjct: 429 SLRLPARNYLIMLDTAGTYCLAFIKTKSSLSIIGSFQQQGIRVSYDLTNSIVGFSTNKC 487
>gi|414880708|tpg|DAA57839.1| TPA: hypothetical protein ZEAMMB73_997765 [Zea mays]
Length = 452
Score = 134 bits (337), Expect = 6e-29, Method: Compositional matrix adjust.
Identities = 104/375 (27%), Positives = 165/375 (44%), Gaps = 53/375 (14%)
Query: 35 YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSS 94
Y + +G PP +DTGSD W QC PC C++Q PL+DP+ S T+ I C+S
Sbjct: 92 YFAVIGVGDPPTHALVVIDTGSDLIWLQCLPCRR--CYRQVTPLYDPRNSKTHRRIPCAS 149
Query: 95 SQCAVVTS----NCSEGDCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMPNVIF 150
QC V + G C Y +YG G S SSG+LAT+TL + + NV
Sbjct: 150 PQCRGVLRYPGCDARTGGCVYMVVYGDG---SASSGDLATDTLVLPDDT----RVHNVTL 202
Query: 151 GCGHKN---LASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLPDQGSSKIN--- 204
GCGH N LAS G++G G G S +Q+ + FSYCL D+ S N
Sbjct: 203 GCGHDNEGLLAS------AAGLLGAGRGQLSFPTQLAPAYGHVFSYCLGDRMSRARNSSS 256
Query: 205 ---FGGIVAGAGVVSTPLIIRDH----YYLSLEAISVGNQRLEFVSSST---------GN 248
FG TPL YY+ + SVG +R+ S+++ G
Sbjct: 257 YLVFGRTPELPSTAFTPLRTNPRRPSLYYVDMVGFSVGGERVAGFSNASLALNPATGRGG 316
Query: 249 IFVDTGVLRTLLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDVLCYNI-----SSQPK 303
+ VD+G + + ++ ++ + A ++ + + D CY++ + +
Sbjct: 317 VVVDSGTAISRFTRDAYAAVRDAFVSHAAAAGMRRLRNKFSVFDT-CYDVHGNGPGTGVR 375
Query: 304 FPEVTIHF-RGADVKLSPSNLFRNI----SDEIMCSAFRGGNANI-VYGRIMQINFLIGY 357
P + +HF AD+ L +N + C + + + V G + Q F + +
Sbjct: 376 VPSIVLHFAAAADMALPQANYLIPVVGGDRRTYFCLGLQAADDGLNVLGNVQQQGFGVVF 435
Query: 358 DIEQAMVSFKPSRCT 372
D+E+ + F P+ C+
Sbjct: 436 DVERGRIGFTPNGCS 450
>gi|449486856|ref|XP_004157423.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 1-like [Cucumis
sativus]
Length = 491
Score = 134 bits (337), Expect = 7e-29, Method: Compositional matrix adjust.
Identities = 117/357 (32%), Positives = 171/357 (47%), Gaps = 40/357 (11%)
Query: 35 YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSS 94
Y + +G P + +DTGSD W QC+PC DC++Q P+FDP+ SS++ S+ C S
Sbjct: 155 YFSRVGVGQPAKPFYMVLDTGSDINWLQCQPC--TDCYQQTDPIFDPRSSSSFASLPCES 212
Query: 95 SQC-AVVTSNCSEGDCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMPNVIFGCG 153
QC A+ TS C C Y YG G SF+ G TETLTF + SG+ + +V GCG
Sbjct: 213 QQCQALETSGCRASKCLYQVSYGDG---SFTVGEFVTETLTFGN-SGM---INDVAVGCG 265
Query: 154 HKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLPDQ---GSSKINFGGIVA 210
H N + G+ G SL SQM A FSYCL D+ SS + F A
Sbjct: 266 HDNEGLFVGSAGLLGLGGG---PLSLTSQMK---ASSFSYCLVDRDSSSSSDLEFNS-AA 318
Query: 211 GAGVVSTPLI----IRDHYYLSLEAISVGNQRL-------EFVSSSTGNIFVDTGVLRTL 259
+ V+ PL+ + YY+ L +SVG Q L + S G I VD+G T
Sbjct: 319 PSDSVNAPLLKSGKVDTFYYVGLTGMSVGGQLLSIPPNLFQMDDSGYGGIIVDSGTAITR 378
Query: 260 LPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDVLCYNISSQPK--FPEVTIHFRGAD-V 316
L + ++ L+ + + +K F CY++SSQ + P V+ F G +
Sbjct: 379 LQTQAYNTLRDAF--VSRTPYLKKTNGFALFD--TCYDLSSQSRVTIPTVSFEFAGGKSL 434
Query: 317 KLSPSNLFRNI-SDEIMCSAFRGGNANI-VYGRIMQINFLIGYDIEQAMVSFKPSRC 371
+L P N + S C AF +++ + G + Q + YD+ ++V F P +C
Sbjct: 435 QLPPKNYLIPVDSVGTFCFAFAPTTSSLSIIGNVQQQGTRVHYDLANSVVGFSPHKC 491
>gi|326495920|dbj|BAJ90582.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 476
Score = 134 bits (336), Expect = 9e-29, Method: Compositional matrix adjust.
Identities = 110/356 (30%), Positives = 167/356 (46%), Gaps = 41/356 (11%)
Query: 35 YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSS 94
Y++ +S+GTP V VDTGSD +W QC+PC C Q LFDP KSSTY+++ C +
Sbjct: 143 YVVTVSLGTPGVSQTVEVDTGSDVSWVQCKPCSAPACNSQRDQLFDPAKSSTYSAVPCGA 202
Query: 95 SQCA---VVTSNCSEGDCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMPNVIFG 151
C+ + + CS C Y YG G S ++G ++TL + + +FG
Sbjct: 203 DACSELRIYEAGCSGSQCGYVVSYGDG---SNTTGVYGSDTLALAPGN----TVGTFLFG 255
Query: 152 CGHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLPDQGSSK--INFGGIV 209
CGH A + G++ LG + SL SQ + G FSYCLP + S+ + GG
Sbjct: 256 CGH---AQAGMFAGIDGLLALGRQSMSLKSQAAGAYGGVFSYCLPSKQSAAGYLTLGGPS 312
Query: 210 AGAGVVSTPLIIR----DHYYLSLEAISVGNQRLEFVSSS-TGNIFVDTGVLRTLLPLEY 264
+ +G +T L+ Y + L ISVG Q++ +S+ G VDTG + T LP
Sbjct: 313 SASGFATTGLLTAWAAPTFYMVMLTGISVGGQQVAVPASAFAGGTVVDTGTVITRLPPTA 372
Query: 265 HSNLKSVMSNMIK-----AQPVKGVGAEPGFSDVLCYNISSQP--KFPEVTIHFR-GADV 316
++ L+S I + P G+ D CY+ S P V + F GA +
Sbjct: 373 YAALRSAFRGAIAPCGYPSAPANGI------LDT-CYDFSRYGVVTLPTVALTFSGGATL 425
Query: 317 KLSPSNLFRNISDEIMCSAFRGGNAN-IVYGRIMQINFLIGYDIEQAMVSFKPSRC 371
L + +S + A GG+ + + G + Q +F + +D + V F P C
Sbjct: 426 ALEAPGI---LSSGCLAFAPNGGDGDAAILGNVQQRSFAVRFD--GSTVGFMPGAC 476
>gi|449439383|ref|XP_004137465.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 1-like [Cucumis
sativus]
Length = 491
Score = 134 bits (336), Expect = 9e-29, Method: Compositional matrix adjust.
Identities = 117/357 (32%), Positives = 171/357 (47%), Gaps = 40/357 (11%)
Query: 35 YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSS 94
Y + +G P + +DTGSD W QC+PC DC++Q P+FDP+ SS++ S+ C S
Sbjct: 155 YFSRVGVGQPAKPFYMVLDTGSDINWLQCQPC--TDCYQQTDPIFDPRSSSSFASLPCES 212
Query: 95 SQC-AVVTSNCSEGDCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMPNVIFGCG 153
QC A+ TS C C Y YG G SF+ G ETLTF + SG+ + NV GCG
Sbjct: 213 QQCQALETSGCRASKCLYQVSYGDG---SFTVGEFVIETLTFGN-SGM---INNVAVGCG 265
Query: 154 HKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLPDQ---GSSKINFGGIVA 210
H N + G+ G + SL SQM A FSYCL D+ SS + F A
Sbjct: 266 HDNEGLFVGSAGLLGLGGG---SLSLTSQMK---ASSFSYCLVDRDSSSSSDLEFNS-AA 318
Query: 211 GAGVVSTPLI----IRDHYYLSLEAISVGNQRL-------EFVSSSTGNIFVDTGVLRTL 259
+ V+ PL+ + YY+ L +SVG Q L + S G I VD+G T
Sbjct: 319 PSDSVNAPLLKSGKVDTFYYVGLTGMSVGGQLLSIPPNLFQMDDSGYGGIIVDSGTAITR 378
Query: 260 LPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDVLCYNISSQPK--FPEVTIHFRGAD-V 316
L + ++ L+ + + +K F CY++SSQ + P V+ F G +
Sbjct: 379 LQTQAYNTLRDAF--VSRTPYLKKTNGFALFD--TCYDLSSQSRVTIPTVSFEFAGGKSL 434
Query: 317 KLSPSNLFRNI-SDEIMCSAFRGGNANI-VYGRIMQINFLIGYDIEQAMVSFKPSRC 371
+L P N + S C AF +++ + G + Q + YD+ ++V F P +C
Sbjct: 435 QLPPKNYLIPVDSVGTFCFAFAPTTSSLSIIGNVQQQGTRVHYDLANSVVGFSPHKC 491
>gi|116786826|gb|ABK24255.1| unknown [Picea sitchensis]
gi|148906052|gb|ABR16185.1| unknown [Picea sitchensis]
Length = 485
Score = 134 bits (336), Expect = 1e-28, Method: Compositional matrix adjust.
Identities = 113/359 (31%), Positives = 162/359 (45%), Gaps = 40/359 (11%)
Query: 35 YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSS 94
Y + +G P D +DTGSD TW QCEPC DC++Q P+++P SS+Y + C +
Sbjct: 145 YFSRIGVGAPRRDQLMVLDTGSDVTWIQCEPCS--DCYQQSDPIYNPALSSSYKLVGCQA 202
Query: 95 SQCAVV-TSNCSE-GDCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMPNVIFGC 152
+ C + S CS G C Y YG G+Y + GN ATETLT G P++ NV GC
Sbjct: 203 NLCQQLDVSGCSRNGSCLYQVSYGDGSY---TQGNFATETLTLG---GAPLQ--NVAIGC 254
Query: 153 GHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLPD---QGSSKINFGGIV 209
GH N + G+ G + S SQ+ FSYCL D + SS + FG
Sbjct: 255 GHDNEGLFVGAAGLLGLGGG---SLSFPSQLTDENGKIFSYCLVDRDSESSSTLQFGRAA 311
Query: 210 AGAGVVSTPLI----IRDHYYLSLEAISVGNQRLEFV-------SSSTGNIFVDTGVLRT 258
G V P++ + YY+SL ISVG + L +S G + VD+G T
Sbjct: 312 VPNGAVLAPMLKNSRLDTFYYVSLSGISVGGKMLSISDSVFGIDASGNGGVIVDSGTAVT 371
Query: 259 LLPLEYHSNLKSVMSNMIKAQP-VKGVGAEPGFSDVLCYNISSQP--KFPEVTIHFRGAD 315
L + +L+ K P GV CY++SS+ P V HF G
Sbjct: 372 RLQTAAYDSLRDAFRAGTKNLPSTDGVSLFD-----TCYDLSSKESVDVPTVVFHFSGGG 426
Query: 316 VKLSPSNLFRNISDEI--MCSAFRGGNANI-VYGRIMQINFLIGYDIEQAMVSFKPSRC 371
P+ + D + C AF ++++ + G I Q + +D V F ++C
Sbjct: 427 SMSLPAKNYLVPVDSMGTFCFAFAPTSSSLSIVGNIQQQGIRVSFDRANNQVGFAVNKC 485
>gi|449433371|ref|XP_004134471.1| PREDICTED: probable aspartic protease At2g35615-like [Cucumis
sativus]
gi|449495479|ref|XP_004159853.1| PREDICTED: probable aspartic protease At2g35615-like [Cucumis
sativus]
Length = 424
Score = 133 bits (335), Expect = 1e-28, Method: Compositional matrix adjust.
Identities = 117/385 (30%), Positives = 178/385 (46%), Gaps = 43/385 (11%)
Query: 12 DNETPKSPISIIYQAEIISVDDIYLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDC 71
DN+ SP + E YLM +IG P + G +DT + W QC C C
Sbjct: 59 DNDVSLSPTLVNEGGE-------YLMSFNIGNPSSQVMGFLDTSNGLIWVQCSNCNS-QC 110
Query: 72 FKQEPPL---FDPKKSSTYNSISCSSSQCAVVTS----NCSEGDCSYSFLYGRGAYASFS 124
++ L F KS TY C S+ C +T N S+ C Y +YG +
Sbjct: 111 EPEKRGLTTKFLSSKSFTYEMEPCGSNFCNSLTGFQTCNSSDKWCKYRLVYGDNKA---T 167
Query: 125 SGNLATETLTFNSTSGLPVEMPNVIFGCGHKNLASPTSDSKQTGIIGLGPGNSSLISQMG 184
SG L++++ F+++ G+ V++ + FGC L + TG +GL SLISQ+G
Sbjct: 168 SGILSSDSFGFDTSDGMLVDVGFLNFGCSEAPLTG--DEQSYTGNVGLNQTPLSLISQLG 225
Query: 185 TSIAGKFSYCL---PDQGS-SKINFGGIVAGAGVVSTPLII--RDHYYLSLEAISVGNQR 238
KFSYCL + GS SK+ FG + +G TPL+ D YY+ + IS+GN
Sbjct: 226 IK---KFSYCLVPFNNLGSTSKMYFGSLPVTSG-GQTPLLYPNSDAYYVKVLGISIGNDE 281
Query: 239 LEF-----VSSSTGNIFVDTGVLRTLLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDV 293
F V +DTG+ + L + S+++ + + +P
Sbjct: 282 PHFDGVFDVYEVRDGWIIDTGITYSSLETD---AFDSLLAKFLTLKDFPQRKDDPKERFE 338
Query: 294 LCY---NISSQPKFPEVTIHFRGADVKLSPSNLFRNISDE-IMCSA-FRGGNANIVYGRI 348
LC+ N + FP+VT+HF GAD+ L+ + F I D+ I C A R G+ + G
Sbjct: 339 LCFELQNANDLESFPDVTVHFDGADLILNVESTFVKIEDDGIFCLALLRSGSPVSILGNF 398
Query: 349 MQINFLIGYDIEQAMVSFKPSRCTN 373
N+ +GYD+E ++SF P C +
Sbjct: 399 QLQNYHVGYDLEAQVISFAPVDCAD 423
>gi|225423917|ref|XP_002281973.1| PREDICTED: aspartic proteinase nepenthesin-2 [Vitis vinifera]
Length = 491
Score = 133 bits (335), Expect = 1e-28, Method: Compositional matrix adjust.
Identities = 112/361 (31%), Positives = 167/361 (46%), Gaps = 48/361 (13%)
Query: 35 YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSS 94
Y + IG+PP ++ VDTGSD W QC PC DC++Q P+F+P SS+Y ++C +
Sbjct: 155 YFSRVGIGSPPKHVYMVVDTGSDVNWVQCAPCA--DCYQQADPIFEPSFSSSYAPLTCET 212
Query: 95 SQC-AVVTSNCSEGDCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMPNVIFGCG 153
QC ++ S C C Y YG G+Y + G+ ATET+T + ++ L NV GCG
Sbjct: 213 HQCKSLDVSECRNDSCLYEVSYGDGSY---TVGDFATETITLDGSASL----NNVAIGCG 265
Query: 154 HKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLPDQ---GSSKINFGGIVA 210
H N + G+ G S I+ A FSYCL ++ +S + F +
Sbjct: 266 HDNEGLFVGAAGLLGLGGGSLSFPSQIN------ASSFSYCLVNRDTDSASTLEFNSPIP 319
Query: 211 GAGVVSTPLIIRDH----YYLSLEAISVGNQRL-------EFVSSSTGNIFVDTGVLRTL 259
V + PL+ + YYL + I VG Q L E S G I VD+G T
Sbjct: 320 SHSVTA-PLLRNNQLDTFYYLGMTGIGVGGQMLSIPRSSFEVDESGNGGIIVDSGTAVT- 377
Query: 260 LPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDV----LCYNISSQP--KFPEVTIHF-R 312
L+S + N ++ V+G P S V CY++SS+ + P V+ HF
Sbjct: 378 -------RLQSDVYNSLRDSFVRGTQHLPSTSGVALFDTCYDLSSRSSVEVPTVSFHFPD 430
Query: 313 GADVKLSPSNLFRNI-SDEIMCSAFRGGNANI-VYGRIMQINFLIGYDIEQAMVSFKPSR 370
G + L N + S C AF + + + G + Q + YD+ ++V F P+
Sbjct: 431 GKYLALPAKNYLIPVDSAGTFCFAFAPTTSALSIIGNVQQQGTRVSYDLSNSLVGFSPNG 490
Query: 371 C 371
C
Sbjct: 491 C 491
>gi|357158691|ref|XP_003578210.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Brachypodium
distachyon]
Length = 459
Score = 133 bits (335), Expect = 1e-28, Method: Compositional matrix adjust.
Identities = 111/377 (29%), Positives = 171/377 (45%), Gaps = 51/377 (13%)
Query: 30 SVDDIYLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNS 89
S D YL+ L++GTPP + +DTGSD WTQC PC C Q P+F P SS+Y
Sbjct: 99 SGDLEYLVDLAVGTPPQPVSALLDTGSDLIWTQCAPCAS--CLPQPDPIFSPGASSSYEP 156
Query: 90 ISCSSSQC-AVVTSNCSEGD-CSYSFLYGRGAYASFSSGNLATETLTF------NSTSGL 141
+ C+ C ++ +C D C+Y + YG G + + G ATE TF T+ L
Sbjct: 157 MRCAGELCNDILHHSCQRPDTCTYRYSYGDG---TTTRGVYATERFTFSSSSSGGETTKL 213
Query: 142 PVEMPNVIFGCGHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLPDQGS- 200
+ FGCG N S + S GI+G G SL+SQ+ +FSYCL S
Sbjct: 214 SAPLG---FGCGTMNKGSLNNGS---GIVGFGRAPLSLVSQLAIR---RFSYCLTPYASG 264
Query: 201 --SKINFGGIVAG-----AGVVSTPLIIRDH-----YYLSLEAISVGNQRLEFVSSS--- 245
S + FG + G V T ++R YY+ ++VG +RL S+
Sbjct: 265 RKSTLLFGSLRGGVYDAATATVQTTRLLRSRQNPTFYYVPFTGVTVGARRLRIPISAFAL 324
Query: 246 ----TGNIFVDTGVLRTLLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDVLCYNISSQ 301
+G VD+G TL P + + + ++ P G+ G D +C+ ++
Sbjct: 325 RPDGSGGAIVDSGTALTLFPAPVLAEVVRAFRSQLRL-PFAANGSS-GPDDGVCFAAAAS 382
Query: 302 --PK---FPEVTIHFRGADVKLSPSN-LFRNISDEIMCSAFR-GGNANIVYGRIMQINFL 354
P+ P + H +GAD+ L N + + +C G++ G +Q +
Sbjct: 383 RVPRPAVVPRMVFHLQGADLDLPRRNYVLDDQRKGNLCLLLADSGDSGTTIGNFVQQDMR 442
Query: 355 IGYDIEQAMVSFKPSRC 371
+ YD+E +SF P++C
Sbjct: 443 VLYDLEADTLSFAPAQC 459
>gi|357127293|ref|XP_003565317.1| PREDICTED: aspartic proteinase nepenthesin-1-like, partial
[Brachypodium distachyon]
Length = 540
Score = 133 bits (335), Expect = 1e-28, Method: Compositional matrix adjust.
Identities = 113/368 (30%), Positives = 165/368 (44%), Gaps = 54/368 (14%)
Query: 35 YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSS 94
Y + IG+P ++ +DTGSD TW QC PC DC+ Q PLFDP SS+Y ++ C S
Sbjct: 196 YFSRIGIGSPARQLYMVLDTGSDVTWLQCAPC--ADCYAQSDPLFDPALSSSYATVPCDS 253
Query: 95 SQCAVVTS-----NCSEGD--CSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMPN 147
C + + N + G+ C Y YG G+Y + G+ ATETLT V +
Sbjct: 254 PHCRALDASACHNNAANGNSSCVYEVAYGDGSY---TVGDFATETLTLGGDGSAAVH--D 308
Query: 148 VIFGCGHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLPDQGS---SKIN 204
V GCGH N + + G S IS A +FSYCL D+ S S +
Sbjct: 309 VAIGCGHDNEGLFVGAAGLLALGGGPLSFPSQIS------ATEFSYCLVDRDSPSASTLQ 362
Query: 205 FGGIVAGAGVVSTPLIIRDH----YYLSLEAISVGNQRLEFV--------SSSTGNIFVD 252
FG + + V+ PL+ YY++L ISVG + L + +G + VD
Sbjct: 363 FG--ASDSSTVTAPLMRSPRSNTFYYVALNGISVGGETLSDIPPAAFAMDEQGSGGVIVD 420
Query: 253 TGVLRTLLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDV----LCYNIS--SQPKFPE 306
+G T L +S L+ V+G A P S V CY+++ S + P
Sbjct: 421 SGTAVTRLQSSAYSALRDAF--------VRGTQALPRASGVSLFDTCYDLAGRSSVQVPA 472
Query: 307 VTIHFR-GADVKLSPSNLFRNISDE-IMCSAFRG-GNANIVYGRIMQINFLIGYDIEQAM 363
V++ F G ++KL N + C AF G A + G + Q + +D +
Sbjct: 473 VSLRFEGGGELKLPAKNYLIPVDGAGTYCLAFAATGGAVSIVGNVQQQGIRVSFDTAKNT 532
Query: 364 VSFKPSRC 371
V F P++C
Sbjct: 533 VGFSPNKC 540
>gi|242066168|ref|XP_002454373.1| hypothetical protein SORBIDRAFT_04g029630 [Sorghum bicolor]
gi|241934204|gb|EES07349.1| hypothetical protein SORBIDRAFT_04g029630 [Sorghum bicolor]
Length = 458
Score = 133 bits (334), Expect = 1e-28, Method: Compositional matrix adjust.
Identities = 110/355 (30%), Positives = 172/355 (48%), Gaps = 34/355 (9%)
Query: 35 YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSS 94
Y+ + +GTP VDTGS TW QC PC + C +Q P+F+P+ SS+Y S+SCS+
Sbjct: 121 YVTRMGLGTPAKSYVMVVDTGSSLTWLQCSPC-LVSCHRQSGPVFNPRSSSSYASVSCSA 179
Query: 95 SQCAVVT------SNCSEGD-CSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMPN 147
QC +T S CS + C Y YG +SFS G L+ +T++F STS +PN
Sbjct: 180 PQCDALTTATLNPSTCSTSNVCIYQASYGD---SSFSVGYLSKDTVSFGSTS-----VPN 231
Query: 148 VIFGCGHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLPDQGSSKINFGG 207
+GCG N + G+IGL SL+ Q+ S+ FSYCLP SS
Sbjct: 232 FYYGCGQDNEG---LFGQSAGLIGLARNKLSLLYQLAPSMGYSFSYCLPTSSSSSGYLSI 288
Query: 208 IVAGAGVVS-TPL----IIRDHYYLSLEAISVGNQRLEFVSSSTGNI--FVDTGVLRTLL 260
G S TP+ + Y++ + I+V + L +S+ ++ +D+G + T L
Sbjct: 289 GSYNPGQYSYTPMAKSSLDDSLYFIKMTGITVAGKPLSVSASAYSSLPTIIDSGTVITRL 348
Query: 261 PLEYHSNLKSVMSNMIKAQPVKGVGAEPGFS--DVLCYNISSQPKFPEVTIHFR-GADVK 317
P + +S L ++ +K P FS D +S+ + P+V++ F GA +K
Sbjct: 349 PTDVYSALSKAVAGAMKGTPRAS-----AFSILDTCFQGQASRLRVPQVSMAFAGGAALK 403
Query: 318 LSPSNLFRNISDEIMCSAFRGGNANIVYGRIMQINFLIGYDIEQAMVSFKPSRCT 372
L +NL ++ C AF + + G Q F + YD++ + + F C+
Sbjct: 404 LKATNLLVDVDSATTCLAFAPARSAAIIGNTQQQTFSVVYDVKNSKIGFAAGGCS 458
>gi|413944392|gb|AFW77041.1| hypothetical protein ZEAMMB73_800604 [Zea mays]
Length = 476
Score = 132 bits (333), Expect = 2e-28, Method: Compositional matrix adjust.
Identities = 107/359 (29%), Positives = 160/359 (44%), Gaps = 39/359 (10%)
Query: 35 YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSS 94
+++ + GTP DTGSD +W QC PC C+KQ P+FDP KS+TY+ + C
Sbjct: 135 FVVTVGFGTPAQTYTVIFDTGSDVSWIQCLPC-SGHCYKQHDPIFDPTKSATYSVVPCGH 193
Query: 95 SQCAVVT-SNCSEGDCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMPNVIFGCG 153
QCA S CS G C Y YG G S S+G L+ ETL+ ST L P FGCG
Sbjct: 194 PQCAAADGSKCSNGTCLYKVEYGDG---SSSAGVLSHETLSLTSTRAL----PGFAFGCG 246
Query: 154 HKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLPDQGSSK--INFGGIVAG 211
NL G+IGLG G SL SQ S G FSYCLP ++ + G
Sbjct: 247 QTNLG---DFGDVDGLIGLGRGQLSLSSQAAASFGGTFSYCLPSDNTTHGYLTIGPTTPA 303
Query: 212 AG--VVSTPLIIRDH----YYLSLEAISVGNQRLEFVSS--STGNIFVDTGVLRTLLPLE 263
+ V T ++ + Y++ L +I +G L + + F+D+G + T LP E
Sbjct: 304 SNDDVQYTAMVQKQDYPSFYFVELVSIDIGGYILPVPPTLFTDDGTFLDSGTILTYLPPE 363
Query: 264 YHSNLKSVMS-NMIKAQPVKGVGAEPGFSDV-LCYNISSQPKFPEVTIHFRGADVKLSPS 321
++ L+ M + +P P + CY+ + Q + F+ +D +
Sbjct: 364 AYTALRDRFKFTMTQYKPA------PAYDPFDTCYDFTGQSAIFIPAVSFKFSDGSVFDL 417
Query: 322 NLF------RNISDEIMCSAFRGGNANI---VYGRIMQINFLIGYDIEQAMVSFKPSRC 371
+ F + + I C F + + + G + Q N + YD+ + F + C
Sbjct: 418 SFFGILIFPDDTAPAIGCLGFVARPSAMPFTIVGNMQQRNTEVIYDVAAEKIGFASASC 476
>gi|242079451|ref|XP_002444494.1| hypothetical protein SORBIDRAFT_07g022800 [Sorghum bicolor]
gi|241940844|gb|EES13989.1| hypothetical protein SORBIDRAFT_07g022800 [Sorghum bicolor]
Length = 445
Score = 132 bits (333), Expect = 2e-28, Method: Compositional matrix adjust.
Identities = 112/371 (30%), Positives = 163/371 (43%), Gaps = 51/371 (13%)
Query: 35 YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSS 94
+ + +SIGTPP +DTGSD WTQC+ +E PL+DP KSS++ + C
Sbjct: 89 HTLTVSIGTPPQPRTLILDTGSDLIWTQCKLFDTRQ--HREKPLYDPAKSSSFAAAPCDG 146
Query: 95 SQC---AVVTSNCSEGDCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMPNVIFG 151
C + T NCS C Y++ YG ++ + G LA+ET TF + V + FG
Sbjct: 147 RLCETGSFNTKNCSRNKCIYTYNYG----SATTKGELASETFTFGEHRRVSVSLD---FG 199
Query: 152 CGHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLP----DQGSSKINFG- 206
CG S S GI+G+ P SL+SQ+ +FSYCL +S I FG
Sbjct: 200 CGKLTSGSLPGAS---GILGISPDRLSLVSQLQIP---RFSYCLTPFLDRNTTSHIFFGA 253
Query: 207 ----------GIVAGAGVVSTPLIIRDHYYLSLEAISVGNQRLEFVSSS-------TGNI 249
G + +V+ P +YY+ L ISVG +RL SS +G
Sbjct: 254 MADLSKYRTTGPIQTTSLVTNPDGSNYYYYVPLIGISVGTKRLNVPVSSFAIGRDGSGGT 313
Query: 250 FVDTGVLRTLLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDVLCYN--------ISSQ 301
FVD+G +LP LK M +K V + G+ LC+ + +
Sbjct: 314 FVDSGDTTGMLPSVVMEALKEAMVEAVKLPVVNAT--DHGYEYELCFQLPRNGGGAVETA 371
Query: 302 PKFPEVTIHFR-GADVKLSPSNLFRNISDEIMCSAFRGGNANIVYGRIMQINFLIGYDIE 360
+ P + HF GA + L + +S MC G + G Q N + +D+E
Sbjct: 372 VQVPPLVYHFDGGAAMLLRRDSYMVEVSAGRMCLVISSGARGAIIGNYQQQNMHVLFDVE 431
Query: 361 QAMVSFKPSRC 371
SF P++C
Sbjct: 432 NHEFSFAPTQC 442
>gi|356528675|ref|XP_003532925.1| PREDICTED: LOW QUALITY PROTEIN: probable aspartic protease
At2g35615-like [Glycine max]
Length = 342
Score = 132 bits (333), Expect = 2e-28, Method: Compositional matrix adjust.
Identities = 116/390 (29%), Positives = 156/390 (40%), Gaps = 114/390 (29%)
Query: 8 PFYNDNETPKSPIS---------IIYQAEIISVDDIYLMHLSIGTPPVDIFGSVDTGSDC 58
PFYN + TP I+ + ++ +I + YLM L IGTPPV+ DTGSD
Sbjct: 42 PFYNPSLTPSERITDAALSSNENKLPESILIPNNGEYLMRLYIGTPPVERLVIADTGSDF 101
Query: 59 TWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSSSQCAVVTSNCSEGDCSYSFLYGRG 118
W QC PC NC C Y +Y
Sbjct: 102 IWVQCSPC-----------------------------------QNC---QCVYLNIY--- 120
Query: 119 AYASFSSGNLATETLTFNSTSGL-PVEMPNVIFGCGHKNLASPTSDSKQTGIIGLGPGNS 177
A SF+ + TETL+F+ST G V PN IFGCG N + S K TG++GL G
Sbjct: 121 ANKSFTIEVVGTETLSFDSTGGAQTVSFPNSIFGCGANNNLTFRSSDKATGLVGLVAGQL 180
Query: 178 SLISQMGTSIAGKFSYCLPDQGSSKINFG--GIVAGAGVVSTPLIIRDH---YYLSLEAI 232
SL+SQ+G I KFSY + FG I+ GVVSTPLII+ Y+L+LE +
Sbjct: 181 SLVSQLGAQIGYKFSY---------LKFGSEAIITTNGVVSTPLIIKPSLPLYFLNLEVV 231
Query: 233 SVGNQRLEFVSSSTGNIFVDTGVLRTLLPLEYHSNLKSVMSNMIKAQPVKGVGAEP---- 288
++G K P + +G E
Sbjct: 232 TIGQ----------------------------------------KVVPTETLGVESVQDL 251
Query: 289 GFSDVLCYNISSQPKFPEVTIHFRGADVKLSPSNLFRNISDEIMCSAFRGGNAN-----I 343
F C+ P + F GA V L P NL + D M +A+
Sbjct: 252 PFPFKFCFPYRDNMTVPAIAFQFTGASVALRPKNLLIKLQDRNMLXLAVVPSASSLSVIS 311
Query: 344 VYGRIMQINFLIGYDIEQAMVSFKPSRCTN 373
++G I Q +F + YD++ VS P+ CT
Sbjct: 312 IFGIIAQFDFQVLYDLDGKKVSVAPTDCTK 341
>gi|116787333|gb|ABK24467.1| unknown [Picea sitchensis]
Length = 497
Score = 132 bits (333), Expect = 2e-28, Method: Compositional matrix adjust.
Identities = 113/360 (31%), Positives = 161/360 (44%), Gaps = 40/360 (11%)
Query: 35 YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSS 94
Y L +GTPP + +DTGSD W QC PC + C+ Q PLF+P SSTY + C++
Sbjct: 153 YFTRLGVGTPPRYTYMVLDTGSDIMWIQCLPCAK--CYGQTDPLFNPAASSTYRKVPCAT 210
Query: 95 SQCAVV-TSNCSEGD-CSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMPNVIFGC 152
C + S C C Y YG G SF+ G+ +TETLTF + V GC
Sbjct: 211 PLCKKLDISGCRNKRYCEYQVSYGDG---SFTVGDFSTETLTFRGQV-----IRRVALGC 262
Query: 153 GHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLPDQG----SSKINFGGI 208
GH N + G+ G+ S SQ G + +FSYCL D+ +S + FG
Sbjct: 263 GHDNEGLFIGAAGLLGLG---RGSLSFPSQTGAQFSKRFSYCLVDRSASGTASSLIFGKA 319
Query: 209 VAGAGVVSTPLI----IRDHYYLSLEAISVGNQRLE------FVSSSTGN--IFVDTGVL 256
+ TPL+ + YY+ L ISVG +RL F +TGN + +D+G
Sbjct: 320 AIPKSAIFTPLLSNPKLDTFYYVELVGISVGGRRLTSIPASVFRMDATGNGGVIIDSGTS 379
Query: 257 RTLLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDVLCYNIS--SQPKFPEVTIHFR-G 313
T L +S ++ + +K G F CY++S K P + HF+ G
Sbjct: 380 VTRLVDSAYSTMRDAFR--VGTGNLKSAGGFSLFDT--CYDLSGLKTVKVPTLVFHFQGG 435
Query: 314 ADVKLSPSNLFRNI-SDEIMCSAFRGGNANI-VYGRIMQINFLIGYDIEQAMVSFKPSRC 371
A + L +N + S C AF G + + G I Q + + +D V FK C
Sbjct: 436 AHISLPATNYLIPVDSSATFCFAFAGNTGGLSIIGNIQQQGYRVVFDSLANRVGFKAGSC 495
>gi|147858841|emb|CAN78694.1| hypothetical protein VITISV_037475 [Vitis vinifera]
Length = 442
Score = 132 bits (333), Expect = 2e-28, Method: Compositional matrix adjust.
Identities = 109/362 (30%), Positives = 180/362 (49%), Gaps = 34/362 (9%)
Query: 28 IISVDDIYLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTY 87
+I +L +LSIG PP +++ +DTGSD W QCEPC C+KQ+ P+++ KS +Y
Sbjct: 86 LIRDKSAFLANLSIGNPPTNVYVVLDTGSDLFWIQCEPCDV--CYKQKDPIYNRTKSDSY 143
Query: 88 NSISCSSSQCAVV--TSNCSE-GDCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVE 144
+ C+ C + CS+ G C Y Y GA +SG L+ E + F S +
Sbjct: 144 TEMLCNEPPCVSLGREGQCSDSGSCLYQTAYADGAR---TSGLLSYEKVAFTSHYSDEDK 200
Query: 145 MPNVIFGCGHKNLASPTSDSKQTGIIGLGPGNSSLISQMGT--SIAGKFSYCL-----PD 197
V FGCG +NL TS+ + G++GLGPG SL+SQ+ ++ F+YC P+
Sbjct: 201 TAQVGFGCGLQNLNFITSN-RDGGVLGLGPGLVSLVSQLSAIGKVSKSFAYCFGNISNPN 259
Query: 198 QGSSKINFGGIVAGAGVVSTPLIIRDHYYLSLEAI--SVGNQRLEFVSSS-------TGN 248
G + FG G + TP++I + YY++L I VG RL+ SSS +G
Sbjct: 260 AGGFLV-FGDATYLNGDM-TPMVIAEFYYVNLLGIGLGVGEPRLDINSSSFERKPDGSGG 317
Query: 249 IFVDTGVLRTLLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDVLCYNISSQ---PKFP 305
+ +D+G ++ P E + +++ + + +K KG P S C+ + P FP
Sbjct: 318 VIIDSGSTLSVFPPEVYEVVRNAVVDKLK----KGYNISPLTSSPDCFEGKIERDLPLFP 373
Query: 306 EVTIHFRGADVKLSPSNLFRNISDEIMCSAFRGGNANIVYGRIMQINFLIGYDIEQAMVS 365
+ ++ + ++F DE+ C F G + G + Q ++ GY++E + +S
Sbjct: 374 TLVLYLESTGILNDRWSIFLQRYDELFCLGFTSGEGLSIIGTLAQQSYKFGYNLELSTLS 433
Query: 366 FK 367
+
Sbjct: 434 IE 435
>gi|302822373|ref|XP_002992845.1| hypothetical protein SELMODRAFT_136051 [Selaginella moellendorffii]
gi|300139393|gb|EFJ06135.1| hypothetical protein SELMODRAFT_136051 [Selaginella moellendorffii]
Length = 510
Score = 132 bits (333), Expect = 2e-28, Method: Compositional matrix adjust.
Identities = 119/386 (30%), Positives = 180/386 (46%), Gaps = 61/386 (15%)
Query: 35 YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSS 94
Y + L +GTP V++ +DTGSD +W QC PC DC P F+P+ SS++ + C+S
Sbjct: 138 YYVPLQLGTPAVEVVLIMDTGSDVSWIQCVPCK--DCVPALRPPFNPRHSSSFFKLPCAS 195
Query: 95 SQCAVVTSN----CSEG--DCSYSFLYGRGAYASFSSGNLATETLTFNSTS---GLPVEM 145
S C V CS C +S YG G S SSG LA ET+ N+ + G PV++
Sbjct: 196 STCTNVYQGVKPFCSPSGRTCLFSIQYGDG---SLSSGLLAMETIAGNTPNFGDGEPVKL 252
Query: 146 PNVIFGCGHKNLAS-PTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLPDQGSSKIN 204
N+ GC + PT S G++G+ S SQ+ + A KFS+C PD+ + +N
Sbjct: 253 SNITLGCADIDREGLPTGAS---GLLGMDRRPISFPSQLSSRYARKFSHCFPDK-IAHLN 308
Query: 205 FGGIV--AGAGVVS-----TPLIIR--------DHYYLSLEAISVGNQRLEF-------- 241
G+V + ++S TPL+ D+YY+ L ISV RL
Sbjct: 309 SSGLVFFGESDIISPYLRYTPLVQNPAVPSASLDYYYVGLVGISVDESRLPLSHKNFDID 368
Query: 242 -VSSSTGNIFVDTGVLRTLLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDVLCYNISS 300
V+ S G I +D+G T L ++ + + + V GF+ CYNI+S
Sbjct: 369 KVTGSGGTI-IDSGTAFTYLKKPAFQAMRREF--LARTSHLAKVDDNSGFTP--CYNITS 423
Query: 301 QPK------FPEVTIHFRGA-DVKLSPSNLFRNIS----DEIMCSAFR--GGNANIVYGR 347
P +T+HFRG DV L +++ +S +C AF+ G + G
Sbjct: 424 GTAALESTILPSITLHFRGGLDVVLPKNSILIPVSSSEEQTTLCLAFQMSGDIPFNIIGN 483
Query: 348 IMQINFLIGYDIEQAMVSFKPSRCTN 373
Q N + YD+E+ + P++C
Sbjct: 484 YQQQNLWVEYDLEKLRLGIAPAQCAT 509
>gi|326515172|dbj|BAK03499.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 494
Score = 132 bits (333), Expect = 2e-28, Method: Compositional matrix adjust.
Identities = 107/338 (31%), Positives = 165/338 (48%), Gaps = 34/338 (10%)
Query: 52 VDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSSSQCAVVTSNCSEG---- 107
VDT SD W QC PCP C Q+ PL+DP KSST+ I C S C + S+ G
Sbjct: 173 VDTSSDIPWVQCLPCPIPQCHLQKDPLYDPAKSSTFAPIPCGSPACKELGSSYGNGCSPT 232
Query: 108 --DCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMPNVIFGCGHKNLASPTSDSK 165
+C Y YG G ++G T+TLT + T + + + FGC H S ++
Sbjct: 233 TDECKYIVNYGDG---KATTGTYVTDTLTMSPT----IVVKDFRFGCSHAVRGS--FSNQ 283
Query: 166 QTGIIGLGPGNSSLISQMGTSIAGKFSYCLPDQGSSK-INFGGIVAGAGVVS-TPLIIRD 223
GI+ LG G SL+ Q + FSYC+P S+ ++ GG V + S TPLI
Sbjct: 284 NAGILALGGGRGSLLEQTADAYGNAFSYCIPKPSSAGFLSLGGPVEASLKFSYTPLIKNK 343
Query: 224 H----YYLSLEAISVGNQRLEF--VSSSTGNIFVDTGVLRTLLPLEYHSNLKSVMSNMIK 277
H Y + LEAI V ++L + +TG + +D+G + T LP + ++ L++ + +
Sbjct: 344 HAPTFYIVHLEAIIVAGKQLAVPPTAFATGAV-MDSGAVVTQLPPQVYAALRAAFRSAMA 402
Query: 278 AQPVKGVGAEPGFSDVLCYNISSQP--KFPEVTIHFR-GADVKLSPSNLFRNISDEIMCS 334
A G A P + CY+ + P K P+V++ F GA + L P+++ I D +
Sbjct: 403 AY---GPLAAPVRNLDTCYDFTRFPDVKVPKVSLVFAGGATLDLEPASI---ILDGCLAF 456
Query: 335 AFRGGNANIVY-GRIMQINFLIGYDIEQAMVSFKPSRC 371
A G ++ + G + Q + + YD+ V F+ C
Sbjct: 457 AATPGEESVGFIGNVQQQTYEVLYDVGGGKVGFRRGAC 494
>gi|224093400|ref|XP_002309912.1| predicted protein [Populus trichocarpa]
gi|222852815|gb|EEE90362.1| predicted protein [Populus trichocarpa]
Length = 382
Score = 132 bits (333), Expect = 2e-28, Method: Compositional matrix adjust.
Identities = 104/358 (29%), Positives = 157/358 (43%), Gaps = 39/358 (10%)
Query: 35 YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSS 94
Y + + +G+PP + +D+GSD W QC+PC + C+ Q PLFDP S+++ +SCSS
Sbjct: 43 YFVRIGLGSPPRSQYMVIDSGSDIVWVQCKPCTQ--CYHQTDPLFDPADSASFMGVSCSS 100
Query: 95 SQC-AVVTSNCSEGDCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMPNVIFGCG 153
+ C V + C+ G C Y YG G+Y + G LA ETLTF T + NV GCG
Sbjct: 101 AVCDRVENAGCNSGRCRYEVSYGDGSY---TKGTLALETLTFGRTV-----VRNVAIGCG 152
Query: 154 HKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLPDQGSSK---INFGGIVA 210
H N + G+ G + S + Q+ FSYCL +G++ + FG
Sbjct: 153 HSNRGMFVGAAGLLGLGGG---SMSFMGQLSGQTGNAFSYCLVSRGTNTNGFLEFGSEAM 209
Query: 211 GAGVVSTPLIIRDH----YYLSLEAISVGNQRL-------EFVSSSTGNIFVDTGVLRTL 259
G PL+ YY+ L + VG+ R+ + +G + +DTG T
Sbjct: 210 PVGAAWIPLVRNPRAPSFYYIRLLGLGVGDTRVPVSEDVFQLNELGSGGVVMDTGTAVTR 269
Query: 260 LPLEYHSNLKSVMSNMIKAQP-VKGVGAEPGFSDVLCYNISS--QPKFPEVTIHFRGADV 316
P + ++ + P GV CYN+ + P V+ +F G +
Sbjct: 270 FPTVAYEAFRNAFIEQTQNLPRASGVSIFD-----TCYNLFGFLSVRVPTVSFYFSGGPI 324
Query: 317 KLSPSNLFRNISDE--IMCSAFRGGNANI-VYGRIMQINFLIGYDIEQAMVSFKPSRC 371
P+N F D+ C AF + + + G I Q I D V F P+ C
Sbjct: 325 LTIPANNFLIPVDDAGTFCFAFAPSPSGLSILGNIQQEGIQISVDEANEFVGFGPNIC 382
>gi|449464952|ref|XP_004150193.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 2-like [Cucumis
sativus]
gi|449526850|ref|XP_004170426.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 2-like [Cucumis
sativus]
Length = 476
Score = 132 bits (333), Expect = 2e-28, Method: Compositional matrix adjust.
Identities = 106/361 (29%), Positives = 156/361 (43%), Gaps = 45/361 (12%)
Query: 35 YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSS 94
Y + + +G+PP + +D+GSD W QC+PC E C++Q P+FDP S+TY ISC S
Sbjct: 137 YFVRIGVGSPPRSQYVVIDSGSDIVWVQCQPCSE--CYQQSDPVFDPAGSATYAGISCDS 194
Query: 95 SQCAVV-TSNCSEGDCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMPNVIFGCG 153
S C + + C++G C Y YG G+Y + G LA ETLTF V + N+ GCG
Sbjct: 195 SVCDRLDNAGCNDGRCRYEVSYGDGSY---TRGTLALETLTFGR-----VLIRNIAIGCG 246
Query: 154 HKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLPDQGSSK---INFGGIVA 210
H N + G+ G S + Q+G G FSYCL +G+ + FG
Sbjct: 247 HMNRGMFIGAAGLLGLGGG---AMSFVGQLGGQTGGAFSYCLVSRGTESTGTLEFGRGAM 303
Query: 211 GAGVVSTPLIIRDH----YYLSLEA-------ISVGNQRLEFVSSSTGNIFVDTGVLRTL 259
G PLI YY+ L + + Q E G + +DTG T
Sbjct: 304 PVGAAWVPLIRNPRAPSFYYVGLSGLGVGGIRVPIPEQIFELTDLGYGGVVMDTGTAVTR 363
Query: 260 LPLEYHSNLKSVM----SNMIKAQPVKGVGAEPGFSDVLCYNISS--QPKFPEVTIHFRG 313
LP + + +N+ ++ V CYN++ + P V+ +F G
Sbjct: 364 LPAPAYEAFRDTFIGQTANLPRSDRVSIFDT--------CYNLNGFVSVRVPTVSFYFSG 415
Query: 314 ADVKLSPS-NLFRNISDE-IMCSAFRGGNANI-VYGRIMQINFLIGYDIEQAMVSFKPSR 370
+ P+ N + E C AF + + + G I Q I D V F P+
Sbjct: 416 GPILTLPARNFLIPVDGEGTFCFAFAASASGLSIIGNIQQEGIQISIDGSNGFVGFGPTI 475
Query: 371 C 371
C
Sbjct: 476 C 476
>gi|356532672|ref|XP_003534895.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
Length = 449
Score = 132 bits (333), Expect = 2e-28, Method: Compositional matrix adjust.
Identities = 108/357 (30%), Positives = 166/357 (46%), Gaps = 36/357 (10%)
Query: 36 LMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSSS 95
+ ++SIG PP+ +DTGSD W C PC +C LFDP SST++ + +
Sbjct: 102 MANISIGQPPIPQLVVMDTGSDILWVMCTPC--TNCDNHLGLLFDPSMSSTFSPL--CKT 157
Query: 96 QCAVVTSNCSEGD-CSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMPNVIFGCGH 154
C CS D ++ Y A S +SG +T+ F +T +P+V+FGCGH
Sbjct: 158 PCDF--KGCSRCDPIPFTVTY---ADNSTASGMFGRDTVVFETTDEGTSRIPDVLFGCGH 212
Query: 155 KNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLPDQGSSKINFGGIVAGAGV 214
N+ T D GI+GL G SL +++G KFSYC+ D N+ ++ G G
Sbjct: 213 -NIGQDT-DPGHNGILGLNNGPDSLATKIGQ----KFSYCIGDLADPYYNYHQLILGEGA 266
Query: 215 ----VSTPLIIRD-HYYLSLEAISVGNQRL-------EFVSSSTGNIFVDTGVLRTLLPL 262
STP + + YY+++E ISVG +RL E + TG + +DTG T L
Sbjct: 267 DLEGYSTPFEVHNGFYYVTMEGISVGEKRLDIAPETFEMKKNRTGGVIIDTGSTITFLVD 326
Query: 263 EYHSNLKSVMSNMIKAQPVKGVGAEPGFSDVLCYNISSQ-PKFPEVTIHFR-GADVKLSP 320
H L + N++ + + + +IS FP VT HF GAD+ L
Sbjct: 327 SVHRLLSKEVRNLLGWSFRQTTIEKSPWMQCFYGSISRDLVGFPVVTFHFADGADLALDS 386
Query: 321 SNLFRNISDEIMC------SAFRGGNANIVYGRIMQINFLIGYDIEQAMVSFKPSRC 371
+ F ++D + C S+ + + G + Q ++ +GYD+ V F+ C
Sbjct: 387 GSFFNQLNDNVFCMTVGPVSSLNLKSKPSLIGLLAQQSYSVGYDLVNQFVYFQRIDC 443
>gi|147801191|emb|CAN68822.1| hypothetical protein VITISV_007106 [Vitis vinifera]
Length = 443
Score = 132 bits (333), Expect = 2e-28, Method: Compositional matrix adjust.
Identities = 107/357 (29%), Positives = 153/357 (42%), Gaps = 92/357 (25%)
Query: 41 IGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSSSQC-AV 99
+G P ++G DTGS+ W QC PC C+ Q PP+FDP +S TY ++S S C AV
Sbjct: 63 LGVPSTLVYGIADTGSELIWLQCLPCTH--CYNQTPPIFDPAESYTYETVSSDSPICNAV 120
Query: 100 VTSNCSEGD--CSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMPNVIFGCGHKNL 157
+C EGD C Y YG G + + G L+T+ F + VE+ + FGC H
Sbjct: 121 RRISCREGDKSCCYQHTYGDG---TTTKGTLSTDVFAFEDPTRTIVEVGYLTFGCSHDTK 177
Query: 158 ASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCL---PDQGS-SKINFG--GIVAG 211
A Q G++GL +SL+SQ+ KFSYC+ D GS S++ FG ++ G
Sbjct: 178 ARL--KGHQAGVVGLNRHPNSLVSQLKVK---KFSYCMVIPDDHGSGSRMYFGSRAVILG 232
Query: 212 AGVVSTPLIIRD--HYYLSLEAISVGNQRLEFVSSSTGNIFVDTGVLRTLLPLEYHSNLK 269
TPL+ D HY+++L+ IS
Sbjct: 233 G---KTPLLKGDYSHYFVTLKGIS------------------------------------ 253
Query: 270 SVMSNMIKAQPVKGVGAEPGFSDVLCYNISSQPKFPEVTIHFRGADVKLSPSNLFRNISD 329
VG E G SD L P++T HF GAD L+ + +
Sbjct: 254 --------------VGEEKGRSDEL------ASAGPDITFHFYGADFILTKXTTYVEVEK 293
Query: 330 EIMCSAFRGGNANI---VYGRIMQINFLIGYDIEQAMVS---------FKPSRCTNY 374
+ C A N+ + G I Q N+ +GYD+E V+ F PS+ + Y
Sbjct: 294 GLWCLAMLSSNSTRKLSILGNIQQQNYHVGYDLEAQEVAQCFNQTPPIFDPSKSSTY 350
Score = 61.2 bits (147), Expect = 8e-07, Method: Compositional matrix adjust.
Identities = 35/114 (30%), Positives = 58/114 (50%), Gaps = 7/114 (6%)
Query: 71 CFKQEPPLFDPKKSSTYNSISCSSSQCAVV---TSNCSEGDCSYSFLYGRGAYASFSSGN 127
CF Q PP+FDP KSSTY+++ + C + E DC Y YG G+ + + G
Sbjct: 334 CFNQTPPIFDPSKSSTYSTVPWDAPTCYQAGGYACHIDEEDCCYRISYGSGSTS--TEGT 391
Query: 128 LATETLTFNSTSGLPVEMPNVIFGCGHKNLASPTSDSKQTGIIGLGPGNSSLIS 181
++ + F V++ +++FGC + + T + GI+GL + SL+S
Sbjct: 392 ISIDAFAFEDNRQNMVDVXHLVFGC--SDYTTGTFKGYEVGIVGLNQDSLSLVS 443
>gi|226492150|ref|NP_001146362.1| hypothetical protein precursor [Zea mays]
gi|219886805|gb|ACL53777.1| unknown [Zea mays]
gi|414878074|tpg|DAA55205.1| TPA: hypothetical protein ZEAMMB73_415404 [Zea mays]
Length = 440
Score = 132 bits (332), Expect = 2e-28, Method: Compositional matrix adjust.
Identities = 115/387 (29%), Positives = 175/387 (45%), Gaps = 66/387 (17%)
Query: 35 YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSS 94
Y+ IG PP +DTGS+ WTQC C CF+Q P +DP +S ++ C+
Sbjct: 71 YIAEYLIGDPPQRAEAIIDTGSNLIWTQCSRC-RPTCFRQNLPYYDPSRSRAARAVGCND 129
Query: 95 SQCAVVTSNCSEGD---CSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMPNVIFG 151
+ CA+ + D C+ YG G A G LATE LTF S E +++FG
Sbjct: 130 AACALGSETQCLSDNKTCAVVTGYGAGNIA----GTLATENLTFQS------ETVSLVFG 179
Query: 152 CGHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLPDQGSSKINFGGIVAG 211
C SP S + +GIIGLG G SL SQ+G + +FSYCL I +V G
Sbjct: 180 CIVVTKLSPGSLNGASGIIGLGRGKLSLPSQLGDT---RFSYCLTPYFEDTIEPSHMVVG 236
Query: 212 --AGVV-----STPL----IIRD--------HYYLSLEAISVGNQRLEFVSSS------- 245
AG++ STP+ +R YYL L I+ G +L S++
Sbjct: 237 ASAGLINGSASSTPVTTVPFVRSPSDDPFSTFYYLPLTGITAGKVKLAVPSAAFDLRQVA 296
Query: 246 ----TGNIFVDTGV-LRTLLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDVLCYNISS 300
TG F+D+G L +L+ + Y + L++ ++ + A V+ + GF LC +
Sbjct: 297 PGMWTGT-FIDSGAPLTSLVDVAYQA-LRAELARQLGAALVQPLAGTTGFD--LCVALKD 352
Query: 301 QPKF-PEVTIHF-----RGADVKLSPSNLFRNISDEIMCSAFRGG--------NANIVYG 346
+ P + +HF G D+ + P+N + + C N V G
Sbjct: 353 AERLVPPLVLHFGGGSGTGTDLVVPPANYWAPVDSATACMVVFSSVDRKSLPMNETTVIG 412
Query: 347 RIMQINFLIGYDIEQAMVSFKPSRCTN 373
MQ N + YD+ ++SF+P+ C++
Sbjct: 413 NYMQQNMHVLYDLAGGVLSFQPADCSS 439
>gi|118487542|gb|ABK95598.1| unknown [Populus trichocarpa]
Length = 471
Score = 132 bits (332), Expect = 2e-28, Method: Compositional matrix adjust.
Identities = 105/361 (29%), Positives = 162/361 (44%), Gaps = 39/361 (10%)
Query: 35 YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSS 94
Y + L +GTPP +DTGS +W QC+PC + C Q PL+DP S TY +SC+S
Sbjct: 125 YYVKLGLGTPPKYYAMILDTGSSLSWLQCQPC-AVYCHAQADPLYDPSVSKTYKKLSCAS 183
Query: 95 SQC-----AVVTSNCSEGD---CSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMP 146
+C A + E D C Y+ YG SFS G L+ + LT S+ L P
Sbjct: 184 VECSRLKAATLNDPLCETDSNACLYTASYGD---TSFSIGYLSQDLLTLTSSQTL----P 236
Query: 147 NVIFGCGHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLPDQG----SSK 202
+GCG N + GIIGL S+++Q+ T FSYCLP
Sbjct: 237 QFTYGCGQDNQG---LFGRAAGIIGLARDKLSMLAQLSTKYGHAFSYCLPTANSGSSGGG 293
Query: 203 INFGGIVAGAGVVSTPLIIRDH----YYLSLEAISVGNQRLEFVSSSTG-NIFVDTGVLR 257
G ++ TP++ Y+L L AI+V + L+ ++ +D+G +
Sbjct: 294 FLSIGSISPTSYKFTPMLTDSKNPSLYFLRLTAITVSGRPLDLAAAMYRVPTLIDSGTVI 353
Query: 258 TLLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDV-LCY--NISSQPKFPEVTIHFR-G 313
T LP+ ++ L+ ++ + K P +S + C+ ++ S PE+ + F+ G
Sbjct: 354 TRLPMSMYAALRQAFVKIMSTKYAKA----PAYSILDTCFKGSLKSISAVPEIKMIFQGG 409
Query: 314 ADVKLSPSNLFRNISDEIMCSAFRGG---NANIVYGRIMQINFLIGYDIEQAMVSFKPSR 370
AD+ L ++ I C AF G N + G Q + I YD+ + + F P
Sbjct: 410 ADLTLRAPSILIEADKGITCLAFAGSSGTNQIAIIGNRQQQTYNIAYDVSTSRIGFAPGS 469
Query: 371 C 371
C
Sbjct: 470 C 470
>gi|255548662|ref|XP_002515387.1| Aspartic proteinase nepenthesin-2 precursor, putative [Ricinus
communis]
gi|223545331|gb|EEF46836.1| Aspartic proteinase nepenthesin-2 precursor, putative [Ricinus
communis]
Length = 463
Score = 132 bits (332), Expect = 2e-28, Method: Compositional matrix adjust.
Identities = 110/354 (31%), Positives = 154/354 (43%), Gaps = 40/354 (11%)
Query: 35 YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSS 94
Y++ + +G+P D+ DTGSD TW +C FDP KS++Y ++SCS+
Sbjct: 134 YIVSIGLGSPKKDLMLIFDTGSDLTWARCSAAET----------FDPTKSTSYANVSCST 183
Query: 95 SQCAVVT------SNCSEGDCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMPNV 148
C+ V S C+ C Y YG G+Y S G L E LT ST N
Sbjct: 184 PLCSSVISATGNPSRCAASTCVYGIQYGDGSY---SIGFLGKERLTIGSTD----IFNNF 236
Query: 149 IFGCGHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLPDQGSSKINFGGI 208
FGCG K G++GLG S++SQ FSYCLP S+ G
Sbjct: 237 YFGCGQD---VDGLFGKAAGLLGLGRDKLSVVSQTAPKYNQLFSYCLPSSSSTGFLSFGS 293
Query: 209 VAGAGVVSTPLII--RDHYYLSLEAISVGNQRLEFVSS--STGNIFVDTGVLRTLLPLEY 264
TPL Y L L I+VG Q+L S ST +D+G + T LP
Sbjct: 294 SQSKSAKFTPLSSGPSSFYNLDLTGITVGGQKLAIPLSVFSTAGTIIDSGTVVTRLPPAA 353
Query: 265 HSNLKSVMSNMIKAQPVKGVGAEPGFSDVLCYNISSQP--KFPEVTIHFRGA-DVKLSPS 321
+S L+S + + P +G D CY+ S K P++ I F G DV + +
Sbjct: 354 YSALRSAFRKAMASYP---MGKPLSILDT-CYDFSKYKTIKVPKIVISFSGGVDVDVDQA 409
Query: 322 NLFRNISDEIMCSAFRGGNA---NIVYGRIMQINFLIGYDIEQAMVSFKPSRCT 372
+F + +C AF G ++G Q NF + YD+ V F P+ C+
Sbjct: 410 GIFVANGLKQVCLAFAGNTGARDTAIFGNTQQRNFEVVYDVSGGKVGFAPASCS 463
>gi|224111722|ref|XP_002315953.1| predicted protein [Populus trichocarpa]
gi|222864993|gb|EEF02124.1| predicted protein [Populus trichocarpa]
Length = 484
Score = 132 bits (332), Expect = 3e-28, Method: Compositional matrix adjust.
Identities = 108/358 (30%), Positives = 168/358 (46%), Gaps = 43/358 (12%)
Query: 35 YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSS 94
Y + IG PP + +DTGSD W QC PC DC++Q P+F+P S++++++SC++
Sbjct: 149 YFSRVGIGKPPSQAYLILDTGSDVNWVQCAPCA--DCYQQADPIFEPASSASFSTLSCNT 206
Query: 95 SQC-AVVTSNCSEGDCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMPNVIFGCG 153
QC ++ S C C Y YG G+Y + G+ TET+T S PV+ NV GCG
Sbjct: 207 RQCRSLDVSECRNDTCLYEVSYGDGSY---TVGDFVTETITLGSA---PVD--NVAIGCG 258
Query: 154 HKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLPDQGS---SKINFGGIVA 210
H N + G+ G S I+ A FSYCL D+ S S + F +
Sbjct: 259 HNNEGLFVGAAGLLGLGGGSLSFPSQIN------ATSFSYCLVDRDSESASTLEFNSTLP 312
Query: 211 GAGVVSTPLIIRDH----YYLSLEAISVGNQ-------RLEFVSSSTGNIFVDTGVLRTL 259
VS PL+ H YY+ L +SVG + + S G + VD+G T
Sbjct: 313 -PNAVSAPLLRNHHLDTFYYVGLTGLSVGGELVSIPESAFQIDESGNGGVIVDSGTAITR 371
Query: 260 LPLEYHSNLKSVMSNMIKAQP-VKGVGAEPGFSDVLCYNISSQ--PKFPEVTIHF-RGAD 315
L + +++L+ + P G+ D CY++SS+ + P V+ HF G +
Sbjct: 372 LQTDVYNSLRDAFVKRTRDLPSTNGI----ALFDT-CYDLSSKGNVEVPTVSFHFPDGKE 426
Query: 316 VKLSPSNLFRNISDE-IMCSAFRGGNANI-VYGRIMQINFLIGYDIEQAMVSFKPSRC 371
+ L N + E C AF +++ + G + Q + YD+ +V F P++C
Sbjct: 427 LPLPAKNYLVPLDSEGTFCFAFAPTASSLSIIGNVQQQGTRVVYDLVNHLVGFVPNKC 484
>gi|125558632|gb|EAZ04168.1| hypothetical protein OsI_26310 [Oryza sativa Indica Group]
gi|125600539|gb|EAZ40115.1| hypothetical protein OsJ_24558 [Oryza sativa Japonica Group]
Length = 453
Score = 132 bits (332), Expect = 3e-28, Method: Compositional matrix adjust.
Identities = 123/378 (32%), Positives = 179/378 (47%), Gaps = 56/378 (14%)
Query: 35 YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSS 94
Y+M L+IGTPP DTGSD WTQC PC E CFKQ PL++P S T+ + CSS
Sbjct: 92 YIMTLAIGTPPQSYPAIADTGSDLVWTQCAPCGE-RCFKQPSPLYNPSSSPTFRVLPCSS 150
Query: 95 S--QCAVVTS--------NCSEGDCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVE 144
+ CA C+ C Y+ YG G ++SG +ET TF S+ V
Sbjct: 151 ALNLCAAEARLAGATPPPGCA---CRYNQTYGTG----WTSGLQGSETFTFGSSPADQVR 203
Query: 145 MPNVIFGCGHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCL-PDQGS--- 200
+P + FGC + AS + G++GLG G SL+SQ+ AG FSYCL P Q +
Sbjct: 204 VPGIAFGCSN---ASSDDWNGSAGLVGLGRGGLSLVSQLA---AGMFSYCLTPFQDTKSK 257
Query: 201 SKINFG-----GIVAGAGVVSTPLI-------IRDHYYLSLEAISVGNQRL-------EF 241
S + G + G GV STP + + +YYL+L ISVG L
Sbjct: 258 STLLLGPAAAAAALNGTGVRSTPFVPSPSKPPMSTYYYLNLTGISVGAAALPIPPGAFAL 317
Query: 242 VSSSTGNIFVDTGVLRTLLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDVLCYNI--S 299
+ TG + +D+G T L + +++ + +++K G A G LC+ + S
Sbjct: 318 RADGTGGLIIDSGTTITSLVDAAYKRVRAAVRSLVKLPVTDGSNAT-GLD--LCFALPSS 374
Query: 300 SQP--KFPEVTIHFRGADVKLSPSNLFRNISDEIMCSAFRGGNANIV--YGRIMQINFLI 355
S P P +T+HF G + P + + + C A R + G Q N I
Sbjct: 375 SAPPATLPSMTLHFGGGADMVLPVENYMILDGGMWCLAMRSQTDGELSTLGNYQQQNLHI 434
Query: 356 GYDIEQAMVSFKPSRCTN 373
YD+++ +SF P++C+
Sbjct: 435 LYDVQKETLSFAPAKCST 452
>gi|414878073|tpg|DAA55204.1| TPA: hypothetical protein ZEAMMB73_344109 [Zea mays]
Length = 440
Score = 132 bits (331), Expect = 3e-28, Method: Compositional matrix adjust.
Identities = 115/391 (29%), Positives = 168/391 (42%), Gaps = 56/391 (14%)
Query: 26 AEIISVDDIYLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSS 85
A + + Y+ IG PP +DTGS+ WTQC C CF Q +DP +S
Sbjct: 62 APVHWAESQYIAEYLIGDPPQQAEAIIDTGSNLIWTQCSTCQPAGCFSQNLSFYDPSRSR 121
Query: 86 TYNSISCSSSQCAVVT-SNCSEGD--CSYSFLYGRGAYASFSSGNLATETLTFNSTSGLP 142
T ++C+ + CA+ + + C+ + C+ YG G G L TE TF S
Sbjct: 122 TARPVACNDTACALGSETRCARDNKACAVLTAYGAGVIG----GVLGTEAFTFQPQS--- 174
Query: 143 VEMPNVIFGCGHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLPDQGSSK 202
E ++ FGC +P S +GIIGLG GN SL+SQ+G + KFSYCL S
Sbjct: 175 -ENVSLAFGCIAATRLTPGSLDGASGIIGLGRGNLSLVSQLGDN---KFSYCLTPYFSQS 230
Query: 203 IN----FGGIVA-----GAGVVSTPLI-------IRDHYYLSLEAISVGNQRL------- 239
N F G A GA S P + YYL L I+VG+ +L
Sbjct: 231 TNTSRLFVGASAGLSSGGAPATSVPFLKNPDVDPFSTFYYLPLTGITVGDAKLAVPEAAF 290
Query: 240 EFVSSSTG---NIFVDTGVLRTLLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDVLCY 296
+ +TG +D+G T L + L+ + + A V G LC
Sbjct: 291 DLRQVATGLWAGTLIDSGSPFTSLVDVAYQALRDELVQQLGASIVPPPAGAEGLD--LCA 348
Query: 297 NISS---QPKFPEVTIHF--RGADVKLSPSNLFRNISDEIMCS-AFRGG--------NAN 342
++ P + +HF G DV + P N + + D C F G N
Sbjct: 349 AVAHGDVGKLVPPLVLHFGSGGGDVAVPPENYWGPVDDSTACMVVFSSGGPNSTLPMNET 408
Query: 343 IVYGRIMQINFLIGYDIEQAMVSFKPSRCTN 373
+ G MQ + + YD+E+ M+SF+P+ C++
Sbjct: 409 TIIGNYMQQDMHLLYDLEKGMLSFQPADCSS 439
>gi|413944596|gb|AFW77245.1| hypothetical protein ZEAMMB73_545774 [Zea mays]
gi|414876929|tpg|DAA54060.1| TPA: hypothetical protein ZEAMMB73_875469 [Zea mays]
Length = 459
Score = 132 bits (331), Expect = 3e-28, Method: Compositional matrix adjust.
Identities = 114/380 (30%), Positives = 181/380 (47%), Gaps = 38/380 (10%)
Query: 11 NDNETPKSPISIIYQAEIISVDD---IYLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCP 67
+DN T + P+ + DD Y M S+GTPP + DTGSD W +C
Sbjct: 73 SDNNTQRIPLRM---------DDSGGAYDMEFSMGTPPQKLTALADTGSDLIWAKCGGAC 123
Query: 68 ELDCFKQEPPLFDPKKSSTYNSISCSSSQCAVVTSN------CSEGDCSYSFLYGRGAY- 120
C Q P + P SST+ + CS C+++ S+ + +C Y + YG G
Sbjct: 124 TTSCEPQGSPSYLPNASSTFAKLPCSDRLCSLLRSDSVAWCAAAGAECDYRYSYGLGDDD 183
Query: 121 ASFSSGNLATETLTFNSTSGLPVEMPNVIFGCGHKNLASPTSDSKQTGIIGLGPGNSSLI 180
++ G LA ET T + + +P+V FGC AS +G++GLG G SL+
Sbjct: 184 HHYTQGFLARETFTLGADA-----VPSVRFGC---TTASEGGYGSGSGLVGLGRGPLSLV 235
Query: 181 SQMGTSIAGKFSYCLPDQGS--SKINFGGI--VAGAGVVSTPLIIRDHYY-LSLEAISVG 235
SQ+ A F YCL S S + FG + + GA V ST L+ +Y ++L +IS+G
Sbjct: 236 SQLN---ASTFMYCLTSDASKASPLLFGSLASLTGAQVQSTGLLASTTFYAVNLRSISIG 292
Query: 236 NQRLEFVSSSTGNIFVDTGVLRTLLPLEYHSNLKSVMSNMIKAQPVKGV-GAEPGFSDVL 294
+ V G +F D+G T L +S K+ + V+ G E F
Sbjct: 293 SATTPGVGEPEGVVF-DSGTTLTYLAEPAYSEAKAAFLSQTSLDQVEDTDGFEACFQKPA 351
Query: 295 CYNISSQPKFPEVTIHFRGADVKLSPSNLFRNISDEIMCSAFRGGNANIVYGRIMQINFL 354
+S+ P + +HF GAD+ L +N + D ++C + + + G IMQ+N+L
Sbjct: 352 NGRLSNA-AVPTMVLHFDGADMALPVANYVVEVEDGVVCWIVQRSPSLSIIGNIMQVNYL 410
Query: 355 IGYDIEQAMVSFKPSRCTNY 374
+ +D+ ++++SF+P+ C Y
Sbjct: 411 VLHDVHRSVLSFQPANCDTY 430
>gi|242053503|ref|XP_002455897.1| hypothetical protein SORBIDRAFT_03g026970 [Sorghum bicolor]
gi|241927872|gb|EES01017.1| hypothetical protein SORBIDRAFT_03g026970 [Sorghum bicolor]
Length = 493
Score = 132 bits (331), Expect = 3e-28, Method: Compositional matrix adjust.
Identities = 111/372 (29%), Positives = 163/372 (43%), Gaps = 53/372 (14%)
Query: 35 YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSS 94
Y + +GTP +DTGSD W QC PC C+ Q P+FDP++SS+Y ++ C++
Sbjct: 140 YFTKIGVGTPSTPALMVLDTGSDVVWLQCAPCRR--CYDQSGPVFDPRRSSSYGAVDCAA 197
Query: 95 SQCAVVTS---NCSEGDCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMPNVIFG 151
C + S + C Y YG G S ++G+ ATETLTF + + V G
Sbjct: 198 PLCRRLDSGGCDLRRRACLYQVAYGDG---SVTAGDFATETLTFAGGA----RVARVALG 250
Query: 152 CGHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLPDQ------------G 199
CGH N + + G+ G+ S +Q+ FSYCL D+
Sbjct: 251 CGHDNEGLFVAAAGLLGLG---RGSLSFPTQISRRYGKSFSYCLVDRTSSSSSGAASRSR 307
Query: 200 SSKINFGGIVAGAGVVSTPLI----IRDHYYLSLEAISVGNQRLEFVSSS---------T 246
SS + FG A A TP++ + YY+ L ISVG R+ V+ S
Sbjct: 308 SSTVTFGPPSASAASF-TPMVRNPRMETFYYVQLVGISVGGARVPGVAESDLRLDPSTGR 366
Query: 247 GNIFVDTGVLRTLLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDVL--CYNISSQP-- 302
G + VD+G T L +S L+ G+ PG + CY++ +
Sbjct: 367 GGVIVDSGTSVTRLARPSYSALRDAFRAA-----AAGLRLSPGGFSLFDTCYDLGGRKVV 421
Query: 303 KFPEVTIHFR-GADVKLSPSNLFRNI-SDEIMCSAFRGGNANI-VYGRIMQINFLIGYDI 359
K P V++HF GA+ L P N + S C AF G + + + G I Q F + +D
Sbjct: 422 KVPTVSMHFAGGAEAALPPENYLIPVDSRGTFCFAFAGTDGGVSIIGNIQQQGFRVVFDG 481
Query: 360 EQAMVSFKPSRC 371
+ V F P C
Sbjct: 482 DGQRVGFAPKGC 493
>gi|302811785|ref|XP_002987581.1| hypothetical protein SELMODRAFT_426333 [Selaginella moellendorffii]
gi|300144735|gb|EFJ11417.1| hypothetical protein SELMODRAFT_426333 [Selaginella moellendorffii]
Length = 511
Score = 132 bits (331), Expect = 4e-28, Method: Compositional matrix adjust.
Identities = 119/386 (30%), Positives = 179/386 (46%), Gaps = 61/386 (15%)
Query: 35 YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSS 94
Y + L +GTP V++ +DTGSD +W QC PC DC P F+P+ SS++ + C+S
Sbjct: 139 YYVPLQVGTPAVEVVLIMDTGSDVSWIQCVPCK--DCVPALRPPFNPRHSSSFFKLPCAS 196
Query: 95 SQCAVVTSN----CSEG--DCSYSFLYGRGAYASFSSGNLATETLTFNSTS---GLPVEM 145
S C V CS C +S YG G S SSG LA ET+ N+ + G PV++
Sbjct: 197 STCTNVYQGVKPFCSPSGRTCLFSIQYGDG---SLSSGLLAMETIAGNTPNFGDGEPVKL 253
Query: 146 PNVIFGCGHKNLAS-PTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLPDQGSSKIN 204
N+ GC + PT S G++G+ S SQ+ + A KFS+C PD+ + +N
Sbjct: 254 SNITLGCADIDREGLPTGAS---GLLGMDRRPISFPSQLSSRYARKFSHCFPDK-IAHLN 309
Query: 205 FGGIV--AGAGVVS-----TPLIIR--------DHYYLSLEAISVGNQRLEF-------- 241
G+V + ++S TPL+ D+YY+ L ISV RL
Sbjct: 310 SSGLVFFGESDIISPYLRYTPLVQNPAVPSASLDYYYVGLVGISVDESRLPLSHKNFDID 369
Query: 242 -VSSSTGNIFVDTGVLRTLLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDVLCYNISS 300
V+ S G I +D+G T L ++ + + + V GF+ CYNI+S
Sbjct: 370 KVTGSGGTI-IDSGTAFTYLKKPAFQAMRREF--LARTSHLAKVDDNSGFTP--CYNITS 424
Query: 301 QPK------FPEVTIHFRGA-DVKLSPSNLFRNIS----DEIMCSAF--RGGNANIVYGR 347
P +T+HFRG DV L +++ +S +C AF G + G
Sbjct: 425 GTAALESTILPSITLHFRGGLDVVLPKNSILIPVSSSEEQTTLCLAFLMSGDIPFNIIGN 484
Query: 348 IMQINFLIGYDIEQAMVSFKPSRCTN 373
Q N + YD+E+ + P++C
Sbjct: 485 YQQQNLWVEYDLEKLRLGIAPAQCAT 510
>gi|22831049|dbj|BAC15912.1| putative nucleoid DNA-binding protein cnd41 [Oryza sativa Japonica
Group]
gi|50508281|dbj|BAD32130.1| putative nucleoid DNA-binding protein cnd41 [Oryza sativa Japonica
Group]
Length = 453
Score = 132 bits (331), Expect = 4e-28, Method: Compositional matrix adjust.
Identities = 115/375 (30%), Positives = 170/375 (45%), Gaps = 50/375 (13%)
Query: 35 YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSS 94
Y+M L+IGTPP DTGSD WTQC PC E CFKQ PL++P S T+ + CSS
Sbjct: 92 YIMTLAIGTPPQSYPAIADTGSDLVWTQCAPCGE-RCFKQPSPLYNPSSSPTFRVLPCSS 150
Query: 95 ------SQCAVVTSNCSEG-DCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMPN 147
++ + + G C Y+ YG G ++SG +ET TF S+ V +P
Sbjct: 151 ALNLCAAEARLAGATPPPGCACRYNQTYGTG----WTSGLQGSETFTFGSSPADQVRVPG 206
Query: 148 VIFGCGHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCL-PDQGS---SKI 203
+ FGC + S G GL +S + AG FSYCL P Q + S +
Sbjct: 207 IAFGCSNA------SSDDWNGSAGLVGLGRGGLSLVSQLAAGMFSYCLTPFQDTKSKSTL 260
Query: 204 NFG-----GIVAGAGVVSTPLI-------IRDHYYLSLEAISVGNQRL-------EFVSS 244
G + G GV STP + + +YYL+L ISVG L +
Sbjct: 261 LLGPAAAAAALNGTGVRSTPFVPSPSKPPMSTYYYLNLTGISVGPAALPIPPGAFALRAD 320
Query: 245 STGNIFVDTGVLRTLLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDVLCYNI--SSQP 302
TG + +D+G T L + +++ + +++K G A G LC+ + SS P
Sbjct: 321 GTGGLIIDSGTTITSLVDAAYKRVRAAVRSLVKLPVTDGSNAT-GLD--LCFALPSSSAP 377
Query: 303 --KFPEVTIHFRGADVKLSPSNLFRNISDEIMCSAFRGGNANIV--YGRIMQINFLIGYD 358
P +T+HF G + P + + + C A R + G Q N I YD
Sbjct: 378 PATLPSMTLHFGGGADMVLPVENYMILDGGMWCLAMRSQTDGELSTLGNYQQQNLHILYD 437
Query: 359 IEQAMVSFKPSRCTN 373
+++ +SF P++C+
Sbjct: 438 VQKETLSFAPAKCST 452
>gi|242092898|ref|XP_002436939.1| hypothetical protein SORBIDRAFT_10g011740 [Sorghum bicolor]
gi|241915162|gb|EER88306.1| hypothetical protein SORBIDRAFT_10g011740 [Sorghum bicolor]
Length = 469
Score = 132 bits (331), Expect = 4e-28, Method: Compositional matrix adjust.
Identities = 115/362 (31%), Positives = 167/362 (46%), Gaps = 48/362 (13%)
Query: 35 YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSS 94
Y++ L GTP V +DTGSD +W QC PC C+ Q+ PLFDP KSSTY I+C++
Sbjct: 131 YVVTLGFGTPSVPQVLLMDTGSDVSWVQCTPCNSTKCYPQKDPLFDPSKSSTYAPIACNT 190
Query: 95 SQCAVVTSN----CSEG--DCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMPNV 148
C + + C+ G C YS Y G S S G + ETLT G+ VE +
Sbjct: 191 DACRKLGDHYHNGCTSGGTQCGYSVEYADG---SHSRGVYSNETLTL--APGITVE--DF 243
Query: 149 IFGCGHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLPDQGSSKINFGGI 208
FGCG ++ P+ K G++GLG SL+ Q + G FSYCLP S G +
Sbjct: 244 HFGCG-RDQRGPS--DKYDGLLGLGGAPVSLVVQTSSVYGGAFSYCLPALNSEA---GFL 297
Query: 209 VAG-------AGVVSTPLI----IRDHYYLSLEAISVGNQRLEFVSSS-TGNIFVDTGVL 256
V G + V TP+ Y +++ ISVG + L S+ G + +D+G +
Sbjct: 298 VLGSPPSGNKSAFVFTPMRHLPGYATFYMVTMTGISVGGKPLHIPQSAFRGGMIIDSGTV 357
Query: 257 RTLLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDVLCYNIS--SQPKFPEVTIHFR-G 313
T LP ++ L++ + +KA P+ P CYN + S P V F G
Sbjct: 358 DTELPETAYNALEAALRKALKAYPLV-----PSDDFDTCYNFTGYSNITVPRVAFTFSGG 412
Query: 314 ADVKLS-PSNLFRNISDEIMCSAFRGGNANI---VYGRIMQINFLIGYDIEQAMVSFKPS 369
A + L P+ + N C AF+ + + G + Q + YD + V F+
Sbjct: 413 ATIDLDVPNGILVND-----CLAFQESGPDDGLGIIGNVNQRTLEVLYDAGRGNVGFRAG 467
Query: 370 RC 371
C
Sbjct: 468 AC 469
>gi|115472519|ref|NP_001059858.1| Os07g0533800 [Oryza sativa Japonica Group]
gi|113611394|dbj|BAF21772.1| Os07g0533800 [Oryza sativa Japonica Group]
Length = 458
Score = 131 bits (330), Expect = 4e-28, Method: Compositional matrix adjust.
Identities = 115/375 (30%), Positives = 170/375 (45%), Gaps = 50/375 (13%)
Query: 35 YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSS 94
Y+M L+IGTPP DTGSD WTQC PC E CFKQ PL++P S T+ + CSS
Sbjct: 97 YIMTLAIGTPPQSYPAIADTGSDLVWTQCAPCGE-RCFKQPSPLYNPSSSPTFRVLPCSS 155
Query: 95 ------SQCAVVTSNCSEG-DCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMPN 147
++ + + G C Y+ YG G ++SG +ET TF S+ V +P
Sbjct: 156 ALNLCAAEARLAGATPPPGCACRYNQTYGTG----WTSGLQGSETFTFGSSPADQVRVPG 211
Query: 148 VIFGCGHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCL-PDQGS---SKI 203
+ FGC + S G GL +S + AG FSYCL P Q + S +
Sbjct: 212 IAFGCSNA------SSDDWNGSAGLVGLGRGGLSLVSQLAAGMFSYCLTPFQDTKSKSTL 265
Query: 204 NFG-----GIVAGAGVVSTPLI-------IRDHYYLSLEAISVGNQRL-------EFVSS 244
G + G GV STP + + +YYL+L ISVG L +
Sbjct: 266 LLGPAAAAAALNGTGVRSTPFVPSPSKPPMSTYYYLNLTGISVGPAALPIPPGAFALRAD 325
Query: 245 STGNIFVDTGVLRTLLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDVLCYNI--SSQP 302
TG + +D+G T L + +++ + +++K G A G LC+ + SS P
Sbjct: 326 GTGGLIIDSGTTITSLVDAAYKRVRAAVRSLVKLPVTDGSNAT-GLD--LCFALPSSSAP 382
Query: 303 --KFPEVTIHFRGADVKLSPSNLFRNISDEIMCSAFRGGNANIV--YGRIMQINFLIGYD 358
P +T+HF G + P + + + C A R + G Q N I YD
Sbjct: 383 PATLPSMTLHFGGGADMVLPVENYMILDGGMWCLAMRSQTDGELSTLGNYQQQNLHILYD 442
Query: 359 IEQAMVSFKPSRCTN 373
+++ +SF P++C+
Sbjct: 443 VQKETLSFAPAKCST 457
>gi|18400416|ref|NP_565559.1| aspartyl protease family protein [Arabidopsis thaliana]
gi|20197296|gb|AAM15014.1| predicted protein [Arabidopsis thaliana]
gi|330252412|gb|AEC07506.1| aspartyl protease family protein [Arabidopsis thaliana]
Length = 458
Score = 131 bits (330), Expect = 4e-28, Method: Compositional matrix adjust.
Identities = 111/374 (29%), Positives = 174/374 (46%), Gaps = 52/374 (13%)
Query: 34 IYLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCS 93
++L++ S+G PPV +DTGS W QC+PC P+F+P SST+ SC
Sbjct: 95 LFLVNFSVGQPPVPQLTIMDTGSSLLWIQCQPCKHCSSDHMIHPVFNPALSSTFVECSCD 154
Query: 94 SSQCAVV-TSNC-SEGDCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMPNVIFG 151
C +C S C Y +Y G + S G LA E LTF + +G V + FG
Sbjct: 155 DRFCRYAPNGHCGSSNKCVYEQVYISG---TGSKGVLAKERLTFTTPNGNTVVTQPIAFG 211
Query: 152 CGHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLPDQGSSKINFGGIVAG 211
CG++N +S TGI+GLG +SL Q+G+ KFSYC+ D + + +V G
Sbjct: 212 CGYEN--GEQLESHFTGILGLGAKPTSLAVQLGS----KFSYCIGDLANKNYGYNQLVLG 265
Query: 212 --AGVVSTPLIIRDH-----YYLSLEAISVGNQRLEF---VSSSTG---NIFVDTGVLRT 258
A ++ P I YY++LE ISVG+ +L V G + +D+G L T
Sbjct: 266 EDADILGDPTPIEFETENSIYYMNLEGISVGDTQLNIEPVVFKRRGPRTGVILDSGTLYT 325
Query: 259 LLP----LEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDVLCYNISSQPK---FPEVTIHF 311
L E ++ +KS++ ++ F D LCY+ + FP VT HF
Sbjct: 326 WLADIAYRELYNEIKSILDPKLE---------RFWFRDFLCYHGRVSEELIGFPVVTFHF 376
Query: 312 R-GADVKLSPSNLFRNISD----EIMCSAFR-----GGNAN--IVYGRIMQINFLIGYDI 359
GA++ + +++F +S+ + C + + GG G + Q + IGYD+
Sbjct: 377 AGGAELAMEATSMFYPLSEPNTFNVFCMSVKPTKEHGGEYKEFTAIGLMAQQYYNIGYDL 436
Query: 360 EQAMVSFKPSRCTN 373
++ + + C
Sbjct: 437 KEKNIYLQRIDCVQ 450
>gi|224091849|ref|XP_002309371.1| predicted protein [Populus trichocarpa]
gi|222855347|gb|EEE92894.1| predicted protein [Populus trichocarpa]
Length = 438
Score = 131 bits (330), Expect = 4e-28, Method: Compositional matrix adjust.
Identities = 114/359 (31%), Positives = 168/359 (46%), Gaps = 47/359 (13%)
Query: 34 IYLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQ-EPPLFDPKKSSTYNSISC 92
++L++ S+G PPV +DTGS W QC PC C +Q P+FDP SSTY+S+SC
Sbjct: 101 LFLVNFSMGQPPVPQLAIMDTGSSLLWIQCAPCKS--CSQQIIGPMFDPSISSTYDSLSC 158
Query: 93 SSSQCAVVTS-NC-SEGDCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMPNVIF 150
+ C S C S C Y+ Y G S G +ATE L F S+ + NV+F
Sbjct: 159 KNIICRYAPSGECDSSSQCVYNQTYVEGLP---SVGVIATEQLIFGSSDEGRNAVNNVLF 215
Query: 151 GCGHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLPDQGSSKINFGGIVA 210
GC H+N D + TG+ GLG G +S+++QMG+ KFSYC+ + ++ +V
Sbjct: 216 GCSHRN--GNYKDRRFTGVFGLGSGITSVVNQMGS----KFSYCIGNIADPDYSYNQLVL 269
Query: 211 GAGV----VSTPLIIRD-HYYLSLEAISVGNQRLEFVSSS------TGNIFVDTGVLRTL 259
GV STPL + D HY + LE ISVG RL S+ + +D+G T
Sbjct: 270 SEGVNMEGYSTPLDVVDGHYQVILEGISVGETRLVIDPSAFKRTEKQRRVIIDSGTAPTW 329
Query: 260 LPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSD-VLCYNISSQPK---FPEVTIHF-RGA 314
L + L+ + N++ P + LCY FP VT HF GA
Sbjct: 330 LAENEYRALEREVRNLLDR------FLTPFMRESFLCYKGKVGQDLVGFPAVTFHFAEGA 383
Query: 315 DVKLSPSNLFRNISDEIMCSAFRGGNAN--IVYGRIMQINFLIGYDIEQAMVSFKPSRC 371
D+ + E+ ++ G + V G + Q + + YD+ + + F+ C
Sbjct: 384 DLV---------VDTEMRQASVYGKDFKDFSVIGLMAQQYYNVAYDLNKHKLFFQRIDC 433
>gi|297842769|ref|XP_002889266.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
gi|297335107|gb|EFH65525.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
Length = 489
Score = 131 bits (330), Expect = 5e-28, Method: Compositional matrix adjust.
Identities = 110/369 (29%), Positives = 179/369 (48%), Gaps = 53/369 (14%)
Query: 35 YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSS 94
Y++ + +G + + VDTGSD TW QC+PC C+ Q+ PL+DP SS+Y ++ C+S
Sbjct: 138 YIVTVELGGKNMSLI--VDTGSDLTWVQCQPCR--SCYNQQGPLYDPSVSSSYKTVFCNS 193
Query: 95 SQCA-VVTSNCSEG-----------DCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLP 142
S C +V + + G C Y YG G+Y + G+LA+E++ T
Sbjct: 194 STCQDLVAATGNSGPCGGFNGVVKTTCEYVVSYGDGSY---TRGDLASESIVLGDT---- 246
Query: 143 VEMPNVIFGCGHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLP---DQG 199
++ N++FGCG N S G++GLG + SL+SQ + G FSYCLP D
Sbjct: 247 -KLENLVFGCGRNNKGLFGGAS---GLMGLGRSSVSLVSQTLKTFNGVFSYCLPSLEDGA 302
Query: 200 SSKINFGGIVA----GAGVVSTPLI----IRDHYYLSLEAISVGNQRLEFVSSSTGNIFV 251
S ++FG + V TPL+ +R Y L+L S+G L+ +S G I +
Sbjct: 303 SGTLSFGNDFSVYKNSTSVFYTPLVQNPQLRSFYILNLTGASIGGVELKTLSFGRG-ILI 361
Query: 252 DTGVLRTLLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDV-LCYNISSQP--KFPEVT 308
D+G + T LP + K+V + +K G + PG+S + C+N++S P +
Sbjct: 362 DSGTVITRLPPSIY---KAVKTEFLKQ--FSGFPSAPGYSILDTCFNLTSYEDISIPTIK 416
Query: 309 IHFRG-ADVKLSPSNLFRNISDE--IMCSAFRG---GNANIVYGRIMQINFLIGYDIEQA 362
+ F G A++++ + +F + + ++C A N + G Q N + YD Q
Sbjct: 417 MIFEGNAELEVDVTGVFYFVKPDASLVCLALASLSYENEVGIIGNYQQKNQRVIYDTTQE 476
Query: 363 MVSFKPSRC 371
+ C
Sbjct: 477 RLGIAGENC 485
>gi|15238250|ref|NP_196637.1| aspartyl protease family protein [Arabidopsis thaliana]
gi|8979710|emb|CAB96831.1| nucleoid DNA-binding protein cnd41-like protein [Arabidopsis
thaliana]
gi|18176136|gb|AAL59990.1| putative nucleoid DNA-binding protein cnd41 [Arabidopsis thaliana]
gi|22136986|gb|AAM91722.1| putative nucleoid DNA-binding protein cnd41 [Arabidopsis thaliana]
gi|110740988|dbj|BAE98588.1| nucleoid DNA-binding protein cnd41 - like protein [Arabidopsis
thaliana]
gi|332004210|gb|AED91593.1| aspartyl protease family protein [Arabidopsis thaliana]
Length = 464
Score = 131 bits (330), Expect = 5e-28, Method: Compositional matrix adjust.
Identities = 111/354 (31%), Positives = 160/354 (45%), Gaps = 38/354 (10%)
Query: 35 YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSS 94
Y++ + IGTP D+ DTGSD TWTQCEPC C+ Q+ P F+P SSTY ++SCSS
Sbjct: 132 YIVTIGIGTPKHDLSLVFDTGSDLTWTQCEPCLG-SCYSQKEPKFNPSSSSTYQNVSCSS 190
Query: 95 SQCAVVTSNCSEGDCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMPNVIFGCGH 154
C S CS +C YS +YG SF+ G LA E T ++ L +V FGCG
Sbjct: 191 PMCEDAES-CSASNCVYSIVYGD---KSFTQGFLAKEKFTLTNSDVL----EDVYFGCGE 242
Query: 155 KN---LASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLP---DQGSSKINFGGI 208
N P +Q T+ FSYCLP + + FG
Sbjct: 243 NNQGLFDGVAGLLGLGPGKLSLP------AQTTTTYNNIFSYCLPSFTSNSTGHLTFGSA 296
Query: 209 VAGAGVVSTPLI---IRDHYYLSLEAISVGNQRLEFV--SSSTGNIFVDTGVLRTLLPLE 263
V TP+ +Y + + ISVG++ L S ST +D+G + T LP +
Sbjct: 297 GISESVKFTPISSFPSAFNYGIDIIGISVGDKELAITPNSFSTEGAIIDSGTVFTRLPTK 356
Query: 264 YHSNLKSVM-SNMIKAQPVKGVGAEPGFSDVLCYNIS--SQPKFPEVTIHFRGAD-VKLS 319
++ L+SV M + G G CY+ + +P + F G+ V+L
Sbjct: 357 VYAELRSVFKEKMSSYKSTSGYGLFD-----TCYDFTGLDTVTYPTIAFSFAGSTVVELD 411
Query: 320 PSNLFRNISDEIMCSAFRGGNANI--VYGRIMQINFLIGYDIEQAMVSFKPSRC 371
S + I +C AF GN ++ ++G + Q + YD+ V F P+ C
Sbjct: 412 GSGISLPIKISQVCLAF-AGNDDLPAIFGNVQQTTLDVVYDVAGGRVGFAPNGC 464
>gi|356531224|ref|XP_003534178.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
Length = 492
Score = 131 bits (329), Expect = 6e-28, Method: Compositional matrix adjust.
Identities = 106/358 (29%), Positives = 163/358 (45%), Gaps = 43/358 (12%)
Query: 35 YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSS 94
Y + +G P + +DTGSD W QC+PC DC++Q P+FDP SS+YN ++C +
Sbjct: 157 YFSRVGVGQPSKPFYMVLDTGSDVNWLQCKPCS--DCYQQSDPIFDPTASSSYNPLTCDA 214
Query: 95 SQCA-VVTSNCSEGDCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMPNVIFGCG 153
QC + S C G C Y YG G SF+ G TET++F + S + V GCG
Sbjct: 215 QQCQDLEMSACRNGKCLYQVSYGDG---SFTVGEYVTETVSFGAGS-----VNRVAIGCG 266
Query: 154 HKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLPDQGSSK---INFGGIVA 210
H N + G+ G +S I A FSYCL D+ S K + F
Sbjct: 267 HDNEGLFVGSAGLLGLGGGPLSLTSQIK------ATSFSYCLVDRDSGKSSTLEFNSPRP 320
Query: 211 GAGVVSTPLI----IRDHYYLSLEAISVGN-------QRLEFVSSSTGNIFVDTGVLRTL 259
G VV+ PL+ + YY+ L +SVG + S G + VD+G T
Sbjct: 321 GDSVVA-PLLKNQKVNTFYYVELTGVSVGGEIVTVPPETFAVDQSGAGGVIVDSGTAITR 379
Query: 260 LPLEYHSNLKSVMSNMI-KAQPVKGVGAEPGFSDVLCYNISS--QPKFPEVTIHFRGADV 316
L + +++++ +P +GV D CY++SS + P V+ HF G
Sbjct: 380 LRTQAYNSVRDAFKRKTSNLRPAEGV----ALFDT-CYDLSSLQSVRVPTVSFHFSGDRA 434
Query: 317 KLSPSNLFRNISD--EIMCSAFRGGNANI-VYGRIMQINFLIGYDIEQAMVSFKPSRC 371
P+ + D C AF +++ + G + Q + +D+ ++V F P++C
Sbjct: 435 WALPAKNYLIPVDGAGTYCFAFAPTTSSMSIIGNVQQQGTRVSFDLANSLVGFSPNKC 492
>gi|54290717|dbj|BAD62387.1| putative 41 kD chloroplast nucleoid DNA binding protein [Oryza
sativa Japonica Group]
gi|215734915|dbj|BAG95637.1| unnamed protein product [Oryza sativa Japonica Group]
Length = 469
Score = 131 bits (329), Expect = 6e-28, Method: Compositional matrix adjust.
Identities = 108/356 (30%), Positives = 165/356 (46%), Gaps = 35/356 (9%)
Query: 35 YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSS 94
Y+ L +GTP VDTGS TW QC PC + C +Q P+FDP+ S TY ++ CSS
Sbjct: 131 YVTRLGLGTPATSYVMVVDTGSSLTWLQCSPC-SVSCHRQAGPVFDPRASGTYAAVQCSS 189
Query: 95 SQCAVVT------SNCSEGD-CSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMPN 147
S+C + S CS + C Y YG +S+S G L+ +T++F S S P
Sbjct: 190 SECGELQAATLNPSACSVSNVCIYQASYGD---SSYSVGYLSKDTVSFGSGS-----FPG 241
Query: 148 VIFGCGHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLPDQGSSK--INF 205
+GCG N + G+IGL SL+ Q+ S+ FSYCLP ++ ++
Sbjct: 242 FYYGCGQDNEGL---FGRSAGLIGLAKNKLSLLYQLAPSLGYAFSYCLPTSSAAAGYLSI 298
Query: 206 GGIVAGAGVVSTPL----IIRDHYYLSLEAISVGNQRLEFVSSSTGNI--FVDTGVLRTL 259
G G TP+ + Y+++L ISV L S ++ +D+G + T
Sbjct: 299 GSYNPGQ-YSYTPMASSSLDASLYFVTLSGISVAGAPLAVPPSEYRSLPTIIDSGTVITR 357
Query: 260 LPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDV-LCYNISSQP-KFPEVTIHFRG-ADV 316
LP N+ + +S + A P +S + C+ S+ + P V + F G A +
Sbjct: 358 LP----PNVYTALSRAVAAAMASAAPRAPTYSILDTCFRGSAAGLRVPRVDMAFAGGATL 413
Query: 317 KLSPSNLFRNISDEIMCSAFRGGNANIVYGRIMQINFLIGYDIEQAMVSFKPSRCT 372
LSP N+ ++ D C AF + G Q F + YD+ Q+ + F C+
Sbjct: 414 ALSPGNVLIDVDDSTTCLAFAPTGGTAIIGNTQQQTFSVVYDVAQSRIGFAAGGCS 469
>gi|242063796|ref|XP_002453187.1| hypothetical protein SORBIDRAFT_04g001340 [Sorghum bicolor]
gi|241933018|gb|EES06163.1| hypothetical protein SORBIDRAFT_04g001340 [Sorghum bicolor]
Length = 493
Score = 131 bits (329), Expect = 6e-28, Method: Compositional matrix adjust.
Identities = 125/376 (33%), Positives = 181/376 (48%), Gaps = 50/376 (13%)
Query: 30 SVDDI-YLMHLSIGTPP-VDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTY 87
S+D + Y++ + +G+PP +DTGSD +W +C+PC + C Q PLFDP SSTY
Sbjct: 134 SLDTLEYVITVRLGSPPGKSQTMLIDTGSDISWVRCKPCWQ-QCRPQVDPLFDPSLSSTY 192
Query: 88 NSISCSSSQCAVV-----TSNC-SEGDCSYSFLYGRGAYASFSSGNLATETLTFNSTSGL 141
+ SCSS+ CA + + C S G C Y +YG G+ ++G +++TL S S
Sbjct: 193 SPFSCSSAACAQLFQEGNANGCSSSGQCQYIAMYGDGSVG--TTGTYSSDTLALGSNSNT 250
Query: 142 PVEMPNVIFGCGHKNLASPTSDSKQTGII-------GLGPGNSSLISQ-MGTSIAGKFSY 193
V + FGC H +TGI GLG G SL+SQ GT FSY
Sbjct: 251 -VVVSKFRFGCSH----------AETGITGLTAGLMGLGGGAQSLVSQTAGTFGTTAFSY 299
Query: 194 CLPDQGSSK--INFGGI-VAGAGVVSTPLI----IRDHYYLSLEAISVGNQRLEFVSSS- 245
CLP SS + G + AG V TP++ + Y + LEAI VG ++L ++
Sbjct: 300 CLPPTPSSSGFLTLGAAGTSSAGFVKTPMLRSSQVPAFYGVRLEAIRVGGRQLSIPTTVF 359
Query: 246 TGNIFVDTGVLRTLLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDVLCYNISSQP--K 303
+ + +D+G + T LP +S+L S +K P A GF D C+++S Q
Sbjct: 360 SAGMIMDSGTVVTRLPPTAYSSLSSAFKAGMKQYPPAPSSAGGGFLDT-CFDMSGQSSVS 418
Query: 304 FPEVTIHFRGAD---VKLSPSNLFRNI-SDEIMCSAFRG----GNANIVYGRIMQINFLI 355
P V + F GA V L S + + + I C AF G+ I+ G + Q F +
Sbjct: 419 MPTVALVFSGAGGAVVNLDASGILLQMETSSIFCLAFVATSDDGSTGII-GNVQQRTFQV 477
Query: 356 GYDIEQAMVSFKPSRC 371
YD+ V FK C
Sbjct: 478 LYDVAGGAVGFKAGAC 493
>gi|359485189|ref|XP_002279141.2| PREDICTED: aspartic proteinase nepenthesin-2-like [Vitis vinifera]
Length = 546
Score = 131 bits (329), Expect = 6e-28, Method: Compositional matrix adjust.
Identities = 113/382 (29%), Positives = 170/382 (44%), Gaps = 60/382 (15%)
Query: 35 YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSS 94
Y + + +GTPP +DTGSD W QC PC E CF+Q P +DP +SS+Y +I C
Sbjct: 181 YFIDVFVGTPPKHFSLILDTGSDLNWIQCVPCYE--CFEQNGPHYDPGQSSSYRNIGCHD 238
Query: 95 SQCAVVTS-------NCSEGDCSYSFLYGRGAYASFSSGNLATETLTFNST--SGLP--V 143
S+C +V+S C Y + YG +S ++G+ A ET T N T SG P
Sbjct: 239 SRCHLVSSPDPPQPCKAENQTCPYYYWYGD---SSNTTGDFALETFTVNLTMSSGKPELR 295
Query: 144 EMPNVIFGCGHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLPDQGSS-- 201
+ NV+FGCGH N + G+ S SQ+ + FSYCL D+ S
Sbjct: 296 RVENVMFGCGHWNRGLFHGAAGLLGLGRG---PLSFSSQLQSLYGHSFSYCLVDRNSDAN 352
Query: 202 -----------------KINFGGIVAGAGVVSTPLIIRDHYYLSLEAISVG-------NQ 237
++NF +VAG P + YY+ +++I VG +
Sbjct: 353 VSSKLIFGEDKDLLSHPELNFTTLVAGK---ENP--VDTFYYVQIKSIVVGGEVVNIPEE 407
Query: 238 RLEFVSSSTGNIFVDTGVLRTLLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDVLCYN 297
+ + + +G +D+G + + +K +K PV V P CYN
Sbjct: 408 KWQIATDGSGGTIIDSGTTLSYFAEPAYQVIKEAFMAKVKGYPV--VKDFPVLEP--CYN 463
Query: 298 IS--SQPKFPEVTIHFR-GADVKLSPSNLFRNIS-DEIMCSAFRGG--NANIVYGRIMQI 351
++ QP P+ I F GA N F I E++C A G +A + G Q
Sbjct: 464 VTGVEQPDLPDFGIVFSDGAVWNFPVENYFIEIEPREVVCLAILGTPPSALSIIGNYQQQ 523
Query: 352 NFLIGYDIEQAMVSFKPSRCTN 373
NF I YD +++ + F P++C +
Sbjct: 524 NFHILYDTKKSRLGFAPTKCAD 545
>gi|115434442|ref|NP_001041979.1| Os01g0140100 [Oryza sativa Japonica Group]
gi|12328547|dbj|BAB21205.1| putative aspartic proteinase nepenthesin I [Oryza sativa Japonica
Group]
gi|113531510|dbj|BAF03893.1| Os01g0140100 [Oryza sativa Japonica Group]
gi|125568961|gb|EAZ10476.1| hypothetical protein OsJ_00309 [Oryza sativa Japonica Group]
gi|215697206|dbj|BAG91200.1| unnamed protein product [Oryza sativa Japonica Group]
Length = 504
Score = 131 bits (329), Expect = 6e-28, Method: Compositional matrix adjust.
Identities = 114/364 (31%), Positives = 169/364 (46%), Gaps = 53/364 (14%)
Query: 35 YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSS 94
Y + +G+P ++ +DTGSD TW QC+PC DC++Q P+FDP S++Y S++C +
Sbjct: 167 YFSRVGVGSPARQLYMVLDTGSDVTWVQCQPC--ADCYQQSDPVFDPSLSTSYASVACDN 224
Query: 95 SQC----AVVTSNCSEGDCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMPNVIF 150
+C A N S G C Y YG G+Y + G+ ATETLT ++ PV +V
Sbjct: 225 PRCHDLDAAACRN-STGACLYEVAYGDGSY---TVGDFATETLTLGDSA--PVS--SVAI 276
Query: 151 GCGHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLPDQ---GSSKINFGG 207
GCGH N + + G S IS A FSYCL D+ SS + FG
Sbjct: 277 GCGHDNEGLFVGAAGLLALGGGPLSFPSQIS------ATTFSYCLVDRDSPSSSTLQFGD 330
Query: 208 IVAGAGVVSTPLI----IRDHYYLSLEAISVGNQRLE-----FVSSST--GNIFVDTGVL 256
A V+ PLI YY+ L +SVG Q L F ST G + VD+G
Sbjct: 331 --AADAEVTAPLIRSPRTSTFYYVGLSGLSVGGQILSIPPSAFAMDSTGAGGVIVDSGTA 388
Query: 257 RTLLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDV----LCYNISSQP--KFPEVTIH 310
T L ++ L+ V+G + P S V CY++S + + P V++
Sbjct: 389 VTRLQSSAYAALRDAF--------VRGTQSLPRTSGVSLFDTCYDLSDRTSVEVPAVSLR 440
Query: 311 FR-GADVKLSPSNLFRNISDE-IMCSAFRGGNANI-VYGRIMQINFLIGYDIEQAMVSFK 367
F G +++L N + C AF NA + + G + Q + +D ++ V F
Sbjct: 441 FAGGGELRLPAKNYLIPVDGAGTYCLAFAPTNAAVSIIGNVQQQGTRVSFDTAKSTVGFT 500
Query: 368 PSRC 371
++C
Sbjct: 501 TNKC 504
>gi|297741705|emb|CBI32837.3| unnamed protein product [Vitis vinifera]
Length = 455
Score = 131 bits (329), Expect = 6e-28, Method: Compositional matrix adjust.
Identities = 108/362 (29%), Positives = 178/362 (49%), Gaps = 34/362 (9%)
Query: 28 IISVDDIYLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTY 87
+I +L +LSIG PP +++ +DTGSD W QCEPC C+KQ+ P+++ KS +Y
Sbjct: 99 LIRDKSAFLANLSIGNPPTNVYVVLDTGSDLFWIQCEPCDV--CYKQKDPIYNRTKSDSY 156
Query: 88 NSISCSSSQCAVV--TSNCSE-GDCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVE 144
+ C+ C + CS+ G C Y Y G S +SG L+ E + F S +
Sbjct: 157 TEMLCNEPPCLSLGREGQCSDSGSCLYQTSYADG---SRTSGLLSYEKVAFTSHYSDEDK 213
Query: 145 MPNVIFGCGHKNLASPTSDSKQTGIIGLGPGNSSLISQMGT--SIAGKFSYCLPDQGSSK 202
V FGCG +NL TS S+ G++GLGPG SL+SQ+ ++ F+YC + S
Sbjct: 214 TAQVGFGCGLQNLNFVTS-SRDGGVLGLGPGLVSLVSQLSAIGKVSKSFAYCFGNL--SN 270
Query: 203 INFGGIVAGAGVV-----STPLIIRDHYYLSLEAISVGNQ--RLEFVSSS-------TGN 248
N GG + TP++I + YY++L I +G + RL+ SSS +G
Sbjct: 271 PNAGGFLVFGDATYLNGDMTPMVIAEFYYVNLLGIGLGVEEPRLDINSSSFERKPDGSGG 330
Query: 249 IFVDTGVLRTLLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDVLCYNIS---SQPKFP 305
+ +D+G ++ P E + +++ + + +K KG P S C+ P FP
Sbjct: 331 VIIDSGSTLSIFPPEVYEVVRNAVVDKLK----KGYNISPLTSSPDCFEGKIGRDLPLFP 386
Query: 306 EVTIHFRGADVKLSPSNLFRNISDEIMCSAFRGGNANIVYGRIMQINFLIGYDIEQAMVS 365
+ ++ + ++F DE+ C F G + G + Q ++ GY++E + +S
Sbjct: 387 TLVLYLESTGILNDRWSIFLQRYDELFCLGFTSGEGLSIIGTLAQQSYKFGYNLELSTLS 446
Query: 366 FK 367
+
Sbjct: 447 IE 448
>gi|115438214|ref|NP_001043485.1| Os01g0598600 [Oryza sativa Japonica Group]
gi|15290061|dbj|BAB63755.1| nucleoid DNA-binding protein cnd41-like [Oryza sativa Japonica
Group]
gi|113533016|dbj|BAF05399.1| Os01g0598600 [Oryza sativa Japonica Group]
gi|125526702|gb|EAY74816.1| hypothetical protein OsI_02707 [Oryza sativa Indica Group]
Length = 500
Score = 131 bits (329), Expect = 6e-28, Method: Compositional matrix adjust.
Identities = 111/371 (29%), Positives = 161/371 (43%), Gaps = 51/371 (13%)
Query: 35 YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSS 94
Y + +GTP +DTGSD W QC PC C+ Q +FDP+ S +Y ++ C++
Sbjct: 147 YFTKIGVGTPVTPALMVLDTGSDVVWLQCAPCRR--CYDQSGQMFDPRASHSYGAVDCAA 204
Query: 95 SQCAVVTS---NCSEGDCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMPNVIFG 151
C + S + C Y YG G S ++G+ ATETLTF S + +P V G
Sbjct: 205 PLCRRLDSGGCDLRRKACLYQVAYGDG---SVTAGDFATETLTFASGA----RVPRVALG 257
Query: 152 CGHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLPD---------QGSSK 202
CGH N + + G+ G+ S SQ+ FSYCL D SS
Sbjct: 258 CGHDNEGLFVAAAGLLGLG---RGSLSFPSQISRRFGRSFSYCLVDRTSSSASATSRSST 314
Query: 203 INFGGIVAG--AGVVSTPLI----IRDHYYLSLEAISVGNQRLEFV---------SSSTG 247
+ FG G A TP++ + YY+ L ISVG R+ V S+ G
Sbjct: 315 VTFGSGAVGPSAAASFTPMVKNPRMETFYYVQLMGISVGGARVPGVAVSDLRLDPSTGRG 374
Query: 248 NIFVDTGVLRTLLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDVL--CYNISSQP--K 303
+ VD+G T L ++ L+ G+ PG + CY++S K
Sbjct: 375 GVIVDSGTSVTRLARPAYAALRDAFRAA-----AAGLRLSPGGFSLFDTCYDLSGLKVVK 429
Query: 304 FPEVTIHFR-GADVKLSPSNLFRNI-SDEIMCSAFRGGNANI-VYGRIMQINFLIGYDIE 360
P V++HF GA+ L P N + S C AF G + + + G I Q F + +D +
Sbjct: 430 VPTVSMHFAGGAEAALPPENYLIPVDSRGTFCFAFAGTDGGVSIIGNIQQQGFRVVFDGD 489
Query: 361 QAMVSFKPSRC 371
+ F P C
Sbjct: 490 GQRLGFVPKGC 500
>gi|302781668|ref|XP_002972608.1| hypothetical protein SELMODRAFT_97538 [Selaginella moellendorffii]
gi|300160075|gb|EFJ26694.1| hypothetical protein SELMODRAFT_97538 [Selaginella moellendorffii]
Length = 430
Score = 130 bits (328), Expect = 7e-28, Method: Compositional matrix adjust.
Identities = 119/387 (30%), Positives = 177/387 (45%), Gaps = 57/387 (14%)
Query: 35 YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEP-------CPELDCFKQEPPLFDPKKSSTY 87
YL+ ++ GTPP ++ DTGSD W QC CP+ C ++ P F KS+T
Sbjct: 54 YLVSMAFGTPPQEVLLIADTGSDLIWLQCSTTAAPPAFCPKKACSRR--PAFVASKSATL 111
Query: 88 NSISCSSSQCAVVTS------NCSEGD---CSYSFLYGRGAYASFSSGNLATETLTF-NS 137
+ + CS++QC +V + +CS C Y++ Y G S ++G LA +T T N
Sbjct: 112 SVVPCSAAQCLLVPAPRGHGPSCSPAAPVPCGYAYDYADG---SSTTGFLARDTATISNG 168
Query: 138 TSGLPVEMPNVIFGCGHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLPD 197
TSG + V FGCG +N S S G+IGLG G S +Q G+ A FSYCL D
Sbjct: 169 TSG-GAAVRGVAFGCGTRNQGG--SFSGTGGVIGLGQGQLSFPAQSGSLFAQTFSYCLLD 225
Query: 198 -------QGSSKINFGGIVAGAGVVSTPLI----IRDHYYLSLEAISVGNQRLEFVSSS- 245
+ SS + G A TPL+ YY+ + AI VGN+ L S
Sbjct: 226 LEGGRRGRSSSFLFLGRPERRAAFAYTPLVSNPLAPTFYYVGVVAIRVGNRVLPVPGSEW 285
Query: 246 ------TGNIFVDTGVLRTLLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDV-LCYNI 298
G +D+G T L L + +L S + + + + F + LCYN+
Sbjct: 286 AIDVLGNGGTVIDSGSTLTYLRLGAYLHLVSAFAASVHLPRIP--SSATFFQGLELCYNV 343
Query: 299 SSQPK-------FPEVTIHF-RGADVKLSPSNLFRNISDEIMCSAFR---GGNANIVYGR 347
SS FP +TI F +G ++L N +++D++ C A R A V G
Sbjct: 344 SSSSSLAPANGGFPRLTIDFAQGLSLELPTGNYLVDVADDVKCLAIRPTLSPFAFNVLGN 403
Query: 348 IMQINFLIGYDIEQAMVSFKPSRCTNY 374
+MQ + + +D A + F + C +
Sbjct: 404 LMQQGYHVEFDRASARIGFARTECVAH 430
>gi|125524353|gb|EAY72467.1| hypothetical protein OsI_00323 [Oryza sativa Indica Group]
Length = 500
Score = 130 bits (328), Expect = 7e-28, Method: Compositional matrix adjust.
Identities = 113/364 (31%), Positives = 168/364 (46%), Gaps = 53/364 (14%)
Query: 35 YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSS 94
Y + +G+P ++ +DTGSD TW QC+PC DC++Q P+FDP S++Y S++C +
Sbjct: 163 YFSRVGVGSPARQLYMVLDTGSDVTWVQCQPC--ADCYQQSDPVFDPSLSTSYASVACDN 220
Query: 95 SQC----AVVTSNCSEGDCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMPNVIF 150
+C A N S G C Y YG G+Y + G+ ATETLT ++ PV +V
Sbjct: 221 PRCHDLDAAACRN-STGACLYEVAYGDGSY---TVGDFATETLTLGDSA--PVS--SVAI 272
Query: 151 GCGHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLPDQ---GSSKINFGG 207
GCGH N + + G S IS A FSYCL D+ SS + FG
Sbjct: 273 GCGHDNEGLFVGAAGLLALGGGPLSFPSQIS------ATTFSYCLVDRDSPSSSTLQFGD 326
Query: 208 IVAGAGVVSTPLI----IRDHYYLSLEAISVGNQRLEFVSSS-------TGNIFVDTGVL 256
A V+ PLI YY+ L ISVG Q L S+ G + VD+G
Sbjct: 327 --AADAEVTAPLIRSPRTSTFYYVGLSGISVGGQILSIPPSAFAMDGTGAGGVIVDSGTA 384
Query: 257 RTLLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDV----LCYNISSQP--KFPEVTIH 310
T L ++ L+ V+G + P S V CY++S + + P V++
Sbjct: 385 VTRLQSSAYAALRDAF--------VRGTQSLPRTSGVSLFDTCYDLSDRTSVEVPAVSLR 436
Query: 311 FR-GADVKLSPSNLFRNISDE-IMCSAFRGGNANI-VYGRIMQINFLIGYDIEQAMVSFK 367
F G +++L N + C AF NA + + G + Q + +D ++ V F
Sbjct: 437 FAGGGELRLPAKNYLIPVDGAGTYCLAFAPTNAAVSIIGNVQQQGTRVSFDTAKSTVGFT 496
Query: 368 PSRC 371
++C
Sbjct: 497 SNKC 500
>gi|225449446|ref|XP_002283126.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Vitis vinifera]
Length = 436
Score = 130 bits (328), Expect = 7e-28, Method: Compositional matrix adjust.
Identities = 114/381 (29%), Positives = 175/381 (45%), Gaps = 70/381 (18%)
Query: 39 LSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSSSQCA 98
L++G+PP + +DTGS+ +W C+ P L +FDP +SS+Y+ I C+S C
Sbjct: 67 LTVGSPPQTVTMVLDTGSELSWLHCKKAPNLHS------VFDPLRSSSYSPIPCTSPTCR 120
Query: 99 VVTSN------CSEGDCSYSFLYGRGAYASFSS--GNLATETLTFNSTSGLPVEMPNVIF 150
T + C + ++ + +YA SS GNLA++T +++ +P IF
Sbjct: 121 TRTRDFSIPVSCDKKKLCHAII----SYADASSIEGNLASDTFHIGNSA-----IPATIF 171
Query: 151 GCGHKNLASPT-SDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLPDQGSSKINFGGIV 209
GC +S + DSK TG+IG+ G+ S ++QMG KFSYC+ Q SS I G
Sbjct: 172 GCMDSGFSSNSDEDSKTTGLIGMNRGSLSFVTQMGLQ---KFSYCISGQDSSGILLFGES 228
Query: 210 AGAGV----------VSTPLIIRDH--YYLSLEAISVGNQRLEFVSS-------STGNIF 250
+ + + +STPL D Y + LE I V N L+ S G
Sbjct: 229 SFSWLKALKYTPLVQISTPLPYFDRVAYTVQLEGIKVANSMLQLPKSVYAPDHTGAGQTM 288
Query: 251 VDTGVLRTLLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGF----SDVLCYNI----SSQP 302
VD+G T L ++ LK+ KA V +P F + LCY + + P
Sbjct: 289 VDSGTQFTFLLGPVYTALKNEFVRQTKAS--LKVLEDPNFVFQGAMDLCYRVPLTRRTLP 346
Query: 303 KFPEVTIHFRGADVKLSPSNLFRNI------SDEIMCSAFRGGNANI------VYGRIMQ 350
P VT+ FRGA++ +S L + SD + C F GN+ + + G Q
Sbjct: 347 PLPTVTLMFRGAEMSVSAERLMYRVPGVIRGSDSVYCFTF--GNSELLGVESYIIGHHHQ 404
Query: 351 INFLIGYDIEQAMVSFKPSRC 371
N + +D+ ++ V F RC
Sbjct: 405 QNVWMEFDLAKSRVGFAEVRC 425
>gi|302756591|ref|XP_002961719.1| hypothetical protein SELMODRAFT_64161 [Selaginella moellendorffii]
gi|300170378|gb|EFJ36979.1| hypothetical protein SELMODRAFT_64161 [Selaginella moellendorffii]
Length = 357
Score = 130 bits (328), Expect = 8e-28, Method: Compositional matrix adjust.
Identities = 111/361 (30%), Positives = 158/361 (43%), Gaps = 39/361 (10%)
Query: 35 YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSS 94
Y + IG P + +DTGSD TW QC PC C+ Q P++DP SS+Y + C S
Sbjct: 12 YFARMGIGNPQRSYYLELDTGSDVTWIQCAPCSS--CYSQVDPIYDPSNSSSYRRVYCGS 69
Query: 95 SQC-AVVTSNCSEGDCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMPNVIFGCG 153
+ C A+ S C CSY +YG +S SSG+L E+ S M N+ FGCG
Sbjct: 70 ALCQALDYSACQGMGCSYRVVYGD---SSASSGDLGIESFYLGPNSS--TAMRNIAFGCG 124
Query: 154 HKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLPDQ------GSSKINFGG 207
H N ++ G+ G S SQ+ SI FSYCL D+ SS + FG
Sbjct: 125 HSNSGLFRGEAGLLGMGGG---TLSFFSQIAASIGPAFSYCLVDRYSQLQSRSSPLIFGR 181
Query: 208 IVAGAGVVSTPLI----IRDHYYLSLEAISVGNQRL-------EFVSSSTGNIFVDTGVL 256
TPL+ I YY L ISVG L + TG +D+G
Sbjct: 182 TAIPFAARFTPLLKNPRINTFYYAVLTGISVGGTPLPIPPAQFALTGNGTGGAILDSGTS 241
Query: 257 RT-LLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDVLCYNISSQP--KFPEVTIHF-R 312
T ++P Y + + P GV + C+N P + P + +HF
Sbjct: 242 VTRVVPPAYAVLRDAYRAASRNLPPAPGV-----YLLDTCFNFQGLPTVQIPSLVLHFDN 296
Query: 313 GADVKLSPSNLFRNISDE-IMCSAFRGGNANI-VYGRIMQINFLIGYDIEQAMVSFKPSR 370
G D+ L N+ + C AF + I V G + Q F IG+D+++++++ P
Sbjct: 297 GVDMVLPGGNILIPVDRSGTFCLAFAPSSMPISVIGNVQQQTFRIGFDLQRSLIAIAPRE 356
Query: 371 C 371
C
Sbjct: 357 C 357
>gi|147821993|emb|CAN70318.1| hypothetical protein VITISV_016757 [Vitis vinifera]
Length = 429
Score = 130 bits (328), Expect = 8e-28, Method: Compositional matrix adjust.
Identities = 114/381 (29%), Positives = 175/381 (45%), Gaps = 70/381 (18%)
Query: 39 LSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSSSQCA 98
L++G+PP + +DTGS+ +W C+ P L +FDP +SS+Y+ I C+S C
Sbjct: 60 LTVGSPPQTVTMVLDTGSELSWLHCKKAPNLHS------VFDPLRSSSYSPIPCTSPTCR 113
Query: 99 VVTSN------CSEGDCSYSFLYGRGAYASFSS--GNLATETLTFNSTSGLPVEMPNVIF 150
T + C + ++ + +YA SS GNLA++T +++ +P IF
Sbjct: 114 TRTRDFSIPVSCDKKKLCHAII----SYADASSIEGNLASDTFHIGNSA-----IPATIF 164
Query: 151 GCGHKNLASPT-SDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLPDQGSSKINFGGIV 209
GC +S + DSK TG+IG+ G+ S ++QMG KFSYC+ Q SS I G
Sbjct: 165 GCMDSGFSSNSDEDSKTTGLIGMNRGSLSFVTQMGLQ---KFSYCISGQDSSGILLFGES 221
Query: 210 AGAGV----------VSTPLIIRDH--YYLSLEAISVGNQRLEFVSS-------STGNIF 250
+ + + +STPL D Y + LE I V N L+ S G
Sbjct: 222 SFSWLKALKYTPLVQISTPLPYFDRVAYTVQLEGIKVANSMLQLPKSVYAPDHTGAGQTM 281
Query: 251 VDTGVLRTLLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGF----SDVLCYNI----SSQP 302
VD+G T L ++ LK+ KA V +P F + LCY + + P
Sbjct: 282 VDSGTQFTFLLGPVYTALKNEFVRQTKAS--LKVLEDPNFVFQGAMDLCYRVPLTRRTLP 339
Query: 303 KFPEVTIHFRGADVKLSPSNLFRNI------SDEIMCSAFRGGNANI------VYGRIMQ 350
P VT+ FRGA++ +S L + SD + C F GN+ + + G Q
Sbjct: 340 PLPTVTLMFRGAEMSVSAERLMYRVPGVIRGSDSVYCFTF--GNSELLGVESYIIGHHHQ 397
Query: 351 INFLIGYDIEQAMVSFKPSRC 371
N + +D+ ++ V F RC
Sbjct: 398 QNVWMEFDLAKSRVGFAEVRC 418
>gi|449451908|ref|XP_004143702.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Cucumis sativus]
gi|449529900|ref|XP_004171936.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Cucumis sativus]
Length = 459
Score = 130 bits (328), Expect = 9e-28, Method: Compositional matrix adjust.
Identities = 122/408 (29%), Positives = 177/408 (43%), Gaps = 58/408 (14%)
Query: 7 LPFYNDNETPKSPISIIYQAEIISVDDIYLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPC 66
L F N T KSP+ I A S Y + + +GTPP + DTGSD W +C C
Sbjct: 64 LLFSRPNPTLKSPL--ISGASTGSGQ--YFVDIRLGTPPQSLLLVADTGSDLVWVKCSAC 119
Query: 67 PELDCFKQEPP--LFDPKKSSTYNSISCSSSQCAVVTSNCSEGDCSYS-------FLYGR 117
+C PP F P+ SS+++ C C ++ + C+++ FLY
Sbjct: 120 --RNC-SHHPPSSAFLPRHSSSFSPFHCFDPHCRLL-PHAPHHLCNHTRLHSPCRFLYSY 175
Query: 118 GAYASFSSGNLATETLTFNSTSGLPVEMPNVIFGCGHKNLASPTSDSKQ----TGIIGLG 173
A S SSG + ET T S SG + + + FGCG + ++ P+ Q G++GLG
Sbjct: 176 -ADGSLSSGFFSKETTTLKSLSGSEIHLKGLSFGCGFR-ISGPSVSGAQFNGARGVMGLG 233
Query: 174 PGNSSLISQMGTSIAGKFSYCLPDQGSSKINFGGIVAGAGVVSTPL-------------- 219
G+ S SQ+G KFSYCL D S ++ G G+ S PL
Sbjct: 234 RGSISFSSQLGRRFGNKFSYCLMDYTLSPPPTSFLMIGGGLHSLPLTNATKISYTPLQIN 293
Query: 220 -IIRDHYYLSLEAISVGNQRL-------EFVSSSTGNIFVDTGVLRTLL-PLEYHSNLKS 270
+ YY+++ +I++ +L E G VD+G T L Y LKS
Sbjct: 294 PLSPTFYYITIHSITIDGVKLPINPAVWEIDEQGNGGTVVDSGTTLTYLTKTAYEEVLKS 353
Query: 271 VMSNMIKAQPVKGVGAEPGFSDVLCYNISSQPKFPEV-TIHFR---GADVKLSPSNLFRN 326
V + + PGF LC N S + + P + + FR GA P N F
Sbjct: 354 VRR---RVKLPNAAELTPGFD--LCVNASGESRRPSLPRLRFRLGGGAVFAPPPRNYFLE 408
Query: 327 ISDEIMCSAFRG---GNANIVYGRIMQINFLIGYDIEQAMVSFKPSRC 371
+ +MC A R GN V G +MQ FL+ +D E++ + F C
Sbjct: 409 TEEGVMCLAIRAVESGNGFSVIGNLMQQGFLLEFDKEESRLGFTRRGC 456
>gi|242066140|ref|XP_002454359.1| hypothetical protein SORBIDRAFT_04g029390 [Sorghum bicolor]
gi|241934190|gb|EES07335.1| hypothetical protein SORBIDRAFT_04g029390 [Sorghum bicolor]
Length = 460
Score = 130 bits (327), Expect = 9e-28, Method: Compositional matrix adjust.
Identities = 116/353 (32%), Positives = 167/353 (47%), Gaps = 39/353 (11%)
Query: 35 YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSS 94
YL+ + +G+P +D+GSD +W QC+PC L C Q PLFDP SSTY+ SCSS
Sbjct: 131 YLITVRLGSPAKTQTVLIDSGSDVSWVQCKPC--LQCHSQVDPLFDPSLSSTYSPFSCSS 188
Query: 95 SQCAVVTSN----CSEGDCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMPNVIF 150
+ CA + + S C Y Y G S ++G +++TL S + + N F
Sbjct: 189 AACAQLGQDGNGCSSSSQCQYIVRYADG---SSTTGTYSSDTLALGSNT-----ISNFQF 240
Query: 151 GCGHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLPDQGSSKINFGGIVA 210
GC H + S +D G++GLG G SL SQ + FSYCLP SS F + A
Sbjct: 241 GCSH--VESGFNDLTD-GLMGLGGGAPSLASQTAGTFGTAFSYCLPPTPSSS-GFLTLGA 296
Query: 211 G-AGVVSTPLI----IRDHYYLSLEAISVGNQRLEFVSSS-TGNIFVDTGVLRTLLPLEY 264
G +G V TP++ + Y + LEAI VG +L +S + + +D+G + T LP
Sbjct: 297 GTSGFVKTPMLRSSPVPTFYGVRLEAIRVGGTQLSIPTSVFSAGMVMDSGTIITRLPRTA 356
Query: 265 HSNLKSVM-SNMIKAQPVKGVGAEPGFSDVLCYNISSQP--KFPEVTIHFRGADVKLSPS 321
+S L S + M + +P A P C++ S Q + P V + F G V
Sbjct: 357 YSALSSAFKAGMKQYRP-----APPRSIMDTCFDFSGQSSVRLPSVALVFSGGAVV---- 407
Query: 322 NLFRNISDEIMCSAFRGGNANI---VYGRIMQINFLIGYDIEQAMVSFKPSRC 371
NL N C AF + + + G + Q F + YD+ V FK C
Sbjct: 408 NLDANGIILGNCLAFAANSDDSSPGIVGNVQQRTFEVLYDVGGGAVGFKAGAC 460
>gi|356536463|ref|XP_003536757.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
Length = 475
Score = 130 bits (327), Expect = 1e-27, Method: Compositional matrix adjust.
Identities = 104/358 (29%), Positives = 159/358 (44%), Gaps = 39/358 (10%)
Query: 35 YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSS 94
Y + + +G+PP + + +D+GSD W QCEPC + C+ Q P+F+P SS+++ +SC+S
Sbjct: 136 YFVRIGVGSPPRNQYVVMDSGSDIIWVQCEPCTQ--CYHQSDPVFNPADSSSFSGVSCAS 193
Query: 95 SQCAVV-TSNCSEGDCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMPNVIFGCG 153
+ C+ V + C EG C Y YG G+Y + G LA ET+TF T + NV GCG
Sbjct: 194 TVCSHVDNAACHEGRCRYEVSYGDGSY---TKGTLALETITFGRT-----LIRNVAIGCG 245
Query: 154 HKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLPDQG---SSKINFGGIVA 210
H N + G+ G S + Q+G G FSYCL +G S + FG
Sbjct: 246 HHNQGMFVGAAGLLGLGGG---PMSFVGQLGGQTGGAFSYCLVSRGIESSGLLEFGREAM 302
Query: 211 GAGVVSTPLI----IRDHYYLSLEA-------ISVGNQRLEFVSSSTGNIFVDTGVLRTL 259
G PLI + YY+ L +S+ + G + +DTG T
Sbjct: 303 PVGAAWVPLIHNPRAQSFYYIGLSGLGVGGLRVSISEDVFKLSELGDGGVVMDTGTAVTR 362
Query: 260 LP-LEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDVLCYNISS--QPKFPEVTIHFRGADV 316
LP + Y + ++ GV CY++ + P V+ +F G +
Sbjct: 363 LPTVAYEAFRDGFIAQTTNLPRASGVSIFD-----TCYDLFGFVSVRVPTVSFYFSGGPI 417
Query: 317 KLSPSNLFRNISDEI--MCSAFRGGNANI-VYGRIMQINFLIGYDIEQAMVSFKPSRC 371
P+ F D++ C AF ++ + + G I Q I D V F P+ C
Sbjct: 418 LTLPARNFLIPVDDVGTFCFAFAPSSSGLSIIGNIQQEGIQISVDGANGFVGFGPNVC 475
>gi|297742733|emb|CBI35367.3| unnamed protein product [Vitis vinifera]
Length = 521
Score = 130 bits (327), Expect = 1e-27, Method: Compositional matrix adjust.
Identities = 103/353 (29%), Positives = 154/353 (43%), Gaps = 48/353 (13%)
Query: 35 YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSS 94
Y + + +G+PP + +D+GSD W QC+PC + C+ Q P+FDP S+++ +SCSS
Sbjct: 201 YFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCTQ--CYHQSDPVFDPADSASFTGVSCSS 258
Query: 95 SQC-AVVTSNCSEGDCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMPNVIFGCG 153
S C + + C G C Y YG G+Y + G LA ETLTF T + +V GCG
Sbjct: 259 SVCDRLENAGCHAGRCRYEVSYGDGSY---TKGTLALETLTFGRTM-----VRSVAIGCG 310
Query: 154 HKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLPDQGSSKINFGGIVAGAG 213
H+N + G+ G + S + Q+G G FSYCL V+ A
Sbjct: 311 HRNRGMFVGAAGLLGLGGG---SMSFVGQLGGQTGGAFSYCL-------------VSAAW 354
Query: 214 VVSTPLIIRDH----YYLSLEAISVGNQRL-------EFVSSSTGNIFVDTGVLRTLLP- 261
V PL+ YY+ L + VG R+ G + +DTG T LP
Sbjct: 355 V---PLVRNPRAPSFYYIGLAGLGVGGIRVPISEEVFRLTELGDGGVVMDTGTAVTRLPT 411
Query: 262 LEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDVLCYNISSQPKFPEVTIHFRGADVKLSPS 321
L Y + + ++ GV D+L + + P V+ +F G + P+
Sbjct: 412 LAYQAFRDAFLAQTANLPRATGVAIFDTCYDLLGF---VSVRVPTVSFYFSGGPILTLPA 468
Query: 322 NLFRNISDE--IMCSAFRGGNANI-VYGRIMQINFLIGYDIEQAMVSFKPSRC 371
F D+ C AF + + + G I Q I +D V F P+ C
Sbjct: 469 RNFLIPMDDAGTFCFAFAPSTSGLSILGNIQQEGIQISFDGANGYVGFGPNIC 521
>gi|357118064|ref|XP_003560779.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Brachypodium
distachyon]
Length = 472
Score = 130 bits (326), Expect = 1e-27, Method: Compositional matrix adjust.
Identities = 112/366 (30%), Positives = 173/366 (47%), Gaps = 47/366 (12%)
Query: 35 YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSS 94
Y++ L IGTP V +DTGSD +W QC+PC DC+ Q+ PLFDP KSST+ +I C+S
Sbjct: 125 YVVTLGIGTPAVQQTVLIDTGSDLSWVQCKPCNASDCYPQKDPLFDPSKSSTFATIPCAS 184
Query: 95 SQCAVV--------TSNCSEG---DCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPV 143
C + +N + G C Y+ YG GA + G +TETL S++
Sbjct: 185 DACKQLPVDGYDNGCTNNTSGMPPQCGYAIEYGNGA---ITEGVYSTETLALGSSA---- 237
Query: 144 EMPNVIFGCGHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLP--DQGSS 201
+ + FGCG K G++GLG SL+SQ + G FSYCLP + G+
Sbjct: 238 VVKSFRFGCGSDQHG---PYDKFDGLLGLGGAPESLVSQTASVYGGAFSYCLPPLNSGAG 294
Query: 202 KINFGGI----VAGAGVVSTPL-----IIRDHYYLSLEAISVGNQRLEFVSS--STGNIF 250
+ G + +G V TP+ I Y ++L ISVG + L+ + + GNI
Sbjct: 295 FLTLGAPNSTNNSNSGFVFTPMHAFSPKIATFYVVTLTGISVGGKALDIPPAVFAKGNI- 353
Query: 251 VDTGVLRTLLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDV-LCYNISSQP--KFPEV 307
VD+G + T +P + L++ + + P+ P S + CYN + P+V
Sbjct: 354 VDSGTVITGIPTTAYKALRTAFRSAMAEYPL----LPPADSALDTCYNFTGHGTVTVPKV 409
Query: 308 TIHF-RGADVKLS-PSNLFRNISDEIMCSAFRGGNANIVYGRIMQINFLIGYDIEQAMVS 365
+ F GA V L PS + + ++ + A G + + G + + YD + +
Sbjct: 410 ALTFVGGATVDLDVPSGV---LVEDCLAFADAGDGSFGIIGNVNTRTIEVLYDSGKGHLG 466
Query: 366 FKPSRC 371
F+ C
Sbjct: 467 FRAGAC 472
>gi|224114179|ref|XP_002332420.1| predicted protein [Populus trichocarpa]
gi|222832373|gb|EEE70850.1| predicted protein [Populus trichocarpa]
Length = 449
Score = 130 bits (326), Expect = 1e-27, Method: Compositional matrix adjust.
Identities = 104/348 (29%), Positives = 167/348 (47%), Gaps = 35/348 (10%)
Query: 52 VDTGSDCTWTQCEPCPELD--CFKQEPPLFDPKKSSTYNSISCSS-SQCAVVTSNCSEGD 108
+DTG++ +W QCE C CF + P + +S +Y +SC+ S C + C EG
Sbjct: 105 IDTGNELSWIQCEGCQNKGNMCFPHKDPPYTSSQSKSYKPVSCNQHSFCE--PNQCKEGL 162
Query: 109 CSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMPNVIFGCG--HKNL--ASPTSDS 164
C+Y+ YG G+Y +SGNLA ET TF S G + ++ FGC +N+ A +
Sbjct: 163 CAYNVTYGPGSY---TSGNLANETFTFYSNHGKHTALKSISFGCSTDSRNMIYAFLLDKN 219
Query: 165 KQTGIIGLGPGNSSLISQMGTSIAGKFSYCLP--DQGSSKINFGGIVAGAGVVSTPLIIR 222
+G++G+G G S ++Q+G+ GKFSYC+ + ++ + FG V + + T I++
Sbjct: 220 PVSGVLGMGWGPRSFLAQLGSISHGKFSYCITANNTHNTYLRFGKHVVKSKNLQTTKIMQ 279
Query: 223 ----DHYYLSLEAISVGNQRLEFVSS--------STGNIFVDTGVLRTLLPLEYHSNLKS 270
Y+++L ISV +L + S G I +D G L TLL L +
Sbjct: 280 VKPSAAYHVNLLGISVNGVKLNITKTDLAVRKDGSRGCI-IDAGTLATLLVKPIFDTLHT 338
Query: 271 VMSNMIKA-QPVKGVGAEPGFSDVLCYNISS---QPKFPEVTIHFRGADVKLSPSN--LF 324
+SN + + Q +K D LCY S + P VT H AD+++ P LF
Sbjct: 339 ALSNHLSSNQNLKRWVIHKLHKD-LCYEQLSDAGRKNLPVVTFHLENADLEVKPEAIFLF 397
Query: 325 RNISDE-IMCSAFRGGNANIVYGRIMQINFLIGYDIEQAMVSFKPSRC 371
R + + C + ++ + G Q+ YD + ++SF P C
Sbjct: 398 REFEGKNVFCLSMLSDDSKTIIGAYQQMKQKFVYDTKARVLSFGPEDC 445
>gi|356558300|ref|XP_003547445.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
Length = 447
Score = 129 bits (325), Expect = 2e-27, Method: Compositional matrix adjust.
Identities = 103/356 (28%), Positives = 163/356 (45%), Gaps = 35/356 (9%)
Query: 36 LMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSSS 95
+ ++SIG PP+ +DTGSD W C PC +C LFDP KSST++ + +
Sbjct: 102 MANISIGQPPIPQLVVMDTGSDILWVMCTPC--TNCDNDLGLLFDPSKSSTFSPL--CKT 157
Query: 96 QCAVVTSNCSEGDCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMPNVIFGCGHK 155
C C + ++ A S +SG +T+ F +T + +V+FGCGH
Sbjct: 158 PCDFEGCRCDPIPFTVTY-----ADNSTASGTFGRDTVVFETTDEGTSRISDVLFGCGH- 211
Query: 156 NLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLPDQGSSKINFGGIVAGAGV- 214
N+ T D GI+GL G SL++++G KFSYC+ + N+ ++ G G
Sbjct: 212 NIGHDT-DPGHNGILGLNNGPDSLVTKLGQ----KFSYCIGNLADPYYNYHQLILGEGAD 266
Query: 215 ---VSTPL-IIRDHYYLSLEAISVGNQRL-------EFVSSSTGNIFVDTGVLRTLLPLE 263
STP + YY+++E ISVG +RL E + G + +DTG T L
Sbjct: 267 LEGYSTPFEVYNGFYYVTMEGISVGEKRLDIAPETFEMKENRAGGVIIDTGSTITFLVDS 326
Query: 264 YHSNLKSVMSNMIKAQPVKGVGAEPGFSDVLCYNISSQ-PKFPEVTIHFR-GADVKLSPS 321
H L + N++ + + + +IS FP VT HF GAD+ L
Sbjct: 327 VHKLLSKEVRNLLGWSFRQATIEKSPWMQCFYGSISRDLVGFPVVTFHFSDGADLALDSG 386
Query: 322 NLFRNISDEIMC------SAFRGGNANIVYGRIMQINFLIGYDIEQAMVSFKPSRC 371
+ F ++D + C S+ + + G + Q ++ +GYD+ V F+ C
Sbjct: 387 SFFNQLNDNVFCMTVGPVSSLNIKSKPSLIGLLAQQSYNVGYDLVNQFVYFQRIDC 442
>gi|224143371|ref|XP_002324933.1| predicted protein [Populus trichocarpa]
gi|222866367|gb|EEF03498.1| predicted protein [Populus trichocarpa]
Length = 463
Score = 129 bits (325), Expect = 2e-27, Method: Compositional matrix adjust.
Identities = 110/381 (28%), Positives = 180/381 (47%), Gaps = 30/381 (7%)
Query: 2 QNSQKLPFYNDNETPKSPISIIYQAEIISVDDIYLMHLSIGTPPVDIFGSVDTGSDCTWT 61
Q + + + E KS + ++I + D Y++++ IGTP ++ DTGS WT
Sbjct: 101 QARRSMNLTSSVEHMKSSVPFYGLSKITASD--YIVNVGIGTPKKEMPLIFDTGSGLIWT 158
Query: 62 QCEPCPELDCFKQEPPLFDPKKSSTYNSISCSSSQCAVVTSNCSEGDCSYSFLYGRGAYA 121
QC+PC C+ + P +FDP KS+++ + CSS C + CS C+Y Y
Sbjct: 159 QCKPCKA--CYPKVP-VFDPTKSASFKGLPCSSKLCQSIRQGCSSPKCTYLTAY---VDN 212
Query: 122 SFSSGNLATETLTFNSTSGLPVEMPNVIFGCGHKNLASPTSDSKQTGIIGLGPGNSSLIS 181
S S+G LATET++F S L + N++ GC + +S GI+GL SL S
Sbjct: 213 SSSTGTLATETISF---SHLKYDFKNILIGCSDQVSGESLGES---GIMGLNRSPISLAS 266
Query: 182 QMGTSIAGKFSYCLPDQGSS--KINFGGIVAGAGVVS--TPLIIRDHYYLSLEAISVGNQ 237
Q FSYC+P S + FGG V S + Y + + ISVG +
Sbjct: 267 QTANIYDKLFSYCIPSTPGSTGHLTFGGKVPNDVRFSPVSKTAPSSDYDIKMTGISVGGR 326
Query: 238 RLEFVSSSTGNIF--VDTGVLRTLLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDVLC 295
+L + +S I +D+G + T LP + +S L+SV M+K P+ + F D C
Sbjct: 327 KL-LIDASAFKIASTIDSGAVLTRLPPKAYSALRSVFREMMKGYPLLD---QDDFLDT-C 381
Query: 296 YNIS--SQPKFPEVTIHFRGA-DVKLSPSNLFRNI-SDEIMCSAFRGGNANI-VYGRIMQ 350
Y+ S S P +++ F G ++ + S + + ++ C AF + + ++G Q
Sbjct: 382 YDFSNYSTVAIPSISVFFEGGVEMDIDVSGIMWQVPGSKVYCLAFAELDDEVSIFGNFQQ 441
Query: 351 INFLIGYDIEQAMVSFKPSRC 371
+ + +D + + F P C
Sbjct: 442 KTYTVVFDGAKERIGFAPGGC 462
>gi|359476204|ref|XP_002262813.2| PREDICTED: LOW QUALITY PROTEIN: aspartic proteinase
nepenthesin-2-like [Vitis vinifera]
Length = 460
Score = 129 bits (324), Expect = 2e-27, Method: Compositional matrix adjust.
Identities = 113/371 (30%), Positives = 172/371 (46%), Gaps = 50/371 (13%)
Query: 24 YQAEIISVDDIYLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKK 83
+ + D +L+ ++ GTP +I +DTGS TWTQC+ C ++C + FD
Sbjct: 117 HNNNLFDEDGNFLVDVAFGTPXTEIXLILDTGSSITWTQCKAC--VNCLQDSNRYFDSSA 174
Query: 84 SSTYNSISCSSSQCAVVTSNCSEGDCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPV 143
SSTY+ SC S V +N Y+ YG S S GN +T+T +
Sbjct: 175 SSTYSFGSCIPS---TVENN-------YNMTYGD---DSTSVGNYGCDTMTLEPSD---- 217
Query: 144 EMPNVIFGCGHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLPDQG---- 199
FGCG N S G++GLG G S +SQ + FSYCLP++
Sbjct: 218 VFQKFQFGCGRNNKGD--FGSGVDGMLGLGQGQLSTVSQTASKFNKVFSYCLPEEDSIGS 275
Query: 200 ----------SSKINFGGIVAGAGVVSTPLIIRDHYYLSLEAISVGNQRLEFVSS---ST 246
SS + F +V G G + +Y+++L ISVGN+RL SS S
Sbjct: 276 LLFGEKATSQSSSLKFTSLVNGPGTLQES----GYYFVNLSDISVGNERLNIPSSVFASP 331
Query: 247 GNIFVDTGVLRTLLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDVL--CYNISSQPK- 303
G I +D+ + T LP +S LK+ + P+ + G D+L CYN+S +
Sbjct: 332 GTI-IDSRTVITRLPQRAYSALKAAFKKAMAKYPLSNGRRKKG--DILDTCYNLSGRKDV 388
Query: 304 -FPEVTIHF-RGADVKLSPSNLFRNISDEIMCSAFRGGNANIVYGRIMQINFLIGYDIEQ 361
PE+ +HF GADV+L+ +N+ +C AF G + + G Q++ + YDI+
Sbjct: 389 LLPEIVLHFGGGADVRLNGTNIVWGSDASRLCLAFAGTSELTIIGNRQQLSLTVLYDIQG 448
Query: 362 AMVSFKPSRCT 372
+ F + C+
Sbjct: 449 RRIGFGGNGCS 459
>gi|356557203|ref|XP_003546907.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
Length = 470
Score = 129 bits (324), Expect = 2e-27, Method: Compositional matrix adjust.
Identities = 113/370 (30%), Positives = 170/370 (45%), Gaps = 55/370 (14%)
Query: 35 YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSS 94
Y++ + +G+ + + VDTGSD TW QCEPC C+ Q PLF P S +Y I C+S
Sbjct: 122 YIVTMGLGSQNMSVI--VDTGSDLTWVQCEPCR--SCYNQNGPLFKPSTSPSYQPILCNS 177
Query: 95 SQCAVVTSNCSEGD------CSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMPNV 148
+ C + D C Y YG G+Y +SG L E L F S + N
Sbjct: 178 TTCQSLELGACGSDPSTSATCDYVVNYGDGSY---TSGELGIEKLGFGGIS-----VSNF 229
Query: 149 IFGCGHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLP--DQGSSKINFG 206
+FGCG N S G++GLG S+ISQ + G FSYCLP DQ + G
Sbjct: 230 VFGCGRNNKGLFGGAS---GLMGLGRSELSMISQTNATFGGVFSYCLPSTDQAGAS---G 283
Query: 207 GIVAG--AGVVS--TP---------LIIRDHYYLSLEAISVGNQRLEFVSSSTGN--IFV 251
+V G +GV TP L + + Y L+L I VG L +SS GN + +
Sbjct: 284 SLVMGNQSGVFKNVTPIAYTRMLPNLQLSNFYILNLTGIDVGGVSLHVQASSFGNGGVIL 343
Query: 252 DTGVLRTLLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDV-LCYNIS--SQPKFPEVT 308
D+G + + L + LK+ P + PGFS + C+N++ Q P ++
Sbjct: 344 DSGTVISRLAPSVYKALKAKFLEQFSGFP-----SAPGFSILDTCFNLTGYDQVNIPTIS 398
Query: 309 IHFRG-ADVKLSPSNLFRNISDEI--MCSAFRGGNANI---VYGRIMQINFLIGYDIEQA 362
++F G A++ + + +F + ++ +C A + + G Q N + YD + +
Sbjct: 399 MYFEGNAELNVDATGIFYLVKEDASRVCLALASLSDEYEMGIIGNYQQRNQRVLYDAKLS 458
Query: 363 MVSFKPSRCT 372
V F CT
Sbjct: 459 QVGFAKEPCT 468
>gi|24417232|gb|AAN60226.1| unknown [Arabidopsis thaliana]
Length = 464
Score = 129 bits (324), Expect = 2e-27, Method: Compositional matrix adjust.
Identities = 111/354 (31%), Positives = 158/354 (44%), Gaps = 38/354 (10%)
Query: 35 YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSS 94
Y++ + IGTP D+ DTGSD TWTQCEPC C+ Q+ P F+P SSTY ++SCSS
Sbjct: 132 YIVTIGIGTPKHDLSLVFDTGSDLTWTQCEPCLG-SCYSQKEPKFNPSSSSTYQNVSCSS 190
Query: 95 SQCAVVTSNCSEGDCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMPNVIFGCGH 154
C S CS +C YS YG SF+ G LA E T ++ L +V FGCG
Sbjct: 191 PMCEDAES-CSASNCVYSIGYGD---KSFTQGFLAKEKFTLTNSDVL----EDVYFGCGE 242
Query: 155 KN---LASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLP---DQGSSKINFGGI 208
N P +Q T+ FSYCLP + + FG
Sbjct: 243 NNQGLFDGVAGLLGLGPGKLSLP------AQTTTTYNNIFSYCLPSFTSNSTGHLTFGSA 296
Query: 209 VAGAGVVSTPLI---IRDHYYLSLEAISVGNQRLEFV--SSSTGNIFVDTGVLRTLLPLE 263
V TP+ +Y + + ISVG++ L S ST +D+G + T LP +
Sbjct: 297 GISESVKFTPISSFPSAFNYGIDIIGISVGDKELAITPNSFSTEGAIIDSGTVFTRLPTK 356
Query: 264 YHSNLKSVM-SNMIKAQPVKGVGAEPGFSDVLCYNIS--SQPKFPEVTIHFRGAD-VKLS 319
++ L+SV M + G G CY+ + +P + F G V+L
Sbjct: 357 VYAELRSVFKEKMSSYKSTSGYGLFD-----TCYDFTGLDTVTYPTIAFSFAGGTVVELD 411
Query: 320 PSNLFRNISDEIMCSAFRGGNANI--VYGRIMQINFLIGYDIEQAMVSFKPSRC 371
S + I +C AF GN ++ ++G + Q + YD+ V F P+ C
Sbjct: 412 GSGISLPIKISQVCLAF-AGNDDLPAIFGNVQQTTLDVVYDVAGGRVGFAPNGC 464
>gi|302762735|ref|XP_002964789.1| hypothetical protein SELMODRAFT_82470 [Selaginella moellendorffii]
gi|300167022|gb|EFJ33627.1| hypothetical protein SELMODRAFT_82470 [Selaginella moellendorffii]
Length = 390
Score = 129 bits (324), Expect = 2e-27, Method: Compositional matrix adjust.
Identities = 110/362 (30%), Positives = 163/362 (45%), Gaps = 41/362 (11%)
Query: 35 YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSS 94
Y + IG+P + +DTGSD TW QC PC C+ Q P++DP SS+Y + C S
Sbjct: 45 YFARMGIGSPQRSYYLELDTGSDVTWIQCAPCSS--CYSQVDPIYDPSNSSSYRRVYCGS 102
Query: 95 SQC-AVVTSNCSEGDCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMPNVIFGCG 153
+ C A+ S C CSY +YG +S SSG+L E+ S M N+ FGCG
Sbjct: 103 ALCQALDYSACQGMGCSYRVVYGD---SSASSGDLGIESFYLGPNSS--TAMRNIAFGCG 157
Query: 154 HKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLPDQ------GSSKINFGG 207
H N ++ G+ G S SQ+ SI FSYCL D+ SS + FG
Sbjct: 158 HSNSGLFRGEAGLLGMGGG---TLSFFSQIAASIGPAFSYCLVDRYSQLQSRSSPLIFGR 214
Query: 208 IVAGAGVVSTPLI----IRDHYYLSLEAISVGNQRL-------EFVSSSTGNIFVDTGV- 255
TPL+ I YY L ISVG L + TG +D+G
Sbjct: 215 TAIPFAARFTPLLKNPRIDTFYYAILTGISVGGTALPIPPAQFALTGNGTGGAILDSGTS 274
Query: 256 LRTLLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDV-LCYNISSQP--KFPEVTIHF- 311
+ ++P Y +V+ + +A + + PG + C+N P + P + +HF
Sbjct: 275 VTRVVPAAY-----AVLRDAYRAAS-RNLPPAPGVYLLDTCFNFQGLPTVQIPSLVLHFD 328
Query: 312 RGADVKLSPSNLFRNISDE-IMCSAFRGGNANI-VYGRIMQINFLIGYDIEQAMVSFKPS 369
D+ L N+ + C AF + I V G + Q F IG+D+++++++ P
Sbjct: 329 NDVDMVLPGGNILIPVDRSGTFCLAFAPSSMPISVIGNVQQQTFRIGFDLQRSLIAIAPR 388
Query: 370 RC 371
C
Sbjct: 389 EC 390
>gi|293335955|ref|NP_001168399.1| uncharacterized protein LOC100382168 precursor [Zea mays]
gi|223948009|gb|ACN28088.1| unknown [Zea mays]
gi|413922066|gb|AFW61998.1| hypothetical protein ZEAMMB73_694403 [Zea mays]
Length = 507
Score = 129 bits (324), Expect = 3e-27, Method: Compositional matrix adjust.
Identities = 109/374 (29%), Positives = 171/374 (45%), Gaps = 59/374 (15%)
Query: 40 SIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSSSQCA- 98
S G+P ++ VDTGSD TW QC+PC C+ Q PLFDP S+TY ++ C++S CA
Sbjct: 153 SSGSPAANLTVIVDTGSDLTWVQCKPCSA--CYAQRDPLFDPAGSATYAAVRCNASACAD 210
Query: 99 -----------VVTSNCSEGDCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMPN 147
++ C Y+ YG G SFS G LAT+T+ S +
Sbjct: 211 SLRAATGTPGSCGSTGAGSEKCYYALAYGDG---SFSRGVLATDTVALGGAS-----LGG 262
Query: 148 VIFGCG--HKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLP------DQG 199
+FGCG ++ L T+ G++GLG SL+SQ + G FSYCLP G
Sbjct: 263 FVFGCGLSNRGLFGGTA-----GLMGLGRTELSLVSQTASRYGGVFSYCLPAATSGDASG 317
Query: 200 SSKINFGGIVAGAGVVSTPL----IIRD-----HYYLSLEAISVGNQRLEFVSSSTGNIF 250
S + G A + +TP+ +I D Y+L++ +VG L N+
Sbjct: 318 SLSLGGGDDAASSYRNTTPVAYTRMIADPAQPPFYFLNVTGAAVGGTALAAQGLGASNVL 377
Query: 251 VDTG-VLRTLLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDV-LCYNISS--QPKFPE 306
+D+G V+ L P Y ++V + ++ G A PGFS + CY+++ + K P
Sbjct: 378 IDSGTVITRLAPSVY----RAVRAEFMRQFGAAGYPAAPGFSILDTCYDLTGHDEVKVPL 433
Query: 307 VTIHFR-GADVKLSPSNLFRNISDE-----IMCSAFRGGNANIVYGRIMQINFLIGYDIE 360
+T+ GADV + + + + + + ++ + + G Q N + YD
Sbjct: 434 LTLRLEGGADVTVDAAGMLFVVRKDGSQVCLAMASLSYEDETPIIGNYQQKNKRVVYDTL 493
Query: 361 QAMVSFKPSRCTNY 374
+ + F C NY
Sbjct: 494 GSRLGFADEDC-NY 506
>gi|168014386|ref|XP_001759733.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162689272|gb|EDQ75645.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 392
Score = 129 bits (324), Expect = 3e-27, Method: Compositional matrix adjust.
Identities = 114/373 (30%), Positives = 171/373 (45%), Gaps = 54/373 (14%)
Query: 35 YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSS 94
Y + S+GTP VDTGSD + QC PC +L C++Q+ PL+ P SST+ + C S
Sbjct: 34 YFVDFSLGTPEQKFHLIVDTGSDLAFVQCAPC-DL-CYEQDGPLYQPSNSSTFTPVPCDS 91
Query: 95 SQC----AVVTSNCS--------EGDCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLP 142
++C A V + CS +G CSY + YG S + G A ET T
Sbjct: 92 AECLLIPAPVGAPCSSSYPESPPQGACSYEYRYGDN---SSTVGVFAYETATVGG----- 143
Query: 143 VEMPNVIFGCGHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLPDQGSSK 202
+ + +V FGCG++N S S G++GLG G S SQ G + KF+YCL S
Sbjct: 144 IRVNHVAFGCGNRNQGSFVSAG---GVLGLGQGALSFTSQAGYAFENKFAYCLTSYLSPT 200
Query: 203 INFGGIVAGAGVVS-------TPLIIR----DHYYLSLEAISVGNQRLEFVSSS------ 245
F ++ G ++S TPL+ YY+ + I G + L S+
Sbjct: 201 SVFSSLIFGDDMMSTIHDLQFTPLVSNPLNPSVYYVQIVRICFGGETLLIPDSAWKIDSV 260
Query: 246 -TGNIFVDTGVLRTLLPLEYHSNLKSVMSNMIKAQPVKGVGAEP-GFSDVLCYNIS--SQ 301
G D+G T + ++ +++ K+ P P G LC N+S
Sbjct: 261 GNGGTIFDSGTTVTYWSPQAYAR---IIAAFEKSVPYPRAPPSPQGLP--LCVNVSGIDH 315
Query: 302 PKFPEVTIHF-RGADVKLSPSNLFRNISDEIMCSAFRGGNAN--IVYGRIMQINFLIGYD 358
P +P TI F +GA + + N F +S I C A +++ V G I+Q N+L+ YD
Sbjct: 316 PIYPSFTIEFDQGATYRPNQGNYFIEVSPNIDCLAMLESSSDGFNVIGNIIQQNYLVQYD 375
Query: 359 IEQAMVSFKPSRC 371
E+ + F + C
Sbjct: 376 REEHRIGFAHANC 388
>gi|224080963|ref|XP_002306246.1| predicted protein [Populus trichocarpa]
gi|222855695|gb|EEE93242.1| predicted protein [Populus trichocarpa]
Length = 382
Score = 129 bits (323), Expect = 3e-27, Method: Compositional matrix adjust.
Identities = 105/361 (29%), Positives = 154/361 (42%), Gaps = 45/361 (12%)
Query: 35 YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSS 94
Y + + +G+PP + +D+GSD W QC+PC + C+ Q PLFDP S+++ +SCSS
Sbjct: 43 YFVRIGVGSPPRSQYMVIDSGSDIVWVQCKPCTQ--CYHQTDPLFDPADSASFMGVSCSS 100
Query: 95 SQC-AVVTSNCSEGDCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMPNVIFGCG 153
+ C V + C+ G C Y YG G S + G LA ETLT T + NV GCG
Sbjct: 101 AVCDQVDNAGCNSGRCRYEVSYGDG---SSTKGTLALETLTLGRTV-----VQNVAIGCG 152
Query: 154 HKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLPDQGSSK---INFGGIVA 210
H N + G+ G + S + Q+ FSYCL + ++ + FG
Sbjct: 153 HMNQGMFVGAAGLLGLGGG---SMSFVGQLSRERGNAFSYCLVSRVTNSNGFLEFGSEAM 209
Query: 211 GAGVVSTPLIIRDH----YYLSLEAISVGNQRL-------EFVSSSTGNIFVDTGVLRTL 259
G PLI H YY+ L + VG+ ++ E G + +DTG T
Sbjct: 210 PVGAAWIPLIRNPHSPSYYYIGLSGLGVGDMKVPISEDIFELTELGNGGVVMDTGTAVTR 269
Query: 260 LPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDV----LCYNISS--QPKFPEVTIHFRG 313
P + + + G P S V CYN+ + P V+ +F G
Sbjct: 270 FPTVAYEAFRDAF--------IDQTGNLPRASGVSIFDTCYNLFGFLSVRVPTVSFYFSG 321
Query: 314 ADVKLSPSNLFRNISDE--IMCSAFRGGNANI-VYGRIMQINFLIGYDIEQAMVSFKPSR 370
+ P+N F D+ C AF + + + G I Q I D V F P+
Sbjct: 322 GPILTLPANNFLIPVDDAGTFCFAFAPSPSGLSILGNIQQEGIQISVDGANEFVGFGPNV 381
Query: 371 C 371
C
Sbjct: 382 C 382
>gi|242081123|ref|XP_002445330.1| hypothetical protein SORBIDRAFT_07g009580 [Sorghum bicolor]
gi|241941680|gb|EES14825.1| hypothetical protein SORBIDRAFT_07g009580 [Sorghum bicolor]
Length = 543
Score = 129 bits (323), Expect = 3e-27, Method: Compositional matrix adjust.
Identities = 118/414 (28%), Positives = 184/414 (44%), Gaps = 62/414 (14%)
Query: 3 NSQKLPFYNDNETPKSPISIIYQAEIIS---------VDDIYLMHLSIGTPPVDIFGSVD 53
NS +L ND S S + + S V I L S G+P ++ VD
Sbjct: 149 NSFQLRIRNDRAAAASTQSGSAEVPLTSGIRFQTLNYVTTIALGGGSSGSPAANLTVIVD 208
Query: 54 TGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSSSQCAV-------VTSNCSE 106
TGSD TW QC+PC C+ Q PLFDP S+TY ++ C++S CA +C
Sbjct: 209 TGSDLTWVQCKPCSA--CYAQRDPLFDPAGSATYAAVRCNASACAASLKAATGTPGSCGG 266
Query: 107 GD--CSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMPNVIFGCG--HKNLASPTS 162
G+ C Y+ YG G SFS G LAT+T+ S + +FGCG ++ L T+
Sbjct: 267 GNERCYYALAYGDG---SFSRGVLATDTVALGGAS-----LDGFVFGCGLSNRGLFGGTA 318
Query: 163 DSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLP----DQGSSKINFGGIVAGAGVVSTP 218
G++GLG SL+SQ G FSYCLP S ++ GG A + +TP
Sbjct: 319 -----GLMGLGRTELSLVSQTALRYGGVFSYCLPATTSGDASGSLSLGGD-ASSYRNTTP 372
Query: 219 L----IIRD-----HYYLSLEAISVGNQRLEFVSSSTGNIFVDTGVLRTLLPLEYHSNLK 269
+ +I D Y+L++ +VG L N+ +D+G + T L + ++
Sbjct: 373 VAYTRMIADPAQPPFYFLNVTGAAVGGTALAAQGLGASNVLIDSGTVITRLAPSVYRGVR 432
Query: 270 SVMSNMIKAQPVKGVGAEPGFSDV-LCYNISS--QPKFPEVTIHFR-GADVKLSPSNLFR 325
+ + A G PGFS + CY+++ + K P +T+ GA+V + + +
Sbjct: 433 AEFTRQFAA---AGYPTAPGFSILDTCYDLTGHDEVKVPLLTLRLEGGAEVTVDAAGMLF 489
Query: 326 NISDE-----IMCSAFRGGNANIVYGRIMQINFLIGYDIEQAMVSFKPSRCTNY 374
+ + + ++ + + G Q N + YD + + F C NY
Sbjct: 490 VVRKDGSQVCLAMASLSYEDQTPIIGNYQQKNKRVVYDTVGSRLGFADEDC-NY 542
>gi|125558627|gb|EAZ04163.1| hypothetical protein OsI_26305 [Oryza sativa Indica Group]
Length = 404
Score = 129 bits (323), Expect = 3e-27, Method: Compositional matrix adjust.
Identities = 92/232 (39%), Positives = 121/232 (52%), Gaps = 32/232 (13%)
Query: 22 IIYQAEIISVDDIYLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDP 81
+ +Q + + Y M+LSIGTPPV DTGS WTQC PC E C + P F P
Sbjct: 77 VSFQTLLDNSAGAYNMNLSIGTPPVTFSVLADTGSSLIWTQCAPCTE--CAARPAPPFQP 134
Query: 82 KKSSTYNSISCSSSQCAVVTS---NCSEGDCSYSFLYGRGAYASFSSGNLATETLTFNST 138
SST++ + C+SS C +TS C+ C Y + YG G F++G LATETL
Sbjct: 135 ASSSTFSKLPCASSLCQFLTSPYRTCNATGCVYYYPYGMG----FTAGYLATETLHVGGA 190
Query: 139 SGLPVEMPNVIFGCGHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCL--- 195
S P V FGC +N +S +GI+GLG SL+SQ+G + +FSYCL
Sbjct: 191 S-----FPGVTFGCSTENGVGNSS----SGIVGLGRSPLSLVSQVGVA---RFSYCLRSN 238
Query: 196 PDQGSSKINFGGI--VAGAGVVSTPLIIR------DHYYLSLEAISVGNQRL 239
D G S I FG + V G V STPL+ +YY++L I+VG L
Sbjct: 239 ADAGDSPILFGSLAKVTGGNVQSTPLLENPEMPSSSYYYVNLTGITVGATDL 290
>gi|356523155|ref|XP_003530207.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
Length = 412
Score = 129 bits (323), Expect = 3e-27, Method: Compositional matrix adjust.
Identities = 109/370 (29%), Positives = 177/370 (47%), Gaps = 56/370 (15%)
Query: 35 YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSS 94
Y++ + +G+ + + +DTGSD TW QCEPC + C+ Q+ P+F P SS+Y S+SC+S
Sbjct: 65 YIVTMGLGSTNMTVI--IDTGSDLTWVQCEPC--MSCYNQQGPIFKPSTSSSYQSVSCNS 120
Query: 95 SQCAVV------TSNCSEG--DCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMP 146
S C + T C C+Y YG G+Y ++G L E L+F V +
Sbjct: 121 STCQSLQFATGNTGACGSNPSTCNYVVNYGDGSY---TNGELGVEQLSFGG-----VSVS 172
Query: 147 NVIFGCGHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLP--DQGSSKIN 204
+ +FGCG N S G++GLG SL+SQ + G FSYCLP + G+S
Sbjct: 173 DFVFGCGRNNKGLFGGVS---GLMGLGRSYLSLVSQTNATFGGVFSYCLPTTESGAS--- 226
Query: 205 FGGIVAG--AGVVS--TPLI---------IRDHYYLSLEAISVGNQRLEFVSSSTGNIFV 251
G +V G + V TP+ + + Y L+L I V L+ S G + +
Sbjct: 227 -GSLVMGNESSVFKNVTPITYTRMLPNPQLSNFYILNLTGIDVDGVALQVPSFGNGGVLI 285
Query: 252 DTGVLRTLLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDV-LCYNIS--SQPKFPEVT 308
D+G + T LP + LK++ P + PGFS + C+N++ + P ++
Sbjct: 286 DSGTVITRLPSSVYKALKALFLKQFTGFP-----SAPGFSILDTCFNLTGYDEVSIPTIS 340
Query: 309 IHFRG-ADVKLSPSNLFRNISDE-----IMCSAFRGGNANIVYGRIMQINFLIGYDIEQA 362
+HF G A++K+ + F + ++ + ++ + G Q N + YD +Q+
Sbjct: 341 MHFEGNAELKVDATGTFYVVKEDASQVCLALASLSDAYDTAIIGNYQQRNQRVIYDTKQS 400
Query: 363 MVSFKPSRCT 372
V F C+
Sbjct: 401 KVGFAEESCS 410
>gi|22165126|gb|AAM93742.1| putative chloroplast nucleoid DNA binding protein [Oryza sativa
Japonica Group]
gi|31433307|gb|AAP54836.1| Eukaryotic aspartyl protease family protein [Oryza sativa Japonica
Group]
gi|125575547|gb|EAZ16831.1| hypothetical protein OsJ_32302 [Oryza sativa Japonica Group]
Length = 405
Score = 129 bits (323), Expect = 3e-27, Method: Compositional matrix adjust.
Identities = 104/389 (26%), Positives = 169/389 (43%), Gaps = 48/389 (12%)
Query: 13 NETPKSPISIIYQAEIISVDDIYLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCF 72
+ TP + + +S +Y+ + +IGTPP + VD + WTQC PC CF
Sbjct: 35 DATPPAAGGAVAVPIYLSSQGLYVANFTIGTPPQPVSAVVDLTGELVWTQCTPCQP--CF 92
Query: 73 KQEPPLFDPKKSSTYNSISCSSSQCAVV---TSNCSEGDCSYSFLYGRGAYASFSSGNLA 129
+Q+ PLFDP KSST+ + C S C + + NC+ C +Y A + G
Sbjct: 93 EQDLPLFDPTKSSTFRGLPCGSHLCESIPESSRNCTSDVC----IYEAPTKAGDTGGKAG 148
Query: 130 TETLTFNSTSGLPVEMPNVIFGCGHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAG 189
T+T + + FGC + +GI+GLG SL++QM +
Sbjct: 149 TDTFAIGAAK------ETLGFGCVVMTDKRLKTIGGPSGIVGLGRTPWSLVTQMNVT--- 199
Query: 190 KFSYCLPDQGSSKINFGGI---VAGAGVVSTPLIIRD-----------HYYLSLEAISVG 235
FSYCL + S + G +AG STP +I+ +Y + L I G
Sbjct: 200 AFSYCLAGKSSGALFLGATAKQLAGGKNSSTPFVIKTSAGSSDNGSNPYYMVKLAGIKTG 259
Query: 236 NQRLEFVSSSTGNIFVDTGVLRTLLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDVLC 295
L+ SSS + +DT + L + LK ++ + QPV A P LC
Sbjct: 260 GAPLQAASSSGSTVLLDTVSRASYLADGAYKALKKALTAAVGVQPV----ASPPKPYDLC 315
Query: 296 YNISSQPKFPEVTIHFR-GADVKLSPSNLFRNISDEIMCSAFRGGNANI----------V 344
+ + PE+ F GA + + P+N + +C G +A++ +
Sbjct: 316 FPKAVAGDAPELVFTFDGGAALTVPPANYLLASGNGTVCLTI-GSSASLNLTGELEGASI 374
Query: 345 YGRIMQINFLIGYDIEQAMVSFKPSRCTN 373
G + Q N + +D+++ +SFKP+ C++
Sbjct: 375 LGSLQQENVHVLFDLKEETLSFKPADCSS 403
>gi|125555058|gb|EAZ00664.1| hypothetical protein OsI_22685 [Oryza sativa Indica Group]
Length = 465
Score = 128 bits (322), Expect = 3e-27, Method: Compositional matrix adjust.
Identities = 114/373 (30%), Positives = 170/373 (45%), Gaps = 50/373 (13%)
Query: 30 SVDDI-YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYN 88
SVD + Y++ L IGTP V +DTGSD +W QC+PC +C+ Q+ PLFDP SS+Y
Sbjct: 112 SVDSLEYVVTLGIGTPAVQQIVLIDTGSDLSWVQCKPCGAGECYAQKDPLFDPSSSSSYA 171
Query: 89 SISCSSSQCAVVTS-----NCSEGD---CSYSFLYGRGAYASFSSGNLATETLTFNSTSG 140
S+ C S C + + C+ G C Y YG A ++G +TETLT
Sbjct: 172 SVPCDSDACRKLAAGAYGHGCTSGAAALCEYGIEYGNRAT---TTGVYSTETLTLKPG-- 226
Query: 141 LPVEMPNVIFGCGHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLP--DQ 198
V + + FGCG K G++GLG SL+SQ + G FSYCLP
Sbjct: 227 --VVVADFGFGCGDHQHG---PYEKFDGLLGLGGAPESLVSQTSSQFGGPFSYCLPPTSG 281
Query: 199 GSSKINFGG------IVAGAGVVSTPLI----IRDHYYLSLEAISVGNQRLEFVSSS-TG 247
G+ + G A AG + TP+ + Y ++L ISVG L S+ +
Sbjct: 282 GAGFLALGAPNSSSSSTAAAGFLFTPMRRIPSVPTFYVVTLTGISVGGAPLAVPPSAFSS 341
Query: 248 NIFVDTGVLRTLLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDVL--CYNISSQPK-- 303
+ +D+G + T LP ++ L+S + + + P VL CY+ +
Sbjct: 342 GMVIDSGTVITGLPATAYAALRSAFRSAMSEYRLL----PPSNGAVLDTCYDFTGHTNVT 397
Query: 304 FPEVTIHFR-GADVKLS-PSNLFRNISDEIMCSAFRGGNANIVYGRIMQIN---FLIGYD 358
P + + F GA + L+ P+ + + C AF G + G I +N F + YD
Sbjct: 398 VPTIALTFSGGATIDLATPAGVLVD-----GCLAFAGAGTDDTIGIIGNVNQRTFEVLYD 452
Query: 359 IEQAMVSFKPSRC 371
+ V F+ C
Sbjct: 453 SGKGTVGFRAGAC 465
>gi|125532796|gb|EAY79361.1| hypothetical protein OsI_34489 [Oryza sativa Indica Group]
Length = 405
Score = 128 bits (322), Expect = 4e-27, Method: Compositional matrix adjust.
Identities = 104/389 (26%), Positives = 170/389 (43%), Gaps = 48/389 (12%)
Query: 13 NETPKSPISIIYQAEIISVDDIYLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCF 72
+ TP + + +S +Y+ + +IGTPP + VD + WTQC PC CF
Sbjct: 35 DATPPAAGGAVAVPIYLSSQGLYVANFTIGTPPQPVSAVVDLTGELVWTQCTPCQP--CF 92
Query: 73 KQEPPLFDPKKSSTYNSISCSSSQCAVV---TSNCSEGDCSYSFLYGRGAYASFSSGNLA 129
+Q+ PLFDP KSST+ + C S C + + NC+ C +Y A + G
Sbjct: 93 EQDLPLFDPTKSSTFRGLPCGSHLCESIPESSRNCTSDVC----IYEAPTKAGDTGGMAG 148
Query: 130 TETLTFNSTSGLPVEMPNVIFGCGHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAG 189
T+T + + FGC + +GI+GLG SL++QM +
Sbjct: 149 TDTFAIGAAK------ETLGFGCVVMTDKRLKTIGGPSGIVGLGRTPWSLVTQMNVT--- 199
Query: 190 KFSYCLPDQGSSKINFGGI---VAGAGVVSTPLIIRD-----------HYYLSLEAISVG 235
FSYCL + S + G +AG STP +I+ +Y + L I G
Sbjct: 200 AFSYCLAGKSSGALFLGATAKQLAGGKNSSTPFVIKTSAGSSDNGSNPYYMVKLAGIKAG 259
Query: 236 NQRLEFVSSSTGNIFVDTGVLRTLLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDVLC 295
L+ SSS + +DT + L + LK ++ + QPV A P LC
Sbjct: 260 GAPLQAASSSGSTVLLDTVSRASYLADGAYKALKKALTAAVGVQPV----ASPPKPYDLC 315
Query: 296 YNISSQPKFPEVTIHFR-GADVKLSPSNLFRNISDEIMCSAFRGGNANI----------V 344
++ + PE+ F GA + + P+N + +C G +A++ +
Sbjct: 316 FSKAVAGDAPELVFTFDGGAALTVPPANYLLASGNGTVCLTI-GSSASLNLTGELEGASI 374
Query: 345 YGRIMQINFLIGYDIEQAMVSFKPSRCTN 373
G + Q N + +D+++ +SFKP+ C++
Sbjct: 375 LGSLQQENVHVLFDLKEETLSFKPADCSS 403
>gi|357507803|ref|XP_003624190.1| Aspartic proteinase-like protein [Medicago truncatula]
gi|355499205|gb|AES80408.1| Aspartic proteinase-like protein [Medicago truncatula]
Length = 476
Score = 128 bits (322), Expect = 4e-27, Method: Compositional matrix adjust.
Identities = 108/371 (29%), Positives = 171/371 (46%), Gaps = 40/371 (10%)
Query: 34 IYLMHLSIGTPPVDIFGSVDTGSDCTWTQCE---PCPELDCFKQEPPLFDPKKSSTYNSI 90
+Y + +G+P + + VDTGSD W C CP+ + L+DP S T N++
Sbjct: 71 LYYTKVGLGSPAKEFYVQVDTGSDILWVNCAGCTACPKKSGLGMDLTLYDPNGSKTSNAV 130
Query: 91 SCSSSQCAVV----TSNCSEG-DCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEM 145
C C S C + C YS YG G S +SG+ ++LTF+ SG
Sbjct: 131 PCGDGFCTDTYSGPISGCKQDMSCPYSITYGDG---STTSGSFVNDSLTFDEVSGNLHTK 187
Query: 146 PN---VIFGCGHKNLASPTSDSKQT--GIIGLGPGNSSLISQMGTS--IAGKFSYCLPDQ 198
P+ VIFGCG K S +S+S + GIIG G NSS++SQ+ S + FS+CL
Sbjct: 188 PDNSSVIFGCGAKQSGSLSSNSDEALDGIIGFGQANSSVLSQLAASGKVKRIFSHCLDSH 247
Query: 199 GSSKINFGGIVAGAGVVSTPLIIR-DHYYLSLEAISVGNQRL-----EFVSSSTGNIFVD 252
I G V +TPL+ R HY + L+ + V + + F S S +D
Sbjct: 248 HGGGIFSIGQVMEPKFNTTPLVPRMAHYNVILKDMDVDGEPILLPLYLFDSGSGRGTIID 307
Query: 253 TGVLRTLLPLEYHSNLKSVMSNMIKAQP-VKGVGAEPGFSDVLCYNISSQ--PKFPEVTI 309
+G LPL ++ L + ++ QP +K + E F+ C++ S + FP V
Sbjct: 308 SGTTLAYLPLSIYNQL---LPKVLGRQPGLKLMIVEDQFT---CFHYSDKLDEGFPVVKF 361
Query: 310 HFRGADVKLSPSNLFRNISDEIMCSAF-------RGGNANIVYGRIMQINFLIGYDIEQA 362
HF G + + P + ++I C + + G I+ G ++ N L+ YD+E
Sbjct: 362 HFEGLSLTVHPHDYLFLYKEDIYCIGWQKSSTQTKEGRDLILIGDLVLSNKLVVYDLENM 421
Query: 363 MVSFKPSRCTN 373
++ + C++
Sbjct: 422 VIGWTNFNCSS 432
>gi|224092218|ref|XP_002309514.1| predicted protein [Populus trichocarpa]
gi|222855490|gb|EEE93037.1| predicted protein [Populus trichocarpa]
Length = 474
Score = 128 bits (322), Expect = 4e-27, Method: Compositional matrix adjust.
Identities = 113/362 (31%), Positives = 171/362 (47%), Gaps = 46/362 (12%)
Query: 35 YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSS 94
Y++ + +GTP D DTGS TWTQC+PC C+ Q+ FDP KS++YN++SCSS
Sbjct: 135 YVVTVGLGTPKEDFTLVFDTGSGITWTQCQPCLG-SCYPQKEQKFDPTKSTSYNNVSCSS 193
Query: 95 SQCAVVTSN---CSEGD--CSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMPNVI 149
+ C ++ ++ CS + C Y +YG +Y S G ATETLT +S+ N +
Sbjct: 194 ASCNLLPTSERGCSASNSTCLYQIIYGDQSY---SQGFFATETLTISSSD----VFTNFL 246
Query: 150 FGCGHKNLASPTSDSKQTGIIGLGPG-------NSSLISQMGTSIAGKFSYCLPDQGSSK 202
FGCG N G+ G G + SL SQ +FSYCLP SS
Sbjct: 247 FGCGQSN----------NGLFGQAAGLLGLSSSSVSLPSQTAEKYQKQFSYCLPSTPSST 296
Query: 203 --INFGGIVA-GAGVVSTPLIIRDHYYLSLEAISVGNQRLEFVSS--STGNIFVDTGVLR 257
+NFGG V+ AG Y + + ISV +L S +T +D+G +
Sbjct: 297 GYLNFGGKVSQTAGFTPISPAFSSFYGIDIVGISVAGSQLPIDPSIFTTSGAIIDSGTVI 356
Query: 258 TLLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDVLCYNIS--SQPKFPEVTIHFRGA- 314
T LP + LK + P K G E D CY+ S + FP+V++ F+G
Sbjct: 357 TRLPPTAYKALKEAFDEKMSNYP-KTNGDE--LLDT-CYDFSNYTTVSFPKVSVSFKGGV 412
Query: 315 DVKLSPSNLFRNISD-EIMCSAFRGGNANI---VYGRIMQINFLIGYDIEQAMVSFKPSR 370
+V + S + ++ +++C AF + ++G Q + + YD + M+ F
Sbjct: 413 EVDIDASGILYLVNGVKMVCLAFAANKDDSEFGIFGNHQQKTYEVVYDGAKGMIGFAAGA 472
Query: 371 CT 372
C+
Sbjct: 473 CS 474
>gi|255566835|ref|XP_002524401.1| basic 7S globulin 2 precursor small subunit, putative [Ricinus
communis]
gi|223536362|gb|EEF38012.1| basic 7S globulin 2 precursor small subunit, putative [Ricinus
communis]
Length = 455
Score = 128 bits (321), Expect = 4e-27, Method: Compositional matrix adjust.
Identities = 112/383 (29%), Positives = 178/383 (46%), Gaps = 62/383 (16%)
Query: 35 YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEP-PLFDPKKSSTYNSISCS 93
Y + L IGTPP + DTGSD W +C PC +C + P F + S+TY++I C
Sbjct: 86 YFVSLRIGTPPQTLLLVADTGSDLIWVKCSPC--RNCSHRSPGSAFFARHSTTYSAIHCY 143
Query: 94 SSQCAVV----TSNCSE----GDCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEM 145
S QC +V + C+ C Y + Y A +S ++G + E LT N+++G ++
Sbjct: 144 SPQCQLVPHPHPNPCNRTRLHSPCRYQYTY---ADSSTTTGFFSKEALTLNTSTGKVKKL 200
Query: 146 PNVIFGCGHK----NLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLPDQG-- 199
+ FGCG + +L + + Q G++GLG S SQ+G KFSYCL D
Sbjct: 201 NGLSFGCGFRISGPSLTGASFEGAQ-GVMGLGRAPISFSSQLGRRFGSKFSYCLMDYTLS 259
Query: 200 ---SSKINFGG----IVAGAGVVS-TPLIIR----DHYYLSLEAISVGNQRLEFVSS--- 244
+S + GG V+ G++S TPL+I YY++++ + V +L S
Sbjct: 260 PPPTSFLTIGGAQNVAVSKKGIMSFTPLLINPLSPTFYYIAIKGVYVNGVKLPINPSVWS 319
Query: 245 ----STGNIFVDTGVLRTLLPLEYHSNLKSVMSNMIK----AQPVKGVGAEPGFSDVLCY 296
G +D+G T + ++ + +K A+P PGF LC
Sbjct: 320 IDDLGNGGTIIDSGTTLTFITEPAYTEILKAFKKRVKLPSPAEPT------PGFD--LCM 371
Query: 297 NIS--SQPKFPEVTIHFRGADV-KLSPSNLFRNISDEIMCSAFR-----GGNANIVYGRI 348
N+S ++P P ++ + G V P N F D+I C A + GG + V G +
Sbjct: 372 NVSGVTRPALPRMSFNLAGGSVFSPPPRNYFIETGDQIKCLAVQPVSQDGGFS--VLGNL 429
Query: 349 MQINFLIGYDIEQAMVSFKPSRC 371
MQ FL+ +D +++ + F C
Sbjct: 430 MQQGFLLEFDRDKSRLGFTRRGC 452
>gi|356502456|ref|XP_003520035.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
Length = 484
Score = 128 bits (321), Expect = 5e-27, Method: Compositional matrix adjust.
Identities = 114/363 (31%), Positives = 168/363 (46%), Gaps = 53/363 (14%)
Query: 35 YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSS 94
Y + + IG PP + +DTGSD +W QC PC E C++Q P+FDP S++Y+ I C +
Sbjct: 149 YFLRVGIGKPPSQAYVVLDTGSDVSWIQCAPCSE--CYQQSDPIFDPVSSNSYSPIRCDA 206
Query: 95 SQC-AVVTSNCSEGDCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMPNVIFGCG 153
QC ++ S C G C Y YG G+Y + G ATET+T + + + NV GCG
Sbjct: 207 PQCKSLDLSECRNGTCLYEVSYGDGSY---TVGEFATETVTLGTAA-----VENVAIGCG 258
Query: 154 HKN--LASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLPDQGS---SKINFGGI 208
H N L + G L S +Q+ A FSYCL ++ S S + F
Sbjct: 259 HNNEGLFVGAAGLLGLGGGKL-----SFPAQVN---ATSFSYCLVNRDSDAVSTLEFNSP 310
Query: 209 VAGAGVVSTPLI----IRDHYYLSLEAISVGNQRL-------EFVSSSTGNIFVDTGVLR 257
+ VV+ PL + YYL L+ ISVG + L E + G I +D+G
Sbjct: 311 LP-RNVVTAPLRRNPELDTFYYLGLKGISVGGEALPIPESIFEVDAIGGGGIIIDSGTAV 369
Query: 258 TLLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDV----LCYNISSQP--KFPEVTIHF 311
T L E + L+ VKG P + V CY++SS+ + P V+ HF
Sbjct: 370 TRLRSEVYDALRDAF--------VKGAKGIPKANGVSLFDTCYDLSSRESVQVPTVSFHF 421
Query: 312 -RGADVKLSPSNLFRNI-SDEIMCSAFRGGNANI-VYGRIMQINFLIGYDIEQAMVSFKP 368
G ++ L N + S C AF +++ + G + Q +G+DI ++V F
Sbjct: 422 PEGRELPLPARNYLIPVDSVGTFCFAFAPTTSSLSIMGNVQQQGTRVGFDIANSLVGFSA 481
Query: 369 SRC 371
C
Sbjct: 482 DSC 484
>gi|449458736|ref|XP_004147103.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 2-like [Cucumis
sativus]
gi|449518669|ref|XP_004166359.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 2-like [Cucumis
sativus]
Length = 482
Score = 128 bits (321), Expect = 5e-27, Method: Compositional matrix adjust.
Identities = 106/365 (29%), Positives = 162/365 (44%), Gaps = 53/365 (14%)
Query: 35 YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSS 94
Y + + +G+PP + + +D+GSD W QC+PC C++Q P+FDP SS++ +SC S
Sbjct: 143 YFVRIGVGSPPRNQYMVIDSGSDIVWVQCKPCSR--CYQQSDPVFDPADSSSFAGVSCGS 200
Query: 95 SQC-AVVTSNCSEGDCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMPNVIFGCG 153
C + + C+ G C Y YG G+Y + G LA ETLT V + +V GCG
Sbjct: 201 DVCDRLENTGCNAGRCRYEVSYGDGSY---TKGTLALETLTVGQ-----VMIRDVAIGCG 252
Query: 154 HKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLPDQGSS-----KINFGGI 208
H N + G+ G + S I Q+G G FSYCL +G+ + G +
Sbjct: 253 HTNQGMFIGAAGLLGLGGG---SMSFIGQLGGQTGGAFSYCLVSRGTGSTGALEFGRGAL 309
Query: 209 VAGAGVVSTPLIIRD-----HYYLSLEAISVGNQRL-------EFVSSSTGNIFVDTGVL 256
GA +S +IR+ YY+ L I VG R+ + T + +DTG
Sbjct: 310 PVGATWIS---LIRNPRAPSFYYIGLAGIGVGGVRVSVPEETFQLTEYGTNGVVMDTGTA 366
Query: 257 RTLLP----LEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDV-LCYNIS--SQPKFPEVTI 309
T P + + + + SN+ +A PG S CY+++ + P V+
Sbjct: 367 VTRFPTAAYVAFRDSFTAQTSNLPRA---------PGVSIFDTCYDLNGFESVRVPTVSF 417
Query: 310 HFRGADVKLSPSNLFRNISD--EIMCSAFRGGNANI-VYGRIMQINFLIGYDIEQAMVSF 366
+F V P+ F D C AF + + + G I Q I +D V F
Sbjct: 418 YFSDGPVLTLPARNFLIPVDGGGTFCLAFAPSPSGLSIIGNIQQEGIQISFDGANGFVGF 477
Query: 367 KPSRC 371
P+ C
Sbjct: 478 GPNIC 482
>gi|302780575|ref|XP_002972062.1| hypothetical protein SELMODRAFT_96804 [Selaginella moellendorffii]
gi|300160361|gb|EFJ26979.1| hypothetical protein SELMODRAFT_96804 [Selaginella moellendorffii]
Length = 429
Score = 127 bits (320), Expect = 6e-27, Method: Compositional matrix adjust.
Identities = 117/387 (30%), Positives = 175/387 (45%), Gaps = 57/387 (14%)
Query: 35 YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEP-------CPELDCFKQEPPLFDPKKSSTY 87
YL+ ++ GTPP ++ DTGSD W QC CP+ C ++ P F KS+T
Sbjct: 53 YLVSMAFGTPPQEVLLIADTGSDLIWLQCSTTAAPPAFCPKKACSRR--PAFVASKSATL 110
Query: 88 NSISCSSSQCAVVTSNCSEG---------DCSYSFLYGRGAYASFSSGNLATETLTF-NS 137
+ + CS++QC +V + G C Y++ Y G S ++G LA +T T N
Sbjct: 111 SVVPCSAAQCLLVPAPRGHGPACSPAAPVPCGYAYDYADG---SSTTGFLARDTATISNG 167
Query: 138 TSGLPVEMPNVIFGCGHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLPD 197
TSG + V FGCG +N S S G+IGLG G S +Q G+ A FSYCL D
Sbjct: 168 TSG-GAAVRGVAFGCGTRNQGG--SFSGTGGVIGLGQGQLSFPAQSGSLFAQTFSYCLLD 224
Query: 198 -------QGSSKINFGGIVAGAGVVSTPLI----IRDHYYLSLEAISVGNQRLEFVSSS- 245
+ SS + G A TPL+ YY+ + AI VGN+ L S
Sbjct: 225 LEGGRRGRSSSFLFLGRPERRAAFAYTPLVSNPLAPTFYYVGVVAIRVGNRVLPVPGSEW 284
Query: 246 ------TGNIFVDTGVLRTLLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDV-LCYNI 298
G +D+G T L L + +L S + + + + F + LCYN+
Sbjct: 285 AIDVLGNGGTVIDSGSTLTYLRLGAYLHLVSAFAASVHLPRIP--SSATFFQGLELCYNV 342
Query: 299 S-------SQPKFPEVTIHF-RGADVKLSPSNLFRNISDEIMCSAFR---GGNANIVYGR 347
S + FP +TI F +G ++L N +++D++ C A R A V G
Sbjct: 343 SSSSSSAPANGGFPRLTIDFAQGLSLELPTGNYLVDVADDVKCLAIRPTLSPFAFNVLGN 402
Query: 348 IMQINFLIGYDIEQAMVSFKPSRCTNY 374
+MQ + + +D A + F + C +
Sbjct: 403 LMQQGYHVEFDRASARIGFARTECVAH 429
>gi|242045118|ref|XP_002460430.1| hypothetical protein SORBIDRAFT_02g027990 [Sorghum bicolor]
gi|241923807|gb|EER96951.1| hypothetical protein SORBIDRAFT_02g027990 [Sorghum bicolor]
Length = 488
Score = 127 bits (320), Expect = 6e-27, Method: Compositional matrix adjust.
Identities = 106/359 (29%), Positives = 168/359 (46%), Gaps = 32/359 (8%)
Query: 35 YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSS 94
Y+ L +GTP ++ +DTGSD +W QC+PC DC++Q P+FDP SSTY+++ C +
Sbjct: 139 YVASLRLGTPATELVVELDTGSDQSWVQCKPC--ADCYEQRDPVFDPTASSTYSAVPCGA 196
Query: 95 SQCAVVTSNCSEG--------DCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVE-- 144
+C + S+ S +C Y Y S + G+LA +TLT + +
Sbjct: 197 RECQELASSSSSRNCSSDNNKNCPYEVSYDDD---SHTVGDLARDTLTLSPSPSPSPADT 253
Query: 145 MPNVIFGCGHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLPDQGSSK-- 202
+P +FGCGH N + + G++GLG G +SL SQ+ FSYCLP S+
Sbjct: 254 VPGFVFGCGHSNAG---TFGEVDGLLGLGLGKASLPSQVAARYGAAFSYCLPSSPSAAGY 310
Query: 203 INFGGIVAGAGVVSTPLIIRDH---YYLSLEAISVGNQRLEFVSSS---TGNIFVDTGVL 256
++FGG A A T ++ YYL+L I V + ++ +S+ +D+G
Sbjct: 311 LSFGGAAARANAQFTEMVTGQDPTSYYLNLTGIVVAGRAIKVPASAFATAAGTIIDSGTA 370
Query: 257 RTLLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDVLCYNISSQP--KFPEVTIHFR-G 313
+ LP ++ L+S + + K + P F CY+ + + P V + F G
Sbjct: 371 FSRLPPSAYAALRSSFRSAMGRYRYKRAPSSPIFD--TCYDFTGHETVRIPAVELVFADG 428
Query: 314 ADVKLSPSNLFRNISDEIM-CSAFRGGNANIVYGRIMQINFLIGYDIEQAMVSFKPSRC 371
A V L PS + +D C AF + + G Q + YD+ + F C
Sbjct: 429 ATVHLHPSGVLYTWNDVAQTCLAFVPNHDLGILGNTQQRTLAVIYDVGSQRIGFGRKGC 487
>gi|326517992|dbj|BAK07248.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 500
Score = 127 bits (320), Expect = 6e-27, Method: Compositional matrix adjust.
Identities = 111/363 (30%), Positives = 164/363 (45%), Gaps = 51/363 (14%)
Query: 35 YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSS 94
Y + +G P ++ +DTGSD TW QC+PC DC+ Q P++DP S++Y ++ C S
Sbjct: 163 YFSRVGVGRPARQLYMVLDTGSDVTWLQCQPC--ADCYAQSDPVYDPSVSTSYATVGCDS 220
Query: 95 SQCAVVTSNC---SEGDCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMPNVIFG 151
+C + + S G C Y YG G+Y + G+ ATETLT ++ PV NV G
Sbjct: 221 PRCRDLDAAACRNSTGSCLYEVAYGDGSY---TVGDFATETLTLGDSA--PVS--NVAIG 273
Query: 152 CGHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLPDQ---GSSKINFGGI 208
CGH N + + G S IS A FSYCL D+ SS + FG
Sbjct: 274 CGHDNEGLFVGAAGLLALGGGPLSFPSQIS------ATTFSYCLVDRDSPSSSTLQFGDS 327
Query: 209 VAGAGVVSTPLI----IRDHYYLSLEAISVGNQRLEFVSSS-------TGNIFVDTGVLR 257
A V+ PLI YY++L ISVG + L SS+ +G + VD+G
Sbjct: 328 EQPA--VTAPLIRSPRTNTFYYVALSGISVGGEALSIPSSAFAMDDAGSGGVIVDSGTAV 385
Query: 258 TLLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDV----LCYNIS--SQPKFPEVTIHF 311
T L + L+ V+G + P S V CY+++ S + P V + F
Sbjct: 386 TRLQSGAYGALREAF--------VQGTQSLPRASGVSLFDTCYDLAGRSSVQVPAVALWF 437
Query: 312 R-GADVKLSPSNLFRNISDE-IMCSAFRGGNANI-VYGRIMQINFLIGYDIEQAMVSFKP 368
G ++KL N + C AF G + + + G + Q + +D + V F
Sbjct: 438 EGGGELKLPAKNYLIPVDAAGTYCLAFAGTSGPVSIIGNVQQQGVRVSFDTAKNTVGFTA 497
Query: 369 SRC 371
+C
Sbjct: 498 DKC 500
>gi|255558694|ref|XP_002520371.1| basic 7S globulin 2 precursor small subunit, putative [Ricinus
communis]
gi|223540418|gb|EEF41987.1| basic 7S globulin 2 precursor small subunit, putative [Ricinus
communis]
Length = 557
Score = 127 bits (320), Expect = 7e-27, Method: Compositional matrix adjust.
Identities = 114/384 (29%), Positives = 166/384 (43%), Gaps = 64/384 (16%)
Query: 35 YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSS 94
Y M + IGTPP +DTGSD W QC PC DCF Q P +DPK+SS++ +I C
Sbjct: 192 YFMDVFIGTPPRHFSLILDTGSDLNWIQCVPC--YDCFVQNGPYYDPKESSSFKNIGCHD 249
Query: 95 SQCAVVTS-------NCSEGDCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPV---- 143
+C +V+S C Y + YG +S ++G+ A ET T N TS P
Sbjct: 250 PRCHLVSSPDPPQPCKAENQTCPYFYWYGD---SSNTTGDFALETFTVNLTS--PAGKSE 304
Query: 144 --EMPNVIFGCGHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLPDQGSS 201
+ NV+FGCGH N + G+ S SQ+ + FSYCL D+ S
Sbjct: 305 FKRVENVMFGCGHWNRGLFHGAAGLLGLGRG---PLSFSSQLQSLYGHSFSYCLVDRNSD 361
Query: 202 -------------------KINFGGIVAGAGVVSTPLIIRDHYYLSLEAISVGNQRLEFV 242
++NF +VAG P + YY+ +++I VG + L+
Sbjct: 362 TNVSSKLIFGEDKDLLNHPEVNFTSLVAGK---ENP--VDTFYYVQIKSIMVGGEVLKIP 416
Query: 243 SSS-------TGNIFVDTGVLRTLLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDVLC 295
+ G VD+G + + +K +K PV + P C
Sbjct: 417 EETWHLSPEGAGGTIVDSGTTLSYFAEPSYEIIKDAFVKKVKGYPV--IKDFPILDP--C 472
Query: 296 YNISSQPK--FPEVTIHFR-GADVKLSPSNLFRNIS-DEIMCSAFRG--GNANIVYGRIM 349
YN+S K PE I F GA N F + +EI+C A G +A + G
Sbjct: 473 YNVSGVEKMELPEFRILFEDGAVWNFPVENYFIKLEPEEIVCLAILGTPRSALSIIGNYQ 532
Query: 350 QINFLIGYDIEQAMVSFKPSRCTN 373
Q NF I YD +++ + + P +C +
Sbjct: 533 QQNFHILYDTKKSRLGYAPMKCAD 556
>gi|413923782|gb|AFW63714.1| hypothetical protein ZEAMMB73_300584 [Zea mays]
Length = 458
Score = 127 bits (320), Expect = 7e-27, Method: Compositional matrix adjust.
Identities = 117/355 (32%), Positives = 175/355 (49%), Gaps = 36/355 (10%)
Query: 35 YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSS 94
Y++ + IG+P V S+DTGSD +W QC+PC + C + LFDP SSTY+ SCSS
Sbjct: 122 YVITVGIGSPAVTQTMSMDTGSDVSWVQCKPCSQ--CHSEVDSLFDPSSSSTYSPFSCSS 179
Query: 95 SQCAVVTSN-----CSEGDCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMPNVI 149
+ CA ++ + C C Y YG + + + +++TLT S++ M +
Sbjct: 180 APCAQLSQSQEGNGCMSSQCQYIVNYGDSSSTTGTY---SSDTLTLGSSA-----MTDFQ 231
Query: 150 FGCGHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLPDQGSSKINFGGIV 209
FGC S + + G++GLG G SL SQ + FSYCLP S
Sbjct: 232 FGCSQSE--SGGFNDQTDGLMGLGGGAQSLASQTAGTFGTAFSYCLPPTSGSSGFLTLGT 289
Query: 210 AGAGVVSTPLI----IRDHYYLSLEAISVGNQRLEFVSS--STGNIFVDTGVLRTLLPLE 263
+G V TP++ I +Y + LE+I VG+Q+L +S S G++ +D+G + T LP
Sbjct: 290 GSSGFVKTPMLRSTQIPTYYVVLLESIKVGSQQLNLPTSVFSAGSL-MDSGTIITRLPPT 348
Query: 264 YHSNLKSVMSNMIKAQPVKGVGAEP-GFSDVLCYNISSQP--KFPEVTIHFR-GADVKLS 319
+S L S ++ P A P G D C++ S Q P VT+ F GA V L+
Sbjct: 349 AYSALSSAFKAGMQQYPP----ATPSGILDT-CFDFSGQSSISIPTVTLVFSGGAAVDLA 403
Query: 320 PSNLFRNISDEIMCSAF--RGGNANI-VYGRIMQINFLIGYDIEQAMVSFKPSRC 371
+ IS I C AF G ++++ + G + Q F + YD+ V FK C
Sbjct: 404 FDGIMLEISSSIRCLAFTPNGDDSSLGIIGNVQQRTFEVLYDVGGGAVGFKAGAC 458
>gi|255541796|ref|XP_002511962.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
gi|223549142|gb|EEF50631.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
Length = 495
Score = 127 bits (320), Expect = 8e-27, Method: Compositional matrix adjust.
Identities = 107/359 (29%), Positives = 167/359 (46%), Gaps = 44/359 (12%)
Query: 35 YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSS 94
Y + +G P + +DTGSD W QC+PC DC++Q P+F P SS+Y+ ++C S
Sbjct: 159 YFTRVGVGNPAKSYYMVLDTGSDINWIQCQPCS--DCYQQSDPIFTPAASSSYSPLTCDS 216
Query: 95 SQC-AVVTSNCSEGDCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMPNVIFGCG 153
QC ++ S+C G C Y YG G SF+ G+ TET++F + + ++ GCG
Sbjct: 217 QQCNSLQMSSCRNGQCRYQVNYGDG---SFTFGDFVTETMSFGGSG----TVNSIALGCG 269
Query: 154 HKN--LASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLPDQ---GSSKINFGGI 208
H N L + G L SL SQ+ A FSYCL ++ SS ++F
Sbjct: 270 HDNEGLFVGAAGLLGLGGGPL-----SLTSQLK---ATSFSYCLVNRDSAASSTLDFNSA 321
Query: 209 VAGAGVVSTPLI----IRDHYYLSLEAISVGNQRL-------EFVSSSTGNIFVDTGVLR 257
G V++ PL+ I YY+ L +SVG + L + S G + VD G
Sbjct: 322 PVGDSVIA-PLLKSSKIDTFYYVGLSGMSVGGELLRIPQEVFKLDDSGDGGVIVDCGTAI 380
Query: 258 TLLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDVLCYNISSQP--KFPEVTIHFRGAD 315
T L E +++L+ +M ++ ++ F CY++S Q K P V+ HF G
Sbjct: 381 TRLQSEAYNSLRDSFVSM--SRHLRSTSGVALFD--TCYDLSGQSSVKVPTVSFHFDGGK 436
Query: 316 VKLSPSN--LFRNISDEIMCSAFRGGNANI-VYGRIMQINFLIGYDIEQAMVSFKPSRC 371
P+ L S C AF +++ + G + Q + +D+ V F ++C
Sbjct: 437 SWDLPAANYLIPVDSAGTYCFAFAPTTSSLSIIGNVQQQGTRVSFDLANNRVGFSTNKC 495
>gi|357148754|ref|XP_003574882.1| PREDICTED: aspartic proteinase-like protein 2-like [Brachypodium
distachyon]
Length = 488
Score = 127 bits (319), Expect = 8e-27, Method: Compositional matrix adjust.
Identities = 114/377 (30%), Positives = 173/377 (45%), Gaps = 53/377 (14%)
Query: 34 IYLMHLSIGTPPVDIFGSVDTGSDCTWTQ---CEPCPELDCFKQEPPLFDPKKSSTYNSI 90
+Y + IGTPP VDTGSD W C CP + L+DPK SS+ +++
Sbjct: 82 LYYTEIEIGTPPKQYHVQVDTGSDILWVNCISCNKCPRKSDLGIDLRLYDPKGSSSGSTV 141
Query: 91 SCSSSQCAVVTS----NCSEG-DCSYSFLYGRGAYASFSSGNLATETLTFNSTSG---LP 142
SC CA C++ C YS +YG G S ++G +++L +N SG
Sbjct: 142 SCDQKFCAATYGGKLPGCAKNIPCEYSVMYGDG---SSTTGYFVSDSLQYNQVSGDGQTR 198
Query: 143 VEMPNVIFGCGHKNLASPTSDSKQ-TGIIGLGPGNSSLISQMGTS--IAGKFSYCLPDQG 199
+VIFGCG + S ++ GIIG G N+S++SQ+ + + FS+CL
Sbjct: 199 HANASVIFGCGAQQGGDLGSTNQALDGIIGFGQSNTSMLSQLAAAGEVKKIFSHCL---- 254
Query: 200 SSKINFGGIVAGAGVV-----STPLI-IRDHYYLSLEAISVGNQRLE-----FVSSSTGN 248
I GGI A VV STPL+ HY ++LE+I+VG L+ F +
Sbjct: 255 -DTIKGGGIFAIGDVVQPKVKSTPLVPDMPHYNVNLESINVGGTTLQLPSHMFETGEKKG 313
Query: 249 IFVDTGVLRTLLP-LEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDVLC--YNISSQPKFP 305
+D+G T LP L Y K V++ + P + D LC Y S FP
Sbjct: 314 TIIDSGTTLTYLPELVY----KDVLAAVFAKHPDTTFHS---VQDFLCIQYFQSVDDGFP 366
Query: 306 EVTIHFRGADVKLS--PSNLFRNISDEIMCSAFRGGNAN-------IVYGRIMQINFLIG 356
++T HF D+ L+ P + F D + C F+ G ++ G ++ N ++
Sbjct: 367 KITFHFE-DDLGLNVYPHDYFFQNGDNLYCFGFQNGGLQSKDGKDMVLLGDLVLSNKVVV 425
Query: 357 YDIEQAMVSFKPSRCTN 373
YD+E +V + C++
Sbjct: 426 YDLENQVVGWTDYNCSS 442
>gi|225455876|ref|XP_002275164.1| PREDICTED: aspartic proteinase nepenthesin-2 [Vitis vinifera]
Length = 496
Score = 127 bits (319), Expect = 9e-27, Method: Compositional matrix adjust.
Identities = 109/360 (30%), Positives = 165/360 (45%), Gaps = 46/360 (12%)
Query: 35 YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSS 94
Y + + IG P + +DTGSD W QC+PC DC++Q P+FDP SS+++ + C +
Sbjct: 160 YFLRVGIGRPSKTFYMVIDTGSDVNWLQCKPCD--DCYQQVDPIFDPASSSSFSRLGCQT 217
Query: 95 SQCA-VVTSNCSEGDCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMPNVIFGCG 153
QC + C C Y YG G+Y + G+ ATET++F ++ + V GCG
Sbjct: 218 PQCRNLDVFACRNDSCLYQVSYGDGSY---TVGDFATETVSFGNSG----SVDKVAIGCG 270
Query: 154 HKN--LASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLPDQ---GSSKINFGGI 208
H N L + G L SL SQ+ A FSYCL ++ SS + F
Sbjct: 271 HDNEGLFVGAAGLIGLGGGPL-----SLTSQIK---ASSFSYCLVNRDSVDSSTLEFNS- 321
Query: 209 VAGAGVVSTPLI----IRDHYYLSLEAISVGNQRL-------EFVSSSTGNIFVDTGVLR 257
+ V+ P+ + YY+ + +SVG ++L E S G I VD G
Sbjct: 322 AKPSDSVTAPIFKNSKVDTFYYVGITGMSVGGEKLAIPPSIFEVDGSGKGGIIVDCGTAV 381
Query: 258 TLLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDV-LCYNISSQP--KFPEVTIHFRGA 314
T L + ++ L+ + K P + GF+ CYN+SS+ + P V F G
Sbjct: 382 TRLQTQAYNALRDTFVKLTKDLP-----STSGFALFDTCYNLSSRTSVRVPTVAFLFDGG 436
Query: 315 D-VKLSPSNLFRNI-SDEIMCSAFRGGNANI-VYGRIMQINFLIGYDIEQAMVSFKPSRC 371
+ L PSN + S C AF A++ + G + Q + YD+ + VSF +C
Sbjct: 437 KSLPLPPSNYLIPVDSAGTFCLAFAPTTASLSIIGNVQQQGTRVTYDLANSQVSFSSRKC 496
>gi|356499344|ref|XP_003518501.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Glycine max]
Length = 561
Score = 127 bits (319), Expect = 9e-27, Method: Compositional matrix adjust.
Identities = 114/381 (29%), Positives = 170/381 (44%), Gaps = 59/381 (15%)
Query: 35 YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSS 94
Y M + +GTPP +DTGSD W QC PC + CF+Q P +DPK SS++ +ISC
Sbjct: 197 YFMDVFVGTPPKHFSLILDTGSDLNWIQCVPC--IACFEQSGPYYDPKDSSSFRNISCHD 254
Query: 95 SQCAVVTS-------NCSEGDCSYSFLYGRGAYASFSSGNLATETLTF-----NSTSGLP 142
+C +V++ C Y + YG G S ++G+ A ET T N TS L
Sbjct: 255 PRCQLVSAPDPPKPCKAENQSCPYFYWYGDG---SNTTGDFALETFTVNLTTPNGTSELK 311
Query: 143 VEMPNVIFGCGHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLPDQG--- 199
+ NV+FGCGH N + G+ S SQM + FSYCL D+
Sbjct: 312 -HVENVMFGCGHWNRGLFHGAAGLLGLGKG---PLSFASQMQSLYGQSFSYCLVDRNSNA 367
Query: 200 --SSKINFGGIVAGAGVVSTPLI------------IRDHYYLSLEAISVGNQRLE----- 240
SSK+ FG ++S P + + YY+ ++++ V ++ L+
Sbjct: 368 SVSSKLIFG---EDKELLSHPNLNFTSFGGGKDGSVDTFYYVQIKSVMVDDEVLKIPEET 424
Query: 241 --FVSSSTGNIFVDTGVLRTLLPLEYHSNLKSVMSNMIKA-QPVKGVGAEPGFSDVLCYN 297
S G +D+G T + +K IK Q V+G+ P CYN
Sbjct: 425 WHLSSEGAGGTIIDSGTTLTYFAEPAYEIIKEAFVRKIKGYQLVEGL---PPLKP--CYN 479
Query: 298 ISSQPK--FPEVTIHFRGADVKLSP-SNLFRNISDEIMCSAFRGG--NANIVYGRIMQIN 352
+S K P+ I F V P N F I E++C A G +A + G Q N
Sbjct: 480 VSGIEKMELPDFGILFADEAVWNFPVENYFIWIDPEVVCLAILGNPRSALSIIGNYQQQN 539
Query: 353 FLIGYDIEQAMVSFKPSRCTN 373
F I YD++++ + + P +C +
Sbjct: 540 FHILYDMKKSRLGYAPMKCAD 560
>gi|413944387|gb|AFW77036.1| hypothetical protein ZEAMMB73_461996 [Zea mays]
Length = 472
Score = 127 bits (318), Expect = 1e-26, Method: Compositional matrix adjust.
Identities = 119/372 (31%), Positives = 175/372 (47%), Gaps = 50/372 (13%)
Query: 30 SVDDI-YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYN 88
+VD + Y++ L IGTP V +DTGSD +W QC+PC C+ Q+ PL+DP SSTY
Sbjct: 121 AVDSLEYVVTLGIGTPAVQQTVLIDTGSDLSWVQCKPCNSSSCYPQKDPLYDPTASSTYA 180
Query: 89 SISCSSSQCAVVTSNCSEGDCS---------YSFLYGRGAYASFSSGNLATETLTFNSTS 139
+ C S C + + + C+ Y YG + G +TETLT +
Sbjct: 181 PVPCDSKACKDLVPDAYDHGCTNSSGTSLCQYGIEYGN---RDTTVGVYSTETLTLSPQ- 236
Query: 140 GLPVEMPNVIFGCGHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLPDQG 199
V + + FGCG L + G++GLG SL+SQ + G FSYCLP G
Sbjct: 237 ---VSVKDFGFGCG---LVQQGTFDLFDGLLGLGGAPESLVSQTAETYGGAFSYCLP-PG 289
Query: 200 SSKINFGGIVA------GAGVVSTPLIIRDH----YYLSLEAISVGNQRLEFVSSS-TGN 248
+S F + A AG + TPL Y ++L +SVG + L+ + +G
Sbjct: 290 NSTTGFLALGAPTNNNDTAGFLFTPLHSLPEQATFYLVNLTGVSVGGKPLDIPPTVLSGG 349
Query: 249 IFVDTGVLRTLLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDVL--CYNIS--SQPKF 304
+ +D+G + T LP +S L++ + A P+ P DVL CYN + +
Sbjct: 350 MIIDSGTIITGLPDTAYSALRTAFRTAMSAYPL----LPPNNDDVLDTCYNFTGIANVTV 405
Query: 305 PEVTIHFR-GADVKLS-PSNLFRNISDEIMCSAFRGGNAN---IVYGRIMQINFLIGYDI 359
P V + F GA + L PS + I D C AF GG ++ + G + Q F + YD
Sbjct: 406 PTVALTFDGGATIDLDVPSGVL--IQD---CLAFAGGASDGDVGIIGNVNQRTFEVLYDS 460
Query: 360 EQAMVSFKPSRC 371
+ V F+P C
Sbjct: 461 GRGHVGFRPGAC 472
>gi|357166728|ref|XP_003580821.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Brachypodium
distachyon]
Length = 479
Score = 127 bits (318), Expect = 1e-26, Method: Compositional matrix adjust.
Identities = 111/373 (29%), Positives = 166/373 (44%), Gaps = 52/373 (13%)
Query: 35 YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSS 94
YL+ + IG+PP++ DTGSD W QC PC DC+ Q PLFDP S++++ + C+S
Sbjct: 123 YLVRVGIGSPPLEQHLVADTGSDVIWVQCSPCS--DCYAQGDPLFDPANSASFSPVPCNS 180
Query: 95 SQCAVVTS------NCSEGDCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMPNV 148
C G+C Y YG S+++G LA ETLT + + E+ V
Sbjct: 181 GVCRAAARYSSSSCGGGGGECEYKVSYGD---KSYTNGVLALETLTLDGGT----EVQGV 233
Query: 149 IFGCGHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLPDQGSSKINFGGI 208
GCGH+N ++ G++GLG G SL+ Q+G + G FSYCL S + + G
Sbjct: 234 AMGCGHENRG---LFAEAAGLLGLGWGPMSLVGQLGGAAGGAFSYCLAGYYSGEGSGSGS 290
Query: 209 V-------AGAGVVSTPLIIR----DHYYLSLEAISVGNQRLEF-------VSSSTGNIF 250
+ A G V PL+ YY+ + + V +RL+ G +
Sbjct: 291 LVLGREDAAPTGAVWVPLVRNPDAPSFYYVGVNGLGVAGERLQLQDGLFDLGDDGGGGVV 350
Query: 251 VDTGVLRTLLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDV-LCYNIS--SQPKFPEV 307
+DTG T LP E ++ L+ + + +G PG S CY++S + + P V
Sbjct: 351 MDTGTAVTRLPAEAYAALRGAFAGAFE----EGAPRAPGVSLFDTCYDLSGYASVRVPTV 406
Query: 308 TIHF-------RGADVKLSPSNLFRNISD-EIMCSAFRG-GNANIVYGRIMQINFLIGYD 358
++F A + L NL + D C AF + + G I Q I D
Sbjct: 407 ALYFGGGGQGQEAASLTLPARNLLVPVDDGGTYCLAFAAVASGPSILGNIQQQGIEITVD 466
Query: 359 IEQAMVSFKPSRC 371
V F P+ C
Sbjct: 467 SASGYVGFGPATC 479
>gi|86438622|emb|CAJ26370.1| chloroplast nucleoid binding protein [Brachypodium sylvaticum]
Length = 443
Score = 127 bits (318), Expect = 1e-26, Method: Compositional matrix adjust.
Identities = 104/368 (28%), Positives = 159/368 (43%), Gaps = 39/368 (10%)
Query: 35 YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSS 94
Y++ + +GTP D+ DTGSD +W QC PC C+ Q+ PLF P SST++++ C
Sbjct: 85 YVVSVGLGTPARDLTVVFDTGSDLSWVQCGPCSSGGCYHQQDPLFAPSSSSTFSAVRCGE 144
Query: 95 SQCAVVTSNCSE--GD--CSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPV------E 144
+C +CS GD C Y +YG S + G+L +TLT +T +
Sbjct: 145 PECPRARQSCSSSPGDDRCPYEVVYGD---KSRTVGHLGNDTLTLGTTPSTNASENNSNK 201
Query: 145 MPNVIFGCGHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLPDQGSSKIN 204
+P +FGCG N K G+ GLG G SL SQ FSYCLP S+
Sbjct: 202 LPGFVFGCGENNTG---LFGKADGLFGLGRGKVSLSSQAAGKYGEGFSYCLPSSSSNAHG 258
Query: 205 FGGI----VAGAGVVSTPLIIRDH----YYLSLEAISVGNQRLEFVSSST---GNIFVDT 253
+ + A A TP++ R + YY+ L I V + ++ S + VD+
Sbjct: 259 YLSLGTPAPAPAHARFTPMLNRSNTPSFYYVKLVGIRVAGRAIKVSSRPALWPAGLIVDS 318
Query: 254 GVLRTLLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDV-LCYNISSQPK----FPEVT 308
G + T L +S L++ + A G P S + CY+ ++ P V
Sbjct: 319 GTVITRLAPRAYSALRTA---FLSAMGKYGYKRAPRLSILDTCYDFTAHANATVSIPAVA 375
Query: 309 IHFR-GADVKLSPSNLFRNISDEIMCSAFR---GGNANIVYGRIMQINFLIGYDIEQAMV 364
+ F GA + + S + C AF G + + G Q + YD+ + +
Sbjct: 376 LVFAGGATISVDFSGVLYVAKVAQACLAFAPNGNGRSAGILGNTQQRTVAVVYDVGRQKI 435
Query: 365 SFKPSRCT 372
F C+
Sbjct: 436 GFAAKGCS 443
>gi|224072755|ref|XP_002303865.1| predicted protein [Populus trichocarpa]
gi|222841297|gb|EEE78844.1| predicted protein [Populus trichocarpa]
Length = 412
Score = 126 bits (317), Expect = 1e-26, Method: Compositional matrix adjust.
Identities = 108/373 (28%), Positives = 178/373 (47%), Gaps = 59/373 (15%)
Query: 35 YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTY-----NS 89
Y++ + +G + + VDTGSD +W QC+PC C+ Q+ P+F+P KS +Y NS
Sbjct: 66 YIVTVELGGRKMTVI--VDTGSDLSWVQCQPCNR--CYNQQDPVFNPSKSPSYRTVLCNS 121
Query: 90 ISCSSSQCAVVTSNCSEGD---CSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMP 146
++C S Q A S + C+Y YG G+Y +SG + E L +T+ +
Sbjct: 122 LTCRSLQLATGNSGVCGSNPPTCNYVVNYGDGSY---TSGEVGMEHLNLGNTT-----VN 173
Query: 147 NVIFGCGHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLPD---QGSSKI 203
N IFGCG KN S G++GLG + SLISQ+ G FSYCLP + S +
Sbjct: 174 NFIFGCGRKNQGLFGGAS---GLVGLGRTDLSLISQISPMFGGVFSYCLPTTEAEASGSL 230
Query: 204 NFGGI---------VAGAGVVSTPLIIRDHYYLSLEAISVGNQRLEFVSSSTGNIFVDTG 254
GG ++ ++ PL+ Y+L+L I+VG ++ S + +D+G
Sbjct: 231 VMGGNSSVYKNTTPISYTRMIHNPLL--PFYFLNLTGITVGGVEVQAPSFGKDRMIIDSG 288
Query: 255 VLRTLLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDV-LCYNIS--SQPKFPEVTIHF 311
+ + LP + LK+ P + P F + C+N+S + K P++ ++F
Sbjct: 289 TVISRLPPSIYQALKAEFVKQFSGYP-----SAPSFMILDSCFNLSGYQEVKIPDIKMYF 343
Query: 312 RG-ADVKLSPSNLFRNISDEI--MCSAFRGGNANIVY-------GRIMQINFLIGYDIEQ 361
G A++ + + +F ++ + +C A A++ Y G Q N I YD +
Sbjct: 344 EGSAELNVDVTGVFYSVKTDASQVCLAI----ASLPYEDEVGIIGNYQQKNQRIIYDTKG 399
Query: 362 AMVSFKPSRCTNY 374
+M+ F C+ Y
Sbjct: 400 SMLGFAEEACSFY 412
>gi|449527151|ref|XP_004170576.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 1-like [Cucumis
sativus]
Length = 523
Score = 126 bits (317), Expect = 1e-26, Method: Compositional matrix adjust.
Identities = 115/359 (32%), Positives = 168/359 (46%), Gaps = 41/359 (11%)
Query: 35 YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCP-ELDCFKQEPPLFDPKKSSTYNSISCS 93
Y + +G P F DTGSD +W QC+PC E C+KQ P+FDPK SS+Y+ +SC
Sbjct: 184 YFARIGVGQPVQSYFFVPDTGSDVSWLQCQPCDGENGCYKQIGPIFDPKSSSSYSPLSCD 243
Query: 94 SSQCAVV-TSNCSEGDCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMPNVIFGC 152
S QC ++ + C C Y YG G SF+ G LATET +F ++ + PN+ GC
Sbjct: 244 SEQCHLLDEAACDANSCIYEVEYGDG---SFTVGELATETFSFRHSNSI----PNLPIGC 296
Query: 153 GHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLPD---QGSSKINFGGIV 209
GH N + G+ G SL SQ+ A FSYCL D + SS ++F
Sbjct: 297 GHDNEGLFVGAAGLIGLGGG---AISLSSQLE---ATSFSYCLVDLDSESSSTLDFNADQ 350
Query: 210 AGAGVVSTPLIIRDHY----YLSLEAISVGNQRLEFVSSS-------TGNIFVDTGVLRT 258
+ S PL+ D + Y+ + +SVG + L SSS +G I VD+G T
Sbjct: 351 PSDSLTS-PLVKNDRFPTFRYVKVIGMSVGGKPLPISSSSFEIDESGSGGIIVDSGTTIT 409
Query: 259 LLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDV-LCYNISSQP--KFPEVTIHFRGAD 315
+P + + L+ + K P PG S CY++SSQ + P + G +
Sbjct: 410 EIPSDVYDVLRDAFVGLTKNLP-----PAPGVSPFDTCYDLSSQSNVEVPTIAFILPGEN 464
Query: 316 VKLSPSN--LFRNISDEIMCSAFRGGNANI-VYGRIMQINFLIGYDIEQAMVSFKPSRC 371
P+ LF+ S C AF + + G + Q + YD+ ++V F +C
Sbjct: 465 SLQLPAKNCLFQVDSAGTFCLAFLPSTFPLSIIGNVQQQGIRVSYDLANSLVGFSTDKC 523
>gi|255547548|ref|XP_002514831.1| Aspartic proteinase Asp1 precursor, putative [Ricinus communis]
gi|223545882|gb|EEF47385.1| Aspartic proteinase Asp1 precursor, putative [Ricinus communis]
Length = 488
Score = 126 bits (317), Expect = 2e-26, Method: Compositional matrix adjust.
Identities = 107/370 (28%), Positives = 165/370 (44%), Gaps = 40/370 (10%)
Query: 34 IYLMHLSIGTPPVDIFGSVDTGSDCTWT---QCEPCPELDCFKQEPPLFDPKKSSTYNSI 90
+Y + +G PP D + VDTGSD W C+ CP + L+DP+ S++ I
Sbjct: 81 LYFAKIGLGNPPKDYYVQVDTGSDILWVNCANCDKCPTKSDLGVKLTLYDPQSSTSATRI 140
Query: 91 SCSSSQCAVVTSNCSEG-----DCSYSFLYGRGAYASFSSGNLATETLTFNSTSG-LPVE 144
C CA + +G C YS +YG G S ++G + L F+ +G L
Sbjct: 141 YCDDDFCAATYNGVLQGCTKDLPCQYSVVYGDG---SSTAGFFVKDNLQFDRVTGNLQTS 197
Query: 145 MPN--VIFGCGHKNLAS-PTSDSKQTGIIGLGPGNSSLISQMGTSIAGK----FSYCLPD 197
N VIFGCG K TS GI+G G NSS+ISQ+ + AGK F++CL +
Sbjct: 198 SANGSVIFGCGAKQSGELGTSSEALDGILGFGQANSSMISQL--AAAGKVKRVFAHCLDN 255
Query: 198 QGSSKINFGGIVAGAGVVSTPLIIRD-HYYLSLEAISVGNQRLE-----FVSSSTGNIFV 251
I G V V +TP++ HY + ++ I VG LE F + +
Sbjct: 256 VKGGGIFAIGEVVSPKVNTTPMVPNQPHYNVVMKEIEVGGNVLELPTDIFDTGDRRGTII 315
Query: 252 DTGVLRTLLPLEYHSNLKSVMSNMIKAQP-VKGVGAEPGFSDVLCYNISSQPKFPEVTIH 310
D+G LP + +S+M+ ++ QP +K E F+ Y + FP V H
Sbjct: 316 DSGTTLAYLPEVVY---ESMMTKIVSEQPGLKLHTVEEQFT-CFQYTGNVNEGFPVVKFH 371
Query: 311 FRGA-DVKLSPSNLFRNISDEIMCSAFRGGNANIVYGRIMQI-------NFLIGYDIEQA 362
F G+ + ++P + I +E+ C ++ GR M + N L+ YD+E
Sbjct: 372 FNGSLSLTVNPHDYLFQIHEEVWCFGWQNSGMQSKDGRDMTLLGDLVLSNKLVLYDLENQ 431
Query: 363 MVSFKPSRCT 372
+ + C+
Sbjct: 432 AIGWTDYNCS 441
>gi|449519146|ref|XP_004166596.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 1-like [Cucumis
sativus]
Length = 752
Score = 126 bits (317), Expect = 2e-26, Method: Compositional matrix adjust.
Identities = 114/377 (30%), Positives = 168/377 (44%), Gaps = 53/377 (14%)
Query: 35 YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSS 94
Y + + IG+PP +DTGSD W QC PC DCF+Q P +DPK S ++ +I+C+
Sbjct: 196 YFIDVFIGSPPKHFSLILDTGSDLNWIQCVPC--FDCFEQNGPYYDPKDSISFRNITCND 253
Query: 95 SQCAVVTS-------NCSEGDCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLP----- 142
+C +V+S C Y + YG +S ++G+ A ET T N TS
Sbjct: 254 PRCQLVSSPDPPRPCKFETQSCPYFYWYGD---SSNTTGDFALETFTVNLTSSTTGKSEF 310
Query: 143 VEMPNVIFGCGHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLPDQG--- 199
+ NV+FGCGH N + G+ S SQ+ + FSYCL D+
Sbjct: 311 RRVENVMFGCGHWNRGLFHGAAGLLGLGRG---PLSFSSQLQSLYGHSFSYCLVDRDSDT 367
Query: 200 --SSKINFG---GIVAGAGVVSTPLI------IRDHYYLSLEAISVGNQRLE-------F 241
SSK+ FG ++ + T LI + YYL +++I VG ++L+
Sbjct: 368 SVSSKLIFGEDKDLLTHPELNFTSLIAGKENPVDTFYYLQIKSIFVGGEKLQIPEENWNL 427
Query: 242 VSSSTGNIFVDTGVLRTLLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDVL-CYNIS- 299
+ G +D+G L Y S+ + + VKG F + CYN+S
Sbjct: 428 SADGAGGTIIDSGTT-----LSYFSDPAYRIIKEAFLRKVKGYKLVEDFPILHPCYNVSG 482
Query: 300 -SQPKFPEVTIHFR-GADVKLSPSNLFRNISD-EIMCSAFRG--GNANIVYGRIMQINFL 354
+ FPE I F GA N F I +I+C A G +A + G Q NF
Sbjct: 483 TDELNFPEFLIQFADGAVWNFPVENYFIRIQQLDIVCLAMLGTPKSALSIIGNYQQQNFH 542
Query: 355 IGYDIEQAMVSFKPSRC 371
I YD + + + + P RC
Sbjct: 543 ILYDTKNSRLGYAPMRC 559
>gi|449453872|ref|XP_004144680.1| PREDICTED: LOW QUALITY PROTEIN: protein ASPARTIC PROTEASE IN GUARD
CELL 1-like [Cucumis sativus]
Length = 757
Score = 126 bits (316), Expect = 2e-26, Method: Compositional matrix adjust.
Identities = 114/377 (30%), Positives = 168/377 (44%), Gaps = 53/377 (14%)
Query: 35 YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSS 94
Y + + IG+PP +DTGSD W QC PC DCF+Q P +DPK S ++ +I+C+
Sbjct: 196 YFIDVFIGSPPKHFSLILDTGSDLNWIQCVPC--FDCFEQNGPYYDPKDSISFRNITCND 253
Query: 95 SQCAVVTS-------NCSEGDCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLP----- 142
+C +V+S C Y + YG +S ++G+ A ET T N TS
Sbjct: 254 PRCQLVSSPDPPRPCKFETQSCPYFYWYGD---SSNTTGDFALETFTVNLTSSTTGKSEF 310
Query: 143 VEMPNVIFGCGHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLPDQG--- 199
+ NV+FGCGH N + G+ S SQ+ + FSYCL D+
Sbjct: 311 RRVENVMFGCGHWNRGLFHGAAGLLGLGRG---PLSFSSQLQSLYGHSFSYCLVDRDSDT 367
Query: 200 --SSKINFG---GIVAGAGVVSTPLI------IRDHYYLSLEAISVGNQRLE-------F 241
SSK+ FG ++ + T LI + YYL +++I VG ++L+
Sbjct: 368 SVSSKLIFGEDKDLLTHPELNFTSLIAGKENPVDTFYYLQIKSIFVGGEKLQIPEENWNL 427
Query: 242 VSSSTGNIFVDTGVLRTLLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDVL-CYNIS- 299
+ G +D+G L Y S+ + + VKG F + CYN+S
Sbjct: 428 SADGAGGTIIDSGTT-----LSYFSDPAYRIIKEAFLRKVKGYKLVEDFPILHPCYNVSG 482
Query: 300 -SQPKFPEVTIHFR-GADVKLSPSNLFRNISD-EIMCSAFRG--GNANIVYGRIMQINFL 354
+ FPE I F GA N F I +I+C A G +A + G Q NF
Sbjct: 483 TDELNFPEFLIQFADGAVWNFPVENYFIRIQQLDIVCLAMLGTPKSALSIIGNYQQQNFH 542
Query: 355 IGYDIEQAMVSFKPSRC 371
I YD + + + + P RC
Sbjct: 543 ILYDTKNSRLGYAPMRC 559
>gi|359473000|ref|XP_002278677.2| PREDICTED: aspartic proteinase nepenthesin-1-like [Vitis vinifera]
Length = 458
Score = 126 bits (316), Expect = 2e-26, Method: Compositional matrix adjust.
Identities = 108/381 (28%), Positives = 170/381 (44%), Gaps = 58/381 (15%)
Query: 35 YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEP-PLFDPKKSSTYNSISCS 93
Y + L +GTPP + DTGSD W +C C +C + P F + S+T++ C
Sbjct: 89 YFVDLRLGTPPQKLLLVADTGSDLVWVKCSAC--RNCTRHTPGSAFLARHSTTFSPNHCY 146
Query: 94 SSQCAVV----TSNCSEGD----CSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEM 145
S C +V C+ C Y + YG G S +SG + ET T N++SG ++
Sbjct: 147 DSACQLVPLPKHHRCNHARLHSPCRYEYSYGDG---SKTSGFFSKETTTLNTSSGREAKL 203
Query: 146 PNVIFGCGHKNLASPT----SDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLPDQGSS 201
+ FGC + ++ P+ S + G++GLG G SL SQ+G KFSYCL D S
Sbjct: 204 KGIAFGCAFR-ISGPSVSGASFNGAHGVMGLGRGPISLSSQLGHRFGNKFSYCLMDHDIS 262
Query: 202 KINFGGIVAGA----------GVVSTPLIIR----DHYYLSLEAISVGNQRLEFVSS--- 244
++ G+ + TPL I YY+ +E++SV +L S
Sbjct: 263 PSPTSYLLIGSTQNDVAPGKRRMRFTPLHINPLSPTFYYIGIESVSVDGIKLPINPSVWA 322
Query: 245 ----STGNIFVDTGVLRTLLPLEYHSNLKSVMSNMIK----AQPVKGVGAEPGFSDVLCY 296
G VD+G T LP + + +V+ ++ A+P PGF LC
Sbjct: 323 LDELGNGGTIVDSGTTLTFLPEPAYLQILTVIKRRVRLPSPAEPT------PGFD--LCV 374
Query: 297 NIS--SQPKFPEVTIHFRGADV-KLSPSNLFRNISDEIMCSAFRG---GNANIVYGRIMQ 350
N+S P+ P+++ G V P N F + +++ C A + + V G +MQ
Sbjct: 375 NVSEIEHPRLPKLSFKLGGDSVFSPPPRNYFVDTDEDVKCLALQAVMTPSGFSVIGNLMQ 434
Query: 351 INFLIGYDIEQAMVSFKPSRC 371
FL+ +D ++ + F C
Sbjct: 435 QGFLLEFDKDRTRLGFSRHGC 455
>gi|218187618|gb|EEC70045.1| hypothetical protein OsI_00635 [Oryza sativa Indica Group]
Length = 570
Score = 126 bits (316), Expect = 2e-26, Method: Compositional matrix adjust.
Identities = 111/366 (30%), Positives = 166/366 (45%), Gaps = 52/366 (14%)
Query: 26 AEIISVDDIYLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPL-FDPKKS 84
++++S YLM +++G+PP + DTGSD W +C+ P FDP +S
Sbjct: 92 SKVVSRSFEYLMTVNLGSPPRSMLAIADTGSDLVWVKCKKGNNDTSSAAAPTTQFDPSRS 151
Query: 85 STYNSISCSSSQC-AVVTSNCSEG-DCSYSFLYGRGAYASFSSGNLATETLTFNS----T 138
STY +SC + C A+ + C +G +C+Y + YG G S ++G L+TET TF+
Sbjct: 152 STYGRVSCQTDACEALGRATCDDGSNCAYLYAYGDG---SNTTGVLSTETFTFDDGGAGR 208
Query: 139 SGLPVEMPNVIFGCGHKNLAS-PTSDSKQTGIIGLGPGNSSLISQMG--TSIAGKFSYCL 195
S V + V FGC S P G SL++Q+G TS+ +FSYCL
Sbjct: 209 SPRQVRIGGVKFGCSTATAGSFPADGLVGLGGG-----AVSLVTQLGGATSLGRRFSYCL 263
Query: 196 PDQ---GSSKINFGGI--VAGAGVVSTPLIIRDHYYLSLEAISVGNQRLEFVSSSTGNIF 250
SS +NFG + V G STPL VGN+ + S+++ I
Sbjct: 264 VPHSVNASSALNFGALADVTEPGAASTPL--------------VGNKTV--ASAASSRII 307
Query: 251 VDTGVLRTLLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDVLCYNISSQ-----PKFP 305
VD+G T L + +S I PV+ P LCYN++ + P
Sbjct: 308 VDSGTTLTFLDPSLLGPIVDELSRRITLPPVQ----SPDGLLQLCYNVAGREVEAGESIP 363
Query: 306 EVTIHF-RGADVKLSPSNLFRNISDEIMCSAFRGGNANI---VYGRIMQINFLIGYDIEQ 361
++T+ F GA V L P N F + + +C A + G + Q N +GYD++
Sbjct: 364 DLTLEFGGGAAVALKPENAFVAVQEGTLCLAIVATTEQQPVSILGNLAQQNIHVGYDLDA 423
Query: 362 AMVSFK 367
V K
Sbjct: 424 GTVGNK 429
Score = 57.4 bits (137), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 44/157 (28%), Positives = 69/157 (43%), Gaps = 15/157 (9%)
Query: 224 HYYLSLEAISVGNQRLEFVSSSTGNIFVDTGVLRTLLPLEYHSNLKSVMSNMIKAQPVKG 283
H L+A +VGN+ + +SS I VD+G T L + +S I PV+
Sbjct: 416 HVGYDLDAGTVGNKTVASAASS--RIIVDSGTTLTFLDPSLLGPIVDELSRRITLPPVQ- 472
Query: 284 VGAEPGFSDVLCYNISSQP-----KFPEVTIHFRG-ADVKLSPSNLFRNISDEIMCSAFR 337
P LCYN++ + P++T+ F G A V L P N F + + +C A
Sbjct: 473 ---SPDGLLQLCYNVAGREVEAGESIPDLTLEFGGGAAVALKPENAFVAVQEGTLCLAIV 529
Query: 338 GGNANI---VYGRIMQINFLIGYDIEQAMVSFKPSRC 371
+ G + Q N +GYD++ V+F + C
Sbjct: 530 ATTEQQPVSILGNLAQQNIHVGYDLDAGTVTFAVADC 566
>gi|302790323|ref|XP_002976929.1| hypothetical protein SELMODRAFT_105896 [Selaginella moellendorffii]
gi|300155407|gb|EFJ22039.1| hypothetical protein SELMODRAFT_105896 [Selaginella moellendorffii]
Length = 373
Score = 125 bits (315), Expect = 3e-26, Method: Compositional matrix adjust.
Identities = 109/381 (28%), Positives = 176/381 (46%), Gaps = 57/381 (14%)
Query: 37 MHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSSSQ 96
M IGTPP ++ VDT S+ TW Q C +C + P F+P SS++ S C+SS
Sbjct: 1 MQTKIGTPPREVLLLVDTASELTWVQGTSC--TNCSPTKVPPFNPGLSSSFISEPCTSSV 58
Query: 97 CAVVTS-------NCSEGDCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMPNVI 149
C + N S G CS+ Y G+ A G +A E + S G + +VI
Sbjct: 59 CLGRSKLGFQSACNRSTGSCSFQVAYLDGSEA---YGVIAREIFSLQSWDGAASTLGDVI 115
Query: 150 FGCGHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTS----IAGKFSYCLPDQGSSKINF 205
FGC K+L P S +G +GL G+ S +Q+G+ ++ +FSYC P++ +
Sbjct: 116 FGCASKDLQRPVDFS--SGTLGLNRGSFSFPAQIGSRSKSGLSDRFSYCFPNRAEHLNSS 173
Query: 206 GGIVAGAGVV-----------STPLI--IRDHYYLSLEAISVGNQRLEFVSSS------- 245
G I+ G + P I I D YY+ L+ ISVG + L S+
Sbjct: 174 GVIIFGDSGIPAHHFQYLSLEQEPPIASIVDFYYVGLQGISVGGELLHIPRSAFKIDRLG 233
Query: 246 TGNIFVDTGVLRTLLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDVLCYNISSQ---- 301
G + D+G + L H+ L + + G++ F+ LCY++++
Sbjct: 234 NGGTYFDSGTTVSFLVEPAHTALVEAFGRRV-LHLNRTSGSD--FTKELCYDVAAGDARL 290
Query: 302 PKFPEVTIHFR-GADVKLSPSNLF----RNISDEIMCSAF------RGGNANIVYGRIMQ 350
P P VT+HF+ D++L ++++ R +C AF G N++ G Q
Sbjct: 291 PTAPLVTLHFKNNVDMELREASVWVPLARTPQVVTICLAFVNAGAVAQGGVNVI-GNYQQ 349
Query: 351 INFLIGYDIEQAMVSFKPSRC 371
++LI +D+E++ + F P+ C
Sbjct: 350 QDYLIEHDLERSRIGFAPANC 370
>gi|413953772|gb|AFW86421.1| hypothetical protein ZEAMMB73_098827 [Zea mays]
Length = 482
Score = 125 bits (315), Expect = 3e-26, Method: Compositional matrix adjust.
Identities = 109/370 (29%), Positives = 170/370 (45%), Gaps = 45/370 (12%)
Query: 35 YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSS 94
Y++ + IGTP + DTGSD TW QC+PC + C++Q+ PLFDP KSSTY + C +
Sbjct: 126 YVVTIGIGTPARNFTVLFDTGSDLTWVQCKPCTD-SCYQQQEPLFDPSKSSTYVDVPCGT 184
Query: 95 SQCAVVTSN---CSEGDCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMPNVIFG 151
QC + C C YS YG S + GNLA E T S S P V+FG
Sbjct: 185 PQCKIGGGQDLTCGGTTCEYSVKYGD---QSVTRGNLAQEAFTL-SPSAPPAA--GVVFG 238
Query: 152 CGHK---NLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGK-FSYCLPDQGSSKINFGG 207
C H+ + + G++GLG G+SS++SQ +G FSYCLP +GSS G
Sbjct: 239 CSHEYSSGVKGAEEEMSVAGLLGLGRGDSSILSQTRRGNSGDVFSYCLPPRGSSA---GY 295
Query: 208 IVAGAG------VVSTPLIIRDH-----YYLSLEAISVGNQRLEFVSSS--TGNIFVDTG 254
+ GA + TPL+ + Y ++L ISV L +S+ G + +D+G
Sbjct: 296 LTIGAAAPPQSNLSFTPLVTDNSQLSSVYVVNLVGISVSGAALPIDASAFYIGTV-IDSG 354
Query: 255 VLRTLLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDVLCYNISSQP--KFPEVTIHF- 311
+ T +P + L+ + + G CY+++ P V + F
Sbjct: 355 TVITHMPAAAYYVLRDEFRRHMGGYTMLPEGHVESLDT--CYDVTGHDVVTAPPVALEFG 412
Query: 312 RGADVKLSPSNLFRNISDE-------IMCSAFRGGN--ANIVYGRIMQINFLIGYDIEQA 362
GA + + S + + + + C AF N ++ G + Q + + +D+E
Sbjct: 413 GGARIDVDASGILLVFAVDASGQSLTLACLAFVPTNLPGFVIIGNMQQRAYNVVFDVEGR 472
Query: 363 MVSFKPSRCT 372
+ F + C+
Sbjct: 473 RIGFGANGCS 482
>gi|449441139|ref|XP_004138341.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 1-like [Cucumis
sativus]
gi|449477464|ref|XP_004155031.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 1-like [Cucumis
sativus]
Length = 336
Score = 125 bits (315), Expect = 3e-26, Method: Compositional matrix adjust.
Identities = 111/355 (31%), Positives = 163/355 (45%), Gaps = 41/355 (11%)
Query: 39 LSIGTPPVDIFGSVDTGSDCTWTQCEPCPELD-CFKQEPPLFDPKKSSTYNSISCSSSQC 97
+ +G P F +DTGSD TW QC PC + C++Q P+FDP+ SS+YN +SC S QC
Sbjct: 1 MRVGQPQQPSFFVLDTGSDVTWLQCLPCAGKNGCYEQITPIFDPELSSSYNPVSCDSEQC 60
Query: 98 AVV-TSNCSEGDCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMPNVIFGCGHKN 156
++ + C+ C Y YG G SF+ G LATETLTF ++ + PN+ GCGH N
Sbjct: 61 QLLDEAGCNVNSCIYKVEYGDG---SFTIGELATETLTFVHSNSI----PNISIGCGHDN 113
Query: 157 LASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLPDQGS---SKINFGGIVAGAG 213
G+ G S+ SQ+ A FSYCL D S S ++F
Sbjct: 114 EGLFVGADGLIGLGGG---AISISSQLK---ASSFSYCLVDIDSPSFSTLDFNTDPPSDS 167
Query: 214 VVSTPLIIRDHY----YLSLEAISVGNQ-------RLEFVSSSTGNIFVDTGVLRTLLPL 262
++S PL+ D + Y+ + +SVG + R E S G I VD+G T LP
Sbjct: 168 LIS-PLVKNDRFPSFRYVKVIGMSVGGKPLPISSSRFEIDESGLGGIIVDSGTTITQLPS 226
Query: 263 EYHSNLKSVMSNMIKAQPVKGVGAEPGFSDV-LCYNISSQP--KFPEVTIHFRGADVKLS 319
+ + L+ + P P S CY++SSQ + P + G +
Sbjct: 227 DVYEVLREAFLGLTTNLP-----PAPEISPFDTCYDLSSQSNVEVPTIAFILPGENSLQL 281
Query: 320 PSN--LFRNISDEIMCSAFRGGNANI-VYGRIMQINFLIGYDIEQAMVSFKPSRC 371
P+ L + S C AF + + G Q + YD+ ++V F ++C
Sbjct: 282 PAKNCLIQVDSAGTFCLAFVSATFPLSIIGNFQQQGIRVSYDLTNSLVGFSTNKC 336
>gi|195607464|gb|ACG25562.1| aspartic proteinase nepenthesin-2 precursor [Zea mays]
Length = 478
Score = 125 bits (314), Expect = 3e-26, Method: Compositional matrix adjust.
Identities = 117/357 (32%), Positives = 168/357 (47%), Gaps = 38/357 (10%)
Query: 35 YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPEL-DCFKQEPPLFDPKKSSTYNSISCS 93
Y++ S+GTP V VDTGSD +W QC+PC C+ Q+ PLFDP +SS+Y ++ C
Sbjct: 140 YVVTASLGTPGVAQTMEVDTGSDLSWVQCKPCSAAPSCYSQKDPLFDPAQSSSYAAVPCG 199
Query: 94 SSQCAVV----TSNCSEGDCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMPNVI 149
CA + S CS C Y YG G S ++G +++TLT +++S +
Sbjct: 200 GPVCAGLGIYAASACSAAQCGYVVSYGDG---SNTTGVYSSDTLTLSASS----AVQGFF 252
Query: 150 FGCGHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLPDQGSSK----INF 205
FGCGH A + G++GLG SL+ Q + G FSYCLP + S+ +
Sbjct: 253 FGCGH---AQSGLFNGVDGLLGLGREQPSLVEQTAGTYGGVFSYCLPTKPSTAGYLTLGL 309
Query: 206 GGIVAGAGVVSTPLIIRD-----HYYLSLEAISVGNQRLEFVSSS-TGNIFVDTGVLRTL 259
GG A ST ++ +Y + L ISVG Q+L +S+ G VDTG + T
Sbjct: 310 GGPSGAAPGFSTTQLLPSPNAPTYYVVMLTGISVGGQQLSVPASAFAGGTVVDTGTVITR 369
Query: 260 LPLEYHSNLKSVMSNMIKAQPVKGVGAEP--GFSDVLCYNIS--SQPKFPEVTIHF-RGA 314
LP ++ L+S + + + G P G D CYN + P V + F GA
Sbjct: 370 LPPTAYAALRSAFRSGMASY---GYPTAPSNGILDT-CYNFAGYGTVTLPNVALTFGSGA 425
Query: 315 DVKLSPSNLFRNISDEIMCSAFRGGNANIVYGRIMQINFLIGYDIEQAMVSFKPSRC 371
V L + S GG A + G + Q +F + I+ V FKPS C
Sbjct: 426 TVMLGADGILSFGCLAFAPSGSDGGMA--ILGNVQQRSFEV--RIDGTSVGFKPSSC 478
>gi|15230868|ref|NP_189198.1| aspartyl protease family protein [Arabidopsis thaliana]
gi|11994761|dbj|BAB03090.1| chloroplast nucleoid DNA binding protein-like; nucellin-like
protein [Arabidopsis thaliana]
gi|189339286|gb|ACD89063.1| At3g25700 [Arabidopsis thaliana]
gi|332643533|gb|AEE77054.1| aspartyl protease family protein [Arabidopsis thaliana]
Length = 452
Score = 125 bits (314), Expect = 3e-26, Method: Compositional matrix adjust.
Identities = 115/414 (27%), Positives = 175/414 (42%), Gaps = 55/414 (13%)
Query: 1 AQNSQKLPFYNDNETP----KSPISIIYQAEIISVDDIYLMHLSIGTPPVDIFGSVDTGS 56
A ++++L F + P KSP+ + S Y + L IG PP + DTGS
Sbjct: 50 ALDTRRLHFLSLRRKPIPFVKSPVV----SGAASGSGQYFVDLRIGQPPQSLLLIADTGS 105
Query: 57 DCTWTQCEPCPELDCFKQEPP-LFDPKKSSTYNSISCSSSQCAVVTSN-----CSE---- 106
D W +C C +C P +F P+ SST++ C C +V C+
Sbjct: 106 DLVWVKCSAC--RNCSHHSPATVFFPRHSSTFSPAHCYDPVCRLVPKPDRAPICNHTRIH 163
Query: 107 GDCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMPNVIFGCGHK---NLASPTSD 163
C Y + Y G S +SG A ET + ++SG + +V FGCG + S TS
Sbjct: 164 STCHYEYGYADG---SLTSGLFARETTSLKTSSGKEARLKSVAFGCGFRISGQSVSGTSF 220
Query: 164 SKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLPDQGSSKINFGGIVAGAG------VVST 217
+ G++GLG G S SQ+G KFSYCL D S ++ G G + T
Sbjct: 221 NGANGVMGLGRGPISFASQLGRRFGNKFSYCLMDYTLSPPPTSYLIIGNGGDGISKLFFT 280
Query: 218 PLIIR----DHYYLSLEAISVGNQRL-------EFVSSSTGNIFVDTGVLRTLLPLEYHS 266
PL+ YY+ L+++ V +L E S G VD+G L +
Sbjct: 281 PLLTNPLSPTFYYVKLKSVFVNGAKLRIDPSIWEIDDSGNGGTVVDSGTTLAFLAEPAYR 340
Query: 267 NLKSVMSNMIKAQPVKGVGAEPGFSDVLCYNISSQPK----FPEVTIHFRGADVKL-SPS 321
++ + + +K + PGF LC N+S K P + F G V + P
Sbjct: 341 SVIAAVRRRVKLPIADAL--TPGFD--LCVNVSGVTKPEKILPRLKFEFSGGAVFVPPPR 396
Query: 322 NLFRNISDEIMCSAFRGGNANI---VYGRIMQINFLIGYDIEQAMVSFKPSRCT 372
N F ++I C A + + + V G +MQ FL +D +++ + F C
Sbjct: 397 NYFIETEEQIQCLAIQSVDPKVGFSVIGNLMQQGFLFEFDRDRSRLGFSRRGCA 450
>gi|356553775|ref|XP_003545228.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
Length = 559
Score = 125 bits (314), Expect = 3e-26, Method: Compositional matrix adjust.
Identities = 111/380 (29%), Positives = 168/380 (44%), Gaps = 57/380 (15%)
Query: 35 YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSS 94
Y M + +GTPP +DTGSD W QC PC + CF+Q P +DPK SS++ +ISC
Sbjct: 195 YFMDVFVGTPPKHFSLILDTGSDLNWIQCVPC--IACFEQSGPYYDPKDSSSFRNISCHD 252
Query: 95 SQCAVVTS-------NCSEGDCSYSFLYGRGAYASFSSGNLATETLTFNSTS----GLPV 143
+C +V+S C Y + YG G S ++G+ A ET T N T+
Sbjct: 253 PRCQLVSSPDPPNPCKAENQSCPYFYWYGDG---SNTTGDFALETFTVNLTTPNGKSELK 309
Query: 144 EMPNVIFGCGHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLPDQG---- 199
+ NV+FGCGH N + G+ S SQM + FSYCL D+
Sbjct: 310 HVENVMFGCGHWNRGLFHGAAGLLGLGKG---PLSFASQMQSLYGQSFSYCLVDRNSNAS 366
Query: 200 -SSKINFGGIVAGAGVVSTPLI------------IRDHYYLSLEAISVGNQRLE------ 240
SSK+ FG ++S P + + YY+ + ++ V ++ L+
Sbjct: 367 VSSKLIFG---EDKELLSHPNLNFTSFGGGKDGSVDTFYYVQINSVMVDDEVLKIPEETW 423
Query: 241 -FVSSSTGNIFVDTGVLRTLLPLEYHSNLKSVMSNMIKA-QPVKGVGAEPGFSDVLCYNI 298
S G +D+G T + +K IK + V+G+ P CYN+
Sbjct: 424 HLSSEGAGGTIIDSGTTLTYFAEPAYEIIKEAFVRKIKGYELVEGL---PPLKP--CYNV 478
Query: 299 SSQPK--FPEVTIHFR-GADVKLSPSNLFRNISDEIMCSAFRGG--NANIVYGRIMQINF 353
S K P+ I F GA N F I +++C A G +A + G Q NF
Sbjct: 479 SGIEKMELPDFGILFADGAVWNFPVENYFIQIDPDVVCLAILGNPRSALSIIGNYQQQNF 538
Query: 354 LIGYDIEQAMVSFKPSRCTN 373
I YD++++ + + P +C +
Sbjct: 539 HILYDMKKSRLGYAPMKCAD 558
>gi|357154085|ref|XP_003576664.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Brachypodium
distachyon]
Length = 509
Score = 125 bits (314), Expect = 3e-26, Method: Compositional matrix adjust.
Identities = 109/366 (29%), Positives = 163/366 (44%), Gaps = 38/366 (10%)
Query: 35 YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSS 94
Y++ + +GTP D+ DTGSD +W QC PC C+KQ+ PLF P SST++++ C +
Sbjct: 154 YVVSVGLGTPARDLTVVFDTGSDLSWVQCGPCSSGGCYKQQDPLFAPSDSSTFSAVRCGA 213
Query: 95 SQCAVVTS-NCSEGD--CSYSFLYGRGAYASFSSGNLATETLTF------NSTSGLPVEM 145
+C S S GD C Y +YG S + G+L +TLT N+++ ++
Sbjct: 214 RECRARQSCGGSPGDDRCPYEVVYGD---KSRTQGHLGNDTLTLGTMAPANASAENDNKL 270
Query: 146 PNVIFGCGHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLPDQGSSK--- 202
P +FGCG N + G+ GLG G SL SQ FSYCLP SS
Sbjct: 271 PGFVFGCGENNTG---LFGQADGLFGLGRGKVSLSSQAAGKFGEGFSYCLPSSSSSAPGY 327
Query: 203 INFGGIV-AGAGVVSTPLIIR----DHYYLSLEAISVGNQRLEFVSSSTG-NIFVDTG-V 255
++ G V A A TP++ R YY+ L I V + + S + VD+G V
Sbjct: 328 LSLGTPVPAPAHAQFTPMLNRTTTPSFYYVKLVGIRVAGRAIRVSSPRVALPLIVDSGTV 387
Query: 256 LRTLLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDV-LCYNISSQPK----FPEVTIH 310
+ L P Y + + +S M K G P S + CY+ ++ P V +
Sbjct: 388 ITRLAPRAYRALRAAFLSAMGK----YGYKRAPRLSILDTCYDFTAHANATVSIPAVALV 443
Query: 311 FR-GADVKLSPSNLFRNISDEIMCSAFR---GGNANIVYGRIMQINFLIGYDIEQAMVSF 366
F GA + + S + C AF G + + G Q + YD+ + + F
Sbjct: 444 FAGGATISVDFSGVLYVAKVAQACLAFAPNGDGRSAGILGNTQQRTLAVVYDVARQKIGF 503
Query: 367 KPSRCT 372
C+
Sbjct: 504 AAKGCS 509
>gi|224085379|ref|XP_002307559.1| predicted protein [Populus trichocarpa]
gi|222857008|gb|EEE94555.1| predicted protein [Populus trichocarpa]
Length = 455
Score = 125 bits (314), Expect = 3e-26, Method: Compositional matrix adjust.
Identities = 115/379 (30%), Positives = 168/379 (44%), Gaps = 54/379 (14%)
Query: 35 YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSS 94
Y M + IGTPP +DTGSD W QC PC DCF+Q P +DPK+SS++ +I C
Sbjct: 90 YFMDVFIGTPPKHYSLILDTGSDLNWIQCVPCH--DCFEQNGPYYDPKESSSFRNIGCHD 147
Query: 95 SQCAVVTS-------NCSEGDCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPV---- 143
+C +V+S C Y + YG +S ++G+ ATET T N TS P
Sbjct: 148 PRCHLVSSPDPPLPCKAENQTCPYFYWYGD---SSNTTGDFATETFTVNLTS--PTGKSE 202
Query: 144 --EMPNVIFGCGHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLPDQG-- 199
+ NV+FGCGH N S G+ S SQ+ + FSYCL D+
Sbjct: 203 FKRVENVMFGCGHWNRGLFHGASGLLGLGRG---PLSFSSQLQSLYGHSFSYCLVDRNSD 259
Query: 200 ---SSKINFG---GIVAGAGVVSTPLI------IRDHYYLSLEAISVGNQRL-------E 240
SSK+ FG ++ + T L+ + YY+ +++I VG + L
Sbjct: 260 TNVSSKLIFGEDKDLLNHPELNFTTLVGGKENPVDTFYYVQIKSIMVGGEVLNIPESTWN 319
Query: 241 FVSSSTGNIFVDTGVLRTLLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDVLCYNISS 300
S G VD+G + + +K +K P+ V P CYN+S
Sbjct: 320 MTSDGVGGTIVDSGTTLSYFTEPAYQIIKDAFVKKVKGYPI--VQDFPILDP--CYNVSG 375
Query: 301 QPK--FPEVTIHF-RGADVKLSPSNLF-RNISDEIMCSAFRG--GNANIVYGRIMQINFL 354
K P+ I F GA N F R +E++C A G +A + G Q NF
Sbjct: 376 VEKIDLPDFGILFADGAVWNFPVENYFIRLDPEEVVCLAILGTPRSALSIIGNYQQQNFH 435
Query: 355 IGYDIEQAMVSFKPSRCTN 373
+ YD +++ + + P C +
Sbjct: 436 VLYDTKKSRLGYAPMNCAD 454
>gi|297805182|ref|XP_002870475.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
gi|297316311|gb|EFH46734.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
Length = 480
Score = 125 bits (313), Expect = 4e-26, Method: Compositional matrix adjust.
Identities = 102/370 (27%), Positives = 180/370 (48%), Gaps = 41/370 (11%)
Query: 34 IYLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDC---FKQEPPLFDPKKSSTYNSI 90
+Y + +G+PP + + VDTGSD W C PCP+ L+D K SST ++
Sbjct: 76 LYFTKIKLGSPPKEYYVQVDTGSDILWVNCAPCPKCPVKTDLGIPLSLYDSKASSTSKNV 135
Query: 91 SCSSSQCAVV--TSNC-SEGDCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMP- 146
C + C+ + + C ++ CSY +YG G S S G+ + +T + +G P
Sbjct: 136 GCEDAFCSFIMQSETCGAKKPCSYHVVYGDG---STSDGDFVKDNITLDQVTGNLRTAPL 192
Query: 147 --NVIFGCGHKNLASP--TSDSKQTGIIGLGPGNSSLISQM--GTSIAGKFSYCLPDQGS 200
V+FGCG KN + ++S GI+G G N+S+ISQ+ G S+ FS+CL +
Sbjct: 193 AQEVVFGCG-KNQSGQLGQTESAVDGIMGFGQSNTSVISQLAAGGSVKRIFSHCLDNMNG 251
Query: 201 SKINFGGIVAGAGVVSTPLIIRD-HYYLSLEAISVGNQRLEF---VSSSTGN--IFVDTG 254
I G V V +TPL+ HY + L+ + V + ++ ++S+ G+ +D+G
Sbjct: 252 GGIFAIGEVESPVVKTTPLVPNQVHYNVILKGMDVDGEPIDLPPSLASTNGDGGTIIDSG 311
Query: 255 VLRTLLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDVLCYNISS--QPKFPEVTIHFR 312
LP ++ S++ + Q VK + F+ C++ +S FP V +HF
Sbjct: 312 TTLAYLPQNLYN---SLIEKITAKQQVKLHMVQETFA---CFSFTSNTDKAFPVVNLHFE 365
Query: 313 GADVKLS--PSNLFRNISDEIMCSAFRGGNAN-------IVYGRIMQINFLIGYDIEQAM 363
+ +KLS P + ++ +++ C ++ G I+ G ++ N L+ YD+E +
Sbjct: 366 DS-LKLSVYPHDYLFSLREDMYCFGWQSGGMTTQDGADVILLGDLVLSNKLVVYDLENEV 424
Query: 364 VSFKPSRCTN 373
+ + C++
Sbjct: 425 IGWADHNCSS 434
>gi|357133002|ref|XP_003568117.1| PREDICTED: aspartic proteinase-like protein 2-like [Brachypodium
distachyon]
Length = 497
Score = 125 bits (313), Expect = 4e-26, Method: Compositional matrix adjust.
Identities = 113/380 (29%), Positives = 175/380 (46%), Gaps = 59/380 (15%)
Query: 35 YLMHLSIGTPPVDIFGSVDTGSDCTWTQ---CEPCPELDCFKQEPPLFDPKKSSTYNSIS 91
Y + IGTPP VDTGSD W C+ CP + L+DPK SS+ +++S
Sbjct: 87 YYTKIEIGTPPKPFHVQVDTGSDILWVNCVSCDKCPTKSGLGIDLALYDPKGSSSGSAVS 146
Query: 92 CSSSQCAVVTSN------CSEGD-CSYSFLYGRGAYASFSSGNLATETLTFNSTSG---L 141
C + CA + C+ G C Y YG G S ++G+ +++L +N SG
Sbjct: 147 CDNKFCAATYGSGEKLPGCTAGKPCEYRAEYGDG---SSTAGSFVSDSLQYNQLSGNAQT 203
Query: 142 PVEMPNVIFGCGHKNLASPTSDSKQ-TGIIGLGPGNSSLISQMGTS--IAGKFSYCLPDQ 198
NVIFGCG + S ++ GIIG G N+S +SQ+ ++ + FS+CL
Sbjct: 204 RHAKANVIFGCGAQQGGDLESTNQALDGIIGFGQSNTSTLSQLASAGEVKKIFSHCL--- 260
Query: 199 GSSKINFGGIVAGAGVV-----STPLIIR-DHYYLSLEAISVGNQRLE-----FVSSSTG 247
I GGI A VV STPL+ HY ++L++I V L+ F +S
Sbjct: 261 --DTIKGGGIFAIGEVVQPKVKSTPLLPNMSHYNVNLQSIDVAGNALQLPPHIFETSEKR 318
Query: 248 NIFVDTGVLRTLLP-LEYHSNLKSVMSNM--IKAQPVKGVGAEPGFSDVLC--YNISSQP 302
+D+G T LP L Y L +V I + ++G LC Y+ S
Sbjct: 319 GTIIDSGTTLTYLPELVYKDILAAVFQKHQDITFRTIQG---------FLCFEYSESVDD 369
Query: 303 KFPEVTIHFRGADVKLS--PSNLFRNISDEIMCSAFRGGNAN-------IVYGRIMQINF 353
FP++T HF D+ L+ P + F D + C F+ G ++ G ++ N
Sbjct: 370 GFPKITFHFE-DDLGLNVYPHDYFFQNGDNLYCLGFQNGGFQPKDAKDMVLLGDLVLSNK 428
Query: 354 LIGYDIEQAMVSFKPSRCTN 373
++ YD+E+ ++ + C++
Sbjct: 429 VVVYDLEKQVIGWTDYNCSS 448
>gi|255548664|ref|XP_002515388.1| Aspartic proteinase nepenthesin-2 precursor, putative [Ricinus
communis]
gi|223545332|gb|EEF46837.1| Aspartic proteinase nepenthesin-2 precursor, putative [Ricinus
communis]
Length = 494
Score = 125 bits (313), Expect = 4e-26, Method: Compositional matrix adjust.
Identities = 109/359 (30%), Positives = 160/359 (44%), Gaps = 38/359 (10%)
Query: 35 YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSS 94
Y + + +GTP D DTGSD TWTQCEPC + C+ Q+ +F+P +S++Y +ISC S
Sbjct: 153 YFVTVGLGTPKKDFSLIFDTGSDLTWTQCEPCVK-SCYNQKEAIFNPSQSTSYANISCGS 211
Query: 95 SQCAVVTS------NCSEGDCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMPNV 148
+ C + S NC+ C Y YG +SFS G E L+ +T +
Sbjct: 212 TLCDSLASATGNIFNCASSTCVYGIQYGD---SSFSIGFFGKEKLSLTATD----VFNDF 264
Query: 149 IFGCGHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLPDQGSSK--INFG 206
FGCG N + G+ SL+SQ FSYCLP SS + FG
Sbjct: 265 YFGCGQNNKGLFGGAAGLLGLG---RDKLSLVSQTAQRYNKIFSYCLPSSSSSTGFLTFG 321
Query: 207 GIVAGAGVVSTPLII----RDHYYLSLEAISVGNQRLEFVSS--STGNIFVDTGVLRTLL 260
G + + TPL Y L L ISVG ++L S ST +D+G + T L
Sbjct: 322 GSTSKSASF-TPLATISGGSSFYGLDLTGISVGGRKLAISPSVFSTAGTIIDSGTVITRL 380
Query: 261 PLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDV-LCYNISSQPKF--PEVTIHFRGA-DV 316
P +S L S ++ P A P S + C++ S+ P++ + F G V
Sbjct: 381 PPAAYSALSSTFRKLMSQYP-----AAPALSILDTCFDFSNHDTISVPKIGLFFSGGVVV 435
Query: 317 KLSPSNLFRNISDEIMCSAFRGGNAN---IVYGRIMQINFLIGYDIEQAMVSFKPSRCT 372
+ + +F +C AF G + ++G + Q + YD V F P+ C+
Sbjct: 436 DIDKTGIFYVNDLTQVCLAFAGNSDASDVAIFGNVQQKTLEVVYDGAAGRVGFAPAGCS 494
>gi|224053042|ref|XP_002297678.1| predicted protein [Populus trichocarpa]
gi|222844936|gb|EEE82483.1| predicted protein [Populus trichocarpa]
Length = 482
Score = 125 bits (313), Expect = 4e-26, Method: Compositional matrix adjust.
Identities = 105/389 (26%), Positives = 180/389 (46%), Gaps = 48/389 (12%)
Query: 11 NDNETPKSPISIIYQAEIISVDDIYLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELD 70
N +++ +PI + + +++ Y++ + +G + + VDTGSD +W QC+PC
Sbjct: 113 NIDDSVDAPIPLTSGIRLQTLN--YIVTVELGGRKMTVI--VDTGSDLSWVQCQPCKR-- 166
Query: 71 CFKQEPPLFDPKKSSTYNSISCSSSQCAVVTSN------CSEG--DCSYSFLYGRGAYAS 122
C+ Q+ P+F+P S +Y ++ CSS C + S C C+Y YG G+Y
Sbjct: 167 CYNQQDPVFNPSTSPSYRTVLCSSPTCQSLQSATGNLGVCGSNPPSCNYVVNYGDGSY-- 224
Query: 123 FSSGNLATETLTFNSTSGLPVEMPNVIFGCGHKNLASPTSDSKQTGIIGLGPGNSSLISQ 182
+ G L TE L +++ + N IFGCG N S G++GLG + SLISQ
Sbjct: 225 -TRGELGTEHLDLGNSTAVN----NFIFGCGRNNQGLFGGAS---GLVGLGRSSLSLISQ 276
Query: 183 MGTSIAGKFSYCLP---DQGSSKINFGG---IVAGAGVVSTPLIIRD----HYYLSLEAI 232
G FSYCLP + S + GG + +S +I + Y+L+L I
Sbjct: 277 TSAMFGGVFSYCLPITETEASGSLVMGGNSSVYKNTTPISYTRMIPNPQLPFYFLNLTGI 336
Query: 233 SVGNQRLEFVSSSTGNIFVDTGVLRTLLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSD 292
+VG+ ++ S + +D+G + T LP + LK P + P F
Sbjct: 337 TVGSVAVQAPSFGKDGMMIDSGTVITRLPPSIYQALKDEFVKQFSGFP-----SAPAFMI 391
Query: 293 V-LCYNIS--SQPKFPEVTIHFRG-ADVKLSPSNLFRNISDE-----IMCSAFRGGNANI 343
+ C+N+S + + P + +HF G A++ + + +F + + + ++ N
Sbjct: 392 LDTCFNLSGYQEVEIPNIKMHFEGNAELNVDVTGVFYFVKTDASQVCLAIASLSYENEVG 451
Query: 344 VYGRIMQINFLIGYDIEQAMVSFKPSRCT 372
+ G Q N + YD + +M+ F CT
Sbjct: 452 IIGNYQQKNQRVIYDTKGSMLGFAAEACT 480
>gi|356558489|ref|XP_003547539.1| PREDICTED: uncharacterized protein LOC100817234 [Glycine max]
Length = 739
Score = 125 bits (313), Expect = 5e-26, Method: Compositional matrix adjust.
Identities = 93/243 (38%), Positives = 135/243 (55%), Gaps = 22/243 (9%)
Query: 143 VEMPNVIFGCGHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCL----PDQ 198
V P + GCG N + T DSK GI+GLG G SLIS +G SI K+SYCL
Sbjct: 56 VSFPKIPIGCGLNN--AGTFDSKCFGIVGLGGGVVSLISHIGLSIDSKYSYCLVPLFEFN 113
Query: 199 GSSKINFG--GIVAGAGVVSTPLI---IRDHYYLSLEAISVGNQRLEFVSSST-----GN 248
+SKINFG +V G G VSTP+I YYL LE +SVG++R++FV +ST GN
Sbjct: 114 STSKINFGENAVVEGLGTVSTPIIPGSFDTFYYLKLEGMSVGSKRIDFVDASTSNELKGN 173
Query: 249 IFVDTGVLRTLLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDVLCYNI--SSQPKFPE 306
I +D+G T+L +++ L++ + I + V LCY ++ + P
Sbjct: 174 IIIDSGTTLTILLENFYTKLEAEVEAHINLERVNSTDQILS----LCYKSPPNNAIEVPI 229
Query: 307 VTIHFRGADVKLSPSNLFRNISDEIMCSAFRGGNANIVYGRIMQINFLIGYDIEQAMVSF 366
+T HF G D+ L+ N F ++ D+ M AF + ++G + Q+N L+GYD+ + VSF
Sbjct: 230 ITTHFAGVDIVLNSLNTFVSVFDDAMWFAFAPVASGSIFGNLAQMNHLVGYDLLRKTVSF 289
Query: 367 KPS 369
KP+
Sbjct: 290 KPT 292
>gi|115479815|ref|NP_001063501.1| Os09g0482200 [Oryza sativa Japonica Group]
gi|50725878|dbj|BAD33407.1| putative nucleoid DNA-binding protein cnd41 [Oryza sativa Japonica
Group]
gi|50725881|dbj|BAD33410.1| putative nucleoid DNA-binding protein cnd41 [Oryza sativa Japonica
Group]
gi|113631734|dbj|BAF25415.1| Os09g0482200 [Oryza sativa Japonica Group]
gi|125606112|gb|EAZ45148.1| hypothetical protein OsJ_29786 [Oryza sativa Japonica Group]
Length = 485
Score = 124 bits (312), Expect = 6e-26, Method: Compositional matrix adjust.
Identities = 107/356 (30%), Positives = 164/356 (46%), Gaps = 37/356 (10%)
Query: 35 YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSS 94
Y++ + +GTP DTGSD +W QC+PC DC++Q+ PLFDP SSTY +++C +
Sbjct: 149 YVVSVGLGTPAKQYAVIFDTGSDLSWVQCKPC--ADCYEQQDPLFDPSLSSTYAAVACGA 206
Query: 95 SQCAVV-TSNC-SEGDCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMPNVIFGC 152
+C + S C S+ C Y YG S + GNL +TLT +++ +P +FGC
Sbjct: 207 PECQELDASGCSSDSRCRYEVQYGD---QSQTDGNLVRDTLTLSASD----TLPGFVFGC 259
Query: 153 GHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLPDQGSSK--INFGGIVA 210
G +N + G+ GLG SL SQ S F+YCLP S + ++ GG
Sbjct: 260 GDQNAG---LFGQVDGLFGLGREKVSLPSQGAPSYGPGFTYCLPSSSSGRGYLSLGG-AP 315
Query: 211 GAGVVSTPL---IIRDHYYLSLEAISVGNQRLEF---VSSSTGNIFVDTGVLRTLLPLEY 264
A T L YY+ L I VG + + ++ G +D+G + T LP
Sbjct: 316 PANAQFTALADGATPSFYYIDLVGIKVGGRAIRIPATAFAAAGGTVIDSGTVITRLPPRA 375
Query: 265 HSNLKSVMSNMIKAQPVKGVGAEPGFSDV-LCYNISSQ--PKFPEVTIHFR-GADVKLSP 320
++ L++ + + AQ K P S + CY+ + + P V + F GA V L
Sbjct: 376 YAPLRAAFARSM-AQYKK----APALSILDTCYDFTGHRTAQIPTVELAFAGGATVSLDF 430
Query: 321 SNLFRNISDEIMCSAFRGGNAN----IVYGRIMQINFLIGYDIEQAMVSFKPSRCT 372
+ + C AF NA+ + G Q F + YD+ + F C+
Sbjct: 431 TGVLYVSKVSQACLAF-APNADDSSIAILGNTQQKTFAVAYDVANQRIGFGAKGCS 485
>gi|302809015|ref|XP_002986201.1| hypothetical protein SELMODRAFT_234982 [Selaginella moellendorffii]
gi|300146060|gb|EFJ12732.1| hypothetical protein SELMODRAFT_234982 [Selaginella moellendorffii]
Length = 449
Score = 124 bits (312), Expect = 6e-26, Method: Compositional matrix adjust.
Identities = 108/378 (28%), Positives = 175/378 (46%), Gaps = 57/378 (15%)
Query: 35 YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSS 94
Y M + +G PP +DTGSD TW QC+PC CF Q P+FDP +S+++ I C++
Sbjct: 87 YFMDVFVGNPPRHFLLIIDTGSDLTWLQCKPCKA--CFDQSGPVFDPSQSTSFKIIPCNA 144
Query: 95 SQCAVV--------TSNCSEGDCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLP--VE 144
+ C +V +S S C Y + YG +S +SG+LA E+L+ S S P +E
Sbjct: 145 AACDLVVHDECRDNSSKTSPKTCKYFYWYGD---SSRTSGDLALESLSV-SLSDHPSSLE 200
Query: 145 MPNVIFGCGHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGK-FSYCLPDQG---- 199
+ +++ GCGH N G+ G S SQ+ +S G+ FSYCL D+
Sbjct: 201 IRDMVIGCGHSNKGLFQGAGGLLGLGQ---GALSFPSQLRSSPIGQSFSYCLVDRTNNLS 257
Query: 200 -SSKINFGGIVAGA----GVVSTPLI-----IRDHYYLSLEAISVGN-------QRLEFV 242
SS I+FG A + + TP + + YYL ++ I + +R
Sbjct: 258 VSSAISFGAGFALSRHFDQMKFTPFVRTNNSVETFYYLGIQGIKIDQELLPIPAERFAIA 317
Query: 243 SSSTGNIFVDTGVLRTLLPLEYHSNLKSVMSNMI---KAQPVKGVGAEPGFSDVLCYNIS 299
++ +G +D+G T L + + ++S I +A P +G +CYN +
Sbjct: 318 TNGSGGTIIDSGTTLTYLNRDAYRAVESAFLARISYPRADPFDILG--------ICYNAT 369
Query: 300 SQPK--FPEVTIHFR-GADVKLSPSNLF--RNISDEIMCSAFRGGNANIVYGRIMQINFL 354
+ FP ++I F+ GA++ L N F + + C A + + G Q N
Sbjct: 370 GRAAVPFPALSIVFQNGAELDLPQENYFIQPDPQEAKHCLAILPTDGMSIIGNFQQQNIH 429
Query: 355 IGYDIEQAMVSFKPSRCT 372
YD++ A + F + C+
Sbjct: 430 FLYDVQHARLGFANTDCS 447
>gi|357494221|ref|XP_003617399.1| 60S ribosomal protein L18a [Medicago truncatula]
gi|355518734|gb|AET00358.1| 60S ribosomal protein L18a [Medicago truncatula]
Length = 749
Score = 124 bits (312), Expect = 6e-26, Method: Compositional matrix adjust.
Identities = 108/376 (28%), Positives = 173/376 (46%), Gaps = 51/376 (13%)
Query: 35 YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSS 94
Y M + IGTPP +DTGSD W QC PC + CF+Q P +DPK+SS++ +I+C
Sbjct: 192 YFMDVFIGTPPKHYSLILDTGSDLNWIQCVPC--IACFEQSGPYYDPKESSSFENITCHD 249
Query: 95 SQCAVVTS-----NCSEGD--CSYSFLYGRGAYASFSSGNLATETLTFNSTS----GLPV 143
+C +V+S C + + C Y + YG +S ++G+ A ET T N T+
Sbjct: 250 PRCKLVSSPDPPKPCKDENQTCPYFYWYGD---SSNTTGDFALETFTVNLTTPNGKSEQK 306
Query: 144 EMPNVIFGCGHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLPDQG---- 199
+ NV+FGCGH N + G+ S SQ+ + FSYCL D+
Sbjct: 307 HVENVMFGCGHWNRGLFHGAAGLLGLGRG---PLSFASQLQSIYGHSFSYCLVDRNSDTS 363
Query: 200 -SSKINFG---GIVAGAGVVSTPLI------IRDHYYLSLEAISVGNQRLEFVSSST--- 246
SSK+ FG +++ + T + + YY+ +++I V + L+ +
Sbjct: 364 VSSKLIFGEDKELLSHPNLNFTSFVGGEENSVDTFYYVGIKSIMVDGEVLKIPEETWHLS 423
Query: 247 ----GNIFVDTGVLRTLLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDVL-CYNIS-- 299
G +D+G T + +K + +KG GF + CYN+S
Sbjct: 424 KEGGGGTIIDSGTTLTYFAEPAYEIIKEAF-----MKKIKGYELVEGFPPLKPCYNVSGI 478
Query: 300 SQPKFPEVTIHFR-GADVKLSPSNLFRNISDEIMCSAFRG--GNANIVYGRIMQINFLIG 356
+ + P+ I F GA N F I +++C A G +A + G Q NF I
Sbjct: 479 EKMELPDFGILFSDGAMWDFPVENYFIQIEPDLVCLAILGTPKSALSIIGNYQQQNFHIL 538
Query: 357 YDIEQAMVSFKPSRCT 372
YD++++ + + P +CT
Sbjct: 539 YDMKKSRLGYAPMKCT 554
>gi|224124882|ref|XP_002329972.1| predicted protein [Populus trichocarpa]
gi|222871994|gb|EEF09125.1| predicted protein [Populus trichocarpa]
Length = 332
Score = 124 bits (312), Expect = 6e-26, Method: Compositional matrix adjust.
Identities = 99/344 (28%), Positives = 154/344 (44%), Gaps = 39/344 (11%)
Query: 52 VDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSSSQC-----AVVTSNCSE 106
+DTGS +W QC+PC + C Q PL+DP S TY +SC+S +C A + E
Sbjct: 3 LDTGSSLSWLQCQPC-AVYCHAQADPLYDPSVSKTYKKLSCASVECSRLKAATLNDPLCE 61
Query: 107 GD---CSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMPNVIFGCGHKNLASPTSD 163
D C Y+ YG SFS G L+ + LT S+ L P +GCG N
Sbjct: 62 TDSNACLYTASYGD---TSFSIGYLSQDLLTLTSSQTL----PQFTYGCGQDNQG---LF 111
Query: 164 SKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLPDQG----SSKINFGGIVAGAGVVSTPL 219
+ GIIGL S+++Q+ T FSYCLP G ++ TP+
Sbjct: 112 GRAAGIIGLARDKLSMLAQLSTKYGHAFSYCLPTANSGSSGGGFLSIGSISPTSYKFTPM 171
Query: 220 IIRDH----YYLSLEAISVGNQRLEFVSSSTG-NIFVDTGVLRTLLPLEYHSNLKSVMSN 274
+ Y+L L AI+V + L+ ++ +D+G + T LP+ ++ L+
Sbjct: 172 LTDSKNPSLYFLRLTAITVSGRPLDLAAAMYRVPTLIDSGTVITRLPMSMYAALRQAFVK 231
Query: 275 MIKAQPVKGVGAEPGFSDV-LCY--NISSQPKFPEVTIHFR-GADVKLSPSNLFRNISDE 330
++ + K P +S + C+ ++ S PE+ + F+ GAD+ L ++
Sbjct: 232 IMSTKYAKA----PAYSILDTCFKGSLKSISAVPEIKMIFQGGADLTLRAPSILIEADKG 287
Query: 331 IMCSAFRGG---NANIVYGRIMQINFLIGYDIEQAMVSFKPSRC 371
I C AF G N + G Q + I YD+ + + F P C
Sbjct: 288 ITCLAFAGSSGTNQIAIIGNRQQQTYNIAYDVSTSRIGFAPGSC 331
>gi|218194599|gb|EEC77026.1| hypothetical protein OsI_15382 [Oryza sativa Indica Group]
Length = 409
Score = 124 bits (311), Expect = 7e-26, Method: Compositional matrix adjust.
Identities = 110/378 (29%), Positives = 170/378 (44%), Gaps = 55/378 (14%)
Query: 34 IYLMHLSIGTPPVDIFGSVDTGSDCTWTQ---CEPCPELDCFKQEPPLFDPKKSSTYNSI 90
+Y + IGTP + VDTGSD W C+ CP E L+DPK SST + +
Sbjct: 3 LYYTEIGIGTPTKRYYVQVDTGSDILWVNCISCDRCPRKSGLGLELTLYDPKDSSTGSKV 62
Query: 91 SCSSSQCAVVTSNCSEG-----DCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEM 145
SC CA G C YS YG G S ++G ++ L F+ SG
Sbjct: 63 SCDQGFCAATYGGLLPGCTTSLPCEYSVTYGDG---SSTTGYFVSDLLQFDQVSGDGQTR 119
Query: 146 P---NVIFGCGHKNLAS-PTSDSKQTGIIGLGPGNSSLISQMGTSIAGK----FSYCLPD 197
P V FGCG + +S+ GIIG G N+S++SQ+ S AGK F++CL
Sbjct: 120 PANSTVTFGCGSQQGGDLGSSNQALDGIIGFGQSNTSMLSQL--SAAGKVKKIFAHCL-- 175
Query: 198 QGSSKINFGGIVAGAGVV-----STPLIIR-DHYYLSLEAISVGNQRLE-----FVSSST 246
IN GGI A VV +TPL+ HY ++L++I VG L+ F +
Sbjct: 176 ---DTINGGGIFAIGNVVQPKVKTTPLVPNMPHYNVNLKSIDVGGTALKLPSHMFDTGEK 232
Query: 247 GNIFVDTGVLRTLLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDVLCYNISSQ--PKF 304
+D+G T LP + + +++ K + + + + LC+ + F
Sbjct: 233 KGTIIDSGTTLTYLPEIVYKEI--MLAVFAKHKDITFHNVQ----EFLCFQYVGRVDDDF 286
Query: 305 PEVTIHFRGADVKLS--PSNLFRNISDEIMCSAF-------RGGNANIVYGRIMQINFLI 355
P++T HF D+ L+ P + F D + C F + G ++ G ++ N L+
Sbjct: 287 PKITFHFEN-DLPLNVYPHDYFFENGDNLYCVGFQNGGLQSKDGKGMVLLGDLVLSNKLV 345
Query: 356 GYDIEQAMVSFKPSRCTN 373
YD+E ++ + C++
Sbjct: 346 VYDLENQVIGWTEYNCSS 363
>gi|38344991|emb|CAE01597.2| OSJNBa0008A08.5 [Oryza sativa Japonica Group]
gi|116309515|emb|CAH66581.1| OSIGBa0137O04.7 [Oryza sativa Indica Group]
gi|222628622|gb|EEE60754.1| hypothetical protein OsJ_14310 [Oryza sativa Japonica Group]
Length = 494
Score = 124 bits (311), Expect = 7e-26, Method: Compositional matrix adjust.
Identities = 111/378 (29%), Positives = 166/378 (43%), Gaps = 55/378 (14%)
Query: 34 IYLMHLSIGTPPVDIFGSVDTGSDCTWTQ---CEPCPELDCFKQEPPLFDPKKSSTYNSI 90
+Y + IGTP + VDTGSD W C+ CP E L+DPK SST + +
Sbjct: 88 LYYTEIGIGTPTKRYYVQVDTGSDILWVNCISCDRCPRKSGLGLELTLYDPKDSSTGSKV 147
Query: 91 SCSSSQCAVVTSNCSEG-----DCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEM 145
SC CA G C YS YG G S ++G ++ L F+ SG
Sbjct: 148 SCDQGFCAATYGGLLPGCTTSLPCEYSVTYGDG---SSTTGYFVSDLLQFDQVSGDGQTR 204
Query: 146 P---NVIFGCGHKNLAS-PTSDSKQTGIIGLGPGNSSLISQMGTSIAGK----FSYCLPD 197
P V FGCG + +S+ GIIG G N+S++SQ+ S AGK F++CL
Sbjct: 205 PANSTVTFGCGSQQGGDLGSSNQALDGIIGFGQSNTSMLSQL--SAAGKVKKIFAHCL-- 260
Query: 198 QGSSKINFGGIVAGAGVV-----STPLIIR-DHYYLSLEAISVGNQRLE-----FVSSST 246
IN GGI A VV +TPL+ HY ++L++I VG L+ F +
Sbjct: 261 ---DTINGGGIFAIGNVVQPKVKTTPLVPNMPHYNVNLKSIDVGGTALKLPSHMFDTGEK 317
Query: 247 GNIFVDTGVLRTLLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDVLCYNISSQ--PKF 304
+D+G T LP + + + K V + LC+ + F
Sbjct: 318 KGTIIDSGTTLTYLPEIVYKEIMLAVFAKHKDITFHNV------QEFLCFQYVGRVDDDF 371
Query: 305 PEVTIHFRGADVKLS--PSNLFRNISDEIMCSAF-------RGGNANIVYGRIMQINFLI 355
P++T HF D+ L+ P + F D + C F + G ++ G ++ N L+
Sbjct: 372 PKITFHFEN-DLPLNVYPHDYFFENGDNLYCVGFQNGGLQSKDGKGMVLLGDLVLSNKLV 430
Query: 356 GYDIEQAMVSFKPSRCTN 373
YD+E ++ + C++
Sbjct: 431 VYDLENQVIGWTEYNCSS 448
>gi|297814776|ref|XP_002875271.1| hypothetical protein ARALYDRAFT_484331 [Arabidopsis lyrata subsp.
lyrata]
gi|297321109|gb|EFH51530.1| hypothetical protein ARALYDRAFT_484331 [Arabidopsis lyrata subsp.
lyrata]
Length = 451
Score = 124 bits (311), Expect = 8e-26, Method: Compositional matrix adjust.
Identities = 104/373 (27%), Positives = 160/373 (42%), Gaps = 41/373 (10%)
Query: 35 YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPP-LFDPKKSSTYNSISCS 93
Y + L IG PP + DTGSD W +C C +C P +F P+ SST++ C
Sbjct: 83 YFVDLRIGQPPQSLLLIADTGSDLVWVKCSAC--RNCSHHSPATVFFPRHSSTFSPAHCY 140
Query: 94 SSQCAVVTSNCSEGDCSYSFLYGRGAY------ASFSSGNLATETLTFNSTSGLPVEMPN 147
C +V C+++ ++ Y S +SG A ET + ++SG ++ +
Sbjct: 141 DPVCRLVPKPGRAPRCNHTRIHSTCPYEYGYADGSLTSGLFARETTSLKTSSGKEAKLKS 200
Query: 148 VIFGCGHK---NLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLPDQGSSKIN 204
V FGCG + S TS + G++GLG G S SQ+G KFSYCL D S
Sbjct: 201 VAFGCGFRISGQSVSGTSFNGANGVMGLGRGPISFASQLGRRFGNKFSYCLMDYTLSPPP 260
Query: 205 FGGIVAGAG------VVSTPLIIR----DHYYLSLEAISVGNQRL-------EFVSSSTG 247
++ G G + TPL+ YY+ L+++ V +L E S G
Sbjct: 261 TSYLIIGDGGDAVSKLFFTPLLTNPLSPTFYYVKLKSVFVNGAKLRIDPSIWEIDDSGNG 320
Query: 248 NIFVDTGVLRTLLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDVLCYNISSQPK---- 303
+D+G L + + + + IK + PGF LC N+S K
Sbjct: 321 GTVMDSGTTLAFLADPAYRLVIAAVKQRIKLPNADEL--TPGFD--LCVNVSGVTKPEKI 376
Query: 304 FPEVTIHFRGADVKL-SPSNLFRNISDEIMCSAFRGGNANI---VYGRIMQINFLIGYDI 359
P + F G V + P N F ++I C A + + + V G +MQ FL +D
Sbjct: 377 LPRLKFEFSGGAVFVPPPRNYFIETEEQIQCLAIQSVDPKVGFSVIGNLMQQGFLFEFDR 436
Query: 360 EQAMVSFKPSRCT 372
+++ + F C
Sbjct: 437 DRSRLGFSRRGCA 449
>gi|302776610|ref|XP_002971459.1| hypothetical protein SELMODRAFT_64134 [Selaginella moellendorffii]
gi|300160591|gb|EFJ27208.1| hypothetical protein SELMODRAFT_64134 [Selaginella moellendorffii]
Length = 357
Score = 124 bits (311), Expect = 8e-26, Method: Compositional matrix adjust.
Identities = 113/366 (30%), Positives = 170/366 (46%), Gaps = 51/366 (13%)
Query: 35 YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSS 94
Y + + IG+P + +DTGSD W QC PC C+KQ +FDP+ SS++ +SCS+
Sbjct: 14 YFVRVGIGSPTKLQYLVMDTGSDVPWIQCSPCKS--CYKQNDAVFDPRASSSFRRLSCST 71
Query: 95 SQCAVV-TSNCSEGD--CSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMPNVIFG 151
QC ++ C+ D C Y YG G SF+ G+LA+++ + + P V+FG
Sbjct: 72 PQCKLLDVKACASTDNRCLYQVSYGDG---SFTVGDLASDSFSVSRGRTSP-----VVFG 123
Query: 152 CGHKN--LASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCL--PDQG---SSKIN 204
CGH N L + G L S SQ+ + KFSYCL D G SS +
Sbjct: 124 CGHDNEGLFVGAAGLLGLGAGKL-----SFPSQLSSR---KFSYCLVSRDNGVRASSALL 175
Query: 205 FG--GIVAGAGVVSTPLI----IRDHYYLSLEAISVGNQRLEF------VSSSTGN--IF 250
FG + A T L+ + YY L IS+G L +SSSTG +
Sbjct: 176 FGDSALPTSASFAYTQLLKNPKLDTFYYAGLSGISIGGTLLSIPSTAFKLSSSTGRGGVI 235
Query: 251 VDTGVLRTLLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDVLCYNIS--SQPKFPEVT 308
+D+G T LP ++ ++ + + P A+ D CY+ S + P V+
Sbjct: 236 IDSGTSVTRLPTYAYTVMRDAFRSATQKLP---RAADFSLFDT-CYDFSALTSVTIPTVS 291
Query: 309 IHFR-GADVKLSPSNLFRNI-SDEIMCSAFRGGNANI-VYGRIMQINFLIGYDIEQAMVS 365
HF GA V+L PSN + + C AF + ++ + G I Q + D++ + V
Sbjct: 292 FHFEGGASVQLPPSNYLVPVDTSGTFCFAFSKTSLDLSIIGNIQQQTMRVAIDLDSSRVG 351
Query: 366 FKPSRC 371
F P +C
Sbjct: 352 FAPRQC 357
>gi|449440931|ref|XP_004138237.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 1-like [Cucumis
sativus]
Length = 523
Score = 124 bits (310), Expect = 9e-26, Method: Compositional matrix adjust.
Identities = 114/359 (31%), Positives = 166/359 (46%), Gaps = 41/359 (11%)
Query: 35 YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCP-ELDCFKQEPPLFDPKKSSTYNSISCS 93
Y + +G P F DTGSD +W QC+PC E C+KQ P+FDPK SS+Y+ +SC
Sbjct: 184 YFARIGVGQPVQSYFFVPDTGSDVSWLQCQPCDGENGCYKQIGPIFDPKSSSSYSPLSCD 243
Query: 94 SSQCAVV-TSNCSEGDCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMPNVIFGC 152
S QC ++ + C C Y YG G SF+ G LATET +F ++ + PN+ GC
Sbjct: 244 SEQCHLLDEAACDANSCIYEVEYGDG---SFTVGELATETFSFRHSNSI----PNLPIGC 296
Query: 153 GHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLPD---QGSSKINFGGIV 209
GH N G+ G SL SQ+ A FSYCL D + SS ++F
Sbjct: 297 GHDNEGLFVGADGLIGLGGG---AISLSSQLE---ATSFSYCLVDLDSESSSTLDFNADQ 350
Query: 210 AGAGVVSTPLIIRDHY----YLSLEAISVGNQRLEFVSSS-------TGNIFVDTGVLRT 258
+ S PL+ D + Y+ + +SVG + L SSS +G I VD+G T
Sbjct: 351 PSDSLTS-PLVKNDRFPTFRYVKVIGMSVGGKPLPISSSSFEIDESGSGGIIVDSGTTIT 409
Query: 259 LLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDV-LCYNISSQP--KFPEVTIHFRGAD 315
+P + + L+ + K P PG S CY++SSQ + P + G +
Sbjct: 410 EIPSDVYDVLRDAFVGLTKNLP-----PAPGVSPFDTCYDLSSQSNVEVPTIAFILPGEN 464
Query: 316 VKLSPSN--LFRNISDEIMCSAFRGGNANI-VYGRIMQINFLIGYDIEQAMVSFKPSRC 371
P+ L + S C AF + + G + Q + YD+ ++V F +C
Sbjct: 465 SLQLPAKNCLIQVDSAGTFCLAFLPSTFPLSIIGNVQQQGIRVSYDLANSLVGFSTDKC 523
>gi|147809812|emb|CAN71447.1| hypothetical protein VITISV_040904 [Vitis vinifera]
Length = 988
Score = 124 bits (310), Expect = 9e-26, Method: Compositional matrix adjust.
Identities = 114/369 (30%), Positives = 165/369 (44%), Gaps = 52/369 (14%)
Query: 32 DDIYLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSIS 91
D +L+ ++ GTPP +DTGS TWTQC+ C + C K FD SSTY+ S
Sbjct: 124 DGNFLVDVAFGTPPQKFKLILDTGSSITWTQCKAC--VHCLKDSHRHFDSLASSTYSFGS 181
Query: 92 CSSSQCAVVTSNCSEGDCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMPNVIFG 151
C S +Y+ YG S S GN +T+T + FG
Sbjct: 182 CIPSTVG----------NTYNMTYGD---KSTSVGNYGCDTMTLEPSDVF----QKFQFG 224
Query: 152 CGHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLPDQG------------ 199
CG N S + G++GLG G S +SQ + FSYCLP++
Sbjct: 225 CGRNNEGDFGSGAD--GMLGLGQGQLSTVSQTASKFKKVFSYCLPEENSIGSLLFGEKAT 282
Query: 200 --SSKINFGGIVAGAGVVSTPLIIRDHYYLSLEAISVGNQRLEFVSS---STGNIFVDTG 254
SS + F +V G G ++ L +Y++ L ISVGN+RL SS S G I +D+G
Sbjct: 283 SQSSSLKFTSLVNGPG--TSGLEESGYYFVKLLDISVGNKRLNIPSSVFASPGTI-IDSG 339
Query: 255 VLRTLLPLEYHSNLKSVMSNMIKAQPV-KGVGAEPGFSDVLCYNISSQPK--FPEVTIHF 311
+ T LP +S LK+ + P+ G E D CYN+S + PE +HF
Sbjct: 340 TVITRLPQRAYSALKAAFKKAMAKYPLSNGRRKENDMLDT-CYNLSGRKDVLLPEXVLHF 398
Query: 312 -RGADVKLSPSNLFRNISDEIMCSAFRGGNAN------IVYGRIMQINFLIGYDIEQAMV 364
GADV+L+ + +C AF G + + + G Q++ + YDI +
Sbjct: 399 GDGADVRLNGKRVVWGNDASRLCLAFAGNSKSTMNPELTIIGNRQQVSLTVLYDIRGRRI 458
Query: 365 SFKPSRCTN 373
F + C+N
Sbjct: 459 GFGGNGCSN 467
>gi|224101015|ref|XP_002312106.1| predicted protein [Populus trichocarpa]
gi|222851926|gb|EEE89473.1| predicted protein [Populus trichocarpa]
Length = 440
Score = 124 bits (310), Expect = 9e-26, Method: Compositional matrix adjust.
Identities = 120/409 (29%), Positives = 180/409 (44%), Gaps = 73/409 (17%)
Query: 17 KSPISI--IYQAEIISVDDIYLMH-------LSIGTPPVDIFGSVDTGSDCTWTQCEPCP 67
++PIS ++ + D + H L+ GTP +I +DTGS+ +W C+ P
Sbjct: 40 RTPISTPRLFSTTSKTTDKLLFHHNVTLTVSLTAGTPLQNITMVLDTGSELSWLHCKKEP 99
Query: 68 ELDCFKQEPPLFDPKKSSTYNSISCSSSQCAVVTSN------CSEGD-CSYSFLYGRGAY 120
+ +F+P S TY I CSS C T + C C + Y A
Sbjct: 100 NFNS------IFNPLASKTYTKIPCSSPTCETRTRDLPLPVSCDPAKLCHFIISY---AD 150
Query: 121 ASFSSGNLATETLTFNSTSGLPVEMPNVIFGCGHKNLASPT-SDSKQTGIIGLGPGNSSL 179
AS GNLA ET S +G P +FGC +S + D+K TG++G+ G+ S
Sbjct: 151 ASSVEGNLAFETFRVGSVTG-----PATVFGCMDSGFSSNSEEDAKTTGLMGMNRGSLSF 205
Query: 180 ISQMGTSIAGKFSYCLPDQGSSKINFGGIVAGAGV----------VSTPLIIRDH--YYL 227
++QMG KFSYC+ D+ SS + G + + + +STPL D Y +
Sbjct: 206 VNQMGFR---KFSYCISDRDSSGVLLLGEASFSWLKPLNYTPLVEMSTPLPYFDRVAYSV 262
Query: 228 SLEAISVGNQRLE-----FVSSST--GNIFVDTGVLRTLLPLEYHSNLKSVMSNMIKAQP 280
LE I V ++ L FV T G VD+G T L +S LK +++ +
Sbjct: 263 QLEGIRVSDKVLSLPKSVFVPDHTGAGQTMVDSGTQFTFLLGPVYSALKQEF--LLQTKG 320
Query: 281 VKGVGAEPGF----SDVLCYNI----SSQPKFPEVTIHFRGADVKLSPSNLFRNI----- 327
V V EP + + LCY I ++ P P V + FRGA++ +S L +
Sbjct: 321 VLRVLNEPRYVFQGAMDLCYLIEPTRAALPNLPVVNLMFRGAEMSVSGQRLLYRVPGEVR 380
Query: 328 -SDEIMCSAFRG----GNANIVYGRIMQINFLIGYDIEQAMVSFKPSRC 371
D + C F G + V G Q N + YD+E++ + F RC
Sbjct: 381 GKDSVWCFTFGNSDSLGIESFVIGHHQQQNVWMEYDLEKSRIGFAEVRC 429
>gi|326512066|dbj|BAJ96014.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 485
Score = 124 bits (310), Expect = 9e-26, Method: Compositional matrix adjust.
Identities = 115/379 (30%), Positives = 170/379 (44%), Gaps = 58/379 (15%)
Query: 34 IYLMHLSIGTPPVDIFGSVDTGSDCTWTQ---CEPCPELDCFKQEPPLFDPKKSSTYNSI 90
+Y + IGTP + VDTGSD W C+ CP E L+DP SS+ +
Sbjct: 80 LYFTQIGIGTPAKSYYVQVDTGSDILWVNCVFCDTCPRKSGLGIELTLYDPSGSSSGTGV 139
Query: 91 SCSSSQC-----AVVTSNCSEGDCSYSFLYGRGAYASFSSGNLATETLTFNSTSG---LP 142
+C C V+ S C YS YG G S ++G T+ L +N SG
Sbjct: 140 TCGQDFCVATHGGVIPSCVPAAPCQYSISYGDG---SSTTGFFVTDFLQYNQVSGNSQTT 196
Query: 143 VEMPNVIFGCGHKNLASPTSDSKQ-TGIIGLGPGNSSLISQMGTSIAGK----FSYCLPD 197
+ ++ FGCG K S S+ GI+G G NSS++SQ+ + AGK F++CL
Sbjct: 197 LANTSITFGCGAKIGGDLGSSSQALDGILGFGQSNSSMLSQL--AAAGKVRKVFAHCL-- 252
Query: 198 QGSSKINFGGIVAGAGVV-----STPLII-RDHYYLSLEAISVGNQRLEF------VSSS 245
IN GGI A VV +TPL+ HY ++LEAI VG +L+ + S
Sbjct: 253 ---DTINGGGIFAIGDVVQPKVSTTPLVPGMPHYNVNLEAIDVGGVKLQLPTNIFDIGES 309
Query: 246 TGNIFVDTGVLRTLLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDVLCYNISS--QPK 303
G I +D+G LP ++ + S + P+K D C+ S
Sbjct: 310 KGTI-IDSGTTLAYLPGVVYNAIMSKVFAQYGDMPLKND------QDFQCFRYSGSVDDG 362
Query: 304 FPEVTIHFRGA-DVKLSPSN-LFRNISDEIMCSAFRGGNANIVYGRIMQI-------NFL 354
FP +T HF G + + P + LF+N E+ C F+ G G+ M + N L
Sbjct: 363 FPIITFHFEGGLPLNIHPHDYLFQN--GELYCMGFQTGGLQTKDGKDMVLLGDLAFSNRL 420
Query: 355 IGYDIEQAMVSFKPSRCTN 373
+ YD+E ++ + C++
Sbjct: 421 VLYDLENQVIGWTDYNCSS 439
>gi|356498306|ref|XP_003517994.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
Length = 484
Score = 124 bits (310), Expect = 1e-25, Method: Compositional matrix adjust.
Identities = 113/363 (31%), Positives = 164/363 (45%), Gaps = 53/363 (14%)
Query: 35 YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSS 94
Y + + IG PP + +DTGSD +W QC PC E C++Q P+FDP S++Y+ I C
Sbjct: 149 YFLRVGIGKPPSQAYVVLDTGSDVSWIQCAPCSE--CYQQSDPIFDPISSNSYSPIRCDE 206
Query: 95 SQC-AVVTSNCSEGDCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMPNVIFGCG 153
QC ++ S C G C Y YG G+Y + G ATET+T S + + NV GCG
Sbjct: 207 PQCKSLDLSECRNGTCLYEVSYGDGSY---TVGEFATETVTLGSAA-----VENVAIGCG 258
Query: 154 HKN--LASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLPDQGS---SKINFGGI 208
H N L + G L S +Q+ A FSYCL ++ S S + F
Sbjct: 259 HNNEGLFVGAAGLLGLGGGKL-----SFPAQVN---ATSFSYCLVNRDSDAVSTLEFNSP 310
Query: 209 VAGAGVVSTPLI----IRDHYYLSLEAISVGNQRLEFVSSS-------TGNIFVDTGVLR 257
+ + PL+ + YYL L+ ISVG + L SS G I +D+G
Sbjct: 311 LP-RNAATAPLMRNPELDTFYYLGLKGISVGGEALPIPESSFEVDAIGGGGIIIDSGTAV 369
Query: 258 TLLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDV----LCYNISSQPKFPEVTIHFR- 312
T L E + L+ VKG P + V CY++SS+ T+ FR
Sbjct: 370 TRLRSEVYDALRDAF--------VKGAKGIPKANGVSLFDTCYDLSSRESVEIPTVSFRF 421
Query: 313 --GADVKLSPSNLFRNI-SDEIMCSAFRGGNANI-VYGRIMQINFLIGYDIEQAMVSFKP 368
G ++ L N + S C AF +++ + G + Q +G+DI ++V F
Sbjct: 422 PEGRELPLPARNYLIPVDSVGTFCFAFAPTTSSLSIIGNVQQQGTRVGFDIANSLVGFSV 481
Query: 369 SRC 371
C
Sbjct: 482 DSC 484
>gi|125564143|gb|EAZ09523.1| hypothetical protein OsI_31798 [Oryza sativa Indica Group]
Length = 485
Score = 124 bits (310), Expect = 1e-25, Method: Compositional matrix adjust.
Identities = 107/356 (30%), Positives = 164/356 (46%), Gaps = 37/356 (10%)
Query: 35 YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSS 94
Y++ + +GTP DTGSD +W QC+PC DC++Q+ PLFDP SSTY +++C +
Sbjct: 149 YVVSVGLGTPAKQYAVIFDTGSDLSWVQCKPC--ADCYEQQDPLFDPSLSSTYAAVACGA 206
Query: 95 SQCAVV-TSNC-SEGDCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMPNVIFGC 152
+C + S C S+ C Y YG S + GNL +TLT +++ +P +FGC
Sbjct: 207 PECQELDASGCSSDSRCRYEVQYGD---QSQTDGNLVRDTLTLSASD----TLPGFVFGC 259
Query: 153 GHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLPDQGSSK--INFGGIVA 210
G +N + G+ GLG SL SQ S F+YCLP S + ++ GG
Sbjct: 260 GDQNAG---LFGQVDGLFGLGREKVSLPSQGAPSYGPGFTYCLPSSSSGRGYLSLGG-AP 315
Query: 211 GAGVVSTPL---IIRDHYYLSLEAISVGNQRLEF---VSSSTGNIFVDTGVLRTLLPLEY 264
A T L YY+ L I VG + + ++ G +D+G + T LP
Sbjct: 316 PANAQFTALADGATPSFYYIDLVGIKVGGRAIRIPATAFAAAGGTVIDSGTVITRLPPRA 375
Query: 265 HSNLKSVMSNMIKAQPVKGVGAEPGFSDV-LCYNISSQ--PKFPEVTIHFR-GADVKLSP 320
++ L++ + + AQ K P S + CY+ + + P V + F GA V L
Sbjct: 376 YAPLRAAFARSM-AQYKK----APALSILDTCYDFTGHRTAQIPTVELAFAGGATVSLDF 430
Query: 321 SNLFRNISDEIMCSAFRGGNAN----IVYGRIMQINFLIGYDIEQAMVSFKPSRCT 372
+ + C AF NA+ + G Q F + YD+ + F C+
Sbjct: 431 TGVLYVSKVSQACLAF-APNADDSSIAILGNTQQKTFAVTYDVANQRIGFGAKGCS 485
>gi|54290731|dbj|BAD62401.1| putative nucleoid DNA-binding protein cnd41 [Oryza sativa Japonica
Group]
Length = 521
Score = 123 bits (309), Expect = 1e-25, Method: Compositional matrix adjust.
Identities = 113/372 (30%), Positives = 166/372 (44%), Gaps = 56/372 (15%)
Query: 35 YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSS 94
Y++ L IGTP V +DTGSD +W QC+PC +C+ Q+ PLFDP SS+Y S+ C S
Sbjct: 171 YVVTLGIGTPAVQQTVLIDTGSDLSWVQCKPCGAGECYAQKDPLFDPSSSSSYASVPCDS 230
Query: 95 SQCAVVT-----------SNCSEGDCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPV 143
C + S + C Y YG A ++G +TETLT V
Sbjct: 231 DACRKLAAGAYGHGCTGVSGGAAALCEYGIEYGNRAT---TTGVYSTETLTLKPG----V 283
Query: 144 EMPNVIFGCGHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLP--DQGSS 201
+ + FGCG K G++GLG SL+SQ + G FSYCLP G+
Sbjct: 284 VVADFGFGCGDHQHG---PYEKFDGLLGLGGAPESLVSQTSSQFGGPFSYCLPPTSGGAG 340
Query: 202 KINFGG------IVAGAGVVSTPLI----IRDHYYLSLEAISVGNQRLEFVSSS-TGNIF 250
+ G A +G+ TP+ + Y ++L ISVG L S+ + +
Sbjct: 341 FLTLGAPPNSSSSTAASGLSFTPMRRLPSVPTFYIVTLTGISVGGAPLAIPPSAFSSGMV 400
Query: 251 VDTGVLRTLLPLEYHSNL----KSVMSNMIKAQPVKGVGAEPGFSDVLCYNISSQPK--F 304
+D+G + T LP ++ L +S MS P G G D CY+ +
Sbjct: 401 IDSGTVITGLPATAYAALRSAFRSAMSEYRLLPPSNG-----GVLDT-CYDFTGHANVTV 454
Query: 305 PEVTIHFR-GADVKL-SPSNLFRNISDEIMCSAFRGG---NANIVYGRIMQINFLIGYDI 359
P +++ F GA + L +P+ + + C AF G NA + G + Q F + YD
Sbjct: 455 PTISLTFSGGATIDLAAPAGVLVD-----GCLAFAGAGTDNAIGIIGNVNQRTFEVLYDS 509
Query: 360 EQAMVSFKPSRC 371
+ V F+ C
Sbjct: 510 GKGTVGFRAGAC 521
>gi|359482097|ref|XP_002271077.2| PREDICTED: aspartic proteinase nepenthesin-1-like [Vitis vinifera]
Length = 458
Score = 123 bits (309), Expect = 1e-25, Method: Compositional matrix adjust.
Identities = 114/379 (30%), Positives = 180/379 (47%), Gaps = 66/379 (17%)
Query: 39 LSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSSSQCA 98
L++GTPP ++ +DTGS+ +W +C + F+ FDP +SS+Y+ + CSS C
Sbjct: 89 LTVGTPPQNVSMVLDTGSELSWLRCN---KTQTFQTT---FDPNRSSSYSPVPCSSLTCT 142
Query: 99 VVT------SNCSEGDCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMPNVIFGC 152
T ++C ++ L A AS S GNLA++T ++ +MP IFGC
Sbjct: 143 DRTRDFPIPASCDSNQLCHAIL--SYADASSSEGNLASDTFYIGNS-----DMPGTIFGC 195
Query: 153 GHKNLASPT-SDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLPDQGSSKI------NF 205
+ ++ T DSK TG++G+ G+ S +SQM KFSYC+ D S + NF
Sbjct: 196 MDSSFSTNTEEDSKNTGLMGMNRGSLSFVSQMDFP---KFSYCISDSDFSGVLLLGDANF 252
Query: 206 GGIVAGAGV----VSTPLIIRDH--YYLSLEAISVGNQRLE-----FVSSST--GNIFVD 252
++ +STPL D Y + LE I V ++ L FV T G VD
Sbjct: 253 SWLMPLNYTPLIQISTPLPYFDRVAYTVQLEGIKVSSKLLPLPKSVFVPDHTGAGQTMVD 312
Query: 253 TGVLRTLLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGF----SDVLCYNI----SSQPKF 304
+G T L +S L++ N + + V +P + LCY + +S P
Sbjct: 313 SGTQFTFLLGPVYSALRNEFLN--QTSQILRVLEDPNYVFQGGMDLCYRVPLSQTSLPWL 370
Query: 305 PEVTIHFRGADVKLSPSNLFRNI------SDEIMCSAFRGGNANI------VYGRIMQIN 352
P V++ FRGA++K+S L + SD + C F GN+++ V G Q N
Sbjct: 371 PTVSLMFRGAEMKVSGDRLLYRVPGEVRGSDSVYCFTF--GNSDLLAVEAYVIGHHHQQN 428
Query: 353 FLIGYDIEQAMVSFKPSRC 371
+ +D+E++ + F +C
Sbjct: 429 VWMEFDLEKSRIGFAQVQC 447
>gi|223950123|gb|ACN29145.1| unknown [Zea mays]
gi|413923785|gb|AFW63717.1| hypothetical protein ZEAMMB73_445506 [Zea mays]
Length = 385
Score = 123 bits (309), Expect = 1e-25, Method: Compositional matrix adjust.
Identities = 116/358 (32%), Positives = 175/358 (48%), Gaps = 45/358 (12%)
Query: 35 YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSS 94
YL+ + +G+P +DTGSD +W QC+PC + C Q PLFDP SSTY+ SC S
Sbjct: 52 YLITVGLGSPATSQTMLIDTGSDVSWVQCKPCSQ--CHSQADPLFDPSSSSTYSPFSCGS 109
Query: 95 SQCAVVTSN----CSEGDCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMPNVIF 150
+ CA + S C Y YG G S ++G +++TL S++ + + F
Sbjct: 110 ADCAQLGQEGNGCSSSSQCQYIVTYGDG---SSTTGTYSSDTLALGSSA-----VRSFQF 161
Query: 151 GCGHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLPDQGSSK----INFG 206
GC N+ S +D + G++GLG G SL+SQ ++ FSYCLP SS +
Sbjct: 162 GC--SNVESGFND-QTDGLMGLGGGAQSLVSQTAGTLGRAFSYCLPPTPSSSGFLTLGAA 218
Query: 207 GIVAGAGVVSTPLI----IRDHYYLSLEAISVGNQRLEFVSS--STGNIFVDTGVLRTLL 260
G +G V TP++ + Y + L+AI VG ++L +S S G + +D+G + T L
Sbjct: 219 GGSGTSGFVKTPMLRSSQVPTFYGVRLQAIRVGGRQLSIPASVFSAGTV-MDSGTVITRL 277
Query: 261 PLEYHSNLKSVMSNMIKAQPVKGVGAEP-GFSDVLCYNISSQP--KFPEVTIHFR-GADV 316
P +S L S +K P A+P G D C++ S Q P V + F GA V
Sbjct: 278 PPTAYSALSSAFKAGMKQYPP----AQPSGILDT-CFDFSGQSSVSIPSVALVFSGGAVV 332
Query: 317 KLSPSNLFRNISDEIMCSAFRGGNANI---VYGRIMQINFLIGYDIEQAMVSFKPSRC 371
L S + + C AF G + + + G + Q F + YD+ + +V F+ C
Sbjct: 333 SLDASGIILS-----NCLAFAGNSDDSSLGIIGNVQQRTFEVLYDVGRGVVGFRAGAC 385
>gi|222635451|gb|EEE65583.1| hypothetical protein OsJ_21095 [Oryza sativa Japonica Group]
Length = 441
Score = 123 bits (309), Expect = 1e-25, Method: Compositional matrix adjust.
Identities = 113/372 (30%), Positives = 166/372 (44%), Gaps = 56/372 (15%)
Query: 35 YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSS 94
Y++ L IGTP V +DTGSD +W QC+PC +C+ Q+ PLFDP SS+Y S+ C S
Sbjct: 91 YVVTLGIGTPAVQQTVLIDTGSDLSWVQCKPCGAGECYAQKDPLFDPSSSSSYASVPCDS 150
Query: 95 SQCAVVT-----------SNCSEGDCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPV 143
C + S + C Y YG A ++G +TETLT V
Sbjct: 151 DACRKLAAGAYGHGCTGVSGGAAALCEYGIEYGNRAT---TTGVYSTETLTLKPG----V 203
Query: 144 EMPNVIFGCGHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLP--DQGSS 201
+ + FGCG K G++GLG SL+SQ + G FSYCLP G+
Sbjct: 204 VVADFGFGCGDHQHG---PYEKFDGLLGLGGAPESLVSQTSSQFGGPFSYCLPPTSGGAG 260
Query: 202 KINFGG------IVAGAGVVSTPLI----IRDHYYLSLEAISVGNQRLEFVSSS-TGNIF 250
+ G A +G+ TP+ + Y ++L ISVG L S+ + +
Sbjct: 261 FLTLGAPPNSSSSTAASGLSFTPMRRLPSVPTFYIVTLTGISVGGAPLAIPPSAFSSGMV 320
Query: 251 VDTGVLRTLLPLEYHSNL----KSVMSNMIKAQPVKGVGAEPGFSDVLCYNISSQPK--F 304
+D+G + T LP ++ L +S MS P G G D CY+ +
Sbjct: 321 IDSGTVITGLPATAYAALRSAFRSAMSEYRLLPPSNG-----GVLDT-CYDFTGHANVTV 374
Query: 305 PEVTIHFR-GADVKL-SPSNLFRNISDEIMCSAFRGG---NANIVYGRIMQINFLIGYDI 359
P +++ F GA + L +P+ + + C AF G NA + G + Q F + YD
Sbjct: 375 PTISLTFSGGATIDLAAPAGVLVD-----GCLAFAGAGTDNAIGIIGNVNQRTFEVLYDS 429
Query: 360 EQAMVSFKPSRC 371
+ V F+ C
Sbjct: 430 GKGTVGFRAGAC 441
>gi|224142001|ref|XP_002324349.1| predicted protein [Populus trichocarpa]
gi|222865783|gb|EEF02914.1| predicted protein [Populus trichocarpa]
Length = 490
Score = 123 bits (309), Expect = 1e-25, Method: Compositional matrix adjust.
Identities = 116/361 (32%), Positives = 167/361 (46%), Gaps = 42/361 (11%)
Query: 35 YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSS 94
Y++ + +GTP D+ DTGSD TWTQC+PC C+KQ+ +FDP +S++Y +ISCSS
Sbjct: 149 YIVTVGLGTPKKDLSLIFDTGSDITWTQCQPCAR-SCYKQKEQIFDPSQSTSYTNISCSS 207
Query: 95 SQCAVVTS------NCSEGDCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMPNV 148
S C +TS C+ C Y YG +SFS G TE LT ST N+
Sbjct: 208 SICNSLTSATGNTPGCASSACVYGIQYGD---SSFSVGFFGTEKLTLTSTDAF----NNI 260
Query: 149 IFGCGHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLPDQGSSK--INFG 206
FGCG N + G+ S++SQ FSYCLP SS + FG
Sbjct: 261 YFGCGQNNQGLFGGSAGLLGLG---RDKLSVVSQTAQKYNKIFSYCLPSSSSSTGFLTFG 317
Query: 207 GIVAGAGVVSTPLII----RDHYYLSLEAISVGNQRLEFVSS--STGNIFVDTGVLRTLL 260
G A TPL Y L ISVG ++L +S ST +D+G + T L
Sbjct: 318 G-SASKNAKFTPLSTISAGPSFYGLDFTGISVGGKKLAISASVFSTAGAIIDSGTVITRL 376
Query: 261 PLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDVL--CYNISSQPKF--PEVTIHF-RGAD 315
P +S L++ N++ P+ + +L CY+ SS P++ F G +
Sbjct: 377 PPAAYSALRASFRNLMSKYPMTKALS------ILDTCYDFSSYTTISVPKIGFSFSSGIE 430
Query: 316 VKLSPSNLFRNISDEIMCSAFRGGNAN----IVYGRIMQINFLIGYDIEQAMVSFKPSRC 371
V + + + S +C AF GN++ ++G + Q + YD V F P C
Sbjct: 431 VDIDATGILYASSLSQVCLAF-AGNSDATDVFIFGNVQQKTLEVFYDGSAGKVGFAPGGC 489
Query: 372 T 372
+
Sbjct: 490 S 490
>gi|224286159|gb|ACN40790.1| unknown [Picea sitchensis]
Length = 452
Score = 123 bits (308), Expect = 1e-25, Method: Compositional matrix adjust.
Identities = 101/357 (28%), Positives = 162/357 (45%), Gaps = 43/357 (12%)
Query: 35 YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSS 94
Y++ + GTP ++ +DTGSD W C+ C P+FDP KSS+Y +C S
Sbjct: 115 YIIQVDFGTPKQSMYTLIDTGSDVAWIPCKQCQGC---HSTAPIFDPAKSSSYKPFACDS 171
Query: 95 SQCAVVTSNC-SEGDCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMPNVIFGCG 153
C ++ NC C + LYG G G LA++ +T S +PN FGC
Sbjct: 172 QPCQEISGNCGGNSKCQFEVLYGDGTQV---DGTLASDAITLGSQ-----YLPNFSFGCA 223
Query: 154 HKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLPDQGSSKINFGGIVAG-- 211
++L+ T S +G G + + G FSYCLP +S + +V G
Sbjct: 224 -ESLSEDTYSSPGLMGLGGGSLSLLTQAPTAELFGGTFSYCLPSSSTSSGS---LVLGKE 279
Query: 212 AGVVSTPL----IIRD-----HYYLSLEAISVGNQRLEFVSS---STGNIFVDTGVLRTL 259
A V S+ L +I+D Y+++L+AISVGN R+ ++ S G +D+G T
Sbjct: 280 AAVSSSSLKFTTLIKDPSFPTFYFVTLKAISVGNTRISVPATNIASGGGTIIDSGTTITY 339
Query: 260 LPLEYHSNLKSVMSNM---IKAQPVKGVGAEPGFSDVLCYNISSQP-KFPEVTIHF-RGA 314
L + +L+ ++ PV+ + CY++SS P +T+H R
Sbjct: 340 LVPSAYKDLRDAFRQQLSSLQPTPVEDMDT--------CYDLSSSSVDVPTITLHLDRNV 391
Query: 315 DVKLSPSNLFRNISDEIMCSAFRGGNANIVYGRIMQINFLIGYDIEQAMVSFKPSRC 371
D+ L N+ + C AF ++ + G + Q N+ I +D+ + V F +C
Sbjct: 392 DLVLPKENILITQESGLSCLAFSSTDSRSIIGNVQQQNWRIVFDVPNSQVGFAQEQC 448
>gi|356532674|ref|XP_003534896.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
Length = 446
Score = 123 bits (308), Expect = 1e-25, Method: Compositional matrix adjust.
Identities = 110/357 (30%), Positives = 160/357 (44%), Gaps = 38/357 (10%)
Query: 36 LMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSSS 95
L++LSIG P + +DTGSD W C PC +C LFDP SST++ + +
Sbjct: 102 LVNLSIGQPSIPQLVVMDTGSDILWIMCNPCT--NCDNHLGLLFDPSMSSTFSPL--CKT 157
Query: 96 QCAVVTSNCSEGDCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMPNVIFGCGHK 155
C C + S++ A SG + L F +T ++ +VI GCGH
Sbjct: 158 PCGFKGCKCDPIPFTISYVDNSSA-----SGTFGRDILVFETTDEGTSQISDVIIGCGHN 212
Query: 156 NLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLPDQGSSKINFGGIVAGAGV- 214
SD GI+GL G +SL +Q+G KFSYC+ + N+ + G G
Sbjct: 213 --IGFNSDPGYNGILGLNNGPNSLATQIGR----KFSYCIGNLADPYYNYNQLRLGEGAD 266
Query: 215 ---VSTPL-IIRDHYYLSLEAISVGNQRL-------EFVSSSTGNIFVDTGVLRTLLPLE 263
STP + YY+++E ISVG +RL E + TG + +D+G T L
Sbjct: 267 LEGYSTPFEVYHGFYYVTMEGISVGEKRLDIALETFEMKRNGTGGVILDSGTTITYLVDS 326
Query: 264 YHSNLKSVMSNMIKAQPVKGVGAEPGFSDVLCYNISSQP--KFPEVTIHF-RGADVKLSP 320
H L + + N++K + V E + Y I S+ FP VT HF GAD+ L
Sbjct: 327 AHKLLYNEVRNLLKWS-FRQVIFENAPWKLCYYGIISRDLVGFPVVTFHFVDGADLALDT 385
Query: 321 SNLFRNISDEIMCSAFRGG---NANI---VYGRIMQINFLIGYDIEQAMVSFKPSRC 371
+ F D+I C N I V G + Q ++ +GYD+ V F+ C
Sbjct: 386 GSFFSQ-RDDIFCMTVSPASILNTTISPSVIGLLAQQSYNVGYDLVNQFVYFQRIDC 441
>gi|116789442|gb|ABK25248.1| unknown [Picea sitchensis]
Length = 366
Score = 123 bits (308), Expect = 2e-25, Method: Compositional matrix adjust.
Identities = 78/208 (37%), Positives = 112/208 (53%), Gaps = 21/208 (10%)
Query: 35 YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSS 94
Y + +GTP + + +DTGSD W QCEPC E C+ Q P+F+P S++++++ C S
Sbjct: 157 YFTRIGVGTPTREQYMVLDTGSDVAWIQCEPCRE--CYSQADPIFNPSYSASFSTVGCDS 214
Query: 95 SQCAVVTS-NCSEGDCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMPNVIFGCG 153
+ C+ + + +C G C Y YG G+Y S+G+ ATETLTF +TS + NV GCG
Sbjct: 215 AVCSQLDAYDCHSGGCLYEASYGDGSY---STGSFATETLTFGTTS-----VANVAIGCG 266
Query: 154 HKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLPDQGSSK---INFGGIVA 210
HKN+ + G+ S +Q+GT FSYCL D+ S + FG
Sbjct: 267 HKNVGLFIGAAGLLGLGAG---ALSFPNQIGTQTGHTFSYCLVDRESDSSGPLQFGPKSV 323
Query: 211 GAGVVSTPLIIRDH----YYLSLEAISV 234
G + TPL H YYLS+ AIS+
Sbjct: 324 PVGSIFTPLEKNPHLPTFYYLSVTAISI 351
>gi|302765224|ref|XP_002966033.1| hypothetical protein SELMODRAFT_64135 [Selaginella moellendorffii]
gi|300166847|gb|EFJ33453.1| hypothetical protein SELMODRAFT_64135 [Selaginella moellendorffii]
Length = 357
Score = 123 bits (308), Expect = 2e-25, Method: Compositional matrix adjust.
Identities = 113/366 (30%), Positives = 169/366 (46%), Gaps = 51/366 (13%)
Query: 35 YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSS 94
Y + + IG+P + +DTGSD W QC PC C+KQ +FDP+ SS++ +SCS+
Sbjct: 14 YFVRVGIGSPTKLQYLVMDTGSDVPWIQCSPCKS--CYKQNDAVFDPRASSSFRRLSCST 71
Query: 95 SQCAVV-TSNCSEGD--CSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMPNVIFG 151
QC ++ C+ D C Y YG G SF+ G+LA+++ + P V+FG
Sbjct: 72 PQCKLLDVKACASTDNRCLYQVSYGDG---SFTVGDLASDSFLVSRGRTSP-----VVFG 123
Query: 152 CGHKN--LASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCL--PDQG---SSKIN 204
CGH N L + G L S SQ+ + KFSYCL D G SS +
Sbjct: 124 CGHDNEGLFVGAAGLLGLGAGKL-----SFPSQLSSR---KFSYCLVSRDNGVRASSALL 175
Query: 205 FG--GIVAGAGVVSTPLI----IRDHYYLSLEAISVGNQRLEF------VSSSTGN--IF 250
FG + A T L+ + YY L IS+G L +SSSTG +
Sbjct: 176 FGDSALPTSASFAYTQLLKNPKLDTFYYAGLSGISIGGTLLSIPSTAFKLSSSTGRGGVI 235
Query: 251 VDTGVLRTLLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDVLCYNIS--SQPKFPEVT 308
+D+G T LP ++ ++ + + P A+ D CY+ S + P V+
Sbjct: 236 IDSGTSVTRLPTYAYTVMRDAFRSATQKLP---RAADFSLFDT-CYDFSALTSVTIPTVS 291
Query: 309 IHFR-GADVKLSPSNLFRNI-SDEIMCSAFRGGNANI-VYGRIMQINFLIGYDIEQAMVS 365
HF GA V+L PSN + + C AF + ++ + G I Q + D++ + V
Sbjct: 292 FHFEGGASVQLPPSNYLVPVDTSGTFCFAFSKTSLDLSIIGNIQQQTMRVAIDLDSSRVG 351
Query: 366 FKPSRC 371
F P +C
Sbjct: 352 FAPRQC 357
>gi|302806531|ref|XP_002985015.1| hypothetical protein SELMODRAFT_121417 [Selaginella moellendorffii]
gi|300147225|gb|EFJ13890.1| hypothetical protein SELMODRAFT_121417 [Selaginella moellendorffii]
Length = 533
Score = 123 bits (308), Expect = 2e-25, Method: Compositional matrix adjust.
Identities = 107/378 (28%), Positives = 173/378 (45%), Gaps = 57/378 (15%)
Query: 35 YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSS 94
Y M + +G PP +DTGSD TW QC+PC CF Q P+FDP +S+++ I C++
Sbjct: 171 YFMDVFVGNPPRHFLLIIDTGSDLTWLQCKPCKA--CFDQSGPVFDPSQSTSFKIIPCNA 228
Query: 95 SQCAVV--------TSNCSEGDCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLP--VE 144
+ C +V +S S C Y + YG +S +SG+LA E+L+ S S P +E
Sbjct: 229 AACDLVVHDECRDNSSKTSPKTCKYFYWYGD---SSRTSGDLALESLSV-SLSDHPSSLE 284
Query: 145 MPNVIFGCGHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGK-FSYCLPDQG---- 199
+ +++ GCGH N G+ S SQ+ +S G+ FSYCL D+
Sbjct: 285 IRDMVIGCGHSNKGLFQGAGGLLGLGQG---ALSFPSQLRSSPIGQSFSYCLVDRTNNLS 341
Query: 200 -SSKINFGGIVAGA----GVVSTPLI-----IRDHYYLSLEAISVGN-------QRLEFV 242
SS I+FG A + + TP + + YYL ++ I + +R
Sbjct: 342 VSSAISFGAGFALSRHFDQMRFTPFVRTNNSVETFYYLGIQGIKIDQELLPIPAERFAIA 401
Query: 243 SSSTGNIFVDTGVLRTLLPLEYHSNLKSVMSNMI---KAQPVKGVGAEPGFSDVLCYNIS 299
+ +G +D+G T L + + ++S I +A P +G +CYN +
Sbjct: 402 PNGSGGTIIDSGTTLTYLNRDAYRAVESAFLARISYPRADPFDILG--------ICYNAT 453
Query: 300 SQPK--FPEVTIHFR-GADVKLSPSNLF--RNISDEIMCSAFRGGNANIVYGRIMQINFL 354
+ FP ++I F+ GA++ L N F + + C A + + G Q N
Sbjct: 454 GRTAVPFPTLSIVFQNGAELDLPQENYFIQPDPQEAKHCLAILPTDGMSIIGNFQQQNIH 513
Query: 355 IGYDIEQAMVSFKPSRCT 372
YD++ A + F + C+
Sbjct: 514 FLYDVQHARLGFANTDCS 531
>gi|242079765|ref|XP_002444651.1| hypothetical protein SORBIDRAFT_07g025440 [Sorghum bicolor]
gi|241941001|gb|EES14146.1| hypothetical protein SORBIDRAFT_07g025440 [Sorghum bicolor]
Length = 493
Score = 123 bits (308), Expect = 2e-25, Method: Compositional matrix adjust.
Identities = 114/379 (30%), Positives = 171/379 (45%), Gaps = 57/379 (15%)
Query: 34 IYLMHLSIGTPPVDIFGSVDTGSDCTWTQ---CEPCPELDCFKQEPPLFDPKKSSTYNSI 90
+Y + +GTPP + VDTGSD W C+ CP + L+DPK SST +++
Sbjct: 87 LYYTEVRLGTPPKRFYVQVDTGSDILWVNCITCDQCPHKSGLGLDLTLYDPKASSTGSTV 146
Query: 91 SCSSSQCAVVTS----NCSEG-DCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEM 145
C CA CS C YS YG G S + G+ + L F+ +G
Sbjct: 147 MCDQGFCADTFGGRLPKCSANVPCEYSVTYGDG---SSTVGSFVNDALQFDQVTGDGQTQ 203
Query: 146 P---NVIFGCGHKNLASPTSDSKQ-TGIIGLGPGNSSLISQMGTSIAGK----FSYCLPD 197
P +VIFGCG + S S+ GI+G G N+S++SQ+ T AGK F++CL
Sbjct: 204 PANASVIFGCGAQQGGDLGSSSQALDGILGFGEANTSMLSQLAT--AGKVKKIFAHCL-- 259
Query: 198 QGSSKINFGGIVAGAGVV-----STPLII-RDHYYLSLEAISVGNQRLE-----FVSSST 246
I GGI A VV +TPL+ + HY ++L+ I VG LE F
Sbjct: 260 ---DTIKGGGIFAIGDVVQPKVKTTPLVADKPHYNVNLKTIDVGGTTLELPADIFKPGEK 316
Query: 247 GNIFVDTGVLRTLLP-LEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDVLC--YNISSQPK 303
+D+G T LP L + + +V + K Q + + D LC Y+ S
Sbjct: 317 RGTIIDSGTTLTYLPELVFKKVMLAVFN---KHQDITFHDVQ----DFLCFEYSGSVDDG 369
Query: 304 FPEVTIHFRGADVKLS--PSNLFRNISDEIMCSAFRGGNAN-------IVYGRIMQINFL 354
FP +T HF D+ L P F +++ C F+ G ++ G ++ N L
Sbjct: 370 FPTLTFHFE-DDLALHVYPHEYFFPNGNDVYCVGFQNGALQSKDGKDIVLMGDLVLSNKL 428
Query: 355 IGYDIEQAMVSFKPSRCTN 373
+ YD+E ++ + C++
Sbjct: 429 VVYDLENRVIGWTDYNCSS 447
>gi|413923784|gb|AFW63716.1| aspartic proteinase nepenthesin-2 [Zea mays]
Length = 531
Score = 122 bits (307), Expect = 2e-25, Method: Compositional matrix adjust.
Identities = 116/358 (32%), Positives = 175/358 (48%), Gaps = 45/358 (12%)
Query: 35 YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSS 94
YL+ + +G+P +DTGSD +W QC+PC + C Q PLFDP SSTY+ SC S
Sbjct: 198 YLITVGLGSPATSQTMLIDTGSDVSWVQCKPCSQ--CHSQADPLFDPSSSSTYSPFSCGS 255
Query: 95 SQCAVVTSN----CSEGDCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMPNVIF 150
+ CA + S C Y YG G S ++G +++TL S++ + + F
Sbjct: 256 ADCAQLGQEGNGCSSSSQCQYIVTYGDG---SSTTGTYSSDTLALGSSA-----VRSFQF 307
Query: 151 GCGHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLPDQGSSK----INFG 206
GC N+ S +D + G++GLG G SL+SQ ++ FSYCLP SS +
Sbjct: 308 GC--SNVESGFND-QTDGLMGLGGGAQSLVSQTAGTLGRAFSYCLPPTPSSSGFLTLGAA 364
Query: 207 GIVAGAGVVSTPLI----IRDHYYLSLEAISVGNQRLEFVSS--STGNIFVDTGVLRTLL 260
G +G V TP++ + Y + L+AI VG ++L +S S G + +D+G + T L
Sbjct: 365 GGSGTSGFVKTPMLRSSQVPTFYGVRLQAIRVGGRQLSIPASVFSAGTV-MDSGTVITRL 423
Query: 261 PLEYHSNLKSVMSNMIKAQPVKGVGAEP-GFSDVLCYNISSQP--KFPEVTIHFR-GADV 316
P +S L S +K P A+P G D C++ S Q P V + F GA V
Sbjct: 424 PPTAYSALSSAFKAGMKQYPP----AQPSGILDT-CFDFSGQSSVSIPSVALVFSGGAVV 478
Query: 317 KLSPSNLFRNISDEIMCSAFRGGNANI---VYGRIMQINFLIGYDIEQAMVSFKPSRC 371
L S + + C AF G + + + G + Q F + YD+ + +V F+ C
Sbjct: 479 SLDASGIILS-----NCLAFAGNSDDSSLGIIGNVQQRTFEVLYDVGRGVVGFRAGAC 531
>gi|194707866|gb|ACF88017.1| unknown [Zea mays]
Length = 461
Score = 122 bits (307), Expect = 2e-25, Method: Compositional matrix adjust.
Identities = 114/357 (31%), Positives = 173/357 (48%), Gaps = 43/357 (12%)
Query: 35 YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSS 94
YL+ + +G+P +DTGSD +W QC+PC + C Q PLFDP SSTY+ SC S
Sbjct: 128 YLITVGLGSPATSQTMLIDTGSDVSWVQCKPCSQ--CHSQADPLFDPSSSSTYSPFSCGS 185
Query: 95 SQCAVVTSN----CSEGDCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMPNVIF 150
+ CA + S C Y YG G S ++G +++TL S++ + + F
Sbjct: 186 ADCAQLGQEGNGCSSSSQCQYIVTYGDG---SSTTGTYSSDTLALGSSA-----VRSFQF 237
Query: 151 GCGHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLPDQGSSK----INFG 206
GC N+ S +D + G++GLG G SL+SQ ++ FSYCLP SS +
Sbjct: 238 GC--SNVESGFND-QTDGLMGLGGGAQSLVSQTAGTLGRAFSYCLPPTPSSSGFLTLGAA 294
Query: 207 GIVAGAGVVSTPLI----IRDHYYLSLEAISVGNQRLEFVSS--STGNIFVDTGVLRTLL 260
G +G V TP++ + Y + L+AI VG ++L +S S G + +D+G + T L
Sbjct: 295 GGSGTSGFVKTPMLRSSQVPTFYGVRLQAIRVGGRQLSIPASVFSAGTV-MDSGTVITRL 353
Query: 261 PLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDVLCYNISSQP--KFPEVTIHFR-GADVK 317
P +S L S +K P A+P C++ S Q P V + F GA V
Sbjct: 354 PPTAYSALSSAFKAGMKQYPP----AQPSGILDTCFDFSGQSSVSIPSVALVFSGGAVVS 409
Query: 318 LSPSNLFRNISDEIMCSAFRGGNANI---VYGRIMQINFLIGYDIEQAMVSFKPSRC 371
L S + + C AF G + + + G + Q F + YD+ + +V F+ C
Sbjct: 410 LDASGIILS-----NCLAFAGNSDDSSLGIIGNVQQRTFEVLYDVGRGVVGFRAGAC 461
>gi|302817380|ref|XP_002990366.1| hypothetical protein SELMODRAFT_43971 [Selaginella moellendorffii]
gi|300141928|gb|EFJ08635.1| hypothetical protein SELMODRAFT_43971 [Selaginella moellendorffii]
Length = 420
Score = 122 bits (307), Expect = 2e-25, Method: Compositional matrix adjust.
Identities = 105/359 (29%), Positives = 160/359 (44%), Gaps = 41/359 (11%)
Query: 35 YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSS 94
Y + +GTP ++ DTGSD +W QC PC + C++Q+ P+F+P SS++ ++C+S
Sbjct: 81 YFARIGVGTPARSVYMVADTGSDVSWLQCSPCRK--CYRQQDPIFNPSLSSSFKPLACAS 138
Query: 95 SQCAVVT-SNCS-EGDCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMPNVIFGC 152
S C + CS + +C Y YG G SF+ G+ +TETL+F + V M GC
Sbjct: 139 SICGKLKIKGCSRKNECMYQVSYGDG---SFTVGDFSTETLSFGEHAVRSVAM-----GC 190
Query: 153 GHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLPDQGS---SKINFGGIV 209
G N + G+ S SQ GTS A FSYCLP + S + + FG
Sbjct: 191 GRNNQGLFHGAAGLLGLGRG---PLSFPSQTGTSYASVFSYCLPRRESAIAASLVFGPSA 247
Query: 210 AGAGVVSTPLI----IRDHYYLSLEAISVGN-------QRLEFVSSSTGNIFVDTGVLRT 258
T L+ + +YY+ L I V S TG + VD+G +
Sbjct: 248 VPEKARFTKLLPNRRLDTYYYVGLARIRVAGSPVNIPPDAFAMGSRGTGGVIVDSGTAIS 307
Query: 259 LLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDV-LCYNISSQ--PKFPEVTIHFR-GA 314
L ++ L+ +++ + PG S CY++SS P V + F GA
Sbjct: 308 RLTTPAYTALRDAFRSLVT------FPSAPGISLFDTCYDLSSMKTATLPAVVLDFDGGA 361
Query: 315 DVKLSPSNLFRNISDE-IMCSAFR-GGNANIVYGRIMQINFLIGYDIEQAMVSFKPSRC 371
+ L + N+ DE C AF A + G + Q F I D ++ + P +C
Sbjct: 362 SMPLPADGILVNVDDEGTYCLAFAPEEEAFSIIGNVQQQTFRISIDNQKEQMGIAPDQC 420
>gi|356526294|ref|XP_003531753.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
Length = 414
Score = 122 bits (307), Expect = 2e-25, Method: Compositional matrix adjust.
Identities = 109/373 (29%), Positives = 175/373 (46%), Gaps = 60/373 (16%)
Query: 35 YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSS 94
Y++ + +G+ + + +DTGSD TW QCEPC + C+ Q+ P+F P SS+Y S+SC+S
Sbjct: 65 YIVTMGLGSKNMTVI--IDTGSDLTWVQCEPC--MSCYNQQGPIFKPSTSSSYQSVSCNS 120
Query: 95 SQCAVV------TSNCSEGD---CSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEM 145
S C + T C + C+Y YG G+Y ++G L E L+F V +
Sbjct: 121 STCQSLQFATGNTGACGSSNPSTCNYVVNYGDGSY---TNGELGVEALSFGG-----VSV 172
Query: 146 PNVIFGCGHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLP--DQGSSKI 203
+ +FGCG N S G++GLG SL+SQ + G FSYCLP + GSS
Sbjct: 173 SDFVFGCGRNNKGLFGGVS---GLMGLGRSYLSLVSQTNATFGGVFSYCLPTTEAGSS-- 227
Query: 204 NFGGIVAG--------------AGVVSTPLIIRDHYYLSLEAISVGNQRLEF-VSSSTGN 248
G +V G ++S P + + Y L+L I VG L+ +S G
Sbjct: 228 --GSLVMGNESSVFKNANPITYTRMLSNPQ-LSNFYILNLTGIDVGGVALKAPLSFGNGG 284
Query: 249 IFVDTGVLRTLLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDV-LCYNIS--SQPKFP 305
I +D+G + T LP + LK+ P + PGFS + C+N++ + P
Sbjct: 285 ILIDSGTVITRLPSSVYKALKAEFLKKFTGFP-----SAPGFSILDTCFNLTGYDEVSIP 339
Query: 306 EVTIHFRG-ADVKLSPSNLFRNISDE-----IMCSAFRGGNANIVYGRIMQINFLIGYDI 359
+++ F G A + + + F + ++ + ++ + G Q N + YD
Sbjct: 340 TISLRFEGNAQLNVDATGTFYVVKEDASQVCLALASLSDAYDTAIIGNYQQRNQRVIYDT 399
Query: 360 EQAMVSFKPSRCT 372
+Q+ V F C+
Sbjct: 400 KQSKVGFAEEPCS 412
>gi|357142295|ref|XP_003572524.1| PREDICTED: aspartic proteinase-like protein 2-like [Brachypodium
distachyon]
Length = 494
Score = 122 bits (307), Expect = 2e-25, Method: Compositional matrix adjust.
Identities = 116/382 (30%), Positives = 173/382 (45%), Gaps = 61/382 (15%)
Query: 34 IYLMHLSIGTPPVDIFGSVDTGSDCTWTQ---CEPCPELDCFKQEPPLFDPKKSSTYNSI 90
+Y + IGTP + VDTGSD W C+ CP + L+DP S++ ++
Sbjct: 88 LYFTQIGIGTPSKGYYVQVDTGSDILWVNCISCDSCPRKSGLGIDLTLYDPTASASSKTV 147
Query: 91 SCSSSQCAVVT------SNCSEGDCSYSFLYGRGAYASFSSGNLATETLTFNSTSG---L 141
+C CA T S + C YS YG G S ++G + L ++ SG
Sbjct: 148 TCGQEFCATATNGGVPPSCAANSPCQYSITYGDG---SSTTGFFVADFLQYDQVSGDGQT 204
Query: 142 PVEMPNVIFGCGHK-NLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGK----FSYCLP 196
+ +V FGCG K A +S+ GI+G G NSS++SQ+ + AGK FS+CL
Sbjct: 205 NLANASVTFGCGAKIGGALGSSNVALDGILGFGQANSSMLSQL--TSAGKVTKIFSHCL- 261
Query: 197 DQGSSKINFGGIVAGAGVV-----STPLII-RDHYYLSLEAISVGNQRLEF------VSS 244
+N GGI A VV +TPL+ HY + L+ I VG L+ +
Sbjct: 262 ----DTVNGGGIFAIGNVVQPKVKTTPLVPGMPHYNVVLKTIDVGGSTLQLPTNIFDIGG 317
Query: 245 STGNIFVDTGVLRTLLP-LEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDVLCYNISSQPK 303
+ +D+G LP + Y + L +V SN +K V D LC+ S
Sbjct: 318 GSRGTIIDSGTTLAYLPEVVYKAVLSAVFSNHPDVT-LKNV------QDFLCFQYSGSVD 370
Query: 304 --FPEVTIHFRGADVKLS--PSN-LFRNISDEIMCSAFRGGNAN-------IVYGRIMQI 351
FPEVT HF G D+ L P + LF+N D + C F+ G ++ G +
Sbjct: 371 NGFPEVTFHFDG-DLPLVVYPHDYLFQNTED-VYCVGFQSGGVQSKDGKDMVLLGDLALS 428
Query: 352 NFLIGYDIEQAMVSFKPSRCTN 373
N L+ YD+E ++ + C++
Sbjct: 429 NKLVVYDLENQVIGWTNYNCSS 450
>gi|357153697|ref|XP_003576537.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Brachypodium
distachyon]
Length = 474
Score = 122 bits (306), Expect = 2e-25, Method: Compositional matrix adjust.
Identities = 105/372 (28%), Positives = 175/372 (47%), Gaps = 55/372 (14%)
Query: 35 YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSS 94
Y+ + +G + VDT S+ TW QC+PC C Q+ PLFDP S +Y ++ C+S
Sbjct: 120 YVATVGLGAAEATVV--VDTASELTWVQCQPCES--CHDQQDPLFDPSSSPSYAAVPCNS 175
Query: 95 SQC-------AVVTSNCSEGD-----CSYSFLYGRGAYASFSSGNLATETLTFNSTSGLP 142
S C A TS C++ + CSY+ Y G+Y S G LA + L +G
Sbjct: 176 SSCDALRVAMAAGTSPCADDNEQQPACSYALSYRDGSY---SRGVLARDKLRL---AGQD 229
Query: 143 VEMPNVIFGCGHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLP--DQGS 200
+E +FGCG N +P + +G++GLG + SL+SQ G FSYCLP + GS
Sbjct: 230 IE--GFVFGCGTSNQGAPFGGT--SGLMGLGRSHVSLVSQTMDQFGGVFSYCLPMRESGS 285
Query: 201 SKINFGGIVAGAGVVSTPLI----IRDH-------YYLSLEAISVGNQRLEFVSSSTGNI 249
S G + A STP++ + D Y+L+L I+VG Q +E S G +
Sbjct: 286 SGSLVLGDDSSAYRNSTPIVYTAMVSDSGPLQGPFYFLNLTGITVGGQEVESPWFSAGRV 345
Query: 250 FVDTG-VLRTLLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDV-LCYNIS--SQPKFP 305
+D+G ++ TL+P Y++ +S + + P FS + C+N++ + + P
Sbjct: 346 IIDSGTIITTLVPSVYNAVRAEFLSQLAEYPQA------PAFSILDTCFNLTGLKEVQVP 399
Query: 306 EVTIHFRGA-DVKLSPSNLFRNISDE-----IMCSAFRGGNANIVYGRIMQINFLIGYDI 359
+ F G+ +V++ + +S + + ++ + + G Q N + +D
Sbjct: 400 SLKFVFEGSVEVEVDSKGVLYFVSSDASQVCLALASLKSEYDTSIIGNYQQKNLRVIFDT 459
Query: 360 EQAMVSFKPSRC 371
+ + F C
Sbjct: 460 LGSQIGFAQETC 471
>gi|357439021|ref|XP_003589787.1| Aspartic proteinase nepenthesin-2 [Medicago truncatula]
gi|355478835|gb|AES60038.1| Aspartic proteinase nepenthesin-2 [Medicago truncatula]
Length = 456
Score = 122 bits (306), Expect = 2e-25, Method: Compositional matrix adjust.
Identities = 97/356 (27%), Positives = 154/356 (43%), Gaps = 47/356 (13%)
Query: 35 YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSS 94
Y + + IG+P + + +D+GSD W QCEPC + C+ Q P+F+P S+++ ++CSS
Sbjct: 129 YFVRIGIGSPAIYQYMVIDSGSDIVWIQCEPCDQ--CYNQTDPIFNPATSASFIGVACSS 186
Query: 95 SQCAVVTSN--CSEGDCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMPNVIFGC 152
+ C + + C +G C Y YG G+Y + G LA ET+T T + + GC
Sbjct: 187 NVCNQLDDDVACRKGRCGYQVAYGDGSY---TKGTLALETITIGRTV-----IQDTAIGC 238
Query: 153 GHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLPDQGSSKINFGGIVAGA 212
GH N + G+ G S + Q+G G F YCL +
Sbjct: 239 GHWNEGMFVGAAGLLGLGGG---PMSFVGQLGAQTGGAFGYCLVSRAMP----------V 285
Query: 213 GVVSTPLIIR----DHYYLSLEAISVGNQRL-------EFVSSSTGNIFVDTGVLRTLLP 261
G + PLI YY+SL ++VG R+ + TG + +DTG T LP
Sbjct: 286 GAMWVPLIHNPFYPSFYYVSLSGLAVGGIRVPISEQIFQLTDIGTGGVVMDTGTAITRLP 345
Query: 262 LEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDV-LCYNISS--QPKFPEVTIHFRGADVKL 318
++ + P PG S CY+++ + P V+ +F G +
Sbjct: 346 TVAYNAFRDAFIAQTTNLP-----RAPGVSIFDTCYDLNGFVTVRVPTVSFYFSGGQILT 400
Query: 319 SPSNLFRNISDEI--MCSAFRGGNANI-VYGRIMQINFLIGYDIEQAMVSFKPSRC 371
P+ F +D++ C AF + + + G I Q + D V F P+ C
Sbjct: 401 FPARNFLIPADDVGTFCFAFAPSPSGLSIIGNIQQEGIQVSIDGTNGFVGFGPNVC 456
>gi|242057809|ref|XP_002458050.1| hypothetical protein SORBIDRAFT_03g026140 [Sorghum bicolor]
gi|241930025|gb|EES03170.1| hypothetical protein SORBIDRAFT_03g026140 [Sorghum bicolor]
Length = 489
Score = 122 bits (306), Expect = 3e-25, Method: Compositional matrix adjust.
Identities = 109/379 (28%), Positives = 168/379 (44%), Gaps = 57/379 (15%)
Query: 34 IYLMHLSIGTPPVDIFGSVDTGSDCTWTQ---CEPCPELDCFKQEPPLFDPKKSSTYNSI 90
+Y + +GTPP + VDTGSD W CE CP + +DPK SS+ +++
Sbjct: 83 LYFTEIKLGTPPKRYYVQVDTGSDILWVNCISCEKCPRKSGLGLDLTFYDPKASSSGSTV 142
Query: 91 SCSSSQCAVVTSNCSEG-----DCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEM 145
SC CA G C YS +YG G S ++G T+ L F+ +G
Sbjct: 143 SCDQGFCAATYGGKLPGCTANVPCEYSVMYGDG---SSTTGFFVTDALQFDQVTGDGQTQ 199
Query: 146 P---NVIFGCGHKNLAS-PTSDSKQTGIIGLGPGNSSLISQMGTSIAGK----FSYCLPD 197
P V FGCG + +S+ GI+G G N+S++SQ+ + AGK F++CL
Sbjct: 200 PGNATVTFGCGAQQGGDLGSSNQALDGILGFGQANTSMLSQL--AAAGKVKKIFAHCL-- 255
Query: 198 QGSSKINFGGIVAGAGVV-----STPLII-RDHYYLSLEAISVGNQRLE-----FVSSST 246
I GGI A VV +TPL+ HY ++L++I VG L+ F +
Sbjct: 256 ---DTIKGGGIFAIGNVVQPKVKTTPLVADMPHYNVNLKSIDVGGTTLQLPAHVFETGER 312
Query: 247 GNIFVDTGVLRTLLPLEYHSNLKSVMSNMI-KAQPVKGVGAEPGFSDVLCYNI--SSQPK 303
+D+G T LP K VM+ + K Q + D +C+ S
Sbjct: 313 KGTIIDSGTTLTYLP---ELVFKEVMAAIFNKHQDI----VFHNVQDFMCFQYPGSVDDG 365
Query: 304 FPEVTIHFRGADVKLS--PSNLFRNISDEIMCSAFRGGNAN-------IVYGRIMQINFL 354
FP +T HF D+ L P F +++ C F+ G ++ G ++ N L
Sbjct: 366 FPTITFHFE-DDLALHVYPHEYFFPNGNDMYCVGFQNGALQSKDGKDIVLMGDLVLSNKL 424
Query: 355 IGYDIEQAMVSFKPSRCTN 373
+ YD+E ++ + C++
Sbjct: 425 VIYDLENQVIGWTDYNCSS 443
>gi|9759039|dbj|BAB09366.1| aspartyl protease-like [Arabidopsis thaliana]
Length = 478
Score = 122 bits (306), Expect = 3e-25, Method: Compositional matrix adjust.
Identities = 101/369 (27%), Positives = 172/369 (46%), Gaps = 41/369 (11%)
Query: 34 IYLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDC---FKQEPPLFDPKKSSTYNSI 90
+Y + +G+PP + + VDTGSD W C PCP+ L+D K SST ++
Sbjct: 73 LYFTKIKLGSPPKEYYVQVDTGSDILWVNCAPCPKCPVKTDLGIPLSLYDSKTSSTSKNV 132
Query: 91 SCSSSQCAVV--TSNC-SEGDCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMP- 146
C C+ + + C ++ CSY +YG G S S G+ + +T +G P
Sbjct: 133 GCEDDFCSFIMQSETCGAKKPCSYHVVYGDG---STSDGDFIKDNITLEQVTGNLRTAPL 189
Query: 147 --NVIFGCGHKNLASP--TSDSKQTGIIGLGPGNSSLISQM--GTSIAGKFSYCLPDQGS 200
V+FGCG KN + +DS GI+G G N+S+ISQ+ G S FS+CL +
Sbjct: 190 AQEVVFGCG-KNQSGQLGQTDSAVDGIMGFGQSNTSIISQLAAGGSTKRIFSHCLDNMNG 248
Query: 201 SKINFGGIVAGAGVVSTPLIIRD-HYYLSLEAISVGNQRLEF-----VSSSTGNIFVDTG 254
I G V V +TP++ HY + L+ + V ++ ++ G +D+G
Sbjct: 249 GGIFAVGEVESPVVKTTPIVPNQVHYNVILKGMDVDGDPIDLPPSLASTNGDGGTIIDSG 308
Query: 255 VLRTLLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDVLCYNISS--QPKFPEVTIHFR 312
LP ++ S++ + Q VK + F+ C++ +S FP V +HF
Sbjct: 309 TTLAYLPQNLYN---SLIEKITAKQQVKLHMVQETFA---CFSFTSNTDKAFPVVNLHFE 362
Query: 313 GADVKLS--PSNLFRNISDEIMCSAFRGGNAN-------IVYGRIMQINFLIGYDIEQAM 363
+ +KLS P + ++ +++ C ++ G I+ G ++ N L+ YD+E +
Sbjct: 363 DS-LKLSVYPHDYLFSLREDMYCFGWQSGGMTTQDGADVILLGDLVLSNKLVVYDLENEV 421
Query: 364 VSFKPSRCT 372
+ + C+
Sbjct: 422 IGWADHNCS 430
>gi|357137345|ref|XP_003570261.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Brachypodium
distachyon]
Length = 458
Score = 122 bits (306), Expect = 3e-25, Method: Compositional matrix adjust.
Identities = 109/359 (30%), Positives = 165/359 (45%), Gaps = 47/359 (13%)
Query: 35 YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSS 94
Y++ +SIGTP + +DTGSD +W C FDP KSSTY SCSS
Sbjct: 125 YVITVSIGTPAMTQAVMIDTGSDVSWVHCH----ARAGAGSSLFFDPGKSSTYTPFSCSS 180
Query: 95 SQCAVVT---SNCS-EGDCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMPNVIF 150
+ C + + CS C Y+ YG G S ++G ++TL NST ++ N F
Sbjct: 181 AACTRLEGRDNGCSLNSTCQYTVRYGDG---SNTTGTYGSDTLALNSTE----KVENFQF 233
Query: 151 GCGHKNLASPTSDSKQT-GIIGLGPGNSSLISQMGTSIAGKFSYCLP--DQGSSKINFGG 207
GC + D QT G++GLG G SL+SQ + FSYCLP + S + G
Sbjct: 234 GCSETSDPGEGLDEDQTDGLMGLGGGAPSLVSQTAATYGSAFSYCLPATTRSSGFLTLGA 293
Query: 208 IVAGAGVVSTPLIIRDH----YYLSLEAISVGNQRLEFVSS--STGNIFVDTGVLRTLLP 261
+G V+TP+ Y++ L+ I+VG + + + G+I +D+G + T LP
Sbjct: 294 STGTSGFVTTPMFRSRRAPTFYFVILQGINVGGDPVAISPTVFAAGSI-MDSGTIITRLP 352
Query: 262 LEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDV-LCYNISSQPK--FPEVTIHFR-GADVK 317
+S L + ++ P FS + C++ + Q P V + F GA V
Sbjct: 353 PRAYSALSAAFRAGMRRYP-----RARAFSILDTCFDFTGQDNVSIPAVELVFSGGAVVD 407
Query: 318 LSPSNLFRNISDEIM---CSAFRGGNANI--VYGRIMQINFLIGYDIEQAMVSFKPSRC 371
L +D IM C AF I + G + Q F + +D+ Q+++ F+P C
Sbjct: 408 LD--------ADGIMYGSCLAFAPATGGIGSIIGNVQQRTFEVLHDVGQSVLGFRPGAC 458
>gi|218188634|gb|EEC71061.1| hypothetical protein OsI_02803 [Oryza sativa Indica Group]
Length = 479
Score = 122 bits (306), Expect = 3e-25, Method: Compositional matrix adjust.
Identities = 109/367 (29%), Positives = 168/367 (45%), Gaps = 52/367 (14%)
Query: 35 YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELD-CFKQEPPLFDPKKSSTYNSISCS 93
Y++ + +G+P + +DTGSD +W QCEPCP C LFDP SSTY + +CS
Sbjct: 135 YVISVGLGSPAMTQRVVIDTGSDVSWVQCEPCPAPSPCHAHAGALFDPAASSTYAAFNCS 194
Query: 94 SSQCAVV-----TSNC-SEGDCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMPN 147
++ CA + + C ++ C Y YG G S ++G +++ LT + + +
Sbjct: 195 AAACAQLGDSGEANGCDAKSRCQYIVKYGDG---SNTTGTYSSDVLTLSGSD----VVRG 247
Query: 148 VIFGCGHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLPDQGSSK----- 202
FGC H L + D K G+IGLG SL+SQ FSYCLP +S
Sbjct: 248 FQFGCSHAELGAGM-DDKTDGLIGLGGDAQSLVSQTAARYGKSFSYCLPATPASSGFLTL 306
Query: 203 --INFGGIVAGAGVVSTPLI----IRDHYYLSLEAISVGNQRLEFVSS--STGNIFVDTG 254
GG + +TP++ + +Y+ +LE I+VG ++L S + G++ VD+G
Sbjct: 307 GAPASGGGGGASRFATTPMLRSKKVPTYYFAALEDIAVGGKKLGLSPSVFAAGSL-VDSG 365
Query: 255 VLRTLLPLEYHSNLKSV----MSNMIKAQPVKGVGAEPGFSDVLCYNISSQPK--FPEVT 308
+ T LP ++ L S M+ +A+P+ G D C+N + K P V
Sbjct: 366 TVITRLPPAAYAALSSAFRAGMTRYARAEPL-------GILDT-CFNFTGLDKVSIPTVA 417
Query: 309 IHFR-GADVKLSPSNLFRNISDEIMCSAF---RGGNANIVYGRIMQINFLIGYDIEQAMV 364
+ F GA V L + C AF R A G + Q F + YD+ +
Sbjct: 418 LVFAGGAVVDLDAHGIVSG-----GCLAFAPTRDDKAFGTIGNVQQRTFEVLYDVGGGVF 472
Query: 365 SFKPSRC 371
F+ C
Sbjct: 473 GFRAGAC 479
>gi|302764208|ref|XP_002965525.1| hypothetical protein SELMODRAFT_406966 [Selaginella moellendorffii]
gi|300166339|gb|EFJ32945.1| hypothetical protein SELMODRAFT_406966 [Selaginella moellendorffii]
Length = 464
Score = 122 bits (306), Expect = 3e-25, Method: Compositional matrix adjust.
Identities = 107/368 (29%), Positives = 163/368 (44%), Gaps = 63/368 (17%)
Query: 34 IYLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCS 93
+Y +++G+PP D +DTGSD TW +C+PC DC FD S+TY +++C+
Sbjct: 123 VYYSSITLGSPPKDFSLVMDTGSDLTWVRCDPCSP-DC----SSTFDRLASNTYKALTCA 177
Query: 94 SSQCAVVTSNCSEGDCSYSFLYGRGAYASFSSGNLATETLTF-NSTSGLPVEMPNVIFGC 152
D L R F SG +TL + S E P +FGC
Sbjct: 178 D-------------DLRLPVLL-RLWRRLFHSGRSLRDTLKMAGAASDELEEFPGFVFGC 223
Query: 153 GH--KNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLPDQ-GSSKINFGGIV 209
G K L S + GI+ L PG+ S SQ+G KFSYCL Q + + +V
Sbjct: 224 GSLLKGLI-----SGEVGILALSPGSLSFPSQIGEKYGNKFSYCLLRQTAQNSLKKSPMV 278
Query: 210 AGAGVVS--------------TPLIIRDHYY-LSLEAISVGNQRLE-----FVSSSTGNI 249
G V TP+ YY + L+ ISVGNQRL+ F++
Sbjct: 279 FGEAAVELKEPGSGKPQELQYTPIGESSIYYTVRLDGISVGNQRLDLSPSTFLNGQDKPT 338
Query: 250 FVDTGVLRTLLPLEYHSNLKSVMSNMIKAQ---PVKGVGAEPGFSDVLCYNI--SSQPKF 304
D+G T+LP ++K +++M+ +KG+ A C+ + SS
Sbjct: 339 IFDSGTTLTMLPSGVCDSIKQSLASMVSGAEFVAIKGLDA--------CFRVPPSSGQGL 390
Query: 305 PEVTIHFR-GADVKLSPSNLFRNISDEIMCSAFRGGNANIVYGRIMQINFLIGYDIEQAM 363
P++T HF GAD PSN ++ + C F N ++G + Q +F + +D++
Sbjct: 391 PDITFHFNGGADFVTRPSNYVIDLG-SLQCLIFVPTNEVSIFGNLQQQDFFVLHDMDNRR 449
Query: 364 VSFKPSRC 371
+ FK + C
Sbjct: 450 IGFKETDC 457
>gi|357155293|ref|XP_003577072.1| PREDICTED: LOW QUALITY PROTEIN: probable aspartic protease
At2g35615-like [Brachypodium distachyon]
Length = 429
Score = 122 bits (306), Expect = 3e-25, Method: Compositional matrix adjust.
Identities = 113/386 (29%), Positives = 175/386 (45%), Gaps = 44/386 (11%)
Query: 16 PKSPISIIYQAEIISVDDIYLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQE 75
P P ++ EI + + M +S+GTPPV +VDTGS +W C+ C ++ C
Sbjct: 58 PAEPSPVVGNHEIH--EGKFFMDISLGTPPVANLVTVDTGSTLSWVVCQRC-QISCHTTA 114
Query: 76 PP---LFDPKKSSTYNSISCSSSQCAVVTSN------CSE--GDCSYSFLYGRGAYASFS 124
P +FDP KS+TY + CSS CA V + C E C YS YG G +S
Sbjct: 115 PEAGSVFDPDKSTTYELVGCSSRDCADVQRSLVAPFGCIEETDTCLYSLRYGSGPSGQYS 174
Query: 125 SGNLATETLTFNSTSGLPVEMPNVIFGCGHKNLASPTSDSKQTGIIGLGPGNSSLISQMG 184
+G L T+ LT S+S + + IFGC + + ++G+IG G N S +Q+
Sbjct: 175 AGRLGTDKLTLASSSSI---IDGFIFGCSGDD----SFKGYESGVIGFGGANFSFFNQVA 227
Query: 185 TSIAGK-FSYCLPDQGSSKINFGGIVAGA----GVVSTPLI--IRDHYYLSLEAIS--VG 235
+ FSYC P +++ G + GA +V T LI D SL+ I V
Sbjct: 228 RQTNYRAFSYCFPGDHTAE---GFLSIGAYPKDELVYTNLIPHFGDRSVYSLQQIDMMVD 284
Query: 236 NQRLEFVSS--STGNIFVDTGVLRTLLPLEYHSNLKSVMSNMIKAQPV--KGVGAEPGFS 291
RL+ S + + VD+G + T L M++ ++A+ VG E F
Sbjct: 285 GNRLQVDQSEYTKRMMVVDSGTVDTFLLGPVFDAFSKAMASAMQAKGFLSDTVGTETCFR 344
Query: 292 DVLCYNISSQPKFPEVTIHFRGADVKLSPSNLFRNI--SDEIMCSAFRGGNANI----VY 345
++ S P V + F G +KL P N+F ++ S + +C AF+ A + +
Sbjct: 345 PNGGDSVDSG-DLPTVEMRFIGTTLKLPPENVFHDLLPSHDKICLAFKPDVAGVRNVQIL 403
Query: 346 GRIMQINFLIGYDIEQAMVSFKPSRC 371
G +F + YD++ F+ C
Sbjct: 404 GNKATXSFRVVYDLQAMYFGFQAGAC 429
>gi|356569916|ref|XP_003553140.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Glycine max]
Length = 560
Score = 122 bits (305), Expect = 3e-25, Method: Compositional matrix adjust.
Identities = 110/377 (29%), Positives = 170/377 (45%), Gaps = 50/377 (13%)
Query: 35 YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSS 94
Y M + +GTPP +DTGSD W QC PC CF+Q P +DPK SS++ +I+C
Sbjct: 195 YFMDVFVGTPPKHFSLILDTGSDLNWIQCVPC--YACFEQNGPYYDPKDSSSFKNITCHD 252
Query: 95 SQCAVVTS-------NCSEGDCSYSFLYGRGAYASFSSGNLATETLTFNSTS--GLPVE- 144
+C +V+S C Y + YG +S ++G+ A ET T N T+ G P
Sbjct: 253 PRCQLVSSPDPPQPCKGETQSCPYFYWYGD---SSNTTGDFALETFTVNLTTPEGKPELK 309
Query: 145 -MPNVIFGCGHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLPDQG---- 199
+ NV+FGCGH N + G+ S +Q+ + FSYCL D+
Sbjct: 310 IVENVMFGCGHWNRGLFHGAAGLLGLGRG---PLSFATQLQSLYGHSFSYCLVDRNSNSS 366
Query: 200 -SSKINFG---GIVAGAGVVSTPLI------IRDHYYLSLEAISVGNQRLEFVSSST--- 246
SSK+ FG +++ + T + + YY+ +++I VG + L+ +
Sbjct: 367 VSSKLIFGEDKELLSHPNLNFTSFVGGKENPVDTFYYVLIKSIMVGGEVLKIPEETWHLS 426
Query: 247 ----GNIFVDTGVLRTLLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDVLCYNISSQP 302
G +D+G T + +K IK P+ V P CYN+S
Sbjct: 427 AQGGGGTIIDSGTTLTYFAEPAYEIIKEAFMRKIKGFPL--VETFPPLKP--CYNVSGVE 482
Query: 303 K--FPEVTIHFR-GADVKLSPSNLFRNIS-DEIMCSAFRG--GNANIVYGRIMQINFLIG 356
K PE I F GA N F I ++++C A G +A + G Q NF I
Sbjct: 483 KMELPEFAILFADGAMWDFPVENYFIQIEPEDVVCLAILGTPRSALSIIGNYQQQNFHIL 542
Query: 357 YDIEQAMVSFKPSRCTN 373
YD++++ + + P +C +
Sbjct: 543 YDLKKSRLGYAPMKCAD 559
>gi|222637180|gb|EEE67312.1| hypothetical protein OsJ_24552 [Oryza sativa Japonica Group]
Length = 420
Score = 122 bits (305), Expect = 4e-25, Method: Compositional matrix adjust.
Identities = 112/407 (27%), Positives = 174/407 (42%), Gaps = 75/407 (18%)
Query: 2 QNSQKLPFYNDNETPKSPI----SIIYQAEIISVDDIYLMHLSIGTPPVDIFGSVDTGSD 57
++S ++ F +D S+ +QA + + Y M++S+GTP + DTGSD
Sbjct: 49 RDSHRIAFLSDATAAGKATTTNSSVSFQALLENGVGGYNMNISVGTPLLTFSVVADTGSD 108
Query: 58 CTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSSSQCAVV---TSNCSEGDCSYSFL 114
WTQC PC + CF+Q P F P SST++ + C+SS C + C+ C Y++
Sbjct: 109 LIWTQCAPCTK--CFQQPAPPFQPASSSTFSKLPCTSSFCQFLPNSIRTCNATGCVYNYK 166
Query: 115 YGRGAYASFSSGNLATETLTFNSTSGLPVEMPNVIFGCGHKNLASPTSDSKQTGIIGLGP 174
YG G +++G LATETL S P+V FGC +N GLG
Sbjct: 167 YGSG----YTAGYLATETLKVGDAS-----FPSVAFGCSTEN--------------GLG- 202
Query: 175 GNSSLISQMGTSIAGKFSYCLPD---QGSSKINFG-------GIVAGAGVVSTPLIIRDH 224
Q+ + G+FSYCL G+S I FG G V V+ P + +
Sbjct: 203 -------QLDLGV-GRFSYCLRSGSAAGASPILFGSLANLTDGNVQSTPFVNNPAVHPSY 254
Query: 225 YYLSLEAISVGNQRLEFVSSS--------TGNIFVDTGVLRTLLPLEYHSNLKSVMSNMI 276
YY++L I+VG L +S+ G VD+G T L + + +K +
Sbjct: 255 YYVNLTGITVGETDLPVTTSTFGFTQNGLGGGTIVDSGTTLTYLAKDGYEMVKQAF--LS 312
Query: 277 KAQPVKGVGAEPGFSDVLCYNISSQP----KFPEVTIHFRGADVKLSPSNLFRNISDE-- 330
+ V V G LC+ + P + + F G P+ +D
Sbjct: 313 QTADVTTVNGTRGLD--LCFKSTGGGGGGIAVPSLVLRFDGGAEYAVPTYFAGVETDSQG 370
Query: 331 ------IMCSAFRGGNANIVYGRIMQINFLIGYDIEQAMVSFKPSRC 371
+M +G V G +MQ++ + YD++ + SF P+ C
Sbjct: 371 SVTVACLMMLPAKGDQPMSVIGNVMQMDMHLLYDLDGGIFSFAPADC 417
>gi|30692930|ref|NP_198475.2| aspartyl protease family protein [Arabidopsis thaliana]
gi|66792626|gb|AAY56415.1| At5g36260 [Arabidopsis thaliana]
gi|332006680|gb|AED94063.1| aspartyl protease family protein [Arabidopsis thaliana]
Length = 482
Score = 122 bits (305), Expect = 4e-25, Method: Compositional matrix adjust.
Identities = 101/370 (27%), Positives = 173/370 (46%), Gaps = 41/370 (11%)
Query: 34 IYLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDC---FKQEPPLFDPKKSSTYNSI 90
+Y + +G+PP + + VDTGSD W C PCP+ L+D K SST ++
Sbjct: 77 LYFTKIKLGSPPKEYYVQVDTGSDILWVNCAPCPKCPVKTDLGIPLSLYDSKTSSTSKNV 136
Query: 91 SCSSSQCAVV--TSNC-SEGDCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMP- 146
C C+ + + C ++ CSY +YG G S S G+ + +T +G P
Sbjct: 137 GCEDDFCSFIMQSETCGAKKPCSYHVVYGDG---STSDGDFIKDNITLEQVTGNLRTAPL 193
Query: 147 --NVIFGCGHKNLASP--TSDSKQTGIIGLGPGNSSLISQM--GTSIAGKFSYCLPDQGS 200
V+FGCG KN + +DS GI+G G N+S+ISQ+ G S FS+CL +
Sbjct: 194 AQEVVFGCG-KNQSGQLGQTDSAVDGIMGFGQSNTSIISQLAAGGSTKRIFSHCLDNMNG 252
Query: 201 SKINFGGIVAGAGVVSTPLIIRD-HYYLSLEAISVGNQRLEF-----VSSSTGNIFVDTG 254
I G V V +TP++ HY + L+ + V ++ ++ G +D+G
Sbjct: 253 GGIFAVGEVESPVVKTTPIVPNQVHYNVILKGMDVDGDPIDLPPSLASTNGDGGTIIDSG 312
Query: 255 VLRTLLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDVLCYNISS--QPKFPEVTIHFR 312
LP +++L + + Q VK + F+ C++ +S FP V +HF
Sbjct: 313 TTLAYLPQNLYNSL---IEKITAKQQVKLHMVQETFA---CFSFTSNTDKAFPVVNLHFE 366
Query: 313 GADVKLS--PSNLFRNISDEIMCSAFRGGNAN-------IVYGRIMQINFLIGYDIEQAM 363
+ +KLS P + ++ +++ C ++ G I+ G ++ N L+ YD+E +
Sbjct: 367 DS-LKLSVYPHDYLFSLREDMYCFGWQSGGMTTQDGADVILLGDLVLSNKLVVYDLENEV 425
Query: 364 VSFKPSRCTN 373
+ + C++
Sbjct: 426 IGWADHNCSS 435
>gi|297840891|ref|XP_002888327.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
gi|297334168|gb|EFH64586.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
Length = 473
Score = 122 bits (305), Expect = 4e-25, Method: Compositional matrix adjust.
Identities = 103/366 (28%), Positives = 160/366 (43%), Gaps = 36/366 (9%)
Query: 34 IYLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDC---FKQEPPLFDPKKSSTYNSI 90
+Y + +G+PP + VDTGSD W C+PCPE LFD SST +
Sbjct: 73 LYFTKIKLGSPPKEYHVQVDTGSDILWVNCKPCPECPSKTNLNFHLSLFDVNASSTSKKV 132
Query: 91 SCSSSQCAVVTSNCS---EGDCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMP- 146
C C+ ++ + S CSY +Y A S S GN + LT +G P
Sbjct: 133 GCDDDFCSFISQSDSCQPAVGCSYHIVY---ADESTSEGNFIRDKLTLEQVTGDLQTGPL 189
Query: 147 --NVIFGCGHKNLAS-PTSDSKQTGIIGLGPGNSSLISQMGTSIAGK--FSYCLPDQGSS 201
V+FGCG SDS G++G G N+S++SQ+ + K FS+CL +
Sbjct: 190 GQEVVFGCGSDQSGQLGKSDSAVDGVMGFGQSNTSVLSQLAATGDAKRVFSHCLDNVKGG 249
Query: 202 KINFGGIVAGAGVVSTPLIIRD-HYYLSLEAISVGNQRLEFVSS--STGNIFVDTGVLRT 258
I G+V V +TP++ HY + L + V L+ S G VD+G
Sbjct: 250 GIFAVGVVDSPKVKTTPMVPNQMHYNVMLMGMDVDGTALDLPPSIMRNGGTIVDSGTTLA 309
Query: 259 LLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDVLCYNISS--QPKFPEVTIHFRGADV 316
P + +L + ++ QPVK E F C++ S FP V+ F + V
Sbjct: 310 YFPKVLYDSL---IETILARQPVKLHIVEDTFQ---CFSFSENVDVAFPPVSFEFEDS-V 362
Query: 317 KLS--PSNLFRNISDEIMCSAFRGGNAN-------IVYGRIMQINFLIGYDIEQAMVSFK 367
KL+ P + + E+ C ++ G I+ G ++ N L+ YD+E ++ +
Sbjct: 363 KLTVYPHDYLFTLEKELYCFGWQAGGLTTGERTEVILLGDLVLSNKLVVYDLENEVIGWA 422
Query: 368 PSRCTN 373
C++
Sbjct: 423 DHNCSS 428
>gi|224109494|ref|XP_002315215.1| predicted protein [Populus trichocarpa]
gi|222864255|gb|EEF01386.1| predicted protein [Populus trichocarpa]
Length = 444
Score = 122 bits (305), Expect = 4e-25, Method: Compositional matrix adjust.
Identities = 116/378 (30%), Positives = 167/378 (44%), Gaps = 64/378 (16%)
Query: 39 LSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSSSQCA 98
L+IGTPP +I +DTGS+ +W +C+ P +F+P S TY I CSS C
Sbjct: 71 LTIGTPPQNITMVLDTGSELSWLRCKKEPNFTS------IFNPLASKTYTKIPCSSQTCK 124
Query: 99 VVTSN------CSEGD-CSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMPNVIFG 151
TS+ C C + Y A AS G+LA ET F S + P +FG
Sbjct: 125 TRTSDLTLPVTCDPAKLCHFIISY---ADASSVEGHLAFETFRFGS-----LTRPATVFG 176
Query: 152 CGHKNLASPT-SDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLPDQGSSKINFGGIVA 210
C +S T D+K TG++G+ G+ S ++QMG KFSYC+ S+ G
Sbjct: 177 CMDSGSSSNTEEDAKTTGLMGMNRGSLSFVNQMGFR---KFSYCISGLDSTGFLLLGEAR 233
Query: 211 GAGV----------VSTPLIIRDH--YYLSLEAISVGNQRLE-----FVSSST--GNIFV 251
+ + +STPL D Y + LE I V N+ L FV T G V
Sbjct: 234 YSWLKPLNYTPLVQISTPLPYFDRVAYSVQLEGIKVNNKVLPLPKSVFVPDHTGAGQTMV 293
Query: 252 DTGVLRTLLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGF----SDVLCYNI----SSQPK 303
D+G T L +S L+ +++ V V EP + + LCY I S+ P
Sbjct: 294 DSGTQFTFLLGPVYSALRKEF--LLQTAGVLRVLNEPQYVFQGAMDLCYLIDSTSSTLPN 351
Query: 304 FPEVTIHFRGADVKLSPSNLFRNI------SDEIMCSAF----RGGNANIVYGRIMQINF 353
P V + FRGA++ +S L + D + C F G ++ + G Q N
Sbjct: 352 LPVVKLMFRGAEMSVSGQRLLYRVPGEVRGKDSVWCFTFGNSDELGISSFLIGHHQQQNV 411
Query: 354 LIGYDIEQAMVSFKPSRC 371
+ YD+E + + F RC
Sbjct: 412 WMEYDLENSRIGFAELRC 429
>gi|226530755|ref|NP_001150335.1| aspartic proteinase nepenthesin-2 precursor [Zea mays]
gi|195638492|gb|ACG38714.1| aspartic proteinase nepenthesin-2 precursor [Zea mays]
Length = 461
Score = 122 bits (305), Expect = 4e-25, Method: Compositional matrix adjust.
Identities = 114/355 (32%), Positives = 176/355 (49%), Gaps = 39/355 (10%)
Query: 35 YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSS 94
YL+ + +G+P +DTGSD +W QC+PC + C Q PLFDP SSTY+ SC S
Sbjct: 128 YLITVGLGSPATSQTMLIDTGSDVSWVQCKPCSQ--CHSQADPLFDPSSSSTYSPFSCGS 185
Query: 95 SQCAVVTSN----CSEGDCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMPNVIF 150
+ CA + S C Y YG G S ++G +++TL S++ + + F
Sbjct: 186 AACAQLGQEGNGCSSSSQCQYIVTYGDG---SSTTGTYSSDTLALGSSA-----VKSFQF 237
Query: 151 GCGHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLPDQGSSK----INFG 206
GC N+ S +D + G++GLG G SL+SQ ++ FSYCLP SS +
Sbjct: 238 GC--SNVESGFND-QTDGLMGLGGGAQSLVSQTAGTLGRAFSYCLPPTPSSSGFLTLGAA 294
Query: 207 GIVAGAGVVSTPLI----IRDHYYLSLEAISVGNQRLEFVSS--STGNIFVDTGVLRTLL 260
G +G V TP++ + Y + L+AI VG ++L +S S G + +D+G + T L
Sbjct: 295 GGSGTSGFVKTPMLRSSQVPTFYGVRLQAIRVGGRQLSIPASVFSAGTV-MDSGTVITRL 353
Query: 261 PLEYHSNLKSVMSNMIKAQPVKGVGAEP-GFSDVLCYNISSQP--KFPEVTIHFR-GADV 316
P +S L S +K P A+P G D C++ S Q P V + F GA V
Sbjct: 354 PPTAYSALSSAFKAGMKQYPP----AQPSGILDT-CFDFSGQSSVSIPSVALVFSGGAVV 408
Query: 317 KLSPSNLFRNISDEIMCSAFRGGNANIVYGRIMQINFLIGYDIEQAMVSFKPSRC 371
L S + +S+ + +A ++ + G + Q F + YD+ + +V F+ C
Sbjct: 409 SLDASGII--LSNCLAFAANSDDSSLGIIGNVQQRTFEVLYDVGRGVVGFRAGAC 461
>gi|148910443|gb|ABR18297.1| unknown [Picea sitchensis]
Length = 452
Score = 122 bits (305), Expect = 4e-25, Method: Compositional matrix adjust.
Identities = 103/355 (29%), Positives = 160/355 (45%), Gaps = 39/355 (10%)
Query: 35 YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSS 94
Y++ + GTP ++ +DTGSD W C+ C P+FDP KSS+Y +C S
Sbjct: 115 YIIQVDFGTPKQSMYTLIDTGSDVAWIPCKQCQGC---HSTAPIFDPAKSSSYKPFACDS 171
Query: 95 SQCAVVTSNC-SEGDCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMPNVIFGCG 153
C ++ NC C + YG G G LA++ +T S +PN FGC
Sbjct: 172 QPCQEISGNCGGNSKCQFEVSYGDGTQV---DGTLASDAITLGSQ-----YLPNFSFGCA 223
Query: 154 HKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLPDQGSSKINFGGIVAG-- 211
++L+ TS S +G G + + G FSYCLP +S + +V G
Sbjct: 224 -ESLSEDTSPSPGLMGLGGGSLSLLTQAPTAELFGGTFSYCLPSSSTSSGS---LVLGKE 279
Query: 212 AGVVSTPL----IIRD-----HYYLSLEAISVGNQRLEFVSS---STGNIFVDTGVLRT- 258
A V S+ L +I+D Y+++L+AISVGN R+ + S G +D+G T
Sbjct: 280 AAVSSSSLKFTTLIKDPSIPTFYFVTLKAISVGNTRISVPGTNIASGGGTIIDSGTTITH 339
Query: 259 LLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDVLCYNISSQP-KFPEVTIHF-RGADV 316
L+P Y + + + QP P CY++SS P +T+H R D+
Sbjct: 340 LVPSAYTALRDAFRQQLSSLQPT------PVEDMDTCYDLSSSSVDVPTITLHLDRNVDL 393
Query: 317 KLSPSNLFRNISDEIMCSAFRGGNANIVYGRIMQINFLIGYDIEQAMVSFKPSRC 371
L N+ + C AF ++ + G + Q N+ I +D+ + V F +C
Sbjct: 394 VLPKENILITQESGLACLAFSSTDSRSIIGNVQQQNWRIVFDVPNSQVGFAQEQC 448
>gi|15231625|ref|NP_191467.1| aspartyl protease family protein [Arabidopsis thaliana]
gi|15983376|gb|AAL11556.1|AF424562_1 AT3g59080/F17J16_130 [Arabidopsis thaliana]
gi|7529751|emb|CAB86936.1| putative protein [Arabidopsis thaliana]
gi|20466704|gb|AAM20669.1| putative protein [Arabidopsis thaliana]
gi|23198236|gb|AAN15645.1| putative protein [Arabidopsis thaliana]
gi|332646352|gb|AEE79873.1| aspartyl protease family protein [Arabidopsis thaliana]
Length = 535
Score = 122 bits (305), Expect = 4e-25, Method: Compositional matrix adjust.
Identities = 106/385 (27%), Positives = 172/385 (44%), Gaps = 66/385 (17%)
Query: 35 YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSS 94
Y M + +G+PP +DTGSD W QC PC DCF+Q +DPK S++Y +I+C+
Sbjct: 170 YFMDVLVGSPPKHFSLILDTGSDLNWIQCLPC--YDCFQQNGAFYDPKASASYKNITCND 227
Query: 95 SQCAVVTS-------NCSEGDCSYSFLYGRGAYASFSSGNLATETLTFN-STSGLPVEMP 146
+C +V+S C Y + YG +S ++G+ A ET T N +T+G E+
Sbjct: 228 QRCNLVSSPDPPMPCKSDNQSCPYYYWYGD---SSNTTGDFAVETFTVNLTTNGGSSELY 284
Query: 147 NV---IFGCGHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLPDQGSS-- 201
NV +FGCGH N + G+ S SQ+ + FSYCL D+ S
Sbjct: 285 NVENMMFGCGHWNRGLFHGAAGLLGLGRG---PLSFSSQLQSLYGHSFSYCLVDRNSDTN 341
Query: 202 -----------------KINFGGIVAGAGVVSTPLIIRDHYYLSLEAISVGNQRL----- 239
+NF VAG ++ YY+ +++I V + L
Sbjct: 342 VSSKLIFGEDKDLLSHPNLNFTSFVAGK-----ENLVDTFYYVQIKSILVAGEVLNIPEE 396
Query: 240 --EFVSSSTGNIFVDTGVLRTLLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSD--VL- 294
S G +D+G + + +K+ ++ K G P + D +L
Sbjct: 397 TWNISSDGAGGTIIDSGTTLSYFAEPAYEFIKNKIAE-------KAKGKYPVYRDFPILD 449
Query: 295 -CYNIS--SQPKFPEVTIHFR-GADVKLSPSNLFRNISDEIMCSAFRG--GNANIVYGRI 348
C+N+S + PE+ I F GA N F ++++++C A G +A + G
Sbjct: 450 PCFNVSGIHNVQLPELGIAFADGAVWNFPTENSFIWLNEDLVCLAMLGTPKSAFSIIGNY 509
Query: 349 MQINFLIGYDIEQAMVSFKPSRCTN 373
Q NF I YD +++ + + P++C +
Sbjct: 510 QQQNFHILYDTKRSRLGYAPTKCAD 534
>gi|363808270|ref|NP_001242239.1| uncharacterized protein LOC100801883 [Glycine max]
gi|255641727|gb|ACU21134.1| unknown [Glycine max]
Length = 475
Score = 121 bits (304), Expect = 4e-25, Method: Compositional matrix adjust.
Identities = 109/370 (29%), Positives = 167/370 (45%), Gaps = 37/370 (10%)
Query: 34 IYLMHLSIGTPPVDIFGSVDTGSDCTW---TQCEPCPELDCFKQEPPLFDPKKSSTYNSI 90
+Y L +G+PP D + VDTGSD W +C CP + L+DPK S T + +
Sbjct: 69 LYFTKLGLGSPPRDYYVQVDTGSDILWVNCVECSRCPRKSDLGIDLTLYDPKGSETSDVV 128
Query: 91 SCSSSQCAVV----TSNC-SEGDCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEM 145
SC C+ C SE C YS YG G S ++G + LT+N +G
Sbjct: 129 SCDQDFCSATFDGPIPGCKSEIPCPYSITYGDG---SATTGYYVQDYLTYNRINGNLRTS 185
Query: 146 P---NVIFGCGHKNLASPTSDSKQT--GIIGLGPGNSSLISQMGTS--IAGKFSYCLPDQ 198
P ++IFGCG + S S++ GIIG G NSS++SQ+ S + FS+CL +
Sbjct: 186 PQNSSIIFGCGAVQSGTLGSSSEEALDGIIGFGQANSSVLSQLAASGKVKKIFSHCLDNV 245
Query: 199 GSSKINFGGIVAGAGVVSTPLIIR-DHYYLSLEAISVGNQRLE-----FVSSSTGNIFVD 252
I G V V +TPL+ R HY + L++I V L+ F S + +D
Sbjct: 246 RGGGIFAIGEVVEPKVSTTPLVPRMAHYNVVLKSIEVDTDILQLPSDIFDSVNGKGTVID 305
Query: 253 TGVLRTLLPLEYHSNLKSVMSNMIKAQP-VKGVGAEPGFSDVLCYNISSQPKFPEVTIHF 311
+G LP + L + ++ QP +K E F L Y + FP V +HF
Sbjct: 306 SGTTLAYLPDIVYDEL---IQKVLARQPGLKLYLVEQQFRCFL-YTGNVDRGFPVVKLHF 361
Query: 312 RGA-DVKLSPSNLFRNISDEIMCSAFRGGNANIVYGRIMQI-------NFLIGYDIEQAM 363
+ + + + P + D I C ++ A G+ M + N L+ YD+E +
Sbjct: 362 KDSLSLTVYPHDYLFQFKDGIWCIGWQRSVAQTKNGKDMTLLGDLVLSNKLVIYDLENMV 421
Query: 364 VSFKPSRCTN 373
+ + C++
Sbjct: 422 IGWTDYNCSS 431
>gi|302795261|ref|XP_002979394.1| hypothetical protein SELMODRAFT_53966 [Selaginella moellendorffii]
gi|300153162|gb|EFJ19802.1| hypothetical protein SELMODRAFT_53966 [Selaginella moellendorffii]
Length = 353
Score = 121 bits (304), Expect = 4e-25, Method: Compositional matrix adjust.
Identities = 105/359 (29%), Positives = 159/359 (44%), Gaps = 41/359 (11%)
Query: 35 YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSS 94
Y + +GTP ++ DTGSD +W QC PC + C++Q+ P+F+P SS++ ++C+S
Sbjct: 14 YFARIGVGTPARSVYMVADTGSDVSWLQCSPCRK--CYRQQDPIFNPSLSSSFKPLACAS 71
Query: 95 SQCAVVT-SNCSEGD-CSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMPNVIFGC 152
S C + CS + C Y YG G SF+ G+ +TETL+F + V M GC
Sbjct: 72 SICGKLKIKGCSRKNKCMYQVSYGDG---SFTVGDFSTETLSFGEHAVRSVAM-----GC 123
Query: 153 GHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLPDQGS---SKINFGGIV 209
G N + G+ S SQ GTS A FSYCLP + S + + FG
Sbjct: 124 GRNNQGLFHGAAGLLGLGRG---PLSFPSQTGTSYASVFSYCLPRRESAIAASLVFGPSA 180
Query: 210 AGAGVVSTPLI----IRDHYYLSLEAISVGN-------QRLEFVSSSTGNIFVDTGVLRT 258
T L+ + +YY+ L I V S TG + VD+G +
Sbjct: 181 VPEKARFTKLLPNRRLDTYYYVGLARIRVAGSPVNIPPDAFAMGSRGTGGVIVDSGTAIS 240
Query: 259 LLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDV-LCYNISSQ--PKFPEVTIHFR-GA 314
L ++ L+ +++ + PG S CY++SS P V + F GA
Sbjct: 241 RLTTPAYTALRDAFRSLVT------FPSAPGISLFDTCYDLSSMKTATLPAVVLDFDGGA 294
Query: 315 DVKLSPSNLFRNISDE-IMCSAFR-GGNANIVYGRIMQINFLIGYDIEQAMVSFKPSRC 371
+ L + N+ DE C AF A + G + Q F I D ++ + P +C
Sbjct: 295 SMPLPADGILVNVDDEGTYCLAFAPEEEAFSIIGNVQQQTFRISIDNQKEQMGIAPDQC 353
>gi|224099307|ref|XP_002311432.1| predicted protein [Populus trichocarpa]
gi|222851252|gb|EEE88799.1| predicted protein [Populus trichocarpa]
Length = 458
Score = 121 bits (304), Expect = 5e-25, Method: Compositional matrix adjust.
Identities = 107/377 (28%), Positives = 172/377 (45%), Gaps = 44/377 (11%)
Query: 35 YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEP-PLFDPKKSSTYNSISCS 93
Y + + +G+PP + DTGSD TW +C C + +C P F + S+T++ C
Sbjct: 83 YFVSIRLGSPPQTLLLVADTGSDLTWVRCSAC-KTNCSIHPPGSTFLARHSTTFSPTHCF 141
Query: 94 SSQCAVV----TSNCS----EGDCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEM 145
SS C +V + C+ C Y ++Y G S +SG + ET T N++SG +++
Sbjct: 142 SSLCQLVPQPNPNPCNHTRLHSTCRYEYVYSDG---SKTSGFFSKETTTLNTSSGREMKL 198
Query: 146 PNVIFGCG-HKNLASPTSDS--KQTGIIGLGPGNSSLISQMGTSIAGKFSYCLPDQG--- 199
++ FGCG H + S S +G++GLG G S SQ+G FSYCL D
Sbjct: 199 KSIAFGCGFHASGPSLIGSSFNGASGVMGLGRGPISFASQLGRRFGRSFSYCLLDYTLSP 258
Query: 200 --SSKINFGGIVA-----GAGVVSTPLIIRDH----YYLSLEAISVGNQRLEFVSS---- 244
+S + G +V+ + + TPL+I YY+S++ + V +L S
Sbjct: 259 PPTSYLMIGDVVSTKKDNKSMMSFTPLLINPEAPTFYYISIKGVFVDGVKLHIDPSVWSL 318
Query: 245 ---STGNIFVDTGVLRTLLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDVLCYNIS-- 299
G +D+G T L + + S +K GA LC N++
Sbjct: 319 DELGNGGTVIDSGTTLTFLTEPAYREILSAFKREVKLPSPTPGGASTRSGFDLCVNVTGV 378
Query: 300 SQPKFPEVTIHFRGADV-KLSPSNLFRNISDEIMCSAFRGGNAN----IVYGRIMQINFL 354
S+P+FP +++ G + P N F +IS+ I C A + A V G +MQ FL
Sbjct: 379 SRPRFPRLSLELGGESLYSPPPRNYFIDISEGIKCLAIQPVEAESGRFSVIGNLMQQGFL 438
Query: 355 IGYDIEQAMVSFKPSRC 371
+ +D ++ + F C
Sbjct: 439 LEFDRGKSRLGFSRRGC 455
>gi|242092892|ref|XP_002436936.1| hypothetical protein SORBIDRAFT_10g011730 [Sorghum bicolor]
gi|241915159|gb|EER88303.1| hypothetical protein SORBIDRAFT_10g011730 [Sorghum bicolor]
Length = 469
Score = 121 bits (303), Expect = 6e-25, Method: Compositional matrix adjust.
Identities = 111/362 (30%), Positives = 164/362 (45%), Gaps = 39/362 (10%)
Query: 35 YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSS 94
Y++ L GTP V +DTGSD +W QC+PC C+ Q+ P+FDP SSTY + C S
Sbjct: 122 YVVTLGFGTPAVPQVLLIDTGSDLSWVQCQPCNSSTCYPQKDPVFDPSASSTYAPVPCGS 181
Query: 95 SQC--------AVVTSNCSEGD--CSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVE 144
C A +N S G C Y YG G + G +TETLT + + V
Sbjct: 182 EACRDLDPDSYANGCTNSSSGASLCQYGIQYGNG---DTTVGVYSTETLTLSPEAATVVN 238
Query: 145 MPNVIFGCGHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLPDQGSSK-- 202
N FGCG L G++GLG SL+SQ + G FSYCLP S+
Sbjct: 239 --NFSFGCG---LVQKGVFDLFDGLLGLGGAPESLVSQTTGTYGGAFSYCLPAGNSTAGF 293
Query: 203 INFGGIVAG----AGVVSTPL-IIRDHYYL-SLEAISVGNQRLEFVSSS-TGNIFVDTGV 255
+ G G AG TPL ++ +YL L ISVG ++L+ + G + +D+G
Sbjct: 294 LALGAPATGGNNTAGFQFTPLQVVETTFYLVKLTGISVGGKQLDIEPTVFAGGMIIDSGT 353
Query: 256 LRTLLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDVLCYNISSQPKFPEVTIHFRGA- 314
+ T LP +S L++ + + A P+ + + ++ P V + F G
Sbjct: 354 IVTGLPETAYSALRTAFRSAMSAYPLLPPNDDEDLDTCYDFTGNTNVTVPTVALTFEGGV 413
Query: 315 --DVKLSPSNLFRNISDEIMCSAFRGGNAN---IVYGRIMQINFLIGYDIEQAMVSFKPS 369
D+ + PS + + C AF G ++ + G + Q F + YD + V F+
Sbjct: 414 TIDLDV-PSGVLLD-----GCLAFVAGASDGDTGIIGNVNQRTFEVLYDSARGHVGFRAG 467
Query: 370 RC 371
C
Sbjct: 468 AC 469
>gi|297817208|ref|XP_002876487.1| hypothetical protein ARALYDRAFT_486375 [Arabidopsis lyrata subsp.
lyrata]
gi|297322325|gb|EFH52746.1| hypothetical protein ARALYDRAFT_486375 [Arabidopsis lyrata subsp.
lyrata]
Length = 520
Score = 120 bits (302), Expect = 7e-25, Method: Compositional matrix adjust.
Identities = 108/380 (28%), Positives = 177/380 (46%), Gaps = 56/380 (14%)
Query: 35 YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSS 94
Y M + +G+PP +DTGSD W QC PC DCF+Q +DPK S++Y +I+C+
Sbjct: 155 YFMDVLVGSPPKHFSLILDTGSDLNWIQCLPCH--DCFQQNGAFYDPKASASYKNITCND 212
Query: 95 SQCAVVT-----SNCSEGD--CSYSFLYGRGAYASFSSGNLATETLTFN-STSGLPVEMP 146
+C +V+ C + C Y + YG +S ++G+ A ET T N +TSG E+
Sbjct: 213 PRCNLVSPPDPPKPCKSDNQSCPYYYWYGD---SSNTTGDFAVETFTVNLTTSGGSSELY 269
Query: 147 NV---IFGCGHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLPDQG---- 199
NV +FGCGH N + G+ S SQ+ + FSYCL D+
Sbjct: 270 NVENMMFGCGHWNRGLFHGAAGLLGLGRG---PLSFSSQLQSLYGHSFSYCLVDRNSDTN 326
Query: 200 -SSKINFG---GIVAGAGVVSTPLIIRDH------YYLSLEAISVGNQRL-------EFV 242
SSK+ FG +++ + T + R YY+ +++I V + L
Sbjct: 327 VSSKLIFGEDKDLLSHPNLNFTSFVARKENLVDTFYYVQIKSIIVAGEVLNIPEETWNIS 386
Query: 243 SSSTGNIFVDTGVLRTLLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSD--VL--CYNI 298
S G +D+G + + +K+ ++ K G P + D +L C+N+
Sbjct: 387 SDGAGGTIIDSGTTLSYFAEPAYEFIKNKIAE-------KAKGKYPVYRDFPILDPCFNV 439
Query: 299 S--SQPKFPEVTIHF-RGADVKLSPSNLFRNISDEIMCSAFRG--GNANIVYGRIMQINF 353
S + PE+ I F GA N F ++++++C A G +A + G Q NF
Sbjct: 440 SGIDSIQLPELGIAFADGAVWNFPTENSFIWLNEDLVCLAILGTPKSAFSIIGNYQQQNF 499
Query: 354 LIGYDIEQAMVSFKPSRCTN 373
I YD +++ + + P++C +
Sbjct: 500 HILYDTKRSRLGYAPTKCAD 519
>gi|388504358|gb|AFK40245.1| unknown [Medicago truncatula]
Length = 480
Score = 120 bits (302), Expect = 7e-25, Method: Compositional matrix adjust.
Identities = 116/405 (28%), Positives = 186/405 (45%), Gaps = 59/405 (14%)
Query: 2 QNSQKLPFYNDNETPKSPISIIYQAEIISVDDI-YLMHLSIGTPPVDIFGSVDTGSDCTW 60
QN + N + +S I A I+++ + Y++ + +G + + +DTGSD TW
Sbjct: 99 QNRIRAKVSGHNSSEQSSEIQIPLASGINLETLNYIVTIGLGNQNMTVI--IDTGSDLTW 156
Query: 61 TQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSSSQCAVV------TSNCSEGD---CSY 111
QC+PC + C+ Q+ P+F+P SS+YNS+ C+SS C + T C + C++
Sbjct: 157 VQCDPC--MSCYSQQGPVFNPSNSSSYNSLLCNSSTCQNLQFTTGNTEACESNNPSSCNH 214
Query: 112 SFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMPNVIFGCGHKNLASPTSDSKQTGIIG 171
+ YG G SF+ G L E L+F S + N +FGCG N S GI+G
Sbjct: 215 TVSYGDG---SFTDGELGVEHLSFGGIS-----VSNFVFGCGRNNKGLFGGVS---GIMG 263
Query: 172 LGPGNSSLISQMGTSIAGKFSYCLP--DQGSSKINFGGIVAG--------------AGVV 215
LG N S+ISQ T+ G FSYCLP D G+S G +V G +V
Sbjct: 264 LGRSNLSMISQTNTTFGGVFSYCLPTTDSGAS----GSLVIGNESSLFKNLTPIAYTSMV 319
Query: 216 STPLIIRDHYYLSLEAISVGNQRLEFVSSSTGNIFVDTGVLRTLLPLEYHSNLKSVMSNM 275
S P + + Y L+L I VG ++ S G I +D+G + T L ++ LK+
Sbjct: 320 SNPQ-LSNFYVLNLTGIDVGGVAIQDTSFGNGGILIDSGTVITRLAPSLYNALKAEFLKQ 378
Query: 276 IKAQPVKGVGAEPGFSDV-LCYNIS--SQPKFPEVTIHFR-GADVKLSPSNLFRNISDE- 330
P+ P S + C+N++ + P +++HF D+ + + D
Sbjct: 379 FSGYPIA-----PALSILDTCFNLTGIEEVSIPTLSMHFENNVDLNVDAVGILYMPKDGS 433
Query: 331 ---IMCSAFRGGNANIVYGRIMQINFLIGYDIEQAMVSFKPSRCT 372
+ ++ N + G Q N + YD +Q+ + F C+
Sbjct: 434 QVCLALASLSDENDMAIIGNYQQRNQRVIYDAKQSKIGFAREDCS 478
>gi|302763741|ref|XP_002965292.1| hypothetical protein SELMODRAFT_83230 [Selaginella moellendorffii]
gi|300167525|gb|EFJ34130.1| hypothetical protein SELMODRAFT_83230 [Selaginella moellendorffii]
Length = 423
Score = 120 bits (302), Expect = 9e-25, Method: Compositional matrix adjust.
Identities = 109/359 (30%), Positives = 160/359 (44%), Gaps = 39/359 (10%)
Query: 35 YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSS 94
Y + L +GTPP + DTGSD W QC PC C+ Q PLF+P SST+ SI+C S
Sbjct: 81 YFVSLGVGTPPRTVNMVADTGSDVLWLQCLPCQS--CYGQTDPLFNPSFSSTFQSITCGS 138
Query: 95 SQC-AVVTSNCSEGDCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMPNVIFGCG 153
S C ++ C C Y YG G SF+ G +TETL+F S + + +V GCG
Sbjct: 139 SLCQQLLIRGCRRNQCLYQVSYGDG---SFTVGEFSTETLSFGSNA-----VNSVAIGCG 190
Query: 154 HKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLPDQ---GSSKINFGGIVA 210
H N T + G+ S SQ+G FSYCLP + GS + FG
Sbjct: 191 HNNQGLFTGAAGLLGLGKG---LLSFPSQVGQLYGSVFSYCLPTRESTGSVPLIFGNQAV 247
Query: 211 GAGVVSTPLI----IRDHYYLSLEAISVGNQRLEF------VSSSTGN--IFVDTGVLRT 258
+ T L+ + YY+ + I VG + + SSTGN + +D+G T
Sbjct: 248 ASNAQFTTLLTNPKLDTFYYVEMVGIKVGGTSVNIPAGSLSLDSSTGNGGVILDSGTAVT 307
Query: 259 LLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDV-LCYNISSQPK--FPEVTIHFR-GA 314
L ++ ++ + +A GFS CY++S + P V+ F GA
Sbjct: 308 RLVTSAYNPMR----DAFRAGMPSDAKMTSGFSLFDTCYDLSGRSSIMLPAVSFVFNGGA 363
Query: 315 DVKLSPSNLFRNISDE-IMCSAFRGGNANI-VYGRIMQINFLIGYDIEQAMVSFKPSRC 371
+ L N+ + + C AF + N + G I Q +F + +D V ++C
Sbjct: 364 TMALPAQNIMVPVDNSGTYCLAFAPNSENFSIIGNIQQQSFRMSFDSTGNRVGIGANQC 422
>gi|242079449|ref|XP_002444493.1| hypothetical protein SORBIDRAFT_07g022790 [Sorghum bicolor]
gi|241940843|gb|EES13988.1| hypothetical protein SORBIDRAFT_07g022790 [Sorghum bicolor]
Length = 449
Score = 120 bits (302), Expect = 9e-25, Method: Compositional matrix adjust.
Identities = 111/385 (28%), Positives = 175/385 (45%), Gaps = 60/385 (15%)
Query: 32 DDIYLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPE-----LDCFKQEPPLFDPKKSST 86
D + + + IGTPP VDTGSD WTQC +Q PL++P++SS+
Sbjct: 81 DQGHSLTVGIGTPPQPRTLIVDTGSDLIWTQCSMLSRRTRTAASASRQREPLYEPRRSSS 140
Query: 87 YNSISCSSSQC---AVVTSNCSEGD-CSYSFLYGRGAYASFSSGNLATETLTF--NSTSG 140
+ + CS C NC+ + C Y LYG ++ + G LA+ET TF N+
Sbjct: 141 FAYLPCSDRLCQEGQFSYKNCARNNRCMYDELYG----SAEAGGVLASETFTFGVNAKVS 196
Query: 141 LPVEMPNVIFGCGHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCL---PD 197
LP+ FGCG + S G++GL PG SL+SQ+ +FSYCL +
Sbjct: 197 LPLG-----FGCGALSAGDLVGAS---GLMGLSPGIMSLVSQLSVP---RFSYCLTPFAE 245
Query: 198 QGSSKINFGGIV-----AGAGVVSTPLIIRD------HYYLSLEAISVGNQRLEFVSSST 246
+ +S + FG + G V T I+R+ +YY+ L +S+G +RL+ ++S
Sbjct: 246 RKTSPLLFGAMADLRRYRTTGTVQTTSILRNPAMETAYYYVPLVGLSLGTKRLDVPATSL 305
Query: 247 GNI--------FVDTGVLRTLLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDV-LCYN 297
G I VD+G + L +K + ++ PV G + + D LC+
Sbjct: 306 GMIKPDGSGGTIVDSGSTMSYLEETAFRAVKKAVVEAVRL-PVAN-GTDEDYDDYELCFA 363
Query: 298 ISS-----QPKFPEVTIHFR-GADVKLSPSNLFRNISDEIMCSAFR---GGNANIVYGRI 348
+ + K P + +HF GA + L N F+ +MC A G + G +
Sbjct: 364 LPTGVAMEAVKTPPLVLHFDGGAAMTLPRDNYFQEPRAGLMCLAVGTSPDGFGVSIIGNV 423
Query: 349 MQINFLIGYDIEQAMVSFKPSRCTN 373
Q N + +D+ SF P++C +
Sbjct: 424 QQQNMHVLFDVRNQKFSFAPTKCDD 448
>gi|302809855|ref|XP_002986620.1| hypothetical protein SELMODRAFT_124369 [Selaginella moellendorffii]
gi|300145803|gb|EFJ12477.1| hypothetical protein SELMODRAFT_124369 [Selaginella moellendorffii]
Length = 423
Score = 120 bits (301), Expect = 1e-24, Method: Compositional matrix adjust.
Identities = 109/359 (30%), Positives = 160/359 (44%), Gaps = 39/359 (10%)
Query: 35 YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSS 94
Y + L +GTPP + DTGSD W QC PC C+ Q PLF+P SST+ SI+C S
Sbjct: 81 YFVSLGVGTPPRTVNMVADTGSDVLWLQCLPCQS--CYGQTDPLFNPSFSSTFQSITCGS 138
Query: 95 SQC-AVVTSNCSEGDCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMPNVIFGCG 153
S C ++ C C Y YG G SF+ G +TETL+F S + + +V GCG
Sbjct: 139 SLCQQLLIRGCRRNQCLYQVSYGDG---SFTVGEFSTETLSFGSNA-----VNSVAIGCG 190
Query: 154 HKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLPDQ---GSSKINFGGIVA 210
H N T + G+ S SQ+G FSYCLP + GS + FG
Sbjct: 191 HNNQGLFTGAAGLLGLGKG---LLSFPSQVGQLYGSVFSYCLPTRESTGSVPLIFGNQAV 247
Query: 211 GAGVVSTPLI----IRDHYYLSLEAISVGNQRLEF------VSSSTGN--IFVDTGVLRT 258
+ T L+ + YY+ + I VG + + SSTGN + +D+G T
Sbjct: 248 ASNAQFTTLLTNPKLDTFYYVEMVGIKVGGTSVSIPAGSLSLDSSTGNGGVILDSGTAVT 307
Query: 259 LLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDV-LCYNISSQPK--FPEVTIHFR-GA 314
L ++ ++ + +A GFS CY++S + P V+ F GA
Sbjct: 308 RLVTSAYNPMR----DAFRAGMPSDAKMTSGFSLFDTCYDLSGRSSIMLPAVSFVFNGGA 363
Query: 315 DVKLSPSNLFRNISDE-IMCSAFRGGNANI-VYGRIMQINFLIGYDIEQAMVSFKPSRC 371
+ L N+ + + C AF + N + G I Q +F + +D V ++C
Sbjct: 364 TMALPAQNIMVPVDNSGTYCLAFAPNSENFSIIGNIQQQSFRMSFDSTGNRVGIGANQC 422
>gi|413925432|gb|AFW65364.1| hypothetical protein ZEAMMB73_378208 [Zea mays]
Length = 418
Score = 120 bits (301), Expect = 1e-24, Method: Compositional matrix adjust.
Identities = 105/360 (29%), Positives = 169/360 (46%), Gaps = 48/360 (13%)
Query: 35 YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSS 94
Y M S+GTPP + DTGSD W +C C C + + P KSS+++ + CSS
Sbjct: 81 YDMTFSMGTPPQTLSALADTGSDLIWAKCGACKR--CAPRGSASYYPTKSSSFSKLPCSS 138
Query: 95 SQCAVVTS---------NCSEGDCSYSFLYGRGAY-ASFSSGNLATETLTFNSTSGLPVE 144
+ C + S CSY + YG + ++ G + +ET T S +
Sbjct: 139 ALCRTLESQSLATCGGTRARGAVCSYRYSYGLSSNPHHYTQGYMGSETFTLGSDA----- 193
Query: 145 MPNVIFGCGHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLPDQ--GSSK 202
+ + FGC S +G++GLG G SL+ Q+ G FSYCL SS
Sbjct: 194 VQGIGFGC---TTMSEGGYGSGSGLVGLGRGKLSLVRQLK---VGAFSYCLTSDPSTSSP 247
Query: 203 INFG-GIVAGAGVVSTPLI---IRDHYYLSLEAISVGNQRLEFVSSSTGN--IFVDTGVL 256
+ FG G + G GV STPL+ Y ++L++IS+G + + TG I D+G
Sbjct: 248 LLFGAGALTGPGVQSTPLVNLKTSTFYTVNLDSISIGAAK----TPGTGRHGIIFDSGTT 303
Query: 257 RTLLPLEYHS----NLKSVMSNMIKAQPVKGVGAEPGFSDVLCYNISSQPKFPEVTIHFR 312
T L ++ L S +N+ + G +C+ S FP + +HF
Sbjct: 304 LTFLAEPAYTLAEAGLLSQTTNLTRVPGTDGY--------EVCFQTSGGAVFPSMVLHFD 355
Query: 313 GADVKLSPSNLFRNISDEIMCSAFRGGNANI-VYGRIMQINFLIGYDIEQAMVSFKPSRC 371
G D+ L N F ++D + C + + + + G IMQ+++ I YD++++++SF+P+ C
Sbjct: 356 GGDMALKTENYFGAVNDSVSCWLVQKSPSEMSIVGNIMQMDYHIRYDLDKSVLSFQPTNC 415
>gi|242081367|ref|XP_002445452.1| hypothetical protein SORBIDRAFT_07g019450 [Sorghum bicolor]
gi|241941802|gb|EES14947.1| hypothetical protein SORBIDRAFT_07g019450 [Sorghum bicolor]
Length = 459
Score = 120 bits (300), Expect = 1e-24, Method: Compositional matrix adjust.
Identities = 111/371 (29%), Positives = 174/371 (46%), Gaps = 59/371 (15%)
Query: 37 MHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSSSQ 96
+ + +GTPP +D GSD WTQC KQ P+FD +SS+++ + C S
Sbjct: 109 LTVGVGTPPQPSKVILDLGSDLLWTQCSLVGP--TAKQLEPVFDAARSSSFSVLPCDSKL 166
Query: 97 CAVVT---SNCSEGDCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMPNVIFGCG 153
C T C++ C+Y YG ++G LATET TF + G+ N+ FGCG
Sbjct: 167 CEAGTFTNKTCTDRKCAYENDYG----IMTATGVLATETFTFGAHHGVSA---NLTFGCG 219
Query: 154 HKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCL---PDQGSSKINFG---- 206
LA+ T ++ +GI+GL PG S++ Q+ + KFSYCL D+ +S + FG
Sbjct: 220 --KLANGTI-AEASGILGLSPGPLSMLKQLAIT---KFSYCLTPFADRKTSPVMFGAMAD 273
Query: 207 -GIVAGAGVVSTPLIIRD-----HYYLSLEAISVGNQRLEFVSSS-------TGNIFVDT 253
G G V T ++++ +YY+ + +SVG++RL+ + TG +D+
Sbjct: 274 LGKYKTTGKVQTIPLLKNPVEDIYYYVPMVGMSVGSKRLDVPQETLAIKPDGTGGTVLDS 333
Query: 254 GVLRTLLPLEYHSNLKSVMSNMIKAQPV--KGVGAEPGFSDVLCYNIS-----SQPKFPE 306
L + LK + IK PV + V P +C+ + + P
Sbjct: 334 ATTLAYLVEPAFTELKKAVMEGIKL-PVANRSVDDYP-----VCFELPRGMSMEGVQVPP 387
Query: 307 VTIHFRG-ADVKLSPSNLFRNISDEIMCSA-----FRGGNANIVYGRIMQINFLIGYDIE 360
+ +HF G A++ L N F+ S +MC A F G A V G + Q N + YD+
Sbjct: 388 LVLHFDGDAEMSLPRDNYFQEPSPGMMCLAVMQAPFEG--APNVIGNVQQQNMHVLYDVG 445
Query: 361 QAMVSFKPSRC 371
S+ P++C
Sbjct: 446 NRKFSYAPTKC 456
>gi|302781476|ref|XP_002972512.1| hypothetical protein SELMODRAFT_441822 [Selaginella moellendorffii]
gi|300159979|gb|EFJ26598.1| hypothetical protein SELMODRAFT_441822 [Selaginella moellendorffii]
Length = 496
Score = 119 bits (299), Expect = 2e-24, Method: Compositional matrix adjust.
Identities = 116/389 (29%), Positives = 174/389 (44%), Gaps = 67/389 (17%)
Query: 34 IYLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCS 93
++ M L IG+ ++ +DTGS+ QC + P+FDP S +Y + C
Sbjct: 99 LFSMQLGIGSLQKNLSAIIDTGSEAVLVQCG--------SRSRPVFDPAASQSYRQVPCI 150
Query: 94 SSQCAVVTSNCSEGD----------CSYSFLYGRGAYASFSSGNLATETLTFNST--SGL 141
S C V S G C+YS YG + S+G+ + + + NST SG
Sbjct: 151 SQLCLAVQQQTSNGSSQPCVNSSATCTYSLSYGD---SRNSTGDFSQDVIFLNSTNSSGQ 207
Query: 142 PVEMPNVIFGCGHKNLASPTS---DSKQTGIIGLGPGNSSLISQMGTSIAG-KFSYCLPD 197
V+ +V FGC H SP D GI+G GN SL SQ+ + G KFSYC P
Sbjct: 208 AVQFRDVAFGCAH----SPQGFLVDLGSLGIVGFNRGNLSLPSQLKDRLGGSKFSYCFPS 263
Query: 198 QGSSKINFGGIVAGAGVVS------TPLI------IRDH-YYLSLEAISVGNQRLEFVSS 244
Q G I G +S TPL+ R YY+ L +ISV + L S
Sbjct: 264 QPWQPRATGVIFLGDSGLSKSKVGYTPLLDNPVTPARSQLYYVGLTSISVDGKTLAIPES 323
Query: 245 S--------TGNIFVDTGVLRTLLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDVLCY 296
+ G +D+G T + + ++ ++ + ++ K VGA GF D CY
Sbjct: 324 AFKLDPSTGDGGTVLDSGTTFTRVVDDAYTAFRNAFAASNRSGLRKKVGAAAGFDD--CY 381
Query: 297 NI---SSQPKFPEVTIHFR-GADVKLSPSNLFRNIS---DEI-----MCSAFRGGNANI- 343
NI SS P PEV + + ++L +LF +S +E+ + S+ + G I
Sbjct: 382 NISAGSSLPGVPEVRLSLQNNVRLELRFEHLFVPVSAAGNEVTVCLAILSSQKSGFGKIN 441
Query: 344 VYGRIMQINFLIGYDIEQAMVSFKPSRCT 372
V G Q N+L+ YD E++ V F+ + C+
Sbjct: 442 VLGNYQQSNYLVEYDNERSRVGFERADCS 470
>gi|297828736|ref|XP_002882250.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
gi|297328090|gb|EFH58509.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
Length = 488
Score = 119 bits (299), Expect = 2e-24, Method: Compositional matrix adjust.
Identities = 102/366 (27%), Positives = 162/366 (44%), Gaps = 36/366 (9%)
Query: 34 IYLMHLSIGTPPVDIFGSVDTGSDCTWTQCE---PCPELDCFKQEPPLFDPKKSSTYNSI 90
+Y + +GTP D VDTGSD W C CP + P +D SST S+
Sbjct: 84 LYFAKIGLGTPSRDFHVQVDTGSDILWVNCAGCIRCPRKSDLVELTP-YDADASSTAKSV 142
Query: 91 SCSSSQCAVVT--SNCSEGD-CSYSFLYGRGAYASFSSGNLATETLTFNSTSG---LPVE 144
SCS + C+ V S C G C Y LYG G S ++G L + + + +G
Sbjct: 143 SCSDNFCSYVNQRSECHSGSTCQYVILYGDG---SSTNGYLVRDVVHLDLVTGNRQTGST 199
Query: 145 MPNVIFGCGHKNLAS-PTSDSKQTGIIGLGPGNSSLISQMGT--SIAGKFSYCLPDQGSS 201
+IFGCG K S + GI+G G NSS ISQ+ + + F++CL +
Sbjct: 200 NGTIIFGCGSKQSGQLGESQAAVDGIMGFGQSNSSFISQLASQGKVKRSFAHCLDNNNGG 259
Query: 202 KINFGGIVAGAGVVSTPLIIRD-HYYLSLEAISVGNQRLE-----FVSSSTGNIFVDTGV 255
I G V V +TP++ + HY ++L AI VGN L+ F S + +D+G
Sbjct: 260 GIFAIGEVVSPKVKTTPMLSKSAHYSVNLNAIEVGNSVLQLSSDAFDSGDDKGVIIDSGT 319
Query: 256 LRTLLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDVLCYN-ISSQPKFPEVTIHF-RG 313
LP ++ L + + + Q + + F+ C++ I +FP VT F +
Sbjct: 320 TLVYLPDAVYNPLMNQI--LASHQELNLHTVQDSFT---CFHYIDRLDRFPTVTFQFDKS 374
Query: 314 ADVKLSPSNLFRNISDEIMCSAF-------RGGNANIVYGRIMQINFLIGYDIEQAMVSF 366
+ + P + ++ C + +GG + + G + N L+ YDIE ++ +
Sbjct: 375 VSLAVYPQEYLFQVREDTWCFGWQNGGLQTKGGASLTILGDMALSNKLVVYDIENQVIGW 434
Query: 367 KPSRCT 372
C+
Sbjct: 435 TNHNCS 440
>gi|356535252|ref|XP_003536162.1| PREDICTED: aspartic proteinase-like protein 2-like [Glycine max]
Length = 475
Score = 119 bits (299), Expect = 2e-24, Method: Compositional matrix adjust.
Identities = 109/370 (29%), Positives = 164/370 (44%), Gaps = 37/370 (10%)
Query: 34 IYLMHLSIGTPPVDIFGSVDTGSDCTW---TQCEPCPELDCFKQEPPLFDPKKSSTYNSI 90
+Y L +G+PP D + VDTGSD W +C CP + L+DPK S T I
Sbjct: 69 LYFTKLGLGSPPKDYYVQVDTGSDILWVNCVKCSRCPRKSDLGIDLTLYDPKGSETSELI 128
Query: 91 SCSSSQCAVVTS----NC-SEGDCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEM 145
SC C+ C SE C YS YG G S ++G + LT+N +
Sbjct: 129 SCDQEFCSATYDGPIPGCKSEIPCPYSITYGDG---SATTGYYVQDYLTYNHVNDNLRTA 185
Query: 146 P---NVIFGCGHKNLASPTSDSKQT--GIIGLGPGNSSLISQMGTS--IAGKFSYCLPDQ 198
P ++IFGCG + +S S++ GIIG G NSS++SQ+ S + FS+CL +
Sbjct: 186 PQNSSIIFGCGAVQSGTLSSSSEEALDGIIGFGQSNSSVLSQLAASGKVKKIFSHCLDNI 245
Query: 199 GSSKINFGGIVAGAGVVSTPLIIR-DHYYLSLEAISVGNQRLE-----FVSSSTGNIFVD 252
I G V V +TPL+ R HY + L++I V L+ F S + +D
Sbjct: 246 RGGGIFAIGEVVEPKVSTTPLVPRMAHYNVVLKSIEVDTDILQLPSDIFDSGNGKGTIID 305
Query: 253 TGVLRTLLPLEYHSNLKSVMSNMIKAQP-VKGVGAEPGFSDVLCYNISSQPKFPEVTIHF 311
+G LP + L + ++ QP +K E FS Y + FP V +HF
Sbjct: 306 SGTTLAYLPAIVYDEL---IPKVMARQPRLKLYLVEQQFS-CFQYTGNVDRGFPVVKLHF 361
Query: 312 RGA-DVKLSPSNLFRNISDEIMCSAFRGGNANIVYGRIMQI-------NFLIGYDIEQAM 363
+ + + P + D I C ++ A G+ M + N L+ YD+E
Sbjct: 362 EDSLSLTVYPHDYLFQFKDGIWCIGWQKSVAQTKNGKDMTLLGDLVLSNKLVIYDLENMA 421
Query: 364 VSFKPSRCTN 373
+ + C++
Sbjct: 422 IGWTDYNCSS 431
>gi|293332531|ref|NP_001169558.1| uncharacterized protein LOC100383437 precursor [Zea mays]
gi|224030089|gb|ACN34120.1| unknown [Zea mays]
gi|413925069|gb|AFW65001.1| hypothetical protein ZEAMMB73_160528 [Zea mays]
Length = 491
Score = 119 bits (298), Expect = 2e-24, Method: Compositional matrix adjust.
Identities = 110/377 (29%), Positives = 168/377 (44%), Gaps = 53/377 (14%)
Query: 34 IYLMHLSIGTPPVDIFGSVDTGSDCTWTQ---CEPCPELDCFKQEPPLFDPKKSSTYNSI 90
+Y + +GTPP + VDTGSD W CE CP + L+DPK SST + +
Sbjct: 85 LYYTEIKLGTPPKHYYVQVDTGSDILWVNCITCEQCPHKSGLGLDLTLYDPKASSTGSMV 144
Query: 91 SCSSSQCAVVTS----NCSEG-DCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEM 145
C + CA C C YS YG G S + G+ T+ L F+ +
Sbjct: 145 MCDQAFCAATFGGKLPKCGANVPCEYSVTYGDG---SSTIGSFVTDALQFDQVTRDGQTQ 201
Query: 146 P---NVIFGCGHKNLAS-PTSDSKQTGIIGLGPGNSSLISQMGTSIAGK----FSYCLPD 197
P +VIFGCG + +S+ GI+G G N+S++SQ+ T AGK F++CL
Sbjct: 202 PANASVIFGCGAQQGGDLGSSNQALDGILGFGEANTSMLSQLTT--AGKVKKIFAHCLDT 259
Query: 198 QGSSKINFGGIVAGAGVVSTPLII-RDHYYLSLEAISVGNQRLE-----FVSSSTGNIFV 251
I G V V +TPL+ + HY ++L+ I VG L+ F +
Sbjct: 260 IKGGGIFSIGDVVQPKVKTTPLVADKPHYNVNLKTIDVGGTTLQLPAHIFEPGEKKGTII 319
Query: 252 DTGVLRTLLP-LEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDV---LCYNI--SSQPKFP 305
D+G T LP L + + +V + K Q + F DV LC+ S FP
Sbjct: 320 DSGTTLTYLPELVFKEVMLAVFN---KHQDIT-------FHDVQGFLCFQYPGSVDDGFP 369
Query: 306 EVTIHFRGADVKLS--PSNLFRNISDEIMCSAFRGGNAN-------IVYGRIMQINFLIG 356
+T HF D+ L P F +++ C F+ G + ++ G ++ N L+
Sbjct: 370 TITFHFE-DDLALHVYPHEYFFANGNDVYCVGFQNGASQSKDGKDIVLMGDLVLSNKLVI 428
Query: 357 YDIEQAMVSFKPSRCTN 373
YD+E ++ + C++
Sbjct: 429 YDLENRVIGWTDYNCSS 445
>gi|302824729|ref|XP_002994005.1| hypothetical protein SELMODRAFT_431957 [Selaginella moellendorffii]
gi|300138167|gb|EFJ04945.1| hypothetical protein SELMODRAFT_431957 [Selaginella moellendorffii]
Length = 462
Score = 119 bits (298), Expect = 2e-24, Method: Compositional matrix adjust.
Identities = 106/377 (28%), Positives = 175/377 (46%), Gaps = 55/377 (14%)
Query: 35 YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSS 94
Y + +G+P + VDTGS+ TW QC PC C ++D +S++Y ++C++
Sbjct: 100 YYTSIKLGSPGQEAILIVDTGSELTWLQCLPCKV--CAPSVDTIYDAARSASYRPVTCNN 157
Query: 95 SQCAVVTSN-----CSEG-DCSYSFLYGRGAYASFSSGNLATETLTFNST-SGLPVEMPN 147
SQ +S C+ G C ++ YG G SFS G+L+T+TL + G PV + +
Sbjct: 158 SQLCSNSSQGTYAYCARGSQCQFAAFYGDG---SFSYGSLSTDTLIMETVVGGKPVTVQD 214
Query: 148 VIFGCGHKNLA-SPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLPDQGSSKINFG 206
FGC +L PT S GI+GL G +L Q+G KFS+C PD+ SS +N
Sbjct: 215 FAFGCAQGDLELVPTGAS---GILGLNAGKMALPMQLGQRFGWKFSHCFPDR-SSHLNST 270
Query: 207 GIVAGAG-------------VVSTPLIIRDHYYLSLEAISVGNQRLEFVSSSTGNIFVDT 253
G+V ++ + R Y+++L+ +S+ + L F+ + + +D+
Sbjct: 271 GVVFFGNAELPHEQVQYTSVALTNSELQRKFYHVALKGVSINSHELVFLPRGS-VVILDS 329
Query: 254 GVLRTLLPLEYHSNLKSVMSNMIKAQP--VKGVGAEPGFSDV-LCYNISS------QPKF 304
G + +HS L+ +K +P +K + + F D+ C+ +S+
Sbjct: 330 GSSFSSFVRPFHSQLREA---FLKHRPPSLKHLEGD-SFGDLGTCFKVSNDDIDELHRTL 385
Query: 305 PEVTIHFR-GADVK------LSPSNLFRNISDEIMCSAFRGGNANI--VYGRIMQINFLI 355
P +++ F G + L P F+N MC AF G N V G Q N +
Sbjct: 386 PSLSLVFEDGVTIGIPSIGVLLPVARFQNHVK--MCFAFEDGGPNPVNVIGNYQQQNLWV 443
Query: 356 GYDIEQAMVSFKPSRCT 372
YDI+++ V F + C
Sbjct: 444 EYDIQRSRVGFARASCV 460
>gi|15232960|ref|NP_186923.1| aspartyl protease-like protein [Arabidopsis thaliana]
gi|6728988|gb|AAF26986.1|AC018363_31 putative aspartyl protease [Arabidopsis thaliana]
gi|21593593|gb|AAM65560.1| putative aspartyl protease [Arabidopsis thaliana]
gi|332640332|gb|AEE73853.1| aspartyl protease-like protein [Arabidopsis thaliana]
Length = 488
Score = 119 bits (297), Expect = 3e-24, Method: Compositional matrix adjust.
Identities = 101/367 (27%), Positives = 165/367 (44%), Gaps = 38/367 (10%)
Query: 34 IYLMHLSIGTPPVDIFGSVDTGSDCTWTQCE---PCPELDCFKQEPPLFDPKKSSTYNSI 90
+Y + +GTP D VDTGSD W C CP + P +D SST S+
Sbjct: 84 LYFAKIGLGTPSRDFHVQVDTGSDILWVNCAGCIRCPRKSDLVELTP-YDVDASSTAKSV 142
Query: 91 SCSSSQCAVVT--SNCSEGD-CSYSFLYGRGAYASFSSGNLATETLTFNSTSG---LPVE 144
SCS + C+ V S C G C Y +YG G S ++G L + + + +G
Sbjct: 143 SCSDNFCSYVNQRSECHSGSTCQYVIMYGDG---SSTNGYLVKDVVHLDLVTGNRQTGST 199
Query: 145 MPNVIFGCGHKNLAS-PTSDSKQTGIIGLGPGNSSLISQMGT--SIAGKFSYCLPDQGSS 201
+IFGCG K S + GI+G G NSS ISQ+ + + F++CL +
Sbjct: 200 NGTIIFGCGSKQSGQLGESQAAVDGIMGFGQSNSSFISQLASQGKVKRSFAHCLDNNNGG 259
Query: 202 KINFGGIVAGAGVVSTPLIIRD-HYYLSLEAISVGNQRLE-----FVSSSTGNIFVDTGV 255
I G V V +TP++ + HY ++L AI VGN LE F S + +D+G
Sbjct: 260 GIFAIGEVVSPKVKTTPMLSKSAHYSVNLNAIEVGNSVLELSSNAFDSGDDKGVIIDSGT 319
Query: 256 LRTLLPLEYHSNLKSVMSNMIKAQPVKGVG-AEPGFSDVLCYNISSQ-PKFPEVTIHF-R 312
LP ++ L ++ ++ + P + + F+ C++ + + +FP VT F +
Sbjct: 320 TLVYLPDAVYNPL---LNEILASHPELTLHTVQESFT---CFHYTDKLDRFPTVTFQFDK 373
Query: 313 GADVKLSPSNLFRNISDEIMCSAF-------RGGNANIVYGRIMQINFLIGYDIEQAMVS 365
+ + P + ++ C + +GG + + G + N L+ YDIE ++
Sbjct: 374 SVSLAVYPREYLFQVREDTWCFGWQNGGLQTKGGASLTILGDMALSNKLVVYDIENQVIG 433
Query: 366 FKPSRCT 372
+ C+
Sbjct: 434 WTNHNCS 440
>gi|357118871|ref|XP_003561171.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Brachypodium
distachyon]
Length = 506
Score = 119 bits (297), Expect = 3e-24, Method: Compositional matrix adjust.
Identities = 111/344 (32%), Positives = 156/344 (45%), Gaps = 39/344 (11%)
Query: 52 VDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSSSQCAVV-------TSNC 104
VDT SD W QC PCP+ C+ Q L+DP KS CSS QC + T
Sbjct: 178 VDTASDVPWVQCAPCPQPQCYAQSDVLYDPTKSILSAPFPCSSPQCRSLGRYANGCTGAG 237
Query: 105 SEGDCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMPNVIFGCGHKNLASPTSDS 164
+ G C Y LY G S +SG ++ LT N+ V FGC H L + ++
Sbjct: 238 NTGTCQYRVLYPDG---SGTSGTYVSDLLTLNADPKGAVS--KFQFGCSHALLRPGSFNN 292
Query: 165 KQTGIIGLGPGNSSLISQM-GTSIAGK-FSYCLPDQGSSK--INFG-GIVAGAGVVSTPL 219
K G + LG G SL SQ GT G FSYCLP GS K ++ G A + TP+
Sbjct: 293 KTAGFMALGRGAQSLSSQTKGTFSKGNVFSYCLPPTGSHKGFLSLGVPQHAASRYAVTPM 352
Query: 220 IIRDH----YYLSLEAISVGNQRLEFVSSS-TGNIFVDTGVLRTLLPLEYHSNLKSVMSN 274
+ Y + L I V QRL + N +D+ + T LP + L++
Sbjct: 353 LKSKMAPMIYMVRLIGIDVAGQRLPVPPAVFAANAAMDSRTIITRLPPTAYMALRAAFRA 412
Query: 275 MIKAQPVKGVGAEPGFSDVLCYNISSQP--KFPEVTIHF-RGADVKLSPSNLFRNISDEI 331
++A + V A G D CY+ + P + P+VT+ F R A V+L PS + +
Sbjct: 413 QMRA--YRAV-APKGQLDT-CYDFTGVPMVRLPKVTLVFDRNAAVELDPSGVMLD----- 463
Query: 332 MCSAFRGGNANI----VYGRIMQINFLIGYDIEQAMVSFKPSRC 371
C AF NAN + G + Q + Y+++ A V F+ + C
Sbjct: 464 SCLAF-APNANDFMPGIIGNVQQQTLEVLYNVDGASVGFRRAAC 506
>gi|242086034|ref|XP_002443442.1| hypothetical protein SORBIDRAFT_08g019550 [Sorghum bicolor]
gi|241944135|gb|EES17280.1| hypothetical protein SORBIDRAFT_08g019550 [Sorghum bicolor]
Length = 443
Score = 119 bits (297), Expect = 3e-24, Method: Compositional matrix adjust.
Identities = 101/370 (27%), Positives = 152/370 (41%), Gaps = 44/370 (11%)
Query: 35 YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSS 94
Y+ +G PP +DTGS WTQC C C +Q+ P F+ S ++ + C
Sbjct: 86 YIAEYMVGDPPQRAEALIDTGSSLIWTQCTACLRKVCVRQDLPYFNASSSGSFAPVPCQD 145
Query: 95 SQCA--VVTSNCSEGDCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMPNVIFGC 152
CA + +G C++ YG G F L T+ TF S + FGC
Sbjct: 146 KACAGNYLHFCALDGTCTFRVTYGAGGIIGF----LGTDAFTFQSGGA------TLAFGC 195
Query: 153 -GHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLP----DQGSSKINFGG 207
A+P +G+IGLG G SL SQ G A +FSYCL + G+S F G
Sbjct: 196 VSFTRFAAPDVLHGASGLIGLGRGRLSLASQTG---AKRFSYCLTPYFHNNGASSHLFVG 252
Query: 208 IVA----GAGVVSTPLII---RDH-----YYLSLEAISVGNQRLEFVSSS---------- 245
A G G V + + +D+ YYL L I+VG +L S++
Sbjct: 253 AAASLSGGGGAVMSMAFVESPKDYPYSTFYYLPLVGITVGETKLAIPSTAFDLQEVEEGF 312
Query: 246 -TGNIFVDTGVLRTLLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDVLCYNISSQPKF 304
G + +D+G T L + + L ++ + V G + G +
Sbjct: 313 WEGGVIIDSGSPFTSLVEDAYEPLMGELARQLNGSLVPPPGEDDGGMALCVARGDLDRVV 372
Query: 305 PEVTIHFR-GADVKLSPSNLFRNISDEIMCSAFRGGNANIVYGRIMQINFLIGYDIEQAM 363
P + +HF GAD+ L P N + + C A G + G Q N I +D+
Sbjct: 373 PTLVLHFSGGADMALPPENYWAPLEKSTACMAIVRGYLQSIIGNFQQQNMHILFDVGGGR 432
Query: 364 VSFKPSRCTN 373
+SF+ + C+
Sbjct: 433 LSFQNADCST 442
>gi|297828001|ref|XP_002881883.1| hypothetical protein ARALYDRAFT_903676 [Arabidopsis lyrata subsp.
lyrata]
gi|297327722|gb|EFH58142.1| hypothetical protein ARALYDRAFT_903676 [Arabidopsis lyrata subsp.
lyrata]
Length = 529
Score = 118 bits (296), Expect = 4e-24, Method: Compositional matrix adjust.
Identities = 109/383 (28%), Positives = 177/383 (46%), Gaps = 60/383 (15%)
Query: 35 YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSS 94
Y M + +GTPP +DTGSD W QC PC DCF Q +DPK S+++ +I+C+
Sbjct: 162 YFMDVLVGTPPKHFSLILDTGSDLNWLQCLPC--YDCFHQNEAFYDPKTSASFKNITCND 219
Query: 95 SQCAVVTS-----NCSEGD--CSYSFLYGRGAYASFSSGNLATETLTFN--STSGLPVE- 144
+C++++S C + C Y + YG S ++G+ A ET T N +T G E
Sbjct: 220 PRCSLISSPEPPVQCKSDNQSCPYFYWYGD---RSNTTGDFAVETFTVNLTTTEGRSSEY 276
Query: 145 -MPNVIFGCGHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLPDQG---- 199
+ N++FGCGH N + S G+ S SQ+ + FSYCL D+
Sbjct: 277 KVENMMFGCGHWNRGLFSGASGLLGLGRGPLSFS---SQLQSLYGHSFSYCLVDRNSDTN 333
Query: 200 -SSKINFG---GIVAGAGVVSTPLI------IRDHYYLSLEAISVGNQRLEFVSSSTGNI 249
SSK+ FG ++ + T + + YY+ +++I VG + L+ + T NI
Sbjct: 334 VSSKLIFGEDKDLLNHTNLNFTSFVNGKENSVETFYYIQIKSILVGGEALD-IPEETWNI 392
Query: 250 --------FVDTGVLRTLLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSD--VL--CYN 297
+D+G + + +K+ + +K + F D VL C+N
Sbjct: 393 SPDGAGGTIIDSGTTLSYFAEPAYEIIKNKFAEKMKENYLV-------FRDFPVLDPCFN 445
Query: 298 IS----SQPKFPEVTIHF-RGADVKLSPSNLFRNISDEIMCSAFRGGNANI--VYGRIMQ 350
+S + PE+ I F GA N F +S++++C A G + + G Q
Sbjct: 446 VSGIEENNIHLPELGIAFADGAVWNFPAENSFIWLSEDLVCLAILGTPKSTFSIIGNYQQ 505
Query: 351 INFLIGYDIEQAMVSFKPSRCTN 373
NF I YD + + + F P++C +
Sbjct: 506 QNFHILYDTKMSRLGFTPTKCAD 528
>gi|115465771|ref|NP_001056485.1| Os05g0590000 [Oryza sativa Japonica Group]
gi|49328116|gb|AAT58814.1| putative nucleoid DNA-binding protein [Oryza sativa Japonica Group]
gi|113580036|dbj|BAF18399.1| Os05g0590000 [Oryza sativa Japonica Group]
Length = 481
Score = 118 bits (296), Expect = 4e-24, Method: Compositional matrix adjust.
Identities = 108/371 (29%), Positives = 160/371 (43%), Gaps = 51/371 (13%)
Query: 35 YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSS 94
Y + +GTP +DTGSD W QC PC C+ Q +FDP++S +Y ++ C +
Sbjct: 128 YFAQVGVGTPATTALMVLDTGSDVVWLQCAPCRH--CYAQSGRVFDPRRSRSYAAVDCVA 185
Query: 95 SQCAVVTS---NCSEGDCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMPNVIFG 151
C + S + C Y YG G S ++G+ A+ETLTF + + V G
Sbjct: 186 PICRRLDSAGCDRRRNSCLYQVAYGDG---SVTAGDFASETLTFARGA----RVQRVAIG 238
Query: 152 CGHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLPDQ---------GSSK 202
CGH N + S G+ S SQ+ S FSYCL D+ SS
Sbjct: 239 CGHDNEGLFIAASGLLGLGRG---RLSFPSQIARSFGRSFSYCLVDRTSSVRPSSTRSST 295
Query: 203 INFGGIVAGAGVVS--TPL----IIRDHYYLSLEAISVGNQRLEFVSSS---------TG 247
+ FG A + TP+ + YY+ L SVG R++ VS S G
Sbjct: 296 VTFGAGAVAAAAGASFTPMGRNPRMATFYYVHLLGFSVGGARVKGVSQSDLRLNPTTGRG 355
Query: 248 NIFVDTGVLRTLLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDVL--CYNISSQP--K 303
+ +D+G T L + ++ + +A V G+ PG + CYN+S + K
Sbjct: 356 GVILDSGTSVTRLARPVYEAVR----DAFRAAAV-GLRVSPGGFSLFDTCYNLSGRRVVK 410
Query: 304 FPEVTIHFR-GADVKLSPSNLFRNI-SDEIMCSAFRGGNANI-VYGRIMQINFLIGYDIE 360
P V++H GA V L P N + + C A G + + + G I Q F + +D +
Sbjct: 411 VPTVSMHLAGGASVALPPENYLIPVDTSGTFCFAMAGTDGGVSIIGNIQQQGFRVVFDGD 470
Query: 361 QAMVSFKPSRC 371
V F P C
Sbjct: 471 AQRVGFVPKSC 481
>gi|47777372|gb|AAT38006.1| unknow protein [Oryza sativa Japonica Group]
Length = 475
Score = 118 bits (296), Expect = 4e-24, Method: Compositional matrix adjust.
Identities = 108/371 (29%), Positives = 160/371 (43%), Gaps = 51/371 (13%)
Query: 35 YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSS 94
Y + +GTP +DTGSD W QC PC C+ Q +FDP++S +Y ++ C +
Sbjct: 122 YFAQVGVGTPATTALMVLDTGSDVVWLQCAPCRH--CYAQSGRVFDPRRSRSYAAVDCVA 179
Query: 95 SQCAVVTS---NCSEGDCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMPNVIFG 151
C + S + C Y YG G S ++G+ A+ETLTF + + V G
Sbjct: 180 PICRRLDSAGCDRRRNSCLYQVAYGDG---SVTAGDFASETLTFARGA----RVQRVAIG 232
Query: 152 CGHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLPDQ---------GSSK 202
CGH N + S G+ S SQ+ S FSYCL D+ SS
Sbjct: 233 CGHDNEGLFIAASGLLGLGRG---RLSFPSQIARSFGRSFSYCLVDRTSSVRPSSTRSST 289
Query: 203 INFGGIVAGAGVVS--TPL----IIRDHYYLSLEAISVGNQRLEFVSSS---------TG 247
+ FG A + TP+ + YY+ L SVG R++ VS S G
Sbjct: 290 VTFGAGAVAAAAGASFTPMGRNPRMATFYYVHLLGFSVGGARVKGVSQSDLRLNPTTGRG 349
Query: 248 NIFVDTGVLRTLLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDVL--CYNISSQP--K 303
+ +D+G T L + ++ + +A V G+ PG + CYN+S + K
Sbjct: 350 GVILDSGTSVTRLARPVYEAVR----DAFRAAAV-GLRVSPGGFSLFDTCYNLSGRRVVK 404
Query: 304 FPEVTIHFR-GADVKLSPSNLFRNI-SDEIMCSAFRGGNANI-VYGRIMQINFLIGYDIE 360
P V++H GA V L P N + + C A G + + + G I Q F + +D +
Sbjct: 405 VPTVSMHLAGGASVALPPENYLIPVDTSGTFCFAMAGTDGGVSIIGNIQQQGFRVVFDGD 464
Query: 361 QAMVSFKPSRC 371
V F P C
Sbjct: 465 AQRVGFVPKSC 475
>gi|359476197|ref|XP_003631803.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Vitis vinifera]
Length = 414
Score = 118 bits (296), Expect = 4e-24, Method: Compositional matrix adjust.
Identities = 107/356 (30%), Positives = 160/356 (44%), Gaps = 54/356 (15%)
Query: 45 PVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSSSQCAVVTSNC 104
P +I ++ S TWTQC+PC + C K FDP S TY+ SC S
Sbjct: 86 PQEILAEMNPDS-ITWTQCKPC--VRCLKDSHRHFDPSASLTYSLGSCIPSTVGN----- 137
Query: 105 SEGDCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMPNVIFGCGHKNLASPTSDS 164
+Y+ YG S S GN +T+T + P FGCG N S +
Sbjct: 138 -----TYNMTYGD---KSTSVGNYGCDTMTLEPSD----VFPKFQFGCGRNNEGDFGSGA 185
Query: 165 KQTGIIGLGPGNSSLISQMGTSIAGKFSYCLPDQGS-------------SKINFGGIVAG 211
G++GLG G S +SQ + FSYCLP++ S S + F +V G
Sbjct: 186 D--GMLGLGQGQLSTVSQTASKFKKVFSYCLPEEDSIGSLLFGEKATSQSSLKFTSLVNG 243
Query: 212 AGVVSTPLIIRDHYYLSLEAISVGNQRLEFVSS---STGNIFVDTGVLRTLLPLEYHSNL 268
G ++ L +Y++ L ISVGN+RL SS S G I +D+G + T LP +S L
Sbjct: 244 PG--TSGLEESGYYFVKLLDISVGNKRLNVPSSVFASPGTI-IDSGTVITCLPQRAYSAL 300
Query: 269 KSVMSNMIKAQPVKGVGAEPGFSDVL--CYNISSQPK--FPEVTIHF-RGADVKLSPSNL 323
+ + P+ + G D+L CYN+S + PE+ +HF GADV+L+ +
Sbjct: 301 TAAFKKAMAKYPLSNGRRKKG--DILDTCYNLSGRKDVLLPEIVLHFGEGADVRLNGKRV 358
Query: 324 FRNISDEIMCSAFRGGNAN------IVYGRIMQINFLIGYDIEQAMVSFKPSRCTN 373
+C AF G + + + G Q++ + YDI+ + F + C+
Sbjct: 359 IWGNDASRLCLAFAGNSKSTMNSELTIIGNRQQVSLTVLYDIQGGRIGFGGNGCSK 414
>gi|77553049|gb|ABA95845.1| Eukaryotic aspartyl protease family protein, expressed [Oryza
sativa Japonica Group]
Length = 372
Score = 118 bits (296), Expect = 4e-24, Method: Compositional matrix adjust.
Identities = 106/368 (28%), Positives = 161/368 (43%), Gaps = 51/368 (13%)
Query: 35 YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPP---LFDPKKSSTYNSIS 91
Y M +S+GTPPV ++DTGS +W QC+ C ++ C+ Q +F+P SSTY+ +
Sbjct: 25 YFMGISLGTPPVFNLVTIDTGSTLSWVQCKNC-QIKCYDQAAKAGQIFNPYNSSTYSKVG 83
Query: 92 CSSSQC------AVVTSNCSEGD--CSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPV 143
CS+ C V C E D C YS YG G Y S G L + LT S +
Sbjct: 84 CSTEACNGMHMDLAVEYGCVEEDDTCIYSLRYGSGEY---SVGYLGKDRLTLASNRSI-- 138
Query: 144 EMPNVIFGCGHKNLASPTSDSKQTGIIGLGPGNSSLISQM-GTSIAGKFSYCLP--DQGS 200
N IFGCG NL + + GIIG G + S +Q+ + FSYC P +
Sbjct: 139 --DNFIFGCGEDNLYNGVN----AGIIGFGTKSYSFFNQVCQQTDYTAFSYCFPRDHENE 192
Query: 201 SKINFGGIVAGAGVVSTPLIIRDH---YYLSLEAISVGNQRLE-----FVSSSTGNIFVD 252
+ G ++ T LI DH Y + + V RLE ++S T VD
Sbjct: 193 GSLTIGPYARDINLMWTKLIYYDHKPAYAIQQLDMMVNGIRLEIDPYIYISKMT---IVD 249
Query: 253 TGVLRTLLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSD-VLCY----NISSQPKFPEV 307
+G T + L M+ ++A+ G G+ + +C+ ++ FP V
Sbjct: 250 SGTADTYILSPVFDALDKAMTKEMQAK-----GYTRGWDERRICFISNSGSANWNDFPTV 304
Query: 308 TIHFRGADVKLSPSNLFRNISDEIMCSAFRGGNANI----VYGRIMQINFLIGYDIEQAM 363
+ + +KL N F S+ ++CS F +A + + G +F + +DI+
Sbjct: 305 EMKLIRSTLKLPVENAFYESSNNVICSTFLPDDAGVRGVQMLGNRAVRSFKLVFDIQAMN 364
Query: 364 VSFKPSRC 371
FK C
Sbjct: 365 FGFKARAC 372
>gi|218186446|gb|EEC68873.1| hypothetical protein OsI_37494 [Oryza sativa Indica Group]
Length = 353
Score = 118 bits (295), Expect = 5e-24, Method: Compositional matrix adjust.
Identities = 106/368 (28%), Positives = 161/368 (43%), Gaps = 51/368 (13%)
Query: 35 YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPP---LFDPKKSSTYNSIS 91
Y M +S+GTPPV ++DTGS +W QC+ C ++ C+ Q +F+P SSTY+ +
Sbjct: 6 YFMGISLGTPPVFNLVTIDTGSTLSWVQCKNC-QIKCYDQAAKAGQIFNPYNSSTYSKVG 64
Query: 92 CSSSQC------AVVTSNCSEGD--CSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPV 143
CS+ C V C E D C YS YG G Y S G L + LT S +
Sbjct: 65 CSTEACNGMHMDLAVEYGCVEEDDTCIYSLRYGSGEY---SVGYLGKDRLTLASNRSI-- 119
Query: 144 EMPNVIFGCGHKNLASPTSDSKQTGIIGLGPGNSSLISQM-GTSIAGKFSYCLP--DQGS 200
N IFGCG NL + + GIIG G + S +Q+ + FSYC P +
Sbjct: 120 --DNFIFGCGEDNLYNGVN----AGIIGFGTKSYSFFNQVCQQTDYTAFSYCFPRDHENE 173
Query: 201 SKINFGGIVAGAGVVSTPLIIRDH---YYLSLEAISVGNQRLE-----FVSSSTGNIFVD 252
+ G ++ T LI DH Y + + V RLE ++S T VD
Sbjct: 174 GSLTIGPYARDINLMWTKLIYYDHKPAYAIQQLDMMVNGIRLEIDPYIYISKMT---IVD 230
Query: 253 TGVLRTLLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSD-VLCY----NISSQPKFPEV 307
+G T + L M+ ++A+ G G+ + +C+ ++ FP V
Sbjct: 231 SGTADTYILSPVFDALDKAMTKEMQAK-----GYTRGWDERRICFISNSGSANWNDFPTV 285
Query: 308 TIHFRGADVKLSPSNLFRNISDEIMCSAFRGGNANI----VYGRIMQINFLIGYDIEQAM 363
+ + +KL N F S+ ++CS F +A + + G +F + +DI+
Sbjct: 286 EMKLIRSTLKLPVENAFYESSNNVICSTFLPDDAGVRGVQMLGNRAVRSFKLVFDIQAMN 345
Query: 364 VSFKPSRC 371
FK C
Sbjct: 346 FGFKARAC 353
>gi|115446115|ref|NP_001046837.1| Os02g0473200 [Oryza sativa Japonica Group]
gi|47497549|dbj|BAD19621.1| putative aspartic proteinase nepenthesin [Oryza sativa Japonica
Group]
gi|47847591|dbj|BAD21978.1| putative aspartic proteinase nepenthesin [Oryza sativa Japonica
Group]
gi|113536368|dbj|BAF08751.1| Os02g0473200 [Oryza sativa Japonica Group]
Length = 494
Score = 118 bits (295), Expect = 5e-24, Method: Compositional matrix adjust.
Identities = 116/381 (30%), Positives = 174/381 (45%), Gaps = 61/381 (16%)
Query: 34 IYLMHLSIGTPPVDIFGSVDTGSDCTWTQ---CEPCPELDCFKQEPPLFDPKKSSTYNSI 90
+Y + IGTP + VDTGSD W C+ CP E ++DP+ S + +
Sbjct: 89 LYFTRIGIGTPAKRYYVQVDTGSDILWVNCVSCDGCPRKSNLGIELTMYDPRGSQSGELV 148
Query: 91 SCSSSQC-----AVVTSNCSEGDCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEM 145
+C C V+ S S C YS YG G S ++G T+ L +N SG
Sbjct: 149 TCDQQFCVANYGGVLPSCTSTSPCEYSISYGDG---SSTAGFFVTDFLQYNQVSGDGQTT 205
Query: 146 P---NVIFGCGHK---NLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGK----FSYCL 195
P +V FGCG K +L S S+ GI+G G NSS++SQ+ + AGK F++CL
Sbjct: 206 PANASVSFGCGAKLGGDLGS--SNLALDGILGFGQSNSSMLSQL--AAAGKVRKMFAHCL 261
Query: 196 PDQGSSKINFGGIVAGAGVV-----STPLI-IRDHYYLSLEAISVGNQRLE-----FVSS 244
+N GGI A VV +TPL+ HY + L+ I VG L F S
Sbjct: 262 -----DTVNGGGIFAIGNVVQPKVKTTPLVPDMPHYNVILKGIDVGGTALGLPTNIFDSG 316
Query: 245 STGNIFVDTGVLRTLLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDVLC--YNISSQP 302
++ +D+G +P + L +++ + + V+ + D C Y+ S
Sbjct: 317 NSKGTIIDSGTTLAYVPEGVYKALFAMVFDKHQDISVQTL------QDFSCFQYSGSVDD 370
Query: 303 KFPEVTIHFRGADVKL--SPSN-LFRNISDEIMCSAFRGGNAN-------IVYGRIMQIN 352
FPEVT HF G DV L SP + LF+N + C F+ G ++ G ++ N
Sbjct: 371 GFPEVTFHFEG-DVSLIVSPHDYLFQN-GKNLYCMGFQNGGVQTKDGKDMVLLGDLVLSN 428
Query: 353 FLIGYDIEQAMVSFKPSRCTN 373
L+ YD+E + + C++
Sbjct: 429 KLVLYDLENQAIGWADYNCSS 449
>gi|218190722|gb|EEC73149.1| hypothetical protein OsI_07179 [Oryza sativa Indica Group]
Length = 494
Score = 118 bits (295), Expect = 5e-24, Method: Compositional matrix adjust.
Identities = 116/381 (30%), Positives = 174/381 (45%), Gaps = 61/381 (16%)
Query: 34 IYLMHLSIGTPPVDIFGSVDTGSDCTWTQ---CEPCPELDCFKQEPPLFDPKKSSTYNSI 90
+Y + IGTP + VDTGSD W C+ CP E ++DP+ S + +
Sbjct: 89 LYFTRIGIGTPAKRYYVQVDTGSDILWVNCVSCDGCPRKSNLGIELTMYDPRGSQSGELV 148
Query: 91 SCSSSQC-----AVVTSNCSEGDCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEM 145
+C C V+ S S C YS YG G S ++G T+ L +N SG
Sbjct: 149 TCDQQFCVANYGGVLPSCTSTSPCEYSISYGDG---SSTAGFFVTDFLQYNQVSGDGQTT 205
Query: 146 P---NVIFGCGHK---NLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGK----FSYCL 195
P +V FGCG K +L S S+ GI+G G NSS++SQ+ + AGK F++CL
Sbjct: 206 PANASVSFGCGAKLGGDLGS--SNLALDGILGFGQSNSSMLSQL--AAAGKVRKMFAHCL 261
Query: 196 PDQGSSKINFGGIVAGAGVV-----STPLII-RDHYYLSLEAISVGNQRLE-----FVSS 244
+N GGI A VV +TPL+ HY + L+ I VG L F S
Sbjct: 262 -----DTVNGGGIFAIGNVVQPKVKTTPLVSDMPHYNVILKGIDVGGTALGLPTNIFDSG 316
Query: 245 STGNIFVDTGVLRTLLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDVLC--YNISSQP 302
++ +D+G +P + L +++ + + V+ + D C Y+ S
Sbjct: 317 NSKGTIIDSGTTLAYVPEGVYKALFAMVFDKHQDISVQTL------QDFSCFQYSGSVDD 370
Query: 303 KFPEVTIHFRGADVKL--SPSN-LFRNISDEIMCSAFRGGNAN-------IVYGRIMQIN 352
FPEVT HF G DV L SP + LF+N + C F+ G ++ G ++ N
Sbjct: 371 GFPEVTFHFEG-DVSLIVSPHDYLFQN-GKNLYCMGFQNGGVQTKDGKDMVLLGDLVLSN 428
Query: 353 FLIGYDIEQAMVSFKPSRCTN 373
L+ YD+E + + C++
Sbjct: 429 KLVLYDLENQAIGWADYNCSS 449
>gi|242084336|ref|XP_002442593.1| hypothetical protein SORBIDRAFT_08g022613 [Sorghum bicolor]
gi|241943286|gb|EES16431.1| hypothetical protein SORBIDRAFT_08g022613 [Sorghum bicolor]
Length = 482
Score = 118 bits (295), Expect = 6e-24, Method: Compositional matrix adjust.
Identities = 112/373 (30%), Positives = 164/373 (43%), Gaps = 58/373 (15%)
Query: 35 YLMHLSIGTP-----PVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNS 89
Y+ +++GTP + S D GSD TW QC PC C+ Q P+++ KSS+ +
Sbjct: 125 YIAKITVGTPYENDSSFEALLSPDMGSDVTWLQCMPC--FRCYHQPGPVYNRLKSSSASD 182
Query: 90 ISCSSSQCAVVTSN--CSE--GDCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEM 145
+ C + C + S+ C + +C Y YG G S S+G+ ETLTF V +
Sbjct: 183 VGCYAPACRALGSSGGCVQFLNECQYKVEYGDG---SSSAGDFGVETLTFPPG----VRV 235
Query: 146 PNVIFGCGHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLPDQG----SS 201
P V GCG N + GI+GLG G+ S SQ+ FSYCL QG SS
Sbjct: 236 PGVAIGCGSDNQG--LFPAPAAGILGLGRGSLSFPSQIAGRYGRSFSYCLAGQGTGGRSS 293
Query: 202 KINFGGIVAGAGVVS---------TPLIIRDHYYLSLEAISVGNQRLEFVSSST------ 246
+ FG + + T + YY+ L ISVG R+ V+ S
Sbjct: 294 TLTFGSGASATTTTTTPPSFTPMLTNSRMYTFYYVGLVGISVGGVRVRGVTESDLRLDPS 353
Query: 247 ---GNIFVDTGVLRTLLPLEYHSNLKSVMSNMIKAQPVKGVGA-EPG----FSDVLCYNI 298
G + VD+G T L ++ + + + VK +G PG F D ++
Sbjct: 354 TGHGGVIVDSGTAVTRLSGPAYAAFR----DAFRVAAVKELGWPSPGGPFAFFDTCYSSV 409
Query: 299 SSQ--PKFPEVTIHFRGA-DVKLSPSNLFRNISDE--IMCSAFRG-GNANI-VYGRIMQI 351
+ K P V++HF G +VKL P N + MC AF G G+ + + G I
Sbjct: 410 RGRVMKKVPAVSMHFAGGVEVKLPPQNYLIPVDSNKGTMCFAFAGSGDRGVSIIGNIQLQ 469
Query: 352 NFLIGYDIEQAMV 364
F + YD++ V
Sbjct: 470 GFRVVYDVDGQRV 482
>gi|219362525|ref|NP_001136612.1| uncharacterized protein LOC100216735 [Zea mays]
gi|194696366|gb|ACF82267.1| unknown [Zea mays]
gi|413953802|gb|AFW86451.1| hypothetical protein ZEAMMB73_488726 [Zea mays]
Length = 411
Score = 118 bits (295), Expect = 6e-24, Method: Compositional matrix adjust.
Identities = 109/360 (30%), Positives = 169/360 (46%), Gaps = 50/360 (13%)
Query: 35 YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSS 94
Y++ +S GTP V +DTGSD +W QC+PC CF Q+ PL+DP SSTY+++ C+S
Sbjct: 79 YVVRVSFGTPAVPQVVVIDTGSDVSWLQCKPCSSGQCFPQKDPLYDPSHSSTYSAVPCAS 138
Query: 95 SQCAVVT-----SNCSEG-DCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMPNV 148
C + S C+ G C ++ Y G + + G + + LT G V+ N
Sbjct: 139 DVCKKLAADAYGSGCTSGKQCGFAISYADG---TSTVGAYSQDKLTL--APGAIVQ--NF 191
Query: 149 IFGCGH-KNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLPDQGSSKINFGG 207
FGCGH K+ D G++GLG L +G G FSYCLP SSK F
Sbjct: 192 YFGCGHGKHAVRGLFD----GVLGLG----RLRESLGARYGGVFSYCLPSV-SSKPGFLA 242
Query: 208 IVAG---AGVVSTPLIIR----DHYYLSLEAISVGNQRLEFVSSS-TGNIFVDTGVLRTL 259
+ AG +G V TP+ ++L I+VG ++L+ S+ +G + VD+G + T
Sbjct: 243 LGAGKNPSGFVFTPMGTVPGQPTFSTVTLAGINVGGKKLDLRPSAFSGGMIVDSGTVITG 302
Query: 260 LPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDVLCYNISSQPK--FPEVTIHFR-GADV 316
L + L+S ++A + P CYN++ P++ + F GA +
Sbjct: 303 LQSTAYRALRSAFRKAMEAYRL-----LPNGDLDTCYNLTGYKNVVVPKIALTFTGGATI 357
Query: 317 KLS-PSNLFRNISDEIMCSAFR----GGNANIVYGRIMQINFLIGYDIEQAMVSFKPSRC 371
L P+ + N C AF G+A ++ G + Q F + +D + F+ C
Sbjct: 358 NLDVPNGILVN-----GCLAFAESGPDGSAGVL-GNVNQRAFEVLFDTSTSKFGFRAKAC 411
>gi|414881575|tpg|DAA58706.1| TPA: hypothetical protein ZEAMMB73_168363 [Zea mays]
Length = 506
Score = 117 bits (294), Expect = 6e-24, Method: Compositional matrix adjust.
Identities = 104/388 (26%), Positives = 166/388 (42%), Gaps = 59/388 (15%)
Query: 34 IYLMHLSIGTPPVDIFGSVDTGSDCTWTQ---CEPCPELDCFKQEPPLFDPKKSSTYNSI 90
+Y + +GTPP + VDTGSD W C CP + +DPK SS+ +++
Sbjct: 86 LYFTEIKLGTPPKRYYVQVDTGSDILWVNCISCSKCPRKSGLGLDLTFYDPKASSSGSTV 145
Query: 91 SCSSSQCAVVTSNCSEG-----DCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEM 145
SC CA G C YS +YG G S ++G T+ L F+ +G
Sbjct: 146 SCDQGFCAATYGGKLPGCTANVPCEYSVMYGDG---SSTTGFFITDALQFDQVTGDGQTQ 202
Query: 146 P---NVIFGCGHKNLAS-PTSDSKQTGIIGLGPGNSSLISQMGTSIAGK--FSYCLPD-Q 198
P + FGCG + S+ GI+G G N+S++SQ+ + K F++CL +
Sbjct: 203 PGNATITFGCGAQQGGDLGNSNQALDGILGFGQANTSMLSQLAAAGKAKKIFAHCLDTIK 262
Query: 199 GSSKINFGGIVA---------GAGVVSTPLII-------RDHYYLSLEAISVGNQRLE-- 240
G G +V G+++ PL + R HY ++L++I VG L+
Sbjct: 263 GGGIFAIGNVVQPKCYFVFFFAHGLLNIPLFLLVMILLSRPHYNVNLKSIDVGGTTLQLP 322
Query: 241 ---FVSSSTGNIFVDTGVLRTLLP-LEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDVLCY 296
F + +D+G T LP L + + V S K + + A D LC+
Sbjct: 323 AHVFETGEKKGTIIDSGTTLTYLPELVFKQVMDVVFS---KHRDI----AFHNLQDFLCF 375
Query: 297 NISS--QPKFPEVTIHFRGADVKLS--PSNLFRNISDEIMCSAFRGGNAN-------IVY 345
S FP +T HF D+ L P F ++I C F+ G ++
Sbjct: 376 QYSGSVDDGFPTITFHFE-DDLALHVYPHEYFFPNGNDIYCVGFQNGALQSKDGKDIVLM 434
Query: 346 GRIMQINFLIGYDIEQAMVSFKPSRCTN 373
G ++ N L+ YD+E ++ + C++
Sbjct: 435 GDLVLSNKLVVYDLENQVIGWTDYNCSS 462
>gi|357448247|ref|XP_003594399.1| Aspartic proteinase nepenthesin-2 [Medicago truncatula]
gi|355483447|gb|AES64650.1| Aspartic proteinase nepenthesin-2 [Medicago truncatula]
Length = 452
Score = 117 bits (294), Expect = 6e-24, Method: Compositional matrix adjust.
Identities = 103/370 (27%), Positives = 161/370 (43%), Gaps = 54/370 (14%)
Query: 35 YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISC-- 92
Y + + +G+P VDTGS +W QC+PC + C QE P+F+P S TY ++ C
Sbjct: 103 YYVKMGLGSPTKYYTMIVDTGSSFSWLQCQPC-TIYCHIQEDPVFNPSASKTYKTVPCSS 161
Query: 93 ----SSSQCAVVTSNCSE--GDCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMP 146
S + CS+ C Y YG +SFS G L+ + LT + L
Sbjct: 162 SQCSSLKSATLNEPTCSKQSNACVYKASYGD---SSFSLGYLSQDVLTLTPSQTL----S 214
Query: 147 NVIFGCGHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCL---------PD 197
+ ++GCG N + GIIGL S++SQ+ FSYCL P
Sbjct: 215 SFVYGCGQDNQG---LFGRTDGIIGLANNELSMLSQLSGKYGNAFSYCLPTSFSTPNSPK 271
Query: 198 QGSSKINFGGIVAGAGVVSTPLIIRDH----YYLSLEAISVGNQRLEFVSSSTG-NIFVD 252
+G I + + TPL+ + Y++ LE+I+V + L +SS +D
Sbjct: 272 EGFLSIGTSSLTPSSSYKFTPLLKNPNNPSLYFIDLESITVAGRPLGVAASSYKVPTIID 331
Query: 253 TGVLRTLLPLEYHSNLKSVMSNMI--KAQPVKGV--------GAEPGFSDVLCYNISSQP 302
+G + T LP ++ LK+ ++ K Q G+ G+ G S+V
Sbjct: 332 SGTVITRLPTPVYTTLKNAYVTILSKKYQQAPGISLLDTCFKGSLAGISEVA-------- 383
Query: 303 KFPEVTIHFR-GADVKLSPSNLFRNISDEIMCSAFRGGNANIVYGRIMQINFLIGYDIEQ 361
P++ I F+ GAD++L N + I C A G ++ + G Q + YD+
Sbjct: 384 --PDIRIIFKGGADLQLKGHNSLVELETGITCLAMAGSSSIAIIGNYQQQTVKVAYDVGN 441
Query: 362 AMVSFKPSRC 371
+ V F P C
Sbjct: 442 SRVGFAPGGC 451
>gi|357163818|ref|XP_003579856.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Brachypodium
distachyon]
Length = 467
Score = 117 bits (294), Expect = 6e-24, Method: Compositional matrix adjust.
Identities = 102/413 (24%), Positives = 179/413 (43%), Gaps = 87/413 (21%)
Query: 22 IIYQAEIISVDDIYLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDP 81
++ +A ++S YL+ L +GTP ++DT SD WTQC+PC + C+KQ P+F+P
Sbjct: 75 VVAEAPVLSAGGEYLVKLGLGTPQHCFTAAIDTASDLIWTQCQPC--VKCYKQLDPVFNP 132
Query: 82 KKSSTYNSISCSSSQC-AVVTSNCS-EGD------CSYSFLYGRGAYASFSSGNLATETL 133
S++Y + C+S C + T C+ +GD C Y++ YG A + G LA + L
Sbjct: 133 VASTSYAVVPCNSDTCDELDTHRCARDGDSDDEDACQYTYSYGGNAT---TRGILAVDRL 189
Query: 134 TFNSTSGLPVEMPNVIFGCGHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSY 193
V+FGC ++ P + +G++GLG G SL+SQ+ +F Y
Sbjct: 190 AIGDDV-----FRGVVFGCSSSSVGGPP--PQVSGVVGLGRGALSLVSQLSVR---RFMY 239
Query: 194 CLPDQGSSKINFGGIVAGAGVVSTPLIIRD-----------------HYYLSLEAISVGN 236
CLP S + G +V GA +T +R+ +YYL+L+ IS+G+
Sbjct: 240 CLPPPVSR--SAGRLVLGADAAAT---VRNASERVVVPMSTGSRYPSYYYLNLDGISIGD 294
Query: 237 QRLEFVSSSTGN--------------------------------IFVDTGVLRTLLPLEY 264
+ + F S + N + +D T L
Sbjct: 295 RAMSFRSRNRMNATTPGTAAGAPASPVSGSGDGDGSGTGPDAYGMIIDIASTITFLEESL 354
Query: 265 HSNLKSVMSNMIKAQPVKGVGAEPGFSDVLCYNIS-----SQPKFPEVTIHFRGADVKLS 319
+ + + I+ +G G++ G LC+ + S+ P V++ F G ++L
Sbjct: 355 YEEMVDDLEEEIRLP--RGSGSDLGLD--LCFILPEGVPMSRVYAPPVSLAFEGVWLRLD 410
Query: 320 PSNLF-RNISDEIMCSAFRGGNANIVYGRIMQINFLIGYDIEQAMVSFKPSRC 371
+F + + +MC + + G Q N + Y++ + ++F + C
Sbjct: 411 KEQMFVEDRASGMMCLMVGKTDGVSILGNYQQQNMQVMYNLRRGRITFIKTAC 463
>gi|217074470|gb|ACJ85595.1| unknown [Medicago truncatula]
gi|388505166|gb|AFK40649.1| unknown [Medicago truncatula]
Length = 452
Score = 117 bits (294), Expect = 7e-24, Method: Compositional matrix adjust.
Identities = 103/370 (27%), Positives = 161/370 (43%), Gaps = 54/370 (14%)
Query: 35 YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISC-- 92
Y + + +G+P VDTGS +W QC+PC + C QE P+F+P S TY ++ C
Sbjct: 103 YYVKMGLGSPTKYYTMIVDTGSSFSWLQCQPC-TIYCHIQEDPVFNPSASKTYKTVPCSS 161
Query: 93 ----SSSQCAVVTSNCSE--GDCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMP 146
S + CS+ C Y YG +SFS G L+ + LT + L
Sbjct: 162 SQCSSLKSATLNEPTCSKQSNACVYKASYGD---SSFSLGYLSQDVLTLTPSQTL----S 214
Query: 147 NVIFGCGHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCL---------PD 197
+ ++GCG N + GIIGL S++SQ+ FSYCL P
Sbjct: 215 SFVYGCGQDNQG---LFGRTDGIIGLANNELSMLSQLSGKYGNAFSYCLPTSFSTPNSPK 271
Query: 198 QGSSKINFGGIVAGAGVVSTPLIIRDH----YYLSLEAISVGNQRLEFVSSSTG-NIFVD 252
+G I + + TPL+ + Y++ LE+I+V + L +SS +D
Sbjct: 272 EGFLSIGTSSLTPSSSYKFTPLLKNPNNPSLYFIDLESITVAGRPLGVAASSYKVPTIID 331
Query: 253 TGVLRTLLPLEYHSNLKSVMSNMI--KAQPVKGV--------GAEPGFSDVLCYNISSQP 302
+G + T LP ++ LK+ ++ K Q G+ G+ G S+V
Sbjct: 332 SGTVITRLPTPVYTTLKNAYVTILSKKYQQAPGISLLDTCFKGSLAGISEVA-------- 383
Query: 303 KFPEVTIHFR-GADVKLSPSNLFRNISDEIMCSAFRGGNANIVYGRIMQINFLIGYDIEQ 361
P++ I F+ GAD++L N + I C A G ++ + G Q + YD+
Sbjct: 384 --PDIRIIFKGGADLQLKGHNSLVELETGITCLAMAGSSSIAIIGNYQQQTVKVAYDVGN 441
Query: 362 AMVSFKPSRC 371
+ V F P C
Sbjct: 442 SRVGFAPGGC 451
>gi|297827577|ref|XP_002881671.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
gi|297327510|gb|EFH57930.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
Length = 438
Score = 117 bits (294), Expect = 7e-24, Method: Compositional matrix adjust.
Identities = 113/381 (29%), Positives = 183/381 (48%), Gaps = 66/381 (17%)
Query: 39 LSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSSSQCA 98
L++G+PP +I +DTGS+ +W C+ P L +F+P SSTY+ + CSS C
Sbjct: 65 LAVGSPPQNISMVLDTGSELSWLHCKKSPNLGS------VFNPVSSSTYSPVPCSSPICR 118
Query: 99 VVTSNC---SEGDCSYSFLYGRGAYASFSS--GNLATETLTFNSTSGLPVEMPNVIFGCG 153
T + + D F + +YA +S GNLA +T S V P +FGC
Sbjct: 119 TRTRDLPIPASCDPKTHFCHVAISYADATSIEGNLAHDTFVIGS-----VTRPGTLFGCM 173
Query: 154 HKNLASPT-SDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLPDQGSSKINFGGIVAGA 212
L+S + D+K TG++G+ G+ S ++Q+G S KFSYC+ SS I G + +
Sbjct: 174 DSGLSSDSEEDAKSTGLMGMNRGSLSFVNQLGFS---KFSYCISGSDSSGILLLGDASYS 230
Query: 213 G---VVSTPLII---------RDHYYLSLEAISVGNQRLE-----FVSSST--GNIFVDT 253
+ TPL++ R Y + LE I VG++ L FV T G VD+
Sbjct: 231 WLGPIQYTPLVLQTTPLPYFDRVAYTVQLEGIRVGSKILSLPKSVFVPDHTGAGQTMVDS 290
Query: 254 GVLRTLLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGF----SDVLCYNI--SSQPKF--- 304
G T L ++ LK+ + + + V + +P F + LCY + S++P F
Sbjct: 291 GTQFTFLMGPVYTALKNEF--IAQTKSVLRIVDDPNFVFQGTMDLCYRVGSSTRPNFTGL 348
Query: 305 PEVTIHFRGADVKLSPSNLFRNIS-------DEIMCSAFRGGNANI------VYGRIMQI 351
P +++ FRGA++ +S L ++ +E+ C F GN+++ V G Q
Sbjct: 349 PVISLMFRGAEMSVSGQKLLYRVNGAGSEGKEEVYCFTF--GNSDLLGIEAFVIGHHHQQ 406
Query: 352 NFLIGYDIEQAMVSFKPS-RC 371
N + +D+ ++ V F + RC
Sbjct: 407 NVWMEFDLAKSRVGFAGNVRC 427
>gi|255543383|ref|XP_002512754.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
gi|223547765|gb|EEF49257.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
Length = 414
Score = 117 bits (294), Expect = 7e-24, Method: Compositional matrix adjust.
Identities = 102/368 (27%), Positives = 167/368 (45%), Gaps = 52/368 (14%)
Query: 35 YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSS 94
Y++ + IG + + VDTGSD TW QC+PC C+ Q+ PLF+P S +Y +I C+S
Sbjct: 67 YIVTVEIGGRNMTVI--VDTGSDLTWVQCQPCRL--CYNQQDPLFNPSGSPSYQTILCNS 122
Query: 95 SQC----------AVVTSNCSEGDCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVE 144
S C V SN C+Y YG G+Y + G+L E L +T
Sbjct: 123 STCQSLQYATGNLGVCGSNTPT--CNYVVNYGDGSY---TRGDLGMEQLNLGTT-----H 172
Query: 145 MPNVIFGCGHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLPD---QGSS 201
+ N IFGCG N S G++GLG + SL+SQ G FSYCLP S
Sbjct: 173 VSNFIFGCGRNNKGLFGGAS---GLMGLGKSDLSLVSQTSAIFEGVFSYCLPTTAADASG 229
Query: 202 KINFGG---IVAGAGVVSTPLIIRD-----HYYLSLEAISVGNQRLEFVSSSTGNIFVDT 253
+ GG + +S +I + Y+L+L IS+G L+ + I +D+
Sbjct: 230 SLILGGNSSVYKNTTPISYTRMIANPQLPTFYFLNLTGISIGGVALQAPNYRQSGILIDS 289
Query: 254 GVLRTLLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDV-LCYNIS--SQPKFPEVTIH 310
G + T LP + +LK+ P + P FS + C+N++ + P + +
Sbjct: 290 GTVITRLPPPVYRDLKAEFLKQFSGFP-----SAPPFSILDTCFNLNGYDEVDIPTIRMQ 344
Query: 311 FRG-ADVKLSPSNLFRNISDE-----IMCSAFRGGNANIVYGRIMQINFLIGYDIEQAMV 364
F G A++ + + +F + + + ++ + + G Q N + Y+ +++ +
Sbjct: 345 FEGNAELTVDVTGIFYFVKTDASQVCLALASLSFDDEIPIIGNYQQRNQRVIYNTKESKL 404
Query: 365 SFKPSRCT 372
F C+
Sbjct: 405 GFAAEACS 412
>gi|15228044|ref|NP_181826.1| aspartyl protease-like protein [Arabidopsis thaliana]
gi|20197868|gb|AAM15292.1| putative chloroplast nucleoid DNA binding protein [Arabidopsis
thaliana]
gi|20197965|gb|AAD21712.2| putative chloroplast nucleoid DNA binding protein [Arabidopsis
thaliana]
gi|330255100|gb|AEC10194.1| aspartyl protease-like protein [Arabidopsis thaliana]
Length = 527
Score = 117 bits (294), Expect = 7e-24, Method: Compositional matrix adjust.
Identities = 107/382 (28%), Positives = 176/382 (46%), Gaps = 58/382 (15%)
Query: 35 YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSS 94
Y M + +GTPP +DTGSD W QC PC DCF Q +DPK S+++ +I+C+
Sbjct: 160 YFMDVLVGTPPKHFSLILDTGSDLNWLQCLPC--YDCFHQNGMFYDPKTSASFKNITCND 217
Query: 95 SQCAVVTS-----NCSEGD--CSYSFLYGRGAYASFSSGNLATETLTFNSTS----GLPV 143
+C++++S C + C Y + YG S ++G+ A ET T N T+
Sbjct: 218 PRCSLISSPDPPVQCESDNQSCPYFYWYGD---RSNTTGDFAVETFTVNLTTTEGGSSEY 274
Query: 144 EMPNVIFGCGHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLPDQG---- 199
++ N++FGCGH N + S G+ S SQ+ + FSYCL D+
Sbjct: 275 KVGNMMFGCGHWNRGLFSGASGLLGLGRGPLSFS---SQLQSLYGHSFSYCLVDRNSNTN 331
Query: 200 -SSKINFG---GIVAGAGVVSTPLI------IRDHYYLSLEAISVGNQRLE-------FV 242
SSK+ FG ++ + T + + YY+ +++I VG + L+
Sbjct: 332 VSSKLIFGEDKDLLNHTNLNFTSFVNGKENSVETFYYIQIKSILVGGKALDIPEETWNIS 391
Query: 243 SSSTGNIFVDTGVLRTLLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSD--VL--CYNI 298
S G +D+G + + +K+ + +K P F D VL C+N+
Sbjct: 392 SDGDGGTIIDSGTTLSYFAEPAYEIIKNKFAEKMKEN-------YPIFRDFPVLDPCFNV 444
Query: 299 S----SQPKFPEVTIHFRGADVKLSPS-NLFRNISDEIMCSAFRGGNANI--VYGRIMQI 351
S + PE+ I F V P+ N F +S++++C A G + + G Q
Sbjct: 445 SGIEENNIHLPELGIAFVDGTVWNFPAENSFIWLSEDLVCLAILGTPKSTFSIIGNYQQQ 504
Query: 352 NFLIGYDIEQAMVSFKPSRCTN 373
NF I YD +++ + F P++C +
Sbjct: 505 NFHILYDTKRSRLGFTPTKCAD 526
>gi|125553531|gb|EAY99240.1| hypothetical protein OsI_21202 [Oryza sativa Indica Group]
Length = 475
Score = 117 bits (293), Expect = 8e-24, Method: Compositional matrix adjust.
Identities = 107/371 (28%), Positives = 160/371 (43%), Gaps = 51/371 (13%)
Query: 35 YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSS 94
Y + +GTP +DTGSD W QC PC C+ Q +FDP++S +Y ++ C +
Sbjct: 122 YFAQVGVGTPATTALMVLDTGSDVVWLQCAPCRH--CYAQSGRVFDPRRSRSYAAVDCVA 179
Query: 95 SQCAVVTS---NCSEGDCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMPNVIFG 151
C + S + C Y YG G S ++G+ A+ETLTF + + V G
Sbjct: 180 PICRRLDSAGCDRRRNSCLYQVAYGDG---SVTAGDFASETLTFARGA----RVQRVAIG 232
Query: 152 CGHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLPDQ---------GSSK 202
CGH N + S G+ S +Q+ S FSYCL D+ SS
Sbjct: 233 CGHDNEGLFIAASGLLGLGRG---RLSFPTQIARSFGRSFSYCLVDRTSSVRPSSTRSST 289
Query: 203 INFGGIVAGAGVVS--TPL----IIRDHYYLSLEAISVGNQRLEFVSSS---------TG 247
+ FG A + TP+ + YY+ L SVG R++ VS S G
Sbjct: 290 VTFGAGAVAAAAGASFTPMGRNPRMATFYYVHLLGFSVGGARVKGVSQSDLRLNPTTGRG 349
Query: 248 NIFVDTGVLRTLLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDVL--CYNISSQP--K 303
+ +D+G T L + ++ + +A V G+ PG + CYN+S + K
Sbjct: 350 GVILDSGTSVTRLARPVYEAVR----DAFRAAAV-GLRVSPGGFSLFDTCYNLSGRRVVK 404
Query: 304 FPEVTIHFR-GADVKLSPSNLFRNI-SDEIMCSAFRGGNANI-VYGRIMQINFLIGYDIE 360
P V++H GA V L P N + + C A G + + + G I Q F + +D +
Sbjct: 405 VPTVSMHLAGGASVALPPENYLIPVDTSGTFCFAMAGTDGGVSIIGNIQQQGFRVVFDGD 464
Query: 361 QAMVSFKPSRC 371
V F P C
Sbjct: 465 AQRVGFVPKSC 475
>gi|15217887|ref|NP_176703.1| aspartic proteinase-like protein 2 [Arabidopsis thaliana]
gi|118572746|sp|Q9S9K4.2|ASPL2_ARATH RecName: Full=Aspartic proteinase-like protein 2; Flags: Precursor
gi|332196226|gb|AEE34347.1| aspartic proteinase-like protein 2 [Arabidopsis thaliana]
Length = 475
Score = 117 bits (293), Expect = 8e-24, Method: Compositional matrix adjust.
Identities = 99/366 (27%), Positives = 160/366 (43%), Gaps = 36/366 (9%)
Query: 34 IYLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDC---FKQEPPLFDPKKSSTYNSI 90
+Y + +G+PP + VDTGSD W C+PCP+ LFD SST +
Sbjct: 73 LYFTKIKLGSPPKEYHVQVDTGSDILWINCKPCPKCPTKTNLNFRLSLFDMNASSTSKKV 132
Query: 91 SCSSSQCAVVTSNCS---EGDCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMP- 146
C C+ ++ + S CSY +Y A S S G + LT +G P
Sbjct: 133 GCDDDFCSFISQSDSCQPALGCSYHIVY---ADESTSDGKFIRDMLTLEQVTGDLKTGPL 189
Query: 147 --NVIFGCGHKNLAS-PTSDSKQTGIIGLGPGNSSLISQMGTSIAGK--FSYCLPDQGSS 201
V+FGCG DS G++G G N+S++SQ+ + K FS+CL +
Sbjct: 190 GQEVVFGCGSDQSGQLGNGDSAVDGVMGFGQSNTSVLSQLAATGDAKRVFSHCLDNVKGG 249
Query: 202 KINFGGIVAGAGVVSTPLIIRD-HYYLSLEAISVGNQRLEFVSS--STGNIFVDTGVLRT 258
I G+V V +TP++ HY + L + V L+ S G VD+G
Sbjct: 250 GIFAVGVVDSPKVKTTPMVPNQMHYNVMLMGMDVDGTSLDLPRSIVRNGGTIVDSGTTLA 309
Query: 259 LLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDVLCYNISS--QPKFPEVTIHFRGADV 316
P + S++ ++ QPVK E F C++ S+ FP V+ F + V
Sbjct: 310 YFPKVLYD---SLIETILARQPVKLHIVEETFQ---CFSFSTNVDEAFPPVSFEFEDS-V 362
Query: 317 KLS--PSNLFRNISDEIMCSAFRGGNAN-------IVYGRIMQINFLIGYDIEQAMVSFK 367
KL+ P + + +E+ C ++ G I+ G ++ N L+ YD++ ++ +
Sbjct: 363 KLTVYPHDYLFTLEEELYCFGWQAGGLTTDERSEVILLGDLVLSNKLVVYDLDNEVIGWA 422
Query: 368 PSRCTN 373
C++
Sbjct: 423 DHNCSS 428
>gi|326490656|dbj|BAJ89995.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 456
Score = 117 bits (293), Expect = 9e-24, Method: Compositional matrix adjust.
Identities = 120/355 (33%), Positives = 173/355 (48%), Gaps = 43/355 (12%)
Query: 35 YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSS 94
YL+ + +G+P V +DTGSD +W QC+PC + C Q LFDP SSTY++ SC+S
Sbjct: 127 YLITVGMGSPAVAQTMLIDTGSDVSWVQCKPCSQ--CHSQADSLFDPSSSSTYSAFSCTS 184
Query: 95 SQCAVVTSN-CSEGDCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMPNVIFGCG 153
+ CA + CS C Y+ YG G S SG +++TL S++ + N FGC
Sbjct: 185 AACAQLRQRGCSSSQCQYTVKYGDG---STGSGTYSSDTLALGSST-----VENFQFGCS 236
Query: 154 HKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLPDQ-GSSKINFGGIVAGA 212
+ D + G++GLG G SL +Q + FSYCLP GSS G
Sbjct: 237 QSESGNLLQD-QTAGLMGLGGGAESLATQTAGTFGKAFSYCLPPTPGSSGFLTLGASTSG 295
Query: 213 GVVSTPLI----IRDHYYLSLEAISVGNQRLEFVSS--STGNIFVDTGVLRTLLPLEYHS 266
VV TP++ + +Y + L+AI VG ++L +S S G+I +D+G + T LP +S
Sbjct: 296 FVVKTPMLRSTQVPSYYGVLLQAIRVGGRQLNIPASAFSAGSI-MDSGTIITRLPRTAYS 354
Query: 267 NLKSVMSNMIKAQPVKGVGAEP-GFSDVLCYNISSQP--KFPEVTIHFR-GADVKLSPSN 322
L S +K P A+P G D C++ S Q P V + F GA V L+
Sbjct: 355 ALSSAFKAGMKQYPP----AQPMGIFDT-CFDFSGQSSVSIPTVALVFSGGAVVDLA--- 406
Query: 323 LFRNISDEIM---CSAFRGGNANI---VYGRIMQINFLIGYDIEQAMVSFKPSRC 371
SD I+ C AF + + + G + Q F + YD+ V FK C
Sbjct: 407 -----SDGIILGSCLAFAANSDDTSLGIIGNVQQRTFEVLYDVGGGAVGFKAGAC 456
>gi|194696934|gb|ACF82551.1| unknown [Zea mays]
gi|413936470|gb|AFW71021.1| hypothetical protein ZEAMMB73_589717 [Zea mays]
gi|413953801|gb|AFW86450.1| hypothetical protein ZEAMMB73_488726 [Zea mays]
Length = 445
Score = 117 bits (293), Expect = 1e-23, Method: Compositional matrix adjust.
Identities = 108/359 (30%), Positives = 167/359 (46%), Gaps = 48/359 (13%)
Query: 35 YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSS 94
Y++ +S GTP V +DTGSD +W QC+PC CF Q+ PL+DP SSTY+++ C+S
Sbjct: 113 YVVRVSFGTPAVPQVVVIDTGSDVSWLQCKPCSSGQCFPQKDPLYDPSHSSTYSAVPCAS 172
Query: 95 SQCAVVT-----SNCSEG-DCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMPNV 148
C + S C+ G C ++ Y G + + G + + LT G V+ N
Sbjct: 173 DVCKKLAADAYGSGCTSGKQCGFAISYADG---TSTVGAYSQDKLTL--APGAIVQ--NF 225
Query: 149 IFGCGHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLPDQGSSKINFGGI 208
FGCGH A G++GLG L +G G FSYCLP SSK F +
Sbjct: 226 YFGCGHGKHA---VRGLFDGVLGLG----RLRESLGARYGGVFSYCLPSV-SSKPGFLAL 277
Query: 209 VAG---AGVVSTPLII----RDHYYLSLEAISVGNQRLEFVSSS-TGNIFVDTGVLRTLL 260
AG +G V TP+ ++L I+VG ++L+ S+ +G + VD+G + T L
Sbjct: 278 GAGKNPSGFVFTPMGTVPGQPTFSTVTLAGINVGGKKLDLRPSAFSGGMIVDSGTVITGL 337
Query: 261 PLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDVLCYNISSQPK--FPEVTIHFR-GADVK 317
+ L+S ++A + P CYN++ P++ + F GA +
Sbjct: 338 QSTAYRALRSAFRKAMEAYRLL-----PNGDLDTCYNLTGYKNVVVPKIALTFTGGATIN 392
Query: 318 LS-PSNLFRNISDEIMCSAFR----GGNANIVYGRIMQINFLIGYDIEQAMVSFKPSRC 371
L P+ + N C AF G+A ++ G + Q F + +D + F+ C
Sbjct: 393 LDVPNGILVN-----GCLAFAESGPDGSAGVL-GNVNQRAFEVLFDTSTSKFGFRAKAC 445
>gi|255548660|ref|XP_002515386.1| Aspartic proteinase nepenthesin-2 precursor, putative [Ricinus
communis]
gi|223545330|gb|EEF46835.1| Aspartic proteinase nepenthesin-2 precursor, putative [Ricinus
communis]
Length = 387
Score = 117 bits (293), Expect = 1e-23, Method: Compositional matrix adjust.
Identities = 112/364 (30%), Positives = 173/364 (47%), Gaps = 49/364 (13%)
Query: 35 YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSS 94
YL+ +++GTP + + ++DTGSD TWTQCEPC C++Q FDP+KSS+Y ++SCSS
Sbjct: 45 YLVKMALGTPKLSLSLALDTGSDITWTQCEPCVG-SCYRQAQTKFDPRKSSSYKNVSCSS 103
Query: 95 SQCAVVTSN-----CSEGDCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMPNVI 149
S C ++T + C C Y YG G+Y S G ATE LT + + + N +
Sbjct: 104 SSCRIITDSGGARGCVSSTCIYKVQYGDGSY---SVGFFATEKLTISPSD----VISNFL 156
Query: 150 FGCGHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGK----FSYCLPDQGSS---K 202
FGCG +N + + G I G + + K F+YCLP SS
Sbjct: 157 FGCGQQN-------AGRFGRIAGLLGLGRGKLSLALQTSEKYNNLFTYCLPSFSSSSTGH 209
Query: 203 INFGGIVAGAGVVSTPL--IIRD--HYYLSLEAISVGNQRLEFVSS--STGNIFVDTGVL 256
+ GG V + V TPL ++ Y + ++ +SVG L +S S +D+G +
Sbjct: 210 LTLGGQVPKS-VKFTPLSPAFKNTPFYGIDIKGLSVGGHVLPIDASVFSNAGAIIDSGTV 268
Query: 257 RTLLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDV-LCYNISSQPKF--PEVTIHFRG 313
T L +S L S ++K P GFS + CY+ S P ++ F+G
Sbjct: 269 ITRLQPTVYSALSSKFQQLMKDYP-----KTDGFSILDTCYDFSGNESISVPRISFFFKG 323
Query: 314 A---DVKLSPSNLFRNISDEIMCSAFRGGNAN---IVYGRIMQINFLIGYDIEQAMVSFK 367
D+K N D++ C AF + + +V+G Q + + +D+ + + F
Sbjct: 324 GVEVDIKFFGILTVINAWDKV-CLAFAPNDDDGDFVVFGNSQQQTYDVVHDLAKGRIGFA 382
Query: 368 PSRC 371
PS C
Sbjct: 383 PSGC 386
>gi|125561849|gb|EAZ07297.1| hypothetical protein OsI_29545 [Oryza sativa Indica Group]
Length = 451
Score = 117 bits (292), Expect = 1e-23, Method: Compositional matrix adjust.
Identities = 107/379 (28%), Positives = 170/379 (44%), Gaps = 57/379 (15%)
Query: 32 DDIYLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQE--PPLFDPKKSSTYNS 89
D + + + IGTPP VDTGSD WTQC+ + PP++DP +SST+
Sbjct: 88 DQGHSLTVGIGTPPQPRKLIVDTGSDLIWTQCKLSSSTAVAARHGSPPVYDPGESSTFAF 147
Query: 90 ISCSSSQC---AVVTSNC-SEGDCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEM 145
+ CS C NC S+ C Y +YG A + G LA+ET TF + + + +
Sbjct: 148 LPCSDRLCQEGQFSFKNCTSKNRCVYEDVYGSAA----AVGVLASETFTFGARRAVSLRL 203
Query: 146 PNVIFGCGHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCL---PDQGSSK 202
FGCG + S TGI+GL P + SLI+Q+ +FSYCL D+ +S
Sbjct: 204 G---FGCGALSAGSLIG---ATGILGLSPESLSLITQLKIQ---RFSYCLTPFADKKTSP 254
Query: 203 INFGGI-----------VAGAGVVSTPLIIRDHYYLSLEAISVGNQRLEFVSSST----- 246
+ FG + + +VS P + +YY+ L IS+G++RL ++S
Sbjct: 255 LLFGAMADLSRHKTTRPIQTTAIVSNP-VKTVYYYVPLVGISLGHKRLAVPAASLAMRPD 313
Query: 247 --GNIFVDTGVLRTLLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDVLCYNISSQP-- 302
G VD+G L +K + ++++ PV E LC+ + +
Sbjct: 314 GGGGTIVDSGSTVAYLVEAAFEAVKEAVMDVVRL-PVANRTVE---DYELCFVLPRRTAA 369
Query: 303 ------KFPEVTIHFRGADVKLSPS-NLFRNISDEIMCSAF---RGGNANIVYGRIMQIN 352
+ P + +HF G + P N F+ +MC A G+ + G + Q N
Sbjct: 370 AAMEAVQVPPLVLHFDGGAAMVLPRDNYFQEPRAGLMCLAVGKTTDGSGVSIIGNVQQQN 429
Query: 353 FLIGYDIEQAMVSFKPSRC 371
+ +D++ SF P++C
Sbjct: 430 MHVLFDVQHHKFSFAPTQC 448
>gi|356532386|ref|XP_003534754.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Glycine max]
Length = 463
Score = 117 bits (292), Expect = 1e-23, Method: Compositional matrix adjust.
Identities = 115/367 (31%), Positives = 168/367 (45%), Gaps = 47/367 (12%)
Query: 35 YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSS 94
Y + + +GTP VDTGS +W QC+PC + C Q P+F P S TY ++ CSS
Sbjct: 113 YYVKIGLGTPAKYFSMIVDTGSSLSWLQCQPC-VIYCHVQVDPIFTPSTSKTYKALPCSS 171
Query: 95 SQCAVVTS------NCSE--GDCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMP 146
SQC+ + S CS G C Y YG SFS G L+ + LT P E P
Sbjct: 172 SQCSSLKSSTLNAPGCSNATGACVYKASYGD---TSFSIGYLSQDVLTLT-----PSEAP 223
Query: 147 N--VIFGCGHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLPDQGSSKIN 204
+ ++GCG N + +GIIGL S++ Q+ FSYCLP S+ +
Sbjct: 224 SSGFVYGCGQDNQG---LFGRSSGIIGLANDKISMLGQLSKKYGNAFSYCLPSSFSAPNS 280
Query: 205 F---GGIVAGAGVVS------TPLI----IRDHYYLSLEAISVGNQRLEFVSSSTGNI-- 249
G + GA ++ TPL+ I Y+L L I+V + L VS+S+ N+
Sbjct: 281 SSLSGFLSIGASSLTSSPYKFTPLVKNQKIPSLYFLDLTTITVAGKPLG-VSASSYNVPT 339
Query: 250 FVDTGVLRTLLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDV-LCY--NISSQPKFPE 306
+D+G + T LP+ ++ LK ++ K PGFS + C+ ++ PE
Sbjct: 340 IIDSGTVITRLPVAVYNALKKSFVLIMS----KKYAQAPGFSILDTCFKGSVKEMSTVPE 395
Query: 307 VTIHFR-GADVKLSPSNLFRNISDEIMCSAFRGGNANI-VYGRIMQINFLIGYDIEQAMV 364
+ I FR GA ++L N I C A + I + G Q F + YD+ +
Sbjct: 396 IQIIFRGGAGLELKAHNSLVEIEKGTTCLAIAASSNPISIIGNYQQQTFKVAYDVANFKI 455
Query: 365 SFKPSRC 371
F P C
Sbjct: 456 GFAPGGC 462
>gi|359476754|ref|XP_002277058.2| PREDICTED: aspartic proteinase-like protein 2-like isoform 1 [Vitis
vinifera]
Length = 561
Score = 117 bits (292), Expect = 1e-23, Method: Compositional matrix adjust.
Identities = 103/375 (27%), Positives = 170/375 (45%), Gaps = 49/375 (13%)
Query: 34 IYLMHLSIGTPPVDIFGSVDTGSDCTWTQ---CEPCPELDCFKQEPPLFDPKKSSTYNSI 90
+Y + IGTP D + VDTGSD W C+ CP + L+D K S+T +++
Sbjct: 154 LYFAKIGIGTPSKDYYVQVDTGSDILWVNCAGCDRCPTKSDLGVDLTLYDMKASTTSDAV 213
Query: 91 SCSSSQCAVVTS---NCSEG-DCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMP 146
C + C++ C G C YS LYG G S ++G + + +N SG P
Sbjct: 214 GCDDNFCSLYDGPLPGCKPGLQCLYSVLYGDG---SSTTGYFVQDFVQYNRISGNFQTTP 270
Query: 147 ---NVIFGCGHKNLASPTSDSKQ-TGIIGLGPGNSSLISQMGTS--IAGKFSYCLPDQGS 200
V+FGCG+K S S+ GI+G G NSS++SQ+ +S + FS+CL +
Sbjct: 271 TNGTVVFGCGNKQSGELGSSSEALDGILGFGQANSSMLSQLASSGKVKKVFSHCLDN--- 327
Query: 201 SKINFGGIVAGAGVVS-----TPLII-RDHYYLSLEAISVGNQRLE-----FVSSSTGNI 249
++ GGI A VV TPL+ + HY + ++ I VG L+ F S
Sbjct: 328 --VDGGGIFAIGEVVEPKVNITPLVQNQAHYNVVMKEIEVGGDPLDVPSDAFESGDRKGT 385
Query: 250 FVDTGVLRTLLPLEYHSNLKSVMSNMIKAQP-VKGVGAEPGFSDVLCYNISS--QPKFPE 306
+D+G P E + L + ++ QP ++ E F+ C++ + FP
Sbjct: 386 IIDSGTTLAYFPQEVYVPL---IEKILSQQPDLRLHTVEQAFT---CFDYTGNVDDGFPT 439
Query: 307 VTIHF-RGADVKLSPSNLFRNISDEIMCSAFRGGNANIVYGRIMQI-------NFLIGYD 358
VT+HF + + + P + + C ++ A G+ + + N L+ YD
Sbjct: 440 VTLHFDKSISLTVYPHEYLFQVKEFEWCIGWQNSGAQTKDGKDLTLLGDLVLSNKLVVYD 499
Query: 359 IEQAMVSFKPSRCTN 373
+E+ + + C++
Sbjct: 500 LEKQGIGWVEYNCSS 514
>gi|297735249|emb|CBI17611.3| unnamed protein product [Vitis vinifera]
Length = 480
Score = 116 bits (291), Expect = 2e-23, Method: Compositional matrix adjust.
Identities = 103/375 (27%), Positives = 170/375 (45%), Gaps = 49/375 (13%)
Query: 34 IYLMHLSIGTPPVDIFGSVDTGSDCTWTQ---CEPCPELDCFKQEPPLFDPKKSSTYNSI 90
+Y + IGTP D + VDTGSD W C+ CP + L+D K S+T +++
Sbjct: 73 LYFAKIGIGTPSKDYYVQVDTGSDILWVNCAGCDRCPTKSDLGVDLTLYDMKASTTSDAV 132
Query: 91 SCSSSQCAVVTS---NCSEG-DCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMP 146
C + C++ C G C YS LYG G S ++G + + +N SG P
Sbjct: 133 GCDDNFCSLYDGPLPGCKPGLQCLYSVLYGDG---SSTTGYFVQDFVQYNRISGNFQTTP 189
Query: 147 ---NVIFGCGHKNLASPTSDSKQ-TGIIGLGPGNSSLISQMGTS--IAGKFSYCLPDQGS 200
V+FGCG+K S S+ GI+G G NSS++SQ+ +S + FS+CL +
Sbjct: 190 TNGTVVFGCGNKQSGELGSSSEALDGILGFGQANSSMLSQLASSGKVKKVFSHCLDN--- 246
Query: 201 SKINFGGIVAGAGVVS-----TPLII-RDHYYLSLEAISVGNQRLE-----FVSSSTGNI 249
++ GGI A VV TPL+ + HY + ++ I VG L+ F S
Sbjct: 247 --VDGGGIFAIGEVVEPKVNITPLVQNQAHYNVVMKEIEVGGDPLDVPSDAFESGDRKGT 304
Query: 250 FVDTGVLRTLLPLEYHSNLKSVMSNMIKAQP-VKGVGAEPGFSDVLCYNISS--QPKFPE 306
+D+G P E + L + ++ QP ++ E F+ C++ + FP
Sbjct: 305 IIDSGTTLAYFPQEVYVPL---IEKILSQQPDLRLHTVEQAFT---CFDYTGNVDDGFPT 358
Query: 307 VTIHF-RGADVKLSPSNLFRNISDEIMCSAFRGGNANIVYGRIMQI-------NFLIGYD 358
VT+HF + + + P + + C ++ A G+ + + N L+ YD
Sbjct: 359 VTLHFDKSISLTVYPHEYLFQVKEFEWCIGWQNSGAQTKDGKDLTLLGDLVLSNKLVVYD 418
Query: 359 IEQAMVSFKPSRCTN 373
+E+ + + C++
Sbjct: 419 LEKQGIGWVEYNCSS 433
>gi|357118398|ref|XP_003560942.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Brachypodium
distachyon]
Length = 478
Score = 116 bits (291), Expect = 2e-23, Method: Compositional matrix adjust.
Identities = 123/393 (31%), Positives = 172/393 (43%), Gaps = 65/393 (16%)
Query: 31 VDDIYLMHLSIGTP-PVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNS 89
+D YL+HLSIGTP P + ++DTGSD WTQC C CF Q P FD S T +
Sbjct: 96 IDSEYLIHLSIGTPRPQRVALTLDTGSDLVWTQCA-CHV--CFAQPFPTFDALASQTTLA 152
Query: 90 ISCSSSQCA---VVTSNCSEGDCSYSFLYGRGAYASFSSGNLATETLTFNSTSG------ 140
+ CS C S C+ D + +LY A S +SG + +T TF S G
Sbjct: 153 VPCSDPICTSGKYPLSGCTFNDNTCFYLYDY-ADKSITSGRIVEDTFTFRSPQGNNGSKA 211
Query: 141 -LPVEMPNVIFGCGHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYC---LP 196
V +PNV FGCG N S ++GI G G SL SQ+ + +FS+C +
Sbjct: 212 HAGVAVPNVRFGCGQYNKG--IFKSNESGIAGFSRGPMSLPSQLKVA---RFSHCFTAIA 266
Query: 197 DQGSSKINFGGI--------VAGAGVVSTPLIIRDH--YYLSLEAISVGNQRLEF----- 241
D +S + GG A V STP + YYL+L+ I+VG RL
Sbjct: 267 DARTSPVFLGGAPGPDNLGAHATGPVQSTPFANSNGSLYYLTLKGITVGKTRLPLNALAF 326
Query: 242 ----VSSSTGNIFVDTGVLRTLLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDVLCYN 297
S +G +D+G LP + +L++ +K PV A S LC+
Sbjct: 327 AGKGTGSGSGGTIIDSGTGIRTLPGPMYRSLRAAFVARVKL-PVANESAADAES-TLCFE 384
Query: 298 ISSQPKFP---------EVTIHFRGADVKL-SPSNLFRNISDE---------IMCSAFRG 338
+ P +V +H GAD L S + + DE +M SA G
Sbjct: 385 AARSASLPPEAPAPALPKVVLHVAGADWDLPRESYVLDLLEDEDGSGSGLCLVMNSA--G 442
Query: 339 GNANIVYGRIMQINFLIGYDIEQAMVSFKPSRC 371
+ + G Q N + YD+E+ + F P+RC
Sbjct: 443 DSDLTIIGNFQQQNMHVAYDLEKNKLVFVPARC 475
>gi|326503794|dbj|BAK02683.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 456
Score = 116 bits (291), Expect = 2e-23, Method: Compositional matrix adjust.
Identities = 108/353 (30%), Positives = 166/353 (47%), Gaps = 41/353 (11%)
Query: 35 YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSS 94
Y++ + IG+P V +DTGSD +W +C L LFDP KS+TY SCSS
Sbjct: 129 YVITVGIGSPAVTQTMMIDTGSDVSWVRCNSTDGL-------TLFDPSKSTTYAPFSCSS 181
Query: 95 SQCAVVTSN---CSEGDCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMPNVIFG 151
+ CA + +N CS C Y YG G S ++G +++TL +++ + + FG
Sbjct: 182 AACAQLGNNGDGCSNSGCQYRVQYGDG---SNTTGTYSSDTLALSASD----TVTDFHFG 234
Query: 152 CGHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLP--DQGSSKINFGGIV 209
C H K G++GLG SL+SQ + FSYCLP ++ S + FG
Sbjct: 235 CSHHE--EDFDGEKIDGLMGLGGDAQSLVSQTAATYGKSFSYCLPPTNRTSGFLTFGAPN 292
Query: 210 A-GAGVVSTPLIIRDH----YYLSLEAISVGNQRLEFVSS--STGNIFVDTGVLRTLLPL 262
G V+TP++ Y + L+ ISVG L S S G++ +D+G + T LP
Sbjct: 293 GTSGGFVTTPMLRWPKAPTLYGVLLQDISVGGTPLGIQPSVLSNGSV-MDSGTVITWLPR 351
Query: 263 EYHSNLKSVM-SNMIKAQPVKGVGAEPGFSDVLCYNISS--QPKFPEVTIHFR-GADVKL 318
+S L S S+M + + + A G D CY+ + P V++ GA V L
Sbjct: 352 RAYSALSSAFRSSMTRLRHQR--AAPLGILDT-CYDFTGLVNVSIPAVSLVLDGGAVVDL 408
Query: 319 SPSNLFRNISDEIMCSAFRGGNANIVYGRIMQINFLIGYDIEQAMVSFKPSRC 371
+ + I D C AF + + + G + Q F + +D+ Q + F+ C
Sbjct: 409 DGNGIM--IQD---CLAFAATSGDSIIGNVQQRTFEVLHDVGQGVFGFRSGAC 456
>gi|242072067|ref|XP_002451310.1| hypothetical protein SORBIDRAFT_05g027510 [Sorghum bicolor]
gi|241937153|gb|EES10298.1| hypothetical protein SORBIDRAFT_05g027510 [Sorghum bicolor]
Length = 509
Score = 116 bits (290), Expect = 2e-23, Method: Compositional matrix adjust.
Identities = 98/303 (32%), Positives = 134/303 (44%), Gaps = 39/303 (12%)
Query: 44 PPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSSSQCAVV--- 100
P V +DT SD W QC PCP C+ Q L+DP KS + S +CSS C +
Sbjct: 178 PGVRQLMLLDTASDVAWVQCFPCPASQCYAQTDVLYDPSKSRSSESFACSSPTCRQLGPY 237
Query: 101 -----TSNCSEGDCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMPNVIFGCGHK 155
+S+ S G C Y Y G S +SG L + L+ + TS ++P FGC H
Sbjct: 238 ANGCSSSSNSAGQCQYRVRYPDG---STTSGTLVADQLSLSPTS----QVPKFEFGCSHA 290
Query: 156 NLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLPDQGSSKINF--------GG 207
S S SK GI+ LG G SL+SQ T FSYC P S K F
Sbjct: 291 ARGS-FSRSKTAGIMALGRGVQSLVSQTSTKYGQVFSYCFPPTASHKGFFVLGVPRRSSS 349
Query: 208 IVAGAGVVSTPLIIRDHYYLSLEAISVGNQRLEFVSS--STGNIFVDTGVLRTLLPLEYH 265
A ++ TP++ Y + LEAI+V QRL+ + + G V+ L P Y
Sbjct: 350 RYAVTPMLKTPML----YQVRLEAIAVAGQRLDVPPTVFAAGAALDSRTVITRLPPTAYQ 405
Query: 266 SNLKSVMSNMIKAQPVKGVGAEPGFSDVLCYNIS--SQPKFPEVTIHF--RGADVKLSPS 321
+ + M +P A G D CY+ + S P +++ F GA V+L PS
Sbjct: 406 ALRSAFRDKMSMYRPA----AANGQLDT-CYDFTGVSSIMLPTISLVFDRTGAGVQLDPS 460
Query: 322 NLF 324
+
Sbjct: 461 GVL 463
>gi|242084332|ref|XP_002442591.1| hypothetical protein SORBIDRAFT_08g022580 [Sorghum bicolor]
gi|241943284|gb|EES16429.1| hypothetical protein SORBIDRAFT_08g022580 [Sorghum bicolor]
Length = 493
Score = 116 bits (290), Expect = 2e-23, Method: Compositional matrix adjust.
Identities = 114/375 (30%), Positives = 174/375 (46%), Gaps = 53/375 (14%)
Query: 35 YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSS 94
Y+ +++GTP V+ ++DTGSD TW QC+PC C+ Q P+FDP+ S++Y + +
Sbjct: 134 YMAKIAVGTPAVEALLAMDTGSDITWLQCQPCRR--CYPQSGPVFDPRHSTSYREMGYDA 191
Query: 95 SQCAVV----TSNCSEGDCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMPNVIF 150
C + + C Y+ YG S + G+ ETLTF V++P++
Sbjct: 192 PDCQALGRSGGGDAKRMTCVYAVGYGDD--GSTTVGDFIEETLTFAGG----VQVPHMSI 245
Query: 151 GCGHKN---LASPTSDSKQTGIIGLGPGNSSLISQ---MGTSIAGKFSYCLPD------- 197
GCGH N A+P + GI+GLG G S SQ +G ++ FSYCL D
Sbjct: 246 GCGHDNKGLFAAPAA-----GILGLGRGQISCPSQIAALGYNVT-SFSYCLADFFLSSPG 299
Query: 198 -QGSSKINFG-GIVAGAGVVS-TPLI----IRDHYYLSLEAISVGNQRLEFVS------- 243
SS + G G AG+ S TP + + YY+ L +SVG R+ V+
Sbjct: 300 RSVSSTLTIGDGAAAGSPPPSFTPTVQNLNMATFYYVRLVGVSVGGVRVPGVTEDDLKLD 359
Query: 244 --SSTGNIFVDTGVLRTLLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDVLCYNISSQ 301
+ G + +D+G T L + + +G GF D CY + +
Sbjct: 360 PYTGRGGVILDSGTAVTRLARRAYIAFRDAFRAAAVDLGQVSIGGPSGFFDT-CYTMGGR 418
Query: 302 P-KFPEVTIHFRGA-DVKLSPSNLFRNI-SDEIMCSAFRG-GNANI-VYGRIMQINFLIG 356
K P V++HF G ++ L P N + S +C AF G G+ ++ + G I Q F +
Sbjct: 419 AMKVPTVSMHFAGGVELTLPPKNYLIPVDSMGTVCFAFAGTGDRSVSIIGNIQQQGFRVV 478
Query: 357 YDIEQAMVSFKPSRC 371
Y+I V F P+ C
Sbjct: 479 YNIGGGRVGFAPNSC 493
>gi|242092878|ref|XP_002436929.1| hypothetical protein SORBIDRAFT_10g011180 [Sorghum bicolor]
gi|241915152|gb|EER88296.1| hypothetical protein SORBIDRAFT_10g011180 [Sorghum bicolor]
Length = 505
Score = 116 bits (290), Expect = 2e-23, Method: Compositional matrix adjust.
Identities = 105/363 (28%), Positives = 159/363 (43%), Gaps = 44/363 (12%)
Query: 35 YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSS 94
+++ + G+P + S+DTGSD +W QC PC C+KQ P+FDP KS+TY+++ C
Sbjct: 161 FVVTVGFGSPAQNYTLSIDTGSDVSWIQCLPC-SGHCYKQHDPVFDPTKSATYSAVPCGH 219
Query: 95 SQCAVVTSNCS-EGDCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMPNVIFGCG 153
QCA CS G C Y YG G S ++G L+ ETL+ +ST ++P FGCG
Sbjct: 220 PQCAAAGGKCSNSGTCLYKVTYGDG---SSTAGVLSHETLSLSSTR----DLPGFAFGCG 272
Query: 154 HKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLP--DQGSSKINFGGIVAG 211
NL G+ G SL SQ + FSYCLP D + G
Sbjct: 273 QTNLGEFGGVDGLVGLGR---GALSLPSQAAATFGATFSYCLPSYDTTHGYLTMGSTTPA 329
Query: 212 AG-----VVSTPLIIRDH----YYLSLEAISVGNQRLEF---VSSSTGNIFVDTGVLRTL 259
A V T +I ++ Y++ + +I +G L V + G +F D+G + T
Sbjct: 330 ASNDDDDVQYTAMIQKEDYPSLYFVEVVSIDIGGYILPVPPTVFTRDGTLF-DSGTILTY 388
Query: 260 LPLEYHSNLKSVMS-NMIKAQPVKGVGAEPGFSDV-LCYNISSQPK--FPEVTIHFR-GA 314
LP E +++L+ M + +P P + CY+ + P V F GA
Sbjct: 389 LPPEAYASLRDRFKFTMTQYKPA------PAYDPFDTCYDFTGHNAIFMPAVAFKFSDGA 442
Query: 315 DVKLSPSNLF---RNISDEIMCSAFRGGNANI---VYGRIMQINFLIGYDIEQAMVSFKP 368
LSP + + + C AF + + + G Q + YD+ + F
Sbjct: 443 VFDLSPVAILIYPDDTAPATGCLAFVPRPSTMPFNIIGNTQQRGTEVIYDVAAEKIGFGQ 502
Query: 369 SRC 371
C
Sbjct: 503 FTC 505
>gi|413953782|gb|AFW86431.1| hypothetical protein ZEAMMB73_038825 [Zea mays]
Length = 462
Score = 115 bits (289), Expect = 2e-23, Method: Compositional matrix adjust.
Identities = 110/360 (30%), Positives = 158/360 (43%), Gaps = 40/360 (11%)
Query: 35 YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSS 94
+++ + GTP DTGSD +W QC PC C+KQ P+FDP KS+TY+++ C
Sbjct: 120 FVVTVGFGTPAQTYTLMFDTGSDVSWIQCLPC-SGHCYKQHDPIFDPTKSATYSAVPCGH 178
Query: 95 SQCAVVTSNCSE-GDCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMPNVIFGCG 153
QCA CS G C Y YG G S ++G L+ ETL+ S L P FGCG
Sbjct: 179 PQCAAAGGKCSSNGTCLYKVQYGDG---SSTAGVLSHETLSLTSARAL----PGFAFGCG 231
Query: 154 HKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLPDQGSSK--INFGGIVAG 211
NL G+IGLG G SL SQ S FSYCLP +S + G
Sbjct: 232 ETNLG---DFGDVDGLIGLGRGQLSLSSQAAASFGAAFSYCLPSYNTSHGYLTIGTTTPA 288
Query: 212 A---GVVSTPLIIRDH----YYLSLEAISVGNQRLEF--VSSSTGNIFVDTGVLRTLLPL 262
+ GV T +I + Y++ L +I VG L + + +D+G + T LP
Sbjct: 289 SGSDGVRYTAMIQKQDYPSFYFVDLVSIVVGGFVLPVPPILFTRDGTLLDSGTVLTYLPP 348
Query: 263 EYHSNLKSVMS-NMIKAQPVKGVGAEPGFSDV-LCYNISSQPK--FPEVTIHFR-GADVK 317
E ++ L+ M + +P P + CY+ + Q P V+ F G+
Sbjct: 349 EAYTALRDRFKFTMTQYKPA------PAYDPFDTCYDFAGQNAIFMPLVSFKFSDGSSFD 402
Query: 318 LSPSNLF---RNISDEIMCSAFRGGNANI---VYGRIMQINFLIGYDIEQAMVSFKPSRC 371
LSP + + + C AF + + + G Q N + YD+ + F C
Sbjct: 403 LSPFGVLIFPDDTAPATGCLAFVPRPSTMPFTIVGNTQQRNTEMIYDVAAEKIGFVSGSC 462
>gi|326529233|dbj|BAK01010.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 441
Score = 115 bits (289), Expect = 3e-23, Method: Compositional matrix adjust.
Identities = 117/408 (28%), Positives = 171/408 (41%), Gaps = 67/408 (16%)
Query: 2 QNSQKLPFYNDNETPKSPISIIYQAEIISVDDIYLMHLSIGTPPVDIFGSVDTGSDCTWT 61
Q Q+L D P + Y AE + IG PP +DTGS+ WT
Sbjct: 62 QQQQQLRASGDVSAPVHLATRQYIAEYL-----------IGDPPQRAAALIDTGSNLIWT 110
Query: 62 QC-EPCPELDCFKQEPPLFDPKKSSTYNSISCSSSQ--CAV--VTSNCSEGDCSYSFLYG 116
QC C C KQ+ P ++ +SST+ ++ C+ S CA V +G C+++ YG
Sbjct: 111 QCGTTCGLKACAKQDLPYYNLSRSSTFAAVPCADSAKLCAANGVHLCGLDGSCTFAASYG 170
Query: 117 RGAYASFSSGNLATETLTFNSTSGLPVEMPNVIFGCGHKNLASPTSDSKQTGIIGLGPGN 176
G+ G+L TE TF S + + FGC + + + +G+IGLG G
Sbjct: 171 AGSV----FGSLGTEAFTFQSGA------AKLGFGCVSLTRITKGALNGASGLIGLGRGR 220
Query: 177 SSLISQMGTSIAGKFSYCLP----DQGSSKINFGGIVA-----GAGVVSTPLI------- 220
SL+SQ G A KFSYCL + G+S F G A G V S P +
Sbjct: 221 LSLVSQTG---ATKFSYCLTPYLRNHGASSHLFVGASASLSGGGGAVTSIPFVKSPEDYP 277
Query: 221 IRDHYYLSLEAISVGNQRLEFVSSS-----------TGNIFVDTGVLRTLLPLEYHSNLK 269
YYL L ISVG +L S++ +G + +DTG T L +S L
Sbjct: 278 YSTFYYLPLVGISVGETKLPIPSAAFELRRVAAGYWSGGVIIDTGSPVTSLAEAAYSALS 337
Query: 270 SVMSNMIK---AQPVKGVGAEPGFSDVLCYNISSQPK-FPEVTIHF-RGADVKLSPSNLF 324
++ + QP G + LC K P + HF GAD+ +S + +
Sbjct: 338 DEVARQLNRSLVQPPADTGLD------LCVARQDVDKVVPVLVFHFGGGADMAVSAGSYW 391
Query: 325 RNISDEIMCSAFRGGNANIVYGRIMQINFLIGYDIEQAMVSFKPSRCT 372
+ C G V G Q + + YDI + +SF+ + C+
Sbjct: 392 GPVDKSTACMLIEEGGYETVIGNFQQQDVHLLYDIGKGELSFQTADCS 439
>gi|357140068|ref|XP_003571594.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Brachypodium
distachyon]
Length = 533
Score = 115 bits (289), Expect = 3e-23, Method: Compositional matrix adjust.
Identities = 101/357 (28%), Positives = 160/357 (44%), Gaps = 57/357 (15%)
Query: 52 VDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSSSQCAVVTSNC------- 104
VDTGSD TW QCEPCP C+ Q PLFDP S T+ ++ C S CA +
Sbjct: 198 VDTGSDLTWVQCEPCPGSSCYAQRDPLFDPAASPTFAAVPCGSPACAASLKDATGAPGSC 257
Query: 105 ------SEGDCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMPNVIFGCG--HKN 156
SE C Y+ YG G SFS G LA +TL +T+ ++ +FGCG ++
Sbjct: 258 ARSAGNSEQRCYYALSYGDG---SFSRGVLAQDTLGLGTTT----KLDGFVFGCGLSNRG 310
Query: 157 LASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLPDQGSSKINFGGIVAGAGVVS 216
L T+ G++GLG + SL+SQ G FSYCLP ++ + G + G G S
Sbjct: 311 LFGGTA-----GLMGLGRTDLSLVSQTAARFGGVFSYCLP---ATTTSTGSLSLGPGPSS 362
Query: 217 T------PLIIRD-----HYYLSLE-AISVGNQRLEFVSSSTGNIFVDTGVLRTLLPLEY 264
+ +I D Y++++ A G L GN+ VD+G + T L
Sbjct: 363 SFPNMAYTRMIADPTQPPFYFINITGAAVGGGAALTAPGFGAGNVLVDSGTVITRLAPSV 422
Query: 265 HSNLKSVMSNMIKAQPVKGVGAEPGFSDV-LCYNISSQPK--FPEVTIHFR-GADVKLSP 320
+ +++ + + A PGFS + CY+++ + + P +T+ GA V +
Sbjct: 423 YKAVRAEFARRFEYP------AAPGFSILDACYDLTGRDEVNVPLLTLTLEGGAQVTVDA 476
Query: 321 SNLFRNISDE-----IMCSAFRGGNANIVYGRIMQINFLIGYDIEQAMVSFKPSRCT 372
+ + + + + ++ + + G Q N + YD + + F CT
Sbjct: 477 AGMLFVVRKDGSQVCLAMASLPYEDQTPIIGNYQQRNKRVVYDTVGSRLGFADEDCT 533
>gi|222622847|gb|EEE56979.1| hypothetical protein OsJ_06707 [Oryza sativa Japonica Group]
Length = 494
Score = 115 bits (289), Expect = 3e-23, Method: Compositional matrix adjust.
Identities = 116/381 (30%), Positives = 173/381 (45%), Gaps = 61/381 (16%)
Query: 34 IYLMHLSIGTPPVDIFGSVDTGSDCTWTQ---CEPCPELDCFKQEPPLFDPKKSSTYNSI 90
+Y + IGTP + VDTGSD W C+ CP E ++DP+ S + +
Sbjct: 89 LYFTRIGIGTPAKRYYVQVDTGSDILWVNCVSCDGCPRKSNLGIELTMYDPRGSQSGELV 148
Query: 91 SCSSSQC-----AVVTSNCSEGDCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEM 145
+C C V+ S S C YS YG G S ++G T+ L +N SG
Sbjct: 149 TCDQQFCVANYGGVLPSCTSTSPCEYSISYGDG---SSTAGFFVTDFLQYNQVSGDGQTT 205
Query: 146 P---NVIFGCGHK---NLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGK----FSYCL 195
P +V FGCG K +L S S+ GI+G G NSS++SQ+ + AGK F++CL
Sbjct: 206 PANASVSFGCGAKLGGDLGS--SNLALDGILGFGQSNSSMLSQL--AAAGKVRKMFAHCL 261
Query: 196 PDQGSSKINFGGIVAGAGVV-----STPLI-IRDHYYLSLEAISVGNQRLE-----FVSS 244
+N GGI A VV +TPL+ HY + L+ I VG L F S
Sbjct: 262 -----DTVNGGGIFAIGNVVQPKVKTTPLVPDMPHYNVILKGIDVGGTALGLPTNIFDSG 316
Query: 245 STGNIFVDTGVLRTLLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDVLC--YNISSQP 302
++ +D+G +P + L +++ + + V+ + D C Y+ S
Sbjct: 317 NSKGTIIDSGTTLAYVPEGVYKALFAMVFDKHQDISVQTL------QDFSCFQYSGSVDD 370
Query: 303 KFPEVTIHFRGADVKL--SPSN-LFRNISDEIMCSAFRGGNANIVYGRIMQI-------N 352
FPEVT HF G DV L SP + LF+N + C F+ G G+ + + N
Sbjct: 371 GFPEVTFHFEG-DVSLIVSPHDYLFQN-GKNLYCMGFQNGGGKTKDGKDLGLLGDLVLSN 428
Query: 353 FLIGYDIEQAMVSFKPSRCTN 373
L+ YD+E + + C++
Sbjct: 429 KLVLYDLENQAIGWADYNCSS 449
>gi|115458646|ref|NP_001052923.1| Os04g0448500 [Oryza sativa Japonica Group]
gi|38344830|emb|CAD40872.2| OSJNBa0064H22.11 [Oryza sativa Japonica Group]
gi|113564494|dbj|BAF14837.1| Os04g0448500 [Oryza sativa Japonica Group]
Length = 464
Score = 115 bits (289), Expect = 3e-23, Method: Compositional matrix adjust.
Identities = 103/412 (25%), Positives = 171/412 (41%), Gaps = 73/412 (17%)
Query: 14 ETPKSPISIIYQAEIISVDDIYLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFK 73
E + +++ + I+ YL+ L IGTPP ++DT SD WTQC+PC C+
Sbjct: 68 EAASARKAVVAETPIMPAGGEYLVKLGIGTPPYKFTAAIDTASDLIWTQCQPC--TGCYH 125
Query: 74 QEPPLFDPKKSSTYNSISCSSSQC-AVVTSNCSEGD---CSYSFLYGRGAYASFSSGNLA 129
Q P+F+P+ SSTY ++ CSS C + C D C Y++ Y A + G LA
Sbjct: 126 QVDPMFNPRVSSTYAALPCSSDTCDELDVHRCGHDDDESCQYTYTYSGNAT---TEGTLA 182
Query: 130 TETLTFNSTSGLPVEMPNVIFGCGHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAG 189
+ L + V FGC + + +G++GLG G SL+SQ+
Sbjct: 183 VDKLVIGEDA-----FRGVAFGCSTSSTGG-APPPQASGVVGLGRGPLSLVSQLSVR--- 233
Query: 190 KFSYCLPDQGSS---KINFGGIVAGAGVVSTPLII---RD-----HYYLSLEAISVGNQR 238
+F+YCLP S K+ G A + + + RD +YYL+L+ + +G++
Sbjct: 234 RFAYCLPPPASRIPGKLVLGADADAARNATNRIAVPMRRDPRYPSYYYLNLDGLLIGDRA 293
Query: 239 LEF---------------------------VSSSTGN---IFVDTGVLRTLLPLEYHSNL 268
+ V+ N + +D T L + L
Sbjct: 294 MSLPPTTTTTATATATAPAPAPTPSPNATAVAVGDANRYGMIIDIASTITFLEASLYDEL 353
Query: 269 KSVMSNMIKAQPVKGVGAEPGFSDVLCYNISSQPKF-----PEVTIHFRGADVKLSPSNL 323
+ + I+ +G G+ G LC+ + F P V + F G ++L + L
Sbjct: 354 VNDLEVEIRLP--RGTGSSLGLD--LCFILPDGVAFDRVYVPAVALAFDGRWLRLDKARL 409
Query: 324 FRNISDE----IMCSAFRGGNANIVYGRIMQINFLIGYDIEQAMVSFKPSRC 371
F + +M G+ +I+ G Q N + Y++ + V+F S C
Sbjct: 410 FAEDRESGMMCLMVGRAEAGSVSIL-GNFQQQNMQVLYNLRRGRVTFVQSPC 460
>gi|116310064|emb|CAH67085.1| H0818E04.2 [Oryza sativa Indica Group]
gi|116310187|emb|CAH67199.1| OSIGBa0152K17.11 [Oryza sativa Indica Group]
Length = 464
Score = 115 bits (289), Expect = 3e-23, Method: Compositional matrix adjust.
Identities = 103/412 (25%), Positives = 171/412 (41%), Gaps = 73/412 (17%)
Query: 14 ETPKSPISIIYQAEIISVDDIYLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFK 73
E + +++ + I+ YL+ L IGTPP ++DT SD WTQC+PC C+
Sbjct: 68 EAASARKAVVAETPIMPAGGEYLVKLGIGTPPYKFTAAIDTASDLIWTQCQPC--TGCYH 125
Query: 74 QEPPLFDPKKSSTYNSISCSSSQC-AVVTSNCSEGD---CSYSFLYGRGAYASFSSGNLA 129
Q P+F+P+ SSTY ++ CSS C + C D C Y++ Y A + G LA
Sbjct: 126 QVDPMFNPRVSSTYAALPCSSDTCDELDVHRCGHDDDESCQYTYTYSGNAT---TEGTLA 182
Query: 130 TETLTFNSTSGLPVEMPNVIFGCGHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAG 189
+ L + V FGC + + +G++GLG G SL+SQ+
Sbjct: 183 VDKLVIGEDA-----FRGVAFGCSTSSTGG-APPPQASGVVGLGRGPLSLVSQLSVR--- 233
Query: 190 KFSYCLPDQGSS---KINFGGIVAGAGVVSTPLII---RD-----HYYLSLEAISVGNQR 238
+F+YCLP S K+ G A + + + RD +YYL+L+ + +G++
Sbjct: 234 RFAYCLPPPASRIPGKLVLGADADAARNATNRIAVPMRRDPRYPSYYYLNLDGLLIGDRT 293
Query: 239 LEF---------------------------VSSSTGN---IFVDTGVLRTLLPLEYHSNL 268
+ V+ N + +D T L + L
Sbjct: 294 MSLPPTTTTTATATATAPAPAPTPSPNATAVAVGDANRYGMIIDIASTITFLEASLYDEL 353
Query: 269 KSVMSNMIKAQPVKGVGAEPGFSDVLCYNISSQPKF-----PEVTIHFRGADVKLSPSNL 323
+ + I+ +G G+ G LC+ + F P V + F G ++L + L
Sbjct: 354 VNDLEVEIRLP--RGTGSSLGLD--LCFILPDGVAFDRVYVPAVALAFDGRWLRLDKARL 409
Query: 324 FRNISDE----IMCSAFRGGNANIVYGRIMQINFLIGYDIEQAMVSFKPSRC 371
F + +M G+ +I+ G Q N + Y++ + V+F S C
Sbjct: 410 FAEDRESGMMCLMVGRAEAGSVSIL-GNFQQQNMQVLYNLRRGRVTFVQSPC 460
>gi|255571588|ref|XP_002526740.1| aspartic-type endopeptidase, putative [Ricinus communis]
gi|223533929|gb|EEF35654.1| aspartic-type endopeptidase, putative [Ricinus communis]
Length = 471
Score = 115 bits (289), Expect = 3e-23, Method: Compositional matrix adjust.
Identities = 105/389 (26%), Positives = 171/389 (43%), Gaps = 49/389 (12%)
Query: 11 NDNETPKSPISIIYQAEIISVDDIYLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELD 70
N + P S ISII D +Y+M +IG+PPV+ + DTGS+ W QC +
Sbjct: 92 NSRKYPVSRISII--------DKVYVMKFNIGSPPVETYAIPDTGSNIVWIQCGSPICTN 143
Query: 71 CFKQEPPLFDPKKSSTYNSISCSSSQCAVVTSNCSE--------GDCSYSFLYGRGAYAS 122
C+KQ+ PLF+P KSSTY C +C E C Y Y S
Sbjct: 144 CYKQKIPLFNPTKSSTYAIRLCGHRECKQALWGLGEYLGCKSSVQVCRYHISYED---HS 200
Query: 123 FSSGNLATETLTF-NSTSGLPVEMPNVIFGCGHKNLASPTSDSKQ---TGIIGLGPGNSS 178
FS G ++T+ +TF + + FGCG+ N +P D G++GLG +S
Sbjct: 201 FSEGTISTDIITFPEHIAEFGNYSLRMFFGCGYNNSETPGQDPNSFTAPGVVGLGNEMAS 260
Query: 179 LISQMGTSIAGKFSYCL--PD----QGSSKINFGGIVAGAGVVSTPLIIRDHYYL--SLE 230
L+ Q+ G+FSYC+ PD G+ +I FG + +G + + +Y+ +++
Sbjct: 261 LVGQL---TLGQFSYCISTPDVQKPNGTIEIRFGLAASISGHSTALANNLEGWYIFQNVD 317
Query: 231 AISVGNQRL--------EFVSSSTGNIFVDTGVLRTLLPLEYHSNLKSVMSNMIKAQPVK 282
I V + ++ +F G + +D+G T L L + I+ P
Sbjct: 318 GIYVDDTKVKGYPEWVFQFAEGGIGGLIMDSGTTYTELYFSALDALIGELKEQIELAPDT 377
Query: 283 GVGAEPGFSDVLCYNISS--QPKFPEVTIHF---RGADVKLSPSNLFRNISDEIMCSAFR 337
+ +S LCYN ++ P + + F + A + N + + ++ C A
Sbjct: 378 QDHSNSNYS--LCYNAANFLLTYVPAIELKFTDNKEAYFPFTLRNAWIDNGNDQYCLAMF 435
Query: 338 GGNANIVYGRIMQINFLIGYDIEQAMVSF 366
G + + G + IGYD++ +VSF
Sbjct: 436 GTSGISIIGIYQHRDIKIGYDLKYNLVSF 464
>gi|302821814|ref|XP_002992568.1| hypothetical protein SELMODRAFT_46291 [Selaginella moellendorffii]
gi|300139637|gb|EFJ06374.1| hypothetical protein SELMODRAFT_46291 [Selaginella moellendorffii]
Length = 368
Score = 115 bits (288), Expect = 3e-23, Method: Compositional matrix adjust.
Identities = 115/385 (29%), Positives = 170/385 (44%), Gaps = 67/385 (17%)
Query: 37 MHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSSSQ 96
M L IG+ ++ +DTGS+ QC + P+FDP S +Y + C S
Sbjct: 1 MQLGIGSLQKNLSAIIDTGSEAVLVQCG--------SRSRPVFDPAASQSYRQVPCISQL 52
Query: 97 CAVVTSNCSEGD----------CSYSFLYGRGAYASFSSGNLATETLTFNST--SGLPVE 144
C V S G C+YS YG + S+G+ + + + NST S V+
Sbjct: 53 CLAVQQQTSNGSSQPCVNSSAACTYSLSYGD---SRNSTGDFSQDVIFLNSTNSSSQAVQ 109
Query: 145 MPNVIFGCGHKNLASPTS---DSKQTGIIGLGPGNSSLISQMGTSIAG-KFSYCLPDQGS 200
+V FGC H SP D GI+G GN SL SQ+ + G KFSYC P Q
Sbjct: 110 FRDVAFGCAH----SPQGFLVDLGSLGIVGFNRGNLSLPSQLKDRLGGSKFSYCFPSQPW 165
Query: 201 SKINFGGIVAG------AGVVSTPLI------IRDH-YYLSLEAISVGNQRLEFVSSS-- 245
G I G + V TPL+ R YY+ L +ISV + L S+
Sbjct: 166 QPRATGVIFLGDSGLSKSKVSYTPLLDNPVTPARSQLYYVGLTSISVDGKTLAIPESAFK 225
Query: 246 ------TGNIFVDTGVLRTLLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDVLCYNI- 298
G +D+G T + + ++ ++ + ++ K VGA GF D CYNI
Sbjct: 226 LDPSTGDGGTVLDSGTTFTRVVDDAYTAFRNAFAASNRSGLRKKVGAAAGFDD--CYNIS 283
Query: 299 --SSQPKFPEVTIHFR-GADVKLSPSNLFRNIS---DEI-----MCSAFRGGNANI-VYG 346
SS P PEV + + ++L +LF +S +E+ + S+ + G I V G
Sbjct: 284 AGSSLPGVPEVRLSLQNNVRLELRFEHLFVPVSAAGNEVTVCLAILSSQKSGFGKINVLG 343
Query: 347 RIMQINFLIGYDIEQAMVSFKPSRC 371
Q N+L+ YD E++ V F+ + C
Sbjct: 344 NYQQSNYLVEYDNERSRVGFERADC 368
>gi|125532792|gb|EAY79357.1| hypothetical protein OsI_34486 [Oryza sativa Indica Group]
Length = 396
Score = 115 bits (288), Expect = 3e-23, Method: Compositional matrix adjust.
Identities = 104/362 (28%), Positives = 168/362 (46%), Gaps = 43/362 (11%)
Query: 35 YLMHLSIGTPPVDIFGSVDTGSDCTWTQC-EPCPELDCFKQEPPLFDPKKSSTYNSISCS 93
Y+++L+IGTPP + +D G + WTQC + C CFKQ+ PLFD SST+ C
Sbjct: 51 YVVNLTIGTPPQPVSAIIDIGGELVWTQCAQHCRR--CFKQDLPLFDTNASSTFRPEPCG 108
Query: 94 SSQCAVVTSNCSEGDCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMPNVIFGCG 153
++ C + + GD + Y + G + T+ + + + + FGC
Sbjct: 109 AAVCESIPTRSCAGDGGGACGYEASTSFGRTVGRIGTDAVAIGTAA-----TARLAFGCA 163
Query: 154 HKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCL--PDQGSSKINFGGIV-- 209
+ S +G +GLG N SL +QM A FSYCL PD G S F G
Sbjct: 164 VASEMDTMWGS--SGSVGLGRTNLSLAAQMN---ATAFSYCLAPPDTGKSSALFLGASAK 218
Query: 210 ---AGAGVVSTPLI---------IRDHYYLSLEAISVGNQRLEFVSSSTGN-IFVDTGVL 256
AG G +TP + + Y L LEAI GN + S GN I V T
Sbjct: 219 LAGAGKGAGTTPFVKTSTPPHSGLSRSYLLRLEAIRAGNATIAMPQS--GNTIMVSTATP 276
Query: 257 RTLLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDVLCY-NISSQPKFPEVTIHFR-GA 314
T L + +L+ +++ + A PV P + LC+ S+ P++ + F+ GA
Sbjct: 277 VTALVDSVYRDLRKAVADAVGAAPVP----PPVQNYDLCFPKASASGGAPDLVLAFQGGA 332
Query: 315 DVKLSPSNLFRNISDEIMCSAFRG----GNANIVYGRIMQINFLIGYDIEQAMVSFKPSR 370
++ + S+ + ++ C A G G +I+ G + Q+N + +D+++ +SF+P+
Sbjct: 333 EMTVPVSSYLFDAGNDTACVAILGSPALGGVSIL-GSLQQVNIHLLFDLDKETLSFEPAD 391
Query: 371 CT 372
C+
Sbjct: 392 CS 393
>gi|219886223|gb|ACL53486.1| unknown [Zea mays]
gi|238015146|gb|ACR38608.1| unknown [Zea mays]
gi|413938611|gb|AFW73162.1| hypothetical protein ZEAMMB73_440759 [Zea mays]
gi|413938612|gb|AFW73163.1| hypothetical protein ZEAMMB73_440759 [Zea mays]
gi|413938613|gb|AFW73164.1| hypothetical protein ZEAMMB73_440759 [Zea mays]
gi|413938614|gb|AFW73165.1| hypothetical protein ZEAMMB73_440759 [Zea mays]
Length = 467
Score = 115 bits (288), Expect = 3e-23, Method: Compositional matrix adjust.
Identities = 110/356 (30%), Positives = 171/356 (48%), Gaps = 35/356 (9%)
Query: 35 YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSS 94
Y+ + +GTP VDTGS TW QC PC + C +Q P+F+PK SS+Y S+SCS+
Sbjct: 129 YVTRMGLGTPAKSYVMVVDTGSSLTWLQCSPC-VVSCHRQSGPVFNPKASSSYTSVSCSA 187
Query: 95 SQCAVVT------SNCSEGD-CSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMPN 147
QC+ +T ++CS + C Y YG +SFS G L+ +T++F STS +PN
Sbjct: 188 QQCSDLTTATLNPASCSTSNVCIYQASYGD---SSFSVGYLSKDTVSFGSTS-----VPN 239
Query: 148 VIFGCGHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLPDQGSSKINFGG 207
+GCG N + G+IGL SL+ Q+ S+ FSYCLP SS +
Sbjct: 240 FYYGCGQDNEG---LFGQSAGLIGLARNKLSLLYQLAPSMGYSFSYCLPTSSSSSSGYLS 296
Query: 208 IVA-GAGVVS-TPL----IIRDHYYLSLEAISVGNQRLEFVSSSTGNI--FVDTGVLRTL 259
I + G S TP+ + Y++ + I V + L SS+ ++ +D+G + T
Sbjct: 297 IGSYNPGQYSYTPMASSSLDDSLYFIKMTGIKVAGKPLSVSSSAYSSLPTIIDSGTVITR 356
Query: 260 LPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFS--DVLCYNISSQPKFPEVTIHFRGADVK 317
LP +S L ++ +K P FS D +++ + PEVT+ F G
Sbjct: 357 LPTGVYSALSKAVAGAMKGTPRAS-----AFSILDTCFQGQAARLRVPEVTMAFAGGAAL 411
Query: 318 LSPS-NLFRNISDEIMCSAFRGGNANIVYGRIMQINFLIGYDIEQAMVSFKPSRCT 372
+ NL ++ C AF + + G Q F + YD++ + + F C+
Sbjct: 412 KLAARNLLVDVDSATTCLAFAPARSAAIIGNTQQQTFSVVYDVKNSKIGFAAGGCS 467
>gi|212275300|ref|NP_001130675.1| uncharacterized protein LOC100191778 precursor [Zea mays]
gi|194706308|gb|ACF87238.1| unknown [Zea mays]
Length = 467
Score = 115 bits (288), Expect = 3e-23, Method: Compositional matrix adjust.
Identities = 110/356 (30%), Positives = 171/356 (48%), Gaps = 35/356 (9%)
Query: 35 YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSS 94
Y+ + +GTP VDTGS TW QC PC + C +Q P+F+PK SS+Y S+SCS+
Sbjct: 129 YVTRMGLGTPAKSYVMVVDTGSSLTWLQCSPC-VVSCHRQSGPVFNPKASSSYTSVSCSA 187
Query: 95 SQCAVVT------SNCSEGD-CSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMPN 147
QC+ +T ++CS + C Y YG +SFS G L+ +T++F STS +PN
Sbjct: 188 QQCSDLTTATLSPASCSTSNVCIYQASYGD---SSFSVGYLSKDTVSFGSTS-----VPN 239
Query: 148 VIFGCGHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLPDQGSSKINFGG 207
+GCG N + G+IGL SL+ Q+ S+ FSYCLP SS +
Sbjct: 240 FYYGCGQDNEG---LFGQSAGLIGLARNKLSLLYQLAPSMGYSFSYCLPTSSSSSSGYLS 296
Query: 208 IVA-GAGVVS-TPL----IIRDHYYLSLEAISVGNQRLEFVSSSTGNI--FVDTGVLRTL 259
I + G S TP+ + Y++ + I V + L SS+ ++ +D+G + T
Sbjct: 297 IGSYNPGQYSYTPMASSSLDDSLYFIKMTGIKVAGKPLSVSSSAYSSLPTIIDSGTVITR 356
Query: 260 LPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFS--DVLCYNISSQPKFPEVTIHFRGADVK 317
LP +S L ++ +K P FS D +++ + PEVT+ F G
Sbjct: 357 LPTGVYSALSKAVAGAMKGTPRAS-----AFSILDTCFQGQAARLRVPEVTMAFAGGAAL 411
Query: 318 LSPS-NLFRNISDEIMCSAFRGGNANIVYGRIMQINFLIGYDIEQAMVSFKPSRCT 372
+ NL ++ C AF + + G Q F + YD++ + + F C+
Sbjct: 412 KLAARNLLVDVDSATTCLAFAPARSAAIIGNTQQQTFSVVYDVKNSKIGFAAGGCS 467
>gi|222616654|gb|EEE52786.1| hypothetical protein OsJ_35257 [Oryza sativa Japonica Group]
Length = 346
Score = 115 bits (288), Expect = 4e-23, Method: Compositional matrix adjust.
Identities = 103/361 (28%), Positives = 155/361 (42%), Gaps = 41/361 (11%)
Query: 37 MHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPP---LFDPKKSSTYNSISCS 93
M +S+GTPPV ++DTGS +W QC+ C ++ C+ Q +F+P SSTY+ + CS
Sbjct: 1 MGISLGTPPVFNLVTIDTGSTLSWVQCKNC-QIKCYDQAAKAGQIFNPYNSSTYSKVGCS 59
Query: 94 SSQC------AVVTSNCSEGD--CSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEM 145
+ C V C E D C YS YG G Y S G L + LT S +
Sbjct: 60 TEACNGMHMDLAVEYGCVEEDDTCIYSLRYGSGEY---SVGYLGKDRLTLASNRSI---- 112
Query: 146 PNVIFGCGHKNLASPTSDSKQTGIIGLGPGNSSLISQM-GTSIAGKFSYCLP--DQGSSK 202
N IFGCG NL + + GIIG G + S +Q+ + FSYC P +
Sbjct: 113 DNFIFGCGEDNLYNGVN----AGIIGFGTKSYSFFNQVCQQTDYTAFSYCFPRDHENEGS 168
Query: 203 INFGGIVAGAGVVSTPLIIRDH---YYLSLEAISVGNQRLE-----FVSSSTGNIFVDTG 254
+ G ++ T LI DH Y + + V RLE ++S T VD+G
Sbjct: 169 LTIGPYARDINLMWTKLIYYDHKPAYAIQQLDMMVNGIRLEIDPYIYISKMT---IVDSG 225
Query: 255 VLRTLLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDVLCYNISSQPKFPEVTIHFRGA 314
T + L M+ ++A+ E + ++ FP V + +
Sbjct: 226 TADTYILSPVFDALDKAMTKEMQAKGYTRGWDERRICFISNSGSANWNDFPTVEMKLIRS 285
Query: 315 DVKLSPSNLFRNISDEIMCSAFRGGNANI----VYGRIMQINFLIGYDIEQAMVSFKPSR 370
+KL N F S+ ++CS F +A + + G +F + +DI+ FK
Sbjct: 286 TLKLPVENAFYESSNNVICSTFLPDDAGVRGVQMLGNRAVRSFKLVFDIQAMNFGFKARA 345
Query: 371 C 371
C
Sbjct: 346 C 346
>gi|388518257|gb|AFK47190.1| unknown [Lotus japonicus]
Length = 478
Score = 115 bits (288), Expect = 4e-23, Method: Compositional matrix adjust.
Identities = 107/374 (28%), Positives = 165/374 (44%), Gaps = 38/374 (10%)
Query: 30 SVDDIYLMHLSIGTPPVDIFGSVDTGSDCTW---TQCEPCPELDCFKQEPPLFDPKKSST 86
+V +Y + +G+P D + VDTGSD W +C CP L+DPK+S T
Sbjct: 64 TVTGLYFTKIGLGSPSKDYYVQVDTGSDILWVNCVECTRCPRKSDIGIGLTLYDPKRSKT 123
Query: 87 YNSISCSSSQCA-----VVTSNCSEGDCSYSFLYGRGAYASFSSGNLATETLTFNSTSGL 141
+SC + C+ + +E C YS YG G S ++G + LTFN +G
Sbjct: 124 SEFVSCEHNFCSSTYEGRILGCKAENPCPYSISYGDG---SATTGYYVQDYLTFNRVNGN 180
Query: 142 P---VEMPNVIFGCGHKNLASPTSDSKQT--GIIGLGPGNSSLISQMGTS--IAGKFSYC 194
P + ++IFGCG + S S++ GIIG G NSS++SQ+ S + FS+C
Sbjct: 181 PHTATQNSSIIFGCGAAQSGTFASSSEEALDGIIGFGQANSSVLSQLAASGKVKKIFSHC 240
Query: 195 LPDQGSSKINFGGIVAGAGVVSTPLIIR-DHYYLSLEAISVGNQRLE-----FVSSSTGN 248
L I G V V +TPL+ HY + L+ I V L+ F S +
Sbjct: 241 LDTNVGGGIFSIGEVVEPKVKTTPLVPNMAHYNVILKNIEVDGDILQLPSDTFDSENGKG 300
Query: 249 IFVDTGVLRTLLPLEYHSNLKSVMSNMIKAQP-VKGVGAEPGFSDVLCYNISSQPKFPEV 307
+D+G LP + L MS ++ QP +K E +S Y + FP V
Sbjct: 301 TVIDSGTTLAYLPRIVYDQL---MSKVLAKQPRLKVYLVEEQYS-CFQYTGNVDSGFPIV 356
Query: 308 TIHFRGA-DVKLSPSN-LFRNISDEIMCSAFRGGNANIVYGRIMQI-------NFLIGYD 358
+HF + + + P + LF D C ++ + G+ M + N L+ YD
Sbjct: 357 KLHFEDSLSLTVYPHDYLFNYKGDSYWCIGWQKSASETKNGKDMTLLGDFVLSNKLVVYD 416
Query: 359 IEQAMVSFKPSRCT 372
+E + + C+
Sbjct: 417 LENMTIGWTDYNCS 430
>gi|293329689|dbj|BAJ04354.1| pollen allergen CPA63 [Cryptomeria japonica]
Length = 472
Score = 115 bits (288), Expect = 4e-23, Method: Compositional matrix adjust.
Identities = 109/371 (29%), Positives = 172/371 (46%), Gaps = 58/371 (15%)
Query: 35 YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSS 94
Y++ L GTPP + +DTGS+ W C PC KQ+P F+P KSSTYN ++C+S
Sbjct: 124 YIIKLGFGTPPQSFYTVLDTGSNIAWIPCNPCSGCSS-KQQP--FEPSKSSTYNYLTCAS 180
Query: 95 SQCAVVTSNCSEGD----CSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMPNVIF 150
QC ++ C++ D CS + YG S L++ETL+ S ++ N +F
Sbjct: 181 QQCQLLRV-CTKSDNSVNCSLTQRYGD---QSEVDEILSSETLSVGSQ-----QVENFVF 231
Query: 151 GCGHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLPDQGSSKINFGGIVA 210
GC + A+ + ++G G S +SQ T FSYCLP SS G ++
Sbjct: 232 GCSN---AARGLIQRTPSLVGFGRNPLSFVSQTATLYDSTFSYCLPSLFSSAFT-GSLLL 287
Query: 211 GAGVVS------TPLIIRDH----YYLSLEAISVGNQRLEF------VSSSTGN-IFVDT 253
G +S TPL+ YY+ L ISVG + + + STG +D+
Sbjct: 288 GKEALSAQGLKFTPLLSNSRYPSFYYVGLNGISVGEELVSIPAGTLSLDESTGRGTIIDS 347
Query: 254 G-VLRTLLPLEYHS---NLKSVMSNMIKAQPVKGVGAEPGFSDVLCYNI-SSQPKFPEVT 308
G V+ L+ Y++ + +S +SN+ A P CYN S +FP +T
Sbjct: 348 GTVITRLVEPAYNAMRDSFRSQLSNLTMASPTDLFDT--------CYNRPSGDVEFPLIT 399
Query: 309 IHF-RGADVKLSPSNLFRNISDE--IMCSAFR---GGNANIV--YGRIMQINFLIGYDIE 360
+HF D+ L N+ +D+ ++C AF GG +++ +G Q I +D+
Sbjct: 400 LHFDDNLDLTLPLDNILYPGNDDGSVLCLAFGLPPGGGDDVLSTFGNYQQQKLRIVHDVA 459
Query: 361 QAMVSFKPSRC 371
++ + C
Sbjct: 460 ESRLGIASENC 470
>gi|195638734|gb|ACG38835.1| aspartic proteinase nepenthesin-1 precursor [Zea mays]
Length = 465
Score = 115 bits (288), Expect = 4e-23, Method: Compositional matrix adjust.
Identities = 110/356 (30%), Positives = 172/356 (48%), Gaps = 35/356 (9%)
Query: 35 YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSS 94
Y+ + +GTP VDTGS TW QC PC + C +Q P+F+PK SS+Y S+SCS+
Sbjct: 127 YVTRMGLGTPAKSYVMVVDTGSSLTWLQCSPC-VVSCHRQSGPVFNPKASSSYASVSCSA 185
Query: 95 SQCAVVT------SNCSEGD-CSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMPN 147
QC+ +T ++CS + C Y YG +SFS G L+ +T++F STS +PN
Sbjct: 186 QQCSDLTTATLNPASCSTSNVCIYQASYGD---SSFSVGYLSKDTVSFGSTS-----VPN 237
Query: 148 VIFGCGHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLPDQGSSKINFGG 207
+GCG N + G+IGL SL+ Q+ S+ FSYCLP SS +
Sbjct: 238 FYYGCGQDNEG---LFGQSAGLIGLARNKLSLLYQLAPSMGYSFSYCLPTSSSSSSGYLS 294
Query: 208 IVA-GAGVVS-TPL----IIRDHYYLSLEAISVGNQRLEFVSSSTGNI--FVDTGVLRTL 259
I + G S TP+ + Y++ + I V + L SS+ ++ +D+G + T
Sbjct: 295 IGSYNPGQYSYTPMASSSLDDSLYFIKMTGIKVAGKPLSVSSSAYSSLPTIIDSGTVITR 354
Query: 260 LPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFS--DVLCYNISSQPKFPEVTIHFRGADVK 317
LP +S L ++ +K P FS D +++ + PEVT+ F G
Sbjct: 355 LPTGVYSALSKAVAGAMKGTPRAS-----AFSILDTCFQGQAARLRVPEVTMAFAGGAAL 409
Query: 318 LSPS-NLFRNISDEIMCSAFRGGNANIVYGRIMQINFLIGYDIEQAMVSFKPSRCT 372
+ NL ++ C AF + + G Q F + YD++ + + F + C+
Sbjct: 410 KLAARNLLVDVDSATTCLAFAPARSAAIIGNTQQQTFSVVYDVKNSKIGFAAAGCS 465
>gi|356568507|ref|XP_003552452.1| PREDICTED: aspartic proteinase-like protein 2-like [Glycine max]
Length = 481
Score = 115 bits (287), Expect = 5e-23, Method: Compositional matrix adjust.
Identities = 109/378 (28%), Positives = 169/378 (44%), Gaps = 55/378 (14%)
Query: 36 LMHLSIGTPPVDIFGSVDTGSDCTWTQ---CEPCPELDCFKQEPPLFDPKKSSTYNSISC 92
L + IG P D + VDTGSD W C CP+ + L+DP S T ++ C
Sbjct: 75 LYYTKIGLGPKDYYVQVDTGSDTLWVNCVGCTACPKKSGLGMDLTLYDPNLSKTSKAVPC 134
Query: 93 SSSQCAVV----TSNCSEG-DCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMPN 147
C S C++G C YS YG G S +SG+ + LTF+ G +P+
Sbjct: 135 DDEFCTSTYDGQISGCTKGMSCPYSITYGDG---STTSGSYIKDDLTFDRVVGDLRTVPD 191
Query: 148 ---VIFGCGHKN--LASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGK----FSYCLPDQ 198
VIFGCG K S T+D+ GIIG G NSS++SQ+ + AGK FS+CL
Sbjct: 192 NTSVIFGCGSKQSGTLSSTTDTSLDGIIGFGQANSSVLSQL--AAAGKVKRIFSHCL--- 246
Query: 199 GSSKINFGGIVAGAGVV-----STPLII-RDHYYLSLEAISVGNQRLEF----VSSSTGN 248
I+ GGI A VV +TPL+ HY + L+ I V ++ + SS+G
Sbjct: 247 --DSISGGGIFAIGEVVQPKVKTTPLLQGMAHYNVVLKDIEVAGDPIQLPSDILDSSSGR 304
Query: 249 -IFVDTGVLRTLLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDVLCYNISSQPK---- 303
+D+G LP+ + L + + + +K E F+ C++ S +
Sbjct: 305 GTIIDSGTTLAYLPVSIYDQLLEKI--LAQRSGMKLYLVEDQFT---CFHYSDEESVDDL 359
Query: 304 FPEVTIHF-RGADVKLSPSNLFRNISDEIMCSAF-------RGGNANIVYGRIMQINFLI 355
FP V F G + P + +++ C + + G I+ G ++ N L+
Sbjct: 360 FPTVKFTFEEGLTLTTYPRDYLFLFKEDMWCVGWQKSMAQTKDGKELILLGDLVLANKLV 419
Query: 356 GYDIEQAMVSFKPSRCTN 373
YD++ + + C++
Sbjct: 420 VYDLDNMAIGWADYNCSS 437
>gi|21717157|gb|AAM76350.1|AC074196_8 putative nucleoid DNA binding protein [Oryza sativa Japonica Group]
gi|31433294|gb|AAP54832.1| Eukaryotic aspartyl protease family protein, expressed [Oryza
sativa Japonica Group]
Length = 396
Score = 115 bits (287), Expect = 5e-23, Method: Compositional matrix adjust.
Identities = 104/362 (28%), Positives = 168/362 (46%), Gaps = 43/362 (11%)
Query: 35 YLMHLSIGTPPVDIFGSVDTGSDCTWTQC-EPCPELDCFKQEPPLFDPKKSSTYNSISCS 93
Y+++L+IGTPP + +D G + WTQC + C CFKQ+ PLFD SST+ C
Sbjct: 51 YVVNLTIGTPPQPVSAIIDIGGELVWTQCAQHCRR--CFKQDLPLFDTNASSTFRPEPCG 108
Query: 94 SSQCAVVTSNCSEGDCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMPNVIFGCG 153
++ C + + GD + Y + G + T+ + + + + FGC
Sbjct: 109 AAVCESIPTRSCAGDGGGACGYEASTSFGRTVGRIGTDAVAIGTAA-----TARLAFGCA 163
Query: 154 HKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCL--PDQGSSKINFGGIV-- 209
+ S +G +GLG N SL +QM A FSYCL PD G S F G
Sbjct: 164 VASEMDTMWGS--SGSVGLGRTNLSLAAQMN---ATAFSYCLAPPDTGKSSALFLGASAK 218
Query: 210 ---AGAGVVSTPLI---------IRDHYYLSLEAISVGNQRLEFVSSSTGN-IFVDTGVL 256
AG G +TP + + Y L LEAI GN + S GN I V T
Sbjct: 219 LAGAGKGAGTTPFVKTSTPPNSGLSRSYLLRLEAIRAGNATIAMPQS--GNTITVSTATP 276
Query: 257 RTLLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDVLCY-NISSQPKFPEVTIHFR-GA 314
T L + +L+ +++ + A PV P + LC+ S+ P++ + F+ GA
Sbjct: 277 VTALVDSVYRDLRKAVADAVGAAPVP----PPVQNYDLCFPKASASGGAPDLVLAFQGGA 332
Query: 315 DVKLSPSNLFRNISDEIMCSAFRG----GNANIVYGRIMQINFLIGYDIEQAMVSFKPSR 370
++ + S+ + ++ C A G G +I+ G + Q+N + +D+++ +SF+P+
Sbjct: 333 EMTVPVSSYLFDAGNDTACVAILGSPALGGVSIL-GSLQQVNIHLLFDLDKETLSFEPAD 391
Query: 371 CT 372
C+
Sbjct: 392 CS 393
>gi|21717169|gb|AAM76362.1|AC074196_20 putative nucleoid DNA binding protein, 3'-partial [Oryza sativa
Japonica Group]
Length = 377
Score = 114 bits (286), Expect = 5e-23, Method: Compositional matrix adjust.
Identities = 93/339 (27%), Positives = 144/339 (42%), Gaps = 37/339 (10%)
Query: 13 NETPKSPISIIYQAEIISVDDIYLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCF 72
+ TP + + +S +Y+ + +IGTPP + VD + WTQC PC CF
Sbjct: 35 DATPPAAGGAVAVPIYLSSQGLYVANFTIGTPPQPVSAVVDLTGELVWTQCTPCQP--CF 92
Query: 73 KQEPPLFDPKKSSTYNSISCSSSQCAVV---TSNCSEGDCSYSFLYGRGAYASFSSGNLA 129
+Q+ PLFDP KSST+ + C S C + + NC+ C +Y A + G
Sbjct: 93 EQDLPLFDPTKSSTFRGLPCGSHLCESIPESSRNCTSDVC----IYEAPTKAGDTGGKAG 148
Query: 130 TETLTFNSTSGLPVEMPNVIFGCGHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAG 189
T+T + + FGC + +GI+GLG SL++QM +
Sbjct: 149 TDTFAIGAAK------ETLGFGCVVMTDKRLKTIGGPSGIVGLGRTPWSLVTQMNVT--- 199
Query: 190 KFSYCLPDQGSSKINFGGI---VAGAGVVSTPLIIRD-----------HYYLSLEAISVG 235
FSYCL + S + G +AG STP +I+ +Y + L I G
Sbjct: 200 AFSYCLAGKSSGALFLGATAKQLAGGKNSSTPFVIKTSAGSSDNGSNPYYMVKLAGIKTG 259
Query: 236 NQRLEFVSSSTGNIFVDTGVLRTLLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDVLC 295
L+ SSS + +DT + L + LK ++ + QPV A P LC
Sbjct: 260 GAPLQAASSSGSTVLLDTVSRASYLADGAYKALKKALTAAVGVQPV----ASPPKPYDLC 315
Query: 296 YNISSQPKFPEVTIHFR-GADVKLSPSNLFRNISDEIMC 333
+ + PE+ F GA + + P+N + +C
Sbjct: 316 FPKAVAGDAPELVFTFDGGAALTVPPANYLLASGNGTVC 354
>gi|297605079|ref|NP_001056639.2| Os06g0121800 [Oryza sativa Japonica Group]
gi|255676668|dbj|BAF18553.2| Os06g0121800 [Oryza sativa Japonica Group]
Length = 487
Score = 114 bits (286), Expect = 5e-23, Method: Compositional matrix adjust.
Identities = 104/343 (30%), Positives = 146/343 (42%), Gaps = 42/343 (12%)
Query: 51 SVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSSSQCAVV---TSNCSEG 107
S+DT D W QC PCP +C+ Q+ LFDP++S T ++ C S+ C + + CS
Sbjct: 165 SIDTSIDLPWIQCAPCPMPECYPQQNALFDPRRSRTSAAVPCGSAACGELGRYGAGCSNN 224
Query: 108 DCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMPNVIFGCGHK---NLASPTSDS 164
C Y YG G +SG + LT N ++ V M N FGC H N ++ TS
Sbjct: 225 QCQYFVDYGDG---RATSGTYMVDALTLNPST---VVM-NFRFGCSHAVRGNFSASTS-- 275
Query: 165 KQTGIIGLGPGNSSLISQMGTSIAGKFSYCLPDQGSSKI---------NFGGIVAGAGVV 215
G + LG G SL+SQ + FSYC+PD SS G A +V
Sbjct: 276 ---GTMSLGGGRQSLLSQTAATFGNAFSYCVPDPSSSGFLSLGGPADGGGAGRFARTPLV 332
Query: 216 STPLIIRDHYYLSLEAISVGNQRLEFVSSS-TGNIFVDTGVLRTLLPLEYHSNLKSVMSN 274
P II Y + L I VG +RL G +D+ V+ T LP + L+ +
Sbjct: 333 RNPSIIPTLYLVRLRGIEVGGRRLNVPPVVFAGGAVMDSSVIITQLPPTAYRALRLAFRS 392
Query: 275 MIKAQPVKGVGAEPGFSDVLCYNIS--SQPKFPEVTIHFR-GADVKLSPSNLFRNISDEI 331
+ A P + G G CY+ + P V++ F GA V+L +
Sbjct: 393 AMAAYP-RVAGGRAGLDT--CYDFVRFTSVTVPAVSLVFDGGAVVRLDAMGVMVE----- 444
Query: 332 MCSAFR---GGNANIVYGRIMQINFLIGYDIEQAMVSFKPSRC 371
C AF G A G + Q + YD+ V F+ C
Sbjct: 445 GCLAFVPTPGDFALGFIGNVQQQTHEVLYDVGGGSVGFRRGAC 487
>gi|223975883|gb|ACN32129.1| unknown [Zea mays]
gi|223975971|gb|ACN32173.1| unknown [Zea mays]
gi|224034191|gb|ACN36171.1| unknown [Zea mays]
gi|413938623|gb|AFW73174.1| aspartic proteinase nepenthesin-1 isoform 1 [Zea mays]
gi|413938624|gb|AFW73175.1| aspartic proteinase nepenthesin-1 isoform 2 [Zea mays]
Length = 465
Score = 114 bits (286), Expect = 6e-23, Method: Compositional matrix adjust.
Identities = 110/356 (30%), Positives = 171/356 (48%), Gaps = 35/356 (9%)
Query: 35 YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSS 94
Y+ + +GTP VDTGS TW QC PC + C +Q P+F+PK SS+Y S+SCS+
Sbjct: 127 YVTRMGLGTPAKSYVMVVDTGSSLTWLQCSPC-VVSCHRQSGPVFNPKASSSYASVSCSA 185
Query: 95 SQCAVVT------SNCSEGD-CSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMPN 147
QC+ +T ++CS + C Y YG +SFS G L+ +T++F STS +PN
Sbjct: 186 QQCSDLTTATLNPASCSTSNVCIYQASYGD---SSFSVGYLSKDTVSFGSTS-----VPN 237
Query: 148 VIFGCGHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLPDQGSSKINFGG 207
+GCG N + G+IGL SL+ Q+ S+ FSYCLP SS +
Sbjct: 238 FYYGCGQDNEG---LFGQSAGLIGLARNKLSLLYQLAPSMGYSFSYCLPTSSSSSSGYLS 294
Query: 208 IVA-GAGVVS-TPL----IIRDHYYLSLEAISVGNQRLEFVSSSTGNI--FVDTGVLRTL 259
I + G S TP+ + Y++ + I V + L SS+ ++ +D+G + T
Sbjct: 295 IGSYNPGQYSYTPMASSSLDDSLYFIKMTGIKVAGKPLSVSSSAYSSLPTIIDSGTVITR 354
Query: 260 LPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFS--DVLCYNISSQPKFPEVTIHFRGADVK 317
LP +S L ++ +K P FS D +++ + PEVT+ F G
Sbjct: 355 LPTGVYSALSKAVAGAMKGTPRAS-----AFSILDTCFQGQAARLRVPEVTMAFAGGAAL 409
Query: 318 LSPS-NLFRNISDEIMCSAFRGGNANIVYGRIMQINFLIGYDIEQAMVSFKPSRCT 372
+ NL ++ C AF + + G Q F + YD++ + + F C+
Sbjct: 410 KLAARNLLVDVDSATTCLAFAPARSAAIIGNTQQQTFSVVYDVKNSKIGFAAGGCS 465
>gi|356531884|ref|XP_003534506.1| PREDICTED: aspartic proteinase-like protein 2-like [Glycine max]
Length = 482
Score = 114 bits (286), Expect = 6e-23, Method: Compositional matrix adjust.
Identities = 109/378 (28%), Positives = 165/378 (43%), Gaps = 55/378 (14%)
Query: 36 LMHLSIGTPPVDIFGSVDTGSDCTWTQ---CEPCPELDCFKQEPPLFDPKKSSTYNSISC 92
L + IG P D + VDTGSD W C CP+ E L+DP S T + C
Sbjct: 76 LYYTKIGLGPNDYYVQVDTGSDTLWVNCVGCTTCPKKSGLGMELTLYDPNSSKTSKVVPC 135
Query: 93 SSSQCAVV----TSNCSEG-DCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMPN 147
C S C + C YS YG G S +SG+ + LTF+ G +P+
Sbjct: 136 DDEFCTSTYDGPISGCKKDMSCPYSITYGDG---STTSGSYIKDDLTFDRVVGDLRTVPD 192
Query: 148 ---VIFGCGHKN--LASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGK----FSYCLPDQ 198
VIFGCG K S T+D+ GIIG G NSS++SQ+ + AGK FS+CL
Sbjct: 193 NTSVIFGCGSKQSGTLSSTTDTSLDGIIGFGQANSSVLSQL--AAAGKVKRVFSHCL--- 247
Query: 199 GSSKINFGGIVAGAGVV-----STPLIIR-DHYYLSLEAISVGNQRLE-----FVSSSTG 247
+N GGI A VV +TPL+ R HY + L+ I V ++ F S+S
Sbjct: 248 --DTVNGGGIFAIGEVVQPKVKTTPLVPRMAHYNVVLKDIEVAGDPIQLPTDIFDSTSGR 305
Query: 248 NIFVDTGVLRTLLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDVLCYNISSQP----K 303
+D+G LP+ + L + + + ++ E F+ C++ S +
Sbjct: 306 GTIIDSGTTLAYLPVSIYDQL--LEKTLAQRSGMELYLVEDQFT---CFHYSDEKSLDDA 360
Query: 304 FPEVTIHF-RGADVKLSPSNLFRNISDEIMCSAFRGGNAN-------IVYGRIMQINFLI 355
FP V F G + P + +++ C ++ A I+ G ++ N L
Sbjct: 361 FPTVKFTFEEGLTLTAYPHDYLFPFKEDMWCIGWQKSTAQTKDGKDLILLGDLVLTNKLF 420
Query: 356 GYDIEQAMVSFKPSRCTN 373
YD++ + + C++
Sbjct: 421 IYDLDNMSIGWTDYNCSS 438
>gi|125586059|gb|EAZ26723.1| hypothetical protein OsJ_10631 [Oryza sativa Japonica Group]
Length = 339
Score = 114 bits (286), Expect = 6e-23, Method: Compositional matrix adjust.
Identities = 105/359 (29%), Positives = 157/359 (43%), Gaps = 51/359 (14%)
Query: 41 IGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKK-SSTYNSISCSSSQCAV 99
+GTPP + ++ G++ W P PE CF+Q P F+P S SC S +
Sbjct: 1 MGTPPNPVKLKLENGNELIWNHSNPSPE--CFEQAFPYFEPLTFSRGLPFASCGSPKF-- 56
Query: 100 VTSNCSEGDCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMPNVIFGCGHKNLAS 159
C Y++ YG S ++G L + TF G +P V FGCG N +
Sbjct: 57 ----WPNQTCVYTYSYGD---KSVTTGFLEVDKFTF---VGAGASVPGVAFGCGLFN--N 104
Query: 160 PTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYC-------LPDQGSSKINFGGIVAGA 212
S +TGI G G G SL SQ+ G FS+C +P + G
Sbjct: 105 GVFKSNETGIAGFGRGPLSLPSQLK---VGNFSHCFTTITGAIPSTVLLDLPADLFSNGQ 161
Query: 213 GVV-STPLIIRDH-------YYLSLEAISVGNQRLEF------VSSSTGNIFVDTGVLRT 258
G V +TPLI YYLSL+ I+VG+ RL +++ TG +D+G T
Sbjct: 162 GAVQTTPLIQYAKNEANPTLYYLSLKGITVGSTRLPVPESAFALTNGTGGTIIDSGTSIT 221
Query: 259 LLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDVLCYNISSQ--PKFPEVTIHFRGADV 316
LP + + ++ + IK V G C++ SQ P P++ +HF GA +
Sbjct: 222 SLPPQVYQVVRDEFAAQIKLPVVPGNAT----GHYTCFSAPSQAKPDVPKLVLHFEGATM 277
Query: 317 KLSPSNLFRNISDE----IMCSAFRGGNANIVYGRIMQINFLIGYDIEQAMVSFKPSRC 371
L N + D+ I+C A G+ + G Q N + YD++ M+SF ++C
Sbjct: 278 DLPRENYVFEVPDDAGNSIICLAINKGDETTIIGNFQQQNMHVLYDLQNNMLSFVAAQC 336
>gi|218197468|gb|EEC79895.1| hypothetical protein OsI_21423 [Oryza sativa Indica Group]
Length = 471
Score = 114 bits (286), Expect = 6e-23, Method: Compositional matrix adjust.
Identities = 104/343 (30%), Positives = 146/343 (42%), Gaps = 42/343 (12%)
Query: 51 SVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSSSQCAVV---TSNCSEG 107
S+DT D W QC PCP +C+ Q+ LFDP++S T ++ C S+ C + + CS
Sbjct: 149 SIDTSIDLPWIQCAPCPMPECYPQQNALFDPRRSRTSAAVPCGSAACGELGRYGAGCSNN 208
Query: 108 DCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMPNVIFGCGHK---NLASPTSDS 164
C Y YG G +SG + LT N ++ V M N FGC H N ++ TS
Sbjct: 209 QCQYFVDYGDG---RATSGTYMVDALTLNPST---VVM-NFRFGCSHAVRGNFSASTS-- 259
Query: 165 KQTGIIGLGPGNSSLISQMGTSIAGKFSYCLPDQGSSKI---------NFGGIVAGAGVV 215
G + LG G SL+SQ + FSYC+PD SS G A +V
Sbjct: 260 ---GTMSLGGGRQSLLSQTAATFGNAFSYCVPDPSSSGFLSLGGPADGGGAGRFARTPLV 316
Query: 216 STPLIIRDHYYLSLEAISVGNQRLEFVSSS-TGNIFVDTGVLRTLLPLEYHSNLKSVMSN 274
P II Y + L I VG +RL G +D+ V+ T LP + L+ +
Sbjct: 317 RNPSIIPTLYLVRLRGIEVGGRRLNVPPVVFAGGAVMDSSVIITQLPPTAYRALRLAFRS 376
Query: 275 MIKAQPVKGVGAEPGFSDVLCYNIS--SQPKFPEVTIHFR-GADVKLSPSNLFRNISDEI 331
+ A P + G G CY+ + P V++ F GA V+L +
Sbjct: 377 AMAAYP-RVAGGRAGLDT--CYDFVRFTSVTVPAVSLVFDGGAVVRLDAMGVMVE----- 428
Query: 332 MCSAFR---GGNANIVYGRIMQINFLIGYDIEQAMVSFKPSRC 371
C AF G A G + Q + YD+ V F+ C
Sbjct: 429 GCLAFVPTPGDFALGFIGNVQQQTHEVLYDVGGGSVGFRRGAC 471
>gi|449509162|ref|XP_004163513.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 2-like [Cucumis
sativus]
Length = 417
Score = 114 bits (286), Expect = 6e-23, Method: Compositional matrix adjust.
Identities = 104/355 (29%), Positives = 159/355 (44%), Gaps = 53/355 (14%)
Query: 52 VDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSSSQCAVVT---------S 102
VDTGSD TW QC PC C+ Q+ PLF+P SS++ S+ C+S C + S
Sbjct: 81 VDTGSDLTWVQCLPCRL--CYNQQEPLFNPSNSSSFLSLPCNSPTCVALQPTAGSSGLCS 138
Query: 103 NCSEGDCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMPNVIFGCGHKNLASPTS 162
N + C Y YG G+Y S G L E LT T E+ N IFGCG N
Sbjct: 139 NKNSTSCDYQIDYGDGSY---SRGELGFEKLTLGKT-----EIDNFIFGCGRNNKGLFGG 190
Query: 163 DSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLPDQG---SSKINFGGI----------V 209
S G++GL SL+SQ + FSYCLP G S + GG +
Sbjct: 191 AS---GLMGLARSELSLVSQTSSLFGSVFSYCLPTTGVGSSGSLTLGGADFSNFKNISPI 247
Query: 210 AGAGVVSTPLIIRDHYYLSLEAISVGNQRLEF--VSSSTGNI-FVDTGVLRTLLPLEYHS 266
+ ++ P + + Y+L+L IS+G L +SS+ G + +D+G + T L +
Sbjct: 248 SYTRMIQNPQ-MSNFYFLNLTGISIGGVNLNVPRLSSNEGVLSLLDSGTVITRLSPSIYK 306
Query: 267 NLKSVMSNMIKAQPVKGVGAEPGFSDV-LCYNIS--SQPKFPEVTIHFRG-ADVKLSPSN 322
K+ + G PGFS + C+N++ + P V F G A++ +
Sbjct: 307 AFKAEFE-----KQFSGYRTTPGFSILNTCFNLTGYEEVNIPTVKFIFEGNAEMIVDVEG 361
Query: 323 LFRNISDEI--MCSAFRG---GNANIVYGRIMQINFLIGYDIEQAMVSFKPSRCT 372
+F + + +C AF + ++ G Q N + Y+ +++ V F C+
Sbjct: 362 VFYFVKSDASQICLAFASLGYEDQTMIIGNYQQKNQRVIYNSKESKVGFAGEPCS 416
>gi|7715602|gb|AAF68120.1|AC010793_15 F20B17.14 [Arabidopsis thaliana]
gi|12324588|gb|AAG52249.1|AC011717_17 putative aspartyl protease; 105611-106921 [Arabidopsis thaliana]
Length = 436
Score = 114 bits (286), Expect = 6e-23, Method: Compositional matrix adjust.
Identities = 111/370 (30%), Positives = 173/370 (46%), Gaps = 55/370 (14%)
Query: 35 YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSS 94
Y++ + +G + + VDTGSD TW QC+PC C+ Q+ PL+DP SS+Y ++ C+S
Sbjct: 87 YIVTVELGGKNMSLI--VDTGSDLTWVQCQPCRS--CYNQQGPLYDPSVSSSYKTVFCNS 142
Query: 95 SQC---AVVTSN---CSEGD------CSYSFLYGRGAYASFSSGNLATETLTFNSTSGLP 142
S C TSN C + C Y YG G+Y + G+LA+E++ T
Sbjct: 143 STCQDLVAATSNSGPCGGNNGVVKTPCEYVVSYGDGSY---TRGDLASESILLGDT---- 195
Query: 143 VEMPNVIFGCGHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLP---DQG 199
++ N +FGCG N S G+ + SL+SQ + G FSYCLP D
Sbjct: 196 -KLENFVFGCGRNNKGLFGGSSGLMGLG---RSSVSLVSQTLKTFNGVFSYCLPSLEDGA 251
Query: 200 SSKINFGG----IVAGAGVVSTPLI----IRDHYYLSLEAISVGNQRLEFVSSSTGN-IF 250
S ++FG V TPL+ +R Y L+L S+G +E SSS G I
Sbjct: 252 SGSLSFGNDSSVYTNSTSVSYTPLVQNPQLRSFYILNLTGASIGG--VELKSSSFGRGIL 309
Query: 251 VDTGVLRTLLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDV-LCYNISSQPK--FPEV 307
+D+G + T LP + K+V +K G PG+S + C+N++S P +
Sbjct: 310 IDSGTVITRLPPSIY---KAVKIEFLKQ--FSGFPTAPGYSILDTCFNLTSYEDISIPII 364
Query: 308 TIHFRG-ADVKLSPSNLFRNISDE--IMCSAFRG---GNANIVYGRIMQINFLIGYDIEQ 361
+ F+G A++++ + +F + + ++C A N + G Q N + YD Q
Sbjct: 365 KMIFQGNAELEVDVTGVFYFVKPDASLVCLALASLSYENEVGIIGNYQQKNQRVIYDTTQ 424
Query: 362 AMVSFKPSRC 371
+ C
Sbjct: 425 ERLGIVGENC 434
>gi|4646203|gb|AAD26876.1|AC007230_10 Belongs to PF|00026 Eukaryotic aspartyl protease family
[Arabidopsis thaliana]
Length = 449
Score = 114 bits (286), Expect = 6e-23, Method: Compositional matrix adjust.
Identities = 98/359 (27%), Positives = 157/359 (43%), Gaps = 36/359 (10%)
Query: 34 IYLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDC---FKQEPPLFDPKKSSTYNSI 90
+Y + +G+PP + VDTGSD W C+PCP+ LFD SST +
Sbjct: 73 LYFTKIKLGSPPKEYHVQVDTGSDILWINCKPCPKCPTKTNLNFRLSLFDMNASSTSKKV 132
Query: 91 SCSSSQCAVVTSNCS---EGDCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMP- 146
C C+ ++ + S CSY +Y A S S G + LT +G P
Sbjct: 133 GCDDDFCSFISQSDSCQPALGCSYHIVY---ADESTSDGKFIRDMLTLEQVTGDLKTGPL 189
Query: 147 --NVIFGCGHKNLAS-PTSDSKQTGIIGLGPGNSSLISQMGTSIAGK--FSYCLPDQGSS 201
V+FGCG DS G++G G N+S++SQ+ + K FS+CL +
Sbjct: 190 GQEVVFGCGSDQSGQLGNGDSAVDGVMGFGQSNTSVLSQLAATGDAKRVFSHCLDNVKGG 249
Query: 202 KINFGGIVAGAGVVSTPLIIRD-HYYLSLEAISVGNQRLEFVSS--STGNIFVDTGVLRT 258
I G+V V +TP++ HY + L + V L+ S G VD+G
Sbjct: 250 GIFAVGVVDSPKVKTTPMVPNQMHYNVMLMGMDVDGTSLDLPRSIVRNGGTIVDSGTTLA 309
Query: 259 LLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDVLCYNISS--QPKFPEVTIHFRGADV 316
P + S++ ++ QPVK E F C++ S+ FP V+ F + V
Sbjct: 310 YFPKVLYD---SLIETILARQPVKLHIVEETFQ---CFSFSTNVDEAFPPVSFEFEDS-V 362
Query: 317 KLS--PSNLFRNISDEIMCSAFRGGNAN-------IVYGRIMQINFLIGYDIEQAMVSF 366
KL+ P + + +E+ C ++ G I+ G ++ N L+ YD++ ++ +
Sbjct: 363 KLTVYPHDYLFTLEEELYCFGWQAGGLTTDERSEVILLGDLVLSNKLVVYDLDNEVIGW 421
>gi|449436215|ref|XP_004135889.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 2-like [Cucumis
sativus]
Length = 496
Score = 114 bits (286), Expect = 7e-23, Method: Compositional matrix adjust.
Identities = 104/355 (29%), Positives = 159/355 (44%), Gaps = 53/355 (14%)
Query: 52 VDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSSSQCAVVT---------S 102
VDTGSD TW QC PC C+ Q+ PLF+P SS++ S+ C+S C + S
Sbjct: 160 VDTGSDLTWVQCLPCRL--CYNQQEPLFNPSNSSSFLSLPCNSPTCVALQPTAGSSGLCS 217
Query: 103 NCSEGDCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMPNVIFGCGHKNLASPTS 162
N + C Y YG G+Y S G L E LT T E+ N IFGCG N
Sbjct: 218 NKNSTSCDYQIDYGDGSY---SRGELGFEKLTLGKT-----EIDNFIFGCGRNNKGLFGG 269
Query: 163 DSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLPDQG---SSKINFGGI----------V 209
S G++GL SL+SQ + FSYCLP G S + GG +
Sbjct: 270 AS---GLMGLARSELSLVSQTSSLFGSVFSYCLPTTGVGSSGSLTLGGADFSNFKNISPI 326
Query: 210 AGAGVVSTPLIIRDHYYLSLEAISVGNQRLEF--VSSSTGNI-FVDTGVLRTLLPLEYHS 266
+ ++ P + + Y+L+L IS+G L +SS+ G + +D+G + T L +
Sbjct: 327 SYTRMIQNPQ-MSNFYFLNLTGISIGGVNLNVPRLSSNEGVLSLLDSGTVITRLSPSIYK 385
Query: 267 NLKSVMSNMIKAQPVKGVGAEPGFSDV-LCYNIS--SQPKFPEVTIHFRG-ADVKLSPSN 322
K+ + G PGFS + C+N++ + P V F G A++ +
Sbjct: 386 AFKAEFE-----KQFSGYRTTPGFSILNTCFNLTGYEEVNIPTVKFIFEGNAEMIVDVEG 440
Query: 323 LFRNISDEI--MCSAFRG---GNANIVYGRIMQINFLIGYDIEQAMVSFKPSRCT 372
+F + + +C AF + ++ G Q N + Y+ +++ V F C+
Sbjct: 441 VFYFVKSDASQICLAFASLGYEDQTMIIGNYQQKNQRVIYNSKESKVGFAGEPCS 495
>gi|18412482|ref|NP_565219.1| aspartyl protease-like protein [Arabidopsis thaliana]
gi|19699359|gb|AAL91289.1| At1g79720/F19K16_30 [Arabidopsis thaliana]
gi|26450464|dbj|BAC42346.1| unknown protein [Arabidopsis thaliana]
gi|115646741|gb|ABJ17101.1| At1g79720 [Arabidopsis thaliana]
gi|332198170|gb|AEE36291.1| aspartyl protease-like protein [Arabidopsis thaliana]
Length = 484
Score = 114 bits (285), Expect = 7e-23, Method: Compositional matrix adjust.
Identities = 111/370 (30%), Positives = 173/370 (46%), Gaps = 55/370 (14%)
Query: 35 YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSS 94
Y++ + +G + + VDTGSD TW QC+PC C+ Q+ PL+DP SS+Y ++ C+S
Sbjct: 135 YIVTVELGGKNMSLI--VDTGSDLTWVQCQPCR--SCYNQQGPLYDPSVSSSYKTVFCNS 190
Query: 95 SQC---AVVTSN---CSEGD------CSYSFLYGRGAYASFSSGNLATETLTFNSTSGLP 142
S C TSN C + C Y YG G+Y + G+LA+E++ T
Sbjct: 191 STCQDLVAATSNSGPCGGNNGVVKTPCEYVVSYGDGSY---TRGDLASESILLGDT---- 243
Query: 143 VEMPNVIFGCGHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLP---DQG 199
++ N +FGCG N S G+ + SL+SQ + G FSYCLP D
Sbjct: 244 -KLENFVFGCGRNNKGLFGGSSGLMGLG---RSSVSLVSQTLKTFNGVFSYCLPSLEDGA 299
Query: 200 SSKINFGG----IVAGAGVVSTPLI----IRDHYYLSLEAISVGNQRLEFVSSSTGN-IF 250
S ++FG V TPL+ +R Y L+L S+G +E SSS G I
Sbjct: 300 SGSLSFGNDSSVYTNSTSVSYTPLVQNPQLRSFYILNLTGASIGG--VELKSSSFGRGIL 357
Query: 251 VDTGVLRTLLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDV-LCYNISSQP--KFPEV 307
+D+G + T LP + K+V +K G PG+S + C+N++S P +
Sbjct: 358 IDSGTVITRLPPSIY---KAVKIEFLKQ--FSGFPTAPGYSILDTCFNLTSYEDISIPII 412
Query: 308 TIHFRG-ADVKLSPSNLFRNISDE--IMCSAFRG---GNANIVYGRIMQINFLIGYDIEQ 361
+ F+G A++++ + +F + + ++C A N + G Q N + YD Q
Sbjct: 413 KMIFQGNAELEVDVTGVFYFVKPDASLVCLALASLSYENEVGIIGNYQQKNQRVIYDTTQ 472
Query: 362 AMVSFKPSRC 371
+ C
Sbjct: 473 ERLGIVGENC 482
>gi|302758750|ref|XP_002962798.1| hypothetical protein SELMODRAFT_78156 [Selaginella moellendorffii]
gi|300169659|gb|EFJ36261.1| hypothetical protein SELMODRAFT_78156 [Selaginella moellendorffii]
Length = 427
Score = 114 bits (285), Expect = 8e-23, Method: Compositional matrix adjust.
Identities = 110/382 (28%), Positives = 183/382 (47%), Gaps = 59/382 (15%)
Query: 35 YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEP--PLFDPKKSSTYNSISC 92
Y + L +GTP VDTGSD TW QC P P P P +D SS+Y I C
Sbjct: 59 YFVELRVGTPAKKFPLIVDTGSDLTWIQCNP-PNTTANSSSPPAPWYDKSSSSSYREIPC 117
Query: 93 SSSQC----AVVTSNC---SEGDCSYSFLYGRGAYASFSSGNLATETLTFNS--TSG--- 140
+ +C A + S+C S C Y++ Y + S ++G LA ET++ S SG
Sbjct: 118 TDDECQFLPAPIGSSCSITSPSPCDYTYGY---SDQSRTTGILAYETISMKSRKRSGKRA 174
Query: 141 -----LPVEMPNVIFGCGHKNLASPTSDSKQTGIIGLGPGNSSLISQ-MGTSIAGKFSYC 194
+ + NV GC +++ + S +G++GLG G SL +Q T++ G FSYC
Sbjct: 175 GNHKTRRIRIKNVALGCSRESVGA--SFLGASGVLGLGQGPISLATQTRHTALGGIFSYC 232
Query: 195 LPD--QGSSKINF--GGIVAGAGVVSTPLI----IRDHYYLSLEAISVGNQRLEFVSSS- 245
L D +GS+ +F G + TP++ + YY+++ ++V + ++ ++SS
Sbjct: 233 LVDYLRGSNASSFLVMGRTHWRKLAHTPIVRNPAAQSFYYVNVTGVAVDGKPVDGIASSD 292
Query: 246 --------TGNIFVDTGVLRTLLPLEYHSNLKSVMSNMI---KAQPVKGVGAEPGFSDVL 294
G IF D+G + L +S + ++ I +AQ + GF L
Sbjct: 293 WGIDGDGNKGTIF-DSGTTLSYLREPAYSKVLGALNASIYLPRAQEI-----PEGFE--L 344
Query: 295 CYNISSQPK-FPEVTIHFRGADV-KLSPSNLFRNISDEIMCSAFRG---GNANIVYGRIM 349
CYN++ K P++ + F+G V +L +N +++ + C A + N + + G ++
Sbjct: 345 CYNVTRMEKGMPKLGVEFQGGAVMELPWNNYMVLVAENVQCVALQKVTTTNGSNILGNLL 404
Query: 350 QINFLIGYDIEQAMVSFKPSRC 371
Q + I YD+ +A + FK S C
Sbjct: 405 QQDHHIEYDLAKARIGFKWSPC 426
>gi|296082170|emb|CBI21175.3| unnamed protein product [Vitis vinifera]
Length = 386
Score = 114 bits (285), Expect = 8e-23, Method: Compositional matrix adjust.
Identities = 103/352 (29%), Positives = 152/352 (43%), Gaps = 70/352 (19%)
Query: 35 YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSS 94
Y++ + +G+P D+ DTGSD TWTQCEPC C++Q +FDP S +Y+++SC S
Sbjct: 89 YVVTVGLGSPKRDLTFIFDTGSDLTWTQCEPCVGY-CYQQREHIFDPSTSLSYSNVSCDS 147
Query: 95 SQCAVVTS------NCSEGDCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMPNV 148
C + S CS C Y YG G+Y S G A E L+ ST N
Sbjct: 148 PSCEKLESATGNSPGCSSSTCLYGIRYGDGSY---SIGFFAREKLSLTSTD----VFNNF 200
Query: 149 IFGCGHKN--LASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLPDQGSSKINFG 206
FGCG N L T+ G++GL SL+SQ FSYCLP SS
Sbjct: 201 QFGCGQNNRGLFGGTA-----GLLGLARNPLSLVSQTAQKYGKVFSYCLPSSSSST---- 251
Query: 207 GIVAGAGVVSTPLIIRDHYYLSLEAISVGNQRLEFVSSSTGNIFVDTGVLRTLLPLEYHS 266
YLS + ++ ++F LP +S
Sbjct: 252 ------------------GYLSFGSGDGDSKAVKFTPR---------------LPPTVYS 278
Query: 267 NLKSVMSNMIKAQP-VKGVGAEPGFSDVLCYNISSQP--KFPEVTIHFR-GADVKLSPSN 322
+++ V ++ P VKGV CY++S K P++ ++F GA++ L+P
Sbjct: 279 SVQKVFRELMSDYPRVKGVSILD-----TCYDLSKYKTVKVPKIILYFSGGAEMDLAPEG 333
Query: 323 LFRNISDEIMCSAFRGGNAN---IVYGRIMQINFLIGYDIEQAMVSFKPSRC 371
+ + +C AF G + + + G + Q + YD + V F PS C
Sbjct: 334 IIYVLKVSQVCLAFAGNSDDDEVAIIGNVQQKTIHVVYDDAEGRVGFAPSGC 385
>gi|357118074|ref|XP_003560784.1| PREDICTED: aspartic proteinase nepenthesin-2-like, partial
[Brachypodium distachyon]
Length = 452
Score = 114 bits (284), Expect = 9e-23, Method: Compositional matrix adjust.
Identities = 106/363 (29%), Positives = 168/363 (46%), Gaps = 48/363 (13%)
Query: 35 YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSS 94
+++ + G+P DTGSD +W QC+PC C+KQ P+FDP KSS+Y + C +
Sbjct: 112 FVVVVGFGSPAQTSATMFDTGSDLSWIQCQPC-SGHCYKQHDPVFDPAKSSSYAVVPCGT 170
Query: 95 SQCAVVTSNCSEGDCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMPNVIFGCGH 154
++CA C+ C Y YG G S ++G LA ETLTF+S+S E IFGCG
Sbjct: 171 TECAAAGGECNGTTCVYGVEYGDG---SSTTGVLARETLTFSSSS----EFTGFIFGCGE 223
Query: 155 KNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLPDQGSSKINFGGIVAGAGV 214
NL + G++GLG G+ SL SQ + G FSYCLP ++ G + GA
Sbjct: 224 TNLG---DFGEVDGLLGLGRGSLSLSSQAAPAFGGIFSYCLPSYNTTP---GYLSIGATP 277
Query: 215 VSTPLIIR-----------DHYYLSLEAISVGNQRL-----EFVSSSTGNIFVDTGVLRT 258
V+ + ++ Y++ L +I++G L EF + T +D+G + T
Sbjct: 278 VTGQIPVQYTAMVNKPDYPSFYFIELVSINIGGYVLPVPPSEFTKTGT---LLDSGTILT 334
Query: 259 LLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDV-LCYNISSQPKFPEVTIHFRGADVK 317
LP ++ L+ + +G P + ++ CY+ + Q + F +D
Sbjct: 335 YLPPPAYTALRDRFKFTM-----QGSKPAPPYDELDTCYDFTGQSGILIPGVSFNFSDGA 389
Query: 318 LSPSNLF------RNISDEIMCSAFRGGNANI---VYGRIMQINFLIGYDIEQAMVSFKP 368
+ N F + + C AF A++ V G Q + + YD+ + F P
Sbjct: 390 VFNLNFFGIMTFPDDTKPAVGCLAFVSRPADMPFSVVGSTTQRSAEVIYDVPAQKIGFIP 449
Query: 369 SRC 371
+ C
Sbjct: 450 ASC 452
>gi|115466060|ref|NP_001056629.1| Os06g0118700 [Oryza sativa Japonica Group]
gi|55296436|dbj|BAD68559.1| putative nucleoid DNA-binding protein cnd41 [Oryza sativa Japonica
Group]
gi|113594669|dbj|BAF18543.1| Os06g0118700 [Oryza sativa Japonica Group]
gi|215767921|dbj|BAH00150.1| unnamed protein product [Oryza sativa Japonica Group]
Length = 494
Score = 114 bits (284), Expect = 1e-22, Method: Compositional matrix adjust.
Identities = 102/340 (30%), Positives = 152/340 (44%), Gaps = 38/340 (11%)
Query: 52 VDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSSSQCAVV---TSNCSEGD 108
+DT SD TW QC PCP C+ Q+ L+DP KSS+ SC+S C + + C+ +
Sbjct: 173 LDTASDVTWVQCSPCPTPPCYPQKDVLYDPTKSSSSGVFSCNSPTCTQLGPYANGCTNNN 232
Query: 109 -CSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMPNVIFGCGHKNLASPTSDSKQT 167
C Y Y G + ++G ++ LT + + + FGC H S + S
Sbjct: 233 QCQYRVRYPDG---TSTAGTYISDLLTITPATAV----RSFQFGCSHGVQGSFSFGSSAA 285
Query: 168 GIIGLGPGNSSLISQMGTSIAGKFSYCLPDQGSSKINFGGI--VAGAGVVSTPL-----I 220
GI+ LG G SL+SQ + FS+C P G+ VA V TP+ I
Sbjct: 286 GIMALGGGPESLVSQTAATYGRVFSHCFPPPTRRGFFTLGVPRVAAWRYVLTPMLKNPAI 345
Query: 221 IRDHYYLSLEAISVGNQRLEFVSS--STGNIFVDTGVLRTLLPLEYHSNLKSVMSNMIKA 278
Y + LEAI+V QR+ + + G + L P Y + ++ M
Sbjct: 346 PPTFYMVRLEAIAVAGQRIAVPPTVFAAGAALDSRTAITRLPPTAYQALRQAFRDRMAMY 405
Query: 279 QPVKGVGAEPGFSDVLCYNISSQPKF--PEVTIHF-RGADVKLSPSN-LFRNISDEIMCS 334
QP A P CY+++ F P +T+ F + A V+L PS LF+ C
Sbjct: 406 QP-----APPKGPLDTCYDMAGVRSFALPRITLVFDKNAAVELDPSGVLFQG------CL 454
Query: 335 AFRGGNANIVYGRI--MQINFL-IGYDIEQAMVSFKPSRC 371
AF G + V G I +Q+ L + Y+I A+V F+ + C
Sbjct: 455 AFTAGPNDQVPGIIGNIQLQTLEVLYNIPAALVGFRHAAC 494
>gi|125553822|gb|EAY99427.1| hypothetical protein OsI_21398 [Oryza sativa Indica Group]
Length = 469
Score = 114 bits (284), Expect = 1e-22, Method: Compositional matrix adjust.
Identities = 102/340 (30%), Positives = 152/340 (44%), Gaps = 38/340 (11%)
Query: 52 VDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSSSQCAVV---TSNCSEGD 108
+DT SD TW QC PCP C+ Q+ L+DP KSS+ SC+S C + + C+ +
Sbjct: 148 LDTASDVTWVQCSPCPTPPCYPQKDVLYDPTKSSSSGVFSCNSPTCTQLGPYANGCTNNN 207
Query: 109 -CSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMPNVIFGCGHKNLASPTSDSKQT 167
C Y Y G + ++G ++ LT + + + FGC H S + S
Sbjct: 208 QCQYRVRYPDG---TSTAGTYISDLLTITPATAV----RSFQFGCSHGVQGSFSFGSSAA 260
Query: 168 GIIGLGPGNSSLISQMGTSIAGKFSYCLPDQGSSKINFGGI--VAGAGVVSTPL-----I 220
GI+ LG G SL+SQ + FS+C P G+ VA V TP+ I
Sbjct: 261 GIMALGGGPESLVSQTAATYGRVFSHCFPPPTRRGFFTLGVPRVAAWRYVLTPMLKNPAI 320
Query: 221 IRDHYYLSLEAISVGNQRLEFVSS--STGNIFVDTGVLRTLLPLEYHSNLKSVMSNMIKA 278
Y + LEAI+V QR+ + + G + L P Y + ++ M
Sbjct: 321 PPTFYMVRLEAIAVAGQRIAVPPTVFAAGAALDSRTAITRLPPTAYQALRQAFRDRMAMY 380
Query: 279 QPVKGVGAEPGFSDVLCYNISSQPKF--PEVTIHF-RGADVKLSPSN-LFRNISDEIMCS 334
QP A P CY+++ F P +T+ F + A V+L PS LF+ C
Sbjct: 381 QP-----APPKGPLDTCYDMAGVRSFALPRITLVFDKNAAVELDPSGVLFQG------CL 429
Query: 335 AFRGGNANIVYGRI--MQINFL-IGYDIEQAMVSFKPSRC 371
AF G + V G I +Q+ L + Y+I A+V F+ + C
Sbjct: 430 AFTAGPNDQVPGIIGNIQLQTLEVLYNIPAALVGFRHAAC 469
>gi|21595063|gb|AAM66069.1| putative aspartyl protease [Arabidopsis thaliana]
Length = 484
Score = 113 bits (283), Expect = 1e-22, Method: Compositional matrix adjust.
Identities = 111/370 (30%), Positives = 173/370 (46%), Gaps = 55/370 (14%)
Query: 35 YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSS 94
Y++ + +G + + VDTGSD TW QC+PC C+ Q+ PL+DP SS+Y ++ C+S
Sbjct: 135 YIVTVELGGKNMSLI--VDTGSDLTWVQCQPCR--SCYNQQGPLYDPSVSSSYKTVFCNS 190
Query: 95 SQC---AVVTSN---CSEGD------CSYSFLYGRGAYASFSSGNLATETLTFNSTSGLP 142
S C TSN C + C Y YG G+Y + G+LA+E++ T
Sbjct: 191 STCQDLVAATSNSGPCGGNNGVVKTPCEYVVSYGDGSY---TRGDLASESILLGDT---- 243
Query: 143 VEMPNVIFGCGHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLP---DQG 199
++ N +FGCG N S G+ + SL+SQ + G FSYCLP D
Sbjct: 244 -KLENFVFGCGRNNKGLFGGSSGLMGLG---RSSVSLVSQTLKTFNGVFSYCLPSLEDGA 299
Query: 200 SSKINFGG----IVAGAGVVSTPLI----IRDHYYLSLEAISVGNQRLEFVSSSTGN-IF 250
S ++FG V TPL+ +R Y L+L S+G +E SSS G I
Sbjct: 300 SGSLSFGNDSSVYTNSTSVSYTPLVQNPQLRSFYILNLTGASIGG--VELKSSSFGRGIL 357
Query: 251 VDTGVLRTLLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDV-LCYNISSQP--KFPEV 307
+D+G + T LP + K+V +K G PG+S + C+N++S P +
Sbjct: 358 IDSGTVITRLPPSIY---KAVKIEFLKQ--FSGFPTAPGYSILDTCFNLTSYEDISIPII 412
Query: 308 TIHFRG-ADVKLSPSNLFRNISDE--IMCSAFRG---GNANIVYGRIMQINFLIGYDIEQ 361
+ F+G A++++ + +F + + ++C A N + G Q N + YD Q
Sbjct: 413 KMIFQGNAELEVDVTGVFYFVKPDASLVCLALASLSYENEVGIIGNYQQKNQRVIYDSTQ 472
Query: 362 AMVSFKPSRC 371
+ C
Sbjct: 473 ERLGIVGENC 482
>gi|356555807|ref|XP_003546221.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Glycine max]
Length = 457
Score = 113 bits (283), Expect = 1e-22, Method: Compositional matrix adjust.
Identities = 117/367 (31%), Positives = 169/367 (46%), Gaps = 47/367 (12%)
Query: 35 YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSS 94
Y + + +GTP VDTGS +W QC+PC + C Q P+F P S TY ++SCSS
Sbjct: 107 YYVKIGVGTPAKYFSMIVDTGSSLSWLQCQPC-VIYCHVQVDPIFTPSVSKTYKALSCSS 165
Query: 95 SQCAVVTS------NCSE--GDCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMP 146
SQC+ + S CS G C Y YG SFS G L+ + LT P P
Sbjct: 166 SQCSSLKSSTLNAPGCSNATGACVYKASYGD---TSFSIGYLSQDVLTLT-----PSAAP 217
Query: 147 N--VIFGCGHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLPDQGSSKIN 204
+ ++GCG N + GIIGL S++ Q+ FSYCLP S++ N
Sbjct: 218 SSGFVYGCGQDNQG---LFGRSAGIIGLANDKLSMLGQLSNKYGNAFSYCLPSSFSAQPN 274
Query: 205 ---FGGIVAGAGVVS------TPLI----IRDHYYLSLEAISVGNQRLEFVSSSTGNI-- 249
G + GA +S TPL+ I Y+L L I+V + L VS+S+ N+
Sbjct: 275 SSVSGFLSIGASSLSSSPYKFTPLVKNPKIPSLYFLGLTTITVAGKPLG-VSASSYNVPT 333
Query: 250 FVDTGVLRTLLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDV-LCY--NISSQPKFPE 306
+D+G + T LP+ ++ LK ++ K PGFS + C+ ++ PE
Sbjct: 334 IIDSGTVITRLPVAIYNALKKSFVMIMS----KKYAQAPGFSILDTCFKGSVKEMSTVPE 389
Query: 307 VTIHFR-GADVKLSPSNLFRNISDEIMCSAFRGGNANI-VYGRIMQINFLIGYDIEQAMV 364
+ I FR GA ++L N I C A + I + G Q F + YD+ + +
Sbjct: 390 IRIIFRGGAGLELKVHNSLVEIEKGTTCLAIAASSNPISIIGNYQQQTFTVAYDVANSKI 449
Query: 365 SFKPSRC 371
F P C
Sbjct: 450 GFAPGGC 456
>gi|302758122|ref|XP_002962484.1| hypothetical protein SELMODRAFT_78458 [Selaginella moellendorffii]
gi|300169345|gb|EFJ35947.1| hypothetical protein SELMODRAFT_78458 [Selaginella moellendorffii]
Length = 395
Score = 113 bits (283), Expect = 1e-22, Method: Compositional matrix adjust.
Identities = 109/382 (28%), Positives = 183/382 (47%), Gaps = 59/382 (15%)
Query: 35 YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEP--PLFDPKKSSTYNSISC 92
Y + L +GTP +DTGSD TW QC P P P P +D SS+Y I C
Sbjct: 27 YFVELRVGTPAKKFPLIIDTGSDLTWIQCNP-PNTTANSSSPPAPWYDKSSSSSYREIPC 85
Query: 93 SSSQC----AVVTSNC---SEGDCSYSFLYGRGAYASFSSGNLATETLTFNS--TSG--- 140
+ +C A + S+C S C Y++ Y + S ++G LA ET++ S SG
Sbjct: 86 TDDECLFLPAPIGSSCSIKSPSPCDYTYGY---SDQSRTTGILAYETISMKSRKRSGKRA 142
Query: 141 -----LPVEMPNVIFGCGHKNLASPTSDSKQTGIIGLGPGNSSLISQ-MGTSIAGKFSYC 194
+ + NV GC +++ + S +G++GLG G SL +Q T++ G FSYC
Sbjct: 143 GNHKTRTIRIKNVALGCSRESVGA--SFLGASGVLGLGQGPISLATQTRHTALGGIFSYC 200
Query: 195 LPD--QGSSKINF--GGIVAGAGVVSTPLI----IRDHYYLSLEAISVGNQRLEFVSSS- 245
L D +GS+ +F G + TP++ + YY+++ ++V + ++ ++SS
Sbjct: 201 LVDYLRGSNASSFLVMGRTRWRKLAHTPIVRNPAAQSFYYVNVTGVAVDGKPVDGIASSD 260
Query: 246 --------TGNIFVDTGVLRTLLPLEYHSNLKSVMSNMI---KAQPVKGVGAEPGFSDVL 294
G IF D+G + L +S + ++ I +AQ + GF L
Sbjct: 261 WGIDGDGNKGTIF-DSGTTLSYLREPAYSKVLGALNASIYLPRAQEI-----PEGFE--L 312
Query: 295 CYNISSQPK-FPEVTIHFRGADV-KLSPSNLFRNISDEIMCSAFRG---GNANIVYGRIM 349
CYN++ K P++ + F+G V +L +N +++ + C A + N + + G ++
Sbjct: 313 CYNVTRMEKGMPKLGVEFQGGAVMELPWNNYMVLVAENVQCVALQKVTTTNGSNILGNLL 372
Query: 350 QINFLIGYDIEQAMVSFKPSRC 371
Q + I YD+ +A + FK S C
Sbjct: 373 QQDHHIEYDLAKARIGFKWSPC 394
>gi|18405138|ref|NP_565911.1| Eukaryotic aspartyl protease family protein [Arabidopsis thaliana]
gi|13877759|gb|AAK43957.1|AF370142_1 unknown protein [Arabidopsis thaliana]
gi|15293231|gb|AAK93726.1| unknown protein [Arabidopsis thaliana]
gi|20196976|gb|AAB87120.2| expressed protein [Arabidopsis thaliana]
gi|20197046|gb|AAM14894.1| expressed protein [Arabidopsis thaliana]
gi|330254616|gb|AEC09710.1| Eukaryotic aspartyl protease family protein [Arabidopsis thaliana]
Length = 442
Score = 113 bits (282), Expect = 2e-22, Method: Compositional matrix adjust.
Identities = 113/381 (29%), Positives = 179/381 (46%), Gaps = 66/381 (17%)
Query: 39 LSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSSSQCA 98
L++G PP +I +DTGS+ +W C+ P L +F+P SSTY+ + CSS C
Sbjct: 69 LAVGDPPQNISMVLDTGSELSWLHCKKSPNLGS------VFNPVSSSTYSPVPCSSPICR 122
Query: 99 VVTSNC---SEGDCSYSFLYGRGAYASFSS--GNLATETLTFNSTSGLPVEMPNVIFGCG 153
T + + D + +YA +S GNLA ET S V P +FGC
Sbjct: 123 TRTRDLPIPASCDPKTHLCHVAISYADATSIEGNLAHETFVIGS-----VTRPGTLFGCM 177
Query: 154 HKNLASPT-SDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLPDQGSSKINFGGIVAGA 212
L+S + D+K TG++G+ G+ S ++Q+G S KFSYC+ SS G + +
Sbjct: 178 DSGLSSNSEEDAKSTGLMGMNRGSLSFVNQLGFS---KFSYCISGSDSSGFLLLGDASYS 234
Query: 213 G---VVSTPLII---------RDHYYLSLEAISVGNQRLE-----FVSSST--GNIFVDT 253
+ TPL++ R Y + LE I VG++ L FV T G VD+
Sbjct: 235 WLGPIQYTPLVLQSTPLPYFDRVAYTVQLEGIRVGSKILSLPKSVFVPDHTGAGQTMVDS 294
Query: 254 GVLRTLLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGF----SDVLCYNISS--QPKF--- 304
G T L ++ LK+ + + + V + +P F + LCY + S +P F
Sbjct: 295 GTQFTFLMGPVYTALKNEF--ITQTKSVLRLVDDPDFVFQGTMDLCYKVGSTTRPNFSGL 352
Query: 305 PEVTIHFRGADVKLSPSNLFRNIS-------DEIMCSAFRGGNANI------VYGRIMQI 351
P V++ FRGA++ +S L ++ +E+ C F GN+++ V G Q
Sbjct: 353 PMVSLMFRGAEMSVSGQKLLYRVNGAGSEGKEEVYCFTF--GNSDLLGIEAFVIGHHHQQ 410
Query: 352 NFLIGYDIEQAMVSFKPS-RC 371
N + +D+ ++ V F + RC
Sbjct: 411 NVWMEFDLAKSRVGFAGNVRC 431
>gi|26451756|dbj|BAC42973.1| unknown protein [Arabidopsis thaliana]
Length = 442
Score = 113 bits (282), Expect = 2e-22, Method: Compositional matrix adjust.
Identities = 113/381 (29%), Positives = 179/381 (46%), Gaps = 66/381 (17%)
Query: 39 LSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSSSQCA 98
L++G PP +I +DTGS+ +W C+ P L +F+P SSTY+ + CSS C
Sbjct: 69 LAVGDPPQNISMVLDTGSELSWLHCKKSPNLGS------VFNPVSSSTYSPVPCSSPICR 122
Query: 99 VVTSNC---SEGDCSYSFLYGRGAYASFSS--GNLATETLTFNSTSGLPVEMPNVIFGCG 153
T + + D + +YA +S GNLA ET S V P +FGC
Sbjct: 123 TRTRDLPIPASCDPKTHLCHVAISYADATSIEGNLAHETFVIGS-----VTRPGTLFGCM 177
Query: 154 HKNLASPT-SDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLPDQGSSKINFGGIVAGA 212
L+S + D+K TG++G+ G+ S ++Q+G S KFSYC+ SS G + +
Sbjct: 178 DSGLSSNSEEDAKSTGLMGMNRGSLSFVNQLGFS---KFSYCISGSDSSVFLLLGDASYS 234
Query: 213 G---VVSTPLII---------RDHYYLSLEAISVGNQRLE-----FVSSST--GNIFVDT 253
+ TPL++ R Y + LE I VG++ L FV T G VD+
Sbjct: 235 WLGPIQYTPLVLQSTPLPYFDRVAYTVQLEGIRVGSKILSLPKSVFVPDHTGAGQTMVDS 294
Query: 254 GVLRTLLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGF----SDVLCYNISS--QPKF--- 304
G T L ++ LK+ + + + V + +P F + LCY + S +P F
Sbjct: 295 GTQFTFLMGPVYTALKNEF--ITQTKSVLRLVDDPDFVFQGTMDLCYKVGSTTRPNFSGL 352
Query: 305 PEVTIHFRGADVKLSPSNLFRNIS-------DEIMCSAFRGGNANI------VYGRIMQI 351
P V++ FRGA++ +S L ++ +E+ C F GN+++ V G Q
Sbjct: 353 PMVSLMFRGAEMSVSGQKLLYRVNGAGSEGKEEVYCFTF--GNSDLLGIEAFVIGHHHQQ 410
Query: 352 NFLIGYDIEQAMVSFKPS-RC 371
N + +D+ ++ V F + RC
Sbjct: 411 NVWMEFDLAKSRVGFAGNVRC 431
>gi|302794412|ref|XP_002978970.1| hypothetical protein SELMODRAFT_418789 [Selaginella moellendorffii]
gi|300153288|gb|EFJ19927.1| hypothetical protein SELMODRAFT_418789 [Selaginella moellendorffii]
Length = 462
Score = 113 bits (282), Expect = 2e-22, Method: Compositional matrix adjust.
Identities = 102/375 (27%), Positives = 172/375 (45%), Gaps = 51/375 (13%)
Query: 35 YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSS 94
Y + +G+P + VDTGS+ TW +C PC C ++D +S +Y ++C++
Sbjct: 100 YYTSIKLGSPGQEAILIVDTGSELTWLKCLPCKV--CAPSVDTIYDAARSVSYKPVTCNN 157
Query: 95 SQCAVVTSN-----CSEG-DCSYSFLYGRGAYASFSSGNLATETLTFNST-SGLPVEMPN 147
SQ +S C+ G C ++ YG G SFS G+L+T+TL + G PV + +
Sbjct: 158 SQLCSNSSQGTYAYCARGSQCQFAAFYGDG---SFSYGSLSTDTLIMETVVGGKPVTVQD 214
Query: 148 VIFGCGHKNLA-SPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLPDQGSSKINFG 206
FGC +L PT S GI+GL G +L Q+G KFS+C PD+ SS +N
Sbjct: 215 FAFGCAQGDLELVPTGAS---GILGLNAGKMALPMQLGQRFGWKFSHCFPDR-SSHLNST 270
Query: 207 GIV-------------AGAGVVSTPLIIRDHYYLSLEAISVGNQRLEFVSSSTGNIFVDT 253
G+V + ++ + R Y+++L+ +S+ + L + + + +D+
Sbjct: 271 GVVFFGNAELPHEQVQYTSVALTNSELQRKFYHVALKGVSINSHELVLLPRGS-VVILDS 329
Query: 254 GVLRTLLPLEYHSNLKSVMSNMIKAQP--VKGVGAEPGFSDV-LCYNISS------QPKF 304
G + +HS L+ +K +P +K + + F D+ C+ +S+
Sbjct: 330 GSSFSSFVRPFHSQLREA---FLKHRPPSLKHLEGD-SFGDLGTCFKVSNDDIDELHRTL 385
Query: 305 PEVTIHFRGADVKLSPS-----NLFRNISDEIMCSAFRGGNANI--VYGRIMQINFLIGY 357
P +++ F PS + R + MC AF G N V G Q N + Y
Sbjct: 386 PSLSLVFEDGVTIGIPSIGVLLPVARYQNHVKMCFAFEDGGPNPVNVIGNYQQQNLWVEY 445
Query: 358 DIEQAMVSFKPSRCT 372
DI+++ V F + C
Sbjct: 446 DIQRSRVGFARASCV 460
>gi|222637181|gb|EEE67313.1| hypothetical protein OsJ_24553 [Oryza sativa Japonica Group]
Length = 414
Score = 113 bits (282), Expect = 2e-22, Method: Compositional matrix adjust.
Identities = 102/345 (29%), Positives = 155/345 (44%), Gaps = 59/345 (17%)
Query: 70 DCFKQEPPLFDPKKSSTYNSISCSSSQCAVVTS---NCSEGDCSYSFLYGRGAYASFSSG 126
+C + P F P SST++ + C+SS C +TS C+ C Y + YG G F++G
Sbjct: 87 ECAARPAPPFQPASSSTFSKLPCASSLCQFLTSPYLTCNATGCVYYYPYGMG----FTAG 142
Query: 127 NLATETLTFNSTSGLPVEMPNVIFGCGHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTS 186
LATETL S P V FGC +N +S +GI+GLG SL+SQ+G
Sbjct: 143 YLATETLHVGGAS-----FPGVAFGCSTENGVGNSS----SGIVGLGRSPLSLVSQVGV- 192
Query: 187 IAGKFSYCL---PDQGSSKINFGGIVAGAGVVSTPLIIRD-------HYYLSLEAISVGN 236
G+FSYCL D G S I FG + G S+P I+ + +YY++L I+VG
Sbjct: 193 --GRFSYCLRSDADAGDSPILFGSLAKVTGGKSSPAILENPEMPSSSYYYVNLTGITVGA 250
Query: 237 QRLEFVSSS-----------TGNIFVDTGVLRTLLPLEYHSNLK-SVMSNMIKAQPVKGV 284
L S++ G VD+G T L E ++ +K + +S M A V
Sbjct: 251 TDLPVTSTTFGFTRGAGAGLVGGTIVDSGTTLTYLVKEGYAMVKRAFLSQMATANLTTTV 310
Query: 285 -GAEPGFSDVLCYNIS-----SQPKFPEVTIHFRGADVKLSPSNLFRNISD-------EI 331
G GF LC++ + S P + + F G + + + +
Sbjct: 311 NGTRFGFD--LCFDANAAGGGSGVPVPTLVLRFAGGAEYAVRRRSYVGVVEVDSQGRAAV 368
Query: 332 MCSAFRGGNANI---VYGRIMQINFLIGYDIEQAMVSFKPSRCTN 373
C + + + G +MQ++ + YD++ M SF P+ C N
Sbjct: 369 ECLLVLPASEKLSISIIGNVMQMDLHVLYDLDGGMFSFAPADCAN 413
>gi|297720195|ref|NP_001172459.1| Os01g0608366 [Oryza sativa Japonica Group]
gi|53792202|dbj|BAD52835.1| nucleoid DNA-binding protein cnd41-like [Oryza sativa Japonica
Group]
gi|255673454|dbj|BAH91189.1| Os01g0608366 [Oryza sativa Japonica Group]
Length = 452
Score = 112 bits (281), Expect = 2e-22, Method: Compositional matrix adjust.
Identities = 106/354 (29%), Positives = 161/354 (45%), Gaps = 52/354 (14%)
Query: 35 YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELD-CFKQEPPLFDPKKSSTYNSISCS 93
Y++ + +G+P V +DTGSD +W QCEPCP C LFDP SSTY + +CS
Sbjct: 108 YVISVGLGSPAVTQRVVIDTGSDVSWVQCEPCPAPSPCHAHAGALFDPAASSTYAAFNCS 167
Query: 94 SSQCAVV-----TSNC-SEGDCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMPN 147
++ CA + + C ++ C Y YG G S ++G +++ LT + + +
Sbjct: 168 AAACAQLGDSGEANGCDAKSRCQYIVKYGDG---SNTTGTYSSDVLTLSGSD----VVRG 220
Query: 148 VIFGCGHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLPDQGSSK----- 202
FGC H L + D K G+IGLG S +SQ F YCLP +S
Sbjct: 221 FQFGCSHAELGAGM-DDKTDGLIGLGGDAQSPVSQTAARYGKSFFYCLPATPASSGFLTL 279
Query: 203 --INFGGIVAGAGVVSTPLI----IRDHYYLSLEAISVGNQRLEFVSS--STGNIFVDTG 254
GG + +TP++ + +Y+ +LE I+VG ++L S + G++ VD+G
Sbjct: 280 GAPASGGGGGASRFATTPMLRSKKVPTYYFAALEDIAVGGKKLGLSPSVFAAGSL-VDSG 338
Query: 255 VLRTLLPLEYHSNLKSV----MSNMIKAQPVKGVGAEPGFSDVLCYNISSQPK--FPEVT 308
+ T LP ++ L S M+ +A+P+ G D C+N + K P V
Sbjct: 339 TVITRLPPAAYAALSSAFRAGMTRYARAEPL-------GILDT-CFNFTGLDKVSIPTVA 390
Query: 309 IHFR-GADVKLSPSNLFRNISDEIMCSAF---RGGNANIVYGRIMQINFLIGYD 358
+ F GA V L + C AF R A G + Q F + YD
Sbjct: 391 LVFAGGAVVDLDAHGIVSG-----GCLAFAPTRDDKAFGTIGNVQQRTFEVLYD 439
>gi|413953788|gb|AFW86437.1| hypothetical protein ZEAMMB73_618532 [Zea mays]
Length = 469
Score = 112 bits (280), Expect = 3e-22, Method: Compositional matrix adjust.
Identities = 103/356 (28%), Positives = 153/356 (42%), Gaps = 34/356 (9%)
Query: 35 YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSS 94
Y+ + +GTP V +DTGS TW QC+PC C+ Q PLFDP SS+Y+ + C S
Sbjct: 129 YVATVGLGTPAVPQTLILDTGSSLTWVQCKPCNSSQCYPQRLPLFDPNTSSSYSPVPCDS 188
Query: 95 SQCAVVTSN------CSEGD--CSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMP 146
+C + + S+GD C+Y YG GA +G +T+ LT G +
Sbjct: 189 QECRALAAGIDGDGCTSDGDWGCAYEIHYGSGAT---PAGEYSTDALTL----GPGAIVK 241
Query: 147 NVIFGCGHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGK-FSYCLPDQGSSK--I 203
FGCGH + G++GLG SL Q G FS+CLP G S +
Sbjct: 242 RFHFGCGHHQQRGKFDMAD--GVLGLGRLPQSLAWQASARRGGGVFSHCLPPTGVSTGFL 299
Query: 204 NFGGIVAGAGVVSTPLIIRDH----YYLSLEAISVGNQRLEFVSSS-TGNIFVDTGVLRT 258
G + V TPL+ D Y L AISV Q L+ + + D+G + +
Sbjct: 300 ALGAPHDTSAFVFTPLLTMDDQPWFYQLMPTAISVAGQLLDIPPAVFREGVITDSGTVLS 359
Query: 259 LLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDVLCYNIS--SQPKFPEVTIHFR-GAD 315
L ++ L++ + + P+ A P C+N + P V++ FR GA
Sbjct: 360 ALQETAYTALRTAFRSAMAEYPL----APPVGHLDTCFNFTGYDNVTVPTVSLTFRGGAT 415
Query: 316 VKLSPSNLFRNISDEIMCSAFRGGNANIVYGRIMQINFLIGYDIEQAMVSFKPSRC 371
V L S+ + D + G + G + Q + YD+ V F+ C
Sbjct: 416 VHLDASSGV--LMDGCLAFWSSGDEYTGLIGSVSQRTIEVLYDMPGRKVGFRTGAC 469
>gi|449440014|ref|XP_004137780.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Cucumis sativus]
gi|449483406|ref|XP_004156582.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Cucumis sativus]
Length = 449
Score = 112 bits (280), Expect = 3e-22, Method: Compositional matrix adjust.
Identities = 112/387 (28%), Positives = 171/387 (44%), Gaps = 79/387 (20%)
Query: 39 LSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSSSQCA 98
L++GTPP ++ +DTGS+ +W C F+P SS+Y+ I CSSS C
Sbjct: 77 LTVGTPPQNVTMVIDTGSELSWLHCNTSQNSSSSSST---FNPVWSSSYSPIPCSSSTCT 133
Query: 99 VVTSNCS-EGDC-SYSFLYGRGAYASFSS--GNLATETLTFNSTSGLPVEMPNVIFGCGH 154
T + C S F + +YA SS GNLAT+T S+ +PNV+FGC
Sbjct: 134 DQTRDFPIRPSCDSNQFCHATLSYADASSSEGNLATDTFYIGSSG-----IPNVVFGCMD 188
Query: 155 KNLASPT-SDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLPDQGSSKINFGGIVAGAG 213
+S + DSK TG++G+ G+ S +SQMG KFSYC+ S+ +F G++
Sbjct: 189 SIFSSNSEEDSKNTGLMGMNRGSLSFVSQMGFP---KFSYCI-----SEYDFSGLLLLGD 240
Query: 214 V---------------VSTPLIIRDH--YYLSLEAISVGNQRLEFVSS-------STGNI 249
+STPL D Y + LE I V ++ L S G
Sbjct: 241 ANFSWLAPLNYTPLIEMSTPLPYFDRVAYTVQLEGIKVAHKLLPIPESVFEPDHTGAGQT 300
Query: 250 FVDTGVLRTLLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDV---------LCYNISS 300
VD+G T L ++ L+ N K G+ + D LCY + +
Sbjct: 301 MVDSGTQFTFLLGPAYTALRDHFLN-------KTAGSLRVYEDSNFVFQGAMDLCYRVPT 353
Query: 301 Q----PKFPEVTIHFRGADVKLSPSNLF------RNISDEIMCSAFRGGNANI------V 344
P P VT+ FRGA++ ++ + R +D I C F GN+++ V
Sbjct: 354 NQTRLPPLPSVTLVFRGAEMTVTGDRILYRVPGERRGNDSIHCFTF--GNSDLLGVEAFV 411
Query: 345 YGRIMQINFLIGYDIEQAMVSFKPSRC 371
G + Q N + +D++++ + RC
Sbjct: 412 IGHLHQQNVWMEFDLKKSRIGLAEIRC 438
>gi|359476756|ref|XP_002277082.2| PREDICTED: aspartic proteinase-like protein 2-like isoform 2 [Vitis
vinifera]
Length = 560
Score = 112 bits (280), Expect = 3e-22, Method: Compositional matrix adjust.
Identities = 105/374 (28%), Positives = 173/374 (46%), Gaps = 48/374 (12%)
Query: 34 IYLMHLSIGTPPVDIFGSVDTGSDCTWTQ---CEPCPELDCFKQEPPLFDPKKSSTYNSI 90
+Y + IGTP D + VDTGSD W C+ CP + L+D K S+T +++
Sbjct: 154 LYFAKIGIGTPSKDYYVQVDTGSDILWVNCAGCDRCPTKSDLGVDLTLYDMKASTTSDAV 213
Query: 91 SCSSSQCAVVTS---NCSEG-DCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMP 146
C + C++ C G C YS LYG G S ++G + + +N SG P
Sbjct: 214 GCDDNFCSLYDGPLPGCKPGLQCLYSVLYGDG---SSTTGYFVQDFVQYNRISGNFQTTP 270
Query: 147 ---NVIFGCGHKNLASPTSDSKQ-TGIIGLGPGNSSLISQMGTS--IAGKFSYCLPDQGS 200
V+FGCG+K S S+ GI+G G NSS++SQ+ +S + FS+CL +
Sbjct: 271 TNGTVVFGCGNKQSGELGSSSEALDGILGFGQANSSMLSQLASSGKVKKVFSHCLDN--- 327
Query: 201 SKINFGGIVAGAGVVS-----TPLII-RDHYYLSLEAISVGNQRLE-----FVSSSTGNI 249
++ GGI A VV TPL+ + HY + ++ I VG L+ F S
Sbjct: 328 --VDGGGIFAIGEVVEPKVNITPLVQNQAHYNVVMKEIEVGGDPLDVPSDAFESGDRKGT 385
Query: 250 FVDTGVLRTLLPLEYHSNLKSVMSNMIKAQP-VKGVGAEPGFSDVLCYNISS--QPKFPE 306
+D+G P E + L + ++ QP ++ E F+ C++ + FP
Sbjct: 386 IIDSGTTLAYFPQEVYVPL---IEKILSQQPDLRLHTVEQAFT---CFDYTGNVDDGFPT 439
Query: 307 VTIHF-RGADVKLSPSN-LFRN-----ISDEIMCSAFRGGNANIVYGRIMQINFLIGYDI 359
VT+HF + + + P LF++ I + + + G + G ++ N L+ YD+
Sbjct: 440 VTLHFDKSISLTVYPHEYLFQHEFEWCIGWQNSGAQTKDGKDLTLLGDLVLSNKLVVYDL 499
Query: 360 EQAMVSFKPSRCTN 373
E+ + + C++
Sbjct: 500 EKQGIGWVEYNCSS 513
>gi|212274713|ref|NP_001130791.1| uncharacterized protein LOC100191895 precursor [Zea mays]
gi|194690124|gb|ACF79146.1| unknown [Zea mays]
gi|194708040|gb|ACF88104.1| unknown [Zea mays]
gi|223950469|gb|ACN29318.1| unknown [Zea mays]
gi|414885521|tpg|DAA61535.1| TPA: hypothetical protein ZEAMMB73_650724 [Zea mays]
Length = 500
Score = 112 bits (280), Expect = 3e-22, Method: Compositional matrix adjust.
Identities = 101/359 (28%), Positives = 158/359 (44%), Gaps = 58/359 (16%)
Query: 52 VDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSSSQCAVVTSNCSEG---- 107
VDT S+ TW QC PC C Q+ PLFDP S +Y ++ C S C + + G
Sbjct: 158 VDTASELTWVQCAPCES--CHDQQGPLFDPSSSPSYAAVPCDSPSCDALQQQLATGAGAG 215
Query: 108 ----------DCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMPNVIFGCGHKNL 157
CSY+ Y G+Y S G LA + L+ + +FGCG N
Sbjct: 216 APPCDAGRPAACSYALSYRDGSY---SRGVLAHDRLSLAGEV-----IDGFVFGCGTSNQ 267
Query: 158 ASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLP--------------DQGSSKI 203
P + +G++GLG SL+SQ G FSYCLP D S+
Sbjct: 268 GPPFGGT--SGLMGLGRSQLSLVSQTVDQFGGVFSYCLPLSRESDASGSLVLGDDPSAYR 325
Query: 204 NFGGIVAGAGVV-STPLIIRDHYYLSLEAISVGNQRLEFVSSSTGNIFVDTG-VLRTLLP 261
N +V + V S PL+ Y ++L I+VG Q +E S I VD+G V+ +L+P
Sbjct: 326 NSTPVVYTSMVSNSDPLLQGPFYLVNLTGITVGGQEVESTGFSARAI-VDSGTVITSLVP 384
Query: 262 LEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDV-LCYNIS--SQPKFPEVTIHFR-GADVK 317
Y++ MS + + PGFS + C+N++ + + P +T+ F GA+V+
Sbjct: 385 SVYNAVRAEFMSQLAEYPQA------PGFSILDTCFNMTGLKEVQVPSLTLVFDGGAEVE 438
Query: 318 LSPSNLFRNISDE-----IMCSAFRGGNANIVYGRIMQINFLIGYDIEQAMVSFKPSRC 371
+ + +S + + ++ + + + G Q N + +D + V F C
Sbjct: 439 VDSGGVLYFVSSDSSQVCLAVASLKSEDETSIIGNYQQKNLRVVFDTSASQVGFAQETC 497
>gi|242085924|ref|XP_002443387.1| hypothetical protein SORBIDRAFT_08g018620 [Sorghum bicolor]
gi|241944080|gb|EES17225.1| hypothetical protein SORBIDRAFT_08g018620 [Sorghum bicolor]
Length = 460
Score = 112 bits (279), Expect = 3e-22, Method: Compositional matrix adjust.
Identities = 106/386 (27%), Positives = 160/386 (41%), Gaps = 57/386 (14%)
Query: 35 YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSS 94
Y+ IG PP +DTGS+ WTQC C CF Q+ +DP +S T ++C+
Sbjct: 84 YIAEYLIGDPPQQAAAIIDTGSNLIWTQCSTCRANGCFGQDLTFYDPSRSRTAKPVACND 143
Query: 95 SQCAVVTSNCSEGD---CSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMPNVIFG 151
+ C + + D C+ YG GA F L TE TF + ++ FG
Sbjct: 144 TACLLGSETRCARDGKACAVLTAYGAGAIGGF----LGTEVFTFGHGQSSENNV-SLAFG 198
Query: 152 CGHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLPDQGSSKINFGGIVAG 211
C + +P S +GIIGLG G SL SQ+G + KFSYCL S N + G
Sbjct: 199 CITASRLTPGSLDGASGIIGLGRGKLSLPSQLGDN---KFSYCLTPYFSDAANTSTLFVG 255
Query: 212 AG---------VVSTPLI-------IRDHYYLSLEAISVGNQRL----------EFVSSS 245
A S P + YYL L I+VG +L E +
Sbjct: 256 ASAGLSGGGAPATSVPFLKNPDDDPFDSFYYLPLTGITVGTAKLDVPAAAFDLREVAPAK 315
Query: 246 TGNIFVDTGV-LRTLLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDVLCYNIS---SQ 301
G +D+G +L+ + Y + L+ + + A V G D+ ++ +
Sbjct: 316 WGGTLIDSGSPFTSLIDVAYQA-LRDELVRQLGASVVPPPAGAEGL-DLCVGGVAPGDAG 373
Query: 302 PKFPEVTIHF-----RGADVKLSPSNLFRNISDEIMCS-AFRGG--------NANIVYGR 347
P + +HF G DV + P N + + D C F G N + G
Sbjct: 374 KLVPPLVLHFGSGGGGGGDVVVPPENYWGPVDDSTACMVVFSSGGPNSTLPLNETTIIGN 433
Query: 348 IMQINFLIGYDIEQAMVSFKPSRCTN 373
MQ + + YD+ Q ++SF+P+ C++
Sbjct: 434 YMQQDMHLLYDLGQGVLSFQPADCSS 459
>gi|302756119|ref|XP_002961483.1| hypothetical protein SELMODRAFT_76765 [Selaginella moellendorffii]
gi|300170142|gb|EFJ36743.1| hypothetical protein SELMODRAFT_76765 [Selaginella moellendorffii]
Length = 388
Score = 112 bits (279), Expect = 3e-22, Method: Compositional matrix adjust.
Identities = 100/376 (26%), Positives = 163/376 (43%), Gaps = 46/376 (12%)
Query: 34 IYLMHLSIGTPPVDIFGSVDTGSDCTWTQCEP---CPELDCFKQEPPLFDPKKSSTYNSI 90
+Y + +G P VDTGSD W C P CP ++DP++SST + +
Sbjct: 1 LYFTQVGLGNPVKHYIVQVDTGSDVLWVNCRPCSGCPRKSALNIPLTMYDPRESSTTSLV 60
Query: 91 SCSSSQCA----VVTSNCSEG--DCSYSFLYGRGAYASFSSGNLATETLTFN--STSGLP 142
SCS C + CS+ +C Y F YG G S S G + + +N S++GL
Sbjct: 61 SCSDPLCVRGRRFAEAQCSQATNNCEYIFSYGDG---STSEGYYVRDAMQYNVISSNGLA 117
Query: 143 VEMPNVIFGCGHKNLAS-PTSDSKQTGIIGLGPGNSSLISQMGT--SIAGKFSYCLP-DQ 198
V+FGC + TS GIIG G S+ +Q+ +I FS+CL ++
Sbjct: 118 NTTSQVLFGCSIRQTGDLSTSQQAVDGIIGFGQLELSVPNQLAAQQNIPRVFSHCLEGEK 177
Query: 199 GSSKINFGGIVAGAGVVSTPLIIRD-HYYLSLEAISVGNQRL-----EFVSSSTGNIFVD 252
I G +A G+ TPL+ HY + L ISV + RL +F S++ + +D
Sbjct: 178 RGGGILVIGGIAEPGMTYTPLVPDSVHYNVVLRGISVNSNRLPIDAEDFSSTNDTGVIMD 237
Query: 253 TGVLRTLLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDVLCYNISSQPK--FPEVTIH 310
+G P ++ + A PV+ G D C+ +S + FP VT++
Sbjct: 238 SGTTLAYFPSGAYNVFVQAIREATSATPVRVQGM-----DTQCFLVSGRLSDLFPNVTLN 292
Query: 311 FRGADVKLSPSNLFR------NISDEIMCSAF---------RGGNANIVYGRIMQINFLI 355
F G ++L P N + ++ C + + G+ + G I+ + L+
Sbjct: 293 FEGGAMELQPDNYLMWGGTAPTGTTDVWCIGWQSSSSSAGPKDGSQLTILGDIVLKDKLV 352
Query: 356 GYDIEQAMVSFKPSRC 371
YD++ + + + C
Sbjct: 353 VYDLDNSRIGWMSYNC 368
>gi|413944032|gb|AFW76681.1| hypothetical protein ZEAMMB73_606599 [Zea mays]
Length = 315
Score = 112 bits (279), Expect = 4e-22, Method: Compositional matrix adjust.
Identities = 94/323 (29%), Positives = 141/323 (43%), Gaps = 47/323 (14%)
Query: 84 SSTYNSISCSSSQC----AVVTSNCSEGD--CSYSFLYGRGAYASFSSGNLATETLTFNS 137
SST+ +++C C V S C+ + C Y YG S ++G++ +T TF S
Sbjct: 2 SSTFKAVACPDPICRPSSGVSVSACAMENFQCFYLCSYGD---RSITAGHIFKDTFTFMS 58
Query: 138 TSGLPVEMPNVIFGCGHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCL-- 195
+G+PV + + FGCG N S+ ++GI G G G SL SQ+ G+FSYCL
Sbjct: 59 PNGVPVAVSELAFGCGDYNTGLFVSN--ESGIAGFGRGPQSLPSQLK---VGRFSYCLTL 113
Query: 196 --------------PDQGSSKINFGGIVAGAGVVSTPLIIRDHYYLSLEAISVGNQRLEF 241
PD + + G ++ PL I YYLSLE I+VG RL F
Sbjct: 114 VTESKSSVVILGTPPDPDGLRAHTTGPFQSTPIIYNPL-IPTFYYLSLEGITVGKTRLPF 172
Query: 242 VSS-------STGNIFVDTGVLRTLLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDVL 294
S +G +D+G T LP + + + ++ P+ P D L
Sbjct: 173 DKSVFALKKDGSGGTVIDSGTSLTTLP---EAVFELLQEELVAQFPLPRYDNTPEVGDRL 229
Query: 295 CYNI---SSQPKFPEVTIHFRGADVKLSPSNLFRNISDE-IMCSAFRGGNAN--IVYGRI 348
C+ Q P++ +H GAD+ L N F D +MC G ++ G
Sbjct: 230 CFRRPKGGKQVPVPKLILHLAGADMDLPRDNYFVEEPDSGVMCLQINGAEDTTMVLIGNF 289
Query: 349 MQINFLIGYDIEQAMVSFKPSRC 371
Q N + YD+E + F P++C
Sbjct: 290 QQQNMHVVYDVENNKLLFAPAQC 312
>gi|115476830|ref|NP_001062011.1| Os08g0469100 [Oryza sativa Japonica Group]
gi|42407408|dbj|BAD09566.1| putative nucleoid DNA-binding protein [Oryza sativa Japonica Group]
gi|113623980|dbj|BAF23925.1| Os08g0469100 [Oryza sativa Japonica Group]
Length = 373
Score = 111 bits (278), Expect = 5e-22, Method: Compositional matrix adjust.
Identities = 101/359 (28%), Positives = 161/359 (44%), Gaps = 57/359 (15%)
Query: 52 VDTGSDCTWTQCEPCPELDCFKQE--PPLFDPKKSSTYNSISCSSSQC---AVVTSNC-S 105
VDTGSD WTQC+ + PP++DP +SST+ + CS C NC S
Sbjct: 30 VDTGSDLIWTQCKLSSSTAAAARHGSPPVYDPGESSTFAFLPCSDRLCQEGQFSFKNCTS 89
Query: 106 EGDCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMPNVIFGCGHKNLASPTSDSK 165
+ C Y +YG A + G LA+ET TF + + + + FGCG + S
Sbjct: 90 KNRCVYEDVYGSAA----AVGVLASETFTFGARRAVSLRLG---FGCGALSAGSLIG--- 139
Query: 166 QTGIIGLGPGNSSLISQMGTSIAGKFSYCL---PDQGSSKINFGGI-----------VAG 211
TGI+GL P + SLI+Q+ +FSYCL D+ +S + FG + +
Sbjct: 140 ATGILGLSPESLSLITQLKIQ---RFSYCLTPFADKKTSPLLFGAMADLSRHKTTRPIQT 196
Query: 212 AGVVSTPLIIRDHYYLSLEAISVGNQRLEFVSSST-------GNIFVDTGVLRTLLPLEY 264
+VS P + +YY+ L IS+G++RL ++S G VD+G L
Sbjct: 197 TAIVSNP-VETVYYYVPLVGISLGHKRLAVPAASLAMRPDGGGGTIVDSGSTVAYLVEAA 255
Query: 265 HSNLKSVMSNMIKAQPVKGVGAEPGFSDVLCYNISSQP--------KFPEVTIHFRGADV 316
+K + ++++ PV E LC+ + + + P + +HF G
Sbjct: 256 FEAVKEAVMDVVRL-PVANRTVE---DYELCFVLPRRTAAAAMEAVQVPPLVLHFDGGAA 311
Query: 317 KLSPS-NLFRNISDEIMCSAF---RGGNANIVYGRIMQINFLIGYDIEQAMVSFKPSRC 371
+ P N F+ +MC A G+ + G + Q N + +D++ SF P++C
Sbjct: 312 MVLPRDNYFQEPRAGLMCLAVGKTTDGSGVSIIGNVQQQNMHVLFDVQHHKFSFAPTQC 370
>gi|54290728|dbj|BAD62398.1| putative nucleoid DNA-binding protein cnd41 [Oryza sativa Japonica
Group]
Length = 486
Score = 111 bits (277), Expect = 6e-22, Method: Compositional matrix adjust.
Identities = 108/363 (29%), Positives = 162/363 (44%), Gaps = 46/363 (12%)
Query: 35 YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPC-PELDCFKQEPPLFDPKKSSTYNSISCS 93
+++ + +GTP DTGSD +W QC+PC C Q+ PLFDP KSSTY ++ C
Sbjct: 144 FVVAVGLGTPAQPSALIFDTGSDLSWVQCQPCGSSGHCHPQQDPLFDPSKSSTYAAVHCG 203
Query: 94 SSQCAVVTSNCSEGD--CSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMPNVIFG 151
QCA CSE + C Y YG G S ++G L+ +TL S+ L FG
Sbjct: 204 EPQCAAAGDLCSEDNTTCLYLVRYGDG---SSTTGVLSRDTLALTSSRAL----TGFPFG 256
Query: 152 CGHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLPDQGSSK--INFGGIV 209
CG +NL + G++GLG G SL SQ S FSYCLP S+ + G
Sbjct: 257 CGTRNLG---DFGRVDGLLGLGRGELSLPSQAAASFGAVFSYCLPSSNSTTGYLTIGATP 313
Query: 210 A-GAGVVSTPLIIRD-----HYYLSLEAISVGNQRLEFVSS--STGNIFVDTGVLRTLLP 261
A G ++R Y++ L +I +G L + + G +D+G + T LP
Sbjct: 314 ATDTGAAQYTAMLRKPQFPSFYFVELVSIDIGGYVLPVPPAVFTRGGTLLDSGTVLTYLP 373
Query: 262 LEYHSNLKS----VMSNMIKAQPVKGVGAEPGFSDVL--CYNISSQPKFPEVTIHFR--- 312
+ ++ L+ M A P +DVL CY+ + + + + FR
Sbjct: 374 AQAYALLRDRFRLTMERYTPAPP----------NDVLDACYDFAGESEVVVPAVSFRFGD 423
Query: 313 GADVKLSPSNLFRNISDEIMCSAFR----GGNANIVYGRIMQINFLIGYDIEQAMVSFKP 368
GA +L + + + + C AF GG + G Q + + YD+ + F P
Sbjct: 424 GAVFELDFFGVMIFLDENVGCLAFAAMDTGGLPLSIIGNTQQRSAEVIYDVAAEKIGFVP 483
Query: 369 SRC 371
+ C
Sbjct: 484 ASC 486
>gi|302776054|ref|XP_002971323.1| hypothetical protein SELMODRAFT_63598 [Selaginella moellendorffii]
gi|300161305|gb|EFJ27921.1| hypothetical protein SELMODRAFT_63598 [Selaginella moellendorffii]
Length = 395
Score = 110 bits (276), Expect = 8e-22, Method: Compositional matrix adjust.
Identities = 100/376 (26%), Positives = 163/376 (43%), Gaps = 46/376 (12%)
Query: 34 IYLMHLSIGTPPVDIFGSVDTGSDCTWTQCEP---CPELDCFKQEPPLFDPKKSSTYNSI 90
+Y + +G P VDTGSD W C P CP ++DP++SST + +
Sbjct: 28 LYFTQVGLGNPVKHYIVQVDTGSDVLWVNCRPCSGCPRKSALNIPLTMYDPRESSTTSLV 87
Query: 91 SCSSSQCA----VVTSNCSE--GDCSYSFLYGRGAYASFSSGNLATETLTFN--STSGLP 142
SCS C + CS+ +C Y F YG G S S G + + +N S++GL
Sbjct: 88 SCSDPLCVRGRRFAEAQCSQTTNNCEYIFSYGDG---STSEGYYVRDAMQYNVISSNGLA 144
Query: 143 VEMPNVIFGCGHKNLAS-PTSDSKQTGIIGLGPGNSSLISQMGT--SIAGKFSYCLP-DQ 198
V+FGC + TS GIIG G S+ +Q+ +I FS+CL ++
Sbjct: 145 NTTSQVLFGCSIRQTGDLSTSQQAVDGIIGFGQLELSVPNQLAAQQNIPRVFSHCLEGEK 204
Query: 199 GSSKINFGGIVAGAGVVSTPLIIRD-HYYLSLEAISVGNQRL-----EFVSSSTGNIFVD 252
I G +A G+ TPL+ HY + L ISV + RL +F S++ + +D
Sbjct: 205 RGGGILVIGGIAEPGMTYTPLVPDSVHYNVVLRGISVNSNRLPIDAEDFSSTNDTGVIMD 264
Query: 253 TGVLRTLLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDVLCYNISSQPK--FPEVTIH 310
+G P ++ + A PV+ G D C+ +S + FP VT++
Sbjct: 265 SGTTLAYFPSGAYNVFVQAIREATSATPVRVQGM-----DTQCFLVSGRLSDLFPNVTLN 319
Query: 311 FRGADVKLSPSNLFR------NISDEIMCSAF---------RGGNANIVYGRIMQINFLI 355
F G ++L P N + ++ C + + G+ + G I+ + L+
Sbjct: 320 FEGGAMELQPDNYLMWGGTAPTGTTDVWCIGWQSSSSSAGPKDGSQLTILGDIVLKDKLV 379
Query: 356 GYDIEQAMVSFKPSRC 371
YD++ + + + C
Sbjct: 380 VYDLDNSRIGWMSYNC 395
>gi|115475303|ref|NP_001061248.1| Os08g0207800 [Oryza sativa Japonica Group]
gi|45735815|dbj|BAD12851.1| unknown protein [Oryza sativa Japonica Group]
gi|113623217|dbj|BAF23162.1| Os08g0207800 [Oryza sativa Japonica Group]
gi|125602549|gb|EAZ41874.1| hypothetical protein OsJ_26419 [Oryza sativa Japonica Group]
Length = 449
Score = 110 bits (276), Expect = 8e-22, Method: Compositional matrix adjust.
Identities = 104/383 (27%), Positives = 161/383 (42%), Gaps = 50/383 (13%)
Query: 29 ISVDDIYLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYN 88
I D +YL + IG + +DTGS WTQC+ CP C + P + +S T+
Sbjct: 76 IYEDVVYLAEMEIGERQQKQYLLIDTGSSLVWTQCDECPH--CHIGDVPPYGRSQSRTFQ 133
Query: 89 SISCS-----------SSQCAV----VTSNCSEGDCSYSFLYGRGAYASFSSGNLATETL 133
+SC +S C + C G C + LY G ++ +T
Sbjct: 134 EVSCGDDDDNDKEEAIASYCPAKPPGYITLCVNGRCMFKALYNLTGQGETVQGYMSMDTF 193
Query: 134 TFNSTSGLPVEMP-NVIFGCGHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFS 192
F + ++FGC H+ T+ + TGI+GLG G++S + Q G + KFS
Sbjct: 194 HFIDDRRFDYQAKFRMVFGCAHQENIVLTAVKECTGILGLGMGDASFLRQTGIT---KFS 250
Query: 193 YCLPD-------QGSSKINFGGIVAGAGVVSTPLIIR-DHYYLSLEAISVG-NQRLEFV- 242
YC+P + S + FG +G PL++R YYL L AI+ N+ + V
Sbjct: 251 YCVPPRMPGYSYRRHSWLRFGSHAQISG-KKVPLVMRWGKYYLPLTAITYTYNELMSPVP 309
Query: 243 ------SSSTGNIFVDTGVLRTLLPLEYHSNLKSVMSNMIKAQPV-KGVGAEPGFSDVLC 295
++ VDTG LP H +L M +IK++ + +G P C
Sbjct: 310 IIAYKSQEDYLHMMVDTGTSLLSLPTSLHDDLIKEMEAIIKSENIMEGATRWPKH----C 365
Query: 296 YNIS-SQPKFPEVTIHFRGA-DVKLSPSNLF---RNISDEIMCSAFR--GGNANIVYGRI 348
Y + + K VT+ F G D++L S LF +C A ++ + G
Sbjct: 366 YKRTMDEVKDITVTLSFDGGLDIELFTSALFIKTETTKGPAVCLAVNRVDDSSKAILGMF 425
Query: 349 MQINFLIGYDIEQAMVSFKPSRC 371
Q N +GYD+ ++ P RC
Sbjct: 426 AQTNINVGYDLLSREIAMDPIRC 448
>gi|255563741|ref|XP_002522872.1| Aspartic proteinase nepenthesin-2 precursor, putative [Ricinus
communis]
gi|223537956|gb|EEF39570.1| Aspartic proteinase nepenthesin-2 precursor, putative [Ricinus
communis]
Length = 448
Score = 110 bits (275), Expect = 1e-21, Method: Compositional matrix adjust.
Identities = 100/379 (26%), Positives = 156/379 (41%), Gaps = 40/379 (10%)
Query: 24 YQAEIISVDDIYLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKK 83
++ I D YL+ + IG+P V ++ DTGS WTQCEPC F+Q PP+F+
Sbjct: 80 FRLRISQDDTCYLVKVIIGSPGVPLYLVPDTGSGLFWTQCEPCTRR--FRQLPPIFNSTA 137
Query: 84 SSTYNSISCSSSQCAVVTS--NCSEGDCSYSFLYGRGAYASFSSGNLATETLTFNSTSGL 141
S TY + C C + C + C Y Y G S ++G A + L +
Sbjct: 138 SRTYRDLPCQHQFCTNNQNVFQCRDDKCVYRIAYAGG---SATAGVAAQDILQSAENDRI 194
Query: 142 PVEMPNVIFGCGH--KNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCL---- 195
P FGC +N ++ S K GIIGL SL+ QM +FSYCL
Sbjct: 195 P-----FYFGCSRDNQNFSTFESSGKGGGIIGLNMSPVSLLQQMNHITKNRFSYCLNLFD 249
Query: 196 ---PDQGSSKINFGGIVAGA--GVVSTPLII---RDHYYLSLEAISVGNQRLE------- 240
P +S + FG + + +STP + +Y+L+L +SV R++
Sbjct: 250 LSSPSHATSLLRFGNDIRKSRRKYLSTPFVSPRGMPNYFLNLIDVSVAGNRMQIPPGTFA 309
Query: 241 FVSSSTGNIFVDTGVLRTLLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDVLCYNISS 300
TG +D+G T + + + + N + V + S +CY
Sbjct: 310 LKPDGTGGTIIDSGTAVTYISQTAYFPVITAFKNYFDQHGFQRVNIQ--LSGYICYKQQG 367
Query: 301 QP--KFPEVTIHFRGADVKLSPSNLFRNISDE-IMCSAFR--GGNANIVYGRIMQINFLI 355
+P + HF+GAD + P ++ + D C A + + G + Q N
Sbjct: 368 HTFHNYPSMAFHFQGADFFVEPEYVYLTVQDRGAFCVALQPISPQQRTIIGALNQANTQF 427
Query: 356 GYDIEQAMVSFKPSRCTNY 374
YD + F P C ++
Sbjct: 428 IYDAANRQLLFTPENCQDH 446
>gi|356553832|ref|XP_003545255.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Glycine max]
Length = 427
Score = 110 bits (275), Expect = 1e-21, Method: Compositional matrix adjust.
Identities = 117/415 (28%), Positives = 183/415 (44%), Gaps = 76/415 (18%)
Query: 2 QNSQKLPFYNDNETPKSPISIIYQAEIISVDDIYLMHLSIGTPPVDIFGSVDTGSDCTWT 61
Q LP +TP P + +Q + + L+IG+PP ++ +DTGS+ +W
Sbjct: 33 QKPLLLPLKTQTQTP--PRKLAFQHNVT-----LTISLTIGSPPQNVTMVLDTGSELSWL 85
Query: 62 QCEPCPELDCFKQEPPLFDPKKSSTYNSISCSSSQCAVVTSNCS---EGDCSYSFLYGRG 118
C+ P L+ F+P SS+Y C+SS C T + + D + +
Sbjct: 86 HCKKLPNLNS------TFNPLLSSSYTPTPCNSSVCMTRTRDLTIPASCDPNNKLCHVIV 139
Query: 119 AYASFSS--GNLATETLTFNSTSGLPVEMPNVIFGCGHKNLASPTS----DSKQTGIIGL 172
+YA SS G LA ET + + P +FGC + A TS D+K TG++G+
Sbjct: 140 SYADASSAEGTLAAETFSLAGAA-----QPGTLFGC--MDSAGYTSDINEDAKTTGLMGM 192
Query: 173 GPGNSSLISQMGTSIAGKFSYCLPDQGSSKINFGGIVAGAG---------------VVST 217
G+ SL++QM + KFSYC+ S + FG ++ G G S+
Sbjct: 193 NRGSLSLVTQM---VLPKFSYCI----SGEDAFGVLLLGDGPSAPSPLQYTPLVTATTSS 245
Query: 218 PLIIRDHYYLSLEAISVGNQRLE-----FVSSST--GNIFVDTGVLRTLLPLEYHSNLKS 270
P R Y + LE I V + L+ FV T G VD+G T L +++LK
Sbjct: 246 PYFDRVAYTVQLEGIKVSEKLLQLPKSVFVPDHTGAGQTMVDSGTQFTFLLGPVYNSLKD 305
Query: 271 VMSNMIKAQPVKGVGAEPGF----SDVLCYNI-SSQPKFPEVTIHFRGADVKLSPSNLFR 325
K V +P F + LCY+ +S P VT+ F GA++++S L
Sbjct: 306 EFLEQTKG--VLTRIEDPNFVFEGAMDLCYHAPASLAAVPAVTLVFSGAEMRVSGERLLY 363
Query: 326 NIS---DEIMCSAFRGGNANI------VYGRIMQINFLIGYDIEQAMVSFKPSRC 371
+S D + C F GN+++ V G Q N + +D+ ++ V F + C
Sbjct: 364 RVSKGRDWVYCFTF--GNSDLLGIEAYVIGHHHQQNVWMEFDLVKSRVGFTETTC 416
>gi|357507805|ref|XP_003624191.1| Aspartic proteinase-like protein [Medicago truncatula]
gi|355499206|gb|AES80409.1| Aspartic proteinase-like protein [Medicago truncatula]
Length = 406
Score = 110 bits (275), Expect = 1e-21, Method: Compositional matrix adjust.
Identities = 98/339 (28%), Positives = 155/339 (45%), Gaps = 37/339 (10%)
Query: 63 CEPCPELDCFKQEPPLFDPKKSSTYNSISCSSSQCAVV----TSNCSEG-DCSYSFLYGR 117
C CP+ + L+DP S T N++ C C S C + C YS YG
Sbjct: 33 CTACPKKSGLGMDLTLYDPNGSKTSNAVPCGDGFCTDTYSGPISGCKQDMSCPYSITYGD 92
Query: 118 GAYASFSSGNLATETLTFNSTSGLPVEMPN---VIFGCGHKNLASPTSDSKQT--GIIGL 172
G S +SG+ ++LTF+ SG P+ VIFGCG K S +S+S + GIIG
Sbjct: 93 G---STTSGSFVNDSLTFDEVSGNLHTKPDNSSVIFGCGAKQSGSLSSNSDEALDGIIGF 149
Query: 173 GPGNSSLISQMGTS--IAGKFSYCLPDQGSSKINFGGIVAGAGVVSTPLIIR-DHYYLSL 229
G NSS++SQ+ S + FS+CL I G V +TPL+ R HY + L
Sbjct: 150 GQANSSVLSQLAASGKVKRIFSHCLDSHHGGGIFSIGQVMEPKFNTTPLVPRMAHYNVIL 209
Query: 230 EAISVGNQRL-----EFVSSSTGNIFVDTGVLRTLLPLEYHSNLKSVMSNMIKAQP-VKG 283
+ + V + + F S S +D+G LPL ++ L + ++ QP +K
Sbjct: 210 KDMDVDGEPILLPLYLFDSGSGRGTIIDSGTTLAYLPLSIYNQL---LPKVLGRQPGLKL 266
Query: 284 VGAEPGFSDVLCYNISSQ--PKFPEVTIHFRGADVKLSPSNLFRNISDEIMCSAF----- 336
+ E F+ C++ S + FP V HF G + + P + ++I C +
Sbjct: 267 MIVEDQFT---CFHYSDKLDEGFPVVKFHFEGLSLTVHPHDYLFLYKEDIYCIGWQKSST 323
Query: 337 --RGGNANIVYGRIMQINFLIGYDIEQAMVSFKPSRCTN 373
+ G I+ G ++ N L+ YD+E ++ + C++
Sbjct: 324 QTKEGRDLILIGDLVLSNKLVVYDLENMVIGWTNFNCSS 362
>gi|224090744|ref|XP_002309070.1| predicted protein [Populus trichocarpa]
gi|222855046|gb|EEE92593.1| predicted protein [Populus trichocarpa]
Length = 404
Score = 110 bits (275), Expect = 1e-21, Method: Compositional matrix adjust.
Identities = 110/388 (28%), Positives = 177/388 (45%), Gaps = 78/388 (20%)
Query: 36 LMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSSS 95
++ L++GTPP ++ +DTGS+ +W C P FDP +S++Y +I CSS
Sbjct: 32 IVSLTVGTPPQNVSMVIDTGSELSWLHCNKT------LSYPTTFDPTRSTSYQTIPCSSP 85
Query: 96 QCAVVT------SNCSEGD-CSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMPNV 148
C T ++C + C + Y A AS S GNLA++ S+ ++ +
Sbjct: 86 TCTNRTQDFPIPASCDSNNLCHATLSY---ADASSSDGNLASDVFHIGSS-----DISGL 137
Query: 149 IFGCGHKNLASPT-SDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLPDQGSSKINFGG 207
+FGC +S + DSK TG++G+ G+ S +SQ+G KFSYC+ S +F G
Sbjct: 138 VFGCMDSVFSSNSDEDSKSTGLMGMNRGSLSFVSQLGFP---KFSYCI-----SGTDFSG 189
Query: 208 IVAGAGV---------------VSTPLIIRDH--YYLSLEAISVGNQRLEFVSSS----- 245
++ +STPL D Y + LE I V ++ L S+
Sbjct: 190 LLLLGESNLTWSVPLNYTPLIQISTPLPYFDRVAYTVQLEGIKVLDKLLPIPKSTFEPDH 249
Query: 246 --TGNIFVDTGVLRTLLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGF----SDVLCYNIS 299
G VD+G T L ++ L+S N + V V +P F + LCY +
Sbjct: 250 TGAGQTMVDSGTQFTFLLGPVYNALRSAFLN--QTSSVLRVLEDPDFVFQGAMDLCYLVP 307
Query: 300 -SQ---PKFPEVTIHFRGADVKLSPSNLFRNI------SDEIMCSAFRGGNANI------ 343
SQ P P VT+ FRGA++ +S + + +D + C +F GN+++
Sbjct: 308 LSQRVLPLLPTVTLVFRGAEMTVSGDRVLYRVPGELRGNDSVHCLSF--GNSDLLGVEAY 365
Query: 344 VYGRIMQINFLIGYDIEQAMVSFKPSRC 371
V G Q N + +D+E++ + RC
Sbjct: 366 VIGHHHQQNVWMEFDLEKSRIGLAQVRC 393
>gi|255549236|ref|XP_002515672.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
gi|223545215|gb|EEF46724.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
Length = 492
Score = 110 bits (275), Expect = 1e-21, Method: Compositional matrix adjust.
Identities = 109/374 (29%), Positives = 158/374 (42%), Gaps = 46/374 (12%)
Query: 34 IYLMHLSIGTPPVDIFGSVDTGSDCTWT---QCEPCPELDCFKQEPPLFDPKKSSTYNSI 90
+Y + IGTP D + VDTGSD W QC CP E L++ K S + +
Sbjct: 85 LYYAKVGIGTPSKDYYVQVDTGSDIMWVNCIQCRECPRTSSLGMELTLYNIKDSVSGKLV 144
Query: 91 SCSSSQCAVVT----SNCSEG-DCSYSFLYGRGAYASFSSGNLATETLTFNSTSG-LPVE 144
C C V S C+ C Y +YG G S ++G + + ++ SG L
Sbjct: 145 PCDEEFCYEVNGGPLSGCTANMSCPYLEIYGDG---SSTAGYFVKDVVQYDRVSGDLQTT 201
Query: 145 MPN--VIFGCGHKNLAS--PTSDSKQTGIIGLGPGNSSLISQMGTSIAGK--FSYCLPDQ 198
N VIFGCG + PTS+ GI+G G NSS+ISQ+ + K F++CL
Sbjct: 202 SSNGSVIFGCGARQSGDLGPTSEEALDGILGFGKSNSSMISQLAATRKVKKIFAHCL--- 258
Query: 199 GSSKINFGGIVAGAGVVS-----TPLIIRD-HYYLSLEAISVGNQRL-----EFVSSSTG 247
IN GGI A VV TPLI HY +++ A+ VG L EF +
Sbjct: 259 --DGINGGGIFAIGHVVQPKVNMTPLIPNQPHYNVNMTAVQVGEDFLHLPTEEFEAGDRK 316
Query: 248 NIFVDTGVLRTLLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDVLCYNISSQPKFPEV 307
+D+G LP + L +S +I QP V Y+ S FP V
Sbjct: 317 GAIIDSGTTLAYLPEIVYEPL---VSKIISQQPDLKVHIVRDEYTCFQYSGSVDDGFPNV 373
Query: 308 TIHFRGAD-VKLSPSNLFRNISDEIMCSAF-------RGGNANIVYGRIMQINFLIGYDI 359
T HF + +K+ P + + C + R + G ++ N L+ YD+
Sbjct: 374 TFHFENSVFLKVHPHEYLFPF-EGLWCIGWQNSGMQSRDRRNMTLLGDLVLSNKLVLYDL 432
Query: 360 EQAMVSFKPSRCTN 373
E + + C++
Sbjct: 433 ENQAIGWTEYNCSS 446
>gi|302817726|ref|XP_002990538.1| hypothetical protein SELMODRAFT_131679 [Selaginella moellendorffii]
gi|300141706|gb|EFJ08415.1| hypothetical protein SELMODRAFT_131679 [Selaginella moellendorffii]
Length = 434
Score = 110 bits (275), Expect = 1e-21, Method: Compositional matrix adjust.
Identities = 114/378 (30%), Positives = 169/378 (44%), Gaps = 70/378 (18%)
Query: 34 IYLMHLSIGTPPVDIFGSVDTGSDCTWTQCEP---CPELDCFKQEPPLFDPKKSSTYNSI 90
+Y + +GTPP VDTGSD W C P CP K +D K S++ + +
Sbjct: 35 LYFTQVQLGTPPRTYNLQVDTGSDLLWVNCHPCIGCPAFSDLKIPIVPYDVKASASSSKV 94
Query: 91 SCSSSQCAVVT----SNCS-EGDCSYSFLYGRGAYASFSSGNLATETLTF--NSTSGLPV 143
CS C ++T S C+ + C YSF YG G S + G L + L + N+T+
Sbjct: 95 PCSDPSCTLITQISESGCNDQNQCGYSFQYGDG---SGTLGYLVEDVLHYMVNATA---- 147
Query: 144 EMPNVIFGCGHKNLAS-PTSDSKQTGIIGLGPGNSSLISQMGTSIAGK----FSYCLPDQ 198
VIFGCG K TS+ GIIG G + S SQ+ GK F++CL D
Sbjct: 148 ---TVIFGCGFKQSGDLSTSERALDGIIGFGASDLSFNSQLAKQ--GKTPNVFAHCL-DG 201
Query: 199 GSSKINFGGIVAGAGVVS-----TPLI-IRDHYYLSLEAISVGNQRL----EFVSSST-- 246
G GGI+ V+ TPL+ HY + L++ISV N L + S+
Sbjct: 202 GERG---GGILVLGNVIEPDIQYTPLVPYMSHYNVVLQSISVNNANLTIDPKLFSNDVMQ 258
Query: 247 GNIFVDTGVLRTLLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDVLCYNISSQ---PK 303
G IF D+G LP E + +S ++ +LC S+
Sbjct: 259 GTIF-DSGTTLAYLPDEAYQAFTQAVSLVVAPF-------------LLCDTRLSRFIYKL 304
Query: 304 FPEVTIHFRGADVKLSPSN-LFRNISDE---IMCSAFRG-GNAN-----IVYGRIMQINF 353
FP V ++F GA + L+P+ L R S I C ++ G+A ++G ++ N
Sbjct: 305 FPNVVLYFEGASMTLTPAEYLIRQASAANAPIWCMGWQSMGSAESELQYTIFGDLVLKNK 364
Query: 354 LIGYDIEQAMVSFKPSRC 371
L+ YD+E+ + ++P C
Sbjct: 365 LVVYDLERGRIGWRPFDC 382
>gi|359492937|ref|XP_002283889.2| PREDICTED: aspartic proteinase nepenthesin-1-like [Vitis vinifera]
Length = 439
Score = 110 bits (275), Expect = 1e-21, Method: Compositional matrix adjust.
Identities = 105/361 (29%), Positives = 164/361 (45%), Gaps = 45/361 (12%)
Query: 35 YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSS 94
Y++ + +GTP +F +DT D W C DC P F P SSTY S+ CS
Sbjct: 99 YVVRVKLGTPGQLMFMVLDTSRDAAWVPCA-----DCAGCSSPTFSPNTSSTYASLQCSV 153
Query: 95 SQCAVVTS-NC-SEGDCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVE-MPNVIFG 151
QC V +C + G + F G +SFS+ L+ ++L GL V+ +P+ FG
Sbjct: 154 PQCTQVRGLSCPTTGTAACFFNQTYGGDSSFSA-MLSQDSL------GLAVDTLPSYSFG 206
Query: 152 CGHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLPDQG----SSKINFGG 207
C N S ++ Q G++GLG G SL+SQ G+ +G FSYC P S + G
Sbjct: 207 C--VNAVSGSTLPPQ-GLLGLGRGPMSLLSQSGSLYSGVFSYCFPSFKSYYFSGSLRLGP 263
Query: 208 IVAGAGVVSTPLIIRDH----YYLSLEAISVGN-------QRLEFVSSSTGNIFVDTGVL 256
+ + +TPL+ H YY++L +SVG + L F ++ +D+G +
Sbjct: 264 LGQPKNIRTTPLLRNPHRPTLYYVNLTGVSVGRVLVPVAPELLAFDPNTGAGTIIDSGTV 323
Query: 257 RTLLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDVLCYNISSQPKFPEVTIHFRGADV 316
T + + + + + + Q VKG A G D C+ +++ P VT HF G D+
Sbjct: 324 IT----RFVEPVYAAIRDEFRKQ-VKGPFATIGAFDT-CFAATNEDIAPPVTFHFTGMDL 377
Query: 317 KLSPSN-LFRNISDEIMCSAFRGGNANI-----VYGRIMQINFLIGYDIEQAMVSFKPSR 370
KL N L + + + C A N+ V + Q N I +D+ + +
Sbjct: 378 KLPLENTLIHSSAGSLACLAMAAAPNNVNSVLNVIANLQQQNLRIMFDVTNSRLGIAREL 437
Query: 371 C 371
C
Sbjct: 438 C 438
>gi|226492633|ref|NP_001149953.1| LOC100283580 precursor [Zea mays]
gi|195635701|gb|ACG37319.1| pepsin A [Zea mays]
Length = 491
Score = 110 bits (274), Expect = 1e-21, Method: Compositional matrix adjust.
Identities = 103/380 (27%), Positives = 171/380 (45%), Gaps = 59/380 (15%)
Query: 34 IYLMHLSIGTPPVDIFGSVDTGSDCTWT---QCEPCPELDCFKQEPPLFDPKKSSTYNSI 90
+Y + IG+PP + VDTGSD W +C+ CP E +DP S T ++
Sbjct: 83 LYYTRIEIGSPPKGYYVQVDTGSDILWVNCIRCDGCPTRSGLGIELTQYDPAGSGT--TV 140
Query: 91 SCSSSQCAV-----VTSNC--SEGDCSYSFLYGRGAYASFSSGNLATETLTFNSTSG--- 140
C C V C + C + YG G S ++G T+ + +N SG
Sbjct: 141 GCEQEFCVANSAGGVPPTCPSTSSPCQFRITYGDG---STTTGFYVTDFVQYNQVSGNGQ 197
Query: 141 LPVEMPNVIFGCGHKNLASPTSDSKQT--GIIGLGPGNSSLISQMGTS--IAGKFSYCLP 196
++ FGCG + L S Q GI+G G +SS++SQ+ + + F++CL
Sbjct: 198 TTTSNASITFGCGAQ-LGGDLGSSNQALDGILGFGQSDSSMLSQLAAARRVRKIFAHCL- 255
Query: 197 DQGSSKINFGGIVAGAGVV-----STPLIIR-DHYYLSLEAISVGNQRLE-----FVSSS 245
+ GGI A VV +TPL+ HY ++L+ ISVG L+ F S
Sbjct: 256 ----DTVRGGGIFAIGNVVQPKVKTTPLVPNVTHYNVNLQGISVGGATLQLPTSTFDSGD 311
Query: 246 TGNIFVDTGVLRTLLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDVLCYNISS--QPK 303
+ +D+G LP E + L + + + + P+ + D +C+ S
Sbjct: 312 SKGTIIDSGTTLAYLPREVYRTLLAAVFDKYQDLPLH------NYQDFVCFQFSGSIDDG 365
Query: 304 FPEVTIHFRGADVKLS--PSN-LFRNISDEIMCSAF-------RGGNANIVYGRIMQINF 353
FP +T F+G D+ L+ P + LF+N +D + C F + G ++ G ++ N
Sbjct: 366 FPVITFSFKG-DLTLNVYPDDYLFQNRND-LYCMGFLDGGVQTKDGKDMLLLGDLVLSNK 423
Query: 354 LIGYDIEQAMVSFKPSRCTN 373
L+ YD+E+ ++ + C++
Sbjct: 424 LVVYDLEKEVIGWTDYNCSS 443
>gi|302803839|ref|XP_002983672.1| hypothetical protein SELMODRAFT_118648 [Selaginella moellendorffii]
gi|300148509|gb|EFJ15168.1| hypothetical protein SELMODRAFT_118648 [Selaginella moellendorffii]
Length = 388
Score = 110 bits (274), Expect = 1e-21, Method: Compositional matrix adjust.
Identities = 114/378 (30%), Positives = 169/378 (44%), Gaps = 70/378 (18%)
Query: 34 IYLMHLSIGTPPVDIFGSVDTGSDCTWTQCEP---CPELDCFKQEPPLFDPKKSSTYNSI 90
+Y + +GTPP VDTGSD W C P CP K +D K S++ + +
Sbjct: 35 LYFTQVQLGTPPRTYNLQVDTGSDLLWVNCHPCIGCPAFSDLKIPIVPYDVKASASSSKV 94
Query: 91 SCSSSQCAVVT----SNCS-EGDCSYSFLYGRGAYASFSSGNLATETLTF--NSTSGLPV 143
CS C ++T S C+ + C YSF YG G S + G L + L + N+T+
Sbjct: 95 PCSDPSCTLITQISESGCNDQNQCGYSFQYGDG---SGTLGYLVEDVLHYMVNATA---- 147
Query: 144 EMPNVIFGCGHKNLAS-PTSDSKQTGIIGLGPGNSSLISQMGTSIAGK----FSYCLPDQ 198
VIFGCG K TS+ GIIG G + S SQ+ GK F++CL D
Sbjct: 148 ---TVIFGCGFKQSGDLSTSERALDGIIGFGASDLSFNSQLAKQ--GKTPNVFAHCL-DG 201
Query: 199 GSSKINFGGIVAGAGVVS-----TPLI-IRDHYYLSLEAISVGNQRL----EFVSSST-- 246
G GGI+ V+ TPL+ HY + L++ISV N L + S+
Sbjct: 202 GERG---GGILVLGNVIEPDIQYTPLVPYMYHYNVVLQSISVNNANLTIDPKLFSNDVMQ 258
Query: 247 GNIFVDTGVLRTLLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDVLCYNISSQ---PK 303
G IF D+G LP E + +S ++ +LC S+
Sbjct: 259 GTIF-DSGTTLAYLPDEAYQAFTQAVSLVVAPF-------------LLCDTRLSRFIYKL 304
Query: 304 FPEVTIHFRGADVKLSPSN-LFRNISDE---IMCSAFRG-GNAN-----IVYGRIMQINF 353
FP V ++F GA + L+P+ L R S I C ++ G+A ++G ++ N
Sbjct: 305 FPNVVLYFEGASMTLTPAEYLIRQASAANAPIWCMGWQSMGSAESELQYTIFGDLVLKNK 364
Query: 354 LIGYDIEQAMVSFKPSRC 371
L+ YD+E+ + ++P C
Sbjct: 365 LVVYDLERGRIGWRPFDC 382
>gi|388498308|gb|AFK37220.1| unknown [Lotus japonicus]
Length = 363
Score = 110 bits (274), Expect = 2e-21, Method: Compositional matrix adjust.
Identities = 70/177 (39%), Positives = 97/177 (54%), Gaps = 25/177 (14%)
Query: 35 YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSS 94
Y++ + +G + + +DTGSD TW QCEPC + C+ Q+ P+F P SS+Y SI C+S
Sbjct: 145 YIVTMELGGQDMTVI--IDTGSDLTWVQCEPC--MSCYNQQGPVFKPSTSSSYQSIPCNS 200
Query: 95 SQCA---VVTSNCSE-----GDCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMP 146
S C + T N +CSY+ YG G+Y ++G L E L+F S +
Sbjct: 201 STCQSLQLTTGNAGACESNPSNCSYAVNYGDGSY---TNGELGAEHLSFGGIS-----VS 252
Query: 147 NVIFGCGHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLP--DQGSS 201
N +FGCG N S G++GLG N SLISQ ++ G FSYCLP D G+S
Sbjct: 253 NFVFGCGKNNKGLFGGVS---GLMGLGRSNLSLISQTNSTFGGVFSYCLPPTDAGAS 306
>gi|242051593|ref|XP_002454942.1| hypothetical protein SORBIDRAFT_03g001790 [Sorghum bicolor]
gi|241926917|gb|EES00062.1| hypothetical protein SORBIDRAFT_03g001790 [Sorghum bicolor]
Length = 431
Score = 110 bits (274), Expect = 2e-21, Method: Compositional matrix adjust.
Identities = 106/361 (29%), Positives = 165/361 (45%), Gaps = 37/361 (10%)
Query: 32 DDIYLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSIS 91
D+ Y + + IGTPP DT SD TWTQC D KQ PLFDP KSS++ ++
Sbjct: 88 DEGYTVTIGIGTPPQLHTLIADTASDLTWTQCNLFN--DTAKQVEPLFDPAKSSSFAFVT 145
Query: 92 CSSSQCAVV---TSNCSEGDCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMPNV 148
CSS C T CS C Y + Y + ++G LA E+ T S + + M +
Sbjct: 146 CSSKLCTEDNPGTKRCSNKTCRYVYPY----VSVEAAGVLAYESFTL-SDNNQHICM-SF 199
Query: 149 IFGCGHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLP---DQGSSKINF 205
FGCG + S GI+G+ P S++SQ+ KFSYCL D+ SS + F
Sbjct: 200 GFGCGALTDGNLLGAS---GILGMSPAILSMVSQLAIP---KFSYCLTPYTDRKSSPLFF 253
Query: 206 GGIVAGAGVVSTPLIIRD---HYYLSLEAISVGNQRLEFVSSS----TGNIFVDTGVLRT 258
G +T I + +YY+ L +S+G +RL+ +++ G VD G
Sbjct: 254 GAWADLGRYKTTGPIQKSLTFYYYVPLVGLSLGTRRLDVPAATFALKQGGTVVDLGCTVG 313
Query: 259 LLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDVLCYNISS-----QPKFPEVTIHFRG 313
L + LK + + + P+ + +C+ + S + P + ++F G
Sbjct: 314 QLAEPAFTALKEAVLHTLNL-PLTNRTVK---DYKVCFALPSGVAMGAVQTPPLVLYFDG 369
Query: 314 -ADVKLSPSNLFRNISDEIMCSAFRGGNANIVYGRIMQINFLIGYDIEQAMVSFKPSRCT 372
AD+ L N F+ + +MC A G + G + Q NF + +D+ + F P+ C
Sbjct: 370 GADMVLPRDNYFQEPTAGLMCLALVPGGGMSIIGNVQQQNFHLLFDVHDSKFLFAPTICD 429
Query: 373 N 373
+
Sbjct: 430 D 430
>gi|413938615|gb|AFW73166.1| hypothetical protein ZEAMMB73_633272 [Zea mays]
Length = 386
Score = 109 bits (273), Expect = 2e-21, Method: Compositional matrix adjust.
Identities = 117/357 (32%), Positives = 168/357 (47%), Gaps = 38/357 (10%)
Query: 35 YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPEL-DCFKQEPPLFDPKKSSTYNSISCS 93
Y++ S+GTP V VDTGSD +W QC+PC C+ Q+ PLFDP +SS+Y ++ C
Sbjct: 48 YVVTASLGTPGVAQTMEVDTGSDLSWVQCKPCAAAPSCYSQKDPLFDPAQSSSYAAVPCG 107
Query: 94 SSQCAVV----TSNCSEGDCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMPNVI 149
CA + S CS C Y YG G S ++G +++TLT +++S +
Sbjct: 108 GPVCAGLGIYAASACSAAQCGYVVSYGDG---SNTTGVYSSDTLTLSASS----AVQGFF 160
Query: 150 FGCGHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLPDQGSSK----INF 205
FGCGH A + G++GLG SL+ Q + G FSYCLP + S+ +
Sbjct: 161 FGCGH---AQSGLFNGVDGLLGLGREQPSLVEQTAGTYGGVFSYCLPTKPSTAGYLTLGV 217
Query: 206 GGIVAGAGVVSTPLIIRD-----HYYLSLEAISVGNQRLEFVSSS-TGNIFVDTGVLRTL 259
GG A ST ++ +Y + L ISVG Q+L +S+ G VDTG + T
Sbjct: 218 GGPSGAAPGFSTTQLLPSPNAPTYYVVMLTGISVGGQQLSVPASAFAGGTVVDTGTVVTR 277
Query: 260 LPLEYHSNLKSVMSNMIKAQPVKGVGAEP--GFSDVLCYNIS--SQPKFPEVTIHF-RGA 314
LP ++ L+S + + + G P G D CYN + P V + F GA
Sbjct: 278 LPPTAYAALRSAFRSGMAS---YGYPTAPSNGILDT-CYNFAGYGTVTLPNVALTFGSGA 333
Query: 315 DVKLSPSNLFRNISDEIMCSAFRGGNANIVYGRIMQINFLIGYDIEQAMVSFKPSRC 371
V L + S GG A + G + Q +F + I+ V FKPS C
Sbjct: 334 TVTLGADGILSFGCLAFAPSGSDGGMA--ILGNVQQRSFEV--RIDGTSVGFKPSSC 386
>gi|125555056|gb|EAZ00662.1| hypothetical protein OsI_22683 [Oryza sativa Indica Group]
Length = 491
Score = 109 bits (273), Expect = 2e-21, Method: Compositional matrix adjust.
Identities = 108/363 (29%), Positives = 161/363 (44%), Gaps = 46/363 (12%)
Query: 35 YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPC-PELDCFKQEPPLFDPKKSSTYNSISCS 93
+++ + +GTP DTGSD +W QC+PC C Q+ PLFDP KSSTY ++ C
Sbjct: 149 FVVAVGLGTPAQPSALIFDTGSDLSWVQCQPCGSSGHCHPQQDPLFDPSKSSTYAAVHCG 208
Query: 94 SSQCAVVTSNCSEGD--CSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMPNVIFG 151
QCA CSE + C Y YG G S ++G L+ +TL S+ L FG
Sbjct: 209 EPQCAAAGGLCSEDNTTCLYLVHYGDG---SSTTGVLSRDTLALTSSRAL----AGFPFG 261
Query: 152 CGHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLPDQGSSK--INFGGIV 209
CG +NL + G++GLG G SL SQ S FSYCLP S+ + G
Sbjct: 262 CGTRNLG---DFGRVDGLLGLGRGELSLPSQAAASFGAVFSYCLPSSNSTTGYLTIGATP 318
Query: 210 A-GAGVVSTPLIIRD-----HYYLSLEAISVGNQRLEFVSS--STGNIFVDTGVLRTLLP 261
A G ++R Y++ L +I +G L + + G +D+G + T LP
Sbjct: 319 ATDTGAAQYTAMLRKPQFPSFYFVELVSIDIGGYILPVPPAVFTRGGTLLDSGTVLTYLP 378
Query: 262 LEYHSNLKS----VMSNMIKAQPVKGVGAEPGFSDVL--CYNISSQPKFPEVTIHFR--- 312
+ + L+ M A P +DVL CY+ + + + + FR
Sbjct: 379 AQAYELLRDRFRLTMERYTPAPP----------NDVLDACYDFAGESEVIVPAVSFRFGD 428
Query: 313 GADVKLSPSNLFRNISDEIMCSAF----RGGNANIVYGRIMQINFLIGYDIEQAMVSFKP 368
GA +L + + + + C AF GG + G Q + + YD+ + F P
Sbjct: 429 GAVFELDFFGVMIFLDENVGCLAFAAMDAGGLPLSIIGNTQQRSAEVIYDVAAEKIGFVP 488
Query: 369 SRC 371
+ C
Sbjct: 489 ASC 491
>gi|226509408|ref|NP_001141440.1| uncharacterized protein LOC100273550 precursor [Zea mays]
gi|194704586|gb|ACF86377.1| unknown [Zea mays]
gi|413938617|gb|AFW73168.1| hypothetical protein ZEAMMB73_633272 [Zea mays]
Length = 478
Score = 109 bits (273), Expect = 2e-21, Method: Compositional matrix adjust.
Identities = 117/357 (32%), Positives = 168/357 (47%), Gaps = 38/357 (10%)
Query: 35 YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPEL-DCFKQEPPLFDPKKSSTYNSISCS 93
Y++ S+GTP V VDTGSD +W QC+PC C+ Q+ PLFDP +SS+Y ++ C
Sbjct: 140 YVVTASLGTPGVAQTMEVDTGSDLSWVQCKPCAAAPSCYSQKDPLFDPAQSSSYAAVPCG 199
Query: 94 SSQCAVV----TSNCSEGDCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMPNVI 149
CA + S CS C Y YG G S ++G +++TLT +++S +
Sbjct: 200 GPVCAGLGIYAASACSAAQCGYVVSYGDG---SNTTGVYSSDTLTLSASS----AVQGFF 252
Query: 150 FGCGHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLPDQGSSK----INF 205
FGCGH A + G++GLG SL+ Q + G FSYCLP + S+ +
Sbjct: 253 FGCGH---AQSGLFNGVDGLLGLGREQPSLVEQTAGTYGGVFSYCLPTKPSTAGYLTLGV 309
Query: 206 GGIVAGAGVVSTPLIIRD-----HYYLSLEAISVGNQRLEFVSSS-TGNIFVDTGVLRTL 259
GG A ST ++ +Y + L ISVG Q+L +S+ G VDTG + T
Sbjct: 310 GGPSGAAPGFSTTQLLPSPNAPTYYVVMLTGISVGGQQLSVPASAFAGGTVVDTGTVVTR 369
Query: 260 LPLEYHSNLKSVMSNMIKAQPVKGVGAEP--GFSDVLCYNIS--SQPKFPEVTIHF-RGA 314
LP ++ L+S + + + G P G D CYN + P V + F GA
Sbjct: 370 LPPTAYAALRSAFRSGMASY---GYPTAPSNGILDT-CYNFAGYGTVTLPNVALTFGSGA 425
Query: 315 DVKLSPSNLFRNISDEIMCSAFRGGNANIVYGRIMQINFLIGYDIEQAMVSFKPSRC 371
V L + S GG A + G + Q +F + I+ V FKPS C
Sbjct: 426 TVTLGADGILSFGCLAFAPSGSDGGMA--ILGNVQQRSFEV--RIDGTSVGFKPSSC 478
>gi|326503602|dbj|BAJ86307.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 461
Score = 109 bits (273), Expect = 2e-21, Method: Compositional matrix adjust.
Identities = 106/382 (27%), Positives = 165/382 (43%), Gaps = 69/382 (18%)
Query: 39 LSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSSSQC- 97
L++GTPP ++ +DTGS+ +W C P + F F P+ SST+ ++ C+S+QC
Sbjct: 89 LAVGTPPQNVTMVLDTGSELSWLLCAPAGARNKFSAMS--FRPRASSTFAAVPCASAQCR 146
Query: 98 -----AVVTSNCSEGDCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMPNVIFGC 152
+ + + CS S Y G S S G LAT+ F SG P+ FGC
Sbjct: 147 SRDLPSPPACDGASSRCSVSLSYADG---SSSDGALATD--VFAVGSGPPLR---AAFGC 198
Query: 153 GHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLPDQGSSKI--------- 203
S G++G+ G S +SQ T +FSYC+ D+ + +
Sbjct: 199 MSSAFDSSPDGVASAGLLGMNRGALSFVSQASTR---RFSYCISDRDDAGVLLLGHSDLP 255
Query: 204 -----NFGGIVAGAGVVSTPLIIRDHYYLSLEAISVGNQRLEFVSS-------STGNIFV 251
N+ + A + P R Y + L I VG + L +S G V
Sbjct: 256 TFLPLNYTPMYQPA--LPLPYFDRVAYSVQLLGIRVGGKHLPIPASVLAPDHTGAGQTMV 313
Query: 252 DTGVLRTLLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFS-----DVLCYNI---SSQP- 302
D+G T L + +S LK+ + +A+P+ +P F+ D C+ + S P
Sbjct: 314 DSGTQFTFLLGDAYSALKAEFTR--QARPLLPALDDPSFAFQEAFDT-CFRVPQGRSPPT 370
Query: 303 -KFPEVTIHFRGADVKLSPSNLF------RNISDEIMCSAFRGGNANI------VYGRIM 349
+ P VT+ F GA++ ++ L R D + C F GNA++ V G
Sbjct: 371 ARLPGVTLLFNGAEMAVAGDRLLYKVPGERRGGDGVWCLTF--GNADMVPIMAYVIGHHH 428
Query: 350 QINFLIGYDIEQAMVSFKPSRC 371
Q+N + YD+E+ V P RC
Sbjct: 429 QMNVWVEYDLERGRVGLAPVRC 450
>gi|413938607|gb|AFW73158.1| aspartic proteinase nepenthesin-2 [Zea mays]
Length = 478
Score = 109 bits (273), Expect = 2e-21, Method: Compositional matrix adjust.
Identities = 117/357 (32%), Positives = 168/357 (47%), Gaps = 38/357 (10%)
Query: 35 YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPEL-DCFKQEPPLFDPKKSSTYNSISCS 93
Y++ S+GTP V VDTGSD +W QC+PC C+ Q+ PLFDP +SS+Y ++ C
Sbjct: 140 YVVTASLGTPGVAQTMEVDTGSDLSWVQCKPCSAAPSCYSQKDPLFDPAQSSSYAAVPCG 199
Query: 94 SSQCAVV----TSNCSEGDCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMPNVI 149
CA + S CS C Y YG G S ++G +++TLT +++S +
Sbjct: 200 GPVCAGLGIYAASACSAAQCGYVVSYGDG---SNTTGVYSSDTLTLSASS----AVQGFF 252
Query: 150 FGCGHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLPDQGSSK----INF 205
FGCGH A + G++GLG SL+ Q + G FSYCLP + S+ +
Sbjct: 253 FGCGH---AQSGLFNGVDGLLGLGREQPSLVEQTAGTYGGVFSYCLPTKPSTAGYLTLGV 309
Query: 206 GGIVAGAGVVSTPLIIRD-----HYYLSLEAISVGNQRLEFVSSS-TGNIFVDTGVLRTL 259
GG A ST ++ +Y + L ISVG Q+L +S+ G VDTG + T
Sbjct: 310 GGPSGAAPGFSTTQLLPSPNAPTYYVVMLTGISVGGQQLSVPASAFAGGTVVDTGTVVTR 369
Query: 260 LPLEYHSNLKSVMSNMIKAQPVKGVGAEP--GFSDVLCYNIS--SQPKFPEVTIHF-RGA 314
LP ++ L+S + + + G P G D CYN + P V + F GA
Sbjct: 370 LPPTAYAALRSAFRSGMASY---GYPTAPSNGILDT-CYNFAGYGTVTLPNVALTFGSGA 425
Query: 315 DVKLSPSNLFRNISDEIMCSAFRGGNANIVYGRIMQINFLIGYDIEQAMVSFKPSRC 371
V L + S GG A + G + Q +F + I+ V FKPS C
Sbjct: 426 TVTLGADGILSFGCLAFAPSGSDGGMA--ILGNVQQRSFEV--RIDGTSVGFKPSSC 478
>gi|356523724|ref|XP_003530485.1| PREDICTED: aspartic proteinase-like protein 2-like [Glycine max]
Length = 488
Score = 109 bits (272), Expect = 2e-21, Method: Compositional matrix adjust.
Identities = 101/375 (26%), Positives = 167/375 (44%), Gaps = 48/375 (12%)
Query: 34 IYLMHLSIGTPPVDIFGSVDTGSDCTWT---QCEPCPELDCFKQEPPLFDPKKSSTYNSI 90
+Y + IGTPP + + VDTGSD W QC+ CP + L+D K+SS+ +
Sbjct: 82 LYYAKIGIGTPPKNYYLQVDTGSDIMWVNCIQCKECPTRSSLGMDLTLYDIKESSSGKLV 141
Query: 91 SCSSSQCAVVTSNCSEG-----DCSYSFLYGRGAYASFSSGNLATETLTFNSTSG-LPVE 144
C C + G C Y +YG G S ++G + + ++ SG L +
Sbjct: 142 PCDQEFCKEINGGLLTGCTANISCPYLEIYGDG---SSTAGYFVKDIVLYDQVSGDLKTD 198
Query: 145 MPN--VIFGCGHKNLASPTSDSKQT--GIIGLGPGNSSLISQMGTS--IAGKFSYCLPDQ 198
N ++FGCG + +S +++ GI+G G NSS+ISQ+ +S + F++CL
Sbjct: 199 SANGSIVFGCGARQSGDLSSSNEEALDGILGFGKANSSMISQLASSGKVKKMFAHCL--- 255
Query: 199 GSSKINFGGIVAGAGVVS-----TPLI-IRDHYYLSLEAISVGNQRLEFVSSSTGN---- 248
+ +N GGI A VV TPL+ + HY +++ A+ VG+ L + ++
Sbjct: 256 --NGVNGGGIFAIGHVVQPKVNMTPLLPDQPHYSVNMTAVQVGHTFLSLSTDTSAQGDRK 313
Query: 249 -IFVDTGVLRTLLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDVLCYNISSQPKFPEV 307
+D+G LP + L + MI P V Y+ S FP V
Sbjct: 314 GTIIDSGTTLAYLPEGIYEPL---VYKMISQHPDLKVQTLHDEYTCFQYSESVDDGFPAV 370
Query: 308 TIHFR-GADVKLSPSN-LFRNISDEIMCSAF-------RGGNANIVYGRIMQINFLIGYD 358
T F G +K+ P + LF +++ C + R + G ++ N L+ YD
Sbjct: 371 TFFFENGLSLKVYPHDYLFPSVN--FWCIGWQNSGTQSRDSKNMTLLGDLVLSNKLVFYD 428
Query: 359 IEQAMVSFKPSRCTN 373
+E + + C++
Sbjct: 429 LENQAIGWAEYNCSS 443
>gi|223942467|gb|ACN25317.1| unknown [Zea mays]
gi|413936886|gb|AFW71437.1| pepsin A [Zea mays]
Length = 491
Score = 109 bits (272), Expect = 2e-21, Method: Compositional matrix adjust.
Identities = 103/380 (27%), Positives = 170/380 (44%), Gaps = 59/380 (15%)
Query: 34 IYLMHLSIGTPPVDIFGSVDTGSDCTWT---QCEPCPELDCFKQEPPLFDPKKSSTYNSI 90
+Y + IG+PP + VDTGSD W +C+ CP E +DP S T ++
Sbjct: 83 LYYTRIEIGSPPKGYYVQVDTGSDILWVNCIRCDGCPTRSGLGIELTQYDPAGSGT--TV 140
Query: 91 SCSSSQCAV-----VTSNC--SEGDCSYSFLYGRGAYASFSSGNLATETLTFNSTSG--- 140
C C V C + C + YG G S ++G T+ + +N SG
Sbjct: 141 GCEQEFCVANSAGGVPPTCPSTSSPCQFRITYGDG---STTTGFYVTDFVQYNQVSGNGQ 197
Query: 141 LPVEMPNVIFGCGHKNLASPTSDSKQT--GIIGLGPGNSSLISQMGTS--IAGKFSYCLP 196
++ FGCG + L S Q GI+G G +SS++SQ+ + + F++CL
Sbjct: 198 TTTSNASITFGCGAQ-LGGDLGSSNQALDGILGFGQSDSSMLSQLAAARRVRKIFAHCL- 255
Query: 197 DQGSSKINFGGIVAGAGVV-----STPLIIR-DHYYLSLEAISVGNQRLE-----FVSSS 245
+ GGI A VV +TPL+ HY ++L+ ISVG L+ F S
Sbjct: 256 ----DTVRGGGIFAIGNVVQPKVKTTPLVPNVTHYNVNLQGISVGGATLQLPTSTFDSGD 311
Query: 246 TGNIFVDTGVLRTLLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDVLCYNISS--QPK 303
+ +D+G LP E + L + + + + P+ + D +C+ S
Sbjct: 312 SKGTIIDSGTTLAYLPREVYRTLLAAVFDKYQDLPLH------NYQDFVCFQFSGSIDDG 365
Query: 304 FPEVTIHFRGADVKLS--PSN-LFRNISDEIMCSAF-------RGGNANIVYGRIMQINF 353
FP +T F G D+ L+ P + LF+N +D + C F + G ++ G ++ N
Sbjct: 366 FPVITFSFEG-DLTLNVYPDDYLFQNRND-LYCMGFLDGGVQTKDGKDMLLLGDLVLSNK 423
Query: 354 LIGYDIEQAMVSFKPSRCTN 373
L+ YD+E+ ++ + C++
Sbjct: 424 LVVYDLEKEVIGWTDYNCSS 443
>gi|307136234|gb|ADN34070.1| aspartic proteinase nepenthesin-1 precursor [Cucumis melo subsp.
melo]
Length = 412
Score = 109 bits (272), Expect = 3e-21, Method: Compositional matrix adjust.
Identities = 105/380 (27%), Positives = 174/380 (45%), Gaps = 69/380 (18%)
Query: 39 LSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSSSQCA 98
L++G+PP + +DTGS+ +W C+ P L +F+P SS+Y+ I CSS C
Sbjct: 44 LTVGSPPQQVTMVLDTGSELSWLHCKKSPNLTS------VFNPLSSSSYSPIPCSSPVCR 97
Query: 99 VVTSN------CSEGDCSYSFLYGRGAYASFSS--GNLATETLTFNSTSGLPVEMPNVIF 150
T + C ++ + +YA SS GNLA++ S++ +P +F
Sbjct: 98 TRTRDLPNPVTCDPKKLCHAIV----SYADASSLEGNLASDNFRIGSSA-----LPGTLF 148
Query: 151 GCGHKNLASPT-SDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLPDQ--------GSS 201
GC +S + D+K TG++G+ G+ S ++Q+G KFSYC+ + G S
Sbjct: 149 GCMDSGFSSNSEEDAKTTGLMGMNRGSLSFVTQLGLP---KFSYCISGRDSSGVLLFGDS 205
Query: 202 KINFGGIVAGAGVV--STPLIIRDH--YYLSLEAISVGNQRLEFVSS-------STGNIF 250
+++ G + +V STPL D Y + L+ I VGN+ L S G
Sbjct: 206 HLSWLGNLTYTPLVQISTPLPYFDRVAYTVQLDGIRVGNKILPLPKSIFAPDHTGAGQTM 265
Query: 251 VDTGVLRTLLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGF----SDVLCYNISSQPKFPE 306
VD+G T L ++ L++ K V +P F + LCY + + K PE
Sbjct: 266 VDSGTQFTFLLGPVYTALRNEFLEQTKG--VLAPLGDPNFVFQGAMDLCYRVPAGGKLPE 323
Query: 307 ---VTIHFRGADVKLSPSNLFRNI------SDEIMCSAFRGGNANI------VYGRIMQI 351
V++ FRGA++ + L + + + C F GN+++ V G Q
Sbjct: 324 LPAVSLMFRGAEMVVGGEVLLYKVPGMMKGKEWVYCLTF--GNSDLLGIEAFVIGHHHQQ 381
Query: 352 NFLIGYDIEQAMVSFKPSRC 371
N + +D+ ++ V F +RC
Sbjct: 382 NVWMEFDLVKSRVGFVETRC 401
>gi|359478045|ref|XP_002267046.2| PREDICTED: aspartic proteinase-like protein 2-like [Vitis vinifera]
Length = 502
Score = 108 bits (271), Expect = 3e-21, Method: Compositional matrix adjust.
Identities = 106/373 (28%), Positives = 163/373 (43%), Gaps = 46/373 (12%)
Query: 34 IYLMHLSIGTPPVDIFGSVDTGSDCTWT---QCEPCPELDCFKQEPPLFDPKKSSTYNSI 90
+Y + IGTP D + VDTGSD W QC CP+ E L+D K+S T +
Sbjct: 97 LYYAKIGIGTPARDYYVQVDTGSDIMWVNCIQCNECPKKSSLGMELTLYDIKESLTGKLV 156
Query: 91 SCSSSQCAVVT----SNC-SEGDCSYSFLYGRGAYASFSSGNLATETLTFNSTSG-LPVE 144
SC C + S C + CSY+ +Y G S S G + + ++ SG L
Sbjct: 157 SCDQDFCYAINGGPPSYCIANMSCSYTEIYADG---SSSFGYFVRDIVQYDQVSGDLETT 213
Query: 145 MPN--VIFGCGHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTS--IAGKFSYCLPDQGS 200
N VIFGC +S+ GI+G G N+S+ISQ+ +S + F++CL
Sbjct: 214 SANGSVIFGCSATQSGDLSSEEALDGILGFGKSNTSMISQLASSGKVRKMFAHCL----- 268
Query: 201 SKINFGGIVAGAGVV-----STPLI-IRDHYYLSLEAISVGNQRLEF------VSSSTGN 248
+N GGI A +V +TPL+ + HY ++++A+ VG L V G
Sbjct: 269 DGLNGGGIFAIGHIVQPKVNTTPLVPNQTHYNVNMKAVEVGGYFLNLPTDVFDVGDKKGT 328
Query: 249 IFVDTGVLRTLLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDVLCYNISSQPKFPEVT 308
I +D+G LP + L +S + Q V Y+ S FP VT
Sbjct: 329 I-IDSGTTLAYLPEVVYDQL---LSKIFSWQSDLKVHTIHDQFTCFQYSESLDDGFPAVT 384
Query: 309 IHFRGA-DVKLSPSNLFRNISDEIMCSAFRGG------NANI-VYGRIMQINFLIGYDIE 360
HF + +K+ P + D + C ++ NI + G + N L+ YD+E
Sbjct: 385 FHFENSLYLKVHPHEYLFSY-DGLWCIGWQNSGMQSRDRRNITLLGDLALSNKLVLYDLE 443
Query: 361 QAMVSFKPSRCTN 373
++ + C++
Sbjct: 444 NQVIGWTEYNCSS 456
>gi|356499109|ref|XP_003518386.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Glycine max]
Length = 428
Score = 108 bits (271), Expect = 3e-21, Method: Compositional matrix adjust.
Identities = 118/419 (28%), Positives = 184/419 (43%), Gaps = 79/419 (18%)
Query: 1 AQNSQK---LPFYNDNETPKSPISIIYQAEIISVDDIYLMHLSIGTPPVDIFGSVDTGSD 57
+Q +QK LP +TP +S + + + L++G+PP ++ +DTGS+
Sbjct: 30 SQLTQKPLLLPLKTQTQTPSRKLSFHHNVTLT-------VSLTVGSPPQNVTMVLDTGSE 82
Query: 58 CTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSSSQCAVVTSNCS---EGDCSYSFL 114
+W C+ P L+ F+P SS+Y C+SS C T + + D +
Sbjct: 83 LSWLHCKKLPNLNS------TFNPLLSSSYTPTPCNSSICTTRTRDLTIPASCDPNNKLC 136
Query: 115 YGRGAYASFSS--GNLATETLTFNSTSGLPVEMPNVIFGCGHKNLASPTS----DSKQTG 168
+ +YA SS G LA ET + + P +FGC + A TS DSK TG
Sbjct: 137 HVIVSYADASSAEGTLAAETFSLAGAA-----QPGTLFGC--MDSAGYTSDINEDSKTTG 189
Query: 169 IIGLGPGNSSLISQMGTSIAGKFSYCLPDQGSSKINFGGIVAGAG--------------- 213
++G+ G+ SL++QM KFSYC+ S + G ++ G G
Sbjct: 190 LMGMNRGSLSLVTQMSLP---KFSYCI----SGEDALGVLLLGDGTDAPSPLQYTPLVTA 242
Query: 214 VVSTPLIIRDHYYLSLEAISVGNQRLE-----FVSSST--GNIFVDTGVLRTLLPLEYHS 266
S+P R Y + LE I V + L+ FV T G VD+G T L +S
Sbjct: 243 TTSSPYFNRVAYTVQLEGIKVSEKLLQLPKSVFVPDHTGAGQTMVDSGTQFTFLLGSVYS 302
Query: 267 NLKSVMSNMIKAQPVKGVGAEPGF----SDVLCYNI-SSQPKFPEVTIHFRGADVKLSPS 321
+LK K V +P F + LCY+ +S P VT+ F GA++++S
Sbjct: 303 SLKDEFLEQTKG--VLTRIEDPNFVFEGAMDLCYHAPASFAAVPAVTLVFSGAEMRVSGE 360
Query: 322 NLFRNI---SDEIMCSAFRGGNANI------VYGRIMQINFLIGYDIEQAMVSFKPSRC 371
L + SD + C F GN+++ V G Q N + +D+ ++ V F + C
Sbjct: 361 RLLYRVSKGSDWVYCFTF--GNSDLLGIEAYVIGHHHQQNVWMEFDLLKSRVGFTQTTC 417
>gi|297820186|ref|XP_002877976.1| hypothetical protein ARALYDRAFT_906847 [Arabidopsis lyrata subsp.
lyrata]
gi|297323814|gb|EFH54235.1| hypothetical protein ARALYDRAFT_906847 [Arabidopsis lyrata subsp.
lyrata]
Length = 425
Score = 108 bits (271), Expect = 3e-21, Method: Compositional matrix adjust.
Identities = 108/364 (29%), Positives = 163/364 (44%), Gaps = 52/364 (14%)
Query: 35 YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSS 94
Y++ +IGTP + ++DT +D W C C + C LFDP KSS+ ++ C +
Sbjct: 88 YIVRANIGTPAQAMLVALDTSNDAAWIPCSGC--VGC--SSSVLFDPSKSSSSRTLQCEA 143
Query: 95 SQCAVVTS-NCS-EGDCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMPNVIFGC 152
QC + +C+ C ++ YG A ++ L +TLT + +PN FGC
Sbjct: 144 PQCKQAPNPSCTVSKSCGFNMTYGGSAIEAY----LTQDTLTLATD-----VIPNYTFGC 194
Query: 153 GHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLPDQGSSKINFGGIVAGA 212
+K AS TS Q G++GLG G SLISQ FSYCLP+ SS NF G +
Sbjct: 195 INK--ASGTSLPAQ-GLMGLGRGPLSLISQSQNLYQSTFSYCLPNSKSS--NFSGSLRLG 249
Query: 213 ------GVVSTPLIIRDH----YYLSLEAISVGNQRLEFVSSS--------TGNIFVDTG 254
+ +TPL+ YY++L I VGN+ ++ +S+ G IF D+G
Sbjct: 250 PKNQPIRIKTTPLLKNPRRSSLYYVNLVGIRVGNKIVDIPTSALAFDPATGAGTIF-DSG 308
Query: 255 VLRTLLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDVLCYNISSQPKFPEVTIHFRGA 314
+ T L + +++ +K +G GF CY S FP VT F G
Sbjct: 309 TVYTRLVEPAYVAMRNEFRRRVKNANATSLG---GFDT--CY--SGSVVFPSVTFMFAGM 361
Query: 315 DVKLSPSNLF-RNISDEIMCSAFRGGNANI-----VYGRIMQINFLIGYDIEQAMVSFKP 368
+V L P NL + + + C A N+ V + Q N + D+ + +
Sbjct: 362 NVTLPPDNLLIHSSAGNLSCLAMAAAPTNVNSVLNVIASMQQQNHRVLIDVPNSRLGISR 421
Query: 369 SRCT 372
CT
Sbjct: 422 ETCT 425
>gi|242089069|ref|XP_002440367.1| hypothetical protein SORBIDRAFT_09g030430 [Sorghum bicolor]
gi|241945652|gb|EES18797.1| hypothetical protein SORBIDRAFT_09g030430 [Sorghum bicolor]
Length = 462
Score = 108 bits (271), Expect = 3e-21, Method: Compositional matrix adjust.
Identities = 96/353 (27%), Positives = 148/353 (41%), Gaps = 48/353 (13%)
Query: 35 YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSS 94
Y + +GTPP +DTGSD W QC PC + C+ Q +FDP++S +Y ++ C +
Sbjct: 142 YFASVGVGTPPTPALLVLDTGSDVVWLQCAPCRQ--CYAQSGRVFDPRRSRSYAAVRCGA 199
Query: 95 SQC------AVVTSNCSEGDCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMPNV 148
C + G C Y YG G S ++G+LATETL F + +P V
Sbjct: 200 PPCRGLDAGGGGGCDRRRGTCLYQVAYGDG---SVTAGDLATETLWFARGA----RVPRV 252
Query: 149 IFGCGHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLPDQGSSKINFGGI 208
GCGH N + + G+ SL +Q +FSYC QGS
Sbjct: 253 AVGCGHDNEGLFVAAAGLLGLGRG---RLSLPTQTARRYGRRFSYCF--QGSD------- 300
Query: 209 VAGAGVVSTPLIIRD-HYYLSLEAIS-VGNQRLEF-VSSSTGNIFVDTGVLRTLLPLEYH 265
+ IIR H ++ + VG + L S+ G + +D+G T L +
Sbjct: 301 ------LDHRTIIRTVHQHVGGARVRGVGERSLRLDPSTGRGGVILDSGTSVTRLARPVY 354
Query: 266 SNLKSVMSNMIKAQPVKGVGAEPGFSDVL--CYNISSQP--KFPEVTIHFR-GADVKLSP 320
++ G+ PG + CY++ + K P V++H GA+V L P
Sbjct: 355 VAVREAFRAA-----AGGLRLAPGGFSLFDTCYDLRGRRVVKVPTVSVHLAGGAEVALPP 409
Query: 321 SNLFRNISDE-IMCSAFRGGNANI-VYGRIMQINFLIGYDIEQAMVSFKPSRC 371
N + C A G + + + G I Q F + +D ++ V+ P C
Sbjct: 410 ENYLIPVDTRGTFCLALAGTDGGVSIVGNIQQQGFRVVFDGDRQRVALVPKSC 462
>gi|296089645|emb|CBI39464.3| unnamed protein product [Vitis vinifera]
Length = 477
Score = 108 bits (271), Expect = 3e-21, Method: Compositional matrix adjust.
Identities = 106/371 (28%), Positives = 161/371 (43%), Gaps = 46/371 (12%)
Query: 34 IYLMHLSIGTPPVDIFGSVDTGSDCTWT---QCEPCPELDCFKQEPPLFDPKKSSTYNSI 90
+Y + IGTP D + VDTGSD W QC CP+ E L+D K+S T +
Sbjct: 97 LYYAKIGIGTPARDYYVQVDTGSDIMWVNCIQCNECPKKSSLGMELTLYDIKESLTGKLV 156
Query: 91 SCSSSQCAVVT----SNC-SEGDCSYSFLYGRGAYASFSSGNLATETLTFNSTSG-LPVE 144
SC C + S C + CSY+ +Y G S S G + + ++ SG L
Sbjct: 157 SCDQDFCYAINGGPPSYCIANMSCSYTEIYADG---SSSFGYFVRDIVQYDQVSGDLETT 213
Query: 145 MPN--VIFGCGHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTS--IAGKFSYCLPDQGS 200
N VIFGC +S+ GI+G G N+S+ISQ+ +S + F++CL
Sbjct: 214 SANGSVIFGCSATQSGDLSSEEALDGILGFGKSNTSMISQLASSGKVRKMFAHCL----- 268
Query: 201 SKINFGGIVAGAGVV-----STPLI-IRDHYYLSLEAISVGNQRLEF------VSSSTGN 248
+N GGI A +V +TPL+ + HY ++++A+ VG L V G
Sbjct: 269 DGLNGGGIFAIGHIVQPKVNTTPLVPNQTHYNVNMKAVEVGGYFLNLPTDVFDVGDKKGT 328
Query: 249 IFVDTGVLRTLLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDVLCYNISSQPKFPEVT 308
I +D+G LP + L +S + Q V Y+ S FP VT
Sbjct: 329 I-IDSGTTLAYLPEVVYDQL---LSKIFSWQSDLKVHTIHDQFTCFQYSESLDDGFPAVT 384
Query: 309 IHFRGA-DVKLSPSNLFRNISDEIMCSAFRGG------NANI-VYGRIMQINFLIGYDIE 360
HF + +K+ P + D + C ++ NI + G + N L+ YD+E
Sbjct: 385 FHFENSLYLKVHPHEYLFSY-DGLWCIGWQNSGMQSRDRRNITLLGDLALSNKLVLYDLE 443
Query: 361 QAMVSFKPSRC 371
++ + C
Sbjct: 444 NQVIGWTEYNC 454
>gi|356543524|ref|XP_003540210.1| PREDICTED: aspartic proteinase-like protein 2-like [Glycine max]
Length = 493
Score = 108 bits (271), Expect = 4e-21, Method: Compositional matrix adjust.
Identities = 105/370 (28%), Positives = 169/370 (45%), Gaps = 39/370 (10%)
Query: 34 IYLMHLSIGTPPVDIFGSVDTGSDCTWTQCEP---CPELDCFKQEPPLFDPKKSSTYNSI 90
+Y + +GTPPV+ +DTGSD W C CP+ + + FDP SST + I
Sbjct: 77 LYYTKVQLGTPPVEFNVQIDTGSDVLWVSCNSCNGCPQTSGLQIQLNFFDPGSSSTSSMI 136
Query: 91 SCSSSQC------AVVTSNCSEGDCSYSFLYGRGAYASFSSGNLATETLTFNST---SGL 141
+CS +C + T + CSY+F YG G S +SG ++ + N+ S
Sbjct: 137 ACSDQRCNNGKQSSDATCSSQNNQCSYTFQYGDG---SGTSGYYVSDMMHLNTIFEGSMT 193
Query: 142 PVEMPNVIFGCGHKNLASPT-SDSKQTGIIGLGPGNSSLISQMGTS-IAGK-FSYCLP-D 197
V+FGC ++ T SD GI G G S+ISQ+ + IA + FS+CL D
Sbjct: 194 TNSTAPVVFGCSNQQTGDLTKSDRAVDGIFGFGQQEMSVISQLSSQGIAPRIFSHCLKGD 253
Query: 198 QGSSKINFGGIVAGAGVVSTPLI-IRDHYYLSLEAISVGNQRLE-----FVSSSTGNIFV 251
I G + +V T L+ + HY L+L++ISV Q L+ F +S++ V
Sbjct: 254 SSGGGILVLGEIVEPNIVYTSLVPAQPHYNLNLQSISVNGQTLQIDSSVFATSNSRGTIV 313
Query: 252 DTGVLRTLLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDVLCYNISSQPK--FPEVTI 309
D+G L E + S ++ I Q V+ V + CY I+S FP+V++
Sbjct: 314 DSGTTLAYLAEEAYDPFVSAITAAIP-QSVRTVVSRGN----QCYLITSSVTDVFPQVSL 368
Query: 310 HFR-GADVKLSPSNLFRNISD----EIMCSAFRG--GNANIVYGRIMQINFLIGYDIEQA 362
+F GA + L P + + + C F+ G + G ++ + ++ YD+
Sbjct: 369 NFAGGASMILRPQDYLIQQNSIGGAAVWCIGFQKIQGQGITILGDLVLKDKIVVYDLAGQ 428
Query: 363 MVSFKPSRCT 372
+ + C+
Sbjct: 429 RIGWANYDCS 438
>gi|21717162|gb|AAM76355.1|AC074196_13 putative nucleoid DNA binding protein [Oryza sativa Japonica Group]
gi|31433286|gb|AAP54824.1| Eukaryotic aspartyl protease family protein [Oryza sativa Japonica
Group]
Length = 397
Score = 108 bits (270), Expect = 4e-21, Method: Compositional matrix adjust.
Identities = 99/366 (27%), Positives = 171/366 (46%), Gaps = 53/366 (14%)
Query: 36 LMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSSS 95
+ + +IGTPP +D + WTQC C + CFKQ+ P+F P SST+ C +
Sbjct: 55 VANFTIGTPPQAASAFIDLTGELVWTQCSQC--IHCFKQDLPVFVPNASSTFKPEPCGTD 112
Query: 96 QC-AVVTSNCSEGDCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMPNVIFGCGH 154
C ++ T C+ C+Y + G G + + G +AT+T + + P + FGC
Sbjct: 113 VCKSIPTPKCASDVCAYDGVTGLGGH---TVGIVATDTFAIGTAA--PASLG---FGC-- 162
Query: 155 KNLASPTSDSK--QTGIIGLGPGNSSLISQMGTSIAGKFSYCLP--DQGSSKINFGGIVA 210
+ + D+ +G IGLG SL++QM + +FSYCL D G + F G A
Sbjct: 163 --VVASDIDTMGGPSGFIGLGRTPWSLVAQMKLT---RFSYCLAPHDTGKNSRLFLGASA 217
Query: 211 --GAGVVSTPLI-------IRDHYYLSLEAISVGNQRLEFVSSSTGNIFVDTGVLRTLLP 261
G TP + + +Y + LE I G+ + + + V T V+R L
Sbjct: 218 KLAGGGAWTPFVKTSPNDGMSQYYPIELEEIKAGDATIT-MPRGRNTVLVQTAVVRVSLL 276
Query: 262 LE--YHSNLKSVMSNMIKAQPVKGVGAEPGFSDVLCYNISSQPKFPEVTIHFR-GADVKL 318
++ Y K+VM+++ A VGA F +C+ + P++ F+ GA + +
Sbjct: 277 VDSVYQEFKKAVMASVGAAPTATPVGAP--FE--VCFPKAGVSGAPDLVFTFQAGAALTV 332
Query: 319 SPSNLFRNISDEIMC-----------SAFRGGNANIVYGRIMQINFLIGYDIEQAMVSFK 367
P+N ++ ++ +C +A G N + G Q N + +D+++ M+SF+
Sbjct: 333 PPANYLFDVGNDTVCLSVMSIALLNITALDGLN---ILGSFQQENVHLLFDLDKDMLSFE 389
Query: 368 PSRCTN 373
P+ C++
Sbjct: 390 PADCSS 395
>gi|357131735|ref|XP_003567490.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Brachypodium
distachyon]
Length = 458
Score = 108 bits (270), Expect = 5e-21, Method: Compositional matrix adjust.
Identities = 104/371 (28%), Positives = 157/371 (42%), Gaps = 47/371 (12%)
Query: 35 YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFK-QEPPLFDPKKSSTYNSISCS 93
Y+ +GTPP + ++D +D W C C L C P FDP +SSTY + C
Sbjct: 100 YVARARLGTPPQTLLVAIDPSNDAAWVPCSAC--LGCAPGASSPSFDPTQSSTYRPVRCG 157
Query: 94 SSQCAVV---TSNCSEG---DCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMPN 147
+ QCA V T +C G C+++ Y +S L + L+ + ++G V +
Sbjct: 158 APQCAQVPPATPSCPAGPGASCAFNLSYA----SSTLHAVLGQDALSLSDSNGAAVPDDH 213
Query: 148 VIFGCGHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLPDQGSSKINFGG 207
FGC S S Q G++G G G S +SQ + FSYCLP SS NF G
Sbjct: 214 YTFGCLRVVTGSGGSVPPQ-GLVGFGRGPLSFLSQTKATYGSIFSYCLPSYKSS--NFSG 270
Query: 208 I--VAGAG----VVSTPLIIRDH----YYLSLEAISVGNQRLEFVSSS--------TGNI 249
+ AG + +TPL+ H YY+++ + V + + +S+ G
Sbjct: 271 TLRLGPAGQPRRIKTTPLLSNPHRPSLYYVAMVGVRVNGKAVPIPASALALDAATGRGGT 330
Query: 250 FVDTGVLRTLLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDVLCYNISSQPKFPEVTI 309
VD G + T L ++ L++ + A +G GF CY ++ P V
Sbjct: 331 IVDAGTMFTRLSPPAYAALRNAFRRGVSAPAAPALG---GFDT--CYYVNGTKSVPAVAF 385
Query: 310 HFR-GADVKLSPSN-LFRNISDEIMCSAFRGG-----NANI-VYGRIMQINFLIGYDIEQ 361
F GA V L N + + S + C A G NA + V + Q N + +D+
Sbjct: 386 VFAGGARVTLPEENVVISSTSGGVACLAMAAGPSDGVNAGLNVLASMQQQNHRVVFDVGN 445
Query: 362 AMVSFKPSRCT 372
V F CT
Sbjct: 446 GRVGFSRELCT 456
>gi|356568907|ref|XP_003552649.1| PREDICTED: aspartic proteinase-like protein 2-like [Glycine max]
Length = 490
Score = 108 bits (270), Expect = 5e-21, Method: Compositional matrix adjust.
Identities = 103/373 (27%), Positives = 168/373 (45%), Gaps = 44/373 (11%)
Query: 34 IYLMHLSIGTPPVDIFGSVDTGSDCTWT---QCEPCPELDCFKQEPPLFDPKKSSTYNSI 90
+Y + IGTPP + + VDTGSD W QC+ CP + L+D K+SS+ +
Sbjct: 84 LYYAKIGIGTPPKNYYLQVDTGSDIMWVNCIQCKECPTRSNLGMDLTLYDIKESSSGKFV 143
Query: 91 SCSSSQCAVVTSNCSEG-----DCSYSFLYGRGAYASFSSGNLATETLTFNSTSG-LPVE 144
C C + G C Y +YG G S ++G + + ++ SG L +
Sbjct: 144 PCDQEFCKEINGGLLTGCTANISCPYLEIYGDG---SSTAGYFVKDIVLYDQVSGDLKTD 200
Query: 145 MPN--VIFGCGHKNLASPTSDSKQT--GIIGLGPGNSSLISQMGTS--IAGKFSYCLPDQ 198
N ++FGCG + +S +++ GI+G G NSS+ISQ+ +S + F++CL
Sbjct: 201 SANGSIVFGCGARQSGDLSSSNEEALGGILGFGKANSSMISQLASSGKVKKMFAHCL--- 257
Query: 199 GSSKINFGGIVAGAGVVS-----TPLI-IRDHYYLSLEAISVGNQRLEFV--SSSTGN-- 248
+ +N GGI A VV TPL+ + HY +++ A+ VG+ L +S+ G+
Sbjct: 258 --NGVNGGGIFAIGHVVQPKVNMTPLLPDQPHYSVNMTAVQVGHAFLSLSTDTSTQGDRK 315
Query: 249 -IFVDTGVLRTLLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDVLCYNISSQPKFPEV 307
+D+G LP + L + +I P V Y+ S FP V
Sbjct: 316 GTIIDSGTTLAYLPEGIYEPL---VYKIISQHPDLKVRTLHDEYTCFQYSESVDDGFPAV 372
Query: 308 TIHFR-GADVK------LSPSNLFRNISDEIMCSAFRGGNANIVYGRIMQINFLIGYDIE 360
T +F G +K L PS F I + + R + G ++ N L+ YD+E
Sbjct: 373 TFYFENGLSLKVYPHDYLFPSGDFWCIGWQNSGTQSRDSKNMTLLGDLVLSNKLVFYDLE 432
Query: 361 QAMVSFKPSRCTN 373
++ + C++
Sbjct: 433 NQVIGWTEYNCSS 445
>gi|125527370|gb|EAY75484.1| hypothetical protein OsI_03384 [Oryza sativa Indica Group]
Length = 453
Score = 108 bits (269), Expect = 5e-21, Method: Compositional matrix adjust.
Identities = 108/374 (28%), Positives = 165/374 (44%), Gaps = 55/374 (14%)
Query: 35 YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSS 94
Y M IGTP + G DTGSD WT+C C C + P + P SS+ ++C
Sbjct: 92 YAMSFGIGTPATGLSGEADTGSDLIWTKCGACAR--CSPRGSPSYYPTSSSSAAFVACGD 149
Query: 95 SQCA---------VVTSNCSEGDCSYSFLYGRGA-YASFSSGNLATETLTFNSTSGLPVE 144
C V G+CSY + YG ++ G L TET TF +
Sbjct: 150 RTCGELPRPLCSNVAGGGSGSGNCSYHYAYGNARDTHHYTEGILMTETFTFGDDA---AA 206
Query: 145 MPNVIFGCGHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLPDQGS--SK 202
P + FGC L S +G++GLG G SL++Q+ F Y L S S
Sbjct: 207 FPGIAFGC---TLRSEGGFGTGSGLVGLGRGKLSLVTQLNVE---AFGYRLSSDLSAPSP 260
Query: 203 INFGGIVA-----GAGVVSTPL----IIRD--HYYLSLEAISVGNQRLEF--------VS 243
I+FG + G +STPL +++D YY+ L ISVG + ++ S
Sbjct: 261 ISFGSLADVTGGNGDSFMSTPLLTNPVVQDLPFYYVGLTGISVGGKLVQIPSGTFSFDRS 320
Query: 244 SSTGNIFVDTGVLRTLLPLEYHSNLK-SVMSNMIKAQPVKGVGAEPGFSDVLCYN-ISSQ 301
+ G + D+G T+LP ++ ++ ++S M +P + D++C+ SS
Sbjct: 321 TGAGGVIFDSGTTLTMLPDPAYTLVRDELLSQMGFQKPPPAANDD----DLICFTGGSST 376
Query: 302 PKFPEVTIHFR-GADVKLSPSNLF-----RNISDEIMCSAFRGGNANIVYGRIMQINFLI 355
FP + +HF GAD+ LS N +N S + A + G IMQ++F +
Sbjct: 377 TTFPSMVLHFDGGADMDLSTENYLPQMQGQNGETARCWSVVKSSQALTIIGNIMQMDFHV 436
Query: 356 GYDIE-QAMVSFKP 368
+D+ A + F+P
Sbjct: 437 VFDLSGNARMLFQP 450
>gi|255543963|ref|XP_002513044.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
gi|223548055|gb|EEF49547.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
Length = 431
Score = 108 bits (269), Expect = 6e-21, Method: Compositional matrix adjust.
Identities = 106/396 (26%), Positives = 171/396 (43%), Gaps = 51/396 (12%)
Query: 1 AQNSQKLPFYNDNETPKSPISIIYQAEIISVDDIYLMHLSIGTPPVDIFGSVDTGSDCTW 60
A++ +L F + KS + I +I+ Y++ IGTP + ++DT +D W
Sbjct: 63 AKDQARLQFLSSLVARKSVVPIASGRQIVQ-SPTYIVRAKIGTPAQTMLLAMDTSNDAAW 121
Query: 61 TQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSSSQCAVV-TSNCSEGDCSYSFLYGRGA 119
C C + C +F+ KS+T+ ++ C + QC V S C C+++ YG +
Sbjct: 122 IPCSGC--VGC---SSTVFNNVKSTTFKTVGCEAPQCKQVPNSKCGGSACAFNMTYGSSS 176
Query: 120 YASFSSGNLATETLTFNSTSGLPVEMPNVIFGCGHKNLASPTSDS-KQTGIIGLGPGNSS 178
A+ NL+ + +T + S +P+ FGC L T S G++GLG G S
Sbjct: 177 IAA----NLSQDVVTLATDS-----IPSYTFGC----LTEATGSSIPPQGLLGLGRGPMS 223
Query: 179 LISQMGTSIAGKFSYCLPD----QGSSKINFGGIVAGAGVVSTPLIIRDH----YYLSLE 230
L+SQ FSYCLP S + G + + +TPL+ YY++L
Sbjct: 224 LLSQTQNLYQSTFSYCLPSFRSLNFSGSLRLGPVGQPKRIKTTPLLKNPRRSSLYYVNLM 283
Query: 231 AISVGNQRLEFVSSS--------TGNIFVDTGVLRTLLPLEYHSNLKSVMSNMIKAQPVK 282
AI VG + ++ S+ G IF D+G + T L ++ ++ + V
Sbjct: 284 AIRVGRRVVDIPPSALAFNPTTGAGTIF-DSGTVFTRLVAPAYTAVRDAFRKRVGNATVT 342
Query: 283 GVGAEPGFSDVLCYNISSQPKFPEVTIHFRGADVKLSPSNLF-RNISDEIMCSAFRGGNA 341
+G GF CY +S P +T F G +V L P NL + + I C A
Sbjct: 343 SLG---GFDT--CY--TSPIVAPTITFMFSGMNVTLPPDNLLIHSTASSITCLAMAAAPD 395
Query: 342 NI-----VYGRIMQINFLIGYDIEQAMVSFKPSRCT 372
N+ V + Q N I +D+ + + CT
Sbjct: 396 NVNSVLNVIANMQQQNHRILFDVPNSRLGVAREPCT 431
>gi|115479237|ref|NP_001063212.1| Os09g0423500 [Oryza sativa Japonica Group]
gi|50725896|dbj|BAD33424.1| putative nucleoid DNA-binding protein cnd41, chloroplast [Oryza
sativa Japonica Group]
gi|50726136|dbj|BAD33657.1| putative nucleoid DNA-binding protein cnd41, chloroplast [Oryza
sativa Japonica Group]
gi|113631445|dbj|BAF25126.1| Os09g0423500 [Oryza sativa Japonica Group]
gi|215767614|dbj|BAG99842.1| unnamed protein product [Oryza sativa Japonica Group]
gi|222641596|gb|EEE69728.1| hypothetical protein OsJ_29412 [Oryza sativa Japonica Group]
Length = 473
Score = 108 bits (269), Expect = 6e-21, Method: Compositional matrix adjust.
Identities = 98/352 (27%), Positives = 157/352 (44%), Gaps = 55/352 (15%)
Query: 52 VDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSSSQCAVV----------T 101
VDT S+ TW QC PC C Q+ PLFDP S +Y + C+SS C +
Sbjct: 142 VDTASELTWVQCAPCA--SCHDQQGPLFDPASSPSYAVLPCNSSSCDALQVATGSAAGAC 199
Query: 102 SNCSEGDCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMPNVIFGCGHKNLASPT 161
+ CSY+ Y G+Y S G LA + L+ + +FGCG N P
Sbjct: 200 GGGEQPSCSYTLSYRDGSY---SQGVLAHDKLSLAGEV-----IDGFVFGCGTSN-QGPF 250
Query: 162 SDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLPDQGSSKINFGGIVAGAGVV----ST 217
+ +G++GLG SLISQ G FSYCLP + S + G +V G ST
Sbjct: 251 GGT--SGLMGLGRSQLSLISQTMDQFGGVFSYCLPLKESE--SSGSLVLGDDTSVYRNST 306
Query: 218 PLI----IRD-----HYYLSLEAISVGNQRLEFVSSSTGNIFVDTGVLRTLLPLEYHSNL 268
P++ + D Y+++L I++G Q +E SS G + VD+G + T L ++ +
Sbjct: 307 PIVYTTMVSDPVQGPFYFVNLTGITIGGQEVE---SSAGKVIVDSGTIITSLVPSVYNAV 363
Query: 269 KSVMSNMIKAQPVKGVGAEPGFSDV-LCYNIS--SQPKFPEVTIHFRG-ADVKLSPSNLF 324
K+ + P PGFS + C+N++ + + P + F G +V++ S +
Sbjct: 364 KAEFLSQFAEYP-----QAPGFSILDTCFNLTGFREVQIPSLKFVFEGNVEVEVDSSGVL 418
Query: 325 RNISDE-----IMCSAFRGGNANIVYGRIMQINFLIGYDIEQAMVSFKPSRC 371
+S + + ++ + + G Q N + +D + + F C
Sbjct: 419 YFVSSDSSQVCLALASLKSEYETSIIGNYQQKNLRVIFDTLGSQIGFAQETC 470
>gi|125571687|gb|EAZ13202.1| hypothetical protein OsJ_03122 [Oryza sativa Japonica Group]
Length = 453
Score = 107 bits (268), Expect = 6e-21, Method: Compositional matrix adjust.
Identities = 108/374 (28%), Positives = 165/374 (44%), Gaps = 55/374 (14%)
Query: 35 YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSS 94
Y M IGTP + G DTGSD WT+C C C + P + P SS+ ++C
Sbjct: 92 YAMSFGIGTPATGLSGEADTGSDLIWTKCGACAR--CSPRGSPSYYPTSSSSAAFVACGD 149
Query: 95 SQCA---------VVTSNCSEGDCSYSFLYGRGA-YASFSSGNLATETLTFNSTSGLPVE 144
C V G+CSY + YG ++ G L TET TF +
Sbjct: 150 RTCGELPRPLCSNVAGGGSGSGNCSYHYAYGNARDTHHYTEGILMTETFTFGDDA---AA 206
Query: 145 MPNVIFGCGHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLPDQGS--SK 202
P + FGC L S +G++GLG G SL++Q+ F Y L S S
Sbjct: 207 FPGIAFGC---TLRSEGGFGTGSGLVGLGRGKLSLVTQLNVE---AFGYRLSSDLSAPSP 260
Query: 203 INFGGIVA-----GAGVVSTPL----IIRD--HYYLSLEAISVGNQRLEF--------VS 243
I+FG + G +STPL +++D YY+ L ISVG + ++ S
Sbjct: 261 ISFGSLADVTGGNGDSFMSTPLLTNPVVQDLPFYYVGLTGISVGGKLVQIPSGTFSFDRS 320
Query: 244 SSTGNIFVDTGVLRTLLPLEYHSNLK-SVMSNMIKAQPVKGVGAEPGFSDVLCYN-ISSQ 301
+ G + D+G T+LP ++ ++ ++S M +P + D++C+ SS
Sbjct: 321 TGAGGVIFDSGTTLTMLPDPAYTLVRDELLSQMGFQKPPPAANDD----DLICFTGGSST 376
Query: 302 PKFPEVTIHFR-GADVKLSPSNLF-----RNISDEIMCSAFRGGNANIVYGRIMQINFLI 355
FP + +HF GAD+ LS N +N S + A + G IMQ++F +
Sbjct: 377 TTFPSMVLHFDGGADMDLSTENYLPQMQGQNGETARCWSVVKSSQALTIIGNIMQMDFHV 436
Query: 356 GYDIE-QAMVSFKP 368
+D+ A + F+P
Sbjct: 437 VFDLSGNARMLFQP 450
>gi|242065058|ref|XP_002453818.1| hypothetical protein SORBIDRAFT_04g018520 [Sorghum bicolor]
gi|241933649|gb|EES06794.1| hypothetical protein SORBIDRAFT_04g018520 [Sorghum bicolor]
Length = 490
Score = 107 bits (268), Expect = 7e-21, Method: Compositional matrix adjust.
Identities = 102/379 (26%), Positives = 171/379 (45%), Gaps = 57/379 (15%)
Query: 34 IYLMHLSIGTPPVDIFGSVDTGSDCTWT---QCEPCPELDCFKQEPPLFDPKKSSTYNSI 90
+Y + IG+P + VDTGSD W +C+ CP E +DP S T ++
Sbjct: 84 LYYTQIEIGSPSKGYYVQVDTGSDILWVNCIRCDGCPTTSGLGIELTQYDPAGSGT--TV 141
Query: 91 SCSSSQCAVVTSN-------CSEGDCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPV 143
C C + N + C + YG G S ++G ++++ +N SG
Sbjct: 142 GCDQEFCVANSPNGLPPACPSTSSPCQFRIAYGDG---SSTTGFYVSDSVQYNQVSGNGQ 198
Query: 144 EMP---NVIFGCGHKNLASPTSDSKQT--GIIGLGPGNSSLISQMGTS--IAGKFSYCLP 196
P ++ FGCG + L S Q GI+G G +SS++SQ+ + + F++CL
Sbjct: 199 TTPSNASITFGCGAQ-LGGDLGSSSQALDGILGFGQADSSMLSQLAAARKVRKIFAHCL- 256
Query: 197 DQGSSKINFGGIVAGAGVV-----STPLIIR-DHYYLSLEAISVGNQRLE-----FVSSS 245
++ GGI A VV +TPL+ HY ++L+ ISVG L+ F S
Sbjct: 257 ----DTVHGGGIFAIGNVVQPKVKTTPLVQNVTHYNVNLQGISVGGATLQLPSSTFDSGD 312
Query: 246 TGNIFVDTGVLRTLLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDVLCYNISS--QPK 303
+ +D+G LP E + L + + + K Q + A + D +C+ S
Sbjct: 313 SKGTIIDSGTTLAYLPREVYRTLLTAVFD--KYQDL----ALHNYQDFVCFQFSGSIDDG 366
Query: 304 FPEVTIHFRGA-DVKLSPSN-LFRNISDEIMCSAF-------RGGNANIVYGRIMQINFL 354
FP VT F G + + P + LF+N +D + C F + G ++ G ++ N L
Sbjct: 367 FPVVTFSFEGEITLNVYPHDYLFQNEND-LYCMGFLDGGVQTKDGKDMVLLGDLVLSNKL 425
Query: 355 IGYDIEQAMVSFKPSRCTN 373
+ YD+E+ ++ + C++
Sbjct: 426 VVYDLEKQVIGWADYNCSS 444
>gi|357502759|ref|XP_003621668.1| Aspartic proteinase-like protein [Medicago truncatula]
gi|355496683|gb|AES77886.1| Aspartic proteinase-like protein [Medicago truncatula]
Length = 481
Score = 107 bits (268), Expect = 7e-21, Method: Compositional matrix adjust.
Identities = 103/376 (27%), Positives = 168/376 (44%), Gaps = 50/376 (13%)
Query: 34 IYLMHLSIGTPPVDIFGSVDTGSDCTWT---QCEPCPELDCFKQEPPLFDPKKSSTYNSI 90
+Y + IGTP D + VDTG+D W QC+ CP + L++ K+SS+ +
Sbjct: 72 LYYAKIGIGTPSKDYYLQVDTGTDMMWVNCIQCKECPTRSNLGMDLTLYNIKESSSGKLV 131
Query: 91 SCSSSQCAVVTSNCSEG-------DCSYSFLYGRGAYASFSSGNLATETLTFNSTSG-LP 142
C C + G C Y +YG G S ++G + + F+ SG L
Sbjct: 132 PCDQELCKEINGGLLTGCTSKTNDSCPYLEIYGDG---SSTAGYFVKDVVLFDQVSGDLK 188
Query: 143 VEMPN--VIFGCGHKNLA--SPTSDSKQTGIIGLGPGNSSLISQMGTS--IAGKFSYCLP 196
N VIFGCG + S +++ GI+G G N S+ISQ+ +S + F++CL
Sbjct: 189 TASANGSVIFGCGARQSGDLSYSNEEALDGILGFGKANYSMISQLSSSGKVKKMFAHCL- 247
Query: 197 DQGSSKINFGGIVAGAGVV-----STPLI-IRDHYYLSLEAISVGNQRLEFVS------S 244
+ +N GGI A VV +TPL+ + HY +++ AI VG+ L +
Sbjct: 248 ----NGVNGGGIFAIGHVVQPTVNTTPLLPDQPHYSVNMTAIQVGHTFLNLSTDASEQRD 303
Query: 245 STGNIFVDTGVLRTLLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDVLCYNISSQPKF 304
S G I +D+G LP + L + ++ QP V Y+ S F
Sbjct: 304 SKGTI-IDSGTTLAYLPDGIYQPL---VYKILSQQPNLKVQTLHDEYTCFQYSGSVDDGF 359
Query: 305 PEVTIHFR-GADVKLSPSNLFRNISDEIMCSAFRGGNAN-------IVYGRIMQINFLIG 356
P VT +F G +K+ P + + +S+ + C ++ A + G ++ N L+
Sbjct: 360 PNVTFYFENGLSLKVYPHD-YLFLSENLWCIGWQNSGAQSRDSKNMTLLGDLVLSNKLVF 418
Query: 357 YDIEQAMVSFKPSRCT 372
YD+E ++ + C+
Sbjct: 419 YDLENQVIGWTEYNCS 434
>gi|326500408|dbj|BAK06293.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 475
Score = 107 bits (268), Expect = 7e-21, Method: Compositional matrix adjust.
Identities = 97/344 (28%), Positives = 150/344 (43%), Gaps = 42/344 (12%)
Query: 51 SVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSSSQCAVV-------TSN 103
++DT D W QC PCP C+ Q PLFDP SST ++ C S C + ++
Sbjct: 151 AIDTTVDVPWIQCAPCPIPQCYPQRDPLFDPTTSSTAAAVRCRSPACRSLGPYGNGCSNR 210
Query: 104 CSEGDCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMPNVIFGCGHKNLASPTSD 163
+ +C Y Y ++G T+TLT + T+ + N FGC H + SD
Sbjct: 211 SANAECRYLIEYSDD---RATAGTYMTDTLTISGTTAV----RNFRFGCSHA-VRGRFSD 262
Query: 164 SKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLPDQGSSK-INFGG--IVAGAGVVSTPLI 220
G + LG G SL++Q S+ FSYC+P +S ++ GG V +T +
Sbjct: 263 -LTAGTMSLGGGAQSLLAQTARSLGNAFSYCVPQASASGFLSIGGPATTNSTTVFATTPL 321
Query: 221 IRDH-----YYLSLEAISVGNQRLEF--VSSSTGNIFVDTGVLRTLLPLEYHSNLKSVMS 273
+R Y + L+ I V +RL V+ S G + + V+ L P Y + L+
Sbjct: 322 VRSAINPSLYLVRLQGIVVAGRRLGIPPVAFSAGAVMDSSAVITQLPPTAYRA-LRRAFR 380
Query: 274 NMIKAQPVKGVGAEPGFSDVLCYNI--SSQPKFPEVTIHF-RGADVKLSPSNLFRNISDE 330
N ++A P G G D CY+ + + P V++ F GA V L P +
Sbjct: 381 NAMRAYPRSGA---TGTLDT-CYDFLGLTNVRVPAVSLVFGGGAVVVLDPPAVMIG---- 432
Query: 331 IMCSAFRGGNANIVY---GRIMQINFLIGYDIEQAMVSFKPSRC 371
C AF ++++ G + Q + YD+ V F+ C
Sbjct: 433 -GCLAFTATSSDLALGFIGNVQQQTHEVLYDVAAGGVGFRRGAC 475
>gi|125563764|gb|EAZ09144.1| hypothetical protein OsI_31414 [Oryza sativa Indica Group]
Length = 472
Score = 107 bits (268), Expect = 7e-21, Method: Compositional matrix adjust.
Identities = 98/352 (27%), Positives = 157/352 (44%), Gaps = 55/352 (15%)
Query: 52 VDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSSSQCAVV----------T 101
VDT S+ TW QC PC C Q+ PLFDP S +Y + C+SS C +
Sbjct: 141 VDTASELTWVQCAPCA--SCHDQQGPLFDPASSPSYAVLPCNSSSCDALQVATGSAAGAC 198
Query: 102 SNCSEGDCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMPNVIFGCGHKNLASPT 161
+ CSY+ Y G+Y S G LA + L+ + +FGCG N P
Sbjct: 199 GGGEQPSCSYTLSYRDGSY---SQGVLAHDKLSLAGEV-----IDGFVFGCGTSN-QGPF 249
Query: 162 SDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLPDQGSSKINFGGIVAGAGVV----ST 217
+ +G++GLG SLISQ G FSYCLP + S + G +V G ST
Sbjct: 250 GGT--SGLMGLGRSQLSLISQTMDQFGGVFSYCLPLKESE--SSGSLVLGDDTSVYRNST 305
Query: 218 PLI----IRD-----HYYLSLEAISVGNQRLEFVSSSTGNIFVDTGVLRTLLPLEYHSNL 268
P++ + D Y+++L I++G Q +E SS G + VD+G + T L ++ +
Sbjct: 306 PIVYTTMVSDPVQGPFYFVNLTGITIGGQEVE---SSAGKVIVDSGTIITSLVPSVYNAV 362
Query: 269 KSVMSNMIKAQPVKGVGAEPGFSDV-LCYNIS--SQPKFPEVTIHFRG-ADVKLSPSNLF 324
K+ + P PGFS + C+N++ + + P + F G +V++ S +
Sbjct: 363 KAEFLSQFAEYP-----QAPGFSILDTCFNLTGFREVQIPSLKFVFEGNVEVEVDSSGVL 417
Query: 325 RNISDE-----IMCSAFRGGNANIVYGRIMQINFLIGYDIEQAMVSFKPSRC 371
+S + + ++ + + G Q N + +D + + F C
Sbjct: 418 YFVSSDSSQVCLALASLKSEYETSIIGNYQQKNLRVIFDTLGSQIGFAQETC 469
>gi|226532674|ref|NP_001151415.1| pepsin A precursor [Zea mays]
gi|195646632|gb|ACG42784.1| pepsin A [Zea mays]
Length = 492
Score = 107 bits (268), Expect = 7e-21, Method: Compositional matrix adjust.
Identities = 100/377 (26%), Positives = 171/377 (45%), Gaps = 51/377 (13%)
Query: 34 IYLMHLSIGTPPVDIFGSVDTGSDCTWTQ---CEPCPELDCFKQEPPLFDPKKSSTYNSI 90
+Y + IG+PP + VDTGSD W C+ CP E +DP S T ++
Sbjct: 84 LYYTRIEIGSPPKGYYVQVDTGSDILWVNGISCDGCPTRSGLGIELTQYDPAGSGT--TV 141
Query: 91 SCSSSQCAV------VTSNC--SEGDCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLP 142
C C V C + C + YG G S ++G T+ + +N SG
Sbjct: 142 GCEQEFCVANSAASGVPPACPSAASPCQFRITYGDG---SSTTGFYVTDFVQYNQVSGNG 198
Query: 143 VEMP---NVIFGCGHKNLASPTSDSKQT--GIIGLGPGNSSLISQMGTS--IAGKFSYCL 195
P ++ FGCG + L S Q GI+G G ++S++SQ+ + + F++CL
Sbjct: 199 QTTPSNVSITFGCGAQ-LGGDLGSSSQALDGILGFGQSDASMLSQLAAARKVRKIFAHCL 257
Query: 196 PD-QGSSKINFGGIVAGAGVVSTPLIIR-DHYYLSLEAISVGNQRLE-----FVSSSTGN 248
+G G +V V +TPL+ HY ++L+ ISVG L+ F S +
Sbjct: 258 DTVRGGGIFAIGNVVQPPIVKTTPLVPNATHYNVNLQGISVGGATLQLPTSTFDSGDSKG 317
Query: 249 IFVDTGVLRTLLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDVLCYNISS--QPKFPE 306
+D+G LP E + +++++ + P V + D +C+ S +FP
Sbjct: 318 TIIDSGTTLAYLPREVY---RTLLTAVFDKHPDLAV---RNYEDFICFQFSGSLDEEFPV 371
Query: 307 VTIHFRGADVKLS--PSN-LFRNISDEIMCSAF-------RGGNANIVYGRIMQINFLIG 356
+T F G D+ L+ P + LF+N +++ C F + G ++ G ++ N L+
Sbjct: 372 ITFSFEG-DLTLNVYPHDYLFQN-GNDLYCMGFLDGGVQTKDGKDMVLLGDLVLSNKLVV 429
Query: 357 YDIEQAMVSFKPSRCTN 373
YD+E+ ++ + C++
Sbjct: 430 YDLEKQVIGWTDYNCSS 446
>gi|357159298|ref|XP_003578403.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Brachypodium
distachyon]
Length = 442
Score = 107 bits (268), Expect = 8e-21, Method: Compositional matrix adjust.
Identities = 107/379 (28%), Positives = 164/379 (43%), Gaps = 47/379 (12%)
Query: 26 AEIISVDDIYLMHLSIGTPPVDIFGSVDTGSDCTWTQCE-PCPELDCFKQEPPLFDPKKS 84
A++ Y+ IG+PP +DTGSD WTQC C C KQ P ++ +S
Sbjct: 77 AQVHRATRQYIASYLIGSPPQRTEALIDTGSDLIWTQCATTCLPKSCAKQGLPYYNLSQS 136
Query: 85 STYNSISCSSSQ--CAV--VTSNCSEGDCSYSFLYGRGAYASFSSGNLATETLTFNSTSG 140
ST+ + C+ CA V +G C++ YG G G+L TE+ F S +
Sbjct: 137 STFVPVPCADKAGFCAANGVHLCGLDGSCTFIASYGAGRVI----GSLGTESFAFESGT- 191
Query: 141 LPVEMPNVIFGCGHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLP---- 196
++ FGC + + + +G+IGLG G SL+SQ+G A +FSYCL
Sbjct: 192 -----TSLAFGCVSLTRITSGALNDASGLIGLGRGRLSLVSQIG---ATRFSYCLTPYFH 243
Query: 197 DQGSSKINF--GGIVAGAGVVSTPLII--RDH-----YYLSLEAISVGNQRLEFVSSST- 246
G+S F G G S P + +D+ YYL LE I+VG RL V+S+T
Sbjct: 244 SSGASSHLFVGASASLGGGGASMPFVKSPKDYPYSTFYYLPLEGITVGKTRLPAVNSTTF 303
Query: 247 -----------GNIFVDTGVLRTLLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDVLC 295
G + +DTG T L + LK ++ + + + G LC
Sbjct: 304 QLRQLFKGYWAGGVIIDTGSPLTQLASHAYEALKEEVAAQLGNGSLVPAPEDSGLE--LC 361
Query: 296 YNISS-QPKFPEVTIHF-RGADVKLSPSNLFRNISDEIMCSAFRGGNANIVYGRIMQINF 353
Q P + HF GAD+ + ++ + + C G + + G Q +
Sbjct: 362 VAREGFQKVVPALVFHFGGGADMAVPAASYWAPVDKAAACMMILEGGYDSIIGNFQQQDM 421
Query: 354 LIGYDIEQAMVSFKPSRCT 372
+ YD+ + SF+ + CT
Sbjct: 422 HLLYDLRRGRFSFQTADCT 440
>gi|21617933|gb|AAM66983.1| nucleoid DNA-binding-like protein [Arabidopsis thaliana]
Length = 425
Score = 107 bits (267), Expect = 8e-21, Method: Compositional matrix adjust.
Identities = 110/376 (29%), Positives = 164/376 (43%), Gaps = 52/376 (13%)
Query: 23 IYQAEIISVDDIYLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPK 82
I I Y++ +IGTP + ++DT +D W C C + C LFDP
Sbjct: 76 IASGRAIVQSPTYIVRANIGTPAQPMLVALDTSNDAAWIPCSGC--VGC--SSSVLFDPS 131
Query: 83 KSSTYNSISCSSSQCAVVTS-NCS-EGDCSYSFLYGRGAYASFSSGNLATETLTFNSTSG 140
KSS+ ++ C + QC + +C+ C ++ YG ++ L +TLT S
Sbjct: 132 KSSSSRTLQCEAPQCKQAPNPSCTVSKSCGFNMTYGGSTIEAY----LTQDTLTLASD-- 185
Query: 141 LPVEMPNVIFGCGHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLPDQGS 200
+PN FGC +K AS TS Q G++GLG G SLISQ FSYCLP+ S
Sbjct: 186 ---VIPNYTFGCINK--ASGTSLPAQ-GLMGLGRGPLSLISQSQNLYQSTFSYCLPNSKS 239
Query: 201 SKINFGGIVAGA------GVVSTPLIIRDH----YYLSLEAISVGNQRLEFVSSS----- 245
S NF G + + +TPL+ YY++L I VGN+ ++ +S+
Sbjct: 240 S--NFSGSLRLGPKNQPIRIKTTPLLKNPRRSSLYYVNLVGIRVGNKIVDIPTSALAFDP 297
Query: 246 ---TGNIFVDTGVLRTLLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDVLCYNISSQP 302
G IF D+G + T L + +++ +K +G GF CY S
Sbjct: 298 ATGAGTIF-DSGTVYTRLVEPAYVAVRNEFRRRVKNANATSLG---GFDT--CY--SGSV 349
Query: 303 KFPEVTIHFRGADVKLSPSNLF-RNISDEIMCSAFRGGNANI-----VYGRIMQINFLIG 356
FP VT F G +V L P NL + + + C A N+ V + Q N +
Sbjct: 350 VFPSVTFMFAGMNVTLPPDNLLIHSSAGNLSCLAMAAAPVNVNSVLNVIASMQQQNHRVL 409
Query: 357 YDIEQAMVSFKPSRCT 372
D+ + + CT
Sbjct: 410 IDVPNSRLGISRETCT 425
>gi|15232503|ref|NP_191008.1| aspartyl protease family protein [Arabidopsis thaliana]
gi|7288018|emb|CAB81805.1| nucleoid DNA-binding-like protein [Arabidopsis thaliana]
gi|17979257|gb|AAL49945.1| AT3g54400/T12E18_90 [Arabidopsis thaliana]
gi|21700851|gb|AAM70549.1| AT3g54400/T12E18_90 [Arabidopsis thaliana]
gi|332645705|gb|AEE79226.1| aspartyl protease family protein [Arabidopsis thaliana]
Length = 425
Score = 107 bits (267), Expect = 9e-21, Method: Compositional matrix adjust.
Identities = 110/376 (29%), Positives = 164/376 (43%), Gaps = 52/376 (13%)
Query: 23 IYQAEIISVDDIYLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPK 82
I I Y++ +IGTP + ++DT +D W C C + C LFDP
Sbjct: 76 IASGRAIVQSPTYIVRANIGTPAQPMLVALDTSNDAAWIPCSGC--VGC--SSSVLFDPS 131
Query: 83 KSSTYNSISCSSSQCAVVTS-NCS-EGDCSYSFLYGRGAYASFSSGNLATETLTFNSTSG 140
KSS+ ++ C + QC + +C+ C ++ YG ++ L +TLT S
Sbjct: 132 KSSSSRTLQCEAPQCKQAPNPSCTVSKSCGFNMTYGGSTIEAY----LTQDTLTLASD-- 185
Query: 141 LPVEMPNVIFGCGHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLPDQGS 200
+PN FGC +K AS TS Q G++GLG G SLISQ FSYCLP+ S
Sbjct: 186 ---VIPNYTFGCINK--ASGTSLPAQ-GLMGLGRGPLSLISQSQNLYQSTFSYCLPNSKS 239
Query: 201 SKINFGGIVAGA------GVVSTPLIIRDH----YYLSLEAISVGNQRLEFVSSS----- 245
S NF G + + +TPL+ YY++L I VGN+ ++ +S+
Sbjct: 240 S--NFSGSLRLGPKNQPIRIKTTPLLKNPRRSSLYYVNLVGIRVGNKIVDIPTSALAFDP 297
Query: 246 ---TGNIFVDTGVLRTLLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDVLCYNISSQP 302
G IF D+G + T L + +++ +K +G GF CY S
Sbjct: 298 ATGAGTIF-DSGTVYTRLVEPAYVAVRNEFRRRVKNANATSLG---GFDT--CY--SGSV 349
Query: 303 KFPEVTIHFRGADVKLSPSNLF-RNISDEIMCSAFRGGNANI-----VYGRIMQINFLIG 356
FP VT F G +V L P NL + + + C A N+ V + Q N +
Sbjct: 350 VFPSVTFMFAGMNVTLPPDNLLIHSSAGNLSCLAMAAAPVNVNSVLNVIASMQQQNHRVL 409
Query: 357 YDIEQAMVSFKPSRCT 372
D+ + + CT
Sbjct: 410 IDVPNSRLGISRETCT 425
>gi|226491934|ref|NP_001140743.1| uncharacterized protein LOC100272818 [Zea mays]
gi|194700872|gb|ACF84520.1| unknown [Zea mays]
Length = 351
Score = 107 bits (267), Expect = 1e-20, Method: Compositional matrix adjust.
Identities = 101/344 (29%), Positives = 148/344 (43%), Gaps = 49/344 (14%)
Query: 52 VDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSSSQCAVV---TSNCSEGD 108
+D+ SD W QC PCP C Q +DP +S T + SCSS C + + C+
Sbjct: 33 LDSASDVPWVQCVPCPIPPCHPQVDSFYDPSRSPTSAAFSCSSPTCTALGPYANGCANNQ 92
Query: 109 CSYSFLYGRGAYASFSSGNLATETLTF---NSTSGLPVEMPNVIFGCGHKNLASPTSDSK 165
C Y Y G S +SG + LT N+ SG FGC H S D++
Sbjct: 93 CQYLVRYPDG---SSTSGAYIADLLTLDAGNAVSGFK-------FGCSHAEQGS--FDAR 140
Query: 166 QTGIIGLGPGNSSLISQMGTSIAGKFSYCLPDQGSSKINFGGIV---AGAGVVSTPLI-- 220
GI+ LG G SL+SQ + FSYC+P S F V A + V TP++
Sbjct: 141 AAGIMALGGGPESLLSQTASRYGNAFSYCIPATASDSGFFTLGVPRRASSRYVVTPMVRF 200
Query: 221 --IRDHYYLSLEAISVGNQRLEFVSS--STGNIFVDTGVLRTLLPLEYHSNLKSVMSNMI 276
Y + L I+VG QRL + + G++ + L P Y + + S+M
Sbjct: 201 RQAATFYGVLLRTITVGGQRLGVAPAVFAAGSVLDSRTAITRLPPTAYQALRAAFRSSMT 260
Query: 277 --KAQPVKGVGAEPGFSDVLCYNISS--QPKFPEVTIHF-RGADVKLSPSNLFRNISDEI 331
++ P K G+ D CY+ + + P++++ F R A + L PS + N
Sbjct: 261 MYRSAPPK------GYLDT-CYDFTGVVNIRLPKISLVFDRNAVLPLDPSGILFND---- 309
Query: 332 MCSAFRGGNANI----VYGRIMQINFLIGYDIEQAMVSFKPSRC 371
C AF NA+ V G + Q + YD+ V F+ C
Sbjct: 310 -CLAFT-SNADDRMPGVLGSVQQQTIEVLYDVGGGAVGFRQGAC 351
>gi|125548492|gb|EAY94314.1| hypothetical protein OsI_16081 [Oryza sativa Indica Group]
Length = 417
Score = 107 bits (266), Expect = 1e-20, Method: Compositional matrix adjust.
Identities = 72/243 (29%), Positives = 115/243 (47%), Gaps = 29/243 (11%)
Query: 14 ETPKSPISIIYQAEIISVDDIYLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFK 73
E + +++ + I+ YL+ L IGTPP ++DT SD WTQC+PC C+
Sbjct: 68 EAASARKAVVAETPIMPAGGEYLVKLGIGTPPYKFTAAIDTASDLIWTQCQPC--TGCYH 125
Query: 74 QEPPLFDPKKSSTYNSISCSSSQC-AVVTSNCSEGD---CSYSFLYGRGAYASFSSGNLA 129
Q P+F+P+ SSTY ++ CSS C + C D C Y++ Y A + G LA
Sbjct: 126 QVDPMFNPRVSSTYAALPCSSDTCDELDVHRCGHDDDESCQYTYTYSGNA---TTEGTLA 182
Query: 130 TETLTFNSTSGLPVEMPNVIFGCGHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAG 189
+ L + V FGC + + +G++GLG G SL+SQ+
Sbjct: 183 VDKLVIGEDA-----FRGVAFGCSTSSTGG-APPPQASGVVGLGRGPLSLVSQLSVR--- 233
Query: 190 KFSYCLPDQGSS---KINFGGIVAGAGVVSTPLII---RD-----HYYLSLEAISVGNQR 238
+F+YCLP S K+ G A + + + RD +YYL+L+ + +G++
Sbjct: 234 RFAYCLPPPASRIPGKLVLGADADAARNATNRIAVPMRRDPRYPSYYYLNLDGLLIGDRT 293
Query: 239 LEF 241
+
Sbjct: 294 MSL 296
>gi|356523171|ref|XP_003530215.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
Length = 442
Score = 107 bits (266), Expect = 1e-20, Method: Compositional matrix adjust.
Identities = 107/382 (28%), Positives = 180/382 (47%), Gaps = 69/382 (18%)
Query: 39 LSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEP-PLFDPKKSSTYNSISCSSSQC 97
+++GTPP ++ +DTGS+ +W C + P P F+P SS+Y ISCSS C
Sbjct: 70 ITVGTPPQNMSMVIDTGSELSWLHC----NTNTTATIPYPFFNPNISSSYTPISCSSPTC 125
Query: 98 AVVT------SNCSEGDCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMPNVIFG 151
T ++C + ++ L A AS S GNLA++T F S+ P ++FG
Sbjct: 126 TTRTRDFPIPASCDSNNLCHATL--SYADASSSEGNLASDTFGFGSSFN-----PGIVFG 178
Query: 152 CGHKNLASPT-SDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLPDQ--------GSSK 202
C + + ++ + SDS TG++G+ G+ SL+SQ+ KFSYC+ G S
Sbjct: 179 CMNSSYSTNSESDSNTTGLMGMNLGSLSLVSQLKIP---KFSYCISGSDFSGILLLGESN 235
Query: 203 INFGGIVAGAGVV--STPL--IIRDHYYLSLEAISVGNQRLEFVSSSTGNIFV--DTGVL 256
++GG + +V STPL R Y + LE I + ++ L +GN+FV TG
Sbjct: 236 FSWGGSLNYTPLVQISTPLPYFDRSAYTVRLEGIKISDKLLNI----SGNLFVPDHTGAG 291
Query: 257 RTLLPL-EYHSNLKSVMSNMIKAQPVKGVGA------EPGF----SDVLCYNI----SSQ 301
+T+ L S L + N ++ + + +P F + LCY + S
Sbjct: 292 QTMFDLGTQFSYLLGPVYNALRDEFLNQTNGTLRALDDPNFVFQIAMDLCYRVPVNQSEL 351
Query: 302 PKFPEVTIHFRGADVKLSPSNLFRNI------SDEIMCSAFRGGNANI------VYGRIM 349
P+ P V++ F GA++++ L + +D + C F GN+++ + G
Sbjct: 352 PELPSVSLVFEGAEMRVFGDQLLYRVPGFVWGNDSVYCFTF--GNSDLLGVEAFIIGHHH 409
Query: 350 QINFLIGYDIEQAMVSFKPSRC 371
Q + + +D+ + V +RC
Sbjct: 410 QQSMWMEFDLVEHRVGLAHARC 431
>gi|357143328|ref|XP_003572882.1| PREDICTED: uncharacterized protein LOC100846829 [Brachypodium
distachyon]
Length = 836
Score = 107 bits (266), Expect = 1e-20, Method: Compositional matrix adjust.
Identities = 113/356 (31%), Positives = 168/356 (47%), Gaps = 38/356 (10%)
Query: 35 YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSS 94
Y++ +S+GTP V VDTGSD +W QC PC C+ Q+ LFDP KSS+Y+++ C++
Sbjct: 500 YVVTVSLGTPGVAQTVEVDTGSDVSWVQCAPCAAPACYAQKDQLFDPAKSSSYSAVPCAA 559
Query: 95 SQCAVVTS---NCSEG-DCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMPNVIF 150
C+ +++ C+ G C Y YG G S ++G ++TLT + +F
Sbjct: 560 DACSELSTYGHGCAAGSQCGYVVSYGDG---SNTTGVYGSDTLTLTDADAV----TGFLF 612
Query: 151 GCGHKNLASPTSDSKQTGIIGLGPGNSSLISQM-GTSIAGKFSYCLPDQGSSK--INFGG 207
GCGH A + G++ LG SL SQ G G FSYCLP SS + GG
Sbjct: 613 GCGH---AQAGLFAGIDGLLALGRKGMSLTSQTSGAYGGGVFSYCLPPSPSSTGFLTLGG 669
Query: 208 IVAGAGVVSTPLI----IRDHYYLSLEAISVGNQRLEFVSSS--TGNIFVDTGVLRTLLP 261
+ +G +T L+ + Y + L I VG Q+L V +S G VDTG + T LP
Sbjct: 670 PSSASGFATTGLLTAWDVPTFYMVMLTGIGVGGQQLSGVPASAFAGGTVVDTGTVITRLP 729
Query: 262 LEYHSNLKSVMSNMIKAQPVKGVGAEP--GFSDVLCYNIS--SQPKFPEVTIHFR-GADV 316
+ ++ + A G A P G D CYN + P V++ F GA +
Sbjct: 730 ---PTAYAALRAAFRAAMAPYGYPAAPATGILDT-CYNFTDYGTVTLPTVSLTFSGGATL 785
Query: 317 KLSPSNLFRNISDEIMCSAFRGGNAN-IVYGRIMQINFLIGYDIEQAMVSFKPSRC 371
KL +S + A G+ + + G + Q +F + +D + V F P C
Sbjct: 786 KLDAPGF---LSSGCLAFATNSGDGDPAILGNVQQRSFAVRFD--GSSVGFMPHSC 836
>gi|357118076|ref|XP_003560785.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Brachypodium
distachyon]
Length = 477
Score = 106 bits (265), Expect = 1e-20, Method: Compositional matrix adjust.
Identities = 107/358 (29%), Positives = 170/358 (47%), Gaps = 38/358 (10%)
Query: 35 YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSS 94
+++ + GTP +DTGSD +W QC+PC C++Q P FDP KSS+Y ++ C +
Sbjct: 137 FVVVVGFGTPAQTAAIILDTGSDLSWIQCKPC-SGHCYRQHDPDFDPAKSSSYAAVPCGT 195
Query: 95 SQCAVVTSNCSEGDCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMPNVIFGCGH 154
CA C+ C Y YG G S ++G L+ +TLTFNS+S + FGCG
Sbjct: 196 PVCAAAGGMCNGTTCLYGVQYGDG---SSTTGVLSRDTLTFNSSS----KFTGFTFGCGE 248
Query: 155 KNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLPDQGSSK--INFGGI--VA 210
KN+ + G++GLG G SL SQ S G FSYCLP ++ +N G +
Sbjct: 249 KNIG---DFGEVDGLLGLGRGKLSLPSQAAPSFGGVFSYCLPSYNTTPGYLNIGATKPTS 305
Query: 211 GAGVVSTPLIIRDH----YYLSLEAISVGNQRLEF---VSSSTGNIFVDTGVLRTLLPLE 263
V T +I + Y++ L +I++G L V + TG + +D+G + T LP
Sbjct: 306 TVPVQYTAMIKKPQYPSFYFIELVSINIGGYILPVPPSVFTKTGTL-LDSGTILTYLPPP 364
Query: 264 YHSNLKSVMSNMIKAQPVKGVGAEPGFSDV-LCYNISSQPK--FPEVTIHFR-GA--DVK 317
+++L+ + +G P + + CY+ + Q P V+ +F GA D+
Sbjct: 365 AYTSLRDRFKFTM-----QGNKPAPPYEPLDTCYDFTGQGAIVIPAVSFNFSDGAVFDLD 419
Query: 318 LSPSNLFRNISDEIM-CSAFRGGNANI---VYGRIMQINFLIGYDIEQAMVSFKPSRC 371
+F + + ++ C AF A + + G Q + YD+ + F P C
Sbjct: 420 FYGIMIFPDDAKPLIGCLAFVSRPAAMPFSIVGNTQQRAAEVIYDVPSQKIGFIPISC 477
>gi|414587000|tpg|DAA37571.1| TPA: hypothetical protein ZEAMMB73_036171 [Zea mays]
Length = 459
Score = 106 bits (265), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 99/404 (24%), Positives = 171/404 (42%), Gaps = 79/404 (19%)
Query: 21 SIIYQAEIISVDDIYLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFD 80
++ +A ++ YL+ L GTP ++DT SD W QC+PC + C++Q P+F+
Sbjct: 78 AVASEAPLVPGGGEYLVKLGTGTPQHFFSAAIDTASDLVWMQCQPC--VSCYRQLDPVFN 135
Query: 81 PKKSSTYNSISCSSSQCAVVTSN-CSEGD---CSYSFLY-GRGAYASFSSGNLATETLTF 135
PK SS+Y + C+S CA + + C E D C Y++ Y G G + G LA + L
Sbjct: 136 PKLSSSYAVVPCTSDTCAQLDGHRCHEDDDGACQYTYKYSGHGV----TKGTLAIDKLAI 191
Query: 136 NSTSGLPVEMPNVIFGCGHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCL 195
V+FGC ++ P ++ +G++GLG G SL+SQ+ +F YCL
Sbjct: 192 GGDV-----FHAVVFGCSDSSVGGPA--AQASGLVGLGRGPLSLVSQLSVH---RFMYCL 241
Query: 196 PDQGSSKINFGGIVAGAGV-------------VSTPLIIRDHYYLSLEAISVGNQRLEFV 242
P S G +V GAG +S+ +YYL+L+ ++VG+Q
Sbjct: 242 PPPMSR--TSGKLVLGAGADAVRNMSDRVTVTMSSSTRYPSYYYLNLDGLAVGDQTPGTT 299
Query: 243 SSSTG--------------------------NIFVDTGVLRTLLPLEYHSNLKSVMSNMI 276
++T + VD + L + L + I
Sbjct: 300 RNATSPPSGGAGGGGGGGGGGIVGAGGANAYGMIVDVASTISFLETSLYDELADDLEEEI 359
Query: 277 ---KAQPVKGVGAEPGFSDVLCYNISS-----QPKFPEVTIHFRGADVKLSPSNLFRNIS 328
+A P +G + LC+ + + P V++ F G ++L LF ++
Sbjct: 360 RLPRATPSLRLGLD------LCFILPEGVGMDRVYVPTVSLSFDGRWLELDRDRLF--VT 411
Query: 329 D-EIMCSAFRGGNANIVYGRIMQINFLIGYDIEQAMVSFKPSRC 371
D +MC + + G N + +++ + ++F + C
Sbjct: 412 DGRMMCLMIGRTSGVSILGNFQLQNMRVLFNLRRGKITFAKASC 455
>gi|356564743|ref|XP_003550608.1| PREDICTED: aspartic proteinase-like protein 2-like [Glycine max]
Length = 490
Score = 106 bits (265), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 104/370 (28%), Positives = 169/370 (45%), Gaps = 39/370 (10%)
Query: 34 IYLMHLSIGTPPVDIFGSVDTGSDCTWTQCEP---CPELDCFKQEPPLFDPKKSSTYNSI 90
+Y + +GTPPV+ +DTGSD W C CP+ + + FDP SST + I
Sbjct: 74 LYYTKVQLGTPPVEFNVQIDTGSDVLWVSCNSCSGCPQTSGLQIQLNFFDPGSSSTSSMI 133
Query: 91 SCSSSQC------AVVTSNCSEGDCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVE 144
+CS +C + T + CSY+F YG G S +SG ++ + N+ V
Sbjct: 134 ACSDQRCNNGIQSSDATCSSQNNQCSYTFQYGDG---SGTSGYYVSDMMHLNTIFEGSVT 190
Query: 145 MPN---VIFGCGHKNLASPT-SDSKQTGIIGLGPGNSSLISQMGTS-IAGK-FSYCLP-D 197
+ V+FGC ++ T SD GI G G S+ISQ+ + IA + FS+CL D
Sbjct: 191 TNSTAPVVFGCSNQQTGDLTKSDRAVDGIFGFGQQEMSVISQLSSQGIAPRVFSHCLKGD 250
Query: 198 QGSSKINFGGIVAGAGVVSTPLI-IRDHYYLSLEAISVGNQRLE-----FVSSSTGNIFV 251
I G + +V T L+ + HY L+L++I+V Q L+ F +S++ V
Sbjct: 251 SSGGGILVLGEIVEPNIVYTSLVPAQPHYNLNLQSIAVNGQTLQIDSSVFATSNSRGTIV 310
Query: 252 DTGVLRTLLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDVLCYNISSQPK--FPEVTI 309
D+G L E + S ++ I Q V V + CY I+S FP+V++
Sbjct: 311 DSGTTLAYLAEEAYDPFVSAITASIP-QSVHTVVSRGN----QCYLITSSVTEVFPQVSL 365
Query: 310 HFR-GADVKLSPSNLFRNISD----EIMCSAFRG--GNANIVYGRIMQINFLIGYDIEQA 362
+F GA + L P + + + C F+ G + G ++ + ++ YD+
Sbjct: 366 NFAGGASMILRPQDYLIQQNSIGGAAVWCIGFQKIQGQGITILGDLVLKDKIVVYDLAGQ 425
Query: 363 MVSFKPSRCT 372
+ + C+
Sbjct: 426 RIGWANYDCS 435
>gi|115466068|ref|NP_001056633.1| Os06g0119600 [Oryza sativa Japonica Group]
gi|55296446|dbj|BAD68569.1| putative nucleoid DNA-binding protein cnd41 [Oryza sativa Japonica
Group]
gi|55296924|dbj|BAD68375.1| putative nucleoid DNA-binding protein cnd41 [Oryza sativa Japonica
Group]
gi|113594673|dbj|BAF18547.1| Os06g0119600 [Oryza sativa Japonica Group]
gi|215694767|dbj|BAG89958.1| unnamed protein product [Oryza sativa Japonica Group]
gi|215737752|dbj|BAG96882.1| unnamed protein product [Oryza sativa Japonica Group]
Length = 495
Score = 106 bits (265), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 97/347 (27%), Positives = 148/347 (42%), Gaps = 50/347 (14%)
Query: 52 VDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSSSQCAVV---TSNCSE-G 107
+D+GSD +W QC+PCP C +Q PLFDP S+TY ++ C+S+ CA + CS
Sbjct: 172 IDSGSDVSWVQCKPCPLPMCHRQRDPLFDPAMSTTYAAVPCTSAACAQLGPYRRGCSANA 231
Query: 108 DCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVE-MPNVIFGCGHKNLASPTSDSKQ 166
C + YG G+ AT T +F+ + P + + FGC H + S D
Sbjct: 232 QCQFGINYG--------DGSTATGTYSFDDLTLGPYDVIRGFRFGCAHADRGS-AFDYDV 282
Query: 167 TGIIGLGPGNSSLISQMGTSIAGKFSYCLPDQGSSKINFGGIVAGA---------GVVST 217
G + LG G+ SL+ Q T FSYCLP SS G +V G VST
Sbjct: 283 AGSLALGGGSQSLVQQTATRYGRVFSYCLPPTASS---LGFLVLGVPPERAQLIPSFVST 339
Query: 218 PLI----IRDHYYLSLEAISVGNQRLEFVSS--STGNIFVDTGVLRTLLPLEYH---SNL 268
PL+ Y + L AI V + L + S ++ + ++ L P Y +
Sbjct: 340 PLLSSSMAPTFYRVLLRAIIVAGRPLAVPPAVFSASSVIDSSTIISRLPPTAYQALRAAF 399
Query: 269 KSVMSNMIKAQPVKGVGAEPGFSDVLCYNISSQPKFPEVTIHFR-GADVKLSPSNLFRNI 327
+S M+ A PV + F+ V + P + + F GA V L + +
Sbjct: 400 RSAMTMYRAAPPVSILDTCYDFTGVRSITL------PSIALVFDGGATVNLDAAGILLG- 452
Query: 328 SDEIMCSAFRGGNANIV---YGRIMQINFLIGYDIEQAMVSFKPSRC 371
C AF ++ + G + Q + YD+ + F+ + C
Sbjct: 453 ----SCLAFAPTASDRMPGFIGNVQQKTLEVVYDVPAKAMRFRTAAC 495
>gi|115451209|ref|NP_001049205.1| Os03g0186900 [Oryza sativa Japonica Group]
gi|49532749|dbj|BAD26705.1| Radc1 [Oryza sativa Japonica Group]
gi|108706569|gb|ABF94364.1| Eukaryotic aspartyl protease family protein, expressed [Oryza
sativa Japonica Group]
gi|113547676|dbj|BAF11119.1| Os03g0186900 [Oryza sativa Japonica Group]
gi|215692805|dbj|BAG88249.1| unnamed protein product [Oryza sativa Japonica Group]
gi|215767626|dbj|BAG99854.1| unnamed protein product [Oryza sativa Japonica Group]
Length = 438
Score = 106 bits (265), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 103/382 (26%), Positives = 168/382 (43%), Gaps = 68/382 (17%)
Query: 35 YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSS 94
Y++ +G+P + ++DT +D TW C PC C LF P SS+Y S+ CSS
Sbjct: 79 YVVRAGLGSPSQQLLLALDTSADATWAHCSPCGT--CPSSS--LFAPANSSSYASLPCSS 134
Query: 95 SQCAVVTSNCSE-----GD----------CSYSFLYGRGAYASFSSGNLATETLTFNSTS 139
S C + GD C++S + A ASF + LA++TL +
Sbjct: 135 SWCPLFQGQACPAPQGGGDAAPPPATLPTCAFSKPF---ADASFQAA-LASDTLRLGKDA 190
Query: 140 GLPVEMPNVIFGCGHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLPDQG 199
+PN FGC ++ PT++ + G++GLG G +L+SQ G+ G FSYCLP
Sbjct: 191 -----IPNYTFGC-VSSVTGPTTNMPRQGLLGLGRGPMALLSQAGSLYNGVFSYCLPSYR 244
Query: 200 SSKINFGGIVAGAG------VVSTPLIIRDH----YYLSLEAISVGNQRLE-------FV 242
S + G + GAG V TP++ H YY+++ +SVG+ ++ F
Sbjct: 245 SYYFS-GSLRLGAGGGQPRSVRYTPMLRNPHRSSLYYVNVTGLSVGHAWVKVPAGSFAFD 303
Query: 243 SSSTGNIFVDTGVLRTLLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDV----LCYNI 298
+++ VD+G + T ++ L+ + V A G++ + C+N
Sbjct: 304 AATGAGTVVDSGTVITRWTAPVYAALREEFR--------RQVAAPSGYTSLGAFDTCFNT 355
Query: 299 S--SQPKFPEVTIHFRGA-DVKLSPSN-LFRNISDEIMCSAFRGGNANI-----VYGRIM 349
+ P VT+H G D+ L N L + + + C A N+ V +
Sbjct: 356 DEVAAGGAPAVTVHMDGGVDLALPMENTLIHSSATPLACLAMAEAPQNVNSVVNVIANLQ 415
Query: 350 QINFLIGYDIEQAMVSFKPSRC 371
Q N + +D+ + V F C
Sbjct: 416 QQNIRVVFDVANSRVGFAKESC 437
>gi|413922067|gb|AFW61999.1| hypothetical protein ZEAMMB73_694403, partial [Zea mays]
Length = 328
Score = 106 bits (265), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 79/243 (32%), Positives = 115/243 (47%), Gaps = 44/243 (18%)
Query: 40 SIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSSSQCA- 98
S G+P ++ VDTGSD TW QC+PC C+ Q PLFDP S+TY ++ C++S CA
Sbjct: 101 SSGSPAANLTVIVDTGSDLTWVQCKPCSA--CYAQRDPLFDPAGSATYAAVRCNASACAD 158
Query: 99 -----------VVTSNCSEGDCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMPN 147
++ C Y+ YG G SFS G LAT+T+ S +
Sbjct: 159 SLRAATGTPGSCGSTGAGSEKCYYALAYGDG---SFSRGVLATDTVALGGAS-----LGG 210
Query: 148 VIFGCG--HKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLP------DQG 199
+FGCG ++ L T+ G++GLG SL+SQ + G FSYCLP G
Sbjct: 211 FVFGCGLSNRGLFGGTA-----GLMGLGRTELSLVSQTASRYGGVFSYCLPAATSGDASG 265
Query: 200 SSKINFGGIVAGAGVVSTPL----IIRD-----HYYLSLEAISVGNQRLEFVSSSTGNIF 250
S + G A + +TP+ +I D Y+L++ +VG L N+
Sbjct: 266 SLSLGGGDDAASSYRNTTPVAYTRMIADPAQPPFYFLNVTGAAVGGTALAAQGLGASNVL 325
Query: 251 VDT 253
+D+
Sbjct: 326 IDS 328
>gi|218184943|gb|EEC67370.1| hypothetical protein OsI_34481 [Oryza sativa Indica Group]
Length = 367
Score = 106 bits (265), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 99/366 (27%), Positives = 172/366 (46%), Gaps = 53/366 (14%)
Query: 36 LMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSSS 95
+ + +IGTPP +D + WTQC C + CFKQ+ P+F P SST+ C +
Sbjct: 25 VANFTIGTPPQAASAFIDLTGELVWTQCSQC--IHCFKQDLPVFVPNASSTFKPEPCGTD 82
Query: 96 QC-AVVTSNCSEGDCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMPNVIFGCGH 154
C ++ T C+ C++ + G G + + G +AT+T + + P + FGC
Sbjct: 83 VCKSIPTPKCASDVCAFDGVTGLGGH---TVGIVATDTFAIGTAA--PASLG---FGC-- 132
Query: 155 KNLASPTSDSK--QTGIIGLGPGNSSLISQMGTSIAGKFSYCLP--DQGSSKINFGGIVA 210
+ + D+ +G IGLG SL++QM + +FSYCL D G + F G A
Sbjct: 133 --VVASDIDTMGGPSGFIGLGRTPWSLVAQMKLT---RFSYCLAPHDTGKNSRLFLGASA 187
Query: 211 --GAGVVSTPLI-------IRDHYYLSLEAISVGNQRLEFVSSSTGNIFVDTGVLRTLLP 261
G TP + + +Y + LE I G+ + + + V T V+R L
Sbjct: 188 KLAGGGAWTPFVKTSPNDGMSQYYPIELEEIKAGDATIT-MPRGRNTVLVQTAVVRVSLL 246
Query: 262 LE--YHSNLKSVMSNMIKAQPVKGVGAEPGFSDVLCYNISSQPKFPEVTIHFR-GADVKL 318
++ Y K+VM+++ A VG EP F +C+ + P++ F+ GA + +
Sbjct: 247 VDSVYQEFKKAVMASVGAAPTATPVG-EP-FE--VCFPKAGVSGAPDLVFTFQAGAALTV 302
Query: 319 SPSNLFRNISDEIMC-----------SAFRGGNANIVYGRIMQINFLIGYDIEQAMVSFK 367
P+N ++ ++ +C +A G N + G Q N + +D+++ M+SF+
Sbjct: 303 PPANYLFDVGNDTVCLSVMSIALLNITALDGLN---ILGSFQQENVHLLFDLDKDMLSFE 359
Query: 368 PSRCTN 373
P+ C++
Sbjct: 360 PADCSS 365
>gi|125560845|gb|EAZ06293.1| hypothetical protein OsI_28528 [Oryza sativa Indica Group]
Length = 525
Score = 106 bits (265), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 106/367 (28%), Positives = 163/367 (44%), Gaps = 70/367 (19%)
Query: 52 VDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSSSQCAV-------VTSNC 104
VDTGSD TW QC+PC C+ Q PLFDP S++Y ++ C++S C V +C
Sbjct: 181 VDTGSDLTWVQCKPCSV--CYAQRDPLFDPSGSASYAAVPCNASACEASLKAATGVPGSC 238
Query: 105 S----------EGDCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMPNVIFGCG- 153
+ C YS YG G SFS G LAT+T+ S + +FGCG
Sbjct: 239 ATVGGGGGGGKSERCYYSLAYGDG---SFSRGVLATDTVALGGAS-----VDGFVFGCGL 290
Query: 154 -HKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLP----DQGSSKINFGGI 208
++ L T+ G++GLG SL+SQ G FSYCLP + ++ GG
Sbjct: 291 SNRGLFGGTA-----GLMGLGRTELSLVSQTAPRFGGVFSYCLPAATSGDAAGSLSLGGD 345
Query: 209 VAG---AGVVSTPLIIRD-----HYYLSLEAISVGNQRLEFVSSSTGNIFVDTG-VLRTL 259
+ A VS +I D Y++++ SVG + N+ +D+G V+ L
Sbjct: 346 TSSYRNATPVSYTRMIADPAQPPFYFMNVTGASVGGAAVAAAGLGAANVLLDSGTVITRL 405
Query: 260 LPLEYHSNLKSVMSNMIKAQPVKGVGAE-----PGFSDV-LCYNIS--SQPKFPEVTIHF 311
P Y + ++A+ + GAE P FS + CYN++ + K P +T+
Sbjct: 406 APSVYRA---------VRAEFARQFGAERYPAAPPFSLLDACYNLTGHDEVKVPLLTLRL 456
Query: 312 R-GADVKLSPSN-LFRNISDE----IMCSAFRGGNANIVYGRIMQINFLIGYDIEQAMVS 365
GAD+ + + LF D + ++ + + G Q N + YD + +
Sbjct: 457 EGGADMTVDAAGMLFMARKDGSQVCLAMASLSFEDQTPIIGNYQQKNKRVVYDTVGSRLG 516
Query: 366 FKPSRCT 372
F C+
Sbjct: 517 FADEDCS 523
>gi|125532793|gb|EAY79358.1| hypothetical protein OsI_34487 [Oryza sativa Indica Group]
Length = 419
Score = 106 bits (265), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 104/373 (27%), Positives = 167/373 (44%), Gaps = 51/373 (13%)
Query: 35 YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSS 94
Y+ + +IGTPP + G VD + WTQC C CFKQE P+FDP S+TY + C S
Sbjct: 62 YVANFTIGTPPQAVSGIVDLSGELVWTQCAACRSSGCFKQELPVFDPSASNTYRAEQCGS 121
Query: 95 SQC-AVVTSNCS-EGDCSYSF--LYGRGAYASFSSGNLATETLTFNSTSGLPVEMPNVIF 150
C ++ T NCS +G+C Y ++G + G +T+ + + G + F
Sbjct: 122 PLCKSIPTRNCSGDGECGYEAPSMFGD------TFGIASTDAIAIGNAEG------RLAF 169
Query: 151 GCGHKNLAS-PTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLPDQGSSK-----IN 204
GC + S + +G +GLG SL+ Q + FSYCL G K +
Sbjct: 170 GCVVASDGSIDGAMDGPSGFVGLGRTPWSLVGQSNVT---AFSYCLAPHGPGKKSALFLG 226
Query: 205 FGGIVAGAGVVS--TPLIIR----------DHYY-LSLEAISVGNQRLEFVSSSTGNIFV 251
+AGAG + TPL+ + D YY + LE I G+ + SS G I +
Sbjct: 227 ASAKLAGAGKSNPPTPLLGQHASNTSDDGSDPYYTVQLEGIKAGDVAVAAASSGGGAITI 286
Query: 252 DTGVLRTLLPLEYHSNLK-SVMSNMIKAQPVKGVGAEPGFSDVLCYNISSQPKFPEVTIH 310
L T PL Y + + ++ A A P LC+ ++ P++
Sbjct: 287 LQ--LETFRPLSYLPDAAYQALEKVVTAALGSPSMANPPEPFDLCFQNAAVSGVPDLVFT 344
Query: 311 FRGADVKLSPSN---LFRNISDEIMC----SAFRGGNAN---IVYGRIMQINFLIGYDIE 360
F+G +P + L + +C S+ R +A+ + G ++Q N +D+E
Sbjct: 345 FQGGATLTAPPSKYLLGDGNGNGTVCLSILSSTRLDSADDGVSILGSLLQENVHFLFDLE 404
Query: 361 QAMVSFKPSRCTN 373
+ +SF+P+ C++
Sbjct: 405 KETLSFEPADCSS 417
>gi|115475621|ref|NP_001061407.1| Os08g0267300 [Oryza sativa Japonica Group]
gi|37806402|dbj|BAC99940.1| putative 41 kD chloroplast nucleoid DNA binding protein [Oryza
sativa Japonica Group]
gi|113623376|dbj|BAF23321.1| Os08g0267300 [Oryza sativa Japonica Group]
Length = 524
Score = 106 bits (264), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 106/367 (28%), Positives = 163/367 (44%), Gaps = 70/367 (19%)
Query: 52 VDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSSSQCAV-------VTSNC 104
VDTGSD TW QC+PC C+ Q PLFDP S++Y ++ C++S C V +C
Sbjct: 180 VDTGSDLTWVQCKPCSV--CYAQRDPLFDPSGSASYAAVPCNASACEASLKAATGVPGSC 237
Query: 105 S----------EGDCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMPNVIFGCG- 153
+ C YS YG G SFS G LAT+T+ S + +FGCG
Sbjct: 238 ATVGGGGGGGKSERCYYSLAYGDG---SFSRGVLATDTVALGGAS-----VDGFVFGCGL 289
Query: 154 -HKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLP----DQGSSKINFGGI 208
++ L T+ G++GLG SL+SQ G FSYCLP + ++ GG
Sbjct: 290 SNRGLFGGTA-----GLMGLGRTELSLVSQTAPRFGGVFSYCLPAATSGDAAGSLSLGGD 344
Query: 209 VAG---AGVVSTPLIIRD-----HYYLSLEAISVGNQRLEFVSSSTGNIFVDTG-VLRTL 259
+ A VS +I D Y++++ SVG + N+ +D+G V+ L
Sbjct: 345 TSSYRNATPVSYTRMIADPAQPPFYFMNVTGASVGGAAVAAAGLGAANVLLDSGTVITRL 404
Query: 260 LPLEYHSNLKSVMSNMIKAQPVKGVGAE-----PGFSDV-LCYNIS--SQPKFPEVTIHF 311
P Y + ++A+ + GAE P FS + CYN++ + K P +T+
Sbjct: 405 APSVYRA---------VRAEFARQFGAERYPAAPPFSLLDACYNLTGHDEVKVPLLTLRL 455
Query: 312 R-GADVKLSPSN-LFRNISDE----IMCSAFRGGNANIVYGRIMQINFLIGYDIEQAMVS 365
GAD+ + + LF D + ++ + + G Q N + YD + +
Sbjct: 456 EGGADMTVDAAGMLFMARKDGSQVCLAMASLSFEDQTPIIGNYQQKNKRVVYDTVGSRLG 515
Query: 366 FKPSRCT 372
F C+
Sbjct: 516 FADEDCS 522
>gi|148907752|gb|ABR17002.1| unknown [Picea sitchensis]
Length = 454
Score = 106 bits (264), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 104/381 (27%), Positives = 174/381 (45%), Gaps = 50/381 (13%)
Query: 31 VDDIYLMHLSIGTPPVDIFGSVDTGSDCTWTQCEP---CPELDCFKQEPPLFDPKKSSTY 87
V +Y + +GTPP + +DTGSD W C+P CP FDP+ SST
Sbjct: 37 VAGLYYTRIELGTPPRPFYVQIDTGSDILWVNCKPCNACPLTSGLGVALNFFDPRGSSTA 96
Query: 88 NSISCSSSQCA----VVTSNC-SEGDCSYSFLYGRGA-----YAS--FSSGNLATETLTF 135
+ +SC S+C + S C ++ C YSF YG G+ Y S F + +T
Sbjct: 97 SPLSCIDSKCVSSNQISESVCTTDRYCGYSFEYGDGSGTLGYYVSDEFDYNQYVNQYVTN 156
Query: 136 NSTSGLPVEMPNVIFGCGHKNLASPTS-DSKQTGIIGLGPGNSSLISQMGTS-IAGK-FS 192
N+++ + FGC + T D GI G G + S++SQ+ + +A K FS
Sbjct: 157 NASA-------KITFGCSYNQSGDLTKPDRAVDGIFGFGQNDLSVVSQLNSQGLAPKIFS 209
Query: 193 YCL--PDQGSSKINFGGIVAGAGVVSTPLI-IRDHYYLSLEAISVGNQRLE-----FVSS 244
+CL D G + G I G+V TP++ + HY L+L+ I+V Q+L F ++
Sbjct: 210 HCLEGADPGGGILVLGEITE-PGMVYTPIVPSQPHYNLNLQGIAVNGQQLSIDPQVFATT 268
Query: 245 STGNIFVDTGVLRTLLPLE-YHSNLKSVMSNMIKAQPVKGVGAEPGFSDVLCYNISSQPK 303
+T +D G L E Y + ++++ + ++ + P F V S
Sbjct: 269 NTRGTIIDCGTTLAYLAEEAYEPFVNTIIAAVSQSTQPFMLKGNPCFLTVH----SIDEI 324
Query: 304 FPEVTIHFRGADVKLSPSN-LFRNISDE---IMCSAFR--GGNAN-----IVYGRIMQIN 352
FP VT++F GA + L P + L + +S + + C ++ G A + G ++ +
Sbjct: 325 FPSVTLYFEGAPMDLKPKDYLIQQLSPDSSPVWCIGWQKSGQQATDSSKMTILGDLVLKD 384
Query: 353 FLIGYDIEQAMVSFKPSRCTN 373
+ YD+E + + C++
Sbjct: 385 KVFVYDLENQRIGWTSFDCSS 405
>gi|10177232|dbj|BAB10606.1| protease-like protein [Arabidopsis thaliana]
Length = 539
Score = 106 bits (264), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 103/337 (30%), Positives = 153/337 (45%), Gaps = 42/337 (12%)
Query: 31 VDDIYLMHLSIGTPPVDIFGSVDTGSDCTWTQCEP---CPELDCFKQEPPLFDPKKSSTY 87
V +Y L +GTPP D + VDTGSD W C CP+ + + FDP S T
Sbjct: 77 VVGLYYTKLRLGTPPRDFYVQVDTGSDVLWVSCASCNGCPQTSGLQIQLNFFDPGSSVTA 136
Query: 88 NSISCSSSQCAV----VTSNCSEGD--CSYSFLYGRGAYASFSSGNLATETLTFNSTSG- 140
+ ISCS +C+ S CS + C+Y+F YG G S +SG ++ L F+ G
Sbjct: 137 SPISCSDQRCSWGIQSSDSGCSVQNNLCAYTFQYGDG---SGTSGFYVSDVLQFDMIVGS 193
Query: 141 --LPVEMPNVIFGCGHKNLASPT-SDSKQTGIIGLGPGNSSLISQMGTS-IAGK-FSYCL 195
+P V+FGC SD GI G G S+ISQ+ + IA + FS+CL
Sbjct: 194 SLVPNSTAPVVFGCSTSQTGDLVKSDRAVDGIFGFGQQGMSVISQLASQGIAPRVFSHCL 253
Query: 196 P-DQGSSKINFGGIVAGAGVVSTPLI-IRDHYYLSLEAISVGNQRLE-----FVSSSTGN 248
+ G I G + +V TPL+ + HY ++L +ISV Q L F +S+
Sbjct: 254 KGENGGGGILVLGEIVEPNMVFTPLVPSQPHYNVNLLSISVNGQALPINPSVFSTSNGQG 313
Query: 249 IFVDTGVLRTLLPLEYHSNLKSVMSNMI--KAQPVKGVGAEPGFSDVLCYNISSQPK--F 304
+DTG L + ++N + +PV G + CY I++ F
Sbjct: 314 TIIDTGTTLAYLSEAAYVPFVEAITNAVSQSVRPVVSKGNQ-------CYVITTSVGDIF 366
Query: 305 PEVTIHFRGADVKLSPSNLFRNISDEIMCSAFRGGNA 341
P V+++F G +++F N D ++ GG A
Sbjct: 367 PPVSLNFAGG------ASMFLNPQDYLIQQNNVGGTA 397
>gi|30688682|ref|NP_197676.2| aspartyl protease family protein [Arabidopsis thaliana]
gi|110736370|dbj|BAF00154.1| protease-like protein [Arabidopsis thaliana]
gi|332005704|gb|AED93087.1| aspartyl protease family protein [Arabidopsis thaliana]
Length = 493
Score = 106 bits (264), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 102/334 (30%), Positives = 152/334 (45%), Gaps = 42/334 (12%)
Query: 34 IYLMHLSIGTPPVDIFGSVDTGSDCTWTQCEP---CPELDCFKQEPPLFDPKKSSTYNSI 90
+Y L +GTPP D + VDTGSD W C CP+ + + FDP S T + I
Sbjct: 80 LYYTKLRLGTPPRDFYVQVDTGSDVLWVSCASCNGCPQTSGLQIQLNFFDPGSSVTASPI 139
Query: 91 SCSSSQCAV----VTSNCSEGD--CSYSFLYGRGAYASFSSGNLATETLTFNSTSG---L 141
SCS +C+ S CS + C+Y+F YG G S +SG ++ L F+ G +
Sbjct: 140 SCSDQRCSWGIQSSDSGCSVQNNLCAYTFQYGDG---SGTSGFYVSDVLQFDMIVGSSLV 196
Query: 142 PVEMPNVIFGCGHKNLASPT-SDSKQTGIIGLGPGNSSLISQMGTS-IAGK-FSYCLP-D 197
P V+FGC SD GI G G S+ISQ+ + IA + FS+CL +
Sbjct: 197 PNSTAPVVFGCSTSQTGDLVKSDRAVDGIFGFGQQGMSVISQLASQGIAPRVFSHCLKGE 256
Query: 198 QGSSKINFGGIVAGAGVVSTPLI-IRDHYYLSLEAISVGNQRLE-----FVSSSTGNIFV 251
G I G + +V TPL+ + HY ++L +ISV Q L F +S+ +
Sbjct: 257 NGGGGILVLGEIVEPNMVFTPLVPSQPHYNVNLLSISVNGQALPINPSVFSTSNGQGTII 316
Query: 252 DTGVLRTLLPLEYHSNLKSVMSNMI--KAQPVKGVGAEPGFSDVLCYNISSQPK--FPEV 307
DTG L + ++N + +PV G + CY I++ FP V
Sbjct: 317 DTGTTLAYLSEAAYVPFVEAITNAVSQSVRPVVSKGNQ-------CYVITTSVGDIFPPV 369
Query: 308 TIHFRGADVKLSPSNLFRNISDEIMCSAFRGGNA 341
+++F G +++F N D ++ GG A
Sbjct: 370 SLNFAGG------ASMFLNPQDYLIQQNNVGGTA 397
>gi|21717160|gb|AAM76353.1|AC074196_11 putative nucleoid DNA binding protein [Oryza sativa Japonica Group]
gi|31433304|gb|AAP54833.1| Eukaryotic aspartyl protease family protein [Oryza sativa Japonica
Group]
gi|125575544|gb|EAZ16828.1| hypothetical protein OsJ_32300 [Oryza sativa Japonica Group]
Length = 419
Score = 106 bits (264), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 108/377 (28%), Positives = 172/377 (45%), Gaps = 59/377 (15%)
Query: 35 YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSS 94
Y+ + +IGTPP + G VD + WTQC C CFKQE P+FDP S+TY + C S
Sbjct: 62 YVANFTIGTPPQAVSGIVDLSGELVWTQCAACRSSGCFKQELPVFDPSASNTYRAEQCGS 121
Query: 95 SQC-AVVTSNCS-EGDCSYSF--LYGRGAYASFSSGNLATETLTFNSTSGLPVEMPNVIF 150
C ++ T NCS +G+C Y ++G + G +T+ + + G + F
Sbjct: 122 PLCKSIPTRNCSGDGECGYEAPSMFGD------TFGIASTDAIAIGNAEG------RLAF 169
Query: 151 GCGHKNLAS-PTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLPDQGSSK-----IN 204
GC + S + +G +GLG SL+ Q + FSYCL G K +
Sbjct: 170 GCVVASDGSIDGAMDGPSGFVGLGRTPWSLVGQSNVT---AFSYCLALHGPGKKSALFLG 226
Query: 205 FGGIVAGAGVVS--TPLIIR----------DHYY-LSLEAISVGNQRLEFVSSSTGNIFV 251
+AGAG + TPL+ + D YY + LE I G+ + SS G I V
Sbjct: 227 ASAKLAGAGKSNPPTPLLGQHASNTSDDGSDPYYTVQLEGIKAGDVAVAAASSGGGAITV 286
Query: 252 DTGVLRTLLPLEY-----HSNLKSVMSNMIKAQPVKGVGAEPGFSDVLCYNISSQPKFPE 306
L T PL Y + L+ V++ + + + A P LC+ ++ P+
Sbjct: 287 LQ--LETFRPLSYLPDAAYQALEKVVTAALGSPSM----ANPPEPFDLCFQNAAVSGVPD 340
Query: 307 VTIHFR-GADVKLSPSN--LFRNISDEIMC----SAFRGGNAN---IVYGRIMQINFLIG 356
+ F+ GA + PS L + +C S+ R +A+ + G ++Q N
Sbjct: 341 LVFTFQGGATLTAQPSKYLLGDGNGNGTVCLSILSSTRLDSADDGVSILGSLLQENVHFL 400
Query: 357 YDIEQAMVSFKPSRCTN 373
+D+E+ +SF+P+ C++
Sbjct: 401 FDLEKETLSFEPADCSS 417
>gi|125575542|gb|EAZ16826.1| hypothetical protein OsJ_32298 [Oryza sativa Japonica Group]
Length = 396
Score = 106 bits (264), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 105/367 (28%), Positives = 166/367 (45%), Gaps = 45/367 (12%)
Query: 36 LMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSSS 95
+ + +IGTPP +D + WTQC C CFKQ+ PLF P SST+ C +
Sbjct: 44 VANFTIGTPPQPASAIIDVAGELVWTQCSRCSR--CFKQDLPLFIPNASSTFRPEPCGTD 101
Query: 96 QC-AVVTSNCSEGDCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMPNVIFGCGH 154
C + TSNCS C+Y + G + TET + + ++ FGC
Sbjct: 102 ACKSTPTSNCSGDVCTYESTTNIRLDRHTTLGIVGTETFAIGTATA------SLAFGC-- 153
Query: 155 KNLASPTSDSKQT-GIIGLGPGNSSLISQMGTSIAGKFSYCLPDQG---SSKINFG--GI 208
+AS T G IGLG SL++QM + KFSYCL +G SS++ G
Sbjct: 154 -VVASDIDTMDGTSGFIGLGRTPRSLVAQMKLT---KFSYCLSPRGTGKSSRLFLGSSAK 209
Query: 209 VAGAGVVSTPLIIR-------DHYY-LSLEAISVGNQRLEFVSSSTGNIFVDTGVLRTLL 260
+AG ST I+ HYY LSL+AI GN + + S G + + T +LL
Sbjct: 210 LAGGESTSTAPFIKTSPDDDSHHYYLLSLDAIRAGNTTIA-TAQSGGILVMHTVSPFSLL 268
Query: 261 PLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDVLCYNIS---SQPKFPEVTIHFRG-ADV 316
+ K ++ + + + P D LC+ + S+ P++ F+G A +
Sbjct: 269 VDSAYRAFKKAVTEAVGGAAEQPMATPPQPFD-LCFKKAAGFSRATAPDLVFTFQGAAAL 327
Query: 317 KLSPSNLFRNISDE--IMCSAF-------RGGNANI-VYGRIMQINFLIGYDIEQAMVSF 366
+ P+ ++ +E C+A R G + V G + Q + YD+++ +SF
Sbjct: 328 TVPPAKYLIDVGEEKDTACAAILSMAWLNRTGLEGVSVLGSLQQEDVHFLYDLKKETLSF 387
Query: 367 KPSRCTN 373
+P+ C++
Sbjct: 388 EPADCSS 394
>gi|42571079|ref|NP_973613.1| aspartyl protease-like protein [Arabidopsis thaliana]
gi|110737616|dbj|BAF00749.1| putative protease [Arabidopsis thaliana]
gi|330254187|gb|AEC09281.1| aspartyl protease-like protein [Arabidopsis thaliana]
Length = 507
Score = 106 bits (264), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 108/370 (29%), Positives = 167/370 (45%), Gaps = 41/370 (11%)
Query: 34 IYLMHLSIGTPPVDIFGSVDTGSDCTWTQCEP---CPELDCFKQEPPLFDPKKSSTYNSI 90
+Y + +G+PP + +DTGSD W C CP + FD S T S+
Sbjct: 99 LYFTKVKLGSPPTEFNVQIDTGSDILWVTCSSCSNCPHSSGLGIDLHFFDAPGSLTAGSV 158
Query: 91 SCSSSQCAVV----TSNCSEGD-CSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEM 145
+CS C+ V + CSE + C YSF YG G S +SG T+T F++ G +
Sbjct: 159 TCSDPICSSVFQTTAAQCSENNQCGYSFRYGDG---SGTSGYYMTDTFYFDAILGESLVA 215
Query: 146 PN---VIFGCGHKNLASPT-SDSKQTGIIGLGPGNSSLISQMGTS--IAGKFSYCLPDQG 199
+ ++FGC T SD GI G G G S++SQ+ + FS+CL G
Sbjct: 216 NSSAPIVFGCSTYQSGDLTKSDKAVDGIFGFGKGKLSVVSQLSSRGITPPVFSHCLKGDG 275
Query: 200 SSKINFG-GIVAGAGVVSTPLI-IRDHYYLSLEAISVGNQRLE-----FVSSSTGNIFVD 252
S F G + G+V +PL+ + HY L+L +I V Q L F +S+T VD
Sbjct: 276 SGGGVFVLGEILVPGMVYSPLVPSQPHYNLNLLSIGVNGQMLPLDAAVFEASNTRGTIVD 335
Query: 253 TGVLRTLLPLEYHSNLKSVMSNMIK--AQPVKGVGAEPGFSDVLCYNISS--QPKFPEVT 308
TG T L E + + +SN + P+ G + CY +S+ FP V+
Sbjct: 336 TGTTLTYLVKEAYDLFLNAISNSVSQLVTPIISNGEQ-------CYLVSTSISDMFPSVS 388
Query: 309 IHFR-GADVKLSPSN-LFR-NISD--EIMCSAFRGG-NANIVYGRIMQINFLIGYDIEQA 362
++F GA + L P + LF I D + C F+ + G ++ + + YD+ +
Sbjct: 389 LNFAGGASMMLRPQDYLFHYGIYDGASMWCIGFQKAPEEQTILGDLVLKDKVFVYDLARQ 448
Query: 363 MVSFKPSRCT 372
+ + C+
Sbjct: 449 RIGWASYDCS 458
>gi|242073262|ref|XP_002446567.1| hypothetical protein SORBIDRAFT_06g018180 [Sorghum bicolor]
gi|241937750|gb|EES10895.1| hypothetical protein SORBIDRAFT_06g018180 [Sorghum bicolor]
Length = 453
Score = 106 bits (264), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 96/400 (24%), Positives = 166/400 (41%), Gaps = 73/400 (18%)
Query: 21 SIIYQAEIISVDDIYLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFD 80
+++ +A ++ YL+ L IGTP ++DT SD W QC+PC + C++Q P+F+
Sbjct: 74 AVVGEAPLVPRGGEYLVKLGIGTPQHYFSAAIDTASDLVWLQCQPC--VSCYRQLDPIFN 131
Query: 81 PKKSSTYNSISCSSSQCAVVTSN-CSEGD---CSYSFLYGRGAYASFSSGNLATETLTFN 136
P+ SS+Y + CSS C+ + + C E D C Y++ Y A ++G LA + L
Sbjct: 132 PRLSSSYAVVPCSSDTCSQLDGHRCDEDDDQACRYNYKYSGNA---VTNGTLAIDKLAVG 188
Query: 137 STSGLPVEMPNVIFGCGHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLP 196
V+ GC ++ P + +G++GL G SL+SQ+ +F YCLP
Sbjct: 189 GNV-----FHAVVLGCSDSSVGGPP--PQASGLVGLARGPLSLLSQLSVR---RFMYCLP 238
Query: 197 DQGSSKINFGGIVAGAGV---------------VSTPLIIRDHYYLSLEAISVGNQRLEF 241
S G +V GAG +S+ +YYL+ + ++VG+Q
Sbjct: 239 PPMSR--TPGKLVLGAGAGADAVRNVSDRVTVTMSSSTRYPSYYYLNFDGLAVGDQTPGT 296
Query: 242 VSSSTG----------------------NIFVDTGVLRTLLPLEYHSNLKSVMSNMI--- 276
+ T + VD + L + L + I
Sbjct: 297 IRRPTSPPATGGGVGGGGGDGGSGANAYGMIVDVASTISFLEASLYDELADDLEEEIRLP 356
Query: 277 KAQPVKGVGAEPGFSDVLCYNISS-----QPKFPEVTIHFRGADVKLSPSNLFRNISDEI 331
+A P +G + LC+ + + P V++ F G ++L LF +
Sbjct: 357 RATPSTRLGLD------LCFILPEGVGIDRVYVPTVSMSFDGRWLELERDRLFLE-DGRM 409
Query: 332 MCSAFRGGNANIVYGRIMQINFLIGYDIEQAMVSFKPSRC 371
MC + + G Q N + Y++ + ++F + C
Sbjct: 410 MCLMIGRTSGVSILGNYQQQNMHVLYNLRRGKITFAKASC 449
>gi|242091325|ref|XP_002441495.1| hypothetical protein SORBIDRAFT_09g028050 [Sorghum bicolor]
gi|241946780|gb|EES19925.1| hypothetical protein SORBIDRAFT_09g028050 [Sorghum bicolor]
Length = 466
Score = 106 bits (264), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 99/371 (26%), Positives = 159/371 (42%), Gaps = 52/371 (14%)
Query: 35 YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPP--LFDPKKSSTYNSISC 92
Y + L +GTP + DTGSD TW +C PP +F PK S ++ I C
Sbjct: 116 YFVKLRVGTPVQEFTLVADTGSDLTWVKCA--------GASPPGRVFRPKTSRSWAPIPC 167
Query: 93 SSSQCAV----VTSNCSE--GDCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMP 146
SS C + +NCS C+Y + Y G ++ + G + TE+ T G ++
Sbjct: 168 SSDTCKLDVPFTLANCSSPASPCTYDYRYKEG--SAGARGIVGTESATIALPGGKVAQLK 225
Query: 147 NVIFGC--GHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLPDQGSSKIN 204
+V+ GC H + ++D G++ LG S +Q G FSYCL D + +
Sbjct: 226 DVVLGCSSSHDGQSFRSAD----GVLSLGNAKISFATQAAARFGGSFSYCLVDHLAPRNA 281
Query: 205 FGGIVAGAGVV------STPLIIRDH---YYLSLEAISVGNQRL----EFVSSSTGNIFV 251
G + G G V T L + Y + ++AI V + L E + +G + +
Sbjct: 282 TGYLAFGPGQVPRTPATQTKLFLDPEMPFYGVKVDAIHVAGKALDIPAEVWDAKSGGVIL 341
Query: 252 DTGVLRTLLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDVLCYNISS-QPKFPEV--- 307
D+G T+L + + + +S + P P F CYN ++ +P PE+
Sbjct: 342 DSGNTLTVLAAPAYKAVVAALSKHLDGVPKVSF---PPFEH--CYNWTARRPGAPEIIPK 396
Query: 308 -TIHFRGADVKLSP--SNLFRNISDEIMCSAFRGGN--ANIVYGRIMQINFLIGYDIEQA 362
+ F G+ +L P + ++ + C + G V G IMQ L +D++
Sbjct: 397 LAVQFAGS-ARLEPPAKSYVIDVKPGVKCIGVQEGEWPGLSVIGNIMQQEHLWEFDLKNM 455
Query: 363 MVSFKPSRCTN 373
V FK S CT
Sbjct: 456 QVRFKQSNCTR 466
>gi|147819672|emb|CAN76394.1| hypothetical protein VITISV_020864 [Vitis vinifera]
Length = 507
Score = 106 bits (264), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 98/363 (26%), Positives = 159/363 (43%), Gaps = 53/363 (14%)
Query: 34 IYLMHLSIGTPPVDIFGSVDTGSDCTWTQ---CEPCPELDCFKQEPPLFDPKKSSTYNSI 90
+Y + IGTP D + VDTGSD W C+ CP + L+D K S+T +++
Sbjct: 77 LYFAKIGIGTPSKDYYVQVDTGSDILWVNCAGCDRCPTKSDLGVDLTLYDMKASTTSDAV 136
Query: 91 SCSSSQCAVVTS---NCSEG-DCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMP 146
C + C++ C G C YS LYG G S ++G + + +N SG P
Sbjct: 137 GCDDNFCSLYDGPLPGCKPGLQCLYSVLYGDG---SSTTGYFVQDFVQYNRISGNFQTTP 193
Query: 147 ---NVIFGCGHKNLASPTSDSKQ-TGIIGLGPGNSSLISQMGTS--IAGKFSYCLPDQGS 200
V+FGCG+K S S+ GI+G G NSS++SQ+ +S + FS+CL +
Sbjct: 194 TNGTVVFGCGNKQSGELGSSSEALDGILGFGQANSSMLSQLASSGKVKKVFSHCLDN--- 250
Query: 201 SKINFGGIVAGAGVVSTPL--------------IIRDHYYLSLEAISVGNQRLE-----F 241
++ GGI A VV + + R HY + ++ I VG L+ F
Sbjct: 251 --VDGGGIFAIGEVVEPKVRFLLMNSVMIVVLFLSRAHYNVVMKEIEVGGDPLDVPSDAF 308
Query: 242 VSSSTGNIFVDTGVLRTLLPLEYHSNLKSVMSNMIKAQP-VKGVGAEPGFSDVLCYNISS 300
S +D+G P E + L + ++ QP ++ E F+ C++ +
Sbjct: 309 ESGDRKGTIIDSGTTLAYFPQEVYVPL---IEKILSQQPDLRLHTVEQAFT---CFDYTG 362
Query: 301 --QPKFPEVTIHF-RGADVKLSPSNLFRNISDEIMCSAFRGGNANIVYGRIMQINFLIGY 357
FP VT+HF + + + P + + C ++ A G+ + L+G
Sbjct: 363 NVDDGFPTVTLHFDKSISLTVYPHEYLFQVKEFEWCIGWQNSGAQTKDGKDLT---LLGE 419
Query: 358 DIE 360
D +
Sbjct: 420 DAQ 422
>gi|4415912|gb|AAD20143.1| putative protease [Arabidopsis thaliana]
Length = 469
Score = 105 bits (263), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 108/369 (29%), Positives = 167/369 (45%), Gaps = 41/369 (11%)
Query: 34 IYLMHLSIGTPPVDIFGSVDTGSDCTW---TQCEPCPELDCFKQEPPLFDPKKSSTYNSI 90
+Y + +G+PP + +DTGSD W + C CP + FD S T S+
Sbjct: 99 LYFTKVKLGSPPTEFNVQIDTGSDILWVTCSSCSNCPHSSGLGIDLHFFDAPGSLTAGSV 158
Query: 91 SCSSSQCAVV----TSNCSEGD-CSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEM 145
+CS C+ V + CSE + C YSF YG G S +SG T+T F++ G +
Sbjct: 159 TCSDPICSSVFQTTAAQCSENNQCGYSFRYGDG---SGTSGYYMTDTFYFDAILGESLVA 215
Query: 146 PN---VIFGCGHKNLASPT-SDSKQTGIIGLGPGNSSLISQMGTS--IAGKFSYCLPDQG 199
+ ++FGC T SD GI G G G S++SQ+ + FS+CL G
Sbjct: 216 NSSAPIVFGCSTYQSGDLTKSDKAVDGIFGFGKGKLSVVSQLSSRGITPPVFSHCLKGDG 275
Query: 200 SSKINFG-GIVAGAGVVSTPLI-IRDHYYLSLEAISVGNQRLE-----FVSSSTGNIFVD 252
S F G + G+V +PL+ + HY L+L +I V Q L F +S+T VD
Sbjct: 276 SGGGVFVLGEILVPGMVYSPLVPSQPHYNLNLLSIGVNGQMLPLDAAVFEASNTRGTIVD 335
Query: 253 TGVLRTLLPLEYHSNLKSVMSNMIK--AQPVKGVGAEPGFSDVLCYNISS--QPKFPEVT 308
TG T L E + + +SN + P+ G + CY +S+ FP V+
Sbjct: 336 TGTTLTYLVKEAYDLFLNAISNSVSQLVTPIISNGEQ-------CYLVSTSISDMFPSVS 388
Query: 309 IHFR-GADVKLSPSN-LFR-NISD--EIMCSAFRGG-NANIVYGRIMQINFLIGYDIEQA 362
++F GA + L P + LF I D + C F+ + G ++ + + YD+ +
Sbjct: 389 LNFAGGASMMLRPQDYLFHYGIYDGASMWCIGFQKAPEEQTILGDLVLKDKVFVYDLARQ 448
Query: 363 MVSFKPSRC 371
+ + C
Sbjct: 449 RIGWASYDC 457
>gi|42569679|ref|NP_181205.2| aspartyl protease-like protein [Arabidopsis thaliana]
gi|330254186|gb|AEC09280.1| aspartyl protease-like protein [Arabidopsis thaliana]
Length = 512
Score = 105 bits (263), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 108/370 (29%), Positives = 167/370 (45%), Gaps = 41/370 (11%)
Query: 34 IYLMHLSIGTPPVDIFGSVDTGSDCTWTQCEP---CPELDCFKQEPPLFDPKKSSTYNSI 90
+Y + +G+PP + +DTGSD W C CP + FD S T S+
Sbjct: 104 LYFTKVKLGSPPTEFNVQIDTGSDILWVTCSSCSNCPHSSGLGIDLHFFDAPGSLTAGSV 163
Query: 91 SCSSSQCAVV----TSNCSEGD-CSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEM 145
+CS C+ V + CSE + C YSF YG G S +SG T+T F++ G +
Sbjct: 164 TCSDPICSSVFQTTAAQCSENNQCGYSFRYGDG---SGTSGYYMTDTFYFDAILGESLVA 220
Query: 146 PN---VIFGCGHKNLASPT-SDSKQTGIIGLGPGNSSLISQMGTS--IAGKFSYCLPDQG 199
+ ++FGC T SD GI G G G S++SQ+ + FS+CL G
Sbjct: 221 NSSAPIVFGCSTYQSGDLTKSDKAVDGIFGFGKGKLSVVSQLSSRGITPPVFSHCLKGDG 280
Query: 200 SSKINFG-GIVAGAGVVSTPLI-IRDHYYLSLEAISVGNQRLE-----FVSSSTGNIFVD 252
S F G + G+V +PL+ + HY L+L +I V Q L F +S+T VD
Sbjct: 281 SGGGVFVLGEILVPGMVYSPLVPSQPHYNLNLLSIGVNGQMLPLDAAVFEASNTRGTIVD 340
Query: 253 TGVLRTLLPLEYHSNLKSVMSNMIK--AQPVKGVGAEPGFSDVLCYNISS--QPKFPEVT 308
TG T L E + + +SN + P+ G + CY +S+ FP V+
Sbjct: 341 TGTTLTYLVKEAYDLFLNAISNSVSQLVTPIISNGEQ-------CYLVSTSISDMFPSVS 393
Query: 309 IHFR-GADVKLSPSN-LFR-NISD--EIMCSAFRGG-NANIVYGRIMQINFLIGYDIEQA 362
++F GA + L P + LF I D + C F+ + G ++ + + YD+ +
Sbjct: 394 LNFAGGASMMLRPQDYLFHYGIYDGASMWCIGFQKAPEEQTILGDLVLKDKVFVYDLARQ 453
Query: 363 MVSFKPSRCT 372
+ + C+
Sbjct: 454 RIGWASYDCS 463
>gi|224104765|ref|XP_002313558.1| predicted protein [Populus trichocarpa]
gi|222849966|gb|EEE87513.1| predicted protein [Populus trichocarpa]
Length = 468
Score = 105 bits (263), Expect = 3e-20, Method: Compositional matrix adjust.
Identities = 110/374 (29%), Positives = 170/374 (45%), Gaps = 47/374 (12%)
Query: 34 IYLMHLSIGTPPVDIFGSVDTGSDCTWT---QCEPCPELDCFKQEPPLFDPKKSSTYNSI 90
+Y L +GTPP D + +DTGSD W C CP FDP S T + I
Sbjct: 51 LYYTRLQLGTPPRDFYVQIDTGSDVLWVSCGSCNGCPVNSGLHIPLNFFDPGSSPTASLI 110
Query: 91 SCSSSQCAV----VTSNCSEGD--CSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVE 144
SCS +C++ S CS + C Y+F YG G S +SG ++ L F++ G V
Sbjct: 111 SCSDQRCSLGLQSSDSVCSAQNNLCGYNFQYGDG---SGTSGYYVSDLLHFDTVLGGSV- 166
Query: 145 MPN----VIFGCGHKNLASPT-SDSKQTGIIGLGPGNSSLISQMGTS-IAGK-FSYCLP- 196
M N ++FGC T SD GI G G + S++SQ+ + I+ + FS+CL
Sbjct: 167 MNNSSAPIVFGCSALQTGDLTKSDRAVDGIFGFGQQDMSVVSQLASQGISPRAFSHCLKG 226
Query: 197 -DQGSSKINFGGIVAGAGVVSTPLI-IRDHYYLSLEAISVGNQRLE-----FVSSSTGNI 249
D G + G IV +V TPL+ + HY L++++ISV Q L F +SS+
Sbjct: 227 DDSGGGILVLGEIVE-PNIVYTPLVPSQPHYNLNMQSISVNGQTLAIDPSVFGTSSSQGT 285
Query: 250 FVDTGVLRTLLPLEYHSNLKSVMSNMI--KAQPVKGVGAEPGFSDVLCYNISSQPK--FP 305
+D+G L + S +++++ +P G CY ISS FP
Sbjct: 286 IIDSGTTLAYLAEAAYDPFISAITSIVSPSVRPYLSKGNH-------CYLISSSINDIFP 338
Query: 306 EVTIHFR-GADVKLSPSNLFRNISD----EIMCSAFRG--GNANIVYGRIMQINFLIGYD 358
+V+++F GA + L P + S + C F+ G + G ++ + + YD
Sbjct: 339 QVSLNFAGGASMILIPQDYLIQQSSIGGAALWCIGFQKIQGQGITILGDLVLKDKIFVYD 398
Query: 359 IEQAMVSFKPSRCT 372
I + + C+
Sbjct: 399 IANQRIGWANYDCS 412
>gi|168042409|ref|XP_001773681.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162675069|gb|EDQ61569.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 399
Score = 105 bits (263), Expect = 3e-20, Method: Compositional matrix adjust.
Identities = 103/377 (27%), Positives = 164/377 (43%), Gaps = 49/377 (12%)
Query: 31 VDDIYLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPL----FDPKKSST 86
V +Y + +GTPPV + VDTGSD TW C PC Q P + +DP +SST
Sbjct: 33 VTGLYYTKIYLGTPPVGYYVQVDTGSDVTWLNCAPCTSCVTETQLPSIKLTTYDPSRSST 92
Query: 87 YNSISCSSSQC-AVVTSN----CSEGDCSYSFLYGRGAYASFSSGNLATETLTFNST-SG 140
++SC S C A + SN S G C+YS YG G S + G + +TF +
Sbjct: 93 DGALSCRDSNCGAALGSNEVSCTSAGYCAYSTTYGDG---SSTQGYFIQDVMTFQEIHNN 149
Query: 141 LPVE-MPNVIFGCGHKNLASPTSDSKQ-TGIIGLGPGNSSLISQMGT--SIAGKFSYCLP 196
V +V FGCG + S+ G+IG G S+ SQ+ + + +F++CL
Sbjct: 150 TQVNGTASVYFGCGTTQSGNLLMSSRALDGLIGFGQAAVSIPSQLASMGKVGNRFAHCLQ 209
Query: 197 --DQGSSKINFGGIVAGAGVVSTPLIIRDHYYLSLEAISVGNQRL------EFVSSSTGN 248
+QG I G V+ + TP++ R+HY + ++ I+V + + + S+S G
Sbjct: 210 GDNQGGGTIVIGS-VSEPNISYTPIVSRNHYAVGMQNIAVNGRNVTTPASFDTTSTSAGG 268
Query: 249 IFVDTGVLRTLLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDVLCYNI---SSQPKFP 305
+ +D+G L ++ + +S + S C + S Q FP
Sbjct: 269 VIMDSGTTLAYLVDPAYTQFVNAVSTFESSMFS---------SHSQCLQLAWCSLQADFP 319
Query: 306 EVTIHF-RGADVKLSPSNLFRNI----SDEIMCSAFRGGNANIVY------GRIMQINFL 354
V + F GA + L+P N + C ++ Y G I+ + L
Sbjct: 320 TVKLFFDAGAVMNLTPRNYLYSQPLQNGQAAYCMGWQKSTTKAGYLSYSILGDIVLKDHL 379
Query: 355 IGYDIEQAMVSFKPSRC 371
+ YD + +V +K C
Sbjct: 380 VVYDNDNRVVGWKSFDC 396
>gi|359496797|ref|XP_002277380.2| PREDICTED: aspartic proteinase nepenthesin-2-like, partial [Vitis
vinifera]
Length = 358
Score = 105 bits (263), Expect = 3e-20, Method: Compositional matrix adjust.
Identities = 78/247 (31%), Positives = 120/247 (48%), Gaps = 26/247 (10%)
Query: 35 YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSS 94
Y + + G+P VDTGS +W QC+PC + C Q PLFDP S TY S+SC+S
Sbjct: 118 YYVKVGFGSPARYYSMIVDTGSSLSWLQCKPC-VVYCHVQADPLFDPSASKTYKSLSCTS 176
Query: 95 SQCAVVTS--------NCSEGDCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMP 146
SQC+ + S C Y+ YG +S+S G L+ + LT + +P
Sbjct: 177 SQCSSLVDATLNNPLCETSSNVCVYTASYGD---SSYSMGYLSQDLLTLAPSQ----TLP 229
Query: 147 NVIFGCGHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLPDQGSSK-INF 205
++GCG S + GI+GLG S++ Q+ + FSYCLP +G ++
Sbjct: 230 GFVYGCGQD---SDGLFGRAAGILGLGRNKLSMLGQVSSKFGYAFSYCLPTRGGGGFLSI 286
Query: 206 G-GIVAGAGVVSTPLIIR----DHYYLSLEAISVGNQRLEFVSSSTG-NIFVDTGVLRTL 259
G +AG+ TP+ Y+L L AI+VG + L ++ +D+G + T
Sbjct: 287 GKASLAGSAYKFTPMTTDPGNPSLYFLRLTAITVGGRALGVAAAQYRVPTIIDSGTVITR 346
Query: 260 LPLEYHS 266
LP+ ++
Sbjct: 347 LPMSVYT 353
>gi|238010910|gb|ACR36490.1| unknown [Zea mays]
gi|413942664|gb|AFW75313.1| hypothetical protein ZEAMMB73_520329 [Zea mays]
Length = 481
Score = 105 bits (263), Expect = 3e-20, Method: Compositional matrix adjust.
Identities = 102/345 (29%), Positives = 148/345 (42%), Gaps = 51/345 (14%)
Query: 52 VDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSSSQCAVV---TSNCSEGD 108
+D+ SD W QC PCP C Q +DP +S + SCSS C + + C+
Sbjct: 163 LDSASDVPWVQCVPCPIPPCHPQVDSFYDPSRSPSSAPFSCSSPTCTALGPYANGCANNQ 222
Query: 109 CSYSFLYGRGAYASFSSGNLATETLTF---NSTSGLPVEMPNVIFGCGHKNLASPTSDSK 165
C Y Y G S +SG + LT N+ SG FGC H S D++
Sbjct: 223 CQYLVRYPDG---SSTSGAYIADLLTLDAGNAVSGF-------KFGCSHAEQGS--FDAR 270
Query: 166 QTGIIGLGPGNSSLISQMGTSIAGKFSYCLPDQGSSKINFGGIV---AGAGVVSTPLI-- 220
GI+ LG G SL+SQ + FSYC+P S F V A + V TP++
Sbjct: 271 AAGIMALGGGPESLLSQTASRYGNAFSYCIPATASDSGFFTLGVPRRASSRYVVTPMVRF 330
Query: 221 --IRDHYYLSLEAISVGNQRLEFVSS--STGNIFVDTGVLRTLLPLEYH---SNLKSVMS 273
Y + L I+VG QRL + + G++ + L P Y S +S M+
Sbjct: 331 RQAATFYGVLLRTITVGGQRLGVAPAVFAAGSVLDSRTAITRLPPTAYQALRSAFRSSMT 390
Query: 274 NMIKAQPVKGVGAEPGFSDVLCYNISS--QPKFPEVTIHF-RGADVKLSPSNLFRNISDE 330
M ++ P K G+ D CY+ + + P++++ F R A + L PS + N
Sbjct: 391 -MYRSAPPK------GYLDT-CYDFTGVVNIRLPKISLVFDRNAVLPLDPSGILFND--- 439
Query: 331 IMCSAFRGGNANI----VYGRIMQINFLIGYDIEQAMVSFKPSRC 371
C AF NA+ V G + Q + YD+ V F+ C
Sbjct: 440 --CLAFT-SNADDRMPGVLGSVQQQTIEVLYDVGGGAVGFRQGAC 481
>gi|194707632|gb|ACF87900.1| unknown [Zea mays]
Length = 423
Score = 105 bits (263), Expect = 3e-20, Method: Compositional matrix adjust.
Identities = 104/376 (27%), Positives = 168/376 (44%), Gaps = 48/376 (12%)
Query: 34 IYLMHLSIGTPPVDIFGSVDTGSDCTWTQCEP---CPELDCFKQEPPLFDPKKSSTYNSI 90
+Y + +G P + F +DTGSD W C P CP + F+P SST + I
Sbjct: 4 LYFTRVKLGNPAKEFFVQIDTGSDILWVTCSPCTGCPTSSGLNIQLESFNPDSSSTASRI 63
Query: 91 SCSSSQCAV---------VTSNCSEGDCSYSFLYGRGAYASFSSGNLATETLTFNSTSG- 140
+CS +C TSN C Y+F YG G S +SG ++T+ F + G
Sbjct: 64 TCSDDRCTAGFQTGEAICQTSNSQSSPCGYTFTYGDG---SGTSGYYVSDTMFFETVMGN 120
Query: 141 --LPVEMPNVIFGCGHKNLASPT-SDSKQTGIIGLGPGNSSLISQMGT-SIAGK-FSYCL 195
+++FGC + T +D GI G G S+ISQ+ + ++ K FS+CL
Sbjct: 121 EQTANSSASIVFGCSNSQSGDLTKADRAVDGIFGFGQHQLSVISQLNSLGVSPKVFSHCL 180
Query: 196 P--DQGSSKINFGGIVAGAGVVSTPLI-IRDHYYLSLEAISVGNQRLE-----FVSSSTG 247
D G + G IV G+V TPL+ + HY L+LE+I+V Q+L F +S+T
Sbjct: 181 KGSDNGGGILVLGEIVE-PGLVYTPLVPSQPHYNLNLESIAVNGQKLPIDSSLFTTSNTQ 239
Query: 248 NIFVDTGVLRTLLP----LEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDVLCYNISSQPK 303
VD+G L + S + + +S +++ KG S + S
Sbjct: 240 GTIVDSGTTLAYLADGAYDPFVSAIAAAVSPSVRSLVSKG-------SQCFITSSSVDSS 292
Query: 304 FPEVTIHFRGA-DVKLSPSN-LFRNISDE---IMCSAFR--GGNANIVYGRIMQINFLIG 356
FP VT++F G + + P N L + S + + C ++ G + G ++ + +
Sbjct: 293 FPTVTLYFMGGVAMSVKPENYLLQQASVDNSVLWCIGWQRNQGQEITILGDLVLKDKIFV 352
Query: 357 YDIEQAMVSFKPSRCT 372
YD+ + + C+
Sbjct: 353 YDLANMRMGWADYDCS 368
>gi|125532788|gb|EAY79353.1| hypothetical protein OsI_34482 [Oryza sativa Indica Group]
Length = 394
Score = 105 bits (263), Expect = 3e-20, Method: Compositional matrix adjust.
Identities = 99/371 (26%), Positives = 163/371 (43%), Gaps = 61/371 (16%)
Query: 35 YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSS 94
Y+ + +IGTPP +D + WTQC+ C CF+Q+ PLFDP S+TY + C +
Sbjct: 51 YVANFTIGTPPQPASAVIDLAGELVWTQCKQCSR--CFEQDTPLFDPTASNTYRAEPCGT 108
Query: 95 SQCAVVTS---NCSEGDCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMPNVIFG 151
C + S NCS C+Y A + G + T+T + ++ FG
Sbjct: 109 PLCESIPSDSRNCSGNVCAYQ----ASTNAGDTGGKVGTDTFAVGTAKA------SLAFG 158
Query: 152 CGHKNLASPTSDSK--QTGIIGLGPGNSSLISQMGTSIAGKFSYCLP--DQGSSKINFGG 207
C + + D+ +GI+GLG SL++Q G + FSYCL D G + F G
Sbjct: 159 C----VVASDIDTMGGPSGIVGLGRTPWSLVTQTGVA---AFSYCLAPHDAGKNSALFLG 211
Query: 208 ----IVAGAGVVSTPLI--------IRDHYYLSLEAISVGNQRLEFVSSSTGNIFVDTGV 255
+ G STP + + ++Y + LE + G+ + S + T +
Sbjct: 212 SSAKLAGGGKAASTPFVNISGNGNDLSNYYKVQLEGLKAGDAMIPLPPSGS------TVL 265
Query: 256 LRTLLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDV----LCYNIS-SQPKFPEVTIH 310
L T P+ + L +K VGA P + V LC+ S + P++
Sbjct: 266 LDTFSPISF---LVDGAYQAVKKAVTVAVGAPPMATPVEPFDLCFPKSGASGAAPDLVFT 322
Query: 311 FR-GADVKLSPSNLFRNISDEIMCSAFRGGNANI-------VYGRIMQINFLIGYDIEQA 362
FR GA + ++ SN + + +C A +A + + G + Q N +D+++
Sbjct: 323 FRGGAAMTVAASNYLLDYKNGTVCLAML-SSARLNSTTELSLLGSLQQENIHFLFDLDKE 381
Query: 363 MVSFKPSRCTN 373
+SF+P+ CT
Sbjct: 382 TLSFEPADCTK 392
>gi|125542690|gb|EAY88829.1| hypothetical protein OsI_10302 [Oryza sativa Indica Group]
Length = 440
Score = 105 bits (262), Expect = 3e-20, Method: Compositional matrix adjust.
Identities = 100/376 (26%), Positives = 163/376 (43%), Gaps = 56/376 (14%)
Query: 35 YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSS 94
Y++ +G+P + ++DT +D TW C PC C LF P SS+Y S+ CSS
Sbjct: 81 YVVRAGLGSPSQQLLLALDTSADATWAHCSPCGT--CPSSS--LFAPANSSSYASLPCSS 136
Query: 95 SQCAVVTSNCSE-----GD----------CSYSFLYGRGAYASFSSGNLATETLTFNSTS 139
S C + GD C++S + A ASF + LA++TL +
Sbjct: 137 SWCPLFQGQACPAPQGGGDAAPPPATLPTCAFSKPF---ADASFQAA-LASDTLRLGKDA 192
Query: 140 GLPVEMPNVIFGCGHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLPDQG 199
+PN FGC ++ PT++ + G++GLG G +L+SQ G+ G FSYCLP
Sbjct: 193 -----IPNYTFGC-VSSVTGPTTNMPRQGLLGLGRGPMALLSQAGSLYNGVFSYCLPSYR 246
Query: 200 SSKINFGGIVAGAG------VVSTPLIIRDH----YYLSLEAISVGNQRLE-------FV 242
S + G + GAG V TP++ H YY+++ +SVG ++ F
Sbjct: 247 SYYFS-GSLRLGAGGGQPRSVRYTPMLRNPHRSSLYYVNVTGLSVGRAWVKVPAGSFAFD 305
Query: 243 SSSTGNIFVDTGVLRTLLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDVLCYNISSQP 302
+++ VD+G + T ++ L+ + A G + F + +
Sbjct: 306 AATGAGTVVDSGTVITRWTAPVYAALREEFRRQVAAP--SGYTSLGAFDTCFNTDEVAAG 363
Query: 303 KFPEVTIHFRGA-DVKLSPSN-LFRNISDEIMCSAFRGGNANI-----VYGRIMQINFLI 355
P VT+H G D+ L N L + + + C A N+ V + Q N +
Sbjct: 364 GAPAVTVHMDGGVDLALPMENTLIHSSATPLACLAMAEAPQNVNSVVNVIANLQQQNIRV 423
Query: 356 GYDIEQAMVSFKPSRC 371
+D+ + + F C
Sbjct: 424 VFDVANSRIGFAKESC 439
>gi|226508052|ref|NP_001150337.1| LOC100283967 precursor [Zea mays]
gi|195638522|gb|ACG38729.1| aspartic proteinase nepenthesin-2 precursor [Zea mays]
Length = 507
Score = 105 bits (262), Expect = 3e-20, Method: Compositional matrix adjust.
Identities = 104/376 (27%), Positives = 168/376 (44%), Gaps = 48/376 (12%)
Query: 34 IYLMHLSIGTPPVDIFGSVDTGSDCTWTQCEP---CPELDCFKQEPPLFDPKKSSTYNSI 90
+Y + +G P + F +DTGSD W C P CP + F+P SST + I
Sbjct: 88 LYFTRVKLGNPAKEFFVQIDTGSDILWVTCSPCTGCPTSSGLNIQLESFNPDSSSTASRI 147
Query: 91 SCSSSQCAV---------VTSNCSEGDCSYSFLYGRGAYASFSSGNLATETLTFNSTSG- 140
+CS +C TSN C Y+F YG G S +SG ++T+ F + G
Sbjct: 148 TCSDDRCTAGFQTGEAICQTSNSQSSPCGYTFTYGDG---SGTSGYYVSDTMFFETVMGN 204
Query: 141 --LPVEMPNVIFGCGHKNLASPT-SDSKQTGIIGLGPGNSSLISQMGT-SIAGK-FSYCL 195
+++FGC + T +D GI G G S+ISQ+ + ++ K FS+CL
Sbjct: 205 EQTANSSASIVFGCSNSQSGDLTKADRAVDGIFGFGQHQLSVISQLNSLGVSPKVFSHCL 264
Query: 196 P--DQGSSKINFGGIVAGAGVVSTPLI-IRDHYYLSLEAISVGNQRLE-----FVSSSTG 247
D G + G IV G+V TPL+ + HY L+LE+I+V Q+L F +S+T
Sbjct: 265 KGSDNGGGILVLGEIVE-PGLVYTPLVPSQPHYNLNLESIAVNGQKLPIDSSLFTTSNTQ 323
Query: 248 NIFVDTGVLRTLLP----LEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDVLCYNISSQPK 303
VD+G L + S + + +S +++ KG S + S
Sbjct: 324 GTIVDSGTTLAYLADGAYDPFVSAIAAAVSPSVRSLVSKG-------SQCFITSSSVDSS 376
Query: 304 FPEVTIHFRGA-DVKLSPSN-LFRNISDE---IMCSAFR--GGNANIVYGRIMQINFLIG 356
FP VT++F G + + P N L + S + + C ++ G + G ++ + +
Sbjct: 377 FPTVTLYFMGGVAMSVKPENYLLQQASVDNSVLWCIGWQRNQGQEITILGDLVLKDKIFV 436
Query: 357 YDIEQAMVSFKPSRCT 372
YD+ + + C+
Sbjct: 437 YDLANMRMGWADYDCS 452
>gi|413952263|gb|AFW84912.1| aspartic proteinase nepenthesin-2 [Zea mays]
Length = 509
Score = 105 bits (262), Expect = 3e-20, Method: Compositional matrix adjust.
Identities = 104/376 (27%), Positives = 168/376 (44%), Gaps = 48/376 (12%)
Query: 34 IYLMHLSIGTPPVDIFGSVDTGSDCTWTQCEP---CPELDCFKQEPPLFDPKKSSTYNSI 90
+Y + +G P + F +DTGSD W C P CP + F+P SST + I
Sbjct: 90 LYFTRVKLGNPAKEFFVQIDTGSDILWVTCSPCTGCPTSSGLNIQLESFNPDSSSTASRI 149
Query: 91 SCSSSQCAV---------VTSNCSEGDCSYSFLYGRGAYASFSSGNLATETLTFNSTSG- 140
+CS +C TSN C Y+F YG G S +SG ++T+ F + G
Sbjct: 150 TCSDDRCTAGFQTGEAICQTSNSQSSPCGYTFTYGDG---SGTSGYYVSDTMFFETVMGN 206
Query: 141 --LPVEMPNVIFGCGHKNLASPT-SDSKQTGIIGLGPGNSSLISQMGT-SIAGK-FSYCL 195
+++FGC + T +D GI G G S+ISQ+ + ++ K FS+CL
Sbjct: 207 EQTANSSASIVFGCSNSQSGDLTKADRAVDGIFGFGQHQLSVISQLNSLGVSPKVFSHCL 266
Query: 196 P--DQGSSKINFGGIVAGAGVVSTPLI-IRDHYYLSLEAISVGNQRLE-----FVSSSTG 247
D G + G IV G+V TPL+ + HY L+LE+I+V Q+L F +S+T
Sbjct: 267 KGSDNGGGILVLGEIVE-PGLVYTPLVPSQPHYNLNLESIAVNGQKLPIDSSLFTTSNTQ 325
Query: 248 NIFVDTGVLRTLLP----LEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDVLCYNISSQPK 303
VD+G L + S + + +S +++ KG S + S
Sbjct: 326 GTIVDSGTTLAYLADGAYDPFVSAIAAAVSPSVRSLVSKG-------SQCFITSSSVDSS 378
Query: 304 FPEVTIHFRGA-DVKLSPSN-LFRNISDE---IMCSAFR--GGNANIVYGRIMQINFLIG 356
FP VT++F G + + P N L + S + + C ++ G + G ++ + +
Sbjct: 379 FPTVTLYFMGGVAMSVKPENYLLQQASVDNSVLWCIGWQRNQGQEITILGDLVLKDKIFV 438
Query: 357 YDIEQAMVSFKPSRCT 372
YD+ + + C+
Sbjct: 439 YDLANMRMGWADYDCS 454
>gi|51536458|gb|AAU05467.1| At5g22850 [Arabidopsis thaliana]
gi|55733777|gb|AAV59285.1| At5g22850 [Arabidopsis thaliana]
Length = 426
Score = 105 bits (262), Expect = 4e-20, Method: Compositional matrix adjust.
Identities = 102/339 (30%), Positives = 155/339 (45%), Gaps = 37/339 (10%)
Query: 34 IYLMHLSIGTPPVDIFGSVDTGSDCTWTQCEP---CPELDCFKQEPPLFDPKKSSTYNSI 90
+Y L +GTPP D + VDTGSD W C CP+ + + FDP S T + I
Sbjct: 80 LYYTKLRLGTPPRDFYVQVDTGSDVLWVSCASCNGCPQTSGLQIQLNFFDPGSSVTASPI 139
Query: 91 SCSSSQCAV----VTSNCSEGD--CSYSFLYGRGAYASFSSGNLATETLTFNSTSG---L 141
SCS +C+ S CS + C+Y+F YG G S +SG ++ L F+ G +
Sbjct: 140 SCSDQRCSWGIQSSDSGCSVQNNLCAYTFQYGDG---SGTSGFYVSDVLQFDMIVGSSLV 196
Query: 142 PVEMPNVIFGCGHKNLASPT-SDSKQTGIIGLGPGNSSLISQMGTS-IAGK-FSYCLP-D 197
P V+FGC SD GI G G S+ISQ+ + IA + FS+CL +
Sbjct: 197 PNSTAPVVFGCSTSQTGDLVKSDRAVDGIFGFGQQGMSVISQLASQGIAPRVFSHCLKGE 256
Query: 198 QGSSKINFGGIVAGAGVVSTPLI-IRDHYYLSLEAISVGNQRLE-----FVSSSTGNIFV 251
G I G + +V TPL+ + HY ++L +ISV Q L F +S+ +
Sbjct: 257 NGGGGILVLGEIVEPNMVFTPLVPSQPHYNVNLLSISVNGQALPINPSVFSTSNGQGTII 316
Query: 252 DTGVLRTLLPLEYHSNLKSVMSNMI--KAQPVKGVGAEPGFSDVLCYNISSQPK--FPEV 307
DTG L + ++N + +PV G + CY I++ FP V
Sbjct: 317 DTGTTLAYLSEAAYVPFVEAITNAVSQSVRPVVSKGNQ-------CYVITTSVGDIFPPV 369
Query: 308 TIHFR-GADVKLSPSNLFRNISDEIMCSAFRGGNANIVY 345
+++F GA + L+P + ++ F G ++V+
Sbjct: 370 SLNFAGGASMFLNPQDYLIQQNNVASALCFLGRYCSVVH 408
>gi|30682289|ref|NP_187876.2| aspartyl protease family protein [Arabidopsis thaliana]
gi|11994412|dbj|BAB02414.1| chloroplast nucleoid DNA binding protein-like [Arabidopsis
thaliana]
gi|332641715|gb|AEE75236.1| aspartyl protease family protein [Arabidopsis thaliana]
Length = 461
Score = 105 bits (262), Expect = 4e-20, Method: Compositional matrix adjust.
Identities = 104/369 (28%), Positives = 161/369 (43%), Gaps = 44/369 (11%)
Query: 35 YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSS 94
Y + +GTP VDTGS+ TW C K +F +S ++ ++ C +
Sbjct: 106 YFTEIRVGTPAKKFRVVVDTGSELTWVNCRYRAR---GKDNRRVFRADESKSFKTVGCLT 162
Query: 95 SQCAVVTSNC--------SEGDCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMP 146
C V N CSY + Y G+ A G A ET+T T+G +P
Sbjct: 163 QTCKVDLMNLFSLTTCPTPSTPCSYDYRYADGSAAQ---GVFAKETITVGLTNGRMARLP 219
Query: 147 NVIFGCGHKNLASPTSDSKQ--TGIIGLGPGNSSLISQMGTSIAGKFSYCLPDQGSSKIN 204
+ GC +S T S Q G++GL + S S + KFSYCL D S+K
Sbjct: 220 GHLIGCS----SSFTGQSFQGADGVLGLAFSDFSFTSTATSLYGAKFSYCLVDHLSNKNV 275
Query: 205 FGGIVAG-------AGVVSTPLIIRD---HYYLSLEAISVGNQRLE-----FVSSSTGNI 249
++ G A +TPL + Y +++ IS+G L+ + ++S G
Sbjct: 276 SNYLIFGSSRSTKTAFRRTTPLDLTRIPPFYAINVIGISLGYDMLDIPSQVWDATSGGGT 335
Query: 250 FVDTGVLRTLLP-LEYHSNLKSVMSNMIKAQPVK--GVGAEPGFSDVLCYNISSQPKFPE 306
+D+G TLL Y + + +++ + VK GV E FS +N+S K P+
Sbjct: 336 ILDSGTSLTLLADAAYKQVVTGLARYLVELKRVKPEGVPIEYCFSFTSGFNVS---KLPQ 392
Query: 307 VTIHFRG-ADVKLSPSNLFRNISDEIMCSAF--RGGNANIVYGRIMQINFLIGYDIEQAM 363
+T H +G A + + + + + C F G A V G IMQ N+L +D+ +
Sbjct: 393 LTFHLKGGARFEPHRKSYLVDAAPGVKCLGFVSAGTPATNVIGNIMQQNYLWEFDLMAST 452
Query: 364 VSFKPSRCT 372
+SF PS CT
Sbjct: 453 LSFAPSACT 461
>gi|357127505|ref|XP_003565420.1| PREDICTED: probable aspartic protease At2g35615-like [Brachypodium
distachyon]
Length = 466
Score = 105 bits (262), Expect = 4e-20, Method: Compositional matrix adjust.
Identities = 99/383 (25%), Positives = 169/383 (44%), Gaps = 59/383 (15%)
Query: 35 YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCF-----------------KQEPP 77
YL +++GTPPV DTGSD W +C + +
Sbjct: 82 YLAAVNVGTPPVRFLAVADTGSDLVWLKCNTTQNNNGIVSSDSGNNSNSSPPPPPPEAVV 141
Query: 78 LFDPKKSSTYNSISCSSSQCAVVTSNCS-EGD---CSYSFLYGRGAYASFSSGNLATETL 133
F+P SS+Y+ + C C + +N S GD C + + Y GA A +G LA +T
Sbjct: 142 YFNPFDSSSYSRVGCDGPSCLALATNASCNGDSHACDFRYSYRDGASA---TGLLAADTF 198
Query: 134 TF-NSTSGLPVEMPNVIFGCGHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFS 192
TF + + ++ FGC + + + G++GLG G SL SQ+G KFS
Sbjct: 199 TFGGNINNDTTSTASIDFGCAT---GTAGREFQADGMVGLGAGPLSLASQLGR----KFS 251
Query: 193 YCLP----DQGSSKINFG--GIVAGAGVVSTPLIIRD-----HYYLSLEAISVGNQRLEF 241
+CL D SS +NFG +V+ G +TPLI +Y +S++++ V Q +
Sbjct: 252 FCLTAYDIDDASSILNFGARAVVSDPGAATTPLIASSSNAAAYYAISIDSLKVAGQPVPG 311
Query: 242 VSSSTGNIFVDTGVLRTLLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDV--LCYNIS 299
++S + VDTG + T L+ + L + ++ + G+ P + LCY++S
Sbjct: 312 -TTSVSKVIVDTGTVLTF--LDRAALLAPLTESLARVMDGAGLPRAPPPDETLELCYDVS 368
Query: 300 SQPK----FPEVTIHF---RGADVKLSPSNLFRNISDEIMCSAFRGGNANI----VYGRI 348
P+VT+ G +V+L+ F + + ++C A + + V G +
Sbjct: 369 RVKDVDGVIPDVTLVLGGGGGGEVRLTGEGTFVLVKEGVLCLAVVTTSPELQPLSVLGNV 428
Query: 349 MQINFLIGYDIEQAMVSFKPSRC 371
+ +G D++ +F + C
Sbjct: 429 ALQDLHVGIDLDARTATFATANC 451
>gi|255581508|ref|XP_002531560.1| Aspartic proteinase nepenthesin-2 precursor, putative [Ricinus
communis]
gi|223528821|gb|EEF30826.1| Aspartic proteinase nepenthesin-2 precursor, putative [Ricinus
communis]
Length = 407
Score = 105 bits (262), Expect = 4e-20, Method: Compositional matrix adjust.
Identities = 114/403 (28%), Positives = 182/403 (45%), Gaps = 64/403 (15%)
Query: 13 NETPKSPISIIYQAEIISVDDIYLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCF 72
N P+SP + ++ I + L++GTPP ++ +DTGS+ +W C +
Sbjct: 14 NSFPRSPNKLPFRHNIS-----LTVSLTVGTPPQNVSMVIDTGSELSWLYCN---KTTTT 65
Query: 73 KQEPPLFDPKKSSTYNSISCSSSQCAVVTSNCS-EGDC-SYSFLYGRGAYASFSS--GNL 128
P F+ +S +Y I CSSS C T + S C S S + +YA SS GNL
Sbjct: 66 TSYPTTFNQTRSISYRPIPCSSSTCTNQTRDFSIPASCDSNSLCHATLSYADASSSEGNL 125
Query: 129 ATETLTFNSTSGLPVEMPNVIFGCGHKNLASPT-SDSKQTGIIGLGPGNSSLISQMGTSI 187
A++T ++ ++P ++FGC +S + DSK TG++G+ G+ S +SQMG
Sbjct: 126 ASDTFHMGAS-----DIPGMVFGCMDSVFSSNSDEDSKNTGLMGMNRGSLSFVSQMGFP- 179
Query: 188 AGKFSYCLPDQGSSKI------NFGGIV----AGAGVVSTPLIIRDH--YYLSLEAISVG 235
KFSYC+ S + NF V +STPL D Y + LE I V
Sbjct: 180 --KFSYCISGTDFSGMLLLGESNFTWAVPLNYTPLVQISTPLPYFDRIAYTVQLEGIKVS 237
Query: 236 NQRLEFVSS-------STGNIFVDTGVLRTLLPLEYHSNLKSVMSNMIKAQPVKGVGAEP 288
++ L S G VD+G T L ++ L+S N V +P
Sbjct: 238 DRLLPIPKSVFEPDHTGAGQTMVDSGTQFTFLLGPAYTALRSEFLNQTTG--FLRVLEDP 295
Query: 289 GF----SDVLCYNIS-SQ---PKFPEVTIHFRGADVKLSPSNLFRNI------SDEIMCS 334
F + LCY + SQ P+ P V++ F GA++ ++ + + +D + C
Sbjct: 296 DFVFQGAMDLCYRVPISQRVLPRLPTVSLVFNGAEMTVADERVLYRVPGEIRGNDSVHCL 355
Query: 335 AFRGGNANI------VYGRIMQINFLIGYDIEQAMVSFKPSRC 371
+F GN+++ V G Q N + +D+E++ + RC
Sbjct: 356 SF--GNSDLLGVEAYVIGHHHQQNVWMEFDLERSRIGLAQVRC 396
>gi|449476186|ref|XP_004154665.1| PREDICTED: LOW QUALITY PROTEIN: aspartic proteinase-like protein
2-like [Cucumis sativus]
Length = 478
Score = 105 bits (261), Expect = 4e-20, Method: Compositional matrix adjust.
Identities = 99/374 (26%), Positives = 162/374 (43%), Gaps = 46/374 (12%)
Query: 34 IYLMHLSIGTPPVDIFGSVDTGSDCTWTQ---CEPCPELDCFKQEPPLFDPKKSSTYNSI 90
+Y + IG+PP D VDTGSD W C CP+ + L++PK SST I
Sbjct: 72 LYYARIGIGSPPNDFHVQVDTGSDILWVNCVGCSNCPKKSDIGVDLQLYNPKSSSTSTLI 131
Query: 91 SCSSSQCAVVTSNCSEG-----DCSYSFLYGRGAYASFSSGNLATETLTFNSTSG---LP 142
+C C+ G C Y +YG G S ++G + + G
Sbjct: 132 TCDQPFCSATYDAPIPGCKPDLLCQYKVIYGDG---SATAGYFVNDYIQLQRAVGNHKTS 188
Query: 143 VEMPNVIFGCGHKNLASPTSDSKQT-GIIGLGPGNSSLISQMGTS--IAGKFSYCLPDQG 199
+++FGCG K S S+ GI+G G NSS+ISQ+ + + F++CL
Sbjct: 189 ETNGSIVFGCGAKQSGELGSSSEALDGILGFGQANSSMISQLAATGKVKKIFAHCL---- 244
Query: 200 SSKINFGGIVAGAGVV-----STPLI-IRDHYYLSLEAISVGNQRLE-----FVSSSTGN 248
I+ GGI A VV +TP++ + HY + L + VG+ L+ F +S
Sbjct: 245 -DSISGGGIFAIGEVVEPKLXNTPVVPNQAHYNVVLNGVKVGDTALDLPLGLFETSYKRG 303
Query: 249 IFVDTGVLRTLLPLEYHSNLKSVMSNMIKAQP-VKGVGAEPGFSDVLCYNISSQPKFPEV 307
+D+G LP S +M ++ AQP +K + F+ ++ + FP V
Sbjct: 304 AIIDSGTTLAYLP---ESIYLPLMEKILGAQPDLKLRTVDDQFT-CFVFDKNVDDGFPTV 359
Query: 308 TIHFRGADV-KLSPSNLFRNISDEIMCSAF-------RGGNANIVYGRIMQINFLIGYDI 359
T F + + + P I D++ C + + GN + G ++ N L+ Y++
Sbjct: 360 TFKFEESLILTIYPHEYLFQIRDDVWCVGWQNSGAQSKDGNEVTLLGDLVLQNKLVYYNL 419
Query: 360 EQAMVSFKPSRCTN 373
E + + C++
Sbjct: 420 ENQTIGWTEYNCSS 433
>gi|449442641|ref|XP_004139089.1| PREDICTED: aspartic proteinase-like protein 2-like [Cucumis
sativus]
Length = 478
Score = 105 bits (261), Expect = 5e-20, Method: Compositional matrix adjust.
Identities = 99/374 (26%), Positives = 162/374 (43%), Gaps = 46/374 (12%)
Query: 34 IYLMHLSIGTPPVDIFGSVDTGSDCTWTQ---CEPCPELDCFKQEPPLFDPKKSSTYNSI 90
+Y + IG+PP D VDTGSD W C CP+ + L++PK SST I
Sbjct: 72 LYYARIGIGSPPNDFHVQVDTGSDILWVNCVGCSNCPKKSDIGVDLQLYNPKSSSTSTLI 131
Query: 91 SCSSSQCAVVTSNCSEG-----DCSYSFLYGRGAYASFSSGNLATETLTFNSTSG---LP 142
+C C+ G C Y +YG G S ++G + + G
Sbjct: 132 TCDQPFCSATYDAPIPGCKPDLLCQYKVIYGDG---SATAGYFVNDYIQLQRAVGNHKTS 188
Query: 143 VEMPNVIFGCGHKNLASPTSDSKQT-GIIGLGPGNSSLISQMGTS--IAGKFSYCLPDQG 199
+++FGCG K S S+ GI+G G NSS+ISQ+ + + F++CL
Sbjct: 189 ETNGSIVFGCGAKQSGELGSSSEALDGILGFGQANSSMISQLAATGKVKKIFAHCL---- 244
Query: 200 SSKINFGGIVAGAGVV-----STPLI-IRDHYYLSLEAISVGNQRLE-----FVSSSTGN 248
I+ GGI A VV +TP++ + HY + L + VG+ L+ F +S
Sbjct: 245 -DSISGGGIFAIGEVVEPKLKTTPVVPNQAHYNVVLNGVKVGDTALDLPLGLFETSYKRG 303
Query: 249 IFVDTGVLRTLLPLEYHSNLKSVMSNMIKAQP-VKGVGAEPGFSDVLCYNISSQPKFPEV 307
+D+G LP S +M ++ AQP +K + F+ ++ + FP V
Sbjct: 304 AIIDSGTTLAYLP---DSIYLPLMEKILGAQPDLKLRTVDDQFT-CFVFDKNVDDGFPTV 359
Query: 308 TIHFRGADV-KLSPSNLFRNISDEIMCSAF-------RGGNANIVYGRIMQINFLIGYDI 359
T F + + + P I D++ C + + GN + G ++ N L+ Y++
Sbjct: 360 TFKFEESLILTIYPHEYLFQIRDDVWCVGWQNSGAQSKDGNEVTLLGDLVLQNKLVYYNL 419
Query: 360 EQAMVSFKPSRCTN 373
E + + C++
Sbjct: 420 ENQTIGWTEYNCSS 433
>gi|17979392|gb|AAL49921.1| unknown protein [Arabidopsis thaliana]
Length = 439
Score = 104 bits (260), Expect = 5e-20, Method: Compositional matrix adjust.
Identities = 104/369 (28%), Positives = 161/369 (43%), Gaps = 44/369 (11%)
Query: 35 YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSS 94
Y + +GTP VDTGS+ TW C K +F +S ++ ++ C +
Sbjct: 84 YFTEIRVGTPAKKFRVVVDTGSELTWVNCRYRAR---GKDNRRVFRADESKSFKTVGCLT 140
Query: 95 SQCAVVTSNC--------SEGDCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMP 146
C V N CSY + Y G+ A G A ET+T T+G +P
Sbjct: 141 QTCKVDLMNLFSLTTCPTPSTPCSYDYRYADGSAAQ---GVFAKETITVGLTNGRMARLP 197
Query: 147 NVIFGCGHKNLASPTSDSKQ--TGIIGLGPGNSSLISQMGTSIAGKFSYCLPDQGSSKIN 204
+ GC +S T S Q G++GL + S S + KFSYCL D S+K
Sbjct: 198 GHLIGCS----SSFTGQSFQGADGVLGLAFSDFSFTSTATSLYGAKFSYCLVDHLSNKNV 253
Query: 205 FGGIVAG-------AGVVSTPLIIRD---HYYLSLEAISVGNQRLE-----FVSSSTGNI 249
++ G A +TPL + Y +++ IS+G L+ + ++S G
Sbjct: 254 SNYLIFGSSRSTKTAFRRTTPLDLTRIPPFYAINVIGISLGYDMLDIPSQVWDATSGGGT 313
Query: 250 FVDTGVLRTLLP-LEYHSNLKSVMSNMIKAQPVK--GVGAEPGFSDVLCYNISSQPKFPE 306
+D+G TLL Y + + +++ + VK GV E FS +N+S K P+
Sbjct: 314 ILDSGTSLTLLADAAYKQVVTGLARYLVELKRVKPEGVPIEYCFSFTSGFNVS---KLPQ 370
Query: 307 VTIHFRG-ADVKLSPSNLFRNISDEIMCSAF--RGGNANIVYGRIMQINFLIGYDIEQAM 363
+T H +G A + + + + + C F G A V G IMQ N+L +D+ +
Sbjct: 371 LTFHLKGGARFEPHRKSYLVDAAPGVKCLGFVSAGTPATNVIGNIMQQNYLWEFDLMAST 430
Query: 364 VSFKPSRCT 372
+SF PS CT
Sbjct: 431 LSFAPSACT 439
>gi|357118738|ref|XP_003561107.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Brachypodium
distachyon]
Length = 491
Score = 104 bits (260), Expect = 6e-20, Method: Compositional matrix adjust.
Identities = 99/352 (28%), Positives = 151/352 (42%), Gaps = 52/352 (14%)
Query: 52 VDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSSSQCAVV---TSNCS-EG 107
+DT SD W QC PCP C Q L+DP KSS+ + CSS C + + C+ G
Sbjct: 160 IDTASDVPWVQCAPCPAPHCHAQTDVLYDPSKSSSSAAFPCSSPACRNLGPYANGCTPAG 219
Query: 108 D-CSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMPNVI----FGCGHKNLASPTS 162
D C Y Y G S S+G ++ LT N P + + I FGC H L +
Sbjct: 220 DQCQYRVQYPDG---SASAGTYISDVLTLN-----PAKPASAISEFRFGCSHALLQPGSF 271
Query: 163 DSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLPDQGSSKINFGGIVAGAGVVS------ 216
+K +GI+ LG G SL +Q + FSYCLP + ++ G + G V+
Sbjct: 272 SNKTSGIMALGRGAQSLPTQTKATYGDVFSYCLP---PTPVHSGFFILGVPRVAASRYAV 328
Query: 217 TPLIIRDH----YYLSLEAISVGNQRLEFVSSS-TGNIFVDTGVLRTLLPLEYHSNLKSV 271
TP++ Y + L AI V +RL + +D+ + T LP + L++
Sbjct: 329 TPMLRSKAAPMLYLVRLIAIEVAGKRLPVPPAVFAAGAVMDSRTIVTRLPPTAYMALRAA 388
Query: 272 MSNMIKAQPVKGVGAEPGFSDVLCYNISSQP-------KFPEVTIHFRGAD--VKLSPSN 322
++A A P CY+ S K P++T+ F G + V+L PS
Sbjct: 389 FVAEMRAY----RAAAPKEHLDTCYDFSGAAPGGGGGVKLPKITLVFDGPNGAVELDPSG 444
Query: 323 LFRNISDEIMCSAFRGGNANI---VYGRIMQINFLIGYDIEQAMVSFKPSRC 371
+ + C AF + + G + Q + Y+++ A V F+ C
Sbjct: 445 VLLD-----GCLAFAPNTDDQMTGIIGNVQQQALEVLYNVDGATVGFRRGAC 491
>gi|77808087|gb|AAS48510.2| aspartic protease [Fagopyrum esculentum]
gi|82780908|gb|ABB88696.2| aspartic proteinase-like protein [Fagopyrum esculentum]
Length = 447
Score = 104 bits (260), Expect = 6e-20, Method: Compositional matrix adjust.
Identities = 110/388 (28%), Positives = 173/388 (44%), Gaps = 70/388 (18%)
Query: 35 YLMHLSIGTPPVDIFGSVDTGSDCTWTQCE-PCPELDCFK--------QEPPLFDPKKSS 85
Y + S+GTPP + +DTGS WT C P C + P++ KSS
Sbjct: 74 YSVIFSLGTPPQKVSLVLDTGSSLVWTPCTIPTATYTCQNCTFSGVDPTKIPIYARNKSS 133
Query: 86 TYNSISCSSSQCAVVTS---NCSEGD-CSYSFL-YGRGAYASFSSGNLATETLTFNSTSG 140
T S+ C S +C V NCS C Y L YG G+ ++G L ++ L + +
Sbjct: 134 TVQSLPCRSPKCNWVFGSDLNCSTTKRCPYYGLEYGLGS----TTGQLVSDVLGLSKLN- 188
Query: 141 LPVEMPNVIFGCGHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCL----- 195
+P+ +FGC S S+ + GI G G G +S+ +Q+G + KFSYCL
Sbjct: 189 ---RIPDFLFGC------SLVSNRQPEGIAGFGRGLASIPAQLGLT---KFSYCLVSHRF 236
Query: 196 ---PDQGSSKINFGGIVAGA---GVVSTPLI-------IRDHYYLSLEAISVGNQRLE-- 240
P G ++ G A A GV P ++YY+SL I VG + +
Sbjct: 237 DDTPQSGDLVLHRGRRHADAAANGVAYAPFTKSPALSPYSEYYYISLSKILVGGKDVPIP 296
Query: 241 ---FVSSSTGN--IFVDTGVLRTLLP-LEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDVL 294
V S G+ + VD+G T + + + + + +M K + K + G
Sbjct: 297 PRYLVPSKEGDGGMIVDSGSTFTFMERIIFDPVARELEKHMTKYKRAKEIEDSSGLGP-- 354
Query: 295 CYNISSQPK--FPEVTIHFR-GADVKLSPSNLFRNISDEIMCSAF-----RGGNAN---I 343
CYNI+ Q + P++T F+ GA++ L ++ F ++D ++C G+ I
Sbjct: 355 CYNITGQSEVDVPKLTFSFKGGANMDLPLTDYFSLVTDGVVCMTVLTDPDEPGSTTGPAI 414
Query: 344 VYGRIMQINFLIGYDIEQAMVSFKPSRC 371
+ G Q NF I YD+++ FKP +C
Sbjct: 415 ILGNYQQQNFYIEYDLKKQRFGFKPQQC 442
>gi|21717154|gb|AAM76347.1|AC074196_5 putative nucleoid DNA binding protein [Oryza sativa Japonica Group]
gi|31433293|gb|AAP54831.1| Eukaryotic aspartyl protease family protein [Oryza sativa Japonica
Group]
gi|125532791|gb|EAY79356.1| hypothetical protein OsI_34485 [Oryza sativa Indica Group]
Length = 397
Score = 104 bits (259), Expect = 7e-20, Method: Compositional matrix adjust.
Identities = 108/370 (29%), Positives = 164/370 (44%), Gaps = 50/370 (13%)
Query: 36 LMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSSS 95
+ + +IGTPP +D + WTQC C CFKQ+ PLF P SST+ C +
Sbjct: 44 VANFTIGTPPQPASAIIDVAGELVWTQCSRCSR--CFKQDLPLFIPNASSTFRPEPCGTD 101
Query: 96 QC-AVVTSNCSEGDCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMPNVIFGCGH 154
C + TSNCS C+Y + G + TET + + ++ FGC
Sbjct: 102 ACKSTPTSNCSGDVCTYESTTNIRLDRHTTLGIVGTETFAIGTATA------SLAFGC-- 153
Query: 155 KNLASPTSDSKQT-GIIGLGPGNSSLISQMGTSIAGKFSYCLPDQG---SSKINFG--GI 208
+AS T G IGLG SL++QM + KFSYCL +G SS++ G
Sbjct: 154 -VVASDIDTMDGTSGFIGLGRTPRSLVAQMKLT---KFSYCLSPRGTGKSSRLFLGSSAK 209
Query: 209 VAGAGVVSTPLIIR-------DHYY-LSLEAISVGNQRLEFVSSSTGNIFVDTGV--LRT 258
+AG ST I+ HYY LSL+AI GN + ++ +G I V V
Sbjct: 210 LAGGESTSTAPFIKTSPDDDSHHYYLLSLDAIRAGNTTI--ATAQSGGILVMHTVSPFSL 267
Query: 259 LLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDVLCYNIS---SQPKFPEVTIHFRGAD 315
L+ Y + K+V + A F LC+ + S+ P++ F+G
Sbjct: 268 LVDSAYRAFKKAVTEAVGGAAAPPMATPPQPFD--LCFKKAAGFSRATAPDLVFTFQGGG 325
Query: 316 VKLS--PSNLFRNISDE--IMCSAF-------RGGNANI-VYGRIMQINFLIGYDIEQAM 363
L+ P+ ++ +E C+A R G + V G + Q N YD+++
Sbjct: 326 AALTVPPAKYLIDVGEEKDTACAAILSMARLNRTGLEGVSVLGSLQQENVHFLYDLKKET 385
Query: 364 VSFKPSRCTN 373
+SF+P+ C++
Sbjct: 386 LSFEPADCSS 395
>gi|115483168|ref|NP_001065177.1| Os10g0538200 [Oryza sativa Japonica Group]
gi|21717168|gb|AAM76361.1|AC074196_19 putative nucleoid DNA binding protein [Oryza sativa Japonica Group]
gi|31433289|gb|AAP54827.1| Eukaryotic aspartyl protease family protein, expressed [Oryza
sativa Japonica Group]
gi|113639786|dbj|BAF27091.1| Os10g0538200 [Oryza sativa Japonica Group]
gi|215686408|dbj|BAG87693.1| unnamed protein product [Oryza sativa Japonica Group]
Length = 394
Score = 104 bits (259), Expect = 8e-20, Method: Compositional matrix adjust.
Identities = 98/371 (26%), Positives = 162/371 (43%), Gaps = 61/371 (16%)
Query: 35 YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSS 94
Y+ + +IGTPP +D + WTQC+ C CF+Q+ PLFDP S+TY + C +
Sbjct: 51 YVANFTIGTPPQPASAVIDLAGELVWTQCKQCSR--CFEQDTPLFDPTASNTYRAEPCGT 108
Query: 95 SQCAVVTS---NCSEGDCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMPNVIFG 151
C + S NCS C+Y A + G + T+T + ++ FG
Sbjct: 109 PLCESIPSDSRNCSGNVCAYQ----ASTNAGDTGGKVGTDTFAVGTAKA------SLAFG 158
Query: 152 CGHKNLASPTSDSK--QTGIIGLGPGNSSLISQMGTSIAGKFSYCLP--DQGSSKINFGG 207
C + + D+ +GI+GLG SL++Q G + FSYCL D G + F G
Sbjct: 159 C----VVASDIDTMGGPSGIVGLGRTPWSLVTQTGVA---AFSYCLAPHDAGRNSALFLG 211
Query: 208 ----IVAGAGVVSTPLI--------IRDHYYLSLEAISVGNQRLEFVSSSTGNIFVDTGV 255
+ G STP + + ++Y + LE + G+ + S + T +
Sbjct: 212 SSAKLAGGGKAASTPFVNISGNGNDLSNYYKVQLEGLKAGDAMIPLPPSGS------TVL 265
Query: 256 LRTLLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDV----LCYNIS-SQPKFPEVTIH 310
L T P+ + L +K VGA P + V LC+ S + P++
Sbjct: 266 LDTFSPISF---LVDGAYQAVKKAVTAAVGAPPMATPVEPFDLCFPKSGASGAAPDLVFT 322
Query: 311 FR-GADVKLSPSNLFRNISDEIMCSAFRGGNANI-------VYGRIMQINFLIGYDIEQA 362
FR GA + + +N + + +C A +A + + G + Q N +D+++
Sbjct: 323 FRGGAAMTVPATNYLLDYKNGTVCLAML-SSARLNSTTELSLLGSLQQENIHFLFDLDKE 381
Query: 363 MVSFKPSRCTN 373
+SF+P+ CT
Sbjct: 382 TLSFEPADCTK 392
>gi|218189149|gb|EEC71576.1| hypothetical protein OsI_03949 [Oryza sativa Indica Group]
Length = 504
Score = 104 bits (259), Expect = 8e-20, Method: Compositional matrix adjust.
Identities = 104/374 (27%), Positives = 172/374 (45%), Gaps = 46/374 (12%)
Query: 34 IYLMHLSIGTPPVDIFGSVDTGSDCTWTQCEP---CPELDCFKQEPPLFDPKKSSTYNSI 90
+Y + +G+PP + F +DTGSD W C P CP + F+P SST + I
Sbjct: 90 LYFTRVKLGSPPKEYFVQIDTGSDILWVACSPCTGCPSSSGLNIQLEFFNPDTSSTSSKI 149
Query: 91 SCSSSQC--AVVTSN--CSEGD---CSYSFLYGRGAYASFSSGNLATETLTFNSTSG--- 140
CS +C A+ TS C D C Y+F YG G S +SG ++T+ F+S G
Sbjct: 150 PCSDDRCTAALQTSEAVCQTSDNSPCGYTFTYGDG---SGTSGYYVSDTMYFDSVMGNEQ 206
Query: 141 LPVEMPNVIFGCGHKNLASPT-SDSKQTGIIGLGPGNSSLISQMGT-SIAGK-FSYCLP- 196
+++FGC + T +D GI G G S++SQ+ + ++ K FS+CL
Sbjct: 207 TANSSASIVFGCSNSQSGDLTKTDRAVDGIFGFGQHQLSVVSQLNSLGVSPKVFSHCLKG 266
Query: 197 -DQGSSKINFGGIVAGAGVVSTPLI-IRDHYYLSLEAISVGNQRLE-----FVSSSTGNI 249
D G + G IV G+V TPL+ + HY L+LE+I V Q+L F +S+T
Sbjct: 267 SDNGGGILVLGEIVE-PGLVYTPLVPSQPHYNLNLESIVVNGQKLPIDSSLFTTSNTQGT 325
Query: 250 FVDTGVLRTLLP----LEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDVLCYNISSQPKFP 305
VD+G L + + + + +S +++ KG + + S FP
Sbjct: 326 IVDSGTTLAYLADGAYDPFVNAITAAVSPSVRSLVSKG-------NQCFVTSSSVDSSFP 378
Query: 306 EVTIHFRGA-DVKLSPSN-LFRNIS---DEIMCSAFR--GGNANIVYGRIMQINFLIGYD 358
V+++F G + + P N L + S + + C ++ G + G ++ + + YD
Sbjct: 379 TVSLYFMGGVAMTVKPENYLLQQASIDNNVLWCIGWQRNQGQQITILGDLVLKDKIFVYD 438
Query: 359 IEQAMVSFKPSRCT 372
+ + + C+
Sbjct: 439 LANMRMGWTDYDCS 452
>gi|21717175|gb|AAM76368.1|AC074196_26 putative nucleoid DNA binding protein [Oryza sativa Japonica Group]
gi|31433292|gb|AAP54830.1| Eukaryotic aspartyl protease family protein [Oryza sativa Japonica
Group]
Length = 418
Score = 104 bits (259), Expect = 8e-20, Method: Compositional matrix adjust.
Identities = 103/376 (27%), Positives = 166/376 (44%), Gaps = 66/376 (17%)
Query: 36 LMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSSS 95
+ + +IGTPP +D + WTQC C CFKQ+ PLF P SST+ C +
Sbjct: 68 VANFTIGTPPQPASAIIDVAGELVWTQCSMCSR--CFKQDLPLFVPNASSTFRPEPCGTD 125
Query: 96 QC-AVVTSNCSEGDCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMPNVIFGC-- 152
C ++ TSNCS C+Y + G +AT+T + + ++ FGC
Sbjct: 126 ACKSIPTSNCSSNMCTYEGTI-NSKLGGHTLGIVATDTFAIGTATA------SLGFGCVV 178
Query: 153 --GHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLPDQGS---SKINFGG 207
G + P +G+IGLG SSL+SQM + KFSYCL S S++ G
Sbjct: 179 ASGIDTMGGP------SGLIGLGRAPSSLVSQMNIT---KFSYCLTPHDSGKNSRLLLGS 229
Query: 208 ---IVAGAGVVSTPLI-------IRDHYYLSLEAISVGNQRLEFVSSSTGNIFVDTGVLR 257
+ G +TP + + +Y + L+ I G+ + S GN T +++
Sbjct: 230 SAKLAGGGNSTTTPFVKTSPGDDMSQYYPIQLDGIKAGDAAIALPPS--GN----TVLVQ 283
Query: 258 TLLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDV----LCYNIS--SQPKFPEVTIHF 311
TL P+ + L +K + K VGA P + + LC+ + S P++ F
Sbjct: 284 TLAPMSF---LVDSAYQALKKEVTKAVGAAPTATPLQPFDLCFPKAGLSNASAPDLVFTF 340
Query: 312 R--GADVKLSPSNLFRNISDE--IMCSAFRGG----------NANIVYGRIMQINFLIGY 357
+ A + + P ++ +E +C A N NI+ G + Q N
Sbjct: 341 QQGAAALTVPPPKYLIDVGEEKGTVCMAILSTSWLNTTALDENLNIL-GSLQQENTHFLL 399
Query: 358 DIEQAMVSFKPSRCTN 373
D+E+ +SF+P+ C++
Sbjct: 400 DLEKKTLSFEPADCSS 415
>gi|224136436|ref|XP_002322329.1| predicted protein [Populus trichocarpa]
gi|222869325|gb|EEF06456.1| predicted protein [Populus trichocarpa]
Length = 486
Score = 104 bits (259), Expect = 8e-20, Method: Compositional matrix adjust.
Identities = 113/418 (27%), Positives = 167/418 (39%), Gaps = 98/418 (23%)
Query: 31 VDDIYLMHLSIGTPPVDIFGSVDTGSDCTWTQCE----PCPELDCFKQEP---------- 76
V D YL+ L+IGTPP I +DTGSD TW C C E D ++
Sbjct: 78 VRDGYLISLNIGTPPQVIQVLMDTGSDLTWVPCGNLSFDCMECDDYRNNKLMATFSPSYS 137
Query: 77 ----------PLFDPKKSSTYNSISCSSSQCA---VVTSNCSEGDCSYSFLYGRGAYASF 123
P SS +C+ + C+ +V + CS S+++ YG G
Sbjct: 138 SSSYRASCASPFCIDIHSSDNPLDTCTVAGCSLSTLVKATCSRPCPSFAYTYGAGGVV-- 195
Query: 124 SSGNLATETLTFNSTS-GLPVEMPNVIFGCGHKNLASPTSDSKQTGIIGLGPGNSSLISQ 182
+G L +TL N +S G+ E+P FGC P GI G G G S++SQ
Sbjct: 196 -TGILTRDTLRVNGSSPGVAKEIPKFCFGCVGSAYREP------IGIAGFGRGTLSMVSQ 248
Query: 183 MGTSIAGKFSYCLPDQGSSKINFGGIVAGAGVVSTPLIIRD------------------- 223
+G G FS+C A +S+PL++ D
Sbjct: 249 LGFLQKG-FSHCF---------LAFKYANNPNISSPLVVGDIALTSKDDMQFTPMLNSPM 298
Query: 224 ---HYYLSLEAISVGN--------QRLEFVSSSTGNIFVDTGVLRTLLPLEYHSNLKSVM 272
YY+ LEAI+VGN EF S G + +D+G T LP ++S + S++
Sbjct: 299 YPNFYYVGLEAITVGNVSATEVPSSLREFDSLGNGGMKIDSGTTYTHLPEPFYSQVLSIL 358
Query: 273 SNMIKAQPVKGVGAEPGFSDVLCYNI--------SSQPKFPEVTIHF-RGADVKLSPSNL 323
+ I G+ + GF LCY + +S P +T HF + L N
Sbjct: 359 QSTINYPRDTGMEMQTGFD--LCYKVPRPNNNTLTSDDLLPSITFHFLNNVSLVLPQGNH 416
Query: 324 FRNISDE-----IMCSAFRGGNANI-----VYGRIMQINFLIGYDIEQAMVSFKPSRC 371
F +S + C F+ + V+G Q N + YD+E+ + F+P C
Sbjct: 417 FYPVSAPGNPAVVKCLMFQSTDDGDDGPAGVFGSFQQQNVEVVYDLEKERIGFQPMDC 474
>gi|115441003|ref|NP_001044781.1| Os01g0844500 [Oryza sativa Japonica Group]
gi|19571042|dbj|BAB86469.1| putative aspartic proteinase nepenthesin II [Oryza sativa Japonica
Group]
gi|20160609|dbj|BAB89555.1| putative aspartic proteinase nepenthesin II [Oryza sativa Japonica
Group]
gi|113534312|dbj|BAF06695.1| Os01g0844500 [Oryza sativa Japonica Group]
gi|125572614|gb|EAZ14129.1| hypothetical protein OsJ_04051 [Oryza sativa Japonica Group]
Length = 442
Score = 104 bits (259), Expect = 8e-20, Method: Compositional matrix adjust.
Identities = 102/380 (26%), Positives = 155/380 (40%), Gaps = 65/380 (17%)
Query: 39 LSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSSSQC- 97
L++GTPP ++ +DTGS+ +W C P + F P+ S T+ S+ C S+QC
Sbjct: 70 LAVGTPPQNVTMVLDTGSELSWLLCAPGGGGGGGGRSALSFRPRASLTFASVPCDSAQCR 129
Query: 98 -----AVVTSNCSEGDCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMPNVIFGC 152
+ + + C S Y G S S G LATE T G P+ FGC
Sbjct: 130 SRDLPSPPACDGASKQCRVSLSYADG---SSSDGALATEVFTVG--QGPPLR---AAFGC 181
Query: 153 GHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLPDQGSSKI--------- 203
+ G++G+ G S +SQ T +FSYC+ D+ + +
Sbjct: 182 MATAFDTSPDGVATAGLLGMNRGALSFVSQASTR---RFSYCISDRDDAGVLLLGHSDLP 238
Query: 204 ----NFGGIVAGAGVVSTPLIIRDHYYLSLEAISVGNQRLEFVSS-------STGNIFVD 252
N+ + A + P R Y + L I VG + L +S G VD
Sbjct: 239 FLPLNYTPLYQPA--MPLPYFDRVAYSVQLLGIRVGGKPLPIPASVLAPDHTGAGQTMVD 296
Query: 253 TGVLRTLLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFS-----DVLCYNI----SSQPK 303
+G T L + +S LK+ S K P +P F+ D C+ + + +
Sbjct: 297 SGTQFTFLLGDAYSALKAEFSRQTK--PWLPALNDPNFAFQEAFDT-CFRVPQGRAPPAR 353
Query: 304 FPEVTIHFRGADVKLSPSNLF------RNISDEIMCSAFRGGNANI------VYGRIMQI 351
P VT+ F GA + ++ L R D + C F GNA++ V G Q+
Sbjct: 354 LPAVTLLFNGAQMTVAGDRLLYKVPGERRGGDGVWCLTF--GNADMVPITAYVIGHHHQM 411
Query: 352 NFLIGYDIEQAMVSFKPSRC 371
N + YD+E+ V P RC
Sbjct: 412 NVWVEYDLERGRVGLAPIRC 431
>gi|115489316|ref|NP_001067145.1| Os12g0583300 [Oryza sativa Japonica Group]
gi|77556903|gb|ABA99699.1| Eukaryotic aspartyl protease family protein, expressed [Oryza
sativa Japonica Group]
gi|113649652|dbj|BAF30164.1| Os12g0583300 [Oryza sativa Japonica Group]
gi|125537189|gb|EAY83677.1| hypothetical protein OsI_38901 [Oryza sativa Indica Group]
Length = 446
Score = 103 bits (258), Expect = 9e-20, Method: Compositional matrix adjust.
Identities = 105/372 (28%), Positives = 149/372 (40%), Gaps = 51/372 (13%)
Query: 35 YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSS 94
Y+ IG PP +DTGSD WTQC C C +Q P ++ SST+ + C++
Sbjct: 90 YVAEYLIGDPPQRAEALIDTGSDLVWTQCSTCLRKVCARQALPYYNSSASSTFAPVPCAA 149
Query: 95 SQCAV---VTSNCS-EGDCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMPNVIF 150
CA + C CS YG G A G L TE F S + + F
Sbjct: 150 RICAANDDIIHFCDLAAGCSVIAGYGAGVVA----GTLGTEAFAFQSGTA------ELAF 199
Query: 151 GCGHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLP----DQGSSKINFG 206
GC + +G+IGLG G SL+SQ G A KFSYCL + G++ F
Sbjct: 200 GCVTFTRIVQGALHGASGLIGLGRGRLSLVSQTG---ATKFSYCLTPYFHNNGATGHLFV 256
Query: 207 GIVA---GAGVVSTPLIIRD-----HYYLSLEAISVGNQRLEFVSSS-----------TG 247
G A G G V T ++ YYL L ++VG RL ++ +G
Sbjct: 257 GASASLGGHGDVMTTQFVKGPKGSPFYYLPLIGLTVGETRLPIPATVFDLREVAPGLFSG 316
Query: 248 NIFVDTGVLRTLLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSD--VLCYNISSQPK-F 304
+ +D+G T L + + L S ++ + V A P +D LC +
Sbjct: 317 GVIIDSGSPFTSLVHDAYDALASELAARLNGSLV----APPPDADDGALCVARRDVGRVV 372
Query: 305 PEVTIHFRGADVKLSPSNLFRNISDEIMCSAFRGG----NANIVYGRIMQINFLIGYDIE 360
P V HFRG P+ + D+ V G Q N + YD+
Sbjct: 373 PAVVFHFRGGADMAVPAESYWAPVDKAAACMAIASAGPYRRQSVIGNYQQQNMRVLYDLA 432
Query: 361 QAMVSFKPSRCT 372
SF+P+ C+
Sbjct: 433 NGDFSFQPADCS 444
>gi|125528357|gb|EAY76471.1| hypothetical protein OsI_04407 [Oryza sativa Indica Group]
Length = 441
Score = 103 bits (258), Expect = 9e-20, Method: Compositional matrix adjust.
Identities = 102/380 (26%), Positives = 155/380 (40%), Gaps = 65/380 (17%)
Query: 39 LSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSSSQC- 97
L++GTPP ++ +DTGS+ +W C P + F P+ S T+ S+ C S+QC
Sbjct: 69 LAVGTPPQNVTMVLDTGSELSWLLCAPGGGGGGGGRSALSFRPRASLTFASVPCGSAQCR 128
Query: 98 -----AVVTSNCSEGDCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMPNVIFGC 152
+ + + C S Y G S S G LATE T G P+ FGC
Sbjct: 129 SRDLPSPPACDGASKQCRVSLSYADG---SSSDGALATEVFTVG--QGPPLR---AAFGC 180
Query: 153 GHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLPDQGSSKI--------- 203
+ G++G+ G S +SQ T +FSYC+ D+ + +
Sbjct: 181 MATAFDTSPDGVATAGLLGMNRGALSFVSQASTR---RFSYCISDRDDAGVLLLGHSDLP 237
Query: 204 ----NFGGIVAGAGVVSTPLIIRDHYYLSLEAISVGNQRLEFVSS-------STGNIFVD 252
N+ + A + P R Y + L I VG + L +S G VD
Sbjct: 238 FLPLNYTPLYQPA--MPLPYFDRVAYSVQLLGIRVGGKPLPIPASVLAPDHTGAGQTMVD 295
Query: 253 TGVLRTLLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFS-----DVLCYNI----SSQPK 303
+G T L + +S LK+ S K P +P F+ D C+ + + +
Sbjct: 296 SGTQFTFLLGDAYSALKAEFSRQTK--PWLPALNDPNFAFQEAFDT-CFRVPQGRAPPAR 352
Query: 304 FPEVTIHFRGADVKLSPSNLF------RNISDEIMCSAFRGGNANI------VYGRIMQI 351
P VT+ F GA + ++ L R D + C F GNA++ V G Q+
Sbjct: 353 LPAVTLLFNGAQMTVAGDRLLYKVPGERRGGDGVWCLTF--GNADMVPITAYVIGHHHQM 410
Query: 352 NFLIGYDIEQAMVSFKPSRC 371
N + YD+E+ V P RC
Sbjct: 411 NVWVEYDLERGRVGLAPIRC 430
>gi|15241713|ref|NP_195839.1| aspartyl protease family protein [Arabidopsis thaliana]
gi|75181297|sp|Q9LZL3.1|PCS1L_ARATH RecName: Full=Aspartic proteinase PCS1; AltName: Full=Aspartic
protease 38; Short=AtASP38; AltName: Full=Protein EMBRYO
DEFECTIVE 24; AltName: Full=Protein PROMOTION OF CELL
SURVIVAL 1; Flags: Precursor
gi|7340693|emb|CAB82992.1| putative protein [Arabidopsis thaliana]
gi|50897174|gb|AAT85726.1| At5g02190 [Arabidopsis thaliana]
gi|53828617|gb|AAU94418.1| At5g02190 [Arabidopsis thaliana]
gi|110742159|dbj|BAE99007.1| hypothetical protein [Arabidopsis thaliana]
gi|332003059|gb|AED90442.1| aspartyl protease family protein [Arabidopsis thaliana]
Length = 453
Score = 103 bits (258), Expect = 9e-20, Method: Compositional matrix adjust.
Identities = 110/384 (28%), Positives = 166/384 (43%), Gaps = 79/384 (20%)
Query: 44 PPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSSSQCAVVTSN 103
PP +I +DTGS+ +W +C + FDP +SS+Y+ I CSS C T +
Sbjct: 82 PPQNISMVIDTGSELSWLRCNRSSNPNPVNN----FDPTRSSSYSPIPCSSPTCRTRTRD 137
Query: 104 C-------SEGDCSYSFLYGRGAYASFSSGNLATETLTF-NSTSGLPVEMPNVIFGC-GH 154
S+ C + Y A AS S GNLA E F NST+ N+IFGC G
Sbjct: 138 FLIPASCDSDKLCHATLSY---ADASSSEGNLAAEIFHFGNSTND-----SNLIFGCMGS 189
Query: 155 KNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLPDQGS-------------- 200
+ + P D+K TG++G+ G+ S ISQMG KFSYC+
Sbjct: 190 VSGSDPEEDTKTTGLLGMNRGSLSFISQMGFP---KFSYCISGTDDFPGFLLLGDSNFTW 246
Query: 201 -SKINFGGIVAGAGVVSTPLIIRDH--YYLSLEAISVGNQRLEFVSS-------STGNIF 250
+ +N+ ++ +STPL D Y + L I V + L S G
Sbjct: 247 LTPLNYTPLIR----ISTPLPYFDRVAYTVQLTGIKVNGKLLPIPKSVLVPDHTGAGQTM 302
Query: 251 VDTGVLRTLLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGF----SDVLCYNISS------ 300
VD+G T L ++ L+S N + + V +P F + LCY IS
Sbjct: 303 VDSGTQFTFLLGPVYTALRSHFLN--RTNGILTVYEDPDFVFQGTMDLCYRISPVRIRSG 360
Query: 301 -QPKFPEVTIHFRGADVKLSPSNLFRNI------SDEIMCSAFRGGNANI------VYGR 347
+ P V++ F GA++ +S L + +D + C F GN+++ V G
Sbjct: 361 ILHRLPTVSLVFEGAEIAVSGQPLLYRVPHLTVGNDSVYCFTF--GNSDLMGMEAYVIGH 418
Query: 348 IMQINFLIGYDIEQAMVSFKPSRC 371
Q N I +D++++ + P C
Sbjct: 419 HHQQNMWIEFDLQRSRIGLAPVEC 442
>gi|168064205|ref|XP_001784055.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162664441|gb|EDQ51161.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 459
Score = 103 bits (258), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 110/386 (28%), Positives = 175/386 (45%), Gaps = 65/386 (16%)
Query: 29 ISVDD------IYLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEP-----P 77
IS DD +Y + +GTPP + VDTGSD W C PC +C +
Sbjct: 36 ISGDDDTFTTGLYYTRIYLGTPPQQFYVHVDTGSDVAWVNCVPC--TNCKRASNVALPIS 93
Query: 78 LFDPKKSSTYNSISCSSSQCAVVT-SNCS--EGDCSYSFLYGRGAYASFSSGNLATETLT 134
+FDP+KS++ SISC+ +C + + S CS C YS LYG G S ++G L + L+
Sbjct: 94 IFDPEKSTSKTSISCTDEECYLASNSKCSFNSMSCPYSTLYGDG---SSTAGYLINDVLS 150
Query: 135 FN--------STSGLPVEMPNVIFGCGHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTS 186
FN +TSG + FGCG + +D G++G G SL SQ+
Sbjct: 151 FNQVPSGNSTATSG----TARLTFGCGSNQTGTWLTD----GLVGFGQAEVSLPSQLSKQ 202
Query: 187 --IAGKFSYCLP--DQGSSKINFGGIVAGAGVVSTPLIIRD-HYYLSLEAISVGNQRL-- 239
F++CL ++GS + G I G+V TP++ + HY + L I V +
Sbjct: 203 NVSVNIFAHCLQGDNKGSGTLVIGHIRE-PGLVYTPIVPKQSHYNVELLNIGVSGTNVTT 261
Query: 240 --EFVSSSTGNIFVDTGVLRTLLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDVLCYN 297
F S++G + +D+G T L + ++ + + +++ G P C
Sbjct: 262 PTAFDLSNSGGVIMDSGTTLTYLVQPAYDQFQAKVRDCMRS------GVLPVAFQFFC-- 313
Query: 298 ISSQPKFPEVTIHFR-GADVKLSPSN-LFRN-ISDEIMCSAFRGGNANIVYGRIMQINF- 353
+ + FP VT++F GA + LSPS+ L++ ++ + F + VYG + F
Sbjct: 314 -TIEGYFPNVTLYFAGGAAMLLSPSSYLYKEMLTTGLSAYCFSWLESTSVYGYLSYTIFG 372
Query: 354 -------LIGYDIEQAMVSFKPSRCT 372
L+ YD + +K CT
Sbjct: 373 DNVLKDQLVVYDNVNNRIGWKNFDCT 398
>gi|449511696|ref|XP_004164029.1| PREDICTED: aspartic proteinase-like protein 2-like [Cucumis
sativus]
Length = 639
Score = 103 bits (258), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 105/365 (28%), Positives = 165/365 (45%), Gaps = 48/365 (13%)
Query: 35 YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSS 94
Y L IG+PP + VDTGS T+ C C + C + P F P+ SSTY + C++
Sbjct: 89 YTTRLWIGSPPQEFALIVDTGSTVTYVPCSNC--VQCGNHQDPRFQPELSSTYQPVKCNA 146
Query: 95 SQCAVVTSNCSEG--DCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMPNVIFGC 152
NC E C+Y Y A S SSG LA + ++F S L + +FGC
Sbjct: 147 D------CNCDENGVQCTYERRY---AEMSTSSGVLAEDVMSFGKESELVPQ--RAVFGC 195
Query: 153 GHKNLASPTSDSKQTGIIGLGPGNSSLISQM--GTSIAGKFSYCLPDQGSSKINFGGIVA 210
+ + GI+GLG G S++ Q+ ++ FS C G + G +V
Sbjct: 196 ETME-SGDLYTQRADGIMGLGRGTLSVMDQLVGKGVVSNSFSLCY---GGMDVGGGAMVL 251
Query: 211 GAGVVSTPLIIRDH--------YYLSLEAISVGNQRLEFVSSSTGNIF---VDTGVLRTL 259
G G+ S P ++ H Y + L+ I V + L+ + + +D+G
Sbjct: 252 G-GISSPPGMVFSHSDPSRSPYYNIELKEIHVAGKPLKLNPRTFDGKYGAILDSGTTYAY 310
Query: 260 LPLEYHSNLKSVMSNMIKAQPVKGV-GAEPGFSDVLCY-----NISSQPK-FPEVTIHF- 311
P + + K + M K +K + G +P F D+ C+ +++ PK FPEV + F
Sbjct: 311 FPEKAYYAFKDAI--MKKISFLKQISGPDPNFKDI-CFSGAGRDVTELPKVFPEVDMVFA 367
Query: 312 RGADVKLSPSN-LFRN--ISDEIMCSAFRGGN-ANIVYGRIMQINFLIGYDIEQAMVSFK 367
G + LSP N LFR+ +S F+ GN + G I+ N L+ Y+ E + + F
Sbjct: 368 NGQKISLSPENYLFRHTKVSGAYCLGIFKNGNDQTTLLGGIIVRNTLVTYNRENSTIGFW 427
Query: 368 PSRCT 372
+ C+
Sbjct: 428 KTNCS 432
>gi|15242307|ref|NP_199325.1| aspartyl protease family protein [Arabidopsis thaliana]
gi|9758987|dbj|BAB09497.1| chloroplast nucleoid DNA-binding protein-like [Arabidopsis
thaliana]
gi|332007824|gb|AED95207.1| aspartyl protease family protein [Arabidopsis thaliana]
Length = 491
Score = 103 bits (258), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 122/431 (28%), Positives = 181/431 (41%), Gaps = 88/431 (20%)
Query: 14 ETPKSPISIIYQAEIISVDDIYLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPEL--DC 71
+ P S + ++ + + V D YL+ L+IGTPP + +DTGSD TW PC L DC
Sbjct: 63 KKPLSSVDVVMEP-LREVRDGYLITLNIGTPPQAVQVYLDTGSDLTWV---PCGNLSFDC 118
Query: 72 FK---------QEPPLFDPKKSSTYNSISCSSSQCAVVTSN------CSEGDCSYSFLYG 116
+ + P +F P SST SC+SS C + S+ C+ CS S L
Sbjct: 119 IECYDLKNNDLKSPSVFSPLHSSTSFRDSCASSFCVEIHSSDNPFDPCAVAGCSVSMLLK 178
Query: 117 RG--------AYASFSSGNLATETLTFNSTSGLPVEMPNVIFGCGHKNLASPTSDSKQTG 168
AY ++ G L + LT + ++P FGC P G
Sbjct: 179 STCVRPCPSFAY-TYGEGGLISGILTRDILKARTRDVPRFSFGCVTSTYREP------IG 231
Query: 169 IIGLGPGNSSLISQMGTSIAGKFSYC-LPDQGSSKINFGG-IVAGAGVVS---------T 217
I G G G SL SQ+G G FS+C LP + + N ++ GA +S T
Sbjct: 232 IAGFGRGLLSLPSQLGFLEKG-FSHCFLPFKFVNNPNISSPLILGASALSINLTDSLQFT 290
Query: 218 PLI----IRDHYYLSLEAISVGNQ---------RLEFVSSSTGNIFVDTGVLRTLLPLEY 264
P++ + YY+ LE+I++G +F S G + VD+G T LP +
Sbjct: 291 PMLNTPMYPNSYYIGLESITIGTNITPTQVPLTLRQFDSQGNGGMLVDSGTTYTHLPEPF 350
Query: 265 HSNLKSVMSNMIKAQPVKGVGAEPGFSDVLCY-------NISSQPK-----FPEVTIHF- 311
+S L + + + I + GF LCY N++S FP +T HF
Sbjct: 351 YSQLLTTLQSTITYPRATETESRTGFD--LCYKVPCPNNNLTSLENDVMMIFPSITFHFL 408
Query: 312 RGADVKLSPSNLFRNISDE-----IMCSAFRG------GNANIVYGRIMQINFLIGYDIE 360
A + L N F +S + C F+ G A V+G Q N + YD+E
Sbjct: 409 NNATLLLPQGNSFYAMSAPSDGSVVQCLLFQNMEDGDYGPAG-VFGSFQQQNVKVVYDLE 467
Query: 361 QAMVSFKPSRC 371
+ + F+ C
Sbjct: 468 KERIGFQAMDC 478
>gi|357143657|ref|XP_003573000.1| PREDICTED: LOW QUALITY PROTEIN: aspartic proteinase
nepenthesin-2-like [Brachypodium distachyon]
Length = 464
Score = 103 bits (258), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 110/364 (30%), Positives = 164/364 (45%), Gaps = 57/364 (15%)
Query: 35 YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSS 94
Y++ +SIG+P V +DTGSD +W +C + L+DP SSTY SCS+
Sbjct: 131 YVITVSIGSPAVAXTMFIDTGSDVSWLRC-----------KSRLYDPGTSSTYAPFSCSA 179
Query: 95 SQCAVV---TSNCSEGD-CSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMPNVIF 150
CA + + CS G C YS YG G S ++G ++TLT TS P+ + F
Sbjct: 180 PACAQLGRRGTGCSSGSTCVYSVKYGDG---SNTTGTYGSDTLTLAGTS-EPL-ISGFQF 234
Query: 151 GCGHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLPDQGSSK----INFG 206
GC + + G++GLG S +SQ + FSYCLP +S +
Sbjct: 235 GC--SAVEHGFEEDNTDGLMGLGGDAQSFVSQTAATYGSAFSYCLPPTWNSSGFLTLGAP 292
Query: 207 GIVAGAGVVSTPLIIRDH----YYLSLEAISVGNQRLEFVSS--STGNIFVDTGVLRTLL 260
A +TP++ Y L L ISVG + LE SS S G+I VD+G + T L
Sbjct: 293 SSSTSAAFSTTPMLRSKQAATFYGLLLRGISVGGKTLEIPSSVFSAGSI-VDSGTVITRL 351
Query: 261 PLEYHSNLKSVMSNMI---KAQPVKGVGAEPGFSDVLCYNISSQPK-----FPEVTIHFR 312
P + L + + + + QP A G D C++ + + P V +
Sbjct: 352 PPTAYGALSAAFRDGMARYQYQPA----APRGLLDT-CFDFTGHGEGNNFTVPSVALVLD 406
Query: 313 -GADVKLSPSNLFRNISDEIMCSAFRG----GNANIVYGRIMQINFLIGYDIEQAMVSFK 367
GA V L P+ + ++ C AF G I+ G + Q F + YD+ Q++ F+
Sbjct: 407 GGAVVDLHPNGIVQD-----GCLAFAATDDDGRTGII-GNVQQRTFEVLYDVGQSVFGFR 460
Query: 368 PSRC 371
P C
Sbjct: 461 PGAC 464
>gi|449447285|ref|XP_004141399.1| PREDICTED: aspartic proteinase-like protein 2-like [Cucumis
sativus]
Length = 609
Score = 103 bits (258), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 105/365 (28%), Positives = 165/365 (45%), Gaps = 48/365 (13%)
Query: 35 YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSS 94
Y L IG+PP + VDTGS T+ C C + C + P F P+ SSTY + C++
Sbjct: 89 YTTRLWIGSPPQEFALIVDTGSTVTYVPCSNC--VQCGNHQDPRFQPELSSTYQPVKCNA 146
Query: 95 SQCAVVTSNCSEG--DCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMPNVIFGC 152
NC E C+Y Y A S SSG LA + ++F S L + +FGC
Sbjct: 147 D------CNCDENGVQCTYERRY---AEMSTSSGVLAEDVMSFGKESELVPQ--RAVFGC 195
Query: 153 GHKNLASPTSDSKQTGIIGLGPGNSSLISQM--GTSIAGKFSYCLPDQGSSKINFGGIVA 210
+ + GI+GLG G S++ Q+ ++ FS C G + G +V
Sbjct: 196 ETME-SGDLYTQRADGIMGLGRGTLSVMDQLVGKGVVSNSFSLCY---GGMDVGGGAMVL 251
Query: 211 GAGVVSTPLIIRDH--------YYLSLEAISVGNQRLEFVSSSTGNIF---VDTGVLRTL 259
G G+ S P ++ H Y + L+ I V + L+ + + +D+G
Sbjct: 252 G-GISSPPGMVFSHSDPSRSPYYNIELKEIHVAGKPLKLNPRTFDGKYGAILDSGTTYAY 310
Query: 260 LPLEYHSNLKSVMSNMIKAQPVKGV-GAEPGFSDVLCY-----NISSQPK-FPEVTIHF- 311
P + + K + M K +K + G +P F D+ C+ +++ PK FPEV + F
Sbjct: 311 FPEKAYYAFKDAI--MKKISFLKQISGPDPNFKDI-CFSGAGRDVTELPKVFPEVDMVFA 367
Query: 312 RGADVKLSPSN-LFRN--ISDEIMCSAFRGGN-ANIVYGRIMQINFLIGYDIEQAMVSFK 367
G + LSP N LFR+ +S F+ GN + G I+ N L+ Y+ E + + F
Sbjct: 368 NGQKISLSPENYLFRHTKVSGAYCLGIFKNGNDQTTLLGGIIVRNTLVTYNRENSTIGFW 427
Query: 368 PSRCT 372
+ C+
Sbjct: 428 KTNCS 432
>gi|297806153|ref|XP_002870960.1| hypothetical protein ARALYDRAFT_908082 [Arabidopsis lyrata subsp.
lyrata]
gi|297316797|gb|EFH47219.1| hypothetical protein ARALYDRAFT_908082 [Arabidopsis lyrata subsp.
lyrata]
Length = 453
Score = 103 bits (258), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 110/384 (28%), Positives = 166/384 (43%), Gaps = 79/384 (20%)
Query: 44 PPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSSSQCAVVTSN 103
PP +I +DTGS+ +W +C + FDP +SS+Y+ I CSS C T +
Sbjct: 82 PPQNISMVIDTGSELSWLRCNRSSNPNPVNN----FDPTRSSSYSPIPCSSPTCRTRTRD 137
Query: 104 C-------SEGDCSYSFLYGRGAYASFSSGNLATETLTF-NSTSGLPVEMPNVIFGC-GH 154
S+ C + Y A AS S GNLA E F NST+ N+IFGC G
Sbjct: 138 FLIPASCDSDKLCHATLSY---ADASSSEGNLAAEIFHFGNSTND-----SNLIFGCMGS 189
Query: 155 KNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLPDQGS-------------- 200
+ + P D+K TG++G+ G+ S ISQMG KFSYC+
Sbjct: 190 VSGSDPEEDTKTTGLLGMNRGSLSFISQMGFP---KFSYCISGTDDFPGFLLLGDSNFTW 246
Query: 201 -SKINFGGIVAGAGVVSTPLIIRDH--YYLSLEAISVGNQRLEFVSS-------STGNIF 250
+ +N+ ++ +STPL D Y + L I V + L S G
Sbjct: 247 LTPLNYTPLIR----ISTPLPYFDRVAYTVQLTGIKVNGKLLPIPKSVLLPDHTGAGQTM 302
Query: 251 VDTGVLRTLLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGF----SDVLCYNISS------ 300
VD+G T L ++ L+S N + V +P F + LCY IS
Sbjct: 303 VDSGTQFTFLLGPVYTALRSDFLNQTNG--ILTVYEDPEFVFQGTMDLCYRISPFRIRTG 360
Query: 301 -QPKFPEVTIHFRGADVKLSPSNLFRNI------SDEIMCSAFRGGNANI------VYGR 347
+ P V++ F GA++ +S L + +D + C F GN+++ V G
Sbjct: 361 ILHRLPTVSLVFEGAEIAVSGQPLLYRVPHLTAGNDSVYCFTF--GNSDLMGMEAYVIGH 418
Query: 348 IMQINFLIGYDIEQAMVSFKPSRC 371
Q N I +D++++ + P +C
Sbjct: 419 HHQQNMWIEFDLQRSRIGLAPVQC 442
>gi|449441618|ref|XP_004138579.1| PREDICTED: uncharacterized protein LOC101220661 [Cucumis sativus]
Length = 2819
Score = 103 bits (257), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 104/383 (27%), Positives = 175/383 (45%), Gaps = 79/383 (20%)
Query: 39 LSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSSSQCA 98
L++G+PP + +DTGS+ +W C+ P L +F+P SS+Y+ I CSS C
Sbjct: 1004 LTVGSPPQQVTMVLDTGSELSWLHCKKSPNLTS------VFNPLSSSSYSPIPCSSPICR 1057
Query: 99 VVTSN------CSEGDCSYSFLYGRGAYASFSS--GNLATETLTFNSTSGLPVEMPNVIF 150
T + C ++ + +YA SS GNLA++ S++ +P +F
Sbjct: 1058 TRTRDLPNPVTCDPKKLCHAIV----SYADASSLEGNLASDNFRIGSSA-----LPGTLF 1108
Query: 151 GCGHKNLASPT-SDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLPDQGSSKINFGGIV 209
GC +S + D+K TG++G+ G+ S ++Q+G KFSYC+ + SS + G +
Sbjct: 1109 GCMDSGFSSNSEEDAKTTGLMGMNRGSLSFVTQLGLP---KFSYCISGRDSSGVLLFGDL 1165
Query: 210 AGAGV----------VSTPLIIRDH--YYLSLEAISVGNQRLEFVSS-------STGNIF 250
+ + +STPL D Y + L+ I VGN+ L S G
Sbjct: 1166 HLSWLGNLTYTPLVQISTPLPYFDRVAYTVQLDGIRVGNKILPLPKSIFAPDHTGAGQTM 1225
Query: 251 VDTGVLRTLLPLEYHSNLKSVMSNMIKAQPVKGVGA---EPGF----SDVLCYNISS--- 300
VD+G T L ++ L++ + KGV A +P F + LCY++++
Sbjct: 1226 VDSGTQFTFLLGPVYTALRNEF-----LEQTKGVLAPLGDPNFVFQGAMDLCYSVAAGGK 1280
Query: 301 QPKFPEVTIHFRGADVKLSPSNLFRNI------SDEIMCSAFRGGNANI------VYGRI 348
P P V++ FRGA++ + L + ++ + C F GN+++ V G
Sbjct: 1281 LPTLPSVSLMFRGAEMVVGGEVLLYRVPEMMKGNEWVYCLTF--GNSDLLGIEAFVIGHH 1338
Query: 349 MQINFLIGYDIEQAMVSFKPSRC 371
Q N + +D +V+F C
Sbjct: 1339 HQQNVWMEFD----LVAFAADLC 1357
>gi|115457778|ref|NP_001052489.1| Os04g0337000 [Oryza sativa Japonica Group]
gi|113564060|dbj|BAF14403.1| Os04g0337000, partial [Oryza sativa Japonica Group]
Length = 321
Score = 103 bits (257), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 87/255 (34%), Positives = 118/255 (46%), Gaps = 37/255 (14%)
Query: 34 IYLMHLSIGTPPVDIFGSVDTGSDCTWTQ---CEPCPELDCFKQEPPLFDPKKSSTYNSI 90
+Y + IGTP + VDTGSD W C+ CP E L+DPK SST + +
Sbjct: 32 LYYTEIGIGTPTKRYYVQVDTGSDILWVNCISCDRCPRKSGLGLELTLYDPKDSSTGSKV 91
Query: 91 SCSSSQCAVVTSNCSEG-----DCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEM 145
SC CA G C YS YG G S ++G ++ L F+ SG
Sbjct: 92 SCDQGFCAATYGGLLPGCTTSLPCEYSVTYGDG---SSTTGYFVSDLLQFDQVSGDGQTR 148
Query: 146 P---NVIFGCGHKNLAS-PTSDSKQTGIIGLGPGNSSLISQMGTSIAGK----FSYCLPD 197
P V FGCG + +S+ GIIG G N+S++SQ+ S AGK F++CL
Sbjct: 149 PANSTVTFGCGSQQGGDLGSSNQALDGIIGFGQSNTSMLSQL--SAAGKVKKIFAHCL-- 204
Query: 198 QGSSKINFGGIVAGAGVV-----STPLIIR-DHYYLSLEAISVGNQRLE-----FVSSST 246
IN GGI A VV +TPL+ HY ++L++I VG L+ F +
Sbjct: 205 ---DTINGGGIFAIGNVVQPKVKTTPLVPNMPHYNVNLKSIDVGGTALKLPSHMFDTGEK 261
Query: 247 GNIFVDTGVLRTLLP 261
+D+G T LP
Sbjct: 262 KGTIIDSGTTLTYLP 276
>gi|255538124|ref|XP_002510127.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
gi|223550828|gb|EEF52314.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
Length = 641
Score = 103 bits (257), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 116/401 (28%), Positives = 177/401 (44%), Gaps = 53/401 (13%)
Query: 4 SQKLPFYNDNETPKSPISIIYQAEIISVDDI-----YLMHLSIGTPPVDIFGSVDTGSDC 58
S + PF ++ + S + A + DD+ Y L IGTPP + VDTGS
Sbjct: 52 SHRKPFTSNYHRRQLHNSDLPNAHMRLYDDLLSNGYYTTRLFIGTPPQEFALIVDTGSTV 111
Query: 59 TWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSSSQCAVVTSNCSE--GDCSYSFLYG 116
T+ C C + C K + P F P+ SSTY + C+ S NC + C+Y Y
Sbjct: 112 TYVPCSTCEQ--CGKHQDPRFQPESSSTYKPMQCNPS------CNCDDEGKQCTYERRY- 162
Query: 117 RGAYASFSSGNLATETLTFNSTSGLPVEMPNVIFGCGHKNLASPTSDSKQTGIIGLGPGN 176
A S SSG LA + L+F + S L + IFGC S + GI+GLG G
Sbjct: 163 --AEMSSSSGLLAEDVLSFGNESELTPQ--RAIFGCETVETGELFS-QRADGIMGLGRGP 217
Query: 177 SSLISQM--GTSIAGKFSYCLPDQGSSKINFGGIVAGAGVVSTPLIIRDH--------YY 226
S++ Q+ + FS C G + G +V G + P ++ H Y
Sbjct: 218 LSVVDQLVIKEVVGNSFSLCY---GGMDVVGGAMVLG-NIPPPPDMVFAHSDPYRSAYYN 273
Query: 227 LSLEAISVGNQRLEF---VSSSTGNIFVDTGVLRTLLPLEYHSNLKSVMSNMIKAQPVKG 283
+ L+ + V +RL+ V +D+G LP E K + IK +K
Sbjct: 274 IELKELHVAGKRLKLNPRVFDGKHGTVLDSGTTYAYLPEEAFVAFKDAIIKEIKF--LKQ 331
Query: 284 V-GAEPGFSDVLCY-----NISSQPK-FPEVTIHF-RGADVKLSPSN-LFRN--ISDEIM 332
+ G +P ++D+ C+ ++S K FPEV + F G + LSP N LFR+ +S
Sbjct: 332 IHGPDPSYNDI-CFSGAGRDVSQLSKIFPEVNMVFGNGQKLSLSPENYLFRHTKVSGAYC 390
Query: 333 CSAFRGG-NANIVYGRIMQINFLIGYDIEQAMVSFKPSRCT 372
F+ G + + G I+ N L+ YD + + F + C+
Sbjct: 391 LGIFQNGKDPTTLLGGIVVRNTLVTYDRDNDKIGFWKTNCS 431
>gi|255634819|gb|ACU17770.1| unknown [Glycine max]
Length = 354
Score = 103 bits (257), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 95/314 (30%), Positives = 146/314 (46%), Gaps = 33/314 (10%)
Query: 34 IYLMHLSIGTPPVDIFGSVDTGSDCTWTQCEP---CPELDCFKQEPPLFDPKKSSTYNSI 90
+Y + +GTPPV+ +DTGSD W C CP+ + + FDP SST + I
Sbjct: 24 LYYTKVQLGTPPVEFNVQIDTGSDVLWVSCNSCSGCPQTSGLQIQLNFFDPGSSSTSSMI 83
Query: 91 SCSSSQC------AVVTSNCSEGDCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVE 144
+CS +C + T + CSY+F YG G S +SG ++ + N+ V
Sbjct: 84 ACSDQRCNNGIQSSDATCSSQNNQCSYTFQYGDG---SGTSGYYVSDMMHLNTIFEGSVT 140
Query: 145 MPN---VIFGCGHKNLASPT-SDSKQTGIIGLGPGNSSLISQMGTS-IAGK-FSYCLP-D 197
+ V+FGC ++ T SD GI G G S+ISQ+ + IA + FS+CL D
Sbjct: 141 TNSTAPVVFGCSNQQTGDLTKSDRAVDGIFGFGQQEMSVISQLSSQGIAPRVFSHCLKGD 200
Query: 198 QGSSKINFGGIVAGAGVVSTPLI-IRDHYYLSLEAISVGNQRLE-----FVSSSTGNIFV 251
I G + +V T L+ + HY L+L++I+V Q L+ F +S++ V
Sbjct: 201 SSGGGILVLGEIVEPNIVYTSLVPAQPHYNLNLQSIAVNGQTLQIDSSVFATSNSRGTIV 260
Query: 252 DTGVLRTLLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDVLCYNISSQPK--FPEVTI 309
D+G L E + S ++ I V CY I+S FP+V++
Sbjct: 261 DSGTTLAYLAEEAYDPFVSAITASIPQSVHTAVSRGN-----QCYLITSSVTEVFPQVSL 315
Query: 310 HFR-GADVKLSPSN 322
+F GA + L P +
Sbjct: 316 NFAGGASMILRPQD 329
>gi|359476191|ref|XP_003631801.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Vitis vinifera]
Length = 439
Score = 103 bits (257), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 102/374 (27%), Positives = 160/374 (42%), Gaps = 83/374 (22%)
Query: 27 EIISVDDIYLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSST 86
++ D +L+ ++ GTPP + +DTGS TWTQC+ C
Sbjct: 120 KLFDEDGNFLVDVAFGTPPQNFTLILDTGSSITWTQCKAC-------------------- 159
Query: 87 YNSISCSSSQCAVVTSNCSEGDCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMP 146
V +N Y+ YG S S GN +T+T +
Sbjct: 160 ------------TVENN-------YNMTYGD---DSTSVGNYGCDTMTLEPSD----VFQ 193
Query: 147 NVIFGCGHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLPDQG------- 199
FG G N S G++GLG G S +SQ + FSYCLP++
Sbjct: 194 KFQFGRGRNNKGD--FGSGVDGMLGLGQGQLSTVSQTASKFNKVFSYCLPEEDSIGSLLF 251
Query: 200 -------SSKINFGGIVAGAGVVSTPLIIRDHYYLSLEAISVGNQRLEFVSS---STGNI 249
SS + F +V G G + +Y+++L ISVGN+RL SS S G I
Sbjct: 252 GEKATSQSSSLKFTSLVNGPGTLQE----SGYYFVNLSDISVGNERLNIPSSVFASPGTI 307
Query: 250 FVDTGVLRTLLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDVL--CYNISSQPK--FP 305
+D+ + T LP +S LK+ + P+ + G D+L CYN+S + P
Sbjct: 308 -IDSRTVITRLPQRAYSALKAAFKKAMAKYPLSNGRRKKG--DILDTCYNLSGRKDVLLP 364
Query: 306 EVTIHF-RGADVKLSPSNLFRNISDEIMCSAFRGGNAN------IVYGRIMQINFLIGYD 358
E+ +HF GADV+L+ +N+ + +C AF G + + + G Q++ + YD
Sbjct: 365 EIVLHFGGGADVRLNGTNIVWGSDESRLCLAFAGNSKSTMNPELTIIGNRQQLSLTVLYD 424
Query: 359 IEQAMVSFKPSRCT 372
I+ + F+ + C+
Sbjct: 425 IQGGRIGFRSNGCS 438
>gi|53791672|dbj|BAD53242.1| putative aspartic proteinase nepenthesin II [Oryza sativa Japonica
Group]
Length = 504
Score = 103 bits (256), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 103/374 (27%), Positives = 172/374 (45%), Gaps = 46/374 (12%)
Query: 34 IYLMHLSIGTPPVDIFGSVDTGSDCTWTQCEP---CPELDCFKQEPPLFDPKKSSTYNSI 90
+Y + +G+PP + F +DTGSD W C P CP + F+P SST + I
Sbjct: 90 LYFTRVKLGSPPKEYFVQIDTGSDILWVACSPCTGCPSSSGLNIQLEFFNPDTSSTSSKI 149
Query: 91 SCSSSQC--AVVTSN--CSEGD---CSYSFLYGRGAYASFSSGNLATETLTFNSTSG--- 140
CS +C A+ TS C D C Y+F YG G S +SG ++T+ F++ G
Sbjct: 150 PCSDDRCTAALQTSEAVCQTSDNSPCGYTFTYGDG---SGTSGYYVSDTMYFDTVMGNEQ 206
Query: 141 LPVEMPNVIFGCGHKNLASPT-SDSKQTGIIGLGPGNSSLISQMGT-SIAGK-FSYCLP- 196
+++FGC + T +D GI G G S++SQ+ + ++ K FS+CL
Sbjct: 207 TANSSASIVFGCSNSQSGDLTKTDRAVDGIFGFGQHQLSVVSQLNSLGVSPKVFSHCLKG 266
Query: 197 -DQGSSKINFGGIVAGAGVVSTPLI-IRDHYYLSLEAISVGNQRLE-----FVSSSTGNI 249
D G + G IV G+V TPL+ + HY L+LE+I V Q+L F +S+T
Sbjct: 267 SDNGGGILVLGEIVE-PGLVYTPLVPSQPHYNLNLESIVVNGQKLPIDSSLFTTSNTQGT 325
Query: 250 FVDTGVLRTLLP----LEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDVLCYNISSQPKFP 305
VD+G L + + + + +S +++ KG + + S FP
Sbjct: 326 IVDSGTTLAYLADGAYDPFVNAITAAVSPSVRSLVSKG-------NQCFVTSSSVDSSFP 378
Query: 306 EVTIHFRGA-DVKLSPSN-LFRNIS---DEIMCSAFR--GGNANIVYGRIMQINFLIGYD 358
V+++F G + + P N L + S + + C ++ G + G ++ + + YD
Sbjct: 379 TVSLYFMGGVAMTVKPENYLLQQASIDNNVLWCIGWQRNQGQQITILGDLVLKDKIFVYD 438
Query: 359 IEQAMVSFKPSRCT 372
+ + + C+
Sbjct: 439 LANMRMGWTDYDCS 452
>gi|312283333|dbj|BAJ34532.1| unnamed protein product [Thellungiella halophila]
Length = 428
Score = 103 bits (256), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 108/379 (28%), Positives = 158/379 (41%), Gaps = 46/379 (12%)
Query: 17 KSPISIIYQAEIISVDDIYLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEP 76
K P I I Y++ +IGTP + ++DT +D W C C + C
Sbjct: 73 KKPSVPIASGRAIVQSPTYIVRANIGTPAQPMLVALDTSNDAAWVPCSGC--VGCASSV- 129
Query: 77 PLFDPKKSSTYNSISCSSSQCAVVTS-NCSEGD-CSYSFLYGRGAYASFSSGNLATETLT 134
LFDP KSS+ ++ C + QC + C+ G C ++ YG S +L +TLT
Sbjct: 130 -LFDPSKSSSSRNLQCDAPQCKQAPNPTCTAGKSCGFNMTYG----GSTIEASLTQDTLT 184
Query: 135 FNSTSGLPVEMPNVIFGCGHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYC 194
+ + + FGC K A+ TS Q G++GLG G SLISQ FSYC
Sbjct: 185 LAND-----VIKSYTFGCISK--ATGTSLPAQ-GLMGLGRGPLSLISQTQNLYMSTFSYC 236
Query: 195 LPDQGSSK----INFGGIVAGAGVVSTPLIIRDH----YYLSLEAISVGNQ-------RL 239
LP+ SS + G + +TPL+ YY++L I VGN+ L
Sbjct: 237 LPNSKSSNFSGSLRLGPKYQPVRIKTTPLLKNPRRSSLYYVNLVGIRVGNKIVDIPTSAL 296
Query: 240 EFVSSSTGNIFVDTGVLRTLLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDVLCYNIS 299
F +S+ D+G + T L + +++ IK +G GF CY S
Sbjct: 297 AFDASTGAGTIFDSGTVFTRLVEPAYVAVRNEFRRRIKNANATSLG---GFDT--CY--S 349
Query: 300 SQPKFPEVTIHFRGADVKLSPSNLF-RNISDEIMCSAFRGGNANI-----VYGRIMQINF 353
+P VT F G +V L P NL + S C A N+ V + Q N
Sbjct: 350 GSVVYPSVTFMFAGMNVTLPPDNLLIHSSSGSTSCLAMAAAPNNVNSVLNVIASMQQQNH 409
Query: 354 LIGYDIEQAMVSFKPSRCT 372
+ D+ + + CT
Sbjct: 410 RVLIDLPNSRLGISRETCT 428
>gi|357122155|ref|XP_003562781.1| PREDICTED: aspartic proteinase-like protein 2-like [Brachypodium
distachyon]
Length = 629
Score = 103 bits (256), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 104/363 (28%), Positives = 161/363 (44%), Gaps = 42/363 (11%)
Query: 35 YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSS 94
Y L IGTPP + VD+GS T+ C C + C + P F P SSTY+ + CS
Sbjct: 85 YTTRLYIGTPPQEFALIVDSGSTVTYVPCASCEQ--CGNHQDPRFQPDLSSTYSPVKCS- 141
Query: 95 SQCAVVTSNCSEGDCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMPNVIFGCGH 154
A T + + C+Y Y A S SSG L + ++F + S L + +FGC +
Sbjct: 142 ---ADCTCDSDKSQCTYERQY---AEMSSSSGVLGEDIVSFGTESELKPQ--RAVFGCEN 193
Query: 155 KNLASPTSDSKQTGIIGLGPGNSSLISQMGTS--IAGKFSYCLPDQGSSKINFGGIVAGA 212
S GI+GLG G S++ Q+ I FS C G I G +V GA
Sbjct: 194 SETGDLFSQHAD-GIMGLGRGQLSIMDQLVDKGVIGDSFSMCY---GGMDIGGGAMVLGA 249
Query: 213 GVVSTPLI------IRDHYY-LSLEAISVGNQRLEF---VSSSTGNIFVDTGVLRTLLPL 262
++ +R YY + L+ I V + L + S +D+G LP
Sbjct: 250 MPAPPDMVFSRSDPVRSPYYNIELKEIHVAGKALRLDPRIFDSKHGTVLDSGTTYAYLPE 309
Query: 263 EYHSNLKSVMSNMIKAQPVKGV-GAEPGFSDVLCY-----NISSQPK-FPEVTIHF-RGA 314
+ K +++ K +P+K + G +P + D+ C+ N+S + FP+V + F G
Sbjct: 310 QAFVAFKDAVTS--KVRPLKKIRGPDPNYKDI-CFAGAGRNVSQLSQAFPDVDMVFGDGQ 366
Query: 315 DVKLSPSN-LFRN--ISDEIMCSAFRGG-NANIVYGRIMQINFLIGYDIEQAMVSFKPSR 370
+ LSP N LFR+ + F+ G + + G I+ N L+ YD + F +
Sbjct: 367 KLSLSPENYLFRHSKVEGAYCLGVFQNGKDPTTLLGGIVVRNTLVTYDRHNEKIGFWKTN 426
Query: 371 CTN 373
C+
Sbjct: 427 CSE 429
>gi|224128838|ref|XP_002320434.1| predicted protein [Populus trichocarpa]
gi|222861207|gb|EEE98749.1| predicted protein [Populus trichocarpa]
Length = 485
Score = 103 bits (256), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 100/371 (26%), Positives = 156/371 (42%), Gaps = 42/371 (11%)
Query: 34 IYLMHLSIGTPPVDIFGSVDTGSDCTWT---QCEPCPELDCFKQEPPLFDPKKSSTYNSI 90
+Y + IGTP D + VDTGSD W QC CP+ + L++ +S T +
Sbjct: 77 LYYAKIGIGTPTKDYYVQVDTGSDIMWVNCIQCRECPKTSSLGIDLTLYNINESDTGKLV 136
Query: 91 SCSSSQCAVV--------TSNCSEGDCSYSFLYGRGAYASFSSGNLATETLTFNSTSG-L 141
C C + T+N S C Y +YG G S ++G + + + SG L
Sbjct: 137 PCDQEFCYEINGGQLPGCTANMS---CPYLEIYGDG---SSTAGYFVKDVVQYARVSGDL 190
Query: 142 PVEMPN--VIFGCGHKNLASPTSDSKQT--GIIGLGPGNSSLISQMGTS--IAGKFSYCL 195
N VIFGCG + S +++ GI+G G NSS+ISQ+ + + F++CL
Sbjct: 191 KTTAANGSVIFGCGARQSGDLGSSNEEALDGILGFGKSNSSMISQLAVTGKVKKIFAHCL 250
Query: 196 PDQGSSKINFGGIVAGAGVVSTPLIIRD-HYYLSLEAISVGNQRLE-----FVSSSTGNI 249
I G V V TPLI HY +++ A+ VG++ L F +
Sbjct: 251 DGTNGGGIFVIGHVVQPKVNMTPLIPNQPHYNVNMTAVQVGHEFLSLPTDVFEAGDRKGA 310
Query: 250 FVDTGVLRTLLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDVLCYNISSQPKFPEVTI 309
+D+G LP + K ++S +I QP V Y+ S FP VT
Sbjct: 311 IIDSGTTLAYLPEMVY---KPLVSKIISQQPDLKVHTVRDEYTCFQYSDSLDDGFPNVTF 367
Query: 310 HFRGADV-KLSPSNLFRNISDEIMCSAF-------RGGNANIVYGRIMQINFLIGYDIEQ 361
HF + + K+ P + + C + R + G ++ N L+ YD+E
Sbjct: 368 HFENSVILKVYPHEYLFPF-EGLWCIGWQNSGVQSRDRRNMTLLGDLVLSNKLVLYDLEN 426
Query: 362 AMVSFKPSRCT 372
+ + C+
Sbjct: 427 QAIGWTEYNCS 437
>gi|297745479|emb|CBI40559.3| unnamed protein product [Vitis vinifera]
Length = 436
Score = 103 bits (256), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 60/141 (42%), Positives = 77/141 (54%), Gaps = 15/141 (10%)
Query: 35 YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSS 94
Y L +GTPP ++ +DTGSD W QC PC + C+ Q P+FDPKKS +++SISC S
Sbjct: 174 YFTRLGVGTPPKYVYMVLDTGSDVVWIQCAPCRK--CYSQTDPVFDPKKSGSFSSISCRS 231
Query: 95 SQCAVVTS-NC-SEGDCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMPNVIFGC 152
C + S C S C Y YG G SF+ G +TETLTF T +P V GC
Sbjct: 232 PLCLRLDSPGCNSRQSCLYQVAYGDG---SFTFGEFSTETLTFRGT-----RVPKVALGC 283
Query: 153 GHKNLASPTSDSKQTGIIGLG 173
GH N + G++GLG
Sbjct: 284 GHDNEGLFVGAA---GLLGLG 301
>gi|125553832|gb|EAY99437.1| hypothetical protein OsI_21406 [Oryza sativa Indica Group]
Length = 409
Score = 103 bits (256), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 96/347 (27%), Positives = 147/347 (42%), Gaps = 49/347 (14%)
Query: 52 VDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSSSQCAVV---TSNC-SEG 107
+D+GSD W QC+PCP L C Q PLFDP S+TY ++ CSS+ CA + C +
Sbjct: 85 IDSGSDVPWVQCQPCPLLVCHPQRDPLFDPATSTTYAAVPCSSAACARLGPYRRGCLANS 144
Query: 108 DCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVE-MPNVIFGCGHKNLASPTSDSKQ 166
C + Y GA A +G +++ LT P + + +FGC H + S T
Sbjct: 145 QCQFGITYANGATA---TGTYSSDDLTLG-----PYDVVRGFLFGCAHADQGS-TFSYDV 195
Query: 167 TGIIGLGPGNSSLISQMGTSIAGKFSYCLPDQGSSKINFGGIVAGA---------GVVST 217
G + LG G+ S + Q + + FSYC+P SS FG I+ G VST
Sbjct: 196 AGTLALGGGSQSFVQQTASQYSRVFSYCVPPSTSS---FGFIMFGVPPQRAALVPTFVST 252
Query: 218 PLIIRD-----HYYLSLEAISVGNQRLEFVSS--STGNIFVDTGVLRTLLPLEYHSNLKS 270
PL+ Y + L +I V + L + S ++ V+ + P Y + +
Sbjct: 253 PLLSSSTMSPTFYRVLLRSIIVAGRPLPVPPTVFSASSVIDSATVISRIPPTAYQALRAA 312
Query: 271 VMSNMIKAQPVKGVGAEPGFSDVLCYNIS--SQPKFPEVTIHFR-GADVKLSPSNLFRNI 327
S M +P A P CY+ S P + + F GA V L + +
Sbjct: 313 FRSAMTMYRP-----APPVSILDTCYDFSGVRSITLPSIALVFDGGATVNLDAAGILLQ- 366
Query: 328 SDEIMCSAFRGGNANIV---YGRIMQINFLIGYDIEQAMVSFKPSRC 371
C AF ++ + G + Q + YD+ + F+ + C
Sbjct: 367 ----GCLAFAPTASDRMPGFIGNVQQRTLEVVYDVPGKAIRFRSAAC 409
>gi|15235526|ref|NP_193028.1| aspartyl protease family protein [Arabidopsis thaliana]
gi|5123933|emb|CAB45491.1| putative protein [Arabidopsis thaliana]
gi|7267994|emb|CAB78334.1| putative protein [Arabidopsis thaliana]
gi|332657803|gb|AEE83203.1| aspartyl protease family protein [Arabidopsis thaliana]
Length = 389
Score = 103 bits (256), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 95/352 (26%), Positives = 149/352 (42%), Gaps = 37/352 (10%)
Query: 35 YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEP-PLFDPKKSSTYNSISCS 93
++ + G+P F +DTGS TWTQC PC DC+ Q+ P + P S TY C
Sbjct: 58 FMAEIHFGSPQKKQFLHMDTGSSLTWTQCFPCS--DCYAQKIYPKYRPAASITYRDAMCE 115
Query: 94 SSQCAV---VTSNCSEGDCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMPNVIF 150
S + C+Y Y + G LA E +T ++ G + V F
Sbjct: 116 DSHPKSNPHFAFDPLTRICTYQQHY---LDETNIKGTLAQEMITVDTHDGGFKRVHGVYF 172
Query: 151 GCGHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLPDQGSSKINFGGIVA 210
GC N S S TGI+GLG G S+I + G+ KFS+CL + K + I+
Sbjct: 173 GC---NTLSDGSYFTGTGILGLGVGKYSIIGEFGS----KFSFCLGEISEPKASHNLILG 225
Query: 211 -GAGVVSTPLII---RDHYYLSLEAISVGNQRLEFVSSSTGNIFVDTGVLRTLLPLEYHS 266
GA V P +I H LE+I VG E +FVDTG + L +
Sbjct: 226 DGANVQGHPTVINITEGHTIFQLESIIVGE---EITLDDPVQVFVDTGSTLSHLSTNLYY 282
Query: 267 NLKSVMSNMIKAQPVKGVGAEPGFSDVLCYNISSQPKFPEVTIHFR---GADVKLSPSNL 323
++I ++P+ + LCY + + ++ + F+ GA++ ++ N+
Sbjct: 283 KFVDAFDDLIGSRPLS-------YEPTLCYKADTIERLEKMDVGFKFDVGAELSVNIHNI 335
Query: 324 F-RNISDEIMCSAFRGGN---ANIVYGRIMQINFLIGYDIEQAMVSFKPSRC 371
F + EI C A + ++++ G I + +GYD+ C
Sbjct: 336 FIQQGPPEIRCLAIQNNKESFSHVIIGVIAMQGYNVGYDLSAKTAYINKQDC 387
>gi|18390579|ref|NP_563751.1| aspartyl protease-like protein [Arabidopsis thaliana]
gi|332189782|gb|AEE27903.1| aspartyl protease-like protein [Arabidopsis thaliana]
Length = 485
Score = 103 bits (256), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 99/372 (26%), Positives = 164/372 (44%), Gaps = 42/372 (11%)
Query: 34 IYLMHLSIGTPPVDIFGSVDTGSDCTWT---QCEPCPELDCFKQEPPLFDPKKSSTYNSI 90
+Y + IGTP + VDTGSD W QC+ CP E L++ +S + +
Sbjct: 79 LYYAKIGIGTPAKSYYVQVDTGSDIMWVNCIQCKQCPRRSTLGIELTLYNIDESDSGKLV 138
Query: 91 SCSSSQCAVVT----SNCSEG-DCSYSFLYGRGAYASFSSGNLATETLTFNSTSG-LPVE 144
SC C ++ S C C Y +YG G S ++G + + ++S +G L +
Sbjct: 139 SCDDDFCYQISGGPLSGCKANMSCPYLEIYGDG---SSTAGYFVKDVVQYDSVAGDLKTQ 195
Query: 145 MPN--VIFGCGHKNLASPTSDSKQT--GIIGLGPGNSSLISQMGTS--IAGKFSYCLPDQ 198
N VIFGCG + S +++ GI+G G NSS+ISQ+ +S + F++CL +
Sbjct: 196 TANGSVIFGCGARQSGDLDSSNEEALDGILGFGKANSSMISQLASSGRVKKIFAHCLDGR 255
Query: 199 GSSKINFGGIVAGAGVVSTPLIIRD-HYYLSLEAISVGNQRLE-----FVSSSTGNIFVD 252
I G V V TPL+ HY +++ A+ VG + L F +D
Sbjct: 256 NGGGIFAIGRVVQPKVNMTPLVPNQPHYNVNMTAVQVGQEFLTIPADLFQPGDRKGAIID 315
Query: 253 TGVLRTLLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDVLCYNISSQ--PKFPEVTIH 310
+G LP + L +++ A V V D C+ S + FP VT H
Sbjct: 316 SGTTLAYLPEIIYEPLVKKITSQEPALKVHIVD-----KDYKCFQYSGRVDEGFPNVTFH 370
Query: 311 FRGAD-VKLSPSN-LFRNISDEIMCSAF-------RGGNANIVYGRIMQINFLIGYDIEQ 361
F + +++ P + LF + + + C + R + G ++ N L+ YD+E
Sbjct: 371 FENSVFLRVYPHDYLFPH--EGMWCIGWQNSAMQSRDRRNMTLLGDLVLSNKLVLYDLEN 428
Query: 362 AMVSFKPSRCTN 373
++ + C++
Sbjct: 429 QLIGWTEYNCSS 440
>gi|125595861|gb|EAZ35641.1| hypothetical protein OsJ_19928 [Oryza sativa Japonica Group]
Length = 629
Score = 102 bits (255), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 80/253 (31%), Positives = 115/253 (45%), Gaps = 35/253 (13%)
Query: 52 VDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSSSQCAVV---TSNCSE-G 107
+D+GSD +W QC+PCP C +Q PLFDP S+TY ++ C+S+ CA + CS
Sbjct: 81 IDSGSDVSWVQCKPCPLPMCHRQRDPLFDPAMSTTYAAVPCTSAACAQLGPYRRGCSANA 140
Query: 108 DCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVE-MPNVIFGCGHKNLASPTSDSKQ 166
C + YG G+ AT T +F+ + P + + FGC H + S D
Sbjct: 141 QCQFGINYG--------DGSTATGTYSFDDLTLGPYDVIRGFRFGCAHADRGS-AFDYDV 191
Query: 167 TGIIGLGPGNSSLISQMGTSIAGKFSYCLPDQGSSKINFGGIVAGA---------GVVST 217
G + LG G+ SL+ Q T FSYCLP SS G +V G VST
Sbjct: 192 AGSLALGGGSQSLVQQTATRYGRVFSYCLPPTASS---LGFLVLGVPPERAQLIPSFVST 248
Query: 218 PLI----IRDHYYLSLEAISVGNQRLEFVSS--STGNIFVDTGVLRTLLPLEYH---SNL 268
PL+ Y + L AI V + L + S ++ + ++ L P Y +
Sbjct: 249 PLLSSSMAPTFYRVLLRAIIVAGRPLAVPPAVFSASSVIDSSTIISRLPPTAYQALRAAF 308
Query: 269 KSVMSNMIKAQPV 281
+S M+ A PV
Sbjct: 309 RSAMTMYRAAPPV 321
>gi|255556768|ref|XP_002519417.1| Aspartic proteinase Asp1 precursor, putative [Ricinus communis]
gi|223541280|gb|EEF42831.1| Aspartic proteinase Asp1 precursor, putative [Ricinus communis]
Length = 494
Score = 102 bits (255), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 101/371 (27%), Positives = 159/371 (42%), Gaps = 44/371 (11%)
Query: 34 IYLMHLSIGTPPVDIFGSVDTGSDCTWTQCEP---CPELDCFKQEPPLFDPKKSSTYNSI 90
+Y + +GTPP + +DTGSD W C CP+ + FD SST +
Sbjct: 80 LYFTRVKLGTPPREFNVQIDTGSDVLWVTCSSCSNCPQTSGLGIQLNYFDTTSSSTARLV 139
Query: 91 SCS----SSQCAVVTSNC--SEGDCSYSFLYGRGAYASFSSGNLATETLTFNSTSG---L 141
CS +SQ + C CSY+F YG G S +SG ++T F++ G +
Sbjct: 140 PCSHPICTSQIQTTATQCPPQSNQCSYAFQYGDG---SGTSGYYVSDTFYFDAVLGESLI 196
Query: 142 PVEMPNVIFGCGHKNLASPT-SDSKQTGIIGLGPGNSSLISQMGTS--IAGKFSYCLP-- 196
++FGC T +D GI G G G S+ISQ+ + FS+CL
Sbjct: 197 ANSSAAIVFGCSTYQSGDLTKTDKAVDGIFGFGQGELSVISQLSSHGITPRVFSHCLKGE 256
Query: 197 DQGSSKINFGGIVAGAGVVSTPLI-IRDHYYLSLEAISVGNQRL-----EFVSSSTGNIF 250
D G + G I+ G+V +PL+ + HY L L++I+V Q L F +SS
Sbjct: 257 DSGGGILVLGEILE-PGIVYSPLVPSQPHYNLDLQSIAVSGQLLPIDPAAFATSSNRGTI 315
Query: 251 VDTGVLRTLLPLEYHSNLKSVMSNMIK--AQPVKGVGAEPGFSDVLCYNISSQPK--FPE 306
+DTG L E + S ++ + A P G + CY +S+ FP
Sbjct: 316 IDTGTTLAYLVEEAYDPFVSAITAAVSQLATPTINKGNQ-------CYLVSNSVSEVFPP 368
Query: 307 VTIHFR-GADVKLSPSNLFRNISD----EIMCSAFRGGNANI-VYGRIMQINFLIGYDIE 360
V+ +F GA + L P +++ + C F+ I + G ++ + + YD+
Sbjct: 369 VSFNFAGGATMLLKPEEYLMYLTNYAGAALWCIGFQKIQGGITILGDLVLKDKIFVYDLA 428
Query: 361 QAMVSFKPSRC 371
+ + C
Sbjct: 429 HQRIGWANYDC 439
>gi|297812425|ref|XP_002874096.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
gi|297319933|gb|EFH50355.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
Length = 493
Score = 102 bits (255), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 98/334 (29%), Positives = 151/334 (45%), Gaps = 42/334 (12%)
Query: 34 IYLMHLSIGTPPVDIFGSVDTGSDCTWTQCEP---CPELDCFKQEPPLFDPKKSSTYNSI 90
+Y + +G+PP D + VDTGSD W C CP+ + + FDP S T +
Sbjct: 80 LYYTKIRLGSPPRDFYVQVDTGSDVLWVSCASCNGCPQTSGLQIQLNFFDPGSSVTATPV 139
Query: 91 SCSSSQCAV----VTSNCSEGD--CSYSFLYGRGAYASFSSGNLATETLTFNSTSG---L 141
SCS +C+ S CS + C+Y+F YG G S +SG ++ L F+ G +
Sbjct: 140 SCSDQRCSWGIQSSDSGCSVQNNLCAYTFQYGDG---SGTSGFYVSDVLQFDMIVGSSLV 196
Query: 142 PVEMPNVIFGCGHKNLASPT-SDSKQTGIIGLGPGNSSLISQMGTS-IAGK-FSYCLP-D 197
P V+FGC SD GI G G S+ISQ+ + +A + FS+CL +
Sbjct: 197 PNSTAPVVFGCSTSQTGDLVKSDRAVDGIFGFGQQGMSVISQLASQGLAPRVFSHCLKGE 256
Query: 198 QGSSKINFGGIVAGAGVVSTPLI-IRDHYYLSLEAISVGNQRLE-----FVSSSTGNIFV 251
G I G + +V TPL+ + HY ++L +ISV Q L F +S+ +
Sbjct: 257 NGGGGILVLGEIVEPNMVFTPLVPSQPHYNVNLLSISVNGQALPINPSVFSTSNGQGTII 316
Query: 252 DTGVLRTLLPLEYHSNLKSVMSNMI--KAQPVKGVGAEPGFSDVLCYNISSQPK--FPEV 307
DTG L + ++N + +PV G + CY I++ FP V
Sbjct: 317 DTGTTLAYLSEAAYVPFVEAITNAVSQSVRPVVSKGNQ-------CYVIATSVADIFPPV 369
Query: 308 TIHFRGADVKLSPSNLFRNISDEIMCSAFRGGNA 341
+++F G +++F N D ++ GG A
Sbjct: 370 SLNFAGG------ASMFLNPQDYLIQQNNVGGTA 397
>gi|218197465|gb|EEC79892.1| hypothetical protein OsI_21411 [Oryza sativa Indica Group]
Length = 720
Score = 102 bits (255), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 80/253 (31%), Positives = 115/253 (45%), Gaps = 35/253 (13%)
Query: 52 VDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSSSQCAVV---TSNCSE-G 107
+D+GSD +W QC+PCP C +Q PLFDP S+TY ++ C+S+ CA + CS
Sbjct: 172 IDSGSDVSWVQCKPCPLPMCHRQRDPLFDPAMSTTYAAVPCTSAACAQLGPYRRGCSANA 231
Query: 108 DCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVE-MPNVIFGCGHKNLASPTSDSKQ 166
C + YG G+ AT T +F+ + P + + FGC H + S D
Sbjct: 232 QCQFGINYG--------DGSTATGTYSFDDLTLGPYDVIRGFRFGCAHADRGS-AFDYDV 282
Query: 167 TGIIGLGPGNSSLISQMGTSIAGKFSYCLPDQGSSKINFGGIVAGA---------GVVST 217
G + LG G+ SL+ Q T FSYCLP SS G +V G VST
Sbjct: 283 AGSLALGGGSQSLVQQTATRYGRVFSYCLPPTASS---LGFLVLGVPPERAQLIPSFVST 339
Query: 218 PLI----IRDHYYLSLEAISVGNQRLEFVSS--STGNIFVDTGVLRTLLPLEYH---SNL 268
PL+ Y + L AI V + L + S ++ + ++ L P Y +
Sbjct: 340 PLLSSSMAPTFYRVLLRAIIVAGRPLAVPPAVFSASSVIDSSTIISRLPPTAYQALRAAF 399
Query: 269 KSVMSNMIKAQPV 281
+S M+ A PV
Sbjct: 400 RSAMTMYRAAPPV 412
>gi|212722026|ref|NP_001131674.1| uncharacterized protein LOC100193034 precursor [Zea mays]
gi|194692214|gb|ACF80191.1| unknown [Zea mays]
gi|413946454|gb|AFW79103.1| hypothetical protein ZEAMMB73_752316 [Zea mays]
Length = 441
Score = 102 bits (255), Expect = 3e-19, Method: Compositional matrix adjust.
Identities = 93/371 (25%), Positives = 155/371 (41%), Gaps = 52/371 (14%)
Query: 35 YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPP--LFDPKKSSTYNSISC 92
Y + + +GTP + DTGS+ TW +C PP +F P+ S ++ + C
Sbjct: 91 YFVKVLVGTPAQEFTLVADTGSELTWVKC-------AGGASPPGLVFRPEASKSWAPVPC 143
Query: 93 SSSQCAV----VTSNCSEG--DCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMP 146
SS C + +NCS CSY + Y G+ + G + T++ T G ++
Sbjct: 144 SSDTCKLDVPFSLANCSSSASPCSYDYRYKEGSAGAL--GVVGTDSATIALPGGKVAQLQ 201
Query: 147 NVIFGCGHKNLASPTSDSKQ----TGIIGLGPGNSSLISQMGTSIAGKFSYCLPDQGSSK 202
+V+ GC S T D + G++ LG S S+ G FSYCL D + +
Sbjct: 202 DVVLGC------SSTHDGQSFKSVDGVLSLGNAKISFASRAAARFGGSFSYCLVDHLAPR 255
Query: 203 INFGGIVAGAGVV------STPLII---RDHYYLSLEAISVGNQRL----EFVSSSTGNI 249
G + G G V T L + Y + ++A+ V Q L E +G +
Sbjct: 256 NATGYLAFGPGQVPRTPATQTKLFLDPAMPFYGVKVDAVHVAGQALDIPAEVWDPKSGGV 315
Query: 250 FVDTGVLRTLLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDVLCYNISS----QPKFP 305
+D+G T+L + + + ++ ++ P P F CYN ++ P+ P
Sbjct: 316 ILDSGTTLTVLATPAYKAVVAALTKLLAGVPKVDF---PPFEH--CYNWTAPRPGAPEIP 370
Query: 306 EVTIHFRGADVKLSPSNLFR-NISDEIMCSAFRGGN--ANIVYGRIMQINFLIGYDIEQA 362
++ + F G P+ + ++ + C + G V G IMQ L +D++
Sbjct: 371 KLAVQFTGCARLEPPAKSYVIDVKPGVKCIGLQEGEWPGVSVIGNIMQQEHLWEFDLKNM 430
Query: 363 MVSFKPSRCTN 373
V F PS CT
Sbjct: 431 EVRFMPSTCTR 441
>gi|449467979|ref|XP_004151699.1| PREDICTED: probable aspartic protease At2g35615-like, partial
[Cucumis sativus]
Length = 209
Score = 102 bits (254), Expect = 3e-19, Method: Compositional matrix adjust.
Identities = 55/124 (44%), Positives = 76/124 (61%), Gaps = 13/124 (10%)
Query: 35 YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSS 94
YLM +SIGTPPVD G DTGSD W QC PC L C+KQ P+FDP KS++++ + C+S
Sbjct: 92 YLMSVSIGTPPVDYIGMADTGSDLMWAQCLPC--LKCYKQSRPIFDPLKSTSFSHVPCNS 149
Query: 95 SQC-AVVTSNC-SEGDCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMPNVIFGC 152
C A+ S+C ++G C YS+ YG Y + G+L E +T S+S + GC
Sbjct: 150 QNCKAIDDSHCGAQGVCDYSYTYGDQTY---TKGDLGFEKITIGSSS------VKSVIGC 200
Query: 153 GHKN 156
GH++
Sbjct: 201 GHES 204
>gi|20160862|dbj|BAB89801.1| putative aspartic proteinase nepenthesin I [Oryza sativa Japonica
Group]
Length = 488
Score = 102 bits (254), Expect = 3e-19, Method: Compositional matrix adjust.
Identities = 104/363 (28%), Positives = 162/363 (44%), Gaps = 58/363 (15%)
Query: 34 IYLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCS 93
+Y+ IGTPP + G++D SD WT C F+P +S+T + C+
Sbjct: 99 MYVFSYGIGTPPQQVSGALDISSDLVWTACGATAP----------FNPVRSTTVADVPCT 148
Query: 94 SSQC---AVVTSNCSEGDCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMPNVIF 150
C A T +C+Y+++YG G A+ ++G L TE TF T + V+F
Sbjct: 149 DDACQQFAPQTCGAGASECAYTYMYGGG--AANTTGLLGTEAFTFGDT-----RIDGVVF 201
Query: 151 GCGHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCL-PDQG---SSKINFG 206
GCG KN+ S +G+IGLG GN SL+SQ+ +FSY PD S I FG
Sbjct: 202 GCGLKNVG---DFSGVSGVIGLGRGNLSLVSQLQVD---RFSYHFAPDDSVDTQSFILFG 255
Query: 207 --GIVAGAGVVSTPLIIRDH----YYLSLEAISVGNQRLEFVSSS--------TGNIFVD 252
+ +ST L+ D YY+ L I V + L S + +G +F+
Sbjct: 256 DDATPQTSHTLSTRLLASDANPSLYYVELAGIQVDGKDLAIPSGTFDLRNKDGSGGVFLS 315
Query: 253 TGVLRTLLPLEYHSNLKSVMSNMIKAQPVKG--VGAEPGFSDVLCYNISS--QPKFPEVT 308
L T+L + L+ +++ I V G +G + LCY S + K P +
Sbjct: 316 ITDLVTVLEEAAYKPLRQAVASKIGLPAVNGSALGLD------LCYTGESLAKAKVPSMA 369
Query: 309 IHFRGADV-KLSPSNLF-RNISDEIMCSAFRGGNA--NIVYGRIMQINFLIGYDIEQAMV 364
+ F G V +L N F + + + C +A V G ++Q+ + YDI + +
Sbjct: 370 LVFAGGAVMELELGNYFYMDSTTGLACLTILPSSAGDGSVLGSLIQVGTHMMYDINGSKL 429
Query: 365 SFK 367
F+
Sbjct: 430 VFE 432
>gi|297720449|ref|NP_001172586.1| Os01g0776900 [Oryza sativa Japonica Group]
gi|255673740|dbj|BAH91316.1| Os01g0776900 [Oryza sativa Japonica Group]
Length = 381
Score = 102 bits (254), Expect = 3e-19, Method: Compositional matrix adjust.
Identities = 84/251 (33%), Positives = 124/251 (49%), Gaps = 28/251 (11%)
Query: 34 IYLMHLSIGTPPVDIFGSVDTGSDCTWTQCEP---CPELDCFKQEPPLFDPKKSSTYNSI 90
+Y + +G+PP + F +DTGSD W C P CP + F+P SST + I
Sbjct: 90 LYFTRVKLGSPPKEYFVQIDTGSDILWVACSPCTGCPSSSGLNIQLEFFNPDTSSTSSKI 149
Query: 91 SCSSSQC--AVVTSN--CSEGD---CSYSFLYGRGAYASFSSGNLATETLTFNSTSG--- 140
CS +C A+ TS C D C Y+F YG G S +SG ++T+ F++ G
Sbjct: 150 PCSDDRCTAALQTSEAVCQTSDNSPCGYTFTYGDG---SGTSGYYVSDTMYFDTVMGNEQ 206
Query: 141 LPVEMPNVIFGCGHKNLASPT-SDSKQTGIIGLGPGNSSLISQMGT-SIAGK-FSYCLP- 196
+++FGC + T +D GI G G S++SQ+ + ++ K FS+CL
Sbjct: 207 TANSSASIVFGCSNSQSGDLTKTDRAVDGIFGFGQHQLSVVSQLNSLGVSPKVFSHCLKG 266
Query: 197 -DQGSSKINFGGIVAGAGVVSTPLI-IRDHYYLSLEAISVGNQRLE-----FVSSSTGNI 249
D G + G IV G+V TPL+ + HY L+LE+I V Q+L F +S+T
Sbjct: 267 SDNGGGILVLGEIVE-PGLVYTPLVPSQPHYNLNLESIVVNGQKLPIDSSLFTTSNTQGT 325
Query: 250 FVDTGVLRTLL 260
VD+G L
Sbjct: 326 IVDSGTTLAYL 336
>gi|255571584|ref|XP_002526738.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
gi|223533927|gb|EEF35652.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
Length = 457
Score = 102 bits (254), Expect = 3e-19, Method: Compositional matrix adjust.
Identities = 102/378 (26%), Positives = 161/378 (42%), Gaps = 44/378 (11%)
Query: 17 KSPISIIYQAEIISVDDIYLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEP 76
K PIS + D Y+M SIG+P VD + D+GS W QC +C++Q+
Sbjct: 88 KYPIS-----RMSYTDKAYVMKFSIGSPAVDTYAIPDSGSSLVWLQCGTPYCRNCYRQKI 142
Query: 77 PLFDPKKSSTYNSISCSSSQCAVVTSN----CSEGD--CSYSFLYGRGAYASFSSGNLAT 130
PLF+P KS TY C++++C V + C + + C Y Y +Y + G ++T
Sbjct: 143 PLFNPSKSVTYMKRLCNTAECRVALGDEYWRCKKPNQICKYHEDYLDDSY---TEGVIST 199
Query: 131 ETLTF-NSTSGLPVEMPNVIFGCGHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAG 189
+ TF SG +IFGCG+ N S G++GL +SL+ QM
Sbjct: 200 DIFTFPEHISGFGNYTLRIIFGCGYNN--SDPQHFYPPGLVGLTNNKASLVGQMDVD--- 254
Query: 190 KFSYCLPD------QGSSKINFGGIVAGAGVVSTPLIIRDHYYL--SLEAISVGNQRLE- 240
+FSYC+ +GS +I FG + +G + + D +Y+ +++ I V +E
Sbjct: 255 QFSYCVSIDTEQNLKGSMEIRFGLAASISGHSTQLVPNSDGWYIFKNVDGIYVNEFEVEG 314
Query: 241 -------FVSSSTGNIFVDTGVLRTLLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDV 293
+ G + +DTG T L L ++ I P K + GF
Sbjct: 315 YPAWVFKYTEGGQGGLTMDTGTTYTELHNSVMDPLIKLLEEHITIVPEKDY-SNSGFE-- 371
Query: 294 LCY--NISSQPKFPEVTIHF---RGADVKLSPSNLFRNISDEIMCSAFRGGNANIVYGRI 348
LCY + P++ + F + + N + MC A N + G
Sbjct: 372 LCYFSDDFLGATLPDIELRFTDNKDTYFSFNTRNAWTPNGRSQMCLAMFRTNGMSIIGMH 431
Query: 349 MQINFLIGYDIEQAMVSF 366
+ IGYD+ +VSF
Sbjct: 432 QLRDIKIGYDLHHNIVSF 449
>gi|222619345|gb|EEE55477.1| hypothetical protein OsJ_03658 [Oryza sativa Japonica Group]
Length = 530
Score = 102 bits (254), Expect = 3e-19, Method: Compositional matrix adjust.
Identities = 103/373 (27%), Positives = 171/373 (45%), Gaps = 46/373 (12%)
Query: 35 YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEP---CPELDCFKQEPPLFDPKKSSTYNSIS 91
Y + +G+PP + F +DTGSD W C P CP + F+P SST + I
Sbjct: 117 YFTRVKLGSPPKEYFVQIDTGSDILWVACSPCTGCPSSSGLNIQLEFFNPDTSSTSSKIP 176
Query: 92 CSSSQC--AVVTSN--CSEGD---CSYSFLYGRGAYASFSSGNLATETLTFNSTSG---L 141
CS +C A+ TS C D C Y+F YG G S +SG ++T+ F++ G
Sbjct: 177 CSDDRCTAALQTSEAVCQTSDNSPCGYTFTYGDG---SGTSGYYVSDTMYFDTVMGNEQT 233
Query: 142 PVEMPNVIFGCGHKNLASPT-SDSKQTGIIGLGPGNSSLISQMGT-SIAGK-FSYCLP-- 196
+++FGC + T +D GI G G S++SQ+ + ++ K FS+CL
Sbjct: 234 ANSSASIVFGCSNSQSGDLTKTDRAVDGIFGFGQHQLSVVSQLNSLGVSPKVFSHCLKGS 293
Query: 197 DQGSSKINFGGIVAGAGVVSTPLI-IRDHYYLSLEAISVGNQRLE-----FVSSSTGNIF 250
D G + G IV G+V TPL+ + HY L+LE+I V Q+L F +S+T
Sbjct: 294 DNGGGILVLGEIVE-PGLVYTPLVPSQPHYNLNLESIVVNGQKLPIDSSLFTTSNTQGTI 352
Query: 251 VDTGVLRTLLP----LEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDVLCYNISSQPKFPE 306
VD+G L + + + + +S +++ KG + + S FP
Sbjct: 353 VDSGTTLAYLADGAYDPFVNAITAAVSPSVRSLVSKG-------NQCFVTSSSVDSSFPT 405
Query: 307 VTIHFRGA-DVKLSPSN-LFRNIS---DEIMCSAFR--GGNANIVYGRIMQINFLIGYDI 359
V+++F G + + P N L + S + + C ++ G + G ++ + + YD+
Sbjct: 406 VSLYFMGGVAMTVKPENYLLQQASIDNNVLWCIGWQRNQGQQITILGDLVLKDKIFVYDL 465
Query: 360 EQAMVSFKPSRCT 372
+ + C+
Sbjct: 466 ANMRMGWTDYDCS 478
>gi|413938618|gb|AFW73169.1| hypothetical protein ZEAMMB73_633272 [Zea mays]
Length = 324
Score = 102 bits (254), Expect = 3e-19, Method: Compositional matrix adjust.
Identities = 111/340 (32%), Positives = 159/340 (46%), Gaps = 38/340 (11%)
Query: 52 VDTGSDCTWTQCEPCPEL-DCFKQEPPLFDPKKSSTYNSISCSSSQCAVV----TSNCSE 106
VDTGSD +W QC+PC C+ Q+ PLFDP +SS+Y ++ C CA + S CS
Sbjct: 3 VDTGSDLSWVQCKPCAAAPSCYSQKDPLFDPAQSSSYAAVPCGGPVCAGLGIYAASACSA 62
Query: 107 GDCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMPNVIFGCGHKNLASPTSDSKQ 166
C Y YG G S ++G +++TLT +++S + FGCGH A +
Sbjct: 63 AQCGYVVSYGDG---SNTTGVYSSDTLTLSASS----AVQGFFFGCGH---AQSGLFNGV 112
Query: 167 TGIIGLGPGNSSLISQMGTSIAGKFSYCLPDQGSSK----INFGGIVAGAGVVSTPLIIR 222
G++GLG SL+ Q + G FSYCLP + S+ + GG A ST ++
Sbjct: 113 DGLLGLGREQPSLVEQTAGTYGGVFSYCLPTKPSTAGYLTLGVGGPSGAAPGFSTTQLLP 172
Query: 223 D-----HYYLSLEAISVGNQRLEFVSSS-TGNIFVDTGVLRTLLPLEYHSNLKSVMSNMI 276
+Y + L ISVG Q+L +S+ G VDTG + T LP ++ L+S + +
Sbjct: 173 SPNAPTYYVVMLTGISVGGQQLSVPASAFAGGTVVDTGTVVTRLPPTAYAALRSAFRSGM 232
Query: 277 KAQPVKGVGAEP--GFSDVLCYNIS--SQPKFPEVTIHF-RGADVKLSPSNLFRNISDEI 331
+ G P G D CYN + P V + F GA V L +
Sbjct: 233 AS---YGYPTAPSNGILDT-CYNFAGYGTVTLPNVALTFGSGATVTLGADGILSFGCLAF 288
Query: 332 MCSAFRGGNANIVYGRIMQINFLIGYDIEQAMVSFKPSRC 371
S GG A + G + Q +F + I+ V FKPS C
Sbjct: 289 APSGSDGGMA--ILGNVQQRSFEV--RIDGTSVGFKPSSC 324
>gi|224144963|ref|XP_002325476.1| predicted protein [Populus trichocarpa]
gi|222862351|gb|EEE99857.1| predicted protein [Populus trichocarpa]
Length = 372
Score = 102 bits (254), Expect = 3e-19, Method: Compositional matrix adjust.
Identities = 99/370 (26%), Positives = 157/370 (42%), Gaps = 62/370 (16%)
Query: 34 IYLMHLSIGTPPVDIFGSVDTGSDCTWTQ---CEPCPELDCFKQEPPLFDPKKSSTYNSI 90
+Y + +G P D + VDTGSD W C+ CP + L+DP S + +
Sbjct: 26 LYFAKIGLGNPSKDYYVQVDTGSDILWVNCIGCDKCPTKSDLGIKLTLYDPASSVSATRV 85
Query: 91 SCSSSQCAVVTS----NC-SEGDCSYSFLYGRGAYASFSSGNLATETLTFNSTSG-LPVE 144
SC C + +C E C Y+ +YG G S ++G ++ + F +G L
Sbjct: 86 SCDDDFCTSTYNGLLPDCKKELPCQYNVVYGDG---SSTAGYFVSDAVQFERVTGNLQTG 142
Query: 145 MPN--VIFGCGHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLPDQGSSK 202
+ N V FGCG ++Q+G GLG +L I G F++CL
Sbjct: 143 LSNGTVTFGCG----------AQQSG--GLGTSGEAL-----DGILGAFAHCL-----DN 180
Query: 203 INFGGIVAGAGVVS-----TPLI-IRDHYYLSLEAISVGNQRLE-----FVSSSTGNIFV 251
+N GGI A +VS TP++ + HY + ++ I VG LE F S +
Sbjct: 181 VNGGGIFAIGELVSPKVNTTPMVPNQAHYNVYMKEIEVGGTVLELPTDVFDSGDRRGTII 240
Query: 252 DTGVLRTLLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDVLCYNISS--QPKFPEVTI 309
D+G LP + S+M+ + QP G+ +C+ S FP++
Sbjct: 241 DSGTTLAYLPEVVYD---SMMNEIRSQQP--GLSLHTVEEQFICFKYSGNVDDGFPDIKF 295
Query: 310 HFRGA-DVKLSPSNLFRNISDEIMCSAFRGGNANIVYGRIMQI-------NFLIGYDIEQ 361
HF+ + + + P + IS++I C ++ G GR M + N L+ YDIE
Sbjct: 296 HFKDSLTLTVYPHDYLFQISEDIWCFGWQNGGMQSKDGRDMTLLGDLVLSNKLVLYDIEN 355
Query: 362 AMVSFKPSRC 371
+ + C
Sbjct: 356 QAIGWTEYNC 365
>gi|125533812|gb|EAY80360.1| hypothetical protein OsI_35532 [Oryza sativa Indica Group]
Length = 428
Score = 102 bits (254), Expect = 3e-19, Method: Compositional matrix adjust.
Identities = 101/366 (27%), Positives = 170/366 (46%), Gaps = 51/366 (13%)
Query: 34 IYLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCS 93
+Y++ + +GTP +DTGS +W C E D P F +S+T +SC
Sbjct: 81 LYVISVGLGTPAKTQIVEIDTGSSTSWVFC----ECDGCHTNPRTFLQSRSTTCAKVSCG 136
Query: 94 SSQCAVVTSN--CSEG----DCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMPN 147
+S C + S+ C + DC + Y G S S G L +TLTF+ ++P+
Sbjct: 137 TSMCLLGGSDPHCQDSENYPDCPFRVSYQDG---SASYGILYQDTLTFSDVQ----KIPS 189
Query: 148 VIFGCGHKNLASPTSDS--KQTGIIGLGPGNSSLISQMGTSIAGKFSYCLPDQGSSK--- 202
FGC NL S ++ G++G+G G S++ Q G FSYCLP Q S +
Sbjct: 190 FTFGC---NLDSFGANEFGNVDGLLGMGAGPMSVLKQSSPRFDG-FSYCLPLQKSERGFF 245
Query: 203 ------INFGGIVAGAGVVSTPLIIR----DHYYLSLEAISVGNQRLEF---VSSSTGNI 249
+ G + V T ++ R + +++ L AISV +RL + S G +
Sbjct: 246 SKTTGYFSLGKVATRTDVRYTKMVARRKNTELFFVDLAAISVDGERLGLSPSIFSRKGVV 305
Query: 250 FVDTGVLRTLLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDVLCYNISS--QPKFPEV 307
F D+G + +P SV+S I+ ++ AE S+ CY++ S + P +
Sbjct: 306 F-DSGSELSYIP----DRALSVLSQRIRELLLRRGAAEEE-SERNCYDMRSVDEGDMPAI 359
Query: 308 TIHF-RGADVKLSPSNLF--RNISDE-IMCSAFRGGNANIVYGRIMQINFLIGYDIEQAM 363
++HF GA L +F R++ ++ + C AF + + G +MQ + + YD+++ +
Sbjct: 360 SLHFDDGARFDLGSHGVFVERSVQEQDVWCLAFAPTESVSIIGSLMQTSKEVVYDLKRQL 419
Query: 364 VSFKPS 369
+ PS
Sbjct: 420 IGIGPS 425
>gi|255581545|ref|XP_002531578.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
gi|223528808|gb|EEF30814.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
Length = 442
Score = 102 bits (253), Expect = 3e-19, Method: Compositional matrix adjust.
Identities = 108/377 (28%), Positives = 173/377 (45%), Gaps = 62/377 (16%)
Query: 39 LSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSSSQCA 98
L++G+PP ++ +DTGS+ +W C+ L+ +F+P S TY+ + C S C
Sbjct: 73 LTVGSPPQNVTMVLDTGSELSWLHCKKTQFLNS------VFNPLSSKTYSKVPCLSPTCK 126
Query: 99 VVTSNCS-EGDCSYSFL-YGRGAYASFSS--GNLATETLTFNSTSGLPVEMPNVIFGCGH 154
T + + C + L + +YA +S GNLA ET S + P IFGC
Sbjct: 127 TRTRDLTIPVSCDATKLCHVIVSYADATSIEGNLAFETFRLGSLTK-----PATIFGCMD 181
Query: 155 KNLASPT-SDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLPDQGSSKINFGGIVAGAG 213
+S + DSK TG+IG+ G+ S ++QMG KFSYC+ S+ + G +
Sbjct: 182 SGFSSNSEEDSKTTGLIGMNRGSLSFVNQMGYP---KFSYCISGFDSAGVLLLGNASFPW 238
Query: 214 V----------VSTPLIIRDH--YYLSLEAISVGNQRLE-----FVSSST--GNIFVDTG 254
+ +STPL D Y + LE I V N+ L FV T G VD+G
Sbjct: 239 LKPLSYTPLVQISTPLPYFDRVAYTVQLEGIKVKNKVLSLPKSVFVPDHTGAGQTMVDSG 298
Query: 255 VLRTLLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGF----SDVLCYNI-SSQP---KFPE 306
T L ++ LK+ + + + + V + F + LCY + SS+P P
Sbjct: 299 TQFTFLLGPVYTALKNEF--LSQTRGILKVLNDDNFVFQGAMDLCYLLDSSRPNLQNLPV 356
Query: 307 VTIHFRGADVKLSPSNLFRNI------SDEIMCSAFRGGNANI------VYGRIMQINFL 354
V++ F+GA++ +S L + D + C F GN+++ V G Q N
Sbjct: 357 VSLMFQGAEMSVSGERLLYRVPGEVRGRDSVWCFTF--GNSDLLGVEAFVIGHHHQQNVW 414
Query: 355 IGYDIEQAMVSFKPSRC 371
+ +D+E++ + RC
Sbjct: 415 MEFDLEKSRIGLADVRC 431
>gi|326516344|dbj|BAJ92327.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 464
Score = 102 bits (253), Expect = 4e-19, Method: Compositional matrix adjust.
Identities = 99/369 (26%), Positives = 167/369 (45%), Gaps = 52/369 (14%)
Query: 35 YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSS 94
Y+ + IG + VDT S+ TW QCEPC C Q+ PLFDP S +Y ++ C+S
Sbjct: 113 YVATVGIGGGEATVI--VDTASELTWVQCEPCDA--CHDQQEPLFDPSSSPSYAAVPCNS 168
Query: 95 SQCAVVT-------SNCSE--GDCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEM 145
S C + C + CSY+ Y G+Y S G LA + L+ ++
Sbjct: 169 SSCDALRVATGMSGQACDDQPAACSYTLSYRDGSY---SRGVLAHDRLSLAGE-----DI 220
Query: 146 PNVIFGCGHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLP--DQGSSKI 203
+FGCG N P + +G++GLG SLISQ G FSYCLP + GSS
Sbjct: 221 QGFVFGCGTSN-QGPFGGT--SGLMGLGRSQLSLISQTMDQFGGVFSYCLPPKESGSSGS 277
Query: 204 NFGGIVAGAGVVSTPLI--------IRDHYYLS-LEAISVGNQRLE---FVSSSTGNIFV 251
G A STP++ ++ +YL+ L I+VG + ++ F + G V
Sbjct: 278 LVLGDDASVYRNSTPIVYTAMVSDPLQGPFYLANLTGITVGGEDVQSPGFSAGGGGKAIV 337
Query: 252 DTGVLRTLLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDV-LCYNISS--QPKFPEVT 308
D+G + T L ++ +++ + + P A P FS + C++++ + + P +
Sbjct: 338 DSGTIITSLVPSVYAAVRAEFVSQLAEYPQ----AAP-FSILDTCFDLTGLREVQVPSLK 392
Query: 309 IHFR-GADVKLSPSNLFRNISDE-----IMCSAFRGGNANIVYGRIMQINFLIGYDIEQA 362
+ F GA+V++ + ++ + + ++ + + G Q N + +D +
Sbjct: 393 LVFDGGAEVEVDSKGVLYVVTGDASQVCLALASLKSEYDTPIIGNYQQKNLRVIFDTVGS 452
Query: 363 MVSFKPSRC 371
+ F C
Sbjct: 453 QIGFAQETC 461
>gi|297848856|ref|XP_002892309.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
gi|297338151|gb|EFH68568.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
Length = 484
Score = 102 bits (253), Expect = 4e-19, Method: Compositional matrix adjust.
Identities = 99/372 (26%), Positives = 163/372 (43%), Gaps = 42/372 (11%)
Query: 34 IYLMHLSIGTPPVDIFGSVDTGSDCTWT---QCEPCPELDCFKQEPPLFDPKKSSTYNSI 90
+Y + IGTP + VDTGSD W QC+ CP E L++ +S + +
Sbjct: 79 LYYAKIGIGTPAKSYYVQVDTGSDIMWVNCIQCKQCPRRSTLGIELTLYNIDESDSGKLV 138
Query: 91 SCSSSQCAVVT----SNCSEG-DCSYSFLYGRGAYASFSSGNLATETLTFNSTSG-LPVE 144
SC C ++ S C C Y +YG G S ++G + + ++S +G L +
Sbjct: 139 SCDDDFCYQISGGPLSGCKANMSCPYLEIYGDG---SSTAGYFVKDVVQYDSVAGDLKTQ 195
Query: 145 MPN--VIFGCGHKNLASPTSDSKQT--GIIGLGPGNSSLISQMGTS--IAGKFSYCLPDQ 198
N VIFGCG + S +++ GI+G G NSS+ISQ+ +S + F++CL +
Sbjct: 196 TANGSVIFGCGARQSGDLDSSNEEALDGILGFGKANSSMISQLASSGRVKKIFAHCLDGR 255
Query: 199 GSSKINFGGIVAGAGVVSTPLIIRD-HYYLSLEAISVGNQRLE-----FVSSSTGNIFVD 252
I G V V TPL+ HY +++ A+ VG + L F +D
Sbjct: 256 NGGGIFAIGRVVQPKVNMTPLVPNQPHYNVNMTAVQVGQEFLNIPADLFQPGDRKGAIID 315
Query: 253 TGVLRTLLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDVLCYNISSQ--PKFPEVTIH 310
+G LP + L +++ A V V D C+ S + FP VT H
Sbjct: 316 SGTTLAYLPEIIYEPLVKKITSQEPALKVHIVD-----KDYKCFQYSGRVDEGFPNVTFH 370
Query: 311 FRGAD-VKLSPSN-LFRNISDEIMCSAF-------RGGNANIVYGRIMQINFLIGYDIEQ 361
F + +++ P + LF + + C + R + G ++ N L+ YD+E
Sbjct: 371 FENSVFLRVYPHDYLFP--YEGMWCIGWQNSAMQSRDRRNMTLLGDLVLSNKLVLYDLEN 428
Query: 362 AMVSFKPSRCTN 373
++ + C++
Sbjct: 429 QLIGWTEYNCSS 440
>gi|115483166|ref|NP_001065176.1| Os10g0537800 [Oryza sativa Japonica Group]
gi|21717159|gb|AAM76352.1|AC074196_10 putative nucleoid DNA binding protein [Oryza sativa Japonica Group]
gi|31433285|gb|AAP54823.1| Eukaryotic aspartyl protease family protein, expressed [Oryza
sativa Japonica Group]
gi|113639785|dbj|BAF27090.1| Os10g0537800 [Oryza sativa Japonica Group]
gi|215692411|dbj|BAG87831.1| unnamed protein product [Oryza sativa Japonica Group]
Length = 394
Score = 102 bits (253), Expect = 4e-19, Method: Compositional matrix adjust.
Identities = 98/371 (26%), Positives = 161/371 (43%), Gaps = 61/371 (16%)
Query: 35 YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSS 94
Y+ + +IGTPP +D + WTQC+ C CF+Q PLFDP S+TY + C +
Sbjct: 51 YVANFTIGTPPQPASAVIDLAGELVWTQCKQCGR--CFEQGTPLFDPTASNTYRAEPCGT 108
Query: 95 SQCAVVTS---NCSEGDCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMPNVIFG 151
C + S NCS C+Y A + G + T+T + ++ FG
Sbjct: 109 PLCESIPSDVRNCSGNVCAYE----ASTNAGDTGGKVGTDTFAVGTAKA------SLAFG 158
Query: 152 CGHKNLASPTSDSK--QTGIIGLGPGNSSLISQMGTSIAGKFSYCLP--DQGSSKINFGG 207
C + + D+ +GI+GLG SL++Q G + FSYCL D G + F G
Sbjct: 159 C----VVASDIDTMGGPSGIVGLGRTPWSLVTQTGVA---AFSYCLAPHDAGKNSALFLG 211
Query: 208 ----IVAGAGVVSTPLI--------IRDHYYLSLEAISVGNQRLEFVSSSTGNIFVDTGV 255
+ G STP + + ++Y + LE + G+ + S + T +
Sbjct: 212 SSAKLAGGGKAASTPFVNISGNGNDLSNYYKVQLEGLKAGDAMIPLPPSGS------TVL 265
Query: 256 LRTLLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDV----LCYNIS-SQPKFPEVTIH 310
L T P+ + L +K VGA P + V LC+ S + P++
Sbjct: 266 LDTFSPISF---LVDGAYQAVKKAVTVAVGAPPMATPVEPFDLCFPKSGASGAAPDLVFT 322
Query: 311 FR-GADVKLSPSNLFRNISDEIMCSAFRGGNANI-------VYGRIMQINFLIGYDIEQA 362
FR GA + + +N + + +C A +A + + G + Q N +D+++
Sbjct: 323 FRGGAAMTVPATNYLLDYKNGTVCLAML-SSARLNSTTELSLLGSLQQENIHFLFDLDKE 381
Query: 363 MVSFKPSRCTN 373
+SF+P+ CT
Sbjct: 382 TLSFEPADCTK 392
>gi|125571060|gb|EAZ12575.1| hypothetical protein OsJ_02481 [Oryza sativa Japonica Group]
Length = 501
Score = 102 bits (253), Expect = 4e-19, Method: Compositional matrix adjust.
Identities = 103/378 (27%), Positives = 150/378 (39%), Gaps = 64/378 (16%)
Query: 35 YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSS 94
Y + +GTP +DTGSD W QC PC C+ Q +FDP+ S +Y ++ C++
Sbjct: 147 YFTKIGVGTPVTPALMVLDTGSDVVWLQCAPCRR--CYDQSGQMFDPRASHSYGAVDCAA 204
Query: 95 SQCAVVTS---NCSEGDCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMPNVIFG 151
C + S + C Y YG G S ++G+ ATETLTF S + +P V G
Sbjct: 205 PLCRRLDSGGCDLRRKACLYQVAYGDG---SVTAGDFATETLTFASGA----RVPRVALG 257
Query: 152 CGHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLPD---------QGSSK 202
CGH N + + G+ G+ S SQ+ FSYCL D SS
Sbjct: 258 CGHDNEGLFVAAAGLLGLG---RGSLSFPSQISRRFGRSFSYCLVDRTSSSASATSRSST 314
Query: 203 INFGGIVAGAGVVSTPLIIRDHYYLSLEAISVGNQRLEFV-------------------- 242
+ FG GA + R + E G+ L
Sbjct: 315 VTFGSGARGA-------LGRRVLHPDGEEPQDGDVLLRAAHGHQRRRRARPGRGRVRPPP 367
Query: 243 --SSSTGNIFVDTGVLRTLLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDVL--CYNI 298
S+ G + VD+G + + S A G+ PG + CY++
Sbjct: 368 DPSTGRGGVIVDSGRPSPAWARAGRTPPCATRSRAAAA----GLRLSPGGFSLFDTCYDL 423
Query: 299 SSQP--KFPEVTIHFR-GADVKLSPSNLFRNI-SDEIMCSAFRGGNANI-VYGRIMQINF 353
S K P V++HF GA+ L P N + S C AF G + + + G I Q F
Sbjct: 424 SGLKVVKVPTVSMHFAGGAEAALPPENYLIPVDSRGTFCFAFAGTDGGVSIIGNIQQQGF 483
Query: 354 LIGYDIEQAMVSFKPSRC 371
+ +D + + F P C
Sbjct: 484 RVVFDGDGQRLGFVPKGC 501
>gi|225436397|ref|XP_002272121.1| PREDICTED: aspartic proteinase-like protein 2 isoform 2 [Vitis
vinifera]
Length = 499
Score = 101 bits (252), Expect = 5e-19, Method: Compositional matrix adjust.
Identities = 102/371 (27%), Positives = 165/371 (44%), Gaps = 42/371 (11%)
Query: 34 IYLMHLSIGTPPVDIFGSVDTGSDCTWT---QCEPCPELDCFKQEPPLFDPKKSSTYNSI 90
+Y + +G+PP + + +DTGSD W C CP+ FDP SST + I
Sbjct: 82 LYFTRVLLGSPPKEFYVQIDTGSDVLWVSCGSCNGCPQSSGLHIPLNFFDPGSSSTASLI 141
Query: 91 SCSSSQCAVVTSNCSEG------DCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPV- 143
SCS +C++ + G C Y+F YG G S +SG ++ L F++ G V
Sbjct: 142 SCSDQRCSLGVQSSDAGCSSQGNQCIYTFQYGDG---SGTSGYYVSDLLNFDAIVGSSVT 198
Query: 144 -EMPNVIFGCGHKNLASPT-SDSKQTGIIGLGPGNSSLISQMGTS-IAGK-FSYCLPDQG 199
+++FGC T SD GI G G + S+ISQM + I K FS+CL G
Sbjct: 199 NSSASIVFGCSISQTGDLTKSDRAVDGIFGFGQQDMSVISQMSSQGITPKVFSHCLKGDG 258
Query: 200 SSKINFGGI-VAGAGVVSTPLI-IRDHYYLSLEAISVGNQRLE-----FVSSSTGNIFVD 252
+ +V +PL+ + HY L+L++ISV + L F +S+ VD
Sbjct: 259 GGGGILVLGEIVEEDIVYSPLVPSQPHYNLNLQSISVNGKSLAIDPEVFATSTNRGTIVD 318
Query: 253 TGVLRTLLPLEYHSNLKSVMSNMI--KAQPVKGVGAEPGFSDVLCYNISSQPK--FPEVT 308
+G L E + S ++ + +P+ G + CY I+S K FP V+
Sbjct: 319 SGTTLAYLAEEAYDPFVSAITEAVSQSVRPLLSKGTQ-------CYLITSSVKGIFPTVS 371
Query: 309 IHFRGA-DVKLSPSNLF---RNISD-EIMCSAFRG--GNANIVYGRIMQINFLIGYDIEQ 361
++F G + L P + +I D + C F+ G + G ++ + + YD+
Sbjct: 372 LNFAGGVSMNLKPEDYLLQQNSIGDAAVWCIGFQKIQGQGITILGDLVLKDKIFVYDLAG 431
Query: 362 AMVSFKPSRCT 372
+ + C+
Sbjct: 432 QRIGWANYDCS 442
>gi|21717166|gb|AAM76359.1|AC074196_17 putative nucleoid DNA binding protein [Oryza sativa Japonica Group]
gi|31433306|gb|AAP54835.1| Eukaryotic aspartyl protease family protein, expressed [Oryza
sativa Japonica Group]
gi|125575546|gb|EAZ16830.1| hypothetical protein OsJ_32301 [Oryza sativa Japonica Group]
Length = 373
Score = 101 bits (252), Expect = 5e-19, Method: Compositional matrix adjust.
Identities = 102/373 (27%), Positives = 166/373 (44%), Gaps = 61/373 (16%)
Query: 34 IYLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCS 93
+Y+ +L+IGTPP + + WTQC PC CFKQ+ PLF+ SSTY C
Sbjct: 27 LYMANLTIGTPPQPASAIIHLAGEFVWTQCSPCRR--CFKQDLPLFNRSASSTYRPEPCG 84
Query: 94 SSQC-AVVTSNCS-EGDCSYSF--LYGRGAYASFSSGNLATETLTFNSTSGLPVEMPNVI 149
++ C +V S CS +G CSY ++G +SG T+T + + ++
Sbjct: 85 TALCESVPASTCSGDGVCSYEVETMFGD------TSGIGGTDTFAIGTATA------SLA 132
Query: 150 FGCGHKNLASPTSDSKQ----TGIIGLGPGNSSLISQMGTSIAGKFSYCLPDQGS----S 201
FGC S+ KQ +G++GLG SL+ QM A FSYCL G+ S
Sbjct: 133 FGCAMD------SNIKQLLGASGVVGLGRTPWSLVGQMN---ATAFSYCLAPHGAAGKKS 183
Query: 202 KINFGG---IVAGAGVVSTPLI----IRDHYYLSLEAISVGNQRLEFVSSSTGNIFVDTG 254
+ G + G +TPL+ Y + LE I G+ + + + + VDT
Sbjct: 184 ALLLGASAKLAGGKSAATTPLVNTSDDSSDYMIHLEGIKFGDVIIAPPPNGS-VVLVDTI 242
Query: 255 VLRTLLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDVLCY-------NISSQPKFPEV 307
+ L +K ++ + A P+ A P LC+ +S P+V
Sbjct: 243 FGVSFLVDAAFQAIKKAVTVAVGAAPM----ATPTKPFDLCFPKAAAAAGANSSLPLPDV 298
Query: 308 TIHFRG-ADVKLSPSNLFRNISDEIMCSAFR-GGNANI-----VYGRIMQINFLIGYDIE 360
+ F+G A + + PS + + +C A N+ + GR+ Q N +D++
Sbjct: 299 VLTFQGAAALTVPPSKYMYDAGNGTVCLAMMSSAMLNLTTELSILGRLHQENIHFLFDLD 358
Query: 361 QAMVSFKPSRCTN 373
+ +SF+P+ C++
Sbjct: 359 KETLSFEPADCSS 371
>gi|297734873|emb|CBI17107.3| unnamed protein product [Vitis vinifera]
Length = 484
Score = 101 bits (252), Expect = 5e-19, Method: Compositional matrix adjust.
Identities = 102/371 (27%), Positives = 165/371 (44%), Gaps = 42/371 (11%)
Query: 34 IYLMHLSIGTPPVDIFGSVDTGSDCTWT---QCEPCPELDCFKQEPPLFDPKKSSTYNSI 90
+Y + +G+PP + + +DTGSD W C CP+ FDP SST + I
Sbjct: 67 LYFTRVLLGSPPKEFYVQIDTGSDVLWVSCGSCNGCPQSSGLHIPLNFFDPGSSSTASLI 126
Query: 91 SCSSSQCAVVTSNCSEG------DCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPV- 143
SCS +C++ + G C Y+F YG G S +SG ++ L F++ G V
Sbjct: 127 SCSDQRCSLGVQSSDAGCSSQGNQCIYTFQYGDG---SGTSGYYVSDLLNFDAIVGSSVT 183
Query: 144 -EMPNVIFGCGHKNLASPT-SDSKQTGIIGLGPGNSSLISQMGTS-IAGK-FSYCLPDQG 199
+++FGC T SD GI G G + S+ISQM + I K FS+CL G
Sbjct: 184 NSSASIVFGCSISQTGDLTKSDRAVDGIFGFGQQDMSVISQMSSQGITPKVFSHCLKGDG 243
Query: 200 SSKINFGGI-VAGAGVVSTPLI-IRDHYYLSLEAISVGNQRLE-----FVSSSTGNIFVD 252
+ +V +PL+ + HY L+L++ISV + L F +S+ VD
Sbjct: 244 GGGGILVLGEIVEEDIVYSPLVPSQPHYNLNLQSISVNGKSLAIDPEVFATSTNRGTIVD 303
Query: 253 TGVLRTLLPLEYHSNLKSVMSNMI--KAQPVKGVGAEPGFSDVLCYNISSQPK--FPEVT 308
+G L E + S ++ + +P+ G + CY I+S K FP V+
Sbjct: 304 SGTTLAYLAEEAYDPFVSAITEAVSQSVRPLLSKGTQ-------CYLITSSVKGIFPTVS 356
Query: 309 IHFRGA-DVKLSPSNLF---RNISD-EIMCSAFRG--GNANIVYGRIMQINFLIGYDIEQ 361
++F G + L P + +I D + C F+ G + G ++ + + YD+
Sbjct: 357 LNFAGGVSMNLKPEDYLLQQNSIGDAAVWCIGFQKIQGQGITILGDLVLKDKIFVYDLAG 416
Query: 362 AMVSFKPSRCT 372
+ + C+
Sbjct: 417 QRIGWANYDCS 427
>gi|296085344|emb|CBI29076.3| unnamed protein product [Vitis vinifera]
Length = 434
Score = 101 bits (252), Expect = 5e-19, Method: Compositional matrix adjust.
Identities = 122/446 (27%), Positives = 175/446 (39%), Gaps = 115/446 (25%)
Query: 12 DNETPKSPISIIYQAE-IISVDDIYLMHLSIGTPPVDIFGSVDTGSDCTWTQCE----PC 66
DN K P +I E + V D YL+ L++GTPP I +DTGSD TW C C
Sbjct: 5 DNGLTKKPSGMIDMMEPLREVRDGYLISLNLGTPPKVIQVYMDTGSDLTWVPCGNLSFDC 64
Query: 67 PELDCFKQEP--------------------PLFDPKKSSTYNSISCSSSQCA---VVTSN 103
+ + ++ PL SS + C+ + C+ +V
Sbjct: 65 MDCNDYRNNKLMSTYSPSYSSSSLRDLCVSPLCSDVHSSDNSYDPCAVAGCSLSTLVKGT 124
Query: 104 CSEGDCSYSFLYGRGAYASFSSGNLATETLTFNSTS-GLPVEMPNVIFGCGHKNLASPTS 162
C S+++ YG G G L +TLT + +S E+PN FGC P
Sbjct: 125 CPRPCPSFAYTYGAGGVV---IGTLTRDTLTTHGSSPSFTREVPNFCFGCVGSTYREP-- 179
Query: 163 DSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLPDQGSSKINFGGIVAGAGVVSTPLIIR 222
GI G G G SL SQ+G G FS+C G A +S+PL+I
Sbjct: 180 ----IGIAGFGRGVLSLPSQLGFLQKG-FSHCF---------LGFKFANNPNISSPLVIG 225
Query: 223 D----------------------HYYLSLEAISVGNQRL--------EFVSSSTGNIFVD 252
D +YY+ LEAI+VGN EF S G + +D
Sbjct: 226 DLAISSNDHLQFTSLLKNPMYPNYYYIGLEAITVGNATAIQVPSSLREFDSHGNGGMIID 285
Query: 253 TGVLRTLLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDVLCYNI--------SSQPKF 304
+G T LP +++ L S++ ++I + A GF LCY I
Sbjct: 286 SGTTYTHLPGPFYTQLLSMLQSIITYPRAQEQEARTGFD--LCYRIPCPNNVVTDHDHLL 343
Query: 305 PEVTIHFRGADVKL------------SPSN-------LFRNISDEIMCSAFRGGNANIVY 345
P ++ HF +V L +PSN L +N+ D G A V+
Sbjct: 344 PSISFHFSN-NVSLVLPQGNHFYAMGAPSNSTVVKCLLLQNMDDS------DSGPAG-VF 395
Query: 346 GRIMQINFLIGYDIEQAMVSFKPSRC 371
G Q N + YD+E+ + F+P C
Sbjct: 396 GSFQQQNVKVVYDLEKERIGFQPMDC 421
>gi|125529158|gb|EAY77272.1| hypothetical protein OsI_05246 [Oryza sativa Indica Group]
Length = 426
Score = 101 bits (252), Expect = 5e-19, Method: Compositional matrix adjust.
Identities = 106/382 (27%), Positives = 163/382 (42%), Gaps = 52/382 (13%)
Query: 19 PISIIYQAEIISVDDIYLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPL 78
P+ I +I+S+ + Y+ +GTP + ++D +D W C C C P
Sbjct: 68 PVPIAPGRQILSIPN-YIARAGLGTPAQTLLVAIDPSNDAAWVPCSAC--AGCAASSP-S 123
Query: 79 FDPKKSSTYNSISCSSSQCAVVTS-NCSEG---DCSYSFLYGRGAY-ASFSSGNLATETL 133
F P +SSTY ++ C S QCA V S +C G C ++ Y + A +LA E
Sbjct: 124 FSPTQSSTYRTVPCGSPQCAQVPSPSCPAGVGSSCGFNLTYAASTFQAVLGQDSLALENN 183
Query: 134 TFNSTSGLPVEMPNVIFGCGHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSY 193
S + FGC + S S Q G+IG G G S +SQ + FSY
Sbjct: 184 VVVSYT----------FGC--LRVVSGNSVPPQ-GLIGFGRGPLSFLSQTKDTYGSVFSY 230
Query: 194 CLPDQGSSK----INFGGIVAGAGVVSTPLIIRDH----YYLSLEAISVGNQRLEFVSSS 245
CLP+ SS + G I + +TPL+ H YY+++ I VG++ ++ S+
Sbjct: 231 CLPNYRSSNFSGTLKLGPIGQPKRIKTTPLLYNPHRPSLYYVNMIGIRVGSKVVQVPQSA 290
Query: 246 ------TGN-IFVDTGVLRTLLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDVLCYNI 298
TG+ +D G + T L ++ ++ ++ +G GF CYN+
Sbjct: 291 LAFNPVTGSGTIIDAGTMFTRLAAPVYAAVRDAFRGRVRTPVAPPLG---GFDT--CYNV 345
Query: 299 SSQPKFPEVTIHFRGA-DVKLSPSN-LFRNISDEIMCSAFRGG-----NANI-VYGRIMQ 350
+ P VT F GA V L N + + S + C A G NA + V + Q
Sbjct: 346 TV--SVPTVTFMFAGAVAVTLPEENVMIHSSSGGVACLAMAAGPSDGVNAALNVLASMQQ 403
Query: 351 INFLIGYDIEQAMVSFKPSRCT 372
N + +D+ V F CT
Sbjct: 404 QNQRVLFDVANGRVGFSRELCT 425
>gi|242044724|ref|XP_002460233.1| hypothetical protein SORBIDRAFT_02g025060 [Sorghum bicolor]
gi|241923610|gb|EER96754.1| hypothetical protein SORBIDRAFT_02g025060 [Sorghum bicolor]
Length = 512
Score = 101 bits (252), Expect = 5e-19, Method: Compositional matrix adjust.
Identities = 99/363 (27%), Positives = 156/363 (42%), Gaps = 64/363 (17%)
Query: 52 VDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSSSQC-------------- 97
VDT S+ TW QC PC C Q+ PLFDP S +Y ++ C+SS C
Sbjct: 168 VDTASELTWVQCAPCES--CHDQQDPLFDPSSSPSYAAVPCNSSSCDALQLATGGTSGGA 225
Query: 98 -AVVTSNCSEGDCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMPNVIFGCGHKN 156
A + S CSY+ Y G+Y S G LA + L+ + +FGCG N
Sbjct: 226 AACQGQDQSAAACSYTLSYRDGSY---SRGVLAHDRLSLAGEV-----IDGFVFGCGTSN 277
Query: 157 LASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLPDQGSSKINFGGIVAG----- 211
P + +G++GLG SL+SQ G FSYCLP + S + G +V G
Sbjct: 278 QGPPFGGT--SGLMGLGRSQLSLVSQTMDQFGGVFSYCLPLKESD--SSGSLVIGDDSSV 333
Query: 212 ---------AGVVSTPLIIRDHYYLSLEAISVGNQRLE----FVSSSTGNIFVDTG-VLR 257
A +VS PL Y+++L I+VG Q +E G +D+G V+
Sbjct: 334 YRNSTPIVYASMVSDPL-QGPFYFVNLTGITVGGQEVESSGFSSGGGGGKAIIDSGTVIT 392
Query: 258 TLLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDV-LCYNIS--SQPKFPEVTIHFRGA 314
+L+P Y++ +S + PGFS + C+N++ + + P + + F G
Sbjct: 393 SLVPSIYNAVKAEFLSQFAEYPQA------PGFSILDTCFNMTGLREVQVPSLKLVFDGG 446
Query: 315 -DVKLSPSNLFRNISDE-----IMCSAFRGGNANIVYGRIMQINFLIGYDIEQAMVSFKP 368
+V++ + +S + + + + + G Q N + +D + V F
Sbjct: 447 VEVEVDSGGVLYFVSSDSSQVCLAMAPLKSEYETNIIGNYQQKNLRVIFDTSGSQVGFAQ 506
Query: 369 SRC 371
C
Sbjct: 507 ETC 509
>gi|357127507|ref|XP_003565421.1| PREDICTED: probable aspartic protease At2g35615-like [Brachypodium
distachyon]
Length = 438
Score = 101 bits (252), Expect = 6e-19, Method: Compositional matrix adjust.
Identities = 97/361 (26%), Positives = 159/361 (44%), Gaps = 60/361 (16%)
Query: 35 YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSS 94
YLM L + TPPV + DTGS W +C + P P SS+Y + C +
Sbjct: 76 YLMALDVSTPPVRMLALADTGSSLVWLKC----------KLPAAHTP-ASSSYARLPCDA 124
Query: 95 SQCAVV--TSNCSEGD-----CSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMPN 147
C + ++C C Y + + G S ++G + + TF++
Sbjct: 125 FACKALGDAASCRATGSGNNICVYRYAFADG---SCTAGPVTVDAFTFST---------R 172
Query: 148 VIFGCGHKNLASPTSDSKQTGIIGLGPGNSSLISQMG--TSIAGKFSYCL-----PDQGS 200
+ FGC + D G++GL G SL+SQ+ T A KFSYCL + S
Sbjct: 173 LDFGCATRTEGLSVPDD---GLVGLANGPISLVSQLSAKTPFAHKFSYCLVPYSSSETVS 229
Query: 201 SKINFG--GIVAGA-GVVSTPLII---RDHYYLSLEAISVGNQRLEFVSSSTGNIFVDTG 254
S +NFG IV+ + G +TPL+ + Y ++L++I V + + +++T + VD+G
Sbjct: 230 SSLNFGSHAIVSSSPGAATTPLVAGRNKSFYTIALDSIKVAGKPVPLQTTTT-KLIVDSG 288
Query: 255 VLRTLLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDVLCYNISSQP------KFPEVT 308
+ T LP L + ++ IK VK P +CY++ + P+VT
Sbjct: 289 TMLTYLPKAVLDPLVAALTAAIKLPRVK----SPETLYAVCYDVRRRAPEDVGKSIPDVT 344
Query: 309 IHF-RGADVKLSPSNLF--RNISDEIMCSAFRGGNANIVYGRIMQINFLIGYDIEQAMVS 365
+ G +V+L N F N + + + G + Q N +G+D+E+ VS
Sbjct: 345 LVLGGGGEVRLPWGNTFVVENKGTTVCLALVESHLPEFILGNVAQQNLHVGFDLERRTVS 404
Query: 366 F 366
F
Sbjct: 405 F 405
>gi|218195474|gb|EEC77901.1| hypothetical protein OsI_17222 [Oryza sativa Indica Group]
Length = 467
Score = 101 bits (252), Expect = 6e-19, Method: Compositional matrix adjust.
Identities = 107/378 (28%), Positives = 158/378 (41%), Gaps = 56/378 (14%)
Query: 35 YLMHLSIGTPPVDI---FGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSIS 91
YL+ L IGTP I + DTGSD +WTQCEPC F PP DP KS T+ +S
Sbjct: 102 YLVQLRIGTPTDRISPRYVLFDTGSDLSWTQCEPCTNCSSFTPYPP-HDPSKSRTFRRLS 160
Query: 92 CSSSQC----AVVTSNCSEGDCSYSFLYGRGAYASFSSGNLATETLTFNST---SGLPVE 144
C C AVV C + YG G SG L ++ F + G +E
Sbjct: 161 CFDPMCELCTAVVDGGGGSAGCLFRRRYGDGGAV---SGELVSDVFHFGAAGDGGGYQLE 217
Query: 145 MPNVIFGCGHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLP-------- 196
+V FGC H S TGI+ LG G S ++Q+G +FSYC+P
Sbjct: 218 R-DVAFGCAHVE-DSKAVRGYSTGILALGIGKPSFVTQLGVD---RFSYCIPASEITDDD 272
Query: 197 -----DQGSSKINFGGIVAGAGVVSTPLIIRDHYYLSLEAISVG-----NQRLEFVSSST 246
++ +S + FG G + Y + L+++ NQ+
Sbjct: 273 DDDDEERSASFLRFGSHARMTGKRAPFKQDGSGYAVRLKSVVYQHGGRLNQQQPVPVYVA 332
Query: 247 GN-------IFVDTGVLRTLLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDVLCY--N 297
G + VD+G LP L+ + I + P + CY N
Sbjct: 333 GEEAAAAMPMLVDSGTTLLWLPGSVFYPLQRRIEEDISLTRRYDL-THP---SLYCYLGN 388
Query: 298 ISSQPKFPEVTIHF-RGADVKLSPSNLF---RNISDEIMCSAFRGGNANIVYGRIMQINF 353
++ VT+ F GAD++L ++LF N++++ +C A GN I+ G Q N
Sbjct: 389 MTDVEAV-SVTLGFGGGADLELFGTSLFFTDENLTEDWVCLAVAAGNRAIL-GVYPQRNI 446
Query: 354 LIGYDIEQAMVSFKPSRC 371
+GYD+ ++F +C
Sbjct: 447 NVGYDLSTMEIAFDRDQC 464
>gi|224102847|ref|XP_002312826.1| predicted protein [Populus trichocarpa]
gi|222849234|gb|EEE86781.1| predicted protein [Populus trichocarpa]
Length = 445
Score = 101 bits (251), Expect = 6e-19, Method: Compositional matrix adjust.
Identities = 111/402 (27%), Positives = 160/402 (39%), Gaps = 88/402 (21%)
Query: 35 YLMHLSIGTPPVDIFGSVDTGSDCTW---------TQCEPCPELDCFKQEPPLFDPKKSS 85
Y + LS GTPP + +DTGSD W C + +P F PK+SS
Sbjct: 67 YSVSLSFGTPPQTLSFIMDTGSDIVWFPCTSHYLCKHCSFSSSSPSSRIQP--FIPKESS 124
Query: 86 TYNSISCSSSQCAVVTS---NCSEGDCS-----------YSFLYGRGAYASFSSGNLATE 131
+ + C + +C+ + NC + DCS Y YG G + G +E
Sbjct: 125 SSKLLGCKNPKCSWIHHSNINCDQ-DCSIKSCLNQTCPPYMIFYGSGT----TGGVALSE 179
Query: 132 TLTFNSTSGLPVEMPNVIFGCGHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKF 191
TL +S S PN + GC S S + GI G G G SSL SQ+G GKF
Sbjct: 180 TLHLHSLS-----KPNFLVGC------SVFSSHQPAGIAGFGRGLSSLPSQLGL---GKF 225
Query: 192 SYCL--------PDQGSSKI----NFGGIVAGAGVVSTPLI----------IRDHYYLSL 229
SYCL + SS + +V TP + +YYL L
Sbjct: 226 SYCLLSHRFDDDTKKSSSLVLDMEQLDSDKKTNALVYTPFVKNPKVDNKSSFSVYYYLGL 285
Query: 230 EAISVGNQRLE----FVS---SSTGNIFVDTGVLRTLLPLEYHSNLKSVMSNMIK-AQPV 281
I+VG ++ ++S G + +D+G T + E L IK + V
Sbjct: 286 RRITVGGHHVKVPYKYLSPGEDGNGGVIIDSGTTFTFMAREAFEPLSDEFIRQIKDYRRV 345
Query: 282 KGVGAEPGFSDVLCYNISSQP--KFPEVTIHFRG-ADVKLSPSNLFRNISDEIMCSAF-- 336
K + G C+N+S FPE+ ++F+G ADV L N F + E+ C
Sbjct: 346 KEIEDAIGLRP--CFNVSDAKTVSFPELRLYFKGGADVALPVENYFAFVGGEVACLTVVT 403
Query: 337 -------RGGNANIVYGRIMQINFLIGYDIEQAMVSFKPSRC 371
R G ++ G NF + YD+ + FK +C
Sbjct: 404 DGVAGPERVGGPGMILGNFQMQNFYVEYDLRNERLGFKQEKC 445
>gi|115442309|ref|NP_001045434.1| Os01g0954900 [Oryza sativa Japonica Group]
gi|20161865|dbj|BAB90778.1| nucleoid DNA-binding-like protein [Oryza sativa Japonica Group]
gi|113534965|dbj|BAF07348.1| Os01g0954900 [Oryza sativa Japonica Group]
gi|215766867|dbj|BAG99095.1| unnamed protein product [Oryza sativa Japonica Group]
Length = 445
Score = 101 bits (251), Expect = 6e-19, Method: Compositional matrix adjust.
Identities = 106/382 (27%), Positives = 163/382 (42%), Gaps = 52/382 (13%)
Query: 19 PISIIYQAEIISVDDIYLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPL 78
P+ I +I+S+ + Y+ +GTP + ++D +D W C C C P
Sbjct: 87 PVPIAPGRQILSIPN-YIARAGLGTPAQTLLVAIDPSNDAAWVPCSAC--AGCAASSP-S 142
Query: 79 FDPKKSSTYNSISCSSSQCAVVTS-NCSEG---DCSYSFLYGRGAY-ASFSSGNLATETL 133
F P +SSTY ++ C S QCA V S +C G C ++ Y + A +LA E
Sbjct: 143 FSPTQSSTYRTVPCGSPQCAQVPSPSCPAGVGSSCGFNLTYAASTFQAVLGQDSLALENN 202
Query: 134 TFNSTSGLPVEMPNVIFGCGHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSY 193
S + FGC + S S Q G+IG G G S +SQ + FSY
Sbjct: 203 VVVSYT----------FGC--LRVVSGNSVPPQ-GLIGFGRGPLSFLSQTKDTYGSVFSY 249
Query: 194 CLPDQGSSK----INFGGIVAGAGVVSTPLIIRDH----YYLSLEAISVGNQRLEFVSSS 245
CLP+ SS + G I + +TPL+ H YY+++ I VG++ ++ S+
Sbjct: 250 CLPNYRSSNFSGTLKLGPIGQPKRIKTTPLLYNPHRPSLYYVNMIGIRVGSKVVQVPQSA 309
Query: 246 ------TGN-IFVDTGVLRTLLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDVLCYNI 298
TG+ +D G + T L ++ ++ ++ +G GF CYN+
Sbjct: 310 LAFNPVTGSGTIIDAGTMFTRLAAPVYAAVRDAFRGRVRTPVAPPLG---GFDT--CYNV 364
Query: 299 SSQPKFPEVTIHFRGA-DVKLSPSN-LFRNISDEIMCSAFRGG-----NANI-VYGRIMQ 350
+ P VT F GA V L N + + S + C A G NA + V + Q
Sbjct: 365 TV--SVPTVTFMFAGAVAVTLPEENVMIHSSSGGVACLAMAAGPSDGVNAALNVLASMQQ 422
Query: 351 INFLIGYDIEQAMVSFKPSRCT 372
N + +D+ V F CT
Sbjct: 423 QNQRVLFDVANGRVGFSRELCT 444
>gi|225458774|ref|XP_002283258.1| PREDICTED: aspartic proteinase-like protein 2 [Vitis vinifera]
gi|302142232|emb|CBI19435.3| unnamed protein product [Vitis vinifera]
Length = 659
Score = 101 bits (251), Expect = 6e-19, Method: Compositional matrix adjust.
Identities = 104/364 (28%), Positives = 165/364 (45%), Gaps = 47/364 (12%)
Query: 35 YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSS 94
Y L IGTPP + VDTGS T+ C C C K + P F P +SSTY+ + C+
Sbjct: 88 YTTRLWIGTPPQEFALIVDTGSTVTYVPCSDCEH--CGKHQDPRFQPDESSTYHPVKCN- 144
Query: 95 SQCAVVTSNCSEG--DCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMPNVIFGC 152
+ NC +C Y Y A S SSG L + ++F + S + + +FGC
Sbjct: 145 -----MDCNCDHDGVNCVYERRY---AEMSSSSGVLGEDIISFGNQSEVVPQ--RAVFGC 194
Query: 153 GHKNLASPTSDSKQTGIIGLGPGNSSLISQM--GTSIAGKFSYCLPDQGSSKINFGGIVA 210
+ S + GI+GLG G S++ Q+ I FS C G + G +V
Sbjct: 195 ENVETGDLYSQ-RADGIMGLGRGQLSIVDQLVDKNVINDSFSLCY---GGMHVGGGAMVL 250
Query: 211 GAGVVSTPLII-------RDHYY-LSLEAISVGNQRLEFVSSS---TGNIFVDTGVLRTL 259
G G+ P ++ R YY + L+ I V + L+ S+ +D+G
Sbjct: 251 G-GIPPPPDMVFSRSDPYRSPYYNIELKEIHVAGKPLKLSPSTFDRKHGTVLDSGTTYAY 309
Query: 260 LPLEYHSNLKSVMSNMIKAQPVKGV-GAEPGFSDVLCY-----NISSQPK-FPEVTIHF- 311
LP E + + + K+ +K + G +P ++D+ C+ ++S K FPEV + F
Sbjct: 310 LPEEAFVAFRDAI--IKKSHNLKQIHGPDPNYNDI-CFSGAGRDVSQLSKAFPEVDMVFS 366
Query: 312 RGADVKLSPSN-LFRN--ISDEIMCSAFRGGNANIVYGRIMQINFLIGYDIEQAMVSFKP 368
G + L+P N LF++ + FR G++ + G I+ N L+ YD E + F
Sbjct: 367 NGQKLSLTPENYLFQHTKVHGAYCLGIFRNGDSTTLLGGIIVRNTLVTYDRENEKIGFWK 426
Query: 369 SRCT 372
+ C+
Sbjct: 427 TNCS 430
>gi|449463971|ref|XP_004149703.1| PREDICTED: aspartic proteinase-like protein 2-like [Cucumis
sativus]
Length = 641
Score = 101 bits (251), Expect = 7e-19, Method: Compositional matrix adjust.
Identities = 108/361 (29%), Positives = 157/361 (43%), Gaps = 40/361 (11%)
Query: 35 YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSS 94
Y L IGTPP VDTGS T+ C C + C + + P FDP+ SSTY I C+
Sbjct: 83 YTTRLWIGTPPQQFALIVDTGSTVTYVPCSTCEQ--CGRHQDPKFDPESSSTYKPIKCN- 139
Query: 95 SQCAVVTSNC-SEG-DCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMPNVIFGC 152
+ C S+G C Y Y A S SSG L + ++F + S L + +FGC
Sbjct: 140 -----IDCICDSDGVQCVYERQY---AEMSTSSGVLGEDVISFGNQSELIPQ--RAVFGC 189
Query: 153 GHKNLASPTSDSKQTGIIGLGPGNSSLISQM--GTSIAGKFSYCL--PDQGSSKINFGGI 208
+ S + GI+GLG G+ SL+ Q+ +I FS C D G + GGI
Sbjct: 190 ENMETGDLFS-QRADGIMGLGTGDLSLVDQLVEKGAINDSFSLCYGGMDIGGGAMVLGGI 248
Query: 209 VAGAGVV---STPLIIRDHYY-LSLEAISVGNQRLEFVSSSTGNIF---VDTGVLRTLLP 261
+ ++ S P +R YY + L+ I V ++L S + +D+G LP
Sbjct: 249 SPPSDMIFTYSDP--VRSPYYNVDLKEIHVAGKKLPLSSGIFDGRYGAVLDSGTTYAYLP 306
Query: 262 LEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDVLCYNISSQP-----KFPEVTIHFR-GAD 315
E S K + + I + K G +P F D+ S KFP V + F G
Sbjct: 307 AEAFSAFKDAIMDEIHSLK-KIDGPDPNFKDICFSGAGSDAAELSNKFPTVDMVFENGQK 365
Query: 316 VKLSPSNLF---RNISDEIMCSAFRGGN-ANIVYGRIMQINFLIGYDIEQAMVSFKPSRC 371
+ L+P N F + F GN + G I+ N L+ YD + + F + C
Sbjct: 366 LSLTPENYFFRHSKVHGAYCLGIFENGNDQTTLLGGIVVRNTLVMYDRANSKIGFWKTNC 425
Query: 372 T 372
+
Sbjct: 426 S 426
>gi|449508297|ref|XP_004163275.1| PREDICTED: aspartic proteinase-like protein 2-like [Cucumis
sativus]
Length = 641
Score = 101 bits (251), Expect = 7e-19, Method: Compositional matrix adjust.
Identities = 108/362 (29%), Positives = 159/362 (43%), Gaps = 42/362 (11%)
Query: 35 YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSS 94
Y L IGTPP VDTGS T+ C C + C + + P FDP+ SSTY I C+
Sbjct: 83 YTTRLWIGTPPQQFALIVDTGSTVTYVPCSTCEQ--CGRHQDPKFDPESSSTYKPIKCN- 139
Query: 95 SQCAVVTSNC-SEG-DCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMPNVIFGC 152
+ C S+G C Y Y A S SSG L + ++F + S L + +FGC
Sbjct: 140 -----IDCICDSDGVQCVYERQY---AEMSTSSGVLGEDVISFGNQSELIPQ--RAVFGC 189
Query: 153 GHKNLASPTSDSKQTGIIGLGPGNSSLISQM--GTSIAGKFSYCL--PDQGSSKINFGGI 208
+ S + GI+GLG G+ SL+ Q+ +I FS C D G + GGI
Sbjct: 190 ENMETGDLFS-QRADGIMGLGTGDLSLVDQLVEKGAINDSFSLCYGGMDIGGGAMVLGGI 248
Query: 209 VAGAGVV---STPLIIRDHYY-LSLEAISVGNQRLEFVSSSTGNIF---VDTGVLRTLLP 261
+ ++ S P +R YY + L+ I V ++L S + +D+G LP
Sbjct: 249 SPPSDMIFTYSDP--VRSPYYNVDLKEIHVAGKKLPLSSGIFDGRYGAVLDSGTTYAYLP 306
Query: 262 LEYHSNLKSVMSNMIKAQPVKGV-GAEPGFSDVLCYNISSQP-----KFPEVTIHFR-GA 314
E S K + + I + +K + G +P F D+ S KFP V + F G
Sbjct: 307 AEAFSAFKDAIMDEIHS--LKKIDGPDPNFKDICFSGAGSDAAELSNKFPTVDMVFENGQ 364
Query: 315 DVKLSPSNLF---RNISDEIMCSAFRGGN-ANIVYGRIMQINFLIGYDIEQAMVSFKPSR 370
+ L+P N F + F GN + G I+ N L+ YD + + F +
Sbjct: 365 KLSLTPENYFFRHSKVHGAYCLGIFENGNDQTTLLGGIVVRNTLVMYDRANSKIGFWKTN 424
Query: 371 CT 372
C+
Sbjct: 425 CS 426
>gi|116311058|emb|CAH67989.1| OSIGBa0142I02-OSIGBa0101B20.32 [Oryza sativa Indica Group]
Length = 488
Score = 101 bits (251), Expect = 7e-19, Method: Compositional matrix adjust.
Identities = 107/378 (28%), Positives = 158/378 (41%), Gaps = 56/378 (14%)
Query: 35 YLMHLSIGTPPVDI---FGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSIS 91
YL+ L IGTP I + DTGSD +WTQCEPC F PP DP KS T+ +S
Sbjct: 123 YLVQLRIGTPTDRISPRYVLFDTGSDLSWTQCEPCTNCSSFTPYPP-HDPSKSRTFRRLS 181
Query: 92 CSSSQC----AVVTSNCSEGDCSYSFLYGRGAYASFSSGNLATETLTFNST---SGLPVE 144
C C AVV C + YG G SG L ++ F + G +E
Sbjct: 182 CFDPMCELCTAVVDGGGGSAGCLFRRRYGDGGAV---SGELVSDVFHFGAAGDGGGYQLE 238
Query: 145 MPNVIFGCGHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLP-------- 196
+V FGC H S TGI+ LG G S ++Q+G +FSYC+P
Sbjct: 239 R-DVAFGCAHVE-DSKAVRGYSTGILALGIGKPSFVTQLGVD---RFSYCIPASEITDDD 293
Query: 197 -----DQGSSKINFGGIVAGAGVVSTPLIIRDHYYLSLEAISVG-----NQRLEFVSSST 246
++ +S + FG G + Y + L+++ NQ+
Sbjct: 294 DDDDEERSASFLRFGSHARMTGKRAPFKQDGSGYAVRLKSVVYQHGGRLNQQQPVPVYVA 353
Query: 247 GN-------IFVDTGVLRTLLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDVLCY--N 297
G + VD+G LP L+ + I + P + CY N
Sbjct: 354 GEEAAAAMPMLVDSGTTLLWLPGSVFYPLQRRIEEDISLTRRYDL-THP---SLYCYLGN 409
Query: 298 ISSQPKFPEVTIHF-RGADVKLSPSNLF---RNISDEIMCSAFRGGNANIVYGRIMQINF 353
++ VT+ F GAD++L ++LF N++++ +C A GN I+ G Q N
Sbjct: 410 MTDVEAV-SVTLGFGGGADLELFGTSLFFTDENLTEDWVCLAVAAGNRAIL-GVYPQRNI 467
Query: 354 LIGYDIEQAMVSFKPSRC 371
+GYD+ ++F +C
Sbjct: 468 NVGYDLSTMEIAFDRDQC 485
>gi|357113696|ref|XP_003558637.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Brachypodium
distachyon]
Length = 432
Score = 100 bits (250), Expect = 8e-19, Method: Compositional matrix adjust.
Identities = 96/376 (25%), Positives = 161/376 (42%), Gaps = 60/376 (15%)
Query: 35 YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSS 94
Y++ +G+P I ++DT +D TW C PC LF P S++Y + CSS
Sbjct: 77 YVVRAGLGSPAQPILLALDTSADATWAHCSPC---GTCPSSGSLFAPANSTSYAPLPCSS 133
Query: 95 SQCAVVTSN-CSEGD----------CSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPV 143
+ C V+ C D C+++ + A ASF + +LA++ L +
Sbjct: 134 TMCTVLQGQPCPAQDPYDSSAPLPMCAFTKPF---ADASFQA-SLASDWLHLGKDA---- 185
Query: 144 EMPNVIFGCGHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLPDQG---- 199
+PN FGC ++ PT++ + G++GLG G +L+SQ+G G FSYCLP
Sbjct: 186 -IPNYAFGC-VSAVSGPTANLPKQGLLGLGRGPMALLSQVGNMYNGVFSYCLPSYKSYYF 243
Query: 200 SSKINFGGIVAGAGVVSTPLIIRDH----YYLSLEAISVGNQRLE-------FVSSSTGN 248
S + G GV TP++ + YY+++ +SVG ++ F ++
Sbjct: 244 SGSLRLGAAGQPRGVRYTPMLKNPNRSSLYYVNVTGLSVGRAPVKVPAGSFAFDPATGAG 303
Query: 249 IFVDTGVLRTLLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDV----LCYNISSQPK- 303
VD+G + T ++ L+ + V A G++ + C+N
Sbjct: 304 TVVDSGTVITRWTPPVYAALREEFR--------RHVAAPSGYTSLGAFDTCFNTDEVAAG 355
Query: 304 -FPEVTIHFRGA-DVKLSPSN-LFRNISDEIMCSAFRGGNANI-----VYGRIMQINFLI 355
P VT+H G D+ L N L + + + C A N+ V + Q N +
Sbjct: 356 VAPAVTVHMDGGLDLALPMENTLIHSSATPLACLAMAEAPQNVNAVVNVLANLQQQNLRV 415
Query: 356 GYDIEQAMVSFKPSRC 371
+D+ + V F C
Sbjct: 416 VFDVANSRVGFARESC 431
>gi|388495448|gb|AFK35790.1| unknown [Medicago truncatula]
Length = 434
Score = 100 bits (250), Expect = 9e-19, Method: Compositional matrix adjust.
Identities = 107/397 (26%), Positives = 175/397 (44%), Gaps = 51/397 (12%)
Query: 1 AQNSQKLPFYNDNETPKSPISIIYQAEIISVDDIYLMHLSIGTPPVDIFGSVDTGSDCTW 60
A++ ++ +++ KS + I +II Y++ GTPP + ++DT SD W
Sbjct: 64 AKDQARMQYFSSLVARKSVVPIASARQIIQ-SPTYIVKAKFGTPPQTLLLALDTSSDAAW 122
Query: 61 TQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSSSQCAVVTS-NCSEGDCSYSFLYGRGA 119
C C + C +P F P KS+++ ++SC S C V + C C+++F YG +
Sbjct: 123 IPCSGC--VGCSTSKP--FAPIKSTSFRNVSCGSPHCKQVPNPTCGGSACAFNFTYGSSS 178
Query: 120 YASFSSGNLATETLTFNSTSGLPVEMPNVIFGCGHKNLASPTSDSKQTGIIGLGPGNSSL 179
A+ ++ +TLT + +P FGC +K S + Q G++GLG G SL
Sbjct: 179 IAA----SVVQDTLTLAAD-----PIPGYTFGCVNKTTG---SSAPQQGLLGLGRGPLSL 226
Query: 180 ISQMGTSIAGKFSYCLPDQGSSKINFGG------IVAGAGVVSTPLIIRDH----YYLSL 229
+SQ FSYCLP S INF G + + TPL+ YY++L
Sbjct: 227 LSQSQNLYKSTFSYCLPSFKS--INFSGSLRLGPVYQPKRIKYTPLLRNPRRSSLYYVNL 284
Query: 230 EAISVGNQ-------RLEFVSSSTGNIFVDTGVLRTLLPLEYHSNLKSVMSNMIKAQ-PV 281
AI VG + L F ++ D+G + T L ++ +++ + + PV
Sbjct: 285 VAIKVGRKIVDIPPAALAFNPTTGAGTIFDSGTVFTRLAEPVYTAVRNEFRRRVGPKLPV 344
Query: 282 KGVGAEPGFSDVLCYNISSQPKFPEVTIHFRGADVKLSPSNL-FRNISDEIMCSAFRGGN 340
+G GF CYN+ P +T F G +V L P N+ + + C A G
Sbjct: 345 TTLG---GFDT--CYNVPI--VVPTITFLFSGMNVALPPDNIVIHSTAGSTTCLAMAGAP 397
Query: 341 ANI-----VYGRIMQINFLIGYDIEQAMVSFKPSRCT 372
N+ V + Q N + +D+ + + CT
Sbjct: 398 DNVNSVLNVIANMQQQNHRVLFDVPNSRIGIARELCT 434
>gi|297829808|ref|XP_002882786.1| hypothetical protein ARALYDRAFT_478632 [Arabidopsis lyrata subsp.
lyrata]
gi|297328626|gb|EFH59045.1| hypothetical protein ARALYDRAFT_478632 [Arabidopsis lyrata subsp.
lyrata]
Length = 449
Score = 100 bits (250), Expect = 9e-19, Method: Compositional matrix adjust.
Identities = 96/370 (25%), Positives = 158/370 (42%), Gaps = 40/370 (10%)
Query: 35 YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSS 94
Y + +GTP VDTGS+ TW C + +F ++S ++ ++ C +
Sbjct: 88 YFTEVRVGTPAKKFRVVVDTGSELTWVNCRYRGRGKGKVKNRRVFRAEESKSFKTVGCFT 147
Query: 95 SQCAVVTSNC--------SEGDCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMP 146
C V N CSY + Y G+ A G A ET+T T+G +
Sbjct: 148 QTCKVDLMNLFSLSTCPTPSTPCSYDYRYADGSAAQ---GVFAKETITVGLTNGRKARLR 204
Query: 147 NVIFGCGHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLPDQGSSK---- 202
++ GC + G++GL + S S + K SYCL D S+K
Sbjct: 205 GLLVGCSSSFSGQSFQGAD--GVLGLAFSDFSFTSTATSLFGAKLSYCLVDHLSNKNISN 262
Query: 203 -INFG-----GIVAGAGVVSTPL---IIRDHYYLSLEAISVGNQRLE-----FVSSSTGN 248
+ FG A +TPL +I Y +++ IS+G+ L+ + +++ G
Sbjct: 263 YLIFGYSSSSTSTKTAPGRTTPLDLTLIPPFYAINIIGISIGDDMLDIPTQVWDATTGGG 322
Query: 249 IFVDTGVLRTLLP-LEYHSNLKSVMSNMIKAQPVK--GVGAEPGFSDVLCYNISSQPKFP 305
+D+G TLL Y + + +++ + VK G+ E FS +N + K P
Sbjct: 323 TILDSGTSLTLLAEAAYKPVVTGLARYLVELKRVKPEGIPIEYCFSSTSGFN---ESKLP 379
Query: 306 EVTIHFRG-ADVKLSPSNLFRNISDEIMCSAFR--GGNANIVYGRIMQINFLIGYDIEQA 362
++T H +G A + + + + + C F G A V G IMQ N+L +D+ +
Sbjct: 380 QLTFHLKGGARFEPHRKSYLVDAAPGVKCLGFMSAGTPATNVVGNIMQQNYLWEFDLMAS 439
Query: 363 MVSFKPSRCT 372
+SF PS CT
Sbjct: 440 TLSFAPSTCT 449
>gi|449446233|ref|XP_004140876.1| PREDICTED: aspartic proteinase-like protein 2-like [Cucumis
sativus]
Length = 498
Score = 100 bits (250), Expect = 9e-19, Method: Compositional matrix adjust.
Identities = 102/379 (26%), Positives = 163/379 (43%), Gaps = 58/379 (15%)
Query: 34 IYLMHLSIGTPPVDIFGSVDTGSDCTWT---QCEPCPELDCFKQEPPLFDPKKSSTYNSI 90
+Y + IGTP D + VDTGSD W QC CP E +D ++S+T +
Sbjct: 86 LYYAKIGIGTPSKDYYVQVDTGSDIVWVNCIQCRECPRTSSLGMELTPYDLEESTTGKLV 145
Query: 91 SCSSSQCAVVT----SNCSEG-DCSYSFLYGRGAYASFSSGNLATETLTFNSTSG-LPVE 144
SC C V S C+ C Y +YG G S ++G + + +N SG L
Sbjct: 146 SCDEQFCLEVNGGPLSGCTTNMSCPYLQIYGDG---SSTAGYFVKDYVQYNRVSGDLETT 202
Query: 145 MPN--VIFGCGHKNLASPTSDSKQT--GIIGLGPGNSSLISQMGTS--IAGKFSYCLPDQ 198
N + FGCG + S ++ GI+G G NSS+ISQ+ ++ + F++CL
Sbjct: 203 AANGSIKFGCGARQSGDLGSSGEEALDGILGFGKSNSSIISQLASTRKVKKMFAHCL--- 259
Query: 199 GSSKINFGGIVAGAGVVS-----TPLIIRD-HYYLSLEAISVGNQRLE-----FVSSSTG 247
N GGI A VV TPL+ HY +++ + VG+ L F +
Sbjct: 260 --DGTNGGGIFAMGHVVQPKVNMTPLVPNQPHYNVNMTGVQVGHIILNISADVFEAGDRK 317
Query: 248 NIFVDTGVLRTLLP-LEYHSNLKSVMSNM--IKAQPVKGVGAEPGFSDVLCYNISSQPK- 303
+D+G LP L Y + ++S ++ Q + G + C+ S +
Sbjct: 318 GTIIDSGTTLAYLPELIYEPLVAKILSQQHNLEVQTIHG--------EYKCFQYSERVDD 369
Query: 304 -FPEVTIHFRGA-DVKLSPSN-LFRNISDEIMCSAF-------RGGNANIVYGRIMQINF 353
FP V HF + +K+ P LF+ + + C + R ++G ++ N
Sbjct: 370 GFPPVIFHFENSLLLKVYPHEYLFQ--YENLWCIGWQNSGMQSRDRKNVTLFGDLVLSNK 427
Query: 354 LIGYDIEQAMVSFKPSRCT 372
L+ YD+E + + C+
Sbjct: 428 LVLYDLENQTIGWTEYNCS 446
>gi|302769978|ref|XP_002968408.1| hypothetical protein SELMODRAFT_89951 [Selaginella moellendorffii]
gi|300164052|gb|EFJ30662.1| hypothetical protein SELMODRAFT_89951 [Selaginella moellendorffii]
Length = 492
Score = 100 bits (250), Expect = 9e-19, Method: Compositional matrix adjust.
Identities = 112/361 (31%), Positives = 157/361 (43%), Gaps = 39/361 (10%)
Query: 35 YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSS 94
Y + IGTPP + VDTGS T+ C C C + P F P SS+Y + C
Sbjct: 35 YTSRVKIGTPPHEFSLIVDTGSTVTYVPCSSCTH--CGNHQDPRFSPALSSSYKPLECG- 91
Query: 95 SQCAVVTSNCSEGDCSYSFLYGRG-AYASFSSGNLATETLTFNSTSGLPVEMPNVIFGCG 153
S CS G C S Y R A S SSG L + + F+++S L + ++FGC
Sbjct: 92 -------SECSTGFCDGSRKYQRQYAEKSTSSGVLGKDVIGFSNSSDLGGQ--RLVFGCE 142
Query: 154 HKNLASPTSDSKQTGIIGLGPGNSSLISQM--GTSIAGKFSYCLP--DQGSSKINFGGIV 209
D GIIGLG G S+I Q+ ++ FS C D+G + GG
Sbjct: 143 TAETGD-LYDQTADGIIGLGRGPLSIIDQLVEKNAMEDVFSLCYGGMDEGGGAMILGGFQ 201
Query: 210 AGAGVVSTPLI-IRDHYY-LSLEAISVGNQRL----EFVSSSTGNIFVDTGVLRTLLPLE 263
+V T R YY L L+ I VG L E G + +D+G P
Sbjct: 202 PPKDMVFTASDPHRSPYYNLMLKGIRVGGSPLRLKPEVFDGKYGTV-LDSGTTYAYFPGA 260
Query: 264 YHSNLKSVMSNMIKAQPVKGV-GAEPGFSDVLCY-----NISSQPK-FPEVTIHF-RGAD 315
KS + + + +K V G + F D+ CY N+S+ + FP V F G
Sbjct: 261 AFQAFKSAVKEQVGS--LKEVPGPDEKFKDI-CYAGAGTNVSNLSQFFPSVDFVFGDGQS 317
Query: 316 VKLSPSN-LFRN--ISDEIMCSAFRGGNANIVYGRIMQINFLIGYDIEQAMVSFKPSRCT 372
V LSP N LFR+ IS F G+ + G I+ N L+ Y+ +A + F ++C
Sbjct: 318 VTLSPENYLFRHTKISGAYCLGVFENGDPTTLLGGIIVRNMLVTYNRGKASIGFLKTKCN 377
Query: 373 N 373
+
Sbjct: 378 D 378
>gi|222629462|gb|EEE61594.1| hypothetical protein OsJ_16002 [Oryza sativa Japonica Group]
Length = 468
Score = 100 bits (250), Expect = 9e-19, Method: Compositional matrix adjust.
Identities = 107/380 (28%), Positives = 158/380 (41%), Gaps = 58/380 (15%)
Query: 35 YLMHLSIGTPPVDI---FGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSIS 91
YL+ L IGTP I + DTGSD +WTQCEPC F PP DP KS T+ +S
Sbjct: 101 YLVQLRIGTPTDRISPRYVLFDTGSDLSWTQCEPCTNCSSFTPYPP-HDPSKSRTFRRLS 159
Query: 92 CSSSQC----AVVTSNCSEGDCSYSFLYGRGAYASFSSGNLATETLTFNST---SGLPVE 144
C C AVV C + YG G SG L ++ F + G +E
Sbjct: 160 CFDPMCELCTAVVDGGGGSAGCLFRRRYGDGGAV---SGELVSDVFHFGAAGDGGGYQLE 216
Query: 145 MPNVIFGCGHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLP-------- 196
+V FGC H S TGI+ LG G S ++Q+G +FSYC+P
Sbjct: 217 R-DVAFGCAHVE-DSKAVRGYSTGILALGIGKPSFVTQLGVD---RFSYCIPASEITDDD 271
Query: 197 -------DQGSSKINFGGIVAGAGVVSTPLIIRDHYYLSLEAISVG-----NQRLEFVSS 244
++ +S + FG G + Y + L+++ NQ+
Sbjct: 272 DDDDDDEERSASFLRFGSHARMTGKRAPFKQDGSGYAVRLKSVVYQHGGRLNQQQPVPVY 331
Query: 245 STGN-------IFVDTGVLRTLLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDVLCY- 296
G + VD+G LP L+ + I + P + CY
Sbjct: 332 VAGEEAAAAMPMLVDSGTTLLWLPGSVFYPLQRRIEEDISLTRRYDL-THP---SLYCYL 387
Query: 297 -NISSQPKFPEVTIHF-RGADVKLSPSNLF---RNISDEIMCSAFRGGNANIVYGRIMQI 351
N++ VT+ F GAD++L ++LF N++++ +C A GN I+ G Q
Sbjct: 388 GNMTDVEAV-SVTLGFGGGADLELFGTSLFFTDENLTEDWVCLAVAAGNRAIL-GVYPQR 445
Query: 352 NFLIGYDIEQAMVSFKPSRC 371
N +GYD+ ++F +C
Sbjct: 446 NINVGYDLSTMEIAFDRDQC 465
>gi|357515189|ref|XP_003627883.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
gi|355521905|gb|AET02359.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
Length = 434
Score = 100 bits (249), Expect = 1e-18, Method: Compositional matrix adjust.
Identities = 107/397 (26%), Positives = 175/397 (44%), Gaps = 51/397 (12%)
Query: 1 AQNSQKLPFYNDNETPKSPISIIYQAEIISVDDIYLMHLSIGTPPVDIFGSVDTGSDCTW 60
A++ ++ +++ KS + I +II Y++ GTPP + ++DT SD W
Sbjct: 64 AKDQARMQYFSSLVARKSVVPIASARQIIQ-SPTYIVKAKFGTPPQTLLLALDTSSDAAW 122
Query: 61 TQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSSSQCAVVTS-NCSEGDCSYSFLYGRGA 119
C C + C +P F P KS+++ ++SC S C V + C C+++F YG +
Sbjct: 123 IPCSGC--VGCSTSKP--FAPIKSTSFRNVSCGSPHCKQVPNPTCGGSACAFNFTYGSSS 178
Query: 120 YASFSSGNLATETLTFNSTSGLPVEMPNVIFGCGHKNLASPTSDSKQTGIIGLGPGNSSL 179
A+ ++ +TLT + +P FGC +K S + Q G++GLG G SL
Sbjct: 179 IAA----SVVQDTLTLATD-----PIPGYTFGCVNKTTG---SSAPQQGLLGLGRGPLSL 226
Query: 180 ISQMGTSIAGKFSYCLPDQGSSKINFGG------IVAGAGVVSTPLIIRDH----YYLSL 229
+SQ FSYCLP S INF G + + TPL+ YY++L
Sbjct: 227 LSQSQNLYKSTFSYCLPSFKS--INFSGSLRLGPVYQPKRIKYTPLLRNPRRSSLYYVNL 284
Query: 230 EAISVGNQ-------RLEFVSSSTGNIFVDTGVLRTLLPLEYHSNLKSVMSNMIKAQ-PV 281
AI VG + L F ++ D+G + T L ++ +++ + + PV
Sbjct: 285 VAIKVGRKIVDIPPAALAFNPTTGAGTIFDSGTVFTRLAEPVYTAVRNEFRRRVGPKLPV 344
Query: 282 KGVGAEPGFSDVLCYNISSQPKFPEVTIHFRGADVKLSPSNL-FRNISDEIMCSAFRGGN 340
+G GF CYN+ P +T F G +V L P N+ + + C A G
Sbjct: 345 TTLG---GFDT--CYNVPI--VVPTITFLFSGMNVTLPPDNIVIHSTAGSTTCLAMAGAP 397
Query: 341 ANI-----VYGRIMQINFLIGYDIEQAMVSFKPSRCT 372
N+ V + Q N + +D+ + + CT
Sbjct: 398 DNVNSVLNVIANMQQQNHRVLFDVPNSRIGIARELCT 434
>gi|115460260|ref|NP_001053730.1| Os04g0595000 [Oryza sativa Japonica Group]
gi|113565301|dbj|BAF15644.1| Os04g0595000, partial [Oryza sativa Japonica Group]
Length = 471
Score = 100 bits (249), Expect = 1e-18, Method: Compositional matrix adjust.
Identities = 107/380 (28%), Positives = 158/380 (41%), Gaps = 58/380 (15%)
Query: 35 YLMHLSIGTPPVDI---FGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSIS 91
YL+ L IGTP I + DTGSD +WTQCEPC F PP DP KS T+ +S
Sbjct: 104 YLVQLRIGTPTDRISPRYVLFDTGSDLSWTQCEPCTNCSSFTPYPP-HDPSKSRTFRRLS 162
Query: 92 CSSSQC----AVVTSNCSEGDCSYSFLYGRGAYASFSSGNLATETLTFNST---SGLPVE 144
C C AVV C + YG G SG L ++ F + G +E
Sbjct: 163 CFDPMCELCTAVVDGGGGSAGCLFRRRYGDGGAV---SGELVSDVFHFGAAGDGGGYQLE 219
Query: 145 MPNVIFGCGHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLP-------- 196
+V FGC H S TGI+ LG G S ++Q+G +FSYC+P
Sbjct: 220 R-DVAFGCAHVE-DSKAVRGYSTGILALGIGKPSFVTQLGVD---RFSYCIPASEITDDD 274
Query: 197 -------DQGSSKINFGGIVAGAGVVSTPLIIRDHYYLSLEAISVG-----NQRLEFVSS 244
++ +S + FG G + Y + L+++ NQ+
Sbjct: 275 DDDDDDEERSASFLRFGSHARMTGKRAPFKQDGSGYAVRLKSVVYQHGGRLNQQQPVPVY 334
Query: 245 STGN-------IFVDTGVLRTLLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDVLCY- 296
G + VD+G LP L+ + I + P + CY
Sbjct: 335 VAGEEAAAAMPMLVDSGTTLLWLPGSVFYPLQRRIEEDISLTRRYDL-THP---SLYCYL 390
Query: 297 -NISSQPKFPEVTIHF-RGADVKLSPSNLF---RNISDEIMCSAFRGGNANIVYGRIMQI 351
N++ VT+ F GAD++L ++LF N++++ +C A GN I+ G Q
Sbjct: 391 GNMTDVEAV-SVTLGFGGGADLELFGTSLFFTDENLTEDWVCLAVAAGNRAIL-GVYPQR 448
Query: 352 NFLIGYDIEQAMVSFKPSRC 371
N +GYD+ ++F +C
Sbjct: 449 NINVGYDLSTMEIAFDRDQC 468
>gi|32489096|emb|CAE03928.1| OSJNba0093F12.2 [Oryza sativa Japonica Group]
gi|58532027|emb|CAD41565.3| OSJNBa0006A01.20 [Oryza sativa Japonica Group]
Length = 489
Score = 100 bits (249), Expect = 1e-18, Method: Compositional matrix adjust.
Identities = 107/380 (28%), Positives = 158/380 (41%), Gaps = 58/380 (15%)
Query: 35 YLMHLSIGTPPVDI---FGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSIS 91
YL+ L IGTP I + DTGSD +WTQCEPC F PP DP KS T+ +S
Sbjct: 122 YLVQLRIGTPTDRISPRYVLFDTGSDLSWTQCEPCTNCSSFTPYPP-HDPSKSRTFRRLS 180
Query: 92 CSSSQC----AVVTSNCSEGDCSYSFLYGRGAYASFSSGNLATETLTFNST---SGLPVE 144
C C AVV C + YG G SG L ++ F + G +E
Sbjct: 181 CFDPMCELCTAVVDGGGGSAGCLFRRRYGDGGAV---SGELVSDVFHFGAAGDGGGYQLE 237
Query: 145 MPNVIFGCGHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLP-------- 196
+V FGC H S TGI+ LG G S ++Q+G +FSYC+P
Sbjct: 238 R-DVAFGCAHVE-DSKAVRGYSTGILALGIGKPSFVTQLGVD---RFSYCIPASEITDDD 292
Query: 197 -------DQGSSKINFGGIVAGAGVVSTPLIIRDHYYLSLEAISVG-----NQRLEFVSS 244
++ +S + FG G + Y + L+++ NQ+
Sbjct: 293 DDDDDDEERSASFLRFGSHARMTGKRAPFKQDGSGYAVRLKSVVYQHGGRLNQQQPVPVY 352
Query: 245 STGN-------IFVDTGVLRTLLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDVLCY- 296
G + VD+G LP L+ + I + P + CY
Sbjct: 353 VAGEEAAAAMPMLVDSGTTLLWLPGSVFYPLQRRIEEDISLTRRYDL-THP---SLYCYL 408
Query: 297 -NISSQPKFPEVTIHF-RGADVKLSPSNLF---RNISDEIMCSAFRGGNANIVYGRIMQI 351
N++ VT+ F GAD++L ++LF N++++ +C A GN I+ G Q
Sbjct: 409 GNMTDVEAV-SVTLGFGGGADLELFGTSLFFTDENLTEDWVCLAVAAGNRAIL-GVYPQR 466
Query: 352 NFLIGYDIEQAMVSFKPSRC 371
N +GYD+ ++F +C
Sbjct: 467 NINVGYDLSTMEIAFDRDQC 486
>gi|357117301|ref|XP_003560410.1| PREDICTED: uncharacterized protein LOC100833752 [Brachypodium
distachyon]
Length = 473
Score = 100 bits (249), Expect = 1e-18, Method: Compositional matrix adjust.
Identities = 84/355 (23%), Positives = 151/355 (42%), Gaps = 41/355 (11%)
Query: 51 SVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSSS-QCAVVTSNCSEGDC 109
+D + +W QC PC C Q P+FDP KS T+ +S ++ C +G C
Sbjct: 119 EMDMAAGFSWMQCAPC--HPCLPQLNPVFDPAKSPTFRPVSGHNAVLCRPPYHPLQDGRC 176
Query: 110 SYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMPNVIFGCGHKNLASPTSDSKQTGI 169
+ Y GA A +G LA +T +F + +P ++FGC ++ +A + G+
Sbjct: 177 GFGIAYRNGASA---AGYLARDTFSFPTGDNNFQHLPGIVFGCANR-IARFDTHGALAGV 232
Query: 170 IGLGPGN-----SSLISQMGTSIAGKFSYCLPDQGSSKINF------------GGIVAGA 212
+G+G G + + Q+ + G+FSYC G++ +F G+ +
Sbjct: 233 LGMGMGAEGKPLTGFMRQLYHNGGGRFSYCPIVPGTTAYSFLRFGNDIPSQPPAGVHRQS 292
Query: 213 GVVSTPLIIRDHYYLSLEAISVGNQRLEFVS--------SSTGNIFVDTGVLRTLLPLEY 264
V P + YY+ L ISVG R+ V+ G +D G T +
Sbjct: 293 MAVLAPTTTSEAYYVKLAGISVGALRVPGVTPEMFERDQHGRGGCAIDIGTKMTAIVQTA 352
Query: 265 HSNLKSVMSNMIKAQPVKGVGAEPGFSDVLCYNISSQPKFPEVTIHFRGAD-VKLSPSNL 323
++++++ + ++ + V PG + + + + P +T+HF G +++ P +L
Sbjct: 353 YAHVEAAVRGHLQRNRARFV-QSPGHHLCVHRTPAIEERLPSMTLHFVGGPWLRVKPQHL 411
Query: 324 FRNISD-----EIMCSAFRGGNANIVYGRIMQINFLIGYDIEQ--AMVSFKPSRC 371
F + E +C V G + QI+ +D+ +VSF P C
Sbjct: 412 FLVVGSPTGGGEYLCLGLVPDAEMTVIGAMQQIDTRFIFDLHNNIPIVSFNPEDC 466
>gi|297827153|ref|XP_002881459.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
gi|297327298|gb|EFH57718.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
Length = 507
Score = 100 bits (249), Expect = 1e-18, Method: Compositional matrix adjust.
Identities = 104/368 (28%), Positives = 166/368 (45%), Gaps = 37/368 (10%)
Query: 34 IYLMHLSIGTPPVDIFGSVDTGSDCTWTQCEP---CPELDCFKQEPPLFDPKKSSTYNSI 90
+Y + +G+PP + +DTGSD W C CP + FD S T S+
Sbjct: 99 LYFTKVKLGSPPTEFNVQIDTGSDILWVTCSSCSNCPHSSGLGIDLHFFDAPGSFTAGSV 158
Query: 91 SCSSSQCAVV----TSNCSEGD-CSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEM 145
+CS C+ V + CSE + C YSF YG G S +SG T+T F++ G +
Sbjct: 159 TCSDPICSSVFQTTAAQCSENNQCGYSFRYGDG---SGTSGYYMTDTFYFDAILGESLVA 215
Query: 146 PN---VIFGCGHKNLASPT-SDSKQTGIIGLGPGNSSLISQMGTS--IAGKFSYCLPDQG 199
+ ++FGC T SD GI G G G S++SQ+ + FS+CL G
Sbjct: 216 NSSAPIVFGCSTYQSGDLTKSDKAVDGIFGFGKGKLSVVSQLSSRGITPPVFSHCLKGDG 275
Query: 200 SSKINFG-GIVAGAGVVSTPLI-IRDHYYLSLEAISVGNQRLE-----FVSSSTGNIFVD 252
S F G + G+V +PL+ + HY L+L +I V Q L F +S+T VD
Sbjct: 276 SGGGVFVLGEILVPGMVYSPLLPSQPHYNLNLLSIGVNGQILPIDAAVFEASNTRGTIVD 335
Query: 253 TGVLRTLLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDVLCYNISS--QPKFPEVTIH 310
TG T L E + + +SN + +Q V + + + CY +S+ FP V+++
Sbjct: 336 TGTTLTYLVKEAYDPFLNAISNSV-SQLVTLIIS----NGEQCYLVSTSISDMFPPVSLN 390
Query: 311 FR-GADVKLSPSNLFRNI----SDEIMCSAF-RGGNANIVYGRIMQINFLIGYDIEQAMV 364
F GA + L P + + + C F + + G ++ + + YD+ + +
Sbjct: 391 FAGGASMMLRPQDYLFHYGFYDGASMWCIGFQKAPEEQTILGDLVLKDKVFVYDLARQRI 450
Query: 365 SFKPSRCT 372
+ C+
Sbjct: 451 GWANYDCS 458
>gi|302783200|ref|XP_002973373.1| hypothetical protein SELMODRAFT_98841 [Selaginella moellendorffii]
gi|300159126|gb|EFJ25747.1| hypothetical protein SELMODRAFT_98841 [Selaginella moellendorffii]
Length = 389
Score = 100 bits (248), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 105/374 (28%), Positives = 166/374 (44%), Gaps = 55/374 (14%)
Query: 37 MHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSSSQ 96
M LS+GTPP + ++ S +W C ++C LF P S+++ + C S
Sbjct: 1 MDLSLGTPPQPLNFTLAVDSGFSWVACSSSCAINC--TTASLFQPGLSTSHTKLPCGSPS 58
Query: 97 C----AVVTSNCSEGDCSYSFLYGRGAYASFSS-GNLATETLTFNSTSGLPVEMPNVIFG 151
C AV TS CSY+ YG +FSS G+L ++ T +S V N+ G
Sbjct: 59 CSAFSAVSTSCGPSSSCSYNTSYG----TNFSSAGDLVSDIATMDSVRNRKVA-ANLSLG 113
Query: 152 CGHKNLASPTSDSKQTGIIGLGPGNSSLISQMGT-SIAGKFSYCLP-DQGSSKINFGGIV 209
CG ++ +G +G GN S + Q+ KF YCLP D K+ G
Sbjct: 114 CG-RDSGGLLELLDTSGFVGFDKGNVSFMGQLSALGYRSKFIYCLPSDTFRGKLVIGNYK 172
Query: 210 AGAGVVS-----TPLIIR----DHYYLSLEAISVGNQRLE-----FVSSSTGNIFVDTGV 255
+S TP+I + Y+++L IS+ + + F+S+ TG +DT
Sbjct: 173 LRNASISSSMAYTPMITNPQAAELYFINLSTISIDKNKFQVPIQGFLSNGTGGTVIDTTT 232
Query: 256 LRTLLPLEYHSNL----KSVMSNMIKAQP--VKGVGAEPGFSDVLCYNISSQPKFP---E 306
+ L ++++ L K+ +N+++ +G E LCYNIS+ FP
Sbjct: 233 FLSYLTSDFYTQLVQAIKNYTTNLVEVSSSVADALGVE------LCYNISANSDFPPPAT 286
Query: 307 VTIHFR-GADVKLSPSNLFRNISDEI---MCSAF-----RGGNANIVYGRIMQINFLIGY 357
+T HF GA V++S L + SD + +C A G N N++ G Q++ + Y
Sbjct: 287 LTYHFLGGAGVEVSTWFLLDD-SDSVNNTICMAIGRSESVGPNLNVI-GTYQQLDLTVEY 344
Query: 358 DIEQAMVSFKPSRC 371
D+EQ F C
Sbjct: 345 DLEQMRYGFGAQGC 358
>gi|297811181|ref|XP_002873474.1| hypothetical protein ARALYDRAFT_909035 [Arabidopsis lyrata subsp.
lyrata]
gi|297319311|gb|EFH49733.1| hypothetical protein ARALYDRAFT_909035 [Arabidopsis lyrata subsp.
lyrata]
Length = 293
Score = 100 bits (248), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 67/160 (41%), Positives = 86/160 (53%), Gaps = 12/160 (7%)
Query: 35 YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSS 94
Y++ + IGTP DI DTGSD TWTQCEPC C+ Q+ P F+P SS+Y+++SCSS
Sbjct: 134 YIVTIGIGTPKHDISLMFDTGSDLTWTQCEPCLG-SCYSQKEPKFNPSSSSSYHNVSCSS 192
Query: 95 SQCAVVTSNCSEGDCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMPNVIFGCGH 154
C S CS +C Y YG G S + G LA E T ++ L ++ FGCG
Sbjct: 193 PMCGNPES-CSASNCLYGIGYGDG---SVTVGFLAKEKFTLTNSDVL----DDIYFGCGE 244
Query: 155 KNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYC 194
N + GI+GLGPG S Q T+ FSYC
Sbjct: 245 NNKGVFIGSA---GILGLGPGKFSFPLQTTTTYNNIFSYC 281
>gi|224115494|ref|XP_002332148.1| predicted protein [Populus trichocarpa]
gi|222875198|gb|EEF12329.1| predicted protein [Populus trichocarpa]
Length = 483
Score = 100 bits (248), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 121/418 (28%), Positives = 172/418 (41%), Gaps = 99/418 (23%)
Query: 31 VDDIYLMHLSIGTPPVDIFGSVDTGSDCTWTQCE----PCPELDCFKQEPPL--FDPKKS 84
V D YL+ LSIGTPP I +DTGSD TW C C E D ++ + F P S
Sbjct: 76 VRDGYLISLSIGTPPQVIQVYMDTGSDLTWAPCGNISFDCIECDNYRNNRMMASFSPSHS 135
Query: 85 STYNSISCSSSQCAVVTSN------CSEGDCS---------------YSFLYGRGAYASF 123
S+ + SC+S C V S+ C+ CS +++ YG G
Sbjct: 136 SSSHRDSCTSPFCIDVHSSDNPLDPCTMAGCSLSTLVKATCSWPCPPFAYTYGAGGVV-- 193
Query: 124 SSGNLATETLTFNSTS-GLPVEMPNVIFGCGHKNLASPTSDSKQTGIIGLGPGNSSLISQ 182
+G L +TL + + G+ E+P FGC + P GI G G G SL SQ
Sbjct: 194 -TGTLTRDTLRVHGRNLGVTQEIPRFCFGCVASSYREP------IGIAGFGRGALSLPSQ 246
Query: 183 MGTSIAGKFSYCLPDQGSSKINFGGIVAGAGVVSTPLIIRD------------------- 223
+G G FS+C A +S+PLII D
Sbjct: 247 LGFLRKG-FSHCF---------LAFKYANNPNISSPLIIGDIALTSKDDMQFTPMLKSPM 296
Query: 224 ---HYYLSLEAISVGN--------QRLEFVSSSTGNIFVDTGVLRTLLPLEYHSNLKSVM 272
+YY+ LEAI+VGN EF S G + VD+G T LP ++S + SV+
Sbjct: 297 YPNYYYVGLEAITVGNVSATEVPSSLREFDSLGNGGMLVDSGTTYTHLPEPFYSQVLSVL 356
Query: 273 SNMIKAQPVKGVGAEPGFSDVLCYNISSQPK-------FPEVTIHF-RGADVKLSPSNLF 324
++I + GF LCY + Q P +T HF A + LS + F
Sbjct: 357 QSIINYPRATDMEMRTGFD--LCYKVPCQNNSILTGDLLPSITFHFLNNASLVLSRGSHF 414
Query: 325 RNISDE-----IMCSAFRG------GNANIVYGRIMQINFLIGYDIEQAMVSFKPSRC 371
+S + C F+ G A ++ G Q + + YD+E+ + F+P C
Sbjct: 415 YAMSAPSNSTVVKCLLFQSMDDGDYGPAGVL-GSFQQQDVEVVYDMEKERIGFRPMDC 471
>gi|242092902|ref|XP_002436941.1| hypothetical protein SORBIDRAFT_10g011760 [Sorghum bicolor]
gi|241915164|gb|EER88308.1| hypothetical protein SORBIDRAFT_10g011760 [Sorghum bicolor]
Length = 445
Score = 100 bits (248), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 106/360 (29%), Positives = 168/360 (46%), Gaps = 49/360 (13%)
Query: 35 YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSS 94
Y+ +S GTP V +DTGSD TW QC+PC C Q+ PLFDP SSTY+++ C+S
Sbjct: 112 YVATVSFGTPAVPQVVVIDTGSDLTWLQCKPCSSGQCSPQKDPLFDPSHSSTYSAVPCAS 171
Query: 95 SQCAVVT-----SNCSEGD-CSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMPNV 148
+C + S CS G C ++ Y G + + G + LT G V+ +
Sbjct: 172 GECKKLAADAYGSGCSNGQPCGFAISYVDG---TSTVGVYGKDKLTL--APGAIVK--DF 224
Query: 149 IFGCGHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLPDQGSSK--INFG 206
FGCGH + P G+ L + SL +Q G FSYCLP S + FG
Sbjct: 225 YFGCGHSKSSLPGLFDGLLGLGRL---SESLGAQYGGGGG--FSYCLPAVNSKPGFLAFG 279
Query: 207 GIVAGAGVVSTPL--IIRDHYY--LSLEAISVGNQRLEFVSSS-TGNIFVDTGVLRTLLP 261
+G V TP+ + + ++L I+VG ++L+ S+ +G + VD+G + T+L
Sbjct: 280 AGRNPSGFVFTPMGRVPGQPTFSTVTLAGITVGGKKLDLRPSAFSGGMIVDSGTVVTVLQ 339
Query: 262 LEYHSNLKSVMSNMIKA-QPVKGVGAEPGFSDV-LCYNISSQPK--FPEVTIHFR-GADV 316
+ L++ +KA + V G D+ CY+++ P++ + F GA +
Sbjct: 340 STVYRALRAAFREAMKAYRLVHG--------DLDTCYDLTGYKNVVVPKIALTFSGGATI 391
Query: 317 KLS-PSNLFRNISDEIMCSAF----RGGNANIVYGRIMQINFLIGYDIEQAMVSFKPSRC 371
L P+ + N C AF + G A ++ G + Q F + +D + F+ C
Sbjct: 392 NLDVPNGILVN-----GCLAFAETGKDGTAGVL-GNVNQRTFEVLFDTSASKFGFRAKAC 445
>gi|115484725|ref|NP_001067506.1| Os11g0215400 [Oryza sativa Japonica Group]
gi|77549255|gb|ABA92052.1| Eukaryotic aspartyl protease family protein, expressed [Oryza
sativa Japonica Group]
gi|113644728|dbj|BAF27869.1| Os11g0215400 [Oryza sativa Japonica Group]
Length = 428
Score = 99.8 bits (247), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 99/364 (27%), Positives = 166/364 (45%), Gaps = 47/364 (12%)
Query: 34 IYLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCS 93
+Y++ + +GTP +DTGS +W C E D P F +S+T +SC
Sbjct: 81 LYVISVGLGTPAKTQIVEIDTGSSTSWVFC----ECDGCHTNPRTFLQSRSTTCAKVSCG 136
Query: 94 SSQCAVVTSN--CSEG----DCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMPN 147
+S C + S+ C + DC + Y G S S G L +TLTF+ ++P
Sbjct: 137 TSMCLLGGSDPHCQDSENYPDCPFRVSYQDG---SASYGILYQDTLTFSDVQ----KIPG 189
Query: 148 VIFGCGHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLPDQGSSK----- 202
FGC + + G++G+G G S++ Q + FSYCLP Q S +
Sbjct: 190 FSFGCNMDSFGA-NEFGNVDGLLGMGAGPMSVLKQSSPTFDC-FSYCLPLQKSERGFFSK 247
Query: 203 ----INFGGIVAGAGVVSTPLIIR----DHYYLSLEAISVGNQRLEF---VSSSTGNIFV 251
+ G + V T ++ R + +++ L AISV +RL V S G +F
Sbjct: 248 TTGYFSLGKVATRTDVRYTKMVARKKNTELFFVDLTAISVDGERLGLSPSVFSRKGVVF- 306
Query: 252 DTGVLRTLLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDVLCYNISS--QPKFPEVTI 309
D+G + +P SV+S I+ +K AE S+ CY++ S + P +++
Sbjct: 307 DSGSELSYIP----DRALSVLSQRIRELLLKRGAAEEE-SERNCYDMRSVDEGDMPAISL 361
Query: 310 HF-RGADVKLSPSNLF--RNISDE-IMCSAFRGGNANIVYGRIMQINFLIGYDIEQAMVS 365
HF GA L +F R++ ++ + C AF + + G +MQ + + YD+++ ++
Sbjct: 362 HFDDGARFDLGSHGVFVERSVQEQDVWCLAFAPTESVSIIGSLMQTSKEVVYDLKRQLIG 421
Query: 366 FKPS 369
PS
Sbjct: 422 IGPS 425
>gi|224136884|ref|XP_002326969.1| predicted protein [Populus trichocarpa]
gi|222835284|gb|EEE73719.1| predicted protein [Populus trichocarpa]
Length = 626
Score = 99.8 bits (247), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 110/401 (27%), Positives = 173/401 (43%), Gaps = 53/401 (13%)
Query: 4 SQKLPFYNDNETPKSPISIIYQAEIISVDDI-----YLMHLSIGTPPVDIFGSVDTGSDC 58
+ ++PF S + A + DD+ Y L IGTPP + VDTGS
Sbjct: 41 AHRMPFDGHYSRRHLQNSELPNARMRLFDDLLSNGYYTTRLFIGTPPQEFALIVDTGSTV 100
Query: 59 TWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSSSQCAVVTSNCSE--GDCSYSFLYG 116
T+ C C + C K + P F P SSTY + C+ S NC + C+Y Y
Sbjct: 101 TYVPCSSCEQ--CGKHQDPRFQPDLSSTYRPVKCNPS------CNCDDEGKQCTYERRY- 151
Query: 117 RGAYASFSSGNLATETLTFNSTSGLPVEMPNVIFGCGHKNLASPTSDSKQTGIIGLGPGN 176
A S SSG +A + ++F + S L + +FGC + S + GI+GLG G
Sbjct: 152 --AEMSSSSGVIAEDVVSFGNESELKPQ--RAVFGCENVETGDLYS-QRADGIMGLGRGR 206
Query: 177 SSLISQMGTS--IAGKFSYCLPDQGSSKINFGGIVAGAGVVSTPLIIRDH--------YY 226
S++ Q+ I FS C G + G +V G + P ++ H Y
Sbjct: 207 LSVVDQLVDKGVIGDSFSLCY---GGMDVGGGAMVLGQ-ISPPPNMVFSHSNPYRSPYYN 262
Query: 227 LSLEAISVGNQRLEF---VSSSTGNIFVDTGVLRTLLP-LEYHSNLKSVMSNMIKAQPVK 282
+ L+ + V + L+ V +D+G P +H+ ++M + + +
Sbjct: 263 IELKELHVAGKPLKLKPKVFDEKHGTVLDSGTTYAYFPEAAFHALKDAIMKEIRHLKQIP 322
Query: 283 GVGAEPGFSDVLCYN-----ISSQPK-FPEVTIHF-RGADVKLSPSN-LFRN--ISDEIM 332
G +P + D+ C++ +S K FPEV + F G + LSP N LFR+ +S
Sbjct: 323 --GPDPNYHDI-CFSGAGREVSHLSKVFPEVNMVFGSGQKLSLSPENYLFRHTKVSGAYC 379
Query: 333 CSAFRGGN-ANIVYGRIMQINFLIGYDIEQAMVSFKPSRCT 372
F+ GN + G I+ N L+ YD E + F + C+
Sbjct: 380 LGIFQNGNDLTTLLGGIVVRNTLVTYDRENDKIGFWKTNCS 420
>gi|118486912|gb|ABK95290.1| unknown [Populus trichocarpa]
Length = 438
Score = 99.8 bits (247), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 98/381 (25%), Positives = 163/381 (42%), Gaps = 42/381 (11%)
Query: 12 DNETPKSPISIIYQAEIISVDDIYLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDC 71
D +T PI+ Q I+ Y++ + +GTP +F +DT +D W C C
Sbjct: 78 DQKTTAVPIAPGQQVLKIAN---YVVRVKLGTPGQQMFMVLDTSNDAAWVPCSGCTGFSS 134
Query: 72 FKQEPPLFDPKKSSTYNSISCSSSQCAVVTSNCSEGDCSYSFLYGRGAYASFSSGNLATE 131
F P S+T S+ CS +QC+ V S + L+ + S+ + T
Sbjct: 135 TT-----FLPNASTTLGSLDCSGAQCSQVRGFSCPATGSSACLFNQ----SYGGDSSLTA 185
Query: 132 TLTFNSTSGLPVEMPNVIFGCGHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKF 191
TL ++ + +P FGC N S S Q G++GLG G SLISQ G +G F
Sbjct: 186 TLVQDAITLANDVIPGFTFGC--INAVSGGSIPPQ-GLLGLGRGPISLISQAGAMYSGVF 242
Query: 192 SYCLPDQG----SSKINFGGIVAGAGVVSTPLIIRDH----YYLSLEAISVG-------N 236
SYCLP S + G + + +TPL+ H YY++L +SVG +
Sbjct: 243 SYCLPSFKSYYFSGSLKLGPVGQPKSIRTTPLLRNPHRPSLYYVNLTGVSVGRIKVPIPS 302
Query: 237 QRLEFVSSSTGNIFVDTGVLRTLLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDVLCY 296
++L F ++ +D+G + T + ++ + P+ +GA F C+
Sbjct: 303 EQLVFDPNTGAGTIIDSGTVITRFVQPVYFAIRDEFRKQVNG-PISSLGA---FDT--CF 356
Query: 297 NISSQPKFPEVTIHFRGADVKLSPSN-LFRNISDEIMCSAFRGGNANI-----VYGRIMQ 350
+++ + P +T+HF G ++ L N L + S + C + N+ V + Q
Sbjct: 357 AATNEAEAPAITLHFEGLNLVLPMENSLIHSSSGSLACLSMAAAPNNVNSVLNVIANLQQ 416
Query: 351 INFLIGYDIEQAMVSFKPSRC 371
N I +D + + C
Sbjct: 417 QNLRIMFDTTNSRLGIARELC 437
>gi|326523839|dbj|BAJ96930.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 473
Score = 99.8 bits (247), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 94/374 (25%), Positives = 152/374 (40%), Gaps = 45/374 (12%)
Query: 35 YLMHLSIGTPPVDIFGSVDTGSDCTWTQCE----PCPELDCFKQEPPLFDPKKSSTYNSI 90
Y + +GTP DTGSD TW +C P+ P +F P S ++ I
Sbjct: 110 YFVQFRVGTPAQPFVLVADTGSDLTWVKCRGRRASSPDASPLA-SPRVFRPANSKSWAPI 168
Query: 91 SCSSSQCAVVT----SNCSEGD-----CSYSFLYGRGAYASFSSGNLATETLTFNSTSGL 141
CSS C +NCS G C Y + Y + A G A S S
Sbjct: 169 PCSSDTCKSYVPFSLANCSAGTTPPAPCGYDYRYKDKSSARGVVGTDAATIALSGSGSDR 228
Query: 142 PVEMPNVIFGC--GHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCL---- 195
++ V+ GC + + +SD G++ LG N S S+ G+FSYCL
Sbjct: 229 KAKLQEVVLGCTTSYDGQSFQSSD----GVLSLGNSNISFASRAAARFGGRFSYCLVDHL 284
Query: 196 -PDQGSSKINFGGIVAGAGVVSTPLIIRDH----YYLSLEAISVGNQRLEFVS-----SS 245
P +S + FG + A TPL++ Y ++++A+SV + L +
Sbjct: 285 APRNATSYLTFGPVGAAHSPSRTPLLLDAQVAPFYAVTVDAVSVAGKALNIPAEVWDVKK 344
Query: 246 TGNIFVDTGVLRTLLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDVLCYNISSQ---P 302
G +D+G T+L + + + +S + P V +P F CYN ++ P
Sbjct: 345 NGGAILDSGTSLTILATPAYKAVVAALSKQLARVP--RVTMDP-FE--YCYNWTATRRPP 399
Query: 303 KFPEVTIHFRGADVKLSPSNLFR-NISDEIMCSAFRGG--NANIVYGRIMQINFLIGYDI 359
P + + F G+ P+ + + + + C + G V G I+Q L +D+
Sbjct: 400 AVPRLEVRFAGSARLRPPTKSYVIDAAPGVKCIGLQEGVWPGVSVIGNILQQEHLWEFDL 459
Query: 360 EQAMVSFKPSRCTN 373
+ F+ SRC +
Sbjct: 460 ANRWLRFQESRCAH 473
>gi|367066697|gb|AEX12632.1| hypothetical protein 2_5918_01 [Pinus taeda]
gi|367066699|gb|AEX12633.1| hypothetical protein 2_5918_01 [Pinus taeda]
gi|367066701|gb|AEX12634.1| hypothetical protein 2_5918_01 [Pinus taeda]
gi|367066703|gb|AEX12635.1| hypothetical protein 2_5918_01 [Pinus taeda]
gi|367066705|gb|AEX12636.1| hypothetical protein 2_5918_01 [Pinus taeda]
gi|367066707|gb|AEX12637.1| hypothetical protein 2_5918_01 [Pinus taeda]
gi|367066709|gb|AEX12638.1| hypothetical protein 2_5918_01 [Pinus taeda]
gi|367066711|gb|AEX12639.1| hypothetical protein 2_5918_01 [Pinus taeda]
gi|367066713|gb|AEX12640.1| hypothetical protein 2_5918_01 [Pinus taeda]
gi|367066715|gb|AEX12641.1| hypothetical protein 2_5918_01 [Pinus taeda]
gi|367066717|gb|AEX12642.1| hypothetical protein 2_5918_01 [Pinus taeda]
Length = 137
Score = 99.8 bits (247), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 60/136 (44%), Positives = 80/136 (58%), Gaps = 11/136 (8%)
Query: 25 QAEIISVDDIYLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKS 84
QA + + + +LM L+IG P + +DTGSD TWTQC PC DC+KQ P++DP S
Sbjct: 11 QAPVSAGNGEFLMQLAIGKPSLAYSAILDTGSDLTWTQCMPCS--DCYKQPTPIYDPSLS 68
Query: 85 STYNSISCSSSQC-AVVTSNCSEGDCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPV 143
STY ++SC SS C A+ S C C Y LY G Y+S + G L+ ET T +S S
Sbjct: 69 STYGTVSCKSSLCLALPASACISATCEY--LYTYGDYSS-TQGILSYETFTLSSQS---- 121
Query: 144 EMPNVIFGCGHKNLAS 159
+P++ FGCG N S
Sbjct: 122 -IPHIAFGCGQDNEGS 136
>gi|357130715|ref|XP_003566992.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Brachypodium
distachyon]
Length = 479
Score = 99.8 bits (247), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 99/394 (25%), Positives = 161/394 (40%), Gaps = 65/394 (16%)
Query: 35 YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPP---------LFDPKKSS 85
Y + +GTP DTGSD TW +C P F P+KS
Sbjct: 95 YFVRFRVGTPAQPFLLVADTGSDLTWVKCRPAKAAAASTNSSSSASASSPRRAFRPEKSK 154
Query: 86 TYNSISCSSSQC------AVVTSNCSEGDCSYSFLYGRGAYASFSSGNLATETLTF---- 135
T+ I C+S C ++ T C+Y + Y G+ A G + TE+ T
Sbjct: 155 TWAPIPCASDTCSKSLPFSLSTCPTPGSPCAYDYRYKDGSAA---RGTVGTESATIALSS 211
Query: 136 ----NSTSGLPVEMPNVIFGCGHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKF 191
+ ++ ++ GC + P+ ++ G++ LG N S S + G+F
Sbjct: 212 SSSSSKNKVKKAKLQGLVLGC-TGSYTGPSFEASD-GVLSLGYSNVSFASHAASRFGGRF 269
Query: 192 SYCL-----PDQGSSKINFG---------GIVAGAGVVSTPLII----RDHYYLSLEAIS 233
SYCL P +S + FG AG G TPL++ R Y +S++AIS
Sbjct: 270 SYCLVDHLSPRNATSYLTFGPNSALSGPCPAAAGPGARQTPLVLDSRMRPFYDVSIKAIS 329
Query: 234 VGNQRLE-----FVSSSTGNIFVDTGVLRTLLPLEYHSNLKSVMSNMIKAQPVKGVGAEP 288
V + L+ + G + VD+G T+L + + + + + P V +P
Sbjct: 330 VDGELLKIPRDVWEVDGGGGVIVDSGTSLTVLAKPAYRAVVAALGKKLARFPR--VAMDP 387
Query: 289 GFSDVLCYNISSQPK------FPEVTIHFRGADVKLSPSNLFR-NISDEIMCSAFRGGNA 341
F CYN +S + P++ +HF G+ PS + + + + C + G
Sbjct: 388 -FE--YCYNWTSPSRKDEGDDLPKLAVHFAGSARLEPPSKSYVIDAAPGVKCIGVQEGPW 444
Query: 342 N--IVYGRIMQINFLIGYDIEQAMVSFKPSRCTN 373
V G I+Q L +D++ + FK SRCT+
Sbjct: 445 PGISVIGNILQQEHLWEFDLKNRRLRFKRSRCTH 478
>gi|356565521|ref|XP_003550988.1| PREDICTED: aspartic proteinase-like protein 2-like [Glycine max]
Length = 634
Score = 99.8 bits (247), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 110/385 (28%), Positives = 174/385 (45%), Gaps = 46/385 (11%)
Query: 13 NETPKSPISIIYQAEIISVDDIYLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCF 72
+E+ + P + + + + ++ Y L IGTPP VDTGS T+ C C + C
Sbjct: 62 SESKRHPNARMRLHDDLLLNGYYTTRLWIGTPPQMFALIVDTGSTVTYVPCSTCEQ--CG 119
Query: 73 KQEPPLFDPKKSSTYNSISCSSSQCAVVTSNCSEG--DCSYSFLYGRGAYASFSSGNLAT 130
+ + P F P+ SSTY + C+ + NC C Y Y A S SSG L
Sbjct: 120 RHQDPKFQPESSSTYQPVKCT------IDCNCDSDRMQCVYERQY---AEMSTSSGVLGE 170
Query: 131 ETLTFNSTSGLPVEMPNVIFGCGHKNLASPTSDSKQT-GIIGLGPGNSSLISQM--GTSI 187
+ ++F + S L + +FGC +N+ + S+ GI+GLG G+ S++ Q+ I
Sbjct: 171 DLISFGNQSELAPQ--RAVFGC--ENVETGDLYSQHADGIMGLGRGDLSIMDQLVDKNVI 226
Query: 188 AGKFSYCL--PDQGSSKINFGGIVAGAGVV---STPLIIRDHYY-LSLEAISVGNQRLEF 241
+ FS C D G + GGI + + S P +R YY + L+ I V +RL
Sbjct: 227 SDSFSLCYGGMDVGGGAMVLGGISPPSDMAFAYSDP--VRSPYYNIDLKEIHVAGKRLPL 284
Query: 242 ---VSSSTGNIFVDTGVLRTLLPLEYHSNLKSVMSNMIKAQPVKGV-GAEPGFSDVLCY- 296
V +D+G LP K + + + Q +K + G +P ++D+ C+
Sbjct: 285 NANVFDGKHGTVLDSGTTYAYLPEAAFLAFKDAI--VKELQSLKKISGPDPNYNDI-CFS 341
Query: 297 ----NISSQPK-FPEVTIHFR-GADVKLSPSN-LFRN--ISDEIMCSAFRGGN-ANIVYG 346
++S K FP V + F G LSP N +FR+ + F+ GN + G
Sbjct: 342 GAGIDVSQLSKSFPVVDMVFENGQKYTLSPENYMFRHSKVRGAYCLGVFQNGNDQTTLLG 401
Query: 347 RIMQINFLIGYDIEQAMVSFKPSRC 371
I+ N L+ YD EQ + F + C
Sbjct: 402 GIIVRNTLVVYDREQTKIGFWKTNC 426
>gi|224068901|ref|XP_002326227.1| predicted protein [Populus trichocarpa]
gi|222833420|gb|EEE71897.1| predicted protein [Populus trichocarpa]
Length = 477
Score = 99.4 bits (246), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 101/370 (27%), Positives = 161/370 (43%), Gaps = 41/370 (11%)
Query: 34 IYLMHLSIGTPPVDIFGSVDTGSDCTWT---QCEPCPELDCFKQEPPLFDPKKSSTYNSI 90
+Y + +G+PP + +DTGSD W C CP + FD SST +
Sbjct: 65 LYFTKVKLGSPPREFNVQIDTGSDVLWVCCNSCNNCPRTSGLGIQLNFFDSSSSSTAGQV 124
Query: 91 SCSSSQC--AVVT--SNCSE--GDCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPV- 143
CS C AV T + CS CSY+F YG G S +SG ++TL F++ G +
Sbjct: 125 RCSDPICTSAVQTTATQCSSQTDQCSYTFQYGDG---SGTSGYYVSDTLYFDAILGQSLI 181
Query: 144 --EMPNVIFGCGHKNLASPT-SDSKQTGIIGLGPGNSSLISQMGTS--IAGKFSYCLPDQ 198
++FGC T +D GI G G G S+ISQ+ T FS+CL
Sbjct: 182 DNSSALIVFGCSAYQSGDLTKTDKAVDGIFGFGQGELSVISQLSTRGITPRVFSHCLKGD 241
Query: 199 GS-SKINFGGIVAGAGVVSTPLI-IRDHYYLSLEAISVGNQRL-----EFVSSSTGNIFV 251
GS I G + G+V +PL+ + HY L+L +I+V Q L F +S++ V
Sbjct: 242 GSGGGILVLGEILEPGIVYSPLVPSQPHYNLNLLSIAVNGQLLPIDPAAFATSNSQGTIV 301
Query: 252 DTGVLRTLLPLEYHSNLKSVMSNMI--KAQPVKGVGAEPGFSDVLCYNISS--QPKFPEV 307
D+G L E + S ++ ++ P+ G + CY +S+ FP
Sbjct: 302 DSGTTLAYLVAEAYDPFVSAVNAIVSPSVTPITSKGNQ-------CYLVSTSVSQMFPLA 354
Query: 308 TIHFR-GADVKLSPSNLF----RNISDEIMCSAFRGGNANIVYGRIMQINFLIGYDIEQA 362
+ +F GA + L P + + + C F+ + G ++ + + YD+ +
Sbjct: 355 SFNFAGGASMVLKPEDYLIPFGSSGGSAMWCIGFQKVQGVTILGDLVLKDKIFVYDLVRQ 414
Query: 363 MVSFKPSRCT 372
+ + C+
Sbjct: 415 RIGWANYDCS 424
>gi|357456413|ref|XP_003598487.1| Peptidase A1, putative [Medicago truncatula]
gi|355487535|gb|AES68738.1| Peptidase A1, putative [Medicago truncatula]
Length = 414
Score = 99.4 bits (246), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 106/395 (26%), Positives = 168/395 (42%), Gaps = 48/395 (12%)
Query: 1 AQNSQKLPFYNDNETPKSPISIIYQAEIISVDDIYLMHLSIGTPPVDIFGSVDTGSDCTW 60
A+++ +L F + KS + I +II Y++ IGTPP + ++DT +D W
Sbjct: 45 AKDTTRLQFLDSLVARKSVVPIASGRQIIQ-SPTYIVRAKIGTPPQTLLLAMDTSNDAAW 103
Query: 61 TQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSSSQCAVVTS-NCSEGDCSYSFLYGRGA 119
C C C LF P+KS+T+ ++SC++ +C V + C C+++ YG +
Sbjct: 104 IPCTACD--GCAST---LFAPEKSTTFKNVSCAAPECKQVPNPGCGVSSCNFNLTYGSSS 158
Query: 120 YASFSSGNLATETLTFNSTSGLPVEMPNVIFGCGHKNLASPTSDSKQTGIIGLGPGNSSL 179
A+ NL +T+T + +P+ FGC K TS Q + G SL
Sbjct: 159 IAA----NLVQDTITLATD-----PVPSYTFGCVSKTTG--TSAPPQGLLGLGR-GPLSL 206
Query: 180 ISQMGTSIAGKFSYCLPD----QGSSKINFGGIVAGAGVVSTPLIIRDH----YYLSLEA 231
+SQ FSYCLP S + G + + TPL+ YY++LEA
Sbjct: 207 LSQTQNLYQSTFSYCLPSFKSLNFSGSLRLGPVAQPKRIKYTPLLKNPRRSSLYYVNLEA 266
Query: 232 ISVGNQRLEF--------VSSSTGNIFVDTGVLRTLLPLEYHSNLKSVMSNMIKAQPVKG 283
I VG + ++ ++ G IF D+G + T L + ++ + P
Sbjct: 267 IRVGRKVVDIPPAALAFNPTTGAGTIF-DSGTVFTRLVAPVYVAVRDEFRR--RVGPKLT 323
Query: 284 VGAEPGFSDVLCYNISSQPKFPEVTIHFRGADVKLSPSN-LFRNISDEIMCSAFRGGNAN 342
V + GF CYN+ P +T F G +V L N L + + C A G N
Sbjct: 324 VTSLGGFDT--CYNVPI--VVPTITFIFTGMNVTLPQDNILIHSTAGSTTCLAMAGAPDN 379
Query: 343 I-----VYGRIMQINFLIGYDIEQAMVSFKPSRCT 372
+ V + Q N + YD+ + V CT
Sbjct: 380 VNSVLNVIANMQQQNHRVLYDVPNSRVGVARELCT 414
>gi|357125326|ref|XP_003564345.1| PREDICTED: aspartic proteinase-like protein 2-like [Brachypodium
distachyon]
Length = 506
Score = 99.4 bits (246), Expect = 3e-18, Method: Compositional matrix adjust.
Identities = 99/374 (26%), Positives = 160/374 (42%), Gaps = 46/374 (12%)
Query: 34 IYLMHLSIGTPPVDIFGSVDTGSDCTWTQCEP---CPELDCFKQEPPLFDPKKSSTYNSI 90
+Y + +G P + F +DTGSD W C P CP + F+P SST + I
Sbjct: 88 LYFTRVKLGNPAKEYFVQIDTGSDILWVACSPCTGCPTSSGLNIQLEFFNPDSSSTSSRI 147
Query: 91 SCSSSQCAVVTSN----CSEGD-----CSYSFLYGRGAYASFSSGNLATETLTFNSTSG- 140
CS +C C D C Y+F YG G S +SG ++T+ F++ G
Sbjct: 148 PCSDDRCTAALQTGEAVCQSSDSPSSPCGYTFTYGDG---SGTSGFYVSDTMYFDTVMGN 204
Query: 141 --LPVEMPNVIFGCGHKNLAS-PTSDSKQTGIIGLGPGNSSLISQ---MGTSIAGKFSYC 194
+V+FGC + +D GI G G S++SQ +G S FS+C
Sbjct: 205 EQTANSSASVVFGCSNSQSGDLMKTDRAVDGIFGFGQHQLSVVSQLYSLGVS-PKTFSHC 263
Query: 195 LP--DQGSSKINFGGIVAGAGVVSTPLI-IRDHYYLSLEAISVGNQRLE-----FVSSST 246
L D G + G IV G+V TPL+ + HY L+LE+I+V Q+L F +S+T
Sbjct: 264 LKGSDNGGGILVLGEIVE-PGLVFTPLVPSQPHYNLNLESIAVSGQKLPIDSSLFATSNT 322
Query: 247 GNIFVDTG-VLRTLLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDVLCYNISS--QPK 303
VD+G L L+ Y + ++ A V + C+ +S
Sbjct: 323 QGTIVDSGTTLVYLVDGAYDPFINAI------AAAVSPSVRSVVSKGIQCFVTTSSVDSS 376
Query: 304 FPEVTIHFRGA-DVKLSPSNLFRNI----SDEIMCSAFRGGNANIVYGRIMQINFLIGYD 358
FP T++F+G + + P N ++ + C ++ + G ++ + + YD
Sbjct: 377 FPTATLYFKGGVSMTVKPENYLLQQGSVDNNVLWCIGWQRSQGITILGDLVLKDKIFVYD 436
Query: 359 IEQAMVSFKPSRCT 372
+ + + C+
Sbjct: 437 LANMRMGWADYDCS 450
>gi|356541713|ref|XP_003539318.1| PREDICTED: aspartic proteinase-like protein 2-like [Glycine max]
Length = 640
Score = 99.4 bits (246), Expect = 3e-18, Method: Compositional matrix adjust.
Identities = 110/363 (30%), Positives = 160/363 (44%), Gaps = 44/363 (12%)
Query: 35 YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSS 94
Y L IGTPP VDTGS T+ C C C + P F P+ S TY + C +
Sbjct: 93 YTTRLWIGTPPQRFALIVDTGSTVTYVPCSTCKH--CGSHQDPKFRPEASETYQPVKC-T 149
Query: 95 SQCAVVTSNCSE--GDCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMPNVIFGC 152
QC NC + C+Y Y A S SSG L + ++F + S L + IFGC
Sbjct: 150 WQC-----NCDDDRKQCTYERRY---AEMSTSSGVLGEDVVSFGNQSELSPQ--RAIFGC 199
Query: 153 GHKNLASPTSDSKQTGIIGLGPGNSSLISQM--GTSIAGKFSYCL--PDQGSSKINFGGI 208
+ + + GI+GLG G+ S++ Q+ I+ FS C G + GGI
Sbjct: 200 ENDETGD-IYNQRADGIMGLGRGDLSIMDQLVEKKVISDAFSLCYGGMGVGGGAMVLGGI 258
Query: 209 VAGAGVV---STPLIIRDHYY-LSLEAISVGNQRLEF---VSSSTGNIFVDTGVLRTLLP 261
A +V S P +R YY + L+ I V +RL V +D+G LP
Sbjct: 259 SPPADMVFTHSDP--VRSPYYNIDLKEIHVAGKRLHLNPKVFDGKHGTVLDSGTTYAYLP 316
Query: 262 LEYHSNLKSVMSNMIKAQPVKGV-GAEPGFSDVLCY-----NISSQPK-FPEVTIHF-RG 313
K + M + +K + G +P ++D+ C+ N+S K FP V + F G
Sbjct: 317 ESAFLAFKHAI--MKETHSLKRISGPDPHYNDI-CFSGAEINVSQLSKSFPVVEMVFGNG 373
Query: 314 ADVKLSPSN-LFRN--ISDEIMCSAFRGGN-ANIVYGRIMQINFLIGYDIEQAMVSFKPS 369
+ LSP N LFR+ + F GN + G I+ N L+ YD E + + F +
Sbjct: 374 HKLSLSPENYLFRHSKVRGAYCLGVFSNGNDPTTLLGGIVVRNTLVMYDREHSKIGFWKT 433
Query: 370 RCT 372
C+
Sbjct: 434 NCS 436
>gi|14532550|gb|AAK64003.1| AT3g61820/F15G16_210 [Arabidopsis thaliana]
Length = 362
Score = 99.4 bits (246), Expect = 3e-18, Method: Compositional matrix adjust.
Identities = 70/199 (35%), Positives = 92/199 (46%), Gaps = 26/199 (13%)
Query: 35 YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSS 94
Y M L +GTP +++ +DTGSD W QC PC C+ Q +FDPKKS T+ ++ C S
Sbjct: 135 YFMRLGVGTPATNVYMVLDTGSDVVWLQCSPCKA--CYNQTDAIFDPKKSKTFATVPCGS 192
Query: 95 SQCAVV--TSNC---SEGDCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMPNVI 149
C + +S C C Y YG G SF+ G+ +TETLTF+ + +V
Sbjct: 193 RLCRRLDDSSECVTRRSKTCLYQVSYGDG---SFTEGDFSTETLTFHG-----ARVDHVP 244
Query: 150 FGCGHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLPDQ--------GSS 201
GCGH N + G+ S SQ GKFSYCL D+ S
Sbjct: 245 LGCGHDNEGLFVGAAGLLGLGRG---GLSFPSQTKNRYNGKFSYCLVDRTSSGSSSKPPS 301
Query: 202 KINFGGIVAGAGVVSTPLI 220
I FG V TPL+
Sbjct: 302 TIVFGNAAVPKTSVFTPLL 320
>gi|224067042|ref|XP_002302336.1| predicted protein [Populus trichocarpa]
gi|222844062|gb|EEE81609.1| predicted protein [Populus trichocarpa]
Length = 438
Score = 99.0 bits (245), Expect = 3e-18, Method: Compositional matrix adjust.
Identities = 99/381 (25%), Positives = 164/381 (43%), Gaps = 42/381 (11%)
Query: 12 DNETPKSPISIIYQAEIISVDDIYLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDC 71
D +T PI+ Q I+ Y++ + +GTP +F +DT +D W C C C
Sbjct: 78 DQKTTAVPIAPGQQVLKIAN---YVVRVKLGTPGQQMFMVLDTSNDAAWVPCSGC--TGC 132
Query: 72 FKQEPPLFDPKKSSTYNSISCSSSQCAVVTSNCSEGDCSYSFLYGRGAYASFSSGNLATE 131
F P S+T S+ CS +QC+ V S + L+ + S+ + T
Sbjct: 133 SSTT---FLPNASTTLGSLDCSGAQCSQVRGFSCPATGSSACLFNQ----SYGGDSSLTA 185
Query: 132 TLTFNSTSGLPVEMPNVIFGCGHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKF 191
TL ++ + +P FGC N S S Q G++GLG G SLISQ G +G F
Sbjct: 186 TLVQDAITLANDVIPGFTFGC--INAVSGGSIPPQ-GLLGLGRGPISLISQAGAMYSGVF 242
Query: 192 SYCLPDQG----SSKINFGGIVAGAGVVSTPLIIRDH----YYLSLEAISVG-------N 236
SYCLP S + G + + +TPL+ H YY++L +SVG +
Sbjct: 243 SYCLPSFKSYYFSGSLKLGPVGQPKSIRTTPLLRNPHRPSLYYVNLTGVSVGRIKVPIPS 302
Query: 237 QRLEFVSSSTGNIFVDTGVLRTLLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDVLCY 296
++L F ++ +D+G + T + ++ + P+ +GA F C+
Sbjct: 303 EQLVFDPNTGAGTIIDSGTVITRFVQPVYFAIRDEFRKQVNG-PISSLGA---FDT--CF 356
Query: 297 NISSQPKFPEVTIHFRGADVKLSPSN-LFRNISDEIMCSAFRGGNANI-----VYGRIMQ 350
+++ + P +T+HF G ++ L N L + S + C + N+ V + Q
Sbjct: 357 AATNEAEAPAITLHFEGLNLVLPMENSLIHSSSGSLACLSMAAAPNNVNSVLNVIANLQQ 416
Query: 351 INFLIGYDIEQAMVSFKPSRC 371
N I +D + + C
Sbjct: 417 QNLRIMFDTTNSRLGIARELC 437
>gi|125528511|gb|EAY76625.1| hypothetical protein OsI_04577 [Oryza sativa Indica Group]
Length = 492
Score = 99.0 bits (245), Expect = 3e-18, Method: Compositional matrix adjust.
Identities = 103/367 (28%), Positives = 162/367 (44%), Gaps = 62/367 (16%)
Query: 34 IYLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCS 93
+Y+ IGTPP + G++D SD WT C F+P +S+T + C+
Sbjct: 99 MYVFSYGIGTPPQQVSGALDISSDLVWTACGATAP----------FNPVRSTTVADVPCT 148
Query: 94 SSQC-AVVTSNCSEG------DCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMP 146
C C G +C+Y+++YG G A+ ++G L TE TF T +
Sbjct: 149 DDACQQFAPQTCGAGAGAGSSECAYTYMYGGG--AANTTGLLGTEAFTFGDT-----RID 201
Query: 147 NVIFGCGHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCL-PDQG---SSK 202
V+FGCG +N+ S +G+IGLG GN SL+SQ+ +FSY PD S
Sbjct: 202 GVVFGCGLQNVG---DFSGVSGVIGLGRGNLSLVSQLQVD---RFSYHFAPDDSVDTQSF 255
Query: 203 INFG--GIVAGAGVVSTPLIIRDH----YYLSLEAISVGNQRLEFVSSS--------TGN 248
I FG + +ST L+ D YY+ L I V + L S + +G
Sbjct: 256 ILFGDDATPQTSHTLSTRLLASDANPSLYYVELAGIQVDGKDLAIPSGTFDLRNKDGSGG 315
Query: 249 IFVDTGVLRTLLPLEYHSNLKSVMSNMIKAQPVKG--VGAEPGFSDVLCYNISS--QPKF 304
+F+ L T+L + L+ +++ I V G +G + LCY S + K
Sbjct: 316 VFLSITDLVTVLEEAAYKPLRQAVASKIGLPAVNGSALGLD------LCYTGESLAKAKV 369
Query: 305 PEVTIHFRGADV-KLSPSNLF-RNISDEIMCSAFRGGNA--NIVYGRIMQINFLIGYDIE 360
P + + F G V +L N F + + + C +A V G ++Q+ + YDI
Sbjct: 370 PSMALVFAGGAVMELELGNYFYMDSTTGLACLTILPSSAGDGSVLGSLIQVGTHMMYDIN 429
Query: 361 QAMVSFK 367
+ + F+
Sbjct: 430 GSKLVFE 436
>gi|6850312|gb|AAF29389.1|AC009999_9 Contains similarity to nucellin from Hordeum vulgare gb|U87148.
ESTs gb|T22068, gb|F14251, gb|F14237, gb|F14242 come
from this gene [Arabidopsis thaliana]
Length = 388
Score = 99.0 bits (245), Expect = 3e-18, Method: Compositional matrix adjust.
Identities = 86/302 (28%), Positives = 134/302 (44%), Gaps = 35/302 (11%)
Query: 34 IYLMHLSIGTPPVDIFGSVDTGSDCTWT---QCEPCPELDCFKQEPPLFDPKKSSTYNSI 90
+Y + IGTP + VDTGSD W QC+ CP E L++ +S + +
Sbjct: 79 LYYAKIGIGTPAKSYYVQVDTGSDIMWVNCIQCKQCPRRSTLGIELTLYNIDESDSGKLV 138
Query: 91 SCSSSQCAVVT----SNCSEG-DCSYSFLYGRGAYASFSSGNLATETLTFNSTSG-LPVE 144
SC C ++ S C C Y +YG G S ++G + + ++S +G L +
Sbjct: 139 SCDDDFCYQISGGPLSGCKANMSCPYLEIYGDG---SSTAGYFVKDVVQYDSVAGDLKTQ 195
Query: 145 MPN--VIFGCGHKNLASPTSDSKQT--GIIGLGPGNSSLISQMGTS--IAGKFSYCLPDQ 198
N VIFGCG + S +++ GI+G G NSS+ISQ+ +S + F++CL +
Sbjct: 196 TANGSVIFGCGARQSGDLDSSNEEALDGILGFGKANSSMISQLASSGRVKKIFAHCLDGR 255
Query: 199 GSSKINFGGIVAGAGVVSTPLII-RDHYYLSLEAISVGNQRLE-----FVSSSTGNIFVD 252
I G V V TPL+ + HY +++ A+ VG + L F +D
Sbjct: 256 NGGGIFAIGRVVQPKVNMTPLVPNQPHYNVNMTAVQVGQEFLTIPADLFQPGDRKGAIID 315
Query: 253 TGVLRTLLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDVLCYNISSQ--PKFPEVTIH 310
+G LP + + ++K +P V D C+ S + FP VT H
Sbjct: 316 SGTTLAYLP-------EIIYEPLVKKEPALKVHIVD--KDYKCFQYSGRVDEGFPNVTFH 366
Query: 311 FR 312
F
Sbjct: 367 FE 368
>gi|218189440|gb|EEC71867.1| hypothetical protein OsI_04576 [Oryza sativa Indica Group]
Length = 508
Score = 99.0 bits (245), Expect = 3e-18, Method: Compositional matrix adjust.
Identities = 104/366 (28%), Positives = 163/366 (44%), Gaps = 54/366 (14%)
Query: 34 IYLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEP-----PLFDPKKSSTYN 88
+Y++ S+GTPP + G +D SD W QC C C P P F SST
Sbjct: 96 MYVLSFSVGTPPQVVTGVLDITSDFVWMQCSACAT--CGADAPAATSAPPFYAFLSSTIR 153
Query: 89 SISCSSSQCA-VVTSNCSEGD--CSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEM 145
+ C++ C +V CS D C YS++YG GA A+ ++G LA + F + V
Sbjct: 154 EVRCANRGCQRLVPQTCSADDSPCGYSYVYGGGA-ANTTAGLLAVDAFAFAT-----VRA 207
Query: 146 PNVIFGCGHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCL-PDQGSSKIN 204
VIFGC + ++ G+IGLG G SL+SQ+ G+FSY L PD +
Sbjct: 208 DGVIFGC------AVATEGDIGGVIGLGRGELSLVSQLQI---GRFSYYLAPDDAVDVGS 258
Query: 205 FGGIVAGAG-----VVSTPLII----RDHYYLSLEAISVGNQRL-------EFVSSSTGN 248
F + A VSTPL+ R YY+ L I V + L + + +G
Sbjct: 259 FILFLDDAKPRTSRAVSTPLVANRASRSLYYVELAGIRVDGEDLAIPRGTFDLQADGSGG 318
Query: 249 IFVDTGVLRTLLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDVLCYNISS--QPKFPE 306
+ + + T L + ++ M++ I + G+E G LCY S K P
Sbjct: 319 VVLSITIPVTFLDAGAYKVVRQAMASKIGLRAAD--GSELGLD--LCYTSESLATAKVPS 374
Query: 307 VTIHFRGADV-KLSPSNLF-RNISDEIMCSAFR---GGNANIVYGRIMQINFLIGYDIEQ 361
+ + F G V +L N F + + + C G+ +++ G ++Q+ + YDI
Sbjct: 375 MALVFAGGAVMELEMGNYFYMDSTTGLECLTILPSPAGDGSLL-GSLIQVGTHMIYDISG 433
Query: 362 AMVSFK 367
+ + F+
Sbjct: 434 SRLVFE 439
>gi|367066719|gb|AEX12643.1| hypothetical protein 2_5918_01 [Pinus radiata]
Length = 137
Score = 99.0 bits (245), Expect = 3e-18, Method: Compositional matrix adjust.
Identities = 60/136 (44%), Positives = 80/136 (58%), Gaps = 11/136 (8%)
Query: 25 QAEIISVDDIYLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKS 84
QA + + + +LM L+IG P + +DTGSD TWTQC PC DC+KQ P++DP S
Sbjct: 11 QAPVSAGNGEFLMQLAIGKPSLAYSAILDTGSDLTWTQCIPCS--DCYKQPTPIYDPSLS 68
Query: 85 STYNSISCSSSQC-AVVTSNCSEGDCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPV 143
STY ++SC SS C A+ S C C Y LY G Y+S + G L+ ET T +S S
Sbjct: 69 STYGTVSCKSSLCLALPASACISATCEY--LYTYGDYSS-TQGILSYETFTLSSQS---- 121
Query: 144 EMPNVIFGCGHKNLAS 159
+P++ FGCG N S
Sbjct: 122 -IPHIAFGCGQDNEGS 136
>gi|125606590|gb|EAZ45626.1| hypothetical protein OsJ_30294 [Oryza sativa Japonica Group]
Length = 431
Score = 99.0 bits (245), Expect = 3e-18, Method: Compositional matrix adjust.
Identities = 97/367 (26%), Positives = 169/367 (46%), Gaps = 59/367 (16%)
Query: 37 MHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSSSQ 96
+ L IGTP +++ DT SD WTQC+PC L C Q ++DP K+ TY +++ S
Sbjct: 90 VFLGIGTPAMNVTLVFDTTSDLLWTQCQPC--LSCVAQAGDMYDPNKTETYANLTSS--- 144
Query: 97 CAVVTSNCSEGDCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMPNVIFGCGHKN 156
SY++ Y + SF+SG ATET + V + N+ FGCG +N
Sbjct: 145 -------------SYNYTYSK---QSFTSGYFATETFALGN-----VTVANITFGCGTRN 183
Query: 157 LASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCL---PDQGSSKINFGG------ 207
+ + G+ G G SL++Q+G +FSYC GSS + GG
Sbjct: 184 QGYYDNVAGVFGVGRGGRGGVSLLNQLGID---RFSYCFSSSGAPGSSAVFLGGSPELAT 240
Query: 208 -----IVAGAGVVSTPLIIRDHYYLSLEAISVGNQRLEFVSSSTGN-----IFVDTGVLR 257
A +V+ P +++ Y++ L ++VG ++ +S+ + +D+
Sbjct: 241 NATTTPAASTPMVADP-VLKSGYFVKLVGVTVGATLVDVAGASSAEGGGRALVIDSTSPV 299
Query: 258 TLLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDVLCYNIS---SQPKFPEV--TIHFR 312
T+L + ++ + + A G LC+ ++ + P P V T+HF
Sbjct: 300 TVLDEATYGPVRRALVAQLAPLKEANANASAGVGLDLCFELAAGGATPTPPNVTMTLHFD 359
Query: 313 G--ADVKLSPSN-LFRNISDEIMCSAFRGGNAN--IVYGRIMQINFLIGYDIEQAMVSFK 367
G AD+ L P++ L ++ + ++C ++N V G ++ L+ YD+ + +VSF+
Sbjct: 360 GGAADLVLPPASYLAKDSAGGLICLTMTPSSSNGVPVLGSWALLDTLVLYDLAKNVVSFQ 419
Query: 368 PSRCTNY 374
P C +
Sbjct: 420 PLDCAAF 426
>gi|449493359|ref|XP_004159266.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Cucumis sativus]
Length = 511
Score = 99.0 bits (245), Expect = 4e-18, Method: Compositional matrix adjust.
Identities = 106/411 (25%), Positives = 165/411 (40%), Gaps = 104/411 (25%)
Query: 35 YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEP---CPELDCFKQEPPL---FDPKKSSTYN 88
Y + L+ GTPP ++ DTGS W C C +P F PK SS+
Sbjct: 132 YSVSLAFGTPPQNLSFIFDTGSSLVWFPCTAGYRCSRCSFPYVDPATISKFVPKLSSSVK 191
Query: 89 SISCSSSQCAVV---------------TSNCSEGDCSYSFLYGRGAYASFSSGNLATETL 133
+ C + +CA + + CS+ Y YG GA A G L +ETL
Sbjct: 192 VVGCRNPKCAWIFGPNLKSRCRNCNSKSRKCSDSCPGYGLQYGSGATA----GILLSETL 247
Query: 134 TFNSTSGLPVEMPNVIFGCGHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSY 193
+ +P+ + GC ++ P GI G G G SL SQM +FS+
Sbjct: 248 DLENK-----RVPDFLVGCSVMSVHQPA------GIAGFGRGPESLPSQMRLK---RFSH 293
Query: 194 CLPDQGSSKINFGGIVAGAGVVSTPLII------------------------------RD 223
CL +G VS+PL++ R+
Sbjct: 294 CLVSRGFDD----------SPVSSPLVLDSGSESDESKTKSFIYAPFRENPSVSNAAFRE 343
Query: 224 HYYLSLEAISVGNQRLEF-----VSSSTGN--IFVDTGVLRTLL--PLEYHSNLKSVMSN 274
+YYLSL I +G + ++F V STGN +D+G T L P+ + + +
Sbjct: 344 YYYLSLRRILIGGKPVKFPYKYLVPDSTGNGGAIIDSGSTFTFLDKPI-FEAIADELEKQ 402
Query: 275 MIKAQPVKGVGAEPGFSDVLCYNISSQ---PKFPEVTIHFRGA-DVKLSPSNLFRNISDE 330
++K K V A+ G C+NI + +FP+V + F+G + L+ N ++DE
Sbjct: 403 LVKYPRAKDVEAQSGLRP--CFNIPKEEESAEFPDVVLKFKGGGKLSLAAENYLAMVTDE 460
Query: 331 -IMCSAFRGGNAN--------IVYGRIMQINFLIGYDIEQAMVSFKPSRCT 372
++C A I+ G Q N L+ YD+ + + F+ +CT
Sbjct: 461 GVVCLTMMTDEAVVGGGGGPAIILGAFQQQNVLVEYDLAKQRIGFRKQKCT 511
>gi|359476195|ref|XP_002268758.2| PREDICTED: aspartic proteinase nepenthesin-2-like [Vitis vinifera]
gi|296082174|emb|CBI21179.3| unnamed protein product [Vitis vinifera]
Length = 460
Score = 99.0 bits (245), Expect = 4e-18, Method: Compositional matrix adjust.
Identities = 104/363 (28%), Positives = 157/363 (43%), Gaps = 52/363 (14%)
Query: 27 EIISVDDIYLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSST 86
+ ++ D ++L+++ GTP +DTGSD TW QC C +C ++ F+P SS+
Sbjct: 121 DTLNEDGLFLVNVGFGTPQQKFNLIIDTGSDTTWIQCNSCSLGNCHNKK--TFNPSLSSS 178
Query: 87 YNSISCSSSQCAVVTSNCSEGDCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMP 146
Y++ SC S D +Y+ Y +Y S G + +T P P
Sbjct: 179 YSNRSCIPST-----------DTNYTMKYEDNSY---SKGVFVCDEVTLK-----PDVFP 219
Query: 147 NVIFGCGHKNLASPTSDSKQTGIIGLGPGNS-SLISQMGTSIAGKFSYCLPDQGSSKINF 205
FGCG + S G++GL G SLISQ + KFSYC P + +
Sbjct: 220 KFQFGCGDSGGGEFGTAS---GVLGLAKGEQYSLISQTASKFKKKFSYCFPPK---EHTL 273
Query: 206 GGIVAGAGVVSTPLIIRDH----------YYLSLEAISVGNQRLEFVSS---STGNIFVD 252
G ++ G +S ++ Y++ L ISV +RL SS S G I +D
Sbjct: 274 GSLLFGEKAISASPSLKFTQLLNPPSGLGYFVELIGISVAKKRLNVSSSLFASPGTI-ID 332
Query: 253 TGVLRTLLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDVLCYNIS----SQPKFPEVT 308
+G + T LP + L++ + P + D CYN+ K PE+
Sbjct: 333 SGTVITRLPTAAYEALRTAFQQEMLHCPSISPPPQEKLLDT-CYNLKGCGGRNIKLPEIV 391
Query: 309 IHFRG-ADVKLSPSN-LFRNISDEIMCSAF-RGGNAN--IVYGRIMQINFLIGYDIEQAM 363
+HF G DV L PS L+ N C AF R N + + G Q++ + YDIE
Sbjct: 392 LHFVGEVDVSLHPSGILWANGDLTQACLAFARKSNPSHVTIIGNRQQVSLKVVYDIEGGR 451
Query: 364 VSF 366
+ F
Sbjct: 452 LGF 454
>gi|326495450|dbj|BAJ85821.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 491
Score = 98.6 bits (244), Expect = 4e-18, Method: Compositional matrix adjust.
Identities = 99/350 (28%), Positives = 152/350 (43%), Gaps = 49/350 (14%)
Query: 51 SVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSSSQCAVV---TSNCSE- 106
++DT D W QC PC C+ Q FDP++SST + C S C + + CS+
Sbjct: 162 AIDTTEDVPWIQCLPCLIPQCYPQRNAFFDPRRSSTGAPVRCGSRACRTLGGYANGCSKP 221
Query: 107 ---GDCSYSFLYGRGAYASFSSGNLATETLTFN-STSGLPVEMPNVIFGCGHKNLASPTS 162
GDC Y Y + + G T+TLT + ST+ L N FGC H
Sbjct: 222 NSTGDCLYRIEY---SDHRLTLGTYMTDTLTISPSTTFL-----NFRFGCSHAVRGK--F 271
Query: 163 DSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLPDQGSSK-INFGGIV-----AGAGVVS 216
++ +G + LG G SL+SQ + FSYC+P ++ ++ GG V G+G +
Sbjct: 272 SAQASGTMSLGGGPQSLLSQTARAYGNAFSYCVPGPSAAGFLSIGGPVNGDDGGGSGAFA 331
Query: 217 TPLIIRDH-------YYLSLEAISVGNQRLEF--VSSSTGNIFVDTGVLRTLLPLEYHSN 267
T ++R Y + L+ I V +RL V S G + + V+ L P Y +
Sbjct: 332 TTPLVRSANVINPTIYVVRLQGIEVAGRRLNVPPVVFSGGTVMDSSAVITQLPPTAYRA- 390
Query: 268 LKSVMSNMIKAQPVKGVGAEPGFSDVLCYNI--SSQPKFPEVTIHFR-GADVKLSPSNLF 324
L+ N ++A + A G D C++ S+ P V++ F GA ++L L
Sbjct: 391 LRLAFRNAMRAYKTR---APTGNLDT-CFDFVGVSKVTVPTVSLVFDGGAVIEL---GLL 443
Query: 325 RNISDEIMCSAFRGGNANIVY---GRIMQINFLIGYDIEQAMVSFKPSRC 371
+ D C AF A+ G + Q + YD+ V F+ C
Sbjct: 444 SVLLDS--CLAFAPMAADFALGFIGNVQQQTHEVLYDVAGGAVGFRHGAC 491
>gi|297843774|ref|XP_002889768.1| hypothetical protein ARALYDRAFT_471076 [Arabidopsis lyrata subsp.
lyrata]
gi|297335610|gb|EFH66027.1| hypothetical protein ARALYDRAFT_471076 [Arabidopsis lyrata subsp.
lyrata]
Length = 449
Score = 98.6 bits (244), Expect = 4e-18, Method: Compositional matrix adjust.
Identities = 106/364 (29%), Positives = 167/364 (45%), Gaps = 47/364 (12%)
Query: 35 YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSS 94
Y++ +GTPP +F +DT +D W C C F+ SSTY+++SCS+
Sbjct: 105 YVVRARLGTPPQLMFMVLDTSNDAVWLPCSGCSGC---SNASTSFNTNSSSTYSTVSCST 161
Query: 95 SQCAVV------TSNCSEGDCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMPNV 148
+QC +S CS++ YG +SFS+ NL +TLT + P +PN
Sbjct: 162 TQCTQARGLTCPSSTPQPSICSFNQSYG--GDSSFSA-NLVQDTLTLS-----PDVIPNF 213
Query: 149 IFGCGHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLPD----QGSSKIN 204
FGC N AS S Q G++GLG G SL+SQ + +G FSYCLP S +
Sbjct: 214 SFGC--INSASGNSLPPQ-GLMGLGRGPMSLVSQTTSLYSGVFSYCLPSFRSFYFSGSLK 270
Query: 205 FGGIVAGAGVVSTPLIIRDH----YYLSLEAISVGNQR-------LEFVSSSTGNIFVDT 253
G + + TPL+ YY++L +SVG+ + L F S+S +D+
Sbjct: 271 LGLLGQPKSIRYTPLLRNPRRPSLYYVNLTGVSVGSVQVPVDPVYLTFDSNSGAGTIIDS 330
Query: 254 GVLRTLLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDVLCYNISSQPKFPEVTIHFRG 313
G + T + + + + + Q V G + G D C++ ++ P++T+H
Sbjct: 331 GTVIT----RFAQPVYEAIRDEFRKQ-VNGSFSTLGAFDT-CFSADNENVTPKITLHMTS 384
Query: 314 ADVKLSPSN-LFRNISDEIMCSAFRG--GNANIVY---GRIMQINFLIGYDIEQAMVSFK 367
D+KL N L + + + C + G NAN V + Q N I +D+ + +
Sbjct: 385 LDLKLPMENTLIHSSAGTLTCLSMAGIRQNANAVLNVIANLQQQNLRILFDVPNSRIGIA 444
Query: 368 PSRC 371
P C
Sbjct: 445 PEPC 448
>gi|168022164|ref|XP_001763610.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162685103|gb|EDQ71500.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 308
Score = 98.6 bits (244), Expect = 4e-18, Method: Compositional matrix adjust.
Identities = 79/249 (31%), Positives = 117/249 (46%), Gaps = 32/249 (12%)
Query: 34 IYLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEP-PL--FDPKKSSTYNSI 90
+Y +S+GTPP + VDTGS+ W +C PC + P P+ FDP+KS+T SI
Sbjct: 40 LYYTRISLGTPPQQFYVDVDTGSNVAWVKCAPCTGCEHSGDVPVPMSTFDPRKSTTKISI 99
Query: 91 SCSSSQCAVVTS--NCS--EGDCSYSFLYGRGAYASFSSGNLATETLTFNST----SGLP 142
SC+ ++C V+ CS C YS LYG G S ++G + TFN S
Sbjct: 100 SCTDAECGVLNKKLQCSPERLSCPYSLLYGDG---SSTAGYYLNDVFTFNQVPSDNSTAK 156
Query: 143 VEMPNVIFGCGHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTS--IAGKFSYCLPDQGS 200
++FGCG S + D G++G GP SL +Q+ F++CL S
Sbjct: 157 SGTARLVFGCGGTQTGSWSVD----GLLGFGPTTVSLPNQLAQQNISVNIFAHCLQGDVS 212
Query: 201 SKINFGGIVAGA----GVVSTPLII-RDHYYLSLEAISVGNQRL----EFVSSSTGNIFV 251
+ G +V G +V TP++ DHY + L I + + + F TG + +
Sbjct: 213 GR---GSLVIGTIREPDLVYTPMVFGEDHYNVQLLNIGISGRNVTTPASFDLEYTGGVII 269
Query: 252 DTGVLRTLL 260
D+G T L
Sbjct: 270 DSGTTLTYL 278
>gi|224140036|ref|XP_002323393.1| predicted protein [Populus trichocarpa]
gi|222868023|gb|EEF05154.1| predicted protein [Populus trichocarpa]
Length = 459
Score = 98.6 bits (244), Expect = 5e-18, Method: Compositional matrix adjust.
Identities = 111/420 (26%), Positives = 168/420 (40%), Gaps = 85/420 (20%)
Query: 14 ETPKSPISIIYQAEIISVDDIYLMHLSIGTPPVDIFGSVDTGSDCTW------TQCEPCP 67
++PK+ S+I Y + L+ GTPP +DTGS W C C
Sbjct: 62 KSPKTNFSLIKTPLFPRSYGGYSISLNFGTPPQTTKFVMDTGSSLVWFPCTSRYLCSECN 121
Query: 68 ELDCFKQEPPLFDPKKSSTYNSISCSSSQCAVV-----TSNCSEGDCS----------YS 112
+ K P F PK SS+ I C + +C+++ S C E D + Y
Sbjct: 122 FPNIKKTGIPTFLPKLSSSSKLIGCKNPRCSMIFGPEIQSKCQECDSTAQNCTQTCPPYV 181
Query: 113 FLYGRGAYASFSSGNLATETLTFNSTSGLPVEMPNVIFGCGHKNLASPTSDSKQTGIIGL 172
YG G+ A G L +ETL F + +P+ + GC ++ P GI G
Sbjct: 182 IQYGSGSTA----GLLLSETLDFPNKK----TIPDFLVGCSIFSIKQPE------GIAGF 227
Query: 173 GPGNSSLISQMGTSIAGKFSYCL--------PDQGSSKINFG---GIVAGAGVVSTPLI- 220
G SL SQ+G KFSYCL P ++ G G+ AG+ TP +
Sbjct: 228 GRSPESLPSQLGLK---KFSYCLVSHAFDDTPTSSDLVLDTGSGSGVTKTAGLSHTPFLK 284
Query: 221 -----IRDHYYLSLEAISVGNQRLE-----FVSSSTGN--IFVDTGVLRTLLPLEYH--- 265
RD+YY+ L I +G+ ++ V + GN VD+G T + +
Sbjct: 285 NPTTAFRDYYYVLLRNIVIGDTHVKVPYKFLVPGTDGNGGTIVDSGTTFTFMENPVYELV 344
Query: 266 -SNLKSVMSNMIKAQPVKGV-GAEPGFSDVLCYNISSQPKF--PEVTIHFRG-ADVKLSP 320
+ M++ A ++ + G P CYNIS + P++ F+G A + L
Sbjct: 345 AKEFEKQMAHYTVATEIQNLTGLRP------CYNISGEKSLSVPDLIFQFKGGAKMALPL 398
Query: 321 SNLFRNISDEIMCSAFRGGNAN---------IVYGRIMQINFLIGYDIEQAMVSFKPSRC 371
SN F + ++C N I+ G Q NF + +D+E FK C
Sbjct: 399 SNYFSIVDSGVICLTIVSDNVAGPGLGGGPAIILGNYQQRNFYVEFDLENEKFGFKQQSC 458
>gi|449437856|ref|XP_004136706.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Cucumis sativus]
Length = 457
Score = 98.2 bits (243), Expect = 5e-18, Method: Compositional matrix adjust.
Identities = 109/407 (26%), Positives = 167/407 (41%), Gaps = 99/407 (24%)
Query: 35 YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEP---CPELDCFKQEP---PLFDPKKSSTYN 88
Y LS GTP + DTGS W C C E K +P P F PK SS+
Sbjct: 81 YSTPLSFGTPQQTLHLIFDTGSSLVWFPCTSRYLCSECSFPKIDPTGIPRFVPKLSSSSK 140
Query: 89 SISCSSSQCAVV---------------TSNCSEGDCSYSFLYGRGAYASFSSGNLATETL 133
+ C + +C+ + T NC++ +Y YG G+ A G L +ETL
Sbjct: 141 LVGCQNPKCSWIFGPDVKSQCRSCNPKTENCTQTCPAYVVQYGSGSTA----GLLLSETL 196
Query: 134 TFNSTSGLPVEMPNVIFGCGHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSY 193
F ++PN + GC ++ P+ GI G G G+ SL SQMG KF+Y
Sbjct: 197 DFPDK-----KIPNFVVGCSFLSIHQPS------GIAGFGRGSESLPSQMGLK---KFAY 242
Query: 194 CL--------PDQGSSKINFGGIVAGAGVVSTPL---------IIRDHYYLSLEAISVGN 236
CL P G ++ G V +G+ TP +++YYL++ I VGN
Sbjct: 243 CLASRKFDDSPHSGQLILDSTG-VKSSGLTYTPFRQNPSVSNNAYKEYYYLNIRKIIVGN 301
Query: 237 QRLE----------------FVSSSTGNIFVDTGVLRTLLPLEYHSNLKSVMSNMIKAQP 280
Q ++ + S + F+D VL + + ++N +A
Sbjct: 302 QAVKVPYKFLVPGPDGNGGSIIDSGSTFTFMDKPVLEVVA-----REFEKQLANWTRATD 356
Query: 281 VKGV-GAEPGFSDVLCYNISSQP--KFPEVTIHFRG-ADVKLSPSNLFRNISDE-IMC-- 333
V+ + G P C++IS + KFPE+ F+G A L +N F +S + C
Sbjct: 357 VETLTGLRP------CFDISKEKSVKFPELIFQFKGGAKWALPLNNYFALVSSSGVACLT 410
Query: 334 --------SAFRGGNANIVYGRIMQINFLIGYDIEQAMVSFKPSRCT 372
GG +++ G Q NF + YD+ + F+ C+
Sbjct: 411 VVTHQMEDGGGGGGGPSVILGAFQQQNFYVEYDLVNQRLGFRQQTCS 457
>gi|356505293|ref|XP_003521426.1| PREDICTED: aspartic proteinase-like protein 2-like [Glycine max]
Length = 499
Score = 98.2 bits (243), Expect = 5e-18, Method: Compositional matrix adjust.
Identities = 97/373 (26%), Positives = 167/373 (44%), Gaps = 45/373 (12%)
Query: 34 IYLMHLSIGTPPVDIFGSVDTGSDCTWTQ---CEPCPELDCFKQEPPLFDPKKSSTYNSI 90
+Y + +G+P D + +DTGSD W C CP E FD SST +
Sbjct: 82 LYFTKVKLGSPAKDFYVQIDTGSDILWINCITCSNCPHSSGLGIELDFFDTAGSSTAALV 141
Query: 91 SCSSSQCA----VVTSNCSE--GDCSYSFLYGRGAYASFSSGNLATETLTFNST----SG 140
SC+ C+ TS CS CSY+F YG G S ++G ++T+ F++ S
Sbjct: 142 SCADPICSYAVQTATSGCSSQANQCSYTFQYGDG---SGTTGYYVSDTMYFDTVLLGQSM 198
Query: 141 LPVEMPNVIFGCGHKNLASPT-SDSKQTGIIGLGPGNSSLISQMGTS--IAGKFSYCLP- 196
+ ++FGC T +D GI G GPG S+ISQ+ + FS+CL
Sbjct: 199 VANSSSTIVFGCSTYQSGDLTKTDKAVDGIFGFGPGALSVISQLSSRGVTPKVFSHCLKG 258
Query: 197 -DQGSSKINFGGIVAGAGVVSTPLI-IRDHYYLSLEAISVGNQRLE-----FVSSSTGNI 249
+ G + G I+ + +V +PL+ HY L+L++I+V Q L F +++
Sbjct: 259 GENGGGVLVLGEILEPS-IVYSPLVPSLPHYNLNLQSIAVNGQLLPIDSNVFATTNNQGT 317
Query: 250 FVDTGVLRTLLPLEYHSNLKSVMSNMIK--AQPVKGVGAEPGFSDVLCYNISSQPK--FP 305
VD+G L E ++ ++ + ++P+ G + CY +S+ FP
Sbjct: 318 IVDSGTTLAYLVQEAYNPFVDAITAAVSQFSKPIISKGNQ-------CYLVSNSVGDIFP 370
Query: 306 EVTIHFR-GADVKLSPSNLFRNI----SDEIMCSAFRGGNANI-VYGRIMQINFLIGYDI 359
+V+++F GA + L+P + + S + C F+ + G ++ + + YD+
Sbjct: 371 QVSLNFMGGASMVLNPEHYLMHYGFLDSAAMWCIGFQKVERGFTILGDLVLKDKIFVYDL 430
Query: 360 EQAMVSFKPSRCT 372
+ + C+
Sbjct: 431 ANQRIGWADYNCS 443
>gi|357440289|ref|XP_003590422.1| Aspartic proteinase-like protein [Medicago truncatula]
gi|355479470|gb|AES60673.1| Aspartic proteinase-like protein [Medicago truncatula]
Length = 498
Score = 98.2 bits (243), Expect = 6e-18, Method: Compositional matrix adjust.
Identities = 102/374 (27%), Positives = 162/374 (43%), Gaps = 46/374 (12%)
Query: 34 IYLMHLSIGTPPVDIFGSVDTGSDCTWTQCEP---CPELDCFKQEPPLFDPKKSSTYNSI 90
+Y + +GTPP + +DTGSD W C CP+ E FD SST +
Sbjct: 83 LYTTKVKMGTPPREFTVQIDTGSDILWINCNTCSNCPKSSGLGIELNFFDTVGSSTAALV 142
Query: 91 SCSSSQCAV----VTSNCSE--GDCSYSFLY--GRGAYASFSSGNLATETLTFNSTSGLP 142
CS CA + CS CSY+F Y G G + S + + + ST
Sbjct: 143 PCSDPMCASAIQGAAAQCSPQVNQCSYTFQYEDGSGTSGVYVSDAMYFDMILGQSTPANV 202
Query: 143 VEMPNVIFGCGHKNLASPT-SDSKQTGIIGLGPGNSSLISQMGTS-IAGK-FSYCLPDQG 199
++FGC T +D GI+G GPG S++SQ+ + I K FS+CL G
Sbjct: 203 ASSATIVFGCSTYQSGDLTKTDKAVDGILGFGPGELSVVSQLSSRGITPKVFSHCLKGDG 262
Query: 200 SSKINFGGI-----VAGAGVVSTPLI-IRDHYYLSLEAISVGNQRLE-----FVSSSTGN 248
N GGI + +V +PL+ + HY L+L++I+V Q L F +S
Sbjct: 263 ----NGGGILVLGEILEPSIVYSPLVPSQPHYNLNLQSIAVNGQVLSINPAVFATSDKRG 318
Query: 249 IFVDTGVLRTLLPLEYHSNLKSVMSNMIK--AQPVKGVGAEPGFSDVLCYNI--SSQPKF 304
+D+G + L E + L + + + A G++ CY + S F
Sbjct: 319 TIIDSGTTLSYLVQEAYDPLVNAVDTAVSQFATSFISKGSQ-------CYLVLTSIDDSF 371
Query: 305 PEVTIHFR-GADVKLSPSN--LFRNISD--EIMCSAFRGGNANI-VYGRIMQINFLIGYD 358
P V+ +F GA + L PS L R D ++ C F+ + + G ++ + ++ YD
Sbjct: 372 PTVSFNFEGGASMDLKPSQYLLNRGFQDGAKMWCIGFQKVQEGVTILGDLVLKDKIVVYD 431
Query: 359 IEQAMVSFKPSRCT 372
+ + + + C+
Sbjct: 432 LARQQIGWTNYDCS 445
>gi|326521034|dbj|BAJ92880.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 448
Score = 98.2 bits (243), Expect = 6e-18, Method: Compositional matrix adjust.
Identities = 99/365 (27%), Positives = 150/365 (41%), Gaps = 53/365 (14%)
Query: 35 YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSS 94
Y++ +GTPP + +VDT +D W C C C P F+P S +Y ++ C S
Sbjct: 108 YVVRARLGTPPQQLLLAVDTSNDAAWIPCSGC--AGCPTTTP--FNPAASKSYRAVPCGS 163
Query: 95 SQCAVVTS-NCS--EGDCSYSFLYGRGAY-ASFSSGNLATETLTFNSTSGLPVEMPNVIF 150
C+ + +CS C +S Y + A+ S +LA S + F
Sbjct: 164 PACSRAPNPSCSLNTKSCGFSLTYADSSLEAALSQDSLAVANDVVKSYT----------F 213
Query: 151 GCGHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLPDQGSSKINFGGIV- 209
GC K + T G+ S +SQ G FSYCLP S +NF G +
Sbjct: 214 GCLQKATGTATPPQGLLGLGRG---PLSFLSQTKDMYEGTFSYCLPSFKS--LNFSGTLR 268
Query: 210 -----AGAGVVSTPLIIRDH----YYLSLEAISVGNQ-------RLEFVSSSTGNIFVDT 253
+ +TPL++ H YY+S+ I VG + L F ++ +D+
Sbjct: 269 LGRKGQPLRIKTTPLLVNPHRSSLYYVSMTGIRVGKKVVPIPPAALAFDPATGAGTVLDS 328
Query: 254 GVLRTLLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDVLCYNISSQPKFPEVTIHFRG 313
G + T L + ++ + I+ P+ +G GF CYN + K+P VT F G
Sbjct: 329 GTMFTRLVAPAYVAVRDEVRRRIRGAPLSSLG---GFDT--CYNTTV--KWPPVTFMFTG 381
Query: 314 ADVKLSPSNL-----FRNISDEIMCSAFRGGNANI-VYGRIMQINFLIGYDIEQAMVSFK 367
V L NL + S M +A G N + V + Q N I +D+ V F
Sbjct: 382 MQVTLPADNLVIHSTYGTTSCLAMAAAPDGVNTVLNVIASMQQQNHRILFDVPNGRVGFA 441
Query: 368 PSRCT 372
+CT
Sbjct: 442 REQCT 446
>gi|21717173|gb|AAM76366.1|AC074196_24 putative nucleoid DNA binding protein [Oryza sativa Japonica Group]
gi|31433291|gb|AAP54829.1| Eukaryotic aspartyl protease family protein [Oryza sativa Japonica
Group]
Length = 413
Score = 98.2 bits (243), Expect = 6e-18, Method: Compositional matrix adjust.
Identities = 106/375 (28%), Positives = 170/375 (45%), Gaps = 61/375 (16%)
Query: 35 YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSS 94
Y+ + +IGTPP VD + WTQC C CFKQ+ P+F P SST+ C +
Sbjct: 62 YVANFTIGTPPQPASAIVDVAGELVWTQCSACRR--CFKQDLPVFVPNASSTFKPEPCGT 119
Query: 95 SQC-AVVTSNCSEGDCSY----SFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMPNVI 149
+ C ++ T +CS CSY + L G +SG AT+T + + +
Sbjct: 120 AVCESIPTRSCSGDVCSYKGPPTQLRGN------TSGFAATDTFAIGTAT------VRLA 167
Query: 150 FGCGHKNLASPTSDSKQ--TGIIGLGPGNSSLISQMGTSIAGKFSYCLPDQG---SSKIN 204
FGC + + D+ +G IGLG SL++QM + +FSYCL + SS++
Sbjct: 168 FGC----VVASDIDTMDGPSGFIGLGRTPWSLVAQMKLT---RFSYCLSPRNTGKSSRLF 220
Query: 205 FG--GIVAGAGVVSTPLIIR-------DHYY-LSLEAISVGNQRLEFVSSSTGNIFVDTG 254
G +AG ST I+ HYY LSL+AI GN + ++ +G I V
Sbjct: 221 LGSSAKLAGGESTSTAPFIKTSPDDDSHHYYLLSLDAIRAGNTTI--ATAQSGGILVMHT 278
Query: 255 V--LRTLLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDVLCYNIS---SQPKFPEVTI 309
V L+ Y + K+V + A F LC+ + S+ P++
Sbjct: 279 VSPFSLLVDSAYRAFKKAVTEAVGGAAAPPMATPPQPFD--LCFKKAAGFSRATAPDLVF 336
Query: 310 HFRG-ADVKLSPSNLFRNISDE--IMCSAF-------RGGNANI-VYGRIMQINFLIGYD 358
F+G A + + P+ ++ +E C+A R G + V G + Q + YD
Sbjct: 337 TFQGAAALTVPPAKYLIDVGEEKDTACAAILSMAWLNRTGLEGVSVLGSLQQEDVHFLYD 396
Query: 359 IEQAMVSFKPSRCTN 373
+++ +SF+P+ C++
Sbjct: 397 LKKETLSFEPADCSS 411
>gi|357119741|ref|XP_003561592.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Brachypodium
distachyon]
Length = 410
Score = 97.8 bits (242), Expect = 6e-18, Method: Compositional matrix adjust.
Identities = 95/343 (27%), Positives = 165/343 (48%), Gaps = 41/343 (11%)
Query: 51 SVDTGSDCTWTQCEPC-PELDCFKQEPPLFDPKKSSTYNSISCSSSQCAVVTSNCSEGDC 109
++DTG+ +W CEPC P L Q LF P S T+ + C V + +G C
Sbjct: 86 ALDTGASTSWLMCEPCQPPL---PQVGHLFSPAASPTFQGVRGDGPVCTVPYRHTDKG-C 141
Query: 110 SYSFLYGRGAYASFSSGNLATETLTFNS-TSGLPVE-MPNVIFGCGHKNLASPTSDSKQT 167
S+ F F++G L+ +T S SG +E +P ++FGC H ++ +D +
Sbjct: 142 SFRF--------PFAAGYLSRDTFHLRSGRSGTVMESVPGIMFGCAH-SVTGFHNDGTLS 192
Query: 168 GIIGLGPGNSSLISQMGTSIAGKFSYCLPDQGS----SKINFGGIVAGA--GVVSTPLII 221
G++ L S ++ +G +G+FSYCLP + S + FG V +T L+
Sbjct: 193 GVLSLSHSPLSFLTLLGGRSSGRFSYCLPKPTTHNPDSFLRFGADVPSLPPHAHTTTLVH 252
Query: 222 RD--HYYLSLEAISVGNQRLEF---VSSSTGNIFVDTGVLRT-LLPLEYHSNLKSVMSNM 275
Y+L++ IS+GN+RL V ++ G ++ V T ++ L Y + +++++M
Sbjct: 253 AGVPGYHLNIVGISLGNKRLHIDRHVFAAGGGCSINPAVTITRIMELAYLAVEHALVAHM 312
Query: 276 --IKAQPVKGVGAEPGFSDVLCY---NISSQPKFPEVTIHFR-GADVKLSPSNLFRNISD 329
+ + VKG+ PG S LC+ + S + + P ++ HF GA+++ + LF ++
Sbjct: 313 KELGSGRVKGM---PGRS--LCFDHMDRSVRVQLPGMSFHFEDGAELRFAAEQLF-DVRV 366
Query: 330 EIMCSAFRG-GNANIVYGRIMQINFLIGYDIEQAMVSFKPSRC 371
C G G+ V G Q++ +DI ++F P C
Sbjct: 367 MAACFLVVGRGHHQTVIGAAQQVDTRFTFDIAAGRLAFVPETC 409
>gi|242046812|ref|XP_002461152.1| hypothetical protein SORBIDRAFT_02g041760 [Sorghum bicolor]
gi|241924529|gb|EER97673.1| hypothetical protein SORBIDRAFT_02g041760 [Sorghum bicolor]
Length = 452
Score = 97.8 bits (242), Expect = 6e-18, Method: Compositional matrix adjust.
Identities = 103/365 (28%), Positives = 156/365 (42%), Gaps = 51/365 (13%)
Query: 35 YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSS 94
Y++ +GTPP + +VDT +D W C C C P FDP S++Y S+ C S
Sbjct: 110 YVVRARLGTPPQQLLLAVDTSNDAAWIPCAGC--AGCPTSSAPPFDPAASTSYRSVPCGS 167
Query: 95 SQCAVV-TSNCSEGD--CSYSFLYGRGAY-ASFSSGNLATETLTFNSTSGLPVEMPNVIF 150
CA + C G C +S Y + A+ S +LA +G V+ F
Sbjct: 168 PLCAQAPNAACPPGGKACGFSLTYADSSLQAALSQDSLA--------VAGDAVK--TYTF 217
Query: 151 GCGHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLPDQGSSKINFGGIV- 209
GC K A+ T+ Q + G S +SQ G FSYCLP S +NF G +
Sbjct: 218 GCLQK--ATGTAAPPQGLLGLGR-GPLSFLSQTRDMYQGTFSYCLPSFKS--LNFSGTLR 272
Query: 210 -----AGAGVVSTPLIIRDH----YYLSLEAISVGNQ-------RLEFVSSSTGNIFVDT 253
+ +TPL+ H YY+++ I VG + L F ++ +D+
Sbjct: 273 LGRNGQPPRIKTTPLLANPHRSSLYYVNMTGIRVGRKVVPIPPPALAFDPATGAGTVLDS 332
Query: 254 GVLRTLLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDVLCYNISSQPKFPEVTIHFRG 313
G + T L + ++ + + A PV +G GF C+N ++ +P VT+ F G
Sbjct: 333 GTMFTRLVAPAYVAVRDEVRRRVGA-PVSSLG---GFDT--CFNTTAV-AWPPVTLLFDG 385
Query: 314 ADVKLSPSNL-----FRNISDEIMCSAFRGGNANI-VYGRIMQINFLIGYDIEQAMVSFK 367
V L N+ + IS M +A G N + V + Q N + +D+ V F
Sbjct: 386 MQVTLPEENVVIHSTYGTISCLAMAAAPDGVNTVLNVIASMQQQNHRVLFDVPNGRVGFA 445
Query: 368 PSRCT 372
RCT
Sbjct: 446 RERCT 450
>gi|297737850|emb|CBI27051.3| unnamed protein product [Vitis vinifera]
Length = 256
Score = 97.8 bits (242), Expect = 7e-18, Method: Compositional matrix adjust.
Identities = 50/123 (40%), Positives = 71/123 (57%), Gaps = 10/123 (8%)
Query: 35 YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSS 94
Y + IG+PP ++ VDTGSD W QC PC DC++Q P+F+P SS+Y ++C +
Sbjct: 53 YFSRVGIGSPPKHVYMVVDTGSDVNWVQCAPCA--DCYQQADPIFEPSFSSSYAPLTCET 110
Query: 95 SQC-AVVTSNCSEGDCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMPNVIFGCG 153
QC ++ S C C Y YG G+Y + G+ ATET+T + ++ L NV GCG
Sbjct: 111 HQCKSLDVSECRNDSCLYEVSYGDGSY---TVGDFATETITLDGSASL----NNVAIGCG 163
Query: 154 HKN 156
H N
Sbjct: 164 HDN 166
>gi|225464832|ref|XP_002272243.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Vitis vinifera]
Length = 467
Score = 97.8 bits (242), Expect = 7e-18, Method: Compositional matrix adjust.
Identities = 109/398 (27%), Positives = 164/398 (41%), Gaps = 82/398 (20%)
Query: 35 YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEP---CPELDCFKQEPP--LFDPKKSSTYNS 89
Y + LS GTPP + +DTGSD W C C P +F PK SS+
Sbjct: 90 YSIPLSFGTPPQTLPLIMDTGSDLVWFPCTHRYVCRNCSFSTSNPSSNIFIPKSSSSSKV 149
Query: 90 ISCSSSQCAVV---------------TSNCSEGDCSYSFLYGRGAYASFSSGNLATETLT 134
+ C + +C + + NC++ Y YG G + G + +ETL
Sbjct: 150 LGCVNPKCGWIHGSKVQSRCRDCEPTSPNCTQICPPYLVFYGSG----ITGGIMLSETLD 205
Query: 135 FNSTSGLPVEMPNVIFGCGHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYC 194
+PN I GC S S S+ GI G G G SL SQ+G KFSYC
Sbjct: 206 LPGKG-----VPNFIVGC------SVLSTSQPAGISGFGRGPPSLPSQLGLK---KFSYC 251
Query: 195 L-------PDQGSSKINFGGIVAG---AGVVSTPLI----------IRDHYYLSLEAISV 234
L + SS + G +G AG+ TP + +YYL L I+V
Sbjct: 252 LLSRRYDDTTESSSLVLDGESDSGEKTAGLSYTPFVQNPKVAGKHAFSVYYYLGLRHITV 311
Query: 235 GNQRLEFV-------SSSTGNIFVDTGVLRTLLPLEYHSNLKSVMSNMIKAQPVKGVGAE 287
G + ++ + G +D+G T + E + + ++++ V
Sbjct: 312 GGKHVKIPYKYLIPGADGDGGTIIDSGTTFTYMKGEIFELVAAEFEKQVQSKRATEVEGI 371
Query: 288 PGFSDVLCYNIS--SQPKFPEVTIHFR-GADVKLSPSNLFRNI-SDEIMC---------- 333
G C+NIS + P FPE+T+ FR GA+++L +N + D+++C
Sbjct: 372 TGLRP--CFNISGLNTPSFPELTLKFRGGAEMELPLANYVAFLGGDDVVCLTIVTDGAAG 429
Query: 334 SAFRGGNANIVYGRIMQINFLIGYDIEQAMVSFKPSRC 371
F GG A I+ G Q NF + YD+ + F+ C
Sbjct: 430 KEFSGGPA-IILGNFQQQNFYVEYDLRNERLGFRQQSC 466
>gi|357160409|ref|XP_003578755.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Brachypodium
distachyon]
Length = 373
Score = 97.8 bits (242), Expect = 7e-18, Method: Compositional matrix adjust.
Identities = 101/371 (27%), Positives = 164/371 (44%), Gaps = 54/371 (14%)
Query: 35 YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEP---PLFDPKKSSTYNSIS 91
+ M +S+GTP V ++DTGS +W QC+ C + C+ Q+ P F+ SSTY +
Sbjct: 23 FFMGISLGTPAVFNLVTIDTGSTISWVQCQYCI-VHCYTQDQRAGPTFNTSSSSTYRRVG 81
Query: 92 CSSSQC------AVVTSNCSEGD--CSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPV 143
CS+ C + S C E + C YS Y G Y S+G L+ + LT ++
Sbjct: 82 CSAQVCHDMHVSQNIPSGCVEEEDSCIYSLRYASGEY---SAGYLSQDRLTLANS----Y 134
Query: 144 EMPNVIFGCGHKNLASPTSDSKQTGIIGLGPGNSSLISQMGT-SIAGKFSYCLPDQGSSK 202
+ IFGCG N + S GIIG G + S +Q+ + FSYC P S++
Sbjct: 135 SIQKFIFGCGSDNRYNGHS----AGIIGFGNKSYSFFNQIAQLTNYSAFSYCFP---SNQ 187
Query: 203 INFGGIVAGAGVVSTPLIIRDH----------YYLSLEAISVGNQRLEFV--SSSTGNIF 250
N G + G V + +I Y L + V RL+ +T
Sbjct: 188 ENEGFLSIGPYVRDSNKLILTQLFDYGAHLPVYALQQFDMMVNGMRLQVDPPVYTTRMTV 247
Query: 251 VDTGVLRT-LLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDVLCYNISSQ----PKFP 305
VD+G + T +L + + +++ M+ V+G ++ +C++ + K P
Sbjct: 248 VDSGTVETFVLSPVFRALDRALTKAMVAEGYVRGSDSKE-----ICFHSNGDSVDWSKLP 302
Query: 306 EVTIHFRGADVKLSPSNLF-RNISDEIMCSAFRGGNANI----VYGRIMQINFLIGYDIE 360
V I F + +KL N+F SD +CS F+ +A + + G +F + +DI+
Sbjct: 303 VVEIKFSRSILKLPAENVFYYETSDGSICSTFQPDDAGVPGVQILGNRATRSFRVVFDIQ 362
Query: 361 QAMVSFKPSRC 371
Q F+ C
Sbjct: 363 QRNFGFEAGAC 373
>gi|297794789|ref|XP_002865279.1| hypothetical protein ARALYDRAFT_494467 [Arabidopsis lyrata subsp.
lyrata]
gi|297311114|gb|EFH41538.1| hypothetical protein ARALYDRAFT_494467 [Arabidopsis lyrata subsp.
lyrata]
Length = 419
Score = 97.8 bits (242), Expect = 7e-18, Method: Compositional matrix adjust.
Identities = 118/413 (28%), Positives = 173/413 (41%), Gaps = 85/413 (20%)
Query: 31 VDDIYLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPEL--DCFK---------QEPPLF 79
V D YL+ L+IGTPP + +DTGSD TW PC L DC + +F
Sbjct: 7 VRDGYLITLNIGTPPQAVQVYMDTGSDLTWV---PCGNLSFDCIDCNDLKSNNLKSSSIF 63
Query: 80 DPKKSSTYNSISCSSSQCAVVTSN------CSEGDCSYSFLYGRG--------AYASFSS 125
P SS+ SC+SS CA + S+ C+ CS S L AY ++
Sbjct: 64 SPLHSSSSFRASCASSFCAEIHSSDNPFDPCAIAGCSVSMLLKSTCIRPCPSFAY-TYGE 122
Query: 126 GNLATETLTFNSTSGLPVEMPNVIFGCGHKNLASPTSDSKQTGIIGLGPGNSSLISQMGT 185
G L + LT + ++P FGC P GI G G G SL SQ+G
Sbjct: 123 GGLVSGILTRDILKARTRDVPRFSFGCVTSTYHEP------IGIAGFGRGLLSLPSQLGF 176
Query: 186 SIAGKFSYC-LPDQGSSKINFGG-IVAGAGVVS---------TPL----IIRDHYYLSLE 230
G FS+C LP + + N ++ GA +S TP+ + + YY+ LE
Sbjct: 177 LEKG-FSHCFLPFKFVNNPNISSPLILGASALSINLTDSLQFTPMLNTPVYPNSYYIGLE 235
Query: 231 AISVGNQ---------RLEFVSSSTGNIFVDTGVLRTLLPLEYHSNLKSVMSNMIKAQPV 281
+I++G +F S G + VD+G T LP ++S L +++ + I
Sbjct: 236 SITIGTNITPTQVPLTLRQFDSQGNGGMLVDSGTTYTHLPNPFYSQLLTILQSTITYPRA 295
Query: 282 KGVGAEPGFSDVLCY-------NISSQPK-----FPEVTIHF-RGADVKLSPSNLFRNIS 328
+ GF LCY N++S FP +T +F A + L N F +S
Sbjct: 296 TETESRTGFD--LCYKVPCPNNNLTSLENDVMMVFPSITFNFLNNATLLLPQGNSFYAMS 353
Query: 329 -----DEIMCSAFRG---GNANI--VYGRIMQINFLIGYDIEQAMVSFKPSRC 371
+ C F+ GN V+G Q N + YD+E+ + F+ C
Sbjct: 354 APSDGSVVQCLLFQNMEDGNYGPAGVFGSFQQQNVKVVYDLEKERIGFQAMDC 406
>gi|242117573|dbj|BAH80056.1| hypothetical protein [Oryza sativa Indica Group]
Length = 469
Score = 97.8 bits (242), Expect = 8e-18, Method: Compositional matrix adjust.
Identities = 115/371 (30%), Positives = 176/371 (47%), Gaps = 55/371 (14%)
Query: 36 LMHLSIGTPPVD-IFGSVDTGSDCTWTQCEPCPELDCFKQEPP-LFDPKKSSTYNSISCS 93
++++++GTP + G VD S W QC PC P F P S+T++ + CS
Sbjct: 89 VINITVGTPVAQTVSGLVDITSYFVWAQCAPCAAAAGCLPPPATAFRPNGSATFSPLPCS 148
Query: 94 SSQCA-VVTSNCSEGDC-----------SYSFLYGRGAYASFSSGNLATETLTFNSTSGL 141
S C V+ C SYS YG A+ +SG LAT+T TF +T+
Sbjct: 149 SDMCLPVLRETCGRAGAAANATAGARCDSYSLTYG--GSAANTSGYLATDTFTFGATA-- 204
Query: 142 PVEMPNVIFGCGHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCL-----P 196
+P V+FGC AS + +G+IG+G GN SLISQ+ GKFSY L
Sbjct: 205 ---VPGVVFGCSD---ASYGDFAGASGVIGIGRGNLSLISQLQF---GKFSYQLLAPEAT 255
Query: 197 DQGS--SKINFG--GIVAGAGVVSTPL----IIRDHYYLSLEAISVGNQRLEFVSSSTGN 248
D GS S I FG + STPL + D YY++L + V RL+ + + T +
Sbjct: 256 DDGSADSVIRFGDDAVPKTKRGRSTPLLSSTLYPDFYYVNLTGVRVDGNRLDAIPAGTFD 315
Query: 249 IFVD-TG--VLRTLLPLEY-----HSNLKSVMSNMIKAQPVKGVGAEPGFSDVLCYNISS 300
+ + TG +L + P+ Y + +++ +++ I V G A LCYN SS
Sbjct: 316 LRANGTGGVILSSTTPVTYLEQAAYDVVRAAVASRIGLPAVNGSAA---LELDLCYNASS 372
Query: 301 --QPKFPEVTIHFR-GADVKLSPSNLFRNISDE-IMCSAFRGGNANIVYGRIMQINFLIG 356
+ K P++T+ F GAD+ LS +N F +D + C V G ++Q +
Sbjct: 373 MAKVKVPKLTLVFDGGADMDLSAANYFYIDNDTGLECLTMLPSQGGSVLGTLLQTGTNMI 432
Query: 357 YDIEQAMVSFK 367
YD++ ++F+
Sbjct: 433 YDVDAGRLTFE 443
>gi|125536523|gb|EAY83011.1| hypothetical protein OsI_38231 [Oryza sativa Indica Group]
Length = 469
Score = 97.8 bits (242), Expect = 8e-18, Method: Compositional matrix adjust.
Identities = 115/371 (30%), Positives = 176/371 (47%), Gaps = 55/371 (14%)
Query: 36 LMHLSIGTPPVD-IFGSVDTGSDCTWTQCEPCPELDCFKQEPP-LFDPKKSSTYNSISCS 93
++++++GTP + G VD S W QC PC P F P S+T++ + CS
Sbjct: 89 VINITVGTPVAQTVSGLVDITSYFVWAQCAPCAAAAGCLPPPATAFRPNGSATFSPLPCS 148
Query: 94 SSQCA-VVTSNCSEGDC-----------SYSFLYGRGAYASFSSGNLATETLTFNSTSGL 141
S C V+ C SYS YG A+ +SG LAT+T TF +T+
Sbjct: 149 SDMCLPVLRETCGRAGAAANATAGARCDSYSLTYG--GSAANTSGYLATDTFTFGATA-- 204
Query: 142 PVEMPNVIFGCGHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCL-----P 196
+P V+FGC AS + +G+IG+G GN SLISQ+ GKFSY L
Sbjct: 205 ---VPGVVFGCSD---ASYGDFAGASGVIGIGRGNLSLISQLQF---GKFSYQLLAPEAT 255
Query: 197 DQGS--SKINFG--GIVAGAGVVSTPL----IIRDHYYLSLEAISVGNQRLEFVSSSTGN 248
D GS S I FG + STPL + D YY++L + V RL+ + + T +
Sbjct: 256 DDGSADSVIRFGDDAVPKTKRGQSTPLLSSTLYPDFYYVNLTGVRVDGNRLDAIPAGTFD 315
Query: 249 IFVD-TG--VLRTLLPLEY-----HSNLKSVMSNMIKAQPVKGVGAEPGFSDVLCYNISS 300
+ + TG +L + P+ Y + +++ +++ I V G A LCYN SS
Sbjct: 316 LRANGTGGVILSSTTPVTYLEQAAYDVVRAAVASRIGLPAVNGSAA---LELDLCYNASS 372
Query: 301 --QPKFPEVTIHFR-GADVKLSPSNLFRNISDE-IMCSAFRGGNANIVYGRIMQINFLIG 356
+ K P++T+ F GAD+ LS +N F +D + C V G ++Q +
Sbjct: 373 MAKVKVPKLTLVFDGGADMDLSAANYFYIDNDTGLECLTMLPSQGGSVLGTLLQTGTNMI 432
Query: 357 YDIEQAMVSFK 367
YD++ ++F+
Sbjct: 433 YDVDAGRLTFE 443
>gi|56784779|dbj|BAD82000.1| putative aspartic proteinase nepenthesin II [Oryza sativa Japonica
Group]
Length = 486
Score = 97.8 bits (242), Expect = 8e-18, Method: Compositional matrix adjust.
Identities = 103/366 (28%), Positives = 163/366 (44%), Gaps = 54/366 (14%)
Query: 34 IYLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEP-----PLFDPKKSSTYN 88
+Y++ S+GTPP + G +D SD W QC C C P P F SST
Sbjct: 96 MYVLSFSVGTPPQVVTGVLDITSDFVWMQCSACAT--CGADAPAATSAPPFYAFLSSTIR 153
Query: 89 SISCSSSQCA-VVTSNCSEGD--CSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEM 145
+ C++ C +V CS D C YS++YG GA A+ ++G LA + F + V
Sbjct: 154 EVRCANRGCQRLVPQTCSADDSPCGYSYVYGGGA-ANTTAGLLAVDAFAFAT-----VRA 207
Query: 146 PNVIFGCGHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCL-PDQGSSKIN 204
VIFGC + ++ G+IGLG G S +SQ+ G+FSY L PD +
Sbjct: 208 DGVIFGC------AVATEGDIGGVIGLGRGELSPVSQLQI---GRFSYYLAPDDAVDVGS 258
Query: 205 FGGIVAGAG-----VVSTPLII----RDHYYLSLEAISVGNQRL-------EFVSSSTGN 248
F + A VSTPL+ R YY+ L I V + L + + +G
Sbjct: 259 FILFLDDAKPRTSRAVSTPLVASRASRSLYYVELAGIRVDGEDLAIPRGTFDLQADGSGG 318
Query: 249 IFVDTGVLRTLLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDVLCYNISS--QPKFPE 306
+ + + T L + ++ M++ I+ + G+E G LCY S K P
Sbjct: 319 VVLSITIPVTFLDAGAYKVVRQAMASKIELRAAD--GSELGLD--LCYTSESLATAKVPS 374
Query: 307 VTIHFRGADV-KLSPSNLF-RNISDEIMCSAFR---GGNANIVYGRIMQINFLIGYDIEQ 361
+ + F G V +L N F + + + C G+ +++ G ++Q+ + YDI
Sbjct: 375 MALVFAGGAVMELEMGNYFYMDSTTGLECLTILPSPAGDGSLL-GSLIQVGTHMIYDISG 433
Query: 362 AMVSFK 367
+ + F+
Sbjct: 434 SRLVFE 439
>gi|224058947|ref|XP_002299658.1| predicted protein [Populus trichocarpa]
gi|222846916|gb|EEE84463.1| predicted protein [Populus trichocarpa]
Length = 451
Score = 97.8 bits (242), Expect = 8e-18, Method: Compositional matrix adjust.
Identities = 105/373 (28%), Positives = 160/373 (42%), Gaps = 47/373 (12%)
Query: 34 IYLMHLSIGTPPVDIFGSVDTGSDCTWTQCEP---CPELDCFKQEPPLFDPKKSSTYNSI 90
+Y L +G+PP D + +DTGSD W C CP FDP S T + I
Sbjct: 89 LYYTRLQLGSPPRDFYVQIDTGSDVLWVSCSSCNGCPVSSGLHIPLNFFDPGSSPTASLI 148
Query: 91 SCSSSQCAV----VTSNCSEGD--CSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVE 144
SCS +C++ S C+ + C Y+F YG G S +SG ++ L F++ G V
Sbjct: 149 SCSDQRCSLGLQSSDSVCAAQNNQCGYTFQYGDG---SGTSGYYVSDLLHFDTILGGSV- 204
Query: 145 MPN----VIFGCGHKNLASPTS-DSKQTGIIGLGPGNSSLISQMGTS--IAGKFSYCLP- 196
M N ++FGC T D GI G G + S+ISQ+ + FS+CL
Sbjct: 205 MKNSSAPIVFGCSTLQTGDLTKPDRAVDGIFGFGQQDMSVISQLASQGITPRVFSHCLKG 264
Query: 197 -DQGSSKINFGGIVAGAGVVSTPLI-IRDHYYLSLEAISVGNQRLE-----FVSSSTGNI 249
D G + G IV +V TPL+ + HY L+L++I V Q L F +SS
Sbjct: 265 DDSGGGILVLGEIVE-PNIVYTPLVPSQPHYNLNLQSIYVNGQTLAIDPSVFATSSNQGT 323
Query: 250 FVDTGVLRTLLPLEYHSNLKSVMSNMI--KAQPVKGVGAEPGFSDVLCYNISSQPK--FP 305
+D+G L + S +++ + P G + CY SS FP
Sbjct: 324 IIDSGTTLAYLTEAAYDPFISAITSTVSPSVSPYLSKGNQ-------CYLTSSSINDVFP 376
Query: 306 EVTIHFRGA-DVKLSPSNLFRNISD----EIMCSAFRG--GNANIVYGRIMQINFLIGYD 358
+V+++F G + L P + S + C F+ G + G ++ + + YD
Sbjct: 377 QVSLNFAGGTSMILIPQDYLIQQSSINGAALWCVGFQKIQGQEITILGDLVLKDKIFVYD 436
Query: 359 IEQAMVSFKPSRC 371
I + + C
Sbjct: 437 IAGQRIGWANYDC 449
>gi|218184944|gb|EEC67371.1| hypothetical protein OsI_34484 [Oryza sativa Indica Group]
Length = 396
Score = 97.4 bits (241), Expect = 9e-18, Method: Compositional matrix adjust.
Identities = 102/373 (27%), Positives = 171/373 (45%), Gaps = 57/373 (15%)
Query: 35 YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSS 94
Y+ + +IGTPP VD + WTQC C CFKQ+ P+F P SST+ C +
Sbjct: 45 YVANFTIGTPPQPASAIVDVAGELVWTQCSACRR--CFKQDLPVFVPNASSTFKPEPCGT 102
Query: 95 SQC-AVVTSNCSEGDCSY----SFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMPNVI 149
+ C ++ T +CS CSY + L G +SG AT+T + + +
Sbjct: 103 AVCESIPTRSCSGDVCSYKGPPTQLRGN------TSGFAATDTFAIGTAT------VRLA 150
Query: 150 FGCGHKNLASPTSDSKQ--TGIIGLGPGNSSLISQMGTSIAGKFSYCLPDQG---SSKIN 204
FGC + + D+ +G IGLG SL++QM + +FSYCL + SS++
Sbjct: 151 FGC----VVASDIDTMDGPSGFIGLGRTPWSLVAQMKLT---RFSYCLSPRNTGKSSRLF 203
Query: 205 FG--GIVAGAGVVSTPLIIR--------DHYYLSLEAISVGNQRLEFVSSSTGNIFVDTG 254
G +AG+ ST I+ ++Y LSL+AI GN + + S G + + T
Sbjct: 204 LGSSAKLAGSESTSTAPFIKTSPDDDGSNYYLLSLDAIRAGNTTIA-TAQSGGILVMHTV 262
Query: 255 VLRTLLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDVLCYNIS---SQPKFPEVTIHF 311
+LL + K ++ + + P D LC+ + S+ P++ F
Sbjct: 263 SPFSLLVDSAYKAFKKAVTEAVGGAAAPPMATPPQPFD-LCFKKAAGFSRATAPDLVFTF 321
Query: 312 RG-ADVKLSPSNLFRNISDE--IMCSAF-------RGGNANI-VYGRIMQINFLIGYDIE 360
+G A + + P+ ++ +E C+A R G + V G + Q + YD++
Sbjct: 322 QGAAALTVPPAKYLIDVGEEKDTACAAILSMAWLNRTGLEGVSVLGSLQQEDVHFLYDLK 381
Query: 361 QAMVSFKPSRCTN 373
+ +SF+P+ C++
Sbjct: 382 KETLSFEPADCSS 394
>gi|449522369|ref|XP_004168199.1| PREDICTED: LOW QUALITY PROTEIN: aspartic proteinase
nepenthesin-2-like [Cucumis sativus]
Length = 457
Score = 97.4 bits (241), Expect = 9e-18, Method: Compositional matrix adjust.
Identities = 109/407 (26%), Positives = 166/407 (40%), Gaps = 99/407 (24%)
Query: 35 YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEP---CPELDCFKQEP---PLFDPKKSSTYN 88
Y LS GTP + DTGS W C C E K +P P F PK SS+
Sbjct: 81 YSTPLSFGTPQQTLHLIFDTGSSLVWFPCTSRYLCSECSFPKIDPTGIPRFVPKLSSSSK 140
Query: 89 SISCSSSQCAVV---------------TSNCSEGDCSYSFLYGRGAYASFSSGNLATETL 133
+ C + +C+ + T NC++ +Y YG G+ A G L +ETL
Sbjct: 141 LVGCQNPKCSWIFGPDVKSQCRSCNPKTENCTQTCPAYVVQYGSGSTA----GLLLSETL 196
Query: 134 TFNSTSGLPVEMPNVIFGCGHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSY 193
F +PN + GC ++ P+ GI G G G+ SL SQMG KF+Y
Sbjct: 197 DFPDKX-----IPNFVVGCSFLSIHQPS------GIAGFGRGSESLPSQMGLK---KFAY 242
Query: 194 CL--------PDQGSSKINFGGIVAGAGVVSTPL---------IIRDHYYLSLEAISVGN 236
CL P G ++ G V +G+ TP +++YYL++ I VGN
Sbjct: 243 CLASRKFDDSPHSGQLILDSTG-VKSSGLTYTPFRQNPSVSNNAYKEYYYLNIRKIIVGN 301
Query: 237 QRLE----------------FVSSSTGNIFVDTGVLRTLLPLEYHSNLKSVMSNMIKAQP 280
Q ++ + S + F+D VL + + ++N +A
Sbjct: 302 QAVKVPYKFLVPGPDGNGGSIIDSGSTFTFMDKPVLEVVA-----REFEKQLANWTRATD 356
Query: 281 VKGV-GAEPGFSDVLCYNISSQP--KFPEVTIHFRG-ADVKLSPSNLFRNISDE-IMC-- 333
V+ + G P C++IS + KFPE+ F+G A L +N F +S + C
Sbjct: 357 VETLTGLRP------CFDISKEKSVKFPELIFQFKGGAKWALPLNNYFALVSSSGVACLT 410
Query: 334 --------SAFRGGNANIVYGRIMQINFLIGYDIEQAMVSFKPSRCT 372
GG +++ G Q NF + YD+ + F+ C+
Sbjct: 411 VVTHQMEDGGGGGGGPSVILGAFQQQNFYVEYDLVNQRLGFRQQTCS 457
>gi|358346736|ref|XP_003637421.1| Aspartic proteinase nepenthesin-1, partial [Medicago truncatula]
gi|355503356|gb|AES84559.1| Aspartic proteinase nepenthesin-1, partial [Medicago truncatula]
Length = 280
Score = 97.4 bits (241), Expect = 9e-18, Method: Compositional matrix adjust.
Identities = 48/123 (39%), Positives = 69/123 (56%), Gaps = 11/123 (8%)
Query: 35 YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSS 94
Y + IG PP + +DTGSD +W QC PC DC++Q P+F+P S++Y +SC +
Sbjct: 132 YFSRIGIGEPPSQAYMVLDTGSDISWVQCAPCA--DCYRQADPIFEPTASASYAPLSCEA 189
Query: 95 SQCAVV-TSNCSEGDCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMPNVIFGCG 153
+QC + S C G+C Y YG G+Y + G+ TET+T ++ NV GCG
Sbjct: 190 AQCRYLDQSQCRNGNCLYQVSYGDGSY---TVGDFVTETVTIGVN-----KVKNVALGCG 241
Query: 154 HKN 156
H N
Sbjct: 242 HNN 244
>gi|326501422|dbj|BAK02500.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 631
Score = 97.4 bits (241), Expect = 1e-17, Method: Compositional matrix adjust.
Identities = 100/360 (27%), Positives = 159/360 (44%), Gaps = 36/360 (10%)
Query: 35 YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSS 94
Y L IGTPP + VD+GS T+ C C + C + P F P SSTY+ + C+
Sbjct: 88 YTTRLHIGTPPQEFALIVDSGSTVTYVPCASCEQ--CGNHQDPRFQPDLSSTYSPVKCNV 145
Query: 95 SQCAVVTSNCSEGDCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMPNVIFGCGH 154
T + + C+Y Y A S SSG L + ++F + S L + +FGC +
Sbjct: 146 D----CTCDSDKNQCTYERQY---AEMSSSSGVLGEDIVSFGTESELKPQ--RAVFGCEN 196
Query: 155 KNLASPTSDSKQTGIIGLGPGNSSLISQMGTS--IAGKFSYCLP--DQGSSKINFGGIVA 210
S GI+GLG G S++ Q+ I FS C D G + G + A
Sbjct: 197 SETGDLFSQHAD-GIMGLGRGQLSIMDQLVDKGVIGDSFSMCYGGMDIGGGAMVLGAMPA 255
Query: 211 GAGVVSTPL-IIRDHYY-LSLEAISVGNQRLEF---VSSSTGNIFVDTGVLRTLLPLEYH 265
G++ T +R YY + L+ + V + L + +D+G LP +
Sbjct: 256 PPGMIYTHSNAVRSPYYNIELKEMHVAGKALRVDPRIFDGKHGTVLDSGTTYAYLPEQAF 315
Query: 266 SNLKSVMSNMIKAQPVKGV-GAEPGFSDVLCY-----NISSQPK-FPEVTIHF-RGADVK 317
K +S+ + P+K + G +P + D+ C+ N+S + FP+V + F G +
Sbjct: 316 VAFKDAVSSQV--HPLKKIRGPDPNYKDI-CFAGAGRNVSQLSEVFPKVDMVFGNGQKLS 372
Query: 318 LSPSN-LFRN--ISDEIMCSAFRGG-NANIVYGRIMQINFLIGYDIEQAMVSFKPSRCTN 373
LSP N LFR+ + F+ G + + G I+ N L+ YD + F + C+
Sbjct: 373 LSPENYLFRHSKVEGAYCLGVFQNGKDPTTLLGGIVVRNTLVTYDRHNEKIGFWKTNCSE 432
>gi|297849132|ref|XP_002892447.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
gi|297338289|gb|EFH68706.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
Length = 493
Score = 97.4 bits (241), Expect = 1e-17, Method: Compositional matrix adjust.
Identities = 105/367 (28%), Positives = 172/367 (46%), Gaps = 36/367 (9%)
Query: 34 IYLMHLSIGTPPVDIFGSVDTGSDCTW---TQCEPCPELDCFKQEPPLFDPKKSSTYNSI 90
+Y + +GTPP + +DTGSD W T C CP+ + + FDP SS+ + +
Sbjct: 83 LYYTKVKLGTPPREFNVQIDTGSDVLWVSCTSCNGCPKTSELQIQLSFFDPGVSSSASLV 142
Query: 91 SCSSSQCA---VVTSNCSEGD-CSYSFLYGRGAYASFSSGNLATETLTFNS--TSGLPVE 144
SCS +C S CS + CSYSF YG G S +SG ++ ++F++ TS L +
Sbjct: 143 SCSDRRCYSNFQTESGCSPNNLCSYSFKYGDG---SGTSGFYISDFMSFDTVITSTLAIN 199
Query: 145 MPN-VIFGCGHKNLASPTSDSKQT-GIIGLGPGNSSLISQMGTS-IAGK-FSYCLP-DQG 199
+FGC + + GI GLG G+ S+ISQ+ +A + FS+CL D+
Sbjct: 200 SSAPFVFGCSNLQTGDLQRPRRAVDGIFGLGQGSLSVISQLAVQGLAPRVFSHCLKGDKS 259
Query: 200 SSKINFGGIVAGAGVVSTPLI-IRDHYYLSLEAISVGNQRLE-----FVSSSTGNIFVDT 253
I G + V TPL+ + HY ++L++I+V Q L F ++ +DT
Sbjct: 260 GGGIMVLGQIKRPDTVYTPLVPSQPHYNVNLQSIAVNGQILPIDPSVFTIATGDGTIIDT 319
Query: 254 GVLRTLLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDVLCYNISSQ--PKFPEVTIHF 311
G LP E +S ++N + G + C+ I++ FPEV++ F
Sbjct: 320 GTTLAYLPDEAYSPFIQAIANAVSQ-----YGRPITYESYQCFEITAGDVDVFPEVSLSF 374
Query: 312 R-GADVKLSPS---NLFRNISDEIMCSAF-RGGNANI-VYGRIMQINFLIGYDIEQAMVS 365
GA + L P +F + I C F R + I + G ++ + ++ YD+ + +
Sbjct: 375 AGGASMVLRPHAYLQIFSSSGSSIWCIGFQRMSHRRITILGDLVLKDKVVVYDLVRQRIG 434
Query: 366 FKPSRCT 372
+ C+
Sbjct: 435 WAEYDCS 441
>gi|222637182|gb|EEE67314.1| hypothetical protein OsJ_24556 [Oryza sativa Japonica Group]
Length = 304
Score = 97.1 bits (240), Expect = 1e-17, Method: Compositional matrix adjust.
Identities = 98/355 (27%), Positives = 154/355 (43%), Gaps = 74/355 (20%)
Query: 37 MHLSIGTPPVDI---FGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCS 93
M L++GTPPV + FG SD W +C PC C P P + Y+ + S
Sbjct: 1 MELAVGTPPVTVQALFGI----SDLCWVECTPCS--GCNNNAAP---PAGARLYDRANSS 51
Query: 94 SSQCAVVTSNCSEGDCSYSFLYG-RGAYASFSSGNLATETLTFNSTSGLPVEMPNVIFGC 152
S S ++ +C Y ++YG ++ G L TET+ F S V+ + FGC
Sbjct: 52 S------FSPLADTECGYRYVYGATDTDRNYVKGILGTETIKFGSNDAATVQ--SFTFGC 103
Query: 153 GHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLPDQG--SSKINFGGIVA 210
+ + D TG++GLG SL+ Q+G +FSYCL +S + FG +
Sbjct: 104 TNTVYRNDLFDG-NTGVVGLGRSKLSLVGQLGLD---RFSYCLASNPNVASPVLFGSTAS 159
Query: 211 --GAGVVSTPLIIRD-HYYLSLEAISVGNQRLEFVSSSTGNIFVDTGVLRTLLPLEYHSN 267
G GV STPL+ D +YY++L ISV RL N
Sbjct: 160 MDGNGVSSTPLLPDDANYYVNLLGISVDGTRLAI------------------------PN 195
Query: 268 LKSVMSNMIKAQPVKGVGAEPGFSDVLCYNISSQPK----FPEVTIHFRGADVKLSPSNL 323
+ MS +A V G G +LC+ + K P +T+HF G D++L N
Sbjct: 196 DTARMSRTYEA--VNGSG-------LLCFLVDDASKNVVTVPTMTMHFDGMDMELLFGNY 246
Query: 324 FR-------NISDEIMCSAFRGGNANIVYGRIMQINFLIGYDIEQAMVSFKPSRC 371
F +++C + G +Q++F + Y+++ +++S +P+ C
Sbjct: 247 FAYTGKQSGGGGGDVLCLMIGKSSTGSRIGNYLQMDFHVLYELKNSVLSVQPADC 301
>gi|326517745|dbj|BAK03791.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 556
Score = 97.1 bits (240), Expect = 1e-17, Method: Compositional matrix adjust.
Identities = 97/383 (25%), Positives = 164/383 (42%), Gaps = 48/383 (12%)
Query: 19 PISIIYQAEIISVDDIYLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPP- 77
PI +I +I + ++LM + +GTPPV +VDTG+ ++ QCEPC L C KQ
Sbjct: 192 PIDLIQNGDINNF--LFLMPIKLGTPPVWNLVAVDTGATLSFVQCEPC-TLRCHKQTDAG 248
Query: 78 -LFDPKKSSTYNSISCSSSQCAVV-------TSNCSEGD--CSYSFLYGRGAYASFSSGN 127
+FDP KS +++ + CS ++C V + C E + C YS + G +S+S G
Sbjct: 249 EIFDPSKSESFSRVGCSENKCRTVQRALHLQSKACMEKEDSCLYSMTF--GGTSSYSVGK 306
Query: 128 LATETLTFNSTSGLPVEMPNVIFGCGHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSI 187
L + L + P+ +FGC + G++G S Q+ +
Sbjct: 307 LVRDRLAIGKYAK-GYSFPDFLFGCS----LDTEYHQYEAGLVGFADEPFSFFEQVAPLV 361
Query: 188 AGK-FSYCLP-DQGSSKINFGGIVAGAGVVSTPLII---RDHYYLSLEAISVGNQRLEFV 242
K FSYC P D+ + G TPL + + Y L L+ + V L
Sbjct: 362 NYKAFSYCFPSDRRKTGYLSIGDYTRVNSTYTPLFLARQQSRYALKLDEVLVNGMAL--- 418
Query: 243 SSSTGNIFVDTGVLRTLLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGF---SDVLCY--- 296
++ + VD+G T+L + + L + ++ + ++ +G + SD +C+
Sbjct: 419 VTTPSEMIVDSGSRWTILLSDTFTQLDAAIT-----EAMRPLGYNRNYYRGSDYICFEDA 473
Query: 297 ---NISSQPKFPEVTIHF-RGADVKLSPSNLFRNISDEIMCSAFRG----GNANIVYGRI 348
S P V + F G + L P + F +D +C+ F G+ + G
Sbjct: 474 HFQQFSDWAALPVVELKFDMGVKMVLQPQSSFHFNNDYGLCTYFMRDASLGSGVQLLGNT 533
Query: 349 MQINFLIGYDIEQAMVSFKPSRC 371
M + I +DI+ F+ C
Sbjct: 534 MTRSVGITFDIQGGQFGFRKGDC 556
>gi|21450872|gb|AAK44106.2|AF370291_1 unknown protein [Arabidopsis thaliana]
Length = 375
Score = 97.1 bits (240), Expect = 1e-17, Method: Compositional matrix adjust.
Identities = 102/364 (28%), Positives = 164/364 (45%), Gaps = 46/364 (12%)
Query: 35 YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSS 94
Y++ +GTPP +F +DT +D W C C F+ SSTY+++SCS+
Sbjct: 30 YVVRAKLGTPPQLMFMVLDTSNDAVWLPCSGCSGC---SNASTSFNTNSSSTYSTVSCST 86
Query: 95 SQCAVV------TSNCSEGDCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMPNV 148
+QC +S+ CS++ YG +SFS+ +L +TLT P +PN
Sbjct: 87 AQCTQARGLTCPSSSPQPSVCSFNQSYG--GDSSFSA-SLVQDTLTL-----APDVIPNF 138
Query: 149 IFGCGHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLPD----QGSSKIN 204
FGC N AS S Q G++GLG G SL+SQ + +G FSYCLP S +
Sbjct: 139 SFGC--INSASGNSLPPQ-GLMGLGRGPMSLVSQTTSLYSGVFSYCLPSFRSFYFSGSLK 195
Query: 205 FGGIVAGAGVVSTPLIIRDH----YYLSLEAISVGNQR-------LEFVSSSTGNIFVDT 253
G + + TPL+ YY++L +SVG+ + L F ++S +D+
Sbjct: 196 LGLLGQPKSIRYTPLLRNPRRPSLYYVNLTGVSVGSVQVPVDPVYLTFDANSGAGTIIDS 255
Query: 254 GVLRTLLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDVLCYNISSQPKFPEVTIHFRG 313
G + T + ++ + +GA F C++ ++ P++T+H
Sbjct: 256 GTVITRFAQPVYEAIRDEFRKQVNVSSFSTLGA---FDT--CFSADNENVAPKITLHMTS 310
Query: 314 ADVKLSPSN-LFRNISDEIMCSAFRG--GNANIVY---GRIMQINFLIGYDIEQAMVSFK 367
D+KL N L + + + C + G NAN V + Q N I +D+ + +
Sbjct: 311 LDLKLPMENTLIHSSAGTLTCLSMAGIRQNANAVLNVIANLQQQNLRILFDVPNSRIGIA 370
Query: 368 PSRC 371
P C
Sbjct: 371 PEPC 374
>gi|359484086|ref|XP_002263357.2| PREDICTED: aspartic proteinase nepenthesin-2-like [Vitis vinifera]
Length = 417
Score = 97.1 bits (240), Expect = 1e-17, Method: Compositional matrix adjust.
Identities = 116/426 (27%), Positives = 167/426 (39%), Gaps = 114/426 (26%)
Query: 31 VDDIYLMHLSIGTPPVDIFGSVDTGSDCTWTQCE----PCPELDCFKQEP---------- 76
V D YL+ L++GTPP I +DTGSD TW C C + + ++
Sbjct: 8 VRDGYLISLNLGTPPKVIQVYMDTGSDLTWVPCGNLSFDCMDCNDYRNNKLMSTYSPSYS 67
Query: 77 ----------PLFDPKKSSTYNSISCSSSQCA---VVTSNCSEGDCSYSFLYGRGAYASF 123
PL SS + C+ + C+ +V C S+++ YG G
Sbjct: 68 SSSLRDLCVSPLCSDVHSSDNSYDPCAVAGCSLSTLVKGTCPRPCPSFAYTYGAGGVV-- 125
Query: 124 SSGNLATETLTFNSTS-GLPVEMPNVIFGCGHKNLASPTSDSKQTGIIGLGPGNSSLISQ 182
G L +TLT + +S E+PN FGC P GI G G G SL SQ
Sbjct: 126 -IGTLTRDTLTTHGSSPSFTREVPNFCFGCVGSTYREP------IGIAGFGRGVLSLPSQ 178
Query: 183 MGTSIAGKFSYCLPDQGSSKINFGGIVAGAGVVSTPLIIRD------------------- 223
+G G FS+C G A +S+PL+I D
Sbjct: 179 LGFLQKG-FSHCF---------LGFKFANNPNISSPLVIGDLAISSNDHLQFTSLLKNPM 228
Query: 224 ---HYYLSLEAISVGNQRL--------EFVSSSTGNIFVDTGVLRTLLPLEYHSNLKSVM 272
+YY+ LEAI+VGN EF S G + +D+G T LP +++ L S++
Sbjct: 229 YPNYYYIGLEAITVGNATAIQVPSSLREFDSHGNGGMIIDSGTTYTHLPGPFYTQLLSML 288
Query: 273 SNMIKAQPVKGVGAEPGFSDVLCYNI--------SSQPKFPEVTIHFRGADVKL------ 318
++I + A GF LCY I P ++ HF +V L
Sbjct: 289 QSIITYPRAQEQEARTGFD--LCYRIPCPNNVVTDHDHLLPSISFHFSN-NVSLVLPQGN 345
Query: 319 ------SPSN-------LFRNISDEIMCSAFRGGNANIVYGRIMQINFLIGYDIEQAMVS 365
+PSN L +N+ D G A V+G Q N + YD+E+ +
Sbjct: 346 HFYAMGAPSNSTVVKCLLLQNMDDS------DSGPAG-VFGSFQQQNVKVVYDLEKERIG 398
Query: 366 FKPSRC 371
F+P C
Sbjct: 399 FQPMDC 404
>gi|356570798|ref|XP_003553571.1| PREDICTED: aspartic proteinase-like protein 2-like [Glycine max]
Length = 500
Score = 97.1 bits (240), Expect = 1e-17, Method: Compositional matrix adjust.
Identities = 98/373 (26%), Positives = 171/373 (45%), Gaps = 45/373 (12%)
Query: 34 IYLMHLSIGTPPVDIFGSVDTGSDCTWTQ---CEPCPELDCFKQEPPLFDPKKSSTYNSI 90
+Y + +G+P + + +DTGSD W C CP E FD SST +
Sbjct: 82 LYFTKVKLGSPAKEFYVQIDTGSDILWINCITCSNCPHSSGLGIELDFFDTAGSSTAALV 141
Query: 91 SCSSSQCA----VVTSNCSE--GDCSYSFLYGRGAYASFSSGNLATETLTFNST----SG 140
SC C+ TS CS CSY+F YG G S ++G ++T+ F++ S
Sbjct: 142 SCGDPICSYAVQTATSECSSQANQCSYTFQYGDG---SGTTGYYVSDTMYFDTVLLGQSV 198
Query: 141 LPVEMPNVIFGCGHKNLASPT-SDSKQTGIIGLGPGNSSLISQMGTS--IAGKFSYCLP- 196
+ +IFGC T +D GI G GPG S+ISQ+ + FS+CL
Sbjct: 199 VANSSSTIIFGCSTYQSGDLTKTDKAVDGIFGFGPGALSVISQLSSRGVTPKVFSHCLKG 258
Query: 197 -DQGSSKINFGGIVAGAGVVSTPLI-IRDHYYLSLEAISVGNQRLE-----FVSSSTGNI 249
+ G + G I+ + +V +PL+ + HY L+L++I+V Q L F +++
Sbjct: 259 GENGGGVLVLGEILEPS-IVYSPLVPSQPHYNLNLQSIAVNGQLLPIDSNVFATTNNQGT 317
Query: 250 FVDTG-VLRTLLPLEYHSNLKSVMSNMIK-AQPVKGVGAEPGFSDVLCYNISSQPK--FP 305
VD+G L L+ Y+ +K++ + + + ++P+ G + CY +S+ FP
Sbjct: 318 IVDSGTTLAYLVQEAYNPFVKAITAAVSQFSKPIISKGNQ-------CYLVSNSVGDIFP 370
Query: 306 EVTIHFR-GADVKLSPSNLFRNI----SDEIMCSAFRGGNANI-VYGRIMQINFLIGYDI 359
+V+++F GA + L+P + + + C F+ + G ++ + + YD+
Sbjct: 371 QVSLNFMGGASMVLNPEHYLMHYGFLDGAAMWCIGFQKVEQGFTILGDLVLKDKIFVYDL 430
Query: 360 EQAMVSFKPSRCT 372
+ + C+
Sbjct: 431 ANQRIGWADYDCS 443
>gi|356514298|ref|XP_003525843.1| PREDICTED: aspartic proteinase-like protein 1-like [Glycine max]
Length = 663
Score = 97.1 bits (240), Expect = 1e-17, Method: Compositional matrix adjust.
Identities = 111/386 (28%), Positives = 176/386 (45%), Gaps = 48/386 (12%)
Query: 13 NETPKSPISIIYQAEIISVDDIYLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCF 72
+E+ + P + + + + ++ Y L IGTPP VDTGS T+ C C + C
Sbjct: 90 SESKRHPNARMRLHDDLLLNGYYTTRLWIGTPPQMFALIVDTGSTVTYVPCSTCEQ--CG 147
Query: 73 KQEPPLFDPKKSSTYNSISCSSSQCAVVTSNCSEGD---CSYSFLYGRGAYASFSSGNLA 129
+ + P F P+ SSTY + C+ + NC +GD C Y Y A S SSG L
Sbjct: 148 RHQDPKFQPESSSTYQPVKCT------IDCNC-DGDRMQCVYERQY---AEMSTSSGVLG 197
Query: 130 TETLTFNSTSGLPVEMPNVIFGCGHKNLASPTSDSKQT-GIIGLGPGNSSLISQM--GTS 186
+ ++F + S L + +FGC +N+ + S+ GI+GLG G+ S++ Q+
Sbjct: 198 EDVISFGNQSELAPQ--RAVFGC--ENVETGDLYSQHADGIMGLGRGDLSIMDQLVDKKV 253
Query: 187 IAGKFSYCL--PDQGSSKINFGGIVAGAGVV---STPLIIRDHYY-LSLEAISVGNQRLE 240
I+ FS C D G + GGI + + S P R YY + L+ + V +RL
Sbjct: 254 ISDSFSLCYGGMDVGGGAMVLGGISPPSDMTFAYSDP--DRSPYYNIDLKEMHVAGKRLP 311
Query: 241 F---VSSSTGNIFVDTGVLRTLLPLEYHSNLKSVMSNMIKAQPVKGV-GAEPGFSDVLCY 296
V +D+G LP K + + + Q +K + G +P ++D+ C+
Sbjct: 312 LNANVFDGKHGTVLDSGTTYAYLPEAAFLAFKDAI--VKELQSLKQISGPDPNYNDI-CF 368
Query: 297 -----NISSQPK-FPEVTIHF-RGADVKLSPSN-LFRN--ISDEIMCSAFRGGN-ANIVY 345
++S K FP V + F G LSP N +FR+ + F+ GN +
Sbjct: 369 SGAGNDVSQLSKSFPVVDMVFGNGHKYSLSPENYMFRHSKVRGAYCLGIFQNGNDQTTLL 428
Query: 346 GRIMQINFLIGYDIEQAMVSFKPSRC 371
G I+ N L+ YD EQ + F + C
Sbjct: 429 GGIIVRNTLVMYDREQTKIGFWKTNC 454
>gi|18391062|ref|NP_563851.1| aspartyl protease-like protein [Arabidopsis thaliana]
gi|2160166|gb|AAB60729.1| F21M12.13 gene product [Arabidopsis thaliana]
gi|21593996|gb|AAM65914.1| nucleoid DNA-binding-like protein [Arabidopsis thaliana]
gi|26983826|gb|AAN86165.1| unknown protein [Arabidopsis thaliana]
gi|332190367|gb|AEE28488.1| aspartyl protease-like protein [Arabidopsis thaliana]
Length = 449
Score = 97.1 bits (240), Expect = 1e-17, Method: Compositional matrix adjust.
Identities = 102/364 (28%), Positives = 164/364 (45%), Gaps = 46/364 (12%)
Query: 35 YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSS 94
Y++ +GTPP +F +DT +D W C C F+ SSTY+++SCS+
Sbjct: 104 YVVRAKLGTPPQLMFMVLDTSNDAVWLPCSGCSGC---SNASTSFNTNSSSTYSTVSCST 160
Query: 95 SQCAVV------TSNCSEGDCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMPNV 148
+QC +S+ CS++ YG +SFS+ +L +TLT P +PN
Sbjct: 161 AQCTQARGLTCPSSSPQPSVCSFNQSYG--GDSSFSA-SLVQDTLTL-----APDVIPNF 212
Query: 149 IFGCGHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLPD----QGSSKIN 204
FGC N AS S Q G++GLG G SL+SQ + +G FSYCLP S +
Sbjct: 213 SFGC--INSASGNSLPPQ-GLMGLGRGPMSLVSQTTSLYSGVFSYCLPSFRSFYFSGSLK 269
Query: 205 FGGIVAGAGVVSTPLIIRDH----YYLSLEAISVGNQR-------LEFVSSSTGNIFVDT 253
G + + TPL+ YY++L +SVG+ + L F ++S +D+
Sbjct: 270 LGLLGQPKSIRYTPLLRNPRRPSLYYVNLTGVSVGSVQVPVDPVYLTFDANSGAGTIIDS 329
Query: 254 GVLRTLLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDVLCYNISSQPKFPEVTIHFRG 313
G + T + ++ + +GA F C++ ++ P++T+H
Sbjct: 330 GTVITRFAQPVYEAIRDEFRKQVNVSSFSTLGA---FDT--CFSADNENVAPKITLHMTS 384
Query: 314 ADVKLSPSN-LFRNISDEIMCSAFRG--GNANIVY---GRIMQINFLIGYDIEQAMVSFK 367
D+KL N L + + + C + G NAN V + Q N I +D+ + +
Sbjct: 385 LDLKLPMENTLIHSSAGTLTCLSMAGIRQNANAVLNVIANLQQQNLRILFDVPNSRIGIA 444
Query: 368 PSRC 371
P C
Sbjct: 445 PEPC 448
>gi|357521081|ref|XP_003630829.1| Aspartic proteinase nepenthesin-2 [Medicago truncatula]
gi|355524851|gb|AET05305.1| Aspartic proteinase nepenthesin-2 [Medicago truncatula]
Length = 526
Score = 97.1 bits (240), Expect = 1e-17, Method: Compositional matrix adjust.
Identities = 96/352 (27%), Positives = 160/352 (45%), Gaps = 42/352 (11%)
Query: 35 YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSS 94
+L+ + +G PP + D +D TW QC+PC + C+ Q +FDP +SS+Y +SC +
Sbjct: 187 FLVQIGVGGPPQKFYMIFDLQTDFTWLQCQPC--IKCYDQPDSIFDPSQSSSYTLLSCET 244
Query: 95 SQCAVV-TSNCS-EGDCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMPNVIFGC 152
C ++ S+CS +G C Y+ Y G + G L ET++F S+ + V GC
Sbjct: 245 KHCNLLPNSSCSDDGYCRYNITYKDGTN---TEGVLINETVSFESSGW----VDRVSLGC 297
Query: 153 GHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLPDQ----GSSKINFGGI 208
+KN P S G GLG G+ S S++ A SYCL + SS + F
Sbjct: 298 SNKN-QGPFVGSD--GTFGLGRGSLSFPSRIN---ASSMSYCLVESKDGYSSSTLEFNSP 351
Query: 209 VAGAGVVSTPLI----IRDHYYLSLEAISVGNQRLEFVSSS-------TGNIFVDTGVLR 257
+G V L+ + YY+ L+ I VG ++++ +S+ G + V + L
Sbjct: 352 PC-SGSVKAKLLQNPKAENLYYVGLKGIKVGGEKIDVPNSTFTIDPYGNGGMIVSSSSLI 410
Query: 258 TLLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDVLCYNISSQPKFPEVTIHFRGADVK 317
T+L + ++ ++ + K Q ++ + A F CYN+SS + F D K
Sbjct: 411 TMLENDTYNVVRDAF--VAKTQHLERLKAFLQFDT--CYNLSSNNTVELPILEFEVNDGK 466
Query: 318 --LSPSNLFRNISDE--IMCSAFRGGNANI-VYGRIMQINFLIGYDIEQAMV 364
L P + D+ C AF + + G + Q + +D+ + V
Sbjct: 467 SWLLPKESYLYAVDKNGTFCFAFAPSKGSFSILGTLQQYGTRVTFDLVNSFV 518
>gi|255565759|ref|XP_002523869.1| Aspartic proteinase nepenthesin-2 precursor, putative [Ricinus
communis]
gi|223536957|gb|EEF38595.1| Aspartic proteinase nepenthesin-2 precursor, putative [Ricinus
communis]
Length = 447
Score = 96.7 bits (239), Expect = 1e-17, Method: Compositional matrix adjust.
Identities = 113/395 (28%), Positives = 169/395 (42%), Gaps = 83/395 (21%)
Query: 35 YLMHLSIGTPPVDIFGSVDTGSDCTWTQCE---PCPELDCFKQEPPLFDPKKSSTYNSIS 91
Y + LS GTPP + +DTGS W C C + P F PK SS+ I
Sbjct: 77 YSISLSFGTPPQTLSFVMDTGSSFVWFPCTLRYLCNNCSFTSRISP-FLPKHSSSSKIIG 135
Query: 92 CSSSQCAVV-------------TSNCSEGDCSYSFLYGRGAYASFSSGNLATETLTFNST 138
C + +C+ + + NCS+ Y LYG G + G +ETL +
Sbjct: 136 CKNPKCSWIHQTDLRCTDCDNNSRNCSQICPPYLILYGSGT----TGGVALSETLHLH-- 189
Query: 139 SGLPVEMPNVIFGCGHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCL--- 195
GL V PN + GC S S + GI G G G SSL SQ+G + KFSYCL
Sbjct: 190 -GLIV--PNFLVGC------SVFSSRQPAGIAGFGRGPSSLPSQLGLT---KFSYCLLSH 237
Query: 196 ---PDQGSSKINFGGI----VAGAGVVSTPLI----IRD------HYYLSLEAISVGNQR 238
Q SS + A ++ TPL+ ++D +YY+SL IS+G +
Sbjct: 238 KFDDTQESSSLVLDSQSDSDKKTAALMYTPLVKNPKVQDKPAFSVYYYVSLRRISIGGRS 297
Query: 239 LEFV-------SSSTGNIFVDTGVLRTLLPLEYHSNLK----SVMSNMIKAQPVKGV-GA 286
++ G +D+G T + E L S + N +A V+ + G
Sbjct: 298 VKIPYKYLSPDKDGNGGTIIDSGTTFTYMSTEAFEILSNEFISQVKNYERALMVEALSGL 357
Query: 287 EPGFSDVLCYNIS--SQPKFPEVTIHFR-GADVKLSPSNLFRNI-SDEIMC------SAF 336
+P C+N+S + + P++ +HF+ GADV+L N F + S E+ C A
Sbjct: 358 KP------CFNVSGAKELELPQLRLHFKGGADVELPLENYFAFLGSREVACFTVVTDGAE 411
Query: 337 RGGNANIVYGRIMQINFLIGYDIEQAMVSFKPSRC 371
+ ++ G NF + YD++ + FK C
Sbjct: 412 KASGPGMILGNFQMQNFYVEYDLQNERLGFKKESC 446
>gi|293334661|ref|NP_001168795.1| uncharacterized protein LOC100382594 precursor [Zea mays]
gi|223973065|gb|ACN30720.1| unknown [Zea mays]
Length = 631
Score = 96.7 bits (239), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 105/362 (29%), Positives = 163/362 (45%), Gaps = 40/362 (11%)
Query: 35 YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSS 94
Y L IGTPP + VD+GS T+ C C + C + P F P SS+Y+ + C+
Sbjct: 88 YTTRLYIGTPPQEFALIVDSGSTVTYVPCSSCEQ--CGNHQDPRFQPDLSSSYSPVKCNV 145
Query: 95 SQCAVVTSNCSEGDCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMPNVIFGCGH 154
T + + C+Y Y A S SSG L + ++F S L + + IFGC +
Sbjct: 146 D----CTCDSDKKQCTYERQY---AEMSSSSGVLGEDIVSFGRESELKPQ--HAIFGCEN 196
Query: 155 KNLASPTSDSKQTGIIGLGPGNSSLISQMGTS--IAGKFSYCLP--DQGSSKINFGGIVA 210
S GI+GLG G S++ Q+ I+ FS C D G + GG++A
Sbjct: 197 SETGDLFSQHAD-GIMGLGRGQLSIMDQLVEKGVISDSFSLCYGGMDIGGGAMVLGGMLA 255
Query: 211 GAGVV---STPLIIRDHYY-LSLEAISVGNQRLEFVS---SSTGNIFVDTGVLRTLLPLE 263
++ S PL R YY + L+ I V + L S +S +D+G LP +
Sbjct: 256 PPDMIFSNSDPL--RSPYYNIELKEIHVAGKALRVESRIFNSKHGTVLDSGTTYAYLPEQ 313
Query: 264 YHSNLKSVMSNMIKAQPVKGV-GAEPGFSDVLCY-----NISS-QPKFPEVTIHF-RGAD 315
K +++ K +K + G +P + D+ C+ N+S FP+V + F G
Sbjct: 314 AFVAFKEAVTS--KVHSLKKIRGPDPSYKDI-CFAGAGRNVSKLHEVFPDVDMVFGNGQK 370
Query: 316 VKLSPSN-LFRNIS-DEIMC-SAFRGG-NANIVYGRIMQINFLIGYDIEQAMVSFKPSRC 371
+ L+P N LFR+ D C F+ G + + G I+ N L+ YD + F + C
Sbjct: 371 LSLTPENYLFRHSKVDGAYCLGVFQNGKDPTTLLGGIIVRNTLVTYDRHNEKIGFWKTNC 430
Query: 372 TN 373
+
Sbjct: 431 SE 432
>gi|340810931|gb|AEK75392.1| S5 [Oryza sativa]
gi|340810983|gb|AEK75418.1| S5 [Oryza nivara]
gi|340810985|gb|AEK75419.1| S5 [Oryza nivara]
gi|340810997|gb|AEK75425.1| S5 [Oryza nivara]
gi|340811011|gb|AEK75432.1| S5 [Oryza nivara]
gi|340811013|gb|AEK75433.1| S5 [Oryza nivara]
gi|340811041|gb|AEK75447.1| S5 [Oryza nivara]
gi|340811043|gb|AEK75448.1| S5 [Oryza nivara]
Length = 474
Score = 96.7 bits (239), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 108/380 (28%), Positives = 162/380 (42%), Gaps = 58/380 (15%)
Query: 32 DDIYLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEP---PLFDPKKSSTYN 88
D ++LM +S+G PPV ++DTGS +W QC+PC + C Q P+FDP +S T
Sbjct: 113 DFLFLMAVSLGKPPVVNLVAIDTGSTLSWVQCQPCA-VHCHTQSAKAGPIFDPGRSYTSR 171
Query: 89 SISCSSSQCA-------VVTSNCSEGD--CSYSFLYGRGAYASFSSGNLATETLTFNSTS 139
+ CSS +C + +NC E + C+YS YG G ++S G + T+TL +
Sbjct: 172 RVRCSSVKCGELRYDLRLQQANCMEKEDSCTYSVTYGNGW--AYSVGKMVTDTLRIGDS- 228
Query: 140 GLPVEMPNVIFGCGHKNLASPTSDSKQTGIIGLGPGNSSLISQMG----TSIAGKFSYCL 195
+++FGC S + GI G G + S Q+ FSYCL
Sbjct: 229 -----FMDLMFGCSMDVKYSEF----EAGIFGFGSSSFSFFEQLAGYPDILSYKAFSYCL 279
Query: 196 P-DQGSSKINFGGIVAGAGVVS--TPL---IIRDHYYLSLEAISVGNQRLEFVSSSTGNI 249
P D+ G A + TPL I R Y L++E + QRL +S+ +
Sbjct: 280 PTDETKPGYMILGRYDRAAMDGGYTPLFRSINRPTYSLTMEMLIANGQRL---VTSSSEM 336
Query: 250 FVDTGVLRT-LLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDVLCY------------ 296
VD+G RT L P + K++ M + A +CY
Sbjct: 337 IVDSGAQRTSLWPSTFALLDKTITQAMSSIGYHRTSRARQ--ESYICYLSEHDYSGWNGT 394
Query: 297 --NISSQPKFPEVTIHFRG-ADVKLSPSNLFRNISDEIMCSAFRGGNA--NIVYGRIMQI 351
S+ P + I F G A + LSP N+F N +C F A + + G +
Sbjct: 395 ITPFSNWSALPPLEIGFAGGAALALSPRNVFYNDPHRGLCMTFAQNPALRSQILGNRVTR 454
Query: 352 NFLIGYDIEQAMVSFKPSRC 371
+F +DI+ FK + C
Sbjct: 455 SFGTTFDIQGKQFGFKYAAC 474
>gi|449456068|ref|XP_004145772.1| PREDICTED: aspartic proteinase-like protein 2-like [Cucumis
sativus]
gi|449496218|ref|XP_004160076.1| PREDICTED: aspartic proteinase-like protein 2-like [Cucumis
sativus]
Length = 500
Score = 96.3 bits (238), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 96/306 (31%), Positives = 142/306 (46%), Gaps = 34/306 (11%)
Query: 34 IYLMHLSIGTPPVDIFGSVDTGSDCTWTQCEP---CPELDCFKQEPPLFDPKKSSTYNSI 90
+Y + +G PP D + +DTGSD W C CP + FDP S+T + +
Sbjct: 82 LYYTRVQLGNPPKDFYVQIDTGSDVLWVSCNSCNGCPATSGLQIPLNFFDPGSSTTASLV 141
Query: 91 SCSSSQCAVVT----SNC--SEGDCSYSFLYGRGAYASFSSGNLATETLTFN---STSGL 141
SCS CA+ S C C+Y F YG G S +SG + + + +S
Sbjct: 142 SCSDQICALGVQSSDSACFGQSNQCAYVFQYGDG---SGTSGYYVMDMIHLDVVIDSSVT 198
Query: 142 PVEMPNVIFGCGHKNLASPT-SDSKQTGIIGLGPGNSSLISQMGTS-IAGK-FSYCLP-- 196
+V+FGC T SD GI G G + S+ISQ+ + IA K FS+CL
Sbjct: 199 SNSSASVVFGCSTSQTGDLTKSDRAVDGIFGFGQQDLSVISQLSSRGIAPKVFSHCLKGD 258
Query: 197 DQGSSKINFGGIVAGAGVVSTPLI-IRDHYYLSLEAISVGNQRLE-----FVSSSTGNIF 250
D G + G IV VV TPL+ + HY L+L++ISV Q L F +SS+
Sbjct: 259 DSGGGILVLGEIVE-PNVVYTPLVPSQPHYNLNLQSISVNGQVLPISPAVFATSSSQGTI 317
Query: 251 VDTGVLRTLLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDVLCYNISSQPK--FPEVT 308
+D+G L E ++ ++N++ +Q + V + CY SS FP+V+
Sbjct: 318 IDSGTTLAYLAEEAYNAFVVAVTNIV-SQSTQSVVLKGN----RCYVTSSSVSDIFPQVS 372
Query: 309 IHFRGA 314
++F G
Sbjct: 373 LNFAGG 378
>gi|340810907|gb|AEK75380.1| S5 [Oryza sativa]
Length = 472
Score = 96.3 bits (238), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 108/380 (28%), Positives = 162/380 (42%), Gaps = 58/380 (15%)
Query: 32 DDIYLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEP---PLFDPKKSSTYN 88
D ++LM +S+G PPV ++DTGS +W QC+PC + C Q P+FDP +S T
Sbjct: 111 DFLFLMAVSLGKPPVVNLVAIDTGSTLSWVQCQPCA-VHCHTQSAKAGPIFDPGRSYTSR 169
Query: 89 SISCSSSQCA-------VVTSNCSEGD--CSYSFLYGRGAYASFSSGNLATETLTFNSTS 139
+ CSS +C + +NC E + C+YS YG G ++S G + T+TL +
Sbjct: 170 RVRCSSVKCGELRYDLRLQQANCMEKEDSCTYSVTYGNGW--AYSVGKMVTDTLRIGDS- 226
Query: 140 GLPVEMPNVIFGCGHKNLASPTSDSKQTGIIGLGPGNSSLISQMG----TSIAGKFSYCL 195
+++FGC S + GI G G + S Q+ FSYCL
Sbjct: 227 -----FMDLMFGCSMDVKYSEF----EAGIFGFGSSSFSFFEQLAGYPDILSYKAFSYCL 277
Query: 196 P-DQGSSKINFGGIVAGAGVVS--TPL---IIRDHYYLSLEAISVGNQRLEFVSSSTGNI 249
P D+ G A + TPL I R Y L++E + QRL +S+ +
Sbjct: 278 PTDETKPGYMILGRYDRAAMDGGYTPLFRSINRPTYSLTMEMLIANGQRL---VTSSSEM 334
Query: 250 FVDTGVLRT-LLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDVLCY------------ 296
VD+G RT L P + K++ M + A +CY
Sbjct: 335 IVDSGAQRTSLWPSTFALLDKTITQAMSSIGYHRTSRARQ--ESYICYLSEHDYSGWNGT 392
Query: 297 --NISSQPKFPEVTIHFRG-ADVKLSPSNLFRNISDEIMCSAFRGGNA--NIVYGRIMQI 351
S+ P + I F G A + LSP N+F N +C F A + + G +
Sbjct: 393 ITPFSNWSALPLLEIGFAGGAALALSPRNVFYNDPHRGLCMTFAQNPALRSQILGNRVTR 452
Query: 352 NFLIGYDIEQAMVSFKPSRC 371
+F +DI+ FK + C
Sbjct: 453 SFGTTFDIQGKQFGFKYAAC 472
>gi|413938616|gb|AFW73167.1| hypothetical protein ZEAMMB73_633272 [Zea mays]
Length = 452
Score = 96.3 bits (238), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 107/353 (30%), Positives = 151/353 (42%), Gaps = 56/353 (15%)
Query: 35 YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPEL-DCFKQEPPLFDPKKSSTYNSISCS 93
Y++ S+GTP V VDTGSD +W QC+PC C+ Q+ PLFDP +SS+Y ++ C
Sbjct: 140 YVVTASLGTPGVAQTMEVDTGSDLSWVQCKPCAAAPSCYSQKDPLFDPAQSSSYAAVPCG 199
Query: 94 SSQCAVVTSNCSEGDCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMPNVIFGCG 153
CA G G YA+ + + FGCG
Sbjct: 200 GPVCA-----------------GLGIYAASACSAAQCGAVQ------------GFFFGCG 230
Query: 154 HKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLPDQGSSK----INFGGIV 209
H A + G++GLG SL+ Q + G FSYCLP + S+ + GG
Sbjct: 231 H---AQSGLFNGVDGLLGLGREQPSLVEQTAGTYGGVFSYCLPTKPSTAGYLTLGVGGPS 287
Query: 210 AGAGVVSTPLIIRD-----HYYLSLEAISVGNQRLEFVSSS-TGNIFVDTGVLRTLLPLE 263
A ST ++ +Y + L ISVG Q+L +S+ G VDTG + T LP
Sbjct: 288 GAAPGFSTTQLLPSPNAPTYYVVMLTGISVGGQQLSVPASAFAGGTVVDTGTVVTRLPPT 347
Query: 264 YHSNLKSVMSNMIKAQPVKGVGAEP--GFSDVLCYNIS--SQPKFPEVTIHF-RGADVKL 318
++ L+S + + + G P G D CYN + P V + F GA V L
Sbjct: 348 AYAALRSAFRSGMASY---GYPTAPSNGILDT-CYNFAGYGTVTLPNVALTFGSGATVTL 403
Query: 319 SPSNLFRNISDEIMCSAFRGGNANIVYGRIMQINFLIGYDIEQAMVSFKPSRC 371
+ S GG A + G + Q +F + I+ V FKPS C
Sbjct: 404 GADGILSFGCLAFAPSGSDGGMA--ILGNVQQRSFEV--RIDGTSVGFKPSSC 452
>gi|449451076|ref|XP_004143288.1| PREDICTED: aspartic proteinase-like protein 2-like [Cucumis
sativus]
Length = 488
Score = 96.3 bits (238), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 99/369 (26%), Positives = 166/369 (44%), Gaps = 42/369 (11%)
Query: 34 IYLMHLSIGTPPVDIFGSVDTGSDCTWTQCEP---CPELDCFKQEPPLFDPKKSSTYNSI 90
+Y + +G P + +DTGSD W C P CP+ E LFD KSS+ +
Sbjct: 83 LYFTKVKLGNPAREFNVQIDTGSDILWVTCSPCDGCPDSSGLGIELNLFDTTKSSSARVL 142
Query: 91 SCSSSQCAVVTSNCSE-----GDCSYSFLYGRGAYASFSSGNLATETLTFNSTSG---LP 142
C+ CA V++ + CSYSF Y S +SG T+++ F+ G +
Sbjct: 143 PCTDPICAAVSTTTDQCLTQTDHCSYSFHYRD---RSGTSGFYVTDSMHFDILLGESTIA 199
Query: 143 VEMPNVIFGCGHKNLASPTSDSKQT-GIIGLGPGNSSLISQMGTS-IAGK-FSYCLP--D 197
++FGC T +K GI G G G S+ISQ+ + I K FS+CL +
Sbjct: 200 NSSATIVFGCSIYQYGDLTRATKALDGIFGFGQGEFSVISQLSSRGITPKVFSHCLKGGE 259
Query: 198 QGSSKINFGGIVAGAGVVSTPLI-IRDHYYLSLEAISVGNQRL----EFVSSSTGNIFVD 252
G + G I+ + +V +PLI + HY L L++I++ Q F S+ G +D
Sbjct: 260 NGGGILVLGEILEPS-IVYSPLIPSQPHYTLKLQSIALSGQLFPNPTMFPISNAGETIID 318
Query: 253 TGVLRTLLPLEYHSNLKSVMSNMI--KAQPVKGVGAEPGFSDVLCYNISSQPK--FPEVT 308
+G L E + + SV+++ + A P G++ C+ +S FP +
Sbjct: 319 SGTTLAYLVEEVYDWIVSVITSAVSQSATPTISRGSQ-------CFRVSMSVADIFPVLR 371
Query: 309 IHFRG-ADVKLSPSNL--FRNISDE--IMCSAFRGGNANI-VYGRIMQINFLIGYDIEQA 362
+F G A + ++P F +I E + C F+ + + G ++ + +I YD+ +
Sbjct: 372 FNFEGIASMVVTPEEYLQFDSIVREPALWCIGFQKAEDGLNILGDLVLKDKIIVYDLARQ 431
Query: 363 MVSFKPSRC 371
+ + C
Sbjct: 432 RIGWANYDC 440
>gi|125553836|gb|EAY99441.1| hypothetical protein OsI_21409 [Oryza sativa Indica Group]
Length = 376
Score = 96.3 bits (238), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 72/204 (35%), Positives = 98/204 (48%), Gaps = 25/204 (12%)
Query: 52 VDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSSSQCAVV---TSNCSEG- 107
+D+GSD W QC+PCP L C Q PLFDP S+TY+++ CSS+ CA + CS
Sbjct: 165 IDSGSDVPWVQCQPCPLLVCHPQRDPLFDPATSTTYSAVPCSSAACARLGPYRRGCSANV 224
Query: 108 DCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVE-MPNVIFGCGHKNLASPTSDSKQ 166
C + F Y GA A +G +++ LT P + + +FGC H + S T
Sbjct: 225 QCQFGFTYTDGATA---TGTYSSDDLTLG-----PYDVVRGFLFGCAHADRGS-TFSFDV 275
Query: 167 TGIIGLGPGNSSLISQMGTSIAGKFSYCLPDQGSSK--INFGGIVAGAGV----VSTPLI 220
+G + LG G S + Q T FSYC+P SS I G A + VSTPL+
Sbjct: 276 SGTLALGGGAQSFVQQTATQYGRVFSYCIPPSPSSLGFITLGVPPQRAALVPTFVSTPLL 335
Query: 221 IRDH-----YYLSLEAISVGNQRL 239
Y + L AI V + L
Sbjct: 336 SSSSMPPTFYRVLLRAIIVAGRPL 359
>gi|357476337|ref|XP_003608454.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
gi|355509509|gb|AES90651.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
Length = 683
Score = 95.9 bits (237), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 109/383 (28%), Positives = 172/383 (44%), Gaps = 44/383 (11%)
Query: 14 ETPKSPISIIYQAEIISVDDIYLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFK 73
E+ + P + + + + ++ Y L IGTPP VDTGS T+ C C + C +
Sbjct: 60 ESKRHPNARMRLHDDLLLNGYYTTRLWIGTPPQMFALIVDTGSTVTYVPCSTCEQ--CGR 117
Query: 74 QEPPLFDPKKSSTYNSISCSSSQCAVVTSNCSEG--DCSYSFLYGRGAYASFSSGNLATE 131
+ P F P SSTY + C+ + NC C Y Y A S SSG L +
Sbjct: 118 HQDPKFQPDLSSTYQPVKCT------LDCNCDNDRMQCVYERQY---AEMSTSSGVLGED 168
Query: 132 TLTFNSTSGLPVEMPNVIFGCGHKNLASPTSDSKQT-GIIGLGPGNSSLISQM--GTSIA 188
++F + S L + +FGC +N+ + S+ GI+GLG G+ S++ Q+ ++
Sbjct: 169 VVSFGNQSELAPQ--RAVFGC--ENVETGDLYSQHADGIMGLGRGDLSIMDQLVDKNVVS 224
Query: 189 GKFSYCL--PDQGSSKINFGGIVAGAGVV---STPLIIRDHYY-LSLEAISVGNQRLEF- 241
FS C D G + GGI + +V S P +R YY + L+ I V +RL
Sbjct: 225 DSFSLCYGGMDVGGGAMVLGGISPPSDMVFAQSDP--VRSPYYNIDLKEIHVAGKRLPLN 282
Query: 242 --VSSSTGNIFVDTGVLRTLLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDVLCY--- 296
V +D+G LP E K + +++ + G +P ++D LC+
Sbjct: 283 PSVFDGKHGSVLDSGTTYAYLPEEAFLAFKEAIVKELQSFS-QISGPDPNYND-LCFSGA 340
Query: 297 --NISSQPK-FPEVTIHF-RGADVKLSPSN-LFRN--ISDEIMCSAFRGG-NANIVYGRI 348
++S K FP V + F G LSP N +FR+ + F+ G + + G I
Sbjct: 341 GIDVSQLSKTFPVVDMIFGNGHKYSLSPENYMFRHSKVRGAYCLGIFQNGKDPTTLLGGI 400
Query: 349 MQINFLIGYDIEQAMVSFKPSRC 371
+ N L+ YD EQ + F + C
Sbjct: 401 VVRNTLVLYDREQTKIGFWKTNC 423
>gi|356495496|ref|XP_003516613.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Glycine max]
Length = 645
Score = 95.9 bits (237), Expect = 3e-17, Method: Compositional matrix adjust.
Identities = 108/364 (29%), Positives = 158/364 (43%), Gaps = 44/364 (12%)
Query: 35 YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSS 94
Y L IGTPP VDTGS T+ C C C + P F P+ S TY + C +
Sbjct: 93 YTARLWIGTPPQRFALIVDTGSTVTYVPCSTCRH--CGSHQDPKFRPEDSETYQPVKC-T 149
Query: 95 SQCAVVTSNC--SEGDCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMPNVIFGC 152
QC NC C+Y Y A S SSG L + ++F + + L + IFGC
Sbjct: 150 WQC-----NCDNDRKQCTYERRY---AEMSTSSGALGEDVVSFGNQTELSPQ--RAIFGC 199
Query: 153 GHKNLASPTSDSKQTGIIGLGPGNSSLISQM--GTSIAGKFSYCL--PDQGSSKINFGGI 208
+ + + GI+GLG G+ S++ Q+ I+ FS C G + GGI
Sbjct: 200 ENDETGD-IYNQRADGIMGLGRGDLSIMDQLVEKKVISDSFSLCYGGMGVGGGAMVLGGI 258
Query: 209 VAGAGVV---STPLIIRDHYY-LSLEAISVGNQRLEF---VSSSTGNIFVDTGVLRTLLP 261
A +V S P +R YY + L+ I V +RL V +D+G LP
Sbjct: 259 SPPADMVFTRSDP--VRSPYYNIDLKEIHVAGKRLHLNPKVFDGKHGTVLDSGTTYAYLP 316
Query: 262 LEYHSNLKSVMSNMIKAQPVKGV-GAEPGFSDVLCY-----NISSQPK-FPEVTIHF-RG 313
K + M + +K + G +P ++D+ C+ ++S K FP V + F G
Sbjct: 317 ESAFLAFKHAI--MKETHSLKRISGPDPRYNDI-CFSGAEIDVSQISKSFPVVEMVFGNG 373
Query: 314 ADVKLSPSN-LFRN--ISDEIMCSAFRGGN-ANIVYGRIMQINFLIGYDIEQAMVSFKPS 369
+ LSP N LFR+ + F GN + G I+ N L+ YD E + F +
Sbjct: 374 HKLSLSPENYLFRHSKVRGAYCLGVFSNGNDPTTLLGGIVVRNTLVMYDREHTKIGFWKT 433
Query: 370 RCTN 373
C+
Sbjct: 434 NCSE 437
>gi|194699094|gb|ACF83631.1| unknown [Zea mays]
gi|413938606|gb|AFW73157.1| hypothetical protein ZEAMMB73_333672 [Zea mays]
Length = 452
Score = 95.9 bits (237), Expect = 3e-17, Method: Compositional matrix adjust.
Identities = 107/353 (30%), Positives = 151/353 (42%), Gaps = 56/353 (15%)
Query: 35 YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPEL-DCFKQEPPLFDPKKSSTYNSISCS 93
Y++ S+GTP V VDTGSD +W QC+PC C+ Q+ PLFDP +SS+Y ++ C
Sbjct: 140 YVVTASLGTPGVAQTMEVDTGSDLSWVQCKPCSAAPSCYSQKDPLFDPAQSSSYAAVPCG 199
Query: 94 SSQCAVVTSNCSEGDCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMPNVIFGCG 153
CA G G YA+ + + FGCG
Sbjct: 200 GPVCA-----------------GLGIYAASACSAAQCGAVQ------------GFFFGCG 230
Query: 154 HKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLPDQGSSK----INFGGIV 209
H A + G++GLG SL+ Q + G FSYCLP + S+ + GG
Sbjct: 231 H---AQSGLFNGVDGLLGLGREQPSLVEQTAGTYGGVFSYCLPTKPSTAGYLTLGVGGPS 287
Query: 210 AGAGVVSTPLIIRD-----HYYLSLEAISVGNQRLEFVSSS-TGNIFVDTGVLRTLLPLE 263
A ST ++ +Y + L ISVG Q+L +S+ G VDTG + T LP
Sbjct: 288 GAAPGFSTTQLLPSPNAPTYYVVMLTGISVGGQQLSVPASAFAGGTVVDTGTVVTRLPPT 347
Query: 264 YHSNLKSVMSNMIKAQPVKGVGAEP--GFSDVLCYNIS--SQPKFPEVTIHF-RGADVKL 318
++ L+S + + + G P G D CYN + P V + F GA V L
Sbjct: 348 AYAALRSAFRSGMASY---GYPTAPSNGILDT-CYNFAGYGTVTLPNVALTFGSGATVTL 403
Query: 319 SPSNLFRNISDEIMCSAFRGGNANIVYGRIMQINFLIGYDIEQAMVSFKPSRC 371
+ S GG A + G + Q +F + I+ V FKPS C
Sbjct: 404 GADGILSFGCLAFAPSGSDGGMA--ILGNVQQRSFEV--RIDGTSVGFKPSSC 452
>gi|356537928|ref|XP_003537458.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Glycine max]
Length = 445
Score = 95.9 bits (237), Expect = 3e-17, Method: Compositional matrix adjust.
Identities = 99/383 (25%), Positives = 170/383 (44%), Gaps = 72/383 (18%)
Query: 39 LSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSSSQCA 98
L++GTPP + +DTGS+ +W C+ ++ +F+P SS+Y I C S C
Sbjct: 74 LTVGTPPQSVTMVLDTGSELSWLHCKKQQNINS------VFNPHLSSSYTPIPCMSPICK 127
Query: 99 VVTSN------CSEGDCSYSFLYGRGAYASFSS--GNLATETLTFNSTSGLPVEMPNVIF 150
T + C + + + +YA F+S GNLA++T + + P +IF
Sbjct: 128 TRTRDFLIPVSCDSNNLCHVTV----SYADFTSLEGNLASDTFAISGSG-----QPGIIF 178
Query: 151 GCGHKNLAS-PTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLPDQGSSKINFGGIV 209
G +S DSK TG++G+ G+ S ++QMG KFSYC+ + +S + G
Sbjct: 179 GSMDSGFSSNANEDSKTTGLMGMNRGSLSFVTQMGFP---KFSYCISGKDASGVLLFGDA 235
Query: 210 AGAGV----------VSTPLIIRDH--YYLSLEAISVGNQRLE-----FVSSST--GNIF 250
+ ++TPL D Y + L I VG++ L+ F T G
Sbjct: 236 TFKWLGPLKYTPLVKMNTPLPYFDRVAYTVRLMGIRVGSKPLQVPKEIFAPDHTGAGQTM 295
Query: 251 VDTGVLRTLLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGF----SDVLCYNISSQ---PK 303
VD+G T L ++ L++ + + + V + +P F + LC+ + P
Sbjct: 296 VDSGTRFTFLLGSVYTALRNEF--VAQTRGVLTLLEDPNFVFEGAMDLCFRVRRGGVVPA 353
Query: 304 FPEVTIHFRGADVKLSPSNLFRNI---------SDEIMCSAFRGGNANI------VYGRI 348
P VT+ F GA++ +S L + + ++ C F GN+++ V G
Sbjct: 354 VPAVTMVFEGAEMSVSGERLLYRVGGDGDVAKGNGDVYCLTF--GNSDLLGIEAYVIGHH 411
Query: 349 MQINFLIGYDIEQAMVSFKPSRC 371
Q N + +D+ + V F ++C
Sbjct: 412 HQQNVWMEFDLVNSRVGFADTKC 434
>gi|242050744|ref|XP_002463116.1| hypothetical protein SORBIDRAFT_02g038150 [Sorghum bicolor]
gi|241926493|gb|EER99637.1| hypothetical protein SORBIDRAFT_02g038150 [Sorghum bicolor]
Length = 632
Score = 95.9 bits (237), Expect = 3e-17, Method: Compositional matrix adjust.
Identities = 105/362 (29%), Positives = 162/362 (44%), Gaps = 40/362 (11%)
Query: 35 YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSS 94
Y L IGTPP + VD+GS T+ C C + C + P F P SS+Y+ + C+
Sbjct: 89 YTTRLYIGTPPQEFALIVDSGSTVTYVPCASCEQ--CGNHQDPRFQPDLSSSYSPVKCNV 146
Query: 95 SQCAVVTSNCSEGDCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMPNVIFGCGH 154
T + + C+Y Y A S SSG L + ++F S L + +FGC +
Sbjct: 147 D----CTCDSDKKQCTYERQY---AEMSSSSGVLGEDIVSFGRESELKPQ--RAVFGCEN 197
Query: 155 KNLASPTSDSKQTGIIGLGPGNSSLISQMGTS--IAGKFSYCLP--DQGSSKINFGGIVA 210
S GI+GLG G S++ Q+ I+ FS C D G + GG+ A
Sbjct: 198 SETGDLFSQHAD-GIMGLGRGQLSIMDQLVEKGVISDSFSLCYGGMDIGGGAMVLGGVPA 256
Query: 211 GAGVV---STPLIIRDHYY-LSLEAISVGNQRLEF---VSSSTGNIFVDTGVLRTLLPLE 263
+ +V S PL R YY + L+ I V + L V +S +D+G LP +
Sbjct: 257 PSDMVFSHSDPL--RSPYYNIELKEIHVAGKALRVDSRVFNSKHGTVLDSGTTYAYLPEQ 314
Query: 264 YHSNLKSVMSNMIKAQPVKGV-GAEPGFSDVLCY-----NISS-QPKFPEVTIHF-RGAD 315
K +++ K +K + G +P + D+ C+ N+S FP+V + F G
Sbjct: 315 AFVAFKDAVTS--KVHSLKKIRGPDPNYKDI-CFAGAGRNVSKLHEVFPDVDMVFGNGQK 371
Query: 316 VKLSPSN-LFRNIS-DEIMC-SAFRGG-NANIVYGRIMQINFLIGYDIEQAMVSFKPSRC 371
+ L+P N LFR+ D C F+ G + + G I+ N L+ YD + F + C
Sbjct: 372 LSLTPENYLFRHSKVDGAYCLGVFQNGKDPTTLLGGIIVRNTLVTYDRHNEKIGFWKTNC 431
Query: 372 TN 373
+
Sbjct: 432 SE 433
>gi|242041431|ref|XP_002468110.1| hypothetical protein SORBIDRAFT_01g039750 [Sorghum bicolor]
gi|241921964|gb|EER95108.1| hypothetical protein SORBIDRAFT_01g039750 [Sorghum bicolor]
Length = 467
Score = 95.9 bits (237), Expect = 3e-17, Method: Compositional matrix adjust.
Identities = 114/413 (27%), Positives = 165/413 (39%), Gaps = 99/413 (23%)
Query: 39 LSIGTPPVDIFGSVDTGSDCTWTQC--EPCPELDCFKQEPPLFDPKKSSTYNSISCSSSQ 96
+++G PP ++ +DTGS+ +W C P Q P F+ SSTY + CSSS
Sbjct: 63 VAVGAPPQNVTMVLDTGSELSWLLCNGSRVPSTPPQPQAPAAFNGSASSTYAAAHCSSSP 122
Query: 97 --------------CAVVTSNCSEGDCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLP 142
CA SN C S Y A AS + G LA +T P
Sbjct: 123 ECQWRGRDLPVPPFCAGPPSN----SCRVSLSY---ADASSADGVLAADTFLLGGAP--P 173
Query: 143 VEMPNVIFGC-------------GHKNLASPTSDSK-QTGIIGLGPGNSSLISQMGTSIA 188
V +FGC G+ N AS T+ S+ TG++G+ G+ S ++Q GT
Sbjct: 174 VR---ALFGCITSYSSSSTADGNGNGNDASATNSSEAATGLLGMNRGSLSFVTQTGTL-- 228
Query: 189 GKFSYCL-PDQGSSKINFGGIVAGAGVVS------TPLII---------RDHYYLSLEAI 232
+F+YC+ P G + GG GA + + TPLI R Y + LE I
Sbjct: 229 -RFAYCIAPGDGPGLLVLGGDGDGAALSAAPQLNYTPLIEMSQPLPYFDRVAYSVQLEGI 287
Query: 233 SVGNQRLEFVSS-------STGNIFVDTGVLRTLLPLEYHSNLKSVMSNMIKAQPVKGVG 285
VG L S G VD+G T L + ++ LK N A +
Sbjct: 288 RVGAALLPIPKSVLAPDHTGAGQTMVDSGTQFTFLLADAYAPLKGEFLNQTSA--LLAPL 345
Query: 286 AEPGF------------SDVLCYNISSQPKFPEVTIHFRGADVKLSPSNLFRNI------ 327
EP F S+ ++ PEV + RGA+V + L +
Sbjct: 346 GEPDFVFQGAFDACFRASEARVAAATASQLLPEVGLVLRGAEVAVGGEKLLYMVPGERRG 405
Query: 328 ---SDEIMCSAFRGGNANI------VYGRIMQINFLIGYDIEQAMVSFKPSRC 371
S+ + C F GN+++ V G Q N + YD++ + V F P+RC
Sbjct: 406 EGGSEAVWCLTF--GNSDMAGMSAYVIGHHHQQNVWVEYDLQNSRVGFAPARC 456
>gi|38605896|emb|CAD41523.2| OSJNBb0020O11.8 [Oryza sativa Japonica Group]
Length = 519
Score = 95.5 bits (236), Expect = 4e-17, Method: Compositional matrix adjust.
Identities = 111/416 (26%), Positives = 164/416 (39%), Gaps = 96/416 (23%)
Query: 35 YLMHLSIGTPP----VDIFGSVDTGSDCTWTQCEPCPELDCFKQE----------PPLFD 80
Y + LS+G P V +F +DTGSD W C P + C + PP D
Sbjct: 88 YTLSLSVGPPSTASSVSLF--LDTGSDLVWFPCAPFTCMLCEGKATPGGNHSSPLPPPID 145
Query: 81 PKK------------SSTYNSISCSSSQC---AVVTSNCSEGDCS-YSFLYGRGAY-ASF 123
++ SS S C++++C A+ T +C+ C + YG G+ A+
Sbjct: 146 SRRISCASPLCSAAHSSAPTSDLCAAARCPLDAIETDSCASHACPPLYYAYGDGSLVANL 205
Query: 124 SSGNLATETLTFNSTSGLPVEMPNVIFGCGHKNLASPTSDSKQTGIIGLGPGNSSLISQM 183
G + + + VE N F C H LA P G+ G G G SL +Q+
Sbjct: 206 RRGRVGL-------AASMAVE--NFTFACAHTALAEPV------GVAGFGRGPLSLPAQL 250
Query: 184 GTSIAGKFSYCLPDQG--------SSKINFGGIVAGAGV-------VSTPLIIRDH---- 224
S++G+FSYCL SS + G A + V TPL+
Sbjct: 251 APSLSGRFSYCLVAHSFRADRLIRSSPLILGRSTDAAAIGASETDFVYTPLLHNPKHPYF 310
Query: 225 YYLSLEAISVGNQRLE-------FVSSSTGNIFVDTGVLRTLLPLEYHSNLKS---VMSN 274
Y ++LEA+SVG +R++ G + VD+G T+LP + + +
Sbjct: 311 YSVALEAVSVGGKRIQAQPELGDVDRDGNGGMVVDSGTTFTMLPSDTFARVADEFARAMA 370
Query: 275 MIKAQPVKGVGAEPGFSDVLCYNIS-SQPKFPEVTIHFRG-ADVKLSPSNLFRNISDE-- 330
+ +G A+ G + CY+ S S P V +HFRG A V L N F E
Sbjct: 371 AARFTRAEGAEAQTGLAP--CYHYSPSDRAVPPVALHFRGNATVALPRRNYFMGFKSEEG 428
Query: 331 --IMCSAFR-----------GGNANIVYGRIMQINFLIGYDIEQAMVSFKPSRCTN 373
+ C GG G Q F + YD++ V F RCT+
Sbjct: 429 RSVGCLMLMNVGGNNDDGEDGGGPAGTLGNFQQQGFEVVYDVDAGRVGFARRRCTD 484
>gi|18390865|ref|NP_563808.1| aspartyl protease-like protein [Arabidopsis thaliana]
gi|11993877|gb|AAG42922.1|AF329505_1 unknown protein [Arabidopsis thaliana]
gi|20260142|gb|AAM12969.1| unknown protein [Arabidopsis thaliana]
gi|22136092|gb|AAM91124.1| unknown protein [Arabidopsis thaliana]
gi|332190140|gb|AEE28261.1| aspartyl protease-like protein [Arabidopsis thaliana]
Length = 492
Score = 95.5 bits (236), Expect = 4e-17, Method: Compositional matrix adjust.
Identities = 104/367 (28%), Positives = 172/367 (46%), Gaps = 36/367 (9%)
Query: 34 IYLMHLSIGTPPVDIFGSVDTGSDCTW---TQCEPCPELDCFKQEPPLFDPKKSSTYNSI 90
+Y + +GTPP + +DTGSD W T C CP+ + + FDP SS+ + +
Sbjct: 83 LYYTKVKLGTPPREFNVQIDTGSDVLWVSCTSCNGCPKTSELQIQLSFFDPGVSSSASLV 142
Query: 91 SCSSSQCA---VVTSNCSEGD-CSYSFLYGRGAYASFSSGNLATETLTFNS--TSGLPVE 144
SCS +C S CS + CSYSF YG G S +SG ++ ++F++ TS L +
Sbjct: 143 SCSDRRCYSNFQTESGCSPNNLCSYSFKYGDG---SGTSGYYISDFMSFDTVITSTLAIN 199
Query: 145 MPN-VIFGCGHKNLASPTSDSKQT-GIIGLGPGNSSLISQMGTS-IAGK-FSYCLP-DQG 199
+FGC + + GI GLG G+ S+ISQ+ +A + FS+CL D+
Sbjct: 200 SSAPFVFGCSNLQSGDLQRPRRAVDGIFGLGQGSLSVISQLAVQGLAPRVFSHCLKGDKS 259
Query: 200 SSKINFGGIVAGAGVVSTPLI-IRDHYYLSLEAISVGNQRLE-----FVSSSTGNIFVDT 253
I G + V TPL+ + HY ++L++I+V Q L F ++ +DT
Sbjct: 260 GGGIMVLGQIKRPDTVYTPLVPSQPHYNVNLQSIAVNGQILPIDPSVFTIATGDGTIIDT 319
Query: 254 GVLRTLLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDVLCYNISSQ--PKFPEVTIHF 311
G LP E +S ++N + G + C+ I++ FP+V++ F
Sbjct: 320 GTTLAYLPDEAYSPFIQAVANAVSQ-----YGRPITYESYQCFEITAGDVDVFPQVSLSF 374
Query: 312 R-GADVKLSPS---NLFRNISDEIMCSAF-RGGNANI-VYGRIMQINFLIGYDIEQAMVS 365
GA + L P +F + I C F R + I + G ++ + ++ YD+ + +
Sbjct: 375 AGGASMVLGPRAYLQIFSSSGSSIWCIGFQRMSHRRITILGDLVLKDKVVVYDLVRQRIG 434
Query: 366 FKPSRCT 372
+ C+
Sbjct: 435 WAEYDCS 441
>gi|125595855|gb|EAZ35635.1| hypothetical protein OsJ_19925 [Oryza sativa Japonica Group]
Length = 335
Score = 95.5 bits (236), Expect = 4e-17, Method: Compositional matrix adjust.
Identities = 69/212 (32%), Positives = 102/212 (48%), Gaps = 26/212 (12%)
Query: 52 VDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSSSQCAVV---TSNC-SEG 107
+D+GSD W QC+PCP L C Q PLFDP S+TY ++ CSS+ CA + C +
Sbjct: 85 IDSGSDVPWVQCQPCPLLVCHPQRDPLFDPATSTTYAAVPCSSAACARLGPYRRGCLANS 144
Query: 108 DCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVE-MPNVIFGCGHKNLASPTSDSKQ 166
C + Y GA A +G +++ LT P + + +FGC H + S T
Sbjct: 145 QCQFGITYANGATA---TGTYSSDDLTLG-----PYDVVRGFLFGCAHADQGS-TFSYDV 195
Query: 167 TGIIGLGPGNSSLISQMGTSIAGKFSYCLPDQGSSKINFGGIVAGA---------GVVST 217
G + LG G+ S + Q + + FSYC+P SS FG I+ G VST
Sbjct: 196 AGTLALGGGSQSFVQQTASQYSRVFSYCVPPSTSS---FGFIMFGVPPQRAALVPTFVST 252
Query: 218 PLIIRDHYYLSLEAISVGNQRLEFVSSSTGNI 249
PL+ + +I++ + L F +T N+
Sbjct: 253 PLLSSSTMSPTFYSITLPSIALVFDGGATVNL 284
>gi|115457772|ref|NP_001052486.1| Os04g0334700 [Oryza sativa Japonica Group]
gi|113564057|dbj|BAF14400.1| Os04g0334700 [Oryza sativa Japonica Group]
Length = 482
Score = 95.5 bits (236), Expect = 4e-17, Method: Compositional matrix adjust.
Identities = 95/367 (25%), Positives = 156/367 (42%), Gaps = 39/367 (10%)
Query: 34 IYLMHLSIGTPPVDIFGSVDTGSDCTWTQ---CEPCPELDCFKQEPPLFDPKKSSTYNSI 90
+Y + IGTP V + +DTGS W C+ CP ++ +DP+ S + +
Sbjct: 82 LYYTDIGIGTPAVKYYVQLDTGSKAFWVNGISCKQCPHESDILRKLTFYDPRSSVSSKEV 141
Query: 91 SCSSSQCAVVTSNCSEGDCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMP---N 147
C + C C Y Y G + G L T+ L ++ G P +
Sbjct: 142 KCDDTICTSRPPCNMTLRCPYITGYADGG---LTMGILFTDLLHYHQLYGNGQTQPTSTS 198
Query: 148 VIFGCGHKNLASPTSDSKQT-GIIGLGPGNSSLISQMGTSIAGK----FSYCLPDQGSSK 202
V FGCG + S + + GIIG G N + +SQ+ + AGK FS+CL
Sbjct: 199 VTFGCGLQQSGSLNNSAVAIDGIIGFGNSNQTALSQL--AAAGKTKKIFSHCLDSTNGGG 256
Query: 203 INFGGIVAGAGVVSTPLIIRDHYY--LSLEAISVGNQRLE-----FVSSSTGNIFVDTGV 255
I G V V +TP++ + Y ++L++I+V L+ F ++ T F+D+G
Sbjct: 257 IFAIGEVVEPKVKTTPIVKNNEVYHLVNLKSINVAGTTLQLPANIFGTTKTKGTFIDSGS 316
Query: 256 LRTLLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDVLCYNI--SSQPKFPEVTIHFRG 313
LP +S L + + P +GA F C++ S KFP++T HF
Sbjct: 317 TLVYLPEIIYSEL---ILAVFAKHPDITMGAMYNFQ---CFHFLGSVDDKFPKITFHFEN 370
Query: 314 ADVKLS--PSNLFRNISDEIMCSAFR-----GGNANIVYGRIMQINFLIGYDIEQAMVSF 366
D+ L P + C F+ G I+ G ++ N ++ YD+E+ + +
Sbjct: 371 -DLTLDVYPYDYLLEYEGNQYCFGFQDAGIHGYKDMIILGDMVISNKVVVYDMEKQAIGW 429
Query: 367 KPSRCTN 373
C++
Sbjct: 430 TEHNCSS 436
>gi|115459640|ref|NP_001053420.1| Os04g0535200 [Oryza sativa Japonica Group]
gi|113564991|dbj|BAF15334.1| Os04g0535200 [Oryza sativa Japonica Group]
gi|116310090|emb|CAH67110.1| H0502G05.1 [Oryza sativa Indica Group]
gi|116310464|emb|CAH67468.1| OSIGBa0159I10.13 [Oryza sativa Indica Group]
gi|215715343|dbj|BAG95094.1| unnamed protein product [Oryza sativa Japonica Group]
gi|215765807|dbj|BAG87504.1| unnamed protein product [Oryza sativa Japonica Group]
gi|215767550|dbj|BAG99778.1| unnamed protein product [Oryza sativa Japonica Group]
gi|218195278|gb|EEC77705.1| hypothetical protein OsI_16781 [Oryza sativa Indica Group]
Length = 492
Score = 95.5 bits (236), Expect = 4e-17, Method: Compositional matrix adjust.
Identities = 109/415 (26%), Positives = 164/415 (39%), Gaps = 94/415 (22%)
Query: 35 YLMHLSIGTPP----VDIFGSVDTGSDCTWTQCEPCPELDCFKQE----------PPLFD 80
Y + LS+G P V +F +DTGSD W C P + C + PP D
Sbjct: 88 YTLSLSVGPPSTASSVSLF--LDTGSDLVWFPCAPFTCMLCEGKATPGGNHSSPLPPPID 145
Query: 81 PKK------------SSTYNSISCSSSQC---AVVTSNCSEGDCS-YSFLYGRGAYASFS 124
++ SS S C++++C A+ T +C+ C + YG G+ +
Sbjct: 146 SRRISCASPLCSAAHSSAPTSDLCAAARCPLDAIETDSCASHACPPLYYAYGDGSLVA-- 203
Query: 125 SGNLATETLTFNSTSGLPVEMPNVIFGCGHKNLASPTSDSKQTGIIGLGPGNSSLISQMG 184
NL + ++ + + N F C H LA P G+ G G G SL +Q+
Sbjct: 204 --NLRRGRVGLAAS----MAVENFTFACAHTALAEPV------GVAGFGRGPLSLPAQLA 251
Query: 185 TSIAGKFSYCLPDQG--------SSKINFGGIVAGAGV-------VSTPLIIRDH----Y 225
S++G+FSYCL SS + G A + V TPL+ Y
Sbjct: 252 PSLSGRFSYCLVAHSFRADRLIRSSPLILGRSTDAAAIGASETDFVYTPLLHNPKHPYFY 311
Query: 226 YLSLEAISVGNQRLE-------FVSSSTGNIFVDTGVLRTLLPLEYHSNLKS---VMSNM 275
++LEA+SVG +R++ G + VD+G T+LP + + +
Sbjct: 312 SVALEAVSVGGKRIQAQPELGDVDRDGNGGMVVDSGTTFTMLPSDTFARVADEFARAMAA 371
Query: 276 IKAQPVKGVGAEPGFSDVLCYNIS-SQPKFPEVTIHFRG-ADVKLSPSNLFRNISDE--- 330
+ +G A+ G + CY+ S S P V +HFRG A V L N F E
Sbjct: 372 ARFTRAEGAEAQTGLAP--CYHYSPSDRAVPPVALHFRGNATVALPRRNYFMGFKSEEGR 429
Query: 331 -IMCSAFR-----------GGNANIVYGRIMQINFLIGYDIEQAMVSFKPSRCTN 373
+ C GG G Q F + YD++ V F RCT+
Sbjct: 430 SVGCLMLMNVGGNNDDGEDGGGPAGTLGNFQQQGFEVVYDVDAGRVGFARRRCTD 484
>gi|115473125|ref|NP_001060161.1| Os07g0592200 [Oryza sativa Japonica Group]
gi|29027762|dbj|BAC65898.1| putative CND41, chloroplast nucleoid DNA binding protein [Oryza
sativa Japonica Group]
gi|113611697|dbj|BAF22075.1| Os07g0592200 [Oryza sativa Japonica Group]
Length = 631
Score = 95.1 bits (235), Expect = 4e-17, Method: Compositional matrix adjust.
Identities = 102/360 (28%), Positives = 158/360 (43%), Gaps = 38/360 (10%)
Query: 35 YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSS 94
Y L IGTP + VD+GS T+ C C + C + P F P SSTY+ + C+
Sbjct: 91 YTTRLYIGTPSQEFALIVDSGSTVTYVPCATCEQ--CGNHQDPRFQPDLSSTYSPVKCNV 148
Query: 95 SQCAVVTSNCSEGDCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMPNVIFGCGH 154
T + C+Y Y A S SSG L + ++F S L + +FGC +
Sbjct: 149 D----CTCDNERSQCTYERQY---AEMSSSSGVLGEDIMSFGKESELKPQ--RAVFGCEN 199
Query: 155 KNLASPTSDSKQTGIIGLGPGNSSLISQMGTS--IAGKFSYCLP--DQGSSKINFGGIVA 210
S GI+GLG G S++ Q+ I+ FS C D G + GG+ A
Sbjct: 200 TETGDLFSQHAD-GIMGLGRGQLSIMDQLVEKGVISDSFSLCYGGMDVGGGTMVLGGMPA 258
Query: 211 GAGVV---STPLIIRDHYY-LSLEAISVGNQRLEF---VSSSTGNIFVDTGVLRTLLPLE 263
+V S P +R YY + L+ I V + L + +S +D+G LP +
Sbjct: 259 PPDMVFSHSNP--VRSPYYNIELKEIHVAGKALRLDPKIFNSKHGTVLDSGTTYAYLPEQ 316
Query: 264 YHSNLKSVMSNMIKAQPVKGVGAEPGFSDVLCY-----NISSQPK-FPEVTIHF-RGADV 316
K ++N + + K G +P + D+ C+ N+S + FP+V + F G +
Sbjct: 317 AFVAFKDAVTNKVNSLK-KIRGPDPNYKDI-CFAGAGRNVSQLSEVFPDVDMVFGNGQKL 374
Query: 317 KLSPSN-LFRN--ISDEIMCSAFRGG-NANIVYGRIMQINFLIGYDIEQAMVSFKPSRCT 372
LSP N LFR+ + F+ G + + G I+ N L+ YD + F + C+
Sbjct: 375 SLSPENYLFRHSKVEGAYCLGVFQNGKDPTTLLGGIVVRNTLVTYDRHNEKIGFWKTNCS 434
>gi|296082172|emb|CBI21177.3| unnamed protein product [Vitis vinifera]
Length = 376
Score = 95.1 bits (235), Expect = 5e-17, Method: Compositional matrix adjust.
Identities = 60/154 (38%), Positives = 80/154 (51%), Gaps = 17/154 (11%)
Query: 35 YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSS 94
Y++ + +GTP D+ DTGSD TWTQCEPC C+ Q+ P+F+P KS++Y +ISCSS
Sbjct: 138 YVVTVGLGTPKRDLTFIFDTGSDLTWTQCEPCARY-CYHQQEPIFNPSKSTSYTNISCSS 196
Query: 95 SQCAVVTS------NCSEGDCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMPNV 148
C + S +CS C Y YG S+S G A + L ST N
Sbjct: 197 PTCDELKSGTGNSPSCSASTCVYGIQYGD---QSYSVGFFAQDKLALTSTD----VFNNF 249
Query: 149 IFGCGHKNLASPTSDSKQTGIIGLGPGNSSLISQ 182
+FGCG N G+IGLG SL+S+
Sbjct: 250 LFGCGQNNRGLFVG---VAGLIGLGRNALSLMSK 280
Score = 40.8 bits (94), Expect = 1.0, Method: Compositional matrix adjust.
Identities = 30/111 (27%), Positives = 49/111 (44%), Gaps = 14/111 (12%)
Query: 267 NLKSVMSNMIKAQPVKGVGAEPGFSDVLCYNISSQP--KFPEVTIHFR-GADVKLSPSNL 323
N S+MS KA P + CY+ S P++ ++F GA++ L PS +
Sbjct: 273 NALSLMSKYPKAAPASILDT--------CYDFSQYDTVDVPKINLYFSDGAEMDLDPSGI 324
Query: 324 FRNISDEIMCSAFRGGNANI---VYGRIMQINFLIGYDIEQAMVSFKPSRC 371
F ++ +C AF G + + G + Q F + YD+ + F P C
Sbjct: 325 FYILNISQVCLAFAGNSDATDIAILGNVQQKTFDVVYDVAGGRIGFAPGGC 375
>gi|255565531|ref|XP_002523756.1| Aspartic proteinase Asp1 precursor, putative [Ricinus communis]
gi|223537060|gb|EEF38696.1| Aspartic proteinase Asp1 precursor, putative [Ricinus communis]
Length = 507
Score = 94.7 bits (234), Expect = 5e-17, Method: Compositional matrix adjust.
Identities = 105/384 (27%), Positives = 160/384 (41%), Gaps = 62/384 (16%)
Query: 34 IYLMHLSIGTPPVDIFGSVDTGSDCTWTQCEP---CPELDCFKQEPPLFDPKKSSTYNSI 90
+Y + +G+PP D + +DTGSD W C CP + FDP S+T +
Sbjct: 83 LYFTRVQLGSPPKDFYVQIDTGSDVLWVSCSSCNGCPVTSGLQIPLTFFDPGSSTTAALV 142
Query: 91 SCSSSQCAV----VTSNCSE--GDCSYSFLYGRGAYAS--------------FSSGNLAT 130
SCS +C S CS C Y+F YG G+ S SSG L+
Sbjct: 143 SCSDQRCTAGIQSSDSLCSSRTNQCGYTFQYGDGSGTSGYYVADLMHLDTLLLSSGELSQ 202
Query: 131 ETLTFNSTSGLPVEMPNVIFGCGHKNLASPT-SDSKQTGIIGLGPGNSSLISQMGTS--I 187
T++S+ V F C T SD GI G G S+ISQ+ +
Sbjct: 203 ICQTYDSS---------VSFMCSTLQTGDLTKSDRAVDGIFGFGQQEMSVISQLASQGIT 253
Query: 188 AGKFSYCLP--DQGSSKINFGGIVAGAGVVSTPLI-IRDHYYLSLEAISVGNQRLE---- 240
FS+CL D G + G IV +V TPL+ + HY L L++ISV Q L
Sbjct: 254 PRVFSHCLKGDDSGGGVLVLGEIVE-PNIVYTPLVPSQPHYNLYLQSISVAGQTLAIDPS 312
Query: 241 -FVSSSTGNIFVDTGVLRTLLPLEYHSNLKSVMSNMI--KAQPVKGVGAEPGFSDVLCYN 297
F +SS VD+G L + S +++++ A+ G + CY
Sbjct: 313 VFGASSNQGTIVDSGTTLAYLAEGAYDPFVSAITSVVSLNARTYLSKGNQ-------CYL 365
Query: 298 ISSQPK--FPEVTIHFR-GADVKLSPSNLFRNISD----EIMCSAFRG--GNANIVYGRI 348
++S FP+V+++F GA + L+P + + + C F+ G + G +
Sbjct: 366 VTSSVNDVFPQVSLNFAGGASLILNPQDYLLQQNSVGGAAVWCVGFQKTPGQQITILGDL 425
Query: 349 MQINFLIGYDIEQAMVSFKPSRCT 372
+ + + YDI V + C+
Sbjct: 426 VLKDKIFVYDIANQRVGWTNYDCS 449
>gi|449455475|ref|XP_004145478.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Cucumis sativus]
gi|449518962|ref|XP_004166504.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Cucumis sativus]
Length = 449
Score = 94.7 bits (234), Expect = 6e-17, Method: Compositional matrix adjust.
Identities = 117/426 (27%), Positives = 170/426 (39%), Gaps = 94/426 (22%)
Query: 31 VDDIYLMHLSIGTPPVDIFGSVDTGSDCTWT-------QCEPCPELDCFKQEPPL--FDP 81
+ D YLM LSIGTPP + +DTGSD TW C+ C E P L F P
Sbjct: 17 IRDGYLMSLSIGTPPQVVQVYMDTGSDLTWVPCGNLSFDCQDCEEYQNNISGPRLAAFLP 76
Query: 82 KKSSTYNSISCSSSQCAVVTSN------CSEGDCSY-------------SFLYGRGAYAS 122
SST +C SS C + S+ C+ CS SF Y GA +
Sbjct: 77 THSSTSIRDTCGSSFCMDIHSSDNPFDPCTIAGCSLASLVKGTCPRPCPSFAYTYGA-SG 135
Query: 123 FSSGNLATETLTFNSTSGLPV----EMPNVIFGCGHKNLASPTSDSKQTGIIGLGPGNSS 178
+G+L + L + ++P FGC P GI G G G S
Sbjct: 136 VVTGSLTRDVLFTHGNYNNNNNNNKQIPRFCFGCVGATYREP------IGIAGFGRGLLS 189
Query: 179 LISQMGTSIAGKFSYC-LPDQGSSKINFGG-IVAGAGVVS--------TPLI----IRDH 224
L Q+G S G FS+C LP + S+ NF ++ G +S TPL+ ++
Sbjct: 190 LPFQLGFSHKG-FSHCFLPFKFSNNPNFSSPLILGNLAISSKDENLQFTPLLKSPMYPNY 248
Query: 225 YYLSLEAISVGN-----------QRLEFVSSSTGNIFVDTGVLRTLLPLEYHSNLKSVMS 273
YY+ LE+I++GN + E + G + +D+G T LP +S L S +
Sbjct: 249 YYIGLESITIGNGDNNFRFGVSFKLREIDTKGNGGMLIDSGTTYTHLPEPLYSQLISNLE 308
Query: 274 NMIKAQPVKGVGAEPGFSDVLCYNISSQ---------PKFPEVTIHF-RGADVKLSPSNL 323
+I K V GF LCY + + + P +T HF V L N
Sbjct: 309 LVIGYPRAKQVELNTGFD--LCYKVPCKNNNSSFVDDAQLPSITFHFLNNVSVVLPQGNN 366
Query: 324 FRNI-----SDEIMCSAFRGGNANI------------VYGRIMQINFLIGYDIEQAMVSF 366
F + S + C ++ + ++G Q N + YD+E+ + F
Sbjct: 367 FYAMAAPINSTVVKCLLYQSMDGVGDDNDSDDNGPAGIFGSFQQQNIEVVYDLEKERLGF 426
Query: 367 KPSRCT 372
+P C
Sbjct: 427 QPMDCV 432
>gi|449482385|ref|XP_004156266.1| PREDICTED: aspartic proteinase-like protein 2-like [Cucumis
sativus]
Length = 491
Score = 94.7 bits (234), Expect = 6e-17, Method: Compositional matrix adjust.
Identities = 98/372 (26%), Positives = 164/372 (44%), Gaps = 45/372 (12%)
Query: 34 IYLMHLSIGTPPVDIFGSVDTGSDCTWTQCEP---CPELDCFKQEPPLFDPKKSSTYNSI 90
+Y + +G P + +DTGSD W C P CP+ E LFD KSS+ +
Sbjct: 83 LYFTKVKLGNPAREFNVQIDTGSDILWVTCSPCDGCPDSSGLGIELNLFDTTKSSSARVL 142
Query: 91 SCSSSQCAVVTSNCSE-----GDCSYSFLYGRGAYASFSSGNLATETLTFNSTSG---LP 142
C+ CA V++ + CSYSF Y S +SG T+++ F+ G +
Sbjct: 143 PCTDPICAAVSTTTDQCLTQTDHCSYSFHYRD---RSGTSGFYVTDSMHFDILLGESTIA 199
Query: 143 VEMPNVIFGCGHKNLASPTSDSKQT-GIIGLGPGNSSLISQMGTS-IAGK-FSYCLP--D 197
++FGC T +K GI G G G S+ISQ+ + I K FS+CL +
Sbjct: 200 NSSATIVFGCSIYQYGDLTRATKALDGIFGFGQGEFSVISQLSSRGITPKVFSHCLKGGE 259
Query: 198 QGSSKINFGGIVAGAGVVSTPLI-IRDHYYLSLEAISVGNQRL----EFVSSSTGNIFVD 252
G + G I+ + +V +PLI + HY L L++I++ Q F S+ G +D
Sbjct: 260 NGGGILVLGEILEPS-IVYSPLIPSQPHYTLKLQSIALSGQLFPNPTMFPISNAGETIID 318
Query: 253 TGVLRTLLPLEYHSNLKSVMSNMI--KAQPVKGVGAEPGFSDVLCYNISSQPK--FPEVT 308
+G L E + + SV+++ + A P G++ C+ +S FP +
Sbjct: 319 SGTTLAYLVEEVYDWIVSVITSAVSQSATPTISRGSQ-------CFRVSMSVADIFPVLR 371
Query: 309 IHFRG-ADVKLSPSNLFRNIS-------DEIMCSAFRGGNANI-VYGRIMQINFLIGYDI 359
+F G A + ++P + S + C F+ + + G ++ + +I YD+
Sbjct: 372 FNFEGIASMVVTPEEYLQFDSIVSCYKFASLWCIGFQKAEDGLNILGDLVLKDKIIVYDL 431
Query: 360 EQAMVSFKPSRC 371
Q + + C
Sbjct: 432 AQQRIGWANYDC 443
>gi|449440161|ref|XP_004137853.1| PREDICTED: aspartic proteinase-like protein 2-like [Cucumis
sativus]
gi|449521209|ref|XP_004167622.1| PREDICTED: aspartic proteinase-like protein 2-like [Cucumis
sativus]
Length = 492
Score = 94.7 bits (234), Expect = 6e-17, Method: Compositional matrix adjust.
Identities = 101/376 (26%), Positives = 160/376 (42%), Gaps = 50/376 (13%)
Query: 34 IYLMHLSIGTPPVDIFGSVDTGSDCTWTQCEP---CPELDCFKQEPPLFDP------KKS 84
+Y + +GTPP++ +DTGSD W C CP + FD
Sbjct: 78 LYFTKVKLGTPPMEFTVQIDTGSDILWVNCNSCNGCPRSSGLGIQLNFFDASSSSSSSLV 137
Query: 85 STYNSISCSSSQCAVVTSNCSEGDCSYSFLYGRGAYASFSSGNLATETLTFNSTSG---L 141
S + I S+ Q CSY+F YG G S +SG +E++ F+ G +
Sbjct: 138 SCSDPICNSAFQTTATQCLTQSNQCSYTFQYGDG---SGTSGYYVSESMYFDMVMGQSMI 194
Query: 142 PVEMPNVIFGCGHKNLASPT-SDSKQTGIIGLGPGNSSLISQM-GTSIAGK-FSYCLPDQ 198
+V+FGC T SD GI G GPG+ S+ISQ+ I K FS+CL +
Sbjct: 195 ANSSASVVFGCSTYQSGDLTKSDHAIDGIFGFGPGDLSVISQLSARGITPKVFSHCLKGE 254
Query: 199 GSSKINFGGI-----VAGAGVVSTPLI-IRDHYYLSLEAISVGNQRLE-----FVSSSTG 247
G N GGI V G+V +PL+ + HY L L++ISV Q L F +S
Sbjct: 255 G----NGGGILVLGEVLEPGIVYSPLVPSQPHYNLYLQSISVNGQTLPIDPSVFATSINR 310
Query: 248 NIFVDTGVLRTLLPLEYHSNLKSVMSNMI--KAQPVKGVGAEPGFSDVLCYNISSQ--PK 303
+D+G L E ++ S ++ + P G + CY +S+
Sbjct: 311 GTIIDSGTTLAYLVEEAYTPFVSAITAAVSQSVTPTISKGNQ-------CYLVSTSVGEI 363
Query: 304 FPEVTIHFRG-ADVKLSPSNLFRNI----SDEIMCSAFRGGNANI-VYGRIMQINFLIGY 357
FP V+++F G A + L P ++ + C F+ + + G ++ + + Y
Sbjct: 364 FPLVSLNFAGSASMVLKPEEYLMHLGFYDGAALWCIGFQKVQEGVTILGDLVMKDKIFVY 423
Query: 358 DIEQAMVSFKPSRCTN 373
D+ + + + C+
Sbjct: 424 DLARQRIGWASYDCSQ 439
>gi|449518783|ref|XP_004166415.1| PREDICTED: aspartic proteinase-like protein 2-like, partial
[Cucumis sativus]
Length = 420
Score = 94.7 bits (234), Expect = 7e-17, Method: Compositional matrix adjust.
Identities = 89/310 (28%), Positives = 134/310 (43%), Gaps = 47/310 (15%)
Query: 34 IYLMHLSIGTPPVDIFGSVDTGSDCTWT---QCEPCPELDCFKQEPPLFDPKKSSTYNSI 90
+Y + IGTP D + VDTGSD W QC CP E +D ++S+T +
Sbjct: 86 LYYAKIGIGTPSKDYYVQVDTGSDIVWVNCIQCRECPRTSSLGMELTPYDLEESTTGKLV 145
Query: 91 SCSSSQCAVVT----SNCSEG-DCSYSFLYGRGAYASFSSGNLATETLTFNSTSG-LPVE 144
SC C V S C+ C Y +YG G S ++G + + +N SG L
Sbjct: 146 SCDEQFCLEVNGGPLSGCTTNMSCPYLQIYGDG---SSTAGYFVKDYVQYNRVSGDLETT 202
Query: 145 MPN--VIFGCGHKNLASPTSDSKQT--GIIGLGPGNSSLISQMGTS--IAGKFSYCLPDQ 198
N + FGCG + S ++ GI+G G NSS+ISQ+ ++ + F++CL
Sbjct: 203 AANGSIKFGCGARQSGDLGSSGEEALDGILGFGKSNSSIISQLASTRKVKKMFAHCL--- 259
Query: 199 GSSKINFGGIVAGAGVVS-----TPLIIRD-HYYLSLEAISVGNQRLE-----FVSSSTG 247
N GGI A VV TPL+ HY +++ + VG+ L F +
Sbjct: 260 --DGTNGGGIFAMGHVVQPKVNMTPLVPNQPHYNVNMTGVQVGHIILNISADVFEAGDRK 317
Query: 248 NIFVDTGVLRTLLP-LEYHSNLKSVMSNM--IKAQPVKGVGAEPGFSDVLCYNISSQ--P 302
+D+G LP L Y + ++S ++ Q + G + C+ S +
Sbjct: 318 GTIIDSGTTLAYLPELIYEPLVAKILSQQHNLEVQTIHG--------EYKCFQYSERVDD 369
Query: 303 KFPEVTIHFR 312
FP V HF
Sbjct: 370 GFPPVIFHFE 379
>gi|222613193|gb|EEE51325.1| hypothetical protein OsJ_32293 [Oryza sativa Japonica Group]
Length = 371
Score = 94.7 bits (234), Expect = 7e-17, Method: Compositional matrix adjust.
Identities = 90/337 (26%), Positives = 155/337 (45%), Gaps = 48/337 (14%)
Query: 63 CEPCPELDCFKQEPPLFDPKKSSTYNSISCSSSQC-AVVTSNCSEGDCSYSFLYGRGAYA 121
C C + CFKQ+ P+F P SST+ C + C ++ T C+ C+Y + G G +
Sbjct: 55 CSQC--IHCFKQDLPVFVPNASSTFKPEPCGTDVCKSIPTPKCASDVCAYDGVTGLGGH- 111
Query: 122 SFSSGNLATETLTFNSTSGLPVEMPNVIFGCGHKNLASPTSDSKQTGIIGLGPGNSSLIS 181
+ G +AT+T + + P P G A+ T + +G IGLG SL++
Sbjct: 112 --TVGIVATDTFAIGTAA--PARPP----ASGASWRATSTPWAGPSGFIGLGRTPWSLVA 163
Query: 182 QMGTSIAGKFSYCLP--DQGSSKINFGGIVA--GAGVVSTPLI-------IRDHYYLSLE 230
QM + +FSYCL D G + F G A G TP + + +Y + LE
Sbjct: 164 QMKLT---RFSYCLAPHDTGKNSRLFLGASAKLAGGGAWTPFVKTSPNDGMSQYYPIELE 220
Query: 231 AISVGNQRLEFVSSSTGNIFVDTGVLRTLLPLE--YHSNLKSVMSNMIKAQPVKGVGAEP 288
I G+ + + + V T V+R L ++ Y K+VM+++ A VGA
Sbjct: 221 EIKAGDATIT-MPRGRNTVLVQTAVVRVSLLVDSVYQEFKKAVMASVGAAPTATPVGAP- 278
Query: 289 GFSDVLCYNISSQPKFPEVTIHFR-GADVKLSPSNLFRNISDEIMC-----------SAF 336
F +C+ + P++ F+ GA + + P+N ++ ++ +C +A
Sbjct: 279 -FE--VCFPKAGVSGAPDLVFTFQAGAALTVPPANYLFDVGNDTVCLSVMSIALLNITAL 335
Query: 337 RGGNANIVYGRIMQINFLIGYDIEQAMVSFKPSRCTN 373
G N + G Q N + +D+++ M+SF+P+ C++
Sbjct: 336 DGLN---ILGSFQQENVHLLFDLDKDMLSFEPADCSS 369
>gi|297795137|ref|XP_002865453.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
gi|297311288|gb|EFH41712.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
Length = 665
Score = 94.4 bits (233), Expect = 8e-17, Method: Compositional matrix adjust.
Identities = 108/367 (29%), Positives = 166/367 (45%), Gaps = 51/367 (13%)
Query: 35 YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSS 94
Y L IGTPP + VDTGS T+ C C + C K + P F P+ SS+Y ++ C+
Sbjct: 80 YTTRLWIGTPPQEFALIVDTGSTVTYVPCSTCKQ--CGKHQDPKFQPELSSSYKALKCNP 137
Query: 95 SQCAVVTSNC-SEGD-CSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMPNVIFGC 152
NC EG C Y Y A S SSG L+ + ++F + S L + +FGC
Sbjct: 138 D------CNCDDEGKLCVYERRY---AEMSSSSGVLSEDLISFGNESQLTPQ--RAVFGC 186
Query: 153 GHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTS--IAGKFSYCLPDQGSSKINFGGIVA 210
+ S + GI+GLG G S++ Q+ I FS C G ++ G +V
Sbjct: 187 ENVETGDLFS-QRADGIMGLGRGKLSVVDQLVDKGVIEDVFSLCY---GGMEVGGGAMVL 242
Query: 211 G-----AGVV---STPLIIRDHYY-LSLEAISVGNQRLEF---VSSSTGNIFVDTGVLRT 258
G AG+V S P R YY + L+ + V + L+ V + +D+G
Sbjct: 243 GKISPPAGMVFSHSDPF--RSPYYNIDLKQMHVAGKSLKLNPKVFNGKHGTVLDSGTTYA 300
Query: 259 LLPLEYHSNLKSVMSNMIKAQP-VKGV-GAEPGFSDVLCYNISSQPK------FPEVTIH 310
P E +K +IK P +K + G +P + DV C++ + + FPE+ +
Sbjct: 301 YFPKEAFIAIKDA---IIKEIPSLKRIHGPDPNYDDV-CFSGAGRDVAEIHNFFPEIDME 356
Query: 311 F-RGADVKLSPSN-LFRN--ISDEIMCSAFRGGNANIVYGRIMQINFLIGYDIEQAMVSF 366
F G + LSP N LFR+ + F ++ + G I+ N L+ YD E + F
Sbjct: 357 FGNGQKLILSPENYLFRHTKVRGAYCLGIFPDRDSTTLLGGIVVRNTLVTYDRENDKLGF 416
Query: 367 KPSRCTN 373
+ C++
Sbjct: 417 LKTNCSD 423
>gi|449442281|ref|XP_004138910.1| PREDICTED: aspartic proteinase-like protein 2-like [Cucumis
sativus]
gi|449506266|ref|XP_004162699.1| PREDICTED: aspartic proteinase-like protein 2-like [Cucumis
sativus]
Length = 482
Score = 94.4 bits (233), Expect = 8e-17, Method: Compositional matrix adjust.
Identities = 103/376 (27%), Positives = 164/376 (43%), Gaps = 50/376 (13%)
Query: 34 IYLMHLSIGTPPVDIFGSVDTGSDCTWTQCE---PCPELDCFKQEPPLFDPKKSSTYNSI 90
+Y + +GTP D + VDTGSD W C CP+ E L+ P SST N +
Sbjct: 73 LYFAKIGLGTPVQDYYVQVDTGSDILWVNCAGCTNCPKKSDLGIELSLYSPSSSSTSNRV 132
Query: 91 SCSSSQCAVVTS----NCS-EGDCSYSFLYGRGAYAS--FSSGNLATETLTFN----STS 139
+C+ C C+ E C Y YG G+ + F ++ + +T N ST+
Sbjct: 133 TCNQDFCTSTYDGPIPGCTPELLCEYRVAYGDGSSTAGYFVRDHVVLDRVTGNFQTTSTN 192
Query: 140 GLPVEMPNVIFGCGHKNLASPTSDSKQ-TGIIGLGPGNSSLISQMGTS--IAGKFSYCLP 196
G +++FGCG + + S GI+G G NSS+ISQ+ +S + F++CL
Sbjct: 193 G------SIVFGCGAQQSGQLGATSAALDGILGFGQANSSMISQLASSGKVKRVFAHCLD 246
Query: 197 DQGSSKINFGGIVAGAGVV-----STPLIIRD-HYYLSLEAISVGNQRLE-----FVSSS 245
+ IN GGI A VV +TPL+ + HY + ++AI V N+ L F +
Sbjct: 247 N-----INGGGIFAIGEVVQPKVRTTPLVPQQAHYNVFMKAIEVDNEVLNLPTDVFDTDL 301
Query: 246 TGNIFVDTGVLRTLLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDVLCYNISSQPKFP 305
+D+G P + L S + + +K E F+ Y+ + FP
Sbjct: 302 RKGTIIDSGTTLAYFPDVIYEPLISKI--FARQSTLKLHTVEEQFT-CFEYDGNVDDGFP 358
Query: 306 EVTIHFRGA-DVKLSPSNLFRNISDEIMCSAF-------RGGNANIVYGRIMQINFLIGY 357
VT HF + + + P +I C + R G I+ G ++ N L+ Y
Sbjct: 359 TVTFHFEDSLSLTVYPHEYLFDIDSNKWCVGWQNSGAQSRDGKDMILLGDLVLQNRLVMY 418
Query: 358 DIEQAMVSFKPSRCTN 373
D+E + + C++
Sbjct: 419 DLENQTIGWTEYNCSS 434
>gi|326511104|dbj|BAK03251.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 631
Score = 94.4 bits (233), Expect = 8e-17, Method: Compositional matrix adjust.
Identities = 99/360 (27%), Positives = 158/360 (43%), Gaps = 36/360 (10%)
Query: 35 YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSS 94
Y L IGTPP + VD+GS T+ C C + C + P F P SSTY+ + C+
Sbjct: 88 YTTRLHIGTPPQEFALIVDSGSTVTYVPCASCEQ--CGNHQDPRFQPDLSSTYSPVKCNV 145
Query: 95 SQCAVVTSNCSEGDCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMPNVIFGCGH 154
T + + C+Y Y A S SSG L + ++F + S L + +FGC +
Sbjct: 146 D----CTCDSDKNQCTYERQY---AEMSSSSGVLGEDIVSFGTESELKPQ--RAVFGCEN 196
Query: 155 KNLASPTSDSKQTGIIGLGPGNSSLISQMGTS--IAGKFSYCLP--DQGSSKINFGGIVA 210
S GI+GLG G S++ Q+ I FS C D G + G + A
Sbjct: 197 SETGDLFSQHAD-GIMGLGRGQLSIMDQLVDKGVIGDSFSMCYGGMDIGGGAMVLGAMPA 255
Query: 211 GAGVVSTPL-IIRDHYY-LSLEAISVGNQRLEF---VSSSTGNIFVDTGVLRTLLPLEYH 265
G++ T +R YY + L+ + V + L + +D+G LP +
Sbjct: 256 PPGMIYTHSNAVRSPYYNIELKEMHVAGKALRVDPRIFDGKHGTVLDSGTTYAYLPEQAF 315
Query: 266 SNLKSVMSNMIKAQPVKGV-GAEPGFSDVLCY-----NISSQPK-FPEVTIHF-RGADVK 317
K +S+ + P+K + G + + D+ C+ N+S + FP+V + F G +
Sbjct: 316 VAFKDAVSSQV--HPLKKIRGPDSNYKDI-CFAGAGRNVSQLSEVFPKVDMVFGNGQKLS 372
Query: 318 LSPSN-LFRN--ISDEIMCSAFRGG-NANIVYGRIMQINFLIGYDIEQAMVSFKPSRCTN 373
LSP N LFR+ + F+ G + + G I+ N L+ YD + F + C+
Sbjct: 373 LSPENYLFRHSKVEGAYCLGVFQNGKDPTTLLGGIVVRNTLVTYDRHNEKIGFWKTNCSE 432
>gi|242094478|ref|XP_002437729.1| hypothetical protein SORBIDRAFT_10g001440 [Sorghum bicolor]
gi|241915952|gb|EER89096.1| hypothetical protein SORBIDRAFT_10g001440 [Sorghum bicolor]
Length = 486
Score = 94.4 bits (233), Expect = 8e-17, Method: Compositional matrix adjust.
Identities = 92/352 (26%), Positives = 157/352 (44%), Gaps = 48/352 (13%)
Query: 43 TPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSSSQCAVV-- 100
+PPV + +DT D W +C PC C +DP +SSTY++ C+SS C +
Sbjct: 160 SPPVTVV--LDTAGDVPWMRCVPCTFAQCAD-----YDPTRSSTYSAFPCNSSACKQLGR 212
Query: 101 -TSNC-SEGDCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMPNVIFGCGHKNLA 158
+ C + G C Y + ++ + SG +++ LT NS G VE FGC
Sbjct: 213 YANGCDANGQCQYMVVTAGDSFTT--SGTYSSDVLTINS--GDRVE--GFRFGCSQNEQG 266
Query: 159 SPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLPDQGSSKINFG-GIVAGAG--VV 215
S +++ GI+ LG G SL++Q ++ FSYCLP ++K F G+ GA V
Sbjct: 267 S--FENQADGIMALGRGVQSLMAQTSSTYGDAFSYCLPPTETTKGFFQIGVPIGASYRFV 324
Query: 216 STPLIIRDH---------YYLSLEAISVGNQRLEFVSSS-TGNIFVDTGVLRTLLPLEYH 265
+TP++ Y L AI+V + L + +D+ + T LP+ +
Sbjct: 325 TTPMLKERGGASAAAATLYRALLLAITVDGKELNVPAEVFAAGTVMDSRTIITRLPVTAY 384
Query: 266 SNLKSVMSNMIKAQPVKGVGAEPGFSDVLCYNIS--SQPKFPEVTIHFRG-ADVKLSPSN 322
L++ N ++ + A P CY+++ P+ P + + F G A V++ S
Sbjct: 385 GALRAAFRNRMRYRV-----APPQEELDTCYDLTGVRYPRLPRIALVFDGNAVVEMDRSG 439
Query: 323 LFRNISDEIMCSAFRGGNAN---IVYGRIMQINFLIGYDIEQAMVSFKPSRC 371
+ N C AF + + + G + Q + +D+ + F+ + C
Sbjct: 440 ILLN-----GCLAFASNDDDSSPSILGNVQQQTIQVLHDVGGGRIGFRSAAC 486
>gi|340810987|gb|AEK75420.1| S5 [Oryza rufipogon]
gi|340810989|gb|AEK75421.1| S5 [Oryza rufipogon]
gi|340810991|gb|AEK75422.1| S5 [Oryza rufipogon]
gi|340811001|gb|AEK75427.1| S5 [Oryza rufipogon]
gi|340811019|gb|AEK75436.1| S5 [Oryza rufipogon]
gi|340811104|gb|AEK75478.1| S5 [Oryza rufipogon]
gi|340811124|gb|AEK75488.1| S5 [Oryza rufipogon]
Length = 472
Score = 94.4 bits (233), Expect = 8e-17, Method: Compositional matrix adjust.
Identities = 107/380 (28%), Positives = 161/380 (42%), Gaps = 58/380 (15%)
Query: 32 DDIYLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEP---PLFDPKKSSTYN 88
D ++LM +S+G PPV ++DTGS +W QC+PC + C Q P+FDP +S T
Sbjct: 111 DFLFLMAVSLGKPPVVNLVAIDTGSTLSWVQCQPCA-VHCHTQSAKAGPIFDPGRSYTSR 169
Query: 89 SISCSSSQCA-------VVTSNCSEGD--CSYSFLYGRGAYASFSSGNLATETLTFNSTS 139
+ CSS +C + +NC E + C+YS YG G ++S G + T+TL +
Sbjct: 170 RVRCSSVKCGELRYDLRLQQANCMEKENSCTYSVTYGNG--WAYSVGKMVTDTLRIGDS- 226
Query: 140 GLPVEMPNVIFGCGHKNLASPTSDSKQTGIIGLGPGNSSLISQMG----TSIAGKFSYCL 195
+++FGC S + GI G G + S Q+ FSYCL
Sbjct: 227 -----FMDLMFGCSMDVKYS----EFEAGIFGFGSSSFSFFEQLAGYPDILSYKAFSYCL 277
Query: 196 P-DQGSSKINFGGIVAGAGVVS--TPL---IIRDHYYLSLEAISVGNQRLEFVSSSTGNI 249
P D+ G A + TPL I R Y L++E + QRL +S+ +
Sbjct: 278 PTDETKPGYMILGRYDRAAMDGGYTPLFRSINRPTYSLTMEMLIANGQRL---VTSSSEM 334
Query: 250 FVDTGVLRT-LLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDVLCY------------ 296
VD+G RT L P + K++ M + A +CY
Sbjct: 335 IVDSGAQRTSLWPSTFALLDKTITQAMSSIGYHRTSRARQ--ESYICYLSEHDYSGWNGT 392
Query: 297 --NISSQPKFPEVTIHFR-GADVKLSPSNLFRNISDEIMCSAFRGGNA--NIVYGRIMQI 351
S+ P + I F GA + L P N+F N +C F A + + G +
Sbjct: 393 ITPFSNWSALPLLEIGFAGGAALALPPRNVFYNDPHRGLCMTFAQNPALRSQILGNRVTR 452
Query: 352 NFLIGYDIEQAMVSFKPSRC 371
+F +DI+ FK + C
Sbjct: 453 SFGTTFDIQGKQFGFKYAAC 472
>gi|255550723|ref|XP_002516410.1| pepsin A, putative [Ricinus communis]
gi|223544445|gb|EEF45965.1| pepsin A, putative [Ricinus communis]
Length = 416
Score = 94.4 bits (233), Expect = 8e-17, Method: Compositional matrix adjust.
Identities = 116/412 (28%), Positives = 168/412 (40%), Gaps = 78/412 (18%)
Query: 27 EIISVDDIYLMHLSIGTPPVDIFGSVDTGSDCTWTQCE----PCPELDCFKQEPPLFDPK 82
++ V D YL+ L+IGTPP I +DTGSD TW C C + D ++ +
Sbjct: 4 QLREVRDGYLISLNIGTPPQVIQVYMDTGSDLTWVPCGNLSFDCMDCDDYRNSKLMSAFS 63
Query: 83 KSSTYNSI--SCSSSQCAVVTSN------CSEGDCSY-------------SFLYGRGAYA 121
S + +S SC+S C + S+ C+ CS SF Y GA
Sbjct: 64 PSHSSSSYRDSCASPYCTDIHSSDNSFDPCTVAGCSLSTLIKATCARPCPSFAYTYGA-G 122
Query: 122 SFSSGNLATETLTFNS-TSGLPVEMPNVIFGCGHKNLASPTSDSKQTGIIGLGPGNSSLI 180
+G L +TL + + + ++P FGC P GI G G S
Sbjct: 123 GVVTGTLTRDTLRVHEGPARVTKDIPKFCFGCVGSTYHEP------IGIAGFVRGTLSFP 176
Query: 181 SQMGTSIAGKFSYC-LPDQGSSKINFGG-IVAGAGVVS-------TPLI----IRDHYYL 227
SQ+G G FS+C L + ++ N +V G +S TP++ ++YY+
Sbjct: 177 SQLGLLKKG-FSHCFLAFKYANNPNISSPLVIGDTALSSKDNMQFTPMLKSPMYPNYYYI 235
Query: 228 SLEAISVGN--------QRLEFVSSSTGNIFVDTGVLRTLLPLEYHSNLKSVMSNMIKAQ 279
LEAI+VGN EF S G + +D+G T LP ++S L S+ +I
Sbjct: 236 GLEAITVGNVSATTVPLNLREFDSQGNGGMLIDSGTTYTHLPEPFYSQLLSIFKAIITYP 295
Query: 280 PVKGVGAEPGFSDVLCYNI--------SSQPKFPEVTIHF-RGADVKLSPSNLFRNISDE 330
V GF LCY + FP +T HF L N F +S
Sbjct: 296 RATEVEMRAGFD--LCYKVPCPNNRLTDDDNLFPSITFHFLNNVSFVLPQGNHFYAMSAP 353
Query: 331 -----IMCSAFRG------GNANIVYGRIMQINFLIGYDIEQAMVSFKPSRC 371
+ C F+ G A V+G Q N I YD+E+ + F+P C
Sbjct: 354 SNSTVVKCLLFQSMADSDYGPAG-VFGSFQQQNVQIVYDLEKERIGFQPMDC 404
>gi|196212952|gb|ACG76112.1| S5 [Oryza sativa Indica Group]
gi|338809989|gb|AEJ08560.1| S5 [Oryza barthii]
gi|340810883|gb|AEK75368.1| S5 [Oryza sativa]
gi|340810885|gb|AEK75369.1| S5 [Oryza sativa]
gi|340810889|gb|AEK75371.1| S5 [Oryza sativa]
gi|340810895|gb|AEK75374.1| S5 [Oryza sativa]
gi|340810897|gb|AEK75375.1| S5 [Oryza sativa]
gi|340810905|gb|AEK75379.1| S5 [Oryza sativa]
gi|340810909|gb|AEK75381.1| S5 [Oryza sativa]
gi|340810911|gb|AEK75382.1| S5 [Oryza sativa]
gi|340810913|gb|AEK75383.1| S5 [Oryza sativa]
gi|340810923|gb|AEK75388.1| S5 [Oryza sativa]
gi|340810925|gb|AEK75389.1| S5 [Oryza sativa]
gi|340810929|gb|AEK75391.1| S5 [Oryza sativa]
gi|340810935|gb|AEK75394.1| S5 [Oryza sativa]
gi|340810937|gb|AEK75395.1| S5 [Oryza sativa]
gi|340810939|gb|AEK75396.1| S5 [Oryza sativa]
gi|340810941|gb|AEK75397.1| S5 [Oryza sativa]
gi|340810943|gb|AEK75398.1| S5 [Oryza sativa]
gi|340810951|gb|AEK75402.1| S5 [Oryza sativa]
gi|340810953|gb|AEK75403.1| S5 [Oryza sativa]
gi|340810963|gb|AEK75408.1| S5 [Oryza sativa]
gi|340810965|gb|AEK75409.1| S5 [Oryza sativa]
gi|340810973|gb|AEK75413.1| S5 [Oryza nivara]
gi|340811003|gb|AEK75428.1| S5 [Oryza rufipogon]
gi|340811005|gb|AEK75429.1| S5 [Oryza rufipogon]
gi|340811009|gb|AEK75431.1| S5 [Oryza rufipogon]
gi|340811023|gb|AEK75438.1| S5 [Oryza rufipogon]
gi|340811025|gb|AEK75439.1| S5 [Oryza nivara]
gi|340811031|gb|AEK75442.1| S5 [Oryza rufipogon]
gi|340811033|gb|AEK75443.1| S5 [Oryza rufipogon]
gi|340811035|gb|AEK75444.1| S5 [Oryza nivara]
gi|340811039|gb|AEK75446.1| S5 [Oryza rufipogon]
gi|340811049|gb|AEK75451.1| S5 [Oryza nivara]
gi|340811053|gb|AEK75453.1| S5 [Oryza rufipogon]
gi|340811055|gb|AEK75454.1| S5 [Oryza nivara]
gi|340811057|gb|AEK75455.1| S5 [Oryza rufipogon]
gi|340811059|gb|AEK75456.1| S5 [Oryza rufipogon]
gi|340811061|gb|AEK75457.1| S5 [Oryza rufipogon]
gi|340811065|gb|AEK75459.1| S5 [Oryza nivara]
gi|340811067|gb|AEK75460.1| S5 [Oryza nivara]
gi|340811069|gb|AEK75461.1| S5 [Oryza nivara]
gi|340811071|gb|AEK75462.1| S5 [Oryza rufipogon]
gi|340811081|gb|AEK75467.1| S5 [Oryza nivara]
gi|340811083|gb|AEK75468.1| S5 [Oryza nivara]
gi|340811087|gb|AEK75470.1| S5 [Oryza nivara]
gi|340811092|gb|AEK75472.1| S5 [Oryza nivara]
gi|340811102|gb|AEK75477.1| S5 [Oryza rufipogon]
gi|340811106|gb|AEK75479.1| S5 [Oryza rufipogon]
gi|340811108|gb|AEK75480.1| S5 [Oryza rufipogon]
gi|340811110|gb|AEK75481.1| S5 [Oryza rufipogon]
gi|340811112|gb|AEK75482.1| S5 [Oryza rufipogon]
gi|340811118|gb|AEK75485.1| S5 [Oryza nivara]
gi|340811120|gb|AEK75486.1| S5 [Oryza rufipogon]
Length = 472
Score = 94.4 bits (233), Expect = 9e-17, Method: Compositional matrix adjust.
Identities = 107/380 (28%), Positives = 161/380 (42%), Gaps = 58/380 (15%)
Query: 32 DDIYLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEP---PLFDPKKSSTYN 88
D ++LM +S+G PPV ++DTGS +W QC+PC + C Q P+FDP +S T
Sbjct: 111 DFLFLMAVSLGKPPVVNLVAIDTGSTLSWVQCQPCA-VHCHTQSAKAGPIFDPGRSYTSR 169
Query: 89 SISCSSSQCA-------VVTSNCSEGD--CSYSFLYGRGAYASFSSGNLATETLTFNSTS 139
+ CSS +C + +NC E + C+YS YG G ++S G + T+TL +
Sbjct: 170 RVRCSSVKCGELRYDLRLQQANCMEKEDSCTYSVTYGNG--WAYSVGKMVTDTLRIGDS- 226
Query: 140 GLPVEMPNVIFGCGHKNLASPTSDSKQTGIIGLGPGNSSLISQMG----TSIAGKFSYCL 195
+++FGC S + GI G G + S Q+ FSYCL
Sbjct: 227 -----FMDLMFGCSMDVKYS----EFEAGIFGFGSSSFSFFEQLAGYPDILSYKAFSYCL 277
Query: 196 P-DQGSSKINFGGIVAGAGVVS--TPL---IIRDHYYLSLEAISVGNQRLEFVSSSTGNI 249
P D+ G A + TPL I R Y L++E + QRL +S+ +
Sbjct: 278 PTDETKPGYMILGRYDRAAMDGGYTPLFRSINRPTYSLTMEMLIANGQRL---VTSSSEM 334
Query: 250 FVDTGVLRT-LLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDVLCY------------ 296
VD+G RT L P + K++ M + A +CY
Sbjct: 335 IVDSGAQRTSLWPSTFALLDKTITQAMSSIGYHRTSRARQ--ESYICYLSEHDYSGWNGT 392
Query: 297 --NISSQPKFPEVTIHFR-GADVKLSPSNLFRNISDEIMCSAFRGGNA--NIVYGRIMQI 351
S+ P + I F GA + L P N+F N +C F A + + G +
Sbjct: 393 ITPFSNWSALPLLEIGFAGGAALALPPRNVFYNDPHRGLCMTFAQNPALRSQILGNRVTR 452
Query: 352 NFLIGYDIEQAMVSFKPSRC 371
+F +DI+ FK + C
Sbjct: 453 SFGTTFDIQGKQFGFKYAAC 472
>gi|125543639|gb|EAY89778.1| hypothetical protein OsI_11320 [Oryza sativa Indica Group]
Length = 488
Score = 94.4 bits (233), Expect = 9e-17, Method: Compositional matrix adjust.
Identities = 96/327 (29%), Positives = 145/327 (44%), Gaps = 48/327 (14%)
Query: 77 PLFDPKKSSTYNSISCSSSQC-AVVTSNCSEGD------CSYSFLYGRGAYASFSSGNLA 129
P FD SST SC S+ C ++ ++C C Y++ Y S ++G L
Sbjct: 175 PYFDRSTSSTLLLTSCDSTLCQGLLVASCGNTKFWPNQTCVYTYYYND---KSVTTGLLE 231
Query: 130 TETLTFNSTSGLPVEMPNVIFGCGHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAG 189
+ TF G +P V FGCG N + S +TGI G G G SL SQ+ G
Sbjct: 232 VDKFTF----GAGASVPGVAFGCGLFN--NGVFKSNETGIAGFGRGPLSLPSQLKV---G 282
Query: 190 KFSYCLP-----DQGSSKINFGGIVAGAG---VVSTPLIIRDH----YYLSLEAISVGNQ 237
FS+C Q + ++ + G V STPLI YYLSL+ I+VG+
Sbjct: 283 NFSHCFTAVNGLKQSTVLLDLLADLYKNGRGAVQSTPLIQNSANPTLYYLSLKGITVGST 342
Query: 238 RLEF------VSSSTGNIFVDTGVLRTLLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFS 291
RL +++ TG +D+G T LP + + ++ + IK V G P
Sbjct: 343 RLPVPESAFALTNGTGGTIIDSGTSITSLPPQVYQVVRDEFAAQIKLPVVPGNATGP--- 399
Query: 292 DVLCYNISSQ--PKFPEVTIHFRGADVKLSPSNLFRNISDE----IMCSAFRG-GNANIV 344
C++ SQ P P++ +HF GA + L N + D+ ++C A G+
Sbjct: 400 -YTCFSAPSQAKPDVPKLVLHFEGATMDLPRENYVFEVPDDAGNSMICLAINELGDERAT 458
Query: 345 YGRIMQINFLIGYDIEQAMVSFKPSRC 371
G Q N + YD++ M+SF ++C
Sbjct: 459 IGNFQQQNMHVLYDLQNNMLSFVAAQC 485
Score = 48.1 bits (113), Expect = 0.008, Method: Compositional matrix adjust.
Identities = 36/134 (26%), Positives = 59/134 (44%), Gaps = 16/134 (11%)
Query: 232 ISVGNQRLEF------VSSSTGNIFVDTGVLRTLLPLEYHSNLKSVMSNMIKAQPVKGVG 285
I+VG+ RL +++ TG +D+G T LP + + ++ + IK V G
Sbjct: 42 ITVGSTRLPVPESAFALTNGTGGTIIDSGTSITSLPPQVYQVVRDEFAAQIKLPVVPGNA 101
Query: 286 AEPGFSDVLCYNISSQ--PKFPEVTIHFRGADVKLSPSNLFRNISDE----IMCSAFRGG 339
P C++ SQ P P++ +HF GA + L N + D+ I+C A G
Sbjct: 102 TGP----YTCFSAPSQAKPDVPKLVLHFEGATMDLPRENYVFEVPDDAGNSIICLAINKG 157
Query: 340 NANIVYGRIMQINF 353
+ + G Q N
Sbjct: 158 DETTIIGNFQQQNM 171
>gi|224065128|ref|XP_002301682.1| predicted protein [Populus trichocarpa]
gi|222843408|gb|EEE80955.1| predicted protein [Populus trichocarpa]
Length = 441
Score = 94.4 bits (233), Expect = 9e-17, Method: Compositional matrix adjust.
Identities = 96/383 (25%), Positives = 163/383 (42%), Gaps = 67/383 (17%)
Query: 34 IYLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPP--LFDPKKSSTYNSIS 91
I L+ L IGTPP +DTGS +W QC ++ PP +FDP SS+++ +
Sbjct: 81 ILLVSLPIGTPPQTQQMILDTGSQLSWIQCHK----KVPRKPPPSSVFDPSLSSSFSVLP 136
Query: 92 CSSSQCA------VVTSNCSEGD-CSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVE 144
C+ C + ++C + C YS+ Y G A GNL E +TF+ +
Sbjct: 137 CNHPLCKPRIPDFTLPTSCDQNRLCHYSYFYADGTLAE---GNLVREKITFSRSQ----S 189
Query: 145 MPNVIFGCGHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLPDQ------ 198
P +I GC A +SD+K GI+G+ G S SQ + KFSYC+P +
Sbjct: 190 TPPLILGC-----AEESSDAK--GILGMNLGRLSFASQAKLT---KFSYCVPTRQVRPGF 239
Query: 199 ------------GSSKINFGGIVAGAGVVSTPLIIRDHYYLSLEAISVGNQRLEFVSSS- 245
S + ++ + P + Y ++++ I +GNQ+L S+
Sbjct: 240 TPTGSFYLGENPNSGGFRYINLLTFSQSQRMPNLDPLAYTVAMQGIRIGNQKLNIPISAF 299
Query: 246 ------TGNIFVDTGVLRTLLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDVLCYN-- 297
G +D+G T L E ++ ++ + ++ A+ KG G SD +C+N
Sbjct: 300 RPDPSGAGQTMIDSGSEFTYLVDEAYNKVREEVVRLVGARLKKGY-VYGGVSD-MCFNGN 357
Query: 298 -ISSQPKFPEVTIHF-RGADVKLSPSNLFRNISDEIMC-----SAFRGGNANIVYGRIMQ 350
I + F +G ++ + + ++ + C S G +NI+ G Q
Sbjct: 358 AIEIGRLIGNMVFEFDKGVEIVVEKERVLADVGGGVHCVGIGRSEMLGAASNII-GNFHQ 416
Query: 351 INFLIGYDIEQAMVSFKPSRCTN 373
N + +D+ V F + C+
Sbjct: 417 QNIWVEFDLANRRVGFGKADCSR 439
>gi|224091907|ref|XP_002309394.1| predicted protein [Populus trichocarpa]
gi|222855370|gb|EEE92917.1| predicted protein [Populus trichocarpa]
Length = 469
Score = 94.0 bits (232), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 110/422 (26%), Positives = 162/422 (38%), Gaps = 85/422 (20%)
Query: 14 ETPKSPISIIYQAEIISVDDIYLMHLSIGTPPVDIFGSVDTGSDCTWTQCEP---CPELD 70
++PK+ S++ Y + L+ GTPP +DTGS W C C D
Sbjct: 71 KSPKTKFSLLKTPLFPRSYGGYSISLNFGTPPQTTKFVMDTGSSLVWFPCTSRYLCSRCD 130
Query: 71 CFKQEP---PLFDPKKSSTYNSISCSSSQCAVV---------------TSNCSEGDCSYS 112
E P F PK+SS+ N I C + +C+ + T NC++ Y
Sbjct: 131 FPNIEVTGIPTFIPKQSSSSNLIGCKNHKCSWLFGPKVQSKCQECDPTTQNCTQSCPPYV 190
Query: 113 FLYGRGAYASFSSGNLATETLTFNSTSGLPVEMPNVIFGCGHKNLASPTSDSKQTGIIGL 172
YG G+ A G L +ETL F +P + GC ++ P GI G
Sbjct: 191 IQYGLGSTA----GLLLSETLDFPHKK----TIPGFLVGCSLFSIRQP------EGIAGF 236
Query: 173 GPGNSSLISQMGTSIAGKFSYCL--------PDQGSSKINFGG---IVAGAGVVSTPL-- 219
G SL SQ+G KFSYCL P ++ G G+ TP
Sbjct: 237 GRSPESLPSQLGLK---KFSYCLVSHAFDDTPASSDLVLDTGSGSDDTKTPGLSYTPFQK 293
Query: 220 ----IIRDHYYLSLEAISVGNQRLE-----FVSSSTGN--IFVDTGVLRTLLPLEYHSNL 268
RD+YY+ L I +G+ ++ V S GN VD+G T + + +
Sbjct: 294 NPTAAFRDYYYVLLRNIVIGDTHVKVPYKFLVPGSDGNGGTIVDSGTTFTFMEKPVYELV 353
Query: 269 -----KSVMSNMIKAQPVKGVGAEPGFSDVLCYNISSQP--KFPEVTIHFR-GADVKLSP 320
K V + + G P C+NIS + PE HF+ GA + L
Sbjct: 354 AKEFEKQVAHYTVATEVQNQTGLRP------CFNISGEKSVSVPEFIFHFKGGAKMALPL 407
Query: 321 SNLFRNISDEIMCSAFR---------GGNANIVYGRIMQINFLIGYDIEQAMVSFKPSRC 371
+N F + ++C GG I+ G Q NF + +D++ FK C
Sbjct: 408 ANYFSFVDSGVICLTIVSDNMSGSGIGGGPAIILGNYQQRNFHVEFDLKNERFGFKQQNC 467
Query: 372 TN 373
+
Sbjct: 468 VS 469
>gi|356546036|ref|XP_003541438.1| PREDICTED: aspartic proteinase-like protein 2-like [Glycine max]
Length = 486
Score = 94.0 bits (232), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 103/372 (27%), Positives = 161/372 (43%), Gaps = 43/372 (11%)
Query: 34 IYLMHLSIGTPPVDIFGSVDTGSDCTWTQCEP---CPELDCFKQEPPLFDPKKSSTYNSI 90
+Y + +GTPP + +DTGSD W C CP+ E FD SST I
Sbjct: 77 LYYTKVKMGTPPKEFNVQIDTGSDILWVNCNTCSNCPQSSQLGIELNFFDTVGSSTAALI 136
Query: 91 SCSSSQCAV----VTSNCSE--GDCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVE 144
CS C + CS CSY+F YG G S +SG ++ + F+ G P
Sbjct: 137 PCSDPICTSRVQGAAAECSPRVNQCSYTFQYGDG---SGTSGYYVSDAMYFSLIMGQPPA 193
Query: 145 M---PNVIFGCGHKNLASPT-SDSKQTGIIGLGPGNSSLISQMGTS-IAGK-FSYCLPDQ 198
+ ++FGC T +D GI G GPG S++SQ+ + I K FS+CL
Sbjct: 194 VNSSATIVFGCSISQSGDLTKTDKAVDGIFGFGPGPLSVVSQLSSRGITPKVFSHCLKGD 253
Query: 199 GSSKINFGGI-VAGAGVVSTPLI-IRDHYYLSLEAISVGNQRLEF------VSSSTGNIF 250
G + +V +PL+ + HY L+L++I+V Q L +S++ G
Sbjct: 254 GDGGGVLVLGEILEPSIVYSPLVPSQPHYNLNLQSIAVNGQLLPINPAVFSISNNRGGTI 313
Query: 251 VDTGVLRTLLPLEYHSNLKSVMSNMI--KAQPVKGVGAEPGFSDVLCYNISSQ--PKFPE 306
VD G L E + L + ++ + A+ G + CY +S+ FP
Sbjct: 314 VDCGTTLAYLIQEAYDPLVTAINTAVSQSARQTNSKGNQ-------CYLVSTSIGDIFPS 366
Query: 307 VTIHFR-GADVKLSPSN-LFRN---ISDEIMCSAFRGGNANI-VYGRIMQINFLIGYDIE 360
V+++F GA + L P L N E+ C F+ + G ++ + ++ YDI
Sbjct: 367 VSLNFEGGASMVLKPEQYLMHNGYLDGAEMWCIGFQKFQEGASILGDLVLKDKIVVYDIA 426
Query: 361 QAMVSFKPSRCT 372
Q + + C+
Sbjct: 427 QQRIGWANYDCS 438
>gi|224072901|ref|XP_002303933.1| predicted protein [Populus trichocarpa]
gi|222841365|gb|EEE78912.1| predicted protein [Populus trichocarpa]
Length = 370
Score = 94.0 bits (232), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 98/395 (24%), Positives = 166/395 (42%), Gaps = 49/395 (12%)
Query: 1 AQNSQKLPFYNDNETPKSPISIIYQAEIISVDDIYLMHLSIGTPPVDIFGSVDTGSDCTW 60
A++ +L F + KS + I +I Y++ +GTPP + ++D D W
Sbjct: 2 AKDQARLQFLSSLVAKKSVVPIASGRGVIQSPS-YIVKAKVGTPPQTLLMALDNSYDAAW 60
Query: 61 TQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSSSQCAVVTSN-CSEGDCSYSFLYGRGA 119
C+ C + C +F+ KS+T+ ++ C + QC V + C C+++ YG
Sbjct: 61 IPCKGC--VGCSST---VFNTVKSTTFKTLGCGAPQCKQVPNPICGGSTCTWNTTYGSST 115
Query: 120 YASFSSGNLATETLTFNSTSGLPVEMPNVIFGCGHKNLASPTSDSKQTGIIGLGPGNSSL 179
S NL +T+ + +P FGC K A+ +S Q G++G G G S
Sbjct: 116 ILS----NLTRDTIALSMD-----PVPYYAFGCIQK--ATGSSVPPQ-GLLGFGRGPLSF 163
Query: 180 ISQMGTSIAGKFSYCLPD----QGSSKINFGGIVAGAGVVSTPLIIRDH----YYLSLEA 231
+SQ FSYCLP S + G + + +TPL+ YY+ L
Sbjct: 164 LSQTQNLYKSTFSYCLPSFRTLNFSGSLRLGPVGQPPRIKTTPLLKNPRRSSLYYVKLNG 223
Query: 232 ISVGNQRLEFVSSS--------TGNIFVDTGVLRTLLPLEYHSNLKSVMSNMIKAQPVKG 283
I VG + ++ S+ G IF D+G + T L + +++ + V
Sbjct: 224 IRVGRKIVDIPRSALAFNPTTGAGTIF-DSGTVFTRLVAPAYIAVRNEFRKRVGNATVSS 282
Query: 284 VGAEPGFSDVLCYNISSQPKFPEVTIHFRGADVKLSPSNLFRNISDEIM-CSAFRGGNAN 342
+G GF CY++ P P +T F G +V + P NL + + + C A N
Sbjct: 283 LG---GFDT--CYSVPIVP--PTITFMFSGMNVTMPPENLLIHSTAGVTSCLAMAAAPDN 335
Query: 343 I-----VYGRIMQINFLIGYDIEQAMVSFKPSRCT 372
+ V + Q N I +D+ + + +C+
Sbjct: 336 VNSVLNVIASMQQQNHRILFDVPNSRLGVAREQCS 370
>gi|222618833|gb|EEE54965.1| hypothetical protein OsJ_02555 [Oryza sativa Japonica Group]
Length = 393
Score = 93.6 bits (231), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 97/343 (28%), Positives = 143/343 (41%), Gaps = 87/343 (25%)
Query: 35 YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELD-CFKQEPPLFDPKKSSTYNSISCS 93
Y++ + +G+P V +DTGSD +W QCEPCP C LFDP SSTY + +CS
Sbjct: 106 YVISVGLGSPAVTQRVVIDTGSDVSWVQCEPCPAPSPCHAHAGALFDPAASSTYAAFNCS 165
Query: 94 SSQCAVV-----TSNC-SEGDCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMPN 147
++ CA + + C ++ C Y YG G ++T+G +
Sbjct: 166 AAACAQLGDSGEANGCDAKSRCQYIVKYGDG-----------------SNTTGTGFQ--- 205
Query: 148 VIFGCGHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLPDQGSSKINFGG 207
FGC H L + D K G+IGLG SL+SQ S K+
Sbjct: 206 --FGCSHAELGAGM-DDKTDGLIGLGGDAQSLVSQTAAR-------------SKKVP--- 246
Query: 208 IVAGAGVVSTPLIIRDHYYLSLEAISVGNQRLEFVSS--STGNIFVDTGVLRTLLPLEYH 265
+Y+ +LE I+VG ++L S + G++ VD+G + T LP +
Sbjct: 247 ---------------TYYFAALEDIAVGGKKLGLSPSVFAAGSL-VDSGTVITRLPPAAY 290
Query: 266 SNLKSV----MSNMIKAQPVKGVGAEPGFSDVLCYNISSQPK--FPEVTIHFR-GADVKL 318
+ L S M+ +A+P+ G D C+N + K P V + F GA V L
Sbjct: 291 AALSSAFRAGMTRYARAEPL-------GILDT-CFNFTGLDKVSIPTVALVFAGGAVVDL 342
Query: 319 SPSNLFRNISDEIMCSAF---RGGNANIVYGRIMQINFLIGYD 358
+ C AF R A G + Q F + YD
Sbjct: 343 DAHGIVSG-----GCLAFAPTRDDKAFGTIGNVQQRTFEVLYD 380
>gi|357520119|ref|XP_003630348.1| Aspartic proteinase Asp1 [Medicago truncatula]
gi|355524370|gb|AET04824.1| Aspartic proteinase Asp1 [Medicago truncatula]
Length = 435
Score = 93.6 bits (231), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 98/369 (26%), Positives = 164/369 (44%), Gaps = 48/369 (13%)
Query: 34 IYLMHLSIGTPPVDIFGSVDTGSDCTWTQCE-PCPELDCFKQEPPLFDPKKSSTYNSISC 92
Y + L+IG PP F VDTGS+ TW QC+ PC + C + PL+ P + I C
Sbjct: 73 FYNVTLNIGQPPRPYFLDVDTGSELTWLQCDAPCSQ--CSETPHPLYKPSN----DFIPC 126
Query: 93 SSSQCAVVTS----NCSE-GDCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMPN 147
CA + C + C Y Y Y++ G L + N T+G+ +++
Sbjct: 127 KDPLCASLQPTDDYTCEDPNQCDYEIKYAD-QYSTL--GVLLNDVYLLNFTNGVQLKV-R 182
Query: 148 VIFGCGHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTS--IAGKFSYCLPDQGSSKINF 205
+ GCG+ + SP++ GI+GLG G +SLISQ+ + + +CL +G I F
Sbjct: 183 MALGCGYDQIFSPSTYHPLDGILGLGRGKASLISQLNSQGLVRNVMGHCLSSRGGGYIFF 242
Query: 206 GGIVAGAGVVSTPLIIRDHYYLSLEAISVGNQRLEFVSSSTG----NIFVDTGVLRTLLP 261
G + + + TP+ D S + S G L F TG NI DTG T
Sbjct: 243 GNVYDSSRMSWTPISSID----SGKHYSAGPAELVFGGRKTGVGSLNIIFDTGSSYTYFN 298
Query: 262 LEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDVLCYN-------ISSQPK-FPEVTIHF-R 312
+ + + S+++ + +P+K + +C++ I+ K F +T+ F
Sbjct: 299 SQAYQAMISLLNKELHRKPIKAAPDDQTLP--MCWHGKRPFRSINEVKKYFKPLTLSFTN 356
Query: 313 GADVKLS---PSNLFRNISD--EIMCSAFRG-----GNANIVYGRIMQINFLIGYDIEQA 362
G VK P + IS+ + G G N++ G I ++ ++ +D E+
Sbjct: 357 GGRVKPQFEIPPEAYLIISNMGNVCLGILNGPEVGLGELNLI-GDISMLDKVMVFDNEKQ 415
Query: 363 MVSFKPSRC 371
++ + P+ C
Sbjct: 416 LIGWGPADC 424
>gi|9757837|dbj|BAB08274.1| unnamed protein product [Arabidopsis thaliana]
Length = 586
Score = 93.6 bits (231), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 102/366 (27%), Positives = 164/366 (44%), Gaps = 49/366 (13%)
Query: 35 YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSS 94
Y L IGTPP + VDTGS T+ C C + C K + P F P+ S++Y ++ C+
Sbjct: 76 YTTRLWIGTPPQEFALIVDTGSTVTYVPCSTCKQ--CGKHQDPKFQPELSTSYQALKCNP 133
Query: 95 SQCAVVTSNC-SEGD-CSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMPNVIFGC 152
NC EG C Y Y A S SSG L+ + ++F + S L + +FGC
Sbjct: 134 D------CNCDDEGKLCVYERRY---AEMSSSSGVLSEDLISFGNESQLSPQ--RAVFGC 182
Query: 153 GHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTS--IAGKFSYCLPDQGSSKINFGGIVA 210
++ S + GI+GLG G S++ Q+ I FS C G ++ G +V
Sbjct: 183 ENEETGDLFS-QRADGIMGLGRGKLSVVDQLVDKGVIEDVFSLCY---GGMEVGGGAMVL 238
Query: 211 GAGVVSTPLIIRDH--------YYLSLEAISVGNQRLEF---VSSSTGNIFVDTGVLRTL 259
G + P ++ H Y + L+ + V + L+ V + +D+G
Sbjct: 239 GK-ISPPPGMVFSHSDPFRSPYYNIDLKQMHVAGKSLKLNPKVFNGKHGTVLDSGTTYAY 297
Query: 260 LPLEYHSNLKSVMSNMIKAQP-VKGV-GAEPGFSDVLCYNISSQPK------FPEVTIHF 311
P E +K +IK P +K + G +P + DV C++ + + FPE+ + F
Sbjct: 298 FPKEAFIAIKDA---VIKEIPSLKRIHGPDPNYDDV-CFSGAGRDVAEIHNFFPEIAMEF 353
Query: 312 -RGADVKLSPSN-LFRN--ISDEIMCSAFRGGNANIVYGRIMQINFLIGYDIEQAMVSFK 367
G + LSP N LFR+ + F ++ + G I+ N L+ YD E + F
Sbjct: 354 GNGQKLILSPENYLFRHTKVRGAYCLGIFPDRDSTTLLGGIVVRNTLVTYDRENDKLGFL 413
Query: 368 PSRCTN 373
+ C++
Sbjct: 414 KTNCSD 419
>gi|222628951|gb|EEE61083.1| hypothetical protein OsJ_14969 [Oryza sativa Japonica Group]
Length = 367
Score = 93.6 bits (231), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 56/174 (32%), Positives = 85/174 (48%), Gaps = 15/174 (8%)
Query: 14 ETPKSPISIIYQAEIISVDDIYLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFK 73
E + +++ + I+ YL+ L IGTPP ++DT SD WTQC+PC C+
Sbjct: 68 EAASARKAVVAETPIMPAGGEYLVKLGIGTPPYKFTAAIDTASDLIWTQCQPCT--GCYH 125
Query: 74 QEPPLFDPKKSSTYNSISCSSSQC-AVVTSNCSEGD---CSYSFLYGRGAYASFSSGNLA 129
Q P+F+P+ SSTY ++ CSS C + C D C Y++ Y A + G LA
Sbjct: 126 QVDPMFNPRVSSTYAALPCSSDTCDELDVHRCGHDDDESCQYTYTYSGNAT---TEGTLA 182
Query: 130 TETLTFNSTSGLPVEMPNVIFGCGHKNLASPTSDSKQTGIIGLGPGNSSLISQM 183
+ L + V FGC + + +G++GLG G SL+SQ+
Sbjct: 183 VDKLVIGEDA-----FRGVAFGCSTSSTGG-APPPQASGVVGLGRGPLSLVSQL 230
>gi|224081804|ref|XP_002306494.1| predicted protein [Populus trichocarpa]
gi|222855943|gb|EEE93490.1| predicted protein [Populus trichocarpa]
Length = 564
Score = 93.6 bits (231), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 105/368 (28%), Positives = 166/368 (45%), Gaps = 46/368 (12%)
Query: 31 VDDIYLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSI 90
++ Y L IGTPP VDTGS T+ C C + C + + P F P SSTY S+
Sbjct: 9 INGYYTTRLWIGTPPQRFALIVDTGSSVTYVPCSSCEQ--CGRHQDPKFQPDLSSTYQSV 66
Query: 91 SCSSSQCAVVTSNCSE--GDCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMPNV 148
C+ + NC + C Y Y A S SSG L + ++F + S L +
Sbjct: 67 KCN------IDCNCDDEKQQCVYERQY---AEMSTSSGVLGEDIISFGNLSALAPQ--RA 115
Query: 149 IFGCGHKNLASPTSDSKQT-GIIGLGPGNSSLISQMGTS--IAGKFSYCL--PDQGSSKI 203
+FGC +N+ + S+ GI+G+G G+ S++ + I FS C G +
Sbjct: 116 VFGC--ENMETGDLYSQHADGIMGMGRGDLSIVDHLVDKGVINDSFSLCYGGMGIGGGAM 173
Query: 204 NFGGIVAGAGVV---STPLIIRDHYY-LSLEAISVGNQRLEF---VSSSTGNIFVDTGVL 256
GGI + +V S P +R YY + L+ I V + L V +D+G
Sbjct: 174 VLGGISPPSNMVFSQSDP--VRSPYYNIDLKEIHVAGKPLPLNPTVFDGKHGTILDSGTT 231
Query: 257 RTLLPLEYHSNLK-SVMSNMIKAQPVKGVGAEPGFSDVLCY-----NISS-QPKFPEVTI 309
LP + K ++M + +P++ G +P ++D+ C+ +IS FP V +
Sbjct: 232 YAYLPEAAFVSFKDAIMKELHSLKPIR--GPDPNYNDI-CFSGAGSDISQLSSSFPAVEM 288
Query: 310 HF-RGADVKLSPSN-LFRN--ISDEIMCSAFRGG-NANIVYGRIMQINFLIGYDIEQAMV 364
F G + LSP N LFR+ + F+ G + + G I+ N L+ YD E + +
Sbjct: 289 VFGNGQKLLLSPENYLFRHSKVHGAYCLGIFQNGKDPTTLLGGIVVRNTLVLYDRENSKI 348
Query: 365 SFKPSRCT 372
F + C+
Sbjct: 349 GFWKTNCS 356
>gi|42568291|ref|NP_199124.3| aspartyl protease family protein [Arabidopsis thaliana]
gi|332007527|gb|AED94910.1| aspartyl protease family protein [Arabidopsis thaliana]
Length = 631
Score = 93.6 bits (231), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 102/366 (27%), Positives = 164/366 (44%), Gaps = 49/366 (13%)
Query: 35 YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSS 94
Y L IGTPP + VDTGS T+ C C + C K + P F P+ S++Y ++ C+
Sbjct: 76 YTTRLWIGTPPQEFALIVDTGSTVTYVPCSTCKQ--CGKHQDPKFQPELSTSYQALKCNP 133
Query: 95 SQCAVVTSNC-SEGD-CSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMPNVIFGC 152
NC EG C Y Y A S SSG L+ + ++F + S L + +FGC
Sbjct: 134 D------CNCDDEGKLCVYERRY---AEMSSSSGVLSEDLISFGNESQLSPQ--RAVFGC 182
Query: 153 GHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTS--IAGKFSYCLPDQGSSKINFGGIVA 210
++ S + GI+GLG G S++ Q+ I FS C G ++ G +V
Sbjct: 183 ENEETGDLFS-QRADGIMGLGRGKLSVVDQLVDKGVIEDVFSLCY---GGMEVGGGAMVL 238
Query: 211 GAGVVSTPLIIRDH--------YYLSLEAISVGNQRLEF---VSSSTGNIFVDTGVLRTL 259
G + P ++ H Y + L+ + V + L+ V + +D+G
Sbjct: 239 GK-ISPPPGMVFSHSDPFRSPYYNIDLKQMHVAGKSLKLNPKVFNGKHGTVLDSGTTYAY 297
Query: 260 LPLEYHSNLKSVMSNMIKAQP-VKGV-GAEPGFSDVLCYNISSQPK------FPEVTIHF 311
P E +K +IK P +K + G +P + DV C++ + + FPE+ + F
Sbjct: 298 FPKEAFIAIKDA---VIKEIPSLKRIHGPDPNYDDV-CFSGAGRDVAEIHNFFPEIAMEF 353
Query: 312 -RGADVKLSPSN-LFRN--ISDEIMCSAFRGGNANIVYGRIMQINFLIGYDIEQAMVSFK 367
G + LSP N LFR+ + F ++ + G I+ N L+ YD E + F
Sbjct: 354 GNGQKLILSPENYLFRHTKVRGAYCLGIFPDRDSTTLLGGIVVRNTLVTYDRENDKLGFL 413
Query: 368 PSRCTN 373
+ C++
Sbjct: 414 KTNCSD 419
>gi|356539352|ref|XP_003538162.1| PREDICTED: aspartic proteinase-like protein 2-like [Glycine max]
Length = 489
Score = 93.2 bits (230), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 105/376 (27%), Positives = 174/376 (46%), Gaps = 50/376 (13%)
Query: 34 IYLMHLSIGTPPVDIFGSVDTGSDCTWT---QCEPCPELDCFKQEPPLFDPKKSSTYNSI 90
+Y + +GTPP +++ +DTGSD W C CP+ + + FDP SST + I
Sbjct: 76 LYYTKVKLGTPPRELYVQIDTGSDVLWVSCGSCNGCPQTSGLQIQLNYFDPGSSSTSSLI 135
Query: 91 SCSSSQC--AVVTSNCS----EGDCSYSFLYGRGAYAS-------FSSGNLATETLTFNS 137
SC +C V TS+ S C+Y+F YG G+ S ++ TLT NS
Sbjct: 136 SCLDRRCRSGVQTSDASCSGRNNQCTYTFQYGDGSGTSGYYVSDLMHFASIFEGTLTTNS 195
Query: 138 TSGLPVEMPNVIFGCGHKNLASPTSDSKQT-GIIGLGPGNSSLISQMGTS-IAGK-FSYC 194
++ +V+FGC T + GI G G S+ISQ+ + IA + FS+C
Sbjct: 196 SA-------SVVFGCSILQTGDLTKSERAVDGIFGFGQQGMSVISQLSSQGIAPRVFSHC 248
Query: 195 LP--DQGSSKINFGGIVAGAGVVSTPLI-IRDHYYLSLEAISVGNQRLE-----FVSSST 246
L + G + G IV +V +PL+ + HY L+L++ISV Q + F +S+
Sbjct: 249 LKGDNSGGGVLVLGEIVE-PNIVYSPLVPSQPHYNLNLQSISVNGQIVRIAPSVFATSNN 307
Query: 247 GNIFVDTGVLRTLLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDVLCYNISSQPK--- 303
VD+G L E ++ ++ +I Q V+ V + CY I++
Sbjct: 308 RGTIVDSGTTLAYLAEEAYNPFVIAIAAVIP-QSVRSVLSRGN----QCYLITTSSNVDI 362
Query: 304 FPEVTIHFR-GADVKLSPSNLF--RNISDE--IMCSAFR--GGNANIVYGRIMQINFLIG 356
FP+V+++F GA + L P + +N E + C F+ G + + G ++ + +
Sbjct: 363 FPQVSLNFAGGASLVLRPQDYLMQQNFIGEGSVWCIGFQKISGQSITILGDLVLKDKIFV 422
Query: 357 YDIEQAMVSFKPSRCT 372
YD+ + + C+
Sbjct: 423 YDLAGQRIGWANYDCS 438
>gi|356558304|ref|XP_003547447.1| PREDICTED: LOW QUALITY PROTEIN: aspartic proteinase
nepenthesin-1-like [Glycine max]
Length = 336
Score = 93.2 bits (230), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 100/358 (27%), Positives = 154/358 (43%), Gaps = 54/358 (15%)
Query: 35 YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSS 94
Y LSIG PP+ +DT SD W C LFDP KSST++ +
Sbjct: 9 YWSILSIGQPPIPQLVIMDTSSDILWIMC---------NHVGLLFDPSKSSTFSPL--CK 57
Query: 95 SQCAVVTSNCSEGDCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMPNVIFGCGH 154
+ C C + S++ S +SG ++T+ F +T ++ +V+ CGH
Sbjct: 58 TPCGFKGCKCDPIPFNISYV-----DKSSTSGTFGSDTVVFETTDEGHSQIFDVLVRCGH 112
Query: 155 KNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLPDQGSSKINFGGIVAGAGV 214
N+ T D GI GL G +SL +++G KFSYC+ + N+ ++ G
Sbjct: 113 -NIGFNT-DPGYNGIRGLNNGPNSLATKIGQ----KFSYCVGNLADPYYNYNQLILCEGA 166
Query: 215 ----VSTPLIIRD-HYYLSLEAISVGNQRL-------EFVSSSTGNIFVDTGVLRTLLPL 262
STP + YY++L+ I VG +RL E ++TG + D+G T L
Sbjct: 167 DLEGYSTPFEVHHGFYYVTLKGIIVGEKRLDIAPITFEIKGNNTGGVIRDSGTTITYLVD 226
Query: 263 EYHSNLKSVMSNMIKAQPVKGVGAEPGFSDVLCYNISSQP--KFPEVTIHFR-GADVKLS 319
H L + + N++ F + Y I S+ FP VT HF GAD+ L
Sbjct: 227 SVHKLLYNEVRNLLSWS----------FRQLCHYGIISRDLVGFPVVTFHFADGADLALD 276
Query: 320 PSNLFRNISDEIMCSAFRGG---NANI---VYGRIMQINFLIGYDIEQAMVSFKPSRC 371
+ F + + I+C N I V + Q ++ +GYD+ V F+ C
Sbjct: 277 TGSFFNQL-NSILCMTVSPASILNTTISPSVIELLAQQSYNVGYDLLTNFVYFQRIDC 333
>gi|357132618|ref|XP_003567926.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Brachypodium
distachyon]
Length = 468
Score = 93.2 bits (230), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 95/375 (25%), Positives = 155/375 (41%), Gaps = 46/375 (12%)
Query: 35 YLMHLSIGTPPVDIFGSVDTGSDCTWTQC---EPCPELDCFKQEPPLFDPKKSSTYNSIS 91
Y + L +GTP DTGSD TW +C +F P S +++ +
Sbjct: 104 YFVRLRVGTPAQPFVLVADTGSDLTWVKCSSPSSSSSSPAASPPQRVFRPAGSKSWSPLP 163
Query: 92 CSSSQCAVVT----SNCSE--GDCSYSFLYGRGAYASFSSGNLATETLTFNSTSGL-PVE 144
C S C +NCS CSY + Y + A G L + T++ + G +
Sbjct: 164 CDSDTCKSYVPFSLANCSSPPDPCSYDYRYKDNSSARGVVG-LDSATVSLSGNDGTRKAK 222
Query: 145 MPNVIFGC--GHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCL-----PD 197
+ V+ GC + + +SD G++ LG N S S+ + G+FSYCL P
Sbjct: 223 LQEVVLGCTTSYDGQSFKSSD----GVLSLGNSNISFASRAASRFGGRFSYCLVDHLAPR 278
Query: 198 QGSSKINFGGIVAGAGVV----STPLII------RDHYYLSLEAISVGNQRLEFVS---- 243
+S + FG + G TPL++ R Y++S++A++V +RLE +
Sbjct: 279 NATSFLTFGNGDSSPGDDSSSRRTPLVLLEDARTRPFYFVSVDAVTVAGERLEILPDVWD 338
Query: 244 -SSTGNIFVDTGVLRTLLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDVLCYNISS-Q 301
G +D+G T+L + + +S P V +P F CYN +
Sbjct: 339 FRKNGGAILDSGTSLTILATPAYDAVVKAISKQFAGVP--RVNMDP-FE--YCYNWTGVS 393
Query: 302 PKFPEVTIHFRGADVKLSPSNLFR-NISDEIMC-SAFRGGNANI-VYGRIMQINFLIGYD 358
+ P + + F GA P + + + + C G + V G I+Q L +D
Sbjct: 394 AEIPRMELRFAGAATLAPPGKSYVIDTAPGVKCIGVVEGAWPGVSVIGNILQQEHLWEFD 453
Query: 359 IEQAMVSFKPSRCTN 373
+ + FK SRC +
Sbjct: 454 LANRWLRFKQSRCAH 468
>gi|357510893|ref|XP_003625735.1| Aspartic proteinase-like protein [Medicago truncatula]
gi|355500750|gb|AES81953.1| Aspartic proteinase-like protein [Medicago truncatula]
Length = 535
Score = 93.2 bits (230), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 80/252 (31%), Positives = 120/252 (47%), Gaps = 25/252 (9%)
Query: 34 IYLMHLSIGTPPVDIFGSVDTGSDCTWTQCEP---CPELDCFKQEPPLFDPKKSSTYNSI 90
+Y + +G+P + + +DTGSD W C CP+ + FD SST +
Sbjct: 70 LYFTKVKMGSPAKEFYVQIDTGSDILWLNCNTCNNCPKSSGLGIDLNYFDTASSSTAALV 129
Query: 91 SCSSSQCA----VVTSNCSE--GDCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPV- 143
SCS C+ TS CS CSY+F YG G S +SG + + F+ G V
Sbjct: 130 SCSDPVCSYAVQTATSQCSSQANQCSYTFQYGDG---SGTSGYYVYDAMYFDVIMGQSVF 186
Query: 144 --EMPNVIFGCG-HKNLASPTSDSKQTGIIGLGPGNSSLISQMGTS-IAGK-FSYCLPDQ 198
V+FGC +++ ++ GI G GPG S++SQ+ + +A K FS+CL Q
Sbjct: 187 SNSSSTVVFGCSTYQSGDLARTEKAVDGIFGFGPGALSVVSQVSSQGMAPKVFSHCLKGQ 246
Query: 199 GS-SKINFGGIVAGAGVVSTPLI-IRDHYYLSLEAISVGNQRLE-----FVSSSTGNIFV 251
GS I G + +V TPL+ ++ HY L+L++I+V Q L F + + V
Sbjct: 247 GSGGGILVLGEILEPNIVYTPLVPLQPHYNLNLQSIAVNGQILPIDQDVFATGNNRGTIV 306
Query: 252 DTGVLRTLLPLE 263
D+G L E
Sbjct: 307 DSGTTLAYLVQE 318
>gi|449461377|ref|XP_004148418.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Cucumis sativus]
gi|449518059|ref|XP_004166061.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Cucumis sativus]
Length = 436
Score = 93.2 bits (230), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 100/371 (26%), Positives = 167/371 (45%), Gaps = 49/371 (13%)
Query: 27 EIISVDDIYLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSST 86
++++V + Y++ + +GTP ++ +DT +D W C C + C F + SST
Sbjct: 88 QVLNVGN-YVVRVQLGTPGQTMYMVLDTSNDAAWAPCSGC--IGCSSTT--TFSAQNSST 142
Query: 87 YNSISCSSSQC----AVVTSNCSEGDCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLP 142
+ ++ CS +C + DC ++ YG ++FS+ TL +S P
Sbjct: 143 FATLDCSKPECTQARGLSCPTTGNVDCLFNQTYG--GDSTFSA------TLVQDSLHLGP 194
Query: 143 VEMPNVIFGCGHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLPDQG--- 199
+PN FGC + AS +S Q G++GLG G SLISQ G+ +G FSYCLP
Sbjct: 195 NVIPNFSFGC--ISSASGSSIPPQ-GLMGLGRGPLSLISQSGSLYSGLFSYCLPSFKSYY 251
Query: 200 -SSKINFGGIVAGAGVVSTPLIIRDH----YYLSLEAISVGN-------QRLEFVSSSTG 247
S + G + + +TPL+ H YY++L ISVG + L F ++
Sbjct: 252 FSGSLKLGPVGQPKAIRTTPLLHNPHRPSLYYVNLTGISVGRVLVPISPELLAFDPNTGA 311
Query: 248 NIFVDTG-VLRTLLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDVLCYNISSQPKFPE 306
+D+G V+ +P Y + + + + Q V G + G D C+ +++ P
Sbjct: 312 GTIIDSGTVITRFVPAIY-----TAVRDEFRKQ-VGGSFSPLGAFDT-CFATNNEVSAPA 364
Query: 307 VTIHFRGADVKLSPSN-LFRNISDEIMCSAFRGG-----NANIVYGRIMQINFLIGYDIE 360
+T+H G D+KL N L + + + C A + V + Q N I +DI
Sbjct: 365 ITLHLSGLDLKLPMENSLIHSSAGSLACLAMAAAPNNVNSVVNVIANLQQQNHRILFDIN 424
Query: 361 QAMVSFKPSRC 371
+ + C
Sbjct: 425 NSKLGIARELC 435
>gi|356503843|ref|XP_003520712.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Glycine max]
Length = 474
Score = 93.2 bits (230), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 108/399 (27%), Positives = 154/399 (38%), Gaps = 82/399 (20%)
Query: 35 YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEP--------CPELDCFKQEPPLFDPKKSST 86
Y + L++GTPP +DTGS W C P +D K P F PK SST
Sbjct: 92 YSIDLNLGTPPQTSPFVLDTGSSLVWFPCTSRYLCSHCNFPNIDTTKI--PTFIPKNSST 149
Query: 87 YNSISCSSSQCAVV---------------TSNCSEGDCSYSFLYGRGAYASFSSGNLATE 131
+ C + +C + + NCS +Y YG G+ A F L +
Sbjct: 150 AKLLGCRNPKCGYIFGSDVQFRCPQCKPESQNCSLTCPAYIIQYGLGSTAGF----LLLD 205
Query: 132 TLTFNSTSGLPVEMPNVIFGCGHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKF 191
L F + +P + GC ++ P +GI G G G SL SQM +F
Sbjct: 206 NLNFPGKT-----VPQFLVGCSILSIRQP------SGIAGFGRGQESLPSQMNLK---RF 251
Query: 192 SYCL-------PDQGSS---KINFGGIVAGAGVVSTPL---------IIRDHYYLSLEAI 232
SYCL Q S +I+ G G+ TP +++YYL+L +
Sbjct: 252 SYCLVSHRFDDTPQSSDLVLQISSTGDTKTNGLSYTPFRSNPSTNNPAFKEYYYLTLRKV 311
Query: 233 SVGNQR-------LEFVSSSTGNIFVDTGVLRTLLPLEYHSNLKSVMSNMIKAQPVKGVG 285
VG + LE S G VD+G T + ++ + ++ +
Sbjct: 312 IVGGKDVKIPYTFLEPGSDGNGGTIVDSGSTFTFMERPVYNLVAQEFVKQLEKNYSRAED 371
Query: 286 AEPGFSDVLCYNIS--SQPKFPEVTIHFRGADVKLSP-SNLFRNISD-EIMC-SAFRGGN 340
AE C+NIS FPE+T F+G P N F + D E++C + G
Sbjct: 372 AETQSGLSPCFNISGVKTVTFPELTFKFKGGAKMTQPLQNYFSLVGDAEVVCLTVVSDGG 431
Query: 341 AN--------IVYGRIMQINFLIGYDIEQAMVSFKPSRC 371
A I+ G Q NF I YD+E F P C
Sbjct: 432 AGPPKTTGPAIILGNYQQQNFYIEYDLENERFGFGPRSC 470
>gi|159464048|ref|XP_001690254.1| pepsin-type aspartyl protease [Chlamydomonas reinhardtii]
gi|158284242|gb|EDP09992.1| pepsin-type aspartyl protease [Chlamydomonas reinhardtii]
Length = 485
Score = 93.2 bits (230), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 94/355 (26%), Positives = 150/355 (42%), Gaps = 30/355 (8%)
Query: 39 LSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSS--SQ 96
L +GTP +DTGS T+ C+ C C K FDP KS+T ++C
Sbjct: 17 LKLGTPERTFSVIIDTGSTITYIPCKDCSH--CGKHTAEWFDPDKSTTAKKLACGDPLCN 74
Query: 97 CAVVTSNCSEGDCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMPNVIFGCGHKN 156
C + C+ C YS Y A S S G + +T F + PV + +FGC +
Sbjct: 75 CGTPSCTCNNDRCYYSRTY---AERSSSEGWMIEDTFGFPDSDS-PVRL---VFGCENGE 127
Query: 157 LASPTSDSKQTGIIGLGPGNSSLISQM--GTSIAGKFSYCL--PDQGSSKINFGGIVAGA 212
GI+G+G +++ SQ+ I FS C P G + + GA
Sbjct: 128 TGE-IYRQMADGIMGMGNNHNAFQSQLVQRKVIEDVFSLCFGYPKDGILLLGDVTLPEGA 186
Query: 213 GVVSTPLI--IRDHYY-LSLEAISVGNQRLEFVSSSTGNIF---VDTGVLRTLLPLEYHS 266
V TPL+ + HYY + ++ I+V Q L F +S + +D+G T LP +
Sbjct: 187 NTVYTPLLTHLHLHYYNVKMDGITVNGQTLAFDASVFDRGYGTVLDSGTTFTYLPTDAFK 246
Query: 267 NLKSVMSNMIKAQPVKGV-GAEPGFSDVLCYNISSQPK-----FPEVTIHF-RGADVKLS 319
+ + + ++ + ++ GA+P ++D+ Q K FP F GA + L
Sbjct: 247 AMAKAVGDYVEKKGLQSTPGADPQYNDICWKGAPDQFKDLDKYFPPAEFVFGGGAKLTLP 306
Query: 320 PSN-LFRNISDEIMCSAFRGGNANIVYGRIMQINFLIGYDIEQAMVSFKPSRCTN 373
P LF + E F GN+ + G + + ++ YD + V F C +
Sbjct: 307 PLRYLFLSKPAEYCLGIFDNGNSGALVGGVSVRDVVVTYDRRNSKVGFTTMACAD 361
>gi|255567949|ref|XP_002524952.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
gi|223535787|gb|EEF37449.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
Length = 394
Score = 92.8 bits (229), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 94/311 (30%), Positives = 143/311 (45%), Gaps = 37/311 (11%)
Query: 35 YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSS 94
Y + IGTPP VDTGS T+ C C + C + + P F+P+ SSTY +SC+
Sbjct: 90 YTTRIWIGTPPQTFALIVDTGSTVTYVPCSTCEQ--CGRHQDPKFEPELSSTYQPVSCNI 147
Query: 95 SQCAVVTSNCSEGDCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMPNVIFGCGH 154
T + C Y Y A S SSG L + ++F + S L + IFGC +
Sbjct: 148 D----CTCDNERKQCVYERQY---AEMSSSSGVLGEDIISFGNQSELVPQ--RAIFGCEN 198
Query: 155 KNLASPTSDSKQTGIIGLGPGNSSLISQMGTS--IAGKFSYCL--PDQGSSKINFGGIVA 210
+ S + GI+GLG G+ S++ Q+ I+ FS C D G + GGI
Sbjct: 199 QETGDLYS-QRADGIMGLGRGDLSIVDQLVEKGVISDSFSLCYGGMDIGGGAMILGGISP 257
Query: 211 GAGVV---STPLIIRDHYY-LSLEAISVGNQRLEF---VSSSTGNIFVDTGVLRTLLPLE 263
+G+V S P +R YY + L+AI V ++L + +D+G LP
Sbjct: 258 PSGMVFAESDP--VRSQYYNIDLKAIHVAGKQLHLDPSIFDGKHGTVLDSGTTYAYLPEA 315
Query: 264 YHSNLKSVMSNMIKAQPVKGV-GAEPGFSDVLCYNISSQP------KFPEVTIHF-RGAD 315
+ K M M + +K + G +P ++D+ C++ + FP V + F G
Sbjct: 316 AFTAFKDAM--MKELTSLKQIHGPDPNYNDI-CFSGAESDVSQLSNTFPAVEMVFSNGQK 372
Query: 316 VKLSPSN-LFR 325
+ LSP N LF+
Sbjct: 373 LSLSPENYLFQ 383
>gi|356570895|ref|XP_003553619.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Glycine max]
Length = 470
Score = 92.8 bits (229), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 111/399 (27%), Positives = 157/399 (39%), Gaps = 82/399 (20%)
Query: 35 YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEP---CPELDCFKQEP---PLFDPKKSSTYN 88
Y + L++GTPP +DTGS W C C + +P P F PK SST
Sbjct: 88 YSIDLNLGTPPQTSPFVLDTGSSLVWFPCTSHYLCSHCNFPNIDPTKIPTFIPKNSSTAK 147
Query: 89 SISCSSSQCAVV----------------TSNCSEGDCSYSFLYGRGAYASFSSGNLATET 132
+ C + +C + + NCS SY YG GA A F L +
Sbjct: 148 LLGCRNPKCGYLFGPDVESRCPQCKKPGSQNCSLTCPSYIIQYGLGATAGF----LLLDN 203
Query: 133 LTFNSTSGLPVEMPNVIFGCGHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFS 192
L F + +P + GC ++ P +GI G G G SL SQM +FS
Sbjct: 204 LNFPGKT-----VPQFLVGCSILSIRQP------SGIAGFGRGQESLPSQMNLK---RFS 249
Query: 193 YCL-------PDQGSS---KINFGGIVAGAGVVSTPL--------IIRDHYYLSLEAISV 234
YCL Q S +I+ G G+ TP + R++YY++L + V
Sbjct: 250 YCLVSHRFDDTPQSSDLVLQISSTGDTKTNGLSYTPFRSNPSNNSVFREYYYVTLRKLIV 309
Query: 235 GN-------QRLEFVSSSTGNIFVDTGVLRTLLPLEYHSNLKSVMSNMI--KAQPVKGVG 285
G + LE S G VD+G T + ++ + + K + V
Sbjct: 310 GGVDVKIPYKFLEPGSDGNGGTIVDSGSTFTFMERPVYNLVAQEFLRQLGKKYSREENVE 369
Query: 286 AEPGFSDVLCYNISSQP--KFPEVTIHFRGADVKLSP-SNLFRNISD-EIMC-SAFRGGN 340
A+ G S C+NIS FPE T F+G P N F + D E++C + G
Sbjct: 370 AQSGLSP--CFNISGVKTISFPEFTFQFKGGAKMSQPLLNYFSFVGDAEVLCFTVVSDGG 427
Query: 341 AN--------IVYGRIMQINFLIGYDIEQAMVSFKPSRC 371
A I+ G Q NF + YD+E F P C
Sbjct: 428 AGQPKTAGPAIILGNYQQQNFYVEYDLENERFGFGPRNC 466
>gi|357482719|ref|XP_003611646.1| Aspartic proteinase-like protein [Medicago truncatula]
gi|355512981|gb|AES94604.1| Aspartic proteinase-like protein [Medicago truncatula]
Length = 640
Score = 92.8 bits (229), Expect = 3e-16, Method: Compositional matrix adjust.
Identities = 105/382 (27%), Positives = 167/382 (43%), Gaps = 40/382 (10%)
Query: 14 ETPKSPISIIYQAEIISVDDIYLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFK 73
E+ + P + + + + ++ Y L IGTPP VDTGS T+ C C C +
Sbjct: 68 ESKRHPNARMRLYDDLLINGYYTTRLWIGTPPQRFALIVDTGSTVTYVPCSTCEH--CGR 125
Query: 74 QEPPLFDPKKSSTYNSISCSSSQCAVVTSNCSEGDCSYSFLYGRGAYASFSSGNLATETL 133
+ P F P S TY + C+ NC +GD + + A S SSG L + +
Sbjct: 126 HQDPKFQPDLSETYQPVKCTPD------CNC-DGDTNQCMYDRQYAEMSSSSGVLGEDVV 178
Query: 134 TFNSTSGLPVEMPNVIFGCGHKNLASPTSDSKQTGIIGLGPGNSSLISQM--GTSIAGKF 191
+F + S L + +FGC + S + GI+GLG G+ S++ Q+ I+ F
Sbjct: 179 SFGNLSELAPQ--RAVFGCENDETGDLYS-QRADGIMGLGRGDLSIMDQLVDKKVISDSF 235
Query: 192 SYCL--PDQGSSKINFGGIVAGAGVV---STPLIIRDHYY-LSLEAISVGNQRLEF---V 242
S C D G + GGI +V S P R YY ++L+ + V ++L+ V
Sbjct: 236 SLCYGGMDVGGGAMILGGISPPEDMVFTHSDP--DRSPYYNINLKEMHVAGKKLQLNPKV 293
Query: 243 SSSTGNIFVDTGVLRTLLPLEYHSNLKSVMSNMIKAQPVKGV-GAEPGFSDVLCY----- 296
+D+G LP K + M + +K + G +P + D+ C+
Sbjct: 294 FDGKHGTVLDSGTTYAYLPETAFLAFKRAI--MKERNSLKQINGPDPNYKDI-CFTGAGI 350
Query: 297 NISSQPK-FPEVTIHFR-GADVKLSPSN-LFRN--ISDEIMCSAFRGG-NANIVYGRIMQ 350
++S K FP V + F G + LSP N LFR+ + F G + + G I
Sbjct: 351 DVSQLAKSFPVVDMVFENGHKLSLSPENYLFRHSKVRGAYCLGVFSNGRDPTTLLGGIFV 410
Query: 351 INFLIGYDIEQAMVSFKPSRCT 372
N L+ YD E + + F + C+
Sbjct: 411 RNTLVMYDRENSKIGFWKTNCS 432
>gi|218194598|gb|EEC77025.1| hypothetical protein OsI_15381 [Oryza sativa Indica Group]
Length = 422
Score = 92.8 bits (229), Expect = 3e-16, Method: Compositional matrix adjust.
Identities = 94/360 (26%), Positives = 153/360 (42%), Gaps = 39/360 (10%)
Query: 34 IYLMHLSIGTPPVDIFGSVDTGSDCTWTQ---CEPCPELDCFKQEPPLFDPKKSSTYNSI 90
+Y + IGTP V + +DTGS W C+ CP ++ +DP+ S + +
Sbjct: 58 LYYTDIGIGTPAVKYYVQLDTGSKAFWVNGISCKQCPHESDILRKLTFYDPRSSVSSKEV 117
Query: 91 SCSSSQCAVVTSNCSEGDCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMP---N 147
C + C C Y Y G + G L T+ L ++ G P +
Sbjct: 118 KCDDTICTSRPPCNMTLRCPYITGYADGG---LTMGILFTDLLHYHQLYGNGQTQPTSTS 174
Query: 148 VIFGCGHKNLASPTSDSKQT-GIIGLGPGNSSLISQMGTSIAGK----FSYCLPDQGSSK 202
V FGCG + S + + GIIG G N + +SQ+ + AGK FS+CL
Sbjct: 175 VTFGCGLQQSGSLNNSAVAIDGIIGFGNSNQTALSQL--AAAGKTKKIFSHCLDSTNGGG 232
Query: 203 INFGGIVAGAGVVSTPLIIRDHYY--LSLEAISVGNQRLE-----FVSSSTGNIFVDTGV 255
I G V V +TP++ + Y ++L++I+V L+ F ++ T F+D+G
Sbjct: 233 IFAIGEVVEPKVKTTPIVKNNEVYHLVNLKSINVAGTTLQLPANIFGTTKTKGTFIDSGS 292
Query: 256 LRTLLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDVLCYNI--SSQPKFPEVTIHFRG 313
LP +S L + + P +GA F C++ S KFP++T HF
Sbjct: 293 TLVYLPEIIYSEL---ILAVFAKHPDITMGAMYNFQ---CFHFLGSVDDKFPKITFHFEN 346
Query: 314 ADVKLS--PSNLFRNISDEIMCSAFR-----GGNANIVYGRIMQINFLIGYDIEQAMVSF 366
D+ L P + C F+ G I+ G ++ N ++ YD+E+ + +
Sbjct: 347 -DLTLDVYPYDYLLEYEGNQYCFGFQDAGIHGYKDMIILGDMVISNKVVVYDMEKQAIGW 405
>gi|215766660|dbj|BAG98888.1| unnamed protein product [Oryza sativa Japonica Group]
Length = 433
Score = 92.8 bits (229), Expect = 3e-16, Method: Compositional matrix adjust.
Identities = 94/360 (26%), Positives = 153/360 (42%), Gaps = 39/360 (10%)
Query: 34 IYLMHLSIGTPPVDIFGSVDTGSDCTWTQ---CEPCPELDCFKQEPPLFDPKKSSTYNSI 90
+Y + IGTP V + +DTGS W C+ CP ++ +DP+ S + +
Sbjct: 82 LYYTDIGIGTPAVKYYVQLDTGSKAFWVNGISCKQCPHESDILRKLTFYDPRSSVSSKEV 141
Query: 91 SCSSSQCAVVTSNCSEGDCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMP---N 147
C + C C Y Y G + G L T+ L ++ G P +
Sbjct: 142 KCDDTICTSRPPCNMTLRCPYITGYADGG---LTMGILFTDLLHYHQLYGNGQTQPTSTS 198
Query: 148 VIFGCGHKNLASPTSDSKQT-GIIGLGPGNSSLISQMGTSIAGK----FSYCLPDQGSSK 202
V FGCG + S + + GIIG G N + +SQ+ + AGK FS+CL
Sbjct: 199 VTFGCGLQQSGSLNNSAVAIDGIIGFGNSNQTALSQL--AAAGKTKKIFSHCLDSTNGGG 256
Query: 203 INFGGIVAGAGVVSTPLIIRDHYY--LSLEAISVGNQRLE-----FVSSSTGNIFVDTGV 255
I G V V +TP++ + Y ++L++I+V L+ F ++ T F+D+G
Sbjct: 257 IFAIGEVVEPKVKTTPIVKNNEVYHLVNLKSINVAGTTLQLPANIFGTTKTKGTFIDSGS 316
Query: 256 LRTLLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDVLCYNI--SSQPKFPEVTIHFRG 313
LP +S L + + P +GA F C++ S KFP++T HF
Sbjct: 317 TLVYLPEIIYSEL---ILAVFAKHPDITMGAMYNFQ---CFHFLGSVDDKFPKITFHFEN 370
Query: 314 ADVKLS--PSNLFRNISDEIMCSAFR-----GGNANIVYGRIMQINFLIGYDIEQAMVSF 366
D+ L P + C F+ G I+ G ++ N ++ YD+E+ + +
Sbjct: 371 -DLTLDVYPYDYLLEYEGNQYCFGFQDAGIHGYKDMIILGDMVISNKVVVYDMEKQAIGW 429
>gi|242059211|ref|XP_002458751.1| hypothetical protein SORBIDRAFT_03g039590 [Sorghum bicolor]
gi|241930726|gb|EES03871.1| hypothetical protein SORBIDRAFT_03g039590 [Sorghum bicolor]
Length = 444
Score = 92.4 bits (228), Expect = 3e-16, Method: Compositional matrix adjust.
Identities = 96/382 (25%), Positives = 159/382 (41%), Gaps = 64/382 (16%)
Query: 39 LSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPL----FDPKKSSTYNSISCSS 94
L++GTPP ++ +DTGS+ +W C + F P+ S+T+ ++ C S
Sbjct: 67 LAVGTPPQNVTMVLDTGSELSWLLCATGRQGSAAAGAAAAMGESFRPRASATFAAVPCGS 126
Query: 95 SQC------AVVTSNCSEGDCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMPNV 148
+QC A + + + C S Y G S S G LAT+ L
Sbjct: 127 TQCSSRDLPAPPSCDGASRQCHVSLSYADG---SASDGALATDVFAVGEAPPL-----RS 178
Query: 149 IFGCGHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLPDQ--------GS 200
FGC S G++G+ G S ++Q T +FSYC+ D+ G
Sbjct: 179 AFGCMSTAYDSSPDGVATAGLLGMNRGTLSFVTQASTR---RFSYCISDRDDAGVLLLGH 235
Query: 201 SKINFGGI---VAGAGVVSTPLIIRDHYYLSLEAISVGNQRLEFVSS-------STGNIF 250
S + F + + P R Y + L I VG + L +S G
Sbjct: 236 SDLPFLPLNYTPLYQPTLPLPYFDRVAYSVQLLGIRVGGKALPIPASVLAPDHTGAGQTM 295
Query: 251 VDTGVLRTLLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFS--DVL--CYNI-SSQP--- 302
VD+G T L + +S LK+ + + +P+ +P F+ + L C+ + + +P
Sbjct: 296 VDSGTQFTFLLGDAYSALKAEF--LKQTKPLLRALDDPSFAFQEALDTCFRVPAGRPPPS 353
Query: 303 -KFPEVTIHFRGADVKLSPSNLFRNI------SDEIMCSAFRGGNANI------VYGRIM 349
+ P VT+ F GA++ ++ L + +D + C F GNA++ V G
Sbjct: 354 ARLPPVTLLFNGAEMSVAGDRLLYKVPGEHRGADGVWCLTF--GNADMVPLTAYVIGHHH 411
Query: 350 QINFLIGYDIEQAMVSFKPSRC 371
Q+N + YD+E+ V P +C
Sbjct: 412 QMNLWVEYDLERGRVGLAPVKC 433
>gi|296085638|emb|CBI29432.3| unnamed protein product [Vitis vinifera]
Length = 337
Score = 92.4 bits (228), Expect = 3e-16, Method: Compositional matrix adjust.
Identities = 60/173 (34%), Positives = 86/173 (49%), Gaps = 19/173 (10%)
Query: 35 YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSS 94
Y + + G+P VDTGS +W QC+PC + C Q PLFDP S TY S+SC+S
Sbjct: 118 YYVKVGFGSPARYYSMIVDTGSSLSWLQCKPC-VVYCHVQADPLFDPSASKTYKSLSCTS 176
Query: 95 SQCAVVTS--------NCSEGDCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMP 146
SQC+ + S C Y+ YG +S+S G L+ + LT + +P
Sbjct: 177 SQCSSLVDATLNNPLCETSSNVCVYTASYGD---SSYSMGYLSQDLLTLAPSQ----TLP 229
Query: 147 NVIFGCGHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLPDQG 199
++GCG S + GI+GLG S++ Q+ + FSYCLP +G
Sbjct: 230 GFVYGCGQD---SDGLFGRAAGILGLGRNKLSMLGQVSSKFGYAFSYCLPTRG 279
>gi|190896584|gb|ACE96805.1| aspartyl protease [Populus tremula]
gi|190896586|gb|ACE96806.1| aspartyl protease [Populus tremula]
gi|190896588|gb|ACE96807.1| aspartyl protease [Populus tremula]
gi|190896590|gb|ACE96808.1| aspartyl protease [Populus tremula]
gi|190896592|gb|ACE96809.1| aspartyl protease [Populus tremula]
gi|190896594|gb|ACE96810.1| aspartyl protease [Populus tremula]
gi|190896596|gb|ACE96811.1| aspartyl protease [Populus tremula]
gi|190896598|gb|ACE96812.1| aspartyl protease [Populus tremula]
gi|190896600|gb|ACE96813.1| aspartyl protease [Populus tremula]
gi|190896602|gb|ACE96814.1| aspartyl protease [Populus tremula]
gi|190896604|gb|ACE96815.1| aspartyl protease [Populus tremula]
gi|190896606|gb|ACE96816.1| aspartyl protease [Populus tremula]
gi|190896610|gb|ACE96818.1| aspartyl protease [Populus tremula]
gi|190896612|gb|ACE96819.1| aspartyl protease [Populus tremula]
gi|190896614|gb|ACE96820.1| aspartyl protease [Populus tremula]
gi|190896616|gb|ACE96821.1| aspartyl protease [Populus tremula]
gi|190896618|gb|ACE96822.1| aspartyl protease [Populus tremula]
gi|190896620|gb|ACE96823.1| aspartyl protease [Populus tremula]
gi|190896622|gb|ACE96824.1| aspartyl protease [Populus tremula]
gi|190896624|gb|ACE96825.1| aspartyl protease [Populus tremula]
gi|190896626|gb|ACE96826.1| aspartyl protease [Populus tremula]
gi|190896628|gb|ACE96827.1| aspartyl protease [Populus tremula]
gi|190896630|gb|ACE96828.1| aspartyl protease [Populus tremula]
gi|190896632|gb|ACE96829.1| aspartyl protease [Populus tremula]
gi|190896634|gb|ACE96830.1| aspartyl protease [Populus tremula]
gi|190896636|gb|ACE96831.1| aspartyl protease [Populus tremula]
gi|190896638|gb|ACE96832.1| aspartyl protease [Populus tremula]
gi|190896640|gb|ACE96833.1| aspartyl protease [Populus tremula]
gi|190896642|gb|ACE96834.1| aspartyl protease [Populus tremula]
gi|190896644|gb|ACE96835.1| aspartyl protease [Populus tremula]
gi|190896646|gb|ACE96836.1| aspartyl protease [Populus tremula]
gi|190896648|gb|ACE96837.1| aspartyl protease [Populus tremula]
gi|190896650|gb|ACE96838.1| aspartyl protease [Populus tremula]
gi|190896652|gb|ACE96839.1| aspartyl protease [Populus tremula]
gi|190896654|gb|ACE96840.1| aspartyl protease [Populus tremula]
gi|190896656|gb|ACE96841.1| aspartyl protease [Populus tremula]
gi|190896658|gb|ACE96842.1| aspartyl protease [Populus tremula]
Length = 339
Score = 92.4 bits (228), Expect = 3e-16, Method: Compositional matrix adjust.
Identities = 89/326 (27%), Positives = 145/326 (44%), Gaps = 36/326 (11%)
Query: 12 DNETPKSPISIIYQAEIISVDDIYLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDC 71
D +T PI+ Q I+ Y++ + +GTP +F +DT +D W C C C
Sbjct: 25 DQKTTAVPIAPGQQVLKIAN---YVVRVKLGTPGQQMFMVLDTSNDAAWVPCSGC--TGC 79
Query: 72 FKQEPPLFDPKKSSTYNSISCSSSQCAVVTSNCSEGDCSYSFLYGRGAYASFSSGNLATE 131
F P S+T S+ CS +QC+ V S + L+ + S+ +
Sbjct: 80 SSTT---FLPNASTTLGSLDCSEAQCSQVRGFSCPATGSSACLFNQ----SYGGDSSLAA 132
Query: 132 TLTFNSTSGLPVEMPNVIFGCGHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKF 191
TL ++ + +P FGC N S S Q G++GLG G SLISQ G +G F
Sbjct: 133 TLVQDAITLANDVIPGFTFGC--INAVSGGSIPPQ-GLLGLGRGPISLISQAGAMYSGVF 189
Query: 192 SYCLPDQG----SSKINFGGIVAGAGVVSTPLIIRDH----YYLSLEAISVG-------N 236
SYCLP S + G + + +TPL+ H YY++L +SVG +
Sbjct: 190 SYCLPSFKSYYFSGSLKLGPVGQPKSIRTTPLLRNPHRPSLYYVNLTGVSVGRIKVPIPS 249
Query: 237 QRLEFVSSSTGNIFVDTGVLRTLLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDVLCY 296
++L F ++ +D+G + T + ++ + P+ +GA F C+
Sbjct: 250 EQLVFDPNTGAGTIIDSGTVITRFVQPVYFAIRDEFRKQVNG-PISSLGA---FDT--CF 303
Query: 297 NISSQPKFPEVTIHFRGADVKLSPSN 322
+++ + P VT+HF G ++ L N
Sbjct: 304 AATNEAEAPAVTLHFEGLNLVLPMEN 329
>gi|168030587|ref|XP_001767804.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162680886|gb|EDQ67318.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 399
Score = 92.4 bits (228), Expect = 3e-16, Method: Compositional matrix adjust.
Identities = 105/371 (28%), Positives = 155/371 (41%), Gaps = 46/371 (12%)
Query: 35 YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQ---------EPPLFDPKKSS 85
Y + IGTPP + VDTGS T+ C C + P F P+ SS
Sbjct: 40 YTSRVFIGTPPNEFALIVDTGSTVTYVPCSSCTHCGHHQASFSTHRLFCRDPRFKPENSS 99
Query: 86 TYNSISCSSSQCAVVTSNCSEGDCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEM 145
+Y I C SS C + + C Y +Y A S S G L + L F S L ++
Sbjct: 100 SYQKIGCRSSDCITGLCDSNSHQCKYERMY---AEMSTSKGVLGKDLLDFGPASRLQSQL 156
Query: 146 PNVIFGCGHK---NLASPTSDSKQTGIIGLGPGNSSLISQM--GTSIAGKFSYCLP--DQ 198
+ FGC +L +D GI+GLG G S++ Q+ +I FS C D+
Sbjct: 157 --LSFGCETAESGDLYLQVAD----GIMGLGRGPLSIVDQLVGNGAIEDSFSLCYGGMDE 210
Query: 199 GSSKINFGGIVAGAGVV---STPLIIRDHYY-LSLEAISVGNQRLEFVSSSTGNIF---V 251
G + G I A +G+V S P R +YY L L I V L+ S+ F +
Sbjct: 211 GGGSMVLGAIPAPSGMVFAKSDPR--RSNYYNLELTEIQVQGASLKLDSNVFNGKFGTIL 268
Query: 252 DTGVLRTLLP-LEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDVLCYNISSQPK-----FP 305
D+G LP + + +V++ + Q V G +P + D+ + K FP
Sbjct: 269 DSGTTYAYLPDRAFEAFTDAVVAQLGSLQAVD--GPDPNYPDICYAGAGTDTKELGKHFP 326
Query: 306 EVTIHF-RGADVKLSPSN-LFRN--ISDEIMCSAFRGGNANIVYGRIMQINFLIGYDIEQ 361
V F V L+P N LF++ + F+ +A + G I+ N L+ YD
Sbjct: 327 LVDFVFAENQKVSLAPENYLFKHTKVPGAYCLGFFKNQDATTLLGGIIVRNMLVTYDRYN 386
Query: 362 AMVSFKPSRCT 372
+ F + CT
Sbjct: 387 HQIGFLKTNCT 397
>gi|32479948|emb|CAE01594.1| OSJNBa0008A08.2 [Oryza sativa Japonica Group]
gi|38347627|emb|CAE05222.2| OSJNBa0011K22.4 [Oryza sativa Japonica Group]
gi|38567678|emb|CAE75961.1| B1159F04.24 [Oryza sativa Japonica Group]
gi|116309512|emb|CAH66578.1| OSIGBa0137O04.4 [Oryza sativa Indica Group]
Length = 431
Score = 92.0 bits (227), Expect = 4e-16, Method: Compositional matrix adjust.
Identities = 94/360 (26%), Positives = 153/360 (42%), Gaps = 39/360 (10%)
Query: 34 IYLMHLSIGTPPVDIFGSVDTGSDCTWTQ---CEPCPELDCFKQEPPLFDPKKSSTYNSI 90
+Y + IGTP V + +DTGS W C+ CP ++ +DP+ S + +
Sbjct: 58 LYYTDIGIGTPAVKYYVQLDTGSKAFWVNGISCKQCPHESDILRKLTFYDPRSSVSSKEV 117
Query: 91 SCSSSQCAVVTSNCSEGDCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMP---N 147
C + C C Y Y G + G L T+ L ++ G P +
Sbjct: 118 KCDDTICTSRPPCNMTLRCPYITGYADGG---LTMGILFTDLLHYHQLYGNGQTQPTSTS 174
Query: 148 VIFGCGHKNLASPTSDSKQT-GIIGLGPGNSSLISQMGTSIAGK----FSYCLPDQGSSK 202
V FGCG + S + + GIIG G N + +SQ+ + AGK FS+CL
Sbjct: 175 VTFGCGLQQSGSLNNSAVAIDGIIGFGNSNQTALSQL--AAAGKTKKIFSHCLDSTNGGG 232
Query: 203 INFGGIVAGAGVVSTPLIIRDHYY--LSLEAISVGNQRLE-----FVSSSTGNIFVDTGV 255
I G V V +TP++ + Y ++L++I+V L+ F ++ T F+D+G
Sbjct: 233 IFAIGEVVEPKVKTTPIVKNNEVYHLVNLKSINVAGTTLQLPANIFGTTKTKGTFIDSGS 292
Query: 256 LRTLLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDVLCYNI--SSQPKFPEVTIHFRG 313
LP +S L + + P +GA F C++ S KFP++T HF
Sbjct: 293 TLVYLPEIIYSEL---ILAVFAKHPDITMGAMYNFQ---CFHFLGSVDDKFPKITFHFEN 346
Query: 314 ADVKLS--PSNLFRNISDEIMCSAFR-----GGNANIVYGRIMQINFLIGYDIEQAMVSF 366
D+ L P + C F+ G I+ G ++ N ++ YD+E+ + +
Sbjct: 347 -DLTLDVYPYDYLLEYEGNQYCFGFQDAGIHGYKDMIILGDMVISNKVVVYDMEKQAIGW 405
>gi|190896608|gb|ACE96817.1| aspartyl protease [Populus tremula]
Length = 339
Score = 92.0 bits (227), Expect = 5e-16, Method: Compositional matrix adjust.
Identities = 89/326 (27%), Positives = 145/326 (44%), Gaps = 36/326 (11%)
Query: 12 DNETPKSPISIIYQAEIISVDDIYLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDC 71
D +T PI+ Q I+ Y++ + +GTP +F +DT +D W C C C
Sbjct: 25 DQKTTAVPIAPGQQVLKIAN---YVVRVKLGTPGQQMFMVLDTSNDAAWVPCSGC--TGC 79
Query: 72 FKQEPPLFDPKKSSTYNSISCSSSQCAVVTSNCSEGDCSYSFLYGRGAYASFSSGNLATE 131
F P S+T S+ CS +QC+ V S + L+ + S+ +
Sbjct: 80 SSTT---FLPNASTTLGSLDCSEAQCSQVRGFSCPATGSSACLFNQ----SYGGDSSLAA 132
Query: 132 TLTFNSTSGLPVEMPNVIFGCGHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKF 191
TL ++ + +P FGC N S S Q G++GLG G SLISQ G +G F
Sbjct: 133 TLVQDAITLANDVIPGFTFGC--INAVSGGSIPPQ-GLLGLGRGPISLISQAGAMYSGVF 189
Query: 192 SYCLPDQG----SSKINFGGIVAGAGVVSTPLIIRDH----YYLSLEAISVG-------N 236
SYCLP S + G + + +TPL+ H YY++L +SVG +
Sbjct: 190 SYCLPSFKSYYFSGSLKLGPVGQPKSIRTTPLLRNPHRPSLYYVNLTGVSVGRIKVPIPS 249
Query: 237 QRLEFVSSSTGNIFVDTGVLRTLLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDVLCY 296
++L F ++ +D+G + T + ++ + P+ +GA F C+
Sbjct: 250 EQLVFDPNTGAGTIIDSGTVITRFVQPVYFAIRDEFRKQVNG-PISSLGA---FDT--CF 303
Query: 297 NISSQPKFPEVTIHFRGADVKLSPSN 322
+++ + P VT+HF G ++ L N
Sbjct: 304 AETNEAEAPAVTLHFEGLNLVLPMEN 329
>gi|108707839|gb|ABF95634.1| Eukaryotic aspartyl protease family protein [Oryza sativa Japonica
Group]
Length = 330
Score = 91.7 bits (226), Expect = 5e-16, Method: Compositional matrix adjust.
Identities = 92/315 (29%), Positives = 139/315 (44%), Gaps = 47/315 (14%)
Query: 77 PLFDPKKSSTYNSISCSSSQC-AVVTSNCSEGD------CSYSFLYGRGAYASFSSGNLA 129
P FD SST SC S+ C ++ ++C C Y++ Y S ++G +
Sbjct: 23 PYFDRSTSSTLLLTSCDSTLCQGLLVASCGNTKFWPNQTCVYTYYYND---KSVTTGLIE 79
Query: 130 TETLTFNSTSGLPVEMPNVIFGCGHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAG 189
+ TF G +P V FGCG N + S +TGI G G G SL SQ+ G
Sbjct: 80 VDKFTF----GAGASVPGVAFGCGLFN--NGVFKSNETGIAGFGRGPLSLPSQLKV---G 130
Query: 190 KFSYCLP-----DQGSSKINFGGIVAGAG---VVSTPLIIRD----HYYLSLEAISVGNQ 237
FS+C Q + ++ + G V STPLI YYLSL+ I+VG+
Sbjct: 131 NFSHCFTAVNGLKQSTVLLDLPADLYKNGRGAVQSTPLIQNSANPTFYYLSLKGITVGST 190
Query: 238 RLEF------VSSSTGNIFVDTGVLRTLLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFS 291
RL +++ TG +D+G T LP + + ++ + IK V G P
Sbjct: 191 RLPVPESAFALTNGTGGTIIDSGTSITSLPPQVYQVVRDEFAAQIKLPVVPGNATGP--- 247
Query: 292 DVLCYNISSQ--PKFPEVTIHFRGADVKLSPSNLFRNISDE----IMCSAFRGGNANIVY 345
C++ SQ P P++ +HF GA + L N + D+ I+C A G+ +
Sbjct: 248 -YTCFSAPSQAKPDVPKLVLHFEGATMDLPRENYVFEVPDDAGNSIICLAINKGDETTII 306
Query: 346 GRIMQINFLIGYDIE 360
G Q N + YD++
Sbjct: 307 GNFQQQNMHVLYDLQ 321
>gi|300681506|emb|CBH32600.1| pepsin A, putative, expressed [Triticum aestivum]
Length = 477
Score = 91.7 bits (226), Expect = 5e-16, Method: Compositional matrix adjust.
Identities = 98/392 (25%), Positives = 160/392 (40%), Gaps = 64/392 (16%)
Query: 35 YLMHLSIGTPPVDIFGSVDTGSDCTWTQCE-------PCPELDCFKQEPPLFDPKKSSTY 87
Y + +GTP DTGSD TW +C D F P+ S T+
Sbjct: 97 YFVRFRVGTPAQPFLLVADTGSDLTWVKCRRPASANSSLSPADSGPGPGRAFRPEDSRTW 156
Query: 88 NSISCSSSQC------AVVTSNCSEGDCSYSFLYGRGAYASFSSGNLATE--TLTFNSTS 139
ISC+S C ++ T C+Y + Y G+ A G + TE T+ +
Sbjct: 157 APISCASDTCTKSLPFSLATCPTPGSPCAYDYRYKDGSAA---RGTVGTESATIALSGRE 213
Query: 140 GLPVEMPNVIFGCGHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCL---- 195
++ ++ GC + P+ ++ G++ LG S S + G+FSYCL
Sbjct: 214 ERKAKLKGLVLGC-SSSYTGPSFEASD-GVLSLGYSGISFASHAASRFGGRFSYCLVDHL 271
Query: 196 -PDQGSSKINFG--------------GIVAGAGVVSTPLII----RDHYYLSLEAISVGN 236
P +S + FG A TPL++ R Y +SL+AISV
Sbjct: 272 SPRNATSYLTFGPNPAVSSPRASPSSCAAAAPRARQTPLLLDRRMRPFYDVSLKAISVAG 331
Query: 237 QRLEFVSS-----STGNIFVDTGVLRTLLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFS 291
+ L+ + + G + +D+G T+L + + + +S + P V +P F
Sbjct: 332 EFLKIPRAVWDVEAGGGVILDSGTSLTVLAKPAYRAVVAALSKGLAGLPR--VTMDP-FE 388
Query: 292 DVLCYNISSQP------KFPEVTIHFRGADVKLSP--SNLFRNISDEIMCSAFRGG--NA 341
CYN +S P++ +HF GA +L P + + + + C + G
Sbjct: 389 --YCYNWTSPSGKDADVAVPKMAVHFAGA-ARLEPPGKSYVIDAAPGVKCIGLQEGPWPG 445
Query: 342 NIVYGRIMQINFLIGYDIEQAMVSFKPSRCTN 373
V G I+Q L +DI+ + F+ SRCT+
Sbjct: 446 ISVIGNILQQEHLWEFDIKNRRLKFQRSRCTH 477
>gi|414866064|tpg|DAA44621.1| TPA: putative aspartic protease family protein [Zea mays]
Length = 454
Score = 91.7 bits (226), Expect = 6e-16, Method: Compositional matrix adjust.
Identities = 112/402 (27%), Positives = 163/402 (40%), Gaps = 95/402 (23%)
Query: 39 LSIGTPPVDIFGSVDTGSDCTWTQCE-------PCPELDCFKQEPPLFDPKKSSTYNSIS 91
+++G PP ++ +DTGS+ +W +C P P Q P F+ SSTY +
Sbjct: 66 VAVGAPPQNVTMVLDTGSELSWLRCNGSRVPSTPPP------QAPAAFNGSASSTYAAAH 119
Query: 92 CSSSQCAV------VTSNCS---EGDCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLP 142
CSS +C V C+ C S Y A AS + G LA +T P
Sbjct: 120 CSSPECQWRGRDLPVPPFCAGPPSNSCRVSLSY---ADASSADGILAADTFLLGGAP--P 174
Query: 143 VEMPNVIFGCG---HKNLASPTSDSK-QTGIIGLGPGNSSLISQMGTSIAGKFSYCL-PD 197
V +FGC A+ +SDS+ TG++G+ G+ S ++Q T +F+YC+ P
Sbjct: 175 VR---ALFGCVTSYSSATATNSSDSEAATGLLGMNRGSLSFVTQTATL---RFAYCIAPG 228
Query: 198 QGSSKINFGGIVAGAGVVS----TPLII---------RDHYYLSLEAISVGNQRLEFVSS 244
G + GG GA + TPLI R Y + LE I VG L S
Sbjct: 229 DGPGLLVLGG--DGAALAPQLNYTPLIQISRPLPYFDRVAYSVQLEGIRVGAALLPIPKS 286
Query: 245 -------STGNIFVDTGVLRTLLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDVL--- 294
G VD+G T L + ++ LK N A + A G SD +
Sbjct: 287 VLAPDHTGAGQTMVDSGTQFTFLLADAYAPLKGEFLNQTSA-----LLAPLGESDFVFQG 341
Query: 295 ----CYNIS------SQPKFPEVTIHFRGADVKLSPSNLFRNISDE---------IMCSA 335
C+ S + PEV + RGA+V + L + E + C
Sbjct: 342 AFDACFRASEARVAAASQMLPEVGLVLRGAEVAVGGEKLLYRVPGERRGEGGAEAVWCLT 401
Query: 336 FRGGNANI------VYGRIMQINFLIGYDIEQAMVSFKPSRC 371
F GN+++ V G Q N + YD++ V F P+RC
Sbjct: 402 F--GNSDMAGMSAYVIGHHHQQNVWVEYDLQNGRVGFAPARC 441
>gi|125564663|gb|EAZ10043.1| hypothetical protein OsI_32347 [Oryza sativa Indica Group]
Length = 330
Score = 91.7 bits (226), Expect = 6e-16, Method: Compositional matrix adjust.
Identities = 92/358 (25%), Positives = 162/358 (45%), Gaps = 62/358 (17%)
Query: 46 VDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSSSQCAVVTSNCS 105
+D+ DT SD WTQC+PC L C Q ++DP K+ TY +++ S
Sbjct: 1 MDVTLVFDTTSDLLWTQCQPC--LSCVAQAGDMYDPNKTETYANLTSS------------ 46
Query: 106 EGDCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMPNVIFGCGHKNLASPTSDSK 165
+Y++ Y + SF+SG ATET + V + N+ FGCG +N + +
Sbjct: 47 ----NYNYTYSK---QSFTSGYFATETFALGN-----VTVANITFGCGTRNQGYYDNVAG 94
Query: 166 QTGIIGLGPGNSSLISQMGTSIAGKFSYCL---PDQGSSKINFGG-----------IVAG 211
G+ SL++Q+G +FSYC GSS + GG A
Sbjct: 95 VFGVGRG---GVSLLNQLGID---RFSYCFSSSGAPGSSAVFLGGSPELATNATTTPAAS 148
Query: 212 AGVVSTPLIIRDHYYLSLEAISVGNQRLEFVSSSTGN-----IFVDTGVLRTLLPLEYHS 266
+V+ P +++ Y++ L ++VG R++ +S+ + +D+ T+L +
Sbjct: 149 TPMVADP-VLKSGYFVKLVGVTVGATRVDVAGASSAEGGGRALVIDSTSPVTVLDEATYG 207
Query: 267 NLKSVMSNMIKAQPVKGVGAEPGFSDVLCYNIS---SQPKFPEV--TIHFRG--ADVKLS 319
++ + + A G LC+ ++ + P P V T+HF G AD+ L
Sbjct: 208 PVRRALVAQLAPLKEANANASAGVGLDLCFELAAGGATPTPPNVTMTLHFDGGAADLVLP 267
Query: 320 PSN-LFRNISDEIMCSAFRGGNAN--IVYGRIMQINFLIGYDIEQAMVSFKPSRCTNY 374
P+N L ++ + ++C ++N V G ++ L+ YD+ + +VSF+P C +
Sbjct: 268 PANYLAKDSAGGLICLTMTPSSSNGVPVLGSSALLDTLVLYDLAKNVVSFQPLDCAAF 325
>gi|413919745|gb|AFW59677.1| hypothetical protein ZEAMMB73_406599 [Zea mays]
Length = 246
Score = 91.7 bits (226), Expect = 6e-16, Method: Compositional matrix adjust.
Identities = 68/203 (33%), Positives = 102/203 (50%), Gaps = 27/203 (13%)
Query: 34 IYLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPP---LFDPKKSSTYNSI 90
++LM ++IGTPPV +DTGS +W QC PC E C KQ +FDP +S+T+
Sbjct: 41 LFLMPINIGTPPVMNLVGIDTGSTLSWVQCRPC-EPHCHKQAAKAGQIFDPSRSTTFRRA 99
Query: 91 SCSSSQCAVVT-------SNCSE--GDCSYSFLYGRGAYASFSSGNLATETLTFNSTSGL 141
C+S +C VV +NC E C YS +Y G +A +++ + + L + L
Sbjct: 100 GCNSRECFVVKDALKLEFANCMEKVNTCLYSMIY-EGGWA-YTASKVVWDNLIIGTNISL 157
Query: 142 PVEMPNVIFGCGHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGK-FSYCLPDQGS 200
+ +FGC +L + K+ G +G G + S Q+ + I K FSYCLP +
Sbjct: 158 -----SFMFGC---SLDVEYGNYKEAGTVGFGTTSISFFEQVSSQINYKAFSYCLPSNET 209
Query: 201 SK--INFGGIVA-GAGVVSTPLI 220
+ +N G GA V+ TPL
Sbjct: 210 TTGYMNLGDYSGQGAHVLYTPLF 232
>gi|51091919|dbj|BAD35188.1| putative 41 kD chloroplast nucleoid DNA binding protein [Oryza
sativa Japonica Group]
gi|125596474|gb|EAZ36254.1| hypothetical protein OsJ_20576 [Oryza sativa Japonica Group]
gi|196212950|gb|ACG76111.1| S5 [Oryza sativa Japonica Group]
gi|340810891|gb|AEK75372.1| S5 [Oryza sativa]
gi|340810893|gb|AEK75373.1| S5 [Oryza sativa]
gi|340810899|gb|AEK75376.1| S5 [Oryza sativa]
gi|340810901|gb|AEK75377.1| S5 [Oryza sativa]
gi|340810933|gb|AEK75393.1| S5 [Oryza sativa]
gi|340810947|gb|AEK75400.1| S5 [Oryza sativa]
gi|340810949|gb|AEK75401.1| S5 [Oryza sativa]
gi|340810967|gb|AEK75410.1| S5 [Oryza sativa]
gi|340810969|gb|AEK75411.1| S5 [Oryza sativa]
gi|340810999|gb|AEK75426.1| S5 [Oryza rufipogon]
gi|340811017|gb|AEK75435.1| S5 [Oryza rufipogon]
gi|340811029|gb|AEK75441.1| S5 [Oryza nivara]
gi|340811051|gb|AEK75452.1| S5 [Oryza nivara]
gi|340811075|gb|AEK75464.1| S5 [Oryza nivara]
gi|340811077|gb|AEK75465.1| S5 [Oryza rufipogon]
gi|340811085|gb|AEK75469.1| S5 [Oryza nivara]
gi|340811096|gb|AEK75474.1| S5 [Oryza rufipogon]
gi|340811100|gb|AEK75476.1| S5 [Oryza rufipogon]
gi|340811114|gb|AEK75483.1| S5 [Oryza nivara]
Length = 472
Score = 91.3 bits (225), Expect = 6e-16, Method: Compositional matrix adjust.
Identities = 106/380 (27%), Positives = 160/380 (42%), Gaps = 58/380 (15%)
Query: 32 DDIYLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEP---PLFDPKKSSTYN 88
D ++LM +S+G PPV ++DTGS +W QC+PC + C Q P+FDP +S T
Sbjct: 111 DFLFLMAVSLGKPPVVNLVAIDTGSTLSWVQCQPCA-VHCHTQSAKAGPIFDPGRSYTSR 169
Query: 89 SISCSSSQCA-------VVTSNCSEGD--CSYSFLYGRGAYASFSSGNLATETLTFNSTS 139
+ CSS +C + +NC E + C+YS YG G ++S G + T+TL +
Sbjct: 170 RVRCSSVKCGELRYDLRLQQANCMEKEDSCTYSVTYGNG--WAYSVGKMVTDTLRIGDS- 226
Query: 140 GLPVEMPNVIFGCGHKNLASPTSDSKQTGIIGLGPGNSSLISQMG----TSIAGKFSYCL 195
+++FGC S + GI G G + S Q+ SYCL
Sbjct: 227 -----FMDLMFGCSMDVKYS----EFEAGIFGFGSSSFSFFEQLAGYPDILSYKALSYCL 277
Query: 196 P-DQGSSKINFGGIVAGAGVVS--TPL---IIRDHYYLSLEAISVGNQRLEFVSSSTGNI 249
P D+ G A + TPL I R Y L++E + QRL +S+ +
Sbjct: 278 PTDETKPGYMILGRYDRAAMDGGYTPLFRSINRPTYSLTMEMLIANGQRL---VTSSSEM 334
Query: 250 FVDTGVLRT-LLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDVLCY------------ 296
VD+G RT L P + K++ M + A +CY
Sbjct: 335 IVDSGAQRTSLWPSTFALLDKTITQAMSSIGYHRTSRARQ--ESYICYLSEHDYSGWNGT 392
Query: 297 --NISSQPKFPEVTIHFR-GADVKLSPSNLFRNISDEIMCSAFRGGNA--NIVYGRIMQI 351
S+ P + I F GA + L P N+F N +C F A + + G +
Sbjct: 393 ITPFSNWSALPLLEIGFAGGAALALPPRNVFYNDPHRGLCMTFAQNPALRSQILGNRVTR 452
Query: 352 NFLIGYDIEQAMVSFKPSRC 371
+F +DI+ FK + C
Sbjct: 453 SFGTTFDIQGKQFGFKYAVC 472
>gi|340810993|gb|AEK75423.1| S5 [Oryza rufipogon]
gi|340811015|gb|AEK75434.1| S5 [Oryza nivara]
gi|340811021|gb|AEK75437.1| S5 [Oryza nivara]
Length = 474
Score = 91.3 bits (225), Expect = 6e-16, Method: Compositional matrix adjust.
Identities = 106/380 (27%), Positives = 160/380 (42%), Gaps = 58/380 (15%)
Query: 32 DDIYLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEP---PLFDPKKSSTYN 88
D ++LM +S+G PPV ++DTGS +W QC+PC + C Q P+FDP +S T
Sbjct: 113 DFLFLMAVSLGKPPVVNLVAIDTGSTLSWVQCQPCA-VHCHTQSAKAGPIFDPGRSYTSR 171
Query: 89 SISCSSSQCA-------VVTSNCSEGD--CSYSFLYGRGAYASFSSGNLATETLTFNSTS 139
+ CSS +C + +NC E + C+YS YG G ++S G + T+TL +
Sbjct: 172 RVRCSSVKCGELRYDLRLQQANCMEKEDSCTYSVTYGNG--WAYSVGKMVTDTLRIGDS- 228
Query: 140 GLPVEMPNVIFGCGHKNLASPTSDSKQTGIIGLGPGNSSLISQMG----TSIAGKFSYCL 195
+++FGC S + GI G G + S Q+ SYCL
Sbjct: 229 -----FMDLMFGCSMDVKYS----EFEAGIFGFGSSSFSFFEQLAGYPDILSYKALSYCL 279
Query: 196 P-DQGSSKINFGGIVAGAGVVS--TPL---IIRDHYYLSLEAISVGNQRLEFVSSSTGNI 249
P D+ G A + TPL I R Y L++E + QRL +S+ +
Sbjct: 280 PTDETKPGYMILGRYDRAAMDGGYTPLFRSINRPTYSLTMEMLIANGQRL---VTSSSEM 336
Query: 250 FVDTGVLRT-LLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDVLCY------------ 296
VD+G RT L P + K++ M + A +CY
Sbjct: 337 IVDSGAQRTSLWPSTFALLDKTITQAMSSIGYHRTSRARQ--ESYICYLSEHDYSGWNGT 394
Query: 297 --NISSQPKFPEVTIHFR-GADVKLSPSNLFRNISDEIMCSAFRGGNA--NIVYGRIMQI 351
S+ P + I F GA + L P N+F N +C F A + + G +
Sbjct: 395 ITPFSNWSALPLLEIGFAGGAALALPPRNVFYNDPHRGLCMTFAQNPALRSQILGNRVTR 454
Query: 352 NFLIGYDIEQAMVSFKPSRC 371
+F +DI+ FK + C
Sbjct: 455 SFGTTFDIQGKQFGFKYAVC 474
>gi|340810915|gb|AEK75384.1| S5 [Oryza sativa]
gi|340810917|gb|AEK75385.1| S5 [Oryza sativa]
gi|340810919|gb|AEK75386.1| S5 [Oryza sativa]
gi|340810927|gb|AEK75390.1| S5 [Oryza sativa]
gi|340810975|gb|AEK75414.1| S5 [Oryza nivara]
gi|340810979|gb|AEK75416.1| S5 [Oryza nivara]
gi|340810995|gb|AEK75424.1| S5 [Oryza nivara]
gi|340811027|gb|AEK75440.1| S5 [Oryza nivara]
gi|340811063|gb|AEK75458.1| S5 [Oryza nivara]
Length = 357
Score = 91.3 bits (225), Expect = 6e-16, Method: Compositional matrix adjust.
Identities = 108/379 (28%), Positives = 160/379 (42%), Gaps = 66/379 (17%)
Query: 37 MHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEP---PLFDPKKSSTYNSISCS 93
M +S+G PPV ++DTGS +W QC+PC + C Q P+FDP +S T + CS
Sbjct: 1 MAVSLGKPPVVNLVAIDTGSTLSWVQCQPC-AVHCHTQSAKAGPIFDPGRSYTSRRVRCS 59
Query: 94 SSQCA-------VVTSNCSEGD--CSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVE 144
S +C + +NC E + C+YS YG G ++S G + T+TL +
Sbjct: 60 SVKCGEPRYDLRLQQANCMEKEDSCTYSVTYGNGW--AYSVGKMVTDTLRIGDS------ 111
Query: 145 MPNVIFGCGHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAG--------KFSYCLP 196
+++FGC S + GI G G + S Q+ AG FSYCLP
Sbjct: 112 FMDLMFGCSMDVKYS----EFEAGIFGFGSSSFSFFEQL----AGYPDILSYKAFSYCLP 163
Query: 197 -DQGSSKINFGGIVAGAGVVS--TPL---IIRDHYYLSLEAISVGNQRLEFVSSSTGNIF 250
D+ G A + TPL I R Y L++E + QRL +S+ +
Sbjct: 164 TDETKPGYMILGRYDRAAMDGGYTPLFRSINRPTYSLTMEMLIANGQRL---VTSSSEMI 220
Query: 251 VDTGVLRT-LLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDVLCY------------- 296
VD+G RT L P + K++ M + A +CY
Sbjct: 221 VDSGAQRTSLWPSTFALLDKTITQAMSSIGYHRTSRARQ--ESYICYLSEHDYSGWNGTI 278
Query: 297 -NISSQPKFPEVTIHFRG-ADVKLSPSNLFRNISDEIMCSAFRGGNA--NIVYGRIMQIN 352
S+ P + I F G A + LSP N+F N +C F A + + G + +
Sbjct: 279 TPFSNWSALPLLEIGFAGGAALALSPRNVFYNDPHRGLCMTFAQNPALRSQILGNRVTRS 338
Query: 353 FLIGYDIEQAMVSFKPSRC 371
F +DI+ FK + C
Sbjct: 339 FGTTFDIQGKQFGFKYAAC 357
>gi|212275143|ref|NP_001130306.1| uncharacterized protein LOC100191400 precursor [Zea mays]
gi|194688798|gb|ACF78483.1| unknown [Zea mays]
gi|194703430|gb|ACF85799.1| unknown [Zea mays]
gi|194707192|gb|ACF87680.1| unknown [Zea mays]
gi|223944599|gb|ACN26383.1| unknown [Zea mays]
gi|223948667|gb|ACN28417.1| unknown [Zea mays]
gi|414887962|tpg|DAA63976.1| TPA: aspartic proteinase nepenthesin-1 [Zea mays]
Length = 450
Score = 91.3 bits (225), Expect = 6e-16, Method: Compositional matrix adjust.
Identities = 99/361 (27%), Positives = 156/361 (43%), Gaps = 47/361 (13%)
Query: 35 YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSS 94
Y++ S+GTPP + +VDT +D +W C C P FDP S++Y ++ C S
Sbjct: 112 YVVRASLGTPPQQLLLAVDTSNDASWIPCAGCAGCPTSSAAP--FDPASSASYRTVPCGS 169
Query: 95 SQCAVV-TSNCSEGD--CSYSFLYGRGAY-ASFSSGNLATETLTFNSTSGLPVEMPNVIF 150
CA + C G C +S Y + A+ S +LA +G V+ F
Sbjct: 170 PLCAQAPNAACPPGGKACGFSLTYADSSLQAALSQDSLA--------VAGNAVKA--YTF 219
Query: 151 GCGHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLPDQGSSKINFGGIV- 209
GC + A+ T+ Q + G S +SQ FSYCLP S +NF G +
Sbjct: 220 GCLQR--ATGTAAPPQGLLGLGR-GPLSFLSQTKDMYEATFSYCLPSFKS--LNFSGTLR 274
Query: 210 -----AGAGVVSTPLIIRDH----YYLSLEAISVGNQRL---EFVSSSTGNIFVDTGVLR 257
+ +TPL+ H YY+++ I VG + + F ++ +D+G +
Sbjct: 275 LGRNGQPQRIKTTPLLANPHRSSLYYVNMTGIRVGRKVVPIPAFDPATGAGTVLDSGTMF 334
Query: 258 TLLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDVLCYNISSQPKFPEVTIHFRGADVK 317
T L + ++ + + A PV +G GF C+N ++ +P VT+ F G V
Sbjct: 335 TRLVAPAYVAVRDEVRRRVGA-PVSSLG---GFDT--CFNTTAV-AWPPVTLLFDGMQVT 387
Query: 318 LSPSNL-----FRNISDEIMCSAFRGGNANI-VYGRIMQINFLIGYDIEQAMVSFKPSRC 371
L N+ + IS M +A G N + V + Q N + +D+ V F RC
Sbjct: 388 LPEENVVIHSTYGTISCLAMAAAPDGVNTVLNVIASMQQQNHRVLFDVPNGRVGFARERC 447
Query: 372 T 372
T
Sbjct: 448 T 448
>gi|357114697|ref|XP_003559132.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Brachypodium
distachyon]
Length = 416
Score = 91.3 bits (225), Expect = 6e-16, Method: Compositional matrix adjust.
Identities = 101/378 (26%), Positives = 163/378 (43%), Gaps = 56/378 (14%)
Query: 28 IISVDDIYLMHLSIGTP---PVDIFGSVDTGSDCTWTQCEPC-PELDCFKQEPPLFDPKK 83
+ S +Y + +SIGT + + G +DT + +W CEPC P L Q LF P
Sbjct: 60 LTSARFVYGVFVSIGTGQGFKLQVLG-LDTSTSMSWVMCEPCQPSL---PQAGHLFSPAA 115
Query: 84 SSTYNSISCSSSQCAVVTSNCSEGDCSYSFLYGRGAYASFSSGNLATETLTFNS---TSG 140
S T++ + + C + G CS+ F F+SG L+ +T + + G
Sbjct: 116 SPTFHGVHSNDPVCTAPYRPTANG-CSFRF--------PFASGYLSRDTFHLRNGGLSGG 166
Query: 141 LPVE-MPNVIFGCGHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLPD-- 197
P+E +P ++FGC H ++A +D G++ L SL++Q+ G+FSYCLP
Sbjct: 167 APIESVPGIMFGCAH-SVAGFHNDGTLGGVLSLSHLRLSLLTQLSARAGGRFSYCLPKPT 225
Query: 198 QGSSKINFGGIVAGAGVVS-------TPLIIRD----HYYLSLEAISVGNQRLE-----F 241
QG+ G + GA V+ T L +R YYLSL I++ +RL F
Sbjct: 226 QGNPH---GFLRLGADVLPPLPHSHMTALTVRSGSAPDYYLSLVGITLAEKRLRIDPRVF 282
Query: 242 VSSSTGNIFVDTGVLRTLLPLEYH-------SNLKSVMSNMIKAQPVKGVGAEPGFSDVL 294
+ G + ++ Y + +K + S+ +K P G GA F D +
Sbjct: 283 AAGRGGCSINPAATITAIMEPAYLVVERALVAYMKELGSDRVKKGPPGG-GAL--FFDRM 339
Query: 295 CYNISSQPKFPEVTIHFR-GADVKLSPSNLFRNISDEIMCSAFRGGNANIVYGRIMQINF 353
++ Q + P + HF+ GA++ +P LF G V G Q+N
Sbjct: 340 YKSV--QARLPSMAFHFKDGAELWFTPEQLFEVHGMVAWFMMVGKGYRRTVIGAPQQVNT 397
Query: 354 LIGYDIEQAMVSFKPSRC 371
+D+ +SF C
Sbjct: 398 RFTFDVAAGRLSFASELC 415
>gi|356537015|ref|XP_003537027.1| PREDICTED: aspartic proteinase-like protein 2-like [Glycine max]
Length = 476
Score = 91.3 bits (225), Expect = 7e-16, Method: Compositional matrix adjust.
Identities = 103/358 (28%), Positives = 156/358 (43%), Gaps = 51/358 (14%)
Query: 52 VDTGSDCTWTQCEP---CPELDCFKQEPPLFDPKKSSTYNSISCSSSQCAV----VTSNC 104
+DTGSD W C CP+ E FD SST I CS C + C
Sbjct: 85 IDTGSDILWVNCNTCSNCPQSSQLGIELNFFDTVGSSTAALIPCSDLICTSGVQGAAAEC 144
Query: 105 SE--GDCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEM---PNVIFGCGHKNLAS 159
S CSY+F YG G S +SG ++ + FN G P + ++FGC
Sbjct: 145 SPRVNQCSYTFQYGDG---SGTSGYYVSDAMYFNLIMGQPPAVNSTATIVFGCSISQSGD 201
Query: 160 PT-SDSKQTGIIGLGPGNSSLISQMGTS-IAGK-FSYCLPDQGSSKINFGGI-----VAG 211
T +D GI G GPG S++SQ+ + I K FS+CL G N GGI +
Sbjct: 202 LTKTDKAVDGIFGFGPGPLSVVSQLSSQGITPKVFSHCLKGDG----NGGGILVLGEILE 257
Query: 212 AGVVSTPLI-IRDHYYLSLEAISVGNQRLEF------VSSSTGNIFVDTGVLRTLLPLEY 264
+V +PL+ + HY L+L++I+V Q L +S++ G VD G L E
Sbjct: 258 PSIVYSPLVPSQPHYNLNLQSIAVNGQPLPINPAVFSISNNRGGTIVDCGTTLAYLIQEA 317
Query: 265 HSNLKSVMSNMI--KAQPVKGVGAEPGFSDVLCYNISSQ--PKFPEVTIHFR-GADVKLS 319
+ L + ++ + A+ G + CY +S+ FP V+++F GA + L
Sbjct: 318 YDPLVTAINTAVSQSARQTNSKGNQ-------CYLVSTSIGDIFPLVSLNFEGGASMVLK 370
Query: 320 PSN-LFRN---ISDEIMCSAFRGGNANI-VYGRIMQINFLIGYDIEQAMVSFKPSRCT 372
P L N E+ C F+ + G ++ + ++ YDI Q + + C+
Sbjct: 371 PEQYLMHNGYLDGAEMWCVGFQKLQEGASILGDLVLKDKIVVYDIAQQRIGWANYDCS 428
>gi|388502484|gb|AFK39308.1| unknown [Medicago truncatula]
Length = 425
Score = 91.3 bits (225), Expect = 7e-16, Method: Compositional matrix adjust.
Identities = 102/382 (26%), Positives = 163/382 (42%), Gaps = 48/382 (12%)
Query: 1 AQNSQKLPFYNDNETPKSPISIIYQAEIISVDDIYLMHLSIGTPPVDIFGSVDTGSDCTW 60
A+++ +L F + KS + I +II Y++ IGTPP + ++DT +D W
Sbjct: 60 AKDTTRLQFLDSLVARKSIVPIASGRQIIQ-SPTYIVRAKIGTPPQTLLLAMDTSNDAAW 118
Query: 61 TQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSSSQCAVVTS-NCSEGDCSYSFLYGRGA 119
C C C LF P+KS+T+ ++SC++ +C V + C +++ YG +
Sbjct: 119 IPCTACD--GCAST---LFAPEKSTTFKNVSCAAPECKQVPNPGCGVSSRNFNLTYGSSS 173
Query: 120 YASFSSGNLATETLTFNSTSGLPVEMPNVIFGCGHKNLASPTSDSKQTGIIGLGPGNSSL 179
A+ NL +T+T + +P+ FGC K TS Q + G SL
Sbjct: 174 IAA----NLVQDTITLATD-----PVPSYTFGCVSKTTG--TSAPPQGLLGLGR-GPLSL 221
Query: 180 ISQMGTSIAGKFSYCLPD----QGSSKINFGGIVAGAGVVSTPLIIRDH----YYLSLEA 231
+SQ FSYCLP S + G + + TPL+ YY++LEA
Sbjct: 222 LSQTQNLYQSTFSYCLPSFKSLNFSGSLRLGPVAQPKRIKYTPLLKNPRRSSLYYVNLEA 281
Query: 232 ISVGNQRLEF--------VSSSTGNIFVDTGVLRTLLPLEYHSNLKSVMSNMIKAQPVKG 283
I VG + ++ ++ G IF D+G + T L + ++ + P
Sbjct: 282 IRVGRKVVDIPPAALAFNPTTGAGTIF-DSGTVFTRLVAPVYVAVRDEFRR--RVGPKLT 338
Query: 284 VGAEPGFSDVLCYNISSQPKFPEVTIHFRGADVKLSPSN-LFRNISDEIMCSAFRGGNAN 342
V + GF CYN+ P +T F G +V L N L + + C A G N
Sbjct: 339 VTSLGGFDT--CYNVPI--VVPTITFIFTGMNVTLPQDNILIHSTAGSTTCLAMAGAPDN 394
Query: 343 I-----VYGRIMQINFLIGYDI 359
+ V + Q N + YD+
Sbjct: 395 VNSVLNVIANMQQQNHRVLYDV 416
>gi|357120129|ref|XP_003561782.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Brachypodium
distachyon]
Length = 452
Score = 91.3 bits (225), Expect = 7e-16, Method: Compositional matrix adjust.
Identities = 99/394 (25%), Positives = 162/394 (41%), Gaps = 79/394 (20%)
Query: 39 LSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSSSQCA 98
+++GTPP ++ +DTGS+ +W C + + + P FD SS+Y + CSS C
Sbjct: 67 VAVGTPPQNVTMVLDTGSELSWLLC------NGSRHDAP-FDASASSSYAPVPCSSPACT 119
Query: 99 VVTSN------CSEGDCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMPNVIFGC 152
+ + C C S Y A AS + G LA +T S+ MP +FGC
Sbjct: 120 WLGRDLPVRPFCDSSACRVSLSY---ADASSADGLLAADTFLLGSS-----PMP-ALFGC 170
Query: 153 --GHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCL-PDQGSSKINFGGIV 209
+ + P S++ TG++G+ G S ++Q T +F+YC+ QG + GG
Sbjct: 171 ITSYSSSTDP-SETPPTGLLGMNRGGLSFVTQTATR---RFAYCIAAGQGPGILLLGGND 226
Query: 210 AGAGVVS--------TPLII---------RDHYYLSLEAISVGNQRLEFVS-------SS 245
+ S TPL+ R Y + LE I VG+ L +
Sbjct: 227 TETPLTSPPQQQLNYTPLVEISQPLPYFDRAAYTVQLEGIRVGSALLAIPKHLLTPDHTG 286
Query: 246 TGNIFVDTGVLRTLLPLEYHSNLKSVMSNMIKAQPVKGVG--AEPGF------------S 291
G VD+G T L + ++ LK+ +N + G+ EPGF +
Sbjct: 287 AGQTMVDSGTRFTFLLPDAYAALKAEFANQLTRSLDGGLAPLGEPGFVFQGAFDACFRGT 346
Query: 292 DVLCYNISSQPKFPEVTIHFRGADVKLSPSNLF--------RNISDEIMCSAF----RGG 339
+ ++ PEV + RGA+V ++ + R + + C F G
Sbjct: 347 EARVSAAAAGGLLPEVGLVLRGAEVVVAGAEKLLYRVPGERRGEGEGVWCLTFGSSDMAG 406
Query: 340 NANIVYGRIMQINFLIGYDIEQAMVSFKPSRCTN 373
+ V G Q + + YD+ A + F +RC +
Sbjct: 407 VSAYVIGHHHQQDVWVEYDLRNARLGFAAARCAD 440
>gi|125554529|gb|EAZ00135.1| hypothetical protein OsI_22138 [Oryza sativa Indica Group]
Length = 472
Score = 91.3 bits (225), Expect = 7e-16, Method: Compositional matrix adjust.
Identities = 105/383 (27%), Positives = 161/383 (42%), Gaps = 64/383 (16%)
Query: 32 DDIYLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEP---PLFDPKKSSTYN 88
D ++LM +S+G PPV ++DTGS +W QC+PC + C Q P+FDP +S T
Sbjct: 111 DFLFLMAVSLGKPPVVNLVAIDTGSTLSWVQCQPCA-VHCHTQSAKAGPIFDPGRSYTSR 169
Query: 89 SISCSSSQCA-------VVTSNCSEGD--CSYSFLYGRGAYASFSSGNLATETLTFNSTS 139
+ CSS +C + +NC E + C+YS YG G ++S G + T+TL +
Sbjct: 170 RVRCSSVKCGELRYDLRLQQANCMEKEDSCTYSVTYGNG--WAYSVGKMVTDTLRIGDS- 226
Query: 140 GLPVEMPNVIFGCGHKNLASPTSDSKQTGIIGLGPGNSSLISQMG----TSIAGKFSYCL 195
+++FGC S + GI G G + S Q+ FSYCL
Sbjct: 227 -----FMDLMFGCSMDVKYSEF----EAGIFGFGSSSFSFFEQLAGYPDILSYKAFSYCL 277
Query: 196 PDQGSSKINFGGIVAG--------AGVVSTPLII-RDHYYLSLEAISVGNQRLEFVSSST 246
P + + G ++ G G S I R Y L++E + QRL +S+
Sbjct: 278 P---TDETKPGYMILGRYDRAAMDGGYTSLFRSINRPTYSLTMEMLIANGQRL---VTSS 331
Query: 247 GNIFVDTGVLRT-LLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDVLCY--------- 296
+ VD+G RT L P + K++ M + A +CY
Sbjct: 332 SEMIVDSGAQRTSLWPSTFALLDKTITQAMSSIGYHRTSRARQ--ESYICYLSEHDYSGW 389
Query: 297 -----NISSQPKFPEVTIHFR-GADVKLSPSNLFRNISDEIMCSAFRGGNA--NIVYGRI 348
S+ P + I F GA + L P N+F N +C F A + + G
Sbjct: 390 NGTITPFSNWSALPLLEIGFAGGAALALPPRNVFYNDPHRGLCMTFAQNPALRSQILGNR 449
Query: 349 MQINFLIGYDIEQAMVSFKPSRC 371
+ +F +DI+ FK + C
Sbjct: 450 VTRSFGTTFDIQGKQFGFKYAAC 472
>gi|115452187|ref|NP_001049694.1| Os03g0271900 [Oryza sativa Japonica Group]
gi|29893618|gb|AAP06872.1| hypothetical protein [Oryza sativa Japonica Group]
gi|108707424|gb|ABF95219.1| Eukaryotic aspartyl protease family protein, expressed [Oryza
sativa Japonica Group]
gi|108707425|gb|ABF95220.1| Eukaryotic aspartyl protease family protein, expressed [Oryza
sativa Japonica Group]
gi|113548165|dbj|BAF11608.1| Os03g0271900 [Oryza sativa Japonica Group]
gi|215715205|dbj|BAG94956.1| unnamed protein product [Oryza sativa Japonica Group]
gi|215737033|dbj|BAG95962.1| unnamed protein product [Oryza sativa Japonica Group]
gi|215740994|dbj|BAG97489.1| unnamed protein product [Oryza sativa Japonica Group]
Length = 447
Score = 91.3 bits (225), Expect = 8e-16, Method: Compositional matrix adjust.
Identities = 110/397 (27%), Positives = 167/397 (42%), Gaps = 83/397 (20%)
Query: 39 LSIGTPPVDIFGSVDTGSDCTWTQCEP--CPELDCFKQEPPLFDPKKSSTYNSISCSSSQ 96
+++GTPP ++ +DTGS+ +W C P L P F+ SS+Y ++ C S+
Sbjct: 59 VAVGTPPQNVTMVLDTGSELSWLLCNGSYAPPLT------PAFNASGSSSYGAVPCPSTA 112
Query: 97 CAV------VTSNCS---EGDCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMPN 147
C V C C S Y A AS + G LAT+ TF T G P
Sbjct: 113 CEWRGRDLPVPPFCDTPPSNACRVSLSY---ADASSADGVLATD--TFLLTGGAPPVAVG 167
Query: 148 VIFGCGHKNLASPTSDS---------KQTGIIGLGPGNSSLISQMGTSIAGKFSYCL-PD 197
FGC ++ ++S TG++G+ G S ++Q GT +F+YC+ P
Sbjct: 168 AYFGCITSYSSTTATNSNGTGTDVSEAATGLLGMNRGTLSFVTQTGTR---RFAYCIAPG 224
Query: 198 QGSSKINFG--GIVAGAGVVSTPLII---------RDHYYLSLEAISVGNQRLEFVSS-- 244
+G + G G VA + TPLI R Y + LE I VG L S
Sbjct: 225 EGPGVLLLGDDGGVA-PPLNYTPLIEISQPLPYFDRVAYSVQLEGIRVGCALLPIPKSVL 283
Query: 245 -----STGNIFVDTGVLRTLLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGF----SDVLC 295
G VD+G T L + ++ LK+ ++ +A+ + EPGF + C
Sbjct: 284 TPDHTGAGQTMVDSGTQFTFLLADAYAALKAEFTS--QARLLLAPLGEPGFVFQGAFDAC 341
Query: 296 YN------ISSQPKFPEVTIHFRGADVKLSPSNLFRNISDE---------IMCSAFRGGN 340
+ ++ PEV + RGA+V +S L + E + C F GN
Sbjct: 342 FRGPEARVAAASGLLPEVGLVLRGAEVAVSGEKLLYMVPGERRGEGGAEAVWCLTF--GN 399
Query: 341 ANI------VYGRIMQINFLIGYDIEQAMVSFKPSRC 371
+++ V G Q N + YD++ V F P+RC
Sbjct: 400 SDMAGMSAYVIGHHHQQNVWVEYDLQNGRVGFAPARC 436
>gi|356513697|ref|XP_003525547.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
Length = 252
Score = 90.9 bits (224), Expect = 8e-16, Method: Compositional matrix adjust.
Identities = 62/171 (36%), Positives = 92/171 (53%), Gaps = 24/171 (14%)
Query: 35 YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSS 94
Y++ + +G+ + + +DT SD TW QCEPC + C+ Q+ P+F P SS+Y S+SC+S
Sbjct: 65 YIVTMGLGSKNMTVI--IDTRSDLTWVQCEPC--MSCYNQQGPIFKPSTSSSYQSVSCNS 120
Query: 95 SQCAVV------TSNCSEGD---CSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEM 145
S C + T C + C+Y YG G+Y ++G+L E L+F V +
Sbjct: 121 STCQSLQFATGNTGACGSSNPSTCNYVVNYGDGSY---TNGDLGVEALSFGG-----VSV 172
Query: 146 PNVIFGCGHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLP 196
+ +FGCG N S G++GLG SL+SQ + G FSYCLP
Sbjct: 173 SDFVFGCGRNNKGLFGGVS---GLMGLGRSYLSLVSQTNATFGGVFSYCLP 220
>gi|413951979|gb|AFW84628.1| putative aspartic protease family protein [Zea mays]
Length = 435
Score = 90.9 bits (224), Expect = 8e-16, Method: Compositional matrix adjust.
Identities = 96/379 (25%), Positives = 155/379 (40%), Gaps = 65/379 (17%)
Query: 39 LSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSSSQC- 97
L++GTPP ++ +DTGS+ +W C F P+ S+T+ ++ C S++C
Sbjct: 65 LAVGTPPQNVTMVLDTGSELSWLLCATGRAAAAAADS---FRPRASATFAAVPCGSARCS 121
Query: 98 -----AVVTSNCSEGDCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMPNVIFGC 152
A + + + C S Y G S S G LAT+ L FGC
Sbjct: 122 SRDLPAPPSCDAASRRCRVSLSYADG---SASDGALATDVFAVGDAPPL-----RSAFGC 173
Query: 153 GHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLPDQ--------GSSKIN 204
S G++G+ G S ++Q T +FSYC+ D+ G S +
Sbjct: 174 MSAAYDSSPDAVATAGLLGMNRGALSFVTQASTR---RFSYCISDRDDAGVLLLGHSDLP 230
Query: 205 FGGIVAGAGVVSTPLII---RDHYYLSLEAISVGNQRLEFVSS-------STGNIFVDTG 254
F + TP + R Y + L I VG + L S G VD+G
Sbjct: 231 FLPLNYTPLYQPTPPLPYFDRVAYSVQLLGIRVGGKPLPIPPSVLAPDHTGAGQTMVDSG 290
Query: 255 VLRTLLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFS-----DVLCYNI-SSQP----KF 304
T L + +S +K+ + + +P+ +P F+ D C+ + +P +
Sbjct: 291 TQFTFLLGDAYSAVKAEF--LKQTKPLLPALEDPSFAFQEAFDT-CFRVPKGRPPPSARL 347
Query: 305 PEVTIHFRGADVKLSPSNLF------RNISDEIMCSAFRGGNANI------VYGRIMQIN 352
P VT+ F GA + ++ L R +D + C F GNA++ V G Q+N
Sbjct: 348 PPVTLLFNGAQMSVAGDRLLYKVPGERRGADGVWCLTF--GNADMVPLTAYVIGHHHQMN 405
Query: 353 FLIGYDIEQAMVSFKPSRC 371
+ YD+E+ V P +C
Sbjct: 406 LWVEYDLERGRVGLAPVKC 424
>gi|297801286|ref|XP_002868527.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
gi|297314363|gb|EFH44786.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
Length = 444
Score = 90.9 bits (224), Expect = 9e-16, Method: Compositional matrix adjust.
Identities = 98/380 (25%), Positives = 156/380 (41%), Gaps = 62/380 (16%)
Query: 36 LMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSSS 95
++ L IGTP +DTGS +W QC P FDP SS+++ + CS
Sbjct: 82 ILSLPIGTPSQSQELVLDTGSQLSWIQCHPKKIKKPLPPPTTSFDPSLSSSFSDLPCSHP 141
Query: 96 QC-------AVVTSNCSEGDCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMPNV 148
C + TS S C YS+ Y G +F+ GNL E TF+++ P +
Sbjct: 142 LCKPRIPDFTLPTSCDSNRLCHYSYFYADG---TFAEGNLVKEKFTFSNSQ----TTPPL 194
Query: 149 IFGCGHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLPDQ---------- 198
I GC A ++D K GI+G+ G S ISQ S KFSYC+P +
Sbjct: 195 ILGC-----AKESTDVK--GILGMNLGRLSFISQAKIS---KFSYCIPTRSNRPGLASTG 244
Query: 199 --------GSSKINFGGIVAGAGVVSTPLIIRDHYYLSLEAISVGNQRLEFVSS------ 244
S + ++ P + Y + L I +G +RL SS
Sbjct: 245 SFYLGENPNSRGFKYVSLLTFPQSQRMPNLDPLAYTVPLLGIRIGQKRLNIPSSVFRPDA 304
Query: 245 -STGNIFVDTGVLRTLLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDVLCYNISSQ-- 301
+G VD+G T L + +K + ++ ++ KG G + +C++ + Q
Sbjct: 305 GGSGQTMVDSGSEFTHLVDVAYDKVKEEIVRLVGSRLKKGYVY--GSTADMCFDGNHQMV 362
Query: 302 --PKFPEVTIHF-RGADVKLSPSNLFRNISDEIMC-----SAFRGGNANIVYGRIMQINF 353
++ F RG ++ + L N+ I C S+ G +NI+ G + Q N
Sbjct: 363 IGRLIGDLVFEFGRGVEILVEKQRLLVNVGGGIHCVGIGRSSMLGAASNII-GNVHQQNL 421
Query: 354 LIGYDIEQAMVSFKPSRCTN 373
+ +D+ V F + C+
Sbjct: 422 WVEFDVANRRVGFSKAECSR 441
>gi|226530102|ref|NP_001152414.1| PCS1 precursor [Zea mays]
gi|195656033|gb|ACG47484.1| PCS1 [Zea mays]
Length = 452
Score = 90.9 bits (224), Expect = 9e-16, Method: Compositional matrix adjust.
Identities = 112/402 (27%), Positives = 163/402 (40%), Gaps = 95/402 (23%)
Query: 39 LSIGTPPVDIFGSVDTGSDCTWTQCE-------PCPELDCFKQEPPLFDPKKSSTYNSIS 91
+++G PP ++ +DTGS+ +W +C P P Q P F+ SSTY +
Sbjct: 64 VAVGAPPQNVTMVLDTGSELSWLRCNGSRVPSTPPP------QAPAAFNGSASSTYAAAH 117
Query: 92 CSSSQCAV------VTSNCS---EGDCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLP 142
CSS +C V C+ C S Y A AS + G LA +T P
Sbjct: 118 CSSPECQWRGRDLPVPPFCAGPPSXSCRVSLSY---ADASSADGILAADTFLLGGAP--P 172
Query: 143 VEMPNVIFGCG---HKNLASPTSDSK-QTGIIGLGPGNSSLISQMGTSIAGKFSYCL-PD 197
V +FGC A+ +SDS+ TG++G+ G+ S ++Q T +F+YC+ P
Sbjct: 173 V---XALFGCVTSYSSATATNSSDSEAATGLLGMNRGSLSFVTQTATL---RFAYCIAPG 226
Query: 198 QGSSKINFGGIVAGAGVVS----TPLII---------RDHYYLSLEAISVGNQRLEFVSS 244
G + GG GA + TPLI R Y + LE I VG L S
Sbjct: 227 DGPGLLVLGG--DGAALAPQLNYTPLIQISRPLPYFDRVAYSVQLEGIRVGAALLPIPKS 284
Query: 245 -------STGNIFVDTGVLRTLLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDVL--- 294
G VD+G T L + ++ LK N A + A G SD +
Sbjct: 285 VLAPDHTGAGQTMVDSGTQFTFLLADAYAPLKGEFLNQTSA-----LLAPLGESDFVFQG 339
Query: 295 ----CYNIS------SQPKFPEVTIHFRGADVKLSPSNLFRNISDE---------IMCSA 335
C+ S + PEV + RGA+V + L + E + C
Sbjct: 340 AFDACFRASEARVAAASXMLPEVGLVLRGAEVAVGGEKLLYRVPGERRGEGGAEAVWCLT 399
Query: 336 FRGGNANI------VYGRIMQINFLIGYDIEQAMVSFKPSRC 371
F GN+++ V G Q N + YD++ V F P+RC
Sbjct: 400 F--GNSDMAGMSAYVIGHHHQQNVWVEYDLQNGRVGFAPARC 439
>gi|168014188|ref|XP_001759635.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162689174|gb|EDQ75547.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 485
Score = 90.9 bits (224), Expect = 9e-16, Method: Compositional matrix adjust.
Identities = 113/397 (28%), Positives = 166/397 (41%), Gaps = 53/397 (13%)
Query: 12 DNETPKSPISIIYQAEIISVDDI-----YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPC 66
D + ++ A ++ DD+ Y + IGTP + VDTGS T+ C C
Sbjct: 71 DRRFERRGRGLVEDARMVLHDDLLTKGYYTSRVFIGTPAQEFALIVDTGSTVTYVPCSSC 130
Query: 67 PELD----CFKQEPPLFDPKKSSTYNSISCSSSQCAVVTSNCSEGDCSYSFLYGRGAYAS 122
CF P F P SS+Y ++SC+S C + C Y +Y A S
Sbjct: 131 THCGHHQACFD---PRFKPDNSSSYQTVSCNSPDCITKMCDARVHQCKYERVY---AEMS 184
Query: 123 FSSGNLATETLTFNSTSGLPVEMPN-VIFGCGHKNLASPTSD---SKQTGIIGLGPGNSS 178
S G L + L F + S L P+ ++FGC + T D GI+GLG G S
Sbjct: 185 SSKGVLGKDLLGFGNGSRL---QPHPLLFGCE----TAETGDLYLQHADGIMGLGRGPLS 237
Query: 179 LISQM-GT-SIAGKFSYCLP--DQGSSKINFGGIVAGAGVV---STPLIIRDHYY-LSLE 230
++ Q+ GT ++ FS C D+G + G I +V S P R +YY L L
Sbjct: 238 IVDQLVGTGAMEDSFSLCYGGMDEGGGSMVLGAIPPPPAMVFAKSDP--NRSNYYNLELS 295
Query: 231 AISVGNQRL----EFVSSSTGNIFVDTGVLRTLLPLEYHSNLKSVMSNMIKA-QPVKGVG 285
I V L E + G + +D+G LP + K ++ + + Q V G
Sbjct: 296 EIQVQGVSLNVPSEVFNGRLGTV-LDSGTTYAYLPDKAFDAFKDAITQQLGSLQAVP--G 352
Query: 286 AEPGFSDVLCYNISSQPK-----FPEVTIHFRG-ADVKLSPSN-LFRN--ISDEIMCSAF 336
+P + DV S K FP V F G V L+P N LF++ + F
Sbjct: 353 PDPSYPDVCFAGAGSDSKALGKHFPPVDFVFSGNQKVFLAPENYLFKHTKVPGAYCLGFF 412
Query: 337 RGGNANIVYGRIMQINFLIGYDIEQAMVSFKPSRCTN 373
+ +A + G I+ N L+ YD + F + CTN
Sbjct: 413 KNQDATTLLGGIVVRNTLVTYDRANHQIGFFKTNCTN 449
>gi|147814824|emb|CAN65806.1| hypothetical protein VITISV_015630 [Vitis vinifera]
Length = 449
Score = 90.9 bits (224), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 95/377 (25%), Positives = 154/377 (40%), Gaps = 49/377 (12%)
Query: 35 YLMHLSIGTPPVDIFGSVDTGSDCTWTQCE-PCPELDCFKQEPPLFDPKK------SSTY 87
Y + +GTP DTGSD TW C+ C +C ++ K+ SS++
Sbjct: 83 YFVAFKVGTPSQKFMLVADTGSDLTWMSCKYHCRSRNCSNRKARRIRHKRVFHANLSSSF 142
Query: 88 NSISCSSSQCAV------VTSNCSE--GDCSYSFLYGRGAYASFSSGNLATETLTFNSTS 139
+I C + C + +NC C Y + Y G+ A G A ET+T
Sbjct: 143 KTIPCLTDMCKIELMDLFSLTNCPTPLTPCGYDYRYSDGSTAL---GFFANETVTVELKE 199
Query: 140 GLPVEMPNVIFGCGHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLPDQG 199
G +++ NV+ GC S G++GLG S + GKFSYCL D
Sbjct: 200 GRKMKLHNVLIGCSES--FQGQSFQAADGVMGLGYSKYSFAIKAAEKFGGKFSYCLVDHL 257
Query: 200 SSK-----INFGGIVAGAGVVS----TPLI---IRDHYYLSLEAISVGNQRLEFVS---- 243
S K + FG + +++ T L+ + Y +++ IS+G L+ S
Sbjct: 258 SHKNVSNYLTFGSSRSKEALLNNMTYTELVLGMVNSFYAVNMMGISIGGAMLKIPSEVWD 317
Query: 244 -SSTGNIFVDTGVLRTLLPL-EYHSNLKSVMSNMIKAQPVK-GVGAEPGFSDVLCYNIS- 299
G +D+G T L Y + ++ +++K + V+ +G C+N +
Sbjct: 318 VKGAGGTILDSGSSLTFLTEPAYQPVMAALRVSLLKFRKVEMDIGPLE-----YCFNSTG 372
Query: 300 -SQPKFPEVTIHFR-GADVKLSPSNLFRNISDEIMCSAF--RGGNANIVYGRIMQINFLI 355
+ P + HF GA+ + + + +D + C F V G IMQ N L
Sbjct: 373 FEESLVPRLVFHFADGAEFEPPVKSYVISAADGVRCLGFVSVAWPGTSVVGNIMQQNHLW 432
Query: 356 GYDIEQAMVSFKPSRCT 372
+D+ + F PS CT
Sbjct: 433 EFDLGLKKLGFAPSSCT 449
>gi|218201673|gb|EEC84100.1| hypothetical protein OsI_30414 [Oryza sativa Indica Group]
Length = 366
Score = 90.5 bits (223), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 82/296 (27%), Positives = 123/296 (41%), Gaps = 45/296 (15%)
Query: 29 ISVDDIYLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYN 88
I D YL + IG + +DTGS WTQC+ CP C + P + +S T+
Sbjct: 76 IYEDVAYLAEMEIGERQQKQYLLIDTGSSLVWTQCDECPH--CHIGDVPPYGRSQSRTFQ 133
Query: 89 SISCS-----------SSQCAV----VTSNCSEGDCSYSFLYGRGAYASFSSGNLATETL 133
+SC +S C + C G C + LY G ++ +T
Sbjct: 134 EVSCGDDDDNDKEEAIASYCPAKPPGYITLCVNGRCMFKALYNLTGQGETVQGYMSMDTF 193
Query: 134 TFNSTSGLPVEMP-NVIFGCGHKN------LASPTSDSKQ-TGIIGLGPGNSSLISQMGT 185
F + ++FGC H+ + T+ K+ TGI+GLG G++S + Q G
Sbjct: 194 HFIDDRRFDYQAKFRMVFGCAHQENIVLTAVKECTTAVKECTGILGLGMGDASFLRQTGI 253
Query: 186 SIAGKFSYCLPD-------QGSSKINFGGIVAGAGVVSTPLIIR-DHYYLSLEAISVG-N 236
+ KFSYC P + S + FG +G PL++R YYL L AI+ N
Sbjct: 254 T---KFSYCAPPRMPGYSYRRDSWLRFGSHAQISG-KKVPLVMRWGKYYLPLTAITYTYN 309
Query: 237 QRLEFV-------SSSTGNIFVDTGVLRTLLPLEYHSNLKSVMSNMIKAQPVKGVG 285
+ + V ++ VDTG LP H +L M +IK++ G G
Sbjct: 310 ELMSPVPIIAYKSQEDYLHMMVDTGTSLLSLPTSLHDDLIKEMEAIIKSKKYDGRG 365
>gi|88174569|gb|ABD39359.1| chloroplast nucleoid DNA-binding protein [Oryza nivara]
Length = 321
Score = 90.5 bits (223), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 91/329 (27%), Positives = 148/329 (44%), Gaps = 45/329 (13%)
Query: 35 YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSS 94
Y++ + +GTP +DTGS TW C E D P F +S+T +SC +
Sbjct: 1 YVISVGLGTPAKTQIVEIDTGSSTTWVFC----ECDGCHTNPRTFLQSRSTTCAKVSCGT 56
Query: 95 SQCAVVTSN--CSEG----DCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMPNV 148
S C + S+ C + DC + Y G S S G L +TLTF+ ++P+
Sbjct: 57 SMCLLGGSDPHCQDSENYPDCPFRVSYQDG---SASYGILYQDTLTFSDVQ----KIPSF 109
Query: 149 IFGCGHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLPDQGSSK------ 202
FGC + + G++G+G G S++ Q + G FSYCLP Q S +
Sbjct: 110 TFGCNLDSFGA-NEFGNVDGLLGMGAGPMSVLKQSSPTFDG-FSYCLPLQKSERGFFSKT 167
Query: 203 ---INFGGIVAGAGVVSTPLIIR----DHYYLSLEAISVGNQRLEFVSS--STGNIFVDT 253
+ G + V T ++ R + +++ L AISV +RL S S + D+
Sbjct: 168 TGYFSLGKVATRTDVRYTKMVARRKNTELFFVDLAAISVDGERLGLSPSIFSRKGVVFDS 227
Query: 254 GVLRTLLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDVLCYNISS--QPKFPEVTIHF 311
G + +P SV+S I+ ++ AE S+ CY++ S + P +++HF
Sbjct: 228 GSELSYIP----DRALSVLSQRIRELLLRRGAAEEE-SERNCYDMRSVDEGDMPAISLHF 282
Query: 312 -RGADVKLSPSNLF--RNISDE-IMCSAF 336
GA L +F R++ ++ + C AF
Sbjct: 283 DDGARFDLGSRGVFVERSVQEQDVWCLAF 311
>gi|297819684|ref|XP_002877725.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
gi|297323563|gb|EFH53984.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
Length = 633
Score = 90.5 bits (223), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 104/381 (27%), Positives = 162/381 (42%), Gaps = 47/381 (12%)
Query: 19 PISIIYQAEIISVDDIYLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPL 78
P S + + + ++ Y L IGTPP VD+GS T+ C C + C K + P
Sbjct: 78 PHSRMRLYDDLLINGYYTTRLWIGTPPQMFALIVDSGSTVTYVPCSDCEQ--CGKHQDPK 135
Query: 79 FDPKKSSTYNSISCSSSQCAVVTSNCSEG--DCSYSFLYGRGAYASFSSGNLATETLTFN 136
F P+ SSTY + C+ + NC + C Y Y A S S G L + ++F
Sbjct: 136 FQPELSSTYQPVKCN------MDCNCDDDKEQCVYEREY---AEHSSSKGVLGEDLISFG 186
Query: 137 STSGLPVEMPNVIFGCGHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTS--IAGKFSYC 194
+ S L + +FGC S + GIIGLG G+ SL+ Q+ I+ F C
Sbjct: 187 NESQLTPQ--RAVFGCETVETGDLYS-QRADGIIGLGQGDLSLVDQLVDKGLISNSFGLC 243
Query: 195 LPDQGSSKINFGGIVAGAGVVSTPLIIRD-------HYYLSLEAISVGNQRLEF---VSS 244
G + G ++ G + +I D +Y + L I V ++L V
Sbjct: 244 Y---GGMDVGGGSMILGGFDYPSDMIFTDSDPDRSPYYNIDLTGIRVAGKKLSLNSRVFD 300
Query: 245 STGNIFVDTGVLRTLLPLEYHSNLKSVMSNMIKAQPVKGV-GAEPGFSDVLCY------N 297
+D+G LP + + + M + P+K + G +P F D C+ +
Sbjct: 301 GEHGAVLDSGTTYAYLPDAAFAAFEEAV--MREVSPLKQIDGPDPNFKDT-CFLVAASND 357
Query: 298 ISSQPK-FPEVTIHFR-GADVKLSPSN-LFRN--ISDEIMCSAF-RGGNANIVYGRIMQI 351
+S K FP V + F+ G LSP N +FR+ + F G + + G I+
Sbjct: 358 VSELSKIFPSVEMIFKSGQSWLLSPENYMFRHSKVHGAYCLGVFPNGKDHTTLLGGIVVR 417
Query: 352 NFLIGYDIEQAMVSFKPSRCT 372
N L+ YD E + V F + C+
Sbjct: 418 NTLVVYDRENSKVGFWRTNCS 438
>gi|326501496|dbj|BAK02537.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 476
Score = 90.5 bits (223), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 95/391 (24%), Positives = 159/391 (40%), Gaps = 60/391 (15%)
Query: 35 YLMHLSIGTPPVDIFGSVDTGSDCTWTQCE--PCPELDCFKQEPPLFDPKKSSTYNSISC 92
Y + +GTP DTGSD TW +C + F P+ S T+ ISC
Sbjct: 94 YFVRFRVGTPAQPFLLVADTGSDLTWVKCRRPAANSSESGSGSGRAFRPEDSRTWAPISC 153
Query: 93 SSSQC------AVVTSNCSEGDCSYSFLYGRGAYASFSSGNL-ATETLTFNSTSGLPVEM 145
+S C ++ T C+Y + Y G+ A + G AT L+ ++
Sbjct: 154 ASDTCTKSLPFSLATCPTPGSPCAYDYRYKDGSAARGTVGTESATIALSGRGREERKAKL 213
Query: 146 PNVIFGCGHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCL-----PDQGS 200
++ GC + P+ + G++ LG + S S + AG+FSYCL P +
Sbjct: 214 KGLVLGC-TSSYTGPSFEVSD-GVLSLGYSDVSFASHAASRFAGRFSYCLVDHLSPRNAT 271
Query: 201 SKINFG----------------------GIVAGAGVVSTPLII----RDHYYLSLEAISV 234
S + FG TPL++ R Y ++++A+SV
Sbjct: 272 SYLTFGPNPAVASSSSPSSPAPASCTAAAPRPRPRARQTPLLLDRRMRPFYDVAVKAVSV 331
Query: 235 GNQRLEFVSS-----STGNIFVDTGVLRTLLPLEYHSNLKSVMSNMIKAQPVKGVGAEPG 289
Q L+ + + G + +D+G T+L + + + +S + P V +P
Sbjct: 332 AGQFLKIPRAVWDVDAGGGVILDSGTSLTVLAKPAYRAVVAALSEGLAGLPR--VTMDP- 388
Query: 290 FSDVLCYNISS---QPKFPEVTIHFRGADVKLSP--SNLFRNISDEIMCSAFRGG--NAN 342
F CYN +S P++ +HF GA +L P + + + + C + G
Sbjct: 389 FE--YCYNWTSPSGDVTLPKMAVHFAGA-ARLEPPGKSYVIDAAPGVKCIGLQEGPWPGI 445
Query: 343 IVYGRIMQINFLIGYDIEQAMVSFKPSRCTN 373
V G I+Q L +DI+ + F+ SRCT+
Sbjct: 446 SVIGNILQQEHLWEFDIKNRRLKFQRSRCTH 476
>gi|212274314|ref|NP_001130524.1| uncharacterized protein LOC100191623 [Zea mays]
gi|194689376|gb|ACF78772.1| unknown [Zea mays]
gi|224031455|gb|ACN34803.1| unknown [Zea mays]
gi|238011528|gb|ACR36799.1| unknown [Zea mays]
gi|238015454|gb|ACR38762.1| unknown [Zea mays]
Length = 304
Score = 90.1 bits (222), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 86/317 (27%), Positives = 141/317 (44%), Gaps = 48/317 (15%)
Query: 90 ISCSSSQCA-VVTSNCSEGD-CSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMPN 147
+ C+ + C+ ++ +C D C+Y + YG G + + G ATE TF S+ G +
Sbjct: 1 MRCAGTLCSDILHHSCERPDTCTYRYNYGDG---TMTVGVYATERFTFASSGGGGLTTTT 57
Query: 148 VI--FGCGHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLPDQGSSK--- 202
V FGCG N+ S + S GI+G G SL+SQ+ +FSYCL S +
Sbjct: 58 VPLGFGCGSVNVGSLNNGS---GIVGFGRNPLSLVSQLSIR---RFSYCLTSYASRRQST 111
Query: 203 INFGGIVAG------AGVVSTPLIIRDH----YYLSLEAISVGNQRLEFVSSS------- 245
+ FG + G V +TPL+ YY+ ++VG +RL S+
Sbjct: 112 LLFGSLSDGVYGDATGRVQTTPLLQSPQNPTFYYVHFTGLTVGARRLRIPESAFALRPDG 171
Query: 246 TGNIFVDTGVLRTLLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDVLCYNI------- 298
+G + VD+G TLLP + + ++ G E G +C+ +
Sbjct: 172 SGGVIVDSGTALTLLPAAVLAEVVRAFRQQLRLPFANGGNPEDG----VCFLVPAAWRRS 227
Query: 299 --SSQPKFPEVTIHFRGADVKLSPSN-LFRNISDEIMCSAFR-GGNANIVYGRIMQINFL 354
+SQ P + +HF+GAD+ L N + + +C G+ G ++Q +
Sbjct: 228 SSTSQMPVPRMVLHFQGADLDLPRRNYVLDDHRRGRLCLLLADSGDDGSTIGNLVQQDMR 287
Query: 355 IGYDIEQAMVSFKPSRC 371
+ YD+E +S P+RC
Sbjct: 288 VLYDLEAETLSIAPARC 304
>gi|340810945|gb|AEK75399.1| S5 [Oryza sativa]
gi|340810957|gb|AEK75405.1| S5 [Oryza sativa]
gi|340811007|gb|AEK75430.1| S5 [Oryza nivara]
gi|340811073|gb|AEK75463.1| S5 [Oryza rufipogon]
gi|340811094|gb|AEK75473.1| S5 [Oryza rufipogon]
Length = 357
Score = 90.1 bits (222), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 108/379 (28%), Positives = 159/379 (41%), Gaps = 66/379 (17%)
Query: 37 MHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEP---PLFDPKKSSTYNSISCS 93
M +S+G PPV ++DTGS +W QC+PC + C Q P+FDP +S T + CS
Sbjct: 1 MAVSLGKPPVVNLVAIDTGSTLSWVQCQPC-AVHCHTQSAKAGPIFDPGRSYTSRRVRCS 59
Query: 94 SSQCA-------VVTSNCSEGD--CSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVE 144
S +C + +NC E + C+YS YG G ++S G + T+TL +
Sbjct: 60 SVKCGEPRYDLRLQQANCMEKEDSCTYSVTYGNGW--AYSVGKMVTDTLRIGDS------ 111
Query: 145 MPNVIFGCGHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAG--------KFSYCLP 196
+++FGC S + GI G G + S Q+ AG FSYCLP
Sbjct: 112 FMDLMFGCSMDVKYS----EFEAGIFGFGSSSFSFFEQL----AGYPDILSYKAFSYCLP 163
Query: 197 -DQGSSKINFGGIVAGAGVVS--TPL---IIRDHYYLSLEAISVGNQRLEFVSSSTGNIF 250
D+ G A + TPL I R Y L+ E + QRL +S+ +
Sbjct: 164 TDETKPGYMILGRYDRAAMDGGYTPLFRSINRPTYSLTTEMLIANGQRL---VTSSSEMI 220
Query: 251 VDTGVLRT-LLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDVLCY------------- 296
VD+G RT L P + K++ M + A +CY
Sbjct: 221 VDSGAQRTSLWPSTFALLDKTITQAMSSIGYHRTSRARQ--ESYICYLSEHDYSGWNGTI 278
Query: 297 -NISSQPKFPEVTIHFRG-ADVKLSPSNLFRNISDEIMCSAFRGGNA--NIVYGRIMQIN 352
S+ P + I F G A + LSP N+F N +C F A + + G + +
Sbjct: 279 TPFSNWSALPLLEIGFAGGAALALSPRNVFYNDPHRGLCMTFAQNPALRSQILGNRVTRS 338
Query: 353 FLIGYDIEQAMVSFKPSRC 371
F +DI+ FK + C
Sbjct: 339 FGTTFDIQGKQFGFKYAAC 357
>gi|195626958|gb|ACG35309.1| aspartic proteinase nepenthesin-1 precursor [Zea mays]
Length = 450
Score = 90.1 bits (222), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 97/361 (26%), Positives = 156/361 (43%), Gaps = 47/361 (13%)
Query: 35 YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSS 94
Y++ S+GTPP + +VDT +D +W C C P FDP S++Y ++ C S
Sbjct: 112 YVVRASLGTPPQQLLLAVDTSNDASWIPCAGCAGCPTSSAAP--FDPAASASYRTVPCGS 169
Query: 95 SQCAVV-TSNCSEGD--CSYSFLYGRGAY-ASFSSGNLATETLTFNSTSGLPVEMPNVIF 150
CA + C G C +S Y + A+ S +LA +G V+ F
Sbjct: 170 PLCAQAPNAACPPGGKACGFSLTYADSSLQAALSQDSLA--------VAGNAVKA--YTF 219
Query: 151 GCGHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLPDQGSSKINFGGIV- 209
GC + A+ T+ Q + G S +SQ FSYCLP S +NF G +
Sbjct: 220 GCLQR--ATGTAAPPQGLLGLGR-GPLSFLSQTKDMYEATFSYCLPSFKS--LNFSGTLR 274
Query: 210 -----AGAGVVSTPLIIRDH----YYLSLEAISVGNQRL---EFVSSSTGNIFVDTGVLR 257
+ +TPL+ H YY+++ + VG + + F ++ +D+G +
Sbjct: 275 LGRNGQPQRIKTTPLLANPHRSSLYYVNMTGVRVGRKVVPIPAFDPATGAGTVLDSGTMF 334
Query: 258 TLLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDVLCYNISSQPKFPEVTIHFRGADVK 317
T L + ++ + + A PV +G GF C+N ++ +P +T+ F G V
Sbjct: 335 TRLVAPAYVAVRDEVRRRVGA-PVSSLG---GFDT--CFNTTAV-AWPPMTLLFDGMQVT 387
Query: 318 LSPSNL-----FRNISDEIMCSAFRGGNANI-VYGRIMQINFLIGYDIEQAMVSFKPSRC 371
L N+ + IS M +A G N + V + Q N + +D+ V F RC
Sbjct: 388 LPEENVVIHSTYGTISCLAMAAAPDGVNTVLNVIASMQQQNHRVLFDVPNGRVGFARERC 447
Query: 372 T 372
T
Sbjct: 448 T 448
>gi|356555248|ref|XP_003545946.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
Length = 453
Score = 89.7 bits (221), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 92/368 (25%), Positives = 151/368 (41%), Gaps = 57/368 (15%)
Query: 35 YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSS 94
Y + +G+P + VDTGS+ TW C S ++ +++C+S
Sbjct: 113 YFAEVKVGSPGQRFWLVVDTGSEFTWLNC--------------------SKSFEAVTCAS 152
Query: 95 SQCAVVTSN------CSE--GDCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMP 146
+C V S C + C Y Y G+ A G T+++T T+G ++
Sbjct: 153 RKCKVDLSELFSLSVCPKPSDPCLYDISYADGSSAK---GFFGTDSITVGLTNGKQGKLN 209
Query: 147 NVIFGCGHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLPDQ-----GSS 201
N+ GC L + + GI+GLG S I + KFSYCL D SS
Sbjct: 210 NLTIGCTKSMLNGVNFNEETGGILGLGFAKDSFIDKAANKYGAKFSYCLVDHLSHRSVSS 269
Query: 202 KINFGGIVAG---AGVVSTPLIIRDHYY-LSLEAISVGNQRLE-----FVSSSTGNIFVD 252
+ GG + T LI+ +Y +++ IS+G Q L+ + ++ G +D
Sbjct: 270 NLTIGGHHNAKLLGEIRRTELILFPPFYGVNVVGISIGGQMLKIPPQVWDFNAEGGTLID 329
Query: 253 TG-VLRTLLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDVLCYNIS--SQPKFPEVTI 309
+G L +LL Y + +++ ++ K + V G E + C++ P +
Sbjct: 330 SGTTLTSLLLPAYEAVFEALTKSLTKVKRVTG---EDFDALEFCFDAEGFDDSVVPRLVF 386
Query: 310 HFRGADVKLSP--SNLFRNISDEIMCSA---FRGGNANIVYGRIMQINFLIGYDIEQAMV 364
HF G + P + +++ + C G V G IMQ N L +D+ V
Sbjct: 387 HFAGG-ARFEPPVKSYIIDVAPLVKCIGIVPIDGIGGASVIGNIMQQNHLWEFDLSTNTV 445
Query: 365 SFKPSRCT 372
F PS CT
Sbjct: 446 GFAPSTCT 453
>gi|340811098|gb|AEK75475.1| S5 [Oryza nivara]
Length = 357
Score = 89.7 bits (221), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 108/379 (28%), Positives = 159/379 (41%), Gaps = 66/379 (17%)
Query: 37 MHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEP---PLFDPKKSSTYNSISCS 93
M +S+G PPV ++DTGS +W QC+PC + C Q P+FDP +S T + CS
Sbjct: 1 MAVSLGKPPVVNLVAIDTGSTLSWVQCQPC-AVHCHTQSAKAGPIFDPGRSYTSRRVRCS 59
Query: 94 SSQCA-------VVTSNCSEGD--CSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVE 144
S +C + +NC E + C+YS YG G ++S G + T+TL +
Sbjct: 60 SVKCGELRYDLRLQQANCMEKEDSCTYSVTYGNGW--AYSVGKMVTDTLRIGDS------ 111
Query: 145 MPNVIFGCGHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAG--------KFSYCLP 196
+++FGC S + GI G G + S Q+ AG FSYCLP
Sbjct: 112 FMDLMFGCSMDVKYSEF----EAGIFGFGSSSFSFFEQL----AGYPDILSYKAFSYCLP 163
Query: 197 -DQGSSKINFGGIVAGAGVVS--TPL---IIRDHYYLSLEAISVGNQRLEFVSSSTGNIF 250
D+ G A + TPL I R Y L+ E + QRL +S+ +
Sbjct: 164 TDETKPGYMILGRYDRAAMDGGYTPLFRSINRPTYSLTTEMLIANGQRL---VTSSSEMI 220
Query: 251 VDTGVLRT-LLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDVLCY------------- 296
VD+G RT L P + K++ M + A +CY
Sbjct: 221 VDSGAQRTSLWPSTFALLDKTITQAMSSIGYHRTSRARQ--ESYICYLSEHDYSGWNGTI 278
Query: 297 -NISSQPKFPEVTIHFRG-ADVKLSPSNLFRNISDEIMCSAFRGGNA--NIVYGRIMQIN 352
S+ P + I F G A + LSP N+F N +C F A + + G + +
Sbjct: 279 TPFSNWSALPLLEIGFAGGAALALSPRNVFYNDPHRGLCMTFAQNPALRSQILGNRVTRS 338
Query: 353 FLIGYDIEQAMVSFKPSRC 371
F +DI+ FK + C
Sbjct: 339 FGTTFDIQGKQFGFKYAAC 357
>gi|255554715|ref|XP_002518395.1| Aspartic proteinase Asp1 precursor, putative [Ricinus communis]
gi|223542240|gb|EEF43782.1| Aspartic proteinase Asp1 precursor, putative [Ricinus communis]
Length = 489
Score = 89.7 bits (221), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 100/373 (26%), Positives = 157/373 (42%), Gaps = 48/373 (12%)
Query: 35 YLMHLSIGTP-PVDIFGSVDTGSDCTWTQCEPCPELDCFKQEP---PLFDPKKSSTYNSI 90
Y + + IGTP P DTGSD TW CE + C K P +F SS++ +I
Sbjct: 119 YFVSIRIGTPRPQKFILVTDTGSDLTWMNCEYWCK-SCPKPNPHPGRVFRANDSSSFRTI 177
Query: 91 SCSSSQCAVVTSN------CSEGD--CSYSFLYGRGAYASFSSGNLATETLTFNSTSGLP 142
CSS C + + C + C + + Y G A G A ET+T
Sbjct: 178 PCSSDDCKIELQDYFSLTECPNPNAPCLFDYRYLNGPRAI---GVFANETVTVGLNDHKK 234
Query: 143 VEMPNVIFGCGHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLPDQGSSK 202
+ + +V+ GC + ++ G++GLG SL ++ KFSYCL D SS
Sbjct: 235 IRLFDVLIGCTE---SFNETNGFPDGVMGLGYRKHSLALRLAEIFGNKFSYCLVDHLSSS 291
Query: 203 -----INFGGI--VAGAGVVSTPLI---IRDHYYLSLEAISVGNQRLEFVS-----SSTG 247
++FG I + + T L+ I Y +++ ISVG L S + G
Sbjct: 292 NHKNFLSFGDIPEMKLPKMQHTELLLGYINAFYPVNVSGISVGGSMLSISSDIWNVTGVG 351
Query: 248 NIFVDTGVLRTLLPLEYHSN----LKSVMSNMIKAQPVKGVGAEPGFSDVLCYNIS--SQ 301
+ VD+G T+L E + LK + K P++ P ++ C+ +
Sbjct: 352 GMIVDSGTSLTMLAGEAYDKVVDALKPIFDKHKKVVPIE----LPELNN-FCFEDKGFDR 406
Query: 302 PKFPEVTIHF-RGADVKLSPSNLFRNISDEIMCSAFRGGN--ANIVYGRIMQINFLIGYD 358
P + IHF GA K + ++++ I C + + + G +MQ N L YD
Sbjct: 407 AAVPRLLIHFADGAIFKPPVKSYIIDVAEGIKCLGIIKADFPGSSILGNVMQQNHLWEYD 466
Query: 359 IEQAMVSFKPSRC 371
+ + + F PS C
Sbjct: 467 LGRGKLGFGPSSC 479
>gi|18421660|ref|NP_568551.1| aspartyl protease family protein [Arabidopsis thaliana]
gi|10177438|dbj|BAB10671.1| unnamed protein product [Arabidopsis thaliana]
gi|15809850|gb|AAL06853.1| AT5g37540/mpa22_p_70 [Arabidopsis thaliana]
gi|20260182|gb|AAM12989.1| unknown protein [Arabidopsis thaliana]
gi|23197748|gb|AAN15401.1| unknown protein [Arabidopsis thaliana]
gi|332006821|gb|AED94204.1| aspartyl protease family protein [Arabidopsis thaliana]
Length = 442
Score = 89.7 bits (221), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 96/378 (25%), Positives = 156/378 (41%), Gaps = 62/378 (16%)
Query: 36 LMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSSS 95
++ L IGTP +DTGS +W QC P FDP SS+++ + CS
Sbjct: 81 ILSLPIGTPSQSQELVLDTGSQLSWIQCHPKKIKKPLPPPTTSFDPSLSSSFSDLPCSHP 140
Query: 96 QC-------AVVTSNCSEGDCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMPNV 148
C + TS S C YS+ Y G +F+ GNL E TF+++ P +
Sbjct: 141 LCKPRIPDFTLPTSCDSNRLCHYSYFYADG---TFAEGNLVKEKFTFSNSQ----TTPPL 193
Query: 149 IFGCGHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLPDQ---------- 198
I GC A ++D K GI+G+ G S ISQ S KFSYC+P +
Sbjct: 194 ILGC-----AKESTDEK--GILGMNLGRLSFISQAKIS---KFSYCIPTRSNRPGLASTG 243
Query: 199 --------GSSKINFGGIVAGAGVVSTPLIIRDHYYLSLEAISVGNQRLEFVSS------ 244
S + ++ P + Y + L+ I +G +RL S
Sbjct: 244 SFYLGDNPNSRGFKYVSLLTFPQSQRMPNLDPLAYTVPLQGIRIGQKRLNIPGSVFRPDA 303
Query: 245 -STGNIFVDTGVLRTLLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDVLCYNISSQPK 303
+G VD+G T L + +K + ++ ++ KG G + +C++ + +
Sbjct: 304 GGSGQTMVDSGSEFTHLVDVAYDKVKEEIVRLVGSRLKKGYVY--GSTADMCFDGNHSME 361
Query: 304 ----FPEVTIHF-RGADVKLSPSNLFRNISDEIMC-----SAFRGGNANIVYGRIMQINF 353
++ F RG ++ + +L N+ I C S+ G +NI+ G + Q N
Sbjct: 362 IGRLIGDLVFEFGRGVEILVEKQSLLVNVGGGIHCVGIGRSSMLGAASNII-GNVHQQNL 420
Query: 354 LIGYDIEQAMVSFKPSRC 371
+ +D+ V F + C
Sbjct: 421 WVEFDVTNRRVGFSKAEC 438
>gi|255685714|gb|ACU28346.1| At2g28040-like protein [Arabidopsis lyrata subsp. petraea]
Length = 91
Score = 89.7 bits (221), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 48/106 (45%), Positives = 62/106 (58%), Gaps = 15/106 (14%)
Query: 37 MHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSSSQ 96
M L IGTPP +I +DTGS+ WTQC PC L C+ Q+ P+FDP KSST+ C
Sbjct: 1 MKLQIGTPPFEIEAVLDTGSELIWTQCLPC--LHCYDQKAPIFDPSKSSTFKETRC---- 54
Query: 97 CAVVTSNCSEGDCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLP 142
N + CSY +Y +Y + G LATET+T +STSG+P
Sbjct: 55 ------NTPDHSCSYKIVYDDKSY---TQGTLATETVTIHSTSGVP 91
>gi|218199944|gb|EEC82371.1| hypothetical protein OsI_26705 [Oryza sativa Indica Group]
Length = 642
Score = 89.7 bits (221), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 102/368 (27%), Positives = 158/368 (42%), Gaps = 44/368 (11%)
Query: 35 YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQE--------PPLFDPKKSST 86
Y L IGTP + VD+GS T+ C C + + E P F P SST
Sbjct: 92 YTTRLYIGTPSQEFALIVDSGSTVTYVPCATCEQCGNHQSESPNIIEAHDPRFQPDLSST 151
Query: 87 YNSISCSSSQCAVVTSNCSEGDCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMP 146
Y+ + C+ T + C+Y Y A S SSG L + ++F S L +
Sbjct: 152 YSPVKCNVD----CTCDNERSQCTYERQY---AEMSSSSGVLGEDIMSFGKESELKPQ-- 202
Query: 147 NVIFGCGHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTS--IAGKFSYCLP--DQGSSK 202
+FGC + S GI+GLG G S++ Q+ I+ FS C D G
Sbjct: 203 RAVFGCENTETGDLFSQHAD-GIMGLGRGQLSIMDQLVEKGVISDSFSLCYGGMDVGGGT 261
Query: 203 INFGGIVAGAGVV---STPLIIRDHYY-LSLEAISVGNQRLEF---VSSSTGNIFVDTGV 255
+ GG+ A +V S P +R YY + L+ I V + L + +S +D+G
Sbjct: 262 MVLGGMPAPPDMVFSHSNP--VRSPYYNIELKEIHVAGKALRLDPKIFNSKHGTVLDSGT 319
Query: 256 LRTLLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDVLCY-----NISSQPK-FPEVTI 309
LP + K ++N + + K G +P + D+ C+ N+S + FP+V +
Sbjct: 320 TYAYLPEQAFVAFKDAVTNKVNSLK-KIRGPDPNYKDI-CFAGAGRNVSQLSEVFPDVDM 377
Query: 310 HF-RGADVKLSPSN-LFRN--ISDEIMCSAFRGG-NANIVYGRIMQINFLIGYDIEQAMV 364
F G + LSP N LFR+ + F+ G + + G I+ N L+ YD +
Sbjct: 378 VFGNGQKLSLSPENYLFRHSKVEGAYCLGVFQNGKDPTTLLGGIVVRNTLVTYDRHNEKI 437
Query: 365 SFKPSRCT 372
F + C+
Sbjct: 438 GFWKTNCS 445
>gi|18408451|ref|NP_564867.1| aspartyl protease-like protein [Arabidopsis thaliana]
gi|12322615|gb|AAG51309.1|AC026480_16 unknown protein [Arabidopsis thaliana]
gi|14334808|gb|AAK59582.1| unknown protein [Arabidopsis thaliana]
gi|15293195|gb|AAK93708.1| unknown protein [Arabidopsis thaliana]
gi|332196351|gb|AEE34472.1| aspartyl protease-like protein [Arabidopsis thaliana]
Length = 430
Score = 89.7 bits (221), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 97/383 (25%), Positives = 158/383 (41%), Gaps = 72/383 (18%)
Query: 36 LMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPL----FDPKKSSTYNSIS 91
++ L IGTPP +DTGS +W QC K+ PP FDP SS+++++
Sbjct: 73 IISLPIGTPPQAQQMVLDTGSQLSWIQCH-------RKKLPPKPKTSFDPSLSSSFSTLP 125
Query: 92 CSSSQC-------AVVTSNCSEGDCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVE 144
CS C + TS S C YS+ Y G +F+ GNL E +TF++T
Sbjct: 126 CSHPLCKPRIPDFTLPTSCDSNRLCHYSYFYADG---TFAEGNLVKEKITFSNTE----I 178
Query: 145 MPNVIFGCGHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLPDQ------ 198
P +I GC T S GI+G+ G S +SQ S KFSYC+P +
Sbjct: 179 TPPLILGCA-------TESSDDRGILGMNRGRLSFVSQAKIS---KFSYCIPPKSNRPGF 228
Query: 199 ------------GSSKINFGGIVAGAGVVSTPLIIRDHYYLSLEAISVGNQRLEFVSS-- 244
S + ++ P + Y + + I G ++L S
Sbjct: 229 TPTGSFYLGDNPNSHGFKYVSLLTFPESQRMPNLDPLAYTVPMIGIRFGLKKLNISGSVF 288
Query: 245 -----STGNIFVDTGVLRTLLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDVLCY--N 297
+G VD+G T L + +++ + + + KG G +D +C+ N
Sbjct: 289 RPDAGGSGQTMVDSGSEFTHLVDAAYDKVRAEIMTRVGRRLKKGY-VYGGTAD-MCFDGN 346
Query: 298 ISSQPKF--PEVTIHFRGADVKLSPSNLFRNISDEIMC-----SAFRGGNANIVYGRIMQ 350
++ P+ V + RG ++ + + N+ I C S+ G +NI+ G + Q
Sbjct: 347 VAMIPRLIGDLVFVFTRGVEILVPKERVLVNVGGGIHCVGIGRSSMLGAASNII-GNVHQ 405
Query: 351 INFLIGYDIEQAMVSFKPSRCTN 373
N + +D+ V F + C+
Sbjct: 406 QNLWVEFDVTNRRVGFAKADCSR 428
>gi|222637379|gb|EEE67511.1| hypothetical protein OsJ_24961 [Oryza sativa Japonica Group]
Length = 641
Score = 89.7 bits (221), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 102/368 (27%), Positives = 158/368 (42%), Gaps = 44/368 (11%)
Query: 35 YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQE--------PPLFDPKKSST 86
Y L IGTP + VD+GS T+ C C + + E P F P SST
Sbjct: 91 YTTRLYIGTPSQEFALIVDSGSTVTYVPCATCEQCGNHQSESPNIIEAHDPRFQPDLSST 150
Query: 87 YNSISCSSSQCAVVTSNCSEGDCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMP 146
Y+ + C+ T + C+Y Y A S SSG L + ++F S L +
Sbjct: 151 YSPVKCNVD----CTCDNERSQCTYERQY---AEMSSSSGVLGEDIMSFGKESELKPQ-- 201
Query: 147 NVIFGCGHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTS--IAGKFSYCLP--DQGSSK 202
+FGC + S GI+GLG G S++ Q+ I+ FS C D G
Sbjct: 202 RAVFGCENTETGDLFSQHAD-GIMGLGRGQLSIMDQLVEKGVISDSFSLCYGGMDVGGGT 260
Query: 203 INFGGIVAGAGVV---STPLIIRDHYY-LSLEAISVGNQRLEF---VSSSTGNIFVDTGV 255
+ GG+ A +V S P +R YY + L+ I V + L + +S +D+G
Sbjct: 261 MVLGGMPAPPDMVFSHSNP--VRSPYYNIELKEIHVAGKALRLDPKIFNSKHGTVLDSGT 318
Query: 256 LRTLLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDVLCY-----NISSQPK-FPEVTI 309
LP + K ++N + + K G +P + D+ C+ N+S + FP+V +
Sbjct: 319 TYAYLPEQAFVAFKDAVTNKVNSLK-KIRGPDPNYKDI-CFAGAGRNVSQLSEVFPDVDM 376
Query: 310 HF-RGADVKLSPSN-LFRN--ISDEIMCSAFRGG-NANIVYGRIMQINFLIGYDIEQAMV 364
F G + LSP N LFR+ + F+ G + + G I+ N L+ YD +
Sbjct: 377 VFGNGQKLSLSPENYLFRHSKVEGAYCLGVFQNGKDPTTLLGGIVVRNTLVTYDRHNEKI 436
Query: 365 SFKPSRCT 372
F + C+
Sbjct: 437 GFWKTNCS 444
>gi|340810959|gb|AEK75406.1| S5 [Oryza sativa]
gi|340810971|gb|AEK75412.1| S5 [Oryza rufipogon]
Length = 357
Score = 89.7 bits (221), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 107/379 (28%), Positives = 159/379 (41%), Gaps = 66/379 (17%)
Query: 37 MHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEP---PLFDPKKSSTYNSISCS 93
M +S+G PPV ++DTGS +W QC+PC + C Q P+FDP +S T + CS
Sbjct: 1 MAVSLGKPPVVNLVAIDTGSTLSWVQCQPC-AVHCHTQSAKAGPIFDPGRSYTSRRVRCS 59
Query: 94 SSQCA-------VVTSNCSEGD--CSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVE 144
S +C + +NC E + C+YS YG G ++S G + T+TL +
Sbjct: 60 SVKCGEPRYDLRLQQANCMEKEDSCTYSVTYGNG--WAYSVGKMVTDTLRIGDS------ 111
Query: 145 MPNVIFGCGHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAG--------KFSYCLP 196
+++FGC S + GI G G + S Q +AG FSYCLP
Sbjct: 112 FMDLMFGCSMDVKYS----EFEAGIFGFGSSSFSFFEQ----LAGYPDILSYKAFSYCLP 163
Query: 197 -DQGSSKINFGGIVAGAGVVS--TPL---IIRDHYYLSLEAISVGNQRLEFVSSSTGNIF 250
D+ G A + TPL I R Y L++E + QRL +S+ +
Sbjct: 164 TDETKPGYMILGRYDRAAMDGGYTPLFRSINRPTYSLTMEMLIANGQRL---VTSSSEMI 220
Query: 251 VDTGVLRT-LLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDVLCY------------- 296
VD+G RT L P + K++ M + A +CY
Sbjct: 221 VDSGAQRTSLWPSTFALLDKTITQAMSSIGYHRTSRARQ--ESYICYLSEHDYSGWNGTI 278
Query: 297 -NISSQPKFPEVTIHFR-GADVKLSPSNLFRNISDEIMCSAFRGGNA--NIVYGRIMQIN 352
S+ P + I F GA + L P N+F N +C F A + + G + +
Sbjct: 279 TPFSNWSALPLLEIGFAGGAALALPPRNVFYNDPHRGLCMTFAQNPALRSQILGNRVTRS 338
Query: 353 FLIGYDIEQAMVSFKPSRC 371
F +DI+ FK + C
Sbjct: 339 FGTTFDIQGKQFGFKYAAC 357
>gi|79315693|ref|NP_001030891.1| aspartyl protease family protein [Arabidopsis thaliana]
gi|332646353|gb|AEE79874.1| aspartyl protease family protein [Arabidopsis thaliana]
Length = 499
Score = 89.7 bits (221), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 96/378 (25%), Positives = 153/378 (40%), Gaps = 88/378 (23%)
Query: 35 YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSS 94
Y M + +G+PP +DTGSD W QC PC DCF+Q
Sbjct: 170 YFMDVLVGSPPKHFSLILDTGSDLNWIQCLPC--YDCFQQN------------------- 208
Query: 95 SQCAVVTSNCSEGDCSYSFLYGRGAYASFSSGNLATETLTFN-STSGLPVEMPNV---IF 150
C Y + YG +S ++G+ A ET T N +T+G E+ NV +F
Sbjct: 209 ----------DNQSCPYYYWYGD---SSNTTGDFAVETFTVNLTTNGGSSELYNVENMMF 255
Query: 151 GCGHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLPDQGSS--------- 201
GCGH N + G+ S SQ+ + FSYCL D+ S
Sbjct: 256 GCGHWNRGLFHGAAGLLGLGRG---PLSFSSQLQSLYGHSFSYCLVDRNSDTNVSSKLIF 312
Query: 202 ----------KINFGGIVAGAGVVSTPLIIRDHYYLSLEAISVGNQRL-------EFVSS 244
+NF VAG ++ YY+ +++I V + L S
Sbjct: 313 GEDKDLLSHPNLNFTSFVAGK-----ENLVDTFYYVQIKSILVAGEVLNIPEETWNISSD 367
Query: 245 STGNIFVDTGVLRTLLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSD--VL--CYNIS- 299
G +D+G + + +K+ ++ K G P + D +L C+N+S
Sbjct: 368 GAGGTIIDSGTTLSYFAEPAYEFIKNKIAE-------KAKGKYPVYRDFPILDPCFNVSG 420
Query: 300 -SQPKFPEVTIHFR-GADVKLSPSNLFRNISDEIMCSAFRG--GNANIVYGRIMQINFLI 355
+ PE+ I F GA N F ++++++C A G +A + G Q NF I
Sbjct: 421 IHNVQLPELGIAFADGAVWNFPTENSFIWLNEDLVCLAMLGTPKSAFSIIGNYQQQNFHI 480
Query: 356 GYDIEQAMVSFKPSRCTN 373
YD +++ + + P++C +
Sbjct: 481 LYDTKRSRLGYAPTKCAD 498
>gi|225217039|gb|ACN85323.1| aspartic proteinase nepenthesin-1 precursor [Oryza brachyantha]
Length = 287
Score = 89.7 bits (221), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 84/289 (29%), Positives = 131/289 (45%), Gaps = 32/289 (11%)
Query: 101 TSNCSEGDCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMPNVIFGCGHKNLASP 160
T CS G C Y YG G+Y + G A +TLT +S + FGCG +N
Sbjct: 13 TRGCSGGHCLYGVQYGDGSY---TIGFFAMDTLTLSSHD----AIKGFRFGCGERN---E 62
Query: 161 TSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLPDQ--GSSKINFG---GIVAGAGVV 215
+ G++GLG G +SL Q G F++C P + G+ + FG A +
Sbjct: 63 GLFGEAAGLLGLGRGKTSLPVQTYDKYGGVFAHCFPARSSGTGYLEFGPGSSPAVSAKLS 122
Query: 216 STPLIIR---DHYYLSLEAISVGNQRL---EFVSSSTGNIFVDTGVLRTLLPLEYHSNLK 269
+TP++I YY+ + I VG + L + V ++ G I VD+G + T LP +S+L+
Sbjct: 123 TTPMLIDTGPTFYYVGMTGIRVGGKLLPIPQSVFAAAGTI-VDSGTVITRLPPAAYSSLR 181
Query: 270 SVMSNMIKAQPVKGVGAEPGFSDV-LCYNI--SSQPKFPEVTIHFRGA-DVKLSPSNLFR 325
S + + A+ K P S + CY++ +S+ P V++ F+G + + S +
Sbjct: 182 SAFAASMAARGYK---RAPALSLLDTCYDLTGASEVAIPTVSLLFQGGVSLDVDASGIIY 238
Query: 326 NISDEIMCSAFRGGNAN---IVYGRIMQINFLIGYDIEQAMVSFKPSRC 371
S C F G A + G F + YDI +V F P C
Sbjct: 239 AASVSQACLGFAGNEAADDVAIVGNTQLKTFGVVYDIASKVVGFCPGAC 287
>gi|88174577|gb|ABD39363.1| chloroplast nucleoid DNA-binding protein [Oryza sativa Japonica
Group]
Length = 321
Score = 89.4 bits (220), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 91/329 (27%), Positives = 148/329 (44%), Gaps = 45/329 (13%)
Query: 35 YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSS 94
Y+ + +GTP +DTGS +W C E D P F +S+T +SC +
Sbjct: 1 YVTSVGLGTPAKTQIVEIDTGSSISWVFC----ECDGCHTNPRTFLQSRSTTCAKVSCGT 56
Query: 95 SQCAVVTSN--CSEG----DCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMPNV 148
S C + S+ C + DC + Y G S S G L +TLTF+ ++P+
Sbjct: 57 SMCLLGGSDPHCQDSENYPDCPFRVSYQDG---SASYGILYQDTLTFSDVQ----KIPSF 109
Query: 149 IFGCGHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLPDQGSSK------ 202
FGC + + G++G+G G S++ Q + G FSYCLP Q S +
Sbjct: 110 TFGCNLDSFGA-NEFGNVDGLLGMGAGPMSVLKQSSPTFDG-FSYCLPLQKSERGFFSKT 167
Query: 203 ---INFGGIVAGAGVVSTPLIIR----DHYYLSLEAISVGNQRLEFVSS--STGNIFVDT 253
+ G + V T ++ R + +++ L AISV +RL S S + D+
Sbjct: 168 TGYFSLGKVATRTDVRYTKMVARRKNTELFFVDLAAISVDGERLGLSPSIFSRKGVVFDS 227
Query: 254 GVLRTLLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDVLCYNISS--QPKFPEVTIHF 311
G + +P SV+S I+ ++ AE S+ CY++ S + P +++HF
Sbjct: 228 GSELSYIP----DRALSVLSQRIRELLLRRGAAEEE-SERNCYDMRSVDEGDMPAISLHF 282
Query: 312 -RGADVKLSPSNLF--RNISDE-IMCSAF 336
GA L S +F R++ ++ + C AF
Sbjct: 283 DDGARFDLGSSGVFVERSVQEQDVWCLAF 311
>gi|297597434|ref|NP_001043968.2| Os01g0696800 [Oryza sativa Japonica Group]
gi|255673588|dbj|BAF05882.2| Os01g0696800 [Oryza sativa Japonica Group]
Length = 334
Score = 89.4 bits (220), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 92/332 (27%), Positives = 145/332 (43%), Gaps = 53/332 (15%)
Query: 77 PLFDPKKSSTYNSISCSSSQCA---------VVTSNCSEGDCSYSFLYGRGA-YASFSSG 126
PL P SS+ ++C C V G+CSY + YG ++ G
Sbjct: 13 PLLYPTSSSSAAFVACGDRTCGELPRPLCSNVAGGGSGSGNCSYHYAYGNARDTHHYTEG 72
Query: 127 NLATETLTFNSTSGLPVEMPNVIFGCGHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTS 186
L TET TF + P + FGC L S +G++GLG G SL++Q+
Sbjct: 73 ILMTETFTFGDDA---AAFPGIAFGC---TLRSEGGFGTGSGLVGLGRGKLSLVTQLNVE 126
Query: 187 IAGKFSYCLPDQGS--SKINFGGIVA-----GAGVVSTPL----IIRD--HYYLSLEAIS 233
F Y L S S I+FG + G +STPL +++D YY+ L IS
Sbjct: 127 ---AFGYRLSSDLSAPSPISFGSLADVTGGNGDSFMSTPLLTNPVVQDLPFYYVGLTGIS 183
Query: 234 VGNQRLEF--------VSSSTGNIFVDTGVLRTLLPLEYHSNLK-SVMSNMIKAQPVKGV 284
VG + ++ S+ G + D+G T+LP ++ ++ ++S M +P
Sbjct: 184 VGGKLVQIPSGTFSFDRSTGAGGVIFDSGTTLTMLPDPAYTLVRDELLSQMGFQKPPPAA 243
Query: 285 GAEPGFSDVLCYN-ISSQPKFPEVTIHFR-GADVKLSPSNLF-----RNISDEIMCSAFR 337
+ D++C+ SS FP + +HF GAD+ LS N +N S +
Sbjct: 244 NDD----DLICFTGGSSTTTFPSMVLHFDGGADMDLSTENYLPQMQGQNGETARCWSVVK 299
Query: 338 GGNANIVYGRIMQINFLIGYDIE-QAMVSFKP 368
A + G IMQ++F + +D+ A + F+P
Sbjct: 300 SSQALTIIGNIMQMDFHVVFDLSGNARMLFQP 331
>gi|196212948|gb|ACG76110.1| S5 [Oryza sativa Japonica Group]
gi|340810887|gb|AEK75370.1| S5 [Oryza sativa]
gi|340810903|gb|AEK75378.1| S5 [Oryza sativa]
gi|340810921|gb|AEK75387.1| S5 [Oryza sativa]
gi|340810955|gb|AEK75404.1| S5 [Oryza sativa]
gi|340811079|gb|AEK75466.1| S5 [Oryza nivara]
gi|340811090|gb|AEK75471.1| S5 [Oryza rufipogon]
gi|340811116|gb|AEK75484.1| S5 [Oryza nivara]
Length = 357
Score = 89.4 bits (220), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 107/379 (28%), Positives = 159/379 (41%), Gaps = 66/379 (17%)
Query: 37 MHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEP---PLFDPKKSSTYNSISCS 93
M +S+G PPV ++DTGS +W QC+PC + C Q P+FDP +S T + CS
Sbjct: 1 MAVSLGKPPVVNLVAIDTGSTLSWVQCQPC-AVHCHTQSAKAGPIFDPGRSYTSRRVRCS 59
Query: 94 SSQCA-------VVTSNCSEGD--CSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVE 144
S +C + +NC E + C+YS YG G ++S G + T+TL +
Sbjct: 60 SVKCGELRYDLRLQQANCMEKEDSCTYSVTYGNG--WAYSVGKMVTDTLRIGDS------ 111
Query: 145 MPNVIFGCGHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAG--------KFSYCLP 196
+++FGC S + GI G G + S Q +AG FSYCLP
Sbjct: 112 FMDLMFGCSMDVKYS----EFEAGIFGFGSSSFSFFEQ----LAGYPDILSYKAFSYCLP 163
Query: 197 -DQGSSKINFGGIVAGAGVVS--TPL---IIRDHYYLSLEAISVGNQRLEFVSSSTGNIF 250
D+ G A + TPL I R Y L++E + QRL +S+ +
Sbjct: 164 TDETKPGYMILGRYDRAAMDGGYTPLFRSINRPTYSLTMEMLIANGQRL---VTSSSEMI 220
Query: 251 VDTGVLRT-LLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDVLCY------------- 296
VD+G RT L P + K++ M + A +CY
Sbjct: 221 VDSGAQRTSLWPSTFALLDKTITQAMSSIGYHRTSRARQ--ESYICYLSEHDYSGWNGTI 278
Query: 297 -NISSQPKFPEVTIHFR-GADVKLSPSNLFRNISDEIMCSAFRGGNA--NIVYGRIMQIN 352
S+ P + I F GA + L P N+F N +C F A + + G + +
Sbjct: 279 TPFSNWSALPLLEIGFAGGAALALPPRNVFYNDPHRGLCMTFAQNPALRSQILGNRVTRS 338
Query: 353 FLIGYDIEQAMVSFKPSRC 371
F +DI+ FK + C
Sbjct: 339 FGTTFDIQGKQFGFKYAAC 357
>gi|297736090|emb|CBI24128.3| unnamed protein product [Vitis vinifera]
Length = 378
Score = 89.4 bits (220), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 95/377 (25%), Positives = 154/377 (40%), Gaps = 49/377 (12%)
Query: 35 YLMHLSIGTPPVDIFGSVDTGSDCTWTQCE-PCPELDCFKQEPP------LFDPKKSSTY 87
Y + +GTP DTGSD TW C+ C +C ++ +F SS++
Sbjct: 12 YSVAFKVGTPSQKFMLVADTGSDLTWMSCKYHCRSRNCSNRKARRIRHKRVFHANLSSSF 71
Query: 88 NSISCSSSQCAV------VTSNCSE--GDCSYSFLYGRGAYASFSSGNLATETLTFNSTS 139
+I C + C + +NC C Y + Y G+ A G A ET+T
Sbjct: 72 KTIPCLTDMCKIELMDLFSLTNCPTPLTPCGYDYRYSDGSTAL---GFFANETVTVELKE 128
Query: 140 GLPVEMPNVIFGCGHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLPDQG 199
G +++ NV+ GC S G++GLG S + GKFSYCL D
Sbjct: 129 GRKMKLHNVLIGCSES--FQGQSFQAADGVMGLGYSKYSFAIKAAEKFGGKFSYCLVDHL 186
Query: 200 SSK-----INFGGIVAGAGVVS----TPLI---IRDHYYLSLEAISVGNQRLEFVS---- 243
S K + FG + +++ T L+ + Y +++ IS+G L+ S
Sbjct: 187 SHKNVSNYLTFGSSRSKEALLNNMTYTELVLGMVNSFYAVNMMGISIGGAMLKIPSEVWD 246
Query: 244 -SSTGNIFVDTGVLRTLLPL-EYHSNLKSVMSNMIKAQPVK-GVGAEPGFSDVLCYNIS- 299
G +D+G T L Y + ++ +++K + V+ +G C+N +
Sbjct: 247 VKGAGGTILDSGSSLTFLTEPAYQPVMAALRVSLLKFRKVEMDIGPLE-----YCFNSTG 301
Query: 300 -SQPKFPEVTIHFR-GADVKLSPSNLFRNISDEIMCSAF--RGGNANIVYGRIMQINFLI 355
+ P + HF GA+ + + + +D + C F V G IMQ N L
Sbjct: 302 FEESLVPRLVFHFADGAEFEPPVKSYVISAADGVRCLGFVSVAWPGTSVVGNIMQQNHLW 361
Query: 356 GYDIEQAMVSFKPSRCT 372
+D+ + F PS CT
Sbjct: 362 EFDLGLKKLGFAPSSCT 378
>gi|356542694|ref|XP_003539801.1| PREDICTED: aspartic proteinase-like protein 2-like [Glycine max]
Length = 489
Score = 89.4 bits (220), Expect = 3e-15, Method: Compositional matrix adjust.
Identities = 105/376 (27%), Positives = 174/376 (46%), Gaps = 50/376 (13%)
Query: 34 IYLMHLSIGTPPVDIFGSVDTGSDCTWT---QCEPCPELDCFKQEPPLFDPKKSSTYNSI 90
+Y + +GTPP + + +DTGSD W C CP+ + + FDP+ SST + I
Sbjct: 76 LYYTKVKLGTPPREFYVQIDTGSDVLWVSCGSCNGCPQTSGLQIQLNYFDPRSSSTSSLI 135
Query: 91 SCSSSQC--AVVTSNCS----EGDCSYSFLYGRGAYAS-------FSSGNLATETLTFNS 137
SCS +C V TS+ S C+Y+F YG G+ S + TLT NS
Sbjct: 136 SCSDRRCRSGVQTSDASCSSQNNQCTYTFQYGDGSGTSGYYVSDLMHFAGIFEGTLTTNS 195
Query: 138 TSGLPVEMPNVIFGCGHKNLASPTSDSKQT-GIIGLGPGNSSLISQMGTS-IAGK-FSYC 194
++ +V+FGC T + GI G G S+ISQ+ IA + FS+C
Sbjct: 196 SA-------SVVFGCSILQTGDLTKSERAVDGIFGFGQQGMSVISQLSLQGIAPRVFSHC 248
Query: 195 LP--DQGSSKINFGGIVAGAGVVSTPLII-RDHYYLSLEAISVGNQRLE-----FVSSST 246
L + G + G IV +V +PL+ + HY L+L++ISV Q + F +S+
Sbjct: 249 LKGDNSGGGVLVLGEIVE-PNIVYSPLVQSQPHYNLNLQSISVNGQIVPIAPAVFATSNN 307
Query: 247 GNIFVDTGVLRTLLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDVLCYNISSQPK--- 303
VD+G L E ++ + ++ ++ Q V+ V + CY I++
Sbjct: 308 RGTIVDSGTTLAYLAEEAYNPFVNAITALVP-QSVRSVLSRGN----QCYLITTSSNVDI 362
Query: 304 FPEVTIHFR-GADVKLSPSNLF--RNISDE--IMCSAFRG--GNANIVYGRIMQINFLIG 356
FP+V+++F GA + L P + +N E + C F+ G + + G ++ + +
Sbjct: 363 FPQVSLNFAGGASLVLRPQDYLMQQNYIGEGSVWCIGFQRIPGQSITILGDLVLKDKIFV 422
Query: 357 YDIEQAMVSFKPSRCT 372
YD+ + + C+
Sbjct: 423 YDLAGQRIGWANYDCS 438
>gi|21618176|gb|AAM67226.1| unknown [Arabidopsis thaliana]
Length = 430
Score = 89.4 bits (220), Expect = 3e-15, Method: Compositional matrix adjust.
Identities = 97/383 (25%), Positives = 158/383 (41%), Gaps = 72/383 (18%)
Query: 36 LMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPL----FDPKKSSTYNSIS 91
++ L IGTPP +DTGS +W QC K+ PP FDP SS+++++
Sbjct: 73 IISLPIGTPPQAQQMVLDTGSQLSWIQCH-------RKKLPPKPKTSFDPSLSSSFSTLP 125
Query: 92 CSSSQC-------AVVTSNCSEGDCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVE 144
CS C + TS S C YS+ Y G +F+ GNL E +TF++T
Sbjct: 126 CSHPLCKPRIPDFTLPTSCDSNRLCHYSYFYADG---TFAEGNLVKEKITFSNTE----I 178
Query: 145 MPNVIFGCGHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLPDQ------ 198
P +I GC T S GI+G+ G S +SQ S KFSYC+P +
Sbjct: 179 TPPLILGCA-------TESSDDRGILGMNRGRLSFVSQAKIS---KFSYCIPPKSNRPGF 228
Query: 199 ------------GSSKINFGGIVAGAGVVSTPLIIRDHYYLSLEAISVGNQRLEFVSS-- 244
S + ++ P + Y + + I G ++L S
Sbjct: 229 TPTGSFYLGDNPNSHGFKYVSLLTFPESQRMPNLDPLAYTVPMIGIRFGLKKLNISGSVF 288
Query: 245 -----STGNIFVDTGVLRTLLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDVLCY--N 297
+G VD+G T L + +++ + + + KG G +D +C+ N
Sbjct: 289 RPDAGGSGQTMVDSGSEFTHLVDAAYDKVRAEIMTRVGRRLKKGY-VYGGTAD-MCFDGN 346
Query: 298 ISSQPKF--PEVTIHFRGADVKLSPSNLFRNISDEIMC-----SAFRGGNANIVYGRIMQ 350
++ P+ V + RG ++ + + N+ I C S+ G +NI+ G + Q
Sbjct: 347 VAMIPRLIGDLVFVFTRGVEIFVPKERVLVNVGGGIHCVGIGRSSMLGAASNII-GNVHQ 405
Query: 351 INFLIGYDIEQAMVSFKPSRCTN 373
N + +D+ V F + C+
Sbjct: 406 QNLWVEFDVTNRRVGFAKADCSR 428
>gi|88174589|gb|ABD39369.1| chloroplast nucleoid DNA-binding protein [Oryza rufipogon]
Length = 321
Score = 89.4 bits (220), Expect = 3e-15, Method: Compositional matrix adjust.
Identities = 93/331 (28%), Positives = 151/331 (45%), Gaps = 49/331 (14%)
Query: 35 YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSS 94
Y++ + +GTP +DTGS +W C E D P F +S+T +SC +
Sbjct: 1 YVISVGLGTPAKTQIVEIDTGSSASWVFC----ECDGCHTNPRTFLQSRSTTCAKVSCGT 56
Query: 95 SQCAVVTSN--CSEG----DCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMPNV 148
S C + S+ C + DC + Y G S S G L +TLTF+ ++P+
Sbjct: 57 SMCLLGGSDPHCQDSENYPDCPFRVSYQDG---SASYGILYQDTLTFSDVQ----KIPSF 109
Query: 149 IFGCGHKNLASPTSDS--KQTGIIGLGPGNSSLISQMGTSIAGKFSYCLPDQGSSK---- 202
FGC NL S ++ G++G+G G S++ Q + G FSYCLP Q S +
Sbjct: 110 TFGC---NLDSFGANEFGNVDGLLGMGAGPMSVLKQSSPTFDG-FSYCLPLQKSERGFFS 165
Query: 203 -----INFGGIVAGAGVVSTPLIIR----DHYYLSLEAISVGNQRLEFVSS--STGNIFV 251
+ G + V T ++ R + +++ L AISV +RL S S +
Sbjct: 166 KTTGYFSLGKVATRTDVRYTKMVARRKNTELFFVDLAAISVDGERLGLSPSIFSRKGVVF 225
Query: 252 DTGVLRTLLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDVLCYNISS--QPKFPEVTI 309
D+G + +P SV+S I+ ++ AE S+ CY++ S + P +++
Sbjct: 226 DSGSELSYIP----DRALSVLSQRIRELLLRRGAAEEE-SERNCYDMRSVDEGDMPAISL 280
Query: 310 HF-RGADVKLSPSNLF--RNISDE-IMCSAF 336
HF GA L +F R++ ++ + C AF
Sbjct: 281 HFDDGARFDLGSHGVFVERSVQEQDVWCLAF 311
>gi|359494621|ref|XP_002265771.2| PREDICTED: aspartic proteinase nepenthesin-2-like [Vitis vinifera]
Length = 449
Score = 89.4 bits (220), Expect = 3e-15, Method: Compositional matrix adjust.
Identities = 95/377 (25%), Positives = 154/377 (40%), Gaps = 49/377 (12%)
Query: 35 YLMHLSIGTPPVDIFGSVDTGSDCTWTQCE-PCPELDCFKQEPPLFDPKK------SSTY 87
Y + +GTP DTGSD TW C+ C +C ++ K+ SS++
Sbjct: 83 YSVAFKVGTPSQKFMLVADTGSDLTWMSCKYHCRSRNCSNRKARRIRHKRVFHANLSSSF 142
Query: 88 NSISCSSSQCAV------VTSNCSE--GDCSYSFLYGRGAYASFSSGNLATETLTFNSTS 139
+I C + C + +NC C Y + Y G+ A G A ET+T
Sbjct: 143 KTIPCLTDMCKIELMDLFSLTNCPTPLTPCGYDYRYSDGSTAL---GFFANETVTVELKE 199
Query: 140 GLPVEMPNVIFGCGHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLPDQG 199
G +++ NV+ GC S G++GLG S + GKFSYCL D
Sbjct: 200 GRKMKLHNVLIGCSES--FQGQSFQAADGVMGLGYSKYSFAIKAAEKFGGKFSYCLVDHL 257
Query: 200 SSK-----INFGGIVAGAGVVS----TPLI---IRDHYYLSLEAISVGNQRLEFVS---- 243
S K + FG + +++ T L+ + Y +++ IS+G L+ S
Sbjct: 258 SHKNVSNYLTFGSSRSKEALLNNMTYTELVLGMVNSFYAVNMMGISIGGAMLKIPSEVWD 317
Query: 244 -SSTGNIFVDTGVLRTLLPL-EYHSNLKSVMSNMIKAQPVK-GVGAEPGFSDVLCYNIS- 299
G +D+G T L Y + ++ +++K + V+ +G C+N +
Sbjct: 318 VKGAGGTILDSGSSLTFLTEPAYQPVMAALRVSLLKFRKVEMDIGPLE-----YCFNSTG 372
Query: 300 -SQPKFPEVTIHFR-GADVKLSPSNLFRNISDEIMCSAF--RGGNANIVYGRIMQINFLI 355
+ P + HF GA+ + + + +D + C F V G IMQ N L
Sbjct: 373 FEESLVPRLVFHFADGAEFEPPVKSYVISAADGVRCLGFVSVAWPGTSVVGNIMQQNHLW 432
Query: 356 GYDIEQAMVSFKPSRCT 372
+D+ + F PS CT
Sbjct: 433 EFDLGLKKLGFAPSSCT 449
>gi|222632750|gb|EEE64882.1| hypothetical protein OsJ_19741 [Oryza sativa Japonica Group]
Length = 456
Score = 89.4 bits (220), Expect = 3e-15, Method: Compositional matrix adjust.
Identities = 97/355 (27%), Positives = 144/355 (40%), Gaps = 38/355 (10%)
Query: 35 YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPEL-DCFKQEPPL-FDPKKSSTYNSISC 92
Y + +GTP +DTGSD W P L +Q P + +N ++
Sbjct: 122 YFAQVGVGTPATTALMVLDTGSDVVWAPVRALPPLLRAVRQGSSTGAAPAPTPRWNCVAP 181
Query: 93 SSSQCAVVTSNCSEGDCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMPNVIFGC 152
+ + C Y YG G S ++G+ A+ETLTF + + V GC
Sbjct: 182 ICRRLDSAGCDRRRNSCLYQVAYGDG---SVTAGDFASETLTFARGA----RVQRVAIGC 234
Query: 153 GHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLPDQGSSKINFGGIVAGA 212
GH N + S G+ S SQ+ S FSYCL D+ SS+ G
Sbjct: 235 GHDNEGLFIAASGLLGLGRG---RLSFPSQIARSFGRSFSYCLVDRTSSRRARPSRRWGG 291
Query: 213 GVVSTPLIIRDHYYLSLEAISVGNQRLEFVSSS---------TGNIFVDTGVLRTLLPLE 263
TP + YY+ L SVG R++ VS S G + +D+G T L
Sbjct: 292 ----TPRMA-TFYYVHLLGFSVGGARVKGVSQSDLRLNPTTGRGGVILDSGTSVTRLARP 346
Query: 264 YHSNLKSVMSNMIKAQPVKGVGAEPGFSDVL--CYNISSQP--KFPEVTIHFR-GADVKL 318
+ ++ + +A V G+ PG + CYN+S + K P V++H GA V L
Sbjct: 347 VYEAVR----DAFRAAAV-GLRVSPGGFSLFDTCYNLSGRRVVKVPTVSMHLAGGASVAL 401
Query: 319 SPSNLFRNI-SDEIMCSAFRGGNANI-VYGRIMQINFLIGYDIEQAMVSFKPSRC 371
P N + + C A G + + + G I Q F + +D + V F P C
Sbjct: 402 PPENYLIPVDTSGTFCFAMAGTDGGVSIIGNIQQQGFRVVFDGDAQRVGFVPKSC 456
>gi|115465777|ref|NP_001056488.1| Os05g0591300 [Oryza sativa Japonica Group]
gi|113580039|dbj|BAF18402.1| Os05g0591300 [Oryza sativa Japonica Group]
Length = 453
Score = 89.4 bits (220), Expect = 3e-15, Method: Compositional matrix adjust.
Identities = 61/180 (33%), Positives = 92/180 (51%), Gaps = 22/180 (12%)
Query: 32 DDIYLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEP---PLFDPKKSSTYN 88
D +L+ + +GTP V ++DTGS +W QC PC + C Q P+FDP SST+
Sbjct: 50 DFAFLIPVKLGTPAVQYLVTMDTGSSLSWVQCRPC-TIKCHVQPAKVGPIFDPSNSSTFR 108
Query: 89 SISCSSSQCA-------VVTSNCSEGD--CSYSFLYGRGAYASFSSGNLATETLTF--NS 137
+ CS+S C+ + + C E + C Y+ YG G ++S G T+ L
Sbjct: 109 HVGCSTSICSYLGRTLRIQSKACMEWEDICLYTMSYGGG--WAYSVGKAVTDRLVLGGGE 166
Query: 138 TSGLPVEMPNVIFGCGHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGK-FSYCLP 196
T+ + + N +FGC ++ + S K+ GI GLG N S Q+ ++ K FSYCLP
Sbjct: 167 TTRTTLSLANFVFGC---SMDTQYSTHKEAGIFGLGTSNYSF-EQIAPLLSYKAFSYCLP 222
>gi|220702733|gb|ACL81165.1| aspartyl protease [Mirabilis jalapa]
Length = 499
Score = 89.0 bits (219), Expect = 3e-15, Method: Compositional matrix adjust.
Identities = 101/410 (24%), Positives = 155/410 (37%), Gaps = 83/410 (20%)
Query: 35 YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCF-KQEPPLFDPKKSSTYNSISCS 93
Y + SI + + ++ +DTGSD W C P + C K EP P S + ISC
Sbjct: 94 YTLTFSINSQTLSVY--MDTGSDIVWFPCSPFECILCEGKFEPGTLTPLNVSKSSLISCK 151
Query: 94 SSQCA---------------------VVTSNCSEGDC-SYSFLYGRGAYASFSSGNLATE 131
S C+ + TS+CS C S+ + YG G+ + L
Sbjct: 152 SRACSTAHNSPSTSDLCAIAKCPLDEIETSDCSNYHCPSFYYAYGDGSLIA----KLHKH 207
Query: 132 TLTFNSTSGLPVEMPNVIFGCGHKNLASPTSDSKQTGIIGLGPGNSSLISQMGT---SIA 188
L STS P + + FGC H L P G+ G G G+ SL +Q+ +
Sbjct: 208 NLIMPSTSNKPFSLKDFTFGCAHSALGEPI------GVAGFGFGSLSLPAQLANLSPDLG 261
Query: 189 GKFSYCLPDQG--SSKINFGGIVAGAGV-----------VSTPLIIRDH----YYLSLEA 231
+FSYCL S+K++ + V V TP++ Y +S+EA
Sbjct: 262 NQFSYCLVSHSFDSTKLHHPSPLILGKVKERDFDEITQFVYTPMLDNPKHPYFYSVSMEA 321
Query: 232 ISVGNQR-------LEFVSSSTGNIFVDTGVLRTLLPLEYHSNLKSVMSNMIKAQPVKGV 284
ISVG+ R + G + VD+G T+LP +++++ + + + +
Sbjct: 322 ISVGSSRVRAPNALIRIDRDGNGGVVVDSGTTYTMLPTGFYNSVATELDRRVGRVFKRAS 381
Query: 285 GAEPGFSDVLCYNISSQPK------FPEVTIHFRG-ADVKLSPSNLFRNISD-------- 329
E CY + P + HF G V L N F D
Sbjct: 382 ETESKTGLSPCYYLEGNGVERLGLVVPRLAFHFGGNYSVVLPRRNYFYEFLDGEDEKKGR 441
Query: 330 EIMCSAFRGGNAN------IVYGRIMQINFLIGYDIEQAMVSFKPSRCTN 373
++ C G G Q F + YD+E+ V F P +C +
Sbjct: 442 KVGCLMLMDGGDESEGGPGATLGNYQQQGFQVVYDLEERRVGFAPRKCAS 491
>gi|125527257|gb|EAY75371.1| hypothetical protein OsI_03267 [Oryza sativa Indica Group]
Length = 484
Score = 89.0 bits (219), Expect = 3e-15, Method: Compositional matrix adjust.
Identities = 98/414 (23%), Positives = 161/414 (38%), Gaps = 91/414 (21%)
Query: 35 YLMHLSIGTPPVDIFGSVDTGSDCTWTQCE----------------PCPELDCFKQEPPL 78
Y + +GTP DTGSD TW +C P P ++
Sbjct: 87 YFVRFRVGTPAQPFLLVADTGSDLTWVKCHRAAAAASASPRNASSLPAPAPASPRRT--- 143
Query: 79 FDPKKSSTYNSISCSSSQC------AVVTSNCSEGDCSYSFLYGRGAYASFSSGNLATET 132
F P KS T+ I CSS+ C ++ C+Y + Y G+ A + G + + T
Sbjct: 144 FRPDKSRTWAPIPCSSATCRESLPFSLAACATPANPCAYDYRYKDGSAARGTVG-VDSAT 202
Query: 133 LTFNSTSGLPVEMPNVIFGC-----GHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSI 187
+ + + ++ V+ GC G LAS G++ LG N S S+ +
Sbjct: 203 IALSGRAARKAKLRGVVLGCTTSYNGQSFLAS-------DGVLSLGYSNISFASRAASRF 255
Query: 188 AGKFSYCL-----PDQGSSKINFGGIVA------GAGVVS-------------------T 217
G+FSYCL P +S + FG A G+ S T
Sbjct: 256 GGRFSYCLVDHLAPRNATSYLTFGPNPAFSSRRPSEGIASCKPAPAPTPAPAGAPGARQT 315
Query: 218 PLII----RDHYYLSLEAISVGNQRLEFVSS-----STGNIFVDTGVLRTLLPLEYHSNL 268
PL++ R Y ++++ +SV + L+ + G +D+G T+L + +
Sbjct: 316 PLVLDHRTRPFYAVTVKGVSVAGELLKIPRAVWDVEQGGGAILDSGTSLTMLAKPAYRAV 375
Query: 269 KSVMSNMIKAQPVKGVGAEPGFSDVLCYNISS------QPKFPEVTIHFRGADVKLSPSN 322
+ +S + P V +P F CYN +S P + +HF G+ P+
Sbjct: 376 VAALSKRLAGLPR--VTMDP-FD--YCYNWTSPSGSDVAAPLPMLAVHFAGSARLEPPAK 430
Query: 323 LFR-NISDEIMCSAFRGG--NANIVYGRIMQINFLIGYDIEQAMVSFKPSRCTN 373
+ + + + C + G V G I+Q L YD++ + FK SRC +
Sbjct: 431 SYVIDAAPGVKCIGLQEGPWPGLSVIGNILQQEHLWEYDLKNRRLRFKRSRCMH 484
>gi|88174587|gb|ABD39368.1| chloroplast nucleoid DNA-binding protein [Oryza sativa Indica
Group]
Length = 321
Score = 89.0 bits (219), Expect = 3e-15, Method: Compositional matrix adjust.
Identities = 93/331 (28%), Positives = 151/331 (45%), Gaps = 49/331 (14%)
Query: 35 YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSS 94
Y++ + +GTP +DTGS +W C E D P F +S+T +SC +
Sbjct: 1 YVISVGLGTPSKTQIVEIDTGSSASWVFC----ECDGCHTNPRTFLQSRSTTCAKVSCGT 56
Query: 95 SQCAVVTSN--CSEG----DCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMPNV 148
S C + S+ C + DC + Y G S S G L +TLTF+ ++P+
Sbjct: 57 SMCLLGGSDPHCQDSENYPDCPFRVSYQDG---SASYGILYQDTLTFSDVQ----KIPSF 109
Query: 149 IFGCGHKNLASPTSDS--KQTGIIGLGPGNSSLISQMGTSIAGKFSYCLPDQGSSK---- 202
FGC NL S ++ G++G+G G S++ Q + G FSYCLP Q S +
Sbjct: 110 TFGC---NLDSFGANEFGNVDGLLGMGAGPMSVLKQSSPTFDG-FSYCLPLQKSERGFFS 165
Query: 203 -----INFGGIVAGAGVVSTPLIIR----DHYYLSLEAISVGNQRLEFVSS--STGNIFV 251
+ G + V T ++ R + +++ L AISV +RL S S +
Sbjct: 166 KTTGYFSLGKVATRTDVRYTKMVARRKNTELFFVDLAAISVDGERLGLSPSIFSRKGVVF 225
Query: 252 DTGVLRTLLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDVLCYNISS--QPKFPEVTI 309
D+G + +P SV+S I+ ++ AE S+ CY++ S + P +++
Sbjct: 226 DSGSELSYIP----DRALSVLSQRIRELLLRRGAAEEE-SERNCYDMRSVDEGDMPAISL 280
Query: 310 HF-RGADVKLSPSNLF--RNISDE-IMCSAF 336
HF GA L +F R++ ++ + C AF
Sbjct: 281 HFDDGARFDLGSHGVFVERSVQEQDVWCLAF 311
>gi|88174597|gb|ABD39373.1| chloroplast nucleoid DNA-binding protein [Oryza rufipogon]
gi|88174601|gb|ABD39375.1| chloroplast nucleoid DNA-binding protein [Oryza rufipogon]
gi|88174603|gb|ABD39376.1| chloroplast nucleoid DNA-binding protein [Oryza rufipogon]
Length = 321
Score = 89.0 bits (219), Expect = 4e-15, Method: Compositional matrix adjust.
Identities = 94/332 (28%), Positives = 153/332 (46%), Gaps = 51/332 (15%)
Query: 35 YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSS 94
Y++ + +GTP +DTGS +W C E D P F +S+T +SC +
Sbjct: 1 YVISVGLGTPAKTQIVEIDTGSSTSWVFC----ECDGCHTNPRTFLQSRSTTCAKVSCGT 56
Query: 95 SQCAVVTSN--CSEG----DCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMPNV 148
S C + S+ C + DC + Y G S S G L +TLTF+ ++P+
Sbjct: 57 SMCLLGGSDPHCQDSENYPDCPFRVSYQDG---SASYGILYQDTLTFSDVQ----KIPSF 109
Query: 149 IFGCGHKNLASPTSDS--KQTGIIGLGPGNSSLISQMGTSIAGKFSYCLPDQGSSK---- 202
FGC NL S ++ G++G+G G S++ Q + G FSYCLP Q S +
Sbjct: 110 TFGC---NLDSFGANEFGNVDGLLGMGAGPMSVLKQSSPTFDG-FSYCLPLQKSERGFFS 165
Query: 203 -----INFGGIVAGAGVVSTPLIIR----DHYYLSLEAISVGNQRLEF---VSSSTGNIF 250
+ G + V T ++ R + +++ L AISV +RL + S G +F
Sbjct: 166 KTTGYFSLGKVATRTDVRYTKMVARRKNTELFFVDLAAISVDGERLGLSPSIFSRKGVVF 225
Query: 251 VDTGVLRTLLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDVLCYNISS--QPKFPEVT 308
D+G + +P SV+S I+ ++ AE S+ CY++ S + P ++
Sbjct: 226 -DSGSELSYIP----DRALSVLSQRIRELLLRRGAAEEE-SERNCYDMRSVDEGDMPAIS 279
Query: 309 IHF-RGADVKLSPSNLF--RNISDE-IMCSAF 336
+HF GA L +F R++ ++ + C AF
Sbjct: 280 LHFDDGARFDLGSHGVFVERSVQEQDVWCLAF 311
>gi|88174567|gb|ABD39358.1| chloroplast nucleoid DNA-binding protein [Oryza glumipatula]
Length = 323
Score = 89.0 bits (219), Expect = 4e-15, Method: Compositional matrix adjust.
Identities = 93/332 (28%), Positives = 151/332 (45%), Gaps = 49/332 (14%)
Query: 35 YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSS 94
Y++ + +GTP +DTGS +W C E D P F +S+T +SC +
Sbjct: 1 YVISVGLGTPAKTQIVEIDTGSSTSWVFC----ECDGCHTNPRTFLQSRSTTCAKVSCGT 56
Query: 95 SQCAVVTSN--CSEG----DCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMPNV 148
S C + S+ C + DC + Y G S S G L +TLTF+ ++P
Sbjct: 57 SMCLLGGSDPHCQDSENYPDCPFRVSYQDG---SASYGILYQDTLTFSDVQ----KIPGF 109
Query: 149 IFGCGHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLPDQGSSK------ 202
FGC + + G++G+G G S++ Q + G FSYCLP Q S +
Sbjct: 110 TFGCNMDSFGA-NEFGNVDGLLGMGAGQMSVLKQSSPTFDG-FSYCLPLQMSERGFFSKT 167
Query: 203 ---INFGGIVAG--AGVVSTPLIIR----DHYYLSLEAISVGNQRLEF---VSSSTGNIF 250
+ GG +A V T ++ R + +++ L AISV +RL + S G +F
Sbjct: 168 TGYFSLGGKIAATRTDVRYTKMVARRKNTELFFVDLTAISVDGERLGLSPSIFSRKGVVF 227
Query: 251 VDTGVLRTLLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDVLCYNISS--QPKFPEVT 308
D+G + +P SV+S I+ ++ AE S+ CY++ S + P ++
Sbjct: 228 -DSGSELSYIP----DRALSVLSQRIRELLLRRGAAEEE-SERNCYDMRSVDEGDMPAIS 281
Query: 309 IHF-RGADVKLSPSNLF--RNISDE-IMCSAF 336
+HF GA L +F R++ ++ + C AF
Sbjct: 282 LHFDDGARFDLGSHGVFVERSVQEQDVWCLAF 313
>gi|21717171|gb|AAM76364.1|AC074196_22 putative nucleoid DNA binding protein [Oryza sativa Japonica Group]
gi|31433290|gb|AAP54828.1| Eukaryotic aspartyl protease family protein, expressed [Oryza
sativa Japonica Group]
gi|125532789|gb|EAY79354.1| hypothetical protein OsI_34483 [Oryza sativa Indica Group]
Length = 382
Score = 88.6 bits (218), Expect = 4e-15, Method: Compositional matrix adjust.
Identities = 99/370 (26%), Positives = 158/370 (42%), Gaps = 54/370 (14%)
Query: 39 LSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSSSQCA 98
+IGTPP +D G WTQC C CF QE P FDP KSSTY C ++ C
Sbjct: 28 FTIGTPPQPASAFIDVGGLLVWTQCSQCSSSSCFNQELPPFDPTKSSTYRPEPCGTALCE 87
Query: 99 VVTS---NCSEGDCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMPNVIFGCGHK 155
+ NCS C+Y +SG + T+ + + + +V FGC
Sbjct: 88 FFPASIRNCSGDVCAYE---ASTQLFEHTSGKIGTDAVAIGTATAA-----SVAFGC--- 136
Query: 156 NLASPTS--DSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCL-PDQGSSKIN-------- 204
+AS D +G +GL SL++QM + FS+CL P G N
Sbjct: 137 VMASDIKLMDGGPSGFVGLARTPLSLVAQMNVT---AFSHCLAPHDGGGGKNSRLFLGAA 193
Query: 205 -FGGIVAGAGVVSTPLI------IRDHYYL-SLEAISVGNQRLEFVSSSTGNIFVDTGVL 256
+ ++TP + I+ YYL +LE I G++ + V S + + T
Sbjct: 194 AKLAGGGKSAAMTTPFVKSSPDDIKSLYYLINLEGIKAGDEAIITVPQSGRTVLLQTFSP 253
Query: 257 RTLLPLEYHSNLKSVMSNMIKAQPVKGVGAEP--GFSDV--LCYNISSQPKFPEVTIHFR 312
+ L + +LK ++ V G A P F + LC+ P+V + F+
Sbjct: 254 VSFLVDGVYQDLKKAVTAA-----VGGPTATPPEQFQSIFDLCFKRGGVSGAPDVVLTFQ 308
Query: 313 G-ADVKLSPSNLFRNISDEIMCSAF----RGGNANI----VYGRIMQINFLIGYDIEQAM 363
G A + + P+N ++ D+ +C A R + + + G + Q N YD+E+
Sbjct: 309 GAAALTVPPTNYLLDVGDDTVCVAIASSARLNSTEVAGMSILGGLQQQNVHFLYDLEKET 368
Query: 364 VSFKPSRCTN 373
+SF+ + C++
Sbjct: 369 LSFEAADCSS 378
>gi|88174571|gb|ABD39360.1| chloroplast nucleoid DNA-binding protein [Oryza nivara]
Length = 321
Score = 88.6 bits (218), Expect = 4e-15, Method: Compositional matrix adjust.
Identities = 90/329 (27%), Positives = 147/329 (44%), Gaps = 45/329 (13%)
Query: 35 YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSS 94
Y++ + +GTP +DTGS +W C E D P F +S+T +SC +
Sbjct: 1 YVISVGLGTPAKTQIVEIDTGSSTSWVFC----ECDGCHTNPRTFLQSRSTTCAKVSCGT 56
Query: 95 SQCAVVTSN--CSEG----DCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMPNV 148
S C + S+ C + DC + Y G S S G L +TLTF+ ++P+
Sbjct: 57 SMCLLGGSDPHCQDSENYPDCPFRVSYQDG---SASYGILYQDTLTFSDVQ----KIPSF 109
Query: 149 IFGCGHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLPDQGSSK------ 202
FGC + + G++G+G G S++ Q G FSYCLP Q S +
Sbjct: 110 TFGCNLDSFGA-NEFGNVDGLLGMGAGPMSVLKQSSPRFDG-FSYCLPLQKSERGFFSKT 167
Query: 203 ---INFGGIVAGAGVVSTPLIIR----DHYYLSLEAISVGNQRLEFVSS--STGNIFVDT 253
+ G + V T ++ R + +++ L AISV +RL S S + D+
Sbjct: 168 TGYFSLGKVATRTDVRYTKMVARRKNTELFFVDLAAISVDGERLGLSPSIFSRKGVVFDS 227
Query: 254 GVLRTLLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDVLCYNISS--QPKFPEVTIHF 311
G + +P SV+S I+ ++ AE S+ CY++ S + P +++HF
Sbjct: 228 GSELSYIP----DRALSVLSQRIRELLLRRGAAEEE-SERNCYDMRSVDEGDMPAISLHF 282
Query: 312 -RGADVKLSPSNLF--RNISDE-IMCSAF 336
GA L +F R++ ++ + C AF
Sbjct: 283 DDGARFDLGSKGVFVERSVQEQDVWCLAF 311
>gi|359483137|ref|XP_002272278.2| PREDICTED: aspartic proteinase nepenthesin-1-like [Vitis vinifera]
Length = 402
Score = 88.6 bits (218), Expect = 4e-15, Method: Compositional matrix adjust.
Identities = 85/317 (26%), Positives = 141/317 (44%), Gaps = 43/317 (13%)
Query: 82 KKSSTYNSISCSSSQCAVVTSN---CSEGD--CSYSFLYGRGAYASFSSGNLATETLTFN 136
K++ N+ S++Q V + N C C+Y+ YG G SF+ G L E L F
Sbjct: 101 KRTVPSNTEDVSNAQIPVTSGNSGVCGSAAPICNYAINYGDG---SFTRGELGHEKLKFG 157
Query: 137 STSGLPVEMPNVIFGCGHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLP 196
+ + + + IFGCG N S G++GLG + SLISQ G FSYCLP
Sbjct: 158 T-----ILVKDFIFGCGRNNKGLFGGVS---GLMGLGRSDLSLISQTSGIFGGVFSYCLP 209
Query: 197 D---QGSSKINFGG---------IVAGAGVVSTPLIIRDHYYLSLEAISVGNQRLEFVSS 244
+GS + GG ++ A ++ P + + Y+++L IS+G L+ S
Sbjct: 210 STERKGSGSLILGGNSSVYRNSSPISYAKMIENPQLY-NFYFINLTGISIGGVALQAPSV 268
Query: 245 STGNIFVDTGVLRTLLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDV-LCYNISS--Q 301
I VD+G + T LP + LK+ P P FS + C+N+S+ +
Sbjct: 269 GPSRILVDSGTVITRLPPTIYKALKAEFLKQFTGFP-----PAPAFSILDTCFNLSAYQE 323
Query: 302 PKFPEVTIHFRG-ADVKLSPSNLFRNISDE-----IMCSAFRGGNANIVYGRIMQINFLI 355
P + +HF G A++ + + +F + + + ++ + + G Q N +
Sbjct: 324 VDIPTIKMHFEGNAELTVDVTGVFYFVKSDASQVCLALASLEYQDEVAILGNYQQKNLRV 383
Query: 356 GYDIEQAMVSFKPSRCT 372
YD ++ V F C+
Sbjct: 384 IYDTKETKVGFALETCS 400
>gi|357464807|ref|XP_003602685.1| Aspartic proteinase Asp1 [Medicago truncatula]
gi|355491733|gb|AES72936.1| Aspartic proteinase Asp1 [Medicago truncatula]
Length = 440
Score = 88.6 bits (218), Expect = 4e-15, Method: Compositional matrix adjust.
Identities = 78/259 (30%), Positives = 113/259 (43%), Gaps = 27/259 (10%)
Query: 34 IYLMHLSIGTPPVDIFGSVDTGSDCTWTQCE-PCPELDCFKQEPPLFDPKKSSTYNSISC 92
Y + ++IG PP F +DTGSD TW QC+ PC C + PL+ P + + C
Sbjct: 84 FYNVTINIGYPPRPYFLDIDTGSDLTWLQCDAPCSR--CSQTPHPLYRPSN----DLVPC 137
Query: 93 SSSQCAVV--TSNCS---EGDCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMPN 147
CA V T N E C Y Y Y+S G L + N T+G+ +++
Sbjct: 138 RHPLCASVHQTDNYECEVEHQCDYEVEYA-DHYSSL--GVLVNDVYVLNFTNGVQLKV-R 193
Query: 148 VIFGCGHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTS--IAGKFSYCLPDQGSSKINF 205
+ GCG+ + +S G++GLG G SSLISQ+ + +CL QG I F
Sbjct: 194 MALGCGYDQIFPDSSYHPVDGMLGLGRGKSSLISQLNGQGLVRNVVGHCLSAQGGGYIFF 253
Query: 206 GGIVAGAGVVSTPLIIRD--HYYLSLEAISVGNQRLEFVSSSTGNIFV--DTGVLRTLLP 261
G + + + TP+ RD HY + +G +R F GN+ D G T
Sbjct: 254 GDVYDSSRLAWTPMSSRDYKHYSAGAAELVLGGKRTGF-----GNLLAVFDAGSSYTYFN 308
Query: 262 LEYHSNLKSVMSNMIKAQP 280
+ K + IK P
Sbjct: 309 SNAYQLTKELAGKPIKEAP 327
>gi|326525377|dbj|BAK07958.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 463
Score = 88.6 bits (218), Expect = 4e-15, Method: Compositional matrix adjust.
Identities = 91/361 (25%), Positives = 147/361 (40%), Gaps = 56/361 (15%)
Query: 51 SVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSSS-QCAVVTSNCSEGDC 109
++D G +W QC PC C Q P+FDP KS T+++I ++ C + G C
Sbjct: 114 ALDMGGGLSWMQCLPC--RHCLLQMSPVFDPTKSPTFSNIPAHNTVWCRPPYQPLANGAC 171
Query: 110 SYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMPNVIFGCGHKNLASPTSDSKQTGI 169
+ Y +A SG LA +T +F + + V + ++FGC H+ + GI
Sbjct: 172 GFDIAYRDNTHA---SGYLARDTFSFPAGNDDFVPLSAIVFGCAHQT-EHFKNQRAVAGI 227
Query: 170 IGLGPGNS-----SLISQMGTSIAGKFSYCLPDQGSSK---INFGGIV-----AGAGVVS 216
+GLG G + + Q+ + G+FSYC G S + FG + S
Sbjct: 228 LGLGMGPAGKPPTAFTKQVLPAHGGRFSYCPFVPGMSMYSYLRFGSDIPSHPPPNVHRQS 287
Query: 217 TPLIIRDH----YYLSLEAISVGNQRLEFVSSS--------TGNIFVDTGVLRTLLPLEY 264
TP++ H Y++ L +SVG RL V+ + G VD G T
Sbjct: 288 TPVLAPAHNSEAYFVKLAGVSVGANRLSGVTPAMFRRNAHGAGGCVVDIGTRMTAFIHSA 347
Query: 265 HSNLKSVMSNMIKAQ-----PVKG---VGAEPGFSDVLCYNISSQPKFPEVTIHFR-GAD 315
+ ++ + ++ + V+G V DVL P +T+HF GA
Sbjct: 348 YVHIDHAVRQHLQRRGAHIVVVRGNTCVQQPAPHHDVL----------PSMTLHFENGAW 397
Query: 316 VKLSPSNLFRNI---SDEIMCSAFRGGNANIVYGRIMQINFLIGYDIEQA--MVSFKPSR 370
+++ P ++F C F V G Q+N +D+ ++SF P
Sbjct: 398 LRVMPEHVFMPFVVGGHHYQCFGFVSSTDLTVIGARQQVNHRFIFDLHDTIPIMSFNPED 457
Query: 371 C 371
C
Sbjct: 458 C 458
>gi|88174581|gb|ABD39365.1| chloroplast nucleoid DNA-binding protein [Oryza sativa Indica
Group]
Length = 321
Score = 88.6 bits (218), Expect = 4e-15, Method: Compositional matrix adjust.
Identities = 90/329 (27%), Positives = 147/329 (44%), Gaps = 45/329 (13%)
Query: 35 YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSS 94
Y++ + +GTP +DTGS +W C E D P F +S+T +SC +
Sbjct: 1 YVISVGLGTPAKTQIVEIDTGSSTSWVFC----ECDGCHTNPRTFLQSRSTTCAKVSCGT 56
Query: 95 SQCAVVTSN--CSEG----DCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMPNV 148
S C + S+ C + DC + Y G S S G L +TLTF+ ++P+
Sbjct: 57 SMCLLGGSDPHCQDSENYPDCPFRVSYQDG---SASYGILYQDTLTFSDVQ----KIPSF 109
Query: 149 IFGCGHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLPDQGSSK------ 202
FGC + + G++G+G G S++ Q G FSYCLP Q S +
Sbjct: 110 TFGCNLDSFGA-NEFGNVDGLLGMGAGPMSVLKQSSPRFDG-FSYCLPLQKSERGFFSKT 167
Query: 203 ---INFGGIVAGAGVVSTPLIIR----DHYYLSLEAISVGNQRLEFVSS--STGNIFVDT 253
+ G + V T ++ R + +++ L AISV +RL S S + D+
Sbjct: 168 TGYFSLGKVATRTDVRYTKMVARRKNTELFFVDLAAISVDGERLGLSPSIFSRKGVVFDS 227
Query: 254 GVLRTLLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDVLCYNISS--QPKFPEVTIHF 311
G + +P SV+S I+ ++ AE S+ CY++ S + P +++HF
Sbjct: 228 GSELSYIP----DRALSVLSQRIRELLLRRGAAEEE-SERNCYDMRSVDEGDMPAISLHF 282
Query: 312 -RGADVKLSPSNLF--RNISDE-IMCSAF 336
GA L +F R++ ++ + C AF
Sbjct: 283 DDGARFDLGRRGVFVERSVQEQDVWCLAF 311
>gi|326533786|dbj|BAK05424.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 412
Score = 88.6 bits (218), Expect = 4e-15, Method: Compositional matrix adjust.
Identities = 87/339 (25%), Positives = 138/339 (40%), Gaps = 29/339 (8%)
Query: 51 SVDTGSDCTWTQCEPC-PELDCFKQEPPLFDPKKSSTYNSISCSSSQCAVVTSNC-SEGD 108
++DT + +W CEPC P L Q LF P +S T+ + C S
Sbjct: 84 ALDTAASTSWVMCEPCRPPL---HQLGRLFSPAESPTFRGVRRDDPVCVPPYHRLHSTNG 140
Query: 109 CSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMPNVIFGCGHKNLASPTSDSKQTG 168
CS++F G A + +E S SG V FGC H D G
Sbjct: 141 CSFAFPSAIGYLARDTFHLRHSERSVVKSISG-------VAFGCAHTTTGFYNED-ILGG 192
Query: 169 IIGLGPGNSSLISQMGTSIAGKFSYCLPDQGSSKINFGGIVAGAGVVSTP--------LI 220
++ L P S ++Q G+ G+FSYCLPD +S G I G V S P +
Sbjct: 193 VLSLSPSPLSFLTQFGSRAGGRFSYCLPDPTTSHNPSGFIQFGIEVPSLPRHAHTTTLTV 252
Query: 221 IRDHYYLSLEAISVGNQRLEF---VSSSTGNIFVDTGVLRTLLPLEYHSNLKSVMSNM-- 275
Y+LSL IS+GN+RL+ + +S G + + Y + +M+ M
Sbjct: 253 SASGYHLSLIGISLGNKRLDIDRHILTSHGCSINPAETITKIAEPAYIIVARELMAQMNE 312
Query: 276 IKAQPVKGVGAEPGFSDVLCYNISSQPKFPEVTIHFR-GADVKLSPSNLFRNISDEIMCS 334
+ ++ VKG + P + + + + + P + HF G D+ + LF+ I
Sbjct: 313 LGSKQVKGPPSSPLVFNKISRRVRA--RLPNMVFHFADGGDMWFTAGKLFQVIGTTARFL 370
Query: 335 AFRGGNANIVYGRIMQINFLIGYDIEQAMVSFKPSRCTN 373
G+ V G Q+N +++ ++F C+
Sbjct: 371 VEGHGSHRTVIGAAQQVNARFIFNVAAGRLTFAEELCSR 409
>gi|88174605|gb|ABD39377.1| chloroplast nucleoid DNA-binding protein [Oryza rufipogon]
Length = 321
Score = 88.6 bits (218), Expect = 4e-15, Method: Compositional matrix adjust.
Identities = 93/331 (28%), Positives = 150/331 (45%), Gaps = 49/331 (14%)
Query: 35 YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSS 94
Y++ + +GTP +DTGS +W C E D P F +S+T +SC +
Sbjct: 1 YVISVGLGTPAKTQIVEIDTGSSTSWVFC----ECDGCHTNPRTFLQSRSTTCAKVSCGT 56
Query: 95 SQCAVVTSN--CSEG----DCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMPNV 148
S C + S+ C + DC + Y G S S G L +TLTF+ ++P+
Sbjct: 57 SMCLLGGSDPHCQDSENYPDCPFRVSYQDG---SASYGILYQDTLTFSDVQ----KIPSF 109
Query: 149 IFGCGHKNLASPTSDS--KQTGIIGLGPGNSSLISQMGTSIAGKFSYCLPDQGSSK---- 202
FGC NL S ++ G++G+G G S++ Q G FSYCLP Q S +
Sbjct: 110 TFGC---NLDSFGANEFGNVDGLLGMGAGPMSVLKQSSPRFDG-FSYCLPLQKSERGFFS 165
Query: 203 -----INFGGIVAGAGVVSTPLIIR----DHYYLSLEAISVGNQRLEFVSS--STGNIFV 251
+ G + V T ++ R + +++ L AISV +RL S S +
Sbjct: 166 KTTGYFSLGKVATRTDVRYTKMVARRKNTELFFVDLAAISVDGERLGLSPSIFSRKGVVF 225
Query: 252 DTGVLRTLLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDVLCYNISS--QPKFPEVTI 309
D+G + +P SV+S I+ ++ AE S+ CY++ S + P +++
Sbjct: 226 DSGSELSYIP----DRALSVLSQRIRELLLRRGAAEEE-SERNCYDMRSVDEGDMPAISL 280
Query: 310 HF-RGADVKLSPSNLF--RNISDE-IMCSAF 336
HF GA L +F R++ ++ + C AF
Sbjct: 281 HFDDGARFDLGSHGVFVERSVQEQDVWCLAF 311
>gi|340810981|gb|AEK75417.1| S5 [Oryza rufipogon]
Length = 357
Score = 88.6 bits (218), Expect = 4e-15, Method: Compositional matrix adjust.
Identities = 107/379 (28%), Positives = 159/379 (41%), Gaps = 66/379 (17%)
Query: 37 MHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEP---PLFDPKKSSTYNSISCS 93
M +S+G PPV ++DTGS +W QC+PC + C Q P+FDP +S T + CS
Sbjct: 1 MAVSLGKPPVVNLVAIDTGSTLSWVQCQPC-AVHCHTQSAKAGPIFDPGRSYTSRRVRCS 59
Query: 94 SSQCA-------VVTSNCSEGD--CSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVE 144
S +C + +NC E + C+YS YG G ++S G + T+TL +
Sbjct: 60 SVKCGELRYDLRLQQANCMEKEDSCTYSVTYGNGW--AYSVGKMVTDTLRIGDS------ 111
Query: 145 MPNVIFGCGHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAG--------KFSYCLP 196
+++FGC S + GI G G + S Q+ AG SYCLP
Sbjct: 112 FMDLMFGCSMDVKYSEF----EAGIFGFGSSSFSFFEQL----AGYPDILSYKALSYCLP 163
Query: 197 -DQGSSKINFGGIVAGAGVVS--TPL---IIRDHYYLSLEAISVGNQRLEFVSSSTGNIF 250
D+ G A + TPL I R Y L++E + QRL +S+ +
Sbjct: 164 TDETKPGYMILGRYDRAAMDGGYTPLFRSINRPTYSLTMEMLIANGQRL---VTSSSEMI 220
Query: 251 VDTGVLRT-LLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDVLCY------------- 296
VD+G RT L P + K++ M + A +CY
Sbjct: 221 VDSGAQRTSLWPSTFALLDKTITQAMSSIGYHRTSRARQ--ESYICYLSEHDYSGWNGTI 278
Query: 297 -NISSQPKFPEVTIHFRG-ADVKLSPSNLFRNISDEIMCSAFRGGNA--NIVYGRIMQIN 352
S+ P + I F G A + LSP N+F N +C F A + + G + +
Sbjct: 279 TPFSNWSALPLLEIGFAGGAALALSPRNVFYNDPHRGLCMTFAQNPALRSQILGNRVTRS 338
Query: 353 FLIGYDIEQAMVSFKPSRC 371
F +DI+ FK + C
Sbjct: 339 FGTTFDIQGKQFGFKYAVC 357
>gi|88174565|gb|ABD39357.1| chloroplast nucleoid DNA-binding protein [Oryza glumipatula]
Length = 323
Score = 88.6 bits (218), Expect = 4e-15, Method: Compositional matrix adjust.
Identities = 93/332 (28%), Positives = 151/332 (45%), Gaps = 49/332 (14%)
Query: 35 YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSS 94
Y++ + +GTP +DTGS +W C E D P F +S+T +SC +
Sbjct: 1 YVISVGLGTPSKTQIVEIDTGSSTSWVFC----ECDGCHTNPRTFLQSRSTTCAKVSCGT 56
Query: 95 SQCAVVTSN--CSEG----DCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMPNV 148
S C + S+ C + DC + Y G S S G L +TLTF+ ++P
Sbjct: 57 SMCLLGGSDPHCQDSENYPDCPFRVSYQDG---SASYGILYQDTLTFSDVQ----KIPGF 109
Query: 149 IFGCGHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLPDQGSSK------ 202
FGC + + G++G+G G S++ Q + G FSYCLP Q S +
Sbjct: 110 TFGCNMDSFGA-NEFGNVDGLLGMGAGQMSVLKQSSPTFDG-FSYCLPLQMSERGFFSKT 167
Query: 203 ---INFGGIVAG--AGVVSTPLIIR----DHYYLSLEAISVGNQRLEF---VSSSTGNIF 250
+ GG +A V T ++ R + +++ L AISV +RL + S G +F
Sbjct: 168 TGYFSLGGKIAATRTDVRYTKMVARRKNTELFFVDLTAISVDGERLGLSPSIFSRKGVVF 227
Query: 251 VDTGVLRTLLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDVLCYNISS--QPKFPEVT 308
D+G + +P SV+S I+ ++ AE S+ CY++ S + P ++
Sbjct: 228 -DSGSELSYIP----DRALSVLSQRIRELLLRRGAAEEE-SERNCYDMRSVDEGDMPAIS 281
Query: 309 IHF-RGADVKLSPSNLF--RNISDE-IMCSAF 336
+HF GA L +F R++ ++ + C AF
Sbjct: 282 LHFDDGARFDLGRHGVFVERSVQEQDVWCLAF 313
>gi|359806832|ref|NP_001241567.1| uncharacterized protein LOC100819698 precursor [Glycine max]
gi|255638149|gb|ACU19388.1| unknown [Glycine max]
Length = 437
Score = 88.6 bits (218), Expect = 5e-15, Method: Compositional matrix adjust.
Identities = 89/366 (24%), Positives = 149/366 (40%), Gaps = 42/366 (11%)
Query: 29 ISVDDIYLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYN 88
I+ Y++ IGTP + ++DT +D +W C C + C P F P KS+T+
Sbjct: 92 ITQSPTYIVKAKIGTPAQTLLLAMDTSNDASWVPCTAC--VGCSTTTP--FAPAKSTTFK 147
Query: 89 SISCSSSQCAVVTS-NCSEGDCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMPN 147
+ C +SQC V + C C+++F YG + A+ +L +T+T + +P
Sbjct: 148 KVGCGASQCKQVRNPTCDGSACAFNFTYGTSSVAA----SLVQDTVTLATD-----PVPA 198
Query: 148 VIFGCGHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLPD----QGSSKI 203
FGC K S G+ + ++ S FSYCLP S +
Sbjct: 199 YAFGCIQKVTGSSVPPQGLLGLGRGPLSLLAQTQKLYQST---FSYCLPSFKTLNFSGSL 255
Query: 204 NFGGIVAGAGVVSTPLIIRDH----YYLSLEAISVGN-------QRLEFVSSSTGNIFVD 252
G + + TPL+ YY++L AI VG + L F +++ D
Sbjct: 256 RLGPVAQPKRIKFTPLLKNPRRSSLYYVNLVAIRVGRRIVDIPPEALAFNANTGAGTVFD 315
Query: 253 TGVLRTLLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDVLCYNISSQPKFPEVTIHFR 312
+G + T L ++ +++ I V + GF CY ++ P +T F
Sbjct: 316 SGTVFTRLVEPAYNAVRNEFRRRIAVHKKLTVTSLGGFDT--CY--TAPIVAPTITFMFS 371
Query: 313 GADVKLSPSN-LFRNISDEIMCSAFRGGNANI-----VYGRIMQINFLIGYDIEQAMVSF 366
G +V L P N L + + + C A N+ V + Q N + +D+ + +
Sbjct: 372 GMNVTLPPDNILIHSTAGSVTCLAMAPAPDNVNSVLNVIANMQQQNHRVLFDVPNSRLGV 431
Query: 367 KPSRCT 372
CT
Sbjct: 432 ARELCT 437
>gi|224063191|ref|XP_002301033.1| predicted protein [Populus trichocarpa]
gi|222842759|gb|EEE80306.1| predicted protein [Populus trichocarpa]
Length = 536
Score = 88.6 bits (218), Expect = 5e-15, Method: Compositional matrix adjust.
Identities = 105/371 (28%), Positives = 162/371 (43%), Gaps = 41/371 (11%)
Query: 31 VDDIYLMHLSIGTPPVDIFGSVDTGSDCTWTQCE--PCPELDC------FKQEPPLFDPK 82
+D ++ + IGTP V ++D GSD W C+ C L ++ + P
Sbjct: 103 LDWLHYTWIDIGTPNVSFLVALDAGSDLLWVPCDCIQCAPLSASYYNISLDRDLSEYSPS 162
Query: 83 KSSTYNSISCSSSQCAVVTSNCS--EGDCSYSFLYGRGAYASFSSGNLATETLTFNST-- 138
SST +SC C SNC + C Y F Y + S+G L + L S
Sbjct: 163 LSSTSRHLSCDHQLCE-WGSNCKNPKDPCPYIFNYDDFENTT-SAGFLVEDKLHLASVGD 220
Query: 139 -SGLPVEMPNVIFGCGHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTS--IAGKFSYCL 195
+ + +V+ GCG K S + G++GLGPG+ S+ S + + I FS C
Sbjct: 221 HTARKMLQASVVLGCGRKQGGSFFDGAAPDGVMGLGPGDISVPSLLAKAGLIQNCFSLCF 280
Query: 196 PDQGSSKINFGGIVAGAGVVSTPLI----IRDHYYLSLEAISVGNQRLEFVSSSTGNIFV 251
+ S +I FG A STP + Y++ +E+ VGN L+ S V
Sbjct: 281 DENDSGRILFGD-RGHASQQSTPFLPIQGTYVAYFVGVESYCVGNSCLK---RSGFKALV 336
Query: 252 DTGVLRTLLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDVLCYNISSQP--KFPEVTI 309
D+G T LP E ++ L S + A K + + G D CYN SSQ P + +
Sbjct: 337 DSGSSFTYLPSEVYNELVSEFDKQVNA---KRISFQDGLWDY-CYNASSQELHDIPAIQL 392
Query: 310 HF-RGAD-VKLSPS-NLFRNISDEIMCSAFRGGNANIVYGRIMQINFLIGY----DIEQA 362
F R + V +P+ ++ + + C + + + + YG I Q NF+IGY DIE
Sbjct: 393 KFPRNQNFVVHNPTYSIPHHQGFTMFCLSLQPTDGS--YGIIGQ-NFMIGYRMVFDIENL 449
Query: 363 MVSFKPSRCTN 373
+ + S C +
Sbjct: 450 KLGWSNSSCQD 460
>gi|88174558|gb|ABD39354.1| chloroplast nucleoid DNA-binding protein [Oryza meridionalis]
Length = 321
Score = 88.6 bits (218), Expect = 5e-15, Method: Compositional matrix adjust.
Identities = 93/332 (28%), Positives = 151/332 (45%), Gaps = 51/332 (15%)
Query: 35 YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSS 94
Y++ + +GTP +DTGS +W C E D P F +S+T +SC +
Sbjct: 1 YVISVGLGTPAKTQIVEIDTGSSTSWVFC----ECDGCHTNPRTFLQSRSTTCAKVSCGT 56
Query: 95 SQCAVVTSN--CSEG----DCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMPNV 148
S C + S+ C + DC + Y G S S G L +TLTF+ ++P
Sbjct: 57 SMCLLGGSDPHCQDSENYPDCPFRVSYQDG---SASYGILYQDTLTFSDVQ----KIPGF 109
Query: 149 IFGCGHKNLASPTSDS--KQTGIIGLGPGNSSLISQMGTSIAGKFSYCLPDQGSSK---- 202
FGC NL S ++ G++G+G G S++ Q + G FSYCLP Q S +
Sbjct: 110 TFGC---NLDSFGANEFGNVDGLLGMGAGPMSVLKQSSPTFDG-FSYCLPLQMSERGFFS 165
Query: 203 -----INFGGIVAGAGVVSTPLIIR----DHYYLSLEAISVGNQRLEF---VSSSTGNIF 250
+ G + V T ++ R + +++ L AISV +RL V S G +F
Sbjct: 166 KTTGYFSLGKVATRTDVRYTKMVARKKNTELFFVDLTAISVDGERLGLSPSVFSRKGVVF 225
Query: 251 VDTGVLRTLLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDVLCYNISS--QPKFPEVT 308
D+G + +P S L+ + ++ + GA S+ CY++ S + P ++
Sbjct: 226 -DSGSELSYIPDRALSVLRQRIRELLLKR-----GAAEEESERNCYDMRSVDEGDMPAIS 279
Query: 309 IHF-RGADVKLSPSNLF--RNISDE-IMCSAF 336
+HF GA L +F R++ ++ + C AF
Sbjct: 280 LHFDDGARFDLGSHGVFVERSVQEQDVWCLAF 311
>gi|343172998|gb|AEL99202.1| aspartyl protease family protein, partial [Silene latifolia]
Length = 584
Score = 88.2 bits (217), Expect = 5e-15, Method: Compositional matrix adjust.
Identities = 96/355 (27%), Positives = 154/355 (43%), Gaps = 40/355 (11%)
Query: 41 IGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSSSQCAVV 100
IGTPP + VDTGS T+ C C + C + P F P S TY+ + C + C
Sbjct: 2 IGTPPQEFALIVDTGSTVTYVPCNSCDQ--CGNHQDPKFQPDLSDTYHPVKC-NPDCTCD 58
Query: 101 TSNCSEGDCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMPNVIFGCGHKNLASP 160
T N C+Y Y A S SSG L + ++F + S L + +FGC +
Sbjct: 59 TEN---DQCTYERQY---AEMSSSSGILGEDLVSFGNMSELKPQ--RAVFGCENAETGDL 110
Query: 161 TSDSKQTGIIGLGPGNSSLISQMGTS--IAGKFSYCLPDQGSSKINFGGIVAGAGVVSTP 218
S GI+GLG G+ S++ Q+ I FS C G ++ G +V G +
Sbjct: 111 FSQHAD-GIMGLGRGDLSIVDQLVEKGVINDSFSLCY---GGMEVGGGAMVLGQISPPSD 166
Query: 219 LII------RDHYY-LSLEAISVGNQRLEF---VSSSTGNIFVDTGVLRTLLP-LEYHSN 267
++ R YY + L + V ++L+ V +D+G LP +
Sbjct: 167 MVFSHSDPDRSPYYNIELRGLHVAGKKLDINPQVFDGKHGTILDSGTTYAYLPEAAFLPF 226
Query: 268 LKSVMSNMIKAQPVKGVGAEPGFSDVLCYNISSQ-----PKFPEVTIHF-RGADVKLSPS 321
++++ S + + ++ G +P ++DV S+ FP V + F G LSP
Sbjct: 227 IQAITSELHGLKQIR--GPDPNYNDVCFSGAGSEIPELYKTFPSVDMVFDNGEKYSLSPE 284
Query: 322 N-LFRN--ISDEIMCSAFRGG-NANIVYGRIMQINFLIGYDIEQAMVSFKPSRCT 372
N LF++ + F+ G + + G I+ N L+ YD E + V F + C+
Sbjct: 285 NYLFKHSKVHGAYCLGVFQNGKDPTTLLGGIVVRNTLVTYDREHSKVGFWKTNCS 339
>gi|255685712|gb|ACU28345.1| At2g28040-like protein [Arabidopsis lyrata subsp. petraea]
Length = 91
Score = 88.2 bits (217), Expect = 6e-15, Method: Compositional matrix adjust.
Identities = 47/106 (44%), Positives = 61/106 (57%), Gaps = 15/106 (14%)
Query: 37 MHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSSSQ 96
M L IGTPP +I +DTGS+ WTQC PC L C+ Q+ P+FDP KSST+ C
Sbjct: 1 MKLQIGTPPFEIEAVLDTGSELIWTQCLPC--LHCYDQKAPIFDPSKSSTFKETRC---- 54
Query: 97 CAVVTSNCSEGDCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLP 142
N + C Y +Y +Y + G LATET+T +STSG+P
Sbjct: 55 ------NTPDHSCXYKIVYDDKSY---TQGTLATETVTIHSTSGVP 91
>gi|356508308|ref|XP_003522900.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
Length = 439
Score = 88.2 bits (217), Expect = 6e-15, Method: Compositional matrix adjust.
Identities = 105/400 (26%), Positives = 170/400 (42%), Gaps = 52/400 (13%)
Query: 1 AQNSQKLPFYNDNETPKSPISIIYQAEIISVDDIYLMHLSIGTPPVDIFGSVDTGSDCTW 60
A++ +L F +S + I +II Y++ IG+PP + ++DT +D W
Sbjct: 65 AKDQARLQFLASMVAGRSVVPIASGRQIIQ-SPTYIVRAKIGSPPQTLLLAMDTSNDAAW 123
Query: 61 TQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSSSQC-AVVTSNCSEGDCSYSFLYGRGA 119
C C C LF P+KS+T+ ++SC S QC V +C C+++ YG +
Sbjct: 124 IPCTACD--GCTST---LFAPEKSTTFKNVSCGSPQCNQVPNPSCGTSACTFNLTYGSSS 178
Query: 120 YASFSSGNLATETLTFNSTSGLPVEMPNVIFGCGHKNLASPTSDSKQTGIIGLGPGNSSL 179
A+ N+ +T+T + +P+ FGC K + + G++GLG G SL
Sbjct: 179 IAA----NVVQDTVTLATD-----PIPDYTFGCVAKTTG---ASAPPQGLLGLGRGPLSL 226
Query: 180 ISQMGTSIAGKFSYCLPDQGSSKINFGGIVAGAGVVSTPLIIR-----------DHYYLS 228
+SQ FSYCLP S +NF G + G V+ P+ I+ YY++
Sbjct: 227 LSQTQNLYQSTFSYCLPSFKS--LNFSGSLR-LGPVAQPIRIKYTPLLKNPRRSSLYYVN 283
Query: 229 LEAISVGN-------QRLEFVSSSTGNIFVDTGVLRTLLPLEYHSNLKSVMSN--MIKAQ 279
L AI VG + L F +++ D+G + T L ++ ++ I A+
Sbjct: 284 LVAIRVGRKVVDIPPEALAFNAATGAGTVFDSGTVFTRLVAPAYTAVRDEFQRRVAIAAK 343
Query: 280 PVKGVGAEPGFSDVLCYNISSQPKFPEVTIHFRGADVKLSPSN-LFRNISDEIMCSAFRG 338
V + GF CY + P +T F G +V L N L + + C A
Sbjct: 344 ANLTVTSLGGFDT--CYTVPI--VAPTITFMFSGMNVTLPEDNILIHSTAGSTTCLAMAS 399
Query: 339 GNANI-----VYGRIMQINFLIGYDIEQAMVSFKPSRCTN 373
N+ V + Q N + YD+ + + CT
Sbjct: 400 APDNVNSVLNVIANMQQQNHRVLYDVPNSRLGVARELCTK 439
>gi|343172996|gb|AEL99201.1| aspartyl protease family protein, partial [Silene latifolia]
Length = 584
Score = 88.2 bits (217), Expect = 6e-15, Method: Compositional matrix adjust.
Identities = 96/355 (27%), Positives = 154/355 (43%), Gaps = 40/355 (11%)
Query: 41 IGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSSSQCAVV 100
IGTPP + VDTGS T+ C C + C + P F P S TY+ + C + C
Sbjct: 2 IGTPPQEFALIVDTGSTVTYVPCNSCDQ--CGNHQDPKFQPDLSDTYHPVKC-NPDCTCD 58
Query: 101 TSNCSEGDCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMPNVIFGCGHKNLASP 160
T N C+Y Y A S SSG L + ++F + S L + +FGC +
Sbjct: 59 TEN---DQCTYERQY---AEMSSSSGILGEDLVSFGNMSELKPQ--RAVFGCENAETGDL 110
Query: 161 TSDSKQTGIIGLGPGNSSLISQMGTS--IAGKFSYCLPDQGSSKINFGGIVAGAGVVSTP 218
S GI+GLG G+ S++ Q+ I FS C G ++ G +V G +
Sbjct: 111 FSQHAD-GIMGLGRGDLSIVDQLVEKGVINDSFSLCY---GGMEVGGGAMVLGQISPPSD 166
Query: 219 LII------RDHYY-LSLEAISVGNQRLEF---VSSSTGNIFVDTGVLRTLLP-LEYHSN 267
++ R YY + L + V ++L+ V +D+G LP +
Sbjct: 167 MVFSHSDPDRSPYYNIELRGLHVAGKKLDINPQVFDGKHGTILDSGTTYAYLPEAAFLPF 226
Query: 268 LKSVMSNMIKAQPVKGVGAEPGFSDVLCYNISSQ-----PKFPEVTIHF-RGADVKLSPS 321
++++ S + + ++ G +P ++DV S+ FP V + F G LSP
Sbjct: 227 IQAITSELHGLKQIR--GPDPNYNDVCFSGAGSEIPELYKTFPSVDMVFDNGEKYSLSPE 284
Query: 322 N-LFRN--ISDEIMCSAFRGG-NANIVYGRIMQINFLIGYDIEQAMVSFKPSRCT 372
N LF++ + F+ G + + G I+ N L+ YD E + V F + C+
Sbjct: 285 NYLFKHSKVHGAYCLGVFQNGKDPTTLLGGIVVRNTLVTYDREHSKVGFWKTNCS 339
>gi|88174575|gb|ABD39362.1| chloroplast nucleoid DNA-binding protein [Oryza sativa Japonica
Group]
Length = 321
Score = 88.2 bits (217), Expect = 6e-15, Method: Compositional matrix adjust.
Identities = 93/331 (28%), Positives = 150/331 (45%), Gaps = 49/331 (14%)
Query: 35 YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSS 94
Y+ + +GTP +DTGS +W C E D P F +S+T +SC +
Sbjct: 1 YVTSVGLGTPAKTQIVEIDTGSSTSWVFC----ECDGCHTNPRTFLQSRSTTCAKVSCGT 56
Query: 95 SQCAVVTSN--CSEG----DCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMPNV 148
S C + S+ C + DC + Y G S S G L +TLTF+ ++P+
Sbjct: 57 SMCLLGGSDPHCQDSENYPDCPFRVSYQDG---SASYGILYQDTLTFSDVQ----KIPSF 109
Query: 149 IFGCGHKNLASPTSDS--KQTGIIGLGPGNSSLISQMGTSIAGKFSYCLPDQGSSK---- 202
FGC NL S ++ G++G+G G S++ Q + G FSYCLP Q S +
Sbjct: 110 TFGC---NLDSFGANEFGNVDGLLGMGAGPMSVLKQSSPTFDG-FSYCLPLQKSERGFFS 165
Query: 203 -----INFGGIVAGAGVVSTPLIIR----DHYYLSLEAISVGNQRLEFVSS--STGNIFV 251
+ G + V T ++ R + +++ L AISV +RL S S +
Sbjct: 166 KTTGYFSLGKVATRTDVRYTKMVARRKNTELFFVDLAAISVDGERLGLSPSIFSRKGVVF 225
Query: 252 DTGVLRTLLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDVLCYNISS--QPKFPEVTI 309
D+G + +P SV+S I+ ++ AE S+ CY++ S + P +++
Sbjct: 226 DSGSELSYIP----DRALSVLSQRIRELLLRRGAAEEE-SERNCYDMRSVDEGDMPAISL 280
Query: 310 HF-RGADVKLSPSNLF--RNISDE-IMCSAF 336
HF GA L +F R++ ++ + C AF
Sbjct: 281 HFDDGARFDLGRHGVFVERSVQEQDVWCLAF 311
>gi|242091057|ref|XP_002441361.1| hypothetical protein SORBIDRAFT_09g025220 [Sorghum bicolor]
gi|241946646|gb|EES19791.1| hypothetical protein SORBIDRAFT_09g025220 [Sorghum bicolor]
Length = 439
Score = 88.2 bits (217), Expect = 6e-15, Method: Compositional matrix adjust.
Identities = 112/434 (25%), Positives = 172/434 (39%), Gaps = 104/434 (23%)
Query: 28 IISVDDIYLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFK-----QEPPLFDPK 82
+ + D YL+ L++GTPP +DTGSD TW C C + P F P
Sbjct: 18 VTAYTDGYLLSLNLGTPPQVFQVYLDTGSDLTWVPCGSSSSYQCLDCGSSVKPTPTFLPS 77
Query: 83 KSSTYNSISCSSSQCAVVTSN------CSEGDCS---------------YSFLYGRGAYA 121
+S++ C S C V S+ C+ C+ +S+ YG GA
Sbjct: 78 ESTSNTRDLCGSRFCVDVHSSDNRFDPCAAAGCAIPAFTGGQCPRPCPPFSYTYGGGALV 137
Query: 122 SFSSGNLATETLTFN-STSG-------LPVEMPNVIFGCGHKNLASPTSDSKQTGIIGLG 173
G+L+ +++T + ST G LPV P FGC ++ P GI G G
Sbjct: 138 ---LGSLSRDSVTLHGSTHGSGAGAGPLPVAFPGFGFGCVGSSIREP------LGIAGFG 188
Query: 174 PGNSSLISQMGTSIAGKFSYC-LPDQGSSKINFGG-IVAG----------AGVVSTPLII 221
G SL SQ+G + FS+C L + + NF +V G G V TP++
Sbjct: 189 RGALSLPSQLGF-LGKGFSHCFLGFRFARNPNFTSPLVMGDLALSSASTDGGFVFTPMLT 247
Query: 222 R----DHYYLSLEAISVGNQ----------RLEFV-SSSTGNIFVDTGVLRTLLPLEYHS 266
+ YY+ LE + +G+ L + + G + VDTG T LP +++
Sbjct: 248 SATYPNFYYVGLEGVVLGDDDGGSAMAAPPSLSGIDAQGNGGVLVDTGTTYTQLPDPFYA 307
Query: 267 NLKSVMSNMIKAQP----VKGVGAEPGFSDVLCYNIS------SQPKFPEVTIHFRGA-- 314
SV++++I A P + + A GF LC+ + + + P +T+H G
Sbjct: 308 ---SVLASLISAAPPYERSRDLEARTGFD--LCFKVPCARAPCADDELPPITLHLAGGAR 362
Query: 315 ----------------DVKLSPSNLFRNISDEIMCSAFRGGNANIVYGRIMQINFLIGYD 358
D + LF+ + E GG V G N + YD
Sbjct: 363 LALPKLSSYYPVTAIRDSVVVKCLLFQRMEMEDDGDGTSGGGPAAVLGSFQMQNVEVVYD 422
Query: 359 IEQAMVSFKPSRCT 372
+ V F+P C
Sbjct: 423 LAAGRVGFRPRDCA 436
>gi|224066811|ref|XP_002302227.1| predicted protein [Populus trichocarpa]
gi|222843953|gb|EEE81500.1| predicted protein [Populus trichocarpa]
Length = 422
Score = 88.2 bits (217), Expect = 6e-15, Method: Compositional matrix adjust.
Identities = 94/366 (25%), Positives = 156/366 (42%), Gaps = 46/366 (12%)
Query: 35 YLMHLSIGTPPVDIFGSVDTGSDCTWTQCE-PCPELDCFKQEPPLFDPKKSSTYNSISCS 93
Y + L+IG PP +DTGSD TW QC+ PC C K L+ PK N + C+
Sbjct: 68 YSVILNIGNPPKAFDLDIDTGSDLTWVQCDAPCK--GCTKPLDKLYKPKN----NRVPCA 121
Query: 94 SSQC-AVVTSNCS--EGDCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMPNVIF 150
SS C A+ +NC C Y Y A S G L ++ +G ++ P + F
Sbjct: 122 SSLCQAIQNNNCDIPTEQCDYEVEY---ADLGSSLGVLLSDYFPLRLNNGSLLQ-PRIAF 177
Query: 151 GCGH-KNLASPTSDSKQTGIIGLGPGNSSLISQMGTS--IAGKFSYCLPDQGSSKINFGG 207
GCG+ + P S GI+GLG G +S++SQ+ T +C + FG
Sbjct: 178 GCGYDQKYLGPHSPPDTAGILGLGRGKASILSQLRTLGITQNVVGHCFSRVTGGFLFFGD 237
Query: 208 -IVAGAGVVSTPLIIRDHYYLSLEAISVGNQRLEFVSSSTG----NIFVDTGVLRTLLPL 262
++ +G+ TP++ L S G L F TG + D+G T
Sbjct: 238 HLLPPSGITWTPMLRSSSDTL----YSSGPAELLFGGKPTGIKGLQLIFDSGSSYTYFNA 293
Query: 263 EYHSNLKSVMSNMIKAQPVKGVGAEPGFSDVLCYNISSQPK--------FPEVTIHF--- 311
+ + ++ +++ + P+K E + +C+ + K F +TI+F
Sbjct: 294 QVYQSILNLVRKDLSGMPLKDAPEEKALA--VCWKTAKPIKSILDIKSFFKPLTINFIKA 351
Query: 312 RGADVKLSPSNLFRNISDEIMCSAFRG------GNANIVYGRIMQINFLIGYDIEQAMVS 365
+ ++L+P + D +C GN N++ G I + ++ YD E+ +
Sbjct: 352 KNVQLQLAPEDYLIITKDGNVCLGILNGGEQGLGNLNVI-GDIFMQDRVVVYDNERQQIG 410
Query: 366 FKPSRC 371
+ P+ C
Sbjct: 411 WFPTNC 416
>gi|88174583|gb|ABD39366.1| chloroplast nucleoid DNA-binding protein [Oryza sativa Indica
Group]
Length = 321
Score = 88.2 bits (217), Expect = 6e-15, Method: Compositional matrix adjust.
Identities = 93/331 (28%), Positives = 150/331 (45%), Gaps = 49/331 (14%)
Query: 35 YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSS 94
Y++ + +GTP +DTGS +W C E D P F +S+T +SC +
Sbjct: 1 YVISVGLGTPSKTQIVEIDTGSSTSWVFC----ECDGCHTNPRTFLQSRSTTCAKVSCGT 56
Query: 95 SQCAVVTSN--CSEG----DCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMPNV 148
S C + S+ C + DC + Y G S S G L +TLTF+ ++P+
Sbjct: 57 SMCLLGGSDPHCQDSENYPDCPFRVSYQDG---SASYGILYQDTLTFSDVQ----KIPSF 109
Query: 149 IFGCGHKNLASPTSDS--KQTGIIGLGPGNSSLISQMGTSIAGKFSYCLPDQGSSK---- 202
FGC NL S ++ G++G+G G S++ Q G FSYCLP Q S +
Sbjct: 110 TFGC---NLDSFGANEFGNVDGLLGMGAGPMSVLKQSSPRFDG-FSYCLPLQKSERGFFS 165
Query: 203 -----INFGGIVAGAGVVSTPLIIR----DHYYLSLEAISVGNQRLEFVSS--STGNIFV 251
+ G + V T ++ R + +++ L AISV +RL S S +
Sbjct: 166 KTTGYFSLGKVATRTDVRYTKMVARRKNTELFFVDLAAISVDGERLGLSPSIFSRKGVVF 225
Query: 252 DTGVLRTLLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDVLCYNISS--QPKFPEVTI 309
D+G + +P SV+S I+ ++ AE S+ CY++ S + P +++
Sbjct: 226 DSGSELSYIP----DRALSVLSQRIRELLLRRGAAEEE-SERNCYDMRSVDEGDMPAISL 280
Query: 310 HF-RGADVKLSPSNLF--RNISDE-IMCSAF 336
HF GA L +F R++ ++ + C AF
Sbjct: 281 HFDDGARFDLGSHGVFVERSVQEQDVWCLAF 311
>gi|356539555|ref|XP_003538263.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
Length = 438
Score = 88.2 bits (217), Expect = 6e-15, Method: Compositional matrix adjust.
Identities = 103/400 (25%), Positives = 165/400 (41%), Gaps = 52/400 (13%)
Query: 1 AQNSQKLPFYNDNETPKSPISIIYQAEIISVDDIYLMHLSIGTPPVDIFGSVDTGSDCTW 60
A++ +L F +S + I +II Y++ IGTPP + ++DT +D W
Sbjct: 64 AKDQARLQFLASMVAGRSIVPIASGRQIIQ-SPTYIVRAKIGTPPQTLLLAIDTSNDAAW 122
Query: 61 TQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSSSQCAVVTS-NCSEGDCSYSFLYGRGA 119
C C C LF P+KS+T+ ++SC S +C V S +C C+++ YG +
Sbjct: 123 IPCTACD--GCTST---LFAPEKSTTFKNVSCGSPECNKVPSPSCGTSACTFNLTYGSSS 177
Query: 120 YASFSSGNLATETLTFNSTSGLPVEMPNVIFGCGHKNLASPTSDSKQTGIIGLGPGNSSL 179
A+ N+ +T+T + +P FGC K T G+ G SL
Sbjct: 178 IAA----NVVQDTVTLATD-----PIPGYTFGCVAKTTGPSTPPQGLLGLGR---GPLSL 225
Query: 180 ISQMGTSIAGKFSYCLPDQGSSKINFGGIVAGAGVVSTPLIIR-----------DHYYLS 228
+SQ FSYCLP S +NF G + G V+ P+ I+ YY++
Sbjct: 226 LSQTQNLYQSTFSYCLPSFKS--LNFSGSLR-LGPVAQPIRIKYTPLLKNPRRSSLYYVN 282
Query: 229 LEAISVGNQ-------RLEFVSSSTGNIFVDTGVLRTLLPLEYHSNLKSVMSNMI--KAQ 279
L AI VG + L F +++ D+G + T L ++ ++ + A+
Sbjct: 283 LFAIRVGRKIVDIPPAALAFNAATGAGTVFDSGTVFTRLVAPVYTAVRDEFRRRVAMAAK 342
Query: 280 PVKGVGAEPGFSDVLCYNISSQPKFPEVTIHFRGADVKLSPSN-LFRNISDEIMCSAFRG 338
V + GF CY + P +T F G +V L N L + + C A
Sbjct: 343 ANLTVTSLGGFDT--CYTVPIVA--PTITFMFSGMNVTLPQDNILIHSTAGSTSCLAMAS 398
Query: 339 GNANI-----VYGRIMQINFLIGYDIEQAMVSFKPSRCTN 373
N+ V + Q N + YD+ + + CT
Sbjct: 399 APDNVNSVLNVIANMQQQNHRVLYDVPNSRLGVARELCTK 438
>gi|125543284|gb|EAY89423.1| hypothetical protein OsI_10930 [Oryza sativa Indica Group]
Length = 447
Score = 88.2 bits (217), Expect = 7e-15, Method: Compositional matrix adjust.
Identities = 109/397 (27%), Positives = 166/397 (41%), Gaps = 83/397 (20%)
Query: 39 LSIGTPPVDIFGSVDTGSDCTWTQCEP--CPELDCFKQEPPLFDPKKSSTYNSISCSSSQ 96
+++GTPP ++ +DTGS+ +W C P L P F+ SS+Y ++ C S+
Sbjct: 59 VAVGTPPQNVTMVLDTGSELSWLLCNGSYAPPLT------PAFNASGSSSYGAVPCPSTA 112
Query: 97 CAV------VTSNCS---EGDCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMPN 147
C V C C S Y A AS + G LAT+ TF T G P
Sbjct: 113 CEWRGRDLPVPPFCDTPPSNACRVSLSY---ADASSADGVLATD--TFLLTGGAPPVAVG 167
Query: 148 VIFGCGHKNLASPTSDS---------KQTGIIGLGPGNSSLISQMGTSIAGKFSYCL-PD 197
FGC ++ ++S TG++G+ G S ++Q GT +F+YC+ P
Sbjct: 168 AYFGCITSYSSTTATNSNGTGTDVSEAATGLLGMNRGTLSFVTQTGTR---RFAYCIAPG 224
Query: 198 QGSSKINFG--GIVAGAGVVSTPLII---------RDHYYLSLEAISVGNQRLEFVSS-- 244
+G + G G VA + TPLI R Y + LE I VG L S
Sbjct: 225 EGPGVLLLGDDGGVA-PPLNYTPLIEISQPLPYFDRVAYSVQLEGIRVGCALLPIPKSVL 283
Query: 245 -----STGNIFVDTGVLRTLLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGF----SDVLC 295
G VD+G T L + ++ LK+ ++ +A+ + EPGF + C
Sbjct: 284 TPDHTGAGQTMVDSGTQFTFLLADAYAALKAEFTS--QARLLLAPLGEPGFVFQGAFDAC 341
Query: 296 YN------ISSQPKFPEVTIHFRGADVKLSPSNLFRNISDE---------IMCSAFRGGN 340
+ ++ P V + RGA+V +S L + E + C F GN
Sbjct: 342 FRGPEARVAAASGLLPVVGLVLRGAEVAVSGEKLLYMVPGERRGEGGAEAVWCLTF--GN 399
Query: 341 ANI------VYGRIMQINFLIGYDIEQAMVSFKPSRC 371
+++ V G Q N + YD++ V F P+RC
Sbjct: 400 SDMAGMSAYVIGHHHQQNVWVEYDLQNGRVGFAPARC 436
>gi|88174573|gb|ABD39361.1| chloroplast nucleoid DNA-binding protein [Oryza sativa Japonica
Group]
Length = 321
Score = 87.8 bits (216), Expect = 7e-15, Method: Compositional matrix adjust.
Identities = 90/329 (27%), Positives = 147/329 (44%), Gaps = 45/329 (13%)
Query: 35 YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSS 94
Y+ + +GTP +DTGS +W C E D P F +S+T +SC +
Sbjct: 1 YVTSVGLGTPSKTQIVEIDTGSSTSWVFC----ECDGCHTNPRTFLQSRSTTCAKVSCGT 56
Query: 95 SQCAVVTSN--CSEG----DCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMPNV 148
S C + S+ C + DC + Y G S S G L +TLTF+ ++P+
Sbjct: 57 SMCLLGGSDPHCQDSENYPDCPFRVSYQDG---SASYGILYQDTLTFSDVQ----KIPSF 109
Query: 149 IFGCGHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLPDQGSSK------ 202
FGC + + G++G+G G S++ Q + G FSYCLP Q S +
Sbjct: 110 TFGCNLDSFGA-NEFGNVDGLLGMGAGPMSVLKQSSPTFDG-FSYCLPLQKSERGFFSKT 167
Query: 203 ---INFGGIVAGAGVVSTPLIIR----DHYYLSLEAISVGNQRLEFVSS--STGNIFVDT 253
+ G + V T ++ R + +++ L AISV +RL S S + D+
Sbjct: 168 TGYFSLGKVATRTDVRYTKMVARRKNTELFFVDLAAISVDGERLGLSPSIFSRKGVVFDS 227
Query: 254 GVLRTLLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDVLCYNISS--QPKFPEVTIHF 311
G + +P SV+S I+ ++ AE S+ CY++ S + P +++HF
Sbjct: 228 GSELSYIP----DRALSVLSQRIRELLLRRGAAEEE-SERNCYDMRSVDEGDMPAISLHF 282
Query: 312 -RGADVKLSPSNLF--RNISDE-IMCSAF 336
GA L +F R++ ++ + C AF
Sbjct: 283 DDGARFDLGSRGVFVERSVQEQDVWCLAF 311
>gi|15229663|ref|NP_190574.1| aspartyl protease family protein [Arabidopsis thaliana]
gi|6522926|emb|CAB62113.1| putative protein [Arabidopsis thaliana]
gi|53828539|gb|AAU94379.1| At3g50050 [Arabidopsis thaliana]
gi|55733749|gb|AAV59271.1| At3g50050 [Arabidopsis thaliana]
gi|332645100|gb|AEE78621.1| aspartyl protease family protein [Arabidopsis thaliana]
Length = 632
Score = 87.8 bits (216), Expect = 7e-15, Method: Compositional matrix adjust.
Identities = 100/381 (26%), Positives = 161/381 (42%), Gaps = 47/381 (12%)
Query: 19 PISIIYQAEIISVDDIYLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPL 78
P S + + + ++ Y L IGTPP VD+GS T+ C C + C K + P
Sbjct: 77 PHSRMRLYDDLLINGYYTTRLWIGTPPQMFALIVDSGSTVTYVPCSDCEQ--CGKHQDPK 134
Query: 79 FDPKKSSTYNSISCSSSQCAVVTSNCSEG--DCSYSFLYGRGAYASFSSGNLATETLTFN 136
F P+ SSTY + C+ + NC + C Y Y A S S G L + ++F
Sbjct: 135 FQPEMSSTYQPVKCN------MDCNCDDDREQCVYEREY---AEHSSSKGVLGEDLISFG 185
Query: 137 STSGLPVEMPNVIFGCGHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTS--IAGKFSYC 194
+ S L + +FGC S + GIIGLG G+ SL+ Q+ I+ F C
Sbjct: 186 NESQLTPQ--RAVFGCETVETGDLYS-QRADGIIGLGQGDLSLVDQLVDKGLISNSFGLC 242
Query: 195 LPDQGSSKINFGGIVAGAGVVSTPLIIRD-------HYYLSLEAISVGNQRLEF---VSS 244
G + G ++ G + ++ D +Y + L I V ++L V
Sbjct: 243 Y---GGMDVGGGSMILGGFDYPSDMVFTDSDPDRSPYYNIDLTGIRVAGKQLSLHSRVFD 299
Query: 245 STGNIFVDTGVLRTLLP-LEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDVLCYNISS--- 300
+D+G LP + + ++VM + + + G +P F D C+ +++
Sbjct: 300 GEHGAVLDSGTTYAYLPDAAFAAFEEAVMREVSTLKQID--GPDPNFKDT-CFQVAASNY 356
Query: 301 ----QPKFPEVTIHFR-GADVKLSPSN-LFRN--ISDEIMCSAF-RGGNANIVYGRIMQI 351
FP V + F+ G LSP N +FR+ + F G + + G I+
Sbjct: 357 VSELSKIFPSVEMVFKSGQSWLLSPENYMFRHSKVHGAYCLGVFPNGKDHTTLLGGIVVR 416
Query: 352 NFLIGYDIEQAMVSFKPSRCT 372
N L+ YD E + V F + C+
Sbjct: 417 NTLVVYDRENSKVGFWRTNCS 437
>gi|357116170|ref|XP_003559856.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Brachypodium
distachyon]
Length = 460
Score = 87.8 bits (216), Expect = 7e-15, Method: Compositional matrix adjust.
Identities = 99/379 (26%), Positives = 162/379 (42%), Gaps = 57/379 (15%)
Query: 35 YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSS 94
YL+ S+GTPP + +VDT +D W C C C P F+P S+T+ + C +
Sbjct: 94 YLVRASLGTPPQRLLLAVDTSNDAAWVPCAGC--HGC-PTTAPSFNPASSATFRPVPCGA 150
Query: 95 SQCAVVTS-NC-----SEGDCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMPNV 148
C+ + +C S+ C +S YG + L+ + L + G+ +
Sbjct: 151 PPCSQAPNPSCTSLAKSKNSCGFSLSYGDSSL----DATLSQDNLAVTANGGV---IKGY 203
Query: 149 IFGCGHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLPDQGSSKINFGGI 208
FGC K S S + G++GLG G ++Q G FSYCLP S NF G
Sbjct: 204 TFGCLTK---SNGSAAPAQGLLGLGRGPLGFVAQTKGIYEGTFSYCLPSYYRSAANFSGS 260
Query: 209 V--------AGAGVVSTPLIIRDH----YYLSLEAISVGNQR-------LEFVSSSTGNI 249
+ A + +TPL+ H YY+++ + +G + L F +++
Sbjct: 261 LTLGRKGQPAPEKMKTTPLLASPHRPSLYYVAMTGVRIGKKSVPIPPSALAFDAATGAGT 320
Query: 250 FVDTGVLRTLLPLEYHSNLKSVMSNMIKAQPVK--------GVGAEPGFSDVLCYNISSQ 301
+D+G + L ++ ++ + + + V + GF CYN+S+
Sbjct: 321 VLDSGTMFARLAQPAYAAVRDEVRRRVAGSLRRRGGGGASVSVSSLGGFDT--CYNVSTV 378
Query: 302 PKFPEVTIHFRGA-DVKLSPSN-LFRNISDEIMCSAFR-----GGNANI-VYGRIMQINF 353
+P VT+ F G +V+L N + R+ C A G NA + V G + Q N
Sbjct: 379 -AWPAVTLVFGGGMEVRLPEENVVIRSTYGSTSCLAMAASPADGVNAALNVIGSLQQQNH 437
Query: 354 LIGYDIEQAMVSFKPSRCT 372
+ +D+ A V F RCT
Sbjct: 438 RVLFDVPNARVGFARERCT 456
>gi|296087361|emb|CBI33735.3| unnamed protein product [Vitis vinifera]
Length = 633
Score = 87.8 bits (216), Expect = 8e-15, Method: Compositional matrix adjust.
Identities = 105/362 (29%), Positives = 157/362 (43%), Gaps = 40/362 (11%)
Query: 35 YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSS 94
Y + IGTPP VDTGS T+ C C + C K + P F P SSTY + C S
Sbjct: 92 YTTRIWIGTPPQTFALIVDTGSTLTYVPCSTCEQ--CGKHQDPNFQPDWSSTYQPLKC-S 148
Query: 95 SQCAVVTSNCSEGDCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMPNVIFGCGH 154
+C T + C Y Y A S SSG L + ++F S L + +FGC +
Sbjct: 149 MEC---TCDSEMMHCVYDRQY---AEMSSSSGVLGEDIVSFGKQSELKPQ--RTVFGCEN 200
Query: 155 KNLASPTSDSKQTGIIGLGPGNSSLISQMGTS--IAGKFSYCL--PDQGSSKINFGGIVA 210
S + GI+GLG G+ S++ Q+ I FS C D G + GGI
Sbjct: 201 VETGDIYS-QRADGIMGLGRGDLSIVDQLVEKGVIGNSFSLCYGGMDVGGGAMVLGGISP 259
Query: 211 GAGVV---STPLIIRDHYY-LSLEAISVGNQRL---EFVSSSTGNIFVDTGVLRTLLPLE 263
AG+V S P R YY + L+ I + ++L V +D+G LP
Sbjct: 260 PAGMVFTHSDP--ARSAYYNIDLKEIHIAGKQLPINPMVFDGKYGTILDSGTTYAYLPEP 317
Query: 264 YHSNLK-SVMSNMIKAQPVKGVGAEPGFSDVLCY-----NISSQPK-FPEVTIHF-RGAD 315
K ++M + + ++ G + ++D+ C+ ++S K FP V + F G
Sbjct: 318 AFKAFKDAIMKELNSLKLIQ--GPDRNYNDI-CFSGVGSDVSQLSKTFPAVDLVFSNGNR 374
Query: 316 VKLSPSN-LFRN--ISDEIMCSAFRGGN-ANIVYGRIMQINFLIGYDIEQAMVSFKPSRC 371
+ LSP N LF++ F+ N + G I+ N L+ YD E + F + C
Sbjct: 375 LSLSPENYLFQHSKAHGAYCLGIFQNENDQTTLLGGIIVRNTLVMYDREHLKIGFWKTNC 434
Query: 372 TN 373
+
Sbjct: 435 SE 436
>gi|255685716|gb|ACU28347.1| At2g28040-like protein [Arabidopsis lyrata subsp. petraea]
gi|255685726|gb|ACU28352.1| At2g28040-like protein [Arabidopsis lyrata subsp. petraea]
gi|255685728|gb|ACU28353.1| At2g28040-like protein [Arabidopsis lyrata subsp. petraea]
Length = 91
Score = 87.8 bits (216), Expect = 8e-15, Method: Compositional matrix adjust.
Identities = 47/106 (44%), Positives = 61/106 (57%), Gaps = 15/106 (14%)
Query: 37 MHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSSSQ 96
M L IGTPP +I +DTGS+ WTQC PC L C+ Q+ P+FDP KSST+ C
Sbjct: 1 MKLQIGTPPFEIEAVLDTGSELIWTQCLPC--LHCYDQKAPIFDPSKSSTFKETRC---- 54
Query: 97 CAVVTSNCSEGDCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLP 142
N + C Y +Y +Y + G LATET+T +STSG+P
Sbjct: 55 ------NTPDHSCPYKIVYDDKSY---TQGTLATETVTIHSTSGVP 91
>gi|242091327|ref|XP_002441496.1| hypothetical protein SORBIDRAFT_09g028060 [Sorghum bicolor]
gi|241946781|gb|EES19926.1| hypothetical protein SORBIDRAFT_09g028060 [Sorghum bicolor]
Length = 466
Score = 87.8 bits (216), Expect = 8e-15, Method: Compositional matrix adjust.
Identities = 95/379 (25%), Positives = 159/379 (41%), Gaps = 55/379 (14%)
Query: 35 YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPP--LFDPKKSSTYNSISC 92
Y + +GTP DTGSD TW +C P +F S ++ I+C
Sbjct: 101 YFVRFRVGTPAQPFVLVADTGSDLTWVKCRGAGAAAGTGAGSPARVFRTAASKSWAPIAC 160
Query: 93 SSSQCAVVT----SNCSE--GDCSYSFLYGRGAYASFSSGNLATETLTF----------- 135
SS C +NCS C+Y + Y G S + G + T++ T
Sbjct: 161 SSDTCTSYVPFSLANCSSPASPCAYDYRYRDG---SAARGVVGTDSATIALSSGSGRGGG 217
Query: 136 NSTSGLPVEMPNVIFGCGHKNLASPTSDSKQT--GIIGLGPGNSSLISQMGTSIAGKFSY 193
+S+ G ++ V+ GC A+ S Q+ G++ LG N S S+ G+FSY
Sbjct: 218 DSSGGRRAKLQGVVLGCA----ATYDGQSFQSSDGVLSLGNSNISFASRAAARFGGRFSY 273
Query: 194 CL-----PDQGSSKINFGGIVAGAGVVSTPLIIRDH----YYLSLEAISVGNQRLEFVS- 243
CL P +S + F G A A TPL++ Y ++++A+ V + L+ +
Sbjct: 274 CLVDHLAPRNATSYLTF-GPGATAPAAQTPLLLDRRMTPFYAVTVDAVYVAGEALDIPAD 332
Query: 244 ----SSTGNIFVDTGVLRTLLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDVLCYNIS 299
G +D+G T+L + + + +S + P V +P F CYN +
Sbjct: 333 VWDVDRNGGAILDSGTSLTILATPAYRAVVTALSKHLAGLPR--VTMDP-FE--YCYNWT 387
Query: 300 SQP--KFPEVTIHFRGADVKLSP--SNLFRNISDEIMCSAFRGGN--ANIVYGRIMQINF 353
+ P++ +HF G+ +L P + + + + C + G+ V G I+Q
Sbjct: 388 DAGALEIPKMEVHFAGS-ARLEPPAKSYVIDAAPGVKCIGVQEGSWPGVSVIGNILQQEH 446
Query: 354 LIGYDIEQAMVSFKPSRCT 372
L +D+ + FK +RC
Sbjct: 447 LWEFDLRDRWLRFKHTRCA 465
>gi|51038078|gb|AAT93881.1| hypothetical protein [Oryza sativa Japonica Group]
Length = 481
Score = 87.8 bits (216), Expect = 9e-15, Method: Compositional matrix adjust.
Identities = 101/419 (24%), Positives = 154/419 (36%), Gaps = 98/419 (23%)
Query: 35 YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPC--------PELDCFKQEPPLFDPKKSST 86
Y+ IG PP VDTGSD WTQC C CF Q P ++ S T
Sbjct: 78 YIASYGIGDPPQPAEAVVDTGSDLVWTQCSTCRLPAAAAAGGGGCFPQNLPYYNFSLSRT 137
Query: 87 YNSISCSSSQ---CAVV--TSNCSEGD------CSYSFLYGRGAYASFSSGNLATETLTF 135
++ C C V T+ C+ G C + YG G + G L T+ TF
Sbjct: 138 ARAVPCDDDDGALCGVAPETAGCARGGGSGDDACVVAASYGAG----VALGVLGTDAFTF 193
Query: 136 NSTSGLPVEMPNVIFGCGHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCL 195
S+S + + FGC + SP + + +GIIGLG G SL+SQ+ A +FSYCL
Sbjct: 194 PSSSSV-----TLAFGCVSQTRISPGALNGASGIIGLGRGALSLVSQLN---ATEFSYCL 245
Query: 196 PDQGSSKINFGGIVAGAG------------------VVSTPLI-------IRDHYYLSLE 230
++ + G G V + P YYL L
Sbjct: 246 TPYFRDTVSPSHLFVGDGELAGLSAAAGGGGGGGAPVTTVPFAKNPKDSPFSTFYYLPLV 305
Query: 231 AISVGNQRLEFVSSS-----------TGNIFVDTGVLRTLLPLEYHSNLKSVMSNMIKAQ 279
++ GN + + + G +D+G T L H L ++ ++
Sbjct: 306 GLAAGNATVALPAGAFDLREAAPKVWAGGALIDSGSPFTRLVDPAHRALTKELARQLRGS 365
Query: 280 ------PVKGVGAEPGFSDVLCYNIS------SQPKFPEVTIHFR-----GADVKLSPSN 322
P K GA LC + P + + F G ++ +
Sbjct: 366 GSLVPPPAKLGGALE-----LCVEAGDDGDSLAAAAVPPLVLRFDDGVGGGRELVIPAEK 420
Query: 323 LFRNISDEIMCSAF---RGGNANI------VYGRIMQINFLIGYDIEQAMVSFKPSRCT 372
+ + C A GNA + + G MQ + + YD+ ++SF+P+ C+
Sbjct: 421 YWARVEASTWCMAVVSSASGNATLPTNETTIIGNFMQQDMRVLYDLANGLLSFQPANCS 479
>gi|195627138|gb|ACG35399.1| aspartic proteinase nepenthesin-1 precursor [Zea mays]
Length = 431
Score = 87.8 bits (216), Expect = 9e-15, Method: Compositional matrix adjust.
Identities = 90/368 (24%), Positives = 151/368 (41%), Gaps = 47/368 (12%)
Query: 35 YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSS 94
Y++ +GTP + ++DT +D TW+ C PC + F P SS+Y S+ C+S
Sbjct: 79 YVVRAGLGTPVQQLLLALDTSADATWSHCAPCDTCPAGSR----FIPASSSSYASLPCAS 134
Query: 95 SQCAVVTSNCSEGD---------CSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEM 145
C + + C++S + A SF + +L ++TL + +
Sbjct: 135 DWCPLFEGQPCPANQDASAPLPACAFSKPF---ADTSFQA-SLGSDTLRLGKDA-----I 185
Query: 146 PNVIFGCGHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLPDQG----SS 201
FGC +A PT++ + G++GLG G SL+SQ G++ G FSYCLP S
Sbjct: 186 AGYAFGC-VGAVAGPTTNLPKQGLLGLGRGPMSLLSQTGSTYNGVFSYCLPSYRSYYFSG 244
Query: 202 KINFGGIVAGAGVVSTPLIIRDH----YYLSLEAISVGNQRLE-------FVSSSTGNIF 250
+ G V TPL+ H YY+++ +SVG ++ F ++
Sbjct: 245 SLRLGAAGQPRNVRYTPLLTNPHRPSLYYVNVTGLSVGRTWVKVPAGSFAFDPATGAGTV 304
Query: 251 VDTGVLRTLLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDVLCYNISSQPKFPEVTIH 310
+D+G + T ++ L+ + A G + F + + P VT+H
Sbjct: 305 IDSGTVITRWTAPVYAALREEFRRQVAAP--SGYTSLGAFDTCFNTDEVAAGGAPPVTLH 362
Query: 311 FRGA-DVKLSPSN-LFRNISDEIMCSAFRGG-----NANIVYGRIMQINFLIGYDIEQAM 363
G D+ L N L + + + C A V + Q N + D+ +
Sbjct: 363 MDGGVDLTLPMENTLIHSSATPLACLAMAEAPQNVNAVVNVVANLQQQNVRVVVDVAGSR 422
Query: 364 VSFKPSRC 371
V F C
Sbjct: 423 VGFAREPC 430
>gi|225438908|ref|XP_002279194.1| PREDICTED: aspartic proteinase-like protein 2-like [Vitis vinifera]
Length = 634
Score = 87.4 bits (215), Expect = 9e-15, Method: Compositional matrix adjust.
Identities = 105/362 (29%), Positives = 157/362 (43%), Gaps = 40/362 (11%)
Query: 35 YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSS 94
Y + IGTPP VDTGS T+ C C + C K + P F P SSTY + C S
Sbjct: 92 YTTRIWIGTPPQTFALIVDTGSTLTYVPCSTCEQ--CGKHQDPNFQPDWSSTYQPLKC-S 148
Query: 95 SQCAVVTSNCSEGDCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMPNVIFGCGH 154
+C T + C Y Y A S SSG L + ++F S L + +FGC +
Sbjct: 149 MEC---TCDSEMMHCVYDRQY---AEMSSSSGVLGEDIVSFGKQSELKPQ--RTVFGCEN 200
Query: 155 KNLASPTSDSKQTGIIGLGPGNSSLISQMGTS--IAGKFSYCL--PDQGSSKINFGGIVA 210
S + GI+GLG G+ S++ Q+ I FS C D G + GGI
Sbjct: 201 VETGDIYSQ-RADGIMGLGRGDLSIVDQLVEKGVIGNSFSLCYGGMDVGGGAMVLGGISP 259
Query: 211 GAGVV---STPLIIRDHYY-LSLEAISVGNQRL---EFVSSSTGNIFVDTGVLRTLLPLE 263
AG+V S P R YY + L+ I + ++L V +D+G LP
Sbjct: 260 PAGMVFTHSDP--ARSAYYNIDLKEIHIAGKQLPINPMVFDGKYGTILDSGTTYAYLPEP 317
Query: 264 YHSNLK-SVMSNMIKAQPVKGVGAEPGFSDVLCY-----NISSQPK-FPEVTIHF-RGAD 315
K ++M + + ++ G + ++D+ C+ ++S K FP V + F G
Sbjct: 318 AFKAFKDAIMKELNSLKLIQ--GPDRNYNDI-CFSGVGSDVSQLSKTFPAVDLVFSNGNR 374
Query: 316 VKLSPSN-LFRN--ISDEIMCSAFRGGN-ANIVYGRIMQINFLIGYDIEQAMVSFKPSRC 371
+ LSP N LF++ F+ N + G I+ N L+ YD E + F + C
Sbjct: 375 LSLSPENYLFQHSKAHGAYCLGIFQNENDQTTLLGGIIVRNTLVMYDREHLKIGFWKTNC 434
Query: 372 TN 373
+
Sbjct: 435 SE 436
>gi|88174591|gb|ABD39370.1| chloroplast nucleoid DNA-binding protein [Oryza rufipogon]
Length = 321
Score = 87.4 bits (215), Expect = 9e-15, Method: Compositional matrix adjust.
Identities = 90/329 (27%), Positives = 147/329 (44%), Gaps = 45/329 (13%)
Query: 35 YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSS 94
Y+ + +GTP +DTGS +W C E D P F +S+T +SC +
Sbjct: 1 YVTSVGLGTPAKTQIVEIDTGSSTSWVFC----ECDGCHTNPRTFLQSRSTTCAKVSCGT 56
Query: 95 SQCAVVTSN--CSEG----DCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMPNV 148
S C + S+ C + DC + Y G S S G L +TLTF+ ++P+
Sbjct: 57 SMCLLGGSDPHCQDSENYPDCPFRVSYQDG---SASYGILYQDTLTFSDVQ----KIPSF 109
Query: 149 IFGCGHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLPDQGSSK------ 202
FGC + + G++G+G G S++ Q + G FSYCLP Q S +
Sbjct: 110 TFGCNLDSFGA-NEFGNVDGLLGMGAGPMSVLKQSSPTFDG-FSYCLPLQKSERGFFSKT 167
Query: 203 ---INFGGIVAGAGVVSTPLIIR----DHYYLSLEAISVGNQRLEFVSS--STGNIFVDT 253
+ G + V T ++ R + +++ L AISV +RL S S + D+
Sbjct: 168 TGYFSLGKVATRTDVRYTKMVARRKNTELFFVDLAAISVDGERLGLSPSIFSRKGVVFDS 227
Query: 254 GVLRTLLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDVLCYNISS--QPKFPEVTIHF 311
G + +P SV+S I+ ++ AE S+ CY++ S + P +++HF
Sbjct: 228 GSELSYIP----DRALSVLSQRIRELLLRRGAAEEE-SERNCYDMRSVDEGDMPAISLHF 282
Query: 312 -RGADVKLSPSNLF--RNISDE-IMCSAF 336
GA L +F R++ ++ + C AF
Sbjct: 283 DDGARFDLGIHGVFVERSVQEQDVWCLAF 311
>gi|88174561|gb|ABD39355.1| chloroplast nucleoid DNA-binding protein [Oryza longistaminata]
Length = 321
Score = 87.4 bits (215), Expect = 9e-15, Method: Compositional matrix adjust.
Identities = 90/329 (27%), Positives = 148/329 (44%), Gaps = 45/329 (13%)
Query: 35 YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSS 94
Y++ + +GTP +DTGS +W C E D P F +S+T +SC +
Sbjct: 1 YVISVGLGTPSKTQILEIDTGSSTSWVFC----ECDGCHTNPRTFLQSRSTTCAKVSCGT 56
Query: 95 SQCAVVTSN--CSEG----DCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMPNV 148
S C + S+ C + DC + Y G S S G L +TLTF+ ++P+
Sbjct: 57 SMCLLGGSDPHCQDSENYPDCPFRVSYQDG---SASYGILYQDTLTFSDVQ----KIPSF 109
Query: 149 IFGCGHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLPDQGSSK------ 202
FGC + + G++G+G G S++ Q + G FSYCLP Q S +
Sbjct: 110 SFGCNMDSFGA-NEFGNVDGLLGMGAGPMSVLKQSSPTFDG-FSYCLPLQMSERGFFSKT 167
Query: 203 ---INFGGIVAGAGVVSTPLIIR----DHYYLSLEAISVGNQRLEFVSS--STGNIFVDT 253
+ G + V T ++ R + +++ L AISV +RL S S + D+
Sbjct: 168 TGYFSLGKVATRTDVRYTKMVARKKNTELFFVDLTAISVDGERLGLSPSIFSRKGVVFDS 227
Query: 254 GVLRTLLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDVLCYNISS--QPKFPEVTIHF 311
G + +P SV+S I+ ++ AE S+ CY++ S + P +++HF
Sbjct: 228 GSELSYIP----DRALSVLSQRIRELLLRRGAAEEE-SERNCYDMRSVDEGDMPAISLHF 282
Query: 312 -RGADVKLSPSNLF--RNISDE-IMCSAF 336
GA L +F R++ ++ + C AF
Sbjct: 283 DDGARFDLGSHGVFVERSVQEQDVWCLAF 311
>gi|326515330|dbj|BAK03578.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 445
Score = 87.4 bits (215), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 89/361 (24%), Positives = 151/361 (41%), Gaps = 54/361 (14%)
Query: 49 FGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSSSQCAVVTSNCSEGD 108
F +DT S W +C C L +Q P+FDP SS+Y + +S C GD
Sbjct: 90 FLVLDTASSLPWMRCAHC--LPVQRQRSPVFDPSDSSSYRPLHPTSPLCRAPNPVLPAGD 147
Query: 109 -CSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMPNVIFGCGHKNLASPTSDSKQT 167
CS+ + G + T+T+ + + LP+ +V FGC ++ D+K T
Sbjct: 148 KCSFHL-------PGEAHGYVGTDTIILGNPT-LPIH--SVAFGCAQ---STEGFDTKGT 194
Query: 168 --GIIGLGPGNSSLISQMGTSIAGKFSYCLPDQGSSKINFGGIVAGAGVVSTPLIIRDH- 224
G +G+G +SLI Q+ + +FSYCL G S G I GA + L++
Sbjct: 195 FAGTLGMGKLPTSLIMQIKDRVGSRFSYCLIGLGHSPGRNGFIRFGADIPDPTLLVHHRI 254
Query: 225 -----------------YYLSLEAIS--------VGNQRLEFVSSSTGNIFVDTGVLRTL 259
YY+ L IS + E S +G FVD G T
Sbjct: 255 KILPTPPHLPHGVADSAYYVKLLGISLNGTPIPGIRQAMFERRSDGSGGCFVDAGTQVTH 314
Query: 260 LPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDVLCYNISSQPKFPEVTIHFRG------ 313
L ++ ++ +++M++ K V +P FS + P++T+ F G
Sbjct: 315 LVPAAYAVVEEAVAHMVQQWGYKRV-RDPNFSLCFREHPGIWSHIPKLTLDFEGPASRTV 373
Query: 314 ADVKLSPSNLFRNISDE-IMC-SAFRGGNAN-IVYGRIMQINFLIGYDIEQAMVSFKPSR 370
A +++ NLF + ++ ++C +R + V G + Q++ +D+ ++F
Sbjct: 374 AHLEIVSRNLFLKVDNQPLVCFGVYRTSRGSPTVVGAMQQVDTRFIFDLHANTITFHRES 433
Query: 371 C 371
C
Sbjct: 434 C 434
>gi|297810815|ref|XP_002873291.1| hypothetical protein ARALYDRAFT_487523 [Arabidopsis lyrata subsp.
lyrata]
gi|297319128|gb|EFH49550.1| hypothetical protein ARALYDRAFT_487523 [Arabidopsis lyrata subsp.
lyrata]
Length = 439
Score = 87.0 bits (214), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 101/398 (25%), Positives = 169/398 (42%), Gaps = 50/398 (12%)
Query: 1 AQNSQKLPFYNDNETPKSPISIIYQAEIISVDDIYLMHLSIGTPPVDIFGSVDTGSDCTW 60
AQ+ +L + + +S + I +++ Y++ + IGTP + ++DT SD W
Sbjct: 66 AQDQARLQYLSSLVAGRSVVPIASGRQMLQ-STTYIVKVLIGTPAQPLLLAMDTSSDVAW 124
Query: 61 TQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSSSQCAVVTS-NCSEGDCSYSFLYGRGA 119
C C + C F P KS+++ ++SCS+ QC V + C CS++ YG +
Sbjct: 125 IPCSGC--VGCPSNT--AFSPAKSTSFKNVSCSAPQCKQVPNPACGARACSFNLTYGSSS 180
Query: 120 YASFSSGNLATETLTFNSTSGLPVEMPNVIFGCGHKNLASPTSDSKQTGIIGLGPGNSSL 179
A+ NL+ +T+ + P++ FGC +K T Q + G SL
Sbjct: 181 IAA----NLSQDTIRLAAD---PIKA--FTFGCVNKVAGGGTIPPPQGLLGLGR-GPLSL 230
Query: 180 ISQMGTSIAGKFSYCLPDQGSSKINFGGIVAGAGVVSTPLIIR-----------DHYYLS 228
+SQ + FSYCLP S + F G + G S P ++ YY++
Sbjct: 231 MSQAQSVYKSTFSYCLPSFRS--LTFSGSLR-LGPTSQPQRVKYTQLLRNPRRSSLYYVN 287
Query: 229 LEAISVGNQRLEF--------VSSSTGNIFVDTGVLRTLLPLEYHSNLKSVMSNMIKAQP 280
L AI VG + ++ S+ G IF D+G + T L + +++ +K P
Sbjct: 288 LVAIRVGRKVVDLPPAAIAFNPSTGAGTIF-DSGTVYTRLAKPVYEAVRNEFRKRVKP-P 345
Query: 281 VKGVGAEPGFSDVLCYNISSQPKFPEVTIHFRGADVKLSPSNL-FRNISDEIMCSAFRGG 339
V + GF CY S Q K P +T F+G ++ + NL + + C A
Sbjct: 346 TAVVTSLGGFDT--CY--SGQVKVPTITFMFKGVNMTMPADNLMLHSTAGSTSCLAMASA 401
Query: 340 NANI-----VYGRIMQINFLIGYDIEQAMVSFKPSRCT 372
N+ V + Q N + D+ + RC+
Sbjct: 402 PENVNSVVNVIASMQQQNHRVLIDVPNGRLGLARERCS 439
>gi|212722554|ref|NP_001131154.1| uncharacterized protein LOC100192462 precursor [Zea mays]
gi|194690728|gb|ACF79448.1| unknown [Zea mays]
Length = 431
Score = 87.0 bits (214), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 90/368 (24%), Positives = 150/368 (40%), Gaps = 47/368 (12%)
Query: 35 YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSS 94
Y++ +GTP + ++DT +D TW+ C PC + F P SS+Y S+ C+S
Sbjct: 79 YVVRAGLGTPVQQLLLALDTSADATWSHCAPCDTCPAGSR----FIPASSSSYASLPCAS 134
Query: 95 SQCAVVTSNCSEGD---------CSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEM 145
C + + C++S + A SF + +L ++TL + +
Sbjct: 135 DWCPLFEGQPCPANQDASAPLPACAFSKPF---ADTSFQA-SLGSDTLRLGKDA-----I 185
Query: 146 PNVIFGCGHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLPDQG----SS 201
FGC +A PT++ + G++GLG G SL+SQ G+ G FSYCLP S
Sbjct: 186 AGYAFGC-VGAVAGPTTNLPKQGLLGLGRGPMSLLSQTGSRYNGVFSYCLPSYRSYYFSG 244
Query: 202 KINFGGIVAGAGVVSTPLIIRDH----YYLSLEAISVGNQRLE-------FVSSSTGNIF 250
+ G V TPL+ H YY+++ +SVG ++ F ++
Sbjct: 245 SLRLGAAGQPRNVRYTPLLTNPHRPSLYYVNVTGLSVGRTWVKVPAGSFAFDPATGAGTV 304
Query: 251 VDTGVLRTLLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDVLCYNISSQPKFPEVTIH 310
+D+G + T ++ L+ + A G + F + + P VT+H
Sbjct: 305 IDSGTVITRWTAPVYAALREEFRRQVAAP--SGYTSLGAFDTCFNTDEVAAGGAPPVTLH 362
Query: 311 FRGA-DVKLSPSN-LFRNISDEIMCSAFRGG-----NANIVYGRIMQINFLIGYDIEQAM 363
G D+ L N L + + + C A V + Q N + D+ +
Sbjct: 363 MDGGVDLTLPMENTLIHSSATPLACLAMAEAPQNVNAVVNVVANLQQQNVRVVVDVAGSR 422
Query: 364 VSFKPSRC 371
V F C
Sbjct: 423 VGFAREPC 430
>gi|194698750|gb|ACF83459.1| unknown [Zea mays]
gi|194703964|gb|ACF86066.1| unknown [Zea mays]
gi|219886221|gb|ACL53485.1| unknown [Zea mays]
gi|219886359|gb|ACL53554.1| unknown [Zea mays]
gi|223950085|gb|ACN29126.1| unknown [Zea mays]
gi|414865218|tpg|DAA43775.1| TPA: aspartic proteinase nepenthesin-1 [Zea mays]
Length = 431
Score = 87.0 bits (214), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 90/368 (24%), Positives = 150/368 (40%), Gaps = 47/368 (12%)
Query: 35 YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSS 94
Y++ +GTP + ++DT +D TW+ C PC + F P SS+Y S+ C+S
Sbjct: 79 YVVRAGLGTPVQQLLLALDTSADATWSHCAPCDTCPAGSR----FIPASSSSYASLPCAS 134
Query: 95 SQCAVVTSNCSEGD---------CSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEM 145
C + + C++S + A SF + +L ++TL + +
Sbjct: 135 DWCPLFEGQPCPANQDASAPLPACAFSKPF---ADTSFQA-SLGSDTLRLGKDA-----I 185
Query: 146 PNVIFGCGHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLPDQG----SS 201
FGC +A PT++ + G++GLG G SL+SQ G+ G FSYCLP S
Sbjct: 186 AGYAFGC-VGAVAGPTTNLPKQGLLGLGRGPMSLLSQTGSRYNGVFSYCLPSYRSYYFSG 244
Query: 202 KINFGGIVAGAGVVSTPLIIRDH----YYLSLEAISVGNQRLE-------FVSSSTGNIF 250
+ G V TPL+ H YY+++ +SVG ++ F ++
Sbjct: 245 SLRLGAAGQPRNVRYTPLLTNPHRPSLYYVNVTGLSVGRTWVKVPAGSFAFDPATGAGTV 304
Query: 251 VDTGVLRTLLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDVLCYNISSQPKFPEVTIH 310
+D+G + T ++ L+ + A G + F + + P VT+H
Sbjct: 305 IDSGTVITRWTAPVYAALREEFRRQVAAP--SGYTSLGAFDTCFNTDEVAAGGAPPVTLH 362
Query: 311 FRGA-DVKLSPSN-LFRNISDEIMCSAFRGG-----NANIVYGRIMQINFLIGYDIEQAM 363
G D+ L N L + + + C A V + Q N + D+ +
Sbjct: 363 MDGGVDLTLPMENTLIHSSATPLACLAMAEAPQNVNAVVNVVANLQQQNVRVVVDVAGSR 422
Query: 364 VSFKPSRC 371
V F C
Sbjct: 423 VGFAREPC 430
>gi|413951280|gb|AFW83929.1| hypothetical protein ZEAMMB73_279135 [Zea mays]
Length = 451
Score = 87.0 bits (214), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 105/386 (27%), Positives = 161/386 (41%), Gaps = 55/386 (14%)
Query: 17 KSPISIIYQAEIISVDDIYLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEP 76
+S + I +++S+ Y+ +GTP + ++D +D W C
Sbjct: 90 RSFVPIAPGRQLLSIPS-YVARARLGTPAQALLVAIDPSNDAAWVPCA----ACAGCARA 144
Query: 77 PLFDPKKSSTYNSISCSSSQCAVVTS-NCSEG---DCSYSFLYGRGAYASFSSGNLATET 132
P FDP +SSTY + C + QC+ + +C G C+++ Y + + L +
Sbjct: 145 PSFDPTRSSTYRPVRCGAPQCSQAPAPSCPGGLGSSCAFNLSYAASTFQAL----LGQDA 200
Query: 133 LTFNSTSGLPVEMPNVIFGCGHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFS 192
L + + FGC H + + S Q G++G G G S SQ FS
Sbjct: 201 LALHDDVD---AVAAYTFGCLH--VVTGGSVPPQ-GLVGFGRGPLSFPSQTKDVYGSVFS 254
Query: 193 YCLPDQGSSKINFGGI--VAGAG----VVSTPLIIRDH----YYLSLEAISVGNQ----- 237
YCLP SS NF G + AG + +TPL+ H YY+++ I VG +
Sbjct: 255 YCLPSYKSS--NFSGTLRLGPAGQPKRIKTTPLLSNPHRPSLYYVNMVGIRVGGRPVPVP 312
Query: 238 --RLEFVSSSTGNIFVDTGVLRTLLPLEYHSNLKSVMSNMIKAQPVKG-VGAEPGFSDVL 294
L F +S VD G + T L ++ ++ V + ++A PV G +G GF
Sbjct: 313 ASALAFDPTSGRGTIVDAGTMFTRLSAPVYAAVRDVFRSRVRA-PVAGPLG---GFDT-- 366
Query: 295 CYNISSQPKFPEVTIHFRG-ADVKLSPSN-LFRNISDEIMCSAFRGG------NANIVYG 346
CYN++ P VT F G V L N + R+ S I C A G A V
Sbjct: 367 CYNVTI--SVPTVTFSFDGRVSVTLPEENVVIRSSSGGIACLAMAAGPPDGVDAALNVLA 424
Query: 347 RIMQINFLIGYDIEQAMVSFKPSRCT 372
+ Q N + +D+ V F CT
Sbjct: 425 SMQQQNHRVLFDVANGRVGFSRELCT 450
>gi|255576064|ref|XP_002528927.1| pepsin A, putative [Ricinus communis]
gi|223531629|gb|EEF33456.1| pepsin A, putative [Ricinus communis]
Length = 493
Score = 86.7 bits (213), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 99/407 (24%), Positives = 156/407 (38%), Gaps = 90/407 (22%)
Query: 44 PPVDIFGSVDTGSDCTWTQCEPCPELDCFKQE-------PPLFDPKKSSTYNSISCSSSQ 96
PP + +DTGSD W C+P + C + PP P+ SST S+ C SS
Sbjct: 92 PPQHVSLYLDTGSDLVWFPCKPFECILCEGKAENTTASTPP---PRLSSTARSVHCKSSA 148
Query: 97 CAVVTSN------CSEGDCSYSFLYGRGA--------YASFSSGNLATETLTFNSTSGLP 142
C+ SN C+ DC + Y ++ G+L ++ + LP
Sbjct: 149 CSAAHSNLPTSDLCAIADCPLESIETSDCHSFSCPSFYYAYGDGSLVAR--LYHDSIKLP 206
Query: 143 VEMP-----NVIFGCGHKNLASPTSDSKQTGIIGLGPGNSSLISQMGT---SIAGKFSYC 194
+ P N FGC H LA P G+ G G G SL +Q+ + + +FSYC
Sbjct: 207 LATPSLSLHNFTFGCAHTALAEP------VGVAGFGRGVLSLPAQLASFAPQLGNRFSYC 260
Query: 195 LPDQ--GSSKINF-GGIVAGAGVVSTPLIIRDH-----------------YYLSLEAISV 234
L S ++ ++ G + +D Y + LE IS+
Sbjct: 261 LVSHSFNSDRLRLPSPLILGHSDDKEKRVNKDDVQFVYTSMLDNPKHPYFYCVGLEGISI 320
Query: 235 GNQRL---EFVS----SSTGNIFVDTGVLRTLLPLEYHSNLKSVMSNMIKA--QPVKGVG 285
G +++ EF+ +G + VD+G T+LP ++++ + N + + K V
Sbjct: 321 GKKKIPAPEFLKRVDREGSGGVVVDSGTTFTMLPASLYNSVVAEFDNRVGRVYERAKEVE 380
Query: 286 AEPGFSDVLCYNISSQPKFPEVTIHFRG--ADVKLSPSNLFRNISD---------EIMC- 333
+ G CY + P + +HF G + V L N F + D + C
Sbjct: 381 DKTGLGP--CYYYDTVVNIPSLVLHFVGNESSVVLPKKNYFYDFLDGGDGVRRKRRVGCL 438
Query: 334 -------SAFRGGNANIVYGRIMQINFLIGYDIEQAMVSFKPSRCTN 373
A G G Q F + YD+EQ V F +C +
Sbjct: 439 MLMNGGEEAELTGGPGATLGNYQQHGFEVVYDLEQRRVGFARRKCAS 485
>gi|340810977|gb|AEK75415.1| S5 [Oryza rufipogon]
Length = 357
Score = 86.7 bits (213), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 99/371 (26%), Positives = 148/371 (39%), Gaps = 50/371 (13%)
Query: 37 MHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEP---PLFDPKKSSTYNSISCS 93
M +S+G PPV ++DTGS +W QC+PC + C Q P+FDP +S T + CS
Sbjct: 1 MAVSLGKPPVVNLVAIDTGSTLSWVQCQPC-AVHCHTQSAKAGPIFDPGRSYTSRRVRCS 59
Query: 94 SSQCA-------VVTSNCSEGD--CSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVE 144
S +C + +NC E + C+YS YG G ++S G + T+TL +
Sbjct: 60 SVKCGEPRYDLRLQQANCMEKEDSCTYSVTYGNG--WAYSVGKMVTDTLRIGDS------ 111
Query: 145 MPNVIFGCGHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLP-DQGSSKI 203
+++FGC S L SYCLP D+
Sbjct: 112 FMDLMFGCSMDVKYSEFEAGIFGFGSSSFSFFEQLAGYPDILSYKALSYCLPTDETKPGY 171
Query: 204 NFGGIVAGAGVVS--TPL---IIRDHYYLSLEAISVGNQRLEFVSSSTGNIFVDTGVLRT 258
G A + TPL I R Y L++E + QRL +S+ + VD+G RT
Sbjct: 172 MILGRYDRAAMDGGYTPLFRSINRPTYSLTMEMLIANGQRL---VTSSSEMIVDSGAQRT 228
Query: 259 -LLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDVLCY--------------NISSQPK 303
L P + K++ M + A +CY S+
Sbjct: 229 SLWPSTFALLDKTITQAMSSIGYHRTSRARQ--ESYICYLSEHDYSGWNGTITPFSNWSA 286
Query: 304 FPEVTIHFR-GADVKLSPSNLFRNISDEIMCSAFRGGNA--NIVYGRIMQINFLIGYDIE 360
P + I F GA + L P N+F N +C F A + + G + +F +DI+
Sbjct: 287 LPLLEIGFAGGAALALPPRNVFYNDPHRGLCMTFAQNPALRSQILGNRVTRSFGTTFDIQ 346
Query: 361 QAMVSFKPSRC 371
FK + C
Sbjct: 347 GKQFGFKYAVC 357
>gi|340810961|gb|AEK75407.1| S5 [Oryza sativa]
gi|340811037|gb|AEK75445.1| S5 [Oryza rufipogon]
Length = 357
Score = 86.7 bits (213), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 106/379 (27%), Positives = 158/379 (41%), Gaps = 66/379 (17%)
Query: 37 MHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEP---PLFDPKKSSTYNSISCS 93
M +S+G PPV ++DTGS +W QC+PC + C Q P+FDP +S T + CS
Sbjct: 1 MAVSLGKPPVVNLVAIDTGSTLSWVQCQPC-AVHCHTQSAKAGPIFDPGRSYTSRRVRCS 59
Query: 94 SSQCA-------VVTSNCSEGD--CSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVE 144
S +C + +NC E + C+YS YG G ++S G + T+TL +
Sbjct: 60 SVKCGELRYDLRLQQANCMEKEDSCTYSVTYGNG--WAYSVGKMVTDTLRIGDS------ 111
Query: 145 MPNVIFGCGHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAG--------KFSYCLP 196
+++FGC S + GI G G + S Q+ AG SYCLP
Sbjct: 112 FMDLMFGCSMDVKYS----EFEAGIFGFGSSSFSFFEQL----AGYPDILSYKALSYCLP 163
Query: 197 -DQGSSKINFGGIVAGAGVVS--TPL---IIRDHYYLSLEAISVGNQRLEFVSSSTGNIF 250
D+ G A + TPL I R Y L++E + QRL +S+ +
Sbjct: 164 TDETKPGYMILGRYDRAAMDGGYTPLFRSINRPTYSLTMEMLIANGQRL---VTSSSEMI 220
Query: 251 VDTGVLRT-LLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDVLCY------------- 296
VD+G RT L P + K++ M + A +CY
Sbjct: 221 VDSGAQRTSLWPSTFALLDKTITQAMSSIGYHRTSRARQ--ESYICYLSEHDYSGWNGTI 278
Query: 297 -NISSQPKFPEVTIHFR-GADVKLSPSNLFRNISDEIMCSAFRGGNA--NIVYGRIMQIN 352
S+ P + I F GA + L P N+F N +C F A + + G + +
Sbjct: 279 TPFSNWSALPLLEIGFAGGAALALPPRNVFYNDPHRGLCMTFAQNPALRSQILGNRVTRS 338
Query: 353 FLIGYDIEQAMVSFKPSRC 371
F +DI+ FK + C
Sbjct: 339 FGTTFDIQGKQFGFKYAVC 357
>gi|79507883|ref|NP_196320.2| aspartyl protease family protein [Arabidopsis thaliana]
gi|332003717|gb|AED91100.1| aspartyl protease family protein [Arabidopsis thaliana]
Length = 455
Score = 86.7 bits (213), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 101/398 (25%), Positives = 167/398 (41%), Gaps = 50/398 (12%)
Query: 1 AQNSQKLPFYNDNETPKSPISIIYQAEIISVDDIYLMHLSIGTPPVDIFGSVDTGSDCTW 60
AQ+ +L + + +S + I +++ Y++ IGTP + ++DT SD W
Sbjct: 82 AQDQARLQYLSSLVAGRSVVPIASGRQMLQ-STTYIVKALIGTPAQPLLLAMDTSSDVAW 140
Query: 61 TQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSSSQCAVVTS-NCSEGDCSYSFLYGRGA 119
C C + C F P KS+++ ++SCS+ QC V + C CS++ YG +
Sbjct: 141 IPCSGC--VGCPSNT--AFSPAKSTSFKNVSCSAPQCKQVPNPTCGARACSFNLTYGSSS 196
Query: 120 YASFSSGNLATETLTFNSTSGLPVEMPNVIFGCGHKNLASPTSDSKQTGIIGLGPGNSSL 179
A+ NL+ +T+ + P++ FGC +K T Q + G SL
Sbjct: 197 IAA----NLSQDTIRLAAD---PIKA--FTFGCVNKVAGGGTIPPPQGLLGLGR-GPLSL 246
Query: 180 ISQMGTSIAGKFSYCLPDQGSSKINFGGIVAGAGVVSTPLIIR-----------DHYYLS 228
+SQ + FSYCLP S + F G + G S P ++ YY++
Sbjct: 247 MSQAQSIYKSTFSYCLPSFRS--LTFSGSLR-LGPTSQPQRVKYTQLLRNPRRSSLYYVN 303
Query: 229 LEAISVGNQRLEF--------VSSSTGNIFVDTGVLRTLLPLEYHSNLKSVMSNMIKAQP 280
L AI VG + ++ S+ G IF D+G + T L + +++ +K P
Sbjct: 304 LVAIRVGRKVVDLPPAAIAFNPSTGAGTIF-DSGTVYTRLAKPVYEAVRNEFRKRVK--P 360
Query: 281 VKGVGAEPGFSDVLCYNISSQPKFPEVTIHFRGADVKLSPSNL-FRNISDEIMCSAFRGG 339
V G D CY S Q K P +T F+G ++ + NL + + C A
Sbjct: 361 TTAVVTSLGGFDT-CY--SGQVKVPTITFMFKGVNMTMPADNLMLHSTAGSTSCLAMAAA 417
Query: 340 NANI-----VYGRIMQINFLIGYDIEQAMVSFKPSRCT 372
N+ V + Q N + D+ + RC+
Sbjct: 418 PENVNSVVNVIASMQQQNHRVLIDVPNGRLGLARERCS 455
>gi|168000300|ref|XP_001752854.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162696017|gb|EDQ82358.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 525
Score = 86.7 bits (213), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 96/366 (26%), Positives = 161/366 (43%), Gaps = 43/366 (11%)
Query: 34 IYLMHLSIGTPPVDIFGSVDTGSDCTWT--QCEPCPELDCFKQEPPL-----FDPKKSST 86
++ ++ IGTP V +DTGSD W +CE C L ++P + P SST
Sbjct: 110 LHYSYIDIGTPNVQFLVVLDTGSDLLWIPCECESCAPLSAESKDPRTSQLNPYTPSLSST 169
Query: 87 YNSISCSSSQCAVVTSNCSEGD-CSYSFLYGRGAYASFSSGNLATETLTFNSTS-GLPVE 144
+ CS C + ++ + D C Y Y + +SG L + + F S G PV+
Sbjct: 170 AKPVLCSDPLCEMSSTCMAPTDQCPYEINYVSANTS--TSGALYEDYMYFMRESGGNPVK 227
Query: 145 MPNVIFGCGHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTS--IAGKFSYCLPDQGSSK 202
+P V GCG S + G++GLG + S+ +++ ++ +A FS C+ GS
Sbjct: 228 LP-VYLGCGKVQTGSLLKGAAPNGLMGLGTTDISVPNKLASTGQLADSFSLCISPGGSGT 286
Query: 203 INFGGIVAGAGVVSTPLI-----IRDHYYLSLEAISVGNQRLEFVSSSTGNIFVDTGVLR 257
+ FG A +TP+I + D Y + +++I+VGN L S + DTG
Sbjct: 287 LTFGD-EGPAAQRTTPIIPKSVSMLDTYIVEIDSITVGNTNLLMASHA----LFDTGTSF 341
Query: 258 TLLPLEYHSNLKSVMSNMIK---AQPVKGVGAEPGFSDV-LCYNIS-SQPKFPEVTIHFR 312
T L K+V ++ AQ +P FS LCY S + + P V++
Sbjct: 342 TYLS-------KTVYPQFVQAYDAQMSLPKWNDPRFSKWDLCYQTSNTNFQVPVVSLALS 394
Query: 313 GADVKLSPSNLFRNISDE------IMCSAFRGGNANIVYGRIMQINFLIGYDIEQAMVSF 366
G + L + ++I D+ + + G + G+ N+ I Y+ + + +
Sbjct: 395 GGN-SLDVVSGLKSIVDDNNAMIAVCVTVMDSGAGLSIIGQNFMTNYSITYNRAKMTIGW 453
Query: 367 KPSRCT 372
PS C+
Sbjct: 454 TPSDCS 459
>gi|297723027|ref|NP_001173877.1| Os04g0336942 [Oryza sativa Japonica Group]
gi|255675342|dbj|BAH92605.1| Os04g0336942 [Oryza sativa Japonica Group]
Length = 388
Score = 86.7 bits (213), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 82/299 (27%), Positives = 129/299 (43%), Gaps = 31/299 (10%)
Query: 34 IYLMHLSIGTPPVDIFGSVDTGSDCTWTQ---CEPCPELDCFKQEPPLFDPKKSSTYNSI 90
+Y + IGTP V + +DTGS W C+ CP ++ +DP+ S + +
Sbjct: 82 LYYTDIGIGTPAVKYYVQLDTGSKAFWVNGISCKQCPHESDILRKLTFYDPRSSVSSKEV 141
Query: 91 SCSSSQCAVVTSNCSEGDCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMP---N 147
C + C C Y Y G + G L T+ L ++ G P +
Sbjct: 142 KCDDTICTSRPPCNMTLRCPYITGYADGG---LTMGILFTDLLHYHQLYGNGQTQPTSTS 198
Query: 148 VIFGCGHKNLASPTSDSKQT-GIIGLGPGNSSLISQMGTSIAGK----FSYCLPDQGSSK 202
V FGCG + S + + GIIG G N + +SQ+ + AGK FS+CL
Sbjct: 199 VTFGCGLQQSGSLNNSAVAIDGIIGFGNSNQTALSQL--AAAGKTKKIFSHCLDSTNGGG 256
Query: 203 INFGGIVAGAGVVSTPLIIRDHYY--LSLEAISVGNQRLE-----FVSSSTGNIFVDTGV 255
I G V V +TP++ + Y ++L++I+V L+ F ++ T F+D+G
Sbjct: 257 IFAIGEVVEPKVKTTPIVKNNEVYHLVNLKSINVAGTTLQLPANIFGTTKTKGTFIDSGS 316
Query: 256 LRTLLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDVLCYNI--SSQPKFPEVTIHFR 312
LP +S L + + P +GA F C++ S KFP++T HF
Sbjct: 317 TLVYLPEIIYSEL---ILAVFAKHPDITMGAMYNFQ---CFHFLGSVDDKFPKITFHFE 369
>gi|224140237|ref|XP_002323490.1| predicted protein [Populus trichocarpa]
gi|222868120|gb|EEF05251.1| predicted protein [Populus trichocarpa]
Length = 478
Score = 86.7 bits (213), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 96/371 (25%), Positives = 160/371 (43%), Gaps = 42/371 (11%)
Query: 34 IYLMHLSIGTPPVDIFGSVDTGSDCTWT---QCEPCPELDCFKQEPPLFDPKKSSTYNSI 90
+Y + +G+PP + +DTGSD W C CP + FD SST +
Sbjct: 65 LYFTKVKLGSPPREFNVQIDTGSDVLWVCCNSCNNCPRTSGLGIQLNFFDSSSSSTAGLV 124
Query: 91 SCSSSQC--AVVT--SNCS--EGDCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVE 144
CS C AV T + CS CSY+F Y G S +SG ++TL F++ G +
Sbjct: 125 HCSDPICTSAVQTTVTQCSPQTNQCSYTFQYEDG---SGTSGYYVSDTLYFDAILGESLV 181
Query: 145 MPN---VIFGCGHKNLASPT-SDSKQTGIIGLGPGNSSLISQMGTS--IAGKFSYCLPDQ 198
+ + ++FGC T +D GI G G G S+ISQ+ T FS+CL +
Sbjct: 182 VNSSALIVFGCSTFQSGDLTMTDKAVDGIFGFGQGELSVISQLSTHGITPRVFSHCLKGE 241
Query: 199 GSSKINFGGI-VAGAGVVSTPLI-IRDHYYLSLEAISVGNQRLE-----FVSSSTGNIFV 251
G + G+V +PL+ + HY L+L++I+V + L F +S++ V
Sbjct: 242 GIGGGILVLGEILEPGMVYSPLVPSQPHYNLNLQSIAVNGKLLPIDPSVFATSNSQGTIV 301
Query: 252 DTGVLRTLLPLEYHSNLKSVMSNMI--KAQPVKGVGAEPGFSDVLCYNISS--QPKFPEV 307
D+G L E + S ++ ++ P+ G + CY +S+ FP
Sbjct: 302 DSGTTLAYLVAEAYDPFVSAVNVIVSPSVTPIISKGNQ-------CYLVSTSVSQMFPLA 354
Query: 308 TIHFR-GADVKLSPSNLF-----RNISDEIMCSAFRGGNANIVYGRIMQINFLIGYDIEQ 361
+ +F GA + L P + + C F+ + G ++ + + YD+ +
Sbjct: 355 SFNFAGGASMVLKPEDYLIPFGPSQGGSVMWCIGFQKVQGVTILGDLVLKDKIFVYDLVR 414
Query: 362 AMVSFKPSRCT 372
+ + C+
Sbjct: 415 QRIGWANYDCS 425
>gi|356498711|ref|XP_003518193.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Glycine max]
Length = 466
Score = 86.3 bits (212), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 103/399 (25%), Positives = 156/399 (39%), Gaps = 84/399 (21%)
Query: 35 YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEP---CPELDCFKQEPPLFDPKKSSTYNSIS 91
Y + L GTP +DTGS W C C + + F P F PK SS+ +
Sbjct: 86 YSIDLEFGTPSQTFPFVLDTGSTLVWLPCSSHYLCSKCNSFSNTPK-FIPKNSSSSKFVG 144
Query: 92 CSSSQCAVVT----------------SNCSEGDCSYSFLYGRGAYASFSSGNLATETLTF 135
C++ +CA V +NCS+ +Y+ YG G+ A F L +E L F
Sbjct: 145 CTNPKCAWVFGPDVKSHCCRQDKAAFNNCSQTCPAYTVQYGLGSTAGF----LLSENLNF 200
Query: 136 NSTSGLPVEMPNVIFGCGHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCL 195
+ + + + GC ++ P GI G G G SL SQM + +FSYCL
Sbjct: 201 PTK-----KYSDFLLGCSVVSVYQPA------GIAGFGRGEESLPSQMNLT---RFSYCL 246
Query: 196 PDQ---GSSKINFGGIVAGA--------GVVSTPLI----------IRDHYYLSLEAISV 234
S+ I ++ A GV TP + +YY++L+ I V
Sbjct: 247 LSHQFDDSATITSNLVLETASSRDGKTNGVSYTPFLKNPTTKKNPAFGAYYYITLKRIVV 306
Query: 235 GNQR-------LEFVSSSTGNIFVDTGVLRTLLPLEYHSNLKSVMSNMIKAQPVKGVGAE 287
G +R LE G VD+G T + + + + + AE
Sbjct: 307 GEKRVRVPRRLLEPNVDGDGGFIVDSGSTFTFMERPIFDLVAQEFAKQVSY--TRAREAE 364
Query: 288 PGFSDVLCYNIS---SQPKFPEVTIHFRG-ADVKLSPSNLFRNI-----------SDEIM 332
F C+ ++ FPE+ FRG A ++L +N F + SD++
Sbjct: 365 KQFGLSPCFVLAGGAETASFPELRFEFRGGAKMRLPVANYFSLVGKGDVACLTIVSDDVA 424
Query: 333 CSAFRGGNANIVYGRIMQINFLIGYDIEQAMVSFKPSRC 371
S G A ++ G Q NF + YD+E F+ C
Sbjct: 425 GSGGTVGPA-VILGNYQQQNFYVEYDLENERFGFRSQSC 462
>gi|359476206|ref|XP_002262837.2| PREDICTED: aspartic proteinase nepenthesin-2-like [Vitis vinifera]
Length = 462
Score = 86.3 bits (212), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 97/361 (26%), Positives = 153/361 (42%), Gaps = 50/361 (13%)
Query: 29 ISVDDIYLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYN 88
++ D +L+++ G P ++ +DTGSD TW +C C +C ++ P F+P SS+Y+
Sbjct: 123 LNEDGFFLVNVGFGKPQQNLNLIIDTGSDTTWIRCNSCSLGNCHNKKIPTFNPSLSSSYS 182
Query: 89 SISCSSSQCAVVTSNCSEGDCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMPNV 148
+ SC S T N + S+S G + +T P P
Sbjct: 183 NRSCIPSTKTNYTMNYEDN--------------SYSKGVFVCDEVTLK-----PDVFPKF 223
Query: 149 IFGCGHKNLASPTSDSKQTGIIGLGPGNS-SLISQMGTSIAGKFSYCLPDQGSSKINFGG 207
F + +G++GL G SLISQ + KFSYC P +++ G
Sbjct: 224 QF---GCGDSGGGDFGSASGVLGLAQGEQYSLISQTASKFKKKFSYCFPHNENTR---GS 277
Query: 208 IVAGAGVVSTPLIIR----------DHYYLSLEAISVGNQRLEFVSS---STGNIFVDTG 254
++ G +S ++ Y++ L ISV +RL SS S G I +D+G
Sbjct: 278 LLFGEKAISASPSLKFTRLLNPSSGSVYFVELIGISVAKKRLNVSSSLFASPGTI-IDSG 336
Query: 255 VLRTLLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDVLCYNIS----SQPKFPEVTIH 310
+ T LP + L++ + P + D CYN+ K PE+ +H
Sbjct: 337 TVITHLPTAAYEALRTAFQQEMLHCPSVSPPPQEKPLDT-CYNLKGCGGRNIKLPEIVLH 395
Query: 311 FRG-ADVKLSPSN-LFRNISDEIMCSAF-RGGNAN--IVYGRIMQINFLIGYDIEQAMVS 365
F G DV L PS L+ N C AF R + + + G Q++ + YDIE +
Sbjct: 396 FVGEVDVSLHPSGILWANGDLTQACLAFARKSHPSHVTIIGNRQQVSLKVVYDIEGGRLG 455
Query: 366 F 366
F
Sbjct: 456 F 456
>gi|9759559|dbj|BAB11161.1| nucleoid DNA-binding-like protein [Arabidopsis thaliana]
gi|21553652|gb|AAM62745.1| nucleoid DNA-binding-like protein [Arabidopsis thaliana]
gi|109134179|gb|ABG25087.1| At5g07030 [Arabidopsis thaliana]
Length = 439
Score = 86.3 bits (212), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 101/398 (25%), Positives = 167/398 (41%), Gaps = 50/398 (12%)
Query: 1 AQNSQKLPFYNDNETPKSPISIIYQAEIISVDDIYLMHLSIGTPPVDIFGSVDTGSDCTW 60
AQ+ +L + + +S + I +++ Y++ IGTP + ++DT SD W
Sbjct: 66 AQDQARLQYLSSLVAGRSVVPIASGRQMLQ-STTYIVKALIGTPAQPLLLAMDTSSDVAW 124
Query: 61 TQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSSSQCAVVTS-NCSEGDCSYSFLYGRGA 119
C C + C F P KS+++ ++SCS+ QC V + C CS++ YG +
Sbjct: 125 IPCSGC--VGCPSNT--AFSPAKSTSFKNVSCSAPQCKQVPNPTCGARACSFNLTYGSSS 180
Query: 120 YASFSSGNLATETLTFNSTSGLPVEMPNVIFGCGHKNLASPTSDSKQTGIIGLGPGNSSL 179
A+ NL+ +T+ + P++ FGC +K T Q + G SL
Sbjct: 181 IAA----NLSQDTIRLAAD---PIKA--FTFGCVNKVAGGGTIPPPQGLLGLGR-GPLSL 230
Query: 180 ISQMGTSIAGKFSYCLPDQGSSKINFGGIVAGAGVVSTPLIIR-----------DHYYLS 228
+SQ + FSYCLP S + F G + G S P ++ YY++
Sbjct: 231 MSQAQSIYKSTFSYCLPSFRS--LTFSGSLR-LGPTSQPQRVKYTQLLRNPRRSSLYYVN 287
Query: 229 LEAISVGNQRLEF--------VSSSTGNIFVDTGVLRTLLPLEYHSNLKSVMSNMIKAQP 280
L AI VG + ++ S+ G IF D+G + T L + +++ +K P
Sbjct: 288 LVAIRVGRKVVDLPPAAIAFNPSTGAGTIF-DSGTVYTRLAKPVYEAVRNEFRKRVK--P 344
Query: 281 VKGVGAEPGFSDVLCYNISSQPKFPEVTIHFRGADVKLSPSNL-FRNISDEIMCSAFRGG 339
V G D CY S Q K P +T F+G ++ + NL + + C A
Sbjct: 345 TTAVVTSLGGFDT-CY--SGQVKVPTITFMFKGVNMTMPADNLMLHSTAGSTSCLAMAAA 401
Query: 340 NANI-----VYGRIMQINFLIGYDIEQAMVSFKPSRCT 372
N+ V + Q N + D+ + RC+
Sbjct: 402 PENVNSVVNVIASMQQQNHRVLIDVPNGRLGLARERCS 439
>gi|88174579|gb|ABD39364.1| chloroplast nucleoid DNA-binding protein [Oryza sativa Japonica
Group]
gi|88174585|gb|ABD39367.1| chloroplast nucleoid DNA-binding protein [Oryza sativa Indica
Group]
gi|88174595|gb|ABD39372.1| chloroplast nucleoid DNA-binding protein [Oryza rufipogon]
gi|88174599|gb|ABD39374.1| chloroplast nucleoid DNA-binding protein [Oryza rufipogon]
gi|88174607|gb|ABD39378.1| chloroplast nucleoid DNA-binding protein [Oryza rufipogon]
Length = 321
Score = 86.3 bits (212), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 92/330 (27%), Positives = 148/330 (44%), Gaps = 47/330 (14%)
Query: 35 YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSS 94
Y++ + +GTP +DTGS +W C E D P F +S+T +SC +
Sbjct: 1 YVISVGLGTPAKTQIVEIDTGSSTSWVFC----ECDGCHTNPRTFLQSRSTTCAKVSCGT 56
Query: 95 SQCAVVTSN--CSEG----DCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMPNV 148
S C + S+ C + DC + Y G S S G L +TLTF+ ++P
Sbjct: 57 SMCLLGGSDPHCQDSENYPDCPFRVSYQDG---SASYGILYQDTLTFSDVQ----KIPGF 109
Query: 149 IFGCGHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLPDQGSSK------ 202
FGC + + G++G+G G S++ Q + FSYCLP Q S +
Sbjct: 110 SFGCNMDSFGA-NEFGNVDGLLGMGAGPMSVLKQSSPTFDC-FSYCLPLQKSERGFFSKT 167
Query: 203 ---INFGGIVAGAGVVSTPLIIR----DHYYLSLEAISVGNQRLEF---VSSSTGNIFVD 252
+ G + V T ++ R + +++ L AISV +RL V S G +F D
Sbjct: 168 TGYFSLGKVATRTDVRYTKMVARKKNTELFFVDLTAISVDGERLGLSPSVFSRKGVVF-D 226
Query: 253 TGVLRTLLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDVLCYNISS--QPKFPEVTIH 310
+G + +P SV+S I+ +K AE S+ CY++ S + P +++H
Sbjct: 227 SGSELSYIP----DRALSVLSQRIRELLLKRGAAEEE-SERNCYDMRSVDEGDMPAISLH 281
Query: 311 F-RGADVKLSPSNLF--RNISDE-IMCSAF 336
F GA L +F R++ ++ + C AF
Sbjct: 282 FDDGARFDLGSHGVFVERSVQEQDVWCLAF 311
>gi|88174563|gb|ABD39356.1| chloroplast nucleoid DNA-binding protein [Oryza longistaminata]
Length = 323
Score = 86.3 bits (212), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 92/331 (27%), Positives = 149/331 (45%), Gaps = 47/331 (14%)
Query: 35 YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSS 94
Y++ + +GTP +DTGS +W C E D P F +S+T +SC +
Sbjct: 1 YVISVGLGTPSKTQILEIDTGSSTSWVFC----ECDGCHTNPRTFLQSRSTTCAKVSCGT 56
Query: 95 SQCAVVTSN--CSEG----DCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMPNV 148
S C + S+ C + DC + Y G S S G L +TLTF+ ++P
Sbjct: 57 SMCLLGGSDPHCQDSENYPDCPFRVSYQDG---SASYGILYQDTLTFSDVQ----KIPGF 109
Query: 149 IFGCGHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLPDQGSSK------ 202
FGC + + G++G+G G S++ Q + G FSYCLP Q S +
Sbjct: 110 SFGCNMDSFGA-NEFGNVDGLLGMGAGPMSVLKQSSPTFDG-FSYCLPLQMSERGFFSKT 167
Query: 203 ---INFGGIVAG--AGVVSTPLIIR----DHYYLSLEAISVGNQRLEFVSS--STGNIFV 251
+ GG +A V T ++ R + +++ L AISV +RL S S +
Sbjct: 168 TGYFSLGGKIAATRTDVRYTKMVARRKNTELFFVDLTAISVDGERLGLSPSIFSRKGVVF 227
Query: 252 DTGVLRTLLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDVLCYNISS--QPKFPEVTI 309
D+G + +P SV+S I+ ++ AE S+ CY++ S + P +++
Sbjct: 228 DSGSELSYIP----DRALSVLSQRIRELLLRRGAAEEE-SERNCYDMRSVDEGDMPAISL 282
Query: 310 HF-RGADVKLSPSNLF--RNISDE-IMCSAF 336
HF GA L +F R++ ++ + C AF
Sbjct: 283 HFDDGARFDLGSHGVFVERSVQEQDVWCLAF 313
>gi|356518800|ref|XP_003528065.1| PREDICTED: aspartic proteinase Asp1-like [Glycine max]
Length = 438
Score = 86.3 bits (212), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 94/370 (25%), Positives = 158/370 (42%), Gaps = 50/370 (13%)
Query: 34 IYLMHLSIGTPPVDIFGSVDTGSDCTWTQCE-PCPELDCFKQEPPLFDPKKSSTYNSISC 92
Y + L+IG PP F +DTGSD TW QC+ PC C + PL+ P + + C
Sbjct: 76 FYNVTLNIGQPPRPYFLDIDTGSDLTWLQCDAPCSR--CSQTPHPLYRPSN----DFVPC 129
Query: 93 SSSQCAVVTS----NCS-EGDCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMPN 147
S CA + +C C Y Y Y+S G L + T N T+G+ +++
Sbjct: 130 RHSLCASLHHSDNYDCEVPHQCDYEVQYA-DHYSSL--GVLLHDVYTLNFTNGVQLKV-R 185
Query: 148 VIFGCGHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTS--IAGKFSYCLPDQGSSKINF 205
+ GCG+ + S G++GLG G +SL SQ+ + + +CL QG I F
Sbjct: 186 MALGCGYDQIFPDPSHHPLDGMLGLGRGKTSLTSQLNSQGLVRNVIGHCLSAQGGGYIFF 245
Query: 206 GGIVAGAGVVSTPLIIRDHYYLSLEAISVGNQRLEFVSSSTG----NIFVDTGVLRTLLP 261
G + + + TP+ RD+ + S + G L F +G + DTG T
Sbjct: 246 GDVYDSSRLTWTPMSSRDYKHYS----AAGAAELLFGGKKSGIGSLHAVFDTGSSYTYFN 301
Query: 262 LEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDVLCYN--------ISSQPKFPEVTIHFRG 313
+ L S + +P+K + LC+ + F + + F
Sbjct: 302 PYAYQALISWLGKESGGKPLKEAHDDQTLP--LCWRGRRPFRSIYEVRKYFKPIVLSFTS 359
Query: 314 -----ADVKLSP------SNLFRNISDEIMCSAFRG-GNANIVYGRIMQINFLIGYDIEQ 361
A ++ P SN+ N+ I+ + G G+ N++ G I +N ++ +D ++
Sbjct: 360 NGRSKAQFEMPPEAYLIISNM-GNVCLGILNGSEVGMGDLNLI-GDISMLNKVMVFDNDK 417
Query: 362 AMVSFKPSRC 371
++ + P+ C
Sbjct: 418 QLIGWTPADC 427
>gi|222629275|gb|EEE61407.1| hypothetical protein OsJ_15596 [Oryza sativa Japonica Group]
Length = 466
Score = 86.3 bits (212), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 105/400 (26%), Positives = 159/400 (39%), Gaps = 90/400 (22%)
Query: 35 YLMHLSIGTPP----VDIFGSVDTGSDCTWTQCEPCPELDCFKQE----------PPLFD 80
Y + LS+G P V +F +DTGSD W C P + C + PP D
Sbjct: 88 YTLSLSVGPPSTASSVSLF--LDTGSDLVWFPCAPFTCMLCEGKATPGGNHSSPLPPPID 145
Query: 81 PKK------------SSTYNSISCSSSQC---AVVTSNCSEGDCS-YSFLYGRGAYASFS 124
++ SS S C++++C A+ T +C+ C + YG G+ +
Sbjct: 146 SRRISCASPLCSAAHSSAPTSDLCAAARCPLDAIETDSCASHACPPLYYAYGDGSLVA-- 203
Query: 125 SGNLATETLTFNSTSGLPVEMPNVIFGCGHKNLASPTSDSKQTGIIGLGPGNSSLISQMG 184
NL + ++ + VE N F C H LA P G+ G G G SL +Q+
Sbjct: 204 --NLRRGRVGLAAS--MAVE--NFTFACAHTALAEPV------GVAGFGRGPLSLPAQLA 251
Query: 185 TSIAGKFSYCLPDQGSSKINFGGIVAGAGVVSTPLIIRDH----YYLSLEAISVGNQRLE 240
S++G G+S+ +F V TPL+ Y ++LEA+SVG +R++
Sbjct: 252 PSLSGSTDAAA--IGASETDF---------VYTPLLHNPKHPYFYSVALEAVSVGGKRIQ 300
Query: 241 -------FVSSSTGNIFVDTGVLRTLLPLEYHSNLKS---VMSNMIKAQPVKGVGAEPGF 290
G + VD+G T+LP + + + + +G A+ G
Sbjct: 301 AQPELGDVDRDGNGGMVVDSGTTFTMLPSDTFARVADEFARAMAAARFTRAEGAEAQTGL 360
Query: 291 SDVLCYNIS-SQPKFPEVTIHFRG-ADVKLSPSNLFRNISDE----IMCSAFR------- 337
+ CY+ S S P V +HFRG A V L N F E + C
Sbjct: 361 AP--CYHYSPSDRAVPPVALHFRGNATVALPRRNYFMGFKSEEGRSVGCLMLMNVGGNND 418
Query: 338 ----GGNANIVYGRIMQINFLIGYDIEQAMVSFKPSRCTN 373
GG G Q F + YD++ V F RCT+
Sbjct: 419 DGEDGGGPAGTLGNFQQQGFEVVYDVDAGRVGFARRRCTD 458
>gi|125571841|gb|EAZ13356.1| hypothetical protein OsJ_03278 [Oryza sativa Japonica Group]
Length = 447
Score = 86.3 bits (212), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 48/128 (37%), Positives = 65/128 (50%), Gaps = 15/128 (11%)
Query: 35 YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSS 94
Y + +GTP +DTGSD W QC PC C+ Q +FDP++SSTY + CSS
Sbjct: 86 YFALVGVGTPSTKAMLVIDTGSDLVWLQCSPCRR--CYAQRGQVFDPRRSSTYRRVPCSS 143
Query: 95 SQCAVV------TSNCSEGDCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMPNV 148
QC + + + G C Y YG G S S+G+LAT+ L F + + + NV
Sbjct: 144 PQCRALRFPGCDSGGAAGGGCRYMVAYGDG---SSSTGDLATDKLAFANDT----YVNNV 196
Query: 149 IFGCGHKN 156
GCG N
Sbjct: 197 TLGCGRDN 204
Score = 45.8 bits (107), Expect = 0.030, Method: Compositional matrix adjust.
Identities = 26/90 (28%), Positives = 42/90 (46%), Gaps = 11/90 (12%)
Query: 295 CYNISSQPKF--PEVTIHFRG-ADVKLSPSNLF-------RNISDEIMCSAFRGGNANI- 343
CY++ +P P + +HF G AD+ L P N F R + C F + +
Sbjct: 358 CYDLRGRPAASAPLIVLHFAGGADMALPPENYFLPVDGGRRRAASYRRCLGFEAADDGLS 417
Query: 344 VYGRIMQINFLIGYDIEQAMVSFKPSRCTN 373
V G + Q F + +D+E+ + F P CT+
Sbjct: 418 VIGNVQQQGFRVVFDVEKERIGFAPKGCTS 447
>gi|359476199|ref|XP_003631804.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Vitis vinifera]
Length = 421
Score = 86.3 bits (212), Expect = 3e-14, Method: Compositional matrix adjust.
Identities = 94/305 (30%), Positives = 137/305 (44%), Gaps = 48/305 (15%)
Query: 24 YQAEIISVDDIYLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKK 83
+ + D +L+ ++ GTPP + +DTGS TWTQC+ C ++C + F+
Sbjct: 117 HNNNLFDEDGNFLVDVAFGTPPQNFMLILDTGSSITWTQCKAC--VNCLQDSHRYFNWSA 174
Query: 84 SSTYNSISCSSSQCAVVTSNCSEGDCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPV 143
SSTY+S SC V +N Y+ YG S S GN +T+T +
Sbjct: 175 SSTYSSGSCIP---GTVENN-------YNMTYGD---DSTSVGNYGCDTMTLEPSD---- 217
Query: 144 EMPNVIFGCGHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLPDQG---- 199
FGCG N S G++GLG G S +SQ + FSYCLP++
Sbjct: 218 VFQKFQFGCGRNNKGD--FGSGVDGMLGLGQGQLSTVSQTASKFNKVFSYCLPEEDSIGS 275
Query: 200 ----------SSKINFGGIVAGAGVVSTPLIIRDHYYLSLEAISVGNQRLEFVSS---ST 246
SS + F +V G G + +Y+++L ISVGN+RL SS S
Sbjct: 276 LLFGEKATSQSSSLKFTSLVNGPGTLQE----SGYYFVNLSDISVGNERLNIPSSVFASP 331
Query: 247 GNIFVDTGVLRTLLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDVL--CYNISSQPKF 304
G I +D+ + T LP +S LK+ + P+ + G D+L CYN
Sbjct: 332 GTI-IDSRTVITRLPQRAYSALKAAFKKAMAKYPLSNGRRKKG--DILDTCYNXXXX-XX 387
Query: 305 PEVTI 309
PE+TI
Sbjct: 388 PELTI 392
>gi|222640709|gb|EEE68841.1| hypothetical protein OsJ_27628 [Oryza sativa Japonica Group]
Length = 375
Score = 85.9 bits (211), Expect = 3e-14, Method: Compositional matrix adjust.
Identities = 91/357 (25%), Positives = 149/357 (41%), Gaps = 78/357 (21%)
Query: 52 VDTGSDCTWTQCEPCPELDCFKQE--PPL--FDPKKSSTYNSISCSSSQCAVVTSNCSEG 107
VDTGSD WTQC+ + PPL P ++ + +C++S AV
Sbjct: 57 VDTGSDLIWTQCKLSSSTAAAARHGSPPLSRTAPARTGAFTR-TCTASAAAV-------- 107
Query: 108 DCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMPNVIFGCGHKNLASPTSDSKQT 167
G LA+ET TF + + + + FGCG + S T
Sbjct: 108 ------------------GVLASETFTFGARRAVSLRLG---FGCGALSAGSLIG---AT 143
Query: 168 GIIGLGPGNSSLISQMGTSIAGKFSYCL---PDQGSSKINFGGI-----------VAGAG 213
GI+GL P + SLI+Q+ +FSYCL D+ +S + FG + +
Sbjct: 144 GILGLSPESLSLITQLKIQ---RFSYCLTPFADKKTSPLLFGAMADLSRHKTTRPIQTTA 200
Query: 214 VVSTPLIIRDHYYLSLEAISVGNQRLEFVSSST-------GNIFVDTGVLRTLLPLEYHS 266
+VS P + +YY+ L IS+G++RL ++S G VD+G L
Sbjct: 201 IVSNP-VETVYYYVPLVGISLGHKRLAVPAASLAMRPDGGGGTIVDSGSTVAYLVEAAFE 259
Query: 267 NLKSVMSNMIKAQPVKGVGAEPGFSDVLCYNISSQP--------KFPEVTIHFRGADVKL 318
+K + ++++ PV E LC+ + + + P + +HF G +
Sbjct: 260 AVKEAVMDVVRL-PVANRTVE---DYELCFVLPRRTAAAAMEAVQVPPLVLHFDGGAAMV 315
Query: 319 SPS-NLFRNISDEIMCSAF---RGGNANIVYGRIMQINFLIGYDIEQAMVSFKPSRC 371
P N F+ +MC A G+ + G + Q N + +D++ SF P++C
Sbjct: 316 LPRDNYFQEPRAGLMCLAVGKTTDGSGVSIIGNVQQQNMHVLFDVQHHKFSFAPTQC 372
>gi|294463081|gb|ADE77078.1| unknown [Picea sitchensis]
Length = 370
Score = 85.9 bits (211), Expect = 3e-14, Method: Compositional matrix adjust.
Identities = 99/384 (25%), Positives = 153/384 (39%), Gaps = 76/384 (19%)
Query: 52 VDTGSDCTWTQCEP------CPELDCFKQEPPLFDPKKSSTYNSISCSSSQCAVV----- 100
+DTGSD W C CPE +F P+ SS+ + ++C+ S C +
Sbjct: 1 MDTGSDLVWVPCTRNYSCINCPED---SASNGVFLPRMSSSLHLVTCADSNCKTLYGNNT 57
Query: 101 ----------TSNCSEGDCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMPNVIF 150
NCSE Y YGRG+ A G L TETL LP+E
Sbjct: 58 ELLCQSCAGSLKNCSETCPPYGIQYGRGSTA----GLLLTETLN------LPLENGEGAR 107
Query: 151 GCGHKNLASPTSDSKQ-TGIIGLGPGNSSLISQMGTSIA-GKFSYCLPDQGSSKINFGGI 208
H + S+Q +GI G G G S+ SQ+G I +F+YCL + N +
Sbjct: 108 AITHFAVGCSIVSSQQPSGIAGFGRGALSMPSQLGEHIGKDRFAYCLQSHRFDEENKKSL 167
Query: 209 -VAGAGVVS-------TPLIIRD----------HYYLSLEAISVGNQRLE--------FV 242
V G + TP + +YY+ L +S+G +RL+ F
Sbjct: 168 MVLGDKALPNNIPLNYTPFLTNSRAPPSSQYGVYYYIGLRGVSIGGKRLKQLPSKLLRFD 227
Query: 243 SSSTGNIFVDTGVLRTLLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDVLCYNISSQP 302
+ G +D+G T+ E ++ + ++ I + V + G LCY+++
Sbjct: 228 TKGNGGTIIDSGTTFTVFSDEIFKHIAAGFASQIGYRRAGEVEDKTGMG--LCYDVTGLE 285
Query: 303 K--FPEVTIHFR-GADVKLSPSNLFRNIS--DEIMCSAF--RG-----GNANIVYGRIMQ 350
PE HF+ G+D+ L +N F S D I + RG ++ G Q
Sbjct: 286 NIVLPEFAFHFKGGSDMVLPVANYFSYFSSFDSICLTMISSRGLLEVDSGPAVILGNDQQ 345
Query: 351 INFLIGYDIEQAMVSFKPSRCTNY 374
+F + YD E+ + F C +
Sbjct: 346 QDFYLLYDREKNRLGFTQQTCKTF 369
>gi|255685718|gb|ACU28348.1| At2g28040-like protein [Arabidopsis lyrata subsp. petraea]
gi|255685720|gb|ACU28349.1| At2g28040-like protein [Arabidopsis lyrata subsp. petraea]
gi|255685724|gb|ACU28351.1| At2g28040-like protein [Arabidopsis lyrata subsp. petraea]
Length = 91
Score = 85.9 bits (211), Expect = 3e-14, Method: Compositional matrix adjust.
Identities = 47/106 (44%), Positives = 60/106 (56%), Gaps = 15/106 (14%)
Query: 37 MHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSSSQ 96
M L IGTPP +I +DTGS+ WTQC PC L C+ Q+ P+FDP KSST+ C
Sbjct: 1 MKLQIGTPPFEIEAVLDTGSELIWTQCLPC--LHCYDQKAPIFDPSKSSTFKETRC---- 54
Query: 97 CAVVTSNCSEGDCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLP 142
N C Y +Y +Y + G LATET+T +STSG+P
Sbjct: 55 ------NTPNHSCPYKIVYDDKSY---TLGTLATETVTIHSTSGVP 91
>gi|18409620|ref|NP_566966.1| aspartyl protease family protein [Arabidopsis thaliana]
gi|13430562|gb|AAK25903.1|AF360193_1 unknown protein [Arabidopsis thaliana]
gi|4886277|emb|CAB43423.1| putative protein [Arabidopsis thaliana]
gi|14532764|gb|AAK64083.1| unknown protein [Arabidopsis thaliana]
gi|15450892|gb|AAK96717.1| Unknown protein [Arabidopsis thaliana]
gi|30387567|gb|AAP31949.1| At3g52500 [Arabidopsis thaliana]
gi|332645431|gb|AEE78952.1| aspartyl protease family protein [Arabidopsis thaliana]
Length = 469
Score = 85.9 bits (211), Expect = 3e-14, Method: Compositional matrix adjust.
Identities = 110/403 (27%), Positives = 157/403 (38%), Gaps = 89/403 (22%)
Query: 35 YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEP---CPELDCFKQEP---PLFDPKKSSTYN 88
Y + LS GTP I DTGS W C C D +P P F PK SS+
Sbjct: 90 YSVSLSFGTPSQTIPFVFDTGSSLVWLPCTSRYLCSGCDFSGLDPTLIPRFIPKNSSSSK 149
Query: 89 SISCSSSQCAVV-------------TSNCSEGDCSYSFLYGRGAYASFSSGNLATETLTF 135
I C S +C + T NC+ G Y YG G+ A G L TE L F
Sbjct: 150 IIGCQSPKCQFLYGPNVQCRGCDPNTRNCTVGCPPYILQYGLGSTA----GVLITEKLDF 205
Query: 136 NSTSGLPVEMPNVIFGCGHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCL 195
+ +P+ + GC S S + GI G G G SL SQM +FS+CL
Sbjct: 206 PD-----LTVPDFVVGC------SIISTRQPAGIAGFGRGPVSLPSQMNLK---RFSHCL 251
Query: 196 PDQGSSKINF---------GGIVAGA---GVVSTPL---------IIRDHYYLSLEAISV 234
+ N G +G+ G+ TP ++YYL+L I V
Sbjct: 252 VSRRFDDTNVTTDLDLDTGSGHNSGSKTPGLTYTPFRKNPNVSNKAFLEYYYLNLRRIYV 311
Query: 235 GNQRLEFV-------SSSTGNIFVDTGVLRTLLPLEYHS----NLKSVMSNMIKAQPV-K 282
G + ++ ++ G VD+G T + S MSN + + + K
Sbjct: 312 GRKHVKIPYKYLAPGTNGDGGSIVDSGSTFTFMERPVFELVAEEFASQMSNYTREKDLEK 371
Query: 283 GVGAEPGFSDVLCYNISSQPKF--PEVTIHFRG-ADVKLSPSNLFRNI--SDEIMCSAFR 337
G P C+NIS + PE+ F+G A ++L SN F + +D + +
Sbjct: 372 ETGLGP------CFNISGKGDVTVPELIFEFKGGAKLELPLSNYFTFVGNTDTVCLTVVS 425
Query: 338 GGNAN--------IVYGRIMQINFLIGYDIEQAMVSFKPSRCT 372
N I+ G Q N+L+ YD+E F +C+
Sbjct: 426 DKTVNPSGGTGPAIILGSFQQQNYLVEYDLENDRFGFAKKKCS 468
>gi|255545620|ref|XP_002513870.1| Aspartic proteinase nepenthesin-2 precursor, putative [Ricinus
communis]
gi|223546956|gb|EEF48453.1| Aspartic proteinase nepenthesin-2 precursor, putative [Ricinus
communis]
Length = 535
Score = 85.9 bits (211), Expect = 3e-14, Method: Compositional matrix adjust.
Identities = 103/415 (24%), Positives = 167/415 (40%), Gaps = 64/415 (15%)
Query: 5 QKLPFYNDNETPKSPISIIYQAEII-------------SVDDIYLMHLSIGTPPVDIFGS 51
Q L DN+ + + + Q +++ +D ++ + IGTP V +
Sbjct: 59 QYLQLLLDNDLKRQKMKLGAQNQLLFPSLGSHTFFYGNDLDWLHYTWIDIGTPNVSFLVA 118
Query: 52 VDTGSDCTWTQCEPCPELDCFKQEP-------PL------FDPKKSSTYNSISCSSSQCA 98
+D GSD +W C DC + P PL + P S+T +SC+ C
Sbjct: 119 LDAGSDLSWVPC------DCIQCAPLSASLYKPLDRDLSEYRPSLSTTSRHLSCNHQLCE 172
Query: 99 VVTSNCSEGDCSYSFLYGRGAYASFSSGNLATETLTF-----NSTSGLPVEMPNVIFGCG 153
+ S+C ++ + SSG L + L +S S +VI GCG
Sbjct: 173 -LGSHCKNLKDPCPYIADYADPNTSSSGFLVEDILHLASVSDDSNSTQKRVQASVILGCG 231
Query: 154 HKNLASPTSDSKQTGIIGLGPGNSSLISQMGTS--IAGKFSYCLPDQGSSKINFGGIVAG 211
K + G++GLGPG+ S+ S + + I FS C GS I FG
Sbjct: 232 RKQTGGYLDGAAPDGVMGLGPGSISVPSLLAKAGLIRKSFSLCFDVNGSGTILFGD-QGH 290
Query: 212 AGVVSTPLIIR----DHYYLSLEAISVGNQRLEFVSSSTGNIFVDTGVLRTLLPLEYHSN 267
STPL+ D Y + +E+ VGN L+ S VD+G T LP++ ++
Sbjct: 291 TSQKSTPLLPTQGNYDAYLIEVESYCVGNSCLK---QSGFKALVDSGASFTYLPIDVYNK 347
Query: 268 LKSVMSNMIKAQPVKGVGAEPGFSDVLCYNISSQP--KFPEVTIHFRGADVKLSPSNLF- 324
+ + AQ + G + CYN SS+ P + + F L ++ +
Sbjct: 348 IVLEFDKQVNAQRISSQGGPWNY----CYNTSSKQLDNVPAMRLSFLMNQSLLIHNSTYY 403
Query: 325 --RNISDEIMCSAFRGGNANIVYGRIMQINFLIGY----DIEQAMVSFKPSRCTN 373
+N + C + + N YG I Q N++ GY D+E + + S C +
Sbjct: 404 VPQNQEFAVFCLTLQPTDLN--YGIIGQ-NYMTGYRVVFDMENLKLGWSSSNCKD 455
Database: nr
Posted date: Mar 3, 2013 10:45 PM
Number of letters in database: 999,999,864
Number of sequences in database: 2,912,245
Database: /local_scratch/syshi//blastdatabase/nr.01
Posted date: Mar 3, 2013 10:52 PM
Number of letters in database: 999,999,666
Number of sequences in database: 2,912,720
Database: /local_scratch/syshi//blastdatabase/nr.02
Posted date: Mar 3, 2013 10:58 PM
Number of letters in database: 999,999,938
Number of sequences in database: 3,014,250
Database: /local_scratch/syshi//blastdatabase/nr.03
Posted date: Mar 3, 2013 11:03 PM
Number of letters in database: 999,999,780
Number of sequences in database: 2,805,020
Database: /local_scratch/syshi//blastdatabase/nr.04
Posted date: Mar 3, 2013 11:08 PM
Number of letters in database: 999,999,551
Number of sequences in database: 2,816,253
Database: /local_scratch/syshi//blastdatabase/nr.05
Posted date: Mar 3, 2013 11:13 PM
Number of letters in database: 999,999,897
Number of sequences in database: 2,981,387
Database: /local_scratch/syshi//blastdatabase/nr.06
Posted date: Mar 3, 2013 11:18 PM
Number of letters in database: 999,999,649
Number of sequences in database: 2,911,476
Database: /local_scratch/syshi//blastdatabase/nr.07
Posted date: Mar 3, 2013 11:24 PM
Number of letters in database: 999,999,452
Number of sequences in database: 2,920,260
Database: /local_scratch/syshi//blastdatabase/nr.08
Posted date: Mar 3, 2013 11:25 PM
Number of letters in database: 64,230,274
Number of sequences in database: 189,558
Lambda K H
0.317 0.134 0.401
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 6,059,136,327
Number of Sequences: 23463169
Number of extensions: 265984114
Number of successful extensions: 581998
Number of sequences better than 100.0: 1000
Number of HSP's better than 100.0 without gapping: 822
Number of HSP's successfully gapped in prelim test: 2254
Number of HSP's that attempted gapping in prelim test: 575237
Number of HSP's gapped (non-prelim): 3282
length of query: 374
length of database: 8,064,228,071
effective HSP length: 144
effective length of query: 230
effective length of database: 8,980,499,031
effective search space: 2065514777130
effective search space used: 2065514777130
T: 11
A: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.7 bits)
S2: 77 (34.3 bits)