BLASTP 2.2.26 [Sep-21-2011]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for compositional score matrix adjustment: Altschul, Stephen F.,
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.
Query= 038667
(411 letters)
Database: nr
23,463,169 sequences; 8,064,228,071 total letters
Searching..................................................done
>gi|255585473|ref|XP_002533429.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
gi|223526717|gb|EEF28949.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
Length = 448
Score = 357 bits (916), Expect = 6e-96, Method: Compositional matrix adjust.
Identities = 187/417 (44%), Positives = 260/417 (62%), Gaps = 13/417 (3%)
Query: 1 LIHRDSILSPYNNPNENPAHRVQRGINISIARLAYLQEKIRS---HNTYQAQILPSNDIS 57
LIH SILSPY NPN + A R +R + S R+AYL +I+ N ++ +LPS
Sbjct: 38 LIHWGSILSPYFNPNASVAERAERIVKTSATRIAYLYAQIKGDIHMNDFELNLLPST--Y 95
Query: 58 NSLFYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLGTIFYPSRSSSYAYVPCD 117
LF VN S+GQP PQ MDTGS++LWV C PC+ C+ Q G + PS+SS+YA +PC
Sbjct: 96 EPLFLVNFSMGQPATPQLAIMDTGSNILWVRCAPCKRCTQQNGPLLDPSKSSTYASLPCT 155
Query: 118 SEHCRYFPYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDVVFGCG 177
+ C Y P A C+ ++C Y Y G ++ ++TEQL F ++DE V VVFGC
Sbjct: 156 NTMCHYAPSAYCNRL-NQCGYNLSYATGLSSAGVLATEQLIFHSSDEGVNAVPSVVFGCS 214
Query: 178 FSTN--RNFKFSGIFGLGIGRSSLVSQLNSSFSYCIGSLHDPDYLHNKLILGDGAIIDEG 235
++ +F+G+FGLG G +S V+++ S FSYC+G++ DP Y +N+L+ G+ A EG
Sbjct: 215 HENGDYKDRRFTGVFGLGKGITSFVTRMGSKFSYCLGNIADPHYGYNQLVFGEKANF-EG 273
Query: 236 DATPLQFIDGHYYITLEAISVDGRMLDINPNIFKRDDSGGGVMIDSGTDVTWLVKEAYEA 295
+TPL+ ++GHYY+TLE ISV + LDI+ F + +IDSGT +TWL + A+ A
Sbjct: 274 YSTPLKVVNGHYYVTLEGISVGEKRLDIDSTAFSMKGNEKSALIDSGTALTWLAESAFRA 333
Query: 296 LRDEVMIRLEGEQMRSYSWPDKL-CYHGIMSSDLKGFPTVRFHFRGGAKLALEKDSMFYQ 354
L +EV L+G M W CY G +S DL GFP V FHF GGA L L+ +SMFYQ
Sbjct: 334 LDNEVRQLLDGVLMP--FWRGSFACYKGTVSQDLIGFPVVTFHFSGGADLDLDTESMFYQ 391
Query: 355 PRPDAFCMAVNPASI-NNRYVNFSLIGMMAQQFYNVGYDIGRKQKTFQRMDCEVLDD 410
PD C+AV AS N + +FS+IG+MAQQ+YN+ YD+ + FQR+DC++L D
Sbjct: 392 ATPDILCIAVRQASAYGNDFKSFSVIGLMAQQYYNMAYDLNSNKLFFQRIDCQLLVD 448
>gi|224091849|ref|XP_002309371.1| predicted protein [Populus trichocarpa]
gi|222855347|gb|EEE92894.1| predicted protein [Populus trichocarpa]
Length = 438
Score = 330 bits (847), Expect = 6e-88, Method: Compositional matrix adjust.
Identities = 185/418 (44%), Positives = 254/418 (60%), Gaps = 28/418 (6%)
Query: 1 LIHRDSILSPYNNPNENPAHRVQRGINISIARLAYLQEKIRSH---NTYQAQILPSNDIS 57
LIHRDSI+SPY N+ A R +R + S+ARL+YL KI N + PS S
Sbjct: 41 LIHRDSIVSPYYRSNDTVADRTERTMKASLARLSYLYAKIERDFDINDLWLNLHPS--AS 98
Query: 58 NSLFYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQ-LGTIFYPSRSSSYAYVPC 116
LF VN S+GQPPVPQ MDTGSSLLW+ C PC+ CS Q +G +F PS SS+Y + C
Sbjct: 99 EPLFLVNFSMGQPPVPQLAIMDTGSSLLWIQCAPCKSCSQQIIGPMFDPSISSTYDSLSC 158
Query: 117 DSEHCRYFPYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDVVFGC 176
+ CRY P C + +C+Y Q Y+ G + ++TEQL F ++DE V +V+FGC
Sbjct: 159 KNIICRYAPSGECDS-SSQCVYNQTYVEGLPSVGVIATEQLIFGSSDEGRNAVNNVLFGC 217
Query: 177 GFSTNRNFK---FSGIFGLGIGRSSLVSQLNSSFSYCIGSLHDPDYLHNKLILGDGAIID 233
N N+K F+G+FGLG G +S+V+Q+ S FSYCIG++ DPDY +N+L+L +G +
Sbjct: 218 SHR-NGNYKDRRFTGVFGLGSGITSVVNQMGSKFSYCIGNIADPDYSYNQLVLSEGVNM- 275
Query: 234 EGDATPLQFIDGHYYITLEAISVDGRMLDINPNIFKRDDSGGGVMIDSGTDVTWLVKEAY 293
EG +TPL +DGHY + LE ISV L I+P+ FKR + V+IDSGT TWL + Y
Sbjct: 276 EGYSTPLDVVDGHYQVILEGISVGETRLVIDPSAFKRTEKQRRVIIDSGTAPTWLAENEY 335
Query: 294 EALRDEVMIRLEGEQMRSYSWPDKLCYHGIMSSDLKGFPTVRFHFRGGAKLALEKDSMFY 353
AL EV L+ + + LCY G + DL GFP V FHF GA L ++ +
Sbjct: 336 RALEREVRNLLD-RFLTPFMRESFLCYKGKVGQDLVGFPAVTFHFAEGADLVVDTE---- 390
Query: 354 QPRPDAFCMAVNPASINNR-YVNFSLIGMMAQQFYNVGYDIGRKQKTFQRMDCEVLDD 410
+ AS+ + + +FS+IG+MAQQ+YNV YD+ + + FQR+DCE+LD+
Sbjct: 391 ----------MRQASVYGKDFKDFSVIGLMAQQYYNVAYDLNKHKLFFQRIDCELLDE 438
>gi|297825301|ref|XP_002880533.1| hypothetical protein ARALYDRAFT_481251 [Arabidopsis lyrata subsp.
lyrata]
gi|297326372|gb|EFH56792.1| hypothetical protein ARALYDRAFT_481251 [Arabidopsis lyrata subsp.
lyrata]
Length = 430
Score = 330 bits (845), Expect = 1e-87, Method: Compositional matrix adjust.
Identities = 180/427 (42%), Positives = 258/427 (60%), Gaps = 23/427 (5%)
Query: 1 LIHRDSILSPYNNPNE----NPAHRVQRGINISIARLAYLQEKI-RSHNTYQAQILPSND 55
LI R+S++ +NP+ P +Q +IS AR YLQ I + + Q+
Sbjct: 5 LIRRESVVR--HNPDARVPVTPEDHIQHMTDISSARFKYLQNSIVKELGSSDFQVDVHQA 62
Query: 56 ISNSLFYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQ--LGTIFYPSRSSSYAY 113
I SLF+VN S+GQPPVPQFT MDTGSSLLW+ C+PC+ CS + +F P+ SS++
Sbjct: 63 IKTSLFFVNFSVGQPPVPQFTIMDTGSSLLWIQCHPCKHCSSNHMIHPVFNPALSSTFVE 122
Query: 114 VPCDSEHCRYFPYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDVV 173
CD CRY P CS+ ++C+Y Q+Y+ G + ++ E+LTF + +T+ Q +
Sbjct: 123 CSCDDRFCRYAPNGHCSS--NKCVYEQVYISGTGSKGVLAKERLTFTTPNGNTVVTQPIA 180
Query: 174 FGCGFSTNRNF--KFSGIFGLGIGRSSLVSQLNSSFSYCIGSLHDPDYLHNKLILGDGAI 231
FGCG +F+GI GLG +SL QL S FSYCIG L + +Y +N+L+LG+ A
Sbjct: 181 FGCGHENGEQLESEFTGILGLGAKPTSLAVQLGSKFSYCIGDLANKNYGYNQLVLGEDAD 240
Query: 232 IDEGDATPLQF--IDGHYYITLEAISVDGRMLDINPNIFKRDDSGGGVMIDSGTDVTWLV 289
I GD TP++F +G YY+ LE ISV + L+I P +FKR S GV++D+GT TWL
Sbjct: 241 I-LGDPTPIEFETENGIYYMNLEGISVGDKQLNIEPVVFKRRGSRTGVILDTGTLYTWLA 299
Query: 290 KEAYEALRDEVMIRLEGEQMRSYSWPDKLCYHGIMSSDLKGFPTVRFHFRGGAKLALEKD 349
AY L +E+ L+ ++ + + D LCYHG ++ +L GFP V FHF GGA+LA+E
Sbjct: 300 DIAYRELYNEIKSILD-PKLERFWFRDFLCYHGRVNEELIGFPVVTFHFAGGAELAMEAT 358
Query: 350 SMFYQPRP-----DAFCMAVNPASIN-NRYVNFSLIGMMAQQFYNVGYDIGRKQKTFQRM 403
SMFY + FCM+V P + + Y +F+ IG+MAQQ+YN+ YD+ + QR+
Sbjct: 359 SMFYPMTESDTYHNVFCMSVRPTTEHGGEYKDFTAIGLMAQQYYNIAYDLKERNIYLQRI 418
Query: 404 DCEVLDD 410
DC +LDD
Sbjct: 419 DCVLLDD 425
>gi|296090291|emb|CBI40110.3| unnamed protein product [Vitis vinifera]
Length = 408
Score = 327 bits (838), Expect = 7e-87, Method: Compositional matrix adjust.
Identities = 185/417 (44%), Positives = 253/417 (60%), Gaps = 28/417 (6%)
Query: 1 LIHRDSILSPYNNPNENPAHRVQRGINISIARLAYLQEKIRSHNTYQAQILPSNDISNSL 60
LIH+DSILS Y + + N R + R A++ ++I QA ++ D
Sbjct: 13 LIHQDSILSSYQSLDRNNVERRR------TRRAAFITDEI------QANMVA--DDRGQA 58
Query: 61 FYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLGTIFYPSRSSSYAYVPCDSEH 120
F VN S+G+PPVPQ +DTGS LLWV C PC DC Q IF PS+SS+Y + DS
Sbjct: 59 FLVNFSVGRPPVPQLVGIDTGSDLLWVQCRPCADCFRQSTPIFDPSKSSTYVDLSYDSPI 118
Query: 121 CRYFPYARCSAYKH--RCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDVVFGCGF 178
C P + Y H +CIY Y G +S ++TE + F+ +D+ T+ V VVFGCG
Sbjct: 119 CPNSPQKK---YNHLNQCIYNASYADGSTSSGNLATEDIVFETSDQGTVTVSSVVFGCGH 175
Query: 179 STNRNF--KFSGIFGLGIGRSSLVSQLNSSFSYCIGSLHDPDYLHNKLILGDGAIIDEGD 236
S F + SGI GL G S+VS+L S FSYCIG L DP Y HN+L+LGDG + EG
Sbjct: 176 SNRGRFDGQQSGILGLSAGDQSIVSRLGSRFSYCIGDLFDPHYTHNQLVLGDGVKM-EGS 234
Query: 237 ATPLQFIDGHYYITLEAISVDGRMLDINPNIFKRDDSG-GGVMIDSGTDVTWLVKEAYEA 295
+TP +G YY+TLE ISV LDINP +F+R +SG GGV++DSGT T+L K+ ++
Sbjct: 235 STPFHTFNGFYYVTLEGISVGETRLDINPEVFQRTESGQGGVVMDSGTTATFLAKDGFDP 294
Query: 296 LRDEV--MIRLEGEQMRSYSWPDKLCYHGIMSSDLKGFPTVRFHFRGGAKLALEKDSMFY 353
L +E+ ++R +Q+ + P LCY G ++ DL+GFP + FHF GA L L+ +S+F
Sbjct: 295 LSNEIQRLVRGHFQQVIYRTIPGWLCYKGRVNEDLRGFPELAFHFAEGADLVLDANSLFV 354
Query: 354 QPRPDAFCMAVNPASINNRYVNFSLIGMMAQQFYNVGYDIGRKQKTFQRMDCEVLDD 410
Q D FC+AV +++ N S+IG+MAQQ YNV YD+ K+ FQR DCE+L+D
Sbjct: 355 QKNQDVFCLAVLESNLKNIG---SVIGIMAQQHYNVAYDLIGKRVYFQRTDCELLED 408
>gi|225446261|ref|XP_002265547.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Vitis vinifera]
Length = 440
Score = 327 bits (838), Expect = 7e-87, Method: Compositional matrix adjust.
Identities = 185/417 (44%), Positives = 253/417 (60%), Gaps = 28/417 (6%)
Query: 1 LIHRDSILSPYNNPNENPAHRVQRGINISIARLAYLQEKIRSHNTYQAQILPSNDISNSL 60
LIH+DSILS Y + + N R + R A++ ++I QA ++ D
Sbjct: 45 LIHQDSILSSYQSLDRNNVERRR------TRRAAFITDEI------QANMVA--DDRGQA 90
Query: 61 FYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLGTIFYPSRSSSYAYVPCDSEH 120
F VN S+G+PPVPQ +DTGS LLWV C PC DC Q IF PS+SS+Y + DS
Sbjct: 91 FLVNFSVGRPPVPQLVGIDTGSDLLWVQCRPCADCFRQSTPIFDPSKSSTYVDLSYDSPI 150
Query: 121 CRYFPYARCSAYKH--RCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDVVFGCGF 178
C P + Y H +CIY Y G +S ++TE + F+ +D+ T+ V VVFGCG
Sbjct: 151 CPNSPQKK---YNHLNQCIYNASYADGSTSSGNLATEDIVFETSDQGTVTVSSVVFGCGH 207
Query: 179 STNRNF--KFSGIFGLGIGRSSLVSQLNSSFSYCIGSLHDPDYLHNKLILGDGAIIDEGD 236
S F + SGI GL G S+VS+L S FSYCIG L DP Y HN+L+LGDG + EG
Sbjct: 208 SNRGRFDGQQSGILGLSAGDQSIVSRLGSRFSYCIGDLFDPHYTHNQLVLGDGVKM-EGS 266
Query: 237 ATPLQFIDGHYYITLEAISVDGRMLDINPNIFKRDDSG-GGVMIDSGTDVTWLVKEAYEA 295
+TP +G YY+TLE ISV LDINP +F+R +SG GGV++DSGT T+L K+ ++
Sbjct: 267 STPFHTFNGFYYVTLEGISVGETRLDINPEVFQRTESGQGGVVMDSGTTATFLAKDGFDP 326
Query: 296 LRDEV--MIRLEGEQMRSYSWPDKLCYHGIMSSDLKGFPTVRFHFRGGAKLALEKDSMFY 353
L +E+ ++R +Q+ + P LCY G ++ DL+GFP + FHF GA L L+ +S+F
Sbjct: 327 LSNEIQRLVRGHFQQVIYRTIPGWLCYKGRVNEDLRGFPELAFHFAEGADLVLDANSLFV 386
Query: 354 QPRPDAFCMAVNPASINNRYVNFSLIGMMAQQFYNVGYDIGRKQKTFQRMDCEVLDD 410
Q D FC+AV +++ N S+IG+MAQQ YNV YD+ K+ FQR DCE+L+D
Sbjct: 387 QKNQDVFCLAVLESNLKNIG---SVIGIMAQQHYNVAYDLIGKRVYFQRTDCELLED 440
>gi|147786881|emb|CAN62311.1| hypothetical protein VITISV_008929 [Vitis vinifera]
Length = 408
Score = 327 bits (837), Expect = 9e-87, Method: Compositional matrix adjust.
Identities = 185/417 (44%), Positives = 253/417 (60%), Gaps = 28/417 (6%)
Query: 1 LIHRDSILSPYNNPNENPAHRVQRGINISIARLAYLQEKIRSHNTYQAQILPSNDISNSL 60
LIH+DSILS Y + + N R + R A++ ++I QA ++ D
Sbjct: 13 LIHQDSILSSYQSLDRNNVERRR------TRRAAFIXDEI------QANMVA--DDRGQA 58
Query: 61 FYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLGTIFYPSRSSSYAYVPCDSEH 120
F VN S+G+PPVPQ +DTGS LLWV C PC DC Q IF PS+SS+Y + DS
Sbjct: 59 FLVNFSVGRPPVPQLVGIDTGSDLLWVQCRPCADCFRQSTPIFDPSKSSTYVDLSYDSPI 118
Query: 121 CRYFPYARCSAYKH--RCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDVVFGCGF 178
C P + Y H +CIY Y G +S ++TE + F+ +D+ T+ V VVFGCG
Sbjct: 119 CPNSPQKK---YNHLNQCIYNASYADGSTSSGNLATEDIVFETSDQGTVTVSSVVFGCGH 175
Query: 179 STNRNF--KFSGIFGLGIGRSSLVSQLNSSFSYCIGSLHDPDYLHNKLILGDGAIIDEGD 236
S F + SGI GL G S+VS+L S FSYCIG L DP Y HN+L+LGDG + EG
Sbjct: 176 SNRGRFDGQQSGILGLSAGDQSIVSRLGSRFSYCIGDLFDPHYTHNQLVLGDGVKM-EGS 234
Query: 237 ATPLQFIDGHYYITLEAISVDGRMLDINPNIFKRDDSG-GGVMIDSGTDVTWLVKEAYEA 295
+TP +G YY+TLE ISV LDINP +F+R +SG GGV++DSGT T+L K+ ++
Sbjct: 235 STPFHTFNGFYYVTLEGISVGETRLDINPEVFQRTESGQGGVVMDSGTTATFLAKDGFDP 294
Query: 296 LRDEV--MIRLEGEQMRSYSWPDKLCYHGIMSSDLKGFPTVRFHFRGGAKLALEKDSMFY 353
L +E+ ++R +Q+ + P LCY G ++ DL+GFP + FHF GA L L+ +S+F
Sbjct: 295 LSNEIQRLVRGHFQQVIYRTIPGWLCYKGRVNEDLRGFPELAFHFAEGADLVLDANSLFV 354
Query: 354 QPRPDAFCMAVNPASINNRYVNFSLIGMMAQQFYNVGYDIGRKQKTFQRMDCEVLDD 410
Q D FC+AV +++ N S+IG+MAQQ YNV YD+ K+ FQR DCE+L+D
Sbjct: 355 QKNQDVFCLAVLESNLKNIG---SVIGIMAQQHYNVAYDLIGKRVYFQRTDCELLED 408
>gi|18400416|ref|NP_565559.1| aspartyl protease family protein [Arabidopsis thaliana]
gi|20197296|gb|AAM15014.1| predicted protein [Arabidopsis thaliana]
gi|330252412|gb|AEC07506.1| aspartyl protease family protein [Arabidopsis thaliana]
Length = 458
Score = 327 bits (837), Expect = 9e-87, Method: Compositional matrix adjust.
Identities = 181/424 (42%), Positives = 251/424 (59%), Gaps = 17/424 (4%)
Query: 1 LIHRDSI--LSPYNNPNENPAHRVQRGINISIARLAYLQEKI-RSHNTYQAQILPSNDIS 57
LIHR+S+ L+P P ++ +IS AR YLQ I + + Q+ I
Sbjct: 33 LIHRESVARLNPNARVPITPEDHIKHLTDISSARFKYLQNSIDKELGSSNFQVDVEQAIK 92
Query: 58 NSLFYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSP--QLGTIFYPSRSSSYAYVP 115
SLF VN S+GQPPVPQ T MDTGSSLLW+ C PC+ CS + +F P+ SS++
Sbjct: 93 TSLFLVNFSVGQPPVPQLTIMDTGSSLLWIQCQPCKHCSSDHMIHPVFNPALSSTFVECS 152
Query: 116 CDSEHCRYFPYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDVVFG 175
CD CRY P C + ++C+Y Q+Y+ G + ++ E+LTF + +T+ Q + FG
Sbjct: 153 CDDRFCRYAPNGHCGS-SNKCVYEQVYISGTGSKGVLAKERLTFTTPNGNTVVTQPIAFG 211
Query: 176 CGFSTNRNFK--FSGIFGLGIGRSSLVSQLNSSFSYCIGSLHDPDYLHNKLILGDGAIID 233
CG+ + F+GI GLG +SL QL S FSYCIG L + +Y +N+L+LG+ A I
Sbjct: 212 CGYENGEQLESHFTGILGLGAKPTSLAVQLGSKFSYCIGDLANKNYGYNQLVLGEDADI- 270
Query: 234 EGDATPLQFIDGH--YYITLEAISVDGRMLDINPNIFKRDDSGGGVMIDSGTDVTWLVKE 291
GD TP++F + YY+ LE ISV L+I P +FKR GV++DSGT TWL
Sbjct: 271 LGDPTPIEFETENSIYYMNLEGISVGDTQLNIEPVVFKRRGPRTGVILDSGTLYTWLADI 330
Query: 292 AYEALRDEVMIRLEGEQMRSYSWPDKLCYHGIMSSDLKGFPTVRFHFRGGAKLALEKDSM 351
AY L +E+ L+ ++ + + D LCYHG +S +L GFP V FHF GGA+LA+E SM
Sbjct: 331 AYRELYNEIKSILD-PKLERFWFRDFLCYHGRVSEELIGFPVVTFHFAGGAELAMEATSM 389
Query: 352 FY---QPRP-DAFCMAVNPASIN-NRYVNFSLIGMMAQQFYNVGYDIGRKQKTFQRMDCE 406
FY +P + FCM+V P + Y F+ IG+MAQQ+YN+GYD+ K QR+DC
Sbjct: 390 FYPLSEPNTFNVFCMSVKPTKEHGGEYKEFTAIGLMAQQYYNIGYDLKEKNIYLQRIDCV 449
Query: 407 VLDD 410
LDD
Sbjct: 450 QLDD 453
>gi|449516339|ref|XP_004165204.1| PREDICTED: probable aspartic protease At2g35615-like [Cucumis
sativus]
Length = 456
Score = 312 bits (800), Expect = 2e-82, Method: Compositional matrix adjust.
Identities = 174/421 (41%), Positives = 252/421 (59%), Gaps = 17/421 (4%)
Query: 1 LIHRDSILSPYNNPNENPAHRVQRGINISIARLAYLQEKIRS----HNTYQAQILPSNDI 56
LIHR+S L P + NE R +R SI R +L+ KI+ N ++ ++P N
Sbjct: 42 LIHRNSYLHPLYDQNETVEDRSKREQTSSIERFDFLESKIKELKSVGNEARSSLIPFN-- 99
Query: 57 SNSLFYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLGTIFYPSRSSSYAYVPC 116
S F VN+SIG PPV Q +DTGSSLLWV C PC +C Q + F P +S S+ + C
Sbjct: 100 RGSGFLVNLSIGSPPVTQLVVVDTGSSLLWVQCLPCINCFQQSTSWFDPLKSVSFKTLGC 159
Query: 117 DSEHCRYFPYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDVVFGC 176
Y +C+ + ++ Y YL G + ++ E L F+ DE I ++ FGC
Sbjct: 160 GFPGYNYINGYKCNRF-NQAEYKLRYLGGDSSQGILAKESLLFETLDEGKIKKSNITFGC 218
Query: 177 G---FSTNRNFKFSGIFGLGI-GRSSLVSQLNSSFSYCIGSLHDPDYLHNKLILGDGAII 232
G TN + ++G+FGLG ++ +QL + FSYCIG +++P Y HN L+LG G+ I
Sbjct: 219 GHMNIKTNNDDAYNGVFGLGAYPHITMATQLGNKFSYCIGDINNPLYTHNHLVLGQGSYI 278
Query: 233 DEGDATPLQFIDGHYYITLEAISVDGRMLDINPNIFK-RDDSGGGVMIDSGTDVTWLVKE 291
EGD+TPLQ GHYY+TL++ISV + L I+PN FK D GGV+IDSG T L
Sbjct: 279 -EGDSTPLQIHFGHYYVTLQSISVGSKTLKIDPNAFKISSDGSGGVLIDSGMTYTKLANG 337
Query: 292 AYEALRDEVMIRLEG--EQMRSYSWPDKLCYHGIMSSDLKGFPTVRFHFRGGAKLALEKD 349
+E L DE++ ++G E++ + + LC+ G++S DL GFP V FHF GGA L LE
Sbjct: 338 GFELLYDEIVDLMKGLLERIPTQRKFEGLCFKGVVSRDLVGFPAVTFHFAGGADLVLESG 397
Query: 350 SMFYQPRPDAFCMAVNPASINNRYVNFSLIGMMAQQFYNVGYDIGRKQKTFQRMDCEVLD 409
S+F Q D FC+A+ P+ N+ +N S+IG++AQQ YNVG+D+ + + F+R+DC++LD
Sbjct: 398 SLFRQHGGDRFCLAILPS--NSELLNLSVIGILAQQNYNVGFDLEQMKVFFRRIDCQLLD 455
Query: 410 D 410
+
Sbjct: 456 E 456
>gi|356498789|ref|XP_003518231.1| PREDICTED: probable aspartic protease At2g35615-like [Glycine max]
Length = 446
Score = 312 bits (799), Expect = 2e-82, Method: Compositional matrix adjust.
Identities = 182/421 (43%), Positives = 245/421 (58%), Gaps = 34/421 (8%)
Query: 1 LIHRDSILSPYNNPNENPAHRVQRGINISIARLAYLQEKIRSHNTYQAQILPSNDISNSL 60
LIH +S LSPYN+ + H + + + + N Y + ++PS +
Sbjct: 47 LIHHESSLSPYNSKDTIWDHYSHKILKQTFS------------NDYISNLVPSPRYV--V 92
Query: 61 FYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLGTIFYPSRSSSYAYVPCDSEH 120
F +N SIG+PP+PQ MDTGSSL WV C+PC CS Q IF PS+SS+Y+ + C
Sbjct: 93 FLMNFSIGEPPIPQLAVMDTGSSLTWVMCHPCSSCSQQSVPIFDPSKSSTYSNLSC--SE 150
Query: 121 CRYFPYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDVVFGCG--F 178
C +C C Y+ Y+ + + EQLT + DES I V ++FGCG F
Sbjct: 151 CN-----KCDVVNGECPYSVEYVGSGSSQGIYAREQLTLETIDESIIKVPSLIFGCGRKF 205
Query: 179 STNRNF----KFSGIFGLGIGRSSLVSQLNSSFSYCIGSLHDPDYLHNKLILGDGAIIDE 234
S + N +G+FGLG GR SL+ FSYCIG+L + +Y N+L+LGD A + +
Sbjct: 206 SISSNGYPYQGINGVFGLGSGRFSLLPSFGKKFSYCIGNLRNTNYKFNRLVLGDKANM-Q 264
Query: 235 GDATPLQFIDGHYYITLEAISVDGRMLDINPNIFKRD--DSGGGVMIDSGTDVTWLVKEA 292
GD+T L I+G YY+ LEAIS+ GR LDI+P +F+R D+ GV+IDSG D TWL K
Sbjct: 265 GDSTTLNVINGLYYVNLEAISIGGRKLDIDPTLFERSITDNNSGVIIDSGADHTWLTKYG 324
Query: 293 YEALRDEVMIRLEGEQMRSYS---WPDKLCYHGIMSSDLKGFPTVRFHFRGGAKLALEKD 349
+E L EV LEG + + P LCY G++S DL GFP V FHF GA L L+
Sbjct: 325 FEVLSFEVENLLEGVLVLAQQDKHNPYTLCYSGVVSQDLSGFPLVTFHFAEGAVLDLDVT 384
Query: 350 SMFYQPRPDAFCMAVNPAS-INNRYVNFSLIGMMAQQFYNVGYDIGRKQKTFQRMDCEVL 408
SMF Q + FCMA+ P + + Y +FS IGM+AQQ YNVGYD+ R + FQR+DCE+L
Sbjct: 385 SMFIQTTENEFCMAMLPGNYFGDDYESFSSIGMLAQQNYNVGYDLNRMRVYFQRIDCELL 444
Query: 409 D 409
D
Sbjct: 445 D 445
>gi|356532674|ref|XP_003534896.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
Length = 446
Score = 310 bits (795), Expect = 7e-82, Method: Compositional matrix adjust.
Identities = 180/420 (42%), Positives = 251/420 (59%), Gaps = 26/420 (6%)
Query: 1 LIHRDSILSPYNNPNENPAHRVQRGINISIARLAYLQEKIRS----HNTYQAQILPSNDI 56
LIH S+ P+ PNE R++ I S ARLAY+Q +I +N Y A + PS +
Sbjct: 39 LIHPGSVHHPHYKPNETAKDRMELDIEHSAARLAYIQARIEGSLVYNNDYTASVSPS--L 96
Query: 57 SNSLFYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLGTIFYPSRSSSYA---Y 113
+ VN+SIGQP +PQ MDTGS +LW+ C PC +C LG +F PS SS+++
Sbjct: 97 TGRTILVNLSIGQPSIPQLVVMDTGSDILWIMCNPCTNCDNHLGLLFDPSMSSTFSPLCK 156
Query: 114 VPCDSEHCRYFPYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDVV 173
PC + C+ P +T Y+ S + L F+ TDE T + DV+
Sbjct: 157 TPCGFKGCKCDPIP----------FTISYVDNSSASGTFGRDILVFETTDEGTSQISDVI 206
Query: 174 FGCGFST--NRNFKFSGIFGLGIGRSSLVSQLNSSFSYCIGSLHDPDYLHNKLILGDGAI 231
GCG + N + ++GI GL G +SL +Q+ FSYCIG+L DP Y +N+L LG+GA
Sbjct: 207 IGCGHNIGFNSDPGYNGILGLNNGPNSLATQIGRKFSYCIGNLADPYYNYNQLRLGEGAD 266
Query: 232 IDEGDATPLQFIDGHYYITLEAISVDGRMLDINPNIFKRDDSG-GGVMIDSGTDVTWLVK 290
+ EG +TP + G YY+T+E ISV + LDI F+ +G GGV++DSGT +T+LV
Sbjct: 267 L-EGYSTPFEVYHGFYYVTMEGISVGEKRLDIALETFEMKRNGTGGVILDSGTTITYLVD 325
Query: 291 EAYEALRDEV--MIRLEGEQMRSYSWPDKLCYHGIMSSDLKGFPTVRFHFRGGAKLALEK 348
A++ L +EV +++ Q+ + P KLCY+GI+S DL GFP V FHF GA LAL+
Sbjct: 326 SAHKLLYNEVRNLLKWSFRQVIFENAPWKLCYYGIISRDLVGFPVVTFHFVDGADLALDT 385
Query: 349 DSMFYQPRPDAFCMAVNPASINNRYVNFSLIGMMAQQFYNVGYDIGRKQKTFQRMDCEVL 408
S F Q R D FCM V+PASI N ++ S+IG++AQQ YNVGYD+ + FQR+DCE+L
Sbjct: 386 GSFFSQ-RDDIFCMTVSPASILNTTISPSVIGLLAQQSYNVGYDLVNQFVYFQRIDCELL 444
>gi|356558300|ref|XP_003547445.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
Length = 447
Score = 306 bits (783), Expect = 2e-80, Method: Compositional matrix adjust.
Identities = 176/420 (41%), Positives = 243/420 (57%), Gaps = 25/420 (5%)
Query: 1 LIHRDSILSPYNNPNENPAHRVQRGINISIARLAYLQEKIR----SHNTYQAQILPSNDI 56
LIH S+ P+ PNE R++ I S ARLA +Q +I S+N Y+A++ PS +
Sbjct: 39 LIHPGSVHHPHYKPNETAKDRMELDIQHSAARLANIQARIEGSLVSNNDYKARVSPS--L 96
Query: 57 SNSLFYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLGTIFYPSRSSSYA---Y 113
+ NISIGQPP+PQ MDTGS +LWV C PC +C LG +F PS+SS+++
Sbjct: 97 TGRTIMANISIGQPPIPQLVVMDTGSDILWVMCTPCTNCDNDLGLLFDPSKSSTFSPLCK 156
Query: 114 VPCDSEHCRYFPYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDVV 173
PCD E CR P +T Y S + + F+ TDE T + DV+
Sbjct: 157 TPCDFEGCRCDPIP----------FTVTYADNSTASGTFGRDTVVFETTDEGTSRISDVL 206
Query: 174 FGCGFSTNRNFK--FSGIFGLGIGRSSLVSQLNSSFSYCIGSLHDPDYLHNKLILGDGAI 231
FGCG + + +GI GL G SLV++L FSYCIG+L DP Y +++LILG+GA
Sbjct: 207 FGCGHNIGHDTDPGHNGILGLNNGPDSLVTKLGQKFSYCIGNLADPYYNYHQLILGEGAD 266
Query: 232 IDEGDATPLQFIDGHYYITLEAISVDGRMLDINPNIFK-RDDSGGGVMIDSGTDVTWLVK 290
+ EG +TP + +G YY+T+E ISV + LDI P F+ +++ GGV+ID+G+ +T+LV
Sbjct: 267 L-EGYSTPFEVYNGFYYVTMEGISVGEKRLDIAPETFEMKENRAGGVIIDTGSTITFLVD 325
Query: 291 EAYEALRDEV--MIRLEGEQMRSYSWPDKLCYHGIMSSDLKGFPTVRFHFRGGAKLALEK 348
++ L EV ++ Q P C++G +S DL GFP V FHF GA LAL+
Sbjct: 326 SVHKLLSKEVRNLLGWSFRQATIEKSPWMQCFYGSISRDLVGFPVVTFHFSDGADLALDS 385
Query: 349 DSMFYQPRPDAFCMAVNPASINNRYVNFSLIGMMAQQFYNVGYDIGRKQKTFQRMDCEVL 408
S F Q + FCM V P S N SLIG++AQQ YNVGYD+ + FQR+DCE+L
Sbjct: 386 GSFFNQLNDNVFCMTVGPVSSLNIKSKPSLIGLLAQQSYNVGYDLVNQFVYFQRIDCELL 445
>gi|449457263|ref|XP_004146368.1| PREDICTED: probable aspartic protease At2g35615-like [Cucumis
sativus]
Length = 469
Score = 305 bits (781), Expect = 3e-80, Method: Compositional matrix adjust.
Identities = 175/434 (40%), Positives = 253/434 (58%), Gaps = 30/434 (6%)
Query: 1 LIHRDSILSPYNNPNENPAHRVQRGINISIARLAYLQEKIRS----HNTYQAQILPSNDI 56
LIHR+S L P + NE R +R SI R +L+ KI+ N ++ ++P N
Sbjct: 42 LIHRNSYLHPLYDQNETVEDRSKREQTSSIERFDFLESKIKELKSVGNEARSSLIPFN-- 99
Query: 57 SNSLFYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLGTIFYPSRSSSYAYVPC 116
S F VN+SIG PPV Q +DTGSSLLWV C PC +C Q + F P +S S+ + C
Sbjct: 100 RGSGFLVNLSIGSPPVTQLVVVDTGSSLLWVQCLPCINCFQQSTSWFDPLKSVSFKTLGC 159
Query: 117 DSEHCRYFPYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDE------------ 164
Y +C+ + ++ Y YL G + ++ E L F+ DE
Sbjct: 160 GFPGYNYINGYKCNRF-NQAEYKLRYLGGDSSQGILAKESLLFETLDEGRVFQYNAISTQ 218
Query: 165 -STIHVQDVVFGCG---FSTNRNFKFSGIFGLGI-GRSSLVSQLNSSFSYCIGSLHDPDY 219
S I ++ FGCG TN + ++G+FGLG ++ +QL + FSYCIG +++P Y
Sbjct: 219 ISKIKKSNITFGCGHMNIKTNNDDAYNGVFGLGAYPHITMATQLGNKFSYCIGDINNPLY 278
Query: 220 LHNKLILGDGAIIDEGDATPLQFIDGHYYITLEAISVDGRMLDINPNIFK-RDDSGGGVM 278
HN L+LG G+ I EGD+TPLQ GHYY+TL++ISV + L I+PN FK D GGV+
Sbjct: 279 THNHLVLGQGSYI-EGDSTPLQIHFGHYYVTLQSISVGSKTLKIDPNAFKISSDGSGGVL 337
Query: 279 IDSGTDVTWLVKEAYEALRDEVMIRLEG--EQMRSYSWPDKLCYHGIMSSDLKGFPTVRF 336
IDSG T L +E L DE++ ++G E++ + + LC+ G++S DL GFP V F
Sbjct: 338 IDSGMTYTKLANGGFELLYDEIVDLMKGLLERIPTQRKFEGLCFKGVVSRDLVGFPAVTF 397
Query: 337 HFRGGAKLALEKDSMFYQPRPDAFCMAVNPASINNRYVNFSLIGMMAQQFYNVGYDIGRK 396
HF GGA L LE S+F Q D FC+A+ P+ N+ +N S+IG++AQQ YNVG+D+ +
Sbjct: 398 HFAGGADLVLESGSLFRQHGGDRFCLAILPS--NSELLNLSVIGILAQQNYNVGFDLEQM 455
Query: 397 QKTFQRMDCEVLDD 410
+ F+R+DC++LD+
Sbjct: 456 KVFFRRIDCQLLDE 469
>gi|356532672|ref|XP_003534895.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
Length = 449
Score = 297 bits (761), Expect = 5e-78, Method: Compositional matrix adjust.
Identities = 172/420 (40%), Positives = 240/420 (57%), Gaps = 24/420 (5%)
Query: 1 LIHRDSILSPYNNPNENPAHRVQRGINISIARLAYLQEKIR----SHNTYQAQILPSNDI 56
LIH S+ P+ PNE R++ I S AR AY+Q +I S+N Y+A++ PS +
Sbjct: 39 LIHPGSVHHPHYKPNETAKDRMELDIQHSAARFAYIQARIEGSLVSNNEYKARVSPS--L 96
Query: 57 SNSLFYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLGTIFYPSRSSSYA---Y 113
+ NISIGQPP+PQ MDTGS +LWV C PC +C LG +F PS SS+++
Sbjct: 97 TGRTIMANISIGQPPIPQLVVMDTGSDILWVMCTPCTNCDNHLGLLFDPSMSSTFSPLCK 156
Query: 114 VPCDSEHCRYFPYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDVV 173
PCD + C +RC +T Y S + + F+ TDE T + DV+
Sbjct: 157 TPCDFKGC-----SRCDPIP----FTVTYADNSTASGMFGRDTVVFETTDEGTSRIPDVL 207
Query: 174 FGCGFSTNRNFK--FSGIFGLGIGRSSLVSQLNSSFSYCIGSLHDPDYLHNKLILGDGAI 231
FGCG + ++ +GI GL G SL +++ FSYCIG L DP Y +++LILG+GA
Sbjct: 208 FGCGHNIGQDTDPGHNGILGLNNGPDSLATKIGQKFSYCIGDLADPYYNYHQLILGEGAD 267
Query: 232 IDEGDATPLQFIDGHYYITLEAISVDGRMLDINPNIFK-RDDSGGGVMIDSGTDVTWLVK 290
+ EG +TP + +G YY+T+E ISV + LDI P F+ + + GGV+ID+G+ +T+LV
Sbjct: 268 L-EGYSTPFEVHNGFYYVTMEGISVGEKRLDIAPETFEMKKNRTGGVIIDTGSTITFLVD 326
Query: 291 EAYEALRDEV--MIRLEGEQMRSYSWPDKLCYHGIMSSDLKGFPTVRFHFRGGAKLALEK 348
+ L EV ++ Q P C++G +S DL GFP V FHF GA LAL+
Sbjct: 327 SVHRLLSKEVRNLLGWSFRQTTIEKSPWMQCFYGSISRDLVGFPVVTFHFADGADLALDS 386
Query: 349 DSMFYQPRPDAFCMAVNPASINNRYVNFSLIGMMAQQFYNVGYDIGRKQKTFQRMDCEVL 408
S F Q + FCM V P S N SLIG++AQQ Y+VGYD+ + FQR+DCE+L
Sbjct: 387 GSFFNQLNDNVFCMTVGPVSSLNLKSKPSLIGLLAQQSYSVGYDLVNQFVYFQRIDCELL 446
>gi|297798978|ref|XP_002867373.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
gi|297313209|gb|EFH43632.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
Length = 434
Score = 281 bits (718), Expect = 6e-73, Method: Compositional matrix adjust.
Identities = 160/357 (44%), Positives = 210/357 (58%), Gaps = 19/357 (5%)
Query: 61 FYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLGTIFYPSRSSSYAYVPCDSEH 120
F NISIG PPVPQ +DTGS L W+ C PC+ C PQ F+PSRSS+Y C+S
Sbjct: 88 FLANISIGDPPVPQLLLIDTGSDLTWIQCLPCK-CYPQTIPFFHPSRSSTYRNASCESA- 145
Query: 121 CRYFPYARCSAYKHR----CIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDVVFGC 176
P+A ++ C Y Y T ++ E+LTF+ +DE I ++VFGC
Sbjct: 146 ----PHAMPQIFRDEKTGNCRYHLRYRDFSNTRGILAKEKLTFQTSDEGLISKPNIVFGC 201
Query: 177 GFSTNRNFKFSGIFGLGIGRSSLVSQ-LNSSFSYCIGSLHDPDYLHNKLILGDGAIIDEG 235
G + ++SG+ GLG G S+V++ S FSYC GSL DP Y HN LILG+GA I EG
Sbjct: 202 GQDNSGFTQYSGVLGLGPGTFSIVTRNFGSKFSYCFGSLIDPTYPHNFLILGNGARI-EG 260
Query: 236 DATPLQFIDGHYYITLEAISVDGRMLDINPNIFKRDDSGGGVMIDSGTDVTWLVKEAYEA 295
D TPLQ YY+ L+AIS+ ++LDI P IF+R S GG +ID+G T L +EAYE
Sbjct: 261 DPTPLQIFQDRYYLDLQAISLGEKLLDIEPGIFQRYRSKGGTVIDTGCSPTILAREAYET 320
Query: 296 LRDEVMIRLEGEQMRSYSWPD--KLCYHGIMSSDLKGFPTVRFHFRGGAKLALEKDSMFY 353
L +E+ L R W CY G + DL GFP V FHF GGA+LAL+ +S+F
Sbjct: 321 LSEEIDFLLGEVLRRVKDWEQYTNHCYEGNLKLDLYGFPVVTFHFAGGAELALDVESLFV 380
Query: 354 QPRP-DAFCMAVNPASINNRYVNFSLIGMMAQQFYNVGYDIGRKQKTFQRMDCEVLD 409
D+FC+A+ N + + S+IG MAQQ YNVGY++ + FQR DCE+LD
Sbjct: 381 SSESGDSFCLAMT----MNTFDDMSVIGAMAQQNYNVGYNLRTMKVYFQRTDCEILD 433
>gi|15234606|ref|NP_194732.1| aspartyl protease family protein [Arabidopsis thaliana]
gi|4938479|emb|CAB43838.1| putative protein [Arabidopsis thaliana]
gi|7269903|emb|CAB80996.1| putative protein [Arabidopsis thaliana]
gi|67633774|gb|AAY78811.1| aspartyl protease family protein [Arabidopsis thaliana]
gi|332660310|gb|AEE85710.1| aspartyl protease family protein [Arabidopsis thaliana]
Length = 424
Score = 280 bits (716), Expect = 1e-72, Method: Compositional matrix adjust.
Identities = 161/358 (44%), Positives = 213/358 (59%), Gaps = 21/358 (5%)
Query: 61 FYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLGTIFYPSRSSSYAYVPCDSEH 120
F NISIG PPVPQ +DTGS L W+HC PC+ C PQ F+PSRSS+Y C S
Sbjct: 78 FLANISIGNPPVPQLLLIDTGSDLTWIHCLPCK-CYPQTIPFFHPSRSSTYRNASCVSA- 135
Query: 121 CRYFPYARCSAYKHR----CIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDVVFGC 176
P+A ++ C Y Y T ++ E+LTF+ +D+ I Q++VFGC
Sbjct: 136 ----PHAMPQIFRDEKTGNCQYHLRYRDFSNTRGILAEEKLTFETSDDGLISKQNIVFGC 191
Query: 177 GFSTNRNFKFSGIFGLGIGRSSLVSQ-LNSSFSYCIGSLHDPDYLHNKLILGDGAIIDEG 235
G + K+SG+ GLG G S+V++ S FSYC GSL +P Y HN LILG+GA I EG
Sbjct: 192 GQDNSGFTKYSGVLGLGPGTFSIVTRNFGSKFSYCFGSLTNPTYPHNILILGNGAKI-EG 250
Query: 236 DATPLQFIDGHYYITLEAISVDGRMLDINPNIFKRDDSGGGVMIDSGTDVTWLVKEAYEA 295
D TPLQ YY+ L+AIS ++LDI P F+R S GG +ID+G T L +EAYE
Sbjct: 251 DPTPLQIFQDRYYLDLQAISFGEKLLDIEPGTFQRYRSQGGTVIDTGCSPTILAREAYET 310
Query: 296 LRDEVMIRLEGEQMRSYSWPDKL---CYHGIMSSDLKGFPTVRFHFRGGAKLALEKDSMF 352
L +E+ L GE +R D+ CY G + DL GFP V FHF GGA+LAL+ +S+F
Sbjct: 311 LSEEIDFLL-GEVLRRVKDWDQYTTPCYEGNLKLDLYGFPVVTFHFAGGAELALDVESLF 369
Query: 353 YQPRP-DAFCMAVNPASINNRYVNFSLIGMMAQQFYNVGYDIGRKQKTFQRMDCEVLD 409
D+FC+A+ N + + S+IG MAQQ YNVGY++ + FQR DCE++D
Sbjct: 370 VSSESGDSFCLAMT----MNTFDDMSVIGAMAQQNYNVGYNLRTMKVYFQRTDCEIID 423
>gi|297803034|ref|XP_002869401.1| aspartyl protease family [Arabidopsis lyrata subsp. lyrata]
gi|297315237|gb|EFH45660.1| aspartyl protease family [Arabidopsis lyrata subsp. lyrata]
Length = 438
Score = 273 bits (699), Expect = 9e-71, Method: Compositional matrix adjust.
Identities = 161/393 (40%), Positives = 221/393 (56%), Gaps = 21/393 (5%)
Query: 29 SIARLAYLQEKIRSHNTYQAQILPSNDISNSLFYVNISIGQPPVPQFTAMDTGSSLLWVH 88
S+ RL YL K ++ A + P+ I F VNISIG PPV Q MDT S LLW+
Sbjct: 55 SVERLEYL--KAKATGDIIAHLSPNVPIIPQAFLVNISIGSPPVTQLLHMDTASDLLWLQ 112
Query: 89 CYPCRDCSPQLGTIFYPSRSSSYAYVPCDSEHCRYFPYARCSAYKHRCIYTQLYLIGPET 148
C PC +C Q IF PSRS ++ C + P R +A C Y+ Y+ G +
Sbjct: 113 CRPCINCYAQSLPIFDPSRSYTHRNESCRTSQYS-MPSLRFNAKTRSCEYSMRYMDGTGS 171
Query: 149 SVFVSTEQLTFKNT--DESTIHVQDVVFGCGFST-NRNFKFSGIFGLGIGRSSLVSQLNS 205
++ E L F + S+ + DVVFGCG +GI GLG G SLV + +
Sbjct: 172 KGILAKEMLMFNTIYDESSSAALHDVVFGCGHDNYGEPLVGTGILGLGYGEFSLVHRFGT 231
Query: 206 SFSYCIGSLHDPDYLHNKLILGDGAIIDEGDATPLQFIDGHYYITLEAISVDGRMLDINP 265
FSYC GSL DP Y HN L+LGD GD TPL+ +G YY+T+EAISVDG +L I+P
Sbjct: 232 KFSYCFGSLDDPSYPHNVLVLGDDGANILGDTTPLEIYNGFYYVTIEAISVDGIILPIDP 291
Query: 266 NIFKRDDSG--GGVMIDSGTDVTWLVKEAYEALRDEVMIRLEGEQMRSYSWPDKL----C 319
+F R+ GG +ID+G +T LV+EAY+ L++++ EG + D + C
Sbjct: 292 WVFNRNHQTGLGGTIIDTGNSLTSLVEEAYKPLKNKIEDYFEGRFTAADVNQDDMFKVEC 351
Query: 320 YHGIMSSDL--KGFPTVRFHFRGGAKLALEKDSMFYQPRPDAFCMAVNPASINNRYVNFS 377
Y+G + DL GFP V FHF GA+L+L+ S+F + P+ FC+AV P ++N+
Sbjct: 352 YNGNLERDLVESGFPIVTFHFSDGAELSLDVKSVFMKLSPNVFCLAVTPGNMNS------ 405
Query: 378 LIGMMAQQFYNVGYDIGRKQKTFQRMDCEVLDD 410
IG AQQ YN+GYD+ K+ +F+R+DC VL D
Sbjct: 406 -IGATAQQSYNIGYDLEAKKISFERIDCGVLFD 437
>gi|255537017|ref|XP_002509575.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
gi|223549474|gb|EEF50962.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
Length = 459
Score = 261 bits (666), Expect = 6e-67, Method: Compositional matrix adjust.
Identities = 164/434 (37%), Positives = 237/434 (54%), Gaps = 37/434 (8%)
Query: 1 LIHRDSILSPYNNPNENPAHRVQRGINISIARLAYLQ---------------EKIRSHNT 45
LIHRDSI SP NPN++ R +R + S AR Y+Q + + +
Sbjct: 39 LIHRDSIFSPAYNPNDSIKDRAKRMLKNSNARFDYVQAISKRNSAVVDYDGGDTSAADDA 98
Query: 46 YQAQILPSNDISNSLFYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLGTIFYP 105
Y+A +L F VN SIGQPPVPQ+ MDTGSSL W+ C PC +C Q G ++ P
Sbjct: 99 YEASLLSEL----CTFLVNFSIGQPPVPQYAVMDTGSSLTWIQCEPCINCHQQKGPLYNP 154
Query: 106 SRSSSYAYVPCDSEHCRYFPYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDES 165
S SS+Y F + + C Y+Q Y T + EQL F+ D+
Sbjct: 155 SSSSTYVSCSDFDRTDTTF----TATHGSDCNYSQTYADKTTTRGTYAREQLLFETPDDG 210
Query: 166 TIHVQDVVFGCGFSTNR----NFKFSGIFGLGIGRSSLVSQLNSSFSYCIGSLHDPDYLH 221
+ DV+FGCG + + SG+FGLG SS++S+L FSYCIG++ DP Y
Sbjct: 211 ITIMHDVIFGCGHNNTQLPGPTGYASGVFGLGDSGSSIISKLGFGFSYCIGNIGDPLYGF 270
Query: 222 NKLILGDGAIIDEGDATPLQFIDGHYYITLEAISVDGRMLDINPNIFKRDDSGG---GVM 278
++L LG+ I EG +TPL G YYITL IS+ LDI+P +F+R D G ++
Sbjct: 271 HRLTLGNKLKI-EGYSTPL-VPRGLYYITLVGISIGQERLDIDPIVFQRVDLNGISSRIV 328
Query: 279 IDSGTDVTWLVKEAYEALRDEVMIRLEG--EQMRSYSWPDKLCYHGIMSSDLKGFPTVRF 336
IDSG ++++ ++AY +RD+V L G + R + LCY G ++ DL+GFP F
Sbjct: 329 IDSGATLSYIPRQAYNVVRDKVSSILSGFLSRYRYIARHLSLCYIGKLNQDLQGFPDATF 388
Query: 337 HFRGGAKLALEKDSMFYQPRPDAFCMAVNPASINNRYVNFSLIGMMAQQFYNVGYDIGRK 396
H GA L + + +F+Q + C+A+ P + LIG++AQQ+YNV YD+ ++
Sbjct: 389 HLADGADLVFQVEGLFFQYTDNVLCLALVPTESDEETC---LIGLLAQQYYNVAYDLKQQ 445
Query: 397 QKTFQRMDCEVLDD 410
+ FQR++CE+LDD
Sbjct: 446 KLYFQRIECELLDD 459
>gi|15234607|ref|NP_194733.1| aspartyl protease family protein [Arabidopsis thaliana]
gi|4938480|emb|CAB43839.1| putative protein [Arabidopsis thaliana]
gi|7269904|emb|CAB80997.1| putative protein [Arabidopsis thaliana]
gi|67633776|gb|AAY78812.1| aspartyl protease family protein [Arabidopsis thaliana]
gi|332660311|gb|AEE85711.1| aspartyl protease family protein [Arabidopsis thaliana]
Length = 427
Score = 253 bits (647), Expect = 8e-65, Method: Compositional matrix adjust.
Identities = 151/383 (39%), Positives = 208/383 (54%), Gaps = 21/383 (5%)
Query: 29 SIARLAYLQEKIRSHNTYQAQILPSNDISNSLFYVNISIGQPPVPQFTAMDTGSSLLWVH 88
S+ RL YL K ++ A + P+ I F VNISIG PP+ Q MDT S LLW+
Sbjct: 55 SVERLEYL--KAKTTGDIIAHLSPNVPIIPQAFLVNISIGSPPITQLLHMDTASDLLWIQ 112
Query: 89 CYPCRDCSPQLGTIFYPSRSSSYAYVPCDSEHCRYFPYARCSAYKHRCIYTQLYLIGPET 148
C PC +C Q IF PSRS ++ C + P + +A C Y+ Y+ +
Sbjct: 113 CLPCINCYAQSLPIFDPSRSYTHRNETCRTSQYS-MPSLKFNANTRSCEYSMRYVDDTGS 171
Query: 149 SVFVSTEQLTFKNT--DESTIHVQDVVFGCGFST-NRNFKFSGIFGLGIGRSSLVSQLNS 205
++ E L F + S+ + DVVFGCG +GI GLG G SLV +
Sbjct: 172 KGILAREMLLFNTIYDESSSAALHDVVFGCGHDNYGEPLVGTGILGLGYGEFSLVHRFGK 231
Query: 206 SFSYCIGSLHDPDYLHNKLILGDGAIIDEGDATPLQFIDGHYYITLEAISVDGRMLDINP 265
FSYC GSL DP Y HN L+LGD GD TPL+ +G YY+T+EAISVDG +L I+P
Sbjct: 232 KFSYCFGSLDDPSYPHNVLVLGDDGANILGDTTPLEIHNGFYYVTIEAISVDGIILPIDP 291
Query: 266 NIFKRDDSG--GGVMIDSGTDVTWLVKEAYEALRDEVMIRLEGEQMRSYSWPDKL----C 319
+F R+ GG +ID+G +T LV+EAY+ L++ + EG + D + C
Sbjct: 292 RVFNRNHQTGLGGTIIDTGNSLTSLVEEAYKPLKNRIEDIFEGRFTAADVSQDDMIKMEC 351
Query: 320 YHGIMSSDL--KGFPTVRFHFRGGAKLALEKDSMFYQPRPDAFCMAVNPASINNRYVNFS 377
Y+G DL GFP V FHF GA+L+L+ S+F + P+ FC+AV P ++N+
Sbjct: 352 YNGNFERDLVESGFPIVTFHFSEGAELSLDVKSLFMKLSPNVFCLAVTPGNLNS------ 405
Query: 378 LIGMMAQQFYNVGYDIGRKQKTF 400
IG AQQ YN+GYD+ + +F
Sbjct: 406 -IGATAQQSYNIGYDLEAMEVSF 427
>gi|356558304|ref|XP_003547447.1| PREDICTED: LOW QUALITY PROTEIN: aspartic proteinase
nepenthesin-1-like [Glycine max]
Length = 336
Score = 226 bits (577), Expect = 1e-56, Method: Compositional matrix adjust.
Identities = 140/360 (38%), Positives = 208/360 (57%), Gaps = 32/360 (8%)
Query: 55 DISNSLFYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLGTIFYPSRSSSYA-- 112
D N ++ +SIGQPP+PQ MDT S +LW+ C +G +F PS+SS+++
Sbjct: 3 DERNKPYWSILSIGQPPIPQLVIMDTSSDILWIMC-------NHVGLLFDPSKSSTFSPL 55
Query: 113 -YVPCDSEHCRYFPYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQD 171
PC + C+ P +Y + TS ++ + F+ TDE + D
Sbjct: 56 CKTPCGFKGCKCDPIPFNISYVDK----------SSTSGTFGSDTVVFETTDEGHSQIFD 105
Query: 172 VVFGCGFST--NRNFKFSGIFGLGIGRSSLVSQLNSSFSYCIGSLHDPDYLHNKLILGDG 229
V+ CG + N + ++GI GL G +SL +++ FSYC+G+L DP Y +N+LIL +G
Sbjct: 106 VLVRCGHNIGFNTDPGYNGIRGLNNGPNSLATKIGQKFSYCVGNLADPYYNYNQLILCEG 165
Query: 230 AIIDEGDATPLQFIDGHYYITLEAISVDGRMLDINPNIFK-RDDSGGGVMIDSGTDVTWL 288
A + EG +TP + G YY+TL+ I V + LDI P F+ + ++ GGV+ DSGT +T+L
Sbjct: 166 ADL-EGYSTPFEVHHGFYYVTLKGIIVGEKRLDIAPITFEIKGNNTGGVIRDSGTTITYL 224
Query: 289 VKEAYEALRDEVMIRLEGEQMRSYSWPDKLCYHGIMSSDLKGFPTVRFHFRGGAKLALEK 348
V ++ L +EV + S+S+ +LC++GI+S DL GFP V FHF GA LAL+
Sbjct: 225 VDSVHKLLYNEV------RNLLSWSF-RQLCHYGIISRDLVGFPVVTFHFADGADLALDT 277
Query: 349 DSMFYQPRPDAFCMAVNPASINNRYVNFSLIGMMAQQFYNVGYDIGRKQKTFQRMDCEVL 408
S F Q CM V+PASI N ++ S+I ++AQQ YNVGYD+ FQR+DCE+L
Sbjct: 278 GSFFNQLN-SILCMTVSPASILNTTISPSVIELLAQQSYNVGYDLLTNFVYFQRIDCELL 336
>gi|225427554|ref|XP_002266533.1| PREDICTED: probable aspartic protease At2g35615-like [Vitis
vinifera]
Length = 447
Score = 190 bits (483), Expect = 1e-45, Method: Compositional matrix adjust.
Identities = 142/427 (33%), Positives = 218/427 (51%), Gaps = 36/427 (8%)
Query: 1 LIHRDSILSPYNNPNENPAHRVQRGINISIARLAYLQEKIRSHNTYQAQILPSNDISNSL 60
LI RDS LSP+ NP+E R+Q+ + SI+R + + S N+ Q+ ++ +N
Sbjct: 39 LISRDSPLSPFYNPSETQFDRLQKAFHRSISRANHFRANGVSTNSIQSPVISNN----GE 94
Query: 61 FYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLGTIFYPSRSSSYAYVPCDSEH 120
+ +NIS+G PPV DTGS LLW C PC C Q+ IF P++S +Y + C+ +
Sbjct: 95 YLMNISLGTPPVSMHGIADTGSDLLWRQCKPCDSCYEQIEPIFDPAKSKTYQILSCEGKS 154
Query: 121 CRYFPYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDVVFGCGFST 180
C + + CIY+ Y G TS ++ + LT +T + V VVFGCG +
Sbjct: 155 CSNLGGQGGCSDDNTCIYSYSYGDGSHTSGDLAVDTLTIGSTTGRPVSVPKVVFGCGHNN 214
Query: 181 NRNFK--FSGIFGLGIGRSSLVSQL----NSSFSYCIGSLHDPDYLHNKLILGDGAIIDE 234
F+ SG+ GLG G S++SQL FSYC+ L + + +K+ G I+
Sbjct: 215 GGTFELHGSGLVGLGGGPLSMISQLRPLIGGRFSYCLVPLGNDPSVSSKMHFGSRGIVSG 274
Query: 235 GDA--TPL--QFIDGHYYITLEAISVDGRML------DINPNIFKRDDSGGGVMIDSGTD 284
A TPL + D YY+TLE++SV + L + + D+ G ++IDSGT
Sbjct: 275 AGAVSTPLASRQPDTFYYLTLESMSVGSKKLAYKGFSKVGSPLADADE--GNIIIDSGTT 332
Query: 285 VTWLVKEAYEALRDEVMIRLEGEQMRSYSWPDKLCYHGIMSSDLKGF--PTVRFHFRGGA 342
+T L ++ Y L V+ + G+ +R + LCY S+L G PT+ HF GA
Sbjct: 333 LTLLPQDFYGTLESNVVSAIGGKPVRDPNNVFSLCY-----SNLSGLRIPTITAHFV-GA 386
Query: 343 KLALEKDSMFYQPRPDAFCMAVNPASINNRYVNFSLIGMMAQQFYNVGYDIGRKQKTFQR 402
L L+ + F Q + D FC A+ P S + ++ G +AQ + VGYD+ + +F+
Sbjct: 387 DLELKPLNTFVQVQEDLFCFAMIPVS------DLAIFGNLAQMNFLVGYDLKSRTVSFKP 440
Query: 403 MDCEVLD 409
DC +D
Sbjct: 441 TDCTKID 447
>gi|225427550|ref|XP_002266461.1| PREDICTED: probable aspartic protease At2g35615 [Vitis vinifera]
Length = 439
Score = 189 bits (479), Expect = 3e-45, Method: Compositional matrix adjust.
Identities = 128/415 (30%), Positives = 208/415 (50%), Gaps = 24/415 (5%)
Query: 1 LIHRDSILSPYNNPNENPAHRVQRGINISIARLAYLQEKIRSHNTYQAQILPSNDISNSL 60
LIHRDS SP+ +P++ A R+ S++R+ + + + Q++I+PS
Sbjct: 36 LIHRDSPHSPFFDPSKTQAERLTDAFRRSVSRVGRFRPTAMTSDGIQSRIVPSA----GE 91
Query: 61 FYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLGTIFYPSRSSSYAYVPCDSEH 120
+ +N+ IG PPVP +DTGS L W C PC C Q+ +F P SS+Y C +
Sbjct: 92 YLMNLYIGTPPVPVIAIVDTGSDLTWTQCRPCTHCYKQVVPLFDPKNSSTYRDSSCGTSF 151
Query: 121 CRYFPYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDVVFGCGFST 180
C R + + +C + Y G T +++E LT +T + FGCG S+
Sbjct: 152 CLALGKDRSCSKEKKCTFRYSYADGSFTGGNLASETLTVDSTAGKPVSFPGFAFGCGHSS 211
Query: 181 NRNF--KFSGIFGLGIGRSSLVSQLNSS----FSYCIGSLHDPDYLHNKLILG-DGAIID 233
F SGI GLG G SL+SQL S+ FSYC+ + + +++ G G +
Sbjct: 212 GGIFDKSSSGIVGLGGGELSLISQLKSTINGLFSYCLLPVSTDSSISSRINFGASGRVSG 271
Query: 234 EGD-ATPL--QFIDGHYYITLEAISVDGRMLDINPNIFKRDDSGGGVMIDSGTDVTWLVK 290
G +TPL + D YY+TLE ISV + L K + G +++DSGT T+L +
Sbjct: 272 YGTVSTPLVQKSPDTFYYLTLEGISVGKKRLPYKGYSKKTEVEEGNIIVDSGTTYTFLPQ 331
Query: 291 EAYEALRDEVMIRLEGEQMRSYSWPDKLCYHGIMSSDLKGFPTVRFHFRGGAKLALEKDS 350
E Y L V ++G+++R + LCY+ ++++ P + HF+ A + L+ +
Sbjct: 332 EFYSKLEKSVANSIKGKRVRDPNGIFSLCYN--TTAEINA-PIITAHFK-DANVELQPLN 387
Query: 351 MFYQPRPDAFCMAVNPASINNRYVNFSLIGMMAQQFYNVGYDIGRKQKTFQRMDC 405
F + + D C V P S + ++G +AQ + VG+D+ +K+ +F+ DC
Sbjct: 388 TFMRMQEDLVCFTVAPTS------DIGVLGNLAQVNFLVGFDLRKKRVSFKAADC 436
>gi|297805038|ref|XP_002870403.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
gi|297316239|gb|EFH46662.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
Length = 440
Score = 187 bits (475), Expect = 8e-45, Method: Compositional matrix adjust.
Identities = 145/426 (34%), Positives = 212/426 (49%), Gaps = 38/426 (8%)
Query: 1 LIHRDSILSPYNNPNENPAHRVQRGINISIARLAY---LQEKIRSHNTYQAQILPSNDIS 57
LIHRDS SP+ NP E + R++ I+ S++R+ + + +K S N Q + S
Sbjct: 35 LIHRDSPKSPFYNPTETSSQRLRNAIHRSVSRVFHFTDISQKDASDNAPQIDL-----TS 89
Query: 58 NSLFYV-NISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLGTIFYPSRSSSYAYVPC 116
NS Y+ NIS+G PP P DTGS LLW C PC DC Q+ +F P SS+Y V C
Sbjct: 90 NSGEYLMNISLGTPPFPIMAIADTGSDLLWTQCKPCDDCYTQVDPLFDPKASSTYKDVSC 149
Query: 117 DSEHCRYFP-YARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDVVFG 175
S C A CS + C Y+ Y T ++ + LT +TD + +++++ G
Sbjct: 150 SSSQCTALENQASCSTEDNTCSYSTSYGDRSYTKGNIAVDTLTLGSTDTRPVQLKNIIIG 209
Query: 176 CGFSTNRNF--KFSGIFGLGIGRSSLVSQLNSS----FSYCIGSLHDPDYLHNKLILGDG 229
CG + F K SGI GLG G SL++QL S FSYC+ L + +K+ G
Sbjct: 210 CGHNNAGTFNKKGSGIVGLGGGAVSLITQLGDSIDGKFSYCLVPLTSENDRTSKINFGTN 269
Query: 230 AIIDEGD--ATPL--QFIDGHYYITLEAISVDGRMLDINPNIFKRDDSGGG---VMIDSG 282
A++ +TPL + + YY+TL++ISV + + + DSG G ++IDSG
Sbjct: 270 AVVSGTGVVSTPLIAKSQETFYYLTLKSISVGSKEVQ-----YPGSDSGSGEGNIIIDSG 324
Query: 283 TDVTWLVKEAYEALRDEVMIRLEGEQMRSYSWPDKLCYHGIMSSDLKGFPTVRFHFRGGA 342
T +T L E Y L D V ++ E+ + LCY + DLK P + HF GA
Sbjct: 325 TTLTLLPTEFYSELEDAVASSIDAEKKQDPQTGLSLCYSA--TGDLK-VPAITMHFD-GA 380
Query: 343 KLALEKDSMFYQPRPDAFCMAVNPASINNRYVNFSLIGMMAQQFYNVGYDIGRKQKTFQR 402
+ L+ + F Q D C A + +FS+ G +AQ + VGYD K +F+
Sbjct: 381 DVNLKPSNCFVQISEDLVCFAFRGSP------SFSIYGNVAQMNFLVGYDTVSKTVSFKP 434
Query: 403 MDCEVL 408
DC +
Sbjct: 435 TDCAKM 440
>gi|357457681|ref|XP_003599121.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
gi|355488169|gb|AES69372.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
Length = 439
Score = 186 bits (472), Expect = 2e-44, Method: Compositional matrix adjust.
Identities = 139/423 (32%), Positives = 206/423 (48%), Gaps = 29/423 (6%)
Query: 1 LIHRDSILSPYNNPNENPAHRVQRGINISIARLAYLQEKIR-SHNTY-QAQILPSNDISN 58
LIH DS SP+ N E R+ + SI R YL SHN + I+P +
Sbjct: 31 LIHPDSSRSPFYNIRETQLQRISNVVTHSIKRAHYLNHVFSLSHNDLPKPTIIP---YAG 87
Query: 59 SLFYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLGTIFYPSRSSSYAYVPCDS 118
S + ++ SIG PP + +DTGS +W C PC+ C Q IF PS+SS+Y + C S
Sbjct: 88 SYYVMSYSIGTPPFQLYGVVDTGSDGIWFQCKPCKPCLNQTSPIFNPSKSSTYKNIRCSS 147
Query: 119 EHCRYFPYARCSA-YKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDVVFGCG 177
C+ RCS+ K +C Y YL + +S + LT + D S I +V GCG
Sbjct: 148 PICKRGEKTRCSSNRKRKCEYEITYLDRSGSQGDISKDTLTLNSNDGSPISFPKIVIGCG 207
Query: 178 FSTNRNFK--FSGIFGLGIGRSSLVSQLNSS----FSYCIGSLHDPDYLHNKLILGDGAI 231
+ + SGI G G G S+VSQL SS FSYC+ SL + +KL GD A+
Sbjct: 208 HKNSLTTEGLASGIIGFGRGNFSIVSQLGSSIGGKFSYCLASLFSKANISSKLYFGDMAV 267
Query: 232 IDEGD--ATPL--QFIDGHYYITLEAISVDGRMLDINPNIFKRDDSGGGVMIDSGTDVTW 287
+ +TPL F G+Y+ LEA SV ++ + + D+ G V IDSG+ +T
Sbjct: 268 VSGHGVVSTPLIQSFYVGNYFTNLEAFSVGDHIIKLKDSSLIPDNEGNAV-IDSGSTITQ 326
Query: 288 LVKEAYEALRDEVMIRLEGEQMRSYSWPDKLCYHGIMSSDLKGF--PTVRFHFRGGAKLA 345
L + Y L V+ ++ ++++ + LCY + LK + P + HFR GA +
Sbjct: 327 LPNDVYSQLETAVISMVKLKRVKDPTQQLSLCYK----TTLKKYEVPIITAHFR-GADVK 381
Query: 346 LEKDSMFYQPRPDAFCMAVNPASINNRYVNFSLIGMMAQQFYNVGYDIGRKQKTFQRMDC 405
L + F Q + C A N ++ + + G +AQQ + VGYD + +F+ +C
Sbjct: 382 LNAFNTFIQMNHEVMCFAFNSSAF-----PWVVYGNIAQQNFLVGYDTLKNIISFKPTNC 436
Query: 406 EVL 408
L
Sbjct: 437 TKL 439
>gi|356546370|ref|XP_003541599.1| PREDICTED: probable aspartic protease At2g35615-like [Glycine max]
Length = 434
Score = 185 bits (470), Expect = 3e-44, Method: Compositional matrix adjust.
Identities = 134/420 (31%), Positives = 204/420 (48%), Gaps = 36/420 (8%)
Query: 1 LIHRDSILSPYNNPNENPAHRVQRGINISIARLAYLQEKIRSHNTYQAQILPSNDISNSL 60
+IHRDS SP+ P E RV ++ S+ R + ++H +A I ND
Sbjct: 33 MIHRDSSRSPFFRPTETQFQRVANAVHRSVNRANHFH---KAHKAAKATI-TQND---GE 85
Query: 61 FYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLGTIFYPSRSSSYAYVPCDSEH 120
+ ++ S+G PP + +DTGS ++W+ C PC C Q IF PS+S++Y +P S
Sbjct: 86 YLISYSVGIPPFQLYGIIDTGSDMIWLQCKPCEKCYNQTTRIFDPSKSNTYKILPFSSTT 145
Query: 121 CRYFPYARCSAYKHR-CIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDVVFGCGFS 179
C+ CS+ + C YT Y G + +S E LT +T+ S++ + V GCG +
Sbjct: 146 CQSVEDTSCSSDNRKMCEYTIYYGDGSYSQGDLSVETLTLGSTNGSSVKFRRTVIGCGRN 205
Query: 180 TNRNF--KFSGIFGLGIGRSSLVSQL-------NSSFSYCIGSLHDPDYLHNKLILGDGA 230
+F K SGI GLG G SL++QL FSYC+ S+ + + +KL GD A
Sbjct: 206 NTVSFEGKSSGIVGLGNGPVSLINQLRRRSSSIGRKFSYCLASMSN---ISSKLNFGDAA 262
Query: 231 IIDEGDATPLQFIDGH-----YYITLEAISVDGRMLDINPNIFKRDDSGGGVMIDSGTDV 285
++ GD T I H YY+TLEA SV ++ + F+ + G ++IDSGT +
Sbjct: 263 VV-SGDGTVSTPIVTHDPKVFYYLTLEAFSVGNNRIEFTSSSFRFGEK-GNIIIDSGTTL 320
Query: 286 TWLVKEAYEALRDEVMIRLEGEQMRSYSWPDKLCYHGIMSSDLKGFPTVRFHFRGGAKLA 345
T L + Y L V +E ++++ LCY D P + HF GA +
Sbjct: 321 TLLPNDIYSKLESAVADLVELDRVKDPLKQLSLCYRSTF--DELNAPVIMAHF-SGADVK 377
Query: 346 LEKDSMFYQPRPDAFCMAVNPASINNRYVNFSLIGMMAQQFYNVGYDIGRKQKTFQRMDC 405
L + F + C+A + I + G MAQQ + VGYD+ +K +F+ DC
Sbjct: 378 LNAVNTFIEVEQGVTCLAFISSKIG------PIFGNMAQQNFLVGYDLQKKIVSFKPTDC 431
>gi|357514995|ref|XP_003627786.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
gi|355521808|gb|AET02262.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
Length = 436
Score = 184 bits (467), Expect = 6e-44, Method: Compositional matrix adjust.
Identities = 140/424 (33%), Positives = 211/424 (49%), Gaps = 41/424 (9%)
Query: 1 LIHRDSILSPYNNPNENPAHRVQRGINISIARLAYLQEKIRSHNTYQAQILPSNDISNSL 60
LIHRDS+ SP P +N SI R + K N Q+ ++P DI L
Sbjct: 32 LIHRDSLKSPLYKPTQNKYQYFVDAARRSINRANHFY-KYSLANIPQSTVIP--DIGEYL 88
Query: 61 FYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLGTIFYPSRSSSYAYVPCDSEH 120
+ S+G PP + +DTGS ++W+ C PC++C Q +F PS+SSSY +PC S+
Sbjct: 89 --MTYSVGTPPFKLYGIVDTGSDIVWLQCEPCQECYNQTTPMFNPSKSSSYKNIPCPSKL 146
Query: 121 CRYFPYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDVVFGCGFST 180
C+ C+ K+ C Y+ Y + +S + LT ++T+ T+ ++V GCG T
Sbjct: 147 CQSMEDTSCND-KNYCEYSTYYGDNSHSGGDLSVDTLTLESTNGLTVSFPNIVIGCG--T 203
Query: 181 NRNFKF----SGIFGLGIGRSSLVSQLNSS----FSYCIGSLHDPDYLH----NKLILGD 228
N + SGI G G G +S ++QL SS FSYC+ L + +KL GD
Sbjct: 204 NNILSYEGASSGIVGFGSGPASFITQLGSSTGGKFSYCLTPLFSVTNIQSNATSKLNFGD 263
Query: 229 GAIIDEGDA---TPLQFIDGH--YYITLEAISVDGRMLDIN--PNIFKRDDSGGGVMIDS 281
A + GD TP+ D YY+TLEA SV R ++I PN D+ G ++IDS
Sbjct: 264 AATV-SGDGVVTTPILKKDPETFYYLTLEAFSVGNRRVEIGGVPN----GDNEGNIIIDS 318
Query: 282 GTDVTWLVKEAYEALRDEVMIRLEGEQMRSYSWPDKLCYHGIMSSDLKGFPTVRFHFRGG 341
GT +T L K+ Y L V+ ++ E++ + LCY + ++ FP + HF+ G
Sbjct: 319 GTTLTSLTKDDYSFLESAVVDLVKLERVDDPTQTLNLCYS--VKAEGYDFPIITMHFK-G 375
Query: 342 AKLALEKDSMFYQPRPDAFCMAVNPASINNRYVNFSLIGMMAQQFYNVGYDIGRKQKTFQ 401
A + L S F FC+A + + ++ G +AQQ VGYD+ +K +F+
Sbjct: 376 ADVDLHPISTFVSVADGVFCLAFESSQ------DHAIFGNLAQQNLMVGYDLQQKIVSFK 429
Query: 402 RMDC 405
DC
Sbjct: 430 PSDC 433
>gi|15242803|ref|NP_198319.1| aspartyl protease family protein [Arabidopsis thaliana]
gi|75130158|sp|Q6XBF8.1|CDR1_ARATH RecName: Full=Aspartic proteinase CDR1; AltName: Full=Protein
CONSTITUTIVE DISEASE RESISTANCE 1; Flags: Precursor
gi|37935737|gb|AAP72988.1| CDR1 [Arabidopsis thaliana]
gi|91806924|gb|ABE66189.1| aspartyl protease family protein [Arabidopsis thaliana]
gi|109946613|gb|ABG48485.1| At5g33340 [Arabidopsis thaliana]
gi|332006513|gb|AED93896.1| aspartyl protease family protein [Arabidopsis thaliana]
Length = 437
Score = 183 bits (465), Expect = 1e-43, Method: Compositional matrix adjust.
Identities = 144/421 (34%), Positives = 205/421 (48%), Gaps = 31/421 (7%)
Query: 1 LIHRDSILSPYNNPNENPAHRVQRGINISIARLAYLQEKIRSHNTYQAQI-LPSNDISNS 59
LIHRDS SP+ NP E + R++ I+ S+ R+ + EK NT Q QI L SN +
Sbjct: 35 LIHRDSPKSPFYNPMETSSQRLRNAIHRSVNRVFHFTEK---DNTPQPQIDLTSN---SG 88
Query: 60 LFYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLGTIFYPSRSSSYAYVPCDSE 119
+ +N+SIG PP P DTGS LLW C PC DC Q+ +F P SS+Y V C S
Sbjct: 89 EYLMNVSIGTPPFPIMAIADTGSDLLWTQCAPCDDCYTQVDPLFDPKTSSTYKDVSCSSS 148
Query: 120 HCRYFP-YARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDVVFGCGF 178
C A CS + C Y+ Y T ++ + LT ++D + +++++ GCG
Sbjct: 149 QCTALENQASCSTNDNTCSYSLSYGDNSYTKGNIAVDTLTLGSSDTRPMQLKNIIIGCGH 208
Query: 179 STNRNF--KFSGIFGLGIGRSSLVSQLNSS----FSYCIGSLHDPDYLHNKLILGDGAII 232
+ F K SGI GLG G SL+ QL S FSYC+ L +K+ G AI+
Sbjct: 209 NNAGTFNKKGSGIVGLGGGPVSLIKQLGDSIDGKFSYCLVPLTSKKDQTSKINFGTNAIV 268
Query: 233 DEGDATPLQFI-----DGHYYITLEAISVDGRMLDINPNIFKRDDSGGGVMIDSGTDVTW 287
I + YY+TL++ISV + I + + S G ++IDSGT +T
Sbjct: 269 SGSGVVSTPLIAKASQETFYYLTLKSISVGSKQ--IQYSGSDSESSEGNIIIDSGTTLTL 326
Query: 288 LVKEAYEALRDEVMIRLEGEQMRSYSWPDKLCYHGIMSSDLKGFPTVRFHFRGGAKLALE 347
L E Y L D V ++ E+ + LCY + DLK P + HF GA + L+
Sbjct: 327 LPTEFYSELEDAVASSIDAEKKQDPQSGLSLCYSA--TGDLK-VPVITMHFD-GADVKLD 382
Query: 348 KDSMFYQPRPDAFCMAVNPASINNRYVNFSLIGMMAQQFYNVGYDIGRKQKTFQRMDCEV 407
+ F Q D C A + +FS+ G +AQ + VGYD K +F+ DC
Sbjct: 383 SSNAFVQVSEDLVCFAFRGSP------SFSIYGNVAQMNFLVGYDTVSKTVSFKPTDCAK 436
Query: 408 L 408
+
Sbjct: 437 M 437
>gi|116831531|gb|ABK28718.1| unknown [Arabidopsis thaliana]
Length = 438
Score = 183 bits (465), Expect = 1e-43, Method: Compositional matrix adjust.
Identities = 144/421 (34%), Positives = 205/421 (48%), Gaps = 31/421 (7%)
Query: 1 LIHRDSILSPYNNPNENPAHRVQRGINISIARLAYLQEKIRSHNTYQAQI-LPSNDISNS 59
LIHRDS SP+ NP E + R++ I+ S+ R+ + EK NT Q QI L SN +
Sbjct: 35 LIHRDSPKSPFYNPMETSSQRLRNAIHRSVNRVFHFTEK---DNTPQPQIDLTSN---SG 88
Query: 60 LFYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLGTIFYPSRSSSYAYVPCDSE 119
+ +N+SIG PP P DTGS LLW C PC DC Q+ +F P SS+Y V C S
Sbjct: 89 EYLMNVSIGTPPFPIMAIADTGSDLLWTQCAPCDDCYTQVDPLFDPKTSSTYKDVSCSSS 148
Query: 120 HCRYFP-YARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDVVFGCGF 178
C A CS + C Y+ Y T ++ + LT ++D + +++++ GCG
Sbjct: 149 QCTALENQASCSTNDNTCSYSLSYGDNSYTKGNIAVDTLTLGSSDTRPMQLKNIIIGCGH 208
Query: 179 STNRNF--KFSGIFGLGIGRSSLVSQLNSS----FSYCIGSLHDPDYLHNKLILGDGAII 232
+ F K SGI GLG G SL+ QL S FSYC+ L +K+ G AI+
Sbjct: 209 NNAGTFNKKGSGIVGLGGGPVSLIKQLGDSIDGKFSYCLVPLTSKKDQTSKINFGTNAIV 268
Query: 233 DEGDATPLQFI-----DGHYYITLEAISVDGRMLDINPNIFKRDDSGGGVMIDSGTDVTW 287
I + YY+TL++ISV + I + + S G ++IDSGT +T
Sbjct: 269 SGSGVVSTPLIAKASQETFYYLTLKSISVGSKQ--IQYSGSDSESSEGNIIIDSGTTLTL 326
Query: 288 LVKEAYEALRDEVMIRLEGEQMRSYSWPDKLCYHGIMSSDLKGFPTVRFHFRGGAKLALE 347
L E Y L D V ++ E+ + LCY + DLK P + HF GA + L+
Sbjct: 327 LPTEFYSELEDAVASSIDAEKKQDPQSGLSLCYSA--TGDLK-VPVITMHFD-GADVKLD 382
Query: 348 KDSMFYQPRPDAFCMAVNPASINNRYVNFSLIGMMAQQFYNVGYDIGRKQKTFQRMDCEV 407
+ F Q D C A + +FS+ G +AQ + VGYD K +F+ DC
Sbjct: 383 SSNAFVQVSEDLVCFAFRGSP------SFSIYGNVAQMNFLVGYDTVSKTVSFKPTDCAK 436
Query: 408 L 408
+
Sbjct: 437 M 437
>gi|116790042|gb|ABK25480.1| unknown [Picea sitchensis]
Length = 460
Score = 181 bits (459), Expect = 6e-43, Method: Compositional matrix adjust.
Identities = 136/420 (32%), Positives = 196/420 (46%), Gaps = 30/420 (7%)
Query: 1 LIHRDSILSPYNNPNENPAHRVQRGINISIARLAYLQEKIRSHNTYQAQILPSNDISNSL 60
L+ DS LSP++ N + R +R I S RL LQ + +A + N
Sbjct: 59 LVRTDSPLSPFSPGNISSTERFKRAIKRSQDRLEKLQMSVDEVKAVEAPVY----AGNGE 114
Query: 61 FYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLGTIFYPSRSSSYAYVPCDSEH 120
F + ++IG P + +DTGS L W C PC DC PQ I+ PS+SS+Y+ VPC S
Sbjct: 115 FLMKMAIGTPSLSFSAILDTGSDLTWTQCKPCTDCYPQPTPIYDPSQSSTYSKVPCSSSM 174
Query: 121 CRYFPYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDVVFGCGFST 180
C+ P CS C Y LY G ++S +F T +S H + FGCG
Sbjct: 175 CQALPMYSCSG--ANCEY--LYSYGDQSSTQGILSYESFTLTSQSLPH---IAFGCGQEN 227
Query: 181 NRNFKFSGIFGLGIGRS--SLVSQLNSS----FSYCIGSLHDPDYLHNKLILGDGAIIDE 234
G +G GR SL+SQL S FSYC+ S+ D + L +G A ++
Sbjct: 228 EGGGFSQGGGLVGFGRGPLSLISQLGQSLGNKFSYCLVSITDSPSKTSPLFIGKTASLNA 287
Query: 235 GDATPLQFIDGH-----YYITLEAISVDGRMLDINPNIFKRD-DSGGGVMIDSGTDVTWL 288
+ + YY++LE ISV G++LDI F D GGV+IDSGT VT+L
Sbjct: 288 KTVSSTPLVQSRSRPTFYYLSLEGISVGGQLLDIADGTFDLQLDGTGGVIIDSGTTVTYL 347
Query: 289 VKEAYEALRDEVMIRLEGEQMRSYSWPDKLCYHGIMSSDLKGFPTVRFHFRGGAKLALEK 348
+ Y+ ++ V+ + Q+ + LC+ S FPT+ FHF GA L K
Sbjct: 348 EQSGYDVVKKAVISSINLPQVDGSNIGLDLCFEPQSGSSTSHFPTITFHFE-GADFNLPK 406
Query: 349 DSMFYQPRPDAFCMAVNPASINNRYVNFSLIGMMAQQFYNVGYDIGRKQKTFQRMDCEVL 408
++ Y C+A+ P++ S+ G + QQ Y + YD R +F C+ L
Sbjct: 407 ENYIYTDSSGIACLAMLPSN------GMSIFGNIQQQNYQILYDNERNVLSFAPTVCDTL 460
>gi|356546372|ref|XP_003541600.1| PREDICTED: probable aspartic protease At2g35615-like [Glycine max]
Length = 433
Score = 180 bits (456), Expect = 1e-42, Method: Compositional matrix adjust.
Identities = 133/419 (31%), Positives = 201/419 (47%), Gaps = 35/419 (8%)
Query: 1 LIHRDSILSPYNNPNENPAHRVQRGINISIARLAYLQEKIRSHNTYQAQILPSNDISNSL 60
+IHRDS SP+ +P E RV ++ SI R +L + S N+ P + ++L
Sbjct: 33 MIHRDSSRSPFFSPTETQFQRVANAVHRSINRANHLNQSFVSPNS------PETTVISAL 86
Query: 61 --FYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLGTIFYPSRSSSYAYVPCDS 118
+ ++ S+G P + F +DTGS ++W+ C PC+ C Q IF S+S +Y +PC S
Sbjct: 87 GEYLISYSVGTPSLQVFGILDTGSDIIWLQCQPCKKCYEQTTPIFDSSKSQTYKTLPCPS 146
Query: 119 EHCRYFPYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDVVFGCGF 178
C+ CS+ KH C+Y+ Y+ G ++ +S E LT +T+ S + V GCG
Sbjct: 147 NTCQSVQGTFCSSRKH-CLYSIHYVDGSQSLGDLSVETLTLGSTNGSPVQFPGTVIGCGR 205
Query: 179 --STNRNFKFSGIFGLGIGRSSLVSQLNSS----FSYCIGSLHDPDYLHNKLILGDGAII 232
+ K SGI GLG G SL++QL+ S FSYC+ + +KL G+ A++
Sbjct: 206 YNAIGIEEKNSGIVGLGRGPMSLITQLSPSTGGKFSYCL--VPGLSTASSKLNFGNAAVV 263
Query: 233 DEGD--ATPLQFIDG--HYYITLEAISVDGRMLDINPNIFKRDDSG--GGVMIDSGTDVT 286
+TPL +G Y++TLEA SV ++ F SG G ++IDSGT +T
Sbjct: 264 SGRGTVSTPLFSKNGLVFYFLTLEAFSVGRNRIE-----FGSPGSGGKGNIIIDSGTTLT 318
Query: 287 WLVKEAYEALRDEVMIRLEGEQMRSYSWPDKLCYHGIMSSDLKGFPTVRFHFRGGAKLAL 346
L Y L V + +++R + LCY P + HF GA + L
Sbjct: 319 ALPNGVYSKLEAAVAKTVILQRVRDPNQVLGLCYKVTPDKLDASVPVITAHFS-GADVTL 377
Query: 347 EKDSMFYQPRPDAFCMAVNPASINNRYVNFSLIGMMAQQFYNVGYDIGRKQKTFQRMDC 405
+ F Q D C A P ++ G +AQQ VGYD+ +F+ DC
Sbjct: 378 NAINTFVQVADDVVCFAFQPTETG------AVFGNLAQQNLLVGYDLQMNTVSFKHTDC 430
>gi|356546378|ref|XP_003541603.1| PREDICTED: probable aspartic protease At2g35615-like [Glycine max]
Length = 439
Score = 178 bits (451), Expect = 5e-42, Method: Compositional matrix adjust.
Identities = 131/416 (31%), Positives = 196/416 (47%), Gaps = 25/416 (6%)
Query: 1 LIHRDSILSPYNNPNENPAHRVQRGINISIARLAYLQEKIRSHNTYQAQILPSNDISNSL 60
+IHRDS SP P E P RV + SI R + ++ S ++ ++ ++ S
Sbjct: 35 MIHRDSSRSPLYRPTETPFQRVANAVRRSINRGNHFKKAFVSTDSAESTVVASQ----GE 90
Query: 61 FYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLGTIFYPSRSSSYAYVPCDSEH 120
+ + S+G PP +DTGS +LW+ C PC DC Q IF PS+S +Y +PC S
Sbjct: 91 YLMRYSVGSPPFQVLGIVDTGSDILWLQCEPCEDCYKQTTPIFDPSKSKTYKTLPCSSNT 150
Query: 121 CRYFPYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDVVFGCGFST 180
C CS+ + C Y+ Y G + +S E LT +TD S++H V GCG +
Sbjct: 151 CESLRNTACSS-DNVCEYSIDYGDGSHSDGDLSVETLTLGSTDGSSVHFPKTVIGCGHNN 209
Query: 181 NRNFKFSGIFGLGIGRSSL------VSQLNSSFSYCIGSLHDPDYLHNKLILGDGAIIDE 234
F+ G +G+G + S + FSYC+ + +KL GD A++
Sbjct: 210 GGTFQEEGSGIVGLGGGPVSLISQLSSSIGGKFSYCLAPIFSESNSSSKLNFGDAAVVSG 269
Query: 235 GD--ATPLQFIDGH--YYITLEAISV-DGRMLDINPNIFKRDDSGGGVMIDSGTDVTWLV 289
+TPL ++G Y++TLEA SV D R+ + G ++IDSGT +T L
Sbjct: 270 RGTVSTPLDPLNGQVFYFLTLEAFSVGDNRIEFSGSSSSGSGSGDGNIIIDSGTTLTLLP 329
Query: 290 KEAYEALRDEVMIRLEGEQMRSYSWPDKLCYHGIMSSDLKGFPTVRFHFRGGAKLALEKD 349
+E Y L V ++ E+ R S LCY +SD P + HF+ GA + L
Sbjct: 330 QEDYLNLESAVSDVIKLERARDPSKLLSLCYK--TTSDELDLPVITAHFK-GADVELNPI 386
Query: 350 SMFYQPRPDAFCMAVNPASINNRYVNFSLIGMMAQQFYNVGYDIGRKQKTFQRMDC 405
S F C A + I ++ G +AQQ VGYD+ +K +F+ DC
Sbjct: 387 STFVPVEKGVVCFAFISSKIG------AIFGNLAQQNLLVGYDLVKKTVSFKPTDC 436
>gi|225427552|ref|XP_002266498.1| PREDICTED: probable aspartic protease At2g35615 [Vitis vinifera]
Length = 441
Score = 177 bits (450), Expect = 7e-42, Method: Compositional matrix adjust.
Identities = 123/418 (29%), Positives = 200/418 (47%), Gaps = 24/418 (5%)
Query: 1 LIHRDSILSPYNNPNENPAHRVQRGINISIARLAYLQEKIRSHNTYQAQILPSNDISNSL 60
LIHRDS SP+ +P++ R+ + S +R+ ++ + + Q++++PS
Sbjct: 36 LIHRDSPHSPFFDPSKTRTERLTDAFHRSASRVGRFRQSAMTSDGIQSRLVPSA----GE 91
Query: 61 FYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLGTIFYPSRSSSYAYVPCDSEH 120
+ +N+SIG PPVP +DTGS L W C PC C Q+ F P SS+Y C +
Sbjct: 92 YIMNLSIGTPPVPVIAIVDTGSDLTWTQCRPCTHCYKQVVPFFDPKNSSTYRDSSCGTSF 151
Query: 121 CRYFPYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDVVFGCGFST 180
C R +C + Y G T ++ E LT +T + FGC +
Sbjct: 152 CLALGNDRSCRNGKKCTFMYSYADGSFTGGNLAVETLTVASTAGKPVSFPGFAFGCVHRS 211
Query: 181 NRNF--KFSGIFGLGIGRSSLVSQLNSS----FSYCIGSLHDPDYLHNKLILGDGAIIDE 234
F SGI GLG+ S++SQL S+ FSYC+ + + +++ G I+
Sbjct: 212 GGIFDEHSSGIVGLGVAELSMISQLKSTINGRFSYCLLPVFTDSSMSSRINFGRSGIVSG 271
Query: 235 GD--ATPLQFI--DGHYY-ITLEAISVDGRMLDINPNIFKRDDSGGGVMIDSGTDVTWLV 289
+TPL D +YY ITLE SV + L K + G +++DSGT T+L
Sbjct: 272 AGTVSTPLVMKGPDTYYYLITLEGFSVGKKRLSYKGFSKKAEVEEGNIIVDSGTTYTYLP 331
Query: 290 KEAYEALRDEVMIRLEGEQMRSYSWPDKLCYHGIMSSDLKGFPTVRFHFRGGAKLALEKD 349
E Y L + V ++G+++R + LCY+ + D P + HF+ A + L+
Sbjct: 332 LEFYVKLEESVAHSIKGKRVRDPNGISSLCYN--TTVDQIDAPIITAHFK-DANVELQPW 388
Query: 350 SMFYQPRPDAFCMAVNPASINNRYVNFSLIGMMAQQFYNVGYDIGRKQKTFQRMDCEV 407
+ F + + D C V P S + ++G +AQ + VG+D+ +K+ +F+ DC +
Sbjct: 389 NTFLRMQEDLVCFTVLPTS------DIGILGNLAQVNFLVGFDLRKKRVSFKAADCTL 440
>gi|356557014|ref|XP_003546813.1| PREDICTED: probable aspartic protease At2g35615-like [Glycine max]
Length = 435
Score = 177 bits (449), Expect = 9e-42, Method: Compositional matrix adjust.
Identities = 130/420 (30%), Positives = 201/420 (47%), Gaps = 35/420 (8%)
Query: 1 LIHRDSILSPYNNPNENPAHRVQRGINISIARLAYLQEKIRSHNTYQAQILPSNDISNSL 60
LIHRDS SP+ NP+ P+ R+ S++RL + + + ++ ++P
Sbjct: 33 LIHRDSPSSPFYNPSLTPSERIINAALRSMSRLQRVSHFLDENKLPESLLIPDKGEYLMR 92
Query: 61 FYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLGTIFYPSRSSSYAYVPCDSEH 120
FY IG PPV + +DTGSSL+W+ C PC +C PQ +F P +SS+Y Y CDS+
Sbjct: 93 FY----IGSPPVERLAMVDTGSSLIWLQCSPCHNCFPQETPLFEPLKSSTYKYATCDSQP 148
Query: 121 CRYF-PYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDES-TIHVQDVVFGCGF 178
C P R +CIY +Y + + TE L+F +T + T+ + +FGCG
Sbjct: 149 CTLLQPSQRDCGKLGQCIYGIMYGDKSFSVGILGTETLSFGSTGGAQTVSFPNTIFGCGV 208
Query: 179 STNRNF----KFSGIFGLGIGRSSLVSQLNS----SFSYCIGSLHDPDYLHNKLILGDGA 230
N K GI GLG G SLVSQL + FSYC+ L +KL G A
Sbjct: 209 DNNFTIYTSNKVMGIAGLGAGPLSLVSQLGAQIGHKFSYCL--LPYDSTSTSKLKFGSEA 266
Query: 231 IIDEGD--ATPLQF---IDGHYYITLEAISVDGRMLDINPNIFKRDDSGGGVMIDSGTDV 285
II +TPL + +Y++ LEA+++ +++ + G ++IDSGT +
Sbjct: 267 IITTNGVVSTPLIIKPSLPTYYFLNLEAVTIGQKVVSTG-------QTDGNIVIDSGTPL 319
Query: 286 TWLVKEAYEALRDEVMIRLEGEQMRSYSWPDKLCYHGIMSSDLKGFPTVRFHFRGGAKLA 345
T+L Y + L + ++ P K C+ ++L P + F F G +
Sbjct: 320 TYLENTFYNNFVASLQETLGVKLLQDLPSPLKTCFPN--RANL-AIPDIAFQFTGASVAL 376
Query: 346 LEKDSMFYQPRPDAFCMAVNPASINNRYVNFSLIGMMAQQFYNVGYDIGRKQKTFQRMDC 405
K+ + + C+AV P+S + SL G +AQ + V YD+ K+ +F DC
Sbjct: 377 RPKNVLIPLTDSNILCLAVVPSS----GIGISLFGSIAQYDFQVEYDLEGKKVSFAPTDC 432
>gi|225436202|ref|XP_002271145.1| PREDICTED: probable aspartic protease At2g35615-like [Vitis
vinifera]
Length = 440
Score = 177 bits (448), Expect = 1e-41, Method: Compositional matrix adjust.
Identities = 127/417 (30%), Positives = 196/417 (47%), Gaps = 31/417 (7%)
Query: 1 LIHRDSILSPYNNPNENPAHRVQRGINISIARLAYLQEKIRSHNTYQAQILPSNDISNSL 60
LIHRDS SP NP+E PA R+ R R E S NT + P +N
Sbjct: 39 LIHRDSPKSPLYNPSETPAERLDRFFR----RFMSFSEASISPNTPE----PPVSSNNGE 90
Query: 61 FYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLGTIFYPSRSSSYAYVPCDSEH 120
+ + ISIG PP + DTGS L+W C PC C Q +F PS+S+S+ V C+S+
Sbjct: 91 YLMKISIGTPPFDVYGIYDTGSDLMWTQCLPCLSCYKQKNPMFDPSKSTSFKEVSCESQQ 150
Query: 121 CRYFPYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDVVFGCGFST 180
CR CS + C ++ Y G ++TE LT + + ++VFGCG +
Sbjct: 151 CRLLDTVSCSQPQKLCDFSYGYGDGSLAQGVIATETLTLNSNSGQPTSILNIVFGCGHNN 210
Query: 181 NRNFKFS--GIFGLGIGRSSLVSQLNSS------FSYCIGSLHDPDYLHNKLILGDGAII 232
+ F + G+FG G SL SQ+ S+ FS C+ + +K+I G A +
Sbjct: 211 SGTFNENEMGLFGTGGRPLSLTSQIMSTLGSGRKFSQCLVPFRTDPSITSKIIFGPEAEV 270
Query: 233 DEGD--ATPLQFID--GHYYITLEAISVDGRMLDINPNIFKRDDSGGGVMIDSGTDVTWL 288
D +TPL D +Y++TL+ ISV ++ + + + G V ID+GT T L
Sbjct: 271 SGSDVVSTPLVTKDDPTYYFVTLDGISVGDKLFPFSSS--SPMATKGNVFIDAGTPPTLL 328
Query: 289 VKEAYEALRDEVMIRLEGEQMRSYSWPDKLCYHGIMSSDLKGFPTVRFHFRGGAKLALEK 348
++ Y L V + E ++ +LCY S+ L P + HF GA + L+
Sbjct: 329 PRDFYNRLVQGVKEAIPMEPVQDPDLQPQLCYR---SATLIDGPILTAHFD-GADVQLKP 384
Query: 349 DSMFYQPRPDAFCMAVNPASINNRYVNFSLIGMMAQQFYNVGYDIGRKQKTFQRMDC 405
+ F P+ +C A+ P + + G Q + +G+D+ K+ +F+ +DC
Sbjct: 385 LNTFISPKEGVYCFAMQPIDGDT-----GIFGNFVQMNFLIGFDLDGKKVSFKAVDC 436
>gi|297805036|ref|XP_002870402.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
gi|297316238|gb|EFH46661.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
Length = 435
Score = 176 bits (445), Expect = 2e-41, Method: Compositional matrix adjust.
Identities = 135/421 (32%), Positives = 206/421 (48%), Gaps = 39/421 (9%)
Query: 1 LIHRDSILSPYNNPNENPAHRVQRGINISIARLAY---LQEKIRSHNTYQAQILPSNDIS 57
LIHRDS SP+ NP E P+ R++ I+ S R+++ L E S N+ Q I P
Sbjct: 35 LIHRDSPKSPFYNPAETPSQRIRNAIHRSFNRVSHFTDLSEMDASLNSPQTDITPCG--- 91
Query: 58 NSLFYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLGTIFYPSRSSSYAYVPCD 117
+ +N+S+G PP P DTGS+L+W C PC DC Q+ +F P SS+Y V C
Sbjct: 92 -GEYLMNLSLGTPPSPIMAVADTGSNLIWTQCKPCDDCYTQVDPLFDPKASSTYKDVSCS 150
Query: 118 SEHCRYFP-YARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDVVFGC 176
S C A CS C Y Y G T + + LT +TD + +++++ GC
Sbjct: 151 SSQCTALENQASCSTEDKTCSYLVSYADGSYTMGKFAVDTLTLGSTDNRPVQLKNIIIGC 210
Query: 177 GFSTNRNF--KFSGIFGLGIGRSSLVSQLNSS----FSYCIGSLHDP----DYLHNKLIL 226
G + F K SG+ GLG G SL+ QL S FSYC+ +D ++ N ++
Sbjct: 211 GQNNAVTFRNKSSGVVGLGGGAVSLIKQLGDSIDGKFSYCLVPENDQTSKINFGTNAVVS 270
Query: 227 GDGAIIDEGDATPL--QFIDGHYYITLEAISVDGRMLDINPNIFKRDDSGGGVMIDSGTD 284
G G + +TPL + D YY+TL++ISV + + + K G ++IDSGT
Sbjct: 271 GPGTV-----STPLVVKSRDTFYYLTLKSISVGSKNMQTPDSNIK-----GNMVIDSGTT 320
Query: 285 VTWLVKEAYEALRDEVMIRLEGEQMRSYSWPDKLCYHGIMSSDLKGFPTVRFHFRGGAKL 344
+T L + Y + + V + ++ + LCY+ ++DL P + HF GA +
Sbjct: 321 LTLLPVKYYIEIENAVASLINADKSKDERIGSSLCYNA--TADL-NIPVITMHFE-GADV 376
Query: 345 ALEKDSMFYQPRPDAFCMAVNPASINNRYVNFSLIGMMAQQFYNVGYDIGRKQKTFQRMD 404
L + F++ D C+A + N + G +AQ+ + VGYD K +F+ D
Sbjct: 377 KLYPYNSFFKVTEDLVCLAFGMSFYRN-----GIYGNVAQKNFLVGYDTASKTMSFKPTD 431
Query: 405 C 405
C
Sbjct: 432 C 432
>gi|147811402|emb|CAN61225.1| hypothetical protein VITISV_006732 [Vitis vinifera]
Length = 440
Score = 175 bits (443), Expect = 4e-41, Method: Compositional matrix adjust.
Identities = 126/417 (30%), Positives = 195/417 (46%), Gaps = 31/417 (7%)
Query: 1 LIHRDSILSPYNNPNENPAHRVQRGINISIARLAYLQEKIRSHNTYQAQILPSNDISNSL 60
LIHRDS SP NP+E PA R+ R R E S NT + P +N
Sbjct: 39 LIHRDSPKSPLYNPSETPAERLDRFFR----RFMSFSEASISPNTPE----PPVSSNNGE 90
Query: 61 FYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLGTIFYPSRSSSYAYVPCDSEH 120
+ + ISIG PP + DTGS L+W C PC C Q +F PS+S+S+ V C+S+
Sbjct: 91 YLMKISIGTPPFDVYGIYDTGSDLMWTQCLPCLSCYKQKNPMFDPSKSTSFKEVSCESQQ 150
Query: 121 CRYFPYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDVVFGCGFST 180
CR CS + C ++ Y G ++TE LT + + ++VFGCG +
Sbjct: 151 CRLLDTVSCSQPQKLCDFSYGYGDGSLAQGVIATETLTLNSNSGQPXSIXNIVFGCGHNN 210
Query: 181 NRNFKFS--GIFGLGIGRSSLVSQLNSS------FSYCIGSLHDPDYLHNKLILGDGAII 232
+ F + G+FG G SL SQ+ S+ FS C+ + +K+I G A +
Sbjct: 211 SGTFNENEMGLFGTGGRPLSLTSQIMSTLGSGRKFSQCLVPFRTDPSITSKIIFGPEAEV 270
Query: 233 DEGD--ATPLQFID--GHYYITLEAISVDGRMLDINPNIFKRDDSGGGVMIDSGTDVTWL 288
+TPL D +Y++TL+ ISV ++ + + + G V ID+GT T L
Sbjct: 271 SGSXVVSTPLVTKDDPTYYFVTLDGISVGDKLFPFSSS--SPMATKGNVFIDAGTPPTLL 328
Query: 289 VKEAYEALRDEVMIRLEGEQMRSYSWPDKLCYHGIMSSDLKGFPTVRFHFRGGAKLALEK 348
++ Y L V + E ++ +LCY S+ L P + HF GA + L+
Sbjct: 329 PRDFYNRLVQGVKEAIPMEPVQDPDLQPQLCYR---SATLIDGPILTAHFD-GADVQLKP 384
Query: 349 DSMFYQPRPDAFCMAVNPASINNRYVNFSLIGMMAQQFYNVGYDIGRKQKTFQRMDC 405
+ F P+ +C A+ P + + G Q + +G+D+ K+ +F+ +DC
Sbjct: 385 LNTFISPKEGVYCFAMQPIDGDT-----GIFGNFVQMNFLIGFDLDGKKVSFKAVDC 436
>gi|224074591|ref|XP_002304395.1| predicted protein [Populus trichocarpa]
gi|222841827|gb|EEE79374.1| predicted protein [Populus trichocarpa]
Length = 443
Score = 175 bits (443), Expect = 5e-41, Method: Compositional matrix adjust.
Identities = 133/417 (31%), Positives = 205/417 (49%), Gaps = 26/417 (6%)
Query: 1 LIHRDSILSPYNNPNENPAHRVQRGINISIARLAYLQEKIRSHNTYQAQILPSNDISNSL 60
LIHRDS LSP NPN R++ + SI+R+ + K N++Q ++P+
Sbjct: 38 LIHRDSPLSPLYNPNHTDFDRLRNAFSRSISRVNVFKTKAVDINSFQNDLVPNG----GE 93
Query: 61 FYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLGTIFYPSRSSSYAYVPCDSEH 120
+++ +SIG P V DTGS L WV C PC C Q +F PSRSSSY ++ C S
Sbjct: 94 YFMKMSIGTPLVEVIVIADTGSDLTWVQCLPCDPCYRQKSPLFDPSRSSSYRHMLCGSRF 153
Query: 121 CRYFPYAR--CSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDVVFGCGF 178
C + C+ + C Y Y T+ ++TE+ T +T +H+ +VFGCG
Sbjct: 154 CNALDVSEQACTMDTNICEYHYSYGDKSYTNGNLATEKFTIGSTSSRPVHLSPIVFGCGT 213
Query: 179 STNRNFK--FSGIFGLGIGRSSLVSQLNS----SFSYCIGSLHDPDYLHNKLILGDGAII 232
F SGI GLG G SLVSQL+S FSYC+ L + + +K+ G ++I
Sbjct: 214 GNGGTFDELGSGIVGLGGGALSLVSQLSSIIKGKFSYCLVPLSEQSNVTSKIKFGTDSVI 273
Query: 233 D--EGDATPL--QFIDGHYYITLEAISVDGRMLDINPNIFKRDDSGGGVMIDSGTDVTWL 288
+ +TPL + D +YY+TLEAISV + L + + G V+IDSGT +T+L
Sbjct: 274 SGPQVVSTPLVSKQPDTYYYVTLEAISVGNKRLPYTNGLLNGNVEKGNVIIDSGTTLTFL 333
Query: 289 VKEAYEALRDEVMIRLEGEQMRSYSWPDKLCYHGIMSSDLKGFPTVRFHFRGGAKLALEK 348
E + L + ++ E++ +C+ DL P + HF A + L+
Sbjct: 334 DSEFFTELERVLEETVKAERVSDPRGLFSVCFRSAGDIDL---PVIAVHFN-DADVKLQP 389
Query: 349 DSMFYQPRPDAFCMAVNPASINNRYVNFSLIGMMAQQFYNVGYDIGRKQKTFQRMDC 405
+ F + D C + ++ + G +AQ + VGYD+ ++ +F+ DC
Sbjct: 390 LNTFVKADEDLLCFTMISSN------QIGIFGNLAQMDFLVGYDLEKRTVSFKPTDC 440
>gi|357515001|ref|XP_003627789.1| Aspartic proteinase nepenthesin-2 [Medicago truncatula]
gi|355521811|gb|AET02265.1| Aspartic proteinase nepenthesin-2 [Medicago truncatula]
Length = 415
Score = 174 bits (440), Expect = 9e-41, Method: Compositional matrix adjust.
Identities = 133/424 (31%), Positives = 197/424 (46%), Gaps = 64/424 (15%)
Query: 1 LIHRDSILSPYNNPNENPAHRVQRGINISIARLAYLQEKIRSHNTYQAQILPSNDISN-- 58
LIHRDS SP+ P +N R+ + SI R+ + + Y P + +++
Sbjct: 33 LIHRDSSKSPFYQPTQNKYERIANAVRRSINRVNHFYK-------YSLTSTPQSTVNSDK 85
Query: 59 SLFYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLGTIFYPSRSSSYAYVPCDS 118
+ ++ SIG PP F +DTGS L+W+ C PC+ C PQ+ IF PS SSSY +PC S
Sbjct: 86 GEYLMSYSIGTPPFKVFGFVDTGSDLVWLQCEPCKQCYPQITPIFDPSLSSSYQNIPCLS 145
Query: 119 EHCRYFPYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDVVFGCGF 178
+ C C + ++S E LT +T ++ + GCG+
Sbjct: 146 DTCHSMRTTSC-----------------DVRGYLSVETLTLDSTTGYSVSFPKTMIGCGY 188
Query: 179 STNRNFK--FSGIFGLGIGRSSLVSQLNSS----FSYCIGSLHDPDYLHN---KLILGDG 229
F SGI GLG G SL SQL +S FSYC+G +L N KL GD
Sbjct: 189 RNTGTFHGPSSGIVGLGSGPMSLPSQLGTSIGGKFSYCLG-----PWLPNSTSKLNFGDA 243
Query: 230 AII--DEGDATPLQFIDGH--YYITLEAISVDGRMLDINPNIFKRDDSGGGVMIDSGTDV 285
AI+ D TP+ D YY+TLEA SV ++++ + ++ G ++IDSGT
Sbjct: 244 AIVYGDGAMTTPIVKKDAQSGYYLTLEAFSVGNKLIEFGGPTYGGNE--GNILIDSGTTF 301
Query: 286 TWLVKEAYEALRDEVMIRLEGEQMRSYSWPDKLC----YHGIMSSDLKGFPTVRFHFRGG 341
T+L + Y V + E + + KLC YHG + P + HF+ G
Sbjct: 302 TFLPYDVYYRFESAVAEYINLEHVEDPNGTFKLCYNVAYHGFEA------PLITAHFK-G 354
Query: 342 AKLALEKDSMFYQPRPDAFCMAVNPASINNRYVNFSLIGMMAQQFYNVGYDIGRKQKTFQ 401
A + L S F + C+A P+ ++ G +AQQ VGY++ + TF+
Sbjct: 355 ADIKLYYISTFIKVSDGIACLAFIPSQT-------AIFGNVAQQNLLVGYNLVQNTVTFK 407
Query: 402 RMDC 405
+DC
Sbjct: 408 PVDC 411
>gi|357487631|ref|XP_003614103.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
gi|355515438|gb|AES97061.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
Length = 431
Score = 174 bits (440), Expect = 1e-40, Method: Compositional matrix adjust.
Identities = 129/415 (31%), Positives = 191/415 (46%), Gaps = 28/415 (6%)
Query: 1 LIHRDSILSPYNNPNENPAHRVQRGINISIARLAYLQEKIRSHNTYQAQILPSNDISNSL 60
LIHRDS SP P +N V SI R L + S NT ++ + ++
Sbjct: 32 LIHRDSSKSPLYKPAQNKFQHVVNAARRSINRANRLFKDSLS-NTPESTVY----VNGGE 86
Query: 61 FYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLGTIFYPSRSSSYAYVPCDSEH 120
+ + S+G PP + +DTGS ++W+ C PC C Q IF PS+SSSY +PC S
Sbjct: 87 YLMTYSVGTPPFNVYGVVDTGSDIVWLQCKPCEQCYKQTTPIFNPSKSSSYKNIPCSSNL 146
Query: 121 CRYFPYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDVVFGCGFST 180
C+ Y C+ ++ C YT + + +S E LT +T ++ V GCG +
Sbjct: 147 CQSVRYTSCNK-QNSCEYTINFSDQSYSQGELSVETLTLDSTTGHSVSFPKTVIGCGHNN 205
Query: 181 NRNFK--FSGIFGLGIGRSSLVSQLNSS----FSYCIGSLHDPDYLHNKLILGDGAIIDE 234
F+ SGI GLGIG SL +QL SS FSYC+ L +KL GD A++
Sbjct: 206 RGMFQGETSGIVGLGIGPVSLTTQLKSSIGGKFSYCLLPLLVDSNKTSKLNFGDAAVVSG 265
Query: 235 GDATPLQFI----DGHYYITLEAISVDGRMLDINPNIFKRDDSGGGVMIDSGTDVTWLVK 290
F+ YY+TLEA SV + ++ D G +++DSGT +T L
Sbjct: 266 DGVVSTPFVKKDPQAFYYLTLEAFSVGNKRIEFE---VLDDSEEGNIILDSGTTLTLLPS 322
Query: 291 EAYEALRDEVMIRLEGEQMRSYSWPDKLCYHGIMSSDLKGFPTVRFHFRGGAKLALEKDS 350
Y L V ++ +++ + LCY ++SD FP + HF+ GA + L S
Sbjct: 323 HVYTNLESAVAQLVKLDRVDDPNQLLNLCYS--ITSDQYDFPIITAHFK-GADIKLNPIS 379
Query: 351 MFYQPRPDAFCMAVNPASINNRYVNFSLIGMMAQQFYNVGYDIGRKQKTFQRMDC 405
F C+A + + G +AQ VGYD+ + +F+ DC
Sbjct: 380 TFAHVADGVVCLAFTSSQTG------PIFGNLAQLNLLVGYDLQQNIVSFKPSDC 428
>gi|15217764|ref|NP_176663.1| aspartyl protease-like protein [Arabidopsis thaliana]
gi|5042418|gb|AAD38257.1|AC006193_13 Hypothetical Protein [Arabidopsis thaliana]
gi|332196174|gb|AEE34295.1| aspartyl protease-like protein [Arabidopsis thaliana]
Length = 431
Score = 173 bits (438), Expect = 2e-40, Method: Compositional matrix adjust.
Identities = 132/418 (31%), Positives = 191/418 (45%), Gaps = 26/418 (6%)
Query: 1 LIHRDSILSPYNNPNENPAHRVQRGINISIARLAYLQEKIRSHNTYQAQILPSNDISNSL 60
LIHRDS SP+ N E + R++ I S S N+ Q+ I +
Sbjct: 30 LIHRDSPKSPFYNSAETSSQRMRNAIRRSARSTLQFSNDDASPNSPQSFITSNR----GE 85
Query: 61 FYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLGTIFYPSRSSSYAYVPCDSEH 120
+ +NISIG PPVP DTGS L+W C PC DC Q +F P SS+Y V C S
Sbjct: 86 YLMNISIGTPPVPILAIADTGSDLIWTQCNPCEDCYQQTSPLFDPKESSTYRKVSCSSSQ 145
Query: 121 CRYFPYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDVVFGCGFST 180
CR A CS ++ C YT Y T V+ + +T ++ + +++++ GCG
Sbjct: 146 CRALEDASCSTDENTCSYTITYGDNSYTKGDVAVDTVTMGSSGRRPVSLRNMIIGCGHEN 205
Query: 181 NRNF--KFSGIFGLGIGRSSLVSQL----NSSFSYCIGSLHDPDYLHNKLILGDGAIIDE 234
F SGI GLG G +SLVSQL N FSYC+ L +K+ G I+
Sbjct: 206 TGTFDPAGSGIIGLGGGSTSLVSQLRKSINGKFSYCLVPFTSETGLTSKINFGTNGIVSG 265
Query: 235 GDATPLQFIDG----HYYITLEAISVDGRMLDINPNIFKRDDSGGGVMIDSGTDVTWLVK 290
+ +Y++ LEAISV + + IF + G ++IDSGT +T L
Sbjct: 266 DGVVSTSMVKKDPATYYFLNLEAISVGSKKIQFTSTIFGTGE--GNIVIDSGTTLTLLPS 323
Query: 291 EAYEALRDEVMIRLEGEQMRSYSWPDKLCYHGIMSSDLKGFPTVRFHFRGGAKLALEKDS 350
Y L V ++ E+++ LCY SS K P + HF+GG + L +
Sbjct: 324 NFYYELESVVASTIKAERVQDPDGILSLCYRD--SSSFK-VPDITVHFKGG-DVKLGNLN 379
Query: 351 MFYQPRPDAFCMAVNPASINNRYVNFSLIGMMAQQFYNVGYDIGRKQKTFQRMDCEVL 408
F D C A + N + ++ G +AQ + VGYD +F++ DC +
Sbjct: 380 TFVAVSEDVSCFAF---AANEQ---LTIFGNLAQMNFLVGYDTVSGTVSFKKTDCSQM 431
>gi|449515031|ref|XP_004164553.1| PREDICTED: probable aspartic protease At2g35615-like [Cucumis
sativus]
Length = 430
Score = 172 bits (437), Expect = 2e-40, Method: Compositional matrix adjust.
Identities = 130/420 (30%), Positives = 209/420 (49%), Gaps = 37/420 (8%)
Query: 1 LIHRDSILSPYNNPNENPAHRVQRGINISIARLAYLQEKIRSHNT--YQAQILPSNDISN 58
L HRDS+LSP + + R+ S++R A L + + Q+ I P +
Sbjct: 34 LFHRDSLLSPLEFSSLSHYDRLANAFRRSLSRSAALLNRAATSGAVGLQSSIGPGS---- 89
Query: 59 SLFYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLGTIFYPSRSSSYAYVPCDS 118
+ +++SIG PPV DTGS L W C PC C QL IF P +S+S+++VPC++
Sbjct: 90 GEYLMSVSIGTPPVDYLGIADTGSDLTWAQCLPCLKCYQQLRPIFNPLKSTSFSHVPCNT 149
Query: 119 EHCRYFPYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDVVFGCGF 178
+ C C + C Y+ Y G T S L F+ + V+ V+ GCG
Sbjct: 150 QTCHAVDDGHC-GVQGVCDYSYTY--GDRT---YSKGDLGFEKITIGSSSVKSVI-GCGH 202
Query: 179 STNRNFKF-SGIFGLGIGRSSLVSQLNSS------FSYCIGSLHDPDYLHNKLILGDGAI 231
+++ F F SG+ GLG G+ SLVSQ++ + FSYC+ +L + + K+ G+ A+
Sbjct: 203 ASSGGFGFASGVIGLGGGQLSLVSQMSQTSGISRRFSYCLPTLL--SHANGKINFGENAV 260
Query: 232 IDEGD--ATPL--QFIDGHYYITLEAISVDGRMLDINPNIFKRDDSGGGVMIDSGTDVTW 287
+ +TPL + +YYITLEAIS+ F + G V+IDSGT +T
Sbjct: 261 VSGPGVVSTPLISKNTVTYYYITLEAISIGNE----RHMAFAKQ---GNVIIDSGTTLTI 313
Query: 288 LVKEAYEALRDEVMIRLEGEQMRSYSWPDKLCY-HGIMSSDLKGFPTVRFHFRGGAKLAL 346
L KE Y+ + ++ ++ ++++ LC+ GI ++ G P + HF GGA + L
Sbjct: 314 LPKELYDGVVSSLLKVVKAKRVKDPHGSLDLCFDDGINAAASLGIPVITAHFSGGANVNL 373
Query: 347 EKDSMFYQPRPDAFCMAVNPASINNRYVNFSLIGMMAQQFYNVGYDIGRKQKTFQRMDCE 406
+ F + + C+ + AS F +IG +AQ + +GYD+ K+ +F+ C
Sbjct: 374 LPINTFRKVADNVNCLTLKAASPTTE---FGIIGNLAQANFLIGYDLEAKRLSFKPTVCA 430
>gi|357492389|ref|XP_003616483.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
gi|355517818|gb|AES99441.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
Length = 434
Score = 172 bits (435), Expect = 3e-40, Method: Compositional matrix adjust.
Identities = 132/416 (31%), Positives = 200/416 (48%), Gaps = 27/416 (6%)
Query: 1 LIHRDSILSPYNNPNENPAHRVQRGINISIARLAYLQEKIRSHNTYQAQILPSNDISNSL 60
LIHRDS SPY P EN SI R + K +T ++ ++P
Sbjct: 32 LIHRDSPKSPYYKPTENKYQHFVDAARRSINRANHF-FKDSDTSTPESTVIPDR----GG 86
Query: 61 FYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLGTIFYPSRSSSYAYVPCDSEH 120
+ + S+G PP + DTGS ++W+ C PC C Q IF PS+SSSY +PC S+
Sbjct: 87 YLMTYSVGTPPTKIYGIADTGSDIVWLQCEPCEQCYNQTTPIFNPSKSSSYKNIPCSSKL 146
Query: 121 CRYFPYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDVVFGCGFST 180
C CS ++ C Y Y + +S + L+ ++T S + +V GCG
Sbjct: 147 CHSVRDTSCSD-QNSCQYKISYGDSSHSQGDLSVDTLSLESTSGSPVSFPKIVIGCGTDN 205
Query: 181 NRNF--KFSGIFGLGIGRSSLVSQLNSS----FSYCIGSLHDPDYLHNKLI-LGDGAIID 233
F SGI GLG G SL++QL SS FSYC+ L + + + ++ GD A++
Sbjct: 206 AGTFGGASSGIVGLGGGPVSLITQLGSSIGGKFSYCLVPLLNKESNASSILSFGDAAVV- 264
Query: 234 EGD---ATPLQFIDG-HYYITLEAISVDGRMLDINPNIFKRDDSGGGVMIDSGTDVTWLV 289
GD +TPL D Y++TL+A SV + ++ + DD G ++IDSGT +T +
Sbjct: 265 SGDGVVSTPLIKKDPVFYFLTLQAFSVGNKRVEFGGSSEGGDDE-GNIIIDSGTTLTLIP 323
Query: 290 KEAYEALRDEVMIRLEGEQMRSYSWPDKLCYHGIMSSDLKGFPTVRFHFRGGAKLALEKD 349
+ Y L V+ ++ +++ + LCY + S+ FP + HF+ GA + L
Sbjct: 324 SDVYTNLESAVVDLVKLDRVDDPNQQFSLCYS--LKSNEYDFPIITVHFK-GADVELHSI 380
Query: 350 SMFYQPRPDAFCMAVNPASINNRYVNFSLIGMMAQQFYNVGYDIGRKQKTFQRMDC 405
S F C A P+ S+ G +AQQ VGYD+ +K +F+ DC
Sbjct: 381 STFVPITDGIVCFAFQPSPQLG-----SIFGNLAQQNLLVGYDLQQKTVSFKPTDC 431
>gi|125561848|gb|EAZ07296.1| hypothetical protein OsI_29544 [Oryza sativa Indica Group]
Length = 448
Score = 171 bits (434), Expect = 5e-40, Method: Compositional matrix adjust.
Identities = 122/374 (32%), Positives = 187/374 (50%), Gaps = 43/374 (11%)
Query: 61 FYVNISIGQPPVPQFTAM-DTGSSLLWVHCYPCRDCSPQLGTIFYPSRSSSYAYVPCDSE 119
+ ++++IG PP+ ++TAM DTGS L+W C PC C+ Q F P+RS++Y VPC S
Sbjct: 92 YLMDLAIGTPPL-RYTAMVDTGSDLIWTQCAPCVLCADQPTPYFRPARSATYRLVPCRSP 150
Query: 120 HCRYFPYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDVVFGCG-F 178
C PY C + C+Y Y T+ +++E TF + S + V DV FGCG
Sbjct: 151 LCAALPYPAC-FQRSVCVYQYYYGDEASTAGVLASETFTFGAANSSKVMVSDVAFGCGNI 209
Query: 179 STNRNFKFSGIFGLGIGRSSLVSQLNSS-FSYCIGSLHDPDYLHNKLILGDGAIIDEGDA 237
++ + SG+ GLG G SLVSQL S FSYC+ S P+ ++L G A ++ +A
Sbjct: 210 NSGQLANSSGMVGLGRGPLSLVSQLGPSRFSYCLTSFLSPE--PSRLNFGVFATLNGTNA 267
Query: 238 ---------TPL---QFIDGHYYITLEAISVDGRMLDINPNIFKRDDSG-GGVMIDSGTD 284
TPL + Y+++L+ IS+ + L I+P +F +D G GGV IDSGT
Sbjct: 268 SSSGSPVQSTPLVVNAALPSLYFMSLKGISLGQKRLPIDPLVFAINDDGTGGVFIDSGTS 327
Query: 285 VTWLVKEAYEALRDEVMIRL---------EGEQMRSYSWPDKLCYHGIMSSDLKGFPTVR 335
+TWL ++AY+A+R E++ L E + WP S P +
Sbjct: 328 LTWLQQDAYDAVRHELVSVLRPLPPTNDTEIGLETCFPWPPP-------PSVAVTVPDME 380
Query: 336 FHFRGGAKLALEKDS-MFYQPRPDAFCMAVNPASINNRYVNFSLIGMMAQQFYNVGYDIG 394
HF GGA + + ++ M C+A+ R + ++IG QQ ++ YDI
Sbjct: 381 LHFDGGANMTVPPENYMLIDGATGFLCLAM------IRSGDATIIGNYQQQNMHILYDIA 434
Query: 395 RKQKTFQRMDCEVL 408
+F C ++
Sbjct: 435 NSLLSFVPAPCNIV 448
>gi|115476828|ref|NP_001062010.1| Os08g0469000 [Oryza sativa Japonica Group]
gi|42407407|dbj|BAD09565.1| putative nucleoid DNA-binding protein [Oryza sativa Japonica Group]
gi|113623979|dbj|BAF23924.1| Os08g0469000 [Oryza sativa Japonica Group]
gi|125603713|gb|EAZ43038.1| hypothetical protein OsJ_27627 [Oryza sativa Japonica Group]
Length = 448
Score = 171 bits (434), Expect = 5e-40, Method: Compositional matrix adjust.
Identities = 122/374 (32%), Positives = 187/374 (50%), Gaps = 43/374 (11%)
Query: 61 FYVNISIGQPPVPQFTAM-DTGSSLLWVHCYPCRDCSPQLGTIFYPSRSSSYAYVPCDSE 119
+ ++++IG PP+ ++TAM DTGS L+W C PC C+ Q F P+RS++Y VPC S
Sbjct: 92 YLMDLAIGTPPL-RYTAMVDTGSDLIWTQCAPCVLCADQPTPYFRPARSATYRLVPCRSP 150
Query: 120 HCRYFPYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDVVFGCG-F 178
C PY C + C+Y Y T+ +++E TF + S + V DV FGCG
Sbjct: 151 LCAALPYPAC-FQRSVCVYQYYYGDEASTAGVLASETFTFGAANSSKVMVSDVAFGCGNI 209
Query: 179 STNRNFKFSGIFGLGIGRSSLVSQLNSS-FSYCIGSLHDPDYLHNKLILGDGAIIDEGDA 237
++ + SG+ GLG G SLVSQL S FSYC+ S P+ ++L G A ++ +A
Sbjct: 210 NSGQLANSSGMVGLGRGPLSLVSQLGPSRFSYCLTSFLSPE--PSRLNFGVFATLNGTNA 267
Query: 238 ---------TPL---QFIDGHYYITLEAISVDGRMLDINPNIFKRDDSG-GGVMIDSGTD 284
TPL + Y+++L+ IS+ + L I+P +F +D G GGV IDSGT
Sbjct: 268 SSSGSPVQSTPLVVNAALPSLYFMSLKGISLGQKRLPIDPLVFAINDDGTGGVFIDSGTS 327
Query: 285 VTWLVKEAYEALRDEVMIRL---------EGEQMRSYSWPDKLCYHGIMSSDLKGFPTVR 335
+TWL ++AY+A+R E++ L E + WP S P +
Sbjct: 328 LTWLQQDAYDAVRRELVSVLRPLPPTNDTEIGLETCFPWPPP-------PSVAVTVPDME 380
Query: 336 FHFRGGAKLALEKDS-MFYQPRPDAFCMAVNPASINNRYVNFSLIGMMAQQFYNVGYDIG 394
HF GGA + + ++ M C+A+ R + ++IG QQ ++ YDI
Sbjct: 381 LHFDGGANMTVPPENYMLIDGATGFLCLAM------IRSGDATIIGNYQQQNMHILYDIA 434
Query: 395 RKQKTFQRMDCEVL 408
+F C ++
Sbjct: 435 NSLLSFVPAPCNIV 448
>gi|343198386|gb|AEM05966.1| nodulin 41 [Phaseolus vulgaris]
Length = 437
Score = 171 bits (433), Expect = 5e-40, Method: Compositional matrix adjust.
Identities = 132/424 (31%), Positives = 204/424 (48%), Gaps = 41/424 (9%)
Query: 1 LIHRDSILSPYNNPNENPAHRVQRGINISIARLAYLQEKIRSHNTYQAQILPSNDISNSL 60
LIHRDS LSP+ NP+ P+ R+ SI+RL + + +N +L + N
Sbjct: 33 LIHRDSPLSPFYNPSLTPSQRIINAALRSISRLNRVSNLLDQNNKLPQSVL---ILHNGE 89
Query: 61 FYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLGTIFYPSRSSSYAYVPCDSEH 120
+ + IG PPV + DTGS L+WV C PC C PQ +F P +SS++ C S+
Sbjct: 90 YLMRFYIGTPPVERLATADTGSDLIWVQCSPCASCFPQSTPLFQPLKSSTFMPTTCRSQP 149
Query: 121 CR-YFPYARCSAYKHRCIYTQLYLIGPETSV---FVSTEQLTFKNTDE-STIHVQDVVFG 175
C P + CIYT Y G + S +STE L F + T+ + FG
Sbjct: 150 CTLLLPEQKGCGKSGECIYT--YKYGDQYSFSEGLLSTETLRFDSQGGVQTVAFPNSFFG 207
Query: 176 CGFSTN----RNFKFSGIFGLGIGRSSLVSQLNS----SFSYCIGSLHDPDYLHNKLILG 227
CG N ++K +GI GLG G SLVSQ+ FSYC+ L +KL G
Sbjct: 208 CGLYNNITVFPSYKLTGIMGLGAGPLSLVSQIGDQIGHKFSYCLLPLGSTS--TSKLKFG 265
Query: 228 DGAIID-EG-DATPL---QFIDGHYYITLEAISVDGRMLDINPNIFKRDDSGGGVMIDSG 282
+ +II EG +TP+ ++ +Y++ LEA++V + + + G V+IDSG
Sbjct: 266 NESIITGEGVVSTPMIIKPWLPTYYFLNLEAVTVAQKTVPTG-------STDGNVIIDSG 318
Query: 283 TDVTWLVKEAYEALRDEVMIRLEGEQMRSYSWPDKLCYHGIMSSDLKGFPTVRFHFRGGA 342
T +T+L + Y + L E ++ P C+ D FP + F F GA
Sbjct: 319 TLLTYLGESFYYNFAASLQESLAVELVQDVLSPLPFCFP---YRDNFVFPEIAFQFT-GA 374
Query: 343 KLALEKDSMFYQPRP-DAFCMAVNPASINNRYVNFSLIGMMAQQFYNVGYDIGRKQKTFQ 401
+++L+ ++F + C+ + P+S++ S+ G +Q + V YD+ K+ +FQ
Sbjct: 375 RVSLKPANLFVMTEDRNTVCLMIAPSSVS----GISIFGSFSQIDFQVEYDLEGKKVSFQ 430
Query: 402 RMDC 405
DC
Sbjct: 431 PTDC 434
>gi|115458644|ref|NP_001052922.1| Os04g0448300 [Oryza sativa Japonica Group]
gi|113564493|dbj|BAF14836.1| Os04g0448300 [Oryza sativa Japonica Group]
gi|215766465|dbj|BAG98773.1| unnamed protein product [Oryza sativa Japonica Group]
gi|215767943|dbj|BAH00172.1| unnamed protein product [Oryza sativa Japonica Group]
Length = 454
Score = 170 bits (430), Expect = 2e-39, Method: Compositional matrix adjust.
Identities = 126/368 (34%), Positives = 177/368 (48%), Gaps = 32/368 (8%)
Query: 58 NSLFYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLGTIFYPSRSSSYAYVPCD 117
N F +++SIG P + +DTGS L+W C PC DC Q +F PS SS+YA VPC
Sbjct: 102 NGEFLMDVSIGTPALAYSAIVDTGSDLVWTQCKPCVDCFKQSTPVFDPSSSSTYATVPCS 161
Query: 118 SEHCRYFPYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDVVFGCG 177
S C P ++C++ +C YT Y T ++TE T + + VVFGCG
Sbjct: 162 SASCSDLPTSKCTS-ASKCGYTYTYGDSSSTQGVLATETFTLAKS-----KLPGVVFGCG 215
Query: 178 FSTNRNFKFS---GIFGLGIGRSSLVSQLN-SSFSYCIGSLHDPDYLHNKLILGDGAIID 233
TN FS G+ GLG G SLVSQL FSYC+ SL D + ++ L+LG A I
Sbjct: 216 -DTNEGDGFSQGAGLVGLGRGPLSLVSQLGLDKFSYCLTSLDDTN--NSPLLLGSLAGIS 272
Query: 234 EG-------DATPL---QFIDGHYYITLEAISVDGRMLDINPNIFK-RDDSGGGVMIDSG 282
E TPL YY++L+AI+V + + + F +DD GGV++DSG
Sbjct: 273 EASAAASSVQTTPLIKNPSQPSFYYVSLKAITVGSTRISLPSSAFAVQDDGTGGVIVDSG 332
Query: 283 TDVTWLVKEAYEALRDEVMIRLEGEQMRSYSWPDKLCYHGIMSS-DLKGFPTVRFHFRGG 341
T +T+L + Y AL+ ++ LC+ D P + FHF GG
Sbjct: 333 TSITYLEVQGYRALKKAFAAQMALPAADGSGVGLDLCFRAPAKGVDQVEVPRLVFHFDGG 392
Query: 342 AKLALEKDS-MFYQPRPDAFCMAVNPASINNRYVNFSLIGMMAQQFYNVGYDIGRKQKTF 400
A L L ++ M A C+ V + +R S+IG QQ + YD+G +F
Sbjct: 393 ADLDLPAENYMVLDGGSGALCLTV----MGSR--GLSIIGNFQQQNFQFVYDVGHDTLSF 446
Query: 401 QRMDCEVL 408
+ C L
Sbjct: 447 APVQCNKL 454
>gi|38344829|emb|CAD40873.2| OSJNBa0064H22.10 [Oryza sativa Japonica Group]
gi|116310063|emb|CAH67084.1| H0818E04.1 [Oryza sativa Indica Group]
gi|116310186|emb|CAH67198.1| OSIGBa0152K17.10 [Oryza sativa Indica Group]
Length = 444
Score = 169 bits (429), Expect = 2e-39, Method: Compositional matrix adjust.
Identities = 126/368 (34%), Positives = 177/368 (48%), Gaps = 32/368 (8%)
Query: 58 NSLFYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLGTIFYPSRSSSYAYVPCD 117
N F +++SIG P + +DTGS L+W C PC DC Q +F PS SS+YA VPC
Sbjct: 92 NGEFLMDVSIGTPALAYSAIVDTGSDLVWTQCKPCVDCFKQSTPVFDPSSSSTYATVPCS 151
Query: 118 SEHCRYFPYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDVVFGCG 177
S C P ++C++ +C YT Y T ++TE T + + VVFGCG
Sbjct: 152 SASCSDLPTSKCTS-ASKCGYTYTYGDSSSTQGVLATETFTLAKS-----KLPGVVFGCG 205
Query: 178 FSTNRNFKFS---GIFGLGIGRSSLVSQLN-SSFSYCIGSLHDPDYLHNKLILGDGAIID 233
TN FS G+ GLG G SLVSQL FSYC+ SL D + ++ L+LG A I
Sbjct: 206 -DTNEGDGFSQGAGLVGLGRGPLSLVSQLGLDKFSYCLTSLDDTN--NSPLLLGSLAGIS 262
Query: 234 EG-------DATPL---QFIDGHYYITLEAISVDGRMLDINPNIFK-RDDSGGGVMIDSG 282
E TPL YY++L+AI+V + + + F +DD GGV++DSG
Sbjct: 263 EASAAASSVQTTPLIKNPSQPSFYYVSLKAITVGSTRISLPSSAFAVQDDGTGGVIVDSG 322
Query: 283 TDVTWLVKEAYEALRDEVMIRLEGEQMRSYSWPDKLCYHGIMSS-DLKGFPTVRFHFRGG 341
T +T+L + Y AL+ ++ LC+ D P + FHF GG
Sbjct: 323 TSITYLEVQGYRALKKAFAAQMALPAADGSGVGLDLCFRAPAKGVDQVEVPRLVFHFDGG 382
Query: 342 AKLALEKDS-MFYQPRPDAFCMAVNPASINNRYVNFSLIGMMAQQFYNVGYDIGRKQKTF 400
A L L ++ M A C+ V + +R S+IG QQ + YD+G +F
Sbjct: 383 ADLDLPAENYMVLDGGSGALCLTV----MGSR--GLSIIGNFQQQNFQFVYDVGHDTLSF 436
Query: 401 QRMDCEVL 408
+ C L
Sbjct: 437 APVQCNKL 444
>gi|357492401|ref|XP_003616489.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
gi|355517824|gb|AES99447.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
Length = 434
Score = 169 bits (429), Expect = 2e-39, Method: Compositional matrix adjust.
Identities = 132/416 (31%), Positives = 199/416 (47%), Gaps = 27/416 (6%)
Query: 1 LIHRDSILSPYNNPNENPAHRVQRGINISIARLAYLQEKIRSHNTYQAQILPSNDISNSL 60
LIHRDS SPY P EN SI R + K +T ++ ++P
Sbjct: 32 LIHRDSPKSPYYKPTENKYQHFVDAARRSINRANHF-FKDSDTSTPESTVIPDR----GG 86
Query: 61 FYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLGTIFYPSRSSSYAYVPCDSEH 120
+ + S+G PP + DTGS ++W+ C PC C Q IF PS+SSSY +PC S+
Sbjct: 87 YLMTYSVGTPPTKIYGIADTGSDIVWLQCEPCEQCYNQTTPIFNPSKSSSYKNIPCLSKL 146
Query: 121 CRYFPYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDVVFGCGFST 180
C CS ++ C Y Y + +S + L+ ++T S + V GCG
Sbjct: 147 CHSVRDTSCSD-QNSCQYKISYGDSSHSQGDLSVDTLSLESTSGSPVSFPKTVIGCGTDN 205
Query: 181 NRNF--KFSGIFGLGIGRSSLVSQLNSS----FSYCIGSLHDPDYLHNKLI-LGDGAIID 233
F SGI GLG G SL++QL SS FSYC+ L + + + ++ GD A++
Sbjct: 206 AGTFGGASSGIVGLGGGPVSLITQLGSSIGGKFSYCLVPLLNKESNASSILSFGDAAVV- 264
Query: 234 EGD---ATPLQFIDG-HYYITLEAISVDGRMLDINPNIFKRDDSGGGVMIDSGTDVTWLV 289
GD +TPL D Y++TL+A SV + ++ + DD G ++IDSGT +T +
Sbjct: 265 SGDGVVSTPLIKKDPVFYFLTLQAFSVGNKRVEFGGSSEGGDDE-GNIIIDSGTTLTLIP 323
Query: 290 KEAYEALRDEVMIRLEGEQMRSYSWPDKLCYHGIMSSDLKGFPTVRFHFRGGAKLALEKD 349
+ Y L V+ ++ +++ + LCY + S+ FP + HF+ GA + L
Sbjct: 324 SDVYTNLESAVVDLVKLDRVDDPNQQFSLCYS--LKSNEYDFPIITAHFK-GADIELHSI 380
Query: 350 SMFYQPRPDAFCMAVNPASINNRYVNFSLIGMMAQQFYNVGYDIGRKQKTFQRMDC 405
S F C A P+ S+ G +AQQ VGYD+ +K +F+ DC
Sbjct: 381 STFVPITDGIVCFAFQPSPQLG-----SIFGNLAQQNLLVGYDLQQKTVSFKPTDC 431
>gi|125548488|gb|EAY94310.1| hypothetical protein OsI_16079 [Oryza sativa Indica Group]
Length = 423
Score = 169 bits (428), Expect = 2e-39, Method: Compositional matrix adjust.
Identities = 126/368 (34%), Positives = 177/368 (48%), Gaps = 32/368 (8%)
Query: 58 NSLFYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLGTIFYPSRSSSYAYVPCD 117
N F +++SIG P + +DTGS L+W C PC DC Q +F PS SS+YA VPC
Sbjct: 71 NGEFLMDVSIGTPALAYSAIVDTGSDLVWTQCKPCVDCFKQSTPVFDPSSSSTYATVPCS 130
Query: 118 SEHCRYFPYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDVVFGCG 177
S C P ++C++ +C YT Y T ++TE T + + VVFGCG
Sbjct: 131 SASCSDLPTSKCTS-ASKCGYTYTYGDSSSTQGVLATETFTLAKS-----KLPGVVFGCG 184
Query: 178 FSTNRNFKFS---GIFGLGIGRSSLVSQLN-SSFSYCIGSLHDPDYLHNKLILGDGAIID 233
TN FS G+ GLG G SLVSQL FSYC+ SL D + ++ L+LG A I
Sbjct: 185 -DTNEGDGFSQGAGLVGLGRGPLSLVSQLGLDKFSYCLTSLDDTN--NSPLLLGSLAGIS 241
Query: 234 EG-------DATPL---QFIDGHYYITLEAISVDGRMLDINPNIFK-RDDSGGGVMIDSG 282
E TPL YY++L+AI+V + + + F +DD GGV++DSG
Sbjct: 242 EASAAASSVQTTPLIKNPSQPSFYYVSLKAITVGSTRISLPSSAFAVQDDGTGGVIVDSG 301
Query: 283 TDVTWLVKEAYEALRDEVMIRLEGEQMRSYSWPDKLCYHGIMSS-DLKGFPTVRFHFRGG 341
T +T+L + Y AL+ ++ LC+ D P + FHF GG
Sbjct: 302 TSITYLEVQGYRALKKAFAAQMALPAADGSGVGLDLCFRAPAKGVDQVEVPRLVFHFDGG 361
Query: 342 AKLALEKDS-MFYQPRPDAFCMAVNPASINNRYVNFSLIGMMAQQFYNVGYDIGRKQKTF 400
A L L ++ M A C+ V + +R S+IG QQ + YD+G +F
Sbjct: 362 ADLDLPAENYMVLDGGSGALCLTV----MGSR--GLSIIGNFQQQNFQFVYDVGHDTLSF 415
Query: 401 QRMDCEVL 408
+ C L
Sbjct: 416 APVQCNKL 423
>gi|357487593|ref|XP_003614084.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
gi|355515419|gb|AES97042.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
Length = 412
Score = 169 bits (427), Expect = 3e-39, Method: Compositional matrix adjust.
Identities = 118/413 (28%), Positives = 202/413 (48%), Gaps = 42/413 (10%)
Query: 1 LIHRDSILSPYNNPNENPAHRVQRGINISIARLAYLQEKIRSHNTYQAQILPSNDISNSL 60
LIH S SP+ NP E R+ +N SI R+ YL + S + + Q +P + +
Sbjct: 31 LIHPISSRSPFYNPKETQIQRISSILNYSINRVRYLNH-VFSFSPNKIQDVPLSSFMGAG 89
Query: 61 FYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLGTIFYPSRSSSYAYVPCDSEH 120
+ ++ SIG PP ++ +DTG+ +W C PC+ C Q +F+PS+SS+Y +PC S
Sbjct: 90 YVMSYSIGTPPFQLYSLIDTGNDNIWFQCKPCKPCLNQTSPMFHPSKSSTYKTIPCTSPI 149
Query: 121 CRYFPYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDVVFGCGFST 180
C+ +A H ++ + LT + + + I +++V GCG
Sbjct: 150 CK-------NADGH----------------YLGVDTLTLNSNNGTPISFKNIVIGCGHRN 186
Query: 181 NRNFK--FSGIFGLGIGRSSLVSQLNSS----FSYCIGSLHDPDYLHNKLILGDGAIID- 233
+ SG GL G S +SQLNSS FSYC+ L + + +KL GD + +
Sbjct: 187 QGPLEGYVSGNIGLARGPLSFISQLNSSIGGKFSYCLVPLFSKENVSSKLHFGDKSTVSG 246
Query: 234 -EGDATPLQFIDGHYYITLEAISVDGRMLDINPNIFKRDDSGGGVMIDSGTDVTWLVKEA 292
+TP++ +G Y+++LEA SV ++ + + D+ G +IDSGT +T L K+
Sbjct: 247 LGTVSTPIKEENG-YFVSLEAFSVGDHIIKL-----ENSDNRGNSIIDSGTTMTILPKDV 300
Query: 293 YEALRDEVMIRLEGEQMRSYSWPDKLCYHGIMSSDLKGFPTVRFHFRGGAKLALEKDSMF 352
Y L V+ ++ ++++ S LCY ++ L + HF G+++ L + F
Sbjct: 301 YSRLESVVLDMVKLKRVKDPSQQFNLCYQTTSTTLLTKVLIITAHF-SGSEVHLNALNTF 359
Query: 353 YQPRPDAFCMAVNPASINNRYVNFSLIGMMAQQFYNVGYDIGRKQKTFQRMDC 405
Y + C A + + ++ G + QQ + VG+D+ +K +F+ DC
Sbjct: 360 YPITDEVICFAFVSGG---NFSSLAIFGNVVQQNFLVGFDLNKKTISFKPTDC 409
>gi|165292434|dbj|BAF98915.1| aspartic proteinase nepenthesin I [Nepenthes alata]
Length = 437
Score = 169 bits (427), Expect = 3e-39, Method: Compositional matrix adjust.
Identities = 123/358 (34%), Positives = 174/358 (48%), Gaps = 31/358 (8%)
Query: 61 FYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLGTIFYPSRSSSYAYVPCDSEH 120
+ +N+SIG P P MDTGS L+W C PC C Q IF P SSS++ +PC S+
Sbjct: 95 YLMNLSIGTPAQPFSAIMDTGSDLIWTQCQPCTQCFNQSTPIFNPQGSSSFSTLPCSSQL 154
Query: 121 CRYFPYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDVVFGCGFST 180
C+ CS + C YT Y G ET + TE LTF ++ + ++ FGCG
Sbjct: 155 CQALQSPTCS--NNSCQYTYGYGDGSETQGSMGTETLTFG-----SVSIPNITFGCG-EN 206
Query: 181 NRNF---KFSGIFGLGIGRSSLVSQLN-SSFSYC---IGSLHDPDYLHNKLILGDGAIID 233
N+ F +G+ G+G G SL SQL+ + FSYC IGS + L L +
Sbjct: 207 NQGFGQGNGAGLVGMGRGPLSLPSQLDVTKFSYCMTPIGSSNSSTLLLGS--LANSVTAG 264
Query: 234 EGDATPLQF--IDGHYYITLEAISVDGRMLDINPNIFK--RDDSGGGVMIDSGTDVTWLV 289
+ T +Q I YYITL +SV L I+P++FK ++ GG++IDSGT +T+ V
Sbjct: 265 SPNTTLIQSSQIPTFYYITLNGLSVGSTPLPIDPSVFKLNSNNGTGGIIIDSGTTLTYFV 324
Query: 290 KEAYEALRDEVMIRLEGEQMRSYSWPDKLCYHGIMSSDLKGF--PTVRFHFRGGAKLALE 347
AY+A+R + ++ + S LC+ M SD PT HF GG L L
Sbjct: 325 DNAYQAVRQAFISQMNLSVVNGSSSGFDLCFQ--MPSDQSNLQIPTFVMHFDGG-DLVLP 381
Query: 348 KDSMFYQPRPDAFCMAVNPASINNRYVNFSLIGMMAQQFYNVGYDIGRKQKTFQRMDC 405
++ F P C+A+ +S S+ G + QQ V YD G +F C
Sbjct: 382 SENYFISPSNGLICLAMGSSS-----QGMSIFGNIQQQNLLVVYDTGNSVVSFLSAQC 434
>gi|449523529|ref|XP_004168776.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Cucumis sativus]
Length = 461
Score = 168 bits (425), Expect = 5e-39, Method: Compositional matrix adjust.
Identities = 124/405 (30%), Positives = 195/405 (48%), Gaps = 27/405 (6%)
Query: 20 HRVQRGINISIARLAYLQEKI--RSHNTYQAQILPSNDISNSLFYVNISIGQPPVPQFTA 77
R++RG+ RL L + ++ T Q+ N F + ++IG PP
Sbjct: 68 ERLRRGVARGKNRLHRLNAMVLAAANATVGDQVKAPVVAGNGEFLMKLAIGSPPRSFSAI 127
Query: 78 MDTGSSLLWVHCYPCRDCSPQLGTIFYPSRSSSYAYVPCDSEHCRYFPYARCSAYKHRCI 137
MDTGS L+W C PC+ C Q IF P +SSS+ + C SE C P + CS+ C
Sbjct: 128 MDTGSDLIWTQCKPCQQCFDQSTPIFDPKQSSSFYKISCSSELCGALPTSTCSS--DGCE 185
Query: 138 YTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDVVFGCGFSTNRN--FKFSGIFGLGIG 195
Y Y T ++ E TF ++ E I + + FGCG N + + +G+ GLG G
Sbjct: 186 YLYTYGDSSSTQGVLAFETFTFGDSTEDQISIPGLGFGCGNDNNGDGFSQGAGLVGLGRG 245
Query: 196 RSSLVSQLNS-SFSYCIGSLHDPDYLHNKLILGDGAII------DEGDATPL---QFIDG 245
SLVSQL F+YC+ ++ D + L+LG A I DE TPL
Sbjct: 246 PLSLVSQLKEQKFAYCLTAIDDSK--PSSLLLGSLANITPKTSKDEMKTTPLIKNPSQPS 303
Query: 246 HYYITLEAISVDGRMLDINPNIFK-RDDSGGGVMIDSGTDVTWLVKEAYEALRDEVMIRL 304
YY++L+ ISV G L I + F+ DD GGV+IDSGT +T++ A+ +L++E + ++
Sbjct: 304 FYYLSLQGISVGGTQLSIPKSTFELHDDGSGGVIIDSGTTITYVENSAFTSLKNEFIAQM 363
Query: 305 EGEQMRSYSWPDKLCYHGIMSSDLKGFPTVRFHFRGGAKLALEKDS-MFYQPRPDAFCMA 363
S + LC++ ++ P + FHF+ GA L L ++ M + C+A
Sbjct: 364 NLPVDDSGTGGLDLCFNLPAGTNQVEVPKLTFHFK-GADLELPGENYMIGDSKAGLLCLA 422
Query: 364 VNPASINNRYVNFSLIGMMAQQFYNVGYDIGRKQKTFQRMDCEVL 408
+ + S+ G + QQ + V +D+ + +F C+ +
Sbjct: 423 IGSSR------GMSIFGNLQQQNFMVVHDLQEETLSFLPTQCDSI 461
>gi|356528671|ref|XP_003532923.1| PREDICTED: probable aspartic protease At2g35615-like [Glycine max]
Length = 440
Score = 168 bits (425), Expect = 6e-39, Method: Compositional matrix adjust.
Identities = 132/425 (31%), Positives = 192/425 (45%), Gaps = 40/425 (9%)
Query: 1 LIHRDSILSPYNNPNENPAHRVQRGINISIAR----LAYLQEKIRSHNTYQAQILPSNDI 56
LIHR+S LSP+ NP+ P+ R++ + S AR L Q RS T +P I
Sbjct: 33 LIHRESPLSPFYNPSLTPSERIKNTVLRSFARSKRRLRLSQNDDRSPGTIT---IPDEPI 89
Query: 57 SNSLFYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLGTIFYPSRSSSYAYVPC 116
+ L + IG PPV +F DTGS L+WV C PC C PQ +F P +SS++ VPC
Sbjct: 90 TEYL--MRFYIGTPPVERFAIADTGSDLIWVQCAPCEKCVPQNAPLFDPRKSSTFKTVPC 147
Query: 117 DSEHCRYFPYAR--CSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDVVF 174
DS+ C P ++ C +C Y +Y S + E + F + + + I + F
Sbjct: 148 DSQPCTLLPPSQRACVGKSGQCYYQYIYGDHTLVSGILGFESINFGSKNNA-IKFPKLTF 206
Query: 175 GCGFSTNRNFKFS----GIFGLGIGRSSLVSQL----NSSFSYCIGSLHDPDYLHNKLIL 226
GC FS N S G+ GLG+G SL+SQL FSYC L +K+
Sbjct: 207 GCTFSNNDTVDESKRNMGLVGLGVGPLSLISQLGYQIGRKFSYCFPPLSSNS--TSKMRF 264
Query: 227 GDGAIIDEGD---ATPL---QFIDGHYYITLEAISVDGRMLDINPNIFKRDDSGGGVMID 280
G+ AI+ + +TPL +YY+ LE +S+ + + + + G ++ID
Sbjct: 265 GNDAIVKQIKGVVSTPLIIKSIGPSYYYLNLEGVSIGNKKVKTS-----ESQTDGNILID 319
Query: 281 SGTDVTWLVKEAYEALRDEVMIRLEGEQMRSYSWPDKLCYHGIMSSDLKGFPTVRFHFRG 340
SGT T L + Y V E ++ C+ K FP V F F
Sbjct: 320 SGTSFTILKQSFYNKFVALVKEVYGVEAVKIPPLVYNFCFEN--KGKRKRFPDVVFLFT- 376
Query: 341 GAKLALEKDSMFYQPRPDAFCMAVNPASINNRYVNFSLIGMMAQQFYNVGYDIGRKQKTF 400
GAK+ ++ ++F + CM P S + S+ G AQ Y V YD+ +F
Sbjct: 377 GAKVRVDASNLFEAEDNNLLCMVALPTSDEDD----SIFGNHAQIGYQVEYDLQGGMVSF 432
Query: 401 QRMDC 405
DC
Sbjct: 433 APADC 437
>gi|449451908|ref|XP_004143702.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Cucumis sativus]
gi|449529900|ref|XP_004171936.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Cucumis sativus]
Length = 459
Score = 167 bits (424), Expect = 6e-39, Method: Compositional matrix adjust.
Identities = 124/398 (31%), Positives = 186/398 (46%), Gaps = 42/398 (10%)
Query: 41 RSHNTYQAQILPSNDISNSLFYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQ-L 99
R + T ++ ++ + ++V+I +G PP DTGS L+WV C CR+CS
Sbjct: 68 RPNPTLKSPLISGASTGSGQYFVDIRLGTPPQSLLLVADTGSDLVWVKCSACRNCSHHPP 127
Query: 100 GTIFYPSRSSSYAYVPCDSEHCRYFPYARCSAYKHRCIYTQL---------YLIGPETSV 150
+ F P SSS++ C HCR P+A H C +T+L Y G +S
Sbjct: 128 SSAFLPRHSSSFSPFHCFDPHCRLLPHAP----HHLCNHTRLHSPCRFLYSYADGSLSSG 183
Query: 151 FVSTEQLTFKNTDESTIHVQDVVFGCGFSTN----RNFKFS---GIFGLGIGRSSLVSQL 203
F S E T K+ S IH++ + FGCGF + +F+ G+ GLG G S SQL
Sbjct: 184 FFSKETTTLKSLSGSEIHLKGLSFGCGFRISGPSVSGAQFNGARGVMGLGRGSISFSSQL 243
Query: 204 ----NSSFSYCIG--SLHDPDYLHNKLILGDG------AIIDEGDATPLQ---FIDGHYY 248
+ FSYC+ +L P + L++G G + TPLQ YY
Sbjct: 244 GRRFGNKFSYCLMDYTLSPPPT--SFLMIGGGLHSLPLTNATKISYTPLQINPLSPTFYY 301
Query: 249 ITLEAISVDGRMLDINPNIFKRDDSG-GGVMIDSGTDVTWLVKEAYEALRDEVMIRLEGE 307
IT+ +I++DG L INP +++ D+ G GG ++DSGT +T+L K AYE + V R++
Sbjct: 302 ITIHSITIDGVKLPINPAVWEIDEQGNGGTVVDSGTTLTYLTKTAYEEVLKSVRRRVKLP 361
Query: 308 QMRSYSWPDKLCYHGIMSSDLKGFPTVRFHFRGGAKLALEKDSMFYQPRPDAFCMAVNPA 367
+ LC + S P +RF GGA A + F + C+A+
Sbjct: 362 NAAELTPGFDLCVNASGESRRPSLPRLRFRLGGGAVFAPPPRNYFLETEEGVMCLAIRAV 421
Query: 368 SINNRYVNFSLIGMMAQQFYNVGYDIGRKQKTFQRMDC 405
N FS+IG + QQ + + +D + F R C
Sbjct: 422 ESGN---GFSVIGNLMQQGFLLEFDKEESRLGFTRRGC 456
>gi|297817972|ref|XP_002876869.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
gi|297322707|gb|EFH53128.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
Length = 462
Score = 167 bits (424), Expect = 7e-39, Method: Compositional matrix adjust.
Identities = 127/417 (30%), Positives = 198/417 (47%), Gaps = 35/417 (8%)
Query: 15 NENPAHRVQRGINISIARLAYLQE----KIRSHNTYQAQILPSNDISNSLFYVNISIGQP 70
N ++QRGIN RL L + S+ I + F + +SIG P
Sbjct: 58 NLTKIQKIQRGINRGFHRLNRLGAVAVLAVASNPDDTNNIKAPTHGGSGEFLMELSIGNP 117
Query: 71 PVPQFTAMDTGSSLLWVHCYPCRDCSPQLGTIFYPSRSSSYAYVPCDSEHCRYFPYARCS 130
V +DTGS L+W C PC +C Q IF P +SSSY+ V C S C P + C+
Sbjct: 118 AVKYAAIVDTGSDLIWTQCKPCTECFDQPTPIFDPEKSSSYSKVGCSSGLCNALPRSNCN 177
Query: 131 AYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDVVFGCGFSTNRN--FKFSG 188
K C Y Y T ++TE TF+ DE++I + FGCG + + SG
Sbjct: 178 EDKDSCEYLYTYGDYSSTRGLLATETFTFE--DENSI--SGIGFGCGVENEGDGFSQGSG 233
Query: 189 IFGLGIGRSSLVSQLN-SSFSYCIGSLHDPDY--------LHNKLILGDGAIIDEGDATP 239
+ GLG G SL+SQL + FSYC+ S+ D + L + ++ GA +D G+ T
Sbjct: 234 LVGLGRGPLSLISQLKETKFSYCLTSIEDSEASSSLFIGSLASGIVNKTGANLD-GEVTK 292
Query: 240 LQFI------DGHYYITLEAISVDGRMLDINPNIFK-RDDSGGGVMIDSGTDVTWLVKEA 292
+ YY+ L+ I+V + L + + F+ +D GG++IDSGT +T+L + A
Sbjct: 293 TMSLLRNPDQPSFYYLELQGITVGAKRLSVEKSTFELSEDGTGGMIIDSGTTITYLEETA 352
Query: 293 YEALRDEVMIRLEGEQMRSYSWPDKLCYHGIMSSDLKGFPTVRFHFRGGAKLALEKDS-M 351
++ L++E R+ S S LC+ ++ P + FHF+ GA L L ++ M
Sbjct: 353 FKVLKEEFTSRMSLPVDDSGSTGLDLCFKLPNAAKNIAVPKLIFHFK-GADLELPGENYM 411
Query: 352 FYQPRPDAFCMAVNPASINNRYVNFSLIGMMAQQFYNVGYDIGRKQKTFQRMDCEVL 408
C+A+ ++ S+ G + QQ +NV +D+ ++ TF +C L
Sbjct: 412 VADSSTGVLCLAMGSSN------GMSIFGNVQQQNFNVLHDLEKETVTFVPTECGKL 462
>gi|356542355|ref|XP_003539632.1| PREDICTED: probable aspartic protease At2g35615-like [Glycine max]
Length = 444
Score = 167 bits (424), Expect = 7e-39, Method: Compositional matrix adjust.
Identities = 128/418 (30%), Positives = 193/418 (46%), Gaps = 25/418 (5%)
Query: 1 LIHRDSILSPYNNPNENPAHRVQRGINISIARLAYLQEK--IRSHNTYQAQILPSNDISN 58
+IHRDS SPY P E RV + SI R + + + S NT ++ ++ S
Sbjct: 36 IIHRDSSRSPYYRPTETQFQRVANALRRSINRANHFNKPNLVASTNTAESTVIASQ---- 91
Query: 59 SLFYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLGTIFYPSRSSSYAYVPCDS 118
+ ++ S+G PP +DTGS ++W+ C PC DC Q IF PS+S +Y +PC S
Sbjct: 92 GEYLMSYSVGTPPFQILGIVDTGSDIIWLQCQPCEDCYNQTTPIFDPSQSKTYKTLPCSS 151
Query: 119 EHCRYF-PYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDVVFGCG 177
C+ A CS+ C YT Y + +S E LT +TD S++ V GCG
Sbjct: 152 NICQSVQSAASCSSNNDECEYTITYGDNSHSQGDLSVETLTLGSTDGSSVQFPKTVIGCG 211
Query: 178 FSTNRNFKFSGIFGLGIGRSSL------VSQLNSSFSYCIGSLHDPDYLHNKLILGDGAI 231
+ F+ G +G+G + S + FSYC+ L +KL GD A+
Sbjct: 212 HNNKGTFQREGSGIVGLGGGPVSLISQLSSSIGGKFSYCLAPLFSQSNSSSKLNFGDEAV 271
Query: 232 IDEGDATPLQFID----GHYYITLEAISVDGRMLDINPNIFKRDDSGGGVMIDSGTDVTW 287
+ + G Y++TLEA SV ++ + F+ G ++IDSGT +T
Sbjct: 272 VSGRGTVSTPIVPKNGLGFYFLTLEAFSVGDNRIEFGSSSFESSGGEGNIIIDSGTTLTI 331
Query: 288 LVKEAYEALRDEVMIRLEGEQMRSYSWPDKLCYHGIMSSDLKGFPTVRFHFRGGAKLALE 347
L ++ Y L V +E E++ S +LCY SSD P + HF+ GA + L
Sbjct: 332 LPEDDYLNLESAVADAIELERVEDPSKFLRLCYR-TTSSDELNVPVITAHFK-GADVELN 389
Query: 348 KDSMFYQPRPDAFCMAVNPASINNRYVNFSLIGMMAQQFYNVGYDIGRKQKTFQRMDC 405
S F + C A + I + G +AQQ VGYD+ ++ +F+ DC
Sbjct: 390 PISTFIEVDEGVVCFAFRSSKIG------PIFGNLAQQNLLVGYDLVKQTVSFKPTDC 441
>gi|449453902|ref|XP_004144695.1| PREDICTED: LOW QUALITY PROTEIN: aspartic proteinase
nepenthesin-1-like, partial [Cucumis sativus]
Length = 716
Score = 167 bits (423), Expect = 8e-39, Method: Compositional matrix adjust.
Identities = 124/405 (30%), Positives = 195/405 (48%), Gaps = 27/405 (6%)
Query: 20 HRVQRGINISIARLAYLQEKI--RSHNTYQAQILPSNDISNSLFYVNISIGQPPVPQFTA 77
R++RG+ RL L + ++ T Q+ N F + ++IG PP
Sbjct: 323 ERLRRGVARGKNRLHRLNAMVLAAANATVGDQVKAPVVAGNGEFLMKLAIGSPPRSFSAI 382
Query: 78 MDTGSSLLWVHCYPCRDCSPQLGTIFYPSRSSSYAYVPCDSEHCRYFPYARCSAYKHRCI 137
MDTGS L+W C PC+ C Q IF P +SSS+ + C SE C P + CS+ C
Sbjct: 383 MDTGSDLIWTQCKPCQQCFDQSTPIFDPKQSSSFYKISCSSELCGALPTSTCSS--DGCE 440
Query: 138 YTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDVVFGCGFSTNRN--FKFSGIFGLGIG 195
Y Y T ++ E TF ++ E I + + FGCG N + + +G+ GLG G
Sbjct: 441 YLYTYGDSSSTQGVLAFETFTFGDSTEDQISIPGLGFGCGNDNNGDGFSQGAGLVGLGRG 500
Query: 196 RSSLVSQLNS-SFSYCIGSLHDPDYLHNKLILGDGAII------DEGDATPL---QFIDG 245
SLVSQL F+YC+ ++ D + L+LG A I DE TPL
Sbjct: 501 PLSLVSQLKEQKFAYCLTAIDDSK--PSSLLLGSLANITPKTSKDEMKTTPLIKNPSQPS 558
Query: 246 HYYITLEAISVDGRMLDINPNIFK-RDDSGGGVMIDSGTDVTWLVKEAYEALRDEVMIRL 304
YY++L+ ISV G L I + F+ DD GGV+IDSGT +T++ A+ +L++E + ++
Sbjct: 559 FYYLSLQGISVGGTQLSIPKSTFELHDDGSGGVIIDSGTTITYVENSAFTSLKNEFIAQM 618
Query: 305 EGEQMRSYSWPDKLCYHGIMSSDLKGFPTVRFHFRGGAKLALEKDS-MFYQPRPDAFCMA 363
S + LC++ ++ P + FHF+ GA L L ++ M + C+A
Sbjct: 619 NLPVDDSGTGGLDLCFNLPAGTNQVEVPKLTFHFK-GADLELPGENYMIGDSKAGLLCLA 677
Query: 364 VNPASINNRYVNFSLIGMMAQQFYNVGYDIGRKQKTFQRMDCEVL 408
+ + S+ G + QQ + V +D+ + +F C+ +
Sbjct: 678 IGSSR------GMSIFGNLQQQNFMVVHDLQEETLSFLPTQCDSI 716
>gi|224130878|ref|XP_002320947.1| predicted protein [Populus trichocarpa]
gi|222861720|gb|EEE99262.1| predicted protein [Populus trichocarpa]
Length = 440
Score = 167 bits (423), Expect = 8e-39, Method: Compositional matrix adjust.
Identities = 129/418 (30%), Positives = 200/418 (47%), Gaps = 29/418 (6%)
Query: 1 LIHRDSILSPYNNPNENPAHRVQRGINISIARLAYLQEKIRSHNTYQAQILPSNDISNSL 60
LIHRDS LSP+ N E R+ + SI+R+ + + + +A +D++++
Sbjct: 36 LIHRDSPLSPFYNSEETDLQRINNALRRSISRVHHFDPIAAASVSPKAA---ESDVTSNR 92
Query: 61 --FYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLGTIFYPSRSSSYAYVPCDS 118
+ +++S+G PP DTGS L+W C PC C Q+ +F P S +Y CD+
Sbjct: 93 GEYLMSLSLGTPPFKIMGIADTGSDLIWTQCKPCERCYKQVDPLFDPKSSKTYRDFSCDA 152
Query: 119 EHCRYFPYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDVVFGCGF 178
C + CS + C Y Y T V+++ +T +T S + V GCG
Sbjct: 153 RQCSLLDQSTCSG--NICQYQYSYGDRSYTMGNVASDTITLDSTTGSPVSFPKTVIGCGH 210
Query: 179 STNRNF--KFSGIFGLGIGRSSLVSQLNSS----FSYCIGSLHDPDYLHNKLILGDGAII 232
+ F K SGI GLG G SL+SQ+ SS FSYC+ L +KL G A++
Sbjct: 211 ENDGTFSDKGSGIVGLGAGPLSLISQMGSSVGGKFSYCLVPLSSRAGNSSKLNFGSNAVV 270
Query: 233 DEG--DATPL---QFIDGHYYITLEAISVDGRMLDINPNIFKRDDSGGGVMIDSGTDVTW 287
+TPL + + Y++TLEA+SV + + + G ++IDSGT +T
Sbjct: 271 SGPGVQSTPLLSSETMSSFYFLTLEAMSVGNERIKFGDSSLGTGE--GNIIIDSGTTLTI 328
Query: 288 LVKEAYEALRDEVMIRLEGEQMRSYSWPDKLCYHGIMSSDLKGFPTVRFHFRGGAKLALE 347
+ + + L V ++EG + S +CY +SDLK P + HF GA + L+
Sbjct: 329 VPDDFFSNLSTAVGNQVEGRRAEDPSGFLSVCYSA--TSDLK-VPAITAHFT-GADVKLK 384
Query: 348 KDSMFYQPRPDAFCMAVNPASINNRYVNFSLIGMMAQQFYNVGYDIGRKQKTFQRMDC 405
+ F Q D C+A AS + S+ G +AQ + V Y+I K +F+ DC
Sbjct: 385 PINTFVQVSDDVVCLAF--ASTTS---GISIYGNVAQMNFLVEYNIQGKSLSFKPTDC 437
>gi|357450863|ref|XP_003595708.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
gi|355484756|gb|AES65959.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
Length = 407
Score = 167 bits (423), Expect = 8e-39, Method: Compositional matrix adjust.
Identities = 123/382 (32%), Positives = 188/382 (49%), Gaps = 23/382 (6%)
Query: 36 LQEKIRSHNTYQAQILPSN-DISNSLFYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRD 94
L + SH++Y+ + S + + + +SIG PP+ + DTGS L+W C PC
Sbjct: 34 LIRRNSSHDSYKPSTIQSPVSAYDCEYLMELSIGTPPIKIYAEADTGSDLVWFQCIPCTK 93
Query: 95 CSPQLGTIFYPSRSSSYAYVPCDSEHCRYFPYARCSAYKHRCIYTQLYLIGPETSVFVST 154
C Q +F P SSSY + C +E C + CS + C YT Y T ++
Sbjct: 94 CYKQQNPMFDPRSSSSYTNITCGTESCNKLDSSLCSTDQKTCNYTYSYADNSITQGVLAQ 153
Query: 155 EQLTFKNTDESTIHVQDVVFGCGFSTNR-NFKFSGIFGLGIGRSSLVSQLNSS------- 206
E LT +T + Q ++FGCG + + N + G+ GLG G SL+SQ+ SS
Sbjct: 154 ETLTLTSTTGEPVAFQGIIFGCGHNNSGFNDREMGLIGLGRGPLSLISQIGSSLGAGGNM 213
Query: 207 FSYCIGSLHDPDYLHNKLILGDGA-IIDEGD-ATPLQFIDGH-YYITLEAISVDGRMLDI 263
FS C+ + + +++ G G+ ++ G +TPL DG Y+ TL ISV+ L
Sbjct: 214 FSQCLVPFNTDPSITSQMNFGKGSEVLGNGTVSTPLISKDGTGYFATLLGISVEDINLPF 273
Query: 264 NPNIFKRDDSGGGVMIDSGTDVTWLVKEAYEALRDEVMIRLEGEQMRSYSWPDKLCYHGI 323
+ + G ++IDSGT +T+L +E Y L ++V ++ E R + +LCY
Sbjct: 274 SNGSSLGTITKGNILIDSGTTITYLPEEFYHRLIEQVRNKVALEPFRIDGY--ELCYQ-- 329
Query: 324 MSSDLKGFPTVRFHFRGGAKLALEKDSMFYQPRPDAFCMAVNPASINNRYVNFSLIGMMA 383
++L G PT+ HF GG L L MF + D FC AV N YV + G A
Sbjct: 330 TPTNLNG-PTLTIHFEGGDVL-LTPAQMFIPVQDDNFCFAV--FDTNEEYVTY---GNYA 382
Query: 384 QQFYNVGYDIGRKQKTFQRMDC 405
Q Y +G+D+ R+ +F+ DC
Sbjct: 383 QSNYLIGFDLERQVVSFKATDC 404
>gi|317106730|dbj|BAJ53226.1| JHL06P13.6 [Jatropha curcas]
Length = 445
Score = 167 bits (422), Expect = 1e-38, Method: Compositional matrix adjust.
Identities = 133/422 (31%), Positives = 209/422 (49%), Gaps = 34/422 (8%)
Query: 1 LIHRDSILSPYNNPNENPAHRVQRGINISIARL-AYLQEKIRSHNTYQAQILPSNDISNS 59
LIHRDS +SP NP R+Q + SI+R + + + T + I+P
Sbjct: 37 LIHRDSPISPLYNPKNTYFDRLQSSFHRSISRANRFTPNSVSAAKTLEYDIIPGG----G 92
Query: 60 LFYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLGTIFYPSRSSSYAYVPCDSE 119
+++ ISIG PP+ DTGS L+WV C PC++C Q IF P +SS+Y V C++
Sbjct: 93 EYFMRISIGTPPIEVLVIADTGSDLIWVQCQPCQECYKQKSPIFNPKQSSTYRRVLCETR 152
Query: 120 HCRYF--PYARCSAYK--HRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDVVFG 175
+C CSA+ C Y+ Y T +++TE+ +T+ S +Q++ FG
Sbjct: 153 YCNALNSDMRACSAHGFFKACGYSYSYGDHSFTMGYLATERFIIGSTNNS---IQELAFG 209
Query: 176 CGFSTNRNF--KFSGIFGLGIGRSSLVSQL----NSSFSYC-IGSLHDPDYLHNKLILGD 228
CG S NF SGI GLG G SL+SQL ++ FSYC + L ++ K++ GD
Sbjct: 210 CGNSNGGNFDEVGSGIVGLGGGSLSLISQLGTKIDNKFSYCLVPILEKSNFSLGKIVFGD 269
Query: 229 GAIIDEGD---ATPL--QFIDGHYYITLEAISVDGRMLDINPNIFKRDDSGGGVMIDSGT 283
+ I D +TPL + + YY+TLEAISV L + + G ++IDSGT
Sbjct: 270 NSFISGSDTYVSTPLVSKEPETFYYLTLEAISVGNERLAYENSRNDGNVEKGNIIIDSGT 329
Query: 284 DVTWLVKEAYEALRDEVMIRLEGEQMRSYSWPDKLCYHGIMSSDLKGFPTVRFHFRGGAK 343
+T+L + Y L + +EGE++ + +C+ + +L P + HF A
Sbjct: 330 TLTFLDSKLYNKLELVLEKAVEGERVSDPNGIFSICFRDKIGIEL---PIITVHFT-DAD 385
Query: 344 LALEKDSMFYQPRPDAFCMAVNPASINNRYVNFSLIGMMAQQFYNVGYDIGRKQKTFQRM 403
+ L+ + F + D C + P++ ++ G +AQ + VGYD+ + +F
Sbjct: 386 VELKPINTFAKAEEDLLCFTMIPSN------GIAIFGNLAQMNFLVGYDLDKNCVSFMPT 439
Query: 404 DC 405
DC
Sbjct: 440 DC 441
>gi|30678047|ref|NP_565298.2| aspartyl protease-like protein [Arabidopsis thaliana]
gi|30102688|gb|AAP21262.1| At2g03200 [Arabidopsis thaliana]
gi|110736021|dbj|BAE99983.1| putative chloroplast nucleoid DNA binding protein [Arabidopsis
thaliana]
gi|330250580|gb|AEC05674.1| aspartyl protease-like protein [Arabidopsis thaliana]
Length = 461
Score = 165 bits (418), Expect = 3e-38, Method: Compositional matrix adjust.
Identities = 128/419 (30%), Positives = 202/419 (48%), Gaps = 39/419 (9%)
Query: 15 NENPAHRVQRGINISIARL------AYLQEKIRSHNTYQAQILPSNDISNSLFYVNISIG 68
N ++QRGIN RL A L + +T + P++ S F + +SIG
Sbjct: 57 NLTKIQKIQRGINRGFHRLNRLGAVAVLAVASKPDDTNNIKA-PTHGGSGE-FLMELSIG 114
Query: 69 QPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLGTIFYPSRSSSYAYVPCDSEHCRYFPYAR 128
P V +DTGS L+W C PC +C Q IF P +SSSY+ V C S C P +
Sbjct: 115 NPAVKYSAIVDTGSDLIWTQCKPCTECFDQPTPIFDPEKSSSYSKVGCSSGLCNALPRSN 174
Query: 129 CSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDVVFGCGFSTNRN--FKF 186
C+ K C Y Y T ++TE TF+ DE++I + FGCG + +
Sbjct: 175 CNEDKDACEYLYTYGDYSSTRGLLATETFTFE--DENSI--SGIGFGCGVENEGDGFSQG 230
Query: 187 SGIFGLGIGRSSLVSQLN-SSFSYCIGSLHDPDY--------LHNKLILGDGAIIDEGDA 237
SG+ GLG G SL+SQL + FSYC+ S+ D + L + ++ GA +D G+
Sbjct: 231 SGLVGLGRGPLSLISQLKETKFSYCLTSIEDSEASSSLFIGSLASGIVNKTGASLD-GEV 289
Query: 238 TPLQFI------DGHYYITLEAISVDGRMLDINPNIFK-RDDSGGGVMIDSGTDVTWLVK 290
T + YY+ L+ I+V + L + + F+ +D GG++IDSGT +T+L +
Sbjct: 290 TKTMSLLRNPDQPSFYYLELQGITVGAKRLSVEKSTFELAEDGTGGMIIDSGTTITYLEE 349
Query: 291 EAYEALRDEVMIRLEGEQMRSYSWPDKLCYHGIMSSDLKGFPTVRFHFRGGAKLALEKDS 350
A++ L++E R+ S S LC+ ++ P + FHF+ GA L L ++
Sbjct: 350 TAFKVLKEEFTSRMSLPVDDSGSTGLDLCFKLPDAAKNIAVPKMIFHFK-GADLELPGEN 408
Query: 351 -MFYQPRPDAFCMAVNPASINNRYVNFSLIGMMAQQFYNVGYDIGRKQKTFQRMDCEVL 408
M C+A+ ++ S+ G + QQ +NV +D+ ++ +F +C L
Sbjct: 409 YMVADSSTGVLCLAMGSSN------GMSIFGNVQQQNFNVLHDLEKETVSFVPTECGKL 461
>gi|61214233|sp|Q766C3.1|NEP1_NEPGR RecName: Full=Aspartic proteinase nepenthesin-1; AltName:
Full=Nepenthesin-I; Flags: Precursor
gi|41016421|dbj|BAD07474.1| aspartic proteinase nepenthesin I [Nepenthes gracilis]
Length = 437
Score = 164 bits (416), Expect = 6e-38, Method: Compositional matrix adjust.
Identities = 119/356 (33%), Positives = 170/356 (47%), Gaps = 27/356 (7%)
Query: 61 FYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLGTIFYPSRSSSYAYVPCDSEH 120
+ +N+SIG P P MDTGS L+W C PC C Q IF P SSS++ +PC S+
Sbjct: 95 YLMNLSIGTPAQPFSAIMDTGSDLIWTQCQPCTQCFNQSTPIFNPQGSSSFSTLPCSSQL 154
Query: 121 CRYFPYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDVVFGCGFST 180
C+ CS + C YT Y G ET + TE LTF ++ + ++ FGCG
Sbjct: 155 CQALSSPTCS--NNFCQYTYGYGDGSETQGSMGTETLTFG-----SVSIPNITFGCG-EN 206
Query: 181 NRNF---KFSGIFGLGIGRSSLVSQLN-SSFSYC---IGSLHDPDYLHNKLILGDGAIID 233
N+ F +G+ G+G G SL SQL+ + FSYC IGS + L L +
Sbjct: 207 NQGFGQGNGAGLVGMGRGPLSLPSQLDVTKFSYCMTPIGSSTPSNLLLGS--LANSVTAG 264
Query: 234 EGDATPLQF--IDGHYYITLEAISVDGRMLDINPNIF--KRDDSGGGVMIDSGTDVTWLV 289
+ T +Q I YYITL +SV L I+P+ F ++ GG++IDSGT +T+ V
Sbjct: 265 SPNTTLIQSSQIPTFYYITLNGLSVGSTRLPIDPSAFALNSNNGTGGIIIDSGTTLTYFV 324
Query: 290 KEAYEALRDEVMIRLEGEQMRSYSWPDKLCYHGIMSSDLKGFPTVRFHFRGGAKLALEKD 349
AY+++R E + ++ + S LC+ PT HF GG L L +
Sbjct: 325 NNAYQSVRQEFISQINLPVVNGSSSGFDLCFQTPSDPSNLQIPTFVMHFDGG-DLELPSE 383
Query: 350 SMFYQPRPDAFCMAVNPASINNRYVNFSLIGMMAQQFYNVGYDIGRKQKTFQRMDC 405
+ F P C+A+ +S S+ G + QQ V YD G +F C
Sbjct: 384 NYFISPSNGLICLAMGSSS-----QGMSIFGNIQQQNMLVVYDTGNSVVSFASAQC 434
>gi|409179878|gb|AFV26024.1| aspartic proteinase nepenthesin 1 [Nepenthes mirabilis]
Length = 437
Score = 164 bits (415), Expect = 7e-38, Method: Compositional matrix adjust.
Identities = 119/358 (33%), Positives = 172/358 (48%), Gaps = 31/358 (8%)
Query: 61 FYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLGTIFYPSRSSSYAYVPCDSEH 120
+ +N+SIG P P MDTGS L+W C PC C Q IF P SSS++ +PC S+
Sbjct: 95 YLMNLSIGTPAQPFSAIMDTGSDLIWTQCQPCTQCFNQSTPIFNPQGSSSFSTLPCSSQL 154
Query: 121 CRYFPYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDVVFGCGFST 180
C+ CS + C YT Y G ET + TE LTF ++ + ++ FGCG
Sbjct: 155 CQALQSPTCS--NNSCQYTYGYGDGSETQGSMGTETLTFG-----SVSIPNITFGCG-EN 206
Query: 181 NRNF---KFSGIFGLGIGRSSLVSQLN-SSFSYCIGSLHDPDYLHNKLILGDGAIIDEGD 236
N+ F +G+ G+G G SL SQL+ + FSYC+ + + L+LG A
Sbjct: 207 NQGFGQGNGAGLVGMGRGPLSLPSQLDVTKFSYCMTPIGSST--SSTLLLGSLANSVTAG 264
Query: 237 ATPLQFIDGH-----YYITLEAISVDGRMLDINPNIFK--RDDSGGGVMIDSGTDVTWLV 289
+ I+ YYITL +SV L I+P++FK ++ GG++IDSGT +T+
Sbjct: 265 SPNTTLIESSQIPTFYYITLNGLSVGSTPLPIDPSVFKLNSNNGTGGIIIDSGTTLTYFA 324
Query: 290 KEAYEALRDEVMIRLEGEQMRSYSWPDKLCYHGIMSSDLKGF--PTVRFHFRGGAKLALE 347
AY+A+R + ++ + S LC+ M SD PT HF GG L L
Sbjct: 325 DNAYQAVRQAFISQMNLSVVNGSSSGFDLCFQ--MPSDQSNLQIPTFVMHFDGG-DLVLP 381
Query: 348 KDSMFYQPRPDAFCMAVNPASINNRYVNFSLIGMMAQQFYNVGYDIGRKQKTFQRMDC 405
++ F P C+A+ +S S+ G + QQ V YD G +F C
Sbjct: 382 SENYFISPSNGLICLAMGSSS-----QGMSIFGNIQQQNLLVVYDTGNSVVSFLFAQC 434
>gi|297823357|ref|XP_002879561.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
gi|297325400|gb|EFH55820.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
Length = 447
Score = 164 bits (415), Expect = 7e-38, Method: Compositional matrix adjust.
Identities = 137/430 (31%), Positives = 201/430 (46%), Gaps = 37/430 (8%)
Query: 1 LIHRDSILSPYNNPNENPAHRVQRGINISIARLAYLQEKIRSHNTYQAQILPSNDISNSL 60
LIHRDS LSP NP R+ SI+R L I S Q+ ++ ++
Sbjct: 30 LIHRDSPLSPLYNPKNTVTDRLNAAFLRSISRSRRLNN-ILSQTDLQSGLIGAD----GE 84
Query: 61 FYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLGTIFYPSRSSSYAYVPCDSEH 120
F+++I+IG PP+ F DTGS L WV C PC+ C + G IF +SS+Y PCDS +
Sbjct: 85 FFMSITIGTPPMKVFAIADTGSDLTWVQCKPCQQCYKENGPIFDKKKSSTYKSEPCDSRN 144
Query: 121 CRYFPYAR--CSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDVVFGCGF 178
C + C K+ C Y Y + V+TE ++ + S + VFGCG+
Sbjct: 145 CHALSSSERGCDESKNVCKYRYSYGDQSFSKGDVATETISIDSASGSPVSFPGTVFGCGY 204
Query: 179 STNRNFKFSGIFGLGIGRS--SLVSQLNSS----FSYCIGSLHDPDYLHNKLILGDGAI- 231
+ F +G +G+G SL+SQL SS FSYC+ + + LG +I
Sbjct: 205 NNGGTFDETGSGIIGLGGGHLSLISQLGSSISKKFSYCLSHKSATTNGTSVINLGTNSIP 264
Query: 232 ----IDEGD-ATPL--QFIDGHYYITLEAISVDGRMLDINPNIFKRDDSG------GGVM 278
D G +TPL + +YY+TLEAISV + + + + +D G G ++
Sbjct: 265 SSLSKDSGVISTPLVDKEPRTYYYLTLEAISVGKKKIPYTGSSYNPNDGGIFSETSGNII 324
Query: 279 IDSGTDVTWLVKEAYEALRDEVMIRLEGEQMRSYSWPDKLCYHGIMSSDLK-GFPTVRFH 337
IDSGT +T L ++ V + G + S P L H S + G P + H
Sbjct: 325 IDSGTTLTLLDSGFFDKFGAAVEELVTG--AKRVSDPQGLLSHCFKSGSAEIGLPEITVH 382
Query: 338 FRGGAKLALEKDSMFYQPRPDAFCMAVNPASINNRYVNFSLIGMMAQQFYNVGYDIGRKQ 397
F GA + L + F + D C+++ P + Y NF AQ + VGYD+ +
Sbjct: 383 FT-GADVRLSPINAFVKVSEDMVCLSMVPTTEVAIYGNF------AQMDFLVGYDLETRT 435
Query: 398 KTFQRMDCEV 407
+FQRMDC
Sbjct: 436 VSFQRMDCSA 445
>gi|242079447|ref|XP_002444492.1| hypothetical protein SORBIDRAFT_07g022780 [Sorghum bicolor]
gi|241940842|gb|EES13987.1| hypothetical protein SORBIDRAFT_07g022780 [Sorghum bicolor]
Length = 441
Score = 164 bits (414), Expect = 1e-37, Method: Compositional matrix adjust.
Identities = 124/426 (29%), Positives = 193/426 (45%), Gaps = 34/426 (7%)
Query: 1 LIHRDSILSPYNNPNENPAHRVQRGINISIARLAYLQEKIRSHNTYQAQILPSNDI---S 57
L H D+ S Y P + R I S AR+A LQ S I + + S
Sbjct: 32 LTHVDAGTS-YTKP-----QLLSRAIARSKARVAALQSAAVSPAPVADPITAARVLVTAS 85
Query: 58 NSLFYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLGTIFYPSRSSSYAYVPCD 117
+ + V+++IG PP+ MDTGS L+W C PC C+ Q F RS++Y +PC
Sbjct: 86 SGEYLVDLAIGTPPLYYTAIMDTGSDLIWTQCAPCLLCAAQPTPYFDVKRSATYRALPCR 145
Query: 118 SEHCRYFPYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDVVFGCG 177
S C C +K C+Y Y T+ ++ E TF + + ++ FGCG
Sbjct: 146 SSRCAALSSPSC--FKKMCVYQYYYGDTASTAGVLANETFTFGAASSTKVRAANISFGCG 203
Query: 178 -FSTNRNFKFSGIFGLGIGRSSLVSQLNSS-FSYCIGSLHDPDYLHNKLILGDGAIIDEG 235
+ SG+ G G G SLVSQL S FSYC+ S P ++L G A ++
Sbjct: 204 SLNAGELANSSGMVGFGRGPLSLVSQLGPSRFSYCLTSYLSPT--PSRLYFGVFANLNST 261
Query: 236 D---ATPLQ--------FIDGHYYITLEAISVDGRMLDINPNIFKRDDSG-GGVMIDSGT 283
+ +P+Q + Y+++++ IS+ + L I+P +F +D G GGV+IDSGT
Sbjct: 262 NTSSGSPVQSTPFVINPALPNMYFLSVKGISLGTKRLPIDPLVFAINDDGTGGVIIDSGT 321
Query: 284 DVTWLVKEAYEALRDEVMIRLEGEQMRSYSWPDKLCYHGIMSSDLK-GFPTVRFHFRGGA 342
+TWL ++AYEA+R + + M C+ ++ P FHF G
Sbjct: 322 SITWLQQDAYEAVRRGLASTIPLPAMNDTDIGLDTCFQWPPPPNVTVTVPDFVFHFDGAN 381
Query: 343 KLALEKDSMFYQPRPDAFCMAVNPASINNRYVNFSLIGMMAQQFYNVGYDIGRKQKTFQR 402
++ M C+A+ P S+ ++IG QQ ++ YDI +F
Sbjct: 382 MTLPPENYMLIASTTGYLCLAMAPTSVG------TIIGNYQQQNLHLLYDIANSFLSFVP 435
Query: 403 MDCEVL 408
C+++
Sbjct: 436 APCDII 441
>gi|225437854|ref|XP_002264056.1| PREDICTED: aspartic proteinase nepenthesin-1 [Vitis vinifera]
Length = 436
Score = 163 bits (413), Expect = 1e-37, Method: Compositional matrix adjust.
Identities = 125/398 (31%), Positives = 186/398 (46%), Gaps = 29/398 (7%)
Query: 20 HRVQRGINISIARLAYLQEKIRSHNTYQAQILPSNDISNSLFYVNISIGQPPVPQFTAMD 79
R+QR + RL L K S ++ + N F +N++IG P MD
Sbjct: 59 ERLQRAVKRGRLRLQRLSAKTAS---FEPSVEAPVHAGNGEFLMNLAIGTPAETYSAIMD 115
Query: 80 TGSSLLWVHCYPCRDCSPQLGTIFYPSRSSSYAYVPCDSEHCRYFPYARCSAYKHRCIYT 139
TGS L+W C PC+ C Q IF P +SSS++ +PC S+ C P + CS C Y
Sbjct: 116 TGSDLIWTQCKPCKVCFDQPTPIFDPEKSSSFSKLPCSSDLCVALPISSCS---DGCEYR 172
Query: 140 QLYLIGPETSVFVSTEQLTFKNTDESTIHVQDVVFGCGFSTNRNFKFS---GIFGLGIGR 196
Y T ++TE TF + V + FGCG NR +S G+ GLG G
Sbjct: 173 YSYGDHSSTQGVLATETFTFGDA-----SVSKIGFGCG-EDNRGRAYSQGAGLVGLGRGP 226
Query: 197 SSLVSQLN-SSFSYCIGSLHDPDYLHNKLILGDGAIIDEGDATPL---QFIDGHYYITLE 252
SL+SQL FSYC+ S+ D + + L++G A + TPL YY++LE
Sbjct: 227 LSLISQLGVPKFSYCLTSIDDSKGI-STLLVGSEATVKSAIPTPLIQNPSRPSFYYLSLE 285
Query: 253 AISVDGRMLDINPNIFK-RDDSGGGVMIDSGTDVTWLVKEAYEALRDEVMIRLEGEQMRS 311
ISV +L I + F +DD GG++IDSGT +T+L A+ AL+ E + +++ + S
Sbjct: 286 GISVGDTLLPIEKSTFSIQDDGSGGLIIDSGTTITYLKDSAFAALKKEFISQMKLDVDAS 345
Query: 312 YSWPDKLCYHGIMSSDLKGFPTVRFHFRGGAKLALEKDSMFYQPRP-DAFCMAVNPASIN 370
S +LC+ P + FHF G L L K++ + C+ + +S
Sbjct: 346 GSTELELCFTLPPDGSPVDVPQLVFHFE-GVDLKLPKENYIIEDSALRVICLTMGSSS-- 402
Query: 371 NRYVNFSLIGMMAQQFYNVGYDIGRKQKTFQRMDCEVL 408
S+ G QQ V +D+ ++ +F C L
Sbjct: 403 ----GMSIFGNFQQQNIVVLHDLEKETISFAPAQCNQL 436
>gi|147862576|emb|CAN79341.1| hypothetical protein VITISV_006338 [Vitis vinifera]
Length = 436
Score = 163 bits (413), Expect = 1e-37, Method: Compositional matrix adjust.
Identities = 125/398 (31%), Positives = 186/398 (46%), Gaps = 29/398 (7%)
Query: 20 HRVQRGINISIARLAYLQEKIRSHNTYQAQILPSNDISNSLFYVNISIGQPPVPQFTAMD 79
R+QR + RL L K S ++ + N F +N++IG P MD
Sbjct: 59 ERLQRAVKRGRLRLQRLSAKTAS---FEPSVEAPVHAGNGEFLMNLAIGTPAETYSAIMD 115
Query: 80 TGSSLLWVHCYPCRDCSPQLGTIFYPSRSSSYAYVPCDSEHCRYFPYARCSAYKHRCIYT 139
TGS L+W C PC+ C Q IF P +SSS++ +PC S+ C P + CS C Y
Sbjct: 116 TGSDLIWTQCKPCKVCFDQPTPIFDPEKSSSFSKLPCSSDLCVALPISSCS---DGCEYR 172
Query: 140 QLYLIGPETSVFVSTEQLTFKNTDESTIHVQDVVFGCGFSTNRNFKFS---GIFGLGIGR 196
Y T ++TE TF + V + FGCG NR +S G+ GLG G
Sbjct: 173 YSYGDHSSTQGVLATETFTFGDAS-----VSKIGFGCG-EDNRGRAYSQGAGLVGLGRGP 226
Query: 197 SSLVSQLN-SSFSYCIGSLHDPDYLHNKLILGDGAIIDEGDATPL---QFIDGHYYITLE 252
SL+SQL FSYC+ S+ D + + L++G A + TPL YY++LE
Sbjct: 227 LSLISQLGVPKFSYCLTSIDDSKGI-STLLVGSEATVKSAIPTPLIQNPSRPSFYYLSLE 285
Query: 253 AISVDGRMLDINPNIFK-RDDSGGGVMIDSGTDVTWLVKEAYEALRDEVMIRLEGEQMRS 311
ISV +L I + F +DD GG++IDSGT +T+L A+ AL+ E + +++ + S
Sbjct: 286 GISVGDTLLPIEKSTFSIQDDGSGGLIIDSGTTITYLKDNAFAALKKEFISQMKLDVDAS 345
Query: 312 YSWPDKLCYHGIMSSDLKGFPTVRFHFRGGAKLALEKDSMFYQPRP-DAFCMAVNPASIN 370
S +LC+ P + FHF G L L K++ + C+ + +S
Sbjct: 346 GSTELELCFTLPPDGSPVEVPQLVFHFE-GVDLKLPKENYIIEDSALRVICLTMGSSS-- 402
Query: 371 NRYVNFSLIGMMAQQFYNVGYDIGRKQKTFQRMDCEVL 408
S+ G QQ V +D+ ++ +F C L
Sbjct: 403 ----GMSIFGNFQQQNIVVLHDLEKETISFAPAQCNQL 436
>gi|356557010|ref|XP_003546811.1| PREDICTED: probable aspartic protease At2g35615-like [Glycine max]
Length = 437
Score = 163 bits (412), Expect = 1e-37, Method: Compositional matrix adjust.
Identities = 127/421 (30%), Positives = 201/421 (47%), Gaps = 38/421 (9%)
Query: 1 LIHRDSILSPYNNPNENPAHRVQRGINISIARLAYLQEKIRSHNTYQAQILPSNDISNSL 60
LIHRDS LSP+ +P+ P+ R+ S +RL + + +N ++ ++P N
Sbjct: 36 LIHRDSPLSPFYDPSLTPSERITNAAFRSSSRLNRVSHFLDENNLPESLLIPEN----GE 91
Query: 61 FYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLGTIFYPSRSSSYAYVPCDSEH 120
+ + + IG PPV + DTGS L+WV C PC++C PQ +F P +SS++ CDS+
Sbjct: 92 YLMTLYIGTPPVERLAIADTGSDLIWVQCSPCQNCFPQDTPLFEPLKSSTFKAATCDSQP 151
Query: 121 CRYFPYARCSAYK-HRCIYTQLYLIGPETSVFVSTEQLTFKNT-DESTIHVQDVVFGCGF 178
C P ++ K +CIY+ Y T V TE L+F +T D T+ +FGCG
Sbjct: 152 CTSVPPSQRQCGKVGQCIYSYSYGDKSFTVGVVGTETLSFGSTGDAQTVSFPSSIFGCGV 211
Query: 179 STNRNFKFS--------GIFGLGIGRSSLVSQLNSSFSYCIGSLHDPDYLHNKLILGDGA 230
N F S G S L Q+ FSYC+ L +KL G A
Sbjct: 212 YNNFTFHTSDKVTGLVGLGGGPLSLVSQLGPQIGYKFSYCL--LPFSSNSTSKLKFGSEA 269
Query: 231 IIDEGD--ATPL---QFIDGHYYITLEAISVDGRMLDINPNIFKRDDSGGGVMIDSGTDV 285
I+ +TPL Y++ LEA+++ +++ R D G ++IDSGT +
Sbjct: 270 IVTTNGVVSTPLIIKPLFPSFYFLNLEAVTIGQKVVPTG-----RTD--GNIIIDSGTVL 322
Query: 286 TWLVKEAYEALRDEVMIRLEGEQMRSYSWPDKLCYHGIMSSDLKGFPTVRFHFRGGAKLA 345
T+L + Y + L E + +P K C+ D+ P + F F GA +A
Sbjct: 323 TYLEQTFYNNFVASLQEVLSVESAQDLPFPFKFCFP---YRDMT-IPVIAFQFT-GASVA 377
Query: 346 LEKDSMFYQPRP-DAFCMAVNPASINNRYVNFSLIGMMAQQFYNVGYDIGRKQKTFQRMD 404
L+ ++ + + + C+AV P+S++ S+ G +AQ + V YD+ K+ +F D
Sbjct: 378 LQPKNLLIKLQDRNMLCLAVVPSSLS----GISIFGNVAQFDFQVVYDLEGKKVSFAPTD 433
Query: 405 C 405
C
Sbjct: 434 C 434
>gi|357450869|ref|XP_003595711.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
gi|355484759|gb|AES65962.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
Length = 443
Score = 162 bits (411), Expect = 2e-37, Method: Compositional matrix adjust.
Identities = 128/413 (30%), Positives = 200/413 (48%), Gaps = 39/413 (9%)
Query: 28 ISIARLAYLQEKIRSHN-TYQAQILPSNDISNSLF----------------YVNISIGQP 70
++I L ++ I +HN + +++P N S LF + +SIG P
Sbjct: 10 LAILLLVFIFPSIEAHNGRFTVKLIPRNS-SQVLFNRITAQTPVSVHHYDYLMELSIGTP 68
Query: 71 PVPQFTAMDTGSSLLWVHCYPCRDCSPQLGTIFYPSRSSSYAYVPCDSEHCRYFPYARCS 130
PV + +DTGS L+W+ C PC +C QL +F P SS+Y+ + SE C CS
Sbjct: 69 PVKTYAQVDTGSDLIWLQCIPCTNCYKQLNPMFDPQSSSTYSNIAYGSESCSKLYSTSCS 128
Query: 131 AYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDVVFGCGFSTNRNF--KFSG 188
++ C YT Y T ++ E LT +T + ++ V+FGCG + N F K G
Sbjct: 129 PDQNNCNYTYSYEDDSITEGVLAQETLTLTSTTGKPVALKGVIFGCGHNNNGVFNDKEMG 188
Query: 189 IFGLGIGRSSLVSQLNSS-----FSYCIGSLHDPDYLHNKLILGDGA-IIDEG-DATPLQ 241
I GLG G SLVSQ+ SS FS C+ H + + + G G+ ++ G +TPL
Sbjct: 189 IIGLGRGPLSLVSQIGSSFGGKMFSQCLVPFHTNPSITSPMSFGKGSEVLGNGVVSTPLV 248
Query: 242 FIDGH---YYITLEAISVDGRMLDINPNIFKRDDSGGGVMIDSGTDVTWLVKEAYEALRD 298
+ H Y++TL ISV+ L N + G ++IDSGT T L ++ Y L +
Sbjct: 249 SKNTHQAFYFVTLLGISVEDINLPFNDGSSLEPITKGNMVIDSGTPTTLLPEDFYHRLVE 308
Query: 299 EVMIRLEGEQMR-SYSWPDKLCYHGIMSSDLKGFPTVRFHFRGGAKLALEKDSMFYQPRP 357
EV ++ + + + +LCY ++LKG T+ HF GA + L +F +
Sbjct: 309 EVRNKVALDPIPIDPTLGYQLCYR--TPTNLKG-TTLTAHFE-GADVLLTPTQIFIPVQD 364
Query: 358 DAFCMAVNPASINNRYVNFSLIGMMAQQFYNVGYDIGRKQKTFQRMDCEVLDD 410
FC A ++ +N Y + G AQ Y +G+D+ ++ +F+ DC L D
Sbjct: 365 GIFCFAFT-STFSNEY---GIYGNHAQSNYLIGFDLEKQLVSFKATDCTNLQD 413
>gi|357514989|ref|XP_003627783.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
gi|355521805|gb|AET02259.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
Length = 416
Score = 162 bits (411), Expect = 2e-37, Method: Compositional matrix adjust.
Identities = 128/420 (30%), Positives = 195/420 (46%), Gaps = 51/420 (12%)
Query: 1 LIHRDSILSPYNNPNENPAHRVQRGINISIARLAYLQEKIRSHNTYQAQILPSNDISNSL 60
LIHRDS SP P +N + SI R + K NT Q+ ++P +
Sbjct: 32 LIHRDSSKSPLYQPTQNKYQHIVNAARRSINRANHFY-KTALTNTPQSTVIPDH----GE 86
Query: 61 FYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLGTIFYPSRSSSYAYVPCDSEH 120
+ + S+G PP + DTGS ++W+ C PC++C Q F PS+SS+Y +PC S+
Sbjct: 87 YLMTYSVGTPPFKLYGIADTGSDIVWLQCEPCKECYNQTTPKFKPSKSSTYKNIPCSSDL 146
Query: 121 CRYFPYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDVVFGCGFST 180
C+ +S + LT +++ I V GCG
Sbjct: 147 CKSGQQGN-----------------------LSVDTLTLESSTGHPISFPKTVIGCGTDN 183
Query: 181 NRNFK--FSGIFGLGIGRSSLVSQLNSS----FSYCIGSLHDPDYLHNKLILGDGAIIDE 234
+F+ SGI GLG G +SL++QL SS FSYC+ +KL GD A++
Sbjct: 184 TVSFEGASSGIVGLGGGPASLITQLGSSIDAKFSYCLLPNPVESNTTSKLNFGDTAVV-S 242
Query: 235 GD---ATPLQFIDG--HYYITLEAISVDGRMLDINPNIFKRDDSG---GGVMIDSGTDVT 286
GD +TP+ D YY+TLEA SV + ++ F+ +G G ++IDSGT +T
Sbjct: 243 GDGVVSTPIVKKDPIVFYYLTLEAFSVGNKRIE-----FEGSSNGGHEGNIIIDSGTTLT 297
Query: 287 WLVKEAYEALRDEVMIRLEGEQMRSYSWPDKLCYHGIMSSDLKGFPTVRFHFRGGAKLAL 346
+ + Y L V+ ++ +++ + LCY ++SD FP + HF+ GA + L
Sbjct: 298 VIPTDVYNNLESAVLELVKLKRVNDPTRLFNLCYS--VTSDGYDFPIITTHFK-GADVKL 354
Query: 347 EKDSMFYQPRPDAFCMAVNPASINNRYVNFSLIGMMAQQFYNVGYDIGRKQKTFQRMDCE 406
S F C+A S S+ G +AQQ VGYD+ +K +F+ DC
Sbjct: 355 HPISTFVDVADGIVCLAFATTSAFIPSDVVSIFGNLAQQNLLVGYDLQQKIVSFKPTDCS 414
>gi|296085498|emb|CBI29230.3| unnamed protein product [Vitis vinifera]
Length = 542
Score = 162 bits (410), Expect = 3e-37, Method: Compositional matrix adjust.
Identities = 117/414 (28%), Positives = 195/414 (47%), Gaps = 41/414 (9%)
Query: 1 LIHRDSILSPYNNPNENPAHRVQRGINISIARLAYLQEKIRSHNTYQAQILPSNDISNSL 60
LIHRDS SP+ +P++ A R+ S++R+ + + + Q++I+PS
Sbjct: 36 LIHRDSPHSPFFDPSKTQAERLTDAFRRSVSRVGRFRPTAMTSDGIQSRIVPSA----GE 91
Query: 61 FYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLGTIFYPSRSSSYAYVPCDSEH 120
+ +N+ IG PPVP +DTGS L W C PC C Q+ +F P SS+Y C +
Sbjct: 92 YLMNLYIGTPPVPVIAIVDTGSDLTWTQCRPCTHCYKQVVPLFDPKNSSTYRDSSCGTSF 151
Query: 121 CRYFPYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDVVFGCGFST 180
C R + + +C + Y G T +++E LT +T + FGCG S+
Sbjct: 152 CLALGKDRSCSKEKKCTFRYSYADGSFTGGNLASETLTVDSTAGKPVSFPGFAFGCGHSS 211
Query: 181 NRNF--KFSGIFGLGIGRSSLVSQLNSS----FSYCIGSLHDPDYLHNKLILG-DGAIID 233
F SGI GLG G SL+SQL S+ FSYC+ + + +++ G G +
Sbjct: 212 GGIFDKSSSGIVGLGGGELSLISQLKSTINGLFSYCLLPVSTDSSISSRINFGASGRVSG 271
Query: 234 EGD-ATPLQFIDGHYYITLEAISVDGRMLDINPNIFKRDDSGGGVMIDSGTDVTWLVKEA 292
G +TPL+ Y E G +++DSGT T+L +E
Sbjct: 272 YGTVSTPLRLPYKGYSKKTEV-------------------EEGNIIVDSGTTYTFLPQEF 312
Query: 293 YEALRDEVMIRLEGEQMRSYSWPDKLCYHGIMSSDLKGFPTVRFHFRGGAKLALEKDSMF 352
Y L V ++G+++R + LCY+ ++++ P + HF+ A + L+ + F
Sbjct: 313 YSKLEKSVANSIKGKRVRDPNGIFSLCYN--TTAEINA-PIITAHFK-DANVELQPLNTF 368
Query: 353 YQPRPDAFCMAVNPASINNRYVNFSLIGMMAQQFYNVGYDIGRKQKTFQRMDCE 406
+ + D C V P S + ++G +AQ + VG+D+ +K+ ++ + E
Sbjct: 369 MRMQEDLVCFTVAPTS------DIGVLGNLAQVNFLVGFDLRKKRGFSKKAEVE 416
Score = 63.2 bits (152), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 39/139 (28%), Positives = 72/139 (51%), Gaps = 9/139 (6%)
Query: 269 KRDDSGGGVMIDSGTDVTWLVKEAYEALRDEVMIRLEGEQMRSYSWPDKLCYHGIMSSDL 328
K + G +++DSGT T+L E Y L + V ++G+++R + LCY+ + D
Sbjct: 412 KAEVEEGNIIVDSGTTYTYLPLEFYVKLEESVAHSIKGKRVRDPNGISSLCYN--TTVDQ 469
Query: 329 KGFPTVRFHFRGGAKLALEKDSMFYQPRPDAFCMAVNPASINNRYVNFSLIGMMAQQFYN 388
P + HF+ A + L+ + F + + D C V P S + ++G +AQ +
Sbjct: 470 IDAPIITAHFK-DANVELQPWNTFLRMQEDLVCFTVLPTS------DIGILGNLAQVNFL 522
Query: 389 VGYDIGRKQKTFQRMDCEV 407
VG+D+ +K+ +F+ DC +
Sbjct: 523 VGFDLRKKRVSFKAADCTL 541
>gi|362799904|dbj|BAL41445.1| aspartyl protease 1 [Linum grandiflorum]
Length = 449
Score = 162 bits (410), Expect = 3e-37, Method: Compositional matrix adjust.
Identities = 132/438 (30%), Positives = 208/438 (47%), Gaps = 55/438 (12%)
Query: 1 LIHRDSILSPYNNPNENPAHRVQRGINISIARLAYLQEKIRSHNTYQAQILPSNDISNSL 60
LIHRDS LSP + PN + R+Q +I+R + H +Q +LPS
Sbjct: 31 LIHRDSPLSPLHTPNLTFSDRLQASFLRAISRQS-------RHVDFQTDLLPSG----GE 79
Query: 61 FYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLGTIFYPSRSSSYAYVPCDSEH 120
+ +N+SIG PP P DTGS L W+ PC C PQ G IF PS S+++ +PC +
Sbjct: 80 YMMNLSIGTPPFPILAIADTGSDLTWLQSKPCDQCYPQKGPIFDPSNSTTFHKLPCTTAP 139
Query: 121 CRYF-PYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDVVFGCGFS 179
C AR C YT Y T+ +++++ +T N +++ +++V FGCG
Sbjct: 140 CNALDESARSCTDPTTCGYTYSYGDHSYTTGYLASDTVTVGN---ASVQIRNVAFGCGTR 196
Query: 180 TNRNF--KFSGIFGLGIGRSSLVSQLNSS----FSYCIGSLHDP-------DYLHNKLIL 226
NF + SGI GLG G S VSQL + FSYC+ L + ++++
Sbjct: 197 NGGNFDEQGSGIVGLGGGNLSFVSQLGDTIGKKFSYCLLPLENEISSQPSDSPATSRIVF 256
Query: 227 GDGAIIDEGDATPLQFI---------DGHYYITLEAISVDGRMLDINPNIFKRD--DSG- 274
GD + + F +YY+T+EAI+V + L + + K DSG
Sbjct: 257 GDNPVFSSSSTNGVVFATTPLVNKEPSTYYYLTIEAITVGRKKLLYSSSSSKTASYDSGS 316
Query: 275 ------GGVMIDSGTDVTWLVKEAYEALRDEVMIRLEGEQMRSYSWPD-KLCYHGIMSSD 327
G ++IDSGT +T+L +E Y AL ++ ++ E++ LC+ +
Sbjct: 317 KSSVEEGNIIIDSGTTLTFLEEEFYGALEAALVEEIKMERVNDVKNSMFSLCFKS--GKE 374
Query: 328 LKGFPTVRFHFRGGAKLALEKDSMFYQPRPDAFCMAVNPASINNRYVNFSLIGMMAQQFY 387
P ++ HFRGGA + L+ + F + C + P + + + G +AQ +
Sbjct: 375 EVELPLMKVHFRGGADVELKPVNTFVRAEEGLVCFTMLPTN------DVGIYGNLAQMNF 428
Query: 388 NVGYDIGRKQKTFQRMDC 405
VGYD+G++ +F DC
Sbjct: 429 VVGYDLGKRTVSFLPADC 446
>gi|125590542|gb|EAZ30892.1| hypothetical protein OsJ_14967 [Oryza sativa Japonica Group]
Length = 516
Score = 162 bits (409), Expect = 3e-37, Method: Compositional matrix adjust.
Identities = 123/359 (34%), Positives = 171/359 (47%), Gaps = 32/359 (8%)
Query: 67 IGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLGTIFYPSRSSSYAYVPCDSEHCRYFPY 126
IG P + +DTGS L+W C PC DC Q +F PS SS+YA VPC S C P
Sbjct: 173 IGTPALAYSAIVDTGSDLVWTQCKPCVDCFKQSTPVFDPSSSSTYATVPCSSASCSDLPT 232
Query: 127 ARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDVVFGCGFSTNRNFKF 186
++C++ +C YT Y T ++TE T + + VVFGCG TN F
Sbjct: 233 SKCTS-ASKCGYTYTYGDSSSTQGVLATETFTLAKS-----KLPGVVFGCG-DTNEGDGF 285
Query: 187 S---GIFGLGIGRSSLVSQLN-SSFSYCIGSLHDPDYLHNKLILGDGAIIDEG------- 235
S G+ GLG G SLVSQL FSYC+ SL D + ++ L+LG A I E
Sbjct: 286 SQGAGLVGLGRGPLSLVSQLGLDKFSYCLTSLDDTN--NSPLLLGSLAGISEASAAASSV 343
Query: 236 DATPL---QFIDGHYYITLEAISVDGRMLDINPNIFK-RDDSGGGVMIDSGTDVTWLVKE 291
TPL YY++L+AI+V + + + F +DD GGV++DSGT +T+L +
Sbjct: 344 QTTPLIKNPSQPSFYYVSLKAITVGSTRISLPSSAFAVQDDGTGGVIVDSGTSITYLEVQ 403
Query: 292 AYEALRDEVMIRLEGEQMRSYSWPDKLCYHGIMSS-DLKGFPTVRFHFRGGAKLALEKDS 350
Y AL+ ++ LC+ D P + FHF GGA L L ++
Sbjct: 404 GYRALKKAFAAQMALPAADGSGVGLDLCFRAPAKGVDQVEVPRLVFHFDGGADLDLPAEN 463
Query: 351 -MFYQPRPDAFCMAVNPASINNRYVNFSLIGMMAQQFYNVGYDIGRKQKTFQRMDCEVL 408
M A C+ V + +R S+IG QQ + YD+G +F + C L
Sbjct: 464 YMVLDGGSGALCLTV----MGSR--GLSIIGNFQQQNFQFVYDVGHDTLSFAPVQCNKL 516
>gi|357167693|ref|XP_003581287.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Brachypodium
distachyon]
Length = 468
Score = 161 bits (408), Expect = 4e-37, Method: Compositional matrix adjust.
Identities = 118/368 (32%), Positives = 171/368 (46%), Gaps = 31/368 (8%)
Query: 58 NSLFYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLGTIFYPSRSSSYAYVPCD 117
N F +++SIG P + +DTGS L+W C PC +C Q +F PS SS+Y+ +PC
Sbjct: 115 NGEFLMDMSIGTPALAYAAIVDTGSDLVWTQCKPCVECFNQSTPVFDPSSSSTYSTLPCS 174
Query: 118 SEHCRYFPYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDVVFGCG 177
S C P + C++ C YT Y T ++ E T T + V FGCG
Sbjct: 175 SSLCSDLPTSTCTSAAKDCGYTYTYGDASSTQGVLAAETFTLAKT-----KLPGVAFGCG 229
Query: 178 FSTNRNFKFS---GIFGLGIGRSSLVSQLN-SSFSYCIGSLHDPDYLHNKLILGDGAII- 232
TN F+ G+ GLG G SLVSQL FSYC+ SL D + L+LG A I
Sbjct: 230 -DTNEGDGFTQGAGLVGLGRGPLSLVSQLGLGKFSYCLTSLDDTS--KSPLLLGSLAAIS 286
Query: 233 -DEGDATPLQFI--------DGHYYITLEAISVDGRMLDINPNIFK-RDDSGGGVMIDSG 282
D A +Q YY+TL+A++V + + + F +DD GGV++DSG
Sbjct: 287 TDTASAAAIQTTPLIKNPSQPSFYYVTLKALTVGSTRIPLPGSAFAVQDDGTGGVIVDSG 346
Query: 283 TDVTWLVKEAYEALRDEVMIRLEGEQMRSYSWPDKLCYHGIMSS-DLKGFPTVRFHFRGG 341
T +T+L + Y L+ +++ + LC+ S D P + HF GG
Sbjct: 347 TSITYLELQGYRPLKKAFAAQMKLPVADGSAVGLDLCFKAPASGVDDVEVPKLVLHFDGG 406
Query: 342 AKLALEKDS-MFYQPRPDAFCMAVNPASINNRYVNFSLIGMMAQQFYNVGYDIGRKQKTF 400
A L L ++ M A C+ V + +R S+IG QQ YD+ + +F
Sbjct: 407 ADLDLPAENYMVLDSASGALCLTV----MGSR--GLSIIGNFQQQNIQFVYDVDKDTLSF 460
Query: 401 QRMDCEVL 408
+ C L
Sbjct: 461 APVQCAKL 468
>gi|356555042|ref|XP_003545848.1| PREDICTED: probable aspartic protease At2g35615-like [Glycine max]
Length = 431
Score = 161 bits (407), Expect = 6e-37, Method: Compositional matrix adjust.
Identities = 127/418 (30%), Positives = 198/418 (47%), Gaps = 33/418 (7%)
Query: 1 LIHRDSILSPYNNPNENPAHRVQRGINISIARLAYLQEKIRSHNTYQAQILPSNDISNSL 60
+IHRDS SP+ E RV + S+ R + + N ++ P + +
Sbjct: 31 IIHRDSSRSPFYRATETQFQRVTNAVRRSMNRANHFNQISVYSNAVES---PVTLLDDGD 87
Query: 61 FYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLGTIFYPSRSSSYAYVPCDSEH 120
+ ++ S+G PP P + +DT S ++WV C C C +F PS S +Y +PC S
Sbjct: 88 YLMSYSLGTPPFPVYGIVDTASDIIWVQCQLCETCYNDTSPMFDPSYSKTYKNLPCSSTT 147
Query: 121 CRYFPYARCSAYKHR-CIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDVVFGCGFS 179
C+ CS+ + + C +T Y G + + E +T + ++ +H V GC +
Sbjct: 148 CKSVQGTSCSSDERKICEHTVNYKDGSHSQGDLIVETVTLGSYNDPFVHFPRTVIGCIRN 207
Query: 180 TNRNFKFSGIFGLGIGRSSLVSQLNSS----FSYCIGSLHDPDYLHNKLILGDGAII--D 233
TN +F GI GLG G SLV QL+SS FSYC+ + D +KL GD A++ D
Sbjct: 208 TNVSFDSIGIVGLGGGPVSLVPQLSSSISKKFSYCLAPISDRS---SKLKFGDAAMVSGD 264
Query: 234 EGDATPLQFID--GHYYITLEAISVDGRMLDINPNIFKRDDSGGGVMIDSGTDVTWLVKE 291
+T + F D YY+TLEA SV ++ + R G ++IDSGT T L +
Sbjct: 265 GTVSTRIVFKDWKKFYYLTLEAFSVGNNRIEFRSSS-SRSSGKGNIIIDSGTTFTVLPDD 323
Query: 292 AYEALRDEV--MIRLEGEQ--MRSYSWPDKLCYHGIMSSDLKGFPTVRFHFRGGAKLALE 347
Y L V +++LE + ++ +S LCY + D P + HF GA + L
Sbjct: 324 VYSKLESAVADVVKLERAEDPLKQFS----LCYKS--TYDKVDVPVITAHF-SGADVKLN 376
Query: 348 KDSMFYQPRPDAFCMAVNPASINNRYVNFSLIGMMAQQFYNVGYDIGRKQKTFQRMDC 405
+ F C+A + + ++ G +AQQ + VGYD+ RK +F+ DC
Sbjct: 377 ALNTFIVASHRVVCLAFLSSQ------SGAIFGNLAQQNFLVGYDLQRKIVSFKPTDC 428
>gi|242073260|ref|XP_002446566.1| hypothetical protein SORBIDRAFT_06g018170 [Sorghum bicolor]
gi|241937749|gb|EES10894.1| hypothetical protein SORBIDRAFT_06g018170 [Sorghum bicolor]
Length = 452
Score = 161 bits (407), Expect = 6e-37, Method: Compositional matrix adjust.
Identities = 118/368 (32%), Positives = 170/368 (46%), Gaps = 29/368 (7%)
Query: 58 NSLFYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLGTIFYPSRSSSYAYVPCD 117
N F ++++IG P + +DTGS L+W C PC DC Q +F PS SS+YA VPC
Sbjct: 97 NGEFLMDVAIGTPALSYAAIVDTGSDLVWTQCKPCVDCFKQSTPVFDPSSSSTYATVPCS 156
Query: 118 SEHCRYFPYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDVVFGCG 177
S C P + C++ +C YT Y T +++E T + + V FGCG
Sbjct: 157 SALCSDLPTSTCTS-ASKCGYTYTYGDASSTQGVLASETFTLGKEKK---KLPGVAFGCG 212
Query: 178 FSTNRNFKFS---GIFGLGIGRSSLVSQLN-SSFSYCIGSLHDPDYLHNKLILGDGAIID 233
TN F+ G+ GLG G SLVSQL FSYC+ SL D D + L+LG A
Sbjct: 213 -DTNEGDGFTQGAGLVGLGRGPLSLVSQLGLDKFSYCLTSLDDGDG-KSPLLLGGSAAAI 270
Query: 234 EG-------DATPLQFIDGH---YYITLEAISVDGRMLDINPNIFK-RDDSGGGVMIDSG 282
TPL YY++L ++V + + + F +DD GGV++DSG
Sbjct: 271 SESAATAPVQTTPLVKNPSQPSFYYVSLTGLTVGSTRITLPASAFAIQDDGTGGVIVDSG 330
Query: 283 TDVTWLVKEAYEALRDEVMIRLEGEQMRSYSWPDKLCYHGIMSS-DLKGFPTVRFHFRGG 341
T +T+L + Y AL+ + ++ + LC+ G D P + HF GG
Sbjct: 331 TSITYLELQGYRALKKAFVAQMALPTVDGSEIGLDLCFQGPAKGVDEVQVPKLVLHFDGG 390
Query: 342 AKLALEKDS-MFYQPRPDAFCMAVNPASINNRYVNFSLIGMMAQQFYNVGYDIGRKQKTF 400
A L L ++ M A C+ V P+ S+IG QQ + YD+ +F
Sbjct: 391 ADLDLPAENYMVLDSASGALCLTVAPSR------GLSIIGNFQQQNFQFVYDVAGDTLSF 444
Query: 401 QRMDCEVL 408
+ C L
Sbjct: 445 APVQCNKL 452
>gi|297741705|emb|CBI32837.3| unnamed protein product [Vitis vinifera]
Length = 455
Score = 161 bits (407), Expect = 7e-37, Method: Compositional matrix adjust.
Identities = 136/410 (33%), Positives = 196/410 (47%), Gaps = 33/410 (8%)
Query: 1 LIHRDSILSPYNNPNENPAHRVQRGINISIARLAYLQEKIRSHNTYQ-AQILPSNDISN- 58
LIH S SPY N + + +++R AYL + R Q A +P I +
Sbjct: 47 LIHIHSPSSPYKNVKAESLAK-DTALESTLSRHAYL--RARQQKALQPADFVPPPLIRDK 103
Query: 59 SLFYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLGTIFYPSRSSSYAYVPCDS 118
S F N+SIG PP + +DTGS L W+ C PC C Q I+ ++S SY + C+
Sbjct: 104 SAFLANLSIGNPPTNVYVVLDTGSDLFWIQCEPCDVCYKQKDPIYNRTKSDSYTEMLCNE 163
Query: 119 EHCRYFPYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDVVFGCGF 178
C + C+Y Y G TS +S E++ F + V FGCG
Sbjct: 164 PPCLSLGREGQCSDSGSCLYQTSYADGSRTSGLLSYEKVAFTSHYSDEDKTAQVGFGCGL 223
Query: 179 STNRNFKFSG----IFGLGIGRSSLVSQLNS------SFSYCIGSLHDPDYLHNKLILGD 228
N NF S + GLG G SLVSQL++ SF+YC G+L +P+ L+ GD
Sbjct: 224 Q-NLNFVTSSRDGGVLGLGPGLVSLVSQLSAIGKVSKSFAYCFGNLSNPN-AGGFLVFGD 281
Query: 229 GAIIDEGDATPLQFIDGHYYITLEAI--SVDGRMLDINPNIFKRD-DSGGGVMIDSGTDV 285
++ GD TP+ I YY+ L I V+ LDIN + F+R D GGV+IDSG+ +
Sbjct: 282 ATYLN-GDMTPM-VIAEFYYVNLLGIGLGVEEPRLDINSSSFERKPDGSGGVIIDSGSTL 339
Query: 286 TWLVKEAYEALRDEVMIRLEGEQMRS--YSWPDKLCYHGIMSSDLKGFPTVRFHFRGGAK 343
+ E YE +R+ V+ +L+ S S PD C+ G + DL FPT+ +
Sbjct: 340 SIFPPEVYEVVRNAVVDKLKKGYNISPLTSSPD--CFEGKIGRDLPLFPTLVLYLESTGI 397
Query: 344 LALEKDSMFYQPRPDAFCMAVNPASINNRYVNFSLIGMMAQQFYNVGYDI 393
L ++ S+F Q + FC+ S+IG +AQQ Y GY++
Sbjct: 398 LN-DRWSIFLQRYDELFCLGFTSGE------GLSIIGTLAQQSYKFGYNL 440
>gi|356516413|ref|XP_003526889.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
Length = 453
Score = 161 bits (407), Expect = 7e-37, Method: Compositional matrix adjust.
Identities = 131/408 (32%), Positives = 195/408 (47%), Gaps = 27/408 (6%)
Query: 15 NENPAHRVQRGINISIARLAYLQEKIRSHNTY---QAQILPSNDISNSLFYVNISIGQPP 71
N RVQ GI +RL L + + ++ + Q+ N + + ++IG PP
Sbjct: 59 NLTKLERVQHGIKRGKSRLQKLNAMVLAASSTPDSEDQLEAPIHAGNGEYLIELAIGTPP 118
Query: 72 VPQFTAMDTGSSLLWVHCYPCRDCSPQLGTIFYPSRSSSYAYVPCDSEHCRYFPYARCSA 131
V +DTGS L+W C PC C Q IF P +SSS++ V C S C P + CS
Sbjct: 119 VSYPAVLDTGSDLIWTQCKPCTRCYKQPTPIFDPKKSSSFSKVSCGSSLCSALPSSTCS- 177
Query: 132 YKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDVVFGCGFSTNRNFKF---SG 188
C Y Y T ++TE TF + ++ + V ++ FGCG N F SG
Sbjct: 178 --DGCEYVYSYGDYSMTQGVLATETFTFGKS-KNKVSVHNIGFGCG-EDNEGDGFEQASG 233
Query: 189 IFGLGIGRSSLVSQLNSS-FSYCIGSLHDPDYLHNKLILGD-GAIIDEGD--ATPL---Q 241
+ GLG G SLVSQL FSYC+ + D + L+LG G + D + TPL
Sbjct: 234 LVGLGRGPLSLVSQLKEQRFSYCLTPIDDTK--ESVLLLGSLGKVKDAKEVVTTPLLKNP 291
Query: 242 FIDGHYYITLEAISVDGRMLDINPNIFK-RDDSGGGVMIDSGTDVTWLVKEAYEALRDEV 300
YY++LEAISV L I + F+ DD GGV+IDSGT +T++ ++AYEAL+ E
Sbjct: 292 LQPSFYYLSLEAISVGDTRLSIEKSTFEVGDDGNGGVIIDSGTTITYVQQKAYEALKKEF 351
Query: 301 MIRLEGEQMRSYSWPDKLCYHGIMSSDLKGFPTVRFHFRGGAKLALEKDSMFYQPRPDAF 360
+ + + ++ S LC+ S P + FHF+GG ++ M
Sbjct: 352 ISQTKLALDKTSSTGLDLCFSLPSGSTQVEIPKLVFHFKGGDLELPAENYMIGDSNLGVA 411
Query: 361 CMAVNPASINNRYVNFSLIGMMAQQFYNVGYDIGRKQKTFQRMDCEVL 408
C+A+ +S S+ G + QQ V +D+ ++ +F C+ L
Sbjct: 412 CLAMGASS------GMSIFGNVQQQNILVNHDLEKETISFVPTSCDQL 453
>gi|225438315|ref|XP_002272802.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Vitis vinifera]
Length = 436
Score = 160 bits (406), Expect = 8e-37, Method: Compositional matrix adjust.
Identities = 124/398 (31%), Positives = 185/398 (46%), Gaps = 29/398 (7%)
Query: 20 HRVQRGINISIARLAYLQEKIRSHNTYQAQILPSNDISNSLFYVNISIGQPPVPQFTAMD 79
R+QR + RL L K S +++ + N F + ++IG P MD
Sbjct: 59 ERLQRAMKRGKLRLQRLSAKTAS---FESSVEAPVHAGNGEFLMKLAIGTPAETYSAIMD 115
Query: 80 TGSSLLWVHCYPCRDCSPQLGTIFYPSRSSSYAYVPCDSEHCRYFPYARCSAYKHRCIYT 139
TGS L+W C PC+DC Q IF P +SSS++ +PC S+ C P + CS C Y
Sbjct: 116 TGSDLIWTQCKPCKDCFDQPTPIFDPKKSSSFSKLPCSSDLCAALPISSCS---DGCEYL 172
Query: 140 QLYLIGPETSVFVSTEQLTFKNTDESTIHVQDVVFGCGFSTNRNFKFS---GIFGLGIGR 196
Y T ++TE F + V + FGCG N FS G+ GLG G
Sbjct: 173 YSYGDYSSTQGVLATETFAFGDA-----SVSKIGFGCG-EDNDGSGFSQGAGLVGLGRGP 226
Query: 197 SSLVSQLNS-SFSYCIGSLHDPDYLHNKLILGDGAIIDEGDATPL---QFIDGHYYITLE 252
SL+SQL FSYC+ S+ D + + L++G A + TPL YY++LE
Sbjct: 227 LSLISQLGEPKFSYCLTSMDDSKGI-SSLLVGSEATMKNAITTPLIQNPSQPSFYYLSLE 285
Query: 253 AISVDGRMLDINPNIFK-RDDSGGGVMIDSGTDVTWLVKEAYEALRDEVMIRLEGEQMRS 311
ISV +L I + F ++D GG++IDSGT +T+L A+ AL+ E + +L+ + S
Sbjct: 286 GISVGDTLLPIEKSTFSIQNDGSGGLIIDSGTTITYLEDSAFAALKKEFISQLKLDVDES 345
Query: 312 YSWPDKLCYHGIMSSDLKGFPTVRFHFRGGAKLALEKDS-MFYQPRPDAFCMAVNPASIN 370
S LC+ + P + FHF GA L L ++ + C+ + +S
Sbjct: 346 GSTGLDLCFTLPPDASTVDVPQLVFHFE-GADLKLPAENYIIADSGLGVICLTMGSSS-- 402
Query: 371 NRYVNFSLIGMMAQQFYNVGYDIGRKQKTFQRMDCEVL 408
S+ G QQ V +D+ ++ +F C L
Sbjct: 403 ----GMSIFGNFQQQNIVVLHDLEKETISFAPAQCNQL 436
>gi|357486591|ref|XP_003613583.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
gi|355514918|gb|AES96541.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
Length = 437
Score = 160 bits (406), Expect = 9e-37, Method: Compositional matrix adjust.
Identities = 124/422 (29%), Positives = 196/422 (46%), Gaps = 34/422 (8%)
Query: 1 LIHRDSILSPYNNPNENPAHRVQRGINISIARLAYLQEKIRSHNTYQAQILPSNDISNSL 60
LIH S SP+ N E+ R+ + S R+ YL P N + N +
Sbjct: 30 LIHPISSKSPFYNTAESHFQRMSNNMKHSTNRVHYLNHVFS---------FPPNKVPNIV 80
Query: 61 --------FYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLGTIFYPSRSSSYA 112
+ ++ IG PP + MDT + +W C PC+ C +F PS+SS+Y
Sbjct: 81 VSPFMGDGYIISFLIGTPPFQLYGVMDTANDNIWFQCNPCKPCFNTTSPMFDPSKSSTYK 140
Query: 113 YVPCDSEHCRYFPYARCSA-YKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQD 171
+PC S C+ CS+ K C Y+ Y + +S + LT + +++ I ++
Sbjct: 141 TIPCSSPKCKNVENTHCSSDDKKVCEYSFTYGGEAYSQGDLSIDTLTLNSNNDTPISFKN 200
Query: 172 VVFGCGFSTNRNFK--FSGIFGLGIGRSSLVSQLNSS----FSYCIGSLHDPDYLHNKLI 225
+V GCG + SG GLG G S +SQLNSS FSYC+ L + + KL
Sbjct: 201 IVIGCGHRNKGPLEGYVSGNIGLGRGPLSFISQLNSSIGGKFSYCLVPLFSNEGISGKLH 260
Query: 226 LGDGAIIDEGD--ATPLQFIDGHYYITLEAISVDGRMLDINPNIFKRDDSGGGVMIDSGT 283
GD +++ +TP+ + Y TL A+SV ++ N ++D+ G +IDSGT
Sbjct: 261 FGDKSVVSGVGTVSTPITAGEIGYSTTLNALSVGDHIIKFE-NSTSKNDNLGNTIIDSGT 319
Query: 284 DVTWLVKEAYEALRDEVMIRLEGEQMRSYSWPDKLCYHGIMSSDLKGFPTVRFHFRGGAK 343
+T L + Y L V ++ E+ +S + KLCY + + P + HF GA
Sbjct: 320 TLTILPENVYSRLESIVTSMVKLERAKSPNQQFKLCYKATLKN--LDVPIITAHFN-GAD 376
Query: 344 LALEKDSMFYQPRPDAFCMAVNPASINNRYVNFSLIGMMAQQFYNVGYDIGRKQKTFQRM 403
+ L + FY + C A S+ N ++IG +AQQ + VG+D+ + +F+
Sbjct: 377 VHLNSLNTFYPIDHEVVCFAF--VSVGN--FPGTIIGNIAQQNFLVGFDLQKNIISFKPT 432
Query: 404 DC 405
DC
Sbjct: 433 DC 434
>gi|226503109|ref|NP_001147206.1| aspartic proteinase nepenthesin-2 precursor [Zea mays]
gi|195608496|gb|ACG26078.1| aspartic proteinase nepenthesin-2 precursor [Zea mays]
gi|413921850|gb|AFW61782.1| aspartic proteinase nepenthesin-2 [Zea mays]
Length = 441
Score = 160 bits (405), Expect = 9e-37, Method: Compositional matrix adjust.
Identities = 116/405 (28%), Positives = 189/405 (46%), Gaps = 29/405 (7%)
Query: 22 VQRGINISIARLAYLQEKIRSHNTY----QAQILPSNDISNSLFYVNISIGQPPVPQFTA 77
+ R I S AR+A LQ A++L + S+ + V+++IG PP+
Sbjct: 48 LSRAIARSKARVAALQSAAVLPPVVDPITAARVLVT--ASSGEYLVDLAIGTPPLYYTAI 105
Query: 78 MDTGSSLLWVHCYPCRDCSPQLGTIFYPSRSSSYAYVPCDSEHCRYFPYARCSAYKHRCI 137
MDTGS L+W C PC C+ Q F +S++Y +PC S C C +K C+
Sbjct: 106 MDTGSDLIWTQCAPCLLCADQPTPYFDVKKSATYRALPCRSSRCASLSSPSC--FKKMCV 163
Query: 138 YTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDVVFGCG-FSTNRNFKFSGIFGLGIGR 196
Y Y T+ ++ E TF + + + ++ FGCG + SG+ G G G
Sbjct: 164 YQYYYGDTASTAGVLANETFTFGAANSTKVRATNIAFGCGSLNAGDLANSSGMVGFGRGP 223
Query: 197 SSLVSQLNSS-FSYCIGSL--HDPDYLHNKLILGDGAIIDEGDATPLQ--------FIDG 245
SLVSQL S FSYC+ S P L+ + + + + +P+Q +
Sbjct: 224 LSLVSQLGPSRFSYCLTSYLSATPSRLYFG-VYANLSSTNTSSGSPVQSTPFVINPALPN 282
Query: 246 HYYITLEAISVDGRMLDINPNIFKRDDSG-GGVMIDSGTDVTWLVKEAYEALRDEVMIRL 304
Y+++L+AIS+ ++L I+P +F +D G GGV+IDSGT +TWL ++AYEA+R ++ +
Sbjct: 283 MYFLSLKAISLGTKLLPIDPLVFAINDDGTGGVIIDSGTSITWLQQDAYEAVRRGLVSAI 342
Query: 305 EGEQMRSYSWPDKLCYHGIMSSDLK-GFPTVRFHFRGGAKLALEKDSMFYQPRPDAFCMA 363
M C+ ++ P + FHF L ++ M C+
Sbjct: 343 PLPAMNDTDIGLDTCFQWPPPPNVTVTVPDLVFHFDSANMTLLPENYMLIASTTGYLCLV 402
Query: 364 VNPASINNRYVNFSLIGMMAQQFYNVGYDIGRKQKTFQRMDCEVL 408
+ P + ++IG QQ ++ YDIG +F C+++
Sbjct: 403 MAPTGVG------TIIGNYQQQNLHLLYDIGNSFLSFVPAPCDII 441
>gi|225427558|ref|XP_002266614.1| PREDICTED: probable aspartic protease At2g35615-like [Vitis
vinifera]
Length = 444
Score = 160 bits (405), Expect = 1e-36, Method: Compositional matrix adjust.
Identities = 116/421 (27%), Positives = 188/421 (44%), Gaps = 32/421 (7%)
Query: 1 LIHRDSILSPYNNPNENPAHRVQRGINISIARLAYLQEKIRSHNTYQAQILPSNDISNSL 60
I RDS SP+ NP+E R+Q+ SI R + + S N Q+ ++
Sbjct: 38 FISRDSPHSPFYNPSETKYQRLQKAFRRSILRGNHFRAMRASPNDIQSDVISGG----GA 93
Query: 61 FYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLGTIFYPSRSSSYAYVPCDSEH 120
+ +NIS+G PPVP DTGS L+W C PC +C Q+ +F P S +Y + CD+E
Sbjct: 94 YLMNISLGTPPVPMLGIADTGSDLIWRQCLPCPNCYEQVEPLFDPKESETYKTLDCDNEF 153
Query: 121 CRYFPYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDVVFGCGFST 180
C+ + C Y+ Y T +S++ LT +T+ + FGCG
Sbjct: 154 CQDLGQQGSCDDDNTCTYSYSYGDRSYTRGDLSSDTLTIGSTEGDPASFPGIAFGCGHDN 213
Query: 181 NRNFKFSGIFGLGIGRS------SLVSQLNSSFSYCIGSLHDPDYLHNKLILGDGAIIDE 234
F +G+G L S++ FSYC+ L + +K+ G ++
Sbjct: 214 GGTFNEKDGGLIGLGGGPLSLVMQLSSEVGGQFSYCLVPLSSDSTVSSKINFGKSGVVSG 273
Query: 235 GDATPLQFIDG----HYYITLEAISVDGRMLDINPNIFKRDDSG------GGVMIDSGTD 284
I G YY+TLE +SV + F + S G ++IDSGT
Sbjct: 274 SGTVSTPLIKGTPDTFYYLTLEGLSVGSETVAFKG--FSENKSSPAAVEEGNIIIDSGTT 331
Query: 285 VTWLVKEAYEALRDEVMIRLEGEQMRSYSWPDKLCYHGIMSSDLKGFPTVRFHFRGGAKL 344
+T L ++ Y + + + G+ + LCY + + ++ PT+ HF GA +
Sbjct: 332 LTLLPQDFYTDVESALTNAIGGQTTTDPNGIFSLCYSSVNNLEI---PTITAHFT-GADV 387
Query: 345 ALEKDSMFYQPRPDAFCMAVNPASINNRYVNFSLIGMMAQQFYNVGYDIGRKQKTFQRMD 404
L + F Q + D C ++ P+S N ++ G +AQ + VGYD+ + +F++ D
Sbjct: 388 QLPPLNTFVQVQEDLVCFSMIPSS------NLAIFGNLAQINFLVGYDLKNNKVSFKQTD 441
Query: 405 C 405
C
Sbjct: 442 C 442
>gi|356546376|ref|XP_003541602.1| PREDICTED: probable aspartic protease At2g35615-like [Glycine max]
Length = 450
Score = 160 bits (404), Expect = 1e-36, Method: Compositional matrix adjust.
Identities = 128/421 (30%), Positives = 189/421 (44%), Gaps = 28/421 (6%)
Query: 1 LIHRDSILSPYNNPNENPAHRVQRGINISIARLAYLQEK--IRSHNTYQAQILPSNDISN 58
+IHRDS SP E P RV + SI R + +K + S NT ++ + S
Sbjct: 39 MIHRDSSRSPLYRHTETPFQRVANAMRRSINRANHFNKKSFVASTNTAESTV----KASQ 94
Query: 59 SLFYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLGTIFYPSRSSSYAYVPCDS 118
+ ++ S+G PP +DTGS + W+ C C DC Q IF PS+S +Y +PC S
Sbjct: 95 GEYLMSYSVGTPPFEILGVVDTGSGITWMQCQRCEDCYEQTTPIFDPSKSKTYKTLPCSS 154
Query: 119 EHCR-YFPYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDVVFGCG 177
C+ CS+ K C YT Y G + +S E LT +T+ S++ + V GCG
Sbjct: 155 NMCQSVISTPSCSSDKIGCKYTIKYGDGSHSQGDLSVETLTLGSTNGSSVQFPNTVIGCG 214
Query: 178 FSTNRNFK------FSGIFGLGIGRSSLVSQLNSSFSYCIGSLHDPDYLHNKLILGDGAI 231
+ F+ G S L S + FSYC+ + +KL GD A+
Sbjct: 215 HNNKGTFQGEGSGVVGLGGGPVSLISQLSSSIGGKFSYCLAPMFSQSNSSSKLNFGDAAV 274
Query: 232 IDEGDA--TPLQFIDGH---YYITLEAISVDGRMLDI--NPNIFKRDDSGGGVMIDSGTD 284
+ A TPL G YY+TLEA SV + ++ + + G ++IDSGT
Sbjct: 275 VSGLGAVSTPLVSKTGSEVFYYLTLEAFSVGDKRIEFVGGSSSSGSSNGEGNIIIDSGTT 334
Query: 285 VTWLVKEAYEALRDEVMIRLEGEQMRSYSWPDKLCYHGIMSSDLKGFPTVRFHFRGGAKL 344
+T L +E Y L V ++ ++ S LCY S L P + HF+ GA +
Sbjct: 335 LTLLPQEDYSNLESAVADAIQANRVSDPSNFLSLCYQTTPSGQLD-VPVITAHFK-GADV 392
Query: 345 ALEKDSMFYQPRPDAFCMAVNPASINNRYVNFSLIGMMAQQFYNVGYDIGRKQKTFQRMD 404
L S F Q C A + + + S+ G +AQ VGYD+ + +F+ D
Sbjct: 393 ELNPISTFVQVAEGVVCFAFHSSEV------VSIFGNLAQLNLLVGYDLMEQTVSFKPTD 446
Query: 405 C 405
C
Sbjct: 447 C 447
>gi|356508918|ref|XP_003523200.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
Length = 453
Score = 160 bits (404), Expect = 2e-36, Method: Compositional matrix adjust.
Identities = 130/407 (31%), Positives = 194/407 (47%), Gaps = 26/407 (6%)
Query: 15 NENPAHRVQRGINISIARLAYLQEKIRSHNTYQA--QILPSNDISNSLFYVNISIGQPPV 72
N RVQ GI +RL L + + +T + Q+ N + + ++IG PPV
Sbjct: 60 NLTKLERVQHGIKRGKSRLQRLNAMVLAASTLDSEDQLEAPIHAGNGEYLMELAIGTPPV 119
Query: 73 PQFTAMDTGSSLLWVHCYPCRDCSPQLGTIFYPSRSSSYAYVPCDSEHCRYFPYARCSAY 132
+DTGS L+W C PC C Q IF P +SSS++ V C S C P + CS
Sbjct: 120 SYPAVLDTGSDLIWTQCKPCTQCYKQPTPIFDPKKSSSFSKVSCGSSLCSAVPSSTCS-- 177
Query: 133 KHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDVVFGCGFSTNRNFKF---SGI 189
C Y Y T ++TE TF + ++ + V ++ FGCG N F SG+
Sbjct: 178 -DGCEYVYSYGDYSMTQGVLATETFTFGKS-KNKVSVHNIGFGCG-EDNEGDGFEQASGL 234
Query: 190 FGLGIGRSSLVSQLNS-SFSYCIGSLHDPDYLHNKLILGD-GAIIDEGD--ATPL---QF 242
GLG G SLVSQL FSYC+ + D + L+LG G + D + TPL
Sbjct: 235 VGLGRGPLSLVSQLKEPRFSYCLTPMDDTK--ESILLLGSLGKVKDAKEVVTTPLLKNPL 292
Query: 243 IDGHYYITLEAISVDGRMLDINPNIFK-RDDSGGGVMIDSGTDVTWLVKEAYEALRDEVM 301
YY++LE ISV L I + F+ DD GGV+IDSGT +T++ ++A+EAL+ E +
Sbjct: 293 QPSFYYLSLEGISVGDTRLSIEKSTFEVGDDGNGGVIIDSGTTITYIEQKAFEALKKEFI 352
Query: 302 IRLEGEQMRSYSWPDKLCYHGIMSSDLKGFPTVRFHFRGGAKLALEKDSMFYQPRPDAFC 361
+ + ++ S LC+ S P + FHF+GG ++ M C
Sbjct: 353 SQTKLPLDKTSSTGLDLCFSLPSGSTQVEIPKIVFHFKGGDLELPAENYMIGDSNLGVAC 412
Query: 362 MAVNPASINNRYVNFSLIGMMAQQFYNVGYDIGRKQKTFQRMDCEVL 408
+A+ +S S+ G + QQ V +D+ ++ +F C+ L
Sbjct: 413 LAMGASS------GMSIFGNVQQQNILVNHDLEKETISFVPTSCDQL 453
>gi|356525748|ref|XP_003531485.1| PREDICTED: probable aspartic protease At2g35615-like [Glycine max]
Length = 436
Score = 160 bits (404), Expect = 2e-36, Method: Compositional matrix adjust.
Identities = 135/432 (31%), Positives = 200/432 (46%), Gaps = 56/432 (12%)
Query: 1 LIHRDSILSPYNNPNENPAHRVQRGINISIARLAYLQEKIRSHNTYQAQILPSNDISN-S 59
LIHRDS LSP+ P+ P+ R+ IN ++ R Y + + + + L I N
Sbjct: 33 LIHRDSPLSPFYKPSLTPSDRI---INTAL-RSIYQLNRASHSDLNEKKTLERVRIPNHG 88
Query: 60 LFYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLGTIFYPSRSSSYAYVPCDSE 119
+ + IG PPV + DT S L+WV C PC C PQ +F P +SS++A + CDS+
Sbjct: 89 EYLMRFYIGTPPVERLAIADTASDLIWVQCSPCETCFPQDTPLFEPHKSSTFANLSCDSQ 148
Query: 120 HCRYFPYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDVVFGCGFS 179
C C + C+YT Y G T + TE + F + T+ +FGCG
Sbjct: 149 PCTSSNIYYCPLVGNLCLYTNTYGDGSSTKGVLCTESIHFGS---QTVTFPKTIFGCG-- 203
Query: 180 TNRNF------KFSGIFGLGIGRSSLVSQLNS----SFSYCIGSLHDPDYLHNKL----- 224
+N +F K +GI GLG G SLVSQL FSYC+ + K
Sbjct: 204 SNNDFMHQISNKVTGIVGLGAGPLSLVSQLGDQIGHKFSYCLLPFTSTSTIKLKFGNDTT 263
Query: 225 ILGDGAIIDEGDATPLQFIDGH----YYITLEAISVDGRMLDINPNIFKRDDSGGGVMID 280
I G+G + +TPL ID H Y++ L I++ +ML + D + G ++ID
Sbjct: 264 ITGNGVV-----STPL-IIDPHYPSYYFLHLVGITIGQKMLQVR----TTDHTNGNIIID 313
Query: 281 SGTDVTWLVKEAYE----ALRDEVMIRLEGEQMRSYSWPDKLCYHGIMSSDLKGFPTVRF 336
GT +T+L Y LR+ + I E +P C+ + FP + F
Sbjct: 314 LGTVLTYLEVNFYHNFVTLLREALGI---SETKDDIPYPFDFCFPNQANIT---FPKIVF 367
Query: 337 HFRGGAKLALEKDSMFYQ-PRPDAFCMAVNPASINNRYVN-FSLIGMMAQQFYNVGYDIG 394
F GAK+ L ++F++ + C+AV P + Y FS+ G +AQ + V YD
Sbjct: 368 QFT-GAKVFLSPKNLFFRFDDLNMICLAVLP----DFYAKGFSVFGNLAQVDFQVEYDRK 422
Query: 395 RKQKTFQRMDCE 406
K+ +F DC
Sbjct: 423 GKKVSFAPADCS 434
>gi|147858841|emb|CAN78694.1| hypothetical protein VITISV_037475 [Vitis vinifera]
Length = 442
Score = 159 bits (403), Expect = 2e-36, Method: Compositional matrix adjust.
Identities = 135/410 (32%), Positives = 195/410 (47%), Gaps = 33/410 (8%)
Query: 1 LIHRDSILSPYNNPNENPAHRVQRGINISIARLAYLQEKIRSHNTYQ-AQILPSNDISN- 58
LIH S SPY N + + +++R AYL + R Q A +P I +
Sbjct: 34 LIHIHSPSSPYKNVKAESLAK-DTALESTLSRHAYL--RARQQKALQPADFVPPPLIRDK 90
Query: 59 SLFYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLGTIFYPSRSSSYAYVPCDS 118
S F N+SIG PP + +DTGS L W+ C PC C Q I+ ++S SY + C+
Sbjct: 91 SAFLANLSIGNPPTNVYVVLDTGSDLFWIQCEPCDVCYKQKDPIYNRTKSDSYTEMLCNE 150
Query: 119 EHCRYFPYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDVVFGCGF 178
C + C+Y Y G TS +S E++ F + V FGCG
Sbjct: 151 PPCVSLGREGQCSDSGSCLYQTAYADGARTSGLLSYEKVAFTSHYSDEDKTAQVGFGCGL 210
Query: 179 STNRNFKFSG----IFGLGIGRSSLVSQLNS------SFSYCIGSLHDPDYLHNKLILGD 228
N NF S + GLG G SLVSQL++ SF+YC G++ +P+ L+ GD
Sbjct: 211 Q-NLNFITSNRDGGVLGLGPGLVSLVSQLSAIGKVSKSFAYCFGNISNPN-AGGFLVFGD 268
Query: 229 GAIIDEGDATPLQFIDGHYYITLEAI--SVDGRMLDINPNIFKRD-DSGGGVMIDSGTDV 285
++ GD TP+ I YY+ L I V LDIN + F+R D GGV+IDSG+ +
Sbjct: 269 ATYLN-GDMTPM-VIAEFYYVNLLGIGLGVGEPRLDINSSSFERKPDGSGGVIIDSGSTL 326
Query: 286 TWLVKEAYEALRDEVMIRLEGEQMRS--YSWPDKLCYHGIMSSDLKGFPTVRFHFRGGAK 343
+ E YE +R+ V+ +L+ S S PD C+ G + DL FPT+ +
Sbjct: 327 SVFPPEVYEVVRNAVVDKLKKGYNISPLTSSPD--CFEGKIERDLPLFPTLVLYLESTGI 384
Query: 344 LALEKDSMFYQPRPDAFCMAVNPASINNRYVNFSLIGMMAQQFYNVGYDI 393
L ++ S+F Q + FC+ S+IG +AQQ Y GY++
Sbjct: 385 LN-DRWSIFLQRYDELFCLGFTSGE------GLSIIGTLAQQSYKFGYNL 427
>gi|449462553|ref|XP_004149005.1| PREDICTED: probable aspartic protease At2g35615-like [Cucumis
sativus]
Length = 418
Score = 159 bits (403), Expect = 2e-36, Method: Compositional matrix adjust.
Identities = 125/417 (29%), Positives = 200/417 (47%), Gaps = 45/417 (10%)
Query: 1 LIHRDSILSPYNNPNENPAHRVQRGINISIARLAYLQEKIRSHNTYQAQILPSNDISNSL 60
L HRDS+LSP + + R+ S++R A L + + Q
Sbjct: 34 LFHRDSLLSPLEFSSLSHYDRLANAFRRSLSRSAALLNRAATSGAVGLQ----------- 82
Query: 61 FYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLGTIFYPSRSSSYAYVPCDSEH 120
+ IG PPV DTGS L W C PC C QL IF P +S+S+++VPC+++
Sbjct: 83 ---SSIIGTPPVDYLGIADTGSDLTWAQCLPCLKCYQQLRPIFNPLKSTSFSHVPCNTQT 139
Query: 121 CRYFPYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDVVFGCGFST 180
C C + C Y+ Y G T S L F+ + V+ V+ GCG ++
Sbjct: 140 CHAVDDGHC-GVQGVCDYSYTY--GDRT---YSKGDLGFEKITIGSSSVKSVI-GCGHAS 192
Query: 181 NRNFKF-SGIFGLGIGRSSLVSQLNSS------FSYCIGSLHDPDYLHNKLILGDGAIID 233
+ F F SG+ GLG G+ SLVSQ++ + FSYC+ +L + + K+ G A++
Sbjct: 193 SGGFGFASGVIGLGGGQLSLVSQMSQTSGISRRFSYCLPTLL--SHANGKINFGQNAVVS 250
Query: 234 EGD--ATPL--QFIDGHYYITLEAISVDGRMLDINPNIFKRDDSGGGVMIDSGTDVTWLV 289
+TPL + +YYITLEAIS+ F + G V+IDSGT +++L
Sbjct: 251 GPGVVSTPLISKNTVTYYYITLEAISIGNE----RHMAFAKQ---GNVIIDSGTTLSFLP 303
Query: 290 KEAYEALRDEVMIRLEGEQMRSYSWPDKLCY-HGIMSSDLKGFPTVRFHFRGGAKLALEK 348
KE Y+ + ++ ++ ++++ LC+ GI + G P + F GGA + L
Sbjct: 304 KELYDGVVSSLLKVVKAKRVKDPGNFWDLCFDDGINVATSSGIPIITAQFSGGANVNLLP 363
Query: 349 DSMFYQPRPDAFCMAVNPASINNRYVNFSLIGMMAQQFYNVGYDIGRKQKTFQRMDC 405
+ F + + C+ + PAS + F +IG +A + +GYD+ K+ +F+ C
Sbjct: 364 VNTFQKVANNVNCLTLTPASPTDE---FGIIGNLALANFLIGYDLEAKRLSFKPTVC 417
>gi|449515033|ref|XP_004164554.1| PREDICTED: probable aspartic protease At2g35615-like [Cucumis
sativus]
Length = 430
Score = 159 bits (402), Expect = 3e-36, Method: Compositional matrix adjust.
Identities = 122/419 (29%), Positives = 203/419 (48%), Gaps = 37/419 (8%)
Query: 1 LIHRDSILSPYNNPNENPAHRVQRGINISIARLAYLQEKIRSHNTY--QAQILPSNDISN 58
L HRDS+LSP + + R+ S++R A L + ++ QA + P +
Sbjct: 34 LFHRDSLLSPLEFSSLSHYDRLTNAFRRSLSRSATLLNRAATNGALDLQAPLTPGS---- 89
Query: 59 SLFYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLGTIFYPSRSSSYAYVPCDS 118
+ +++SIG PPV DTGS L+W C PC C Q IF P +S+S+++VPC+S
Sbjct: 90 GEYLMSVSIGTPPVDYIGMADTGSDLMWAQCLPCLKCYKQSRPIFDPLKSTSFSHVPCNS 149
Query: 119 EHCRYFPYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDVVFGCGF 178
++C+ + C A + C Y+ Y G +T + L F+ + V+ V+ GCG
Sbjct: 150 QNCKAIDDSHCGA-QGVCDYSYTY--GDQT---YTKGDLGFEKITIGSSSVKSVI-GCGH 202
Query: 179 S-TNRNFKFSGIFGLGIGRSSLVSQLNSS------FSYCIGSLHDPDYLHNKLILGDGAI 231
SG+ GLG G+ SLVSQ++ + FSYC+ +L + + K+ G A+
Sbjct: 203 ESGGGFGFASGVIGLGGGQLSLVSQMSQTSGISRRFSYCLPTLL--SHANGKINFGQNAV 260
Query: 232 IDEGDATPLQFIDG----HYYITLEAISVDGRMLDINPNIFKRDDSGGGVMIDSGTDVTW 287
+ I +YY+TLEAIS+ G V+IDSGT +++
Sbjct: 261 VSGPGVVSTPLISKNPVTYYYVTLEAISIGNER-------HMASAKQGNVIIDSGTTLSF 313
Query: 288 LVKEAYEALRDEVMIRLEGEQMRSYSWPDKLCY-HGIMSSDLKGFPTVRFHFRGGAKLAL 346
L KE Y+ + ++ ++ ++++ LC+ GI + G P + F GGA + L
Sbjct: 314 LPKELYDGVVSSLLKVVKAKRVKDPGNFWDLCFDDGINVATSSGIPIITAQFSGGANVNL 373
Query: 347 EKDSMFYQPRPDAFCMAVNPASINNRYVNFSLIGMMAQQFYNVGYDIGRKQKTFQRMDC 405
+ F + + C+ + PAS + F +IG +A + +GYD+ K+ +F+ C
Sbjct: 374 LPVNTFQKVANNVNCLTLTPASPTDE---FGIIGNLALANFLIGYDLEAKRLSFKPTVC 429
>gi|255566010|ref|XP_002523993.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
gi|223536720|gb|EEF38361.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
Length = 439
Score = 159 bits (402), Expect = 3e-36, Method: Compositional matrix adjust.
Identities = 137/421 (32%), Positives = 203/421 (48%), Gaps = 33/421 (7%)
Query: 1 LIHRDSILSPYNNPNENPAHRVQRGINISIARLAYLQEKIRSH---NTYQAQILPSNDIS 57
LI+RDS SP+ NP E P R+ + S++R+ + S +T Q+++ IS
Sbjct: 33 LINRDSPKSPFYNPRETPTQRIVSAVRRSMSRVHHFSPTKNSDIFTDTAQSEM-----IS 87
Query: 58 NSLFYV-NISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLGTIFYPSRSSSYAYVPC 116
N Y+ S+G P DTGS L+W C PC C Q +F P SS+Y + C
Sbjct: 88 NQGEYLMKFSLGTPAFDILAIADTGSDLIWTQCKPCDQCYEQDAPLFDPKSSSTYRDISC 147
Query: 117 DSEHCRYFPY-ARCSAYKHR-CIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDVVF 174
++ C A CS ++ C Y+ Y TS V+ + +T +T + + +
Sbjct: 148 STKQCDLLKEGASCSGEGNKTCHYSYSYGDRSFTSGNVAADTITLGSTSGRPVLLPKAII 207
Query: 175 GCGFSTNRNF--KFSGIFGLGIGRSSLVSQLNSS----FSYCIGSLHDPDYLHNKLILGD 228
GCG + +F K SGI GLG G SL+SQL S+ FSYC+ L +KL G
Sbjct: 208 GCGHNNGGSFTEKGSGIVGLGGGPISLISQLGSTIDGKFSYCLVPLSSNATNSSKLNFGS 267
Query: 229 GAIIDEG--DATPL--QFIDGHYYITLEAISVDGRMLDINPNIFKRDDSGGGVMIDSGTD 284
I+ G +TPL + D Y++TLEA+SV + + F S G ++IDSGT
Sbjct: 268 NGIVSGGGVQSTPLISKDPDTFYFLTLEAVSVGSERIKFPGSSFGT--SEGNIIIDSGTT 325
Query: 285 VTWLVKEAYEALRDEVMIRLEGEQMRSYSWPDKLCYHGIMSSDLKGFPTVRFHFRGGAKL 344
+T ++ + L V + G + S LCY + +DLK FP++ HF GA +
Sbjct: 326 LTLFPEDFFSELSSAVQDAVAGTPVEDPSGILSLCYS--IDADLK-FPSITAHFD-GADV 381
Query: 345 ALEKDSMFYQPRPDAFCMAVNPASINNRYVNFSLIGMMAQQFYNVGYDIGRKQKTFQRMD 404
L + F Q C A NP IN+ ++ G +AQ + VGYD+ K +F+ D
Sbjct: 382 KLNPLNTFVQVSDTVLCFAFNP--INSG----AIFGNLAQMNFLVGYDLEGKTVSFKPTD 435
Query: 405 C 405
C
Sbjct: 436 C 436
>gi|297603570|ref|NP_001054261.2| Os04g0677100 [Oryza sativa Japonica Group]
gi|255675885|dbj|BAF16175.2| Os04g0677100 [Oryza sativa Japonica Group]
Length = 464
Score = 159 bits (401), Expect = 3e-36, Method: Compositional matrix adjust.
Identities = 128/427 (29%), Positives = 195/427 (45%), Gaps = 51/427 (11%)
Query: 1 LIHRDSILSPYNNPNENPAHRVQRGINISIARLAYLQEKIRSHNT------YQAQILPSN 54
L+HRD+I S P+ H+V + AR+ +L++++ + + ++++P
Sbjct: 67 LVHRDAI-SGATYPSRR--HQVVGLVARDNARVEHLEKRLVASTSPYLPEDLVSEVVPGV 123
Query: 55 DISNSLFYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLGTIFYPSRSSSYAYV 114
D + ++V + +G PP Q+ +D+GS ++WV C PC C Q +F P+ SSS++ V
Sbjct: 124 DDGSGEYFVRVGVGSPPTDQYLVVDSGSDVIWVQCRPCEQCYAQTDPLFDPAASSSFSGV 183
Query: 115 PCDSEHCRYFP--YARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDV 172
C S CR +C Y+ Y G T ++ E LT T VQ V
Sbjct: 184 SCGSAICRTLSGTGCGGGGDAGKCDYSVTYGDGSYTKGELALETLTLGGT-----AVQGV 238
Query: 173 VFGCGFSTNRNF-KFSGIFGLGIGRSSLVSQLNSS----FSYCIGSLHDPDYLHNKLILG 227
GCG + F +G+ GLG G SLV QL + FSYC+ S L+LG
Sbjct: 239 AIGCGHRNSGLFVGAAGLLGLGWGAMSLVGQLGGAAGGVFSYCLASRGAGG--AGSLVLG 296
Query: 228 DGAIIDEGDATPLQFIDGHYYITLEAISVDGRMLDINPNIFK-RDDSGGGVMIDSGTDVT 286
+ G + YY+ L I V G L + ++F+ +D GGV++D+GT VT
Sbjct: 297 RTEAVPRG-----RRASSFYYVGLTGIGVGGERLPLQDSLFQLTEDGAGGVVMDTGTAVT 351
Query: 287 WLVKEAYEALR---DEVMIRLEGEQMRSYSWPDKLCYHGIMSSDLKGF-----PTVRFHF 338
L +EAY ALR D M L + + S D CY DL G+ PTV F+F
Sbjct: 352 RLPREAYAALRGAFDGAMGAL--PRSPAVSLLDT-CY------DLSGYASVRVPTVSFYF 402
Query: 339 RGGAKLALEKDSMFYQPRPDAFCMAVNPASINNRYVNFSLIGMMAQQFYNVGYDIGRKQK 398
GA L L ++ + FC+A P+S S++G + Q+ + D
Sbjct: 403 DQGAVLTLPARNLLVEVGGAVFCLAFAPSS-----SGISILGNIQQEGIQITVDSANGYV 457
Query: 399 TFQRMDC 405
F C
Sbjct: 458 GFGPNTC 464
>gi|224130548|ref|XP_002320868.1| predicted protein [Populus trichocarpa]
gi|222861641|gb|EEE99183.1| predicted protein [Populus trichocarpa]
Length = 488
Score = 159 bits (401), Expect = 3e-36, Method: Compositional matrix adjust.
Identities = 118/361 (32%), Positives = 166/361 (45%), Gaps = 34/361 (9%)
Query: 61 FYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLGTIFYPSRSSSYAYVPCDSEH 120
++ + +G P + +DTGS ++W+ C PC C Q +F P++S S+A +PC S
Sbjct: 145 YFTRLGVGTPARYVYMVLDTGSDIVWIQCAPCIKCYSQTDPVFDPTKSRSFANIPCGSPL 204
Query: 121 CRYFPYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDVVFGCGFST 180
CR Y CS K C+Y Y G T STE LTF+ T V VV GCG
Sbjct: 205 CRRLDYPGCSTKKQICLYQVSYGDGSFTVGEFSTETLTFRGT-----RVGRVVLGCGHDN 259
Query: 181 NRNFKFSGI----------FGLGIGRSSLVSQLNSSFSYCIGSLHDPDYLHNKLILGDGA 230
F + F IGR + NS FSYC+G + ++ GD A
Sbjct: 260 EGLFVGAAGLLGLGRGRLSFPSQIGR-----RFNSKFSYCLGD-RSASSRPSSIVFGDSA 313
Query: 231 IIDEGDATPL---QFIDGHYYITLEAISVDG-RMLDINPNIFKRDDSG-GGVMIDSGTDV 285
I TPL +D YY+ L ISV G R+ I+ ++FK D +G GGV+IDSGT V
Sbjct: 314 ISRTTRFTPLLSNPKLDTFYYVELLGISVGGTRVSGISASLFKLDSTGNGGVIIDSGTSV 373
Query: 286 TWLVKEAYEALRDEVMIRLEG-EQMRSYSWPDKLCYHGIMSSDLKGFPTVRFHFRGGAKL 344
T L + AY ALRD ++ ++ +S D C+ +++K PTV HFRG
Sbjct: 374 TRLTRAAYVALRDAFLVGASNLKRAPEFSLFDT-CFDLSGKTEVK-VPTVVLHFRGADVP 431
Query: 345 ALEKDSMFYQPRPDAFCMAVNPASINNRYVNFSLIGMMAQQFYNVGYDIGRKQKTFQRMD 404
+ + +FC A S+IG + QQ + V YD+ + F
Sbjct: 432 LPASNYLIPVDNSGSFCFA-----FAGTASGLSIIGNIQQQGFRVVYDLATSRVGFAPRG 486
Query: 405 C 405
C
Sbjct: 487 C 487
>gi|30686482|ref|NP_850251.1| aspartyl protease-like protein [Arabidopsis thaliana]
gi|122215044|sp|Q3EBM5.1|ASPR1_ARATH RecName: Full=Probable aspartic protease At2g35615; Flags:
Precursor
gi|330254036|gb|AEC09130.1| aspartyl protease-like protein [Arabidopsis thaliana]
Length = 447
Score = 158 bits (400), Expect = 4e-36, Method: Compositional matrix adjust.
Identities = 132/434 (30%), Positives = 197/434 (45%), Gaps = 45/434 (10%)
Query: 1 LIHRDSILSPYNNPNENPAHRVQRGINISIARLAYLQEKIRSHNTYQAQILPSNDISNSL 60
LIHRDS LSP NP R+ S++R ++ S Q+ ++ ++
Sbjct: 30 LIHRDSPLSPIYNPQITVTDRLNAAFLRSVSRSRRFNHQL-SQTDLQSGLIGAD----GE 84
Query: 61 FYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLGTIFYPSRSSSYAYVPCDSEH 120
F+++I+IG PP+ F DTGS L WV C PC+ C + G IF +SS+Y PCDS +
Sbjct: 85 FFMSITIGTPPIKVFAIADTGSDLTWVQCKPCQQCYKENGPIFDKKKSSTYKSEPCDSRN 144
Query: 121 CRYFPYAR--CSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDVVFGCGF 178
C+ C + C Y Y + V+TE ++ + S + VFGCG+
Sbjct: 145 CQALSSTERGCDESNNICKYRYSYGDQSFSKGDVATETVSIDSASGSPVSFPGTVFGCGY 204
Query: 179 STNRNFKFSGIFGLGIGRS--SLVSQLNSS----FSYCIGSLHDPDYLHNKLILGDGAI- 231
+ F +G +G+G SL+SQL SS FSYC+ + + LG +I
Sbjct: 205 NNGGTFDETGSGIIGLGGGHLSLISQLGSSISKKFSYCLSHKSATTNGTSVINLGTNSIP 264
Query: 232 ----IDEG-------DATPLQFIDGHYYITLEAISVDGRMLDINPNIFKRDDSG------ 274
D G D PL + YY+TLEAISV + + + + +D G
Sbjct: 265 SSLSKDSGVVSTPLVDKEPLTY----YYLTLEAISVGKKKIPYTGSSYNPNDDGILSETS 320
Query: 275 GGVMIDSGTDVTWLVKEAYEALRDEVMIRLEGEQMRSYSWPDKLCYHGIMSSDLK-GFPT 333
G ++IDSGT +T L ++ V + G + S P L H S + G P
Sbjct: 321 GNIIIDSGTTLTLLEAGFFDKFSSAVEESVTG--AKRVSDPQGLLSHCFKSGSAEIGLPE 378
Query: 334 VRFHFRGGAKLALEKDSMFYQPRPDAFCMAVNPASINNRYVNFSLIGMMAQQFYNVGYDI 393
+ HF GA + L + F + D C+++ P + Y NF AQ + VGYD+
Sbjct: 379 ITVHFT-GADVRLSPINAFVKLSEDMVCLSMVPTTEVAIYGNF------AQMDFLVGYDL 431
Query: 394 GRKQKTFQRMDCEV 407
+ +FQ MDC
Sbjct: 432 ETRTVSFQHMDCSA 445
>gi|356540371|ref|XP_003538663.1| PREDICTED: probable aspartic protease At2g35615-like [Glycine max]
Length = 374
Score = 158 bits (400), Expect = 4e-36, Method: Compositional matrix adjust.
Identities = 117/368 (31%), Positives = 171/368 (46%), Gaps = 45/368 (12%)
Query: 61 FYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLGTIFYPSRSSSYAYVPCDSEH 120
+ + +SIG PP + DTGS L W C PC C Q IF P +S+SY + CDS+
Sbjct: 25 YLMEVSIGTPPFKIYGIADTGSDLTWTSCVPCNKCYKQRNPIFDPQKSTSYRNISCDSKL 84
Query: 121 CRYFPYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDVVFGCGFST 180
C CS KH C YT Y T ++ E +T +T ++ ++ +VFGCG +
Sbjct: 85 CHKLDTGVCSPQKH-CNYTYAYASAAITQGVLAQETITLSSTKGESVPLKGIVFGCGHNN 143
Query: 181 NRNF--KFSGIFGLGIGRSSLVSQLNSS-----FSYCIGSLHDPDYLHNKLILGDGA--- 230
F + GI GLG G S +SQ+ SS FS C+ H + +K+ LG G+
Sbjct: 144 TGGFNDREMGIIGLGGGPVSFISQIGSSFGGKRFSQCLVPFHTDVSVSSKMSLGKGSEVS 203
Query: 231 --------IIDEGDATPLQFIDGHYYITLEAISVDGRMLDINPNIFKRDDSGGGVMIDSG 282
++ + D TP Y++TL ISV L N + + + G V +DSG
Sbjct: 204 GKGVVSTPLVAKQDKTP-------YFVTLLGISVGNTYLHFNGSSSQSVEK-GNVFLDSG 255
Query: 283 TDVTWLVKEAYEAL----RDEVMIRLEGEQMRSYSWPDKLCYHGIMSSDLKGFPTVRFHF 338
T T L + Y+ L R EV ++ +LCY ++L+G P + HF
Sbjct: 256 TPPTILPTQLYDRLVAQVRSEVAMK---PVTNDLDLGPQLCYR--TKNNLRG-PVLTAHF 309
Query: 339 RGGAKLALEKDSMFYQPRPDAFCMA-VNPASINNRYVNFSLIGMMAQQFYNVGYDIGRKQ 397
GG + L F P+ FC+ N +S Y NF AQ Y +G+D+ R+
Sbjct: 310 EGG-DVKLLPTQTFVSPKDGVFCLGFTNTSSDGGVYGNF------AQSNYLIGFDLDRQV 362
Query: 398 KTFQRMDC 405
+F+ MDC
Sbjct: 363 VSFKPMDC 370
>gi|414877221|tpg|DAA54352.1| TPA: hypothetical protein ZEAMMB73_214154 [Zea mays]
Length = 447
Score = 158 bits (400), Expect = 4e-36, Method: Compositional matrix adjust.
Identities = 116/362 (32%), Positives = 171/362 (47%), Gaps = 21/362 (5%)
Query: 61 FYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLGTIFYPSRSSSYAYVPCDSEH 120
+ + ++IG PPVP DTGS L W C PC+ C PQ I+ + SSS++ VPC S
Sbjct: 93 YLMELAIGTPPVPFVALADTGSDLTWTQCQPCKLCFPQDTPIYDTAVSSSFSPVPCASAT 152
Query: 121 CRYFPYAR-CSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDVVFGCGF- 178
C +R C+A C Y Y G ++ + TE LTF + V + FGCG
Sbjct: 153 CLPIWSSRNCTASSSPCRYRYAYGDGAYSAGVLGTETLTFPGAPG--VSVGGIAFGCGVD 210
Query: 179 STNRNFKFSGIFGLGIGRSSLVSQLN-SSFSYCI-----GSLHDPDYLHNKLILGDGAII 232
+ ++ +G GLG G SLV+QL FSYC+ SL P L +
Sbjct: 211 NGGLSYNSTGTVGLGRGSLSLVAQLGVGKFSYCLTDFFNTSLGSPVLFGALAELAAPSTG 270
Query: 233 DEGDATPL---QFIDGHYYITLEAISV-DGRMLDINPNIFKRDDSGGGVMIDSGTDVTWL 288
+TPL ++ YY++LE IS+ D R+ N RDD GG+++DSGT T+L
Sbjct: 271 AAVQSTPLVQSPYVPTWYYVSLEGISLGDARLPIPNGTFDLRDDGSGGMIVDSGTTFTFL 330
Query: 289 VKEAYEALRDEVMIRLEGEQMRSYSWPDKLCYHGIMSS-DLKGFPTVRFHFRGGAKLALE 347
V+ A+ + D V L + + S D C+ L P + HF GGA + L
Sbjct: 331 VESAFRVVVDHVAGVLRQPVVNASSL-DSPCFPAATGEQQLPAMPDMVLHFAGGADMRLH 389
Query: 348 KDS-MFYQPRPDAFCMAVNPASINNRYVNFSLIGMMAQQFYNVGYDIGRKQKTFQRMDCE 406
+D+ M + +FC+ + + + S++G QQ + +DI Q +F DC
Sbjct: 390 RDNYMSFNQEESSFCLNI----AGSPSADVSILGNFQQQNIQMLFDITVGQLSFMPTDCG 445
Query: 407 VL 408
L
Sbjct: 446 KL 447
>gi|357158688|ref|XP_003578209.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Brachypodium
distachyon]
Length = 443
Score = 158 bits (399), Expect = 5e-36, Method: Compositional matrix adjust.
Identities = 130/419 (31%), Positives = 196/419 (46%), Gaps = 50/419 (11%)
Query: 19 AHRVQRGINISIARLAYLQE---KIRSHNTYQAQILPSNDISNSLFYVNISIGQPPVPQF 75
A + R + S AR+A LQ + A+IL S + +++ IG PP
Sbjct: 46 AQLLSRAVRRSKARVAALQSLATTTAADAITVARIL--VLASEGEYLMSMGIGTPPRYYS 103
Query: 76 TAMDTGSSLLWVHCYPCRDCSPQLGTIFYPSRSSSYAYVPCDSEHCRYFPYARCSAYKHR 135
+DTGS L+W C PC C Q F P++S SYA +PC+S C Y C Y++
Sbjct: 104 AILDTGSDLIWTQCAPCMLCVDQPTPFFDPAQSPSYAKLPCNSPMCNALYYPLC--YRNV 161
Query: 136 CIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDVVFGCG-FSTNRNFKFSGIFGLGI 194
C+Y Y T+ +S E TF T+++ + V + FGCG + F SG+ G G
Sbjct: 162 CVYQYFYGDSANTAGVLSNETFTF-GTNDTRVTVPRIAFGCGNLNAGSLFNGSGMVGFGR 220
Query: 195 GRSSLVSQLNSS-FSYCIGSLHDPDYLHNKLILGDGAIIDEGDAT---PLQ---FI---- 243
G SLVSQL S FSYC+ S P + ++L G A ++ A+ P+Q FI
Sbjct: 221 GPLSLVSQLGSPRFSYCLTSFMSP--VPSRLYFGAYATLNSTSASTGEPVQSTPFIVNPG 278
Query: 244 -DGHYYITLEAISVDGRMLDINPNIFKRDDS--GGGVMIDSGTDVTWLVKEAY----EAL 296
YY+ + ISV G +L I+P++F +D+ GGV+IDSG+ +T+L + AY +A
Sbjct: 279 LPTMYYLNMTGISVGGELLPIDPSVFAINDADGTGGVIIDSGSTITYLARAAYDMVHQAF 338
Query: 297 RDEVMIRLEGEQMRS------YSWPDKLCYHGIMSSDLKGFPTVRFHFRGG-AKLALEKD 349
D+V + L + + WP + P + FHF G +L LE +
Sbjct: 339 ADQVGLPLTNATSLADVLDTCFVWPPP-------PRKIVTMPELAFHFEGANMELPLE-N 390
Query: 350 SMFYQPRPDAFCMAVNPASINNRYVNFSLIGMMAQQFYNVGYDIGRKQKTFQRMDCEVL 408
M C+A+ + + S+IG Q ++V YD +F C V+
Sbjct: 391 YMLIDGDTGNLCLAIAASD------DGSIIGSFQHQNFHVLYDNENSLLSFTPATCNVM 443
>gi|356495752|ref|XP_003516737.1| PREDICTED: probable aspartic protease At2g35615-like [Glycine max]
Length = 396
Score = 158 bits (399), Expect = 5e-36, Method: Compositional matrix adjust.
Identities = 123/385 (31%), Positives = 186/385 (48%), Gaps = 29/385 (7%)
Query: 35 YLQEKIRSHNTYQAQILPSNDISNSLFYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRD 94
Y +++ H + +N + + +++G PPV + +DTGS L+W C PC+
Sbjct: 24 YKSDELHMHRLGSNGVFTRVTSNNGDYLMKLTLGTPPVDVYGLVDTGSDLVWAQCTPCQG 83
Query: 95 CSPQLGTIFYPSRSSSYAYVPCDSEHCRYFPYARCSAYKHRCIYTQLYLIGPETSVFVST 154
C Q +F P RS++Y +PCDSE C CS K C Y+ Y T ++
Sbjct: 84 CYRQKSPMFEPLRSNTYTPIPCDSEECNSLFGHSCSPQK-LCAYSYAYADSSVTKGVLAR 142
Query: 155 EQLTFKNTDESTIHVQDVVFGCGFSTNRNFKFSGI--FGLGIGRSSLVSQL-----NSSF 207
E +TF +TD + V D+VFGCG S + F + + GLG G SLVSQ + F
Sbjct: 143 ETVTFSSTDGEPVVVGDIVFGCGHSNSGTFNENDMGIIGLGGGPLSLVSQFGNLYGSKRF 202
Query: 208 SYCIGSLHDPDYLHNKLILGDGA-IIDEG-DATPLQFIDGH--YYITLEAISVDGRMLDI 263
S C+ H + + GD + + EG ATPL +G Y +TLE ISV +
Sbjct: 203 SQCLVPFHADPHTLGTISFGDASDVSGEGVAATPLVSEEGQTPYLVTLEGISVGDTFVSF 262
Query: 264 NPNIFKRDDSGGGVMIDSGTDVTWLVKEAYEALRDEVMIRLEGEQMRSYSWPD---KLCY 320
N + S G +MIDSGT T+L +E Y+ L E ++++ + PD +LCY
Sbjct: 263 NSSEML---SKGNIMIDSGTPATYLPQEFYDRLVKE--LKVQSNMLPIDDDPDLGTQLCY 317
Query: 321 HGIMSSDLKGFPTVRFHFRGGAKLALEKDSMFYQPRPDAFCMAVNPASINNRYVNFSLIG 380
++L+G P + HF GA + L F P+ FC A+ + + Y+ G
Sbjct: 318 RS--ETNLEG-PILIAHFE-GADVQLMPIQTFIPPKDGVFCFAM-AGTTDGEYI----FG 368
Query: 381 MMAQQFYNVGYDIGRKQKTFQRMDC 405
AQ +G+D+ RK +F+ DC
Sbjct: 369 NFAQSNVLIGFDLDRKTVSFKATDC 393
>gi|20197342|gb|AAC34482.2| putative chloroplast nucleoid DNA binding protein [Arabidopsis
thaliana]
Length = 353
Score = 158 bits (399), Expect = 6e-36, Method: Compositional matrix adjust.
Identities = 114/365 (31%), Positives = 181/365 (49%), Gaps = 31/365 (8%)
Query: 63 VNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLGTIFYPSRSSSYAYVPCDSEHCR 122
+ +SIG P V +DTGS L+W C PC +C Q IF P +SSSY+ V C S C
Sbjct: 1 MELSIGNPAVKYSAIVDTGSDLIWTQCKPCTECFDQPTPIFDPEKSSSYSKVGCSSGLCN 60
Query: 123 YFPYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDVVFGCGFSTNR 182
P + C+ K C Y Y T ++TE TF+ DE++I + FGCG
Sbjct: 61 ALPRSNCNEDKDACEYLYTYGDYSSTRGLLATETFTFE--DENSIS--GIGFGCGVENEG 116
Query: 183 N--FKFSGIFGLGIGRSSLVSQLNSS-FSYCIGSLHDPDY--------LHNKLILGDGAI 231
+ + SG+ GLG G SL+SQL + FSYC+ S+ D + L + ++ GA
Sbjct: 117 DGFSQGSGLVGLGRGPLSLISQLKETKFSYCLTSIEDSEASSSLFIGSLASGIVNKTGAS 176
Query: 232 IDEGDATPLQFI------DGHYYITLEAISVDGRMLDINPNIFK-RDDSGGGVMIDSGTD 284
+D G+ T + YY+ L+ I+V + L + + F+ +D GG++IDSGT
Sbjct: 177 LD-GEVTKTMSLLRNPDQPSFYYLELQGITVGAKRLSVEKSTFELAEDGTGGMIIDSGTT 235
Query: 285 VTWLVKEAYEALRDEVMIRLEGEQMRSYSWPDKLCYHGIMSSDLKGFPTVRFHFRGGAKL 344
+T+L + A++ L++E R+ S S LC+ ++ P + FHF+ GA L
Sbjct: 236 ITYLEETAFKVLKEEFTSRMSLPVDDSGSTGLDLCFKLPDAAKNIAVPKMIFHFK-GADL 294
Query: 345 ALEKDS-MFYQPRPDAFCMAVNPASINNRYVNFSLIGMMAQQFYNVGYDIGRKQKTFQRM 403
L ++ M C+A+ ++ S+ G + QQ +NV +D+ ++ +F
Sbjct: 295 ELPGENYMVADSSTGVLCLAMGSSN------GMSIFGNVQQQNFNVLHDLEKETVSFVPT 348
Query: 404 DCEVL 408
+C L
Sbjct: 349 ECGKL 353
>gi|449486856|ref|XP_004157423.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 1-like [Cucumis
sativus]
Length = 491
Score = 157 bits (398), Expect = 7e-36, Method: Compositional matrix adjust.
Identities = 124/352 (35%), Positives = 174/352 (49%), Gaps = 22/352 (6%)
Query: 61 FYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLGTIFYPSRSSSYAYVPCDSEH 120
++ + +GQP P + +DTGS + W+ C PC DC Q IF P SSS+A +PC+S+
Sbjct: 155 YFSRVGVGQPAKPFYMVLDTGSDINWLQCQPCTDCYQQTDPIFDPRSSSSFASLPCESQQ 214
Query: 121 CRYFPYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDVVFGCGFST 180
C+ + C A K C+Y Y G T TE LTF N+ + DV GCG
Sbjct: 215 CQALETSGCRASK--CLYQVSYGDGSFTVGEFVTETLTFGNSG----MINDVAVGCGHDN 268
Query: 181 NRNF-KFSGIFGLGIGRSSLVSQLN-SSFSYCIGSLHDPDYLHNKLILGDGAIIDEGDAT 238
F +G+ GLG G SL SQ+ SSFSYC+ + + L A D +A
Sbjct: 269 EGLFVGSAGLLGLGGGPLSLTSQMKASSFSYCL--VDRDSSSSSDLEFNSAAPSDSVNAP 326
Query: 239 PLQF--IDGHYYITLEAISVDGRMLDINPNIFKRDDSG-GGVMIDSGTDVTWLVKEAYEA 295
L+ +D YY+ L +SV G++L I PN+F+ DDSG GG+++DSGT +T L +AY
Sbjct: 327 LLKSGKVDTFYYVGLTGMSVGGQLLSIPPNLFQMDDSGYGGIIVDSGTAITRLQTQAYNT 386
Query: 296 LRDEVMIRLE-GEQMRSYSWPDKLCYHGIMSSDLKGFPTVRFHFRGGAKLAL-EKDSMFY 353
LRD + R ++ ++ D CY + S PTV F F GG L L K+ +
Sbjct: 387 LRDAFVSRTPYLKKTNGFALFDT-CYD-LSSQSRVTIPTVSFEFAGGKSLQLPPKNYLIP 444
Query: 354 QPRPDAFCMAVNPASINNRYVNFSLIGMMAQQFYNVGYDIGRKQKTFQRMDC 405
FC A P + + S+IG + QQ V YD+ F C
Sbjct: 445 VDSVGTFCFAFAPTT-----SSLSIIGNVQQQGTRVHYDLANSVVGFSPHKC 491
>gi|125592062|gb|EAZ32412.1| hypothetical protein OsJ_16623 [Oryza sativa Japonica Group]
Length = 473
Score = 157 bits (398), Expect = 7e-36, Method: Compositional matrix adjust.
Identities = 130/431 (30%), Positives = 196/431 (45%), Gaps = 50/431 (11%)
Query: 1 LIHRDSILSPYNNPNENPAHRVQRGINISIARLAYLQEKIRSHNT------YQAQILPSN 54
L+HRD+I S P+ H+V + AR+ +L++++ + + ++++P
Sbjct: 67 LVHRDAI-SGATYPSRR--HQVVGLVARDNARVEHLEKRLVASTSPYLPEDLVSEVVPGV 123
Query: 55 DISNSLFYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLGTIFYPSRSSSYAYV 114
D + ++V + +G PP Q+ +D+GS ++WV C PC C Q +F P+ SSS++ V
Sbjct: 124 DDGSGEYFVRVGVGSPPTDQYLVVDSGSDVIWVQCRPCEQCYAQTDPLFDPAASSSFSGV 183
Query: 115 PCDSEHCRYFP--YARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDV 172
C S CR +C Y+ Y G T ++ E LT T VQ V
Sbjct: 184 SCGSAICRTLSGTGCGGGGDAGKCDYSVTYGDGSYTKGELALETLTLGGT-----AVQGV 238
Query: 173 VFGCGFSTNRNF-KFSGIFGLGIGRSSLVSQLNSS----FSYCIGSLHDPDYLHNKLILG 227
GCG + F +G+ GLG G SLV QL + FSYC+ S L+LG
Sbjct: 239 AIGCGHRNSGLFVGAAGLLGLGWGAMSLVGQLGGAAGGVFSYCLASRGAGG--AGSLVLG 296
Query: 228 DGAIIDEGDA-TPL---QFIDGHYYITLEAISVDGRMLDINPNIFK-RDDSGGGVMIDSG 282
+ G PL YY+ L I V G L + ++F+ +D GGV++D+G
Sbjct: 297 RTEAVPVGAVWVPLVRNNQASSFYYVGLTGIGVGGERLPLQDSLFQLTEDGAGGVVMDTG 356
Query: 283 TDVTWLVKEAYEALR---DEVMIRLEGEQMRSYSWPDKLCYHGIMSSDLKGF-----PTV 334
T VT L +EAY ALR D M L + + S D CY DL G+ PTV
Sbjct: 357 TAVTRLPREAYAALRGAFDGAMGAL--PRSPAVSLLDT-CY------DLSGYASVRVPTV 407
Query: 335 RFHFRGGAKLALEKDSMFYQPRPDAFCMAVNPASINNRYVNFSLIGMMAQQFYNVGYDIG 394
F+F GA L L ++ + FC+A P+S S++G + Q+ + D
Sbjct: 408 SFYFDQGAVLTLPARNLLVEVGGAVFCLAFAPSS-----SGISILGNIQQEGIQITVDSA 462
Query: 395 RKQKTFQRMDC 405
F C
Sbjct: 463 NGYVGFGPNTC 473
>gi|357481191|ref|XP_003610881.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
gi|355512216|gb|AES93839.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
Length = 427
Score = 157 bits (398), Expect = 8e-36, Method: Compositional matrix adjust.
Identities = 128/419 (30%), Positives = 206/419 (49%), Gaps = 42/419 (10%)
Query: 1 LIHRDSILSPYNNPNENPAHRVQRGINISIARLAYLQEKIRSHNTYQAQILPSNDISNSL 60
LIH++S SP+ N H+ + + + +++Q+ + T +N
Sbjct: 34 LIHKNSPNSPFYKSNN--FHKNKLRSFYQVPKKSFVQKSPYTRVTS----------NNGD 81
Query: 61 FYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLGTIFYPSRSSSYAYVPCDSEH 120
+ + +++G PPV + +DTGS L+W C PC C Q +F P RS +Y+ +PC+SE
Sbjct: 82 YLMKLTLGSPPVDIYGLVDTGSDLVWAQCTPCGGCYRQKSPMFEPLRSKTYSPIPCESEQ 141
Query: 121 CRYFPYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDVVFGCGFST 180
C +F Y+ CS K C Y+ Y T ++ E +TF +TD + V D++FGCG S
Sbjct: 142 CSFFGYS-CSPQK-MCAYSYSYADSSVTKGVLAREAITFSSTDGDPVVVGDIIFGCGHSN 199
Query: 181 NRNFKFSGIFGLGIGRS--SLVSQLNS-----SFSYCIGSLHDPDYLHNKLILGDGA-II 232
+ F + + +G+G SLVSQ+ + FS C+ H + + G+ + +
Sbjct: 200 SGTFNENDMGIIGMGGGPLSLVSQIGTLYGSKRFSQCLVPFHTDAHTSGTINFGEESDVS 259
Query: 233 DEG-DATPLQFIDGH--YYITLEAISVDGRMLDINPNIFKRDDSGGGVMIDSGTDVTWLV 289
EG TPL +G Y +TLE ISV + N + S G +MIDSGT T++
Sbjct: 260 GEGVVTTPLASEEGQTSYLVTLEGISVGDTFVRFNSS---ETLSKGNIMIDSGTPATYIP 316
Query: 290 KEAYEALRDEVMIRLEGEQMRSYSWPD---KLCYHGIMSSDLKGFPTVRFHFRGGAKLAL 346
+E YE L +E ++++ + PD +LCY ++L+G P + HF GA + L
Sbjct: 317 QEFYERLVEE--LKVQSSLLPIEDDPDLGTQLCYRS--ETNLEG-PILTAHFE-GADVQL 370
Query: 347 EKDSMFYQPRPDAFCMAVNPASINNRYVNFSLIGMMAQQFYNVGYDIGRKQKTFQRMDC 405
F P+ FC A+ S + Y+ G AQ +G+D+ RK +F+ DC
Sbjct: 371 LPIQTFIPPKDGVFCFAM-AGSTDGDYI----FGNFAQSNILMGFDLDRKTISFKPTDC 424
>gi|115472517|ref|NP_001059857.1| Os07g0533600 [Oryza sativa Japonica Group]
gi|22831047|dbj|BAC15910.1| putative nucleoid DNA-binding protein cnd41 [Oryza sativa Japonica
Group]
gi|50508280|dbj|BAD32129.1| putative nucleoid DNA-binding protein cnd41 [Oryza sativa Japonica
Group]
gi|113611393|dbj|BAF21771.1| Os07g0533600 [Oryza sativa Japonica Group]
gi|215766673|dbj|BAG98901.1| unnamed protein product [Oryza sativa Japonica Group]
Length = 441
Score = 157 bits (397), Expect = 8e-36, Method: Compositional matrix adjust.
Identities = 118/366 (32%), Positives = 173/366 (47%), Gaps = 26/366 (7%)
Query: 57 SNSLFYVNISIGQPPVPQFTAMDTGSSLLWVHC-YPCRDCSPQLGTIFYPSRSSSYAYVP 115
S + + V+I+IG PP+P +DTGS L+W C PCR C PQ ++ P+RS++YA V
Sbjct: 88 STATYLVDIAIGTPPLPLTAVLDTGSDLIWTQCDAPCRRCFPQPAPLYAPARSATYANVS 147
Query: 116 CDSEHCRYF--PYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDVV 173
C S C+ P++RCS C Y Y G T ++TE T S V+ V
Sbjct: 148 CRSPMCQALQSPWSRCSPPDTGCAYYFSYGDGTSTDGVLATETFTLG----SDTAVRGVA 203
Query: 174 FGCGFST-NRNFKFSGIFGLGIGRSSLVSQLN-SSFSYCI----GSLHDPDYLHNKLILG 227
FGCG SG+ G+G G SLVSQL + FSYC + P +L + L
Sbjct: 204 FGCGTENLGSTDNSSGLVGMGRGPLSLVSQLGVTRFSYCFTPFNATAASPLFLGSSARLS 263
Query: 228 DGAIIDEGDATP---LQFIDGHYYITLEAISVDGRMLDINPNIFKRDDSG-GGVMIDSGT 283
A +P + +YY++LE I+V +L I+P +F+ G GGV+IDSGT
Sbjct: 264 SAAKTTPFVPSPSGGARRRSSYYYLSLEGITVGDTLLPIDPAVFRLTPMGDGGVIIDSGT 323
Query: 284 DVTWLVKEAYEALRDEVMIRLEGEQMRSYSWPDKLCYHGIMSSDLKGFPTVRFHFRGGAK 343
T L + A+ AL + R+ LC+ S + P + HF GA
Sbjct: 324 TFTALEERAFVALARALASRVRLPLASGAHLGLSLCFAA-ASPEAVEVPRLVLHFD-GAD 381
Query: 344 LALEKDSMFYQPRPDAF-CMAVNPASINNRYVNFSLIGMMAQQFYNVGYDIGRKQKTFQR 402
+ L ++S + R C+ + A S++G M QQ ++ YD+ R +F+
Sbjct: 382 MELRRESYVVEDRSAGVACLGMVSAR------GMSVLGSMQQQNTHILYDLERGILSFEP 435
Query: 403 MDCEVL 408
C L
Sbjct: 436 AKCGEL 441
>gi|125558631|gb|EAZ04167.1| hypothetical protein OsI_26309 [Oryza sativa Indica Group]
Length = 441
Score = 157 bits (397), Expect = 9e-36, Method: Compositional matrix adjust.
Identities = 118/366 (32%), Positives = 173/366 (47%), Gaps = 26/366 (7%)
Query: 57 SNSLFYVNISIGQPPVPQFTAMDTGSSLLWVHC-YPCRDCSPQLGTIFYPSRSSSYAYVP 115
S + + V+I+IG PP+P +DTGS L+W C PCR C PQ ++ P+RS++YA V
Sbjct: 88 STATYLVDIAIGTPPLPLTAVLDTGSDLIWTQCDAPCRRCFPQPAPLYAPARSATYANVS 147
Query: 116 CDSEHCRYF--PYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDVV 173
C S C+ P++RCS C Y Y G T ++TE T S V+ V
Sbjct: 148 CRSPMCQALQSPWSRCSPPDTGCAYYFSYGDGTSTDGVLATETFTLG----SDTAVRGVA 203
Query: 174 FGCGFST-NRNFKFSGIFGLGIGRSSLVSQLN-SSFSYCI----GSLHDPDYLHNKLILG 227
FGCG SG+ G+G G SLVSQL + FSYC + P +L + L
Sbjct: 204 FGCGTENLGSTDNSSGLVGMGRGPLSLVSQLGVTRFSYCFTPFNATAASPLFLGSSARLS 263
Query: 228 DGAIIDEGDATP---LQFIDGHYYITLEAISVDGRMLDINPNIFKRDDSG-GGVMIDSGT 283
A +P + +YY++LE I+V +L I+P +F+ G GGV+IDSGT
Sbjct: 264 SAAKTTPFVPSPSGGARRRSSYYYLSLEGITVGDTLLPIDPAVFRLTPMGDGGVIIDSGT 323
Query: 284 DVTWLVKEAYEALRDEVMIRLEGEQMRSYSWPDKLCYHGIMSSDLKGFPTVRFHFRGGAK 343
T L + A+ AL + R+ LC+ S + P + HF GA
Sbjct: 324 TFTALEESAFVALARALASRVRLPLASGAHLGLSLCFAA-ASPEAVEVPRLVLHFD-GAD 381
Query: 344 LALEKDSMFYQPRPDAF-CMAVNPASINNRYVNFSLIGMMAQQFYNVGYDIGRKQKTFQR 402
+ L ++S + R C+ + A S++G M QQ ++ YD+ R +F+
Sbjct: 382 MELRRESYVVEDRSAGVACLGMVSAR------GMSVLGSMQQQNTHILYDLERGILSFEP 435
Query: 403 MDCEVL 408
C L
Sbjct: 436 AKCGEL 441
>gi|15235526|ref|NP_193028.1| aspartyl protease family protein [Arabidopsis thaliana]
gi|5123933|emb|CAB45491.1| putative protein [Arabidopsis thaliana]
gi|7267994|emb|CAB78334.1| putative protein [Arabidopsis thaliana]
gi|332657803|gb|AEE83203.1| aspartyl protease family protein [Arabidopsis thaliana]
Length = 389
Score = 157 bits (396), Expect = 1e-35, Method: Compositional matrix adjust.
Identities = 111/350 (31%), Positives = 162/350 (46%), Gaps = 21/350 (6%)
Query: 61 FYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQ-LGTIFYPSRSSSYAYVPCDSE 119
F I G P QF MDTGSSL W C+PC DC Q + + P+ S +Y C+
Sbjct: 58 FMAEIHFGSPQKKQFLHMDTGSSLTWTQCFPCSDCYAQKIYPKYRPAASITYRDAMCEDS 117
Query: 120 HCRYFPYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDVVFGCG-F 178
H + P+ C Y Q YL ++ E +T D V V FGC
Sbjct: 118 HPKSNPHFAFDPLTRICTYQQHYLDETNIKGTLAQEMITVDTHDGGFKRVHGVYFGCNTL 177
Query: 179 STNRNFKFSGIFGLGIGRSSLVSQLNSSFSYCIGSLHDPDYLHNKLILGDGAIIDEGDAT 238
S F +GI GLG+G+ S++ + S FS+C+G + +P HN LILGDGA + +G T
Sbjct: 178 SDGSYFTGTGILGLGVGKYSIIGEFGSKFSFCLGEISEPKASHN-LILGDGANV-QGHPT 235
Query: 239 PLQFIDGHYYITLEAISVDGRMLDINPNIFKRDDSGGGVMIDSGTDVTWLVKEAYEALRD 298
+ +GH LE+I V + +P V +D+G+ ++ L Y D
Sbjct: 236 VINITEGHTIFQLESIIVGEEITLDDPV---------QVFVDTGSTLSHLSTNLYYKFVD 286
Query: 299 EVMIRLEGEQMRSYSWPDKLCYHGIMSSDLKGFPTVRFHFRGGAKLALEKDSMFYQPR-P 357
L G R S+ LCY L+ V F F GA+L++ ++F Q P
Sbjct: 287 -AFDDLIGS--RPLSYEPTLCYKADTIERLEKM-DVGFKFDVGAELSVNIHNIFIQQGPP 342
Query: 358 DAFCMAVNPASINNRYVNFSLIGMMAQQFYNVGYDIGRKQKTFQRMDCEV 407
+ C+A+ N + +IG++A Q YNVGYD+ K + DC++
Sbjct: 343 EIRCLAIQN---NKESFSHVIIGVIAMQGYNVGYDLSAKTAYINKQDCDM 389
>gi|449499012|ref|XP_004160696.1| PREDICTED: probable aspartic protease At2g35615-like [Cucumis
sativus]
Length = 435
Score = 157 bits (396), Expect = 1e-35, Method: Compositional matrix adjust.
Identities = 125/423 (29%), Positives = 196/423 (46%), Gaps = 36/423 (8%)
Query: 1 LIHRDSILSPYNNPNENPAHRVQRGINISIARLAYLQEKIRSHNTYQAQILPSNDISNSL 60
LIHRDS SP NP EN HRV + SI+ L NT +A I +
Sbjct: 34 LIHRDSPKSPMYNPLENHYHRVADTLRRSISHNTGLVT-----NTVEAPIYNNR----GE 84
Query: 61 FYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLGTIFYPSRSSSYAYVPCDSEH 120
+ + +S+G PP P DTGS ++W C PC +C Q +F PS+S++Y V C S
Sbjct: 85 YLMKLSVGTPPFPIIAVADTGSDIIWTQCVPCTNCYQQDLPMFNPSKSTTYRKVSCSSPV 144
Query: 121 CRYFPYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDVVFGCGFST 180
C + ++K C Y+ Y + + + LT +T + GCG
Sbjct: 145 CSFTGEDNSCSFKPDCTYSISYGDNSHSQGDFAVDTLTMGSTSGRVVAFPRTAIGCGHDN 204
Query: 181 NRNF--KFSGIFGLGIGRSSLVSQLNSS----FSYCIGSLHDPDYLHNKLILGDGAIIDE 234
+F SGI GLG+G +SL+ Q+ S+ FSYC+ + + D NKL G A +
Sbjct: 205 AGSFDANVSGIVGLGLGPASLIKQMGSAVGGKFSYCLTPIGNDDGGSNKLNFGSNANVSG 264
Query: 235 GDA--TPLQFID---GHYYITLEAISVDGRMLDINPNIFKRDDS--GG--GVMIDSGTDV 285
A TP+ D Y + L+A+SV GR N + +S GG ++IDSGT +
Sbjct: 265 SGAVSTPIYISDKFKSFYSLKLKAVSV-GR----NNTFYSTANSILGGKANIIIDSGTTL 319
Query: 286 TWLVKEAYEALRDEVMIRLEGEQMRSYSWPDKLCYHGIMSSDLKGFPTVRFHFRGGAKLA 345
T L + Y + + ++ + + C+ ++D P + HF GA L
Sbjct: 320 TLLPVDLYHNFAKAISNSINLQRTDDPNQFLEYCFE--TTTDDYKVPFIAMHFE-GANLR 376
Query: 346 LEKDSMFYQPRPDAFCMAVNPASINNRYVNFSLIGMMAQQFYNVGYDIGRKQKTFQRMDC 405
L+++++ + + C+A A N + S+ G +AQ + VGYD+ +F+ M+C
Sbjct: 377 LQRENVLIRVSDNVICLAFAGAQDN----DISIYGNIAQINFLVGYDVTNMSLSFKPMNC 432
Query: 406 EVL 408
+
Sbjct: 433 VAM 435
>gi|90399033|emb|CAJ86229.1| H0402C08.5 [Oryza sativa Indica Group]
gi|125550227|gb|EAY96049.1| hypothetical protein OsI_17922 [Oryza sativa Indica Group]
Length = 473
Score = 157 bits (396), Expect = 1e-35, Method: Compositional matrix adjust.
Identities = 129/431 (29%), Positives = 195/431 (45%), Gaps = 50/431 (11%)
Query: 1 LIHRDSILSPYNNPNENPAHRVQRGINISIARLAYLQEKIRSHNT------YQAQILPSN 54
L+HRD+I S P+ H+V + AR+ +L++++ + + ++++P
Sbjct: 67 LVHRDAI-SGATYPSRR--HQVVGLVARDNARVEHLEKRLVASTSPYLPEDLVSEVVPGV 123
Query: 55 DISNSLFYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLGTIFYPSRSSSYAYV 114
D + ++V + +G PP Q+ +D+GS ++WV C PC C Q +F P+ SSS++ V
Sbjct: 124 DDGSGEYFVRVGVGSPPTDQYLVVDSGSDVIWVQCRPCEQCYAQTDPLFDPAASSSFSGV 183
Query: 115 PCDSEHCRYFP--YARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDV 172
C S CR +C Y+ Y G T ++ E LT T VQ V
Sbjct: 184 SCGSAICRTLSGTGCGGGGDAGKCDYSVTYGDGSYTKGELALETLTLGGT-----AVQGV 238
Query: 173 VFGCGFSTNRNF-KFSGIFGLGIGRSSLVSQLNSS----FSYCIGSLHDPDYLHNKLILG 227
GCG + F +G+ GLG G SL+ QL + FSYC+ S L+LG
Sbjct: 239 AIGCGHRNSGLFVGAAGLLGLGWGAMSLIGQLGGAAGGVFSYCLASRGAGG--AGSLVLG 296
Query: 228 DGAIIDEGDA-TPL---QFIDGHYYITLEAISVDGRMLDINPNIFK-RDDSGGGVMIDSG 282
+ G PL YY+ L I V G L + +F+ +D GGV++D+G
Sbjct: 297 RTEAVPVGAVWVPLVRNNQASSFYYVGLTGIGVGGERLPLQDGLFQLTEDGAGGVVMDTG 356
Query: 283 TDVTWLVKEAYEALR---DEVMIRLEGEQMRSYSWPDKLCYHGIMSSDLKGF-----PTV 334
T VT L +EAY ALR D M L + + S D CY DL G+ PTV
Sbjct: 357 TAVTRLPREAYAALRGAFDGAMGAL--PRSPAVSLLDT-CY------DLSGYASVRVPTV 407
Query: 335 RFHFRGGAKLALEKDSMFYQPRPDAFCMAVNPASINNRYVNFSLIGMMAQQFYNVGYDIG 394
F+F GA L L ++ + FC+A P+S S++G + Q+ + D
Sbjct: 408 SFYFDQGAVLTLPARNLLVEVGGAVFCLAFAPSS-----SGISILGNIQQEGIQITVDSA 462
Query: 395 RKQKTFQRMDC 405
F C
Sbjct: 463 NGYVGFGPNTC 473
>gi|449454652|ref|XP_004145068.1| PREDICTED: probable aspartic protease At2g35615-like [Cucumis
sativus]
gi|449470630|ref|XP_004153019.1| PREDICTED: probable aspartic protease At2g35615-like [Cucumis
sativus]
Length = 435
Score = 157 bits (396), Expect = 1e-35, Method: Compositional matrix adjust.
Identities = 125/423 (29%), Positives = 196/423 (46%), Gaps = 36/423 (8%)
Query: 1 LIHRDSILSPYNNPNENPAHRVQRGINISIARLAYLQEKIRSHNTYQAQILPSNDISNSL 60
LIHRDS SP NP EN HRV + SI+ L NT +A I +
Sbjct: 34 LIHRDSPKSPMYNPLENHYHRVADTLRRSISHNTGLVT-----NTVEAPIYNNR----GE 84
Query: 61 FYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLGTIFYPSRSSSYAYVPCDSEH 120
+ + +S+G PP P DTGS ++W C PC +C Q +F PS+S++Y V C S
Sbjct: 85 YLMKLSVGTPPFPIIAVADTGSDIIWTQCEPCTNCYQQDLPMFNPSKSTTYRKVSCSSPV 144
Query: 121 CRYFPYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDVVFGCGFST 180
C + ++K C Y+ Y + + + LT +T + GCG
Sbjct: 145 CSFTGEDNSCSFKPDCTYSISYGDNSHSQGDFAVDTLTMGSTSGRVVAFPRTAIGCGHDN 204
Query: 181 NRNF--KFSGIFGLGIGRSSLVSQLNSS----FSYCIGSLHDPDYLHNKLILGDGAIIDE 234
+F SGI GLG+G +SL+ Q+ S+ FSYC+ + + D NKL G A +
Sbjct: 205 AGSFDANVSGIVGLGLGPASLIKQMGSAVGGKFSYCLTPIGNDDGGSNKLNFGSNANVSG 264
Query: 235 GDA--TPLQFID---GHYYITLEAISVDGRMLDINPNIFKRDDS--GG--GVMIDSGTDV 285
A TP+ D Y + L+A+SV GR N + +S GG ++IDSGT +
Sbjct: 265 SGAVSTPIYISDKFKSFYSLKLKAVSV-GR----NNTFYSTANSILGGKANIIIDSGTTL 319
Query: 286 TWLVKEAYEALRDEVMIRLEGEQMRSYSWPDKLCYHGIMSSDLKGFPTVRFHFRGGAKLA 345
T L + Y + + ++ + + C+ ++D P + HF GA L
Sbjct: 320 TLLPVDLYHNFAKAISNSINLQRTDDPNQFLEYCFE--TTTDDYKVPFIAMHFE-GANLR 376
Query: 346 LEKDSMFYQPRPDAFCMAVNPASINNRYVNFSLIGMMAQQFYNVGYDIGRKQKTFQRMDC 405
L+++++ + + C+A A N + S+ G +AQ + VGYD+ +F+ M+C
Sbjct: 377 LQRENVLIRVSDNVICLAFAGAQDN----DISIYGNIAQINFLVGYDVTNMSLSFKPMNC 432
Query: 406 EVL 408
+
Sbjct: 433 VAM 435
>gi|357481205|ref|XP_003610888.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
gi|355512223|gb|AES93846.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
Length = 413
Score = 156 bits (395), Expect = 1e-35, Method: Compositional matrix adjust.
Identities = 120/362 (33%), Positives = 180/362 (49%), Gaps = 32/362 (8%)
Query: 61 FYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLGTIFYPSRSSSYAYVPCDSEH 120
+ + + IG PP+ +DTGS L+WV C PC C Q+ +F P +SS+Y + CDS
Sbjct: 64 YLMELYIGTPPIKISGTVDTGSDLIWVQCVPCLGCYNQINPMFDPLKSSTYTNISCDSPL 123
Query: 121 CRYFPY-ARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDVVFGCGFS 179
C Y PY CS K RC YT Y T ++ E +T + I +Q ++FGCG +
Sbjct: 124 C-YKPYIGECSPEK-RCDYTYGYADSSLTKGVLAQETVTLTSNTGKPISLQGILFGCGHN 181
Query: 180 TNRNFK--FSGIFGLGIGRSSLVSQL-----NSSFSYCIGSLHDPDYLHNKLILGDGA-I 231
NF G+ GLG G +SLVSQ+ FS C+ + +++ G G+ +
Sbjct: 182 NTGNFNDHEMGLIGLGGGPTSLVSQIGPLFGGKKFSQCLVPFLTDITISSQMSFGKGSEV 241
Query: 232 IDEG-DATPLQFID---GHYYITLEAISVDGRMLDINPNIFKRDDSGGGVMIDSGTDVTW 287
+ EG TPL + YY+TL ISV+ L +N I K G +++DSGT
Sbjct: 242 LGEGVVTTPLVQREQDMTSYYVTLLGISVEDTYLPMNSTIEK-----GNMLVDSGTPPNI 296
Query: 288 LVKEAYEALRDEVMIRLEGEQMR-SYSWPDKLCYHGIMSSDLKGFPTVRFHFRGGAKLAL 346
L ++ Y+ + EV ++ E + S +LCY ++LKG PT+ +HF GA L L
Sbjct: 297 LPQQLYDRVYVEVKNKVPLEPITDDPSLGPQLCYR--TQTNLKG-PTLTYHFE-GANLLL 352
Query: 347 EKDSMFYQPRPDA---FCMAVNPASINNRYVNFSLIGMMAQQFYNVGYDIGRKQKTFQRM 403
F P P+ FC+A+ N + + G AQ Y +G+D+ R+ +F+
Sbjct: 353 TPIQTFIPPTPETKGVFCLAIT----NCANSDPGIYGNFAQTNYLIGFDLDRQIVSFKPT 408
Query: 404 DC 405
DC
Sbjct: 409 DC 410
>gi|224126551|ref|XP_002329582.1| predicted protein [Populus trichocarpa]
gi|222870291|gb|EEF07422.1| predicted protein [Populus trichocarpa]
Length = 440
Score = 156 bits (395), Expect = 1e-35, Method: Compositional matrix adjust.
Identities = 129/424 (30%), Positives = 199/424 (46%), Gaps = 34/424 (8%)
Query: 1 LIHRDSILSPYNNPNENPAHRVQRGINISIARLAYLQEKIRSHNTYQAQILPSNDISNSL 60
L+HRDS SP N + R + + S++R+ + Q R+ T + + S I+N
Sbjct: 35 LVHRDSPKSPLYNSQQTHLQRWNKAMRRSVSRVHHFQ---RTAATVSPKEVESEIIANGG 91
Query: 61 FYV-NISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLGTIFYPSRSSSYAYVPCDSE 119
Y+ ++S+G PP DTGS L+W C PC C Q+ +F P S +Y + CD+
Sbjct: 92 EYLMSLSLGTPPFEILAIADTGSDLIWTQCTPCDKCYKQIAPLFDPKSSKTYRDLSCDTR 151
Query: 120 HCRYFPYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDVVFGCGFS 179
C+ + + + C Y+ Y T+ ++ + +T +T+ ++ V GCG
Sbjct: 152 QCQNLGESSSCSSEQLCQYSYYYGDRSFTNGNLAVDTVTLPSTNGGPVYFPKTVIGCGRR 211
Query: 180 TNRNF--KFSGIFGLGIGRSSLVSQLNSS----FSYCIGSLHDPDYLH-NKLILGDGAII 232
N F K SGI GLG G SL+SQ+ SS FSYC+ + +KL G A++
Sbjct: 212 NNGTFDKKDSGIIGLGGGPMSLISQMGSSVGGKFSYCLVPFSSESAGNSSKLHFGRNAVV 271
Query: 233 DEG--DATPL--QFIDGHYYITLEAISVDGRMLDINPNIFKRDDSGGGVMIDSGTDVTW- 287
+TPL + D YY+TLEA+SV + I S G ++IDSGT +T
Sbjct: 272 SGSGVQSTPLISKNPDTFYYLTLEAMSVGDK--KIEFGGSSFGGSEGNIIIDSGTSLTLF 329
Query: 288 ---LVKEAYEALRDEVMIRLEGEQMRSYSWPDKLCYHGIMSSDLKGFPTVRFHFRGGAKL 344
E A+ + V + GE+ + S CY + DLK P + HF GA +
Sbjct: 330 PVNFFTEFATAVENAV---INGERTQDASGLLSHCYR--PTPDLK-VPVITAHFN-GADV 382
Query: 345 ALEKDSMFYQPRPDAFCMAVNPASINNRYVNFSLIGMMAQQFYNVGYDIGRKQKTFQRMD 404
L+ + F D C+A N + ++ G +AQ + +GYDI K +F+ D
Sbjct: 383 VLQTLNTFILISDDVLCLAFNSTQ------SGAIFGNVAQMNFLIGYDIQGKSVSFKPTD 436
Query: 405 CEVL 408
C L
Sbjct: 437 CTQL 440
>gi|357122568|ref|XP_003562987.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Brachypodium
distachyon]
Length = 455
Score = 155 bits (393), Expect = 3e-35, Method: Compositional matrix adjust.
Identities = 120/390 (30%), Positives = 184/390 (47%), Gaps = 48/390 (12%)
Query: 54 NDISNSLFYV-NISIGQPPVPQFTAMDTGSSLLWVHCYPCRD--------CSPQLGTIFY 104
D+ N Y+ +SIG PP+ DTGS L+W C PC D C Q G ++
Sbjct: 79 KDLRNGGEYIMTLSIGTPPLSYRAIADTGSDLIWTQCAPCGDTVTDTDNQCFKQSGCLYN 138
Query: 105 PSRSSSYAYVPCDSEHCRYFPYARCSAYKH-------RCIYTQLYLIGPETSVFVSTEQL 157
PS S+++ +PC+S P + C+A C+Y Q Y G T+ S E
Sbjct: 139 PSSSTTFGVLPCNS------PLSMCAAMAGPSPPPGCACMYNQTYGTG-WTAGVQSVETF 191
Query: 158 TF-KNTDESTIHVQDVVFGCGFSTNRNFKFS-GIFGLGIGRSSLVSQLNS-SFSYCIGSL 214
TF ++ + V ++ FGC +++ ++ S G+ GLG G SLVSQL + +FSYC+
Sbjct: 192 TFGSSSTPPAVRVPNIAFGCSNASSNDWNGSAGLVGLGRGSMSLVSQLGAGAFSYCLTPF 251
Query: 215 HDPDYLHNKLILGDGAIIDEGDATPLQ---FIDG--------HYYITLEAISVDGRMLDI 263
D + + L+LG A P++ F+ G +YY+ L ISV L I
Sbjct: 252 QDANS-TSTLLLGPSAAAALKGTGPVRSTPFVAGPSKAPMSTYYYLNLTGISVGETALAI 310
Query: 264 NPNIFK-RDDSGGGVMIDSGTDVTWLVKEAYE----ALRDEVMIRLEGEQMRSYSWPDKL 318
P+ F R D GG++IDSGT +T LV AY+ A+R ++ RL +S L
Sbjct: 311 PPDAFSLRADGTGGLIIDSGTTITTLVDSAYQQVRAAVRSLLVTRLPLAHGPDHSTGLDL 370
Query: 319 CYHGIMSSDLKGFPTVRFHFRGGAKLALEKDSMFYQPRPDAFCMAVNPASINNRYVNFSL 378
C+ S+ P++ HF GGA + L ++ +C+A+ N S+
Sbjct: 371 CFALKASTPPPAMPSMTLHFEGGADMVLPVENYMILGS-GVWCLAMR----NQTVGAMSM 425
Query: 379 IGMMAQQFYNVGYDIGRKQKTFQRMDCEVL 408
+G QQ +V YD+ ++ +F C L
Sbjct: 426 VGNYQQQNIHVLYDVRKETLSFAPAVCSSL 455
>gi|326494754|dbj|BAJ94496.1| predicted protein [Hordeum vulgare subsp. vulgare]
gi|326514480|dbj|BAJ96227.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 449
Score = 155 bits (392), Expect = 4e-35, Method: Compositional matrix adjust.
Identities = 121/368 (32%), Positives = 171/368 (46%), Gaps = 34/368 (9%)
Query: 58 NSLFYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLGTIFYPSRSSSYAYVPCD 117
N F +++SIG P V +DTGS L+W C PC +C Q +F PS SS+YA +PC
Sbjct: 99 NGEFLMDMSIGTPAVAYAAIIDTGSDLVWTQCKPCVECFNQSTPVFDPSSSSTYAALPCS 158
Query: 118 SEHCRYFPYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDVVFGCG 177
S C P ++C++ K C YT Y T ++ E T T + DV FGCG
Sbjct: 159 STLCSDLPSSKCTSAK--CGYTYTYGDSSSTQGVLAAETFTLAKT-----KLPDVAFGCG 211
Query: 178 FSTNRNFKFS---GIFGLGIGRSSLVSQLN-SSFSYCIGSLHDPDYLHNKLILGDGAIID 233
TN F+ G+ GLG G SLVSQL + FSYC+ SL D + L+LG A I
Sbjct: 212 -DTNEGDGFTQGAGLVGLGRGPLSLVSQLGLNKFSYCLTSLDDTS--KSPLLLGSLATIS 268
Query: 234 EG-------DATPL---QFIDGHYYITLEAISVDGRMLDINPNIFK-RDDSGGGVMIDSG 282
E TPL YY+ L+ ++V + + + F +DD GGV++DSG
Sbjct: 269 ESAAAASSVQTTPLIRNPSQPSFYYVNLKGLTVGSTHITLPSSAFAVQDDGTGGVIVDSG 328
Query: 283 TDVTWLVKEAYEALRDEVMIRLEGEQMRSYSWPDKLCYHGIMSS-DLKGFPTVRFHFRGG 341
T +T+L + Y AL+ +++ C+ S D P + FH G
Sbjct: 329 TSITYLELQGYRALKKAFAAQMKLPAADGSGIGLDTCFEAPASGVDQVEVPKLVFHLD-G 387
Query: 342 AKLALEKDS-MFYQPRPDAFCMAVNPASINNRYVNFSLIGMMAQQFYNVGYDIGRKQKTF 400
A L L ++ M A C+ V + +R S+IG QQ YD+G +F
Sbjct: 388 ADLDLPAENYMVLDSGSGALCLTV----MGSR--GLSIIGNFQQQNIQFVYDVGENTLSF 441
Query: 401 QRMDCEVL 408
+ C L
Sbjct: 442 APVQCAKL 449
>gi|61214232|sp|Q766C2.1|NEP2_NEPGR RecName: Full=Aspartic proteinase nepenthesin-2; AltName:
Full=Nepenthesin-II; Flags: Precursor
gi|41016423|dbj|BAD07475.1| aspartic proteinase nepenthesin II [Nepenthes gracilis]
Length = 438
Score = 155 bits (391), Expect = 4e-35, Method: Compositional matrix adjust.
Identities = 112/355 (31%), Positives = 169/355 (47%), Gaps = 25/355 (7%)
Query: 61 FYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLGTIFYPSRSSSYAYVPCDSEH 120
+ +N++IG P MDTGS L+W C PC C Q IF P SSS++ +PC+S++
Sbjct: 96 YLMNVAIGTPDSSFSAIMDTGSDLIWTQCEPCTQCFSQPTPIFNPQDSSSFSTLPCESQY 155
Query: 121 CRYFPYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDVVFGCGFST 180
C+ P C+ + C YT Y G T +++TE TF+ T V ++ FGCG
Sbjct: 156 CQDLPSETCN--NNECQYTYGYGDGSTTQGYMATETFTFE-----TSSVPNIAFGCG-ED 207
Query: 181 NRNF---KFSGIFGLGIGRSSLVSQLN-SSFSYCIGSLHDPDYLHNKLILGDGAI-IDEG 235
N+ F +G+ G+G G SL SQL FSYC+ S + L LG A + EG
Sbjct: 208 NQGFGQGNGAGLIGMGWGPLSLPSQLGVGQFSYCMTSYGSSS--PSTLALGSAASGVPEG 265
Query: 236 DATPL----QFIDGHYYITLEAISVDGRMLDINPNIFK-RDDSGGGVMIDSGTDVTWLVK 290
+ +YYITL+ I+V G L I + F+ +DD GG++IDSGT +T+L +
Sbjct: 266 SPSTTLIHSSLNPTYYYITLQGITVGGDNLGIPSSTFQLQDDGTGGMIIDSGTTLTYLPQ 325
Query: 291 EAYEALRDEVMIRLEGEQMRSYSWPDKLCYHGIMSSDLKGFPTVRFHFRGGAKLALEKDS 350
+AY A+ ++ + S C+ P + F GG L L + +
Sbjct: 326 DAYNAVAQAFTDQINLPTVDESSSGLSTCFQQPSDGSTVQVPEISMQFDGGV-LNLGEQN 384
Query: 351 MFYQPRPDAFCMAVNPASINNRYVNFSLIGMMAQQFYNVGYDIGRKQKTFQRMDC 405
+ P C+A+ +S + S+ G + QQ V YD+ +F C
Sbjct: 385 ILISPAEGVICLAMGSSS----QLGISIFGNIQQQETQVLYDLQNLAVSFVPTQC 435
>gi|255553149|ref|XP_002517617.1| Aspartic proteinase nepenthesin-2 precursor, putative [Ricinus
communis]
gi|223543249|gb|EEF44781.1| Aspartic proteinase nepenthesin-2 precursor, putative [Ricinus
communis]
Length = 449
Score = 155 bits (391), Expect = 4e-35, Method: Compositional matrix adjust.
Identities = 137/426 (32%), Positives = 205/426 (48%), Gaps = 36/426 (8%)
Query: 1 LIHRDSILSPYNNPNENPAHRVQRGINISIARLAYLQ-EKIRSHNTYQAQILPSNDISNS 59
LIHRDS +SP NP + R++ + SI+R + I + Q+ I+P
Sbjct: 36 LIHRDSSVSPLYNPRDTYFDRLRNSFHRSISRANRFKPNSISARALVQSDIVPGG----G 91
Query: 60 LFYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLGTIFYPSRSSSYAYVPCDSE 119
+ + ISIG P V DTGS L+WV C PC C Q IF P RSSSY V C +E
Sbjct: 92 EYLMRISIGNPQVEILAIADTGSDLIWVQCQPCEMCYKQNSPIFDPRRSSSYRNVLCGNE 151
Query: 120 HCRYFP-YAR-CSA--YKHRCIYTQLYLIGPETSVFVSTEQL----TFKNTDESTIHVQD 171
C AR C A + C YT Y + ++ E+ T NT + + Q+
Sbjct: 152 FCNKLDGEARSCDARGFVKTCGYTYSYGDQSFSDGHLAIERFGIGSTNSNTSAAIAYFQE 211
Query: 172 VVFGCGFSTNRNF--KFSGIFGLGIGRSSLVSQ----LNSSFSYCIGSLHDPDYLHNKLI 225
V FGCG F SGI GLG G SLVSQ L+ FSYC+ + +K+
Sbjct: 212 VAFGCGTKNGGTFDELGSGIIGLGGGSMSLVSQLGPKLSGKFSYCLVPTSEQSNYTSKIN 271
Query: 226 LGDGAIIDEGD----ATPL--QFIDGHYYITLEAISVDGRMLDINPNIFKRDDSGGGVMI 279
G+ I + +TPL + + +YY+TLEAISV+ + L N++ + G ++I
Sbjct: 272 FGNDINISGSNYNVVSTPLLPKKPETYYYLTLEAISVENKRLPYT-NLWNGEVEKGNIII 330
Query: 280 DSGTDVTWLVKEAYEALRDEVMIRLEGEQMRSYSWPDKLCYHGIMSSDLKGFPTVRFHFR 339
DSGT +T+L E + L V ++GE++ +C+ + +L P + HF
Sbjct: 331 DSGTTLTFLDSEFFNNLDSAVEEAVKGERVSDPHGLFNICFKDEKAIEL---PIITAHFT 387
Query: 340 GGAKLALEKDSMFYQPRPDAFCMAVNPASINNRYVNFSLIGMMAQQFYNVGYDIGRKQKT 399
GA + L+ + F + D C + P++ + ++ G +AQ + VGYD+ +K +
Sbjct: 388 -GADVELQPVNTFAKVEEDLLCFTMIPSN------DIAIFGNLAQMNFLVGYDLEKKAVS 440
Query: 400 FQRMDC 405
F DC
Sbjct: 441 FLPTDC 446
>gi|224286173|gb|ACN40797.1| unknown [Picea sitchensis]
Length = 383
Score = 155 bits (391), Expect = 5e-35, Method: Compositional matrix adjust.
Identities = 128/398 (32%), Positives = 196/398 (49%), Gaps = 26/398 (6%)
Query: 22 VQRGINISIARLAYLQ--EKIRSHNTYQAQILPSNDISNSLFYVNISIGQPPVPQFTAMD 79
++R I S RL LQ + +H + + DI + + + ++IG P + MD
Sbjct: 1 MKRAIQRSQERLEKLQITSAVNTHQMKDIETPVTPDIGSGEYLIQMAIGTPALSLSAIMD 60
Query: 80 TGSSLLWVHCYPCRDCSPQLGTIFYPSRSSSYAYVPCDSEHCRYFPYARCSAYKHRCIYT 139
TGS L+W C PC DCS +I+ PS SS+Y+ V C S C+ C+ C Y
Sbjct: 61 TGSDLVWTKCNPCTDCSTS--SIYDPSSSSTYSKVLCQSSLCQPPSIFSCNN-DGDCEYV 117
Query: 140 QLYLIGPETSVFVSTEQLTFKNTDESTIHVQDVVFGCGFSTNRNFKFSGIFGLGIGRSSL 199
Y TS +S E TF + +S + ++ FGCG K G+ G G G SL
Sbjct: 118 YPYGDRSSTSGILSDE--TFSISSQS---LPNITFGCGHDNQGFDKVGGLVGFGRGSLSL 172
Query: 200 VSQLNSS----FSYCIGSLHDPDYLHNKLILGDGAIIDEGD--ATPL--QFIDGHYYITL 251
VSQL S FSYC+ S D + L +G+ A ++ +TPL HYY++L
Sbjct: 173 VSQLGPSMGNKFSYCLVSRTDSSK-TSPLFIGNTASLEATTVGSTPLVQSSSTNHYYLSL 231
Query: 252 EAISVDGRMLDINPNIFK-RDDSGGGVMIDSGTDVTWLVKEAYEALRDEVMIRLEGEQMR 310
E ISV G+ L I F + D GG++IDSGT +T+L + AY+A+++ ++ + Q
Sbjct: 232 EGISVGGQSLAIPTGTFDIQSDGSGGLIIDSGTTLTFLQQTAYDAVKEAMVSSINLPQAD 291
Query: 311 SYSWPDKLCYHGIMSSDLKGFPTVRFHFRGGAKLALEKDSMFYQPRPDAFCMAVNPASIN 370
LC++ SS+ GFP++ FHF+G +++ +F D C+A+ P N
Sbjct: 292 GQL---DLCFNQQGSSN-PGFPSMTFHFKGADYDVPKENYLFPDSTSDIVCLAMMPT--N 345
Query: 371 NRYVNFSLIGMMAQQFYNVGYDIGRKQKTFQRMDCEVL 408
+ N ++ G + QQ Y + YD +F C+ L
Sbjct: 346 SNLGNMAIFGNVQQQNYQILYDNENNVLSFAPTACDTL 383
>gi|449443786|ref|XP_004139658.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 1-like [Cucumis
sativus]
gi|449475416|ref|XP_004154449.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 1-like [Cucumis
sativus]
Length = 453
Score = 155 bits (391), Expect = 5e-35, Method: Compositional matrix adjust.
Identities = 122/358 (34%), Positives = 170/358 (47%), Gaps = 28/358 (7%)
Query: 61 FYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLGTIFYPSRSSSYAYVPCDSEH 120
++ + +G PP + +DTGS ++W+ C PCR C Q IF P +S S+A +PC S
Sbjct: 110 YFTRLGVGTPPRYLYMVLDTGSDVVWLQCSPCRKCYSQSDPIFNPYKSKSFAGIPCSSPL 169
Query: 121 CRYFPYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDVVFGCGFST 180
CR + CS +H C+Y Y G T+ +TE LTF+ + V GCG
Sbjct: 170 CRRLDSSGCSTRRHTCLYQVSYGDGSFTTGDFATETLTFRGNK-----IAKVALGCGHHN 224
Query: 181 NRNF-KFSGIFGLGIGRSSLVSQ----LNSSFSYCIGSLHDPDYLHNKLILGDGAIIDEG 235
F +G+ GLG GR S SQ N FSYC+ + ++ GD AI
Sbjct: 225 EGLFVGAAGLLGLGRGRLSFPSQTGIRFNHKFSYCLVD-RSASSKPSSMVFGDAAISRLA 283
Query: 236 DATPL---QFIDGHYYITLEAISVDG-RMLDINPNIFKRDDSG-GGVMIDSGTDVTWLVK 290
TPL +D YY+ L ISV G R+ ++P++FK D +G GGV+IDSGT VT L +
Sbjct: 284 RFTPLIRNPKLDTFYYVGLIGISVGGVRVRGVSPSLFKLDSAGNGGVIIDSGTSVTRLTR 343
Query: 291 EAYEALRDEVMIRLEGEQMR---SYSWPDKLCYHGIMSSDLKGFPTVRFHFRGGAKLALE 347
AY ALRD R+ ++ +S D CY S +K PTV HFRG
Sbjct: 344 PAYTALRDA--FRVGARHLKRGPEFSLFDT-CYDLSGQSSVK-VPTVVLHFRGADMALPA 399
Query: 348 KDSMFYQPRPDAFCMAVNPASINNRYVNFSLIGMMAQQFYNVGYDIGRKQKTFQRMDC 405
+ + +FC A S+IG + QQ + V YD+ + F C
Sbjct: 400 TNYLIPVDENGSFCFA-----FAGTISGLSIIGNIQQQGFRVVYDLAGSRIGFAPRGC 452
>gi|38344196|emb|CAE05761.2| OSJNBa0064G10.12 [Oryza sativa Japonica Group]
Length = 451
Score = 155 bits (391), Expect = 5e-35, Method: Compositional matrix adjust.
Identities = 125/427 (29%), Positives = 189/427 (44%), Gaps = 64/427 (14%)
Query: 1 LIHRDSILSPYNNPNENPAHRVQRGINISIARLAYLQEKIRSHNT------YQAQILPSN 54
L+HRD+I S P+ H+V + AR+ +L++++ + + ++++P
Sbjct: 67 LVHRDAI-SGATYPSRR--HQVVGLVARDNARVEHLEKRLVASTSPYLPEDLVSEVVPGV 123
Query: 55 DISNSLFYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLGTIFYPSRSSSYAYV 114
D + ++V + +G PP Q+ +D+GS ++WV C PC C Q +F P+ SSS++ V
Sbjct: 124 DDGSGEYFVRVGVGSPPTDQYLVVDSGSDVIWVQCRPCEQCYAQTDPLFDPAASSSFSGV 183
Query: 115 PCDSEHCRYFP--YARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDV 172
C S CR +C Y+ Y G T ++ E LT T VQ V
Sbjct: 184 SCGSAICRTLSGTGCGGGGDAGKCDYSVTYGDGSYTKGELALETLTLGGT-----AVQGV 238
Query: 173 VFGCGFSTNRNF-KFSGIFGLGIGRSSLVSQLNSS----FSYCIGSLHDPDYLHNKLILG 227
GCG + F +G+ GLG G SLV QL + FSYC+ S
Sbjct: 239 AIGCGHRNSGLFVGAAGLLGLGWGAMSLVGQLGGAAGGVFSYCLAS-------------- 284
Query: 228 DGAIIDEGDATPLQFIDGHYYITLEAISVDGRMLDINPNIFK-RDDSGGGVMIDSGTDVT 286
G YY+ L I V G L + ++F+ +D GGV++D+GT VT
Sbjct: 285 ------RGAGGAGSLASSFYYVGLTGIGVGGERLPLQDSLFQLTEDGAGGVVMDTGTAVT 338
Query: 287 WLVKEAYEALR---DEVMIRLEGEQMRSYSWPDKLCYHGIMSSDLKGF-----PTVRFHF 338
L +EAY ALR D M L + + S D CY DL G+ PTV F+F
Sbjct: 339 RLPREAYAALRGAFDGAMGAL--PRSPAVSLLDT-CY------DLSGYASVRVPTVSFYF 389
Query: 339 RGGAKLALEKDSMFYQPRPDAFCMAVNPASINNRYVNFSLIGMMAQQFYNVGYDIGRKQK 398
GA L L ++ + FC+A P+S S++G + Q+ + D
Sbjct: 390 DQGAVLTLPARNLLVEVGGAVFCLAFAPSS-----SGISILGNIQQEGIQITVDSANGYV 444
Query: 399 TFQRMDC 405
F C
Sbjct: 445 GFGPNTC 451
>gi|413947545|gb|AFW80194.1| hypothetical protein ZEAMMB73_386053 [Zea mays]
Length = 456
Score = 154 bits (389), Expect = 7e-35, Method: Compositional matrix adjust.
Identities = 128/431 (29%), Positives = 194/431 (45%), Gaps = 41/431 (9%)
Query: 1 LIHRDSILSPYNNPNENPAHRVQRGINISIARLAYLQEKIRSHNTYQAQI------LPSN 54
IHRDS SP+ P+ P H R A L + + + + S
Sbjct: 34 FIHRDSARSPFAQPSL-PPHARALAAARRSLRGAALGRYVGGASPAPGPVPEADGGVESK 92
Query: 55 DISNS---LFYVNISIGQPPVPQFTAMDTGSSLLWVHCYP--CRDCSPQLGTIFYPSRSS 109
I+ S L YVN+ G PP DTGS L+WV+C + +F+PSRS+
Sbjct: 93 IITRSFEYLMYVNV--GTPPAQMLAIADTGSDLVWVNCSSNGGGGGASDGAVVFHPSRST 150
Query: 110 SYAYVPCDSEHCRYFPYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDEST--- 166
+Y+ + C S C+ A C A C Y Y G T +STE +F
Sbjct: 151 TYSLLSCQSAACQALSQASCDA-DSECQYQYAYGDGSRTIGVLSTETFSFAAAGGGGEGQ 209
Query: 167 IHVQDVVFGCGFSTNRNFKFSGIFGLGIGRSSLVSQLNSS------FSYCIGSLHDPDYL 220
+ V V FGC + +F+ G+ GLG G SLVSQL ++ FSYC+ +
Sbjct: 210 VRVPRVSFGCSTGSAGSFRSDGLVGLGAGALSLVSQLGAAARIARRFSYCLVPPYAAANS 269
Query: 221 HNKLILGDGAII-DEGDA-TPL--QFIDGHYYITLEAISVDGRMLDINPNIFKRDDSGGG 276
+ L G A++ D G A TPL +D +Y + LE+++V G+ D+ R
Sbjct: 270 SSTLSFGARAVVSDPGAASTPLVPSEVDSYYTVALESVAVAGQ--DVASANSSR------ 321
Query: 277 VMIDSGTDVTWLVKEAYEALRDEVMIRLEGEQMRSYSWPDKLCY--HGIMSSDLKGFPTV 334
+++DSGT +T+L L E+ R+ + + +LCY G ++ G P V
Sbjct: 322 IIVDSGTTLTFLDPALLRPLVAELERRIRLPRAQPPEQLLQLCYDVQGKSQAEDFGIPDV 381
Query: 335 RFHFRGGAKLALEKDSMFYQPRPDAFCMAVNPASINNRYVNFSLIGMMAQQFYNVGYDIG 394
F GGA + L ++ F C+ + P S + S++G +AQQ ++VGYD+
Sbjct: 382 TLRFGGGASVTLRPENTFSLLEEGTLCLVLVPVSESQ---PVSILGNIAQQNFHVGYDLD 438
Query: 395 RKQKTFQRMDC 405
+ TF +DC
Sbjct: 439 ARTVTFAAVDC 449
>gi|297814776|ref|XP_002875271.1| hypothetical protein ARALYDRAFT_484331 [Arabidopsis lyrata subsp.
lyrata]
gi|297321109|gb|EFH51530.1| hypothetical protein ARALYDRAFT_484331 [Arabidopsis lyrata subsp.
lyrata]
Length = 451
Score = 154 bits (389), Expect = 8e-35, Method: Compositional matrix adjust.
Identities = 124/429 (28%), Positives = 198/429 (46%), Gaps = 47/429 (10%)
Query: 9 SPYNNPNENPAHRVQRGINISIARLAYLQEKIRSHNTYQAQILPSNDISNSLFYVNISIG 68
SP+ +P + + + RL +L + + ++ ++ + ++V++ IG
Sbjct: 39 SPFPSPTQ--------ALALDTRRLHFLSLRRKPVPFVKSPVVSGASSGSGQYFVDLRIG 90
Query: 69 QPPVPQFTAMDTGSSLLWVHCYPCRDCSPQ-LGTIFYPSRSSSYAYVPCDSEHCRYFP-- 125
QPP DTGS L+WV C CR+CS T+F+P SS+++ C CR P
Sbjct: 91 QPPQSLLLIADTGSDLVWVKCSACRNCSHHSPATVFFPRHSSTFSPAHCYDPVCRLVPKP 150
Query: 126 --YARCSAYK--HRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDVVFGCGFS-- 179
RC+ + C Y Y G TS + E + K + ++ V FGCGF
Sbjct: 151 GRAPRCNHTRIHSTCPYEYGYADGSLTSGLFARETTSLKTSSGKEAKLKSVAFGCGFRIS 210
Query: 180 ----TNRNFKFS-GIFGLGIGRSSLVSQL----NSSFSYCIGSLHDPDYLHNKLILGDGA 230
+ +F + G+ GLG G S SQL + FSYC+ + LI+GDG
Sbjct: 211 GQSVSGTSFNGANGVMGLGRGPISFASQLGRRFGNKFSYCLMDYTLSPPPTSYLIIGDG- 269
Query: 231 IIDEGDA------TPL---QFIDGHYYITLEAISVDGRMLDINPNIFKRDDSG-GGVMID 280
GDA TPL YY+ L+++ V+G L I+P+I++ DDSG GG ++D
Sbjct: 270 ----GDAVSKLFFTPLLTNPLSPTFYYVKLKSVFVNGAKLRIDPSIWEIDDSGNGGTVMD 325
Query: 281 SGTDVTWLVKEAYEALRDEVMIRLEGEQMRSYSWPDKLCYH--GIMSSDLKGFPTVRFHF 338
SGT + +L AY + V R++ + LC + G+ + K P ++F F
Sbjct: 326 SGTTLAFLADPAYRLVIAAVKQRIKLPNADELTPGFDLCVNVSGVTKPE-KILPRLKFEF 384
Query: 339 RGGAKLALEKDSMFYQPRPDAFCMAVNPASINNRYVNFSLIGMMAQQFYNVGYDIGRKQK 398
GGA + F + C+A+ S++ + V FS+IG + QQ + +D R +
Sbjct: 385 SGGAVFVPPPRNYFIETEEQIQCLAIQ--SVDPK-VGFSVIGNLMQQGFLFEFDRDRSRL 441
Query: 399 TFQRMDCEV 407
F R C +
Sbjct: 442 GFSRRGCAL 450
>gi|225427556|ref|XP_002266575.1| PREDICTED: probable aspartic protease At2g35615 [Vitis vinifera]
Length = 445
Score = 154 bits (389), Expect = 8e-35, Method: Compositional matrix adjust.
Identities = 122/421 (28%), Positives = 192/421 (45%), Gaps = 32/421 (7%)
Query: 1 LIHRDSILSPYNNPNENPAHRVQRGINISIARLAYLQEKIRSHNTYQAQILPSNDISNSL 60
I RDS SP+ NP+E R+Q+ SI R + + S N Q+ ++
Sbjct: 38 FISRDSPRSPFYNPSETKYQRLQKAFRRSILRGNHFRAIRASPNDIQSNVISGG----GS 93
Query: 61 FYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLGTIFYPSRSSSYAYVPCDSEH 120
+ +NIS+G PPV DTGS L+W C PC DC Q+ +F P +S +Y + C+++
Sbjct: 94 YLMNISLGTPPVSMLGIADTGSDLIWRQCLPCDDCYKQVEPLFDPKKSKTYKTLGCNNDF 153
Query: 121 CRYFPYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDVVFGCGFST 180
C+ + C + Y T +S+E T +T+ + FGCG S
Sbjct: 154 CQDLGQQGSCGDDNTCTSSYSYGDQSYTRRDLSSETFTIGSTEGDPASFPGLAFGCGHSN 213
Query: 181 NRNF--KFSGIFGLGIGRSSLVSQLNSS----FSYCIGSLHDPDYLHNKLILGDGAIIDE 234
F K SG+ GLG G SLV QL+S FSYC+ L +K+ G A++
Sbjct: 214 GGTFNEKDSGLIGLGGGPLSLVMQLSSKVGGQFSYCLVPLSSDSTASSKINFGKSAVVSG 273
Query: 235 GDATPLQFIDG----HYYITLEAISVDGRMLDINPNIFKRDDSG------GGVMIDSGTD 284
I G YY+TLE +S+ + F ++ S ++IDSGT
Sbjct: 274 SGTVSTPLIKGTPDTFYYLTLEGMSLGSEKVAFKG--FSKNKSSPAAAEESNIIIDSGTT 331
Query: 285 VTWLVKEAYEALRDEVMIRLEGEQMRSYSWPDKLCYHGIMSSDLKGFPTVRFHFRGGAKL 344
+T L ++ Y + + + G+ LCY G+ ++ PT+ HF GA +
Sbjct: 332 LTLLPRDFYTDMESALTKVIGGQTTTDPRGTFSLCYSGVKKLEI---PTITAHFI-GADV 387
Query: 345 ALEKDSMFYQPRPDAFCMAVNPASINNRYVNFSLIGMMAQQFYNVGYDIGRKQKTFQRMD 404
L + F Q + D C ++ P+S N ++ G ++Q + VGYD+ + +F+ D
Sbjct: 388 QLPPLNTFVQAQEDLVCFSMIPSS------NLAIFGNLSQMNFLVGYDLKNNKVSFKPTD 441
Query: 405 C 405
C
Sbjct: 442 C 442
>gi|449439383|ref|XP_004137465.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 1-like [Cucumis
sativus]
Length = 491
Score = 154 bits (388), Expect = 1e-34, Method: Compositional matrix adjust.
Identities = 122/352 (34%), Positives = 173/352 (49%), Gaps = 22/352 (6%)
Query: 61 FYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLGTIFYPSRSSSYAYVPCDSEH 120
++ + +GQP P + +DTGS + W+ C PC DC Q IF P SSS+A +PC+S+
Sbjct: 155 YFSRVGVGQPAKPFYMVLDTGSDINWLQCQPCTDCYQQTDPIFDPRSSSSFASLPCESQQ 214
Query: 121 CRYFPYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDVVFGCGFST 180
C+ + C A K C+Y Y G T E LTF N+ + +V GCG
Sbjct: 215 CQALETSGCRASK--CLYQVSYGDGSFTVGEFVIETLTFGNSG----MINNVAVGCGHDN 268
Query: 181 NRNF-KFSGIFGLGIGRSSLVSQLN-SSFSYCIGSLHDPDYLHNKLILGDGAIIDEGDAT 238
F +G+ GLG G SL SQ+ SSFSYC+ + + L A D +A
Sbjct: 269 EGLFVGSAGLLGLGGGSLSLTSQMKASSFSYCL--VDRDSSSSSDLEFNSAAPSDSVNAP 326
Query: 239 PLQF--IDGHYYITLEAISVDGRMLDINPNIFKRDDSG-GGVMIDSGTDVTWLVKEAYEA 295
L+ +D YY+ L +SV G++L I PN+F+ DDSG GG+++DSGT +T L +AY
Sbjct: 327 LLKSGKVDTFYYVGLTGMSVGGQLLSIPPNLFQMDDSGYGGIIVDSGTAITRLQTQAYNT 386
Query: 296 LRDEVMIRLE-GEQMRSYSWPDKLCYHGIMSSDLKGFPTVRFHFRGGAKLAL-EKDSMFY 353
LRD + R ++ ++ D CY + S PTV F F GG L L K+ +
Sbjct: 387 LRDAFVSRTPYLKKTNGFALFDT-CYD-LSSQSRVTIPTVSFEFAGGKSLQLPPKNYLIP 444
Query: 354 QPRPDAFCMAVNPASINNRYVNFSLIGMMAQQFYNVGYDIGRKQKTFQRMDC 405
FC A P + + S+IG + QQ V YD+ F C
Sbjct: 445 VDSVGTFCFAFAPTT-----SSLSIIGNVQQQGTRVHYDLANSVVGFSPHKC 491
>gi|242056193|ref|XP_002457242.1| hypothetical protein SORBIDRAFT_03g003930 [Sorghum bicolor]
gi|241929217|gb|EES02362.1| hypothetical protein SORBIDRAFT_03g003930 [Sorghum bicolor]
Length = 457
Score = 154 bits (388), Expect = 1e-34, Method: Compositional matrix adjust.
Identities = 129/431 (29%), Positives = 196/431 (45%), Gaps = 43/431 (9%)
Query: 1 LIHRDSILSPYNNPNENPAHRVQRGINISIARLAYLQEKIRSHNTYQAQI------LPSN 54
IHRDS SPY +P +P H R L + A + + S
Sbjct: 37 FIHRDSARSPYRHPALSP-HARALAAARRSLRGEVLGRSYSGASPAAAPVSAADGGVESK 95
Query: 55 DISNS---LFYVNISIGQPPVPQFTAMDTGSSLLWVHCYPC----RDCSPQLGTIFYPSR 107
I+ S L YVN+ G PP DTGS L+WV+C D +F P+R
Sbjct: 96 IITRSFEYLMYVNV--GTPPTQLLAIADTGSDLVWVNCSSSGGGLADADAGGNVVFQPTR 153
Query: 108 SSSYAYVPCDSEHCRYFPYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTF-KNTDEST 166
SS+Y+ + C S C+ A C A C Y Y G T +STE +F +
Sbjct: 154 SSTYSQLSCQSNACQALSQASCDA-DSECQYQYSYGDGSRTIGVLSTETFSFVDGGGKGQ 212
Query: 167 IHVQDVVFGCGFSTNRNFKFSGIFGLGIGRSSLVSQLNSS------FSYCIGSLHDPDYL 220
+ V V FGC ++ F+ G+ GLG G SLVSQL ++ SYC+ +D +
Sbjct: 213 VRVPRVNFGCSTASAGTFRSDGLVGLGAGAFSLVSQLGATTHIDRKLSYCLIPSYDANS- 271
Query: 221 HNKLILGDGAIIDEGDA--TPL--QFIDGHYYITLEAISVDGRMLDINPNIFKRDDSGGG 276
+ L G A++ E A TPL +D +Y + LE+++V G+ + D
Sbjct: 272 SSTLNFGSRAVVSEPGAASTPLVPSDVDSYYTVALESVAVGGQ------EVATHDSR--- 322
Query: 277 VMIDSGTDVTWLVKEAYEALRDEVMIRLEGEQMRSYSWPDKLCY--HGIMSSDLKGFPTV 334
+++DSGT +T+L L E+ R++ ++++ +LCY G +D G P V
Sbjct: 323 IIVDSGTTLTFLDPALLGPLVTELERRIKLQRVQPPEQLLQLCYDVQGKSETDNFGIPDV 382
Query: 335 RFHFRGGAKLALEKDSMFYQPRPDAFCMAVNPASINNRYVNFSLIGMMAQQFYNVGYDIG 394
F GGA + L ++ F + C+ + P S + S++G +AQQ ++VGYD+
Sbjct: 383 TLRFGGGAAVTLRPENTFSLLQEGTLCLVLVPVSESQ---PVSILGNIAQQNFHVGYDLD 439
Query: 395 RKQKTFQRMDC 405
+ TF DC
Sbjct: 440 ARTVTFAAADC 450
>gi|116786826|gb|ABK24255.1| unknown [Picea sitchensis]
gi|148906052|gb|ABR16185.1| unknown [Picea sitchensis]
Length = 485
Score = 154 bits (388), Expect = 1e-34, Method: Compositional matrix adjust.
Identities = 126/439 (28%), Positives = 191/439 (43%), Gaps = 50/439 (11%)
Query: 1 LIHRDSILSPYNNPNE-NPAHRVQRGINISIARLAYLQEKIR------------------ 41
L+HRD++ N NE + A R+Q+ + AR+A + ++
Sbjct: 63 LVHRDAMKGNSNKNNELSYAERMQQRLKRDAARVAAINSRLELAVNGIKRSSLKPDSSSS 122
Query: 42 ---SHNTYQAQILPSNDISNSLFYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQ 98
+ + +Q+ ++ D + ++ I +G P Q +DTGS + W+ C PC DC Q
Sbjct: 123 FTMAESDFQSPVVSGMDQGSGEYFSRIGVGAPRRDQLMVLDTGSDVTWIQCEPCSDCYQQ 182
Query: 99 LGTIFYPSRSSSYAYVPCDSEHCRYFPYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLT 158
I+ P+ SSSY V C + C+ + CS C+Y Y G T +TE LT
Sbjct: 183 SDPIYNPALSSSYKLVGCQANLCQQLDVSGCS-RNGSCLYQVSYGDGSYTQGNFATETLT 241
Query: 159 FKNTDESTIHVQDVVFGCGFSTNRNFKFSGIFGLGIGR-----SSLVSQLNSSFSYCIGS 213
+Q+V GCG F + G S L + FSYC+
Sbjct: 242 LGGA-----PLQNVAIGCGHDNEGLFVGAAGLLGLGGGSLSFPSQLTDENGKIFSYCL-- 294
Query: 214 LHDPDYLHNKLILGDGAIIDEGDATPL---QFIDGHYYITLEAISVDGRMLDINPNIFKR 270
+ + L G A+ + P+ +D YY++L ISV G+ML I+ ++F
Sbjct: 295 VDRDSESSSTLQFGRAAVPNGAVLAPMLKNSRLDTFYYVSLSGISVGGKMLSISDSVFGI 354
Query: 271 DDSG-GGVMIDSGTDVTWLVKEAYEALRDEVMIRLEGEQMRSYSWPD--KLCYHGIMSSD 327
D SG GGV++DSGT VT L AY++LRD R + + S CY + S +
Sbjct: 355 DASGNGGVIVDSGTAVTRLQTAAYDSLRD--AFRAGTKNLPSTDGVSLFDTCYD-LSSKE 411
Query: 328 LKGFPTVRFHFRGGAKLAL-EKDSMFYQPRPDAFCMAVNPASINNRYVNFSLIGMMAQQF 386
PTV FHF GG ++L K+ + FC A P S + S++G + QQ
Sbjct: 412 SVDVPTVVFHFSGGGSMSLPAKNYLVPVDSMGTFCFAFAPTS-----SSLSIVGNIQQQG 466
Query: 387 YNVGYDIGRKQKTFQRMDC 405
V +D Q F C
Sbjct: 467 IRVSFDRANNQVGFAVNKC 485
>gi|356495754|ref|XP_003516738.1| PREDICTED: probable aspartic protease At2g35615-like [Glycine max]
Length = 420
Score = 154 bits (388), Expect = 1e-34, Method: Compositional matrix adjust.
Identities = 114/367 (31%), Positives = 172/367 (46%), Gaps = 44/367 (11%)
Query: 61 FYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLGTIFYPSRSSSYAYVPCDSEH 120
+ + +SIG PP + DTGS L W C PC +C Q +F P +S++Y + CDS+
Sbjct: 72 YLMELSIGTPPFKIYGIADTGSDLTWTSCVPCNNCYKQRNPMFDPQKSTTYRNISCDSKL 131
Query: 121 CRYFPYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDVVFGCGFST 180
C CS K RC YT Y T ++ E +T +T ++ ++ +VFGCG +
Sbjct: 132 CHKLDTGVCSPQK-RCNYTYAYASAAITRGVLAQETITLSSTKGKSVPLKGIVFGCGHNN 190
Query: 181 NRNFK--FSGIFGLGIGRSSLVSQLNSS-----FSYCIGSLHDPDYLHNKLILGDGA--- 230
F GI GLG G SL+SQ+ SS FS C+ H + +K+ G G+
Sbjct: 191 TGGFNDHEMGIIGLGGGPVSLISQMGSSFGGKRFSQCLVPFHTDVSVSSKMSFGKGSKVS 250
Query: 231 --------IIDEGDATPLQFIDGHYYITLEAISVDGRMLDINPNIFKRDDSGGGVMIDSG 282
++ + D TP Y++TL ISV+ L N + ++ G + +DSG
Sbjct: 251 GKGVVSTPLVAKQDKTP-------YFVTLLGISVENTYLHFNGS--SQNVEKGNMFLDSG 301
Query: 283 TDVTWLVKEAYEALRDEVMIRLEGEQMRSYSWPD---KLCYHGIMSSDLKGFPTVRFHFR 339
T T L + Y+ + +V R E PD +LCY ++L+G P + HF
Sbjct: 302 TPPTILPTQLYDQVVAQV--RSEVAMKPVTDDPDLGPQLCYR--TKNNLRG-PVLTAHFE 356
Query: 340 GGAKLALEKDSMFYQPRPDAFCMA-VNPASINNRYVNFSLIGMMAQQFYNVGYDIGRKQK 398
GA + L F P+ FC+ N +S Y NF AQ Y +G+D+ R+
Sbjct: 357 -GADVKLSPTQTFISPKDGVFCLGFTNTSSDGGVYGNF------AQSNYLIGFDLDRQVV 409
Query: 399 TFQRMDC 405
+F+ DC
Sbjct: 410 SFKPKDC 416
>gi|357449529|ref|XP_003595041.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
gi|355484089|gb|AES65292.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
Length = 210
Score = 152 bits (385), Expect = 2e-34, Method: Compositional matrix adjust.
Identities = 86/214 (40%), Positives = 125/214 (58%), Gaps = 25/214 (11%)
Query: 198 SLVSQLNSSFSYCIGSLHDPDYLHNKLILGDGAIIDEGDATPLQFIDGHYYITLEAISVD 257
SL +Q++ FSYC+GSL D DY +N+LILG+ A + GD TP Q +G ++T+E IS+
Sbjct: 17 SLATQISKKFSYCMGSLTDKDYDYNQLILGEEAYL-AGDTTPFQVYNGVNHVTMEGISIG 75
Query: 258 GRMLDINPNIFKRDDSGGGVMIDSGTDVTWLVKEAYEALRDEV---MIRLEGEQMRSYSW 314
+ LDI P FK ++ V + YE L EV RL+ +++R
Sbjct: 76 QKSLDIAPGTFKMKNN---------------VNDVYELLCKEVRNLFQRLKFQEVRLQGS 120
Query: 315 PDKLCYHGIMSSDLKGFPTVRFHFRGGAKLALEKDSMFYQPRPDAFCMAVNPASINNRYV 374
P LCY G +S DLKGFP V F+F GGA + L+ + F Q + D FCM+V+P+
Sbjct: 121 PWALCYFGSVSRDLKGFPVVTFYFAGGAVIGLDTLNFFVQAKDDVFCMSVHPSH------ 174
Query: 375 NFSLIGMMAQQFYNVGYDIGRKQKTFQRMDCEVL 408
+ S+IG++AQQ YNVGYD + + +DC++L
Sbjct: 175 DLSVIGLLAQQSYNVGYDKDKGLIYIESIDCQLL 208
>gi|224067990|ref|XP_002302634.1| predicted protein [Populus trichocarpa]
gi|222844360|gb|EEE81907.1| predicted protein [Populus trichocarpa]
Length = 490
Score = 152 bits (385), Expect = 2e-34, Method: Compositional matrix adjust.
Identities = 128/430 (29%), Positives = 189/430 (43%), Gaps = 48/430 (11%)
Query: 2 IHRDSILSPYNNPNENPAHRVQRGINISIARLAYLQEKIRSHNTYQAQILP-SNDISNSL 60
+H LS P + R+ R + + L L + S N +A+ S+ +++ L
Sbjct: 82 LHHLDALSSDETPQDLFNSRLARDAS-RVKSLTSLAAAVGSTNRTRARGPGFSSSVTSGL 140
Query: 61 ------FYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLGTIFYPSRSSSYAYV 114
++ + +G P F +DTGS ++W+ C PC+ C Q +F P++S S+A +
Sbjct: 141 AQGSGEYFTRLGVGTPARYVFMVLDTGSDVVWIQCAPCKKCYSQTDPVFNPTKSRSFANI 200
Query: 115 PCDSEHCRYFPYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDVVF 174
PC S CR CS KH C+Y Y G T STE LTF+ T V V
Sbjct: 201 PCGSPLCRRLDSPGCSTKKHICLYQVSYGDGSFTYGEFSTETLTFRGT-----RVGRVAL 255
Query: 175 GCGFSTNRNF----------KFSGIFGLGIGRSSLVSQLNSSFSYCI---GSLHDPDYLH 221
GCG F + F IGR + + FSYC+ + P Y
Sbjct: 256 GCGHDNEGLFIGAAGLLGLGRGRLSFPSQIGR-----RFSRKFSYCLVDRSASSKPSY-- 308
Query: 222 NKLILGDGAIIDEGDATPL---QFIDGHYYITLEAISVDG-RMLDINPNIFKRDDSG-GG 276
++ GD AI TPL +D YY+ L +SV G R+ I ++FK D +G GG
Sbjct: 309 --MVFGDSAISRTARFTPLVSNPKLDTFYYVELLGVSVGGTRVPGITASLFKLDSTGNGG 366
Query: 277 VMIDSGTDVTWLVKEAYEALRDEVMIRLEG-EQMRSYSWPDKLCYHGIMSSDLKGFPTVR 335
V+IDSGT VT L + AY ALRD + ++ +S D C+ +++K PTV
Sbjct: 367 VIIDSGTSVTRLTRPAYVALRDAFRVGASNLKRAPEFSLFDT-CFDLSGKTEVK-VPTVV 424
Query: 336 FHFRGGAKLALEKDSMFYQPRPDAFCMAVNPASINNRYVNFSLIGMMAQQFYNVGYDIGR 395
HFRG + + +FC A S++G + QQ + V YD+
Sbjct: 425 LHFRGADVSLPASNYLIPVDNSGSFCFA-----FAGTMSGLSIVGNIQQQGFRVVYDLAA 479
Query: 396 KQKTFQRMDC 405
+ F C
Sbjct: 480 SRVGFAPRGC 489
>gi|414584783|tpg|DAA35354.1| TPA: hypothetical protein ZEAMMB73_186928 [Zea mays]
Length = 464
Score = 152 bits (383), Expect = 4e-34, Method: Compositional matrix adjust.
Identities = 127/429 (29%), Positives = 190/429 (44%), Gaps = 51/429 (11%)
Query: 1 LIHRDSILSPYNNPNENPAHRVQRGINISIARLAYLQEKIRSHNTYQ--------AQILP 52
L+ RD++ P+ H V + AR YL ++ S YQ ++++
Sbjct: 63 LVRRDAVTGS-TYPSRR--HAVLDLVARDNARAEYLASRL-SPAAYQPTGFSGSESKVVS 118
Query: 53 SNDISNSLFYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLGTIFYPSRSSSYA 112
D + ++V + IG PP Q+ +D+GS ++WV C PC +C Q +F P+ S++++
Sbjct: 119 GLDEGSGEYFVRVGIGSPPTEQYLVVDSGSDVIWVQCKPCLECYAQADPLFDPATSATFS 178
Query: 113 YVPCDSEHCRYFPYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDV 172
VPC S CR + C C Y Y G T ++ E LT T V+ V
Sbjct: 179 AVPCGSAVCRTLRTSGCGD-SGGCDYEVSYGDGSYTKGALALETLTLGGT-----AVEGV 232
Query: 173 VFGCGFSTNRNF-KFSGIFGLGIGRSSLVSQL----NSSFSYCIGSLHDPDYLHNKLILG 227
GCG F +G+ GLG G SLV QL +FSYC+ S L+LG
Sbjct: 233 AIGCGHRNRGLFVGAAGLLGLGWGPMSLVGQLGGAAGGAFSYCLASRG-----AGSLVLG 287
Query: 228 DGAIIDEGDA-TPL---QFIDGHYYITLEAISVDGRMLDINPNIFK-RDDSGGGVMIDSG 282
+ EG PL YY+ L I V L + ++F+ +D GGV++D+G
Sbjct: 288 RSEAVPEGAVWVPLVRNPQAPSFYYVGLSGIGVGDERLPLQEDLFQLTEDGAGGVVMDTG 347
Query: 283 TDVTWLVKEAYEALRDEVMIRLEG-EQMRSYSWPDKLCYHGIMSSDLKGF-----PTVRF 336
T VT L +EAY ALRD + + + S D CY DL G+ PTV F
Sbjct: 348 TAVTRLPQEAYAALRDAFVAAVGALPRAPGVSLLDT-CY------DLSGYTSVRVPTVSF 400
Query: 337 HFRGGAKLALEKDSMFYQPRPDAFCMAVNPASINNRYVNFSLIGMMAQQFYNVGYDIGRK 396
+F G A L L ++ + +C+A P+S S++G + Q+ + D
Sbjct: 401 YFDGAATLTLPARNLLLEVDGGIYCLAFAPSSSGP-----SILGNIQQEGIQITVDSANG 455
Query: 397 QKTFQRMDC 405
F C
Sbjct: 456 YIGFGPTTC 464
>gi|414584780|tpg|DAA35351.1| TPA: hypothetical protein ZEAMMB73_696016 [Zea mays]
Length = 524
Score = 152 bits (383), Expect = 4e-34, Method: Compositional matrix adjust.
Identities = 120/405 (29%), Positives = 178/405 (43%), Gaps = 46/405 (11%)
Query: 31 ARLAYLQEKIR------SHNTYQAQILPSNDISNSLFYVNISIGQPPVPQFTAMDTGSSL 84
AR YL ++ + +++++ D + + V +S+G PP Q+ +D+GS +
Sbjct: 135 ARAEYLATRLSPAYQPPGFSGSESKVVSGLDEGSGEYLVRVSVGSPPTEQYLVVDSGSDV 194
Query: 85 LWVHCYPCRDCSPQLGTIFYPSRSSSYAYVPCDSEHCRYFPYARCS-AYKHRCIYTQLYL 143
+WV C PC +C Q +F P+ S++++ V C S CR P + C C Y Y
Sbjct: 195 MWVQCKPCLECYVQADPLFDPATSATFSGVSCGSAICRILPTSACGDGELGGCEYEVSYA 254
Query: 144 IGPETSVFVSTEQLTFKNTDESTIHVQDVVFGCGFSTNRNF-KFSGIFGLGIGRSSLVSQ 202
G T ++ E LT T V+ VV GCG F +G+ GLG G SLV Q
Sbjct: 255 DGSYTKGALALETLTLGGT-----AVEGVVIGCGHRNRGLFVGAAGLMGLGWGPMSLVGQ 309
Query: 203 L----NSSFSYCIGSLHD-----PDYLHNKLILGDGAIIDEGDA-TPL---QFIDGHYYI 249
L +FSYC+ S D L+LG + EG PL YY+
Sbjct: 310 LGGEVGGAFSYCLASRGGYGSGAADDDAGWLVLGRSEAVPEGAVWVPLVRNPRAPSFYYV 369
Query: 250 TLEAISVDGRMLDINPNIFK-RDDSGGGVMIDSGTDVTWLVKEAYEALRDEVMIRLEGEQ 308
L I V L + +F+ +D G V++D+GT VT L +EAY ALRD + L G
Sbjct: 370 GLSGIEVGDERLPLQAGLFQLTEDGAGDVVMDTGTTVTRLPQEAYAALRDAFVGALAGAV 429
Query: 309 MRSYSWPDKL---CYHGIMSSDLKGF-----PTVRFHFRGGAKLALEKDSMFYQPRPDAF 360
R+ + CY DL G+ PTV F F G A+L L ++ + +
Sbjct: 430 PRAQGVSSSVLDTCY------DLSGYASVRVPTVSFCFDGDARLILAARNVLLEVDMGIY 483
Query: 361 CMAVNPASINNRYVNFSLIGMMAQQFYNVGYDIGRKQKTFQRMDC 405
C+A P+S S++G Q + D F +C
Sbjct: 484 CLAFAPSS-----SGLSIMGNTQQAGIQITVDSANGYIGFGPANC 523
>gi|357500973|ref|XP_003620775.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
gi|357500991|ref|XP_003620784.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
gi|355495790|gb|AES76993.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
gi|355495799|gb|AES77002.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
Length = 438
Score = 152 bits (383), Expect = 4e-34, Method: Compositional matrix adjust.
Identities = 125/419 (29%), Positives = 205/419 (48%), Gaps = 30/419 (7%)
Query: 1 LIHRDSILSPYNNPNENPAHRVQRGINISIARLAYLQEKIRSHNTYQAQILPSNDISNSL 60
LIHRDS SP+ P +N V ++ SI R+ + + + I D
Sbjct: 32 LIHRDSSKSPFYKPTQNKYQHVVDAVHRSINRVNHSNKNSLASTPESTVISYEGD----- 86
Query: 61 FYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLGTIFYPSRSSSYAYVPCDSEH 120
+ ++ S+G PP+ + +DTGS ++W+ C PC C Q F PS+SSSY + C S+
Sbjct: 87 YIMSYSVGTPPIKSYGIVDTGSDIVWLQCEPCEQCYNQTTPKFNPSKSSSYKNISCSSKL 146
Query: 121 CRYFPYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDVVFGCGFST 180
C+ C+ K C Y+ Y + +S E LT ++T + V GCG +
Sbjct: 147 CQSVRDTSCND-KKNCEYSINYGNQSHSQGDLSLETLTLESTTGRPVSFPKTVIGCGTNN 205
Query: 181 NRNFK--FSGIFGLGIGRSSLVSQLNSS----FSYCIG----SLHDPDYLHNKLILGDGA 230
+FK SG+ GLG G +SL++QL S FSYC+ +L + +KL GD A
Sbjct: 206 IGSFKRVSSGVVGLGGGPASLITQLGPSIGGKFSYCLVRMSITLKNMSMGSSKLNFGDVA 265
Query: 231 IIDEGD--ATPLQFIDGH--YYITLEAISVDGRMLDINPNIFKRDDSGGGVMIDSGTDVT 286
I+ + +TP+ D YY+T+EA SV + ++ + + G ++IDS T VT
Sbjct: 266 IVSGHNVLSTPIVKKDHSFFYYLTIEAFSVGDKRVEFAGS--SKGVEEGNIIIDSSTIVT 323
Query: 287 WLVKEAYEALRDEVMIRLEGEQMRSYSWPDKLCYHGIMSSDLKGFPTVRFHFRGGAKLAL 346
++ + Y L ++ + E++ + LCY+ + S + FP + HF+ GA + L
Sbjct: 324 FVPSDVYTKLNSAIVDLVTLERVDDPNQQFSLCYN-VSSDEEYDFPYMTAHFK-GADILL 381
Query: 347 EKDSMFYQPRPDAFCMAVNPASINNRYVNFSLIGMMAQQFYNVGYDIGRKQKTFQRMDC 405
+ F + D C A P++ ++ G +QQ + VGYD+ +K +F+ +DC
Sbjct: 382 YATNTFVEVARDVLCFAFAPSN------GGAIFGSFSQQDFMVGYDLQQKTVSFKSVDC 434
>gi|409179880|gb|AFV26025.1| aspartic proteinase nepenthesin 2 [Nepenthes mirabilis]
Length = 437
Score = 151 bits (382), Expect = 6e-34, Method: Compositional matrix adjust.
Identities = 111/355 (31%), Positives = 167/355 (47%), Gaps = 26/355 (7%)
Query: 61 FYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLGTIFYPSRSSSYAYVPCDSEH 120
+ +N++IG P MDTGS L+W C PC C Q IF P SSS++ +PC+S++
Sbjct: 96 YLMNVAIGTPASSLSAIMDTGSDLIWTQCEPCTQCFSQPTPIFNPQDSSSFSTLPCESQY 155
Query: 121 CRYFPYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDVVFGCGFST 180
C+ P C + C YT Y G T +++TE TF+ T V ++ FGCG
Sbjct: 156 CQDLPSESC---YNDCQYTYGYGDGSSTQGYMATETFTFE-----TSSVPNIAFGCG-ED 206
Query: 181 NRNF---KFSGIFGLGIGRSSLVSQLN-SSFSYCIGSLHDPDYLHNKLILGDGAI-IDEG 235
N+ F +G+ G+G G SL SQL FSYC+ + L LG A + EG
Sbjct: 207 NQGFGQGNGAGLIGMGWGPLSLPSQLGVGQFSYCM--TSSGSSSPSTLALGSAASGVPEG 264
Query: 236 DATPL----QFIDGHYYITLEAISVDGRMLDINPNIFK-RDDSGGGVMIDSGTDVTWLVK 290
+ +YYITL+ I+V G L I + F+ +DD GG++IDSGT +T+L +
Sbjct: 265 SPSTTLIHSSLNPTYYYITLQGITVGGDNLGIPSSTFQLQDDGTGGMIIDSGTTLTYLPQ 324
Query: 291 EAYEALRDEVMIRLEGEQMRSYSWPDKLCYHGIMSSDLKGFPTVRFHFRGGAKLALEKDS 350
+AY A+ ++ + S C+ P + F GG L L +++
Sbjct: 325 DAYNAVAQAFTDQINLSPVDESSSGLSTCFQLPSDGSTVQVPEISMQFDGGV-LNLGEEN 383
Query: 351 MFYQPRPDAFCMAVNPASINNRYVNFSLIGMMAQQFYNVGYDIGRKQKTFQRMDC 405
+ P C+A+ +S S+ G + QQ V YD+ +F C
Sbjct: 384 VLISPAEGVICLAMGSSSQQ----GISIFGNIQQQETQVLYDLQNLAVSFVPTQC 434
>gi|242077672|ref|XP_002448772.1| hypothetical protein SORBIDRAFT_06g032900 [Sorghum bicolor]
gi|241939955|gb|EES13100.1| hypothetical protein SORBIDRAFT_06g032900 [Sorghum bicolor]
Length = 471
Score = 151 bits (381), Expect = 6e-34, Method: Compositional matrix adjust.
Identities = 128/434 (29%), Positives = 191/434 (44%), Gaps = 53/434 (12%)
Query: 1 LIHRDSILSPYNNPNENPAHRVQRGINISIARLAYLQEKIRSHNTYQ--------AQILP 52
L+ RD++ +P H V ++ AR YL ++ YQ ++++
Sbjct: 62 LVRRDAVT---GATYPSPRHAVLDLVSRDNARAEYLASRLSP--AYQPTDFFGSESKVVS 116
Query: 53 SNDISNSLFYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLGTIFYPSRSSSYA 112
D + ++V + IG PP Q+ +D+GS ++WV C PC +C Q +F P+ S++++
Sbjct: 117 GLDEGSGEYFVRVGIGSPPTEQYLVVDSGSDVIWVQCKPCLECYAQADPLFDPASSATFS 176
Query: 113 YVPCDSEHCRYFPYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDV 172
V C S CR + C C Y Y G T ++ E LT T V+ V
Sbjct: 177 AVSCGSAICRTLRTSGCGD-SGGCEYEVSYGDGSYTKGTLALETLTLGGT-----AVEGV 230
Query: 173 VFGCGFSTNRNF-KFSGIFGLGIGRSSLVSQL----NSSFSYCI----GSLHDPDYLHNK 223
GCG F +G+ GLG G SLV QL +FSYC+ GS
Sbjct: 231 AIGCGHRNRGLFVGAAGLLGLGWGPMSLVGQLGGAAGGAFSYCLASRGGSGSGAADAAGS 290
Query: 224 LILGDGAIIDEGDA-TPL---QFIDGHYYITLEAISVDGRMLDINPNIFK-RDDSGGGVM 278
L+LG + EG PL YY+ + I V L + +F+ +D GGGV+
Sbjct: 291 LVLGRSEAVPEGAVWVPLVRNPQAPSFYYVGVSGIGVGDERLPLQDGLFQLTEDGGGGVV 350
Query: 279 IDSGTDVTWLVKEAYEALRDEVMIRLEGEQMRS--YSWPDKLCYHGIMSSDLKGF----- 331
+D+GT VT L +EAY ALRD + G R+ S D CY DL G+
Sbjct: 351 MDTGTAVTRLPQEAYAALRD-AFVGAVGALPRAPGVSLLDT-CY------DLSGYTSVRV 402
Query: 332 PTVRFHFRGGAKLALEKDSMFYQPRPDAFCMAVNPASINNRYVNFSLIGMMAQQFYNVGY 391
PTV F+F G A L L ++ + +C+A P+S S++G + Q+ +
Sbjct: 403 PTVSFYFDGAATLTLPARNLLLEVDGGIYCLAFAPSS-----SGLSILGNIQQEGIQITV 457
Query: 392 DIGRKQKTFQRMDC 405
D F C
Sbjct: 458 DSANGYIGFGPATC 471
>gi|15230868|ref|NP_189198.1| aspartyl protease family protein [Arabidopsis thaliana]
gi|11994761|dbj|BAB03090.1| chloroplast nucleoid DNA binding protein-like; nucellin-like
protein [Arabidopsis thaliana]
gi|189339286|gb|ACD89063.1| At3g25700 [Arabidopsis thaliana]
gi|332643533|gb|AEE77054.1| aspartyl protease family protein [Arabidopsis thaliana]
Length = 452
Score = 150 bits (380), Expect = 8e-34, Method: Compositional matrix adjust.
Identities = 121/424 (28%), Positives = 198/424 (46%), Gaps = 37/424 (8%)
Query: 9 SPYNNPNENPAHRVQRGINISIARLAYLQEKIRSHNTYQAQILPSNDISNSLFYVNISIG 68
SP+ +P + + + RL +L + + ++ ++ + ++V++ IG
Sbjct: 40 SPFPSPTQ--------ALALDTRRLHFLSLRRKPIPFVKSPVVSGAASGSGQYFVDLRIG 91
Query: 69 QPPVPQFTAMDTGSSLLWVHCYPCRDCSPQ-LGTIFYPSRSSSYAYVPCDSEHCRYFPYA 127
QPP DTGS L+WV C CR+CS T+F+P SS+++ C CR P
Sbjct: 92 QPPQSLLLIADTGSDLVWVKCSACRNCSHHSPATVFFPRHSSTFSPAHCYDPVCRLVPKP 151
Query: 128 -RCSAYKH-----RCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDVVFGCGFS-- 179
R H C Y Y G TS + E + K + ++ V FGCGF
Sbjct: 152 DRAPICNHTRIHSTCHYEYGYADGSLTSGLFARETTSLKTSSGKEARLKSVAFGCGFRIS 211
Query: 180 ----TNRNFKFS-GIFGLGIGRSSLVSQL----NSSFSYCIGSLHDPDYLHNKLILGDGA 230
+ +F + G+ GLG G S SQL + FSYC+ + LI+G+G
Sbjct: 212 GQSVSGTSFNGANGVMGLGRGPISFASQLGRRFGNKFSYCLMDYTLSPPPTSYLIIGNGG 271
Query: 231 -IIDEGDATPL---QFIDGHYYITLEAISVDGRMLDINPNIFKRDDSG-GGVMIDSGTDV 285
I + TPL YY+ L+++ V+G L I+P+I++ DDSG GG ++DSGT +
Sbjct: 272 DGISKLFFTPLLTNPLSPTFYYVKLKSVFVNGAKLRIDPSIWEIDDSGNGGTVVDSGTTL 331
Query: 286 TWLVKEAYEALRDEVMIRLEGEQMRSYSWPDKLCYH--GIMSSDLKGFPTVRFHFRGGAK 343
+L + AY ++ V R++ + + LC + G+ + K P ++F F GGA
Sbjct: 332 AFLAEPAYRSVIAAVRRRVKLPIADALTPGFDLCVNVSGVTKPE-KILPRLKFEFSGGAV 390
Query: 344 LALEKDSMFYQPRPDAFCMAVNPASINNRYVNFSLIGMMAQQFYNVGYDIGRKQKTFQRM 403
+ F + C+A+ S++ + V FS+IG + QQ + +D R + F R
Sbjct: 391 FVPPPRNYFIETEEQIQCLAIQ--SVDPK-VGFSVIGNLMQQGFLFEFDRDRSRLGFSRR 447
Query: 404 DCEV 407
C +
Sbjct: 448 GCAL 451
>gi|255576511|ref|XP_002529147.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
gi|223531426|gb|EEF33260.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
Length = 479
Score = 150 bits (380), Expect = 9e-34, Method: Compositional matrix adjust.
Identities = 118/366 (32%), Positives = 174/366 (47%), Gaps = 23/366 (6%)
Query: 47 QAQILPSNDISNSLFYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLGTIFYPS 106
Q I+ + ++ + IG+P P + +DTGS + W+ C PC DC Q IF P+
Sbjct: 130 QGPIISGTSQGSGEYFSRVGIGKPSSPVYMVLDTGSDVNWIQCAPCADCYHQADPIFEPA 189
Query: 107 RSSSYAYVPCDSEHCRYFPYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDEST 166
S+SY+ + CD++ C+ + C + C+Y Y G T TE +T +
Sbjct: 190 SSTSYSPLSCDTKQCQSLDVSEC--RNNTCLYEVSYGDGSYTVGDFVTETITL-----GS 242
Query: 167 IHVQDVVFGCGFSTNRNF-KFSGIFGLGIGRSSLVSQLN-SSFSYCIGSLHDPDYLHNKL 224
V +V GCG + F +G+ GLG G+ S SQ+N SSFSYC L D D
Sbjct: 243 ASVDNVAIGCGHNNEGLFIGAAGLLGLGGGKLSFPSQINASSFSYC---LVDRDSDSAST 299
Query: 225 ILGDGAIIDEGDATPL---QFIDGHYYITLEAISVDGRMLDINPNIFKRDDSG-GGVMID 280
+ + A++ PL + +D YY+ + +SV G +L I ++F+ D+SG GG++ID
Sbjct: 300 LEFNSALLPHAITAPLLRNRELDTFYYVGMTGLSVGGELLSIPESMFEMDESGNGGIIID 359
Query: 281 SGTDVTWLVKEAYEALRDEVMIRLEGEQMRSYSWPDKLCYHGIMSSDLKGFPTVRFHFRG 340
SGT VT L AY ALRD + + + S CY + ++ PTV FH G
Sbjct: 360 SGTAVTRLQTAAYNALRDAFVKGTKDLPVTSEVALFDTCYDLSRKTSVE-VPTVTFHLAG 418
Query: 341 GAKLALEKDSMFYQPRPDA-FCMAVNPASINNRYVNFSLIGMMAQQFYNVGYDIGRKQKT 399
G L L + D FC A P S S+IG + QQ VG+D+
Sbjct: 419 GKVLPLPATNYLIPVDSDGTFCFAFAPTS-----SALSIIGNVQQQGTRVGFDLANSLVG 473
Query: 400 FQRMDC 405
F+ C
Sbjct: 474 FEPRQC 479
>gi|223946005|gb|ACN27086.1| unknown [Zea mays]
Length = 336
Score = 149 bits (375), Expect = 3e-33, Method: Compositional matrix adjust.
Identities = 100/345 (28%), Positives = 162/345 (46%), Gaps = 23/345 (6%)
Query: 78 MDTGSSLLWVHCYPCRDCSPQLGTIFYPSRSSSYAYVPCDSEHCRYFPYARCSAYKHRCI 137
MDTGS L+W C PC C+ Q F +S++Y +PC S C C +K C+
Sbjct: 1 MDTGSDLIWTQCAPCLLCADQPTPYFDVKKSATYRALPCRSSRCASLSSPSC--FKKMCV 58
Query: 138 YTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDVVFGCG-FSTNRNFKFSGIFGLGIGR 196
Y Y T+ ++ E TF + + + ++ FGCG + SG+ G G G
Sbjct: 59 YQYYYGDTASTAGVLANETFTFGAANSTKVRATNIAFGCGSLNAGDLANSSGMVGFGRGP 118
Query: 197 SSLVSQLNSS-FSYCIGSL--HDPDYLHNKLILGDGAIIDEGDATPLQ--------FIDG 245
SLVSQL S FSYC+ S P L+ + + + + +P+Q +
Sbjct: 119 LSLVSQLGPSRFSYCLTSYLSATPSRLYFG-VYANLSSTNTSSGSPVQSTPFVINPALPN 177
Query: 246 HYYITLEAISVDGRMLDINPNIFKRDDSG-GGVMIDSGTDVTWLVKEAYEALRDEVMIRL 304
Y+++L+AIS+ ++L I+P +F +D G GGV+IDSGT +TWL ++AYEA+R ++ +
Sbjct: 178 MYFLSLKAISLGTKLLPIDPLVFAINDDGTGGVIIDSGTSITWLQQDAYEAVRRGLVSAI 237
Query: 305 EGEQMRSYSWPDKLCYHGIMSSDLK-GFPTVRFHFRGGAKLALEKDSMFYQPRPDAFCMA 363
M C+ ++ P + FHF L ++ M C+
Sbjct: 238 PLPAMNDTDIGLDTCFQWPPPPNVTVTVPDLVFHFDSANMTLLPENYMLIASTTGYLCLV 297
Query: 364 VNPASINNRYVNFSLIGMMAQQFYNVGYDIGRKQKTFQRMDCEVL 408
+ P + ++IG QQ ++ YDIG +F C+++
Sbjct: 298 MAPTGVG------TIIGNYQQQNLHLLYDIGNSFLSFVPAPCDII 336
>gi|15222357|ref|NP_174430.1| aspartyl protease-like protein [Arabidopsis thaliana]
gi|12322538|gb|AAG51267.1|AC027135_8 chloroplast nucleoid DNA binding protein, putative [Arabidopsis
thaliana]
gi|67633408|gb|AAY78629.1| aspartyl protease family protein [Arabidopsis thaliana]
gi|332193236|gb|AEE31357.1| aspartyl protease-like protein [Arabidopsis thaliana]
Length = 445
Score = 147 bits (372), Expect = 6e-33, Method: Compositional matrix adjust.
Identities = 134/427 (31%), Positives = 192/427 (44%), Gaps = 40/427 (9%)
Query: 1 LIHRDSILSPYNNPNENPAHRVQRGINISIARLAYLQEKIRSHNTYQAQILPSNDISNS- 59
LIHRDS SP NP+ + R+ A+L+ RS L S ISN
Sbjct: 33 LIHRDSPHSPLYNPHHTVSDRLNA---------AFLRSISRSRRFTTKTDLQSGLISNGG 83
Query: 60 LFYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLGTIFYPSRSSSYAYVPCDSE 119
++++ISIG PP F DTGS L WV C PC+ C Q +F +SS+Y CDS+
Sbjct: 84 EYFMSISIGTPPSKVFAIADTGSDLTWVQCKPCQQCYKQNSPLFDKKKSSTYKTESCDSK 143
Query: 120 HCRYFPYAR--CSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDVVFGCG 177
C+ C K C Y Y T V+TE ++ ++ S++ VFGCG
Sbjct: 144 TCQALSEHEEGCDESKDICKYRYSYGDNSFTKGDVATETISIDSSSGSSVSFPGTVFGCG 203
Query: 178 FSTNRNFKFSGIFGLGIGRS--SLVSQLNSS----FSYCIGSLHDPDYLHNKLILGDGAI 231
++ F+ +G +G+G SLVSQL SS FSYC+ + + LG +I
Sbjct: 204 YNNGGTFEETGSGIIGLGGGPLSLVSQLGSSIGKKFSYCLSHTAATTNGTSVINLGTNSI 263
Query: 232 IDEGD------ATPLQFID--GHYYITLEAISVDGRMLDINPNIF----KRDDSGGGVMI 279
TPL D +Y++TLEA++V L + K G ++I
Sbjct: 264 PSNPSKDSATLTTPLIQKDPETYYFLTLEAVTVGKTKLPYTGGGYGLNGKSSKRTGNIII 323
Query: 280 DSGTDVTWLVKEAYEALRDEVMIRLEGEQMRSYSWPDKLCYHGIMSSDLK-GFPTVRFHF 338
DSGT +T L Y+ V + G + S P L H S D + G P + HF
Sbjct: 324 DSGTTLTLLDSGFYDDFGTAVEESVTG--AKRVSDPQGLLTHCFKSGDKEIGLPAITMHF 381
Query: 339 RGGAKLALEKDSMFYQPRPDAFCMAVNPASINNRYVNFSLIGMMAQQFYNVGYDIGRKQK 398
A + L + F + D C+++ P + ++ G M Q + VGYD+ K
Sbjct: 382 T-NADVKLSPINAFVKLNEDTVCLSMIPTT------EVAIYGNMVQMDFLVGYDLETKTV 434
Query: 399 TFQRMDC 405
+FQRMDC
Sbjct: 435 SFQRMDC 441
>gi|357143854|ref|XP_003573079.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Brachypodium
distachyon]
Length = 417
Score = 147 bits (371), Expect = 9e-33, Method: Compositional matrix adjust.
Identities = 118/364 (32%), Positives = 169/364 (46%), Gaps = 28/364 (7%)
Query: 61 FYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLGTIFYPSRSSSYAYVPCDSEH 120
+ + ++IG PPVP DTGS L W C PC+ C PQ ++ PS SS+++ VPC S
Sbjct: 66 YLMELAIGTPPVPFVALADTGSDLTWTQCQPCKLCFPQDTPVYDPSASSTFSPVPCSSAT 125
Query: 121 CRYFPYAR---CSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNT-DESTIHVQDVVFGC 176
C P R CS C Y Y G + + TE LT ++ T+ V V FGC
Sbjct: 126 C--LPTWRSRNCSNPSSPCRYIYSYSDGAYSVGILGTETLTIGSSVPGQTVSVGSVAFGC 183
Query: 177 GFSTNRN-FKFSGIFGLGIGRSSLVSQLN-SSFSYCIGSLHDPDYLHNKLILGDGAIIDE 234
G + +G GLG G SL++QL FSYC+ + + + LG A +
Sbjct: 184 GTDNGGDSLNSTGTVGLGRGTLSLLAQLGVGKFSYCLTDFFN-STMDSPFFLGTLAELAP 242
Query: 235 G----DATPL---QFIDGHYYITLEAISVDGRMLDINPNIFK-RDDSGGGVMIDSGTDVT 286
G +TPL Y++ L+ IS+ L I F R D GG+M+DSGT T
Sbjct: 243 GPGTVQSTPLLQSPLNPSRYFVNLQGISLGDVRLPIPNGTFDLRADGNGGMMVDSGTTFT 302
Query: 287 WLVKEAYEALRDEVMIRLEGEQMRSYSWPDKLCYHGIMSSDLKGF-PTVRFHFRGGAKLA 345
L K + + D V +L G+ + S D C+ S D + F P + HF GGA +
Sbjct: 303 ILAKSGFREVVDRVA-QLLGQPPVNASSLDSPCFP---SPDGEPFMPDLVLHFAGGADMR 358
Query: 346 LEKDS-MFYQPRPDAFCMAVNPASINNRYVNFSLIGMMAQQFYNVGYDIGRKQKTFQRMD 404
L +D+ M Y +FC+ +I +S +G QQ + +D+ Q +F D
Sbjct: 359 LHRDNYMSYNEDDSSFCL-----NIVGSPSTWSRLGNFQQQNIQMLFDMTVGQLSFLPTD 413
Query: 405 CEVL 408
C L
Sbjct: 414 CSKL 417
>gi|356537728|ref|XP_003537377.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Glycine max]
Length = 543
Score = 147 bits (370), Expect = 1e-32, Method: Compositional matrix adjust.
Identities = 118/420 (28%), Positives = 199/420 (47%), Gaps = 46/420 (10%)
Query: 22 VQRGINISIARLAYLQEKIRSHNTYQAQILPSNDISNSL----FYVNISIGQPPVPQFTA 77
+Q+ N++ A +A L+ S + I+ + + SL +++++ +G PP +
Sbjct: 131 IQQQNNLANAFVASLES---SKGEFSGNIMATLESGASLGTGEYFLDMFVGTPPKHVWLI 187
Query: 78 MDTGSSLLWVHCYPCRDCSPQLGTIFYPSRSSSYAYVPCDSEHCRYF----PYARCSAYK 133
+DTGS L W+ C PC DC Q G+ +YP SS+Y + C C+ P C A
Sbjct: 188 LDTGSDLSWIQCDPCYDCFEQNGSHYYPKDSSTYRNISCYDPRCQLVSSSDPLQHCKAEN 247
Query: 134 HRCIYTQLYLIGPETSVFVSTE----QLTFKNTDESTIHVQDVVFGCGFSTNRNFKF--S 187
C Y Y G T+ ++E LT+ N E V DV+FGCG N+ F + S
Sbjct: 248 QTCPYFYDYADGSNTTGDFASETFTVNLTWPNGKEKFKQVVDVMFGCG-HWNKGFFYGAS 306
Query: 188 GIFGLGIGRSSLVSQLNS----SFSYCIGSLHDPDYLHNKLILG-DGAIIDEGDATPLQF 242
G+ GLG G S SQ+ S SFSYC+ L + +KLI G D +++ +
Sbjct: 307 GLLGLGRGPISFPSQIQSIYGHSFSYCLTDLFSNTSVSSKLIFGEDKELLNNHNLNFTTL 366
Query: 243 IDGH-------YYITLEAISVDGRMLDINPNIFKRDDS------GGGVMIDSGTDVTWLV 289
+ G YY+ +++I V G +LDI+ + GGG +IDSG+ +T+
Sbjct: 367 LAGEETPDETFYYLQIKSIMVGGEVLDISEQTWHWSSEGAAADAGGGTIIDSGSTLTFFP 426
Query: 290 KEAYEALRDEVMIRLEGEQMRSYSWPDKLCYH---GIMSSDLKGFPTVRFHFRGGAKLAL 346
AY+ +++ +++ +Q+ + + CY+ +M +L P HF G
Sbjct: 427 DSAYDIIKEAFEKKIKLQQIAADDFVMSPCYNVSGAMMQVEL---PDFGIHFADGGVWNF 483
Query: 347 EKDSMFYQPRPD-AFCMAVNPASINNRYVNFSLIGMMAQQFYNVGYDIGRKQKTFQRMDC 405
++ FYQ PD C+A+ + + ++IG + QQ +++ YD+ R + + C
Sbjct: 484 PAENYFYQYEPDEVICLAIMKTP---NHSHLTIIGNLLQQNFHILYDVKRSRLGYSPRRC 540
>gi|15223368|ref|NP_171637.1| aspartyl protease-like protein [Arabidopsis thaliana]
gi|9665144|gb|AAF97328.1|AC023628_9 Unknown protein [Arabidopsis thaliana]
gi|22135930|gb|AAM91547.1| chloroplast nucleoid DNA binding protein, putative [Arabidopsis
thaliana]
gi|30387595|gb|AAP31963.1| At1g01300 [Arabidopsis thaliana]
gi|332189147|gb|AEE27268.1| aspartyl protease-like protein [Arabidopsis thaliana]
Length = 485
Score = 147 bits (370), Expect = 1e-32, Method: Compositional matrix adjust.
Identities = 124/433 (28%), Positives = 187/433 (43%), Gaps = 52/433 (12%)
Query: 1 LIHRDSILSPYNNPNENPAHRVQRGINISIARLAYLQEKIRSHNT--------YQAQILP 52
L H D+ LS P+E + R+QR + + +A L +I N + + ++
Sbjct: 76 LDHIDA-LSSNKTPDELFSSRLQRD-SRRVKSIATLAAQIPGRNVTHAPRPGGFSSSVVS 133
Query: 53 SNDISNSLFYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLGTIFYPSRSSSYA 112
+ ++ + +G P + +DTGS ++W+ C PCR C Q IF P +S +YA
Sbjct: 134 GLSQGSGEYFTRLGVGTPARYVYMVLDTGSDIVWLQCAPCRRCYSQSDPIFDPRKSKTYA 193
Query: 113 YVPCDSEHCRYFPYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDV 172
+PC S HCR A C+ + C+Y Y G T STE LTF+ V+ V
Sbjct: 194 TIPCSSPHCRRLDSAGCNTRRKTCLYQVSYGDGSFTVGDFSTETLTFRRN-----RVKGV 248
Query: 173 VFGCGFSTNRNF--------------KFSGIFGLGIGRSSLVSQLNSSFSYCIGSLHDPD 218
GCG F F G G + N FSYC+
Sbjct: 249 ALGCGHDNEGLFVGAAGLLGLGKGKLSFPGQTG---------HRFNQKFSYCLVD-RSAS 298
Query: 219 YLHNKLILGDGAIIDEGDATPL---QFIDGHYYITLEAISVDG-RMLDINPNIFKRDDSG 274
+ ++ G+ A+ TPL +D YY+ L ISV G R+ + ++FK D G
Sbjct: 299 SKPSSVVFGNAAVSRIARFTPLLSNPKLDTFYYVGLLGISVGGTRVPGVTASLFKLDQIG 358
Query: 275 -GGVMIDSGTDVTWLVKEAYEALRDEVMIRLEG-EQMRSYSWPDKLCYHGIMSSDLKGFP 332
GGV+IDSGT VT L++ AY A+RD + + ++ +S D C+ +++K P
Sbjct: 359 NGGVIIDSGTSVTRLIRPAYIAMRDAFRVGAKTLKRAPDFSLFDT-CFDLSNMNEVK-VP 416
Query: 333 TVRFHFRGGAKLALEKDSMFYQPRPDAFCMAVNPASINNRYVNFSLIGMMAQQFYNVGYD 392
TV HFRG + + FC A S+IG + QQ + V YD
Sbjct: 417 TVVLHFRGADVSLPATNYLIPVDTNGKFCFA-----FAGTMGGLSIIGNIQQQGFRVVYD 471
Query: 393 IGRKQKTFQRMDC 405
+ + F C
Sbjct: 472 LASSRVGFAPGGC 484
>gi|413921976|gb|AFW61908.1| hypothetical protein ZEAMMB73_608282 [Zea mays]
Length = 459
Score = 147 bits (370), Expect = 1e-32, Method: Compositional matrix adjust.
Identities = 118/372 (31%), Positives = 170/372 (45%), Gaps = 31/372 (8%)
Query: 61 FYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLGTIFYPSRSSSYAYVPCDSEH 120
+ + ++IG PPVP DTGS L W C PC+ C PQ I+ + S+S++ VPC S
Sbjct: 95 YLMELAIGTPPVPFVALADTGSDLTWTQCKPCKLCFPQDTPIYDTAASASFSPVPCASAT 154
Query: 121 CRYFPYAR----CSAYKHR-CIYTQLYLIGPETSVFVSTEQLTFKNTDEST----IHVQD 171
C P R C+A C Y Y G ++ + TE LTF + + V
Sbjct: 155 C--LPIWRSSRNCTATTTSPCRYRYAYDDGAYSAGVLGTETLTFAGSSPGAPGPGVSVGG 212
Query: 172 VVFGCGF-STNRNFKFSGIFGLGIGRSSLVSQLN-SSFSYCI-----GSLHDPDYLHNKL 224
V FGCG + ++ +G GLG G SLV+QL FSYC+ SL P +
Sbjct: 213 VAFGCGVDNGGLSYNSTGTVGLGRGSLSLVAQLGVGKFSYCLTDFFNTSLGSPVLFGSLA 272
Query: 225 ILGDGAIIDEGDATPLQFIDG-----HYYITLEAISV-DGRMLDINPNIFKRDDSGGGVM 278
L + I + G YY++LE IS+ D R+ N RDD GG++
Sbjct: 273 ELAAPSTIGGAAVQSTPLVQGPYNPSRYYVSLEGISLGDARLPIPNGTFDLRDDGSGGMI 332
Query: 279 IDSGTDVTWLVKEAYEALRDEVMIRLEGEQMRSYSWPDKLCYHGIMSS-DLKGFPTVRFH 337
+DSGT T LV+ A+ + + V L + + S D C+ L P + H
Sbjct: 333 VDSGTIFTVLVESAFRVVVNHVAGVLNQPVVNASSL-DSPCFPATAGEQQLPDMPDMLLH 391
Query: 338 FRGGAKLALEKDS-MFYQPRPDAFCMAVNPASINNRYVNFSLIGMMAQQFYNVGYDIGRK 396
F GGA + L +D+ M + +FC+ N A + Y S++G QQ + +DI
Sbjct: 392 FAGGADMRLHRDNYMSFNQESSSFCL--NIAGAPSAY--GSILGNFQQQNIQMLFDITVG 447
Query: 397 QKTFQRMDCEVL 408
Q +F DC L
Sbjct: 448 QLSFVPTDCSKL 459
>gi|255563827|ref|XP_002522914.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
gi|223537841|gb|EEF39457.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
Length = 442
Score = 147 bits (370), Expect = 1e-32, Method: Compositional matrix adjust.
Identities = 127/404 (31%), Positives = 193/404 (47%), Gaps = 36/404 (8%)
Query: 20 HRVQRGINISIARLAYLQEKIRSHNTYQAQILPSNDISNSLFYVNISIGQPPVPQFTAMD 79
R+Q GI + RL L + + ++ A+I N F +N++IG PP MD
Sbjct: 60 QRIQHGIKRANHRLERLNAMVLAASS-NAEINSPVLSGNGEFLMNLAIGTPPETYSAIMD 118
Query: 80 TGSSLLWVHCYPCRDCSPQLGTIFYPSRSSSYAYVPCDSEHCRYFPYARCSAYKHRCIYT 139
TGS L+W C PC C Q IF P +SSS++ + C S+ C+ P + CS C Y
Sbjct: 119 TGSDLIWTQCKPCTQCFDQPSPIFDPKKSSSFSKLSCSSQLCKALPQSSCS---DSCEYL 175
Query: 140 QLYLIGPETSVFVSTEQLTFKNTDESTIHVQDVVFGCGFSTNRNFKF---SGIFGLGIGR 196
Y T ++TE TF + + +V FGCG N F SG+ GLG G
Sbjct: 176 YTYGDYSSTQGTMATETFTFGK-----VSIPNVGFGCG-EDNEGDGFTQGSGLVGLGRGP 229
Query: 197 SSLVSQLNSS-FSYCIGSLHDPDYLHNKLILGDGAIIDEGDAT---------PLQFIDGH 246
SLVSQL + FSYC+ S+ D + L++G A ++ A PLQ
Sbjct: 230 LSLVSQLKEAKFSYCLTSIDDTK--TSTLLMGSLASVNGTSAAIRTTPLIQNPLQ--PSF 285
Query: 247 YYITLEAISVDGRMLDINPNIFK-RDDSGGGVMIDSGTDVTWLVKEAYEALRDEVMIRLE 305
YY++LE ISV G L I + F+ +DD GG++IDSGT +T+L + A++ ++ E ++
Sbjct: 286 YYLSLEGISVGGTRLPIKESTFQLQDDGTGGLIIDSGTTITYLEESAFDLVKKEFTSQMG 345
Query: 306 GEQMRSYSWPDKLCYHGIMSSDLKGFPTVRFHFRGGAKLALEKDS-MFYQPRPDAFCMAV 364
S + +LCY+ + P + HF GA L L ++ M C+A+
Sbjct: 346 LPVDNSGATGLELCYNLPSDTSELEVPKLVLHFT-GADLELPGENYMIADSSMGVICLAM 404
Query: 365 NPASINNRYVNFSLIGMMAQQFYNVGYDIGRKQKTFQRMDCEVL 408
+ S+ G + QQ V +D+ ++ +F +C L
Sbjct: 405 GSSG------GMSIFGNVQQQNMFVSHDLEKETLSFLPTNCGQL 442
>gi|297848386|ref|XP_002892074.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
gi|297337916|gb|EFH68333.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
Length = 485
Score = 146 bits (369), Expect = 1e-32, Method: Compositional matrix adjust.
Identities = 124/433 (28%), Positives = 186/433 (42%), Gaps = 52/433 (12%)
Query: 1 LIHRDSILSPYNNPNENPAHRVQRGINISIARLAYLQEKIRSHNT--------YQAQILP 52
L H D+ LS P E + R+QR + + +A L +I N + + ++
Sbjct: 76 LDHIDA-LSSNKTPQELFSSRLQRD-SRRVKSIATLAAQIPGRNVTHAPRTGGFSSSVVS 133
Query: 53 SNDISNSLFYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLGTIFYPSRSSSYA 112
+ ++ + +G P + +DTGS ++W+ C PCR C Q IF P +S +YA
Sbjct: 134 GLSQGSGEYFTRLGVGTPARYVYMVLDTGSDIVWLQCAPCRRCYSQSDPIFDPRKSKTYA 193
Query: 113 YVPCDSEHCRYFPYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDV 172
+PC S HCR A C+ + C+Y Y G T STE LTF+ V+ V
Sbjct: 194 TIPCSSPHCRRLDSAGCNTRRKTCLYQVSYGDGSFTVGDFSTETLTFRRN-----RVKGV 248
Query: 173 VFGCGFSTNRNF--------------KFSGIFGLGIGRSSLVSQLNSSFSYCIGSLHDPD 218
GCG F F G G + N FSYC+
Sbjct: 249 ALGCGHDNEGLFVGAAGLLGLGKGKLSFPGQTG---------HRFNQKFSYCLVD-RSAS 298
Query: 219 YLHNKLILGDGAIIDEGDATPL---QFIDGHYYITLEAISVDG-RMLDINPNIFKRDDSG 274
+ ++ G+ A+ TPL +D YY+ L ISV G R+ + ++FK D G
Sbjct: 299 SKPSSVVFGNAAVSRIARFTPLLSNPKLDTFYYVELLGISVGGTRVPGVAASLFKLDQIG 358
Query: 275 -GGVMIDSGTDVTWLVKEAYEALRDEVMIRLEG-EQMRSYSWPDKLCYHGIMSSDLKGFP 332
GGV+IDSGT VT L++ AY A+RD + + ++ +S D C+ +++K P
Sbjct: 359 NGGVIIDSGTSVTRLIRPAYIAMRDAFRVGAKALKRAPDFSLFDT-CFDLSNMNEVK-VP 416
Query: 333 TVRFHFRGGAKLALEKDSMFYQPRPDAFCMAVNPASINNRYVNFSLIGMMAQQFYNVGYD 392
TV HFRG + + FC A S+IG + QQ + V YD
Sbjct: 417 TVVLHFRGADVSLPATNYLIPVDTNGKFCFA-----FAGTMGGLSIIGNIQQQGFRVVYD 471
Query: 393 IGRKQKTFQRMDC 405
+ + F C
Sbjct: 472 LASSRVGFAPGGC 484
>gi|226508202|ref|NP_001141111.1| hypothetical protein precursor [Zea mays]
gi|194702684|gb|ACF85426.1| unknown [Zea mays]
gi|414590469|tpg|DAA41040.1| TPA: hypothetical protein ZEAMMB73_571218 [Zea mays]
Length = 439
Score = 146 bits (368), Expect = 2e-32, Method: Compositional matrix adjust.
Identities = 114/369 (30%), Positives = 174/369 (47%), Gaps = 35/369 (9%)
Query: 61 FYVNISIGQPPVPQFTAMDTGSSLLWVHCYPC-RDCSPQLGTIFYPSRSSSYAYVPCDSE 119
F + ++IG PP+P DTGS L+W C PC R C Q ++ PS S++++ +PC+S
Sbjct: 85 FLMTLAIGTPPLPFLAIADTGSDLIWTQCAPCSRQCFQQPTPLYNPSSSTTFSALPCNSS 144
Query: 120 HCRYFPYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTF-KNTDESTIHVQDVVFGC-- 176
C A C+Y Y G T VF TE TF +T + V + FGC
Sbjct: 145 ------LGLC-APACACMYNMTYGSG-WTYVFQGTETFTFGSSTPADQVRVPGIAFGCSN 196
Query: 177 ---GFSTNRNFKFSGIFGLGIGRSSLVSQLNS-SFSYCIGSLHDPDYLHNKLILGDGAII 232
GF+ + SG+ GLG G SLVSQL + FSYC+ D + L+ ++
Sbjct: 197 ASSGFNASSA---SGLVGLGRGSLSLVSQLGAPKFSYCLTPYQDTNSTSTLLLGPSASLN 253
Query: 233 DEGDATPLQFIDG----HYYITLEAISVDGRMLDINPNIFK-RDDSGGGVMIDSGTDVTW 287
D G + F+ +YY+ L IS+ L I PN F + D GG++IDSGT +T
Sbjct: 254 DTGVVSSTPFVASPSSIYYYLNLTGISLGTTALPIPPNAFSLKADGTGGLIIDSGTTITM 313
Query: 288 LVKEAYEALRDEV--MIRLEGEQMRSYSWPDKLCYHGIMSSDL-KGFPTVRFHFRGGAKL 344
L AY+ +R V ++ L + + D LC+ S+ P++ HF GA +
Sbjct: 314 LGNTAYQQVRAAVLSLVTLPTTDGSAATGLD-LCFELPSSTSAPPSMPSMTLHFD-GADM 371
Query: 345 ALEKDSMFY-----QPRPDAFCMAVNPASINNRYVNFSLIGMMAQQFYNVGYDIGRKQKT 399
L D+ +C+A+ + + V S++G QQ ++ YD+G++ +
Sbjct: 372 VLPADNYMMSLSDPDSDSSLWCLAMQNQTDTDGVV-VSILGNYQQQNMHILYDVGKETLS 430
Query: 400 FQRMDCEVL 408
F C L
Sbjct: 431 FAPAKCSTL 439
>gi|50508275|dbj|BAD32124.1| putative aspartic proteinase nepenthesin I [Oryza sativa Japonica
Group]
Length = 451
Score = 146 bits (368), Expect = 2e-32, Method: Compositional matrix adjust.
Identities = 116/379 (30%), Positives = 177/379 (46%), Gaps = 40/379 (10%)
Query: 55 DISNSLFYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLGTIFYPSRSSSYAYV 114
D S + +N+SIG PPV DTGSSL+W C PC +C+ + F P+ SS+++ +
Sbjct: 84 DNSAGAYNMNLSIGTPPVTFSVLADTGSSLIWTQCAPCTECAARPAPPFQPASSSTFSKL 143
Query: 115 PCDSEHCRYF--PYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDV 172
PC S C++ PY C+A C+Y Y +G T+ +++TE L V
Sbjct: 144 PCASSLCQFLTSPYLTCNATG--CVYYYPYGMG-FTAGYLATETLHVGGAS-----FPGV 195
Query: 173 VFGCGFSTNRNFKFSGIFGLGIGRSSLVSQLN-SSFSYCIGSLHDPDYLHNKLILGDGAI 231
FGC SGI GLG SLVSQ+ FSYC+ S D D + ++ G A
Sbjct: 196 AFGCSTENGVGNSSSGIVGLGRSPLSLVSQVGVGRFSYCLRS--DADAGDSPILFGSLAK 253
Query: 232 IDEGD--ATPL-----QFIDGHYYITLEAISVDGRMLDINPNIF---KRDDSG--GGVMI 279
+ G+ +TPL +YY+ L I+V L + F + +G GG ++
Sbjct: 254 VTGGNVQSTPLLENPEMPSSSYYYVNLTGITVGATDLPVTSTTFGFTRGAGAGLVGGTIV 313
Query: 280 DSGTDVTWLVKEAYEALRDEVMIRLEGEQMRSYSWPDK----LCYHGIMSSDLKG--FPT 333
DSGT +T+LVKE Y ++ + ++ + + + LC+ + G PT
Sbjct: 314 DSGTTLTYLVKEGYAMVKRAFLSQMATANLTTTVNGTRFGFDLCFDATAAGGGSGVPVPT 373
Query: 334 VRFHFRGGAKLALEKDSMF------YQPRPDAFCMAVNPASINNRYVNFSLIGMMAQQFY 387
+ F GGA+ A+ + S Q R C+ V PAS ++ S+IG + Q
Sbjct: 374 LVLRFAGGAEYAVRRRSYVGVVAVDSQGRAAVECLLVLPAS---EKLSISIIGNVMQMDL 430
Query: 388 NVGYDIGRKQKTFQRMDCE 406
+V YD+ +F DC
Sbjct: 431 HVLYDLDGGMFSFAPADCA 449
>gi|297846526|ref|XP_002891144.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
gi|297336986|gb|EFH67403.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
Length = 445
Score = 145 bits (367), Expect = 3e-32, Method: Compositional matrix adjust.
Identities = 131/427 (30%), Positives = 192/427 (44%), Gaps = 40/427 (9%)
Query: 1 LIHRDSILSPYNNPNENPAHRVQRGINISIARLAYLQEKIRSHNTYQAQILPSNDISNS- 59
LIHRDS SP NP + R+ A+L+ RS L S ISN
Sbjct: 33 LIHRDSPHSPLYNPQHTVSDRLNA---------AFLRSISRSRRFSTKTDLQSGLISNGG 83
Query: 60 LFYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLGTIFYPSRSSSYAYVPCDSE 119
++++ISIG PP DTGS L WV C PC+ C Q +F +SS+Y CDS
Sbjct: 84 EYFMSISIGTPPSKFLAIADTGSDLTWVQCKPCQQCYKQNTPLFDKKKSSTYKTESCDSI 143
Query: 120 HCRYFPYAR--CSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDVVFGCG 177
C C ++ C Y Y T V+TE ++ ++ S + FGCG
Sbjct: 144 TCNALSEHEEGCDESRNACKYRYSYGDESFTKGEVATETISIDSSSGSPVSFPGTAFGCG 203
Query: 178 FSTNRNFKFSGIFGLGIGRS--SLVSQLNSS----FSYCIGSLHDPDYLHNKLILGDGAI 231
++ F+ +G +G+G SLVSQL SS FSYC+ + + LG ++
Sbjct: 204 YNNGGTFEETGSGIIGLGGGPLSLVSQLGSSIGKKFSYCLSHTSATTNGTSVINLGTNSM 263
Query: 232 IDEGD------ATPLQFID--GHYYITLEAISVDGRMLDINP----NIFKRDDSGGGVMI 279
+ TPL D +Y++TLEAI+V L ++ ++ G ++I
Sbjct: 264 TSKPSKDSAILTTPLIQKDPETYYFLTLEAITVGKTKLPYTGGGGYSLNRKSKKTGNIII 323
Query: 280 DSGTDVTWLVKEAYEALRDEVMIRLEGEQMRSYSWPDKLCYHGIMSSDLK-GFPTVRFHF 338
DSGT +T L Y+ V + G + S P + H S D + G PT+ HF
Sbjct: 324 DSGTTLTLLDSGFYDDFGAVVEESVTG--AKRVSDPQGILTHCFKSGDKEIGLPTITMHF 381
Query: 339 RGGAKLALEKDSMFYQPRPDAFCMAVNPASINNRYVNFSLIGMMAQQFYNVGYDIGRKQK 398
GA + L + F + D C+++ P + ++ G M Q + VGYD+ K
Sbjct: 382 T-GADVKLSPINSFVKLSEDIVCLSMIPTT------EVAIYGNMVQMDFLVGYDLETKTV 434
Query: 399 TFQRMDC 405
+FQRMDC
Sbjct: 435 SFQRMDC 441
>gi|326507654|dbj|BAK03220.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 442
Score = 145 bits (367), Expect = 3e-32, Method: Compositional matrix adjust.
Identities = 124/420 (29%), Positives = 187/420 (44%), Gaps = 36/420 (8%)
Query: 11 YNNPNENPAHRVQRGINISIARLAYLQEKIRSHNTYQAQILPSNDISNSLFYV-NISIGQ 69
++NP+ + V+ + + R A ++ S D+ N Y+ ++IG
Sbjct: 37 HSNPDVSATEFVRDALRRDMHRHARFTRELASSGDRTVAAPTRKDLPNGGEYIMTLAIGT 96
Query: 70 PPVPQFTAMDTGSSLLWVHCYPC-RDCSPQLGTIFYPSRSSSYAYVPCDS--EHCRYF-- 124
PP+ DTGS L+W C PC C Q G + PS S+++ +PC+S C
Sbjct: 97 PPLSYPAIADTGSDLIWTQCAPCGSQCFKQAGQPYNPSSSTTFGVLPCNSSVSMCAALAG 156
Query: 125 --PYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDVVFGCGFSTNR 182
P CS C+Y Q Y G T+ S E TF +T V + FGC +++
Sbjct: 157 PSPPPGCS-----CMYNQTYGTG-WTAGIQSVETFTFGSTPADQTRVPGIAFGCSNASSD 210
Query: 183 NFKFS-GIFGLGIGRSSLVSQLNSS-FSYCIGSLHDPDYLHNKLILGDGAIIDEGDATPL 240
++ S G+ GLG G SLVSQL + FSYC+ D + + L+LG A ++
Sbjct: 211 DWNGSAGLVGLGRGSMSLVSQLGAGMFSYCLTPFQDANS-TSTLLLGPSAALNGTGVLTT 269
Query: 241 QFI--------DGHYYITLEAISVDGRMLDINPNIFK-RDDSGGGVMIDSGTDVTWLVKE 291
F+ +YY+ L IS+ L I PN F R D GG++IDSGT +T LV
Sbjct: 270 PFVASPSKAPMSTYYYLNLTGISIGTTALSIPPNAFALRTDGTGGLIIDSGTTITSLVDA 329
Query: 292 AYEALRD--EVMIRLEGEQMRSYSWPDKLCYHGIM-SSDLKGFPTVRFHFRGGAKLALEK 348
AY+ +R E ++ L + D LC+ +S P++ FHF GA + L
Sbjct: 330 AYQQVRAAIESLVTLPVADGSDSTGLD-LCFALTSETSTPPSMPSMTFHFD-GADMVLPV 387
Query: 349 DSMFYQPRPDAFCMAVNPASINNRYVNFSLIGMMAQQFYNVGYDIGRKQKTFQRMDCEVL 408
D+ +C+A+ N S G QQ ++ YDI + +F C L
Sbjct: 388 DNYMILGS-GVWCLAMR----NQTVGAMSTFGNYQQQNVHLLYDIHEETLSFAPAKCSTL 442
>gi|168041176|ref|XP_001773068.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162675615|gb|EDQ62108.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 365
Score = 145 bits (366), Expect = 4e-32, Method: Compositional matrix adjust.
Identities = 106/361 (29%), Positives = 156/361 (43%), Gaps = 28/361 (7%)
Query: 61 FYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLGTIFYPSRSSSYAYVPCDSEH 120
+ + +G P +DTGS L WV C PC C Q +F P+ S+S+ + C S
Sbjct: 13 YLATVRLGTPERVFSVIVDTGSDLTWVQCSPCGKCYSQNDALFLPNTSTSFTKLACGSAL 72
Query: 121 CRYFPYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDVVFGCGFST 180
C P+ C+ + C+Y Y G T+ + +T + V + FGCG
Sbjct: 73 CNGLPFPMCN--QTTCVYWYSYGDGSLTTGDFVYDTITMDGINGQKQQVPNFAFGCGHDN 130
Query: 181 NRNFKFS-GIFGLGIGRSSLVSQL----NSSFSYCIGSLHDPDYLHNKLILGDGAIIDEG 235
+F + GI GLG G S SQL N FSYC+ P + L+ GD A+
Sbjct: 131 EGSFAGADGILGLGQGPLSFHSQLKSVYNGKFSYCLVDWLAPPTQTSPLLFGDAAVPILP 190
Query: 236 DATPLQF-----IDGHYYITLEAISVDGRMLDINPNIFKRDDSGG-GVMIDSGTDVTWLV 289
D L + +YY+ L ISV +L+I+ +F D GG G + DSGT VT L
Sbjct: 191 DVKYLPILANPKVPTYYYVKLNGISVGDNLLNISSTVFDIDSVGGAGTIFDSGTTVTQLA 250
Query: 290 KEAYEALRDEVMIRLEGEQMRSYSWPD-----KLCYHGIMSSDLKGFPTVRFHFRGGAKL 344
+ AY+ EV+ + M D LC G L P + FHF GG +
Sbjct: 251 EAAYK----EVLAAMNASTMAYSRKIDDISRLDLCLSGFPKDQLPTVPAMTFHFEGGDMV 306
Query: 345 ALEKDSMFYQPRPDAFCMAVNPASINNRYVNFSLIGMMAQQFYNVGYDIGRKQKTFQRMD 404
+ Y ++C A+ + + ++IG + QQ + V YD ++ F D
Sbjct: 307 LPPSNYFIYLESSQSYCFAMTSSP------DVNIIGSVQQQNFQVYYDTAGRKLGFVPKD 360
Query: 405 C 405
C
Sbjct: 361 C 361
>gi|356569424|ref|XP_003552901.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Glycine max]
Length = 536
Score = 145 bits (365), Expect = 4e-32, Method: Compositional matrix adjust.
Identities = 123/417 (29%), Positives = 198/417 (47%), Gaps = 46/417 (11%)
Query: 22 VQRGINISIARLAYLQEKIRSHNTYQAQILPSNDISNSL----FYVNISIGQPPVPQFTA 77
+Q+ N++ A +A L+ S + + I+ + + SL +++++ +G PP +
Sbjct: 130 IQQQNNLANAVVASLKS---SKDEFSGNIMATLESGASLGTGEYFIDMFVGTPPKHVWLI 186
Query: 78 MDTGSSLLWVHCYPCRDCSPQLGTIFYPSRSSSYAYVPCDSEHCRYF----PYARCSAYK 133
+DTGS L W+ C PC DC Q G + P+ SSSY + C C+ P C
Sbjct: 187 LDTGSDLSWIQCDPCYDCFEQNGPHYNPNESSSYRNISCYDPRCQLVSSPDPLQHCKTEN 246
Query: 134 HRCIYTQLYLIGPETS----VFVSTEQLTFKNTDESTIHVQDVVFGCGFSTNRNF--KFS 187
C Y Y G T+ + T LT+ N E HV DV+FGCG N+ F
Sbjct: 247 QTCPYFYDYADGSNTTGDFALETFTVNLTWPNGKEKFKHVVDVMFGCG-HWNKGFFHGAG 305
Query: 188 GIFGLGIGRSSLVSQLNS----SFSYCIGSLHDPDYLHNKLILG-DGAIIDE-------- 234
G+ GLG G S SQL S SFSYC+ L + +KLI G D +++
Sbjct: 306 GLLGLGRGPLSFPSQLQSIYGHSFSYCLTDLFSNTSVSSKLIFGEDKELLNHHNLNFTKL 365
Query: 235 --GDATPLQFIDGHYYITLEAISVDGRMLDINPNIFKRDDSG-GGVMIDSGTDVTWLVKE 291
G+ TP D YY+ +++I V G +LDI + G GG +IDSG+ +T+
Sbjct: 366 LAGEETP---DDTFYYLQIKSIVVGGEVLDIPEKTWHWSSEGVGGTIIDSGSTLTFFPDS 422
Query: 292 AYEALRDEVMIRLEGEQMRSYSWPDKLCYH--GIMSSDLKGFPTVRFHFRGGAKLALEKD 349
AY+ +++ +++ +Q+ + + CY+ G M +L P HF GA +
Sbjct: 423 AYDVIKEAFEKKIKLQQIAADDFIMSPCYNVSGAMQVEL---PDYGIHFADGAVWNFPAE 479
Query: 350 SMFYQPRPD-AFCMAVNPASINNRYVNFSLIGMMAQQFYNVGYDIGRKQKTFQRMDC 405
+ FYQ PD C+A+ + + ++IG + QQ +++ YD+ R + + C
Sbjct: 480 NYFYQYEPDEVICLAILKTP---NHSHLTIIGNLLQQNFHILYDVKRSRLGYSPRRC 533
>gi|255566835|ref|XP_002524401.1| basic 7S globulin 2 precursor small subunit, putative [Ricinus
communis]
gi|223536362|gb|EEF38012.1| basic 7S globulin 2 precursor small subunit, putative [Ricinus
communis]
Length = 455
Score = 145 bits (365), Expect = 4e-32, Method: Compositional matrix adjust.
Identities = 117/437 (26%), Positives = 202/437 (46%), Gaps = 45/437 (10%)
Query: 1 LIHRDSILSPYNNPNENPAHRVQRGINISIARLAYLQEKIRSHNTYQAQILPSNDISNSL 60
L+H+ +P+ +P+E A + R +++ + ++ N++++ ++ +
Sbjct: 33 LLHK----TPFTSPSEALAFDINRRLSL---LHHHRHQQQHKQNSFRSPVISGASSGSGQ 85
Query: 61 FYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQ-LGTIFYPSRSSSYAYVPCDSE 119
++V++ IG PP DTGS L+WV C PCR+CS + G+ F+ S++Y+ + C S
Sbjct: 86 YFVSLRIGTPPQTLLLVADTGSDLIWVKCSPCRNCSHRSPGSAFFARHSTTYSAIHCYSP 145
Query: 120 HCRYFPYAR---CSAYK--HRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDVVF 174
C+ P+ C+ + C Y Y T+ F S E LT + + + F
Sbjct: 146 QCQLVPHPHPNPCNRTRLHSPCRYQYTYADSSTTTGFFSKEALTLNTSTGKVKKLNGLSF 205
Query: 175 GCGFS------TNRNFKFS-GIFGLGIGRSSLVSQL----NSSFSYCIGSLHDPDYLHNK 223
GCGF T +F+ + G+ GLG S SQL S FSYC+ +
Sbjct: 206 GCGFRISGPSLTGASFEGAQGVMGLGRAPISFSSQLGRRFGSKFSYCLMDYTLSPPPTSF 265
Query: 224 LILGDG---AIIDEG--DATPL---QFIDGHYYITLEAISVDGRMLDINPNIFKRDDSG- 274
L +G A+ +G TPL YYI ++ + V+G L INP+++ DD G
Sbjct: 266 LTIGGAQNVAVSKKGIMSFTPLLINPLSPTFYYIAIKGVYVNGVKLPINPSVWSIDDLGN 325
Query: 275 GGVMIDSGTDVTWLVKEAY----EALRDEVMIRLEGEQMRSYSWPDKLCYHGIMSSDLKG 330
GG +IDSGT +T++ + AY +A + V + E + LC + +
Sbjct: 326 GGTIIDSGTTLTFITEPAYTEILKAFKKRVKLPSPAEPTPGF----DLCMN-VSGVTRPA 380
Query: 331 FPTVRFHFRGGAKLALEKDSMFYQPRPDAFCMAVNPASINNRYVNFSLIGMMAQQFYNVG 390
P + F+ GG+ + + F + C+AV P S + FS++G + QQ + +
Sbjct: 381 LPRMSFNLAGGSVFSPPPRNYFIETGDQIKCLAVQPVSQDG---GFSVLGNLMQQGFLLE 437
Query: 391 YDIGRKQKTFQRMDCEV 407
+D + + F R C +
Sbjct: 438 FDRDKSRLGFTRRGCAL 454
>gi|125561849|gb|EAZ07297.1| hypothetical protein OsI_29545 [Oryza sativa Indica Group]
Length = 451
Score = 145 bits (365), Expect = 5e-32, Method: Compositional matrix adjust.
Identities = 118/430 (27%), Positives = 197/430 (45%), Gaps = 48/430 (11%)
Query: 10 PYNNPNENPAHRVQRGINISIARLAYLQEKIRS-HNTYQAQILPSN----DISNSLFYVN 64
PY + + V+ G S R A+L K+ + + + P++ +S+ +
Sbjct: 35 PYAGSSLSRHDVVRHGARASKTRAAWLTAKLAGVLSNRRGGVSPADVRLSPLSDQGHSLT 94
Query: 65 ISIGQPPVPQFTAMDTGSSLLWVHC-------YPCRDCSPQLGTIFYPSRSSSYAYVPCD 117
+ IG PP P+ +DTGS L+W C R SP ++ P SS++A++PC
Sbjct: 95 VGIGTPPQPRKLIVDTGSDLIWTQCKLSSSTAVAARHGSPP---VYDPGESSTFAFLPCS 151
Query: 118 SEHCR--YFPYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDVVFG 175
C+ F + C++ K+RC+Y +Y V S E TF ++ + FG
Sbjct: 152 DRLCQEGQFSFKNCTS-KNRCVYEDVYGSAAAVGVLAS-ETFTFGARRAVSLRLG---FG 206
Query: 176 CG-FSTNRNFKFSGIFGLGIGRSSLVSQLN-SSFSYCIGSLHDPDYLHNKLILGDGAIID 233
CG S +GI GL SL++QL FSYC+ D + L+ G A +
Sbjct: 207 CGALSAGSLIGATGILGLSPESLSLITQLKIQRFSYCLTPFADKKT--SPLLFGAMADLS 264
Query: 234 EGDAT-PLQFI--------DGHYYITLEAISVDGRMLDI-NPNIFKRDDSGGGVMIDSGT 283
T P+Q +YY+ L IS+ + L + ++ R D GGG ++DSG+
Sbjct: 265 RHKTTRPIQTTAIVSNPVKTVYYYVPLVGISLGHKRLAVPAASLAMRPDGGGGTIVDSGS 324
Query: 284 DVTWLVKEAYEALRDEVM--IRLEGEQMRSYSWPDKLCY-----HGIMSSDLKGFPTVRF 336
V +LV+ A+EA+++ VM +RL + +LC+ + + P +
Sbjct: 325 TVAYLVEAAFEAVKEAVMDVVRLPVANRTVEDY--ELCFVLPRRTAAAAMEAVQVPPLVL 382
Query: 337 HFRGGAKLALEKDSMFYQPRPDAFCMAVNPASINNRYVNFSLIGMMAQQFYNVGYDIGRK 396
HF GGA + L +D+ F +PR C+AV + + S+IG + QQ +V +D+
Sbjct: 383 HFDGGAAMVLPRDNYFQEPRAGLMCLAVGKTTDGS---GVSIIGNVQQQNMHVLFDVQHH 439
Query: 397 QKTFQRMDCE 406
+ +F C+
Sbjct: 440 KFSFAPTQCD 449
>gi|357143850|ref|XP_003573078.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Brachypodium
distachyon]
Length = 431
Score = 144 bits (364), Expect = 5e-32, Method: Compositional matrix adjust.
Identities = 116/393 (29%), Positives = 181/393 (46%), Gaps = 22/393 (5%)
Query: 30 IARLAYLQEKIRSHNTYQAQILPSNDISNSLFYVNISIGQPPVPQFTAMDTGSSLLWVHC 89
+ R A + ++R+ + Y A P + + ++IG PPVP DTGS L W C
Sbjct: 47 LMRRAAHRSRLRALSGYDANS-PRLHSVQVEYLMELAIGTPPVPFVALADTGSDLTWTQC 105
Query: 90 YPCRDCSPQLGTIFYPSRSSSYAYVPCDSEHCRYFPYAR-CSAYKHRCIYTQLYLIGPET 148
PC+ C PQ ++ PS SS+++ VPC S C +R CS C Y Y G +
Sbjct: 106 QPCKLCFPQDTPVYDPSASSTFSPVPCSSATCLPVLRSRNCSTPSSLCRYGYSYSDGAYS 165
Query: 149 SVFVSTEQLTFKNT-DESTIHVQDVVFGCGFSTNRN-FKFSGIFGLGIGRSSLVSQLN-S 205
+ + TE LT ++ + V DV FGCG + +G GLG G SL++QL
Sbjct: 166 AGILGTETLTLGSSVPGQAVSVSDVAFGCGTDNGGDSLNSTGTVGLGRGTLSLLAQLGVG 225
Query: 206 SFSYCIGSLHDPDYLHNKLILGDGAIIDEG----DATPL---QFIDGHYYITLEAISVDG 258
FSYC+ + L + +LG A + G +TPL Y ++L+ I++
Sbjct: 226 KFSYCLTDFFN-STLDSPFLLGTLAELAPGPGAVQSTPLLQSPLNPSRYVVSLQGITLGD 284
Query: 259 RMLDINPNIFK-RDDSGGGVMIDSGTDVTWLVKEAYEALRDEVMIRLEGEQMRSYSWPDK 317
L I F +S GG+++DSGT + L + + + D V ++ G+ + S D
Sbjct: 285 VRLPIPNKTFDLHANSTGGMVVDSGTTFSILPESGFRVVVDHV-AQVLGQPPVNASSLDS 343
Query: 318 LCYHGIMSS-DLKGFPTVRFHFRGGAKLALEKDS-MFYQPRPDAFCMAVNPASINNRYVN 375
C+ L P + HF GGA + L +D+ M Y +FC+ +I
Sbjct: 344 PCFPAPAGERQLPFMPDLVLHFAGGADMRLHRDNYMSYNQEDSSFCL-----NIVGTTST 398
Query: 376 FSLIGMMAQQFYNVGYDIGRKQKTFQRMDCEVL 408
+S++G QQ + +D+ Q +F DC L
Sbjct: 399 WSMLGNFQQQNIQMLFDMTVGQLSFLPTDCSKL 431
>gi|168043550|ref|XP_001774247.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162674374|gb|EDQ60883.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 419
Score = 144 bits (364), Expect = 6e-32, Method: Compositional matrix adjust.
Identities = 117/419 (27%), Positives = 192/419 (45%), Gaps = 32/419 (7%)
Query: 1 LIHRDSILSPYNNPNENPAHRVQRGINISIARLAYLQEKIRSHNTYQAQILPSNDIS-NS 59
LIHR+ SP + N + ++ R A + ++ H + ++ + S N
Sbjct: 22 LIHREHPSSPLRS---NTSKTTTEIFLAAVKRGAERRAQLSKHILAEGRLFSTPVASGNG 78
Query: 60 LFYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLGTIFYPSRSSSYAYVPCDSE 119
+ ++IS G PP +DTGS L+W C PC C+ IF P +SS+Y V C S
Sbjct: 79 EYLIDISFGSPPQKASVIVDTGSDLIWTQCLPCETCNAAASVIFDPVKSSTYDTVSCASN 138
Query: 120 HCRYFPYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDVVFGCGFS 179
C P+ C+ C Y +Y G TS L+ + T + +V FGCG +
Sbjct: 139 FCSSLPFQSCTT---SCKYDYMYGDGSSTS-----GALSTETVTVGTGTIPNVAFGCGHT 190
Query: 180 TNRNFK-FSGIFGLGIGRSSLVSQLNS----SFSYCIGSLHDPDYLHNKLILGDGAIIDE 234
+F +GI GLG G SL+SQ +S FSYC+ L + +++GD A
Sbjct: 191 NLGSFAGAAGIVGLGQGPLSLISQASSITSKKFSYCLVPLGSTK--TSPMLIGDSAAAGG 248
Query: 235 GDATPLQFIDGH---YYITLEAISVDGRMLDINPNIFKRDDSG-GGVMIDSGTDVTWLVK 290
T L + YY L ISV G+ + F D SG GG ++DSGT +T+L
Sbjct: 249 VAYTALLTNTANPTFYYADLTGISVSGKAVTYPVGTFSIDASGQGGFILDSGTTLTYLET 308
Query: 291 EAYEALRDEVMIRLEGEQMRSYSWPDKLCYHGIMSSDLKGFPTVRFHFRGGAKLALEKDS 350
A+ AL + + + + C+ ++ +PT+ FHF+ GA L ++
Sbjct: 309 GAFNALVAALKAEVPFPEADGSLYGLDYCFSTAGVAN-PTYPTMTFHFK-GADYELPPEN 366
Query: 351 MFYQ-PRPDAFCMAVNPASINNRYVNFSLIGMMAQQFYNVGYDIGRKQKTFQRMDCEVL 408
+F + C+A+ ++ FS++G + QQ + + +D+ ++ F+ +CE +
Sbjct: 367 VFVALDTGGSICLAMAAST------GFSIMGNIQQQNHLIVHDLVNQRVGFKEANCETI 419
>gi|242081367|ref|XP_002445452.1| hypothetical protein SORBIDRAFT_07g019450 [Sorghum bicolor]
gi|241941802|gb|EES14947.1| hypothetical protein SORBIDRAFT_07g019450 [Sorghum bicolor]
Length = 459
Score = 144 bits (364), Expect = 6e-32, Method: Compositional matrix adjust.
Identities = 104/358 (29%), Positives = 168/358 (46%), Gaps = 23/358 (6%)
Query: 63 VNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLGTIFYPSRSSSYAYVPCDSEHCR 122
+ + +G PP P +D GS LLW C + QL +F +RSSS++ +PCDS+ C
Sbjct: 109 LTVGVGTPPQPSKVILDLGSDLLWTQCSLVGPTAKQLEPVFDAARSSSFSVLPCDSKLCE 168
Query: 123 YFPYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDVVFGCGFSTNR 182
+ + +C Y Y I T V ++TE TF + ++ FGCG N
Sbjct: 169 AGTFTNKTCTDRKCAYENDYGIMTATGV-LATETFTFGAHHGVS---ANLTFGCGKLANG 224
Query: 183 NF-KFSGIFGLGIGRSSLVSQLN-SSFSYCIGSLHD----PDYLHNKLILGDGAIIDEGD 236
+ SGI GL G S++ QL + FSYC+ D P LG +
Sbjct: 225 TIAEASGILGLSPGPLSMLKQLAITKFSYCLTPFADRKTSPVMFGAMADLGKYKTTGKVQ 284
Query: 237 ATPL---QFIDGHYYITLEAISVDGRMLDI-NPNIFKRDDSGGGVMIDSGTDVTWLVKEA 292
PL D +YY+ + +SV + LD+ + + D GG ++DS T + +LV+ A
Sbjct: 285 TIPLLKNPVEDIYYYVPMVGMSVGSKRLDVPQETLAIKPDGTGGTVLDSATTLAYLVEPA 344
Query: 293 YEALRDEVM--IRLEGEQMRSYSWPDKLCYHGIMSSDLKG--FPTVRFHFRGGAKLALEK 348
+ L+ VM I+L +P +C+ ++G P + HF G A+++L +
Sbjct: 345 FTELKKAVMEGIKLPVANRSVDDYP--VCFELPRGMSMEGVQVPPLVLHFDGDAEMSLPR 402
Query: 349 DSMFYQPRPDAFCMAVNPASINNRYVNFSLIGMMAQQFYNVGYDIGRKQKTFQRMDCE 406
D+ F +P P C+AV A ++IG + QQ +V YD+G ++ ++ C+
Sbjct: 403 DNYFQEPSPGMMCLAVMQAPFEGAP---NVIGNVQQQNMHVLYDVGNRKFSYAPTKCD 457
>gi|326530426|dbj|BAJ97639.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 479
Score = 144 bits (363), Expect = 7e-32, Method: Compositional matrix adjust.
Identities = 127/433 (29%), Positives = 191/433 (44%), Gaps = 54/433 (12%)
Query: 1 LIHRDSILSPYNNPNENPAHRVQRGINISIARLAYLQEKIRSHNTYQ---AQILPSNDIS 57
L+HRD++ S P+ H + AR+ YLQ ++ ++++
Sbjct: 73 LLHRDAV-SGRTYPSTR--HAMLGLAARDGARVEYLQRRLSPTTMTTEVGSEVVSGISEG 129
Query: 58 NSLFYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLGTIFYPSRSSSYAYVPCD 117
+ ++V + +G PP Q+ +D+GS ++W+ C PC +C Q +F P+ S+S+ VPCD
Sbjct: 130 SGEYFVRVGVGSPPTEQYLVVDSGSDVIWIQCRPCAECYQQADPLFDPAASASFTAVPCD 189
Query: 118 SEHCRYFPYARCS-AYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDVVFGC 176
S CR P A C Y Y G T ++ E LTF ++ VQ V GC
Sbjct: 190 SGVCRTLPGGSSGCADSGACRYQVSYGDGSYTQGVLAMETLTFGDSTP----VQGVAIGC 245
Query: 177 GFSTNRNF-KFSGIFGLGIGRSSLVSQL----NSSFSYCIGSLHDPDYLHNKLILGDGAI 231
G F +G+ GLG G SLV QL +FSYC+ S D L+ G
Sbjct: 246 GHRNRGLFVGAAGLLGLGWGPMSLVGQLGGAAGGAFSYCLAS-RGADAGAGSLVFG---- 300
Query: 232 IDEGDATPLQFI----------DGHYYITLEAISVDGRMLDINPNIFK-RDDSGGGVMID 280
DA P+ + YY+ L + V G L + +F +D GGGV++D
Sbjct: 301 --RDDAMPVGAVWVPLLRNAQQPSFYYVGLTGLGVGGERLPLQDGLFDLTEDGGGGVVMD 358
Query: 281 SGTDVTWLVKEAYEALRDEVMIRLEGEQMRS--YSWPDKLCYHGIMSSDLKGF-----PT 333
+GT VT L +AY ALRD + G+ R+ S D CY DL G+ PT
Sbjct: 359 TGTAVTRLPPDAYAALRDAFASTIGGDLPRAPGVSLLDT-CY------DLSGYASVRVPT 411
Query: 334 VRFHF-RGGAKLALEKDSMFYQPRPDAFCMAVNPASINNRYVNFSLIGMMAQQFYNVGYD 392
V +F R GA L L ++ + +C+A ++ S++G + QQ + D
Sbjct: 412 VALYFGRDGAALTLPARNLLVEMGGGVYCLAFAASA-----SGLSILGNIQQQGIQITVD 466
Query: 393 IGRKQKTFQRMDC 405
F C
Sbjct: 467 SANGYVGFGPSTC 479
>gi|449462551|ref|XP_004149004.1| PREDICTED: probable aspartic protease At2g35615-like [Cucumis
sativus]
gi|449515029|ref|XP_004164552.1| PREDICTED: probable aspartic protease At2g35615-like [Cucumis
sativus]
Length = 434
Score = 144 bits (363), Expect = 8e-32, Method: Compositional matrix adjust.
Identities = 118/420 (28%), Positives = 184/420 (43%), Gaps = 31/420 (7%)
Query: 1 LIHRDSILSPYNNPNENPAHRVQRGINISIARLAYLQEKIRSHNT--YQAQILPSNDISN 58
L RDS LSP +NP+ + + S +R A L + S +T ++ I+P +
Sbjct: 32 LFRRDSPLSPLHNPSLSRYDSLIDAFRRSFSRSATLLTHLTSVSTACIRSPIIPDS---- 87
Query: 59 SLFYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLGTIFYPSRSSSYAYVPCDS 118
F ++I IG PPV DTGS L W C PCR+C Q IF P RSSSY V C S
Sbjct: 88 GEFLMSIFIGTPPVNVIAIADTGSDLTWTQCLPCRECFNQSQPIFNPRRSSSYRKVSCAS 147
Query: 119 EHCRYFPYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDVVFGCGF 178
+ CR C C Y Y T ++++Q+T + + V GCG
Sbjct: 148 DTCRSLESYHCGPDLQSCSYGYSYGDRSFTYGDLASDQITI-----GSFKLPKTVIGCGH 202
Query: 179 STNRNF--------KFSGIFGLGIGRSSLVSQLNSSFSYCIGSLHDPDYLHNKLILGDGA 230
F G + + ++ + FSYC+ + + + G A
Sbjct: 203 QNGGTFGGVTSGIIGLGGGSLSLVSQMRTIAGVKPRFSYCLPTFFSNANITGTISFGRKA 262
Query: 231 IID--EGDATPL--QFIDGHYYITLEAISVDGRMLDINPNIFKRDDSGGGVMIDSGTDVT 286
++ + +TPL + D Y++TLEAISV + I + G ++IDSGT +T
Sbjct: 263 VVSGRQVVSTPLVPRSPDTFYFLTLEAISVGKKRFKAANGISAMTNH-GNIIIDSGTTLT 321
Query: 287 WLVKEAYEALRDEVMIRLEGEQMRSYSWPDKLCYHGIMSSDLKGFPTVRFHFRGGAKLAL 346
L + Y + + ++ +++ S +LCY DL P + HF GGA + L
Sbjct: 322 LLPRSLYYGVFSTLARVIKAKRVDDPSGILELCYSAGQVDDLN-IPIITAHFAGGADVKL 380
Query: 347 EKDSMFYQPRPDAFCMAVNPASINNRYVNFSLIGMMAQQFYNVGYDIGRKQKTFQRMDCE 406
+ F + C+ PA+ ++ G +AQ + VGYD+G K+ +F+ C
Sbjct: 381 LPVNTFAPVADNVTCLTFAPAT------QVAIFGNLAQINFEVGYDLGNKRLSFEPKLCA 434
>gi|115479485|ref|NP_001063336.1| Os09g0452400 [Oryza sativa Japonica Group]
gi|51535935|dbj|BAD38017.1| putative aspartic proteinase nepenthesin I [Oryza sativa Japonica
Group]
gi|113631569|dbj|BAF25250.1| Os09g0452400 [Oryza sativa Japonica Group]
gi|215693279|dbj|BAG88661.1| unnamed protein product [Oryza sativa Japonica Group]
Length = 441
Score = 144 bits (362), Expect = 1e-31, Method: Compositional matrix adjust.
Identities = 126/434 (29%), Positives = 196/434 (45%), Gaps = 55/434 (12%)
Query: 1 LIHRDSILSPYNNPNENPAHRVQRGINISIARLAYLQE-KIRSHNTYQAQILPSNDISNS 59
L H D+ N A + R + S AR+A LQ + A+IL S
Sbjct: 35 LTHVDA------NAGYTKAQLLSRAVARSRARVAALQSLATAADAITAARIL--LRFSEG 86
Query: 60 LFYVNISIGQPPVPQFTAM-DTGSSLLWVHCYPCRDCSPQLGTIFYPSRSSSYAYVPCDS 118
+ +++ IG PP F+AM DTGS L+W C PC C Q F P++S+SYA +PC S
Sbjct: 87 EYLMDVGIGSPPR-YFSAMIDTGSDLIWTQCAPCLLCVEQPTPYFEPAKSTSYASLPCSS 145
Query: 119 EHCRYFPYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDVVFGCG- 177
C C +++ C+Y Y ++ ++ E TF T+ + + V V FGCG
Sbjct: 146 AMCNALYSPLC--FQNACVYQAFYGDSASSAGVLANETFTF-GTNSTRVAVPRVSFGCGN 202
Query: 178 FSTNRNFKFSGIFGLGIGRSSLVSQLNSS-FSYCIGSLHDPDYLHNKLILGDGAIIDEGD 236
+ F SG+ G G G SLVSQL S FSYC+ S P ++L G A ++ +
Sbjct: 203 MNAGTLFNGSGMVGFGRGALSLVSQLGSPRFSYCLTSFMSPA--TSRLYFGAYATLNSTN 260
Query: 237 AT---PLQ--------FIDGHYYITLEAISVDGRMLDINPNIFK--RDDSGGGVMIDSGT 283
+ P+Q + Y++ + ISV G +L I+P++F D GGV+IDSGT
Sbjct: 261 TSSSGPVQSTPFIVNPALPTMYFLNMTGISVAGDLLPIDPSVFAINETDGTGGVIIDSGT 320
Query: 284 DVTWLVKEAYEALRDEVMIRLEGEQMRS---------YSWPDKLCYHGIMSSDLKGFPTV 334
VT+L + AY ++ + + + + + WP + P +
Sbjct: 321 TVTFLAQPAYAMVQGAFVAWVGLPRANATPSDTFDTCFKWPPP-------PRRMVTLPEM 373
Query: 335 RFHFRGG-AKLALEKDSMFYQPRPDAFCMAVNPASINNRYVNFSLIGMMAQQFYNVGYDI 393
HF G +L LE + M C+A+ P+ + S+IG Q +++ YD+
Sbjct: 374 VLHFDGADMELPLE-NYMVMDGGTGNLCLAMLPSD------DGSIIGSFQHQNFHMLYDL 426
Query: 394 GRKQKTFQRMDCEV 407
+F C +
Sbjct: 427 ENSLLSFVPAPCNL 440
>gi|242045120|ref|XP_002460431.1| hypothetical protein SORBIDRAFT_02g028000 [Sorghum bicolor]
gi|241923808|gb|EER96952.1| hypothetical protein SORBIDRAFT_02g028000 [Sorghum bicolor]
Length = 481
Score = 143 bits (361), Expect = 1e-31, Method: Compositional matrix adjust.
Identities = 127/429 (29%), Positives = 192/429 (44%), Gaps = 38/429 (8%)
Query: 1 LIHRDSILSPYNNPNENPAH-----RVQRGINISIARLAYLQEKIRSHNTYQAQI---LP 52
++HR SP P+H R Q ++ SI RLA + + + A LP
Sbjct: 68 VVHRHGPCSPLQARGGEPSHAEILDRDQDRVD-SIHRLAAARPSSTADDPSSASKGVSLP 126
Query: 53 SND---ISNSLFYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLGTIFYPSRSS 109
+ + + + V++ +G P DTGS L WV C PC C Q +F PS+S+
Sbjct: 127 ARRGVPLGTANYIVSVGLGTPKRDLLVVFDTGSDLSWVQCKPCDGCYQQHDPLFDPSQST 186
Query: 110 SYAYVPCDSEHCRYFPYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTI-- 167
+Y+ VPC ++ CR CS+ K C Y +Y +T ++ + LT + S+
Sbjct: 187 TYSAVPCGAQECRRLDSGSCSSGK--CRYEVVYGDMSQTDGNLARDTLTLGPSSSSSSSD 244
Query: 168 HVQDVVFGCGFSTNRNF-KFSGIFGLGIGRSSLVSQL----NSSFSYCIGSLHDPDYLHN 222
+Q+ VFGCG F K G+FGLG R SL SQ + FSYC+ S
Sbjct: 245 QLQEFVFGCGDDDTGLFGKADGLFGLGRDRVSLASQAAAKYGAGFSYCLPS---SSTAEG 301
Query: 223 KLILGDGAIIDEGDATPLQFID--GHYYITLEAISVDGRMLDINPNIFKRDDSGGGVMID 280
L LG A + + D YY+ L I V GR + ++P +F+ G +ID
Sbjct: 302 YLSLGSAAPPNARFTAMVTRSDTPSFYYLNLVGIKVAGRTVRVSPAVFRTP----GTVID 357
Query: 281 SGTDVTWLVKEAYEALRDE---VMIRLEGEQMRSYSWPDKLCYHGIMSSDLKGFPTVRFH 337
SGT +T L AY ALR +M R ++ + S D CY + ++ P+V
Sbjct: 358 SGTVITRLPSRAYAALRSSFAGLMRRYSYKRAPALSILDT-CYDFTGRNKVQ-IPSVALL 415
Query: 338 FRGGAKLALEKDSMFYQPRPDAFCMAVNPASINNRYVNFSLIGMMAQQFYNVGYDIGRKQ 397
F GGA L L + Y C+A + N + +++G M Q+ + V YD+ ++
Sbjct: 416 FDGGATLNLGFGEVLYVANKSQACLAF---ASNGDDTSIAILGNMQQKTFAVVYDVANQK 472
Query: 398 KTFQRMDCE 406
F C
Sbjct: 473 IGFGAKGCS 481
>gi|125563957|gb|EAZ09337.1| hypothetical protein OsI_31609 [Oryza sativa Indica Group]
gi|125605916|gb|EAZ44952.1| hypothetical protein OsJ_29595 [Oryza sativa Japonica Group]
Length = 438
Score = 143 bits (361), Expect = 1e-31, Method: Compositional matrix adjust.
Identities = 126/434 (29%), Positives = 196/434 (45%), Gaps = 55/434 (12%)
Query: 1 LIHRDSILSPYNNPNENPAHRVQRGINISIARLAYLQE-KIRSHNTYQAQILPSNDISNS 59
L H D+ N A + R + S AR+A LQ + A+IL S
Sbjct: 32 LTHVDA------NAGYTKAQLLSRAVARSRARVAALQSLATAADAITAARIL--LRFSEG 83
Query: 60 LFYVNISIGQPPVPQFTAM-DTGSSLLWVHCYPCRDCSPQLGTIFYPSRSSSYAYVPCDS 118
+ +++ IG PP F+AM DTGS L+W C PC C Q F P++S+SYA +PC S
Sbjct: 84 EYLMDVGIGSPPR-YFSAMIDTGSDLIWTQCAPCLLCVEQPTPYFEPAKSTSYASLPCSS 142
Query: 119 EHCRYFPYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDVVFGCG- 177
C C +++ C+Y Y ++ ++ E TF T+ + + V V FGCG
Sbjct: 143 AMCNALYSPLC--FQNACVYQAFYGDSASSAGVLANETFTF-GTNSTRVAVPRVSFGCGN 199
Query: 178 FSTNRNFKFSGIFGLGIGRSSLVSQLNSS-FSYCIGSLHDPDYLHNKLILGDGAIIDEGD 236
+ F SG+ G G G SLVSQL S FSYC+ S P ++L G A ++ +
Sbjct: 200 MNAGTLFNGSGMVGFGRGALSLVSQLGSPRFSYCLTSFMSPA--TSRLYFGAYATLNSTN 257
Query: 237 AT---PLQ--------FIDGHYYITLEAISVDGRMLDINPNIFK--RDDSGGGVMIDSGT 283
+ P+Q + Y++ + ISV G +L I+P++F D GGV+IDSGT
Sbjct: 258 TSSSGPVQSTPFIVNPALPTMYFLNMTGISVAGDLLPIDPSVFAINETDGTGGVIIDSGT 317
Query: 284 DVTWLVKEAYEALRDEVMIRLEGEQMRS---------YSWPDKLCYHGIMSSDLKGFPTV 334
VT+L + AY ++ + + + + + WP + P +
Sbjct: 318 TVTFLAQPAYAMVQGAFVAWVGLPRANATPSDTFDTCFKWPPP-------PRRMVTLPEM 370
Query: 335 RFHFRGG-AKLALEKDSMFYQPRPDAFCMAVNPASINNRYVNFSLIGMMAQQFYNVGYDI 393
HF G +L LE + M C+A+ P+ + S+IG Q +++ YD+
Sbjct: 371 VLHFDGADMELPLE-NYMVMDGGTGNLCLAMLPSD------DGSIIGSFQHQNFHMLYDL 423
Query: 394 GRKQKTFQRMDCEV 407
+F C +
Sbjct: 424 ENSLLSFVPAPCNL 437
>gi|242041575|ref|XP_002468182.1| hypothetical protein SORBIDRAFT_01g041210 [Sorghum bicolor]
gi|241922036|gb|EER95180.1| hypothetical protein SORBIDRAFT_01g041210 [Sorghum bicolor]
Length = 490
Score = 143 bits (361), Expect = 1e-31, Method: Compositional matrix adjust.
Identities = 127/432 (29%), Positives = 191/432 (44%), Gaps = 57/432 (13%)
Query: 1 LIHRDSILSPYNNPNENPAHRVQRGINISIARLAYLQEKIRSHNT------------YQA 48
L+HRD + N PA + R + + R A++ K ++ T + A
Sbjct: 72 LLHRDRFAA-----NATPAQLLARRLQRDVLRAAWIISKAAANGTPPPVAGLSSARGFVA 126
Query: 49 QILPSNDISNSLFYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLGTIFYPSRS 108
++ S ++ + I++G P V A+DT S L W+ C PCR C PQ G +F P S
Sbjct: 127 PVV-SRAPTSGEYIAKIAVGTPGVEALLALDTASDLTWLQCQPCRRCYPQSGPVFDPRHS 185
Query: 109 SSYAYVPCDSEHCRYFPYARCS-AYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTI 167
+SY + ++ C+ + A + C+YT Y G T E LTF +
Sbjct: 186 TSYREMSFNAADCQALGRSGGGDAKRGTCVYTVGYGDGSTTVGDFIEETLTFAG----GV 241
Query: 168 HVQDVVFGCGFSTNRNF--KFSGIFGLGIGRSSLVSQL--NSSFSYC-IGSLHDPDYLHN 222
+ + GCG F +GI GLG G S +Q+ N +FSYC + L P L +
Sbjct: 242 RLPRISIGCGHDNKGLFGAPAAGILGLGRGLMSFPNQIDHNGTFSYCLVDFLSGPGSLSS 301
Query: 223 KLILGDGAIIDEGDATPLQF--------IDGHYYITLEAISVDGRMLDINPNIFKRD--- 271
L G GA+ + P+ F + YY+ L ISV G + P + +RD
Sbjct: 302 TLTFGAGAV---DTSPPVSFTPTVLNLNMPTFYYVRLTGISVGGVRV---PGVTERDLQL 355
Query: 272 ---DSGGGVMIDSGTDVTWLVKEAYEALRD---EVMIRLEGEQMRSYSWPDKLCYHGIMS 325
GGV++DSGT VT L + AY A RD V + L + S CY +
Sbjct: 356 DPYTGRGGVIVDSGTAVTRLARPAYTAFRDAFRAVAVDLGQVSIGGPSGFFDTCYT-VGG 414
Query: 326 SDLKGFPTVRFHFRGGAKLALE-KDSMFYQPRPDAFCMAVNPASINNRYVNFSLIGMMAQ 384
+K PTV HF G ++ L+ K+ + C A A+ + V S+IG + Q
Sbjct: 415 RGMKKVPTVSMHFAGSVEVKLQPKNYLIPVDSMGTVCFAF--AATGDHSV--SIIGNIQQ 470
Query: 385 QFYNVGYDIGRK 396
Q + + YDIG +
Sbjct: 471 QGFRIVYDIGGR 482
>gi|413921849|gb|AFW61781.1| hypothetical protein ZEAMMB73_702843 [Zea mays]
Length = 379
Score = 143 bits (360), Expect = 2e-31, Method: Compositional matrix adjust.
Identities = 99/319 (31%), Positives = 156/319 (48%), Gaps = 30/319 (9%)
Query: 22 VQRGINISIARLAYLQEKIRSHNTY----QAQILPSNDISNSLFYVNISIGQPPVPQFTA 77
+ R I S AR+A LQ A++L + S+ + V+++IG PP+
Sbjct: 48 LSRAIARSKARVAALQSAAVLPPVVDPITAARVLVT--ASSGEYLVDLAIGTPPLYYTAI 105
Query: 78 MDTGSSLLWVHCYPCRDCSPQLGTIFYPSRSSSYAYVPCDSEHCRYFPYARCSAYKHRCI 137
MDTGS L+W C PC C+ Q F +S++Y +PC S C C +K C+
Sbjct: 106 MDTGSDLIWTQCAPCLLCADQPTPYFDVKKSATYRALPCRSSRCASLSSPSC--FKKMCV 163
Query: 138 YTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDVVFGCG-FSTNRNFKFSGIFGLGIGR 196
Y Y T+ ++ E TF + + + ++ FGCG + SG+ G G G
Sbjct: 164 YQYYYGDTASTAGVLANETFTFGAANSTKVRATNIAFGCGSLNAGDLANSSGMVGFGRGP 223
Query: 197 SSLVSQLNSS-FSYCIGSL--HDPDYLHNKLILGDGAIIDEGDATPLQ--------FIDG 245
SLVSQL S FSYC+ S P L+ + + + + +P+Q +
Sbjct: 224 LSLVSQLGPSRFSYCLTSYLSATPSRLYFG-VYANLSSTNTSSGSPVQSTPFVINPALPN 282
Query: 246 HYYITLEAISVDGRMLDINPNIFKRDDSG-GGVMIDSGTDVTWLVKEAYEALRDEVMIRL 304
Y+++L+AIS+ ++L I+P +F +D G GGV+IDSGT +TWL ++AYEA+R ++ +
Sbjct: 283 MYFLSLKAISLGTKLLPIDPLVFAINDDGTGGVIIDSGTSITWLQQDAYEAVRRGLVSAI 342
Query: 305 EGEQMRS--------YSWP 315
M + WP
Sbjct: 343 PLTAMNDTDIGLDTCFQWP 361
>gi|21594980|gb|AAM66061.1| chloroplast nucleoid DNA binding protein, putative [Arabidopsis
thaliana]
Length = 485
Score = 143 bits (360), Expect = 2e-31, Method: Compositional matrix adjust.
Identities = 123/433 (28%), Positives = 186/433 (42%), Gaps = 52/433 (12%)
Query: 1 LIHRDSILSPYNNPNENPAHRVQRGINISIARLAYLQEKIRSHNT--------YQAQILP 52
L H D+ LS P E + R+QR + + +A L +I N + + ++
Sbjct: 76 LDHIDA-LSSNKTPQELFSSRLQRD-SRRVRSIATLAAQIPGRNVTHAPRPGGFSSSVVS 133
Query: 53 SNDISNSLFYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLGTIFYPSRSSSYA 112
+ ++ + +G P + +DTGS ++W+ C PCR C Q IF P +S +YA
Sbjct: 134 GLSQGSGEYFTRLGVGTPARYVYMVLDTGSDIVWLQCAPCRRCYSQSDPIFDPRKSKTYA 193
Query: 113 YVPCDSEHCRYFPYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDV 172
+PC S HCR A C+ + C+Y Y G T STE LTF+ V+ V
Sbjct: 194 TIPCSSPHCRRLDSAGCNTRRKTCLYQVSYGDGSFTVGDFSTETLTFRRN-----RVKGV 248
Query: 173 VFGCGFSTNRNF--------------KFSGIFGLGIGRSSLVSQLNSSFSYCIGSLHDPD 218
GCG F F G G + N FSYC+
Sbjct: 249 ALGCGHDNEGLFVGAAGLLGLGKGKLSFPGQTG---------HRFNQKFSYCLVD-RSAS 298
Query: 219 YLHNKLILGDGAIIDEGDATPL---QFIDGHYYITLEAISVDG-RMLDINPNIFKRDDSG 274
+ ++ G+ A+ TPL +D YY+ L ISV G R+ + ++FK D G
Sbjct: 299 SKPSSVVFGNAAVSRIARFTPLLSNPKLDTFYYVGLLGISVGGTRVPGVTASLFKLDQIG 358
Query: 275 -GGVMIDSGTDVTWLVKEAYEALRDEVMIRLEG-EQMRSYSWPDKLCYHGIMSSDLKGFP 332
GGV+IDSGT VT L++ AY A+RD + + ++ ++S D C+ +++K P
Sbjct: 359 NGGVIIDSGTSVTRLIRPAYIAMRDAFRVGAKTLKRAPNFSLFDT-CFDLSNMNEVK-VP 416
Query: 333 TVRFHFRGGAKLALEKDSMFYQPRPDAFCMAVNPASINNRYVNFSLIGMMAQQFYNVGYD 392
TV HFR + + FC A S+IG + QQ + V YD
Sbjct: 417 TVVLHFRRADVSLPATNYLIPVDTNGKFCFA-----FAGTMGGLSIIGNIQQQGFRVVYD 471
Query: 393 IGRKQKTFQRMDC 405
+ + F C
Sbjct: 472 LASSRVGFAPGGC 484
>gi|224060469|ref|XP_002300215.1| predicted protein [Populus trichocarpa]
gi|222847473|gb|EEE85020.1| predicted protein [Populus trichocarpa]
Length = 439
Score = 142 bits (359), Expect = 2e-31, Method: Compositional matrix adjust.
Identities = 127/402 (31%), Positives = 191/402 (47%), Gaps = 36/402 (8%)
Query: 20 HRVQRGINISIARLAYLQEKIRSHNTYQAQILPSNDISNSLFYVNISIGQPPVPQFTAMD 79
H V+RG N + RL + S + +A +LP N F + ++IG PP +D
Sbjct: 61 HGVKRGRN-RLQRLQAMALVASSSSEIEAPVLPGN----GEFLMKLAIGTPPETYSAILD 115
Query: 80 TGSSLLWVHCYPCRDCSPQLGTIFYPSRSSSYAYVPCDSEHCRYFPYARCSAYKHRCIYT 139
TGS L+W C PC C Q IF P +SSS++ + C S+ C P + C+ + C Y
Sbjct: 116 TGSDLIWTQCKPCTQCFHQSTPIFDPKKSSSFSKLSCSSQLCEALPQSSCN---NGCEYL 172
Query: 140 QLYLIGPETSVFVSTEQLTFKNTDESTIHVQDVVFGCGFSTNRNFKFS---GIFGLGIGR 196
Y T +++E LTF V +V FGCG + N FS G+ GLG G
Sbjct: 173 YSYGDYSSTQGILASETLTFGKAS-----VPNVAFGCG-ADNEGSGFSQGAGLVGLGRGP 226
Query: 197 SSLVSQLNS-SFSYCIGSLHDPDYLHNKLILGDGAIIDEGDA----TPLQFIDGH---YY 248
SLVSQL FSYC+ ++ D + L++G A ++ + TPL H YY
Sbjct: 227 LSLVSQLKEPKFSYCLTTVDDTKT--STLLMGSLASVNASSSAIKTTPLIHSPAHPSFYY 284
Query: 249 ITLEAISVDGRMLDINPNIFK-RDDSGGGVMIDSGTDVTWLVKEAYEALRDEVMIRLEGE 307
++LE ISV L I + F +DD GG++IDSGT +T+L + A+ + E ++
Sbjct: 285 LSLEGISVGDTRLPIKKSTFSLQDDGSGGLIIDSGTTITYLEESAFNLVAKEFTAKINLP 344
Query: 308 QMRSYSWPDKLCYHGIMSSDLKGFPTVRFHFRGGAKLALEKDS-MFYQPRPDAFCMAVNP 366
S S +C+ S P + FHF GA L L ++ M C+A+
Sbjct: 345 VDSSGSTGLDVCFTLPSGSTNIEVPKLVFHFD-GADLELPAENYMIGDSSMGVACLAMGS 403
Query: 367 ASINNRYVNFSLIGMMAQQFYNVGYDIGRKQKTFQRMDCEVL 408
+S S+ G + QQ V +D+ ++ +F C++L
Sbjct: 404 SS------GMSIFGNVQQQNMLVLHDLEKETLSFLPTQCDLL 439
>gi|413918484|gb|AFW58416.1| hypothetical protein ZEAMMB73_998053 [Zea mays]
Length = 475
Score = 142 bits (358), Expect = 3e-31, Method: Compositional matrix adjust.
Identities = 118/375 (31%), Positives = 170/375 (45%), Gaps = 36/375 (9%)
Query: 58 NSLFYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLGTIFYPSRSSSYAYVPCD 117
N F +++S+G P +P +DTGS L+W C PC +C Q +F P+ SS+YA +PC
Sbjct: 113 NGEFLMDLSVGTPALPYAAIVDTGSDLVWTQCKPCVECFNQTTPVFDPAASSTYAALPCS 172
Query: 118 SEHCRYFPYARCSAYKHRCI------YTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQD 171
S C P + C++ YT Y T ++TE T V
Sbjct: 173 SALCADLPTSTCASSSSSSSASSPCGYTYTYGDASSTQGVLATETFTLARQ-----KVPG 227
Query: 172 VVFGCGFSTNRNFKFS---GIFGLGIGRSSLVSQLN-SSFSYCIGSLHD-----PDYLHN 222
V FGCG TN F+ G+ GLG G SLVSQL FSYC+ SL D P L +
Sbjct: 228 VAFGCG-DTNEGDGFTQGAGLVGLGRGPLSLVSQLGIDRFSYCLTSLDDAAGRSPLLLGS 286
Query: 223 KLILGDGAIIDEGDATPLQFIDGH---YYITLEAISVDGRMLDINPNIFK-RDDSGGGVM 278
+ A TPL YY++L ++V L + + F +DD GGV+
Sbjct: 287 AAGISASAATAPAQTTPLVKNPSQPSFYYVSLTGLTVGSTRLALPSSAFAIQDDGTGGVI 346
Query: 279 IDSGTDVTWLVKEAYEALRDEVMIRLEGEQMRSYSWPDKLCYH---GIMSSDLK-GFPTV 334
+DSGT +T+L AY ALR + + + + LC+ G + D++ P +
Sbjct: 347 VDSGTSITYLELRAYRALRKAFVAHMSLPTVDASEIGLDLCFQGPAGAVDQDVQVQVPKL 406
Query: 335 RFHFRGGAKLALEKDS-MFYQPRPDAFCMAVNPASINNRYVNFSLIGMMAQQFYNVGYDI 393
HF GGA L L ++ M A C+ V + +R S+IG QQ + YD+
Sbjct: 407 VLHFDGGADLDLPAENYMVLDSASGALCLTV----MASR--GLSIIGNFQQQNFQFVYDV 460
Query: 394 GRKQKTFQRMDCEVL 408
+F +C L
Sbjct: 461 AGDTLSFAPAECNKL 475
>gi|356575389|ref|XP_003555824.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
Length = 473
Score = 142 bits (358), Expect = 3e-31, Method: Compositional matrix adjust.
Identities = 131/414 (31%), Positives = 187/414 (45%), Gaps = 45/414 (10%)
Query: 1 LIHRDSILSPYNNPNENPAHRVQRGINISIARLAYLQEKIRSHN-TYQAQILPSNDIS-- 57
L+HRD + P N + + R + R+A L+ + + TY + S+ +S
Sbjct: 70 LVHRDKV--PTFNTSHDHRTRFNARMQRDTKRVAALRRHLAAGKPTYAEEAFGSDVVSGM 127
Query: 58 ---NSLFYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLGTIFYPSRSSSYAYV 114
+ ++V I +G PP Q+ +D+GS ++WV C PC C Q +F P+ SSSYA V
Sbjct: 128 EQGSGEYFVRIGVGSPPRNQYVVIDSGSDIIWVQCEPCTQCYHQSDPVFNPADSSSYAGV 187
Query: 115 PCDSEHCRYFPYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDVVF 174
C S C + A C ++ RC Y Y G T ++ E LTF T +++V
Sbjct: 188 SCASTVCSHVDNAGC--HEGRCRYEVSYGDGSYTKGTLALETLTFGRT-----LIRNVAI 240
Query: 175 GCGFSTNRNF-KFSGIFGLGIGRSSLVSQL----NSSFSYCIGSLHDPDYLHNKLILGDG 229
GCG F +G+ GLG G S V QL +FSYC+ S L G
Sbjct: 241 GCGHHNQGMFVGAAGLLGLGSGPMSFVGQLGGQAGGTFSYCLVSRGIQS--SGLLQFGRE 298
Query: 230 AIIDEGDATPLQF---IDGHYYITLEAISVDGRMLDINPNIFKRDDSG-GGVMIDSGTDV 285
A+ PL YY+ L + V G + I+ ++FK + G GGV++D+GT V
Sbjct: 299 AVPVGAAWVPLIHNPRAQSFYYVGLSGLGVGGLRVPISEDVFKLSELGDGGVVMDTGTAV 358
Query: 286 TWLVKEAYEALRDEVMIRLEGEQMRSYSWPDKLCYHGIMSSDLKGF-----PTVRFHFRG 340
T L AYEA RD + + S CY DL GF PTV F+F G
Sbjct: 359 TRLPTAAYEAFRDAFIAQTTNLPRASGVSIFDTCY------DLFGFVSVRVPTVSFYFSG 412
Query: 341 GAKLALEKDSMFYQPRPD--AFCMAVNPASINNRYVNFSLIGMMAQQFYNVGYD 392
G L L + F P D +FC A P+S S+IG + Q+ + D
Sbjct: 413 GPILTLPARN-FLIPVDDVGSFCFAFAPSS-----SGLSIIGNIQQEGIEISVD 460
>gi|242050428|ref|XP_002462958.1| hypothetical protein SORBIDRAFT_02g035300 [Sorghum bicolor]
gi|241926335|gb|EER99479.1| hypothetical protein SORBIDRAFT_02g035300 [Sorghum bicolor]
Length = 460
Score = 142 bits (358), Expect = 3e-31, Method: Compositional matrix adjust.
Identities = 118/378 (31%), Positives = 173/378 (45%), Gaps = 39/378 (10%)
Query: 57 SNSLFYVNISIGQPPVPQFTAMDTGSSLLWVHC-YPCRDCSPQLGTIFYPSRSSSYAYVP 115
S + + V+ +IG PP+ +DTGS L+W C PCR C PQ ++ P+RS +YA V
Sbjct: 96 STATYLVDFAIGTPPLALSAVLDTGSDLIWTQCDAPCRRCFPQPAPLYAPARSVTYANVS 155
Query: 116 CDSEHCRYFPYARCSAY-----------KHRCIYTQLYLIGPETSVFVSTEQLTFKNTDE 164
C S C P R S+ + C Y Y G T ++TE TF
Sbjct: 156 CGSRLCDALPSLRPSSRCSASASAPAPERGGCTYYYSYGDGSSTDGVLATETFTFGA--G 213
Query: 165 STIHVQDVVFGCGF-STNRNFKFSGIFGLGIGRSSLVSQLN-SSFSYCIGSLHDPDYLHN 222
+T+H D+ FGCG + SG+ G+G G SLVSQL + FSYC +D +
Sbjct: 214 TTVH--DLAFGCGTDNLGGTDNSSGLVGMGRGPLSLVSQLGVTKFSYCFTPFND-TTTSS 270
Query: 223 KLILGDGAIIDEGDATPLQFI--------DGHYYITLEAISVDGRMLDINPNIFKRDDSG 274
L LG A + A F+ +YY++LE I+V +L I+P +F+ SG
Sbjct: 271 PLFLGSSASLSPA-AKSTPFVPSPSGPRRSSYYYLSLEGITVGDTLLPIDPAVFRLTASG 329
Query: 275 -GGVMIDSGTDVTWLVKEAYEALRDEVMIRLEGEQMRSYSWPDKLCYHGIMSSDLKG--F 331
GG++IDSGT T L + A+ L V R+ +C+ +
Sbjct: 330 RGGLIIDSGTTFTALEERAFVVLARAVAARVALPLASGAHLGLSVCFAAPQGRGPEAVDV 389
Query: 332 PTVRFHFRGGAKLALEKDSMFYQPR-PDAFCMAVNPASINNRYVNFSLIGMMAQQFYNVG 390
P + HF GA + L + S + R C+ + A S++G M QQ +V
Sbjct: 390 PRLVLHFD-GADMELPRSSAVVEDRVAGVACLGIVSAR------GMSVLGSMQQQNMHVR 442
Query: 391 YDIGRKQKTFQRMDCEVL 408
YD+GR +F+ +C L
Sbjct: 443 YDVGRDVLSFEPANCGEL 460
>gi|297834758|ref|XP_002885261.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
gi|297331101|gb|EFH61520.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
Length = 500
Score = 142 bits (358), Expect = 3e-31, Method: Compositional matrix adjust.
Identities = 117/347 (33%), Positives = 170/347 (48%), Gaps = 28/347 (8%)
Query: 61 FYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLGTIFYPSRSSSYAYVPCDSEH 120
++ I +G P + +DTGS + W+ C PC DC Q +F P+ SS+Y + C +
Sbjct: 162 YFSRIGVGTPAKEMYLVLDTGSDVNWIQCEPCSDCYQQSDPVFNPTSSSTYKSLTCSAPQ 221
Query: 121 CRYFPYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDVVFGCGFST 180
C + C + ++C+Y Y G T ++T+ +TF N+ + + DV GCG
Sbjct: 222 CSLLETSACRS--NKCLYQVSYGDGSFTVGELATDTVTFGNSGK----INDVALGCGHDN 275
Query: 181 NRNFK-FSGIFGLGIGRSSLVSQLN-SSFSYCIGSLHDPDYLHNKLILGDGAIIDEGDAT 238
F +G+ GLG G S+ +Q+ +SFSYC L D D + + + + GDAT
Sbjct: 276 EGLFTGAAGLLGLGGGALSITNQMKATSFSYC---LVDRDSGKSSSLDFNSVQLGSGDAT 332
Query: 239 -PL---QFIDGHYYITLEAISVDGRMLDINPNIFKRDDSG-GGVMIDSGTDVTWLVKEAY 293
PL Q ID YY+ L SV G+ + + IF D SG GGV++D GT VT L +AY
Sbjct: 333 APLLRNQKIDTFYYVGLSGFSVGGQKVMMPDAIFDVDASGSGGVILDCGTAVTRLQTQAY 392
Query: 294 EALRDEVMIRLEGEQMRSYSWPD--KLCYHGIMSSDLKGFPTVRFHFRGGAKLALEKDSM 351
+LRD ++L + S CY S +K PTV FHF GG L L +
Sbjct: 393 NSLRD-AFLKLTTNLKKGTSSISLFDTCYDFSSLSSVK-VPTVAFHFTGGKSLDLPAKN- 449
Query: 352 FYQPRPD--AFCMAVNPASINNRYVNFSLIGMMAQQFYNVGYDIGRK 396
+ P D FC A P S + S+IG + QQ + YD+ K
Sbjct: 450 YLIPVDDNGTFCFAFAPTS-----SSLSIIGNVQQQGTRITYDLANK 491
>gi|413953792|gb|AFW86441.1| hypothetical protein ZEAMMB73_342504 [Zea mays]
Length = 459
Score = 142 bits (357), Expect = 3e-31, Method: Compositional matrix adjust.
Identities = 127/429 (29%), Positives = 192/429 (44%), Gaps = 56/429 (13%)
Query: 1 LIHRDSILSPYNNPNENPA--HRVQRGINISIARLAYLQEKIRSHNTYQAQILPSNDISN 58
L+HR +P ++ P+ R++R S AR Y+ + N L + +
Sbjct: 63 LVHRHGPCAPSTRSSDEPSLSERLRR----SRARSKYIMSRASKSNVSIPTHL-GGSVDS 117
Query: 59 SLFYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRD--CSPQLGTIFYPSRSSSYAYVPC 116
+ V + +G P V Q +DTGS L WV C PC C PQ +F PSRSS+YA +PC
Sbjct: 118 LEYVVTVGLGTPAVSQVLLIDTGSDLSWVQCAPCNSTTCYPQKDPLFDPSRSSTYAPIPC 177
Query: 117 DSEHCRYFPY----ARC---SAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHV 169
+++ CR + C S +C Y Y G +T+ S E LT + V
Sbjct: 178 NTDACRDLTRDGYGSDCTSGSGGGAQCGYAITYGDGSQTTGVYSNETLTMA----PGVTV 233
Query: 170 QDVVFGCGFSTNR-NFKFSGIFGLGIGRSSLVSQLNS----SFSYCIGSLHDPDYLHNKL 224
+D FGCG + N K+ G+ GLG SLV Q +S +FSYC+ + +D
Sbjct: 234 KDFHFGCGHDQDGPNDKYDGLLGLGGAPESLVVQTSSVYGGAFSYCLPAAND-----QAG 288
Query: 225 ILGDGAIIDEGDA---TPLQFIDGHYYIT-LEAISVDGRMLDINPNIFKRDDSGGGVMID 280
L GA +++ TP+ +Y+ + I+V G +D+ P+ F GG++ID
Sbjct: 289 FLALGAPVNDASGFVFTPMVREQQTFYVVNMTGITVGGEPIDVPPSAFS-----GGMIID 343
Query: 281 SGTDVTWLVKEAYEALRDEVMIRLEGEQMRSYSWPDKLCYHGIMSSDLKGFPTVRFHFRG 340
SGT VT L AY AL+ + + D CY+ S++ P V F G
Sbjct: 344 SGTVVTELQHTAYAALQAAFRKAMAAYPLLPNGELDT-CYNFTGHSNVT-VPRVALTFSG 401
Query: 341 GAKLALEKDSMFYQPRPDAF----CMAVNPASINNRYVNFSLIGMMAQQFYNVGYDIGRK 396
GA + L+ PD C+A A +N+ ++G + Q+ V YD+G
Sbjct: 402 GATVDLDV--------PDGILLDNCLAFQEAGPDNQP---GILGNVNQRTLEVLYDVGHG 450
Query: 397 QKTFQRMDC 405
+ F C
Sbjct: 451 RVGFGADAC 459
>gi|226493786|ref|NP_001142400.1| uncharacterized protein LOC100274575 [Zea mays]
gi|194708650|gb|ACF88409.1| unknown [Zea mays]
Length = 392
Score = 142 bits (357), Expect = 4e-31, Method: Compositional matrix adjust.
Identities = 114/374 (30%), Positives = 173/374 (46%), Gaps = 39/374 (10%)
Query: 61 FYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRD-CSPQLGTIFYPSRSSSYAYVPCDSE 119
+ + ++IG PP+P DTGS L+W C PC C Q ++ PS S+++A +PC+S
Sbjct: 32 YLMALAIGTPPLPYQAIADTGSDLIWTQCAPCTSQCFRQPTPLYNPSSSTTFAVLPCNSS 91
Query: 120 ---------HCRYFPYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQ 170
P C+ C Y Y G TSVF +E TF +T V
Sbjct: 92 LSVCAAALAGTGTAPPPGCA-----CTYNVTYGSG-WTSVFQGSETFTFGSTPAGHARVP 145
Query: 171 DVVFGCGFSTN--RNFKFSGIFGLGIGRSSLVSQLN-SSFSYCIGSLHDPDYLHNKLILG 227
+ FGC +++ SG+ GLG GR SLVSQL FSYC+ D + + L+LG
Sbjct: 146 GIAFGCSTASSGFNASSASGLVGLGRGRLSLVSQLGVPKFSYCLTPYQDTNS-TSTLLLG 204
Query: 228 DGAIIDEG---DATPL------QFIDGHYYITLEAISVDGRMLDINPNIFKRD-DSGGGV 277
A ++ +TP ++ YY+ L IS+ L I P+ F + D GG+
Sbjct: 205 PSASLNGTAGVSSTPFVASPSTAPMNTFYYLNLTGISLGTTALSIPPDAFSLNADGTGGL 264
Query: 278 MIDSGTDVTWLVKEAYEALRDEV--MIRLEGEQMRSYSWPDKLCYHGIMSSDL-KGFPTV 334
+IDSGT +T L AY+ +R V ++ L + + D LC+ S+ P++
Sbjct: 265 IIDSGTTITLLGNTAYQQVRAAVVSLVTLPTTDGSADTGLD-LCFMLPSSTSAPPAMPSM 323
Query: 335 RFHFRGGAKLALEKDSMFYQPRPDAFCMAVNPASINNRYVNFSLIGMMAQQFYNVGYDIG 394
HF GA + L DS +C+A+ N +++G QQ ++ YDIG
Sbjct: 324 TLHFN-GADMVLPADSYMMSDDSGLWCLAMQ----NQTDGEVNILGNYQQQNMHILYDIG 378
Query: 395 RKQKTFQRMDCEVL 408
++ +F C L
Sbjct: 379 QETLSFAPAKCSAL 392
>gi|15222611|ref|NP_173922.1| aspartyl protease-like protein [Arabidopsis thaliana]
gi|12321511|gb|AAG50814.1|AC079281_16 hypothetical protein [Arabidopsis thaliana]
gi|20466516|gb|AAM20575.1| unknown protein [Arabidopsis thaliana]
gi|23198172|gb|AAN15613.1| unknown protein [Arabidopsis thaliana]
gi|110736960|dbj|BAF00436.1| hypothetical protein [Arabidopsis thaliana]
gi|332192515|gb|AEE30636.1| aspartyl protease-like protein [Arabidopsis thaliana]
Length = 483
Score = 142 bits (357), Expect = 4e-31, Method: Compositional matrix adjust.
Identities = 119/353 (33%), Positives = 166/353 (47%), Gaps = 25/353 (7%)
Query: 61 FYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLGTIFYPSRSSSYAYVPCDSEH 120
++ + IG+P + +DTGS + W+ C PC DC Q IF PS SSSY + CD+
Sbjct: 148 YFTRVGIGKPAREVYMVLDTGSDVNWLQCTPCADCYHQTEPIFEPSSSSSYEPLSCDTPQ 207
Query: 121 CRYFPYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDVVFGCGFST 180
C + C C+Y Y G T +TE LT +T VQ+V GCG S
Sbjct: 208 CNALEVSECR--NATCLYEVSYGDGSYTVGDFATETLTIGST-----LVQNVAVGCGHSN 260
Query: 181 NRNFKFSGIFGLGIGRSSLV-SQLN-SSFSYCIGSLHDPDYLHNKLILGDGAIIDEGDAT 238
F + G + SQLN +SFSYC L D D + ++ +
Sbjct: 261 EGLFVGAAGLLGLGGGLLALPSQLNTTSFSYC---LVDRDSDSASTVDFGTSLSPDAVVA 317
Query: 239 PL---QFIDGHYYITLEAISVDGRMLDINPNIFKRDDSG-GGVMIDSGTDVTWLVKEAYE 294
PL +D YY+ L ISV G +L I + F+ D+SG GG++IDSGT VT L E Y
Sbjct: 318 PLLRNHQLDTFYYLGLTGISVGGELLQIPQSSFEMDESGSGGIIIDSGTAVTRLQTEIYN 377
Query: 295 ALRDE-VMIRLEGEQMRSYSWPDKLCYHGIMSSDLKGFPTVRFHFRGGAKLAL-EKDSMF 352
+LRD V L+ E+ + D CY+ + ++ PTV FHF GG LAL K+ M
Sbjct: 378 SLRDSFVKGTLDLEKAAGVAMFDT-CYNLSAKTTVE-VPTVAFHFPGGKMLALPAKNYMI 435
Query: 353 YQPRPDAFCMAVNPASINNRYVNFSLIGMMAQQFYNVGYDIGRKQKTFQRMDC 405
FC+A P + + ++IG + QQ V +D+ F C
Sbjct: 436 PVDSVGTFCLAFAPTA-----SSLAIIGNVQQQGTRVTFDLANSLIGFSSNKC 483
>gi|226508080|ref|NP_001150678.1| aspartic proteinase nepenthesin-1 precursor [Zea mays]
gi|195641018|gb|ACG39977.1| aspartic proteinase nepenthesin-1 precursor [Zea mays]
Length = 450
Score = 142 bits (357), Expect = 4e-31, Method: Compositional matrix adjust.
Identities = 114/374 (30%), Positives = 173/374 (46%), Gaps = 39/374 (10%)
Query: 61 FYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRD-CSPQLGTIFYPSRSSSYAYVPCDSE 119
+ + ++IG PP+P DTGS L+W C PC C Q ++ PS S+++A +PC+S
Sbjct: 90 YLMALAIGTPPLPYQAIADTGSDLIWTQCAPCTSQCFRQPTPLYNPSSSTTFAVLPCNSS 149
Query: 120 ---------HCRYFPYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQ 170
P C+ C Y Y G TSVF +E TF +T V
Sbjct: 150 LSVCAAALAGTGTAPPPGCA-----CTYNVTYGSG-WTSVFQGSETFTFGSTPAGQSRVP 203
Query: 171 DVVFGCGFSTN--RNFKFSGIFGLGIGRSSLVSQLN-SSFSYCIGSLHDPDYLHNKLILG 227
+ FGC +++ SG+ GLG GR SLVSQL FSYC+ D + + L+LG
Sbjct: 204 GIAFGCSTASSGFNASSASGLVGLGRGRLSLVSQLGVPKFSYCLTPYQDTNS-TSTLLLG 262
Query: 228 DGAIIDEG---DATPL------QFIDGHYYITLEAISVDGRMLDINPNIFKRD-DSGGGV 277
A ++ +TP ++ YY+ L IS+ L I P+ F + D GG+
Sbjct: 263 PSASLNGTAGVSSTPFVASPSTAPMNTFYYLNLTGISLGTTALSIPPDAFLLNADGTGGL 322
Query: 278 MIDSGTDVTWLVKEAYEALRDEV--MIRLEGEQMRSYSWPDKLCYHGIMSSDL-KGFPTV 334
+IDSGT +T L AY+ +R V ++ L + + D LC+ S+ P++
Sbjct: 323 IIDSGTTITLLGNTAYQQVRAAVVSLVTLPTTDGSAATGLD-LCFMLPSSTSAPPAMPSM 381
Query: 335 RFHFRGGAKLALEKDSMFYQPRPDAFCMAVNPASINNRYVNFSLIGMMAQQFYNVGYDIG 394
HF GA + L DS +C+A+ N +++G QQ ++ YDIG
Sbjct: 382 TLHFN-GADMVLPADSYMMSDDSGLWCLAMQ----NQTDGEVNILGNYQQQNMHILYDIG 436
Query: 395 RKQKTFQRMDCEVL 408
++ +F C L
Sbjct: 437 QETLSFAPAKCSAL 450
>gi|414886964|tpg|DAA62978.1| TPA: aspartic proteinase nepenthesin-1 [Zea mays]
Length = 452
Score = 141 bits (356), Expect = 5e-31, Method: Compositional matrix adjust.
Identities = 114/374 (30%), Positives = 173/374 (46%), Gaps = 39/374 (10%)
Query: 61 FYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRD-CSPQLGTIFYPSRSSSYAYVPCDSE 119
+ + ++IG PP+P DTGS L+W C PC C Q ++ PS S+++A +PC+S
Sbjct: 92 YLMALAIGTPPLPYQAIADTGSDLIWTQCAPCTSQCFRQPTPLYNPSSSTTFAVLPCNSS 151
Query: 120 ---------HCRYFPYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQ 170
P C+ C Y Y G TSVF +E TF +T V
Sbjct: 152 LSVCAAALAGTGTAPPPGCA-----CTYNVTYGSG-WTSVFQGSETFTFGSTPAGHARVP 205
Query: 171 DVVFGCGFSTN--RNFKFSGIFGLGIGRSSLVSQLN-SSFSYCIGSLHDPDYLHNKLILG 227
+ FGC +++ SG+ GLG GR SLVSQL FSYC+ D + + L+LG
Sbjct: 206 GIAFGCSTASSGFNASSASGLVGLGRGRLSLVSQLGVPKFSYCLTPYQDTNS-TSTLLLG 264
Query: 228 DGAIIDEG---DATPL------QFIDGHYYITLEAISVDGRMLDINPNIFKRD-DSGGGV 277
A ++ +TP ++ YY+ L IS+ L I P+ F + D GG+
Sbjct: 265 PSASLNGTAGVSSTPFVASPSTAPMNTFYYLNLTGISLGTTALSIPPDAFSLNADGTGGL 324
Query: 278 MIDSGTDVTWLVKEAYEALRDEV--MIRLEGEQMRSYSWPDKLCYHGIMSSDL-KGFPTV 334
+IDSGT +T L AY+ +R V ++ L + + D LC+ S+ P++
Sbjct: 325 IIDSGTTITLLGNTAYQQVRAAVVSLVTLPTTDGSADTGLD-LCFMLPSSTSAPPAMPSM 383
Query: 335 RFHFRGGAKLALEKDSMFYQPRPDAFCMAVNPASINNRYVNFSLIGMMAQQFYNVGYDIG 394
HF GA + L DS +C+A+ N +++G QQ ++ YDIG
Sbjct: 384 TLHFN-GADMVLPADSYMMSDDSGLWCLAMQ----NQTDGEVNILGNYQQQNMHILYDIG 438
Query: 395 RKQKTFQRMDCEVL 408
++ +F C L
Sbjct: 439 QETLSFAPAKCSAL 452
>gi|255566006|ref|XP_002523991.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
gi|223536718|gb|EEF38359.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
Length = 455
Score = 141 bits (356), Expect = 5e-31, Method: Compositional matrix adjust.
Identities = 130/437 (29%), Positives = 188/437 (43%), Gaps = 52/437 (11%)
Query: 1 LIHRDSILSPYNNPNENPAHRVQRGINISIARLAYLQEKIRSHNTYQAQILPSNDISNSL 60
LIH DS SP+ N +E HR+ + + S R+A L S A I +
Sbjct: 42 LIHIDSPNSPFFNASETTTHRLAKALQRSANRVARLNPLSNSDEGVHASIFSGD----GN 97
Query: 61 FYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLGTIFYPSRSSSYAYVPCDSEH 120
+ + + IG PP A+DTGS+++W+ C C+DC Q +IF P SS+Y PCDS
Sbjct: 98 YLMKLLIGTPPTEIHAAIDTGSNVIWIPCINCKDCFNQSSSIFNPLASSTYQDAPCDSYQ 157
Query: 121 CRYFPYARCSAYKHRCIYT---QLYLIGPETSVFVSTEQLTFKNTDESTIHVQDVVFGCG 177
C + C + + C+Y+ + L P + V T LT + + D V CG
Sbjct: 158 CET-TSSSCQS-DNVCLYSCDEKHQLNCPNGRIAVDTMTLTSSDGRPFPLPYSDFV--CG 213
Query: 178 FSTNRNFKFSGIFGLGIGRSSLVSQL----NSSFSYCIGSLHDPDYLHNKLILGDGAIID 233
S + F G+ GLG G SL S+L + FSYC+ + +K+ G + I
Sbjct: 214 NSIYKTFAGVGVIGLGRGALSLTSKLYHLSDGKFSYCLADYYSKQ--PSKINFGLQSFIS 271
Query: 234 EGDATPLQFIDGH------YYITLEAISVDGRMLDINPNIFKRDD----SGGGVMIDSGT 283
+ D + GH YY+TLE ISV + D ++ DD G ++IDSGT
Sbjct: 272 DDDLEVVSTTLGHHRHSGNYYVTLEGISVGEKRQD----LYYVDDPFAPPVGNMLIDSGT 327
Query: 284 DVTWLVKEAYEALRDEVMIRL-EGEQMRSYSWPDKLCYHGIMSSDLK-----------GF 331
T L K+ Y+ L V + E Q P + M + LK F
Sbjct: 328 MFTLLPKDFYDYLWSTVSYAIPENPQNH----PHNSRFPFSMDNTLKLSPCFWYYPELKF 383
Query: 332 PTVRFHFRGGAKLALEKDSMFYQPRPDAFCMAVNPASINNRYVNFSLIGMMAQQFYNVGY 391
P + HF A + L D+ F + D C A V G Q + +GY
Sbjct: 384 PKITIHFT-DADVELSDDNSFIRVAEDVVCFAFAATQPGQSTV----YGSWQQMNFILGY 438
Query: 392 DIGRKQKTFQRMDCEVL 408
D+ R +F+R DC L
Sbjct: 439 DLKRGTVSFKRTDCSKL 455
>gi|356523155|ref|XP_003530207.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
Length = 412
Score = 141 bits (356), Expect = 6e-31, Method: Compositional matrix adjust.
Identities = 131/416 (31%), Positives = 196/416 (47%), Gaps = 50/416 (12%)
Query: 20 HRVQRGINISIARLAYLQEKIR----SHN--TYQAQILPSNDISNSLFYVNISIGQPPVP 73
R+Q+ + R+ +Q +IR SHN Q QI S+ I+ +++G
Sbjct: 16 RRLQKQLISDDLRVRSMQNRIRRVVSSHNVEASQTQIPLSSGINLQTLNYIVTMGLGSTN 75
Query: 74 QFTAMDTGSSLLWVHCYPCRDCSPQLGTIFYPSRSSSYAYVPCDSEHCRYFPYAR----- 128
+DTGS L WV C PC C Q G IF PS SSSY V C+S C+ +A
Sbjct: 76 MTVIIDTGSDLTWVQCEPCMSCYNQQGPIFKPSTSSSYQSVSCNSSTCQSLQFATGNTGA 135
Query: 129 CSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDVVFGCGFSTNRNFK--F 186
C + C Y Y G T+ + EQL+F + V D VFGCG RN K F
Sbjct: 136 CGSNPSTCNYVVNYGDGSYTNGELGVEQLSFGG-----VSVSDFVFGCG----RNNKGLF 186
Query: 187 SGIFGL-GIGRS--SLVSQLNSS----FSYCIGSLHDPDYLHNKLILGDGAIIDEGDATP 239
G+ GL G+GRS SLVSQ N++ FSYC+ + L++G+ + + + + TP
Sbjct: 187 GGVSGLMGLGRSYLSLVSQTNATFGGVFSYCLPTTESG--ASGSLVMGNESSVFK-NVTP 243
Query: 240 LQF--------IDGHYYITLEAISVDGRMLDINPNIFKRDDSGGGVMIDSGTDVTWLVKE 291
+ + + Y + L I VDG L + P+ GGV+IDSGT +T L
Sbjct: 244 ITYTRMLPNPQLSNFYILNLTGIDVDGVALQV-PSF-----GNGGVLIDSGTVITRLPSS 297
Query: 292 AYEALRDEVMIRLEG-EQMRSYSWPDKLCYHGIMSSDLKGFPTVRFHFRGGAKLALEKDS 350
Y+AL+ + + G +S D C++ + D PT+ HF G A+L ++
Sbjct: 298 VYKALKALFLKQFTGFPSAPGFSILDT-CFN-LTGYDEVSIPTISMHFEGNAELKVDATG 355
Query: 351 MFYQPRPDAFCMAVNPASINNRYVNFSLIGMMAQQFYNVGYDIGRKQKTFQRMDCE 406
FY + DA + + AS+++ Y + ++IG Q+ V YD + + F C
Sbjct: 356 TFYVVKEDASQVCLALASLSDAY-DTAIIGNYQQRNQRVIYDTKQSKVGFAEESCS 410
>gi|255564685|ref|XP_002523337.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
gi|223537425|gb|EEF39053.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
Length = 469
Score = 141 bits (355), Expect = 6e-31, Method: Compositional matrix adjust.
Identities = 128/421 (30%), Positives = 185/421 (43%), Gaps = 33/421 (7%)
Query: 2 IHRDSILSPYNNPNENPAHRVQRGINISIARLAYLQEKI----RSHNTYQAQILPSNDIS 57
+H LS + P R+QR + ++YL E R + + ++
Sbjct: 64 LHHVDALSFNSTPETLFTTRLQRDA-ARVEAISYLAETAGTGKRVGTGFSSSVISGLAQG 122
Query: 58 NSLFYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLGTIFYPSRSSSYAYVPCD 117
+ ++ I +G PP + +DTGS ++W+ C PC+ C Q +F P +S S+A + C
Sbjct: 123 SGEYFTRIGVGTPPRYVYMVLDTGSDIVWIQCAPCKRCYAQSDPVFDPRKSRSFASIACR 182
Query: 118 SEHCRYFPYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDVVFGCG 177
S C C+ K C+Y Y G T STE LTF+ T V V GCG
Sbjct: 183 SPLCHRLDSPGCNTQKQTCMYQVSYGDGSFTFGDFSTETLTFRRT-----RVARVALGCG 237
Query: 178 FSTNRNF-KFSGIFGLGIGRSSLVSQ----LNSSFSYCIGSLHDPDYLHNKLILGDGAII 232
F +G+ GLG GR S SQ N FSYC+ + ++ GD A+
Sbjct: 238 HDNEGLFVGAAGLLGLGRGRLSFPSQTGRRFNHKFSYCLVD-RSASSKPSSMVFGDSAVS 296
Query: 233 DEGDATPL---QFIDGHYYITLEAISVDG-RMLDINPNIFKRDDSG-GGVMIDSGTDVTW 287
TPL +D YY+ L ISV G R+ I ++FK D +G GGV+IDSGT VT
Sbjct: 297 RTARFTPLVSNPKLDTFYYVELLGISVGGTRVPGITASLFKLDQTGNGGVIIDSGTSVTR 356
Query: 288 LVKEAYEALRDEVMIRLEGEQMR---SYSWPDKLCYHGIMSSDLKGFPTVRFHFRGGAKL 344
L + AY A RD R ++ +S D C+ +++K PTV HFRG
Sbjct: 357 LTRPAYIAFRDA--FRAGASNLKRAPQFSLFDT-CFDLSGKTEVK-VPTVVLHFRGADVS 412
Query: 345 ALEKDSMFYQPRPDAFCMAVNPASINNRYVNFSLIGMMAQQFYNVGYDIGRKQKTFQRMD 404
+ + FC+A S+IG + QQ + V YD+ + F
Sbjct: 413 LPASNYLIPVDTSGNFCLA-----FAGTMGGLSIIGNIQQQGFRVVYDLAGSRVGFAPHG 467
Query: 405 C 405
C
Sbjct: 468 C 468
>gi|358346443|ref|XP_003637277.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
gi|355503212|gb|AES84415.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
Length = 434
Score = 141 bits (355), Expect = 7e-31, Method: Compositional matrix adjust.
Identities = 116/421 (27%), Positives = 195/421 (46%), Gaps = 35/421 (8%)
Query: 1 LIHRDSILSPYNNPNENPAHRVQRGINISIARLAYLQEKIRSHNTYQAQILPSNDISNSL 60
+IHRD SP +P R ++ SI R+ Y ++ S N Q P + ++ L
Sbjct: 32 MIHRDFSKSPLYHPTVTKFQRAYNVVHRSINRVNYFTKEF-SLNKNQ----PVSTLTPEL 86
Query: 61 --FYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLGTIFYPSRSSSYAYVPCDS 118
+ ++ S+G PP + MDTGS+++W+ C PC C Q IF PS+SSSY +PC S
Sbjct: 87 GEYLISYSVGTPPFKVYGFMDTGSNIVWLQCQPCNTCFNQTSPIFNPSKSSSYKNIPCTS 146
Query: 119 EHCRYF--PYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDVVFGC 176
C+ + CS C Y+ Y ++ +S + LT +T S++ ++V GC
Sbjct: 147 STCKDTNDTHISCSNGGDVCEYSITYGGDAKSQGDLSNDSLTLDSTSGSSVLFPNIVIGC 206
Query: 177 GFST--NRNFKFSGIFGLGIGRSSLVSQLNSS-----FSYCIGSLHDPDYLHNKLILGDG 229
G N + SG+ G+G G SL+ Q+ SS FSYC+ + +KLI G+
Sbjct: 207 GHINVLQDNSQSSGVVGMGRGPMSLIKQVGSSSVGSKFSYCLIPYNSDSNSSSKLIFGED 266
Query: 230 AIIDEGD---ATPLQFIDG---HYYITLEAISVDGRMLDINPNIFKRDDSGGGVMIDSGT 283
++ G+ +TP+ ++G +Y++TLEA SV ++ + + S ++IDSGT
Sbjct: 267 VVV-SGEIVVSTPMVKVNGQENYYFLTLEAFSVGNNRIEYGE---RSNASTQNILIDSGT 322
Query: 284 DVTWLVKEAYEALRDEVMIRLEGEQMRSYSWPDKLCYHGIMSSDLKGFPTVRFHFRGGAK 343
+T L L V ++ ++ LCY+ + P + HF GA
Sbjct: 323 PLTMLPNLFLSKLVSYVAQEVKLPRIEPPDHHLSLCYN--TTGKQLNVPDITAHFN-GAD 379
Query: 344 LALEKDSMFYQPRPDAFCMAVNPASINNRYVNFSLIGMMAQQFYNVGYDIGRKQKTFQRM 403
+ L + F+ C ++ + G +AQ + YD+ ++ +F+
Sbjct: 380 VKLNSNGTFFPFEDGIMCFGFISSN------GLEIFGNIAQNNLLIDYDLEKEIISFKPT 433
Query: 404 D 404
D
Sbjct: 434 D 434
>gi|357481199|ref|XP_003610885.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
gi|355512220|gb|AES93843.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
Length = 416
Score = 141 bits (355), Expect = 7e-31, Method: Compositional matrix adjust.
Identities = 109/358 (30%), Positives = 171/358 (47%), Gaps = 29/358 (8%)
Query: 63 VNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLGTIFYPSRSSSYAYVPCDSEHCR 122
+ I IG PP+ +DTGS L+W+ C PC C Q+ +F P +SS+Y + CDS C
Sbjct: 70 MEIYIGTPPIKITGLVDTGSDLIWIQCAPCLGCYKQIKPMFDPLKSSTYNNISCDSPLCH 129
Query: 123 YFPYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDVVFGCGFSTNR 182
CS K RC YT Y T ++ + TF + + + +FGCG +
Sbjct: 130 KLDTGVCSPEK-RCNYTYGYGDNSLTKGVLAQDTATFTSNTGKPVSLSRFLFGCGHNNTG 188
Query: 183 NFK--FSGIFGLGIGRSSLVSQL-----NSSFSYCIGSLHDPDYLHNKLILGDGA-IIDE 234
F G+ GLG G +SL+SQ+ FS C+ + +++ G G+ ++
Sbjct: 189 GFNDHEMGLIGLGGGPTSLISQIGPLFGGKKFSQCLVPFLTDIKISSRMSFGKGSQVLGN 248
Query: 235 G-DATPL--QFIDGHYYITLEAISVDGRMLDINPNIFKRDDSGGGVMIDSGTDVTWLVKE 291
G TPL + D Y++TL ISV+ +N I K + +++DSGT L ++
Sbjct: 249 GVVTTPLVPREKDTSYFVTLLGISVEDTYFPMNSTIGKAN-----MLVDSGTPPILLPQQ 303
Query: 292 AYEALRDEVMIRLEGEQMR-SYSWPDKLCYHGIMSSDLKGFPTVRFHFRGGAKLALEKDS 350
Y+ + EV ++ + + S +LCY ++LKG PT+ FHF GA + L
Sbjct: 304 LYDKVFAEVRNKVALKPITDDPSLGTQLCYR--TQTNLKG-PTLTFHFV-GANVLLTPIQ 359
Query: 351 MFYQPRPDA---FCMAVNPASINNRYVNFSLIGMMAQQFYNVGYDIGRKQKTFQRMDC 405
F P P FC+A+ N + + G AQ Y +G+D+ R+ +F+ DC
Sbjct: 360 TFIPPTPQTKGIFCLAI----YNRTNSDPGVYGNFAQSNYLIGFDLDRQVVSFKPTDC 413
>gi|225455876|ref|XP_002275164.1| PREDICTED: aspartic proteinase nepenthesin-2 [Vitis vinifera]
Length = 496
Score = 141 bits (355), Expect = 7e-31, Method: Compositional matrix adjust.
Identities = 113/354 (31%), Positives = 168/354 (47%), Gaps = 26/354 (7%)
Query: 61 FYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLGTIFYPSRSSSYAYVPCDSEH 120
+++ + IG+P + +DTGS + W+ C PC DC Q+ IF P+ SSS++ + C +
Sbjct: 160 YFLRVGIGRPSKTFYMVIDTGSDVNWLQCKPCDDCYQQVDPIFDPASSSSFSRLGCQTPQ 219
Query: 121 CRYFPYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDVVFGCGFST 180
CR C C+Y Y G T +TE ++F N+ V V GCG
Sbjct: 220 CRNLDVFAC--RNDSCLYQVSYGDGSYTVGDFATETVSFGNSGS----VDKVAIGCGHDN 273
Query: 181 NRNFKFSGIFGLGIGRS-SLVSQLN-SSFSYCIGSLHDPDYLHNKLILGDGAIIDEGDAT 238
F + G SL SQ+ SSFSYC L + D + + + + A +
Sbjct: 274 EGLFVGAAGLIGLGGGPLSLTSQIKASSFSYC---LVNRDSVDSSTLEFNSAKPSDSVTA 330
Query: 239 PL---QFIDGHYYITLEAISVDGRMLDINPNIFKRDDSG-GGVMIDSGTDVTWLVKEAYE 294
P+ +D YY+ + +SV G L I P+IF+ D SG GG+++D GT VT L +AY
Sbjct: 331 PIFKNSKVDTFYYVGITGMSVGGEKLAIPPSIFEVDGSGKGGIIVDCGTAVTRLQTQAYN 390
Query: 295 ALRDE-VMIRLEGEQMRSYSWPDKLCYHGIMSSDLKGFPTVRFHFRGGAKLALEKDSMFY 353
ALRD V + + ++ D CY+ + ++ PTV F F GG L L S +
Sbjct: 391 ALRDTFVKLTKDLPSTSGFALFDT-CYNLSSRTSVR-VPTVAFLFDGGKSLPLPP-SNYL 447
Query: 354 QPRPDA--FCMAVNPASINNRYVNFSLIGMMAQQFYNVGYDIGRKQKTFQRMDC 405
P A FC+A P + + S+IG + QQ V YD+ Q +F C
Sbjct: 448 IPVDSAGTFCLAFAPTT-----ASLSIIGNVQQQGTRVTYDLANSQVSFSSRKC 496
>gi|116787367|gb|ABK24480.1| unknown [Picea sitchensis]
Length = 496
Score = 141 bits (355), Expect = 8e-31, Method: Compositional matrix adjust.
Identities = 126/438 (28%), Positives = 187/438 (42%), Gaps = 48/438 (10%)
Query: 1 LIHRDSIL-SPYNNPNENPAHRVQRGINISIARLAYLQEKIRSH---------------- 43
L+HRDS+L N + R++ + AR+ L+++I
Sbjct: 75 LVHRDSLLFKGAANATASYERRLEEKLRREAARVRALEQRIERKLKLKKDPAGSYENVAG 134
Query: 44 --NTYQAQILPSNDISNSLFYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLGT 101
+ ++++ + + ++ I IG P Q+ +DTGS ++W+ C PCR+C Q
Sbjct: 135 VTAEFGSEVVSGMEQGSGEYFTRIGIGTPTREQYMVLDTGSDVVWIQCEPCRECYSQADP 194
Query: 102 IFYPSRSSSYAYVPCDSEHCRYFPYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKN 161
IF PS S S++ V CDS C C + C+Y Y G T +TE LTF
Sbjct: 195 IFNPSSSVSFSTVGCDSAVCSQLDANDC--HGGGCLYEVSYGDGSYTVGSYATETLTFGT 252
Query: 162 TDESTIHVQDVVFGCGFSTNRNFKFSGIFGLGIGRS-----SLVSQLNSSFSYCIGSLHD 216
T +Q+V GCG F + S L +Q +FSYC L D
Sbjct: 253 TS-----IQNVAIGCGHDNVGLFVGAAGLLGLGAGSLSFPAQLGTQTGRAFSYC---LVD 304
Query: 217 PDYLHN-KLILGDGAIIDEGDATPL---QFIDGHYYITLEAISVDGRMLDINPN-IFKRD 271
D + L G ++ TPL F+ YY+++ AISV G +LD P+ F+ D
Sbjct: 305 RDSESSGTLEFGPESVPIGSIFTPLVANPFLPTFYYLSMVAISVGGVILDSVPSEAFRID 364
Query: 272 DSG--GGVMIDSGTDVTWLVKEAYEALRDEVMIRLEGEQMRSYSWPDKLCYHGIMSSDLK 329
++ GG++IDSGT VT L AY+ALRD + + CY + +
Sbjct: 365 ETTGRGGIIIDSGTAVTRLQTSAYDALRDAFIAGTQHLPRADGISIFDTCYD-LSALQSV 423
Query: 330 GFPTVRFHFRGGAKLAL-EKDSMFYQPRPDAFCMAVNPASINNRYVNFSLIGMMAQQFYN 388
P V FHF GA L K+ + FC A PA N S++G + QQ
Sbjct: 424 SIPAVGFHFSNGAGFILPAKNCLIPMDSMGTFCFAFAPAD-----SNLSIMGNIQQQGIR 478
Query: 389 VGYDIGRKQKTFQRMDCE 406
V +D F C+
Sbjct: 479 VSFDSANSLVGFAIDQCQ 496
>gi|449454654|ref|XP_004145069.1| PREDICTED: probable aspartic protease At2g35615-like [Cucumis
sativus]
gi|449470632|ref|XP_004153020.1| PREDICTED: probable aspartic protease At2g35615-like [Cucumis
sativus]
gi|449499016|ref|XP_004160697.1| PREDICTED: probable aspartic protease At2g35615-like [Cucumis
sativus]
Length = 434
Score = 140 bits (353), Expect = 1e-30, Method: Compositional matrix adjust.
Identities = 119/426 (27%), Positives = 187/426 (43%), Gaps = 40/426 (9%)
Query: 1 LIHRDSILSPYNNPNENPAHRVQRGINISIARLAYLQEKIRSHNTYQAQILPSNDISNSL 60
LIHRDS SP N +E R+ + S R + E +T +A I +
Sbjct: 31 LIHRDSPKSPMYNSSETHFDRIVNALRRSSHRNTVVLES----DTAEAPIFNNG----GE 82
Query: 61 FYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLGTIFYPSRSSSYAYVPCDSEH 120
+ V IS+G PP DTGS ++W C PC +C Q +F PS+S++Y V C S
Sbjct: 83 YLVEISVGTPPFSIVAVADTGSDVIWTQCKPCSNCYQQNAPMFDPSKSTTYKNVACSSPV 142
Query: 121 CRYFPYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDVVFGCGFST 180
C Y + C+Y+ Y + ++ + +T ++T + V GCG
Sbjct: 143 CSYSGDGSSCSDDSECLYSIAYGDDSHSQGNLAVDTVTMQSTSGRPVAFPRTVIGCGHDN 202
Query: 181 NRNF--KFSGIFGLGIGRSSLVSQLNSS----FSYCI-----GSLHDPDYLH---NKLIL 226
F SGI GLG G +SLV+QL + FSYC+ GS +D L+ N +
Sbjct: 203 AGTFNANVSGIVGLGRGPASLVTQLGPATGGKFSYCLIPIGTGSTNDSTKLNFGSNANVS 262
Query: 227 GDGAIIDEGDATPL----QFIDGHYYITLEAISVDGRMLDINPNIFKRDDSGGGVMIDSG 282
G G + +TP+ Q+ Y + LEA+SV + P + ++IDSG
Sbjct: 263 GSGTV-----STPIYSSAQY-KTFYSLKLEAVSVGDTKFNF-PEGASKLGGESNIIIDSG 315
Query: 283 TDVTWLVKEAYEALRDEVMIRLEGEQMRSYSWPDKLCYHGIMSSDLKGFPTVRFHFRGGA 342
T +T+L + + + + S C+ ++D P V HF GA
Sbjct: 316 TTLTYLPSALLNSFGSAISQSMSLPHAQDPSEFLDYCF--ATTTDDYEMPPVTMHFE-GA 372
Query: 343 KLALEKDSMFYQPRPDAFCMAVNPASINNRYVNFSLIGMMAQQFYNVGYDIGRKQKTFQR 402
+ L+++++F + D C+A +N ++ G +AQ + VGYDI +FQ
Sbjct: 373 DVPLQRENLFVRLSDDTICLAFGSFPDDNIFI----YGNIAQSNFLVGYDIKNLAVSFQP 428
Query: 403 MDCEVL 408
C +
Sbjct: 429 AHCGAV 434
>gi|168025534|ref|XP_001765289.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162683608|gb|EDQ70017.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 372
Score = 140 bits (353), Expect = 1e-30, Method: Compositional matrix adjust.
Identities = 110/359 (30%), Positives = 163/359 (45%), Gaps = 22/359 (6%)
Query: 61 FYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLGTIFYPSRSSSYAYVPCDSEH 120
F V I +G PP +DTGS L W+ PCR C Q IF PS+SS+Y + C S
Sbjct: 25 FLVPIYLGTPPQKAVVIIDTGSDLTWIQSEPCRACFEQADPIFDPSKSSTYNKIACSSSA 84
Query: 121 CRYFPYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDVVFGCGFST 180
C + + CIY Y G T + S E +T +T + V+ G T
Sbjct: 85 CADLLGTQTCSAAANCIYAYGYGDGSVTRGYFSKETITATDTAGEEVKFGASVYNTG--T 142
Query: 181 NRNFKFSGIFGLGIGRSSLVSQLNS----SFSYCIGSLHDPDYLHNKLILGDGAIID-EG 235
+ GI GLG G S+ SQL S FSYC+ + + GD A+ E
Sbjct: 143 FGDTGGEGILGLGQGPVSMPSQLGSVLGNKFSYCLVDWLSAGSETSTMYFGDAAVPSGEV 202
Query: 236 DATPLQFIDGH---YYITLEAISVDGRMLDINPNIFKRDDSG-GGVMIDSGTDVTWLVKE 291
TP+ H YYI ++ ISV G +LDI+ ++++ D G GG +IDSGT +T+L +E
Sbjct: 203 QYTPIVPNADHPTYYYIAVQGISVGGSLLDIDQSVYEIDSGGSGGTIIDSGTTITYLQQE 262
Query: 292 AYEALRDEVMIRLEGEQMRSYSWPDKLCYHGIMSSDLKGFPTVRFHFRGGAKLALEKDSM 351
+ AL ++ S + D LC++ + FP + H G L L +
Sbjct: 263 VFNALVAAYTSQVRYPTTTSATGLD-LCFN-TRGTGSPVFPAMTIHLD-GVHLELPTANT 319
Query: 352 FYQPRPDAFCMAVNPASINNRYVNF--SLIGMMAQQFYNVGYDIGRKQKTFQRMDCEVL 408
F + C+A A ++F ++ G + QQ +++ YD+ + F DC L
Sbjct: 320 FISLETNIICLAFASA------LDFPIAIFGNIQQQNFDIVYDLDNMRIGFAPADCASL 372
>gi|357494221|ref|XP_003617399.1| 60S ribosomal protein L18a [Medicago truncatula]
gi|355518734|gb|AET00358.1| 60S ribosomal protein L18a [Medicago truncatula]
Length = 749
Score = 140 bits (353), Expect = 1e-30, Method: Compositional matrix adjust.
Identities = 118/396 (29%), Positives = 182/396 (45%), Gaps = 51/396 (12%)
Query: 46 YQAQILPSNDISNSL----FYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLGT 101
Y +Q++ + + SL +++++ IG PP +DTGS L W+ C PC C Q G
Sbjct: 173 YSSQLVATLESGVSLGSGEYFMDVFIGTPPKHYSLILDTGSDLNWIQCVPCIACFEQSGP 232
Query: 102 IFYPSRSSSYAYVPCDSEHCRYF----PYARCSAYKHRCIYTQLYLIGPETS----VFVS 153
+ P SSS+ + C C+ P C C Y Y T+ +
Sbjct: 233 YYDPKESSSFENITCHDPRCKLVSSPDPPKPCKDENQTCPYFYWYGDSSNTTGDFALETF 292
Query: 154 TEQLTFKNTDESTIHVQDVVFGCGFSTNRNFKFSGIF-------GLGIGRSSLVSQLNS- 205
T LT N HV++V+FGCG NR G+F GLG G S SQL S
Sbjct: 293 TVNLTTPNGKSEQKHVENVMFGCG-HWNR-----GLFHGAAGLLGLGRGPLSFASQLQSI 346
Query: 206 ---SFSYCIGSLHDPDYLHNKLILG-DGAIIDEGDATPLQFIDGH-------YYITLEAI 254
SFSYC+ + + +KLI G D ++ + F+ G YY+ +++I
Sbjct: 347 YGHSFSYCLVDRNSDTSVSSKLIFGEDKELLSHPNLNFTSFVGGEENSVDTFYYVGIKSI 406
Query: 255 SVDGRMLDINPNIFK-RDDSGGGVMIDSGTDVTWLVKEAYEALRDEVMIRLEGEQMRSYS 313
VDG +L I + + GGG +IDSGT +T+ + AYE +++ M +++G ++
Sbjct: 407 MVDGEVLKIPEETWHLSKEGGGGTIIDSGTTLTYFAEPAYEIIKEAFMKKIKGYELVEGF 466
Query: 314 WPDKLCYH--GIMSSDLKGFPTVRFHFRGGAKLALEKDSMFYQPRPDAFCMAV--NPASI 369
P K CY+ GI +L F + F GA ++ F Q PD C+A+ P S
Sbjct: 467 PPLKPCYNVSGIEKMELPDFGIL---FSDGAMWDFPVENYFIQIEPDLVCLAILGTPKSA 523
Query: 370 NNRYVNFSLIGMMAQQFYNVGYDIGRKQKTFQRMDC 405
S+IG QQ +++ YD+ + + + M C
Sbjct: 524 ------LSIIGNYQQQNFHILYDMKKSRLGYAPMKC 553
>gi|255541796|ref|XP_002511962.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
gi|223549142|gb|EEF50631.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
Length = 495
Score = 140 bits (352), Expect = 1e-30, Method: Compositional matrix adjust.
Identities = 126/415 (30%), Positives = 185/415 (44%), Gaps = 36/415 (8%)
Query: 2 IHRDSILSPYNNPNENPAHRVQRGIN-ISIARLAYLQEKIRSHNTYQAQILPSNDISNSL 60
+HRDS + + R+Q +N +S + L LQ +I+ + + +
Sbjct: 106 LHRDS------SRVQAITTRLQLILNGVSKSDLKPLQTEIQPQD-LSTPVSSGTSQGSGE 158
Query: 61 FYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLGTIFYPSRSSSYAYVPCDSEH 120
++ + +G P + +DTGS + W+ C PC DC Q IF P+ SSSY+ + CDS+
Sbjct: 159 YFTRVGVGNPAKSYYMVLDTGSDINWIQCQPCSDCYQQSDPIFTPAASSSYSPLTCDSQQ 218
Query: 121 CRYFPYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDVVFGCGFST 180
C + C +C Y Y G T TE ++F + V + GCG
Sbjct: 219 CNSLQMSSC--RNGQCRYQVNYGDGSFTFGDFVTETMSFGGSGT----VNSIALGCGHDN 272
Query: 181 NRNFKFSGIFGLGIGRS-SLVSQLN-SSFSYCIGSLHDPDYLHNKLILGDGAIIDEGDAT 238
F + G SL SQL +SFSYC L + D + + + A + +
Sbjct: 273 EGLFVGAAGLLGLGGGPLSLTSQLKATSFSYC---LVNRDSAASSTLDFNSAPVGDSVIA 329
Query: 239 PL---QFIDGHYYITLEAISVDGRMLDINPNIFKRDDSG-GGVMIDSGTDVTWLVKEAYE 294
PL ID YY+ L +SV G +L I +FK DDSG GGV++D GT +T L EAY
Sbjct: 330 PLLKSSKIDTFYYVGLSGMSVGGELLRIPQEVFKLDDSGDGGVIVDCGTAITRLQSEAYN 389
Query: 295 ALRDEVMIRLEGEQMRSYSWPD--KLCYHGIMSSDLKGFPTVRFHFRGGAKLALEKDSMF 352
+LRD + +RS S CY S +K PTV FHF GG L + +
Sbjct: 390 SLRDSFVSM--SRHLRSTSGVALFDTCYDLSGQSSVK-VPTVSFHFDGGKSWDLPA-ANY 445
Query: 353 YQPRPDA--FCMAVNPASINNRYVNFSLIGMMAQQFYNVGYDIGRKQKTFQRMDC 405
P A +C A P + + S+IG + QQ V +D+ + F C
Sbjct: 446 LIPVDSAGTYCFAFAPTT-----SSLSIIGNVQQQGTRVSFDLANNRVGFSTNKC 495
>gi|242050430|ref|XP_002462959.1| hypothetical protein SORBIDRAFT_02g035310 [Sorghum bicolor]
gi|241926336|gb|EER99480.1| hypothetical protein SORBIDRAFT_02g035310 [Sorghum bicolor]
Length = 448
Score = 139 bits (351), Expect = 2e-30, Method: Compositional matrix adjust.
Identities = 107/366 (29%), Positives = 167/366 (45%), Gaps = 27/366 (7%)
Query: 61 FYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCR--DCSPQLGTIFYPSRSSSYAYVPCDS 118
+ + +SIG PP+ DTGS L+W C PC C Q ++ P+ S+++ +PC+S
Sbjct: 92 YLMTLSIGTPPLSYPAIADTGSDLIWTQCAPCSGDQCFAQPAPLYNPASSTTFGVLPCNS 151
Query: 119 --EHCRYFPYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDVVFGC 176
C + C+Y Q Y G T+ +E TF + V + FGC
Sbjct: 152 SLSMCAGVLAGKAPPPGCACMYNQTYGTG-WTAGVQGSETFTFGSAAADQARVPGIAFGC 210
Query: 177 GFSTNRNFKFS-GIFGLGIGRSSLVSQLNSS-FSYCIGSLHDPDYLHNKLILGDGAIIDE 234
+++ ++ S G+ GLG G SLVSQL + FSYC+ D + + L+LG A ++
Sbjct: 211 SNASSSDWNGSAGLVGLGRGSLSLVSQLGAGRFSYCLTPFQDTNS-TSTLLLGPSAALNG 269
Query: 235 GDATPLQFI--------DGHYYITLEAISVDGRMLDINPNIFK-RDDSGGGVMIDSGTDV 285
F+ +YY+ L IS+ + L I+P+ F + D GG++IDSGT +
Sbjct: 270 TGVRSTPFVASPAKAPMSTYYYLNLTGISLGAKALSISPDAFSLKADGTGGLIIDSGTTI 329
Query: 286 TWLVKEAYEALRDEV--MIRLEGEQMRSYSWPDKLCYH-GIMSSDLKGFPTVRFHFRGGA 342
T LV AY+ +R V ++ L + D LCY +S P++ HF GA
Sbjct: 330 TSLVNAAYQQVRAAVQSLVTLPAIDGSDSTGLD-LCYALPTPTSAPPAMPSMTLHFD-GA 387
Query: 343 KLALEKDSMFYQPRPDAFCMAVNPASINNRYVNFSLIGMMAQQFYNVGYDIGRKQKTFQR 402
+ L DS +C+A+ N S G QQ ++ YD+ + +F
Sbjct: 388 DMVLPADSYMISGS-GVWCLAMR----NQTDGAMSTFGNYQQQNMHILYDVRNEMLSFAP 442
Query: 403 MDCEVL 408
C L
Sbjct: 443 AKCSTL 448
>gi|357143847|ref|XP_003573077.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Brachypodium
distachyon]
Length = 420
Score = 139 bits (351), Expect = 2e-30, Method: Compositional matrix adjust.
Identities = 113/390 (28%), Positives = 179/390 (45%), Gaps = 21/390 (5%)
Query: 30 IARLAYLQEKIRSHNTYQAQILPSNDISNSLFYVNISIGQPPVPQFTAMDTGSSLLWVHC 89
+ R A + ++R+ + Y A P + + ++IG+PPVP DTGS L W C
Sbjct: 41 LMRRAVHRSRLRALSGYDATS-PRLHSVQVEYLMELAIGKPPVPFVALADTGSDLTWTQC 99
Query: 90 YPCRDCSPQLGTIFYPSRSSSYAYVPCDSEHCRYFPYARCSAYKHRCIYTQLYLIGPETS 149
PC+ C PQ ++ PS SS+++ +PC S C ++R C Y Y G ++
Sbjct: 100 QPCKLCFPQDTPVYDPSASSTFSPLPCSSATCLPI-WSRNCTPSSLCRYRYAYGDGAYSA 158
Query: 150 VFVSTEQLTFKNTDESTIHVQDVVFGCGFSTNRN-FKFSGIFGLGIGRSSLVSQLN-SSF 207
+ TE LT + + V V FGCG + +G GLG G SL++QL F
Sbjct: 159 GILGTETLTL-GPSSAPVSVGGVAFGCGTDNGGDSLNSTGTVGLGRGTLSLLAQLGVGKF 217
Query: 208 SYCIGSLHDPDYLHNKLILGDGAIIDEG----DATPLQFI---DGHYYITLEAISVDGRM 260
SYC+ + L + +LG A + G +TPL Y+++L+ IS+
Sbjct: 218 SYCLTDFFN-SALDSPFLLGTLAELAPGPSTVQSTPLLQSPQNPSRYFVSLQGISLGDVR 276
Query: 261 LDINPNIFK-RDDSGGGVMIDSGTDVTWLVKEAYEALRDEVMIRLEGEQMRSYSWPDKLC 319
L I F R D GG+++DSGT T L + + + V R+ G+ + S D C
Sbjct: 277 LPIPNGTFDLRGDGTGGMIVDSGTTFTILAESGFREVVGRVA-RVLGQPPVNASSLDAPC 335
Query: 320 YHGIMSSDLKGFPTVRFHFRGGAKLALEKDS-MFYQPRPDAFCMAVNPASINNRYVNFSL 378
+ + + P + HF GGA + L +D+ M Y +FC+ + + + S+
Sbjct: 336 FPA-PAGEPPYMPDLVLHFAGGADMRLYRDNYMSYNEEDSSFCLNIAGTTPEST----SV 390
Query: 379 IGMMAQQFYNVGYDIGRKQKTFQRMDCEVL 408
+G QQ + +D Q +F DC L
Sbjct: 391 LGNFQQQNIQMLFDTTVGQLSFLPTDCSKL 420
>gi|302806531|ref|XP_002985015.1| hypothetical protein SELMODRAFT_121417 [Selaginella moellendorffii]
gi|300147225|gb|EFJ13890.1| hypothetical protein SELMODRAFT_121417 [Selaginella moellendorffii]
Length = 533
Score = 139 bits (351), Expect = 2e-30, Method: Compositional matrix adjust.
Identities = 111/376 (29%), Positives = 178/376 (47%), Gaps = 41/376 (10%)
Query: 61 FYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLGTIFYPSRSSSYAYVPCDSEH 120
+++++ +G PP +DTGS L W+ C PC+ C Q G +F PS+S+S+ +PC++
Sbjct: 171 YFMDVFVGNPPRHFLLIIDTGSDLTWLQCKPCKACFDQSGPVFDPSQSTSFKIIPCNAAA 230
Query: 121 CRYFPYARCSAYKHR-----CIYTQLYLIGPETSVFVSTEQLTFKNTDE-STIHVQDVVF 174
C + C + C Y Y TS ++ E L+ +D S++ ++D+V
Sbjct: 231 CDLVVHDECRDNSSKTSPKTCKYFYWYGDSSRTSGDLALESLSVSLSDHPSSLEIRDMVI 290
Query: 175 GCGFSTN-RNFKFSGIFGLGIGRSSLVSQLNS-----SFSYCIGSLHDPDYLHNKLILGD 228
GCG S G+ GLG G S SQL S SFSYC+ + + + + G
Sbjct: 291 GCGHSNKGLFQGAGGLLGLGQGALSFPSQLRSSPIGQSFSYCLVDRTNNLSVSSAISFGA 350
Query: 229 GAII----DEGDATPL----QFIDGHYYITLEAISVDGRMLDINPNIFKRDDSG-GGVMI 279
G + D+ TP ++ YY+ ++ I +D +L I F +G GG +I
Sbjct: 351 GFALSRHFDQMRFTPFVRTNNSVETFYYLGIQGIKIDQELLPIPAERFAIAPNGSGGTII 410
Query: 280 DSGTDVTWLVKEAYEALRDEVMIRLEGEQMRSYSWPDK-----LCYHGIMSSDLKGFPTV 334
DSGT +T+L ++AY A+ + R+ SY D +CY+ + + FPT+
Sbjct: 411 DSGTTLTYLNRDAYRAVESAFLARI------SYPRADPFDILGICYNATGRTAVP-FPTL 463
Query: 335 RFHFRGGAKLALEKDSMFYQPRPD--AFCMAVNPASINNRYVNFSLIGMMAQQFYNVGYD 392
F+ GA+L L +++ F QP P C+A+ P S+IG QQ + YD
Sbjct: 464 SIVFQNGAELDLPQENYFIQPDPQEAKHCLAILPTD------GMSIIGNFQQQNIHFLYD 517
Query: 393 IGRKQKTFQRMDCEVL 408
+ + F DC L
Sbjct: 518 VQHARLGFANTDCSAL 533
>gi|326520736|dbj|BAJ92731.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 499
Score = 139 bits (351), Expect = 2e-30, Method: Compositional matrix adjust.
Identities = 109/361 (30%), Positives = 164/361 (45%), Gaps = 30/361 (8%)
Query: 56 ISNSLFYVNISIGQPPVPQFTAMDTGSSLLWVHCYPC-RDCSPQLGTIFYPSRSSSYAYV 114
+S + V + +G P DTGS WV C PC C Q G +F P++SS+YA V
Sbjct: 158 VSTGNYVVTVGLGTPASKYTVVFDTGSDTTWVQCRPCVVKCYKQKGPLFDPAKSSTYANV 217
Query: 115 PCDSEHCRYFPYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDVVF 174
C C C+ C+Y Y G T F + + LT + ++ F
Sbjct: 218 SCTDSACADLDTNGCTG--GHCLYAVQYGDGSYTVGFFAQDTLTIAHD-----AIKGFRF 270
Query: 175 GCGFSTNRNF-KFSGIFGLGIGRSSLVSQL----NSSFSYCIGSLHDPDYLHNKLILGDG 229
GCG N F K +G+ GLG G++SL Q +F+YC+ +L L G G
Sbjct: 271 GCGEKNNGLFGKTAGLMGLGRGKTSLTVQAYNKYGGAFAYCLPALTTGT---GYLDFGPG 327
Query: 230 AIIDEGDATPLQFIDGH--YYITLEAISVDGRMLDINPNIFKRDDSGGGVMIDSGTDVTW 287
+ + TP+ G YY+ + I V G+ + + ++F S G ++DSGT +T
Sbjct: 328 SAGNNARLTPMLTDKGQTFYYVGMTGIRVGGQQVPVAESVF----STAGTLVDSGTVITR 383
Query: 288 LVKEAYEALR---DEVMIRLEGEQMRSYSWPDKLCYHGIMSSDLKGFPTVRFHFRGGAKL 344
L AY AL D+VM+ ++ YS D CY SD++ PTV F+GGA L
Sbjct: 384 LPATAYTALSSAFDKVMLARGYKKAPGYSILDT-CYDFTGLSDVE-LPTVSLVFQGGACL 441
Query: 345 ALEKDSMFYQPRPDAFCMAVNPASINNRYVNFSLIGMMAQQFYNVGYDIGRKQKTFQRMD 404
++ + Y C+A + N + +++G Q+ Y V YD+G+K F
Sbjct: 442 DVDVSGIVYAISEAQVCLAF---ASNGDDESVAIVGNTQQKTYGVLYDLGKKTVGFAPGS 498
Query: 405 C 405
C
Sbjct: 499 C 499
>gi|356528627|ref|XP_003532901.1| PREDICTED: probable aspartic protease At2g35615-like [Glycine max]
Length = 430
Score = 139 bits (351), Expect = 2e-30, Method: Compositional matrix adjust.
Identities = 123/423 (29%), Positives = 177/423 (41%), Gaps = 37/423 (8%)
Query: 1 LIHRDSILSPYNNPNENPAHRVQRGINISIARLAYLQEKIRSHNTYQAQILPSNDISNSL 60
LI R S +SP N V+ SI R + + I P D +
Sbjct: 30 LIPRHSPISPLYNSQMTQTELVKSAALRSITRSKRVNFIGQISPPLSPIITPIPD--HGE 87
Query: 61 FYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLGTIFYPSRSSSYAYVPCDSEH 120
+ + S+G P V + DTGS L W+ C PC+ C PQ +F P++SS+Y VPC+S+
Sbjct: 88 YLMRFSLGTPSVERLAIFDTGSDLSWLQCTPCKTCYPQEAPLFDPTQSSTYVDVPCESQP 147
Query: 121 CRYFPYAR--CSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNT--DESTIHVQDVVFGC 176
C FP + C + K +CIY Y T + + ++F +T + VFGC
Sbjct: 148 CTLFPQNQRECGSSK-QCIYLHQYGTDSFTIGRLGYDTISFSSTGMGQGGATFPKSVFGC 206
Query: 177 GFSTNRNFKFS----GIFGLGIGRSSLVSQLNS----SFSYCIGSLHDPDYLHNKLILGD 228
F +N FK S G GLG G SL SQL FSYC+ KL G
Sbjct: 207 AFYSNFTFKISTKANGFVGLGPGPLSLASQLGDQIGHKFSYCMVPFSSTS--TGKLKFGS 264
Query: 229 GAIIDEGDATPLQF---IDGHYYITLEAISVDGRMLDINPNIFKRDDSGGGVMIDSGTDV 285
A +E +TP +Y + LE I+V + + GG ++IDS +
Sbjct: 265 MAPTNEVVSTPFMINPSYPSYYVLNLEGITVGQKKV-------LTGQIGGNIIIDSVPIL 317
Query: 286 TWLVKEAYEALRDEVMIRLEGEQMRSYSWPDKLCYHGIMSSDLKGFPTVRFHFRGGAKLA 345
T L + Y V + E P + C + + FP FHF GA +
Sbjct: 318 THLEQGIYTDFISSVKEAINVEVAEDAPTPFEYCVRNPTNLN---FPEFVFHFT-GADVV 373
Query: 346 LEKDSMFYQPRPDAFCMAVNPASINNRYVNFSLIGMMAQQFYNVGYDIGRKQKTFQRMDC 405
L +MF + CM V P+ S+ G AQ + V YD+G K+ +F +C
Sbjct: 374 LGPKNMFIALDNNLVCMTVVPSK------GISIFGNWAQVNFQVEYDLGEKKVSFAPTNC 427
Query: 406 EVL 408
+
Sbjct: 428 STI 430
>gi|116787333|gb|ABK24467.1| unknown [Picea sitchensis]
Length = 497
Score = 139 bits (350), Expect = 2e-30, Method: Compositional matrix adjust.
Identities = 116/364 (31%), Positives = 161/364 (44%), Gaps = 40/364 (10%)
Query: 61 FYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLGTIFYPSRSSSYAYVPCDSEH 120
++ + +G PP + +DTGS ++W+ C PC C Q +F P+ SS+Y VPC +
Sbjct: 153 YFTRLGVGTPPRYTYMVLDTGSDIMWIQCLPCAKCYGQTDPLFNPAASSTYRKVPCATPL 212
Query: 121 CRYFPYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDVVFGCGFST 180
C+ + C K C Y Y G T STE LTF+ ++ V GCG
Sbjct: 213 CKKLDISGCR-NKRYCEYQVSYGDGSFTVGDFSTETLTFRGQ-----VIRRVALGCGHDN 266
Query: 181 NRNFKFSGIFGLGIGR-----SSLVSQLNSSFSYCIGSLHDPDYLHNKLILGDGAIIDEG 235
F + S +Q + FSYC+ + LI G AI
Sbjct: 267 EGLFIGAAGLLGLGRGSLSFPSQTGAQFSKRFSYCLVD-RSASGTASSLIFGKAAIPKSA 325
Query: 236 DATPLQF---IDGHYYITLEAISVDGRML-DINPNIFKRDDSG-GGVMIDSGTDVTWLVK 290
TPL +D YY+ L ISV GR L I ++F+ D +G GGV+IDSGT VT LV
Sbjct: 326 IFTPLLSNPKLDTFYYVELVGISVGGRRLTSIPASVFRMDATGNGGVIIDSGTSVTRLVD 385
Query: 291 EAYEALRDEVMIRLEGEQMRS---YSWPDKLCYHGIMSSDLKGF-----PTVRFHFRGGA 342
AY +RD R+ ++S +S D CY DL G PT+ FHF+GGA
Sbjct: 386 SAYSTMRDA--FRVGTGNLKSAGGFSLFDT-CY------DLSGLKTVKVPTLVFHFQGGA 436
Query: 343 KLALEKDSMFYQPRPDA-FCMAVNPASINNRYVNFSLIGMMAQQFYNVGYDIGRKQKTFQ 401
++L + A FC A S+IG + QQ Y V +D + F+
Sbjct: 437 HISLPATNYLIPVDSSATFCFA-----FAGNTGGLSIIGNIQQQGYRVVFDSLANRVGFK 491
Query: 402 RMDC 405
C
Sbjct: 492 AGSC 495
>gi|124359514|gb|ABD28633.2| Peptidase aspartic, catalytic [Medicago truncatula]
Length = 181
Score = 139 bits (350), Expect = 3e-30, Method: Compositional matrix adjust.
Identities = 78/198 (39%), Positives = 117/198 (59%), Gaps = 19/198 (9%)
Query: 211 IGSLHDPDYLHNKLILGDGAIIDEGDATPLQFIDGHYYITLEAISVDGRMLDINPNIFKR 270
+GSL D DY +N+LILG+ A + GD TP Q +G ++T+E IS+ + LDI P FK
Sbjct: 1 MGSLTDKDYDYNQLILGEEAYL-AGDTTPFQVYNGVNHVTMEGISIGQKSLDIAPGTFKM 59
Query: 271 DDSGGGVMIDSGTDVTWLVKEAYEALRDEVMIRLEGEQMRSYSWPDKLCYHGIMSSDLKG 330
++G G G +T V+ ++ RL+ +++R P LCY G +S DLKG
Sbjct: 60 KNNGTG----GGLSLTQEVRNLFQ--------RLKFQEVRLQGSPWALCYFGSVSRDLKG 107
Query: 331 FPTVRFHFRGGAKLALEKDSMFYQPRPDAFCMAVNPASINNRYVNFSLIGMMAQQFYNVG 390
FP V F+F GGA + L+ + F Q + D FCM+V+P+ + S+IG++AQQ YNVG
Sbjct: 108 FPVVTFYFAGGAVIGLDTLNFFVQAKDDVFCMSVHPSH------DLSVIGLLAQQSYNVG 161
Query: 391 YDIGRKQKTFQRMDCEVL 408
YD + + +DC++L
Sbjct: 162 YDKDKGLIYIESIDCQLL 179
>gi|414590468|tpg|DAA41039.1| TPA: aspartic proteinase nepenthesin-1 [Zea mays]
Length = 469
Score = 139 bits (349), Expect = 3e-30, Method: Compositional matrix adjust.
Identities = 106/366 (28%), Positives = 166/366 (45%), Gaps = 26/366 (7%)
Query: 61 FYVNISIGQPPVPQFTAMDTGSSLLWVHCYPC-RDCSPQLGTIFYPSRSSSYAYVPCDS- 118
+ + ++IG PP+P DTGS L+W C PC C Q ++ P+ S++++ +PC+S
Sbjct: 112 YLMTLAIGTPPLPYAAVADTGSDLIWTQCAPCGTQCFEQPAPLYNPASSTTFSVLPCNSS 171
Query: 119 -EHCRYFPYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDVVFGCG 177
C C+Y Q Y G T+ +E TF ++ V V FGC
Sbjct: 172 LSMCAGALAGAAPPPGCACMYNQTYGTG-WTAGVQGSETFTFGSSAADQARVPGVAFGCS 230
Query: 178 FSTNRNFKFS-GIFGLGIGRSSLVSQLNSS-FSYCIGSLHDPDYLHNKLILGDGAIIDEG 235
+++ ++ S G+ GLG G SLVSQL + FSYC+ D + + L+LG A ++
Sbjct: 231 NASSSDWNGSAGLVGLGRGSLSLVSQLGAGRFSYCLTPFQDTNS-TSTLLLGPSAALNGT 289
Query: 236 DATPLQFI--------DGHYYITLEAISVDGRMLDINPNIFK-RDDSGGGVMIDSGTDVT 286
F+ +YY+ L IS+ + L I+P F + D GG++IDSGT +T
Sbjct: 290 GVRSTPFVASPARAPMSTYYYLNLTGISLGAKALPISPGAFSLKPDGTGGLIIDSGTTIT 349
Query: 287 WLVKEAYEALRDEV--MIRLEGEQMRSYSWPDKLCYH--GIMSSDLKGFPTVRFHFRGGA 342
L AY+ +R V ++ S S LC+ S+ P++ HF GA
Sbjct: 350 SLANAAYQQVRAAVKSLVTTLPTVDGSDSTGLDLCFALPAPTSAPPAVLPSMTLHFD-GA 408
Query: 343 KLALEKDSMFYQPRPDAFCMAVNPASINNRYVNFSLIGMMAQQFYNVGYDIGRKQKTFQR 402
+ L DS +C+A+ N S G QQ ++ YD+ + +F
Sbjct: 409 DMVLPADSYMISGS-GVWCLAMR----NQTDGAMSTFGNYQQQNMHILYDVREETLSFAP 463
Query: 403 MDCEVL 408
C L
Sbjct: 464 AKCSTL 469
>gi|357127293|ref|XP_003565317.1| PREDICTED: aspartic proteinase nepenthesin-1-like, partial
[Brachypodium distachyon]
Length = 540
Score = 139 bits (349), Expect = 3e-30, Method: Compositional matrix adjust.
Identities = 112/357 (31%), Positives = 155/357 (43%), Gaps = 24/357 (6%)
Query: 61 FYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLGTIFYPSRSSSYAYVPCDSEH 120
++ I IG P + +DTGS + W+ C PC DC Q +F P+ SSSYA VPCDS H
Sbjct: 196 YFSRIGIGSPARQLYMVLDTGSDVTWLQCAPCADCYAQSDPLFDPALSSSYATVPCDSPH 255
Query: 121 CRYFPYARC----SAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDVVFGC 176
CR + C + C+Y Y G T +TE LT + +H DV GC
Sbjct: 256 CRALDASACHNNAANGNSSCVYEVAYGDGSYTVGDFATETLTLGGDGSAAVH--DVAIGC 313
Query: 177 GFSTNRNFKFSGIFGLGIGRS-SLVSQLNSS-FSYCIGSLHDPDYLHNKLILGDGAIIDE 234
G F + G S SQ++++ FSYC+ P + D + +
Sbjct: 314 GHDNEGLFVGAAGLLALGGGPLSFPSQISATEFSYCLVDRDSPSASTLQFGASDSSTV-- 371
Query: 235 GDATPLQ---FIDGHYYITLEAISVDGRML-DINPNIFKRDDSG-GGVMIDSGTDVTWLV 289
PL + YY+ L ISV G L DI P F D+ G GGV++DSGT VT L
Sbjct: 372 --TAPLMRSPRSNTFYYVALNGISVGGETLSDIPPAAFAMDEQGSGGVIVDSGTAVTRLQ 429
Query: 290 KEAYEALRDEVMIRLEGEQMRSYSWPDKLCYHGIMSSDLKGFPTVRFHFRGGAKLAL-EK 348
AY ALRD + + S CY S ++ P V F GG +L L K
Sbjct: 430 SSAYSALRDAFVRGTQALPRASGVSLFDTCYDLAGRSSVQ-VPAVSLRFEGGGELKLPAK 488
Query: 349 DSMFYQPRPDAFCMAVNPASINNRYVNFSLIGMMAQQFYNVGYDIGRKQKTFQRMDC 405
+ + +C+A A+ S++G + QQ V +D + F C
Sbjct: 489 NYLIPVDGAGTYCLAF--AATGGA---VSIVGNVQQQGIRVSFDTAKNTVGFSPNKC 540
>gi|242044886|ref|XP_002460314.1| hypothetical protein SORBIDRAFT_02g026340 [Sorghum bicolor]
gi|241923691|gb|EER96835.1| hypothetical protein SORBIDRAFT_02g026340 [Sorghum bicolor]
Length = 444
Score = 139 bits (349), Expect = 4e-30, Method: Compositional matrix adjust.
Identities = 105/375 (28%), Positives = 169/375 (45%), Gaps = 41/375 (10%)
Query: 57 SNSLFYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLGTIFYPSRSSSYAYVPC 116
S+ + + + IG P +DTGS L+W C PC C Q F P+ SS+Y + C
Sbjct: 88 SDGEYLMEMGIGTPARFYSAILDTGSDLIWTQCAPCLLCVDQPTPYFDPANSSTYRSLGC 147
Query: 117 DSEHCRYFPYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDVVFGC 176
+ C Y C Y+ C+Y Y T+ ++ E TF T+++ + + + FGC
Sbjct: 148 SAPACNALYYPLC--YQKTCVYQYFYGDSASTAGVLANETFTF-GTNDTRVTLPRISFGC 204
Query: 177 G-FSTNRNFKFSGIFGLGIGRSSLVSQLNSS-FSYCIGSLHDPDYLHNKLILGDGAIIDE 234
G + SG+ G G G SLVSQL S FSYC+ S P + ++L G A ++
Sbjct: 205 GNLNAGSLANGSGMVGFGRGSLSLVSQLGSPRFSYCLTSFLSP--VRSRLYFGAYATLNS 262
Query: 235 GDATPLQ--------FIDGHYYITLEAISVDGRMLDINPNIFKRDDS--GGGVMIDSGTD 284
+A+ +Q + Y++ + ISV G L I+P + +D+ GG +IDSGT
Sbjct: 263 TNASTVQSTPFIINPALPTMYFLNMTGISVGGNRLPIDPAVLAINDTDGTGGTIIDSGTT 322
Query: 285 VTWLVKEAYEALRDEVMIRL----------EGEQMRS-YSWPDKLCYHGIMSSDLKGFPT 333
+T+L + AY A+R+ ++ L E + + + WP P
Sbjct: 323 ITYLAEPAYYAVREAFVLYLNSTLPLLDVTETSVLDTCFQWPPP-------PRQSVTLPQ 375
Query: 334 VRFHFRGGAKLALEKDSMFYQPRPDAFCMAVNPASINNRYVNFSLIGMMAQQFYNVGYDI 393
+ HF G ++ M P C+A+ +S + S+IG Q +NV YD+
Sbjct: 376 LVLHFDGADWELPLQNYMLVDPSTGGLCLAMATSS------DGSIIGSYQHQNFNVLYDL 429
Query: 394 GRKQKTFQRMDCEVL 408
+F C ++
Sbjct: 430 ENSLLSFVPAPCNLM 444
>gi|226492465|ref|NP_001150925.1| aspartic proteinase nepenthesin-1 [Zea mays]
gi|195642996|gb|ACG40966.1| aspartic proteinase nepenthesin-1 precursor [Zea mays]
Length = 472
Score = 139 bits (349), Expect = 4e-30, Method: Compositional matrix adjust.
Identities = 112/389 (28%), Positives = 175/389 (44%), Gaps = 32/389 (8%)
Query: 41 RSHNTYQAQILPSNDISNSLFYV-NISIGQPPVPQFTAMDTGSSLLWVHCYPC-RDCSPQ 98
R+ T A+ D+ N Y+ ++IG PP+P DTGS L+W C PC C Q
Sbjct: 95 RTSTTVSART--RKDLPNGGEYLMTLAIGTPPLPYAAVADTGSDLIWTQCAPCGTQCFEQ 152
Query: 99 LGTIFYPSRSSSYAYVPCDS--EHCRYFPYARCSAYKHRCIYTQLYLIGPETSVFVSTEQ 156
++ P+ S++++ +PC+S C C+Y Q Y G T+ +E
Sbjct: 153 PAPLYNPASSTTFSVLPCNSSLSMCAGALAGAAPPPGCACMYYQTYGTG-WTAGVQGSET 211
Query: 157 LTFKNTDESTIHVQDVVFGCGFSTNRNFKFS-GIFGLGIGRSSLVSQLNSS-FSYCIGSL 214
TF ++ V V FGC +++ ++ S G+ GLG G SLVSQL + FSYC+
Sbjct: 212 FTFGSSAADQARVPGVAFGCSNASSSDWNGSAGLVGLGRGSLSLVSQLGAGRFSYCLTPF 271
Query: 215 HDPDYLHNKLILGDGAIIDEGDATPLQFI--------DGHYYITLEAISVDGRMLDINPN 266
D + + L+LG A ++ F+ +YY+ L IS+ + L I+P
Sbjct: 272 QDTNS-TSTLLLGPSAALNGTGVRSTPFVASPARAPMSTYYYLNLTGISLGAKALPISPG 330
Query: 267 IFK-RDDSGGGVMIDSGTDVTWLVKEAYEALRDEVMIRLEGEQMRSYSWPDK----LCYH 321
F + D GG++IDSGT +T L AY+ +R V +L + + D LC+
Sbjct: 331 AFSLKPDGTGGLIIDSGTTITSLANAAYQQVRAAVKSQLV-TTLPTVDGSDSTGLDLCFA 389
Query: 322 --GIMSSDLKGFPTVRFHFRGGAKLALEKDSMFYQPRPDAFCMAVNPASINNRYVNFSLI 379
S+ P++ HF GA + L DS +C+A+ N S
Sbjct: 390 LPAPTSAPPAVLPSMTLHFD-GADMVLPADSYMISGS-GVWCLAMR----NQTDGAMSTF 443
Query: 380 GMMAQQFYNVGYDIGRKQKTFQRMDCEVL 408
G QQ ++ YD+ + +F C L
Sbjct: 444 GNYQQQNMHILYDVREETLSFAPAKCSTL 472
>gi|168013126|ref|XP_001759252.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162689565|gb|EDQ75936.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 385
Score = 138 bits (348), Expect = 4e-30, Method: Compositional matrix adjust.
Identities = 118/386 (30%), Positives = 181/386 (46%), Gaps = 30/386 (7%)
Query: 37 QEKIRSHNTYQAQILPSNDISNSLFYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCS 96
Q K+ S + +QA ++ + + +++ +S+G PP + MDTGS +LW+ C PC C
Sbjct: 14 QTKVPSQD-FQAPVISGLSLGSGEYFIRVSVGTPPRGMYLVMDTGSDILWLQCAPCVSCY 72
Query: 97 PQLGTIFYPSRSSSYAYVPCDSEHCRYFPYARCSAYKHRCIYTQLYLIGPETSVFVSTEQ 156
Q +F P +SS+Y+ + C+S C C ++C+Y Y G ++ +T+
Sbjct: 73 HQCDEVFDPYKSSTYSTLGCNSRQCLNLDVGGC--VGNKCLYQVDYGDGSFSTGEFATDA 130
Query: 157 LTFKNTD-ESTIHVQDVVFGCGFSTNRNF-KFSGIFGLGIGRSSLVSQLNSS----FSYC 210
++ +T + + + GCG F +G+ GLG G S +Q+NS FSYC
Sbjct: 131 VSLNSTSGGGQVVLNKIPLGCGHDNEGYFVGAAGLLGLGKGPLSFPNQINSENGGRFSYC 190
Query: 211 IGSLHDPDYLHNKLILGDGAIIDEG-----DATPLQFIDGHYYITLEAISVDGRMLDINP 265
+ + LI GD A+ G A+ L+ + YY+ + ISV G +L I
Sbjct: 191 LTGRDTDSTERSSLIFGDAAVPPAGVRFTPQASNLR-VSTFYYLKMTGISVGGSILTIPT 249
Query: 266 NIFKRDDSG-GGVMIDSGTDVTWLVKEAYEALRDEVMIRLEGEQMRSYSWPDKLCYHGIM 324
+ F+ D G GGV+IDSGT VT L AY +LR+ + + CY+
Sbjct: 250 SAFQLDSLGNGGVIIDSGTSVTRLQNAAYASLREAFRAGTSDLVLTTEFSLFDTCYN--- 306
Query: 325 SSDLKG--FPTVRFHFRGGAKLALEKDSMFYQP--RPDAFCMAVNPASINNRYVNFSLIG 380
SDL PTV HF+GGA L L S + P FC+A + S+IG
Sbjct: 307 LSDLSSVDVPTVTLHFQGGADLKLPA-SNYLVPVDNSSTFCLAFAGTT------GPSIIG 359
Query: 381 MMAQQFYNVGYDIGRKQKTFQRMDCE 406
+ QQ + V YD Q F C+
Sbjct: 360 NIQQQGFRVIYDNLHNQVGFVPSQCD 385
>gi|242050432|ref|XP_002462960.1| hypothetical protein SORBIDRAFT_02g035320 [Sorghum bicolor]
gi|241926337|gb|EER99481.1| hypothetical protein SORBIDRAFT_02g035320 [Sorghum bicolor]
Length = 445
Score = 138 bits (348), Expect = 4e-30, Method: Compositional matrix adjust.
Identities = 118/405 (29%), Positives = 177/405 (43%), Gaps = 42/405 (10%)
Query: 36 LQEKIRSHNTYQAQILPSNDISNSL----------FYVNISIGQPPVPQFTAMDTGSSLL 85
L+ + HN Q SN + S + + ++IG PPV DTGS L+
Sbjct: 51 LRRDMHRHNARQLAASSSNGTTVSAPTQISPTAGEYLMTLAIGTPPVSYQAIADTGSDLI 110
Query: 86 WVHCYPCRD-CSPQLGTIFYPSRSSSYAYVPCDS--EHCRYFPYARCSAYKHRCIYTQLY 142
W C PC C Q ++ PS S+++A +PC+S C C+Y Y
Sbjct: 111 WTQCAPCSSQCFQQPTPLYNPSSSTTFAVLPCNSSLSMCAAALAGTTPPPGCTCMYNMTY 170
Query: 143 LIGPETSVFVSTEQLTF-KNTDESTIHVQDVVFGC-----GFSTNRNFKFSGIFGLGIGR 196
G TSV+ +E TF +T + V + FGC GF+T+ SG+ GLG G
Sbjct: 171 GSG-WTSVYQGSETFTFGSSTPANQTGVPGIAFGCSNASGGFNTS---SASGLVGLGRGS 226
Query: 197 SSLVSQLN-SSFSYCIGSLHDPDYLHNKLILGDGAIIDEGDATPLQFI--------DGHY 247
SLVSQL FSYC+ D + L+ ++ D G + F+ +Y
Sbjct: 227 LSLVSQLGVPKFSYCLTPYQDTNSTSTLLLGPSASLNDTGGVSSTPFVASPSDAPMSTYY 286
Query: 248 YITLEAISVDGRMLDINPNIFK-RDDSGGGVMIDSGTDVTWLVKEAYEALRDEV--MIRL 304
Y+ L IS+ L I + D GG +IDSGT +T L AY+ +R V ++ L
Sbjct: 287 YLNLTGISLGTTALSIPTTALSLKADGTGGFIIDSGTTITLLGNTAYQQVRAAVVSLVTL 346
Query: 305 EGEQMRSYSWPDKLCYHGIMSSDL-KGFPTVRFHFRGGAKLALEKDSMFYQPRPDAFCMA 363
S + LC+ S+ P++ HF GA + L DS + + +C+A
Sbjct: 347 PTTDGGSAATGLDLCFELPSSTSAPPTMPSMTLHFD-GADMVLPADS-YMMLDSNLWCLA 404
Query: 364 VNPASINNRYVNFSLIGMMAQQFYNVGYDIGRKQKTFQRMDCEVL 408
+ N S++G QQ ++ YD+G++ TF C L
Sbjct: 405 MQ----NQTDGGVSILGNYQQQNMHILYDVGQETLTFAPAKCSTL 445
>gi|108707838|gb|ABF95633.1| Eukaryotic aspartyl protease family protein [Oryza sativa Japonica
Group]
Length = 391
Score = 138 bits (348), Expect = 4e-30, Method: Compositional matrix adjust.
Identities = 113/379 (29%), Positives = 172/379 (45%), Gaps = 39/379 (10%)
Query: 54 NDISNSLFYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLGTIFYPSRSSSYAY 113
N + + + V+++IG PP P +DTGS L+W C PC C Q F PS SS+ +
Sbjct: 28 NGVPTTEYLVHLAIGTPPQPVQLTLDTGSDLIWTQCQPCPACFDQALPYFDPSTSSTLSL 87
Query: 114 VPCDSEHCRYFPYARCSAYK----HRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHV 169
CDS C+ P A C + K C+YT Y T+ F+ ++ TF S V
Sbjct: 88 TSCDSTLCQGLPVASCGSPKFWPNQTCVYTYSYGDKSVTTGFLEVDKFTFVGAGAS---V 144
Query: 170 QDVVFGCGFSTNRNFKF--SGIFGLGIGRSSLVSQLN-SSFSYCIGSLHDP-------DY 219
V FGCG N FK +GI G G G SL SQL +FS+C ++ D
Sbjct: 145 PGVAFGCGLFNNGVFKSNETGIAGFGRGPLSLPSQLKVGNFSHCFTTITGAIPSTVLLDL 204
Query: 220 LHNKLILGDGAIIDEGDATPL-QFIDGH-----YYITLEAISVDGRMLDINPNIFKRDDS 273
+ G GA+ TPL Q+ YY++L+ I+V L + + F +
Sbjct: 205 PADLFSNGQGAV----QTTPLIQYAKNEANPTLYYLSLKGITVGSTRLPVPESAFALTNG 260
Query: 274 GGGVMIDSGTDVTWLVKEAYEALRDEVMIRLEGEQMRSYSWPDKLCYHGIMSSDLKGFPT 333
GG +IDSGT +T L + Y+ +RDE +++ + + C+ S P
Sbjct: 261 TGGTIIDSGTSITSLPPQVYQVVRDEFAAQIKLPVVPGNATGHYTCFSA-PSQAKPDVPK 319
Query: 334 VRFHFRGGAKLALEKDSMFYQPRPDA----FCMAVNPASINNRYVNFSLIGMMAQQFYNV 389
+ HF GA + L +++ ++ DA C+A+ N+ ++IG QQ +V
Sbjct: 320 LVLHFE-GATMDLPRENYVFEVPDDAGNSIICLAI------NKGDETTIIGNFQQQNMHV 372
Query: 390 GYDIGRKQKTFQRMDCEVL 408
YD+ +F C+ L
Sbjct: 373 LYDLQNNMLSFVAAQCDKL 391
>gi|302809015|ref|XP_002986201.1| hypothetical protein SELMODRAFT_234982 [Selaginella moellendorffii]
gi|300146060|gb|EFJ12732.1| hypothetical protein SELMODRAFT_234982 [Selaginella moellendorffii]
Length = 449
Score = 138 bits (348), Expect = 5e-30, Method: Compositional matrix adjust.
Identities = 110/376 (29%), Positives = 177/376 (47%), Gaps = 41/376 (10%)
Query: 61 FYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLGTIFYPSRSSSYAYVPCDSEH 120
+++++ +G PP +DTGS L W+ C PC+ C Q G +F PS+S+S+ +PC++
Sbjct: 87 YFMDVFVGNPPRHFLLIIDTGSDLTWLQCKPCKACFDQSGPVFDPSQSTSFKIIPCNAAA 146
Query: 121 CRYFPYARCSAYKHR-----CIYTQLYLIGPETSVFVSTEQLTFKNTDE-STIHVQDVVF 174
C + C + C Y Y TS ++ E L+ +D S++ ++D+V
Sbjct: 147 CDLVVHDECRDNSSKTSPKTCKYFYWYGDSSRTSGDLALESLSVSLSDHPSSLEIRDMVI 206
Query: 175 GCGFSTN-RNFKFSGIFGLGIGRSSLVSQLNS-----SFSYCIGSLHDPDYLHNKLILGD 228
GCG S G+ GLG G S SQL S SFSYC+ + + + + G
Sbjct: 207 GCGHSNKGLFQGAGGLLGLGQGALSFPSQLRSSPIGQSFSYCLVDRTNNLSVSSAISFGA 266
Query: 229 GAII----DEGDATPL----QFIDGHYYITLEAISVDGRMLDINPNIFKRDDSG-GGVMI 279
G + D+ TP ++ YY+ ++ I +D +L I F +G GG +I
Sbjct: 267 GFALSRHFDQMKFTPFVRTNNSVETFYYLGIQGIKIDQELLPIPAERFAIATNGSGGTII 326
Query: 280 DSGTDVTWLVKEAYEALRDEVMIRLEGEQMRSYSWPDK-----LCYHGIMSSDLKGFPTV 334
DSGT +T+L ++AY A+ + R+ SY D +CY+ + + FP +
Sbjct: 327 DSGTTLTYLNRDAYRAVESAFLARI------SYPRADPFDILGICYNATGRAAVP-FPAL 379
Query: 335 RFHFRGGAKLALEKDSMFYQPRPD--AFCMAVNPASINNRYVNFSLIGMMAQQFYNVGYD 392
F+ GA+L L +++ F QP P C+A+ P S+IG QQ + YD
Sbjct: 380 SIVFQNGAELDLPQENYFIQPDPQEAKHCLAILPTD------GMSIIGNFQQQNIHFLYD 433
Query: 393 IGRKQKTFQRMDCEVL 408
+ + F DC L
Sbjct: 434 VQHARLGFANTDCSAL 449
>gi|50508279|dbj|BAD32128.1| putative aspartic proteinase nepenthesin II [Oryza sativa Japonica
Group]
gi|125600536|gb|EAZ40112.1| hypothetical protein OsJ_24555 [Oryza sativa Japonica Group]
Length = 455
Score = 138 bits (347), Expect = 5e-30, Method: Compositional matrix adjust.
Identities = 121/379 (31%), Positives = 175/379 (46%), Gaps = 45/379 (11%)
Query: 61 FYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDC--SPQLGTIFYPSRSSSYAYVPCDS 118
+ +NIS+G PP+ +DTGS+L+W C PC C P + P+RSS+++ +PC+
Sbjct: 91 YNMNISLGTPPLDFPVIVDTGSNLIWAQCAPCTRCFPRPTPAPVLQPARSSTFSRLPCNG 150
Query: 119 EHCRYFPYA---RCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDVVFG 175
C+Y P + R C Y Y G T+ +++TE LT + V FG
Sbjct: 151 SFCQYLPTSSRPRTCNATAACAYNYTYGSG-YTAGYLATETLTVGDGT-----FPKVAFG 204
Query: 176 CGFSTNRNFKFSGIFGLGIGRSSLVSQLN-SSFSYCIGSLHDPDYLHNKLILGDGAIIDE 234
C + N SGI GLG G SLVSQL FSYC+ S D + ++ G A + E
Sbjct: 205 CS-TENGVDNSSGIVGLGRGPLSLVSQLAVGRFSYCLRS-DMADGGASPILFGSLAKLTE 262
Query: 235 GD---ATPL---QFI--DGHYYITLEAISVDGRMLDINPNI--FKRDDSGGGVMIDSGTD 284
G +TPL ++ HYY+ L I+VD L + + F + GGG ++DSGT
Sbjct: 263 GSVVQSTPLLKNPYLQRSTHYYVNLTGIAVDSTELPVTGSTFGFTQTGLGGGTIVDSGTT 322
Query: 285 VTWLVKEAYEALRDEV---MIRLEGEQMRSYSWPD-KLCYHGIMSSDLKG--FPTVRFHF 338
+T+L K+ Y ++ M L S + D LCY K P + F
Sbjct: 323 LTYLAKDGYAMVKQAFQSQMANLNQTTPASGAPYDLDLCYKPSAGGGGKAVRVPRLALRF 382
Query: 339 RGGAK---------LALEKDSMFYQPRPDAFCMAVNPASINNRYVNFSLIGMMAQQFYNV 389
GGAK +E DS Q R C+ V PA+ + + S+IG + Q ++
Sbjct: 383 AGGAKYNVPVQNYFAGVEADS---QGRVTVACLLVLPATDD---LPISIIGNLMQMDMHL 436
Query: 390 GYDIGRKQKTFQRMDCEVL 408
YDI +F DC L
Sbjct: 437 LYDIDGGMFSFAPADCAKL 455
>gi|388502342|gb|AFK39237.1| unknown [Lotus japonicus]
Length = 440
Score = 138 bits (347), Expect = 5e-30, Method: Compositional matrix adjust.
Identities = 122/426 (28%), Positives = 193/426 (45%), Gaps = 39/426 (9%)
Query: 1 LIHRDSILSPYNNPNENPAHRVQRGINISIARLAYLQEKIRSHNTYQAQILPSNDI--SN 58
LIH DS SP+ N + + ++ SI+R L + + P I +N
Sbjct: 34 LIHHDSPPSPFYNSSMTRSQLIRNAAMRSISRANQLSLSLSHSLNQLKESSPEPIIIPNN 93
Query: 59 SLFYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRD--CSPQLGTIFYPSRSSSYAYVPC 116
+ + I IG P V + DTGS L WV C PC + C Q ++ P SS++ +PC
Sbjct: 94 GNYLMRIYIGTPSVERLAIADTGSDLTWVQCSPCDNTKCFAQNTPLYDPLNSSTFTLLPC 153
Query: 117 DSEHCRYFPYAR--CSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDVVF 174
DS+ C PY++ CS Y CIY Y G + + + + + + F
Sbjct: 154 DSQPCTQLPYSQYVCSDYGD-CIYAYTY--GDNSYSYGGLSSDSIRLMLLQLHYNSKICF 210
Query: 175 GCG----FSTNRNFKFSGIFGLGIGRSSLVSQLNS----SFSYCIGSLHDPDYLHNKLIL 226
GCG F+ +++ K +GI GLG G SLVSQL FSYC+ L ++KL
Sbjct: 211 GCGFQNKFTADKSGKTTGIVGLGAGPLSLVSQLGDEIGHKFSYCL--LPFSSNSNSKLKF 268
Query: 227 GDGAIIDEGD---ATPLQFIDG--HYYITLEAISVDGRMLDINPNIFKRDDSGGGVMIDS 281
G+ AI+ +G+ +TPL YY+ LE I+V + + K + G ++IDS
Sbjct: 269 GEAAIV-QGNGVVSTPLIIKPDLPFYYLNLEGITVGAKTV-------KTGQTDGNIIIDS 320
Query: 282 GTDVTWLVKEAYEALRDEVMIRLEGEQMRSYSWPDKLCYHGIMSSDLKGFPTVRFHFRGG 341
G+ +T+L + Y V + E+ + +P C+ + P V FHF GG
Sbjct: 321 GSTLTYLEESFYNEFVSLVKETVAVEEDQYIPYPFDFCF--TYKEGMSTPPDVVFHFTGG 378
Query: 342 AKLALEKDSMFYQPRPDAFCMAVNPASINNRYVNFSLIGMMAQQFYNVGYDIGRKQKTFQ 401
+ L+ + + C V P+ + ++ G + Q ++VGYDI + +F
Sbjct: 379 -DVVLKPMNTLVLIEDNLICSTVVPS----HFDGIAIFGNLGQIDFHVGYDIQGGKVSFA 433
Query: 402 RMDCEV 407
DC +
Sbjct: 434 PTDCSL 439
>gi|224126751|ref|XP_002329464.1| predicted protein [Populus trichocarpa]
gi|222870144|gb|EEF07275.1| predicted protein [Populus trichocarpa]
Length = 439
Score = 138 bits (347), Expect = 5e-30, Method: Compositional matrix adjust.
Identities = 128/406 (31%), Positives = 190/406 (46%), Gaps = 36/406 (8%)
Query: 16 ENPAHRVQRGINISIARLAYLQEKIRSHNTYQAQILPSNDISNSLFYVNISIGQPPVPQF 75
E H V+RG + + R + S++ A +LP N F + ++IG PP
Sbjct: 57 ERIQHGVKRGRH-RLQRFKAMALVASSNSEIDAPVLPGN----GEFLMKLAIGTPPETYS 111
Query: 76 TAMDTGSSLLWVHCYPCRDCSPQLGTIFYPSRSSSYAYVPCDSEHCRYFPYARCSAYKHR 135
MDTGS L+W C PC C Q IF P +SSS++ + C S+ C P + CS
Sbjct: 112 AIMDTGSDLIWTQCKPCTQCFDQPTPIFDPKKSSSFSKLSCSSKLCEALPQSTCS---DG 168
Query: 136 CIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDVVFGCGFSTNRNFKF---SGIFGL 192
C Y Y T +++E LTF + V +V FGCG N F SG+ GL
Sbjct: 169 CEYLYGYGDYSSTQGMLASETLTFGK-----VSVPEVAFGCG-EDNEGSGFSQGSGLVGL 222
Query: 193 GIGRSSLVSQLNS-SFSYCIGSLHDPDYLHNKLILGDGAIIDEGDA----TPL---QFID 244
G G SLVSQL FSYC+ S+ D + L++G A + D+ TPL
Sbjct: 223 GRGPLSLVSQLKEPKFSYCLTSVDDTK--ASTLLMGSLASVKASDSEIKTTPLIQNSAQP 280
Query: 245 GHYYITLEAISVDGRMLDINPNIFK-RDDSGGGVMIDSGTDVTWLVKEAYEALRDEVMIR 303
YY++LE ISV L I + F ++D GG++IDSGT +T+L + A++ + E +
Sbjct: 281 SFYYLSLEGISVGDTSLPIKKSTFSLQEDGSGGLIIDSGTTITYLEQSAFDLVAKEFTSQ 340
Query: 304 LEGEQMRSYSWPDKLCYHGIMSSDLKGFPTVRFHFRGGAKLALEKDS-MFYQPRPDAFCM 362
+ S S ++C+ S P + FHF GA L L ++ M C+
Sbjct: 341 INLPVDNSGSTGLEVCFTLPSGSTDIEVPKLVFHFD-GADLELPAENYMIADASMGVACL 399
Query: 363 AVNPASINNRYVNFSLIGMMAQQFYNVGYDIGRKQKTFQRMDCEVL 408
A+ +S S+ G + QQ V +D+ ++ +F C+ L
Sbjct: 400 AMGSSS------GMSIFGNIQQQNMLVLHDLEKETLSFLPTQCDEL 439
>gi|242079449|ref|XP_002444493.1| hypothetical protein SORBIDRAFT_07g022790 [Sorghum bicolor]
gi|241940843|gb|EES13988.1| hypothetical protein SORBIDRAFT_07g022790 [Sorghum bicolor]
Length = 449
Score = 137 bits (346), Expect = 7e-30, Method: Compositional matrix adjust.
Identities = 116/405 (28%), Positives = 186/405 (45%), Gaps = 46/405 (11%)
Query: 31 ARLAYLQEKIRSHNTYQAQILPSNDISNSLFYVNISIGQPPVPQFTAMDTGSSLLWVHC- 89
ARLA R A +P +S+ + + IG PP P+ +DTGS L+W C
Sbjct: 60 ARLA------RVLGNLSAADVPVAPLSDQGHSLTVGIGTPPQPRTLIVDTGSDLIWTQCS 113
Query: 90 ------YPCRDCSPQLGTIFYPSRSSSYAYVPCDSEHCR--YFPYARCSAYKHRCIYTQL 141
S Q ++ P RSSS+AY+PC C+ F Y C A +RC+Y +L
Sbjct: 114 MLSRRTRTAASASRQREPLYEPRRSSSFAYLPCSDRLCQEGQFSYKNC-ARNNRCMYDEL 172
Query: 142 YLIGPETSVFVSTEQLTFKNTDESTIHVQDVVFGCG-FSTNRNFKFSGIFGLGIGRSSLV 200
Y E +++E TF + ++ + FGCG S SG+ GL G SLV
Sbjct: 173 Y-GSAEAGGVLASETFTFGVNAKVSLPLG---FGCGALSAGDLVGASGLMGLSPGIMSLV 228
Query: 201 SQLN-SSFSYCIGSLHDPDYLHNKLILGDGAII----DEGDATPLQFI------DGHYYI 249
SQL+ FSYC+ + + L+ G A + G + +YY+
Sbjct: 229 SQLSVPRFSYCLTPFAERK--TSPLLFGAMADLRRYRTTGTVQTTSILRNPAMETAYYYV 286
Query: 250 TLEAISVDGRMLDINPNIFK--RDDSGGGVMIDSGTDVTWLVKEAYEALRDEVM--IRLE 305
L +S+ + LD+ + D GG ++DSG+ +++L + A+ A++ V+ +RL
Sbjct: 287 PLVGLSLGTKRLDVPATSLGMIKPDGSGGTIVDSGSTMSYLEETAFRAVKKAVVEAVRLP 346
Query: 306 GEQMRSYSWPD-KLCYH---GIMSSDLKGFPTVRFHFRGGAKLALEKDSMFYQPRPDAFC 361
+ D +LC+ G+ +K P V HF GGA + L +D+ F +PR C
Sbjct: 347 VANGTDEDYDDYELCFALPTGVAMEAVKTPPLV-LHFDGGAAMTLPRDNYFQEPRAGLMC 405
Query: 362 MAVNPASINNRYVNFSLIGMMAQQFYNVGYDIGRKQKTFQRMDCE 406
+AV + S+IG + QQ +V +D+ ++ +F C+
Sbjct: 406 LAVGTSPDG---FGVSIIGNVQQQNMHVLFDVRNQKFSFAPTKCD 447
>gi|414876414|tpg|DAA53545.1| TPA: hypothetical protein ZEAMMB73_483039 [Zea mays]
Length = 506
Score = 137 bits (346), Expect = 8e-30, Method: Compositional matrix adjust.
Identities = 113/353 (32%), Positives = 151/353 (42%), Gaps = 20/353 (5%)
Query: 61 FYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLGTIFYPSRSSSYAYVPCDSEH 120
++ + IG P + +DTGS + WV C PC DC Q +F PS S+SYA V CDS+
Sbjct: 166 YFSRVGIGSPARQLYMVLDTGSDVTWVQCQPCADCYQQSDPVFDPSLSASYAAVSCDSQR 225
Query: 121 CRYFPYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDVVFGCGFST 180
CR A C C+Y Y G T +TE LT + + V +V GCG
Sbjct: 226 CRDLDTAACRNATGACLYEVAYGDGSYTVGDFATETLTLGD----STPVGNVAIGCGHDN 281
Query: 181 NRNFKFSGIFGLGIGRS-SLVSQLN-SSFSYCIGSLHDPDYLHNKLILGDGAIIDEGDAT 238
F + G S SQ++ S+FSYC+ P + L GDGA
Sbjct: 282 EGLFVGAAGLLALGGGPLSFPSQISASTFSYCLVDRDSP--AASTLQFGDGAAEAGTVTA 339
Query: 239 PL---QFIDGHYYITLEAISVDGRMLDINPNIFKRDDS--GGGVMIDSGTDVTWLVKEAY 293
PL YY+ L ISV G+ L I + F D + GGV++DSGT VT L AY
Sbjct: 340 PLVRSPRTSTFYYVALSGISVGGQPLSIPASAFAMDATSGSGGVIVDSGTAVTRLQSAAY 399
Query: 294 EALRDEVMIRLEGEQMRSYSWPDKLCYHGIMSSDLKGFPTVRFHFRGGAKLAL-EKDSMF 352
ALRD + S CY + ++ P V F GG L L K+ +
Sbjct: 400 AALRDAFVQGAPSLPRTSGVSLFDTCYDLSDRTSVE-VPAVSLRFEGGGALRLPAKNYLI 458
Query: 353 YQPRPDAFCMAVNPASINNRYVNFSLIGMMAQQFYNVGYDIGRKQKTFQRMDC 405
+C+A P + S+IG + QQ V +D R F C
Sbjct: 459 PVDGAGTYCLAFAPTN-----AAVSIIGNVQQQGTRVSFDTARGAVGFTPNKC 506
>gi|312281631|dbj|BAJ33681.1| unnamed protein product [Thellungiella halophila]
Length = 502
Score = 137 bits (346), Expect = 8e-30, Method: Compositional matrix adjust.
Identities = 117/346 (33%), Positives = 171/346 (49%), Gaps = 30/346 (8%)
Query: 61 FYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLGTIFYPSRSSSYAYVPCDSEH 120
++ I +G P + +DTGS + W+ C PC +C Q IF P+ SS++ + C
Sbjct: 164 YFSRIGVGTPAKEMYVVLDTGSDVNWIQCLPCSECYQQSDPIFDPTSSSTFKSLTCSDPK 223
Query: 121 CRYFPYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDVVFGCGFST 180
C + C + ++C+Y Y G T +T+ +TF + + V DV GCG
Sbjct: 224 CASLDVSACRS--NKCLYQVSYGDGSFTVGNYATDTVTFGESGK----VNDVALGCGHDN 277
Query: 181 NRNFK-FSGIFGLGIGRSSLVSQLNS-SFSYCIGSLHDPDYLHNKLILGDGAIIDEGDAT 238
F +G+ GLG G S+ +Q+ + SFSYC L D D + + + I GDAT
Sbjct: 278 EGLFTGAAGLLGLGGGALSMTNQIKAKSFSYC---LVDRDSAKSSSLDFNSVQIGAGDAT 334
Query: 239 -PL---QFIDGHYYITLEAISVDGRMLDINPNIFKRDDSG-GGVMIDSGTDVTWLVKEAY 293
PL +D YY+ L SV G+ + I ++F+ D SG GGV++D GT VT L +AY
Sbjct: 335 APLLRNSKMDTFYYVGLSGFSVGGQQVSIPSSLFEVDASGAGGVILDCGTAVTRLQTQAY 394
Query: 294 EALRDEVMIRLEGEQMRSYSWPDKL---CYHGIMSSDLKGFPTVRFHFRGGAKLALEKDS 350
+LRD ++L + + S P L CY S +K PTV FHF GG L L +
Sbjct: 395 NSLRD-AFVKLTTDFKKGTS-PISLFDTCYDFSSLSTVK-VPTVTFHFTGGKSLNLPAKN 451
Query: 351 MFYQPRPDA--FCMAVNPASINNRYVNFSLIGMMAQQFYNVGYDIG 394
+ P DA FC A P S + S+IG + QQ + YD+
Sbjct: 452 -YLIPIDDAGTFCFAFAPTS-----SSLSIIGNVQQQGTRITYDLA 491
>gi|125524353|gb|EAY72467.1| hypothetical protein OsI_00323 [Oryza sativa Indica Group]
Length = 500
Score = 137 bits (345), Expect = 9e-30, Method: Compositional matrix adjust.
Identities = 111/366 (30%), Positives = 158/366 (43%), Gaps = 21/366 (5%)
Query: 47 QAQILPSNDISNSLFYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLGTIFYPS 106
Q ++ + + ++ + +G P + +DTGS + WV C PC DC Q +F PS
Sbjct: 149 QGPVVSGVGLGSGEYFSRVGVGSPARQLYMVLDTGSDVTWVQCQPCADCYQQSDPVFDPS 208
Query: 107 RSSSYAYVPCDSEHCRYFPYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDEST 166
S+SYA V CD+ C A C C+Y Y G T +TE LT + +
Sbjct: 209 LSTSYASVACDNPRCHDLDAAACRNSTGACLYEVAYGDGSYTVGDFATETLTLGD----S 264
Query: 167 IHVQDVVFGCGFSTNRNFKFSGIFGLGIGRS-SLVSQLN-SSFSYCIGSLHDPDYLHNKL 224
V V GCG F + G S SQ++ ++FSYC+ P + L
Sbjct: 265 APVSSVAIGCGHDNEGLFVGAAGLLALGGGPLSFPSQISATTFSYCLVDRDSPS--SSTL 322
Query: 225 ILGDGAIIDEGDATPL---QFIDGHYYITLEAISVDGRMLDINPNIFKRDDSG-GGVMID 280
GD A D PL YY+ L ISV G++L I P+ F D +G GGV++D
Sbjct: 323 QFGDAA--DAEVTAPLIRSPRTSTFYYVGLSGISVGGQILSIPPSAFAMDGTGAGGVIVD 380
Query: 281 SGTDVTWLVKEAYEALRDEVMIRLEGEQMRSYSWPDKLCYHGIMSSDLKGFPTVRFHFRG 340
SGT VT L AY ALRD + + S CY + ++ P V F G
Sbjct: 381 SGTAVTRLQSSAYAALRDAFVRGTQSLPRTSGVSLFDTCYDLSDRTSVE-VPAVSLRFAG 439
Query: 341 GAKLAL-EKDSMFYQPRPDAFCMAVNPASINNRYVNFSLIGMMAQQFYNVGYDIGRKQKT 399
G +L L K+ + +C+A P + S+IG + QQ V +D +
Sbjct: 440 GGELRLPAKNYLIPVDGAGTYCLAFAPTN-----AAVSIIGNVQQQGTRVSFDTAKSTVG 494
Query: 400 FQRMDC 405
F C
Sbjct: 495 FTSNKC 500
>gi|326512608|dbj|BAJ99659.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 484
Score = 137 bits (345), Expect = 1e-29, Method: Compositional matrix adjust.
Identities = 123/428 (28%), Positives = 182/428 (42%), Gaps = 42/428 (9%)
Query: 1 LIHRDSILSPYNNPNENPAHRVQRGINISIARLAYLQEKIRSHNT--------YQAQILP 52
++HR SP P H +N AR+ + KI + + + LP
Sbjct: 77 VVHRQGPCSPLQARGAPPPH--AELLNDDQARVDSIHRKIAAAASPVLDQARGKKGVTLP 134
Query: 53 SN---DISNSLFYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLGTIFYPSRSS 109
+ + + V++ +G P DTGS L WV C PC DC Q +F P+RSS
Sbjct: 135 AQRGISLGTGNYVVSMGLGTPARDMTVVFDTGSDLSWVQCTPCSDCYEQKDPLFDPARSS 194
Query: 110 SYAYVPCDSEHCRYFPYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHV 169
+Y+ VPC S C+ CS K +C Y +Y +T ++ + LT +D +
Sbjct: 195 TYSAVPCASPECQGLDSRSCSRDK-KCRYEVVYGDQSQTDGALARDTLTLTQSDV----L 249
Query: 170 QDVVFGCGFSTNRNF-KFSGIFGLGIGRSSLVSQLNS----SFSYCIGSLHDPDYLHNKL 224
VFGCG F + G+ GLG + SL SQ S FSYC+ S L
Sbjct: 250 PGFVFGCGEQDTGLFGRADGLVGLGREKVSLSSQAASKYGAGFSYCLPSSPS---AAGYL 306
Query: 225 ILGDGAIIDEGDATPLQF---IDGHYYITLEAISVDGRMLDINPNIFKRDDSGGGVMIDS 281
LG G T ++ YY+ L + V GR + ++P +F S G +IDS
Sbjct: 307 SLG-GPAPANARFTAMETRHDSPSFYYVRLVGVKVAGRTVRVSPIVF----SAAGTVIDS 361
Query: 282 GTDVTWLVKEAYEALRD---EVMIRLEGEQMRSYSWPDKLCYHGIMSSDLKGFPTVRFHF 338
GT +T L Y ALR M R ++ + S D CY + ++ P+V F
Sbjct: 362 GTVITRLPPRVYAALRSAFARSMGRYGYKRAPALSILDT-CYDFTGHTTVR-IPSVALVF 419
Query: 339 RGGAKLALEKDSMFYQPRPDAFCMAVNPASINNRYVNFSLIGMMAQQFYNVGYDIGRKQK 398
GGA + L+ + Y + C+A P N + +IG Q+ V YD+ R++
Sbjct: 420 AGGAAVGLDFSGVLYVAKVSQACLAFAP---NGDGADAGIIGNTQQKTLAVVYDVARQKI 476
Query: 399 TFQRMDCE 406
F C
Sbjct: 477 GFGANGCS 484
>gi|357135412|ref|XP_003569303.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Brachypodium
distachyon]
Length = 494
Score = 137 bits (345), Expect = 1e-29, Method: Compositional matrix adjust.
Identities = 126/428 (29%), Positives = 184/428 (42%), Gaps = 36/428 (8%)
Query: 1 LIHRDSILSPYNNPNENPAHRVQRG---INISIARLAYLQEKIRSHNTYQAQILPSNDIS 57
++HRD + E HR+QR A R+ + A ++
Sbjct: 80 VVHRDDFVV-NATAAELLGHRLQRDGKRAARISAAAGAANGTRRTGSGVVAPVVSGLAQG 138
Query: 58 NSLFYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLGTIFYPSRSSSYAYVPCD 117
+ ++ I +G P P +DTGS ++W+ C PCR C Q G +F P RS SY V C
Sbjct: 139 SGEYFTKIGVGTPATPALMVLDTGSDVVWLQCAPCRRCYDQSGQVFDPRRSRSYGAVGCS 198
Query: 118 SEHCRYFPYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDVVFGCG 177
+ CR C + C+Y Y G T+ +TE LTF V + GCG
Sbjct: 199 APLCRRLDSGGCDLRRKACLYQVAYGDGSVTAGDFATETLTFAG----GARVARIALGCG 254
Query: 178 FSTNRNFKFSGIFGLGIGRSSL------VSQLNSSFSYCI---GSLHDPDYLHNKLILGD 228
N + LG+GR SL + SFSYC+ S +P + + G
Sbjct: 255 HD-NEGLFVAAAGLLGLGRGSLSFPAQISRRYGRSFSYCLVDRTSSANPASHSSTVTFGS 313
Query: 229 GAIIDEGDA--TPL---QFIDGHYYITLEAISVDG-RMLDINPNIFKRDDSG--GGVMID 280
GA+ A TP+ ++ YY+ L ISV G R+ + + + D S GGV++D
Sbjct: 314 GAVGSTVAASFTPMVKNPRMETFYYVQLVGISVGGARVSGVADSDLRLDPSSGRGGVIVD 373
Query: 281 SGTDVTWLVKEAYEALRDEVMIRLEGEQMR--SYSWPDKLCYHGIMSSDLKGFPTVRFHF 338
SGT VT L + AY ALRD G ++ +S D CY + + PTV HF
Sbjct: 374 SGTSVTRLARPAYSALRDAFRAAAAGLRLSPGGFSLFDT-CYD-LSGRKVVKVPTVSMHF 431
Query: 339 RGGAKLALEKDS-MFYQPRPDAFCMAVNPASINNRYVNFSLIGMMAQQFYNVGYDIGRKQ 397
GGA+ AL ++ + FC A A + S+IG + QQ + V +D ++
Sbjct: 432 AGGAEAALPPENYLIPVDSKGTFCFAF--AGTDG---GVSIIGNIQQQGFRVVFDGDGQR 486
Query: 398 KTFQRMDC 405
F C
Sbjct: 487 VGFVPKGC 494
>gi|148907930|gb|ABR17085.1| unknown [Picea sitchensis]
Length = 498
Score = 137 bits (344), Expect = 1e-29, Method: Compositional matrix adjust.
Identities = 123/442 (27%), Positives = 189/442 (42%), Gaps = 58/442 (13%)
Query: 1 LIHRDSIL-SPYNNPNENPAHRVQRGINISIARLAYLQEKIRSHNT-------------- 45
++HRD++L N + R++ + R+ L+ +I T
Sbjct: 78 VVHRDALLLKNAANATASYERRLKEKLRREAVRVRGLERQIERTLTLNKDPVNRYENVAE 137
Query: 46 ----YQAQILPSNDISNSLFYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLGT 101
+ +++ + + ++ I +G P Q+ +DTGS + W+ C PCR+C Q
Sbjct: 138 VDADFGGEVVSGMEQGSGEYFTRIGVGTPTREQYMVLDTGSDVAWIQCEPCRECYSQADP 197
Query: 102 IFYPSRSSSYAYVPCDSEHCRYFPYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKN 161
IF PS S+S++ V CDS C C + C+Y Y G ++ +TE LTF
Sbjct: 198 IFNPSYSASFSTVGCDSAVCSQLDAYDC--HSGGCLYEASYGDGSYSTGSFATETLTFGT 255
Query: 162 TDESTIHVQDVVFGCGFSTNRNFKFSGIFGLGIGR-----SSLVSQLNSSFSYCIGSLHD 216
T V +V GCG F + + + +Q +FSYC+
Sbjct: 256 TS-----VANVAIGCGHKNVGLFIGAAGLLGLGAGALSFPNQIGTQTGHTFSYCLVDRES 310
Query: 217 PDYLHNKLILGDGAIIDEGDATPLQ---FIDGHYYITLEAISVDGRMLD-INPNIFKRDD 272
L G ++ TPL+ + YY+++ AISV G +LD I P +F+ D+
Sbjct: 311 DS--SGPLQFGPKSVPVGSIFTPLEKNPHLPTFYYLSVTAISVGGALLDSIPPEVFRIDE 368
Query: 273 SG--GGVMIDSGTDVTWLVKEAYEALRDEVMIRLEGEQMRSYSWPD-KLCYHGIMSSDLK 329
+ GG +IDSGT VT LV AY+A+RD + G+ R+ + CY DL
Sbjct: 369 TSGHGGFIIDSGTVVTRLVTSAYDAVRD-AFVAGTGQLPRTDAVSIFDTCY------DLS 421
Query: 330 GF-----PTVRFHFRGGAKLAL-EKDSMFYQPRPDAFCMAVNPASINNRYVNFSLIGMMA 383
G PTV FHF GA L L K+ + FC A PA+ + S++G
Sbjct: 422 GLQFVSVPTVGFHFSNGASLILPAKNYLIPMDTVGTFCFAFAPAA-----SSVSIMGNTQ 476
Query: 384 QQFYNVGYDIGRKQKTFQRMDC 405
QQ V +D F C
Sbjct: 477 QQHIRVSFDSANSLVGFAFDQC 498
>gi|302753526|ref|XP_002960187.1| hypothetical protein SELMODRAFT_75184 [Selaginella moellendorffii]
gi|300171126|gb|EFJ37726.1| hypothetical protein SELMODRAFT_75184 [Selaginella moellendorffii]
Length = 398
Score = 137 bits (344), Expect = 1e-29, Method: Compositional matrix adjust.
Identities = 107/364 (29%), Positives = 161/364 (44%), Gaps = 26/364 (7%)
Query: 61 FYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLGTIFYPSRSSSYAYVPCDSEH 120
+ IS+G P DTGS L+W+ C PC+ C Q IF P SSSY + C
Sbjct: 40 YVTTISLGTPAKVFSVIADTGSDLIWIQCKPCQACFNQKDPIFDPEGSSSYTTMSCGDTL 99
Query: 121 CRYFPYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDVVFGCGFST 180
C P CS C Y+ Y G T +S+E +T +T + +++ FGCG
Sbjct: 100 CDSLPRKSCSP---NCDYSYGYGDGSGTRGTLSSETVTLTSTQGEKLAAKNIAFGCGHLN 156
Query: 181 NRNFK-FSGIFGLGIGRSSLVSQLNS----SFSYCIGSLHDPDYLHNKLILGDGAIIDEG 235
+F SG+ GLG G S VSQL FSYC+ D + + GD +
Sbjct: 157 RGSFNDASGLVGLGRGNLSFVSQLGDLFGHKFSYCLVPWRDAPSKTSPMFFGDESSSHSS 216
Query: 236 DA------TPL---QFIDGHYYITLEAISVDGRMLDINPNIFK-RDDSGGGVMIDSGTDV 285
TP+ ++ YY+ L+ IS+ GR L I F + D GG++ DSGT +
Sbjct: 217 GKKLHYAFTPMIHNPAMESFYYVKLKDISIAGRALRIPAGSFDIKPDGSGGMIFDSGTTL 276
Query: 286 TWLVKEAYEALRDEVMIRLEGEQMRSYSWPDKLCY--HGIMSSDLKGFPTVRFHFRGGA- 342
T L Y+ + + ++ ++ S LCY G +S K P + FHF G
Sbjct: 277 TLLPDAPYQIVLRALRSKVSFPEIDGSSAGLDLCYDVSGSKASYKKKIPAMVFHFEGADH 336
Query: 343 KLALEKDSMFYQPRPDAFCMAVNPASINNRYVNFSLIGMMAQQFYNVGYDIGRKQKTFQR 402
+L +E + C+A+ +++ + + G M QQ + V YDIG + +
Sbjct: 337 QLPVENYFIAANDAGTIVCLAMVSSNM-----DIGIYGNMMQQNFRVMYDIGSSKIGWAP 391
Query: 403 MDCE 406
C+
Sbjct: 392 SQCD 395
>gi|242041115|ref|XP_002467952.1| hypothetical protein SORBIDRAFT_01g037070 [Sorghum bicolor]
gi|241921806|gb|EER94950.1| hypothetical protein SORBIDRAFT_01g037070 [Sorghum bicolor]
Length = 774
Score = 137 bits (344), Expect = 1e-29, Method: Compositional matrix adjust.
Identities = 114/382 (29%), Positives = 178/382 (46%), Gaps = 40/382 (10%)
Query: 53 SNDISNSLFYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLGTIFYPSRSSSYA 112
+N + ++ + V+++IG PP P +DTGS L+W C PC C + PS SS++
Sbjct: 407 ANGVPDTEYLVHLAIGTPPQPVQLILDTGSDLVWTQCRPCPVCFSRALGPLDPSNSSTFD 466
Query: 113 YVPCDSEHCRYFPYARCSAY---KHRCIYTQLYLIGPETSVFVSTEQLTFKNTDES-TIH 168
+PC S C ++ C + C+Y Y G T+ + E TF D +
Sbjct: 467 VLPCSSPVCDNLTWSSCGKHNWGNQTCVYVYAYADGSITTGHLDAETFTFAAADGTGQAT 526
Query: 169 VQDVVFGCGFSTNRNFKF--SGIFGLGIGRSSLVSQLN-SSFSYCIGSLHDPDYLHNKLI 225
V D+ FGCG N F +GI G G G SL SQL +FS+C ++ + + ++
Sbjct: 527 VPDLAFGCGLFNNGIFTSNETGIAGFGRGALSLPSQLKVDNFSHCFTAITGSE--PSSVL 584
Query: 226 LG---------DGAIIDEGDATPL-QFIDG--HYYITLEAISVDGRMLDINPNIFK-RDD 272
LG DGA+ +TPL Q YY++L+ I+V L I + F + D
Sbjct: 585 LGLPANLYSDADGAV----QSTPLVQNFSSLRAYYLSLKGITVGSTRLPIPESTFALKQD 640
Query: 273 SGGGVMIDSGTDVTWLVKEAYEALRDE--VMIRLEGEQMRSYSWPDKLCYHGIMSSDLK- 329
GG +IDSGT +T L ++AY+ + D +RL + S S +LC+ + K
Sbjct: 641 GTGGTIIDSGTGMTTLPQDAYKLVHDAFTAQVRLPVDNATSSSL-SRLCFSFSVPRRAKP 699
Query: 330 GFPTVRFHFRGGAKLALEKDSMFYQ---PRPDAFCMAVNPASINNRYVNFSLIGMMAQQF 386
P + HF GA L L +++ ++ C+A+N + ++IG QQ
Sbjct: 700 DVPKLVLHFE-GATLDLPRENYMFEFEDAGGSVTCLAINAGD------DLTIIGNYQQQN 752
Query: 387 YNVGYDIGRKQKTFQRMDCEVL 408
+V YD+ R +F C L
Sbjct: 753 LHVLYDLVRNMLSFVPAQCNRL 774
>gi|168059885|ref|XP_001781930.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162666576|gb|EDQ53226.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 355
Score = 137 bits (344), Expect = 1e-29, Method: Compositional matrix adjust.
Identities = 102/356 (28%), Positives = 157/356 (44%), Gaps = 35/356 (9%)
Query: 61 FYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLGTIFYPSRSSSYAYVPCDSEH 120
+ + +G P +DTGS L WV C PC C Q ++F P+ S+S+ + C +E
Sbjct: 3 YLATVRLGTPERVFSVIVDTGSDLTWVQCSPCGTCYSQNDSLFIPNTSTSFTKLACGTEL 62
Query: 121 CRYFPYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDVVFGCGFST 180
C PY C+ + C+Y Y G ++ + +T + V + FGCG
Sbjct: 63 CNGLPYPMCN--QTTCVYWYSYGDGSLSTGDFVYDTITMDGINGQKQQVPNFAFGCGHDN 120
Query: 181 NRNFKFS-GIFGLGIGRSSLVSQL----NSSFSYCIGSLHDPDYLHNKLILGDGAIIDEG 235
+F + GI GLG G S SQL N FSYC+ P + L+ GD A+
Sbjct: 121 EGSFAGADGILGLGQGPLSFPSQLKTVFNGKFSYCLVDWLAPPTQTSPLLFGDAAVPTFP 180
Query: 236 DATPLQFIDG-----HYYITLEAISVDGRMLDINPNIFKRDDSG-GGVMIDSGTDVTWLV 289
+ + +YY+ L ISV G++L+I+ F D G G + DSGT VT L
Sbjct: 181 GVKYISLLTNPKVPTYYYVKLNGISVGGKLLNISSTAFDIDSVGRAGTIFDSGTTVTQLA 240
Query: 290 KEAYEALRDEVMIRLEGEQMRSYSWPDK--------LCYHGIMSSDLKGFPTVRFHFRGG 341
E ++ EV+ + M +P K LC G L P++ FHF GG
Sbjct: 241 GEVHQ----EVLAAMNASTM---DYPRKSDDSSGLDLCLGGFAEGQLPTVPSMTFHFEGG 293
Query: 342 AKLALEKDSMFYQPRPDAFCMAVNPASINNRYVNFSLIGMMAQQFYNVGYD-IGRK 396
+ + ++C ++ + + ++IG + QQ + V YD +GRK
Sbjct: 294 DMELPPSNYFIFLESSQSYCFSMVSSP------DVTIIGSIQQQNFQVYYDTVGRK 343
>gi|168005153|ref|XP_001755275.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162693403|gb|EDQ79755.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 429
Score = 136 bits (343), Expect = 2e-29, Method: Compositional matrix adjust.
Identities = 122/423 (28%), Positives = 196/423 (46%), Gaps = 40/423 (9%)
Query: 1 LIHRDSILSPY-----NNPNENPAHRVQRGINISIARLAYLQEKIRSHNTYQAQILPSND 55
LI+R+ SP P+E V+RG R A L + + + + + S
Sbjct: 32 LIYREHQSSPLRSETLKTPSEIFIAAVKRGHE----RRARLAKHVLAGDQLFETPVASG- 86
Query: 56 ISNSLFYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLGTIFYPSRSSSYAYVP 115
N + ++IS G PP +DTGS L WV C PC+ C L F PS+S+SY +
Sbjct: 87 --NGEYLIDISYGNPPQKSTAIVDTGSDLNWVQCLPCKSCYETLSAKFDPSKSASYKTLG 144
Query: 116 CDSEHCRYFPYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDVVFG 175
C S C+ P+ C+A C Y +Y G TS +ST+ +T T + +V FG
Sbjct: 145 CGSNFCQDLPFQSCAA---SCQYDYMYGDGSSTSGALSTDDVTI-----GTGKIPNVAFG 196
Query: 176 CGFSTNRNFKFSGIFGLGIGRS-SLVSQLNSS----FSYCIGSLHDPDYLHNKLILGDGA 230
CG S F +G SLVSQL + FSYC+ L + L +GD
Sbjct: 197 CGNSNLGTFAGAGGLVGLGKGPLSLVSQLGGTATKKFSYCLVPLGSTK--TSPLYIGDST 254
Query: 231 IIDEGDATPLQFIDGH---YYITLEAISVDGRMLDINPNIFKRDDSG-GGVMIDSGTDVT 286
+ TP+ + + YY L+ ISV+G+ ++ N F +G GG+++DSGT +T
Sbjct: 255 LAGGVAYTPMLTNNNYPTFYYAELQGISVEGKAVNYPANTFDIAATGRGGLILDSGTTLT 314
Query: 287 WLVKEAYEALRDEVMIRLEGEQMRSYSWPDKLCYHGIMSSDLKGFPTVRFHFRGGAKLAL 346
+L +A+ + + L + + + C+ ++ +PTV FHF GA +AL
Sbjct: 315 YLDVDAFNPMVAALKAALPYPEADGSFYGLEYCFSTAGVAN-PTYPTVVFHFN-GADVAL 372
Query: 347 EKDSMFYQPRPDA-FCMAVNPASINNRYVNFSLIGMMAQQFYNVGYDIGRKQKTFQRMDC 405
D+ F + C+A+ ++ FS+ G + Q + + +D+ K+ F+ +C
Sbjct: 373 APDNTFIALDFEGTTCLAMASST------GFSIFGNIQQLNHVIVHDLVNKRIGFKSANC 426
Query: 406 EVL 408
E +
Sbjct: 427 ETI 429
>gi|118484458|gb|ABK94105.1| unknown [Populus trichocarpa]
Length = 499
Score = 136 bits (343), Expect = 2e-29, Method: Compositional matrix adjust.
Identities = 118/419 (28%), Positives = 191/419 (45%), Gaps = 41/419 (9%)
Query: 2 IHRDSILSPYNNPNENPAHRVQRGI-NISIARLAYLQEKIRSHNTYQAQILPSNDISNSL 60
+HRD++ +N+ R+Q + +IS + L L+ +I+ + + +
Sbjct: 108 LHRDTVR--FNSLTA----RLQLALEDISKSDLKPLETEIKPED-LSTPVTSGTSQGSGE 160
Query: 61 FYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLGTIFYPSRSSSYAYVPCDSEH 120
++ + +G P + +DTGS + W+ C PC DC Q IF P+ SS+YA V C S+
Sbjct: 161 YFTRVGVGNPARQFYMVLDTGSDINWLQCQPCTDCYQQTDPIFDPTASSTYAPVTCQSQQ 220
Query: 121 CRYFPYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDVVFGCGFST 180
C + C + +C+Y Y G T +TE ++F N+ V++V GCG
Sbjct: 221 CSSLEMSSCRS--GQCLYQVNYGDGSYTFGDFATESVSFGNSGS----VKNVALGCGHDN 274
Query: 181 NRNFKFSGIFGLGIGRS-SLVSQLN-SSFSYCIGSLHDPDYLHNKLILGDGAIIDEGDAT 238
F + G SL +QL +SFSYC+ ++ + L + +
Sbjct: 275 EGLFVGAAGLLGLGGGPLSLTNQLKATSFSYCL--VNRDSAGSSTLDFNSAQLGVDSVTA 332
Query: 239 PL---QFIDGHYYITLEAISVDGRMLDINPNIFKRDDSG-GGVMIDSGTDVTWLVKEAYE 294
PL + ID YY+ L +SV G+M+ I + F+ D+SG GG+++D GT +T L +AY
Sbjct: 333 PLMKNRKIDTFYYVGLSGMSVGGQMVSIPESTFRLDESGNGGIIVDCGTAITRLQTQAYN 392
Query: 295 ALRDEVMIRLEGEQMRSYSWPDKLCYHGIMSSDLKG-----FPTVRFHFRGGAKLALEKD 349
LRD + + ++ S CY DL G PTV FHF G L
Sbjct: 393 PLRDAFVRMTQNLKLTSAVALFDTCY------DLSGQASVRVPTVSFHFADGKSWNLPA- 445
Query: 350 SMFYQPRPDA--FCMAVNPASINNRYVNFSLIGMMAQQFYNVGYDIGRKQKTFQRMDCE 406
+ + P A +C A P + + S+IG + QQ V +D+ + F C+
Sbjct: 446 ANYLIPVDSAGTYCFAFAPTT-----SSLSIIGNVQQQGTRVTFDLANNRMGFSPNKCQ 499
>gi|15226315|ref|NP_180368.1| aspartyl protease-like protein [Arabidopsis thaliana]
gi|4510415|gb|AAD21501.1| putative chloroplast nucleoid DNA binding protein [Arabidopsis
thaliana]
gi|330252975|gb|AEC08069.1| aspartyl protease-like protein [Arabidopsis thaliana]
Length = 396
Score = 136 bits (343), Expect = 2e-29, Method: Compositional matrix adjust.
Identities = 103/360 (28%), Positives = 162/360 (45%), Gaps = 36/360 (10%)
Query: 58 NSLFYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLGTIFYPSRSSSYAYVPCD 117
NS++ + + +G PP +DTGS + W C PC C Q IF PS+SS++ CD
Sbjct: 62 NSVYLMKLQVGTPPFEIQAIIDTGSEITWTQCLPCVHCYEQNAPIFDPSKSSTFKEKRCD 121
Query: 118 SEHCRYFPYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDVVFGCG 177
C Y + H Y +G ++TE +T +T + + + GCG
Sbjct: 122 GHSCPY----EVDYFDHT------YTMGT-----LATETITLHSTSGEPFVMPETIIGCG 166
Query: 178 FSTNRNFK--FSGIFGLGIGRSSLVSQLNSSF----SYCIG--SLHDPDYLHNKLILGDG 229
+ N FK FSG+ GL G SSL++Q+ + SYC ++ N ++ GDG
Sbjct: 167 HN-NSWFKPSFSGMVGLNWGPSSLITQMGGEYPGLMSYCFSGQGTSKINFGANAIVAGDG 225
Query: 230 AIIDEGDATPLQFIDGHYYITLEAISVDGRMLDINPNIFKRDDSGGGVMIDSGTDVTWLV 289
+ T + G YY+ L+A+SV ++ F + G ++IDSGT +T+
Sbjct: 226 VVSTTMFMTTAK--PGFYYLNLDAVSVGNTRIETMGTTFHALE--GNIVIDSGTTLTYFP 281
Query: 290 KEAYEALRDEVMIRLEGEQMRSYSWPDKLCYHGIMSSDLKGFPTVRFHFRGGAKLALEKD 349
+R V + + + D LCY+ S + FP + HF GG L L+K
Sbjct: 282 VSYCNLVRQAVEHVVTAVRAADPTGNDMLCYN---SDTIDIFPVITMHFSGGVDLVLDKY 338
Query: 350 SMFYQPRPDA-FCMAVNPASINNRYVNFSLIGMMAQQFYNVGYDIGRKQKTFQRMDCEVL 408
+M+ + FC+A+ I N ++ G AQ + VGYD +F +C L
Sbjct: 339 NMYMESNNGGVFCLAI----ICNSPTQEAIFGNRAQNNFLVGYDSSSLLVSFSPTNCSAL 394
>gi|115434442|ref|NP_001041979.1| Os01g0140100 [Oryza sativa Japonica Group]
gi|12328547|dbj|BAB21205.1| putative aspartic proteinase nepenthesin I [Oryza sativa Japonica
Group]
gi|113531510|dbj|BAF03893.1| Os01g0140100 [Oryza sativa Japonica Group]
gi|125568961|gb|EAZ10476.1| hypothetical protein OsJ_00309 [Oryza sativa Japonica Group]
gi|215697206|dbj|BAG91200.1| unnamed protein product [Oryza sativa Japonica Group]
Length = 504
Score = 136 bits (343), Expect = 2e-29, Method: Compositional matrix adjust.
Identities = 110/366 (30%), Positives = 158/366 (43%), Gaps = 21/366 (5%)
Query: 47 QAQILPSNDISNSLFYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLGTIFYPS 106
Q ++ + + ++ + +G P + +DTGS + WV C PC DC Q +F PS
Sbjct: 153 QGPVVSGVGLGSGEYFSRVGVGSPARQLYMVLDTGSDVTWVQCQPCADCYQQSDPVFDPS 212
Query: 107 RSSSYAYVPCDSEHCRYFPYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDEST 166
S+SYA V CD+ C A C C+Y Y G T +TE LT + +
Sbjct: 213 LSTSYASVACDNPRCHDLDAAACRNSTGACLYEVAYGDGSYTVGDFATETLTLGD----S 268
Query: 167 IHVQDVVFGCGFSTNRNFKFSGIFGLGIGRS-SLVSQLN-SSFSYCIGSLHDPDYLHNKL 224
V V GCG F + G S SQ++ ++FSYC+ P + L
Sbjct: 269 APVSSVAIGCGHDNEGLFVGAAGLLALGGGPLSFPSQISATTFSYCLVDRDSPS--SSTL 326
Query: 225 ILGDGAIIDEGDATPL---QFIDGHYYITLEAISVDGRMLDINPNIFKRDDSG-GGVMID 280
GD A D PL YY+ L +SV G++L I P+ F D +G GGV++D
Sbjct: 327 QFGDAA--DAEVTAPLIRSPRTSTFYYVGLSGLSVGGQILSIPPSAFAMDSTGAGGVIVD 384
Query: 281 SGTDVTWLVKEAYEALRDEVMIRLEGEQMRSYSWPDKLCYHGIMSSDLKGFPTVRFHFRG 340
SGT VT L AY ALRD + + S CY + ++ P V F G
Sbjct: 385 SGTAVTRLQSSAYAALRDAFVRGTQSLPRTSGVSLFDTCYDLSDRTSVE-VPAVSLRFAG 443
Query: 341 GAKLAL-EKDSMFYQPRPDAFCMAVNPASINNRYVNFSLIGMMAQQFYNVGYDIGRKQKT 399
G +L L K+ + +C+A P + S+IG + QQ V +D +
Sbjct: 444 GGELRLPAKNYLIPVDGAGTYCLAFAPTN-----AAVSIIGNVQQQGTRVSFDTAKSTVG 498
Query: 400 FQRMDC 405
F C
Sbjct: 499 FTTNKC 504
>gi|15229656|ref|NP_188478.1| aspartyl protease family protein [Arabidopsis thaliana]
gi|75273882|sp|Q9LS40.1|ASPG1_ARATH RecName: Full=Protein ASPARTIC PROTEASE IN GUARD CELL 1;
Short=AtASPG1; Flags: Precursor
gi|11994113|dbj|BAB01116.1| CND41, chloroplast nucleoid DNA binding protein-like [Arabidopsis
thaliana]
gi|23297732|gb|AAN13013.1| putative chloroplast nucleoid DNA-binding protein [Arabidopsis
thaliana]
gi|332642583|gb|AEE76104.1| aspartyl protease family protein [Arabidopsis thaliana]
Length = 500
Score = 136 bits (343), Expect = 2e-29, Method: Compositional matrix adjust.
Identities = 111/346 (32%), Positives = 167/346 (48%), Gaps = 28/346 (8%)
Query: 61 FYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLGTIFYPSRSSSYAYVPCDSEH 120
++ I +G P + +DTGS + W+ C PC DC Q +F P+ SS+Y + C +
Sbjct: 162 YFSRIGVGTPAKEMYLVLDTGSDVNWIQCEPCADCYQQSDPVFNPTSSSTYKSLTCSAPQ 221
Query: 121 CRYFPYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDVVFGCGFST 180
C + C + ++C+Y Y G T ++T+ +TF N+ + + +V GCG
Sbjct: 222 CSLLETSACRS--NKCLYQVSYGDGSFTVGELATDTVTFGNSGK----INNVALGCGHDN 275
Query: 181 NRNFKFSGIFGLGIGRS-SLVSQLN-SSFSYCIGSLHDPDYLHNKLILGDGAIIDEGDAT 238
F + G S+ +Q+ +SFSYC L D D + + + + GDAT
Sbjct: 276 EGLFTGAAGLLGLGGGVLSITNQMKATSFSYC---LVDRDSGKSSSLDFNSVQLGGGDAT 332
Query: 239 -PL---QFIDGHYYITLEAISVDGRMLDINPNIFKRDDSG-GGVMIDSGTDVTWLVKEAY 293
PL + ID YY+ L SV G + + IF D SG GGV++D GT VT L +AY
Sbjct: 333 APLLRNKKIDTFYYVGLSGFSVGGEKVVLPDAIFDVDASGSGGVILDCGTAVTRLQTQAY 392
Query: 294 EALRD---EVMIRLEGEQMRSYSWPDKLCYHGIMSSDLKGFPTVRFHFRGGAKLAL-EKD 349
+LRD ++ + L+ + S S D CY S +K PTV FHF GG L L K+
Sbjct: 393 NSLRDAFLKLTVNLK-KGSSSISLFDT-CYDFSSLSTVK-VPTVAFHFTGGKSLDLPAKN 449
Query: 350 SMFYQPRPDAFCMAVNPASINNRYVNFSLIGMMAQQFYNVGYDIGR 395
+ FC A P S + S+IG + QQ + YD+ +
Sbjct: 450 YLIPVDDSGTFCFAFAPTS-----SSLSIIGNVQQQGTRITYDLSK 490
>gi|297845610|ref|XP_002890686.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
gi|297336528|gb|EFH66945.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
Length = 486
Score = 136 bits (342), Expect = 2e-29, Method: Compositional matrix adjust.
Identities = 114/352 (32%), Positives = 160/352 (45%), Gaps = 23/352 (6%)
Query: 61 FYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLGTIFYPSRSSSYAYVPCDSEH 120
++ + IG P + +DTGS + W+ C PC DC Q IF PS SSSY + CD+
Sbjct: 151 YFTRVGIGNPAREVYMVLDTGSDVNWLQCTPCADCYHQTEPIFEPSSSSSYEPLSCDTPQ 210
Query: 121 CRYFPYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDVVFGCGFST 180
C + C C+Y Y G T +TE LT +T VQ+V GCG S
Sbjct: 211 CNALEVSECR--NATCLYEVSYGDGSYTVGDFATETLTIGST-----LVQNVAVGCGHSN 263
Query: 181 NRNFKFSGIFGLGIGRSSLV-SQLN-SSFSYCIGSLHDPDYLHNKLILGDGAIIDEGDAT 238
F + G + SQLN +SFSYC L D D + ++ +
Sbjct: 264 EGLFVGAAGLLGLGGGLLALPSQLNTTSFSYC---LVDRDSDSASTVEFGTSLPPDAVVA 320
Query: 239 PL---QFIDGHYYITLEAISVDGRMLDINPNIFKRDDSG-GGVMIDSGTDVTWLVKEAYE 294
PL +D YY+ L ISV G +L I + F+ D+SG GG++IDSGT VT L Y
Sbjct: 321 PLLRNHQLDTFYYLGLTGISVGGELLQIPQSSFEMDESGSGGIIIDSGTAVTRLQTGIYN 380
Query: 295 ALRDEVMIRLEGEQMRSYSWPDKLCYHGIMSSDLKGFPTVRFHFRGGAKLAL-EKDSMFY 353
+LRD + + + CY+ + ++ PTV FHF GG LAL K+ M
Sbjct: 381 SLRDSFLKGTSDLEKAAGVAMFDTCYNLSAKTTIE-VPTVAFHFPGGKMLALPAKNYMIP 439
Query: 354 QPRPDAFCMAVNPASINNRYVNFSLIGMMAQQFYNVGYDIGRKQKTFQRMDC 405
FC+A P + + ++IG + QQ V +D+ F C
Sbjct: 440 VDSVGTFCLAFAPTA-----SSLAIIGNVQQQGTRVTFDLANSLIGFSSNKC 486
>gi|302776610|ref|XP_002971459.1| hypothetical protein SELMODRAFT_64134 [Selaginella moellendorffii]
gi|300160591|gb|EFJ27208.1| hypothetical protein SELMODRAFT_64134 [Selaginella moellendorffii]
Length = 357
Score = 136 bits (342), Expect = 2e-29, Method: Compositional matrix adjust.
Identities = 114/357 (31%), Positives = 163/357 (45%), Gaps = 25/357 (7%)
Query: 61 FYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLGTIFYPSRSSSYAYVPCDSEH 120
++V + IG P Q+ MDTGS + W+ C PC+ C Q +F P SSS+ + C +
Sbjct: 14 YFVRVGIGSPTKLQYLVMDTGSDVPWIQCSPCKSCYKQNDAVFDPRASSSFRRLSCSTPQ 73
Query: 121 CRYFPYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDVVFGCGFST 180
C+ C++ +RC+Y Y G T ++++ + S VVFGCG
Sbjct: 74 CKLLDVKACASTDNRCLYQVSYGDGSFTVGDLASDSFSVSRGRTS-----PVVFGCGHDN 128
Query: 181 NRNF-KFSGIFGLGIGRSSLVSQLNS-SFSYCIGSLHDPDYLHNKLILGDGAIIDEGDAT 238
F +G+ GLG G+ S SQL+S FSYC+ S + + L+ GD A+
Sbjct: 129 EGLFVGAAGLLGLGAGKLSFPSQLSSRKFSYCLVSRDNGVRASSALLFGDSALPTSASFA 188
Query: 239 PLQF-----IDGHYYITLEAISVDGRMLDINPNIFKRDDSG--GGVMIDSGTDVTWLVKE 291
Q +D YY L IS+ G +L I FK S GGV+IDSGT VT L
Sbjct: 189 YTQLLKNPKLDTFYYAGLSGISIGGTLLSIPSTAFKLSSSTGRGGVIIDSGTSVTRLPTY 248
Query: 292 AYEALRDEVMIRLEG-EQMRSYSWPDKLCYHGIMSSDLKGFPTVRFHFRGGAKLALEKDS 350
AY +RD + + +S D CY + + PTV FHF GGA + L S
Sbjct: 249 AYTVMRDAFRSATQKLPRAADFSLFDT-CYDFSALTSVT-IPTVSFHFEGGASVQLPP-S 305
Query: 351 MFYQP--RPDAFCMAVNPASINNRYVNFSLIGMMAQQFYNVGYDIGRKQKTFQRMDC 405
+ P FC A + S+ + S+IG + QQ V D+ + F C
Sbjct: 306 NYLVPVDTSGTFCFAFSKTSL-----DLSIIGNIQQQTMRVAIDLDSSRVGFAPRQC 357
>gi|326491519|dbj|BAJ94237.1| predicted protein [Hordeum vulgare subsp. vulgare]
gi|326524456|dbj|BAK00611.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 499
Score = 136 bits (342), Expect = 2e-29, Method: Compositional matrix adjust.
Identities = 108/361 (29%), Positives = 163/361 (45%), Gaps = 30/361 (8%)
Query: 56 ISNSLFYVNISIGQPPVPQFTAMDTGSSLLWVHCYPC-RDCSPQLGTIFYPSRSSSYAYV 114
+S + V + +G P DTGS WV C PC C Q +F P++SS+YA V
Sbjct: 158 VSTGNYVVTVGLGTPASKYTVVFDTGSDTTWVQCRPCVVKCYKQKEPLFDPAKSSTYANV 217
Query: 115 PCDSEHCRYFPYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDVVF 174
C C C+ C+Y Y G T F + + LT + ++ F
Sbjct: 218 SCTDSACADLDTNGCTG--GHCLYAVQYGDGSYTVGFFAQDTLTIAHD-----AIKGFRF 270
Query: 175 GCGFSTNRNF-KFSGIFGLGIGRSSLVSQL----NSSFSYCIGSLHDPDYLHNKLILGDG 229
GCG N F K +G+ GLG G++SL Q +F+YC+ +L L G G
Sbjct: 271 GCGEKNNGLFGKTAGLMGLGRGKTSLTVQAYNKYGGAFAYCLPALTTGT---GYLDFGPG 327
Query: 230 AIIDEGDATPLQFIDGH--YYITLEAISVDGRMLDINPNIFKRDDSGGGVMIDSGTDVTW 287
+ + TP+ G YY+ + I V G+ + + ++F S G ++DSGT +T
Sbjct: 328 SAGNNARLTPMLTDKGQTFYYVGMTGIRVGGQQVPVAESVF----STAGTLVDSGTVITR 383
Query: 288 LVKEAYEALR---DEVMIRLEGEQMRSYSWPDKLCYHGIMSSDLKGFPTVRFHFRGGAKL 344
L AY AL D+VM+ ++ YS D CY SD++ PTV F+GGA L
Sbjct: 384 LPATAYTALSSAFDKVMLARGYKKAPGYSILDT-CYDFTGLSDVE-LPTVSLVFQGGACL 441
Query: 345 ALEKDSMFYQPRPDAFCMAVNPASINNRYVNFSLIGMMAQQFYNVGYDIGRKQKTFQRMD 404
++ + Y C+A + N + +++G Q+ Y V YD+G+K F
Sbjct: 442 DVDVSGIVYAISEAQVCLAF---ASNGDDESVAIVGNTQQKTYGVLYDLGKKTVGFAPGS 498
Query: 405 C 405
C
Sbjct: 499 C 499
>gi|167997964|ref|XP_001751688.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162696786|gb|EDQ83123.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 418
Score = 136 bits (342), Expect = 2e-29, Method: Compositional matrix adjust.
Identities = 108/376 (28%), Positives = 171/376 (45%), Gaps = 24/376 (6%)
Query: 46 YQAQILPSNDISNSLFYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLGTIFYP 105
+Q+ ++ + + + ++V+ +G PP +D+GS LLWV C PCR C Q ++ P
Sbjct: 49 FQSPVVSGSTLGSGQYFVDFFLGTPPQKFSLIVDSGSDLLWVQCSPCRQCYAQDSPLYVP 108
Query: 106 SRSSSYAYVPCDSEHCRYFPYAR---CS-AYKHRCIYTQLYLIGPETSVFVSTEQLTFKN 161
S SS+++ VPC S C P C Y C Y LY +TS S +++
Sbjct: 109 SNSSTFSPVPCLSSDCLLIPATEGFPCDFRYPGACAYEYLYA---DTS--SSKGVFAYES 163
Query: 162 TDESTIHVQDVVFGCGFSTNRNFKFS-GIFGLGIGRSSLVSQL----NSSFSYCIGSLHD 216
+ + V FGCG +F + G+ GLG G S SQ+ + F+YC+ + D
Sbjct: 164 ATVDGVRIDKVAFGCGSDNQGSFAAAGGVLGLGQGPLSFGSQVGYAYGNKFAYCLVNYLD 223
Query: 217 PDYLHNKLILGDGAI--IDEGDATPLQFIDGH---YYITLEAISVDGRMLDINPNIFKRD 271
P + + LI GD I I + TP+ YY+ +E ++V G+ L I+ + ++ D
Sbjct: 224 PTSVSSSLIFGDELISTIHDMQYTPIVSNPKSPTLYYVQIEKVTVGGKSLPISDSAWEID 283
Query: 272 DSG-GGVMIDSGTDVTWLVKEAYEALRDEVMIRLEGEQMRSYSWPDKLCYHGIMSSDLKG 330
G GG + DSGT +T+ AY + + + S D LC + D
Sbjct: 284 LLGNGGSIFDSGTTLTYWFPSAYSHILAAFDSGVHYPRAESVQGLD-LCVE-LTGVDQPS 341
Query: 331 FPTVRFHFRGGAKLALEKDSMFYQPRPDAFCMAVNPASINNRYVNFSLIGMMAQQFYNVG 390
FP+ F GA E ++ F P+ C+A+ A + + F+ IG + QQ + V
Sbjct: 342 FPSFTIEFDDGAVFQPEAENYFVDVAPNVRCLAM--AGLASPLGGFNTIGNLLQQNFFVQ 399
Query: 391 YDIGRKQKTFQRMDCE 406
YD F C
Sbjct: 400 YDREENLIGFAPAKCS 415
>gi|19424106|gb|AAL87345.1| putative chloroplast nucleoid DNA-binding protein [Arabidopsis
thaliana]
Length = 500
Score = 136 bits (342), Expect = 2e-29, Method: Compositional matrix adjust.
Identities = 111/346 (32%), Positives = 167/346 (48%), Gaps = 28/346 (8%)
Query: 61 FYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLGTIFYPSRSSSYAYVPCDSEH 120
++ I +G P + +DTGS + W+ C PC DC Q +F P+ SS+Y + C +
Sbjct: 162 YFSRIGVGTPAKDMYLVLDTGSDVNWIQCEPCADCYQQSDPVFNPTSSSTYKSLTCSAPQ 221
Query: 121 CRYFPYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDVVFGCGFST 180
C + C + ++C+Y Y G T ++T+ +TF N+ + + +V GCG
Sbjct: 222 CSLLETSACRS--NKCLYQVSYGDGSFTVGELATDTVTFGNSGK----INNVALGCGHDN 275
Query: 181 NRNFKFSGIFGLGIGRS-SLVSQLN-SSFSYCIGSLHDPDYLHNKLILGDGAIIDEGDAT 238
F + G S+ +Q+ +SFSYC L D D + + + + GDAT
Sbjct: 276 EGLFTGAAGLLGLGGGVLSITNQMKATSFSYC---LVDRDSGKSSSLDFNSVQLGGGDAT 332
Query: 239 -PL---QFIDGHYYITLEAISVDGRMLDINPNIFKRDDSG-GGVMIDSGTDVTWLVKEAY 293
PL + ID YY+ L SV G + + IF D SG GGV++D GT VT L +AY
Sbjct: 333 APLLRNKKIDTFYYVGLSGFSVGGEKVVLPDAIFDVDASGSGGVILDCGTAVTRLQTQAY 392
Query: 294 EALRD---EVMIRLEGEQMRSYSWPDKLCYHGIMSSDLKGFPTVRFHFRGGAKLAL-EKD 349
+LRD ++ + L+ + S S D CY S +K PTV FHF GG L L K+
Sbjct: 393 NSLRDAFLKLTVNLK-KGSSSISLFDT-CYDFSSLSTVK-VPTVAFHFTGGKSLDLPAKN 449
Query: 350 SMFYQPRPDAFCMAVNPASINNRYVNFSLIGMMAQQFYNVGYDIGR 395
+ FC A P S + S+IG + QQ + YD+ +
Sbjct: 450 YLIPVDDSGTFCFAFAPTS-----SSLSIIGNVQQQGTRITYDLSK 490
>gi|255544139|ref|XP_002513132.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
gi|223548143|gb|EEF49635.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
Length = 481
Score = 135 bits (341), Expect = 3e-29, Method: Compositional matrix adjust.
Identities = 121/418 (28%), Positives = 184/418 (44%), Gaps = 50/418 (11%)
Query: 1 LIHRDSILSPYNNPNENPAHRVQRGINISIARLAYLQEKIRSHNT--------YQAQILP 52
L+HRD I + +N + + +H I R+A L ++ + + A+++
Sbjct: 75 LVHRDKI-TAFNKSSYDHSHNFHARIQRDKKRVATLIRRLSPRDATSSYSVEEFGAEVVS 133
Query: 53 SNDISNSLFYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLGTIFYPSRSSSYA 112
+ + +++ I +G PP Q+ +D+GS ++WV C PC C Q +F P+ S+S+
Sbjct: 134 GMNQGSGEYFIRIGVGSPPREQYVVIDSGSDIVWVQCQPCTQCYHQTDPVFDPADSASFM 193
Query: 113 YVPCDSEHCRYFPYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDV 172
VPC S C A C A C Y +Y G T ++ E LTF T V++V
Sbjct: 194 GVPCSSSVCERIENAGCHA--GGCRYEVMYGDGSYTKGTLALETLTFGRT-----VVRNV 246
Query: 173 VFGCGFSTNRNFKFSGIFGLGIGRS-SLVSQL----NSSFSYCIGSLHDPDYLHNKLILG 227
GCG F + G S SLV QL +FSYC+ S L G
Sbjct: 247 AIGCGHRNRGMFVGAAGLLGLGGGSMSLVGQLGGQTGGAFSYCLVSRGTDS--AGSLEFG 304
Query: 228 DGAIIDEGDATPL---QFIDGHYYITLEAISVDGRMLDINPNIFKRDDSG-GGVMIDSGT 283
GA+ PL YYI L + V G + I+ ++F+ ++ G GGV++D+GT
Sbjct: 305 RGAMPVGAAWIPLIRNPRAPSFYYIRLSGVGVGGMKVPISEDVFQLNEMGNGGVVMDTGT 364
Query: 284 DVTWLVKEAYEALRDEVMIRLEGEQMRSYSWPDKLCYHGIMSSDLKGF-----PTVRFHF 338
VT + AY A RD + + S CY +L GF PTV F+F
Sbjct: 365 AVTRIPTVAYVAFRDAFIGQTGNLPRASGVSIFDTCY------NLNGFVSVRVPTVSFYF 418
Query: 339 RGGAKLALEKDSMFYQPRPDA----FCMAVNPASINNRYVNFSLIGMMAQQFYNVGYD 392
GG L L + F P D F A +P+ + S+IG + Q+ + +D
Sbjct: 419 AGGPILTLPARN-FLIPVDDVGTFCFAFAASPSGL-------SIIGNIQQEGIQISFD 468
>gi|242069057|ref|XP_002449805.1| hypothetical protein SORBIDRAFT_05g023600 [Sorghum bicolor]
gi|241935648|gb|EES08793.1| hypothetical protein SORBIDRAFT_05g023600 [Sorghum bicolor]
Length = 430
Score = 135 bits (341), Expect = 3e-29, Method: Compositional matrix adjust.
Identities = 109/366 (29%), Positives = 167/366 (45%), Gaps = 36/366 (9%)
Query: 61 FYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLGTIFYPSRSSSYAYVPCDSEH 120
+ + ++IG PPVP DTGS L W C PC+ C Q I+ + SSS++ +PC S
Sbjct: 83 YLMELAIGTPPVPFIALADTGSDLTWTQCKPCKLCFGQDTPIYDTTTSSSFSPLPCSSAT 142
Query: 121 CRYFPYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDVVFGCGF-S 179
C +RCS C Y Y G + + + I V + FGCG +
Sbjct: 143 CLPIWSSRCSTPSATCRYRYAYDDGA-------------YSPECAGISVGGIAFGCGVDN 189
Query: 180 TNRNFKFSGIFGLGIGRSSLVSQLN-SSFSYCI-----GSLHDPDYLHNKLILGDGAIID 233
++ +G GLG G SLV+QL FSYC+ SL P + + L +
Sbjct: 190 GGLSYNSTGTVGLGRGSLSLVAQLGVGKFSYCLTDFFNTSLSSPVFFGSLAELAASSASA 249
Query: 234 EG---DATPL---QFIDGHYYITLEAISVDGRMLDINPNIF--KRDDSGGGVMIDSGTDV 285
+ +TPL + YY++LE IS+ L I F DD GG+++DSGT
Sbjct: 250 DAAVVQSTPLVQSPYNPSRYYVSLEGISLGDARLPIPNGTFDLNDDDGSGGMIVDSGTIF 309
Query: 286 TWLVKEAYEALRDEVMIRLEGEQMRSYSWPDKLCYHGIMSS--DLKGFPTVRFHFRGGAK 343
T LV+ + + D V L G+ + + S D+ C+ + +L P + HF GGA
Sbjct: 310 TILVETGFRVVVDHVAGVL-GQPVVNASSLDRPCFPAPAAGVQELPDMPDMVLHFAGGAD 368
Query: 344 LALEKDS-MFYQPRPDAFCMAVNPASINNRYVNFSLIGMMAQQFYNVGYDIGRKQKTFQR 402
+ L +D+ M + +FC+ + + + S++G QQ + +DI Q +F
Sbjct: 369 MRLHRDNYMSFNEEESSFCLNI----VGTESASGSVLGNFQQQNIQMLFDITVGQLSFMP 424
Query: 403 MDCEVL 408
DC L
Sbjct: 425 TDCSKL 430
>gi|326529233|dbj|BAK01010.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 441
Score = 135 bits (341), Expect = 3e-29, Method: Compositional matrix adjust.
Identities = 136/428 (31%), Positives = 188/428 (43%), Gaps = 70/428 (16%)
Query: 20 HRVQRGINISIARLAYLQEKIRSHNTYQAQILPSNDISNSL------FYVNISIGQPPVP 73
RV+R + +S RLAY Q+ Q Q+ S D+S + + IG PP
Sbjct: 45 ERVRRAVAVSRERLAYTQQ--------QQQLRASGDVSAPVHLATRQYIAEYLIGDPPQR 96
Query: 74 QFTAMDTGSSLLWVHC-YPC--RDCSPQLGTIFYPSRSSSYAYVPC-DSEHCRYFPYARC 129
+DTGS+L+W C C + C+ Q + SRSS++A VPC DS
Sbjct: 97 AAALIDTGSNLIWTQCGTTCGLKACAKQDLPYYNLSRSSTFAAVPCADSAKLCAANGVHL 156
Query: 130 SAYKHRCIYTQLYLIGPETSVF--VSTEQLTFKNTDESTIHVQDVVFGCGFSTNRNFK-- 185
C + Y G SVF + TE TF++ + FGC S R K
Sbjct: 157 CGLDGSCTFAASYGAG---SVFGSLGTEAFTFQS------GAAKLGFGC-VSLTRITKGA 206
Query: 186 ---FSGIFGLGIGRSSLVSQLNSS-FSYCIGSLHDPDYLHNKLILGDGAIIDEGDA--TP 239
SG+ GLG GR SLVSQ ++ FSYC+ + L +G A + G T
Sbjct: 207 LNGASGLIGLGRGRLSLVSQTGATKFSYCLTPYLRNHGASSHLFVGASASLSGGGGAVTS 266
Query: 240 LQFIDG--------HYYITLEAISVDGRMLDINPNIF--KRDDSG---GGVMIDSGTDVT 286
+ F+ YY+ L ISV L I F +R +G GGV+ID+G+ VT
Sbjct: 267 IPFVKSPEDYPYSTFYYLPLVGISVGETKLPIPSAAFELRRVAAGYWSGGVIIDTGSPVT 326
Query: 287 WLVKEAYEALRDEVMIRLEGEQMRSYSWPDK-----LCYHGIMSSDL-KGFPTVRFHFRG 340
L + AY AL DEV +L RS P LC + D+ K P + FHF G
Sbjct: 327 SLAEAAYSALSDEVARQLN----RSLVQPPADTGLDLC---VARQDVDKVVPVLVFHFGG 379
Query: 341 GAKLALEKDSMFYQPRPDAFCMAVNPASINNRYVNFSLIGMMAQQFYNVGYDIGRKQKTF 400
GA +A+ S + CM + ++IG QQ ++ YDIG+ + +F
Sbjct: 380 GADMAVSAGSYWGPVDKSTACMLIEEGGYE------TVIGNFQQQDVHLLYDIGKGELSF 433
Query: 401 QRMDCEVL 408
Q DC VL
Sbjct: 434 QTADCSVL 441
>gi|312282359|dbj|BAJ34045.1| unnamed protein product [Thellungiella halophila]
Length = 484
Score = 135 bits (341), Expect = 3e-29, Method: Compositional matrix adjust.
Identities = 116/388 (29%), Positives = 171/388 (44%), Gaps = 42/388 (10%)
Query: 41 RSHNTYQAQILPSNDISNSLFYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLG 100
RS + ++ + +++ + +G P + +DTGS ++W+ C PC+ C Q
Sbjct: 116 RSAGGFSGVVISGLSQGSGEYFMRLGVGTPATNMYMVLDTGSDVVWLQCSPCKVCYNQSD 175
Query: 101 TIFYPSRSSSYAYVPCDSEHCRYF-PYARCSAYKHR-CIYTQLYLIGPETSVFVSTEQLT 158
+F P++S ++A VPC S CR + C + + + C+Y Y G T STE LT
Sbjct: 176 PVFNPAKSKTFATVPCGSRLCRRLDDSSECVSRRSKACLYQVSYGDGSFTVGDFSTETLT 235
Query: 159 FKNTDESTIHVQDVVFGCGFSTNRNFKFSGIFGLGIGR-----SSLVSQLNSSFSYCI-- 211
F V V GCG F + S ++ N FSYC+
Sbjct: 236 FHGA-----RVDHVALGCGHDNEGLFVGAAGLLGLGRGGLSFPSQTKNRYNGKFSYCLVD 290
Query: 212 -GSLHDPDYLHNKLILGDGAIIDEGDATPLQF---IDGHYYITLEAISVDG-RMLDINPN 266
S + ++ G+GA+ TPL +D YY+ L ISV G R+ ++ +
Sbjct: 291 RTSSGSSSKPPSTIVFGNGAVPKTAVFTPLLTNPKLDTFYYLQLLGISVGGSRVPGVSES 350
Query: 267 IFKRDDSG-GGVMIDSGTDVTWLVKEAYEALRDEVMIRLEGEQMR---SYSWPDKLCYHG 322
FK D +G GGV+IDSGT VT L + AY ALRD RL +++ SYS D C+
Sbjct: 351 QFKLDATGNGGVIIDSGTSVTRLTQSAYVALRDA--FRLGATRLKRAPSYSLFDT-CF-- 405
Query: 323 IMSSDLKGF-----PTVRFHFRGGAKLALEKDSMFYQPRPDAFCMAVNPASINNRYVNFS 377
DL G PTV FHF GG + + FC A + S
Sbjct: 406 ----DLSGMTTVKVPTVVFHFTGGEVSLPASNYLIPVNNQGRFCFA-----FAGTMGSLS 456
Query: 378 LIGMMAQQFYNVGYDIGRKQKTFQRMDC 405
+IG + QQ + V YD+ + F C
Sbjct: 457 IIGNIQQQGFRVAYDLVGSRVGFLSRAC 484
>gi|414880708|tpg|DAA57839.1| TPA: hypothetical protein ZEAMMB73_997765 [Zea mays]
Length = 452
Score = 135 bits (340), Expect = 4e-29, Method: Compositional matrix adjust.
Identities = 111/434 (25%), Positives = 184/434 (42%), Gaps = 45/434 (10%)
Query: 1 LIHRDSILSPYNNPNENPAHRVQRGINISIARLAYLQEKIRSHNTYQAQILPSNDISNSL 60
++HRD++ P P R A+L L + + ++ ++ +
Sbjct: 34 VVHRDAVFPPRRG--APPGSFRCRHAAPHTAQLESLHSATAAADLLRSPVMSGVPFDSGE 91
Query: 61 FYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLGTIFYPSRSSSYAYVPCDSEH 120
++ I +G PP +DTGS L+W+ C PCR C Q+ ++ P S ++ +PC S
Sbjct: 92 YFAVIGVGDPPTHALVVIDTGSDLIWLQCLPCRRCYRQVTPLYDPRNSKTHRRIPCASPQ 151
Query: 121 CR-YFPYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDVVFGCGFS 179
CR Y C A C+Y +Y G +S ++T+ L D++ +H +V GCG
Sbjct: 152 CRGVLRYPGCDARTGGCVYMVVYGDGSASSGDLATDTLVLP--DDTRVH--NVTLGCGHD 207
Query: 180 TNRNF-KFSGIFGLGIGRSSLVSQLNSS----FSYCIGS-LHDPDYLHNKLILGDGAIID 233
+G+ G G G+ S +QL + FSYC+G + + L+ G +
Sbjct: 208 NEGLLASAAGLLGAGRGQLSFPTQLAPAYGHVFSYCLGDRMSRARNSSSYLVFGRTPELP 267
Query: 234 EGDATPLQFIDGH---YYITLEAISVDGRM--------LDINPNIFKRDDSGGGVMIDSG 282
TPL+ YY+ + SV G L +NP + GGV++DSG
Sbjct: 268 STAFTPLRTNPRRPSLYYVDMVGFSVGGERVAGFSNASLALNPATGR-----GGVVVDSG 322
Query: 283 TDVTWLVKEAYEALRDEVMIRLEGEQMR----SYSWPDKLCY--HGIMSSDLKGFPTVRF 336
T ++ ++AY A+RD + MR +S D CY HG P++
Sbjct: 323 TAISRFTRDAYAAVRDAFVSHAAAAGMRRLRNKFSVFDT-CYDVHGNGPGTGVRVPSIVL 381
Query: 337 HFRGGAKLALEKDSMFYQ----PRPDAFCMAVNPASINNRYVNFSLIGMMAQQFYNVGYD 392
HF A +AL + + R FC+ + A +++G + QQ + V +D
Sbjct: 382 HFAAAADMALPQANYLIPVVGGDRRTYFCLGLQAADD-----GLNVLGNVQQQGFGVVFD 436
Query: 393 IGRKQKTFQRMDCE 406
+ R + F C
Sbjct: 437 VERGRIGFTPNGCS 450
>gi|356504173|ref|XP_003520873.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
Length = 461
Score = 135 bits (340), Expect = 4e-29, Method: Compositional matrix adjust.
Identities = 109/363 (30%), Positives = 159/363 (43%), Gaps = 38/363 (10%)
Query: 61 FYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLGTIFYPSRSSSYAYVPCDSEH 120
++ I +G P + +DTGS ++W+ C PCR C Q +F P++S +YA +PC +
Sbjct: 118 YFTRIGVGTPARYVYMVLDTGSDVVWLQCAPCRKCYTQTDHVFDPTKSRTYAGIPCGAPL 177
Query: 121 CRYFPYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDVVFGCGFST 180
CR CS C Y Y G T STE LTF+ V V GCG
Sbjct: 178 CRRLDSPGCSNKNKVCQYQVSYGDGSFTFGDFSTETLTFRRN-----RVTRVALGCGHDN 232
Query: 181 NRNFKFSGI----------FGLGIGRSSLVSQLNSSFSYCIGSLHDPDYLHNKLILGDGA 230
F + F + GR + N FSYC+ + +I GD A
Sbjct: 233 EGLFTGAAGLLGLGRGRLSFPVQTGR-----RFNHKFSYCLVD-RSASAKPSSVIFGDSA 286
Query: 231 IIDEGDATPL---QFIDGHYYITLEAISVDGR-MLDINPNIFKRDDSG-GGVMIDSGTDV 285
+ TPL +D YY+ L ISV G + ++ ++F+ D +G GGV+IDSGT V
Sbjct: 287 VSRTAHFTPLIKNPKLDTFYYLELLGISVGGAPVRGLSASLFRLDAAGNGGVIIDSGTSV 346
Query: 286 TWLVKEAYEALRDEVMIRLEGEQMR---SYSWPDKLCYHGIMSSDLKGFPTVRFHFRGGA 342
T L + AY ALRD R+ ++ +S D C+ +++K PTV HFRG
Sbjct: 347 TRLTRPAYIALRDA--FRIGASHLKRAPEFSLFDT-CFDLSGLTEVK-VPTVVLHFRGAD 402
Query: 343 KLALEKDSMFYQPRPDAFCMAVNPASINNRYVNFSLIGMMAQQFYNVGYDIGRKQKTFQR 402
+ + +FC A S+IG + QQ + + YD+ + F
Sbjct: 403 VSLPATNYLIPVDNSGSFCFA-----FAGTMSGLSIIGNIQQQGFRISYDLTGSRVGFAP 457
Query: 403 MDC 405
C
Sbjct: 458 RGC 460
>gi|168008816|ref|XP_001757102.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162691600|gb|EDQ77961.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 406
Score = 135 bits (340), Expect = 4e-29, Method: Compositional matrix adjust.
Identities = 120/402 (29%), Positives = 187/402 (46%), Gaps = 29/402 (7%)
Query: 21 RVQRGIN-ISIARLAYLQEKIRSHNTYQAQILPSNDISNSLFYVNISIGQPPVPQFTAMD 79
R+ + +N ++ +R Q K+ S + +QA ++ + + +++ IS+G PP + MD
Sbjct: 18 RINQTVNGLTRSRSRDRQTKVPSQD-FQAPVVSGLSLGSGEYFIRISVGTPPRRMYLVMD 76
Query: 80 TGSSLLWVHCYPCRDCSPQLGTIFYPSRSSSYAYVPCDSEHCRYFPYARCSAYKHRCIYT 139
TGS +LW+ C PC +C Q IF P +SS+Y+ + C + C C A ++C+Y
Sbjct: 77 TGSDILWLQCAPCVNCYHQSDAIFDPYKSSTYSTLGCSTRQCLNLDIGTCQA--NKCLYQ 134
Query: 140 QLYLIGPETSVFVSTEQLTFKNTDE-STIHVQDVVFGCGFSTNRNF-KFSGIFGLGIGRS 197
Y G T+ T+ ++ +T + + + GCG F +G+ GLG G
Sbjct: 135 VDYGDGSFTTGEFGTDDVSLNSTSGVGQVVLNKIPLGCGHDNEGYFVGAAGLLGLGKGPL 194
Query: 198 SLVSQLN----SSFSYCIGSLHDPDYLHNKLILGDGAIIDEGDA-TPLQF---IDGHYYI 249
S +Q++ FSYC+ + L+ G+ A+ G TP + YY+
Sbjct: 195 SFPNQVDPQNGGRFSYCLTDRETDSTEGSSLVFGEAAVPPAGARFTPQDSNMRVPTFYYL 254
Query: 250 TLEAISVDGRMLDINPNIFKRDDSG-GGVMIDSGTDVTWLVKEAYEALRDEVMIRLEG-E 307
+ ISV G +L I + F+ D G GGV+IDSGT VT L AY +LRD
Sbjct: 255 KMTGISVGGTILTIPTSAFQLDSLGNGGVIIDSGTSVTRLQNAAYASLRDAFRAGTSDLA 314
Query: 308 QMRSYSWPDKLCY--HGIMSSDLKGFPTVRFHFRGGAKLALEKDS-MFYQPRPDAFCMAV 364
+S D CY G+ S D+ PTV HF+GG L L + + + FC+A
Sbjct: 315 PTAGFSLFDT-CYDLSGLASVDV---PTVTLHFQGGTDLKLPASNYLIPVDNSNTFCLAF 370
Query: 365 NPASINNRYVNFSLIGMMAQQFYNVGYDIGRKQKTFQRMDCE 406
+ S+IG + QQ + V YD Q F C
Sbjct: 371 AGTT------GPSIIGNIQQQGFRVIYDNLHNQVGFVPSQCN 406
>gi|302765224|ref|XP_002966033.1| hypothetical protein SELMODRAFT_64135 [Selaginella moellendorffii]
gi|300166847|gb|EFJ33453.1| hypothetical protein SELMODRAFT_64135 [Selaginella moellendorffii]
Length = 357
Score = 135 bits (340), Expect = 4e-29, Method: Compositional matrix adjust.
Identities = 114/357 (31%), Positives = 162/357 (45%), Gaps = 25/357 (7%)
Query: 61 FYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLGTIFYPSRSSSYAYVPCDSEH 120
++V + IG P Q+ MDTGS + W+ C PC+ C Q +F P SSS+ + C +
Sbjct: 14 YFVRVGIGSPTKLQYLVMDTGSDVPWIQCSPCKSCYKQNDAVFDPRASSSFRRLSCSTPQ 73
Query: 121 CRYFPYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDVVFGCGFST 180
C+ C++ +RC+Y Y G T ++++ S VVFGCG
Sbjct: 74 CKLLDVKACASTDNRCLYQVSYGDGSFTVGDLASDSFLVSRGRTS-----PVVFGCGHDN 128
Query: 181 NRNF-KFSGIFGLGIGRSSLVSQLNS-SFSYCIGSLHDPDYLHNKLILGDGAIIDEGDAT 238
F +G+ GLG G+ S SQL+S FSYC+ S + + L+ GD A+
Sbjct: 129 EGLFVGAAGLLGLGAGKLSFPSQLSSRKFSYCLVSRDNGVRASSALLFGDSALPTSASFA 188
Query: 239 PLQF-----IDGHYYITLEAISVDGRMLDINPNIFKRDDSG--GGVMIDSGTDVTWLVKE 291
Q +D YY L IS+ G +L I FK S GGV+IDSGT VT L
Sbjct: 189 YTQLLKNPKLDTFYYAGLSGISIGGTLLSIPSTAFKLSSSTGRGGVIIDSGTSVTRLPTY 248
Query: 292 AYEALRDEVMIRLEG-EQMRSYSWPDKLCYHGIMSSDLKGFPTVRFHFRGGAKLALEKDS 350
AY +RD + + +S D CY + + PTV FHF GGA + L S
Sbjct: 249 AYTVMRDAFRSATQKLPRAADFSLFDT-CYDFSALTSVT-IPTVSFHFEGGASVQLPP-S 305
Query: 351 MFYQP--RPDAFCMAVNPASINNRYVNFSLIGMMAQQFYNVGYDIGRKQKTFQRMDC 405
+ P FC A + S+ + S+IG + QQ V D+ + F C
Sbjct: 306 NYLVPVDTSGTFCFAFSKTSL-----DLSIIGNIQQQTMRVAIDLDSSRVGFAPRQC 357
>gi|356573235|ref|XP_003554768.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
Length = 472
Score = 135 bits (339), Expect = 4e-29, Method: Compositional matrix adjust.
Identities = 107/361 (29%), Positives = 159/361 (44%), Gaps = 34/361 (9%)
Query: 61 FYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLGTIFYPSRSSSYAYVPCDSEH 120
++ I +G P + +DTGS ++W+ C PCR C Q +F P++S +YA +PC +
Sbjct: 129 YFTRIGVGTPARYVYMVLDTGSDVVWLQCAPCRKCYTQADPVFDPTKSRTYAGIPCGAPL 188
Query: 121 CRYFPYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDVVFGCGFST 180
CR C+ C Y Y G T STE LTF+ T V V GCG
Sbjct: 189 CRRLDSPGCNNKNKVCQYQVSYGDGSFTFGDFSTETLTFRRT-----RVTRVALGCGHDN 243
Query: 181 NRNF----------KFSGIFGLGIGRSSLVSQLNSSFSYCIGSLHDPDYLHNKLILGDGA 230
F + F + GR + N FSYC+ + ++ GD A
Sbjct: 244 EGLFIGAAGLLGLGRGRLSFPVQTGR-----RFNQKFSYCLVD-RSASAKPSSVVFGDSA 297
Query: 231 IIDEGDATPL---QFIDGHYYITLEAISVDGR-MLDINPNIFKRDDSG-GGVMIDSGTDV 285
+ TPL +D YY+ L ISV G + ++ ++F+ D +G GGV+IDSGT V
Sbjct: 298 VSRTARFTPLIKNPKLDTFYYLELLGISVGGSPVRGLSASLFRLDAAGNGGVIIDSGTSV 357
Query: 286 TWLVKEAYEALRDEVMIRLEG-EQMRSYSWPDKLCYHGIMSSDLKGFPTVRFHFRGGAKL 344
T L + AY ALRD + ++ +S D C+ +++K PTV HFRG
Sbjct: 358 TRLTRPAYIALRDAFRVGASHLKRAAEFSLFDT-CFDLSGLTEVK-VPTVVLHFRGADVS 415
Query: 345 ALEKDSMFYQPRPDAFCMAVNPASINNRYVNFSLIGMMAQQFYNVGYDIGRKQKTFQRMD 404
+ + +FC A S+IG + QQ + V +D+ + F
Sbjct: 416 LPATNYLIPVDNSGSFCFA-----FAGTMSGLSIIGNIQQQGFRVSFDLAGSRVGFAPRG 470
Query: 405 C 405
C
Sbjct: 471 C 471
>gi|356522504|ref|XP_003529886.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
Length = 473
Score = 135 bits (339), Expect = 5e-29, Method: Compositional matrix adjust.
Identities = 109/358 (30%), Positives = 158/358 (44%), Gaps = 28/358 (7%)
Query: 61 FYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLGTIFYPSRSSSYAYVPCDSEH 120
++ + +G PP + +DTGS ++W+ C PC C Q IF PS+S S+A +PC S
Sbjct: 130 YFTRLGVGTPPKYLYMVLDTGSDVVWLQCKPCTKCYSQTDQIFDPSKSKSFAGIPCYSPL 189
Query: 121 CRYFPYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDVVFGCGFST 180
CR CS + C Y Y G T STE LTF+ V V GCG
Sbjct: 190 CRRLDSPGCSLKNNLCQYQVSYGDGSFTFGDFSTETLTFRRA-----AVPRVAIGCGHDN 244
Query: 181 NRNFKFSGIFGLGIGR-----SSLVSQLNSSFSYCIGSLHDPDYLHNKLILGDGAIIDEG 235
F + + ++ N+ FSYC+ + ++ GD A+
Sbjct: 245 EGLFVGAAGLLGLGRGGLSFPTQTGTRFNNKFSYCLTD-RTASAKPSSIVFGDSAVSRTA 303
Query: 236 DATPL---QFIDGHYYITLEAISVDGR-MLDINPNIFKRDDSG-GGVMIDSGTDVTWLVK 290
TPL +D YY+ L ISV G + I+ + F+ D +G GGV+IDSGT VT L +
Sbjct: 304 RFTPLVKNPKLDTFYYVELLGISVGGAPVRGISASFFRLDSTGNGGVIIDSGTSVTRLTR 363
Query: 291 EAYEALRDEVMIRLEGEQMR---SYSWPDKLCYHGIMSSDLKGFPTVRFHFRGGAKLALE 347
AY +LRD R+ ++ +S D CY S++K PTV HFRG
Sbjct: 364 PAYVSLRDA--FRVGASHLKRAPEFSLFDT-CYDLSGLSEVK-VPTVVLHFRGADVSLPA 419
Query: 348 KDSMFYQPRPDAFCMAVNPASINNRYVNFSLIGMMAQQFYNVGYDIGRKQKTFQRMDC 405
+ + +FC A S+IG + QQ + V +D+ + F C
Sbjct: 420 ANYLVPVDNSGSFCFA-----FAGTMSGLSIIGNIQQQGFRVVFDLAGSRVGFAPRGC 472
>gi|242092898|ref|XP_002436939.1| hypothetical protein SORBIDRAFT_10g011740 [Sorghum bicolor]
gi|241915162|gb|EER88306.1| hypothetical protein SORBIDRAFT_10g011740 [Sorghum bicolor]
Length = 469
Score = 135 bits (339), Expect = 5e-29, Method: Compositional matrix adjust.
Identities = 129/437 (29%), Positives = 194/437 (44%), Gaps = 58/437 (13%)
Query: 1 LIHRDSILSPYNNPNENPAHRVQRGINISIARLAYLQEKIRSHNTYQAQILPSNDIS--- 57
L+HR +P N P + + S AR Y+ + P +D +
Sbjct: 59 LVHRYGPCAPSQYSNV-PTPSISETLRRSRARTNYIMSQASKSMGMGMASTPDDDDAAVT 117
Query: 58 ---------NSLFYV-NISIGQPPVPQFTAMDTGSSLLWVHCYPCRD--CSPQLGTIFYP 105
+SL YV + G P VPQ MDTGS + WV C PC C PQ +F P
Sbjct: 118 IPTRLGGFVDSLEYVVTLGFGTPSVPQVLLMDTGSDVSWVQCTPCNSTKCYPQKDPLFDP 177
Query: 106 SRSSSYAYVPCDSEHCRYFP---YARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNT 162
S+SS+YA + C+++ CR + C++ +C Y+ Y G + S E LT
Sbjct: 178 SKSSTYAPIACNTDACRKLGDHYHNGCTSGGTQCGYSVEYADGSHSRGVYSNETLTLA-- 235
Query: 163 DESTIHVQDVVFGCGFST-NRNFKFSGIFGLGIGRSSLVSQLNS----SFSYCIGSLHDP 217
I V+D FGCG + K+ G+ GLG SLV Q +S +FSYC+ +L+
Sbjct: 236 --PGITVEDFHFGCGRDQRGPSDKYDGLLGLGGAPVSLVVQTSSVYGGAFSYCLPALNSE 293
Query: 218 DYLHNKLILGDGAIIDEGD--ATPLQFIDGH---YYITLEAISVDGRMLDINPNIFKRDD 272
L+LG ++ TP++ + G+ Y +T+ ISV G+ L I + F+
Sbjct: 294 AGF---LVLGSPPSGNKSAFVFTPMRHLPGYATFYMVTMTGISVGGKPLHIPQSAFR--- 347
Query: 273 SGGGVMIDSGTDVTWLVKEAYEALRDEVMIRLEGEQMRSYSWPDKLCYHGIMSSDLKGFP 332
GG++IDSGT T L + AY AL + L+ + D CY+ S++ P
Sbjct: 348 --GGMIIDSGTVDTELPETAYNALEAALRKALKAYPLVPSDDFDT-CYNFTGYSNIT-VP 403
Query: 333 TVRFHFRGGAKLALEKDSMFYQPRPDAF----CMAVNPASINNRYVNFSLIGMMAQQFYN 388
V F F GGA + L+ P+ C+A + ++ +IG + Q+
Sbjct: 404 RVAFTFSGGATIDLDV--------PNGILVNDCLAFQESGPDD---GLGIIGNVNQRTLE 452
Query: 389 VGYDIGRKQKTFQRMDC 405
V YD GR F+ C
Sbjct: 453 VLYDAGRGNVGFRAGAC 469
>gi|125558629|gb|EAZ04165.1| hypothetical protein OsI_26307 [Oryza sativa Indica Group]
Length = 455
Score = 134 bits (338), Expect = 6e-29, Method: Compositional matrix adjust.
Identities = 120/379 (31%), Positives = 174/379 (45%), Gaps = 45/379 (11%)
Query: 61 FYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDC--SPQLGTIFYPSRSSSYAYVPCDS 118
+ +NIS+G PP+ +DTGS+L+W C PC C P + P+RSS+++ +PC+
Sbjct: 91 YNMNISLGTPPLDFPVIVDTGSNLIWAQCAPCTRCFPRPTPAPVLQPARSSTFSRLPCNG 150
Query: 119 EHCRYFPYA---RCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDVVFG 175
C+Y P + R C Y Y G T+ +++TE LT + V FG
Sbjct: 151 SFCQYLPTSSRPRTCNATAACAYNYTYGSG-YTAGYLATETLTVGDGT-----FPKVAFG 204
Query: 176 CGFSTNRNFKFSGIFGLGIGRSSLVSQLN-SSFSYCIGSLHDPDYLHNKLILGDGAIIDE 234
C + N SGI GLG G SLVSQL FSYC+ S D + ++ G A + E
Sbjct: 205 CS-TENGVDNSSGIVGLGRGPLSLVSQLAVGRFSYCLRS-DMADGGASPILFGSLAKLTE 262
Query: 235 GD---ATPL---QFI--DGHYYITLEAISVDGRMLDINPNI--FKRDDSGGGVMIDSGTD 284
+TPL ++ HYY+ L I+VD L + + F + GGG ++DSGT
Sbjct: 263 RSVVQSTPLLKNPYLQRSTHYYVNLTGIAVDSTELPVTGSTFGFTQTGLGGGTIVDSGTT 322
Query: 285 VTWLVKEAYEALRDEV---MIRLEGEQMRSYSWPD-KLCYHGIMSSDLKG--FPTVRFHF 338
+T+L K+ Y ++ M L S + D LCY K P + F
Sbjct: 323 LTYLAKDGYAMVKQAFQSQMANLNQTTPASGAPYDLDLCYKPSAGGGGKAVRVPRLALRF 382
Query: 339 RGGAK---------LALEKDSMFYQPRPDAFCMAVNPASINNRYVNFSLIGMMAQQFYNV 389
GGAK +E DS Q R C+ V PA+ + + S+IG + Q ++
Sbjct: 383 AGGAKYNVPVQNYFAGVEADS---QGRVTVACLLVLPATDD---LPISIIGNLMQMDMHL 436
Query: 390 GYDIGRKQKTFQRMDCEVL 408
YDI +F DC L
Sbjct: 437 LYDIDGGMFSFAPADCAKL 455
>gi|222635451|gb|EEE65583.1| hypothetical protein OsJ_21095 [Oryza sativa Japonica Group]
Length = 441
Score = 134 bits (338), Expect = 6e-29, Method: Compositional matrix adjust.
Identities = 141/450 (31%), Positives = 186/450 (41%), Gaps = 74/450 (16%)
Query: 1 LIHRDSILSPYNNPNENP--AHRVQRGINISIARLAYLQEKIRSHNTYQAQI-------- 50
L+HR +P P A R++R AR Y+ K T +
Sbjct: 21 LVHRHGPCAPSAASGGKPSLAERLRR----DRARTNYIVTKATGGRTAATALSDAAGGGT 76
Query: 51 -LPS--NDISNSLFYV-NISIGQPPVPQFTAMDTGSSLLWVHCYPC--RDCSPQLGTIFY 104
+P+ D NSL YV + IG P V Q +DTGS L WV C PC +C Q +F
Sbjct: 77 SIPTFLGDSVNSLEYVVTLGIGTPAVQQTVLIDTGSDLSWVQCKPCGAGECYAQKDPLFD 136
Query: 105 PSRSSSYAYVPCDSEHCRYFPYARCSAYKHRCI-----------YTQLYLIGPETSVFVS 153
PS SSSYA VPCDS+ CR AY H C Y Y T+ S
Sbjct: 137 PSSSSSYASVPCDSDACRKL---AAGAYGHGCTGVSGGAAALCEYGIEYGNRATTTGVYS 193
Query: 154 TEQLTFKNTDESTIHVQDVVFGCGFSTNRNF-KFSGIFGLGIGRSSLVSQLNSS----FS 208
TE LT K + V D FGCG + + KF G+ GLG SLVSQ +S FS
Sbjct: 194 TETLTLK----PGVVVADFGFGCGDHQHGPYEKFDGLLGLGGAPESLVSQTSSQFGGPFS 249
Query: 209 YCIGSLHDPDYLHNKLILGDGAIIDEGDATP-----------LQFIDGHYYITLEAISVD 257
YC+ P L GA + +T L + Y +TL ISV
Sbjct: 250 YCL-----PPTSGGAGFLTLGAPPNSSSSTAASGLSFTPMRRLPSVPTFYIVTLTGISVG 304
Query: 258 GRMLDINPNIFKRDDSGGGVMIDSGTDVTWLVKEAYEALRDEVMIRLEGEQMRSYSWPDK 317
G L I P+ F G++IDSGT +T L AY ALR + ++ S
Sbjct: 305 GAPLAIPPSAFSS-----GMVIDSGTVITGLPATAYAALRSAFRSAMSEYRLLPPSNGGV 359
Query: 318 L--CYHGIMSSDLKGFPTVRFHFRGGAKLALEKDSMFYQPRPDAFCMAVNPASINNRYVN 375
L CY +++ PT+ F GGA + L + C+A A +N
Sbjct: 360 LDTCYDFTGHANVT-VPTISLTFSGGATIDLAAPAGVLVDG----CLAFAGAGTDNA--- 411
Query: 376 FSLIGMMAQQFYNVGYDIGRKQKTFQRMDC 405
+IG + Q+ + V YD G+ F+ C
Sbjct: 412 IGIIGNVNQRTFEVLYDSGKGTVGFRAGAC 441
>gi|326499093|dbj|BAK06037.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 471
Score = 134 bits (338), Expect = 7e-29, Method: Compositional matrix adjust.
Identities = 130/436 (29%), Positives = 192/436 (44%), Gaps = 47/436 (10%)
Query: 1 LIHRDSILSPYNNPNENPAHRVQRGINISIARLAYLQEKIRSHNTYQAQILPSNDISNSL 60
IHRDS SP+++P+ RV S R A L + A S S
Sbjct: 39 FIHRDSARSPFHDPSLTAPARVLEAARRSTVRAAALSRSYVRVDAPSADGFVSELTSTPF 98
Query: 61 FYV-NISIGQPPVPQFTAMDTGSSLLWVHC---------YPCRDCSPQ-LGTIFYPSRSS 109
Y+ ++IG PP DTGS L+W++C RD Q G F PS+S+
Sbjct: 99 EYLMAVNIGTPPTRMVAIADTGSDLIWLNCSYGGDGPGLAAARDADAQPPGVQFDPSKST 158
Query: 110 SYAYVPCDSEHCRYFPYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNT-----DE 164
++ V CDS C P A C A +C Y+ Y G TS +STE TF + D
Sbjct: 159 TFRLVDCDSVACSELPEASCGA-DSKCRYSYSYGDGSHTSGVLSTETFTFADAPGARGDG 217
Query: 165 STIHVQDVVFGCGFSTNRNFKFSGIFGLGIGRSSLVSQLNSS------FSYCIGSLHDPD 218
+T V +V FGC + + G+ GLG G SLVSQL + FSYC+ P
Sbjct: 218 TTTRVANVNFGCSTTFVGSSVGDGLVGLGGGDLSLVSQLGADTSLGRRFSYCL----VPY 273
Query: 219 YLHNKLILGDG---AIIDEGD-ATPL--QFIDGHYYITLEAISVDGRMLDINPNIFKRDD 272
+ L G A+ D G TPL + +Y + L ++ V + F+ D
Sbjct: 274 SVKASSALNFGPRAAVTDPGAVTTPLIPSQVKAYYIVELRSVKVGNKT-------FEAPD 326
Query: 273 SGGGVMIDSGTDVTWLVKEAYEALRDEVMIRLEGEQMRSYSWPDKLCY--HGIMSSDLKG 330
+++DSGT +T+L + + L E+ R++ +S LC+ G+ +
Sbjct: 327 R-SPLIVDSGTTLTFLPEALVDPLVKELTGRIKLPPAQSPERLLPLCFDVSGVREGQVAA 385
Query: 331 F-PTVRFHFRGGAKLALEKDSMFYQPRPDAFCMAVNPASINNRYVNFSLIGMMAQQFYNV 389
P V GGA + L+ ++ F + + C+AV S + S+IG +AQQ +V
Sbjct: 386 MIPDVTVGLGGGAAVTLKAENTFVEVQEGTLCLAV---SAMSEQFPASIIGNIAQQNMHV 442
Query: 390 GYDIGRKQKTFQRMDC 405
GYD+ + TF C
Sbjct: 443 GYDLDKGTVTFAPAAC 458
>gi|356557203|ref|XP_003546907.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
Length = 470
Score = 134 bits (337), Expect = 8e-29, Method: Compositional matrix adjust.
Identities = 105/345 (30%), Positives = 162/345 (46%), Gaps = 31/345 (8%)
Query: 78 MDTGSSLLWVHCYPCRDCSPQLGTIFYPSRSSSYAYVPCDSEHCRYFPYARCS---AYKH 134
+DTGS L WV C PCR C Q G +F PS S SY + C+S C+ C +
Sbjct: 137 VDTGSDLTWVQCEPCRSCYNQNGPLFKPSTSPSYQPILCNSTTCQSLELGACGSDPSTSA 196
Query: 135 RCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDVVFGCGFSTNRNF-KFSGIFGLG 193
C Y Y G TS + E+L F I V + VFGCG + F SG+ GLG
Sbjct: 197 TCDYVVNYGDGSYTSGELGIEKLGFGG-----ISVSNFVFGCGRNNKGLFGGASGLMGLG 251
Query: 194 IGRSSLVSQLNSS----FSYCIGSLHDPDYLHNKLILGDGAIIDEGDATPLQF------- 242
S++SQ N++ FSYC+ S D L++G+ + + + + TP+ +
Sbjct: 252 RSELSMISQTNATFGGVFSYCLPS-TDQAGASGSLVMGNQSGVFK-NVTPIAYTRMLPNL 309
Query: 243 -IDGHYYITLEAISVDGRMLDINPNIFKRDDSGGGVMIDSGTDVTWLVKEAYEALRDEVM 301
+ Y + L I V G L + + F GGV++DSGT ++ L Y+AL+ + +
Sbjct: 310 QLSNFYILNLTGIDVGGVSLHVQASSFGN----GGVILDSGTVISRLAPSVYKALKAKFL 365
Query: 302 IRLEG-EQMRSYSWPDKLCYHGIMSSDLKGFPTVRFHFRGGAKLALEKDSMFYQPRPDAF 360
+ G +S D C++ + D PT+ +F G A+L ++ +FY + DA
Sbjct: 366 EQFSGFPSAPGFSILDT-CFN-LTGYDQVNIPTISMYFEGNAELNVDATGIFYLVKEDAS 423
Query: 361 CMAVNPASINNRYVNFSLIGMMAQQFYNVGYDIGRKQKTFQRMDC 405
+ + AS+++ Y +IG Q+ V YD Q F + C
Sbjct: 424 RVCLALASLSDEY-EMGIIGNYQQRNQRVLYDAKLSQVGFAKEPC 467
>gi|357130848|ref|XP_003567056.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Brachypodium
distachyon]
Length = 448
Score = 134 bits (337), Expect = 8e-29, Method: Compositional matrix adjust.
Identities = 107/363 (29%), Positives = 161/363 (44%), Gaps = 31/363 (8%)
Query: 61 FYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLGTIFYPSRSSSYAYVPCDSEH 120
++ ++ +G PP P +DTGS ++W+ C PC C QL ++ P SS+YA PC
Sbjct: 99 YFASVGVGTPPTPALLVIDTGSDVVWLQCKPCVHCYRQLSPLYDPRGSSTYAQTPCSPPQ 158
Query: 121 CRYFPYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDVVFGCGFST 180
CR C C Y +Y TS ++T++L F N D S V +V GCG
Sbjct: 159 CRN--PQTCDGTTGGCGYRIVYGDASSTSGNLATDRLVFSN-DTS---VGNVTLGCGHDN 212
Query: 181 NRNF-KFSGIFGLGIGRSSLVSQLNSS----FSYCIGSLHDPDYLHNKLILGDGAIIDEG 235
F +G+ G+ G +S +Q+ S F+YC+G + L+ G A
Sbjct: 213 EGLFGSAAGLLGVARGNNSFATQVADSYGRYFAYCLGDRTRSGSSSSYLVFGRTAPEPPS 272
Query: 236 DA-TPLQFIDGH---YYITLEAISVDGRMLDINPNIFKRDDSG---GGVMIDSGTDVTWL 288
TPL+ YY+ + SV G + N D GGV++DSGT +T
Sbjct: 273 SVFTPLRSNPRRPSLYYVDMVGFSVGGEPVTGFSNASLSLDPATGRGGVVVDSGTSITRF 332
Query: 289 VKEAYEALRDEVMIRLEGEQMRSYSWPDKL---CY--HGIMSSDLKGFPTVRFHFRGGAK 343
++AY ALRD R MR + CY G+ +D P V HF GGA
Sbjct: 333 ARDAYGALRDAFDARAAKVGMRKVGRGISVFDACYDLRGVAVADA---PGVVLHFAGGAD 389
Query: 344 LALEKDSMFYQPRPDAF-CMAVNPASINNRYVNFSLIGMMAQQFYNVGYDIGRKQKTFQR 402
+AL ++ + C A+ A + S+IG + QQ + V +D+ ++ F+
Sbjct: 390 VALPPENYLVPEESGRYHCFALEAAG----HDGLSVIGNVLQQRFRVVFDVENERVGFEP 445
Query: 403 MDC 405
C
Sbjct: 446 NGC 448
>gi|293329689|dbj|BAJ04354.1| pollen allergen CPA63 [Cryptomeria japonica]
Length = 472
Score = 134 bits (337), Expect = 8e-29, Method: Compositional matrix adjust.
Identities = 111/363 (30%), Positives = 166/363 (45%), Gaps = 22/363 (6%)
Query: 56 ISNSLFYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLGTIFYPSRSSSYAYVP 115
IS+S + + + G PP +T +DTGS++ W+ C PC CS + F PS+SS+Y Y+
Sbjct: 119 ISSSNYIIKLGFGTPPQSFYTVLDTGSNIAWIPCNPCSGCSSK-QQPFEPSKSSTYNYLT 177
Query: 116 CDSEHCRYFPYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDVVFG 175
C S+ C+ S C TQ Y E +S+E L+ + V++ VFG
Sbjct: 178 CASQQCQLLRVCTKSDNSVNCSLTQRYGDQSEVDEILSSETLSVGSQ-----QVENFVFG 232
Query: 176 CGFSTNRNFKFS-GIFGLGIGRSSLVSQL----NSSFSYCIGSLHDPDYLHNKLILGDGA 230
C + + + + G G S VSQ +S+FSYC+ SL + L+LG A
Sbjct: 233 CSNAARGLIQRTPSLVGFGRNPLSFVSQTATLYDSTFSYCLPSLFSSAF-TGSLLLGKEA 291
Query: 231 IIDEG-DATPL---QFIDGHYYITLEAISVDGRMLDINPNIFKRDDSGG-GVMIDSGTDV 285
+ +G TPL YY+ L ISV ++ I D+S G G +IDSGT +
Sbjct: 292 LSAQGLKFTPLLSNSRYPSFYYVGLNGISVGEELVSIPAGTLSLDESTGRGTIIDSGTVI 351
Query: 286 TWLVKEAYEALRDEVMIRLEGEQMRSYSWPDKLCYHGIMSSDLKGFPTVRFHFRGGAKLA 345
T LV+ AY A+RD +L M S + CY+ S D++ FP + HF L
Sbjct: 352 TRLVEPAYNAMRDSFRSQLSNLTMASPTDLFDTCYNR-PSGDVE-FPLITLHFDDNLDLT 409
Query: 346 LEKDSMFYQPRPDA--FCMAVNPASINNRYVNFSLIGMMAQQFYNVGYDIGRKQKTFQRM 403
L D++ Y D C+A V S G QQ + +D+ +
Sbjct: 410 LPLDNILYPGNDDGSVLCLAFGLPPGGGDDV-LSTFGNYQQQKLRIVHDVAESRLGIASE 468
Query: 404 DCE 406
+C+
Sbjct: 469 NCD 471
>gi|54290731|dbj|BAD62401.1| putative nucleoid DNA-binding protein cnd41 [Oryza sativa Japonica
Group]
Length = 521
Score = 134 bits (337), Expect = 8e-29, Method: Compositional matrix adjust.
Identities = 142/450 (31%), Positives = 187/450 (41%), Gaps = 74/450 (16%)
Query: 1 LIHRDSILSPYNNPNENP--AHRVQRGINISIARLAYLQEKIRSHNTYQAQI-------- 50
L+HR +P P A R++R AR Y+ K T +
Sbjct: 101 LVHRHGPCAPSAASGGKPSLAERLRR----DRARTNYIVTKATGGRTAATALSDAAGGGT 156
Query: 51 -LPS--NDISNSLFYV-NISIGQPPVPQFTAMDTGSSLLWVHCYPC--RDCSPQLGTIFY 104
+P+ D NSL YV + IG P V Q +DTGS L WV C PC +C Q +F
Sbjct: 157 SIPTFLGDSVNSLEYVVTLGIGTPAVQQTVLIDTGSDLSWVQCKPCGAGECYAQKDPLFD 216
Query: 105 PSRSSSYAYVPCDSEHCRYFPYARCSAYKHRCI-----------YTQLYLIGPETSVFVS 153
PS SSSYA VPCDS+ CR AY H C Y Y T+ S
Sbjct: 217 PSSSSSYASVPCDSDACRKL---AAGAYGHGCTGVSGGAAALCEYGIEYGNRATTTGVYS 273
Query: 154 TEQLTFKNTDESTIHVQDVVFGCGFSTNRNF-KFSGIFGLGIGRSSLVSQLNSS----FS 208
TE LT K + V D FGCG + + KF G+ GLG SLVSQ +S FS
Sbjct: 274 TETLTLK----PGVVVADFGFGCGDHQHGPYEKFDGLLGLGGAPESLVSQTSSQFGGPFS 329
Query: 209 YCIGSLHDPDYLHNKLILGDGAIIDEGDATP-----------LQFIDGHYYITLEAISVD 257
YC+ P L GA + +T L + Y +TL ISV
Sbjct: 330 YCL-----PPTSGGAGFLTLGAPPNSSSSTAASGLSFTPMRRLPSVPTFYIVTLTGISVG 384
Query: 258 GRMLDINPNIFKRDDSGGGVMIDSGTDVTWLVKEAYEALRDEVMIRLEGEQMRSYSWPDK 317
G L I P+ F G++IDSGT +T L AY ALR + ++ S
Sbjct: 385 GAPLAIPPSAFSS-----GMVIDSGTVITGLPATAYAALRSAFRSAMSEYRLLPPSNGGV 439
Query: 318 L--CYHGIMSSDLKGFPTVRFHFRGGAKLALEKDSMFYQPRPDAFCMAVNPASINNRYVN 375
L CY +++ PT+ F GGA + L + D C+A A +N
Sbjct: 440 LDTCYDFTGHANVT-VPTISLTFSGGATIDLAAPAGVLV---DG-CLAFAGAGTDNA--- 491
Query: 376 FSLIGMMAQQFYNVGYDIGRKQKTFQRMDC 405
+IG + Q+ + V YD G+ F+ C
Sbjct: 492 IGIIGNVNQRTFEVLYDSGKGTVGFRAGAC 521
>gi|356553775|ref|XP_003545228.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
Length = 559
Score = 134 bits (337), Expect = 8e-29, Method: Compositional matrix adjust.
Identities = 108/369 (29%), Positives = 171/369 (46%), Gaps = 31/369 (8%)
Query: 61 FYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLGTIFYPSRSSSYAYVPCDSEH 120
+++++ +G PP +DTGS L W+ C PC C Q G + P SSS+ + C
Sbjct: 195 YFMDVFVGTPPKHFSLILDTGSDLNWIQCVPCIACFEQSGPYYDPKDSSSFRNISCHDPR 254
Query: 121 CRYF----PYARCSAYKHRCIYTQLYLIGPETS----VFVSTEQLTFKNTDESTIHVQDV 172
C+ P C A C Y Y G T+ + T LT N HV++V
Sbjct: 255 CQLVSSPDPPNPCKAENQSCPYFYWYGDGSNTTGDFALETFTVNLTTPNGKSELKHVENV 314
Query: 173 VFGCGFSTNRNF-KFSGIFGLGIGRSSLVSQLNS----SFSYCIGSLHDPDYLHNKLILG 227
+FGCG F +G+ GLG G S SQ+ S SFSYC+ + + +KLI G
Sbjct: 315 MFGCGHWNRGLFHGAAGLLGLGKGPLSFASQMQSLYGQSFSYCLVDRNSNASVSSKLIFG 374
Query: 228 -DGAIIDEGDATPLQF-------IDGHYYITLEAISVDGRMLDINPNIFKRDDSG-GGVM 278
D ++ + F +D YY+ + ++ VD +L I + G GG +
Sbjct: 375 EDKELLSHPNLNFTSFGGGKDGSVDTFYYVQINSVMVDDEVLKIPEETWHLSSEGAGGTI 434
Query: 279 IDSGTDVTWLVKEAYEALRDEVMIRLEGEQMRSYSWPDKLCYH--GIMSSDLKGFPTVRF 336
IDSGT +T+ + AYE +++ + +++G ++ P K CY+ GI +L F +
Sbjct: 435 IDSGTTLTYFAEPAYEIIKEAFVRKIKGYELVEGLPPLKPCYNVSGIEKMELPDFGIL-- 492
Query: 337 HFRGGAKLALEKDSMFYQPRPDAFCMAVNPASINNRYVNFSLIGMMAQQFYNVGYDIGRK 396
F GA ++ F Q PD C+A+ + N S+IG QQ +++ YD+ +
Sbjct: 493 -FADGAVWNFPVENYFIQIDPDVVCLAI----LGNPRSALSIIGNYQQQNFHILYDMKKS 547
Query: 397 QKTFQRMDC 405
+ + M C
Sbjct: 548 RLGYAPMKC 556
>gi|255554715|ref|XP_002518395.1| Aspartic proteinase Asp1 precursor, putative [Ricinus communis]
gi|223542240|gb|EEF43782.1| Aspartic proteinase Asp1 precursor, putative [Ricinus communis]
Length = 489
Score = 134 bits (337), Expect = 9e-29, Method: Compositional matrix adjust.
Identities = 115/409 (28%), Positives = 177/409 (43%), Gaps = 46/409 (11%)
Query: 28 ISIARLAYLQEKIRSHNTYQAQILPSNDISNSLFYVNISIGQPPVPQFTAM-DTGSSLLW 86
IS R ++ +T Q I D S ++V+I IG P +F + DTGS L W
Sbjct: 86 ISSLRHGTRRKAFEVSHTAQIPIHSGADSGQSQYFVSIRIGTPRPQKFILVTDTGSDLTW 145
Query: 87 VHC-YPCRDC---SPQLGTIFYPSRSSSYAYVPCDSEHCR-----YFPYARCSAYKHRCI 137
++C Y C+ C +P G +F + SSS+ +PC S+ C+ YF C C+
Sbjct: 146 MNCEYWCKSCPKPNPHPGRVFRANDSSSFRTIPCSSDDCKIELQDYFSLTECPNPNAPCL 205
Query: 138 YTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDVVFGCGFSTNRNFKF-SGIFGLGIGR 196
+ YL GP + E +T D I + DV+ GC S N F G+ GLG +
Sbjct: 206 FDYRYLNGPRAIGVFANETVTVGLNDHKKIRLFDVLIGCTESFNETNGFPDGVMGLGYRK 265
Query: 197 SSLVSQL----NSSFSYCIGSLHDPDYLHNKLILGDGAIIDEGDATPLQ-------FIDG 245
SL +L + FSYC+ N L GD I E +Q +I+
Sbjct: 266 HSLALRLAEIFGNKFSYCLVDHLSSSNHKNFLSFGD---IPEMKLPKMQHTELLLGYINA 322
Query: 246 HYYITLEAISVDGRMLDINPNIFKRDDSGGGVMIDSGTDVTWLVKEAYEALRDEV----- 300
Y + + ISV G ML I+ +I+ GG+++DSGT +T L EAY+ + D +
Sbjct: 323 FYPVNVSGISVGGSMLSISSDIWNVTGV-GGMIVDSGTSLTMLAGEAYDKVVDALKPIFD 381
Query: 301 ----MIRLEGEQMRSYSWPDKLCYHGIMSSDLKGFPTVRFHFRGGAKLALEKDSMFYQPR 356
++ +E ++ ++ + DK D P + HF GA S
Sbjct: 382 KHKKVVPIELPELNNFCFEDK-------GFDRAAVPRLLIHFADGAIFKPPVKSYIIDVA 434
Query: 357 PDAFCMAVNPASINNRYVNFSLIGMMAQQFYNVGYDIGRKQKTFQRMDC 405
C+ + I + S++G + QQ + YD+GR + F C
Sbjct: 435 EGIKCLGI----IKADFPGSSILGNVMQQNHLWEYDLGRGKLGFGPSSC 479
>gi|225423917|ref|XP_002281973.1| PREDICTED: aspartic proteinase nepenthesin-2 [Vitis vinifera]
Length = 491
Score = 134 bits (337), Expect = 9e-29, Method: Compositional matrix adjust.
Identities = 116/352 (32%), Positives = 162/352 (46%), Gaps = 22/352 (6%)
Query: 61 FYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLGTIFYPSRSSSYAYVPCDSEH 120
++ + IG PP + +DTGS + WV C PC DC Q IF PS SSSYA + C++
Sbjct: 155 YFSRVGIGSPPKHVYMVVDTGSDVNWVQCAPCADCYQQADPIFEPSFSSSYAPLTCETHQ 214
Query: 121 CRYFPYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDVVFGCGFST 180
C+ + C C+Y Y G T +TE +T + + + +V GCG
Sbjct: 215 CKSLDVSECR--NDSCLYEVSYGDGSYTVGDFATETITL----DGSASLNNVAIGCGHDN 268
Query: 181 NRNF-KFSGIFGLGIGRSSLVSQLN-SSFSYCIGSLHDPDYLHNKLILGDGAIIDEGDAT 238
F +G+ GLG G S SQ+N SSFSYC L + D + + I
Sbjct: 269 EGLFVGAAGLLGLGGGSLSFPSQINASSFSYC---LVNRDTDSASTLEFNSPIPSHSVTA 325
Query: 239 PL---QFIDGHYYITLEAISVDGRMLDINPNIFKRDDSG-GGVMIDSGTDVTWLVKEAYE 294
PL +D YY+ + I V G+ML I + F+ D+SG GG+++DSGT VT L + Y
Sbjct: 326 PLLRNNQLDTFYYLGMTGIGVGGQMLSIPRSSFEVDESGNGGIIVDSGTAVTRLQSDVYN 385
Query: 295 ALRDEVMIRLEGEQMRSYSWPDKLCYHGIMSSDLKGFPTVRFHFRGGAKLAL-EKDSMFY 353
+LRD + + S CY S ++ PTV FHF G LAL K+ +
Sbjct: 386 SLRDSFVRGTQHLPSTSGVALFDTCYDLSSRSSVE-VPTVSFHFPDGKYLALPAKNYLIP 444
Query: 354 QPRPDAFCMAVNPASINNRYVNFSLIGMMAQQFYNVGYDIGRKQKTFQRMDC 405
FC A P + S+IG + QQ V YD+ F C
Sbjct: 445 VDSAGTFCFAFAPTT-----SALSIIGNVQQQGTRVSYDLSNSLVGFSPNGC 491
>gi|414885970|tpg|DAA61984.1| TPA: hypothetical protein ZEAMMB73_915310 [Zea mays]
Length = 523
Score = 134 bits (337), Expect = 9e-29, Method: Compositional matrix adjust.
Identities = 123/427 (28%), Positives = 182/427 (42%), Gaps = 45/427 (10%)
Query: 1 LIHRDSILSPYNNPNENPAH-----RVQRGINISIARLAYLQEKIRSHNTYQAQILPSND 55
++HR SP P+H R Q ++ SI R+ + + LP++
Sbjct: 121 VVHRHGPCSPLLARGGEPSHAEILDRDQDRVD-SIHRMTAGPWTAGQSSASKGVSLPAHR 179
Query: 56 ---ISNSLFYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLGTIFYPSRSSSYA 112
+ + + V++ +G P DTGS L WV C PC +C Q +F PS+S++Y+
Sbjct: 180 GLRLGTANYIVSVGLGTPRRDLLVVFDTGSDLSWVQCKPCNNCYKQHDPLFDPSQSTTYS 239
Query: 113 YVPCDSEHCRYFPYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDV 172
VPC ++ C CS+ K C Y +Y +T ++ + LT S+ +Q
Sbjct: 240 AVPCGAQEC--LDSGTCSSGK--CRYEVVYGDMSQTDGNLARDTLTL---GPSSDQLQGF 292
Query: 173 VFGCGFSTNRNF-KFSGIFGLGIGRSSLVSQL----NSSFSYCIGSLHDPDYLHNKLILG 227
VFGCG F + G+FGLG R SL SQ + FSYC+ S + L LG
Sbjct: 293 VFGCGDDDTGLFGRADGLFGLGRDRVSLASQAAARYGAGFSYCLPSSWRAE---GYLSLG 349
Query: 228 DGAIIDEGDATPL---QFIDGHYYITLEAISVDGRMLDINPNIFKRDDSGGGVMIDSGTD 284
A T + YY+ L I V GR + + P +FK G +IDSGT
Sbjct: 350 SAAAPPHAQFTAMVTRSDTPSFYYLDLVGIKVAGRTVRVAPAVFKAP----GTVIDSGTV 405
Query: 285 VTWLVKEAYEALRDEVMIRLEGEQMRSYSWPDKL-----CYHGIMSSDLKGFPTVRFHFR 339
+T L AY ALR MR Y L CY + ++ P+V F
Sbjct: 406 ITRLPSRAYSALRSSFA-----GFMRRYKRAPALSILDTCYDFTGRTKVQ-IPSVALLFD 459
Query: 340 GGAKLALEKDSMFYQPRPDAFCMAVNPASINNRYVNFSLIGMMAQQFYNVGYDIGRKQKT 399
GGA L L + Y C+A + N + ++G M Q+ + V YD+ ++
Sbjct: 460 GGATLNLGFGGVLYVANRSQACLAF---ASNGDDTSVGILGNMQQKTFAVVYDLANQKIG 516
Query: 400 FQRMDCE 406
F C
Sbjct: 517 FGAKGCS 523
>gi|302768196|ref|XP_002967518.1| hypothetical protein SELMODRAFT_87804 [Selaginella moellendorffii]
gi|300165509|gb|EFJ32117.1| hypothetical protein SELMODRAFT_87804 [Selaginella moellendorffii]
Length = 398
Score = 134 bits (337), Expect = 9e-29, Method: Compositional matrix adjust.
Identities = 107/363 (29%), Positives = 158/363 (43%), Gaps = 24/363 (6%)
Query: 61 FYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLGTIFYPSRSSSYAYVPCDSEH 120
+ IS+G P DTGS L+W+ C PC+ C Q IF P SSSY + C
Sbjct: 40 YVTTISLGTPAKVFSVIADTGSDLIWIQCKPCQACFNQKDPIFDPEGSSSYTTMSCGDTL 99
Query: 121 CRYFPYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDVVFGCGFST 180
C P CS C Y+ Y G T +S+E +T +T + +++ FGCG
Sbjct: 100 CDSLPRKSCSP---DCDYSYGYGDGSGTRGTLSSETVTLTSTQGEKLAAKNIAFGCGHLN 156
Query: 181 NRNFK-FSGIFGLGIGRSSLVSQLNS----SFSYCIGSLHDPDYLHNKLILGDGAIIDEG 235
+F SG+ GLG G S VSQL FSYC+ D + + GD +
Sbjct: 157 RGSFNDASGLVGLGRGNLSFVSQLGDLFGHKFSYCLVPWRDAPSKTSPMFFGDESSSHSS 216
Query: 236 DA------TPL---QFIDGHYYITLEAISVDGRMLDINPNIFK-RDDSGGGVMIDSGTDV 285
TP+ ++ YY+ L+ IS+ GR L I F + D GG++ DSGT +
Sbjct: 217 GKKLHYAFTPMIHNPAMESFYYVKLKDISIAGRALRIPAGSFDIKPDGSGGMIFDSGTTL 276
Query: 286 TWLVKEAYEALRDEVMIRLEGEQMRSYSWPDKLCY--HGIMSSDLKGFPTVRFHFRGGAK 343
T L Y+ + + ++ ++ S LCY G +S P + FHF GA
Sbjct: 277 TLLPDAPYQIVLRALRSKISFPKIDGSSAGLDLCYDVSGSKASYKMKIPAMVFHFE-GAD 335
Query: 344 LALEKDSMFYQPRPDAFCMAVNPASINNRYVNFSLIGMMAQQFYNVGYDIGRKQKTFQRM 403
L ++ F + + S N ++ + G M QQ + V YDIG + +
Sbjct: 336 YQLPVENYFIAANDAGTIVCLAMVSSN---MDIGIYGNMMQQNFRVMYDIGSSKIGWAPS 392
Query: 404 DCE 406
C+
Sbjct: 393 QCD 395
>gi|224126221|ref|XP_002329620.1| predicted protein [Populus trichocarpa]
gi|222870359|gb|EEF07490.1| predicted protein [Populus trichocarpa]
Length = 357
Score = 134 bits (336), Expect = 1e-28, Method: Compositional matrix adjust.
Identities = 105/357 (29%), Positives = 162/357 (45%), Gaps = 31/357 (8%)
Query: 61 FYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLGTIFYPSRSSSYAYVPCDSEH 120
++ + +G P + +DTGS + W+ C PC DC Q IF P+ SS+YA V C S+
Sbjct: 20 YFTRVGVGNPARQFYMVLDTGSDINWLQCQPCTDCYQQTDPIFDPTASSTYAPVTCQSQQ 79
Query: 121 CRYFPYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDVVFGCGFST 180
C + C + +C+Y Y G T +TE ++F N+ V++V GCG
Sbjct: 80 CSSLEMSSCRS--GQCLYQVNYGDGSYTFGDFATESVSFGNSGS----VKNVALGCGHDN 133
Query: 181 NRNFKFSGIFGLGIGRS-SLVSQLN-SSFSYCIGSLHDPDYLHNKLILGDGAIIDEGDAT 238
F + G SL +QL +SFSYC+ ++ + L + +
Sbjct: 134 EGLFVGAAGLLGLGGGPLSLTNQLKATSFSYCL--VNRDSAGSSTLDFNSAQLGVDSVTA 191
Query: 239 PL---QFIDGHYYITLEAISVDGRMLDINPNIFKRDDSG-GGVMIDSGTDVTWLVKEAYE 294
PL + ID YY+ L +SV G+M+ I + F+ D+SG GG+++D GT +T L +AY
Sbjct: 192 PLMKNRKIDTFYYVGLSGMSVGGQMVSIPESTFRLDESGNGGIIVDCGTAITRLQTQAYN 251
Query: 295 ALRDEVMIRLEGEQMRSYSWPDKLCYHGIMSSDLKG-----FPTVRFHFRGGAKLAL-EK 348
LRD + + ++ S CY DL G PTV FHF G L
Sbjct: 252 PLRDAFVRMTQNLKLTSAVALFDTCY------DLSGQASVRVPTVSFHFADGKSWNLPAA 305
Query: 349 DSMFYQPRPDAFCMAVNPASINNRYVNFSLIGMMAQQFYNVGYDIGRKQKTFQRMDC 405
+ + +C A P + + S+IG + QQ V +D+ + F C
Sbjct: 306 NYLIPVDSAGTYCFAFAPTT-----SSLSIIGNVQQQGTRVTFDLANNRMGFSPNKC 357
>gi|449434646|ref|XP_004135107.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 1-like [Cucumis
sativus]
Length = 486
Score = 134 bits (336), Expect = 1e-28, Method: Compositional matrix adjust.
Identities = 111/367 (30%), Positives = 163/367 (44%), Gaps = 23/367 (6%)
Query: 46 YQAQILPSNDISNSLFYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLGTIFYP 105
+++ I+ + ++ + IG+PP P + +DTGS + WV C PC +C Q IF P
Sbjct: 136 FESPIVSGASQGSGEYFSRVGIGRPPSPVYMVLDTGSDVSWVQCAPCAECYEQTDPIFEP 195
Query: 106 SRSSSYAYVPCDSEHCRYFPYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDES 165
+ S+S+ + C++E C+ + C C+Y Y G T TE +T +T
Sbjct: 196 TSSASFTSLSCETEQCKSLDVSEC--RNGTCLYEVSYGDGSYTVGDFVTETVTLGST--- 250
Query: 166 TIHVQDVVFGCGFSTNRNF-KFSGIFGLGIGRSSLVSQLN-SSFSYCIGSLHDPDYLHNK 223
+ ++ GCG + F +G+ GLG G S SQLN SSFSYC L D D
Sbjct: 251 --SLGNIAIGCGHNNEGLFIGAAGLLGLGGGSLSFPSQLNASSFSYC---LVDRDSDSTS 305
Query: 224 LILGDGAIIDEGDATPLQF---IDGHYYITLEAISVDGRMLDINPNIFK-RDDSGGGVMI 279
+ + I + PL +D +Y+ L +SV G +L I F+ +D GG+++
Sbjct: 306 TLDFNSPITPDAVTAPLHRNPNLDTFFYLGLTGMSVGGAVLPIPETSFQMSEDGNGGIIV 365
Query: 280 DSGTDVTWLVKEAYEALRDEVMIRLEGEQMRSYSWPDKLCYHGIMSSDLKGFPTVRFHFR 339
DSGT VT L Y LRD + Q CY + S PTV FHF
Sbjct: 366 DSGTAVTRLQTTVYNVLRDAFVKSTHDLQTARGVALFDTCYD-LSSKSRVEVPTVSFHFA 424
Query: 340 GGAKLAL-EKDSMFYQPRPDAFCMAVNPASINNRYVNFSLIGMMAQQFYNVGYDIGRKQK 398
G +L L K+ + FC A P S++G QQ VG+D+
Sbjct: 425 NGNELPLPAKNYLIPVDSEGTFCFAFAPTD-----STLSILGNAQQQGTRVGFDLANSLV 479
Query: 399 TFQRMDC 405
F C
Sbjct: 480 GFSPNKC 486
>gi|297821064|ref|XP_002878415.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
gi|297324253|gb|EFH54674.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
Length = 486
Score = 134 bits (336), Expect = 1e-28, Method: Compositional matrix adjust.
Identities = 118/389 (30%), Positives = 172/389 (44%), Gaps = 44/389 (11%)
Query: 41 RSHNTYQAQILPSNDISNSLFYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLG 100
RS + ++ + +++ + +G P + +DTGS ++W+ C PC+ C Q
Sbjct: 118 RSAGGFSGAVISGLSQGSGEYFMRLGVGTPATNVYMVLDTGSDVVWLQCSPCKACYNQSD 177
Query: 101 TIFYPSRSSSYAYVPCDSEHCRYF-PYARCSAYKHR-CIYTQLYLIGPETSVFVSTEQLT 158
IF P +S ++A VPC S CR + C + + C+Y Y G T STE LT
Sbjct: 178 VIFDPKKSKTFATVPCGSRLCRRLDDSSECVTRRSKTCLYQVSYGDGSFTEGDFSTETLT 237
Query: 159 FKNTDESTIHVQDVVFGCGFSTNRNFKFSGIFGLGIGR-----SSLVSQLNSSFSYCI-- 211
F V V GCG F + S S+ N FSYC+
Sbjct: 238 FHGA-----RVDHVPLGCGHDNEGLFVGAAGLLGLGRGGLSFPSQTKSRYNGKFSYCLVD 292
Query: 212 -GSLHDPDYLHNKLILGDGAIIDEGDATPLQF---IDGHYYITLEAISVDG-RMLDINPN 266
S + ++ G+ A+ TPL +D YY+ L ISV G R+ ++ +
Sbjct: 293 RTSSGSSSKPPSTIVFGNDAVPKTSVFTPLLTNPKLDTFYYLQLLGISVGGSRVPGVSES 352
Query: 267 IFKRDDSG-GGVMIDSGTDVTWLVKEAYEALRDEVMIRLEGEQMR---SYSWPDKLCYHG 322
FK D +G GGV+IDSGT VT L + AY ALRD RL +++ SYS D C+
Sbjct: 353 QFKLDATGNGGVIIDSGTSVTRLTQSAYVALRDA--FRLGATKLKRAPSYSLFDT-CF-- 407
Query: 323 IMSSDLKGF-----PTVRFHFRGGAKLALEKDSMFYQPRPDA-FCMAVNPASINNRYVNF 376
DL G PTV FHF GG +++L + + FC A +
Sbjct: 408 ----DLSGMTTVKVPTVVFHF-GGGEVSLPASNYLIPVNTEGRFCFA-----FAGTMGSL 457
Query: 377 SLIGMMAQQFYNVGYDIGRKQKTFQRMDC 405
S+IG + QQ + V YD+ + F C
Sbjct: 458 SIIGNIQQQGFRVAYDLVGSRVGFLSRAC 486
>gi|225463766|ref|XP_002267930.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Vitis vinifera]
Length = 479
Score = 133 bits (335), Expect = 2e-28, Method: Compositional matrix adjust.
Identities = 120/415 (28%), Positives = 181/415 (43%), Gaps = 47/415 (11%)
Query: 1 LIHRDSILSPYNNPNENPAHRVQRGINISIARLAYLQEKIRSH-------NTYQAQILPS 53
++HRD + + N +++ HR+ + R+A L ++ S + + ++
Sbjct: 76 VVHRDQL--SFGNSDDH-RHRLDGRLKRDAKRVASLIRRLSSGGGGSYRVDDFGTDVISG 132
Query: 54 NDISNSLFYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLGTIFYPSRSSSYAY 113
+ + ++V I +G PP Q+ +D+GS ++WV C PC C Q +F P+ S+S+
Sbjct: 133 MEQGSGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCTQCYHQSDPVFDPADSASFTG 192
Query: 114 VPCDSEHCRYFPYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDVV 173
V C S C A C A RC Y Y G T ++ E LTF T V+ V
Sbjct: 193 VSCSSSVCDRLENAGCHA--GRCRYEVSYGDGSYTKGTLALETLTFGRT-----MVRSVA 245
Query: 174 FGCGFSTNRNFKFSGIFGLGIGRS-SLVSQL----NSSFSYCIGSLHDPDYLHNKLILGD 228
GCG F + G S S V QL +FSYC+ S L+ G
Sbjct: 246 IGCGHRNRGMFVGAAGLLGLGGGSMSFVGQLGGQTGGAFSYCLVSRGTDS--SGSLVFGR 303
Query: 229 GAIIDEGDATPL---QFIDGHYYITLEAISVDGRMLDINPNIFKRDDSG-GGVMIDSGTD 284
A+ PL YYI L + V G + I+ +F+ + G GGV++D+GT
Sbjct: 304 EALPAGAAWVPLVRNPRAPSFYYIGLAGLGVGGIRVPISEEVFRLTELGDGGVVMDTGTA 363
Query: 285 VTWLVKEAYEALRDEVMIRLEGEQMRSYSWPDKLCYHGIMSSDLKGF-----PTVRFHFR 339
VT L AY+A RD + + + CY DL GF PTV F+F
Sbjct: 364 VTRLPTLAYQAFRDAFLAQTANLPRATGVAIFDTCY------DLLGFVSVRVPTVSFYFS 417
Query: 340 GGAKLALEKDSMFYQPRPDA--FCMAVNPASINNRYVNFSLIGMMAQQFYNVGYD 392
GG L L + F P DA FC A P++ S++G + Q+ + +D
Sbjct: 418 GGPILTLPARN-FLIPMDDAGTFCFAFAPST-----SGLSILGNIQQEGIQISFD 466
>gi|224111722|ref|XP_002315953.1| predicted protein [Populus trichocarpa]
gi|222864993|gb|EEF02124.1| predicted protein [Populus trichocarpa]
Length = 484
Score = 133 bits (334), Expect = 2e-28, Method: Compositional matrix adjust.
Identities = 110/354 (31%), Positives = 167/354 (47%), Gaps = 23/354 (6%)
Query: 47 QAQILPSNDISNSLFYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLGTIFYPS 106
Q+ I+ + ++ + IG+PP + +DTGS + WV C PC DC Q IF P+
Sbjct: 135 QSPIISGTSQGSGEYFSRVGIGKPPSQAYLILDTGSDVNWVQCAPCADCYQQADPIFEPA 194
Query: 107 RSSSYAYVPCDSEHCRYFPYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDEST 166
S+S++ + C++ CR + C C+Y Y G T TE +T +
Sbjct: 195 SSASFSTLSCNTRQCRSLDVSEC--RNDTCLYEVSYGDGSYTVGDFVTETITL-----GS 247
Query: 167 IHVQDVVFGCGFSTNRNF-KFSGIFGLGIGRSSLVSQLN-SSFSYCIGSLHDPDYLHNKL 224
V +V GCG + F +G+ GLG G S SQ+N +SFSYC L D D
Sbjct: 248 APVDNVAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQINATSFSYC---LVDRDSESAST 304
Query: 225 ILGDGAIIDEGDATPL---QFIDGHYYITLEAISVDGRMLDINPNIFKRDDSG-GGVMID 280
+ + + + PL +D YY+ L +SV G ++ I + F+ D+SG GGV++D
Sbjct: 305 LEFNSTLPPNAVSAPLLRNHHLDTFYYVGLTGLSVGGELVSIPESAFQIDESGNGGVIVD 364
Query: 281 SGTDVTWLVKEAYEALRDEVMIRLEGEQMRSYSWPDKLCYHGIMSSDLKGFPTVRFHFRG 340
SGT +T L + Y +LRD + R + CY +++ PTV FHF
Sbjct: 365 SGTAITRLQTDVYNSLRDAFVKRTRDLPSTNGIALFDTCYDLSSKGNVE-VPTVSFHFPD 423
Query: 341 GAKLAL-EKDSMFYQPRPDAFCMAVNPASINNRYVNFSLIGMMAQQFYNVGYDI 393
G +L L K+ + FC A P + + S+IG + QQ V YD+
Sbjct: 424 GKELPLPAKNYLVPLDSEGTFCFAFAPTA-----SSLSIIGNVQQQGTRVVYDL 472
>gi|302763741|ref|XP_002965292.1| hypothetical protein SELMODRAFT_83230 [Selaginella moellendorffii]
gi|300167525|gb|EFJ34130.1| hypothetical protein SELMODRAFT_83230 [Selaginella moellendorffii]
Length = 423
Score = 133 bits (334), Expect = 2e-28, Method: Compositional matrix adjust.
Identities = 126/431 (29%), Positives = 187/431 (43%), Gaps = 62/431 (14%)
Query: 2 IHRDSILSPYNNPN-------ENPAHR-------VQRGINISIARL--AYLQEKIRSHNT 45
+HRDS SPY N N HR + I++ +A + + L +++ N
Sbjct: 1 MHRDSADSPYRPANATVHGLVRNRLHRDELRLLSISSRISLGVAGIPKSSLTNPLKNTNP 60
Query: 46 YQAQILPS------NDISNSLFYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQL 99
+ Q + +D S ++V++ +G PP DTGS +LW+ C PC+ C Q
Sbjct: 61 FLQQDFETPLRSGLSDGSGE-YFVSLGVGTPPRTVNMVADTGSDVLWLQCLPCQSCYGQT 119
Query: 100 GTIFYPSRSSSYAYVPCDSEHCRYFPYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTF 159
+F PS SS++ + C S C+ C +++C+Y Y G T STE L+F
Sbjct: 120 DPLFNPSFSSTFQSITCGSSLCQQLLIRGC--RRNQCLYQVSYGDGSFTVGEFSTETLSF 177
Query: 160 KNTDESTIHVQDVVFGCGFSTNRNFKFSGIFGLGIGRS----SLVSQL-NSSFSYCIGSL 214
+ V V GCG + F + S V QL S FSYC+ +
Sbjct: 178 GSN-----AVNSVAIGCGHNNQGLFTGAAGLLGLGKGLLSFPSQVGQLYGSVFSYCLPTR 232
Query: 215 HDPDYLHNKLILGDGAIIDEGDATPLQF---IDGHYYITLEAISVDGRMLDINPNIFKRD 271
+ LI G+ A+ T L +D YY+ + I V G ++I D
Sbjct: 233 ESTGSV--PLIFGNQAVASNAQFTTLLTNPKLDTFYYVEMVGIKVGGTSVNIPAGSLSLD 290
Query: 272 DS--GGGVMIDSGTDVTWLVKEAYEALRDEVMIRL--EGEQMRSYSWPDKLCYHGIMSSD 327
S GGV++DSGT VT LV AY +RD + + + +S D CY D
Sbjct: 291 SSTGNGGVILDSGTAVTRLVTSAYNPMRDAFRAGMPSDAKMTSGFSLFDT-CY------D 343
Query: 328 LKG-----FPTVRFHFRGGAKLAL-EKDSMFYQPRPDAFCMAVNPASINNRYVNFSLIGM 381
L G P V F F GGA +AL ++ M +C+A P S NFS+IG
Sbjct: 344 LSGRSSIMLPAVSFVFNGGATMALPAQNIMVPVDNSGTYCLAFAPNS-----ENFSIIGN 398
Query: 382 MAQQFYNVGYD 392
+ QQ + + +D
Sbjct: 399 IQQQSFRMSFD 409
>gi|148908573|gb|ABR17396.1| unknown [Picea sitchensis]
Length = 350
Score = 133 bits (334), Expect = 2e-28, Method: Compositional matrix adjust.
Identities = 114/359 (31%), Positives = 158/359 (44%), Gaps = 29/359 (8%)
Query: 61 FYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLGTIFYPSRSSSYAYVPCDSEH 120
++ I IG P Q+ +DTGS ++W+ C PCR+C Q IF PS S S++ V CDS
Sbjct: 8 YFTRIGIGTPTREQYMVLDTGSDVVWIQCEPCRECYSQADPIFNPSSSVSFSTVGCDSAV 67
Query: 121 CRYFPYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDVVFGCGFST 180
C C + C+Y Y G T +TE LTF T +Q+V GCG
Sbjct: 68 CSQLDANDC--HGGGCLYEVSYGDGSYTVGSYATETLTFGTT-----SIQNVAIGCGHDN 120
Query: 181 NRNFKFSGIFGLGIGRS-----SLVSQLNSSFSYCIGSLHDPDYLHN-KLILGDGAIIDE 234
F + S L +Q +FSYC L D D + L G ++
Sbjct: 121 VGLFVGAAGLLGLGAGSLSFPAQLGTQTGRAFSYC---LVDRDSESSGTLEFGPESVPIG 177
Query: 235 GDATPL---QFIDGHYYITLEAISVDGRMLDINPN-IFKRDDSG--GGVMIDSGTDVTWL 288
TPL F+ YY+++ AISV G +LD P+ F+ D++ GG++IDSGT VT L
Sbjct: 178 SIFTPLVANPFLPTFYYLSMVAISVGGVILDSVPSEAFRIDETTGRGGIIIDSGTAVTRL 237
Query: 289 VKEAYEALRDEVMIRLEGEQMRSYSWPDKLCYHGIMSSDLKGFPTVRFHFRGGAKLAL-E 347
AY+ALRD + + CY + + P V FHF GA L
Sbjct: 238 QTSAYDALRDAFIAGTQHLPRADGISIFDTCYD-LSALQSVSIPAVGFHFSNGAGFILPA 296
Query: 348 KDSMFYQPRPDAFCMAVNPASINNRYVNFSLIGMMAQQFYNVGYDIGRKQKTFQRMDCE 406
K+ + FC A PA N S++G + QQ V +D F C+
Sbjct: 297 KNCLIPMDSMGTFCFAFAPAD-----SNLSIMGNIQQQGIRVSFDSANSLVGFAIDQCQ 350
>gi|125558622|gb|EAZ04158.1| hypothetical protein OsI_26300 [Oryza sativa Indica Group]
Length = 435
Score = 133 bits (334), Expect = 2e-28, Method: Compositional matrix adjust.
Identities = 109/397 (27%), Positives = 178/397 (44%), Gaps = 38/397 (9%)
Query: 32 RLAYLQEKIRS------HNTYQAQILPSNDISNSLFYVNISIGQPPVPQFTAMDTGSSLL 85
R+A+L + + +++ Q L N + + +NIS+G P + DTGS L+
Sbjct: 53 RIAFLSDATAAGKATTTNSSVSFQALLENGVGG--YNMNISVGTPLLTFPVVADTGSDLI 110
Query: 86 WVHCYPCRDCSPQLGTIFYPSRSSSYAYVPCDSEHCRYFPYARCSAYKHRCIYTQLYLIG 145
W C PC C Q F P+ SS+++ +PC S C++ P + + C+Y Y G
Sbjct: 111 WTQCAPCTKCFQQPAPPFQPASSSTFSKLPCTSSFCQFLPNSIRTCNATGCVYNYKYGSG 170
Query: 146 PETSVFVSTEQLTFKNTDESTIHVQDVVFGCGFSTNRNFKFSGIFGLGIGRSSLVSQLN- 204
T+ +++TE T K D S V FGC SGI GLG G SL+ QL
Sbjct: 171 -YTAGYLATE--TLKVGDAS---FPSVAFGCSTENGVGNSTSGIAGLGRGALSLIPQLGV 224
Query: 205 SSFSYCIGSLHDPDYLHNKLILGDGAIIDEGDATPLQFID------GHYYITLEAISVDG 258
FSYC+ S + ++ G A + +G+ F++ +YY+ L I+V
Sbjct: 225 GRFSYCLRSGSAAG--ASPILFGSLANLTDGNVQSTPFVNNPAVHPSYYYVNLTGITVGE 282
Query: 259 RMLDINPNI--FKRDDSGGGVMIDSGTDVTWLVKEAYEALRDEVMIRLEGEQMRSYSWPD 316
L + + F ++ GGG ++DSGT +T+L K+ YE ++ + + + +
Sbjct: 283 TDLPVTTSTFGFTQNGLGGGTIVDSGTTLTYLAKDGYEMVKQAFLSQTANVTTVNGTRGL 342
Query: 317 KLCYHGIMSSDLKGFPTVRFHFRGGAKLA-------LEKDSMFYQPRPDAFCMAVNPASI 369
LC+ P++ F GGA+ A +E DS Q C+ + PA
Sbjct: 343 DLCFKSTGGGGGIAVPSLVLRFDGGAEYAVPTYFAGVETDS---QGSVTVACLMMLPAKG 399
Query: 370 NNRYVNFSLIGMMAQQFYNVGYDIGRKQKTFQRMDCE 406
+ S+IG + Q ++ YD+ +F DC
Sbjct: 400 DQP---MSVIGNVMQMDMHLLYDLDGGIFSFSPADCA 433
>gi|225470916|ref|XP_002263964.1| PREDICTED: aspartic proteinase nepenthesin-1 [Vitis vinifera]
gi|147788999|emb|CAN64659.1| hypothetical protein VITISV_009613 [Vitis vinifera]
Length = 489
Score = 132 bits (333), Expect = 2e-28, Method: Compositional matrix adjust.
Identities = 109/367 (29%), Positives = 156/367 (42%), Gaps = 47/367 (12%)
Query: 61 FYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLGTIFYPSRSSSYAYVPCDSEH 120
++ + +G PP + +DTGS ++W+ C PCR C Q +F P +S S++ + C S
Sbjct: 147 YFTRLGVGTPPKYVYMVLDTGSDVVWIQCAPCRKCYSQTDPVFDPKKSGSFSSISCRSPL 206
Query: 121 CRYFPYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDVVFGCGFST 180
C C++ + C+Y Y G T STE LTF+ T V V GCG
Sbjct: 207 CLRLDSPGCNS-RQSCLYQVAYGDGSFTFGEFSTETLTFRGT-----RVPKVALGCGHDN 260
Query: 181 NRNF--------------KFSGIFGLGIGRSSLVSQLNSSFSYCIGSLHDPDYLHNKLIL 226
F F GL GR FSYC+ + ++
Sbjct: 261 EGLFVGAAGLLGLGRGRLSFPTQTGLRFGR---------KFSYCLVD-RSASSKPSSVVF 310
Query: 227 GDGAIIDEGDATPL---QFIDGHYYITLEAISVDG-RMLDINPNIFKRDDSG-GGVMIDS 281
G A+ TPL +D YY+ L ISV G R+ I ++FK D +G GGV+IDS
Sbjct: 311 GQSAVSRTAVFTPLITNPKLDTFYYLELTGISVGGARVAGITASLFKLDTAGNGGVIIDS 370
Query: 282 GTDVTWLVKEAYEALRDEVMIRLEGEQMR---SYSWPDKLCYHGIMSSDLKGFPTVRFHF 338
GT VT L + AY +LRD R ++ YS D C+ +++K PTV HF
Sbjct: 371 GTSVTRLTRRAYVSLRDA--FRAGAADLKRAPDYSLFDT-CFDLSGKTEVK-VPTVVMHF 426
Query: 339 RGGAKLALEKDSMFYQPRPDAFCMAVNPASINNRYVNFSLIGMMAQQFYNVGYDIGRKQK 398
RG + + FC A S+IG + QQ + V +D+ +
Sbjct: 427 RGADVSLPATNYLIPVDTNGVFCFA-----FAGTMSGLSIIGNIQQQGFRVVFDVAASRI 481
Query: 399 TFQRMDC 405
F C
Sbjct: 482 GFAARGC 488
>gi|302809855|ref|XP_002986620.1| hypothetical protein SELMODRAFT_124369 [Selaginella moellendorffii]
gi|300145803|gb|EFJ12477.1| hypothetical protein SELMODRAFT_124369 [Selaginella moellendorffii]
Length = 423
Score = 132 bits (333), Expect = 2e-28, Method: Compositional matrix adjust.
Identities = 126/431 (29%), Positives = 186/431 (43%), Gaps = 62/431 (14%)
Query: 2 IHRDSILSPYNNPN-------ENPAHR-------VQRGINISIARL--AYLQEKIRSHNT 45
+HRDS SPY N N HR + I++ +A + + L +++ N
Sbjct: 1 MHRDSADSPYRPANATVHGLVRNRLHRDELRLLSISSRISLGVAGIPKSSLTNPLKNTNP 60
Query: 46 YQAQILPS------NDISNSLFYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQL 99
+ Q + +D S ++V++ +G PP DTGS +LW+ C PC+ C Q
Sbjct: 61 FLQQDFETPLRSGLSDGSGE-YFVSLGVGTPPRTVNMVADTGSDVLWLQCLPCQSCYGQT 119
Query: 100 GTIFYPSRSSSYAYVPCDSEHCRYFPYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTF 159
+F PS SS++ + C S C+ C +++C+Y Y G T STE L+F
Sbjct: 120 DPLFNPSFSSTFQSITCGSSLCQQLLIRGC--RRNQCLYQVSYGDGSFTVGEFSTETLSF 177
Query: 160 KNTDESTIHVQDVVFGCGFSTNRNFKFSGIFGLGIGRS----SLVSQL-NSSFSYCIGSL 214
+ V V GCG + F + S V QL S FSYC+ +
Sbjct: 178 GSN-----AVNSVAIGCGHNNQGLFTGAAGLLGLGKGLLSFPSQVGQLYGSVFSYCLPTR 232
Query: 215 HDPDYLHNKLILGDGAIIDEGDATPLQF---IDGHYYITLEAISVDGRMLDINPNIFKRD 271
+ LI G+ A+ T L +D YY+ + I V G + I D
Sbjct: 233 ESTGSV--PLIFGNQAVASNAQFTTLLTNPKLDTFYYVEMVGIKVGGTSVSIPAGSLSLD 290
Query: 272 DS--GGGVMIDSGTDVTWLVKEAYEALRDEVMIRL--EGEQMRSYSWPDKLCYHGIMSSD 327
S GGV++DSGT VT LV AY +RD + + + +S D CY D
Sbjct: 291 SSTGNGGVILDSGTAVTRLVTSAYNPMRDAFRAGMPSDAKMTSGFSLFDT-CY------D 343
Query: 328 LKG-----FPTVRFHFRGGAKLAL-EKDSMFYQPRPDAFCMAVNPASINNRYVNFSLIGM 381
L G P V F F GGA +AL ++ M +C+A P S NFS+IG
Sbjct: 344 LSGRSSIMLPAVSFVFNGGATMALPAQNIMVPVDNSGTYCLAFAPNS-----ENFSIIGN 398
Query: 382 MAQQFYNVGYD 392
+ QQ + + +D
Sbjct: 399 IQQQSFRMSFD 409
>gi|30683732|ref|NP_180371.2| aspartyl protease-like protein [Arabidopsis thaliana]
gi|28392898|gb|AAO41885.1| putative chloroplast nucleoid DNA binding protein [Arabidopsis
thaliana]
gi|56382011|gb|AAV85724.1| At2g28040 [Arabidopsis thaliana]
gi|330252978|gb|AEC08072.1| aspartyl protease-like protein [Arabidopsis thaliana]
Length = 395
Score = 132 bits (333), Expect = 3e-28, Method: Compositional matrix adjust.
Identities = 105/363 (28%), Positives = 164/363 (45%), Gaps = 49/363 (13%)
Query: 61 FYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLGTIFYPSRSSSYAYVPCDSEH 120
+ + + IG PP +DTGS +W C PC C Q IF PS+SS++ +
Sbjct: 65 YLMKLQIGTPPFEIEAVLDTGSEHIWTQCLPCVHCYNQTAPIFDPSKSSTFKEI------ 118
Query: 121 CRYFPYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDVVFGCGFST 180
RC + H C Y +Y T + TE +T +T + + + GCG
Sbjct: 119 -------RCDTHDHSCPYELVYGGKSYTKGTLVTETVTIHSTSGQPFVMPETIIGCG-RN 170
Query: 181 NRNFK--FSGIFGLGIGRSSLVSQLNSSF----SYCIG--SLHDPDYLHNKLILGDGAI- 231
N FK F+G+ GL G SL++Q+ + SYC ++ N ++ GDG +
Sbjct: 171 NSGFKPGFAGVVGLDRGPKSLITQMGGEYPGLMSYCFAGKGTSKINFGANAIVAGDGVVS 230
Query: 232 --IDEGDATPLQFIDGHYYITLEAISVDGRMLDINPNIFKRDDSGGGVMIDSGTDVTWLV 289
+ A P G YY+ L+A+SV ++ F G ++IDSG+ +T+
Sbjct: 231 TTVFVKTAKP-----GFYYLNLDAVSVGNTRIETVGTPFHALK--GNIVIDSGSTLTYF- 282
Query: 290 KEAYEALRDEVMIRLEGEQMRS---YSWPDKLCYHGIMSSDLKGFPTVRFHFRGGAKLAL 346
E+Y L +R EQ+ + + D LCY+ S + FP + HF GGA L L
Sbjct: 283 PESYCNL-----VRKAVEQVVTAVRFPRSDILCYY---SKTIDIFPVITMHFSGGADLVL 334
Query: 347 EKDSMFYQPRPDA-FCMAVNPASINNRYVNFSLIGMMAQQFYNVGYDIGRKQKTFQRMDC 405
+K +M+ FC+A+ I N + ++ G AQ + VGYD +F+ +C
Sbjct: 335 DKYNMYVASNTGGVFCLAI----ICNSPIEEAIFGNRAQNNFLVGYDSSSLLVSFKPTNC 390
Query: 406 EVL 408
L
Sbjct: 391 SAL 393
>gi|414589630|tpg|DAA40201.1| TPA: hypothetical protein ZEAMMB73_629620 [Zea mays]
Length = 443
Score = 132 bits (332), Expect = 3e-28, Method: Compositional matrix adjust.
Identities = 114/420 (27%), Positives = 183/420 (43%), Gaps = 59/420 (14%)
Query: 22 VQRGINISIARLAYLQEKIRSHNTYQAQILPSNDI---------SNSLFYVNISIGQPPV 72
+ R + S AR+A LQ A + P + I S+ + + + IG P
Sbjct: 50 LSRALRRSSARVATLQSL--------AALAPGDAITAARILVLASDGEYLMEMGIGTPTR 101
Query: 73 PQFTAMDTGSSLLWVHCYPCRDCSPQLGTIFYPSRSSSYAYVPCDSEHCRYFPYARCSAY 132
+DTGS L+W C PC C Q F P+RS++Y + C S C Y C Y
Sbjct: 102 YYSAILDTGSDLIWTQCAPCLLCVDQPTPYFDPARSATYRSLGCASPACNALYYPLC--Y 159
Query: 133 KHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDVVFGCG-FSTNRNFKFSGIFG 191
+ C+Y Y T+ ++ E TF T+E+ + + + FGCG + SG+ G
Sbjct: 160 QKVCVYQYFYGDSASTAGVLANETFTF-GTNETRVSLPGISFGCGNLNAGSLANGSGMVG 218
Query: 192 LGIGRSSLVSQLNSS-FSYCIGSLHDPDYLHNKLILGDGAIIDEGDAT--PLQ------- 241
G G SLVSQL S FSYC+ S P + ++L G A ++ +A+ P+Q
Sbjct: 219 FGRGSLSLVSQLGSPRFSYCLTSFLSP--VPSRLYFGVYATLNSTNASSEPVQSTPFVVN 276
Query: 242 -FIDGHYYITLEAISVDGRMLDINPNIFKRDDS--GGGVMIDSGTDVTWLVKEAYEALRD 298
+ Y++ + ISV G +L I+P +F +D+ GG +IDSGT +T+L + AY+A+R
Sbjct: 277 PALPTMYFLNMTGISVGGYLLPIDPAVFAINDTDGTGGTIIDSGTTITYLAEPAYDAVRA 336
Query: 299 EVMIRLEGEQMR---------SYSWPDKLCYHGIMSSDLKGFPTVRFHFRGGAKLALEKD 349
++ + + WP P + HF G ++
Sbjct: 337 AFASQITLPLLNVTDASVLDTCFQWPPP-------PRQSVTLPQLVLHFDGADWELPLQN 389
Query: 350 SMFYQPRP-DAFCMAVNPASINNRYVNFSLIGMMAQQFYNVGYDIGRKQKTFQRMDCEVL 408
M P C+A+ +S + ++ Q +NV YD+ +F C ++
Sbjct: 390 YMLVDPSTGGGLCLAMASSSDGSIIGSYQ------HQNFNVLYDLENSLMSFVPAPCHLM 443
>gi|4063754|gb|AAC98462.1| putative chloroplast nucleoid DNA binding protein [Arabidopsis
thaliana]
gi|20197956|gb|AAM15330.1| putative chloroplast nucleoid DNA binding protein [Arabidopsis
thaliana]
Length = 389
Score = 132 bits (332), Expect = 3e-28, Method: Compositional matrix adjust.
Identities = 105/363 (28%), Positives = 164/363 (45%), Gaps = 49/363 (13%)
Query: 61 FYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLGTIFYPSRSSSYAYVPCDSEH 120
+ + + IG PP +DTGS +W C PC C Q IF PS+SS++ +
Sbjct: 59 YLMKLQIGTPPFEIEAVLDTGSEHIWTQCLPCVHCYNQTAPIFDPSKSSTFKEI------ 112
Query: 121 CRYFPYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDVVFGCGFST 180
RC + H C Y +Y T + TE +T +T + + + GCG
Sbjct: 113 -------RCDTHDHSCPYELVYGGKSYTKGTLVTETVTIHSTSGQPFVMPETIIGCG-RN 164
Query: 181 NRNFK--FSGIFGLGIGRSSLVSQLNSSF----SYCIG--SLHDPDYLHNKLILGDGAI- 231
N FK F+G+ GL G SL++Q+ + SYC ++ N ++ GDG +
Sbjct: 165 NSGFKPGFAGVVGLDRGPKSLITQMGGEYPGLMSYCFAGKGTSKINFGANAIVAGDGVVS 224
Query: 232 --IDEGDATPLQFIDGHYYITLEAISVDGRMLDINPNIFKRDDSGGGVMIDSGTDVTWLV 289
+ A P G YY+ L+A+SV ++ F G ++IDSG+ +T+
Sbjct: 225 TTVFVKTAKP-----GFYYLNLDAVSVGNTRIETVGTPFHALK--GNIVIDSGSTLTYF- 276
Query: 290 KEAYEALRDEVMIRLEGEQMRS---YSWPDKLCYHGIMSSDLKGFPTVRFHFRGGAKLAL 346
E+Y L +R EQ+ + + D LCY+ S + FP + HF GGA L L
Sbjct: 277 PESYCNL-----VRKAVEQVVTAVRFPRSDILCYY---SKTIDIFPVITMHFSGGADLVL 328
Query: 347 EKDSMFYQPRPDA-FCMAVNPASINNRYVNFSLIGMMAQQFYNVGYDIGRKQKTFQRMDC 405
+K +M+ FC+A+ I N + ++ G AQ + VGYD +F+ +C
Sbjct: 329 DKYNMYVASNTGGVFCLAI----ICNSPIEEAIFGNRAQNNFLVGYDSSSLLVSFKPTNC 384
Query: 406 EVL 408
L
Sbjct: 385 SAL 387
>gi|148906646|gb|ABR16474.1| unknown [Picea sitchensis]
Length = 538
Score = 132 bits (332), Expect = 3e-28, Method: Compositional matrix adjust.
Identities = 122/437 (27%), Positives = 187/437 (42%), Gaps = 48/437 (10%)
Query: 1 LIHRDSIL-SPYNNPNENPAHRVQRGINISIARLAYLQEKIR-----------SHNT--- 45
++HRDS+L N + R++ + R+ L+++I SH
Sbjct: 118 VVHRDSLLVKDAANATASYERRLEETLRRDARRVRGLEQRIEKRLRLNKDPAGSHENVAE 177
Query: 46 ----YQAQILPSNDISNSLFYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLGT 101
+ +++ + ++ I +G P Q+ +DTGS ++W+ C PC C Q+
Sbjct: 178 VAAEFGGEVVSGMAQGSGEYFTRIGVGTPMREQYMVLDTGSDVVWIQCEPCSKCYSQVDP 237
Query: 102 IFYPSRSSSYAYVPCDSEHCRYFPYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKN 161
IF PS S+S++ + C+S C Y C + C+Y Y G T +TE LTF
Sbjct: 238 IFNPSLSASFSTLGCNSAVCSYLDAYNC--HGGGCLYKVSYGDGSYTIGSFATEMLTFGT 295
Query: 162 TDESTIHVQDVVFGCGFSTNRNFKFSGIFGLGIGR-----SSLVSQLNSSFSYCIGSLHD 216
T V++V GCG F + S L +Q +FSYC+
Sbjct: 296 TS-----VRNVAIGCGHDNAGLFVGAAGLLGLGAGLLSFPSQLGTQTGRAFSYCLVDRFS 350
Query: 217 PDYLHNKLILGDGAIIDEGDATPL---QFIDGHYYITLEAISVDGRMLD-INPNIFKRDD 272
L G ++ TPL + YY+ L +ISV G +LD + P++F+ D+
Sbjct: 351 ES--SGTLEFGPESVPLGSILTPLLTNPSLPTFYYVPLISISVGGALLDSVPPDVFRIDE 408
Query: 273 SG--GGVMIDSGTDVTWLVKEAYEALRDE-VMIRLEGEQMRSYSWPDKLCYHGIMSSDLK 329
+ GG ++DSGT VT L Y+A+RD V + + S D CY + L
Sbjct: 409 TSGRGGFIVDSGTAVTRLQTPVYDAVRDAFVAGTRQLPKAEGVSIFDT-CYD-LSGLPLV 466
Query: 330 GFPTVRFHFRGGAKLAL-EKDSMFYQPRPDAFCMAVNPASINNRYVNFSLIGMMAQQFYN 388
PTV FHF GA L L K+ M FC A PA+ + S++G + QQ
Sbjct: 467 NVPTVVFHFSNGASLILPAKNYMIPMDFMGTFCFAFAPAT-----SDLSIMGNIQQQGIR 521
Query: 389 VGYDIGRKQKTFQRMDC 405
V +D F C
Sbjct: 522 VSFDTANSLVGFALRQC 538
>gi|15228618|ref|NP_191741.1| aspartyl protease-like protein [Arabidopsis thaliana]
gi|6850873|emb|CAB71112.1| putative protein [Arabidopsis thaliana]
gi|332646739|gb|AEE80260.1| aspartyl protease-like protein [Arabidopsis thaliana]
Length = 483
Score = 132 bits (332), Expect = 3e-28, Method: Compositional matrix adjust.
Identities = 116/389 (29%), Positives = 172/389 (44%), Gaps = 44/389 (11%)
Query: 41 RSHNTYQAQILPSNDISNSLFYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLG 100
R+ + ++ + +++ + +G P + +DTGS ++W+ C PC+ C Q
Sbjct: 115 RTAGGFSGAVISGLSQGSGEYFMRLGVGTPATNVYMVLDTGSDVVWLQCSPCKACYNQTD 174
Query: 101 TIFYPSRSSSYAYVPCDSEHCRYF-PYARCSAYKHR-CIYTQLYLIGPETSVFVSTEQLT 158
IF P +S ++A VPC S CR + C + + C+Y Y G T STE LT
Sbjct: 175 AIFDPKKSKTFATVPCGSRLCRRLDDSSECVTRRSKTCLYQVSYGDGSFTEGDFSTETLT 234
Query: 159 FKNTDESTIHVQDVVFGCGFSTNRNFKFSGIFGLGIGR-----SSLVSQLNSSFSYCI-- 211
F V V GCG F + S ++ N FSYC+
Sbjct: 235 FHGA-----RVDHVPLGCGHDNEGLFVGAAGLLGLGRGGLSFPSQTKNRYNGKFSYCLVD 289
Query: 212 -GSLHDPDYLHNKLILGDGAIIDEGDATPLQF---IDGHYYITLEAISVDG-RMLDINPN 266
S + ++ G+ A+ TPL +D YY+ L ISV G R+ ++ +
Sbjct: 290 RTSSGSSSKPPSTIVFGNAAVPKTSVFTPLLTNPKLDTFYYLQLLGISVGGSRVPGVSES 349
Query: 267 IFKRDDSG-GGVMIDSGTDVTWLVKEAYEALRDEVMIRLEGEQMR---SYSWPDKLCYHG 322
FK D +G GGV+IDSGT VT L + AY ALRD RL +++ SYS D C+
Sbjct: 350 QFKLDATGNGGVIIDSGTSVTRLTQPAYVALRDA--FRLGATKLKRAPSYSLFDT-CF-- 404
Query: 323 IMSSDLKGF-----PTVRFHFRGGAKLALEKDSMFYQPRPDA-FCMAVNPASINNRYVNF 376
DL G PTV FHF GG +++L + + FC A +
Sbjct: 405 ----DLSGMTTVKVPTVVFHF-GGGEVSLPASNYLIPVNTEGRFCFA-----FAGTMGSL 454
Query: 377 SLIGMMAQQFYNVGYDIGRKQKTFQRMDC 405
S+IG + QQ + V YD+ + F C
Sbjct: 455 SIIGNIQQQGFRVAYDLVGSRVGFLSRAC 483
>gi|15226317|ref|NP_180370.1| aspartyl protease-like protein [Arabidopsis thaliana]
gi|4063755|gb|AAC98463.1| putative chloroplast nucleoid DNA binding protein [Arabidopsis
thaliana]
gi|20197953|gb|AAM15327.1| putative chloroplast nucleoid DNA binding protein [Arabidopsis
thaliana]
gi|330252977|gb|AEC08071.1| aspartyl protease-like protein [Arabidopsis thaliana]
Length = 392
Score = 132 bits (332), Expect = 3e-28, Method: Compositional matrix adjust.
Identities = 101/360 (28%), Positives = 164/360 (45%), Gaps = 38/360 (10%)
Query: 59 SLFYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLGTIFYPSRSSSYAYVPCDS 118
+++ + + +G PP +DTGS L+W C PC +C Q IF PS SS++ C+
Sbjct: 59 NIYLMKLQVGTPPFEIEAEIDTGSDLIWTQCMPCTNCYSQYAPIFDPSNSSTFKEKRCNG 118
Query: 119 EHCRY-FPYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDVVFGCG 177
C Y YA + K ++TE +T +T + + GCG
Sbjct: 119 NSCHYKIIYADTTYSKGT----------------LATETVTIHSTSGEPFVMPETTIGCG 162
Query: 178 FSTNRNFK--FSGIFGLGIGRSSLVSQLNSSF----SYCIGS--LHDPDYLHNKLILGDG 229
+++ FK FSG+ GL G SSL++Q+ + SYC S ++ N ++ GDG
Sbjct: 163 HNSSW-FKPTFSGMVGLSWGPSSLITQMGGEYPGLMSYCFASQGTSKINFGTNAIVAGDG 221
Query: 230 AIIDEGDATPLQFIDGHYYITLEAISVDGRMLDINPNIFKRDDSGGGVMIDSGTDVTWLV 289
+ T + G YY+ L+A+SV ++ F + G ++IDSGT +T+
Sbjct: 222 VVSTTMFLTTAK--PGLYYLNLDAVSVGDTHVETMGTTFHALE--GNIIIDSGTTLTYFP 277
Query: 290 KEAYEALRDEVMIRLEGEQMRSYSWPDKLCYHGIMSSDLKGFPTVRFHFRGGAKLALEKD 349
+R+ V + + + D LCY+ + + FP + HF GGA L L+K
Sbjct: 278 VSYCNLVREAVDHYVTAVRTADPTGNDMLCYY---TDTIDIFPVITMHFSGGADLVLDKY 334
Query: 350 SMFYQP-RPDAFCMAVNPASINNRYVNFSLIGMMAQQFYNVGYDIGRKQKTFQRMDCEVL 408
+M+ + FC+A+ I N ++ G AQ + VGYD +F +C L
Sbjct: 335 NMYIETITRGTFCLAI----ICNNPPQDAIFGNRAQNNFLVGYDSSSLLVSFSPTNCSAL 390
>gi|125600538|gb|EAZ40114.1| hypothetical protein OsJ_24557 [Oryza sativa Japonica Group]
Length = 412
Score = 132 bits (332), Expect = 3e-28, Method: Compositional matrix adjust.
Identities = 109/360 (30%), Positives = 157/360 (43%), Gaps = 43/360 (11%)
Query: 57 SNSLFYVNISIGQPPVPQFTAMDTGSSLLWVHC-YPCRDCSPQLGTIFYPSRSSSYAYVP 115
S + + V+I+IG PP+P +DTGS L+W C PCR C PQ ++ P+RS++YA V
Sbjct: 88 STATYLVDIAIGTPPLPLTAVLDTGSDLIWTQCDAPCRRCFPQPAPLYAPARSATYANVS 147
Query: 116 CDSEHCRYF--PYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDVV 173
C S C+ P++RCS C Y Y G T ++TE T S V+ V
Sbjct: 148 CRSPMCQALQSPWSRCSPPDTGCAYYFSYGDGTSTDGVLATETFTLG----SDTAVRGVA 203
Query: 174 FGCGFST-NRNFKFSGIFGLGIGRSSLVSQLNSSFSY--CIGSLHDPDYLHNKLILGDGA 230
FGCG SG+ G+G G SLVSQL + C
Sbjct: 204 FGCGTENLGSTDNSSGLVGMGRGPLSLVSQLGVTRPRRSCRARAAARGGGAPT------- 256
Query: 231 IIDEGDATPLQFIDGHYYITLEAISVDGRMLDINPNIFKRDDSG-GGVMIDSGTDVTWLV 289
+P LE I+V +L I+P +F+ G GGV+IDSGT T L
Sbjct: 257 -----TTSP-----------LEGITVGDTLLPIDPAVFRLTPMGDGGVIIDSGTTFTALE 300
Query: 290 KEAYEALRDEVMIRLEGEQMRSYSWPDKLCYHGIMSSDLKGFPTVRFHFRGGAKLALEKD 349
+ A+ AL + R+ LC+ S + P + HF GA + L ++
Sbjct: 301 ERAFVALARALASRVRLPLASGAHLGLSLCFAA-ASPEAVEVPRLVLHFD-GADMELRRE 358
Query: 350 SMFYQPRPDAF-CMAVNPASINNRYVNFSLIGMMAQQFYNVGYDIGRKQKTFQRMDCEVL 408
S + R C+ + A S++G M QQ ++ YD+ R +F+ C L
Sbjct: 359 SYVVEDRSAGVACLGMVSAR------GMSVLGSMQQQNTHILYDLERGILSFEPAKCGEL 412
>gi|449432044|ref|XP_004133810.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 1-like [Cucumis
sativus]
Length = 471
Score = 132 bits (332), Expect = 3e-28, Method: Compositional matrix adjust.
Identities = 127/417 (30%), Positives = 184/417 (44%), Gaps = 36/417 (8%)
Query: 8 LSPYNNPNENPAHRVQRGINISIARLAYLQEKIRSHNT------YQAQILPSNDISNSLF 61
LS P E R+QR I + +L+ L R+ + + + ++ + +
Sbjct: 71 LSFNRTPEELFHLRLQRDA-IRVKKLSSLGATSRNLSKPGGTTGFSSSVISGLAQGSGEY 129
Query: 62 YVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLGTIFYPSRSSSYAYVPCDSEHC 121
+ I +G PP + +DTGS ++W+ C PC++C Q +F P +S S+A V C + C
Sbjct: 130 FTRIGVGTPPKYVYMVLDTGSDIVWLQCAPCKNCYSQTDPVFNPVKSGSFAKVLCRTPLC 189
Query: 122 RYFPYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDVVFGCGFSTN 181
R C+ + C+Y Y G T+ TE LTF+ T V+ V GCG
Sbjct: 190 RRLESPGCNQ-RQTCLYQVSYGDGSYTTGEFVTETLTFRRT-----KVEQVALGCGHDNE 243
Query: 182 RNF-KFSGIFGLGIGRSSLVSQ----LNSSFSYCIGSLHDPDYLHNKLILGDGAIIDEGD 236
F +G+ GLG G S SQ N FSYC+ + ++ G+ A+
Sbjct: 244 GLFVGAAGLLGLGRGGLSFPSQAGRTFNQKFSYCLVD-RSASSKPSSVVFGNSAVSRTAR 302
Query: 237 ATPLQF---IDGHYYITLEAISVDGRMLD-INPNIFKRDDSG-GGVMIDSGTDVTWLVKE 291
TPL +D YY+ L ISV G + I + FK D +G GGV+ID GT VT L K
Sbjct: 303 FTPLLTNPRLDTFYYVELLGISVGGTPVSGITASHFKLDRTGNGGVIIDCGTSVTRLNKP 362
Query: 292 AYEALRDEVMIRLEGEQMRS---YSWPDKLCYHGIMSSDLKGFPTVRFHFRGGAKLALEK 348
AY ALRD R ++S +S D CY + +K PTV HFRG
Sbjct: 363 AYIALRDA--FRAGASSLKSAPEFSLFDT-CYDLSGKTTVK-VPTVVLHFRGADVSLPAS 418
Query: 349 DSMFYQPRPDAFCMAVNPASINNRYVNFSLIGMMAQQFYNVGYDIGRKQKTFQRMDC 405
+ + FC A S+IG + QQ + V YD+ + F C
Sbjct: 419 NYLIPVDGSGRFCFA-----FAGTTSGLSIIGNIQQQGFRVVYDLASSRVGFSPRGC 470
>gi|293335828|ref|NP_001170221.1| uncharacterized protein LOC100384173 precursor [Zea mays]
gi|224034427|gb|ACN36289.1| unknown [Zea mays]
Length = 443
Score = 132 bits (332), Expect = 3e-28, Method: Compositional matrix adjust.
Identities = 116/423 (27%), Positives = 184/423 (43%), Gaps = 65/423 (15%)
Query: 22 VQRGINISIARLAYLQEKIRSHNTYQAQILPSNDI---------SNSLFYVNISIGQPPV 72
+ R + S AR+A LQ A + P + I S+ + + + IG P
Sbjct: 50 LSRALRRSSARVATLQSL--------AALAPGDAITAARILVLASDGEYLMEMGIGTPTR 101
Query: 73 PQFTAMDTGSSLLWVHCYPCRDCSPQLGTIFYPSRSSSYAYVPCDSEHCRYFPYARCSAY 132
+DTGS L+W C PC C Q F P+RS++Y + C S C Y C Y
Sbjct: 102 YYSAILDTGSDLIWTQCAPCLLCVDQPTPYFDPARSATYRSLGCASPACNALYYPLC--Y 159
Query: 133 KHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDVVFGCGFSTNRNFKF----SG 188
+ C+Y Y T+ ++ E TF T+E+ + + + FGCG N N SG
Sbjct: 160 QKVCVYQYFYGDSASTAGVLANETFTF-GTNETRVSLPGISFGCG---NLNAGLLANGSG 215
Query: 189 IFGLGIGRSSLVSQLNSS-FSYCIGSLHDPDYLHNKLILGDGAIIDEGDAT--PLQ---- 241
+ G G G SLVSQL S FSYC+ S P + ++L G A ++ +A+ P+Q
Sbjct: 216 MVGFGRGSLSLVSQLGSPRFSYCLTSFLSP--VPSRLYFGVYATLNSTNASSEPVQSTPF 273
Query: 242 ----FIDGHYYITLEAISVDGRMLDINPNIFKRDDS--GGGVMIDSGTDVTWLVKEAYEA 295
+ Y++ + ISV G +L I+P +F +D+ GG +IDSGT +T+L + AY+A
Sbjct: 274 VVNPALPTMYFLNMTGISVGGYLLPIDPAVFAINDTDGTGGTIIDSGTTITYLAEPAYDA 333
Query: 296 LRDEVMIRLEGEQMR---------SYSWPDKLCYHGIMSSDLKGFPTVRFHFRGGAKLAL 346
+R ++ + + WP P + HF G
Sbjct: 334 VRAAFASQITLPLLNVTDASVLDTCFQWPPP-------PRQSVTLPQLVLHFDGADWELP 386
Query: 347 EKDSMFYQPRP-DAFCMAVNPASINNRYVNFSLIGMMAQQFYNVGYDIGRKQKTFQRMDC 405
++ M P C+A+ +S + ++ Q +NV YD+ +F C
Sbjct: 387 LQNYMLVDPSTGGGLCLAMASSSDGSIIGSYQ------HQNFNVLYDLENSLMSFVPAPC 440
Query: 406 EVL 408
++
Sbjct: 441 HLM 443
>gi|326506192|dbj|BAJ86414.1| predicted protein [Hordeum vulgare subsp. vulgare]
gi|326516358|dbj|BAJ92334.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 492
Score = 132 bits (332), Expect = 4e-28, Method: Compositional matrix adjust.
Identities = 114/365 (31%), Positives = 162/365 (44%), Gaps = 32/365 (8%)
Query: 61 FYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLGTIFYPSRSSSYAYVPCDSEH 120
++ I +G P P +DTGS ++W+ C PCR C Q G +F P RS SY V C +
Sbjct: 140 YFTKIGVGTPATPALMVLDTGSDVVWLQCAPCRRCYEQSGQVFDPRRSRSYNAVGCAAPL 199
Query: 121 CRYFPYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDVVFGCGFST 180
CR C + C+Y Y G T+ +TE LTF V V GCG
Sbjct: 200 CRRLDSGGCDLRRSACLYQVAYGDGSVTAGDFATETLTFAG----GARVARVALGCGHD- 254
Query: 181 NRNFKFSGIFGLGIGRSSL------VSQLNSSFSYCI---GSLHDPDYLHNKLILGDGAI 231
N + LG+GR SL + SFSYC+ S + + + G GA+
Sbjct: 255 NEGLFVAAAGLLGLGRGSLSFPTQISRRYGRSFSYCLVDRTSSANTASRSSTVTFGSGAV 314
Query: 232 ID--EGDATPL---QFIDGHYYITLEAISVDG-RMLDINPNIFKRDDSG--GGVMIDSGT 283
TP+ ++ YY+ L ISV G R+ + + + D S GGV++DSGT
Sbjct: 315 GSTVASSFTPMVKNPRMETFYYVQLIGISVGGARVPGVANSDLRLDPSSGRGGVIVDSGT 374
Query: 284 DVTWLVKEAYEALRDEVMIRLEGEQMR--SYSWPDKLCYHGIMSSDLKGFPTVRFHFRGG 341
VT L + AY ALRD G ++ +S D CY + + PTV HF GG
Sbjct: 375 SVTRLARPAYSALRDAFRGAAAGLRLSPGGFSLFDT-CYD-LSGRKVVKVPTVSMHFAGG 432
Query: 342 AKLALEKDS-MFYQPRPDAFCMAVNPASINNRYVNFSLIGMMAQQFYNVGYDIGRKQKTF 400
A+ AL ++ + FC A A + S+IG + QQ + V +D ++ F
Sbjct: 433 AEAALPPENYLIPVDSKGTFCFAF--AGTDG---GVSIIGNIQQQGFRVVFDGDGQRVAF 487
Query: 401 QRMDC 405
C
Sbjct: 488 TPKGC 492
>gi|357440289|ref|XP_003590422.1| Aspartic proteinase-like protein [Medicago truncatula]
gi|355479470|gb|AES60673.1| Aspartic proteinase-like protein [Medicago truncatula]
Length = 498
Score = 132 bits (332), Expect = 4e-28, Method: Compositional matrix adjust.
Identities = 112/377 (29%), Positives = 177/377 (46%), Gaps = 42/377 (11%)
Query: 60 LFYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLG-----TIFYPSRSSSYAYV 114
L+ + +G PP +DTGS +LW++C C +C G F SS+ A V
Sbjct: 83 LYTTKVKMGTPPREFTVQIDTGSDILWINCNTCSNCPKSSGLGIELNFFDTVGSSTAALV 142
Query: 115 PCDSEHCRYF---PYARCSAYKHRCIYTQLYLIGPETS-VFVST----EQLTFKNTDEST 166
PC C A+CS ++C YT Y G TS V+VS + + ++T +
Sbjct: 143 PCSDPMCASAIQGAAAQCSPQVNQCSYTFQYEDGSGTSGVYVSDAMYFDMILGQSTPANV 202
Query: 167 IHVQDVVFGC-----GFSTNRNFKFSGIFGLGIGRSSLVSQLNSS------FSYCIGSLH 215
+VFGC G T + GI G G G S+VSQL+S FS+C+
Sbjct: 203 ASSATIVFGCSTYQSGDLTKTDKAVDGILGFGPGELSVVSQLSSRGITPKVFSHCLKGDG 262
Query: 216 DPDYLHNKLILGDGAIIDEGDA-TPLQFIDGHYYITLEAISVDGRMLDINPNIFKRDDSG 274
+ + L+LG+ I++ +PL HY + L++I+V+G++L INP +F D
Sbjct: 263 NGGGI---LVLGE--ILEPSIVYSPLVPSQPHYNLNLQSIAVNGQVLSINPAVFATSDK- 316
Query: 275 GGVMIDSGTDVTWLVKEAYEALRDEVMIRLEGEQMRSYSWPDKLCYHGIMSSDLKGFPTV 334
G +IDSGT +++LV+EAY+ L + V + + S+ CY + S D FPTV
Sbjct: 317 RGTIIDSGTTLSYLVQEAYDPLVNAVDTAVS-QFATSFISKGSQCYLVLTSID-DSFPTV 374
Query: 335 RFHFRGGAKLALEKDSMF----YQPRPDAFCMAVNPASINNRYVNFSLIGMMAQQFYNVG 390
F+F GGA + L+ +Q +C+ +++G + + V
Sbjct: 375 SFNFEGGASMDLKPSQYLLNRGFQDGAKMWCIGFQKVQ-----EGVTILGDLVLKDKIVV 429
Query: 391 YDIGRKQKTFQRMDCEV 407
YD+ R+Q + DC +
Sbjct: 430 YDLARQQIGWTNYDCSM 446
>gi|242092892|ref|XP_002436936.1| hypothetical protein SORBIDRAFT_10g011730 [Sorghum bicolor]
gi|241915159|gb|EER88303.1| hypothetical protein SORBIDRAFT_10g011730 [Sorghum bicolor]
Length = 469
Score = 132 bits (331), Expect = 4e-28, Method: Compositional matrix adjust.
Identities = 124/366 (33%), Positives = 162/366 (44%), Gaps = 39/366 (10%)
Query: 61 FYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRD--CSPQLGTIFYPSRSSSYAYVPCDS 118
+ V + G P VPQ +DTGS L WV C PC C PQ +F PS SS+YA VPC S
Sbjct: 122 YVVTLGFGTPAVPQVLLIDTGSDLSWVQCQPCNSSTCYPQKDPVFDPSASSTYAPVPCGS 181
Query: 119 EHCRYF---PYA----RCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQD 171
E CR YA S+ C Y Y G T STE LT + E+ V +
Sbjct: 182 EACRDLDPDSYANGCTNSSSGASLCQYGIQYGNGDTTVGVYSTETLTL--SPEAATVVNN 239
Query: 172 VVFGCGF-STNRNFKFSGIFGLGIGRSSLVSQLNSS----FSYCIGSLHDPDYLHNKLIL 226
FGCG F G+ GLG SLVSQ + FSYC L + L L
Sbjct: 240 FSFGCGLVQKGVFDLFDGLLGLGGAPESLVSQTTGTYGGAFSYC---LPAGNSTAGFLAL 296
Query: 227 GDGAIIDEGDA----TPLQFIDGHYY-ITLEAISVDGRMLDINPNIFKRDDSGGGVMIDS 281
G A A TPLQ ++ +Y + L ISV G+ LDI P +F GG++IDS
Sbjct: 297 GAPATGGNNTAGFQFTPLQVVETTFYLVKLTGISVGGKQLDIEPTVFA-----GGMIIDS 351
Query: 282 GTDVTWLVKEAYEALRDEVMIRLEGEQMRSYSWPDKL--CYHGIMSSDLKGFPTVRFHFR 339
GT VT L + AY ALR + + + + L CY ++++ PTV F
Sbjct: 352 GTIVTGLPETAYSALRTAFRSAMSAYPLLPPNDDEDLDTCYDFTGNTNVT-VPTVALTFE 410
Query: 340 GGAKLALEKDSMFYQPRPDAFCMAVNPASINNRYVNFSLIGMMAQQFYNVGYDIGRKQKT 399
GG + L+ S AF + + +IG + Q+ + V YD R
Sbjct: 411 GGVTIDLDVPSGVLLDGCLAFVAGASDG-------DTGIIGNVNQRTFEVLYDSARGHVG 463
Query: 400 FQRMDC 405
F+ C
Sbjct: 464 FRAGAC 469
>gi|357118064|ref|XP_003560779.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Brachypodium
distachyon]
Length = 472
Score = 132 bits (331), Expect = 4e-28, Method: Compositional matrix adjust.
Identities = 118/373 (31%), Positives = 163/373 (43%), Gaps = 53/373 (14%)
Query: 61 FYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCR--DCSPQLGTIFYPSRSSSYAYVPCDS 118
+ V + IG P V Q +DTGS L WV C PC DC PQ +F PS+SS++A +PC S
Sbjct: 125 YVVTLGIGTPAVQQTVLIDTGSDLSWVQCKPCNASDCYPQKDPLFDPSKSSTFATIPCAS 184
Query: 119 EHCRYFPY--------ARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQ 170
+ C+ P S +C Y Y G T STE L S+ V+
Sbjct: 185 DACKQLPVDGYDNGCTNNTSGMPPQCGYAIEYGNGAITEGVYSTETLALG----SSAVVK 240
Query: 171 DVVFGCGFSTNRNF-KFSGIFGLGIGRSSLVSQLNS----SFSYCIGSLHDPDYLHNKLI 225
FGCG + + KF G+ GLG SLVSQ S +FSYC+ L+ L
Sbjct: 241 SFRFGCGSDQHGPYDKFDGLLGLGGAPESLVSQTASVYGGAFSYCLPPLNSGAGF---LT 297
Query: 226 LGDGAIIDEGDA----TPLQF----IDGHYYITLEAISVDGRMLDINPNIFKRDDSGGGV 277
LG + ++ TP+ I Y +TL ISV G+ LDI P +F + G
Sbjct: 298 LGAPNSTNNSNSGFVFTPMHAFSPKIATFYVVTLTGISVGGKALDIPPAVFAK-----GN 352
Query: 278 MIDSGTDVTWLVKEAYEALRDEVMIRL-EGEQMRSYSWPDKLCY----HGIMSSDLKGFP 332
++DSGT +T + AY+ALR + E + CY HG ++ P
Sbjct: 353 IVDSGTVITGIPTTAYKALRTAFRSAMAEYPLLPPADSALDTCYNFTGHGTVT-----VP 407
Query: 333 TVRFHFRGGAKLALEKDSMFYQPRPDAFCMAVNPASINNRYVNFSLIGMMAQQFYNVGYD 392
V F GGA + L+ S C+A A +F +IG + + V YD
Sbjct: 408 KVALTFVGGATVDLDVPSGVLVED----CLAFADAGDG----SFGIIGNVNTRTIEVLYD 459
Query: 393 IGRKQKTFQRMDC 405
G+ F+ C
Sbjct: 460 SGKGHLGFRAGAC 472
>gi|242056497|ref|XP_002457394.1| hypothetical protein SORBIDRAFT_03g006630 [Sorghum bicolor]
gi|241929369|gb|EES02514.1| hypothetical protein SORBIDRAFT_03g006630 [Sorghum bicolor]
Length = 509
Score = 131 bits (330), Expect = 5e-28, Method: Compositional matrix adjust.
Identities = 114/354 (32%), Positives = 152/354 (42%), Gaps = 22/354 (6%)
Query: 61 FYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLGTIFYPSRSSSYAYVPCDSEH 120
++ + IG P + +DTGS + WV C PC DC Q +F PS S+SYA V CDS
Sbjct: 169 YFSRVGIGSPARELYMVLDTGSDVTWVQCQPCADCYQQSDPVFDPSLSASYAAVSCDSPR 228
Query: 121 CRYFPYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDVVFGCGFST 180
CR A C C+Y Y G T +TE LT + + V +V GCG
Sbjct: 229 CRDLDTAACRNATGACLYEVAYGDGSYTVGDFATETLTLGD----STPVTNVAIGCGHDN 284
Query: 181 NRNFKFSGIFGLGIGRS-SLVSQLN-SSFSYCIGSLHDPDYLHNKLILG-DGAIIDEGDA 237
F + G S SQ++ S+FSYC+ P + L G DGA D A
Sbjct: 285 EGLFVGAAGLLALGGGPLSFPSQISASTFSYCLVDRDSP--AASTLQFGADGAEADTVTA 342
Query: 238 TPL---QFIDGHYYITLEAISVDGRMLDINPNIFKRDDS--GGGVMIDSGTDVTWLVKEA 292
PL YY+ L ISV G+ L I + F D + GGV++DSGT VT L A
Sbjct: 343 -PLVRSPRTGTFYYVALSGISVGGQALSIPSSAFAMDATSGSGGVIVDSGTAVTRLQSSA 401
Query: 293 YEALRDEVMIRLEGEQMRSYSWPDKLCYHGIMSSDLKGFPTVRFHFRGGAKLAL-EKDSM 351
Y ALRD + S CY + ++ P V F GG L L K+ +
Sbjct: 402 YAALRDAFVRGTPSLPRTSGVSLFDTCYDLSDRTSVE-VPAVSLRFEGGGALRLPAKNYL 460
Query: 352 FYQPRPDAFCMAVNPASINNRYVNFSLIGMMAQQFYNVGYDIGRKQKTFQRMDC 405
+C+A P + S+IG + QQ V +D + F C
Sbjct: 461 IPVDGAGTYCLAFAPTN-----AAVSIIGNVQQQGTRVSFDTAKGVVGFTPNKC 509
>gi|449530542|ref|XP_004172253.1| PREDICTED: LOW QUALITY PROTEIN: protein ASPARTIC PROTEASE IN GUARD
CELL 1-like [Cucumis sativus]
Length = 486
Score = 131 bits (330), Expect = 5e-28, Method: Compositional matrix adjust.
Identities = 110/367 (29%), Positives = 162/367 (44%), Gaps = 23/367 (6%)
Query: 46 YQAQILPSNDISNSLFYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLGTIFYP 105
+++ I+ + ++ + IG+PP P + +DTGS + WV C PC +C Q F P
Sbjct: 136 FESPIVSGASQGSGEYFSRVGIGRPPSPVYMVLDTGSDVSWVQCAPCAECYEQTDPXFEP 195
Query: 106 SRSSSYAYVPCDSEHCRYFPYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDES 165
+ S+S+ + C++E C+ + C C+Y Y G T TE +T +T
Sbjct: 196 TSSASFTSLSCETEQCKSLDVSEC--RNGTCLYEVSYGDGSYTVGDFVTETVTLGST--- 250
Query: 166 TIHVQDVVFGCGFSTNRNF-KFSGIFGLGIGRSSLVSQLN-SSFSYCIGSLHDPDYLHNK 223
+ ++ GCG + F +G+ GLG G S SQLN SSFSYC L D D
Sbjct: 251 --SLGNIAIGCGHNNEGLFIGAAGLLGLGGGSLSFPSQLNASSFSYC---LVDRDSDSTS 305
Query: 224 LILGDGAIIDEGDATPLQF---IDGHYYITLEAISVDGRMLDINPNIFK-RDDSGGGVMI 279
+ + I + PL +D +Y+ L +SV G +L I F+ +D GG+++
Sbjct: 306 TLDFNSPITPDAVTAPLHRNPNLDTFFYLGLTGMSVGGAVLPIPETSFQMSEDGNGGIIV 365
Query: 280 DSGTDVTWLVKEAYEALRDEVMIRLEGEQMRSYSWPDKLCYHGIMSSDLKGFPTVRFHFR 339
DSGT VT L Y LRD + Q CY + S PTV FHF
Sbjct: 366 DSGTAVTRLQTTVYNVLRDAFVKSTHDLQTARGVALFDTCYD-LSSKSRVEVPTVSFHFA 424
Query: 340 GGAKLAL-EKDSMFYQPRPDAFCMAVNPASINNRYVNFSLIGMMAQQFYNVGYDIGRKQK 398
G +L L K+ + FC A P S++G QQ VG+D+
Sbjct: 425 NGNELPLPAKNYLIPVDSEGTFCFAFAPTD-----STLSILGNAQQQGTRVGFDLANSLV 479
Query: 399 TFQRMDC 405
F C
Sbjct: 480 GFSPNKC 486
>gi|449529638|ref|XP_004171805.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 1-like, partial
[Cucumis sativus]
Length = 384
Score = 131 bits (330), Expect = 5e-28, Method: Compositional matrix adjust.
Identities = 116/358 (32%), Positives = 163/358 (45%), Gaps = 29/358 (8%)
Query: 61 FYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLGTIFYPSRSSSYAYVPCDSEH 120
++ I +G PP + +DTGS ++W+ C PC++C Q +F P +S S+A V C +
Sbjct: 42 YFTRIGVGTPPKYVYMVLDTGSDIVWLQCAPCKNCYSQTDPVFNPVKSGSFAKVLCRTPL 101
Query: 121 CRYFPYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDVVFGCGFST 180
CR C+ + C+Y Y G T+ TE LTF+ T V+ V GCG
Sbjct: 102 CRRLESPGCNQ-RQTCLYQVSYGDGSYTTGEFVTETLTFRRT-----KVEQVALGCGHDN 155
Query: 181 NRNF-KFSGIFGLGIGRSSLVSQ----LNSSFSYCIGSLHDPDYLHNKLILGDGAIIDEG 235
F +G+ GLG G S SQ N FSYC+ + ++ G+ A+
Sbjct: 156 EGLFVGAAGLLGLGRGGLSFPSQAGRTFNQKFSYCLVD-RSASSKPSSVVFGNSAVSRTA 214
Query: 236 DATPLQF---IDGHYYITLEAISVDGRMLD-INPNIFKRDDSG-GGVMIDSGTDVTWLVK 290
TPL +D YY+ L ISV G + I + FK D +G GGV+ID GT VT L K
Sbjct: 215 RFTPLLTNPRLDTFYYVELLGISVGGTPVSGITASHFKLDRTGNGGVIIDCGTSVTRLNK 274
Query: 291 EAYEALRDEVMIRLEGEQMRS---YSWPDKLCYHGIMSSDLKGFPTVRFHFRGGAKLALE 347
AY ALRD R ++S +S D CY + +K PTV HFRG
Sbjct: 275 PAYIALRDA--FRAGASSLKSAPEFSLFDT-CYDLSGKTTVK-VPTVVLHFRGADVSLPA 330
Query: 348 KDSMFYQPRPDAFCMAVNPASINNRYVNFSLIGMMAQQFYNVGYDIGRKQKTFQRMDC 405
+ + FC A + S+IG + QQ + V YD+ + F C
Sbjct: 331 SNYLIPVDGSGRFCFAFAGTT-----SGLSIIGNIQQQGFRVVYDLASSRVGFSPRGC 383
>gi|116787398|gb|ABK24493.1| unknown [Picea sitchensis]
Length = 479
Score = 131 bits (330), Expect = 5e-28, Method: Compositional matrix adjust.
Identities = 109/375 (29%), Positives = 161/375 (42%), Gaps = 43/375 (11%)
Query: 52 PSNDISNSLFYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLGTIFYPSRSSSY 111
P + + + V G P +DTGS + W+ C PC DC Q+ IF P +SSSY
Sbjct: 129 PGSKVGTGNYIVTAGFGTPAKNSLLIIDTGSDVTWIQCKPCSDCYSQVDPIFEPQQSSSY 188
Query: 112 AYVPCDSEHCRYFPYARCSAYKH----RCIYTQLYLIGPETSVFVSTEQLTFKNTDESTI 167
++ C S C + H C+Y Y G + S E LT +
Sbjct: 189 KHLSCLSSAC-----TELTTMNHCRLGGCVYEINYGDGSRSQGDFSQETLTLGSDS---- 239
Query: 168 HVQDVVFGCGFSTNRNFKFS-GIFGLGIGRSSLVSQLNSS----FSYCIGSLHDPDYLHN 222
FGCG + FK S G+ GLG S SQ S FSYC+ PD++ +
Sbjct: 240 -FPSFAFGCGHTNTGLFKGSAGLLGLGRTALSFPSQTKSKYGGQFSYCL-----PDFVSS 293
Query: 223 ----KLILGDGAIIDEGDATPLQFIDGH---YYITLEAISVDGRMLDINPNIFKRDDSGG 275
+G G+I PL + Y++ L ISV G L I P + R G
Sbjct: 294 TSTGSFSVGQGSIPATATFVPLVSNSNYPSFYFVGLNGISVGGERLSIPPAVLGR----G 349
Query: 276 GVMIDSGTDVTWLVKEAYEALRDEVMIRLEG-EQMRSYSWPDKLCYHGIMSSDLKGFPTV 334
G ++DSGT +T LV +AY+AL+ + + +S D CY S ++ PT+
Sbjct: 350 GTIVDSGTVITRLVPQAYDALKTSFRSKTRNLPSAKPFSILDT-CYDLSSYSQVR-IPTI 407
Query: 335 RFHFRGGAKLALEKDSMFYQPRPDA--FCMAVNPASINNRYVNFSLIGMMAQQFYNVGYD 392
FHF+ A +A+ + + + D C+A AS + ++ ++IG QQ V +D
Sbjct: 408 TFHFQNNADVAVSAVGILFTIQSDGSQVCLAFASAS---QSISTNIIGNFQQQRMRVAFD 464
Query: 393 IGRKQKTFQRMDCEV 407
G + F C
Sbjct: 465 TGAGRIGFAPGSCAT 479
>gi|15226358|ref|NP_180389.1| aspartyl protease-like protein [Arabidopsis thaliana]
gi|4803959|gb|AAD29831.1| putative chloroplast nucleoid DNA binding protein [Arabidopsis
thaliana]
gi|330252998|gb|AEC08092.1| aspartyl protease-like protein [Arabidopsis thaliana]
Length = 756
Score = 131 bits (330), Expect = 6e-28, Method: Compositional matrix adjust.
Identities = 103/374 (27%), Positives = 165/374 (44%), Gaps = 62/374 (16%)
Query: 59 SLFYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLGTIFYPSRSSSYAYVPCDS 118
S++ + + +G PP +DTGS ++W C PC +C Q IF PS+SS++ C+
Sbjct: 419 SIYLMKLQVGTPPFEIVAEIDTGSDIIWTQCMPCPNCYSQFAPIFDPSKSSTFREQRCNG 478
Query: 119 EHCRYFPYARCSAYKHRCIYT-QLYLIGPETSVFVSTEQLTFKNTDESTIHVQDVVFGCG 177
C Y IY + Y G ++TE +T +T + + GCG
Sbjct: 479 NSCHY-----------EIIYADKTYSKG-----ILATETVTIPSTSGEPFVMAETKIGCG 522
Query: 178 FSTNRNFKF-------SGIFGLGIGRSSLVSQLNSSF----SYCIG--SLHDPDYLHNKL 224
N N ++ SGI GL +G SL+SQ++ + SYC ++ N +
Sbjct: 523 LD-NTNLQYSGFASSSSGIVGLNMGPLSLISQMDLPYPGLISYCFSGQGTSKINFGTNAI 581
Query: 225 ILGDGAIIDEGDATPLQFI---DGHYYITLEAISVDGRMLDINPNIFKRDDSGGGVMIDS 281
+ GDG + + FI + YY+ L+A+SV+ ++ F +D G + IDS
Sbjct: 582 VAGDGTVAAD------MFIKKDNPFYYLNLDAVSVEDNLIATLGTPFHAED--GNIFIDS 633
Query: 282 GTDVTWLVKEAYEALRDEVMIRLEGEQMRSYSWPDKLCYHGIMSSDLKGFPTVRFHFRGG 341
GT +T+ +R+ V + ++ + LCY+ S + FP + HF GG
Sbjct: 634 GTTLTYFPMSYCNLVREAVEQVVTAVKVPDMGSDNLLCYY---SDTIDIFPVITMHFSGG 690
Query: 342 AKLALEKDSMFYQPRPDA-FCMAVN------PASINNRYVNFSLIGMMAQQFYNVGYDIG 394
A L L+K +M+ + FC+A+ PA NR AQ + VGYD
Sbjct: 691 ADLVLDKYNMYLETITGGIFCLAIGCNDPSMPAVFGNR----------AQNNFLVGYDPS 740
Query: 395 RKQKTFQRMDCEVL 408
+F +C L
Sbjct: 741 SNVISFSPTNCSAL 754
Score = 130 bits (328), Expect = 1e-27, Method: Compositional matrix adjust.
Identities = 107/387 (27%), Positives = 171/387 (44%), Gaps = 53/387 (13%)
Query: 22 VQRGINISIARLAYLQEKIRSHNTYQAQILPSNDISNSLFYVNISIGQPPVPQFTAMDTG 81
+QR N S RL+ + +++ + Y + N ++ + + +G PP +DTG
Sbjct: 50 IQRRSNSSSFRLS--KNQLQGASPYADTLFDYN-----IYLMKLQVGTPPFEIAAEIDTG 102
Query: 82 SSLLWVHCYPCRDCSPQLGTIFYPSRSSSYAYVPCDSEHCRYFPYARCSAYKHRCIYTQL 141
S L+W C PC DC Q IF PS+SS++ C + C Y + Y
Sbjct: 103 SDLIWTQCMPCPDCYSQFDPIFDPSKSSTFNEQRCHGKSCHYEIIYEDNTYSKG------ 156
Query: 142 YLIGPETSVFVSTEQLTFKNTDESTIHVQDVVFGCGFST----NRNF--KFSGIFGLGIG 195
++TE +T +T + + GCG N F SGI GL +G
Sbjct: 157 ---------ILATETVTIHSTSGEPFVMAETTIGCGLHNTDLDNSGFASSSSGIVGLNMG 207
Query: 196 RSSLVSQLNSSF----SYCIG--SLHDPDYLHNKLILGDGAIIDEGDATPLQFI---DGH 246
SL+SQ++ + SYC ++ N ++ GDG + + FI +
Sbjct: 208 PRSLISQMDLPYPGLISYCFSGQGTSKINFGTNAIVAGDGTVAAD------MFIKKDNPF 261
Query: 247 YYITLEAISVDGRMLDINPNIFKRDDSGGGVMIDSGTDVTWLVKEAYEALRDEVMIRLEG 306
YY+ L+A+SV+ ++ F +D G ++IDSG+ VT+ +R V +
Sbjct: 262 YYLNLDAVSVEDNRIETLGTPFHAED--GNIVIDSGSTVTYFPVSYCNLVRKAVEQVVTA 319
Query: 307 EQMRSYSWPDKLCYHGIMSSDLKGFPTVRFHFRGGAKLALEKDSMFYQPRPDA-FCMAVN 365
++ S D LCY S + FP + HF GGA L L+K +M+ + FC+A+
Sbjct: 320 VRVPDPSGNDMLCY---FSETIDIFPVITMHFSGGADLVLDKYNMYMESNSGGLFCLAI- 375
Query: 366 PASINNRYVNFSLIGMMAQQFYNVGYD 392
I N ++ G AQ + VGYD
Sbjct: 376 ---ICNSPTQEAIFGNRAQNNFLVGYD 399
>gi|359473000|ref|XP_002278677.2| PREDICTED: aspartic proteinase nepenthesin-1-like [Vitis vinifera]
Length = 458
Score = 131 bits (330), Expect = 6e-28, Method: Compositional matrix adjust.
Identities = 107/434 (24%), Positives = 190/434 (43%), Gaps = 54/434 (12%)
Query: 8 LSPYNNPNENPAHRVQRGINISIARLAYLQEKIRSHNTYQAQILPSNDISNSLFYVNISI 67
+ P+ P++ ++ RL++ + + + ++ ++ + ++V++ +
Sbjct: 44 IKPFTTPSQ--------ALSFDSHRLSFFFSALHTPQSLKSPVVSGASTGSGQYFVDLRL 95
Query: 68 GQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQL-GTIFYPSRSSSYAYVPCDSEHCRYFPY 126
G PP DTGS L+WV C CR+C+ G+ F S++++ C C+ P
Sbjct: 96 GTPPQKLLLVADTGSDLVWVKCSACRNCTRHTPGSAFLARHSTTFSPNHCYDSACQLVPL 155
Query: 127 ARCSAYKHRCIYTQL---------YLIGPETSVFVSTEQLTFKNTDESTIHVQDVVFGCG 177
+ HRC + +L Y G +TS F S E T + ++ + FGC
Sbjct: 156 PK----HHRCNHARLHSPCRYEYSYGDGSKTSGFFSKETTTLNTSSGREAKLKGIAFGCA 211
Query: 178 FS------TNRNFKFS-GIFGLGIGRSSLVSQL----NSSFSYCIGSLHDPDYLHNKLIL 226
F + +F + G+ GLG G SL SQL + FSYC+ + L++
Sbjct: 212 FRISGPSVSGASFNGAHGVMGLGRGPISLSSQLGHRFGNKFSYCLMDHDISPSPTSYLLI 271
Query: 227 GDGAIIDEGDATP----LQFIDGH--------YYITLEAISVDGRMLDINPNIFKRDDSG 274
G + D P ++F H YYI +E++SVDG L INP+++ D+ G
Sbjct: 272 GS----TQNDVAPGKRRMRFTPLHINPLSPTFYYIGIESVSVDGIKLPINPSVWALDELG 327
Query: 275 -GGVMIDSGTDVTWLVKEAYEALRDEVMIRLEGEQMRSYSWPDKLCYHGIMSSDLKGFPT 333
GG ++DSGT +T+L + AY + + R+ + LC + + + P
Sbjct: 328 NGGTIVDSGTTLTFLPEPAYLQILTVIKRRVRLPSPAEPTPGFDLCVN-VSEIEHPRLPK 386
Query: 334 VRFHFRGGAKLALEKDSMFYQPRPDAFCMAVNPASINNRYVNFSLIGMMAQQFYNVGYDI 393
+ F G + + + F D C+A+ + FS+IG + QQ + + +D
Sbjct: 387 LSFKLGGDSVFSPPPRNYFVDTDEDVKCLALQAVMTPS---GFSVIGNLMQQGFLLEFDK 443
Query: 394 GRKQKTFQRMDCEV 407
R + F R C +
Sbjct: 444 DRTRLGFSRHGCAL 457
>gi|115467014|ref|NP_001057106.1| Os06g0209100 [Oryza sativa Japonica Group]
gi|51091210|dbj|BAD35903.1| putative aspartic proteinase nepenthesin I [Oryza sativa Japonica
Group]
gi|113595146|dbj|BAF19020.1| Os06g0209100 [Oryza sativa Japonica Group]
gi|125554496|gb|EAZ00102.1| hypothetical protein OsI_22105 [Oryza sativa Indica Group]
Length = 454
Score = 131 bits (330), Expect = 6e-28, Method: Compositional matrix adjust.
Identities = 107/385 (27%), Positives = 170/385 (44%), Gaps = 33/385 (8%)
Query: 47 QAQILPSNDISNSLFYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQ-LGTIFYP 105
+A + I + + +++S+G PP P +DTGS L+W C PC DC Q + P
Sbjct: 76 RAGLGAGGGIVTNEYLMHVSVGTPPRPVALTLDTGSDLVWTQCAPCLDCFEQGAAPVLDP 135
Query: 106 SRSSSYAYVPCDSEHCRYFPYARCSAYK---HRCIYTQLYLIGPETSVFVSTEQLTFKNT 162
+ SS++A +PCD+ CR P+ C C+Y Y T ++T+ TF
Sbjct: 136 AASSTHAALPCDAPLCRALPFTSCGGRSWGDRSCVYVYHYGDRSLTVGQLATDSFTFGGD 195
Query: 163 DES-TIHVQDVVFGCGFSTNRNFKF--SGIFGLGIGRSSLVSQLN-SSFSYCIGSLHDPD 218
D + + + V FGCG F+ +GI G G GR SL SQLN +SFSYC S+ D
Sbjct: 196 DNAGGLAARRVTFGCGHINKGIFQANETGIAGFGRGRWSLPSQLNVTSFSYCFTSMFDTK 255
Query: 219 YLHNKLILGDGA--------IIDEGDATPLQFIDGH-----YYITLEAISVDGRMLDINP 265
+ + LG A GD + I Y++ L ISV G + +
Sbjct: 256 S-SSVVTLGAAAAELLHTHHAAHTGDVRTTRLIKNPSQPSLYFVPLRGISVGGARVAVPE 314
Query: 266 NIFKRDDSGGGVMIDSGTDVTWLVKEAYEALRDEVMIRLEGEQMRSYSWPDKLCYHGIMS 325
+ + +IDSG +T L ++ YEA++ E + ++ + S LC+ ++
Sbjct: 315 SRLRSS-----TIIDSGASITTLPEDVYEAVKAEFVSQVGLPAAAAGSAALDLCFALPVA 369
Query: 326 SDLK--GFPTVRFHFRGGAKLALEKDSMFYQPRPDAFCMAVNPASINNRYVNFSLIGMMA 383
+ + P + H GGA L + + ++ V A+ + V IG
Sbjct: 370 ALWRRPAVPALTLHLDGGADWELPRGNYVFEDYAARVLCVVLDAAAGEQVV----IGNYQ 425
Query: 384 QQFYNVGYDIGRKQKTFQRMDCEVL 408
QQ +V YD+ +F C+ L
Sbjct: 426 QQNTHVVYDLENDVLSFAPARCDKL 450
>gi|125543634|gb|EAY89773.1| hypothetical protein OsI_11315 [Oryza sativa Indica Group]
Length = 474
Score = 131 bits (329), Expect = 6e-28, Method: Compositional matrix adjust.
Identities = 112/402 (27%), Positives = 178/402 (44%), Gaps = 35/402 (8%)
Query: 31 ARLAYLQEKIRSHNTYQAQILP---SNDISNSLFYVNISIGQPPVPQFTAMDTGSSLLWV 87
AR ++ S A++ P ++ + ++ + V+++IG PP P +DTGS L W
Sbjct: 78 ARSKARSARLLSGRAASARVDPGSYTDGVPDTEYLVHMAIGTPPQPVQLILDTGSDLTWT 137
Query: 88 HCYPCRDCSPQLGTIFYPSRSSSYAYVPCDSEHCRYFPYARC---SAYKHRCIYTQLYLI 144
C PC C Q F PSRS +++ +PCD CR ++ C S C+Y Y
Sbjct: 138 QCAPCVSCFRQSLPRFNPSRSMTFSVLPCDLRICRDLTWSSCGEQSWGNGICVYAYAYAD 197
Query: 145 GPETSVFVSTEQLTFKNTDEST--IHVQDVVFGCGFSTNRNF--KFSGIFGLGIGRSSLV 200
T+ + ++ +F + D + V D+ FGCG N F +GI G G S+
Sbjct: 198 HSITTGHLDSDTFSFASADHAIGGASVPDLTFGCGLFNNGIFVSNETGIAGFSRGALSMP 257
Query: 201 SQLN-SSFSYCI----GSLHDPDYLHNKLIL-GDGAIIDEGDATPLQFIDGH------YY 248
+QL +FSYC GS P +L L D A G I H YY
Sbjct: 258 AQLKVDNFSYCFTAITGSEPSPVFLGVPPNLYSDAAGGGHGVVQSTALIRYHSSQLKAYY 317
Query: 249 ITLEAISVDGRMLDINPNIFK-RDDSGGGVMIDSGTDVTWLVKEAYEALRDEVMIRLEGE 307
I+L+ ++V L I ++F ++D GG ++DSGT +T L + Y + D + + +
Sbjct: 318 ISLKGVTVGTTRLPIPESVFALKEDGTGGTIVDSGTGMTMLPEAVYNLVCDAFVAQTKLT 377
Query: 308 QMRSYSWPDKLCYHGIMSSDLKGFPTVRFHFRGGAKLALEKDSMFYQPRPDA----FCMA 363
S S +LC+ + P + HF GA L L +++ ++ C+A
Sbjct: 378 VHNSTSSLSQLCF-SVPPGAKPDVPALVLHFE-GATLDLPRENYMFEIEEAGGIRLTCLA 435
Query: 364 VNPASINNRYVNFSLIGMMAQQFYNVGYDIGRKQKTFQRMDC 405
+N + S+IG QQ +V YD+ +F C
Sbjct: 436 INAGE------DLSVIGNFQQQNMHVLYDLANDMLSFVPARC 471
>gi|356526294|ref|XP_003531753.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
Length = 414
Score = 131 bits (329), Expect = 7e-28, Method: Compositional matrix adjust.
Identities = 129/417 (30%), Positives = 195/417 (46%), Gaps = 50/417 (11%)
Query: 20 HRVQRGINISIARLAYLQEKIR----SHN--TYQAQILPSNDISNSLFYVNISIGQPPVP 73
R+Q+ + + R+ +Q +IR +HN Q QI S+ I+ +++G
Sbjct: 16 RRLQKQLILDDLRVRSMQNRIRRVASTHNVEASQTQIPLSSGINLQTLNYIVTMGLGSKN 75
Query: 74 QFTAMDTGSSLLWVHCYPCRDCSPQLGTIFYPSRSSSYAYVPCDSEHCRYFPYAR----- 128
+DTGS L WV C PC C Q G IF PS SSSY V C+S C+ +A
Sbjct: 76 MTVIIDTGSDLTWVQCEPCMSCYNQQGPIFKPSTSSSYQSVSCNSSTCQSLQFATGNTGA 135
Query: 129 C-SAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDVVFGCGFSTNRNFK-- 185
C S+ C Y Y G T+ + E L+F + V D VFGCG RN K
Sbjct: 136 CGSSNPSTCNYVVNYGDGSYTNGELGVEALSFGG-----VSVSDFVFGCG----RNNKGL 186
Query: 186 FSGIFGL-GIGRS--SLVSQLNSS----FSYCIGSLHDPDYLHNKLILGDGAIIDEGDAT 238
F G+ GL G+GRS SLVSQ N++ FSYC+ + L++G+ + + + +A
Sbjct: 187 FGGVSGLMGLGRSYLSLVSQTNATFGGVFSYCLPTTEAGS--SGSLVMGNESSVFK-NAN 243
Query: 239 PLQF--------IDGHYYITLEAISVDGRMLDINPNIFKRDDSGGGVMIDSGTDVTWLVK 290
P+ + + Y + L I V G L P F GG++IDSGT +T L
Sbjct: 244 PITYTRMLSNPQLSNFYILNLTGIDVGGVALKA-PLSFGN----GGILIDSGTVITRLPS 298
Query: 291 EAYEALRDEVMIRLEG-EQMRSYSWPDKLCYHGIMSSDLKGFPTVRFHFRGGAKLALEKD 349
Y+AL+ E + + G +S D C++ + D PT+ F G A+L ++
Sbjct: 299 SVYKALKAEFLKKFTGFPSAPGFSILDT-CFN-LTGYDEVSIPTISLRFEGNAQLNVDAT 356
Query: 350 SMFYQPRPDAFCMAVNPASINNRYVNFSLIGMMAQQFYNVGYDIGRKQKTFQRMDCE 406
FY + DA + + AS+++ Y + ++IG Q+ V YD + + F C
Sbjct: 357 GTFYVVKEDASQVCLALASLSDAY-DTAIIGNYQQRNQRVIYDTKQSKVGFAEEPCS 412
>gi|413952718|gb|AFW85367.1| hypothetical protein ZEAMMB73_231535 [Zea mays]
Length = 443
Score = 131 bits (329), Expect = 7e-28, Method: Compositional matrix adjust.
Identities = 104/367 (28%), Positives = 165/367 (44%), Gaps = 30/367 (8%)
Query: 61 FYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLGTIFYPSRSSSYAYVPCDSEH 120
+ V +++G P P +DTGS L+W C PCRDC Q + P+ SS+YA +PC +
Sbjct: 84 YLVRLAVGTPRRPVALTLDTGSDLVWTQCAPCRDCFDQDLPVLDPAASSTYAALPCGAAR 143
Query: 121 CRYFPYARCSAY---KHR-CIYTQLYLIGPETSVFVSTEQLTFKNTDES--TIHVQDVVF 174
CR P+ C HR CIY Y T ++T++ TF ++ S ++H + + F
Sbjct: 144 CRALPFTSCGVRTLGNHRSCIYAYHYGDKSLTVGEIATDRFTFGDSGGSGESLHTRRLTF 203
Query: 175 GCGFSTNRNFKF--SGIFGLGIGRSSLVSQLN-SSFSYCIGSLHDPDYLHNKLILGDGAI 231
GCG F+ +GI G G GR SL SQLN +SFSYC S+ + L A+
Sbjct: 204 GCGHLNKGVFQSNETGIAGFGRGRWSLPSQLNVTSFSYCFTSMFESKSSLVTLGGSPAAL 263
Query: 232 ID-----EGDATPLQFIDGH---YYITLEAISVDGRMLDINPNIFKRDDSGGGVMIDSGT 283
E TP+ Y+++L+ ISV L + F+ +IDSG
Sbjct: 264 YSHAHSGEVRTTPILKNPSQPSLYFLSLKGISVGKTRLPVPETKFR------STIIDSGA 317
Query: 284 DVTWLVKEAYEALRDEVMIRLEGEQMRSYSWPDKLCYHGIMSSDLK--GFPTVRFHFRGG 341
+T L +E YEA++ E ++ LC+ +++ + P++ H G
Sbjct: 318 SITTLPEEVYEAVKAEFAAQVGLPPSGVEGSALDLCFALPVTALWRRPAVPSLTLHLEGA 377
Query: 342 AKLALEKDSMFYQPRPDAFCMAVNPASINNRYVNFSLIGMMAQQFYNVGYDIGRKQKTFQ 401
+ +F C+ ++ A ++IG QQ +V YD+ + +F
Sbjct: 378 DWELPRSNYVFEDLGARVMCIVLDAAPGEQ-----TVIGNFQQQNTHVVYDLENDRLSFA 432
Query: 402 RMDCEVL 408
C+ L
Sbjct: 433 PARCDRL 439
>gi|293336306|ref|NP_001168599.1| uncharacterized protein LOC100382383 [Zea mays]
gi|223949441|gb|ACN28804.1| unknown [Zea mays]
Length = 326
Score = 131 bits (329), Expect = 8e-28, Method: Compositional matrix adjust.
Identities = 112/336 (33%), Positives = 145/336 (43%), Gaps = 20/336 (5%)
Query: 78 MDTGSSLLWVHCYPCRDCSPQLGTIFYPSRSSSYAYVPCDSEHCRYFPYARCSAYKHRCI 137
+DTGS + WV C PC DC Q +F PS S+SYA V CDS+ CR A C C+
Sbjct: 3 LDTGSDVTWVQCQPCADCYQQSDPVFDPSLSASYAAVSCDSQRCRDLDTAACRNATGACL 62
Query: 138 YTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDVVFGCGFSTNRNFKFSGIFGLGIGRS 197
Y Y G T +TE LT + + V +V GCG F + G
Sbjct: 63 YEVAYGDGSYTVGDFATETLTLGD----STPVGNVAIGCGHDNEGLFVGAAGLLALGGGP 118
Query: 198 -SLVSQLN-SSFSYCIGSLHDPDYLHNKLILGDGAIIDEGDATPL---QFIDGHYYITLE 252
S SQ++ S+FSYC+ P + L GDGA PL YY+ L
Sbjct: 119 LSFPSQISASTFSYCLVDRDSP--AASTLQFGDGAAEAGTVTAPLVRSPRTSTFYYVALS 176
Query: 253 AISVDGRMLDINPNIFKRDDS--GGGVMIDSGTDVTWLVKEAYEALRDEVMIRLEGEQMR 310
ISV G+ L I + F D + GGV++DSGT VT L AY ALRD +
Sbjct: 177 GISVGGQPLSIPASAFAMDATSGSGGVIVDSGTAVTRLQSAAYAALRDAFVQGAPSLPRT 236
Query: 311 SYSWPDKLCYHGIMSSDLKGFPTVRFHFRGGAKLAL-EKDSMFYQPRPDAFCMAVNPASI 369
S CY + ++ P V F GG L L K+ + +C+A P
Sbjct: 237 SGVSLFDTCYDLSDRTSVE-VPAVSLRFEGGGALRLPAKNYLIPVDGAGTYCLAFAP--- 292
Query: 370 NNRYVNFSLIGMMAQQFYNVGYDIGRKQKTFQRMDC 405
N V S+IG + QQ V +D R F C
Sbjct: 293 TNAAV--SIIGNVQQQGTRVSFDTARGAVGFTPNKC 326
>gi|357127503|ref|XP_003565419.1| PREDICTED: probable aspartic protease At2g35615-like [Brachypodium
distachyon]
Length = 486
Score = 130 bits (328), Expect = 8e-28, Method: Compositional matrix adjust.
Identities = 124/445 (27%), Positives = 190/445 (42%), Gaps = 53/445 (11%)
Query: 1 LIHRDSILSPYNNPNENPAHRVQRGINISIARLAYLQEKIRSHNT------YQAQILPSN 54
IHRDS+ SP+++P P R S AR A L + ++ A ++
Sbjct: 44 FIHRDSVKSPFHDPALTPHGRALAAARRSAARAAELHHLLARRSSGAPSPGTGAGVVAEV 103
Query: 55 DISNSLFYVNISIGQPPVPQFTAMDTGSSLLWVHCYPC---RDCSPQLGTIFYPSRSSSY 111
+ + I +G PPV DTGS L+WV C + + F PS SS+Y
Sbjct: 104 VSRQFEYLMAIEVGTPPVRVLAIADTGSDLVWVKCKGKDNDNNSTAPPSVYFVPSASSTY 163
Query: 112 AYVPCDSEHCRYFPYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDEST----- 166
V CD++ CR A + C Y Y G S +STE TF +S+
Sbjct: 164 GRVGCDTKACRALSSAASCSPDGSCEYLYSYGDGSRASGQLSTETFTFSTIADSSKTNSH 223
Query: 167 ------------IHVQDVVFGCGFSTNRNFKFSGIFGLGIGRSSLVSQLNSS------FS 208
+ + + FGC +T F+ G+ GLG G SL SQL ++ FS
Sbjct: 224 GNNNNNSSSHGQVEIAKLDFGCSTTTTGTFRADGLVGLGGGPVSLASQLGATTSLGRKFS 283
Query: 209 YCIGSLHDPDYLHNKLILGDGAIIDEGDATPLQFIDG----HYYITLEAISVDGRMLDIN 264
YC+ + + + L G A++ E A I G +Y I L++I+V G
Sbjct: 284 YCLAPYANTN-ASSALNFGSRAVVSEPGAASTPLITGEVETYYTIALDSINVAGT----- 337
Query: 265 PNIFKRDDSGGG--VMIDSGTDVTWLVKEAYEALRDEVMIRLEGEQMRSYSWPDKLCY-- 320
KR + +++DSGT +T+L L ++ R++ + S LCY
Sbjct: 338 ----KRPTTAAQAHIIVDSGTTLTYLDSALLTPLVKDLTRRIKLPRAESPEKILDLCYDI 393
Query: 321 HGIMSSDLKGFPTVRFHFRGGAKLALEKDSMFYQPRPDAFCMAVNPASINNRYVNFSLIG 380
G+ D G P V GG ++ L+ D+ F + C+A+ S + S++G
Sbjct: 394 SGVRGEDALGIPDVTLVLGGGGEVTLKPDNTFVVVQEGVLCLALVATSERQ---SVSILG 450
Query: 381 MMAQQFYNVGYDIGRKQKTFQRMDC 405
+AQQ +VGYD+ + TF DC
Sbjct: 451 NIAQQNLHVGYDLEKGTVTFAAADC 475
>gi|242086412|ref|XP_002443631.1| hypothetical protein SORBIDRAFT_08g022620 [Sorghum bicolor]
gi|241944324|gb|EES17469.1| hypothetical protein SORBIDRAFT_08g022620 [Sorghum bicolor]
Length = 507
Score = 130 bits (328), Expect = 9e-28, Method: Compositional matrix adjust.
Identities = 135/450 (30%), Positives = 191/450 (42%), Gaps = 61/450 (13%)
Query: 1 LIHRDSILSPYNNPNENPAHRVQRGINISIARLAYLQEKIRSHNTYQAQILP-------- 52
L+HRDS N A + R + R A++ ++ T ++
Sbjct: 74 LLHRDSFAV-----NATGAELLARRLQRDELRAAWIISTAAANGTPPPDVVGLSTGRGLV 128
Query: 53 ----SNDISNSLFYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLGTIFYPSRS 108
S ++ + I++G P V A+DT S L W+ C PCR C PQ G +F P S
Sbjct: 129 APVVSRAPTSGDYIAKIAVGTPAVEALLALDTASDLTWLQCQPCRRCYPQSGPVFDPRHS 188
Query: 109 SSYAYVPCDSEHCRYFPYARCS-AYKHRCIYTQLYLIG---PETSVFVS---TEQLTFKN 161
+SY + D+ C+ + A + CIYT LY G TS V E LTF
Sbjct: 189 TSYGEMNYDAPDCQALGRSGGGDAKRGTCIYTVLYGDGDGHGSTSTSVGDLVEETLTFAG 248
Query: 162 TDESTIHVQDVVFGCGFSTNRNFK--FSGIFGLGIGRSSLVSQL-----NSSFSYC-IGS 213
+ + GCG F +GI GL G+ S+ Q+ N+SFSYC +
Sbjct: 249 ----GVRQAYLSIGCGHDNKGLFGAPAAGILGLSRGQISIPHQIAFLGYNASFSYCLVDF 304
Query: 214 LHDPDYLHNKLILGDGAIIDEGDA--TPL---QFIDGHYYITLEAISVDGRMLDINPNIF 268
+ P + L G GA+ A TP Q + YY+ L +SV G + P +
Sbjct: 305 ISGPGSPSSTLTFGAGAVDTSPPASFTPTVLNQNMPTFYYVRLIGVSVGGVRV---PGVT 361
Query: 269 KRD------DSGGGVMIDSGTDVTWLVKEAYEALRDEVMIRLEGEQMRSYSWPDKL---C 319
+RD GGV++DSGT VT L + AY A RD G S P L C
Sbjct: 362 ERDLQLDPYTGHGGVILDSGTTVTRLARPAYTAFRDAFRAAATGLGQVSTGGPSGLFDTC 421
Query: 320 YHGIMSSDLK---GFPTVRFHFRGGAKLALE-KDSMFYQPRPDAFCMAVNPASINNRYVN 375
Y + L+ P V HF GG +L+L+ K+ + C A A +R V
Sbjct: 422 YTVGGRAGLRHCVKVPAVSMHFAGGVELSLQPKNYLITVDSRGTVCFAF--AGTGDRSV- 478
Query: 376 FSLIGMMAQQFYNVGYDIGRKQKTFQRMDC 405
S+IG + QQ + V YDIG ++ F C
Sbjct: 479 -SVIGNILQQGFRVVYDIGGQRVGFAPNSC 507
>gi|18461217|dbj|BAB84414.1| chloroplast nucleoid DNA-binding protein cnd41-like [Oryza sativa
Japonica Group]
Length = 446
Score = 130 bits (328), Expect = 9e-28, Method: Compositional matrix adjust.
Identities = 116/437 (26%), Positives = 188/437 (43%), Gaps = 55/437 (12%)
Query: 1 LIHRDSILSPYNNPNENPAHRVQRGINISIARLAYLQEKIRSHNTYQAQILPSNDISNSL 60
+ HRD++ P P +++ + AR A L + + + + +
Sbjct: 31 VFHRDALFPP--PPGAKRGSLLRQRLAADAARYASL---VDATGRLHSPVFSGIPFESGE 85
Query: 61 FYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLGTIFYPSRSSSYAYVPCDSEH 120
++ + +G P +DTGS L+W+ C PCR C Q G +F P RSS+Y VPC S
Sbjct: 86 YFALVGVGTPSTKAMLVIDTGSDLVWLQCSPCRRCYAQRGQVFDPRRSSTYRRVPCSSPQ 145
Query: 121 CRYFPYARC---SAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDVVFGCG 177
CR + C A C Y Y G ++ ++T++L F N +V +V GCG
Sbjct: 146 CRALRFPGCDSGGAAGGGCRYMVAYGDGSSSTGDLATDKLAFAN----DTYVNNVTLGCG 201
Query: 178 FSTNRNF-KFSGIFGLGIGRSSLVSQL----NSSFSYCIGSLHDPDYLHNKLILGDGAII 232
F +G+ G+G G+ S+ +Q+ S F YC+G + L+ G
Sbjct: 202 RDNEGLFDSAAGLLGVGRGKISISTQVAPAYGSVFEYCLGDRTSRSTRSSYLVFGR---T 258
Query: 233 DEGDATPLQFIDGH------YYITLEAISVDGRMLDINPNIFKRDDSG---GGVMIDSGT 283
E +T + + YY+ + SV G + N D+ GGV++DSGT
Sbjct: 259 PEPPSTAFTALLSNPRRPSLYYVDMAGFSVGGERVTGFSNASLALDTATGRGGVVVDSGT 318
Query: 284 DVTWLVKEAYEALRDEVMIRLEGEQMRSYSWPDKL---CYHGIMSSDLKGFPT-----VR 335
++ ++AY ALRD R MR + + CY DL+G P +
Sbjct: 319 AISRFARDAYAALRDAFDARARAAGMRRLAGEHSVFDACY------DLRGRPAASAPLIV 372
Query: 336 FHFRGGAKLALEKDSMFY-----QPRPDAF--CMAVNPASINNRYVNFSLIGMMAQQFYN 388
HF GGA +AL ++ F + R ++ C+ A S+IG + QQ +
Sbjct: 373 LHFAGGADMALPPENYFLPVDGGRRRAASYRRCLGFEAADD-----GLSVIGNVQQQGFR 427
Query: 389 VGYDIGRKQKTFQRMDC 405
V +D+ +++ F C
Sbjct: 428 VVFDVEKERIGFAPKGC 444
>gi|67633548|gb|AAY78698.1| aspartyl protease family protein [Arabidopsis thaliana]
Length = 392
Score = 130 bits (328), Expect = 9e-28, Method: Compositional matrix adjust.
Identities = 101/360 (28%), Positives = 163/360 (45%), Gaps = 38/360 (10%)
Query: 59 SLFYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLGTIFYPSRSSSYAYVPCDS 118
+++ + + +G PP +DTGS L+W C PC +C Q IF PS SS++ C+
Sbjct: 59 NIYLMKLQVGTPPFEIEAEIDTGSDLIWTQCMPCTNCYSQYAPIFDPSNSSTFKEKRCNG 118
Query: 119 EHCRY-FPYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDVVFGCG 177
C Y YA + K ++TE +T +T + + GCG
Sbjct: 119 NSCHYKIIYADTTYSKGT----------------LATETVTIHSTSGEPFVMPETTIGCG 162
Query: 178 FSTNRNFK--FSGIFGLGIGRSSLVSQLNSSF----SYCIGS--LHDPDYLHNKLILGDG 229
+++ FK FSG+ GL G SSL++Q+ + SYC S ++ N ++ GDG
Sbjct: 163 HNSSW-FKPTFSGMVGLSWGPSSLITQMGGEYPGLMSYCFASQGTSKINFGTNAIVAGDG 221
Query: 230 AIIDEGDATPLQFIDGHYYITLEAISVDGRMLDINPNIFKRDDSGGGVMIDSGTDVTWLV 289
+ T + G YY+ L+A+SV ++ F + G ++IDSGT +T+
Sbjct: 222 VVSTTMFLTTAK--PGLYYLNLDAVSVGDTHVETMGTTFHALE--GNIIIDSGTTLTYFP 277
Query: 290 KEAYEALRDEVMIRLEGEQMRSYSWPDKLCYHGIMSSDLKGFPTVRFHFRGGAKLALEKD 349
+R+ V + + + D LCY+ + + FP + HF GGA L L+K
Sbjct: 278 VSYCNLVREAVDHYVTAVRTADPTGNDMLCYY---TDTIDIFPVITMHFSGGADLVLDKY 334
Query: 350 SMFYQP-RPDAFCMAVNPASINNRYVNFSLIGMMAQQFYNVGYDIGRKQKTFQRMDCEVL 408
+M+ + FC+A+ I N ++ G AQ + VGYD F +C L
Sbjct: 335 NMYIETITRGTFCLAI----ICNNPPQDAIFGNRAQNNFLVGYDSSSLLVFFSPTNCSAL 390
>gi|302762735|ref|XP_002964789.1| hypothetical protein SELMODRAFT_82470 [Selaginella moellendorffii]
gi|300167022|gb|EFJ33627.1| hypothetical protein SELMODRAFT_82470 [Selaginella moellendorffii]
Length = 390
Score = 130 bits (327), Expect = 1e-27, Method: Compositional matrix adjust.
Identities = 111/400 (27%), Positives = 170/400 (42%), Gaps = 40/400 (10%)
Query: 31 ARLAYLQEKIRSHN---------TYQAQILPSNDISNSLFYVNISIGQPPVPQFTAMDTG 81
ARL ++ +I+S + AQ+ + + ++ + IG P + +DTG
Sbjct: 6 ARLRWIHHRIQSSDHRHRRGRSLLQTAQVSSGLSLGSGEYFARMGIGSPQRSYYLELDTG 65
Query: 82 SSLLWVHCYPCRDCSPQLGTIFYPSRSSSYAYVPCDSEHCRYFPYARCSAYKHRCIYTQL 141
S + W+ C PC C Q+ I+ PS SSSY V C S C+ Y+ C C Y +
Sbjct: 66 SDVTWIQCAPCSSCYSQVDPIYDPSNSSSYRRVYCGSALCQALDYSACQGMG--CSYRVV 123
Query: 142 YLIGPETSVFVSTEQLTFKNTDESTIHVQDVVFGCGFSTNRNFKFSGIFGLGIGR----- 196
Y +S + E +F S+ ++++ FGCG S + F+ G
Sbjct: 124 YGDSSASSGDLGIE--SFYLGPNSSTAMRNIAFGCGHSNSGLFRGEAGLLGMGGGTLSFF 181
Query: 197 SSLVSQLNSSFSYCIGSLHDP-DYLHNKLILGDGAIIDEGDATPL---QFIDGHYYITLE 252
S + + + +FSYC+ + + LI G AI TPL ID YY L
Sbjct: 182 SQIAASIGPAFSYCLVDRYSQLQSRSSPLIFGRTAIPFAARFTPLLKNPRIDTFYYAILT 241
Query: 253 AISVDGRMLDINPNIFKRDDSG-GGVMIDSGTDVTWLVKEAYEALRDEVMIRLEGEQMRS 311
ISV G L I P F +G GG ++DSGT VT +V AY LRD
Sbjct: 242 GISVGGTALPIPPAQFALTGNGTGGAILDSGTSVTRVVPAAYAVLRDAYRAASRNLPPAP 301
Query: 312 YSWPDKLCYHGIMSSDLKGFPTVR-----FHFRGGAKLALEKDSMFYQ-PRPDAFCMAVN 365
+ C+ + +G PTV+ HF + L ++ R FC+A
Sbjct: 302 GVYLLDTCF------NFQGLPTVQIPSLVLHFDNDVDMVLPGGNILIPVDRSGTFCLAFA 355
Query: 366 PASINNRYVNFSLIGMMAQQFYNVGYDIGRKQKTFQRMDC 405
P+S+ S+IG + QQ + +G+D+ R +C
Sbjct: 356 PSSM-----PISVIGNVQQQTFRIGFDLQRSLIAIAPREC 390
>gi|115452683|ref|NP_001049942.1| Os03g0317300 [Oryza sativa Japonica Group]
gi|108707833|gb|ABF95628.1| Eukaryotic aspartyl protease family protein, expressed [Oryza
sativa Japonica Group]
gi|113548413|dbj|BAF11856.1| Os03g0317300 [Oryza sativa Japonica Group]
Length = 448
Score = 130 bits (327), Expect = 1e-27, Method: Compositional matrix adjust.
Identities = 112/402 (27%), Positives = 178/402 (44%), Gaps = 35/402 (8%)
Query: 31 ARLAYLQEKIRSHNTYQAQILP---SNDISNSLFYVNISIGQPPVPQFTAMDTGSSLLWV 87
AR ++ S A++ P ++ + ++ + V+++IG PP P +DTGS L W
Sbjct: 52 ARSKARSARLLSGRAASARMDPGSYTDGVPDTEYLVHMAIGTPPQPVQLILDTGSDLTWT 111
Query: 88 HCYPCRDCSPQLGTIFYPSRSSSYAYVPCDSEHCRYFPYARC---SAYKHRCIYTQLYLI 144
C PC C Q F PSRS +++ +PCD CR ++ C S C+Y Y
Sbjct: 112 QCAPCVSCFRQSLPRFNPSRSMTFSVLPCDLRICRDLTWSSCGEQSWGNGICVYAYAYAD 171
Query: 145 GPETSVFVSTEQLTFKNTDEST--IHVQDVVFGCGFSTNRNF--KFSGIFGLGIGRSSLV 200
T+ + ++ +F + D + V D+ FGCG N F +GI G G S+
Sbjct: 172 HSITTGHLDSDTFSFASADHAIGGASVPDLTFGCGLFNNGIFVSNETGIAGFSRGALSMP 231
Query: 201 SQLN-SSFSYCI----GSLHDPDYLHNKLIL-GDGAIIDEGDATPLQFIDGH------YY 248
+QL +FSYC GS P +L L D A G I H YY
Sbjct: 232 AQLKVDNFSYCFTAITGSEPSPVFLGVPPNLYSDAAGGGHGVVQSTALIRYHSSQLKAYY 291
Query: 249 ITLEAISVDGRMLDINPNIFK-RDDSGGGVMIDSGTDVTWLVKEAYEALRDEVMIRLEGE 307
I+L+ ++V L I ++F ++D GG ++DSGT +T L + Y + D + + +
Sbjct: 292 ISLKGVTVGTTRLPIPESVFALKEDGTGGTIVDSGTGMTMLPEAVYNLVCDAFVAQTKLT 351
Query: 308 QMRSYSWPDKLCYHGIMSSDLKGFPTVRFHFRGGAKLALEKDSMFYQPRPDA----FCMA 363
S S +LC+ + P + HF GA L L +++ ++ C+A
Sbjct: 352 VHNSTSSLSQLCFS-VPPGAKPDVPALVLHFE-GATLDLPRENYMFEIEEAGGIRLTCLA 409
Query: 364 VNPASINNRYVNFSLIGMMAQQFYNVGYDIGRKQKTFQRMDC 405
+N + S+IG QQ +V YD+ +F C
Sbjct: 410 INAGE------DLSVIGNFQQQNMHVLYDLANDMLSFVPARC 445
>gi|224099307|ref|XP_002311432.1| predicted protein [Populus trichocarpa]
gi|222851252|gb|EEE88799.1| predicted protein [Populus trichocarpa]
Length = 458
Score = 130 bits (327), Expect = 1e-27, Method: Compositional matrix adjust.
Identities = 106/378 (28%), Positives = 168/378 (44%), Gaps = 34/378 (8%)
Query: 61 FYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCR-DCSPQ-LGTIFYPSRSSSYAYVPCDS 118
++V+I +G PP DTGS L WV C C+ +CS G+ F S++++ C S
Sbjct: 83 YFVSIRLGSPPQTLLLVADTGSDLTWVRCSACKTNCSIHPPGSTFLARHSTTFSPTHCFS 142
Query: 119 EHCRYFPYARCSAYKH-----RCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDVV 173
C+ P + H C Y +Y G +TS F S E T + + ++ +
Sbjct: 143 SLCQLVPQPNPNPCNHTRLHSTCRYEYVYSDGSKTSGFFSKETTTLNTSSGREMKLKSIA 202
Query: 174 FGCGFSTN------RNFK-FSGIFGLGIGRSSLVSQL----NSSFSYCIGSLHDPDYLHN 222
FGCGF + +F SG+ GLG G S SQL SFSYC+ +
Sbjct: 203 FGCGFHASGPSLIGSSFNGASGVMGLGRGPISFASQLGRRFGRSFSYCLLDYTLSPPPTS 262
Query: 223 KLILGDGAIIDEGDATPLQFI--------DGHYYITLEAISVDGRMLDINPNIFKRDDSG 274
L++GD + + + + F YYI+++ + VDG L I+P+++ D+ G
Sbjct: 263 YLMIGDVVSTKKDNKSMMSFTPLLINPEAPTFYYISIKGVFVDGVKLHIDPSVWSLDELG 322
Query: 275 -GGVMIDSGTDVTWLVKEAYE----ALRDEVMIRLEGEQMRSYSWPDKLCYHGIMSSDLK 329
GG +IDSGT +T+L + AY A + EV + S LC + +
Sbjct: 323 NGGTVIDSGTTLTFLTEPAYREILSAFKREVKLPSPTPGGASTRSGFDLCVN-VTGVSRP 381
Query: 330 GFPTVRFHFRGGAKLALEKDSMFYQPRPDAFCMAVNPASINNRYVNFSLIGMMAQQFYNV 389
FP + G + + + F C+A+ P + FS+IG + QQ + +
Sbjct: 382 RFPRLSLELGGESLYSPPPRNYFIDISEGIKCLAIQP--VEAESGRFSVIGNLMQQGFLL 439
Query: 390 GYDIGRKQKTFQRMDCEV 407
+D G+ + F R C V
Sbjct: 440 EFDRGKSRLGFSRRGCAV 457
>gi|115476830|ref|NP_001062011.1| Os08g0469100 [Oryza sativa Japonica Group]
gi|42407408|dbj|BAD09566.1| putative nucleoid DNA-binding protein [Oryza sativa Japonica Group]
gi|113623980|dbj|BAF23925.1| Os08g0469100 [Oryza sativa Japonica Group]
Length = 373
Score = 130 bits (327), Expect = 1e-27, Method: Compositional matrix adjust.
Identities = 103/362 (28%), Positives = 168/362 (46%), Gaps = 43/362 (11%)
Query: 73 PQFTAMDTGSSLLWVHC-------YPCRDCSPQLGTIFYPSRSSSYAYVPCDSEHCR--Y 123
P+ +DTGS L+W C R SP ++ P SS++A++PC C+
Sbjct: 25 PRKLIVDTGSDLIWTQCKLSSSTAAAARHGSPP---VYDPGESSTFAFLPCSDRLCQEGQ 81
Query: 124 FPYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDVVFGCG-FSTNR 182
F + C++ K+RC+Y +Y V S E TF ++ + FGCG S
Sbjct: 82 FSFKNCTS-KNRCVYEDVYGSAAAVGVLAS-ETFTFGARRAVSLRLG---FGCGALSAGS 136
Query: 183 NFKFSGIFGLGIGRSSLVSQLN-SSFSYCIGSLHDPDYLHNKLILGDGAIIDEGDAT-PL 240
+GI GL SL++QL FSYC+ D + L+ G A + T P+
Sbjct: 137 LIGATGILGLSPESLSLITQLKIQRFSYCLTPFADKKT--SPLLFGAMADLSRHKTTRPI 194
Query: 241 QFI--------DGHYYITLEAISVDGRMLDI-NPNIFKRDDSGGGVMIDSGTDVTWLVKE 291
Q +YY+ L IS+ + L + ++ R D GGG ++DSG+ V +LV+
Sbjct: 195 QTTAIVSNPVETVYYYVPLVGISLGHKRLAVPAASLAMRPDGGGGTIVDSGSTVAYLVEA 254
Query: 292 AYEALRDEVM--IRLEGEQMRSYSWPDKLCY-----HGIMSSDLKGFPTVRFHFRGGAKL 344
A+EA+++ VM +RL + +LC+ + + P + HF GGA +
Sbjct: 255 AFEAVKEAVMDVVRLPVANRTVEDY--ELCFVLPRRTAAAAMEAVQVPPLVLHFDGGAAM 312
Query: 345 ALEKDSMFYQPRPDAFCMAVNPASINNRYVNFSLIGMMAQQFYNVGYDIGRKQKTFQRMD 404
L +D+ F +PR C+AV + + S+IG + QQ +V +D+ + +F
Sbjct: 313 VLPRDNYFQEPRAGLMCLAVGKTTDGS---GVSIIGNVQQQNMHVLFDVQHHKFSFAPTQ 369
Query: 405 CE 406
C+
Sbjct: 370 CD 371
>gi|326517992|dbj|BAK07248.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 500
Score = 130 bits (327), Expect = 1e-27, Method: Compositional matrix adjust.
Identities = 109/354 (30%), Positives = 153/354 (43%), Gaps = 25/354 (7%)
Query: 61 FYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLGTIFYPSRSSSYAYVPCDSEH 120
++ + +G+P + +DTGS + W+ C PC DC Q ++ PS S+SYA V CDS
Sbjct: 163 YFSRVGVGRPARQLYMVLDTGSDVTWLQCQPCADCYAQSDPVYDPSVSTSYATVGCDSPR 222
Query: 121 CRYFPYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDVVFGCGFST 180
CR A C C+Y Y G T +TE LT + + V +V GCG
Sbjct: 223 CRDLDAAACRNSTGSCLYEVAYGDGSYTVGDFATETLTLGD----SAPVSNVAIGCGHDN 278
Query: 181 NRNFKFSGIFGLGIGRS-SLVSQLN-SSFSYCIGSLHDPDYLHNKLILGDGAIIDEGDAT 238
F + G S SQ++ ++FSYC+ P + L GD E A
Sbjct: 279 EGLFVGAAGLLALGGGPLSFPSQISATTFSYCLVDRDSPS--SSTLQFGD----SEQPAV 332
Query: 239 PLQFI-----DGHYYITLEAISVDGRMLDINPNIFKRDDSG-GGVMIDSGTDVTWLVKEA 292
I + YY+ L ISV G L I + F DD+G GGV++DSGT VT L A
Sbjct: 333 TAPLIRSPRTNTFYYVALSGISVGGEALSIPSSAFAMDDAGSGGVIVDSGTAVTRLQSGA 392
Query: 293 YEALRDEVMIRLEGEQMRSYSWPDKLCYHGIMSSDLKGFPTVRFHFRGGAKLAL-EKDSM 351
Y ALR+ + + S CY S ++ P V F GG +L L K+ +
Sbjct: 393 YGALREAFVQGTQSLPRASGVSLFDTCYDLAGRSSVQ-VPAVALWFEGGGELKLPAKNYL 451
Query: 352 FYQPRPDAFCMAVNPASINNRYVNFSLIGMMAQQFYNVGYDIGRKQKTFQRMDC 405
+C+A S S+IG + QQ V +D + F C
Sbjct: 452 IPVDAAGTYCLAFAGTS-----GPVSIIGNVQQQGVRVSFDTAKNTVGFTADKC 500
>gi|242070719|ref|XP_002450636.1| hypothetical protein SORBIDRAFT_05g008470 [Sorghum bicolor]
gi|241936479|gb|EES09624.1| hypothetical protein SORBIDRAFT_05g008470 [Sorghum bicolor]
Length = 410
Score = 130 bits (326), Expect = 1e-27, Method: Compositional matrix adjust.
Identities = 113/398 (28%), Positives = 177/398 (44%), Gaps = 35/398 (8%)
Query: 17 NPAHRVQRGINISIARLAYLQEKIRSHNTYQAQILPSNDISNSLFYVNISIGQPPVPQFT 76
PA + R + S RL+ L ++ + AQ D + + SIG PP
Sbjct: 38 EPAINLTRAAHKSHQRLSMLAARLDDAASGSAQTPLQLDSGGGAYDMTFSIGTPPQELSA 97
Query: 77 AMDTGSSLLWVHCYPCRDCSPQLGTIFYPSRSSSYAYVPCDSEHCRYFPYARCSAYKHRC 136
DTGS L+W C C C PQ +YP++SSS++ +PC C P ++CSA C
Sbjct: 98 LADTGSDLIWAKCGACTRCVPQGSPSYYPNKSSSFSKLPCSGSLCSDLPSSQCSAGGAEC 157
Query: 137 IYTQLYLIGPE----TSVFVSTEQLTFKNTDESTIHVQDVVFGC-GFSTNRNFKFSGIFG 191
Y Y + + T ++ +E T + V + FGC S SG+ G
Sbjct: 158 DYKYSYGLASDPHHYTQGYLGSETFTLGSD-----AVPGIGFGCTTMSEGGYGSGSGLVG 212
Query: 192 LGIGRSSLVSQLN-SSFSYCIGSLHDPDYLHNKLILGDGAIIDEG-DATPLQFIDGHYY- 248
LG G SLVSQLN +FSYC+ S + L+ G GA+ G +TPL +YY
Sbjct: 213 LGRGPLSLVSQLNVGAFSYCLTSDAAKT---SPLLFGSGALTGAGVQSTPLLRTSTYYYT 269
Query: 249 ITLEAISVDGRMLDINPNIFKRDDSGGGVMIDSGTDVTWLVKEAYEALRDEVMIRLEGEQ 308
+ LE+IS+ G++ DSGT V +L + AY ++ V+ +
Sbjct: 270 VNLESISIGAAT--------TAGTGSSGIIFDSGTTVAFLAEPAYTLAKEAVLSQTTNLT 321
Query: 309 MRSYSWPDKLCYHGIMSSDLKGFPTVRFHFRGGAKLALEKDSMFYQPRPDAFCMAVNPAS 368
M S ++C+ + FP++ HF GG + L ++ F C V
Sbjct: 322 MASGRDGYEVCFQ----TSGAVFPSMVLHFDGG-DMDLPTENYFGAVDDSVSCWIV---- 372
Query: 369 INNRYVNFSLIGMMAQQFYNVGYDIGRKQKTFQRMDCE 406
+ + S++G + Q Y++ YD+ + +FQ +C+
Sbjct: 373 --QKSPSLSIVGNIMQMNYHIRYDVEKSMLSFQPANCD 408
>gi|125586057|gb|EAZ26721.1| hypothetical protein OsJ_10629 [Oryza sativa Japonica Group]
Length = 474
Score = 130 bits (326), Expect = 1e-27, Method: Compositional matrix adjust.
Identities = 112/402 (27%), Positives = 178/402 (44%), Gaps = 35/402 (8%)
Query: 31 ARLAYLQEKIRSHNTYQAQILP---SNDISNSLFYVNISIGQPPVPQFTAMDTGSSLLWV 87
AR ++ S A++ P ++ + ++ + V+++IG PP P +DTGS L W
Sbjct: 78 ARSKARSARLLSGRAASARMDPGSYTDGVPDTEYLVHMAIGTPPQPVQLILDTGSDLTWT 137
Query: 88 HCYPCRDCSPQLGTIFYPSRSSSYAYVPCDSEHCRYFPYARC---SAYKHRCIYTQLYLI 144
C PC C Q F PSRS +++ +PCD CR ++ C S C+Y Y
Sbjct: 138 QCAPCVSCFRQSLPRFNPSRSMTFSVLPCDLRICRDLTWSSCGEQSWGNGICVYAYAYAD 197
Query: 145 GPETSVFVSTEQLTFKNTDEST--IHVQDVVFGCGFSTNRNF--KFSGIFGLGIGRSSLV 200
T+ + ++ +F + D + V D+ FGCG N F +GI G G S+
Sbjct: 198 HSITTGHLDSDTFSFASADHAIGGASVPDLTFGCGLFNNGIFVSNETGIAGFSRGALSMP 257
Query: 201 SQLN-SSFSYCI----GSLHDPDYLHNKLIL-GDGAIIDEGDATPLQFIDGH------YY 248
+QL +FSYC GS P +L L D A G I H YY
Sbjct: 258 AQLKVDNFSYCFTAITGSEPSPVFLGVPPNLYSDAAGGGHGVVQSTALIRYHSSQLKAYY 317
Query: 249 ITLEAISVDGRMLDINPNIFK-RDDSGGGVMIDSGTDVTWLVKEAYEALRDEVMIRLEGE 307
I+L+ ++V L I ++F ++D GG ++DSGT +T L + Y + D + + +
Sbjct: 318 ISLKGVTVGTTRLPIPESVFALKEDGTGGTIVDSGTGMTMLPEAVYNLVCDAFVAQTKLT 377
Query: 308 QMRSYSWPDKLCYHGIMSSDLKGFPTVRFHFRGGAKLALEKDSMFYQPRPDA----FCMA 363
S S +LC+ + P + HF GA L L +++ ++ C+A
Sbjct: 378 VHNSTSSLSQLCFS-VPPGAKPDVPALVLHFE-GATLDLPRENYMFEIEEAGGIRLTCLA 435
Query: 364 VNPASINNRYVNFSLIGMMAQQFYNVGYDIGRKQKTFQRMDC 405
+N + S+IG QQ +V YD+ +F C
Sbjct: 436 INAGE------DLSVIGNFQQQNMHVLYDLANDMLSFVPARC 471
>gi|356546036|ref|XP_003541438.1| PREDICTED: aspartic proteinase-like protein 2-like [Glycine max]
Length = 486
Score = 130 bits (326), Expect = 2e-27, Method: Compositional matrix adjust.
Identities = 106/378 (28%), Positives = 176/378 (46%), Gaps = 39/378 (10%)
Query: 57 SNSLFYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDC--SPQLG---TIFYPSRSSSY 111
S L+Y + +G PP +DTGS +LWV+C C +C S QLG F SS+
Sbjct: 74 SVGLYYTKVKMGTPPKEFNVQIDTGSDILWVNCNTCSNCPQSSQLGIELNFFDTVGSSTA 133
Query: 112 AYVPCDSEHCR---YFPYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFK---NTDES 165
A +PC C A CS ++C YT Y G TS + ++ + F +
Sbjct: 134 ALIPCSDPICTSRVQGAAAECSPRVNQCSYTFQYGDGSGTSGYYVSDAMYFSLIMGQPPA 193
Query: 166 TIHVQDVVFGCGFS-----TNRNFKFSGIFGLGIGRSSLVSQLNSS------FSYCIGSL 214
+VFGC S T + GIFG G G S+VSQL+S FS+C+
Sbjct: 194 VNSSATIVFGCSISQSGDLTKTDKAVDGIFGFGPGPLSVVSQLSSRGITPKVFSHCLKGD 253
Query: 215 HDPDYLHNKLILGDGAIIDEGDATPLQFIDGHYYITLEAISVDGRMLDINPNIFKRDDSG 274
D + + + +I+ +PL HY + L++I+V+G++L INP +F ++
Sbjct: 254 GDGGGVLVLGEILEPSIV----YSPLVPSQPHYNLNLQSIAVNGQLLPINPAVFSISNNR 309
Query: 275 GGVMIDSGTDVTWLVKEAYEALRDEVMIRLEGEQMRSYSWPDKLCYHGIMSSDLKG-FPT 333
GG ++D GT + +L++EAY+ L + + + R + CY ++S+ + FP+
Sbjct: 310 GGTIVDCGTTLAYLIQEAYDPLVTAINTAVS-QSARQTNSKGNQCY--LVSTSIGDIFPS 366
Query: 334 VRFHFRGGAKLALEKDSMF----YQPRPDAFCMAVNPASINNRYVNFSLIGMMAQQFYNV 389
V +F GGA + L+ + Y + +C+ S++G + + V
Sbjct: 367 VSLNFEGGASMVLKPEQYLMHNGYLDGAEMWCIG-----FQKFQEGASILGDLVLKDKIV 421
Query: 390 GYDIGRKQKTFQRMDCEV 407
YDI +++ + DC +
Sbjct: 422 VYDIAQQRIGWANYDCSL 439
>gi|115472515|ref|NP_001059856.1| Os07g0532800 [Oryza sativa Japonica Group]
gi|50508274|dbj|BAD32123.1| putative aspartic proteinase nepenthesin I [Oryza sativa Japonica
Group]
gi|113611392|dbj|BAF21770.1| Os07g0532800 [Oryza sativa Japonica Group]
Length = 436
Score = 129 bits (325), Expect = 2e-27, Method: Compositional matrix adjust.
Identities = 109/398 (27%), Positives = 178/398 (44%), Gaps = 39/398 (9%)
Query: 32 RLAYLQEKIRS------HNTYQAQILPSNDISNSLFYVNISIGQPPVPQFTAMDTGSSLL 85
R+A+L + + +++ Q L N + + +NIS+G P + DTGS L+
Sbjct: 53 RIAFLSDATAAGKATTTNSSVSFQALLENGVGG--YNMNISVGTPLLTFSVVADTGSDLI 110
Query: 86 WVHCYPCRDCSPQLGTIFYPSRSSSYAYVPCDSEHCRYFPYARCSAYKHRCIYTQLYLIG 145
W C PC C Q F P+ SS+++ +PC S C++ P + + C+Y Y G
Sbjct: 111 WTQCAPCTKCFQQPAPPFQPASSSTFSKLPCTSSFCQFLPNSIRTCNATGCVYNYKYGSG 170
Query: 146 PETSVFVSTEQLTFKNTDESTIHVQDVVFGCGFSTNRNFKFSGIFGLGIGRSSLVSQLN- 204
T+ +++TE T K D S V FGC SGI GLG G SL+ QL
Sbjct: 171 -YTAGYLATE--TLKVGDAS---FPSVAFGCSTENGVGNSTSGIAGLGRGALSLIPQLGV 224
Query: 205 SSFSYCIGSLHDPDYLHNKLILGDGAIIDEGDATPLQFID------GHYYITLEAISVDG 258
FSYC+ S + ++ G A + +G+ F++ +YY+ L I+V
Sbjct: 225 GRFSYCLRSGSAAG--ASPILFGSLANLTDGNVQSTPFVNNPAVHPSYYYVNLTGITVGE 282
Query: 259 RMLDINPNI--FKRDDSGGGVMIDSGTDVTWLVKEAYEALRDEVMIRLEGEQMRSYSWPD 316
L + + F ++ GGG ++DSGT +T+L K+ YE ++ + + + +
Sbjct: 283 TDLPVTTSTFGFTQNGLGGGTIVDSGTTLTYLAKDGYEMVKQAFLSQTADVTTVNGTRGL 342
Query: 317 KLCYHGIMSSDLK-GFPTVRFHFRGGAKLA-------LEKDSMFYQPRPDAFCMAVNPAS 368
LC+ P++ F GGA+ A +E DS Q C+ + PA
Sbjct: 343 DLCFKSTGGGGGGIAVPSLVLRFDGGAEYAVPTYFAGVETDS---QGSVTVACLMMLPAK 399
Query: 369 INNRYVNFSLIGMMAQQFYNVGYDIGRKQKTFQRMDCE 406
+ S+IG + Q ++ YD+ +F DC
Sbjct: 400 GDQP---MSVIGNVMQMDMHLLYDLDGGIFSFAPADCA 434
>gi|357137788|ref|XP_003570481.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Brachypodium
distachyon]
Length = 455
Score = 129 bits (324), Expect = 3e-27, Method: Compositional matrix adjust.
Identities = 110/374 (29%), Positives = 164/374 (43%), Gaps = 41/374 (10%)
Query: 50 ILPSNDISNSLFYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCR-DCSPQLGTIFYPSRS 108
+ P + + + +G P P +DTGSSL W+ C PCR C Q G +F P S
Sbjct: 106 LTPGTSVGVGNYVTRMGLGTPAKPYIMVVDTGSSLTWLQCSPCRVSCHRQSGPVFDPKTS 165
Query: 109 SSYAYVPCDSEHCRYFPYAR-----CSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTD 163
SSYA V C S C A CS + CIY Y + ++S + ++F
Sbjct: 166 SSYAAVSCSSPQCDGLSTATLNPAVCSP-SNVCIYQASYGDSSFSVGYLSKDTVSFGANS 224
Query: 164 ESTIHVQDVVFGCGFSTNRNF-KFSGIFGLGIGRSSLVSQLNS----SFSYCIGSLHDPD 218
V + +GCG F + +G+ GL + SL+ QL SFSYC+ S
Sbjct: 225 -----VPNFYYGCGQDNEGLFGRSAGLMGLARNKLSLLYQLAPTLGYSFSYCLPSTSSSG 279
Query: 219 YLHNKLILGDGAIIDEGDA-TPL---QFIDGHYYITLEAISVDGRMLDINPNIFKRDDSG 274
YL G+ G + TP+ D Y+I+L ++V G+ L ++ + + +
Sbjct: 280 YLS------IGSYNPGGYSYTPMVSNTLDDSLYFISLSGMTVAGKPLAVSSSEYTSLPT- 332
Query: 275 GGVMIDSGTDVTWLVKEAYEALRDEVMIRLEGEQMR--SYSWPDKLCYHGIMSSDLKGFP 332
+IDSGT +T L Y AL V ++G R +YS D C+ G +S L+ P
Sbjct: 333 ---IIDSGTVITRLPTSVYTALSKAVAAAMKGSTKRAAAYSILDT-CFEG-QASKLRAVP 387
Query: 333 TVRFHFRGGAKLALEKDSMFYQPRPDAFCMAVNPASINNRYVNFSLIGMMAQQFYNVGYD 392
V F GGA L L ++ C+A PA + ++IG QQ ++V YD
Sbjct: 388 AVSMAFSGGATLKLSAGNLLVDVDGATTCLAFAPAR------SAAIIGNTQQQTFSVVYD 441
Query: 393 IGRKQKTFQRMDCE 406
+ + F C
Sbjct: 442 VKSNRIGFAAAGCS 455
>gi|302786560|ref|XP_002975051.1| hypothetical protein SELMODRAFT_54028 [Selaginella moellendorffii]
gi|300157210|gb|EFJ23836.1| hypothetical protein SELMODRAFT_54028 [Selaginella moellendorffii]
Length = 359
Score = 129 bits (324), Expect = 3e-27, Method: Compositional matrix adjust.
Identities = 109/366 (29%), Positives = 170/366 (46%), Gaps = 37/366 (10%)
Query: 61 FYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDC--SPQLGTIFYPSRSSSYAYVPCDS 118
+ + +SIG PP +DTGS L+W+ C C C TIF+ SSSY +PC+S
Sbjct: 5 YMMELSIGTPPQLIPAMIDTGSDLVWLKCDNCDHCDLDHHGETIFFSDASSSYKKLPCNS 64
Query: 119 EHCRYFPYA----RCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIH---VQD 171
HC A RC + C Y Y G TS V +++++F++ H
Sbjct: 65 THCSGMSSAGIGPRC---EETCKYKYEYGDGSRTSGDVGSDRISFRSHGAGEDHRSFFDG 121
Query: 172 VVFGCGFSTNRNFKFS-GIFGLGIGRSSLVSQLNS----SFSYCIGSLHDPDYLHNKLIL 226
+FGCG ++ F+ G+ GLG SL+ QL FSYC+ S P + L L
Sbjct: 122 FLFGCGRKLKGDWNFTQGLIGLGQKSHSLIQQLGDKLGYKFSYCLVSYDSPPSAKSFLFL 181
Query: 227 GDGAIIDEGDATPLQFIDGH------YYITLEAISVDGRMLDINPNIFKRDDSGG----- 275
G A + D + G YY+ L++I+V G + + + S G
Sbjct: 182 GSSAALRGHDVVSTPILHGDHLDQTLYYVDLQSITVGGVPVVVYDKESGHNTSVGPFLAN 241
Query: 276 GVMIDSGTDVTWLVKEAYEALRDEVMIRLEGEQMRSYSWPDKLCYHGIMSSDLK-GFPTV 334
+IDSGT T L YEA+R + ++ + + + D LC++ S D GFP+V
Sbjct: 242 KTVIDSGTTYTLLTPPVYEAMRKSIEEQVILPTLGNSAGLD-LCFNS--SGDTSYGFPSV 298
Query: 335 RFHFRGGAKLALEKDSMFYQPRPDAFCMAVNPASINNRYVNFSLIGMMAQQFYNVGYDIG 394
F+F +L L +++F D C+ S+++ + S+IG M QQ +++ YD+
Sbjct: 299 TFYFANQVQLVLPFENIFQVTSRDVVCL-----SMDSSGGDLSIIGNMQQQNFHILYDLV 353
Query: 395 RKQKTF 400
Q +F
Sbjct: 354 ASQISF 359
>gi|242058537|ref|XP_002458414.1| hypothetical protein SORBIDRAFT_03g033075 [Sorghum bicolor]
gi|241930389|gb|EES03534.1| hypothetical protein SORBIDRAFT_03g033075 [Sorghum bicolor]
Length = 448
Score = 129 bits (324), Expect = 3e-27, Method: Compositional matrix adjust.
Identities = 103/436 (23%), Positives = 186/436 (42%), Gaps = 45/436 (10%)
Query: 1 LIHRDSILSPYNNPNENPAHRVQRGINISIARLAYLQEKIRSHNTYQAQILPSNDISNSL 60
++HR ++ R + + ++ + ++ ++ +
Sbjct: 28 VVHRGAVFPSRRGAPPGSLRRCRHAAPFTAQVASFHSIAADDDDRLRSPVMSGVPFDSGE 87
Query: 61 FYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLGTIFYPSRSSSYAYVPCDSEH 120
++ I++G PP +DTGS L+W+ C PCR C Q+ ++ P SS++ +PC S
Sbjct: 88 YFAVINVGDPPTRALVVIDTGSDLIWLQCVPCRHCYRQVTPLYDPRSSSTHRRIPCASPR 147
Query: 121 CR-YFPYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDVVFGCGFS 179
CR Y C A C+Y +Y G +S ++T++L F + HV +V GCG
Sbjct: 148 CRDVLRYPGCDARTGGCVYMVVYGDGSASSGDLATDRLVFPD----DTHVHNVTLGCGHD 203
Query: 180 TNRNFK-FSGIFGLGIGRSSLVSQLNSS----FSYCIGS-LHDPDYLHNKLILGDGAIID 233
+ +G+ G+G G+ S +QL + FSYC+G L + L+ G
Sbjct: 204 NVGLLESAAGLLGVGRGQLSFPTQLAPAYGHVFSYCLGDRLSRAQNGSSYLVFGRTPEPP 263
Query: 234 EGDATPLQFIDGH---YYITLEAISVDGRM--------LDINPNIFKRDDSGGGVMIDSG 282
TPL+ YY+ + SV G L +NP + GG+++DSG
Sbjct: 264 STAFTPLRTNPRRPSLYYVDMVGFSVGGERVTGFSNASLALNPATGR-----GGIVVDSG 318
Query: 283 TDVTWLVKEAYEALRDEV--------MIRLEGEQMRSYSWPDKLCYHGIMSSDLKGFPTV 334
T ++ ++AY A+RD +R + + L +G ++ ++ P++
Sbjct: 319 TAISRFARDAYAAVRDAFDSHAAAAGTMRKLATKFSVFDACYDLRGNGAPAAAVR-VPSI 377
Query: 335 RFHFRGGAKLALEKDSMFYQ----PRPDAFCMAVNPASINNRYVNFSLIGMMAQQFYNVG 390
HF GGA +AL + + R FC+ + A +++G + QQ + +
Sbjct: 378 VLHFAGGADMALPQANYLIPVQGGDRRTYFCLGLQAADD-----GLNVLGNVQQQGFGLV 432
Query: 391 YDIGRKQKTFQRMDCE 406
+D+ R + F C
Sbjct: 433 FDVERGRIGFTPNGCS 448
>gi|125543638|gb|EAY89777.1| hypothetical protein OsI_11319 [Oryza sativa Indica Group]
Length = 390
Score = 129 bits (324), Expect = 3e-27, Method: Compositional matrix adjust.
Identities = 105/379 (27%), Positives = 167/379 (44%), Gaps = 40/379 (10%)
Query: 54 NDISNSLFYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLGTIFYPSRSSSYAY 113
N + + + V+++IG PP P +DTGS L+W C PC C Q F SRSS+ A
Sbjct: 28 NGVPTTEYLVHLAIGTPPQPVQLTLDTGSDLIWTQCKPCVSCFDQPLPYFDTSRSSTNAL 87
Query: 114 VPCDSEHCRYFPY----ARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHV 169
+PC+S C+ P + + C Y Y T ++ ++ TF + +
Sbjct: 88 LPCESTQCKLDPTVTVCVKLNQTVQTCAYYTSYGDNSVTIGLLAADKFTFV----AGTSL 143
Query: 170 QDVVFGCGFSTNRNFKF--SGIFGLGIGRSSLVSQLN-SSFSYCIGSLHDP-------DY 219
V FGCG + F +GI G G G SL SQL +FS+C ++ D
Sbjct: 144 PGVTFGCGLNNTGVFNSNETGIAGFGRGPLSLPSQLKVGNFSHCFTTITGAIPSTVLLDL 203
Query: 220 LHNKLILGDGAIIDEGDATPL-QFIDGH-----YYITLEAISVDGRMLDINPNIFKRDDS 273
+ G GA+ TPL Q+ YY++L+ I+V L + + F +
Sbjct: 204 PADLFSNGQGAV----QTTPLIQYAKNEANPTLYYLSLKGITVGSTRLPVPESAFALTNG 259
Query: 274 GGGVMIDSGTDVTWLVKEAYEALRDEVMIRLEGEQMRSYSWPDKLCYHGIMSSDLKGFPT 333
GG +IDSGT +T L + Y+ +RDE +++ + + C+ S P
Sbjct: 260 TGGTIIDSGTSITSLPPQVYQVVRDEFAAQIKLPVVPGNATGHYTCFSA-PSQAKPDVPK 318
Query: 334 VRFHFRGGAKLALEKDSMFYQPRPDA----FCMAVNPASINNRYVNFSLIGMMAQQFYNV 389
+ HF GA + L +++ ++ DA C+A+N ++IG QQ +V
Sbjct: 319 LVLHFE-GATMDLPRENYVFEVPDDAGNSIICLAINKGD------ETTIIGNFQQQNMHV 371
Query: 390 GYDIGRKQKTFQRMDCEVL 408
YD+ +F C+ L
Sbjct: 372 LYDLQNNMLSFVAAQCDKL 390
>gi|357521081|ref|XP_003630829.1| Aspartic proteinase nepenthesin-2 [Medicago truncatula]
gi|355524851|gb|AET05305.1| Aspartic proteinase nepenthesin-2 [Medicago truncatula]
Length = 526
Score = 129 bits (324), Expect = 3e-27, Method: Compositional matrix adjust.
Identities = 105/353 (29%), Positives = 161/353 (45%), Gaps = 20/353 (5%)
Query: 48 AQILPSNDISNSLFYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLGTIFYPSR 107
A + P S F V I +G PP + D + W+ C PC C Q +IF PS+
Sbjct: 174 ASLNPGITTGTSNFLVQIGVGGPPQKFYMIFDLQTDFTWLQCQPCIKCYDQPDSIFDPSQ 233
Query: 108 SSSYAYVPCDSEHCRYFPYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTI 167
SSSY + C+++HC P + CS + C Y Y G T + E ++F ES+
Sbjct: 234 SSSYTLLSCETKHCNLLPNSSCSDDGY-CRYNITYKDGTNTEGVLINETVSF----ESSG 288
Query: 168 HVQDVVFGCGFSTNRNFKFS-GIFGLGIGRSSLVSQLN-SSFSYCIGSLHDPDYLHNKLI 225
V V GC F S G FGLG G S S++N SS SYC+ D Y + L
Sbjct: 289 WVDRVSLGCSNKNQGPFVGSDGTFGLGRGSLSFPSRINASSMSYCLVESKD-GYSSSTLE 347
Query: 226 LGDGAIIDEGDATPLQ--FIDGHYYITLEAISVDGRMLDINPNIFKRDDSG-GGVMIDSG 282
A LQ + YY+ L+ I V G +D+ + F D G GG+++ S
Sbjct: 348 FNSPPCSGSVKAKLLQNPKAENLYYVGLKGIKVGGEKIDVPNSTFTIDPYGNGGMIVSSS 407
Query: 283 TDVTWLVKEAYEALRDEVMIRLEG-EQMRSYSWPDKLCYHGIMSSDLKGFPTVRFHFRGG 341
+ +T L + Y +RD + + + E+++++ D CY+ + S++ P + F G
Sbjct: 408 SLITMLENDTYNVVRDAFVAKTQHLERLKAFLQFDT-CYN-LSSNNTVELPILEFEVNDG 465
Query: 342 AKLALEKDSMFYQ-PRPDAFCMAVNPASINNRYVNFSLIGMMAQQFYNVGYDI 393
L K+S Y + FC A P+ +FS++G + Q V +D+
Sbjct: 466 KSWLLPKESYLYAVDKNGTFCFAFAPSK-----GSFSILGTLQQYGTRVTFDL 513
>gi|168054484|ref|XP_001779661.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162668975|gb|EDQ55572.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 419
Score = 129 bits (323), Expect = 3e-27, Method: Compositional matrix adjust.
Identities = 112/407 (27%), Positives = 180/407 (44%), Gaps = 28/407 (6%)
Query: 15 NENPAHRVQRGINISIARLAYLQEKIRSHNTYQAQILPSNDISNSLFYVNISIGQPPVPQ 74
N+N AH I+ A ++ + +Q+ ++ + + + ++V+ +G PP
Sbjct: 23 NDNGAHNSANPPVIT----AVIEGPPSHDHDFQSPVVSGSTLGSGQYFVDFFLGTPPQKF 78
Query: 75 FTAMDTGSSLLWVHCYPCRDCSPQLGTIFYPSRSSSYAYVPCDSEHCRYFPYAR---CS- 130
+D+GS LLWV C PC C Q ++ PS SS++ VPC S C P C
Sbjct: 79 SLIVDSGSDLLWVQCAPCLQCYAQDTPLYAPSNSSTFNPVPCLSPECLLIPATEGFPCDF 138
Query: 131 AYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDVVFGCGFSTNRNFKFS-GI 189
Y C Y Y +TS +S +++ + + V FGCG +F + G+
Sbjct: 139 HYPGACAYEYRYA---DTS--LSKGVFAYESATVDDVRIDKVAFGCGRDNQGSFAAAGGV 193
Query: 190 FGLGIGRSSLVSQL----NSSFSYCIGSLHDPDYLHNKLILGDGAI--IDEGDATPLQFI 243
GLG G S SQ+ + F+YC+ + DP + + LI GD I I + TP+
Sbjct: 194 LGLGQGPLSFGSQVGYAYGNKFAYCLVNYLDPTSVSSWLIFGDELISTIHDLQFTPIVSN 253
Query: 244 DGH---YYITLEAISVDGRMLDINPNIFKRDDSG-GGVMIDSGTDVTWLVKEAYEALRDE 299
+ YY+ +E + V G L I+ + + D G GG + DSGT VT+ + AY +
Sbjct: 254 SRNPTLYYVQIEKVMVGGESLPISHSAWSLDFLGNGGSIFDSGTTVTYWLPPAYRNILAA 313
Query: 300 VMIRLEGEQMRSYSWPDKLCYHGIMSSDLKGFPTVRFHFRGGAKLALEKDSMFYQPRPDA 359
+ + S D LC + D FP+ GGA ++ + F P+
Sbjct: 314 FDKNVRYPRAASVQGLD-LCVD-VTGVDQPSFPSFTIVLGGGAVFQPQQGNYFVDVAPNV 371
Query: 360 FCMAVNPASINNRYVNFSLIGMMAQQFYNVGYDIGRKQKTFQRMDCE 406
C+A+ A + + F+ IG + QQ + V YD + F C
Sbjct: 372 QCLAM--AGLPSSVGGFNTIGNLLQQNFLVQYDREENRIGFAPAKCS 416
>gi|356499344|ref|XP_003518501.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Glycine max]
Length = 561
Score = 129 bits (323), Expect = 3e-27, Method: Compositional matrix adjust.
Identities = 106/369 (28%), Positives = 170/369 (46%), Gaps = 31/369 (8%)
Query: 61 FYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLGTIFYPSRSSSYAYVPCDSEH 120
+++++ +G PP +DTGS L W+ C PC C Q G + P SSS+ + C
Sbjct: 197 YFMDVFVGTPPKHFSLILDTGSDLNWIQCVPCIACFEQSGPYYDPKDSSSFRNISCHDPR 256
Query: 121 CRYF----PYARCSAYKHRCIYTQLYLIGPETS----VFVSTEQLTFKNTDESTIHVQDV 172
C+ P C A C Y Y G T+ + T LT N HV++V
Sbjct: 257 CQLVSAPDPPKPCKAENQSCPYFYWYGDGSNTTGDFALETFTVNLTTPNGTSELKHVENV 316
Query: 173 VFGCGFSTNRNF-KFSGIFGLGIGRSSLVSQLNS----SFSYCIGSLHDPDYLHNKLILG 227
+FGCG F +G+ GLG G S SQ+ S SFSYC+ + + +KLI G
Sbjct: 317 MFGCGHWNRGLFHGAAGLLGLGKGPLSFASQMQSLYGQSFSYCLVDRNSNASVSSKLIFG 376
Query: 228 -DGAIIDEGDATPLQF-------IDGHYYITLEAISVDGRMLDINPNIFKRDDSG-GGVM 278
D ++ + F +D YY+ ++++ VD +L I + G GG +
Sbjct: 377 EDKELLSHPNLNFTSFGGGKDGSVDTFYYVQIKSVMVDDEVLKIPEETWHLSSEGAGGTI 436
Query: 279 IDSGTDVTWLVKEAYEALRDEVMIRLEGEQMRSYSWPDKLCYH--GIMSSDLKGFPTVRF 336
IDSGT +T+ + AYE +++ + +++G Q+ P K CY+ GI +L F +
Sbjct: 437 IDSGTTLTYFAEPAYEIIKEAFVRKIKGYQLVEGLPPLKPCYNVSGIEKMELPDFGIL-- 494
Query: 337 HFRGGAKLALEKDSMFYQPRPDAFCMAVNPASINNRYVNFSLIGMMAQQFYNVGYDIGRK 396
F A ++ F P+ C+A+ + N S+IG QQ +++ YD+ +
Sbjct: 495 -FADEAVWNFPVENYFIWIDPEVVCLAI----LGNPRSALSIIGNYQQQNFHILYDMKKS 549
Query: 397 QKTFQRMDC 405
+ + M C
Sbjct: 550 RLGYAPMKC 558
>gi|125527523|gb|EAY75637.1| hypothetical protein OsI_03542 [Oryza sativa Indica Group]
Length = 446
Score = 129 bits (323), Expect = 4e-27, Method: Compositional matrix adjust.
Identities = 115/437 (26%), Positives = 187/437 (42%), Gaps = 55/437 (12%)
Query: 1 LIHRDSILSPYNNPNENPAHRVQRGINISIARLAYLQEKIRSHNTYQAQILPSNDISNSL 60
+ HRD++ P P +++ + AR A L + + + + +
Sbjct: 31 VFHRDALFPP--PPGAKRGSLLRQRLAADAARYASL---VDATGRLHSPVFSGIPFESGE 85
Query: 61 FYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLGTIFYPSRSSSYAYVPCDSEH 120
++ + +G P +DTGS L+W+ C PCR C Q G +F P RSS+Y VPC S
Sbjct: 86 YFALVGVGTPSTKAMLVIDTGSDLVWLQCSPCRRCYAQRGQVFDPRRSSTYRRVPCSSPQ 145
Query: 121 CRYFPYARC---SAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDVVFGCG 177
CR + C A C Y Y G ++ ++T++L F N +V +V GCG
Sbjct: 146 CRALRFPGCDSGGAAGGGCRYMVAYGDGSSSTGELATDKLAFAN----DTYVNNVTLGCG 201
Query: 178 FSTNRNF-KFSGIFGLGIGRSSLVSQL----NSSFSYCIGSLHDPDYLHNKLILGDGAII 232
F +G+ G+ G+ S+ +Q+ S F YC+G + L+ G
Sbjct: 202 RDNEGLFDSAAGLLGVARGKISISTQVAPAYGSVFEYCLGDRTSRSTRSSYLVFGR---T 258
Query: 233 DEGDATPLQFIDGH------YYITLEAISVDGRMLDINPNIFKRDDSG---GGVMIDSGT 283
E +T + + YY+ + SV G + N D+ GGV++DSGT
Sbjct: 259 PEPPSTAFTALLSNPRRPSLYYVDMAGFSVGGERVTGFSNASLALDTATGRGGVVVDSGT 318
Query: 284 DVTWLVKEAYEALRDEVMIRLEGEQMRSYSWPDKL---CYHGIMSSDLKGFPT-----VR 335
++ ++AY ALRD R MR + + CY DL+G P +
Sbjct: 319 AISRFARDAYAALRDAFDARARAAGMRRLAGEHSVFDACY------DLRGRPAASAPLIV 372
Query: 336 FHFRGGAKLALEKDSMFY-----QPRPDAF--CMAVNPASINNRYVNFSLIGMMAQQFYN 388
HF GGA +AL ++ F + R ++ C+ A S+IG + QQ +
Sbjct: 373 LHFAGGADMALPPENYFLPVDGGRRRAASYRRCLGFEAADD-----GLSVIGNVQQQGFR 427
Query: 389 VGYDIGRKQKTFQRMDC 405
V +D+ +++ F C
Sbjct: 428 VVFDVEKERIGFAPKGC 444
>gi|293332561|ref|NP_001170100.1| uncharacterized protein LOC100384018 precursor [Zea mays]
gi|224033441|gb|ACN35796.1| unknown [Zea mays]
gi|413944035|gb|AFW76684.1| hypothetical protein ZEAMMB73_746438 [Zea mays]
Length = 456
Score = 129 bits (323), Expect = 4e-27, Method: Compositional matrix adjust.
Identities = 88/266 (33%), Positives = 129/266 (48%), Gaps = 24/266 (9%)
Query: 50 ILPSNDISNSLFYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLGTIFYPSRSS 109
+ + I+ + + V++++G PP P +DTGS L+W C PCRDC Q + P+ SS
Sbjct: 75 VAAAGGIATNEYLVHLAVGTPPRPVALTLDTGSDLVWTQCAPCRDCFDQGIPLLDPAASS 134
Query: 110 SYAYVPCDSEHCRYFPYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTF-----KNTDE 164
+YA +PC + CR P+ C C+Y Y T ++T++ TF +N D
Sbjct: 135 TYAALPCGAPRCRALPFTSCGG--RSCVYVYHYGDKSVTVGKIATDRFTFGDNGRRNGDG 192
Query: 165 STIHVQDVVFGCGFSTNRNFKF--SGIFGLGIGRSSLVSQLN-SSFSYCIGSLHDPDYLH 221
S + + FGCG F+ +GI G G GR SL SQLN +SFSYC S+ D
Sbjct: 193 SLPATRRLTFGCGHFNKGVFQSNETGIAGFGRGRWSLPSQLNATSFSYCFTSMFDSKSSI 252
Query: 222 NKLILGDGAIID-----EGDATPL---QFIDGHYYITLEAISVDGRMLDINPNIFKRDDS 273
L A+ E TPL Y+++L+ ISV L + F+
Sbjct: 253 VTLGGAPAALYSHAHSGEVRTTPLFKNPSQPSLYFLSLKGISVGKTRLPVPETKFR---- 308
Query: 274 GGGVMIDSGTDVTWLVKEAYEALRDE 299
+IDSG +T L +E YEA++ E
Sbjct: 309 --STIIDSGASITTLPEEVYEAVKAE 332
>gi|115438214|ref|NP_001043485.1| Os01g0598600 [Oryza sativa Japonica Group]
gi|15290061|dbj|BAB63755.1| nucleoid DNA-binding protein cnd41-like [Oryza sativa Japonica
Group]
gi|113533016|dbj|BAF05399.1| Os01g0598600 [Oryza sativa Japonica Group]
gi|125526702|gb|EAY74816.1| hypothetical protein OsI_02707 [Oryza sativa Indica Group]
Length = 500
Score = 129 bits (323), Expect = 4e-27, Method: Compositional matrix adjust.
Identities = 116/370 (31%), Positives = 161/370 (43%), Gaps = 41/370 (11%)
Query: 61 FYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLGTIFYPSRSSSYAYVPCDSEH 120
++ I +G P P +DTGS ++W+ C PCR C Q G +F P S SY V C +
Sbjct: 147 YFTKIGVGTPVTPALMVLDTGSDVVWLQCAPCRRCYDQSGQMFDPRASHSYGAVDCAAPL 206
Query: 121 CRYFPYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDVVFGCGFST 180
CR C + C+Y Y G T+ +TE LTF S V V GCG
Sbjct: 207 CRRLDSGGCDLRRKACLYQVAYGDGSVTAGDFATETLTFA----SGARVPRVALGCGHDN 262
Query: 181 NRNFKFSGIFGLGI-GRSSLVSQLN----SSFSYCI----GSLHDPDYLHNKLILGDGAI 231
F + G S SQ++ SFSYC+ S + + G GA+
Sbjct: 263 EGLFVAAAGLLGLGRGSLSFPSQISRRFGRSFSYCLVDRTSSSASATSRSSTVTFGSGAV 322
Query: 232 IDEGDA--TPL---QFIDGHYYITLEAISVDG-RMLDINPNIFKRDDSG--GGVMIDSGT 283
A TP+ ++ YY+ L ISV G R+ + + + D S GGV++DSGT
Sbjct: 323 GPSAAASFTPMVKNPRMETFYYVQLMGISVGGARVPGVAVSDLRLDPSTGRGGVIVDSGT 382
Query: 284 DVTWLVKEAYEALRDEVMIRLEGEQMR--SYSWPDKLCYHGIMSSDLKGF-----PTVRF 336
VT L + AY ALRD G ++ +S D CY DL G PTV
Sbjct: 383 SVTRLARPAYAALRDAFRAAAAGLRLSPGGFSLFDT-CY------DLSGLKVVKVPTVSM 435
Query: 337 HFRGGAKLALEKDS-MFYQPRPDAFCMAVNPASINNRYVNFSLIGMMAQQFYNVGYDIGR 395
HF GGA+ AL ++ + FC A A + S+IG + QQ + V +D
Sbjct: 436 HFAGGAEAALPPENYLIPVDSRGTFCFAF--AGTDG---GVSIIGNIQQQGFRVVFDGDG 490
Query: 396 KQKTFQRMDC 405
++ F C
Sbjct: 491 QRLGFVPKGC 500
>gi|242084330|ref|XP_002442590.1| hypothetical protein SORBIDRAFT_08g022570 [Sorghum bicolor]
gi|241943283|gb|EES16428.1| hypothetical protein SORBIDRAFT_08g022570 [Sorghum bicolor]
Length = 494
Score = 129 bits (323), Expect = 4e-27, Method: Compositional matrix adjust.
Identities = 133/444 (29%), Positives = 193/444 (43%), Gaps = 56/444 (12%)
Query: 1 LIHRDSILSPYNNPNENPAHRVQRGINISIARLAYLQEKIRSH---------NTYQAQIL 51
L+HRDS N A + R + R A++ K ++ +T + +
Sbjct: 68 LLHRDSFAV-----NATAAELLARRLQRDELRAAWIISKAAANGTPPPVVGLSTGRGLVA 122
Query: 52 P--SNDISNSLFYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLGTIFYPSRSS 109
P S ++ + I++G P V A+DT S L W+ C PCR C PQ G +F P S+
Sbjct: 123 PVVSRAPTSGEYMAKIAVGTPAVQALLALDTASDLTWLQCQPCRRCYPQSGPVFDPRHST 182
Query: 110 SYAYVPCDSEHCRYFPYARCS-AYKHRCIYTQLYLIG-PETSVFVS---TEQLTFKNTDE 164
SY + D+ C+ + A + CIYT Y G TS V E LTF
Sbjct: 183 SYGEMNYDAPDCQALGRSGGGDAKRGTCIYTVQYGDGHGSTSTSVGDLVEETLTFAG--- 239
Query: 165 STIHVQDVVFGCGFSTNRNF--KFSGIFGLGIGRSSLVSQL-----NSSFSYC-IGSLHD 216
+ + GCG F +GI GLG G+ S+ Q+ N+SFSYC + +
Sbjct: 240 -GVRQAYLSIGCGHDNKGLFGAPAAGILGLGRGQISIPHQIAFLGYNASFSYCLVDFISG 298
Query: 217 PDYLHNKLILGDGAIIDEGDA--TPL---QFIDGHYYITLEAISVDGRMLDINPNIFKRD 271
P + L G GA+ A TP Q + YY+ L +SV G + P + +RD
Sbjct: 299 PGSPSSTLTFGAGAVDTSPPASFTPTVLNQNMPTFYYVRLIGVSVGGVRV---PGVTERD 355
Query: 272 ------DSGGGVMIDSGTDVTWLVKEAYEALRDEVMIRLEGEQMRSYSWPDKL---CYHG 322
GGV++DSGT VT L + AY A RD S P L CY
Sbjct: 356 LQLDPYTGRGGVILDSGTTVTRLARPAYVAFRDAFRAAATSLGQVSTGGPSGLFDTCYTV 415
Query: 323 IMSSDLKGFPTVRFHFRGGAKLALE-KDSMFYQPRPDAFCMAVNPASINNRYVNFSLIGM 381
+ +K P V HF GG +++L+ K+ + C A A +R V S+IG
Sbjct: 416 GGRAGVK-VPAVSMHFAGGVEVSLQPKNYLIPVDSRGTVCFAF--AGTGDRSV--SVIGN 470
Query: 382 MAQQFYNVGYDIGRKQKTFQRMDC 405
+ QQ + V YD+ ++ F +C
Sbjct: 471 ILQQGFRVVYDLAGQRVGFAPNNC 494
>gi|168000296|ref|XP_001752852.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162696015|gb|EDQ82356.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 384
Score = 129 bits (323), Expect = 4e-27, Method: Compositional matrix adjust.
Identities = 110/362 (30%), Positives = 165/362 (45%), Gaps = 27/362 (7%)
Query: 58 NSLFYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLGTIFYPSRSSSYAYVPCD 117
N + + +++G PP +DTGS L WV C PCR C Q G F PS+S S+ C
Sbjct: 36 NGEYLMTLTLGSPPQSFDVIVDTGSDLNWVQCLPCRVCYQQPGPKFDPSKSRSFRKAACT 95
Query: 118 SEHCRY--FPYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDVVFG 175
C P C+A + C Y Y T+ ++ E ++ N T V + FG
Sbjct: 96 DNLCNVSALPLKACAA--NVCQYQYTYGDQSNTNGDLAFETISLNN-GAGTQSVPNFAFG 152
Query: 176 CGFSTNRNFK-FSGIFGLGIGRSSLVSQLN----SSFSYCIGSLHDPDYLHNKLILGDGA 230
CG F +G+ GLG G SL SQL+ + FSYC+ SL+ + L G A
Sbjct: 153 CGTQNLGTFAGAAGLVGLGQGPLSLNSQLSHTFANKFSYCLVSLNSLS--ASPLTFGSIA 210
Query: 231 IIDEGDATPLQFIDGH---YYITLEAISVDGRMLDINPNIFKRDDSG--GGVMIDSGTDV 285
T + H YY+ L +I V G+ L++ P++F D S GG +IDSGT +
Sbjct: 211 AAANIQYTSIVVNARHPTYYYVQLNSIEVGGQPLNLAPSVFAIDQSTGRGGTIIDSGTTI 270
Query: 286 TWLVKEAYEALRDEVMIRLEGEQMRSYSWPDKLCYHGIMSSDLKGFPTVRFHFRGGAKLA 345
T L AY A+ + ++ ++ LC++ I P + F F+ GA
Sbjct: 271 TMLTLPAYSAVLRAYESFVNYPRLDGSAYGLDLCFN-IAGVSNPSVPDMVFKFQ-GADFQ 328
Query: 346 LEKDSMFYQPRPDA--FCMAVNPASINNRYVNFSLIGMMAQQFYNVGYDIGRKQKTFQRM 403
+ +++F A C+A+ + FS+IG + QQ + V YD+ K+ F
Sbjct: 329 MRGENLFVLVDTSATTLCLAMGGSQ------GFSIIGNIQQQNHLVVYDLEAKKIGFATA 382
Query: 404 DC 405
DC
Sbjct: 383 DC 384
>gi|297811185|ref|XP_002873476.1| hypothetical protein ARALYDRAFT_325615 [Arabidopsis lyrata subsp.
lyrata]
gi|297319313|gb|EFH49735.1| hypothetical protein ARALYDRAFT_325615 [Arabidopsis lyrata subsp.
lyrata]
Length = 475
Score = 128 bits (322), Expect = 4e-27, Method: Compositional matrix adjust.
Identities = 131/436 (30%), Positives = 186/436 (42%), Gaps = 55/436 (12%)
Query: 1 LIHRDSILSPYNNPNENPAHRVQRGINISIARL----AYLQEKIRSHNTYQAQI--LPSN 54
+ HR S NN V+ + + AR+ + L +K+ +++ Q+Q LP+
Sbjct: 65 VTHRHGTCSRLNNGKATSPDHVEI-LRLDQARVNSIHSKLSKKLTTNHVSQSQSTDLPAK 123
Query: 55 D---ISNSLFYVNISIGQPPVPQFTAMDTGSSLLWVHCYPC-RDCSPQLGTIFYPSRSSS 110
D + + + V + +G P DTGS L W C PC R C Q IF PS+S+S
Sbjct: 124 DGSTLGSGNYIVTVGLGTPKNDLSLIFDTGSDLTWTQCQPCVRTCYDQKEPIFNPSKSTS 183
Query: 111 YAYVPCDSEHCRYFPYA-----RCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDES 165
Y V C S C A CSA CIY Y + F++ ++ T ++D
Sbjct: 184 YYNVSCSSAACGSLSSATGNAGSCSA--SNCIYGIQYGDQSFSVGFLAKDKFTLTSSDV- 240
Query: 166 TIHVQDVVFGCGFSTNRNFKFSGIFG-LGIGRSSL------VSQLNSSFSYCIGSLHDPD 218
V FGCG N F+G+ G LG+GR L + N FSYC+ S
Sbjct: 241 ---FDGVYFGCG--ENNQGLFTGVAGLLGLGRDKLSFPSQTATAYNKIFSYCLPS--SAS 293
Query: 219 YLHNKLILGDGAIIDEGDATPLQFI-DGH--YYITLEAISVDGRMLDINPNIFKRDDSGG 275
Y + L G I TP+ I DG Y + + AI+V G+ L I +F S
Sbjct: 294 YTGH-LTFGSAGISRSVKFTPISTITDGTSFYGLNIVAITVGGQKLPIPSTVF----STP 348
Query: 276 GVMIDSGTDVTWLVKEAYEALRDEVMIRLEGEQMRSYSWPDKLCYHGIMSSDLKGF---- 331
G +IDSGT +T L +AY ALR ++ S C+ DL GF
Sbjct: 349 GALIDSGTVITRLPPKAYAALRSSFKAKMSKYPTTSGVSILDTCF------DLSGFKTVT 402
Query: 332 -PTVRFHFRGGAKLALEKDSMFYQPRPDAFCMAVNPASINNRYVNFSLIGMMAQQFYNVG 390
P V F F GGA + L +FY + C+A + N+ N ++ G + QQ V
Sbjct: 403 IPKVAFSFSGGAVVELGSKGIFYAFKISQVCLAF---AGNSDDSNAAIFGNVQQQTLEVV 459
Query: 391 YDIGRKQKTFQRMDCE 406
YD + F C
Sbjct: 460 YDGAGGRVGFAPNGCS 475
>gi|242095588|ref|XP_002438284.1| hypothetical protein SORBIDRAFT_10g011120 [Sorghum bicolor]
gi|241916507|gb|EER89651.1| hypothetical protein SORBIDRAFT_10g011120 [Sorghum bicolor]
Length = 487
Score = 128 bits (322), Expect = 5e-27, Method: Compositional matrix adjust.
Identities = 113/381 (29%), Positives = 154/381 (40%), Gaps = 50/381 (13%)
Query: 61 FYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRD--CSPQLGTIFYPSRSSSYAYVPCDS 118
+ V I IG PP DTGS L WV C PC D C PQ +F PS+SS+Y VPC +
Sbjct: 122 YVVTIGIGTPPRNFTVLFDTGSDLTWVQCLPCPDSSCYPQQEPLFDPSKSSTYVDVPCSA 181
Query: 119 EHCRY--FPYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDVVFGC 176
C RC A C Y+ Y ET ++ E T VVFGC
Sbjct: 182 PECHIGGVQQTRCGATS--CEYSVKYGDESETHGSLAEETFTLSPPSPLAPAATGVVFGC 239
Query: 177 GFS-----TNRNFKFSGIFGLGIGRSSLVSQLNSS-------FSYCIGSLHDPDYLHNKL 224
+ +G+ GLG G SS++SQ S FSYC L L
Sbjct: 240 SHEYISVFNDTGMGVAGLLGLGRGDSSILSQTRRSINSGGGVFSYC---LPPRGSSTGYL 296
Query: 225 ILGDGAIIDEGDATPLQF---------IDGHYYITLEAISVDGRMLDINPNIFKRDDSGG 275
+G GA + + L F + Y + L +SV+G +DI + F
Sbjct: 297 TIGGGAAAPQQQYSNLSFTPLITTISQLRSAYVVNLAGVSVNGAAVDIPASAFSL----- 351
Query: 276 GVMIDSGTDVTWLVKEAYEALRDEVMIRLEGEQMRSYSWPDKL--CYHGIMSSDLKGFPT 333
G +IDSGT VT + AY LRDE + + +M L CY + D+ P
Sbjct: 352 GAVIDSGTVVTHMPAAAYYPLRDEFRLHMGSYKMLPEGSMKLLDTCYD-VTGQDVVTAPR 410
Query: 334 VRFHFRGGAKLALEKDS-MFYQPRPDA-------FCMAVNPASINNRYVNFSLIGMMAQQ 385
V F GGA++ ++ + P D C+A P ++G M Q+
Sbjct: 411 VALEFGGGARIDVDASGILLVLPAEDGSGQSLTLACLAFLP----TNSAGLVIVGNMQQR 466
Query: 386 FYNVGYDIGRKQKTFQRMDCE 406
YNV +D+ + F C
Sbjct: 467 AYNVVFDVDGGRIGFGPNGCS 487
>gi|414881704|tpg|DAA58835.1| TPA: hypothetical protein ZEAMMB73_701358 [Zea mays]
Length = 485
Score = 128 bits (322), Expect = 5e-27, Method: Compositional matrix adjust.
Identities = 131/438 (29%), Positives = 188/438 (42%), Gaps = 54/438 (12%)
Query: 1 LIHRDSILSPYNNPNENPAHRVQRGINISIARLAYLQEKIRS-----HNTYQAQILPSND 55
++HRD+ + E HR+QR R A + E + A ++
Sbjct: 69 VVHRDT-FAVNATAGELLKHRLQRDKR----RAARISEAAGAGGGNGRKGVAAPVVSGLA 123
Query: 56 ISNSLFYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLGTIFYPSRSSSYAYVP 115
+ ++ I +G P +DTGS ++WV C PCR C Q G +F P RSSSY V
Sbjct: 124 QGSGEYFTKIGVGTPATQALMVLDTGSDVVWVQCAPCRRCYEQSGPVFDPRRSSSYGAVG 183
Query: 116 CDSEHCRYFPYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDVVFG 175
C + CR C + C+Y Y G T+ TE LTF V V G
Sbjct: 184 CGAALCRRLDSGGCDLRRGACMYQVAYGDGSVTAGDFVTETLTFAG----GARVARVALG 239
Query: 176 CGFSTNRNF-KFSGIFGLGIGRSSLVSQLN----SSFSYCI------GSLHDP-DYLHNK 223
CG F +G+ GLG G S +Q++ SFSYC+ G+ P + +
Sbjct: 240 CGHDNEGLFVAAAGLLGLGRGGLSFPTQISRRYGRSFSYCLVDRTSSGAGAAPGSHRSST 299
Query: 224 LILGDGAI-IDEGDATPL---QFIDGHYYITLEAISVDG-RMLDINPNIFKRDDSG--GG 276
+ G G++ TP+ ++ YY+ L ISV G R+ + + + D S GG
Sbjct: 300 VSFGAGSVGASSASFTPMVRNPRMETFYYVQLVGISVGGARVPGVAESDLRLDPSTGRGG 359
Query: 277 VMIDSGTDVTWLVKEAYEALRDEVMIRLEGEQMRS---YSWPDKLCYHGIMSSDLKG--- 330
V++DSGT VT L + +Y ALRD G S +S D CY DL G
Sbjct: 360 VIVDSGTSVTRLARASYSALRDAFRAAAAGGLRLSPGGFSLFDT-CY------DLGGRRV 412
Query: 331 --FPTVRFHFRGGAKLALEKDS-MFYQPRPDAFCMAVNPASINNRYVNFSLIGMMAQQFY 387
PTV HF GGA+ AL ++ + FC A A + S+IG + QQ +
Sbjct: 413 VKVPTVSMHFAGGAEAALPPENYLIPVDSRGTFCFAF--AGTDG---GVSIIGNIQQQGF 467
Query: 388 NVGYDIGRKQKTFQRMDC 405
V +D ++ F C
Sbjct: 468 RVVFDGDGQRVGFAPKGC 485
>gi|242051593|ref|XP_002454942.1| hypothetical protein SORBIDRAFT_03g001790 [Sorghum bicolor]
gi|241926917|gb|EES00062.1| hypothetical protein SORBIDRAFT_03g001790 [Sorghum bicolor]
Length = 431
Score = 128 bits (321), Expect = 5e-27, Method: Compositional matrix adjust.
Identities = 117/418 (27%), Positives = 191/418 (45%), Gaps = 42/418 (10%)
Query: 4 RDSILSPYNNPNENPAHRV-QRGINISIARLAYLQEKIRSHNTYQAQILPSNDISNSLFY 62
R + PY + P H + +R S AR+A L+ ++ + +P IS+ +
Sbjct: 39 RAELHHPYAG-SSLPVHDMWRRSARASKARVARLEARLTGDMS-----VPLARISDEGYT 92
Query: 63 VNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLGTIFYPSRSSSYAYVPCDSEHCR 122
V I IG PP DT S L W C D + Q+ +F P++SSS+A+V C S+ C
Sbjct: 93 VTIGIGTPPQLHTLIADTASDLTWTQCNLFNDTAKQVEPLFDPAKSSSFAFVTCSSKLCT 152
Query: 123 Y--FPYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHV-QDVVFGCGFS 179
RCS R +Y + + E + ++ E T + ++ H+ FGCG
Sbjct: 153 EDNPGTKRCSNKTCRYVYPYVSV---EAAGVLAYESFTLSDNNQ---HICMSFGFGCGAL 206
Query: 180 TNRN-FKFSGIFGLGIGRSSLVSQLN-SSFSYCIGSLHDPDYLHNKLILGDGAIIDEGDA 237
T+ N SGI G+ S+VSQL FSYC+ D + L G A +
Sbjct: 207 TDGNLLGASGILGMSPAILSMVSQLAIPKFSYCLTPYTDRK--SSPLFFGAWADLGRYKT 264
Query: 238 T-PLQ-FIDGHYYITLEAISVDGRMLDINPNIFKRDDSGGGVMIDSGTDVTWLVKEAYEA 295
T P+Q + +YY+ L +S+ R LD+ F GG ++D G V L + A+ A
Sbjct: 265 TGPIQKSLTFYYYVPLVGLSLGTRRLDVPAATFALKQ--GGTVVDLGCTVGQLAEPAFTA 322
Query: 296 LRDEVM----IRLEGEQMRSYSWPDKLCY---HGIMSSDLKGFPTVRFHFRGGAKLALEK 348
L++ V+ + L ++ Y K+C+ G+ ++ P V +F GGA + L +
Sbjct: 323 LKEAVLHTLNLPLTNRTVKDY----KVCFALPSGVAMGAVQTPPLV-LYFDGGADMVLPR 377
Query: 349 DSMFYQPRPDAFCMAVNPASINNRYVNFSLIGMMAQQFYNVGYDIGRKQKTFQRMDCE 406
D+ F +P C+A+ P S+IG + QQ +++ +D+ + F C+
Sbjct: 378 DNYFQEPTAGLMCLALVPGG------GMSIIGNVQQQNFHLLFDVHDSKFLFAPTICD 429
>gi|8979711|emb|CAB96832.1| nucleoid DNA-binding protein cnd41-like protein [Arabidopsis
thaliana]
Length = 446
Score = 128 bits (321), Expect = 5e-27, Method: Compositional matrix adjust.
Identities = 131/436 (30%), Positives = 184/436 (42%), Gaps = 55/436 (12%)
Query: 1 LIHRDSILSPYNNPNENPAHRVQRGINISIARL----AYLQEKIRSHNTYQAQI--LPSN 54
+ HR S NN V+ + + AR+ + L +K+ + + +++ LP+
Sbjct: 36 VTHRHGTCSRLNNGKATSPDHVEI-LRLDQARVNSIHSKLSKKLATDHVSESKSTDLPAK 94
Query: 55 D---ISNSLFYVNISIGQPPVPQFTAMDTGSSLLWVHCYPC-RDCSPQLGTIFYPSRSSS 110
D + + + V + +G P DTGS L W C PC R C Q IF PS+S+S
Sbjct: 95 DGSTLGSGNYIVTVGLGTPKNDLSLIFDTGSDLTWTQCQPCVRTCYDQKEPIFNPSKSTS 154
Query: 111 YAYVPCDSEHCRYFPYA-----RCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDES 165
Y V C S C A CSA CIY Y + F++ E+ T N+D
Sbjct: 155 YYNVSCSSAACGSLSSATGNAGSCSA--SNCIYGIQYGDQSFSVGFLAKEKFTLTNSDV- 211
Query: 166 TIHVQDVVFGCGFSTNRNFKFSGIFG-LGIGRSSL------VSQLNSSFSYCIGSLHDPD 218
V FGCG N F+G+ G LG+GR L + N FSYC+ S
Sbjct: 212 ---FDGVYFGCG--ENNQGLFTGVAGLLGLGRDKLSFPSQTATAYNKIFSYCLPS--SAS 264
Query: 219 YLHNKLILGDGAIIDEGDATPLQFI-DG--HYYITLEAISVDGRMLDINPNIFKRDDSGG 275
Y L G I TP+ I DG Y + + AI+V G+ L I +F S
Sbjct: 265 YT-GHLTFGSAGISRSVKFTPISTITDGTSFYGLNIVAITVGGQKLPIPSTVF----STP 319
Query: 276 GVMIDSGTDVTWLVKEAYEALRDEVMIRLEGEQMRSYSWPDKLCYHGIMSSDLKGF---- 331
G +IDSGT +T L +AY ALR ++ S C+ DL GF
Sbjct: 320 GALIDSGTVITRLPPKAYAALRSSFKAKMSKYPTTSGVSILDTCF------DLSGFKTVT 373
Query: 332 -PTVRFHFRGGAKLALEKDSMFYQPRPDAFCMAVNPASINNRYVNFSLIGMMAQQFYNVG 390
P V F F GGA + L +FY + C+A + N+ N ++ G + QQ V
Sbjct: 374 IPKVAFSFSGGAVVELGSKGIFYVFKISQVCLAF---AGNSDDSNAAIFGNVQQQTLEVV 430
Query: 391 YDIGRKQKTFQRMDCE 406
YD + F C
Sbjct: 431 YDGAGGRVGFAPNGCS 446
>gi|356502456|ref|XP_003520035.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
Length = 484
Score = 128 bits (321), Expect = 6e-27, Method: Compositional matrix adjust.
Identities = 117/370 (31%), Positives = 178/370 (48%), Gaps = 25/370 (6%)
Query: 44 NTYQAQILPSNDISNSLFYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLGTIF 103
N Q ++ + +++ + IG+PP + +DTGS + W+ C PC +C Q IF
Sbjct: 132 NALQGPVVSGTSQGSGEYFLRVGIGKPPSQAYVVLDTGSDVSWIQCAPCSECYQQSDPIF 191
Query: 104 YPSRSSSYAYVPCDSEHCRYFPYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTD 163
P S+SY+ + CD+ C+ + C C+Y Y G T +TE +T
Sbjct: 192 DPVSSNSYSPIRCDAPQCKSLDLSEC--RNGTCLYEVSYGDGSYTVGEFATETVTL---- 245
Query: 164 ESTIHVQDVVFGCGFSTNRNF-KFSGIFGLGIGRSSLVSQLN-SSFSYCIGSLHDPDYLH 221
T V++V GCG + F +G+ GLG G+ S +Q+N +SFSYC+ + D D +
Sbjct: 246 -GTAAVENVAIGCGHNNEGLFVGAAGLLGLGGGKLSFPAQVNATSFSYCLVN-RDSDAVS 303
Query: 222 NKLILGDGAIIDEGDATPLQF---IDGHYYITLEAISVDGRMLDINPNIFKRD-DSGGGV 277
+ + + PL+ +D YY+ L+ ISV G L I +IF+ D GGG+
Sbjct: 304 T--LEFNSPLPRNVVTAPLRRNPELDTFYYLGLKGISVGGEALPIPESIFEVDAIGGGGI 361
Query: 278 MIDSGTDVTWLVKEAYEALRDEVMIRLEG-EQMRSYSWPDKLCYHGIMSSDLKGFPTVRF 336
+IDSGT VT L E Y+ALRD + +G + S D CY + S + PTV F
Sbjct: 362 IIDSGTAVTRLRSEVYDALRDAFVKGAKGIPKANGVSLFDT-CYD-LSSRESVQVPTVSF 419
Query: 337 HFRGGAKLAL-EKDSMFYQPRPDAFCMAVNPASINNRYVNFSLIGMMAQQFYNVGYDIGR 395
HF G +L L ++ + FC A P + + S++G + QQ VG+DI
Sbjct: 420 HFPEGRELPLPARNYLIPVDSVGTFCFAFAPTT-----SSLSIMGNVQQQGTRVGFDIAN 474
Query: 396 KQKTFQRMDC 405
F C
Sbjct: 475 SLVGFSADSC 484
>gi|22326716|ref|NP_196638.2| aspartyl protease family protein [Arabidopsis thaliana]
gi|18700103|gb|AAL77663.1| AT5g10770/T30N20_40 [Arabidopsis thaliana]
gi|24111269|gb|AAN46758.1| At5g10770/T30N20_40 [Arabidopsis thaliana]
gi|332004211|gb|AED91594.1| aspartyl protease family protein [Arabidopsis thaliana]
Length = 474
Score = 127 bits (320), Expect = 7e-27, Method: Compositional matrix adjust.
Identities = 130/436 (29%), Positives = 184/436 (42%), Gaps = 55/436 (12%)
Query: 1 LIHRDSILSPYNNPNENPAHRVQRGINISIARL----AYLQEKIRSHNTYQAQI--LPSN 54
+ HR S NN V+ + + AR+ + L +K+ + + +++ LP+
Sbjct: 64 VTHRHGTCSRLNNGKATSPDHVEI-LRLDQARVNSIHSKLSKKLATDHVSESKSTDLPAK 122
Query: 55 D---ISNSLFYVNISIGQPPVPQFTAMDTGSSLLWVHCYPC-RDCSPQLGTIFYPSRSSS 110
D + + + V + +G P DTGS L W C PC R C Q IF PS+S+S
Sbjct: 123 DGSTLGSGNYIVTVGLGTPKNDLSLIFDTGSDLTWTQCQPCVRTCYDQKEPIFNPSKSTS 182
Query: 111 YAYVPCDSEHCRYFPYA-----RCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDES 165
Y V C S C A CSA CIY Y + F++ E+ T N+D
Sbjct: 183 YYNVSCSSAACGSLSSATGNAGSCSA--SNCIYGIQYGDQSFSVGFLAKEKFTLTNSDV- 239
Query: 166 TIHVQDVVFGCGFSTNRNFKFSGIFG-LGIGRSSL------VSQLNSSFSYCIGSLHDPD 218
V FGCG N F+G+ G LG+GR L + N FSYC+ S
Sbjct: 240 ---FDGVYFGCG--ENNQGLFTGVAGLLGLGRDKLSFPSQTATAYNKIFSYCLPS--SAS 292
Query: 219 YLHNKLILGDGAIIDEGDATPLQFI-DGH--YYITLEAISVDGRMLDINPNIFKRDDSGG 275
Y + L G I TP+ I DG Y + + AI+V G+ L I +F
Sbjct: 293 YTGH-LTFGSAGISRSVKFTPISTITDGTSFYGLNIVAITVGGQKLPIPSTVFSTP---- 347
Query: 276 GVMIDSGTDVTWLVKEAYEALRDEVMIRLEGEQMRSYSWPDKLCYHGIMSSDLKGF---- 331
G +IDSGT +T L +AY ALR ++ S C+ DL GF
Sbjct: 348 GALIDSGTVITRLPPKAYAALRSSFKAKMSKYPTTSGVSILDTCF------DLSGFKTVT 401
Query: 332 -PTVRFHFRGGAKLALEKDSMFYQPRPDAFCMAVNPASINNRYVNFSLIGMMAQQFYNVG 390
P V F F GGA + L +FY + C+A + N+ N ++ G + QQ V
Sbjct: 402 IPKVAFSFSGGAVVELGSKGIFYVFKISQVCLAF---AGNSDDSNAAIFGNVQQQTLEVV 458
Query: 391 YDIGRKQKTFQRMDCE 406
YD + F C
Sbjct: 459 YDGAGGRVGFAPNGCS 474
>gi|297822477|ref|XP_002879121.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
gi|297324960|gb|EFH55380.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
Length = 711
Score = 127 bits (320), Expect = 8e-27, Method: Compositional matrix adjust.
Identities = 103/363 (28%), Positives = 160/363 (44%), Gaps = 42/363 (11%)
Query: 58 NSLFYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLGTIFYPSRSSSYAYVPCD 117
NS++ + + +G PP +DTGS + W C PC C Q IF PS+SS+
Sbjct: 377 NSVYLMKLQVGTPPFEIEAVIDTGSEITWTQCLPCVHCYKQNAPIFDPSKSST------- 429
Query: 118 SEHCRYFPYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDVVFGCG 177
F RC + H C Y Y T ++T+ +T +T + + + GCG
Sbjct: 430 ------FKEKRC--HDHSCPYEVDYFDKTYTKGTLATDTVTIHSTSGEPFVMAETIIGCG 481
Query: 178 FSTNRNFK--FSGIFGLGIGRSSLVSQLNSSF----SYCIGSLHDPDYLHNKLILGDGAI 231
N F+ F G GL G SL++Q+ + SYC +K+ G AI
Sbjct: 482 -RNNSWFRPSFEGFVGLNWGPLSLITQMGGEYPGLMSYCFAGNGT-----SKINFGTNAI 535
Query: 232 IDEGD-ATPLQFID----GHYYITLEAISVDGRMLDINPNIFKRDDSGGGVMIDSGTDVT 286
+ G + F+ G YY+ L+A+SV ++ F + G ++IDSGT +T
Sbjct: 536 VGGGGVVSTTMFVTTARPGFYYLNLDAVSVGDTRIETLGTPFHALE--GNIVIDSGTTLT 593
Query: 287 WLVKEAYEALRDEVMIRLEGEQMRSYSWPDKLCYHGIMSSDLKGFPTVRFHFRGGAKLAL 346
+ + +R V + + D LCY+ S+ + FP + HF GGA L L
Sbjct: 594 YFPESYCNLVRQAVEHVVPAVPAADPTGNDLLCYY---SNTTEIFPVITMHFSGGADLVL 650
Query: 347 EKDSMFYQPRPDA-FCMAVNPASINNRYVNFSLIGMMAQQFYNVGYDIGRKQKTFQRMDC 405
+K +MF + FC+A+ I N ++ G AQ + VGYD +F+ +C
Sbjct: 651 DKYNMFMESYSGGLFCLAI----ICNNPTQEAIFGNRAQNNFLVGYDSSSLLVSFKPTNC 706
Query: 406 EVL 408
L
Sbjct: 707 SAL 709
Score = 124 bits (311), Expect = 9e-26, Method: Compositional matrix adjust.
Identities = 98/337 (29%), Positives = 149/337 (44%), Gaps = 48/337 (14%)
Query: 61 FYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLGTIFYPSRSSSYAYVPCDSEH 120
+ + + IG PP +DTGS L+W C PC C Q IF PS+SS+
Sbjct: 65 YLMKLQIGTPPFEVEAVLDTGSELIWTQCLPCLHCYDQKAPIFDPSKSST---------- 114
Query: 121 CRYFPYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDVVFGCGFST 180
F RC+ H C Y +Y T ++TE +T +T + + + GC
Sbjct: 115 ---FKETRCNTPDHSCPYKLVYDDKSYTQGTLATETVTIHSTSGVPFVMPETIIGCS-RN 170
Query: 181 NRNFKF----SGIFGLGIGRSSLVSQLNSSFSYCIGSLHDPDYLHNKLILGDGAIIDEGD 236
N F SGI GL G SL+SQ+ ++ GDG +
Sbjct: 171 NSGSGFRPSSSGIVGLSRGSLSLISQMGGAYP------------------GDGVVSTTMF 212
Query: 237 ATPLQFIDGHYYITLEAISVDGRMLDINPNIFKRDDSGGGVMIDSGTDVTWLVKEAYEAL 296
A + G YY+ L+A+SV ++ F + G ++IDSGT +T+ +
Sbjct: 213 AKTAK--RGQYYLNLDAVSVGDTRIETVGTPFHALN--GNIVIDSGTPLTYFPVSYCNLV 268
Query: 297 RDEVMIRLEGEQMRSYSWPDKLCYHGIMSSDLKGFPTVRFHFRGGAKLALEKDSMFYQ-P 355
R V + +++ S D LCY+ S+ ++ FP + HF GGA L L+K +M+ +
Sbjct: 269 RKAVERVVTADRVVDPSRNDMLCYY---SNTIEIFPVITVHFSGGADLVLDKYNMYMELN 325
Query: 356 RPDAFCMAVNPASINNRYVNFSLIGMMAQQFYNVGYD 392
R FC+A+ I N ++ G AQ + VGYD
Sbjct: 326 RGGVFCLAI----ICNNPTQVAIFGNRAQNNFLVGYD 358
>gi|125536419|gb|EAY82907.1| hypothetical protein OsI_38120 [Oryza sativa Indica Group]
Length = 448
Score = 127 bits (320), Expect = 8e-27, Method: Compositional matrix adjust.
Identities = 98/330 (29%), Positives = 146/330 (44%), Gaps = 36/330 (10%)
Query: 61 FYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLGTIFYPSRSSSYAYVPCDSEH 120
+ + SIG+PP+ + +DTGS L+WV C PC C+P ++ P+RS S +PC S+
Sbjct: 87 YIMQFSIGEPPLLIWAEVDTGSDLMWVKCSPCNGCNPPPSPLYDPARSRSSGKLPCSSQL 146
Query: 121 CRYFPYAR-----CSAYKHRCIYTQLYLIGPE--TSVFVSTEQLTFKN---TDESTIHVQ 170
C+ R CS C Y Y + T + TE TF + + +
Sbjct: 147 CQALGRGRIISDQCSDDPPLCGYHYAYGHSGDHSTQGVLGTETFTFGDGYVANNVSFGRS 206
Query: 171 DVVFGCGFSTNRNFKFSGIFGLGIGRSSLVSQLNSS-FSYCIGSLHDPDYLHNKLILGDG 229
D + G F +G+ GLG G SLVSQL + F+YC+ + DP+ +++ ++ G
Sbjct: 207 DTIDGSQFGGT-----AGLVGLGRGHLSLVSQLGAGRFAYCLAA--DPN-VYSTILFGSL 258
Query: 230 AIID--EGDATPLQFI-------DGHYYITLEAISVDGRMLDINPNIFK-RDDSGGGVMI 279
A +D GD + + D HYY+ L+ ISV G L I F D GGV
Sbjct: 259 AALDTSAGDVSSTPLVTNPKPDRDTHYYVNLQGISVGGSRLPIKDGTFAINSDGSGGVFF 318
Query: 280 DSGTDVTWLVKEAYEALRDEVMIRLEGEQMRSYSWPDKLCYHGIMSSDLKGFPTVRFHFR 339
DSG T L AY+ +R + + Q Y D C+ + P + HF
Sbjct: 319 DSGAIDTSLKDAAYQVVRQAITSEI---QRLGYDAGDDTCFVAANQQAVAQMPPLVLHFD 375
Query: 340 GGAKLALEKDSMFYQ----PRPDAFCMAVN 365
GA ++L + P CMA+
Sbjct: 376 DGADMSLNGRNYLKTSTKGPSEVLVCMAIK 405
>gi|20160862|dbj|BAB89801.1| putative aspartic proteinase nepenthesin I [Oryza sativa Japonica
Group]
Length = 488
Score = 127 bits (319), Expect = 1e-26, Method: Compositional matrix adjust.
Identities = 114/366 (31%), Positives = 162/366 (44%), Gaps = 45/366 (12%)
Query: 57 SNSLFYV-NISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLGTIFYPSRSSSYAYVP 115
+N+ YV + IG PP A+D S L+W C +P F P RS++ A VP
Sbjct: 95 TNAGMYVFSYGIGTPPQQVSGALDISSDLVWTAC---GATAP-----FNPVRSTTVADVP 146
Query: 116 CDSEHCRYFPYARCSAYKHRCIYTQLYLIGPE-TSVFVSTEQLTFKNTDESTIHVQDVVF 174
C + C+ F C A C YT +Y G T+ + TE TF +T + VVF
Sbjct: 147 CTDDACQQFAPQTCGAGASECAYTYMYGGGAANTTGLLGTEAFTFGDT-----RIDGVVF 201
Query: 175 GCGFSTNRNFK-FSGIFGLGIGRSSLVSQLN-SSFSYCIGSLHDPDYLHNKLILGDGAII 232
GCG +F SG+ GLG G SLVSQL FSY D + ++ GD
Sbjct: 202 GCGLKNVGDFSGVSGVIGLGRGNLSLVSQLQVDRFSYHFAPDDSVD-TQSFILFGD---- 256
Query: 233 DEGDATP---------LQFIDGH---YYITLEAISVDGRMLDINPNIF--KRDDSGGGVM 278
DATP L D + YY+ L I VDG+ L I F + D GGV
Sbjct: 257 ---DATPQTSHTLSTRLLASDANPSLYYVELAGIQVDGKDLAIPSGTFDLRNKDGSGGVF 313
Query: 279 IDSGTDVTWLVKEAYEALRDEVMIRLEGEQMRSYSWPDKLCYHGIMSSDLKGFPTVRFHF 338
+ VT L + AY+ LR V ++ + + LCY G + K P++ F
Sbjct: 314 LSITDLVTVLEEAAYKPLRQAVASKIGLPAVNGSALGLDLCYTGESLAKAK-VPSMALVF 372
Query: 339 RGGAKLALEKDSMFYQPRPDAF-CMAVNPASINNRYVNFSLIGMMAQQFYNVGYDIGRKQ 397
GGA + LE + FY C+ + P+S + S++G + Q ++ YDI +
Sbjct: 373 AGGAVMELELGNYFYMDSTTGLACLTILPSSAGDG----SVLGSLIQVGTHMMYDINGSK 428
Query: 398 KTFQRM 403
F+ +
Sbjct: 429 LVFESL 434
>gi|297742733|emb|CBI35367.3| unnamed protein product [Vitis vinifera]
Length = 521
Score = 127 bits (318), Expect = 1e-26, Method: Compositional matrix adjust.
Identities = 116/412 (28%), Positives = 176/412 (42%), Gaps = 60/412 (14%)
Query: 1 LIHRDSILSPYNNPNENPAHRVQRGINISIARLAYLQEKIRSH-------NTYQAQILPS 53
++HRD + + N +++ HR+ + R+A L ++ S + + ++
Sbjct: 137 VVHRDQL--SFGNSDDH-RHRLDGRLKRDAKRVASLIRRLSSGGGGSYRVDDFGTDVISG 193
Query: 54 NDISNSLFYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLGTIFYPSRSSSYAY 113
+ + ++V I +G PP Q+ +D+GS ++WV C PC C Q +F P+ S+S+
Sbjct: 194 MEQGSGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCTQCYHQSDPVFDPADSASFTG 253
Query: 114 VPCDSEHCRYFPYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDVV 173
V C S C A C A RC Y Y G T ++ E LTF T V+ V
Sbjct: 254 VSCSSSVCDRLENAGCHA--GRCRYEVSYGDGSYTKGTLALETLTFGRT-----MVRSVA 306
Query: 174 FGCGFSTNRNFKFSGIFGLGIGRS-SLVSQL----NSSFSYCIGSLHDPDYLHNKLILGD 228
GCG F + G S S V QL +FSYC+ S + N
Sbjct: 307 IGCGHRNRGMFVGAAGLLGLGGGSMSFVGQLGGQTGGAFSYCLVSAAWVPLVRNPR---- 362
Query: 229 GAIIDEGDATPLQFIDGHYYITLEAISVDGRMLDINPNIFKRDDSG-GGVMIDSGTDVTW 287
YYI L + V G + I+ +F+ + G GGV++D+GT VT
Sbjct: 363 --------------APSFYYIGLAGLGVGGIRVPISEEVFRLTELGDGGVVMDTGTAVTR 408
Query: 288 LVKEAYEALRDEVMIRLEGEQMRSYSWPDKLCYHGIMSSDLKGF-----PTVRFHFRGGA 342
L AY+A RD + + + CY DL GF PTV F+F GG
Sbjct: 409 LPTLAYQAFRDAFLAQTANLPRATGVAIFDTCY------DLLGFVSVRVPTVSFYFSGGP 462
Query: 343 KLALEKDSMFYQPRPDA--FCMAVNPASINNRYVNFSLIGMMAQQFYNVGYD 392
L L + F P DA FC A P++ S++G + Q+ + +D
Sbjct: 463 ILTLPARN-FLIPMDDAGTFCFAFAPST-----SGLSILGNIQQEGIQISFD 508
>gi|125555058|gb|EAZ00664.1| hypothetical protein OsI_22685 [Oryza sativa Indica Group]
Length = 465
Score = 127 bits (318), Expect = 1e-26, Method: Compositional matrix adjust.
Identities = 141/446 (31%), Positives = 187/446 (41%), Gaps = 68/446 (15%)
Query: 1 LIHRDSILSPYNNPNENP--AHRVQRGINISIARLAYLQEKIRSHNTYQAQI-------- 50
L+HR +P P A R++R AR Y+ K T +
Sbjct: 47 LVHRHGPCAPSAASGGKPSLAERLRR----DRARANYIVTKAAGGRTAATAVSDAVGGGG 102
Query: 51 --LPS--NDISNSLFYV-NISIGQPPVPQFTAMDTGSSLLWVHCYPC--RDCSPQLGTIF 103
+P+ D +SL YV + IG P V Q +DTGS L WV C PC +C Q +F
Sbjct: 103 TSIPTFLGDSVDSLEYVVTLGIGTPAVQQIVLIDTGSDLSWVQCKPCGAGECYAQKDPLF 162
Query: 104 YPSRSSSYAYVPCDSEHCRYFPYARCSAYKHRCIYTQLYLI--GPE------TSVFVSTE 155
PS SSSYA VPCDS+ CR AY H C L G E T+ STE
Sbjct: 163 DPSSSSSYASVPCDSDACRKL---AAGAYGHGCTSGAAALCEYGIEYGNRATTTGVYSTE 219
Query: 156 QLTFKNTDESTIHVQDVVFGCGFSTNRNF-KFSGIFGLGIGRSSLVSQLNSS----FSYC 210
LT K + V D FGCG + + KF G+ GLG SLVSQ +S FSYC
Sbjct: 220 TLTLK----PGVVVADFGFGCGDHQHGPYEKFDGLLGLGGAPESLVSQTSSQFGGPFSYC 275
Query: 211 IGSLHDPDYLHNKLILGDGAIIDEGDA------TPLQFIDG---HYYITLEAISVDGRML 261
L L LG A TP++ I Y +TL ISV G L
Sbjct: 276 ---LPPTSGGAGFLALGAPNSSSSSTAAAGFLFTPMRRIPSVPTFYVVTLTGISVGGAPL 332
Query: 262 DINPNIFKRDDSGGGVMIDSGTDVTWLVKEAYEALRDEVMIRLEGEQMRSYSWPDKL--C 319
+ P+ F G++IDSGT +T L AY ALR + ++ S L C
Sbjct: 333 AVPPSAFSS-----GMVIDSGTVITGLPATAYAALRSAFRSAMSEYRLLPPSNGAVLDTC 387
Query: 320 YHGIMSSDLKGFPTVRFHFRGGAKLALEKDSMFYQPRPDAFCMAVNPASINNRYVNFSLI 379
Y +++ PT+ F GGA + L + C+A A ++ +I
Sbjct: 388 YDFTGHTNVT-VPTIALTFSGGATIDLATPAGVLVDG----CLAFAGAGTDD---TIGII 439
Query: 380 GMMAQQFYNVGYDIGRKQKTFQRMDC 405
G + Q+ + V YD G+ F+ C
Sbjct: 440 GNVNQRTFEVLYDSGKGTVGFRAGAC 465
>gi|242092900|ref|XP_002436940.1| hypothetical protein SORBIDRAFT_10g011750 [Sorghum bicolor]
gi|241915163|gb|EER88307.1| hypothetical protein SORBIDRAFT_10g011750 [Sorghum bicolor]
Length = 465
Score = 127 bits (318), Expect = 1e-26, Method: Compositional matrix adjust.
Identities = 114/367 (31%), Positives = 162/367 (44%), Gaps = 48/367 (13%)
Query: 61 FYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCR--DCSPQLGTIFYPSRSSSYAYVPCDS 118
+ V + G P VPQ MDTGS + WV C PC +C PQ +F PS+SS+YA + C +
Sbjct: 125 YMVTLGFGTPSVPQVLLMDTGSDVSWVQCAPCNSTECYPQKDPLFDPSKSSTYAPIACGA 184
Query: 119 EHCRYFP---YARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDVVFG 175
+ C C++ +C Y Y G T S E +TF I V+D FG
Sbjct: 185 DACNKLGDHYRNGCTSGGTQCGYRVEYGDGSSTRGVYSNETITFA----PGITVKDFHFG 240
Query: 176 CGFST-NRNFKFSGIFGLGIGRSSLVSQLNS----SFSYCIGSLH-DPDYLHNKLILGDG 229
CG + KF G+ GLG SLV Q S +FSYC+ +L+ + +L +
Sbjct: 241 CGHDQRGPSDKFDGLLGLGGAPESLVVQTASVYGGAFSYCLPALNSEAGFLALGVRPSAA 300
Query: 230 AIIDEGDATPLQFID---GHYYITLEAISVDGRMLDINPNIFKRDDSGGGVMIDSGTDVT 286
TP+ + Y + + ISV G+ LDI + F+ GG++IDSGT VT
Sbjct: 301 TNTSAFVFTPMWHLPMDATSYMVNMTGISVGGKPLDIPRSAFR-----GGMLIDSGTIVT 355
Query: 287 WLVKEAYEALRDEVMIRLEGEQMRSYSWPDKLCYHGIMSSDLKGFPTVRFHFRGGAKLAL 346
L + AY AL + M + D CY+ S++ P V F GGA + L
Sbjct: 356 ELPETAYNALNAALRKAFAAYPMVASEDFDT-CYNFTGYSNVT-VPRVALTFSGGATIDL 413
Query: 347 E-------KDSM-FYQPRPDAFCMAVNPASINNRYVNFSLIGMMAQQFYNVGYDIGRKQK 398
+ KD + F + PD V +IG + Q+ V YD G +
Sbjct: 414 DVPNGILVKDCLAFRESGPD---------------VGLGIIGNVNQRTLEVLYDAGHGKV 458
Query: 399 TFQRMDC 405
F+ C
Sbjct: 459 GFRAGAC 465
>gi|21717166|gb|AAM76359.1|AC074196_17 putative nucleoid DNA binding protein [Oryza sativa Japonica Group]
gi|31433306|gb|AAP54835.1| Eukaryotic aspartyl protease family protein, expressed [Oryza
sativa Japonica Group]
gi|125575546|gb|EAZ16830.1| hypothetical protein OsJ_32301 [Oryza sativa Japonica Group]
Length = 373
Score = 127 bits (318), Expect = 1e-26, Method: Compositional matrix adjust.
Identities = 100/366 (27%), Positives = 159/366 (43%), Gaps = 31/366 (8%)
Query: 57 SNSLFYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLGTIFYPSRSSSYAYVPC 116
S L+ N++IG PP P + +W C PCR C Q +F S SS+Y PC
Sbjct: 24 SQPLYMANLTIGTPPQPASAIIHLAGEFVWTQCSPCRRCFKQDLPLFNRSASSTYRPEPC 83
Query: 117 DSEHCRYFPYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDVVFGC 176
+ C P + CS C Y + G +TS T+ S + FGC
Sbjct: 84 GTALCESVPASTCSG-DGVCSYEVETMFG-DTSGIGGTDTFAIGTATAS------LAFGC 135
Query: 177 GFSTN--RNFKFSGIFGLGIGRSSLVSQLN-SSFSYCIGSLHDPDYLHNKLILGDGAIID 233
+N + SG+ GLG SLV Q+N ++FSYC+ H + L+LG A +
Sbjct: 136 AMDSNIKQLLGASGVVGLGRTPWSLVGQMNATAFSYCLAP-HGAAGKKSALLLGASAKLA 194
Query: 234 EGDA---TPLQFI---DGHYYITLEAISVDGRMLDINPNIFKRDDSGGGVMIDSGTDVTW 287
G + TPL Y I LE I ++ PN G V++D+ V++
Sbjct: 195 GGKSAATTPLVNTSDDSSDYMIHLEGIKFGDVIIAPPPN-------GSVVLVDTIFGVSF 247
Query: 288 LVKEAYEALRDEVMIRLEGEQMRSYSWPDKLCY-----HGIMSSDLKGFPTVRFHFRGGA 342
LV A++A++ V + + M + + P LC+ +S L P V F+G A
Sbjct: 248 LVDAAFQAIKKAVTVAVGAAPMATPTKPFDLCFPKAAAAAGANSSLP-LPDVVLTFQGAA 306
Query: 343 KLALEKDSMFYQPRPDAFCMAVNPASINNRYVNFSLIGMMAQQFYNVGYDIGRKQKTFQR 402
L + Y C+A+ +++ N S++G + Q+ + +D+ ++ +F+
Sbjct: 307 ALTVPPSKYMYDAGNGTVCLAMMSSAMLNLTTELSILGRLHQENIHFLFDLDKETLSFEP 366
Query: 403 MDCEVL 408
DC L
Sbjct: 367 ADCSSL 372
>gi|302756591|ref|XP_002961719.1| hypothetical protein SELMODRAFT_64161 [Selaginella moellendorffii]
gi|300170378|gb|EFJ36979.1| hypothetical protein SELMODRAFT_64161 [Selaginella moellendorffii]
Length = 357
Score = 126 bits (317), Expect = 2e-26, Method: Compositional matrix adjust.
Identities = 104/361 (28%), Positives = 156/361 (43%), Gaps = 31/361 (8%)
Query: 61 FYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLGTIFYPSRSSSYAYVPCDSEH 120
++ + IG P + +DTGS + W+ C PC C Q+ I+ PS SSSY V C S
Sbjct: 12 YFARMGIGNPQRSYYLELDTGSDVTWIQCAPCSSCYSQVDPIYDPSNSSSYRRVYCGSAL 71
Query: 121 CRYFPYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDVVFGCGFST 180
C+ Y+ C C Y +Y +S + E +F S+ ++++ FGCG S
Sbjct: 72 CQALDYSACQGMG--CSYRVVYGDSSASSGDLGIE--SFYLGPNSSTAMRNIAFGCGHSN 127
Query: 181 NRNFKFSGIFGLGIGR-----SSLVSQLNSSFSYCIGSLHDP-DYLHNKLILGDGAIIDE 234
+ F+ G S + + + +FSYC+ + + LI G AI
Sbjct: 128 SGLFRGEAGLLGMGGGTLSFFSQIAASIGPAFSYCLVDRYSQLQSRSSPLIFGRTAIPFA 187
Query: 235 GDATPL---QFIDGHYYITLEAISVDGRMLDINPNIFKRDDSG-GGVMIDSGTDVTWLVK 290
TPL I+ YY L ISV G L I P F +G GG ++DSGT VT +V
Sbjct: 188 ARFTPLLKNPRINTFYYAVLTGISVGGTPLPIPPAQFALTGNGTGGAILDSGTSVTRVVP 247
Query: 291 EAYEALRDEVMIRLEGEQMRSYSWPDKLCYHGIMSSDLKGFPTVR-----FHFRGGAKLA 345
AY LRD + C+ + +G PTV+ HF G +
Sbjct: 248 PAYAVLRDAYRAASRNLPPAPGVYLLDTCF------NFQGLPTVQIPSLVLHFDNGVDMV 301
Query: 346 LEKDSMFYQ-PRPDAFCMAVNPASINNRYVNFSLIGMMAQQFYNVGYDIGRKQKTFQRMD 404
L ++ R FC+A P+S+ S+IG + QQ + +G+D+ R +
Sbjct: 302 LPGGNILIPVDRSGTFCLAFAPSSM-----PISVIGNVQQQTFRIGFDLQRSLIAIAPRE 356
Query: 405 C 405
C
Sbjct: 357 C 357
>gi|302784853|ref|XP_002974198.1| hypothetical protein SELMODRAFT_54030 [Selaginella moellendorffii]
gi|300157796|gb|EFJ24420.1| hypothetical protein SELMODRAFT_54030 [Selaginella moellendorffii]
Length = 359
Score = 126 bits (317), Expect = 2e-26, Method: Compositional matrix adjust.
Identities = 107/366 (29%), Positives = 169/366 (46%), Gaps = 37/366 (10%)
Query: 61 FYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDC--SPQLGTIFYPSRSSSYAYVPCDS 118
+ + +SIG PP +DTGS L+W+ C C C TIF+ SSSY +PC+S
Sbjct: 5 YMMELSIGTPPQLIPAMIDTGSDLVWLKCDNCDHCDLDHHGETIFFSDASSSYKKLPCNS 64
Query: 119 EHCRYFPYA----RCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIH---VQD 171
HC A RC + C Y Y G TS V +++++F++ H
Sbjct: 65 THCSGMSSAGIGPRC---EETCKYKYEYGDGSRTSGDVGSDRISFRSHGAGEDHRSFFDG 121
Query: 172 VVFGCGFSTNRNFKFS-GIFGLGIGRSSLVSQLNS----SFSYCIGSLHDPDYLHNKLIL 226
+FGC ++ F+ G+ GLG SL+ QL FSYC+ S P + L L
Sbjct: 122 FLFGCARKLKGDWNFTQGLIGLGQKSHSLIQQLGDKLGYKFSYCLVSYDSPPSAKSFLFL 181
Query: 227 GDGAIIDEGDATPLQFIDGH------YYITLEAISVDGRMLDINPNIFKRDDSGG----- 275
G A + D + G YY+ L++I++ G + + + S G
Sbjct: 182 GSSAALRGHDVVSTPILHGDHLDQTLYYVDLQSITIGGVPVVVYDKESGHNTSVGPFLAN 241
Query: 276 GVMIDSGTDVTWLVKEAYEALRDEVMIRLEGEQMRSYSWPDKLCYHGIMSSDLK-GFPTV 334
+IDSGT T L YEA+R + ++ + + + D LC++ S D GFP+V
Sbjct: 242 KTVIDSGTTYTLLTPPVYEAMRKSIEEQVILPTLGNSAGLD-LCFNS--SGDTSYGFPSV 298
Query: 335 RFHFRGGAKLALEKDSMFYQPRPDAFCMAVNPASINNRYVNFSLIGMMAQQFYNVGYDIG 394
F+F +L L +++F D C+ S+++ + S+IG M QQ +++ YD+
Sbjct: 299 TFYFANQVQLVLPFENIFQVTSRDVVCL-----SMDSSGGDLSIIGNMQQQNFHILYDLV 353
Query: 395 RKQKTF 400
Q +F
Sbjct: 354 ASQISF 359
>gi|225216930|gb|ACN85225.1| aspartic proteinase nepenthesin-1 precursor [Oryza punctata]
gi|225216938|gb|ACN85232.1| aspartic proteinase nepenthesin-1 precursor [Oryza minuta]
Length = 516
Score = 126 bits (317), Expect = 2e-26, Method: Compositional matrix adjust.
Identities = 103/364 (28%), Positives = 154/364 (42%), Gaps = 27/364 (7%)
Query: 52 PSNDISNSLFYVNISIGQPPVPQFTAMDTGSSLLWVHCYPC-RDCSPQLGTIFYPSRSSS 110
P + + V + +G P DTGS WV C PC C Q +F P+RSS+
Sbjct: 170 PGRALGTGNYVVTVGLGTPASRYTVVFDTGSDTTWVQCQPCVVVCYEQREKLFDPARSST 229
Query: 111 YAYVPCDSEHCRYFPYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQ 170
YA V C + C CS C+Y Y G + F + + LT + D V+
Sbjct: 230 YANVSCAAPACSDLDTRGCSG--GHCLYGVQYGDGSYSIGFFAMDTLTLSSYDA----VK 283
Query: 171 DVVFGCGFSTNRNF-KFSGIFGLGIGRSSLVSQL----NSSFSYCIGSLHDPDYLHNKLI 225
FGCG F + +G+ GLG G++SL Q F++C+ + L
Sbjct: 284 GFRFGCGERNEGLFGEAAGLLGLGRGKTSLPVQTYDKYGGVFAHCLPARSTGT---GYLD 340
Query: 226 LGDGAIIDEGDATPLQFIDG--HYYITLEAISVDGRMLDINPNIFKRDDSGGGVMIDSGT 283
G G+ TP+ +G YY+ L I V GR+L I ++F G ++DSGT
Sbjct: 341 FGAGSPAARLTTTPMLVDNGPTFYYVGLTGIRVGGRLLYIPQSVFAT----AGTIVDSGT 396
Query: 284 DVTWLVKEAYEALRDEVMIRLEGEQMRSYSWPDKL--CYHGIMSSDLKGFPTVRFHFRGG 341
+T L AY +LR + + L CY S + PTV F+GG
Sbjct: 397 VITRLPPAAYSSLRSAFAAAMSARGYKKAPAVSLLDTCYDFAGMSQVA-IPTVSLLFQGG 455
Query: 342 AKLALEKDSMFYQPRPDAFCMAVNPASINNRYVNFSLIGMMAQQFYNVGYDIGRKQKTFQ 401
A+L ++ + Y C+A + N + ++G + + V YDIG+K +F
Sbjct: 456 ARLDVDASGIMYAASASQVCLAF---AANEDGGDVGIVGNTQLKTFGVAYDIGKKVVSFS 512
Query: 402 RMDC 405
C
Sbjct: 513 PGAC 516
>gi|302790323|ref|XP_002976929.1| hypothetical protein SELMODRAFT_105896 [Selaginella moellendorffii]
gi|300155407|gb|EFJ22039.1| hypothetical protein SELMODRAFT_105896 [Selaginella moellendorffii]
Length = 373
Score = 126 bits (316), Expect = 2e-26, Method: Compositional matrix adjust.
Identities = 112/373 (30%), Positives = 164/373 (43%), Gaps = 34/373 (9%)
Query: 67 IGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLGTIFYPSRSSSYAYVPCDSEHC----R 122
IG PP +DT S L WV C +CSP F P SSS+ PC S C +
Sbjct: 5 IGTPPREVLLLVDTASELTWVQGTSCTNCSPTKVPPFNPGLSSSFISEPCTSSVCLGRSK 64
Query: 123 YFPYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDVVFGCGFS-TN 181
+ C+ C + YL G E ++ E + ++ D + + DV+FGC
Sbjct: 65 LGFQSACNRSTGSCSFQVAYLDGSEAYGVIAREIFSLQSWDGAASTLGDVIFGCASKDLQ 124
Query: 182 RNFKF-SGIFGLGIGRSSLVSQLNS--------SFSYCIGSLHDPDYLHNKLILGDGAI- 231
R F SG GL G S +Q+ S FSYC + + +I GD I
Sbjct: 125 RPVDFSSGTLGLNRGSFSFPAQIGSRSKSGLSDRFSYCFPNRAEHLNSSGVIIFGDSGIP 184
Query: 232 ------IDEGDATPLQFIDGHYYITLEAISVDGRMLDINPNIFKRDDSG-GGVMIDSGTD 284
+ P+ I YY+ L+ ISV G +L I + FK D G GG DSGT
Sbjct: 185 AHHFQYLSLEQEPPIASIVDFYYVGLQGISVGGELLHIPRSAFKIDRLGNGGTYFDSGTT 244
Query: 285 VTWLVKEAYEALRDEVMIR-LEGEQMRSYSWPDKLCYHGIMSSD--LKGFPTVRFHFRGG 341
V++LV+ A+ AL + R L + + +LCY + + D L P V HF+
Sbjct: 245 VSFLVEPAHTALVEAFGRRVLHLNRTSGSDFTKELCYD-VAAGDARLPTAPLVTLHFKNN 303
Query: 342 AKLALEKDSMFY----QPRPDAFCMA-VNPASINNRYVNFSLIGMMAQQFYNVGYDIGRK 396
+ L + S++ P+ C+A VN ++ VN +IG QQ Y + +D+ R
Sbjct: 304 VDMELREASVWVPLARTPQVVTICLAFVNAGAVAQGGVN--VIGNYQQQDYLIEHDLERS 361
Query: 397 QKTFQRMDCEVLD 409
+ F +C V+D
Sbjct: 362 RIGFAPANC-VMD 373
>gi|356531224|ref|XP_003534178.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
Length = 492
Score = 126 bits (316), Expect = 2e-26, Method: Compositional matrix adjust.
Identities = 109/352 (30%), Positives = 160/352 (45%), Gaps = 23/352 (6%)
Query: 61 FYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLGTIFYPSRSSSYAYVPCDSEH 120
++ + +GQP P + +DTGS + W+ C PC DC Q IF P+ SSSY + CD++
Sbjct: 157 YFSRVGVGQPSKPFYMVLDTGSDVNWLQCKPCSDCYQQSDPIFDPTASSSYNPLTCDAQQ 216
Query: 121 CRYFPYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDVVFGCGFST 180
C+ + C +C+Y Y G T TE ++F V V GCG
Sbjct: 217 CQDLEMSACR--NGKCLYQVSYGDGSFTVGEYVTETVSF-----GAGSVNRVAIGCGHDN 269
Query: 181 NRNF-KFSGIFGLGIGRSSLVSQLN-SSFSYCIGSLHDPDYLHNKLILGDGAIIDEGDAT 238
F +G+ GLG G SL SQ+ +SFSYC L D D + + + +
Sbjct: 270 EGLFVGSAGLLGLGGGPLSLTSQIKATSFSYC---LVDRDSGKSSTLEFNSPRPGDSVVA 326
Query: 239 PL---QFIDGHYYITLEAISVDGRMLDINPNIFKRDDSG-GGVMIDSGTDVTWLVKEAYE 294
PL Q ++ YY+ L +SV G ++ + P F D SG GGV++DSGT +T L +AY
Sbjct: 327 PLLKNQKVNTFYYVELTGVSVGGEIVTVPPETFAVDQSGAGGVIVDSGTAITRLRTQAYN 386
Query: 295 ALRDEVMIRLEGEQMRSYSWPDKLCYHGIMSSDLKGFPTVRFHFRGGAKLAL-EKDSMFY 353
++RD + + CY + S PTV FHF G AL K+ +
Sbjct: 387 SVRDAFKRKTSNLRPAEGVALFDTCYD-LSSLQSVRVPTVSFHFSGDRAWALPAKNYLIP 445
Query: 354 QPRPDAFCMAVNPASINNRYVNFSLIGMMAQQFYNVGYDIGRKQKTFQRMDC 405
+C A P + + S+IG + QQ V +D+ F C
Sbjct: 446 VDGAGTYCFAFAPTT-----SSMSIIGNVQQQGTRVSFDLANSLVGFSPNKC 492
>gi|242062704|ref|XP_002452641.1| hypothetical protein SORBIDRAFT_04g029670 [Sorghum bicolor]
gi|241932472|gb|EES05617.1| hypothetical protein SORBIDRAFT_04g029670 [Sorghum bicolor]
Length = 466
Score = 126 bits (316), Expect = 2e-26, Method: Compositional matrix adjust.
Identities = 124/427 (29%), Positives = 189/427 (44%), Gaps = 47/427 (11%)
Query: 1 LIHRDSILSPYNNPNENPAHRVQRGINISIARLAYLQEKIRSHN-----TYQAQILPSN- 54
L HR SP + N+ PA +R + R AY++ K A +P+
Sbjct: 65 LHHRHGPCSPVPS-NKMPASLEER-LQRDQLRAAYIKRKFSGAKGGDVEQSDAATVPTTL 122
Query: 55 --DISNSLFYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLGTIFYPSRSSSYA 112
+S + + + IG P V Q +MDTGS + WV C PC C ++ ++F PS SS+Y+
Sbjct: 123 GTSLSTLEYVITVGIGSPAVTQTMSMDTGSDVSWVQCKPCSQCHSEVDSLFDPSASSTYS 182
Query: 113 YVPCDSEHCRYFPYAR----CSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIH 168
C S C ++ CS+ +C Y Y+ G T+ S++ LT +
Sbjct: 183 PFSCSSAACVQLSQSQQGNGCSS--SQCQYIVSYVDGSSTTGTYSSDTLTLGSN-----A 235
Query: 169 VQDVVFGCGFSTNRNF--KFSGIFGLGIGRSSLVSQ----LNSSFSYCIGSLHDPDYLHN 222
++ FGC S + F + G+ GLG SLVSQ +FSYC+ P +
Sbjct: 236 IKGFQFGCSQSESGGFSDQTDGLMGLGGDAQSLVSQTAGTFGKAFSYCL-----PPTPGS 290
Query: 223 KLILGDGAIIDEG-DATPL---QFIDGHYYITLEAISVDGRMLDINPNIFKRDDSGGGVM 278
L GA G TP+ I +Y + LEAI V G+ L+I ++F S G VM
Sbjct: 291 SGFLTLGAASRSGFVKTPMLRSTQIPTYYGVLLEAIRVGGQQLNIPTSVF----SAGSVM 346
Query: 279 IDSGTDVTWLVKEAYEALRDEVMIRLEGEQMRSYSWPDKLCYHGIMSSDLKGFPTVRFHF 338
DSGT +T L AY AL ++ S C+ S + P+V F
Sbjct: 347 -DSGTVITRLPPTAYSALSSAFKAGMKKYPPAQPSGILDTCFDFSGQSSVS-IPSVALVF 404
Query: 339 RGGAKLALEKDSMFYQPRPDAFCMAVNPASINNRYVNFSLIGMMAQQFYNVGYDIGRKQK 398
GGA + L+ + + + D +C+A + N+ + IG + Q+ + V YD+G
Sbjct: 405 SGGAVVNLDFNGIMLE--LDNWCLAF---AANSDDSSLGFIGNVQQRTFEVLYDVGGGAV 459
Query: 399 TFQRMDC 405
F+ C
Sbjct: 460 GFRAGAC 466
>gi|296090179|emb|CBI39998.3| unnamed protein product [Vitis vinifera]
Length = 334
Score = 125 bits (315), Expect = 3e-26, Method: Compositional matrix adjust.
Identities = 98/366 (26%), Positives = 160/366 (43%), Gaps = 62/366 (16%)
Query: 52 PSNDISNSLFYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLGTIFYPSRSSSY 111
P +N + + ISIG PP + DTGS L+W C PC C Q +F PS+S+S+
Sbjct: 15 PPVSSNNGEYLMKISIGTPPFDVYGIYDTGSDLMWTQCLPCLSCYKQKNPMFDPSKSTSF 74
Query: 112 AYVPCDSEHCRYFPYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQD 171
V C+S+ CR ++ + +
Sbjct: 75 KEVSCESQQCRLL---------------------------------------DTPTSILN 95
Query: 172 VVFGCGFSTNRNFKFS--GIFGLGIGRSSLVSQLNSS------FSYCIGSLHDPDYLHNK 223
+VFGCG + + F + G+FG G SL SQ+ S+ FS C+ + +K
Sbjct: 96 IVFGCGHNNSGTFNENEMGLFGTGGRPLSLTSQIMSTLGSGRKFSQCLVPFRTDPSITSK 155
Query: 224 LILGDGAIIDEGD--ATPLQFID--GHYYITLEAISVDGRMLDINPNIFKRDDSGGGVMI 279
+I G A + D +TPL D +Y++TL+ ISV ++ + + + G V I
Sbjct: 156 IIFGPEAEVSGSDVVSTPLVTKDDPTYYFVTLDGISVGDKLFPFSSS--SPMATKGNVFI 213
Query: 280 DSGTDVTWLVKEAYEALRDEVMIRLEGEQMRSYSWPDKLCYHGIMSSDLKGFPTVRFHFR 339
D+GT T L ++ Y L V + E ++ +LCY S+ L P + HF
Sbjct: 214 DAGTPPTLLPRDFYNRLVQGVKEAIPMEPVQDPDLQPQLCYR---SATLIDGPILTAHFD 270
Query: 340 GGAKLALEKDSMFYQPRPDAFCMAVNPASINNRYVNFSLIGMMAQQFYNVGYDIGRKQKT 399
GA + L+ + F P+ +C A+ P + + G Q + +G+D+ K+ +
Sbjct: 271 -GADVQLKPLNTFISPKEGVYCFAMQPIDGDT-----GIFGNFVQMNFLIGFDLDGKKVS 324
Query: 400 FQRMDC 405
F+ +DC
Sbjct: 325 FKAVDC 330
>gi|302795261|ref|XP_002979394.1| hypothetical protein SELMODRAFT_53966 [Selaginella moellendorffii]
gi|300153162|gb|EFJ19802.1| hypothetical protein SELMODRAFT_53966 [Selaginella moellendorffii]
Length = 353
Score = 125 bits (315), Expect = 3e-26, Method: Compositional matrix adjust.
Identities = 110/347 (31%), Positives = 157/347 (45%), Gaps = 25/347 (7%)
Query: 61 FYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLGTIFYPSRSSSYAYVPCDSEH 120
++ I +G P + DTGS + W+ C PCR C Q IF PS SSS+ + C S
Sbjct: 14 YFARIGVGTPARSVYMVADTGSDVSWLQCSPCRKCYRQQDPIFNPSLSSSFKPLACASSI 73
Query: 121 CRYFPYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDVVFGCGFST 180
C CS K++C+Y Y G T STE L+F V+ V GCG +
Sbjct: 74 CGKLKIKGCS-RKNKCMYQVSYGDGSFTVGDFSTETLSFGEH-----AVRSVAMGCGRNN 127
Query: 181 NRNF-KFSGIFGLGIGRSSLVSQLNSS----FSYCIGSLHDPDYLHNKLILGDGAIIDEG 235
F +G+ GLG G S SQ +S FSYC+ + L+ G A+ ++
Sbjct: 128 QGLFHGAAGLLGLGRGPLSFPSQTGTSYASVFSYCLPRRE--SAIAASLVFGPSAVPEKA 185
Query: 236 DAT---PLQFIDGHYYITLEAISVDGRMLDINPNIFKRDDSG-GGVMIDSGTDVTWLVKE 291
T P + +D +YY+ L I V G ++I P+ F G GGV++DSGT ++ L
Sbjct: 186 RFTKLLPNRRLDTYYYVGLARIRVAGSPVNIPPDAFAMGSRGTGGVIVDSGTAISRLTTP 245
Query: 292 AYEALRDEVMIRLEGEQMRSYSWPDKLCYHGIMSSDLKGFPTVRFHFRGGAKLALEKDSM 351
AY ALRD + S D CY + S P V F GGA + L D +
Sbjct: 246 AYTALRDAFRSLVTFPSAPGISLFDT-CYD-LSSMKTATLPAVVLDFDGGASMPLPADGI 303
Query: 352 FYQPRPDA-FCMAVNPASINNRYVNFSLIGMMAQQFYNVGYDIGRKQ 397
+ +C+A P FS+IG + QQ + + D ++Q
Sbjct: 304 LVNVDDEGTYCLAFAP-----EEEAFSIIGNVQQQTFRISIDNQKEQ 345
>gi|242091325|ref|XP_002441495.1| hypothetical protein SORBIDRAFT_09g028050 [Sorghum bicolor]
gi|241946780|gb|EES19925.1| hypothetical protein SORBIDRAFT_09g028050 [Sorghum bicolor]
Length = 466
Score = 125 bits (315), Expect = 3e-26, Method: Compositional matrix adjust.
Identities = 106/363 (29%), Positives = 163/363 (44%), Gaps = 32/363 (8%)
Query: 61 FYVNISIGQPPVPQFTAM-DTGSSLLWVHCYPCRDCSPQLGTIFYPSRSSSYAYVPCDSE 119
++V + +G P V +FT + DTGS L WV C SP G +F P S S+A +PC S+
Sbjct: 116 YFVKLRVGTP-VQEFTLVADTGSDLTWVKCA---GASPP-GRVFRPKTSRSWAPIPCSSD 170
Query: 120 HCRY---FPYARCSAYKHRCIYTQLYLIGPETSV-FVSTEQLTFKNTDESTIHVQDVVFG 175
C+ F A CS+ C Y Y G + V TE T ++DVV G
Sbjct: 171 TCKLDVPFTLANCSSPASPCTYDYRYKEGSAGARGIVGTESATIALPGGKVAQLKDVVLG 230
Query: 176 CGFSTN-RNFKFS-GIFGLGIGRSSLVSQ----LNSSFSYCIGSLHDPDYLHNKLILGDG 229
C S + ++F+ + G+ LG + S +Q SFSYC+ P L G G
Sbjct: 231 CSSSHDGQSFRSADGVLSLGNAKISFATQAAARFGGSFSYCLVDHLAPRNATGYLAFGPG 290
Query: 230 AIIDEGDATPLQFIDGH---YYITLEAISVDGRMLDINPNIFKRDDSGGGVMIDSGTDVT 286
+ F+D Y + ++AI V G+ LDI ++ D GGV++DSG +T
Sbjct: 291 QVPRTPATQTKLFLDPEMPFYGVKVDAIHVAGKALDIPAEVW--DAKSGGVILDSGNTLT 348
Query: 287 WLVKEAYEALRDEVMIRLEGEQMRSYSWPDKLCYHGIMSSDLKG----FPTVRFHFRGGA 342
L AY+A+ + L+G S+ P + CY+ ++ G P + F G A
Sbjct: 349 VLAAPAYKAVVAALSKHLDGVPKVSFP-PFEHCYN--WTARRPGAPEIIPKLAVQFAGSA 405
Query: 343 KLALEKDSMFYQPRPDAFCMAVNPASINNRYVNFSLIGMMAQQFYNVGYDIGRKQKTFQR 402
+L S +P C+ V + S+IG + QQ + +D+ Q F++
Sbjct: 406 RLEPPAKSYVIDVKPGVKCIGVQ----EGEWPGLSVIGNIMQQEHLWEFDLKNMQVRFKQ 461
Query: 403 MDC 405
+C
Sbjct: 462 SNC 464
>gi|115452685|ref|NP_001049943.1| Os03g0318400 [Oryza sativa Japonica Group]
gi|108707841|gb|ABF95636.1| Eukaryotic aspartyl protease family protein, expressed [Oryza
sativa Japonica Group]
gi|113548414|dbj|BAF11857.1| Os03g0318400 [Oryza sativa Japonica Group]
Length = 434
Score = 125 bits (315), Expect = 3e-26, Method: Compositional matrix adjust.
Identities = 112/382 (29%), Positives = 172/382 (45%), Gaps = 49/382 (12%)
Query: 54 NDISNSLFYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLGTIFYPSRSSSYAY 113
N + + + V+++IG PP P +DTGS L+W C PC C Q F PS SS+ +
Sbjct: 75 NGVPTTEYLVHLAIGTPPQPVQLTLDTGSDLIWTQCQPCPACFDQALPYFDPSTSSTLSL 134
Query: 114 VPCDSEHCRYFPYARCSAYK----HRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHV 169
CDS C+ P A C + K C+YT Y T+ F+ ++ TF S V
Sbjct: 135 TSCDSTLCQGLPVASCGSPKFWPNQTCVYTYSYGDKSVTTGFLEVDKFTFVGAGAS---V 191
Query: 170 QDVVFGCGFSTNRNFKF--SGIFGLGIGRSSLVSQLN-SSFSYCIGSLHDPDYLHNKLIL 226
V FGCG N FK +GI G G G SL SQL +FS+C +++ L +L
Sbjct: 192 PGVAFGCGLFNNGVFKSNETGIAGFGRGPLSLPSQLKVGNFSHCFTAVNG---LKPSTVL 248
Query: 227 ----------GDGAIIDEGDATPLQFIDGH---YYITLEAISVDGRMLDINPNIFKRDDS 273
G GA+ +TPL + YY++L+ I+V L + + F +
Sbjct: 249 LDLPADLYKSGRGAV----QSTPLIQNPANPTFYYLSLKGITVGSTRLPVPESEFALKNG 304
Query: 274 GGGVMIDSGTDVTWLVKEAYEALRDEVMIRLEGEQMRSYSWPDKLCYHGIMSSDLKG--- 330
GG +IDSGT +T L Y +RD +++ + + C +S+ L+
Sbjct: 305 TGGTIIDSGTAMTSLPTRVYRLVRDAFAAQVKLPVVSGNTTDPYFC----LSAPLRAKPY 360
Query: 331 FPTVRFHFRGGAKLALEKDSMFYQPRPDA----FCMAVNPASINNRYVNFSLIGMMAQQF 386
P + HF GA + L +++ ++ DA C+A+ + IG QQ
Sbjct: 361 VPKLVLHFE-GATMDLPRENYVFEVE-DAGSSILCLAIIEGG------EVTTIGNFQQQN 412
Query: 387 YNVGYDIGRKQKTFQRMDCEVL 408
+V YD+ + +F C+ L
Sbjct: 413 MHVLYDLQNSKLSFVPAQCDKL 434
>gi|148905906|gb|ABR16115.1| unknown [Picea sitchensis]
Length = 482
Score = 125 bits (315), Expect = 3e-26, Method: Compositional matrix adjust.
Identities = 110/365 (30%), Positives = 154/365 (42%), Gaps = 37/365 (10%)
Query: 61 FYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLGTIFYPSRSSSYAYVPCDSEH 120
+ V G P +DTGS L W+ C PC DC Q+ IF P +SSSY +PC S
Sbjct: 137 YIVTAGFGTPAKNSLLIIDTGSDLTWIQCKPCADCYSQVDAIFEPKQSSSYKTLPCLSAT 196
Query: 121 CRYFPYARCSAYK---HRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDVVFGCG 177
C + + C+Y Y G + S E LT + Q+ FGCG
Sbjct: 197 CTELITSESNPTPCLLGGCVYEINYGDGSSSQGDFSQETLTLGSDS-----FQNFAFGCG 251
Query: 178 FSTNRNFK-FSGIFGLGIGRSSLVSQLNSS----FSYCIGSLHDPD----YLHNKLILGD 228
+ FK SG+ GLG S SQ S F+YC+ PD +G
Sbjct: 252 HTNTGLFKGSSGLLGLGQNSLSFPSQSKSKYGGQFAYCL-----PDFGSSTSTGSFSVGK 306
Query: 229 GAIIDEGDATPL--QFI-DGHYYITLEAISVDGRMLDINPNIFKRDDSGGGVMIDSGTDV 285
G+I TPL F+ Y++ L ISV G L I P + R G ++DSGT +
Sbjct: 307 GSIPASAVFTPLVSNFMYPTFYFVGLNGISVGGDRLSIPPAVLGR----GSTIVDSGTVI 362
Query: 286 TWLVKEAYEALRDEVMIRLEG-EQMRSYSWPDKLCYHGIMSSDLKGFPTVRFHFRGGAKL 344
T L+ +AY AL+ + + +S D CY S ++ PT+ FHF+ A +
Sbjct: 363 TRLLPQAYNALKTSFRSKTRDLPSAKPFSILDT-CYDLSRHSQVR-IPTITFHFQNNADV 420
Query: 345 ALEKDSMF--YQPRPDAFCMAVNPASINNRYVNFSLIGMMAQQFYNVGYDIGRKQKTFQR 402
A+ + Q C+A AS + F++IG QQ V +D G + F
Sbjct: 421 AVSDVGILVPVQNGGSQVCLAFASAS---QMDGFNIIGNFQQQRMRVAFDTGAGRIGFAS 477
Query: 403 MDCEV 407
C
Sbjct: 478 GSCAA 482
>gi|242095602|ref|XP_002438291.1| hypothetical protein SORBIDRAFT_10g011210 [Sorghum bicolor]
gi|241916514|gb|EER89658.1| hypothetical protein SORBIDRAFT_10g011210 [Sorghum bicolor]
Length = 464
Score = 125 bits (314), Expect = 3e-26, Method: Compositional matrix adjust.
Identities = 114/432 (26%), Positives = 187/432 (43%), Gaps = 56/432 (12%)
Query: 1 LIHRDSILSPYNNPNENPAHRVQRGINISIARLAYLQEKIRSHNTYQAQILPSNDIS--- 57
L HR SP + E P+H + + R AY+Q K+ S A+ L + ++
Sbjct: 62 LSHRHGPCSPVIS-KEKPSH--EETLRRDQLRAAYIQAKVSSRYNNVAKELQQSAVTIPT 118
Query: 58 -------NSLFYVNISIGQPPVPQFTAMDTGSSLLWVHCYPC--RDCSPQLGTIFYPSRS 108
+ + + ++IG P V Q ++DTGS + WV C PC + CS Q +F P+ S
Sbjct: 119 SSGYSLGTTEYVITVTIGTPAVTQVMSIDTGSDVSWVQCAPCAAQSCSSQKDKLFDPAMS 178
Query: 109 SSYAYVPCDSEHCRYFPYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIH 168
++Y+ C S C K +C Y Y G T+ ++ L+ ++D
Sbjct: 179 ATYSAFSCGSAQCAQLGDEGNGCLKSQCQYIVKYGDGSNTAGTYGSDTLSLTSSDA---- 234
Query: 169 VQDVVFGCGFSTNRNFKF----SGIFGLGIGRSSLVSQLNSS----FSYCIGSLHDPDYL 220
V+ FGC ++R F G+ GLG SLVSQ ++ FSYC+
Sbjct: 235 VKSFQFGC---SHRAAGFVGELDGLMGLGGDTESLVSQTAATYGKAFSYCLPPPSSSGGG 291
Query: 221 HNKLILGDGAIIDEGDATPL-QF-IDGHYYITLEAISVDGRMLDINPNIFKRDDSGGGVM 278
L GA TP+ +F + Y + L+ I+V G ML++ ++F G +
Sbjct: 292 FLTLGAAGGASSSRYSHTPMVRFSVPTFYGVFLQGITVAGTMLNVPASVFS-----GASV 346
Query: 279 IDSGTDVTWLVKEAYEALRDEVMIRLEGEQMRSYSWPDKLCYHGIMSSDLKGF-----PT 333
+DSGT +T L AY+ALR ++ + C+ D GF PT
Sbjct: 347 VDSGTVITQLPPTAYQALRTAFKKEMKAYPSAAPVGSLDTCF------DFSGFNTITVPT 400
Query: 334 VRFHFRGGAKLALEKDSMFYQPRPDAFCMAVNPASINNRYVNFSLIGMMAQQFYNVGYDI 393
V F GA + L+ + Y A C+A + + + ++G + Q+ + + +D+
Sbjct: 401 VTLTFSRGAAMDLDISGILY-----AGCLAFTATAHDG---DTGILGNVQQRTFEMLFDV 452
Query: 394 GRKQKTFQRMDC 405
G + F+ C
Sbjct: 453 GGRTIGFRSGAC 464
>gi|219362525|ref|NP_001136612.1| uncharacterized protein LOC100216735 [Zea mays]
gi|194696366|gb|ACF82267.1| unknown [Zea mays]
gi|413953802|gb|AFW86451.1| hypothetical protein ZEAMMB73_488726 [Zea mays]
Length = 411
Score = 125 bits (314), Expect = 4e-26, Method: Compositional matrix adjust.
Identities = 97/302 (32%), Positives = 139/302 (46%), Gaps = 35/302 (11%)
Query: 61 FYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRD--CSPQLGTIFYPSRSSSYAYVPCDS 118
+ V +S G P VPQ +DTGS + W+ C PC C PQ ++ PS SS+Y+ VPC S
Sbjct: 79 YVVRVSFGTPAVPQVVVIDTGSDVSWLQCKPCSSGQCFPQKDPLYDPSHSSTYSAVPCAS 138
Query: 119 EHCRYF---PYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDVVFG 175
+ C+ Y +C + Y G T S ++LT VQ+ FG
Sbjct: 139 DVCKKLAADAYGSGCTSGKQCGFAISYADGTSTVGAYSQDKLTL----APGAIVQNFYFG 194
Query: 176 CGFSTNR-NFKFSGIFGLGIGRSSLVSQLNSSFSYCIGSLHD-PDYLHNKLILGDGAIID 233
CG + F G+ GLG R SL ++ FSYC+ S+ P +L LG G
Sbjct: 195 CGHGKHAVRGLFDGVLGLGRLRESLGARYGGVFSYCLPSVSSKPGFLA----LGAGKNPS 250
Query: 234 EGDATPLQFIDGH---YYITLEAISVDGRMLDINPNIFKRDDSGGGVMIDSGTDVTWLVK 290
TP+ + G +TL I+V G+ LD+ P+ F GG+++DSGT +T L
Sbjct: 251 GFVFTPMGTVPGQPTFSTVTLAGINVGGKKLDLRPSAFS-----GGMIVDSGTVITGLQS 305
Query: 291 EAYEALRDEVMIRLEGEQMRSYSWPDKLCYHGIMSSDLKGF-----PTVRFHFRGGAKLA 345
AY ALR +E ++ D CY +L G+ P + F GGA +
Sbjct: 306 TAYRALRSAFRKAMEAYRLLPNGDLDT-CY------NLTGYKNVVVPKIALTFTGGATIN 358
Query: 346 LE 347
L+
Sbjct: 359 LD 360
>gi|125543640|gb|EAY89779.1| hypothetical protein OsI_11321 [Oryza sativa Indica Group]
Length = 434
Score = 125 bits (314), Expect = 4e-26, Method: Compositional matrix adjust.
Identities = 112/382 (29%), Positives = 172/382 (45%), Gaps = 49/382 (12%)
Query: 54 NDISNSLFYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLGTIFYPSRSSSYAY 113
N + + + V+++IG PP P +DTGS L+W C PC C Q F PS SS+ +
Sbjct: 75 NGVPTTEYLVHLAIGTPPQPVQLTLDTGSDLIWTQCQPCPACFDQALPYFDPSTSSTLSL 134
Query: 114 VPCDSEHCRYFPYARCSAYK----HRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHV 169
CDS C+ P A C + K C+YT Y T+ F+ ++ TF S V
Sbjct: 135 TSCDSTLCQGLPVASCGSPKFWPNQTCVYTYSYGDKSVTTGFLEVDKFTFVGAGAS---V 191
Query: 170 QDVVFGCGFSTNRNFKF--SGIFGLGIGRSSLVSQLN-SSFSYCIGSLHDPDYLHNKLIL 226
V FGCG N FK +GI G G G SL SQL +FS+C +++ L +L
Sbjct: 192 PGVAFGCGLFNNGVFKSNETGIAGFGRGPLSLPSQLKVGNFSHCFTAVNG---LKPSTVL 248
Query: 227 ----------GDGAIIDEGDATPLQFIDGH---YYITLEAISVDGRMLDINPNIFKRDDS 273
G GA+ +TPL + YY++L+ I+V L + + F +
Sbjct: 249 LDLPADLYKSGRGAV----QSTPLIQNPANPTFYYLSLKGITVGSTRLPVPESEFTLKNG 304
Query: 274 GGGVMIDSGTDVTWLVKEAYEALRDEVMIRLEGEQMRSYSWPDKLCYHGIMSSDLKG--- 330
GG +IDSGT +T L Y +RD +++ + + C +S+ L+
Sbjct: 305 TGGTIIDSGTAMTSLPTRVYRLVRDAFAAQVKLPVVSGNTTDPYFC----LSAPLRAKPY 360
Query: 331 FPTVRFHFRGGAKLALEKDSMFYQPRPDA----FCMAVNPASINNRYVNFSLIGMMAQQF 386
P + HF GA + L +++ ++ DA C+A+ + IG QQ
Sbjct: 361 VPKLVLHFE-GATMDLPRENYVFEVE-DAGSSILCLAIIEGG------EVTTIGNFQQQN 412
Query: 387 YNVGYDIGRKQKTFQRMDCEVL 408
+V YD+ + +F C+ L
Sbjct: 413 MHVLYDLQNSKLSFVPAQCDKL 434
>gi|302817380|ref|XP_002990366.1| hypothetical protein SELMODRAFT_43971 [Selaginella moellendorffii]
gi|300141928|gb|EFJ08635.1| hypothetical protein SELMODRAFT_43971 [Selaginella moellendorffii]
Length = 420
Score = 125 bits (314), Expect = 4e-26, Method: Compositional matrix adjust.
Identities = 110/347 (31%), Positives = 156/347 (44%), Gaps = 25/347 (7%)
Query: 61 FYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLGTIFYPSRSSSYAYVPCDSEH 120
++ I +G P + DTGS + W+ C PCR C Q IF PS SSS+ + C S
Sbjct: 81 YFARIGVGTPARSVYMVADTGSDVSWLQCSPCRKCYRQQDPIFNPSLSSSFKPLACASSI 140
Query: 121 CRYFPYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDVVFGCGFST 180
C CS K+ C+Y Y G T STE L+F V+ V GCG +
Sbjct: 141 CGKLKIKGCS-RKNECMYQVSYGDGSFTVGDFSTETLSFGEH-----AVRSVAMGCGRNN 194
Query: 181 NRNF-KFSGIFGLGIGRSSLVSQLNSS----FSYCIGSLHDPDYLHNKLILGDGAIIDEG 235
F +G+ GLG G S SQ +S FSYC+ + L+ G A+ ++
Sbjct: 195 QGLFHGAAGLLGLGRGPLSFPSQTGTSYASVFSYCLPRRE--SAIAASLVFGPSAVPEKA 252
Query: 236 DAT---PLQFIDGHYYITLEAISVDGRMLDINPNIFKRDDSG-GGVMIDSGTDVTWLVKE 291
T P + +D +YY+ L I V G ++I P+ F G GGV++DSGT ++ L
Sbjct: 253 RFTKLLPNRRLDTYYYVGLARIRVAGSPVNIPPDAFAMGSRGTGGVIVDSGTAISRLTTP 312
Query: 292 AYEALRDEVMIRLEGEQMRSYSWPDKLCYHGIMSSDLKGFPTVRFHFRGGAKLALEKDSM 351
AY ALRD + S D CY + S P V F GGA + L D +
Sbjct: 313 AYTALRDAFRSLVTFPSAPGISLFDT-CYD-LSSMKTATLPAVVLDFDGGASMPLPADGI 370
Query: 352 FYQPRPDA-FCMAVNPASINNRYVNFSLIGMMAQQFYNVGYDIGRKQ 397
+ +C+A P FS+IG + QQ + + D ++Q
Sbjct: 371 LVNVDDEGTYCLAFAP-----EEEAFSIIGNVQQQTFRISIDNQKEQ 412
>gi|356536463|ref|XP_003536757.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
Length = 475
Score = 125 bits (314), Expect = 4e-26, Method: Compositional matrix adjust.
Identities = 128/416 (30%), Positives = 183/416 (43%), Gaps = 49/416 (11%)
Query: 1 LIHRDSI--LSPYNNPNENPAHRVQRGINISIARLAYLQEKIRSHN-TYQAQILPSNDIS 57
L+HRD + + Y++ R+QR R A L ++ + TY A+ S+ +S
Sbjct: 72 LVHRDKVPTFNTYHDHRTRFNARMQR----DTKRAASLLRRLAAGKPTYAAEAFGSDVVS 127
Query: 58 -----NSLFYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLGTIFYPSRSSSYA 112
+ ++V I +G PP Q+ MD+GS ++WV C PC C Q +F P+ SSS++
Sbjct: 128 GMEQGSGEYFVRIGVGSPPRNQYVVMDSGSDIIWVQCEPCTQCYHQSDPVFNPADSSSFS 187
Query: 113 YVPCDSEHCRYFPYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDV 172
V C S C + A C ++ RC Y Y G T ++ E +TF T +++V
Sbjct: 188 GVSCASTVCSHVDNAAC--HEGRCRYEVSYGDGSYTKGTLALETITFGRT-----LIRNV 240
Query: 173 VFGCGFSTNRNFKFSGIFGLGIGRS-SLVSQL----NSSFSYCIGSLHDPDYLHNKLILG 227
GCG F + G S V QL +FSYC+ S L G
Sbjct: 241 AIGCGHHNQGMFVGAAGLLGLGGGPMSFVGQLGGQTGGAFSYCLVSRGIES--SGLLEFG 298
Query: 228 DGAIIDEGDATPLQF---IDGHYYITLEAISVDGRMLDINPNIFKRDDSG-GGVMIDSGT 283
A+ PL YYI L + V G + I+ ++FK + G GGV++D+GT
Sbjct: 299 REAMPVGAAWVPLIHNPRAQSFYYIGLSGLGVGGLRVSISEDVFKLSELGDGGVVMDTGT 358
Query: 284 DVTWLVKEAYEALRDEVMIRLEGEQMRSYSWPDKLCYHGIMSSDLKGF-----PTVRFHF 338
VT L AYEA RD + + S CY DL GF PTV F+F
Sbjct: 359 AVTRLPTVAYEAFRDGFIAQTTNLPRASGVSIFDTCY------DLFGFVSVRVPTVSFYF 412
Query: 339 RGGAKLALEKDSMFYQPRPD--AFCMAVNPASINNRYVNFSLIGMMAQQFYNVGYD 392
GG L L + F P D FC A P+S S+IG + Q+ + D
Sbjct: 413 SGGPILTLPARN-FLIPVDDVGTFCFAFAPSS-----SGLSIIGNIQQEGIQISVD 462
>gi|326516344|dbj|BAJ92327.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 464
Score = 125 bits (313), Expect = 4e-26, Method: Compositional matrix adjust.
Identities = 124/435 (28%), Positives = 189/435 (43%), Gaps = 62/435 (14%)
Query: 11 YNNPNENPAHRVQRGINISIARLAYLQEKI------RSHNTYQAQILPSNDIS------- 57
+++ ++ A + AR++ LQ +I RS + A L ++
Sbjct: 51 FSSGGKSRAEEAHAVLASDAARVSSLQRRIGSYGLIRSSDAASASKLAQVPVTSGARLRT 110
Query: 58 -NSLFYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLGTIFYPSRSSSYAYVPC 116
N + V I G+ V +DT S L WV C PC C Q +F PS S SYA VPC
Sbjct: 111 LNYVATVGIGGGEATV----IVDTASELTWVQCEPCDACHDQQEPLFDPSSSPSYAAVPC 166
Query: 117 DSEHCRYFPYAR------CSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQ 170
+S C A C C YT Y G + ++ ++L+ D +Q
Sbjct: 167 NSSSCDALRVATGMSGQACDDQPAACSYTLSYRDGSYSRGVLAHDRLSLAGED-----IQ 221
Query: 171 DVVFGCGFSTNRNF-KFSGIFGLGIGRSSLVS----QLNSSFSYCIGSLHDPDYLHNKLI 225
VFGCG S F SG+ GLG + SL+S Q FSYC+ L+
Sbjct: 222 GFVFGCGTSNQGPFGGTSGLMGLGRSQLSLISQTMDQFGGVFSYCLPPKESGS--SGSLV 279
Query: 226 LGDGAIIDEGDATPLQF-------IDGHYYIT-LEAISVDGRMLDINPNIFKRDDSGGGV 277
LGD A + ++TP+ + + G +Y+ L I+V G D+ F GG
Sbjct: 280 LGDDASVYR-NSTPIVYTAMVSDPLQGPFYLANLTGITVGGE--DVQSPGFSA-GGGGKA 335
Query: 278 MIDSGTDVTWLVKEAYEALRDEVMIRL-EGEQMRSYSWPDKLCYHGIMSSDLKGF----- 331
++DSGT +T LV Y A+R E + +L E Q +S D C+ DL G
Sbjct: 336 IVDSGTIITSLVPSVYAAVRAEFVSQLAEYPQAAPFSILDT-CF------DLTGLREVQV 388
Query: 332 PTVRFHFRGGAKLALEKDSMFYQPRPDAFCMAVNPASINNRYVNFSLIGMMAQQFYNVGY 391
P+++ F GGA++ ++ + Y DA + + AS+ + Y + +IG Q+ V +
Sbjct: 389 PSLKLVFDGGAEVEVDSKGVLYVVTGDASQVCLALASLKSEY-DTPIIGNYQQKNLRVIF 447
Query: 392 DIGRKQKTFQRMDCE 406
D Q F + C+
Sbjct: 448 DTVGSQIGFAQETCD 462
>gi|302774174|ref|XP_002970504.1| hypothetical protein SELMODRAFT_93504 [Selaginella moellendorffii]
gi|300162020|gb|EFJ28634.1| hypothetical protein SELMODRAFT_93504 [Selaginella moellendorffii]
Length = 483
Score = 125 bits (313), Expect = 5e-26, Method: Compositional matrix adjust.
Identities = 122/440 (27%), Positives = 182/440 (41%), Gaps = 50/440 (11%)
Query: 1 LIHRDSILSPYNNPNENPAHRVQRGINISIARLAYLQEKIRSHNTYQAQILPSNDISNSL 60
LIHR+S+L + + R+ +++ K + + + S D++ +
Sbjct: 60 LIHRNSLLREAKEKLHTHEQLLLETLQRDEQRVRWIESKAQLAGKKKDEA-SSTDLNGPV 118
Query: 61 ----------FYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLGTIFYPSRSSS 110
++V + +G P F +DTGS L W+ C PC+ C Q IF P SSS
Sbjct: 119 TSGLLYGSGEYFVRLGVGTPARSLFMVVDTGSDLPWLQCQPCKSCYKQADPIFDPRNSSS 178
Query: 111 YAYVPCDSEHCRYFPYARCSAYK---HRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTI 167
+ +PC S C+ CS + RC Y Y G + S++ T ++
Sbjct: 179 FQRIPCLSPLCKALEIHSCSGSRGATSRCSYQVAYGDGSFSVGDFSSDLFTLGTGSKA-- 236
Query: 168 HVQDVVFGCGFSTNRNFKFSGI----------FGLGIGRSSLVSQLNSSFSYCIGSLHDP 217
V FGCGF F + F I SS S +SFSYC+ +P
Sbjct: 237 --MSVAFGCGFDNEGLFAGAAGLLGLGAGKLSFPSQIFASSTNSSTANSFSYCLVDRSNP 294
Query: 218 -DYLHNKLILGDGAIIDEGDATPL---QFIDGHYYITLEAISVDGRMLDINPNIFKRDDS 273
+ LI G AI +PL +D YY + +SV G L I+ + S
Sbjct: 295 MTRSSSSLIFGAAAIPSTAALSPLLKNPKLDTFYYAAMIGVSVGGAQLPISLKSLQLSQS 354
Query: 274 G-GGVMIDSGTDVTWLVKEAYEALRDEVMIRLEGEQMRS---YSWPDKLCYH--GIMSSD 327
G GGV+IDSGT VT Y +RD R + S YS D CY+ G S D
Sbjct: 355 GSGGVIIDSGTSVTRFPTSVYATIRDA--FRNATTNLPSAPRYSLFDT-CYNFSGKASVD 411
Query: 328 LKGFPTVRFHFRGGAKLALEKDS-MFYQPRPDAFCMAVNPASINNRYVNFSLIGMMAQQF 386
+ P + HF GA L L + + +FC+A P S+ +IG + QQ
Sbjct: 412 V---PALVLHFENGADLQLPPTNYLIPINTAGSFCLAFAPTSM-----ELGIIGNIQQQS 463
Query: 387 YNVGYDIGRKQKTFQRMDCE 406
+ +G+D+ + F C+
Sbjct: 464 FRIGFDLQKSHLAFAPQQCK 483
>gi|194696934|gb|ACF82551.1| unknown [Zea mays]
gi|413936470|gb|AFW71021.1| hypothetical protein ZEAMMB73_589717 [Zea mays]
gi|413953801|gb|AFW86450.1| hypothetical protein ZEAMMB73_488726 [Zea mays]
Length = 445
Score = 125 bits (313), Expect = 5e-26, Method: Compositional matrix adjust.
Identities = 97/302 (32%), Positives = 139/302 (46%), Gaps = 35/302 (11%)
Query: 61 FYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRD--CSPQLGTIFYPSRSSSYAYVPCDS 118
+ V +S G P VPQ +DTGS + W+ C PC C PQ ++ PS SS+Y+ VPC S
Sbjct: 113 YVVRVSFGTPAVPQVVVIDTGSDVSWLQCKPCSSGQCFPQKDPLYDPSHSSTYSAVPCAS 172
Query: 119 EHCRYF---PYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDVVFG 175
+ C+ Y +C + Y G T S ++LT VQ+ FG
Sbjct: 173 DVCKKLAADAYGSGCTSGKQCGFAISYADGTSTVGAYSQDKLTL----APGAIVQNFYFG 228
Query: 176 CGFSTNR-NFKFSGIFGLGIGRSSLVSQLNSSFSYCIGSLHD-PDYLHNKLILGDGAIID 233
CG + F G+ GLG R SL ++ FSYC+ S+ P + L LG G
Sbjct: 229 CGHGKHAVRGLFDGVLGLGRLRESLGARYGGVFSYCLPSVSSKPGF----LALGAGKNPS 284
Query: 234 EGDATPLQFIDGH---YYITLEAISVDGRMLDINPNIFKRDDSGGGVMIDSGTDVTWLVK 290
TP+ + G +TL I+V G+ LD+ P+ F GG+++DSGT +T L
Sbjct: 285 GFVFTPMGTVPGQPTFSTVTLAGINVGGKKLDLRPSAFS-----GGMIVDSGTVITGLQS 339
Query: 291 EAYEALRDEVMIRLEGEQMRSYSWPDKLCYHGIMSSDLKGF-----PTVRFHFRGGAKLA 345
AY ALR +E ++ D CY +L G+ P + F GGA +
Sbjct: 340 TAYRALRSAFRKAMEAYRLLPNGDLDT-CY------NLTGYKNVVVPKIALTFTGGATIN 392
Query: 346 LE 347
L+
Sbjct: 393 LD 394
>gi|168040957|ref|XP_001772959.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162675692|gb|EDQ62184.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 351
Score = 125 bits (313), Expect = 5e-26, Method: Compositional matrix adjust.
Identities = 108/359 (30%), Positives = 165/359 (45%), Gaps = 28/359 (7%)
Query: 61 FYVNISIGQPPVPQFTAM-DTGSSLLWVHCYPCRDCSPQLGTIFYPSRSSSYAYVPCDSE 119
+ + IS+G PP QF+A+ DTGS L WV C PC C Q +F P SSSY+ C
Sbjct: 8 YVLQISLGTPP-QQFSAIVDTGSDLCWVQCAPCARCFEQPDPLFIPLASSSYSNASCTDS 66
Query: 120 HCRYFPYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDVVFGCGFS 179
C P CS ++ C Y+ Y G T + E +T + + I FGCG +
Sbjct: 67 LCDALPRPTCS-MRNTCTYSYSYGDGSNTRGDFAFETVTLNGSTLARIG-----FGCGHN 120
Query: 180 TNRNFKFS-GIFGLGIGRSSLVSQLNSS----FSYCIGSLHDPDYLHNKLILGDGAIIDE 234
F + G+ GLG G SL SQLNSS FSYC+ + + G+ A
Sbjct: 121 QEGTFAGADGLIGLGQGPLSLPSQLNSSFTHIFSYCLVD-QSTTGTFSPITFGNAAENSR 179
Query: 235 GDATP-LQFID--GHYYITLEAISVDGRMLDINPNIFKRDDSG-GGVMIDSGTDVTWLVK 290
TP LQ D +YY+ +E+ISV R + P+ F+ D +G GGV++DSGT +T+
Sbjct: 180 ASFTPLLQNEDNPSYYYVGVESISVGNRRVPTPPSAFRIDANGVGGVILDSGTTITYWRL 239
Query: 291 EAYEALRDEVMIRLEGEQMRSYSWPDKLCY--HGIMSSDLKGFPTVRFHFRG-GAKLALE 347
A+ + E+ ++ + + LCY + +S L P++ H ++ +
Sbjct: 240 AAFIPILAELRRQISYPEADPTPYGLNLCYDISSVSASSLT-LPSMTVHLTNVDFEIPVS 298
Query: 348 KDSMFYQPRPDAFCMAVNPASINNRYVNFSLIGMMAQQFYNVGYDIGRKQKTFQRMDCE 406
+ + C A++ + FS+IG + QQ + D+ + F DC
Sbjct: 299 NLWVLVDNFGETVCTAMSTSD------QFSIIGNVQQQNNLIVTDVANSRVGFLATDCS 351
>gi|449441139|ref|XP_004138341.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 1-like [Cucumis
sativus]
gi|449477464|ref|XP_004155031.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 1-like [Cucumis
sativus]
Length = 336
Score = 124 bits (312), Expect = 6e-26, Method: Compositional matrix adjust.
Identities = 113/351 (32%), Positives = 161/351 (45%), Gaps = 29/351 (8%)
Query: 67 IGQPPVPQFTAMDTGSSLLWVHCYPC---RDCSPQLGTIFYPSRSSSYAYVPCDSEHCRY 123
+GQP P F +DTGS + W+ C PC C Q+ IF P SSSY V CDSE C+
Sbjct: 3 VGQPQQPSFFVLDTGSDVTWLQCLPCAGKNGCYEQITPIFDPELSSSYNPVSCDSEQCQL 62
Query: 124 FPYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDVVFGCGFSTNRN 183
A C+ + CIY Y G T ++TE LTF +++ + ++ GCG
Sbjct: 63 LDEAGCNV--NSCIYKVEYGDGSFTIGELATETLTFVHSN----SIPNISIGCGHDNEGL 116
Query: 184 F-KFSGIFGLGIGRSSLVSQLN-SSFSYCIGSLHDPDYL---HNKLILGDGAI--IDEGD 236
F G+ GLG G S+ SQL SSFSYC+ + P + N D I + + D
Sbjct: 117 FVGADGLIGLGGGAISISSQLKASSFSYCLVDIDSPSFSTLDFNTDPPSDSLISPLVKND 176
Query: 237 ATPLQFIDGHYYITLEAISVDGRMLDINPNIFKRDDSG-GGVMIDSGTDVTWLVKEAYEA 295
P Y+ + +SV G+ L I+ + F+ D+SG GG+++DSGT +T L + YE
Sbjct: 177 RFP-----SFRYVKVIGMSVGGKPLPISSSRFEIDESGLGGIIVDSGTTITQLPSDVYEV 231
Query: 296 LRDEVMIRLEGEQMRSYSWPDKLCYHGIMSSDLKGFPTVRFHFRGGAKLAL-EKDSMFYQ 354
LR+ + P CY S+++ PT+ F G L L K+ +
Sbjct: 232 LREAFLGLTTNLPPAPEISPFDTCYDLSSQSNVE-VPTIAFILPGENSLQLPAKNCLIQV 290
Query: 355 PRPDAFCMAVNPASINNRYVNFSLIGMMAQQFYNVGYDIGRKQKTFQRMDC 405
FC+A A+ S+IG QQ V YD+ F C
Sbjct: 291 DSAGTFCLAFVSATF-----PLSIIGNFQQQGIRVSYDLTNSLVGFSTNKC 336
>gi|449433371|ref|XP_004134471.1| PREDICTED: probable aspartic protease At2g35615-like [Cucumis
sativus]
gi|449495479|ref|XP_004159853.1| PREDICTED: probable aspartic protease At2g35615-like [Cucumis
sativus]
Length = 424
Score = 124 bits (312), Expect = 6e-26, Method: Compositional matrix adjust.
Identities = 120/425 (28%), Positives = 195/425 (45%), Gaps = 35/425 (8%)
Query: 1 LIHRDSILSPYNNPNENPAHRVQRGINISIARLAYLQEKIR-SHNTYQAQILPSNDISNS 59
LIH DS LSP+ N R++ ++ S +RL YL + S N + S + N
Sbjct: 12 LIHHDSPLSPFYNHTMTDTARIEATVHRSRSRLNYLYYINKLSENALDNDVSLSPTLVNE 71
Query: 60 --LFYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRD-CSPQ---LGTIFYPSRSSSYAY 113
+ ++ +IG P +DT + L+WV C C C P+ L T F S+S +Y
Sbjct: 72 GGEYLMSFNIGNPSSQVMGFLDTSNGLIWVQCSNCNSQCEPEKRGLTTKFLSSKSFTYEM 131
Query: 114 VPCDSEHCRYFP-YARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDV 172
PC S C + C++ C Y +Y TS +S++ F +D + V +
Sbjct: 132 EPCGSNFCNSLTGFQTCNSSDKWCKYRLVYGDNKATSGILSSDSFGFDTSDGMLVDVGFL 191
Query: 173 VFGCGFS--TNRNFKFSGIFGLGIGRSSLVSQLN-SSFSYCIGSLHDPDYLHNKLILGDG 229
FGC + T ++G GL SL+SQL FSYC+ ++ +K+ G
Sbjct: 192 NFGCSEAPLTGDEQSYTGNVGLNQTPLSLISQLGIKKFSYCLVPFNNLGST-SKMYFGSL 250
Query: 230 AIIDEGDATPLQFIDGH-YYITLEAISV--DGRMLDINPNIFKRDDSGGGVMIDSGTDVT 286
+ G TPL + + YY+ + IS+ D D ++++ D G +ID+G +
Sbjct: 251 PVT-SGGQTPLLYPNSDAYYVKVLGISIGNDEPHFDGVFDVYEVRD---GWIIDTGITYS 306
Query: 287 WLVKEAYEALRDEVMIRLEGEQMRSYSWPD-KLCYHGIMSSDLKGFPTVRFHFRGGAKLA 345
L +A+++L + + + Q + +LC+ ++DL+ FP V HF GA L
Sbjct: 307 SLETDAFDSLLAKFLTLKDFPQRKDDPKERFELCFELQNANDLESFPDVTVHFD-GADLI 365
Query: 346 LEKDSMFYQPRPDA-FCMAV----NPASINNRYVNFSLIGMMAQQFYNVGYDIGRKQKTF 400
L +S F + D FC+A+ +P SI +G Q Y+VGYD+ + +F
Sbjct: 366 LNVESTFVKIEDDGIFCLALLRSGSPVSI---------LGNFQLQNYHVGYDLEAQVISF 416
Query: 401 QRMDC 405
+DC
Sbjct: 417 APVDC 421
>gi|356524287|ref|XP_003530761.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
Length = 481
Score = 124 bits (312), Expect = 7e-26, Method: Compositional matrix adjust.
Identities = 120/409 (29%), Positives = 165/409 (40%), Gaps = 48/409 (11%)
Query: 26 INISIARLAYLQEKI-------RSHNTYQAQILPSND---ISNSLFYVNISIGQPPVPQF 75
+N+ R+ Y+Q ++ S + LP+ I ++ ++V + +G P
Sbjct: 91 MNLDNERVKYIQSRLSKNLGRENSVKELDSTTLPAKSGSLIGSANYFVVVGLGTPKRDLS 150
Query: 76 TAMDTGSSLLWVHCYPCR-DCSPQLGTIFYPSRSSSYAYVPCDSEHCRYFPYA----RCS 130
DTGS L W C PC C Q IF PS+SSSY + C S C A RCS
Sbjct: 151 LVFDTGSDLTWTQCEPCAGSCYKQQDAIFDPSKSSSYINITCTSSLCTQLTSAGIKSRCS 210
Query: 131 AYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDVVFGCGFSTNRNFKFS-GI 189
+ CIY Y + F+S E+LT TD V D +FGCG F S G+
Sbjct: 211 SSTTACIYGIQYGDKSTSVGFLSQERLTITATD----IVDDFLFGCGQDNEGLFSGSAGL 266
Query: 190 FGLGIGRSSLVSQL----NSSFSYCIGSLHDPDYLHNKLILGDGAIIDEG-DATPLQFID 244
GLG S V Q N FSYC+ S L G A + TPL I
Sbjct: 267 IGLGRHPISFVQQTSSIYNKIFSYCLPSTSSS---LGHLTFGASAATNANLKYTPLSTIS 323
Query: 245 GH---YYITLEAISVDGRMLDINPNIFKRDDSGGGVMIDSGTDVTWLVKEAYEALRDEVM 301
G Y + + ISV G L P + S GG +IDSGT +T L AY ALR
Sbjct: 324 GDNTFYGLDIVGISVGGTKL---PAVSSSTFSAGGSIIDSGTVITRLAPTAYAALRSAFR 380
Query: 302 IRLEGEQMRSYSWPDKLCYHGIMSSDLKGF-----PTVRFHFRGGAKLALEKDSMFYQPR 356
+E + + CY D G+ P + F F GG + L +
Sbjct: 381 QGMEKYPVANEDGLFDTCY------DFSGYKEISVPKIDFEFAGGVTVELPLVGILIGRS 434
Query: 357 PDAFCMAVNPASINNRYVNFSLIGMMAQQFYNVGYDIGRKQKTFQRMDC 405
C+A +N + ++ G + Q+ V YD+ + F C
Sbjct: 435 AQQVCLAFAANGNDN---DITIFGNVQQKTLEVVYDVEGGRIGFGAAGC 480
>gi|242053503|ref|XP_002455897.1| hypothetical protein SORBIDRAFT_03g026970 [Sorghum bicolor]
gi|241927872|gb|EES01017.1| hypothetical protein SORBIDRAFT_03g026970 [Sorghum bicolor]
Length = 493
Score = 124 bits (311), Expect = 8e-26, Method: Compositional matrix adjust.
Identities = 110/366 (30%), Positives = 156/366 (42%), Gaps = 33/366 (9%)
Query: 61 FYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLGTIFYPSRSSSYAYVPCDSEH 120
++ I +G P P +DTGS ++W+ C PCR C Q G +F P RSSSY V C +
Sbjct: 140 YFTKIGVGTPSTPALMVLDTGSDVVWLQCAPCRRCYDQSGPVFDPRRSSSYGAVDCAAPL 199
Query: 121 CRYFPYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDVVFGCGFST 180
CR C + C+Y Y G T+ +TE LTF V V GCG
Sbjct: 200 CRRLDSGGCDLRRRACLYQVAYGDGSVTAGDFATETLTFAG----GARVARVALGCGHD- 254
Query: 181 NRNFKFSGIFGLGIGRSSL------VSQLNSSFSYCIGSLHDPDYLHNKLILGDGAII-- 232
N + LG+GR SL + SFSYC+ +
Sbjct: 255 NEGLFVAAAGLLGLGRGSLSFPTQISRRYGKSFSYCLVDRTSSSSSGAASRSRSSTVTFG 314
Query: 233 ----DEGDATPL---QFIDGHYYITLEAISVDG-RMLDINPNIFKRDDSG--GGVMIDSG 282
TP+ ++ YY+ L ISV G R+ + + + D S GGV++DSG
Sbjct: 315 PPSASAASFTPMVRNPRMETFYYVQLVGISVGGARVPGVAESDLRLDPSTGRGGVIVDSG 374
Query: 283 TDVTWLVKEAYEALRDEVMIRLEGEQMR--SYSWPDKLCYHGIMSSDLKGFPTVRFHFRG 340
T VT L + +Y ALRD G ++ +S D CY + + PTV HF G
Sbjct: 375 TSVTRLARPSYSALRDAFRAAAAGLRLSPGGFSLFDT-CYD-LGGRKVVKVPTVSMHFAG 432
Query: 341 GAKLALEKDS-MFYQPRPDAFCMAVNPASINNRYVNFSLIGMMAQQFYNVGYDIGRKQKT 399
GA+ AL ++ + FC A A + S+IG + QQ + V +D ++
Sbjct: 433 GAEAALPPENYLIPVDSRGTFCFAF--AGTDG---GVSIIGNIQQQGFRVVFDGDGQRVG 487
Query: 400 FQRMDC 405
F C
Sbjct: 488 FAPKGC 493
>gi|356527091|ref|XP_003532147.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Glycine max]
Length = 482
Score = 124 bits (311), Expect = 8e-26, Method: Compositional matrix adjust.
Identities = 123/431 (28%), Positives = 174/431 (40%), Gaps = 44/431 (10%)
Query: 1 LIHRDSILSPYNNPNENPAHRVQRGI-NISIARLAYLQEKIRSH-------NTYQAQILP 52
++H+ S N+ + A I N+ R+ Y+Q ++ + + LP
Sbjct: 69 VVHKHGPCSQLNHSGKAEATISHNDIMNLDNERVKYIQSRLSKNLGGENRVKELDSTTLP 128
Query: 53 SND---ISNSLFYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCR-DCSPQLGTIFYPSRS 108
+ I ++ +YV + +G P DTGS L W C PC C Q IF PS+S
Sbjct: 129 AKSGRLIGSADYYVVVGLGTPKRDLSLIFDTGSYLTWTQCEPCAGSCYKQQDPIFDPSKS 188
Query: 109 SSYAYVPCDSEHCRYFPYARCSAYK-HRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTI 167
SSY + C S C F A CS+ CIY Y + F+S E+LT TD
Sbjct: 189 SSYTNIKCTSSLCTQFRSAGCSSSTDASCIYDVKYGDNSISRGFLSQERLTITATD---- 244
Query: 168 HVQDVVFGCGFSTNRNFK-FSGIFGLGIGRSSLVSQL----NSSFSYCIGSLHDPDYLHN 222
V D +FGCG F+ +G+ GL S V Q N FSYC+ S P L +
Sbjct: 245 IVHDFLFGCGQDNEGLFRGTAGLMGLSRHPISFVQQTSSIYNKIFSYCLPS--TPSSLGH 302
Query: 223 KLILGDGAIIDEGDATPLQFIDGH---YYITLEAISVDGRMLDINPNIFKRDDSGGGVMI 279
A TP I G Y + + ISV G L P + S GG +I
Sbjct: 303 LTFGASAATNANLKYTPFSTISGENSFYGLDIVGISVGGTKL---PAVSSSTFSAGGSII 359
Query: 280 DSGTDVTWLVKEAYEALRDEVMIRLEGEQMRSYSWPDKLCYHGIMSSDLKGF-----PTV 334
DSGT +T L AY ALR + + + CY D G+ P +
Sbjct: 360 DSGTVITRLPPTAYAALRSAFRQFMMKYPVAYGTRLLDTCY------DFSGYKEISVPRI 413
Query: 335 RFHFRGGAKLALEKDSMFYQPRPDAFCMAVNPASINNRYVNFSLIGMMAQQFYNVGYDIG 394
F F GG K+ L + Y C+A + N + ++ G + Q+ V YD+
Sbjct: 414 DFEFAGGVKVELPLVGILYGESAQQLCLAF---AANGNGNDITIFGNVQQKTLEVVYDVE 470
Query: 395 RKQKTFQRMDC 405
+ F C
Sbjct: 471 GGRIGFGAAGC 481
>gi|242093566|ref|XP_002437273.1| hypothetical protein SORBIDRAFT_10g023970 [Sorghum bicolor]
gi|241915496|gb|EER88640.1| hypothetical protein SORBIDRAFT_10g023970 [Sorghum bicolor]
Length = 503
Score = 124 bits (311), Expect = 9e-26, Method: Compositional matrix adjust.
Identities = 107/363 (29%), Positives = 158/363 (43%), Gaps = 42/363 (11%)
Query: 61 FYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRD-CSPQLGTIFYPSRSSSYAYVPCDSE 119
+ V I +G P DTGS WV C PC C Q +F P++S++YA + C S
Sbjct: 165 YVVPIRLGTPAARFTVVFDTGSDTTWVQCQPCVAYCYQQKEPLFTPTKSATYANISCTSS 224
Query: 120 HCRYFPYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDVVFGCGFS 179
+C CS C+Y Y G T F + + LT V+D FGCG
Sbjct: 225 YCSDLDTRGCSG--GHCLYAVQYGDGSYTVGFYAQDTLTLGYDT-----VKDFRFGCG-E 276
Query: 180 TNRNF--KFSGIFGLGIGRSSLVSQL----NSSFSYCIGSLHDPDYLHNKLILGDGAIID 233
NR K +G+ GLG G++S+ Q + F+YCI + +
Sbjct: 277 KNRGLFGKAAGLMGLGRGKTSVPVQAYDKYSGVFAYCIPATSSGTGFLD--FGPGAPAAA 334
Query: 234 EGDATPLQFIDG--HYYITLEAISVDGRMLDINPNIFKRDDSGGGVMIDSGTDVTWLVKE 291
TP+ +G YY+ + I V G +L I +F S G ++DSGT +T L
Sbjct: 335 NARLTPMLVDNGPTFYYVGMTGIKVGGHLLSIPATVF----SDAGALVDSGTVITRLPPS 390
Query: 292 AYEALRDEVMIRLEG---EQMRSYSWPDKLCYHGIMSSDLKGF------PTVRFHFRGGA 342
AYE LR +EG + ++S D CY DL G+ P V F+GGA
Sbjct: 391 AYEPLRSAFAKGMEGLGYKTAPAFSILDT-CY------DLTGYQGSIALPAVSLVFQGGA 443
Query: 343 KLALEKDSMFYQPRPDAFCMAVNPASINNRYVNFSLIGMMAQQFYNVGYDIGRKQKTFQR 402
L ++ + Y C+A + N+ + +++G Q+ Y+V YD+G+K F
Sbjct: 444 CLDVDASGILYVADVSQACLAF---AANDDDTDMTIVGNTQQKTYSVLYDLGKKVVGFAP 500
Query: 403 MDC 405
C
Sbjct: 501 GAC 503
>gi|225216879|gb|ACN85177.1| aspartic proteinase nepenthesin-1 precursor [Oryza nivara]
gi|225216896|gb|ACN85193.1| aspartic proteinase nepenthesin-1 precursor [Oryza rufipogon]
Length = 515
Score = 124 bits (310), Expect = 1e-25, Method: Compositional matrix adjust.
Identities = 101/374 (27%), Positives = 154/374 (41%), Gaps = 48/374 (12%)
Query: 52 PSNDISNSLFYVNISIGQPPVPQFTAMDTGSSLLWVHCYPC-RDCSPQLGTIFYPSRSSS 110
P + + V + +G P DTGS WV C PC C Q +F P+ SS+
Sbjct: 170 PGRALGTGNYVVTVGLGTPASRYTVVFDTGSDTTWVQCQPCVVACYEQREKLFDPASSST 229
Query: 111 YAYVPCDSEHCRYFPYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQ 170
YA V C + C + CS C+Y Y G + F + + LT + D V+
Sbjct: 230 YANVSCAAPACSDLDVSGCSG--GHCLYGVQYGDGSYSIGFFAMDTLTLSSYDA----VK 283
Query: 171 DVVFGCGFSTNRNF-KFSGIFGLGIGRSSLVSQL----NSSFSYCI------------GS 213
FGCG + F + +G+ GLG G++SL Q F++C+ G+
Sbjct: 284 GFRFGCGERNDGLFGEAAGLLGLGRGKTSLPVQTYGKYGGVFAHCLPARSTGTGYLDFGA 343
Query: 214 LHDPDYLHNKLILGDGAIIDEGDATPLQFIDGHYYITLEAISVDGRMLDINPNIFKRDDS 273
P ++ G+G YY+ + I V GR+L I P++F +
Sbjct: 344 GSPPATTTTPMLTGNGPTF--------------YYVGMTGIRVGGRLLPIAPSVF----A 385
Query: 274 GGGVMIDSGTDVTWLVKEAYEALRDEVMIRLEGEQMRSYSWPDKL--CYHGIMSSDLKGF 331
G ++DSGT +T L AY +LR + R + L CY S +
Sbjct: 386 AAGTIVDSGTVITRLPPAAYSSLRSAFAAAMAARGYRKAAAVSLLDTCYDFTGMSQVA-I 444
Query: 332 PTVRFHFRGGAKLALEKDSMFYQPRPDAFCMAVNPASINNRYVNFSLIGMMAQQFYNVGY 391
PTV F+GGA L ++ + Y C+A + N + ++G + + V Y
Sbjct: 445 PTVSLLFQGGAALDVDASGIMYTVSASQVCLAF---AGNEDGGDVGIVGNTQLKTFGVAY 501
Query: 392 DIGRKQKTFQRMDC 405
DIG+K F C
Sbjct: 502 DIGKKVVGFSPGAC 515
>gi|225216914|gb|ACN85210.1| aspartic proteinase nepenthesin-1 precursor [Oryza glaberrima]
Length = 516
Score = 124 bits (310), Expect = 1e-25, Method: Compositional matrix adjust.
Identities = 101/374 (27%), Positives = 154/374 (41%), Gaps = 48/374 (12%)
Query: 52 PSNDISNSLFYVNISIGQPPVPQFTAMDTGSSLLWVHCYPC-RDCSPQLGTIFYPSRSSS 110
P + + V + +G P DTGS WV C PC C Q +F P+ SS+
Sbjct: 171 PGRALGTGNYVVTVGLGTPASRYTVVFDTGSDTTWVQCQPCVVACYEQREKLFDPASSST 230
Query: 111 YAYVPCDSEHCRYFPYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQ 170
YA V C + C + CS C+Y Y G + F + + LT + D V+
Sbjct: 231 YANVSCAAPACSDLDVSGCSG--GHCLYGVQYGDGSYSIGFFAMDTLTLSSYDA----VK 284
Query: 171 DVVFGCGFSTNRNF-KFSGIFGLGIGRSSLVSQL----NSSFSYCI------------GS 213
FGCG + F + +G+ GLG G++SL Q F++C+ G+
Sbjct: 285 GFRFGCGERNDGLFGEAAGLLGLGRGKTSLPVQTYGKYGGVFAHCLPPRSTGTGYLDFGA 344
Query: 214 LHDPDYLHNKLILGDGAIIDEGDATPLQFIDGHYYITLEAISVDGRMLDINPNIFKRDDS 273
P ++ G+G YY+ + I V GR+L I P++F +
Sbjct: 345 GSPPATTTTPMLTGNGPTF--------------YYVGMTGIRVGGRLLPIAPSVF----A 386
Query: 274 GGGVMIDSGTDVTWLVKEAYEALRDEVMIRLEGEQMRSYSWPDKL--CYHGIMSSDLKGF 331
G ++DSGT +T L AY +LR + R + L CY S +
Sbjct: 387 AAGTIVDSGTVITRLPPAAYSSLRSAFAAAMAARGYRKAAAVSLLDTCYDFTGMSQVA-I 445
Query: 332 PTVRFHFRGGAKLALEKDSMFYQPRPDAFCMAVNPASINNRYVNFSLIGMMAQQFYNVGY 391
PTV F+GGA L ++ + Y C+A + N + ++G + + V Y
Sbjct: 446 PTVSLLFQGGAALDVDASGIMYTVSASQVCLAF---AGNEDGGDVGIVGNTQLKTFGVAY 502
Query: 392 DIGRKQKTFQRMDC 405
DIG+K F C
Sbjct: 503 DIGKKVVGFSPGAC 516
>gi|222624819|gb|EEE58951.1| hypothetical protein OsJ_10630 [Oryza sativa Japonica Group]
Length = 440
Score = 124 bits (310), Expect = 1e-25, Method: Compositional matrix adjust.
Identities = 104/363 (28%), Positives = 160/363 (44%), Gaps = 28/363 (7%)
Query: 61 FYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLGTIFYPSRSSSYAYVPCDSEH 120
+ ++++IG PP P +DTGS L+W C PC C Q + SRSS++A CDS
Sbjct: 91 YLLHLAIGTPPQPVQLTLDTGSVLVWTQCQPCAVCFNQSLPYYDASRSSTFALPSCDSTQ 150
Query: 121 CRYFPYARCSAYK--HRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDVVFGCGF 178
C+ P + C Y+ Y T F+ E ++F + V VVFGCG
Sbjct: 151 CKLDPSVTMCVNQTVQTCAYSYSYGDKSATIGFLDVETVSF----VAGASVPGVVFGCGL 206
Query: 179 STNRNFKF--SGIFGLGIGRSSLVSQLN-SSFSYCIGSL--HDPDYLHNKLIL-----GD 228
+ F+ +GI G G G SL SQL +FS+C ++ P + L G
Sbjct: 207 NNTGIFRSNETGIAGFGRGPLSLPSQLKVGNFSHCFTAVSGRKPSTVLFDLPADLYKNGR 266
Query: 229 GAIIDEGDATPLQFIDGH---YYITLEAISVDGRMLDINPNIFKRDDSGGGVMIDSGTDV 285
G + TPL H YY++L+ I+V L + + F + GG +IDSGT
Sbjct: 267 GTV----QTTPLIKNPAHPTFYYLSLKGITVGSTRLPVPESAFALKNGTGGTIIDSGTAF 322
Query: 286 TWLVKEAYEALRDEVMIRLEGEQMRSYSWPDKLCYHGIMSSDLKGFPTVRFHFRGGAKLA 345
T L Y + DE ++ + S LC+ P + HF GA +
Sbjct: 323 TSLPPRVYRLVHDEFAAHVKLPVVPSNETGPLLCFSAPPLGKAPHVPKLVLHFE-GATMH 381
Query: 346 LEKDSMFYQPRPDAFCMAVNPASINNRYVNFSLIGMMAQQFYNVGYDIGRKQKTFQRMDC 405
L +++ ++ + C ++ A I ++IG QQ +V YD+ + +F R C
Sbjct: 382 LPRENYVFEAKDGGNC-SICLAIIEGE---MTIIGNFQQQNMHVLYDLKNSKLSFVRAKC 437
Query: 406 EVL 408
+ L
Sbjct: 438 DKL 440
>gi|45735840|dbj|BAD12875.1| putative chloroplast nucleoid DNA-binding protein cnd41 [Oryza
sativa Japonica Group]
gi|45735966|dbj|BAD12995.1| putative chloroplast nucleoid DNA-binding protein cnd41 [Oryza
sativa Japonica Group]
gi|125583491|gb|EAZ24422.1| hypothetical protein OsJ_08175 [Oryza sativa Japonica Group]
Length = 475
Score = 124 bits (310), Expect = 1e-25, Method: Compositional matrix adjust.
Identities = 112/394 (28%), Positives = 169/394 (42%), Gaps = 63/394 (15%)
Query: 32 RLAYLQEKIRSHNTY---------QAQILPSN---DISNSLFYVNISIGQPPVPQFTAMD 79
R Y+Q ++ +A +P+N I + V +S+G P V Q +D
Sbjct: 101 RAEYIQRRVSGAAAAAPGMQLAGSKAATVPANLGFSIGTLQYVVTVSLGTPAVAQTLEVD 160
Query: 80 TGSSLLWVHCYPCRD--CSPQLGTIFYPSRSSSYAYVPCDSEHCRYFPYARCSAYKHRCI 137
TGS + WV C PC C Q +F P+RSSSY+ VPC + C +C
Sbjct: 161 TGSDVSWVQCKPCPSPPCYSQRDPLFDPTRSSSYSAVPCAAASCSQLALYSNGCSGGQCG 220
Query: 138 YTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDVVFGCGFSTNRNFK-FSGIFGLGIGR 196
Y Y G T+ S++ LT ++ ++ +FGCG + F G+ GLG
Sbjct: 221 YVVSYGDGSTTTGVYSSDTLTLTGSNA----LKGFLFGCGHAQQGLFAGVDGLLGLGRQG 276
Query: 197 SSLVSQLNSS----FSYCIGSLHDPDYLHNKLILGDGAIIDEGDATPLQFI--DGHYYIT 250
SLVSQ +S+ FSYC+ + + LG + TPL D YYI
Sbjct: 277 QSLVSQASSTYGGVFSYCLPPTQNS---VGYISLGGPSSTAGFSTTPLLTASNDPTYYIV 333
Query: 251 -LEAISVDGRMLDINPNIFKRDDSGGGVMIDSGTDVTWLVKEAYEALRDEVMIRLEGEQM 309
L ISV G+ L I+ ++F G ++D+GT VT L AY ALR M
Sbjct: 334 MLAGISVGGQPLSIDASVFAS-----GAVVDTGTVVTRLPPTAYSALRSAFR-----AAM 383
Query: 310 RSYSWPDK-------LCY----HGIMSSDLKGFPTVRFHFRGGAKLALEKDSMFYQPRPD 358
Y +P CY +G ++ PT+ F GGA + L +
Sbjct: 384 APYGYPSAPATGILDTCYDFTRYGTVT-----LPTISIAFGGGAAMDLGTSGIL-----T 433
Query: 359 AFCMAVNPASINNRYVNFSLIGMMAQQFYNVGYD 392
+ C+A P +++ S++G + Q+ + V +D
Sbjct: 434 SGCLAFAPTGGDSQA---SILGNVQQRSFEVRFD 464
>gi|115468912|ref|NP_001058055.1| Os06g0610800 [Oryza sativa Japonica Group]
gi|51091964|dbj|BAD35493.1| putative nucleoid DNA-binding protein cnd41, chloroplast [Oryza
sativa Japonica Group]
gi|113596095|dbj|BAF19969.1| Os06g0610800 [Oryza sativa Japonica Group]
gi|218198535|gb|EEC80962.1| hypothetical protein OsI_23679 [Oryza sativa Indica Group]
Length = 519
Score = 124 bits (310), Expect = 1e-25, Method: Compositional matrix adjust.
Identities = 101/374 (27%), Positives = 154/374 (41%), Gaps = 48/374 (12%)
Query: 52 PSNDISNSLFYVNISIGQPPVPQFTAMDTGSSLLWVHCYPC-RDCSPQLGTIFYPSRSSS 110
P + + V + +G P DTGS WV C PC C Q +F P+ SS+
Sbjct: 174 PGRALGTGNYVVTVGLGTPASRYTVVFDTGSDTTWVQCQPCVVACYEQREKLFDPASSST 233
Query: 111 YAYVPCDSEHCRYFPYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQ 170
YA V C + C + CS C+Y Y G + F + + LT + D V+
Sbjct: 234 YANVSCAAPACSDLDVSGCSG--GHCLYGVQYGDGSYSIGFFAMDTLTLSSYDA----VK 287
Query: 171 DVVFGCGFSTNRNF-KFSGIFGLGIGRSSLVSQL----NSSFSYCI------------GS 213
FGCG + F + +G+ GLG G++SL Q F++C+ G+
Sbjct: 288 GFRFGCGERNDGLFGEAAGLLGLGRGKTSLPVQTYGKYGGVFAHCLPARSTGTGYLDFGA 347
Query: 214 LHDPDYLHNKLILGDGAIIDEGDATPLQFIDGHYYITLEAISVDGRMLDINPNIFKRDDS 273
P ++ G+G YY+ + I V GR+L I P++F +
Sbjct: 348 GSPPATTTTPMLTGNGPTF--------------YYVGMTGIRVGGRLLPIAPSVF----A 389
Query: 274 GGGVMIDSGTDVTWLVKEAYEALRDEVMIRLEGEQMRSYSWPDKL--CYHGIMSSDLKGF 331
G ++DSGT +T L AY +LR + R + L CY S +
Sbjct: 390 AAGTIVDSGTVITRLPPAAYSSLRSAFAAAMAARGYRKAAAVSLLDTCYDFTGMSQVA-I 448
Query: 332 PTVRFHFRGGAKLALEKDSMFYQPRPDAFCMAVNPASINNRYVNFSLIGMMAQQFYNVGY 391
PTV F+GGA L ++ + Y C+A + N + ++G + + V Y
Sbjct: 449 PTVSLLFQGGAALDVDASGIMYTVSASQVCLAF---AGNEDGGDVGIVGNTQLKTFGVAY 505
Query: 392 DIGRKQKTFQRMDC 405
DIG+K F C
Sbjct: 506 DIGKKVVGFSPGAC 519
>gi|226504334|ref|NP_001141706.1| uncharacterized protein LOC100273835 precursor [Zea mays]
gi|194705620|gb|ACF86894.1| unknown [Zea mays]
gi|414885968|tpg|DAA61982.1| TPA: hypothetical protein ZEAMMB73_231717 [Zea mays]
Length = 477
Score = 124 bits (310), Expect = 1e-25, Method: Compositional matrix adjust.
Identities = 118/434 (27%), Positives = 182/434 (41%), Gaps = 53/434 (12%)
Query: 1 LIHRDSILSPYNNPNENPAHRVQRGINISIARLAYLQEKIRSHNTYQAQILPSN------ 54
++H SP + P+H G + R+ ++ K+ + T + P
Sbjct: 67 VVHGHGPCSPQESRRGAPSHTEILGRDQD--RVDAIRRKVAAVTTAASSSKPKGVPLQVG 124
Query: 55 -----DISNSLFYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLGTIFYPSRSS 109
D +N ++ ++ +G P +DTGS W+ C PC DC Q +F PS+SS
Sbjct: 125 WGKYLDTTN--YFTSLRLGTPATDLLVELDTGSDQSWIQCKPCPDCYEQHEALFDPSKSS 182
Query: 110 SYAYVPCDSEHCRYFPYAR---CSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDEST 166
+Y+ + C S C+ + CS+ K +C Y Y T ++ + LT TD
Sbjct: 183 TYSDITCSSRECQELGSSHKHNCSSDK-KCPYEITYADDSYTVGNLARDTLTLSPTDA-- 239
Query: 167 IHVQDVVFGCGFSTNRNF-KFSGIFGLGIGRSSLVSQL----NSSFSYCIGSLHDPDYLH 221
V VFGCG + +F + G+ GLG G++SL SQ+ + FSYC+ S P
Sbjct: 240 --VPGFVFGCGHNNAGSFGEIDGLLGLGRGKASLSSQVAARYGAGFSYCLPS--SPSAT- 294
Query: 222 NKLILGDGAIIDEGDATPLQFIDGH----YYITLEAISVDGRMLDINPNIFKRDDSGGGV 277
L A +A + + G YY+ L I+V GR + + P++F + G
Sbjct: 295 GYLSFSGAAAAAPTNAQFTEMVAGQHPSFYYLNLTGITVAGRAIKVPPSVFA---TAAGT 351
Query: 278 MIDSGTDVTWLVKEAYEALRDEVMIRLEGEQMRSYSWPDKLCYHGIMSSDLKGFPTVRFH 337
+IDSGT + L AY ALR V + + S CY DL G TVR
Sbjct: 352 IIDSGTAFSCLPPSAYAALRSSVRSAMGRYKRAPSSTIFDTCY------DLTGHETVRIP 405
Query: 338 -----FRGGAKLALEKDSMFYQ-PRPDAFCMAVNPASINNRYVNFSLIGMMAQQFYNVGY 391
F GA + L + Y C+A P N + ++G Q+ V Y
Sbjct: 406 SVALVFADGATVHLHPSGVLYTWSNVSQTCLAFLP---NPDDTSLGVLGNTQQRTLAVIY 462
Query: 392 DIGRKQKTFQRMDC 405
D+ ++ F C
Sbjct: 463 DVDNQKVGFGANGC 476
>gi|108707835|gb|ABF95630.1| Eukaryotic aspartyl protease family protein [Oryza sativa Japonica
Group]
Length = 384
Score = 124 bits (310), Expect = 1e-25, Method: Compositional matrix adjust.
Identities = 104/363 (28%), Positives = 160/363 (44%), Gaps = 28/363 (7%)
Query: 61 FYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLGTIFYPSRSSSYAYVPCDSEH 120
+ ++++IG PP P +DTGS L+W C PC C Q + SRSS++A CDS
Sbjct: 35 YLLHLAIGTPPQPVQLTLDTGSVLVWTQCQPCAVCFNQSLPYYDASRSSTFALPSCDSTQ 94
Query: 121 CRYFPYARCSAYK--HRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDVVFGCGF 178
C+ P + C Y+ Y T F+ E ++F + V VVFGCG
Sbjct: 95 CKLDPSVTMCVNQTVQTCAYSYSYGDKSATIGFLDVETVSF----VAGASVPGVVFGCGL 150
Query: 179 STNRNFKF--SGIFGLGIGRSSLVSQLN-SSFSYCIGSL--HDPDYLHNKLIL-----GD 228
+ F+ +GI G G G SL SQL +FS+C ++ P + L G
Sbjct: 151 NNTGIFRSNETGIAGFGRGPLSLPSQLKVGNFSHCFTAVSGRKPSTVLFDLPADLYKNGR 210
Query: 229 GAIIDEGDATPLQFIDGH---YYITLEAISVDGRMLDINPNIFKRDDSGGGVMIDSGTDV 285
G + TPL H YY++L+ I+V L + + F + GG +IDSGT
Sbjct: 211 GTV----QTTPLIKNPAHPTFYYLSLKGITVGSTRLPVPESAFALKNGTGGTIIDSGTAF 266
Query: 286 TWLVKEAYEALRDEVMIRLEGEQMRSYSWPDKLCYHGIMSSDLKGFPTVRFHFRGGAKLA 345
T L Y + DE ++ + S LC+ P + HF GA +
Sbjct: 267 TSLPPRVYRLVHDEFAAHVKLPVVPSNETGPLLCFSAPPLGKAPHVPKLVLHFE-GATMH 325
Query: 346 LEKDSMFYQPRPDAFCMAVNPASINNRYVNFSLIGMMAQQFYNVGYDIGRKQKTFQRMDC 405
L +++ ++ + C ++ A I ++IG QQ +V YD+ + +F R C
Sbjct: 326 LPRENYVFEAKDGGNC-SICLAIIEGE---MTIIGNFQQQNMHVLYDLKNSKLSFVRAKC 381
Query: 406 EVL 408
+ L
Sbjct: 382 DKL 384
>gi|168064509|ref|XP_001784204.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162664276|gb|EDQ51002.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 367
Score = 123 bits (309), Expect = 1e-25, Method: Compositional matrix adjust.
Identities = 108/375 (28%), Positives = 167/375 (44%), Gaps = 39/375 (10%)
Query: 46 YQAQILPSNDISNSLFYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLGTIFYP 105
++A I ++ + +G P + +DTGS + W+ C PC +C Q +F P
Sbjct: 1 FEAPIFSGLAFGTGEYFAVVGVGTPRRDMYLVVDTGSDITWLQCAPCTNCYKQKDALFNP 60
Query: 106 SRSSSYAYVPCDSEHCRYFPYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNT-DE 164
S SSS+ + C S C C ++C+Y Y G T + T+ + +
Sbjct: 61 SSSSSFKVLDCSSSLCLNLDVMGC--LSNKCLYQADYGDGSFTMGELVTDNVVLDDAFGP 118
Query: 165 STIHVQDVVFGCGFSTNRNF-KFSGIFGLGIGRSSLVSQLNSS----FSYCIGSLH-DPD 218
+ + ++ GCG F +GI GLG G S + L++S FSYC+ DP+
Sbjct: 119 GQVVLTNIPLGCGHDNEGTFGTAAGILGLGRGPLSFPNNLDASTRNIFSYCLPDRESDPN 178
Query: 219 YLHNKLILGDGAIIDEGDATPLQFI--------DGHYYITLEAISVDGRML-DINPNIFK 269
+ + L+ GD AI + ++FI +YY+ + ISV G +L +I ++F+
Sbjct: 179 H-KSTLVFGDAAIPHTATGS-VKFIPQLRNPRVATYYYVQITGISVGGNLLTNIPASVFQ 236
Query: 270 RDDSG-GGVMIDSGTDVTWLVKEAYEALRDEVMIRLEGEQMRSYSWPD----KLCYHGIM 324
D G GG + DSGT +T L AY A+RD M S D CY
Sbjct: 237 LDSHGNGGTIFDSGTTITRLEARAYTAVRDA----FRAATMHLTSAADFKIFDTCYDFTG 292
Query: 325 SSDLKGFPTVRFHFRGGAKLALEKDSMFYQP--RPDAFCMAVNPASINNRYVNFSLIGMM 382
+ + PTV FHF+G + L S + P + FC A + + S+IG +
Sbjct: 293 MNSIS-VPTVTFHFQGDVDMRLPP-SNYIVPVSNNNIFCFAFAAS------MGPSVIGNV 344
Query: 383 AQQFYNVGYDIGRKQ 397
QQ + V YD KQ
Sbjct: 345 QQQSFRVIYDNVHKQ 359
>gi|125540927|gb|EAY87322.1| hypothetical protein OsI_08726 [Oryza sativa Indica Group]
Length = 464
Score = 123 bits (309), Expect = 2e-25, Method: Compositional matrix adjust.
Identities = 112/394 (28%), Positives = 169/394 (42%), Gaps = 63/394 (15%)
Query: 32 RLAYLQEKIRSHNTY---------QAQILPSN---DISNSLFYVNISIGQPPVPQFTAMD 79
R Y+Q ++ +A +P+N I + V +S+G P V Q +D
Sbjct: 90 RAEYIQRRVSGAAAAAPGMQLAGSKAATVPANLGFSIGTLQYVVTVSLGTPAVAQTLEVD 149
Query: 80 TGSSLLWVHCYPCRD--CSPQLGTIFYPSRSSSYAYVPCDSEHCRYFPYARCSAYKHRCI 137
TGS + WV C PC C Q +F P+RSSSY+ VPC + C +C
Sbjct: 150 TGSDVSWVQCKPCPSPPCYSQRDPLFDPTRSSSYSAVPCAAASCSQLALYSNGCSGGQCG 209
Query: 138 YTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDVVFGCGFSTNRNFK-FSGIFGLGIGR 196
Y Y G T+ S++ LT ++ ++ +FGCG + F G+ GLG
Sbjct: 210 YVVSYGDGSTTTGVYSSDTLTLTGSNA----LKGFLFGCGHAQQGLFAGVDGLLGLGRQG 265
Query: 197 SSLVSQLNSS----FSYCIGSLHDPDYLHNKLILGDGAIIDEGDATPLQFI--DGHYYIT 250
SLVSQ +S+ FSYC+ + + LG + TPL D YYI
Sbjct: 266 QSLVSQASSTYGGVFSYCLPPTQNS---VGYISLGGPSSTAGFSTTPLLTASNDPTYYIV 322
Query: 251 -LEAISVDGRMLDINPNIFKRDDSGGGVMIDSGTDVTWLVKEAYEALRDEVMIRLEGEQM 309
L ISV G+ L I+ ++F G ++D+GT VT L AY ALR M
Sbjct: 323 MLAGISVGGQPLSIDASVFAS-----GAVVDTGTVVTRLPPTAYSALRSAFR-----AAM 372
Query: 310 RSYSWPDK-------LCY----HGIMSSDLKGFPTVRFHFRGGAKLALEKDSMFYQPRPD 358
Y +P CY +G ++ PT+ F GGA + L +
Sbjct: 373 APYGYPSAPATGILDTCYDFTRYGTVT-----LPTISIAFGGGAAMDLGTSGIL-----T 422
Query: 359 AFCMAVNPASINNRYVNFSLIGMMAQQFYNVGYD 392
+ C+A P +++ S++G + Q+ + V +D
Sbjct: 423 SGCLAFAPTGGDSQA---SILGNVQQRSFEVRFD 453
>gi|413944392|gb|AFW77041.1| hypothetical protein ZEAMMB73_800604 [Zea mays]
Length = 476
Score = 123 bits (308), Expect = 2e-25, Method: Compositional matrix adjust.
Identities = 101/356 (28%), Positives = 150/356 (42%), Gaps = 25/356 (7%)
Query: 61 FYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRD-CSPQLGTIFYPSRSSSYAYVPCDSE 119
F V + G P DTGS + W+ C PC C Q IF P++S++Y+ VPC
Sbjct: 135 FVVTVGFGTPAQTYTVIFDTGSDVSWIQCLPCSGHCYKQHDPIFDPTKSATYSVVPCGHP 194
Query: 120 HCRYFPYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDVVFGCGFS 179
C ++CS C+Y Y G ++ +S E L+ ST + FGCG +
Sbjct: 195 QCAAADGSKCS--NGTCLYKVEYGDGSSSAGVLSHETLSLT----STRALPGFAFGCGQT 248
Query: 180 TNRNF-KFSGIFGLGIGRSSLVSQLNSS----FSYCIGSLHDPDYLHNKLILGDGAIIDE 234
+F G+ GLG G+ SL SQ +S FSYC+ S + H L +G
Sbjct: 249 NLGDFGDVDGLIGLGRGQLSLSSQAAASFGGTFSYCLPS---DNTTHGYLTIGPTTPASN 305
Query: 235 GDATPLQFIDGH-----YYITLEAISVDGRMLDINPNIFKRDDSGGGVMIDSGTDVTWLV 289
D + Y++ L +I + G +L + P +F D G +DSGT +T+L
Sbjct: 306 DDVQYTAMVQKQDYPSFYFVELVSIDIGGYILPVPPTLFTDD----GTFLDSGTILTYLP 361
Query: 290 KEAYEALRDEVMIRLEGEQMRSYSWPDKLCYHGIMSSDLKGFPTVRFHFRGGAKLALEKD 349
EAY ALRD + + P CY S + P V F F G+ L
Sbjct: 362 PEAYTALRDRFKFTMTQYKPAPAYDPFDTCYDFTGQSAIF-IPAVSFKFSDGSVFDLSFF 420
Query: 350 SMFYQPRPDAFCMAVNPASINNRYVNFSLIGMMAQQFYNVGYDIGRKQKTFQRMDC 405
+ P A + + F+++G M Q+ V YD+ ++ F C
Sbjct: 421 GILIFPDDTAPAIGCLGFVARPSAMPFTIVGNMQQRNTEVIYDVAAEKIGFASASC 476
>gi|218189149|gb|EEC71576.1| hypothetical protein OsI_03949 [Oryza sativa Indica Group]
Length = 504
Score = 123 bits (308), Expect = 2e-25, Method: Compositional matrix adjust.
Identities = 96/319 (30%), Positives = 149/319 (46%), Gaps = 32/319 (10%)
Query: 60 LFYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLG-----TIFYPSRSSSYAYV 114
L++ + +G PP F +DTGS +LWV C PC C G F P SS+ + +
Sbjct: 90 LYFTRVKLGSPPKEYFVQIDTGSDILWVACSPCTGCPSSSGLNIQLEFFNPDTSSTSSKI 149
Query: 115 PCDSEHCRYF---PYARCSAYKHR-CIYTQLYLIGPETSVFVSTEQLTFKNT---DESTI 167
PC + C A C + C YT Y G TS + ++ + F + +++
Sbjct: 150 PCSDDRCTAALQTSEAVCQTSDNSPCGYTFTYGDGSGTSGYYVSDTMYFDSVMGNEQTAN 209
Query: 168 HVQDVVFGCGFS-----TNRNFKFSGIFGLGIGRSSLVSQLNS------SFSYCIGSLHD 216
+VFGC S T + GIFG G + S+VSQLNS FS+C L
Sbjct: 210 SSASIVFGCSNSQSGDLTKTDRAVDGIFGFGQHQLSVVSQLNSLGVSPKVFSHC---LKG 266
Query: 217 PDYLHNKLILGDGAIIDEGDA-TPLQFIDGHYYITLEAISVDGRMLDINPNIFKRDDSGG 275
D L+LG+ I++ G TPL HY + LE+I V+G+ L I+ ++F ++
Sbjct: 267 SDNGGGILVLGE--IVEPGLVYTPLVPSQPHYNLNLESIVVNGQKLPIDSSLFTTSNT-Q 323
Query: 276 GVMIDSGTDVTWLVKEAYEALRDEVMIRLEGEQMRSYSWPDKLCYHGIMSSDLKGFPTVR 335
G ++DSGT + +L AY+ + + + +RS C+ S D FPTV
Sbjct: 324 GTIVDSGTTLAYLADGAYDPFVNAITAAVS-PSVRSLVSKGNQCFVTSSSVD-SSFPTVS 381
Query: 336 FHFRGGAKLALEKDSMFYQ 354
+F GG + ++ ++ Q
Sbjct: 382 LYFMGGVAMTVKPENYLLQ 400
>gi|413944387|gb|AFW77036.1| hypothetical protein ZEAMMB73_461996 [Zea mays]
Length = 472
Score = 123 bits (308), Expect = 2e-25, Method: Compositional matrix adjust.
Identities = 122/372 (32%), Positives = 161/372 (43%), Gaps = 53/372 (14%)
Query: 61 FYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCR--DCSPQLGTIFYPSRSSSYAYVPCDS 118
+ V + IG P V Q +DTGS L WV C PC C PQ ++ P+ SS+YA VPCDS
Sbjct: 127 YVVTLGIGTPAVQQTVLIDTGSDLSWVQCKPCNSSSCYPQKDPLYDPTASSTYAPVPCDS 186
Query: 119 EHCR-YFPYARCSAYKHRCIY---TQLYLIGPE-----TSVFV-STEQLTFKNTDESTIH 168
+ C+ P AY H C T L G E T+V V STE LT +
Sbjct: 187 KACKDLVP----DAYDHGCTNSSGTSLCQYGIEYGNRDTTVGVYSTETLTL----SPQVS 238
Query: 169 VQDVVFGCGF-STNRNFKFSGIFGLGIGRSSLVSQL----NSSFSYCIGSLHDPDYLHNK 223
V+D FGCG F G+ GLG SLVSQ +FSYC+ P
Sbjct: 239 VKDFGFGCGLVQQGTFDLFDGLLGLGGAPESLVSQTAETYGGAFSYCL-----PPGNSTT 293
Query: 224 LILGDGAIIDEGDATPLQFIDGH--------YYITLEAISVDGRMLDINPNIFKRDDSGG 275
L GA + D F H Y + L +SV G+ LDI P + G
Sbjct: 294 GFLALGAPTNNNDTAGFLFTPLHSLPEQATFYLVNLTGVSVGGKPLDIPPTVLS-----G 348
Query: 276 GVMIDSGTDVTWLVKEAYEALRDEVMIRLEGEQMRSYSWPDKL--CYHGIMSSDLKGFPT 333
G++IDSGT +T L AY ALR + + + D L CY+ +++ PT
Sbjct: 349 GMIIDSGTIITGLPDTAYSALRTAFRTAMSAYPLLPPNNDDVLDTCYNFTGIANVT-VPT 407
Query: 334 VRFHFRGGAKLALEKDSMFYQPRPDAFCMAVNPASINNRYVNFSLIGMMAQQFYNVGYDI 393
V F GGA + L+ S AF + + +IG + Q+ + V YD
Sbjct: 408 VALTFDGGATIDLDVPSGVLIQDCLAFAGGASDGDVG-------IIGNVNQRTFEVLYDS 460
Query: 394 GRKQKTFQRMDC 405
GR F+ C
Sbjct: 461 GRGHVGFRPGAC 472
>gi|326520291|dbj|BAK07404.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 464
Score = 123 bits (308), Expect = 2e-25, Method: Compositional matrix adjust.
Identities = 118/426 (27%), Positives = 186/426 (43%), Gaps = 55/426 (12%)
Query: 22 VQRGINISIARLAYLQEKIRSHNTYQAQILP----SNDISNSLFYVNISIGQP-PVPQFT 76
++R + S ARLA L RS A P +D+ +S + +++ IG P P
Sbjct: 55 LRRMVARSKARLASL----RSSACDTALTAPVDHGGSDVGSSEYLIHLGIGTPRPQRVVL 110
Query: 77 AMDTGSSLLWVHCYPCRDCSPQLGTIFYPSRSSSYAYVPCDSE---HCRYFPYARCSAYK 133
+DTGS L+W C C C Q +F S S +++ VPC H Y P + C+A
Sbjct: 111 HLDTGSDLVWTQCA-CTVCFDQPVPVFRASVSHTFSRVPCSDPLCGHAVYLPLSGCAARD 169
Query: 134 HRCIYTQLYLIGPETSVFVSTEQLTFKNTD--ESTIHVQDVVFGCG------FSTNRNFK 185
C Y Y+ T+ ++ + TFK D ++ V ++ FGCG F+ N+
Sbjct: 170 RSCFYAYGYMDHSITTGKMAEDTFTFKAPDRADTAAAVPNIRFGCGMMNYGLFTPNQ--- 226
Query: 186 FSGIFGLGIGRSSLVSQLN-SSFSYCIGSLHDPDYLHNKLILGDGAIIDEGDAT-PLQ-- 241
SGI G G G SL SQL FSYC ++ + + +ILG E AT P+Q
Sbjct: 227 -SGIAGFGTGPLSLPSQLKVRRFSYCFTAMEESRV--SPVILGGEPENIEAHATGPIQST 283
Query: 242 -FIDG----------HYYITLEAISVDGRMLDINPNIFK-RDDSGGGVMIDSGTDVTWLV 289
F G Y+++L ++V L N + F + D GG IDSGT +T+
Sbjct: 284 PFAPGPAGAPVGSQPFYFLSLRGVTVGETRLPFNASTFALKGDGSGGTFIDSGTAITFFP 343
Query: 290 KEAYEALRDEVMIRLEGEQMRSYSWPDK-LCYHGIMSSDLKGFPTVRFHFRGGAKLALEK 348
+ + +LR+ + ++ + Y+ PD LC+ P + H GA L +
Sbjct: 344 QAVFRSLREAFVAQVPLPVAKGYTDPDNLLCFSVPAKKKAPAVPKLILHLE-GADWELPR 402
Query: 349 DSMFYQPRPDA------FCMAVNPASINNRYVNFSLIGMMAQQFYNVGYDIGRKQKTFQR 402
++ D C+ + A +N ++IG QQ ++ YD+ + F
Sbjct: 403 ENYVLDNDDDGSGAGRKLCVVILSAGNSNG----TIIGNFQQQNMHIVYDLESNKMVFAP 458
Query: 403 MDCEVL 408
C+ L
Sbjct: 459 ARCDKL 464
>gi|357142295|ref|XP_003572524.1| PREDICTED: aspartic proteinase-like protein 2-like [Brachypodium
distachyon]
Length = 494
Score = 123 bits (308), Expect = 2e-25, Method: Compositional matrix adjust.
Identities = 102/372 (27%), Positives = 159/372 (42%), Gaps = 35/372 (9%)
Query: 60 LFYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLG-----TIFYPSRSSSYAYV 114
L++ I IG P + +DTGS +LWV+C C C + G T++ P+ S+S V
Sbjct: 88 LYFTQIGIGTPSKGYYVQVDTGSDILWVNCISCDSCPRKSGLGIDLTLYDPTASASSKTV 147
Query: 115 PCDSEHCRYFPYA----RCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTD---ESTI 167
C E C C+A C Y+ Y G T+ F + L + ++ +
Sbjct: 148 TCGQEFCATATNGGVPPSCAA-NSPCQYSITYGDGSSTTGFFVADFLQYDQVSGDGQTNL 206
Query: 168 HVQDVVFGCGFST-----NRNFKFSGIFGLGIGRSSLVSQLNSS------FSYCIGSLHD 216
V FGCG + N GI G G SS++SQL S+ FS+C+
Sbjct: 207 ANASVTFGCGAKIGGALGSSNVALDGILGFGQANSSMLSQLTSAGKVTKIFSHCL----- 261
Query: 217 PDYLHNKLILGDGAIID-EGDATPLQFIDGHYYITLEAISVDGRMLDINPNIFKRDDSGG 275
D ++ I G ++ + TPL HY + L+ I V G L + NIF
Sbjct: 262 -DTVNGGGIFAIGNVVQPKVKTTPLVPGMPHYNVVLKTIDVGGSTLQLPTNIFDIGGGSR 320
Query: 276 GVMIDSGTDVTWLVKEAYEALRDEVMIRLEGEQMRSYSWPDKLCYHGIMSSDLKGFPTVR 335
G +IDSGT + +L + Y+A+ V +++ D LC+ S D GFP V
Sbjct: 321 GTIIDSGTTLAYLPEVVYKAVLSAVFSNHPDVTLKNVQ--DFLCFQYSGSVD-NGFPEVT 377
Query: 336 FHFRGGAKLALEKDSMFYQPRPDAFCMAVNPASINNR-YVNFSLIGMMAQQFYNVGYDIG 394
FHF G L + +Q D +C+ + ++ + L+G +A V YD+
Sbjct: 378 FHFDGDLPLVVYPHDYLFQNTEDVYCVGFQSGGVQSKDGKDMVLLGDLALSNKLVVYDLE 437
Query: 395 RKQKTFQRMDCE 406
+ + +C
Sbjct: 438 NQVIGWTNYNCS 449
>gi|53791672|dbj|BAD53242.1| putative aspartic proteinase nepenthesin II [Oryza sativa Japonica
Group]
Length = 504
Score = 123 bits (308), Expect = 2e-25, Method: Compositional matrix adjust.
Identities = 96/319 (30%), Positives = 148/319 (46%), Gaps = 32/319 (10%)
Query: 60 LFYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLG-----TIFYPSRSSSYAYV 114
L++ + +G PP F +DTGS +LWV C PC C G F P SS+ + +
Sbjct: 90 LYFTRVKLGSPPKEYFVQIDTGSDILWVACSPCTGCPSSSGLNIQLEFFNPDTSSTSSKI 149
Query: 115 PCDSEHCRY---FPYARCSAYKHR-CIYTQLYLIGPETSVFVSTEQLTFKNT---DESTI 167
PC + C A C + C YT Y G TS + ++ + F +++
Sbjct: 150 PCSDDRCTAALQTSEAVCQTSDNSPCGYTFTYGDGSGTSGYYVSDTMYFDTVMGNEQTAN 209
Query: 168 HVQDVVFGCGFS-----TNRNFKFSGIFGLGIGRSSLVSQLNS------SFSYCIGSLHD 216
+VFGC S T + GIFG G + S+VSQLNS FS+C L
Sbjct: 210 SSASIVFGCSNSQSGDLTKTDRAVDGIFGFGQHQLSVVSQLNSLGVSPKVFSHC---LKG 266
Query: 217 PDYLHNKLILGDGAIIDEGDA-TPLQFIDGHYYITLEAISVDGRMLDINPNIFKRDDSGG 275
D L+LG+ I++ G TPL HY + LE+I V+G+ L I+ ++F ++
Sbjct: 267 SDNGGGILVLGE--IVEPGLVYTPLVPSQPHYNLNLESIVVNGQKLPIDSSLFTTSNT-Q 323
Query: 276 GVMIDSGTDVTWLVKEAYEALRDEVMIRLEGEQMRSYSWPDKLCYHGIMSSDLKGFPTVR 335
G ++DSGT + +L AY+ + + + +RS C+ S D FPTV
Sbjct: 324 GTIVDSGTTLAYLADGAYDPFVNAITAAVS-PSVRSLVSKGNQCFVTSSSVD-SSFPTVS 381
Query: 336 FHFRGGAKLALEKDSMFYQ 354
+F GG + ++ ++ Q
Sbjct: 382 LYFMGGVAMTVKPENYLLQ 400
>gi|302793638|ref|XP_002978584.1| hypothetical protein SELMODRAFT_54048 [Selaginella moellendorffii]
gi|300153933|gb|EFJ20570.1| hypothetical protein SELMODRAFT_54048 [Selaginella moellendorffii]
Length = 407
Score = 123 bits (308), Expect = 2e-25, Method: Compositional matrix adjust.
Identities = 112/369 (30%), Positives = 156/369 (42%), Gaps = 39/369 (10%)
Query: 61 FYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLGTIFYPSRSSSYAYVPCDSEH 120
++V + +G P F +DTGS L W+ C PC+ C Q IF P SSS+ +PC S
Sbjct: 54 YFVRLGLGTPARSLFMVVDTGSDLPWLQCQPCKSCYKQADPIFDPRNSSSFQRIPCLSPL 113
Query: 121 CRYFPYARCSAYK---HRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDVVFGCG 177
C+ CS + RC Y Y G + S++ T ++ V FGCG
Sbjct: 114 CKALEVHSCSGSRGATSRCSYQVAYGDGSFSVGDFSSDLFTLGTGSKA----MSVAFGCG 169
Query: 178 FSTNRNFKFSGI----------FGLGIGRSSLVSQLNSSFSYCIGSLHDP-DYLHNKLIL 226
F F + F I SS S +SFSYC+ +P + LI
Sbjct: 170 FDNEGLFAGAAGLLGLGAGKLSFPSQIFASSTNSSTANSFSYCLVDRSNPMTRSSSSLIF 229
Query: 227 GDGAIIDEGDATPL---QFIDGHYYITLEAISVDGRMLDINPNIFKRDDSG-GGVMIDSG 282
G AI +PL +D YY + +SV G L I+ + SG GGV+IDSG
Sbjct: 230 GVAAIPSTAALSPLLKNPKLDTFYYAAMIGVSVGGAQLPISLKSLQLSQSGSGGVIIDSG 289
Query: 283 TDVTWLVKEAYEALRD---EVMIRLEGEQMRSYSWPDKLCYH--GIMSSDLKGFPTVRFH 337
T VT Y +RD I L YS D CY+ G S D+ P + H
Sbjct: 290 TSVTRFPTSVYATIRDAFRNATINLPSAPR--YSLFDT-CYNFSGKASVDV---PALVLH 343
Query: 338 FRGGAKLALEKDS-MFYQPRPDAFCMAVNPASINNRYVNFSLIGMMAQQFYNVGYDIGRK 396
F GA L L + + +FC+A P S+ +IG + QQ + +G+D+ +
Sbjct: 344 FENGADLQLPPTNYLIPINTAGSFCLAFAPTSM-----ELGIIGNIQQQSFRIGFDLQKS 398
Query: 397 QKTFQRMDC 405
F C
Sbjct: 399 HLAFAPQQC 407
>gi|357127505|ref|XP_003565420.1| PREDICTED: probable aspartic protease At2g35615-like [Brachypodium
distachyon]
Length = 466
Score = 122 bits (307), Expect = 2e-25, Method: Compositional matrix adjust.
Identities = 121/447 (27%), Positives = 194/447 (43%), Gaps = 66/447 (14%)
Query: 1 LIHRDSILSPYNNPNENPAHRVQRGINISIARLAYLQEKIRSHNTYQAQILPSNDISNSL 60
LIHRDSI SP+++P R + +A L ++D+S+ L
Sbjct: 31 LIHRDSIKSPFHDPKLTRHDRFL---------------AAARRSRARAAALLASDVSSDL 75
Query: 61 FY------VNISIGQPPVPQFTAMDTGSSLLWVHCYPCRD-------------------C 95
FY +++G PPV DTGS L+W+ C ++
Sbjct: 76 FYGDFEYLAAVNVGTPPVRFLAVADTGSDLVWLKCNTTQNNNGIVSSDSGNNSNSSPPPP 135
Query: 96 SPQLGTIFYPSRSSSYAYVPCDSEHCRYFPY-ARCSAYKHRCIYTQLYLIGPETSVFVST 154
P+ F P SSSY+ V CD C A C+ H C + Y G + ++
Sbjct: 136 PPEAVVYFNPFDSSSYSRVGCDGPSCLALATNASCNGDSHACDFRYSYRDGASATGLLAA 195
Query: 155 EQLTFK-NTDESTIHVQDVVFGCGFST-NRNFKFSGIFGLGIGRSSLVSQLNSSFSYCIG 212
+ TF N + T + FGC T R F+ G+ GLG G SL SQL FS+C+
Sbjct: 196 DTFTFGGNINNDTTSTASIDFGCATGTAGREFQADGMVGLGAGPLSLASQLGRKFSFCL- 254
Query: 213 SLHDPDYLHNKLILGDGAII-DEGDA-TPL----QFIDGHYYITLEAISVDGRMLDINPN 266
+ +D D + L G A++ D G A TPL +Y I+++++ V G+ + +
Sbjct: 255 TAYDIDDASSILNFGARAVVSDPGAATTPLIASSSNAAAYYAISIDSLKVAGQPVPGTTS 314
Query: 267 IFKRDDSGGGVMIDSGTDVTWLVKEAYEALRDEVMIR-LEGEQMRSYSWPD---KLCYHG 322
+ K V++D+GT +T+L + A A E + R ++G + PD +LCY
Sbjct: 315 VSK-------VIVDTGTVLTFLDRAALLAPLTESLARVMDGAGLPRAPPPDETLELCYDV 367
Query: 323 IMSSDLKGF---PTVRFHFRGGAKLALEKDSMFYQPRPDAFCMAVNPASINNRYVNFSLI 379
D+ G T+ GG ++ L + F + C+AV + + S++
Sbjct: 368 SRVKDVDGVIPDVTLVLGGGGGGEVRLTGEGTFVLVKEGVLCLAV--VTTSPELQPLSVL 425
Query: 380 GMMAQQFYNVGYDIGRKQKTFQRMDCE 406
G +A Q +VG D+ + TF +C+
Sbjct: 426 GNVALQDLHVGIDLDARTATFATANCD 452
>gi|218192703|gb|EEC75130.1| hypothetical protein OsI_11317 [Oryza sativa Indica Group]
Length = 440
Score = 122 bits (307), Expect = 2e-25, Method: Compositional matrix adjust.
Identities = 103/363 (28%), Positives = 160/363 (44%), Gaps = 28/363 (7%)
Query: 61 FYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLGTIFYPSRSSSYAYVPCDSEH 120
+ ++++IG PP P +DTGS L+W C PC C Q + SRSS++A CDS
Sbjct: 91 YLLHLAIGTPPQPVQLTLDTGSDLVWTQCQPCAVCFNQSLPYYDASRSSTFALPSCDSTQ 150
Query: 121 CRYFPYARCSAYK--HRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDVVFGCGF 178
C+ P + C ++ Y T F+ E ++F + V VVFGCG
Sbjct: 151 CKLDPSVTMCVNQTVQTCAFSYSYGDKSATIGFLDVETVSF----VAGASVPGVVFGCGL 206
Query: 179 STNRNFKF--SGIFGLGIGRSSLVSQLN-SSFSYCIGSL--HDPDYLHNKLIL-----GD 228
+ F+ +GI G G G SL SQL +FS+C ++ P + L G
Sbjct: 207 NNTGIFRSNETGIAGFGRGPLSLPSQLKVGNFSHCFTAVSGRKPSTVLFDLPADLYKNGR 266
Query: 229 GAIIDEGDATPLQFIDGH---YYITLEAISVDGRMLDINPNIFKRDDSGGGVMIDSGTDV 285
G + TPL H YY++L+ I+V L + + F + GG +IDSGT
Sbjct: 267 GTV----QTTPLIKNPAHPTFYYLSLKGITVGSTRLPVPESAFALKNGTGGTIIDSGTAF 322
Query: 286 TWLVKEAYEALRDEVMIRLEGEQMRSYSWPDKLCYHGIMSSDLKGFPTVRFHFRGGAKLA 345
T L Y + DE ++ + S LC+ P + HF GA +
Sbjct: 323 TSLPPRVYRLVHDEFAAHVKLPVVPSNETGPLLCFSAPPLGKAPHVPKLVLHFE-GATMH 381
Query: 346 LEKDSMFYQPRPDAFCMAVNPASINNRYVNFSLIGMMAQQFYNVGYDIGRKQKTFQRMDC 405
L +++ ++ + C ++ A I ++IG QQ +V YD+ + +F R C
Sbjct: 382 LPRENYVFEAKDGGNC-SICLAIIEGE---MTIIGNFQQQNMHVLYDLKNSKLSFVRAKC 437
Query: 406 EVL 408
+ L
Sbjct: 438 DKL 440
>gi|302811785|ref|XP_002987581.1| hypothetical protein SELMODRAFT_426333 [Selaginella moellendorffii]
gi|300144735|gb|EFJ11417.1| hypothetical protein SELMODRAFT_426333 [Selaginella moellendorffii]
Length = 511
Score = 122 bits (307), Expect = 2e-25, Method: Compositional matrix adjust.
Identities = 106/379 (27%), Positives = 164/379 (43%), Gaps = 43/379 (11%)
Query: 61 FYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLGTIFYPSRSSSYAYVPCDSEH 120
+YV + +G P V MDTGS + W+ C PC+DC P L F P SSS+ +PC S
Sbjct: 139 YYVPLQVGTPAVEVVLIMDTGSDVSWIQCVPCKDCVPALRPPFNPRHSSSFFKLPCASST 198
Query: 121 CRYF-----PYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNT----DESTIHVQD 171
C P+ CS C+++ Y G +S ++ E + NT D + + +
Sbjct: 199 CTNVYQGVKPF--CSPSGRTCLFSIQYGDGSLSSGLLAMETIA-GNTPNFGDGEPVKLSN 255
Query: 172 VVFGCGFSTNRNFK--FSGIFGLGIGRSSLVSQLNS----SFSYCIGSLHDPDYLHN--- 222
+ GC SG+ G+ S SQL+S FS+C PD + +
Sbjct: 256 ITLGCADIDREGLPTGASGLLGMDRRPISFPSQLSSRYARKFSHCF-----PDKIAHLNS 310
Query: 223 --KLILGDGAIID---------EGDATPLQFIDGHYYITLEAISVDGRMLDINPNIFKRD 271
+ G+ II + A P +D +YY+ L ISVD L ++ F D
Sbjct: 311 SGLVFFGESDIISPYLRYTPLVQNPAVPSASLD-YYYVGLVGISVDESRLPLSHKNFDID 369
Query: 272 D--SGGGVMIDSGTDVTWLVKEAYEALRDEVMIRLEGEQMRSYSWPDKLCYH---GIMSS 326
GG +IDSGT T+L K A++A+R E + R + CY+ G +
Sbjct: 370 KVTGSGGTIIDSGTAFTYLKKPAFQAMRREFLARTSHLAKVDDNSGFTPCYNITSGTAAL 429
Query: 327 DLKGFPTVRFHFRGGAKLALEKDSMFYQPRPDAFCMAVNPASINNRYVNFSLIGMMAQQF 386
+ P++ HFRGG + L K+S+ + A + + + F++IG QQ
Sbjct: 430 ESTILPSITLHFRGGLDVVLPKNSILIPVSSSEEQTTLCLAFLMSGDIPFNIIGNYQQQN 489
Query: 387 YNVGYDIGRKQKTFQRMDC 405
V YD+ + + C
Sbjct: 490 LWVEYDLEKLRLGIAPAQC 508
>gi|242058093|ref|XP_002458192.1| hypothetical protein SORBIDRAFT_03g028480 [Sorghum bicolor]
gi|241930167|gb|EES03312.1| hypothetical protein SORBIDRAFT_03g028480 [Sorghum bicolor]
Length = 468
Score = 122 bits (307), Expect = 3e-25, Method: Compositional matrix adjust.
Identities = 116/433 (26%), Positives = 187/433 (43%), Gaps = 52/433 (12%)
Query: 1 LIHRDSILSPYNNPNENPAHRVQRGINISIARLAYLQEKI-RSHNTYQAQILPSNDISNS 59
L+HR +P ++ P+ R + + AR Y+ ++ + A + + S
Sbjct: 60 LVHRHGPCAPTQLSSDKPSSFTDR-LRRNRARSKYIMSRVSKGMMGDDADVSIPTHLGGS 118
Query: 60 L----FYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRD--CSPQLGTIFYPSRSSSYAY 113
+ + V + +G P V Q +DTGS L WV C PC C PQ +F PS+SS+YA
Sbjct: 119 VDSLEYVVTVGLGTPSVSQVLLIDTGSDLSWVQCQPCNSTTCYPQKDPLFDPSKSSTYAP 178
Query: 114 VPCDSEHCR------YFPYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTI 167
+PC+++ CR Y +C + Y G +T S E L +
Sbjct: 179 IPCNTDACRDLTDDGYGGGCASGDGAAQCGFAITYGDGSQTRGVYSNETLALA----PGV 234
Query: 168 HVQDVVFGCGFSTN-RNFKFSGIFGLGIGRSSLVSQLNS----SFSYCIGSLHD------ 216
V+D FGCG + N K+ G+ GLG SLV Q S +FSYC+ +L++
Sbjct: 235 AVKDFRFGCGHDQDGANDKYDGLLGLGGAPESLVVQTASVYGGAFSYCLPALNNQVGFLA 294
Query: 217 ---PDYLHNKLILGDGAIIDEGDATPL-QFIDGHYYITLEAISVDGRMLDINPNIFKRDD 272
++ G + TP+ + + Y + + I+V G +D+ P+ F
Sbjct: 295 LGGGGAPSGGVVNTSGFVF-----TPMIREEETFYVVNMTGITVGGEPIDVPPSAFS--- 346
Query: 273 SGGGVMIDSGTDVTWLVKEAYEALRDEVMIRLEGEQMRSYSWPDKLCYHGIMSSDLKGFP 332
GG++IDSGT VT L AY AL+ + + D CY S++ P
Sbjct: 347 --GGMIIDSGTVVTELQHTAYNALQAAFRKAMAAYPLVRNGELDT-CYDFSGYSNVT-LP 402
Query: 333 TVRFHFRGGAKLALEKDSMFYQPRPDAFCMAVNPASINNRYVNFSLIGMMAQQFYNVGYD 392
V F GGA + L+ + C+A + +++ ++G + Q+ V YD
Sbjct: 403 KVALTFSGGATIDLDVPNGILLDD----CLAFQESGPDDQP---GILGNVNQRTLEVLYD 455
Query: 393 IGRKQKTFQRMDC 405
GR + F+ C
Sbjct: 456 AGRGRVGFRAAVC 468
>gi|356498306|ref|XP_003517994.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
Length = 484
Score = 122 bits (307), Expect = 3e-25, Method: Compositional matrix adjust.
Identities = 115/370 (31%), Positives = 174/370 (47%), Gaps = 25/370 (6%)
Query: 44 NTYQAQILPSNDISNSLFYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLGTIF 103
N Q ++ + +++ + IG+PP + +DTGS + W+ C PC +C Q IF
Sbjct: 132 NALQGPVVSGTSQGSGEYFLRVGIGKPPSQAYVVLDTGSDVSWIQCAPCSECYQQSDPIF 191
Query: 104 YPSRSSSYAYVPCDSEHCRYFPYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTD 163
P S+SY+ + CD C+ + C C+Y Y G T +TE +T
Sbjct: 192 DPISSNSYSPIRCDEPQCKSLDLSEC--RNGTCLYEVSYGDGSYTVGEFATETVTL---- 245
Query: 164 ESTIHVQDVVFGCGFSTNRNF-KFSGIFGLGIGRSSLVSQLN-SSFSYCIGSLHDPDYLH 221
+ V++V GCG + F +G+ GLG G+ S +Q+N +SFSYC+ + D D +
Sbjct: 246 -GSAAVENVAIGCGHNNEGLFVGAAGLLGLGGGKLSFPAQVNATSFSYCLVN-RDSDAVS 303
Query: 222 NKLILGDGAIIDEGDATPLQF---IDGHYYITLEAISVDGRMLDINPNIFKRD-DSGGGV 277
+ + + PL +D YY+ L+ ISV G L I + F+ D GGG+
Sbjct: 304 T--LEFNSPLPRNAATAPLMRNPELDTFYYLGLKGISVGGEALPIPESSFEVDAIGGGGI 361
Query: 278 MIDSGTDVTWLVKEAYEALRDEVMIRLEG-EQMRSYSWPDKLCYHGIMSSDLKGFPTVRF 336
+IDSGT VT L E Y+ALRD + +G + S D CY + S + PTV F
Sbjct: 362 IIDSGTAVTRLRSEVYDALRDAFVKGAKGIPKANGVSLFDT-CYD-LSSRESVEIPTVSF 419
Query: 337 HFRGGAKLAL-EKDSMFYQPRPDAFCMAVNPASINNRYVNFSLIGMMAQQFYNVGYDIGR 395
F G +L L ++ + FC A P + + S+IG + QQ VG+DI
Sbjct: 420 RFPEGRELPLPARNYLIPVDSVGTFCFAFAPTT-----SSLSIIGNVQQQGTRVGFDIAN 474
Query: 396 KQKTFQRMDC 405
F C
Sbjct: 475 SLVGFSVDSC 484
>gi|115434870|ref|NP_001042193.1| Os01g0178600 [Oryza sativa Japonica Group]
gi|55296112|dbj|BAD67831.1| putative CDR1 [Oryza sativa Japonica Group]
gi|55296252|dbj|BAD67993.1| putative CDR1 [Oryza sativa Japonica Group]
gi|113531724|dbj|BAF04107.1| Os01g0178600 [Oryza sativa Japonica Group]
gi|125569253|gb|EAZ10768.1| hypothetical protein OsJ_00604 [Oryza sativa Japonica Group]
Length = 454
Score = 122 bits (306), Expect = 3e-25, Method: Compositional matrix adjust.
Identities = 110/375 (29%), Positives = 174/375 (46%), Gaps = 38/375 (10%)
Query: 53 SNDISNSLFYV-NISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLG---TIFYPSRS 108
S +S S Y+ +++G PP DTGS L+WV C + + T F PSRS
Sbjct: 92 SKVVSRSFEYLMTVNLGSPPRSMLAIADTGSDLVWVKCKKGNNDTSSAAAPTTQFDPSRS 151
Query: 109 SSYAYVPCDSEHCRYFPYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDES--- 165
S+Y V C ++ C A C + C Y Y G T+ +STE TF +
Sbjct: 152 STYGRVSCQTDACEALGRATCDDGSN-CAYLYAYGDGSNTTGVLSTETFTFDDGGSGRSP 210
Query: 166 -TIHVQDVVFGCGFSTNRNFKFSGIFGLGIGRSSLVSQLNSS------FSYCIGSLHDPD 218
+ V V FGC +T +F G+ GLG G SLV+QL + FSYC+ P
Sbjct: 211 RQVRVGGVKFGCSTATAGSFPADGLVGLGGGAVSLVTQLGGATSLGRRFSYCL----VPH 266
Query: 219 YLHNKLILGDGAIIDEGD----ATPLQF--IDGHYYITLEAISVDGRMLDINPNIFKRDD 272
++ L GA+ D + +TPL +D +Y + L+++ V + +
Sbjct: 267 SVNASSALNFGALADVTEPGAASTPLVAGDVDTYYTVVLDSVKVGNKTV--------ASA 318
Query: 273 SGGGVMIDSGTDVTWLVKEAYEALRDEVMIRLEGEQMRSYSWPDKLCYH--GIMSSDLKG 330
+ +++DSGT +T+L + DE+ R+ ++S +LCY+ G +
Sbjct: 319 ASSRIIVDSGTTLTFLDPSLLGPIVDELSRRITLPPVQSPDGLLQLCYNVAGREVEAGES 378
Query: 331 FPTVRFHFRGGAKLALEKDSMFYQPRPDAFCMAVNPASINNRYVNFSLIGMMAQQFYNVG 390
P + F GGA +AL+ ++ F + C+A+ A+ + V S++G +AQQ +VG
Sbjct: 379 IPDLTLEFGGGAAVALKPENAFVAVQEGTLCLAIV-ATTEQQPV--SILGNLAQQNIHVG 435
Query: 391 YDIGRKQKTFQRMDC 405
YD+ TF DC
Sbjct: 436 YDLDAGTVTFAGADC 450
>gi|312283333|dbj|BAJ34532.1| unnamed protein product [Thellungiella halophila]
Length = 428
Score = 122 bits (306), Expect = 3e-25, Method: Compositional matrix adjust.
Identities = 120/416 (28%), Positives = 179/416 (43%), Gaps = 35/416 (8%)
Query: 1 LIHRDSILSPYNNPNENPAHRVQRGINISIARLAYLQEKIRSHNTYQAQILPSNDISNSL 60
+ H +S SP+ PN + + ARL YL + + I I S
Sbjct: 36 VFHVNSPCSPFKQPN---TVSWESTLLKDKARLQYLSSLAKKPS---VPIASGRAIVQSP 89
Query: 61 FY-VNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLGTIFYPSRSSSYAYVPCDSE 119
Y V +IG P P A+DT + WV C C C+ + +F PS+SSS + CD+
Sbjct: 90 TYIVRANIGTPAQPMLVALDTSNDAAWVPCSGCVGCASSV--LFDPSKSSSSRNLQCDAP 147
Query: 120 HCRYFPYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDVVFGC-GF 178
C+ P C+A K C + Y G ++ + LT N ++ FGC
Sbjct: 148 QCKQAPNPTCTAGKS-CGFNMTY-GGSTIEASLTQDTLTLAND-----VIKSYTFGCISK 200
Query: 179 STNRNFKFSGIFGLGIGRSSLVSQLN----SSFSYCIGSLHDPDYLHNKLILGDGAIIDE 234
+T + G+ GLG G SL+SQ S+FSYC+ + ++ L LG
Sbjct: 201 ATGTSLPAQGLMGLGRGPLSLISQTQNLYMSTFSYCLPNSKSSNF-SGSLRLGPKYQPVR 259
Query: 235 GDATPL---QFIDGHYYITLEAISVDGRMLDINPNIFKRDDS-GGGVMIDSGTDVTWLVK 290
TPL YY+ L I V +++DI + D S G G + DSGT T LV+
Sbjct: 260 IKTTPLLKNPRRSSLYYVNLVGIRVGNKIVDIPTSALAFDASTGAGTIFDSGTVFTRLVE 319
Query: 291 EAYEALRDEVMIRLEGEQMRSYSWPDKLCYHGIMSSDLKGFPTVRFHFRGGAKLALEKDS 350
AY A+R+E R++ S D CY G + +P+V F F G + L D+
Sbjct: 320 PAYVAVRNEFRRRIKNANATSLGGFDT-CYSGSVV-----YPSVTFMF-AGMNVTLPPDN 372
Query: 351 MF-YQPRPDAFCMAVNPASINNRYVNFSLIGMMAQQFYNVGYDIGRKQKTFQRMDC 405
+ + C+A+ A+ NN ++I M QQ + V D+ + R C
Sbjct: 373 LLIHSSSGSTSCLAM-AAAPNNVNSVLNVIASMQQQNHRVLIDLPNSRLGISRETC 427
>gi|357153697|ref|XP_003576537.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Brachypodium
distachyon]
Length = 474
Score = 122 bits (305), Expect = 4e-25, Method: Compositional matrix adjust.
Identities = 133/447 (29%), Positives = 203/447 (45%), Gaps = 64/447 (14%)
Query: 3 HRDSILSPY-NNPNENPAHRVQRGINIS-IARLAYLQEKIRSHNTYQ----------AQI 50
H S SP N P++ V G+ S AR++ LQ +I S+ + A
Sbjct: 47 HISSSFSPGPNRPSKTSRGEVDGGVLSSDAARVSSLQRRIESYRSSSEGEEEEASKLALQ 106
Query: 51 LPSNDISN--SLFYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLGTIFYPSRS 108
+P +N +L YV ++G +DT S L WV C PC C Q +F PS S
Sbjct: 107 VPITSGANLRTLNYV-ATVGLGAAEATVVVDTASELTWVQCQPCESCHDQQDPLFDPSSS 165
Query: 109 SSYAYVPCDSEHCRYF---------PYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTF 159
SYA VPC+S C P A + + C Y Y G + ++ ++L
Sbjct: 166 PSYAAVPCNSSSCDALRVAMAAGTSPCADDNEQQPACSYALSYRDGSYSRGVLARDKLRL 225
Query: 160 KNTDESTIHVQDVVFGCGFSTNRNFKFSGIFGL-GIGRS--SLVS----QLNSSFSYCI- 211
D ++ VFGCG ++N+ F G GL G+GRS SLVS Q FSYC+
Sbjct: 226 AGQD-----IEGFVFGCG-TSNQGAPFGGTSGLMGLGRSHVSLVSQTMDQFGGVFSYCLP 279
Query: 212 -------GSLHDPD----YLHNKLILGDGAIIDEGDATPLQFIDGHYYITLEAISVDGRM 260
GSL D Y ++ I+ + D G PLQ Y++ L I+V G+
Sbjct: 280 MRESGSSGSLVLGDDSSAYRNSTPIVYTAMVSDSG---PLQ--GPFYFLNLTGITVGGQ- 333
Query: 261 LDINPNIFKRDDSGGGVMIDSGTDVTWLVKEAYEALRDEVMIRL-EGEQMRSYSWPDKLC 319
++ F S G V+IDSGT +T LV Y A+R E + +L E Q ++S D C
Sbjct: 334 -EVESPWF----SAGRVIIDSGTIITTLVPSVYNAVRAEFLSQLAEYPQAPAFSILDT-C 387
Query: 320 YHGIMSSDLKGFPTVRFHFRGGAKLALEKDSMFYQPRPDAFCMAVNPASINNRYVNFSLI 379
++ +++ P+++F F G ++ ++ + Y DA + + AS+ + Y + S+I
Sbjct: 388 FNLTGLKEVQ-VPSLKFVFEGSVEVEVDSKGVLYFVSSDASQVCLALASLKSEY-DTSII 445
Query: 380 GMMAQQFYNVGYDIGRKQKTFQRMDCE 406
G Q+ V +D Q F + C+
Sbjct: 446 GNYQQKNLRVIFDTLGSQIGFAQETCD 472
>gi|356543524|ref|XP_003540210.1| PREDICTED: aspartic proteinase-like protein 2-like [Glycine max]
Length = 493
Score = 122 bits (305), Expect = 4e-25, Method: Compositional matrix adjust.
Identities = 102/384 (26%), Positives = 168/384 (43%), Gaps = 57/384 (14%)
Query: 60 LFYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLG-----TIFYPSRSSSYAYV 114
L+Y + +G PPV +DTGS +LWV C C C G F P SS+ + +
Sbjct: 77 LYYTKVQLGTPPVEFNVQIDTGSDVLWVSCNSCNGCPQTSGLQIQLNFFDPGSSSTSSMI 136
Query: 115 PCDSEHC---RYFPYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQD 171
C + C + A CS+ ++C YT Y G TS + ++ + E ++
Sbjct: 137 ACSDQRCNNGKQSSDATCSSQNNQCSYTFQYGDGSGTSGYYVSDMMHLNTIFEGSMTTNS 196
Query: 172 ---VVFGC-----GFSTNRNFKFSGIFGLGIGRSSLVSQLNSS------FSYC------- 210
VVFGC G T + GIFG G S++SQL+S FS+C
Sbjct: 197 TAPVVFGCSNQQTGDLTKSDRAVDGIFGFGQQEMSVISQLSSQGIAPRIFSHCLKGDSSG 256
Query: 211 -----IGSLHDPDYLHNKLILGDGAIIDEGDATPLQFIDGHYYITLEAISVDGRMLDINP 265
+G + +P+ ++ L+ A P HY + L++ISV+G+ L I+
Sbjct: 257 GGILVLGEIVEPNIVYTSLV----------PAQP------HYNLNLQSISVNGQTLQIDS 300
Query: 266 NIFKRDDSGGGVMIDSGTDVTWLVKEAYEALRDEVMIRLEGEQMRSYSWPDKLCYHGIMS 325
++F +S G ++DSGT + +L +EAY+ + + + +R+ CY I S
Sbjct: 301 SVFATSNS-RGTIVDSGTTLAYLAEEAYDPFVSAITAAIP-QSVRTVVSRGNQCYL-ITS 357
Query: 326 SDLKGFPTVRFHFRGGAKLALEKDSMFYQPRP--DAFCMAVNPASINNRYVNFSLIGMMA 383
S FP V +F GGA + L Q A + I + + +++G +
Sbjct: 358 SVTDVFPQVSLNFAGGASMILRPQDYLIQQNSIGGAAVWCIGFQKIQGQGI--TILGDLV 415
Query: 384 QQFYNVGYDIGRKQKTFQRMDCEV 407
+ V YD+ ++ + DC +
Sbjct: 416 LKDKIVVYDLAGQRIGWANYDCSL 439
>gi|125532788|gb|EAY79353.1| hypothetical protein OsI_34482 [Oryza sativa Indica Group]
Length = 394
Score = 122 bits (305), Expect = 4e-25, Method: Compositional matrix adjust.
Identities = 95/361 (26%), Positives = 160/361 (44%), Gaps = 31/361 (8%)
Query: 61 FYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLGTIFYPSRSSSYAYVPCDSEH 120
+ N +IG PP P +D L+W C C C Q +F P+ S++Y PC +
Sbjct: 51 YVANFTIGTPPQPASAVIDLAGELVWTQCKQCSRCFEQDTPLFDPTASNTYRAEPCGTPL 110
Query: 121 CRYFPYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDVVFGCGFST 180
C P + + C Y Q +T V T+ S + FGC ++
Sbjct: 111 CESIPSDSRNCSGNVCAY-QASTNAGDTGGKVGTDTFAVGTAKAS------LAFGCVVAS 163
Query: 181 NRNFKF--SGIFGLGIGRSSLVSQLN-SSFSYCIGSLHDPDYLHNKLILGDGAIIDEGD- 236
+ + SGI GLG SLV+Q ++FSYC+ HD ++ L LG A + G
Sbjct: 164 DIDTMGGPSGIVGLGRTPWSLVTQTGVAAFSYCLAP-HDAGK-NSALFLGSSAKLAGGGK 221
Query: 237 --ATPLQFIDG-------HYYITLEAISVDGRMLDINPNIFKRDDSGGGVMIDSGTDVTW 287
+TP I G +Y + LE + M+ + P SG V++D+ + +++
Sbjct: 222 AASTPFVNISGNGNDLSNYYKVQLEGLKAGDAMIPLPP-------SGSTVLLDTFSPISF 274
Query: 288 LVKEAYEALRDEVMIRLEGEQMRSYSWPDKLCYHGIMSSDLKGFPTVRFHFRGGAKLALE 347
LV AY+A++ V + + M + P LC+ +S P + F FRGGA + +
Sbjct: 275 LVDGAYQAVKKAVTVAVGAPPMATPVEPFDLCFPKSGASGAA--PDLVFTFRGGAAMTVA 332
Query: 348 KDSMFYQPRPDAFCMAVNPASINNRYVNFSLIGMMAQQFYNVGYDIGRKQKTFQRMDCEV 407
+ + C+A+ ++ N SL+G + Q+ + +D+ ++ +F+ DC
Sbjct: 333 ASNYLLDYKNGTVCLAMLSSARLNSTTELSLLGSLQQENIHFLFDLDKETLSFEPADCTK 392
Query: 408 L 408
L
Sbjct: 393 L 393
>gi|224079535|ref|XP_002305886.1| predicted protein [Populus trichocarpa]
gi|222848850|gb|EEE86397.1| predicted protein [Populus trichocarpa]
Length = 436
Score = 122 bits (305), Expect = 4e-25, Method: Compositional matrix adjust.
Identities = 99/367 (26%), Positives = 168/367 (45%), Gaps = 31/367 (8%)
Query: 60 LFYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLGTIFYPSRSSSYAYVPCDSE 119
+ V++ IG PP Q +DTGS L W+ C+ P T+F PS SSS++ +PC+
Sbjct: 76 ILLVSLPIGTPPQSQQMILDTGSQLSWIQCHKKVPRKPPPSTVFDPSLSSSFSVLPCNHP 135
Query: 120 HCRY----FPYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDVVFG 175
C+ F C Y+ Y G + E++TF +T +ST ++ G
Sbjct: 136 LCKPRIPDFTLPTSCDLNRLCHYSYFYADGTLAEGNLVREKITF-STSQST---PPLILG 191
Query: 176 CGFSTNRNFKFSGIFGLGIGRSSLVSQLN-SSFSYCIGSLH-DPDYL-HNKLILGDG--- 229
C + + GI G+ +GR S SQ + FSYC+ + P + LG+
Sbjct: 192 CAEDASDD---KGILGMNLGRLSFASQAKITKFSYCVPTRQVRPGFTPTGSFYLGENPNS 248
Query: 230 ------AIIDEGDATPLQFIDG-HYYITLEAISVDGRMLDINPNIFKRDDSGGG-VMIDS 281
+++ + + +D + + L+ I + + L+I + F+ D SG G MIDS
Sbjct: 249 AGFQYISLLTFSQSQRMPNLDPLAHTVALQGIRIGNKKLNIPVSAFRADPSGAGQSMIDS 308
Query: 282 GTDVTWLVKEAYEALRDEVMIRLEGEQMRS---YSWPDKLCYHGIMSSDLKGFPTVRFHF 338
G++ T+LV AY +R+EV +RL G +++ YS +C+ G + + F F
Sbjct: 309 GSEFTYLVDVAYNKVREEV-VRLAGPRLKKGYVYSGVSDMCFDGNAMEIGRLIGNMVFEF 367
Query: 339 RGGAKLALEKDSMFYQPRPDAFCMAVNPASINNRYVNFSLIGMMAQQFYNVGYDIGRKQK 398
G ++ +EK + C+ + + + N +IG QQ V +DI ++
Sbjct: 368 DKGVEIVIEKGRVLADVGGGVHCVGIGRSEMLGAASN--IIGNFHQQNLWVEFDIANRRV 425
Query: 399 TFQRMDC 405
F + DC
Sbjct: 426 GFGKADC 432
>gi|255543383|ref|XP_002512754.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
gi|223547765|gb|EEF49257.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
Length = 414
Score = 122 bits (305), Expect = 4e-25, Method: Compositional matrix adjust.
Identities = 125/433 (28%), Positives = 192/433 (44%), Gaps = 52/433 (12%)
Query: 3 HRDSILSPYNNPNENPAHRVQRGINISIARLAYLQEKIRSH------NTYQAQILPSNDI 56
HRD S + + N ++Q+ + + R+ LQ +I+S + +QI S+ +
Sbjct: 3 HRDFCNSSGKSTDWN--KKLQKSLILDDFRVRSLQSRIKSIFSGNNIDALDSQIPLSSGV 60
Query: 57 S-NSLFY-VNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLGTIFYPSRSSSYAYV 114
+L Y V + IG + +DTGS L WV C PCR C Q +F PS S SY +
Sbjct: 61 RLQTLNYIVTVEIGGRNMT--VIVDTGSDLTWVQCQPCRLCYNQQDPLFNPSGSPSYQTI 118
Query: 115 PCDSEHCRYFPYAR-----CSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHV 169
C+S C+ YA C + C Y Y G T + EQL T HV
Sbjct: 119 LCNSSTCQSLQYATGNLGVCGSNTPTCNYVVNYGDGSYTRGDLGMEQLNL-----GTTHV 173
Query: 170 QDVVFGCGFSTNRNF-KFSGIFGLGIGRSSLVSQLNS----SFSYCIGSLHDPDYLHNKL 224
+ +FGCG + F SG+ GLG SLVSQ ++ FSYC+ + L
Sbjct: 174 SNFIFGCGRNNKGLFGGASGLMGLGKSDLSLVSQTSAIFEGVFSYCLPTTAAD--ASGSL 231
Query: 225 ILGDGAIIDEGDATPLQF--------IDGHYYITLEAISVDGRMLDINPNIFKRDDSGGG 276
ILG + + + + TP+ + + Y++ L IS+ G L PN + G
Sbjct: 232 ILGGNSSVYK-NTTPISYTRMIANPQLPTFYFLNLTGISIGGVALQA-PNYRQS-----G 284
Query: 277 VMIDSGTDVTWLVKEAYEALRDEVMIRLEG-EQMRSYSWPDKLCYHGIMSSDLKGFPTVR 335
++IDSGT +T L Y L+ E + + G +S D C++ + D PT+R
Sbjct: 285 ILIDSGTVITRLPPPVYRDLKAEFLKQFSGFPSAPPFSILDT-CFN-LNGYDEVDIPTIR 342
Query: 336 FHFRGGAKLALEKDSMFYQPRPDA--FCMAVNPASINNRYVNFSLIGMMAQQFYNVGYDI 393
F G A+L ++ +FY + DA C+A+ S ++ +IG Q+ V Y+
Sbjct: 343 MQFEGNAELTVDVTGIFYFVKTDASQVCLALASLSFDDE---IPIIGNYQQRNQRVIYNT 399
Query: 394 GRKQKTFQRMDCE 406
+ F C
Sbjct: 400 KESKLGFAAEACS 412
>gi|242066176|ref|XP_002454377.1| hypothetical protein SORBIDRAFT_04g029680 [Sorghum bicolor]
gi|241934208|gb|EES07353.1| hypothetical protein SORBIDRAFT_04g029680 [Sorghum bicolor]
Length = 474
Score = 122 bits (305), Expect = 5e-25, Method: Compositional matrix adjust.
Identities = 112/401 (27%), Positives = 169/401 (42%), Gaps = 59/401 (14%)
Query: 32 RLAYLQEKIRSHNTYQ---------AQILPSN---DISNSLFYVNISIGQPPVPQFTAMD 79
R Y+ ++ T Q +P+N +I + V +S+G P V Q +D
Sbjct: 99 RAEYILRRVSGRGTPQLWDSKAEAATATVPANWGFNIGTLNYVVTVSLGTPGVAQTLEVD 158
Query: 80 TGSSLLWVHCYPCR--DCSPQLGTIFYPSRSSSYAYVPCDSEHCRYFPYARCSAYKHRCI 137
TGS L WV C PC C Q +F P++SSSYA VPC C S +C
Sbjct: 159 TGSDLSWVQCTPCAAPACYSQKDPLFDPAQSSSYAAVPCGGPVCGGLGIYASSCSAAQCG 218
Query: 138 YTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDVVFGCGFSTNRNFKFSGIFGLGIGRS 197
Y Y G +T+ S++ LT D V+ FGCG + + G+ GLG +
Sbjct: 219 YVVSYGDGSKTTGVYSSDTLTLSPNDA----VRGFFFGCGHAQSGFTGNDGLLGLGREEA 274
Query: 198 SLVSQLNSS----FSYCIGSLHDPDYLHNKLILGDGAIIDEGDATPLQFIDG-----HYY 248
SLV Q + FSYC+ + P + G G +T Q + +Y
Sbjct: 275 SLVEQTAGTYGGVFSYCLPT--RPSTTGYLTLGGPSGAAPPGFST-TQLLSSPNAATYYV 331
Query: 249 ITLEAISVDGRMLDINPNIFKRDDSGGGVMIDSGTDVTWLVKEAYEALRDEVMIRLEGEQ 308
+ L ISV G+ L + ++F GG ++D+GT +T L AY ALR
Sbjct: 332 VMLTGISVGGQQLSVPSSVFA-----GGTVVDTGTVITRLPPTAYAALRSAFR-----SG 381
Query: 309 MRSYSWPDKLCYHGIMSS--DLKGF-----PTVRFHFRGGAKLALEKDSMFYQPRPDAFC 361
M SY +P GI+ + + G+ P V F GGA + L D + C
Sbjct: 382 MASYGYPSAPA-TGILDTCYNFSGYGTVTLPNVALTFSGGATVTLGADGIL-----SFGC 435
Query: 362 MAVNPASINNRYVNFSLIGMMAQQFYNV---GYDIGRKQKT 399
+A P+ + +++G + Q+ + V G +G K +
Sbjct: 436 LAFAPSGSDG---GMAILGNVQQRSFEVRIDGTSVGFKPSS 473
>gi|168064205|ref|XP_001784055.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162664441|gb|EDQ51161.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 459
Score = 121 bits (304), Expect = 5e-25, Method: Compositional matrix adjust.
Identities = 102/373 (27%), Positives = 167/373 (44%), Gaps = 37/373 (9%)
Query: 54 NDISNSLFYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLG-----TIFYPSRS 108
+ + L+Y I +G PP + +DTGS + WV+C PC +C +IF P +S
Sbjct: 41 DTFTTGLYYTRIYLGTPPQQFYVHVDTGSDVAWVNCVPCTNCKRASNVALPISIFDPEKS 100
Query: 109 SSYAYVPCDSEHCRYFPYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNT----DE 164
+S + C E C ++CS C Y+ LY G T+ ++ + L+F
Sbjct: 101 TSKTSISCTDEECYLASNSKCSFNSMSCPYSTLYGDGSSTAGYLINDVLSFNQVPSGNST 160
Query: 165 STIHVQDVVFGCGFSTNRNFKFSGIFGLGIGRSSLVSQLNSS------FSYCIGSLHDPD 218
+T + FGCG + + G+ G G SL SQL+ F++C L +
Sbjct: 161 ATSGTARLTFGCGSNQTGTWLTDGLVGFGQAEVSLPSQLSKQNVSVNIFAHC---LQGDN 217
Query: 219 YLHNKLILGDGAIIDEGDA-TPLQFIDGHYYITLEAISVDGRMLDINPNIFKRDDSGGGV 277
L++G I + G TP+ HY + L I V G + P F +S GGV
Sbjct: 218 KGSGTLVIGH--IREPGLVYTPIVPKQSHYNVELLNIGVSGTNV-TTPTAFDLSNS-GGV 273
Query: 278 MIDSGTDVTWLVKEAYEALRDEVMIRLEGEQMRSYSWPDKLCYHGIMSSDLKG-FPTVRF 336
++DSGT +T+LV+ AY+ + +V + MRS P + ++G FP V
Sbjct: 274 IMDSGTTLTYLVQPAYDQFQAKVR-----DCMRSGVLPVAFQFF----CTIEGYFPNVTL 324
Query: 337 HFRGGAKLALEKDSMFYQPR----PDAFCMAVNPASINNRYVNFSLIGMMAQQFYNVGYD 392
+F GGA + L S Y+ A+C + ++ Y+++++ G + V YD
Sbjct: 325 YFAGGAAMLLSPSSYLYKEMLTTGLSAYCFSWLESTSVYGYLSYTIFGDNVLKDQLVVYD 384
Query: 393 IGRKQKTFQRMDC 405
+ ++ DC
Sbjct: 385 NVNNRIGWKNFDC 397
>gi|224080963|ref|XP_002306246.1| predicted protein [Populus trichocarpa]
gi|222855695|gb|EEE93242.1| predicted protein [Populus trichocarpa]
Length = 382
Score = 121 bits (304), Expect = 5e-25, Method: Compositional matrix adjust.
Identities = 112/383 (29%), Positives = 172/383 (44%), Gaps = 38/383 (9%)
Query: 30 IARLAYLQEKIRSHNT-------YQAQILPSNDISNSLFYVNISIGQPPVPQFTAMDTGS 82
+ R+ L ++ S +T + ++++ D + ++V I +G PP Q+ +D+GS
Sbjct: 5 VKRVVSLIRRVSSGSTASYGVEDFGSEVVSGMDQGSGEYFVRIGVGSPPRSQYMVIDSGS 64
Query: 83 SLLWVHCYPCRDCSPQLGTIFYPSRSSSYAYVPCDSEHCRYFPYARCSAYKHRCIYTQLY 142
++WV C PC C Q +F P+ S+S+ V C S C A C++ RC Y Y
Sbjct: 65 DIVWVQCKPCTQCYHQTDPLFDPADSASFMGVSCSSAVCDQVDNAGCNS--GRCRYEVSY 122
Query: 143 LIGPETSVFVSTEQLTFKNTDESTIHVQDVVFGCGFSTNRNFKFSGIFGLGIGRS-SLVS 201
G T ++ E LT T VQ+V GCG F + G S S V
Sbjct: 123 GDGSSTKGTLALETLTLGRT-----VVQNVAIGCGHMNQGMFVGAAGLLGLGGGSMSFVG 177
Query: 202 QLN----SSFSYCIGSLHDPDYLHNKLILGDGAIIDEGDATPL---QFIDGHYYITLEAI 254
QL+ ++FSYC+ + + L G A+ PL +YYI L +
Sbjct: 178 QLSRERGNAFSYCL--VSRVTNSNGFLEFGSEAMPVGAAWIPLIRNPHSPSYYYIGLSGL 235
Query: 255 SVDGRMLDINPNIFKRDDSG-GGVMIDSGTDVTWLVKEAYEALRDEVMIRLEGEQMRSYS 313
V + I+ +IF+ + G GGV++D+GT VT AYEA RD + + S
Sbjct: 236 GVGDMKVPISEDIFELTELGNGGVVMDTGTAVTRFPTVAYEAFRDAFIDQTGNLPRASGV 295
Query: 314 WPDKLCYH--GIMSSDLKGFPTVRFHFRGGAKLALEKDSMFYQPRPDA--FCMAVNPASI 369
CY+ G +S + PTV F+F GG L L ++ F P DA FC A P+
Sbjct: 296 SIFDTCYNLFGFLSVRV---PTVSFYFSGGPILTLPANN-FLIPVDDAGTFCFAFAPSP- 350
Query: 370 NNRYVNFSLIGMMAQQFYNVGYD 392
S++G + Q+ + D
Sbjct: 351 ----SGLSILGNIQQEGIQISVD 369
>gi|356524289|ref|XP_003530762.1| PREDICTED: LOW QUALITY PROTEIN: aspartic proteinase
nepenthesin-2-like [Glycine max]
Length = 392
Score = 121 bits (304), Expect = 6e-25, Method: Compositional matrix adjust.
Identities = 118/411 (28%), Positives = 172/411 (41%), Gaps = 49/411 (11%)
Query: 26 INISIARLAYLQEKIRSH----NTYQ---AQILPSND---ISNSLFYVNISIGQPPVPQF 75
+N+ R+ Y+Q ++ + NT + + LP+ I ++ + V + +G P
Sbjct: 1 MNLDNERVKYIQSRLSKNLGRENTVKDLDSTTLPAESGSLIGSANYVVVVGLGTPKRDLS 60
Query: 76 TAMDTGSSLLWVHCYPCR-DCSPQLGTIFYPSRSSSYAYVPCDSEHCRYFP----YARCS 130
DTGS L W C PC C Q IF PS+SSSY + C S C + CS
Sbjct: 61 LVFDTGSDLTWTQCEPCAGSCYKQQDAIFDPSKSSSYTNITCTSSLCTQLTSDGIKSECS 120
Query: 131 AYK-HRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDVVFGCGFSTNRNFKFS-G 188
+ CIY Y + F+S E+LT TD V D +FGCG F S G
Sbjct: 121 SSTDASCIYDAKYGDNSTSVGFLSQERLTITATD----IVDDFLFGCGQDNEGLFNGSAG 176
Query: 189 IFGLGIGRSSLVSQLNSS----FSYCIGSLHDPDYLHNKLILGDGAIIDEGDA-TPLQFI 243
+ GLG S+V Q +S+ FSYC+ + L G A + TPL I
Sbjct: 177 LMGLGRHPISIVQQTSSNYNKIFSYCLPATSSS---LGHLTFGASAATNASLIYTPLSTI 233
Query: 244 DGH---YYITLEAISVDGRMLDINPNIFKRDDSGGGVMIDSGTDVTWLVKEAYEALRDEV 300
G Y + + +ISV G L P + S GG +IDSGT +T L Y ALR
Sbjct: 234 SGDNSFYGLDIVSISVGGTKL---PAVSSSTFSAGGSIIDSGTVITRLAPTVYAALRSAF 290
Query: 301 MIRLEGEQMRSYSWPDKLCYHGIMSSDLKGF-----PTVRFHFRGGAKLALEKDSMFYQP 355
+E + + + CY DL G+ P + F F GG + L +
Sbjct: 291 RRXMEKYPVANEAGLLDTCY------DLSGYKEISVPRIDFEFSGGVTVELXHRGILXVE 344
Query: 356 RPDAFCMAVNPASINNRYVNFSLIGMMAQQFYNVGYDIGRKQKTFQRMDCE 406
C+A +N + ++ G + Q+ V YD+ + F C+
Sbjct: 345 SEQQVCLAFAANGSDN---DITVFGNVQQKTLEVVYDVKGGRIGFGAAGCK 392
>gi|302822373|ref|XP_002992845.1| hypothetical protein SELMODRAFT_136051 [Selaginella moellendorffii]
gi|300139393|gb|EFJ06135.1| hypothetical protein SELMODRAFT_136051 [Selaginella moellendorffii]
Length = 510
Score = 121 bits (304), Expect = 6e-25, Method: Compositional matrix adjust.
Identities = 107/383 (27%), Positives = 164/383 (42%), Gaps = 51/383 (13%)
Query: 61 FYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLGTIFYPSRSSSYAYVPCDSEH 120
+YV + +G P V MDTGS + W+ C PC+DC P L F P SSS+ +PC S
Sbjct: 138 YYVPLQLGTPAVEVVLIMDTGSDVSWIQCVPCKDCVPALRPPFNPRHSSSFFKLPCASST 197
Query: 121 CRYF-----PYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNT----DESTIHVQD 171
C P+ CS C+++ Y G +S ++ E + NT D + + +
Sbjct: 198 CTNVYQGVKPF--CSPSGRTCLFSIQYGDGSLSSGLLAMETIA-GNTPNFGDGEPVKLSN 254
Query: 172 VVFGCGFSTNRNFK--FSGIFGLGIGRSSLVSQLNS----SFSYCIGSLHDPDYLHN--- 222
+ GC SG+ G+ S SQL+S FS+C PD + +
Sbjct: 255 ITLGCADIDREGLPTGASGLLGMDRRPISFPSQLSSRYARKFSHCF-----PDKIAHLNS 309
Query: 223 --KLILGDGAIID---------EGDATPLQFIDGHYYITLEAISVDGRMLDINPNIFKRD 271
+ G+ II + A P +D +YY+ L ISVD L ++ F D
Sbjct: 310 SGLVFFGESDIISPYLRYTPLVQNPAVPSASLD-YYYVGLVGISVDESRLPLSHKNFDID 368
Query: 272 D--SGGGVMIDSGTDVTWLVKEAYEALRDEVMIRLEGEQMRSYSWPDKLCYH---GIMSS 326
GG +IDSGT T+L K A++A+R E + R + CY+ G +
Sbjct: 369 KVTGSGGTIIDSGTAFTYLKKPAFQAMRREFLARTSHLAKVDDNSGFTPCYNITSGTAAL 428
Query: 327 DLKGFPTVRFHFRGGAKLALEKDSMFY----QPRPDAFCMAVNPASINNRYVNFSLIGMM 382
+ P++ HFRGG + L K+S+ C+A + + F++IG
Sbjct: 429 ESTILPSITLHFRGGLDVVLPKNSILIPVSSSEEQTTLCLAFQ----MSGDIPFNIIGNY 484
Query: 383 AQQFYNVGYDIGRKQKTFQRMDC 405
QQ V YD+ + + C
Sbjct: 485 QQQNLWVEYDLEKLRLGIAPAQC 507
>gi|297734873|emb|CBI17107.3| unnamed protein product [Vitis vinifera]
Length = 484
Score = 121 bits (304), Expect = 6e-25, Method: Compositional matrix adjust.
Identities = 99/372 (26%), Positives = 169/372 (45%), Gaps = 34/372 (9%)
Query: 60 LFYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLG-----TIFYPSRSSSYAYV 114
L++ + +G PP + +DTGS +LWV C C C G F P SS+ + +
Sbjct: 67 LYFTRVLLGSPPKEFYVQIDTGSDVLWVSCGSCNGCPQSSGLHIPLNFFDPGSSSTASLI 126
Query: 115 PCDSEHCRYFPY---ARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTI--HV 169
C + C A CS+ ++CIYT Y G TS + ++ L F S++
Sbjct: 127 SCSDQRCSLGVQSSDAGCSSQGNQCIYTFQYGDGSGTSGYYVSDLLNFDAIVGSSVTNSS 186
Query: 170 QDVVFGCGFS-----TNRNFKFSGIFGLGIGRSSLVSQLNSS------FSYCIGSLHDPD 218
+VFGC S T + GIFG G S++SQ++S FS+C+
Sbjct: 187 ASIVFGCSISQTGDLTKSDRAVDGIFGFGQQDMSVISQMSSQGITPKVFSHCLKGDGGGG 246
Query: 219 YLHNKLILGDGAIIDEGDATPLQFIDGHYYITLEAISVDGRMLDINPNIFKRDDSGGGVM 278
+ + + I+ +PL HY + L++ISV+G+ L I+P +F + G +
Sbjct: 247 GILVLGEIVEEDIV----YSPLVPSQPHYNLNLQSISVNGKSLAIDPEVFAT-STNRGTI 301
Query: 279 IDSGTDVTWLVKEAYEALRDEVMIRLEGEQMRSYSWPDKLCYHGIMSSDLKG-FPTVRFH 337
+DSGT + +L +EAY+ + + + +R CY +++S +KG FPTV +
Sbjct: 302 VDSGTTLAYLAEEAYDPFVSAITEAVS-QSVRPLLSKGTQCY--LITSSVKGIFPTVSLN 358
Query: 338 FRGGAKLALEKDSMFYQPRP--DAFCMAVNPASINNRYVNFSLIGMMAQQFYNVGYDIGR 395
F GG + L+ + Q DA + I + +++G + + YD+
Sbjct: 359 FAGGVSMNLKPEDYLLQQNSIGDAAVWCIGFQKIQGQ--GITILGDLVLKDKIFVYDLAG 416
Query: 396 KQKTFQRMDCEV 407
++ + DC +
Sbjct: 417 QRIGWANYDCSM 428
>gi|125528511|gb|EAY76625.1| hypothetical protein OsI_04577 [Oryza sativa Indica Group]
Length = 492
Score = 121 bits (304), Expect = 6e-25, Method: Compositional matrix adjust.
Identities = 114/370 (30%), Positives = 162/370 (43%), Gaps = 49/370 (13%)
Query: 57 SNSLFYV-NISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLGTIFYPSRSSSYAYVP 115
+N+ YV + IG PP A+D S L+W C +P F P RS++ A VP
Sbjct: 95 TNAGMYVFSYGIGTPPQQVSGALDISSDLVWTAC---GATAP-----FNPVRSTTVADVP 146
Query: 116 CDSEHCRYFPYARC----SAYKHRCIYTQLYLIG-PETSVFVSTEQLTFKNTDESTIHVQ 170
C + C+ F C A C YT +Y G T+ + TE TF +T +
Sbjct: 147 CTDDACQQFAPQTCGAGAGAGSSECAYTYMYGGGAANTTGLLGTEAFTFGDT-----RID 201
Query: 171 DVVFGCGFSTNRNFK-FSGIFGLGIGRSSLVSQLN-SSFSYCIGSLHDPDYLHNKLILGD 228
VVFGCG +F SG+ GLG G SLVSQL FSY D + ++ GD
Sbjct: 202 GVVFGCGLQNVGDFSGVSGVIGLGRGNLSLVSQLQVDRFSYHFAPDDSVD-TQSFILFGD 260
Query: 229 GAIIDEGDATP---------LQFIDGH---YYITLEAISVDGRMLDINPNIF--KRDDSG 274
DATP L D + YY+ L I VDG+ L I F + D
Sbjct: 261 -------DATPQTSHTLSTRLLASDANPSLYYVELAGIQVDGKDLAIPSGTFDLRNKDGS 313
Query: 275 GGVMIDSGTDVTWLVKEAYEALRDEVMIRLEGEQMRSYSWPDKLCYHGIMSSDLKGFPTV 334
GGV + VT L + AY+ LR V ++ + + LCY G + K P++
Sbjct: 314 GGVFLSITDLVTVLEEAAYKPLRQAVASKIGLPAVNGSALGLDLCYTGESLAKAK-VPSM 372
Query: 335 RFHFRGGAKLALEKDSMFYQPRPDAF-CMAVNPASINNRYVNFSLIGMMAQQFYNVGYDI 393
F GGA + LE + FY C+ + P+S + S++G + Q ++ YDI
Sbjct: 373 ALVFAGGAVMELELGNYFYMDSTTGLACLTILPSSAGDG----SVLGSLIQVGTHMMYDI 428
Query: 394 GRKQKTFQRM 403
+ F+ +
Sbjct: 429 NGSKLVFESL 438
>gi|356539352|ref|XP_003538162.1| PREDICTED: aspartic proteinase-like protein 2-like [Glycine max]
Length = 489
Score = 121 bits (304), Expect = 6e-25, Method: Compositional matrix adjust.
Identities = 99/319 (31%), Positives = 150/319 (47%), Gaps = 32/319 (10%)
Query: 60 LFYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLGTI------FYPSRSSSYAY 113
L+Y + +G PP + +DTGS +LWV C C C PQ + F P SS+ +
Sbjct: 76 LYYTKVKLGTPPRELYVQIDTGSDVLWVSCGSCNGC-PQTSGLQIQLNYFDPGSSSTSSL 134
Query: 114 VPCDSEHCR---YFPYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQ 170
+ C CR A CS ++C YT Y G TS + ++ + F + E T+
Sbjct: 135 ISCLDRRCRSGVQTSDASCSGRNNQCTYTFQYGDGSGTSGYYVSDLMHFASIFEGTLTTN 194
Query: 171 ---DVVFGC-----GFSTNRNFKFSGIFGLGIGRSSLVSQLNSS------FSYCIGSLHD 216
VVFGC G T GIFG G S++SQL+S FS+C L
Sbjct: 195 SSASVVFGCSILQTGDLTKSERAVDGIFGFGQQGMSVISQLSSQGIAPRVFSHC---LKG 251
Query: 217 PDYLHNKLILGDGAIIDEGDA-TPLQFIDGHYYITLEAISVDGRMLDINPNIFKRDDSGG 275
+ L+LG+ I++ +PL HY + L++ISV+G+++ I P++F ++
Sbjct: 252 DNSGGGVLVLGE--IVEPNIVYSPLVPSQPHYNLNLQSISVNGQIVRIAPSVFATSNN-R 308
Query: 276 GVMIDSGTDVTWLVKEAYEALRDEVMIRLEGEQMRSYSWPDKLCYHGIMSSDLKGFPTVR 335
G ++DSGT + +L +EAY + + + +RS CY SS++ FP V
Sbjct: 309 GTIVDSGTTLAYLAEEAYNPFVIAIAAVIP-QSVRSVLSRGNQCYLITTSSNVDIFPQVS 367
Query: 336 FHFRGGAKLALEKDSMFYQ 354
+F GGA L L Q
Sbjct: 368 LNFAGGASLVLRPQDYLMQ 386
>gi|294461757|gb|ADE76437.1| unknown [Picea sitchensis]
Length = 325
Score = 121 bits (304), Expect = 6e-25, Method: Compositional matrix adjust.
Identities = 99/344 (28%), Positives = 152/344 (44%), Gaps = 34/344 (9%)
Query: 75 FTAMDTGSSLLWVHCYPCRDCSPQLGTIFYPSRSSSYAYVPCDSEHCRYFPYARCSAYKH 134
F +DTGS + W+ C PC C Q ++F P+ S++Y +PC+S C+ S
Sbjct: 2 FLLIDTGSDITWIQCDPCPQCYKQQDSLFQPAGSATYKPLPCNSTMCQQLQSFSHSCLNS 61
Query: 135 RCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDVVFGCGFSTNRNFKFSGIFGL-G 193
C Y Y T + E LT ++ D + V + FGCG + N+ F+G GL G
Sbjct: 62 SCNYMVSYGDKSTTRGDFALETLTLRSDDTILVSVPNFAFGCGHA-NKGL-FNGAAGLMG 119
Query: 194 IGRSSLVSQLNSS------FSYCIGSLHDPDYLHNKLILGDGAIID-EGDATPLQFID-- 244
+G+SS+ +S FSYC+ S+ L G+ A++D + TPL +D
Sbjct: 120 LGKSSIGFPAQTSVAFGKVFSYCLPSVSS-TIPSGILHFGEAAMLDYDVRFTPL--VDSS 176
Query: 245 ---GHYYITLEAISVDGRMLDINPNIFKRDDSGGGVMIDSGTDVTWLVKEAYEALRDEVM 301
Y++++ I+V +L I+ VM+DSGT ++ + AYE LRD
Sbjct: 177 SGPSQYFVSMTGINVGDELLPISAT----------VMVDSGTVISRFEQSAYERLRDAFT 226
Query: 302 IRLEGEQMRSYSWPDKLCYHGIMSSDLKGFPTVRFHFRGGAKLALEKDSMFYQPRPDAFC 361
L G Q P C+ + + D P + HFR A+L L + Y C
Sbjct: 227 QILPGLQTAVSVAPFDTCFR-VSTVDDINIPLITLHFRDDAELRLSPVHILYPVDDGVMC 285
Query: 362 MAVNPASINNRYVNFSLIGMMAQQFYNVGYDIGRKQKTFQRMDC 405
A P+S S++G QQ YDI + + +C
Sbjct: 286 FAFAPSSSGR-----SVLGNFQQQNLRFVYDIPKSRLGISAFEC 324
>gi|225436397|ref|XP_002272121.1| PREDICTED: aspartic proteinase-like protein 2 isoform 2 [Vitis
vinifera]
Length = 499
Score = 121 bits (304), Expect = 6e-25, Method: Compositional matrix adjust.
Identities = 99/372 (26%), Positives = 169/372 (45%), Gaps = 34/372 (9%)
Query: 60 LFYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLG-----TIFYPSRSSSYAYV 114
L++ + +G PP + +DTGS +LWV C C C G F P SS+ + +
Sbjct: 82 LYFTRVLLGSPPKEFYVQIDTGSDVLWVSCGSCNGCPQSSGLHIPLNFFDPGSSSTASLI 141
Query: 115 PCDSEHCRYFPY---ARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTI--HV 169
C + C A CS+ ++CIYT Y G TS + ++ L F S++
Sbjct: 142 SCSDQRCSLGVQSSDAGCSSQGNQCIYTFQYGDGSGTSGYYVSDLLNFDAIVGSSVTNSS 201
Query: 170 QDVVFGCGFS-----TNRNFKFSGIFGLGIGRSSLVSQLNSS------FSYCIGSLHDPD 218
+VFGC S T + GIFG G S++SQ++S FS+C+
Sbjct: 202 ASIVFGCSISQTGDLTKSDRAVDGIFGFGQQDMSVISQMSSQGITPKVFSHCLKGDGGGG 261
Query: 219 YLHNKLILGDGAIIDEGDATPLQFIDGHYYITLEAISVDGRMLDINPNIFKRDDSGGGVM 278
+ + + I+ +PL HY + L++ISV+G+ L I+P +F + G +
Sbjct: 262 GILVLGEIVEEDIV----YSPLVPSQPHYNLNLQSISVNGKSLAIDPEVFAT-STNRGTI 316
Query: 279 IDSGTDVTWLVKEAYEALRDEVMIRLEGEQMRSYSWPDKLCYHGIMSSDLKG-FPTVRFH 337
+DSGT + +L +EAY+ + + + +R CY +++S +KG FPTV +
Sbjct: 317 VDSGTTLAYLAEEAYDPFVSAITEAVS-QSVRPLLSKGTQCY--LITSSVKGIFPTVSLN 373
Query: 338 FRGGAKLALEKDSMFYQPRP--DAFCMAVNPASINNRYVNFSLIGMMAQQFYNVGYDIGR 395
F GG + L+ + Q DA + I + +++G + + YD+
Sbjct: 374 FAGGVSMNLKPEDYLLQQNSIGDAAVWCIGFQKIQGQ--GITILGDLVLKDKIFVYDLAG 431
Query: 396 KQKTFQRMDCEV 407
++ + DC +
Sbjct: 432 QRIGWANYDCSM 443
>gi|86438622|emb|CAJ26370.1| chloroplast nucleoid binding protein [Brachypodium sylvaticum]
Length = 443
Score = 121 bits (303), Expect = 7e-25, Method: Compositional matrix adjust.
Identities = 121/442 (27%), Positives = 181/442 (40%), Gaps = 56/442 (12%)
Query: 1 LIHRDSILSPYNNPNENPAHRVQRGINISIARLAYLQEKIRSHNTYQAQ--ILPSN---D 55
++HR SP P++ P+ + AR+ + I + Q LP+
Sbjct: 22 VMHRHGPCSPLQTPDDAPSDADL--LEHDQARVDSIHRMIANETAVVGQDVSLPAERGIS 79
Query: 56 ISNSLFYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRD--CSPQLGTIFYPSRSSSYAY 113
+ + V++ +G P DTGS L WV C PC C Q +F PS SS+++
Sbjct: 80 VGTGNYVVSVGLGTPARDLTVVFDTGSDLSWVQCGPCSSGGCYHQQDPLFAPSSSSTFSA 139
Query: 114 VPCDSEHCRYFPYAR--CSAYK--HRCIYTQLYLIGPETSVFVSTEQLTFKNT------D 163
V C C P AR CS+ RC Y +Y T + + LT T +
Sbjct: 140 VRCGEPEC---PRARQSCSSSPGDDRCPYEVVYGDKSRTVGHLGNDTLTLGTTPSTNASE 196
Query: 164 ESTIHVQDVVFGCGFSTNRNF-KFSGIFGLGIGRSSLVSQL----NSSFSYCIGSLHDPD 218
++ + VFGCG + F K G+FGLG G+ SL SQ FSYC+ S
Sbjct: 197 NNSNKLPGFVFGCGENNTGLFGKADGLFGLGRGKVSLSSQAAGKYGEGFSYCLPS--SSS 254
Query: 219 YLHNKLILGDGAIID-EGDATPL---QFIDGHYYITLEAISVDGRMLDIN--PNIFKRDD 272
H L LG A TP+ YY+ L I V GR + ++ P ++
Sbjct: 255 NAHGYLSLGTPAPAPAHARFTPMLNRSNTPSFYYVKLVGIRVAGRAIKVSSRPALWP--- 311
Query: 273 SGGGVMIDSGTDVTWLVKEAYEALRDEVMIRLEGEQMRSYSWPDK-------LCYHGIMS 325
G+++DSGT +T L AY ALR + M Y + CY
Sbjct: 312 --AGLIVDSGTVITRLAPRAYSALRTAFL-----SAMGKYGYKRAPRLSILDTCYDFTAH 364
Query: 326 SDLK-GFPTVRFHFRGGAKLALEKDSMFYQPRPDAFCMAVNPASINNRYVNFSLIGMMAQ 384
++ P V F GGA ++++ + Y + C+A P N + ++G Q
Sbjct: 365 ANATVSIPAVALVFAGGATISVDFSGVLYVAKVAQACLAFAP---NGNGRSAGILGNTQQ 421
Query: 385 QFYNVGYDIGRKQKTFQRMDCE 406
+ V YD+GR++ F C
Sbjct: 422 RTVAVVYDVGRQKIGFAAKGCS 443
>gi|115489316|ref|NP_001067145.1| Os12g0583300 [Oryza sativa Japonica Group]
gi|77556903|gb|ABA99699.1| Eukaryotic aspartyl protease family protein, expressed [Oryza
sativa Japonica Group]
gi|113649652|dbj|BAF30164.1| Os12g0583300 [Oryza sativa Japonica Group]
gi|125537189|gb|EAY83677.1| hypothetical protein OsI_38901 [Oryza sativa Indica Group]
Length = 446
Score = 121 bits (303), Expect = 7e-25, Method: Compositional matrix adjust.
Identities = 129/425 (30%), Positives = 178/425 (41%), Gaps = 54/425 (12%)
Query: 15 NENPAHRVQRGINISIARLAYLQEKIRSHNTYQAQILPSNDISNSLFYVNISIGQPPVPQ 74
N V+R + RLA+L + P + + IG PP
Sbjct: 45 NYTAEELVRRAVAAGKQRLAFLDAAMAGGGDGGGVGAPVR-WATLQYVAEYLIGDPPQRA 103
Query: 75 FTAMDTGSSLLWVHCYPC--RDCSPQLGTIFYPSRSSSYAYVPCDSEHCR------YF-- 124
+DTGS L+W C C + C+ Q + S SS++A VPC + C +F
Sbjct: 104 EALIDTGSDLVWTQCSTCLRKVCARQALPYYNSSASSTFAPVPCAARICAANDDIIHFCD 163
Query: 125 PYARCSAYKHRCIYTQLYLIGPETSVFVS-TEQLTFKNTDESTIHVQDVVFGCGFSTNRN 183
A CS +G E F S T +L F + I VQ + G
Sbjct: 164 LAAGCSVIAGYGAGVVAGTLGTEAFAFQSGTAELAFGCVTFTRI-VQGALHGA------- 215
Query: 184 FKFSGIFGLGIGRSSLVSQLNSS-FSYCIGSLHDPDYLHNK-----LILGDGAIID-EGD 236
SG+ GLG GR SLVSQ ++ FSYC+ Y HN L +G A + GD
Sbjct: 216 ---SGLIGLGRGRLSLVSQTGATKFSYCL-----TPYFHNNGATGHLFVGASASLGGHGD 267
Query: 237 ATPLQFIDG-----HYYITLEAISVDGRMLDINPNIFKRDDSG-----GGVMIDSGTDVT 286
QF+ G YY+ L ++V L I +F + GGV+IDSG+ T
Sbjct: 268 VMTTQFVKGPKGSPFYYLPLIGLTVGETRLPIPATVFDLREVAPGLFSGGVIIDSGSPFT 327
Query: 287 WLVKEAYEALRDEVMIRLEGEQMRSYSWPD--KLCYHGIMSSDL-KGFPTVRFHFRGGAK 343
LV +AY+AL E+ RL G + D LC + D+ + P V FHFRGGA
Sbjct: 328 SLVHDAYDALASELAARLNGSLVAPPPDADDGALC---VARRDVGRVVPAVVFHFRGGAD 384
Query: 344 LALEKDSMFYQPRPDAFCMAVNPASINNRYVNFSLIGMMAQQFYNVGYDIGRKQKTFQRM 403
+A+ +S ++ P A A + Y S+IG QQ V YD+ +FQ
Sbjct: 385 MAVPAES-YWAPVDKA--AACMAIASAGPYRRQSVIGNYQQQNMRVLYDLANGDFSFQPA 441
Query: 404 DCEVL 408
DC L
Sbjct: 442 DCSAL 446
>gi|356569916|ref|XP_003553140.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Glycine max]
Length = 560
Score = 121 bits (303), Expect = 7e-25, Method: Compositional matrix adjust.
Identities = 105/370 (28%), Positives = 167/370 (45%), Gaps = 32/370 (8%)
Query: 61 FYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLGTIFYPSRSSSYAYVPCDSEH 120
+++++ +G PP +DTGS L W+ C PC C Q G + P SSS+ + C
Sbjct: 195 YFMDVFVGTPPKHFSLILDTGSDLNWIQCVPCYACFEQNGPYYDPKDSSSFKNITCHDPR 254
Query: 121 CRYF----PYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFK-NTDESTIH---VQDV 172
C+ P C C Y Y T+ + E T T E V++V
Sbjct: 255 CQLVSSPDPPQPCKGETQSCPYFYWYGDSSNTTGDFALETFTVNLTTPEGKPELKIVENV 314
Query: 173 VFGCGFSTNRNF-KFSGIFGLGIGRSSLVSQLNS----SFSYCIGSLHDPDYLHNKLILG 227
+FGCG F +G+ GLG G S +QL S SFSYC+ + + +KLI G
Sbjct: 315 MFGCGHWNRGLFHGAAGLLGLGRGPLSFATQLQSLYGHSFSYCLVDRNSNSSVSSKLIFG 374
Query: 228 -DGAIIDEGDATPLQFIDGH-------YYITLEAISVDGRMLDINPNIFK-RDDSGGGVM 278
D ++ + F+ G YY+ +++I V G +L I + GGG +
Sbjct: 375 EDKELLSHPNLNFTSFVGGKENPVDTFYYVLIKSIMVGGEVLKIPEETWHLSAQGGGGTI 434
Query: 279 IDSGTDVTWLVKEAYEALRDEVMIRLEGEQMRSYSWPDKLCYH--GIMSSDLKGFPTVRF 336
IDSGT +T+ + AYE +++ M +++G + P K CY+ G+ +L F +
Sbjct: 435 IDSGTTLTYFAEPAYEIIKEAFMRKIKGFPLVETFPPLKPCYNVSGVEKMELPEFAIL-- 492
Query: 337 HFRGGAKLALEKDSMFYQPRP-DAFCMAVNPASINNRYVNFSLIGMMAQQFYNVGYDIGR 395
F GA ++ F Q P D C+A+ + S+IG QQ +++ YD+ +
Sbjct: 493 -FADGAMWDFPVENYFIQIEPEDVVCLAI----LGTPRSALSIIGNYQQQNFHILYDLKK 547
Query: 396 KQKTFQRMDC 405
+ + M C
Sbjct: 548 SRLGYAPMKC 557
>gi|297842769|ref|XP_002889266.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
gi|297335107|gb|EFH65525.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
Length = 489
Score = 121 bits (303), Expect = 8e-25, Method: Compositional matrix adjust.
Identities = 121/420 (28%), Positives = 188/420 (44%), Gaps = 54/420 (12%)
Query: 19 AHRVQRGINISIARLAYLQEKIRS-------HNTYQAQILPSNDIS-NSLFY-VNISIGQ 69
+++R + + R+ LQ +I++ + + QI ++ I +L Y V + +G
Sbjct: 87 GKKMRRALLLDNIRVQSLQLRIKAMTSSTTEQSVSETQIPLTSGIKLETLNYIVTVELGG 146
Query: 70 PPVPQFTAMDTGSSLLWVHCYPCRDCSPQLGTIFYPSRSSSYAYVPCDSEHCRYFPYAR- 128
+ +DTGS L WV C PCR C Q G ++ PS SSSY V C+S C+ A
Sbjct: 147 KNMSLI--VDTGSDLTWVQCQPCRSCYNQQGPLYDPSVSSSYKTVFCNSSTCQDLVAATG 204
Query: 129 ----CSAY----KHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDVVFGCGFST 180
C + K C Y Y G T +++E + +T ++++VFGCG +
Sbjct: 205 NSGPCGGFNGVVKTTCEYVVSYGDGSYTRGDLASESIVLGDT-----KLENLVFGCGRNN 259
Query: 181 NRNF-KFSGIFGLGIGRSSLVSQ----LNSSFSYCIGSLHDPDYLHNKLILGDGAIIDEG 235
F SG+ GLG SLVSQ N FSYC+ SL D L G+ + +
Sbjct: 260 KGLFGGASGLMGLGRSSVSLVSQTLKTFNGVFSYCLPSLEDGA--SGTLSFGNDFSVYKN 317
Query: 236 DA----TPL---QFIDGHYYITLEAISVDGRMLDINPNIFKRDDSGGGVMIDSGTDVTWL 288
TPL + Y + L S+ G L K G G++IDSGT +T L
Sbjct: 318 STSVFYTPLVQNPQLRSFYILNLTGASIGGVEL-------KTLSFGRGILIDSGTVITRL 370
Query: 289 VKEAYEALRDEVMIRLEG-EQMRSYSWPDKLCYHGIMSSDLKGFPTVRFHFRGGAKLALE 347
Y+A++ E + + G YS D C++ D+ PT++ F G A+L ++
Sbjct: 371 PPSIYKAVKTEFLKQFSGFPSAPGYSILDT-CFNLTSYEDIS-IPTIKMIFEGNAELEVD 428
Query: 348 KDSMFYQPRPDA--FCMAVNPASINNRYVNFSLIGMMAQQFYNVGYDIGRKQKTFQRMDC 405
+FY +PDA C+A+ S N +IG Q+ V YD +++ +C
Sbjct: 429 VTGVFYFVKPDASLVCLALASLSYENE---VGIIGNYQQKNQRVIYDTTQERLGIAGENC 485
>gi|222619345|gb|EEE55477.1| hypothetical protein OsJ_03658 [Oryza sativa Japonica Group]
Length = 530
Score = 120 bits (302), Expect = 9e-25, Method: Compositional matrix adjust.
Identities = 95/318 (29%), Positives = 147/318 (46%), Gaps = 32/318 (10%)
Query: 61 FYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLG-----TIFYPSRSSSYAYVP 115
++ + +G PP F +DTGS +LWV C PC C G F P SS+ + +P
Sbjct: 117 YFTRVKLGSPPKEYFVQIDTGSDILWVACSPCTGCPSSSGLNIQLEFFNPDTSSTSSKIP 176
Query: 116 CDSEHCRYF---PYARCSAYKHR-CIYTQLYLIGPETSVFVSTEQLTFKNT---DESTIH 168
C + C A C + C YT Y G TS + ++ + F +++
Sbjct: 177 CSDDRCTAALQTSEAVCQTSDNSPCGYTFTYGDGSGTSGYYVSDTMYFDTVMGNEQTANS 236
Query: 169 VQDVVFGCGFS-----TNRNFKFSGIFGLGIGRSSLVSQLNS------SFSYCIGSLHDP 217
+VFGC S T + GIFG G + S+VSQLNS FS+C L
Sbjct: 237 SASIVFGCSNSQSGDLTKTDRAVDGIFGFGQHQLSVVSQLNSLGVSPKVFSHC---LKGS 293
Query: 218 DYLHNKLILGDGAIIDEGDA-TPLQFIDGHYYITLEAISVDGRMLDINPNIFKRDDSGGG 276
D L+LG+ I++ G TPL HY + LE+I V+G+ L I+ ++F ++ G
Sbjct: 294 DNGGGILVLGE--IVEPGLVYTPLVPSQPHYNLNLESIVVNGQKLPIDSSLFTTSNT-QG 350
Query: 277 VMIDSGTDVTWLVKEAYEALRDEVMIRLEGEQMRSYSWPDKLCYHGIMSSDLKGFPTVRF 336
++DSGT + +L AY+ + + + +RS C+ S D FPTV
Sbjct: 351 TIVDSGTTLAYLADGAYDPFVNAITAAVS-PSVRSLVSKGNQCFVTSSSVD-SSFPTVSL 408
Query: 337 HFRGGAKLALEKDSMFYQ 354
+F GG + ++ ++ Q
Sbjct: 409 YFMGGVAMTVKPENYLLQ 426
>gi|168014386|ref|XP_001759733.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162689272|gb|EDQ75645.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 392
Score = 120 bits (302), Expect = 1e-24, Method: Compositional matrix adjust.
Identities = 107/382 (28%), Positives = 163/382 (42%), Gaps = 30/382 (7%)
Query: 46 YQAQILPSNDISNSLFYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLGTIFYP 105
++ ++ + + ++V+ S+G P +DTGS L +V C PC C Q G ++ P
Sbjct: 19 FRTPLVSGTTLGSGQYFVDFSLGTPEQKFHLIVDTGSDLAFVQCAPCDLCYEQDGPLYQP 78
Query: 106 SRSSSYAYVPCDSEHCRYFPY---ARCSAY------KHRCIYTQLYLIGPETSVFVSTEQ 156
S SS++ VPCDS C P A CS+ + C Y Y T + E
Sbjct: 79 SNSSTFTPVPCDSAECLLIPAPVGAPCSSSYPESPPQGACSYEYRYGDNSSTVGVFAYET 138
Query: 157 LTFKNTDESTIHVQDVVFGCGFSTNRNF-KFSGIFGLGIGRSSLVSQ----LNSSFSYCI 211
T I V V FGCG +F G+ GLG G S SQ + F+YC+
Sbjct: 139 ATVGG-----IRVNHVAFGCGNRNQGSFVSAGGVLGLGQGALSFTSQAGYAFENKFAYCL 193
Query: 212 GSLHDPDYLHNKLILGDG--AIIDEGDATPL---QFIDGHYYITLEAISVDGRMLDINPN 266
S P + + LI GD + I + TPL YY+ + I G L I +
Sbjct: 194 TSYLSPTSVFSSLIFGDDMMSTIHDLQFTPLVSNPLNPSVYYVQIVRICFGGETLLIPDS 253
Query: 267 IFKRDDSG-GGVMIDSGTDVTWLVKEAYEALRDEVMIRLEGEQMRSYSWPDKLCYHGIMS 325
+K D G GG + DSGT VT+ +AY + + + LC + +
Sbjct: 254 AWKIDSVGNGGTIFDSGTTVTYWSPQAYARIIAAFEKSVPYPRAPPSPQGLPLCVN-VSG 312
Query: 326 SDLKGFPTVRFHFRGGAKLALEKDSMFYQPRPDAFCMAVNPASINNRYVNFSLIGMMAQQ 385
D +P+ F GA + + F + P+ C+A+ +S + F++IG + QQ
Sbjct: 313 IDHPIYPSFTIEFDQGATYRPNQGNYFIEVSPNIDCLAMLESSSD----GFNVIGNIIQQ 368
Query: 386 FYNVGYDIGRKQKTFQRMDCEV 407
Y V YD + F +C+
Sbjct: 369 NYLVQYDREEHRIGFAHANCDA 390
>gi|115483166|ref|NP_001065176.1| Os10g0537800 [Oryza sativa Japonica Group]
gi|21717159|gb|AAM76352.1|AC074196_10 putative nucleoid DNA binding protein [Oryza sativa Japonica Group]
gi|31433285|gb|AAP54823.1| Eukaryotic aspartyl protease family protein, expressed [Oryza
sativa Japonica Group]
gi|113639785|dbj|BAF27090.1| Os10g0537800 [Oryza sativa Japonica Group]
gi|215692411|dbj|BAG87831.1| unnamed protein product [Oryza sativa Japonica Group]
Length = 394
Score = 120 bits (302), Expect = 1e-24, Method: Compositional matrix adjust.
Identities = 95/361 (26%), Positives = 160/361 (44%), Gaps = 31/361 (8%)
Query: 61 FYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLGTIFYPSRSSSYAYVPCDSEH 120
+ N +IG PP P +D L+W C C C Q +F P+ S++Y PC +
Sbjct: 51 YVANFTIGTPPQPASAVIDLAGELVWTQCKQCGRCFEQGTPLFDPTASNTYRAEPCGTPL 110
Query: 121 CRYFPYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDVVFGCGFST 180
C P + + C Y G +T V T+ S + FGC ++
Sbjct: 111 CESIPSDVRNCSGNVCAYEASTNAG-DTGGKVGTDTFAVGTAKAS------LAFGCVVAS 163
Query: 181 NRNFKF--SGIFGLGIGRSSLVSQLN-SSFSYCIGSLHDPDYLHNKLILGDGAIIDEGD- 236
+ + SGI GLG SLV+Q ++FSYC+ HD ++ L LG A + G
Sbjct: 164 DIDTMGGPSGIVGLGRTPWSLVTQTGVAAFSYCLAP-HDAGK-NSALFLGSSAKLAGGGK 221
Query: 237 --ATPLQFIDG-------HYYITLEAISVDGRMLDINPNIFKRDDSGGGVMIDSGTDVTW 287
+TP I G +Y + LE + M+ + P SG V++D+ + +++
Sbjct: 222 AASTPFVNISGNGNDLSNYYKVQLEGLKAGDAMIPLPP-------SGSTVLLDTFSPISF 274
Query: 288 LVKEAYEALRDEVMIRLEGEQMRSYSWPDKLCYHGIMSSDLKGFPTVRFHFRGGAKLALE 347
LV AY+A++ V + + M + P LC+ +S P + F FRGGA + +
Sbjct: 275 LVDGAYQAVKKAVTVAVGAPPMATPVEPFDLCFPKSGASGAA--PDLVFTFRGGAAMTVP 332
Query: 348 KDSMFYQPRPDAFCMAVNPASINNRYVNFSLIGMMAQQFYNVGYDIGRKQKTFQRMDCEV 407
+ + C+A+ ++ N SL+G + Q+ + +D+ ++ +F+ DC
Sbjct: 333 ATNYLLDYKNGTVCLAMLSSARLNSTTELSLLGSLQQENIHFLFDLDKETLSFEPADCTK 392
Query: 408 L 408
L
Sbjct: 393 L 393
>gi|145693992|gb|ABP93696.1| unknown protein isoform 1 [Lemna minor]
Length = 350
Score = 120 bits (302), Expect = 1e-24, Method: Compositional matrix adjust.
Identities = 110/362 (30%), Positives = 159/362 (43%), Gaps = 35/362 (9%)
Query: 56 ISNSLFYVNISIGQPPVPQFTAMDTGSSLLWVHCYPC-RDCSPQLGTIFYPSRSSSYAYV 114
I + + + + G P Q DTGS++ W+ C PC C PQ +F P+ SS+Y +
Sbjct: 11 IGTANYVITVGFGTPKKNQTVIFDTGSNVNWIQCKPCVVSCYPQQEPLFDPTLSSTYRNI 70
Query: 115 PCDSEHCRYFPYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDVVF 174
C S C CS C+Y Y G T F++TE T + + +F
Sbjct: 71 SCTSAACTGLSSRGCSG--STCVYGVTYGDGSSTVGFLATETFTLAAGNV----FNNFIF 124
Query: 175 GCGFSTNRNFKFSGIFGL-GIGRS--SLVSQLNSS----FSYCIGSLHDPD-YLH--NKL 224
GCG N F+G GL G+GRS SL SQL +S FSYC+ S YL+ N L
Sbjct: 125 GCG--QNNQGLFTGAAGLIGLGRSPYSLNSQLATSLGNIFSYCLPSTSSATGYLNIGNPL 182
Query: 225 -ILGDGAIIDEGDATPLQFIDGHYYITLEAISVDGRMLDINPNIFKRDDSGGGVMIDSGT 283
G A++ A L FID L ISV G L ++ +F+ G +IDSGT
Sbjct: 183 RTPGYTAMLTNSRAPTLYFID------LIGISVGGTRLALSSTVFQSV----GTIIDSGT 232
Query: 284 DVTWLVKEAYEALRDEVMIRLEGEQMRSYSWPDKLCYHGIMSSDLKGFPTVRFHFRGGAK 343
+T L AY ALR + + + CY ++ + FPT++ H+ G
Sbjct: 233 VITRLPPTAYGALRTAFRAAMTQYTRAAAASILDTCYDFSRTTTVT-FPTIKLHYT-GLD 290
Query: 344 LALEKDSMFYQPRPDAFCMAVNPASINNRYVNFSLIGMMAQQFYNVGYDIGRKQKTFQRM 403
+ + +FY C+A + N+ +IG + Q+ V YD K+ F
Sbjct: 291 VTIPGAGVFYVISSSQVCLAF---AGNSDSTQIGIIGNVQQRTMEVTYDNALKRIGFAAG 347
Query: 404 DC 405
C
Sbjct: 348 AC 349
>gi|297817208|ref|XP_002876487.1| hypothetical protein ARALYDRAFT_486375 [Arabidopsis lyrata subsp.
lyrata]
gi|297322325|gb|EFH52746.1| hypothetical protein ARALYDRAFT_486375 [Arabidopsis lyrata subsp.
lyrata]
Length = 520
Score = 120 bits (302), Expect = 1e-24, Method: Compositional matrix adjust.
Identities = 108/421 (25%), Positives = 176/421 (41%), Gaps = 42/421 (9%)
Query: 15 NENPAHRVQRGINISIARLAYLQEKIRSHNTYQAQILPSNDISNSLFYVNISIGQPPVPQ 74
N+N + Q+ N + A + + + +++++ +G PP
Sbjct: 109 NQNTVSQKQKKKNKEVVTTPVASSVEEQAGQLVATLESGMTLGSGEYFMDVLVGSPPKHF 168
Query: 75 FTAMDTGSSLLWVHCYPCRDCSPQLGTIFYPSRSSSYAYVPCDSEHCRYF----PYARCS 130
+DTGS L W+ C PC DC Q G + P S+SY + C+ C P C
Sbjct: 169 SLILDTGSDLNWIQCLPCHDCFQQNGAFYDPKASASYKNITCNDPRCNLVSPPDPPKPCK 228
Query: 131 AYKHRCIYTQLYLIGPETS----VFVSTEQLTFKNTDESTIHVQDVVFGCGFSTNRNFKF 186
+ C Y Y T+ V T LT +V++++FGCG NR
Sbjct: 229 SDNQSCPYYYWYGDSSNTTGDFAVETFTVNLTTSGGSSELYNVENMMFGCGH-WNR---- 283
Query: 187 SGIF-------GLGIGRSSLVSQLNS----SFSYCIGSLHDPDYLHNKLILG-DGAIIDE 234
G+F GLG G S SQL S SFSYC+ + + +KLI G D ++
Sbjct: 284 -GLFHGAAGLLGLGRGPLSFSSQLQSLYGHSFSYCLVDRNSDTNVSSKLIFGEDKDLLSH 342
Query: 235 GD-------ATPLQFIDGHYYITLEAISVDGRMLDINPNIFK-RDDSGGGVMIDSGTDVT 286
+ A +D YY+ +++I V G +L+I + D GG +IDSGT ++
Sbjct: 343 PNLNFTSFVARKENLVDTFYYVQIKSIIVAGEVLNIPEETWNISSDGAGGTIIDSGTTLS 402
Query: 287 WLVKEAYEALRDEVMIRLEGEQMRSYSWPDKLCYHGIMSSDLKGFPTVRFHFRGGAKLAL 346
+ + AYE +++++ + +G+ +P + D P + F GA
Sbjct: 403 YFAEPAYEFIKNKIAEKAKGKYPVYRDFPILDPCFNVSGIDSIQLPELGIAFADGAVWNF 462
Query: 347 EKDSMFYQPRPDAFCMAV--NPASINNRYVNFSLIGMMAQQFYNVGYDIGRKQKTFQRMD 404
++ F D C+A+ P S FS+IG QQ +++ YD R + +
Sbjct: 463 PTENSFIWLNEDLVCLAILGTPKSA------FSIIGNYQQQNFHILYDTKRSRLGYAPTK 516
Query: 405 C 405
C
Sbjct: 517 C 517
>gi|118487542|gb|ABK95598.1| unknown [Populus trichocarpa]
Length = 471
Score = 120 bits (301), Expect = 1e-24, Method: Compositional matrix adjust.
Identities = 104/380 (27%), Positives = 171/380 (45%), Gaps = 33/380 (8%)
Query: 44 NTYQAQILPSNDISNSLFYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRD-CSPQLGTI 102
N+ + P I + +YV + +G PP +DTGSSL W+ C PC C Q +
Sbjct: 108 NSASIPLNPGLSIGSGNYYVKLGLGTPPKYYAMILDTGSSLSWLQCQPCAVYCHAQADPL 167
Query: 103 FYPSRSSSYAYVPCDSEHCRYFPYAR-----CSAYKHRCIYTQLYLIGPETSVFVSTEQL 157
+ PS S +Y + C S C A C + C+YT Y + ++S + L
Sbjct: 168 YDPSVSKTYKKLSCASVECSRLKAATLNDPLCETDSNACLYTASYGDTSFSIGYLSQDLL 227
Query: 158 TFKNTDESTIHVQDVVFGCGFSTNRNF-KFSGIFGLGIGRSSLVSQLNS----SFSYCIG 212
T ++ + +GCG F + +GI GL + S+++QL++ +FSYC+
Sbjct: 228 TLTSSQT----LPQFTYGCGQDNQGLFGRAAGIIGLARDKLSMLAQLSTKYGHAFSYCLP 283
Query: 213 SLHDPDYLHNKLILGDGAIIDEGDATPLQFIDGH----YYITLEAISVDGRMLDINPNIF 268
+ + L +G + TP+ D Y++ L AI+V GR LD+ ++
Sbjct: 284 TANSGSSGGGFLSIGSISPTSY-KFTPM-LTDSKNPSLYFLRLTAITVSGRPLDLAAAMY 341
Query: 269 KRDDSGGGVMIDSGTDVTWLVKEAYEALRDEVMIRLEGEQMR--SYSWPDKLCYHGIMSS 326
+ +IDSGT +T L Y ALR + + + + +YS D C+ G + S
Sbjct: 342 RVP-----TLIDSGTVITRLPMSMYAALRQAFVKIMSTKYAKAPAYSILDT-CFKGSLKS 395
Query: 327 DLKGFPTVRFHFRGGAKLALEKDSMFYQPRPDAFCMAVNPASINNRYVNFSLIGMMAQQF 386
+ P ++ F+GGA L L S+ + C+A +S N+ ++IG QQ
Sbjct: 396 -ISAVPEIKMIFQGGADLTLRAPSILIEADKGITCLAFAGSSGTNQ---IAIIGNRQQQT 451
Query: 387 YNVGYDIGRKQKTFQRMDCE 406
YN+ YD+ + F C
Sbjct: 452 YNIAYDVSTSRIGFAPGSCH 471
>gi|293332735|ref|NP_001168472.1| uncharacterized protein LOC100382248 [Zea mays]
gi|223948487|gb|ACN28327.1| unknown [Zea mays]
Length = 434
Score = 120 bits (301), Expect = 1e-24, Method: Compositional matrix adjust.
Identities = 107/364 (29%), Positives = 157/364 (43%), Gaps = 44/364 (12%)
Query: 61 FYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRD-CSPQLGTIFYPSRSSSYAYVPCDSE 119
+ V + +G P DTGS WV C PC C Q +F P++S++YA + C S
Sbjct: 96 YVVPVRLGTPAERFTVVFDTGSDTTWVQCQPCVAYCYRQKEPLFDPTKSATYANISCSSS 155
Query: 120 HCRYFPYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDVVFGCGFS 179
+C + CS C+Y Y G T F + + LT +++ FGCG
Sbjct: 156 YCSDLYVSGCSG--GHCLYGIQYGDGSYTIGFYAQDTLTLAYDT-----IKNFRFGCG-E 207
Query: 180 TNRNF--KFSGIFGLGIGRSSLVSQL----NSSFSYCIGSLHDPDYLHNKLILGDGAIID 233
NR + +G+ GLG G++SL Q F+YC+ + L LG GA
Sbjct: 208 KNRGLFGRAAGLLGLGRGKTSLPVQAYDKYGGVFAYCLPATSAGTGF---LDLGPGAPAA 264
Query: 234 EGDATPLQFIDG--HYYITLEAISVDGRMLDINPNIFKRDDSGGGVMIDSGTDVTWLVKE 291
TP+ G YY+ + I V G +L I ++F S G ++DSGT +T L
Sbjct: 265 NARLTPMLVDRGPTFYYVGMTGIKVGGHVLPIPGSVF----STAGTLVDSGTVITRLPPS 320
Query: 292 AYEALRDEVMIRLEG---EQMRSYSWPDKLCYHGIMSSDLKG-------FPTVRFHFRGG 341
AY LR ++G ++S D CY DL G P V F+GG
Sbjct: 321 AYAPLRSAFSKAMQGLGYSAAPAFSILDT-CY------DLTGHKGGSIALPAVSLVFQGG 373
Query: 342 AKLALEKDSMFYQPRPDAFCMAVNPASINNRYVNFSLIGMMAQQFYNVGYDIGRKQKTFQ 401
A L ++ + Y C+A P N + +++G Q+ + V YDIG+K F
Sbjct: 374 ACLDVDASGILYVADVSQACLAFAP---NADDTDVAIVGNTQQKTHGVLYDIGKKIVGFA 430
Query: 402 RMDC 405
C
Sbjct: 431 PGAC 434
>gi|357124861|ref|XP_003564115.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Brachypodium
distachyon]
Length = 477
Score = 120 bits (301), Expect = 1e-24, Method: Compositional matrix adjust.
Identities = 108/385 (28%), Positives = 174/385 (45%), Gaps = 38/385 (9%)
Query: 56 ISNSLFYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLGT-IFYPSRSSSYAYV 114
I + + V++S+G PP P +DTGS L+W C PC +C Q + P+ SS++A V
Sbjct: 89 IVTNEYLVHLSVGTPPRPVALTLDTGSDLVWTQCAPCLNCFDQGAIPVLDPAASSTHAAV 148
Query: 115 PCDSEHCRYFPYARC----SAYKHR-CIYTQLYLIGPETSVFVSTEQLTF---KNTDEST 166
CD+ CR P+ C S++ R C+Y Y T +++++ TF N D
Sbjct: 149 RCDAPVCRALPFTSCGRGGSSWGERSCVYVYHYGDKSITVGKLASDRFTFGPGDNADGGG 208
Query: 167 IHVQDVVFGCGFSTNRNFKF--SGIFGLGIGRSSLVSQLN-SSFSYCIGSLHDPDYLHNK 223
+ + + FGCG F+ +GI G G GR SL SQL +SFSYC S+ +
Sbjct: 209 VSERRLTFGCGHFNKGIFQANETGIAGFGRGRWSLPSQLGVTSFSYCFTSMFESTSSLVT 268
Query: 224 LILGDGAIIDEG--DATPLQFIDGH---YYITLEAISVDGRMLDINPNIFKRDDSGGGVM 278
L + + G +TPL Y+++L+AI+V + I P +R +
Sbjct: 269 LGVAPAELHLTGQVQSTPLLRDPSQPSLYFLSLKAITVGATRIPI-PERRQRLREASAI- 326
Query: 279 IDSGTDVTWLVKEAYEALRDEVMIR-------LEGEQMR-SYSWPDKLCYHGIMSSDLKG 330
IDSG +T L ++ YEA++ E + + +EG + ++ P +G
Sbjct: 327 IDSGASITTLPEDVYEAVKAEFVAQVGLPVSAVEGSALDLCFALPSAAAPKSAFGWRWRG 386
Query: 331 --------FPTVRFHFRGGAKLALEKDS-MFYQPRPDAFCMAVNPASINNRYVNFSLIGM 381
P + FH GGA L +++ +F C+ ++ A+ +IG
Sbjct: 387 RGRAMPVRVPRLVFHLGGGADWELPRENYVFEDYGARVMCLVLDAATGGGDQT--VVIGN 444
Query: 382 MAQQFYNVGYDIGRKQKTFQRMDCE 406
QQ +V YD+ +F CE
Sbjct: 445 YQQQNTHVVYDLENDVLSFAPARCE 469
>gi|242086034|ref|XP_002443442.1| hypothetical protein SORBIDRAFT_08g019550 [Sorghum bicolor]
gi|241944135|gb|EES17280.1| hypothetical protein SORBIDRAFT_08g019550 [Sorghum bicolor]
Length = 443
Score = 120 bits (301), Expect = 1e-24, Method: Compositional matrix adjust.
Identities = 115/413 (27%), Positives = 174/413 (42%), Gaps = 34/413 (8%)
Query: 15 NENPAHRVQRGINISIARLAYLQEKIRSHNTYQAQILPSNDISNSLFYVNISIGQPPVPQ 74
N RV+R I +S Q + S + + + +G PP
Sbjct: 46 NYTAPERVRRAIALS------RQINLASTRAEGGGVSAPVHWATRQYIAEYMVGDPPQRA 99
Query: 75 FTAMDTGSSLLWVHCYPC--RDCSPQLGTIFYPSRSSSYAYVPCDSEHCRYFPYARCSAY 132
+DTGSSL+W C C + C Q F S S S+A VPC + C Y A
Sbjct: 100 EALIDTGSSLIWTQCTACLRKVCVRQDLPYFNASSSGSFAPVPCQDKACAG-NYLHFCAL 158
Query: 133 KHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDVVFGCGFSTNRNFKFSGIFGL 192
C + Y G F+ T+ TF+ + +T+ V F + + SG+ GL
Sbjct: 159 DGTCTFRVTYGAGGIIG-FLGTDAFTFQ-SGGATLAFGCVSFTRFAAPDVLHGASGLIGL 216
Query: 193 GIGRSSLVSQLNSS-FSYCIGSLHDPDYLHNKLILGDGAIIDEGDAT--PLQFIDG---- 245
G GR SL SQ + FSYC+ + + L +G A + G + F++
Sbjct: 217 GRGRLSLASQTGAKRFSYCLTPYFHNNGASSHLFVGAAASLSGGGGAVMSMAFVESPKDY 276
Query: 246 ----HYYITLEAISVDGRMLDINPNIF--KRDDSG---GGVMIDSGTDVTWLVKEAYEAL 296
YY+ L I+V L I F + + G GGV+IDSG+ T LV++AYE L
Sbjct: 277 PYSTFYYLPLVGITVGETKLAIPSTAFDLQEVEEGFWEGGVIIDSGSPFTSLVEDAYEPL 336
Query: 297 RDEVMIRLEGEQMRSYSWPDKLCYHGIMSSDL-KGFPTVRFHFRGGAKLALEKDSMFYQP 355
E+ +L G + D + DL + PT+ HF GGA +AL ++ +
Sbjct: 337 MGELARQLNGSLVPPPGEDDGGMALCVARGDLDRVVPTLVLHFSGGADMALPPENYWAPL 396
Query: 356 RPDAFCMAVNPASINNRYVNFSLIGMMAQQFYNVGYDIGRKQKTFQRMDCEVL 408
CMA+ + S+IG QQ ++ +D+G + +FQ DC +
Sbjct: 397 EKSTACMAIVRGYLQ------SIIGNFQQQNMHILFDVGGGRLSFQNADCSTI 443
>gi|413943688|gb|AFW76337.1| hypothetical protein ZEAMMB73_223549 [Zea mays]
gi|413943689|gb|AFW76338.1| hypothetical protein ZEAMMB73_223549 [Zea mays]
Length = 499
Score = 120 bits (301), Expect = 1e-24, Method: Compositional matrix adjust.
Identities = 109/365 (29%), Positives = 161/365 (44%), Gaps = 46/365 (12%)
Query: 61 FYVNISIGQPPVPQFTAM-DTGSSLLWVHCYPCRD-CSPQLGTIFYPSRSSSYAYVPCDS 118
+ V + +G P +FT + DTGS WV C PC C Q +F P++S++YA + C S
Sbjct: 161 YVVPVRLGTP-AERFTVVFDTGSDTTWVQCQPCVAYCYRQKEPLFDPTKSATYANISCSS 219
Query: 119 EHCRYFPYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDVVFGCGF 178
+C + CS C+Y Y G T F + + LT +++ FGCG
Sbjct: 220 SYCSDLYVSGCSG--GHCLYGIQYGDGSYTIGFYAQDTLTLAYDT-----IKNFRFGCG- 271
Query: 179 STNRNF--KFSGIFGLGIGRSSLVSQL----NSSFSYCIGSLHDPDYLHNKLILGDGAII 232
NR + +G+ GLG G++SL Q F+YC+ + L LG GA
Sbjct: 272 EKNRGLFGRAAGLLGLGRGKTSLPVQAYDKYGGVFAYCLPATSAGTGF---LDLGPGAPA 328
Query: 233 DEGDATPLQFIDG--HYYITLEAISVDGRMLDINPNIFKRDDSGGGVMIDSGTDVTWLVK 290
TP+ G YY+ + I V G +L I ++F S G ++DSGT +T L
Sbjct: 329 ANARLTPMLVDRGPTFYYVGMTGIKVGGHVLPIPGSVF----STAGTLVDSGTVITRLPP 384
Query: 291 EAYEALRDEVMIRLEG---EQMRSYSWPDKLCYHGIMSSDLKG-------FPTVRFHFRG 340
AY LR ++G ++S D CY DL G P V F+G
Sbjct: 385 SAYAPLRSAFSKAMQGLGYSAAPAFSILDT-CY------DLTGHKGGSIALPAVSLVFQG 437
Query: 341 GAKLALEKDSMFYQPRPDAFCMAVNPASINNRYVNFSLIGMMAQQFYNVGYDIGRKQKTF 400
GA L ++ + Y C+A P N + +++G Q+ + V YDIG+K F
Sbjct: 438 GACLDVDASGILYVADVSQACLAFAP---NADDTDVAIVGNTQQKTHGVLYDIGKKIVGF 494
Query: 401 QRMDC 405
C
Sbjct: 495 APGAC 499
>gi|255634819|gb|ACU17770.1| unknown [Glycine max]
Length = 354
Score = 120 bits (301), Expect = 1e-24, Method: Compositional matrix adjust.
Identities = 96/319 (30%), Positives = 148/319 (46%), Gaps = 33/319 (10%)
Query: 60 LFYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLGTI------FYPSRSSSYAY 113
L+Y + +G PPV +DTGS +LWV C C C PQ + F P SS+ +
Sbjct: 24 LYYTKVQLGTPPVEFNVQIDTGSDVLWVSCNSCSGC-PQTSGLQIQLNFFDPGSSSTSSM 82
Query: 114 VPCDSEHCR---YFPYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQ 170
+ C + C A CS+ ++C YT Y G TS + ++ + E ++
Sbjct: 83 IACSDQRCNNGIQSSDATCSSQNNQCSYTFQYGDGSGTSGYYVSDMMHLNTIFEGSVTTN 142
Query: 171 D---VVFGC-----GFSTNRNFKFSGIFGLGIGRSSLVSQLNSS------FSYCIGSLHD 216
VVFGC G T + GIFG G S++SQL+S FS+C L
Sbjct: 143 STAPVVFGCSNQQTGDLTKSDRAVDGIFGFGQQEMSVISQLSSQGIAPRVFSHC---LKG 199
Query: 217 PDYLHNKLILGDGAIIDEGDA-TPLQFIDGHYYITLEAISVDGRMLDINPNIFKRDDSGG 275
L+LG+ I++ T L HY + L++I+V+G+ L I+ ++F +S
Sbjct: 200 DSSGGGILVLGE--IVEPNIVYTSLVPAQPHYNLNLQSIAVNGQTLQIDSSVFATSNS-R 256
Query: 276 GVMIDSGTDVTWLVKEAYEALRDEVMIRLEGEQMRSYSWPDKLCYHGIMSSDLKGFPTVR 335
G ++DSGT + +L +EAY+ + + + S ++ CY I SS + FP V
Sbjct: 257 GTIVDSGTTLAYLAEEAYDPFVSAITASIPQSVHTAVSRGNQ-CYL-ITSSVTEVFPQVS 314
Query: 336 FHFRGGAKLALEKDSMFYQ 354
+F GGA + L Q
Sbjct: 315 LNFAGGASMILRPQDYLIQ 333
>gi|302757235|ref|XP_002962041.1| hypothetical protein SELMODRAFT_64201 [Selaginella moellendorffii]
gi|300170700|gb|EFJ37301.1| hypothetical protein SELMODRAFT_64201 [Selaginella moellendorffii]
Length = 367
Score = 120 bits (300), Expect = 1e-24, Method: Compositional matrix adjust.
Identities = 104/370 (28%), Positives = 162/370 (43%), Gaps = 31/370 (8%)
Query: 61 FYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLGTIFYPSRSSSYAYVPCDSEH 120
+ + I +G PP +DTGS L+W+ C PC C Q I+ PS SS++A C +
Sbjct: 4 YTMEIELGSPPKKFNAIVDTGSDLVWIQCKPCSQCYSQSDPIYDPSASSTFAKTSCSTSS 63
Query: 121 CRYFPYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDVVFGCGFST 180
C+ P + CS+ CIY Y T + E LT +++ S+ + FGCG
Sbjct: 64 CQSLPASGCSSSAKTCIYGYQYGDSSSTQGDFALETLTLRSSGGSSKAFPNFQFGCGRLN 123
Query: 181 NRNF-KFSGIFGLGIGRSSLVSQL----NSSFSYCIGSLHDPDYLHNKLILGDGAIIDEG 235
+ +F +GI GLG G+ SL +QL N+ FSYC+ D + LI G A G
Sbjct: 124 SGSFGGAAGIVGLGQGKISLSTQLGSAINNKFSYCLVDFDDDSSKTSPLIFGSSASTGSG 183
Query: 236 D-ATPLQFIDG---HYYITLEAISVDGRMLDINP--------------NIFKRDDSGGGV 277
+TP+ G +Y++ LE ISV G+ L + + + + GG
Sbjct: 184 AISTPIIPNSGRSTYYFVGLEGISVGGKQLSLATRAIDFLSVRSKKKLRVRALEVNSGGT 243
Query: 278 MIDSGTDVTWLVKEAYEALRDEVMIRLEGEQMRSYSWPDKLCYHGIMSSDLKGFPTVRFH 337
+ DSGT +T L Y ++ + + + S LCY S + K FP +
Sbjct: 244 IFDSGTTLTLLDDAVYSKVKSAFASSVSLPTVDASSSGFDLCYDVSKSKNFK-FPALTLA 302
Query: 338 FRGGAKLALEKDSMF--YQPRPDAFCMAVNPASINNRYVNFSLIGMMAQQFYNVGYDIGR 395
F+ G K + + + F C+A+ + + +L+ QQ Y+V YD G
Sbjct: 303 FK-GTKFSPPQKNYFVIVDTAETVACLAMGGSGSLGLGIIGNLM----QQNYHVVYDRGT 357
Query: 396 KQKTFQRMDC 405
+ C
Sbjct: 358 STISMSPAQC 367
>gi|326515172|dbj|BAK03499.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 494
Score = 120 bits (300), Expect = 1e-24, Method: Compositional matrix adjust.
Identities = 112/354 (31%), Positives = 153/354 (43%), Gaps = 40/354 (11%)
Query: 69 QPPVPQFTAMDTGSSLLWVHCYPC--RDCSPQLGTIFYPSRSSSYAYVPCDSEHCRYFPY 126
Q V Q +DT S + WV C PC C Q ++ P++SS++A +PC S C+
Sbjct: 164 QDAVSQTVVVDTSSDIPWVQCLPCPIPQCHLQKDPLYDPAKSSTFAPIPCGSPACKELGS 223
Query: 127 A---RCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDVVFGCGFSTNRN 183
+ CS C Y Y G T+ T+ LT TI V+D FGC + +
Sbjct: 224 SYGNGCSPTTDECKYIVNYGDGKATTGTYVTDTLTM----SPTIVVKDFRFGCSHAVRGS 279
Query: 184 F--KFSGIFGLGIGRSSLVSQL----NSSFSYCIGSLHDPDYLHNKLILGDGAIIDEGDA 237
F + +GI LG GR SL+ Q ++FSYCI +L + G +
Sbjct: 280 FSNQNAGILALGGGRGSLLEQTADAYGNAFSYCIPKPSSAGFLS---LGGPVEASLKFSY 336
Query: 238 TPL---QFIDGHYYITLEAISVDGRMLDINPNIFKRDDSGGGVMIDSGTDVTWLVKEAYE 294
TPL + Y + LEAI V G+ L + P F G ++DSG VT L + Y
Sbjct: 337 TPLIKNKHAPTFYIVHLEAIIVAGKQLAVPPTAFAT-----GAVMDSGAVVTQLPPQVYA 391
Query: 295 ALRDEVMIRLEGEQMRSYSWPDK---LCYHGIMSSDLKGFPTVRFHFRGGAKLALEKDSM 351
ALR R + P + CY D+K P V F GGA L LE S+
Sbjct: 392 ALR--AAFRSAMAAYGPLAAPVRNLDTCYDFTRFPDVK-VPKVSLVFAGGATLDLEPASI 448
Query: 352 FYQPRPDAFCMAVNPASINNRYVNFSLIGMMAQQFYNVGYDIGRKQKTFQRMDC 405
C+A A+ V F IG + QQ Y V YD+G + F+R C
Sbjct: 449 ILD-----GCLAFA-ATPGEESVGF--IGNVQQQTYEVLYDVGGGKVGFRRGAC 494
>gi|115483168|ref|NP_001065177.1| Os10g0538200 [Oryza sativa Japonica Group]
gi|21717168|gb|AAM76361.1|AC074196_19 putative nucleoid DNA binding protein [Oryza sativa Japonica Group]
gi|31433289|gb|AAP54827.1| Eukaryotic aspartyl protease family protein, expressed [Oryza
sativa Japonica Group]
gi|113639786|dbj|BAF27091.1| Os10g0538200 [Oryza sativa Japonica Group]
gi|215686408|dbj|BAG87693.1| unnamed protein product [Oryza sativa Japonica Group]
Length = 394
Score = 120 bits (300), Expect = 2e-24, Method: Compositional matrix adjust.
Identities = 95/361 (26%), Positives = 159/361 (44%), Gaps = 31/361 (8%)
Query: 61 FYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLGTIFYPSRSSSYAYVPCDSEH 120
+ N +IG PP P +D L+W C C C Q +F P+ S++Y PC +
Sbjct: 51 YVANFTIGTPPQPASAVIDLAGELVWTQCKQCSRCFEQDTPLFDPTASNTYRAEPCGTPL 110
Query: 121 CRYFPYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDVVFGCGFST 180
C P + + C Y Q +T V T+ S + FGC ++
Sbjct: 111 CESIPSDSRNCSGNVCAY-QASTNAGDTGGKVGTDTFAVGTAKAS------LAFGCVVAS 163
Query: 181 NRNFKF--SGIFGLGIGRSSLVSQLN-SSFSYCIGSLHDPDYLHNKLILGDGAIIDEGD- 236
+ + SGI GLG SLV+Q ++FSYC+ HD ++ L LG A + G
Sbjct: 164 DIDTMGGPSGIVGLGRTPWSLVTQTGVAAFSYCLAP-HDAGR-NSALFLGSSAKLAGGGK 221
Query: 237 --ATPLQFIDG-------HYYITLEAISVDGRMLDINPNIFKRDDSGGGVMIDSGTDVTW 287
+TP I G +Y + LE + M+ + P SG V++D+ + +++
Sbjct: 222 AASTPFVNISGNGNDLSNYYKVQLEGLKAGDAMIPLPP-------SGSTVLLDTFSPISF 274
Query: 288 LVKEAYEALRDEVMIRLEGEQMRSYSWPDKLCYHGIMSSDLKGFPTVRFHFRGGAKLALE 347
LV AY+A++ V + M + P LC+ +S P + F FRGGA + +
Sbjct: 275 LVDGAYQAVKKAVTAAVGAPPMATPVEPFDLCFPKSGASGAA--PDLVFTFRGGAAMTVP 332
Query: 348 KDSMFYQPRPDAFCMAVNPASINNRYVNFSLIGMMAQQFYNVGYDIGRKQKTFQRMDCEV 407
+ + C+A+ ++ N SL+G + Q+ + +D+ ++ +F+ DC
Sbjct: 333 ATNYLLDYKNGTVCLAMLSSARLNSTTELSLLGSLQQENIHFLFDLDKETLSFEPADCTK 392
Query: 408 L 408
L
Sbjct: 393 L 393
>gi|255568540|ref|XP_002525244.1| Aspartic proteinase nepenthesin-2 precursor, putative [Ricinus
communis]
gi|223535541|gb|EEF37210.1| Aspartic proteinase nepenthesin-2 precursor, putative [Ricinus
communis]
Length = 460
Score = 120 bits (300), Expect = 2e-24, Method: Compositional matrix adjust.
Identities = 109/387 (28%), Positives = 171/387 (44%), Gaps = 53/387 (13%)
Query: 44 NTYQAQILPSNDISNSLFYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRD-CSPQLGTI 102
N+ + P I + +Y+ + +G PP +DTGSSL W+ C PC C Q+ +
Sbjct: 103 NSANIPLNPGLSIGSGNYYLKLGLGSPPKYYTMILDTGSSLSWLQCKPCVVYCHSQVDPL 162
Query: 103 FYPSRSSSYAYVPCDSEHCRYFPYAR-----CSAYKHRCIYTQLYLIGPETSVFVSTEQL 157
F PS S++Y + C S C A C+A C+YT Y + ++S + L
Sbjct: 163 FEPSASNTYRPLYCSSSECSLLKAATLNDPLCTA-SGVCVYTASYGDASYSMGYLSRDLL 221
Query: 158 TFKNTDESTIHVQDVVFGCGFSTNRNF-KFSGIFGLGIGRSSLVSQLNS----SFSYCIG 212
T + + +GCG F K +GI GL + S+++QL+ +FSYC+
Sbjct: 222 TLTPSQT----LPSFTYGCGQDNEGLFGKAAGIVGLARDKLSMLAQLSPKYGYAFSYCLP 277
Query: 213 SLHDPDYLHNKLILGDGAIIDEGDATPLQFI----------DGHYYITLEAISVDGRMLD 262
+ G + G +P + Y++ L AI+V GR +
Sbjct: 278 TSTS----------SGGGFLSIGKISPSSYKFTPMIRNSQNPSLYFLRLAAITVAGRPVG 327
Query: 263 INPNIFKRDDSGGGVMIDSGTDVTWLVKEAYEALRD---EVMIRLEGEQMRSYSWPDKLC 319
+ ++ +IDSGT VT L Y ALR+ ++M R EQ +YS D C
Sbjct: 328 VAAAGYQVP-----TIIDSGTVVTRLPISIYAALREAFVKIMSR-RYEQAPAYSILDT-C 380
Query: 320 YHGIMSSDLKGFPTVRFHFRGGAKLALEKDSMFYQPRPDAFCMAVNPASINNRYVNFSLI 379
+ G + S + G P +R F+GGA L+L ++ + C+A AS N ++I
Sbjct: 381 FKGSLKS-MSGAPEIRMIFQGGADLSLRAPNILIEADKGIACLAF--ASSN----QIAII 433
Query: 380 GMMAQQFYNVGYDIGRKQKTFQRMDCE 406
G QQ YN+ YD+ + F C
Sbjct: 434 GNHQQQTYNIAYDVSASKIGFAPGGCR 460
>gi|224065128|ref|XP_002301682.1| predicted protein [Populus trichocarpa]
gi|222843408|gb|EEE80955.1| predicted protein [Populus trichocarpa]
Length = 441
Score = 120 bits (300), Expect = 2e-24, Method: Compositional matrix adjust.
Identities = 94/367 (25%), Positives = 166/367 (45%), Gaps = 31/367 (8%)
Query: 60 LFYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLGTIFYPSRSSSYAYVPCDSE 119
+ V++ IG PP Q +DTGS L W+ C+ P ++F PS SSS++ +PC+
Sbjct: 81 ILLVSLPIGTPPQTQQMILDTGSQLSWIQCHKKVPRKPPPSSVFDPSLSSSFSVLPCNHP 140
Query: 120 HCRY----FPYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDVVFG 175
C+ F C Y+ Y G + E++TF + + ++ G
Sbjct: 141 LCKPRIPDFTLPTSCDQNRLCHYSYFYADGTLAEGNLVREKITFSRSQSTP----PLILG 196
Query: 176 CGFSTNRNFKFSGIFGLGIGRSSLVSQLN-SSFSYCIGSLH-DPDYL-HNKLILGDGA-- 230
C ++ GI G+ +GR S SQ + FSYC+ + P + LG+
Sbjct: 197 CAEESS---DAKGILGMNLGRLSFASQAKLTKFSYCVPTRQVRPGFTPTGSFYLGENPNS 253
Query: 231 -------IIDEGDATPLQFIDG-HYYITLEAISVDGRMLDINPNIFKRDDSGGG-VMIDS 281
++ + + +D Y + ++ I + + L+I + F+ D SG G MIDS
Sbjct: 254 GGFRYINLLTFSQSQRMPNLDPLAYTVAMQGIRIGNQKLNIPISAFRPDPSGAGQTMIDS 313
Query: 282 GTDVTWLVKEAYEALRDEVMIRLEGEQMRS---YSWPDKLCYHGIMSSDLKGFPTVRFHF 338
G++ T+LV EAY +R+EV +RL G +++ Y +C++G + + F F
Sbjct: 314 GSEFTYLVDEAYNKVREEV-VRLVGARLKKGYVYGGVSDMCFNGNAIEIGRLIGNMVFEF 372
Query: 339 RGGAKLALEKDSMFYQPRPDAFCMAVNPASINNRYVNFSLIGMMAQQFYNVGYDIGRKQK 398
G ++ +EK+ + C+ + + + N +IG QQ V +D+ ++
Sbjct: 373 DKGVEIVVEKERVLADVGGGVHCVGIGRSEMLGAASN--IIGNFHQQNIWVEFDLANRRV 430
Query: 399 TFQRMDC 405
F + DC
Sbjct: 431 GFGKADC 437
>gi|115465771|ref|NP_001056485.1| Os05g0590000 [Oryza sativa Japonica Group]
gi|49328116|gb|AAT58814.1| putative nucleoid DNA-binding protein [Oryza sativa Japonica Group]
gi|113580036|dbj|BAF18399.1| Os05g0590000 [Oryza sativa Japonica Group]
Length = 481
Score = 119 bits (299), Expect = 2e-24, Method: Compositional matrix adjust.
Identities = 118/392 (30%), Positives = 170/392 (43%), Gaps = 41/392 (10%)
Query: 39 KIRSHNTYQAQILPSNDISNSLFYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQ 98
+ R + A +L + ++ + +G P +DTGS ++W+ C PCR C Q
Sbjct: 106 RPRRRGGFAAPLLSGLPQGSGEYFAQVGVGTPATTALMVLDTGSDVVWLQCAPCRHCYAQ 165
Query: 99 LGTIFYPSRSSSYAYVPCDSEHCRYFPYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLT 158
G +F P RS SYA V C + CR A C ++ C+Y Y G T+ ++E LT
Sbjct: 166 SGRVFDPRRSRSYAAVDCVAPICRRLDSAGCDRRRNSCLYQVAYGDGSVTAGDFASETLT 225
Query: 159 FKNTDESTIHVQDVVFGCGFSTNRNF-KFSGIFGLGIGRSSLVSQL----NSSFSYCI-- 211
F VQ V GCG F SG+ GLG GR S SQ+ SFSYC+
Sbjct: 226 FAR----GARVQRVAIGCGHDNEGLFIAASGLLGLGRGRLSFPSQIARSFGRSFSYCLVD 281
Query: 212 -GSLHDPDYLHNKLIL---GDGAIIDEGDATPL---QFIDGHYYITLEAISVDGRM---- 260
S P + + G A TP+ + YY+ L SV G
Sbjct: 282 RTSSVRPSSTRSSTVTFGAGAVAAAAGASFTPMGRNPRMATFYYVHLLGFSVGGARVKGV 341
Query: 261 ----LDINPNIFKRDDSGGGVMIDSGTDVTWLVKEAYEALRDEVMIRLEGEQMR--SYSW 314
L +NP + GGV++DSGT VT L + YEA+RD G ++ +S
Sbjct: 342 SQSDLRLNPTTGR-----GGVILDSGTSVTRLARPVYEAVRDAFRAAAVGLRVSPGGFSL 396
Query: 315 PDKLCYHGIMSSDLKGFPTVRFHFRGGAKLALEKDS-MFYQPRPDAFCMAVNPASINNRY 373
D CY+ + + PTV H GGA +AL ++ + FC A+ A +
Sbjct: 397 FDT-CYN-LSGRRVVKVPTVSMHLAGGASVALPPENYLIPVDTSGTFCFAM--AGTDG-- 450
Query: 374 VNFSLIGMMAQQFYNVGYDIGRKQKTFQRMDC 405
S+IG + QQ + V +D ++ F C
Sbjct: 451 -GVSIIGNIQQQGFRVVFDGDAQRVGFVPKSC 481
>gi|413953788|gb|AFW86437.1| hypothetical protein ZEAMMB73_618532 [Zea mays]
Length = 469
Score = 119 bits (299), Expect = 2e-24, Method: Compositional matrix adjust.
Identities = 116/365 (31%), Positives = 159/365 (43%), Gaps = 44/365 (12%)
Query: 61 FYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRD--CSPQLGTIFYPSRSSSYAYVPCDS 118
+ + +G P VPQ +DTGSSL WV C PC C PQ +F P+ SSSY+ VPCDS
Sbjct: 129 YVATVGLGTPAVPQTLILDTGSSLTWVQCKPCNSSQCYPQRLPLFDPNTSSSYSPVPCDS 188
Query: 119 EHCRYFPYA----RCSA-YKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDVV 173
+ CR C++ C Y Y G + ST+ LT V+
Sbjct: 189 QECRALAAGIDGDGCTSDGDWGCAYEIHYGSGATPAGEYSTDALTLG----PGAIVKRFH 244
Query: 174 FGCGFSTNRNFKF---SGIFGLGIGRSSLVSQLNSS-----FSYCIGSLHDPDYLHNKLI 225
FGCG R KF G+ GLG SL Q ++ FS+C+ P +
Sbjct: 245 FGCGHHQQRG-KFDMADGVLGLGRLPQSLAWQASARRGGGVFSHCL-----PPTGVSTGF 298
Query: 226 LGDGAIIDEGD--ATPLQFIDGH---YYITLEAISVDGRMLDINPNIFKRDDSGGGVMID 280
L GA D TPL +D Y + AISV G++LDI P +F+ GV+ D
Sbjct: 299 LALGAPHDTSAFVFTPLLTMDDQPWFYQLMPTAISVAGQLLDIPPAVFRE-----GVITD 353
Query: 281 SGTDVTWLVKEAYEALRDEVMIRLEGEQMRSYSWPDKLCYHGIMSSDLKGFPTVRFHFRG 340
SGT ++ L + AY ALR + + C++ D PTV FRG
Sbjct: 354 SGTVLSALQETAYTALRTAFRSAMAEYPLAPPVGHLDTCFN-FTGYDNVTVPTVSLTFRG 412
Query: 341 GAKLALEKDSMFYQPRPDAFCMAVNPASINNRYVNFSLIGMMAQQFYNVGYDIGRKQKTF 400
GA + L+ S AF S + Y LIG ++Q+ V YD+ ++ F
Sbjct: 413 GATVHLDASSGVLMDGCLAFW------SSGDEYTG--LIGSVSQRTIEVLYDMPGRKVGF 464
Query: 401 QRMDC 405
+ C
Sbjct: 465 RTGAC 469
>gi|47777372|gb|AAT38006.1| unknow protein [Oryza sativa Japonica Group]
Length = 475
Score = 119 bits (299), Expect = 2e-24, Method: Compositional matrix adjust.
Identities = 118/392 (30%), Positives = 170/392 (43%), Gaps = 41/392 (10%)
Query: 39 KIRSHNTYQAQILPSNDISNSLFYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQ 98
+ R + A +L + ++ + +G P +DTGS ++W+ C PCR C Q
Sbjct: 100 RPRRRGGFAAPLLSGLPQGSGEYFAQVGVGTPATTALMVLDTGSDVVWLQCAPCRHCYAQ 159
Query: 99 LGTIFYPSRSSSYAYVPCDSEHCRYFPYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLT 158
G +F P RS SYA V C + CR A C ++ C+Y Y G T+ ++E LT
Sbjct: 160 SGRVFDPRRSRSYAAVDCVAPICRRLDSAGCDRRRNSCLYQVAYGDGSVTAGDFASETLT 219
Query: 159 FKNTDESTIHVQDVVFGCGFSTNRNF-KFSGIFGLGIGRSSLVSQL----NSSFSYCI-- 211
F VQ V GCG F SG+ GLG GR S SQ+ SFSYC+
Sbjct: 220 FAR----GARVQRVAIGCGHDNEGLFIAASGLLGLGRGRLSFPSQIARSFGRSFSYCLVD 275
Query: 212 -GSLHDPDYLHNKLIL---GDGAIIDEGDATPL---QFIDGHYYITLEAISVDGRM---- 260
S P + + G A TP+ + YY+ L SV G
Sbjct: 276 RTSSVRPSSTRSSTVTFGAGAVAAAAGASFTPMGRNPRMATFYYVHLLGFSVGGARVKGV 335
Query: 261 ----LDINPNIFKRDDSGGGVMIDSGTDVTWLVKEAYEALRDEVMIRLEGEQMR--SYSW 314
L +NP + GGV++DSGT VT L + YEA+RD G ++ +S
Sbjct: 336 SQSDLRLNPTTGR-----GGVILDSGTSVTRLARPVYEAVRDAFRAAAVGLRVSPGGFSL 390
Query: 315 PDKLCYHGIMSSDLKGFPTVRFHFRGGAKLALEKDS-MFYQPRPDAFCMAVNPASINNRY 373
D CY+ + + PTV H GGA +AL ++ + FC A+ A +
Sbjct: 391 FDT-CYN-LSGRRVVKVPTVSMHLAGGASVALPPENYLIPVDTSGTFCFAM--AGTDG-- 444
Query: 374 VNFSLIGMMAQQFYNVGYDIGRKQKTFQRMDC 405
S+IG + QQ + V +D ++ F C
Sbjct: 445 -GVSIIGNIQQQGFRVVFDGDAQRVGFVPKSC 475
>gi|356537015|ref|XP_003537027.1| PREDICTED: aspartic proteinase-like protein 2-like [Glycine max]
Length = 476
Score = 119 bits (299), Expect = 2e-24, Method: Compositional matrix adjust.
Identities = 103/358 (28%), Positives = 166/358 (46%), Gaps = 41/358 (11%)
Query: 78 MDTGSSLLWVHCYPCRDC--SPQLG---TIFYPSRSSSYAYVPCDSEHCR---YFPYARC 129
+DTGS +LWV+C C +C S QLG F SS+ A +PC C A C
Sbjct: 85 IDTGSDILWVNCNTCSNCPQSSQLGIELNFFDTVGSSTAALIPCSDLICTSGVQGAAAEC 144
Query: 130 SAYKHRCIYTQLYLIGPETSVFVSTEQLTFK---NTDESTIHVQDVVFGCGFS-----TN 181
S ++C YT Y G TS + ++ + F + +VFGC S T
Sbjct: 145 SPRVNQCSYTFQYGDGSGTSGYYVSDAMYFNLIMGQPPAVNSTATIVFGCSISQSGDLTK 204
Query: 182 RNFKFSGIFGLGIGRSSLVSQLNSS------FSYCIGSLHDPDYLHNKLILGDGAIIDEG 235
+ GIFG G G S+VSQL+S FS+C L L+LG+ I++
Sbjct: 205 TDKAVDGIFGFGPGPLSVVSQLSSQGITPKVFSHC---LKGDGNGGGILVLGE--ILEPS 259
Query: 236 DA-TPLQFIDGHYYITLEAISVDGRMLDINPNIFKRDDSGGGVMIDSGTDVTWLVKEAYE 294
+PL HY + L++I+V+G+ L INP +F ++ GG ++D GT + +L++EAY+
Sbjct: 260 IVYSPLVPSQPHYNLNLQSIAVNGQPLPINPAVFSISNNRGGTIVDCGTTLAYLIQEAYD 319
Query: 295 ALRDEVMIRLEGEQMRSYSWPDKLCYHGIMSSDLKG-FPTVRFHFRGGAKLALEKDSMF- 352
L + + + R + CY ++S+ + FP V +F GGA + L+ +
Sbjct: 320 PLVTAINTAVS-QSARQTNSKGNQCY--LVSTSIGDIFPLVSLNFEGGASMVLKPEQYLM 376
Query: 353 ---YQPRPDAFCMAVNPASINNRYVNFSLIGMMAQQFYNVGYDIGRKQKTFQRMDCEV 407
Y + +C+ S++G + + V YDI +++ + DC +
Sbjct: 377 HNGYLDGAEMWCVG-----FQKLQEGASILGDLVLKDKIVVYDIAQQRIGWANYDCSL 429
>gi|125532796|gb|EAY79361.1| hypothetical protein OsI_34489 [Oryza sativa Indica Group]
Length = 405
Score = 119 bits (298), Expect = 2e-24, Method: Compositional matrix adjust.
Identities = 97/373 (26%), Positives = 159/373 (42%), Gaps = 42/373 (11%)
Query: 57 SNSLFYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLGTIFYPSRSSSYAYVPC 116
S L+ N +IG PP P +D L+W C PC+ C Q +F P++SS++ +PC
Sbjct: 53 SQGLYVANFTIGTPPQPVSAVVDLTGELVWTQCTPCQPCFEQDLPLFDPTKSSTFRGLPC 112
Query: 117 DSEHCRYFPYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDVVFGC 176
S C P + + CIY G +T T+ E+ + FGC
Sbjct: 113 GSHLCESIPESSRNCTSDVCIYEAPTKAG-DTGGMAGTDTFAIGAAKET------LGFGC 165
Query: 177 GFSTNRNFKF----SGIFGLGIGRSSLVSQLN-SSFSYCIGSLHDPDYLHNKLILGDGAI 231
T++ K SGI GLG SLV+Q+N ++FSYC+ L LG A
Sbjct: 166 VVMTDKRLKTIGGPSGIVGLGRTPWSLVTQMNVTAFSYCLAGKS-----SGALFLGATAK 220
Query: 232 -IDEGDATPLQFI------------DGHYYITLEAISVDGRMLDINPNIFKRDDSGGGVM 278
+ G + F+ + +Y + L I G L SG V+
Sbjct: 221 QLAGGKNSSTPFVIKTSAGSSDNGSNPYYMVKLAGIKAGGAPLQ------AASSSGSTVL 274
Query: 279 IDSGTDVTWLVKEAYEALRDEVMIRLEGEQMRSYSWPDKLCYHGIMSSDLKGFPTVRFHF 338
+D+ + ++L AY+AL+ + + + + S P LC+ ++ D P + F F
Sbjct: 275 LDTVSRASYLADGAYKALKKALTAAVGVQPVASPPKPYDLCFSKAVAGDA---PELVFTF 331
Query: 339 RGGAKLALEKDSMFYQPRPDAFCMAV-NPASIN--NRYVNFSLIGMMAQQFYNVGYDIGR 395
GGA L + + C+ + + AS+N S++G + Q+ +V +D+
Sbjct: 332 DGGAALTVPPANYLLASGNGTVCLTIGSSASLNLTGELEGASILGSLQQENVHVLFDLKE 391
Query: 396 KQKTFQRMDCEVL 408
+ +F+ DC L
Sbjct: 392 ETLSFKPADCSSL 404
>gi|242050426|ref|XP_002462957.1| hypothetical protein SORBIDRAFT_02g035290 [Sorghum bicolor]
gi|241926334|gb|EER99478.1| hypothetical protein SORBIDRAFT_02g035290 [Sorghum bicolor]
Length = 452
Score = 119 bits (298), Expect = 3e-24, Method: Compositional matrix adjust.
Identities = 105/368 (28%), Positives = 169/368 (45%), Gaps = 31/368 (8%)
Query: 61 FYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRD-CSPQLGTIFYPSRSSSYAYVPCDSE 119
+++ +S+G PP+ +DTGS L W C PC C Q ++ P+RSS+++ +PC S
Sbjct: 96 YHMILSVGTPPLAFPAIIDTGSDLTWTQCAPCTTACFAQPTPLYDPARSSTFSKLPCASP 155
Query: 120 HCRYFPYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTF---KNTDESTIHVQDVVFGC 176
C+ P A + C+Y Y +G T+ +++ + L +++ V FGC
Sbjct: 156 LCQALPSAFRACNATGCVYDYRYAVG-FTAGYLAADTLAIGDGDGDGDASSSFAGVAFGC 214
Query: 177 GFSTNRNF-KFSGIFGLGIGRSSLVSQLN-SSFSYCIGSLHDPDYLHNKLILGDGAII-- 232
+ + SGI GLG SL+SQ+ FSYC+ S D D + ++ G A +
Sbjct: 215 STANGGDMDGASGIVGLGRSALSLLSQIGVGRFSYCLRS--DADAGASPILFGALANVTG 272
Query: 233 DEGDATPL-------QFIDGHYYITLEAISVDGRMLDINPNIFKRDDSG-GGVMIDSGTD 284
D+ +T L + +YY+ L I+V L + + F +G GGV++DSGT
Sbjct: 273 DKVQSTALLRNPVAARRRAPYYYVNLTGIAVGSTDLPVTSSTFGFTAAGAGGVIVDSGTT 332
Query: 285 VTWLVKEAYEALRDEVMIRLEGEQMR--SYSWPDKLCYHGIMSSDLKGFPTVRFHFRGGA 342
T+L + Y LR + + G R + LC+ ++D P + F F GGA
Sbjct: 333 FTYLAEAGYTMLRQAFLSQTAGLLTRVSGAQFDFDLCFEA-GAADTP-VPRLVFRFAGGA 390
Query: 343 KLALEKDSMF--YQPRPDAFCMAVNPASINNRYVNFSLIGMMAQQFYNVGYDIGRKQKTF 400
+ A+ + S F C+ V P S+IG + Q +V YD+ +F
Sbjct: 391 EYAVPRQSYFDAVDEGGRVACLLVLPTR------GVSVIGNVMQMDLHVLYDLDGATFSF 444
Query: 401 QRMDCEVL 408
DC L
Sbjct: 445 APADCASL 452
>gi|168042409|ref|XP_001773681.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162675069|gb|EDQ61569.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 399
Score = 119 bits (298), Expect = 3e-24, Method: Compositional matrix adjust.
Identities = 97/374 (25%), Positives = 171/374 (45%), Gaps = 39/374 (10%)
Query: 60 LFYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLG------TIFYPSRSSSYAY 113
L+Y I +G PPV + +DTGS + W++C PC C + T + PSRSS+
Sbjct: 36 LYYTKIYLGTPPVGYYVQVDTGSDVTWLNCAPCTSCVTETQLPSIKLTTYDPSRSSTDGA 95
Query: 114 VPCDSEHCRYF---PYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTI--H 168
+ C +C C++ + C Y+ Y G T + + +TF+ +T
Sbjct: 96 LSCRDSNCGAALGSNEVSCTSAGY-CAYSTTYGDGSSTQGYFIQDVMTFQEIHNNTQVNG 154
Query: 169 VQDVVFGCGFSTNRNFKFS-----GIFGLGIGRSSLVSQLNS------SFSYCIGSLHDP 217
V FGCG + + N S G+ G G S+ SQL S F++C L
Sbjct: 155 TASVYFGCGTTQSGNLLMSSRALDGLIGFGQAAVSIPSQLASMGKVGNRFAHC---LQGD 211
Query: 218 DYLHNKLILGDGAIIDEGDATPLQFID-GHYYITLEAISVDGRMLDINPNIFKRDDSGGG 276
+ +++G + E + + + HY + ++ I+V+GR + + S GG
Sbjct: 212 NQGGGTIVIGS---VSEPNISYTPIVSRNHYAVGMQNIAVNGRNVTTPASFDTTSTSAGG 268
Query: 277 VMIDSGTDVTWLVKEAYEALRDEVMIRLEGEQMRSYSWPDKLCYHGIMSSDLKGFPTVRF 336
V++DSGT + +LV AY + V E S+S +L + + + FPTV+
Sbjct: 269 VIMDSGTTLAYLVDPAYTQFVNAVST-FESSMFSSHSQCLQLAWCSLQAD----FPTVKL 323
Query: 337 HFRGGAKLALE-KDSMFYQPRPD---AFCMAVNPASINNRYVNFSLIGMMAQQFYNVGYD 392
F GA + L ++ ++ QP + A+CM ++ Y+++S++G + + + V YD
Sbjct: 324 FFDAGAVMNLTPRNYLYSQPLQNGQAAYCMGWQKSTTKAGYLSYSILGDIVLKDHLVVYD 383
Query: 393 IGRKQKTFQRMDCE 406
+ ++ DC+
Sbjct: 384 NDNRVVGWKSFDCK 397
>gi|125553531|gb|EAY99240.1| hypothetical protein OsI_21202 [Oryza sativa Indica Group]
Length = 475
Score = 119 bits (298), Expect = 3e-24, Method: Compositional matrix adjust.
Identities = 112/397 (28%), Positives = 163/397 (41%), Gaps = 51/397 (12%)
Query: 39 KIRSHNTYQAQILPSNDISNSLFYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQ 98
+ R + A +L + ++ + +G P +DTGS ++W+ C PCR C Q
Sbjct: 100 RPRRRGGFAAPLLSGLPQGSGEYFAQVGVGTPATTALMVLDTGSDVVWLQCAPCRHCYAQ 159
Query: 99 LGTIFYPSRSSSYAYVPCDSEHCRYFPYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLT 158
G +F P RS SYA V C + CR A C ++ C+Y Y G T+ ++E LT
Sbjct: 160 SGRVFDPRRSRSYAAVDCVAPICRRLDSAGCDRRRNSCLYQVAYGDGSVTAGDFASETLT 219
Query: 159 FKNTDESTIHVQDVVFGCGFSTNRNFKFSGI----------FGLGIGRSSLVSQLNSSFS 208
F VQ V GCG F + F I RS SFS
Sbjct: 220 FAR----GARVQRVAIGCGHDNEGLFIAASGLLGLGRGRLSFPTQIARS-----FGRSFS 270
Query: 209 YCI---GSLHDPDYLHNKLIL---GDGAIIDEGDATPL---QFIDGHYYITLEAISVDGR 259
YC+ S P + + G A TP+ + YY+ L SV G
Sbjct: 271 YCLVDRTSSVRPSSTRSSTVTFGAGAVAAAAGASFTPMGRNPRMATFYYVHLLGFSVGGA 330
Query: 260 M--------LDINPNIFKRDDSGGGVMIDSGTDVTWLVKEAYEALRDEVMIRLEGEQMR- 310
L +NP + GGV++DSGT VT L + YEA+RD G ++
Sbjct: 331 RVKGVSQSDLRLNPTTGR-----GGVILDSGTSVTRLARPVYEAVRDAFRAAAVGLRVSP 385
Query: 311 -SYSWPDKLCYHGIMSSDLKGFPTVRFHFRGGAKLALEKDS-MFYQPRPDAFCMAVNPAS 368
+S D CY+ + + PTV H GGA +AL ++ + FC A+ A
Sbjct: 386 GGFSLFDT-CYN-LSGRRVVKVPTVSMHLAGGASVALPPENYLIPVDTSGTFCFAM--AG 441
Query: 369 INNRYVNFSLIGMMAQQFYNVGYDIGRKQKTFQRMDC 405
+ S+IG + QQ + V +D ++ F C
Sbjct: 442 TDG---GVSIIGNIQQQGFRVVFDGDAQRVGFVPKSC 475
>gi|226492633|ref|NP_001149953.1| LOC100283580 precursor [Zea mays]
gi|195635701|gb|ACG37319.1| pepsin A [Zea mays]
Length = 491
Score = 119 bits (298), Expect = 3e-24, Method: Compositional matrix adjust.
Identities = 103/381 (27%), Positives = 165/381 (43%), Gaps = 40/381 (10%)
Query: 51 LPSNDISNSLFYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLG-----TIFYP 105
LP++ L+Y I IG PP + +DTGS +LWV+C C C + G T + P
Sbjct: 77 LPTD---TGLYYTRIEIGSPPKGYYVQVDTGSDILWVNCIRCDGCPTRSGLGIELTQYDP 133
Query: 106 SRSSSYAYVPCDSEHCRYFPYA----RCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKN 161
+ S + V C+ E C C + C + Y G T+ F T+ + +
Sbjct: 134 AGSGT--TVGCEQEFCVANSAGGVPPTCPSTSSPCQFRITYGDGSTTTGFYVTDFVQYNQ 191
Query: 162 TD---ESTIHVQDVVFGCGFST-----NRNFKFSGIFGLGIGRSSLVSQLNSS------F 207
++T + FGCG + N GI G G SS++SQL ++ F
Sbjct: 192 VSGNGQTTTSNASITFGCGAQLGGDLGSSNQALDGILGFGQSDSSMLSQLAAARRVRKIF 251
Query: 208 SYCIGSLHDPDYLHNKLILGDGAIID-EGDATPLQFIDGHYYITLEAISVDGRMLDINPN 266
++C+ D + I G ++ + TPL HY + L+ ISV G L + +
Sbjct: 252 AHCL------DTVRGGGIFAIGNVVQPKVKTTPLVPNVTHYNVNLQGISVGGATLQLPTS 305
Query: 267 IFKRDDSGGGVMIDSGTDVTWLVKEAYEALRDEVMIRLEGEQMRSYSWPDKLCYHGIMSS 326
F DS G +IDSGT + +L +E Y L V + + + +Y D +C+ S
Sbjct: 306 TFDSGDS-KGTIIDSGTTLAYLPREVYRTLLAAVFDKYQDLPLHNYQ--DFVCFQFSGSI 362
Query: 327 DLKGFPTVRFHFRGGAKLALEKDSMFYQPRPDAFCMAVNPASINNR-YVNFSLIGMMAQQ 385
D GFP + F F+G L + D +Q R D +CM + + + L+G +
Sbjct: 363 D-DGFPVITFSFKGDLTLNVYPDDYLFQNRNDLYCMGFLDGGVQTKDGKDMLLLGDLVLS 421
Query: 386 FYNVGYDIGRKQKTFQRMDCE 406
V YD+ ++ + +C
Sbjct: 422 NKLVVYDLEKEVIGWTDYNCS 442
>gi|18405138|ref|NP_565911.1| Eukaryotic aspartyl protease family protein [Arabidopsis thaliana]
gi|13877759|gb|AAK43957.1|AF370142_1 unknown protein [Arabidopsis thaliana]
gi|15293231|gb|AAK93726.1| unknown protein [Arabidopsis thaliana]
gi|20196976|gb|AAB87120.2| expressed protein [Arabidopsis thaliana]
gi|20197046|gb|AAM14894.1| expressed protein [Arabidopsis thaliana]
gi|330254616|gb|AEC09710.1| Eukaryotic aspartyl protease family protein [Arabidopsis thaliana]
Length = 442
Score = 119 bits (297), Expect = 3e-24, Method: Compositional matrix adjust.
Identities = 110/380 (28%), Positives = 166/380 (43%), Gaps = 53/380 (13%)
Query: 58 NSLFYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLGTIFYPSRSSSYAYVPCD 117
N V +++G PP +DTGS L W+HC SP LG++F P SS+Y+ VPC
Sbjct: 62 NVTLTVTLAVGDPPQNISMVLDTGSELSWLHCKK----SPNLGSVFNPVSSSTYSPVPCS 117
Query: 118 SEHCRY----FPY-ARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDV 172
S CR P A C H C Y TS+ + TF ++
Sbjct: 118 SPICRTRTRDLPIPASCDPKTHLCHVAISY--ADATSIEGNLAHETFV---IGSVTRPGT 172
Query: 173 VFGC---GFSTN--RNFKFSGIFGLGIGRSSLVSQLN-SSFSYCIGSLHDPDYLHNKLIL 226
+FGC G S+N + K +G+ G+ G S V+QL S FSYCI + L+L
Sbjct: 173 LFGCMDSGLSSNSEEDAKSTGLMGMNRGSLSFVNQLGFSKFSYCISGSDSSGF----LLL 228
Query: 227 GDGAIIDEG---------DATPLQFIDG-HYYITLEAISVDGRMLDINPNIFKRDDSGGG 276
GD + G +TPL + D Y + LE I V ++L + ++F D +G G
Sbjct: 229 GDASYSWLGPIQYTPLVLQSTPLPYFDRVAYTVQLEGIRVGSKILSLPKSVFVPDHTGAG 288
Query: 277 -VMIDSGTDVTWLVKEAYEALRDEVMIRLEGEQMRSYSWPD-------KLCYH--GIMSS 326
M+DSGT T+L+ Y AL++E + + + +R PD LCY
Sbjct: 289 QTMVDSGTQFTFLMGPVYTALKNEFITQTK-SVLRLVDDPDFVFQGTMDLCYKVGSTTRP 347
Query: 327 DLKGFPTVRFHFRG------GAKLALEKDSMFYQPRPDAFCMAVNPASINNRYVNFSLIG 380
+ G P V FRG G KL + + + + +C + + + +IG
Sbjct: 348 NFSGLPMVSLMFRGAEMSVSGQKLLYRVNGAGSEGKEEVYCFTFGNSDLLG--IEAFVIG 405
Query: 381 MMAQQFYNVGYDIGRKQKTF 400
QQ + +D+ + + F
Sbjct: 406 HHHQQNVWMEFDLAKSRVGF 425
>gi|357158691|ref|XP_003578210.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Brachypodium
distachyon]
Length = 459
Score = 119 bits (297), Expect = 3e-24, Method: Compositional matrix adjust.
Identities = 115/407 (28%), Positives = 168/407 (41%), Gaps = 38/407 (9%)
Query: 24 RGINISIARLAYLQEKIRSHNTYQAQ----ILPSNDISNSLFYVNISIGQPPVPQFTAMD 79
R +S+ARL + R + Q Q + PS D+ + V++++G PP P +D
Sbjct: 66 RAAALSVARLGGSNKGARQQDQNQQQPGLPVRPSGDLE---YLVDLAVGTPPQPVSALLD 122
Query: 80 TGSSLLWVHCYPCRDCSPQLGTIFYPSRSSSYAYVPCDSEHCRYFPYARCSAYKHRCIYT 139
TGS L+W C PC C PQ IF P SSSY + C E C + C C Y
Sbjct: 123 TGSDLIWTQCAPCASCLPQPDPIFSPGASSSYEPMRCAGELCNDILHHSCQ-RPDTCTYR 181
Query: 140 QLYLIGPETSVFVSTEQLTF---KNTDESTIHVQDVVFGCGFSTNRNF-KFSGIFGLGIG 195
Y G T +TE+ TF + E+T + FGCG + SGI G G
Sbjct: 182 YSYGDGTTTRGVYATERFTFSSSSSGGETTKLSAPLGFGCGTMNKGSLNNGSGIVGFGRA 241
Query: 196 RSSLVSQLN-SSFSYCIGSLHDPDYLHNKLILGD--GAIIDEGDAT--PLQFIDGH---- 246
SLVSQL FSYC+ + L+ G G + D AT + +
Sbjct: 242 PLSLVSQLAIRRFSYCLTPYASGR--KSTLLFGSLRGGVYDAATATVQTTRLLRSRQNPT 299
Query: 247 -YYITLEAISVDGRMLDINPNIFK-RDDSGGGVMIDSGTDVTW----LVKEAYEALRDEV 300
YY+ ++V R L I + F R D GG ++DSGT +T ++ E A R +
Sbjct: 300 FYYVPFTGVTVGARRLRIPISAFALRPDGSGGAIVDSGTALTLFPAPVLAEVVRAFRSQ- 358
Query: 301 MIRLEGEQMRSYSWPDKLCYHGIMSSDLKG--FPTVRFHFRGGAKLALEKDSMFYQPRPD 358
+RL S D +C+ S + P + FH +G ++ + R
Sbjct: 359 -LRLPFAANGSSGPDDGVCFAAAASRVPRPAVVPRMVFHLQGADLDLPRRNYVLDDQRKG 417
Query: 359 AFCMAVNPASINNRYVNFSLIGMMAQQFYNVGYDIGRKQKTFQRMDC 405
C+ + + + + IG QQ V YD+ +F C
Sbjct: 418 NLCLLLADSGDSG-----TTIGNFVQQDMRVLYDLEADTLSFAPAQC 459
>gi|224286159|gb|ACN40790.1| unknown [Picea sitchensis]
Length = 452
Score = 119 bits (297), Expect = 3e-24, Method: Compositional matrix adjust.
Identities = 109/417 (26%), Positives = 172/417 (41%), Gaps = 36/417 (8%)
Query: 1 LIHRDSILSPYNNPNENPAHRVQRGINISIARLAYLQEKIRSHNTYQAQILPSNDISNSL 60
LIH S SP+ PN + I RL +L+ RS +P S
Sbjct: 56 LIHIYSECSPFRPPNRTWESLMSEKIRGDANRLRFLKRTSRSSKEDANANVPVRSGSGE- 114
Query: 61 FYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLGTIFYPSRSSSYAYVPCDSEH 120
+ + + G P +T +DTGS + W+ C C+ C IF P++SSSY CDS+
Sbjct: 115 YIIQVDFGTPKQSMYTLIDTGSDVAWIPCKQCQGCH-STAPIFDPAKSSSYKPFACDSQP 173
Query: 121 CRYFPYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDVVFGCG--- 177
C+ C +C + LY G + ++++ +T + ++ + FGC
Sbjct: 174 CQEI-SGNCGG-NSKCQFEVLYGDGTQVDGTLASDAITLGSQ-----YLPNFSFGCAESL 226
Query: 178 ----FSTNRNFKFSGIFGLGIGRSSLVSQLNSSFSYCIGSLHDPDYLHNKLILGDGAIID 233
+S+ G + ++ +FSYC L L+LG A +
Sbjct: 227 SEDTYSSPGLMGLGGGSLSLLTQAPTAELFGGTFSYC---LPSSSTSSGSLVLGKEAAVS 283
Query: 234 EGDATPLQFID-----GHYYITLEAISVDGRMLDINPNIFKRDDSGGGVMIDSGTDVTWL 288
I Y++TL+AISV + + SGGG +IDSGT +T+L
Sbjct: 284 SSSLKFTTLIKDPSFPTFYFVTLKAISVGNTRISVPATNIA---SGGGTIIDSGTTITYL 340
Query: 289 VKEAYEALRDEVMIRLEGEQMRSYSWPDKLCYHGIMSSDLKGFPTVRFHFRGGAKLALEK 348
V AY+ LRD +L Q D CY +SS PT+ H L L K
Sbjct: 341 VPSAYKDLRDAFRQQLSSLQPTPVEDMDT-CYD--LSSSSVDVPTITLHLDRNVDLVLPK 397
Query: 349 DSMFYQPRPDAFCMAVNPASINNRYVNFSLIGMMAQQFYNVGYDIGRKQKTFQRMDC 405
+++ C+A +S ++R S+IG + QQ + + +D+ Q F + C
Sbjct: 398 ENILITQESGLSCLAF--SSTDSR----SIIGNVQQQNWRIVFDVPNSQVGFAQEQC 448
>gi|357507803|ref|XP_003624190.1| Aspartic proteinase-like protein [Medicago truncatula]
gi|355499205|gb|AES80408.1| Aspartic proteinase-like protein [Medicago truncatula]
Length = 476
Score = 119 bits (297), Expect = 3e-24, Method: Compositional matrix adjust.
Identities = 110/388 (28%), Positives = 167/388 (43%), Gaps = 53/388 (13%)
Query: 51 LPSNDISNSLFYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLG-----TIFYP 105
LPS S L+Y + +G P + +DTGS +LWV+C C C + G T++ P
Sbjct: 65 LPS---STGLYYTKVGLGSPAKEFYVQVDTGSDILWVNCAGCTACPKKSGLGMDLTLYDP 121
Query: 106 SRSSSYAYVPCDSEHCRYFPYARCSAYKH--RCIYTQLYLIGPETSVFVSTEQLTFKNTD 163
+ S + VPC C S K C Y+ Y G TS + LTF
Sbjct: 122 NGSKTSNAVPCGDGFCTDTYSGPISGCKQDMSCPYSITYGDGSTTSGSFVNDSLTFDEV- 180
Query: 164 ESTIHVQ----DVVFGCG------FSTNRNFKFSGIFGLGIGRSSLVSQLNSS------F 207
+H + V+FGCG S+N + GI G G SS++SQL +S F
Sbjct: 181 SGNLHTKPDNSSVIFGCGAKQSGSLSSNSDEALDGIIGFGQANSSVLSQLAASGKVKRIF 240
Query: 208 SYCIGSLHDPDYLHNKLILGDGAIID-EGDATPLQFIDGHYYITLEAISVDGRMLDINPN 266
S+C+ D H I G +++ + + TPL HY + L+ + VDG + +
Sbjct: 241 SHCL------DSHHGGGIFSIGQVMEPKFNTTPLVPRMAHYNVILKDMDVDGEPILLPLY 294
Query: 267 IFKRDDSGGGVMIDSGTDVTWLVKEAYEALRDEVMIRLEG-------EQMRSYSWPDKLC 319
+F SG G +IDSGT + +L Y L +V+ R G +Q + + DKL
Sbjct: 295 LFDS-GSGRGTIIDSGTTLAYLPLSIYNQLLPKVLGRQPGLKLMIVEDQFTCFHYSDKLD 353
Query: 320 YHGIMSSDLKGFPTVRFHFRGGAKLALEKDSMFYQPRPDAFCMAVNPASINNRY-VNFSL 378
+GFP V+FHF G + D +F + D +C+ +S + + L
Sbjct: 354 ---------EGFPVVKFHFEGLSLTVHPHDYLFLY-KEDIYCIGWQKSSTQTKEGRDLIL 403
Query: 379 IGMMAQQFYNVGYDIGRKQKTFQRMDCE 406
IG + V YD+ + +C
Sbjct: 404 IGDLVLSNKLVVYDLENMVIGWTNFNCS 431
>gi|255547548|ref|XP_002514831.1| Aspartic proteinase Asp1 precursor, putative [Ricinus communis]
gi|223545882|gb|EEF47385.1| Aspartic proteinase Asp1 precursor, putative [Ricinus communis]
Length = 488
Score = 119 bits (297), Expect = 4e-24, Method: Compositional matrix adjust.
Identities = 94/372 (25%), Positives = 170/372 (45%), Gaps = 38/372 (10%)
Query: 60 LFYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQ--LG---TIFYPSRSSSYAYV 114
L++ I +G PP + +DTGS +LWV+C C C + LG T++ P S+S +
Sbjct: 81 LYFAKIGLGNPPKDYYVQVDTGSDILWVNCANCDKCPTKSDLGVKLTLYDPQSSTSATRI 140
Query: 115 PCDSEHC--RYFPYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQ-- 170
CD + C Y + C Y+ +Y G T+ F + L F D T ++Q
Sbjct: 141 YCDDDFCAATYNGVLQGCTKDLPCQYSVVYGDGSSTAGFFVKDNLQF---DRVTGNLQTS 197
Query: 171 ----DVVFGCGFSTNRNFKFS-----GIFGLGIGRSSLVSQLNSS------FSYCIGSLH 215
V+FGCG + S GI G G SS++SQL ++ F++C+
Sbjct: 198 SANGSVIFGCGAKQSGELGTSSEALDGILGFGQANSSMISQLAAAGKVKRVFAHCL---- 253
Query: 216 DPDYLHNKLILGDGAIID-EGDATPLQFIDGHYYITLEAISVDGRMLDINPNIFKRDDSG 274
D + I G ++ + + TP+ HY + ++ I V G +L++ +IF D
Sbjct: 254 --DNVKGGGIFAIGEVVSPKVNTTPMVPNQPHYNVVMKEIEVGGNVLELPTDIFDTGDR- 310
Query: 275 GGVMIDSGTDVTWLVKEAYEALRDEVMIRLEGEQMRSYSWPDKLCYHGIMSSDLKGFPTV 334
G +IDSGT + +L + YE++ +++ G ++ ++ ++ + +GFP V
Sbjct: 311 RGTIIDSGTTLAYLPEVVYESMMTKIVSEQPG--LKLHTVEEQFTCFQYTGNVNEGFPVV 368
Query: 335 RFHFRGGAKLALEKDSMFYQPRPDAFCMAVNPASINNR-YVNFSLIGMMAQQFYNVGYDI 393
+FHF G L + +Q + +C + + ++ + +L+G + V YD+
Sbjct: 369 KFHFNGSLSLTVNPHDYLFQIHEEVWCFGWQNSGMQSKDGRDMTLLGDLVLSNKLVLYDL 428
Query: 394 GRKQKTFQRMDC 405
+ + +C
Sbjct: 429 ENQAIGWTDYNC 440
>gi|356564743|ref|XP_003550608.1| PREDICTED: aspartic proteinase-like protein 2-like [Glycine max]
Length = 490
Score = 119 bits (297), Expect = 4e-24, Method: Compositional matrix adjust.
Identities = 102/373 (27%), Positives = 166/373 (44%), Gaps = 35/373 (9%)
Query: 60 LFYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLG-----TIFYPSRSSSYAYV 114
L+Y + +G PPV +DTGS +LWV C C C G F P SS+ + +
Sbjct: 74 LYYTKVQLGTPPVEFNVQIDTGSDVLWVSCNSCSGCPQTSGLQIQLNFFDPGSSSTSSMI 133
Query: 115 PCDSEHCR---YFPYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQD 171
C + C A CS+ ++C YT Y G TS + ++ + E ++
Sbjct: 134 ACSDQRCNNGIQSSDATCSSQNNQCSYTFQYGDGSGTSGYYVSDMMHLNTIFEGSVTTNS 193
Query: 172 ---VVFGC-----GFSTNRNFKFSGIFGLGIGRSSLVSQLNSS------FSYCIGSLHDP 217
VVFGC G T + GIFG G S++SQL+S FS+C L
Sbjct: 194 TAPVVFGCSNQQTGDLTKSDRAVDGIFGFGQQEMSVISQLSSQGIAPRVFSHC---LKGD 250
Query: 218 DYLHNKLILGDGAIIDEGDA-TPLQFIDGHYYITLEAISVDGRMLDINPNIFKRDDSGGG 276
L+LG+ I++ T L HY + L++I+V+G+ L I+ ++F +S G
Sbjct: 251 SSGGGILVLGE--IVEPNIVYTSLVPAQPHYNLNLQSIAVNGQTLQIDSSVFATSNS-RG 307
Query: 277 VMIDSGTDVTWLVKEAYEALRDEVMIRLEGEQMRSYSWPDKLCYHGIMSSDLKGFPTVRF 336
++DSGT + +L +EAY+ + + + + + CY I SS + FP V
Sbjct: 308 TIVDSGTTLAYLAEEAYDPFVSAITASIP-QSVHTVVSRGNQCYL-ITSSVTEVFPQVSL 365
Query: 337 HFRGGAKLALEKDSMFYQPRP--DAFCMAVNPASINNRYVNFSLIGMMAQQFYNVGYDIG 394
+F GGA + L Q A + I + + +++G + + V YD+
Sbjct: 366 NFAGGASMILRPQDYLIQQNSIGGAAVWCIGFQKIQGQGI--TILGDLVLKDKIVVYDLA 423
Query: 395 RKQKTFQRMDCEV 407
++ + DC +
Sbjct: 424 GQRIGWANYDCSL 436
>gi|224092218|ref|XP_002309514.1| predicted protein [Populus trichocarpa]
gi|222855490|gb|EEE93037.1| predicted protein [Populus trichocarpa]
Length = 474
Score = 118 bits (296), Expect = 4e-24, Method: Compositional matrix adjust.
Identities = 115/402 (28%), Positives = 166/402 (41%), Gaps = 59/402 (14%)
Query: 31 ARLAYLQEKIRSHNTYQAQI--LPSND---ISNSLFYVNISIGQPPVPQFTAMDTGSSLL 85
ARL+ KI H ++ + LP+ I + V + +G P DTGS +
Sbjct: 104 ARLS----KISGHGIFEEMVTKLPAQSGIAIGTGNYVVTVGLGTPKEDFTLVFDTGSGIT 159
Query: 86 WVHCYPCR-DCSPQLGTIFYPSRSSSYAYVPCDSEHCRYFPYAR--CSAYKHRCIYTQLY 142
W C PC C PQ F P++S+SY V C S C P + CSA C+Y +Y
Sbjct: 160 WTQCQPCLGSCYPQKEQKFDPTKSTSYNNVSCSSASCNLLPTSERGCSASNSTCLYQIIY 219
Query: 143 LIGPETSVFVSTEQLTFKNTDESTIHVQDVVFGCGFSTNRNFKFSGIFGLGIGRSS---- 198
+ F +TE LT ++D T + +FGCG S N G+FG G
Sbjct: 220 GDQSYSQGFFATETLTISSSDVFT----NFLFGCGQSNN------GLFGQAAGLLGLSSS 269
Query: 199 -------LVSQLNSSFSYCIGSL-HDPDYLHNKLILGDGAIIDEGDATPLQ-FIDGHYYI 249
+ FSYC+ S YL+ G + TP+ Y I
Sbjct: 270 SVSLPSQTAEKYQKQFSYCLPSTPSSTGYLNF-----GGKVSQTAGFTPISPAFSSFYGI 324
Query: 250 TLEAISVDGRMLDINPNIFKRDDSGGGVMIDSGTDVTWLVKEAYEALRDEVMIRLEGEQM 309
+ ISV G L I+P+IF G +IDSGT +T L AY+AL++ E+M
Sbjct: 325 DIVGISVAGSQLPIDPSIFTTS----GAIIDSGTVITRLPPTAYKALKEAF-----DEKM 375
Query: 310 RSY--SWPDKL---CYHGIMSSDLKGFPTVRFHFRGGAKLALEKDSMFYQPRP-DAFCMA 363
+Y + D+L CY + FP V F+GG ++ ++ + Y C+A
Sbjct: 376 SNYPKTNGDELLDTCYD-FSNYTTVSFPKVSVSFKGGVEVDIDASGILYLVNGVKMVCLA 434
Query: 364 VNPASINNRYVNFSLIGMMAQQFYNVGYDIGRKQKTFQRMDC 405
+ N F + G Q+ Y V YD + F C
Sbjct: 435 F---AANKDDSEFGIFGNHQQKTYEVVYDGAKGMIGFAAGAC 473
>gi|21617933|gb|AAM66983.1| nucleoid DNA-binding-like protein [Arabidopsis thaliana]
Length = 425
Score = 118 bits (296), Expect = 5e-24, Method: Compositional matrix adjust.
Identities = 101/355 (28%), Positives = 160/355 (45%), Gaps = 28/355 (7%)
Query: 61 FYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLGTIFYPSRSSSYAYVPCDSEH 120
+ V +IG P P A+DT + W+ C C CS + +F PS+SSS + C++
Sbjct: 88 YIVRANIGTPAQPMLVALDTSNDAAWIPCSGCVGCSSSV--LFDPSKSSSSRTLQCEAPQ 145
Query: 121 CRYFPYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDVVFGC-GFS 179
C+ P C+ K C + Y G +++ + LT ++ + + FGC +
Sbjct: 146 CKQAPNPSCTVSKS-CGFNMTYG-GSTIEAYLTQDTLTL-----ASDVIPNYTFGCINKA 198
Query: 180 TNRNFKFSGIFGLGIGRSSLVSQ----LNSSFSYCIGSLHDPDYLHNKLILGDGAIIDEG 235
+ + G+ GLG G SL+SQ S+FSYC+ + ++ L LG
Sbjct: 199 SGTSLPAQGLMGLGRGPLSLISQSQNLYQSTFSYCLPNSKSSNF-SGSLRLGPKNQPIRI 257
Query: 236 DATPL---QFIDGHYYITLEAISVDGRMLDINPNIFKRD-DSGGGVMIDSGTDVTWLVKE 291
TPL YY+ L I V +++DI + D +G G + DSGT T LV+
Sbjct: 258 KTTPLLKNPRRSSLYYVNLVGIRVGNKIVDIPTSALAFDPATGAGTIFDSGTVYTRLVEP 317
Query: 292 AYEALRDEVMIRLEGEQMRSYSWPDKLCYHGIMSSDLKGFPTVRFHFRGGAKLALEKDSM 351
AY A+R+E R++ S D CY G + FP+V F F G + L D++
Sbjct: 318 AYVAVRNEFRRRVKNANATSLGGFDT-CYSGSVV-----FPSVTFMF-AGMNVTLPPDNL 370
Query: 352 F-YQPRPDAFCMAVNPASINNRYVNFSLIGMMAQQFYNVGYDIGRKQKTFQRMDC 405
+ + C+A+ A +N V ++I M QQ + V D+ + R C
Sbjct: 371 LIHSSAGNLSCLAMAAAPVNVNSV-LNVIASMQQQNHRVLIDVPNSRLGISRETC 424
>gi|326524580|dbj|BAK00673.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 461
Score = 118 bits (296), Expect = 5e-24, Method: Compositional matrix adjust.
Identities = 115/429 (26%), Positives = 177/429 (41%), Gaps = 65/429 (15%)
Query: 22 VQRGINISIARLAYLQ--------------EKIRSHNTYQAQILPSNDISNSLFYVNISI 67
++R + S AR A L ++ H + PS D+ + ++++I
Sbjct: 53 IRRAMQRSKARAAALSVARSGSGRVPGKSAQQGEQHQQPGVPVRPSGDLE---YLIDLAI 109
Query: 68 GQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLGTIFYPSRSSSYAYVPCDSEHCRYFPYA 127
G PP P +DTGS L+W C PC C Q +F P+ SSSY + C + C +
Sbjct: 110 GTPPQPVSALLDTGSDLIWTQCAPCASCLAQPDPLFAPAASSSYVPMRCSGQLCNDILHH 169
Query: 128 RCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDVVFGCG-FSTNRNFKF 186
C C Y Y G T +TE+ TF ++ + V + FGCG +
Sbjct: 170 SCQ-RPDTCTYRYNYGDGTTTLGVYATERFTFASSSGEKLSV-PLGFGCGTMNVGSLNNG 227
Query: 187 SGIFGLGIGRSSLVSQLN-SSFSYCI-------------GSLHDPDYLHNKLILGDGAII 232
SGI G G SLVSQL+ FSYC+ GSL D + GD A
Sbjct: 228 SGIVGFGRDPLSLVSQLSIRRFSYCLTPYTSTRKSTLMFGSLSD------GVFEGDDAAT 281
Query: 233 DEGDATPL---QFIDGHYYITLEAISVDGRMLDINPNIFK-RDDSGGGVMIDSGTDVTW- 287
+ T L + YY+ ++V R L I + F R D GGV++DSGT +T
Sbjct: 282 GQVQTTRLLQSRQNPTFYYVPFTGVTVGTRRLRIPLSAFALRPDGSGGVIVDSGTALTLF 341
Query: 288 ---LVKEAYEALRDEVMIRLEGEQMRSYSWPDKLCYHGIM--------SSDLKGFPTVRF 336
++ E A R ++ + S S D +C+ M ++ + P + F
Sbjct: 342 PAAVLTEVLRAFRAQLRLPFTS----SSSPDDGVCFATPMAAGGRRASAATVVSVPRMAF 397
Query: 337 HFRGGAKLALEKDSMFYQPRPDAFCMAVNPASINNRYVNFSLIGMMAQQFYNVGYDIGRK 396
HF+G ++ + PR + C+ + + + + IG QQ V YD+ +
Sbjct: 398 HFQGADLELPRRNYVLDDPRRGSLCILLADSGDSG-----ATIGNFVQQDMRVLYDLEAE 452
Query: 397 QKTFQRMDC 405
+F C
Sbjct: 453 TLSFAPAQC 461
>gi|242092368|ref|XP_002436674.1| hypothetical protein SORBIDRAFT_10g006870 [Sorghum bicolor]
gi|241914897|gb|EER88041.1| hypothetical protein SORBIDRAFT_10g006870 [Sorghum bicolor]
Length = 461
Score = 118 bits (296), Expect = 5e-24, Method: Compositional matrix adjust.
Identities = 108/406 (26%), Positives = 171/406 (42%), Gaps = 60/406 (14%)
Query: 42 SHNTYQAQILPSNDISNSLFYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLGT 101
SH I + + V++++G PP P +DTGS L+W C PCRDC Q
Sbjct: 73 SHAVRAGLGAGGGGIVTNEYLVHLAVGTPPRPVALTLDTGSDLVWTQCAPCRDCFHQGLP 132
Query: 102 IFYPSRSSSYAYVPCDSEHCRYFPYARCSA--------YKHRCIYTQLYLIGPETSVFVS 153
+ P+ SS+YA +PC + CR P+ C C Y Y T ++
Sbjct: 133 LLDPAASSTYAALPCGAPRCRALPFTSCGGGGRSSWGNGNRSCAYIYHYGDKSVTVGEIA 192
Query: 154 TEQLTFKNTD---ESTIHVQDVVFGCGFSTNRNFKF--SGIFGLGIGRSSLVSQLN-SSF 207
T++ TF + +S + + + FGCG F+ +GI G G GR SL SQLN ++F
Sbjct: 193 TDRFTFGGDNGDGDSRLPTRRLTFGCGHFNKGVFQSNETGIAGFGRGRWSLPSQLNVTTF 252
Query: 208 SYCIGSLHDPDYLHNKLILGDGA------------IIDEGDATPLQFIDGH---YYITLE 252
SYC S+ + + L+ GA I E TPL Y+++L+
Sbjct: 253 SYCFTSMFES---KSSLVTLGGAPAAALLYSHAAHISGEVRTTPLLKNPSQPSLYFLSLK 309
Query: 253 AISVDGRMLDINPNIFKRDDSGGGVMIDSGTDVTWLVKEAYEALRDEVMIR--------L 304
ISV L + P R +IDSG +T L + YEA++ E + +
Sbjct: 310 GISVGKTRLAV-PEAKLRS-----TIIDSGASITTLPEAVYEAVKAEFAAQVGLPPTGVV 363
Query: 305 EGEQMRSYSWPDKLCYHGIMSSDLK--GFPTVRFHFRGGAKLALEKDSMFYQPRPDAFCM 362
EG + LC+ +++ + P++ H G + +F C+
Sbjct: 364 EGSAL-------DLCFALPVTALWRRPPVPSLTLHLDGADWELPRGNYVFEDLAARVMCV 416
Query: 363 AVNPASINNRYVNFSLIGMMAQQFYNVGYDIGRKQKTFQRMDCEVL 408
++ A + ++IG QQ +V YD+ +F C+ L
Sbjct: 417 VLDAAPGDQ-----TVIGNFQQQNTHVVYDLENDWLSFAPARCDSL 457
>gi|15232503|ref|NP_191008.1| aspartyl protease family protein [Arabidopsis thaliana]
gi|7288018|emb|CAB81805.1| nucleoid DNA-binding-like protein [Arabidopsis thaliana]
gi|17979257|gb|AAL49945.1| AT3g54400/T12E18_90 [Arabidopsis thaliana]
gi|21700851|gb|AAM70549.1| AT3g54400/T12E18_90 [Arabidopsis thaliana]
gi|332645705|gb|AEE79226.1| aspartyl protease family protein [Arabidopsis thaliana]
Length = 425
Score = 118 bits (296), Expect = 5e-24, Method: Compositional matrix adjust.
Identities = 101/355 (28%), Positives = 160/355 (45%), Gaps = 28/355 (7%)
Query: 61 FYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLGTIFYPSRSSSYAYVPCDSEH 120
+ V +IG P P A+DT + W+ C C CS + +F PS+SSS + C++
Sbjct: 88 YIVRANIGTPAQPMLVALDTSNDAAWIPCSGCVGCSSSV--LFDPSKSSSSRTLQCEAPQ 145
Query: 121 CRYFPYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDVVFGC-GFS 179
C+ P C+ K C + Y G +++ + LT ++ + + FGC +
Sbjct: 146 CKQAPNPSCTVSKS-CGFNMTYG-GSTIEAYLTQDTLTL-----ASDVIPNYTFGCINKA 198
Query: 180 TNRNFKFSGIFGLGIGRSSLVSQ----LNSSFSYCIGSLHDPDYLHNKLILGDGAIIDEG 235
+ + G+ GLG G SL+SQ S+FSYC+ + ++ L LG
Sbjct: 199 SGTSLPAQGLMGLGRGPLSLISQSQNLYQSTFSYCLPNSKSSNF-SGSLRLGPKNQPIRI 257
Query: 236 DATPL---QFIDGHYYITLEAISVDGRMLDINPNIFKRD-DSGGGVMIDSGTDVTWLVKE 291
TPL YY+ L I V +++DI + D +G G + DSGT T LV+
Sbjct: 258 KTTPLLKNPRRSSLYYVNLVGIRVGNKIVDIPTSALAFDPATGAGTIFDSGTVYTRLVEP 317
Query: 292 AYEALRDEVMIRLEGEQMRSYSWPDKLCYHGIMSSDLKGFPTVRFHFRGGAKLALEKDSM 351
AY A+R+E R++ S D CY G + FP+V F F G + L D++
Sbjct: 318 AYVAVRNEFRRRVKNANATSLGGFDT-CYSGSVV-----FPSVTFMF-AGMNVTLPPDNL 370
Query: 352 F-YQPRPDAFCMAVNPASINNRYVNFSLIGMMAQQFYNVGYDIGRKQKTFQRMDC 405
+ + C+A+ A +N V ++I M QQ + V D+ + R C
Sbjct: 371 LIHSSAGNLSCLAMAAAPVNVNSV-LNVIASMQQQNHRVLIDVPNSRLGISRETC 424
>gi|293333012|ref|NP_001168410.1| uncharacterized protein LOC100382179 precursor [Zea mays]
gi|223948083|gb|ACN28125.1| unknown [Zea mays]
gi|413953783|gb|AFW86432.1| hypothetical protein ZEAMMB73_737575 [Zea mays]
Length = 466
Score = 118 bits (296), Expect = 5e-24, Method: Compositional matrix adjust.
Identities = 115/433 (26%), Positives = 188/433 (43%), Gaps = 57/433 (13%)
Query: 1 LIHRDSILSPYNNPNENPAHRVQRGINISIARLAYLQEKIRSHNTYQAQILPSNDIS--- 57
L+HR SP + E P+H G + R A + K+ S A+ L + ++
Sbjct: 63 LVHRHGPCSPVMS-KEKPSHEETLGRDQ--LRAANIHAKLSSPRNSSAKELQQSGVTIPT 119
Query: 58 -------NSLFYVNISIGQPPVPQFTAMDTGSSLLWVHCYPC--RDCSPQLGTIFYPSRS 108
+ + +S+G P V Q ++DTGS + WV C PC + CS Q +F P++S
Sbjct: 120 SSGYSLGTPEYVITVSLGTPAVTQVMSIDTGSDVSWVQCAPCAAQSCSSQKDKLFDPAKS 179
Query: 109 SSYAYVPCDSEHCRYFPYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIH 168
++Y+ C S C C Y Y+ T+ ++ L +D
Sbjct: 180 ATYSAFSCSSAQCAQLGGEGNGCLNSHCQYIVKYVDHSNTTGTYGSDTLGLTTSDA---- 235
Query: 169 VQDVVFGCGFSTNRNF-KFSGIFGLGIGRSSLVSQLNSS----FSYCI--GSLHDPDYLH 221
V++ FGC N + G+ GLG SLVSQ ++ FSYC+ S +L
Sbjct: 236 VKNFQFGCSHRANGFVGQLDGLMGLGGDTESLVSQTAATYGKAFSYCLPPSSSSAGGFLT 295
Query: 222 NKLILGDGAIIDEGDATPL-QF-IDGHYYITLEAISVDGRMLDINPNIFKRDDSGGGVMI 279
G G TPL +F + Y + L+AI+V G L++ ++F G ++
Sbjct: 296 LGAAAG-GTSSSRYSRTPLVRFNVPTFYGVFLQAITVAGTKLNVPASVFS-----GASVV 349
Query: 280 DSGTDVTWLVKEAYEALRDEVMIRLEGEQMRSYSWPDKLCYHGIMSS--DLKGFPTVR-- 335
DSGT +T L AY+ALR ++M++Y + GI+ + D G TVR
Sbjct: 350 DSGTVITQLPPTAYQALRTAFK-----KEMKAYPSAAPV---GILDTCFDFSGIKTVRVP 401
Query: 336 ---FHFRGGAKLALEKDSMFYQPRPDAFCMAVNPASINNRYVNFSLIGMMAQQFYNVGYD 392
F GA + L+ +FY A C+A + + + ++G + Q+ + + +D
Sbjct: 402 VVTLTFSRGAVMDLDVSGIFY-----AGCLAFTATAQDG---DTGILGNVQQRTFEMLFD 453
Query: 393 IGRKQKTFQRMDC 405
+G F+ C
Sbjct: 454 VGGSTLGFRPGAC 466
>gi|223942467|gb|ACN25317.1| unknown [Zea mays]
gi|413936886|gb|AFW71437.1| pepsin A [Zea mays]
Length = 491
Score = 118 bits (295), Expect = 6e-24, Method: Compositional matrix adjust.
Identities = 103/381 (27%), Positives = 164/381 (43%), Gaps = 40/381 (10%)
Query: 51 LPSNDISNSLFYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLG-----TIFYP 105
LP++ L+Y I IG PP + +DTGS +LWV+C C C + G T + P
Sbjct: 77 LPTD---TGLYYTRIEIGSPPKGYYVQVDTGSDILWVNCIRCDGCPTRSGLGIELTQYDP 133
Query: 106 SRSSSYAYVPCDSEHCRYFPYA----RCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKN 161
+ S + V C+ E C C + C + Y G T+ F T+ + +
Sbjct: 134 AGSGT--TVGCEQEFCVANSAGGVPPTCPSTSSPCQFRITYGDGSTTTGFYVTDFVQYNQ 191
Query: 162 TD---ESTIHVQDVVFGCGFST-----NRNFKFSGIFGLGIGRSSLVSQLNSS------F 207
++T + FGCG + N GI G G SS++SQL ++ F
Sbjct: 192 VSGNGQTTTSNASITFGCGAQLGGDLGSSNQALDGILGFGQSDSSMLSQLAAARRVRKIF 251
Query: 208 SYCIGSLHDPDYLHNKLILGDGAIID-EGDATPLQFIDGHYYITLEAISVDGRMLDINPN 266
++C+ D + I G ++ + TPL HY + L+ ISV G L + +
Sbjct: 252 AHCL------DTVRGGGIFAIGNVVQPKVKTTPLVPNVTHYNVNLQGISVGGATLQLPTS 305
Query: 267 IFKRDDSGGGVMIDSGTDVTWLVKEAYEALRDEVMIRLEGEQMRSYSWPDKLCYHGIMSS 326
F DS G +IDSGT + +L +E Y L V + + + +Y D +C+ S
Sbjct: 306 TFDSGDS-KGTIIDSGTTLAYLPREVYRTLLAAVFDKYQDLPLHNYQ--DFVCFQFSGSI 362
Query: 327 DLKGFPTVRFHFRGGAKLALEKDSMFYQPRPDAFCMAVNPASINNR-YVNFSLIGMMAQQ 385
D GFP + F F G L + D +Q R D +CM + + + L+G +
Sbjct: 363 D-DGFPVITFSFEGDLTLNVYPDDYLFQNRNDLYCMGFLDGGVQTKDGKDMLLLGDLVLS 421
Query: 386 FYNVGYDIGRKQKTFQRMDCE 406
V YD+ ++ + +C
Sbjct: 422 NKLVVYDLEKEVIGWTDYNCS 442
>gi|242079451|ref|XP_002444494.1| hypothetical protein SORBIDRAFT_07g022800 [Sorghum bicolor]
gi|241940844|gb|EES13989.1| hypothetical protein SORBIDRAFT_07g022800 [Sorghum bicolor]
Length = 445
Score = 118 bits (295), Expect = 6e-24, Method: Compositional matrix adjust.
Identities = 101/365 (27%), Positives = 170/365 (46%), Gaps = 35/365 (9%)
Query: 63 VNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLGTIFYPSRSSSYAYVPCDSEHCR 122
+ +SIG PP P+ +DTGS L+W C + ++ P++SSS+A PCD C
Sbjct: 91 LTVSIGTPPQPRTLILDTGSDLIWTQCKLFDTRQHREKPLYDPAKSSSFAAAPCDGRLCE 150
Query: 123 YFPYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDVVFGCGFSTNR 182
+ + +++CIYT Y T +++E TF ++ + FGCG T+
Sbjct: 151 TGSFNTKNCSRNKCIYTYNY-GSATTKGELASETFTFGEHRRVSVSLD---FGCGKLTSG 206
Query: 183 NFK-FSGIFGLGIGRSSLVSQLN-SSFSYCIGSLHDPDYLHNKLILGDGAIIDEGDAT-P 239
+ SGI G+ R SLVSQL FSYC+ D + + + G A + + T P
Sbjct: 207 SLPGASGILGISPDRLSLVSQLQIPRFSYCLTPFLDRNTT-SHIFFGAMADLSKYRTTGP 265
Query: 240 LQFI------DG---HYYITLEAISVDGRMLDINPNIFK--RDDSGGGVMIDSGTDVTWL 288
+Q DG +YY+ L ISV + L++ + F RD S GG +DSG L
Sbjct: 266 IQTTSLVTNPDGSNYYYYVPLIGISVGTKRLNVPVSSFAIGRDGS-GGTFVDSGDTTGML 324
Query: 289 VKEAYEALRDEVM--IRLEGEQMRSYSWPDKLCYH------GIMSSDLKGFPTVRFHFRG 340
EAL++ ++ ++L + + +LC+ G + + ++ P + +HF G
Sbjct: 325 PSVVMEALKEAMVEAVKLPVVNATDHGYEYELCFQLPRNGGGAVETAVQ-VPPLVYHFDG 383
Query: 341 GAKLALEKDSMFYQPRPDAFCMAVNPASINNRYVNFSLIGMMAQQFYNVGYDIGRKQKTF 400
GA + L +DS + C+ ++ + ++IG QQ +V +D+ + +F
Sbjct: 384 GAAMLLRRDSYMVEVSAGRMCLVISSGARG------AIIGNYQQQNMHVLFDVENHEFSF 437
Query: 401 QRMDC 405
C
Sbjct: 438 APTQC 442
>gi|26451756|dbj|BAC42973.1| unknown protein [Arabidopsis thaliana]
Length = 442
Score = 118 bits (295), Expect = 6e-24, Method: Compositional matrix adjust.
Identities = 110/380 (28%), Positives = 166/380 (43%), Gaps = 53/380 (13%)
Query: 58 NSLFYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLGTIFYPSRSSSYAYVPCD 117
N V +++G PP +DTGS L W+HC SP LG++F P SS+Y+ VPC
Sbjct: 62 NVTLTVTLAVGDPPQNISMVLDTGSELSWLHCKK----SPNLGSVFNPVSSSTYSPVPCS 117
Query: 118 SEHCRY----FPY-ARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDV 172
S CR P A C H C Y TS+ + TF ++
Sbjct: 118 SPICRTRTRDLPIPASCDPKTHLCHVAISY--ADATSIEGNLAHETFV---IGSVTRPGT 172
Query: 173 VFGC---GFSTN--RNFKFSGIFGLGIGRSSLVSQLN-SSFSYCIGSLHDPDYLHNKLIL 226
+FGC G S+N + K +G+ G+ G S V+QL S FSYCI + L+L
Sbjct: 173 LFGCMDSGLSSNSEEDAKSTGLMGMNRGSLSFVNQLGFSKFSYCISGSDSSVF----LLL 228
Query: 227 GDGAIIDEG---------DATPLQFIDG-HYYITLEAISVDGRMLDINPNIFKRDDSGGG 276
GD + G +TPL + D Y + LE I V ++L + ++F D +G G
Sbjct: 229 GDASYSWLGPIQYTPLVLQSTPLPYFDRVAYTVQLEGIRVGSKILSLPKSVFVPDHTGAG 288
Query: 277 -VMIDSGTDVTWLVKEAYEALRDEVMIRLEGEQMRSYSWPD-------KLCYH--GIMSS 326
M+DSGT T+L+ Y AL++E + + + +R PD LCY
Sbjct: 289 QTMVDSGTQFTFLMGPVYTALKNEFITQTK-SVLRLVDDPDFVFQGTMDLCYKVGSTTRP 347
Query: 327 DLKGFPTVRFHFRG------GAKLALEKDSMFYQPRPDAFCMAVNPASINNRYVNFSLIG 380
+ G P V FRG G KL + + + + +C + + + +IG
Sbjct: 348 NFSGLPMVSLMFRGAEMSVSGQKLLYRVNGAGSEGKEEVYCFTFGNSDLLG--IEAFVIG 405
Query: 381 MMAQQFYNVGYDIGRKQKTF 400
QQ + +D+ + + F
Sbjct: 406 HHHQQNVWMEFDLAKSRVGF 425
>gi|297805182|ref|XP_002870475.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
gi|297316311|gb|EFH46734.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
Length = 480
Score = 118 bits (295), Expect = 6e-24, Method: Compositional matrix adjust.
Identities = 90/355 (25%), Positives = 159/355 (44%), Gaps = 40/355 (11%)
Query: 39 KIRSHNTYQ-AQILPSNDI---------SNSLFYVNISIGQPPVPQFTAMDTGSSLLWVH 88
+++SH++++ A++L + D+ S L++ I +G PP + +DTGS +LWV+
Sbjct: 45 ELKSHDSFRHARMLANIDLPLGGDSRADSIGLYFTKIKLGSPPKEYYVQVDTGSDILWVN 104
Query: 89 CYPCRDC--SPQLG---TIFYPSRSSSYAYVPCDSEHCRYFPYARCSAYKHRCIYTQLYL 143
C PC C LG +++ SS+ V C+ C + + K C Y +Y
Sbjct: 105 CAPCPKCPVKTDLGIPLSLYDSKASSTSKNVGCEDAFCSFIMQSETCGAKKPCSYHVVYG 164
Query: 144 IGPETSVFVSTEQLTFKNTD---ESTIHVQDVVFGCGFSTNRNF-----KFSGIFGLGIG 195
G + + +T + Q+VVFGCG + + GI G G
Sbjct: 165 DGSTSDGDFVKDNITLDQVTGNLRTAPLAQEVVFGCGKNQSGQLGQTESAVDGIMGFGQS 224
Query: 196 RSSLVSQLNSS------FSYCIGSLHDPDYLHNKLILGDGAIIDE-GDATPLQFIDGHYY 248
+S++SQL + FS+C+ D ++ I G + TPL HY
Sbjct: 225 NTSVISQLAAGGSVKRIFSHCL------DNMNGGGIFAIGEVESPVVKTTPLVPNQVHYN 278
Query: 249 ITLEAISVDGRMLDINPNIFKRDDSGGGVMIDSGTDVTWLVKEAYEALRDEVMIRLEGEQ 308
+ L+ + VDG +D+ P++ + GG +IDSGT + +L + Y +L +++ + +Q
Sbjct: 279 VILKGMDVDGEPIDLPPSL-ASTNGDGGTIIDSGTTLAYLPQNLYNSLIEKITAK---QQ 334
Query: 309 MRSYSWPDKLCYHGIMSSDLKGFPTVRFHFRGGAKLALEKDSMFYQPRPDAFCMA 363
++ + + S+ K FP V HF KL++ + R D +C
Sbjct: 335 VKLHMVQETFACFSFTSNTDKAFPVVNLHFEDSLKLSVYPHDYLFSLREDMYCFG 389
>gi|15231625|ref|NP_191467.1| aspartyl protease family protein [Arabidopsis thaliana]
gi|15983376|gb|AAL11556.1|AF424562_1 AT3g59080/F17J16_130 [Arabidopsis thaliana]
gi|7529751|emb|CAB86936.1| putative protein [Arabidopsis thaliana]
gi|20466704|gb|AAM20669.1| putative protein [Arabidopsis thaliana]
gi|23198236|gb|AAN15645.1| putative protein [Arabidopsis thaliana]
gi|332646352|gb|AEE79873.1| aspartyl protease family protein [Arabidopsis thaliana]
Length = 535
Score = 118 bits (295), Expect = 6e-24, Method: Compositional matrix adjust.
Identities = 107/379 (28%), Positives = 171/379 (45%), Gaps = 50/379 (13%)
Query: 61 FYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLGTIFYPSRSSSYAYVPCDSEH 120
+++++ +G PP +DTGS L W+ C PC DC Q G + P S+SY + C+ +
Sbjct: 170 YFMDVLVGSPPKHFSLILDTGSDLNWIQCLPCYDCFQQNGAFYDPKASASYKNITCNDQR 229
Query: 121 CRYF----PYARCSAYKHRCIYTQLYLIGPETS----VFVSTEQLTFKNTDESTIHVQDV 172
C P C + C Y Y T+ V T LT +V+++
Sbjct: 230 CNLVSSPDPPMPCKSDNQSCPYYYWYGDSSNTTGDFAVETFTVNLTTNGGSSELYNVENM 289
Query: 173 VFGCGFSTNRNFKFSGIF-------GLGIGRSSLVSQLNS----SFSYCIGSLHDPDYLH 221
+FGCG NR G+F GLG G S SQL S SFSYC+ + +
Sbjct: 290 MFGCG-HWNR-----GLFHGAAGLLGLGRGPLSFSSQLQSLYGHSFSYCLVDRNSDTNVS 343
Query: 222 NKLILG-DGAIIDEGDATPLQFIDGH-------YYITLEAISVDGRMLDINPNIFK-RDD 272
+KLI G D ++ + F+ G YY+ +++I V G +L+I + D
Sbjct: 344 SKLIFGEDKDLLSHPNLNFTSFVAGKENLVDTFYYVQIKSILVAGEVLNIPEETWNISSD 403
Query: 273 SGGGVMIDSGTDVTWLVKEAYEALRDEVMIRLEGEQ--MRSYSWPDKLCYH--GIMSSDL 328
GG +IDSGT +++ + AYE +++++ + +G+ R + D C++ GI + L
Sbjct: 404 GAGGTIIDSGTTLSYFAEPAYEFIKNKIAEKAKGKYPVYRDFPILDP-CFNVSGIHNVQL 462
Query: 329 KGFPTVRFHFRGGAKLALEKDSMFYQPRPDAFCMAV--NPASINNRYVNFSLIGMMAQQF 386
P + F GA ++ F D C+A+ P S FS+IG QQ
Sbjct: 463 ---PELGIAFADGAVWNFPTENSFIWLNEDLVCLAMLGTPKSA------FSIIGNYQQQN 513
Query: 387 YNVGYDIGRKQKTFQRMDC 405
+++ YD R + + C
Sbjct: 514 FHILYDTKRSRLGYAPTKC 532
>gi|449527151|ref|XP_004170576.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 1-like [Cucumis
sativus]
Length = 523
Score = 118 bits (295), Expect = 6e-24, Method: Compositional matrix adjust.
Identities = 115/374 (30%), Positives = 173/374 (46%), Gaps = 25/374 (6%)
Query: 42 SHNTYQAQILPSNDISNSLFYVNISIGQPPVPQFTAMDTGSSLLWVHCYPC---RDCSPQ 98
S N+ A + ++ I +GQP F DTGS + W+ C PC C Q
Sbjct: 165 STNSLTAPVTSGASQGAGEYFARIGVGQPVQSYFFVPDTGSDVSWLQCQPCDGENGCYKQ 224
Query: 99 LGTIFYPSRSSSYAYVPCDSEHCRYFPYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLT 158
+G IF P SSSY+ + CDSE C A C A + CIY Y G T ++TE +
Sbjct: 225 IGPIFDPKSSSSYSPLSCDSEQCHLLDEAACDA--NSCIYEVEYGDGSFTVGELATETFS 282
Query: 159 FKNTDESTIHVQDVVFGCGFSTNRNF-KFSGIFGLGIGRSSLVSQLN-SSFSYCIGSLHD 216
F++++ + ++ GCG F +G+ GLG G SL SQL +SFSYC L D
Sbjct: 283 FRHSNS----IPNLPIGCGHDNEGLFVGAAGLIGLGGGAISLSSQLEATSFSYC---LVD 335
Query: 217 PDYLHNKLILGDGAIIDEGDATPLQFID---GHYYITLEAISVDGRMLDINPNIFKRDDS 273
D + + + + +PL D Y+ + +SV G+ L I+ + F+ D+S
Sbjct: 336 LDSESSSTLDFNADQPSDSLTSPLVKNDRFPTFRYVKVIGMSVGGKPLPISSSSFEIDES 395
Query: 274 G-GGVMIDSGTDVTWLVKEAYEALRDEVMIRLEGEQMRSYSWPDKLCYHGIMSSDLKGFP 332
G GG+++DSGT +T + + Y+ LRD + + P CY S+++ P
Sbjct: 396 GSGGIIVDSGTTITEIPSDVYDVLRDAFVGLTKNLPPAPGVSPFDTCYDLSSQSNVE-VP 454
Query: 333 TVRFHFRGGAKLAL-EKDSMFYQPRPDAFCMAVNPASINNRYVNFSLIGMMAQQFYNVGY 391
T+ F G L L K+ +F FC+A P++ S+IG + QQ V Y
Sbjct: 455 TIAFILPGENSLQLPAKNCLFQVDSAGTFCLAFLPSTF-----PLSIIGNVQQQGIRVSY 509
Query: 392 DIGRKQKTFQRMDC 405
D+ F C
Sbjct: 510 DLANSLVGFSTDKC 523
>gi|148910443|gb|ABR18297.1| unknown [Picea sitchensis]
Length = 452
Score = 118 bits (295), Expect = 6e-24, Method: Compositional matrix adjust.
Identities = 114/421 (27%), Positives = 178/421 (42%), Gaps = 44/421 (10%)
Query: 1 LIHRDSILSPYNNPNENPAHRVQRGINISIARLAYLQEKIRSHNTYQAQILPSNDISNSL 60
LIH S SP+ PN + I RL +L+ RS +P S
Sbjct: 56 LIHIYSECSPFRPPNRTWESLMSEKIRGDANRLRFLKRTSRSSKQDANANVPVRSGSGE- 114
Query: 61 FYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLGTIFYPSRSSSYAYVPCDSEH 120
+ + + G P +T +DTGS + W+ C C+ C IF P++SSSY CDS+
Sbjct: 115 YIIQVDFGTPKQSMYTLIDTGSDVAWIPCKQCQGCH-STAPIFDPAKSSSYKPFACDSQP 173
Query: 121 CRYFPYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDVVFGCGFST 180
C+ C +C + Y G + ++++ +T + ++ + FGC S
Sbjct: 174 CQEI-SGNCGG-NSKCQFEVSYGDGTQVDGTLASDAITLGSQ-----YLPNFSFGCAESL 226
Query: 181 NRNFKFS-------GIFGLGIGRSSLVSQLNSSFSYCIGSLHDPDYLHNKLILGDGAIID 233
+ + S G + ++ +FSYC L L+LG A +
Sbjct: 227 SEDTSPSPGLMGLGGGSLSLLTQAPTAELFGGTFSYC---LPSSSTSSGSLVLGKEAAVS 283
Query: 234 EGDATPLQF--------IDGHYYITLEAISVDGRMLDI-NPNIFKRDDSGGGVMIDSGTD 284
++ L+F I Y++TL+AISV + + NI SGGG +IDSGT
Sbjct: 284 ---SSSLKFTTLIKDPSIPTFYFVTLKAISVGNTRISVPGTNI----ASGGGTIIDSGTT 336
Query: 285 VTWLVKEAYEALRDEVMIRLEGEQMRSYSWPDKLCYHGIMSSDLKGFPTVRFHFRGGAKL 344
+T LV AY ALRD +L Q D CY +SS PT+ H L
Sbjct: 337 ITHLVPSAYTALRDAFRQQLSSLQPTPVEDMDT-CYD--LSSSSVDVPTITLHLDRNVDL 393
Query: 345 ALEKDSMFYQPRPDAFCMAVNPASINNRYVNFSLIGMMAQQFYNVGYDIGRKQKTFQRMD 404
L K+++ C+A +S ++R S+IG + QQ + + +D+ Q F +
Sbjct: 394 VLPKENILITQESGLACLAF--SSTDSR----SIIGNVQQQNWRIVFDVPNSQVGFAQEQ 447
Query: 405 C 405
C
Sbjct: 448 C 448
>gi|218192707|gb|EEC75134.1| hypothetical protein OsI_11325 [Oryza sativa Indica Group]
Length = 401
Score = 118 bits (295), Expect = 6e-24, Method: Compositional matrix adjust.
Identities = 98/323 (30%), Positives = 149/323 (46%), Gaps = 38/323 (11%)
Query: 54 NDISNSLFYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLGTIFYPSRSSSYAY 113
N + + + V+++IG PP P +DTGS L+W C PC C Q F PS SS+ +
Sbjct: 75 NGVPTTEYLVHLAIGTPPQPVQLTLDTGSDLIWTQCQPCPACFDQALPYFDPSTSSTLSL 134
Query: 114 VPCDSEHCRYFPYARCSAYK----HRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHV 169
CDS C+ P A C + K C+YT Y T+ F+ ++ TF S V
Sbjct: 135 TSCDSTLCQGLPVASCGSPKFWPNQTCVYTYSYGDKSVTTGFLEVDKFTFVGAGAS---V 191
Query: 170 QDVVFGCGFSTNRNFKF--SGIFGLGIGRSSLVSQLN-SSFSYCIGSLHDPDYLHNKLIL 226
V FGCG N FK +GI G G G SL SQL +FS+C +++ L +L
Sbjct: 192 PGVAFGCGLFNNGVFKSNETGIAGFGRGPLSLPSQLKVGNFSHCFTAVNG---LKPSTVL 248
Query: 227 ----------GDGAIIDEGDATPLQFIDGH---YYITLEAISVDGRMLDINPNIFKRDDS 273
G GA+ +TPL + YY++L+ I+V L + + F +
Sbjct: 249 LDLPADLYKSGRGAV----QSTPLIQNPANPTFYYLSLKGITVGSTRLPVPESEFALKNG 304
Query: 274 GGGVMIDSGTDVTWLVKEAYEALRDEVMIRLEGEQMRSYSWPDKLCYHGIMSSDLKG--- 330
GG +IDSGT +T L Y +RD +++ + + C +S+ L+
Sbjct: 305 TGGTIIDSGTAMTSLPTRVYRLVRDAFAAQVKLPVVSGNTTDPYFC----LSAPLRAKPY 360
Query: 331 FPTVRFHFRGGAKLALEKDSMFY 353
P + HF GA + L +++ +
Sbjct: 361 VPKLVLHFE-GATMDLPRENYVW 382
>gi|9759039|dbj|BAB09366.1| aspartyl protease-like [Arabidopsis thaliana]
Length = 478
Score = 118 bits (295), Expect = 6e-24, Method: Compositional matrix adjust.
Identities = 89/355 (25%), Positives = 161/355 (45%), Gaps = 40/355 (11%)
Query: 39 KIRSHNTYQ-AQILPSNDI---------SNSLFYVNISIGQPPVPQFTAMDTGSSLLWVH 88
+++SH++++ A++L + D+ S L++ I +G PP + +DTGS +LWV+
Sbjct: 42 ELKSHDSFRHARMLANIDLPLGGDSRADSIGLYFTKIKLGSPPKEYYVQVDTGSDILWVN 101
Query: 89 CYPCRDC--SPQLG---TIFYPSRSSSYAYVPCDSEHCRYFPYARCSAYKHRCIYTQLYL 143
C PC C LG +++ SS+ V C+ + C + + K C Y +Y
Sbjct: 102 CAPCPKCPVKTDLGIPLSLYDSKTSSTSKNVGCEDDFCSFIMQSETCGAKKPCSYHVVYG 161
Query: 144 IGPETSVFVSTEQLTFKNTD---ESTIHVQDVVFGCGFSTNRNF-----KFSGIFGLGIG 195
G + + +T + + Q+VVFGCG + + GI G G
Sbjct: 162 DGSTSDGDFIKDNITLEQVTGNLRTAPLAQEVVFGCGKNQSGQLGQTDSAVDGIMGFGQS 221
Query: 196 RSSLVSQLNSS------FSYCIGSLHDPDYLHNKLILGDGAIIDE-GDATPLQFIDGHYY 248
+S++SQL + FS+C+ D ++ I G + TP+ HY
Sbjct: 222 NTSIISQLAAGGSTKRIFSHCL------DNMNGGGIFAVGEVESPVVKTTPIVPNQVHYN 275
Query: 249 ITLEAISVDGRMLDINPNIFKRDDSGGGVMIDSGTDVTWLVKEAYEALRDEVMIRLEGEQ 308
+ L+ + VDG +D+ P++ + GG +IDSGT + +L + Y +L +++ + +Q
Sbjct: 276 VILKGMDVDGDPIDLPPSL-ASTNGDGGTIIDSGTTLAYLPQNLYNSLIEKITAK---QQ 331
Query: 309 MRSYSWPDKLCYHGIMSSDLKGFPTVRFHFRGGAKLALEKDSMFYQPRPDAFCMA 363
++ + + S+ K FP V HF KL++ + R D +C
Sbjct: 332 VKLHMVQETFACFSFTSNTDKAFPVVNLHFEDSLKLSVYPHDYLFSLREDMYCFG 386
>gi|357439021|ref|XP_003589787.1| Aspartic proteinase nepenthesin-2 [Medicago truncatula]
gi|355478835|gb|AES60038.1| Aspartic proteinase nepenthesin-2 [Medicago truncatula]
Length = 456
Score = 118 bits (295), Expect = 6e-24, Method: Compositional matrix adjust.
Identities = 115/423 (27%), Positives = 171/423 (40%), Gaps = 72/423 (17%)
Query: 1 LIHRDSILSPYNNPNENPAHRVQRGINISIARLAYLQEKIRSHN-----------TYQAQ 49
L HRD+I N R IN I R+ +L ++ + ++ +
Sbjct: 62 LFHRDNI----NLKKTTHKTRFISRINRDIKRVTFLLNRLNKNTQEQQTTTATEASFGSD 117
Query: 50 ILPSNDISNSLFYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLGTIFYPSRSS 109
++ + + ++V I IG P + Q+ +D+GS ++W+ C PC C Q IF P+ S+
Sbjct: 118 VVSGTEEGSGEYFVRIGIGSPAIYQYMVIDSGSDIVWIQCEPCDQCYNQTDPIFNPATSA 177
Query: 110 SYAYVPCDSEHCRYFPYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHV 169
S+ V C S C + K RC Y Y G T ++ E +T T +
Sbjct: 178 SFIGVACSSNVCNQLD-DDVACRKGRCGYQVAYGDGSYTKGTLALETITIGRT-----VI 231
Query: 170 QDVVFGCGFSTNRNFKFSGIFGLGIGRS-SLVSQLNS----SFSYC-------IGSLHDP 217
QD GCG F + G S V QL + +F YC +G++ P
Sbjct: 232 QDTAIGCGHWNEGMFVGAAGLLGLGGGPMSFVGQLGAQTGGAFGYCLVSRAMPVGAMWVP 291
Query: 218 DYLHNKLILGDGAIIDEGDATPLQFIDGHYYITLEAISVDGRMLDINPNIFKRDDSG-GG 276
+HN F YY++L ++V G + I+ IF+ D G GG
Sbjct: 292 -LIHNP------------------FYPSFYYVSLSGLAVGGIRVPISEQIFQLTDIGTGG 332
Query: 277 VMIDSGTDVTWLVKEAYEALRDEVMIRLEGEQMRSYSWPDKLCYHGIMSSDLKGF----- 331
V++D+GT +T L AY A RD + + CY DL GF
Sbjct: 333 VVMDTGTAITRLPTVAYNAFRDAFIAQTTNLPRAPGVSIFDTCY------DLNGFVTVRV 386
Query: 332 PTVRFHFRGGAKLALEKDSMFYQPRPD--AFCMAVNPASINNRYVNFSLIGMMAQQFYNV 389
PTV F+F GG L + F P D FC A P+ S+IG + Q+ V
Sbjct: 387 PTVSFYFSGGQILTFPARN-FLIPADDVGTFCFAFAPSP-----SGLSIIGNIQQEGIQV 440
Query: 390 GYD 392
D
Sbjct: 441 SID 443
>gi|30692930|ref|NP_198475.2| aspartyl protease family protein [Arabidopsis thaliana]
gi|66792626|gb|AAY56415.1| At5g36260 [Arabidopsis thaliana]
gi|332006680|gb|AED94063.1| aspartyl protease family protein [Arabidopsis thaliana]
Length = 482
Score = 118 bits (295), Expect = 7e-24, Method: Compositional matrix adjust.
Identities = 89/355 (25%), Positives = 161/355 (45%), Gaps = 40/355 (11%)
Query: 39 KIRSHNTYQ-AQILPSNDI---------SNSLFYVNISIGQPPVPQFTAMDTGSSLLWVH 88
+++SH++++ A++L + D+ S L++ I +G PP + +DTGS +LWV+
Sbjct: 46 ELKSHDSFRHARMLANIDLPLGGDSRADSIGLYFTKIKLGSPPKEYYVQVDTGSDILWVN 105
Query: 89 CYPCRDC--SPQLG---TIFYPSRSSSYAYVPCDSEHCRYFPYARCSAYKHRCIYTQLYL 143
C PC C LG +++ SS+ V C+ + C + + K C Y +Y
Sbjct: 106 CAPCPKCPVKTDLGIPLSLYDSKTSSTSKNVGCEDDFCSFIMQSETCGAKKPCSYHVVYG 165
Query: 144 IGPETSVFVSTEQLTFKNTD---ESTIHVQDVVFGCGFSTNRNF-----KFSGIFGLGIG 195
G + + +T + + Q+VVFGCG + + GI G G
Sbjct: 166 DGSTSDGDFIKDNITLEQVTGNLRTAPLAQEVVFGCGKNQSGQLGQTDSAVDGIMGFGQS 225
Query: 196 RSSLVSQLNSS------FSYCIGSLHDPDYLHNKLILGDGAIIDE-GDATPLQFIDGHYY 248
+S++SQL + FS+C+ D ++ I G + TP+ HY
Sbjct: 226 NTSIISQLAAGGSTKRIFSHCL------DNMNGGGIFAVGEVESPVVKTTPIVPNQVHYN 279
Query: 249 ITLEAISVDGRMLDINPNIFKRDDSGGGVMIDSGTDVTWLVKEAYEALRDEVMIRLEGEQ 308
+ L+ + VDG +D+ P++ + GG +IDSGT + +L + Y +L +++ + +Q
Sbjct: 280 VILKGMDVDGDPIDLPPSL-ASTNGDGGTIIDSGTTLAYLPQNLYNSLIEKITAK---QQ 335
Query: 309 MRSYSWPDKLCYHGIMSSDLKGFPTVRFHFRGGAKLALEKDSMFYQPRPDAFCMA 363
++ + + S+ K FP V HF KL++ + R D +C
Sbjct: 336 VKLHMVQETFACFSFTSNTDKAFPVVNLHFEDSLKLSVYPHDYLFSLREDMYCFG 390
>gi|115479815|ref|NP_001063501.1| Os09g0482200 [Oryza sativa Japonica Group]
gi|50725878|dbj|BAD33407.1| putative nucleoid DNA-binding protein cnd41 [Oryza sativa Japonica
Group]
gi|50725881|dbj|BAD33410.1| putative nucleoid DNA-binding protein cnd41 [Oryza sativa Japonica
Group]
gi|113631734|dbj|BAF25415.1| Os09g0482200 [Oryza sativa Japonica Group]
gi|125606112|gb|EAZ45148.1| hypothetical protein OsJ_29786 [Oryza sativa Japonica Group]
Length = 485
Score = 117 bits (294), Expect = 7e-24, Method: Compositional matrix adjust.
Identities = 107/360 (29%), Positives = 147/360 (40%), Gaps = 37/360 (10%)
Query: 61 FYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLGTIFYPSRSSSYAYVPCDSEH 120
+ V++ +G P DTGS L WV C PC DC Q +F PS SS+YA V C +
Sbjct: 149 YVVSVGLGTPAKQYAVIFDTGSDLSWVQCKPCADCYEQQDPLFDPSLSSTYAAVACGAPE 208
Query: 121 CRYFPYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDVVFGCGFST 180
C+ + CS+ RC Y Y +T + + LT +D + VFGCG
Sbjct: 209 CQELDASGCSS-DSRCRYEVQYGDQSQTDGNLVRDTLTLSASDT----LPGFVFGCGDQN 263
Query: 181 NRNF-KFSGIFGLGIGRSSLVSQLNSS----FSYCIGSLHDPDYLHNKLILGDGAIIDEG 235
F + G+FGLG + SL SQ S F+YC+ P + L G
Sbjct: 264 AGLFGQVDGLFGLGREKVSLPSQGAPSYGPGFTYCL-----PSSSSGRGYLSLGG-APPA 317
Query: 236 DATPLQFIDGH----YYITLEAISVDGRMLDINPNIFKRDDSGGGVMIDSGTDVTWLVKE 291
+A DG YYI L I V GR + I F +IDSGT +T L
Sbjct: 318 NAQFTALADGATPSFYYIDLVGIKVGGRAIRIPATAFAAAGG---TVIDSGTVITRLPPR 374
Query: 292 AYEALRDEVMIRLEGEQMRSYSWPDKL-----CYHGIMSSDLKGFPTVRFHFRGGAKLAL 346
AY LR M Y L CY PTV F GGA ++L
Sbjct: 375 AYAPLRAAFA-----RSMAQYKKAPALSILDTCYD-FTGHRTAQIPTVELAFAGGATVSL 428
Query: 347 EKDSMFYQPRPDAFCMAVNPASINNRYVNFSLIGMMAQQFYNVGYDIGRKQKTFQRMDCE 406
+ + Y + C+A P N + +++G Q+ + V YD+ ++ F C
Sbjct: 429 DFTGVLYVSKVSQACLAFAP---NADDSSIAILGNTQQKTFAVAYDVANQRIGFGAKGCS 485
>gi|358345197|ref|XP_003636668.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
gi|355502603|gb|AES83806.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
Length = 294
Score = 117 bits (294), Expect = 8e-24, Method: Compositional matrix adjust.
Identities = 83/265 (31%), Positives = 125/265 (47%), Gaps = 28/265 (10%)
Query: 28 ISIARLAYLQEKIRSHNT-YQAQILPSNDISNSL---------------FYVNISIGQPP 71
ISI ++ I +HN + +++P N + + + +SIG PP
Sbjct: 10 ISILLFVFIFPHIEAHNGGFTGKLIPRNSSKDFFNRNTIQSPVSANHYDYLMELSIGTPP 69
Query: 72 VPQFTAMDTGSSLLWVHCYPCRDCSPQLGTIFYPSRSSSYAYVPCDSEHCRYFPYARCSA 131
V + DTGS L+W+ C PC +C QL +F SS+++ + C SE C CS
Sbjct: 70 VKIYAQADTGSDLIWLQCIPCTNCYKQLNPMFDSQSSSTFSNIACGSESCSKLYSTSCSP 129
Query: 132 YKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDVVFGCGFSTNRNF--KFSGI 189
+ C Y Y+ G ET ++ E LT +T + + V+FGCG + N F K GI
Sbjct: 130 DQINCKYNYSYVDGSETQGVLAQETLTLTSTTGEPVAFKGVIFGCGHNNNGAFNDKEMGI 189
Query: 190 FGLGIGRSSLVSQLNSS-----FSYCIGSLHDPDYLHNKLILGDGA-IIDEG-DATPL-- 240
GLG G SLVSQ+ SS FS C+ + + + + G G+ ++ G +TPL
Sbjct: 190 IGLGRGPLSLVSQIGSSLGGNMFSQCLVPFNTNPSISSPMSFGKGSEVLGNGVVSTPLVS 249
Query: 241 -QFIDGHYYITLEAISVDGRMLDIN 264
Y++TL ISV+ L N
Sbjct: 250 KTTYQSFYFVTLLGISVEDINLPFN 274
>gi|22165126|gb|AAM93742.1| putative chloroplast nucleoid DNA binding protein [Oryza sativa
Japonica Group]
gi|31433307|gb|AAP54836.1| Eukaryotic aspartyl protease family protein [Oryza sativa Japonica
Group]
gi|125575547|gb|EAZ16831.1| hypothetical protein OsJ_32302 [Oryza sativa Japonica Group]
Length = 405
Score = 117 bits (294), Expect = 9e-24, Method: Compositional matrix adjust.
Identities = 97/373 (26%), Positives = 159/373 (42%), Gaps = 42/373 (11%)
Query: 57 SNSLFYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLGTIFYPSRSSSYAYVPC 116
S L+ N +IG PP P +D L+W C PC+ C Q +F P++SS++ +PC
Sbjct: 53 SQGLYVANFTIGTPPQPVSAVVDLTGELVWTQCTPCQPCFEQDLPLFDPTKSSTFRGLPC 112
Query: 117 DSEHCRYFPYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDVVFGC 176
S C P + + CIY G +T T+ E+ + FGC
Sbjct: 113 GSHLCESIPESSRNCTSDVCIYEAPTKAG-DTGGKAGTDTFAIGAAKET------LGFGC 165
Query: 177 GFSTNRNFKF----SGIFGLGIGRSSLVSQLN-SSFSYCIGSLHDPDYLHNKLILGDGAI 231
T++ K SGI GLG SLV+Q+N ++FSYC+ L LG A
Sbjct: 166 VVMTDKRLKTIGGPSGIVGLGRTPWSLVTQMNVTAFSYCLAGKS-----SGALFLGATAK 220
Query: 232 -IDEGDATPLQFI------------DGHYYITLEAISVDGRMLDINPNIFKRDDSGGGVM 278
+ G + F+ + +Y + L I G L SG V+
Sbjct: 221 QLAGGKNSSTPFVIKTSAGSSDNGSNPYYMVKLAGIKTGGAPLQ------AASSSGSTVL 274
Query: 279 IDSGTDVTWLVKEAYEALRDEVMIRLEGEQMRSYSWPDKLCYHGIMSSDLKGFPTVRFHF 338
+D+ + ++L AY+AL+ + + + + S P LC+ ++ D P + F F
Sbjct: 275 LDTVSRASYLADGAYKALKKALTAAVGVQPVASPPKPYDLCFPKAVAGDA---PELVFTF 331
Query: 339 RGGAKLALEKDSMFYQPRPDAFCMAV-NPASIN--NRYVNFSLIGMMAQQFYNVGYDIGR 395
GGA L + + C+ + + AS+N S++G + Q+ +V +D+
Sbjct: 332 DGGAALTVPPANYLLASGNGTVCLTIGSSASLNLTGELEGASILGSLQQENVHVLFDLKE 391
Query: 396 KQKTFQRMDCEVL 408
+ +F+ DC L
Sbjct: 392 ETLSFKPADCSSL 404
>gi|125540928|gb|EAY87323.1| hypothetical protein OsI_08727 [Oryza sativa Indica Group]
Length = 463
Score = 117 bits (294), Expect = 9e-24, Method: Compositional matrix adjust.
Identities = 102/360 (28%), Positives = 154/360 (42%), Gaps = 38/360 (10%)
Query: 61 FYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRD--CSPQLGTIFYPSRSSSYAYVPCDS 118
+ +++ +G P V Q +DTGS + WV C PC + C Q G +F P++SS+Y V C +
Sbjct: 127 YVISVGLGTPAVTQTVTIDTGSDVSWVQCNPCPNPPCHAQTGALFDPAKSSTYRAVSCAA 186
Query: 119 EHCRYFPYA--RCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDVVFGC 176
C C A + C Y Y G T+ S + LT ++ V+ FGC
Sbjct: 187 AECAQLEQQGNGCGATNYECQYGVQYGDGSTTNGTYSRDTLTLSGASDA---VKGFQFGC 243
Query: 177 -----GFSTNRNFKFSGIFGLGIGRSSLVSQL----NSSFSYCIGSLHDPDYLHNKLILG 227
GFS + G+ GLG G SLVSQ +SFSYC+ G
Sbjct: 244 SHLESGFSDQTD----GLMGLGGGAQSLVSQTAAAYGNSFSYCLPPTSGSSGFLTLGGGG 299
Query: 228 DGAIIDEGDATPLQFIDGHYYITLEAISVDGRMLDINPNIFKRDDSGGGVMIDSGTDVTW 287
+ + I Y L+ I+V G+ L ++P++F G ++DSGT +T
Sbjct: 300 GASGFVTTRMLRSKQIPTFYGARLQDIAVGGKQLGLSPSVFA-----AGSVVDSGTIITR 354
Query: 288 LVKEAYEALRDEVMIRLEGEQMRSYSWPDKL--CYHGIMSSDLKGFPTVRFHFRGGAKLA 345
L AY AL + +Q RS L C+ + + PTV F GGA +
Sbjct: 355 LPPTAYSALSSAFKAGM--KQYRSAPARSILDTCFDFAGQTQIS-IPTVALVFSGGAAID 411
Query: 346 LEKDSMFYQPRPDAFCMAVNPASINNRYVNFSLIGMMAQQFYNVGYDIGRKQKTFQRMDC 405
L+ + + Y C+A + +IG + Q+ + V YD+G F+ C
Sbjct: 412 LDPNGIMY-----GNCLAFAATGDDG---TTGIIGNVQQRTFEVLYDVGSSTLGFRSGAC 463
>gi|297812425|ref|XP_002874096.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
gi|297319933|gb|EFH50355.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
Length = 493
Score = 117 bits (293), Expect = 9e-24, Method: Compositional matrix adjust.
Identities = 102/320 (31%), Positives = 143/320 (44%), Gaps = 38/320 (11%)
Query: 60 LFYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLG-----TIFYPSRSSSYAYV 114
L+Y I +G PP + +DTGS +LWV C C C G F P S + V
Sbjct: 80 LYYTKIRLGSPPRDFYVQVDTGSDVLWVSCASCNGCPQTSGLQIQLNFFDPGSSVTATPV 139
Query: 115 PCDSEHCRYFPYAR---CSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTI---H 168
C + C + + CS + C YT Y G TS F ++ L F S++
Sbjct: 140 SCSDQRCSWGIQSSDSGCSVQNNLCAYTFQYGDGSGTSGFYVSDVLQFDMIVGSSLVPNS 199
Query: 169 VQDVVFGCGFSTNRNFKFS-----GIFGLGIGRSSLVSQLNSS------FSYCIGSLHDP 217
VVFGC S + S GIFG G S++SQL S FS+C L
Sbjct: 200 TAPVVFGCSTSQTGDLVKSDRAVDGIFGFGQQGMSVISQLASQGLAPRVFSHC---LKGE 256
Query: 218 DYLHNKLILGDGAIIDEGDA-TPLQFIDGHYYITLEAISVDGRMLDINPNIFKRDDSGGG 276
+ L+LG+ I++ TPL HY + L +ISV+G+ L INP++F + G G
Sbjct: 257 NGGGGILVLGE--IVEPNMVFTPLVPSQPHYNVNLLSISVNGQALPINPSVFSTSN-GQG 313
Query: 277 VMIDSGTDVTWLVKEAYEALRDEVMIRLEGEQMRSYSWPDKLCYHGIMSSDLKGFPTVRF 336
+ID+GT + +L + AY E + + +R CY I +S FP V
Sbjct: 314 TIIDTGTTLAYLSEAAYVPFV-EAITNAVSQSVRPVVSKGNQCYV-IATSVADIFPPVSL 371
Query: 337 HFRGGAKLALEKDSMFYQPR 356
+F GGA SMF P+
Sbjct: 372 NFAGGA-------SMFLNPQ 384
>gi|225216995|gb|ACN85284.1| aspartic proteinase nepenthesin-1 precursor [Oryza australiensis]
Length = 519
Score = 117 bits (293), Expect = 9e-24, Method: Compositional matrix adjust.
Identities = 99/357 (27%), Positives = 151/357 (42%), Gaps = 29/357 (8%)
Query: 61 FYVNISIGQPPVPQFTAMDTGSSLLWVHCYPC-RDCSPQLGTIFYPSRSSSYAYVPCDSE 119
+ V + +G P DTGS WV C PC C Q +F P+RSS+YA + C +
Sbjct: 180 YVVTVGLGTPASRYTVVFDTGSDTTWVQCQPCVVVCYEQREKLFDPARSSTYANISCAAP 239
Query: 120 HCRYFPYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDVVFGCGFS 179
C CS C+Y Y G + F + + LT + D V+ FGCG
Sbjct: 240 ACSDLDTRGCSG--GNCLYGVQYGDGSYSIGFFAMDTLTLSSYDA----VKGFRFGCGER 293
Query: 180 TNRNF-KFSGIFGLGIGRSSLVSQL----NSSFSYCIGSLHDPDYLHNKLILGDGAIIDE 234
F + +G+ GLG G++SL Q F++C+ + L G G+
Sbjct: 294 NEGLFGEAAGLLGLGRGKTSLPVQTYDKYGGVFAHCLPARSSGT---GYLDFGPGSPAAA 350
Query: 235 GD--ATPLQFIDG--HYYITLEAISVDGRMLDINPNIFKRDDSGGGVMIDSGTDVTWLVK 290
G TP+ +G YY+ + I V G++L I ++F G ++DSGT +T L
Sbjct: 351 GARLTTPMLTDNGPTFYYVGMTGIRVGGQLLSIPQSVFTT----AGTIVDSGTVITRLPP 406
Query: 291 EAYEALRDEVMIRLEGEQMRSYSWPDKL--CYHGIMSSDLKGFPTVRFHFRGGAKLALEK 348
AY +LR + + L CY S + PTV F+GGA+L ++
Sbjct: 407 AAYSSLRSAFASAMAARGYKKAPAVSLLDTCYDFTGMSQVA-IPTVSLLFQGGARLDVDA 465
Query: 349 DSMFYQPRPDAFCMAVNPASINNRYVNFSLIGMMAQQFYNVGYDIGRKQKTFQRMDC 405
+ Y C+ + N + ++G + + V YDIG+K F C
Sbjct: 466 SGIMYAASVSQVCLGF---AANEDGGDVGIVGNTQLKTFGVAYDIGKKVVGFSPGAC 519
>gi|115448349|ref|NP_001047954.1| Os02g0720600 [Oryza sativa Japonica Group]
gi|45735841|dbj|BAD12876.1| putative 41 kD chloroplast nucleoid DNA binding protein (CND41)
[Oryza sativa Japonica Group]
gi|45735967|dbj|BAD12996.1| putative 41 kD chloroplast nucleoid DNA binding protein (CND41)
[Oryza sativa Japonica Group]
gi|113537485|dbj|BAF09868.1| Os02g0720600 [Oryza sativa Japonica Group]
gi|125583492|gb|EAZ24423.1| hypothetical protein OsJ_08176 [Oryza sativa Japonica Group]
gi|215740989|dbj|BAG97484.1| unnamed protein product [Oryza sativa Japonica Group]
Length = 463
Score = 117 bits (293), Expect = 9e-24, Method: Compositional matrix adjust.
Identities = 102/360 (28%), Positives = 154/360 (42%), Gaps = 38/360 (10%)
Query: 61 FYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRD--CSPQLGTIFYPSRSSSYAYVPCDS 118
+ +++ +G P V Q +DTGS + WV C PC + C Q G +F P++SS+Y V C +
Sbjct: 127 YVISVGLGTPAVTQTVTIDTGSDVSWVQCNPCPNPPCYAQTGALFDPAKSSTYRAVSCAA 186
Query: 119 EHCRYFPYA--RCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDVVFGC 176
C C A + C Y Y G T+ S + LT ++ V+ FGC
Sbjct: 187 AECAQLEQQGNGCGATNYECQYGVQYGDGSTTNGTYSRDTLTLSGASDA---VKGFQFGC 243
Query: 177 -----GFSTNRNFKFSGIFGLGIGRSSLVSQL----NSSFSYCIGSLHDPDYLHNKLILG 227
GFS + G+ GLG G SLVSQ +SFSYC+ G
Sbjct: 244 SHVESGFSDQTD----GLMGLGGGAQSLVSQTAAAYGNSFSYCLPPTSGSSGFLTLGGGG 299
Query: 228 DGAIIDEGDATPLQFIDGHYYITLEAISVDGRMLDINPNIFKRDDSGGGVMIDSGTDVTW 287
+ + I Y L+ I+V G+ L ++P++F G ++DSGT +T
Sbjct: 300 GVSGFVTTRMLRSRQIPTFYGARLQDIAVGGKQLGLSPSVFA-----AGSVVDSGTIITR 354
Query: 288 LVKEAYEALRDEVMIRLEGEQMRSYSWPDKL--CYHGIMSSDLKGFPTVRFHFRGGAKLA 345
L AY AL + +Q RS L C+ + + PTV F GGA +
Sbjct: 355 LPPTAYSALSSAFKAGM--KQYRSAPARSILDTCFDFAGQTQIS-IPTVALVFSGGAAID 411
Query: 346 LEKDSMFYQPRPDAFCMAVNPASINNRYVNFSLIGMMAQQFYNVGYDIGRKQKTFQRMDC 405
L+ + + Y C+A + +IG + Q+ + V YD+G F+ C
Sbjct: 412 LDPNGIMY-----GNCLAFAATGDDG---TTGIIGNVQQRTFEVLYDVGSSTLGFRSGAC 463
>gi|255566008|ref|XP_002523992.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
gi|223536719|gb|EEF38360.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
Length = 442
Score = 117 bits (293), Expect = 1e-23, Method: Compositional matrix adjust.
Identities = 121/421 (28%), Positives = 180/421 (42%), Gaps = 38/421 (9%)
Query: 1 LIHRDSILSPYNNPNENPAHRVQRGINISIARLAYLQEKIRSHNTYQAQILPSNDISNSL 60
LIHRDS SP N +E R+ + S R+ + I N+ A PS + N
Sbjct: 41 LIHRDSPNSPLFNASETTDIRLANAVERSADRVNRFNDLIS--NSITAAEFPS-ILDNGD 97
Query: 61 FYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLGTIFY-PSRSSSYAYVPCDSE 119
F + ISIG PP + TGS L+W+ C + C+ F+ P SS+Y VPCDS
Sbjct: 98 FLMKISIGIPPTELLVNVATGSDLVWIPCLSFKPCTHNCDLRFFDPMESSTYKNVPCDSY 157
Query: 120 HCRYFPYARCSAYKHRCIYT---QLYLIGPETSVFVSTEQLTFKNTDESTIHVQDVVFGC 176
C+ A C C Y+ + P+ + + T LT +T + + + F C
Sbjct: 158 RCQITNAATCQF--SDCFYSCDPRHQDSCPDGDLAMDT--LTLNSTTGKSFMLPNTGFIC 213
Query: 177 GFSTNRNFKFSGIFGLGIGRSSLVSQ----LNSSFSYCIGSLHDPDYLHNKLILGDGAII 232
G ++ GI GLG G SL+++ ++ FS+CI +KL GD A++
Sbjct: 214 GNRIGGDYPGVGILGLGHGSLSLLNRISHLIDGKFSHCIVPYSSNQ--TSKLSFGDKAVV 271
Query: 233 DEGDA---TPLQFIDGHYYITLEAISVDGRMLDINPNIFKRDDSGGGVMIDSGTDVTWLV 289
G A T L G Y TL + I+ D G+ +DSGT T+
Sbjct: 272 -SGSAMFSTRLDMTGGPYSYTLSFYGISVGNKSISAGGIGSDYYMNGLGMDSGTMFTYFP 330
Query: 290 KEAYEALRDEVMIRLEGEQMRSYSWPD-----KLCYHGIMSSDLKGFPTVRFHFRGGAKL 344
+ Y L +V ++ E + +PD +LCY S D PT+ HF GG+ +
Sbjct: 331 EYFYSQLEYDVRYAIQQEPL----YPDPTRRLRLCYR--YSPDFSP-PTITMHFEGGS-V 382
Query: 345 ALEKDSMFYQPRPDAFCMAVNPASINNRYVNFSLIGMMAQQFYNVGYDIGRKQKTFQRMD 404
L + F + D C+A +S V G Q +GYD+ +F + D
Sbjct: 383 ELSSSNSFIRMTEDIVCLAFATSSSEQDAV----FGYWQQTNLLIGYDLDAGFLSFLKTD 438
Query: 405 C 405
C
Sbjct: 439 C 439
>gi|224093400|ref|XP_002309912.1| predicted protein [Populus trichocarpa]
gi|222852815|gb|EEE90362.1| predicted protein [Populus trichocarpa]
Length = 382
Score = 117 bits (293), Expect = 1e-23, Method: Compositional matrix adjust.
Identities = 109/383 (28%), Positives = 173/383 (45%), Gaps = 38/383 (9%)
Query: 30 IARLAYLQEKIRSHNT--YQAQILPSNDIS-----NSLFYVNISIGQPPVPQFTAMDTGS 82
+ R+A L ++ S + Y+ + S+ +S + ++V I +G PP Q+ +D+GS
Sbjct: 5 VKRVASLIHRLSSGSAAKYEVEDFGSDVVSGMNQGSGEYFVRIGLGSPPRSQYMVIDSGS 64
Query: 83 SLLWVHCYPCRDCSPQLGTIFYPSRSSSYAYVPCDSEHCRYFPYARCSAYKHRCIYTQLY 142
++WV C PC C Q +F P+ S+S+ V C S C A C++ RC Y Y
Sbjct: 65 DIVWVQCKPCTQCYHQTDPLFDPADSASFMGVSCSSAVCDRVENAGCNS--GRCRYEVSY 122
Query: 143 LIGPETSVFVSTEQLTFKNTDESTIHVQDVVFGCGFSTNRNFKFSGIFGLGIGRS----- 197
G T ++ E LTF T V++V GCG S F + G S
Sbjct: 123 GDGSYTKGTLALETLTFGRT-----VVRNVAIGCGHSNRGMFVGAAGLLGLGGGSMSFMG 177
Query: 198 SLVSQLNSSFSYCIGSLHDPDYLHNKLILGDGAIIDEGDATPL---QFIDGHYYITLEAI 254
L Q ++FSYC+ + + L G A+ PL YYI L +
Sbjct: 178 QLSGQTGNAFSYCL--VSRGTNTNGFLEFGSEAMPVGAAWIPLVRNPRAPSFYYIRLLGL 235
Query: 255 SVDGRMLDINPNIFKRDDSG-GGVMIDSGTDVTWLVKEAYEALRDEVMIRLEGEQMRSYS 313
V + ++ ++F+ ++ G GGV++D+GT VT AYEA R+ + + + S
Sbjct: 236 GVGDTRVPVSEDVFQLNELGSGGVVMDTGTAVTRFPTVAYEAFRNAFIEQTQNLPRASGV 295
Query: 314 WPDKLCYH--GIMSSDLKGFPTVRFHFRGGAKLALEKDSMFYQPRPDA--FCMAVNPASI 369
CY+ G +S + PTV F+F GG L + ++ F P DA FC A P+
Sbjct: 296 SIFDTCYNLFGFLSVRV---PTVSFYFSGGPILTIPANN-FLIPVDDAGTFCFAFAPSP- 350
Query: 370 NNRYVNFSLIGMMAQQFYNVGYD 392
S++G + Q+ + D
Sbjct: 351 ----SGLSILGNIQQEGIQISVD 369
>gi|297826117|ref|XP_002880941.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
gi|297326780|gb|EFH57200.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
Length = 397
Score = 117 bits (293), Expect = 1e-23, Method: Compositional matrix adjust.
Identities = 107/375 (28%), Positives = 164/375 (43%), Gaps = 63/375 (16%)
Query: 59 SLFYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLGTIFYPSRSSSYAYVPCDS 118
S++ + + +G PP +DTGS L+W C PC +C Q IF PS+SS+
Sbjct: 59 SIYLMRLQLGTPPFEIVAEIDTGSDLIWTQCMPCPNCYTQFAPIFDPSKSST-------- 110
Query: 119 EHCRYFPYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDVVFGCGF 178
F RC + + C Y +Y ++ ++TE +T ++T + + GCG
Sbjct: 111 -----FKEKRC--HGNSCPYEIIYADESYSTGILATETVTIQSTSGEPFVMAETSIGCGL 163
Query: 179 STNRNF-------KFSGIFGLGIGRSSLVSQLN----SSFSYCIGS--LHDPDYLHNKLI 225
+ N N SGI GL +G SSL+SQ++ SYC S ++ N ++
Sbjct: 164 N-NSNLMTPGYAASSSGIVGLNMGPSSLISQMDLPIPGLISYCFSSQGTSKINFGTNAVV 222
Query: 226 LGDGAIIDEGDATPLQFIDG---HYYITLEAISVDGRMLDINPNIFKRDDSGGGVMIDSG 282
GDG + + FI YY+ L+A+SV + ++ F D G + IDSG
Sbjct: 223 AGDGTVAAD------MFIKKDQPFYYLNLDAVSVGDKRIETLGTPFHAQD--GNIFIDSG 274
Query: 283 TDVTWLVKEAYEALRDEVMIRLEGEQMRS--YSWPDKLCYHGIMSSDLKGFPTVRFHFRG 340
T T+L +Y L E + + S + LCY+ ++ FP + HF G
Sbjct: 275 TTYTYL-PTSYCNLVREAVAASVVAANQVPDPSSENLLCYNW---DTMEIFPVITLHFAG 330
Query: 341 GAKLALEKDSMFYQP-RPDAFCMAVN------PASINNRYVNFSLIGMMAQQFYNVGYDI 393
GA L L+K +M+ + FC+A+ PA NR N L VGYD
Sbjct: 331 GADLVLDKYNMYVETITGGTFCLAIGCVDPSMPAIFGNRAHNNLL----------VGYDS 380
Query: 394 GRKQKTFQRMDCEVL 408
+F +C L
Sbjct: 381 STLVISFSPTNCSAL 395
>gi|242044888|ref|XP_002460315.1| hypothetical protein SORBIDRAFT_02g026350 [Sorghum bicolor]
gi|241923692|gb|EER96836.1| hypothetical protein SORBIDRAFT_02g026350 [Sorghum bicolor]
Length = 456
Score = 117 bits (292), Expect = 1e-23, Method: Compositional matrix adjust.
Identities = 108/393 (27%), Positives = 164/393 (41%), Gaps = 29/393 (7%)
Query: 31 ARLAYLQEKIRSHNTYQAQILPSNDISNSLFYVNISIGQPPVPQFTAMDTGSSLLWVHCY 90
AR + + R+ + PS D+ + V+++IG PP P +DTGS L+W C
Sbjct: 75 ARFSGKNDDQRTTPPTGVSVRPSGDLE---YVVDLAIGTPPQPVSALLDTGSDLIWTQCA 131
Query: 91 PCRDCSPQLGTIFYPSRSSSYAYVPCDSEHCRYFPYARCSAYKHRCIYTQLYLIGPETSV 150
PC C Q +F P S+SY + C + C + C C Y Y G T
Sbjct: 132 PCASCLAQPDPLFAPGESASYEPMRCAGQLCSDILHHGCE-MPDTCTYRYNYGDGTMTMG 190
Query: 151 FVSTEQLTFKNTDESTIHVQDVVFGCG-FSTNRNFKFSGIFGLGIGRSSLVSQLN-SSFS 208
+TE+ TF ++ + + FGCG + SGI G G SLVSQL+ FS
Sbjct: 191 VYATERFTFTSSGGDRLMTVPLGFGCGSMNVGSLNNGSGIVGFGRNPLSLVSQLSIRRFS 250
Query: 209 YCIGSLHDPDYLHNKLILGDGAIIDEGDAT-PLQFI--------DGHYYITLEAISVDGR 259
YC+ S + L+ G + GDAT P+Q YY+ L ++V R
Sbjct: 251 YCLTSYGSGR--KSTLLFGSLSGGVYGDATGPVQTTPLLQSLQNPTFYYVHLAGLTVGAR 308
Query: 260 MLDINPNIFK-RDDSGGGVMIDSGTDVTWL----VKEAYEALRDEVMIRLE--GEQMRSY 312
L I + F R D GGV++DSGT +T L + E A R ++ + G
Sbjct: 309 RLRIPESAFALRPDGSGGVIVDSGTALTLLPGAVLAEVVRAFRQQLRLPFANGGNPEDGV 368
Query: 313 SWPDKLCYHGIMSSDLKGFPTVRFHFRGGAKLALEKDSMFYQPRPDAFCMAVNPASINNR 372
+ + S+ P + FHF+ ++ + R C+ + + +
Sbjct: 369 CFLVPAAWRRSSSTSQVPVPRMVFHFQDADLDLPRRNYVLDDHRKGRLCLLLADSGDDG- 427
Query: 373 YVNFSLIGMMAQQFYNVGYDIGRKQKTFQRMDC 405
S IG + QQ V YD+ + +F C
Sbjct: 428 ----STIGNLVQQDMRVLYDLEAETLSFAPAQC 456
>gi|224085379|ref|XP_002307559.1| predicted protein [Populus trichocarpa]
gi|222857008|gb|EEE94555.1| predicted protein [Populus trichocarpa]
Length = 455
Score = 117 bits (292), Expect = 1e-23, Method: Compositional matrix adjust.
Identities = 106/374 (28%), Positives = 171/374 (45%), Gaps = 40/374 (10%)
Query: 61 FYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLGTIFYPSRSSSYAYVPCDSEH 120
+++++ IG PP +DTGS L W+ C PC DC Q G + P SSS+ + C
Sbjct: 90 YFMDVFIGTPPKHYSLILDTGSDLNWIQCVPCHDCFEQNGPYYDPKESSSFRNIGCHDPR 149
Query: 121 CRYF----PYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDEST----IHVQDV 172
C P C A C Y Y T+ +TE T T + V++V
Sbjct: 150 CHLVSSPDPPLPCKAENQTCPYFYWYGDSSNTTGDFATETFTVNLTSPTGKSEFKRVENV 209
Query: 173 VFGCGFSTNRNFK-FSGIFGLGIGRSSLVSQLNS----SFSYCIGSLHDPDYLHNKLILG 227
+FGCG F SG+ GLG G S SQL S SFSYC+ + + +KLI G
Sbjct: 210 MFGCGHWNRGLFHGASGLLGLGRGPLSFSSQLQSLYGHSFSYCLVDRNSDTNVSSKLIFG 269
Query: 228 DG-----------AIIDEGDATPLQFIDGHYYITLEAISVDGRMLDINPNIFKRDDSG-G 275
+ + G P +D YY+ +++I V G +L+I + + G G
Sbjct: 270 EDKDLLNHPELNFTTLVGGKENP---VDTFYYVQIKSIMVGGEVLNIPESTWNMTSDGVG 326
Query: 276 GVMIDSGTDVTWLVKEAYEALRDEVMIRLEGEQM-RSYSWPDKLCYH--GIMSSDLKGFP 332
G ++DSGT +++ + AY+ ++D + +++G + + + D CY+ G+ DL F
Sbjct: 327 GTIVDSGTTLSYFTEPAYQIIKDAFVKKVKGYPIVQDFPILDP-CYNVSGVEKIDLPDFG 385
Query: 333 TVRFHFRGGAKLALEKDSMFYQPRP-DAFCMAVNPASINNRYVNFSLIGMMAQQFYNVGY 391
+ F GA ++ F + P + C+A+ + S+IG QQ ++V Y
Sbjct: 386 IL---FADGAVWNFPVENYFIRLDPEEVVCLAI----LGTPRSALSIIGNYQQQNFHVLY 438
Query: 392 DIGRKQKTFQRMDC 405
D + + + M+C
Sbjct: 439 DTKKSRLGYAPMNC 452
>gi|224142011|ref|XP_002324354.1| predicted protein [Populus trichocarpa]
gi|222865788|gb|EEF02919.1| predicted protein [Populus trichocarpa]
Length = 471
Score = 117 bits (292), Expect = 1e-23, Method: Compositional matrix adjust.
Identities = 106/359 (29%), Positives = 161/359 (44%), Gaps = 33/359 (9%)
Query: 61 FYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCR-DCSPQLGTIFYPSRSSSYAYVPCDSE 119
+ V + +G P DTGS L W C PC C PQ F P++S+SY + C SE
Sbjct: 132 YAVTVGLGTPKKDFSLLFDTGSDLTWTQCEPCSGGCFPQNDEKFDPTKSTSYKNLSCSSE 191
Query: 120 HCRYF--PYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDVVFGCG 177
C+ A+ + + C+Y Y G T F++TE LT +D ++ V GCG
Sbjct: 192 PCKSIGKESAQGCSSSNSCLYGVKYGTG-YTVGFLATETLTITPSDV----FENFVIGCG 246
Query: 178 FSTNRN-FKFSGIFG-LGIGRS--SLVSQLNSS----FSYCIGSLHDPDYLHNKLILGDG 229
RN +FSG G LG+GRS +L SQ +S+ FSYC L L G G
Sbjct: 247 ---ERNGGRFSGTAGLLGLGRSPVALPSQTSSTYKNLFSYC---LPASSSSTGHLSFG-G 299
Query: 230 AIIDEGDATPLQF-IDGHYYITLEAISVDGRMLDINPNIFKRDDSGGGVMIDSGTDVTWL 288
+ TP+ I Y + + ISV GR L I+P++F+ G +IDSGT +T+L
Sbjct: 300 GVSQAAKFTPITSKIPELYGLDVSGISVGGRKLPIDPSVFRT----AGTIIDSGTTLTYL 355
Query: 289 VKEAYEALRDEVMIRLEGEQMRSYSWPDKLCYH-GIMSSDLKGFPTVRFHFRGGAKLALE 347
A+ AL + + + + CY ++D P + F GG ++ ++
Sbjct: 356 PSTAHSALSSAFQEMMTNYTLTKGTSGLQPCYDFSKHANDNITIPQISIFFEGGVEVDID 415
Query: 348 KDSMFYQPRP-DAFCMAVNPASINNRYVNFSLIGMMAQQFYNVGYDIGRKQKTFQRMDC 405
+F + C+A N + ++ G + Q+ Y V YD+ + F C
Sbjct: 416 DSGIFIAANGLEEVCLAFKD---NGNDTDVAIFGNVQQKTYEVVYDVAKGMVGFAPGGC 471
>gi|356535252|ref|XP_003536162.1| PREDICTED: aspartic proteinase-like protein 2-like [Glycine max]
Length = 475
Score = 117 bits (292), Expect = 1e-23, Method: Compositional matrix adjust.
Identities = 100/373 (26%), Positives = 167/373 (44%), Gaps = 41/373 (10%)
Query: 18 PAHRVQRGINISIARLAYLQEKIRSHNTYQ--AQILPSNDISNSLFYVNISIGQPPVPQF 75
P R +R +N A A + +I S LP+ L++ + +G PP +
Sbjct: 28 PVERRKRSLNAVKAHDARRRGRILSAVDLNLGGNGLPTE---TGLYFTKLGLGSPPKDYY 84
Query: 76 TAMDTGSSLLWVHCYPCRDC--SPQLG---TIFYPSRSSSYAYVPCDSEHCRYF---PYA 127
+DTGS +LWV+C C C LG T++ P S + + CD E C P
Sbjct: 85 VQVDTGSDILWVNCVKCSRCPRKSDLGIDLTLYDPKGSETSELISCDQEFCSATYDGPIP 144
Query: 128 RCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDE---STIHVQDVVFGCG------F 178
C + + C Y+ Y G T+ + + LT+ + ++ + ++FGCG
Sbjct: 145 GCKS-EIPCPYSITYGDGSATTGYYVQDYLTYNHVNDNLRTAPQNSSIIFGCGAVQSGTL 203
Query: 179 STNRNFKFSGIFGLGIGRSSLVSQLNSS------FSYCIGSLHDPDYLHNKLILGDGAII 232
S++ GI G G SS++SQL +S FS+C+ D + I G ++
Sbjct: 204 SSSSEEALDGIIGFGQSNSSVLSQLAASGKVKKIFSHCL------DNIRGGGIFAIGEVV 257
Query: 233 D-EGDATPLQFIDGHYYITLEAISVDGRMLDINPNIFKRDDSGGGVMIDSGTDVTWLVKE 291
+ + TPL HY + L++I VD +L + +IF + G G +IDSGT + +L
Sbjct: 258 EPKVSTTPLVPRMAHYNVVLKSIEVDTDILQLPSDIFDSGN-GKGTIIDSGTTLAYLPAI 316
Query: 292 AYEALRDEVMIRLEGEQMRSYSWPDKL-CYHGIMSSDLKGFPTVRFHFRGGAKLALEKDS 350
Y+ L +VM R +++ Y + C+ + D +GFP V+ HF L +
Sbjct: 317 VYDELIPKVMAR--QPRLKLYLVEQQFSCFQYTGNVD-RGFPVVKLHFEDSLSLTVYPHD 373
Query: 351 MFYQPRPDAFCMA 363
+Q + +C+
Sbjct: 374 YLFQFKDGIWCIG 386
>gi|125564143|gb|EAZ09523.1| hypothetical protein OsI_31798 [Oryza sativa Indica Group]
Length = 485
Score = 117 bits (292), Expect = 1e-23, Method: Compositional matrix adjust.
Identities = 107/360 (29%), Positives = 147/360 (40%), Gaps = 37/360 (10%)
Query: 61 FYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLGTIFYPSRSSSYAYVPCDSEH 120
+ V++ +G P DTGS L WV C PC DC Q +F PS SS+YA V C +
Sbjct: 149 YVVSVGLGTPAKQYAVIFDTGSDLSWVQCKPCADCYEQQDPLFDPSLSSTYAAVACGAPE 208
Query: 121 CRYFPYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDVVFGCGFST 180
C+ + CS+ RC Y Y +T + + LT +D + VFGCG
Sbjct: 209 CQELDASGCSS-DSRCRYEVQYGDQSQTDGNLVRDTLTLSASDT----LPGFVFGCGDQN 263
Query: 181 NRNF-KFSGIFGLGIGRSSLVSQLNSS----FSYCIGSLHDPDYLHNKLILGDGAIIDEG 235
F + G+FGLG + SL SQ S F+YC+ P + L G
Sbjct: 264 AGLFGQVDGLFGLGREKVSLPSQGAPSYGPGFTYCL-----PSSSSGRGYLSLGG-APPA 317
Query: 236 DATPLQFIDGH----YYITLEAISVDGRMLDINPNIFKRDDSGGGVMIDSGTDVTWLVKE 291
+A DG YYI L I V GR + I F +IDSGT +T L
Sbjct: 318 NAQFTALADGATPSFYYIDLVGIKVGGRAIRIPATAFAAAGG---TVIDSGTVITRLPPR 374
Query: 292 AYEALRDEVMIRLEGEQMRSYSWPDKL-----CYHGIMSSDLKGFPTVRFHFRGGAKLAL 346
AY LR M Y L CY PTV F GGA ++L
Sbjct: 375 AYAPLRAAFA-----RSMAQYKKAPALSILDTCYD-FTGHRTAQIPTVELAFAGGATVSL 428
Query: 347 EKDSMFYQPRPDAFCMAVNPASINNRYVNFSLIGMMAQQFYNVGYDIGRKQKTFQRMDCE 406
+ + Y + C+A P N + +++G Q+ + V YD+ ++ F C
Sbjct: 429 DFTGVLYVSKVSQACLAFAP---NADDSSIAILGNTQQKTFAVTYDVANQRIGFGAKGCS 485
>gi|222637180|gb|EEE67312.1| hypothetical protein OsJ_24552 [Oryza sativa Japonica Group]
Length = 420
Score = 116 bits (291), Expect = 2e-23, Method: Compositional matrix adjust.
Identities = 105/397 (26%), Positives = 174/397 (43%), Gaps = 53/397 (13%)
Query: 32 RLAYLQEKIRS------HNTYQAQILPSNDISNSLFYVNISIGQPPVPQFTAMDTGSSLL 85
R+A+L + + +++ Q L N + + +NIS+G P + DTGS L+
Sbjct: 53 RIAFLSDATAAGKATTTNSSVSFQALLENGVGG--YNMNISVGTPLLTFSVVADTGSDLI 110
Query: 86 WVHCYPCRDCSPQLGTIFYPSRSSSYAYVPCDSEHCRYFPYARCSAYKHRCIYTQLYLIG 145
W C PC C Q F P+ SS+++ +PC S C++ P + + C+Y Y G
Sbjct: 111 WTQCAPCTKCFQQPAPPFQPASSSTFSKLPCTSSFCQFLPNSIRTCNATGCVYNYKYGSG 170
Query: 146 PETSVFVSTEQLTFKNTDESTIHVQDVVFGCGFSTNRNFKFSGIFGLGIGRSSLVSQLNS 205
T+ +++TE T K D S V FGC ST G LG+GR
Sbjct: 171 -YTAGYLATE--TLKVGDAS---FPSVAFGC--STENGL---GQLDLGVGR--------- 210
Query: 206 SFSYCIGSLHDPDYLHNKLILGDGAIIDEGDATPLQFID------GHYYITLEAISVDGR 259
FSYC+ S + ++ G A + +G+ F++ +YY+ L I+V
Sbjct: 211 -FSYCLRSGSAAG--ASPILFGSLANLTDGNVQSTPFVNNPAVHPSYYYVNLTGITVGET 267
Query: 260 MLDINPNI--FKRDDSGGGVMIDSGTDVTWLVKEAYEALRDEVMIRLEGEQMRSYSWPDK 317
L + + F ++ GGG ++DSGT +T+L K+ YE ++ + + + +
Sbjct: 268 DLPVTTSTFGFTQNGLGGGTIVDSGTTLTYLAKDGYEMVKQAFLSQTADVTTVNGTRGLD 327
Query: 318 LCYHGIMSSDLK-GFPTVRFHFRGGAKLA-------LEKDSMFYQPRPDAFCMAVNPASI 369
LC+ P++ F GGA+ A +E DS Q C+ + PA
Sbjct: 328 LCFKSTGGGGGGIAVPSLVLRFDGGAEYAVPTYFAGVETDS---QGSVTVACLMMLPAKG 384
Query: 370 NNRYVNFSLIGMMAQQFYNVGYDIGRKQKTFQRMDCE 406
+ S+IG + Q ++ YD+ +F DC
Sbjct: 385 DQP---MSVIGNVMQMDMHLLYDLDGGIFSFAPADCA 418
>gi|212722026|ref|NP_001131674.1| uncharacterized protein LOC100193034 precursor [Zea mays]
gi|194692214|gb|ACF80191.1| unknown [Zea mays]
gi|413946454|gb|AFW79103.1| hypothetical protein ZEAMMB73_752316 [Zea mays]
Length = 441
Score = 116 bits (291), Expect = 2e-23, Method: Compositional matrix adjust.
Identities = 102/360 (28%), Positives = 155/360 (43%), Gaps = 26/360 (7%)
Query: 61 FYVNISIGQPPVPQFTAM-DTGSSLLWVHCYPCRDCSPQLGTIFYPSRSSSYAYVPCDSE 119
++V + +G P +FT + DTGS L WV C SP G +F P S S+A VPC S+
Sbjct: 91 YFVKVLVGTP-AQEFTLVADTGSELTWVKC--AGGASPP-GLVFRPEASKSWAPVPCSSD 146
Query: 120 HCRY---FPYARCSAYKHRCIYTQLYLIGPETSV-FVSTEQLTFKNTDESTIHVQDVVFG 175
C+ F A CS+ C Y Y G ++ V T+ T +QDVV G
Sbjct: 147 TCKLDVPFSLANCSSSASPCSYDYRYKEGSAGALGVVGTDSATIALPGGKVAQLQDVVLG 206
Query: 176 CGFSTN-RNFK-FSGIFGLGIGRSSLVSQ----LNSSFSYCIGSLHDPDYLHNKLILGDG 229
C + + ++FK G+ LG + S S+ SFSYC+ P L G G
Sbjct: 207 CSSTHDGQSFKSVDGVLSLGNAKISFASRAAARFGGSFSYCLVDHLAPRNATGYLAFGPG 266
Query: 230 AIIDEGDATPLQFIDGH---YYITLEAISVDGRMLDINPNIFKRDDSGGGVMIDSGTDVT 286
+ F+D Y + ++A+ V G+ LDI ++ D GGV++DSGT +T
Sbjct: 267 QVPRTPATQTKLFLDPAMPFYGVKVDAVHVAGQALDIPAEVW--DPKSGGVILDSGTTLT 324
Query: 287 WLVKEAYEALRDEVMIRLEGEQMRSYSWPDKLCYHGIMS-SDLKGFPTVRFHFRGGAKLA 345
L AY+A+ + L G + P + CY+ P + F G A+L
Sbjct: 325 VLATPAYKAVVAALTKLLAGVPKVDFP-PFEHCYNWTAPRPGAPEIPKLAVQFTGCARLE 383
Query: 346 LEKDSMFYQPRPDAFCMAVNPASINNRYVNFSLIGMMAQQFYNVGYDIGRKQKTFQRMDC 405
S +P C+ + + S+IG + QQ + +D+ + F C
Sbjct: 384 PPAKSYVIDVKPGVKCIGLQ----EGEWPGVSVIGNIMQQEHLWEFDLKNMEVRFMPSTC 439
>gi|356542694|ref|XP_003539801.1| PREDICTED: aspartic proteinase-like protein 2-like [Glycine max]
Length = 489
Score = 116 bits (291), Expect = 2e-23, Method: Compositional matrix adjust.
Identities = 99/319 (31%), Positives = 149/319 (46%), Gaps = 32/319 (10%)
Query: 60 LFYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLGTI------FYPSRSSSYAY 113
L+Y + +G PP + +DTGS +LWV C C C PQ + F P SS+ +
Sbjct: 76 LYYTKVKLGTPPREFYVQIDTGSDVLWVSCGSCNGC-PQTSGLQIQLNYFDPRSSSTSSL 134
Query: 114 VPCDSEHCR---YFPYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQ 170
+ C CR A CS+ ++C YT Y G TS + ++ + F E T+
Sbjct: 135 ISCSDRRCRSGVQTSDASCSSQNNQCTYTFQYGDGSGTSGYYVSDLMHFAGIFEGTLTTN 194
Query: 171 ---DVVFGC-----GFSTNRNFKFSGIFGLGIGRSSLVSQLN------SSFSYCIGSLHD 216
VVFGC G T GIFG G S++SQL+ FS+C L
Sbjct: 195 SSASVVFGCSILQTGDLTKSERAVDGIFGFGQQGMSVISQLSLQGIAPRVFSHC---LKG 251
Query: 217 PDYLHNKLILGDGAIIDEGDA-TPLQFIDGHYYITLEAISVDGRMLDINPNIFKRDDSGG 275
+ L+LG+ I++ +PL HY + L++ISV+G+++ I P +F ++
Sbjct: 252 DNSGGGVLVLGE--IVEPNIVYSPLVQSQPHYNLNLQSISVNGQIVPIAPAVFATSNN-R 308
Query: 276 GVMIDSGTDVTWLVKEAYEALRDEVMIRLEGEQMRSYSWPDKLCYHGIMSSDLKGFPTVR 335
G ++DSGT + +L +EAY + + L + +RS CY SS++ FP V
Sbjct: 309 GTIVDSGTTLAYLAEEAYNPFVNAIT-ALVPQSVRSVLSRGNQCYLITTSSNVDIFPQVS 367
Query: 336 FHFRGGAKLALEKDSMFYQ 354
+F GGA L L Q
Sbjct: 368 LNFAGGASLVLRPQDYLMQ 386
>gi|242092878|ref|XP_002436929.1| hypothetical protein SORBIDRAFT_10g011180 [Sorghum bicolor]
gi|241915152|gb|EER88296.1| hypothetical protein SORBIDRAFT_10g011180 [Sorghum bicolor]
Length = 505
Score = 116 bits (291), Expect = 2e-23, Method: Compositional matrix adjust.
Identities = 101/367 (27%), Positives = 152/367 (41%), Gaps = 44/367 (11%)
Query: 61 FYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRD-CSPQLGTIFYPSRSSSYAYVPCDSE 119
F V + G P ++DTGS + W+ C PC C Q +F P++S++Y+ VPC
Sbjct: 161 FVVTVGFGSPAQNYTLSIDTGSDVSWIQCLPCSGHCYKQHDPVFDPTKSATYSAVPCGHP 220
Query: 120 HCRYFPYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDVVFGCGFS 179
C +CS C+Y Y G T+ +S E L+ +T + + FGCG +
Sbjct: 221 QCAA-AGGKCSN-SGTCLYKVTYGDGSSTAGVLSHETLSLSSTRD----LPGFAFGCGQT 274
Query: 180 TNRNFKFSGIFGLGI-GRSSLVSQ----LNSSFSYCIGSLHDPDYLHNKLILGDGAIIDE 234
F G SL SQ ++FSYC+ S D H L +G
Sbjct: 275 NLGEFGGVDGLVGLGRGALSLPSQAAATFGATFSYCLPSY---DTTHGYLTMGSTTPAAS 331
Query: 235 GDATPLQFI--------DGHYYITLEAISVDGRMLDINPNIFKRDDSGGGVMIDSGTDVT 286
D +Q+ Y++ + +I + G +L + P +F RD G + DSGT +T
Sbjct: 332 NDDDDVQYTAMIQKEDYPSLYFVEVVSIDIGGYILPVPPTVFTRD----GTLFDSGTILT 387
Query: 287 WLVKEAYEALRDEVMIRLEGEQMRSYSWPDKLCYHGIMSSDLKG-----FPTVRFHFRGG 341
+L EAY +LRD + + P CY D G P V F F G
Sbjct: 388 YLPPEAYASLRDRFKFTMTQYKPAPAYDPFDTCY------DFTGHNAIFMPAVAFKFSDG 441
Query: 342 AKLALEKDSMFYQP---RPDAFCMAVNPASINNRYVNFSLIGMMAQQFYNVGYDIGRKQK 398
A L ++ P P C+A P + F++IG Q+ V YD+ ++
Sbjct: 442 AVFDLSPVAILIYPDDTAPATGCLAFVP---RPSTMPFNIIGNTQQRGTEVIYDVAAEKI 498
Query: 399 TFQRMDC 405
F + C
Sbjct: 499 GFGQFTC 505
>gi|308044575|ref|NP_001183392.1| uncharacterized protein LOC100501808 [Zea mays]
gi|238011188|gb|ACR36629.1| unknown [Zea mays]
Length = 342
Score = 116 bits (291), Expect = 2e-23, Method: Compositional matrix adjust.
Identities = 116/356 (32%), Positives = 160/356 (44%), Gaps = 44/356 (12%)
Query: 78 MDTGSSLLWVHCYPCRDCSPQLGTIFYPSRSSSYAYVPCDSEHCRYFPYARCSAYKHRCI 137
+DTGS ++WV C PCR C Q G +F P RSSSY V C + CR C + C+
Sbjct: 3 LDTGSDVVWVQCAPCRRCYEQSGPVFDPRRSSSYGAVGCGAALCRRLDSGGCDLRRGACM 62
Query: 138 YTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDVVFGCGFSTNRNF-KFSGIFGLGIGR 196
Y Y G T+ TE LTF V V GCG F +G+ GLG G
Sbjct: 63 YQVAYGDGSVTAGDFVTETLTFAG----GARVARVALGCGHDNEGLFVAAAGLLGLGRGG 118
Query: 197 SSLVSQLN----SSFSYCI------GSLHDP-DYLHNKLILGDGAI-IDEGDATPL---Q 241
S +Q++ SFSYC+ G+ P + + + G G++ TP+
Sbjct: 119 LSFPTQISRRYGRSFSYCLVDRTSSGAGAAPGSHRSSTVSFGAGSVGASSASFTPMVRNP 178
Query: 242 FIDGHYYITLEAISVDG-RMLDINPNIFKRDDSG--GGVMIDSGTDVTWLVKEAYEALRD 298
++ YY+ L ISV G R+ + + + D S GGV++DSGT VT L + +Y ALRD
Sbjct: 179 RMETFYYVQLVGISVGGARVPGVAESDLRLDPSTGRGGVIVDSGTSVTRLARASYSALRD 238
Query: 299 EVMIRLEGEQMRS---YSWPDKLCYHGIMSSDLKG-----FPTVRFHFRGGAKLALEKDS 350
G S +S D CY DL G PTV HF GGA+ AL ++
Sbjct: 239 AFRAAAAGGLRLSPGGFSLFDT-CY------DLGGRRVVKVPTVSMHFAGGAEAALPPEN 291
Query: 351 -MFYQPRPDAFCMAVNPASINNRYVNFSLIGMMAQQFYNVGYDIGRKQKTFQRMDC 405
+ FC A A + S+IG + QQ + V +D ++ F C
Sbjct: 292 YLIPVDSRGTFCFAF--AGTDG---GVSIIGNIQQQGFRVVFDGDGQRVGFAPKGC 342
>gi|357133002|ref|XP_003568117.1| PREDICTED: aspartic proteinase-like protein 2-like [Brachypodium
distachyon]
Length = 497
Score = 116 bits (291), Expect = 2e-23, Method: Compositional matrix adjust.
Identities = 102/371 (27%), Positives = 165/371 (44%), Gaps = 37/371 (9%)
Query: 61 FYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLG-----TIFYPSRSSSYAYVP 115
+Y I IG PP P +DTGS +LWV+C C C + G ++ P SSS + V
Sbjct: 87 YYTKIEIGTPPKPFHVQVDTGSDILWVNCVSCDKCPTKSGLGIDLALYDPKGSSSGSAVS 146
Query: 116 CDSEHCRYF-----PYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFK--NTDESTIH 168
CD++ C C+A K C Y Y G T+ ++ L + + + T H
Sbjct: 147 CDNKFCAATYGSGEKLPGCTAGK-PCEYRAEYGDGSSTAGSFVSDSLQYNQLSGNAQTRH 205
Query: 169 VQ-DVVFGCGFS-----TNRNFKFSGIFGLGIGRSSLVSQLNSS------FSYCIGSLHD 216
+ +V+FGCG + N GI G G +S +SQL S+ FS+C+
Sbjct: 206 AKANVIFGCGAQQGGDLESTNQALDGIIGFGQSNTSTLSQLASAGEVKKIFSHCL----- 260
Query: 217 PDYLHNKLILGDGAIID-EGDATPLQFIDGHYYITLEAISVDGRMLDINPNIFKRDDSGG 275
D + I G ++ + +TPL HY + L++I V G L + P+IF+ +
Sbjct: 261 -DTIKGGGIFAIGEVVQPKVKSTPLLPNMSHYNVNLQSIDVAGNALQLPPHIFETSEK-R 318
Query: 276 GVMIDSGTDVTWLVKEAYEALRDEVMIRLEGEQMRSYSWPDKLCYHGIMSSDLKGFPTVR 335
G +IDSGT +T+L + Y+ + V + + R+ LC+ S D GFP +
Sbjct: 319 GTIIDSGTTLTYLPELVYKDILAAVFQKHQDITFRTIQ--GFLCFEYSESVD-DGFPKIT 375
Query: 336 FHFRGGAKLALEKDSMFYQPRPDAFCMAVNPASINNRYV-NFSLIGMMAQQFYNVGYDIG 394
FHF L + F+Q + +C+ + + L+G + V YD+
Sbjct: 376 FHFEDDLGLNVYPHDYFFQNGDNLYCLGFQNGGFQPKDAKDMVLLGDLVLSNKVVVYDLE 435
Query: 395 RKQKTFQRMDC 405
++ + +C
Sbjct: 436 KQVIGWTDYNC 446
>gi|242065058|ref|XP_002453818.1| hypothetical protein SORBIDRAFT_04g018520 [Sorghum bicolor]
gi|241933649|gb|EES06794.1| hypothetical protein SORBIDRAFT_04g018520 [Sorghum bicolor]
Length = 490
Score = 116 bits (291), Expect = 2e-23, Method: Compositional matrix adjust.
Identities = 102/376 (27%), Positives = 160/376 (42%), Gaps = 39/376 (10%)
Query: 57 SNSLFYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLG-----TIFYPSRSSSY 111
+ L+Y I IG P + +DTGS +LWV+C C C G T + P+ S +
Sbjct: 81 ATGLYYTQIEIGSPSKGYYVQVDTGSDILWVNCIRCDGCPTTSGLGIELTQYDPAGSGT- 139
Query: 112 AYVPCDSEHC-----RYFPYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTD--- 163
V CD E C P A C + C + Y G T+ F ++ + +
Sbjct: 140 -TVGCDQEFCVANSPNGLPPA-CPSTSSPCQFRIAYGDGSSTTGFYVSDSVQYNQVSGNG 197
Query: 164 ESTIHVQDVVFGCGFSTNRNFKFS-----GIFGLGIGRSSLVSQLNSS------FSYCIG 212
++T + FGCG + S GI G G SS++SQL ++ F++C+
Sbjct: 198 QTTPSNASITFGCGAQLGGDLGSSSQALDGILGFGQADSSMLSQLAAARKVRKIFAHCL- 256
Query: 213 SLHDPDYLHNKLILGDGAIID-EGDATPLQFIDGHYYITLEAISVDGRMLDINPNIFKRD 271
D +H I G ++ + TPL HY + L+ ISV G L + + F
Sbjct: 257 -----DTVHGGGIFAIGNVVQPKVKTTPLVQNVTHYNVNLQGISVGGATLQLPSSTFDSG 311
Query: 272 DSGGGVMIDSGTDVTWLVKEAYEALRDEVMIRLEGEQMRSYSWPDKLCYHGIMSSDLKGF 331
DS G +IDSGT + +L +E Y L V + + + +Y D +C+ S D GF
Sbjct: 312 DS-KGTIIDSGTTLAYLPREVYRTLLTAVFDKYQDLALHNYQ--DFVCFQFSGSID-DGF 367
Query: 332 PTVRFHFRGGAKLALEKDSMFYQPRPDAFCMAVNPASINNR-YVNFSLIGMMAQQFYNVG 390
P V F F G L + +Q D +CM + + + L+G + V
Sbjct: 368 PVVTFSFEGEITLNVYPHDYLFQNENDLYCMGFLDGGVQTKDGKDMVLLGDLVLSNKLVV 427
Query: 391 YDIGRKQKTFQRMDCE 406
YD+ ++ + +C
Sbjct: 428 YDLEKQVIGWADYNCS 443
>gi|225438908|ref|XP_002279194.1| PREDICTED: aspartic proteinase-like protein 2-like [Vitis vinifera]
Length = 634
Score = 116 bits (290), Expect = 2e-23, Method: Compositional matrix adjust.
Identities = 106/395 (26%), Positives = 173/395 (43%), Gaps = 55/395 (13%)
Query: 38 EKIRSHNTYQAQILPSNDISNSLFYVN-ISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCS 96
++ SH+T A++ +D+ +Y I IG PP +DTGS+L +V C C C
Sbjct: 68 QRSESHSTATARMPLYDDLIPYGYYTTRIWIGTPPQTFALIVDTGSTLTYVPCSTCEQCG 127
Query: 97 PQLGTIFYPSRSSSYAYVPCDSEHCRYFPYARCSAYKHRCIYTQLYLIGPETSVFVSTEQ 156
F P SS+Y + C E C + C+Y + Y +S + +
Sbjct: 128 KHQDPNFQPDWSSTYQPLKCSME-------CTCDSEMMHCVYDRQYAEMSSSSGVLGEDI 180
Query: 157 LTFKNTDESTIHVQDVVFGCGFSTNRNF---KFSGIFGLGIGRSSLVSQL------NSSF 207
++F +S + Q VFGC + + GI GLG G S+V QL +SF
Sbjct: 181 VSFGK--QSELKPQRTVFGCENVETGDIYSQRADGIMGLGRGDLSIVDQLVEKGVIGNSF 238
Query: 208 SYCIGSLHDPDYLHNKLILGDGAIIDEGDATPLQFIDGH--------YYITLEAISVDGR 259
S C G + +G GA++ G + P + H Y I L+ I + G+
Sbjct: 239 SLCYGGMD----------VGGGAMVLGGISPPAGMVFTHSDPARSAYYNIDLKEIHIAGK 288
Query: 260 MLDINPNIFKRDDSGGGVMIDSGTDVTWLVKEAYEALRDEVMIRLEGEQMRSYSWPDK-- 317
L INP +F D G ++DSGT +L + A++A +D +M L ++ PD+
Sbjct: 289 QLPINPMVF---DGKYGTILDSGTTYAYLPEPAFKAFKDAIMKEL--NSLKLIQGPDRNY 343
Query: 318 --LCYHGI---MSSDLKGFPTVRFHFRGGAKLALEKDSMFYQPRP--DAFCMAVNPASIN 370
+C+ G+ +S K FP V F G +L+L ++ +Q A+C+ +
Sbjct: 344 NDICFSGVGSDVSQLSKTFPAVDLVFSNGNRLSLSPENYLFQHSKAHGAYCLGI----FQ 399
Query: 371 NRYVNFSLIGMMAQQFYNVGYDIGRKQKTFQRMDC 405
N +L+G + + V YD + F + +C
Sbjct: 400 NENDQTTLLGGIIVRNTLVMYDREHLKIGFWKTNC 434
>gi|356567196|ref|XP_003551807.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Glycine max]
Length = 490
Score = 116 bits (290), Expect = 2e-23, Method: Compositional matrix adjust.
Identities = 110/362 (30%), Positives = 151/362 (41%), Gaps = 49/362 (13%)
Query: 56 ISNSLFYVNISIGQPPVPQFTAMDTGSSLLWVHCYPC-RDCSPQLGTIFYPSRSSSYAYV 114
I + ++V + +G P DTGS L W C PC R C Q IF PS+S+SY+ +
Sbjct: 141 IGSGNYFVVVGLGTPKRDLSLIFDTGSDLTWTQCEPCARSCYKQQDVIFDPSKSTSYSNI 200
Query: 115 PCDSEHCRYFPYAR-----CSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHV 169
C S C A CSA CIY Y + + S E+LT TD V
Sbjct: 201 TCTSALCTQLSTATGNDPGCSASTKACIYGIQYGDSSFSVGYFSRERLTVTATDV----V 256
Query: 170 QDVVFGCGFSTNRNFKFS-GIFGLGIGRSSLVSQLNSS----FSYCIGSLHDPDYLHNKL 224
+ +FGCG + F S G+ GLG S V Q + FSYC+ S L
Sbjct: 257 DNFLFGCGQNNQGLFGGSAGLIGLGRHPISFVQQTAAKYRKIFSYCLPSTSSST---GHL 313
Query: 225 ILGDGAIIDEGDATPLQFI---DGHYYITLEAISVDGRMLDINPNIFKRDDSGGGVMIDS 281
G A TP I Y + + AI+V G L ++ + F S GG +IDS
Sbjct: 314 SFGPAATGRYLKYTPFSTISRGSSFYGLDITAIAVGGVKLPVSSSTF----STGGAIIDS 369
Query: 282 GTDVTWLVKEAYEALRDEVMIRLEGEQMRSYSWPDKL-----CYHGIMSSDLKGF----- 331
GT +T L AY ALR + M Y +L CY DL G+
Sbjct: 370 GTVITRLPPTAYGALRSAFR-----QGMSKYPSAGELSILDTCY------DLSGYKVFSI 418
Query: 332 PTVRFHFRGGAKLALEKDSMFYQPRPDAFCMAVNPASINNRYVNFSLIGMMAQQFYNVGY 391
PT+ F F GG + L + + C+A + N + ++ G + Q+ V Y
Sbjct: 419 PTIEFSFAGGVTVKLPPQGILFVASTKQVCLAF---AANGDDSDVTIYGNVQQRTIEVVY 475
Query: 392 DI 393
D+
Sbjct: 476 DV 477
>gi|296087361|emb|CBI33735.3| unnamed protein product [Vitis vinifera]
Length = 633
Score = 116 bits (290), Expect = 2e-23, Method: Compositional matrix adjust.
Identities = 106/395 (26%), Positives = 173/395 (43%), Gaps = 55/395 (13%)
Query: 38 EKIRSHNTYQAQILPSNDISNSLFYVN-ISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCS 96
++ SH+T A++ +D+ +Y I IG PP +DTGS+L +V C C C
Sbjct: 68 QRSESHSTATARMPLYDDLIPYGYYTTRIWIGTPPQTFALIVDTGSTLTYVPCSTCEQCG 127
Query: 97 PQLGTIFYPSRSSSYAYVPCDSEHCRYFPYARCSAYKHRCIYTQLYLIGPETSVFVSTEQ 156
F P SS+Y + C E C + C+Y + Y +S + +
Sbjct: 128 KHQDPNFQPDWSSTYQPLKCSME-------CTCDSEMMHCVYDRQYAEMSSSSGVLGEDI 180
Query: 157 LTFKNTDESTIHVQDVVFGCGFSTNRNF---KFSGIFGLGIGRSSLVSQL------NSSF 207
++F +S + Q VFGC + + GI GLG G S+V QL +SF
Sbjct: 181 VSFGK--QSELKPQRTVFGCENVETGDIYSQRADGIMGLGRGDLSIVDQLVEKGVIGNSF 238
Query: 208 SYCIGSLHDPDYLHNKLILGDGAIIDEGDATPLQFIDGH--------YYITLEAISVDGR 259
S C G + +G GA++ G + P + H Y I L+ I + G+
Sbjct: 239 SLCYGGMD----------VGGGAMVLGGISPPAGMVFTHSDPARSAYYNIDLKEIHIAGK 288
Query: 260 MLDINPNIFKRDDSGGGVMIDSGTDVTWLVKEAYEALRDEVMIRLEGEQMRSYSWPDK-- 317
L INP +F D G ++DSGT +L + A++A +D +M L ++ PD+
Sbjct: 289 QLPINPMVF---DGKYGTILDSGTTYAYLPEPAFKAFKDAIMKEL--NSLKLIQGPDRNY 343
Query: 318 --LCYHGI---MSSDLKGFPTVRFHFRGGAKLALEKDSMFYQPRP--DAFCMAVNPASIN 370
+C+ G+ +S K FP V F G +L+L ++ +Q A+C+ +
Sbjct: 344 NDICFSGVGSDVSQLSKTFPAVDLVFSNGNRLSLSPENYLFQHSKAHGAYCLGI----FQ 399
Query: 371 NRYVNFSLIGMMAQQFYNVGYDIGRKQKTFQRMDC 405
N +L+G + + V YD + F + +C
Sbjct: 400 NENDQTTLLGGIIVRNTLVMYDREHLKIGFWKTNC 434
>gi|413953772|gb|AFW86421.1| hypothetical protein ZEAMMB73_098827 [Zea mays]
Length = 482
Score = 116 bits (290), Expect = 2e-23, Method: Compositional matrix adjust.
Identities = 115/374 (30%), Positives = 158/374 (42%), Gaps = 45/374 (12%)
Query: 61 FYVNISIGQPPVPQFTAM-DTGSSLLWVHCYPCRD-CSPQLGTIFYPSRSSSYAYVPCDS 118
+ V I IG P FT + DTGS L WV C PC D C Q +F PS+SS+Y VPC +
Sbjct: 126 YVVTIGIGTP-ARNFTVLFDTGSDLTWVQCKPCTDSCYQQQEPLFDPSKSSTYVDVPCGT 184
Query: 119 EHCRYFPYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDVVFGCGF 178
C+ + C Y+ Y T ++ E T S VVFGC
Sbjct: 185 PQCKIGGGQDLTCGGTTCEYSVKYGDQSVTRGNLAQEAFTLS---PSAPPAAGVVFGCSH 241
Query: 179 STNRNFK-------FSGIFGLGIGRSSLVSQL---NSS--FSYCIGSLHDPDYLHNKLIL 226
+ K +G+ GLG G SS++SQ NS FSYC L L +
Sbjct: 242 EYSSGVKGAEEEMSVAGLLGLGRGDSSILSQTRRGNSGDVFSYC---LPPRGSSAGYLTI 298
Query: 227 GDGAIIDEGDA-TPL----QFIDGHYYITLEAISVDGRMLDINPNIFKRDDSGGGVMIDS 281
G A + TPL + Y + L ISV G L I+ + F G +IDS
Sbjct: 299 GAAAPPQSNLSFTPLVTDNSQLSSVYVVNLVGISVSGAALPIDASAFYI-----GTVIDS 353
Query: 282 GTDVTWLVKEAYEALRDEVMIRLEGEQMRSYSWPDKL--CYHGIMSSDLKGFPTVRFHFR 339
GT +T + AY LRDE + G M + L CY + D+ P V F
Sbjct: 354 GTVITHMPAAAYYVLRDEFRRHMGGYTMLPEGHVESLDTCYD-VTGHDVVTAPPVALEFG 412
Query: 340 GGAKLALEKDSMFYQPRPDA-------FCMAVNPASINNRYVNFSLIGMMAQQFYNVGYD 392
GGA++ ++ + DA C+A P ++ F +IG M Q+ YNV +D
Sbjct: 413 GGARIDVDASGILLVFAVDASGQSLTLACLAFVPTNLP----GFVIIGNMQQRAYNVVFD 468
Query: 393 IGRKQKTFQRMDCE 406
+ ++ F C
Sbjct: 469 VEGRRIGFGANGCS 482
>gi|242066142|ref|XP_002454360.1| hypothetical protein SORBIDRAFT_04g029400 [Sorghum bicolor]
gi|241934191|gb|EES07336.1| hypothetical protein SORBIDRAFT_04g029400 [Sorghum bicolor]
Length = 466
Score = 115 bits (289), Expect = 3e-23, Method: Compositional matrix adjust.
Identities = 110/398 (27%), Positives = 162/398 (40%), Gaps = 45/398 (11%)
Query: 32 RLAYLQEKI----------------RSHNTYQAQILPSNDISNSLFYVNISIGQPPVPQF 75
R AY+Q K +SH T + S D + + + +G P Q
Sbjct: 90 RAAYIQRKFSGGGVNGSRGGAGDVQQSHATVPTTLGTSLDTLE--YLITVRLGSPGKSQT 147
Query: 76 TAMDTGSSLLWVHCYPCRDCSPQLGTIFYPSRSSSYAYVPCDSEHCRYFPYARCSAYKHR 135
+DTGS + WV C PC C Q +F PS SS+Y+ C S C +
Sbjct: 148 MLIDTGSDVSWVQCKPCSQCHSQADPLFDPSSSSTYSPFSCSSAACAQLGQEGNGCSSSQ 207
Query: 136 CIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDVVFGCG-FSTNRNFKFSGIFGLGI 194
C YT Y G T+ S++ L + V+ FGC + N + G+ GLG
Sbjct: 208 CQYTVTYGDGSSTTGTYSSDTLALGSN-----AVRKFQFGCSNVESGFNDQTDGLMGLGG 262
Query: 195 GRSSLVSQ----LNSSFSYCIGSLHDPDYLHNKLILGDGAIIDEGDATPL---QFIDGHY 247
G SLVSQ ++FSYC+ P + L GA TP+ + Y
Sbjct: 263 GAQSLVSQTAGTFGAAFSYCL-----PATSSSSGFLTLGAGTSGFVKTPMLRSSQVPTFY 317
Query: 248 YITLEAISVDGRMLDINPNIFKRDDSGGGVMIDSGTDVTWLVKEAYEALRDEVMIRLEGE 307
+ ++AI V GR L I ++F G ++DSGT +T L AY AL ++
Sbjct: 318 GVRIQAIRVGGRQLSIPTSVFS-----AGTIMDSGTVLTRLPPTAYSALSSAFKAGMKQY 372
Query: 308 QMRSYSWPDKLCYHGIMSSDLKGFPTVRFHFRGGAKLALEKDSMFYQPRPDAFCMAVNPA 367
S C+ S + PTV F GGA + + D + Q C+A
Sbjct: 373 PSAPPSGILDTCFDFSGQSSVS-IPTVALVFSGGAVVDIASDGIMLQTSNSILCLAF--- 428
Query: 368 SINNRYVNFSLIGMMAQQFYNVGYDIGRKQKTFQRMDC 405
+ N+ + +IG + Q+ + V YD+G F+ C
Sbjct: 429 AANSDDSSLGIIGNVQQRTFEVLYDVGGGAVGFKAGAC 466
>gi|449440931|ref|XP_004138237.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 1-like [Cucumis
sativus]
Length = 523
Score = 115 bits (289), Expect = 3e-23, Method: Compositional matrix adjust.
Identities = 114/374 (30%), Positives = 171/374 (45%), Gaps = 25/374 (6%)
Query: 42 SHNTYQAQILPSNDISNSLFYVNISIGQPPVPQFTAMDTGSSLLWVHCYPC---RDCSPQ 98
S N+ A + ++ I +GQP F DTGS + W+ C PC C Q
Sbjct: 165 STNSLTAPVTSGASQGAGEYFARIGVGQPVQSYFFVPDTGSDVSWLQCQPCDGENGCYKQ 224
Query: 99 LGTIFYPSRSSSYAYVPCDSEHCRYFPYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLT 158
+G IF P SSSY+ + CDSE C A C A + CIY Y G T ++TE +
Sbjct: 225 IGPIFDPKSSSSYSPLSCDSEQCHLLDEAACDA--NSCIYEVEYGDGSFTVGELATETFS 282
Query: 159 FKNTDESTIHVQDVVFGCGFSTNRNF-KFSGIFGLGIGRSSLVSQLN-SSFSYCIGSLHD 216
F++++ + ++ GCG F G+ GLG G SL SQL +SFSYC L D
Sbjct: 283 FRHSNS----IPNLPIGCGHDNEGLFVGADGLIGLGGGAISLSSQLEATSFSYC---LVD 335
Query: 217 PDYLHNKLILGDGAIIDEGDATPLQFID---GHYYITLEAISVDGRMLDINPNIFKRDDS 273
D + + + + +PL D Y+ + +SV G+ L I+ + F+ D+S
Sbjct: 336 LDSESSSTLDFNADQPSDSLTSPLVKNDRFPTFRYVKVIGMSVGGKPLPISSSSFEIDES 395
Query: 274 G-GGVMIDSGTDVTWLVKEAYEALRDEVMIRLEGEQMRSYSWPDKLCYHGIMSSDLKGFP 332
G GG+++DSGT +T + + Y+ LRD + + P CY S+++ P
Sbjct: 396 GSGGIIVDSGTTITEIPSDVYDVLRDAFVGLTKNLPPAPGVSPFDTCYDLSSQSNVE-VP 454
Query: 333 TVRFHFRGGAKLAL-EKDSMFYQPRPDAFCMAVNPASINNRYVNFSLIGMMAQQFYNVGY 391
T+ F G L L K+ + FC+A P++ S+IG + QQ V Y
Sbjct: 455 TIAFILPGENSLQLPAKNCLIQVDSAGTFCLAFLPSTF-----PLSIIGNVQQQGIRVSY 509
Query: 392 DIGRKQKTFQRMDC 405
D+ F C
Sbjct: 510 DLANSLVGFSTDKC 523
>gi|326532056|dbj|BAK01404.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 506
Score = 115 bits (289), Expect = 3e-23, Method: Compositional matrix adjust.
Identities = 106/370 (28%), Positives = 162/370 (43%), Gaps = 30/370 (8%)
Query: 56 ISNSLFYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLGTIFYPSRSSSYAYVP 115
+ + + V++ +G PP MDTGS L W+ C PC DC Q G IF P+ S SY V
Sbjct: 144 VGSGEYLVDVYLGTPPRRFRMIMDTGSDLNWLQCAPCLDCFEQSGPIFDPAASISYRNVT 203
Query: 116 CDSEHCRYF-PYARCSAYKHR------CIYTQLYLIGPETSVFVSTEQLTFKNTDESTIH 168
C + CR P A + + R C Y Y T+ ++ E T T T
Sbjct: 204 CGDDRCRLVSPPAESAPRECRRPRSDPCPYYYWYGDQSNTTGDLALEAFTVNLTQSGTRR 263
Query: 169 VQDVVFGCGFSTNRNF-KFSGIFGLGIGRSSLVSQLNS-----SFSYCIGSLHDPDYLHN 222
V V FGCG F +G+ GLG G S SQL +FSYC+ + +
Sbjct: 264 VDGVAFGCGHRNRGLFHGAAGLLGLGRGPLSFASQLRGVYGGHAFSYCL--VEHGSAAGS 321
Query: 223 KLILG-DGAIIDEGDA-----TPLQFIDGHYYITLEAISVDGRMLDINPNIFKRDDSGGG 276
K+I G D A++ P D YY+ L++I V G ++I+ + S GG
Sbjct: 322 KIIFGHDDALLAHPQLNYTAFAPTTDADTFYYLQLKSILVGGEAVNISSDTL----SAGG 377
Query: 277 VMIDSGTDVTWLVKEAYEALRDEVMIRLEGEQMRSYSWPDKLCYHGIMSSDLKGFPTVRF 336
+IDSGT +++ + AY+A+R + R+ +P + + ++ P +
Sbjct: 378 TIIDSGTTLSYFPEPAYQAIRQAFIDRMSPSYPLILGFPVLSPCYNVSGAEKVEVPELSL 437
Query: 337 HFRGGAKLALEKDSMFYQPRPDA-FCMAVNPASINNRYVNFSLIGMMAQQFYNVGYDIGR 395
F GA ++ F + P+ C+AV + S+IG QQ ++V YD+
Sbjct: 438 VFADGAAWEFPAENYFIRLEPEGIMCLAV----LGTPRSGMSIIGNYQQQNFHVLYDLEH 493
Query: 396 KQKTFQRMDC 405
+ F C
Sbjct: 494 NRLGFAPRRC 503
>gi|297820186|ref|XP_002877976.1| hypothetical protein ARALYDRAFT_906847 [Arabidopsis lyrata subsp.
lyrata]
gi|297323814|gb|EFH54235.1| hypothetical protein ARALYDRAFT_906847 [Arabidopsis lyrata subsp.
lyrata]
Length = 425
Score = 115 bits (289), Expect = 3e-23, Method: Compositional matrix adjust.
Identities = 101/355 (28%), Positives = 158/355 (44%), Gaps = 28/355 (7%)
Query: 61 FYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLGTIFYPSRSSSYAYVPCDSEH 120
+ V +IG P A+DT + W+ C C CS + +F PS+SSS + C++
Sbjct: 88 YIVRANIGTPAQAMLVALDTSNDAAWIPCSGCVGCSSSV--LFDPSKSSSSRTLQCEAPQ 145
Query: 121 CRYFPYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDVVFGC-GFS 179
C+ P C+ K C + Y G +++ + LT +T + + FGC +
Sbjct: 146 CKQAPNPSCTVSKS-CGFNMTYG-GSAIEAYLTQDTLTL-----ATDVIPNYTFGCINKA 198
Query: 180 TNRNFKFSGIFGLGIGRSSLVSQ----LNSSFSYCIGSLHDPDYLHNKLILGDGAIIDEG 235
+ + G+ GLG G SL+SQ S+FSYC+ + ++ L LG
Sbjct: 199 SGTSLPAQGLMGLGRGPLSLISQSQNLYQSTFSYCLPNSKSSNF-SGSLRLGPKNQPIRI 257
Query: 236 DATPL---QFIDGHYYITLEAISVDGRMLDINPNIFKRD-DSGGGVMIDSGTDVTWLVKE 291
TPL YY+ L I V +++DI + D +G G + DSGT T LV+
Sbjct: 258 KTTPLLKNPRRSSLYYVNLVGIRVGNKIVDIPTSALAFDPATGAGTIFDSGTVYTRLVEP 317
Query: 292 AYEALRDEVMIRLEGEQMRSYSWPDKLCYHGIMSSDLKGFPTVRFHFRGGAKLALEKDSM 351
AY A+R+E R++ S D CY G + FP+V F F G + L D++
Sbjct: 318 AYVAMRNEFRRRVKNANATSLGGFDT-CYSGSVV-----FPSVTFMF-AGMNVTLPPDNL 370
Query: 352 F-YQPRPDAFCMAVNPASINNRYVNFSLIGMMAQQFYNVGYDIGRKQKTFQRMDC 405
+ + C+A+ A N V ++I M QQ + V D+ + R C
Sbjct: 371 LIHSSAGNLSCLAMAAAPTNVNSV-LNVIASMQQQNHRVLIDVPNSRLGISRETC 424
>gi|413952263|gb|AFW84912.1| aspartic proteinase nepenthesin-2 [Zea mays]
Length = 509
Score = 115 bits (289), Expect = 3e-23, Method: Compositional matrix adjust.
Identities = 94/322 (29%), Positives = 150/322 (46%), Gaps = 36/322 (11%)
Query: 60 LFYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLG-----TIFYPSRSSSYAYV 114
L++ + +G P F +DTGS +LWV C PC C G F P SS+ + +
Sbjct: 90 LYFTRVKLGNPAKEFFVQIDTGSDILWVTCSPCTGCPTSSGLNIQLESFNPDSSSTASRI 149
Query: 115 PCDSEHCRY---FPYARCSAYKHR---CIYTQLYLIGPETSVFVSTEQLTFKNT---DES 165
C + C A C + C YT Y G TS + ++ + F+ +++
Sbjct: 150 TCSDDRCTAGFQTGEAICQTSNSQSSPCGYTFTYGDGSGTSGYYVSDTMFFETVMGNEQT 209
Query: 166 TIHVQDVVFGCGFS-----TNRNFKFSGIFGLGIGRSSLVSQLNS------SFSYCIGSL 214
+VFGC S T + GIFG G + S++SQLNS FS+C L
Sbjct: 210 ANSSASIVFGCSNSQSGDLTKADRAVDGIFGFGQHQLSVISQLNSLGVSPKVFSHC---L 266
Query: 215 HDPDYLHNKLILGDGAIIDEGDA-TPLQFIDGHYYITLEAISVDGRMLDINPNIFKRDDS 273
D L+LG+ I++ G TPL HY + LE+I+V+G+ L I+ ++F ++
Sbjct: 267 KGSDNGGGILVLGE--IVEPGLVYTPLVPSQPHYNLNLESIAVNGQKLPIDSSLFTTSNT 324
Query: 274 GGGVMIDSGTDVTWLVKEAYEALRDEVMIRLEGEQMRSYSWPDKLCYHGIMSSDL-KGFP 332
G ++DSGT + +L AY+ + + +RS C+ I SS + FP
Sbjct: 325 -QGTIVDSGTTLAYLADGAYDPFVSAIAAAVS-PSVRSLVSKGSQCF--ITSSSVDSSFP 380
Query: 333 TVRFHFRGGAKLALEKDSMFYQ 354
TV +F GG ++++ ++ Q
Sbjct: 381 TVTLYFMGGVAMSVKPENYLLQ 402
>gi|242084332|ref|XP_002442591.1| hypothetical protein SORBIDRAFT_08g022580 [Sorghum bicolor]
gi|241943284|gb|EES16429.1| hypothetical protein SORBIDRAFT_08g022580 [Sorghum bicolor]
Length = 493
Score = 115 bits (289), Expect = 3e-23, Method: Compositional matrix adjust.
Identities = 118/394 (29%), Positives = 166/394 (42%), Gaps = 41/394 (10%)
Query: 40 IRSHNTYQAQILPSNDISNSLFYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQL 99
+ S + A ++ ++ + I++G P V AMDTGS + W+ C PCR C PQ
Sbjct: 113 LSSGGAFVAPVVSRAPTTSGEYMAKIAVGTPAVEALLAMDTGSDITWLQCQPCRRCYPQS 172
Query: 100 GTIFYPSRSSSYAYVPCDSEHCRYFPYARCS-AYKHRCIYTQLYLIGPETSV--FVSTEQ 156
G +F P S+SY + D+ C+ + A + C+Y Y T+V F+ E
Sbjct: 173 GPVFDPRHSTSYREMGYDAPDCQALGRSGGGDAKRMTCVYAVGYGDDGSTTVGDFIE-ET 231
Query: 157 LTFKNTDESTIHVQDVVFGCGFSTNRNFK--FSGIFGLGIGRSSLVSQLN------SSFS 208
LTF + V + GCG F +GI GLG G+ S SQ+ +SFS
Sbjct: 232 LTFAG----GVQVPHMSIGCGHDNKGLFAAPAAGILGLGRGQISCPSQIAALGYNVTSFS 287
Query: 209 YCIGS--LHDPDY-LHNKLILGDGAIIDEGDATP--------LQFIDGHYYITLEAISVD 257
YC+ L P + + L +GDGA G P L +Y +
Sbjct: 288 YCLADFFLSSPGRSVSSTLTIGDGAA--AGSPPPSFTPTVQNLNMATFYYVRLVGVSVGG 345
Query: 258 GRMLDINPNIFKRD--DSGGGVMIDSGTDVTWLVKEAYEALRDEVMIRLEGEQMRSYSWP 315
R+ + + K D GGV++DSGT VT L + AY A RD S P
Sbjct: 346 VRVPGVTEDDLKLDPYTGRGGVILDSGTAVTRLARRAYIAFRDAFRAAAVDLGQVSIGGP 405
Query: 316 DKL---CYHGIMSSDLKGFPTVRFHFRGGAKLAL-EKDSMFYQPRPDAFCMAVNPASINN 371
CY M PTV HF GG +L L K+ + C A A +
Sbjct: 406 SGFFDTCY--TMGGRAMKVPTVSMHFAGGVELTLPPKNYLIPVDSMGTVCFAF--AGTGD 461
Query: 372 RYVNFSLIGMMAQQFYNVGYDIGRKQKTFQRMDC 405
R V S+IG + QQ + V Y+IG + F C
Sbjct: 462 RSV--SIIGNIQQQGFRVVYNIGGGRVGFAPNSC 493
>gi|226508052|ref|NP_001150337.1| LOC100283967 precursor [Zea mays]
gi|195638522|gb|ACG38729.1| aspartic proteinase nepenthesin-2 precursor [Zea mays]
Length = 507
Score = 115 bits (289), Expect = 3e-23, Method: Compositional matrix adjust.
Identities = 94/322 (29%), Positives = 150/322 (46%), Gaps = 36/322 (11%)
Query: 60 LFYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLG-----TIFYPSRSSSYAYV 114
L++ + +G P F +DTGS +LWV C PC C G F P SS+ + +
Sbjct: 88 LYFTRVKLGNPAKEFFVQIDTGSDILWVTCSPCTGCPTSSGLNIQLESFNPDSSSTASRI 147
Query: 115 PCDSEHCRY---FPYARCSAYKHR---CIYTQLYLIGPETSVFVSTEQLTFKNT---DES 165
C + C A C + C YT Y G TS + ++ + F+ +++
Sbjct: 148 TCSDDRCTAGFQTGEAICQTSNSQSSPCGYTFTYGDGSGTSGYYVSDTMFFETVMGNEQT 207
Query: 166 TIHVQDVVFGCGFS-----TNRNFKFSGIFGLGIGRSSLVSQLNS------SFSYCIGSL 214
+VFGC S T + GIFG G + S++SQLNS FS+C L
Sbjct: 208 ANSSASIVFGCSNSQSGDLTKADRAVDGIFGFGQHQLSVISQLNSLGVSPKVFSHC---L 264
Query: 215 HDPDYLHNKLILGDGAIIDEGDA-TPLQFIDGHYYITLEAISVDGRMLDINPNIFKRDDS 273
D L+LG+ I++ G TPL HY + LE+I+V+G+ L I+ ++F ++
Sbjct: 265 KGSDNGGGILVLGE--IVEPGLVYTPLVPSQPHYNLNLESIAVNGQKLPIDSSLFTTSNT 322
Query: 274 GGGVMIDSGTDVTWLVKEAYEALRDEVMIRLEGEQMRSYSWPDKLCYHGIMSSDL-KGFP 332
G ++DSGT + +L AY+ + + +RS C+ I SS + FP
Sbjct: 323 -QGTIVDSGTTLAYLADGAYDPFVSAIAAAVS-PSVRSLVSKGSQCF--ITSSSVDSSFP 378
Query: 333 TVRFHFRGGAKLALEKDSMFYQ 354
TV +F GG ++++ ++ Q
Sbjct: 379 TVTLYFMGGVAMSVKPENYLLQ 400
>gi|194707632|gb|ACF87900.1| unknown [Zea mays]
Length = 423
Score = 115 bits (289), Expect = 3e-23, Method: Compositional matrix adjust.
Identities = 94/322 (29%), Positives = 150/322 (46%), Gaps = 36/322 (11%)
Query: 60 LFYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLG-----TIFYPSRSSSYAYV 114
L++ + +G P F +DTGS +LWV C PC C G F P SS+ + +
Sbjct: 4 LYFTRVKLGNPAKEFFVQIDTGSDILWVTCSPCTGCPTSSGLNIQLESFNPDSSSTASRI 63
Query: 115 PCDSEHCRY---FPYARCSAYKHR---CIYTQLYLIGPETSVFVSTEQLTFKNT---DES 165
C + C A C + C YT Y G TS + ++ + F+ +++
Sbjct: 64 TCSDDRCTAGFQTGEAICQTSNSQSSPCGYTFTYGDGSGTSGYYVSDTMFFETVMGNEQT 123
Query: 166 TIHVQDVVFGCGFS-----TNRNFKFSGIFGLGIGRSSLVSQLNS------SFSYCIGSL 214
+VFGC S T + GIFG G + S++SQLNS FS+C L
Sbjct: 124 ANSSASIVFGCSNSQSGDLTKADRAVDGIFGFGQHQLSVISQLNSLGVSPKVFSHC---L 180
Query: 215 HDPDYLHNKLILGDGAIIDEGDA-TPLQFIDGHYYITLEAISVDGRMLDINPNIFKRDDS 273
D L+LG+ I++ G TPL HY + LE+I+V+G+ L I+ ++F ++
Sbjct: 181 KGSDNGGGILVLGE--IVEPGLVYTPLVPSQPHYNLNLESIAVNGQKLPIDSSLFTTSNT 238
Query: 274 GGGVMIDSGTDVTWLVKEAYEALRDEVMIRLEGEQMRSYSWPDKLCYHGIMSSDL-KGFP 332
G ++DSGT + +L AY+ + + +RS C+ I SS + FP
Sbjct: 239 -QGTIVDSGTTLAYLADGAYDPFVSAIAAAVS-PSVRSLVSKGSQCF--ITSSSVDSSFP 294
Query: 333 TVRFHFRGGAKLALEKDSMFYQ 354
TV +F GG ++++ ++ Q
Sbjct: 295 TVTLYFMGGVAMSVKPENYLLQ 316
>gi|449458736|ref|XP_004147103.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 2-like [Cucumis
sativus]
gi|449518669|ref|XP_004166359.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 2-like [Cucumis
sativus]
Length = 482
Score = 115 bits (288), Expect = 4e-23, Method: Compositional matrix adjust.
Identities = 113/416 (27%), Positives = 171/416 (41%), Gaps = 46/416 (11%)
Query: 1 LIHRDSI------LSPYNNPNENPAHRVQRGIN-ISIARLAYLQEKIRSHNTYQAQILPS 53
L+HRD + +N+ + A RV + +S A +++ + ++
Sbjct: 76 LLHRDKLSHVHGHRRGFNDRMKRDAIRVATLVRRLSHGAPAAVKDSRYKVANFATDVISG 135
Query: 54 NDISNSLFYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLGTIFYPSRSSSYAY 113
+ + ++V I +G PP Q+ +D+GS ++WV C PC C Q +F P+ SSS+A
Sbjct: 136 MEAGSGEYFVRIGVGSPPRNQYMVIDSGSDIVWVQCKPCSRCYQQSDPVFDPADSSSFAG 195
Query: 114 VPCDSEHCRYFPYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDVV 173
V C S+ C C+A RC Y Y G T ++ E LT + ++DV
Sbjct: 196 VSCGSDVCDRLENTGCNA--GRCRYEVSYGDGSYTKGTLALETLTVGQ-----VMIRDVA 248
Query: 174 FGCGFSTNRNFKFSGIFGLGIGRS-----SLVSQLNSSFSYCIGSLHDPDYLHNKLILGD 228
GCG + F + G S L Q +FSYC+ S L G
Sbjct: 249 IGCGHTNQGMFIGAAGLLGLGGGSMSFIGQLGGQTGGAFSYCLVSRGTGS--TGALEFGR 306
Query: 229 GAIIDEGDATPLQFID-----GHYYITLEAISVDGRMLDINPNIFKRDDSG-GGVMIDSG 282
GA+ AT + I YYI L I V G + + F+ + G GV++D+G
Sbjct: 307 GAL--PVGATWISLIRNPRAPSFYYIGLAGIGVGGVRVSVPEETFQLTEYGTNGVVMDTG 364
Query: 283 TDVTWLVKEAYEALRDEVMIRLEGEQMRSYSWPDKLCYHGIMSSDLKGF-----PTVRFH 337
T VT AY A RD + CY DL GF PTV F+
Sbjct: 365 TAVTRFPTAAYVAFRDSFTAQTSNLPRAPGVSIFDTCY------DLNGFESVRVPTVSFY 418
Query: 338 FRGGAKLAL-EKDSMFYQPRPDAFCMAVNPASINNRYVNFSLIGMMAQQFYNVGYD 392
F G L L ++ + FC+A P+ S+IG + Q+ + +D
Sbjct: 419 FSDGPVLTLPARNFLIPVDGGGTFCLAFAPSP-----SGLSIIGNIQQEGIQISFD 469
>gi|359482097|ref|XP_002271077.2| PREDICTED: aspartic proteinase nepenthesin-1-like [Vitis vinifera]
Length = 458
Score = 115 bits (288), Expect = 4e-23, Method: Compositional matrix adjust.
Identities = 109/386 (28%), Positives = 175/386 (45%), Gaps = 54/386 (13%)
Query: 58 NSLFYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLGTIFYPSRSSSYAYVPCD 117
N V++++G PP +DTGS L W+ C + + T F P+RSSSY+ VPC
Sbjct: 82 NVSLTVSLTVGTPPQNVSMVLDTGSELSWLRC----NKTQTFQTTFDPNRSSSYSPVPCS 137
Query: 118 SEHC----RYFPYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDVV 173
S C R FP C Y + ++++ N+D + +
Sbjct: 138 SLTCTDRTRDFPIPASCDSNQLCHAILSYADASSSEGNLASDTFYIGNSD-----MPGTI 192
Query: 174 FGC---GFSTN--RNFKFSGIFGLGIGRSSLVSQLN-SSFSYCIGSLHDPDYLHNKLILG 227
FGC FSTN + K +G+ G+ G S VSQ++ FSYCI D D+ L+LG
Sbjct: 193 FGCMDSSFSTNTEEDSKNTGLMGMNRGSLSFVSQMDFPKFSYCIS---DSDF-SGVLLLG 248
Query: 228 DG-----------AIIDEGDATPLQFIDG-HYYITLEAISVDGRMLDINPNIFKRDDSGG 275
D +I +TPL + D Y + LE I V ++L + ++F D +G
Sbjct: 249 DANFSWLMPLNYTPLIQI--STPLPYFDRVAYTVQLEGIKVSSKLLPLPKSVFVPDHTGA 306
Query: 276 G-VMIDSGTDVTWLVKEAYEALRDEVM------IRLEGEQMRSYSWPDKLCYHGIMS-SD 327
G M+DSGT T+L+ Y ALR+E + +R+ + + LCY +S +
Sbjct: 307 GQTMVDSGTQFTFLLGPVYSALRNEFLNQTSQILRVLEDPNYVFQGGMDLCYRVPLSQTS 366
Query: 328 LKGFPTVRFHFRGGAKLALEKDSMFYQPRPD------AFCMAVNPASINNRYVNFSLIGM 381
L PTV FR GA++ + D + Y+ + +C + + V +IG
Sbjct: 367 LPWLPTVSLMFR-GAEMKVSGDRLLYRVPGEVRGSDSVYCFTFGNSDL--LAVEAYVIGH 423
Query: 382 MAQQFYNVGYDIGRKQKTFQRMDCEV 407
QQ + +D+ + + F ++ C++
Sbjct: 424 HHQQNVWMEFDLEKSRIGFAQVQCDL 449
>gi|356539818|ref|XP_003538390.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
Length = 457
Score = 115 bits (288), Expect = 4e-23, Method: Compositional matrix adjust.
Identities = 95/371 (25%), Positives = 159/371 (42%), Gaps = 39/371 (10%)
Query: 63 VNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLGTIFYPSRSSSYAYVPCDSEHCR 122
V++ IG PP Q +DTGS L W+ C+ P F PS SS+++ +PC C+
Sbjct: 99 VDLPIGTPPQVQPMVLDTGSQLSWIQCHKKAPAKPPPTASFDPSLSSTFSTLPCTHPVCK 158
Query: 123 Y----FPYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDVVFGCGF 178
F C Y+ Y G + E+ TF + + ++ GC
Sbjct: 159 PRIPDFTLPTSCDQNRLCHYSYFYADGTYAEGNLVREKFTFSRS----LFTPPLILGCAT 214
Query: 179 STNRNFKFSGIFGLGIGRSSLVSQLN-SSFSYCIGS-LHDPDYLHNKLILGDGAIIDEGD 236
+ GI G+ GR S SQ + FSYC+ + + P Y G + +
Sbjct: 215 ESTDP---RGILGMNRGRLSFASQSKITKFSYCVPTRVTRPGYTPT----GSFYLGHNPN 267
Query: 237 ATPLQFIDG---------------HYYITLEAISVDGRMLDINPNIFKRDDSGGG-VMID 280
+ ++I+ Y + L+ I + GR L+I+P +F+ D G G M+D
Sbjct: 268 SNTFRYIEMLTFARSQRMPNLDPLAYTVALQGIRIGGRKLNISPAVFRADAGGSGQTMLD 327
Query: 281 SGTDVTWLVKEAYEALRDEVMIRLEGEQMRS---YSWPDKLCYHGIMSSDLKGFPTVRFH 337
SG++ T+LV EAY+ +R EV +R G +M+ Y +C+ G + + F
Sbjct: 328 SGSEFTYLVNEAYDKVRAEV-VRAVGPRMKKGYVYGGVADMCFDGNAIEIGRLIGDMVFE 386
Query: 338 FRGGAKLALEKDSMFYQPRPDAFCMAVNPASINNRYVNFSLIGMMAQQFYNVGYDIGRKQ 397
F G ++ + K+ + C+ + A+ + ++IG QQ V +D+ ++
Sbjct: 387 FEKGVQIVVPKERVLATVEGGVHCIGI--ANSDKLGAASNIIGNFHQQNLWVEFDLVNRR 444
Query: 398 KTFQRMDCEVL 408
F DC L
Sbjct: 445 MGFGTADCSRL 455
>gi|225216988|gb|ACN85278.1| aspartic proteinase nepenthesin-1 precursor [Oryza alta]
Length = 518
Score = 115 bits (288), Expect = 4e-23, Method: Compositional matrix adjust.
Identities = 100/357 (28%), Positives = 151/357 (42%), Gaps = 29/357 (8%)
Query: 61 FYVNISIGQPPVPQFTAMDTGSSLLWVHCYPC-RDCSPQLGTIFYPSRSSSYAYVPCDSE 119
+ V + +G P DTGS WV C PC C Q +F P+RSS+YA V C +
Sbjct: 179 YVVTVGLGTPASRYTVVFDTGSDTTWVQCQPCVVVCYEQQEKLFDPARSSTYANVSCAAP 238
Query: 120 HCRYFPYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDVVFGCGFS 179
C CS C+Y Y G + F + + LT + D V+ FGCG
Sbjct: 239 ACFDLDTRGCSG--GHCLYGVQYGDGSYSIGFFAMDTLTLSSYDA----VKGFRFGCGER 292
Query: 180 TNRNF-KFSGIFGLGIGRSSLVSQL----NSSFSYCIGSLHDPDYLHNKLILGDGAIIDE 234
F + +G+ GLG G++SL Q F++C+ + L G G+
Sbjct: 293 NEGLFGEAAGLLGLGRGKTSLPVQTYDKYGGVFAHCLPARSSGT---GYLDFGPGSPAAA 349
Query: 235 GD--ATPLQFIDG--HYYITLEAISVDGRMLDINPNIFKRDDSGGGVMIDSGTDVTWLVK 290
G TP+ +G YY+ + I V G++L I ++F G ++DSGT +T L
Sbjct: 350 GARLTTPMLTDNGPTFYYVGMTGIRVGGQLLSIPQSVFAT----AGTIVDSGTVITRLPP 405
Query: 291 EAYEALRDEVMIRLEGEQMRSYSWPDKL--CYHGIMSSDLKGFPTVRFHFRGGAKLALEK 348
AY +LR + + + L CY S + PTV F+GGA L ++
Sbjct: 406 PAYSSLRSAFVSAMAARGYKKAPAVSLLDTCYDFTGMSQVA-IPTVSLLFQGGAILDVDA 464
Query: 349 DSMFYQPRPDAFCMAVNPASINNRYVNFSLIGMMAQQFYNVGYDIGRKQKTFQRMDC 405
+ Y C+ + N + ++G + + V YDIG+K F C
Sbjct: 465 SGIMYAASVSQVCLGF---AANEDGGDVGIVGNTQLKTFGVAYDIGKKVVGFSPGAC 518
>gi|307136234|gb|ADN34070.1| aspartic proteinase nepenthesin-1 precursor [Cucumis melo subsp.
melo]
Length = 412
Score = 115 bits (288), Expect = 4e-23, Method: Compositional matrix adjust.
Identities = 106/391 (27%), Positives = 169/391 (43%), Gaps = 50/391 (12%)
Query: 52 PSNDIS---NSLFYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLGTIFYPSRS 108
PSN +S N V++++G PP +DTGS L W+HC SP L ++F P S
Sbjct: 28 PSNKLSFHHNVTLTVSLTVGSPPQQVTMVLDTGSELSWLHCKK----SPNLTSVFNPLSS 83
Query: 109 SSYAYVPCDSEHCRY----FPYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDE 164
SSY+ +PC S CR P K C Y ++++ ++
Sbjct: 84 SSYSPIPCSSPVCRTRTRDLPNPVTCDPKKLCHAIVSYADASSLEGNLASDNFRIGSS-- 141
Query: 165 STIHVQDVVFGC---GFSTN--RNFKFSGIFGLGIGRSSLVSQLN-SSFSYCIGSLHDPD 218
+ +FGC GFS+N + K +G+ G+ G S V+QL FSYCI
Sbjct: 142 ---ALPGTLFGCMDSGFSSNSEEDAKTTGLMGMNRGSLSFVTQLGLPKFSYCISGRDS-- 196
Query: 219 YLHNKLILGDGAIIDEGD---------ATPLQFIDG-HYYITLEAISVDGRMLDINPNIF 268
L+ GD + G+ +TPL + D Y + L+ I V ++L + +IF
Sbjct: 197 --SGVLLFGDSHLSWLGNLTYTPLVQISTPLPYFDRVAYTVQLDGIRVGNKILPLPKSIF 254
Query: 269 KRDDSGGG-VMIDSGTDVTWLVKEAYEALRDEVMIRLE------GEQMRSYSWPDKLCYH 321
D +G G M+DSGT T+L+ Y ALR+E + + + G+ + LCY
Sbjct: 255 APDHTGAGQTMVDSGTQFTFLLGPVYTALRNEFLEQTKGVLAPLGDPNFVFQGAMDLCYR 314
Query: 322 GIMSSDLKGFPTVRFHFRG-----GAKLALEKDSMFYQPRPDAFCMAVNPASINNRYVNF 376
L P V FRG G ++ L K + + +C+ + + +
Sbjct: 315 VPAGGKLPELPAVSLMFRGAEMVVGGEVLLYKVPGMMKGKEWVYCLTFGNSDLLG--IEA 372
Query: 377 SLIGMMAQQFYNVGYDIGRKQKTFQRMDCEV 407
+IG QQ + +D+ + + F C++
Sbjct: 373 FVIGHHHQQNVWMEFDLVKSRVGFVETRCDL 403
>gi|148907752|gb|ABR17002.1| unknown [Picea sitchensis]
Length = 454
Score = 115 bits (288), Expect = 4e-23, Method: Compositional matrix adjust.
Identities = 93/323 (28%), Positives = 150/323 (46%), Gaps = 32/323 (9%)
Query: 60 LFYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLG-----TIFYPSRSSSYAYV 114
L+Y I +G PP P + +DTGS +LWV+C PC C G F P SS+ + +
Sbjct: 40 LYYTRIELGTPPRPFYVQIDTGSDILWVNCKPCNACPLTSGLGVALNFFDPRGSSTASPL 99
Query: 115 PCDSEHC---RYFPYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFK---NTDESTIH 168
C C + C+ ++ C Y+ Y G T + +++ + N +
Sbjct: 100 SCIDSKCVSSNQISESVCTTDRY-CGYSFEYGDGSGTLGYYVSDEFDYNQYVNQYVTNNA 158
Query: 169 VQDVVFGCGFS-----TNRNFKFSGIFGLGIGRSSLVSQLNSS------FSYCIGSLHDP 217
+ FGC ++ T + GIFG G S+VSQLNS FS+C L
Sbjct: 159 SAKITFGCSYNQSGDLTKPDRAVDGIFGFGQNDLSVVSQLNSQGLAPKIFSHC---LEGA 215
Query: 218 DYLHNKLILGDGAIIDEGDA-TPLQFIDGHYYITLEAISVDGRMLDINPNIFKRDDSGGG 276
D L+LG+ I + G TP+ HY + L+ I+V+G+ L I+P +F ++ G
Sbjct: 216 DPGGGILVLGE--ITEPGMVYTPIVPSQPHYNLNLQGIAVNGQQLSIDPQVFATTNT-RG 272
Query: 277 VMIDSGTDVTWLVKEAYEALRDEVMIRLEGEQMRSYSWPDKLCYHGIMSSDLKGFPTVRF 336
+ID GT + +L +EAYE + ++ + + + + C+ + S D + FP+V
Sbjct: 273 TIIDCGTTLAYLAEEAYEPFVNTIIAAVS-QSTQPFMLKGNPCFLTVHSID-EIFPSVTL 330
Query: 337 HFRGGAKLALEKDSMFYQPRPDA 359
+F G KD + Q PD+
Sbjct: 331 YFEGAPMDLKPKDYLIQQLSPDS 353
>gi|302781668|ref|XP_002972608.1| hypothetical protein SELMODRAFT_97538 [Selaginella moellendorffii]
gi|300160075|gb|EFJ26694.1| hypothetical protein SELMODRAFT_97538 [Selaginella moellendorffii]
Length = 430
Score = 115 bits (288), Expect = 4e-23, Method: Compositional matrix adjust.
Identities = 110/380 (28%), Positives = 164/380 (43%), Gaps = 41/380 (10%)
Query: 61 FYVNISIGQPPVPQFTAMDTGSSLLWVHCY---------PCRDCSPQLGTIFYPSRSSSY 111
+ V+++ G PP DTGS L+W+ C P + CS + F S+S++
Sbjct: 54 YLVSMAFGTPPQEVLLIADTGSDLIWLQCSTTAAPPAFCPKKACSRR--PAFVASKSATL 111
Query: 112 AYVPCDSEHCRYFPYAR-----CS-AYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDES 165
+ VPC + C P R CS A C Y Y G T+ F++ + T N
Sbjct: 112 SVVPCSAAQCLLVPAPRGHGPSCSPAAPVPCGYAYDYADGSSTTGFLARDTATISNGTSG 171
Query: 166 TIHVQDVVFGCGFSTNRNFKFS---GIFGLGIGRSSLVSQLNS----SFSYCIGSLHDPD 218
V+ V FGCG + N+ FS G+ GLG G+ S +Q S +FSYC+ L
Sbjct: 172 GAAVRGVAFGCG-TRNQGGSFSGTGGVIGLGQGQLSFPAQSGSLFAQTFSYCLLDLEGGR 230
Query: 219 YLHNK--LILGDGAIIDEGDATPL---QFIDGHYYITLEAISVDGRMLDINPNIFKRDDS 273
+ L LG TPL YY+ + AI V R+L + + + D
Sbjct: 231 RGRSSSFLFLGRPERRAAFAYTPLVSNPLAPTFYYVGVVAIRVGNRVLPVPGSEWAIDVL 290
Query: 274 G-GGVMIDSGTDVTWLVKEAYEALRDEVMIRLEGEQMRS---YSWPDKLCYHGIMSSDLK 329
G GG +IDSG+ +T+L AY L + ++ S + +LCY+ SS L
Sbjct: 291 GNGGTVIDSGSTLTYLRLGAYLHLVSAFAASVHLPRIPSSATFFQGLELCYNVSSSSSLA 350
Query: 330 ----GFPTVRFHFRGGAKLALEKDSMFYQPRPDAFCMAVNPASINNRYVNFSLIGMMAQQ 385
GFP + F G L L + D C+A+ P F+++G + QQ
Sbjct: 351 PANGGFPRLTIDFAQGLSLELPTGNYLVDVADDVKCLAIRP---TLSPFAFNVLGNLMQQ 407
Query: 386 FYNVGYDIGRKQKTFQRMDC 405
Y+V +D + F R +C
Sbjct: 408 GYHVEFDRASARIGFARTEC 427
>gi|225458774|ref|XP_002283258.1| PREDICTED: aspartic proteinase-like protein 2 [Vitis vinifera]
gi|302142232|emb|CBI19435.3| unnamed protein product [Vitis vinifera]
Length = 659
Score = 115 bits (288), Expect = 4e-23, Method: Compositional matrix adjust.
Identities = 103/379 (27%), Positives = 165/379 (43%), Gaps = 55/379 (14%)
Query: 56 ISNSLFYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLGTIFYPSRSSSYAYVP 115
+SN + + IG PP +DTGS++ +V C C C F P SS+Y V
Sbjct: 83 LSNGYYTTRLWIGTPPQEFALIVDTGSTVTYVPCSDCEHCGKHQDPRFQPDESSTYHPVK 142
Query: 116 CDSEHCRYFPYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDVVFG 175
C+ + C C+Y + Y +S + + ++F N +S + Q VFG
Sbjct: 143 CNMD-------CNCDHDGVNCVYERRYAEMSSSSGVLGEDIISFGN--QSEVVPQRAVFG 193
Query: 176 CGFSTNRNF---KFSGIFGLGIGRSSLVSQL------NSSFSYCIGSLHDPDYLHNKLIL 226
C + + GI GLG G+ S+V QL N SFS C G +H +
Sbjct: 194 CENVETGDLYSQRADGIMGLGRGQLSIVDQLVDKNVINDSFSLCYGGMH----------V 243
Query: 227 GDGAIIDEGDATPLQFI--------DGHYYITLEAISVDGRMLDINPNIFKRDDSGGGVM 278
G GA++ G P + +Y I L+ I V G+ L ++P+ F R G +
Sbjct: 244 GGGAMVLGGIPPPPDMVFSRSDPYRSPYYNIELKEIHVAGKPLKLSPSTFDRKH---GTV 300
Query: 279 IDSGTDVTWLVKEAYEALRDEVMIRLEGEQMRSYSWPDK----LCYHGI---MSSDLKGF 331
+DSGT +L +EA+ A RD ++ + ++ PD +C+ G +S K F
Sbjct: 301 LDSGTTYAYLPEEAFVAFRDAIIKK--SHNLKQIHGPDPNYNDICFSGAGRDVSQLSKAF 358
Query: 332 PTVRFHFRGGAKLALEKDSMFYQPRP--DAFCMAVNPASINNRYVNFSLIGMMAQQFYNV 389
P V F G KL+L ++ +Q A+C+ I + +L+G + + V
Sbjct: 359 PEVDMVFSNGQKLSLTPENYLFQHTKVHGAYCLG-----IFRNGDSTTLLGGIIVRNTLV 413
Query: 390 GYDIGRKQKTFQRMDCEVL 408
YD ++ F + +C L
Sbjct: 414 TYDRENEKIGFWKTNCSEL 432
>gi|51536458|gb|AAU05467.1| At5g22850 [Arabidopsis thaliana]
gi|55733777|gb|AAV59285.1| At5g22850 [Arabidopsis thaliana]
Length = 426
Score = 115 bits (288), Expect = 4e-23, Method: Compositional matrix adjust.
Identities = 102/339 (30%), Positives = 149/339 (43%), Gaps = 33/339 (9%)
Query: 60 LFYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLG-----TIFYPSRSSSYAYV 114
L+Y + +G PP + +DTGS +LWV C C C G F P S + + +
Sbjct: 80 LYYTKLRLGTPPRDFYVQVDTGSDVLWVSCASCNGCPQTSGLQIQLNFFDPGSSVTASPI 139
Query: 115 PCDSEHCRYFPYAR---CSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTI---H 168
C + C + + CS + C YT Y G TS F ++ L F S++
Sbjct: 140 SCSDQRCSWGIQSSDSGCSVQNNLCAYTFQYGDGSGTSGFYVSDVLQFDMIVGSSLVPNS 199
Query: 169 VQDVVFGCGFSTNRNFKFS-----GIFGLGIGRSSLVSQLNSS------FSYCIGSLHDP 217
VVFGC S + S GIFG G S++SQL S FS+C L
Sbjct: 200 TAPVVFGCSTSQTGDLVKSDRAVDGIFGFGQQGMSVISQLASQGIAPRVFSHC---LKGE 256
Query: 218 DYLHNKLILGDGAIIDEGDA-TPLQFIDGHYYITLEAISVDGRMLDINPNIFKRDDSGGG 276
+ L+LG+ I++ TPL HY + L +ISV+G+ L INP++F + G G
Sbjct: 257 NGGGGILVLGE--IVEPNMVFTPLVPSQPHYNVNLLSISVNGQALPINPSVFSTSN-GQG 313
Query: 277 VMIDSGTDVTWLVKEAYEALRDEVMIRLEGEQMRSYSWPDKLCYHGIMSSDLKGFPTVRF 336
+ID+GT + +L + AY E + + +R CY I +S FP V
Sbjct: 314 TIIDTGTTLAYLSEAAYVPFV-EAITNAVSQSVRPVVSKGNQCYV-ITTSVGDIFPPVSL 371
Query: 337 HFRGGAKLALEKDSMFYQPR--PDAFCMAVNPASINNRY 373
+F GGA + L Q A C S+ +R+
Sbjct: 372 NFAGGASMFLNPQDYLIQQNNVASALCFLGRYCSVVHRF 410
>gi|357148754|ref|XP_003574882.1| PREDICTED: aspartic proteinase-like protein 2-like [Brachypodium
distachyon]
Length = 488
Score = 115 bits (288), Expect = 4e-23, Method: Compositional matrix adjust.
Identities = 104/379 (27%), Positives = 165/379 (43%), Gaps = 36/379 (9%)
Query: 51 LPSNDISNSLFYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDC--SPQLGT---IFYP 105
LP++ L+Y I IG PP +DTGS +LWV+C C C LG ++ P
Sbjct: 76 LPTD---TGLYYTEIEIGTPPKQYHVQVDTGSDILWVNCISCNKCPRKSDLGIDLRLYDP 132
Query: 106 SRSSSYAYVPCDSEHC--RYFPYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNT- 162
SSS + V CD + C Y A C Y+ +Y G T+ + ++ L +
Sbjct: 133 KGSSSGSTVSCDQKFCAATYGGKLPGCAKNIPCEYSVMYGDGSSTTGYFVSDSLQYNQVS 192
Query: 163 -DESTIHVQ-DVVFGCGFST-----NRNFKFSGIFGLGIGRSSLVSQLNSS------FSY 209
D T H V+FGCG + N GI G G +S++SQL ++ FS+
Sbjct: 193 GDGQTRHANASVIFGCGAQQGGDLGSTNQALDGIIGFGQSNTSMLSQLAAAGEVKKIFSH 252
Query: 210 CIGSLHDPDYLHNKLILGDGAIID-EGDATPLQFIDGHYYITLEAISVDGRMLDINPNIF 268
C+ D + I G ++ + +TPL HY + LE+I+V G L + ++F
Sbjct: 253 CL------DTIKGGGIFAIGDVVQPKVKSTPLVPDMPHYNVNLESINVGGTTLQLPSHMF 306
Query: 269 KRDDSGGGVMIDSGTDVTWLVKEAYEALRDEVMIRLEGEQMRSYSWPDKLCYHGIMSSDL 328
+ + G +IDSGT +T+L + Y+ + V + S D LC S D
Sbjct: 307 ETGEK-KGTIIDSGTTLTYLPELVYKDVLAAVFAKHPDTTFHSVQ--DFLCIQYFQSVD- 362
Query: 329 KGFPTVRFHFRGGAKLALEKDSMFYQPRPDAFCMAVNPASINNR-YVNFSLIGMMAQQFY 387
GFP + FHF L + F+Q + +C + ++ + L+G +
Sbjct: 363 DGFPKITFHFEDDLGLNVYPHDYFFQNGDNLYCFGFQNGGLQSKDGKDMVLLGDLVLSNK 422
Query: 388 NVGYDIGRKQKTFQRMDCE 406
V YD+ + + +C
Sbjct: 423 VVVYDLENQVVGWTDYNCS 441
>gi|30688682|ref|NP_197676.2| aspartyl protease family protein [Arabidopsis thaliana]
gi|110736370|dbj|BAF00154.1| protease-like protein [Arabidopsis thaliana]
gi|332005704|gb|AED93087.1| aspartyl protease family protein [Arabidopsis thaliana]
Length = 493
Score = 115 bits (287), Expect = 5e-23, Method: Compositional matrix adjust.
Identities = 100/320 (31%), Positives = 144/320 (45%), Gaps = 38/320 (11%)
Query: 60 LFYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLG-----TIFYPSRSSSYAYV 114
L+Y + +G PP + +DTGS +LWV C C C G F P S + + +
Sbjct: 80 LYYTKLRLGTPPRDFYVQVDTGSDVLWVSCASCNGCPQTSGLQIQLNFFDPGSSVTASPI 139
Query: 115 PCDSEHCRYFPYAR---CSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTI---H 168
C + C + + CS + C YT Y G TS F ++ L F S++
Sbjct: 140 SCSDQRCSWGIQSSDSGCSVQNNLCAYTFQYGDGSGTSGFYVSDVLQFDMIVGSSLVPNS 199
Query: 169 VQDVVFGCGFSTNRNFKFS-----GIFGLGIGRSSLVSQLNSS------FSYCIGSLHDP 217
VVFGC S + S GIFG G S++SQL S FS+C L
Sbjct: 200 TAPVVFGCSTSQTGDLVKSDRAVDGIFGFGQQGMSVISQLASQGIAPRVFSHC---LKGE 256
Query: 218 DYLHNKLILGDGAIIDEGDA-TPLQFIDGHYYITLEAISVDGRMLDINPNIFKRDDSGGG 276
+ L+LG+ I++ TPL HY + L +ISV+G+ L INP++F + G G
Sbjct: 257 NGGGGILVLGE--IVEPNMVFTPLVPSQPHYNVNLLSISVNGQALPINPSVFSTSN-GQG 313
Query: 277 VMIDSGTDVTWLVKEAYEALRDEVMIRLEGEQMRSYSWPDKLCYHGIMSSDLKGFPTVRF 336
+ID+GT + +L + AY E + + +R CY I +S FP V
Sbjct: 314 TIIDTGTTLAYLSEAAYVPFV-EAITNAVSQSVRPVVSKGNQCYV-ITTSVGDIFPPVSL 371
Query: 337 HFRGGAKLALEKDSMFYQPR 356
+F GGA SMF P+
Sbjct: 372 NFAGGA-------SMFLNPQ 384
>gi|224136884|ref|XP_002326969.1| predicted protein [Populus trichocarpa]
gi|222835284|gb|EEE73719.1| predicted protein [Populus trichocarpa]
Length = 626
Score = 115 bits (287), Expect = 5e-23, Method: Compositional matrix adjust.
Identities = 110/415 (26%), Positives = 174/415 (41%), Gaps = 63/415 (15%)
Query: 27 NISIARLAYLQEKIRSHNTYQAQILPS-------NDISNSLFYVNISIGQPPVPQFTAMD 79
NIS R+ + R H Q LP+ + +SN + + IG PP +D
Sbjct: 38 NISAHRMPFDGHYSRRH--LQNSELPNARMRLFDDLLSNGYYTTRLFIGTPPQEFALIVD 95
Query: 80 TGSSLLWVHCYPCRDCSPQLGTIFYPSRSSSYAYVPCDSEHCRYFPYARCSAYKHRCIYT 139
TGS++ +V C C C F P SS+Y V C+ P C +C Y
Sbjct: 96 TGSTVTYVPCSSCEQCGKHQDPRFQPDLSSTYRPVKCN-------PSCNCDDEGKQCTYE 148
Query: 140 QLYLIGPETSVFVSTEQLTFKNTDESTIHVQDVVFGCGFSTNRNF---KFSGIFGLGIGR 196
+ Y +S ++ + ++F N ES + Q VFGC + + GI GLG GR
Sbjct: 149 RRYAEMSSSSGVIAEDVVSFGN--ESELKPQRAVFGCENVETGDLYSQRADGIMGLGRGR 206
Query: 197 SSLVSQL------NSSFSYCIGSLHDPDYLHNKLILGDGAIIDEGDATPLQFIDGH---- 246
S+V QL SFS C G + +G GA++ + P + H
Sbjct: 207 LSVVDQLVDKGVIGDSFSLCYGGMD----------VGGGAMVLGQISPPPNMVFSHSNPY 256
Query: 247 ----YYITLEAISVDGRMLDINPNIFKRDDSGGGVMIDSGTDVTWLVKEAYEALRDEVMI 302
Y I L+ + V G+ L + P +F D G ++DSGT + + A+ AL+D +M
Sbjct: 257 RSPYYNIELKELHVAGKPLKLKPKVF---DEKHGTVLDSGTTYAYFPEAAFHALKDAIMK 313
Query: 303 RLEGEQMRSYSWPDK----LCYHGI---MSSDLKGFPTVRFHFRGGAKLALEKDSMFYQ- 354
+ ++ PD +C+ G +S K FP V F G KL+L ++ ++
Sbjct: 314 EI--RHLKQIPGPDPNYHDICFSGAGREVSHLSKVFPEVNMVFGSGQKLSLSPENYLFRH 371
Query: 355 -PRPDAFCMAVNPASINNRYVNFSLIGMMAQQFYNVGYDIGRKQKTFQRMDCEVL 408
A+C+ + N +L+G + + V YD + F + +C L
Sbjct: 372 TKVSGAYCLGI----FQNGNDLTTLLGGIVVRNTLVTYDRENDKIGFWKTNCSEL 422
>gi|225217056|gb|ACN85339.1| aspartic proteinase nepenthesin-1 precursor [Oryza granulata]
Length = 521
Score = 115 bits (287), Expect = 5e-23, Method: Compositional matrix adjust.
Identities = 104/359 (28%), Positives = 154/359 (42%), Gaps = 33/359 (9%)
Query: 61 FYVNISIGQPPVPQFTAMDTGSSLLWVHCYPC-RDCSPQLGTIFYPSRSSSYAYVPCDSE 119
+ V I +G P DTGS WV C PC C Q +F P+RSS+YA V C +
Sbjct: 182 YVVTIGLGTPASRYTVVFDTGSDTTWVQCQPCVVVCYKQQEKLFDPARSSTYANVSCAAP 241
Query: 120 HCRYFPYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDVVFGCGFS 179
C Y R + H C+Y+ Y G + F + + LT + D V+ FGCG
Sbjct: 242 ACSDL-YTRGCSGGH-CLYSVQYGDGSYSIGFFAMDTLTLSSYDA----VKGFRFGCGER 295
Query: 180 TNRNF-KFSGIFGLGIGRSSLVSQL----NSSFSYCIGSLHD-PDYLHNKLILGDGAIID 233
F + +G+ GLG G++SL Q F++C+ + YL G A +
Sbjct: 296 NEGLFGEAAGLLGLGRGKTSLPVQTYDKYGGVFAHCLPARSSGTGYL--DFGPGSPAAVG 353
Query: 234 EGDATPLQFIDG--HYYITLEAISVDGRMLDINPNIFKRDDSGGGVMIDSGTDVTWLVKE 291
TP+ +G YY+ + I V G++L I ++F S G ++DSGT +T L
Sbjct: 354 ARQTTPMLTDNGPTFYYVGMTGIRVGGQLLSIPQSVF----STAGTIVDSGTVITRLPPA 409
Query: 292 AYEALRDEVMIRLEGEQMRSYSWPDKL-----CYHGIMSSDLKGFPTVRFHFRGGAKLAL 346
AY +LR + R Y L CY S++ P V F+GGA L +
Sbjct: 410 AYSSLRSAFASAMA---ARGYKKAPALSLLDTCYDFTGMSEVA-IPKVSLLFQGGAYLDV 465
Query: 347 EKDSMFYQPRPDAFCMAVNPASINNRYVNFSLIGMMAQQFYNVGYDIGRKQKTFQRMDC 405
+ Y C+ + N + ++G + + V YDIG+K F C
Sbjct: 466 NASGIMYAASLSQVCLGF---AANEDDDDVGIVGNTQLKTFGVVYDIGKKTVGFSPGAC 521
>gi|359497446|ref|XP_003635520.1| PREDICTED: aspartic proteinase nepenthesin-2-like, partial [Vitis
vinifera]
Length = 354
Score = 115 bits (287), Expect = 5e-23, Method: Compositional matrix adjust.
Identities = 102/372 (27%), Positives = 165/372 (44%), Gaps = 38/372 (10%)
Query: 52 PSNDISNSLFYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRD-CSPQLGTIFYPSRSSS 110
P I + +YV + +G P +DTGSSL W+ C PC C Q +F PS S +
Sbjct: 4 PGASIGSGNYYVKVGLGSPARYYSMIVDTGSSLSWLQCKPCVVYCHVQADPLFDPSASKT 63
Query: 111 YAYVPCDSEHCRYFPYAR-----CSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDES 165
Y + C S C A C + C+YT Y + ++S + LT +
Sbjct: 64 YKSLSCTSSQCSSLVDATLNNPLCETSSNVCVYTASYGDSSYSMGYLSQDLLTLAPSQT- 122
Query: 166 TIHVQDVVFGCGFSTNRNF-KFSGIFGLGIGRSSLVSQLNS----SFSYCIGSLHDPDYL 220
+ V+GCG + F + +GI GLG + S++ Q++S +FSYC+ + +L
Sbjct: 123 ---LPGFVYGCGQDSEGLFGRAAGILGLGRNKLSMLGQVSSKFGYAFSYCLPTRGGGGFL 179
Query: 221 HNKLILGDGAIIDEG-DATPLQFIDGH---YYITLEAISVDGRMLDINPNIFKRDDSGGG 276
+G ++ TP+ G+ Y++ L AI+V GR L + ++
Sbjct: 180 S----IGKASLAGSAYKFTPMTTDPGNPSLYFLRLTAITVGGRALGVAAAQYRVP----- 230
Query: 277 VMIDSGTDVTWLVKEAYEALRDEVMIRLEGEQMRS--YSWPDKLCYHGIMSSDLKGFPTV 334
+IDSGT +T L Y + + + + R+ +S D C+ G + D++ P V
Sbjct: 231 TIIDSGTVITRLPMSVYTPFQQAFVKIMSSKYARAPGFSILDT-CFKGNL-KDMQSVPEV 288
Query: 335 RFHFRGGAKLALEKDSMFYQPRPDAFCMAVNPASINNRYVNFSLIGMMAQQFYNVGYDIG 394
R F+GGA L L ++ Q C+A + NN ++IG QQ + V +DI
Sbjct: 289 RLIFQGGADLNLRPVNVLLQVDEGLTCLAF---AGNN---GVAIIGNHQQQTFKVAHDIS 342
Query: 395 RKQKTFQRMDCE 406
+ F C
Sbjct: 343 TARIGFATGGCN 354
>gi|356568907|ref|XP_003552649.1| PREDICTED: aspartic proteinase-like protein 2-like [Glycine max]
Length = 490
Score = 115 bits (287), Expect = 5e-23, Method: Compositional matrix adjust.
Identities = 102/375 (27%), Positives = 172/375 (45%), Gaps = 44/375 (11%)
Query: 60 LFYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQ--LG---TIFYPSRSSSYAYV 114
L+Y I IG PP + +DTGS ++WV+C C++C + LG T++ SSS +V
Sbjct: 84 LYYAKIGIGTPPKNYYLQVDTGSDIMWVNCIQCKECPTRSNLGMDLTLYDIKESSSGKFV 143
Query: 115 PCDSEHCRYFP---YARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFK------NTDES 165
PCD E C+ C+A C Y ++Y G T+ + + + + TD +
Sbjct: 144 PCDQEFCKEINGGLLTGCTA-NISCPYLEIYGDGSSTAGYFVKDIVLYDQVSGDLKTDSA 202
Query: 166 TIHVQDVVFGCG------FSTNRNFKFSGIFGLGIGRSSLVSQLNSS------FSYCIGS 213
+VFGCG S++ GI G G SS++SQL SS F++C+
Sbjct: 203 N---GSIVFGCGARQSGDLSSSNEEALGGILGFGKANSSMISQLASSGKVKKMFAHCLNG 259
Query: 214 LHDPDYLHNKLILGDGAIID-EGDATPLQFIDGHYYITLEAISVDGRMLDINPNIFKRDD 272
++ I G ++ + + TPL HY + + A+ V L ++ + + D
Sbjct: 260 VNGGG------IFAIGHVVQPKVNMTPLLPDQPHYSVNMTAVQVGHAFLSLSTDTSTQGD 313
Query: 273 SGGGVMIDSGTDVTWLVKEAYEALRDEVMIRLEGEQMRSYSWPDKLCYHGIMSSDLKGFP 332
G +IDSGT + +L + YE L +++ + ++R+ + C+ S D GFP
Sbjct: 314 R-KGTIIDSGTTLAYLPEGIYEPLVYKIISQHPDLKVRTLH-DEYTCFQYSESVD-DGFP 370
Query: 333 TVRFHFRGGAKLAL-EKDSMFYQPRPDAFCMAVNPASINNR-YVNFSLIGMMAQQFYNVG 390
V F+F G L + D +F P D +C+ + +R N +L+G + V
Sbjct: 371 AVTFYFENGLSLKVYPHDYLF--PSGDFWCIGWQNSGTQSRDSKNMTLLGDLVLSNKLVF 428
Query: 391 YDIGRKQKTFQRMDC 405
YD+ + + +C
Sbjct: 429 YDLENQVIGWTEYNC 443
>gi|218190722|gb|EEC73149.1| hypothetical protein OsI_07179 [Oryza sativa Indica Group]
Length = 494
Score = 115 bits (287), Expect = 5e-23, Method: Compositional matrix adjust.
Identities = 95/337 (28%), Positives = 145/337 (43%), Gaps = 32/337 (9%)
Query: 58 NSLFYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDC--SPQLG---TIFYPSRSSSYA 112
L++ I IG P + +DTGS +LWV+C C C LG T++ P S S
Sbjct: 87 TGLYFTRIGIGTPAKRYYVQVDTGSDILWVNCVSCDGCPRKSNLGIELTMYDPRGSQSGE 146
Query: 113 YVPCDSEHC--RYFPYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTD---ESTI 167
V CD + C Y C Y+ Y G T+ F T+ L + ++T
Sbjct: 147 LVTCDQQFCVANYGGVLPSCTSTSPCEYSISYGDGSSTAGFFVTDFLQYNQVSGDGQTTP 206
Query: 168 HVQDVVFGCGFS-----TNRNFKFSGIFGLGIGRSSLVSQLNSS------FSYCIGSLHD 216
V FGCG + N GI G G SS++SQL ++ F++C+
Sbjct: 207 ANASVSFGCGAKLGGDLGSSNLALDGILGFGQSNSSMLSQLAAAGKVRKMFAHCL----- 261
Query: 217 PDYLHNKLILGDGAIID-EGDATPLQFIDGHYYITLEAISVDGRMLDINPNIFKRDDSGG 275
D ++ I G ++ + TPL HY + L+ I V G L + NIF +S
Sbjct: 262 -DTVNGGGIFAIGNVVQPKVKTTPLVSDMPHYNVILKGIDVGGTALGLPTNIFDSGNS-K 319
Query: 276 GVMIDSGTDVTWLVKEAYEALRDEVMIRLEGEQMRSYSWPDKLCYHGIMSSDLKGFPTVR 335
G +IDSGT + ++ + Y+AL M+ + + + + D C+ S D GFP V
Sbjct: 320 GTIIDSGTTLAYVPEGVYKALF--AMVFDKHQDISVQTLQDFSCFQYSGSVD-DGFPEVT 376
Query: 336 FHFRGGAKLALEKDSMFYQPRPDAFCMAVNPASINNR 372
FHF G L + +Q + +CM + +
Sbjct: 377 FHFEGDVSLIVSPHDYLFQNGKNLYCMGFQNGGVQTK 413
>gi|359485189|ref|XP_002279141.2| PREDICTED: aspartic proteinase nepenthesin-2-like [Vitis vinifera]
Length = 546
Score = 115 bits (287), Expect = 6e-23, Method: Compositional matrix adjust.
Identities = 104/375 (27%), Positives = 168/375 (44%), Gaps = 42/375 (11%)
Query: 61 FYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLGTIFYPSRSSSYAYVPCDSEH 120
+++++ +G PP +DTGS L W+ C PC +C Q G + P +SSSY + C
Sbjct: 181 YFIDVFVGTPPKHFSLILDTGSDLNWIQCVPCYECFEQNGPHYDPGQSSSYRNIGCHDSR 240
Query: 121 CRYF----PYARCSAYKHRCIYTQLYLIGPETS----VFVSTEQLTFKNTDESTIHVQDV 172
C P C A C Y Y T+ + T LT + V++V
Sbjct: 241 CHLVSSPDPPQPCKAENQTCPYYYWYGDSSNTTGDFALETFTVNLTMSSGKPELRRVENV 300
Query: 173 VFGCGFSTNRNF-KFSGIFGLGIGRSSLVSQLNS----SFSYCIGSLHDPDYLHNKLILG 227
+FGCG F +G+ GLG G S SQL S SFSYC+ + + +KLI G
Sbjct: 301 MFGCGHWNRGLFHGAAGLLGLGRGPLSFSSQLQSLYGHSFSYCLVDRNSDANVSSKLIFG 360
Query: 228 DG-----------AIIDEGDATPLQFIDGHYYITLEAISVDGRMLDINPNIFK-RDDSGG 275
+ + G P +D YY+ +++I V G +++I ++ D G
Sbjct: 361 EDKDLLSHPELNFTTLVAGKENP---VDTFYYVQIKSIVVGGEVVNIPEEKWQIATDGSG 417
Query: 276 GVMIDSGTDVTWLVKEAYEALRDEVMIRLEGEQMRSYSWPDKLCYH--GIMSSDLKGFPT 333
G +IDSGT +++ + AY+ +++ M +++G + + CY+ G+ DL F
Sbjct: 418 GTIIDSGTTLSYFAEPAYQVIKEAFMAKVKGYPVVKDFPVLEPCYNVTGVEQPDLPDFGI 477
Query: 334 VRFHFRGGAKLALEKDSMFYQPRP-DAFCMAV--NPASINNRYVNFSLIGMMAQQFYNVG 390
V F GA ++ F + P + C+A+ P S S+IG QQ +++
Sbjct: 478 V---FSDGAVWNFPVENYFIEIEPREVVCLAILGTPPSA------LSIIGNYQQQNFHIL 528
Query: 391 YDIGRKQKTFQRMDC 405
YD + + F C
Sbjct: 529 YDTKKSRLGFAPTKC 543
>gi|255583547|ref|XP_002532530.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
gi|223527742|gb|EEF29846.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
Length = 440
Score = 114 bits (286), Expect = 6e-23, Method: Compositional matrix adjust.
Identities = 98/365 (26%), Positives = 159/365 (43%), Gaps = 32/365 (8%)
Query: 63 VNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLGTI-FYPSRSSSYAYVPCDSEHC 121
V++ IG PP Q +DTGS L W+ C+ T F PS SSS++ +PC+ C
Sbjct: 82 VSLPIGTPPQTQQMVLDTGSQLSWIQCHKKSVPKKPPPTTSFDPSLSSSFSVLPCNHPLC 141
Query: 122 RY----FPYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDVVFGCG 177
+ F C Y+ Y G + E++TF ++ + ++ GC
Sbjct: 142 KPRIPDFTLPTTCDQNRLCHYSYFYADGTYAEGSLVREKITFSSSQST----PPLILGCA 197
Query: 178 FSTNRNFKFSGIFGLGIGRSSLVSQLN-SSFSYCIGSLHDPD--------YLHNKLILGD 228
++ GI G+ +GR S SQ S FSYC+ + YL N G
Sbjct: 198 EASTDE---KGILGMNLGRRSFASQAKISKFSYCVPTRQARAGLSSTGSFYLGNNPNSGR 254
Query: 229 GAIIDEGDATPLQFIDG----HYYITLEAISVDGRMLDINPNIFKRDDSGGG-VMIDSGT 283
I+ TP Q Y I ++ I + L+I+ +F+ D SG G +IDSG+
Sbjct: 255 FQYINLLTFTPSQRSPNLDPLAYTIPMQGIRMGNARLNISATLFRPDPSGAGQTIIDSGS 314
Query: 284 DVTWLVKEAYEALRDEVMIRLEGEQMRS---YSWPDKLCYHGIMSSDLKGFPTVRFHFRG 340
+ T+LV EAY +R+EV +RL G +++ Y +C+ G + + F F
Sbjct: 315 EFTYLVDEAYNKVREEV-VRLVGPKLKKGYVYGGVSDMCFDGNPMEIGRLIGNMVFEFEK 373
Query: 341 GAKLALEKDSMFYQPRPDAFCMAVNPASINNRYVNFSLIGMMAQQFYNVGYDIGRKQKTF 400
G ++ ++K + C+ + + + N +IG QQ V YD+ ++
Sbjct: 374 GVEIVIDKWRVLADVGGGVHCIGIGRSEMLGAASN--IIGNFHQQNLWVEYDLANRRIGL 431
Query: 401 QRMDC 405
+ DC
Sbjct: 432 GKADC 436
>gi|225430555|ref|XP_002285593.1| PREDICTED: aspartic proteinase nepenthesin-2 [Vitis vinifera]
Length = 481
Score = 114 bits (286), Expect = 6e-23, Method: Compositional matrix adjust.
Identities = 112/381 (29%), Positives = 163/381 (42%), Gaps = 49/381 (12%)
Query: 51 LPSND---ISNSLFYVNISIGQPPVPQFTAMDTGSSLLWVHCYPC-RDCSPQLGTIFYPS 106
LPS I + V + +G P DTGS L W C PC R C Q IF PS
Sbjct: 125 LPSKSGSTIGTGNYVVTVGLGTPKRDLTFIFDTGSDLTWTQCEPCARYCYHQQEPIFNPS 184
Query: 107 RSSSYAYVPCDSEHCRYFPYA-----RCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKN 161
+S+SY + C S C CSA C+Y Y + F + ++L +
Sbjct: 185 KSTSYTNISCSSPTCDELKSGTGNSPSCSA--STCVYGIQYGDQSYSVGFFAQDKLALTS 242
Query: 162 TDESTIHVQDVVFGCGFSTNRNFKFSGIFGL-GIGRS--SLVSQLNSS----FSYCIGSL 214
TD + +FGCG NR F G+ GL G+GR+ SLVSQ FSYC+ S
Sbjct: 243 TDV----FNNFLFGCG-QNNRGL-FVGVAGLIGLGRNALSLVSQTAQKYGKLFSYCLPST 296
Query: 215 HDPDYLHNKLILGDGAIIDEGDATPLQFIDGH----YYITLEAISVDGRMLDINPNIFKR 270
L G G + ++ Y++ L AISV GR L + ++F
Sbjct: 297 SSST---GYLTFGSGGGTSKAVKFTPSLVNSQGPSFYFLNLIAISVGGRKLSTSASVF-- 351
Query: 271 DDSGGGVMIDSGTDVTWLVKEAYEALRDEVMIRLEGEQMRSY--SWPDKL---CYHGIMS 325
S G +IDSGT ++ L AY LR +QM Y + P + CY
Sbjct: 352 --STAGTIIDSGTVISRLPPTAYSDLRASFQ-----QQMSKYPKAAPASILDTCYD-FSQ 403
Query: 326 SDLKGFPTVRFHFRGGAKLALEKDSMFYQPRPDAFCMAVNPASINNRYVNFSLIGMMAQQ 385
D P + +F GA++ L+ +FY C+A + N+ + +++G + Q+
Sbjct: 404 YDTVDVPKINLYFSDGAEMDLDPSGIFYILNISQVCLAF---AGNSDATDIAILGNVQQK 460
Query: 386 FYNVGYDIGRKQKTFQRMDCE 406
++V YD+ + F CE
Sbjct: 461 TFDVVYDVAGGRIGFAPGGCE 481
>gi|10177232|dbj|BAB10606.1| protease-like protein [Arabidopsis thaliana]
Length = 539
Score = 114 bits (286), Expect = 6e-23, Method: Compositional matrix adjust.
Identities = 98/318 (30%), Positives = 142/318 (44%), Gaps = 31/318 (9%)
Query: 60 LFYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLG-----TIFYPSRSSSYAYV 114
L+Y + +G PP + +DTGS +LWV C C C G F P S + + +
Sbjct: 80 LYYTKLRLGTPPRDFYVQVDTGSDVLWVSCASCNGCPQTSGLQIQLNFFDPGSSVTASPI 139
Query: 115 PCDSEHCRYFPYAR---CSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTI---H 168
C + C + + CS + C YT Y G TS F ++ L F S++
Sbjct: 140 SCSDQRCSWGIQSSDSGCSVQNNLCAYTFQYGDGSGTSGFYVSDVLQFDMIVGSSLVPNS 199
Query: 169 VQDVVFGCGFSTNRNFKFS-----GIFGLGIGRSSLVSQLNSS------FSYCIGSLHDP 217
VVFGC S + S GIFG G S++SQL S FS+C L
Sbjct: 200 TAPVVFGCSTSQTGDLVKSDRAVDGIFGFGQQGMSVISQLASQGIAPRVFSHC---LKGE 256
Query: 218 DYLHNKLILGDGAIIDEGDA-TPLQFIDGHYYITLEAISVDGRMLDINPNIFKRDDSGGG 276
+ L+LG+ I++ TPL HY + L +ISV+G+ L INP++F + G G
Sbjct: 257 NGGGGILVLGE--IVEPNMVFTPLVPSQPHYNVNLLSISVNGQALPINPSVFSTSN-GQG 313
Query: 277 VMIDSGTDVTWLVKEAYEALRDEVMIRLEGEQMRSYSWPDKLCYHGIMSSDLKGFPTVRF 336
+ID+GT + +L + AY E + + +R CY I +S FP V
Sbjct: 314 TIIDTGTTLAYLSEAAYVPFV-EAITNAVSQSVRPVVSKGNQCYV-ITTSVGDIFPPVSL 371
Query: 337 HFRGGAKLALEKDSMFYQ 354
+F GGA + L Q
Sbjct: 372 NFAGGASMFLNPQDYLIQ 389
>gi|15228044|ref|NP_181826.1| aspartyl protease-like protein [Arabidopsis thaliana]
gi|20197868|gb|AAM15292.1| putative chloroplast nucleoid DNA binding protein [Arabidopsis
thaliana]
gi|20197965|gb|AAD21712.2| putative chloroplast nucleoid DNA binding protein [Arabidopsis
thaliana]
gi|330255100|gb|AEC10194.1| aspartyl protease-like protein [Arabidopsis thaliana]
Length = 527
Score = 114 bits (286), Expect = 6e-23, Method: Compositional matrix adjust.
Identities = 100/377 (26%), Positives = 164/377 (43%), Gaps = 44/377 (11%)
Query: 61 FYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLGTIFYPSRSSSYAYVPCDSEH 120
+++++ +G PP +DTGS L W+ C PC DC Q G + P S+S+ + C+
Sbjct: 160 YFMDVLVGTPPKHFSLILDTGSDLNWLQCLPCYDCFHQNGMFYDPKTSASFKNITCNDPR 219
Query: 121 CRYF----PYARCSAYKHRCIYTQLYLIGPETS----VFVSTEQLTFKNTDESTIHVQDV 172
C P +C + C Y Y T+ V T LT S V ++
Sbjct: 220 CSLISSPDPPVQCESDNQSCPYFYWYGDRSNTTGDFAVETFTVNLTTTEGGSSEYKVGNM 279
Query: 173 VFGCGFSTNRNFKFSGIFGLGIG-----------RSSLVSQLNSSFSYCIGSLHDPDYLH 221
+FGCG NR G+F G S L S SFSYC+ + +
Sbjct: 280 MFGCG-HWNR-----GLFSGASGLLGLGRGPLSFSSQLQSLYGHSFSYCLVDRNSNTNVS 333
Query: 222 NKLILG-DGAIIDEGDATPLQFIDGH-------YYITLEAISVDGRMLDINPNIFK-RDD 272
+KLI G D +++ + F++G YYI +++I V G+ LDI + D
Sbjct: 334 SKLIFGEDKDLLNHTNLNFTSFVNGKENSVETFYYIQIKSILVGGKALDIPEETWNISSD 393
Query: 273 SGGGVMIDSGTDVTWLVKEAYEALRDEVMIRLEGEQ--MRSYSWPDKLCYH--GIMSSDL 328
GG +IDSGT +++ + AYE ++++ +++ R + D C++ GI +++
Sbjct: 394 GDGGTIIDSGTTLSYFAEPAYEIIKNKFAEKMKENYPIFRDFPVLDP-CFNVSGIEENNI 452
Query: 329 KGFPTVRFHFRGGAKLALEKDSMFYQPRPDAFCMAVNPASINNRYVNFSLIGMMAQQFYN 388
P + F G ++ F D C+A+ + FS+IG QQ ++
Sbjct: 453 H-LPELGIAFVDGTVWNFPAENSFIWLSEDLVCLAI----LGTPKSTFSIIGNYQQQNFH 507
Query: 389 VGYDIGRKQKTFQRMDC 405
+ YD R + F C
Sbjct: 508 ILYDTKRSRLGFTPTKC 524
>gi|356531884|ref|XP_003534506.1| PREDICTED: aspartic proteinase-like protein 2-like [Glycine max]
Length = 482
Score = 114 bits (286), Expect = 7e-23, Method: Compositional matrix adjust.
Identities = 99/342 (28%), Positives = 149/342 (43%), Gaps = 38/342 (11%)
Query: 57 SNSLFYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLG-----TIFYPSRSSSY 111
S L+Y I +G P + +DTGS LWV+C C C + G T++ P+ S +
Sbjct: 73 STGLYYTKIGLG--PNDYYVQVDTGSDTLWVNCVGCTTCPKKSGLGMELTLYDPNSSKTS 130
Query: 112 AYVPCDSEHCRYF---PYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIH 168
VPCD E C P + C C Y+ Y G TS + LTF
Sbjct: 131 KVVPCDDEFCTSTYDGPISGCKK-DMSCPYSITYGDGSTTSGSYIKDDLTFDRVVGDLRT 189
Query: 169 VQD---VVFGCG------FSTNRNFKFSGIFGLGIGRSSLVSQLNSS------FSYCIGS 213
V D V+FGCG S+ + GI G G SS++SQL ++ FS+C+
Sbjct: 190 VPDNTSVIFGCGSKQSGTLSSTTDTSLDGIIGFGQANSSVLSQLAAAGKVKRVFSHCL-- 247
Query: 214 LHDPDYLHNKLILGDGAIID-EGDATPLQFIDGHYYITLEAISVDGRMLDINPNIFKRDD 272
D ++ I G ++ + TPL HY + L+ I V G + + +IF
Sbjct: 248 ----DTVNGGGIFAIGEVVQPKVKTTPLVPRMAHYNVVLKDIEVAGDPIQLPTDIFDS-T 302
Query: 273 SGGGVMIDSGTDVTWLVKEAYEALRDEVMIRLEGEQMRSYSWPDKL-CYHGIMSSDL-KG 330
SG G +IDSGT + +L Y+ L ++ + + G M Y D+ C+H L
Sbjct: 303 SGRGTIIDSGTTLAYLPVSIYDQLLEKTLAQRSG--MELYLVEDQFTCFHYSDEKSLDDA 360
Query: 331 FPTVRFHFRGGAKLALEKDSMFYQPRPDAFCMAVNPASINNR 372
FPTV+F F G L + + D +C+ ++ +
Sbjct: 361 FPTVKFTFEEGLTLTAYPHDYLFPFKEDMWCIGWQKSTAQTK 402
>gi|115446115|ref|NP_001046837.1| Os02g0473200 [Oryza sativa Japonica Group]
gi|47497549|dbj|BAD19621.1| putative aspartic proteinase nepenthesin [Oryza sativa Japonica
Group]
gi|47847591|dbj|BAD21978.1| putative aspartic proteinase nepenthesin [Oryza sativa Japonica
Group]
gi|113536368|dbj|BAF08751.1| Os02g0473200 [Oryza sativa Japonica Group]
Length = 494
Score = 114 bits (286), Expect = 7e-23, Method: Compositional matrix adjust.
Identities = 101/372 (27%), Positives = 158/372 (42%), Gaps = 33/372 (8%)
Query: 58 NSLFYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDC--SPQLG---TIFYPSRSSSYA 112
L++ I IG P + +DTGS +LWV+C C C LG T++ P S S
Sbjct: 87 TGLYFTRIGIGTPAKRYYVQVDTGSDILWVNCVSCDGCPRKSNLGIELTMYDPRGSQSGE 146
Query: 113 YVPCDSEHC--RYFPYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTD---ESTI 167
V CD + C Y C Y+ Y G T+ F T+ L + ++T
Sbjct: 147 LVTCDQQFCVANYGGVLPSCTSTSPCEYSISYGDGSSTAGFFVTDFLQYNQVSGDGQTTP 206
Query: 168 HVQDVVFGCGFS-----TNRNFKFSGIFGLGIGRSSLVSQLNSS------FSYCIGSLHD 216
V FGCG + N GI G G SS++SQL ++ F++C+
Sbjct: 207 ANASVSFGCGAKLGGDLGSSNLALDGILGFGQSNSSMLSQLAAAGKVRKMFAHCL----- 261
Query: 217 PDYLHNKLILGDGAIID-EGDATPLQFIDGHYYITLEAISVDGRMLDINPNIFKRDDSGG 275
D ++ I G ++ + TPL HY + L+ I V G L + NIF +S
Sbjct: 262 -DTVNGGGIFAIGNVVQPKVKTTPLVPDMPHYNVILKGIDVGGTALGLPTNIFDSGNS-K 319
Query: 276 GVMIDSGTDVTWLVKEAYEALRDEVMIRLEGEQMRSYSWPDKLCYHGIMSSDLKGFPTVR 335
G +IDSGT + ++ + Y+AL M+ + + + + D C+ S D GFP V
Sbjct: 320 GTIIDSGTTLAYVPEGVYKALF--AMVFDKHQDISVQTLQDFSCFQYSGSVD-DGFPEVT 376
Query: 336 FHFRGGAKLALEKDSMFYQPRPDAFCMAVNPASINNR-YVNFSLIGMMAQQFYNVGYDIG 394
FHF G L + +Q + +CM + + + L+G + V YD+
Sbjct: 377 FHFEGDVSLIVSPHDYLFQNGKNLYCMGFQNGGVQTKDGKDMVLLGDLVLSNKLVLYDLE 436
Query: 395 RKQKTFQRMDCE 406
+ + +C
Sbjct: 437 NQAIGWADYNCS 448
>gi|79407941|ref|NP_188636.2| aspartyl protease family protein [Arabidopsis thaliana]
gi|75273243|sp|Q9LHE3.1|ASPG2_ARATH RecName: Full=Protein ASPARTIC PROTEASE IN GUARD CELL 2;
Short=AtASPG2; Flags: Precursor
gi|11994777|dbj|BAB03167.1| nucleoid chloroplast DNA-binding protein-like [Arabidopsis
thaliana]
gi|28392860|gb|AAO41867.1| unknown protein [Arabidopsis thaliana]
gi|332642798|gb|AEE76319.1| aspartyl protease family protein [Arabidopsis thaliana]
Length = 470
Score = 114 bits (286), Expect = 7e-23, Method: Compositional matrix adjust.
Identities = 120/418 (28%), Positives = 177/418 (42%), Gaps = 49/418 (11%)
Query: 1 LIHRDSILS-PYNNPNENPAHRVQRGINISIARLAYLQEKIRSH-------NTYQAQILP 52
L+HRD S Y N + R++R + A L + K+ N + + I+
Sbjct: 63 LLHRDRFPSVTYRNHHHRLHARMRRDTDRVSAILRRISGKVIPSSDSRYEVNDFGSDIVS 122
Query: 53 SNDISNSLFYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLGTIFYPSRSSSYA 112
D + ++V I +G PP Q+ +D+GS ++WV C PC+ C Q +F P++S SY
Sbjct: 123 GMDQGSGEYFVRIGVGSPPRDQYMVIDSGSDMVWVQCQPCKLCYKQSDPVFDPAKSGSYT 182
Query: 113 YVPCDSEHCRYFPYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDV 172
V C S C + C + C Y +Y G T ++ E LTF T V++V
Sbjct: 183 GVSCGSSVCDRIENSGC--HSGGCRYEVMYGDGSYTKGTLALETLTFAKT-----VVRNV 235
Query: 173 VFGCGFSTNRNFKFSGIFGLGIGRS-SLVSQLN----SSFSYCIGSLHDPDYLHNKLILG 227
GCG F + G S S V QL+ +F YC+ S L+ G
Sbjct: 236 AMGCGHRNRGMFIGAAGLLGIGGGSMSFVGQLSGQTGGAFGYCLVSRGTDS--TGSLVFG 293
Query: 228 DGAIIDEGDATPL---QFIDGHYYITLEAISVDGRMLDINPNIFKRDDSG-GGVMIDSGT 283
A+ PL YY+ L+ + V G + + +F ++G GGV++D+GT
Sbjct: 294 REALPVGASWVPLVRNPRAPSFYYVGLKGLGVGGVRIPLPDGVFDLTETGDGGVVMDTGT 353
Query: 284 DVTWLVKEAYEALRDEVMIRLEGEQMRSYSWPDKLCYHGIMSSDLKGF-----PTVRFHF 338
VT L AY A RD + S CY DL GF PTV F+F
Sbjct: 354 AVTRLPTAAYVAFRDGFKSQTANLPRASGVSIFDTCY------DLSGFVSVRVPTVSFYF 407
Query: 339 RGGAKLALEKDSMFYQPRPDA----FCMAVNPASINNRYVNFSLIGMMAQQFYNVGYD 392
G L L + F P D+ F A +P + S+IG + Q+ V +D
Sbjct: 408 TEGPVLTLPARN-FLMPVDDSGTYCFAFAASPTGL-------SIIGNIQQEGIQVSFD 457
>gi|226494448|ref|NP_001141341.1| uncharacterized protein LOC100273432 precursor [Zea mays]
gi|194704078|gb|ACF86123.1| unknown [Zea mays]
gi|413953775|gb|AFW86424.1| aspartic proteinase nepenthesin-2 [Zea mays]
Length = 471
Score = 114 bits (286), Expect = 7e-23, Method: Compositional matrix adjust.
Identities = 105/370 (28%), Positives = 164/370 (44%), Gaps = 38/370 (10%)
Query: 52 PSNDISNSLFYVNISIGQPPVPQFTAMDTGSSLLWVHCYPC-RDCSPQLGTIFYPSRSSS 110
P + + + +G P +DTGSSL W+ C PC C Q+G +F P SS+
Sbjct: 125 PGTSVGVGNYVTQLGLGTPSTSYAMVVDTGSSLTWLQCSPCVVSCHRQVGPLFDPRASST 184
Query: 111 YAYVPCDSEHCRYFPYAR-----CSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDES 165
Y V C + C A CSA + CIY Y + ++ST+ ++F +T
Sbjct: 185 YTSVRCSASQCDELQAATLNPSACSA-SNVCIYQASYGDSSFSVGYLSTDTVSFGSTSYP 243
Query: 166 TIHVQDVVFGCGFSTNRNF-KFSGIFGLGIGRSSLVSQLNS----SFSYCIGSLHDPDYL 220
+ + +GCG F + +G+ GL + SL+ QL SFSYC+ + YL
Sbjct: 244 SFY-----YGCGQDNEGLFGRSAGLIGLARNKLSLLYQLAPSLGYSFSYCLPTAASTGYL 298
Query: 221 HNKLILGDGAIIDEGDATPL--QFIDGH-YYITLEAISVDGRMLDINPNIFKRDDSGGGV 277
+G TP+ +D Y+ITL +SV G L ++P+ + +
Sbjct: 299 S----IGPYNTGHYYSYTPMASSSLDASLYFITLSGMSVGGSPLAVSPSEYSSLPT---- 350
Query: 278 MIDSGTDVTWLVKEAYEALRDEVMIRLEGEQ-MRSYSWPDKLCYHGIMSSDLKGFPTVRF 336
+IDSGT +T L + AL V + G Q ++S D C+ G +S L+ PTV
Sbjct: 351 IIDSGTVITRLPTAVHTALSKAVAQAMAGAQRAPAFSILDT-CFEG-QASQLR-VPTVVM 407
Query: 337 HFRGGAKLALEKDSMFYQPRPDAFCMAVNPASINNRYVNFSLIGMMAQQFYNVGYDIGRK 396
F GGA + L ++ C+A P + ++IG QQ ++V YD+ +
Sbjct: 408 AFAGGASMKLTTRNVLIDVDDSTTCLAFAPTD------STAIIGNTQQQTFSVIYDVAQS 461
Query: 397 QKTFQRMDCE 406
+ F C
Sbjct: 462 RIGFSAGGCS 471
>gi|18408451|ref|NP_564867.1| aspartyl protease-like protein [Arabidopsis thaliana]
gi|12322615|gb|AAG51309.1|AC026480_16 unknown protein [Arabidopsis thaliana]
gi|14334808|gb|AAK59582.1| unknown protein [Arabidopsis thaliana]
gi|15293195|gb|AAK93708.1| unknown protein [Arabidopsis thaliana]
gi|332196351|gb|AEE34472.1| aspartyl protease-like protein [Arabidopsis thaliana]
Length = 430
Score = 114 bits (286), Expect = 7e-23, Method: Compositional matrix adjust.
Identities = 96/365 (26%), Positives = 169/365 (46%), Gaps = 32/365 (8%)
Query: 63 VNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLGTIFYPSRSSSYAYVPCDSEHCR 122
+++ IG PP Q +DTGS L W+ C+ + P+ T F PS SSS++ +PC C+
Sbjct: 74 ISLPIGTPPQAQQMVLDTGSQLSWIQCH-RKKLPPKPKTSFDPSLSSSFSTLPCSHPLCK 132
Query: 123 Y----FPYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDVVFGCGF 178
F C Y+ Y G + E++TF NT+ + ++ GC
Sbjct: 133 PRIPDFTLPTSCDSNRLCHYSYFYADGTFAEGNLVKEKITFSNTEIT----PPLILGCAT 188
Query: 179 STNRNFKFSGIFGLGIGRSSLVSQLN-SSFSYCIGSLHD-PDYL-HNKLILGDG------ 229
++ + GI G+ GR S VSQ S FSYCI + P + LGD
Sbjct: 189 ESSDD---RGILGMNRGRLSFVSQAKISKFSYCIPPKSNRPGFTPTGSFYLGDNPNSHGF 245
Query: 230 ---AIIDEGDATPLQFIDG-HYYITLEAISVDGRMLDINPNIFKRDDSGGG-VMIDSGTD 284
+++ ++ + +D Y + + I + L+I+ ++F+ D G G M+DSG++
Sbjct: 246 KYVSLLTFPESQRMPNLDPLAYTVPMIGIRFGLKKLNISGSVFRPDAGGSGQTMVDSGSE 305
Query: 285 VTWLVKEAYEALRDEVMIRLEGEQMRS---YSWPDKLCYHGIMSSDLKGFPTVRFHFRGG 341
T LV AY+ +R E+M R+ G +++ Y +C+ G ++ + + F F G
Sbjct: 306 FTHLVDAAYDKVRAEIMTRV-GRRLKKGYVYGGTADMCFDGNVAMIPRLIGDLVFVFTRG 364
Query: 342 AKLALEKDSMFYQPRPDAFCMAVNPASINNRYVNFSLIGMMAQQFYNVGYDIGRKQKTFQ 401
++ + K+ + C+ + +S+ N +IG + QQ V +D+ ++ F
Sbjct: 365 VEILVPKERVLVNVGGGIHCVGIGRSSMLGAASN--IIGNVHQQNLWVEFDVTNRRVGFA 422
Query: 402 RMDCE 406
+ DC
Sbjct: 423 KADCS 427
>gi|125555056|gb|EAZ00662.1| hypothetical protein OsI_22683 [Oryza sativa Indica Group]
Length = 491
Score = 114 bits (286), Expect = 7e-23, Method: Compositional matrix adjust.
Identities = 107/366 (29%), Positives = 152/366 (41%), Gaps = 44/366 (12%)
Query: 61 FYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRD---CSPQLGTIFYPSRSSSYAYVPCD 117
F V + +G P P DTGS L WV C PC C PQ +F PS+SS+YA V C
Sbjct: 149 FVVAVGLGTPAQPSALIFDTGSDLSWVQCQPCGSSGHCHPQQDPLFDPSKSSTYAAVHCG 208
Query: 118 SEHCRYFPYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDVVFGCG 177
C CS C+Y Y G T+ +S + L S+ + FGCG
Sbjct: 209 EPQCAA-AGGLCSEDNTTCLYLVHYGDGSSTTGVLSRDTLALT----SSRALAGFPFGCG 263
Query: 178 FSTNRNFKFSGIFG-----------LGIGRSSLVSQLNSSFSYCIGSLHDPDYLHNKLIL 226
RN G FG S + + FSYC+ S + L +
Sbjct: 264 ---TRNL---GDFGRVDGLLGLGRGELSLPSQAAASFGAVFSYCLPSSNS---TTGYLTI 314
Query: 227 GDGAIIDEGDATPLQFI-----DGHYYITLEAISVDGRMLDINPNIFKRDDSGGGVMIDS 281
G D G A + Y++ L +I + G +L + P +F R GG ++DS
Sbjct: 315 GATPATDTGAAQYTAMLRKPQFPSFYFVELVSIDIGGYILPVPPAVFTR----GGTLLDS 370
Query: 282 GTDVTWLVKEAYEALRDEVMIRLEGEQMRSYSWPDKL--CYHGIMSSDLKGFPTVRFHFR 339
GT +T+L +AYE LRD RL E+ D L CY S++ P V F F
Sbjct: 371 GTVLTYLPAQAYELLRDR--FRLTMERYTPAPPNDVLDACYDFAGESEVI-VPAVSFRFG 427
Query: 340 GGAKLALEKDSMFYQPRPDAFCMAVNPASINNRYVNFSLIGMMAQQFYNVGYDIGRKQKT 399
GA L+ + + C+A A+++ + S+IG Q+ V YD+ ++
Sbjct: 428 DGAVFELDFFGVMIFLDENVGCLAF--AAMDAGGLPLSIIGNTQQRSAEVIYDVAAEKIG 485
Query: 400 FQRMDC 405
F C
Sbjct: 486 FVPASC 491
>gi|326506682|dbj|BAJ91382.1| predicted protein [Hordeum vulgare subsp. vulgare]
gi|326525815|dbj|BAJ88954.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 473
Score = 114 bits (285), Expect = 8e-23, Method: Compositional matrix adjust.
Identities = 104/361 (28%), Positives = 160/361 (44%), Gaps = 39/361 (10%)
Query: 61 FYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCR-DCSPQLGTIFYPSRSSSYAYVPCDSE 119
+ + +G P P +DTGSSL W+ C PCR C Q G +F P SSSYA V C +
Sbjct: 137 YVTRMGLGTPAKPYIMVVDTGSSLTWLQCSPCRVSCHRQSGPVFDPKTSSSYAAVSCSTP 196
Query: 120 HCRYFPYAR-----CSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDVVF 174
C A CS+ CIY Y + ++S + ++F + V + +
Sbjct: 197 QCNDLSTATLNPAACSS-SDVCIYQASYGDSSFSVGYLSKDTVSFGSNS-----VPNFYY 250
Query: 175 GCGFSTNRNF-KFSGIFGLGIGRSSLVSQLNS----SFSYCIGSLHDPDYLHNKLILGDG 229
GCG F + +G+ GL + SL+ QL SFSYC+ P + +
Sbjct: 251 GCGQDNEGLFGRSAGLMGLARNKLSLLYQLAPTLGYSFSYCL-----PSSSSSGYLSIGS 305
Query: 230 AIIDEGDATPL---QFIDGHYYITLEAISVDGRMLDINPNIFKRDDSGGGVMIDSGTDVT 286
+ TP+ D Y+I L ++V G+ L ++ + + S +IDSGT +T
Sbjct: 306 YNPGQYSYTPMVSSTLDDSLYFIKLSGMTVAGKPLAVSSSEY----SSLPTIIDSGTVIT 361
Query: 287 WLVKEAYEALRDEVMIRLEG-EQMRSYSWPDKLCYHGIMSSDLKGFPTVRFHFRGGAKLA 345
L Y+AL V ++G ++ +YS D C+ G +S L+ P V F GGA L
Sbjct: 362 RLPTTVYDALSKAVAGAMKGTKRADAYSILDT-CFVG-QASSLR-VPAVSMAFSGGAALK 418
Query: 346 LEKDSMFYQPRPDAFCMAVNPASINNRYVNFSLIGMMAQQFYNVGYDIGRKQKTFQRMDC 405
L ++ C+A PA + ++IG QQ ++V YD+ + F C
Sbjct: 419 LSAQNLLVDVDSSTTCLAFAPAR------SAAIIGNTQQQTFSVVYDVKSNRIGFAAGGC 472
Query: 406 E 406
Sbjct: 473 T 473
>gi|255558694|ref|XP_002520371.1| basic 7S globulin 2 precursor small subunit, putative [Ricinus
communis]
gi|223540418|gb|EEF41987.1| basic 7S globulin 2 precursor small subunit, putative [Ricinus
communis]
Length = 557
Score = 114 bits (285), Expect = 9e-23, Method: Compositional matrix adjust.
Identities = 102/371 (27%), Positives = 168/371 (45%), Gaps = 34/371 (9%)
Query: 61 FYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLGTIFYPSRSSSYAYVPCDSEH 120
+++++ IG PP +DTGS L W+ C PC DC Q G + P SSS+ + C
Sbjct: 192 YFMDVFIGTPPRHFSLILDTGSDLNWIQCVPCYDCFVQNGPYYDPKESSSFKNIGCHDPR 251
Query: 121 CRYF----PYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDEST----IHVQDV 172
C P C A C Y Y T+ + E T T + V++V
Sbjct: 252 CHLVSSPDPPQPCKAENQTCPYFYWYGDSSNTTGDFALETFTVNLTSPAGKSEFKRVENV 311
Query: 173 VFGCGFSTNRNF-KFSGIFGLGIGRSSLVSQLNS----SFSYCIGSLHDPDYLHNKLILG 227
+FGCG F +G+ GLG G S SQL S SFSYC+ + + +KLI G
Sbjct: 312 MFGCGHWNRGLFHGAAGLLGLGRGPLSFSSQLQSLYGHSFSYCLVDRNSDTNVSSKLIFG 371
Query: 228 -DGAIIDEGDATPLQFIDGH-------YYITLEAISVDGRMLDINPNIFKRDDSG-GGVM 278
D +++ + + G YY+ +++I V G +L I + G GG +
Sbjct: 372 EDKDLLNHPEVNFTSLVAGKENPVDTFYYVQIKSIMVGGEVLKIPEETWHLSPEGAGGTI 431
Query: 279 IDSGTDVTWLVKEAYEALRDEVMIRLEG-EQMRSYSWPDKLCYH--GIMSSDLKGFPTVR 335
+DSGT +++ + +YE ++D + +++G ++ + D CY+ G+ +L P R
Sbjct: 432 VDSGTTLSYFAEPSYEIIKDAFVKKVKGYPVIKDFPILDP-CYNVSGVEKMEL---PEFR 487
Query: 336 FHFRGGAKLALEKDSMFYQPRPDAF-CMAVNPASINNRYVNFSLIGMMAQQFYNVGYDIG 394
F GA ++ F + P+ C+A+ + S+IG QQ +++ YD
Sbjct: 488 ILFEDGAVWNFPVENYFIKLEPEEIVCLAI----LGTPRSALSIIGNYQQQNFHILYDTK 543
Query: 395 RKQKTFQRMDC 405
+ + + M C
Sbjct: 544 KSRLGYAPMKC 554
>gi|242092902|ref|XP_002436941.1| hypothetical protein SORBIDRAFT_10g011760 [Sorghum bicolor]
gi|241915164|gb|EER88308.1| hypothetical protein SORBIDRAFT_10g011760 [Sorghum bicolor]
Length = 445
Score = 114 bits (285), Expect = 9e-23, Method: Compositional matrix adjust.
Identities = 107/366 (29%), Positives = 155/366 (42%), Gaps = 48/366 (13%)
Query: 1 LIHRDSILSPYNNPNENPAHRVQRGINISIARLAYLQEKIRSHNTYQAQILPSNDISNSL 60
L+HR +P + + P+ + S ARL+Y I S + +
Sbjct: 58 LLHRHGPCAPSLSTDTPPS--MSEMFRRSHARLSY----IVSGKKVSVPAHLGTSVKSLE 111
Query: 61 FYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRD--CSPQLGTIFYPSRSSSYAYVPCDS 118
+ +S G P VPQ +DTGS L W+ C PC CSPQ +F PS SS+Y+ VPC S
Sbjct: 112 YVATVSFGTPAVPQVVVIDTGSDLTWLQCKPCSSGQCSPQKDPLFDPSHSSTYSAVPCAS 171
Query: 119 EHCRYF---PYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDVVFG 175
C+ Y + C + Y+ G T ++LT V+D FG
Sbjct: 172 GECKKLAADAYGSGCSNGQPCGFAISYVDGTSTVGVYGKDKLTL----APGAIVKDFYFG 227
Query: 176 CGFSTNRNFKFSGIFGLGIGRS-SLVSQ--LNSSFSYCIGSLHD-PDYLHNKLILGDGAI 231
CG S + S SL +Q FSYC+ +++ P +L G G
Sbjct: 228 CGHSKSSLPGLFDGLLGLGRLSESLGAQYGGGGGFSYCLPAVNSKPGFLA----FGAGRN 283
Query: 232 IDEGDATPLQFIDGH---YYITLEAISVDGRMLDINPNIFKRDDSGGGVMIDSGTDVTWL 288
TP+ + G +TL I+V G+ LD+ P+ F GG+++DSGT VT L
Sbjct: 284 PSGFVFTPMGRVPGQPTFSTVTLAGITVGGKKLDLRPSAFS-----GGMIVDSGTVVTVL 338
Query: 289 VKEAYEALRDEVMIRLEGEQMRSYSWPDKLCYHGIMSS--DLKGF-----PTVRFHFRGG 341
Y ALR E M++Y HG + + DL G+ P + F GG
Sbjct: 339 QSTVYRALRAAFR-----EAMKAYRL-----VHGDLDTCYDLTGYKNVVVPKIALTFSGG 388
Query: 342 AKLALE 347
A + L+
Sbjct: 389 ATINLD 394
>gi|449511696|ref|XP_004164029.1| PREDICTED: aspartic proteinase-like protein 2-like [Cucumis
sativus]
Length = 639
Score = 114 bits (285), Expect = 9e-23, Method: Compositional matrix adjust.
Identities = 103/403 (25%), Positives = 178/403 (44%), Gaps = 54/403 (13%)
Query: 32 RLAYLQEKIRSHNTYQAQILPSNDISNSLFYVNISIGQPPVPQFTAMDTGSSLLWVHCYP 91
RL +LQ ++ H++ L + ++N + + IG PP +DTGS++ +V C
Sbjct: 60 RLRHLQNLVKPHSSNARMRLHDDLLTNGYYTTRLWIGSPPQEFALIVDTGSTVTYVPCSN 119
Query: 92 CRDCSPQLGTIFYPSRSSSYAYVPCDSEHCRYFPYARCSAYKHRCIYTQLYLIGPETSVF 151
C C F P SS+Y V C+++ C +C Y + Y +S
Sbjct: 120 CVQCGNHQDPRFQPELSSTYQPVKCNAD-------CNCDENGVQCTYERRYAEMSTSSGV 172
Query: 152 VSTEQLTFKNTDESTIHVQDVVFGCGFSTNRNF---KFSGIFGLGIGRSSLVSQL----- 203
++ + ++F ES + Q VFGC + + + GI GLG G S++ QL
Sbjct: 173 LAEDVMSFGK--ESELVPQRAVFGCETMESGDLYTQRADGIMGLGRGTLSVMDQLVGKGV 230
Query: 204 -NSSFSYCIGSLHDPDYLHNKLILGDGAIIDEGDATPLQFIDGH--------YYITLEAI 254
++SFS C G + +G GA++ G ++P + H Y I L+ I
Sbjct: 231 VSNSFSLCYGGMD----------VGGGAMVLGGISSPPGMVFSHSDPSRSPYYNIELKEI 280
Query: 255 SVDGRMLDINPNIFKRDDSGGGVMIDSGTDVTWLVKEAYEALRDEVMIRLEGEQMRSYSW 314
V G+ L +NP F D G ++DSGT + ++AY A +D +M ++ ++ S
Sbjct: 281 HVAGKPLKLNPRTF---DGKYGAILDSGTTYAYFPEKAYYAFKDAIMKKI--SFLKQISG 335
Query: 315 PD----KLCYHGI---MSSDLKGFPTVRFHFRGGAKLALEKDSMFYQ--PRPDAFCMAVN 365
PD +C+ G ++ K FP V F G K++L ++ ++ A+C+ +
Sbjct: 336 PDPNFKDICFSGAGRDVTELPKVFPEVDMVFANGQKISLSPENYLFRHTKVSGAYCLGI- 394
Query: 366 PASINNRYVNFSLIGMMAQQFYNVGYDIGRKQKTFQRMDCEVL 408
N +L+G + + V Y+ F + +C L
Sbjct: 395 ---FKNGNDQTTLLGGIIVRNTLVTYNRENSTIGFWKTNCSEL 434
>gi|195625122|gb|ACG34391.1| aspartic proteinase nepenthesin-2 precursor [Zea mays]
Length = 471
Score = 114 bits (285), Expect = 9e-23, Method: Compositional matrix adjust.
Identities = 106/370 (28%), Positives = 164/370 (44%), Gaps = 38/370 (10%)
Query: 52 PSNDISNSLFYVNISIGQPPVPQFTAMDTGSSLLWVHCYPC-RDCSPQLGTIFYPSRSSS 110
P + + + +G P +DTGSSL W+ C PC C Q+G +F P SS+
Sbjct: 125 PGTSVGVGNYVTQLGLGTPSTSYAMVVDTGSSLTWLQCSPCVVSCHRQVGPLFDPRASST 184
Query: 111 YAYVPCDSEHCRYFPYAR-----CSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDES 165
YA V C + C A CSA + CIY Y + +ST+ ++F +T
Sbjct: 185 YASVRCSASQCDELQAATLNPSACSA-SNVCIYQASYGDSSFSVGSLSTDTVSFGSTRYP 243
Query: 166 TIHVQDVVFGCGFSTNRNF-KFSGIFGLGIGRSSLVSQLNS----SFSYCIGSLHDPDYL 220
+ + +GCG F + +G+ GL + SL+ QL SFSYC+ + YL
Sbjct: 244 SFY-----YGCGQDNEGLFGRSAGLIGLARNKLSLLYQLAPSLGYSFSYCLPTAASTGYL 298
Query: 221 HNKLILGDGAIIDEGDATPL--QFIDGH-YYITLEAISVDGRMLDINPNIFKRDDSGGGV 277
+G TP+ +D Y+ITL +SV G L ++P+ + +
Sbjct: 299 S----IGPYNTGHYYSYTPMASSSLDASLYFITLSGMSVGGSPLAVSPSEYSSLPT---- 350
Query: 278 MIDSGTDVTWLVKEAYEALRDEVMIRLEGEQ-MRSYSWPDKLCYHGIMSSDLKGFPTVRF 336
+IDSGT +T L + AL V + G Q ++S D C+ G +S L+ PTV
Sbjct: 351 IIDSGTVITRLPTAVHTALSKAVAQAMAGAQRAPAFSILDT-CFEG-QASQLR-VPTVAM 407
Query: 337 HFRGGAKLALEKDSMFYQPRPDAFCMAVNPASINNRYVNFSLIGMMAQQFYNVGYDIGRK 396
F GGA + L ++ C+A P + ++IG QQ ++V YD+ +
Sbjct: 408 AFAGGASMKLTTRNVLIDVDDSTTCLAFAPTD------STAIIGNTQQQTFSVIYDVAQS 461
Query: 397 QKTFQRMDCE 406
+ F C
Sbjct: 462 RIGFSAGGCS 471
>gi|242095586|ref|XP_002438283.1| hypothetical protein SORBIDRAFT_10g011110 [Sorghum bicolor]
gi|241916506|gb|EER89650.1| hypothetical protein SORBIDRAFT_10g011110 [Sorghum bicolor]
Length = 470
Score = 114 bits (285), Expect = 1e-22, Method: Compositional matrix adjust.
Identities = 107/409 (26%), Positives = 174/409 (42%), Gaps = 42/409 (10%)
Query: 12 NNPNENPAHRVQRGINISIARLAYLQEKIRSHNTYQAQILPSNDISNSLFYVNISIGQPP 71
N P+ P +++ + A L + + S + P + + + +G P
Sbjct: 90 NAPSRRPTTSLRKPKAAAGASGGPLDDSLAS-----VPLTPGTSVGVGNYVTELGLGTPA 144
Query: 72 VPQFTAMDTGSSLLWVHCYPC-RDCSPQLGTIFYPSRSSSYAYVPCDSEHCRYFPYAR-- 128
+DTGSSL W+ C PC C Q+G ++ P SS+YA VPC + C A
Sbjct: 145 TSYAMVVDTGSSLTWLQCSPCVVSCHRQVGPLYDPRASSTYATVPCSASQCDELQAATLN 204
Query: 129 ---CSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDVVFGCGFSTNRNF- 184
CS ++ CIY Y + ++S + ++F + + +GCG F
Sbjct: 205 PSACSV-RNVCIYQASYGDSSFSVGYLSRDTVSFGSGSYPNFY-----YGCGQDNEGLFG 258
Query: 185 KFSGIFGLGIGRSSLVSQLNS----SFSYCIGSLHDPDYLH-NKLILGDGAIIDEGDATP 239
+ +G+ GL + SL+ QL SFSYC+ + YL G + ++
Sbjct: 259 RSAGLIGLARNKLSLLYQLAPSLGYSFSYCLPTPASTGYLSIGPYTSGHYSYTPMASSS- 317
Query: 240 LQFIDGH-YYITLEAISVDGRMLDINPNIFKRDDSGGGVMIDSGTDVTWLVKEAYEALRD 298
+D Y++TL +SV G L ++P + + +IDSGT +T L Y AL
Sbjct: 318 ---LDASLYFVTLSGMSVGGSPLAVSPAEYSSLPT----IIDSGTVITRLPTAVYTALSK 370
Query: 299 EVMIRLEGEQ-MRSYSWPDKLCYHGIMSSDLKGFPTVRFHFRGGAKLALEKDSMFYQPRP 357
V + G Q ++S D C+ G +S L+ P V F GGA L L ++
Sbjct: 371 AVAAAMVGVQSAPAFSILDT-CFQG-QASQLR-VPAVAMAFAGGATLKLATQNVLIDVDD 427
Query: 358 DAFCMAVNPASINNRYVNFSLIGMMAQQFYNVGYDIGRKQKTFQRMDCE 406
C+A P + ++IG QQ ++V YD+ + + F C
Sbjct: 428 STTCLAFAPTD------STTIIGNTQQQTFSVVYDVAQSRIGFAAGGCS 470
>gi|413952720|gb|AFW85369.1| hypothetical protein ZEAMMB73_571116 [Zea mays]
Length = 451
Score = 114 bits (285), Expect = 1e-22, Method: Compositional matrix adjust.
Identities = 122/419 (29%), Positives = 186/419 (44%), Gaps = 46/419 (10%)
Query: 20 HRVQRGINISIARLAYLQEKIRSHNTYQAQILPSNDISNSLFYVNISIGQPPVPQFTA-- 77
R+ R S AR A L ++ Y + + S+ + ++ +IG P PQ A
Sbjct: 49 ERLSRMAVRSRARAASLYQR---GGHYGQPVTATAVPSSGEYLIHFNIGTP-RPQRVALT 104
Query: 78 MDTGSSLLWVHCYPCRDCSPQLGTIFYPSRSSSYAYVPCDSEHCRY---FPYARCSAYKH 134
MDTGS L+W C PC C Q +F PS SS++ V C CR + C+
Sbjct: 105 MDTGSDLVWTQCTPCPVCFDQPFPLFDPSVSSTFRAVACPDPICRPSSGLSVSACALKTF 164
Query: 135 RCIYTQLYLIGPETSVFVSTEQLTFKNTD---ESTIHVQDVVFGCG------FSTNRNFK 185
RC Y Y T+ ++ + TF + + + V + FGCG F++N
Sbjct: 165 RCFYLCSYGDKSITAGYIFKDTFTFMSPNGEGAPPVAVSGLAFGCGDYNTGVFASNE--- 221
Query: 186 FSGIFGLGIGRSSLVSQLN-SSFSYCIGSLHDPDYLHNK---LILG---DGAIIDEGD-- 236
SGI G G G SL SQL FSYC+ S HD + NK + LG +G
Sbjct: 222 -SGIAGFGRGPLSLPSQLRVGRFSYCLTS-HD-ETESNKTSAVFLGTPPNGLRAHSSGPF 278
Query: 237 -ATPL---QFIDGHYYITLEAISVDGRMLDINPNIFK-RDDSGGGVMIDSGTDVTWLVKE 291
+TP+ YY++LE I+V L ++ ++F + D GG +IDSGT VT
Sbjct: 279 RSTPIIHSPSFPTFYYLSLEGITVGKTRLPVDSSVFALKKDGSGGTVIDSGTGVTTFPAA 338
Query: 292 AYEALRDEVMIRLEGEQMRSYS-WPDKLCYHGIMSSDLKGFPTVRFHFRGGAKLALEKDS 350
+E L++E + +L + + S + LC+ P + FH A + L +++
Sbjct: 339 VFEQLKNEFVAQLPLPRYDNTSEVGNLLCFQRPKGGKQVPVPKLIFHL-ASADMDLPREN 397
Query: 351 MFYQPR-PDAFCMAVNPASINNRYVNFSLIGMMAQQFYNVGYDIGRKQKTFQRMDCEVL 408
Y P D+ M + IN V+ LIG QQ ++ YD+ + F C+ +
Sbjct: 398 --YIPEDTDSGVMCL---MINGAEVDMVLIGNFQQQNMHIVYDVENSKLLFASAQCDKM 451
>gi|242084336|ref|XP_002442593.1| hypothetical protein SORBIDRAFT_08g022613 [Sorghum bicolor]
gi|241943286|gb|EES16431.1| hypothetical protein SORBIDRAFT_08g022613 [Sorghum bicolor]
Length = 482
Score = 114 bits (284), Expect = 1e-22, Method: Compositional matrix adjust.
Identities = 115/424 (27%), Positives = 170/424 (40%), Gaps = 47/424 (11%)
Query: 1 LIHRDSILSPYNNPNENPAHRVQRGINISIARLAYLQEKIRSHNTYQAQILPSNDISNSL 60
L+HRDS N + A + R + + R A++ K + + + + ++
Sbjct: 70 LVHRDSFAV-----NASAADLLARRLQRDMRRAAWIITKAATPADPENGTVVTGAPTSGE 124
Query: 61 FYVNISIGQP-----PVPQFTAMDTGSSLLWVHCYPCRDCSPQLGTIFYPSRSSSYAYVP 115
+ I++G P + D GS + W+ C PC C Q G ++ +SSS + V
Sbjct: 125 YIAKITVGTPYENDSSFEALLSPDMGSDVTWLQCMPCFRCYHQPGPVYNRLKSSSASDVG 184
Query: 116 CDSEHCRYF-PYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDVVF 174
C + CR C + + C Y Y G ++ E LTF + V V
Sbjct: 185 CYAPACRALGSSGGCVQFLNECQYKVEYGDGSSSAGDFGVETLTFP----PGVRVPGVAI 240
Query: 175 GCGFSTNRNF--KFSGIFGLGIGRSSLVSQL----NSSFSYCIGSLHDPDYLHNKLILGD 228
GCG F +GI GLG G S SQ+ SFSYC+ + L G
Sbjct: 241 GCGSDNQGLFPAPAAGILGLGRGSLSFPSQIAGRYGRSFSYCLAG-QGTGGRSSTLTFGS 299
Query: 229 GAIIDEGDATPLQF--------IDGHYYITLEAISVDG-RMLDINPNIFKRDDSG--GGV 277
GA TP F + YY+ L ISV G R+ + + + D S GGV
Sbjct: 300 GASATTTTTTPPSFTPMLTNSRMYTFYYVGLVGISVGGVRVRGVTESDLRLDPSTGHGGV 359
Query: 278 MIDSGTDVTWLVKEAYEALRDEVMIRLEGEQMRSYSWPDK--------LCYHGIMSSDLK 329
++DSGT VT L AY A RD + ++ WP CY + +K
Sbjct: 360 IVDSGTAVTRLSGPAYAAFRDAFRV----AAVKELGWPSPGGPFAFFDTCYSSVRGRVMK 415
Query: 330 GFPTVRFHFRGGAKLALEKDSMFYQPRPDAFCMAVNPASINNRYVNFSLIGMMAQQFYNV 389
P V HF GG ++ L + + M A +R V S+IG + Q + V
Sbjct: 416 KVPAVSMHFAGGVEVKLPPQNYLIPVDSNKGTMCFAFAGSGDRGV--SIIGNIQLQGFRV 473
Query: 390 GYDI 393
YD+
Sbjct: 474 VYDV 477
>gi|326523839|dbj|BAJ96930.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 473
Score = 114 bits (284), Expect = 1e-22, Method: Compositional matrix adjust.
Identities = 100/368 (27%), Positives = 154/368 (41%), Gaps = 29/368 (7%)
Query: 61 FYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLGT-----IFYPSRSSSYAYVP 115
++V +G P P DTGS L WV C R SP +F P+ S S+A +P
Sbjct: 110 YFVQFRVGTPAQPFVLVADTGSDLTWVKCRGRRASSPDASPLASPRVFRPANSKSWAPIP 169
Query: 116 CDSEHCR-YFPY--ARCSAYK---HRCIYTQLYLIGPETSVFVSTEQLTFK---NTDEST 166
C S+ C+ Y P+ A CSA C Y Y V T+ T + +
Sbjct: 170 CSSDTCKSYVPFSLANCSAGTTPPAPCGYDYRYKDKSSARGVVGTDAATIALSGSGSDRK 229
Query: 167 IHVQDVVFGCGFSTN-RNFKFS-GIFGLGIGRSSLVS----QLNSSFSYCIGSLHDPDYL 220
+Q+VV GC S + ++F+ S G+ LG S S + FSYC+ P
Sbjct: 230 AKLQEVVLGCTTSYDGQSFQSSDGVLSLGNSNISFASRAAARFGGRFSYCLVDHLAPRNA 289
Query: 221 HNKLILGDGAIIDEGDATPLQF---IDGHYYITLEAISVDGRMLDINPNIFKRDDSGGGV 277
+ L G TPL + Y +T++A+SV G+ L+I ++ +GG +
Sbjct: 290 TSYLTFGPVGAAHSPSRTPLLLDAQVAPFYAVTVDAVSVAGKALNIPAEVWDVKKNGGAI 349
Query: 278 MIDSGTDVTWLVKEAYEALRDEVMIRLEGEQMRSYSWPDKLCYHGIMSSDLKGFPTVRFH 337
+ DSGT +T L AY+A+ + +L R P + CY+ + P +
Sbjct: 350 L-DSGTSLTILATPAYKAVVAALSKQL-ARVPRVTMDPFEYCYNWTATRRPPAVPRLEVR 407
Query: 338 FRGGAKLALEKDSMFYQPRPDAFCMAVNPASINNRYVNFSLIGMMAQQFYNVGYDIGRKQ 397
F G A+L S P C+ + + S+IG + QQ + +D+ +
Sbjct: 408 FAGSARLRPPTKSYVIDAAPGVKCIGLQ----EGVWPGVSVIGNILQQEHLWEFDLANRW 463
Query: 398 KTFQRMDC 405
FQ C
Sbjct: 464 LRFQESRC 471
>gi|449447285|ref|XP_004141399.1| PREDICTED: aspartic proteinase-like protein 2-like [Cucumis
sativus]
Length = 609
Score = 114 bits (284), Expect = 1e-22, Method: Compositional matrix adjust.
Identities = 103/403 (25%), Positives = 178/403 (44%), Gaps = 54/403 (13%)
Query: 32 RLAYLQEKIRSHNTYQAQILPSNDISNSLFYVNISIGQPPVPQFTAMDTGSSLLWVHCYP 91
RL +LQ ++ H++ L + ++N + + IG PP +DTGS++ +V C
Sbjct: 60 RLRHLQNLVKPHSSNARMRLHDDLLTNGYYTTRLWIGSPPQEFALIVDTGSTVTYVPCSN 119
Query: 92 CRDCSPQLGTIFYPSRSSSYAYVPCDSEHCRYFPYARCSAYKHRCIYTQLYLIGPETSVF 151
C C F P SS+Y V C+++ C +C Y + Y +S
Sbjct: 120 CVQCGNHQDPRFQPELSSTYQPVKCNAD-------CNCDENGVQCTYERRYAEMSTSSGV 172
Query: 152 VSTEQLTFKNTDESTIHVQDVVFGCGFSTNRNF---KFSGIFGLGIGRSSLVSQL----- 203
++ + ++F ES + Q VFGC + + + GI GLG G S++ QL
Sbjct: 173 LAEDVMSFGK--ESELVPQRAVFGCETMESGDLYTQRADGIMGLGRGTLSVMDQLVGKGV 230
Query: 204 -NSSFSYCIGSLHDPDYLHNKLILGDGAIIDEGDATPLQFIDGH--------YYITLEAI 254
++SFS C G + +G GA++ G ++P + H Y I L+ I
Sbjct: 231 VSNSFSLCYGGMD----------VGGGAMVLGGISSPPGMVFSHSDPSRSPYYNIELKEI 280
Query: 255 SVDGRMLDINPNIFKRDDSGGGVMIDSGTDVTWLVKEAYEALRDEVMIRLEGEQMRSYSW 314
V G+ L +NP F D G ++DSGT + ++AY A +D +M ++ ++ S
Sbjct: 281 HVAGKPLKLNPRTF---DGKYGAILDSGTTYAYFPEKAYYAFKDAIMKKI--SFLKQISG 335
Query: 315 PD----KLCYHGI---MSSDLKGFPTVRFHFRGGAKLALEKDSMFYQ--PRPDAFCMAVN 365
PD +C+ G ++ K FP V F G K++L ++ ++ A+C+ +
Sbjct: 336 PDPNFKDICFSGAGRDVTELPKVFPEVDMVFANGQKISLSPENYLFRHTKVSGAYCLGI- 394
Query: 366 PASINNRYVNFSLIGMMAQQFYNVGYDIGRKQKTFQRMDCEVL 408
N +L+G + + V Y+ F + +C L
Sbjct: 395 ---FKNGNDQTTLLGGIIVRNTLVTYNRENSTIGFWKTNCSEL 434
>gi|21618176|gb|AAM67226.1| unknown [Arabidopsis thaliana]
Length = 430
Score = 114 bits (284), Expect = 1e-22, Method: Compositional matrix adjust.
Identities = 96/365 (26%), Positives = 169/365 (46%), Gaps = 32/365 (8%)
Query: 63 VNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLGTIFYPSRSSSYAYVPCDSEHCR 122
+++ IG PP Q +DTGS L W+ C+ + P+ T F PS SSS++ +PC C+
Sbjct: 74 ISLPIGTPPQAQQMVLDTGSQLSWIQCH-RKKLPPKPKTSFDPSLSSSFSTLPCSHPLCK 132
Query: 123 Y----FPYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDVVFGCGF 178
F C Y+ Y G + E++TF NT+ + ++ GC
Sbjct: 133 PRIPDFTLPTSCDSNRLCHYSYFYADGTFAEGNLVKEKITFSNTEIT----PPLILGCAT 188
Query: 179 STNRNFKFSGIFGLGIGRSSLVSQLN-SSFSYCIGSLHD-PDYL-HNKLILGDG------ 229
++ + GI G+ GR S VSQ S FSYCI + P + LGD
Sbjct: 189 ESSDD---RGILGMNRGRLSFVSQAKISKFSYCIPPKSNRPGFTPTGSFYLGDNPNSHGF 245
Query: 230 ---AIIDEGDATPLQFIDG-HYYITLEAISVDGRMLDINPNIFKRDDSGGG-VMIDSGTD 284
+++ ++ + +D Y + + I + L+I+ ++F+ D G G M+DSG++
Sbjct: 246 KYVSLLTFPESQRMPNLDPLAYTVPMIGIRFGLKKLNISGSVFRPDAGGSGQTMVDSGSE 305
Query: 285 VTWLVKEAYEALRDEVMIRLEGEQMRS---YSWPDKLCYHGIMSSDLKGFPTVRFHFRGG 341
T LV AY+ +R E+M R+ G +++ Y +C+ G ++ + + F F G
Sbjct: 306 FTHLVDAAYDKVRAEIMTRV-GRRLKKGYVYGGTADMCFDGNVAMIPRLIGDLVFVFTRG 364
Query: 342 AKLALEKDSMFYQPRPDAFCMAVNPASINNRYVNFSLIGMMAQQFYNVGYDIGRKQKTFQ 401
++ + K+ + C+ + +S+ N +IG + QQ V +D+ ++ F
Sbjct: 365 VEIFVPKERVLVNVGGGIHCVGIGRSSMLGAASN--IIGNVHQQNLWVEFDVTNRRVGFA 422
Query: 402 RMDCE 406
+ DC
Sbjct: 423 KADCS 427
>gi|225216949|gb|ACN85242.1| aspartic proteinase nepenthesin-1 precursor [Oryza minuta]
Length = 519
Score = 114 bits (284), Expect = 1e-22, Method: Compositional matrix adjust.
Identities = 100/357 (28%), Positives = 152/357 (42%), Gaps = 29/357 (8%)
Query: 61 FYVNISIGQPPVPQFTAMDTGSSLLWVHCYPC-RDCSPQLGTIFYPSRSSSYAYVPCDSE 119
+ V + +G P DTGS WV C PC C Q +F P+RSS+YA V C +
Sbjct: 180 YVVTVGLGTPASRYTVVFDTGSDTTWVQCQPCVVVCYEQREKLFDPARSSTYANVSCAAP 239
Query: 120 HCRYFPYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDVVFGCGFS 179
C CS C+Y Y G + F + + LT + D V+ FGCG
Sbjct: 240 ACSDLNIHGCSG--GHCLYGVQYGDGSYSIGFFAMDTLTLSSYDA----VKGFRFGCGER 293
Query: 180 TNRNF-KFSGIFGLGIGRSSLVSQL----NSSFSYCIGSLHDPDYLHNKLILGDGAIIDE 234
F + +G+ GLG G++SL Q F++C+ + L G G++
Sbjct: 294 NEGLFGEAAGLLGLGRGKTSLPVQTYDKYGGVFAHCLPARSTGT---GYLDFGAGSLAAA 350
Query: 235 GD--ATPLQFIDG--HYYITLEAISVDGRMLDINPNIFKRDDSGGGVMIDSGTDVTWLVK 290
TP+ +G YY+ + I V G++L I ++F G ++DSGT +T L
Sbjct: 351 RARLTTPMLTENGPTFYYVGMTGIRVGGQLLSIPQSVFAT----AGTIVDSGTVITRLPP 406
Query: 291 EAYEALRDEVMIRLEGEQMRSYSWPDKL--CYHGIMSSDLKGFPTVRFHFRGGAKLALEK 348
AY +LR + + L CY S + PTV F+GGA+L ++
Sbjct: 407 AAYSSLRYAFAAAMAARGYKKAPAVSLLDTCYDFTGMSQVA-IPTVSLLFQGGARLDVDA 465
Query: 349 DSMFYQPRPDAFCMAVNPASINNRYVNFSLIGMMAQQFYNVGYDIGRKQKTFQRMDC 405
+ Y C+A + N + ++G + + V YDIG+K F C
Sbjct: 466 SGIMYAASASQVCLAF---AANEDGGDVGIVGNTQLKTFGVAYDIGKKVVGFYPGAC 519
>gi|255538124|ref|XP_002510127.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
gi|223550828|gb|EEF52314.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
Length = 641
Score = 114 bits (284), Expect = 1e-22, Method: Compositional matrix adjust.
Identities = 102/372 (27%), Positives = 166/372 (44%), Gaps = 40/372 (10%)
Query: 56 ISNSLFYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLGTIFYPSRSSSYAYVP 115
+SN + + IG PP +DTGS++ +V C C C F P SS+Y +
Sbjct: 83 LSNGYYTTRLFIGTPPQEFALIVDTGSTVTYVPCSTCEQCGKHQDPRFQPESSSTYKPMQ 142
Query: 116 CDSEHCRYFPYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDVVFG 175
C+ P C +C Y + Y +S ++ + L+F N ES + Q +FG
Sbjct: 143 CN-------PSCNCDDEGKQCTYERRYAEMSSSSGLLAEDVLSFGN--ESELTPQRAIFG 193
Query: 176 C-GFSTNRNF--KFSGIFGLGIGRSSLVSQL------NSSFSYCIGSLHDPDYLHNKLIL 226
C T F + GI GLG G S+V QL +SFS C G + D + ++L
Sbjct: 194 CETVETGELFSQRADGIMGLGRGPLSVVDQLVIKEVVGNSFSLCYGGM---DVVGGAMVL 250
Query: 227 GD-GAIIDEGDATPLQFIDGHYYITLEAISVDGRMLDINPNIFKRDDSGGGVMIDSGTDV 285
G+ D A + +Y I L+ + V G+ L +NP +F D G ++DSGT
Sbjct: 251 GNIPPPPDMVFAHSDPYRSAYYNIELKELHVAGKRLKLNPRVF---DGKHGTVLDSGTTY 307
Query: 286 TWLVKEAYEALRDEVMIRLEGEQMRSYSWPDK----LCYHGI---MSSDLKGFPTVRFHF 338
+L +EA+ A +D ++ ++ ++ PD +C+ G +S K FP V F
Sbjct: 308 AYLPEEAFVAFKDAIIKEIKF--LKQIHGPDPSYNDICFSGAGRDVSQLSKIFPEVNMVF 365
Query: 339 RGGAKLALEKDSMFYQ--PRPDAFCMAVNPASINNRYVNFSLIGMMAQQFYNVGYDIGRK 396
G KL+L ++ ++ A+C+ + N +L+G + + V YD
Sbjct: 366 GNGQKLSLSPENYLFRHTKVSGAYCLGI----FQNGKDPTTLLGGIVVRNTLVTYDRDND 421
Query: 397 QKTFQRMDCEVL 408
+ F + +C L
Sbjct: 422 KIGFWKTNCSEL 433
>gi|224104765|ref|XP_002313558.1| predicted protein [Populus trichocarpa]
gi|222849966|gb|EEE87513.1| predicted protein [Populus trichocarpa]
Length = 468
Score = 113 bits (283), Expect = 1e-22, Method: Compositional matrix adjust.
Identities = 95/311 (30%), Positives = 143/311 (45%), Gaps = 33/311 (10%)
Query: 60 LFYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLG-----TIFYPSRSSSYAYV 114
L+Y + +G PP + +DTGS +LWV C C C G F P S + + +
Sbjct: 51 LYYTRLQLGTPPRDFYVQIDTGSDVLWVSCGSCNGCPVNSGLHIPLNFFDPGSSPTASLI 110
Query: 115 PCDSEHCRYFPYAR---CSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTI---H 168
C + C + CSA + C Y Y G TS + ++ L F ++
Sbjct: 111 SCSDQRCSLGLQSSDSVCSAQNNLCGYNFQYGDGSGTSGYYVSDLLHFDTVLGGSVMNNS 170
Query: 169 VQDVVFGC-----GFSTNRNFKFSGIFGLGIGRSSLVSQLNS------SFSYCIGSLHDP 217
+VFGC G T + GIFG G S+VSQL S +FS+C L
Sbjct: 171 SAPIVFGCSALQTGDLTKSDRAVDGIFGFGQQDMSVVSQLASQGISPRAFSHC---LKGD 227
Query: 218 DYLHNKLILGDGAIIDEGDA-TPLQFIDGHYYITLEAISVDGRMLDINPNIFKRDDSGGG 276
D L+LG+ I++ TPL HY + +++ISV+G+ L I+P++F S G
Sbjct: 228 DSGGGILVLGE--IVEPNIVYTPLVPSQPHYNLNMQSISVNGQTLAIDPSVFGTSSS-QG 284
Query: 277 VMIDSGTDVTWLVKEAYEALRDEVMIRLEGEQMRSYSWPDKLCYHGIMSSDLKG-FPTVR 335
+IDSGT + +L + AY+ + + +R Y CY ++SS + FP V
Sbjct: 285 TIIDSGTTLAYLAEAAYDPFISAIT-SIVSPSVRPYLSKGNHCY--LISSSINDIFPQVS 341
Query: 336 FHFRGGAKLAL 346
+F GGA + L
Sbjct: 342 LNFAGGASMIL 352
>gi|297834938|ref|XP_002885351.1| pepsin A [Arabidopsis lyrata subsp. lyrata]
gi|297331191|gb|EFH61610.1| pepsin A [Arabidopsis lyrata subsp. lyrata]
Length = 471
Score = 113 bits (283), Expect = 1e-22, Method: Compositional matrix adjust.
Identities = 119/419 (28%), Positives = 177/419 (42%), Gaps = 50/419 (11%)
Query: 1 LIHRDSILS-PYNNPNENPAHRVQRGINISIARLAYLQEKIRSH--------NTYQAQIL 51
L+HRD S Y N + R++R + A L + K+ N + + ++
Sbjct: 63 LLHRDRFPSVTYRNHHHRLHARMRRDTDRVSAILRRISGKVVVASSDSRYEVNDFGSDVV 122
Query: 52 PSNDISNSLFYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLGTIFYPSRSSSY 111
D + ++V I +G PP Q+ +D+GS ++WV C PC+ C Q +F P++S SY
Sbjct: 123 SGMDQGSGEYFVRIGVGSPPRDQYMVIDSGSDMVWVQCQPCKLCYKQSDPVFDPAKSGSY 182
Query: 112 AYVPCDSEHCRYFPYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQD 171
V C S C + C + C Y +Y G T ++ E LTF T V++
Sbjct: 183 TGVSCGSSVCDRIENSGC--HSGGCRYEVMYGDGSYTKGTLALETLTFAKT-----VVRN 235
Query: 172 VVFGCGFSTNRNFKFSGIFGLGIGRS-SLVSQLN----SSFSYCIGSLHDPDYLHNKLIL 226
V GCG F + G S S V QL+ +F YC+ S L+
Sbjct: 236 VAMGCGHRNRGMFIGAAGLLGIGGGSMSFVGQLSGQTGGAFGYCLVSRGTDS--TGSLVF 293
Query: 227 GDGAIIDEGDATPL---QFIDGHYYITLEAISVDGRMLDINPNIFKRDDSG-GGVMIDSG 282
G A+ PL YY+ L+ + V G + + +F ++G GGV++D+G
Sbjct: 294 GREALPVGASWVPLVRNPRAPSFYYVGLKGLGVGGVRIPLPDGVFDLTETGDGGVVMDTG 353
Query: 283 TDVTWLVKEAYEALRDEVMIRLEGEQMRSYSWPDKLCYHGIMSSDLKGF-----PTVRFH 337
T VT L AY A RD + S CY DL GF PTV F+
Sbjct: 354 TAVTRLPTGAYAAFRDGFKSQTANLPRASGVSIFDTCY------DLSGFVSVRVPTVSFY 407
Query: 338 FRGGAKLALEKDSMFYQPRPDA----FCMAVNPASINNRYVNFSLIGMMAQQFYNVGYD 392
F G L L + F P D+ F A +P + S+IG + Q+ V +D
Sbjct: 408 FTEGPVLTLPARN-FLMPVDDSGTYCFAFAASPTGL-------SIIGNIQQEGIQVSFD 458
>gi|326512066|dbj|BAJ96014.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 485
Score = 113 bits (283), Expect = 1e-22, Method: Compositional matrix adjust.
Identities = 104/375 (27%), Positives = 163/375 (43%), Gaps = 40/375 (10%)
Query: 58 NSLFYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLG-----TIFYPSRSSSYA 112
L++ I IG P + +DTGS +LWV+C C C + G T++ PS SSS
Sbjct: 78 TGLYFTQIGIGTPAKSYYVQVDTGSDILWVNCVFCDTCPRKSGLGIELTLYDPSGSSSGT 137
Query: 113 YVPCDSEHC-----RYFPYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNT---DE 164
V C + C P +A C Y+ Y G T+ F T+ L + +
Sbjct: 138 GVTCGQDFCVATHGGVIPSCVPAA---PCQYSISYGDGSSTTGFFVTDFLQYNQVSGNSQ 194
Query: 165 STIHVQDVVFGCGFSTNRNFKFS-----GIFGLGIGRSSLVSQLNSS------FSYCIGS 213
+T+ + FGCG + S GI G G SS++SQL ++ F++C+
Sbjct: 195 TTLANTSITFGCGAKIGGDLGSSSQALDGILGFGQSNSSMLSQLAAAGKVRKVFAHCL-- 252
Query: 214 LHDPDYLHNKLILGDGAIID-EGDATPLQFIDGHYYITLEAISVDGRMLDINPNIFKRDD 272
D ++ I G ++ + TPL HY + LEAI V G L + NIF +
Sbjct: 253 ----DTINGGGIFAIGDVVQPKVSTTPLVPGMPHYNVNLEAIDVGGVKLQLPTNIFDIGE 308
Query: 273 SGGGVMIDSGTDVTWLVKEAYEALRDEVMIRLEGEQMRSYSWPDKLCYHGIMSSDLKGFP 332
S G +IDSGT + +L Y A+ +V + +++ D C+ S D GFP
Sbjct: 309 S-KGTIIDSGTTLAYLPGVVYNAIMSKVFAQYGDMPLKNDQ--DFQCFRYSGSVD-DGFP 364
Query: 333 TVRFHFRGGAKLALEKDSMFYQPRPDAFCMAVNPASINNR-YVNFSLIGMMAQQFYNVGY 391
+ FHF GG L + +Q + +CM + + + L+G +A V Y
Sbjct: 365 IITFHFEGGLPLNIHPHDYLFQ-NGELYCMGFQTGGLQTKDGKDMVLLGDLAFSNRLVLY 423
Query: 392 DIGRKQKTFQRMDCE 406
D+ + + +C
Sbjct: 424 DLENQVIGWTDYNCS 438
>gi|413925432|gb|AFW65364.1| hypothetical protein ZEAMMB73_378208 [Zea mays]
Length = 418
Score = 113 bits (283), Expect = 2e-22, Method: Compositional matrix adjust.
Identities = 110/405 (27%), Positives = 180/405 (44%), Gaps = 40/405 (9%)
Query: 17 NPAHRVQRGINISIARLAYLQEKIRSHNTYQAQILPSNDISNSLFYVNISIGQPPVPQFT 76
P R + S RL+ L ++ + + AQ D + + S+G PP
Sbjct: 37 EPTINFTRAAHRSRERLSILATRLGAASAGSAQSPLQMDSGGGAYDMTFSMGTPPQTLSA 96
Query: 77 AMDTGSSLLWVHCYPCRDCSPQLGTIFYPSRSSSYAYVPCDSEHCRYF---PYARCSAYK 133
DTGS L+W C C+ C+P+ +YP++SSS++ +PC S CR A C +
Sbjct: 97 LADTGSDLIWAKCGACKRCAPRGSASYYPTKSSSFSKLPCSSALCRTLESQSLATCGGTR 156
Query: 134 HR---CIYTQLYLIGPE----TSVFVSTEQLTFKNTDESTIHVQDVVFGC-GFSTNRNFK 185
R C Y Y + T ++ +E T + VQ + FGC S
Sbjct: 157 ARGAVCSYRYSYGLSSNPHHYTQGYMGSETFTLGSD-----AVQGIGFGCTTMSEGGYGS 211
Query: 186 FSGIFGLGIGRSSLVSQLN-SSFSYCIGSLHDPDYLHNKLILGDGAIIDEG-DATPLQFI 243
SG+ GLG G+ SLV QL +FSYC+ S DP + L+ G GA+ G +TPL +
Sbjct: 212 GSGLVGLGRGKLSLVRQLKVGAFSYCLTS--DPS-TSSPLLFGAGALTGPGVQSTPLVNL 268
Query: 244 DGH--YYITLEAISVDGRMLDINPNIFKRDDSGGGVMIDSGTDVTWLVKEAYEALRDEVM 301
Y + L++IS+ P + G++ DSGT +T+L + AY ++
Sbjct: 269 KTSTFYTVNLDSISIGAAK---TPGTGRH-----GIIFDSGTTLTFLAEPAYTLAEAGLL 320
Query: 302 IRLEGEQMRSYSWPDKLCYHGIMSSDLKGFPTVRFHFRGGAKLALEKDSMFYQPRPDAFC 361
+ + ++C+ +S FP++ HF GG +AL+ ++ F C
Sbjct: 321 SQTTNLTRVPGTDGYEVCFQ---TSGGAVFPSMVLHFDGG-DMALKTENYFGAVNDSVSC 376
Query: 362 MAVNPASINNRYVNFSLIGMMAQQFYNVGYDIGRKQKTFQRMDCE 406
V + S++G + Q Y++ YD+ + +FQ +C+
Sbjct: 377 WLVQKSP-----SEMSIVGNIMQMDYHIRYDLDKSVLSFQPTNCD 416
>gi|414878073|tpg|DAA55204.1| TPA: hypothetical protein ZEAMMB73_344109 [Zea mays]
Length = 440
Score = 113 bits (283), Expect = 2e-22, Method: Compositional matrix adjust.
Identities = 118/422 (27%), Positives = 181/422 (42%), Gaps = 45/422 (10%)
Query: 15 NENPAHRVQRGINISIARLAYLQEKIRSHNTYQAQILPSNDISNSLFYVNISIGQPPVPQ 74
N + R++R + RLA + E + ++Q + IG PP
Sbjct: 36 NCSTEERMRRATERTHRRLASMGEASAPVHWAESQ-----------YIAEYLIGDPPQQA 84
Query: 75 FTAMDTGSSLLWVHCYPCR--DCSPQLGTIFYPSRSSSYAYVPCDSEHCRYFPYARCSAY 132
+DTGS+L+W C C+ C Q + + PSRS + V C+ C RC+
Sbjct: 85 EAIIDTGSNLIWTQCSTCQPAGCFSQNLSFYDPSRSRTARPVACNDTACALGSETRCARD 144
Query: 133 KHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDVVFGCGFSTNRNFK----FSG 188
C Y G V + TE TF+ E+ + FGC +T SG
Sbjct: 145 NKACAVLTAYGAGVIGGV-LGTEAFTFQPQSENV----SLAFGCIAATRLTPGSLDGASG 199
Query: 189 IFGLGIGRSSLVSQL-NSSFSYCIGSLHDPDYLHNKLILGDGAIIDEGDA--TPLQFIDG 245
I GLG G SLVSQL ++ FSYC+ ++L +G A + G A T + F+
Sbjct: 200 IIGLGRGNLSLVSQLGDNKFSYCLTPYFSQSTNTSRLFVGASAGLSSGGAPATSVPFLKN 259
Query: 246 --------HYYITLEAISVDGRMLDINPNIFK-RDDSGG---GVMIDSGTDVTWLVKEAY 293
YY+ L I+V L + F R + G G +IDSG+ T LV AY
Sbjct: 260 PDVDPFSTFYYLPLTGITVGDAKLAVPEAAFDLRQVATGLWAGTLIDSGSPFTSLVDVAY 319
Query: 294 EALRDEVMIRLEGEQMRSYSWPD--KLCYHGIMSSDLKGFPTVRFHF-RGGAKLALEKDS 350
+ALRDE++ +L + + + LC K P + HF GG +A+ ++
Sbjct: 320 QALRDELVQQLGASIVPPPAGAEGLDLCAAVAHGDVGKLVPPLVLHFGSGGGDVAVPPEN 379
Query: 351 MFYQPRPDA-FCMAVNPASINNRYVNF---SLIGMMAQQFYNVGYDIGRKQKTFQRMDCE 406
++ P D+ CM V + N + ++IG QQ ++ YD+ + +FQ DC
Sbjct: 380 -YWGPVDDSTACMVVFSSGGPNSTLPMNETTIIGNYMQQDMHLLYDLEKGMLSFQPADCS 438
Query: 407 VL 408
+
Sbjct: 439 SM 440
>gi|449508297|ref|XP_004163275.1| PREDICTED: aspartic proteinase-like protein 2-like [Cucumis
sativus]
Length = 641
Score = 113 bits (283), Expect = 2e-22, Method: Compositional matrix adjust.
Identities = 103/377 (27%), Positives = 163/377 (43%), Gaps = 54/377 (14%)
Query: 58 NSLFYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLGTIFYPSRSSSYAYVPCD 117
N + + IG PP +DTGS++ +V C C C F P SS+Y + C+
Sbjct: 80 NGYYTTRLWIGTPPQQFALIVDTGSTVTYVPCSTCEQCGRHQDPKFDPESSSTYKPIKCN 139
Query: 118 SEHCRYFPYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDVVFGC- 176
+ C + +C+Y + Y +S + + ++F N +S + Q VFGC
Sbjct: 140 ID-------CICDSDGVQCVYERQYAEMSTSSGVLGEDVISFGN--QSELIPQRAVFGCE 190
Query: 177 GFSTNRNF--KFSGIFGLGIGRSSLVSQL------NSSFSYCIGSLHDPDYLHNKLILGD 228
T F + GI GLG G SLV QL N SFS C G + +G
Sbjct: 191 NMETGDLFSQRADGIMGLGTGDLSLVDQLVEKGAINDSFSLCYGGMD----------IGG 240
Query: 229 GAIIDEGDATPLQFI--------DGHYYITLEAISVDGRMLDINPNIFKRDDSGGGVMID 280
GA++ G + P I +Y + L+ I V G+ L ++ IF D G ++D
Sbjct: 241 GAMVLGGISPPSDMIFTYSDPVRSPYYNVDLKEIHVAGKKLPLSSGIF---DGRYGAVLD 297
Query: 281 SGTDVTWLVKEAYEALRDEVMIRLEGEQMRSYSWPD----KLCYHGIMSSDLK---GFPT 333
SGT +L EA+ A +D +M + ++ PD +C+ G S + FPT
Sbjct: 298 SGTTYAYLPAEAFSAFKDAIMDEI--HSLKKIDGPDPNFKDICFSGAGSDAAELSNKFPT 355
Query: 334 VRFHFRGGAKLALEKDSMFYQPRP--DAFCMAVNPASINNRYVNFSLIGMMAQQFYNVGY 391
V F G KL+L ++ F++ A+C+ + N +L+G + + V Y
Sbjct: 356 VDMVFENGQKLSLTPENYFFRHSKVHGAYCLGI----FENGNDQTTLLGGIVVRNTLVMY 411
Query: 392 DIGRKQKTFQRMDCEVL 408
D + F + +C L
Sbjct: 412 DRANSKIGFWKTNCSEL 428
>gi|297828001|ref|XP_002881883.1| hypothetical protein ARALYDRAFT_903676 [Arabidopsis lyrata subsp.
lyrata]
gi|297327722|gb|EFH58142.1| hypothetical protein ARALYDRAFT_903676 [Arabidopsis lyrata subsp.
lyrata]
Length = 529
Score = 113 bits (283), Expect = 2e-22, Method: Compositional matrix adjust.
Identities = 98/376 (26%), Positives = 163/376 (43%), Gaps = 42/376 (11%)
Query: 61 FYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLGTIFYPSRSSSYAYVPCDSEH 120
+++++ +G PP +DTGS L W+ C PC DC Q + P S+S+ + C+
Sbjct: 162 YFMDVLVGTPPKHFSLILDTGSDLNWLQCLPCYDCFHQNEAFYDPKTSASFKNITCNDPR 221
Query: 121 CRYF----PYARCSAYKHRCIYTQLYLIGPETS----VFVSTEQLTFKNTDESTIHVQDV 172
C P +C + C Y Y T+ V T LT S V+++
Sbjct: 222 CSLISSPEPPVQCKSDNQSCPYFYWYGDRSNTTGDFAVETFTVNLTTTEGRSSEYKVENM 281
Query: 173 VFGCGFSTNRNFKFSGIFGLGIG-----------RSSLVSQLNSSFSYCIGSLHDPDYLH 221
+FGCG NR G+F G S L S SFSYC+ + +
Sbjct: 282 MFGCG-HWNR-----GLFSGASGLLGLGRGPLSFSSQLQSLYGHSFSYCLVDRNSDTNVS 335
Query: 222 NKLILG-DGAIIDEGDATPLQFIDGH-------YYITLEAISVDGRMLDINPNIFK-RDD 272
+KLI G D +++ + F++G YYI +++I V G LDI + D
Sbjct: 336 SKLIFGEDKDLLNHTNLNFTSFVNGKENSVETFYYIQIKSILVGGEALDIPEETWNISPD 395
Query: 273 SGGGVMIDSGTDVTWLVKEAYEALRDEVMIRLEGEQMRSYSWPD-KLCYH--GIMSSDLK 329
GG +IDSGT +++ + AYE ++++ +++ + +P C++ GI +++
Sbjct: 396 GAGGTIIDSGTTLSYFAEPAYEIIKNKFAEKMKENYLVFRDFPVLDPCFNVSGIEENNIH 455
Query: 330 GFPTVRFHFRGGAKLALEKDSMFYQPRPDAFCMAVNPASINNRYVNFSLIGMMAQQFYNV 389
P + F GA ++ F D C+A+ + FS+IG QQ +++
Sbjct: 456 -LPELGIAFADGAVWNFPAENSFIWLSEDLVCLAI----LGTPKSTFSIIGNYQQQNFHI 510
Query: 390 GYDIGRKQKTFQRMDC 405
YD + F C
Sbjct: 511 LYDTKMSRLGFTPTKC 526
>gi|449463971|ref|XP_004149703.1| PREDICTED: aspartic proteinase-like protein 2-like [Cucumis
sativus]
Length = 641
Score = 113 bits (282), Expect = 2e-22, Method: Compositional matrix adjust.
Identities = 103/377 (27%), Positives = 163/377 (43%), Gaps = 54/377 (14%)
Query: 58 NSLFYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLGTIFYPSRSSSYAYVPCD 117
N + + IG PP +DTGS++ +V C C C F P SS+Y + C+
Sbjct: 80 NGYYTTRLWIGTPPQQFALIVDTGSTVTYVPCSTCEQCGRHQDPKFDPESSSTYKPIKCN 139
Query: 118 SEHCRYFPYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDVVFGC- 176
+ C + +C+Y + Y +S + + ++F N +S + Q VFGC
Sbjct: 140 ID-------CICDSDGVQCVYERQYAEMSTSSGVLGEDVISFGN--QSELIPQRAVFGCE 190
Query: 177 GFSTNRNF--KFSGIFGLGIGRSSLVSQL------NSSFSYCIGSLHDPDYLHNKLILGD 228
T F + GI GLG G SLV QL N SFS C G + +G
Sbjct: 191 NMETGDLFSQRADGIMGLGTGDLSLVDQLVEKGAINDSFSLCYGGMD----------IGG 240
Query: 229 GAIIDEGDATPLQFI--------DGHYYITLEAISVDGRMLDINPNIFKRDDSGGGVMID 280
GA++ G + P I +Y + L+ I V G+ L ++ IF D G ++D
Sbjct: 241 GAMVLGGISPPSDMIFTYSDPVRSPYYNVDLKEIHVAGKKLPLSSGIF---DGRYGAVLD 297
Query: 281 SGTDVTWLVKEAYEALRDEVMIRLEGEQMRSYSWPD----KLCYHGIMSSDLK---GFPT 333
SGT +L EA+ A +D +M + ++ PD +C+ G S + FPT
Sbjct: 298 SGTTYAYLPAEAFSAFKDAIMDEI--HSLKKIDGPDPNFKDICFSGAGSDAAELSNKFPT 355
Query: 334 VRFHFRGGAKLALEKDSMFYQPRP--DAFCMAVNPASINNRYVNFSLIGMMAQQFYNVGY 391
V F G KL+L ++ F++ A+C+ + N +L+G + + V Y
Sbjct: 356 VDMVFENGQKLSLTPENYFFRHSKVHGAYCLGI----FENGNDQTTLLGGIVVRNTLVMY 411
Query: 392 DIGRKQKTFQRMDCEVL 408
D + F + +C L
Sbjct: 412 DRANSKIGFWKTNCSEL 428
>gi|293332531|ref|NP_001169558.1| uncharacterized protein LOC100383437 precursor [Zea mays]
gi|224030089|gb|ACN34120.1| unknown [Zea mays]
gi|413925069|gb|AFW65001.1| hypothetical protein ZEAMMB73_160528 [Zea mays]
Length = 491
Score = 113 bits (282), Expect = 2e-22, Method: Compositional matrix adjust.
Identities = 106/403 (26%), Positives = 175/403 (43%), Gaps = 49/403 (12%)
Query: 40 IRSHN-TYQAQILPSNDI---------SNSLFYVNISIGQPPVPQFTAMDTGSSLLWVHC 89
+R+H+ T ++L + D+ L+Y I +G PP + +DTGS +LWV+C
Sbjct: 55 LRAHDGTRHGRLLAAADLPLGGLGLPTDTGLYYTEIKLGTPPKHYYVQVDTGSDILWVNC 114
Query: 90 YPCRDCSPQLG-----TIFYPSRSSSYAYVPCDSEHCRYF---PYARCSAYKHRCIYTQL 141
C C + G T++ P SS+ + V CD C +C A C Y+
Sbjct: 115 ITCEQCPHKSGLGLDLTLYDPKASSTGSMVMCDQAFCAATFGGKLPKCGA-NVPCEYSVT 173
Query: 142 YLIGPETSVFVSTEQLTFKNT---DESTIHVQDVVFGCGFST-----NRNFKFSGIFGLG 193
Y G T T+ L F ++ V+FGCG + N GI G G
Sbjct: 174 YGDGSSTIGSFVTDALQFDQVTRDGQTQPANASVIFGCGAQQGGDLGSSNQALDGILGFG 233
Query: 194 IGRSSLVSQLNSS------FSYCIGSLHDPDYLHNKLILGDGAIID-EGDATPLQFIDGH 246
+S++SQL ++ F++C+ D + I G ++ + TPL H
Sbjct: 234 EANTSMLSQLTTAGKVKKIFAHCL------DTIKGGGIFSIGDVVQPKVKTTPLVADKPH 287
Query: 247 YYITLEAISVDGRMLDINPNIFKRDDSGGGVMIDSGTDVTWLVKEAYEALRDEVMIRL-- 304
Y + L+ I V G L + +IF+ + G +IDSGT +T+L + ++ EVM+ +
Sbjct: 288 YNVNLKTIDVGGTTLQLPAHIFEPGEK-KGTIIDSGTTLTYLPELVFK----EVMLAVFN 342
Query: 305 EGEQMRSYSWPDKLCYHGIMSSDLKGFPTVRFHFRGGAKLALEKDSMFYQPRPDAFCMAV 364
+ + + + LC+ S D GFPT+ FHF L + F+ D +C+
Sbjct: 343 KHQDITFHDVQGFLCFQYPGSVD-DGFPTITFHFEDDLALHVYPHEYFFANGNDVYCVGF 401
Query: 365 -NPASINNRYVNFSLIGMMAQQFYNVGYDIGRKQKTFQRMDCE 406
N AS + + L+G + V YD+ + + +C
Sbjct: 402 QNGASQSKDGKDIVLMGDLVLSNKLVIYDLENRVIGWTDYNCS 444
>gi|125563959|gb|EAZ09339.1| hypothetical protein OsI_31611 [Oryza sativa Indica Group]
Length = 453
Score = 113 bits (282), Expect = 2e-22, Method: Compositional matrix adjust.
Identities = 120/422 (28%), Positives = 180/422 (42%), Gaps = 55/422 (13%)
Query: 22 VQRGINISIARLAYLQEKIRSHNTYQAQILPSND----------ISNSLFYV-NISIGQP 70
++R + S AR A L +R+ + I + + S L YV ++++G P
Sbjct: 49 IRRAMQRSKARAAAL-SVVRNGGGFYGSIAQAREREREPGMAVRASGDLEYVLDLAVGTP 107
Query: 71 PVPQFTAMDTGSSLLWVHCYPCRDCSPQLGTIFYPSRSSSYAYVPCDSEHCRYFPYARCS 130
P P +DTGS L+W C C C Q +F P SSSY + C + C + C
Sbjct: 108 PQPITALLDTGSDLIWTQCDTCTACLRQPDPLFSPRMSSSYEPMRCAGQLCGDILHHSC- 166
Query: 131 AYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDVVFGCG-FSTNRNFKFSGI 189
C Y Y G T + +TE+ TF ++ T V + FGCG + SGI
Sbjct: 167 VRPDTCTYRYSYGDGTTTLGYYATERFTFASSSGETQSVP-LGFGCGTMNVGSLNNASGI 225
Query: 190 FGLGIGRSSLVSQLN-SSFSYCIGSLHDPDYLHNKLILGDGAIIDEG---DAT-PLQFI- 243
G G SLVSQL+ FSYC+ P K L G++ D G DAT P+Q
Sbjct: 226 VGFGRDPLSLVSQLSIRRFSYCL----TPYASSRKSTLQFGSLADVGLYDDATGPVQTTP 281
Query: 244 -------DGHYYITLEAISVDGRMLDINPNIFK-RDDSGGGVMIDSGTDVTW----LVKE 291
YY+ ++V R L I + F R D GGV+IDSGT +T ++ E
Sbjct: 282 ILQSAQNPTFYYVAFTGVTVGARRLRIPASAFALRPDGSGGVIIDSGTALTLFPAAVLAE 341
Query: 292 AYEALRDEVMIRLEGEQMRSYSWPDKLCYHGIMSSDLK-------GFPTVRFHFRGGAKL 344
A R ++ + S D +C+ + P + FHF+ GA L
Sbjct: 342 VVRAFRSQLRLPFA----NGSSPDDGVCFAAPAVAAGGGRMARQVAVPRMVFHFQ-GADL 396
Query: 345 ALEKDSMFYQP-RPDAFCMAVNPASINNRYVNFSLIGMMAQQFYNVGYDIGRKQKTFQRM 403
L +++ + R C+ + + + + IG QQ V YD+ R+ +F +
Sbjct: 397 DLPRENYVLEDHRRGHLCVLLGDSGDDG-----ATIGNFVQQDMRVVYDLERETLSFAPV 451
Query: 404 DC 405
+C
Sbjct: 452 EC 453
>gi|297827577|ref|XP_002881671.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
gi|297327510|gb|EFH57930.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
Length = 438
Score = 113 bits (282), Expect = 2e-22, Method: Compositional matrix adjust.
Identities = 107/379 (28%), Positives = 165/379 (43%), Gaps = 51/379 (13%)
Query: 58 NSLFYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLGTIFYPSRSSSYAYVPCD 117
N V +++G PP +DTGS L W+HC SP LG++F P SS+Y+ VPC
Sbjct: 58 NVTLTVTLAVGSPPQNISMVLDTGSELSWLHCKK----SPNLGSVFNPVSSSTYSPVPCS 113
Query: 118 SEHCRY----FPY-ARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDV 172
S CR P A C H C Y TS+ + TF ++
Sbjct: 114 SPICRTRTRDLPIPASCDPKTHFCHVAISY--ADATSIEGNLAHDTFV---IGSVTRPGT 168
Query: 173 VFGC---GFSTN--RNFKFSGIFGLGIGRSSLVSQLN-SSFSYCIGSLHDPDYLHNKLIL 226
+FGC G S++ + K +G+ G+ G S V+QL S FSYCI L+L
Sbjct: 169 LFGCMDSGLSSDSEEDAKSTGLMGMNRGSLSFVNQLGFSKFSYCISGSDS----SGILLL 224
Query: 227 GDGAIIDEG---------DATPLQFIDG-HYYITLEAISVDGRMLDINPNIFKRDDSGGG 276
GD + G TPL + D Y + LE I V ++L + ++F D +G G
Sbjct: 225 GDASYSWLGPIQYTPLVLQTTPLPYFDRVAYTVQLEGIRVGSKILSLPKSVFVPDHTGAG 284
Query: 277 -VMIDSGTDVTWLVKEAYEALRDEVM------IRLEGEQMRSYSWPDKLCYHGIMSS--D 327
M+DSGT T+L+ Y AL++E + +R+ + + LCY S+ +
Sbjct: 285 QTMVDSGTQFTFLMGPVYTALKNEFIAQTKSVLRIVDDPNFVFQGTMDLCYRVGSSTRPN 344
Query: 328 LKGFPTVRFHFRG------GAKLALEKDSMFYQPRPDAFCMAVNPASINNRYVNFSLIGM 381
G P + FRG G KL + + + + +C + + + +IG
Sbjct: 345 FTGLPVISLMFRGAEMSVSGQKLLYRVNGAGSEGKEEVYCFTFGNSDLLG--IEAFVIGH 402
Query: 382 MAQQFYNVGYDIGRKQKTF 400
QQ + +D+ + + F
Sbjct: 403 HHQQNVWMEFDLAKSRVGF 421
>gi|125558632|gb|EAZ04168.1| hypothetical protein OsI_26310 [Oryza sativa Indica Group]
gi|125600539|gb|EAZ40115.1| hypothetical protein OsJ_24558 [Oryza sativa Japonica Group]
Length = 453
Score = 113 bits (282), Expect = 2e-22, Method: Compositional matrix adjust.
Identities = 111/378 (29%), Positives = 171/378 (45%), Gaps = 31/378 (8%)
Query: 54 NDISNSLFYV-NISIGQPPVPQFTAMDTGSSLLWVHCYPCRD-CSPQLGTIFYPSRSSSY 111
D+ N Y+ ++IG PP DTGS L+W C PC + C Q ++ PS S ++
Sbjct: 84 KDLPNGGEYIMTLAIGTPPQSYPAIADTGSDLVWTQCAPCGERCFKQPSPLYNPSSSPTF 143
Query: 112 AYVPCDSEHCRYFPYARCSAYKH----RCIYTQLYLIGPETSVFVSTEQLTFKNTDESTI 167
+PC S AR + C Y Q Y G TS +E TF ++ +
Sbjct: 144 RVLPCSSALNLCAAEARLAGATPPPGCACRYNQTYGTG-WTSGLQGSETFTFGSSPADQV 202
Query: 168 HVQDVVFGCGFSTNRNFKFSG-IFGLGIGRSSLVSQLNSS-FSYCIGSLHDPDYLHNKLI 225
V + FGC +++ ++ S + GLG G SLVSQL + FSYC+ D + L+
Sbjct: 203 RVPGIAFGCSNASSDDWNGSAGLVGLGRGGLSLVSQLAAGMFSYCLTPFQDTKS-KSTLL 261
Query: 226 LGDGAIIDEGDATPLQF-----------IDGHYYITLEAISVDGRMLDINPNIFK-RDDS 273
LG A + T ++ + +YY+ L ISV L I P F R D
Sbjct: 262 LGPAAAAAALNGTGVRSTPFVPSPSKPPMSTYYYLNLTGISVGAAALPIPPGAFALRADG 321
Query: 274 GGGVMIDSGTDVTWLVKEAYEALRDEV--MIRLEGEQMRSYSWPDKLCYHGIMSSDLKG- 330
GG++IDSGT +T LV AY+ +R V +++L + + D LC+ SS
Sbjct: 322 TGGLIIDSGTTITSLVDAAYKRVRAAVRSLVKLPVTDGSNATGLD-LCFALPSSSAPPAT 380
Query: 331 FPTVRFHFRGGAKLALEKDSMFYQPRPDAFCMAVNPASINNRYVNFSLIGMMAQQFYNVG 390
P++ HF GGA + L ++ +C+A+ + S +G QQ ++
Sbjct: 381 LPSMTLHFGGGADMVLPVENYMIL-DGGMWCLAMR----SQTDGELSTLGNYQQQNLHIL 435
Query: 391 YDIGRKQKTFQRMDCEVL 408
YD+ ++ +F C L
Sbjct: 436 YDVQKETLSFAPAKCSTL 453
>gi|115479489|ref|NP_001063338.1| Os09g0452800 [Oryza sativa Japonica Group]
gi|51535938|dbj|BAD38020.1| putative aspartic proteinase nepenthesin I [Oryza sativa Japonica
Group]
gi|113631571|dbj|BAF25252.1| Os09g0452800 [Oryza sativa Japonica Group]
gi|125605918|gb|EAZ44954.1| hypothetical protein OsJ_29597 [Oryza sativa Japonica Group]
gi|215740967|dbj|BAG97462.1| unnamed protein product [Oryza sativa Japonica Group]
Length = 453
Score = 113 bits (282), Expect = 2e-22, Method: Compositional matrix adjust.
Identities = 120/422 (28%), Positives = 180/422 (42%), Gaps = 55/422 (13%)
Query: 22 VQRGINISIARLAYLQEKIRSHNTYQAQILPSND----------ISNSLFYV-NISIGQP 70
++R + S AR A L +R+ + I + + S L YV ++++G P
Sbjct: 49 IRRAMQRSKARAAAL-SVVRNGGGFYGSIAQAREREREPGMAVRASGDLEYVLDLAVGTP 107
Query: 71 PVPQFTAMDTGSSLLWVHCYPCRDCSPQLGTIFYPSRSSSYAYVPCDSEHCRYFPYARCS 130
P P +DTGS L+W C C C Q +F P SSSY + C + C + C
Sbjct: 108 PQPITALLDTGSDLIWTQCDTCTACLRQPDPLFSPRMSSSYEPMRCAGQLCGDILHHSC- 166
Query: 131 AYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDVVFGCG-FSTNRNFKFSGI 189
C Y Y G T + +TE+ TF ++ T V + FGCG + SGI
Sbjct: 167 VRPDTCTYRYSYGDGTTTLGYYATERFTFASSSGETQSVP-LGFGCGTMNVGSLNNASGI 225
Query: 190 FGLGIGRSSLVSQLN-SSFSYCIGSLHDPDYLHNKLILGDGAIIDEG---DAT-PLQFI- 243
G G SLVSQL+ FSYC+ P K L G++ D G DAT P+Q
Sbjct: 226 VGFGRDPLSLVSQLSIRRFSYCL----TPYASSRKSTLQFGSLADVGLYDDATGPVQTTP 281
Query: 244 -------DGHYYITLEAISVDGRMLDINPNIFK-RDDSGGGVMIDSGTDVTW----LVKE 291
YY+ ++V R L I + F R D GGV+IDSGT +T ++ E
Sbjct: 282 ILQSAQNPTFYYVAFTGVTVGARRLRIPASAFALRPDGSGGVIIDSGTALTLFPVAVLAE 341
Query: 292 AYEALRDEVMIRLEGEQMRSYSWPDKLCYHGIMSSDLK-------GFPTVRFHFRGGAKL 344
A R ++ + S D +C+ + P + FHF+ GA L
Sbjct: 342 VVRAFRSQLRLPFA----NGSSPDDGVCFAAPAVAAGGGRMARQVAVPRMVFHFQ-GADL 396
Query: 345 ALEKDSMFYQP-RPDAFCMAVNPASINNRYVNFSLIGMMAQQFYNVGYDIGRKQKTFQRM 403
L +++ + R C+ + + + + IG QQ V YD+ R+ +F +
Sbjct: 397 DLPRENYVLEDHRRGHLCVLLGDSGDDG-----ATIGNFVQQDMRVVYDLERETLSFAPV 451
Query: 404 DC 405
+C
Sbjct: 452 EC 453
>gi|357113696|ref|XP_003558637.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Brachypodium
distachyon]
Length = 432
Score = 113 bits (282), Expect = 2e-22, Method: Compositional matrix adjust.
Identities = 105/399 (26%), Positives = 164/399 (41%), Gaps = 38/399 (9%)
Query: 31 ARLAYLQEKIRSHNTYQAQILPSNDISNSLFYVNISIGQPPVPQFTAMDTGSSLLWVHCY 90
ARL +L K S A + ++ S + V +G P P A+DT + W HC
Sbjct: 49 ARLLFLSSKAASTGVSSAPV--ASGQSPPSYVVRAGLGSPAQPILLALDTSADATWAHCS 106
Query: 91 PCRDCSPQLGTIFYPSRSSSYAYVPCDSEHCRYFPYARC--------SAYKHRCIYTQLY 142
PC C P G++F P+ S+SYA +PC S C C SA C +T+ +
Sbjct: 107 PCGTC-PSSGSLFAPANSTSYAPLPCSSTMCTVLQGQPCPAQDPYDSSAPLPMCAFTKPF 165
Query: 143 LIGPETSVFVSTEQLTFKNTDESTIHVQDVVFGCGFSTN---RNFKFSGIFGLGIGRSSL 199
+ S K+ + + FGC + + N G+ GLG G +L
Sbjct: 166 ADASFQASLASDWLHLGKDA------IPNYAFGCVSAVSGPTANLPKQGLLGLGRGPMAL 219
Query: 200 VSQL----NSSFSYCIGSLHDPDYLHNKLILGDGAIIDEGDATPLQFIDGH---YYITLE 252
+SQ+ N FSYC+ S + Y L LG TP+ YY+ +
Sbjct: 220 LSQVGNMYNGVFSYCLPS-YKSYYFSGSLRLGAAGQPRGVRYTPMLKNPNRSSLYYVNVT 278
Query: 253 AISVDGRMLDINPNIFKRDD-SGGGVMIDSGTDVTWLVKEAYEALRDEVMIRLEGEQMRS 311
+SV + + F D +G G ++DSGT +T Y ALR+E + +
Sbjct: 279 GLSVGRAPVKVPAGSFAFDPATGAGTVVDSGTVITRWTPPVYAALREEFRRHVAAPSGYT 338
Query: 312 YSWPDKLCYHGIMSSDLKG--FPTVRFHFRGGAKLALE-KDSMFYQPRPDAFCMAVNPAS 368
C++ + ++ P V H GG LAL ++++ + C+A+ A
Sbjct: 339 SLGAFDTCFN---TDEVAAGVAPAVTVHMDGGLDLALPMENTLIHSSATPLACLAMAEAP 395
Query: 369 IN-NRYVNFSLIGMMAQQFYNVGYDIGRKQKTFQRMDCE 406
N N VN ++ + QQ V +D+ + F R C
Sbjct: 396 QNVNAVVN--VLANLQQQNLRVVFDVANSRVGFARESCN 432
>gi|449441618|ref|XP_004138579.1| PREDICTED: uncharacterized protein LOC101220661 [Cucumis sativus]
Length = 2819
Score = 113 bits (282), Expect = 2e-22, Method: Compositional matrix adjust.
Identities = 97/333 (29%), Positives = 150/333 (45%), Gaps = 44/333 (13%)
Query: 52 PSNDIS---NSLFYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLGTIFYPSRS 108
PSN +S N V++++G PP +DTGS L W+HC SP L ++F P S
Sbjct: 988 PSNKLSFHHNVTLTVSLTVGSPPQQVTMVLDTGSELSWLHCKK----SPNLTSVFNPLSS 1043
Query: 109 SSYAYVPCDSEHCRY----FPYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDE 164
SSY+ +PC S CR P K C Y ++++ ++
Sbjct: 1044 SSYSPIPCSSPICRTRTRDLPNPVTCDPKKLCHAIVSYADASSLEGNLASDNFRIGSSA- 1102
Query: 165 STIHVQDVVFGC---GFSTN--RNFKFSGIFGLGIGRSSLVSQLN-SSFSYCIGSLHDPD 218
+ +FGC GFS+N + K +G+ G+ G S V+QL FSYCI
Sbjct: 1103 ----LPGTLFGCMDSGFSSNSEEDAKTTGLMGMNRGSLSFVTQLGLPKFSYCISGRDSSG 1158
Query: 219 YLHNKLILGDGAIIDEGD---------ATPLQFIDG-HYYITLEAISVDGRMLDINPNIF 268
L + GD + G+ +TPL + D Y + L+ I V ++L + +IF
Sbjct: 1159 VL----LFGDLHLSWLGNLTYTPLVQISTPLPYFDRVAYTVQLDGIRVGNKILPLPKSIF 1214
Query: 269 KRDDSGGG-VMIDSGTDVTWLVKEAYEALRDEVMIRLE------GEQMRSYSWPDKLCYH 321
D +G G M+DSGT T+L+ Y ALR+E + + + G+ + LCY
Sbjct: 1215 APDHTGAGQTMVDSGTQFTFLLGPVYTALRNEFLEQTKGVLAPLGDPNFVFQGAMDLCYS 1274
Query: 322 GIMSSDLKGFPTVRFHFRGGAKLALEKDSMFYQ 354
L P+V FR GA++ + + + Y+
Sbjct: 1275 VAAGGKLPTLPSVSLMFR-GAEMVVGGEVLLYR 1306
>gi|224140237|ref|XP_002323490.1| predicted protein [Populus trichocarpa]
gi|222868120|gb|EEF05251.1| predicted protein [Populus trichocarpa]
Length = 478
Score = 113 bits (282), Expect = 2e-22, Method: Compositional matrix adjust.
Identities = 94/313 (30%), Positives = 147/313 (46%), Gaps = 31/313 (9%)
Query: 60 LFYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLG-----TIFYPSRSSSYAYV 114
L++ + +G PP +DTGS +LWV C C +C G F S SS+ V
Sbjct: 65 LYFTKVKLGSPPREFNVQIDTGSDVLWVCCNSCNNCPRTSGLGIQLNFFDSSSSSTAGLV 124
Query: 115 PCDSEHCR---YFPYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQD 171
C C +CS ++C YT Y G TS + ++ L F ++ V
Sbjct: 125 HCSDPICTSAVQTTVTQCSPQTNQCSYTFQYEDGSGTSGYYVSDTLYFDAILGESLVVNS 184
Query: 172 ---VVFGC-----GFSTNRNFKFSGIFGLGIGRSSLVSQLNS------SFSYCI-GSLHD 216
+VFGC G T + GIFG G G S++SQL++ FS+C+ G
Sbjct: 185 SALIVFGCSTFQSGDLTMTDKAVDGIFGFGQGELSVISQLSTHGITPRVFSHCLKGEGIG 244
Query: 217 PDYLHNKLILGDGAIIDEGDATPLQFIDGHYYITLEAISVDGRMLDINPNIFKRDDSGGG 276
L IL G + +PL HY + L++I+V+G++L I+P++F +S G
Sbjct: 245 GGILVLGEILEPGMVY-----SPLVPSQPHYNLNLQSIAVNGKLLPIDPSVFATSNS-QG 298
Query: 277 VMIDSGTDVTWLVKEAYEALRDEVMIRLEGEQMRSYSWPDKLCYHGIMSSDLKGFPTVRF 336
++DSGT + +LV EAY+ V + + S ++ CY + +S + FP F
Sbjct: 299 TIVDSGTTLAYLVAEAYDPFVSAVNVIVSPSVTPIISKGNQ-CYL-VSTSVSQMFPLASF 356
Query: 337 HFRGGAKLALEKD 349
+F GGA + L+ +
Sbjct: 357 NFAGGASMVLKPE 369
>gi|413923782|gb|AFW63714.1| hypothetical protein ZEAMMB73_300584 [Zea mays]
Length = 458
Score = 113 bits (282), Expect = 2e-22, Method: Compositional matrix adjust.
Identities = 109/391 (27%), Positives = 169/391 (43%), Gaps = 36/391 (9%)
Query: 32 RLAYLQEKIRSHNTYQ---AQILPSN---DISNSLFYVNISIGQPPVPQFTAMDTGSSLL 85
R AY++ K + A +P+ +S + + + IG P V Q +MDTGS +
Sbjct: 87 RAAYIKRKFSGAGDIEQSDAATVPTTLGTSLSTLEYVITVGIGSPAVTQTMSMDTGSDVS 146
Query: 86 WVHCYPCRDCSPQLGTIFYPSRSSSYAYVPCDSEHCRYFPYAR--CSAYKHRCIYTQLYL 143
WV C PC C ++ ++F PS SS+Y+ C S C ++ +C Y Y
Sbjct: 147 WVQCKPCSQCHSEVDSLFDPSSSSTYSPFSCSSAPCAQLSQSQEGNGCMSSQCQYIVNYG 206
Query: 144 IGPETSVFVSTEQLTFKNTDESTIHVQDVVFGCGFSTNRNF--KFSGIFGLGIGRSSLVS 201
T+ S++ LT ++ + D FGC S + F + G+ GLG G SL S
Sbjct: 207 DSSSTTGTYSSDTLTLGSS-----AMTDFQFGCSQSESGGFNDQTDGLMGLGGGAQSLAS 261
Query: 202 Q----LNSSFSYCIGSLHDPDYLHNKLILGDGAIIDEGDATPL---QFIDGHYYITLEAI 254
Q ++FSYC L L LG G+ TP+ I +Y + LE+I
Sbjct: 262 QTAGTFGTAFSYC---LPPTSGSSGFLTLGTGS--SGFVKTPMLRSTQIPTYYVVLLESI 316
Query: 255 SVDGRMLDINPNIFKRDDSGGGVMIDSGTDVTWLVKEAYEALRDEVMIRLEGEQMRSYSW 314
V + L++ ++F G ++DSGT +T L AY AL ++ + S
Sbjct: 317 KVGSQQLNLPTSVFS-----AGSLMDSGTIITRLPPTAYSALSSAFKAGMQQYPPATPSG 371
Query: 315 PDKLCYHGIMSSDLKGFPTVRFHFRGGAKLALEKDSMFYQPRPDAFCMAVNPASINNRYV 374
C+ S + PTV F GGA + L D + + C+A P N
Sbjct: 372 ILDTCFDFSGQSSIS-IPTVTLVFSGGAAVDLAFDGIMLEISSSIRCLAFTP---NGDDS 427
Query: 375 NFSLIGMMAQQFYNVGYDIGRKQKTFQRMDC 405
+ +IG + Q+ + V YD+G F+ C
Sbjct: 428 SLGIIGNVQQRTFEVLYDVGGGAVGFKAGAC 458
>gi|356568507|ref|XP_003552452.1| PREDICTED: aspartic proteinase-like protein 2-like [Glycine max]
Length = 481
Score = 113 bits (282), Expect = 2e-22, Method: Compositional matrix adjust.
Identities = 110/389 (28%), Positives = 166/389 (42%), Gaps = 56/389 (14%)
Query: 13 NPNENPAHRVQRGINISIARLAYLQEKIRSHNT-YQAQILPSNDI---------SNSLFY 62
N N N V R + LA I++H+ + + L D+ SN L+Y
Sbjct: 22 NANANLVFPVVRKFKGPVENLA----AIKAHDAGRRGRFLSVVDVALGGNGRPTSNGLYY 77
Query: 63 VNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLG-----TIFYPSRSSSYAYVPCD 117
I +G P + +DTGS LWV+C C C + G T++ P+ S + VPCD
Sbjct: 78 TKIGLG--PKDYYVQVDTGSDTLWVNCVGCTACPKKSGLGMDLTLYDPNLSKTSKAVPCD 135
Query: 118 SEHCRYFPYARCSAYKH--RCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQD---V 172
E C + S C Y+ Y G TS + LTF V D V
Sbjct: 136 DEFCTSTYDGQISGCTKGMSCPYSITYGDGSTTSGSYIKDDLTFDRVVGDLRTVPDNTSV 195
Query: 173 VFGCG------FSTNRNFKFSGIFGLGIGRSSLVSQLNSS------FSYCIGSLHDPDYL 220
+FGCG S+ + GI G G SS++SQL ++ FS+C+ D +
Sbjct: 196 IFGCGSKQSGTLSSTTDTSLDGIIGFGQANSSVLSQLAAAGKVKRIFSHCL------DSI 249
Query: 221 HNKLILGDGAIID-EGDATPLQFIDGHYYITLEAISVDGRMLDINPNIFKRDDSGGGVMI 279
I G ++ + TPL HY + L+ I V G + + +I SG G +I
Sbjct: 250 SGGGIFAIGEVVQPKVKTTPLLQGMAHYNVVLKDIEVAGDPIQLPSDILDS-SSGRGTII 308
Query: 280 DSGTDVTWLVKEAYEALRDEVMIRLEGEQMRSYSWPDKL-CYHGIMSSDLKG----FPTV 334
DSGT + +L Y+ L ++++ + G M+ Y D+ C+H SD + FPTV
Sbjct: 309 DSGTTLAYLPVSIYDQLLEKILAQRSG--MKLYLVEDQFTCFH---YSDEESVDDLFPTV 363
Query: 335 RFHFRGGAKLALEKDSMFYQPRPDAFCMA 363
+F F G L + + D +C+
Sbjct: 364 KFTFEEGLTLTTYPRDYLFLFKEDMWCVG 392
>gi|222622847|gb|EEE56979.1| hypothetical protein OsJ_06707 [Oryza sativa Japonica Group]
Length = 494
Score = 113 bits (282), Expect = 2e-22, Method: Compositional matrix adjust.
Identities = 95/326 (29%), Positives = 143/326 (43%), Gaps = 32/326 (9%)
Query: 60 LFYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDC--SPQLG---TIFYPSRSSSYAYV 114
L++ I IG P + +DTGS +LWV+C C C LG T++ P S S V
Sbjct: 89 LYFTRIGIGTPAKRYYVQVDTGSDILWVNCVSCDGCPRKSNLGIELTMYDPRGSQSGELV 148
Query: 115 PCDSEHC--RYFPYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTD---ESTIHV 169
CD + C Y C Y+ Y G T+ F T+ L + ++T
Sbjct: 149 TCDQQFCVANYGGVLPSCTSTSPCEYSISYGDGSSTAGFFVTDFLQYNQVSGDGQTTPAN 208
Query: 170 QDVVFGCGFS-----TNRNFKFSGIFGLGIGRSSLVSQLNSS------FSYCIGSLHDPD 218
V FGCG + N GI G G SS++SQL ++ F++C+ D
Sbjct: 209 ASVSFGCGAKLGGDLGSSNLALDGILGFGQSNSSMLSQLAAAGKVRKMFAHCL------D 262
Query: 219 YLHNKLILGDGAIID-EGDATPLQFIDGHYYITLEAISVDGRMLDINPNIFKRDDSGGGV 277
++ I G ++ + TPL HY + L+ I V G L + NIF +S G
Sbjct: 263 TVNGGGIFAIGNVVQPKVKTTPLVPDMPHYNVILKGIDVGGTALGLPTNIFDSGNS-KGT 321
Query: 278 MIDSGTDVTWLVKEAYEALRDEVMIRLEGEQMRSYSWPDKLCYHGIMSSDLKGFPTVRFH 337
+IDSGT + ++ + Y+AL M+ + + + + D C+ S D GFP V FH
Sbjct: 322 IIDSGTTLAYVPEGVYKALF--AMVFDKHQDISVQTLQDFSCFQYSGSVD-DGFPEVTFH 378
Query: 338 FRGGAKLALEKDSMFYQPRPDAFCMA 363
F G L + +Q + +CM
Sbjct: 379 FEGDVSLIVSPHDYLFQNGKNLYCMG 404
>gi|225217022|gb|ACN85307.1| aspartic proteinase nepenthesin-1 precursor [Oryza ridleyi]
Length = 525
Score = 113 bits (282), Expect = 2e-22, Method: Compositional matrix adjust.
Identities = 101/359 (28%), Positives = 149/359 (41%), Gaps = 33/359 (9%)
Query: 61 FYVNISIGQPPVPQFTAMDTGSSLLWVHCYPC-RDCSPQLGTIFYPSRSSSYAYVPCDSE 119
+ V I +G P DTGS WV C PC C Q +F P+RSS+ A + C +
Sbjct: 186 YVVTIGLGTPAGRYTVVFDTGSDTTWVQCEPCVVVCYEQQEKLFDPARSSTDANISCAAP 245
Query: 120 HCRYFPYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDVVFGCGFS 179
C CS C+Y Y G + F + + LT + D ++ FGCG
Sbjct: 246 ACSDLYTKGCSG--GHCLYGVQYGDGSYSIGFFAMDTLTLSSYDA----IKGFRFGCGER 299
Query: 180 TNRNF-KFSGIFGLGIGRSSLVSQL----NSSFSYCIGSLHD-PDYLHNKLILGDGAIID 233
F + +G+ GLG G++SL Q F++C + YL G +
Sbjct: 300 NEGLFGEAAGLLGLGRGKTSLPVQAYDKYGGVFAHCFPARSSGTGYL--DFGPGSSPAVS 357
Query: 234 EGDATPLQFIDG--HYYITLEAISVDGRMLDINPNIFKRDDSGGGVMIDSGTDVTWLVKE 291
TP+ +G YY+ L I V G++L I P++F G ++DSGT +T L
Sbjct: 358 TKLTTPMLVDNGLTFYYVGLTGIRVGGKLLSIPPSVFTT----AGTIVDSGTVITRLPPA 413
Query: 292 AYEALRDEVMIRLEGEQMRSYSWPDKL-----CYHGIMSSDLKGFPTVRFHFRGGAKLAL 346
AY +LR + R Y L CY S + PTV F+GGA L +
Sbjct: 414 AYSSLRSAFASAIA---ARGYKKAPALSLLDTCYDFTGMSQVA-IPTVSLLFQGGASLDV 469
Query: 347 EKDSMFYQPRPDAFCMAVNPASINNRYVNFSLIGMMAQQFYNVGYDIGRKQKTFQRMDC 405
+ + Y C+ + N + ++G + + V YDIG+K F C
Sbjct: 470 DASGIIYAASVSQACLGF---AANEEDDDVGIVGNTQLKTFGVVYDIGKKVVGFSPGAC 525
>gi|255567949|ref|XP_002524952.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
gi|223535787|gb|EEF37449.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
Length = 394
Score = 112 bits (281), Expect = 2e-22, Method: Compositional matrix adjust.
Identities = 95/318 (29%), Positives = 143/318 (44%), Gaps = 42/318 (13%)
Query: 58 NSLFYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLGTIFYPSRSSSYAYVPCD 117
N + I IG PP +DTGS++ +V C C C F P SS+Y V C+
Sbjct: 87 NGYYTTRIWIGTPPQTFALIVDTGSTVTYVPCSTCEQCGRHQDPKFEPELSSTYQPVSCN 146
Query: 118 SEHCRYFPYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDVVFGCG 177
+ C + +C+Y + Y +S + + ++F N +S + Q +FGC
Sbjct: 147 ID-------CTCDNERKQCVYERQYAEMSSSSGVLGEDIISFGN--QSELVPQRAIFGCE 197
Query: 178 FSTNRNF---KFSGIFGLGIGRSSLVSQL------NSSFSYCIGSLHDPDYLHNKLILG- 227
+ + GI GLG G S+V QL + SFS C G + D +ILG
Sbjct: 198 NQETGDLYSQRADGIMGLGRGDLSIVDQLVEKGVISDSFSLCYGGM---DIGGGAMILGG 254
Query: 228 ----DGAIIDEGDATPLQFIDGHYYITLEAISVDGRMLDINPNIFKRDDSGGGVMIDSGT 283
G + E D Q+ Y I L+AI V G+ L ++P+IF D G ++DSGT
Sbjct: 255 ISPPSGMVFAESDPVRSQY----YNIDLKAIHVAGKQLHLDPSIF---DGKHGTVLDSGT 307
Query: 284 DVTWLVKEAYEALRDEVMIRLEGEQMRSYSWPD----KLCYHGIMS--SDLKG-FPTVRF 336
+L + A+ A +D +M L ++ PD +C+ G S S L FP V
Sbjct: 308 TYAYLPEAAFTAFKDAMMKEL--TSLKQIHGPDPNYNDICFSGAESDVSQLSNTFPAVEM 365
Query: 337 HFRGGAKLALEKDSMFYQ 354
F G KL+L ++ +Q
Sbjct: 366 VFSNGQKLSLSPENYLFQ 383
>gi|242085924|ref|XP_002443387.1| hypothetical protein SORBIDRAFT_08g018620 [Sorghum bicolor]
gi|241944080|gb|EES17225.1| hypothetical protein SORBIDRAFT_08g018620 [Sorghum bicolor]
Length = 460
Score = 112 bits (281), Expect = 3e-22, Method: Compositional matrix adjust.
Identities = 112/371 (30%), Positives = 163/371 (43%), Gaps = 33/371 (8%)
Query: 67 IGQPPVPQFTAMDTGSSLLWVHCYPCR--DCSPQLGTIFYPSRSSSYAYVPCDSEHCRYF 124
IG PP +DTGS+L+W C CR C Q T + PSRS + V C+ C
Sbjct: 90 IGDPPQQAAAIIDTGSNLIWTQCSTCRANGCFGQDLTFYDPSRSRTAKPVACNDTACLLG 149
Query: 125 PYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDVVFGCGFSTNRNF 184
RC+ C Y G F+ TE TF + S +V + FGC ++
Sbjct: 150 SETRCARDGKACAVLTAYGAG-AIGGFLGTEVFTFGHGQSSENNVS-LAFGCITASRLTP 207
Query: 185 K----FSGIFGLGIGRSSLVSQL-NSSFSYCIGSLHDPDYLHNKLILGDGAIIDEGD--A 237
SGI GLG G+ SL SQL ++ FSYC+ + L +G A + G A
Sbjct: 208 GSLDGASGIIGLGRGKLSLPSQLGDNKFSYCLTPYFSDAANTSTLFVGASAGLSGGGAPA 267
Query: 238 TPLQFI--------DGHYYITLEAISVDGRMLDINPNIFKRDDSG----GGVMIDSGTDV 285
T + F+ D YY+ L I+V LD+ F + GG +IDSG+
Sbjct: 268 TSVPFLKNPDDDPFDSFYYLPLTGITVGTAKLDVPAAAFDLREVAPAKWGGTLIDSGSPF 327
Query: 286 TWLVKEAYEALRDEVMIRLEGEQMRSYSWPD--KLCYHGIMSSDL-KGFPTVRFHFRGGA 342
T L+ AY+ALRDE++ +L + + + LC G+ D K P + HF G
Sbjct: 328 TSLIDVAYQALRDELVRQLGASVVPPPAGAEGLDLCVGGVAPGDAGKLVPPLVLHFGSGG 387
Query: 343 KLALE---KDSMFYQPRPDA-FCMAVNPASINNRYVNF---SLIGMMAQQFYNVGYDIGR 395
+ ++ P D+ CM V + N + ++IG QQ ++ YD+G+
Sbjct: 388 GGGGDVVVPPENYWGPVDDSTACMVVFSSGGPNSTLPLNETTIIGNYMQQDMHLLYDLGQ 447
Query: 396 KQKTFQRMDCE 406
+FQ DC
Sbjct: 448 GVLSFQPADCS 458
>gi|242057809|ref|XP_002458050.1| hypothetical protein SORBIDRAFT_03g026140 [Sorghum bicolor]
gi|241930025|gb|EES03170.1| hypothetical protein SORBIDRAFT_03g026140 [Sorghum bicolor]
Length = 489
Score = 112 bits (281), Expect = 3e-22, Method: Compositional matrix adjust.
Identities = 94/347 (27%), Positives = 158/347 (45%), Gaps = 41/347 (11%)
Query: 51 LPSNDISNSLFYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLG-----TIFYP 105
LP++ L++ I +G PP + +DTGS +LWV+C C C + G T + P
Sbjct: 77 LPTD---TGLYFTEIKLGTPPKRYYVQVDTGSDILWVNCISCEKCPRKSGLGLDLTFYDP 133
Query: 106 SRSSSYAYVPCDSEHCRYFPYAR---CSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNT 162
SSS + V CD C + C+A C Y+ +Y G T+ F T+ L F
Sbjct: 134 KASSSGSTVSCDQGFCAATYGGKLPGCTA-NVPCEYSVMYGDGSSTTGFFVTDALQFDQV 192
Query: 163 ---DESTIHVQDVVFGCGFST-----NRNFKFSGIFGLGIGRSSLVSQLNSS------FS 208
++ V FGCG + N GI G G +S++SQL ++ F+
Sbjct: 193 TGDGQTQPGNATVTFGCGAQQGGDLGSSNQALDGILGFGQANTSMLSQLAAAGKVKKIFA 252
Query: 209 YCIGSLHDPDYLHNKLILGDGAIID-EGDATPLQFIDGHYYITLEAISVDGRMLDINPNI 267
+C+ D + I G ++ + TPL HY + L++I V G L + ++
Sbjct: 253 HCL------DTIKGGGIFAIGNVVQPKVKTTPLVADMPHYNVNLKSIDVGGTTLQLPAHV 306
Query: 268 FKRDDSGGGVMIDSGTDVTWLVKEAYEALRDEVMIRL--EGEQMRSYSWPDKLCYHGIMS 325
F+ + G +IDSGT +T+L + ++ EVM + + + + ++ D +C+ S
Sbjct: 307 FETGER-KGTIIDSGTTLTYLPELVFK----EVMAAIFNKHQDIVFHNVQDFMCFQYPGS 361
Query: 326 SDLKGFPTVRFHFRGGAKLALEKDSMFYQPRPDAFCMAVNPASINNR 372
D GFPT+ FHF L + F+ D +C+ ++ ++
Sbjct: 362 VD-DGFPTITFHFEDDLALHVYPHEYFFPNGNDMYCVGFQNGALQSK 407
>gi|357125326|ref|XP_003564345.1| PREDICTED: aspartic proteinase-like protein 2-like [Brachypodium
distachyon]
Length = 506
Score = 112 bits (281), Expect = 3e-22, Method: Compositional matrix adjust.
Identities = 94/321 (29%), Positives = 147/321 (45%), Gaps = 34/321 (10%)
Query: 60 LFYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLG-----TIFYPSRSSSYAYV 114
L++ + +G P F +DTGS +LWV C PC C G F P SS+ + +
Sbjct: 88 LYFTRVKLGNPAKEYFVQIDTGSDILWVACSPCTGCPTSSGLNIQLEFFNPDSSSTSSRI 147
Query: 115 PCDSEHCRYF---PYARCSAY---KHRCIYTQLYLIGPETSVFVSTEQLTFKNT---DES 165
PC + C A C + C YT Y G TS F ++ + F +++
Sbjct: 148 PCSDDRCTAALQTGEAVCQSSDSPSSPCGYTFTYGDGSGTSGFYVSDTMYFDTVMGNEQT 207
Query: 166 TIHVQDVVFGCGFSTNRNF-----KFSGIFGLGIGRSSLVSQLNS------SFSYCIGSL 214
VVFGC S + + GIFG G + S+VSQL S +FS+C L
Sbjct: 208 ANSSASVVFGCSNSQSGDLMKTDRAVDGIFGFGQHQLSVVSQLYSLGVSPKTFSHC---L 264
Query: 215 HDPDYLHNKLILGDGAIIDEGDA-TPLQFIDGHYYITLEAISVDGRMLDINPNIFKRDDS 273
D L+LG+ I++ G TPL HY + LE+I+V G+ L I+ ++F ++
Sbjct: 265 KGSDNGGGILVLGE--IVEPGLVFTPLVPSQPHYNLNLESIAVSGQKLPIDSSLFATSNT 322
Query: 274 GGGVMIDSGTDVTWLVKEAYEALRDEVMIRLEGEQMRSYSWPDKLCYHGIMSSDLKGFPT 333
G ++DSGT + +LV AY+ + + + S + C+ S D FPT
Sbjct: 323 -QGTIVDSGTTLVYLVDGAYDPFINAIAAAVSPSVRSVVSKGIQ-CFVTTSSVD-SSFPT 379
Query: 334 VRFHFRGGAKLALEKDSMFYQ 354
+F+GG + ++ ++ Q
Sbjct: 380 ATLYFKGGVSMTVKPENYLLQ 400
>gi|224143371|ref|XP_002324933.1| predicted protein [Populus trichocarpa]
gi|222866367|gb|EEF03498.1| predicted protein [Populus trichocarpa]
Length = 463
Score = 112 bits (281), Expect = 3e-22, Method: Compositional matrix adjust.
Identities = 113/427 (26%), Positives = 190/427 (44%), Gaps = 49/427 (11%)
Query: 1 LIHRDSILSPYNNPNENPAHRVQRGINISIARLAYLQEKIRSHN-TYQAQILPSN----- 54
L+HR +P+ + PA + R+ + + RS N T + + S+
Sbjct: 65 LVHRFGPCNPHRT-STAPASSFNEILRRDKLRVDSIIQARRSMNLTSSVEHMKSSVPFYG 123
Query: 55 --DISNSLFYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLGTIFYPSRSSSYA 112
I+ S + VN+ IG P DTGS L+W C PC+ C P++ +F P++S+S+
Sbjct: 124 LSKITASDYIVNVGIGTPKKEMPLIFDTGSGLIWTQCKPCKACYPKV-PVFDPTKSASFK 182
Query: 113 YVPCDSEHCRYFPYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDV 172
+PC S+ C+ R +C Y Y+ ++ ++TE ++F + +++
Sbjct: 183 GLPCSSKLCQSI---RQGCSSPKCTYLTAYVDNSSSTGTLATETISFSHLK---YDFKNI 236
Query: 173 VFGCGFS-TNRNFKFSGIFGLGIGRSSLVSQL----NSSFSYCIGSLHDPDYLHNKLILG 227
+ GC + + SGI GL SL SQ + FSYCI S L G
Sbjct: 237 LIGCSDQVSGESLGESGIMGLNRSPISLASQTANIYDKLFSYCIPSTPGS---TGHLTFG 293
Query: 228 DGAIIDEGDATPLQFI--DGHYYITLEAISVDGRMLDINPNIFKRDDSGGGVMIDSGTDV 285
G + ++ +P+ Y I + ISV GR L I+ + FK + IDSG +
Sbjct: 294 -GKVPNDVRFSPVSKTAPSSDYDIKMTGISVGGRKLLIDASAFKIAST-----IDSGAVL 347
Query: 286 TWLVKEAYEALRDEVMIRLEGEQMRSYSWPDK-----LCYHGIMSSDLKGFPTVRFHFRG 340
T L +AY ALR + R E M+ Y D+ CY S + P++ F G
Sbjct: 348 TRLPPKAYSALRS--VFR---EMMKGYPLLDQDDFLDTCYDFSNYSTV-AIPSISVFFEG 401
Query: 341 GAKLALEKDSMFYQ-PRPDAFCMAVNPASINNRYVNFSLIGMMAQQFYNVGYDIGRKQKT 399
G ++ ++ + +Q P +C+A A +++ S+ G Q+ Y V +D +++
Sbjct: 402 GVEMDIDVSGIMWQVPGSKVYCLAF--AELDDE---VSIFGNFQQKTYTVVFDGAKERIG 456
Query: 400 FQRMDCE 406
F C+
Sbjct: 457 FAPGGCD 463
>gi|414887402|tpg|DAA63416.1| TPA: hypothetical protein ZEAMMB73_414910 [Zea mays]
Length = 407
Score = 112 bits (280), Expect = 3e-22, Method: Compositional matrix adjust.
Identities = 102/373 (27%), Positives = 166/373 (44%), Gaps = 61/373 (16%)
Query: 19 AHRVQRGINISIARLAYLQEKIRSHNTYQAQILPSNDISNSLFYVNISIGQPPVPQFTAM 78
A R+ + + A+ ++R H+ ++N + + IG PP +
Sbjct: 56 ASRLAASLRRGLGDGAHPNARMRLHDDL---------LTNGYYTTRLYIGTPPQEFALIV 106
Query: 79 DTGSSLLWVHCYPCRDCSPQLGTIFYPSRSSSYAYVPCDSEHCRYFPYARCSAYKHRCIY 138
D+GS++ +V C C C F P SSSY+ V C+ + C + K +C Y
Sbjct: 107 DSGSTVTYVPCASCEQCGNHQDPRFQPDLSSSYSPVKCNVD-------CTCDSDKKQCTY 159
Query: 139 TQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDVVFGCGFSTNRNFKFS----GIFGLGI 194
+ Y +S + + ++F ES + Q VFGC S + FS GI GLG
Sbjct: 160 ERQYAEMSSSSGVLGEDIVSFGR--ESELKAQRAVFGCENSETGDL-FSQHADGIMGLGR 216
Query: 195 GRSSLVSQL------NSSFSYCIGSLHDPDYLHNKLILGDGAIIDEGDATPLQFI----- 243
G+ S++ QL N SFS C G + +G GA++ G TP +
Sbjct: 217 GQLSIMDQLVEKGVINDSFSLCYGGMD----------IGGGAMVLGGVPTPSDMVFSRSD 266
Query: 244 ---DGHYYITLEAISVDGRMLDINPNIFKRDDSGGGVMIDSGTDVTWLVKEAYEALRDEV 300
+Y I L+ I V G+ L ++ IF DS G ++DSGT +L ++A+ A +D V
Sbjct: 267 PLRSPYYNIELKEIHVAGKALRVDSRIF---DSKHGTVLDSGTTYAYLPEQAFMAFKDAV 323
Query: 301 MIRLEGEQMRSYSWPD----KLCYHGI---MSSDLKGFPTVRFHFRGGAKLALEKDS-MF 352
++ ++ PD +C+ G +S + FP V F G KL+L ++ +F
Sbjct: 324 TSKV--HSLKKIRGPDPSYKDICFAGARRNVSKLHEVFPDVDMVFGNGQKLSLTPENYLF 381
Query: 353 YQPRPD-AFCMAV 364
+ D A+C+ V
Sbjct: 382 RHSKVDGAYCLGV 394
>gi|300681506|emb|CBH32600.1| pepsin A, putative, expressed [Triticum aestivum]
Length = 477
Score = 112 bits (280), Expect = 3e-22, Method: Compositional matrix adjust.
Identities = 107/385 (27%), Positives = 151/385 (39%), Gaps = 46/385 (11%)
Query: 61 FYVNISIGQPPVPQFTAMDTGSSLLWVHCY-PCR--------DCSPQLGTIFYPSRSSSY 111
++V +G P P DTGS L WV C P D P G F P S ++
Sbjct: 97 YFVRFRVGTPAQPFLLVADTGSDLTWVKCRRPASANSSLSPADSGPGPGRAFRPEDSRTW 156
Query: 112 AYVPCDSEHCRY---FPYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLT--FKNTDEST 166
A + C S+ C F A C C Y Y G V TE T +E
Sbjct: 157 APISCASDTCTKSLPFSLATCPTPGSPCAYDYRYKDGSAARGTVGTESATIALSGREERK 216
Query: 167 IHVQDVVFGCGFS-TNRNFKFS-GIFGLGIG----RSSLVSQLNSSFSYCIGSLHDPDYL 220
++ +V GC S T +F+ S G+ LG S S+ FSYC+ P
Sbjct: 217 AKLKGLVLGCSSSYTGPSFEASDGVLSLGYSGISFASHAASRFGGRFSYCLVDHLSPRNA 276
Query: 221 HNKLILGDGAIID--------------EGDATPL---QFIDGHYYITLEAISVDGRMLDI 263
+ L G + TPL + + Y ++L+AISV G L I
Sbjct: 277 TSYLTFGPNPAVSSPRASPSSCAAAAPRARQTPLLLDRRMRPFYDVSLKAISVAGEFLKI 336
Query: 264 NPNIFKRDDSGGGVMIDSGTDVTWLVKEAYEALRDEVMIRLEGEQMRSYSWPDKLCYHGI 323
P ++GGGV++DSGT +T L K AY A+ + L G + P + CY+
Sbjct: 337 -PRAVWDVEAGGGVILDSGTSLTVLAKPAYRAVVAALSKGLAGLPRVTMD-PFEYCYNWT 394
Query: 324 MSSDLK---GFPTVRFHFRGGAKLALEKDSMFYQPRPDAFCMAVNPASINNRYVNFSLIG 380
S P + HF G A+L S P C+ + + S+IG
Sbjct: 395 SPSGKDADVAVPKMAVHFAGAARLEPPGKSYVIDAAPGVKCIGLQ----EGPWPGISVIG 450
Query: 381 MMAQQFYNVGYDIGRKQKTFQRMDC 405
+ QQ + +DI ++ FQR C
Sbjct: 451 NILQQEHLWEFDIKNRRLKFQRSRC 475
>gi|125539168|gb|EAY85563.1| hypothetical protein OsI_06935 [Oryza sativa Indica Group]
Length = 514
Score = 112 bits (280), Expect = 3e-22, Method: Compositional matrix adjust.
Identities = 104/371 (28%), Positives = 161/371 (43%), Gaps = 27/371 (7%)
Query: 56 ISNSLFYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLGTIFYPSRSSSYAYVP 115
+ + + V++ +G PP MDTGS L W+ C PC DC Q G +F P+ S SY V
Sbjct: 147 VGSGEYLVDLYVGTPPRRFQMIMDTGSDLNWLQCAPCLDCFEQRGPVFDPATSLSYRNVT 206
Query: 116 CDSEHCRYF-----PYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDE-STIHV 169
C C P A + C Y Y T+ ++ E T T ++ V
Sbjct: 207 CGDPRCGLVAPPTAPRACRRPHSDPCPYYYWYGDQSNTTGDLALEAFTVNLTAPGASRRV 266
Query: 170 QDVVFGCGFSTNRNF-KFSGIFGLGIGRSSLVSQLNS----SFSYCIGSLHDPDYLHNKL 224
DVVFGCG S F +G+ GLG G S SQL + +FSYC+ + + +K+
Sbjct: 267 DDVVFGCGHSNRGLFHGAAGLLGLGRGALSFASQLRAVYGHAFSYCL--VDHGSSVGSKI 324
Query: 225 ILGDGAII--------DEGDATPLQFIDGHYYITLEAISVDGRMLDINPNIFK-RDDSGG 275
+ GD + + D YY+ L+ + V G L+I+P+ + D G
Sbjct: 325 VFGDDDALLGHPRLNYTAFAPSAAAAADTFYYVQLKGVLVGGEKLNISPSTWDVGKDGSG 384
Query: 276 GVMIDSGTDVTWLVKEAYEALRDEVMIRLEGEQMRSYSWPDKLCYHGIMSSDLKGFPTVR 335
G +IDSGT +++ + AYE +R + R++ +P + + + P
Sbjct: 385 GTIIDSGTTLSYFAEPAYEVIRRAFVERMDKAYPLVADFPVLSPCYNVSGVERVEVPEFS 444
Query: 336 FHFRGGAKLALEKDSMFYQPRPDA-FCMAVNPASINNRYVNFSLIGMMAQQFYNVGYDIG 394
F GA ++ F + PD C+AV + S+IG QQ ++V YD+
Sbjct: 445 LLFADGAVWDFPAENYFVRLDPDGIMCLAV----LGTPRSAMSIIGNFQQQNFHVLYDLQ 500
Query: 395 RKQKTFQRMDC 405
+ F C
Sbjct: 501 NNRLGFAPRRC 511
>gi|115445765|ref|NP_001046662.1| Os02g0314600 [Oryza sativa Japonica Group]
gi|46391044|dbj|BAD15987.1| putative chloroplast nucleoid DNA binding protein [Oryza sativa
Japonica Group]
gi|113536193|dbj|BAF08576.1| Os02g0314600 [Oryza sativa Japonica Group]
gi|125581836|gb|EAZ22767.1| hypothetical protein OsJ_06441 [Oryza sativa Japonica Group]
gi|215697168|dbj|BAG91162.1| unnamed protein product [Oryza sativa Japonica Group]
Length = 514
Score = 112 bits (280), Expect = 3e-22, Method: Compositional matrix adjust.
Identities = 104/371 (28%), Positives = 161/371 (43%), Gaps = 27/371 (7%)
Query: 56 ISNSLFYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLGTIFYPSRSSSYAYVP 115
+ + + V++ +G PP MDTGS L W+ C PC DC Q G +F P+ S SY V
Sbjct: 147 VGSGEYLVDLYVGTPPRRFQMIMDTGSDLNWLQCAPCLDCFEQRGPVFDPAASLSYRNVT 206
Query: 116 CDSEHCRYF-----PYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDE-STIHV 169
C C P A + C Y Y T+ ++ E T T ++ V
Sbjct: 207 CGDPRCGLVAPPTAPRACRRPHSDPCPYYYWYGDQSNTTGDLALEAFTVNLTAPGASRRV 266
Query: 170 QDVVFGCGFSTNRNF-KFSGIFGLGIGRSSLVSQLNS----SFSYCIGSLHDPDYLHNKL 224
DVVFGCG S F +G+ GLG G S SQL + +FSYC+ + + +K+
Sbjct: 267 DDVVFGCGHSNRGLFHGAAGLLGLGRGALSFASQLRAVYGHAFSYCL--VDHGSSVGSKI 324
Query: 225 ILGDGAII--------DEGDATPLQFIDGHYYITLEAISVDGRMLDINPNIFK-RDDSGG 275
+ GD + + D YY+ L+ + V G L+I+P+ + D G
Sbjct: 325 VFGDDDALLGHPRLNYTAFAPSAAAAADTFYYVQLKGVLVGGEKLNISPSTWDVGKDGSG 384
Query: 276 GVMIDSGTDVTWLVKEAYEALRDEVMIRLEGEQMRSYSWPDKLCYHGIMSSDLKGFPTVR 335
G +IDSGT +++ + AYE +R + R++ +P + + + P
Sbjct: 385 GTIIDSGTTLSYFAEPAYEVIRRAFVERMDKAYPLVADFPVLSPCYNVSGVERVEVPEFS 444
Query: 336 FHFRGGAKLALEKDSMFYQPRPDA-FCMAVNPASINNRYVNFSLIGMMAQQFYNVGYDIG 394
F GA ++ F + PD C+AV + S+IG QQ ++V YD+
Sbjct: 445 LLFADGAVWDFPAENYFVRLDPDGIMCLAV----LGTPRSAMSIIGNFQQQNFHVLYDLQ 500
Query: 395 RKQKTFQRMDC 405
+ F C
Sbjct: 501 NNRLGFAPRRC 511
>gi|242089069|ref|XP_002440367.1| hypothetical protein SORBIDRAFT_09g030430 [Sorghum bicolor]
gi|241945652|gb|EES18797.1| hypothetical protein SORBIDRAFT_09g030430 [Sorghum bicolor]
Length = 462
Score = 112 bits (280), Expect = 3e-22, Method: Compositional matrix adjust.
Identities = 119/421 (28%), Positives = 179/421 (42%), Gaps = 56/421 (13%)
Query: 1 LIHRDSILSPYNNPNENPAHRVQRGINISIARLAYLQEKIRSHNTYQAQILPSNDISNSL 60
L HR++ +P + AHR+ R + A + R+ + A ++ +
Sbjct: 82 LAHREAFAAPNATAAQLLAHRLARDAARAEAISVSARNVTRAGGGFSAPVVSGLAQGSGE 141
Query: 61 FYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLGTIFYPSRSSSYAYVPCDSEH 120
++ ++ +G PP P +DTGS ++W+ C PCR C Q G +F P RS SYA V C +
Sbjct: 142 YFASVGVGTPPTPALLVLDTGSDVVWLQCAPCRQCYAQSGRVFDPRRSRSYAAVRCGAPP 201
Query: 121 CRYFPYARCSAYKHR---CIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDVVFGCG 177
CR R C+Y Y G T+ ++TE L F V V GCG
Sbjct: 202 CRGLDAGGGGGCDRRRGTCLYQVAYGDGSVTAGDLATETLWFAR----GARVPRVAVGCG 257
Query: 178 FSTNRNF-KFSGIFGLGIGRSSLVSQL----NSSFSYCIGSLHDPDYLHNKLILGDGAII 232
F +G+ GLG GR SL +Q FSYC D H +I
Sbjct: 258 HDNEGLFVAAAGLLGLGRGRLSLPTQTARRYGRRFSYC---FQGSDLDHRTIIR------ 308
Query: 233 DEGDATPLQFIDGHYYITLEAISVDGRMLDINPNIFKRDDSGGGVMIDSGTDVTWLVKEA 292
T Q + G V R L ++P+ + GGV++DSGT VT L +
Sbjct: 309 -----TVHQHVGGA-----RVRGVGERSLRLDPSTGR-----GGVILDSGTSVTRLARPV 353
Query: 293 YEALRDEVMIRLEGEQMR--SYSWPDKLCYHGIMSSDLKG-----FPTVRFHFRGGAKLA 345
Y A+R+ G ++ +S D CY DL+G PTV H GGA++A
Sbjct: 354 YVAVREAFRAAAGGLRLAPGGFSLFDT-CY------DLRGRRVVKVPTVSVHLAGGAEVA 406
Query: 346 LEKDS-MFYQPRPDAFCMAVNPASINNRYVNFSLIGMMAQQFYNVGYDIGRKQKTFQRMD 404
L ++ + FC+A+ A + S++G + QQ + V +D R++
Sbjct: 407 LPPENYLIPVDTRGTFCLAL--AGTDG---GVSIVGNIQQQGFRVVFDGDRQRVALVPKS 461
Query: 405 C 405
C
Sbjct: 462 C 462
>gi|357143680|ref|XP_003573011.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Brachypodium
distachyon]
Length = 510
Score = 112 bits (280), Expect = 3e-22, Method: Compositional matrix adjust.
Identities = 103/369 (27%), Positives = 160/369 (43%), Gaps = 25/369 (6%)
Query: 56 ISNSLFYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLGTIFYPSRSSSYAYVP 115
+ + + V + +G PP MDTGS L W+ C PC DC Q G +F P S+SY V
Sbjct: 145 VGSGEYLVEVYVGTPPRRFQMIMDTGSDLNWLQCAPCLDCFDQRGPVFDPMASTSYRNVT 204
Query: 116 CDSEHCRYF-----PYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQ 170
C C P S+ C Y Y T+ ++ E T T S+ V
Sbjct: 205 CGDTRCGLVSPPAAPRTCRSSRSDPCPYYYWYGDQSNTTGDLALEAFTVNLTASSSRRVD 264
Query: 171 DVVFGCGFSTNRNF-KFSGIFGLGIGRSSLVSQLNS----SFSYCIGSLHDPDYLHNKLI 225
VV GCG F +G+ GLG G S SQL + +FSYC+ + + +K++
Sbjct: 265 GVVLGCGHRNRGLFHGAAGLLGLGRGPLSFASQLRAVYGHAFSYCL--VDHGSAVGSKIV 322
Query: 226 LGDGAI------IDEGDATPLQFIDGHYYITLEAISVDGRMLDINPNIF--KRDDSGGGV 277
GD + ++ P + YY+ L+ I V G MLDI N + ++D GG
Sbjct: 323 FGDDNVLLSHPQLNYTAFAPSAAENTFYYVQLKGILVGGEMLDIPSNTWGVSKEDGSGGT 382
Query: 278 MIDSGTDVTWLVKEAYEALRDEVMIRLEGEQMRSYSWPDKLCYHGIMSSDLKGFPTVRFH 337
+IDSGT +++ + AY+A+R + R++ +P + + + P
Sbjct: 383 IIDSGTTLSYFPEPAYKAIRQAFVDRMDKAYPLIADFPVLSPCYNVSGVERVEVPEFSLL 442
Query: 338 FRGGAKLALEKDSMFYQPRPDA-FCMAVNPASINNRYVNFSLIGMMAQQFYNVGYDIGRK 396
F GA ++ F + + C+AV + S+IG QQ ++V YD+
Sbjct: 443 FADGAVWDFPAENYFIRLDTEGIMCLAV----LGTPRSAMSIIGNYQQQNFHVLYDLHHN 498
Query: 397 QKTFQRMDC 405
+ F C
Sbjct: 499 RLGFAPRRC 507
>gi|54290728|dbj|BAD62398.1| putative nucleoid DNA-binding protein cnd41 [Oryza sativa Japonica
Group]
Length = 486
Score = 112 bits (280), Expect = 3e-22, Method: Compositional matrix adjust.
Identities = 106/366 (28%), Positives = 151/366 (41%), Gaps = 44/366 (12%)
Query: 61 FYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRD---CSPQLGTIFYPSRSSSYAYVPCD 117
F V + +G P P DTGS L WV C PC C PQ +F PS+SS+YA V C
Sbjct: 144 FVVAVGLGTPAQPSALIFDTGSDLSWVQCQPCGSSGHCHPQQDPLFDPSKSSTYAAVHCG 203
Query: 118 SEHCRYFPYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDVVFGCG 177
C CS C+Y Y G T+ +S + L ++ T FGCG
Sbjct: 204 EPQCAAA-GDLCSEDNTTCLYLVRYGDGSSTTGVLSRDTLALTSSRALT----GFPFGCG 258
Query: 178 FSTNRNFKFSGIFG-----------LGIGRSSLVSQLNSSFSYCIGSLHDPDYLHNKLIL 226
RN G FG S + + FSYC+ S + L +
Sbjct: 259 ---TRNL---GDFGRVDGLLGLGRGELSLPSQAAASFGAVFSYCLPSSNS---TTGYLTI 309
Query: 227 GDGAIIDEGDATPLQFI-----DGHYYITLEAISVDGRMLDINPNIFKRDDSGGGVMIDS 281
G D G A + Y++ L +I + G +L + P +F R GG ++DS
Sbjct: 310 GATPATDTGAAQYTAMLRKPQFPSFYFVELVSIDIGGYVLPVPPAVFTR----GGTLLDS 365
Query: 282 GTDVTWLVKEAYEALRDEVMIRLEGEQMRSYSWPDKL--CYHGIMSSDLKGFPTVRFHFR 339
GT +T+L +AY LRD RL E+ D L CY S++ P V F F
Sbjct: 366 GTVLTYLPAQAYALLRDR--FRLTMERYTPAPPNDVLDACYDFAGESEVV-VPAVSFRFG 422
Query: 340 GGAKLALEKDSMFYQPRPDAFCMAVNPASINNRYVNFSLIGMMAQQFYNVGYDIGRKQKT 399
GA L+ + + C+A A+++ + S+IG Q+ V YD+ ++
Sbjct: 423 DGAVFELDFFGVMIFLDENVGCLAF--AAMDTGGLPLSIIGNTQQRSAEVIYDVAAEKIG 480
Query: 400 FQRMDC 405
F C
Sbjct: 481 FVPASC 486
>gi|255563741|ref|XP_002522872.1| Aspartic proteinase nepenthesin-2 precursor, putative [Ricinus
communis]
gi|223537956|gb|EEF39570.1| Aspartic proteinase nepenthesin-2 precursor, putative [Ricinus
communis]
Length = 448
Score = 112 bits (280), Expect = 3e-22, Method: Compositional matrix adjust.
Identities = 122/440 (27%), Positives = 183/440 (41%), Gaps = 61/440 (13%)
Query: 1 LIHRDSILSPYNNPNENPAHRVQRGINISIARLAYLQEKIRSHNTYQAQILPSNDISNSL 60
++HR S SP+ N R+ R + +S KIR+HN I S+ S
Sbjct: 32 IVHRYSRESPFYPGNITDYERITRLVELS---------KIRAHNL---AITTSSGFSPEA 79
Query: 61 FYVNIS-----------IGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLGTIFYPSRSS 109
F + IS IG P VP + DTGS L W C PC QL IF + S
Sbjct: 80 FRLRISQDDTCYLVKVIIGSPGVPLYLVPDTGSGLFWTQCEPCTRRFRQLPPIFNSTASR 139
Query: 110 SYAYVPCDSEHCRYFPYA-RCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIH 168
+Y +PC + C +C +C+Y Y G T+ + + L D +
Sbjct: 140 TYRDLPCQHQFCTNNQNVFQCR--DDKCVYRIAYAGGSATAGVAAQDILQSAENDRIPFY 197
Query: 169 VQDVVFGCGFSTNRNFK-------FSGIFGLGIGRSSLVSQLN----SSFSYCIG--SLH 215
FGC N+NF GI GL + SL+ Q+N + FSYC+ L
Sbjct: 198 -----FGCS-RDNQNFSTFESSGKGGGIIGLNMSPVSLLQQMNHITKNRFSYCLNLFDLS 251
Query: 216 DPDYLHNKLILGDGAIIDEGDATPLQFID----GHYYITLEAISVDGRMLDINPNIFK-R 270
P + + L G+ F+ +Y++ L +SV G + I P F +
Sbjct: 252 SPSHATSLLRFGNDIRKSRRKYLSTPFVSPRGMPNYFLNLIDVSVAGNRMQIPPGTFALK 311
Query: 271 DDSGGGVMIDSGTDVTWLVKEAYEALRDEVMIRLE--GEQMRSYSWPDKLCYHGIMSSDL 328
D GG +IDSGT VT++ + AY + + G Q + +CY
Sbjct: 312 PDGTGGTIIDSGTAVTYISQTAYFPVITAFKNYFDQHGFQRVNIQLSGYICYKQ-QGHTF 370
Query: 329 KGFPTVRFHFRGGAKLALEKDSMFY--QPRPDAFCMAVNPASINNRYVNFSLIGMMAQQF 386
+P++ FHF+ GA +E + ++ Q R AFC+A+ P S R ++IG + Q
Sbjct: 371 HNYPSMAFHFQ-GADFFVEPEYVYLTVQDR-GAFCVALQPISPQQR----TIIGALNQAN 424
Query: 387 YNVGYDIGRKQKTFQRMDCE 406
YD +Q F +C+
Sbjct: 425 TQFIYDAANRQLLFTPENCQ 444
>gi|225449446|ref|XP_002283126.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Vitis vinifera]
Length = 436
Score = 112 bits (280), Expect = 3e-22, Method: Compositional matrix adjust.
Identities = 107/406 (26%), Positives = 176/406 (43%), Gaps = 62/406 (15%)
Query: 47 QAQILPSNDIS----------NSLFYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCS 96
+ Q+LPS + N V++++G PP +DTGS L W+HC +
Sbjct: 39 KTQVLPSGSVPRPSSKLSFHHNVSLTVSLTVGSPPQTVTMVLDTGSELSWLHCKK----A 94
Query: 97 PQLGTIFYPSRSSSYAYVPCDSEHCRY----FPYARCSAYKHRCIYTQLYLIGPETSVFV 152
P L ++F P RSSSY+ +PC S CR F K C Y +
Sbjct: 95 PNLHSVFDPLRSSSYSPIPCTSPTCRTRTRDFSIPVSCDKKKLCHAIISYADASSIEGNL 154
Query: 153 STEQLTFKNTDESTIHVQDVVFGC---GFSTN--RNFKFSGIFGLGIGRSSLVSQLN-SS 206
+++ N+ + +FGC GFS+N + K +G+ G+ G S V+Q+
Sbjct: 155 ASDTFHIGNS-----AIPATIFGCMDSGFSSNSDEDSKTTGLIGMNRGSLSFVTQMGLQK 209
Query: 207 FSYCIGSLHDPDYLHNKLILGDGAIIDEGD---------ATPLQFIDG-HYYITLEAISV 256
FSYCI S D + L+ G+ + +TPL + D Y + LE I V
Sbjct: 210 FSYCI-SGQDSSGI---LLFGESSFSWLKALKYTPLVQISTPLPYFDRVAYTVQLEGIKV 265
Query: 257 DGRMLDINPNIFKRDDSGGG-VMIDSGTDVTWLVKEAYEALRDEVMIRLEGEQMRSYSWP 315
ML + +++ D +G G M+DSGT T+L+ Y AL++E +R ++ P
Sbjct: 266 ANSMLQLPKSVYAPDHTGAGQTMVDSGTQFTFLLGPVYTALKNE-FVRQTKASLKVLEDP 324
Query: 316 D-------KLCYH-GIMSSDLKGFPTVRFHFRGGAKLALEKDSMFY------QPRPDAFC 361
+ LCY + L PTV FR GA++++ + + Y + +C
Sbjct: 325 NFVFQGAMDLCYRVPLTRRTLPPLPTVTLMFR-GAEMSVSAERLMYRVPGVIRGSDSVYC 383
Query: 362 MAVNPASINNRYVNFSLIGMMAQQFYNVGYDIGRKQKTFQRMDCEV 407
+ + V +IG QQ + +D+ + + F + C++
Sbjct: 384 FTFGNSELLG--VESYIIGHHHQQNVWMEFDLAKSRVGFAEVRCDL 427
>gi|225217008|gb|ACN85295.1| aspartic proteinase nepenthesin-1 precursor [Oryza coarctata]
Length = 500
Score = 112 bits (280), Expect = 3e-22, Method: Compositional matrix adjust.
Identities = 109/428 (25%), Positives = 176/428 (41%), Gaps = 41/428 (9%)
Query: 1 LIHRDSILSPYNNPNENPAHRVQRGINISIARLAYLQEKIRSHNTYQAQI-------LPS 53
++HR SP + ++ + + R +Q ++ + T LP+
Sbjct: 91 IVHRHGPCSPLADAHDGKLPSHEEILAADQNRAKSIQRRVSTTTTVSRGKPKRNRPSLPA 150
Query: 54 ND---ISNSLFYVNISIGQPPVPQFTAMDTGSSLLWVHCYPC-RDCSPQLGTIFYPSRSS 109
+ + + V I +G P DTGS WV C PC C Q +F P+RSS
Sbjct: 151 SSGSALGTGNYVVTIGLGTPAGRYTVVFDTGSDTTWVQCEPCVVVCYKQQEKLFDPARSS 210
Query: 110 SYAYVPCDSEHCRYFPYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHV 169
+YA + C + C CS C+Y Y G + F + + LT + D +
Sbjct: 211 TYANISCAAPACSDLYIKGCSG--GHCLYGVQYGDGSYSIGFFAMDTLTLSSYDA----I 264
Query: 170 QDVVFGCGFSTNRNF-KFSGIFGLGIGRSSLVSQL----NSSFSYCIGSLHDPDYLHNKL 224
+ FGCG + + +G+ GLG G++SL Q F++C + L
Sbjct: 265 KGFRFGCGERNEGLYGEAAGLLGLGRGKTSLPVQAYDKYGGVFAHCFPARSSGT---GYL 321
Query: 225 ILGDGAI--IDEGDATPLQFIDG--HYYITLEAISVDGRMLDINPNIFKRDDSGGGVMID 280
G G++ + TP+ +G YY+ L I V G++L I ++F G ++D
Sbjct: 322 DFGPGSLPAVSAKLTTPMLVDNGPTFYYVGLTGIRVGGKLLSIPQSVFTTS----GTIVD 377
Query: 281 SGTDVTWLVKEAYEALRD---EVMIRLEGEQMRSYSWPDKLCYHGIMSSDLKGFPTVRFH 337
SGT +T L AY +LR M ++ + S D CY S++ PTV
Sbjct: 378 SGTVITRLPPAAYSSLRSAFASAMAERGYKKAPALSLLDT-CYDFTGMSEVA-IPTVSLL 435
Query: 338 FRGGAKLALEKDSMFYQPRPDAFCMAVNPASINNRYVNFSLIGMMAQQFYNVGYDIGRKQ 397
F+GGA L + + Y C+ + N + ++G + + V YDIG+K
Sbjct: 436 FQGGASLDVHASGIIYAASVSQACLGF---AGNKEDDDVGIVGNTQLKTFGVVYDIGKKV 492
Query: 398 KTFQRMDC 405
F C
Sbjct: 493 VGFCPGAC 500
>gi|297795137|ref|XP_002865453.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
gi|297311288|gb|EFH41712.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
Length = 665
Score = 112 bits (280), Expect = 4e-22, Method: Compositional matrix adjust.
Identities = 106/411 (25%), Positives = 175/411 (42%), Gaps = 57/411 (13%)
Query: 28 ISIARLAYLQEKIRSHNTYQAQI------LPSNDISNSLFYVNISIGQPPVPQFTAMDTG 81
+S + L E R +Q+Q+ L + +SN + + IG PP +DTG
Sbjct: 41 LSYSSLPPRVEDFRRRRLHQSQLPNAHMKLYDDLLSNGYYTTRLWIGTPPQEFALIVDTG 100
Query: 82 SSLLWVHCYPCRDCSPQLGTIFYPSRSSSYAYVPCDSEHCRYFPYARCSAYKHRCIYTQL 141
S++ +V C C+ C F P SSSY + C+ P C C+Y +
Sbjct: 101 STVTYVPCSTCKQCGKHQDPKFQPELSSSYKALKCN-------PDCNCDDEGKLCVYERR 153
Query: 142 YLIGPETSVFVSTEQLTFKNTDESTIHVQDVVFGC-GFSTNRNF--KFSGIFGLGIGRSS 198
Y +S +S + ++F N ES + Q VFGC T F + GI GLG G+ S
Sbjct: 154 YAEMSSSSGVLSEDLISFGN--ESQLTPQRAVFGCENVETGDLFSQRADGIMGLGRGKLS 211
Query: 199 LVSQL------NSSFSYCIGSLHDPDYLHNKLILGDGAIIDEGDATPLQFIDGH------ 246
+V QL FS C G + +G GA++ + P + H
Sbjct: 212 VVDQLVDKGVIEDVFSLCYGGME----------VGGGAMVLGKISPPAGMVFSHSDPFRS 261
Query: 247 --YYITLEAISVDGRMLDINPNIFKRDDSGGGVMIDSGTDVTWLVKEAYEALRDEVMIRL 304
Y I L+ + V G+ L +NP +F + G ++DSGT + KEA+ A++D ++ +
Sbjct: 262 PYYNIDLKQMHVAGKSLKLNPKVF---NGKHGTVLDSGTTYAYFPKEAFIAIKDAIIKEI 318
Query: 305 EGEQMRSYSWP--DKLCYHGI---MSSDLKGFPTVRFHFRGGAKLALEKDSMFYQPRP-- 357
+ P D +C+ G ++ FP + F G KL L ++ ++
Sbjct: 319 PSLKRIHGPDPNYDDVCFSGAGRDVAEIHNFFPEIDMEFGNGQKLILSPENYLFRHTKVR 378
Query: 358 DAFCMAVNPASINNRYVNFSLIGMMAQQFYNVGYDIGRKQKTFQRMDCEVL 408
A+C+ + P + +L+G + + V YD + F + +C L
Sbjct: 379 GAYCLGIFPDRDST-----TLLGGIVVRNTLVTYDRENDKLGFLKTNCSDL 424
>gi|21717175|gb|AAM76368.1|AC074196_26 putative nucleoid DNA binding protein [Oryza sativa Japonica Group]
gi|31433292|gb|AAP54830.1| Eukaryotic aspartyl protease family protein [Oryza sativa Japonica
Group]
Length = 418
Score = 112 bits (280), Expect = 4e-22, Method: Compositional matrix adjust.
Identities = 99/366 (27%), Positives = 162/366 (44%), Gaps = 38/366 (10%)
Query: 63 VNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLGTIFYPSRSSSYAYVPCDSEHCR 122
N +IG PP P +D L+W C C C Q +F P+ SS++ PC ++ C+
Sbjct: 69 ANFTIGTPPQPASAIIDVAGELVWTQCSMCSRCFKQDLPLFVPNASSTFRPEPCGTDACK 128
Query: 123 YFPYARCSAYKHRCIY--TQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDVVFGCGFST 180
P + CS+ + C Y T +G T V+T+ S + FGC ++
Sbjct: 129 SIPTSNCSS--NMCTYEGTINSKLGGHTLGIVATDTFAIGTATAS------LGFGCVVAS 180
Query: 181 NRNFKF--SGIFGLGIGRSSLVSQLN-SSFSYCIGSLHDPDYLHNKLILGDGA-IIDEGD 236
+ SG+ GLG SSLVSQ+N + FSYC+ + HD +++L+LG A + G+
Sbjct: 181 GIDTMGGPSGLIGLGRAPSSLVSQMNITKFSYCL-TPHDSGK-NSRLLLGSSAKLAGGGN 238
Query: 237 ATPLQFIDG--------HYYITLEAISVDGRMLDINPNIFKRDDSGGGVMIDSGTDVTWL 288
+T F+ +Y I L+ I + + P SG V++ + +++L
Sbjct: 239 STTTPFVKTSPGDDMSQYYPIQLDGIKAGDAAIALPP-------SGNTVLVQTLAPMSFL 291
Query: 289 VKEAYEALRDEVMIRLEGEQMRSYSWPDKLCYHGIMSSDLKGFPTVRFHF-RGGAKLALE 347
V AY+AL+ EV + + P LC+ S+ P + F F +G A L +
Sbjct: 292 VDSAYQALKKEVTKAVGAAPTATPLQPFDLCFPKAGLSNASA-PDLVFTFQQGAAALTVP 350
Query: 348 KDSMFYQ--PRPDAFCMAVNPASINNRYV---NFSLIGMMAQQFYNVGYDIGRKQKTFQR 402
CMA+ S N N +++G + Q+ + D+ +K +F+
Sbjct: 351 PPKYLIDVGEEKGTVCMAILSTSWLNTTALDENLNILGSLQQENTHFLLDLEKKTLSFEP 410
Query: 403 MDCEVL 408
DC L
Sbjct: 411 ADCSSL 416
>gi|22831049|dbj|BAC15912.1| putative nucleoid DNA-binding protein cnd41 [Oryza sativa Japonica
Group]
gi|50508281|dbj|BAD32130.1| putative nucleoid DNA-binding protein cnd41 [Oryza sativa Japonica
Group]
Length = 453
Score = 112 bits (280), Expect = 4e-22, Method: Compositional matrix adjust.
Identities = 111/378 (29%), Positives = 171/378 (45%), Gaps = 31/378 (8%)
Query: 54 NDISNSLFYV-NISIGQPPVPQFTAMDTGSSLLWVHCYPCRD-CSPQLGTIFYPSRSSSY 111
D+ N Y+ ++IG PP DTGS L+W C PC + C Q ++ PS S ++
Sbjct: 84 KDLPNGGEYIMTLAIGTPPQSYPAIADTGSDLVWTQCAPCGERCFKQPSPLYNPSSSPTF 143
Query: 112 AYVPCDSEHCRYFPYARCSAYKH----RCIYTQLYLIGPETSVFVSTEQLTFKNTDESTI 167
+PC S AR + C Y Q Y G TS +E TF ++ +
Sbjct: 144 RVLPCSSALNLCAAEARLAGATPPPGCACRYNQTYGTG-WTSGLQGSETFTFGSSPADQV 202
Query: 168 HVQDVVFGCGFSTNRNFKFSG-IFGLGIGRSSLVSQLNSS-FSYCIGSLHDPDYLHNKLI 225
V + FGC +++ ++ S + GLG G SLVSQL + FSYC+ D + L+
Sbjct: 203 RVPGIAFGCSNASSDDWNGSAGLVGLGRGGLSLVSQLAAGMFSYCLTPFQDTKS-KSTLL 261
Query: 226 LGDGAIIDEGDATPLQF-----------IDGHYYITLEAISVDGRMLDINPNIFK-RDDS 273
LG A + T ++ + +YY+ L ISV L I P F R D
Sbjct: 262 LGPAAAAAALNGTGVRSTPFVPSPSKPPMSTYYYLNLTGISVGPAALPIPPGAFALRADG 321
Query: 274 GGGVMIDSGTDVTWLVKEAYEALRDEV--MIRLEGEQMRSYSWPDKLCYHGIMSSDLKG- 330
GG++IDSGT +T LV AY+ +R V +++L + + D LC+ SS
Sbjct: 322 TGGLIIDSGTTITSLVDAAYKRVRAAVRSLVKLPVTDGSNATGLD-LCFALPSSSAPPAT 380
Query: 331 FPTVRFHFRGGAKLALEKDSMFYQPRPDAFCMAVNPASINNRYVNFSLIGMMAQQFYNVG 390
P++ HF GGA + L ++ +C+A+ + S +G QQ ++
Sbjct: 381 LPSMTLHFGGGADMVLPVENYMIL-DGGMWCLAMR----SQTDGELSTLGNYQQQNLHIL 435
Query: 391 YDIGRKQKTFQRMDCEVL 408
YD+ ++ +F C L
Sbjct: 436 YDVQKETLSFAPAKCSTL 453
>gi|388504358|gb|AFK40245.1| unknown [Medicago truncatula]
Length = 480
Score = 112 bits (279), Expect = 4e-22, Method: Compositional matrix adjust.
Identities = 114/422 (27%), Positives = 186/422 (44%), Gaps = 50/422 (11%)
Query: 17 NPAHRVQRGINISIARLAYLQEKIRS----HNTYQA----QILPSNDISNSLFYVNISIG 68
N ++Q+ + R+ +Q +IR+ HN+ + QI ++ I+ ++IG
Sbjct: 79 NWNRKLQKQLIFDDLRVRSMQNRIRAKVSGHNSSEQSSEIQIPLASGINLETLNYIVTIG 138
Query: 69 QPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLGTIFYPSRSSSYAYVPCDSEHCRYFPYAR 128
+DTGS L WV C PC C Q G +F PS SSSY + C+S C+ +
Sbjct: 139 LGNQNMTVIIDTGSDLTWVQCDPCMSCYSQQGPVFNPSNSSSYNSLLCNSSTCQNLQFTT 198
Query: 129 -----CSAYK-HRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDVVFGCGFSTNR 182
C + C +T Y G T + E L+F I V + VFGCG +
Sbjct: 199 GNTEACESNNPSSCNHTVSYGDGSFTDGELGVEHLSFGG-----ISVSNFVFGCGRNNKG 253
Query: 183 NF-KFSGIFGLGIGRSSLVSQLNSS----FSYCIGSLHDPDYLHNKLILGDGAIIDEGDA 237
F SGI GLG S++SQ N++ FSYC+ + L++G+ + + + +
Sbjct: 254 LFGGVSGIMGLGRSNLSMISQTNTTFGGVFSYCLPTTDSG--ASGSLVIGNESSLFK-NL 310
Query: 238 TPLQF--------IDGHYYITLEAISVDGRMLDINPNIFKRDDS--GGGVMIDSGTDVTW 287
TP+ + + Y + L I V G + +D S GG++IDSGT +T
Sbjct: 311 TPIAYTSMVSNPQLSNFYVLNLTGIDVGGVAI--------QDTSFGNGGILIDSGTVITR 362
Query: 288 LVKEAYEALRDEVMIRLEGEQMRSYSWPDKLCYHGIMSSDLKGFPTVRFHFRGGAKLALE 347
L Y AL+ E + + G + C++ + + PT+ HF L ++
Sbjct: 363 LAPSLYNALKAEFLKQFSGYPIAPALSILDTCFN-LTGIEEVSIPTLSMHFENNVDLNVD 421
Query: 348 KDSMFYQPRPDA-FCMAVNPASINNRYVNFSLIGMMAQQFYNVGYDIGRKQKTFQRMDCE 406
+ Y P+ + C+A+ S N + ++IG Q+ V YD + + F R DC
Sbjct: 422 AVGILYMPKDGSQVCLALASLSDEN---DMAIIGNYQQRNQRVIYDAKQSKIGFAREDCS 478
Query: 407 VL 408
+
Sbjct: 479 FI 480
>gi|242072067|ref|XP_002451310.1| hypothetical protein SORBIDRAFT_05g027510 [Sorghum bicolor]
gi|241937153|gb|EES10298.1| hypothetical protein SORBIDRAFT_05g027510 [Sorghum bicolor]
Length = 509
Score = 112 bits (279), Expect = 4e-22, Method: Compositional matrix adjust.
Identities = 105/354 (29%), Positives = 154/354 (43%), Gaps = 38/354 (10%)
Query: 69 QPPVPQFTAMDTGSSLLWVHCYPC--RDCSPQLGTIFYPSRSSSYAYVPCDSEHCRYF-P 125
+P V Q +DT S + WV C+PC C Q ++ PS+S S C S CR P
Sbjct: 177 RPGVRQLMLLDTASDVAWVQCFPCPASQCYAQTDVLYDPSKSRSSESFACSSPTCRQLGP 236
Query: 126 YAR-CSAYKH---RCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDVVFGCGFSTN 181
YA CS+ + +C Y Y G TS + +QL+ T + V FGC +
Sbjct: 237 YANGCSSSSNSAGQCQYRVRYPDGSTTSGTLVADQLSLSPTSQ----VPKFEFGCSHAAR 292
Query: 182 RNF---KFSGIFGLGIGRSSLVSQLNSS----FSYCIGSLHDPDYLHNK-LILGDGAIID 233
+F K +GI LG G SLVSQ ++ FSYC P H +LG
Sbjct: 293 GSFSRSKTAGIMALGRGVQSLVSQTSTKYGQVFSYCF----PPTASHKGFFVLGVPRRSS 348
Query: 234 EGDA-TPLQFIDGHYYITLEAISVDGRMLDINPNIFKRDDSGGGVMIDSGTDVTWLVKEA 292
A TP+ Y + LEAI+V G+ LD+ P +F G +DS T +T L A
Sbjct: 349 SRYAVTPMLKTPMLYQVRLEAIAVAGQRLDVPPTVF-----AAGAALDSRTVITRLPPTA 403
Query: 293 YEALRDEVMIRLEGEQMRSYSWPDKLCYHGIMSSDLKGFPTVRFHF-RGGAKLALEKDSM 351
Y+ALR ++ + + + CY S + PT+ F R GA + L+ +
Sbjct: 404 YQALRSAFRDKMSMYRPAAANGQLDTCYDFTGVSSIM-LPTISLVFDRTGAGVQLDPSGV 462
Query: 352 FYQPRPDAFCMAVNPASINNRYVNFSLIGMMAQQFYNVGYDIGRKQKTFQRMDC 405
+ C+A + ++R +IG + Q V Y++ F+R C
Sbjct: 463 LF-----GSCLAFASTAGDDRATG--IIGFLQLQTIEVLYNVAGGSVGFRRGAC 509
>gi|224053042|ref|XP_002297678.1| predicted protein [Populus trichocarpa]
gi|222844936|gb|EEE82483.1| predicted protein [Populus trichocarpa]
Length = 482
Score = 112 bits (279), Expect = 4e-22, Method: Compositional matrix adjust.
Identities = 103/349 (29%), Positives = 153/349 (43%), Gaps = 40/349 (11%)
Query: 78 MDTGSSLLWVHCYPCRDCSPQLGTIFYPSRSSSYAYVPCDSEHCRYFPYAR-----CSAY 132
+DTGS L WV C PC+ C Q +F PS S SY V C S C+ A C +
Sbjct: 150 VDTGSDLSWVQCQPCKRCYNQQDPVFNPSTSPSYRTVLCSSPTCQSLQSATGNLGVCGSN 209
Query: 133 KHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDVVFGCGFSTNRNFKFSGIFGL 192
C Y Y G T + TE L N+ V + +FGCG N F G GL
Sbjct: 210 PPSCNYVVNYGDGSYTRGELGTEHLDLGNSTA----VNNFIFGCG--RNNQGLFGGASGL 263
Query: 193 -GIGRS--SLVSQLNSS----FSYCIGSLHDPDYLHNKLILGDGAIIDEGDATPLQFID- 244
G+GRS SL+SQ ++ FSYC+ L++G + + + + TP+ +
Sbjct: 264 VGLGRSSLSLISQTSAMFGGVFSYCLPITETE--ASGSLVMGGNSSVYK-NTTPISYTRM 320
Query: 245 ------GHYYITLEAISVDGRMLDINPNIFKRDDSGGGVMIDSGTDVTWLVKEAYEALRD 298
Y++ L I+V + + F +D G+MIDSGT +T L Y+AL+D
Sbjct: 321 IPNPQLPFYFLNLTGITVGS--VAVQAPSFGKD----GMMIDSGTVITRLPPSIYQALKD 374
Query: 299 EVMIRLEGEQMRSYSWPDKLCYHGIMSSDLKGFPTVRFHFRGGAKLALEKDSMFYQPRPD 358
E + + G C++ +++ P ++ HF G A+L ++ +FY + D
Sbjct: 375 EFVKQFSGFPSAPAFMILDTCFNLSGYQEVE-IPNIKMHFEGNAELNVDVTGVFYFVKTD 433
Query: 359 A--FCMAVNPASINNRYVNFSLIGMMAQQFYNVGYDIGRKQKTFQRMDC 405
A C+A+ S N +IG Q+ V YD F C
Sbjct: 434 ASQVCLAIASLSYENE---VGIIGNYQQKNQRVIYDTKGSMLGFAAEAC 479
>gi|357460823|ref|XP_003600693.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
gi|355489741|gb|AES70944.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
Length = 431
Score = 112 bits (279), Expect = 4e-22, Method: Compositional matrix adjust.
Identities = 93/368 (25%), Positives = 161/368 (43%), Gaps = 37/368 (10%)
Query: 63 VNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLGTIFYPSRSSSYAYVPCDSEHCR 122
+N+ IG PP Q +DTGS L W+ C+ + + F PS SS+++ +PC C+
Sbjct: 77 INLPIGTPPQTQPMVLDTGSQLSWIQCHKKQPPTAS----FDPSLSSTFSILPCTHPLCK 132
Query: 123 Y----FPYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDVVFGCGF 178
F C Y+ Y G + E+ TF ++ ++ GC
Sbjct: 133 PRIPDFTLPTSCDQNRLCHYSYFYADGTYAEGNLVREKFTFSR----SVSTPPLILGCAT 188
Query: 179 STNRNFKFSGIFGLGIGRSSLVSQLN-SSFSYCI------------GSLHDPDYLHNKLI 225
+ GI G+ +GR S Q + FSYC+ GS + + +K
Sbjct: 189 ESTDP---RGILGMNLGRLSFAKQSKITKFSYCVPPRQTRPGFTPTGSFYLGNNPSSKGF 245
Query: 226 LGDGAIIDEGDATPLQFIDGHYYITLEAISVDGRMLDINPNIFKRDDSGGG-VMIDSGTD 284
G + P F Y I + I + G+ L+I+P +F+ D G G MIDSG++
Sbjct: 246 KYVGMMTSSRQRMP-NFDPLAYTIPMVGIRIAGKKLNISPAVFRADAGGSGQTMIDSGSE 304
Query: 285 VTWLVKEAYEALRDEVMIRLEGEQMRS---YSWPDKLCYHGIMSSDL-KGFPTVRFHFRG 340
T+LV EAY+ +R +V +R G +++ Y +C+ + + ++ + + F F
Sbjct: 305 FTYLVSEAYDKVRAQV-VRAVGPRLKKGYVYGGVADMCFDSVKAVEIGRLIGEMVFEFER 363
Query: 341 GAKLALEKDSMFYQPRPDAFCMAVNPASINNRYVNFSLIGMMAQQFYNVGYDIGRKQKTF 400
G ++ + K+ + C+ + S + ++IG QQ V +D+ R++ F
Sbjct: 364 GVEVVIPKERVLADVGGGVHCVGI--GSSDKLGAASNIIGNFHQQNLWVEFDLVRRRVGF 421
Query: 401 QRMDCEVL 408
+ DC L
Sbjct: 422 GKADCSRL 429
>gi|115472519|ref|NP_001059858.1| Os07g0533800 [Oryza sativa Japonica Group]
gi|113611394|dbj|BAF21772.1| Os07g0533800 [Oryza sativa Japonica Group]
Length = 458
Score = 112 bits (279), Expect = 4e-22, Method: Compositional matrix adjust.
Identities = 111/378 (29%), Positives = 171/378 (45%), Gaps = 31/378 (8%)
Query: 54 NDISNSLFYV-NISIGQPPVPQFTAMDTGSSLLWVHCYPCRD-CSPQLGTIFYPSRSSSY 111
D+ N Y+ ++IG PP DTGS L+W C PC + C Q ++ PS S ++
Sbjct: 89 KDLPNGGEYIMTLAIGTPPQSYPAIADTGSDLVWTQCAPCGERCFKQPSPLYNPSSSPTF 148
Query: 112 AYVPCDSEHCRYFPYARCSAYKH----RCIYTQLYLIGPETSVFVSTEQLTFKNTDESTI 167
+PC S AR + C Y Q Y G TS +E TF ++ +
Sbjct: 149 RVLPCSSALNLCAAEARLAGATPPPGCACRYNQTYGTG-WTSGLQGSETFTFGSSPADQV 207
Query: 168 HVQDVVFGCGFSTNRNFKFSG-IFGLGIGRSSLVSQLNSS-FSYCIGSLHDPDYLHNKLI 225
V + FGC +++ ++ S + GLG G SLVSQL + FSYC+ D + L+
Sbjct: 208 RVPGIAFGCSNASSDDWNGSAGLVGLGRGGLSLVSQLAAGMFSYCLTPFQDTKS-KSTLL 266
Query: 226 LGDGAIIDEGDATPLQF-----------IDGHYYITLEAISVDGRMLDINPNIFK-RDDS 273
LG A + T ++ + +YY+ L ISV L I P F R D
Sbjct: 267 LGPAAAAAALNGTGVRSTPFVPSPSKPPMSTYYYLNLTGISVGPAALPIPPGAFALRADG 326
Query: 274 GGGVMIDSGTDVTWLVKEAYEALRDEV--MIRLEGEQMRSYSWPDKLCYHGIMSSDLKG- 330
GG++IDSGT +T LV AY+ +R V +++L + + D LC+ SS
Sbjct: 327 TGGLIIDSGTTITSLVDAAYKRVRAAVRSLVKLPVTDGSNATGLD-LCFALPSSSAPPAT 385
Query: 331 FPTVRFHFRGGAKLALEKDSMFYQPRPDAFCMAVNPASINNRYVNFSLIGMMAQQFYNVG 390
P++ HF GGA + L ++ +C+A+ + S +G QQ ++
Sbjct: 386 LPSMTLHFGGGADMVLPVENYMIL-DGGMWCLAMR----SQTDGELSTLGNYQQQNLHIL 440
Query: 391 YDIGRKQKTFQRMDCEVL 408
YD+ ++ +F C L
Sbjct: 441 YDVQKETLSFAPAKCSTL 458
>gi|147821993|emb|CAN70318.1| hypothetical protein VITISV_016757 [Vitis vinifera]
Length = 429
Score = 111 bits (278), Expect = 5e-22, Method: Compositional matrix adjust.
Identities = 107/406 (26%), Positives = 175/406 (43%), Gaps = 62/406 (15%)
Query: 47 QAQILPSNDIS----------NSLFYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCS 96
+ Q+LPS + N V++++G PP +DTGS L W+HC +
Sbjct: 32 KTQVLPSGSVPRPSSKLSFHHNVSLTVSLTVGSPPQTVTMVLDTGSELSWLHCKK----A 87
Query: 97 PQLGTIFYPSRSSSYAYVPCDSEHCRY----FPYARCSAYKHRCIYTQLYLIGPETSVFV 152
P L ++F P RSSSY+ +PC S CR F K C Y +
Sbjct: 88 PNLHSVFDPLRSSSYSPIPCTSPTCRTRTRDFSIPVSCDKKKLCHAIISYADASSIEGNL 147
Query: 153 STEQLTFKNTDESTIHVQDVVFGC---GFSTN--RNFKFSGIFGLGIGRSSLVSQLN-SS 206
+++ N+ + +FGC GFS+N + K +G+ G+ G S V+Q+
Sbjct: 148 ASDTFHIGNSA-----IPATIFGCMDSGFSSNSDEDSKTTGLIGMNRGSLSFVTQMGLQK 202
Query: 207 FSYCIGSLHDPDYLHNKLILGDGAIIDEGD---------ATPLQFIDG-HYYITLEAISV 256
FSYCI S D + L+ G+ + +TPL + D Y + LE I V
Sbjct: 203 FSYCI-SGQDSSGI---LLFGESSFSWLKALKYTPLVQISTPLPYFDRVAYTVQLEGIKV 258
Query: 257 DGRMLDINPNIFKRDDSGGG-VMIDSGTDVTWLVKEAYEALRDEVMIRLEGEQMRSYSWP 315
ML + +++ D +G G M+DSGT T+L+ Y AL++E +R ++ P
Sbjct: 259 ANSMLQLPKSVYAPDHTGAGQTMVDSGTQFTFLLGPVYTALKNE-FVRQTKASLKVLEDP 317
Query: 316 D-------KLCYH-GIMSSDLKGFPTVRFHFRGGAKLALEKDSMFY------QPRPDAFC 361
+ LCY + L PTV FR GA++++ + + Y + +C
Sbjct: 318 NFVFQGAMDLCYRVPLTRRTLPPLPTVTLMFR-GAEMSVSAERLMYRVPGVIRGSDSVYC 376
Query: 362 MAVNPASINNRYVNFSLIGMMAQQFYNVGYDIGRKQKTFQRMDCEV 407
+ + V +IG QQ + +D+ + + F + C +
Sbjct: 377 FTFGNSELLG--VESYIIGHHHQQNVWMEFDLAKSRVGFAEVRCXL 420
>gi|255543963|ref|XP_002513044.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
gi|223548055|gb|EEF49547.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
Length = 431
Score = 111 bits (278), Expect = 5e-22, Method: Compositional matrix adjust.
Identities = 99/360 (27%), Positives = 153/360 (42%), Gaps = 30/360 (8%)
Query: 56 ISNSLFYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLGTIFYPSRSSSYAYVP 115
+ + + V IG P AMDT + W+ C C CS T+F +S+++ V
Sbjct: 91 VQSPTYIVRAKIGTPAQTMLLAMDTSNDAAWIPCSGCVGCS---STVFNNVKSTTFKTVG 147
Query: 116 CDSEHCRYFPYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDVVFG 175
C++ C+ P ++C C + Y G + ++ + TD + FG
Sbjct: 148 CEAPQCKQVPNSKCGG--SACAFNMTY--GSSSIAANLSQDVVTLATDS----IPSYTFG 199
Query: 176 C-GFSTNRNFKFSGIFGLGIGRSSLVSQL----NSSFSYCIGSLHDPDYLHNKLILGDGA 230
C +T + G+ GLG G SL+SQ S+FSYC+ S ++ L LG
Sbjct: 200 CLTEATGSSIPPQGLLGLGRGPMSLLSQTQNLYQSTFSYCLPSFRSLNF-SGSLRLGPVG 258
Query: 231 IIDEGDATPL---QFIDGHYYITLEAISVDGRMLDINPNIFK-RDDSGGGVMIDSGTDVT 286
TPL YY+ L AI V R++DI P+ +G G + DSGT T
Sbjct: 259 QPKRIKTTPLLKNPRRSSLYYVNLMAIRVGRRVVDIPPSALAFNPTTGAGTIFDSGTVFT 318
Query: 287 WLVKEAYEALRDEVMIRLEGEQMRSYSWPDKLCYHGIMSSDLKGFPTVRFHFRGGAKLAL 346
LV AY A+RD R+ + S D CY + + PT+ F F G + L
Sbjct: 319 RLVAPAYTAVRDAFRKRVGNATVTSLGGFDT-CYTSPIVA-----PTITFMF-SGMNVTL 371
Query: 347 EKDSMFYQPRPDAF-CMAVNPASINNRYVNFSLIGMMAQQFYNVGYDIGRKQKTFQRMDC 405
D++ + C+A+ A N V ++I M QQ + + +D+ + R C
Sbjct: 372 PPDNLLIHSTASSITCLAMAAAPDNVNSV-LNVIANMQQQNHRILFDVPNSRLGVAREPC 430
>gi|125571687|gb|EAZ13202.1| hypothetical protein OsJ_03122 [Oryza sativa Japonica Group]
Length = 453
Score = 111 bits (278), Expect = 6e-22, Method: Compositional matrix adjust.
Identities = 122/412 (29%), Positives = 179/412 (43%), Gaps = 49/412 (11%)
Query: 24 RGINISIARLAYLQEKIRSHNTYQAQILPSNDISNSL------FYVNISIGQPPVPQFTA 77
R + S +RL+ L + S+ A P L + ++ IG P
Sbjct: 53 RAVQRSRSRLSMLAARAVSN----AGAAPGESAQTPLKKGSGDYAMSFGIGTPATGLSGE 108
Query: 78 MDTGSSLLWVHCYPCRDCSPQLGTIFYPSRSSSYAYVPCDSEHCRYFPYARCS------A 131
DTGS L+W C C CSP+ +YP+ SSS A+V C C P CS +
Sbjct: 109 ADTGSDLIWTKCGACARCSPRGSPSYYPTSSSSAAFVACGDRTCGELPRPLCSNVAGGGS 168
Query: 132 YKHRCIYTQLYLIGPETSVFVS----TEQLTFKNTDESTIHVQDVVFGCGFSTNRNFKF- 186
C Y Y +T + TE TF + + FGC + F
Sbjct: 169 GSGNCSYHYAYGNARDTHHYTEGILMTETFTFG---DDAAAFPGIAFGCTLRSEGGFGTG 225
Query: 187 SGIFGLGIGRSSLVSQLN-SSFSYCIGS-LHDPDYLH-NKLILGDGAIIDEGDATPL--- 240
SG+ GLG G+ SLV+QLN +F Y + S L P + L G D +TPL
Sbjct: 226 SGLVGLGRGKLSLVTQLNVEAFGYRLSSDLSAPSPISFGSLADVTGGNGDSFMSTPLLTN 285
Query: 241 ---QFIDGHYYITLEAISVDGRMLDINPNIFK--RDDSGGGVMIDSGTDVTWLVKEAYEA 295
Q + YY+ L ISV G+++ I F R GGV+ DSGT +T L AY
Sbjct: 286 PVVQDLP-FYYVGLTGISVGGKLVQIPSGTFSFDRSTGAGGVIFDSGTTLTMLPDPAYTL 344
Query: 296 LRDEVMIRLEGEQMRSYSWPDKL-CYHGIMSSDLKGFPTVRFHFRGGAKLALEKDS---- 350
+RDE++ ++ ++ + D L C+ G S FP++ HF GGA + L ++
Sbjct: 345 VRDELLSQMGFQKPPPAANDDDLICFTG--GSSTTTFPSMVLHFDGGADMDLSTENYLPQ 402
Query: 351 MFYQPRPDAFCMAVNPASINNRYVNFSLIGMMAQQFYNVGYDI-GRKQKTFQ 401
M Q A C +V +S ++IG + Q ++V +D+ G + FQ
Sbjct: 403 MQGQNGETARCWSVVKSS-----QALTIIGNIMQMDFHVVFDLSGNARMLFQ 449
>gi|357137345|ref|XP_003570261.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Brachypodium
distachyon]
Length = 458
Score = 111 bits (278), Expect = 6e-22, Method: Compositional matrix adjust.
Identities = 118/431 (27%), Positives = 183/431 (42%), Gaps = 55/431 (12%)
Query: 1 LIHRDSILSPYNNPNENPAHRVQRGINISIARLAYLQEKIRSHN--------TYQAQILP 52
L HR SP + E + R + R Y+Q K+ ++ A LP
Sbjct: 57 LSHRHGPCSPAPSTVEPTMAELLRRDQL---RAKYIQAKLSVNSGSGTDGVQQSAAITLP 113
Query: 53 SNDIS--NSLFYV-NISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLGTIFYPSRSS 109
+ S ++L YV +SIG P + Q +DTGS + WVHC+ L F P +SS
Sbjct: 114 TTLGSALDTLAYVITVSIGTPAMTQAVMIDTGSDVSWVHCHARAGAGSSL--FFDPGKSS 171
Query: 110 SYAYVPCDSEHC-RYFPYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIH 168
+Y C S C R + C YT Y G T+ ++ L +T++
Sbjct: 172 TYTPFSCSSAACTRLEGRDNGCSLNSTCQYTVRYGDGSNTTGTYGSDTLALNSTEK---- 227
Query: 169 VQDVVFGCGFSTNRNFKF-----SGIFGLGIGRSSLVSQL----NSSFSYCI-GSLHDPD 218
V++ FGC +++ G+ GLG G SLVSQ S+FSYC+ +
Sbjct: 228 VENFQFGCSETSDPGEGLDEDQTDGLMGLGGGAPSLVSQTAATYGSAFSYCLPATTRSSG 287
Query: 219 YLHNKLILGDGAIIDEGDATPL---QFIDGHYYITLEAISVDGRMLDINPNIFKRDDSGG 275
+L G + TP+ + Y++ L+ I+V G + I+P +F
Sbjct: 288 FLTLGASTGTSGFV----TTPMFRSRRAPTFYFVILQGINVGGDPVAISPTVFA-----A 338
Query: 276 GVMIDSGTDVTWLVKEAYEALRDEVMIRLEG-EQMRSYSWPDKLCYHGIMSSDLKGFPTV 334
G ++DSGT +T L AY AL + + R++S D C+ D P V
Sbjct: 339 GSIMDSGTIITRLPPRAYSALSAAFRAGMRRYPRARAFSILDT-CFD-FTGQDNVSIPAV 396
Query: 335 RFHFRGGAKLALEKDSMFYQPRPDAFCMAVNPASINNRYVNFSLIGMMAQQFYNVGYDIG 394
F GGA + L+ D + Y C+A PA+ S+IG + Q+ + V +D+G
Sbjct: 397 ELVFSGGAVVDLDADGIMY-----GSCLAFAPATGGIG----SIIGNVQQRTFEVLHDVG 447
Query: 395 RKQKTFQRMDC 405
+ F+ C
Sbjct: 448 QSVLGFRPGAC 458
>gi|125527370|gb|EAY75484.1| hypothetical protein OsI_03384 [Oryza sativa Indica Group]
Length = 453
Score = 111 bits (278), Expect = 6e-22, Method: Compositional matrix adjust.
Identities = 122/412 (29%), Positives = 179/412 (43%), Gaps = 49/412 (11%)
Query: 24 RGINISIARLAYLQEKIRSHNTYQAQILPSNDISNSL------FYVNISIGQPPVPQFTA 77
R + S +RL+ L + S+ A P L + ++ IG P
Sbjct: 53 RAVQRSRSRLSMLAARAVSN----AGAAPGESAQTPLKKGSGDYAMSFGIGTPATGLSGE 108
Query: 78 MDTGSSLLWVHCYPCRDCSPQLGTIFYPSRSSSYAYVPCDSEHCRYFPYARCS------A 131
DTGS L+W C C CSP+ +YP+ SSS A+V C C P CS +
Sbjct: 109 ADTGSDLIWTKCGACARCSPRGSPSYYPTSSSSAAFVACGDRTCGELPRPLCSNVAGGGS 168
Query: 132 YKHRCIYTQLYLIGPETSVFVS----TEQLTFKNTDESTIHVQDVVFGCGFSTNRNFKF- 186
C Y Y +T + TE TF + + FGC + F
Sbjct: 169 GSGNCSYHYAYGNARDTHHYTEGILMTETFTFG---DDAAAFPGIAFGCTLRSEGGFGTG 225
Query: 187 SGIFGLGIGRSSLVSQLN-SSFSYCIGS-LHDPDYLH-NKLILGDGAIIDEGDATPL--- 240
SG+ GLG G+ SLV+QLN +F Y + S L P + L G D +TPL
Sbjct: 226 SGLVGLGRGKLSLVTQLNVEAFGYRLSSDLSAPSPISFGSLADVTGGNGDSFMSTPLLTN 285
Query: 241 ---QFIDGHYYITLEAISVDGRMLDINPNIFK--RDDSGGGVMIDSGTDVTWLVKEAYEA 295
Q + YY+ L ISV G+++ I F R GGV+ DSGT +T L AY
Sbjct: 286 PVVQDLP-FYYVGLTGISVGGKLVQIPSGTFSFDRSTGAGGVIFDSGTTLTMLPDPAYTL 344
Query: 296 LRDEVMIRLEGEQMRSYSWPDKL-CYHGIMSSDLKGFPTVRFHFRGGAKLALEKDS---- 350
+RDE++ ++ ++ + D L C+ G S FP++ HF GGA + L ++
Sbjct: 345 VRDELLSQMGFQKPPPAANDDDLICFTG--GSSTTTFPSMVLHFDGGADMDLSTENYLPQ 402
Query: 351 MFYQPRPDAFCMAVNPASINNRYVNFSLIGMMAQQFYNVGYDI-GRKQKTFQ 401
M Q A C +V +S ++IG + Q ++V +D+ G + FQ
Sbjct: 403 MQGQNGETARCWSVVKSS-----QALTIIGNIMQMDFHVVFDLSGNARMLFQ 449
>gi|9757837|dbj|BAB08274.1| unnamed protein product [Arabidopsis thaliana]
Length = 586
Score = 111 bits (278), Expect = 6e-22, Method: Compositional matrix adjust.
Identities = 101/398 (25%), Positives = 169/398 (42%), Gaps = 57/398 (14%)
Query: 38 EKIRSHNTYQAQI------LPSNDISNSLFYVNISIGQPPVPQFTAMDTGSSLLWVHCYP 91
E R +Q+Q+ L + +SN + + IG PP +DTGS++ +V C
Sbjct: 47 EDFRRRRLHQSQLPNAHMKLYDDLLSNGYYTTRLWIGTPPQEFALIVDTGSTVTYVPCST 106
Query: 92 CRDCSPQLGTIFYPSRSSSYAYVPCDSEHCRYFPYARCSAYKHRCIYTQLYLIGPETSVF 151
C+ C F P S+SY + C+ P C C+Y + Y +S
Sbjct: 107 CKQCGKHQDPKFQPELSTSYQALKCN-------PDCNCDDEGKLCVYERRYAEMSSSSGV 159
Query: 152 VSTEQLTFKNTDESTIHVQDVVFGCGFSTNRNF---KFSGIFGLGIGRSSLVSQL----- 203
+S + ++F N ES + Q VFGC + + GI GLG G+ S+V QL
Sbjct: 160 LSEDLISFGN--ESQLSPQRAVFGCENEETGDLFSQRADGIMGLGRGKLSVVDQLVDKGV 217
Query: 204 -NSSFSYCIGSLHDPDYLHNKLILGDGAIIDEGDATPLQFIDGH--------YYITLEAI 254
FS C G + +G GA++ + P + H Y I L+ +
Sbjct: 218 IEDVFSLCYGGME----------VGGGAMVLGKISPPPGMVFSHSDPFRSPYYNIDLKQM 267
Query: 255 SVDGRMLDINPNIFKRDDSGGGVMIDSGTDVTWLVKEAYEALRDEVMIRLEGEQMRSYSW 314
V G+ L +NP +F + G ++DSGT + KEA+ A++D V+ + +
Sbjct: 268 HVAGKSLKLNPKVF---NGKHGTVLDSGTTYAYFPKEAFIAIKDAVIKEIPSLKRIHGPD 324
Query: 315 P--DKLCYHGI---MSSDLKGFPTVRFHFRGGAKLALEKDSMFYQPRP--DAFCMAVNPA 367
P D +C+ G ++ FP + F G KL L ++ ++ A+C+ + P
Sbjct: 325 PNYDDVCFSGAGRDVAEIHNFFPEIAMEFGNGQKLILSPENYLFRHTKVRGAYCLGIFPD 384
Query: 368 SINNRYVNFSLIGMMAQQFYNVGYDIGRKQKTFQRMDC 405
+ +L+G + + V YD + F + +C
Sbjct: 385 RDST-----TLLGGIVVRNTLVTYDRENDKLGFLKTNC 417
>gi|42568291|ref|NP_199124.3| aspartyl protease family protein [Arabidopsis thaliana]
gi|332007527|gb|AED94910.1| aspartyl protease family protein [Arabidopsis thaliana]
Length = 631
Score = 111 bits (278), Expect = 6e-22, Method: Compositional matrix adjust.
Identities = 101/398 (25%), Positives = 169/398 (42%), Gaps = 57/398 (14%)
Query: 38 EKIRSHNTYQAQI------LPSNDISNSLFYVNISIGQPPVPQFTAMDTGSSLLWVHCYP 91
E R +Q+Q+ L + +SN + + IG PP +DTGS++ +V C
Sbjct: 47 EDFRRRRLHQSQLPNAHMKLYDDLLSNGYYTTRLWIGTPPQEFALIVDTGSTVTYVPCST 106
Query: 92 CRDCSPQLGTIFYPSRSSSYAYVPCDSEHCRYFPYARCSAYKHRCIYTQLYLIGPETSVF 151
C+ C F P S+SY + C+ P C C+Y + Y +S
Sbjct: 107 CKQCGKHQDPKFQPELSTSYQALKCN-------PDCNCDDEGKLCVYERRYAEMSSSSGV 159
Query: 152 VSTEQLTFKNTDESTIHVQDVVFGCGFSTNRNF---KFSGIFGLGIGRSSLVSQL----- 203
+S + ++F N ES + Q VFGC + + GI GLG G+ S+V QL
Sbjct: 160 LSEDLISFGN--ESQLSPQRAVFGCENEETGDLFSQRADGIMGLGRGKLSVVDQLVDKGV 217
Query: 204 -NSSFSYCIGSLHDPDYLHNKLILGDGAIIDEGDATPLQFIDGH--------YYITLEAI 254
FS C G + +G GA++ + P + H Y I L+ +
Sbjct: 218 IEDVFSLCYGGME----------VGGGAMVLGKISPPPGMVFSHSDPFRSPYYNIDLKQM 267
Query: 255 SVDGRMLDINPNIFKRDDSGGGVMIDSGTDVTWLVKEAYEALRDEVMIRLEGEQMRSYSW 314
V G+ L +NP +F + G ++DSGT + KEA+ A++D V+ + +
Sbjct: 268 HVAGKSLKLNPKVF---NGKHGTVLDSGTTYAYFPKEAFIAIKDAVIKEIPSLKRIHGPD 324
Query: 315 P--DKLCYHGI---MSSDLKGFPTVRFHFRGGAKLALEKDSMFYQPRP--DAFCMAVNPA 367
P D +C+ G ++ FP + F G KL L ++ ++ A+C+ + P
Sbjct: 325 PNYDDVCFSGAGRDVAEIHNFFPEIAMEFGNGQKLILSPENYLFRHTKVRGAYCLGIFPD 384
Query: 368 SINNRYVNFSLIGMMAQQFYNVGYDIGRKQKTFQRMDC 405
+ +L+G + + V YD + F + +C
Sbjct: 385 RDST-----TLLGGIVVRNTLVTYDRENDKLGFLKTNC 417
>gi|357118074|ref|XP_003560784.1| PREDICTED: aspartic proteinase nepenthesin-2-like, partial
[Brachypodium distachyon]
Length = 452
Score = 111 bits (277), Expect = 7e-22, Method: Compositional matrix adjust.
Identities = 105/371 (28%), Positives = 152/371 (40%), Gaps = 42/371 (11%)
Query: 54 NDISNSLFYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRD-CSPQLGTIFYPSRSSSYA 112
++ F V + G P T DTGS L W+ C PC C Q +F P++SSSYA
Sbjct: 105 TNLKTPEFVVVVGFGSPAQTSATMFDTGSDLSWIQCQPCSGHCYKQHDPVFDPAKSSSYA 164
Query: 113 YVPCDSEHCRYFPYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDV 172
VPC + C A C+Y Y G T+ ++ E LTF ++ E T
Sbjct: 165 VVPCGTTECA---AAGGECNGTTCVYGVEYGDGSSTTGVLARETLTFSSSSEFT----GF 217
Query: 173 VFGCGFSTNRNF-----KFSGIFGLGIGRSSLVSQLNSSFSYCIGSLHDPDYLHNKLILG 227
+FGCG + +F G S FSYC+ P Y L
Sbjct: 218 IFGCGETNLGDFGEVDGLLGLGRGSLSLSSQAAPAFGGIFSYCL-----PSYNTTPGYLS 272
Query: 228 DGAIIDEGDATPLQFI--------DGHYYITLEAISVDGRMLDINPNIFKRDDSGGGVMI 279
GA G P+Q+ Y+I L +I++ G +L + P+ F + G ++
Sbjct: 273 IGATPVTGQ-IPVQYTAMVNKPDYPSFYFIELVSINIGGYVLPVPPSEFTKT----GTLL 327
Query: 280 DSGTDVTWLVKEAYEALRDEVMIRLEGEQMRSYSWPDKLCYHGIMSSDLKGFPTVRFHFR 339
DSGT +T+L AY ALRD ++G + CY S + P V F+F
Sbjct: 328 DSGTILTYLPPPAYTALRDRFKFTMQGSKPAPPYDELDTCYDFTGQSGIL-IPGVSFNFS 386
Query: 340 GGAKLALEKDSMFYQP---RPDAFCMAV--NPASINNRYVNFSLIGMMAQQFYNVGYDIG 394
GA L + P +P C+A PA + FS++G Q+ V YD+
Sbjct: 387 DGAVFNLNFFGIMTFPDDTKPAVGCLAFVSRPADM-----PFSVVGSTTQRSAEVIYDVP 441
Query: 395 RKQKTFQRMDC 405
++ F C
Sbjct: 442 AQKIGFIPASC 452
>gi|356527089|ref|XP_003532146.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
Length = 488
Score = 111 bits (277), Expect = 7e-22, Method: Compositional matrix adjust.
Identities = 118/429 (27%), Positives = 172/429 (40%), Gaps = 62/429 (14%)
Query: 1 LIHRDSILSPYNNPNENPAHRVQRG--INISIARLAYLQEKIR-------SHNTYQAQIL 51
++H+ S NN + + +N R+ Y+ +I S + + L
Sbjct: 73 VVHKHGPCSQLNNHDGKAKSKTPHSEILNQDKERVKYINSRISKNLGQDSSVSELDSVTL 132
Query: 52 PSND---ISNSLFYVNISIGQPPVPQFTAMDTGSSLLWVHCYPC-RDCSPQLGTIFYPSR 107
P+ I + ++V + +G P DTGS L W C PC R C Q IF PS+
Sbjct: 133 PAKSGSLIGSGNYFVVVGLGTPKRDLSLIFDTGSDLTWTQCEPCARSCYKQQDAIFDPSK 192
Query: 108 SSSYAYVPCDSEHCRYFPYAR-----CSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNT 162
S+SY+ + C S C A CSA CIY Y + + S E+L+ T
Sbjct: 193 STSYSNITCTSTLCTQLSTATGNEPGCSASTKACIYGIQYGDSSFSVGYFSRERLSVTAT 252
Query: 163 DESTIHVQDVVFGCGFSTNRNFKFS-GIFGLGIGRSSLVSQLNSS----FSYCIGSLHDP 217
D V + +FGCG + F S G+ GLG S V Q + FSYC+ +
Sbjct: 253 D----IVDNFLFGCGQNNQGLFGGSAGLIGLGRHPISFVQQTAAVYRKIFSYCLPATSSS 308
Query: 218 DYLHNKLILGDGAIIDEGDATPLQFI---DGHYYITLEAISVDGRMLDINPNIFKRDDSG 274
+L G TP I Y + + ISV G L ++ + F S
Sbjct: 309 T---GRLSFGT-TTTSYVKYTPFSTISRGSSFYGLDITGISVGGAKLPVSSSTF----ST 360
Query: 275 GGVMIDSGTDVTWLVKEAYEALRDEVMIRLEGEQMRSYSWPDKL-----CYHGIMSSDLK 329
GG +IDSGT +T L AY ALR + M Y +L CY DL
Sbjct: 361 GGAIIDSGTVITRLPPTAYTALRSAFR-----QGMSKYPSAGELSILDTCY------DLS 409
Query: 330 GF-----PTVRFHFRGGAKLALEKDSMFYQPRPDAFCMAVNPASINNRYVNFSLIGMMAQ 384
G+ P + F F GG + L + Y C+A + N + ++ G + Q
Sbjct: 410 GYEVFSIPKIDFSFAGGVTVQLPPQGILYVASAKQVCLAF---AANGDDSDVTIYGNVQQ 466
Query: 385 QFYNVGYDI 393
+ V YD+
Sbjct: 467 KTIEVVYDV 475
>gi|363808270|ref|NP_001242239.1| uncharacterized protein LOC100801883 [Glycine max]
gi|255641727|gb|ACU21134.1| unknown [Glycine max]
Length = 475
Score = 111 bits (277), Expect = 7e-22, Method: Compositional matrix adjust.
Identities = 93/358 (25%), Positives = 163/358 (45%), Gaps = 44/358 (12%)
Query: 40 IRSHNTYQ-AQILPSNDIS---------NSLFYVNISIGQPPVPQFTAMDTGSSLLWVHC 89
+R+H+ + +IL + D++ L++ + +G PP + +DTGS +LWV+C
Sbjct: 39 VRAHDVRRRGRILSAVDLNLGGNGLPTETGLYFTKLGLGSPPRDYYVQVDTGSDILWVNC 98
Query: 90 YPCRDC--SPQLG---TIFYPSRSSSYAYVPCDSEHCRYF---PYARCSAYKHRCIYTQL 141
C C LG T++ P S + V CD + C P C + + C Y+
Sbjct: 99 VECSRCPRKSDLGIDLTLYDPKGSETSDVVSCDQDFCSATFDGPIPGCKS-EIPCPYSIT 157
Query: 142 YLIGPETSVFVSTEQLTFKNTD---ESTIHVQDVVFGCG------FSTNRNFKFSGIFGL 192
Y G T+ + + LT+ + ++ ++FGCG ++ GI G
Sbjct: 158 YGDGSATTGYYVQDYLTYNRINGNLRTSPQNSSIIFGCGAVQSGTLGSSSEEALDGIIGF 217
Query: 193 GIGRSSLVSQLNSS------FSYCIGSLHDPDYLHNKLILGDGAIID-EGDATPLQFIDG 245
G SS++SQL +S FS+C+ D + I G +++ + TPL
Sbjct: 218 GQANSSVLSQLAASGKVKKIFSHCL------DNVRGGGIFAIGEVVEPKVSTTPLVPRMA 271
Query: 246 HYYITLEAISVDGRMLDINPNIFKRDDSGGGVMIDSGTDVTWLVKEAYEALRDEVMIRLE 305
HY + L++I VD +L + +IF + G G +IDSGT + +L Y+ L +V+ R
Sbjct: 272 HYNVVLKSIEVDTDILQLPSDIFDSVN-GKGTVIDSGTTLAYLPDIVYDELIQKVLARQP 330
Query: 306 GEQMRSYSWPDKLCYHGIMSSDLKGFPTVRFHFRGGAKLALEKDSMFYQPRPDAFCMA 363
G ++ + C+ + D +GFP V+ HF+ L + +Q + +C+
Sbjct: 331 GLKLYLVEQQFR-CFLYTGNVD-RGFPVVKLHFKDSLSLTVYPHDYLFQFKDGIWCIG 386
>gi|357130715|ref|XP_003566992.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Brachypodium
distachyon]
Length = 479
Score = 111 bits (277), Expect = 8e-22, Method: Compositional matrix adjust.
Identities = 104/388 (26%), Positives = 156/388 (40%), Gaps = 49/388 (12%)
Query: 61 FYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLGT-----------IFYPSRSS 109
++V +G P P DTGS L WV C P + + + F P +S
Sbjct: 95 YFVRFRVGTPAQPFLLVADTGSDLTWVKCRPAKAAAASTNSSSSASASSPRRAFRPEKSK 154
Query: 110 SYAYVPCDSEHCRY---FPYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFK------ 160
++A +PC S+ C F + C C Y Y G V TE T
Sbjct: 155 TWAPIPCASDTCSKSLPFSLSTCPTPGSPCAYDYRYKDGSAARGTVGTESATIALSSSSS 214
Query: 161 --NTDESTIHVQDVVFGC-GFSTNRNFKFS-GIFGLGIGRSSLVSQLNSSF----SYCIG 212
+Q +V GC G T +F+ S G+ LG S S S F SYC+
Sbjct: 215 SSKNKVKKAKLQGLVLGCTGSYTGPSFEASDGVLSLGYSNVSFASHAASRFGGRFSYCLV 274
Query: 213 SLHDPDYLHNKLILGDGAIIDE---------GDATPLQF---IDGHYYITLEAISVDGRM 260
P + L G + + TPL + Y ++++AISVDG +
Sbjct: 275 DHLSPRNATSYLTFGPNSALSGPCPAAAGPGARQTPLVLDSRMRPFYDVSIKAISVDGEL 334
Query: 261 LDINPNIFKRDDSGGGVMIDSGTDVTWLVKEAYEALRDEVMIRLEGEQMRSYSWPDKLCY 320
L I ++++ D GGGV++DSGT +T L K AY A+ + +L R P + CY
Sbjct: 335 LKIPRDVWEVD-GGGGVIVDSGTSLTVLAKPAYRAVVAALGKKL-ARFPRVAMDPFEYCY 392
Query: 321 HGIMSS---DLKGFPTVRFHFRGGAKLALEKDSMFYQPRPDAFCMAVNPASINNRYVNFS 377
+ S + P + HF G A+L S P C+ V + S
Sbjct: 393 NWTSPSRKDEGDDLPKLAVHFAGSARLEPPSKSYVIDAAPGVKCIGVQ----EGPWPGIS 448
Query: 378 LIGMMAQQFYNVGYDIGRKQKTFQRMDC 405
+IG + QQ + +D+ ++ F+R C
Sbjct: 449 VIGNILQQEHLWEFDLKNRRLRFKRSRC 476
>gi|255556768|ref|XP_002519417.1| Aspartic proteinase Asp1 precursor, putative [Ricinus communis]
gi|223541280|gb|EEF42831.1| Aspartic proteinase Asp1 precursor, putative [Ricinus communis]
Length = 494
Score = 111 bits (277), Expect = 8e-22, Method: Compositional matrix adjust.
Identities = 96/314 (30%), Positives = 149/314 (47%), Gaps = 33/314 (10%)
Query: 60 LFYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLGTI------FYPSRSSSYAY 113
L++ + +G PP +DTGS +LWV C C +C PQ + F + SS+
Sbjct: 80 LYFTRVKLGTPPREFNVQIDTGSDVLWVTCSSCSNC-PQTSGLGIQLNYFDTTSSSTARL 138
Query: 114 VPCDSEHCR---YFPYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNT-DESTI-- 167
VPC C +C ++C Y Y G TS + ++ F ES I
Sbjct: 139 VPCSHPICTSQIQTTATQCPPQSNQCSYAFQYGDGSGTSGYYVSDTFYFDAVLGESLIAN 198
Query: 168 HVQDVVFGC-----GFSTNRNFKFSGIFGLGIGRSSLVSQLNSS------FSYCIGSLHD 216
+VFGC G T + GIFG G G S++SQL+S FS+C L
Sbjct: 199 SSAAIVFGCSTYQSGDLTKTDKAVDGIFGFGQGELSVISQLSSHGITPRVFSHC---LKG 255
Query: 217 PDYLHNKLILGDGAIIDEGDA-TPLQFIDGHYYITLEAISVDGRMLDINPNIFKRDDSGG 275
D L+LG+ I++ G +PL HY + L++I+V G++L I+P F S
Sbjct: 256 EDSGGGILVLGE--ILEPGIVYSPLVPSQPHYNLDLQSIAVSGQLLPIDPAAFAT-SSNR 312
Query: 276 GVMIDSGTDVTWLVKEAYEALRDEVMIRLEGEQMRSYSWPDKLCYHGIMSSDLKGFPTVR 335
G +ID+GT + +LV+EAY+ + + + + ++ CY + +S + FP V
Sbjct: 313 GTIIDTGTTLAYLVEEAYDPFVSAITAAVSQLATPTINKGNQ-CYL-VSNSVSEVFPPVS 370
Query: 336 FHFRGGAKLALEKD 349
F+F GGA + L+ +
Sbjct: 371 FNFAGGATMLLKPE 384
>gi|326531454|dbj|BAJ97731.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 476
Score = 110 bits (276), Expect = 9e-22, Method: Compositional matrix adjust.
Identities = 107/400 (26%), Positives = 166/400 (41%), Gaps = 53/400 (13%)
Query: 15 NENPAHRVQRGINISIARLAYLQEKIRSHNTYQAQILPSNDISNSLFYVNISIGQPPVPQ 74
+E +QR ++ AR A + + + A + + + + V +S+G P V Q
Sbjct: 97 DEQRVEYIQRRVSGGGARGAKGALQQLATGSRSATVPTTMGVGTFQYVVTVSLGTPGVSQ 156
Query: 75 FTAMDTGSSLLWVHCYPCR--DCSPQLGTIFYPSRSSSYAYVPCDSEHCRYFPYARCSAY 132
+DTGS + WV C PC C+ Q +F P++SS+Y+ VPC ++ C
Sbjct: 157 TVEVDTGSDVSWVQCKPCSAPACNSQRDQLFDPAKSSTYSAVPCGADACSELRIYEAGCS 216
Query: 133 KHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDVVFGCGFSTNRNFK-FSGIFG 191
+C Y Y G T+ ++ L + V +FGCG + F G+
Sbjct: 217 GSQCGYVVSYGDGSNTTGVYGSDTLALAPGNT----VGTFLFGCGHAQAGMFAGIDGLLA 272
Query: 192 LGIGRSSLVSQLNSS----FSYCIGSLHDPDYLHNKLILGDGAIIDEGDATPLQFID--- 244
LG SL SQ + FSYC+ S L LG G G AT
Sbjct: 273 LGRQSMSLKSQAAGAYGGVFSYCLPSKQS---AAGYLTLG-GPTSASGFATTGLLTAWAA 328
Query: 245 -GHYYITLEAISVDGRMLDINPNIFKRDDSGGGVMIDSGTDVTWLVKEAYEALRDEVMIR 303
Y + L ISV G+ + + + F GG ++D+GT +T L AY ALR
Sbjct: 329 PTFYMVMLTGISVGGQQVAVPASAFA-----GGTVVDTGTVITRLPPTAYAALRSAFR-- 381
Query: 304 LEGEQMRSYSWPDK-------LCY----HGIMSSDLKGFPTVRFHFRGGAKLALEKDSMF 352
+ Y +P CY +G+++ PTV F GGA LALE +
Sbjct: 382 ---GAIAPYGYPSAPANGILDTCYDFSRYGVVT-----LPTVALTFSGGATLALEAPGIL 433
Query: 353 YQPRPDAFCMAVNPASINNRYVNFSLIGMMAQQFYNVGYD 392
+ C+A P N + +++G + Q+ + V +D
Sbjct: 434 -----SSGCLAFAP---NGGDGDAAILGNVQQRSFAVRFD 465
>gi|115479237|ref|NP_001063212.1| Os09g0423500 [Oryza sativa Japonica Group]
gi|50725896|dbj|BAD33424.1| putative nucleoid DNA-binding protein cnd41, chloroplast [Oryza
sativa Japonica Group]
gi|50726136|dbj|BAD33657.1| putative nucleoid DNA-binding protein cnd41, chloroplast [Oryza
sativa Japonica Group]
gi|113631445|dbj|BAF25126.1| Os09g0423500 [Oryza sativa Japonica Group]
gi|215767614|dbj|BAG99842.1| unnamed protein product [Oryza sativa Japonica Group]
gi|222641596|gb|EEE69728.1| hypothetical protein OsJ_29412 [Oryza sativa Japonica Group]
Length = 473
Score = 110 bits (276), Expect = 1e-21, Method: Compositional matrix adjust.
Identities = 102/355 (28%), Positives = 163/355 (45%), Gaps = 51/355 (14%)
Query: 78 MDTGSSLLWVHCYPCRDCSPQLGTIFYPSRSSSYAYVPCDSEHCRYFPYARCSAY----- 132
+DT S L WV C PC C Q G +F P+ S SYA +PC+S C A SA
Sbjct: 142 VDTASELTWVQCAPCASCHDQQGPLFDPASSPSYAVLPCNSSSCDALQVATGSAAGACGG 201
Query: 133 --KHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDVVFGCGFSTNRNF-KFSGI 189
+ C YT Y G + ++ ++L+ + VFGCG S F SG+
Sbjct: 202 GEQPSCSYTLSYRDGSYSQGVLAHDKLSLAGE-----VIDGFVFGCGTSNQGPFGGTSGL 256
Query: 190 FGLGIGRSSLVS----QLNSSFSYCIGSLHDPDYLHNKLILGDGAIIDEGDATPLQF--- 242
GLG + SL+S Q FSYC+ L + + L+LGD + ++TP+ +
Sbjct: 257 MGLGRSQLSLISQTMDQFGGVFSYCL-PLKESES-SGSLVLGDDTSVYR-NSTPIVYTTM 313
Query: 243 ----IDG-HYYITLEAISVDGRMLDINPNIFKRDDSGGGVMIDSGTDVTWLVKEAYEALR 297
+ G Y++ L I++ G+ + + S G V++DSGT +T LV Y A++
Sbjct: 314 VSDPVQGPFYFVNLTGITIGGQEV---------ESSAGKVIVDSGTIITSLVPSVYNAVK 364
Query: 298 DEVMIRL-EGEQMRSYSWPDKLCYHGIMSSDLKGF-----PTVRFHFRGGAKLALEKDSM 351
E + + E Q +S D C+ +L GF P+++F F G ++ ++ +
Sbjct: 365 AEFLSQFAEYPQAPGFSILDT-CF------NLTGFREVQIPSLKFVFEGNVEVEVDSSGV 417
Query: 352 FYQPRPDAFCMAVNPASINNRYVNFSLIGMMAQQFYNVGYDIGRKQKTFQRMDCE 406
Y D+ + + AS+ + Y S+IG Q+ V +D Q F + C+
Sbjct: 418 LYFVSSDSSQVCLALASLKSEY-ETSIIGNYQQKNLRVIFDTLGSQIGFAQETCD 471
>gi|356505293|ref|XP_003521426.1| PREDICTED: aspartic proteinase-like protein 2-like [Glycine max]
Length = 499
Score = 110 bits (276), Expect = 1e-21, Method: Compositional matrix adjust.
Identities = 94/316 (29%), Positives = 153/316 (48%), Gaps = 36/316 (11%)
Query: 60 LFYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDC--SPQLGT---IFYPSRSSSYAYV 114
L++ + +G P + +DTGS +LW++C C +C S LG F + SS+ A V
Sbjct: 82 LYFTKVKLGSPAKDFYVQIDTGSDILWINCITCSNCPHSSGLGIELDFFDTAGSSTAALV 141
Query: 115 PCDSEHCRYF---PYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQD 171
C C Y + CS+ ++C YT Y G T+ + ++ + F V +
Sbjct: 142 SCADPICSYAVQTATSGCSSQANQCSYTFQYGDGSGTTGYYVSDTMYFDTVLLGQSMVAN 201
Query: 172 ----VVFGC-----GFSTNRNFKFSGIFGLGIGRSSLVSQLNSS------FSYCIGSLHD 216
+VFGC G T + GIFG G G S++SQL+S FS+C L
Sbjct: 202 SSSTIVFGCSTYQSGDLTKTDKAVDGIFGFGPGALSVISQLSSRGVTPKVFSHC---LKG 258
Query: 217 PDYLHNKLILGDGAIIDEGDA-TPLQFIDGHYYITLEAISVDGRMLDINPNIFKRDDSGG 275
+ L+LG+ I++ +PL HY + L++I+V+G++L I+ N+F ++
Sbjct: 259 GENGGGVLVLGE--ILEPSIVYSPLVPSLPHYNLNLQSIAVNGQLLPIDSNVFATTNN-Q 315
Query: 276 GVMIDSGTDVTWLVKEAYEALRDEVMIRLEGEQMRSYSWPDKLCYHGIMSSDLKG--FPT 333
G ++DSGT + +LV+EAY D + + S ++ CY + S+ G FP
Sbjct: 316 GTIVDSGTTLAYLVQEAYNPFVDAITAAVSQFSKPIISKGNQ-CY---LVSNSVGDIFPQ 371
Query: 334 VRFHFRGGAKLALEKD 349
V +F GGA + L +
Sbjct: 372 VSLNFMGGASMVLNPE 387
>gi|255566002|ref|XP_002523989.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
gi|223536716|gb|EEF38357.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
Length = 448
Score = 110 bits (276), Expect = 1e-21, Method: Compositional matrix adjust.
Identities = 114/437 (26%), Positives = 180/437 (41%), Gaps = 60/437 (13%)
Query: 1 LIHRDSILSPYNNPNENPAHRVQRGINISIARLAYLQEKIRSHNTYQAQILPSNDISNSL 60
LI RDS SP+ N E A R A++ S+ Q+++ + S
Sbjct: 41 LIRRDSPNSPFYNALEAAATRSTNASQHYDAQIGRFNLMSDSYYASQSEL----NFSKGN 96
Query: 61 FYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLGTIFYPSRSSSYAYVPCDSEH 120
+ + IS+G PP D L W+ C C+DC+ G F+PS SS+Y C+S
Sbjct: 97 YLIKISVGTPPAEILALADITGDLTWLPCKTCQDCTKD-GFTFFPSESSTYTSAACESYQ 155
Query: 121 CRYFPYARCSAYKHRCIYTQLYLIGP--------ETSVFVSTEQLTFKNTDESTIHVQDV 172
C+ A C CI YL GP V+ + ++F ++ + +
Sbjct: 156 CQITNGAVCQT--KMCI----YLCGPLPQQRSSCTNKGLVAMDTISFHSSSGQALSYPNT 209
Query: 173 VFGCG-FSTNRNFKFSGIFGLGIGRSSLVSQL----NSSFSYCIGSLHDP-----DYLHN 222
F CG F N ++ +GI GLG G S+ SQ+ N +FS C+ ++
Sbjct: 210 NFICGTFIDNWHYIGAGIVGLGRGLFSMTSQMKHLINGTFSQCLVPYSSKQSSKINFGLK 269
Query: 223 KLILGDGA----IIDEGDATPLQFIDGHYYITLEAISVDGRMLDINPNIFKRDDSGGGVM 278
++ G+G I D+G++ G Y++ LEA+SV G + N + S +
Sbjct: 270 GVVSGEGVVSTPIADDGES-------GAYFLFLEAMSVGGNR--VANNFYSAPKS--NIY 318
Query: 279 IDSGTDVTWLVKEAYEALRDEVM-------IRLEGEQMRSYSWPDKLCYHGIMSSDLKGF 331
ID T T L + YE + EV I E+ S LCY D
Sbjct: 319 IDWRTTFTSLPHDFYENVEAEVRKAINLTPINYNNERKLS------LCYKSESDHDFDA- 371
Query: 332 PTVRFHFRGGAKLALEKDSMFYQPRPDAFCMAVNPASIN-NRYVNFSLIGMMAQQFYNVG 390
P + HF A + L + F + + C A + N + + ++ G Q + VG
Sbjct: 372 PPITMHFT-NADVQLSPLNTFVRMDWNVVCFAFLDGTFNATKRITHAVYGSWQQMNFIVG 430
Query: 391 YDIGRKQKTFQRMDCEV 407
YD+ +F++ DC +
Sbjct: 431 YDLKSSTVSFKQADCTL 447
>gi|449456068|ref|XP_004145772.1| PREDICTED: aspartic proteinase-like protein 2-like [Cucumis
sativus]
gi|449496218|ref|XP_004160076.1| PREDICTED: aspartic proteinase-like protein 2-like [Cucumis
sativus]
Length = 500
Score = 110 bits (275), Expect = 1e-21, Method: Compositional matrix adjust.
Identities = 97/319 (30%), Positives = 144/319 (45%), Gaps = 33/319 (10%)
Query: 60 LFYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLG-----TIFYPSRSSSYAYV 114
L+Y + +G PP + +DTGS +LWV C C C G F P S++ + V
Sbjct: 82 LYYTRVQLGNPPKDFYVQIDTGSDVLWVSCNSCNGCPATSGLQIPLNFFDPGSSTTASLV 141
Query: 115 PCDSEHCRYFPYAR---CSAYKHRCIYTQLYLIGPETSVFVSTEQLTFK---NTDESTIH 168
C + C + C ++C Y Y G TS + + + ++ ++
Sbjct: 142 SCSDQICALGVQSSDSACFGQSNQCAYVFQYGDGSGTSGYYVMDMIHLDVVIDSSVTSNS 201
Query: 169 VQDVVFGCGFS-----TNRNFKFSGIFGLGIGRSSLVSQLNSS------FSYCIGSLHDP 217
VVFGC S T + GIFG G S++SQL+S FS+C L
Sbjct: 202 SASVVFGCSTSQTGDLTKSDRAVDGIFGFGQQDLSVISQLSSRGIAPKVFSHC---LKGD 258
Query: 218 DYLHNKLILGDGAIIDEGDA-TPLQFIDGHYYITLEAISVDGRMLDINPNIFKRDDSGGG 276
D L+LG+ I++ TPL HY + L++ISV+G++L I+P +F S G
Sbjct: 259 DSGGGILVLGE--IVEPNVVYTPLVPSQPHYNLNLQSISVNGQVLPISPAVFATSSS-QG 315
Query: 277 VMIDSGTDVTWLVKEAYEALRDEVMIRLEGEQMRSYSWPDKLCYHGIMSSDLKG-FPTVR 335
+IDSGT + +L +EAY A V + + +S CY + SS + FP V
Sbjct: 316 TIIDSGTTLAYLAEEAYNAFVVAVT-NIVSQSTQSVVLKGNRCY--VTSSSVSDIFPQVS 372
Query: 336 FHFRGGAKLALEKDSMFYQ 354
+F GGA L L Q
Sbjct: 373 LNFAGGASLVLGAQDYLIQ 391
>gi|255571588|ref|XP_002526740.1| aspartic-type endopeptidase, putative [Ricinus communis]
gi|223533929|gb|EEF35654.1| aspartic-type endopeptidase, putative [Ricinus communis]
Length = 471
Score = 110 bits (275), Expect = 1e-21, Method: Compositional matrix adjust.
Identities = 115/430 (26%), Positives = 193/430 (44%), Gaps = 36/430 (8%)
Query: 1 LIHRDSILSPYNNPNENPAHRVQRGINISIARLAYLQEKIRSHNTYQAQILPSNDIS--N 58
LIH S SP+ PN P ++ + S AR ++ KIRS ++ P + IS +
Sbjct: 47 LIHWSSPESPFYEPNLTPGELMRASVRTSRARGDRIR-KIRSSGISNSRKYPVSRISIID 105
Query: 59 SLFYVNISIGQPPVPQFTAMDTGSSLLWVHCYP--CRDCSPQLGTIFYPSRSSSYAYVPC 116
++ + +IG PPV + DTGS+++W+ C C +C Q +F P++SS+YA C
Sbjct: 106 KVYVMKFNIGSPPVETYAIPDTGSNIVWIQCGSPICTNCYKQKIPLFNPTKSSTYAIRLC 165
Query: 117 DSEHCRYF-----PYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTF-KNTDESTIHVQ 170
C+ Y C + C Y Y + +ST+ +TF ++ E +
Sbjct: 166 GHRECKQALWGLGEYLGCKSSVQVCRYHISYEDHSFSEGTISTDIITFPEHIAEFGNYSL 225
Query: 171 DVVFGCGFSTNR-------NFKFSGIFGLGIGRSSLVSQLN-SSFSYCIGS--LHDPDYL 220
+ FGCG++ + +F G+ GLG +SLV QL FSYCI + + P+
Sbjct: 226 RMFFGCGYNNSETPGQDPNSFTAPGVVGLGNEMASLVGQLTLGQFSYCISTPDVQKPNGT 285
Query: 221 HNKLILGDGAIIDEGDATPLQFIDGHY-YITLEAISVDGRMLDINPN-IFKRDDSG-GGV 277
++ G A I ++G Y + ++ I VD + P +F+ + G GG+
Sbjct: 286 I-EIRFGLAASISGHSTALANNLEGWYIFQNVDGIYVDDTKVKGYPEWVFQFAEGGIGGL 344
Query: 278 MIDSGTDVTWLVKEAYEALRDEVMIRLE-GEQMRSYSWPD-KLCYHGIMSSDLKGFPTVR 335
++DSGT T L A +AL E+ ++E + +S + LCY+ + L P +
Sbjct: 345 IMDSGTTYTELYFSALDALIGELKEQIELAPDTQDHSNSNYSLCYNA-ANFLLTYVPAIE 403
Query: 336 FHFRGG--AKLALEKDSMFYQPRPDAFCMAVNPASINNRYVNFSLIGMMAQQFYNVGYDI 393
F A + + D +C+A+ S S+IG+ + +GYD+
Sbjct: 404 LKFTDNKEAYFPFTLRNAWIDNGNDQYCLAMFGTS------GISIIGIYQHRDIKIGYDL 457
Query: 394 GRKQKTFQRM 403
+F M
Sbjct: 458 KYNLVSFTEM 467
>gi|226492391|ref|NP_001140482.1| uncharacterized protein LOC100272542 precursor [Zea mays]
gi|224030447|gb|ACN34299.1| unknown [Zea mays]
gi|414887506|tpg|DAA63520.1| TPA: hypothetical protein ZEAMMB73_432695 [Zea mays]
Length = 512
Score = 110 bits (275), Expect = 1e-21, Method: Compositional matrix adjust.
Identities = 105/395 (26%), Positives = 173/395 (43%), Gaps = 35/395 (8%)
Query: 36 LQEKIRSHNTYQAQILPSNDISNSLFYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDC 95
L E R T ++ + + ++ + +++ +G PP MDTGS L W+ C PC DC
Sbjct: 125 LSESERVVATVESGVA----VGSAEYLMDVYVGTPPRRFQMIMDTGSDLNWLQCAPCLDC 180
Query: 96 SPQLGTIFYPSRSSSYAYVPCDSEHCRYFPYARCSAY-------KHRCIYTQLYLIGPET 148
Q G +F P+ SSSY + C C + A + C Y Y +
Sbjct: 181 FEQRGPVFDPAASSSYRNLTCGDPRCGHVAPPEAPAPRACRRPGEDPCPYYYWYGDQSNS 240
Query: 149 SVFVSTEQLTFKNTDE-STIHVQDVVFGCGFSTNRNF-KFSGIFGLGIGRSSLVSQLNS- 205
+ ++ E T T ++ V VVFGCG F +G+ GLG G S SQL +
Sbjct: 241 TGDLALESFTVNLTAPGASSRVDGVVFGCGHRNRGLFHGAAGLLGLGRGPLSFASQLRAV 300
Query: 206 ----SFSYCIGSLHDPDYLHNKLILGDGAIIDEGDATPLQFI---------DGHYYITLE 252
+FSYC+ H D + +K++ G+ + L++ D YY+ L
Sbjct: 301 YGGHTFSYCLVD-HGSD-VASKVVFGEDDALALAAHPRLKYTAFAPASSPADTFYYVRLT 358
Query: 253 AISVDGRMLDINPNIFKRDDSG-GGVMIDSGTDVTWLVKEAYEALRDEVMIRLEGEQMRS 311
+ V G +L+I+ + + + G GG +IDSGT +++ V+ AY+ +R + R+ G
Sbjct: 359 GVLVGGELLNISSDTWDASEGGSGGTIIDSGTTLSYFVEPAYQVIRRAFIDRMSGSYPPV 418
Query: 312 YSWPDKLCYHGIMSSDLKGFPTVRFHFRGGAKLALEKDSMFYQPRPDA-FCMAVNPASIN 370
+P + + + P + F GA ++ F + PD C+AV +
Sbjct: 419 PDFPVLSPCYNVSGVERPEVPELSLLFADGAVWDFPAENYFIRLDPDGIMCLAV----LG 474
Query: 371 NRYVNFSLIGMMAQQFYNVGYDIGRKQKTFQRMDC 405
S+IG QQ ++V YD+ + F C
Sbjct: 475 TPRTGMSIIGNFQQQNFHVAYDLHNNRLGFAPRRC 509
>gi|302780575|ref|XP_002972062.1| hypothetical protein SELMODRAFT_96804 [Selaginella moellendorffii]
gi|300160361|gb|EFJ26979.1| hypothetical protein SELMODRAFT_96804 [Selaginella moellendorffii]
Length = 429
Score = 110 bits (275), Expect = 1e-21, Method: Compositional matrix adjust.
Identities = 109/380 (28%), Positives = 163/380 (42%), Gaps = 41/380 (10%)
Query: 61 FYVNISIGQPPVPQFTAMDTGSSLLWVHCY---------PCRDCSPQLGTIFYPSRSSSY 111
+ V+++ G PP DTGS L+W+ C P + CS + F S+S++
Sbjct: 53 YLVSMAFGTPPQEVLLIADTGSDLIWLQCSTTAAPPAFCPKKACSRR--PAFVASKSATL 110
Query: 112 AYVPCDSEHCRYFPYAR-----CS-AYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDES 165
+ VPC + C P R CS A C Y Y G T+ F++ + T N
Sbjct: 111 SVVPCSAAQCLLVPAPRGHGPACSPAAPVPCGYAYDYADGSSTTGFLARDTATISNGTSG 170
Query: 166 TIHVQDVVFGCGFSTNRNFKFS---GIFGLGIGRSSLVSQLNS----SFSYCIGSLHDPD 218
V+ V FGCG + N+ FS G+ GLG G+ S +Q S +FSYC+ L
Sbjct: 171 GAAVRGVAFGCG-TRNQGGSFSGTGGVIGLGQGQLSFPAQSGSLFAQTFSYCLLDLEGGR 229
Query: 219 YLHNK--LILGDGAIIDEGDATPL---QFIDGHYYITLEAISVDGRMLDINPNIFKRDDS 273
+ L LG TPL YY+ + AI V R+L + + + D
Sbjct: 230 RGRSSSFLFLGRPERRAAFAYTPLVSNPLAPTFYYVGVVAIRVGNRVLPVPGSEWAIDVL 289
Query: 274 G-GGVMIDSGTDVTWLVKEAYEALRDEVMIRLEGEQMRS---YSWPDKLCYHGIMSSDLK 329
G GG +IDSG+ +T+L AY L + ++ S + +LCY+ SS
Sbjct: 290 GNGGTVIDSGSTLTYLRLGAYLHLVSAFAASVHLPRIPSSATFFQGLELCYNVSSSSSSA 349
Query: 330 ----GFPTVRFHFRGGAKLALEKDSMFYQPRPDAFCMAVNPASINNRYVNFSLIGMMAQQ 385
GFP + F G L L + D C+A+ P F+++G + QQ
Sbjct: 350 PANGGFPRLTIDFAQGLSLELPTGNYLVDVADDVKCLAIRP---TLSPFAFNVLGNLMQQ 406
Query: 386 FYNVGYDIGRKQKTFQRMDC 405
Y+V +D + F R +C
Sbjct: 407 GYHVEFDRASARIGFARTEC 426
>gi|357154085|ref|XP_003576664.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Brachypodium
distachyon]
Length = 509
Score = 110 bits (275), Expect = 1e-21, Method: Compositional matrix adjust.
Identities = 112/438 (25%), Positives = 173/438 (39%), Gaps = 51/438 (11%)
Query: 1 LIHRDSILSPYNNPNENPAHRVQRGINISIARLAYLQEKIRSHNTYQAQ--ILPSN---D 55
++HR SP P + P+ ++ AR+ + I + + LP+
Sbjct: 91 VMHRHGPCSPLQTPGDAPSDADL--LDQDQARVDSILGMITNETSAVGPGVSLPAERGIS 148
Query: 56 ISNSLFYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRD--CSPQLGTIFYPSRSSSYAY 113
+ + V++ +G P DTGS L WV C PC C Q +F PS SS+++
Sbjct: 149 VGTGNYVVSVGLGTPARDLTVVFDTGSDLSWVQCGPCSSGGCYKQQDPLFAPSDSSTFSA 208
Query: 114 VPCDSEHCRYFPYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQD-- 171
V C + CR S RC Y +Y T + + LT + ++
Sbjct: 209 VRCGARECRARQSCGGSPGDDRCPYEVVYGDKSRTQGHLGNDTLTLGTMAPANASAENDN 268
Query: 172 ----VVFGCGFSTNRNF-KFSGIFGLGIGRSSLVSQ----LNSSFSYCI--GSLHDPDYL 220
VFGCG + F + G+FGLG G+ SL SQ FSYC+ S P YL
Sbjct: 269 KLPGFVFGCGENNTGLFGQADGLFGLGRGKVSLSSQAAGKFGEGFSYCLPSSSSSAPGYL 328
Query: 221 HNKLILGDGAIIDEGDATPL---QFIDGHYYITLEAISVDGRMLDI-NPNIFKRDDSGGG 276
+ A TP+ YY+ L I V GR + + +P +
Sbjct: 329 SLGTPVPAPA---HAQFTPMLNRTTTPSFYYVKLVGIRVAGRAIRVSSPRVALP------ 379
Query: 277 VMIDSGTDVTWLVKEAYEALRDEVMIRLEGEQMRSYSWPDK-------LCYHGIMSSDLK 329
+++DSGT +T L AY ALR + M Y + CY ++
Sbjct: 380 LIVDSGTVITRLAPRAYRALRAAFL-----SAMGKYGYKRAPRLSILDTCYDFTAHANAT 434
Query: 330 -GFPTVRFHFRGGAKLALEKDSMFYQPRPDAFCMAVNPASINNRYVNFSLIGMMAQQFYN 388
P V F GGA ++++ + Y + C+A P N + ++G Q+
Sbjct: 435 VSIPAVALVFAGGATISVDFSGVLYVAKVAQACLAFAP---NGDGRSAGILGNTQQRTLA 491
Query: 389 VGYDIGRKQKTFQRMDCE 406
V YD+ R++ F C
Sbjct: 492 VVYDVARQKIGFAAKGCS 509
>gi|449464952|ref|XP_004150193.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 2-like [Cucumis
sativus]
gi|449526850|ref|XP_004170426.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 2-like [Cucumis
sativus]
Length = 476
Score = 110 bits (275), Expect = 1e-21, Method: Compositional matrix adjust.
Identities = 125/417 (29%), Positives = 186/417 (44%), Gaps = 53/417 (12%)
Query: 1 LIHRDSILSPYNNPNENPAHRVQRGINISIARLAYLQEKIRSHNTYQAQILPSNDISNSL 60
L HRD + P N ++P R + I+ R++ L + S + Q S+ +S +
Sbjct: 75 LFHRDKL--PLNFDPDHP-RRFKERISRDSKRVSSLLRLLSSGSDEQVTDFGSDVVSGTE 131
Query: 61 -----FYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLGTIFYPSRSSSYAYVP 115
++V I +G PP Q+ +D+GS ++WV C PC +C Q +F P+ S++YA +
Sbjct: 132 QGSGEYFVRIGVGSPPRSQYVVIDSGSDIVWVQCQPCSECYQQSDPVFDPAGSATYAGIS 191
Query: 116 CDSEHCRYFPYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDVVFG 175
CDS C A C+ RC Y Y G T ++ E LTF + ++++ G
Sbjct: 192 CDSSVCDRLDNAGCN--DGRCRYEVSYGDGSYTRGTLALETLTFGR-----VLIRNIAIG 244
Query: 176 CGFSTNRNF-KFSGIFGLGIGRSSLVSQL----NSSFSYCIGSLHDPDYLHNKLILGDGA 230
CG F +G+ GLG G S V QL +FSYC+ S L G GA
Sbjct: 245 CGHMNRGMFIGAAGLLGLGGGAMSFVGQLGGQTGGAFSYCLVSRGTES--TGTLEFGRGA 302
Query: 231 IIDEGDATPL---QFIDGHYYITLEAISVDGRMLDINPNIFKRDDSG-GGVMIDSGTDVT 286
+ PL YY+ L + V G + I IF+ D G GGV++D+GT VT
Sbjct: 303 MPVGAAWVPLIRNPRAPSFYYVGLSGLGVGGIRVPIPEQIFELTDLGYGGVVMDTGTAVT 362
Query: 287 WLVKEAYEALRDEVMIRLEGEQMRSYSWPDKL-----CYHGIMSSDLKGF-----PTVRF 336
L AYEA RD + Q + D++ CY+ L GF PTV F
Sbjct: 363 RLPAPAYEAFRDTFI-----GQTANLPRSDRVSIFDTCYN------LNGFVSVRVPTVSF 411
Query: 337 HFRGGAKLAL-EKDSMFYQPRPDAFCMAVNPASINNRYVNFSLIGMMAQQFYNVGYD 392
+F GG L L ++ + FC A ++ S+IG + Q+ + D
Sbjct: 412 YFSGGPILTLPARNFLIPVDGEGTFCFAFAASA-----SGLSIIGNIQQEGIQISID 463
>gi|125563764|gb|EAZ09144.1| hypothetical protein OsI_31414 [Oryza sativa Indica Group]
Length = 472
Score = 110 bits (275), Expect = 1e-21, Method: Compositional matrix adjust.
Identities = 102/355 (28%), Positives = 163/355 (45%), Gaps = 51/355 (14%)
Query: 78 MDTGSSLLWVHCYPCRDCSPQLGTIFYPSRSSSYAYVPCDSEHCRYFPYARCSAY----- 132
+DT S L WV C PC C Q G +F P+ S SYA +PC+S C A SA
Sbjct: 141 VDTASELTWVQCAPCASCHDQQGPLFDPASSPSYAVLPCNSSSCDALQVATGSAAGACGG 200
Query: 133 --KHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDVVFGCGFSTNRNF-KFSGI 189
+ C YT Y G + ++ ++L+ + VFGCG S F SG+
Sbjct: 201 GEQPSCSYTLSYRDGSYSQGVLAHDKLSLAGE-----VIDGFVFGCGTSNQGPFGGTSGL 255
Query: 190 FGLGIGRSSLVS----QLNSSFSYCIGSLHDPDYLHNKLILGDGAIIDEGDATPLQF--- 242
GLG + SL+S Q FSYC+ L + + L+LGD + ++TP+ +
Sbjct: 256 MGLGRSQLSLISQTMDQFGGVFSYCL-PLKESES-SGSLVLGDDTSVYR-NSTPIVYTTM 312
Query: 243 ----IDG-HYYITLEAISVDGRMLDINPNIFKRDDSGGGVMIDSGTDVTWLVKEAYEALR 297
+ G Y++ L I++ G+ + + S G V++DSGT +T LV Y A++
Sbjct: 313 VSDPVQGPFYFVNLTGITIGGQEV---------ESSAGKVIVDSGTIITSLVPSVYNAVK 363
Query: 298 DEVMIRL-EGEQMRSYSWPDKLCYHGIMSSDLKGF-----PTVRFHFRGGAKLALEKDSM 351
E + + E Q +S D C+ +L GF P+++F F G ++ ++ +
Sbjct: 364 AEFLSQFAEYPQAPGFSILDT-CF------NLTGFREVQIPSLKFVFEGNVEVEVDSSGV 416
Query: 352 FYQPRPDAFCMAVNPASINNRYVNFSLIGMMAQQFYNVGYDIGRKQKTFQRMDCE 406
Y D+ + + AS+ + Y S+IG Q+ V +D Q F + C+
Sbjct: 417 LYFVSSDSSQVCLALASLKSEY-ETSIIGNYQQKNLRVIFDTLGSQIGFAQETCD 470
>gi|255565531|ref|XP_002523756.1| Aspartic proteinase Asp1 precursor, putative [Ricinus communis]
gi|223537060|gb|EEF38696.1| Aspartic proteinase Asp1 precursor, putative [Ricinus communis]
Length = 507
Score = 110 bits (275), Expect = 1e-21, Method: Compositional matrix adjust.
Identities = 96/323 (29%), Positives = 144/323 (44%), Gaps = 36/323 (11%)
Query: 60 LFYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLG-----TIFYPSRSSSYAYV 114
L++ + +G PP + +DTGS +LWV C C C G T F P S++ A V
Sbjct: 83 LYFTRVQLGSPPKDFYVQIDTGSDVLWVSCSSCNGCPVTSGLQIPLTFFDPGSSTTAALV 142
Query: 115 PCDSEHCRYFPYAR---CSAYKHRCIYTQLYLIGPETS------------VFVSTEQLT- 158
C + C + CS+ ++C YT Y G TS + +S+ +L+
Sbjct: 143 SCSDQRCTAGIQSSDSLCSSRTNQCGYTFQYGDGSGTSGYYVADLMHLDTLLLSSGELSQ 202
Query: 159 FKNTDESTIHVQDVVFGCGFSTNRNFKFSGIFGLGIGRSSLVSQLNSS------FSYCIG 212
T +S++ G T + GIFG G S++SQL S FS+C
Sbjct: 203 ICQTYDSSVSFMCSTLQTGDLTKSDRAVDGIFGFGQQEMSVISQLASQGITPRVFSHC-- 260
Query: 213 SLHDPDYLHNKLILGDGAIIDEGDA-TPLQFIDGHYYITLEAISVDGRMLDINPNIFKRD 271
L D L+LG+ I++ TPL HY + L++ISV G+ L I+P++F
Sbjct: 261 -LKGDDSGGGVLVLGE--IVEPNIVYTPLVPSQPHYNLYLQSISVAGQTLAIDPSVFGA- 316
Query: 272 DSGGGVMIDSGTDVTWLVKEAYEALRDEVMIRLEGEQMRSYSWPDKLCYHGIMSSDLKGF 331
S G ++DSGT + +L + AY+ + + R+Y CY + SS F
Sbjct: 317 SSNQGTIVDSGTTLAYLAEGAYDPFVSAIT-SVVSLNARTYLSKGNQCYL-VTSSVNDVF 374
Query: 332 PTVRFHFRGGAKLALEKDSMFYQ 354
P V +F GGA L L Q
Sbjct: 375 PQVSLNFAGGASLILNPQDYLLQ 397
>gi|7715602|gb|AAF68120.1|AC010793_15 F20B17.14 [Arabidopsis thaliana]
gi|12324588|gb|AAG52249.1|AC011717_17 putative aspartyl protease; 105611-106921 [Arabidopsis thaliana]
Length = 436
Score = 110 bits (274), Expect = 1e-21, Method: Compositional matrix adjust.
Identities = 121/423 (28%), Positives = 188/423 (44%), Gaps = 56/423 (13%)
Query: 19 AHRVQRGINISIARLAYLQEKIRS-------HNTYQAQILPSNDIS-NSLFY-VNISIGQ 69
+++R + + R+ LQ KI++ + + QI ++ I SL Y V + +G
Sbjct: 36 GKKMRRALVLDNIRVQSLQLKIKAMTSSTTEQSVSETQIPLTSGIKLESLNYIVTVELGG 95
Query: 70 PPVPQFTAMDTGSSLLWVHCYPCRDCSPQLGTIFYPSRSSSYAYVPCDSEHCRYFPYARC 129
+ +DTGS L WV C PCR C Q G ++ PS SSSY V C+S C+ A
Sbjct: 96 KNMSLI--VDTGSDLTWVQCQPCRSCYNQQGPLYDPSVSSSYKTVFCNSSTCQDLVAATS 153
Query: 130 SA---------YKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDVVFGCGFST 180
++ K C Y Y G T +++E + +T +++ VFGCG
Sbjct: 154 NSGPCGGNNGVVKTPCEYVVSYGDGSYTRGDLASESILLGDT-----KLENFVFGCG-RN 207
Query: 181 NRNFKFSGIFGLGIGRS--SLVSQ----LNSSFSYCIGSLHDPDYLHNKLILGDGAIIDE 234
N+ +G+GRS SLVSQ N FSYC+ SL D L G+ + +
Sbjct: 208 NKGLFGGSSGLMGLGRSSVSLVSQTLKTFNGVFSYCLPSLEDG--ASGSLSFGNDSSVYT 265
Query: 235 GDA----TPL---QFIDGHYYITLEAISVDGRMLDINPNIFKRDDSGGGVMIDSGTDVTW 287
TPL + Y + L S+ G L K G G++IDSGT +T
Sbjct: 266 NSTSVSYTPLVQNPQLRSFYILNLTGASIGGVEL-------KSSSFGRGILIDSGTVITR 318
Query: 288 LVKEAYEALRDEVMIRLEG-EQMRSYSWPDKLCYHGIMSSDLKGFPTVRFHFRGGAKLAL 346
L Y+A++ E + + G YS D C++ D+ P ++ F+G A+L +
Sbjct: 319 LPPSIYKAVKIEFLKQFSGFPTAPGYSILDT-CFNLTSYEDIS-IPIIKMIFQGNAELEV 376
Query: 347 EKDSMFYQPRPDA--FCMAVNPASINNRYVNFSLIGMMAQQFYNVGYDIGRKQKTFQRMD 404
+ +FY +PDA C+A+ S N +IG Q+ V YD +++ +
Sbjct: 377 DVTGVFYFVKPDASLVCLALASLSYENE---VGIIGNYQQKNQRVIYDTTQERLGIVGEN 433
Query: 405 CEV 407
C V
Sbjct: 434 CRV 436
>gi|449440933|ref|XP_004138238.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 1-like [Cucumis
sativus]
Length = 487
Score = 110 bits (274), Expect = 1e-21, Method: Compositional matrix adjust.
Identities = 112/361 (31%), Positives = 175/361 (48%), Gaps = 29/361 (8%)
Query: 57 SNSLFYVNISIGQPPVPQFTAMDTGSSLLWVHCYPC---RDCSPQLGTIFYPSRSSSYAY 113
S + + I +GQP + DTGS + W+ C PC C Q IF P SSSY+
Sbjct: 144 SGAEYLAQIGVGQPVKLFYLVPDTGSDVTWLQCQPCASENTCYKQFDPIFDPKSSSSYSP 203
Query: 114 VPCDSEHCRYFPYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDVV 173
+ C+S+ C+ A C++ CIY Y G T+ ++TE L+F N++ + ++
Sbjct: 204 LSCNSQQCKLLDKANCNS--DTCIYQVHYGDGSFTTGELATETLSFGNSNS----IPNLP 257
Query: 174 FGCGFSTNRNF-KFSGIFGLGIGRSSLVSQLN-SSFSYCIGSLHDPDYLHNKLILGDGAI 231
GCG F +G+ GLG G SL SQL SSFSYC+ +L D + + + +
Sbjct: 258 IGCGHDNEGLFAGGAGLIGLGGGAISLSSQLKASSFSYCLVNL---DSDSSSTLEFNSNM 314
Query: 232 IDEGDATPLQFID---GHYYITLEAISVDGRMLDINPNIFKRDDSG-GGVMIDSGTDVTW 287
+ +PL D + Y+ + ISV G+ L I+P F+ D+SG GG+++DSGT ++
Sbjct: 315 PSDSLTSPLVKNDRFHSYRYVKVVGISVGGKTLPISPTRFEIDESGLGGIIVDSGTIISR 374
Query: 288 LVKEAYEALRDEVMIRLEGEQMRS--YSWPDKLCYHGIMSSDLKGFPTVRFHFRGGAKLA 345
L + YE+LR E ++L + S D CY+ S+++ PT+ F G L
Sbjct: 375 LPSDVYESLR-EAFVKLTSSLSPAPGISVFDT-CYNFSGQSNVE-VPTIAFVLSEGTSLR 431
Query: 346 L-EKDSMFYQPRPDAFCMAVNPASINNRYVNFSLIGMMAQQFYNVGYDIGRKQKTFQRMD 404
L ++ + +C+A I + + S+IG QQ V YD+ F
Sbjct: 432 LPARNYLIMLDTAGTYCLAF----IKTKS-SLSIIGSFQQQGIRVSYDLTNSLVGFSTNK 486
Query: 405 C 405
C
Sbjct: 487 C 487
>gi|242050744|ref|XP_002463116.1| hypothetical protein SORBIDRAFT_02g038150 [Sorghum bicolor]
gi|241926493|gb|EER99637.1| hypothetical protein SORBIDRAFT_02g038150 [Sorghum bicolor]
Length = 632
Score = 110 bits (274), Expect = 2e-21, Method: Compositional matrix adjust.
Identities = 103/380 (27%), Positives = 170/380 (44%), Gaps = 56/380 (14%)
Query: 56 ISNSLFYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLGTIFYPSRSSSYAYVP 115
++N + + IG PP +D+GS++ +V C C C F P SSSY+ V
Sbjct: 84 LTNGYYTTRLYIGTPPQEFALIVDSGSTVTYVPCASCEQCGNHQDPRFQPDLSSSYSPVK 143
Query: 116 CDSEHCRYFPYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDVVFG 175
C+ + C + K +C Y + Y +S + + ++F ES + Q VFG
Sbjct: 144 CNVD-------CTCDSDKKQCTYERQYAEMSSSSGVLGEDIVSFGR--ESELKPQRAVFG 194
Query: 176 CGFSTNRNFKFS----GIFGLGIGRSSLVSQL------NSSFSYCIGSLHDPDYLHNKLI 225
C S + FS GI GLG G+ S++ QL + SFS C G +
Sbjct: 195 CENSETGDL-FSQHADGIMGLGRGQLSIMDQLVEKGVISDSFSLCYGGMD---------- 243
Query: 226 LGDGAIIDEGDATPLQFIDGH--------YYITLEAISVDGRMLDINPNIFKRDDSGGGV 277
+G GA++ G P + H Y I L+ I V G+ L ++ +F +S G
Sbjct: 244 IGGGAMVLGGVPAPSDMVFSHSDPLRSPYYNIELKEIHVAGKALRVDSRVF---NSKHGT 300
Query: 278 MIDSGTDVTWLVKEAYEALRDEVMIRLEGEQMRSYSWPDK----LCYHGI---MSSDLKG 330
++DSGT +L ++A+ A +D V ++ ++ PD +C+ G +S +
Sbjct: 301 VLDSGTTYAYLPEQAFVAFKDAVTSKV--HSLKKIRGPDPNYKDICFAGAGRNVSKLHEV 358
Query: 331 FPTVRFHFRGGAKLALEKDS-MFYQPRPD-AFCMAVNPASINNRYVNFSLIGMMAQQFYN 388
FP V F G KL+L ++ +F + D A+C+ V N +L+G + +
Sbjct: 359 FPDVDMVFGNGQKLSLTPENYLFRHSKVDGAYCLGV----FQNGKDPTTLLGGIIVRNTL 414
Query: 389 VGYDIGRKQKTFQRMDCEVL 408
V YD ++ F + +C L
Sbjct: 415 VTYDRHNEKIGFWKTNCSEL 434
>gi|356570798|ref|XP_003553571.1| PREDICTED: aspartic proteinase-like protein 2-like [Glycine max]
Length = 500
Score = 110 bits (274), Expect = 2e-21, Method: Compositional matrix adjust.
Identities = 101/376 (26%), Positives = 174/376 (46%), Gaps = 41/376 (10%)
Query: 60 LFYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDC--SPQLGT---IFYPSRSSSYAYV 114
L++ + +G P + +DTGS +LW++C C +C S LG F + SS+ A V
Sbjct: 82 LYFTKVKLGSPAKEFYVQIDTGSDILWINCITCSNCPHSSGLGIELDFFDTAGSSTAALV 141
Query: 115 PCDSEHCRYF---PYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNT--DESTI-- 167
C C Y + CS+ ++C YT Y G T+ + ++ + F +S +
Sbjct: 142 SCGDPICSYAVQTATSECSSQANQCSYTFQYGDGSGTTGYYVSDTMYFDTVLLGQSVVAN 201
Query: 168 HVQDVVFGC-----GFSTNRNFKFSGIFGLGIGRSSLVSQLNSS------FSYCIGSLHD 216
++FGC G T + GIFG G G S++SQL+S FS+C L
Sbjct: 202 SSSTIIFGCSTYQSGDLTKTDKAVDGIFGFGPGALSVISQLSSRGVTPKVFSHC---LKG 258
Query: 217 PDYLHNKLILGDGAIIDEGDA-TPLQFIDGHYYITLEAISVDGRMLDINPNIFKRDDSGG 275
+ L+LG+ I++ +PL HY + L++I+V+G++L I+ N+F ++
Sbjct: 259 GENGGGVLVLGE--ILEPSIVYSPLVPSQPHYNLNLQSIAVNGQLLPIDSNVFATTNN-Q 315
Query: 276 GVMIDSGTDVTWLVKEAYEALRDEVMIRLEGEQMRSYSWPDKLCYHGIMSSDLKG--FPT 333
G ++DSGT + +LV+EAY + + S ++ CY + S+ G FP
Sbjct: 316 GTIVDSGTTLAYLVQEAYNPFVKAITAAVSQFSKPIISKGNQ-CY---LVSNSVGDIFPQ 371
Query: 334 VRFHFRGGAKLALEKDS--MFYQPRPDAFCMAVNPASINNRYVNFSLIGMMAQQFYNVGY 391
V +F GGA + L + M Y A + + F+++G + + Y
Sbjct: 372 VSLNFMGGASMVLNPEHYLMHYGFLDGAAMWCIGFQKVEQ---GFTILGDLVLKDKIFVY 428
Query: 392 DIGRKQKTFQRMDCEV 407
D+ ++ + DC +
Sbjct: 429 DLANQRIGWADYDCSL 444
>gi|297735249|emb|CBI17611.3| unnamed protein product [Vitis vinifera]
Length = 480
Score = 110 bits (274), Expect = 2e-21, Method: Compositional matrix adjust.
Identities = 102/401 (25%), Positives = 178/401 (44%), Gaps = 43/401 (10%)
Query: 38 EKIRSHNTYQ-AQILPSNDI---------SNSLFYVNISIGQPPVPQFTAMDTGSSLLWV 87
+ +R+H+T + +IL + D+ L++ I IG P + +DTGS +LWV
Sbjct: 41 DALRAHDTRRHGRILSAVDLPLGGNGHPSEAGLYFAKIGIGTPSKDYYVQVDTGSDILWV 100
Query: 88 HCYPCRDCSPQ--LG---TIFYPSRSSSYAYVPCDSEHCRYF--PYARCSAYKHRCIYTQ 140
+C C C + LG T++ S++ V CD C + P C +C+Y+
Sbjct: 101 NCAGCDRCPTKSDLGVDLTLYDMKASTTSDAVGCDDNFCSLYDGPLPGCKP-GLQCLYSV 159
Query: 141 LYLIGPETSVFVSTEQLTFKNTD---ESTIHVQDVVFGCGFSTNRNFKFS-----GIFGL 192
LY G T+ + + + + ++T VVFGCG + S GI G
Sbjct: 160 LYGDGSSTTGYFVQDFVQYNRISGNFQTTPTNGTVVFGCGNKQSGELGSSSEALDGILGF 219
Query: 193 GIGRSSLVSQLNSS------FSYCIGSLHDPDYLHNKLILGDGAIID-EGDATPLQFIDG 245
G SS++SQL SS FS+C+ D + I G +++ + + TPL
Sbjct: 220 GQANSSMLSQLASSGKVKKVFSHCL------DNVDGGGIFAIGEVVEPKVNITPLVQNQA 273
Query: 246 HYYITLEAISVDGRMLDINPNIFKRDDSGGGVMIDSGTDVTWLVKEAYEALRDEVMIRLE 305
HY + ++ I V G LD+ + F+ D G +IDSGT + + +E Y L ++++ +
Sbjct: 274 HYNVVMKEIEVGGDPLDVPSDAFESGDR-KGTIIDSGTTLAYFPQEVYVPLIEKILSQQP 332
Query: 306 GEQMRSYSWPDKLCYHGIMSSDLKGFPTVRFHFRGGAKLALEKDSMFYQPRPDAFCMAV- 364
++ + C+ + D GFPTV HF L + +Q + +C+
Sbjct: 333 DLRLHTVEQA-FTCFDYTGNVD-DGFPTVTLHFDKSISLTVYPHEYLFQVKEFEWCIGWQ 390
Query: 365 NPASINNRYVNFSLIGMMAQQFYNVGYDIGRKQKTFQRMDC 405
N + + +L+G + V YD+ ++ + +C
Sbjct: 391 NSGAQTKDGKDLTLLGDLVLSNKLVVYDLEKQGIGWVEYNC 431
>gi|242079765|ref|XP_002444651.1| hypothetical protein SORBIDRAFT_07g025440 [Sorghum bicolor]
gi|241941001|gb|EES14146.1| hypothetical protein SORBIDRAFT_07g025440 [Sorghum bicolor]
Length = 493
Score = 110 bits (274), Expect = 2e-21, Method: Compositional matrix adjust.
Identities = 98/368 (26%), Positives = 164/368 (44%), Gaps = 48/368 (13%)
Query: 40 IRSHN-TYQAQILPSNDI---------SNSLFYVNISIGQPPVPQFTAMDTGSSLLWVHC 89
+R+H+ T ++L + D+ L+Y + +G PP + +DTGS +LWV+C
Sbjct: 57 LRAHDGTRHGRLLATADLPLGGLGLPTDTGLYYTEVRLGTPPKRFYVQVDTGSDILWVNC 116
Query: 90 YPCRDCSPQLG-----TIFYPSRSSSYAYVPCDSEHCRYFPYAR---CSAYKHRCIYTQL 141
C C + G T++ P SS+ + V CD C R CSA C Y+
Sbjct: 117 ITCDQCPHKSGLGLDLTLYDPKASSTGSTVMCDQGFCADTFGGRLPKCSA-NVPCEYSVT 175
Query: 142 YLIGPETSVFVSTEQLTFKNT---DESTIHVQDVVFGCGFSTNRNFKFS-----GIFGLG 193
Y G T + L F ++ V+FGCG + S GI G G
Sbjct: 176 YGDGSSTVGSFVNDALQFDQVTGDGQTQPANASVIFGCGAQQGGDLGSSSQALDGILGFG 235
Query: 194 IGRSSLVSQLNSS------FSYCIGSLHDPDYLHNKLILGDGAIID-EGDATPLQFIDGH 246
+S++SQL ++ F++C+ D + I G ++ + TPL H
Sbjct: 236 EANTSMLSQLATAGKVKKIFAHCL------DTIKGGGIFAIGDVVQPKVKTTPLVADKPH 289
Query: 247 YYITLEAISVDGRMLDINPNIFKRDDSGGGVMIDSGTDVTWLVKEAYEALRDEVMIRL-- 304
Y + L+ I V G L++ +IFK + G +IDSGT +T+L + ++ +VM+ +
Sbjct: 290 YNVNLKTIDVGGTTLELPADIFKPGEK-RGTIIDSGTTLTYLPELVFK----KVMLAVFN 344
Query: 305 EGEQMRSYSWPDKLCYHGIMSSDLKGFPTVRFHFRGGAKLALEKDSMFYQPRPDAFCMAV 364
+ + + + D LC+ S D GFPT+ FHF L + F+ D +C+
Sbjct: 345 KHQDITFHDVQDFLCFEYSGSVD-DGFPTLTFHFEDDLALHVYPHEYFFPNGNDVYCVGF 403
Query: 365 NPASINNR 372
++ ++
Sbjct: 404 QNGALQSK 411
>gi|388518257|gb|AFK47190.1| unknown [Lotus japonicus]
Length = 478
Score = 110 bits (274), Expect = 2e-21, Method: Compositional matrix adjust.
Identities = 88/326 (26%), Positives = 151/326 (46%), Gaps = 36/326 (11%)
Query: 60 LFYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDC--SPQLG---TIFYPSRSSSYAYV 114
L++ I +G P + +DTGS +LWV+C C C +G T++ P RS + +V
Sbjct: 68 LYFTKIGLGSPSKDYYVQVDTGSDILWVNCVECTRCPRKSDIGIGLTLYDPKRSKTSEFV 127
Query: 115 PCDSEHCRYFPYAR---CSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTD---ESTIH 168
C+ C R C A ++ C Y+ Y G T+ + + LTF + +
Sbjct: 128 SCEHNFCSSTYEGRILGCKA-ENPCPYSISYGDGSATTGYYVQDYLTFNRVNGNPHTATQ 186
Query: 169 VQDVVFGCG------FSTNRNFKFSGIFGLGIGRSSLVSQLNSS------FSYCIGSLHD 216
++FGCG F+++ GI G G SS++SQL +S FS+C+
Sbjct: 187 NSSIIFGCGAAQSGTFASSSEEALDGIIGFGQANSSVLSQLAASGKVKKIFSHCL----- 241
Query: 217 PDYLHNKLILGDGAIID-EGDATPLQFIDGHYYITLEAISVDGRMLDINPNIFKRDDSGG 275
D I G +++ + TPL HY + L+ I VDG +L + + F ++ G
Sbjct: 242 -DTNVGGGIFSIGEVVEPKVKTTPLVPNMAHYNVILKNIEVDGDILQLPSDTFDSEN-GK 299
Query: 276 GVMIDSGTDVTWLVKEAYEALRDEVMIRLEGEQMRSYSWPDKL-CYHGIMSSDLKGFPTV 334
G +IDSGT + +L + Y+ L +V+ + +++ Y ++ C+ + D GFP V
Sbjct: 300 GTVIDSGTTLAYLPRIVYDQLMSKVLAKQ--PRLKVYLVEEQYSCFQYTGNVD-SGFPIV 356
Query: 335 RFHFRGGAKLALEKDSMFYQPRPDAF 360
+ HF L + + + D++
Sbjct: 357 KLHFEDSLSLTVYPHDYLFNYKGDSY 382
>gi|125586059|gb|EAZ26723.1| hypothetical protein OsJ_10631 [Oryza sativa Japonica Group]
Length = 339
Score = 110 bits (274), Expect = 2e-21, Method: Compositional matrix adjust.
Identities = 101/366 (27%), Positives = 155/366 (42%), Gaps = 51/366 (13%)
Query: 67 IGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLGTIFYPSRSSSYAYVPCDSEHCRYFPY 126
+G PP P ++ G+ L+W H P +C Q F P S R P+
Sbjct: 1 MGTPPNPVKLKLENGNELIWNHSNPSPECFEQAFPYFEPLTFS------------RGLPF 48
Query: 127 ARCSAYK----HRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDVVFGCGFSTNR 182
A C + K C+YT Y T+ F+ ++ TF S V V FGCG N
Sbjct: 49 ASCGSPKFWPNQTCVYTYSYGDKSVTTGFLEVDKFTFVGAGAS---VPGVAFGCGLFNNG 105
Query: 183 NFKF--SGIFGLGIGRSSLVSQLN-SSFSYCIGSLHDP-------DYLHNKLILGDGAII 232
FK +GI G G G SL SQL +FS+C ++ D + G GA+
Sbjct: 106 VFKSNETGIAGFGRGPLSLPSQLKVGNFSHCFTTITGAIPSTVLLDLPADLFSNGQGAV- 164
Query: 233 DEGDATPL-QFIDGH-----YYITLEAISVDGRMLDINPNIFKRDDSGGGVMIDSGTDVT 286
TPL Q+ YY++L+ I+V L + + F + GG +IDSGT +T
Sbjct: 165 ---QTTPLIQYAKNEANPTLYYLSLKGITVGSTRLPVPESAFALTNGTGGTIIDSGTSIT 221
Query: 287 WLVKEAYEALRDEVMIRLEGEQMRSYSWPDKLCYHGIMSSDLKGFPTVRFHFRGGAKLAL 346
L + Y+ +RDE +++ + + C+ S P + HF GA + L
Sbjct: 222 SLPPQVYQVVRDEFAAQIKLPVVPGNATGHYTCFSA-PSQAKPDVPKLVLHFE-GATMDL 279
Query: 347 EKDSMFYQPRPDA----FCMAVNPASINNRYVNFSLIGMMAQQFYNVGYDIGRKQKTFQR 402
+++ ++ DA C+A+N ++IG QQ +V YD+ +F
Sbjct: 280 PRENYVFEVPDDAGNSIICLAINKGD------ETTIIGNFQQQNMHVLYDLQNNMLSFVA 333
Query: 403 MDCEVL 408
C+ L
Sbjct: 334 AQCDKL 339
>gi|357122155|ref|XP_003562781.1| PREDICTED: aspartic proteinase-like protein 2-like [Brachypodium
distachyon]
Length = 629
Score = 110 bits (274), Expect = 2e-21, Method: Compositional matrix adjust.
Identities = 101/380 (26%), Positives = 167/380 (43%), Gaps = 56/380 (14%)
Query: 56 ISNSLFYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLGTIFYPSRSSSYAYVP 115
++N + + IG PP +D+GS++ +V C C C F P SS+Y+ V
Sbjct: 80 LTNGYYTTRLYIGTPPQEFALIVDSGSTVTYVPCASCEQCGNHQDPRFQPDLSSTYSPVK 139
Query: 116 CDSEHCRYFPYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDVVFG 175
C ++ C + K +C Y + Y +S + + ++F ES + Q VFG
Sbjct: 140 CSAD-------CTCDSDKSQCTYERQYAEMSSSSGVLGEDIVSFGT--ESELKPQRAVFG 190
Query: 176 CGFSTNRNFKFS----GIFGLGIGRSSLVSQL------NSSFSYCIGSLHDPDYLHNKLI 225
C S + FS GI GLG G+ S++ QL SFS C G +
Sbjct: 191 CENSETGDL-FSQHADGIMGLGRGQLSIMDQLVDKGVIGDSFSMCYGGMD---------- 239
Query: 226 LGDGAIIDEGDATPLQFI--------DGHYYITLEAISVDGRMLDINPNIFKRDDSGGGV 277
+G GA++ P + +Y I L+ I V G+ L ++P IF DS G
Sbjct: 240 IGGGAMVLGAMPAPPDMVFSRSDPVRSPYYNIELKEIHVAGKALRLDPRIF---DSKHGT 296
Query: 278 MIDSGTDVTWLVKEAYEALRDEVMIRLEGEQMRSYSWPDK----LCYHGI---MSSDLKG 330
++DSGT +L ++A+ A +D V ++ ++ PD +C+ G +S +
Sbjct: 297 VLDSGTTYAYLPEQAFVAFKDAVTSKV--RPLKKIRGPDPNYKDICFAGAGRNVSQLSQA 354
Query: 331 FPTVRFHFRGGAKLALEKDSMFYQ--PRPDAFCMAVNPASINNRYVNFSLIGMMAQQFYN 388
FP V F G KL+L ++ ++ A+C+ V N +L+G + +
Sbjct: 355 FPDVDMVFGDGQKLSLSPENYLFRHSKVEGAYCLGV----FQNGKDPTTLLGGIVVRNTL 410
Query: 389 VGYDIGRKQKTFQRMDCEVL 408
V YD ++ F + +C L
Sbjct: 411 VTYDRHNEKIGFWKTNCSEL 430
>gi|357448247|ref|XP_003594399.1| Aspartic proteinase nepenthesin-2 [Medicago truncatula]
gi|355483447|gb|AES64650.1| Aspartic proteinase nepenthesin-2 [Medicago truncatula]
Length = 452
Score = 110 bits (274), Expect = 2e-21, Method: Compositional matrix adjust.
Identities = 103/366 (28%), Positives = 162/366 (44%), Gaps = 36/366 (9%)
Query: 61 FYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRD-CSPQLGTIFYPSRSSSYAYVPCDSE 119
+YV + +G P +DTGSS W+ C PC C Q +F PS S +Y VPC S
Sbjct: 103 YYVKMGLGSPTKYYTMIVDTGSSFSWLQCQPCTIYCHIQEDPVFNPSASKTYKTVPCSSS 162
Query: 120 HCRYFPYA-----RCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDVVF 174
C A CS + C+Y Y + ++S + LT + + V+
Sbjct: 163 QCSSLKSATLNEPTCSKQSNACVYKASYGDSSFSLGYLSQDVLTLTPSQT----LSSFVY 218
Query: 175 GCGFSTNRNF-KFSGIFGLGIGRSSLVSQLN----SSFSYCI-GSLHDPDYLHNKLI-LG 227
GCG F + GI GL S++SQL+ ++FSYC+ S P+ + +G
Sbjct: 219 GCGQDNQGLFGRTDGIIGLANNELSMLSQLSGKYGNAFSYCLPTSFSTPNSPKEGFLSIG 278
Query: 228 DGAIIDEGDA--TPLQFIDGH---YYITLEAISVDGRMLDINPNIFKRDDSGGGVMIDSG 282
++ TPL + Y+I LE+I+V GR L + + +K +IDSG
Sbjct: 279 TSSLTPSSSYKFTPLLKNPNNPSLYFIDLESITVAGRPLGVAASSYKVP-----TIIDSG 333
Query: 283 TDVTWLVKEAYEALRDEVMIRLEG--EQMRSYSWPDKLCYHGIMSSDLKGFPTVRFHFRG 340
T +T L Y L++ + L +Q S D C+ G ++ + P +R F+G
Sbjct: 334 TVITRLPTPVYTTLKNAYVTILSKKYQQAPGISLLDT-CFKGSLAGISEVAPDIRIIFKG 392
Query: 341 GAKLALEKDSMFYQPRPDAFCMAVNPASINNRYVNFSLIGMMAQQFYNVGYDIGRKQKTF 400
GA L L+ + + C+A+ +S + ++IG QQ V YD+G + F
Sbjct: 393 GADLQLKGHNSLVELETGITCLAMAGSS------SIAIIGNYQQQTVKVAYDVGNSRVGF 446
Query: 401 QRMDCE 406
C+
Sbjct: 447 APGGCQ 452
>gi|356553263|ref|XP_003544977.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
Length = 445
Score = 110 bits (274), Expect = 2e-21, Method: Compositional matrix adjust.
Identities = 93/372 (25%), Positives = 164/372 (44%), Gaps = 40/372 (10%)
Query: 61 FYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLGTIFYPSRSSSYAYVPCDSEH 120
V + IG PP PQ +DTGS L W+ C+ + +P + F PS SSS+ +PC
Sbjct: 88 LVVTLPIGTPPQPQQMVLDTGSQLSWIQCH---NKTPPTAS-FDPSLSSSFYVLPCTHPL 143
Query: 121 CRY----FPYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDVVFGC 176
C+ F C Y+ Y G + E+L F + + ++ GC
Sbjct: 144 CKPRVPDFTLPTTCDQNRLCHYSYFYADGTYAEGNLVREKLAFSPSQTT----PPLILGC 199
Query: 177 GFSTNRNFKFSGIFGLGIGRSSLVSQLN-SSFSYCIGSLHDPDYLHNKLILGDGAIIDEG 235
S +R+ + GI G+ +GR S Q + FSYC+ + + +N G + +
Sbjct: 200 S-SESRDAR--GILGMNLGRLSFPFQAKVTKFSYCVPTRQPAN--NNNFPTGSFYLGNNP 254
Query: 236 DATPLQFIDG---------------HYYITLEAISVDGRMLDINPNIFKRDDSGGG-VMI 279
++ +++ Y + ++ I + GR L+I P++F+ + G G M+
Sbjct: 255 NSARFRYVSMLTFPQSQRMPNLDPLAYTVPMQGIRIGGRKLNIPPSVFRPNAGGSGQTMV 314
Query: 280 DSGTDVTWLVKEAYEALRDEVMIRLEGEQMRS---YSWPDKLCYHGIMSSDLKGFPTVRF 336
DSG++ T+LV AY+ +R+E+ IR+ G +++ Y +C+ G + V F
Sbjct: 315 DSGSEFTFLVDVAYDRVREEI-IRVLGPRVKKGYVYGGVADMCFDGNAMEIGRLLGDVAF 373
Query: 337 HFRGGAKLALEKDSMFYQPRPDAFCMAVNPASINNRYVNFSLIGMMAQQFYNVGYDIGRK 396
F G ++ + K+ + C+ + + N +IG QQ V +D+ +
Sbjct: 374 EFEKGVEIVVPKERVLADVGGGVHCVGIGRSERLGAASN--IIGNFHQQNLWVEFDLANR 431
Query: 397 QKTFQRMDCEVL 408
+ F DC L
Sbjct: 432 RIGFGVADCSRL 443
>gi|18412482|ref|NP_565219.1| aspartyl protease-like protein [Arabidopsis thaliana]
gi|19699359|gb|AAL91289.1| At1g79720/F19K16_30 [Arabidopsis thaliana]
gi|26450464|dbj|BAC42346.1| unknown protein [Arabidopsis thaliana]
gi|115646741|gb|ABJ17101.1| At1g79720 [Arabidopsis thaliana]
gi|332198170|gb|AEE36291.1| aspartyl protease-like protein [Arabidopsis thaliana]
Length = 484
Score = 109 bits (273), Expect = 2e-21, Method: Compositional matrix adjust.
Identities = 121/423 (28%), Positives = 188/423 (44%), Gaps = 56/423 (13%)
Query: 19 AHRVQRGINISIARLAYLQEKIRS-------HNTYQAQILPSNDIS-NSLFY-VNISIGQ 69
+++R + + R+ LQ KI++ + + QI ++ I SL Y V + +G
Sbjct: 84 GKKMRRALVLDNIRVQSLQLKIKAMTSSTTEQSVSETQIPLTSGIKLESLNYIVTVELGG 143
Query: 70 PPVPQFTAMDTGSSLLWVHCYPCRDCSPQLGTIFYPSRSSSYAYVPCDSEHCRYFPYARC 129
+ +DTGS L WV C PCR C Q G ++ PS SSSY V C+S C+ A
Sbjct: 144 KNMSLI--VDTGSDLTWVQCQPCRSCYNQQGPLYDPSVSSSYKTVFCNSSTCQDLVAATS 201
Query: 130 SA---------YKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDVVFGCGFST 180
++ K C Y Y G T +++E + +T +++ VFGCG
Sbjct: 202 NSGPCGGNNGVVKTPCEYVVSYGDGSYTRGDLASESILLGDT-----KLENFVFGCG-RN 255
Query: 181 NRNFKFSGIFGLGIGRS--SLVSQ----LNSSFSYCIGSLHDPDYLHNKLILGDGAIIDE 234
N+ +G+GRS SLVSQ N FSYC+ SL D L G+ + +
Sbjct: 256 NKGLFGGSSGLMGLGRSSVSLVSQTLKTFNGVFSYCLPSLEDG--ASGSLSFGNDSSVYT 313
Query: 235 GDA----TPL---QFIDGHYYITLEAISVDGRMLDINPNIFKRDDSGGGVMIDSGTDVTW 287
TPL + Y + L S+ G L K G G++IDSGT +T
Sbjct: 314 NSTSVSYTPLVQNPQLRSFYILNLTGASIGGVEL-------KSSSFGRGILIDSGTVITR 366
Query: 288 LVKEAYEALRDEVMIRLEG-EQMRSYSWPDKLCYHGIMSSDLKGFPTVRFHFRGGAKLAL 346
L Y+A++ E + + G YS D C++ D+ P ++ F+G A+L +
Sbjct: 367 LPPSIYKAVKIEFLKQFSGFPTAPGYSILDT-CFNLTSYEDIS-IPIIKMIFQGNAELEV 424
Query: 347 EKDSMFYQPRPDA--FCMAVNPASINNRYVNFSLIGMMAQQFYNVGYDIGRKQKTFQRMD 404
+ +FY +PDA C+A+ S N +IG Q+ V YD +++ +
Sbjct: 425 DVTGVFYFVKPDASLVCLALASLSYENE---VGIIGNYQQKNQRVIYDTTQERLGIVGEN 481
Query: 405 CEV 407
C V
Sbjct: 482 CRV 484
>gi|218188634|gb|EEC71061.1| hypothetical protein OsI_02803 [Oryza sativa Indica Group]
Length = 479
Score = 109 bits (273), Expect = 2e-21, Method: Compositional matrix adjust.
Identities = 106/369 (28%), Positives = 158/369 (42%), Gaps = 48/369 (13%)
Query: 61 FYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSP---QLGTIFYPSRSSSYAYVPCD 117
+ +++ +G P + Q +DTGS + WV C PC SP G +F P+ SS+YA C
Sbjct: 135 YVISVGLGSPAMTQRVVIDTGSDVSWVQCEPCPAPSPCHAHAGALFDPAASSTYAAFNCS 194
Query: 118 SEHCRYFPYA----RCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDVV 173
+ C + C A K RC Y Y G T+ S++ LT +D V+
Sbjct: 195 AAACAQLGDSGEANGCDA-KSRCQYIVKYGDGSNTTGTYSSDVLTLSGSDV----VRGFQ 249
Query: 174 FGCG---FSTNRNFKFSGIFGLGIGRSSLVSQLNS----SFSYCI-GSLHDPDYLH-NKL 224
FGC + K G+ GLG SLVSQ + SFSYC+ + +L
Sbjct: 250 FGCSHAELGAGMDDKTDGLIGLGGDAQSLVSQTAARYGKSFSYCLPATPASSGFLTLGAP 309
Query: 225 ILGDGAIIDEGDATPL---QFIDGHYYITLEAISVDGRMLDINPNIFKRDDSGGGVMIDS 281
G G TP+ + + +Y+ LE I+V G+ L ++P++F G ++DS
Sbjct: 310 ASGGGGGASRFATTPMLRSKKVPTYYFAALEDIAVGGKKLGLSPSVFA-----AGSLVDS 364
Query: 282 GTDVTWLVKEAYEALRDEVMIRLEGEQMRSYSWPDKL-----CYHGIMSSDLKGFPTVRF 336
GT +T L AY AL M Y+ + L C++ D PTV
Sbjct: 365 GTVITRLPPAAYAALSSAFR-----AGMTRYARAEPLGILDTCFN-FTGLDKVSIPTVAL 418
Query: 337 HFRGGAKLALEKDSMFYQPRPDAFCMAVNPASINNRYVNFSLIGMMAQQFYNVGYDIGRK 396
F GGA + L+ + C+A P + F IG + Q+ + V YD+G
Sbjct: 419 VFAGGAVVDLDAHGIV-----SGGCLAFAPTRDDK---AFGTIGNVQQRTFEVLYDVGGG 470
Query: 397 QKTFQRMDC 405
F+ C
Sbjct: 471 VFGFRAGAC 479
>gi|217074470|gb|ACJ85595.1| unknown [Medicago truncatula]
gi|388505166|gb|AFK40649.1| unknown [Medicago truncatula]
Length = 452
Score = 109 bits (273), Expect = 2e-21, Method: Compositional matrix adjust.
Identities = 103/366 (28%), Positives = 162/366 (44%), Gaps = 36/366 (9%)
Query: 61 FYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRD-CSPQLGTIFYPSRSSSYAYVPCDSE 119
+YV + +G P +DTGSS W+ C PC C Q +F PS S +Y VPC S
Sbjct: 103 YYVKMGLGSPTKYYTMIVDTGSSFSWLQCQPCTIYCHIQEDPVFNPSASKTYKTVPCSSS 162
Query: 120 HCRYFPYA-----RCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDVVF 174
C A CS + C+Y Y + ++S + LT + + V+
Sbjct: 163 QCSSLKSATLNEPTCSKQSNACVYKASYGDSSFSLGYLSQDVLTLTPSQT----LSSFVY 218
Query: 175 GCGFSTNRNF-KFSGIFGLGIGRSSLVSQLN----SSFSYCI-GSLHDPDYLHNKLI-LG 227
GCG F + GI GL S++SQL+ ++FSYC+ S P+ + +G
Sbjct: 219 GCGQDNQGLFGRTDGIIGLANNELSMLSQLSGKYGNAFSYCLPTSFSTPNSPKEGFLSIG 278
Query: 228 DGAIIDEGDA--TPLQFIDGH---YYITLEAISVDGRMLDINPNIFKRDDSGGGVMIDSG 282
++ TPL + Y+I LE+I+V GR L + + +K +IDSG
Sbjct: 279 TSSLTPSSSYKFTPLLKNPNNPSLYFIDLESITVAGRPLGVAASSYKVP-----TIIDSG 333
Query: 283 TDVTWLVKEAYEALRDEVMIRLEG--EQMRSYSWPDKLCYHGIMSSDLKGFPTVRFHFRG 340
T +T L Y L++ + L +Q S D C+ G ++ + P +R F+G
Sbjct: 334 TVITRLPTPVYTTLKNAYVTILSKKYQQAPGISLLDT-CFKGSLAGISEVAPDIRIIFKG 392
Query: 341 GAKLALEKDSMFYQPRPDAFCMAVNPASINNRYVNFSLIGMMAQQFYNVGYDIGRKQKTF 400
GA L L+ + + C+A+ +S + ++IG QQ V YD+G + F
Sbjct: 393 GADLQLKGHNSLVELETGITCLAMAGSS------SIAIIGNYQQQTVKVAYDVGNSRVGF 446
Query: 401 QRMDCE 406
C+
Sbjct: 447 APGGCQ 452
>gi|449440014|ref|XP_004137780.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Cucumis sativus]
gi|449483406|ref|XP_004156582.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Cucumis sativus]
Length = 449
Score = 109 bits (273), Expect = 2e-21, Method: Compositional matrix adjust.
Identities = 101/324 (31%), Positives = 152/324 (46%), Gaps = 47/324 (14%)
Query: 63 VNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLGTIFYPSRSSSYAYVPCDSEHC- 121
V++++G PP +DTGS L W+HC ++ S T F P SSSY+ +PC S C
Sbjct: 75 VSLTVGTPPQNVTMVIDTGSELSWLHCNTSQNSSSSSST-FNPVWSSSYSPIPCSSSTCT 133
Query: 122 ---RYFPYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDVVFGCG- 177
R FP C T Y + ++T+ ++ + +VVFGC
Sbjct: 134 DQTRDFPIRPSCDSNQFCHATLSYADASSSEGNLATDTFYIGSSG-----IPNVVFGCMD 188
Query: 178 --FSTN--RNFKFSGIFGLGIGRSSLVSQLN-SSFSYCIGSLHDPDYLHNKLILGDG--- 229
FS+N + K +G+ G+ G S VSQ+ FSYCI S +D L+LGD
Sbjct: 189 SIFSSNSEEDSKNTGLMGMNRGSLSFVSQMGFPKFSYCI-SEYD---FSGLLLLGDANFS 244
Query: 230 --------AIIDEGDATPLQFIDG-HYYITLEAISVDGRMLDINPNIFKRDDSGGG-VMI 279
+I+ +TPL + D Y + LE I V ++L I ++F+ D +G G M+
Sbjct: 245 WLAPLNYTPLIEM--STPLPYFDRVAYTVQLEGIKVAHKLLPIPESVFEPDHTGAGQTMV 302
Query: 280 DSGTDVTWLVKEAYEALRDEVMIRLEGEQMRSYSWPDKLCYHGIM---------SSDLKG 330
DSGT T+L+ AY ALRD + + G +R Y + G M + L
Sbjct: 303 DSGTQFTFLLGPAYTALRDHFLNKTAGS-LRVYE-DSNFVFQGAMDLCYRVPTNQTRLPP 360
Query: 331 FPTVRFHFRGGAKLALEKDSMFYQ 354
P+V FR GA++ + D + Y+
Sbjct: 361 LPSVTLVFR-GAEMTVTGDRILYR 383
>gi|125533812|gb|EAY80360.1| hypothetical protein OsI_35532 [Oryza sativa Indica Group]
Length = 428
Score = 109 bits (273), Expect = 2e-21, Method: Compositional matrix adjust.
Identities = 99/384 (25%), Positives = 162/384 (42%), Gaps = 43/384 (11%)
Query: 35 YLQEKIRSHNTYQAQILPSNDISNSLFYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRD 94
Y+ K +T Q+ + SL+ +++ +G P Q +DTGSS WV C C
Sbjct: 56 YISNKTSRLSTQAVQVGWDRGLQTSLYVISVGLGTPAKTQIVEIDTGSSTSWVFC-ECDG 114
Query: 95 CSPQLGTIFYPSRSSSYAYVPCDSEHCRYF---PYARCSAYKHRCIYTQLYLIGPETSVF 151
C T F SRS++ A V C + C P+ + S C + Y G +
Sbjct: 115 CHTNPRT-FLQSRSTTCAKVSCGTSMCLLGGSDPHCQDSENYPDCPFRVSYQDGSASYGI 173
Query: 152 VSTEQLTFKNTDESTIHVQDVVFGC---GFSTNRNFKFSGIFGLGIGRSSLVSQLN---S 205
+ + LTF + + + FGC F N G+ G+G G S++ Q +
Sbjct: 174 LYQDTLTFSDVQK----IPSFTFGCNLDSFGANEFGNVDGLLGMGAGPMSVLKQSSPRFD 229
Query: 206 SFSYCIGSLHDPDYLHNKLI--LGDGAIIDEGDATPLQFIDGH-----YYITLEAISVDG 258
FSYC+ +K G + D + + +++ L AISVDG
Sbjct: 230 GFSYCLPLQKSERGFFSKTTGYFSLGKVATRTDVRYTKMVARRKNTELFFVDLAAISVDG 289
Query: 259 RMLDINPNIFKRDDSGGGVMIDSGTDVTWLVKEAYEALRD---EVMIRLEGEQMRSYSWP 315
L ++P+IF R GV+ DSG++++++ A L E+++R + S
Sbjct: 290 ERLGLSPSIFSRK----GVVFDSGSELSYIPDRALSVLSQRIRELLLRRGAAEEES---- 341
Query: 316 DKLCYHGIMSSDLKGFPTVRFHFRGGAKLALEKDSMFYQ---PRPDAFCMAVNPASINNR 372
++ CY + S D P + HF GA+ L +F + D +C+A P
Sbjct: 342 ERNCYD-MRSVDEGDMPAISLHFDDGARFDLGSHGVFVERSVQEQDVWCLAFAPTE---- 396
Query: 373 YVNFSLIGMMAQQFYNVGYDIGRK 396
+ S+IG + Q V YD+ R+
Sbjct: 397 --SVSIIGSLMQTSKEVVYDLKRQ 418
>gi|449527149|ref|XP_004170575.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 1-like [Cucumis
sativus]
Length = 487
Score = 109 bits (272), Expect = 3e-21, Method: Compositional matrix adjust.
Identities = 112/361 (31%), Positives = 175/361 (48%), Gaps = 29/361 (8%)
Query: 57 SNSLFYVNISIGQPPVPQFTAMDTGSSLLWVHCYPC---RDCSPQLGTIFYPSRSSSYAY 113
S + + I +GQP + DTGS + W+ C PC C Q IF P SSSY+
Sbjct: 144 SGAEYLAQIGVGQPVKLFYLVPDTGSDVTWLQCQPCASENTCYKQFDPIFDPKSSSSYSP 203
Query: 114 VPCDSEHCRYFPYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDVV 173
+ C+S+ C+ A C++ CIY Y G T+ ++TE L+F N++ + ++
Sbjct: 204 LSCNSQQCKLLDKANCNS--DTCIYQVHYGDGSFTTGELATETLSFGNSNS----IPNLP 257
Query: 174 FGCGFSTNRNF-KFSGIFGLGIGRSSLVSQLN-SSFSYCIGSLHDPDYLHNKLILGDGAI 231
GCG F +G+ GLG G SL SQL SSFSYC+ +L D + + + +
Sbjct: 258 IGCGHDNEGLFAGGAGLIGLGGGAISLSSQLKASSFSYCLVNL---DSDSSSTLEFNSYM 314
Query: 232 IDEGDATPLQFID---GHYYITLEAISVDGRMLDINPNIFKRDDSG-GGVMIDSGTDVTW 287
+ +PL D + Y+ + ISV G+ L I+P F+ D+SG GG+++DSGT ++
Sbjct: 315 PSDSLTSPLVKNDRFHSYRYVKVVGISVGGKTLPISPTRFEIDESGLGGIIVDSGTIISR 374
Query: 288 LVKEAYEALRDEVMIRLEGEQMRS--YSWPDKLCYHGIMSSDLKGFPTVRFHFRGGAKLA 345
L + YE+LR E ++L + S D CY+ S+++ PT+ F G L
Sbjct: 375 LPSDVYESLR-EAFVKLTSSLSPAPGISVFDT-CYNFSGQSNVE-VPTIAFVLSEGTSLR 431
Query: 346 L-EKDSMFYQPRPDAFCMAVNPASINNRYVNFSLIGMMAQQFYNVGYDIGRKQKTFQRMD 404
L ++ + +C+A I + + S+IG QQ V YD+ F
Sbjct: 432 LPARNYLIMLDTAGTYCLAF----IKTK-SSLSIIGSFQQQGIRVSYDLTNSIVGFSTNK 486
Query: 405 C 405
C
Sbjct: 487 C 487
>gi|414589628|tpg|DAA40199.1| TPA: hypothetical protein ZEAMMB73_627989 [Zea mays]
Length = 452
Score = 109 bits (272), Expect = 3e-21, Method: Compositional matrix adjust.
Identities = 112/411 (27%), Positives = 169/411 (41%), Gaps = 35/411 (8%)
Query: 22 VQRGINISIARLAYL-----QEKIRSHNTYQ--AQILPSNDISNSLFYVNISIGQPPVPQ 74
++R + S AR A L + + N Q A +LP + + V+++IG PP P
Sbjct: 50 IRRAMRRSKARAAALSAVRNRARFSGKNEQQTPAGVLPVRPSGDLEYVVDLAIGTPPQPV 109
Query: 75 FTAMDTGSSLLWVHCYPCRDCSPQLGTIFYPSRSSSYAYVPCDSEHCRYFPYARCSAYKH 134
+DTGS L+W C PC C Q +F P +S+SY + C C + C
Sbjct: 110 SALLDTGSDLIWTQCAPCASCLSQPDPLFAPGQSASYEPMRCAGTLCSDILHHSCE-RPD 168
Query: 135 RCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDVV--FGCG-FSTNRNFKFSGIFG 191
C Y Y G T +TE+ TF ++ + V FGCG + SGI G
Sbjct: 169 TCTYRYNYGDGTMTVGVYATERFTFASSGGGGLTTTTVPLGFGCGSVNVGSLNNGSGIVG 228
Query: 192 LGIGRSSLVSQLN-SSFSYCIGSLHDPDYLHNKLILGDGAIIDEGDA------TPLQFID 244
G SLVSQL+ FSYC+ S + L+ G + GDA TPL
Sbjct: 229 FGRNPLSLVSQLSIRRFSYCLTSYA--SRRQSTLLFGSLSDGVYGDATGRVQTTPLLQSP 286
Query: 245 GH---YYITLEAISVDGRMLDINPNIFK-RDDSGGGVMIDSGTDVTWL----VKEAYEAL 296
+ YY+ ++V R L I + F R D GGV++DSGT +T L + E A
Sbjct: 287 QNPTFYYVHFTGLTVGARRLRIPESAFALRPDGSGGVIVDSGTALTLLPAAVLAEVVRAF 346
Query: 297 RDEVMIRLE--GEQMRSYSWPDKLCYHGIMSSDLKGFPTVRFHFRGGAKLALEKDSMFYQ 354
R ++ + G + + S+ P + HF+G ++ +
Sbjct: 347 RQQLRLPFANGGNPEDGVCFLVPAAWRRSSSTSQMPVPRMVLHFQGADLDLPRRNYVLDD 406
Query: 355 PRPDAFCMAVNPASINNRYVNFSLIGMMAQQFYNVGYDIGRKQKTFQRMDC 405
R C+ + + + S IG + QQ V YD+ + + C
Sbjct: 407 HRRGRLCLLLADSGDDG-----STIGNLVQQDMRVLYDLEAETLSIAPARC 452
>gi|413923780|gb|AFW63712.1| hypothetical protein ZEAMMB73_689747 [Zea mays]
Length = 470
Score = 109 bits (272), Expect = 3e-21, Method: Compositional matrix adjust.
Identities = 103/328 (31%), Positives = 141/328 (42%), Gaps = 44/328 (13%)
Query: 48 AQILPSN---DISNSLFYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCR--DCSPQLGTI 102
A +P+N DI S + V S+G P + Q +DTGS L WV C PC C Q +
Sbjct: 121 AATVPANWGYDIGTSNYVVTASLGTPGMAQTLEVDTGSDLSWVQCKPCAAPSCYRQKDPL 180
Query: 103 FYPSRSSSYAYVPCDSEHCRYFPYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNT 162
F P++SSSYA VPC C + +C Y Y G T+ S++ LT
Sbjct: 181 FDPAQSSSYAAVPCGRSACAGLGIYASACSAAQCGYVVSYGDGSNTTGVYSSDTLTL--- 237
Query: 163 DESTIHVQDVVFGCGFSTNRNFKFSGIFG-LGIGRS--SLVSQLNSS----FSYCIGSLH 215
+ VQ +FGCG + + F+GI G LG GR SLV Q + FSYC L
Sbjct: 238 -AANATVQGFLFGCGHAQSGGL-FTGIDGLLGFGREQPSLVQQTAGAYGGVFSYC---LP 292
Query: 216 DPDYLHNKLILGDGAIIDEGDAT----PLQFIDGHYYITLEAISVDGRMLDINPNIFKRD 271
L LG + + G +T P +Y + L ISV G+ L + + F
Sbjct: 293 TKSSTTGYLTLGGPSGVAPGFSTTQLLPSPNAPTYYVVMLTGISVGGQPLSVPASAFAA- 351
Query: 272 DSGGGVMIDSGTDVTWLVKEAYEALRDEVMIRLEGEQMRSYSWPDKLCYHGIMSS--DLK 329
G ++D+GT +T L AY ALR M SY + GI+ +
Sbjct: 352 ----GTVVDTGTVITRLPPAAYAALRSAFR-----SGMASYPSAPPI---GILDTCYSFA 399
Query: 330 GFPTVR-----FHFRGGAKLALEKDSMF 352
G+ TV F GA + L D +
Sbjct: 400 GYGTVNLTSVALTFSSGATMTLGADGIM 427
>gi|224072755|ref|XP_002303865.1| predicted protein [Populus trichocarpa]
gi|222841297|gb|EEE78844.1| predicted protein [Populus trichocarpa]
Length = 412
Score = 109 bits (272), Expect = 3e-21, Method: Compositional matrix adjust.
Identities = 99/348 (28%), Positives = 155/348 (44%), Gaps = 39/348 (11%)
Query: 78 MDTGSSLLWVHCYPCRDCSPQLGTIFYPSRSSSYAYVPCDSEHCRYFPYAR-----CSAY 132
+DTGS L WV C PC C Q +F PS+S SY V C+S CR A C +
Sbjct: 81 VDTGSDLSWVQCQPCNRCYNQQDPVFNPSKSPSYRTVLCNSLTCRSLQLATGNSGVCGSN 140
Query: 133 KHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDVVFGCGFSTNRNF-KFSGIFG 191
C Y Y G TS V E L NT V + +FGCG F SG+ G
Sbjct: 141 PPTCNYVVNYGDGSYTSGEVGMEHLNLGNTT-----VNNFIFGCGRKNQGLFGGASGLVG 195
Query: 192 LGIGRSSLVSQLN----SSFSYCIGSLHDPDYLHNKLILGDGAIIDEGDATPLQF----- 242
LG SL+SQ++ FSYC+ + L++G + + + + TP+ +
Sbjct: 196 LGRTDLSLISQISPMFGGVFSYCLPTTEAE--ASGSLVMGGNSSVYK-NTTPISYTRMIH 252
Query: 243 --IDGHYYITLEAISVDGRMLDINPNIFKRDDSGGGVMIDSGTDVTWLVKEAYEALRDEV 300
+ Y++ L I+V G +++ F +D ++IDSGT ++ L Y+AL+ E
Sbjct: 253 NPLLPFYFLNLTGITVGG--VEVQAPSFGKDR----MIIDSGTVISRLPPSIYQALKAEF 306
Query: 301 MIRLEG-EQMRSYSWPDKLCYHGIMSSDLKGFPTVRFHFRGGAKLALEKDSMFYQPRPDA 359
+ + G S+ D C++ ++K P ++ +F G A+L ++ +FY + DA
Sbjct: 307 VKQFSGYPSAPSFMILDS-CFNLSGYQEVK-IPDIKMYFEGSAELNVDVTGVFYSVKTDA 364
Query: 360 --FCMAVNPASINNRYVNFSLIGMMAQQFYNVGYDIGRKQKTFQRMDC 405
C+A+ + +IG Q+ + YD F C
Sbjct: 365 SQVCLAIASLPYEDE---VGIIGNYQQKNQRIIYDTKGSMLGFAEEAC 409
>gi|359476754|ref|XP_002277058.2| PREDICTED: aspartic proteinase-like protein 2-like isoform 1 [Vitis
vinifera]
Length = 561
Score = 109 bits (272), Expect = 3e-21, Method: Compositional matrix adjust.
Identities = 101/401 (25%), Positives = 176/401 (43%), Gaps = 43/401 (10%)
Query: 38 EKIRSHNTYQ-AQILPSNDI---------SNSLFYVNISIGQPPVPQFTAMDTGSSLLWV 87
+ +R+H+T + +IL + D+ L++ I IG P + +DTGS +LWV
Sbjct: 122 DALRAHDTRRHGRILSAVDLPLGGNGHPSEAGLYFAKIGIGTPSKDYYVQVDTGSDILWV 181
Query: 88 HCYPCRDCSPQ--LG---TIFYPSRSSSYAYVPCDSEHCRYF--PYARCSAYKHRCIYTQ 140
+C C C + LG T++ S++ V CD C + P C +C+Y+
Sbjct: 182 NCAGCDRCPTKSDLGVDLTLYDMKASTTSDAVGCDDNFCSLYDGPLPGCKP-GLQCLYSV 240
Query: 141 LYLIGPETSVFVSTEQLTFKNTD---ESTIHVQDVVFGCGFSTNRNFKFS-----GIFGL 192
LY G T+ + + + + ++T VVFGCG + S GI G
Sbjct: 241 LYGDGSSTTGYFVQDFVQYNRISGNFQTTPTNGTVVFGCGNKQSGELGSSSEALDGILGF 300
Query: 193 GIGRSSLVSQLNSS------FSYCIGSLHDPDYLHNKLILGDGAIID-EGDATPLQFIDG 245
G SS++SQL SS FS+C+ D + I G +++ + + TPL
Sbjct: 301 GQANSSMLSQLASSGKVKKVFSHCL------DNVDGGGIFAIGEVVEPKVNITPLVQNQA 354
Query: 246 HYYITLEAISVDGRMLDINPNIFKRDDSGGGVMIDSGTDVTWLVKEAYEALRDEVMIRLE 305
HY + ++ I V G LD+ + F+ D G +IDSGT + + +E Y L ++++ +
Sbjct: 355 HYNVVMKEIEVGGDPLDVPSDAFESGDR-KGTIIDSGTTLAYFPQEVYVPLIEKILS--Q 411
Query: 306 GEQMRSYSWPDKLCYHGIMSSDLKGFPTVRFHFRGGAKLALEKDSMFYQPRPDAFCMA-V 364
+R ++ + GFPTV HF L + +Q + +C+
Sbjct: 412 QPDLRLHTVEQAFTCFDYTGNVDDGFPTVTLHFDKSISLTVYPHEYLFQVKEFEWCIGWQ 471
Query: 365 NPASINNRYVNFSLIGMMAQQFYNVGYDIGRKQKTFQRMDC 405
N + + +L+G + V YD+ ++ + +C
Sbjct: 472 NSGAQTKDGKDLTLLGDLVLSNKLVVYDLEKQGIGWVEYNC 512
>gi|302780643|ref|XP_002972096.1| hypothetical protein SELMODRAFT_96575 [Selaginella moellendorffii]
gi|300160395|gb|EFJ27013.1| hypothetical protein SELMODRAFT_96575 [Selaginella moellendorffii]
Length = 393
Score = 109 bits (272), Expect = 3e-21, Method: Compositional matrix adjust.
Identities = 109/400 (27%), Positives = 171/400 (42%), Gaps = 49/400 (12%)
Query: 31 ARLAYLQEKIRSHNTYQAQILPSNDISNSL------FYVNISIGQPPVPQFTAMDTGSSL 84
AR+ ++ R++++ + + + D+ + L + ++IS+G P DTGS L
Sbjct: 21 ARVRWMAA--RANSSSWSSMAGTTDVESPLHPDGGGYVMDISVGTPGKRFRAIADTGSDL 78
Query: 85 LWVHCYPCRDCSPQLGTIFYPSRSSSYAYVPCDSEHCRYFPYARCSAYKHRCIYTQLYLI 144
+WV PC CS GTIF P +SS++ + C S+ C P C C Y+ Y
Sbjct: 79 VWVQSEPCTGCSG--GTIFDPRQSSTFREMDCSSQLCAELP-GSCEPGSSTCSYSYEYGS 135
Query: 145 GPETSVFVSTEQLTFKNTDESTIHVQDVVFGCGFSTNRNFKFSGIFGLGIGRSSLVSQL- 203
G ET + + ++ T + + GCG + G+ GLG G SL SQL
Sbjct: 136 G-ETEGEFARDTISLGTTSDGSQKFPSFAVGCGMVNSGFDGVDGLVGLGQGPVSLTSQLS 194
Query: 204 ---NSSFSYCI-----GSLHDPDYLHNKLILGDGAIIDEGDATPLQFIDGHYYITLEAIS 255
+S FSYC+ S P L I P +Y +T+ I+
Sbjct: 195 AAIDSKFSYCLVDINSQSESSPLLFGPSAALHGTGIQSTKITPPSDTYPTYYLLTVNGIA 254
Query: 256 VDGRMLDINPNIFKRDDSGGGVMIDSGTDVTWLVKEAYEAL--RDEVMI---RLEGEQMR 310
V G+ + S G +IDSGT +T++ Y + R E M+ R++G M
Sbjct: 255 VAGQTM----------GSPGTTIIDSGTTLTYVPSGVYGRVLSRMESMVTLPRVDGSSMG 304
Query: 311 SYSWPDKLCYHGIMSSDLKGFPTVRFHFRGGAKLALEKDSMFY--QPRPDAFCMAVNPAS 368
LCY + + K FP + GA + + F D C+A+ AS
Sbjct: 305 L-----DLCYDRSSNRNYK-FPALTIRL-AGATMTPPSSNYFLVVDDSGDTVCLAMGSAS 357
Query: 369 INNRYVNFSLIGMMAQQFYNVGYDIGRKQKTFQRMDCEVL 408
+ S+IG + QQ Y++ YD G + +F + CE L
Sbjct: 358 ----GLPVSIIGNVMQQGYHILYDRGSSELSFVQAKCESL 393
>gi|2541876|dbj|BAA22813.1| CND41, chloroplast nucleoid DNA binding protein [Nicotiana tabacum]
Length = 502
Score = 109 bits (272), Expect = 3e-21, Method: Compositional matrix adjust.
Identities = 105/376 (27%), Positives = 152/376 (40%), Gaps = 57/376 (15%)
Query: 61 FYVNISIGQPPVPQFTAMDTGSSLLWVHCYPC-RDCSPQLGTIFYPSRSSSYAYVPCDSE 119
+ VN+ +G P DTGS L W C PC + C Q IF PS S +Y+ + C S
Sbjct: 154 YIVNVGLGTPKKDLSLIFDTGSDLTWTQCQPCVKSCYAQQQPIFDPSTSKTYSNISCTSA 213
Query: 120 HCRYFPYAR-----CSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDVVF 174
C A CS+ C+Y Y T F + ++LT D +F
Sbjct: 214 ACSSLKSATGNSPGCSS--SNCVYGIQYGDSSFTIGFFAKDKLTLTQNDV----FDGFMF 267
Query: 175 GCGFSTNRNF-KFSGIFGLGIGRSSLVSQ----LNSSFSYCIGSLHDPDYLHNKLILGDG 229
GCG + F K +G+ GLG S+V Q FSYC+ + + L G+G
Sbjct: 268 GCGQNNKGLFGKTAGLIGLGRDPLSIVQQTAQKFGKYFSYCLPTSRGSN---GHLTFGNG 324
Query: 230 AIIDEGDA-------TPLQFIDG--HYYITLEAISVDGRMLDINPNIFKRDDSGGGVMID 280
+ A TP G +Y+I + ISV G+ L I+P +F+ G +ID
Sbjct: 325 NGVKASKAVKNGITFTPFASSQGTAYYFIDVLGISVGGKALSISPMLFQN----AGTIID 380
Query: 281 SGTDVTWLVKEAYEALRDEVMIRLEGEQMRSYSWPDKL-----CYHGIMSSDLKGF---- 331
SGT +T L AY +L+ + M Y L CY DL +
Sbjct: 381 SGTVITRLPSTAYGSLKSAFK-----QFMSKYPTAPALSLLDTCY------DLSNYTSIS 429
Query: 332 -PTVRFHFRGGAKLALEKDSMFYQPRPDAFCMAVNPASINNRYVNFSLIGMMAQQFYNVG 390
P + F+F G A + L+ + + C+A + N + + G + QQ V
Sbjct: 430 IPKISFNFNGNANVELDPNGILITNGASQVCLAF---AGNGDDDSIGIFGNIQQQTLEVV 486
Query: 391 YDIGRKQKTFQRMDCE 406
YD+ Q F C
Sbjct: 487 YDVAGGQLGFGYKGCS 502
>gi|224124882|ref|XP_002329972.1| predicted protein [Populus trichocarpa]
gi|222871994|gb|EEF09125.1| predicted protein [Populus trichocarpa]
Length = 332
Score = 109 bits (272), Expect = 3e-21, Method: Compositional matrix adjust.
Identities = 96/345 (27%), Positives = 157/345 (45%), Gaps = 33/345 (9%)
Query: 78 MDTGSSLLWVHCYPCRD-CSPQLGTIFYPSRSSSYAYVPCDSEHCRYFPYAR-----CSA 131
+DTGSSL W+ C PC C Q ++ PS S +Y + C S C A C
Sbjct: 3 LDTGSSLSWLQCQPCAVYCHAQADPLYDPSVSKTYKKLSCASVECSRLKAATLNDPLCET 62
Query: 132 YKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDVVFGCGFSTNRNF-KFSGIF 190
+ C+YT Y + ++S + LT ++ + +GCG F + +GI
Sbjct: 63 DSNACLYTASYGDTSFSIGYLSQDLLTLTSSQT----LPQFTYGCGQDNQGLFGRAAGII 118
Query: 191 GLGIGRSSLVSQLNS----SFSYCIGSLHDPDYLHNKLILGDGAIIDEGDATPLQFIDGH 246
GL + S+++QL++ +FSYC+ + + L +G + TP+ D
Sbjct: 119 GLARDKLSMLAQLSTKYGHAFSYCLPTANSGSSGGGFLSIGSISPTSY-KFTPM-LTDSK 176
Query: 247 ----YYITLEAISVDGRMLDINPNIFKRDDSGGGVMIDSGTDVTWLVKEAYEALRDEVMI 302
Y++ L AI+V GR LD+ +++ +IDSGT +T L Y ALR +
Sbjct: 177 NPSLYFLRLTAITVSGRPLDLAAAMYRVP-----TLIDSGTVITRLPMSMYAALRQAFVK 231
Query: 303 RLEGEQMR--SYSWPDKLCYHGIMSSDLKGFPTVRFHFRGGAKLALEKDSMFYQPRPDAF 360
+ + + +YS D C+ G + S + P ++ F+GGA L L S+ +
Sbjct: 232 IMSTKYAKAPAYSILDT-CFKGSLKS-ISAVPEIKMIFQGGADLTLRAPSILIEADKGIT 289
Query: 361 CMAVNPASINNRYVNFSLIGMMAQQFYNVGYDIGRKQKTFQRMDC 405
C+A +S N+ ++IG QQ YN+ YD+ + F C
Sbjct: 290 CLAFAGSSGTNQ---IAIIGNRQQQTYNIAYDVSTSRIGFAPGSC 331
>gi|195627138|gb|ACG35399.1| aspartic proteinase nepenthesin-1 precursor [Zea mays]
Length = 431
Score = 109 bits (272), Expect = 3e-21, Method: Compositional matrix adjust.
Identities = 113/420 (26%), Positives = 170/420 (40%), Gaps = 35/420 (8%)
Query: 8 LSPYNN---PNENPAHRVQRGINISIARLAYLQEKIRSHNTYQAQILPSNDISNSLFYVN 64
LS Y+N P+ +P + ARL +L K S + + S S + V
Sbjct: 24 LSVYHNVHPPSPSPLESIIALARADDARLLFLSSKAASSGGVTSAPVASGQTPPS-YVVR 82
Query: 65 ISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLGTIFYPSRSSSYAYVPCDSEHCRYF 124
+G P A+DT + W HC PC C G+ F P+ SSSYA +PC S+ C F
Sbjct: 83 AGLGTPVQQLLLALDTSADATWSHCAPCDTC--PAGSRFIPASSSSYASLPCASDWCPLF 140
Query: 125 PYARC------SAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDVVFGCGF 178
C SA C +++ + +TS S T + ++ + FGC
Sbjct: 141 EGQPCPANQDASAPLPACAFSKPFA---DTSFQASLGSDTLRLGKDA---IAGYAFGCVG 194
Query: 179 ST---NRNFKFSGIFGLGIGRSSLVSQLNSS----FSYCIGSLHDPDYLHNKLILGDGAI 231
+ N G+ GLG G SL+SQ S+ FSYC+ S Y L LG
Sbjct: 195 AVAGPTTNLPKQGLLGLGRGPMSLLSQTGSTYNGVFSYCLPSYRS-YYFSGSLRLGAAGQ 253
Query: 232 IDEGDATPLQFIDGH----YYITLEAISVDGRMLDINPNIFKRDD-SGGGVMIDSGTDVT 286
TPL + H YY+ + +SV + + F D +G G +IDSGT +T
Sbjct: 254 PRNVRYTPL-LTNPHRPSLYYVNVTGLSVGRTWVKVPAGSFAFDPATGAGTVIDSGTVIT 312
Query: 287 WLVKEAYEALRDEVMIRLEGEQMRSYSWPDKLCYHGIMSSDLKGFPTVRFHFRGGAKLAL 346
Y ALR+E ++ + C++ G P V H GG L L
Sbjct: 313 RWTAPVYAALREEFRRQVAAPSGYTSLGAFDTCFN-TDEVAAGGAPPVTLHMDGGVDLTL 371
Query: 347 E-KDSMFYQPRPDAFCMAVNPASINNRYVNFSLIGMMAQQFYNVGYDIGRKQKTFQRMDC 405
++++ + C+A+ A N +++ + QQ V D+ + F R C
Sbjct: 372 PMENTLIHSSATPLACLAMAEAP-QNVNAVVNVVANLQQQNVRVVVDVAGSRVGFAREPC 430
>gi|21595063|gb|AAM66069.1| putative aspartyl protease [Arabidopsis thaliana]
Length = 484
Score = 109 bits (272), Expect = 3e-21, Method: Compositional matrix adjust.
Identities = 121/423 (28%), Positives = 188/423 (44%), Gaps = 56/423 (13%)
Query: 19 AHRVQRGINISIARLAYLQEKIRS-------HNTYQAQILPSNDIS-NSLFY-VNISIGQ 69
+++R + + R+ LQ KI++ + + QI ++ I SL Y V + +G
Sbjct: 84 GKKMRRALVLDNIRVQSLQLKIKAMTSSTTEQSVSETQIPLTSGIKLESLNYIVTVELGG 143
Query: 70 PPVPQFTAMDTGSSLLWVHCYPCRDCSPQLGTIFYPSRSSSYAYVPCDSEHCRYFPYARC 129
+ +DTGS L WV C PCR C Q G ++ PS SSSY V C+S C+ A
Sbjct: 144 KNMSLI--VDTGSDLTWVQCQPCRSCYNQQGPLYDPSVSSSYKTVFCNSSTCQDLVAATS 201
Query: 130 SA---------YKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDVVFGCGFST 180
++ K C Y Y G T +++E + +T +++ VFGCG
Sbjct: 202 NSGPCGGNNGVVKTPCEYVVSYGDGSYTRGDLASESILLGDT-----KLENFVFGCG-RN 255
Query: 181 NRNFKFSGIFGLGIGRS--SLVSQ----LNSSFSYCIGSLHDPDYLHNKLILGDGAIIDE 234
N+ +G+GRS SLVSQ N FSYC+ SL D L G+ + +
Sbjct: 256 NKGLFGGSSGLMGLGRSSVSLVSQTLKTFNGVFSYCLPSLEDG--ASGSLSFGNDSSVYT 313
Query: 235 GDA----TPL---QFIDGHYYITLEAISVDGRMLDINPNIFKRDDSGGGVMIDSGTDVTW 287
TPL + Y + L S+ G L K G G++IDSGT +T
Sbjct: 314 NSTSVSYTPLVQNPQLRSFYILNLTGASIGGVEL-------KSSSFGRGILIDSGTVITR 366
Query: 288 LVKEAYEALRDEVMIRLEG-EQMRSYSWPDKLCYHGIMSSDLKGFPTVRFHFRGGAKLAL 346
L Y+A++ E + + G YS D C++ D+ P ++ F+G A+L +
Sbjct: 367 LPPSIYKAVKIEFLKQFSGFPTAPGYSILDT-CFNLTSYEDIS-IPIIKMIFQGNAELEV 424
Query: 347 EKDSMFYQPRPDA--FCMAVNPASINNRYVNFSLIGMMAQQFYNVGYDIGRKQKTFQRMD 404
+ +FY +PDA C+A+ S N +IG Q+ V YD +++ +
Sbjct: 425 DVTGVFYFVKPDASLVCLALASLSYENE---VGIIGNYQQKNQRVIYDSTQERLGIVGEN 481
Query: 405 CEV 407
C V
Sbjct: 482 CRV 484
>gi|168022164|ref|XP_001763610.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162685103|gb|EDQ71500.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 308
Score = 109 bits (272), Expect = 3e-21, Method: Compositional matrix adjust.
Identities = 84/290 (28%), Positives = 131/290 (45%), Gaps = 47/290 (16%)
Query: 54 NDI-SNSLFYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCS-----PQLGTIFYPSR 107
NDI + L+Y IS+G PP + +DTGS++ WV C PC C P + F P +
Sbjct: 33 NDIFAMGLYYTRISLGTPPQQFYVDVDTGSNVAWVKCAPCTGCEHSGDVPVPMSTFDPRK 92
Query: 108 SSSYAYVPCDSEHCRYF-PYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKN--TDE 164
S++ + C C +CS + C Y+ LY G T+ + + TF +D
Sbjct: 93 STTKISISCTDAECGVLNKKLQCSPERLSCPYSLLYGDGSSTAGYYLNDVFTFNQVPSDN 152
Query: 165 STIH--VQDVVFGCGFSTNRNFKFSGIFGLGIGRSSLVSQLNSS------FSYC------ 210
ST +VFGCG + ++ G+ G G SL +QL F++C
Sbjct: 153 STAKSGTARLVFGCGGTQTGSWSVDGLLGFGPTTVSLPNQLAQQNISVNIFAHCLQGDVS 212
Query: 211 ------IGSLHDPDYLHNKLILGDGAIIDEGDATPLQFIDGHYYITLEAISVDGRMLDIN 264
IG++ +PD ++ TP+ F + HY + L I + GR +
Sbjct: 213 GRGSLVIGTIREPDLVY----------------TPMVFGEDHYNVQLLNIGISGRNV-TT 255
Query: 265 PNIFKRDDSGGGVMIDSGTDVTWLVKEAYEALRDEVMIRLEGEQMRSYSW 314
P F + + GGV+IDSGT +T+LV+ AY+ R V + + + W
Sbjct: 256 PASFDLEYT-GGVIIDSGTTLTYLVQPAYDEFRRGVSVFKQSSDLAVAFW 304
>gi|356499109|ref|XP_003518386.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Glycine max]
Length = 428
Score = 108 bits (271), Expect = 3e-21, Method: Compositional matrix adjust.
Identities = 103/392 (26%), Positives = 168/392 (42%), Gaps = 56/392 (14%)
Query: 52 PSNDIS---NSLFYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLGTIFYPSRS 108
PS +S N V++++G PP +DTGS L W+HC P L + F P S
Sbjct: 48 PSRKLSFHHNVTLTVSLTVGSPPQNVTMVLDTGSELSWLHCKKL----PNLNSTFNPLLS 103
Query: 109 SSYAYVPCDSEHCR------YFPYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNT 162
SSY PC+S C P A C C Y ++ E +
Sbjct: 104 SSYTPTPCNSSICTTRTRDLTIP-ASCDPNNKLCHVIVSYADASSAEGTLAAETFSLAGA 162
Query: 163 DESTIHVQDVVFGC----GFST--NRNFKFSGIFGLGIGRSSLVSQLN-SSFSYCIGSLH 215
+ +FGC G+++ N + K +G+ G+ G SLV+Q++ FSYCI
Sbjct: 163 AQP-----GTLFGCMDSAGYTSDINEDSKTTGLMGMNRGSLSLVTQMSLPKFSYCISG-- 215
Query: 216 DPDYLHNKLILGDGA----------IIDEGDATPLQFIDGHYYITLEAISVDGRMLDINP 265
D L L+LGDG ++ ++P F Y + LE I V ++L +
Sbjct: 216 -EDAL-GVLLLGDGTDAPSPLQYTPLVTATTSSPY-FNRVAYTVQLEGIKVSEKLLQLPK 272
Query: 266 NIFKRDDSGGG-VMIDSGTDVTWLVKEAYEALRDEVMIRLEGEQMR------SYSWPDKL 318
++F D +G G M+DSGT T+L+ Y +L+DE + + +G R + L
Sbjct: 273 SVFVPDHTGAGQTMVDSGTQFTFLLGSVYSSLKDEFLEQTKGVLTRIEDPNFVFEGAMDL 332
Query: 319 CYHGIMSSDLKGFPTVRFHFRGGAKLALEKDSMFYQPRPDA---FCMAVNPASINNRYVN 375
CYH S P V F GA++ + + + Y+ + +C + + +
Sbjct: 333 CYHAPAS--FAAVPAVTLVFS-GAEMRVSGERLLYRVSKGSDWVYCFTFGNSDLLG--IE 387
Query: 376 FSLIGMMAQQFYNVGYDIGRKQKTFQRMDCEV 407
+IG QQ + +D+ + + F + C++
Sbjct: 388 AYVIGHHHQQNVWMEFDLLKSRVGFTQTTCDL 419
>gi|18421660|ref|NP_568551.1| aspartyl protease family protein [Arabidopsis thaliana]
gi|10177438|dbj|BAB10671.1| unnamed protein product [Arabidopsis thaliana]
gi|15809850|gb|AAL06853.1| AT5g37540/mpa22_p_70 [Arabidopsis thaliana]
gi|20260182|gb|AAM12989.1| unknown protein [Arabidopsis thaliana]
gi|23197748|gb|AAN15401.1| unknown protein [Arabidopsis thaliana]
gi|332006821|gb|AED94204.1| aspartyl protease family protein [Arabidopsis thaliana]
Length = 442
Score = 108 bits (271), Expect = 3e-21, Method: Compositional matrix adjust.
Identities = 97/370 (26%), Positives = 171/370 (46%), Gaps = 34/370 (9%)
Query: 63 VNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLG--TIFYPSRSSSYAYVPCDSEH 120
+++ IG P Q +DTGS L W+ C+P + P T F PS SSS++ +PC
Sbjct: 82 LSLPIGTPSQSQELVLDTGSQLSWIQCHPKKIKKPLPPPTTSFDPSLSSSFSDLPCSHPL 141
Query: 121 CRY----FPYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDVVFGC 176
C+ F C Y+ Y G + E+ TF N+ + ++ GC
Sbjct: 142 CKPRIPDFTLPTSCDSNRLCHYSYFYADGTFAEGNLVKEKFTFSNSQTT----PPLILGC 197
Query: 177 GFSTNRNFKFSGIFGLGIGRSSLVSQLN-SSFSYCIGSLHDPDYLHN--KLILGDG---- 229
+ GI G+ +GR S +SQ S FSYCI + + L + LGD
Sbjct: 198 AKESTDE---KGILGMNLGRLSFISQAKISKFSYCIPTRSNRPGLASTGSFYLGDNPNSR 254
Query: 230 -----AIIDEGDATPLQFIDG-HYYITLEAISVDGRMLDINPNIFKRDDSGGG-VMIDSG 282
+++ + + +D Y + L+ I + + L+I ++F+ D G G M+DSG
Sbjct: 255 GFKYVSLLTFPQSQRMPNLDPLAYTVPLQGIRIGQKRLNIPGSVFRPDAGGSGQTMVDSG 314
Query: 283 TDVTWLVKEAYEALRDEVMIRLEGEQMRS---YSWPDKLCYHGIMSSDL-KGFPTVRFHF 338
++ T LV AY+ +++E+ +RL G +++ Y +C+ G S ++ + + F F
Sbjct: 315 SEFTHLVDVAYDKVKEEI-VRLVGSRLKKGYVYGSTADMCFDGNHSMEIGRLIGDLVFEF 373
Query: 339 RGGAKLALEKDSMFYQPRPDAFCMAVNPASINNRYVNFSLIGMMAQQFYNVGYDIGRKQK 398
G ++ +EK S+ C+ + +S+ N +IG + QQ V +D+ ++
Sbjct: 374 GRGVEILVEKQSLLVNVGGGIHCVGIGRSSMLGAASN--IIGNVHQQNLWVEFDVTNRRV 431
Query: 399 TFQRMDCEVL 408
F + +C +L
Sbjct: 432 GFSKAECRLL 441
>gi|296089645|emb|CBI39464.3| unnamed protein product [Vitis vinifera]
Length = 477
Score = 108 bits (271), Expect = 3e-21, Method: Compositional matrix adjust.
Identities = 105/409 (25%), Positives = 178/409 (43%), Gaps = 62/409 (15%)
Query: 40 IRSH-NTYQAQILPSNDI---------SNSLFYVNISIGQPPVPQFTAMDTGSSLLWVHC 89
+++H N+ Q +IL D+ + L+Y I IG P + +DTGS ++WV+C
Sbjct: 67 LKAHDNSRQLRILAGVDLPLGGTGRPEAVGLYYAKIGIGTPARDYYVQVDTGSDIMWVNC 126
Query: 90 YPCRDCSPQ--LG---TIFYPSRSSSYAYVPCDSEHCRYF---PYARCSAYKHRCIYTQL 141
C +C + LG T++ S + V CD + C P + C A C YT++
Sbjct: 127 IQCNECPKKSSLGMELTLYDIKESLTGKLVSCDQDFCYAINGGPPSYCIA-NMSCSYTEI 185
Query: 142 YLIGPETSVFVSTEQLTFKNTD---ESTIHVQDVVFGCGFSTNRNFK----FSGIFGLGI 194
Y G + + + + + E+T V+FGC + + + GI G G
Sbjct: 186 YADGSSSFGYFVRDIVQYDQVSGDLETTSANGSVIFGCSATQSGDLSSEEALDGILGFGK 245
Query: 195 GRSSLVSQLNSS------FSYCIGSLHDPDYLHNKLILGDGAIID-EGDATPLQFIDGHY 247
+S++SQL SS F++C+ D L+ I G I+ + + TPL HY
Sbjct: 246 SNTSMISQLASSGKVRKMFAHCL------DGLNGGGIFAIGHIVQPKVNTTPLVPNQTHY 299
Query: 248 YITLEAISVDGRMLDINPNIFKRDDSGGGVMIDSGTDVTWLVKEAYEALRDEVMIRLEGE 307
+ ++A+ V G L++ ++F D G +IDSGT + +L + Y+ L ++
Sbjct: 300 NVNMKAVEVGGYFLNLPTDVFDVGDK-KGTIIDSGTTLAYLPEVVYDQLLSKI------- 351
Query: 308 QMRSYSWPDKLCYHGI--------MSSDL-KGFPTVRFHFRGGAKLALEKDSMFYQPRPD 358
+SW L H I S L GFP V FHF L + +
Sbjct: 352 ----FSWQSDLKVHTIHDQFTCFQYSESLDDGFPAVTFHFENSLYLKVHPHEYLFS-YDG 406
Query: 359 AFCMAVNPASINNR-YVNFSLIGMMAQQFYNVGYDIGRKQKTFQRMDCE 406
+C+ + + +R N +L+G +A V YD+ + + +C+
Sbjct: 407 LWCIGWQNSGMQSRDRRNITLLGDLALSNKLVLYDLENQVIGWTEYNCK 455
>gi|357166728|ref|XP_003580821.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Brachypodium
distachyon]
Length = 479
Score = 108 bits (271), Expect = 3e-21, Method: Compositional matrix adjust.
Identities = 109/361 (30%), Positives = 161/361 (44%), Gaps = 45/361 (12%)
Query: 1 LIHRDSILSPYNNPNENPAHRVQRGINISIARLAYLQEKIRSHNTYQAQILPSNDIS--- 57
L+HRD++ S +P+ H V + AR+AYLQ ++ + + + +
Sbjct: 61 LLHRDTV-SGTKHPSRR--HAVLALASRDTARVAYLQRRLSPSPSPSSTSSVESGGTIVS 117
Query: 58 --NSLFYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLGTIFYPSRSSSYAYVP 115
+ + V + IG PP+ Q DTGS ++WV C PC DC Q +F P+ S+S++ VP
Sbjct: 118 HGSGEYLVRVGIGSPPLEQHLVADTGSDVIWVQCSPCSDCYAQGDPLFDPANSASFSPVP 177
Query: 116 CDSEHCRY---FPYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDV 172
C+S CR + + C C Y Y T+ ++ E LT E VQ V
Sbjct: 178 CNSGVCRAAARYSSSSCGGGGGECEYKVSYGDKSYTNGVLALETLTLDGGTE----VQGV 233
Query: 173 VFGCGFSTNRNF-KFSGIFGLGIGRSSLVSQL----NSSFSYCIGSLHDPDYLHNKLILG 227
GCG F + +G+ GLG G SLV QL +FSYC+ + + + +
Sbjct: 234 AMGCGHENRGLFAEAAGLLGLGWGPMSLVGQLGGAAGGAFSYCLAGYYSGEGSGSGSL-- 291
Query: 228 DGAIIDEGDATPLQFI----------DGHYYITLEAISVDGRMLDIN-PNIFKRDDSGGG 276
++ DA P + YY+ + + V G L + DD GGG
Sbjct: 292 ---VLGREDAAPTGAVWVPLVRNPDAPSFYYVGVNGLGVAGERLQLQDGLFDLGDDGGGG 348
Query: 277 VMIDSGTDVTWLVKEAYEALRDEVMIRLEGEQMRS--YSWPDKLCYHGIMSSDLKGFPTV 334
V++D+GT VT L EAY ALR E R+ S D CY DL G+ +V
Sbjct: 349 VVMDTGTAVTRLPAEAYAALRGAFAGAFEEGAPRAPGVSLFDT-CY------DLSGYASV 401
Query: 335 R 335
R
Sbjct: 402 R 402
>gi|359478045|ref|XP_002267046.2| PREDICTED: aspartic proteinase-like protein 2-like [Vitis vinifera]
Length = 502
Score = 108 bits (271), Expect = 3e-21, Method: Compositional matrix adjust.
Identities = 105/408 (25%), Positives = 177/408 (43%), Gaps = 62/408 (15%)
Query: 40 IRSH-NTYQAQILPSNDI---------SNSLFYVNISIGQPPVPQFTAMDTGSSLLWVHC 89
+++H N+ Q +IL D+ + L+Y I IG P + +DTGS ++WV+C
Sbjct: 67 LKAHDNSRQLRILAGVDLPLGGTGRPEAVGLYYAKIGIGTPARDYYVQVDTGSDIMWVNC 126
Query: 90 YPCRDCSPQ--LG---TIFYPSRSSSYAYVPCDSEHCRYF---PYARCSAYKHRCIYTQL 141
C +C + LG T++ S + V CD + C P + C A C YT++
Sbjct: 127 IQCNECPKKSSLGMELTLYDIKESLTGKLVSCDQDFCYAINGGPPSYCIA-NMSCSYTEI 185
Query: 142 YLIGPETSVFVSTEQLTFKNTD---ESTIHVQDVVFGCGFSTNRNFK----FSGIFGLGI 194
Y G + + + + + E+T V+FGC + + + GI G G
Sbjct: 186 YADGSSSFGYFVRDIVQYDQVSGDLETTSANGSVIFGCSATQSGDLSSEEALDGILGFGK 245
Query: 195 GRSSLVSQLNSS------FSYCIGSLHDPDYLHNKLILGDGAIID-EGDATPLQFIDGHY 247
+S++SQL SS F++C+ D L+ I G I+ + + TPL HY
Sbjct: 246 SNTSMISQLASSGKVRKMFAHCL------DGLNGGGIFAIGHIVQPKVNTTPLVPNQTHY 299
Query: 248 YITLEAISVDGRMLDINPNIFKRDDSGGGVMIDSGTDVTWLVKEAYEALRDEVMIRLEGE 307
+ ++A+ V G L++ ++F D G +IDSGT + +L + Y+ L ++
Sbjct: 300 NVNMKAVEVGGYFLNLPTDVFDVGDK-KGTIIDSGTTLAYLPEVVYDQLLSKI------- 351
Query: 308 QMRSYSWPDKLCYHGI--------MSSDL-KGFPTVRFHFRGGAKLALEKDSMFYQPRPD 358
+SW L H I S L GFP V FHF L + +
Sbjct: 352 ----FSWQSDLKVHTIHDQFTCFQYSESLDDGFPAVTFHFENSLYLKVHPHEYLFS-YDG 406
Query: 359 AFCMAVNPASINNR-YVNFSLIGMMAQQFYNVGYDIGRKQKTFQRMDC 405
+C+ + + +R N +L+G +A V YD+ + + +C
Sbjct: 407 LWCIGWQNSGMQSRDRRNITLLGDLALSNKLVLYDLENQVIGWTEYNC 454
>gi|357132618|ref|XP_003567926.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Brachypodium
distachyon]
Length = 468
Score = 108 bits (271), Expect = 4e-21, Method: Compositional matrix adjust.
Identities = 101/371 (27%), Positives = 162/371 (43%), Gaps = 34/371 (9%)
Query: 61 FYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLGT-----IFYPSRSSSYAYVP 115
++V + +G P P DTGS L WV C S +F P+ S S++ +P
Sbjct: 104 YFVRLRVGTPAQPFVLVADTGSDLTWVKCSSPSSSSSSPAASPPQRVFRPAGSKSWSPLP 163
Query: 116 CDSEHCR-YFPY--ARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFK---NTDESTIHV 169
CDS+ C+ Y P+ A CS+ C Y Y V + T N +
Sbjct: 164 CDSDTCKSYVPFSLANCSSPPDPCSYDYRYKDNSSARGVVGLDSATVSLSGNDGTRKAKL 223
Query: 170 QDVVFGCGFSTN-RNFKFS-GIFGLGIGRSSLVSQLNS----SFSYCIGSLHDPDYLHNK 223
Q+VV GC S + ++FK S G+ LG S S+ S FSYC+ P +
Sbjct: 224 QEVVLGCTTSYDGQSFKSSDGVLSLGNSNISFASRAASRFGGRFSYCLVDHLAPRNATSF 283
Query: 224 LILGDGAIIDEGDA----TPLQFIDGH-----YYITLEAISVDGRMLDINPNIFKRDDSG 274
L G+G D+ TPL ++ Y+++++A++V G L+I P+++ +G
Sbjct: 284 LTFGNGDSSPGDDSSSRRTPLVLLEDARTRPFYFVSVDAVTVAGERLEILPDVWDFRKNG 343
Query: 275 GGVMIDSGTDVTWLVKEAYEALRDEVMIRLEGEQMRSYSWPDKLCYHGIMSSDLKGFPTV 334
G ++ DSGT +T L AY+A+ + + G + P + CY+ S P +
Sbjct: 344 GAIL-DSGTSLTILATPAYDAVVKAISKQFAGVPRVNMD-PFEYCYNWTGVS--AEIPRM 399
Query: 335 RFHFRGGAKLALEKDSMFYQPRPDAFCMAVNPASINNRYVNFSLIGMMAQQFYNVGYDIG 394
F G A LA S P C+ V + + S+IG + QQ + +D+
Sbjct: 400 ELRFAGAATLAPPGKSYVIDTAPGVKCIGV----VEGAWPGVSVIGNILQQEHLWEFDLA 455
Query: 395 RKQKTFQRMDC 405
+ F++ C
Sbjct: 456 NRWLRFKQSRC 466
>gi|297720449|ref|NP_001172586.1| Os01g0776900 [Oryza sativa Japonica Group]
gi|255673740|dbj|BAH91316.1| Os01g0776900 [Oryza sativa Japonica Group]
Length = 381
Score = 108 bits (271), Expect = 4e-21, Method: Compositional matrix adjust.
Identities = 86/285 (30%), Positives = 132/285 (46%), Gaps = 31/285 (10%)
Query: 60 LFYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLG-----TIFYPSRSSSYAYV 114
L++ + +G PP F +DTGS +LWV C PC C G F P SS+ + +
Sbjct: 90 LYFTRVKLGSPPKEYFVQIDTGSDILWVACSPCTGCPSSSGLNIQLEFFNPDTSSTSSKI 149
Query: 115 PCDSEHCRYF---PYARCSAYKHR-CIYTQLYLIGPETSVFVSTEQLTFKNT---DESTI 167
PC + C A C + C YT Y G TS + ++ + F +++
Sbjct: 150 PCSDDRCTAALQTSEAVCQTSDNSPCGYTFTYGDGSGTSGYYVSDTMYFDTVMGNEQTAN 209
Query: 168 HVQDVVFGCGFS-----TNRNFKFSGIFGLGIGRSSLVSQLNS------SFSYCIGSLHD 216
+VFGC S T + GIFG G + S+VSQLNS FS+C L
Sbjct: 210 SSASIVFGCSNSQSGDLTKTDRAVDGIFGFGQHQLSVVSQLNSLGVSPKVFSHC---LKG 266
Query: 217 PDYLHNKLILGDGAIIDEGDA-TPLQFIDGHYYITLEAISVDGRMLDINPNIFKRDDSGG 275
D L+LG+ I++ G TPL HY + LE+I V+G+ L I+ ++F ++
Sbjct: 267 SDNGGGILVLGE--IVEPGLVYTPLVPSQPHYNLNLESIVVNGQKLPIDSSLFTTSNT-Q 323
Query: 276 GVMIDSGTDVTWLVKEAYEALRDEVMIRLEGEQMRSYSWPDKLCY 320
G ++DSGT + +L AY+ + + + +RS C+
Sbjct: 324 GTIVDSGTTLAYLADGAYDPFVNAITAAVS-PSVRSLVSKGNQCF 367
>gi|195607464|gb|ACG25562.1| aspartic proteinase nepenthesin-2 precursor [Zea mays]
Length = 478
Score = 108 bits (271), Expect = 4e-21, Method: Compositional matrix adjust.
Identities = 111/369 (30%), Positives = 160/369 (43%), Gaps = 49/369 (13%)
Query: 55 DISNSLFYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCR---DCSPQLGTIFYPSRSSSY 111
DI + V S+G P V Q +DTGS L WV C PC C Q +F P++SSSY
Sbjct: 134 DIGTLNYVVTASLGTPGVAQTMEVDTGSDLSWVQCKPCSAAPSCYSQKDPLFDPAQSSSY 193
Query: 112 AYVPCDSEHCRYFP-YARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQ 170
A VPC C YA + +C Y Y G T+ S++ LT + VQ
Sbjct: 194 AAVPCGGPVCAGLGIYAASACSAAQCGYVVSYGDGSNTTGVYSSDTLTLSASSA----VQ 249
Query: 171 DVVFGCGFSTNRNFK-FSGIFGLGIGRSSLVSQLNSS----FSYCIGSL-HDPDYLHNKL 224
FGCG + + F G+ GLG + SLV Q + FSYC+ + YL L
Sbjct: 250 GFFFGCGHAQSGLFNGVDGLLGLGREQPSLVEQTAGTYGGVFSYCLPTKPSTAGYL--TL 307
Query: 225 ILGDGAIIDEGDAT----PLQFIDGHYYITLEAISVDGRMLDINPNIFKRDDSGGGVMID 280
LG + G +T P +Y + L ISV G+ L + + F GG ++D
Sbjct: 308 GLGGPSGAAPGFSTTQLLPSPNAPTYYVVMLTGISVGGQQLSVPASAFA-----GGTVVD 362
Query: 281 SGTDVTWLVKEAYEALRDEVMIRLEGEQMRSYSWPDKLCYHGIMSS--DLKGF-----PT 333
+GT +T L AY ALR M SY +P +GI+ + + G+ P
Sbjct: 363 TGTVITRLPPTAYAALRSAFR-----SGMASYGYPTAPS-NGILDTCYNFAGYGTVTLPN 416
Query: 334 VRFHFRGGAKLALEKDSMFYQPRPDAFCMAVNPASINNRYVNFSLIGMMAQQFYNV---G 390
V F GA + L D + C+A P+ + +++G + Q+ + V G
Sbjct: 417 VALTFGSGATVMLGADGIL-----SFGCLAFAPSGSDG---GMAILGNVQQRSFEVRIDG 468
Query: 391 YDIGRKQKT 399
+G K +
Sbjct: 469 TSVGFKPSS 477
>gi|225216960|gb|ACN85252.1| aspartic proteinase nepenthesin-1 precursor [Oryza officinalis]
Length = 519
Score = 108 bits (271), Expect = 4e-21, Method: Compositional matrix adjust.
Identities = 103/357 (28%), Positives = 156/357 (43%), Gaps = 29/357 (8%)
Query: 61 FYVNISIGQPPVPQFTAM-DTGSSLLWVHCYPC-RDCSPQLGTIFYPSRSSSYAYVPCDS 118
+ V + +G P V ++T + DTGS WV C PC C Q +F P+RSS+YA V C +
Sbjct: 180 YVVTVGLGTP-VSRYTVVFDTGSDTTWVQCQPCVVVCYEQREKLFDPARSSTYANVSCAA 238
Query: 119 EHCRYFPYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDVVFGCGF 178
C CS C+Y Y G + F + + LT + D V+ FGCG
Sbjct: 239 PACSDLNIHGCSG--GHCLYGVQYGDGSYSIGFFAMDTLTLSSYDA----VKGFRFGCGE 292
Query: 179 STNRNF-KFSGIFGLGIGRSSLVSQL----NSSFSYCIGSLH-DPDYLHNKLILGDGAII 232
F + +G+ GLG G++SL Q F++C+ + YL G A
Sbjct: 293 RNEGLFGEAAGLLGLGRGKTSLPVQTYDKYGGVFAHCLPARSTGTGYL--DFGAGSLAAA 350
Query: 233 DEGDATPLQFIDG--HYYITLEAISVDGRMLDINPNIFKRDDSGGGVMIDSGTDVTWLVK 290
TP+ +G YY+ + I V G++L I ++F G ++DSGT +T L
Sbjct: 351 SARLTTPMLTDNGPTFYYVGMTGIRVGGQLLSIPQSVFAT----AGTIVDSGTVITRLPP 406
Query: 291 EAYEALRDEVMIRLEGEQMRSYSWPDKL--CYHGIMSSDLKGFPTVRFHFRGGAKLALEK 348
AY +LR + + L CY S + PTV F+GGA+L ++
Sbjct: 407 AAYSSLRYAFAAAMAARGYKKAPAVSLLDTCYDFTGMSQVA-IPTVSLLFQGGARLDVDA 465
Query: 349 DSMFYQPRPDAFCMAVNPASINNRYVNFSLIGMMAQQFYNVGYDIGRKQKTFQRMDC 405
+ Y C+A + N + ++G + + V YDIG+K F C
Sbjct: 466 SGIMYAASASQVCLAF---AANEDGGDVGIVGNTQLKTFGVAYDIGKKVVGFYPGAC 519
>gi|255549236|ref|XP_002515672.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
gi|223545215|gb|EEF46724.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
Length = 492
Score = 108 bits (271), Expect = 4e-21, Method: Compositional matrix adjust.
Identities = 96/372 (25%), Positives = 166/372 (44%), Gaps = 38/372 (10%)
Query: 60 LFYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDC--SPQLG---TIFYPSRSSSYAYV 114
L+Y + IG P + +DTGS ++WV+C CR+C + LG T++ S S V
Sbjct: 85 LYYAKVGIGTPSKDYYVQVDTGSDIMWVNCIQCRECPRTSSLGMELTLYNIKDSVSGKLV 144
Query: 115 PCDSEHCRYF---PYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTD---ESTIH 168
PCD E C P + C+A C Y ++Y G T+ + + + + ++T
Sbjct: 145 PCDEEFCYEVNGGPLSGCTA-NMSCPYLEIYGDGSSTAGYFVKDVVQYDRVSGDLQTTSS 203
Query: 169 VQDVVFGCGFSTNRNF------KFSGIFGLGIGRSSLVSQLNSS------FSYCIGSLHD 216
V+FGCG + + GI G G SS++SQL ++ F++C+
Sbjct: 204 NGSVIFGCGARQSGDLGPTSEEALDGILGFGKSNSSMISQLAATRKVKKIFAHCL----- 258
Query: 217 PDYLHNKLILGDGAIID-EGDATPLQFIDGHYYITLEAISVDGRMLDINPNIFKRDDSGG 275
D ++ I G ++ + + TPL HY + + A+ V L + F+ D G
Sbjct: 259 -DGINGGGIFAIGHVVQPKVNMTPLIPNQPHYNVNMTAVQVGEDFLHLPTEEFEAGDRKG 317
Query: 276 GVMIDSGTDVTWLVKEAYEALRDEVMIRLEGEQMRSYSWPDK-LCYHGIMSSDLKGFPTV 334
+ IDSGT + +L + YE L +++ + ++ + D+ C+ S D GFP V
Sbjct: 318 AI-IDSGTTLAYLPEIVYEPLVSKIIS--QQPDLKVHIVRDEYTCFQYSGSVD-DGFPNV 373
Query: 335 RFHFRGGAKLALEKDSMFYQPRPDAFCMAVNPASINNR-YVNFSLIGMMAQQFYNVGYDI 393
FHF L + + P +C+ + + +R N +L+G + V YD+
Sbjct: 374 TFHFENSVFLKVHPHEYLF-PFEGLWCIGWQNSGMQSRDRRNMTLLGDLVLSNKLVLYDL 432
Query: 394 GRKQKTFQRMDC 405
+ + +C
Sbjct: 433 ENQAIGWTEYNC 444
>gi|356523724|ref|XP_003530485.1| PREDICTED: aspartic proteinase-like protein 2-like [Glycine max]
Length = 488
Score = 108 bits (270), Expect = 4e-21, Method: Compositional matrix adjust.
Identities = 100/375 (26%), Positives = 170/375 (45%), Gaps = 44/375 (11%)
Query: 60 LFYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQ--LG---TIFYPSRSSSYAYV 114
L+Y I IG PP + +DTGS ++WV+C C++C + LG T++ SSS V
Sbjct: 82 LYYAKIGIGTPPKNYYLQVDTGSDIMWVNCIQCKECPTRSSLGMDLTLYDIKESSSGKLV 141
Query: 115 PCDSEHCRYFP---YARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFK------NTDES 165
PCD E C+ C+A C Y ++Y G T+ + + + + TD +
Sbjct: 142 PCDQEFCKEINGGLLTGCTA-NISCPYLEIYGDGSSTAGYFVKDIVLYDQVSGDLKTDSA 200
Query: 166 TIHVQDVVFGCG------FSTNRNFKFSGIFGLGIGRSSLVSQLNSS------FSYCIGS 213
+VFGCG S++ GI G G SS++SQL SS F++C+
Sbjct: 201 N---GSIVFGCGARQSGDLSSSNEEALDGILGFGKANSSMISQLASSGKVKKMFAHCLNG 257
Query: 214 LHDPDYLHNKLILGDGAIID-EGDATPLQFIDGHYYITLEAISVDGRMLDINPNIFKRDD 272
++ I G ++ + + TPL HY + + A+ V L ++ + + D
Sbjct: 258 VNGGG------IFAIGHVVQPKVNMTPLLPDQPHYSVNMTAVQVGHTFLSLSTDTSAQGD 311
Query: 273 SGGGVMIDSGTDVTWLVKEAYEALRDEVMIRLEGEQMRSYSWPDKLCYHGIMSSDLKGFP 332
G +IDSGT + +L + YE L +++ + ++++ + C+ S D GFP
Sbjct: 312 R-KGTIIDSGTTLAYLPEGIYEPLVYKMISQHPDLKVQTLH-DEYTCFQYSESVD-DGFP 368
Query: 333 TVRFHFRGGAKLAL-EKDSMFYQPRPDAFCMAVNPASINNR-YVNFSLIGMMAQQFYNVG 390
V F F G L + D +F P + +C+ + +R N +L+G + V
Sbjct: 369 AVTFFFENGLSLKVYPHDYLF--PSVNFWCIGWQNSGTQSRDSKNMTLLGDLVLSNKLVF 426
Query: 391 YDIGRKQKTFQRMDC 405
YD+ + + +C
Sbjct: 427 YDLENQAIGWAEYNC 441
>gi|212274713|ref|NP_001130791.1| uncharacterized protein LOC100191895 precursor [Zea mays]
gi|194690124|gb|ACF79146.1| unknown [Zea mays]
gi|194708040|gb|ACF88104.1| unknown [Zea mays]
gi|223950469|gb|ACN29318.1| unknown [Zea mays]
gi|414885521|tpg|DAA61535.1| TPA: hypothetical protein ZEAMMB73_650724 [Zea mays]
Length = 500
Score = 108 bits (270), Expect = 4e-21, Method: Compositional matrix adjust.
Identities = 106/360 (29%), Positives = 160/360 (44%), Gaps = 52/360 (14%)
Query: 78 MDTGSSLLWVHCYPCRDCSPQLGTIFYPSRSSSYAYVPCDSEHCRYFPYAR--------- 128
+DT S L WV C PC C Q G +F PS S SYA VPCDS C
Sbjct: 158 VDTASELTWVQCAPCESCHDQQGPLFDPSSSPSYAAVPCDSPSCDALQQQLATGAGAGAP 217
Query: 129 -CSAYK-HRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDVVFGCGFSTNRNFKF 186
C A + C Y Y G + ++ ++L+ + VFGCG ++N+ F
Sbjct: 218 PCDAGRPAACSYALSYRDGSYSRGVLAHDRLSLAGE-----VIDGFVFGCG-TSNQGPPF 271
Query: 187 SGIFGL-GIGRSSL------VSQLNSSFSYCIGSLHDPDYLHNKLILGDGAIIDEGDATP 239
G GL G+GRS L V Q FSYC+ + D L+LGD ++TP
Sbjct: 272 GGTSGLMGLGRSQLSLVSQTVDQFGGVFSYCLPLSRESD-ASGSLVLGDDPSAYR-NSTP 329
Query: 240 LQF----------IDGHYY-ITLEAISVDGRMLDINPNIFKRDDSGGGVMIDSGTDVTWL 288
+ + + G +Y + L I+V G+ ++ F ++DSGT +T L
Sbjct: 330 VVYTSMVSNSDPLLQGPFYLVNLTGITVGGQ--EVESTGFSAR-----AIVDSGTVITSL 382
Query: 289 VKEAYEALRDEVMIRL-EGEQMRSYSWPDKLCYHGIMSSDLKGFPTVRFHFRGGAKLALE 347
V Y A+R E M +L E Q +S D C++ +++ P++ F GGA++ ++
Sbjct: 383 VPSVYNAVRAEFMSQLAEYPQAPGFSILDT-CFNMTGLKEVQ-VPSLTLVFDGGAEVEVD 440
Query: 348 KDSMFYQPRPDA--FCMAVNPASINNRYVNFSLIGMMAQQFYNVGYDIGRKQKTFQRMDC 405
+ Y D+ C+AV AS+ + S+IG Q+ V +D Q F + C
Sbjct: 441 SGGVLYFVSSDSSQVCLAV--ASLKSED-ETSIIGNYQQKNLRVVFDTSASQVGFAQETC 497
>gi|356553832|ref|XP_003545255.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Glycine max]
Length = 427
Score = 108 bits (270), Expect = 5e-21, Method: Compositional matrix adjust.
Identities = 100/383 (26%), Positives = 162/383 (42%), Gaps = 53/383 (13%)
Query: 58 NSLFYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLGTIFYPSRSSSYAYVPCD 117
N ++++IG PP +DTGS L W+HC P L + F P SSSY PC+
Sbjct: 56 NVTLTISLTIGSPPQNVTMVLDTGSELSWLHCKKL----PNLNSTFNPLLSSSYTPTPCN 111
Query: 118 SEHCR------YFPYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQD 171
S C P A C C Y ++ E + +
Sbjct: 112 SSVCMTRTRDLTIP-ASCDPNNKLCHVIVSYADASSAEGTLAAETFSLAGAAQP-----G 165
Query: 172 VVFGC----GFST--NRNFKFSGIFGLGIGRSSLVSQLN-SSFSYCIGSLHDPDYLHNKL 224
+FGC G+++ N + K +G+ G+ G SLV+Q+ FSYCI + L
Sbjct: 166 TLFGCMDSAGYTSDINEDAKTTGLMGMNRGSLSLVTQMVLPKFSYCISG----EDAFGVL 221
Query: 225 ILGDG----------AIIDEGDATPLQFIDGHYYITLEAISVDGRMLDINPNIFKRDDSG 274
+LGDG ++ ++P F Y + LE I V ++L + ++F D +G
Sbjct: 222 LLGDGPSAPSPLQYTPLVTATTSSPY-FDRVAYTVQLEGIKVSEKLLQLPKSVFVPDHTG 280
Query: 275 GG-VMIDSGTDVTWLVKEAYEALRDEVMIRLEGEQMR------SYSWPDKLCYHGIMSSD 327
G M+DSGT T+L+ Y +L+DE + + +G R + LCYH S
Sbjct: 281 AGQTMVDSGTQFTFLLGPVYNSLKDEFLEQTKGVLTRIEDPNFVFEGAMDLCYHAPAS-- 338
Query: 328 LKGFPTVRFHFRGGAKLALEKDSMFY---QPRPDAFCMAVNPASINNRYVNFSLIGMMAQ 384
L P V F GA++ + + + Y + R +C + + + +IG Q
Sbjct: 339 LAAVPAVTLVF-SGAEMRVSGERLLYRVSKGRDWVYCFTFGNSDLLG--IEAYVIGHHHQ 395
Query: 385 QFYNVGYDIGRKQKTFQRMDCEV 407
Q + +D+ + + F C++
Sbjct: 396 QNVWMEFDLVKSRVGFTETTCDL 418
>gi|413953782|gb|AFW86431.1| hypothetical protein ZEAMMB73_038825 [Zea mays]
Length = 462
Score = 108 bits (270), Expect = 5e-21, Method: Compositional matrix adjust.
Identities = 101/360 (28%), Positives = 151/360 (41%), Gaps = 32/360 (8%)
Query: 61 FYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRD-CSPQLGTIFYPSRSSSYAYVPCDSE 119
F V + G P DTGS + W+ C PC C Q IF P++S++Y+ VPC
Sbjct: 120 FVVTVGFGTPAQTYTLMFDTGSDVSWIQCLPCSGHCYKQHDPIFDPTKSATYSAVPCGHP 179
Query: 120 HCRYFPYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDVVFGCGFS 179
C +CS+ C+Y Y G T+ +S E L+ S + FGCG +
Sbjct: 180 QCAAA-GGKCSS-NGTCLYKVQYGDGSSTAGVLSHETLSLT----SARALPGFAFGCGET 233
Query: 180 TNRNF-KFSGIFGLGIGRSSL----VSQLNSSFSYCIGSLHDPDYLHNKLILGDGAIIDE 234
+F G+ GLG G+ SL + ++FSYC+ S + H L +G
Sbjct: 234 NLGDFGDVDGLIGLGRGQLSLSSQAAASFGAAFSYCLPSYNT---SHGYLTIGTTTPASG 290
Query: 235 GDAT------PLQFIDGHYYITLEAISVDGRMLDINPNIFKRDDSGGGVMIDSGTDVTWL 288
D Q Y++ L +I V G +L + P +F RD G ++DSGT +T+L
Sbjct: 291 SDGVRYTAMIQKQDYPSFYFVDLVSIVVGGFVLPVPPILFTRD----GTLLDSGTVLTYL 346
Query: 289 VKEAYEALRDEVMIRLEGEQMRSYSWPDKLCYHGIMSSDLKGFPTVRFHFRGGAKLALEK 348
EAY ALRD + + P CY + P V F F G+ L
Sbjct: 347 PPEAYTALRDRFKFTMTQYKPAPAYDPFDTCYD-FAGQNAIFMPLVSFKFSDGSSFDLSP 405
Query: 349 DSMFYQP---RPDAFCMAVNPASINNRYVNFSLIGMMAQQFYNVGYDIGRKQKTFQRMDC 405
+ P P C+A P + F+++G Q+ + YD+ ++ F C
Sbjct: 406 FGVLIFPDDTAPATGCLAFVP---RPSTMPFTIVGNTQQRNTEMIYDVAAEKIGFVSGSC 462
>gi|255565759|ref|XP_002523869.1| Aspartic proteinase nepenthesin-2 precursor, putative [Ricinus
communis]
gi|223536957|gb|EEF38595.1| Aspartic proteinase nepenthesin-2 precursor, putative [Ricinus
communis]
Length = 447
Score = 108 bits (270), Expect = 5e-21, Method: Compositional matrix adjust.
Identities = 121/429 (28%), Positives = 183/429 (42%), Gaps = 54/429 (12%)
Query: 13 NPNENPAHRVQRGINISIARLAYLQEKIRSHNTYQAQILPSNDISNSLFYVNISIGQPPV 72
NP+++ ++ ++ S+AR +H+ Q P S + +++S G PP
Sbjct: 38 NPSQDHLQKLNYLVSTSLAR---------AHHLKNPQTTPVFSHSYGGYSISLSFGTPPQ 88
Query: 73 PQFTAMDTGSSLLWVHC---YPCRDCS-PQLGTIFYPSRSSSYAYVPCDSEHCRY----- 123
MDTGSS +W C Y C +CS + F P SSS + C + C +
Sbjct: 89 TLSFVMDTGSSFVWFPCTLRYLCNNCSFTSRISPFLPKHSSSSKIIGCKNPKCSWIHQTD 148
Query: 124 FPYARCSAYKHRCI-----YTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDVVFGCGF 178
C C Y LY G V +S E L + V + + GC
Sbjct: 149 LRCTDCDNNSRNCSQICPPYLILYGSGTTGGVALS-ETLHLHG-----LIVPNFLVGCSV 202
Query: 179 STNRNFKFSGIFGLGIGRSSLVSQLN-SSFSYCIGSLH-DPDYLHNKLILGDGAIIDEGD 236
++R + +GI G G G SSL SQL + FSYC+ S D + L+L + D+
Sbjct: 203 FSSR--QPAGIAGFGRGPSSLPSQLGLTKFSYCLLSHKFDDTQESSSLVLDSQSDSDKKT 260
Query: 237 A----TPL---------QFIDGHYYITLEAISVDGRMLDINPNIFKRD-DSGGGVMIDSG 282
A TPL +YY++L IS+ GR + I D D GG +IDSG
Sbjct: 261 AALMYTPLVKNPKVQDKPAFSVYYYVSLRRISIGGRSVKIPYKYLSPDKDGNGGTIIDSG 320
Query: 283 TDVTWLVKEAYEALRDEVMIRL---EGEQMRSYSWPDKLCYHGIMSSDLKGFPTVRFHFR 339
T T++ EA+E L +E + ++ E M K C++ + +L+ P +R HF+
Sbjct: 321 TTFTYMSTEAFEILSNEFISQVKNYERALMVEALSGLKPCFNVSGAKELE-LPQLRLHFK 379
Query: 340 GGA--KLALEKDSMFYQPRPDAFCMAVNPASINNRYVNFSLIGMMAQQFYNVGYDIGRKQ 397
GGA +L LE F R A V + L Q FY V YD+ ++
Sbjct: 380 GGADVELPLENYFAFLGSREVACFTVVTDGAEKASGPGMILGNFQMQNFY-VEYDLQNER 438
Query: 398 KTFQRMDCE 406
F++ C+
Sbjct: 439 LGFKKESCK 447
>gi|115451209|ref|NP_001049205.1| Os03g0186900 [Oryza sativa Japonica Group]
gi|49532749|dbj|BAD26705.1| Radc1 [Oryza sativa Japonica Group]
gi|108706569|gb|ABF94364.1| Eukaryotic aspartyl protease family protein, expressed [Oryza
sativa Japonica Group]
gi|113547676|dbj|BAF11119.1| Os03g0186900 [Oryza sativa Japonica Group]
gi|215692805|dbj|BAG88249.1| unnamed protein product [Oryza sativa Japonica Group]
gi|215767626|dbj|BAG99854.1| unnamed protein product [Oryza sativa Japonica Group]
Length = 438
Score = 108 bits (270), Expect = 5e-21, Method: Compositional matrix adjust.
Identities = 102/402 (25%), Positives = 158/402 (39%), Gaps = 40/402 (9%)
Query: 31 ARLAYLQEKIRSHNTYQAQILPSNDISNSLFYVNISIGQPPVPQFTAMDTGSSLLWVHCY 90
ARL +L K + A + ++ + + V +G P A+DT + W HC
Sbjct: 51 ARLLFLSSKAATAGVSSAPV--ASGQAPPSYVVRAGLGSPSQQLLLALDTSADATWAHCS 108
Query: 91 PCRDCSPQLGTIFYPSRSSSYAYVPCDSEHCRYFPYARCSAYKH------------RCIY 138
PC C ++F P+ SSSYA +PC S C F C A + C +
Sbjct: 109 PCGTCPSS--SLFAPANSSSYASLPCSSSWCPLFQGQACPAPQGGGDAAPPPATLPTCAF 166
Query: 139 TQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDVVFGCGFSTN---RNFKFSGIFGLGIG 195
++ + + S K+ + + FGC S N G+ GLG G
Sbjct: 167 SKPFADASFQAALASDTLRLGKDA------IPNYTFGCVSSVTGPTTNMPRQGLLGLGRG 220
Query: 196 RSSLVSQL----NSSFSYCIGSLHDPDYLHNKLILGDGAIIDEGDATPLQFIDGH----Y 247
+L+SQ N FSYC+ S Y L LG G + H Y
Sbjct: 221 PMALLSQAGSLYNGVFSYCLPSYRS-YYFSGSLRLGAGGGQPRSVRYTPMLRNPHRSSLY 279
Query: 248 YITLEAISVDGRMLDINPNIFKRDDS-GGGVMIDSGTDVTWLVKEAYEALRDEVMIRLEG 306
Y+ + +SV + + F D + G G ++DSGT +T Y ALR+E ++
Sbjct: 280 YVNVTGLSVGHAWVKVPAGSFAFDAATGAGTVVDSGTVITRWTAPVYAALREEFRRQVAA 339
Query: 307 EQMRSYSWPDKLCYHGIMSSDLKGFPTVRFHFRGGAKLALE-KDSMFYQPRPDAFCMAVN 365
+ C++ G P V H GG LAL ++++ + C+A+
Sbjct: 340 PSGYTSLGAFDTCFN-TDEVAAGGAPAVTVHMDGGVDLALPMENTLIHSSATPLACLAMA 398
Query: 366 PASIN-NRYVNFSLIGMMAQQFYNVGYDIGRKQKTFQRMDCE 406
A N N VN +I + QQ V +D+ + F + C
Sbjct: 399 EAPQNVNSVVN--VIANLQQQNIRVVFDVANSRVGFAKESCN 438
>gi|326511104|dbj|BAK03251.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 631
Score = 108 bits (270), Expect = 5e-21, Method: Compositional matrix adjust.
Identities = 101/380 (26%), Positives = 167/380 (43%), Gaps = 56/380 (14%)
Query: 56 ISNSLFYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLGTIFYPSRSSSYAYVP 115
++N + + IG PP +D+GS++ +V C C C F P SS+Y+ V
Sbjct: 83 LTNGYYTTRLHIGTPPQEFALIVDSGSTVTYVPCASCEQCGNHQDPRFQPDLSSTYSPVK 142
Query: 116 CDSEHCRYFPYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDVVFG 175
C+ + C + K++C Y + Y +S + + ++F ES + Q VFG
Sbjct: 143 CNVD-------CTCDSDKNQCTYERQYAEMSSSSGVLGEDIVSFGT--ESELKPQRAVFG 193
Query: 176 CGFSTNRNFKFS----GIFGLGIGRSSLVSQL------NSSFSYCIGSLHDPDYLHNKLI 225
C S + FS GI GLG G+ S++ QL SFS C G +
Sbjct: 194 CENSETGDL-FSQHADGIMGLGRGQLSIMDQLVDKGVIGDSFSMCYGGMD---------- 242
Query: 226 LGDGAIIDEGDATPLQFIDGH--------YYITLEAISVDGRMLDINPNIFKRDDSGGGV 277
+G GA++ P I H Y I L+ + V G+ L ++P IF D G
Sbjct: 243 IGGGAMVLGAMPAPPGMIYTHSNAVRSPYYNIELKEMHVAGKALRVDPRIF---DGKHGT 299
Query: 278 MIDSGTDVTWLVKEAYEALRDEVMIRLEGEQMRSYSWPDK----LCYHGI---MSSDLKG 330
++DSGT +L ++A+ A +D V ++ ++ PD +C+ G +S +
Sbjct: 300 VLDSGTTYAYLPEQAFVAFKDAVSSQV--HPLKKIRGPDSNYKDICFAGAGRNVSQLSEV 357
Query: 331 FPTVRFHFRGGAKLALEKDSMFYQPRP--DAFCMAVNPASINNRYVNFSLIGMMAQQFYN 388
FP V F G KL+L ++ ++ A+C+ V N +L+G + +
Sbjct: 358 FPKVDMVFGNGQKLSLSPENYLFRHSKVEGAYCLGV----FQNGKDPTTLLGGIVVRNTL 413
Query: 389 VGYDIGRKQKTFQRMDCEVL 408
V YD ++ F + +C L
Sbjct: 414 VTYDRHNEKIGFWKTNCSEL 433
>gi|15241713|ref|NP_195839.1| aspartyl protease family protein [Arabidopsis thaliana]
gi|75181297|sp|Q9LZL3.1|PCS1L_ARATH RecName: Full=Aspartic proteinase PCS1; AltName: Full=Aspartic
protease 38; Short=AtASP38; AltName: Full=Protein EMBRYO
DEFECTIVE 24; AltName: Full=Protein PROMOTION OF CELL
SURVIVAL 1; Flags: Precursor
gi|7340693|emb|CAB82992.1| putative protein [Arabidopsis thaliana]
gi|50897174|gb|AAT85726.1| At5g02190 [Arabidopsis thaliana]
gi|53828617|gb|AAU94418.1| At5g02190 [Arabidopsis thaliana]
gi|110742159|dbj|BAE99007.1| hypothetical protein [Arabidopsis thaliana]
gi|332003059|gb|AED90442.1| aspartyl protease family protein [Arabidopsis thaliana]
Length = 453
Score = 108 bits (270), Expect = 5e-21, Method: Compositional matrix adjust.
Identities = 112/385 (29%), Positives = 165/385 (42%), Gaps = 69/385 (17%)
Query: 70 PPVPQFTAMDTGSSLLWVHCYPCRDCSPQLGTIFYPSRSSSYAYVPCDSEHCR-----YF 124
PP +DTGS L W+ C R +P F P+RSSSY+ +PC S CR +
Sbjct: 82 PPQNISMVIDTGSELSWLRCN--RSSNPNPVNNFDPTRSSSYSPIPCSSPTCRTRTRDFL 139
Query: 125 PYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDVVFGC-----GFS 179
A C + K C T Y + ++ E F N+ + +++FGC G
Sbjct: 140 IPASCDSDK-LCHATLSYADASSSEGNLAAEIFHFGNSTNDS----NLIFGCMGSVSGSD 194
Query: 180 TNRNFKFSGIFGLGIGRSSLVSQLN-SSFSYCIGSLHD-PDYLHNKLILGDGAIIDEGD- 236
+ K +G+ G+ G S +SQ+ FSYCI D P + L+LGD
Sbjct: 195 PEEDTKTTGLLGMNRGSLSFISQMGFPKFSYCISGTDDFPGF----LLLGDSNFTWLTPL 250
Query: 237 --------ATPLQFIDG-HYYITLEAISVDGRMLDINPNIFKRDDSGGG-VMIDSGTDVT 286
+TPL + D Y + L I V+G++L I ++ D +G G M+DSGT T
Sbjct: 251 NYTPLIRISTPLPYFDRVAYTVQLTGIKVNGKLLPIPKSVLVPDHTGAGQTMVDSGTQFT 310
Query: 287 WLVKEAYEALRDEVMIRLEGEQMRSYSWPD-------KLCYH----GIMSSDLKGFPTVR 335
+L+ Y ALR + R G + Y PD LCY I S L PTV
Sbjct: 311 FLLGPVYTALRSHFLNRTNG-ILTVYEDPDFVFQGTMDLCYRISPVRIRSGILHRLPTVS 369
Query: 336 FHFRGGAKLALEKDSMFYQPRPDAFCMAVNPASINNRYV------NFSLIGMMA------ 383
F GA++A+ + Y+ V ++ N V N L+GM A
Sbjct: 370 LVFE-GAEIAVSGQPLLYR---------VPHLTVGNDSVYCFTFGNSDLMGMEAYVIGHH 419
Query: 384 -QQFYNVGYDIGRKQKTFQRMDCEV 407
QQ + +D+ R + ++C+V
Sbjct: 420 HQQNMWIEFDLQRSRIGLAPVECDV 444
>gi|212722554|ref|NP_001131154.1| uncharacterized protein LOC100192462 precursor [Zea mays]
gi|194690728|gb|ACF79448.1| unknown [Zea mays]
Length = 431
Score = 108 bits (270), Expect = 5e-21, Method: Compositional matrix adjust.
Identities = 113/420 (26%), Positives = 169/420 (40%), Gaps = 35/420 (8%)
Query: 8 LSPYNN---PNENPAHRVQRGINISIARLAYLQEKIRSHNTYQAQILPSNDISNSLFYVN 64
LS Y+N P+ +P + ARL +L K S + + S S + V
Sbjct: 24 LSVYHNVHPPSPSPLESIIALARADDARLLFLSSKAASSGGITSAPVASGQTPPS-YVVR 82
Query: 65 ISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLGTIFYPSRSSSYAYVPCDSEHCRYF 124
+G P A+DT + W HC PC C G+ F P+ SSSYA +PC S+ C F
Sbjct: 83 AGLGTPVQQLLLALDTSADATWSHCAPCDTC--PAGSRFIPASSSSYASLPCASDWCPLF 140
Query: 125 PYARC------SAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDVVFGCGF 178
C SA C +++ + +TS S T + ++ + FGC
Sbjct: 141 EGQPCPANQDASAPLPACAFSKPFA---DTSFQASLGSDTLRLGKDA---IAGYAFGCVG 194
Query: 179 ST---NRNFKFSGIFGLGIGRSSLVSQL----NSSFSYCIGSLHDPDYLHNKLILGDGAI 231
+ N G+ GLG G SL+SQ N FSYC+ S Y L LG
Sbjct: 195 AVAGPTTNLPKQGLLGLGRGPMSLLSQTGSRYNGVFSYCLPSYRS-YYFSGSLRLGAAGQ 253
Query: 232 IDEGDATPLQFIDGH----YYITLEAISVDGRMLDINPNIFKRDD-SGGGVMIDSGTDVT 286
TPL + H YY+ + +SV + + F D +G G +IDSGT +T
Sbjct: 254 PRNVRYTPL-LTNPHRPSLYYVNVTGLSVGRTWVKVPAGSFAFDPATGAGTVIDSGTVIT 312
Query: 287 WLVKEAYEALRDEVMIRLEGEQMRSYSWPDKLCYHGIMSSDLKGFPTVRFHFRGGAKLAL 346
Y ALR+E ++ + C++ G P V H GG L L
Sbjct: 313 RWTAPVYAALREEFRRQVAAPSGYTSLGAFDTCFN-TDEVAAGGAPPVTLHMDGGVDLTL 371
Query: 347 E-KDSMFYQPRPDAFCMAVNPASINNRYVNFSLIGMMAQQFYNVGYDIGRKQKTFQRMDC 405
++++ + C+A+ A N +++ + QQ V D+ + F R C
Sbjct: 372 PMENTLIHSSATPLACLAMAEAP-QNVNAVVNVVANLQQQNVRVVVDVAGSRVGFAREPC 430
>gi|194698750|gb|ACF83459.1| unknown [Zea mays]
gi|194703964|gb|ACF86066.1| unknown [Zea mays]
gi|219886221|gb|ACL53485.1| unknown [Zea mays]
gi|219886359|gb|ACL53554.1| unknown [Zea mays]
gi|223950085|gb|ACN29126.1| unknown [Zea mays]
gi|414865218|tpg|DAA43775.1| TPA: aspartic proteinase nepenthesin-1 [Zea mays]
Length = 431
Score = 108 bits (269), Expect = 6e-21, Method: Compositional matrix adjust.
Identities = 113/420 (26%), Positives = 169/420 (40%), Gaps = 35/420 (8%)
Query: 8 LSPYNN---PNENPAHRVQRGINISIARLAYLQEKIRSHNTYQAQILPSNDISNSLFYVN 64
LS Y+N P+ +P + ARL +L K S + + S S + V
Sbjct: 24 LSVYHNVHPPSPSPLESIIALARADDARLLFLSSKAASSGGVTSAPVASGQTPPS-YVVR 82
Query: 65 ISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLGTIFYPSRSSSYAYVPCDSEHCRYF 124
+G P A+DT + W HC PC C G+ F P+ SSSYA +PC S+ C F
Sbjct: 83 AGLGTPVQQLLLALDTSADATWSHCAPCDTC--PAGSRFIPASSSSYASLPCASDWCPLF 140
Query: 125 PYARC------SAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDVVFGCGF 178
C SA C +++ + +TS S T + ++ + FGC
Sbjct: 141 EGQPCPANQDASAPLPACAFSKPFA---DTSFQASLGSDTLRLGKDA---IAGYAFGCVG 194
Query: 179 ST---NRNFKFSGIFGLGIGRSSLVSQL----NSSFSYCIGSLHDPDYLHNKLILGDGAI 231
+ N G+ GLG G SL+SQ N FSYC+ S Y L LG
Sbjct: 195 AVAGPTTNLPKQGLLGLGRGPMSLLSQTGSRYNGVFSYCLPSYRS-YYFSGSLRLGAAGQ 253
Query: 232 IDEGDATPLQFIDGH----YYITLEAISVDGRMLDINPNIFKRDD-SGGGVMIDSGTDVT 286
TPL + H YY+ + +SV + + F D +G G +IDSGT +T
Sbjct: 254 PRNVRYTPL-LTNPHRPSLYYVNVTGLSVGRTWVKVPAGSFAFDPATGAGTVIDSGTVIT 312
Query: 287 WLVKEAYEALRDEVMIRLEGEQMRSYSWPDKLCYHGIMSSDLKGFPTVRFHFRGGAKLAL 346
Y ALR+E ++ + C++ G P V H GG L L
Sbjct: 313 RWTAPVYAALREEFRRQVAAPSGYTSLGAFDTCFN-TDEVAAGGAPPVTLHMDGGVDLTL 371
Query: 347 E-KDSMFYQPRPDAFCMAVNPASINNRYVNFSLIGMMAQQFYNVGYDIGRKQKTFQRMDC 405
++++ + C+A+ A N +++ + QQ V D+ + F R C
Sbjct: 372 PMENTLIHSSATPLACLAMAEAP-QNVNAVVNVVANLQQQNVRVVVDVAGSRVGFAREPC 430
>gi|255548664|ref|XP_002515388.1| Aspartic proteinase nepenthesin-2 precursor, putative [Ricinus
communis]
gi|223545332|gb|EEF46837.1| Aspartic proteinase nepenthesin-2 precursor, putative [Ricinus
communis]
Length = 494
Score = 108 bits (269), Expect = 6e-21, Method: Compositional matrix adjust.
Identities = 114/380 (30%), Positives = 156/380 (41%), Gaps = 43/380 (11%)
Query: 48 AQILPSND---ISNSLFYVNISIGQPPVPQFTAMDTGSSLLWVHCYPC-RDCSPQLGTIF 103
A LP+ D I + ++V + +G P DTGS L W C PC + C Q IF
Sbjct: 137 ATTLPAKDGSIIGSGNYFVTVGLGTPKKDFSLIFDTGSDLTWTQCEPCVKSCYNQKEAIF 196
Query: 104 YPSRSSSYAYVPCDSEHCRYFPYARCSAY---KHRCIYTQLYLIGPETSVFVSTEQLTFK 160
PS+S+SYA + C S C A + + C+Y Y + F E+L+
Sbjct: 197 NPSQSTSYANISCGSTLCDSLASATGNIFNCASSTCVYGIQYGDSSFSIGFFGKEKLSLT 256
Query: 161 NTDESTIHVQDVVFGCGFSTNRNFKFSGIFGLGIGRS--SLVSQL----NSSFSYCIGSL 214
TD D FGCG N+ LG+GR SLVSQ N FSYC L
Sbjct: 257 ATDV----FNDFYFGCG-QNNKGLFGGAAGLLGLGRDKLSLVSQTAQRYNKIFSYC---L 308
Query: 215 HDPDYLHNKLILGDGAIIDEGDATPLQFIDG---HYYITLEAISVDGRMLDINPNIFKRD 271
L G G+ TPL I G Y + L ISV GR L I+P++F
Sbjct: 309 PSSSSSTGFLTFG-GSTSKSASFTPLATISGGSSFYGLDLTGISVGGRKLAISPSVF--- 364
Query: 272 DSGGGVMIDSGTDVTWLVKEAYEALRDEVMIRLEGEQMRSYSWPDKL-----CYHGIMSS 326
S G +IDSGT +T L AY AL + M Y L C+ +
Sbjct: 365 -STAGTIIDSGTVITRLPPAAYSALSSTFR-----KLMSQYPAAPALSILDTCFD-FSNH 417
Query: 327 DLKGFPTVRFHFRGGAKLALEKDSMFYQPRPDAFCMAVNPASINNRYVNFSLIGMMAQQF 386
D P + F GG + ++K +FY C+A + N+ + ++ G + Q+
Sbjct: 418 DTISVPKIGLFFSGGVVVDIDKTGIFYVNDLTQVCLAF---AGNSDASDVAIFGNVQQKT 474
Query: 387 YNVGYDIGRKQKTFQRMDCE 406
V YD + F C
Sbjct: 475 LEVVYDGAAGRVGFAPAGCS 494
>gi|414881575|tpg|DAA58706.1| TPA: hypothetical protein ZEAMMB73_168363 [Zea mays]
Length = 506
Score = 108 bits (269), Expect = 6e-21, Method: Compositional matrix adjust.
Identities = 100/391 (25%), Positives = 161/391 (41%), Gaps = 57/391 (14%)
Query: 25 GINISIARLAYLQEKIRSHNTYQAQILPSNDIS----NSLFYVNISIGQPPVPQFTAMDT 80
G NIS R + R A LP + L++ I +G PP + +DT
Sbjct: 50 GANISALRA---HDGRRHGRLLAAADLPLGGLGLPTDTGLYFTEIKLGTPPKRYYVQVDT 106
Query: 81 GSSLLWVHCYPCRDCSPQLG-----TIFYPSRSSSYAYVPCDSEHCRYFPYAR---CSAY 132
GS +LWV+C C C + G T + P SSS + V CD C + C+A
Sbjct: 107 GSDILWVNCISCSKCPRKSGLGLDLTFYDPKASSSGSTVSCDQGFCAATYGGKLPGCTA- 165
Query: 133 KHRCIYTQLYLIGPETSVFVSTEQLTFKNT---DESTIHVQDVVFGCGFST-----NRNF 184
C Y+ +Y G T+ F T+ L F ++ + FGCG N N
Sbjct: 166 NVPCEYSVMYGDGSSTTGFFITDALQFDQVTGDGQTQPGNATITFGCGAQQGGDLGNSNQ 225
Query: 185 KFSGIFGLGIGRSSLVSQLNSS------FSYCIGSLHDPDYLHNKLILGDGAIIDEGDAT 238
GI G G +S++SQL ++ F++C+ D + I G ++
Sbjct: 226 ALDGILGFGQANTSMLSQLAAAGKAKKIFAHCL------DTIKGGGIFAIGNVVQPKCYF 279
Query: 239 PLQFIDG-----------------HYYITLEAISVDGRMLDINPNIFKRDDSGGGVMIDS 281
F G HY + L++I V G L + ++F+ + G +IDS
Sbjct: 280 VFFFAHGLLNIPLFLLVMILLSRPHYNVNLKSIDVGGTTLQLPAHVFETGEK-KGTIIDS 338
Query: 282 GTDVTWLVKEAYEALRDEVMIRLEGEQMRSYSWPDKLCYHGIMSSDLKGFPTVRFHFRGG 341
GT +T+L + ++ + D V + + ++ D LC+ S D GFPT+ FHF
Sbjct: 339 GTTLTYLPELVFKQVMDVVFSKH--RDIAFHNLQDFLCFQYSGSVD-DGFPTITFHFEDD 395
Query: 342 AKLALEKDSMFYQPRPDAFCMAVNPASINNR 372
L + F+ D +C+ ++ ++
Sbjct: 396 LALHVYPHEYFFPNGNDIYCVGFQNGALQSK 426
>gi|357163818|ref|XP_003579856.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Brachypodium
distachyon]
Length = 467
Score = 108 bits (269), Expect = 6e-21, Method: Compositional matrix adjust.
Identities = 112/434 (25%), Positives = 179/434 (41%), Gaps = 65/434 (14%)
Query: 22 VQRGINISIARLAYLQEKIRSHNTYQAQILPSNDI--SNSLFYVNISIGQPPVPQFTAMD 79
++R I S RLA + ++ ++ ++ + + + V + +G P A+D
Sbjct: 47 LRRAIQRSRDRLASIAPRLLPTSSRNKVVVAEAPVLSAGGEYLVKLGLGTPQHCFTAAID 106
Query: 80 TGSSLLWVHCYPCRDCSPQLGTIFYPSRSSSYAYVPCDSEHCRYFPYARC-----SAYKH 134
T S L+W C PC C QL +F P S+SYA VPC+S+ C RC S +
Sbjct: 107 TASDLIWTQCQPCVKCYKQLDPVFNPVASTSYAVVPCNSDTCDELDTHRCARDGDSDDED 166
Query: 135 RCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDVVFGCGFST--NRNFKFSGIFGL 192
C YT Y T ++ ++L + + VVFGC S+ + SG+ GL
Sbjct: 167 ACQYTYSYGGNATTRGILAVDRLAIGDD-----VFRGVVFGCSSSSVGGPPPQVSGVVGL 221
Query: 193 GIGRSSLVSQLN-SSFSYCIGSLHDP-DYLHNKLILGDGAIIDEGDATPLQFI------- 243
G G SLVSQL+ F YC L P +L+LG A +A+ +
Sbjct: 222 GRGALSLVSQLSVRRFMYC---LPPPVSRSAGRLVLGADAAATVRNASERVVVPMSTGSR 278
Query: 244 -DGHYYITLEAISVDGRMLDINP----NIFKRDDSGG----------------------G 276
+YY+ L+ IS+ R + N + G G
Sbjct: 279 YPSYYYLNLDGISIGDRAMSFRSRNRMNATTPGTAAGAPASPVSGSGDGDGSGTGPDAYG 338
Query: 277 VMIDSGTDVTWLVKEAYEALRDEVMIRLEGEQMRSYSWPDKLCY---HGIMSSDLKGFPT 333
++ID + +T+L + YE + D++ + + LC+ G+ S + P
Sbjct: 339 MIIDIASTITFLEESLYEEMVDDLEEEIRLPRGSGSDLGLDLCFILPEGVPMSRVYA-PP 397
Query: 334 VRFHFRGGAKLALEKDSMFYQPRPDA-FCMAVNPASINNRYVNFSLIGMMAQQFYNVGYD 392
V F G L L+K+ MF + R C+ V + S++G QQ V Y+
Sbjct: 398 VSLAFE-GVWLRLDKEQMFVEDRASGMMCLMV------GKTDGVSILGNYQQQNMQVMYN 450
Query: 393 IGRKQKTFQRMDCE 406
+ R + TF + CE
Sbjct: 451 LRRGRITFIKTACE 464
>gi|125542690|gb|EAY88829.1| hypothetical protein OsI_10302 [Oryza sativa Indica Group]
Length = 440
Score = 108 bits (269), Expect = 7e-21, Method: Compositional matrix adjust.
Identities = 102/401 (25%), Positives = 158/401 (39%), Gaps = 40/401 (9%)
Query: 31 ARLAYLQEKIRSHNTYQAQILPSNDISNSLFYVNISIGQPPVPQFTAMDTGSSLLWVHCY 90
ARL +L K + A + ++ + + V +G P A+DT + W HC
Sbjct: 53 ARLLFLSSKAATAGVSSAPV--ASGQAPPSYVVRAGLGSPSQQLLLALDTSADATWAHCS 110
Query: 91 PCRDCSPQLGTIFYPSRSSSYAYVPCDSEHCRYFPYARCSAYKH------------RCIY 138
PC C ++F P+ SSSYA +PC S C F C A + C +
Sbjct: 111 PCGTCPSS--SLFAPANSSSYASLPCSSSWCPLFQGQACPAPQGGGDAAPPPATLPTCAF 168
Query: 139 TQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDVVFGCGFSTN---RNFKFSGIFGLGIG 195
++ + + S K+ + + FGC S N G+ GLG G
Sbjct: 169 SKPFADASFQAALASDTLRLGKDA------IPNYTFGCVSSVTGPTTNMPRQGLLGLGRG 222
Query: 196 RSSLVSQL----NSSFSYCIGSLHDPDYLHNKLILGDGAIIDEGDATPLQFIDGH----Y 247
+L+SQ N FSYC+ S Y L LG G + H Y
Sbjct: 223 PMALLSQAGSLYNGVFSYCLPSYRS-YYFSGSLRLGAGGGQPRSVRYTPMLRNPHRSSLY 281
Query: 248 YITLEAISVDGRMLDINPNIFKRDD-SGGGVMIDSGTDVTWLVKEAYEALRDEVMIRLEG 306
Y+ + +SV + + F D +G G ++DSGT +T Y ALR+E ++
Sbjct: 282 YVNVTGLSVGRAWVKVPAGSFAFDAATGAGTVVDSGTVITRWTAPVYAALREEFRRQVAA 341
Query: 307 EQMRSYSWPDKLCYHGIMSSDLKGFPTVRFHFRGGAKLALE-KDSMFYQPRPDAFCMAVN 365
+ C++ G P V H GG LAL ++++ + C+A+
Sbjct: 342 PSGYTSLGAFDTCFN-TDEVAAGGAPAVTVHMDGGVDLALPMENTLIHSSATPLACLAMA 400
Query: 366 PASIN-NRYVNFSLIGMMAQQFYNVGYDIGRKQKTFQRMDC 405
A N N VN +I + QQ V +D+ + F + C
Sbjct: 401 EAPQNVNSVVN--VIANLQQQNIRVVFDVANSRIGFAKESC 439
>gi|356532386|ref|XP_003534754.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Glycine max]
Length = 463
Score = 108 bits (269), Expect = 7e-21, Method: Compositional matrix adjust.
Identities = 106/372 (28%), Positives = 168/372 (45%), Gaps = 35/372 (9%)
Query: 55 DISNSLFYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRD-CSPQLGTIFYPSRSSSYAY 113
I + +YV I +G P +DTGSSL W+ C PC C Q+ IF PS S +Y
Sbjct: 107 SIGSGNYYVKIGLGTPAKYFSMIVDTGSSLSWLQCQPCVIYCHVQVDPIFTPSTSKTYKA 166
Query: 114 VPCDSEHCRYFPYAR-----CSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIH 168
+PC S C + CS C+Y Y + ++S + LT ++ +
Sbjct: 167 LPCSSSQCSSLKSSTLNAPGCSNATGACVYKASYGDTSFSIGYLSQDVLTLTPSEAPS-- 224
Query: 169 VQDVVFGCGFSTNRNF-KFSGIFGLGIGRSSLVSQLN----SSFSYCI---GSLHDPDYL 220
V+GCG F + SGI GL + S++ QL+ ++FSYC+ S + L
Sbjct: 225 -SGFVYGCGQDNQGLFGRSSGIIGLANDKISMLGQLSKKYGNAFSYCLPSSFSAPNSSSL 283
Query: 221 HNKLILGDGAIIDEG-DATPL---QFIDGHYYITLEAISVDGRMLDINPNIFKRDDSGGG 276
L +G ++ TPL Q I Y++ L I+V G+ L ++ + +
Sbjct: 284 SGFLSIGASSLTSSPYKFTPLVKNQKIPSLYFLDLTTITVAGKPLGVSASSYNVP----- 338
Query: 277 VMIDSGTDVTWLVKEAYEALRDEVMIRLEGE--QMRSYSWPDKLCYHGIMSSDLKGFPTV 334
+IDSGT +T L Y AL+ ++ + + Q +S D C+ G + ++ P +
Sbjct: 339 TIIDSGTVITRLPVAVYNALKKSFVLIMSKKYAQAPGFSILDT-CFKGSV-KEMSTVPEI 396
Query: 335 RFHFRGGAKLALEKDSMFYQPRPDAFCMAVNPASINNRYVNFSLIGMMAQQFYNVGYDIG 394
+ FRGGA L L+ + + C+A+ AS N S+IG QQ + V YD+
Sbjct: 397 QIIFRGGAGLELKAHNSLVEIEKGTTCLAI-AASSN----PISIIGNYQQQTFKVAYDVA 451
Query: 395 RKQKTFQRMDCE 406
+ F C+
Sbjct: 452 NFKIGFAPGGCQ 463
>gi|449445943|ref|XP_004140731.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Cucumis sativus]
Length = 430
Score = 107 bits (268), Expect = 7e-21, Method: Compositional matrix adjust.
Identities = 102/382 (26%), Positives = 171/382 (44%), Gaps = 44/382 (11%)
Query: 57 SNSLFYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLGTIFYPSRSSSYAYVP- 115
S++ V++ IG PP P +DTGS L W+ C+ + +L + P +S +
Sbjct: 62 SSTALVVSLPIGTPPQPTDLVLDTGSQLSWIQCHD-KKVKKRLPPLPKPKTASFDPSLSS 120
Query: 116 ------CDSEHCRY----FPYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDES 165
C+ C+ F C Y+ Y G + E+ TF +
Sbjct: 121 SFSLLPCNHPICKPRIPDFTLPTSCDQNRLCHYSYFYADGTLAEGNLVREKFTFSKS--- 177
Query: 166 TIHVQDVVFGCGFSTNRNFKFSGIFGLGIGRSSLVSQLN-SSFSYCIGSL--HDPDYLHN 222
+ V+ GC ++ N GI G+ GR S +SQ S FSYC+ S +P L
Sbjct: 178 -LSTPPVILGCAQASTEN---RGILGMNHGRLSFISQAKISKFSYCVPSRTGSNPTGL-- 231
Query: 223 KLILGDGAIIDEGD-ATPLQFIDGH---------YYITLEAISVDGRMLDINPNIFKRDD 272
LGD + T L F + Y + ++AI + G+ L+I P FK D
Sbjct: 232 -FYLGDNPNSSKFKYVTMLTFPESQSSPNLDPLAYTLPMKAIKIAGKRLNIPPAAFKPDA 290
Query: 273 SGGG-VMIDSGTDVTWLVKEAYEALRDEVMIRLEGEQMRS-YSWPD--KLCYHGIMSSDL 328
G G MIDSG+D+T+LV EAYE +++EV +RL G M+ Y + D +C+ +++++
Sbjct: 291 GGSGQTMIDSGSDLTYLVDEAYEKVKEEV-VRLVGAMMKKGYVYADVADMCFDAGVTAEV 349
Query: 329 -KGFPTVRFHFRGGAKLALEK-DSMFYQPRPDAFCMAVNPASINNRYVNFSLIGMMAQQF 386
+ + F F G ++ + + + + + C+ + + + ++IG + QQ
Sbjct: 350 GRRIGGISFEFDNGVEIFVGRGEGVLTEVEKGVKCVGIGRS--ERLGIGSNIIGTVHQQN 407
Query: 387 YNVGYDIGRKQKTFQRMDCEVL 408
V YD+ K+ F +C L
Sbjct: 408 MWVEYDLANKRVGFGGAECSRL 429
>gi|225216973|gb|ACN85264.1| aspartic proteinase nepenthesin-1 precursor [Oryza alta]
Length = 517
Score = 107 bits (268), Expect = 7e-21, Method: Compositional matrix adjust.
Identities = 101/351 (28%), Positives = 149/351 (42%), Gaps = 27/351 (7%)
Query: 61 FYVNISIGQPPVPQFTAMDTGSSLLWVHCYPC-RDCSPQLGTIFYPSRSSSYAYVPCDSE 119
+ V + +G P DTGS WV C PC C Q +F P RSS+YA V C +
Sbjct: 178 YVVTVGLGTPASRYTVVFDTGSDTTWVQCQPCVVVCYEQQEKLFDPVRSSTYANVSCAAP 237
Query: 120 HCRYFPYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDVVFGCGFS 179
C CS C+Y Y G + F + + LT + D V+ FGCG
Sbjct: 238 ACSDLNIHGCSG--GHCLYGVQYGDGSYSIGFFAMDTLTLSSYDA----VKGFRFGCGER 291
Query: 180 TNRNF-KFSGIFGLGIGRSSLVSQL----NSSFSYCIGSLH-DPDYLHNKLILGDGAIID 233
F + +G+ GLG G++SL Q F++C+ + YL G A
Sbjct: 292 NEGLFGEAAGLLGLGRGKTSLPVQTYDKYGGVFAHCLPARSTGTGYL--DFGAGSPAAAS 349
Query: 234 EGDATPLQFIDG--HYYITLEAISVDGRMLDINPNIFKRDDSGGGVMIDSGTDVTWLVKE 291
TP+ +G YYI + I V G++L I ++F G ++DSGT +T L
Sbjct: 350 ARLTTPMLTDNGPTFYYIGMTGIRVGGQLLSIPQSVFAT----AGTIVDSGTVITRLPPP 405
Query: 292 AYEALRDEVMIRLEGEQMRSYSWPDKL--CYHGIMSSDLKGFPTVRFHFRGGAKLALEKD 349
AY +LR + + L CY S + PTV F+GGA+L ++
Sbjct: 406 AYSSLRYAFAAAMAARGYKKAPAVSLLDTCYDFTGMSQVA-IPTVSLLFQGGARLDVDAS 464
Query: 350 SMFYQPRPDAFCMAVNPASINNRYVNFSLIGMMAQQFYNVGYDIGRKQKTF 400
+ Y C+A + N + ++G + + V YDIG+K F
Sbjct: 465 GIMYAASASQVCLAF---AANEDGGDVGIVGNTQLKTFGVAYDIGKKVVGF 512
>gi|115448353|ref|NP_001047956.1| Os02g0720900 [Oryza sativa Japonica Group]
gi|45735844|dbj|BAD12879.1| putative chloroplast nucleoid DNA-binding protein cnd41 [Oryza
sativa Japonica Group]
gi|45735970|dbj|BAD12999.1| putative chloroplast nucleoid DNA-binding protein cnd41 [Oryza
sativa Japonica Group]
gi|113537487|dbj|BAF09870.1| Os02g0720900 [Oryza sativa Japonica Group]
gi|125540930|gb|EAY87325.1| hypothetical protein OsI_08729 [Oryza sativa Indica Group]
gi|215692622|dbj|BAG88042.1| unnamed protein product [Oryza sativa Japonica Group]
Length = 458
Score = 107 bits (268), Expect = 8e-21, Method: Compositional matrix adjust.
Identities = 100/370 (27%), Positives = 158/370 (42%), Gaps = 39/370 (10%)
Query: 52 PSNDISNSLFYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCR-DCSPQLGTIFYPSRSSS 110
P + + + +G P +DTGSSL W+ C PC C Q G +F P SS+
Sbjct: 113 PGASVGVGNYVTRMGLGTPATQYVMVVDTGSSLTWLQCSPCLVSCHRQSGPVFNPKSSST 172
Query: 111 YAYVPCDSEHCRYFPYAR-----CSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDES 165
YA V C ++ C P A CS+ + CIY Y + ++S + ++F +T
Sbjct: 173 YASVGCSAQQCSDLPSATLNPSACSS-SNVCIYQASYGDSSFSVGYLSKDTVSFGSTS-- 229
Query: 166 TIHVQDVVFGCGFSTNRNF-KFSGIFGLGIGRSSLVSQLNS----SFSYCIGSLHDPDYL 220
+ + +GCG F + +G+ GL + SL+ QL SF+YC+ P
Sbjct: 230 ---LPNFYYGCGQDNEGLFGRSAGLIGLARNKLSLLYQLAPSLGYSFTYCL-----PSSS 281
Query: 221 HNKLILGDGAIIDEGDATPL---QFIDGHYYITLEAISVDGRMLDINPNIFKRDDSGGGV 277
+ + + TP+ D Y+I L ++V G L ++ + + +
Sbjct: 282 SSGYLSLGSYNPGQYSYTPMVSSSLDDSLYFIKLSGMTVAGNPLSVSSSAYSSLPT---- 337
Query: 278 MIDSGTDVTWLVKEAYEALRDEVMIRLEG-EQMRSYSWPDKLCYHGIMSSDLKGFPTVRF 336
+IDSGT +T L Y AL V ++G + +YS D C+ G S P V
Sbjct: 338 IIDSGTVITRLPTSVYSALSKAVAAAMKGTSRASAYSILDT-CFKGQASR--VSAPAVTM 394
Query: 337 HFRGGAKLALEKDSMFYQPRPDAFCMAVNPASINNRYVNFSLIGMMAQQFYNVGYDIGRK 396
F GGA L L ++ C+A PA + ++IG QQ ++V YD+
Sbjct: 395 SFAGGAALKLSAQNLLVDVDDSTTCLAFAPAR------SAAIIGNTQQQTFSVVYDVKSS 448
Query: 397 QKTFQRMDCE 406
+ F C
Sbjct: 449 RIGFAAGGCS 458
>gi|357482719|ref|XP_003611646.1| Aspartic proteinase-like protein [Medicago truncatula]
gi|355512981|gb|AES94604.1| Aspartic proteinase-like protein [Medicago truncatula]
Length = 640
Score = 107 bits (268), Expect = 8e-21, Method: Compositional matrix adjust.
Identities = 100/377 (26%), Positives = 160/377 (42%), Gaps = 54/377 (14%)
Query: 58 NSLFYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLGTIFYPSRSSSYAYVPCD 117
N + + IG PP +DTGS++ +V C C C F P S +Y V C
Sbjct: 86 NGYYTTRLWIGTPPQRFALIVDTGSTVTYVPCSTCEHCGRHQDPKFQPDLSETYQPVKCT 145
Query: 118 SEHCRYFPYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDVVFGCG 177
P C ++C+Y + Y +S + + ++F N E + Q VFGC
Sbjct: 146 -------PDCNCDGDTNQCMYDRQYAEMSSSSGVLGEDVVSFGNLSE--LAPQRAVFGCE 196
Query: 178 FSTNRNF---KFSGIFGLGIGRSSLVSQL------NSSFSYCIGSLHDPDYLHNKLILGD 228
+ + GI GLG G S++ QL + SFS C G + +G
Sbjct: 197 NDETGDLYSQRADGIMGLGRGDLSIMDQLVDKKVISDSFSLCYGGMD----------VGG 246
Query: 229 GAIIDEGDATPLQFIDGH--------YYITLEAISVDGRMLDINPNIFKRDDSGGGVMID 280
GA+I G + P + H Y I L+ + V G+ L +NP +F D G ++D
Sbjct: 247 GAMILGGISPPEDMVFTHSDPDRSPYYNINLKEMHVAGKKLQLNPKVF---DGKHGTVLD 303
Query: 281 SGTDVTWLVKEAYEALRDEVMIRLEGEQMRSYSWPD----KLCYHGI---MSSDLKGFPT 333
SGT +L + A+ A + +M E ++ + PD +C+ G +S K FP
Sbjct: 304 SGTTYAYLPETAFLAFKRAIM--KERNSLKQINGPDPNYKDICFTGAGIDVSQLAKSFPV 361
Query: 334 VRFHFRGGAKLALEKDSMFYQPRP--DAFCMAVNPASINNRYVNFSLIGMMAQQFYNVGY 391
V F G KL+L ++ ++ A+C+ V +N +L+G + + V Y
Sbjct: 362 VDMVFENGHKLSLSPENYLFRHSKVRGAYCLGV----FSNGRDPTTLLGGIFVRNTLVMY 417
Query: 392 DIGRKQKTFQRMDCEVL 408
D + F + +C L
Sbjct: 418 DRENSKIGFWKTNCSEL 434
>gi|326501422|dbj|BAK02500.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 631
Score = 107 bits (268), Expect = 8e-21, Method: Compositional matrix adjust.
Identities = 101/380 (26%), Positives = 167/380 (43%), Gaps = 56/380 (14%)
Query: 56 ISNSLFYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLGTIFYPSRSSSYAYVP 115
++N + + IG PP +D+GS++ +V C C C F P SS+Y+ V
Sbjct: 83 LTNGYYTTRLHIGTPPQEFALIVDSGSTVTYVPCASCEQCGNHQDPRFQPDLSSTYSPVK 142
Query: 116 CDSEHCRYFPYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDVVFG 175
C+ + C + K++C Y + Y +S + + ++F ES + Q VFG
Sbjct: 143 CNVD-------CTCDSDKNQCTYERQYAEMSSSSGVLGEDIVSFGT--ESELKPQRAVFG 193
Query: 176 CGFSTNRNFKFS----GIFGLGIGRSSLVSQL------NSSFSYCIGSLHDPDYLHNKLI 225
C S + FS GI GLG G+ S++ QL SFS C G +
Sbjct: 194 CENSETGDL-FSQHADGIMGLGRGQLSIMDQLVDKGVIGDSFSMCYGGMD---------- 242
Query: 226 LGDGAIIDEGDATPLQFIDGH--------YYITLEAISVDGRMLDINPNIFKRDDSGGGV 277
+G GA++ P I H Y I L+ + V G+ L ++P IF D G
Sbjct: 243 IGGGAMVLGAMPAPPGMIYTHSNAVRSPYYNIELKEMHVAGKALRVDPRIF---DGKHGT 299
Query: 278 MIDSGTDVTWLVKEAYEALRDEVMIRLEGEQMRSYSWPDK----LCYHGI---MSSDLKG 330
++DSGT +L ++A+ A +D V ++ ++ PD +C+ G +S +
Sbjct: 300 VLDSGTTYAYLPEQAFVAFKDAVSSQV--HPLKKIRGPDPNYKDICFAGAGRNVSQLSEV 357
Query: 331 FPTVRFHFRGGAKLALEKDSMFYQPRP--DAFCMAVNPASINNRYVNFSLIGMMAQQFYN 388
FP V F G KL+L ++ ++ A+C+ V N +L+G + +
Sbjct: 358 FPKVDMVFGNGQKLSLSPENYLFRHSKVEGAYCLGV----FQNGKDPTTLLGGIVVRNTL 413
Query: 389 VGYDIGRKQKTFQRMDCEVL 408
V YD ++ F + +C L
Sbjct: 414 VTYDRHNEKIGFWKTNCSEL 433
>gi|223949775|gb|ACN28971.1| unknown [Zea mays]
gi|414590177|tpg|DAA40748.1| TPA: hypothetical protein ZEAMMB73_257146 [Zea mays]
Length = 510
Score = 107 bits (268), Expect = 9e-21, Method: Compositional matrix adjust.
Identities = 114/411 (27%), Positives = 176/411 (42%), Gaps = 41/411 (9%)
Query: 23 QRGINISIARLAYLQEKIRSHNTYQAQILPSN-DISNSLFYVNISIGQPPVPQFTAMDTG 81
+R +AR+ R+ + + S + + + +++ +G PP MDTG
Sbjct: 110 RRAARSGVARMPASSSPRRALSERMVATVESGVAVGSGEYLIDVYVGTPPRRFRMIMDTG 169
Query: 82 SSLLWVHCYPCRDCSPQLGTIFYPSRSSSYAYVPCDSEHCRYF-----PYARCSAYKHRC 136
S L W+ C PC DC Q G +F P+ SSSY V C + C P A + C
Sbjct: 170 SDLNWLQCAPCLDCFEQRGPVFDPAASSSYRNVTCGDQRCGLVAPPEAPRACRRPAEDSC 229
Query: 137 IYTQLYLIGPETSVFVSTEQLTFKNTDE-STIHVQDVVFGCGFSTNRNF-KFSGIFGLGI 194
Y Y T+ ++ E T T ++ V VVFGCG F +G+ GLG
Sbjct: 230 PYYYWYGDQSNTTGDLALESFTVNLTAPGASRRVDGVVFGCGHRNRGLFHGAAGLLGLGR 289
Query: 195 GRSSLVSQLNS----SFSYCIGSLHDPD------YLHNKLILGDGAIIDEGDATPLQFID 244
G S SQL + +FSYC+ H D + + L+L + A D
Sbjct: 290 GPLSFASQLRAVYGHTFSYCLVE-HGSDAGSKVVFGEDYLVLAHPQLKYTAFAPTSSPAD 348
Query: 245 GHYYITLEAISVDGRMLDINPNIFK-RDDSGGGVMIDSGTDVTWLVKEAYEALRD---EV 300
YY+ L+ + V G +L+I+ + + D GG +IDSGT +++ V+ AY+ +R ++
Sbjct: 349 TFYYVKLKGVLVGGDLLNISSDTWDVGKDGSGGTIIDSGTTLSYFVEPAYQVIRQAFVDL 408
Query: 301 MIRLEGEQMRSYSW-PD----KLCYHGIMSSDLKGFPTVRFHFRGGAKLALEKDSMFYQP 355
M RL Y PD CY+ + + P + F GA ++ F +
Sbjct: 409 MSRL-------YPLIPDFPVLNPCYN-VSGVERPEVPELSLLFADGAVWDFPAENYFVRL 460
Query: 356 RPDA-FCMAVNPASINNRYVNFSLIGMMAQQFYNVGYDIGRKQKTFQRMDC 405
PD C+AV S+IG QQ ++V YD+ + F C
Sbjct: 461 DPDGIMCLAVR----GTPRTGMSIIGNFQQQNFHVVYDLQNNRLGFAPRRC 507
>gi|224128838|ref|XP_002320434.1| predicted protein [Populus trichocarpa]
gi|222861207|gb|EEE98749.1| predicted protein [Populus trichocarpa]
Length = 485
Score = 107 bits (268), Expect = 9e-21, Method: Compositional matrix adjust.
Identities = 94/372 (25%), Positives = 167/372 (44%), Gaps = 38/372 (10%)
Query: 60 LFYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDC--SPQLG---TIFYPSRSSSYAYV 114
L+Y I IG P + +DTGS ++WV+C CR+C + LG T++ + S + V
Sbjct: 77 LYYAKIGIGTPTKDYYVQVDTGSDIMWVNCIQCRECPKTSSLGIDLTLYNINESDTGKLV 136
Query: 115 PCDSEHCRYFPYAR---CSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTD---ESTIH 168
PCD E C + C+A C Y ++Y G T+ + + + + ++T
Sbjct: 137 PCDQEFCYEINGGQLPGCTA-NMSCPYLEIYGDGSSTAGYFVKDVVQYARVSGDLKTTAA 195
Query: 169 VQDVVFGCG------FSTNRNFKFSGIFGLGIGRSSLVSQLNSS------FSYCIGSLHD 216
V+FGCG ++ GI G G SS++SQL + F++C+
Sbjct: 196 NGSVIFGCGARQSGDLGSSNEEALDGILGFGKSNSSMISQLAVTGKVKKIFAHCL----- 250
Query: 217 PDYLHNKLILGDGAIID-EGDATPLQFIDGHYYITLEAISVDGRMLDINPNIFKRDDSGG 275
D + I G ++ + + TPL HY + + A+ V L + ++F+ D G
Sbjct: 251 -DGTNGGGIFVIGHVVQPKVNMTPLIPNQPHYNVNMTAVQVGHEFLSLPTDVFEAGDRKG 309
Query: 276 GVMIDSGTDVTWLVKEAYEALRDEVMIRLEGEQMRSYSWPDK-LCYHGIMSSDLKGFPTV 334
+ IDSGT + +L + Y+ L +++ + ++ ++ D+ C+ S D GFP V
Sbjct: 310 AI-IDSGTTLAYLPEMVYKPLVSKIIS--QQPDLKVHTVRDEYTCFQYSDSLD-DGFPNV 365
Query: 335 RFHFRGGAKLALEKDSMFYQPRPDAFCMAVNPASINNR-YVNFSLIGMMAQQFYNVGYDI 393
FHF L + + P +C+ + + +R N +L+G + V YD+
Sbjct: 366 TFHFENSVILKVYPHEYLF-PFEGLWCIGWQNSGVQSRDRRNMTLLGDLVLSNKLVLYDL 424
Query: 394 GRKQKTFQRMDC 405
+ + +C
Sbjct: 425 ENQAIGWTEYNC 436
>gi|222637181|gb|EEE67313.1| hypothetical protein OsJ_24553 [Oryza sativa Japonica Group]
Length = 414
Score = 107 bits (268), Expect = 9e-21, Method: Compositional matrix adjust.
Identities = 95/339 (28%), Positives = 153/339 (45%), Gaps = 39/339 (11%)
Query: 94 DCSPQLGTIFYPSRSSSYAYVPCDSEHCRYF--PYARCSAYKHRCIYTQLYLIGPETSVF 151
+C+ + F P+ SS+++ +PC S C++ PY C+A C+Y Y +G T+ +
Sbjct: 87 ECAARPAPPFQPASSSTFSKLPCASSLCQFLTSPYLTCNATG--CVYYYPYGMG-FTAGY 143
Query: 152 VSTEQLTFKNTDESTIHVQDVVFGCGFSTNRNFKFSGIFGLGIGRSSLVSQLN-SSFSYC 210
++TE L V FGC SGI GLG SLVSQ+ FSYC
Sbjct: 144 LATETLHVGGAS-----FPGVAFGCSTENGVGNSSSGIVGLGRSPLSLVSQVGVGRFSYC 198
Query: 211 IGSLHDPDYLHNKLILGDGAIIDEGDATPLQFID------GHYYITLEAISVDGRMLDIN 264
+ S D D + ++ G A + G ++P + +YY+ L I+V L +
Sbjct: 199 LRS--DADAGDSPILFGSLAKVTGGKSSPAILENPEMPSSSYYYVNLTGITVGATDLPVT 256
Query: 265 PNIF---KRDDSG--GGVMIDSGTDVTWLVKEAYEALRDEVMIRLEGEQMRSYSWPDK-- 317
F + +G GG ++DSGT +T+LVKE Y ++ + ++ + + +
Sbjct: 257 STTFGFTRGAGAGLVGGTIVDSGTTLTYLVKEGYAMVKRAFLSQMATANLTTTVNGTRFG 316
Query: 318 --LCYHGIMSSDLKG--FPTVRFHFRGGAKLALEKDSMF------YQPRPDAFCMAVNPA 367
LC+ + G PT+ F GGA+ A+ + S Q R C+ V PA
Sbjct: 317 FDLCFDANAAGGGSGVPVPTLVLRFAGGAEYAVRRRSYVGVVEVDSQGRAAVECLLVLPA 376
Query: 368 SINNRYVNFSLIGMMAQQFYNVGYDIGRKQKTFQRMDCE 406
S ++ S+IG + Q +V YD+ +F DC
Sbjct: 377 S---EKLSISIIGNVMQMDLHVLYDLDGGMFSFAPADCA 412
>gi|326495920|dbj|BAJ90582.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 476
Score = 107 bits (268), Expect = 9e-21, Method: Compositional matrix adjust.
Identities = 107/395 (27%), Positives = 165/395 (41%), Gaps = 43/395 (10%)
Query: 15 NENPAHRVQRGINISIARLAYLQEKIRSHNTYQAQILPSNDISNSLFYVNISIGQPPVPQ 74
+E +QR ++ AR A + + + A + + + + V +S+G P V Q
Sbjct: 97 DEQRVEYIQRRVSGGGARGAKGALQQLATGSRSATVPTTMGVGTFQYVVTVSLGTPGVSQ 156
Query: 75 FTAMDTGSSLLWVHCYPCR--DCSPQLGTIFYPSRSSSYAYVPCDSEHCRYFPYARCSAY 132
+DTGS + WV C PC C+ Q +F P++SS+Y+ VPC ++ C
Sbjct: 157 TVEVDTGSDVSWVQCKPCSAPACNSQRDQLFDPAKSSTYSAVPCGADACSELRIYEAGCS 216
Query: 133 KHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDVVFGCGFSTNRNFK-FSGIFG 191
+C Y Y G T+ ++ L + V +FGCG + F G+
Sbjct: 217 GSQCGYVVSYGDGSNTTGVYGSDTLALAPGNT----VGTFLFGCGHAQAGMFAGIDGLLA 272
Query: 192 LGIGRSSLVSQLNSS----FSYCIGSLHDPDYLHNKLILGDGAIIDEGDATPLQFID--- 244
LG SL SQ + FSYC+ S L LG G G AT
Sbjct: 273 LGRQSMSLKSQAAGAYGGVFSYCLPSKQS---AAGYLTLG-GPSSASGFATTGLLTAWAA 328
Query: 245 -GHYYITLEAISVDGRMLDINPNIFKRDDSGGGVMIDSGTDVTWLVKEAYEALRDEV--M 301
Y + L ISV G+ + + + F GG ++D+GT +T L AY ALR
Sbjct: 329 PTFYMVMLTGISVGGQQVAVPASAFA-----GGTVVDTGTVITRLPPTAYAALRSAFRGA 383
Query: 302 IRLEGEQMRSYSWPDKLCY----HGIMSSDLKGFPTVRFHFRGGAKLALEKDSMFYQPRP 357
I G + CY +G+++ PTV F GGA LALE +
Sbjct: 384 IAPCGYPSAPANGILDTCYDFSRYGVVT-----LPTVALTFSGGATLALEAPGIL----- 433
Query: 358 DAFCMAVNPASINNRYVNFSLIGMMAQQFYNVGYD 392
+ C+A P N + +++G + Q+ + V +D
Sbjct: 434 SSGCLAFAP---NGGDGDAAILGNVQQRSFAVRFD 465
>gi|449485448|ref|XP_004157171.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Cucumis sativus]
Length = 430
Score = 107 bits (268), Expect = 9e-21, Method: Compositional matrix adjust.
Identities = 101/382 (26%), Positives = 171/382 (44%), Gaps = 44/382 (11%)
Query: 57 SNSLFYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLGTIFYPSRSSSYAYVP- 115
S++ V++ IG PP P +DTGS L W+ C+ + +L + P +S +
Sbjct: 62 SSTALVVSLPIGTPPQPTDLVLDTGSQLSWIQCHD-KKIKKRLPPLPKPKTTSFDPSLSS 120
Query: 116 ------CDSEHCRY----FPYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDES 165
C+ C+ F C Y+ Y G + E+ TF +
Sbjct: 121 SFSLLPCNHPICKPRIPDFTLPTSCDQNRLCHYSYFYADGTLAEGNLVREKFTFSKS--- 177
Query: 166 TIHVQDVVFGCGFSTNRNFKFSGIFGLGIGRSSLVSQLN-SSFSYCIGSL--HDPDYLHN 222
+ V+ GC ++ N GI G+ GR S +SQ S FSYC+ S +P L
Sbjct: 178 -LSTPPVILGCAQASTEN---RGILGMNRGRLSFISQAKISKFSYCVPSRTGSNPTGL-- 231
Query: 223 KLILGDGAIIDEGD-ATPLQFIDGH---------YYITLEAISVDGRMLDINPNIFKRDD 272
LGD + T L F + Y + ++AI + G+ L++ P FK D
Sbjct: 232 -FYLGDNPNSSKFKYVTMLTFPESQSSPNLDPLAYTLPMKAIKIAGKRLNVPPAAFKPDA 290
Query: 273 SGGG-VMIDSGTDVTWLVKEAYEALRDEVMIRLEGEQMRS-YSWPD--KLCYHGIMSSDL 328
G G MIDSG+D+T+LV EAYE +++EV +RL G M+ Y + D +C+ +++++
Sbjct: 291 GGSGQTMIDSGSDLTYLVDEAYEKVKEEV-VRLVGAMMKKGYVYADVADMCFDAGVTAEV 349
Query: 329 -KGFPTVRFHFRGGAKLALEK-DSMFYQPRPDAFCMAVNPASINNRYVNFSLIGMMAQQF 386
+ + F F G ++ + + + + + C+ + + + ++IG + QQ
Sbjct: 350 GRRIGGISFEFDNGVEIFVGRGEGVLTEVEKGVKCVGIGRS--ERLGIGSNIIGTVHQQN 407
Query: 387 YNVGYDIGRKQKTFQRMDCEVL 408
V YD+ K+ F +C L
Sbjct: 408 MWVEYDLANKRVGFGGAECSRL 429
>gi|326521034|dbj|BAJ92880.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 448
Score = 107 bits (267), Expect = 1e-20, Method: Compositional matrix adjust.
Identities = 107/390 (27%), Positives = 162/390 (41%), Gaps = 33/390 (8%)
Query: 31 ARLAYLQEKIRSHNTYQAQILPSNDISNSLFYVNISIGQPPVPQFTAMDTGSSLLWVHCY 90
+RL YL + Y + + V +G PP A+DT + W+ C
Sbjct: 78 SRLLYLDSLAVAGRAYAPIASGRQLLQTPTYVVRARLGTPPQQLLLAVDTSNDAAWIPCS 137
Query: 91 PCRDCSPQLGTIFYPSRSSSYAYVPCDSEHCRYFPYARCSAYKHRCIYTQLYLIGPETSV 150
C C T F P+ S SY VPC S C P CS C ++ Y ++S+
Sbjct: 138 GCAGC--PTTTPFNPAASKSYRAVPCGSPACSRAPNPSCSLNTKSCGFSLTYA---DSSL 192
Query: 151 FVSTEQLTFKNTDESTIHVQDVVFGC-GFSTNRNFKFSGIFGLGIGRSSLVSQLN----S 205
+ Q + ++ V+ FGC +T G+ GLG G S +SQ
Sbjct: 193 EAALSQDSLAVANDV---VKSYTFGCLQKATGTATPPQGLLGLGRGPLSFLSQTKDMYEG 249
Query: 206 SFSYCIGSLHDPDYLHNKLILGDGAIIDEGDATPLQFIDGH----YYITLEAISVDGRML 261
+FSYC+ S ++ L LG TPL ++ H YY+++ I V +++
Sbjct: 250 TFSYCLPSFKSLNF-SGTLRLGRKGQPLRIKTTPL-LVNPHRSSLYYVSMTGIRVGKKVV 307
Query: 262 DINPNIFKRDD-SGGGVMIDSGTDVTWLVKEAYEALRDEVMIRLEGEQMRSYSWPDKLCY 320
I P D +G G ++DSGT T LV AY A+RDEV R+ G + S D CY
Sbjct: 308 PIPPAALAFDPATGAGTVLDSGTMFTRLVAPAYVAVRDEVRRRIRGAPLSSLGGFDT-CY 366
Query: 321 HGIMSSDLKGFPTVRFHFRGGAKLALEKDSMFYQP---RPDAFCMAVNPASINNRYVNFS 377
+ + +P V F F G ++ L D++ MA P +N +
Sbjct: 367 NTTVK-----WPPVTFMFT-GMQVTLPADNLVIHSTYGTTSCLAMAAAPDGVNTV---LN 417
Query: 378 LIGMMAQQFYNVGYDIGRKQKTFQRMDCEV 407
+I M QQ + + +D+ + F R C
Sbjct: 418 VIASMQQQNHRILFDVPNGRVGFAREQCTA 447
>gi|356540369|ref|XP_003538662.1| PREDICTED: LOW QUALITY PROTEIN: probable aspartic protease
At2g35615-like [Glycine max]
Length = 364
Score = 107 bits (267), Expect = 1e-20, Method: Compositional matrix adjust.
Identities = 107/362 (29%), Positives = 168/362 (46%), Gaps = 42/362 (11%)
Query: 57 SNSLFYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLGTIFYPSRSSSYAYVPC 116
+N + + +++G PPV + +DT S L+W C PC+ C Q +F P +
Sbjct: 27 NNGDYLMKLTLGTPPVDVYGLVDTDSDLVWAQCTPCQGCYKQKNPMFDPLK--------- 77
Query: 117 DSEHCRYFPYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDVVFGC 176
C F CS K C Y Y T ++ E TF +TD I V+ ++FGC
Sbjct: 78 ---ECNSFFDHSCSPEK-ACDYVYAYADDSATKGMLAKEIATFSSTDGKPI-VESIIFGC 132
Query: 177 GFSTNRNFKFSGI--FGLGIGRSSLVSQL-----NSSFSYCIGSLHDPDYLHNKLILGDG 229
G + F + + GLG G SLVSQ+ + FS C+ H + + LG+
Sbjct: 133 GHNNTGVFNENDMGLIGLGGGPLSLVSQMGNLYGSKRFSQCLVPFHADPHTSGTISLGEA 192
Query: 230 A-IIDEG-DATPLQFIDGH--YYITLEAISVDGRMLDINPNIFKRDDSGGGVMIDSGTDV 285
+ + EG TPL +G Y +TLE ISV + N + S G +MIDSGT
Sbjct: 193 SDVSGEGVVTTPLVSEEGQTPYLVTLEGISVGDTFVPFNSSEML---SKGNIMIDSGTPE 249
Query: 286 TWLVKEAYEALRDEVMIRLEGEQMRSYSWPD---KLCYHGIMSSDLKGFPTVRFHFRGGA 342
T+L +E Y+ L +E+ +++ + + PD +LCY ++L+G P + HF GA
Sbjct: 250 TYLPQEFYDRLVEELKVQINLPPI--HVDPDLGTQLCYKS--ETNLEG-PILTAHFE-GA 303
Query: 343 KLALEKDSMFYQPRPDAFCMAVNPASINNRYVNFSLIGMMAQQFYNVGYDIGRKQKTFQR 402
+ L F P+ FC A+ + + Y+ G AQ +G+D+ ++ F+
Sbjct: 304 DVKLLPLQTFIPPKDGVFCFAMT-GTTDGLYI----FGNFAQSNVLIGFDLDKRIVFFKP 358
Query: 403 MD 404
D
Sbjct: 359 TD 360
>gi|359492937|ref|XP_002283889.2| PREDICTED: aspartic proteinase nepenthesin-1-like [Vitis vinifera]
Length = 439
Score = 107 bits (267), Expect = 1e-20, Method: Compositional matrix adjust.
Identities = 113/426 (26%), Positives = 171/426 (40%), Gaps = 43/426 (10%)
Query: 1 LIHRDSILSPYNNPNENPAHRVQRGINISI-------ARLAYLQEKIRSHNTYQAQILPS 53
+IH SP+N H+ +N I AR+ YL + S I
Sbjct: 37 VIHVYGQCSPFNQ------HKAGSWVNTVINMASKDPARVTYLSSLVASPKATSVPIASG 90
Query: 54 NDISN-SLFYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLGTIFYPSRSSSYA 112
+ N + V + +G P F +DT WV C C CS F P+ SS+YA
Sbjct: 91 QQVLNIGNYVVRVKLGTPGQLMFMVLDTSRDAAWVPCADCAGCS---SPTFSPNTSSTYA 147
Query: 113 YVPCDSEHCRYFPYARC-SAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQD 171
+ C C C + C + Q Y S +S + L + +
Sbjct: 148 SLQCSVPQCTQVRGLSCPTTGTAACFFNQTYGGDSSFSAMLSQDSLGL-----AVDTLPS 202
Query: 172 VVFGC-GFSTNRNFKFSGIFGLGIGRSSLVSQLNS----SFSYCIGSLHDPDYLHNKLIL 226
FGC + G+ GLG G SL+SQ S FSYC S Y L L
Sbjct: 203 YSFGCVNAVSGSTLPPQGLLGLGRGPMSLLSQSGSLYSGVFSYCFPSFKS-YYFSGSLRL 261
Query: 227 GDGAIIDEGDATPLQFIDGH----YYITLEAISVDGRMLDINPNIFKRD-DSGGGVMIDS 281
G TPL + H YY+ L +SV ++ + P + D ++G G +IDS
Sbjct: 262 GPLGQPKNIRTTPL-LRNPHRPTLYYVNLTGVSVGRVLVPVAPELLAFDPNTGAGTIIDS 320
Query: 282 GTDVTWLVKEAYEALRDEVMIRLEGEQMRSYSWPDKLCYHGIMSSDLKGFPTVRFHFRG- 340
GT +T V+ Y A+RDE +++G ++ C+ + D+ P V FHF G
Sbjct: 321 GTVITRFVEPVYAAIRDEFRKQVKGPFATIGAF--DTCFAAT-NEDIA--PPVTFHFTGM 375
Query: 341 GAKLALEKDSMFYQPRPDAFCMAVNPASINNRYVNFSLIGMMAQQFYNVGYDIGRKQKTF 400
KL LE +++ + C+A+ A+ NN ++I + QQ + +D+ +
Sbjct: 376 DLKLPLE-NTLIHSSAGSLACLAMA-AAPNNVNSVLNVIANLQQQNLRIMFDVTNSRLGI 433
Query: 401 QRMDCE 406
R C
Sbjct: 434 ARELCN 439
>gi|224058947|ref|XP_002299658.1| predicted protein [Populus trichocarpa]
gi|222846916|gb|EEE84463.1| predicted protein [Populus trichocarpa]
Length = 451
Score = 107 bits (267), Expect = 1e-20, Method: Compositional matrix adjust.
Identities = 102/373 (27%), Positives = 165/373 (44%), Gaps = 37/373 (9%)
Query: 60 LFYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLG-----TIFYPSRSSSYAYV 114
L+Y + +G PP + +DTGS +LWV C C C G F P S + + +
Sbjct: 89 LYYTRLQLGSPPRDFYVQIDTGSDVLWVSCSSCNGCPVSSGLHIPLNFFDPGSSPTASLI 148
Query: 115 PCDSEHCRYFPYAR---CSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQD 171
C + C + C+A ++C YT Y G TS + ++ L F ++
Sbjct: 149 SCSDQRCSLGLQSSDSVCAAQNNQCGYTFQYGDGSGTSGYYVSDLLHFDTILGGSVMKNS 208
Query: 172 ---VVFGC-----GFSTNRNFKFSGIFGLGIGRSSLVSQLNSS------FSYCIGSLHDP 217
+VFGC G T + GIFG G S++SQL S FS+C L
Sbjct: 209 SAPIVFGCSTLQTGDLTKPDRAVDGIFGFGQQDMSVISQLASQGITPRVFSHC---LKGD 265
Query: 218 DYLHNKLILGDGAIIDEGDA-TPLQFIDGHYYITLEAISVDGRMLDINPNIFKRDDSGGG 276
D L+LG+ I++ TPL HY + L++I V+G+ L I+P++F S G
Sbjct: 266 DSGGGILVLGE--IVEPNIVYTPLVPSQPHYNLNLQSIYVNGQTLAIDPSVFAT-SSNQG 322
Query: 277 VMIDSGTDVTWLVKEAYEALRDEVMIRLEGEQMRSYSWPDKLCYHGIMSSDLKG-FPTVR 335
+IDSGT + +L + AY+ + + + Y CY + SS + FP V
Sbjct: 323 TIIDSGTTLAYLTEAAYDPFISAITSTVS-PSVSPYLSKGNQCY--LTSSSINDVFPQVS 379
Query: 336 FHFRGGAKLAL-EKDSMFYQPRPDAFCM-AVNPASINNRYVNFSLIGMMAQQFYNVGYDI 393
+F GG + L +D + Q + + V I + + +++G + + YDI
Sbjct: 380 LNFAGGTSMILIPQDYLIQQSSINGAALWCVGFQKIQGQEI--TILGDLVLKDKIFVYDI 437
Query: 394 GRKQKTFQRMDCE 406
++ + DC+
Sbjct: 438 AGQRIGWANYDCK 450
>gi|15232960|ref|NP_186923.1| aspartyl protease-like protein [Arabidopsis thaliana]
gi|6728988|gb|AAF26986.1|AC018363_31 putative aspartyl protease [Arabidopsis thaliana]
gi|21593593|gb|AAM65560.1| putative aspartyl protease [Arabidopsis thaliana]
gi|332640332|gb|AEE73853.1| aspartyl protease-like protein [Arabidopsis thaliana]
Length = 488
Score = 107 bits (267), Expect = 1e-20, Method: Compositional matrix adjust.
Identities = 100/397 (25%), Positives = 165/397 (41%), Gaps = 42/397 (10%)
Query: 40 IRSHNTYQ-AQILPSNDI---------SNSLFYVNISIGQPPVPQFTAMDTGSSLLWVHC 89
+R+H+ ++ +++L + DI S L++ I +G P +DTGS +LWV+C
Sbjct: 54 LRAHDVHRHSRLLSAIDIPLGGDSQPESIGLYFAKIGLGTPSRDFHVQVDTGSDILWVNC 113
Query: 90 YPCRDCSPQLGTI----FYPSRSSSYAYVPCDSEHCRYFPYARCSAYKHRCIYTQLYLIG 145
C C + + + SS+ V C C Y C Y +Y G
Sbjct: 114 AGCIRCPRKSDLVELTPYDVDASSTAKSVSCSDNFCSYVNQRSECHSGSTCQYVIMYGDG 173
Query: 146 PETSVFVSTE----QLTFKNTDESTIHVQDVVFGCGFSTNRNFKFS-----GIFGLGIGR 196
T+ ++ + L N + + ++FGCG + S GI G G
Sbjct: 174 SSTNGYLVKDVVHLDLVTGNRQTGSTN-GTIIFGCGSKQSGQLGESQAAVDGIMGFGQSN 232
Query: 197 SSLVSQLNS------SFSYCIGSLHDPDYLHNKLILGDGAIID-EGDATPLQFIDGHYYI 249
SS +SQL S SF++C+ D + I G ++ + TP+ HY +
Sbjct: 233 SSFISQLASQGKVKRSFAHCL------DNNNGGGIFAIGEVVSPKVKTTPMLSKSAHYSV 286
Query: 250 TLEAISVDGRMLDINPNIFKRDDSGGGVMIDSGTDVTWLVKEAYEALRDEVMIRLEGEQM 309
L AI V +L+++ N F D GV+IDSGT + +L Y L +E++ +
Sbjct: 287 NLNAIEVGNSVLELSSNAFDSGDD-KGVIIDSGTTLVYLPDAVYNPLLNEILASHPELTL 345
Query: 310 RSYSWPDKLCYHGIMSSDLKGFPTVRFHFRGGAKLALEKDSMFYQPRPDAFCMAVNPASI 369
+ C+H + L FPTV F F LA+ +Q R D +C +
Sbjct: 346 HTVQ-ESFTCFH--YTDKLDRFPTVTFQFDKSVSLAVYPREYLFQVREDTWCFGWQNGGL 402
Query: 370 NNR-YVNFSLIGMMAQQFYNVGYDIGRKQKTFQRMDC 405
+ + +++G MA V YDI + + +C
Sbjct: 403 QTKGGASLTILGDMALSNKLVVYDIENQVIGWTNHNC 439
>gi|224068901|ref|XP_002326227.1| predicted protein [Populus trichocarpa]
gi|222833420|gb|EEE71897.1| predicted protein [Populus trichocarpa]
Length = 477
Score = 107 bits (267), Expect = 1e-20, Method: Compositional matrix adjust.
Identities = 95/313 (30%), Positives = 145/313 (46%), Gaps = 31/313 (9%)
Query: 60 LFYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLG-----TIFYPSRSSSYAYV 114
L++ + +G PP +DTGS +LWV C C +C G F S SS+ V
Sbjct: 65 LYFTKVKLGSPPREFNVQIDTGSDVLWVCCNSCNNCPRTSGLGIQLNFFDSSSSSTAGQV 124
Query: 115 PCDSEHCR---YFPYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNT-DESTIHVQ 170
C C +CS+ +C YT Y G TS + ++ L F +S I
Sbjct: 125 RCSDPICTSAVQTTATQCSSQTDQCSYTFQYGDGSGTSGYYVSDTLYFDAILGQSLIDNS 184
Query: 171 D--VVFGC-----GFSTNRNFKFSGIFGLGIGRSSLVSQLNSS------FSYCIGSLHDP 217
+VFGC G T + GIFG G G S++SQL++ FS+C L
Sbjct: 185 SALIVFGCSAYQSGDLTKTDKAVDGIFGFGQGELSVISQLSTRGITPRVFSHC---LKGD 241
Query: 218 DYLHNKLILGDGAIIDEGDA-TPLQFIDGHYYITLEAISVDGRMLDINPNIFKRDDSGGG 276
L+LG+ I++ G +PL HY + L +I+V+G++L I+P F +S G
Sbjct: 242 GSGGGILVLGE--ILEPGIVYSPLVPSQPHYNLNLLSIAVNGQLLPIDPAAFATSNS-QG 298
Query: 277 VMIDSGTDVTWLVKEAYEALRDEVMIRLEGEQMRSYSWPDKLCYHGIMSSDLKGFPTVRF 336
++DSGT + +LV EAY+ V + + + CY + +S + FP F
Sbjct: 299 TIVDSGTTLAYLVAEAYDPFVSAVN-AIVSPSVTPITSKGNQCYL-VSTSVSQMFPLASF 356
Query: 337 HFRGGAKLALEKD 349
+F GGA + L+ +
Sbjct: 357 NFAGGASMVLKPE 369
>gi|242091327|ref|XP_002441496.1| hypothetical protein SORBIDRAFT_09g028060 [Sorghum bicolor]
gi|241946781|gb|EES19926.1| hypothetical protein SORBIDRAFT_09g028060 [Sorghum bicolor]
Length = 466
Score = 107 bits (267), Expect = 1e-20, Method: Compositional matrix adjust.
Identities = 102/376 (27%), Positives = 157/376 (41%), Gaps = 39/376 (10%)
Query: 61 FYVNISIGQPPVPQFTAMDTGSSLLWVHC------YPCRDCSPQLGTIFYPSRSSSYAYV 114
++V +G P P DTGS L WV C SP +F + S S+A +
Sbjct: 101 YFVRFRVGTPAQPFVLVADTGSDLTWVKCRGAGAAAGTGAGSP--ARVFRTAASKSWAPI 158
Query: 115 PCDSEHC-RYFPY--ARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFK----------- 160
C S+ C Y P+ A CS+ C Y Y G V T+ T
Sbjct: 159 ACSSDTCTSYVPFSLANCSSPASPCAYDYRYRDGSAARGVVGTDSATIALSSGSGRGGGD 218
Query: 161 NTDESTIHVQDVVFGCGFSTN-RNFKFS-GIFGLGIGRSSLVS----QLNSSFSYCIGSL 214
++ +Q VV GC + + ++F+ S G+ LG S S + FSYC+
Sbjct: 219 SSGGRRAKLQGVVLGCAATYDGQSFQSSDGVLSLGNSNISFASRAAARFGGRFSYCLVDH 278
Query: 215 HDPDYLHNKLILGDGAIIDEGDATPL---QFIDGHYYITLEAISVDGRMLDINPNIFKRD 271
P + L G GA TPL + + Y +T++A+ V G LDI +++ D
Sbjct: 279 LAPRNATSYLTFGPGATAPAAQ-TPLLLDRRMTPFYAVTVDAVYVAGEALDIPADVWDVD 337
Query: 272 DSGGGVMIDSGTDVTWLVKEAYEALRDEVMIRLEGEQMRSYSWPDKLCYHGIMSSDLKGF 331
+GG ++ DSGT +T L AY A+ + L G + P + CY+ + L+
Sbjct: 338 RNGGAIL-DSGTSLTILATPAYRAVVTALSKHLAGLPRVTMD-PFEYCYNWTDAGALE-I 394
Query: 332 PTVRFHFRGGAKLALEKDSMFYQPRPDAFCMAVNPASINNRYVNFSLIGMMAQQFYNVGY 391
P + HF G A+L S P C+ V S + S+IG + QQ + +
Sbjct: 395 PKMEVHFAGSARLEPPAKSYVIDAAPGVKCIGVQEGS----WPGVSVIGNILQQEHLWEF 450
Query: 392 DIGRKQKTFQRMDCEV 407
D+ + F+ C +
Sbjct: 451 DLRDRWLRFKHTRCAL 466
>gi|242046218|ref|XP_002460980.1| hypothetical protein SORBIDRAFT_02g038700 [Sorghum bicolor]
gi|241924357|gb|EER97501.1| hypothetical protein SORBIDRAFT_02g038700 [Sorghum bicolor]
Length = 517
Score = 107 bits (266), Expect = 1e-20, Method: Compositional matrix adjust.
Identities = 112/380 (29%), Positives = 174/380 (45%), Gaps = 41/380 (10%)
Query: 56 ISNSLFYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLGTIFYPSRSSSYAYVP 115
+ + + +++ +G PP MDTGS L W+ C PC DC Q+G +F P+ SSSY V
Sbjct: 146 VGSGEYLMDVYVGTPPRRFRMIMDTGSDLNWLQCAPCLDCFDQVGPVFDPAASSSYRNVT 205
Query: 116 CDSEHCRYF-----PYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDE-STIHV 169
C + C P A + C Y Y T+ ++ E T T ++ V
Sbjct: 206 CGDQRCGLVAPPEPPRACRRPGEDSCPYYYWYGDQSNTTGDLALESFTVNLTAPGASRRV 265
Query: 170 QDVVFGCGFSTNRNF-KFSGIFGLGIGRSSLVSQLNS----SFSYCIGSLHDPDYLHNKL 224
DVVFGCG F +G+ GLG G S SQL + +FSYC+ H D + +K+
Sbjct: 266 DDVVFGCGHWNRGLFHGAAGLLGLGRGPLSFASQLRAVYGHTFSYCLVD-HGSD-VASKV 323
Query: 225 ILGDGAIIDEGDATP-LQFI---------DGHYYITLEAISVDGRMLDINPNIF---KRD 271
+ G+ + A P L + D YY+ L+ + V G +L+I+ + + + +
Sbjct: 324 VFGEDDALALAAAHPQLNYTAFAPASSPADTFYYVKLKGVLVGGELLNISSDTWGVGEGE 383
Query: 272 DSGGGVMIDSGTDVTWLVKEAYEALRDEVMIRLEGEQMRSYSW-PD----KLCYHGIMSS 326
GG +IDSGT +++ V+ AY+ +R + R+ RSY PD CY+ +
Sbjct: 384 GGSGGTIIDSGTTLSYFVEPAYQVIRQAFIDRMG----RSYPLIPDFPVLSPCYN-VSGV 438
Query: 327 DLKGFPTVRFHFRGGAKLALEKDSMFYQPRPDA-FCMAVNPASINNRYVNFSLIGMMAQQ 385
D P + F GA ++ F + PD C+AV + S+IG QQ
Sbjct: 439 DRPEVPELSLLFADGAVWDFPAENYFIRLDPDGIMCLAV----LGTPRTGMSIIGNFQQQ 494
Query: 386 FYNVGYDIGRKQKTFQRMDC 405
++V YD+ + F C
Sbjct: 495 NFHVVYDLKNNRLGFAPRRC 514
>gi|255548662|ref|XP_002515387.1| Aspartic proteinase nepenthesin-2 precursor, putative [Ricinus
communis]
gi|223545331|gb|EEF46836.1| Aspartic proteinase nepenthesin-2 precursor, putative [Ricinus
communis]
Length = 463
Score = 107 bits (266), Expect = 1e-20, Method: Compositional matrix adjust.
Identities = 115/427 (26%), Positives = 174/427 (40%), Gaps = 53/427 (12%)
Query: 1 LIHRDSILSPYNNPNENPAHRVQRGINISIARLAYLQEKIRSHNTYQ---AQILPSND-- 55
++H+ S N N N + V+ + +R+ + K+ H+ + A LP+
Sbjct: 69 VVHKHGPCSQLNQQNGNAPNLVEILLE-DQSRVDSIHAKLSDHSGVKETDAAKLPTKSGM 127
Query: 56 -ISNSLFYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLGTIFYPSRSSSYAYV 114
+ + V+I +G P DTGS L W C F P++S+SYA V
Sbjct: 128 SLGTGNYIVSIGLGSPKKDLMLIFDTGSDLTWARCSAAE--------TFDPTKSTSYANV 179
Query: 115 PCDSEHCRYFPYA-----RCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHV 169
C + C A RC+A C+Y Y G + F+ E+LT +TD
Sbjct: 180 SCSTPLCSSVISATGNPSRCAA--STCVYGIQYGDGSYSIGFLGKERLTIGSTD----IF 233
Query: 170 QDVVFGCGFSTNRNF-KFSGIFGLGIGRSSLVSQL----NSSFSYCIGSLHDPDYLHNKL 224
+ FGCG + F K +G+ GLG + S+VSQ N FSYC+ S +L
Sbjct: 234 NNFYFGCGQDVDGLFGKAAGLLGLGRDKLSVVSQTAPKYNQLFSYCLPSSSSTGFLSFGS 293
Query: 225 ILGDGAIIDEGDATPLQFIDGHYYITLEAISVDGRMLDINPNIFKRDDSGGGVMIDSGTD 284
A + P F Y + L I+V G+ L I ++F S G +IDSGT
Sbjct: 294 SQSKSAKFTPLSSGPSSF----YNLDLTGITVGGQKLAIPLSVF----STAGTIIDSGTV 345
Query: 285 VTWLVKEAYEALRDEVMIRLEGEQMRSYSWPDKL-----CYHGIMSSDLKGFPTVRFHFR 339
VT L AY ALR + M SY L CY +K P + F
Sbjct: 346 VTRLPPAAYSALRSAFR-----KAMASYPMGKPLSILDTCYDFSKYKTIK-VPKIVISFS 399
Query: 340 GGAKLALEKDSMFYQPRPDAFCMAVNPASINNRYVNFSLIGMMAQQFYNVGYDIGRKQKT 399
GG + +++ +F C+A + N + ++ G Q+ + V YD+ +
Sbjct: 400 GGVDVDVDQAGIFVANGLKQVCLAF---AGNTGARDTAIFGNTQQRNFEVVYDVSGGKVG 456
Query: 400 FQRMDCE 406
F C
Sbjct: 457 FAPASCS 463
>gi|224109494|ref|XP_002315215.1| predicted protein [Populus trichocarpa]
gi|222864255|gb|EEF01386.1| predicted protein [Populus trichocarpa]
Length = 444
Score = 107 bits (266), Expect = 1e-20, Method: Compositional matrix adjust.
Identities = 103/383 (26%), Positives = 165/383 (43%), Gaps = 48/383 (12%)
Query: 58 NSLFYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLGTIFYPSRSSSYAYVPCD 117
N +++IG PP +DTGS L W+ C P +IF P S +Y +PC
Sbjct: 64 NVTLTASLTIGTPPQNITMVLDTGSELSWLRCKK----EPNFTSIFNPLASKTYTKIPCS 119
Query: 118 SEHCRYFPYARCS--AYKHRCIYTQL-YLIGPETSVFVSTEQLTFKNTDESTIHVQDVVF 174
S+ C+ R S C +L + I L F+ ++ VF
Sbjct: 120 SQTCK----TRTSDLTLPVTCDPAKLCHFIISYADASSVEGHLAFETFRFGSLTRPATVF 175
Query: 175 GC---GFSTN--RNFKFSGIFGLGIGRSSLVSQLN-SSFSYCIGSLHDPDYLHNKLILGD 228
GC G S+N + K +G+ G+ G S V+Q+ FSYCI L +L +LG+
Sbjct: 176 GCMDSGSSSNTEEDAKTTGLMGMNRGSLSFVNQMGFRKFSYCISGLDSTGFL----LLGE 231
Query: 229 G--AIIDEGDATPLQFIDG--------HYYITLEAISVDGRMLDINPNIFKRDDSGGG-V 277
+ + + TPL I Y + LE I V+ ++L + ++F D +G G
Sbjct: 232 ARYSWLKPLNYTPLVQISTPLPYFDRVAYSVQLEGIKVNNKVLPLPKSVFVPDHTGAGQT 291
Query: 278 MIDSGTDVTWLVKEAYEALRDEVMIRLEG------EQMRSYSWPDKLCYH-GIMSSDLKG 330
M+DSGT T+L+ Y ALR E +++ G E + LCY SS L
Sbjct: 292 MVDSGTQFTFLLGPVYSALRKEFLLQTAGVLRVLNEPQYVFQGAMDLCYLIDSTSSTLPN 351
Query: 331 FPTVRFHFRGGAKLALEKDSMFY------QPRPDAFCMAVNPASINNRYVNFSLIGMMAQ 384
P V+ FR GA++++ + Y + + +C + + ++ LIG Q
Sbjct: 352 LPVVKLMFR-GAEMSVSGQRLLYRVPGEVRGKDSVWCFTFGNS--DELGISSFLIGHHQQ 408
Query: 385 QFYNVGYDIGRKQKTFQRMDCEV 407
Q + YD+ + F + C++
Sbjct: 409 QNVWMEYDLENSRIGFAELRCDL 431
>gi|159464048|ref|XP_001690254.1| pepsin-type aspartyl protease [Chlamydomonas reinhardtii]
gi|158284242|gb|EDP09992.1| pepsin-type aspartyl protease [Chlamydomonas reinhardtii]
Length = 485
Score = 107 bits (266), Expect = 1e-20, Method: Compositional matrix adjust.
Identities = 97/368 (26%), Positives = 163/368 (44%), Gaps = 38/368 (10%)
Query: 58 NSLFYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLGTIFYPSRSSSYAYVPCD 117
+S FY + +G P +DTGS++ ++ C C C F P +S++ + C
Sbjct: 10 HSYFYTTLKLGTPERTFSVIIDTGSTITYIPCKDCSHCGKHTAEWFDPDKSTTAKKLACG 69
Query: 118 SEHCRYFPYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDVVFGC- 176
C C+ RC Y++ Y + ++ + F ++D +VFGC
Sbjct: 70 DPLCN-CGTPSCTCNNDRCYYSRTYAERSSSEGWMIEDTFGFPDSDSPV----RLVFGCE 124
Query: 177 GFSTNRNFK--FSGIFGLGIGRSSLVSQL------NSSFSYCIGSLHDPDYLHNKLILGD 228
T ++ GI G+G ++ SQL FS C G D L+LGD
Sbjct: 125 NGETGEIYRQMADGIMGMGNNHNAFQSQLVQRKVIEDVFSLCFGYPKD-----GILLLGD 179
Query: 229 GAIIDEGDA--TP-LQFIDGHYY-ITLEAISVDGRMLDINPNIFKRDDSGGGVMIDSGTD 284
+ + + TP L + HYY + ++ I+V+G+ L + ++F R G G ++DSGT
Sbjct: 180 VTLPEGANTVYTPLLTHLHLHYYNVKMDGITVNGQTLAFDASVFDR---GYGTVLDSGTT 236
Query: 285 VTWLVKEAYEALRDEVMIRLEGEQMRSYSWPD----KLCYHGIMS--SDL-KGFPTVRFH 337
T+L +A++A+ V +E + ++S D +C+ G DL K FP F
Sbjct: 237 FTYLPTDAFKAMAKAVGDYVEKKGLQSTPGADPQYNDICWKGAPDQFKDLDKYFPPAEFV 296
Query: 338 FRGGAKLALEKDSMFYQPRPDAFCMAVNPASINNRYVNFSLIGMMAQQFYNVGYDIGRKQ 397
F GGAKL L + +P +C+ I + + +L+G ++ + V YD +
Sbjct: 297 FGGGAKLTLPPLRYLFLSKPAEYCLG-----IFDNGNSGALVGGVSVRDVVVTYDRRNSK 351
Query: 398 KTFQRMDC 405
F M C
Sbjct: 352 VGFTTMAC 359
>gi|357118076|ref|XP_003560785.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Brachypodium
distachyon]
Length = 477
Score = 107 bits (266), Expect = 1e-20, Method: Compositional matrix adjust.
Identities = 103/369 (27%), Positives = 151/369 (40%), Gaps = 52/369 (14%)
Query: 61 FYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRD-CSPQLGTIFYPSRSSSYAYVPCDSE 119
F V + G P +DTGS L W+ C PC C Q F P++SSSYA VPC +
Sbjct: 137 FVVVVGFGTPAQTAAIILDTGSDLSWIQCKPCSGHCYRQHDPDFDPAKSSSYAAVPCGTP 196
Query: 120 HCRYFPYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDVVFGCGFS 179
C A C+Y Y G T+ +S + LTF ++ + T FGCG
Sbjct: 197 VCAA---AGGMCNGTTCLYGVQYGDGSSTTGVLSRDTLTFNSSSKFT----GFTFGCGEK 249
Query: 180 TNRNF-----KFSGIFGLGIGRSSLVSQLNSSFSYCIGSLHDPDYLHNKLILGDGAIIDE 234
+F G S FSYC+ P Y L GA
Sbjct: 250 NIGDFGEVDGLLGLGRGKLSLPSQAAPSFGGVFSYCL-----PSYNTTPGYLNIGAT-KP 303
Query: 235 GDATPLQFI--------DGHYYITLEAISVDGRMLDINPNIFKRDDSGGGVMIDSGTDVT 286
P+Q+ Y+I L +I++ G +L + P++F + G ++DSGT +T
Sbjct: 304 TSTVPVQYTAMIKKPQYPSFYFIELVSINIGGYILPVPPSVFTKT----GTLLDSGTILT 359
Query: 287 WLVKEAYEALRDEVMIRLEGEQMRSYSWPDKLCYHGIMSSDLKG-----FPTVRFHFRGG 341
+L AY +LRD ++G + P CY D G P V F+F G
Sbjct: 360 YLPPPAYTSLRDRFKFTMQGNKPAPPYEPLDTCY------DFTGQGAIVIPAVSFNFSDG 413
Query: 342 AKLALEKDSMFYQP---RPDAFCMAV--NPASINNRYVNFSLIGMMAQQFYNVGYDIGRK 396
A L+ + P +P C+A PA++ FS++G Q+ V YD+ +
Sbjct: 414 AVFDLDFYGIMIFPDDAKPLIGCLAFVSRPAAM-----PFSIVGNTQQRAAEVIYDVPSQ 468
Query: 397 QKTFQRMDC 405
+ F + C
Sbjct: 469 KIGFIPISC 477
>gi|356565521|ref|XP_003550988.1| PREDICTED: aspartic proteinase-like protein 2-like [Glycine max]
Length = 634
Score = 107 bits (266), Expect = 1e-20, Method: Compositional matrix adjust.
Identities = 101/377 (26%), Positives = 162/377 (42%), Gaps = 54/377 (14%)
Query: 58 NSLFYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLGTIFYPSRSSSYAYVPCD 117
N + + IG PP +DTGS++ +V C C C F P SS+Y V C
Sbjct: 81 NGYYTTRLWIGTPPQMFALIVDTGSTVTYVPCSTCEQCGRHQDPKFQPESSSTYQPVKCT 140
Query: 118 SEHCRYFPYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDVVFGCG 177
+ C + + +C+Y + Y +S + + ++F N +S + Q VFGC
Sbjct: 141 ID-------CNCDSDRMQCVYERQYAEMSTSSGVLGEDLISFGN--QSELAPQRAVFGCE 191
Query: 178 FSTNRNF---KFSGIFGLGIGRSSLVSQL------NSSFSYCIGSLHDPDYLHNKLILGD 228
+ GI GLG G S++ QL + SFS C G + +G
Sbjct: 192 NVETGDLYSQHADGIMGLGRGDLSIMDQLVDKNVISDSFSLCYGGMD----------VGG 241
Query: 229 GAIIDEGDATPLQFI--------DGHYYITLEAISVDGRMLDINPNIFKRDDSGGGVMID 280
GA++ G + P +Y I L+ I V G+ L +N N+F D G ++D
Sbjct: 242 GAMVLGGISPPSDMAFAYSDPVRSPYYNIDLKEIHVAGKRLPLNANVF---DGKHGTVLD 298
Query: 281 SGTDVTWLVKEAYEALRDEVMIRLEGEQMRSYSWPD----KLCYHGI---MSSDLKGFPT 333
SGT +L + A+ A +D ++ L + ++ S PD +C+ G +S K FP
Sbjct: 299 SGTTYAYLPEAAFLAFKDAIVKEL--QSLKKISGPDPNYNDICFSGAGIDVSQLSKSFPV 356
Query: 334 VRFHFRGGAKLALEKDS-MFYQPR-PDAFCMAVNPASINNRYVNFSLIGMMAQQFYNVGY 391
V F G K L ++ MF + A+C+ V N +L+G + + V Y
Sbjct: 357 VDMVFENGQKYTLSPENYMFRHSKVRGAYCLGV----FQNGNDQTTLLGGIIVRNTLVVY 412
Query: 392 DIGRKQKTFQRMDCEVL 408
D + + F + +C L
Sbjct: 413 DREQTKIGFWKTNCAEL 429
>gi|449446233|ref|XP_004140876.1| PREDICTED: aspartic proteinase-like protein 2-like [Cucumis
sativus]
Length = 498
Score = 107 bits (266), Expect = 1e-20, Method: Compositional matrix adjust.
Identities = 94/371 (25%), Positives = 164/371 (44%), Gaps = 36/371 (9%)
Query: 60 LFYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDC--SPQLGTIFYP---SRSSSYAYV 114
L+Y I IG P + +DTGS ++WV+C CR+C + LG P S++ V
Sbjct: 86 LYYAKIGIGTPSKDYYVQVDTGSDIVWVNCIQCRECPRTSSLGMELTPYDLEESTTGKLV 145
Query: 115 PCDSEHCRYF---PYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTD---ESTIH 168
CD + C P + C+ C Y Q+Y G T+ + + + + E+T
Sbjct: 146 SCDEQFCLEVNGGPLSGCTT-NMSCPYLQIYGDGSSTAGYFVKDYVQYNRVSGDLETTAA 204
Query: 169 VQDVVFGCG------FSTNRNFKFSGIFGLGIGRSSLVSQLNSS------FSYCIGSLHD 216
+ FGCG ++ GI G G SS++SQL S+ F++C+
Sbjct: 205 NGSIKFGCGARQSGDLGSSGEEALDGILGFGKSNSSIISQLASTRKVKKMFAHCL----- 259
Query: 217 PDYLHNKLILGDGAIID-EGDATPLQFIDGHYYITLEAISVDGRMLDINPNIFKRDDSGG 275
D + I G ++ + + TPL HY + + + V +L+I+ ++F+ D
Sbjct: 260 -DGTNGGGIFAMGHVVQPKVNMTPLVPNQPHYNVNMTGVQVGHIILNISADVFEAGDR-K 317
Query: 276 GVMIDSGTDVTWLVKEAYEALRDEVMIRLEGEQMRSYSWPDKLCYHGIMSSDLKGFPTVR 335
G +IDSGT + +L + YE L +++ + ++++ K C+ D GFP V
Sbjct: 318 GTIIDSGTTLAYLPELIYEPLVAKILSQQHNLEVQTIHGEYK-CFQYSERVD-DGFPPVI 375
Query: 336 FHFRGGAKLALEKDSMFYQPRPDAFCMAVNPASINNR-YVNFSLIGMMAQQFYNVGYDIG 394
FHF L + +Q + +C+ + + +R N +L G + V YD+
Sbjct: 376 FHFENSLLLKVYPHEYLFQ-YENLWCIGWQNSGMQSRDRKNVTLFGDLVLSNKLVLYDLE 434
Query: 395 RKQKTFQRMDC 405
+ + +C
Sbjct: 435 NQTIGWTEYNC 445
>gi|449519146|ref|XP_004166596.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 1-like [Cucumis
sativus]
Length = 752
Score = 107 bits (266), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 105/384 (27%), Positives = 164/384 (42%), Gaps = 51/384 (13%)
Query: 61 FYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLGTIFYPSRSSSYAYVPCDSEH 120
+++++ IG PP +DTGS L W+ C PC DC Q G + P S S+ + C+
Sbjct: 196 YFIDVFIGSPPKHFSLILDTGSDLNWIQCVPCFDCFEQNGPYYDPKDSISFRNITCNDPR 255
Query: 121 CRYF----PYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDEST-----IHVQD 171
C+ P C C Y Y T+ + E T T +T V++
Sbjct: 256 CQLVSSPDPPRPCKFETQSCPYFYWYGDSSNTTGDFALETFTVNLTSSTTGKSEFRRVEN 315
Query: 172 VVFGCGFSTNRNFKFSGIF-------GLGIGRSSLVSQLNS----SFSYCIGSLHDPDYL 220
V+FGCG NR G+F GLG G S SQL S SFSYC+ +
Sbjct: 316 VMFGCG-HWNR-----GLFHGAAGLLGLGRGPLSFSSQLQSLYGHSFSYCLVDRDSDTSV 369
Query: 221 HNKLILGDG-----------AIIDEGDATPLQFIDGHYYITLEAISVDGRMLDI-NPNIF 268
+KLI G+ + G P +D YY+ +++I V G L I N
Sbjct: 370 SSKLIFGEDKDLLTHPELNFTSLIAGKENP---VDTFYYLQIKSIFVGGEKLQIPEENWN 426
Query: 269 KRDDSGGGVMIDSGTDVTWLVKEAYEALRDEVMIRLEGEQMRSYSWPDKLCYHGIMSSDL 328
D GG +IDSGT +++ AY +++ + +++G ++ CY+ + +D
Sbjct: 427 LSADGAGGTIIDSGTTLSYFSDPAYRIIKEAFLRKVKGYKLVEDFPILHPCYN-VSGTDE 485
Query: 329 KGFPTVRFHFRGGAKLALEKDSMFYQ-PRPDAFCMAV--NPASINNRYVNFSLIGMMAQQ 385
FP F GA ++ F + + D C+A+ P S S+IG QQ
Sbjct: 486 LNFPEFLIQFADGAVWNFPVENYFIRIQQLDIVCLAMLGTPKSA------LSIIGNYQQQ 539
Query: 386 FYNVGYDIGRKQKTFQRMDCEVLD 409
+++ YD + + M C ++
Sbjct: 540 NFHILYDTKNSRLGYAPMRCAEIE 563
>gi|115484725|ref|NP_001067506.1| Os11g0215400 [Oryza sativa Japonica Group]
gi|77549255|gb|ABA92052.1| Eukaryotic aspartyl protease family protein, expressed [Oryza
sativa Japonica Group]
gi|113644728|dbj|BAF27869.1| Os11g0215400 [Oryza sativa Japonica Group]
Length = 428
Score = 107 bits (266), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 97/384 (25%), Positives = 163/384 (42%), Gaps = 43/384 (11%)
Query: 35 YLQEKIRSHNTYQAQILPSNDISNSLFYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRD 94
Y+ K +T Q+ + SL+ +++ +G P Q +DTGSS WV C C
Sbjct: 56 YITNKTSRLSTKAVQVGWDRGLQTSLYVISVGLGTPAKTQIVEIDTGSSTSWVFC-ECDG 114
Query: 95 CSPQLGTIFYPSRSSSYAYVPCDSEHCRYF---PYARCSAYKHRCIYTQLYLIGPETSVF 151
C T F SRS++ A V C + C P+ + S C + Y G +
Sbjct: 115 CHTNPRT-FLQSRSTTCAKVSCGTSMCLLGGSDPHCQDSENYPDCPFRVSYQDGSASYGI 173
Query: 152 VSTEQLTFKNTDESTIHVQDVVFGC---GFSTNRNFKFSGIFGLGIGRSSLVSQLNSS-- 206
+ + LTF + + + FGC F N G+ G+G G S++ Q + +
Sbjct: 174 LYQDTLTFSDVQK----IPGFSFGCNMDSFGANEFGNVDGLLGMGAGPMSVLKQSSPTFD 229
Query: 207 -FSYCIGSLHDPDYLHNKLI--LGDGAIIDEGDATPLQFIDGH-----YYITLEAISVDG 258
FSYC+ +K G + D + + +++ L AISVDG
Sbjct: 230 CFSYCLPLQKSERGFFSKTTGYFSLGKVATRTDVRYTKMVARKKNTELFFVDLTAISVDG 289
Query: 259 RMLDINPNIFKRDDSGGGVMIDSGTDVTWLVKEAYEALRD---EVMIRLEGEQMRSYSWP 315
L ++P++F R GV+ DSG++++++ A L E++++ + S
Sbjct: 290 ERLGLSPSVFSRK----GVVFDSGSELSYIPDRALSVLSQRIRELLLKRGAAEEES---- 341
Query: 316 DKLCYHGIMSSDLKGFPTVRFHFRGGAKLALEKDSMFYQ---PRPDAFCMAVNPASINNR 372
++ CY + S D P + HF GA+ L +F + D +C+A P
Sbjct: 342 ERNCYD-MRSVDEGDMPAISLHFDDGARFDLGSHGVFVERSVQEQDVWCLAFAPTE---- 396
Query: 373 YVNFSLIGMMAQQFYNVGYDIGRK 396
+ S+IG + Q V YD+ R+
Sbjct: 397 --SVSIIGSLMQTSKEVVYDLKRQ 418
>gi|302781726|ref|XP_002972637.1| hypothetical protein SELMODRAFT_97698 [Selaginella moellendorffii]
gi|300160104|gb|EFJ26723.1| hypothetical protein SELMODRAFT_97698 [Selaginella moellendorffii]
Length = 393
Score = 107 bits (266), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 108/400 (27%), Positives = 169/400 (42%), Gaps = 49/400 (12%)
Query: 31 ARLAYLQEKIRSHNTYQAQILPSNDISNSL------FYVNISIGQPPVPQFTAMDTGSSL 84
AR+ ++ R++++ + + + D+ + L + ++IS+G P DTGS L
Sbjct: 21 ARVRWMAA--RANSSSWSSMAGTTDVESPLHPDGGGYVMDISVGTPGKRFRAIADTGSDL 78
Query: 85 LWVHCYPCRDCSPQLGTIFYPSRSSSYAYVPCDSEHCRYFPYARCSAYKHRCIYTQLYLI 144
+WV PC CS GTIF P +SS++ + C S+ C P C C Y+ Y
Sbjct: 79 VWVQSEPCTGCSG--GTIFDPRQSSTFREMDCSSQLCTELP-GSCEPGSSACSYSYEYGS 135
Query: 145 GPETSVFVSTEQLTFKNTDESTIHVQDVVFGCGFSTNRNFKFSGIFGLGIGRSSLVSQL- 203
G ET + + ++ T + GCG + G+ GLG G SL SQL
Sbjct: 136 G-ETEGEFARDTISLGTTSGGSQKFPSFAVGCGMVNSGFDGVDGLVGLGQGPVSLTSQLS 194
Query: 204 ---NSSFSYCI-----GSLHDPDYLHNKLILGDGAIIDEGDATPLQFIDGHYYITLEAIS 255
+S FSYC+ S P L I P +Y +T+ I+
Sbjct: 195 AAIDSKFSYCLVDINSQSESSPLLFGPSAALHGTGIQSTKITPPSDTYPTYYLLTVNGIA 254
Query: 256 VDGRMLDINPNIFKRDDSGGGVMIDSGTDVTWLVKEAYEAL--RDEVMI---RLEGEQMR 310
V G+ + S G +IDSGT +T++ Y + R E M+ R++G M
Sbjct: 255 VAGQTM----------GSPGTTIIDSGTTLTYVPSGVYGRVLSRMESMVTLPRVDGSSMG 304
Query: 311 SYSWPDKLCYHGIMSSDLKGFPTVRFHFRGGAKLALEKDSMFY--QPRPDAFCMAVNPAS 368
LCY + + K FP + GA + + F D C+A+ A
Sbjct: 305 L-----DLCYDRSSNRNYK-FPALTIRL-AGATMTPPSSNYFLVVDDSGDTVCLAMGSAG 357
Query: 369 INNRYVNFSLIGMMAQQFYNVGYDIGRKQKTFQRMDCEVL 408
+ S+IG + QQ Y++ YD G + +F + CE L
Sbjct: 358 ----GLPVSIIGNVMQQGYHILYDRGSSELSFVQAKCESL 393
>gi|297828736|ref|XP_002882250.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
gi|297328090|gb|EFH58509.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
Length = 488
Score = 106 bits (265), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 100/398 (25%), Positives = 168/398 (42%), Gaps = 44/398 (11%)
Query: 40 IRSHNTYQ-AQILPSNDI---------SNSLFYVNISIGQPPVPQFTAMDTGSSLLWVHC 89
+R+H+ ++ +++L + D+ S L++ I +G P +DTGS +LWV+C
Sbjct: 54 LRAHDVHRHSRLLSAIDLPLGGDSQPESIGLYFAKIGLGTPSRDFHVQVDTGSDILWVNC 113
Query: 90 YPCRDCSPQLGTI----FYPSRSSSYAYVPCDSEHCRYFPYARCSAYKHRCIYTQLYLIG 145
C C + + + SS+ V C C Y C Y LY G
Sbjct: 114 AGCIRCPRKSDLVELTPYDADASSTAKSVSCSDNFCSYVNQRSECHSGSTCQYVILYGDG 173
Query: 146 PETSVFVSTE----QLTFKNTDESTIHVQDVVFGCGFSTNRNFKFS-----GIFGLGIGR 196
T+ ++ + L N + + ++FGCG + S GI G G
Sbjct: 174 SSTNGYLVRDVVHLDLVTGNRQTGSTN-GTIIFGCGSKQSGQLGESQAAVDGIMGFGQSN 232
Query: 197 SSLVSQLNS------SFSYCIGSLHDPDYLHNKLILGDGAIID-EGDATPLQFIDGHYYI 249
SS +SQL S SF++C+ D + I G ++ + TP+ HY +
Sbjct: 233 SSFISQLASQGKVKRSFAHCL------DNNNGGGIFAIGEVVSPKVKTTPMLSKSAHYSV 286
Query: 250 TLEAISVDGRMLDINPNIFKRDDSGGGVMIDSGTDVTWLVKEAYEALRDEVMIRLEGEQM 309
L AI V +L ++ + F D GV+IDSGT + +L Y L ++++ +++
Sbjct: 287 NLNAIEVGNSVLQLSSDAFDSGDD-KGVIIDSGTTLVYLPDAVYNPLMNQILA--SHQEL 343
Query: 310 RSYSWPDKL-CYHGIMSSDLKGFPTVRFHFRGGAKLALEKDSMFYQPRPDAFCMAVNPAS 368
++ D C+H I L FPTV F F LA+ +Q R D +C
Sbjct: 344 NLHTVQDSFTCFHYI--DRLDRFPTVTFQFDKSVSLAVYPQEYLFQVREDTWCFGWQNGG 401
Query: 369 INNR-YVNFSLIGMMAQQFYNVGYDIGRKQKTFQRMDC 405
+ + + +++G MA V YDI + + +C
Sbjct: 402 LQTKGGASLTILGDMALSNKLVVYDIENQVIGWTNHNC 439
>gi|449453872|ref|XP_004144680.1| PREDICTED: LOW QUALITY PROTEIN: protein ASPARTIC PROTEASE IN GUARD
CELL 1-like [Cucumis sativus]
Length = 757
Score = 106 bits (265), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 105/384 (27%), Positives = 164/384 (42%), Gaps = 51/384 (13%)
Query: 61 FYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLGTIFYPSRSSSYAYVPCDSEH 120
+++++ IG PP +DTGS L W+ C PC DC Q G + P S S+ + C+
Sbjct: 196 YFIDVFIGSPPKHFSLILDTGSDLNWIQCVPCFDCFEQNGPYYDPKDSISFRNITCNDPR 255
Query: 121 CRYF----PYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDEST-----IHVQD 171
C+ P C C Y Y T+ + E T T +T V++
Sbjct: 256 CQLVSSPDPPRPCKFETQSCPYFYWYGDSSNTTGDFALETFTVNLTSSTTGKSEFRRVEN 315
Query: 172 VVFGCGFSTNRNFKFSGIF-------GLGIGRSSLVSQLNS----SFSYCIGSLHDPDYL 220
V+FGCG NR G+F GLG G S SQL S SFSYC+ +
Sbjct: 316 VMFGCG-HWNR-----GLFHGAAGLLGLGRGPLSFSSQLQSLYGHSFSYCLVDRDSDTSV 369
Query: 221 HNKLILGDG-----------AIIDEGDATPLQFIDGHYYITLEAISVDGRMLDI-NPNIF 268
+KLI G+ + G P +D YY+ +++I V G L I N
Sbjct: 370 SSKLIFGEDKDLLTHPELNFTSLIAGKENP---VDTFYYLQIKSIFVGGEKLQIPEENWN 426
Query: 269 KRDDSGGGVMIDSGTDVTWLVKEAYEALRDEVMIRLEGEQMRSYSWPDKLCYHGIMSSDL 328
D GG +IDSGT +++ AY +++ + +++G ++ CY+ + +D
Sbjct: 427 LSADGAGGTIIDSGTTLSYFSDPAYRIIKEAFLRKVKGYKLVEDFPILHPCYN-VSGTDE 485
Query: 329 KGFPTVRFHFRGGAKLALEKDSMFYQ-PRPDAFCMAV--NPASINNRYVNFSLIGMMAQQ 385
FP F GA ++ F + + D C+A+ P S S+IG QQ
Sbjct: 486 LNFPEFLIQFADGAVWNFPVENYFIRIQQLDIVCLAMLGTPKSA------LSIIGNYQQQ 539
Query: 386 FYNVGYDIGRKQKTFQRMDCEVLD 409
+++ YD + + M C ++
Sbjct: 540 NFHILYDTKNSRLGYAPMRCAEIE 563
>gi|343172998|gb|AEL99202.1| aspartyl protease family protein, partial [Silene latifolia]
Length = 584
Score = 106 bits (265), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 104/370 (28%), Positives = 155/370 (41%), Gaps = 58/370 (15%)
Query: 67 IGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLGTIFYPSRSSSYAYVPCDSEHCRYFPY 126
IG PP +DTGS++ +V C C C F P S +Y V C+ P
Sbjct: 2 IGTPPQEFALIVDTGSTVTYVPCNSCDQCGNHQDPKFQPDLSDTYHPVKCN-------PD 54
Query: 127 ARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDVVFGCGFSTNRNFKF 186
C +C Y + Y +S + + ++F N E + Q VFGC + + F
Sbjct: 55 CTCDTENDQCTYERQYAEMSSSSGILGEDLVSFGNMSE--LKPQRAVFGCENAETGDL-F 111
Query: 187 S----GIFGLGIGRSSLVSQL------NSSFSYCIGSLHDPDYLHNKLILGDGAIIDEGD 236
S GI GLG G S+V QL N SFS C G + +G GA++
Sbjct: 112 SQHADGIMGLGRGDLSIVDQLVEKGVINDSFSLCYGGME----------VGGGAMVLGQI 161
Query: 237 ATPLQFIDGH--------YYITLEAISVDGRMLDINPNIFKRDDSGGGVMIDSGTDVTWL 288
+ P + H Y I L + V G+ LDINP +F D G ++DSGT +L
Sbjct: 162 SPPSDMVFSHSDPDRSPYYNIELRGLHVAGKKLDINPQVF---DGKHGTILDSGTTYAYL 218
Query: 289 VKEAYEALRDEVMIRLEG-EQMRSYSWPDK----LCYHGIMSSD---LKGFPTVRFHFRG 340
+ A+ + L G +Q+R PD +C+ G S K FP+V F
Sbjct: 219 PEAAFLPFIQAITSELHGLKQIRG---PDPNYNDVCFSGAGSEIPELYKTFPSVDMVFDN 275
Query: 341 GAKLALEKDSMFYQPRP--DAFCMAVNPASINNRYVNFSLIGMMAQQFYNVGYDIGRKQK 398
G K +L ++ ++ A+C+ V N +L+G + + V YD +
Sbjct: 276 GEKYSLSPENYLFKHSKVHGAYCLGV----FQNGKDPTTLLGGIVVRNTLVTYDREHSKV 331
Query: 399 TFQRMDCEVL 408
F + +C VL
Sbjct: 332 GFWKTNCSVL 341
>gi|343172996|gb|AEL99201.1| aspartyl protease family protein, partial [Silene latifolia]
Length = 584
Score = 106 bits (265), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 104/370 (28%), Positives = 155/370 (41%), Gaps = 58/370 (15%)
Query: 67 IGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLGTIFYPSRSSSYAYVPCDSEHCRYFPY 126
IG PP +DTGS++ +V C C C F P S +Y V C+ P
Sbjct: 2 IGTPPQEFALIVDTGSTVTYVPCNSCDQCGNHQDPKFQPDLSDTYHPVKCN-------PD 54
Query: 127 ARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDVVFGCGFSTNRNFKF 186
C +C Y + Y +S + + ++F N E + Q VFGC + + F
Sbjct: 55 CTCDTENDQCTYERQYAEMSSSSGILGEDLVSFGNMSE--LKPQRAVFGCENAETGDL-F 111
Query: 187 S----GIFGLGIGRSSLVSQL------NSSFSYCIGSLHDPDYLHNKLILGDGAIIDEGD 236
S GI GLG G S+V QL N SFS C G + +G GA++
Sbjct: 112 SQHADGIMGLGRGDLSIVDQLVEKGVINDSFSLCYGGME----------VGGGAMVLGQI 161
Query: 237 ATPLQFIDGH--------YYITLEAISVDGRMLDINPNIFKRDDSGGGVMIDSGTDVTWL 288
+ P + H Y I L + V G+ LDINP +F D G ++DSGT +L
Sbjct: 162 SPPSDMVFSHSDPDRSPYYNIELRGLHVAGKKLDINPQVF---DGKHGTILDSGTTYAYL 218
Query: 289 VKEAYEALRDEVMIRLEG-EQMRSYSWPDK----LCYHGIMSSD---LKGFPTVRFHFRG 340
+ A+ + L G +Q+R PD +C+ G S K FP+V F
Sbjct: 219 PEAAFLPFIQAITSELHGLKQIRG---PDPNYNDVCFSGAGSEIPELYKTFPSVDMVFDN 275
Query: 341 GAKLALEKDSMFYQPRP--DAFCMAVNPASINNRYVNFSLIGMMAQQFYNVGYDIGRKQK 398
G K +L ++ ++ A+C+ V N +L+G + + V YD +
Sbjct: 276 GEKYSLSPENYLFKHSKVHGAYCLGV----FQNGKDPTTLLGGIVVRNTLVTYDREHSKV 331
Query: 399 TFQRMDCEVL 408
F + +C VL
Sbjct: 332 GFWKTNCSVL 341
>gi|293334661|ref|NP_001168795.1| uncharacterized protein LOC100382594 precursor [Zea mays]
gi|223973065|gb|ACN30720.1| unknown [Zea mays]
Length = 631
Score = 106 bits (265), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 102/380 (26%), Positives = 169/380 (44%), Gaps = 56/380 (14%)
Query: 56 ISNSLFYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLGTIFYPSRSSSYAYVP 115
++N + + IG PP +D+GS++ +V C C C F P SSSY+ V
Sbjct: 83 LTNGYYTTRLYIGTPPQEFALIVDSGSTVTYVPCSSCEQCGNHQDPRFQPDLSSSYSPVK 142
Query: 116 CDSEHCRYFPYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDVVFG 175
C+ + C + K +C Y + Y +S + + ++F ES + Q +FG
Sbjct: 143 CNVD-------CTCDSDKKQCTYERQYAEMSSSSGVLGEDIVSFGR--ESELKPQHAIFG 193
Query: 176 CGFSTNRNFKFS----GIFGLGIGRSSLVSQL------NSSFSYCIGSLHDPDYLHNKLI 225
C S + FS GI GLG G+ S++ QL + SFS C G +
Sbjct: 194 CENSETGDL-FSQHADGIMGLGRGQLSIMDQLVEKGVISDSFSLCYGGMD---------- 242
Query: 226 LGDGAIIDEGDATPLQFI--------DGHYYITLEAISVDGRMLDINPNIFKRDDSGGGV 277
+G GA++ G P I +Y I L+ I V G+ L + IF +S G
Sbjct: 243 IGGGAMVLGGMLAPPDMIFSNSDPLRSPYYNIELKEIHVAGKALRVESRIF---NSKHGT 299
Query: 278 MIDSGTDVTWLVKEAYEALRDEVMIRLEGEQMRSYSWPDK----LCYHGI---MSSDLKG 330
++DSGT +L ++A+ A ++ V ++ ++ PD +C+ G +S +
Sbjct: 300 VLDSGTTYAYLPEQAFVAFKEAVTSKV--HSLKKIRGPDPSYKDICFAGAGRNVSKLHEV 357
Query: 331 FPTVRFHFRGGAKLALEKDS-MFYQPRPD-AFCMAVNPASINNRYVNFSLIGMMAQQFYN 388
FP V F G KL+L ++ +F + D A+C+ V N +L+G + +
Sbjct: 358 FPDVDMVFGNGQKLSLTPENYLFRHSKVDGAYCLGV----FQNGKDPTTLLGGIIVRNTL 413
Query: 389 VGYDIGRKQKTFQRMDCEVL 408
V YD ++ F + +C L
Sbjct: 414 VTYDRHNEKIGFWKTNCSEL 433
>gi|24430421|dbj|BAC22609.1| 41 kD chloroplast nucleoid DNA binding protein (CND41) [Nicotiana
sylvestris]
Length = 502
Score = 106 bits (265), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 106/377 (28%), Positives = 148/377 (39%), Gaps = 59/377 (15%)
Query: 61 FYVNISIGQPPVPQFTAMDTGSSLLWVHCYPC-RDCSPQLGTIFYPSRSSSYAYVPCDSE 119
+ VN+ +G P DTGS L W C PC + C Q IF PS S +Y+ + C S
Sbjct: 154 YIVNVGLGTPKKDLSLIFDTGSDLTWTQCQPCVKSCYAQQQPIFDPSASKTYSNISCTST 213
Query: 120 HCRYFPYAR-----CSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDVVF 174
C A CS+ C+Y Y T F + + LT D +F
Sbjct: 214 ACSGLKSATGNSPGCSS--SNCVYGIQYGDSSFTVGFFAKDTLTLTQNDV----FDGFMF 267
Query: 175 GCGFSTNRNF--KFSGIFGLGIGRSSLVSQ----LNSSFSYCIGSLHDPDYLHNKLILGD 228
GCG NR K +G+ GLG S+V Q FSYC+ + + L G+
Sbjct: 268 GCG-QNNRGLFGKTAGLIGLGRDPLSIVQQTAQKFGKYFSYCLPTSRGSN---GHLTFGN 323
Query: 229 GAIIDEGDA-------TPLQFIDGH--YYITLEAISVDGRMLDINPNIFKRDDSGGGVMI 279
G + A TP G Y+I + ISV G+ L I+P +F+ G +I
Sbjct: 324 GNGVKTSKAVKNGITFTPFASSQGATFYFIDVLGISVGGKALSISPMLFQN----AGTII 379
Query: 280 DSGTDVTWLVKEAYEALRDEVMIRLEGEQMRSYSWPDKL-----CYHGIMSSDLKGF--- 331
DSGT +T L Y +L+ + M Y L CY DL +
Sbjct: 380 DSGTVITRLPSTVYGSLKSTFK-----QFMSKYPTAPALSLLDTCY------DLSNYTSI 428
Query: 332 --PTVRFHFRGGAKLALEKDSMFYQPRPDAFCMAVNPASINNRYVNFSLIGMMAQQFYNV 389
P + F+F G A + LE + + C+A + N + G + QQ V
Sbjct: 429 SIPKISFNFNGNANVDLEPNGILITNGASQVCLAF---AGNGDDDTIGIFGNIQQQTLEV 485
Query: 390 GYDIGRKQKTFQRMDCE 406
YD+ Q F C
Sbjct: 486 VYDVAGGQLGFGYKGCS 502
>gi|45735845|dbj|BAD12880.1| putative chloroplast nucleoid DNA-binding protein cnd41 [Oryza
sativa Japonica Group]
gi|45735971|dbj|BAD13000.1| putative chloroplast nucleoid DNA-binding protein cnd41 [Oryza
sativa Japonica Group]
Length = 333
Score = 106 bits (265), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 99/357 (27%), Positives = 155/357 (43%), Gaps = 39/357 (10%)
Query: 65 ISIGQPPVPQFTAMDTGSSLLWVHCYPCR-DCSPQLGTIFYPSRSSSYAYVPCDSEHCRY 123
+ +G P +DTGSSL W+ C PC C Q G +F P SS+YA V C ++ C
Sbjct: 1 MGLGTPATQYVMVVDTGSSLTWLQCSPCLVSCHRQSGPVFNPKSSSTYASVGCSAQQCSD 60
Query: 124 FPYAR-----CSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDVVFGCGF 178
P A CS+ + CIY Y + ++S + ++F +T + + +GCG
Sbjct: 61 LPSATLNPSACSS-SNVCIYQASYGDSSFSVGYLSKDTVSFGSTS-----LPNFYYGCGQ 114
Query: 179 STNRNF-KFSGIFGLGIGRSSLVSQLNS----SFSYCIGSLHDPDYLHNKLILGDGAIID 233
F + +G+ GL + SL+ QL SF+YC+ P + +
Sbjct: 115 DNEGLFGRSAGLIGLARNKLSLLYQLAPSLGYSFTYCL-----PSSSSSGYLSLGSYNPG 169
Query: 234 EGDATPL---QFIDGHYYITLEAISVDGRMLDINPNIFKRDDSGGGVMIDSGTDVTWLVK 290
+ TP+ D Y+I L ++V G L ++ + + + +IDSGT +T L
Sbjct: 170 QYSYTPMVSSSLDDSLYFIKLSGMTVAGNPLSVSSSAYSSLPT----IIDSGTVITRLPT 225
Query: 291 EAYEALRDEVMIRLEG-EQMRSYSWPDKLCYHGIMSSDLKGFPTVRFHFRGGAKLALEKD 349
Y AL V ++G + +YS D C+ G S P V F GGA L L
Sbjct: 226 SVYSALSKAVAAAMKGTSRASAYSILDT-CFKGQASR--VSAPAVTMSFAGGAALKLSAQ 282
Query: 350 SMFYQPRPDAFCMAVNPASINNRYVNFSLIGMMAQQFYNVGYDIGRKQKTFQRMDCE 406
++ C+A PA + ++IG QQ ++V YD+ + F C
Sbjct: 283 NLLVDVDDSTTCLAFAPAR------SAAIIGNTQQQTFSVVYDVKSSRIGFAAGGCS 333
>gi|115473125|ref|NP_001060161.1| Os07g0592200 [Oryza sativa Japonica Group]
gi|29027762|dbj|BAC65898.1| putative CND41, chloroplast nucleoid DNA binding protein [Oryza
sativa Japonica Group]
gi|113611697|dbj|BAF22075.1| Os07g0592200 [Oryza sativa Japonica Group]
Length = 631
Score = 106 bits (265), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 108/425 (25%), Positives = 178/425 (41%), Gaps = 69/425 (16%)
Query: 11 YNNPNENPAHRVQRGINISIARLAYLQEKIRSHNTYQAQILPSNDISNSLFYVNISIGQP 70
Y N PA +RG+ HN L + ++N + + IG P
Sbjct: 54 YPNATRLPASSARRGLG-------------DGHNPNARMRLHDDLLTNGYYTTRLYIGTP 100
Query: 71 PVPQFTAMDTGSSLLWVHCYPCRDCSPQLGTIFYPSRSSSYAYVPCDSEHCRYFPYARCS 130
+D+GS++ +V C C C F P SS+Y+ V C+ + C
Sbjct: 101 SQEFALIVDSGSTVTYVPCATCEQCGNHQDPRFQPDLSSTYSPVKCNVD-------CTCD 153
Query: 131 AYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDVVFGCGFSTNRNFKFS--- 187
+ +C Y + Y +S + + ++F ES + Q VFGC +T FS
Sbjct: 154 NERSQCTYERQYAEMSSSSGVLGEDIMSFGK--ESELKPQRAVFGCE-NTETGDLFSQHA 210
Query: 188 -GIFGLGIGRSSLVSQL------NSSFSYCIGSLHDPDYLHNKLILGDGAIIDEGDATPL 240
GI GLG G+ S++ QL + SFS C G + +G G ++ G P
Sbjct: 211 DGIMGLGRGQLSIMDQLVEKGVISDSFSLCYGGMD----------VGGGTMVLGGMPAPP 260
Query: 241 QFIDGH--------YYITLEAISVDGRMLDINPNIFKRDDSGGGVMIDSGTDVTWLVKEA 292
+ H Y I L+ I V G+ L ++P IF +S G ++DSGT +L ++A
Sbjct: 261 DMVFSHSNPVRSPYYNIELKEIHVAGKALRLDPKIF---NSKHGTVLDSGTTYAYLPEQA 317
Query: 293 YEALRDEVMIRLEGEQMRSYSWPDK----LCYHGI---MSSDLKGFPTVRFHFRGGAKLA 345
+ A +D V ++ ++ PD +C+ G +S + FP V F G KL+
Sbjct: 318 FVAFKDAVTNKV--NSLKKIRGPDPNYKDICFAGAGRNVSQLSEVFPDVDMVFGNGQKLS 375
Query: 346 LEKDSMFYQ--PRPDAFCMAVNPASINNRYVNFSLIGMMAQQFYNVGYDIGRKQKTFQRM 403
L ++ ++ A+C+ V N +L+G + + V YD ++ F +
Sbjct: 376 LSPENYLFRHSKVEGAYCLGV----FQNGKDPTTLLGGIVVRNTLVTYDRHNEKIGFWKT 431
Query: 404 DCEVL 408
+C L
Sbjct: 432 NCSEL 436
>gi|224090744|ref|XP_002309070.1| predicted protein [Populus trichocarpa]
gi|222855046|gb|EEE92593.1| predicted protein [Populus trichocarpa]
Length = 404
Score = 106 bits (264), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 102/352 (28%), Positives = 153/352 (43%), Gaps = 64/352 (18%)
Query: 47 QAQILPSNDIS----------NSLFYVNISIGQPPVPQFTAMDTGSSLLWVHC-----YP 91
+ Q++PS + N V++++G PP +DTGS L W+HC YP
Sbjct: 7 KTQVIPSGSVPRSPNKPPFHHNVSLIVSLTVGTPPQNVSMVIDTGSELSWLHCNKTLSYP 66
Query: 92 CRDCSPQLGTIFYPSRSSSYAYVPCDSEHC----RYFPYARCSAYKHRCIYTQLYLIGPE 147
T F P+RS+SY +PC S C + FP + C T Y
Sbjct: 67 ---------TTFDPTRSTSYQTIPCSSPTCTNRTQDFPIPASCDSNNLCHATLSYADASS 117
Query: 148 TSVFVSTEQLTFKNTDESTIHVQDVVFGCG---FSTN--RNFKFSGIFGLGIGRSSLVSQ 202
+ ++++ ++D + +VFGC FS+N + K +G+ G+ G S VSQ
Sbjct: 118 SDGNLASDVFHIGSSD-----ISGLVFGCMDSVFSSNSDEDSKSTGLMGMNRGSLSFVSQ 172
Query: 203 LN-SSFSYCIGSLHDPDYLHNKLILGDGAIIDEGD---------ATPLQFIDG-HYYITL 251
L FSYCI L+LG+ + +TPL + D Y + L
Sbjct: 173 LGFPKFSYCISGTD----FSGLLLLGESNLTWSVPLNYTPLIQISTPLPYFDRVAYTVQL 228
Query: 252 EAISVDGRMLDINPNIFKRDDSGGG-VMIDSGTDVTWLVKEAYEALRDEVMIRLEGEQMR 310
E I V ++L I + F+ D +G G M+DSGT T+L+ Y ALR + +R
Sbjct: 229 EGIKVLDKLLPIPKSTFEPDHTGAGQTMVDSGTQFTFLLGPVYNALR-SAFLNQTSSVLR 287
Query: 311 SYSWPD-------KLCYHGIMSSD-LKGFPTVRFHFRGGAKLALEKDSMFYQ 354
PD LCY +S L PTV FR GA++ + D + Y+
Sbjct: 288 VLEDPDFVFQGAMDLCYLVPLSQRVLPLLPTVTLVFR-GAEMTVSGDRVLYR 338
>gi|357502759|ref|XP_003621668.1| Aspartic proteinase-like protein [Medicago truncatula]
gi|355496683|gb|AES77886.1| Aspartic proteinase-like protein [Medicago truncatula]
Length = 481
Score = 106 bits (264), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 98/376 (26%), Positives = 172/376 (45%), Gaps = 38/376 (10%)
Query: 57 SNSLFYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQ--LG---TIFYPSRSSSY 111
S L+Y I IG P + +DTG+ ++WV+C C++C + LG T++ SSS
Sbjct: 69 SVGLYYAKIGIGTPSKDYYLQVDTGTDMMWVNCIQCKECPTRSNLGMDLTLYNIKESSSG 128
Query: 112 AYVPCDSEHCRYFP---YARCSAYKH-RCIYTQLYLIGPETSVFVSTEQLTFKNT--DES 165
VPCD E C+ C++ + C Y ++Y G T+ + + + F D
Sbjct: 129 KLVPCDQELCKEINGGLLTGCTSKTNDSCPYLEIYGDGSSTAGYFVKDVVLFDQVSGDLK 188
Query: 166 TIHVQ-DVVFGCGFSTNRNFKFS------GIFGLGIGRSSLVSQLNSS------FSYCIG 212
T V+FGCG + + +S GI G G S++SQL+SS F++C+
Sbjct: 189 TASANGSVIFGCGARQSGDLSYSNEEALDGILGFGKANYSMISQLSSSGKVKKMFAHCLN 248
Query: 213 SLHDPDYLHNKLILGDGAIIDEG-DATPLQFIDGHYYITLEAISVDGRMLDINPNIFKRD 271
++ I G ++ + TPL HY + + AI V L+++ + ++
Sbjct: 249 GVNGGG------IFAIGHVVQPTVNTTPLLPDQPHYSVNMTAIQVGHTFLNLSTDASEQR 302
Query: 272 DSGGGVMIDSGTDVTWLVKEAYEALRDEVMIRLEGEQMRSYSWPDKLCYHGIMSSDLKGF 331
DS G +IDSGT + +L Y+ L +++ + ++++ + C+ S D GF
Sbjct: 303 DS-KGTIIDSGTTLAYLPDGIYQPLVYKILSQQPNLKVQTLH-DEYTCFQYSGSVD-DGF 359
Query: 332 PTVRFHFRGGAKLAL-EKDSMFYQPRPDAFCMAVNPASINNR-YVNFSLIGMMAQQFYNV 389
P V F+F G L + D +F + +C+ + +R N +L+G + V
Sbjct: 360 PNVTFYFENGLSLKVYPHDYLFLS--ENLWCIGWQNSGAQSRDSKNMTLLGDLVLSNKLV 417
Query: 390 GYDIGRKQKTFQRMDC 405
YD+ + + +C
Sbjct: 418 FYDLENQVIGWTEYNC 433
>gi|125553832|gb|EAY99437.1| hypothetical protein OsI_21406 [Oryza sativa Indica Group]
Length = 409
Score = 106 bits (264), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 102/358 (28%), Positives = 154/358 (43%), Gaps = 43/358 (12%)
Query: 68 GQPPVPQFTAMDTGSSLLWVHCYPCR--DCSPQLGTIFYPSRSSSYAYVPCDSEHC-RYF 124
G V Q +D+GS + WV C PC C PQ +F P+ S++YA VPC S C R
Sbjct: 75 GTSAVSQTVIIDSGSDVPWVQCQPCPLLVCHPQRDPLFDPATSTTYAAVPCSSAACARLG 134
Query: 125 PYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDVVFGCGFS---TN 181
PY R +C + Y G + S++ LT D V+ +FGC + +
Sbjct: 135 PYRRGCLANSQCQFGITYANGATATGTYSSDDLTLGPYDV----VRGFLFGCAHADQGST 190
Query: 182 RNFKFSGIFGLGIGRSSLVSQLNSS----FSYCI-GSLHDPDYLHNKLILGDGAIIDEGD 236
++ +G LG G S V Q S FSYC+ S ++ + A++
Sbjct: 191 FSYDVAGTLALGGGSQSFVQQTASQYSRVFSYCVPPSTSSFGFIMFGVPPQRAALVPTFV 250
Query: 237 ATPL----QFIDGHYYITLEAISVDGRMLDINPNIFKRDDSGGGVMIDSGTDVTWLVKEA 292
+TPL Y + L +I V GR L + P +F +IDS T ++ + A
Sbjct: 251 STPLLSSSTMSPTFYRVLLRSIIVAGRPLPVPPTVFSASS-----VIDSATVISRIPPTA 305
Query: 293 YEALRDEVMIRLEGEQMRSYSWPDKL---CY--HGIMSSDLKGFPTVRFHFRGGAKLALE 347
Y+ALR + M + P + CY G+ S L P++ F GGA + L+
Sbjct: 306 YQALRAAFRSAMT---MYRPAPPVSILDTCYDFSGVRSITL---PSIALVFDGGATVNLD 359
Query: 348 KDSMFYQPRPDAFCMAVNPASINNRYVNFSLIGMMAQQFYNVGYDIGRKQKTFQRMDC 405
+ Q C+A P + ++R F IG + Q+ V YD+ K F+ C
Sbjct: 360 AAGILLQ-----GCLAFAPTA-SDRMPGF--IGNVQQRTLEVVYDVPGKAIRFRSAAC 409
>gi|449466304|ref|XP_004150866.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 1-like [Cucumis
sativus]
gi|449485213|ref|XP_004157102.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 1-like [Cucumis
sativus]
Length = 473
Score = 106 bits (264), Expect = 3e-20, Method: Compositional matrix adjust.
Identities = 107/389 (27%), Positives = 166/389 (42%), Gaps = 44/389 (11%)
Query: 38 EKIRSHNTYQAQILPSNDISNSLFYVNISIGQPPVPQFTAMDTGSSLLWVHCYPC-RDCS 96
+++R + I + + V++ +G P DTGS L W C PC R C
Sbjct: 108 DRLRGSKATKIPAKSGATIGSGNYIVSVGLGTPKKYLSLIFDTGSDLTWTQCQPCARYCY 167
Query: 97 PQLGTIFYPSRSSSYAYVPCDSEHCRYFPYAR-----CSAYKHRCIYTQLYLIGPETSVF 151
Q +F PS+S++Y+ + C S C CSA + CIY Y + +
Sbjct: 168 NQKDPVFVPSQSTTYSNISCSSPDCSQLESGTGNQPGCSAAR-ACIYGIQYGDQSFSVGY 226
Query: 152 VSTEQLTFKNTDESTIHVQDVVFGCGFSTNRNF--KFSGIFGLGIGRSSLVSQLNSS--- 206
+ E LT +TD +++ +FGCG NR +G+ GLG + S+V Q
Sbjct: 227 FAKETLTLTSTDV----IENFLFGCG-QNNRGLFGSAAGLIGLGQDKISIVKQTAQKYGQ 281
Query: 207 -FSYCI-GSLHDPDYLHNKLILGDGAIIDEGDATPLQFIDG---HYYITLEAISVDGRML 261
FSYC+ + YL G GA+ TP+ G Y + + + V G +
Sbjct: 282 VFSYCLPKTSSSTGYLTFGGGGGGGAL----KYTPITKAHGVANFYGVDIVGMKVGGTQI 337
Query: 262 DINPNIFKRDDSGGGVMIDSGTDVTWLVKEAYEALRDEVMIRLEGEQMRSYSWPDKL--- 318
I+ ++F S G +IDSGT +T L +AY AL+ + M Y +L
Sbjct: 338 PISSSVF----STSGAIIDSGTVITRLPPDAYSALKSAFE-----KGMAKYPKAPELSIL 388
Query: 319 --CYHGIMSSDLKGFPTVRFHFRGGAKLALEKDSMFYQPRPDAFCMAVNPASINNRYVNF 376
CY S ++ P V F F+GG +L L+ + Y C+A + N
Sbjct: 389 DTCYDLSKYSTIQ-IPKVGFVFKGGEELDLDGIGIMYGASTSQVCLAF---AGNQDPSTV 444
Query: 377 SLIGMMAQQFYNVGYDIGRKQKTFQRMDC 405
++IG + Q+ V YD+G + F C
Sbjct: 445 AIIGNVQQKTLQVVYDVGGGKIGFGYNGC 473
>gi|115465373|ref|NP_001056286.1| Os05g0557100 [Oryza sativa Japonica Group]
gi|113579837|dbj|BAF18200.1| Os05g0557100 [Oryza sativa Japonica Group]
gi|125553268|gb|EAY98977.1| hypothetical protein OsI_20935 [Oryza sativa Indica Group]
Length = 494
Score = 106 bits (264), Expect = 3e-20, Method: Compositional matrix adjust.
Identities = 99/389 (25%), Positives = 153/389 (39%), Gaps = 50/389 (12%)
Query: 61 FYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCS---------------PQLGTIFYP 105
++V +G P P DTGS L WV C S +F P
Sbjct: 110 YFVRFRVGTPAQPFVLIADTGSDLTWVKCRGAASPSHATATASPAAAPSPAVAPPRVFRP 169
Query: 106 SRSSSYAYVPCDSEHCRY---FPYARCSAYKHRCIYTQLY--------LIGPETSVFVST 154
S +++ +PC SE C+ F A CS+ C Y Y ++G +++ +
Sbjct: 170 GDSKTWSPIPCSSETCKSTIPFSLANCSSSTAACSYDYRYNDNSAARGVVGTDSATVALS 229
Query: 155 EQLTFKNTDESTIHVQDVVFGCGFS-TNRNFKFS-GIFGLGIGRSSLVSQLNS----SFS 208
+ +Q VV GC + + F+ S G+ LG S S+ S FS
Sbjct: 230 GGRGGGGGGDRKAKLQGVVLGCTTAHAGQGFEASDGVLSLGYSNISFASRAASRFGGRFS 289
Query: 209 YCIGSLHDPDYLHNKLILGDG------AIIDEGDATPLQF---IDGHYYITLEAISVDGR 259
YC+ P + L G G + G TPL + Y + ++++SVDG
Sbjct: 290 YCLVDHLAPRNATSYLTFGAGPDAASSSAPAPGSRTPLLLDARVRPFYAVAVDSVSVDGV 349
Query: 260 MLDINPNIFKRDDSGGGVMIDSGTDVTWLVKEAYEALRDEVMIRLEGEQMRSYSWPDKLC 319
LDI ++ S GG +IDSGT +T L AY+A+ + +L G + P C
Sbjct: 350 ALDIPAEVWDV-GSNGGTIIDSGTSLTVLATPAYKAVVAALSEQLAGLPRVAMD-PFDYC 407
Query: 320 YHGIMSSDLKG---FPTVRFHFRGGAKLALEKDSMFYQPRPDAFCMAVNPASINNRYVNF 376
Y+ D G P + F G A+L S P C+ V +
Sbjct: 408 YNWTARGDGGGDLAVPKLAVQFAGSARLEPPAKSYVIDAAPGVKCIGVQ----EGAWPGV 463
Query: 377 SLIGMMAQQFYNVGYDIGRKQKTFQRMDC 405
S+IG + QQ + +D+ + F++ C
Sbjct: 464 SVIGNILQQEHLWEFDLNNRWLRFRQTSC 492
>gi|359482287|ref|XP_002263129.2| PREDICTED: aspartic proteinase-like protein 2 isoform 2 [Vitis
vinifera]
gi|297740017|emb|CBI30199.3| unnamed protein product [Vitis vinifera]
Length = 502
Score = 106 bits (264), Expect = 3e-20, Method: Compositional matrix adjust.
Identities = 97/312 (31%), Positives = 149/312 (47%), Gaps = 33/312 (10%)
Query: 60 LFYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLG-----TIFYPSRSSSYAYV 114
L++ + +G PP +DTGS +LWV C C DC G + F PS SS+ + V
Sbjct: 85 LYFTKVKLGSPPREFNVQIDTGSDILWVTCNSCNDCPRTSGLGIELSFFDPSSSSTTSLV 144
Query: 115 PCDSEHCRYF---PYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQ- 170
C C A CS ++C Y+ Y G T+ + ++ L F ++
Sbjct: 145 SCSHPICTSLVQTTAAECSPQSNQCSYSFHYGDGSGTTGYYVSDMLYFDTVLGDSLIANS 204
Query: 171 --DVVFGC-----GFSTNRNFKFSGIFGLGIGRSSLVSQLNS------SFSYCIGSLHDP 217
+VFGC G T + GIFG G S+VSQL+S FS+C+ D
Sbjct: 205 SASIVFGCSTYQSGDLTKVDKAIDGIFGFGQQDLSVVSQLSSLGITPKVFSHCLKGEGDG 264
Query: 218 DYLHNKLILGDGAIIDEGDA--TPLQFIDGHYYITLEAISVDGRMLDINPNIFKRDDSGG 275
KL+LG+ I E + +PL HY + L++ISV+G++L I+P +F ++
Sbjct: 265 G---GKLVLGE---ILEPNIIYSPLVPSQSHYNLNLQSISVNGQLLPIDPAVFATSNN-Q 317
Query: 276 GVMIDSGTDVTWLVKEAYEALRDEVMIRLEGEQMRSYSWPDKLCYHGIMSSDLKGFPTVR 335
G ++DSGT +T+LV+ AY+ + + S ++ CY S D + FP V
Sbjct: 318 GTIVDSGTTLTYLVETAYDPFVSAITATVSSSTTPVLSKGNQ-CYLVSTSVD-EIFPPVS 375
Query: 336 FHFRGGAKLALE 347
+F GGA + L+
Sbjct: 376 LNFAGGASMVLK 387
>gi|18390579|ref|NP_563751.1| aspartyl protease-like protein [Arabidopsis thaliana]
gi|332189782|gb|AEE27903.1| aspartyl protease-like protein [Arabidopsis thaliana]
Length = 485
Score = 106 bits (264), Expect = 3e-20, Method: Compositional matrix adjust.
Identities = 97/372 (26%), Positives = 164/372 (44%), Gaps = 38/372 (10%)
Query: 60 LFYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQ--LG---TIFYPSRSSSYAYV 114
L+Y I IG P + +DTGS ++WV+C C+ C + LG T++ S S V
Sbjct: 79 LYYAKIGIGTPAKSYYVQVDTGSDIMWVNCIQCKQCPRRSTLGIELTLYNIDESDSGKLV 138
Query: 115 PCDSEHCRYF---PYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTD---ESTIH 168
CD + C P + C A C Y ++Y G T+ + + + + + ++
Sbjct: 139 SCDDDFCYQISGGPLSGCKA-NMSCPYLEIYGDGSSTAGYFVKDVVQYDSVAGDLKTQTA 197
Query: 169 VQDVVFGCG------FSTNRNFKFSGIFGLGIGRSSLVSQLNSS------FSYCIGSLHD 216
V+FGCG ++ GI G G SS++SQL SS F++C+
Sbjct: 198 NGSVIFGCGARQSGDLDSSNEEALDGILGFGKANSSMISQLASSGRVKKIFAHCL----- 252
Query: 217 PDYLHNKLILGDGAIID-EGDATPLQFIDGHYYITLEAISVDGRMLDINPNIFKRDDSGG 275
D + I G ++ + + TPL HY + + A+ V L I ++F+ D G
Sbjct: 253 -DGRNGGGIFAIGRVVQPKVNMTPLVPNQPHYNVNMTAVQVGQEFLTIPADLFQPGDRKG 311
Query: 276 GVMIDSGTDVTWLVKEAYEALRDEVMIRLEGEQMRSYSWPDKLCYHGIMSSDLKGFPTVR 335
+ IDSGT + +L + YE L ++ + ++ D C+ D +GFP V
Sbjct: 312 AI-IDSGTTLAYLPEIIYEPLVKKITSQEPALKVHIVD-KDYKCFQYSGRVD-EGFPNVT 368
Query: 336 FHFRGGAKLAL-EKDSMFYQPRPDAFCMAVNPASINNR-YVNFSLIGMMAQQFYNVGYDI 393
FHF L + D +F P +C+ +++ +R N +L+G + V YD+
Sbjct: 369 FHFENSVFLRVYPHDYLF--PHEGMWCIGWQNSAMQSRDRRNMTLLGDLVLSNKLVLYDL 426
Query: 394 GRKQKTFQRMDC 405
+ + +C
Sbjct: 427 ENQLIGWTEYNC 438
>gi|359488213|ref|XP_002263620.2| PREDICTED: aspartic proteinase nepenthesin-2-like [Vitis vinifera]
Length = 434
Score = 105 bits (263), Expect = 3e-20, Method: Compositional matrix adjust.
Identities = 95/370 (25%), Positives = 163/370 (44%), Gaps = 44/370 (11%)
Query: 63 VNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLGTIFYPSRSSSYAYVPCDSEHCR 122
V++ IG PP Q +DTGS L W+ C P T F P SSS++ +PC+ C+
Sbjct: 80 VSLPIGTPPQTQQMVLDTGSQLSWIQCKVPPKTPP---TAFDPLLSSSFSVLPCNHSLCK 136
Query: 123 -----YFPYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDVVFGCG 177
Y C C Y+ Y G + E+ TF ++ + ++ GC
Sbjct: 137 PRVPDYTLPTSCDQ-NRLCHYSYFYADGTYAEGNLVREKFTFSSSQTT----PPLILGCA 191
Query: 178 FSTNRNFKFSGIFGLGIGRSSLVSQLN-SSFSYCI------------GSLH---DPDYLH 221
++ GI G+ +GR S S S FSYC+ GS + +P
Sbjct: 192 TDSSDT---QGILGMNLGRLSFSSLAKISKFSYCVPPRRSQSGSSPTGSFYLGPNPSSAG 248
Query: 222 NKLILGDGAIIDEGDATPLQFIDG-HYYITLEAISVDGRMLDINPNIFKRDDSGGG-VMI 279
K + ++ + + +D Y + + I ++G+ L+I+ + F+ D SG G +I
Sbjct: 249 FKYV----NLMTYRQSQRMPNLDPLAYTLPMLGIRINGKKLNISTSAFRADPSGAGQTLI 304
Query: 280 DSGTDVTWLVKEAYEALRDEVMIRLEGEQMRS---YSWPDKLCYHGIMSSDLKGFPTVRF 336
DSGT T+LV EAY +++E+ ++L G +++ Y +C+ G + + F
Sbjct: 305 DSGTWFTFLVDEAYSKVKEEI-VKLAGPKLKKGYVYGGSLDMCFDGDAMVIGRMIGNMAF 363
Query: 337 HFRGGAKLALEKDSMFYQPRPDAFCMAVNPASINNRYVNFSLIGMMAQQFYNVGYDIGRK 396
F G ++ +E++ M C+ + + + N +IG QQ V +D+ +
Sbjct: 364 EFENGVEIVVEREKMLADVGGGVQCLGIGRSDLLGVASN--IIGNFHQQDLWVEFDLVGR 421
Query: 397 QKTFQRMDCE 406
+ F R DC
Sbjct: 422 RVGFGRTDCS 431
>gi|125527257|gb|EAY75371.1| hypothetical protein OsI_03267 [Oryza sativa Indica Group]
Length = 484
Score = 105 bits (263), Expect = 3e-20, Method: Compositional matrix adjust.
Identities = 114/438 (26%), Positives = 169/438 (38%), Gaps = 72/438 (16%)
Query: 32 RLAYLQEK--IRSHNTYQAQILPSND---ISNSLFYVNISIGQPPVPQFTAMDTGSSLLW 86
R+A++ + R+ T A +P + ++V +G P P DTGS L W
Sbjct: 53 RMAFISSRGRRRAAETASAFAMPLSSGAYTGTGQYFVRFRVGTPAQPFLLVADTGSDLTW 112
Query: 87 VHCY----------------PC-RDCSPQLGTIFYPSRSSSYAYVPCDSEHCRY---FPY 126
V C+ P SP+ F P +S ++A +PC S CR F
Sbjct: 113 VKCHRAAAAASASPRNASSLPAPAPASPR--RTFRPDKSRTWAPIPCSSATCRESLPFSL 170
Query: 127 ARCSAYKHRCIYTQLYLIGPET--SVFVSTEQLTFKNTDESTIHVQDVVFGCGFSTN-RN 183
A C+ + C Y Y G +V V + + ++ VV GC S N ++
Sbjct: 171 AACATPANPCAYDYRYKDGSAARGTVGVDSATIALSGRAARKAKLRGVVLGCTTSYNGQS 230
Query: 184 FKFS-GIFGLGIGRSSLVSQLNSSF----SYCIGSLHDPDYLHNKLILGDGAIID----- 233
F S G+ LG S S+ S F SYC+ P + L G
Sbjct: 231 FLASDGVLSLGYSNISFASRAASRFGGRFSYCLVDHLAPRNATSYLTFGPNPAFSSRRPS 290
Query: 234 EGDA--------------------TPLQF---IDGHYYITLEAISVDGRMLDINPNIFKR 270
EG A TPL Y +T++ +SV G +L I P
Sbjct: 291 EGIASCKPAPAPTPAPAGAPGARQTPLVLDHRTRPFYAVTVKGVSVAGELLKI-PRAVWD 349
Query: 271 DDSGGGVMIDSGTDVTWLVKEAYEALRDEVMIRLEGEQMRSYSWPDKLCYHGI--MSSDL 328
+ GGG ++DSGT +T L K AY A+ + RL G + P CY+ SD+
Sbjct: 350 VEQGGGAILDSGTSLTMLAKPAYRAVVAALSKRLAGLPRVTMD-PFDYCYNWTSPSGSDV 408
Query: 329 KG-FPTVRFHFRGGAKLALEKDSMFYQPRPDAFCMAVNPASINNRYVNFSLIGMMAQQFY 387
P + HF G A+L S P C+ + + S+IG + QQ +
Sbjct: 409 AAPLPMLAVHFAGSARLEPPAKSYVIDAAPGVKCIGLQ----EGPWPGLSVIGNILQQEH 464
Query: 388 NVGYDIGRKQKTFQRMDC 405
YD+ ++ F+R C
Sbjct: 465 LWEYDLKNRRLRFKRSRC 482
>gi|297840891|ref|XP_002888327.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
gi|297334168|gb|EFH64586.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
Length = 473
Score = 105 bits (263), Expect = 3e-20, Method: Compositional matrix adjust.
Identities = 97/400 (24%), Positives = 170/400 (42%), Gaps = 46/400 (11%)
Query: 38 EKIRSHNTYQ-AQILPSNDI---------SNSLFYVNISIGQPPVPQFTAMDTGSSLLWV 87
E +SH+T + +++L S D+ S L++ I +G PP +DTGS +LWV
Sbjct: 41 EHFKSHDTRRHSRMLASIDLPLGGDSRVDSVGLYFTKIKLGSPPKEYHVQVDTGSDILWV 100
Query: 88 HCYPCRDCSPQLGTIFYPS-----RSSSYAYVPCDSEHCRYFPYARCSAYKHRCIYTQLY 142
+C PC +C + F+ S SS+ V CD + C + + C Y +Y
Sbjct: 101 NCKPCPECPSKTNLNFHLSLFDVNASSTSKKVGCDDDFCSFISQSDSCQPAVGCSYHIVY 160
Query: 143 LIGPETSVFVSTEQLTFKNTD---ESTIHVQDVVFGCGFSTNRNFKFS-----GIFGLGI 194
+ ++LT + ++ Q+VVFGCG + S G+ G G
Sbjct: 161 ADESTSEGNFIRDKLTLEQVTGDLQTGPLGQEVVFGCGSDQSGQLGKSDSAVDGVMGFGQ 220
Query: 195 GRSSLVSQLNSS------FSYCIGSLHDPDYLHNKLILGDGAIID-EGDATPLQFIDGHY 247
+S++SQL ++ FS+C+ D + I G + + TP+ HY
Sbjct: 221 SNTSVLSQLAATGDAKRVFSHCL------DNVKGGGIFAVGVVDSPKVKTTPMVPNQMHY 274
Query: 248 YITLEAISVDGRMLDINPNIFKRDDSGGGVMIDSGTDVTWLVKEAYEALRDEVMIRLEGE 307
+ L + VDG LD+ P+I + GG ++DSGT + + K Y++L + ++ R +
Sbjct: 275 NVMLMGMDVDGTALDLPPSIMRN----GGTIVDSGTTLAYFPKVLYDSLIETILAR---Q 327
Query: 308 QMRSYSWPDKL-CYHGIMSSDLKGFPTVRFHFRGGAKLALEKDSMFYQPRPDAFCMAVNP 366
++ + D C+ + D+ FP V F F KL + + + +C
Sbjct: 328 PVKLHIVEDTFQCFSFSENVDV-AFPPVSFEFEDSVKLTVYPHDYLFTLEKELYCFGWQA 386
Query: 367 ASINN-RYVNFSLIGMMAQQFYNVGYDIGRKQKTFQRMDC 405
+ L+G + V YD+ + + +C
Sbjct: 387 GGLTTGERTEVILLGDLVLSNKLVVYDLENEVIGWADHNC 426
>gi|222624820|gb|EEE58952.1| hypothetical protein OsJ_10633 [Oryza sativa Japonica Group]
Length = 415
Score = 105 bits (263), Expect = 3e-20, Method: Compositional matrix adjust.
Identities = 105/375 (28%), Positives = 160/375 (42%), Gaps = 61/375 (16%)
Query: 54 NDISNSLFYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLGTIFYPSRSSSYAY 113
N + + + V+++IG PP P +DTGS L+W C PC C Q F PS SS+ +
Sbjct: 82 NGVPTTEYLVHLAIGTPPQPVQLTLDTGSDLIWTQCQPCPACFDQALPYFDPSTSSTLSL 141
Query: 114 VPCDSEHCRYFPYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDVV 173
CDS C+ P A +++ TF S V V
Sbjct: 142 TSCDSTLCQGLPVAS----------------------LPRSDKFTFVGAGAS---VPGVA 176
Query: 174 FGCGFSTNRNFKF--SGIFGLGIGRSSLVSQLN-SSFSYCIGSLHDP-------DYLHNK 223
FGCG N FK +GI G G G SL SQL +FS+C ++ D +
Sbjct: 177 FGCGLFNNGVFKSNETGIAGFGRGPLSLPSQLKVGNFSHCFTTITGAIPSTVLLDLPADL 236
Query: 224 LILGDGAIIDEGDATPLQFIDGH---YYITLEAISVDGRMLDINPNIFKRDDSGGGVMID 280
G GA+ TPL + YY++L+ I+V L + + F + GG +ID
Sbjct: 237 FSNGQGAV----QTTPLIQNPANPTFYYLSLKGITVGSTRLPVPESEFALKNGTGGTIID 292
Query: 281 SGTDVTWLVKEAYEALRDEVMIRLEGEQMRSYSWPDKLCYHGIMSSDLKG---FPTVRFH 337
SGT +T L Y +RD +++ + + C +S+ L+ P + H
Sbjct: 293 SGTAMTSLPTRVYRLVRDAFAAQVKLPVVSGNTTDPYFC----LSAPLRAKPYVPKLVLH 348
Query: 338 FRGGAKLALEKDSMFYQPRPDA----FCMAVNPASINNRYVNFSLIGMMAQQFYNVGYDI 393
F GA + L +++ ++ DA C+A+ I V + IG QQ +V YD+
Sbjct: 349 FE-GATMDLPRENYVFEVE-DAGSSILCLAI----IEGGEV--TTIGNFQQQNMHVLYDL 400
Query: 394 GRKQKTFQRMDCEVL 408
+ +F C+ L
Sbjct: 401 QNSKLSFVPAQCDKL 415
>gi|226532674|ref|NP_001151415.1| pepsin A precursor [Zea mays]
gi|195646632|gb|ACG42784.1| pepsin A [Zea mays]
Length = 492
Score = 105 bits (263), Expect = 3e-20, Method: Compositional matrix adjust.
Identities = 98/377 (25%), Positives = 158/377 (41%), Gaps = 39/377 (10%)
Query: 57 SNSLFYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLG-----TIFYPSRSSSY 111
+ L+Y I IG PP + +DTGS +LWV+ C C + G T + P+ S +
Sbjct: 81 ATGLYYTRIEIGSPPKGYYVQVDTGSDILWVNGISCDGCPTRSGLGIELTQYDPAGSGT- 139
Query: 112 AYVPCDSEHCRYFPYAR-----CSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTD--- 163
V C+ E C A C + C + Y G T+ F T+ + +
Sbjct: 140 -TVGCEQEFCVANSAASGVPPACPSAASPCQFRITYGDGSSTTGFYVTDFVQYNQVSGNG 198
Query: 164 ESTIHVQDVVFGCGFSTNRNFKFS-----GIFGLGIGRSSLVSQLNSS------FSYCIG 212
++T + FGCG + S GI G G +S++SQL ++ F++C+
Sbjct: 199 QTTPSNVSITFGCGAQLGGDLGSSSQALDGILGFGQSDASMLSQLAAARKVRKIFAHCL- 257
Query: 213 SLHDPDYLHNKLILGDGAIIDEG--DATPLQFIDGHYYITLEAISVDGRMLDINPNIFKR 270
D + I G ++ TPL HY + L+ ISV G L + + F
Sbjct: 258 -----DTVRGGGIFAIGNVVQPPIVKTTPLVPNATHYNVNLQGISVGGATLQLPTSTFDS 312
Query: 271 DDSGGGVMIDSGTDVTWLVKEAYEALRDEVMIRLEGEQMRSYSWPDKLCYHGIMSSDLKG 330
DS G +IDSGT + +L +E Y L V + +R+Y D +C+ S D +
Sbjct: 313 GDS-KGTIIDSGTTLAYLPREVYRTLLTAVFDKHPDLAVRNYE--DFICFQFSGSLDEE- 368
Query: 331 FPTVRFHFRGGAKLALEKDSMFYQPRPDAFCMAVNPASINNR-YVNFSLIGMMAQQFYNV 389
FP + F F G L + +Q D +CM + + + L+G + V
Sbjct: 369 FPVITFSFEGDLTLNVYPHDYLFQNGNDLYCMGFLDGGVQTKDGKDMVLLGDLVLSNKLV 428
Query: 390 GYDIGRKQKTFQRMDCE 406
YD+ ++ + +C
Sbjct: 429 VYDLEKQVIGWTDYNCS 445
>gi|297848856|ref|XP_002892309.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
gi|297338151|gb|EFH68568.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
Length = 484
Score = 105 bits (263), Expect = 3e-20, Method: Compositional matrix adjust.
Identities = 97/372 (26%), Positives = 165/372 (44%), Gaps = 38/372 (10%)
Query: 60 LFYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQ--LG---TIFYPSRSSSYAYV 114
L+Y I IG P + +DTGS ++WV+C C+ C + LG T++ S S V
Sbjct: 79 LYYAKIGIGTPAKSYYVQVDTGSDIMWVNCIQCKQCPRRSTLGIELTLYNIDESDSGKLV 138
Query: 115 PCDSEHCRYF---PYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTD---ESTIH 168
CD + C P + C A C Y ++Y G T+ + + + + + ++
Sbjct: 139 SCDDDFCYQISGGPLSGCKA-NMSCPYLEIYGDGSSTAGYFVKDVVQYDSVAGDLKTQTA 197
Query: 169 VQDVVFGCG------FSTNRNFKFSGIFGLGIGRSSLVSQLNSS------FSYCIGSLHD 216
V+FGCG ++ GI G G SS++SQL SS F++C+
Sbjct: 198 NGSVIFGCGARQSGDLDSSNEEALDGILGFGKANSSMISQLASSGRVKKIFAHCL----- 252
Query: 217 PDYLHNKLILGDGAIID-EGDATPLQFIDGHYYITLEAISVDGRMLDINPNIFKRDDSGG 275
D + I G ++ + + TPL HY + + A+ V L+I ++F+ D G
Sbjct: 253 -DGRNGGGIFAIGRVVQPKVNMTPLVPNQPHYNVNMTAVQVGQEFLNIPADLFQPGDRKG 311
Query: 276 GVMIDSGTDVTWLVKEAYEALRDEVMIRLEGEQMRSYSWPDKLCYHGIMSSDLKGFPTVR 335
+ IDSGT + +L + YE L ++ + ++ D C+ D +GFP V
Sbjct: 312 AI-IDSGTTLAYLPEIIYEPLVKKITSQEPALKVHIVD-KDYKCFQYSGRVD-EGFPNVT 368
Query: 336 FHFRGGAKLAL-EKDSMFYQPRPDAFCMAVNPASINNR-YVNFSLIGMMAQQFYNVGYDI 393
FHF L + D +F P +C+ +++ +R N +L+G + V YD+
Sbjct: 369 FHFENSVFLRVYPHDYLF--PYEGMWCIGWQNSAMQSRDRRNMTLLGDLVLSNKLVLYDL 426
Query: 394 GRKQKTFQRMDC 405
+ + +C
Sbjct: 427 ENQLIGWTEYNC 438
>gi|242045564|ref|XP_002460653.1| hypothetical protein SORBIDRAFT_02g032590 [Sorghum bicolor]
gi|241924030|gb|EER97174.1| hypothetical protein SORBIDRAFT_02g032590 [Sorghum bicolor]
Length = 525
Score = 105 bits (263), Expect = 3e-20, Method: Compositional matrix adjust.
Identities = 108/388 (27%), Positives = 168/388 (43%), Gaps = 49/388 (12%)
Query: 56 ISNSLFYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLGTIFYPSRSSSYAYVP 115
+ + + +++ +G PP MDTGS L W+ C PC DC Q G +F P+ SSSY V
Sbjct: 146 VGSGEYLMDVYVGTPPRRFRMIMDTGSDLNWLQCAPCLDCFEQRGPVFDPAASSSYRNVT 205
Query: 116 CDSEHCRYFPYARCSAY----------KHRCIYTQLYLIGPETSVFVSTEQLTFKNTDE- 164
C C + + C Y Y T+ ++ E T T
Sbjct: 206 CGDHRCGHVAPPPEPEASSPRTCRRPGEDPCPYYYWYGDQSNTTGDLALESFTVNLTAPG 265
Query: 165 STIHVQDVVFGCGFSTNRNF-KFSGIFGLGIGRSSLVSQLNS----SFSYCIGSLHDPDY 219
++ V VVFGCG F +G+ GLG G S SQL + +FSYC+ H D
Sbjct: 266 ASRRVDGVVFGCGHRNRGLFHGAAGLLGLGRGPLSFASQLRAVYGHTFSYCLVD-HGSD- 323
Query: 220 LHNKLILGDGAIIDEGDATP-LQFI------------DGHYYITLEAISVDGRMLDINPN 266
+ +K++ G+ A P L++ D YY+ L+ + V G +L+I+ +
Sbjct: 324 VGSKVVFGEDDDALALAAHPQLKYTAFAPASSSSSPADTFYYVKLKGVLVGGELLNISSD 383
Query: 267 IFK-RDDSGGGVMIDSGTDVTWLVKEAYEALRDEVMIRLEGEQMRSYSWPDKL-----CY 320
+ D GG +IDSGT +++ V+ AY+ +R M R+ RSY + CY
Sbjct: 384 TWDVGKDGSGGTIIDSGTTLSYFVEPAYQVIRHAFMDRMS----RSYPLVPEFPVLSPCY 439
Query: 321 HGIMSSDLKGFPTVRFHFRGGAKLALEKDSMFYQPRPDA---FCMAVNPASINNRYVNFS 377
+ + + P + F GA ++ F + PD C+AV + S
Sbjct: 440 N-VSGVERPEVPELSLLFADGAVWDFPAENYFIRLDPDGGSIMCLAV----LGTPRTGMS 494
Query: 378 LIGMMAQQFYNVGYDIGRKQKTFQRMDC 405
+IG QQ ++V YD+ + F C
Sbjct: 495 IIGNFQQQNFHVVYDLQNNRLGFAPRRC 522
>gi|116310064|emb|CAH67085.1| H0818E04.2 [Oryza sativa Indica Group]
gi|116310187|emb|CAH67199.1| OSIGBa0152K17.11 [Oryza sativa Indica Group]
Length = 464
Score = 105 bits (263), Expect = 3e-20, Method: Compositional matrix adjust.
Identities = 110/429 (25%), Positives = 178/429 (41%), Gaps = 55/429 (12%)
Query: 22 VQRGINISIARLAYLQEKIRSHNTYQAQILPSNDI--SNSLFYVNISIGQPPVPQFTAMD 79
++R I S RLA + + + ++ I + + V + IG PP A+D
Sbjct: 48 LRRAIQRSRYRLAGIGMARGEAASARKAVVAETPIMPAGGEYLVKLGIGTPPYKFTAAID 107
Query: 80 TGSSLLWVHCYPCRDCSPQLGTIFYPSRSSSYAYVPCDSEHCRYFPYARCS-AYKHRCIY 138
T S L+W C PC C Q+ +F P SS+YA +PC S+ C RC C Y
Sbjct: 108 TASDLIWTQCQPCTGCYHQVDPMFNPRVSSTYAALPCSSDTCDELDVHRCGHDDDESCQY 167
Query: 139 TQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDVVFGCGFSTNRNF---KFSGIFGLGIG 195
T Y T ++ ++L + V FGC S+ + SG+ GLG G
Sbjct: 168 TYTYSGNATTEGTLAVDKLVIGED-----AFRGVAFGCSTSSTGGAPPPQASGVVGLGRG 222
Query: 196 RSSLVSQLN-SSFSYCIGSLHDPDYLHNKLILGDGAIIDEGD----ATPLQ---FIDGHY 247
SLVSQL+ F+YC+ + KL+LG A A P++ +Y
Sbjct: 223 PLSLVSQLSVRRFAYCLPP--PASRIPGKLVLGADADAARNATNRIAVPMRRDPRYPSYY 280
Query: 248 YITLEAISVDGRMLDI---------------------NPN---IFKRDDSGGGVMIDSGT 283
Y+ L+ + + R + + +PN + D + G++ID +
Sbjct: 281 YLNLDGLLIGDRTMSLPPTTTTTATATATAPAPAPTPSPNATAVAVGDANRYGMIIDIAS 340
Query: 284 DVTWLVKEAYEALRDEVMIRLEGEQMRSYSWPDKLCY---HGIMSSDLKGFPTVRFHFRG 340
+T+L Y+ L +++ + + + S LC+ G+ + D P V F
Sbjct: 341 TITFLEASLYDELVNDLEVEIRLPRGTGSSLGLDLCFILPDGV-AFDRVYVPAVALAF-D 398
Query: 341 GAKLALEKDSMFYQPRPDA-FCMAVNPASINNRYVNFSLIGMMAQQFYNVGYDIGRKQKT 399
G L L+K +F + R C+ V A + S++G QQ V Y++ R + T
Sbjct: 399 GRWLRLDKARLFAEDRESGMMCLMVGRAEAG----SVSILGNFQQQNMQVLYNLRRGRVT 454
Query: 400 FQRMDCEVL 408
F + C L
Sbjct: 455 FVQSPCGAL 463
>gi|356523171|ref|XP_003530215.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
Length = 442
Score = 105 bits (263), Expect = 3e-20, Method: Compositional matrix adjust.
Identities = 95/321 (29%), Positives = 146/321 (45%), Gaps = 41/321 (12%)
Query: 63 VNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLGTIFYPSRSSSYAYVPCDSEHC- 121
++I++G PP +DTGS L W+HC + F P+ SSSY + C S C
Sbjct: 68 ISITVGTPPQNMSMVIDTGSELSWLHCNTNTTATIPY-PFFNPNISSSYTPISCSSPTCT 126
Query: 122 ---RYFPYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDVVFGC-- 176
R FP + C T Y + ++++ F ++ I VFGC
Sbjct: 127 TRTRDFPIPASCDSNNLCHATLSYADASSSEGNLASDTFGFGSSFNPGI-----VFGCMN 181
Query: 177 -GFSTN--RNFKFSGIFGLGIGRSSLVSQLN-SSFSYCIGSLHDPDYLHNKLILGDGAII 232
+STN + +G+ G+ +G SLVSQL FSYCI L+LG+
Sbjct: 182 SSYSTNSESDSNTTGLMGMNLGSLSLVSQLKIPKFSYCISG----SDFSGILLLGESNFS 237
Query: 233 DEGD---------ATPLQFID-GHYYITLEAISVDGRMLDINPNIFKRDDSGGG-VMIDS 281
G +TPL + D Y + LE I + ++L+I+ N+F D +G G M D
Sbjct: 238 WGGSLNYTPLVQISTPLPYFDRSAYTVRLEGIKISDKLLNISGNLFVPDHTGAGQTMFDL 297
Query: 282 GTDVTWLVKEAYEALRDEVMIRLEGEQMRSYSWPD-------KLCYH-GIMSSDLKGFPT 333
GT ++L+ Y ALRDE + + G +R+ P+ LCY + S+L P+
Sbjct: 298 GTQFSYLLGPVYNALRDEFLNQTNG-TLRALDDPNFVFQIAMDLCYRVPVNQSELPELPS 356
Query: 334 VRFHFRGGAKLALEKDSMFYQ 354
V F GA++ + D + Y+
Sbjct: 357 VSLVFE-GAEMRVFGDQLLYR 376
>gi|115458646|ref|NP_001052923.1| Os04g0448500 [Oryza sativa Japonica Group]
gi|38344830|emb|CAD40872.2| OSJNBa0064H22.11 [Oryza sativa Japonica Group]
gi|113564494|dbj|BAF14837.1| Os04g0448500 [Oryza sativa Japonica Group]
Length = 464
Score = 105 bits (263), Expect = 4e-20, Method: Compositional matrix adjust.
Identities = 110/429 (25%), Positives = 178/429 (41%), Gaps = 55/429 (12%)
Query: 22 VQRGINISIARLAYLQEKIRSHNTYQAQILPSNDI--SNSLFYVNISIGQPPVPQFTAMD 79
++R I S RLA + + + ++ I + + V + IG PP A+D
Sbjct: 48 LRRAIQRSRYRLAGIGMARGEAASARKAVVAETPIMPAGGEYLVKLGIGTPPYKFTAAID 107
Query: 80 TGSSLLWVHCYPCRDCSPQLGTIFYPSRSSSYAYVPCDSEHCRYFPYARCS-AYKHRCIY 138
T S L+W C PC C Q+ +F P SS+YA +PC S+ C RC C Y
Sbjct: 108 TASDLIWTQCQPCTGCYHQVDPMFNPRVSSTYAALPCSSDTCDELDVHRCGHDDDESCQY 167
Query: 139 TQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDVVFGCGFSTNRNF---KFSGIFGLGIG 195
T Y T ++ ++L + V FGC S+ + SG+ GLG G
Sbjct: 168 TYTYSGNATTEGTLAVDKLVIGED-----AFRGVAFGCSTSSTGGAPPPQASGVVGLGRG 222
Query: 196 RSSLVSQLN-SSFSYCIGSLHDPDYLHNKLILGDGAIIDEGD----ATPLQ---FIDGHY 247
SLVSQL+ F+YC+ + KL+LG A A P++ +Y
Sbjct: 223 PLSLVSQLSVRRFAYCLPP--PASRIPGKLVLGADADAARNATNRIAVPMRRDPRYPSYY 280
Query: 248 YITLEAISVDGRMLDI---------------------NPN---IFKRDDSGGGVMIDSGT 283
Y+ L+ + + R + + +PN + D + G++ID +
Sbjct: 281 YLNLDGLLIGDRAMSLPPTTTTTATATATAPAPAPTPSPNATAVAVGDANRYGMIIDIAS 340
Query: 284 DVTWLVKEAYEALRDEVMIRLEGEQMRSYSWPDKLCY---HGIMSSDLKGFPTVRFHFRG 340
+T+L Y+ L +++ + + + S LC+ G+ + D P V F
Sbjct: 341 TITFLEASLYDELVNDLEVEIRLPRGTGSSLGLDLCFILPDGV-AFDRVYVPAVALAF-D 398
Query: 341 GAKLALEKDSMFYQPRPDA-FCMAVNPASINNRYVNFSLIGMMAQQFYNVGYDIGRKQKT 399
G L L+K +F + R C+ V A + S++G QQ V Y++ R + T
Sbjct: 399 GRWLRLDKARLFAEDRESGMMCLMVGRAEAG----SVSILGNFQQQNMQVLYNLRRGRVT 454
Query: 400 FQRMDCEVL 408
F + C L
Sbjct: 455 FVQSPCGAL 463
>gi|359476756|ref|XP_002277082.2| PREDICTED: aspartic proteinase-like protein 2-like isoform 2 [Vitis
vinifera]
Length = 560
Score = 105 bits (262), Expect = 4e-20, Method: Compositional matrix adjust.
Identities = 100/400 (25%), Positives = 173/400 (43%), Gaps = 42/400 (10%)
Query: 38 EKIRSHNTYQ-AQILPSNDI---------SNSLFYVNISIGQPPVPQFTAMDTGSSLLWV 87
+ +R+H+T + +IL + D+ L++ I IG P + +DTGS +LWV
Sbjct: 122 DALRAHDTRRHGRILSAVDLPLGGNGHPSEAGLYFAKIGIGTPSKDYYVQVDTGSDILWV 181
Query: 88 HCYPCRDCSPQ--LG---TIFYPSRSSSYAYVPCDSEHCRYF--PYARCSAYKHRCIYTQ 140
+C C C + LG T++ S++ V CD C + P C +C+Y+
Sbjct: 182 NCAGCDRCPTKSDLGVDLTLYDMKASTTSDAVGCDDNFCSLYDGPLPGCKP-GLQCLYSV 240
Query: 141 LYLIGPETSVFVSTEQLTFKNTD---ESTIHVQDVVFGCGFSTNRNFKFS-----GIFGL 192
LY G T+ + + + + ++T VVFGCG + S GI G
Sbjct: 241 LYGDGSSTTGYFVQDFVQYNRISGNFQTTPTNGTVVFGCGNKQSGELGSSSEALDGILGF 300
Query: 193 GIGRSSLVSQLNSS------FSYCIGSLHDPDYLHNKLILGDGAIID-EGDATPLQFIDG 245
G SS++SQL SS FS+C+ D + I G +++ + + TPL
Sbjct: 301 GQANSSMLSQLASSGKVKKVFSHCL------DNVDGGGIFAIGEVVEPKVNITPLVQNQA 354
Query: 246 HYYITLEAISVDGRMLDINPNIFKRDDSGGGVMIDSGTDVTWLVKEAYEALRDEVMIRLE 305
HY + ++ I V G LD+ + F+ D G +IDSGT + + +E Y L ++++ +
Sbjct: 355 HYNVVMKEIEVGGDPLDVPSDAFESGDR-KGTIIDSGTTLAYFPQEVYVPLIEKILS--Q 411
Query: 306 GEQMRSYSWPDKLCYHGIMSSDLKGFPTVRFHFRGGAKLALEKDSMFYQPRPDAFCMAVN 365
+R ++ + GFPTV HF L + +Q + N
Sbjct: 412 QPDLRLHTVEQAFTCFDYTGNVDDGFPTVTLHFDKSISLTVYPHEYLFQHEFEWCIGWQN 471
Query: 366 PASINNRYVNFSLIGMMAQQFYNVGYDIGRKQKTFQRMDC 405
+ + +L+G + V YD+ ++ + +C
Sbjct: 472 SGAQTKDGKDLTLLGDLVLSNKLVVYDLEKQGIGWVEYNC 511
>gi|297806153|ref|XP_002870960.1| hypothetical protein ARALYDRAFT_908082 [Arabidopsis lyrata subsp.
lyrata]
gi|297316797|gb|EFH47219.1| hypothetical protein ARALYDRAFT_908082 [Arabidopsis lyrata subsp.
lyrata]
Length = 453
Score = 105 bits (262), Expect = 4e-20, Method: Compositional matrix adjust.
Identities = 108/379 (28%), Positives = 164/379 (43%), Gaps = 57/379 (15%)
Query: 70 PPVPQFTAMDTGSSLLWVHCYPCRDCSPQLGTIFYPSRSSSYAYVPCDSEHCR-----YF 124
PP +DTGS L W+ C R +P F P+RSSSY+ +PC S CR +
Sbjct: 82 PPQNISMVIDTGSELSWLRCN--RSSNPNPVNNFDPTRSSSYSPIPCSSPTCRTRTRDFL 139
Query: 125 PYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDVVFGC-----GFS 179
A C + K C T Y + ++ E F N+ + +++FGC G
Sbjct: 140 IPASCDSDK-LCHATLSYADASSSEGNLAAEIFHFGNSTNDS----NLIFGCMGSVSGSD 194
Query: 180 TNRNFKFSGIFGLGIGRSSLVSQLN-SSFSYCIGSLHD-PDYLHNKLILGDGAIIDEGD- 236
+ K +G+ G+ G S +SQ+ FSYCI D P + L+LGD
Sbjct: 195 PEEDTKTTGLLGMNRGSLSFISQMGFPKFSYCISGTDDFPGF----LLLGDSNFTWLTPL 250
Query: 237 --------ATPLQFIDG-HYYITLEAISVDGRMLDINPNIFKRDDSGGG-VMIDSGTDVT 286
+TPL + D Y + L I V+G++L I ++ D +G G M+DSGT T
Sbjct: 251 NYTPLIRISTPLPYFDRVAYTVQLTGIKVNGKLLPIPKSVLLPDHTGAGQTMVDSGTQFT 310
Query: 287 WLVKEAYEALRDEVMIRLEGEQMRSYSWPD-------KLCYH----GIMSSDLKGFPTVR 335
+L+ Y ALR + + + G + Y P+ LCY I + L PTV
Sbjct: 311 FLLGPVYTALRSDFLNQTNG-ILTVYEDPEFVFQGTMDLCYRISPFRIRTGILHRLPTVS 369
Query: 336 FHFRGGAKLALEKDSMFYQPRPDAFCMAVNPASINNRYVNFSLIGMMA-------QQFYN 388
F GA++A+ + Y+ A N + + N L+GM A QQ
Sbjct: 370 LVFE-GAEIAVSGQPLLYR---VPHLTAGNDSVYCFTFGNSDLMGMEAYVIGHHHQQNMW 425
Query: 389 VGYDIGRKQKTFQRMDCEV 407
+ +D+ R + + C+V
Sbjct: 426 IEFDLQRSRIGLAPVQCDV 444
>gi|242066140|ref|XP_002454359.1| hypothetical protein SORBIDRAFT_04g029390 [Sorghum bicolor]
gi|241934190|gb|EES07335.1| hypothetical protein SORBIDRAFT_04g029390 [Sorghum bicolor]
Length = 460
Score = 105 bits (262), Expect = 4e-20, Method: Compositional matrix adjust.
Identities = 111/432 (25%), Positives = 175/432 (40%), Gaps = 59/432 (13%)
Query: 1 LIHRDSILSPYNNPN----ENPAHRVQRGINISIARLAYLQEKIR---------SHNTYQ 47
L HR SP E+ HR Q R AY++ K + Q
Sbjct: 61 LHHRHGPCSPLPTKKMPSLEDRLHRDQL-------RAAYIKRKFSGDVKKDGQGAGGVEQ 113
Query: 48 AQILPSNDISNSL----FYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLGTIF 103
+ + + SL + + + +G P Q +D+GS + WV C PC C Q+ +F
Sbjct: 114 SHVTVPTTLGTSLNTLEYLITVRLGSPAKTQTVLIDSGSDVSWVQCKPCLQCHSQVDPLF 173
Query: 104 YPSRSSSYAYVPCDSEHCRYFPY--ARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKN 161
PS SS+Y+ C S C CS+ +C Y Y G T+ S++ L +
Sbjct: 174 DPSLSSTYSPFSCSSAACAQLGQDGNGCSS-SSQCQYIVRYADGSSTTGTYSSDTLALGS 232
Query: 162 TDESTIHVQDVVFGCG-FSTNRNFKFSGIFGLGIGRSSLVSQ----LNSSFSYCIGSLHD 216
+ + FGC + N G+ GLG G SL SQ ++FSYC+
Sbjct: 233 NT-----ISNFQFGCSHVESGFNDLTDGLMGLGGGAPSLASQTAGTFGTAFSYCL----- 282
Query: 217 PDYLHNKLILGDGAIIDEGDATPL---QFIDGHYYITLEAISVDGRMLDINPNIFKRDDS 273
P + L GA TP+ + Y + LEAI V G L I ++F
Sbjct: 283 PPTPSSSGFLTLGAGTSGFVKTPMLRSSPVPTFYGVRLEAIRVGGTQLSIPTSVFS---- 338
Query: 274 GGGVMIDSGTDVTWLVKEAYEALRDEVMIRLEGEQMRSYSWPDKLCYHGIMSSDLKGFPT 333
G+++DSGT +T L + AY AL ++ + C+ S ++ P+
Sbjct: 339 -AGMVMDSGTIITRLPRTAYSALSSAFKAGMKQYRPAPPRSIMDTCFDFSGQSSVR-LPS 396
Query: 334 VRFHFRGGAKLALEKDSMFYQPRPDAFCMAVNPASINNRYVNFSLIGMMAQQFYNVGYDI 393
V F GGA + L+ + + C+A + N+ + ++G + Q+ + V YD+
Sbjct: 397 VALVFSGGAVVNLDANGIIL-----GNCLAF---AANSDDSSPGIVGNVQQRTFEVLYDV 448
Query: 394 GRKQKTFQRMDC 405
G F+ C
Sbjct: 449 GGGAVGFKAGAC 460
>gi|30682289|ref|NP_187876.2| aspartyl protease family protein [Arabidopsis thaliana]
gi|11994412|dbj|BAB02414.1| chloroplast nucleoid DNA binding protein-like [Arabidopsis
thaliana]
gi|332641715|gb|AEE75236.1| aspartyl protease family protein [Arabidopsis thaliana]
Length = 461
Score = 105 bits (262), Expect = 4e-20, Method: Compositional matrix adjust.
Identities = 107/426 (25%), Positives = 169/426 (39%), Gaps = 39/426 (9%)
Query: 1 LIHRDSILSPYNNPNENPAHRVQRGINISIARLAYLQEKIRSHNTYQAQILPSNDISNSL 60
L HRD++L P R++ I R + + K S + + D +
Sbjct: 53 LAHRDTLL-------PKPLSRIEDVIGADQKRHSLISRKRNSTVGVKMDLGSGIDYGTAQ 105
Query: 61 FYVNISIGQPPVPQFTAMDTGSSLLWVHC-YPCRDCSPQLGTIFYPSRSSSYAYVPCDSE 119
++ I +G P +DTGS L WV+C Y R + +F S S+ V C ++
Sbjct: 106 YFTEIRVGTPAKKFRVVVDTGSELTWVNCRYRARGKDNR--RVFRADESKSFKTVGCLTQ 163
Query: 120 HCR-----YFPYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDVVF 174
C+ F C C Y Y G + E +T T+ + +
Sbjct: 164 TCKVDLMNLFSLTTCPTPSTPCSYDYRYADGSAAQGVFAKETITVGLTNGRMARLPGHLI 223
Query: 175 GCGFS-TNRNFKFS-GIFGLGIGRSSLVSQLNS----SFSYCIGSLHDPDYLHNKLILGD 228
GC S T ++F+ + G+ GL S S S FSYC+ + N LI G
Sbjct: 224 GCSSSFTGQSFQGADGVLGLAFSDFSFTSTATSLYGAKFSYCLVDHLSNKNVSNYLIFGS 283
Query: 229 GAIIDEG--DATPLQF--IDGHYYITLEAISVDGRMLDINPNIFKRDDSGGGVMIDSGTD 284
TPL I Y I + IS+ MLDI ++ SGGG ++DSGT
Sbjct: 284 SRSTKTAFRRTTPLDLTRIPPFYAINVIGISLGYDMLDIPSQVWDA-TSGGGTILDSGTS 342
Query: 285 VTWLVKEAYEALRDEVMIRL-EGEQMRSYSWPDKLCYHGIMSSDLKGFPTVRFHFRGGAK 343
+T L AY+ + + L E ++++ P + C+ ++ P + FH +GGA+
Sbjct: 343 LTLLADAAYKQVVTGLARYLVELKRVKPEGVPIEYCFSFTSGFNVSKLPQLTFHLKGGAR 402
Query: 344 LALEKDSMFYQPRPDAFCM----AVNPASINNRYVNFSLIGMMAQQFYNVGYDIGRKQKT 399
+ S P C+ A PA+ ++IG + QQ Y +D+ +
Sbjct: 403 FEPHRKSYLVDAAPGVKCLGFVSAGTPAT--------NVIGNIMQQNYLWEFDLMASTLS 454
Query: 400 FQRMDC 405
F C
Sbjct: 455 FAPSAC 460
>gi|297605079|ref|NP_001056639.2| Os06g0121800 [Oryza sativa Japonica Group]
gi|255676668|dbj|BAF18553.2| Os06g0121800 [Oryza sativa Japonica Group]
Length = 487
Score = 105 bits (262), Expect = 4e-20, Method: Compositional matrix adjust.
Identities = 103/353 (29%), Positives = 147/353 (41%), Gaps = 32/353 (9%)
Query: 66 SIGQPPVPQFTAMDTGSSLLWVHCYPCR--DCSPQLGTIFYPSRSSSYAYVPCDSEHCRY 123
+I P + Q ++DT L W+ C PC +C PQ +F P RS + A VPC S C
Sbjct: 154 AIDDPILAQPMSIDTSIDLPWIQCAPCPMPECYPQQNALFDPRRSRTSAAVPCGSAACGE 213
Query: 124 FPYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDVVFGCGFSTNRN 183
++C Y Y G TS + LT + ST+ V + FGC + N
Sbjct: 214 LGRYGAGCSNNQCQYFVDYGDGRATSGTYMVDALTL---NPSTV-VMNFRFGCSHAVRGN 269
Query: 184 FKF--SGIFGLGIGRSSLVSQ----LNSSFSYCIGSLHDPDYLHNKLILGDGAIIDEGDA 237
F SG LG GR SL+SQ ++FSYC+ +L + DG
Sbjct: 270 FSASTSGTMSLGGGRQSLLSQTAATFGNAFSYCVPDPSSSGFL-SLGGPADGGGAGRFAR 328
Query: 238 TPL----QFIDGHYYITLEAISVDGRMLDINPNIFKRDDSGGGVMIDSGTDVTWLVKEAY 293
TPL I Y + L I V GR L++ P +F GG ++DS +T L AY
Sbjct: 329 TPLVRNPSIIPTLYLVRLRGIEVGGRRLNVPPVVFA-----GGAVMDSSVIITQLPPTAY 383
Query: 294 EALRDEVMIRLEG-EQMRSYSWPDKLCYHGIMSSDLKGFPTVRFHFRGGAKLALEKDSMF 352
ALR + ++ CY + + + P V F GGA + L+ +
Sbjct: 384 RALRLAFRSAMAAYPRVAGGRAGLDTCYDFVRFTSVT-VPAVSLVFDGGAVVRLDAMGVM 442
Query: 353 YQPRPDAFCMAVNPASINNRYVNFSLIGMMAQQFYNVGYDIGRKQKTFQRMDC 405
+ C+A P + IG + QQ + V YD+G F+R C
Sbjct: 443 VE-----GCLAFVPTPGD---FALGFIGNVQQQTHEVLYDVGGGSVGFRRGAC 487
>gi|449451076|ref|XP_004143288.1| PREDICTED: aspartic proteinase-like protein 2-like [Cucumis
sativus]
Length = 488
Score = 105 bits (262), Expect = 4e-20, Method: Compositional matrix adjust.
Identities = 105/375 (28%), Positives = 173/375 (46%), Gaps = 46/375 (12%)
Query: 60 LFYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDC--SPQLG---TIFYPSRSSSYAYV 114
L++ + +G P +DTGS +LWV C PC C S LG +F ++SSS +
Sbjct: 83 LYFTKVKLGNPAREFNVQIDTGSDILWVTCSPCDGCPDSSGLGIELNLFDTTKSSSARVL 142
Query: 115 PCDSEHCRYFPYA--RCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKN-TDESTI--HV 169
PC C +C C Y+ Y TS F T+ + F ESTI
Sbjct: 143 PCTDPICAAVSTTTDQCLTQTDHCSYSFHYRDRSGTSGFYVTDSMHFDILLGESTIANSS 202
Query: 170 QDVVFGC-----GFSTNRNFKFSGIFGLGIGRSSLVSQLNSS------FSYCIGSLHDPD 218
+VFGC G T GIFG G G S++SQL+S FS+C+ +
Sbjct: 203 ATIVFGCSIYQYGDLTRATKALDGIFGFGQGEFSVISQLSSRGITPKVFSHCLKGGENGG 262
Query: 219 YLHNKLILGDGAIIDEGDA-TPLQFIDGHYYITLEAISVDGRMLDINPNIFKRDDSGGGV 277
+ L+LG+ I++ +PL HY + L++I++ G++ NP +F ++ G
Sbjct: 263 GI---LVLGE--ILEPSIVYSPLIPSQPHYTLKLQSIALSGQLFP-NPTMFPISNA-GET 315
Query: 278 MIDSGTDVTWLVKEAYEALRDEVMIRLEGEQMRSYSWPDKLCYHGIMS-SDLKGFPTVRF 336
+IDSGT + +LV+E Y+ + + + + S + C+ MS +D+ FP +RF
Sbjct: 316 IIDSGTTLAYLVEEVYDWIVSVITSAVSQSATPTISRGSQ-CFRVSMSVADI--FPVLRF 372
Query: 337 HFRGGAKLA------LEKDSMFYQPRPDAFCMAVNPASINNRYVNFSLIGMMAQQFYNVG 390
+F G A + L+ DS+ +P +C+ A +++G + + +
Sbjct: 373 NFEGIASMVVTPEEYLQFDSIVREPA--LWCIGFQKAE-----DGLNILGDLVLKDKIIV 425
Query: 391 YDIGRKQKTFQRMDC 405
YD+ R++ + DC
Sbjct: 426 YDLARQRIGWANYDC 440
>gi|357155293|ref|XP_003577072.1| PREDICTED: LOW QUALITY PROTEIN: probable aspartic protease
At2g35615-like [Brachypodium distachyon]
Length = 429
Score = 105 bits (262), Expect = 4e-20, Method: Compositional matrix adjust.
Identities = 103/394 (26%), Positives = 175/394 (44%), Gaps = 66/394 (16%)
Query: 50 ILPSNDISNSLFYVNISIGQPPVPQFTAMDTGSSLLWVHC----YPCRDCSPQLGTIFYP 105
++ +++I F+++IS+G PPV +DTGS+L WV C C +P+ G++F P
Sbjct: 64 VVGNHEIHEGKFFMDISLGTPPVANLVTVDTGSTLSWVVCQRCQISCHTTAPEAGSVFDP 123
Query: 106 SRSSSYAYVPCDSEHCR------YFPYARCSAYKHRCIYTQLYLIGPE---TSVFVSTEQ 156
+S++Y V C S C P+ C C+Y+ Y GP ++ + T++
Sbjct: 124 DKSTTYELVGCSSRDCADVQRSLVAPFG-CIEETDTCLYSLRYGSGPSGQYSAGRLGTDK 182
Query: 157 LTFKNTDESTIHVQDVVFGCGFSTNRNFKF--SGIFGLGIGRSSLVSQLN-----SSFSY 209
LT + S+ + +FGC S + +FK SG+ G G S +Q+ +FSY
Sbjct: 183 LTLAS---SSSIIDGFIFGC--SGDDSFKGYESGVIGFGGANFSFFNQVARQTNYRAFSY 237
Query: 210 C------------IGSLHDPDYLHNKLI--LGDGAIIDEGDATPLQFIDGHYYITLEAIS 255
C IG+ + ++ LI GD ++ LQ ID +
Sbjct: 238 CFPGDHTAEGFLSIGAYPKDELVYTNLIPHFGDRSVYS------LQQID---------MM 282
Query: 256 VDGRMLDINPNIFKRDDSGGGVMIDSGTDVTWLVKEAYEALRDEVMIRLEGEQMRSYSWP 315
VDG L ++ + + + +++DSGT T+L+ ++A + ++ + S +
Sbjct: 283 VDGNRLQVDQSEYTKR----MMVVDSGTVDTFLLGPVFDAFSKAMASAMQAKGFLSDTVG 338
Query: 316 DKLCY--HGIMSSDLKGFPTVRFHFRGGAKLALEKDSMFYQ--PRPDAFCMAVNPASINN 371
+ C+ +G S D PTV F G L L +++F+ P D C+A P
Sbjct: 339 TETCFRPNGGDSVDSGDLPTVEMRFI-GTTLKLPPENVFHDLLPSHDKICLAFKPDVAGV 397
Query: 372 RYVNFSLIGMMAQQFYNVGYDIGRKQKTFQRMDC 405
R N ++G A + V YD+ FQ C
Sbjct: 398 R--NVQILGNKATXSFRVVYDLQAMYFGFQAGAC 429
>gi|302817726|ref|XP_002990538.1| hypothetical protein SELMODRAFT_131679 [Selaginella moellendorffii]
gi|300141706|gb|EFJ08415.1| hypothetical protein SELMODRAFT_131679 [Selaginella moellendorffii]
Length = 434
Score = 105 bits (262), Expect = 5e-20, Method: Compositional matrix adjust.
Identities = 98/369 (26%), Positives = 156/369 (42%), Gaps = 42/369 (11%)
Query: 60 LFYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLGTIFYP------SRSSSYAY 113
L++ + +G PP +DTGS LLWV+C+PC C P + P S+S +
Sbjct: 35 LYFTQVQLGTPPRTYNLQVDTGSDLLWVNCHPCIGC-PAFSDLKIPIVPYDVKASASSSK 93
Query: 114 VPCDSEHCRYFPYARCSAY--KHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQD 171
VPC C S +++C Y+ Y G T ++ + L + +T
Sbjct: 94 VPCSDPSCTLITQISESGCNDQNQCGYSFQYGDGSGTLGYLVEDVLHYMVNATAT----- 148
Query: 172 VVFGCGFSTNRNFKFS-----GIFGLGIGRSSLVSQLNSS------FSYCIGSLHDPDYL 220
V+FGCGF + + S GI G G S SQL F++C L +
Sbjct: 149 VIFGCGFKQSGDLSTSERALDGIIGFGASDLSFNSQLAKQGKTPNVFAHC---LDGGERG 205
Query: 221 HNKLILGDGAIIDEGDATPLQFIDGHYYITLEAISVDGRMLDINPNIFKRDDSGGGVMID 280
L+LG+ I + TPL HY + L++ISV+ L I+P +F +D G + D
Sbjct: 206 GGILVLGN-VIEPDIQYTPLVPYMSHYNVVLQSISVNNANLTIDPKLFS-NDVMQGTIFD 263
Query: 281 SGTDVTWLVKEAYEALRDEVMIRLEGEQMRSYSWPDKLCYHGIMSSDLKGFPTVRFHFRG 340
SGT + +L EAY+A V + + P LC + K FP V +F G
Sbjct: 264 SGTTLAYLPDEAYQAFTQAVSLVVA---------PFLLCDTRLSRFIYKLFPNVVLYFEG 314
Query: 341 GAKLALEKDSMFYQ---PRPDAFCMAVNPASINNRYVNFSLIGMMAQQFYNVGYDIGRKQ 397
+ + + Q +CM + +++ G + + V YD+ R +
Sbjct: 315 ASMTLTPAEYLIRQASAANAPIWCMGWQSMGSAESELQYTIFGDLVLKNKLVVYDLERGR 374
Query: 398 KTFQRMDCE 406
++ DC+
Sbjct: 375 IGWRPFDCK 383
>gi|302803839|ref|XP_002983672.1| hypothetical protein SELMODRAFT_118648 [Selaginella moellendorffii]
gi|300148509|gb|EFJ15168.1| hypothetical protein SELMODRAFT_118648 [Selaginella moellendorffii]
Length = 388
Score = 105 bits (261), Expect = 5e-20, Method: Compositional matrix adjust.
Identities = 99/371 (26%), Positives = 157/371 (42%), Gaps = 42/371 (11%)
Query: 60 LFYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLGTIFYP------SRSSSYAY 113
L++ + +G PP +DTGS LLWV+C+PC C P + P S+S +
Sbjct: 35 LYFTQVQLGTPPRTYNLQVDTGSDLLWVNCHPCIGC-PAFSDLKIPIVPYDVKASASSSK 93
Query: 114 VPCDSEHCRYFPYARCSAY--KHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQD 171
VPC C S +++C Y+ Y G T ++ + L + +T
Sbjct: 94 VPCSDPSCTLITQISESGCNDQNQCGYSFQYGDGSGTLGYLVEDVLHYMVNATAT----- 148
Query: 172 VVFGCGFSTNRNFKFS-----GIFGLGIGRSSLVSQLNSS------FSYCIGSLHDPDYL 220
V+FGCGF + + S GI G G S SQL F++C L +
Sbjct: 149 VIFGCGFKQSGDLSTSERALDGIIGFGASDLSFNSQLAKQGKTPNVFAHC---LDGGERG 205
Query: 221 HNKLILGDGAIIDEGDATPLQFIDGHYYITLEAISVDGRMLDINPNIFKRDDSGGGVMID 280
L+LG+ I + TPL HY + L++ISV+ L I+P +F +D G + D
Sbjct: 206 GGILVLGN-VIEPDIQYTPLVPYMYHYNVVLQSISVNNANLTIDPKLFS-NDVMQGTIFD 263
Query: 281 SGTDVTWLVKEAYEALRDEVMIRLEGEQMRSYSWPDKLCYHGIMSSDLKGFPTVRFHFRG 340
SGT + +L EAY+A V + + P LC + K FP V +F G
Sbjct: 264 SGTTLAYLPDEAYQAFTQAVSLVVA---------PFLLCDTRLSRFIYKLFPNVVLYFEG 314
Query: 341 GAKLALEKDSMFYQ---PRPDAFCMAVNPASINNRYVNFSLIGMMAQQFYNVGYDIGRKQ 397
+ + + Q +CM + +++ G + + V YD+ R +
Sbjct: 315 ASMTLTPAEYLIRQASAANAPIWCMGWQSMGSAESELQYTIFGDLVLKNKLVVYDLERGR 374
Query: 398 KTFQRMDCEVL 408
++ DC+ L
Sbjct: 375 IGWRPFDCKFL 385
>gi|356498711|ref|XP_003518193.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Glycine max]
Length = 466
Score = 105 bits (261), Expect = 5e-20, Method: Compositional matrix adjust.
Identities = 108/450 (24%), Positives = 195/450 (43%), Gaps = 60/450 (13%)
Query: 2 IHRDSILSPYNNPNENPAHRVQRGINISIARLAYLQEKIRSHNTYQAQILPSNDISNSLF 61
+H + + + + + +P H ++ ++ SI R +L ++H ++ P + + +
Sbjct: 31 LHLSPLFTNHPSSSSHPFHTLKLAVSTSITRAHHL----KNHKPNKSLETPVHPKTYGGY 86
Query: 62 YVNISIGQPPVPQFTAMDTGSSLLWVHC---YPCRDCSPQLGT-IFYPSRSSSYAYVPCD 117
+++ G P +DTGS+L+W+ C Y C C+ T F P SSS +V C
Sbjct: 87 SIDLEFGTPSQTFPFVLDTGSTLVWLPCSSHYLCSKCNSFSNTPKFIPKNSSSSKFVGCT 146
Query: 118 SEHCRYF---------------PYARCSAYKHRC-IYTQLYLIGPETSVFVSTEQLTFKN 161
+ C + + CS C YT Y +G T+ F+ +E L F
Sbjct: 147 NPKCAWVFGPDVKSHCCRQDKAAFNNCS---QTCPAYTVQYGLG-STAGFLLSENLNFP- 201
Query: 162 TDESTIHVQDVVFGCGFSTNRNFKFSGIFGLGIGRSSLVSQLN-SSFSYCIGS--LHDPD 218
T D + GC S ++ +GI G G G SL SQ+N + FSYC+ S D
Sbjct: 202 ----TKKYSDFLLGC--SVVSVYQPAGIAGFGRGEESLPSQMNLTRFSYCLLSHQFDDSA 255
Query: 219 YLHNKLILGDGAIIDEGDATPLQF--------------IDGHYYITLEAISVDGRMLDIN 264
+ + L+L + A +G + + +YYITL+ I V + + +
Sbjct: 256 TITSNLVL-ETASSRDGKTNGVSYTPFLKNPTTKKNPAFGAYYYITLKRIVVGEKRVRVP 314
Query: 265 PNIFKRD-DSGGGVMIDSGTDVTWLVKEAYEALRDEVMIRLEGEQMRSYSWPDKL--CYH 321
+ + + D GG ++DSG+ T++ + ++ + E ++ + R L C+
Sbjct: 315 RRLLEPNVDGDGGFIVDSGSTFTFMERPIFDLVAQEFAKQVSYTRAREAEKQFGLSPCFV 374
Query: 322 GIMSSDLKGFPTVRFHFRGGAKLALEKDSMF-YQPRPDAFCMAV---NPASINNRYVNFS 377
++ FP +RF FRGGAK+ L + F + D C+ + + A
Sbjct: 375 LAGGAETASFPELRFEFRGGAKMRLPVANYFSLVGKGDVACLTIVSDDVAGSGGTVGPAV 434
Query: 378 LIGMMAQQFYNVGYDIGRKQKTFQRMDCEV 407
++G QQ + V YD+ ++ F+ C+
Sbjct: 435 ILGNYQQQNFYVEYDLENERFGFRSQSCQT 464
>gi|18409620|ref|NP_566966.1| aspartyl protease family protein [Arabidopsis thaliana]
gi|13430562|gb|AAK25903.1|AF360193_1 unknown protein [Arabidopsis thaliana]
gi|4886277|emb|CAB43423.1| putative protein [Arabidopsis thaliana]
gi|14532764|gb|AAK64083.1| unknown protein [Arabidopsis thaliana]
gi|15450892|gb|AAK96717.1| Unknown protein [Arabidopsis thaliana]
gi|30387567|gb|AAP31949.1| At3g52500 [Arabidopsis thaliana]
gi|332645431|gb|AEE78952.1| aspartyl protease family protein [Arabidopsis thaliana]
Length = 469
Score = 105 bits (261), Expect = 5e-20, Method: Compositional matrix adjust.
Identities = 125/464 (26%), Positives = 197/464 (42%), Gaps = 85/464 (18%)
Query: 8 LSPYNNPNENPAH---RVQRGINISIARLAYL---------QEKIRSHNTYQAQIL--PS 53
LSP+++ +++P ++R SIAR L ++ + S T A ++ P
Sbjct: 23 LSPFSHSDQSPKDPYLSLRRLAESSIARAHKLKHGTSIKPDEDALSSTTTASATVVKSPL 82
Query: 54 NDISNSLFYVNISIGQPPVPQFTAMDTGSSLLWVHC---YPCRDC-----SPQLGTIFYP 105
+ S + V++S G P DTGSSL+W+ C Y C C P L F P
Sbjct: 83 SAKSYGGYSVSLSFGTPSQTIPFVFDTGSSLVWLPCTSRYLCSGCDFSGLDPTLIPRFIP 142
Query: 106 SRSSSYAYVPCDSEHCRYF--PYARCSA---YKHRCI-----YTQLYLIGPETSVFVSTE 155
SSS + C S C++ P +C C Y Y +G V + TE
Sbjct: 143 KNSSSSKIIGCQSPKCQFLYGPNVQCRGCDPNTRNCTVGCPPYILQYGLGSTAGVLI-TE 201
Query: 156 QLTFKNTDESTIHVQDVVFGCGFSTNRNFKFSGIFGLGIGRSSLVSQLN-SSFSYCIGSL 214
+L F + + V D V GC + R + +GI G G G SL SQ+N FS+C+ S
Sbjct: 202 KLDFPD-----LTVPDFVVGCSIISTR--QPAGIAGFGRGPVSLPSQMNLKRFSHCLVSR 254
Query: 215 H-DPDYLHNKLILGDGAIIDEGDATP---------------LQFIDGHYYITLEAISVDG 258
D + L L G+ + G TP F++ +YY+ L I V
Sbjct: 255 RFDDTNVTTDLDLDTGSGHNSGSKTPGLTYTPFRKNPNVSNKAFLE-YYYLNLRRIYVGR 313
Query: 259 RMLDINPNIFKRDDSG-GGVMIDSGTDVTWLVKEAYEALRDEVMIRLEGEQMRSYSWPDK 317
+ + I +G GG ++DSG+ T++ + +E + +E QM +Y+
Sbjct: 314 KHVKIPYKYLAPGTNGDGGSIVDSGSTFTFMERPVFELVAEEF-----ASQMSNYTREKD 368
Query: 318 L--------CYHGIMSSDLKGFPTVRFHFRGGAKLALEKDSMF-YQPRPDAFCMAVNPAS 368
L C++ D+ P + F F+GGAKL L + F + D C+ V
Sbjct: 369 LEKETGLGPCFNISGKGDVT-VPELIFEFKGGAKLELPLSNYFTFVGNTDTVCLTV---- 423
Query: 369 INNRYVNFS-------LIGMMAQQFYNVGYDIGRKQKTFQRMDC 405
++++ VN S ++G QQ Y V YD+ + F + C
Sbjct: 424 VSDKTVNPSGGTGPAIILGSFQQQNYLVEYDLENDRFGFAKKKC 467
>gi|17979392|gb|AAL49921.1| unknown protein [Arabidopsis thaliana]
Length = 439
Score = 105 bits (261), Expect = 5e-20, Method: Compositional matrix adjust.
Identities = 106/422 (25%), Positives = 165/422 (39%), Gaps = 31/422 (7%)
Query: 1 LIHRDSILSPYNNPNENPAHRVQRGINISIARLAYLQEKIRSHNTYQAQILPSNDISNSL 60
L HRD++L P R++ I R + + K S + + D +
Sbjct: 31 LAHRDTLL-------PKPLSRIEDVIGADQKRHSLISRKRNSTVGVKMDLGSGIDYGTAQ 83
Query: 61 FYVNISIGQPPVPQFTAMDTGSSLLWVHC-YPCRDCSPQLGTIFYPSRSSSYAYVPCDSE 119
++ I +G P +DTGS L WV+C Y R + +F S S+ V C ++
Sbjct: 84 YFTEIRVGTPAKKFRVVVDTGSELTWVNCRYRARGKDNR--RVFRADESKSFKTVGCLTQ 141
Query: 120 HCR-----YFPYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDVVF 174
C+ F C C Y Y G + E +T T+ + +
Sbjct: 142 TCKVDLMNLFSLTTCPTPSTPCSYDYRYADGSAAQGVFAKETITVGLTNGRMARLPGHLI 201
Query: 175 GCGFS-TNRNFKFS-GIFGLGIGRSSLVSQLNS----SFSYCIGSLHDPDYLHNKLILGD 228
GC S T ++F+ + G+ GL S S S FSYC+ + N LI G
Sbjct: 202 GCSSSFTGQSFQGADGVLGLAFSDFSFTSTATSLYGAKFSYCLVDHLSNKNVSNYLIFGS 261
Query: 229 GAIIDEG--DATPLQF--IDGHYYITLEAISVDGRMLDINPNIFKRDDSGGGVMIDSGTD 284
TPL I Y I + IS+ MLDI ++ SGGG ++DSGT
Sbjct: 262 SRSTKTAFRRTTPLDLTRIPPFYAINVIGISLGYDMLDIPSQVWDA-TSGGGTILDSGTS 320
Query: 285 VTWLVKEAYEALRDEVMIRL-EGEQMRSYSWPDKLCYHGIMSSDLKGFPTVRFHFRGGAK 343
+T L AY+ + + L E ++++ P + C+ ++ P + FH +GGA+
Sbjct: 321 LTLLADAAYKQVVTGLARYLVELKRVKPEGVPIEYCFSFTSGFNVSKLPQLTFHLKGGAR 380
Query: 344 LALEKDSMFYQPRPDAFCMAVNPASINNRYVNFSLIGMMAQQFYNVGYDIGRKQKTFQRM 403
+ S P C+ A V IG + QQ Y +D+ +F
Sbjct: 381 FEPHRKSYLVDAAPGVKCLGFVSAGTPATNV----IGNIMQQNYLWEFDLMASTLSFAPS 436
Query: 404 DC 405
C
Sbjct: 437 AC 438
>gi|77553049|gb|ABA95845.1| Eukaryotic aspartyl protease family protein, expressed [Oryza
sativa Japonica Group]
Length = 372
Score = 105 bits (261), Expect = 6e-20, Method: Compositional matrix adjust.
Identities = 99/378 (26%), Positives = 169/378 (44%), Gaps = 37/378 (9%)
Query: 48 AQILPSNDISNSLFYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCR----DCSPQLGTIF 103
+ ++ + + + +++ IS+G PPV +DTGS+L WV C C+ D + + G IF
Sbjct: 12 STVIGDDSMRKNKYFMGISLGTPPVFNLVTIDTGSTLSWVQCKNCQIKCYDQAAKAGQIF 71
Query: 104 YPSRSSSYAYVPCDSEHCR------YFPYARCSAYKHRCIYTQLYLIGPETSVFVSTEQL 157
P SS+Y+ V C +E C Y C CIY+ Y G + ++ ++L
Sbjct: 72 NPYNSSTYSKVGCSTEACNGMHMDLAVEYG-CVEEDDTCIYSLRYGSGEYSVGYLGKDRL 130
Query: 158 TFKNTDESTIHVQDVVFGCGFSTNRNFKFSGIFGLGIGRSSLVSQL-----NSSFSYCIG 212
T S + + +FGCG N +GI G G S +Q+ ++FSYC
Sbjct: 131 TLA----SNRSIDNFIFGCGEDNLYNGVNAGIIGFGTKSYSFFNQVCQQTDYTAFSYCFP 186
Query: 213 SLHDPDYLHNKLILGDGAIIDEGDATPLQFIDGH--YYITLEAISVDGRMLDINPNIFKR 270
H+ + L +G A T L + D Y I + V+G L+I+P I+
Sbjct: 187 RDHENE---GSLTIGPYARDINLMWTKLIYYDHKPAYAIQQLDMMVNGIRLEIDPYIYIS 243
Query: 271 DDSGGGVMIDSGTDVTWLVKEAYEALRDEVMIRLEGEQMRSYSWPD-KLCY-HGIMSSDL 328
+ ++DSGT T+++ ++AL D+ M + + + W + ++C+ S++
Sbjct: 244 KMT----IVDSGTADTYILSPVFDAL-DKAMTKEMQAKGYTRGWDERRICFISNSGSANW 298
Query: 329 KGFPTVRFHF-RGGAKLALEKDSMFYQPRPDAFCMAVNPASINNRYVNFSLIGMMAQQFY 387
FPTV R KL +E + FY+ + C P R V ++G A + +
Sbjct: 299 NDFPTVEMKLIRSTLKLPVE--NAFYESSNNVICSTFLPDDAGVRGV--QMLGNRAVRSF 354
Query: 388 NVGYDIGRKQKTFQRMDC 405
+ +DI F+ C
Sbjct: 355 KLVFDIQAMNFGFKARAC 372
>gi|218197468|gb|EEC79895.1| hypothetical protein OsI_21423 [Oryza sativa Indica Group]
Length = 471
Score = 105 bits (261), Expect = 6e-20, Method: Compositional matrix adjust.
Identities = 103/353 (29%), Positives = 147/353 (41%), Gaps = 32/353 (9%)
Query: 66 SIGQPPVPQFTAMDTGSSLLWVHCYPCR--DCSPQLGTIFYPSRSSSYAYVPCDSEHCRY 123
+I P + Q ++DT L W+ C PC +C PQ +F P RS + A VPC S C
Sbjct: 138 AIDDPILAQPMSIDTSIDLPWIQCAPCPMPECYPQQNALFDPRRSRTSAAVPCGSAACGE 197
Query: 124 FPYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDVVFGCGFSTNRN 183
++C Y Y G TS + LT + ST+ V + FGC + N
Sbjct: 198 LGRYGAGCSNNQCQYFVDYGDGRATSGTYMVDALTL---NPSTV-VMNFRFGCSHAVRGN 253
Query: 184 FKF--SGIFGLGIGRSSLVSQ----LNSSFSYCIGSLHDPDYLHNKLILGDGAIIDEGDA 237
F SG LG GR SL+SQ ++FSYC+ +L + DG
Sbjct: 254 FSASTSGTMSLGGGRQSLLSQTAATFGNAFSYCVPDPSSSGFL-SLGGPADGGGAGRFAR 312
Query: 238 TPL----QFIDGHYYITLEAISVDGRMLDINPNIFKRDDSGGGVMIDSGTDVTWLVKEAY 293
TPL I Y + L I V GR L++ P +F GG ++DS +T L AY
Sbjct: 313 TPLVRNPSIIPTLYLVRLRGIEVGGRRLNVPPVVFA-----GGAVMDSSVIITQLPPTAY 367
Query: 294 EALRDEVMIRLEG-EQMRSYSWPDKLCYHGIMSSDLKGFPTVRFHFRGGAKLALEKDSMF 352
ALR + ++ CY + + + P V F GGA + L+ +
Sbjct: 368 RALRLAFRSAMAAYPRVAGGRAGLDTCYDFVRFTSVT-VPAVSLVFDGGAVVRLDAMGVM 426
Query: 353 YQPRPDAFCMAVNPASINNRYVNFSLIGMMAQQFYNVGYDIGRKQKTFQRMDC 405
+ C+A P + IG + QQ + V YD+G F+R C
Sbjct: 427 VE-----GCLAFVPTPGD---FALGFIGNVQQQTHEVLYDVGGGSVGFRRGAC 471
>gi|218187618|gb|EEC70045.1| hypothetical protein OsI_00635 [Oryza sativa Indica Group]
Length = 570
Score = 104 bits (260), Expect = 6e-20, Method: Compositional matrix adjust.
Identities = 104/361 (28%), Positives = 162/361 (44%), Gaps = 51/361 (14%)
Query: 53 SNDISNSLFYV-NISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLG---TIFYPSRS 108
S +S S Y+ +++G PP DTGS L+WV C + + T F PSRS
Sbjct: 92 SKVVSRSFEYLMTVNLGSPPRSMLAIADTGSDLVWVKCKKGNNDTSSAAAPTTQFDPSRS 151
Query: 109 SSYAYVPCDSEHCRYFPYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDES--- 165
S+Y V C ++ C A C + C Y Y G T+ +STE TF +
Sbjct: 152 STYGRVSCQTDACEALGRATCDDGSN-CAYLYAYGDGSNTTGVLSTETFTFDDGGAGRSP 210
Query: 166 -TIHVQDVVFGCGFSTNRNFKFSGIFGLGIGRSSLVSQLNSS------FSYCIGSLHDPD 218
+ + V FGC +T +F G+ GLG G SLV+QL + FSYC+ P
Sbjct: 211 RQVRIGGVKFGCSTATAGSFPADGLVGLGGGAVSLVTQLGGATSLGRRFSYCL----VPH 266
Query: 219 YLHNKLILGDGAIIDEGD----ATPLQFIDGHYYITLEAISVDGRMLDINPNIFKRDDSG 274
++ L GA+ D + +TPL G+ + A S
Sbjct: 267 SVNASSALNFGALADVTEPGAASTPLV---GNKTVASAASSR------------------ 305
Query: 275 GGVMIDSGTDVTWLVKEAYEALRDEVMIRLEGEQMRSYSWPDKLCYH--GIMSSDLKGFP 332
+++DSGT +T+L + DE+ R+ ++S +LCY+ G + P
Sbjct: 306 --IIVDSGTTLTFLDPSLLGPIVDELSRRITLPPVQSPDGLLQLCYNVAGREVEAGESIP 363
Query: 333 TVRFHFRGGAKLALEKDSMFYQPRPDAFCMAVNPASINNRYVNFSLIGMMAQQFYNVGYD 392
+ F GGA +AL+ ++ F + C+A+ A+ + V S++G +AQQ +VGYD
Sbjct: 364 DLTLEFGGGAAVALKPENAFVAVQEGTLCLAIV-ATTEQQPV--SILGNLAQQNIHVGYD 420
Query: 393 I 393
+
Sbjct: 421 L 421
Score = 59.7 bits (143), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 39/131 (29%), Positives = 67/131 (51%), Gaps = 5/131 (3%)
Query: 277 VMIDSGTDVTWLVKEAYEALRDEVMIRLEGEQMRSYSWPDKLCYH--GIMSSDLKGFPTV 334
+++DSGT +T+L + DE+ R+ ++S +LCY+ G + P +
Sbjct: 439 IIVDSGTTLTFLDPSLLGPIVDELSRRITLPPVQSPDGLLQLCYNVAGREVEAGESIPDL 498
Query: 335 RFHFRGGAKLALEKDSMFYQPRPDAFCMAVNPASINNRYVNFSLIGMMAQQFYNVGYDIG 394
F GGA +AL+ ++ F + C+A+ A+ + V S++G +AQQ +VGYD+
Sbjct: 499 TLEFGGGAAVALKPENAFVAVQEGTLCLAIV-ATTEQQPV--SILGNLAQQNIHVGYDLD 555
Query: 395 RKQKTFQRMDC 405
TF DC
Sbjct: 556 AGTVTFAVADC 566
>gi|21717169|gb|AAM76362.1|AC074196_20 putative nucleoid DNA binding protein, 3'-partial [Oryza sativa
Japonica Group]
Length = 377
Score = 104 bits (260), Expect = 7e-20, Method: Compositional matrix adjust.
Identities = 88/333 (26%), Positives = 140/333 (42%), Gaps = 40/333 (12%)
Query: 57 SNSLFYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLGTIFYPSRSSSYAYVPC 116
S L+ N +IG PP P +D L+W C PC+ C Q +F P++SS++ +PC
Sbjct: 53 SQGLYVANFTIGTPPQPVSAVVDLTGELVWTQCTPCQPCFEQDLPLFDPTKSSTFRGLPC 112
Query: 117 DSEHCRYFPYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDVVFGC 176
S C P + + CIY G +T T+ E+ + FGC
Sbjct: 113 GSHLCESIPESSRNCTSDVCIYEAPTKAG-DTGGKAGTDTFAIGAAKET------LGFGC 165
Query: 177 GFSTNRNFKF----SGIFGLGIGRSSLVSQLN-SSFSYCIGSLHDPDYLHNKLILGDGAI 231
T++ K SGI GLG SLV+Q+N ++FSYC+ L LG A
Sbjct: 166 VVMTDKRLKTIGGPSGIVGLGRTPWSLVTQMNVTAFSYCLAGKS-----SGALFLGATAK 220
Query: 232 -IDEGDATPLQFI------------DGHYYITLEAISVDGRMLDINPNIFKRDDSGGGVM 278
+ G + F+ + +Y + L I G L SG V+
Sbjct: 221 QLAGGKNSSTPFVIKTSAGSSDNGSNPYYMVKLAGIKTGGAPLQ------AASSSGSTVL 274
Query: 279 IDSGTDVTWLVKEAYEALRDEVMIRLEGEQMRSYSWPDKLCYHGIMSSDLKGFPTVRFHF 338
+D+ + ++L AY+AL+ + + + + S P LC+ ++ D P + F F
Sbjct: 275 LDTVSRASYLADGAYKALKKALTAAVGVQPVASPPKPYDLCFPKAVAGDA---PELVFTF 331
Query: 339 RGGAKLALEKDSMFYQPRPDAFCMAV-NPASIN 370
GGA L + + C+ + + AS+N
Sbjct: 332 DGGAALTVPPANYLLASGNGTVCLTIGSSASLN 364
>gi|449436215|ref|XP_004135889.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 2-like [Cucumis
sativus]
Length = 496
Score = 104 bits (260), Expect = 8e-20, Method: Compositional matrix adjust.
Identities = 98/351 (27%), Positives = 152/351 (43%), Gaps = 39/351 (11%)
Query: 78 MDTGSSLLWVHCYPCRDCSPQLGTIFYPSRSSSYAYVPCDSEHCRYF-PYARCSAY---- 132
+DTGS L WV C PCR C Q +F PS SSS+ +PC+S C P A S
Sbjct: 160 VDTGSDLTWVQCLPCRLCYNQQEPLFNPSNSSSFLSLPCNSPTCVALQPTAGSSGLCSNK 219
Query: 133 -KHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDVVFGCGFSTNRNF-KFSGIF 190
C Y Y G + + E+LT T+ + + +FGCG + F SG+
Sbjct: 220 NSTSCDYQIDYGDGSYSRGELGFEKLTLGKTE-----IDNFIFGCGRNNKGLFGGASGLM 274
Query: 191 GLGIGRSSLVSQ----LNSSFSYCIGSLHDPDYLHNKLILGDGAIIDEGDATPLQF---- 242
GL SLVSQ S FSYC+ + L LG + + +P+ +
Sbjct: 275 GLARSELSLVSQTSSLFGSVFSYCLPTTGVGS--SGSLTLGGADFSNFKNISPISYTRMI 332
Query: 243 ----IDGHYYITLEAISVDGRMLDINPNIFKRDDSGGGV--MIDSGTDVTWLVKEAYEAL 296
+ Y++ L IS+ G L++ R S GV ++DSGT +T L Y+A
Sbjct: 333 QNPQMSNFYFLNLTGISIGGVNLNV-----PRLSSNEGVLSLLDSGTVITRLSPSIYKAF 387
Query: 297 RDEVMIRLEGEQMRSYSWPDKLCYHGIMSSDLKGFPTVRFHFRGGAKLALEKDSMFYQPR 356
+ E + G + C++ + + PTV+F F G A++ ++ + +FY +
Sbjct: 388 KAEFEKQFSGYRTTPGFSILNTCFN-LTGYEEVNIPTVKFIFEGNAEMIVDVEGVFYFVK 446
Query: 357 PDA--FCMAVNPASINNRYVNFSLIGMMAQQFYNVGYDIGRKQKTFQRMDC 405
DA C+A ++ + +IG Q+ V Y+ + F C
Sbjct: 447 SDASQICLAFASLGYEDQTM---IIGNYQQKNQRVIYNSKESKVGFAGEPC 494
>gi|357160409|ref|XP_003578755.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Brachypodium
distachyon]
Length = 373
Score = 104 bits (260), Expect = 8e-20, Method: Compositional matrix adjust.
Identities = 99/391 (25%), Positives = 169/391 (43%), Gaps = 44/391 (11%)
Query: 40 IRSHNTYQAQILPSNDISNSLFYVNISIGQPPVPQFTAMDTGSSLLWVHCYPC-RDCSPQ 98
+++ N + ++ + I + F++ IS+G P V +DTGS++ WV C C C Q
Sbjct: 2 VQAANIPDSAVIGDDSIRKNQFFMGISLGTPAVFNLVTIDTGSTISWVQCQYCIVHCYTQ 61
Query: 99 ---LGTIFYPSRSSSYAYVPCDSEHCRYFPYAR-----CSAYKHRCIYTQLYLIGPETSV 150
G F S SS+Y V C ++ C ++ C + CIY+ Y G ++
Sbjct: 62 DQRAGPTFNTSSSSTYRRVGCSAQVCHDMHVSQNIPSGCVEEEDSCIYSLRYASGEYSAG 121
Query: 151 FVSTEQLTFKNTDESTIHVQDVVFGCGFSTNRNFKFSGIFGLGIGRSSLVSQLN-----S 205
++S ++LT N + +Q +FGCG N +GI G G S +Q+ S
Sbjct: 122 YLSQDRLTLAN----SYSIQKFIFGCGSDNRYNGHSAGIIGFGNKSYSFFNQIAQLTNYS 177
Query: 206 SFSYCIGSLHDPDYL---------HNKLILGDGAIIDEGDATPLQFIDGHYYITLEAISV 256
+FSYC S + + NKLIL + D G P+ Y + + V
Sbjct: 178 AFSYCFPSNQENEGFLSIGPYVRDSNKLILTQ--LFDYGAHLPV------YALQQFDMMV 229
Query: 257 DGRMLDINPNIFKRDDSGGGVMIDSGTDVTWLVKEAYEALRDEVMIRLEGEQMRSYSWPD 316
+G L ++P ++ + ++DSGT T+++ + AL + + E S
Sbjct: 230 NGMRLQVDPPVYTTRMT----VVDSGTVETFVLSPVFRALDRALTKAMVAEGYVRGSDSK 285
Query: 317 KLCYHGIMSS-DLKGFPTVRFHFRGGAKLALEKDSMFYQPRPD-AFCMAVNPASINNRYV 374
++C+H S D P V F + L L +++FY D + C P V
Sbjct: 286 EICFHSNGDSVDWSKLPVVEIKF-SRSILKLPAENVFYYETSDGSICSTFQPDDAGVPGV 344
Query: 375 NFSLIGMMAQQFYNVGYDIGRKQKTFQRMDC 405
++G A + + V +DI ++ F+ C
Sbjct: 345 Q--ILGNRATRSFRVVFDIQQRNFGFEAGAC 373
>gi|449509162|ref|XP_004163513.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 2-like [Cucumis
sativus]
Length = 417
Score = 104 bits (260), Expect = 8e-20, Method: Compositional matrix adjust.
Identities = 91/309 (29%), Positives = 138/309 (44%), Gaps = 36/309 (11%)
Query: 78 MDTGSSLLWVHCYPCRDCSPQLGTIFYPSRSSSYAYVPCDSEHCRYF-PYARCSAY---- 132
+DTGS L WV C PCR C Q +F PS SSS+ +PC+S C P A S
Sbjct: 81 VDTGSDLTWVQCLPCRLCYNQQEPLFNPSNSSSFLSLPCNSPTCVALQPTAGSSGLCSNK 140
Query: 133 -KHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDVVFGCGFSTNRNF-KFSGIF 190
C Y Y G + + E+LT T+ + + +FGCG + F SG+
Sbjct: 141 NSTSCDYQIDYGDGSYSRGELGFEKLTLGKTE-----IDNFIFGCGRNNKGLFGGASGLM 195
Query: 191 GLGIGRSSLVSQ----LNSSFSYCIGSLHDPDYLHNKLILGDGAIIDEGDATPLQF---- 242
GL SLVSQ S FSYC+ + L LG + + +P+ +
Sbjct: 196 GLARSELSLVSQTSSLFGSVFSYCLPTTGVGS--SGSLTLGGADFSNFKNISPISYTRMI 253
Query: 243 ----IDGHYYITLEAISVDGRMLDINPNIFKRDDSGGGV--MIDSGTDVTWLVKEAYEAL 296
+ Y++ L IS+ G L++ R S GV ++DSGT +T L Y+A
Sbjct: 254 QNPQMSNFYFLNLTGISIGGVNLNV-----PRLSSNEGVLSLLDSGTVITRLSPSIYKAF 308
Query: 297 RDEVMIRLEGEQMRSYSWPDKLCYHGIMSSDLKGFPTVRFHFRGGAKLALEKDSMFYQPR 356
+ E + G + C++ + + PTV+F F G A++ ++ + +FY +
Sbjct: 309 KAEFEKQFSGYRTTPGFSILNTCFN-LTGYEEVNIPTVKFIFEGNAEMIVDVEGVFYFVK 367
Query: 357 PDA--FCMA 363
DA C+A
Sbjct: 368 SDASQICLA 376
>gi|357476337|ref|XP_003608454.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
gi|355509509|gb|AES90651.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
Length = 683
Score = 104 bits (259), Expect = 8e-20, Method: Compositional matrix adjust.
Identities = 101/377 (26%), Positives = 163/377 (43%), Gaps = 54/377 (14%)
Query: 58 NSLFYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLGTIFYPSRSSSYAYVPCD 117
N + + IG PP +DTGS++ +V C C C F P SS+Y V C
Sbjct: 78 NGYYTTRLWIGTPPQMFALIVDTGSTVTYVPCSTCEQCGRHQDPKFQPDLSSTYQPVKCT 137
Query: 118 SEHCRYFPYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDVVFGCG 177
+ C + +C+Y + Y +S + + ++F N +S + Q VFGC
Sbjct: 138 LD-------CNCDNDRMQCVYERQYAEMSTSSGVLGEDVVSFGN--QSELAPQRAVFGCE 188
Query: 178 FSTNRNF---KFSGIFGLGIGRSSLVSQL------NSSFSYCIGSLHDPDYLHNKLILGD 228
+ GI GLG G S++ QL + SFS C G + +G
Sbjct: 189 NVETGDLYSQHADGIMGLGRGDLSIMDQLVDKNVVSDSFSLCYGGMD----------VGG 238
Query: 229 GAIIDEGDATPLQFI--------DGHYYITLEAISVDGRMLDINPNIFKRDDSGGGVMID 280
GA++ G + P + +Y I L+ I V G+ L +NP++F D G ++D
Sbjct: 239 GAMVLGGISPPSDMVFAQSDPVRSPYYNIDLKEIHVAGKRLPLNPSVF---DGKHGSVLD 295
Query: 281 SGTDVTWLVKEAYEALRDEVMIRLEGEQMRSYSWPD----KLCYHGI---MSSDLKGFPT 333
SGT +L +EA+ A ++ ++ L + S PD LC+ G +S K FP
Sbjct: 296 SGTTYAYLPEEAFLAFKEAIVKEL--QSFSQISGPDPNYNDLCFSGAGIDVSQLSKTFPV 353
Query: 334 VRFHFRGGAKLALEKDS-MFYQPR-PDAFCMAVNPASINNRYVNFSLIGMMAQQFYNVGY 391
V F G K +L ++ MF + A+C+ + N +L+G + + V Y
Sbjct: 354 VDMIFGNGHKYSLSPENYMFRHSKVRGAYCLGI----FQNGKDPTTLLGGIVVRNTLVLY 409
Query: 392 DIGRKQKTFQRMDCEVL 408
D + + F + +C L
Sbjct: 410 DREQTKIGFWKTNCAEL 426
>gi|356537161|ref|XP_003537098.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Glycine max]
Length = 601
Score = 104 bits (259), Expect = 9e-20, Method: Compositional matrix adjust.
Identities = 110/445 (24%), Positives = 188/445 (42%), Gaps = 74/445 (16%)
Query: 15 NENPAHRVQRGINISIARLAYLQEKIRSHNTYQAQILPSNDISNSLFYVNISIGQPPVPQ 74
N +P H +Q ++ SI R +L ++HN + + + + +++ G PP
Sbjct: 174 NSHPFHTLQLAVSTSITRAHHL----KNHNNPSSLKTLVHPKTYGGYSIDLKFGTPPQTF 229
Query: 75 FTAMDTGSSLLWVHCYP---CRDC---SPQLGTIFYPSRSSSYAYVPCDSEHCRYF---- 124
+DTGSSL+W+ CY C C S F P S S +V C + C +
Sbjct: 230 PFVLDTGSSLVWLPCYSHYLCSKCNSFSNNNTPKFIPKDSFSSKFVGCRNPKCAWVFGSD 289
Query: 125 -----------PYARCSAYKHRC-IYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDV 172
++ + C YT Y +G T+ F+ +E L F + V D
Sbjct: 290 VTSHCCKLAKAAFSNNNNCSQTCPAYTVQYGLG-STAGFLLSENLNFPAKN-----VSDF 343
Query: 173 VFGCGFSTNRNFKFSGIFGLGIGRSSLVSQLN-SSFSYCIGSLHDPDYLHNKLILGDGAI 231
+ GC S ++ GI G G G SL +Q+N + FSYC+ S + N ++ +
Sbjct: 344 LVGC--SVVSVYQPGGIAGFGRGEESLPAQMNLTRFSYCLLSHQFDESPENSDLVMEATN 401
Query: 232 IDEGD--------------ATPLQFIDGHYYITLEAISVDGRMLDINPNIFKRDDSG-GG 276
EG +T +YYITL I V + + + + + D +G GG
Sbjct: 402 SGEGKKTNGVSYTAFLKNPSTKKPAFGAYYYITLRKIVVGEKRVRVPRRMLEPDVNGDGG 461
Query: 277 VMIDSGTDVTWLVKEAYEALRDEVMIRLEGEQMRSYSWPDKL--CYHGIMSSDLKGFPTV 334
++DSG+ +T++ + ++ + +E + ++ + R L C+ ++ FP +
Sbjct: 462 FIVDSGSTLTFMERPIFDLVAEEFVKQVNYTRARELEKQFGLSPCFVLAGGAETASFPEM 521
Query: 335 RFHFRGGAKLALEKDSMFYQ-PRPDAFCM------------AVNPASINNRYVNFSLIGM 381
RF FRGGAK+ L + F + + D C+ AV PA I +G
Sbjct: 522 RFEFRGGAKMRLPVANYFSRVGKGDVACLTIVSDDVAGQGGAVGPAVI---------LGN 572
Query: 382 MAQQFYNVGYDIGRKQKTFQRMDCE 406
QQ + V D+ ++ F+ C+
Sbjct: 573 YQQQNFYVECDLENERFGFRSQSCQ 597
>gi|296085499|emb|CBI29231.3| unnamed protein product [Vitis vinifera]
Length = 308
Score = 104 bits (259), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 100/379 (26%), Positives = 161/379 (42%), Gaps = 89/379 (23%)
Query: 40 IRSHNTYQAQILPSNDISNSL------FYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCR 93
I+++NT Q+ NDI +++ + +NIS+G PPV DTGS L+W C PC
Sbjct: 3 IQTNNTGN-QLASPNDIQSNVISGGGSYLMNISLGTPPVSMLGIADTGSDLIWRQCLPCD 61
Query: 94 DCSPQLGTIFYPSRSSSYAYVPCDSEHCRYFPYARCSAYKHRCIYTQLYLIGPETSVFVS 153
DC Q+ +F P +S +Y +T ++S
Sbjct: 62 DCYKQVEPLFDPKKSKTY-----------------------------------KTLGYLS 86
Query: 154 TEQLTFKNTDESTIHVQDVVFGCGFSTNRNF--KFSGIFGLGIGRSSLVSQLNSS----F 207
+E T +T+ + FGCG S F K SG+ GLG G SLV QL+S F
Sbjct: 87 SETFTIGSTEGDPASFPGLAFGCGHSNGGTFNEKDSGLIGLGGGPLSLVMQLSSKVGGQF 146
Query: 208 SYCIGSLHDPDYLHNKLILGDGAIID-EGDATPLQFIDGHYYITLEAISVDGRMLDINPN 266
SYC+ L +K+ G A++ G ++P + +
Sbjct: 147 SYCLVPLSSDSTASSKINFGKSAVVSGSGTSSPAAAEESN-------------------- 186
Query: 267 IFKRDDSGGGVMIDSGTDVTWLVKEAYEALRDEVMIRLEGEQMRSYSWPDKLCYHGIMSS 326
++IDSGT +T L ++ Y + + + G+ LCY G+
Sbjct: 187 ----------IIIDSGTTLTLLPRDFYTDMESALTKVIGGQTTTDPRGTFSLCYSGVKKL 236
Query: 327 DLKGFPTVRFHFRGGAKLALEKDSMFYQPRPDAFCMAVNPASINNRYVNFSLIGMMAQQF 386
++ PT+ HF GA + L + F Q + D C ++ P+S N ++ G ++Q
Sbjct: 237 EI---PTITAHFI-GADVQLPPLNTFVQAQEDLVCFSMIPSS------NLAIFGNLSQMN 286
Query: 387 YNVGYDIGRKQKTFQRMDC 405
+ VGYD+ + +F+ DC
Sbjct: 287 FLVGYDLKNNKVSFKPTDC 305
>gi|449518783|ref|XP_004166415.1| PREDICTED: aspartic proteinase-like protein 2-like, partial
[Cucumis sativus]
Length = 420
Score = 104 bits (259), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 90/346 (26%), Positives = 155/346 (44%), Gaps = 36/346 (10%)
Query: 60 LFYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDC--SPQLGTIFYP---SRSSSYAYV 114
L+Y I IG P + +DTGS ++WV+C CR+C + LG P S++ V
Sbjct: 86 LYYAKIGIGTPSKDYYVQVDTGSDIVWVNCIQCRECPRTSSLGMELTPYDLEESTTGKLV 145
Query: 115 PCDSEHCRYF---PYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTD---ESTIH 168
CD + C P + C+ C Y Q+Y G T+ + + + + E+T
Sbjct: 146 SCDEQFCLEVNGGPLSGCTT-NMSCPYLQIYGDGSSTAGYFVKDYVQYNRVSGDLETTAA 204
Query: 169 VQDVVFGCG------FSTNRNFKFSGIFGLGIGRSSLVSQLNSS------FSYCIGSLHD 216
+ FGCG ++ GI G G SS++SQL S+ F++C+
Sbjct: 205 NGSIKFGCGARQSGDLGSSGEEALDGILGFGKSNSSIISQLASTRKVKKMFAHCL----- 259
Query: 217 PDYLHNKLILGDGAIID-EGDATPLQFIDGHYYITLEAISVDGRMLDINPNIFKRDDSGG 275
D + I G ++ + + TPL HY + + + V +L+I+ ++F+ D
Sbjct: 260 -DGTNGGGIFAMGHVVQPKVNMTPLVPNQPHYNVNMTGVQVGHIILNISADVFEAGDR-K 317
Query: 276 GVMIDSGTDVTWLVKEAYEALRDEVMIRLEGEQMRSYSWPDKLCYHGIMSSDLKGFPTVR 335
G +IDSGT + +L + YE L +++ + ++++ K C+ D GFP V
Sbjct: 318 GTIIDSGTTLAYLPELIYEPLVAKILSQQHNLEVQTIHGEYK-CFQYSERVD-DGFPPVI 375
Query: 336 FHFRGGAKLALEKDSMFYQPRPDAFCMAVNPASINNR-YVNFSLIG 380
FHF L + +Q + +C+ + + +R N +L G
Sbjct: 376 FHFENSLLLKVYPHEYLFQ-YENLWCIGWQNSGMQSRDRKNVTLFG 420
>gi|356514298|ref|XP_003525843.1| PREDICTED: aspartic proteinase-like protein 1-like [Glycine max]
Length = 663
Score = 104 bits (259), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 99/377 (26%), Positives = 162/377 (42%), Gaps = 54/377 (14%)
Query: 58 NSLFYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLGTIFYPSRSSSYAYVPCD 117
N + + IG PP +DTGS++ +V C C C F P SS+Y V C
Sbjct: 109 NGYYTTRLWIGTPPQMFALIVDTGSTVTYVPCSTCEQCGRHQDPKFQPESSSTYQPVKCT 168
Query: 118 SEHCRYFPYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDVVFGCG 177
+ C + +C+Y + Y +S + + ++F N +S + Q VFGC
Sbjct: 169 ID-------CNCDGDRMQCVYERQYAEMSTSSGVLGEDVISFGN--QSELAPQRAVFGCE 219
Query: 178 FSTNRNF---KFSGIFGLGIGRSSLVSQL------NSSFSYCIGSLHDPDYLHNKLILGD 228
+ GI GLG G S++ QL + SFS C G + +G
Sbjct: 220 NVETGDLYSQHADGIMGLGRGDLSIMDQLVDKKVISDSFSLCYGGMD----------VGG 269
Query: 229 GAIIDEGDATPLQFI--------DGHYYITLEAISVDGRMLDINPNIFKRDDSGGGVMID 280
GA++ G + P +Y I L+ + V G+ L +N N+F D G ++D
Sbjct: 270 GAMVLGGISPPSDMTFAYSDPDRSPYYNIDLKEMHVAGKRLPLNANVF---DGKHGTVLD 326
Query: 281 SGTDVTWLVKEAYEALRDEVMIRLEGEQMRSYSWPDK----LCYHGI---MSSDLKGFPT 333
SGT +L + A+ A +D ++ L + ++ S PD +C+ G +S K FP
Sbjct: 327 SGTTYAYLPEAAFLAFKDAIVKEL--QSLKQISGPDPNYNDICFSGAGNDVSQLSKSFPV 384
Query: 334 VRFHFRGGAKLALEKDS-MFYQPR-PDAFCMAVNPASINNRYVNFSLIGMMAQQFYNVGY 391
V F G K +L ++ MF + A+C+ + N +L+G + + V Y
Sbjct: 385 VDMVFGNGHKYSLSPENYMFRHSKVRGAYCLGI----FQNGNDQTTLLGGIIVRNTLVMY 440
Query: 392 DIGRKQKTFQRMDCEVL 408
D + + F + +C L
Sbjct: 441 DREQTKIGFWKTNCAEL 457
>gi|56784779|dbj|BAD82000.1| putative aspartic proteinase nepenthesin II [Oryza sativa Japonica
Group]
Length = 486
Score = 104 bits (259), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 97/361 (26%), Positives = 152/361 (42%), Gaps = 26/361 (7%)
Query: 58 NSLFYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDC---SPQLGTI--FYPSRSSSYA 112
++ ++ S+G PP +D S +W+ C C C +P + FY SS+
Sbjct: 94 TGMYVLSFSVGTPPQVVTGVLDITSDFVWMQCSACATCGADAPAATSAPPFYAFLSSTIR 153
Query: 113 YVPCDSEHCRYFPYARCSAYKHRCIYTQLYLIGPE--TSVFVSTEQLTFKNTDESTIHVQ 170
V C + C+ CSA C Y+ +Y G T+ ++ + F +T+
Sbjct: 154 EVRCANRGCQRLVPQTCSADDSPCGYSYVYGGGAANTTAGLLAVDAFAF-----ATVRAD 208
Query: 171 DVVFGCGFSTNRNFKFSGIFGLGIGRSSLVSQLN-SSFSYCIGSLHDPDYLHNKLILGDG 229
V+FGC +T + G+ GLG G S VSQL FSY + D L L D
Sbjct: 209 GVIFGCAVATEGD--IGGVIGLGRGELSPVSQLQIGRFSYYLAPDDAVDVGSFILFLDDA 266
Query: 230 AI-IDEGDATPL---QFIDGHYYITLEAISVDGRMLDINPNIFK-RDDSGGGVMIDSGTD 284
+TPL + YY+ L I VDG L I F + D GGV++
Sbjct: 267 KPRTSRAVSTPLVASRASRSLYYVELAGIRVDGEDLAIPRGTFDLQADGSGGVVLSITIP 326
Query: 285 VTWLVKEAYEALRDEVMIRLEGEQMRSYSWPDKLCYHGIMSSDLKGFPTVRFHFRGGAKL 344
VT+L AY+ +R + ++E LCY + K P++ F GGA +
Sbjct: 327 VTFLDAGAYKVVRQAMASKIELRAADGSELGLDLCYTSESLATAK-VPSMALVFAGGAVM 385
Query: 345 ALEKDSMFYQPRPDAF-CMAVNPASINNRYVNFSLIGMMAQQFYNVGYDIGRKQKTFQRM 403
LE + FY C+ + P+ + SL+G + Q ++ YDI + F+ +
Sbjct: 386 ELEMGNYFYMDSTTGLECLTILPSPAGDG----SLLGSLIQVGTHMIYDISGSRLVFESL 441
Query: 404 D 404
+
Sbjct: 442 E 442
>gi|4415912|gb|AAD20143.1| putative protease [Arabidopsis thaliana]
Length = 469
Score = 104 bits (259), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 107/383 (27%), Positives = 171/383 (44%), Gaps = 59/383 (15%)
Query: 60 LFYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDC--SPQLGT---IFYPSRSSSYAYV 114
L++ + +G PP +DTGS +LWV C C +C S LG F S + V
Sbjct: 99 LYFTKVKLGSPPTEFNVQIDTGSDILWVTCSSCSNCPHSSGLGIDLHFFDAPGSLTAGSV 158
Query: 115 PCDSEHCRYF---PYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQD 171
C C A+CS ++C Y+ Y G TS + T+ F ++
Sbjct: 159 TCSDPICSSVFQTTAAQCSE-NNQCGYSFRYGDGSGTSGYYMTDTFYFDAILGESLVANS 217
Query: 172 ---VVFGC-----GFSTNRNFKFSGIFGLGIGRSSLVSQLNSS------FSYCIGSLHDP 217
+VFGC G T + GIFG G G+ S+VSQL+S FS+C L
Sbjct: 218 SAPIVFGCSTYQSGDLTKSDKAVDGIFGFGKGKLSVVSQLSSRGITPPVFSHC---LKGD 274
Query: 218 DYLHNKLILGDGAIIDEGDA-TPLQFIDGHYYITLEAISVDGRMLDINPNIFKRDDSGGG 276
+LG+ I+ G +PL HY + L +I V+G+ML ++ +F+ ++ G
Sbjct: 275 GSGGGVFVLGE--ILVPGMVYSPLVPSQPHYNLNLLSIGVNGQMLPLDAAVFEASNT-RG 331
Query: 277 VMIDSGTDVTWLVKEAYEALRDEVM---------IRLEGEQMRSYSWPDKLCYHGIMSSD 327
++D+GT +T+LVKEAY+ + + I GEQ CY + +S
Sbjct: 332 TIVDTGTTLTYLVKEAYDLFLNAISNSVSQLVTPIISNGEQ----------CYL-VSTSI 380
Query: 328 LKGFPTVRFHFRGGAKLALE-KDSMFYQPRPDA---FCMAVNPASINNRYVNFSLIGMMA 383
FP+V +F GGA + L +D +F+ D +C+ A +++G +
Sbjct: 381 SDMFPSVSLNFAGGASMMLRPQDYLFHYGIYDGASMWCIGFQKAPEEQ-----TILGDLV 435
Query: 384 QQFYNVGYDIGRKQKTFQRMDCE 406
+ YD+ R++ + DC+
Sbjct: 436 LKDKVFVYDLARQRIGWASYDCK 458
>gi|42569679|ref|NP_181205.2| aspartyl protease-like protein [Arabidopsis thaliana]
gi|330254186|gb|AEC09280.1| aspartyl protease-like protein [Arabidopsis thaliana]
Length = 512
Score = 103 bits (258), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 107/384 (27%), Positives = 171/384 (44%), Gaps = 59/384 (15%)
Query: 60 LFYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDC--SPQLGT---IFYPSRSSSYAYV 114
L++ + +G PP +DTGS +LWV C C +C S LG F S + V
Sbjct: 104 LYFTKVKLGSPPTEFNVQIDTGSDILWVTCSSCSNCPHSSGLGIDLHFFDAPGSLTAGSV 163
Query: 115 PCDSEHCRYF---PYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQD 171
C C A+CS ++C Y+ Y G TS + T+ F ++
Sbjct: 164 TCSDPICSSVFQTTAAQCSE-NNQCGYSFRYGDGSGTSGYYMTDTFYFDAILGESLVANS 222
Query: 172 ---VVFGC-----GFSTNRNFKFSGIFGLGIGRSSLVSQLNSS------FSYCIGSLHDP 217
+VFGC G T + GIFG G G+ S+VSQL+S FS+C L
Sbjct: 223 SAPIVFGCSTYQSGDLTKSDKAVDGIFGFGKGKLSVVSQLSSRGITPPVFSHC---LKGD 279
Query: 218 DYLHNKLILGDGAIIDEGDA-TPLQFIDGHYYITLEAISVDGRMLDINPNIFKRDDSGGG 276
+LG+ I+ G +PL HY + L +I V+G+ML ++ +F+ ++ G
Sbjct: 280 GSGGGVFVLGE--ILVPGMVYSPLVPSQPHYNLNLLSIGVNGQMLPLDAAVFEASNT-RG 336
Query: 277 VMIDSGTDVTWLVKEAYEALRDEVM---------IRLEGEQMRSYSWPDKLCYHGIMSSD 327
++D+GT +T+LVKEAY+ + + I GEQ CY + +S
Sbjct: 337 TIVDTGTTLTYLVKEAYDLFLNAISNSVSQLVTPIISNGEQ----------CYL-VSTSI 385
Query: 328 LKGFPTVRFHFRGGAKLALE-KDSMFYQPRPDA---FCMAVNPASINNRYVNFSLIGMMA 383
FP+V +F GGA + L +D +F+ D +C+ A +++G +
Sbjct: 386 SDMFPSVSLNFAGGASMMLRPQDYLFHYGIYDGASMWCIGFQKAPEEQ-----TILGDLV 440
Query: 384 QQFYNVGYDIGRKQKTFQRMDCEV 407
+ YD+ R++ + DC +
Sbjct: 441 LKDKVFVYDLARQRIGWASYDCSM 464
>gi|224092220|ref|XP_002309515.1| predicted protein [Populus trichocarpa]
gi|222855491|gb|EEE93038.1| predicted protein [Populus trichocarpa]
Length = 473
Score = 103 bits (258), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 109/394 (27%), Positives = 169/394 (42%), Gaps = 40/394 (10%)
Query: 32 RLAYLQEKIRSHNTYQAQ--ILP---SNDISNSLFYVNISIGQPPVPQFTAMDTGSSLLW 86
R+ + ++ SH +Q + LP I + + V + +G P DTGS L W
Sbjct: 99 RVDSIHARLSSHGVFQEKQATLPVQSGASIGSGDYAVTVGLGTPKKEFTLIFDTGSDLTW 158
Query: 87 VHCYPC-RDCSPQLGTIFYPSRSSSYAYVPCDSEHCRYFPY---ARCSAYKHRCIYTQLY 142
C PC + C Q P++S+SY + C S C+ CS+ C+Y Y
Sbjct: 159 TQCEPCAKTCYKQKEPRLDPTKSTSYKNISCSSAFCKLLDTEGGESCSS--PTCLYQVQY 216
Query: 143 LIGPETSVFVSTEQLTFKNTDESTIHVQDVVFGCGFSTNRNFK-FSGIFGLGIGRSSLVS 201
G + F +TE LT +++ ++ +FGCG + F+ +G+ GLG + SL S
Sbjct: 217 GDGSYSIGFFATETLTLSSSNV----FKNFLFGCGQQNSGLFRGAAGLLGLGRTKLSLPS 272
Query: 202 QLNSS----FSYCIGSLHDPDYLHNKLILGDGAIIDEG-DATPLQ--FIDGHYY-ITLEA 253
Q FSYC+ P +K L G + + TPL F +Y + +
Sbjct: 273 QTAQKYKKLFSYCL-----PASSSSKGYLSFGGQVSKTVKFTPLSEDFKSTPFYGLDITE 327
Query: 254 ISVDGRMLDINPNIFKRDDSGGGVMIDSGTDVTWLVKEAYEALRDEVM-IRLEGEQMRSY 312
+SV G L I+ +IF S G +IDSGT +T L AY AL + + Y
Sbjct: 328 LSVGGNKLSIDASIF----STSGTVIDSGTVITRLPSTAYSALSSAFQKLMTDYPSTDGY 383
Query: 313 SWPDKLCYHGIMSSDLKGFPTVRFHFRGGAKLALEKDSMFYQPRP-DAFCMAVNPASINN 371
S D CY + +K P V F+GG ++ ++ + Y C+A + N
Sbjct: 384 SIFDT-CYDFSKNETIK-IPKVGVSFKGGVEMDIDVSGILYPVNGLKKVCLAF---AGNG 438
Query: 372 RYVNFSLIGMMAQQFYNVGYDIGRKQKTFQRMDC 405
V ++ G Q+ Y V YD + + F C
Sbjct: 439 DDVKAAIFGNTQQKTYQVVYDDAKGRVGFAPSGC 472
>gi|218186446|gb|EEC68873.1| hypothetical protein OsI_37494 [Oryza sativa Indica Group]
Length = 353
Score = 103 bits (258), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 99/365 (27%), Positives = 163/365 (44%), Gaps = 37/365 (10%)
Query: 61 FYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCR----DCSPQLGTIFYPSRSSSYAYVPC 116
+++ IS+G PPV +DTGS+L WV C C+ D + + G IF P SS+Y+ V C
Sbjct: 6 YFMGISLGTPPVFNLVTIDTGSTLSWVQCKNCQIKCYDQAAKAGQIFNPYNSSTYSKVGC 65
Query: 117 DSEHCR------YFPYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQ 170
+E C Y C CIY+ Y G + ++ ++LT S +
Sbjct: 66 STEACNGMHMDLAVEYG-CVEEDDTCIYSLRYGSGEYSVGYLGKDRLTLA----SNRSID 120
Query: 171 DVVFGCGFSTNRNFKFSGIFGLGIGRSSLVSQL-----NSSFSYCIGSLHDPDYLHNKLI 225
+ +FGCG N +GI G G S +Q+ ++FSYC H+ + L
Sbjct: 121 NFIFGCGEDNLYNGVNAGIIGFGTKSYSFFNQVCQQTDYTAFSYCFPRDHENE---GSLT 177
Query: 226 LGDGAIIDEGDATPLQFIDGH--YYITLEAISVDGRMLDINPNIFKRDDSGGGVMIDSGT 283
+G A T L + D Y I + V+G L+I+P I+ + ++DSGT
Sbjct: 178 IGPYARDINLMWTKLIYYDHKPAYAIQQLDMMVNGIRLEIDPYIYISKMT----IVDSGT 233
Query: 284 DVTWLVKEAYEALRDEVMIRLEGEQMRSYSWPD-KLCY-HGIMSSDLKGFPTVRFHF-RG 340
T+++ ++AL D+ M + + + W + ++C+ S++ FPTV R
Sbjct: 234 ADTYILSPVFDAL-DKAMTKEMQAKGYTRGWDERRICFISNSGSANWNDFPTVEMKLIRS 292
Query: 341 GAKLALEKDSMFYQPRPDAFCMAVNPASINNRYVNFSLIGMMAQQFYNVGYDIGRKQKTF 400
KL +E + FY+ + C P R V ++G A + + + +DI F
Sbjct: 293 TLKLPVE--NAFYESSNNVICSTFLPDDAGVRGV--QMLGNRAVRSFKLVFDIQAMNFGF 348
Query: 401 QRMDC 405
+ C
Sbjct: 349 KARAC 353
>gi|226530755|ref|NP_001150335.1| aspartic proteinase nepenthesin-2 precursor [Zea mays]
gi|195638492|gb|ACG38714.1| aspartic proteinase nepenthesin-2 precursor [Zea mays]
Length = 461
Score = 103 bits (258), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 118/428 (27%), Positives = 173/428 (40%), Gaps = 51/428 (11%)
Query: 1 LIHRDSILSPYNNPN----ENPAHRVQRGINISIARLAYLQEKIRSHNTYQAQILPSNDI 56
L HR SP E HR Q R AY+Q K + S+
Sbjct: 62 LHHRHGPCSPLPTKKMPTLEETLHRDQL-------RAAYIQRKFSGGGGAGGDVQRSDAT 114
Query: 57 S--------NSLFY-VNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLGTIFYPSR 107
N+L Y + + +G P Q +DTGS + WV C PC C Q +F PS
Sbjct: 115 VPTALGTSLNTLEYLITVGLGSPATSQTMLIDTGSDVSWVQCKPCSQCHSQADPLFDPSS 174
Query: 108 SSSYAYVPCDSEHCRYFPYA--RCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDES 165
SS+Y+ C S C CS+ +C Y Y G T+ S++ L ++
Sbjct: 175 SSTYSPFSCGSAACAQLGQEGNGCSS-SSQCQYIVTYGDGSSTTGTYSSDTLALGSS--- 230
Query: 166 TIHVQDVVFGCG-FSTNRNFKFSGIFGLGIGRSSLVSQ----LNSSFSYCIGSLHDPDYL 220
V+ FGC + N + G+ GLG G SLVSQ L +FSYC+
Sbjct: 231 --AVKSFQFGCSNVESGFNDQTDGLMGLGGGAQSLVSQTAGTLGRAFSYCLPPTPSSSGF 288
Query: 221 HNKLILGDGAIIDEGDATPL---QFIDGHYYITLEAISVDGRMLDINPNIFKRDDSGGGV 277
L G+ TP+ + Y + L+AI V GR L I ++F G
Sbjct: 289 LT-LGAAGGSGTSGFVKTPMLRSSQVPTFYGVRLQAIRVGGRQLSIPASVFS-----AGT 342
Query: 278 MIDSGTDVTWLVKEAYEALRDEVMIRLEGEQMRSYSWPDKLCYHGIMSSDLKGFPTVRFH 337
++DSGT +T L AY AL ++ S C+ S + P+V
Sbjct: 343 VMDSGTVITRLPPTAYSALSSAFKAGMKQYPPAQPSGILDTCFDFSGQSSVS-IPSVALV 401
Query: 338 FRGGAKLALEKDSMFYQPRPDAFCMAVNPASINNRYVNFSLIGMMAQQFYNVGYDIGRKQ 397
F GGA ++L+ + + C+A + N+ + +IG + Q+ + V YD+GR
Sbjct: 402 FSGGAVVSLDASGIIL-----SNCLAF---AANSDDSSLGIIGNVQQRTFEVLYDVGRGV 453
Query: 398 KTFQRMDC 405
F+ C
Sbjct: 454 VGFRAGAC 461
>gi|326501496|dbj|BAK02537.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 476
Score = 103 bits (258), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 105/387 (27%), Positives = 151/387 (39%), Gaps = 48/387 (12%)
Query: 61 FYVNISIGQPPVPQFTAMDTGSSLLWVHCY-PCRDCSPQLGT---IFYPSRSSSYAYVPC 116
++V +G P P DTGS L WV C P + S F P S ++A + C
Sbjct: 94 YFVRFRVGTPAQPFLLVADTGSDLTWVKCRRPAANSSESGSGSGRAFRPEDSRTWAPISC 153
Query: 117 DSEHCRY---FPYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTF----KNTDESTIHV 169
S+ C F A C C Y Y G V TE T + +E +
Sbjct: 154 ASDTCTKSLPFSLATCPTPGSPCAYDYRYKDGSAARGTVGTESATIALSGRGREERKAKL 213
Query: 170 QDVVFGCGFS-TNRNFKFS-GIFGLGIGRSSLVSQLNSSF----SYCIGSLHDPDYLHNK 223
+ +V GC S T +F+ S G+ LG S S S F SYC+ P +
Sbjct: 214 KGLVLGCTSSYTGPSFEVSDGVLSLGYSDVSFASHAASRFAGRFSYCLVDHLSPRNATSY 273
Query: 224 LILGDGAIIDEGDA----------------------TPLQF---IDGHYYITLEAISVDG 258
L G + + TPL + Y + ++A+SV G
Sbjct: 274 LTFGPNPAVASSSSPSSPAPASCTAAAPRPRPRARQTPLLLDRRMRPFYDVAVKAVSVAG 333
Query: 259 RMLDINPNIFKRDDSGGGVMIDSGTDVTWLVKEAYEALRDEVMIRLEGEQMRSYSWPDKL 318
+ L I P D+GGGV++DSGT +T L K AY A+ + L G + P +
Sbjct: 334 QFLKI-PRAVWDVDAGGGVILDSGTSLTVLAKPAYRAVVAALSEGLAGLPRVTMD-PFEY 391
Query: 319 CYHGIMSSDLKGFPTVRFHFRGGAKLALEKDSMFYQPRPDAFCMAVNPASINNRYVNFSL 378
CY+ S P + HF G A+L S P C+ + + S+
Sbjct: 392 CYNWTSPSGDVTLPKMAVHFAGAARLEPPGKSYVIDAAPGVKCIGLQ----EGPWPGISV 447
Query: 379 IGMMAQQFYNVGYDIGRKQKTFQRMDC 405
IG + QQ + +DI ++ FQR C
Sbjct: 448 IGNILQQEHLWEFDIKNRRLKFQRSRC 474
>gi|357131735|ref|XP_003567490.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Brachypodium
distachyon]
Length = 458
Score = 103 bits (258), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 89/359 (24%), Positives = 148/359 (41%), Gaps = 23/359 (6%)
Query: 67 IGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLGTI-FYPSRSSSYAYVPCDSEHCRYFP 125
+G PP A+D + WV C C C+P + F P++SS+Y V C + C P
Sbjct: 106 LGTPPQTLLVAIDPSNDAAWVPCSACLGCAPGASSPSFDPTQSSTYRPVRCGAPQCAQVP 165
Query: 126 YA--RCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDVVFGC---GFST 180
A C A L + + L+ +++ + + FGC +
Sbjct: 166 PATPSCPAGPGASCAFNLSYASSTLHAVLGQDALSLSDSNGAAVPDDHYTFGCLRVVTGS 225
Query: 181 NRNFKFSGIFGLGIGRSSLVSQLNSS----FSYCIGSLHDPDYLHNKLILGDGAIIDEGD 236
+ G+ G G G S +SQ ++ FSYC+ S ++ L LG
Sbjct: 226 GGSVPPQGLVGFGRGPLSFLSQTKATYGSIFSYCLPSYKSSNF-SGTLRLGPAGQPRRIK 284
Query: 237 ATPLQFIDGH----YYITLEAISVDGRMLDINPNIFKRDDSG--GGVMIDSGTDVTWLVK 290
TPL + H YY+ + + V+G+ + I + D + GG ++D+GT T L
Sbjct: 285 TTPL-LSNPHRPSLYYVAMVGVRVNGKAVPIPASALALDAATGRGGTIVDAGTMFTRLSP 343
Query: 291 EAYEALRDEVMIRLEGEQMRSYSWPDKLCYHGIMSSDLKGFPTVRFHFRGGAKLAL-EKD 349
AY ALR+ + + D CY+ + K P V F F GGA++ L E++
Sbjct: 344 PAYAALRNAFRRGVSAPAAPALGGFDT-CYY---VNGTKSVPAVAFVFAGGARVTLPEEN 399
Query: 350 SMFYQPRPDAFCMAVNPASINNRYVNFSLIGMMAQQFYNVGYDIGRKQKTFQRMDCEVL 408
+ C+A+ + +++ M QQ + V +D+G + F R C +
Sbjct: 400 VVISSTSGGVACLAMAAGPSDGVNAGLNVLASMQQQNHRVVFDVGNGRVGFSRELCTAV 458
>gi|357481195|ref|XP_003610883.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
gi|355512218|gb|AES93841.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
Length = 315
Score = 103 bits (257), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 93/310 (30%), Positives = 143/310 (46%), Gaps = 30/310 (9%)
Query: 112 AYVPCDSEHCRYFPYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQD 171
A+ CDS C CS K RC YT Y T ++ + TF + + +
Sbjct: 17 AHNSCDSPLCHKLDTGVCSPEK-RCNYTYGYGDNSLTKGVLAQDTATFTSNTGKLVSLSR 75
Query: 172 VVFGCGFSTNRNFK--FSGIFGLGIGRSSLVSQL-----NSSFSYCIGSLHDPDYLHNKL 224
+FGCG + F G+ GLG G +SL+SQ+ FS C+ + +++
Sbjct: 76 FLFGCGHNNTGGFNDHEMGLIGLGGGPTSLISQIGPLFGGKKFSQCLVPFLTDIKISSRM 135
Query: 225 ILGDGAII--DEGDATPLQFID---GHYYITLEAISVDGRMLDINPNIFKRDDSGGGVMI 279
G G+ + D TPL + Y++TL ISV+ L +N I K G +++
Sbjct: 136 SFGKGSQVLGDGVVTTPLVQREQDMTSYFVTLLGISVEDTYLPMNSTIEK-----GNMLV 190
Query: 280 DSGTDVTWLVKEAYEALRDEVMIRLEGEQMRS-YSWPDKLCYHGIMSSDLKGFPTVRFHF 338
DSGT L ++ Y+ + EV + E + + S +LCY ++LKG PT+ +HF
Sbjct: 191 DSGTPPNILPQQLYDRVYVEVKNNVPLELITNDPSLGPQLCYR--TQTNLKG-PTLTYHF 247
Query: 339 RGGAKLALEKDSMFYQPRPDA---FCMAVNPASINNRYVNFSLIGMMAQQFYNVGYDIGR 395
GA L L F P P+ FC+A+N N N + G AQ Y +G+D+ R
Sbjct: 248 E-GANLLLTPIQTFIPPTPETKGVFCLAIN----NYTNSNGGVYGNFAQSNYLIGFDLDR 302
Query: 396 KQKTFQRMDC 405
+ +F+ DC
Sbjct: 303 QVVSFKATDC 312
>gi|297849132|ref|XP_002892447.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
gi|297338289|gb|EFH68706.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
Length = 493
Score = 103 bits (257), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 105/375 (28%), Positives = 168/375 (44%), Gaps = 42/375 (11%)
Query: 60 LFYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSP------QLGTIFYPSRSSSYAY 113
L+Y + +G PP +DTGS +LWV C C C QL + F P SSS +
Sbjct: 83 LYYTKVKLGTPPREFNVQIDTGSDVLWVSCTSCNGCPKTSELQIQL-SFFDPGVSSSASL 141
Query: 114 VPCDSEHC--RYFPYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQD 171
V C C + + CS + C Y+ Y G TS F ++ ++F ST+ +
Sbjct: 142 VSCSDRRCYSNFQTESGCSP-NNLCSYSFKYGDGSGTSGFYISDFMSFDTVITSTLAINS 200
Query: 172 ---VVFGCGFSTNRNFK-----FSGIFGLGIGRSSLVSQLNSS------FSYCIGSLHDP 217
VFGC + + GIFGLG G S++SQL FS+C L
Sbjct: 201 SAPFVFGCSNLQTGDLQRPRRAVDGIFGLGQGSLSVISQLAVQGLAPRVFSHC---LKGD 257
Query: 218 DYLHNKLILGDGAIIDEGDA--TPLQFIDGHYYITLEAISVDGRMLDINPNIFKRDDSGG 275
++LG I D TPL HY + L++I+V+G++L I+P++F +G
Sbjct: 258 KSGGGIMVLGQ---IKRPDTVYTPLVPSQPHYNVNLQSIAVNGQILPIDPSVFTI-ATGD 313
Query: 276 GVMIDSGTDVTWLVKEAYEALRDEVMIRLEGEQMRSYSWPDKLCYHGIMSSDLKGFPTVR 335
G +ID+GT + +L EAY + + + R ++ C+ I + D+ FP V
Sbjct: 314 GTIIDTGTTLAYLPDEAYSPFIQAIANAVS-QYGRPITYESYQCFE-ITAGDVDVFPEVS 371
Query: 336 FHFRGGAKLALEKDS---MFYQPRPDAFCMAVNPASINNRYVNFSLIGMMAQQFYNVGYD 392
F GGA + L + +F +C+ S + +++G + + V YD
Sbjct: 372 LSFAGGASMVLRPHAYLQIFSSSGSSIWCIGFQRMS----HRRITILGDLVLKDKVVVYD 427
Query: 393 IGRKQKTFQRMDCEV 407
+ R++ + DC +
Sbjct: 428 LVRQRIGWAEYDCSL 442
>gi|42571079|ref|NP_973613.1| aspartyl protease-like protein [Arabidopsis thaliana]
gi|110737616|dbj|BAF00749.1| putative protease [Arabidopsis thaliana]
gi|330254187|gb|AEC09281.1| aspartyl protease-like protein [Arabidopsis thaliana]
Length = 507
Score = 103 bits (257), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 107/384 (27%), Positives = 171/384 (44%), Gaps = 59/384 (15%)
Query: 60 LFYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDC--SPQLGT---IFYPSRSSSYAYV 114
L++ + +G PP +DTGS +LWV C C +C S LG F S + V
Sbjct: 99 LYFTKVKLGSPPTEFNVQIDTGSDILWVTCSSCSNCPHSSGLGIDLHFFDAPGSLTAGSV 158
Query: 115 PCDSEHCRYF---PYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQD 171
C C A+CS ++C Y+ Y G TS + T+ F ++
Sbjct: 159 TCSDPICSSVFQTTAAQCSE-NNQCGYSFRYGDGSGTSGYYMTDTFYFDAILGESLVANS 217
Query: 172 ---VVFGC-----GFSTNRNFKFSGIFGLGIGRSSLVSQLNSS------FSYCIGSLHDP 217
+VFGC G T + GIFG G G+ S+VSQL+S FS+C L
Sbjct: 218 SAPIVFGCSTYQSGDLTKSDKAVDGIFGFGKGKLSVVSQLSSRGITPPVFSHC---LKGD 274
Query: 218 DYLHNKLILGDGAIIDEGDA-TPLQFIDGHYYITLEAISVDGRMLDINPNIFKRDDSGGG 276
+LG+ I+ G +PL HY + L +I V+G+ML ++ +F+ ++ G
Sbjct: 275 GSGGGVFVLGE--ILVPGMVYSPLVPSQPHYNLNLLSIGVNGQMLPLDAAVFEASNT-RG 331
Query: 277 VMIDSGTDVTWLVKEAYEALRDEVM---------IRLEGEQMRSYSWPDKLCYHGIMSSD 327
++D+GT +T+LVKEAY+ + + I GEQ CY + +S
Sbjct: 332 TIVDTGTTLTYLVKEAYDLFLNAISNSVSQLVTPIISNGEQ----------CYL-VSTSI 380
Query: 328 LKGFPTVRFHFRGGAKLALE-KDSMFYQPRPDA---FCMAVNPASINNRYVNFSLIGMMA 383
FP+V +F GGA + L +D +F+ D +C+ A +++G +
Sbjct: 381 SDMFPSVSLNFAGGASMMLRPQDYLFHYGIYDGASMWCIGFQKAPEEQ-----TILGDLV 435
Query: 384 QQFYNVGYDIGRKQKTFQRMDCEV 407
+ YD+ R++ + DC +
Sbjct: 436 LKDKVFVYDLARQRIGWASYDCSM 459
>gi|357127507|ref|XP_003565421.1| PREDICTED: probable aspartic protease At2g35615-like [Brachypodium
distachyon]
Length = 438
Score = 103 bits (257), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 111/429 (25%), Positives = 187/429 (43%), Gaps = 62/429 (14%)
Query: 2 IHRDSILSPYNNPNENPAHRVQRGINISIARLAYLQE--------KIRSHNTYQAQILPS 53
IHRDS+ S +++P P R+++ S+AR A+ + A ++
Sbjct: 9 IHRDSVKSLFHDPTLTPEARLRQAARRSMARHAHAARINNSAAAAGASGSDDSDADVVSP 68
Query: 54 NDISNSLFYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLGTIFYPSRSSSYAY 113
N + + + + PPV DTGSSL+W+ C +L P+ SSSYA
Sbjct: 69 MVPQNFEYLMALDVSTPPVRMLALADTGSSLVWLKC--------KLPAAHTPA-SSSYAR 119
Query: 114 VPCDSEHCRYF-PYARCSAY---KHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHV 169
+PCD+ C+ A C A + C+Y + G T+ V+ + TF +
Sbjct: 120 LPCDAFACKALGDAASCRATGSGNNICVYRYAFADGSCTAGPVTVDAFTFSTRLD----- 174
Query: 170 QDVVFGCGFSTN-RNFKFSGIFGLGIGRSSLVSQLNS------SFSYCIGSLHDPDYLHN 222
FGC T + G+ GL G SLVSQL++ FSYC+ + + +
Sbjct: 175 ----FGCATRTEGLSVPDDGLVGLANGPISLVSQLSAKTPFAHKFSYCLVPYSSSETVSS 230
Query: 223 KLILGDGAIIDE---GDATPLQFIDGH----YYITLEAISVDGRMLDINPNIFKRDDSGG 275
L G AI+ TPL + G Y I L++I V G+ + + K
Sbjct: 231 SLNFGSHAIVSSSPGAATTPL--VAGRNKSFYTIALDSIKVAGKPVPLQTTTTK------ 282
Query: 276 GVMIDSGTDVTWLVKEAYEALRDEVMIRLEGEQMRSYSWPDKLCY--HGIMSSDL-KGFP 332
+++DSGT +T+L K + L + ++ +++S +CY D+ K P
Sbjct: 283 -LIVDSGTMLTYLPKAVLDPLVAALTAAIKLPRVKSPETLYAVCYDVRRRAPEDVGKSIP 341
Query: 333 TVRFHFRGGAKLALEKDSMF-YQPRPDAFCMAVNPASINNRYVNFSLIGMMAQQFYNVGY 391
V GG ++ L + F + + C+A+ + + F ++G +AQQ +VG+
Sbjct: 342 DVTLVLGGGGEVRLPWGNTFVVENKGTTVCLAL----VESHLPEF-ILGNVAQQNLHVGF 396
Query: 392 DIGRKQKTF 400
D+ R+ +F
Sbjct: 397 DLERRTVSF 405
>gi|297827153|ref|XP_002881459.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
gi|297327298|gb|EFH57718.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
Length = 507
Score = 103 bits (257), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 107/384 (27%), Positives = 171/384 (44%), Gaps = 59/384 (15%)
Query: 60 LFYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDC--SPQLGT---IFYPSRSSSYAYV 114
L++ + +G PP +DTGS +LWV C C +C S LG F S + V
Sbjct: 99 LYFTKVKLGSPPTEFNVQIDTGSDILWVTCSSCSNCPHSSGLGIDLHFFDAPGSFTAGSV 158
Query: 115 PCDSEHCRYF---PYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQD 171
C C A+CS ++C Y+ Y G TS + T+ F ++
Sbjct: 159 TCSDPICSSVFQTTAAQCSE-NNQCGYSFRYGDGSGTSGYYMTDTFYFDAILGESLVANS 217
Query: 172 ---VVFGC-----GFSTNRNFKFSGIFGLGIGRSSLVSQLNSS------FSYCIGSLHDP 217
+VFGC G T + GIFG G G+ S+VSQL+S FS+C L
Sbjct: 218 SAPIVFGCSTYQSGDLTKSDKAVDGIFGFGKGKLSVVSQLSSRGITPPVFSHC---LKGD 274
Query: 218 DYLHNKLILGDGAIIDEGDA-TPLQFIDGHYYITLEAISVDGRMLDINPNIFKRDDSGGG 276
+LG+ I+ G +PL HY + L +I V+G++L I+ +F+ ++ G
Sbjct: 275 GSGGGVFVLGE--ILVPGMVYSPLLPSQPHYNLNLLSIGVNGQILPIDAAVFEASNT-RG 331
Query: 277 VMIDSGTDVTWLVKEAYEALRDEV---------MIRLEGEQMRSYSWPDKLCYHGIMSSD 327
++D+GT +T+LVKEAY+ + + +I GEQ CY + +S
Sbjct: 332 TIVDTGTTLTYLVKEAYDPFLNAISNSVSQLVTLIISNGEQ----------CYL-VSTSI 380
Query: 328 LKGFPTVRFHFRGGAKLALE-KDSMFYQPRPDA---FCMAVNPASINNRYVNFSLIGMMA 383
FP V +F GGA + L +D +F+ D +C+ A +++G +
Sbjct: 381 SDMFPPVSLNFAGGASMMLRPQDYLFHYGFYDGASMWCIGFQKAPEEQ-----TILGDLV 435
Query: 384 QQFYNVGYDIGRKQKTFQRMDCEV 407
+ YD+ R++ + DC +
Sbjct: 436 LKDKVFVYDLARQRIGWANYDCSM 459
>gi|77808087|gb|AAS48510.2| aspartic protease [Fagopyrum esculentum]
gi|82780908|gb|ABB88696.2| aspartic proteinase-like protein [Fagopyrum esculentum]
Length = 447
Score = 103 bits (256), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 111/419 (26%), Positives = 184/419 (43%), Gaps = 51/419 (12%)
Query: 27 NISIARLAYLQEK--IRSHNTYQAQI-LPSNDISNSLFYVNISIGQPPVPQFTAMDTGSS 83
+I++A L+ L ++ T ++ LP+ S + V S+G PP +DTGSS
Sbjct: 37 SINLAALSSLSRARHLKRPPTLTGKVTLPAYPRSYGGYSVIFSLGTPPQKVSLVLDTGSS 96
Query: 84 LLWVHC------YPCRDCS-----PQLGTIFYPSRSSSYAYVPCDSEHCRYF--PYARCS 130
L+W C Y C++C+ P I+ ++SS+ +PC S C + CS
Sbjct: 97 LVWTPCTIPTATYTCQNCTFSGVDPTKIPIYARNKSSTVQSLPCRSPKCNWVFGSDLNCS 156
Query: 131 AYKHRCIYTQL-YLIGPETSVFVSTEQLTFKNTDESTIHVQDVVFGCGFSTNRNFKFSGI 189
K RC Y L Y +G T VS + L + + D +FGC +NR + GI
Sbjct: 157 TTK-RCPYYGLEYGLGSTTGQLVS-DVLGLSKLN----RIPDFLFGCSLVSNRQPE--GI 208
Query: 190 FGLGIGRSSLVSQLN-SSFSYCIGSLH-DPDYLHNKLILGDGAIIDEGDATPLQFI---- 243
G G G +S+ +QL + FSYC+ S D L+L G + A + +
Sbjct: 209 AGFGRGLASIPAQLGLTKFSYCLVSHRFDDTPQSGDLVLHRGRRHADAAANGVAYAPFTK 268
Query: 244 -------DGHYYITLEAISVDGRMLDINPNIFKRDDSG-GGVMIDSGTDVTWLVKEAYEA 295
+YYI+L I V G+ + I P G GG+++DSG+ T++ + ++
Sbjct: 269 SPALSPYSEYYYISLSKILVGGKDVPIPPRYLVPSKEGDGGMIVDSGSTFTFMERIIFDP 328
Query: 296 LRDEV---MIRLEGEQMRSYSWPDKLCYHGIMSSDLKGFPTVRFHFRGGAKLALEKDSMF 352
+ E+ M + + + S CY+ S++ P + F F+GGA + L F
Sbjct: 329 VARELEKHMTKYKRAKEIEDSSGLGPCYNITGQSEVD-VPKLTFSFKGGANMDLPLTDYF 387
Query: 353 YQPRPDAFCMAV-----NPASINNRYVNFSLIGMMAQQFYNVGYDIGRKQKTFQRMDCE 406
CM V P S + ++G QQ + + YD+ +++ F+ C+
Sbjct: 388 SLVTDGVVCMTVLTDPDEPGSTTGPAI---ILGNYQQQNFYIEYDLKKQRFGFKPQQCD 443
>gi|302781476|ref|XP_002972512.1| hypothetical protein SELMODRAFT_441822 [Selaginella moellendorffii]
gi|300159979|gb|EFJ26598.1| hypothetical protein SELMODRAFT_441822 [Selaginella moellendorffii]
Length = 496
Score = 103 bits (256), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 110/384 (28%), Positives = 159/384 (41%), Gaps = 49/384 (12%)
Query: 59 SLFYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLGTIFYPSRSSSYAYVPCDS 118
+LF + + IG +DTGS + V C + +F P+ S SY VPC S
Sbjct: 98 ALFSMQLGIGSLQKNLSAIIDTGSEAVLVQC------GSRSRPVFDPAASQSYRQVPCIS 151
Query: 119 EHCRYFPYAR-------CSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQ- 170
+ C C C Y+ Y ++ S + + +T+ S VQ
Sbjct: 152 QLCLAVQQQTSNGSSQPCVNSSATCTYSLSYGDSRNSTGDFSQDVIFLNSTNSSGQAVQF 211
Query: 171 -DVVFGCGFSTNR---NFKFSGIFGLGIGRSSLVSQLN-----SSFSYCIGSLHDPDYLH 221
DV FGC S + GI G G SL SQL S FSYC S
Sbjct: 212 RDVAFGCAHSPQGFLVDLGSLGIVGFNRGNLSLPSQLKDRLGGSKFSYCFPSQPWQPRAT 271
Query: 222 NKLILGDGAI---------IDEGDATPLQFIDGHYYITLEAISVDGRMLDINPNIFKRDD 272
+ LGD + + + TP + YY+ L +ISVDG+ L I + FK D
Sbjct: 272 GVIFLGDSGLSKSKVGYTPLLDNPVTPAR--SQLYYVGLTSISVDGKTLAIPESAFKLDP 329
Query: 273 SG--GGVMIDSGTDVTWLVKEAYEALRDEVMIR----LEGEQMRSYSWPDKLCYHGIMSS 326
S GG ++DSGT T +V +AY A R+ L + + + D CY+ S
Sbjct: 330 STGDGGTVLDSGTTFTRVVDDAYTAFRNAFAASNRSGLRKKVGAAAGFDD--CYNISAGS 387
Query: 327 DLKGFPTVRFHFRGGAKLALEKDSMFYQPRPDA-----FCMAVNPASINNRYVNFSLIGM 381
L G P VR + +L L + +F P A C+A+ +S + + +++G
Sbjct: 388 SLPGVPEVRLSLQNNVRLELRFEHLFV-PVSAAGNEVTVCLAI-LSSQKSGFGKINVLGN 445
Query: 382 MAQQFYNVGYDIGRKQKTFQRMDC 405
Q Y V YD R + F+R DC
Sbjct: 446 YQQSNYLVEYDNERSRVGFERADC 469
>gi|218189440|gb|EEC71867.1| hypothetical protein OsI_04576 [Oryza sativa Indica Group]
Length = 508
Score = 103 bits (256), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 97/361 (26%), Positives = 152/361 (42%), Gaps = 26/361 (7%)
Query: 58 NSLFYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDC---SPQLGTI--FYPSRSSSYA 112
++ ++ S+G PP +D S +W+ C C C +P + FY SS+
Sbjct: 94 TGMYVLSFSVGTPPQVVTGVLDITSDFVWMQCSACATCGADAPAATSAPPFYAFLSSTIR 153
Query: 113 YVPCDSEHCRYFPYARCSAYKHRCIYTQLYLIGPE--TSVFVSTEQLTFKNTDESTIHVQ 170
V C + C+ CSA C Y+ +Y G T+ ++ + F +T+
Sbjct: 154 EVRCANRGCQRLVPQTCSADDSPCGYSYVYGGGAANTTAGLLAVDAFAF-----ATVRAD 208
Query: 171 DVVFGCGFSTNRNFKFSGIFGLGIGRSSLVSQLN-SSFSYCIGSLHDPDYLHNKLILGDG 229
V+FGC +T + G+ GLG G SLVSQL FSY + D L L D
Sbjct: 209 GVIFGCAVATEGD--IGGVIGLGRGELSLVSQLQIGRFSYYLAPDDAVDVGSFILFLDDA 266
Query: 230 AI-IDEGDATPL---QFIDGHYYITLEAISVDGRMLDINPNIFK-RDDSGGGVMIDSGTD 284
+TPL + YY+ L I VDG L I F + D GGV++
Sbjct: 267 KPRTSRAVSTPLVANRASRSLYYVELAGIRVDGEDLAIPRGTFDLQADGSGGVVLSITIP 326
Query: 285 VTWLVKEAYEALRDEVMIRLEGEQMRSYSWPDKLCYHGIMSSDLKGFPTVRFHFRGGAKL 344
VT+L AY+ +R + ++ LCY + K P++ F GGA +
Sbjct: 327 VTFLDAGAYKVVRQAMASKIGLRAADGSELGLDLCYTSESLATAK-VPSMALVFAGGAVM 385
Query: 345 ALEKDSMFYQPRPDAF-CMAVNPASINNRYVNFSLIGMMAQQFYNVGYDIGRKQKTFQRM 403
LE + FY C+ + P+ + SL+G + Q ++ YDI + F+ +
Sbjct: 386 ELEMGNYFYMDSTTGLECLTILPSPAGDG----SLLGSLIQVGTHMIYDISGSRLVFESL 441
Query: 404 D 404
+
Sbjct: 442 E 442
>gi|125575542|gb|EAZ16826.1| hypothetical protein OsJ_32298 [Oryza sativa Japonica Group]
Length = 396
Score = 103 bits (256), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 97/375 (25%), Positives = 153/375 (40%), Gaps = 53/375 (14%)
Query: 63 VNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLGTIFYPSRSSSYAYVPCDSEHCR 122
N +IG PP P +D L+W C C C Q +F P+ SS++ PC ++ C+
Sbjct: 45 ANFTIGTPPQPASAIIDVAGELVWTQCSRCSRCFKQDLPLFIPNASSTFRPEPCGTDACK 104
Query: 123 YFPYARCSAYKHRCIY-----------TQLYLIGPET-SVFVSTEQLTFKNTDESTIHVQ 170
P + CS C Y T L ++G ET ++ +T L F S I
Sbjct: 105 STPTSNCSG--DVCTYESTTNIRLDRHTTLGIVGTETFAIGTATASLAFGCVVASDIDTM 162
Query: 171 DVVFGCGFSTNRNFKFSGIFGLGIGRSSLVSQLN-SSFSYCIGSLHDPDYLHNKLILGDG 229
D SG GLG SLV+Q+ + FSYC+ ++L LG
Sbjct: 163 DGT-------------SGFIGLGRTPRSLVAQMKLTKFSYCLSPRGTGK--SSRLFLGSS 207
Query: 230 AIIDEGDATPLQ-FI------DGHYY--ITLEAISVDGRMLDINPNIFKRDDSGGGVMID 280
A + G++T FI D H+Y ++L+AI + SGG +++
Sbjct: 208 AKLAGGESTSTAPFIKTSPDDDSHHYYLLSLDAIRAGNTTIATA-------QSGGILVMH 260
Query: 281 SGTDVTWLVKEAYEALRDEVMIRLEG---EQMRSYSWPDKLCYHGIMSSDLKGFPTVRFH 337
+ + + LV AY A + V + G + M + P LC+ P + F
Sbjct: 261 TVSPFSLLVDSAYRAFKKAVTEAVGGAAEQPMATPPQPFDLCFKKAAGFSRATAPDLVFT 320
Query: 338 FRGGAKLALEKDSMFYQ--PRPDAFCMAVNPASINNR--YVNFSLIGMMAQQFYNVGYDI 393
F+G A L + D C A+ + NR S++G + Q+ + YD+
Sbjct: 321 FQGAAALTVPPAKYLIDVGEEKDTACAAILSMAWLNRTGLEGVSVLGSLQQEDVHFLYDL 380
Query: 394 GRKQKTFQRMDCEVL 408
++ +F+ DC L
Sbjct: 381 KKETLSFEPADCSSL 395
>gi|226492150|ref|NP_001146362.1| hypothetical protein precursor [Zea mays]
gi|219886805|gb|ACL53777.1| unknown [Zea mays]
gi|414878074|tpg|DAA55205.1| TPA: hypothetical protein ZEAMMB73_415404 [Zea mays]
Length = 440
Score = 103 bits (256), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 102/369 (27%), Positives = 157/369 (42%), Gaps = 36/369 (9%)
Query: 67 IGQPPVPQFTAMDTGSSLLWVHCYPCR-DCSPQLGTIFYPSRSSSYAYVPCDSEHCRYFP 125
IG PP +DTGS+L+W C CR C Q + PSRS + V C+ C
Sbjct: 77 IGDPPQRAEAIIDTGSNLIWTQCSRCRPTCFRQNLPYYDPSRSRAARAVGCNDAACALGS 136
Query: 126 YARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDVVFGC----GFSTN 181
+C + C Y G + ++TE LTF++ S +VFGC S
Sbjct: 137 ETQCLSDNKTCAVVTGYGAG-NIAGTLATENLTFQSETVS------LVFGCIVVTKLSPG 189
Query: 182 RNFKFSGIFGLGIGRSSLVSQL-NSSFSYCIGSLHDPDYLHNKLILGDGAIIDEGDA--T 238
SGI GLG G+ SL SQL ++ FSYC+ + + +++G A + G A T
Sbjct: 190 SLNGASGIIGLGRGKLSLPSQLGDTRFSYCLTPYFEDTIEPSHMVVGASAGLINGSASST 249
Query: 239 PLQFI-----------DGHYYITLEAISVDGRMLDINPNIFK-RDDSGG---GVMIDSGT 283
P+ + YY+ L I+ L + F R + G G IDSG
Sbjct: 250 PVTTVPFVRSPSDDPFSTFYYLPLTGITAGKVKLAVPSAAFDLRQVAPGMWTGTFIDSGA 309
Query: 284 DVTWLVKEAYEALRDEVMIRLEGEQMRSYSWPDKLCYHGIMSSDLKGFPTVRFHFRGGAK 343
+T LV AY+ALR E+ +L ++ + + + P + HF GG+
Sbjct: 310 PLTSLVDVAYQALRAELARQLGAALVQPLAGTTGFDLCVALKDAERLVPPLVLHFGGGSG 369
Query: 344 LALE---KDSMFYQPRPDAFCMAVNPASINNRYVNF---SLIGMMAQQFYNVGYDIGRKQ 397
+ + ++ P A V +S++ + + ++IG QQ +V YD+
Sbjct: 370 TGTDLVVPPANYWAPVDSATACMVVFSSVDRKSLPMNETTVIGNYMQQNMHVLYDLAGGV 429
Query: 398 KTFQRMDCE 406
+FQ DC
Sbjct: 430 LSFQPADCS 438
>gi|297810815|ref|XP_002873291.1| hypothetical protein ARALYDRAFT_487523 [Arabidopsis lyrata subsp.
lyrata]
gi|297319128|gb|EFH49550.1| hypothetical protein ARALYDRAFT_487523 [Arabidopsis lyrata subsp.
lyrata]
Length = 439
Score = 103 bits (256), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 111/424 (26%), Positives = 169/424 (39%), Gaps = 41/424 (9%)
Query: 1 LIHRDSILSPYNNPNE-NPAHRVQRGINISIARLAYLQEKIRSHNTYQAQILPSND---- 55
+ H DS SP+ +P+ + RV + + ARL YL + + ++P
Sbjct: 39 IFHIDSPCSPFKSPSPLSWEARVLQTLAQDQARLQYLSSLVAGRS-----VVPIASGRQM 93
Query: 56 ISNSLFYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLGTIFYPSRSSSYAYVP 115
+ ++ + V + IG P P AMDT S + W+ C C C T F P++S+S+ V
Sbjct: 94 LQSTTYIVKVLIGTPAQPLLLAMDTSSDVAWIPCSGCVGCPSN--TAFSPAKSTSFKNVS 151
Query: 116 CDSEHCRYFPYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDVVFG 175
C + C+ P C A C + Y +S+ + Q T + + ++ FG
Sbjct: 152 CSAPQCKQVPNPACGA--RACSFNLTY---GSSSIAANLSQDTIRLAAD---PIKAFTFG 203
Query: 176 C-------GFSTNRNFKFSGIFGLGIGRSSLVSQLNSSFSYCIGSLHDPDYLHNKLILGD 228
C G G S S S+FSYC+ S + L LG
Sbjct: 204 CVNKVAGGGTIPPPQGLLGLGRGPLSLMSQAQSVYKSTFSYCLPSFRSLTF-SGSLRLGP 262
Query: 229 GAIIDEGDATPL---QFIDGHYYITLEAISVDGRMLDINPNIFKRDDS-GGGVMIDSGTD 284
+ T L YY+ L AI V +++D+ P + S G G + DSGT
Sbjct: 263 TSQPQRVKYTQLLRNPRRSSLYYVNLVAIRVGRKVVDLPPAAIAFNPSTGAGTIFDSGTV 322
Query: 285 VTWLVKEAYEALRDEVMIRLEGEQMRSYSWPD-KLCYHGIMSSDLKGFPTVRFHFRGGAK 343
T L K YEA+R+E R++ S CY G + PT+ F F+G
Sbjct: 323 YTRLAKPVYEAVRNEFRKRVKPPTAVVTSLGGFDTCYSGQVK-----VPTITFMFKGVNM 377
Query: 344 LALEKDSMFYQPRPDAFCMAVNPASIN-NRYVNFSLIGMMAQQFYNVGYDIGRKQKTFQR 402
+ M + C+A+ A N N VN +I M QQ + V D+ + R
Sbjct: 378 TMPADNLMLHSTAGSTSCLAMASAPENVNSVVN--VIASMQQQNHRVLIDVPNGRLGLAR 435
Query: 403 MDCE 406
C
Sbjct: 436 ERCS 439
>gi|297801286|ref|XP_002868527.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
gi|297314363|gb|EFH44786.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
Length = 444
Score = 102 bits (255), Expect = 3e-19, Method: Compositional matrix adjust.
Identities = 96/373 (25%), Positives = 168/373 (45%), Gaps = 38/373 (10%)
Query: 63 VNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLG--TIFYPSRSSSYAYVPCDSEH 120
+++ IG P Q +DTGS L W+ C+P + P T F PS SSS++ +PC
Sbjct: 83 LSLPIGTPSQSQELVLDTGSQLSWIQCHPKKIKKPLPPPTTSFDPSLSSSFSDLPCSHPL 142
Query: 121 CRY----FPYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDVVFGC 176
C+ F C Y+ Y G + E+ TF N+ + ++ GC
Sbjct: 143 CKPRIPDFTLPTSCDSNRLCHYSYFYADGTFAEGNLVKEKFTFSNSQTT----PPLILGC 198
Query: 177 GFSTNRNFKFSGIFGLGIGRSSLVSQLN-SSFSYCIGSLHDPDYLHN--KLILGDG---- 229
+ GI G+ +GR S +SQ S FSYCI + + L + LG+
Sbjct: 199 A---KESTDVKGILGMNLGRLSFISQAKISKFSYCIPTRSNRPGLASTGSFYLGENPNSR 255
Query: 230 -----AIIDEGDATPLQFIDG-HYYITLEAISVDGRMLDINPNIFKRDDSGGG-VMIDSG 282
+++ + + +D Y + L I + + L+I ++F+ D G G M+DSG
Sbjct: 256 GFKYVSLLTFPQSQRMPNLDPLAYTVPLLGIRIGQKRLNIPSSVFRPDAGGSGQTMVDSG 315
Query: 283 TDVTWLVKEAYEALRDEVMIRLEGEQMRS---YSWPDKLCY---HGIMSSDLKGFPTVRF 336
++ T LV AY+ +++E+ +RL G +++ Y +C+ H ++ L G + F
Sbjct: 316 SEFTHLVDVAYDKVKEEI-VRLVGSRLKKGYVYGSTADMCFDGNHQMVIGRLIG--DLVF 372
Query: 337 HFRGGAKLALEKDSMFYQPRPDAFCMAVNPASINNRYVNFSLIGMMAQQFYNVGYDIGRK 396
F G ++ +EK + C+ + +S+ N +IG + QQ V +D+ +
Sbjct: 373 EFGRGVEILVEKQRLLVNVGGGIHCVGIGRSSMLGAASN--IIGNVHQQNLWVEFDVANR 430
Query: 397 QKTFQRMDCEVLD 409
+ F + +C L
Sbjct: 431 RVGFSKAECSRLS 443
>gi|18390865|ref|NP_563808.1| aspartyl protease-like protein [Arabidopsis thaliana]
gi|11993877|gb|AAG42922.1|AF329505_1 unknown protein [Arabidopsis thaliana]
gi|20260142|gb|AAM12969.1| unknown protein [Arabidopsis thaliana]
gi|22136092|gb|AAM91124.1| unknown protein [Arabidopsis thaliana]
gi|332190140|gb|AEE28261.1| aspartyl protease-like protein [Arabidopsis thaliana]
Length = 492
Score = 102 bits (255), Expect = 3e-19, Method: Compositional matrix adjust.
Identities = 105/375 (28%), Positives = 169/375 (45%), Gaps = 42/375 (11%)
Query: 60 LFYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSP------QLGTIFYPSRSSSYAY 113
L+Y + +G PP +DTGS +LWV C C C QL + F P SSS +
Sbjct: 83 LYYTKVKLGTPPREFNVQIDTGSDVLWVSCTSCNGCPKTSELQIQL-SFFDPGVSSSASL 141
Query: 114 VPCDSEHC--RYFPYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQD 171
V C C + + CS + C Y+ Y G TS + ++ ++F ST+ +
Sbjct: 142 VSCSDRRCYSNFQTESGCSP-NNLCSYSFKYGDGSGTSGYYISDFMSFDTVITSTLAINS 200
Query: 172 ---VVFGCGFSTNRNFK-----FSGIFGLGIGRSSLVSQLNSS------FSYCIGSLHDP 217
VFGC + + + GIFGLG G S++SQL FS+C L
Sbjct: 201 SAPFVFGCSNLQSGDLQRPRRAVDGIFGLGQGSLSVISQLAVQGLAPRVFSHC---LKGD 257
Query: 218 DYLHNKLILGDGAIIDEGDA--TPLQFIDGHYYITLEAISVDGRMLDINPNIFKRDDSGG 275
++LG I D TPL HY + L++I+V+G++L I+P++F +G
Sbjct: 258 KSGGGIMVLGQ---IKRPDTVYTPLVPSQPHYNVNLQSIAVNGQILPIDPSVFTI-ATGD 313
Query: 276 GVMIDSGTDVTWLVKEAYEALRDEVMIRLEGEQMRSYSWPDKLCYHGIMSSDLKGFPTVR 335
G +ID+GT + +L EAY V + + R ++ C+ I + D+ FP V
Sbjct: 314 GTIIDTGTTLAYLPDEAYSPFIQAVANAVS-QYGRPITYESYQCFE-ITAGDVDVFPQVS 371
Query: 336 FHFRGGAKLALEKDS---MFYQPRPDAFCMAVNPASINNRYVNFSLIGMMAQQFYNVGYD 392
F GGA + L + +F +C+ S + +++G + + V YD
Sbjct: 372 LSFAGGASMVLGPRAYLQIFSSSGSSIWCIGFQRMS----HRRITILGDLVLKDKVVVYD 427
Query: 393 IGRKQKTFQRMDCEV 407
+ R++ + DC +
Sbjct: 428 LVRQRIGWAEYDCSL 442
>gi|194707866|gb|ACF88017.1| unknown [Zea mays]
Length = 461
Score = 102 bits (254), Expect = 3e-19, Method: Compositional matrix adjust.
Identities = 118/428 (27%), Positives = 173/428 (40%), Gaps = 51/428 (11%)
Query: 1 LIHRDSILSPYNNPN----ENPAHRVQRGINISIARLAYLQEKIRSHNTYQAQILPSNDI 56
L HR SP E HR Q R AY+Q K + S+
Sbjct: 62 LHHRHGPCSPLPTKKMPTLEETLHRDQL-------RAAYIQRKFSGGGGAGGDVQRSDAT 114
Query: 57 S--------NSLFY-VNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLGTIFYPSR 107
N+L Y + + +G P Q +DTGS + WV C PC C Q +F PS
Sbjct: 115 VPTALGTSLNTLEYLITVGLGSPATSQTMLIDTGSDVSWVQCKPCSQCHSQADPLFDPSS 174
Query: 108 SSSYAYVPCDSEHCRYFPYA--RCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDES 165
SS+Y+ C S C CS+ +C Y Y G T+ S++ L ++
Sbjct: 175 SSTYSPFSCGSADCAQLGQEGNGCSS-SSQCQYIVTYGDGSSTTGTYSSDTLALGSS--- 230
Query: 166 TIHVQDVVFGCG-FSTNRNFKFSGIFGLGIGRSSLVSQ----LNSSFSYCIGSLHDPDYL 220
V+ FGC + N + G+ GLG G SLVSQ L +FSYC+
Sbjct: 231 --AVRSFQFGCSNVESGFNDQTDGLMGLGGGAQSLVSQTAGTLGRAFSYCLPPTPSSSGF 288
Query: 221 HNKLILGDGAIIDEGDATPL---QFIDGHYYITLEAISVDGRMLDINPNIFKRDDSGGGV 277
L G+ TP+ + Y + L+AI V GR L I ++F G
Sbjct: 289 LT-LGAAGGSGTSGFVKTPMLRSSQVPTFYGVRLQAIRVGGRQLSIPASVFS-----AGT 342
Query: 278 MIDSGTDVTWLVKEAYEALRDEVMIRLEGEQMRSYSWPDKLCYHGIMSSDLKGFPTVRFH 337
++DSGT +T L AY AL ++ S C+ S + P+V
Sbjct: 343 VMDSGTVITRLPPTAYSALSSAFKAGMKQYPPAQPSGILDTCFDFSGQSSVS-IPSVALV 401
Query: 338 FRGGAKLALEKDSMFYQPRPDAFCMAVNPASINNRYVNFSLIGMMAQQFYNVGYDIGRKQ 397
F GGA ++L+ + + C+A + N+ + +IG + Q+ + V YD+GR
Sbjct: 402 FSGGAVVSLDASGIIL-----SNCLAF---AGNSDDSSLGIIGNVQQRTFEVLYDVGRGV 453
Query: 398 KTFQRMDC 405
F+ C
Sbjct: 454 VGFRAGAC 461
>gi|255581545|ref|XP_002531578.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
gi|223528808|gb|EEF30814.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
Length = 442
Score = 102 bits (254), Expect = 3e-19, Method: Compositional matrix adjust.
Identities = 98/378 (25%), Positives = 166/378 (43%), Gaps = 48/378 (12%)
Query: 63 VNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLGTIFYPSRSSSYAYVPCDSEHCR 122
V++++G PP +DTGS L W+HC + L ++F P S +Y+ VPC S C+
Sbjct: 71 VSLTVGSPPQNVTMVLDTGSELSWLHCKKTQF----LNSVFNPLSSKTYSKVPCLSPTCK 126
Query: 123 YFPYARCSAYKHRCIYTQLYLIGPETSVFVSTE-QLTFKNTDESTIHVQDVVFGC---GF 178
R C T+L + + S E L F+ ++ +FGC GF
Sbjct: 127 --TRTRDLTIPVSCDATKLCHVIVSYADATSIEGNLAFETFRLGSLTKPATIFGCMDSGF 184
Query: 179 STN--RNFKFSGIFGLGIGRSSLVSQLN-SSFSYCIGSLHDPDYLHNKLILGDGAIIDEG 235
S+N + K +G+ G+ G S V+Q+ FSYCI L+LG+ +
Sbjct: 185 SSNSEEDSKTTGLIGMNRGSLSFVNQMGYPKFSYCISGFDSAGV----LLLGNASFPWLK 240
Query: 236 D---------ATPLQFIDG-HYYITLEAISVDGRMLDINPNIFKRDDSGGG-VMIDSGTD 284
+TPL + D Y + LE I V ++L + ++F D +G G M+DSGT
Sbjct: 241 PLSYTPLVQISTPLPYFDRVAYTVQLEGIKVKNKVLSLPKSVFVPDHTGAGQTMVDSGTQ 300
Query: 285 VTWLVKEAYEALRDEVMIRLEGEQMRSYSWPDKLCYHGIMS---------SDLKGFPTVR 335
T+L+ Y AL++E + + G + D + G M +L+ P V
Sbjct: 301 FTFLLGPVYTALKNEFLSQTRG--ILKVLNDDNFVFQGAMDLCYLLDSSRPNLQNLPVVS 358
Query: 336 FHFRGGAKLALEKDSMFYQP------RPDAFCMAVNPASINNRYVNFSLIGMMAQQFYNV 389
F+ GA++++ + + Y+ R +C + + V +IG QQ +
Sbjct: 359 LMFQ-GAEMSVSGERLLYRVPGEVRGRDSVWCFTFGNSDLLG--VEAFVIGHHHQQNVWM 415
Query: 390 GYDIGRKQKTFQRMDCEV 407
+D+ + + + C+V
Sbjct: 416 EFDLEKSRIGLADVRCDV 433
>gi|242044724|ref|XP_002460233.1| hypothetical protein SORBIDRAFT_02g025060 [Sorghum bicolor]
gi|241923610|gb|EER96754.1| hypothetical protein SORBIDRAFT_02g025060 [Sorghum bicolor]
Length = 512
Score = 102 bits (254), Expect = 4e-19, Method: Compositional matrix adjust.
Identities = 104/358 (29%), Positives = 166/358 (46%), Gaps = 46/358 (12%)
Query: 78 MDTGSSLLWVHCYPCRDCSPQLGTIFYPSRSSSYAYVPCDSEHCRYFPYAR--------- 128
+DT S L WV C PC C Q +F PS S SYA VPC+S C A
Sbjct: 168 VDTASELTWVQCAPCESCHDQQDPLFDPSSSPSYAAVPCNSSSCDALQLATGGTSGGAAA 227
Query: 129 CSAYKHR---CIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDVVFGCGFSTNRNFK 185
C C YT Y G + ++ ++L+ + VFGCG ++N+
Sbjct: 228 CQGQDQSAAACSYTLSYRDGSYSRGVLAHDRLSLAGE-----VIDGFVFGCG-TSNQGPP 281
Query: 186 FSGIFGL-GIGRS--SLVS----QLNSSFSYCIGSLHDPDYLHNKLILGDGAIIDEGDAT 238
F G GL G+GRS SLVS Q FSYC+ L + D L++GD + + ++T
Sbjct: 282 FGGTSGLMGLGRSQLSLVSQTMDQFGGVFSYCL-PLKESDS-SGSLVIGDDSSVYR-NST 338
Query: 239 PLQF-------IDG-HYYITLEAISVDGRMLDINPNIFKRDDSGGGVMIDSGTDVTWLVK 290
P+ + + G Y++ L I+V G+ ++ + F GG +IDSGT +T LV
Sbjct: 339 PIVYASMVSDPLQGPFYFVNLTGITVGGQ--EVESSGFSSGGGGGKAIIDSGTVITSLVP 396
Query: 291 EAYEALRDEVMIRL-EGEQMRSYSWPDKLCYHGIMSSDLKGFPTVRFHFRGGAKLALEKD 349
Y A++ E + + E Q +S D C++ +++ P+++ F GG ++ ++
Sbjct: 397 SIYNAVKAEFLSQFAEYPQAPGFSILDT-CFNMTGLREVQ-VPSLKLVFDGGVEVEVDSG 454
Query: 350 SMFYQPRPDA--FCMAVNPASINNRYVNFSLIGMMAQQFYNVGYDIGRKQKTFQRMDC 405
+ Y D+ C+A+ P + + Y ++IG Q+ V +D Q F + C
Sbjct: 455 GVLYFVSSDSSQVCLAMAP--LKSEY-ETNIIGNYQQKNLRVIFDTSGSQVGFAQETC 509
>gi|357162717|ref|XP_003579500.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Brachypodium
distachyon]
Length = 488
Score = 102 bits (254), Expect = 4e-19, Method: Compositional matrix adjust.
Identities = 102/396 (25%), Positives = 168/396 (42%), Gaps = 53/396 (13%)
Query: 61 FYVNISIGQPPVPQFTAMDTGSSLLWVHC---YPCRDC-----SPQLGTIFYPSRSSSYA 112
+ ++S+G PP P +DTGS L WV C Y CR+C + +F+P SSS
Sbjct: 91 YAFSVSLGTPPQPLPVLLDTGSHLSWVPCTSSYQCRNCSSSPSAMSAMAVFHPKNSSSSR 150
Query: 113 YVPCDSEHCRYF---PYARCSAYKHR-----C-IYTQLYLIGPETSVFVS-TEQLTFKNT 162
V C + CR+ + C + + C Y +Y G + + +S T +L+ ++
Sbjct: 151 LVGCRNPACRWIHSKSPSTCGSTGNNGNGDVCPPYLVVYGSGSTSGLLISDTLRLSPSSS 210
Query: 163 DESTIHVQDVVFGCGFSTNRNFKFSGIFGLGIGRSSLVSQLN-SSFSYCIGS--LHDPDY 219
+ ++ GC + SG+ G G G S+ SQL FSYC+ S D
Sbjct: 211 SSAPAPFRNFAIGCSIVSVHQ-PPSGLAGFGRGAPSVPSQLKVPKFSYCLLSRRFDDNSA 269
Query: 220 LHNKLILGDGAIIDEGDATPLQFI------------DGHYYITLEAISVDGRMLDINPNI 267
+ +L+LGD + T +Q++ +YY+ L ISV G+ +++
Sbjct: 270 VSGELVLGDAMVPAGKKKTTMQYVPLLNNAASKPPYSVYYYLALTGISVGGKPVNLPSRA 329
Query: 268 FKRDDSGGGVMIDSGTDVTWLVKEAYEALRDEVMIRLEGEQMRSYSWPDKLCYHGIMS-- 325
F SGGG +IDSGT T+L ++ + + + G RS D L +
Sbjct: 330 FV-PSSGGGAIIDSGTTFTYLDPTVFKPVAAAMESAVGGRYNRSRPVEDALGLRPCFALP 388
Query: 326 ---SDLKGFPTVRFHFRGGAKLALEKDSMF--------YQPRPDAFCMAV-----NPASI 369
P + F+GGA + L ++ F P A C+AV
Sbjct: 389 PGPGGAMELPDLELKFKGGAVMRLPVENYFVAAGPAGGPAAGPVAICLAVVSDLPASGGD 448
Query: 370 NNRYVNFSLIGMMAQQFYNVGYDIGRKQKTFQRMDC 405
++G QQ Y++ YD+G+++ F++ C
Sbjct: 449 GAAAGPAIILGSFQQQNYHIEYDLGKERLGFRQQPC 484
>gi|302758122|ref|XP_002962484.1| hypothetical protein SELMODRAFT_78458 [Selaginella moellendorffii]
gi|300169345|gb|EFJ35947.1| hypothetical protein SELMODRAFT_78458 [Selaginella moellendorffii]
Length = 395
Score = 102 bits (254), Expect = 4e-19, Method: Compositional matrix adjust.
Identities = 98/392 (25%), Positives = 175/392 (44%), Gaps = 45/392 (11%)
Query: 48 AQILPSNDISNSLFYVNISIGQPPVPQFTAMDTGSSLLWVHCYP---CRDCSPQLGTIFY 104
++++ + I + ++V + +G P +DTGS L W+ C P + S +
Sbjct: 14 SRLVSGSSIGSGQYFVELRVGTPAKKFPLIIDTGSDLTWIQCNPPNTTANSSSPPAPWYD 73
Query: 105 PSRSSSYAYVPCDSEHCRYFPY---ARCSAYKHR-CIYTQLYLIGPETSVFVSTEQLTFK 160
S SSSY +PC + C + P + CS C YT Y T+ ++ E ++ K
Sbjct: 74 KSSSSSYREIPCTDDECLFLPAPIGSSCSIKSPSPCDYTYGYSDQSRTTGILAYETISMK 133
Query: 161 NTDES----------TIHVQDVVFGCGF-STNRNF-KFSGIFGLGIGRSSLVSQ-----L 203
+ S TI +++V GC S +F SG+ GLG G SL +Q L
Sbjct: 134 SRKRSGKRAGNHKTRTIRIKNVALGCSRESVGASFLGASGVLGLGQGPISLATQTRHTAL 193
Query: 204 NSSFSYCIGSLHDPDYLHNK-----LILGDGAIIDEGDATPL---QFIDGHYYITLEAIS 255
FSYC+ DYL L++G + TP+ YY+ + ++
Sbjct: 194 GGIFSYCL-----VDYLRGSNASSFLVMGR-TRWRKLAHTPIVRNPAAQSFYYVNVTGVA 247
Query: 256 VDGRMLD-INPNIFKRDDSGG-GVMIDSGTDVTWLVKEAYEALRDEVMIRLEGEQMRSYS 313
VDG+ +D I + + D G G + DSGT +++L + AY + + + + +
Sbjct: 248 VDGKPVDGIASSDWGIDGDGNKGTIFDSGTTLSYLREPAYSKVLGALNASIYLPRAQEIP 307
Query: 314 WPDKLCYHGIMSSDLKGFPTVRFHFRGGAKLALEKDSMFYQPRPDAFCMAVNPASINNRY 373
+LCY+ ++ KG P + F+GGA + L ++ + C+A+ + N
Sbjct: 308 EGFELCYN--VTRMEKGMPKLGVEFQGGAVMELPWNNYMVLVAENVQCVALQKVTTTN-- 363
Query: 374 VNFSLIGMMAQQFYNVGYDIGRKQKTFQRMDC 405
+++G + QQ +++ YD+ + + F+ C
Sbjct: 364 -GSNILGNLLQQDHHIEYDLAKARIGFKWSPC 394
>gi|224144963|ref|XP_002325476.1| predicted protein [Populus trichocarpa]
gi|222862351|gb|EEE99857.1| predicted protein [Populus trichocarpa]
Length = 372
Score = 102 bits (253), Expect = 4e-19, Method: Compositional matrix adjust.
Identities = 86/360 (23%), Positives = 161/360 (44%), Gaps = 30/360 (8%)
Query: 59 SLFYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQ--LG---TIFYPSRSSSYAY 113
SL++ I +G P + +DTGS +LWV+C C C + LG T++ P+ S S
Sbjct: 25 SLYFAKIGLGNPSKDYYVQVDTGSDILWVNCIGCDKCPTKSDLGIKLTLYDPASSVSATR 84
Query: 114 VPCDSEHCRYFPYARCSAYKHR--CIYTQLYLIGPETSVFVSTEQLTFKNTD---ESTIH 168
V CD + C K C Y +Y G T+ + ++ + F+ ++ +
Sbjct: 85 VSCDDDFCTSTYNGLLPDCKKELPCQYNVVYGDGSSTAGYFVSDAVQFERVTGNLQTGLS 144
Query: 169 VQDVVFGCGFSTNRNFKFSGIFGLGIGRSSLVSQLNSSFSYCIGSLHDPDYLHNKLILGD 228
V FGCG + GLG +L L +F++C+ D ++ I
Sbjct: 145 NGTVTFGCGAQQSG--------GLGTSGEALDGIL-GAFAHCL------DNVNGGGIFAI 189
Query: 229 GAIID-EGDATPLQFIDGHYYITLEAISVDGRMLDINPNIFKRDDSGGGVMIDSGTDVTW 287
G ++ + + TP+ HY + ++ I V G +L++ ++F D G +IDSGT + +
Sbjct: 190 GELVSPKVNTTPMVPNQAHYNVYMKEIEVGGTVLELPTDVFDSGDR-RGTIIDSGTTLAY 248
Query: 288 LVKEAYEALRDEVMIRLEGEQMRSYSWPDKLCYHGIMSSDLKGFPTVRFHFRGGAKLALE 347
L + Y+++ +E+ + G + + +C+ + D GFP ++FHF+ L +
Sbjct: 249 LPEVVYDSMMNEIRSQQPGLSLHTVE-EQFICFKYSGNVD-DGFPDIKFHFKDSLTLTVY 306
Query: 348 KDSMFYQPRPDAFCMAVNPASINNR-YVNFSLIGMMAQQFYNVGYDIGRKQKTFQRMDCE 406
+Q D +C + ++ + +L+G + V YDI + + +C+
Sbjct: 307 PHDYLFQISEDIWCFGWQNGGMQSKDGRDMTLLGDLVLSNKLVLYDIENQAIGWTEYNCK 366
>gi|226531872|ref|NP_001147022.1| aspartic proteinase nepenthesin-2 precursor [Zea mays]
gi|195606574|gb|ACG25117.1| aspartic proteinase nepenthesin-2 precursor [Zea mays]
Length = 491
Score = 102 bits (253), Expect = 4e-19, Method: Compositional matrix adjust.
Identities = 105/392 (26%), Positives = 163/392 (41%), Gaps = 51/392 (13%)
Query: 61 FYVNISIGQPPVPQFTAMDTGSSLLWVHC---YPCRDCSPQLGT---IFYPSRSSSYAYV 114
+ S+G PP P +DTGS L WV C Y CR+CS + +F+P SSS V
Sbjct: 99 YAFTASLGTPPQPLPVLLDTGSHLTWVPCTSSYECRNCSSPSASAVPVFHPKNSSSSRLV 158
Query: 115 PCDSEHCRYFPYAR----------CSAYKHRCIYTQLYLIGPETSVF--VSTEQLTFKNT 162
C + C++ A CS C + P V+ ST L +T
Sbjct: 159 GCRNPSCQWVHSAANLATKCRRAPCSPGAANCPAAASNVCPPYAVVYGSGSTAGLLIADT 218
Query: 163 DESTIH-VQDVVFGCGFSTNRNFKFSGIFGLGIGRSSLVSQLN-SSFSYCIGSLHDPDYL 220
+ V V GC + SG+ G G G S+ +QL FSYC+ S D
Sbjct: 219 LRAPGRAVPGFVLGCSLVSVHQ-PPSGLAGFGRGAPSVPAQLGLPKFSYCLLSRRFDD-- 275
Query: 221 HNKLILGDGAIIDEGDATPLQFI-------------DGHYYITLEAISVDGRMLDINPNI 267
N + G + G +Q++ +YY+ L ++V G+ + +
Sbjct: 276 -NAAVSGSLVLGGTGGGEGMQYVPLVKSAAGDKLPYGVYYYLALRGVTVGGKAVRLPARA 334
Query: 268 FKRDDSG-GGVMIDSGTDVTWLVKEAYEALRDEVMIRLEGEQMRSYSWPDKL----CYHG 322
F + +G GG ++DSGT T+L ++ + D V+ + G RS D L C+
Sbjct: 335 FAGNAAGSGGTIVDSGTTFTYLDPTVFQPVADAVVAAVGGRYKRSKDAEDGLGLHPCFAL 394
Query: 323 IMSSDLKGFPTVRFHFRGGAKLALEKDSMFY---QPRPDAFCMAV------NPASINNRY 373
+ P + FHF GGA + L ++ F + +A C+AV + N
Sbjct: 395 PQGARSMALPELSFHFEGGAVMQLPVENYFVVAGRGAVEAICLAVVTDFGGGSGAGNEGS 454
Query: 374 VNFSLIGMMAQQFYNVGYDIGRKQKTFQRMDC 405
++G QQ Y V YD+ +++ F+R C
Sbjct: 455 GPAIILGSFQQQNYLVEYDLEKERLGFRRQSC 486
>gi|224101015|ref|XP_002312106.1| predicted protein [Populus trichocarpa]
gi|222851926|gb|EEE89473.1| predicted protein [Populus trichocarpa]
Length = 440
Score = 102 bits (253), Expect = 4e-19, Method: Compositional matrix adjust.
Identities = 102/388 (26%), Positives = 170/388 (43%), Gaps = 58/388 (14%)
Query: 58 NSLFYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLGTIFYPSRSSSYAYVPCD 117
N V+++ G P +DTGS L W+HC P +IF P S +Y +PC
Sbjct: 64 NVTLTVSLTAGTPLQNITMVLDTGSELSWLHCKK----EPNFNSIFNPLASKTYTKIPCS 119
Query: 118 SEHC----RYFPYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDVV 173
S C R P C + Y +SV L F+ ++ V
Sbjct: 120 SPTCETRTRDLPLPVSCDPAKLCHFIISY--ADASSV---EGNLAFETFRVGSVTGPATV 174
Query: 174 FGC---GFSTN--RNFKFSGIFGLGIGRSSLVSQLN-SSFSYCIGSLHDPDYLHNKLILG 227
FGC GFS+N + K +G+ G+ G S V+Q+ FSYCI D D L+LG
Sbjct: 175 FGCMDSGFSSNSEEDAKTTGLMGMNRGSLSFVNQMGFRKFSYCIS---DRDS-SGVLLLG 230
Query: 228 DGA-----------IIDEGDATPLQFIDG-HYYITLEAISVDGRMLDINPNIFKRDDSGG 275
+ + +++ +TPL + D Y + LE I V ++L + ++F D +G
Sbjct: 231 EASFSWLKPLNYTPLVEM--STPLPYFDRVAYSVQLEGIRVSDKVLSLPKSVFVPDHTGA 288
Query: 276 G-VMIDSGTDVTWLVKEAYEALRDEVMIRLEGEQMRSYSWPDKLCYHGIMS--------- 325
G M+DSGT T+L+ Y AL+ E +++ +G +R + P + + G M
Sbjct: 289 GQTMVDSGTQFTFLLGPVYSALKQEFLLQTKG-VLRVLNEP-RYVFQGAMDLCYLIEPTR 346
Query: 326 SDLKGFPTVRFHFRGGAKLALEKDSMFYQP------RPDAFCMAVNPASINNRYVNFSLI 379
+ L P V FR GA++++ + Y+ + +C + ++ + +I
Sbjct: 347 AALPNLPVVNLMFR-GAEMSVSGQRLLYRVPGEVRGKDSVWCFTFGNS--DSLGIESFVI 403
Query: 380 GMMAQQFYNVGYDIGRKQKTFQRMDCEV 407
G QQ + YD+ + + F + C++
Sbjct: 404 GHHQQQNVWMEYDLEKSRIGFAEVRCDL 431
>gi|222640709|gb|EEE68841.1| hypothetical protein OsJ_27628 [Oryza sativa Japonica Group]
Length = 375
Score = 102 bits (253), Expect = 4e-19, Method: Compositional matrix adjust.
Identities = 94/353 (26%), Positives = 154/353 (43%), Gaps = 50/353 (14%)
Query: 73 PQFTAMDTGSSLLWVHCYPCRDCSPQLGTIFYPSRSSSYAYVPCDSEHCRYFPYARCSAY 132
P+ +DTGS L+W C S S++ A R P AR A+
Sbjct: 52 PRKLIVDTGSDLIWTQCKL--------------SSSTAAAARHGSPPLSRTAP-ARTGAF 96
Query: 133 KHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDVVFGCG-FSTNRNFKFSGIFG 191
C + +G +++E TF ++ + FGCG S +GI G
Sbjct: 97 TRTCTASAAA-VG-----VLASETFTFGARRAVSLRLG---FGCGALSAGSLIGATGILG 147
Query: 192 LGIGRSSLVSQLN-SSFSYCIGSLHDPDYLHNKLILGDGAIIDEGDAT-PLQFI------ 243
L SL++QL FSYC+ D + L+ G A + T P+Q
Sbjct: 148 LSPESLSLITQLKIQRFSYCLTPFADKKT--SPLLFGAMADLSRHKTTRPIQTTAIVSNP 205
Query: 244 --DGHYYITLEAISVDGRMLDI-NPNIFKRDDSGGGVMIDSGTDVTWLVKEAYEALRDEV 300
+YY+ L IS+ + L + ++ R D GGG ++DSG+ V +LV+ A+EA+++ V
Sbjct: 206 VETVYYYVPLVGISLGHKRLAVPAASLAMRPDGGGGTIVDSGSTVAYLVEAAFEAVKEAV 265
Query: 301 M--IRLEGEQMRSYSWPDKLCY-----HGIMSSDLKGFPTVRFHFRGGAKLALEKDSMFY 353
M +RL + +LC+ + + P + HF GGA + L +D+ F
Sbjct: 266 MDVVRLPVANRTVEDY--ELCFVLPRRTAAAAMEAVQVPPLVLHFDGGAAMVLPRDNYFQ 323
Query: 354 QPRPDAFCMAVNPASINNRYVNFSLIGMMAQQFYNVGYDIGRKQKTFQRMDCE 406
+PR C+AV + + S+IG + QQ +V +D+ + +F C+
Sbjct: 324 EPRAGLMCLAVGKTTDGS---GVSIIGNVQQQNMHVLFDVQHHKFSFAPTQCD 373
>gi|297829808|ref|XP_002882786.1| hypothetical protein ARALYDRAFT_478632 [Arabidopsis lyrata subsp.
lyrata]
gi|297328626|gb|EFH59045.1| hypothetical protein ARALYDRAFT_478632 [Arabidopsis lyrata subsp.
lyrata]
Length = 449
Score = 102 bits (253), Expect = 5e-19, Method: Compositional matrix adjust.
Identities = 101/426 (23%), Positives = 164/426 (38%), Gaps = 33/426 (7%)
Query: 1 LIHRDSILSPYNNPNENPAHRVQRGINISIARLAYLQEKIRSHNTYQAQILPSNDISNSL 60
L HRD++ NP R++ I R + + K + + + D +
Sbjct: 35 LAHRDTLW-------PNPLSRIEDIIGADQKRHSLISRKRKFKGGVKMDLGSGIDYGTAQ 87
Query: 61 FYVNISIGQPPVPQFTAMDTGSSLLWVHC-YPCRDCSP-QLGTIFYPSRSSSYAYVPCDS 118
++ + +G P +DTGS L WV+C Y R + +F S S+ V C +
Sbjct: 88 YFTEVRVGTPAKKFRVVVDTGSELTWVNCRYRGRGKGKVKNRRVFRAEESKSFKTVGCFT 147
Query: 119 EHCR-----YFPYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDVV 173
+ C+ F + C C Y Y G + E +T T+ ++ ++
Sbjct: 148 QTCKVDLMNLFSLSTCPTPSTPCSYDYRYADGSAAQGVFAKETITVGLTNGRKARLRGLL 207
Query: 174 FGCGFSTNRNFKFS--GIFGLGIGRSSLVSQLNSSF----SYCIGSLHDPDYLHNKLILG 227
GC S + G+ GL S S S F SYC+ + N LI G
Sbjct: 208 VGCSSSFSGQSFQGADGVLGLAFSDFSFTSTATSLFGAKLSYCLVDHLSNKNISNYLIFG 267
Query: 228 -----DGAIIDEGDATP--LQFIDGHYYITLEAISVDGRMLDINPNIFKRDDSGGGVMID 280
G TP L I Y I + IS+ MLDI ++ +GGG ++D
Sbjct: 268 YSSSSTSTKTAPGRTTPLDLTLIPPFYAINIIGISIGDDMLDIPTQVWDA-TTGGGTILD 326
Query: 281 SGTDVTWLVKEAYEALRDEVMIRL-EGEQMRSYSWPDKLCYHGIMSSDLKGFPTVRFHFR 339
SGT +T L + AY+ + + L E ++++ P + C+ + P + FH +
Sbjct: 327 SGTSLTLLAEAAYKPVVTGLARYLVELKRVKPEGIPIEYCFSSTSGFNESKLPQLTFHLK 386
Query: 340 GGAKLALEKDSMFYQPRPDAFCMAVNPASINNRYVNFSLIGMMAQQFYNVGYDIGRKQKT 399
GGA+ + S P C+ A V +G + QQ Y +D+ +
Sbjct: 387 GGARFEPHRKSYLVDAAPGVKCLGFMSAGTPATNV----VGNIMQQNYLWEFDLMASTLS 442
Query: 400 FQRMDC 405
F C
Sbjct: 443 FAPSTC 448
>gi|413923784|gb|AFW63716.1| aspartic proteinase nepenthesin-2 [Zea mays]
Length = 531
Score = 102 bits (253), Expect = 5e-19, Method: Compositional matrix adjust.
Identities = 118/428 (27%), Positives = 173/428 (40%), Gaps = 51/428 (11%)
Query: 1 LIHRDSILSPYNNPN----ENPAHRVQRGINISIARLAYLQEKIRSHNTYQAQILPSNDI 56
L HR SP E HR Q R AY+Q K + S+
Sbjct: 132 LHHRHGPCSPLPTKKMPTLEETLHRDQ-------LRAAYIQRKFSGGGGAGGDVQRSDAT 184
Query: 57 S--------NSLFY-VNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLGTIFYPSR 107
N+L Y + + +G P Q +DTGS + WV C PC C Q +F PS
Sbjct: 185 VPTALGTSLNTLEYLITVGLGSPATSQTMLIDTGSDVSWVQCKPCSQCHSQADPLFDPSS 244
Query: 108 SSSYAYVPCDSEHCRYFPYA--RCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDES 165
SS+Y+ C S C CS+ +C Y Y G T+ S++ L ++
Sbjct: 245 SSTYSPFSCGSADCAQLGQEGNGCSS-SSQCQYIVTYGDGSSTTGTYSSDTLALGSS--- 300
Query: 166 TIHVQDVVFGCG-FSTNRNFKFSGIFGLGIGRSSLVSQ----LNSSFSYCIGSLHDPDYL 220
V+ FGC + N + G+ GLG G SLVSQ L +FSYC+
Sbjct: 301 --AVRSFQFGCSNVESGFNDQTDGLMGLGGGAQSLVSQTAGTLGRAFSYCLPPTPSSSGF 358
Query: 221 HNKLILGDGAIIDEGDATPL---QFIDGHYYITLEAISVDGRMLDINPNIFKRDDSGGGV 277
L G+ TP+ + Y + L+AI V GR L I ++F G
Sbjct: 359 LT-LGAAGGSGTSGFVKTPMLRSSQVPTFYGVRLQAIRVGGRQLSIPASVFS-----AGT 412
Query: 278 MIDSGTDVTWLVKEAYEALRDEVMIRLEGEQMRSYSWPDKLCYHGIMSSDLKGFPTVRFH 337
++DSGT +T L AY AL ++ S C+ S + P+V
Sbjct: 413 VMDSGTVITRLPPTAYSALSSAFKAGMKQYPPAQPSGILDTCFDFSGQSSVS-IPSVALV 471
Query: 338 FRGGAKLALEKDSMFYQPRPDAFCMAVNPASINNRYVNFSLIGMMAQQFYNVGYDIGRKQ 397
F GGA ++L+ + + C+A + N+ + +IG + Q+ + V YD+GR
Sbjct: 472 FSGGAVVSLDASGIIL-----SNCLAF---AGNSDDSSLGIIGNVQQRTFEVLYDVGRGV 523
Query: 398 KTFQRMDC 405
F+ C
Sbjct: 524 VGFRAGAC 531
>gi|125553822|gb|EAY99427.1| hypothetical protein OsI_21398 [Oryza sativa Indica Group]
Length = 469
Score = 102 bits (253), Expect = 5e-19, Method: Compositional matrix adjust.
Identities = 97/352 (27%), Positives = 140/352 (39%), Gaps = 42/352 (11%)
Query: 72 VPQFTAMDTGSSLLWVHCYPCRD--CSPQLGTIFYPSRSSSYAYVPCDSEHCRYF-PYAR 128
V Q +DT S + WV C PC C PQ ++ P++SSS C+S C PYA
Sbjct: 142 VTQTMVLDTASDVTWVQCSPCPTPPCYPQKDVLYDPTKSSSSGVFSCNSPTCTQLGPYAN 201
Query: 129 CSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDVVFGCGFSTNRNFKF-- 186
++C Y Y G T+ ++ LT V+ FGC +F F
Sbjct: 202 GCTNNNQCQYRVRYPDGTSTAGTYISDLLTIT----PATAVRSFQFGCSHGVQGSFSFGS 257
Query: 187 --SGIFGLGIGRSSLVSQLNSSFSYCIGSLHDPDYLHNKLILGDGAIID---------EG 235
+GI LG G SLVSQ +++ P LG + +
Sbjct: 258 SAAGIMALGGGPESLVSQTAATYGRVFSHCFPPPTRRGFFTLGVPRVAAWRYVLTPMLKN 317
Query: 236 DATPLQFIDGHYYITLEAISVDGRMLDINPNIFKRDDSGGGVMIDSGTDVTWLVKEAYEA 295
A P F Y + LEAI+V G+ + + P +F G +DS T +T L AY+A
Sbjct: 318 PAIPPTF----YMVRLEAIAVAGQRIAVPPTVF-----AAGAALDSRTAITRLPPTAYQA 368
Query: 296 LRDEVMIRLEGEQMRSYSWPDKLCY--HGIMSSDLKGFPTVRFHFRGGAKLALEKDSMFY 353
LR R+ Q P CY G+ S L P + F A + L+ + +
Sbjct: 369 LRQAFRDRMAMYQPAPPKGPLDTCYDMAGVRSFAL---PRITLVFDKNAAVELDPSGVLF 425
Query: 354 QPRPDAFCMAVNPASINNRYVNFSLIGMMAQQFYNVGYDIGRKQKTFQRMDC 405
Q C+A A N++ +IG + Q V Y+I F+ C
Sbjct: 426 Q-----GCLAFT-AGPNDQVPG--IIGNIQLQTLEVLYNIPAALVGFRHAAC 469
>gi|115466060|ref|NP_001056629.1| Os06g0118700 [Oryza sativa Japonica Group]
gi|55296436|dbj|BAD68559.1| putative nucleoid DNA-binding protein cnd41 [Oryza sativa Japonica
Group]
gi|113594669|dbj|BAF18543.1| Os06g0118700 [Oryza sativa Japonica Group]
gi|215767921|dbj|BAH00150.1| unnamed protein product [Oryza sativa Japonica Group]
Length = 494
Score = 102 bits (253), Expect = 5e-19, Method: Compositional matrix adjust.
Identities = 97/352 (27%), Positives = 140/352 (39%), Gaps = 42/352 (11%)
Query: 72 VPQFTAMDTGSSLLWVHCYPCRD--CSPQLGTIFYPSRSSSYAYVPCDSEHCRYF-PYAR 128
V Q +DT S + WV C PC C PQ ++ P++SSS C+S C PYA
Sbjct: 167 VTQTMVLDTASDVTWVQCSPCPTPPCYPQKDVLYDPTKSSSSGVFSCNSPTCTQLGPYAN 226
Query: 129 CSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDVVFGCGFSTNRNFKF-- 186
++C Y Y G T+ ++ LT V+ FGC +F F
Sbjct: 227 GCTNNNQCQYRVRYPDGTSTAGTYISDLLTIT----PATAVRSFQFGCSHGVQGSFSFGS 282
Query: 187 --SGIFGLGIGRSSLVSQLNSSFSYCIGSLHDPDYLHNKLILGDGAIID---------EG 235
+GI LG G SLVSQ +++ P LG + +
Sbjct: 283 SAAGIMALGGGPESLVSQTAATYGRVFSHCFPPPTRRGFFTLGVPRVAAWRYVLTPMLKN 342
Query: 236 DATPLQFIDGHYYITLEAISVDGRMLDINPNIFKRDDSGGGVMIDSGTDVTWLVKEAYEA 295
A P F Y + LEAI+V G+ + + P +F G +DS T +T L AY+A
Sbjct: 343 PAIPPTF----YMVRLEAIAVAGQRIAVPPTVF-----AAGAALDSRTAITRLPPTAYQA 393
Query: 296 LRDEVMIRLEGEQMRSYSWPDKLCY--HGIMSSDLKGFPTVRFHFRGGAKLALEKDSMFY 353
LR R+ Q P CY G+ S L P + F A + L+ + +
Sbjct: 394 LRQAFRDRMAMYQPAPPKGPLDTCYDMAGVRSFAL---PRITLVFDKNAAVELDPSGVLF 450
Query: 354 QPRPDAFCMAVNPASINNRYVNFSLIGMMAQQFYNVGYDIGRKQKTFQRMDC 405
Q C+A A N++ +IG + Q V Y+I F+ C
Sbjct: 451 Q-----GCLAFT-AGPNDQVPG--IIGNIQLQTLEVLYNIPAALVGFRHAAC 494
>gi|222616654|gb|EEE52786.1| hypothetical protein OsJ_35257 [Oryza sativa Japonica Group]
Length = 346
Score = 102 bits (253), Expect = 5e-19, Method: Compositional matrix adjust.
Identities = 99/363 (27%), Positives = 161/363 (44%), Gaps = 37/363 (10%)
Query: 63 VNISIGQPPVPQFTAMDTGSSLLWVHCYPCR----DCSPQLGTIFYPSRSSSYAYVPCDS 118
+ IS+G PPV +DTGS+L WV C C+ D + + G IF P SS+Y+ V C +
Sbjct: 1 MGISLGTPPVFNLVTIDTGSTLSWVQCKNCQIKCYDQAAKAGQIFNPYNSSTYSKVGCST 60
Query: 119 EHCR------YFPYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDV 172
E C Y C CIY+ Y G + ++ ++LT S + +
Sbjct: 61 EACNGMHMDLAVEYG-CVEEDDTCIYSLRYGSGEYSVGYLGKDRLTLA----SNRSIDNF 115
Query: 173 VFGCGFSTNRNFKFSGIFGLGIGRSSLVSQLN-----SSFSYCIGSLHDPDYLHNKLILG 227
+FGCG N +GI G G S +Q+ ++FSYC H+ + L +G
Sbjct: 116 IFGCGEDNLYNGVNAGIIGFGTKSYSFFNQVCQQTDYTAFSYCFPRDHENE---GSLTIG 172
Query: 228 DGAIIDEGDATPLQFIDGH--YYITLEAISVDGRMLDINPNIFKRDDSGGGVMIDSGTDV 285
A T L + D Y I + V+G L+I+P I+ + ++DSGT
Sbjct: 173 PYARDINLMWTKLIYYDHKPAYAIQQLDMMVNGIRLEIDPYIYISKMT----IVDSGTAD 228
Query: 286 TWLVKEAYEALRDEVMIRLEGEQMRSYSWPD-KLCY-HGIMSSDLKGFPTVRFHF-RGGA 342
T+++ ++AL D+ M + + + W + ++C+ S++ FPTV R
Sbjct: 229 TYILSPVFDAL-DKAMTKEMQAKGYTRGWDERRICFISNSGSANWNDFPTVEMKLIRSTL 287
Query: 343 KLALEKDSMFYQPRPDAFCMAVNPASINNRYVNFSLIGMMAQQFYNVGYDIGRKQKTFQR 402
KL +E + FY+ + C P R V ++G A + + + +DI F+
Sbjct: 288 KLPVE--NAFYESSNNVICSTFLPDDAGVRGV--QMLGNRAVRSFKLVFDIQAMNFGFKA 343
Query: 403 MDC 405
C
Sbjct: 344 RAC 346
>gi|223950123|gb|ACN29145.1| unknown [Zea mays]
gi|413923785|gb|AFW63717.1| hypothetical protein ZEAMMB73_445506 [Zea mays]
Length = 385
Score = 101 bits (252), Expect = 5e-19, Method: Compositional matrix adjust.
Identities = 109/393 (27%), Positives = 164/393 (41%), Gaps = 40/393 (10%)
Query: 32 RLAYLQEKIRSHNTYQAQILPSNDIS--------NSLFY-VNISIGQPPVPQFTAMDTGS 82
R AY+Q K + S+ N+L Y + + +G P Q +DTGS
Sbjct: 14 RAAYIQRKFSGGGGAGGDVQRSDATVPTALGTSLNTLEYLITVGLGSPATSQTMLIDTGS 73
Query: 83 SLLWVHCYPCRDCSPQLGTIFYPSRSSSYAYVPCDSEHCRYFPYA--RCSAYKHRCIYTQ 140
+ WV C PC C Q +F PS SS+Y+ C S C CS+ +C Y
Sbjct: 74 DVSWVQCKPCSQCHSQADPLFDPSSSSTYSPFSCGSADCAQLGQEGNGCSS-SSQCQYIV 132
Query: 141 LYLIGPETSVFVSTEQLTFKNTDESTIHVQDVVFGCG-FSTNRNFKFSGIFGLGIGRSSL 199
Y G T+ S++ L ++ V+ FGC + N + G+ GLG G SL
Sbjct: 133 TYGDGSSTTGTYSSDTLALGSS-----AVRSFQFGCSNVESGFNDQTDGLMGLGGGAQSL 187
Query: 200 VSQ----LNSSFSYCIGSLHDPDYLHNKLILGDGAIIDEGDATPL---QFIDGHYYITLE 252
VSQ L +FSYC+ L G+ TP+ + Y + L+
Sbjct: 188 VSQTAGTLGRAFSYCLPPTPSSSGFLT-LGAAGGSGTSGFVKTPMLRSSQVPTFYGVRLQ 246
Query: 253 AISVDGRMLDINPNIFKRDDSGGGVMIDSGTDVTWLVKEAYEALRDEVMIRLEGEQMRSY 312
AI V GR L I ++F G ++DSGT +T L AY AL ++
Sbjct: 247 AIRVGGRQLSIPASVFS-----AGTVMDSGTVITRLPPTAYSALSSAFKAGMKQYPPAQP 301
Query: 313 SWPDKLCYHGIMSSDLKGFPTVRFHFRGGAKLALEKDSMFYQPRPDAFCMAVNPASINNR 372
S C+ S + P+V F GGA ++L+ + + C+A + N+
Sbjct: 302 SGILDTCFDFSGQSSVS-IPSVALVFSGGAVVSLDASGIIL-----SNCLAF---AGNSD 352
Query: 373 YVNFSLIGMMAQQFYNVGYDIGRKQKTFQRMDC 405
+ +IG + Q+ + V YD+GR F+ C
Sbjct: 353 DSSLGIIGNVQQRTFEVLYDVGRGVVGFRAGAC 385
>gi|223973231|gb|ACN30803.1| unknown [Zea mays]
Length = 459
Score = 101 bits (252), Expect = 5e-19, Method: Compositional matrix adjust.
Identities = 113/432 (26%), Positives = 182/432 (42%), Gaps = 59/432 (13%)
Query: 29 SIARLAYLQEKIRSHNTYQAQI-LPSNDISNSLF-------YVNISIGQPPVPQFTAMDT 80
S+AR +L+ + +H++ + PS + +L+ S+G PP P +DT
Sbjct: 27 SLARALHLKRRDPNHHSQKGSGGHPSVPATAALYPHSYGGYAFTASLGTPPQPLPVLLDT 86
Query: 81 GSSLLWVHC---YPCRDCSPQLGT---IFYPSRSSSYAYVPCDSEHCRYFPYAR------ 128
GS L WV C Y CR+CS + +F+P SSS V C + C++ A
Sbjct: 87 GSHLTWVPCTSSYECRNCSSPSASAVPVFHPKNSSSSRLVGCRNPSCQWVHSAANLATKC 146
Query: 129 ----CSAYKHRCIYTQLYLIGPETSVF--VSTEQLTFKNTDESTIH-VQDVVFGCGFSTN 181
CS C + P V+ ST L +T + V V GC +
Sbjct: 147 RRAPCSPGAANCPAAASNVCPPYAVVYGSGSTAGLLIADTLRAPGRAVPGFVLGCSLVSV 206
Query: 182 RNFKFSGIFGLGIGRSSLVSQLN-SSFSYCIGSLHDPDYLHNKLILGDGAIIDEGDATPL 240
SG+ G G G S+ +QL FSYC+ S D N + G + G +
Sbjct: 207 HQ-PPSGLAGFGRGAPSVPAQLGLPKFSYCLLSRRFDD---NAAVSGSLVLGGTGGGEGM 262
Query: 241 QFI-------------DGHYYITLEAISVDGRMLDINPNIFKRDDSG-GGVMIDSGTDVT 286
Q++ +YY+ L ++V G+ + + F + +G GG ++DSGT T
Sbjct: 263 QYVPLVKSAAGDKLPYGVYYYLALRGVTVGGKAVRLPARAFAANAAGSGGTIVDSGTTFT 322
Query: 287 WLVKEAYEALRDEVMIRLEGEQMRSYSWPDKL----CYHGIMSSDLKGFPTVRFHFRGGA 342
+L ++ + D V+ + G RS D+L C+ + P + FHF GGA
Sbjct: 323 YLDPTVFQPVADAVVAAVGGRYKRSKDAEDELGLHPCFALPQGARSMALPELSFHFEGGA 382
Query: 343 KLALEKDSMFY---QPRPDAFCMAV------NPASINNRYVNFSLIGMMAQQFYNVGYDI 393
+ L ++ F + +A C+AV + N ++G QQ Y V YD+
Sbjct: 383 VMQLPVENYFVVAGRGAVEAICLAVVTDFSGGSGAGNEGSGPAIILGSFQQQNYLVEYDL 442
Query: 394 GRKQKTFQRMDC 405
+++ F+R C
Sbjct: 443 EKERLGFRRQSC 454
>gi|255571584|ref|XP_002526738.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
gi|223533927|gb|EEF35652.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
Length = 457
Score = 101 bits (252), Expect = 5e-19, Method: Compositional matrix adjust.
Identities = 119/432 (27%), Positives = 191/432 (44%), Gaps = 54/432 (12%)
Query: 1 LIHRDSILSPYNNPNENPAHRVQRGINISIARLAYLQEKIRSHNTYQAQILPSNDIS--N 58
L+H S SP+ PN A Q I S AR ++ I S N + P + +S +
Sbjct: 40 LLHWLSTESPFYEPNLTLAELTQASIRTSGARGDSIRS-IMSGNITSSMKYPISRMSYTD 98
Query: 59 SLFYVNISIGQPPVPQFTAMDTGSSLLWVHCYP--CRDCSPQLGTIFYPSRSSSYAYVPC 116
+ + SIG P V + D+GSSL+W+ C CR+C Q +F PS+S +Y C
Sbjct: 99 KAYVMKFSIGSPAVDTYAIPDSGSSLVWLQCGTPYCRNCYRQKIPLFNPSKSVTYMKRLC 158
Query: 117 DSEHCRYF---PYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTF-KNTDESTIHVQDV 172
++ CR Y RC C Y + YL T +ST+ TF ++ + +
Sbjct: 159 NTAECRVALGDEYWRCKKPNQICKYHEDYLDDSYTEGVISTDIFTFPEHISGFGNYTLRI 218
Query: 173 VFGCGFSTN--RNFKFSGIFGLGIGRSSLVSQLN-SSFSYCIGSLHDPDYLHNKLIL--G 227
+FGCG++ + ++F G+ GL ++SLV Q++ FSYC+ S+ L + + G
Sbjct: 219 IFGCGYNNSDPQHFYPPGLVGLTNNKASLVGQMDVDQFSYCV-SIDTEQNLKGSMEIRFG 277
Query: 228 DGAIIDEGDATPLQFIDGHY-YITLEAISVDGRMLDINPN-IFKRDDSG-GGVMIDSGTD 284
A I + DG Y + ++ I V+ ++ P +FK + G GG+ +D+GT
Sbjct: 278 LAASISGHSTQLVPNSDGWYIFKNVDGIYVNEFEVEGYPAWVFKYTEGGQGGLTMDTGTT 337
Query: 285 VTWLVKEAYEALRDEVMIRLEGEQMRSYSWPD------KLCYHGIMSSDLKG--FPTVRF 336
T E + ++ D +I+L E + D +LCY S D G P +
Sbjct: 338 YT----ELHNSVMDP-LIKLLEEHITIVPEKDYSNSGFELCY---FSDDFLGATLPDIEL 389
Query: 337 HFRGGAKLALEKDSMFYQPRPDAF--------CMAVNPASINNRYVNFSLIGMMAQQFYN 388
F KD+ F +A+ C+A+ R S+IGM +
Sbjct: 390 RFTD------NKDTYFSFNTRNAWTPNGRSQMCLAM------FRTNGMSIIGMHQLRDIK 437
Query: 389 VGYDIGRKQKTF 400
+GYD+ +F
Sbjct: 438 IGYDLHHNIVSF 449
>gi|302821814|ref|XP_002992568.1| hypothetical protein SELMODRAFT_46291 [Selaginella moellendorffii]
gi|300139637|gb|EFJ06374.1| hypothetical protein SELMODRAFT_46291 [Selaginella moellendorffii]
Length = 368
Score = 101 bits (252), Expect = 6e-19, Method: Compositional matrix adjust.
Identities = 108/365 (29%), Positives = 154/365 (42%), Gaps = 49/365 (13%)
Query: 78 MDTGSSLLWVHCYPCRDCSPQLGTIFYPSRSSSYAYVPCDSEHCRYFPYAR-------CS 130
+DTGS + V C + +F P+ S SY VPC S+ C C
Sbjct: 16 IDTGSEAVLVQC------GSRSRPVFDPAASQSYRQVPCISQLCLAVQQQTSNGSSQPCV 69
Query: 131 AYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQ--DVVFGCGFSTNR---NFK 185
C Y+ Y ++ S + + +T+ S+ VQ DV FGC S +
Sbjct: 70 NSSAACTYSLSYGDSRNSTGDFSQDVIFLNSTNSSSQAVQFRDVAFGCAHSPQGFLVDLG 129
Query: 186 FSGIFGLGIGRSSLVSQLN-----SSFSYCIGSLHDPDYLHNKLILGDGAIIDEGDA-TP 239
GI G G SL SQL S FSYC S + LGD + + TP
Sbjct: 130 SLGIVGFNRGNLSLPSQLKDRLGGSKFSYCFPSQPWQPRATGVIFLGDSGLSKSKVSYTP 189
Query: 240 LQFIDGH--------YYITLEAISVDGRMLDINPNIFKRDDSG--GGVMIDSGTDVTWLV 289
L +D YY+ L +ISVDG+ L I + FK D S GG ++DSGT T +V
Sbjct: 190 L--LDNPVTPARSQLYYVGLTSISVDGKTLAIPESAFKLDPSTGDGGTVLDSGTTFTRVV 247
Query: 290 KEAYEALRDEVMIR----LEGEQMRSYSWPDKLCYHGIMSSDLKGFPTVRFHFRGGAKLA 345
+AY A R+ L + + + D CY+ S L G P VR + +L
Sbjct: 248 DDAYTAFRNAFAASNRSGLRKKVGAAAGFDD--CYNISAGSSLPGVPEVRLSLQNNVRLE 305
Query: 346 LEKDSMFYQPRPDA-----FCMAVNPASINNRYVNFSLIGMMAQQFYNVGYDIGRKQKTF 400
L + +F P A C+A+ +S + + +++G Q Y V YD R + F
Sbjct: 306 LRFEHLFV-PVSAAGNEVTVCLAI-LSSQKSGFGKINVLGNYQQSNYLVEYDNERSRVGF 363
Query: 401 QRMDC 405
+R DC
Sbjct: 364 ERADC 368
>gi|357159298|ref|XP_003578403.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Brachypodium
distachyon]
Length = 442
Score = 101 bits (251), Expect = 7e-19, Method: Compositional matrix adjust.
Identities = 103/367 (28%), Positives = 157/367 (42%), Gaps = 41/367 (11%)
Query: 67 IGQPPVPQFTAMDTGSSLLWVHCYPC---RDCSPQLGTIFYPSRSSSYAYVPC--DSEHC 121
IG PP +DTGS L+W C + C+ Q + S+SS++ VPC + C
Sbjct: 92 IGSPPQRTEALIDTGSDLIWTQCATTCLPKSCAKQGLPYYNLSQSSTFVPVPCADKAGFC 151
Query: 122 RYFPYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDVVFGC----G 177
C C + Y G + TE F++ S + FGC
Sbjct: 152 AANGVHLC-GLDGSCTFIASYGAGRVIGS-LGTESFAFESGTTS------LAFGCVSLTR 203
Query: 178 FSTNRNFKFSGIFGLGIGRSSLVSQLNSS-FSYCIGS-LHDPDYLHNKLILGDGAIIDEG 235
++ SG+ GLG GR SLVSQ+ ++ FSYC+ H + + ++ G
Sbjct: 204 ITSGALNDASGLIGLGRGRLSLVSQIGATRFSYCLTPYFHSSGASSHLFVGASASLGGGG 263
Query: 236 DATPLQFIDG--------HYYITLEAISV-DGRMLDINPNIFK-----RDDSGGGVMIDS 281
+ P F+ YY+ LE I+V R+ +N F+ + GGV+ID+
Sbjct: 264 ASMP--FVKSPKDYPYSTFYYLPLEGITVGKTRLPAVNSTTFQLRQLFKGYWAGGVIIDT 321
Query: 282 GTDVTWLVKEAYEALRDEVMIRLEGEQMRSYSWPDKLCYHGIMSSDLKGFPTVRFHFRGG 341
G+ +T L AYEAL++EV +L + L K P + FHF GG
Sbjct: 322 GSPLTQLASHAYEALKEEVAAQLGNGSLVPAPEDSGLELCVAREGFQKVVPALVFHFGGG 381
Query: 342 AKLALEKDSMFYQPRPDAFCMAVNPASINNRYVNFSLIGMMAQQFYNVGYDIGRKQKTFQ 401
A +A+ S + A CM + + S+IG QQ ++ YD+ R + +FQ
Sbjct: 382 ADMAVPAASYWAPVDKAAACMMILEGGYD------SIIGNFQQQDMHLLYDLRRGRFSFQ 435
Query: 402 RMDCEVL 408
DC +L
Sbjct: 436 TADCTML 442
>gi|357123876|ref|XP_003563633.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Brachypodium
distachyon]
Length = 503
Score = 101 bits (251), Expect = 7e-19, Method: Compositional matrix adjust.
Identities = 106/359 (29%), Positives = 152/359 (42%), Gaps = 32/359 (8%)
Query: 61 FYVNISIGQPPVPQFTAMDTGSSLLWVHCYPC-RDCSPQLGTIFYPSRSSSYAYVPCDSE 119
+ V I +G PP DTGS WV C PC C Q +F P++SS+YA V C
Sbjct: 163 YVVPIGLGTPPSRFTVVFDTGSDTTWVQCRPCVVSCYKQKDRLFDPAKSSTYANVSCADP 222
Query: 120 HCRYFPYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDVVFGCGFS 179
C + C+A C+Y Y G T F + + L ++ FGCG
Sbjct: 223 ACADLDASGCNA--GHCLYGIQYGDGSYTVGFFAKDTLAVAQD-----AIKGFKFGCG-E 274
Query: 180 TNRNF--KFSGIFGLGIGRSSLVSQL----NSSFSYCI-GSLHDPDYLHNKLILGDGAII 232
NR + +G+ GLG G +S+ Q SFSYC+ S YL + +
Sbjct: 275 KNRGLFGQTAGLLGLGRGPTSITVQAYEKYGGSFSYCLPASSAATGYLEFGPLSPSSSGS 334
Query: 233 DEGDATPLQFIDG--HYYITLEAISVDGRMLDINP-NIFKRDDSGGGVMIDSGTDVTWLV 289
+ TP+ G YY+ L I V G+ L P ++F S G ++DSGT +T L
Sbjct: 335 NA-KTTPMLTDKGPTFYYVGLTGIRVGGKQLGAIPESVF----SNSGTLVDSGTVITRLP 389
Query: 290 KEAYEALRDEVMIRLEG---EQMRSYSWPDKLCYHGIMSSDLKGFPTVRFHFRGGAKLAL 346
AY AL + ++ +YS D CY S + PTV F+GGA L L
Sbjct: 390 DTAYAALSSAFAAAMAASGYKKAAAYSILDT-CYDFTGLSQVS-LPTVSLVFQGGACLDL 447
Query: 347 EKDSMFYQPRPDAFCMAVNPASINNRYVNFSLIGMMAQQFYNVGYDIGRKQKTFQRMDC 405
+ + Y C+ + N + ++G Q+ Y V YD+ +K F C
Sbjct: 448 DASGIVYAISQSQVCLGF---ASNGDDESVGIVGNTQQRTYGVLYDVSKKVVGFAPGAC 503
>gi|297819968|ref|XP_002877867.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
gi|297323705|gb|EFH54126.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
Length = 469
Score = 101 bits (251), Expect = 8e-19, Method: Compositional matrix adjust.
Identities = 122/463 (26%), Positives = 190/463 (41%), Gaps = 83/463 (17%)
Query: 8 LSPYNNPNENPAH---RVQRGINISIARLAYL---------QEKIRSHNTYQAQILPSND 55
LSP+++ +++P ++R SIAR L +E + S T A ++ S+
Sbjct: 23 LSPFSHSDQSPKDPYLSLRRLAESSIARAHKLKHGTSIKPDEEALSSTATASATVVKSHL 82
Query: 56 ISNSL--FYVNISIGQPPVPQFTAMDTGSSLLWVHC---YPCRDCS-----PQLGTIFYP 105
S + V++S G P DTGSSL+W C Y C DC+ P F P
Sbjct: 83 SPKSYGGYSVSLSFGTPSQTIPFVFDTGSSLVWFPCTSRYLCSDCNFSGLDPTQIPRFIP 142
Query: 106 SRSSSYAYVPCDSEHCRYFPYAR-----CSAYKHRCI-----YTQLYLIGPETSVFVSTE 155
SSS + C + C++ A C C Y Y +G + +S E
Sbjct: 143 KNSSSSRVIGCQNPKCQFLFGANVQCRGCDPNTRNCTVPCPPYILQYGLGSTAGILIS-E 201
Query: 156 QLTFKNTDESTIHVQDVVFGCGFSTNRNFKFSGIFGLGIGRSSLVSQLN-SSFSYCIGSL 214
+L F + + V D V GC + R +GI G G G SL SQ+ SFS+C+ S
Sbjct: 202 KLDFPD-----LTVPDFVVGCSVISTRT--PAGIAGFGRGPESLPSQMKLKSFSHCLVSR 254
Query: 215 H-DPDYLHNKLILGDGAIIDEGDATP---------------LQFIDGHYYITLEAISVDG 258
D + L L G+ G TP F++ +YY+ L I V
Sbjct: 255 RFDDTNVTTDLGLDTGSGHKSGSKTPGLSYTPFRKNPNVSNTAFLE-YYYLNLRRIYVGS 313
Query: 259 RMLDINPNIFKRDDSG-GGVMIDSGTDVTWLVKEAYEALRDEVMIRLEGEQMRSYSWPDK 317
+ + I +G GG ++DSG+ T++ + +E + +E QM +Y+
Sbjct: 314 KHVKIPYKFLAPGTNGNGGSIVDSGSTFTFMERPVFELVAEEF-----ATQMSNYTREKD 368
Query: 318 L--------CYHGIMSSDLKGFPTVRFHFRGGAKLALEKDSMF-YQPRPDAFCM------ 362
L C++ D+ P + F F+GGAK+ L + F + D C+
Sbjct: 369 LEKVSGIAPCFNISGKGDVT-VPELIFEFKGGAKMELPLSNYFSFVGNADTVCLTVVSDN 427
Query: 363 AVNPASINNRYVNFSLIGMMAQQFYNVGYDIGRKQKTFQRMDC 405
VNP + ++G QQ Y V YD+ + F + C
Sbjct: 428 TVNPGGGTGPAI---ILGSFQQQNYLVEYDLENDRFGFAKKKC 467
>gi|222637379|gb|EEE67511.1| hypothetical protein OsJ_24961 [Oryza sativa Japonica Group]
Length = 641
Score = 101 bits (251), Expect = 8e-19, Method: Compositional matrix adjust.
Identities = 110/435 (25%), Positives = 181/435 (41%), Gaps = 79/435 (18%)
Query: 11 YNNPNENPAHRVQRGINISIARLAYLQEKIRSHNTYQAQILPSNDISNSLFYVNISIGQP 70
Y N PA +RG+ HN L + ++N + + IG P
Sbjct: 54 YPNATRLPASSARRGLG-------------DGHNPNARMRLHDDLLTNGYYTTRLYIGTP 100
Query: 71 PVPQFTAMDTGSSLLWVHCYPCRDC------SPQL----GTIFYPSRSSSYAYVPCDSEH 120
+D+GS++ +V C C C SP + F P SS+Y+ V C+ +
Sbjct: 101 SQEFALIVDSGSTVTYVPCATCEQCGNHQSESPNIIEAHDPRFQPDLSSTYSPVKCNVD- 159
Query: 121 CRYFPYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDVVFGCGFST 180
C + +C Y + Y +S + + ++F ES + Q VFGC +T
Sbjct: 160 ------CTCDNERSQCTYERQYAEMSSSSGVLGEDIMSFGK--ESELKPQRAVFGCE-NT 210
Query: 181 NRNFKFS----GIFGLGIGRSSLVSQL------NSSFSYCIGSLHDPDYLHNKLILGDGA 230
FS GI GLG G+ S++ QL + SFS C G + +G G
Sbjct: 211 ETGDLFSQHADGIMGLGRGQLSIMDQLVEKGVISDSFSLCYGGMD----------VGGGT 260
Query: 231 IIDEGDATPLQFIDGH--------YYITLEAISVDGRMLDINPNIFKRDDSGGGVMIDSG 282
++ G P + H Y I L+ I V G+ L ++P IF +S G ++DSG
Sbjct: 261 MVLGGMPAPPDMVFSHSNPVRSPYYNIELKEIHVAGKALRLDPKIF---NSKHGTVLDSG 317
Query: 283 TDVTWLVKEAYEALRDEVMIRLEGEQMRSYSWPDK----LCYHGI---MSSDLKGFPTVR 335
T +L ++A+ A +D V ++ ++ PD +C+ G +S + FP V
Sbjct: 318 TTYAYLPEQAFVAFKDAVTNKV--NSLKKIRGPDPNYKDICFAGAGRNVSQLSEVFPDVD 375
Query: 336 FHFRGGAKLALEKDSMFYQ--PRPDAFCMAVNPASINNRYVNFSLIGMMAQQFYNVGYDI 393
F G KL+L ++ ++ A+C+ V N +L+G + + V YD
Sbjct: 376 MVFGNGQKLSLSPENYLFRHSKVEGAYCLGV----FQNGKDPTTLLGGIVVRNTLVTYDR 431
Query: 394 GRKQKTFQRMDCEVL 408
++ F + +C L
Sbjct: 432 HNEKIGFWKTNCSEL 446
>gi|147819672|emb|CAN76394.1| hypothetical protein VITISV_020864 [Vitis vinifera]
Length = 507
Score = 101 bits (251), Expect = 8e-19, Method: Compositional matrix adjust.
Identities = 106/421 (25%), Positives = 181/421 (42%), Gaps = 60/421 (14%)
Query: 38 EKIRSHNTYQ-AQILPSNDI---------SNSLFYVNISIGQPPVPQFTAMDTGSSLLWV 87
+ +R+H+T + +IL + D+ L++ I IG P + +DTGS +LWV
Sbjct: 45 DALRAHDTRRHGRILSAVDLPLGGNGHPSEAGLYFAKIGIGTPSKDYYVQVDTGSDILWV 104
Query: 88 HCYPCRDCSPQ--LG---TIFYPSRSSSYAYVPCDSEHCRYF--PYARCSAYKHRCIYTQ 140
+C C C + LG T++ S++ V CD C + P C +C+Y+
Sbjct: 105 NCAGCDRCPTKSDLGVDLTLYDMKASTTSDAVGCDDNFCSLYDGPLPGCKP-GLQCLYSV 163
Query: 141 LYLIGPETSVFVSTEQLTFKNTD---ESTIHVQDVVFGCGFSTNRNFKFS-----GIFGL 192
LY G T+ + + + + ++T VVFGCG + S GI G
Sbjct: 164 LYGDGSSTTGYFVQDFVQYNRISGNFQTTPTNGTVVFGCGNKQSGELGSSSEALDGILGF 223
Query: 193 GIGRSSLVSQLNSS------FSYC-----------IGSLHDPDYLHNKLILGDGAIIDEG 235
G SS++SQL SS FS+C IG + +P + +L + +I
Sbjct: 224 GQANSSMLSQLASSGKVKKVFSHCLDNVDGGGIFAIGEVVEPKV---RFLLMNSVMI--- 277
Query: 236 DATPLQFIDGHYYITLEAISVDGRMLDINPNIFKRDDSGGGVMIDSGTDVTWLVKEAYEA 295
L HY + ++ I V G LD+ + F+ D G +IDSGT + + +E Y
Sbjct: 278 --VVLFLSRAHYNVVMKEIEVGGDPLDVPSDAFESGDR-KGTIIDSGTTLAYFPQEVYVP 334
Query: 296 LRDEVMIRLEGEQMRSYSWPDKLCYHGIMSSDLKGFPTVRFHFRGGAKLALEKDSMFYQP 355
L ++++ + ++ + C+ + D GFPTV HF L + +Q
Sbjct: 335 LIEKILSQQPDLRLHTVEQA-FTCFDYTGNVD-DGFPTVTLHFDKSISLTVYPHEYLFQV 392
Query: 356 RPDAFCMAV-NPASINNRYVNFSLIGMMAQQFYNVGYDIGR-----KQKTFQRMDCEVLD 409
+ +C+ N + + +L+G AQ + G +G+ ++ R C V +
Sbjct: 393 KEFEWCIGWQNSGAQTKDGKDLTLLGEDAQCTCHFGSCMGQYKVFPHRELISRPHCPVEE 452
Query: 410 D 410
D
Sbjct: 453 D 453
>gi|297720195|ref|NP_001172459.1| Os01g0608366 [Oryza sativa Japonica Group]
gi|53792202|dbj|BAD52835.1| nucleoid DNA-binding protein cnd41-like [Oryza sativa Japonica
Group]
gi|255673454|dbj|BAH91189.1| Os01g0608366 [Oryza sativa Japonica Group]
Length = 452
Score = 101 bits (251), Expect = 8e-19, Method: Compositional matrix adjust.
Identities = 102/356 (28%), Positives = 151/356 (42%), Gaps = 48/356 (13%)
Query: 61 FYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSP---QLGTIFYPSRSSSYAYVPCD 117
+ +++ +G P V Q +DTGS + WV C PC SP G +F P+ SS+YA C
Sbjct: 108 YVISVGLGSPAVTQRVVIDTGSDVSWVQCEPCPAPSPCHAHAGALFDPAASSTYAAFNCS 167
Query: 118 SEHCRYFPYA----RCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDVV 173
+ C + C A K RC Y Y G T+ S++ LT +D V+
Sbjct: 168 AAACAQLGDSGEANGCDA-KSRCQYIVKYGDGSNTTGTYSSDVLTLSGSDV----VRGFQ 222
Query: 174 FGCG---FSTNRNFKFSGIFGLGIGRSSLVSQLNS----SFSYCI-GSLHDPDYLH-NKL 224
FGC + K G+ GLG S VSQ + SF YC+ + +L
Sbjct: 223 FGCSHAELGAGMDDKTDGLIGLGGDAQSPVSQTAARYGKSFFYCLPATPASSGFLTLGAP 282
Query: 225 ILGDGAIIDEGDATPL---QFIDGHYYITLEAISVDGRMLDINPNIFKRDDSGGGVMIDS 281
G G TP+ + + +Y+ LE I+V G+ L ++P++F G ++DS
Sbjct: 283 ASGGGGGASRFATTPMLRSKKVPTYYFAALEDIAVGGKKLGLSPSVFA-----AGSLVDS 337
Query: 282 GTDVTWLVKEAYEALRDEVMIRLEGEQMRSYSWPDKL-----CYHGIMSSDLKGFPTVRF 336
GT +T L AY AL M Y+ + L C++ D PTV
Sbjct: 338 GTVITRLPPAAYAALSSAFR-----AGMTRYARAEPLGILDTCFN-FTGLDKVSIPTVAL 391
Query: 337 HFRGGAKLALEKDSMFYQPRPDAFCMAVNPASINNRYVNFSLIGMMAQQFYNVGYD 392
F GGA + L+ + C+A P + F IG + Q+ + V YD
Sbjct: 392 VFAGGAVVDLDAHGIV-----SGGCLAFAPTRDDKA---FGTIGNVQQRTFEVLYD 439
>gi|302758750|ref|XP_002962798.1| hypothetical protein SELMODRAFT_78156 [Selaginella moellendorffii]
gi|300169659|gb|EFJ36261.1| hypothetical protein SELMODRAFT_78156 [Selaginella moellendorffii]
Length = 427
Score = 101 bits (251), Expect = 8e-19, Method: Compositional matrix adjust.
Identities = 97/392 (24%), Positives = 174/392 (44%), Gaps = 45/392 (11%)
Query: 48 AQILPSNDISNSLFYVNISIGQPPVPQFTAMDTGSSLLWVHCYP---CRDCSPQLGTIFY 104
++++ + I + ++V + +G P +DTGS L W+ C P + S +
Sbjct: 46 SRLVSGSSIGSGQYFVELRVGTPAKKFPLIVDTGSDLTWIQCNPPNTTANSSSPPAPWYD 105
Query: 105 PSRSSSYAYVPCDSEHCRYFPY---ARCS-AYKHRCIYTQLYLIGPETSVFVSTEQLTFK 160
S SSSY +PC + C++ P + CS C YT Y T+ ++ E ++ K
Sbjct: 106 KSSSSSYREIPCTDDECQFLPAPIGSSCSITSPSPCDYTYGYSDQSRTTGILAYETISMK 165
Query: 161 NTDES----------TIHVQDVVFGCGF-STNRNF-KFSGIFGLGIGRSSLVSQ-----L 203
+ S I +++V GC S +F SG+ GLG G SL +Q L
Sbjct: 166 SRKRSGKRAGNHKTRRIRIKNVALGCSRESVGASFLGASGVLGLGQGPISLATQTRHTAL 225
Query: 204 NSSFSYCIGSLHDPDYLHNK-----LILGDGAIIDEGDATPL---QFIDGHYYITLEAIS 255
FSYC+ DYL L++G TP+ YY+ + ++
Sbjct: 226 GGIFSYCL-----VDYLRGSNASSFLVMGRTHWRKLAH-TPIVRNPAAQSFYYVNVTGVA 279
Query: 256 VDGRMLD-INPNIFKRDDSGG-GVMIDSGTDVTWLVKEAYEALRDEVMIRLEGEQMRSYS 313
VDG+ +D I + + D G G + DSGT +++L + AY + + + + +
Sbjct: 280 VDGKPVDGIASSDWGIDGDGNKGTIFDSGTTLSYLREPAYSKVLGALNASIYLPRAQEIP 339
Query: 314 WPDKLCYHGIMSSDLKGFPTVRFHFRGGAKLALEKDSMFYQPRPDAFCMAVNPASINNRY 373
+LCY+ ++ KG P + F+GGA + L ++ + C+A+ + N
Sbjct: 340 EGFELCYN--VTRMEKGMPKLGVEFQGGAVMELPWNNYMVLVAENVQCVALQKVTTTN-- 395
Query: 374 VNFSLIGMMAQQFYNVGYDIGRKQKTFQRMDC 405
+++G + QQ +++ YD+ + + F+ C
Sbjct: 396 -GSNILGNLLQQDHHIEYDLAKARIGFKWSPC 426
>gi|218199944|gb|EEC82371.1| hypothetical protein OsI_26705 [Oryza sativa Indica Group]
Length = 642
Score = 101 bits (251), Expect = 8e-19, Method: Compositional matrix adjust.
Identities = 110/435 (25%), Positives = 181/435 (41%), Gaps = 79/435 (18%)
Query: 11 YNNPNENPAHRVQRGINISIARLAYLQEKIRSHNTYQAQILPSNDISNSLFYVNISIGQP 70
Y N PA +RG+ HN L + ++N + + IG P
Sbjct: 55 YPNATRLPASSARRGLG-------------DGHNPNARMRLHDDLLTNGYYTTRLYIGTP 101
Query: 71 PVPQFTAMDTGSSLLWVHCYPCRDC------SPQL----GTIFYPSRSSSYAYVPCDSEH 120
+D+GS++ +V C C C SP + F P SS+Y+ V C+ +
Sbjct: 102 SQEFALIVDSGSTVTYVPCATCEQCGNHQSESPNIIEAHDPRFQPDLSSTYSPVKCNVD- 160
Query: 121 CRYFPYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDVVFGCGFST 180
C + +C Y + Y +S + + ++F ES + Q VFGC +T
Sbjct: 161 ------CTCDNERSQCTYERQYAEMSSSSGVLGEDIMSFGK--ESELKPQRAVFGCE-NT 211
Query: 181 NRNFKFS----GIFGLGIGRSSLVSQL------NSSFSYCIGSLHDPDYLHNKLILGDGA 230
FS GI GLG G+ S++ QL + SFS C G + +G G
Sbjct: 212 ETGDLFSQHADGIMGLGRGQLSIMDQLVEKGVISDSFSLCYGGMD----------VGGGT 261
Query: 231 IIDEGDATPLQFIDGH--------YYITLEAISVDGRMLDINPNIFKRDDSGGGVMIDSG 282
++ G P + H Y I L+ I V G+ L ++P IF +S G ++DSG
Sbjct: 262 MVLGGMPAPPDMVFSHSNPVRSPYYNIELKEIHVAGKALRLDPKIF---NSKHGTVLDSG 318
Query: 283 TDVTWLVKEAYEALRDEVMIRLEGEQMRSYSWPDK----LCYHGI---MSSDLKGFPTVR 335
T +L ++A+ A +D V ++ ++ PD +C+ G +S + FP V
Sbjct: 319 TTYAYLPEQAFVAFKDAVTNKV--NSLKKIRGPDPNYKDICFAGAGRNVSQLSEVFPDVD 376
Query: 336 FHFRGGAKLALEKDSMFYQ--PRPDAFCMAVNPASINNRYVNFSLIGMMAQQFYNVGYDI 393
F G KL+L ++ ++ A+C+ V N +L+G + + V YD
Sbjct: 377 MVFGNGQKLSLSPENYLFRHSKVEGAYCLGV----FQNGKDPTTLLGGIVVRNTLVTYDR 432
Query: 394 GRKQKTFQRMDCEVL 408
++ F + +C L
Sbjct: 433 HNEKIGFWKTNCSEL 447
>gi|125536523|gb|EAY83011.1| hypothetical protein OsI_38231 [Oryza sativa Indica Group]
Length = 469
Score = 100 bits (250), Expect = 1e-18, Method: Compositional matrix adjust.
Identities = 112/398 (28%), Positives = 178/398 (44%), Gaps = 42/398 (10%)
Query: 34 AYLQEKIRSH-NTYQAQILPSNDISNSL--FYVNISIGQPPVPQFTAM-DTGSSLLWVHC 89
+L++++R+ N Q Q L S + +NI++G P + + D S +W C
Sbjct: 58 GFLKKQLRNRGNKQQQQQLGGEAASGAAPPLVINITVGTPVAQTVSGLVDITSYFVWAQC 117
Query: 90 YPCRDCS---PQLGTIFYPSRSSSYAYVPCDSEHCRYFPYARC--------SAYKHRCIY 138
PC + P T F P+ S++++ +PC S+ C C + RC
Sbjct: 118 APCAAAAGCLPPPATAFRPNGSATFSPLPCSSDMCLPVLRETCGRAGAAANATAGARCDS 177
Query: 139 TQLYLIG--PETSVFVSTEQLTFKNTDESTIHVQDVVFGCGFSTNRNFK-FSGIFGLGIG 195
L G TS +++T+ TF T V VVFGC ++ +F SG+ G+G G
Sbjct: 178 YSLTYGGSAANTSGYLATDTFTFGAT-----AVPGVVFGCSDASYGDFAGASGVIGIGRG 232
Query: 196 RSSLVSQLN-SSFSYCIGSLHDPD--YLHNKLILGDGAI--IDEGDATPL---QFIDGHY 247
SL+SQL FSY + + D + + GD A+ G +TPL Y
Sbjct: 233 NLSLISQLQFGKFSYQLLAPEATDDGSADSVIRFGDDAVPKTKRGQSTPLLSSTLYPDFY 292
Query: 248 YITLEAISVDGRMLDINP-NIFK-RDDSGGGVMIDSGTDVTWLVKEAYEALRDEVMIRLE 305
Y+ L + VDG LD P F R + GGV++ S T VT+L + AY+ +R V R+
Sbjct: 293 YVNLTGVRVDGNRLDAIPAGTFDLRANGTGGVILSSTTPVTYLEQAAYDVVRAAVASRIG 352
Query: 306 GEQMR-SYSWPDKLCYHGIMSSDLKGFPTVRFHFRGGAKLALEKDSMFYQPRPDAF-CMA 363
+ S + LCY+ + +K P + F GGA + L + FY C+
Sbjct: 353 LPAVNGSAALELDLCYNASSMAKVK-VPKLTLVFDGGADMDLSAANYFYIDNDTGLECLT 411
Query: 364 VNPASINNRYVNFSLIGMMAQQFYNVGYDIGRKQKTFQ 401
+ P+ S++G + Q N+ YD+ + TF+
Sbjct: 412 MLPSQ------GGSVLGTLLQTGTNMIYDVDAGRLTFE 443
>gi|24417232|gb|AAN60226.1| unknown [Arabidopsis thaliana]
Length = 464
Score = 100 bits (250), Expect = 1e-18, Method: Compositional matrix adjust.
Identities = 100/358 (27%), Positives = 138/358 (38%), Gaps = 38/358 (10%)
Query: 61 FYVNISIGQPPVPQFTAMDTGSSLLWVHCYPC-RDCSPQLGTIFYPSRSSSYAYVPCDSE 119
+ V I IG P DTGS L W C PC C Q F PS SS+Y V C S
Sbjct: 132 YIVTIGIGTPKHDLSLVFDTGSDLTWTQCEPCLGSCYSQKEPKFNPSSSSTYQNVSCSSP 191
Query: 120 HCRYFPYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDVVFGCGFS 179
C CSA C+Y+ Y T F++ E+ T N+D ++DV FGCG +
Sbjct: 192 MCE--DAESCSA--SNCVYSIGYGDKSFTQGFLAKEKFTLTNSDV----LEDVYFGCGEN 243
Query: 180 TNRNFKFSGIFGLGIGR-----SSLVSQLNSSFSYCIGSLHDPDYLHNKLILGDGAIIDE 234
F + + N+ FSYC+ S H L G I +
Sbjct: 244 NQGLFDGVAGLLGLGPGKLSLPAQTTTTYNNIFSYCLPSFTSNSTGH--LTFGSAGISES 301
Query: 235 GDATPLQFIDG--HYYITLEAISVDGRMLDINPNIFKRDDSGGGVMIDSGTDVTWLVKEA 292
TP+ +Y I + ISV + L I PN F + G +IDSGT T L +
Sbjct: 302 VKFTPISSFPSAFNYGIDIIGISVGDKELAITPNSFSTE----GAIIDSGTVFTRLPTKV 357
Query: 293 YEALRDEVMIRLEGEQMRSYSWPDKLCYHGIMSSDLKGFPTVRFHFRGGAKLALEKDSMF 352
Y LR ++ + S CY D +PT+ F F GG + L+ +
Sbjct: 358 YAELRSVFKEKMSSYKSTSGYGLFDTCYD-FTGLDTVTYPTIAFSFAGGTVVELDGSGIS 416
Query: 353 YQPRPDAFCMAVN-----PASINNRYVNFSLIGMMAQQFYNVGYDIGRKQKTFQRMDC 405
+ C+A PA + G + Q +V YD+ + F C
Sbjct: 417 LPIKISQVCLAFAGNDDLPA----------IFGNVQQTTLDVVYDVAGGRVGFAPNGC 464
>gi|326503794|dbj|BAK02683.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 456
Score = 100 bits (250), Expect = 1e-18, Method: Compositional matrix adjust.
Identities = 110/395 (27%), Positives = 160/395 (40%), Gaps = 54/395 (13%)
Query: 32 RLAYLQEKIRSHNTYQAQILP-----SNDISNSLFYVNISIGQPPVPQFTAMDTGSSLLW 86
R Y+Q K+ + Q L + + + + + IG P V Q +DTGS + W
Sbjct: 95 RAKYIQRKLSGTDGLQPLDLTVPTTLGSALDTMEYVITVGIGSPAVTQTMMIDTGSDVSW 154
Query: 87 VHCYPCRDCSPQLGTIFYPSRSSSYAYVPCDSEHCRYFPYARCSAYKHRCIYTQLYLIGP 146
V C S T+F PS+S++YA C S C C Y Y G
Sbjct: 155 VRCN-----STDGLTLFDPSKSTTYAPFSCSSAACAQLGNNGDGCSNSGCQYRVQYGDGS 209
Query: 147 ETSVFVSTEQLTFKNTDESTIHVQDVVFGCGFSTNRNF---KFSGIFGLGIGRSSLVSQL 203
T+ S++ L +D V D FGC +F K G+ GLG SLVSQ
Sbjct: 210 NTTGTYSSDTLALSASDT----VTDFHFGCSHH-EEDFDGEKIDGLMGLGGDAQSLVSQT 264
Query: 204 NS----SFSYCIGSLHDPDYLHNKLILGDGAIIDEGDATPLQF----IDGHYYITLEAIS 255
+ SFSYC L + L G G T Y + L+ IS
Sbjct: 265 AATYGKSFSYC---LPPTNRTSGFLTFGAPNGTSGGFVTTPMLRWPKAPTLYGVLLQDIS 321
Query: 256 VDGRMLDINPNIFKRDDSGGGVMIDSGTDVTWLVKEAYEALRDEV---MIRLEGEQMRSY 312
V G L I P++ G ++DSGT +TWL + AY AL M RL ++
Sbjct: 322 VGGTPLGIQPSVLSN-----GSVMDSGTVITWLPRRAYSALSSAFRSSMTRLRHQRAAPL 376
Query: 313 SWPDKLCYH--GIMSSDLKGFPTVRFHFRGGAKLALEKDSMFYQPRPDAFCMAVNPASIN 370
D CY G+++ + P V GGA + L+ + + Q C+A S +
Sbjct: 377 GILDT-CYDFTGLVNVSI---PAVSLVLDGGAVVDLDGNGIMIQD-----CLAFAATSGD 427
Query: 371 NRYVNFSLIGMMAQQFYNVGYDIGRKQKTFQRMDC 405
S+IG + Q+ + V +D+G+ F+ C
Sbjct: 428 ------SIIGNVQQRTFEVLHDVGQGVFGFRSGAC 456
>gi|125554529|gb|EAZ00135.1| hypothetical protein OsI_22138 [Oryza sativa Indica Group]
Length = 472
Score = 100 bits (249), Expect = 1e-18, Method: Compositional matrix adjust.
Identities = 114/409 (27%), Positives = 177/409 (43%), Gaps = 66/409 (16%)
Query: 37 QEKIRSHNTYQAQILPSNDISNSLFYVNISIGQPPVPQFTAMDTGSSLLWVHCYP----C 92
+E+I S ++ + ++ + I++ LF + +S+G+PPV A+DTGS+L WV C P C
Sbjct: 90 EEEITSSSSTKIDVIEDSSINDFLFLMAVSLGKPPVVNLVAIDTGSTLSWVQCQPCAVHC 149
Query: 93 RDCSPQLGTIFYPSRSSSYAYVPCDSEHCRYFPY------ARCSAYKHRCIYTQLYLIGP 146
S + G IF P RS + V C S C Y A C + C Y+ Y G
Sbjct: 150 HTQSAKAGPIFDPGRSYTSRRVRCSSVKCGELRYDLRLQQANCMEKEDSCTYSVTYGNGW 209
Query: 147 ETSV-FVSTEQLTFKNTDESTIHVQDVVFGCGFSTNRNFKFSGIFGLGIGRSSLVSQLN- 204
SV + T+ L ++ D++FGC + +GIFG G S QL
Sbjct: 210 AYSVGKMVTDTLRIGDS------FMDLMFGCSMDVKYSEFEAGIFGFGSSSFSFFEQLAG 263
Query: 205 -------SSFSYCIGSLH-DPDYLHNKLILG--DGAIIDEGDATPLQFIDG-HYYITLEA 253
+FSYC+ + P Y +ILG D A +D G + + I+ Y +T+E
Sbjct: 264 YPDILSYKAFSYCLPTDETKPGY----MILGRYDRAAMDGGYTSLFRSINRPTYSLTMEM 319
Query: 254 ISVDGRMLDINPNIFKRDDSGGGVMIDSGTDVTWLVKEAYEALRDEVMI---------RL 304
+ +G+ L S +++DSG T L + AL D+ + R
Sbjct: 320 LIANGQRLVT---------SSSEMIVDSGAQRTSLWPSTF-ALLDKTITQAMSSIGYHRT 369
Query: 305 EGEQMRSY----SWPDKLCYHGIMS--SDLKGFPTVRFHFRGGAKLALEKDSMFYQPRPD 358
+ SY S D ++G ++ S+ P + F GGA LAL ++FY
Sbjct: 370 SRARQESYICYLSEHDYSGWNGTITPFSNWSALPLLEIGFAGGAALALPPRNVFYNDPHR 429
Query: 359 AFCM--AVNPASINNRYVNFSLIGMMAQQFYNVGYDIGRKQKTFQRMDC 405
CM A NPA + ++G + + +DI KQ F+ C
Sbjct: 430 GLCMTFAQNPA------LRSQILGNRVTRSFGTTFDIQGKQFGFKYAAC 472
>gi|242081123|ref|XP_002445330.1| hypothetical protein SORBIDRAFT_07g009580 [Sorghum bicolor]
gi|241941680|gb|EES14825.1| hypothetical protein SORBIDRAFT_07g009580 [Sorghum bicolor]
Length = 543
Score = 100 bits (249), Expect = 1e-18, Method: Compositional matrix adjust.
Identities = 113/439 (25%), Positives = 170/439 (38%), Gaps = 71/439 (16%)
Query: 14 PNENPAHR---VQRGINISIARLAYLQEKIRSHNTYQAQI--------LPSNDISNSLFY 62
P+++PA ++R + +R Q +IR+ A L S +L Y
Sbjct: 126 PDDDPAAHDRYLRRLLAADESRANSFQLRIRNDRAAAASTQSGSAEVPLTSGIRFQTLNY 185
Query: 63 VNI------SIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLGTIFYPSRSSSYAYVPC 116
V S G P +DTGS L WV C PC C Q +F P+ S++YA V C
Sbjct: 186 VTTIALGGGSSGSPAANLTVIVDTGSDLTWVQCKPCSACYAQRDPLFDPAGSATYAAVRC 245
Query: 117 DSEHCRYFPYA------RCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQ 170
++ C A C RC Y Y G + ++T+ + +
Sbjct: 246 NASACAASLKAATGTPGSCGGGNERCYYALAYGDGSFSRGVLATDTVALGGAS-----LD 300
Query: 171 DVVFGCGFSTNRNFKFSGIFGL-GIGRS--SLVSQL----NSSFSYCIGSLHDPDYLHNK 223
VFGCG S NR F G GL G+GR+ SLVSQ FSYC+ + D +
Sbjct: 301 GFVFGCGLS-NRGL-FGGTAGLMGLGRTELSLVSQTALRYGGVFSYCLPATTSGDASGSL 358
Query: 224 LILGDGAIIDEGDATPLQFID--------GHYYITLEAISVDGRMLDINPNIFKRDDSGG 275
+ GD + + TP+ + Y++ + +V G L +
Sbjct: 359 SLGGDASSYR--NTTPVAYTRMIADPAQPPFYFLNVTGAAVGGTAL------AAQGLGAS 410
Query: 276 GVMIDSGTDVTWLVKEAYEALRDEVMIRLEGEQMRSYSWPDK-------LCYHGIMSSDL 328
V+IDSGT +T L Y +R E Q + +P CY + D
Sbjct: 411 NVLIDSGTVITRLAPSVYRGVRAEFT-----RQFAAAGYPTAPGFSILDTCYD-LTGHDE 464
Query: 329 KGFPTVRFHFRGGAKLALEKDSMFYQPRPDA--FCMAVNPASINNRYVNFSLIGMMAQQF 386
P + GGA++ ++ M + R D C+A+ S ++ +IG Q+
Sbjct: 465 VKVPLLTLRLEGGAEVTVDAAGMLFVVRKDGSQVCLAMASLSYEDQT---PIIGNYQQKN 521
Query: 387 YNVGYDIGRKQKTFQRMDC 405
V YD + F DC
Sbjct: 522 KRVVYDTVGSRLGFADEDC 540
>gi|242117573|dbj|BAH80056.1| hypothetical protein [Oryza sativa Indica Group]
Length = 469
Score = 100 bits (249), Expect = 1e-18, Method: Compositional matrix adjust.
Identities = 112/398 (28%), Positives = 178/398 (44%), Gaps = 42/398 (10%)
Query: 34 AYLQEKIRSH-NTYQAQILPSNDISNSL--FYVNISIGQPPVPQFTAM-DTGSSLLWVHC 89
+L++++R+ N Q Q L S + +NI++G P + + D S +W C
Sbjct: 58 GFLKKQLRNRGNKQQQQQLGGEAASGAAPPLVINITVGTPVAQTVSGLVDITSYFVWAQC 117
Query: 90 YPCRDCS---PQLGTIFYPSRSSSYAYVPCDSEHCRYFPYARC--------SAYKHRCIY 138
PC + P T F P+ S++++ +PC S+ C C + RC
Sbjct: 118 APCAAAAGCLPPPATAFRPNGSATFSPLPCSSDMCLPVLRETCGRAGAAANATAGARCDS 177
Query: 139 TQLYLIG--PETSVFVSTEQLTFKNTDESTIHVQDVVFGCGFSTNRNFK-FSGIFGLGIG 195
L G TS +++T+ TF T V VVFGC ++ +F SG+ G+G G
Sbjct: 178 YSLTYGGSAANTSGYLATDTFTFGAT-----AVPGVVFGCSDASYGDFAGASGVIGIGRG 232
Query: 196 RSSLVSQLN-SSFSYCIGSLHDPD--YLHNKLILGDGAI--IDEGDATPL---QFIDGHY 247
SL+SQL FSY + + D + + GD A+ G +TPL Y
Sbjct: 233 NLSLISQLQFGKFSYQLLAPEATDDGSADSVIRFGDDAVPKTKRGRSTPLLSSTLYPDFY 292
Query: 248 YITLEAISVDGRMLDINP-NIFK-RDDSGGGVMIDSGTDVTWLVKEAYEALRDEVMIRLE 305
Y+ L + VDG LD P F R + GGV++ S T VT+L + AY+ +R V R+
Sbjct: 293 YVNLTGVRVDGNRLDAIPAGTFDLRANGTGGVILSSTTPVTYLEQAAYDVVRAAVASRIG 352
Query: 306 GEQMR-SYSWPDKLCYHGIMSSDLKGFPTVRFHFRGGAKLALEKDSMFYQPRPDAF-CMA 363
+ S + LCY+ + +K P + F GGA + L + FY C+
Sbjct: 353 LPAVNGSAALELDLCYNASSMAKVK-VPKLTLVFDGGADMDLSAANYFYIDNDTGLECLT 411
Query: 364 VNPASINNRYVNFSLIGMMAQQFYNVGYDIGRKQKTFQ 401
+ P+ S++G + Q N+ YD+ + TF+
Sbjct: 412 MLPSQ------GGSVLGTLLQTGTNMIYDVDAGRLTFE 443
>gi|54290717|dbj|BAD62387.1| putative 41 kD chloroplast nucleoid DNA binding protein [Oryza
sativa Japonica Group]
gi|215734915|dbj|BAG95637.1| unnamed protein product [Oryza sativa Japonica Group]
Length = 469
Score = 100 bits (249), Expect = 1e-18, Method: Compositional matrix adjust.
Identities = 100/380 (26%), Positives = 156/380 (41%), Gaps = 53/380 (13%)
Query: 50 ILPSNDISNSLFYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCR-DCSPQLGTIFYPSRS 108
+ P ++ + + +G P +DTGSSL W+ C PC C Q G +F P S
Sbjct: 120 LTPGASVAVGNYVTRLGLGTPATSYVMVVDTGSSLTWLQCSPCSVSCHRQAGPVFDPRAS 179
Query: 109 SSYAYVPCDSEHCRYFPYAR-----CSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTD 163
+YA V C S C A CS + CIY Y + ++S + ++F +
Sbjct: 180 GTYAAVQCSSSECGELQAATLNPSACSV-SNVCIYQASYGDSSYSVGYLSKDTVSFGSGS 238
Query: 164 ESTIHVQDVVFGCGFSTNRNF-KFSGIFGLGIGRSSLVSQLNSS----FSYCIGSLHDPD 218
+ +GCG F + +G+ GL + SL+ QL S FSYC+ +
Sbjct: 239 FPGFY-----YGCGQDNEGLFGRSAGLIGLAKNKLSLLYQLAPSLGYAFSYCLPTSS--- 290
Query: 219 YLHNKLILGDGAIIDEGDATPLQF---------IDGH-YYITLEAISVDGRMLDINPNIF 268
+ G P Q+ +D Y++TL ISV G L + P+ +
Sbjct: 291 --------AAAGYLSIGSYNPGQYSYTPMASSSLDASLYFVTLSGISVAGAPLAVPPSEY 342
Query: 269 KRDDSGGGVMIDSGTDVTWLVKEAYEAL--RDEVMIRLEGEQMRSYSWPDKLCYHGIMSS 326
+ + +IDSGT +T L Y AL + + +YS D C+ G ++
Sbjct: 343 RSLPT----IIDSGTVITRLPPNVYTALSRAVAAAMASAAPRAPTYSILDT-CFRG-SAA 396
Query: 327 DLKGFPTVRFHFRGGAKLALEKDSMFYQPRPDAFCMAVNPASINNRYVNFSLIGMMAQQF 386
L+ P V F GGA LAL ++ C+A P ++IG QQ
Sbjct: 397 GLR-VPRVDMAFAGGATLALSPGNVLIDVDDSTTCLAFAPTG------GTAIIGNTQQQT 449
Query: 387 YNVGYDIGRKQKTFQRMDCE 406
++V YD+ + + F C
Sbjct: 450 FSVVYDVAQSRIGFAAGGCS 469
>gi|340810987|gb|AEK75420.1| S5 [Oryza rufipogon]
gi|340810989|gb|AEK75421.1| S5 [Oryza rufipogon]
gi|340810991|gb|AEK75422.1| S5 [Oryza rufipogon]
gi|340811001|gb|AEK75427.1| S5 [Oryza rufipogon]
gi|340811019|gb|AEK75436.1| S5 [Oryza rufipogon]
gi|340811104|gb|AEK75478.1| S5 [Oryza rufipogon]
gi|340811124|gb|AEK75488.1| S5 [Oryza rufipogon]
Length = 472
Score = 100 bits (249), Expect = 1e-18, Method: Compositional matrix adjust.
Identities = 116/410 (28%), Positives = 177/410 (43%), Gaps = 68/410 (16%)
Query: 37 QEKIRSHNTYQAQILPSNDISNSLFYVNISIGQPPVPQFTAMDTGSSLLWVHCYP----C 92
+E+I S ++ + ++ + I++ LF + +S+G+PPV A+DTGS+L WV C P C
Sbjct: 90 EEEITSSSSTKIDVIEDSSINDFLFLMAVSLGKPPVVNLVAIDTGSTLSWVQCQPCAVHC 149
Query: 93 RDCSPQLGTIFYPSRSSSYAYVPCDSEHCRYFPY------ARCSAYKHRCIYTQLYLIGP 146
S + G IF P RS + V C S C Y A C ++ C Y+ Y G
Sbjct: 150 HTQSAKAGPIFDPGRSYTSRRVRCSSVKCGELRYDLRLQQANCMEKENSCTYSVTYGNGW 209
Query: 147 ETSV-FVSTEQLTFKNTDESTIHVQDVVFGCGFSTNRNFKFSGIFGLGIGRSSLVSQLN- 204
SV + T+ L ++ D++FGC + +GIFG G S QL
Sbjct: 210 AYSVGKMVTDTLRIGDS------FMDLMFGCSMDVKYSEFEAGIFGFGSSSFSFFEQLAG 263
Query: 205 -------SSFSYCIGSLH-DPDYLHNKLILG--DGAIIDEGDATPL--QFIDGHYYITLE 252
+FSYC+ + P Y +ILG D A +D G TPL Y +T+E
Sbjct: 264 YPDILSYKAFSYCLPTDETKPGY----MILGRYDRAAMD-GGYTPLFRSINRPTYSLTME 318
Query: 253 AISVDGRMLDINPNIFKRDDSGGGVMIDSGTDVTWLVKEAYEALRDEVMI---------R 303
+ +G+ L S +++DSG T L + AL D+ + R
Sbjct: 319 MLIANGQRLVT---------SSSEMIVDSGAQRTSLWPSTF-ALLDKTITQAMSSIGYHR 368
Query: 304 LEGEQMRSY----SWPDKLCYHGIMS--SDLKGFPTVRFHFRGGAKLALEKDSMFYQPRP 357
+ SY S D ++G ++ S+ P + F GGA LAL ++FY
Sbjct: 369 TSRARQESYICYLSEHDYSGWNGTITPFSNWSALPLLEIGFAGGAALALPPRNVFYNDPH 428
Query: 358 DAFCM--AVNPASINNRYVNFSLIGMMAQQFYNVGYDIGRKQKTFQRMDC 405
CM A NPA + ++G + + +DI KQ F+ C
Sbjct: 429 RGLCMTFAQNPA------LRSQILGNRVTRSFGTTFDIQGKQFGFKYAAC 472
>gi|384252236|gb|EIE25712.1| acid protease [Coccomyxa subellipsoidea C-169]
Length = 599
Score = 100 bits (249), Expect = 1e-18, Method: Compositional matrix adjust.
Identities = 108/388 (27%), Positives = 165/388 (42%), Gaps = 66/388 (17%)
Query: 61 FYVNISIGQPPVPQFTAMDTGSSLLWVHCYPC-RDCSPQ-LGTIFYPSRSSSYAYVPCDS 118
FY + +G P +DTGS++ +V C C R+C P F P+ SSS A + CDS
Sbjct: 62 FYATLHLGTPARQFAVIVDTGSTITYVPCASCGRNCGPHHKDAAFDPASSSSSAVIGCDS 121
Query: 119 EHCRYF-PYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDVVFGCG 177
+ C P CS K C Y + Y ++ + ++QL ++ +VVFGC
Sbjct: 122 DKCICGRPPCGCSE-KRECTYQRTYAEQSSSAGLLVSDQLQLRDG------AVEVVFGCE 174
Query: 178 FSTNR---NFKFSGIFGLGIGRSSLVSQLNSS------FSYCIGSLHDPDYLHNKLILGD 228
N + GI GLG SLV+QL S F+ C GS+ L+LGD
Sbjct: 175 TKETGEIYNQEADGILGLGNSEVSLVNQLAGSGVIDDVFALCFGSVEG----DGALMLGD 230
Query: 229 GAIIDEGD-ATPLQFI-------DGHYY-ITLEAISVDGRMLDINPNIFKRDDSGGGVMI 279
+D + LQ+ HYY + LEA+ V G+ L + P +R + G G ++
Sbjct: 231 ---VDAAEYDVALQYTALLSSLAHPHYYSVQLEALWVGGQQLPVKP---ERYEEGYGTVL 284
Query: 280 DSGTDVTWLVKEAYEALRDEVMIRLEGEQMRSYSWPD----------KLCY-------HG 322
DSGT T+L EA++ ++ V + S PD +C+ H
Sbjct: 285 DSGTTFTYLPSEAFQLFKEAVSAYALEHGLNSVKGPDPKEKSFAQFHDICFGGAPHAGHA 344
Query: 323 IMSSDLKGFPTVRFHFRGGAKLALEKDSMFYQPRPD--AFCMAV--NPASINNRYVNFSL 378
S K FP F G +L + + + A+C+ V N AS +L
Sbjct: 345 DQSKLEKVFPVFELQFADGVRLRTGPLNYLFMHTGEMGAYCLGVFDNGAS-------GTL 397
Query: 379 IGMMAQQFYNVGYDIGRKQKTFQRMDCE 406
+G ++ + V YD ++ F C+
Sbjct: 398 LGGISFRNILVQYDRRNRRVGFGAASCQ 425
>gi|449442641|ref|XP_004139089.1| PREDICTED: aspartic proteinase-like protein 2-like [Cucumis
sativus]
Length = 478
Score = 100 bits (249), Expect = 1e-18, Method: Compositional matrix adjust.
Identities = 90/372 (24%), Positives = 157/372 (42%), Gaps = 34/372 (9%)
Query: 58 NSLFYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQ--LGT---IFYPSRSSSYA 112
L+Y I IG PP +DTGS +LWV+C C +C + +G ++ P SS+
Sbjct: 70 TGLYYARIGIGSPPNDFHVQVDTGSDILWVNCVGCSNCPKKSDIGVDLQLYNPKSSSTST 129
Query: 113 YVPCDSEHCRY---FPYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFK---NTDEST 166
+ CD C P C C Y +Y G T+ + + + + +++
Sbjct: 130 LITCDQPFCSATYDAPIPGCKP-DLLCQYKVIYGDGSATAGYFVNDYIQLQRAVGNHKTS 188
Query: 167 IHVQDVVFGCGFSTNRNFKFS-----GIFGLGIGRSSLVSQLNSS------FSYCIGSLH 215
+VFGCG + S GI G G SS++SQL ++ F++C+
Sbjct: 189 ETNGSIVFGCGAKQSGELGSSSEALDGILGFGQANSSMISQLAATGKVKKIFAHCL---- 244
Query: 216 DPDYLHNKLILGDGAIID-EGDATPLQFIDGHYYITLEAISVDGRMLDINPNIFKRDDSG 274
D + I G +++ + TP+ HY + L + V LD+ +F+
Sbjct: 245 --DSISGGGIFAIGEVVEPKLKTTPVVPNQAHYNVVLNGVKVGDTALDLPLGLFETSYKR 302
Query: 275 GGVMIDSGTDVTWLVKEAYEALRDEVMIRLEGEQMRSYSWPDKLCYHGIMSSDLKGFPTV 334
G + IDSGT + +L Y L ++++ ++R+ D+ + GFPTV
Sbjct: 303 GAI-IDSGTTLAYLPDSIYLPLMEKILGAQPDLKLRTVD--DQFTCFVFDKNVDDGFPTV 359
Query: 335 RFHFRGGAKLALEKDSMFYQPRPDAFCMAVNPASINNRYVN-FSLIGMMAQQFYNVGYDI 393
F F L + +Q R D +C+ + ++ N +L+G + Q V Y++
Sbjct: 360 TFKFEESLILTIYPHEYLFQIRDDVWCVGWQNSGAQSKDGNEVTLLGDLVLQNKLVYYNL 419
Query: 394 GRKQKTFQRMDC 405
+ + +C
Sbjct: 420 ENQTIGWTEYNC 431
>gi|340810931|gb|AEK75392.1| S5 [Oryza sativa]
gi|340810983|gb|AEK75418.1| S5 [Oryza nivara]
gi|340810985|gb|AEK75419.1| S5 [Oryza nivara]
gi|340810997|gb|AEK75425.1| S5 [Oryza nivara]
gi|340811011|gb|AEK75432.1| S5 [Oryza nivara]
gi|340811013|gb|AEK75433.1| S5 [Oryza nivara]
gi|340811041|gb|AEK75447.1| S5 [Oryza nivara]
gi|340811043|gb|AEK75448.1| S5 [Oryza nivara]
Length = 474
Score = 100 bits (249), Expect = 1e-18, Method: Compositional matrix adjust.
Identities = 116/410 (28%), Positives = 176/410 (42%), Gaps = 68/410 (16%)
Query: 37 QEKIRSHNTYQAQILPSNDISNSLFYVNISIGQPPVPQFTAMDTGSSLLWVHCYP----C 92
+E+I S ++ + ++ + I++ LF + +S+G+PPV A+DTGS+L WV C P C
Sbjct: 92 EEEITSSSSTKIDVIEDSSINDFLFLMAVSLGKPPVVNLVAIDTGSTLSWVQCQPCAVHC 151
Query: 93 RDCSPQLGTIFYPSRSSSYAYVPCDSEHCRYFPY------ARCSAYKHRCIYTQLYLIGP 146
S + G IF P RS + V C S C Y A C + C Y+ Y G
Sbjct: 152 HTQSAKAGPIFDPGRSYTSRRVRCSSVKCGELRYDLRLQQANCMEKEDSCTYSVTYGNGW 211
Query: 147 ETSV-FVSTEQLTFKNTDESTIHVQDVVFGCGFSTNRNFKFSGIFGLGIGRSSLVSQLN- 204
SV + T+ L ++ D++FGC + +GIFG G S QL
Sbjct: 212 AYSVGKMVTDTLRIGDS------FMDLMFGCSMDVKYSEFEAGIFGFGSSSFSFFEQLAG 265
Query: 205 -------SSFSYCIGSLH-DPDYLHNKLILG--DGAIIDEGDATPL--QFIDGHYYITLE 252
+FSYC+ + P Y +ILG D A +D G TPL Y +T+E
Sbjct: 266 YPDILSYKAFSYCLPTDETKPGY----MILGRYDRAAMD-GGYTPLFRSINRPTYSLTME 320
Query: 253 AISVDGRMLDINPNIFKRDDSGGGVMIDSGTDVTWLVKEAYEALRDEVMI---------R 303
+ +G+ L S +++DSG T L + AL D+ + R
Sbjct: 321 MLIANGQRLVT---------SSSEMIVDSGAQRTSLWPSTF-ALLDKTITQAMSSIGYHR 370
Query: 304 LEGEQMRSY----SWPDKLCYHGIMS--SDLKGFPTVRFHFRGGAKLALEKDSMFYQPRP 357
+ SY S D ++G ++ S+ P + F GGA LAL ++FY
Sbjct: 371 TSRARQESYICYLSEHDYSGWNGTITPFSNWSALPPLEIGFAGGAALALSPRNVFYNDPH 430
Query: 358 DAFCM--AVNPASINNRYVNFSLIGMMAQQFYNVGYDIGRKQKTFQRMDC 405
CM A NPA + ++G + + +DI KQ F+ C
Sbjct: 431 RGLCMTFAQNPA------LRSQILGNRVTRSFGTTFDIQGKQFGFKYAAC 474
>gi|357510893|ref|XP_003625735.1| Aspartic proteinase-like protein [Medicago truncatula]
gi|355500750|gb|AES81953.1| Aspartic proteinase-like protein [Medicago truncatula]
Length = 535
Score = 100 bits (248), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 80/264 (30%), Positives = 129/264 (48%), Gaps = 41/264 (15%)
Query: 60 LFYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLG-----TIFYPSRSSSYAYV 114
L++ + +G P + +DTGS +LW++C C +C G F + SS+ A V
Sbjct: 70 LYFTKVKMGSPAKEFYVQIDTGSDILWLNCNTCNNCPKSSGLGIDLNYFDTASSSTAALV 129
Query: 115 PCDSEHCRYF---PYARCSAYKHRCIYTQLYLIGPETSVF---------VSTEQLTFKNT 162
C C Y ++CS+ ++C YT Y G TS + V Q F N+
Sbjct: 130 SCSDPVCSYAVQTATSQCSSQANQCSYTFQYGDGSGTSGYYVYDAMYFDVIMGQSVFSNS 189
Query: 163 DESTIHVQDVVFGC-----GFSTNRNFKFSGIFGLGIGRSSLVSQLNSS------FSYCI 211
+ VVFGC G GIFG G G S+VSQ++S FS+C+
Sbjct: 190 SST------VVFGCSTYQSGDLARTEKAVDGIFGFGPGALSVVSQVSSQGMAPKVFSHCL 243
Query: 212 GSLHDPDYLHNKLILGDGAIIDEGDA-TPLQFIDGHYYITLEAISVDGRMLDINPNIFKR 270
+ L+LG+ I++ TPL + HY + L++I+V+G++L I+ ++F
Sbjct: 244 KGQGSGGGI---LVLGE--ILEPNIVYTPLVPLQPHYNLNLQSIAVNGQILPIDQDVFAT 298
Query: 271 DDSGGGVMIDSGTDVTWLVKEAYE 294
++ G ++DSGT + +LV+EAY+
Sbjct: 299 GNN-RGTIVDSGTTLAYLVQEAYD 321
>gi|15238250|ref|NP_196637.1| aspartyl protease family protein [Arabidopsis thaliana]
gi|8979710|emb|CAB96831.1| nucleoid DNA-binding protein cnd41-like protein [Arabidopsis
thaliana]
gi|18176136|gb|AAL59990.1| putative nucleoid DNA-binding protein cnd41 [Arabidopsis thaliana]
gi|22136986|gb|AAM91722.1| putative nucleoid DNA-binding protein cnd41 [Arabidopsis thaliana]
gi|110740988|dbj|BAE98588.1| nucleoid DNA-binding protein cnd41 - like protein [Arabidopsis
thaliana]
gi|332004210|gb|AED91593.1| aspartyl protease family protein [Arabidopsis thaliana]
Length = 464
Score = 100 bits (248), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 99/358 (27%), Positives = 138/358 (38%), Gaps = 38/358 (10%)
Query: 61 FYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCR-DCSPQLGTIFYPSRSSSYAYVPCDSE 119
+ V I IG P DTGS L W C PC C Q F PS SS+Y V C S
Sbjct: 132 YIVTIGIGTPKHDLSLVFDTGSDLTWTQCEPCLGSCYSQKEPKFNPSSSSTYQNVSCSSP 191
Query: 120 HCRYFPYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDVVFGCGFS 179
C CSA C+Y+ +Y T F++ E+ T N+D ++DV FGCG +
Sbjct: 192 MCE--DAESCSA--SNCVYSIVYGDKSFTQGFLAKEKFTLTNSDV----LEDVYFGCGEN 243
Query: 180 TNRNFKFSGIFGLGIGR-----SSLVSQLNSSFSYCIGSLHDPDYLHNKLILGDGAIIDE 234
F + + N+ FSYC+ S H L G I +
Sbjct: 244 NQGLFDGVAGLLGLGPGKLSLPAQTTTTYNNIFSYCLPSFTSNSTGH--LTFGSAGISES 301
Query: 235 GDATPLQFIDG--HYYITLEAISVDGRMLDINPNIFKRDDSGGGVMIDSGTDVTWLVKEA 292
TP+ +Y I + ISV + L I PN F + G +IDSGT T L +
Sbjct: 302 VKFTPISSFPSAFNYGIDIIGISVGDKELAITPNSFSTE----GAIIDSGTVFTRLPTKV 357
Query: 293 YEALRDEVMIRLEGEQMRSYSWPDKLCYHGIMSSDLKGFPTVRFHFRGGAKLALEKDSMF 352
Y LR ++ + S CY D +PT+ F F G + L+ +
Sbjct: 358 YAELRSVFKEKMSSYKSTSGYGLFDTCYD-FTGLDTVTYPTIAFSFAGSTVVELDGSGIS 416
Query: 353 YQPRPDAFCMAVN-----PASINNRYVNFSLIGMMAQQFYNVGYDIGRKQKTFQRMDC 405
+ C+A PA + G + Q +V YD+ + F C
Sbjct: 417 LPIKISQVCLAFAGNDDLPA----------IFGNVQQTTLDVVYDVAGGRVGFAPNGC 464
>gi|340810907|gb|AEK75380.1| S5 [Oryza sativa]
Length = 472
Score = 100 bits (248), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 116/410 (28%), Positives = 176/410 (42%), Gaps = 68/410 (16%)
Query: 37 QEKIRSHNTYQAQILPSNDISNSLFYVNISIGQPPVPQFTAMDTGSSLLWVHCYP----C 92
+E+I S ++ + ++ + I++ LF + +S+G+PPV A+DTGS+L WV C P C
Sbjct: 90 EEEITSSSSTKIDVIEDSSINDFLFLMAVSLGKPPVVNLVAIDTGSTLSWVQCQPCAVHC 149
Query: 93 RDCSPQLGTIFYPSRSSSYAYVPCDSEHCRYFPY------ARCSAYKHRCIYTQLYLIGP 146
S + G IF P RS + V C S C Y A C + C Y+ Y G
Sbjct: 150 HTQSAKAGPIFDPGRSYTSRRVRCSSVKCGELRYDLRLQQANCMEKEDSCTYSVTYGNGW 209
Query: 147 ETSV-FVSTEQLTFKNTDESTIHVQDVVFGCGFSTNRNFKFSGIFGLGIGRSSLVSQLN- 204
SV + T+ L ++ D++FGC + +GIFG G S QL
Sbjct: 210 AYSVGKMVTDTLRIGDS------FMDLMFGCSMDVKYSEFEAGIFGFGSSSFSFFEQLAG 263
Query: 205 -------SSFSYCIGSLH-DPDYLHNKLILG--DGAIIDEGDATPL--QFIDGHYYITLE 252
+FSYC+ + P Y +ILG D A +D G TPL Y +T+E
Sbjct: 264 YPDILSYKAFSYCLPTDETKPGY----MILGRYDRAAMD-GGYTPLFRSINRPTYSLTME 318
Query: 253 AISVDGRMLDINPNIFKRDDSGGGVMIDSGTDVTWLVKEAYEALRDEVMI---------R 303
+ +G+ L S +++DSG T L + AL D+ + R
Sbjct: 319 MLIANGQRLVT---------SSSEMIVDSGAQRTSLWPSTF-ALLDKTITQAMSSIGYHR 368
Query: 304 LEGEQMRSY----SWPDKLCYHGIMS--SDLKGFPTVRFHFRGGAKLALEKDSMFYQPRP 357
+ SY S D ++G ++ S+ P + F GGA LAL ++FY
Sbjct: 369 TSRARQESYICYLSEHDYSGWNGTITPFSNWSALPLLEIGFAGGAALALSPRNVFYNDPH 428
Query: 358 DAFCM--AVNPASINNRYVNFSLIGMMAQQFYNVGYDIGRKQKTFQRMDC 405
CM A NPA + ++G + + +DI KQ F+ C
Sbjct: 429 RGLCMTFAQNPA------LRSQILGNRVTRSFGTTFDIQGKQFGFKYAAC 472
>gi|449476186|ref|XP_004154665.1| PREDICTED: LOW QUALITY PROTEIN: aspartic proteinase-like protein
2-like [Cucumis sativus]
Length = 478
Score = 100 bits (248), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 90/372 (24%), Positives = 158/372 (42%), Gaps = 34/372 (9%)
Query: 58 NSLFYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQ--LGT---IFYPSRSSSYA 112
L+Y I IG PP +DTGS +LWV+C C +C + +G ++ P SS+
Sbjct: 70 TGLYYARIGIGSPPNDFHVQVDTGSDILWVNCVGCSNCPKKSDIGVDLQLYNPKSSSTST 129
Query: 113 YVPCDSEHCRY---FPYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFK---NTDEST 166
+ CD C P C C Y +Y G T+ + + + + +++
Sbjct: 130 LITCDQPFCSATYDAPIPGCKP-DLLCQYKVIYGDGSATAGYFVNDYIQLQRAVGNHKTS 188
Query: 167 IHVQDVVFGCGFSTNRNFKFS-----GIFGLGIGRSSLVSQLNSS------FSYCIGSLH 215
+VFGCG + S GI G G SS++SQL ++ F++C+
Sbjct: 189 ETNGSIVFGCGAKQSGELGSSSEALDGILGFGQANSSMISQLAATGKVKKIFAHCL---- 244
Query: 216 DPDYLHNKLILGDGAIID-EGDATPLQFIDGHYYITLEAISVDGRMLDINPNIFKRDDSG 274
D + I G +++ + TP+ HY + L + V LD+ +F+
Sbjct: 245 --DSISGGGIFAIGEVVEPKLXNTPVVPNQAHYNVVLNGVKVGDTALDLPLGLFETSYKR 302
Query: 275 GGVMIDSGTDVTWLVKEAYEALRDEVMIRLEGEQMRSYSWPDKLCYHGIMSSDLKGFPTV 334
G + IDSGT + +L + Y L ++++ ++R+ D+ + GFPTV
Sbjct: 303 GAI-IDSGTTLAYLPESIYLPLMEKILGAQPDLKLRTVD--DQFTCFVFDKNVDDGFPTV 359
Query: 335 RFHFRGGAKLALEKDSMFYQPRPDAFCMAVNPASINNRYVN-FSLIGMMAQQFYNVGYDI 393
F F L + +Q R D +C+ + ++ N +L+G + Q V Y++
Sbjct: 360 TFKFEESLILTIYPHEYLFQIRDDVWCVGWQNSGAQSKDGNEVTLLGDLVLQNKLVYYNL 419
Query: 394 GRKQKTFQRMDC 405
+ + +C
Sbjct: 420 ENQTIGWTEYNC 431
>gi|225430551|ref|XP_002283470.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Vitis vinifera]
Length = 490
Score = 100 bits (248), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 118/412 (28%), Positives = 176/412 (42%), Gaps = 67/412 (16%)
Query: 31 ARLAYLQEKIR------SHNTYQAQILPSNDIS---NSLFYVNISIGQPPVPQFTAMDTG 81
+R+A +Q ++ S+ LPS S + + V + +G P DTG
Sbjct: 108 SRVASIQSRLAKNLAGGSNLKASKATLPSKSASTLGSGNYVVTVGLGSPKRDLTFIFDTG 167
Query: 82 SSLLWVHCYPCRD-CSPQLGTIFYPSRSSSYAYVPCDSEHCRYFPYAR-----CSAYKHR 135
S L W C PC C Q IF PS S SY+ V CDS C A CS+
Sbjct: 168 SDLTWTQCEPCVGYCYQQREHIFDPSTSLSYSNVSCDSPSCEKLESATGNSPGCSS--ST 225
Query: 136 CIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDVVFGCGFSTNRNFKFSGIFG-LGI 194
C+Y Y G + F + E+L+ +TD + FGCG NR F G G LG+
Sbjct: 226 CLYGIRYGDGSYSIGFFAREKLSLTSTDV----FNNFQFGCG-QNNRGL-FGGTAGLLGL 279
Query: 195 GRS--SLVSQLNSS----FSYCIGSLHDPDYLHNKLILGDGAIIDEGDATPLQFIDGH-- 246
R+ SLVSQ FSYC L L G G +GD+ ++F
Sbjct: 280 ARNPLSLVSQTAQKYGKVFSYC---LPSSSSSTGYLSFGSG----DGDSKAVKFTPSEVN 332
Query: 247 ------YYITLEAISVDGRMLDINPNIFKRDDSGGGVMIDSGTDVTWLVKEAYEALRDEV 300
Y++ + ISV R L I ++F S G +IDSGT ++ L Y +++ +V
Sbjct: 333 SDYPSFYFLDMVGISVGERKLPIPKSVF----STAGTIIDSGTVISRLPPTVYSSVQ-KV 387
Query: 301 MIRLEGE--QMRSYSWPDKLCYHGIMSSDLKGFPTVR-----FHFRGGAKLALEKDSMFY 353
L + +++ S D CY DL + TV+ +F GGA++ L + + Y
Sbjct: 388 FRELMSDYPRVKGVSILDT-CY------DLSKYKTVKVPKIILYFSGGAEMDLAPEGIIY 440
Query: 354 QPRPDAFCMAVNPASINNRYVNFSLIGMMAQQFYNVGYDIGRKQKTFQRMDC 405
+ C+A S ++ ++IG + Q+ +V YD + F C
Sbjct: 441 VLKVSQVCLAFAGNSDDDE---VAIIGNVQQKTIHVVYDDAEGRVGFAPSGC 489
>gi|16209647|gb|AAL14384.1| AT3g52500/F22O6_120 [Arabidopsis thaliana]
Length = 469
Score = 100 bits (248), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 124/464 (26%), Positives = 196/464 (42%), Gaps = 85/464 (18%)
Query: 8 LSPYNNPNENPAH---RVQRGINISIARLAYL---------QEKIRSHNTYQAQIL--PS 53
LSP+++ +++P ++R SIAR L ++ + S T A ++ P
Sbjct: 23 LSPFSHSDQSPKDPYLSLRRLAESSIARAHKLKHGTSIKPDEDALSSTTTASATVVKSPL 82
Query: 54 NDISNSLFYVNISIGQPPVPQFTAMDTGSSLLWVHC---YPCRDCS-----PQLGTIFYP 105
+ S + V++S G P DTGSSL+ + C Y C C P L F P
Sbjct: 83 SAKSYGGYSVSLSFGTPSQTIPFVFDTGSSLVCLPCTSRYLCSGCDFSGLDPTLIPRFIP 142
Query: 106 SRSSSYAYVPCDSEHCRYF--PYARCSA---YKHRCI-----YTQLYLIGPETSVFVSTE 155
SSS + C S C++ P +C C Y Y +G V + TE
Sbjct: 143 KNSSSSKIIGCQSPKCQFLYGPNVQCRGCDPNTRNCTVGCPPYILQYGLGSTAGVLI-TE 201
Query: 156 QLTFKNTDESTIHVQDVVFGCGFSTNRNFKFSGIFGLGIGRSSLVSQLN-SSFSYCIGSL 214
+L F + + V D V GC + R + +GI G G G SL SQ+N FS+C+ S
Sbjct: 202 KLDFPD-----LTVPDFVVGCSIISTR--QPAGIAGFGRGPVSLPSQMNLKRFSHCLVSR 254
Query: 215 H-DPDYLHNKLILGDGAIIDEGDATP---------------LQFIDGHYYITLEAISVDG 258
D + L L G+ + G TP F++ +YY+ L I V
Sbjct: 255 RFDDTNVTTDLDLDTGSGHNSGSKTPGLTYTPFRKNPNVSNKAFLE-YYYLNLRRIYVGR 313
Query: 259 RMLDINPNIFKRDDSG-GGVMIDSGTDVTWLVKEAYEALRDEVMIRLEGEQMRSYSWPDK 317
+ + I +G GG ++DSG+ T++ + +E + +E QM +Y+
Sbjct: 314 KHVKIPYKYLAPGTNGDGGSIVDSGSTFTFMERPVFELVAEEF-----ASQMSNYTREKD 368
Query: 318 L--------CYHGIMSSDLKGFPTVRFHFRGGAKLALEKDSMF-YQPRPDAFCMAVNPAS 368
L C++ D+ P + F F+GGAKL L + F + D C+ V
Sbjct: 369 LEKETGLGPCFNISGKGDVT-VPELIFEFKGGAKLELPLSNYFTFVGNTDTVCLTV---- 423
Query: 369 INNRYVNFS-------LIGMMAQQFYNVGYDIGRKQKTFQRMDC 405
++++ VN S ++G QQ Y V YD+ + F + C
Sbjct: 424 VSDKTVNPSGGTGPAIILGSFQQQNYLVEYDLENDRFGFAKKKC 467
>gi|196212952|gb|ACG76112.1| S5 [Oryza sativa Indica Group]
gi|338809989|gb|AEJ08560.1| S5 [Oryza barthii]
gi|340810883|gb|AEK75368.1| S5 [Oryza sativa]
gi|340810885|gb|AEK75369.1| S5 [Oryza sativa]
gi|340810889|gb|AEK75371.1| S5 [Oryza sativa]
gi|340810895|gb|AEK75374.1| S5 [Oryza sativa]
gi|340810897|gb|AEK75375.1| S5 [Oryza sativa]
gi|340810905|gb|AEK75379.1| S5 [Oryza sativa]
gi|340810909|gb|AEK75381.1| S5 [Oryza sativa]
gi|340810911|gb|AEK75382.1| S5 [Oryza sativa]
gi|340810913|gb|AEK75383.1| S5 [Oryza sativa]
gi|340810923|gb|AEK75388.1| S5 [Oryza sativa]
gi|340810925|gb|AEK75389.1| S5 [Oryza sativa]
gi|340810929|gb|AEK75391.1| S5 [Oryza sativa]
gi|340810935|gb|AEK75394.1| S5 [Oryza sativa]
gi|340810937|gb|AEK75395.1| S5 [Oryza sativa]
gi|340810939|gb|AEK75396.1| S5 [Oryza sativa]
gi|340810941|gb|AEK75397.1| S5 [Oryza sativa]
gi|340810943|gb|AEK75398.1| S5 [Oryza sativa]
gi|340810951|gb|AEK75402.1| S5 [Oryza sativa]
gi|340810953|gb|AEK75403.1| S5 [Oryza sativa]
gi|340810963|gb|AEK75408.1| S5 [Oryza sativa]
gi|340810965|gb|AEK75409.1| S5 [Oryza sativa]
gi|340810973|gb|AEK75413.1| S5 [Oryza nivara]
gi|340811003|gb|AEK75428.1| S5 [Oryza rufipogon]
gi|340811005|gb|AEK75429.1| S5 [Oryza rufipogon]
gi|340811009|gb|AEK75431.1| S5 [Oryza rufipogon]
gi|340811023|gb|AEK75438.1| S5 [Oryza rufipogon]
gi|340811025|gb|AEK75439.1| S5 [Oryza nivara]
gi|340811031|gb|AEK75442.1| S5 [Oryza rufipogon]
gi|340811033|gb|AEK75443.1| S5 [Oryza rufipogon]
gi|340811035|gb|AEK75444.1| S5 [Oryza nivara]
gi|340811039|gb|AEK75446.1| S5 [Oryza rufipogon]
gi|340811049|gb|AEK75451.1| S5 [Oryza nivara]
gi|340811053|gb|AEK75453.1| S5 [Oryza rufipogon]
gi|340811055|gb|AEK75454.1| S5 [Oryza nivara]
gi|340811057|gb|AEK75455.1| S5 [Oryza rufipogon]
gi|340811059|gb|AEK75456.1| S5 [Oryza rufipogon]
gi|340811061|gb|AEK75457.1| S5 [Oryza rufipogon]
gi|340811065|gb|AEK75459.1| S5 [Oryza nivara]
gi|340811067|gb|AEK75460.1| S5 [Oryza nivara]
gi|340811069|gb|AEK75461.1| S5 [Oryza nivara]
gi|340811071|gb|AEK75462.1| S5 [Oryza rufipogon]
gi|340811081|gb|AEK75467.1| S5 [Oryza nivara]
gi|340811083|gb|AEK75468.1| S5 [Oryza nivara]
gi|340811087|gb|AEK75470.1| S5 [Oryza nivara]
gi|340811092|gb|AEK75472.1| S5 [Oryza nivara]
gi|340811102|gb|AEK75477.1| S5 [Oryza rufipogon]
gi|340811106|gb|AEK75479.1| S5 [Oryza rufipogon]
gi|340811108|gb|AEK75480.1| S5 [Oryza rufipogon]
gi|340811110|gb|AEK75481.1| S5 [Oryza rufipogon]
gi|340811112|gb|AEK75482.1| S5 [Oryza rufipogon]
gi|340811118|gb|AEK75485.1| S5 [Oryza nivara]
gi|340811120|gb|AEK75486.1| S5 [Oryza rufipogon]
Length = 472
Score = 100 bits (248), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 116/410 (28%), Positives = 176/410 (42%), Gaps = 68/410 (16%)
Query: 37 QEKIRSHNTYQAQILPSNDISNSLFYVNISIGQPPVPQFTAMDTGSSLLWVHCYP----C 92
+E+I S ++ + ++ + I++ LF + +S+G+PPV A+DTGS+L WV C P C
Sbjct: 90 EEEITSSSSTKIDVIEDSSINDFLFLMAVSLGKPPVVNLVAIDTGSTLSWVQCQPCAVHC 149
Query: 93 RDCSPQLGTIFYPSRSSSYAYVPCDSEHCRYFPY------ARCSAYKHRCIYTQLYLIGP 146
S + G IF P RS + V C S C Y A C + C Y+ Y G
Sbjct: 150 HTQSAKAGPIFDPGRSYTSRRVRCSSVKCGELRYDLRLQQANCMEKEDSCTYSVTYGNGW 209
Query: 147 ETSV-FVSTEQLTFKNTDESTIHVQDVVFGCGFSTNRNFKFSGIFGLGIGRSSLVSQLN- 204
SV + T+ L ++ D++FGC + +GIFG G S QL
Sbjct: 210 AYSVGKMVTDTLRIGDS------FMDLMFGCSMDVKYSEFEAGIFGFGSSSFSFFEQLAG 263
Query: 205 -------SSFSYCIGSLH-DPDYLHNKLILG--DGAIIDEGDATPL--QFIDGHYYITLE 252
+FSYC+ + P Y +ILG D A +D G TPL Y +T+E
Sbjct: 264 YPDILSYKAFSYCLPTDETKPGY----MILGRYDRAAMD-GGYTPLFRSINRPTYSLTME 318
Query: 253 AISVDGRMLDINPNIFKRDDSGGGVMIDSGTDVTWLVKEAYEALRDEVMI---------R 303
+ +G+ L S +++DSG T L + AL D+ + R
Sbjct: 319 MLIANGQRLVT---------SSSEMIVDSGAQRTSLWPSTF-ALLDKTITQAMSSIGYHR 368
Query: 304 LEGEQMRSY----SWPDKLCYHGIMS--SDLKGFPTVRFHFRGGAKLALEKDSMFYQPRP 357
+ SY S D ++G ++ S+ P + F GGA LAL ++FY
Sbjct: 369 TSRARQESYICYLSEHDYSGWNGTITPFSNWSALPLLEIGFAGGAALALPPRNVFYNDPH 428
Query: 358 DAFCM--AVNPASINNRYVNFSLIGMMAQQFYNVGYDIGRKQKTFQRMDC 405
CM A NPA + ++G + + +DI KQ F+ C
Sbjct: 429 RGLCMTFAQNPA------LRSQILGNRVTRSFGTTFDIQGKQFGFKYAAC 472
>gi|308081797|ref|NP_001182920.1| uncharacterized protein LOC100501208 [Zea mays]
gi|238008190|gb|ACR35130.1| unknown [Zea mays]
gi|413922182|gb|AFW62114.1| hypothetical protein ZEAMMB73_927324 [Zea mays]
Length = 269
Score = 99.8 bits (247), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 79/273 (28%), Positives = 130/273 (47%), Gaps = 30/273 (10%)
Query: 152 VSTEQLTFKNTDESTIHVQDVVFGCGFSTNRNFK-FSGIFGLGIGRSSLVSQLN-SSFSY 209
++TE TF + ++ FGCG TN SGI G+ G S++ QL+ + FSY
Sbjct: 7 LATETFTFGAHQNFS---ANLTFGCGKLTNGTIAGASGIMGVSPGPLSVLKQLSITKFSY 63
Query: 210 CIGSLHDPDYLHNKLILGDGAIIDEG--------DATPL---QFIDGHYYITLEAISVDG 258
C+ D H + GA+ D G PL D +YY+ + IS+
Sbjct: 64 CLTPFTD----HKTSPVMFGAMADLGKYKTTGKVQTIPLLKNPVEDIYYYVPMVGISIGS 119
Query: 259 RMLDINPNIFK-RDDSGGGVMIDSGTDVTWLVKEAYEALRDEVM--IRLEGEQMRSYSWP 315
+ LD+ I R D GG ++DS T + +LV+ A++ L+ VM ++L +P
Sbjct: 120 KRLDVPEAILALRPDGTGGTVLDSATTLAYLVEPAFKELKKAVMEGMKLPAANRSIDDYP 179
Query: 316 DKLCYHGIMSSDLKGF--PTVRFHFRGGAKLALEKDSMFYQPRPDAFCMAVNPASINNRY 373
+C+ ++G P + HF G A+++L +DS F +P P C+AV A
Sbjct: 180 --VCFELPRGMSMEGVQVPPLVLHFAGDAEMSLPRDSYFQEPSPGMMCLAVMQAPFEGAP 237
Query: 374 VNFSLIGMMAQQFYNVGYDIGRKQKTFQRMDCE 406
++IG + QQ +V YD+G ++ ++ C+
Sbjct: 238 ---NVIGNVQQQNMHVLYDLGNRKFSYAPTKCD 267
>gi|242045118|ref|XP_002460430.1| hypothetical protein SORBIDRAFT_02g027990 [Sorghum bicolor]
gi|241923807|gb|EER96951.1| hypothetical protein SORBIDRAFT_02g027990 [Sorghum bicolor]
Length = 488
Score = 99.8 bits (247), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 103/372 (27%), Positives = 154/372 (41%), Gaps = 36/372 (9%)
Query: 54 NDISNSLFYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLGTIFYPSRSSSYAY 113
+S + + ++ +G P +DTGS WV C PC DC Q +F P+ SS+Y+
Sbjct: 132 KSLSTTNYVASLRLGTPATELVVELDTGSDQSWVQCKPCADCYEQRDPVFDPTASSTYSA 191
Query: 114 VPCDSEHCRYFP-----YARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIH 168
VPC + C+ S C Y Y T ++ + LT + +
Sbjct: 192 VPCGARECQELASSSSSRNCSSDNNKNCPYEVSYDDDSHTVGDLARDTLTLSPSPSPSPA 251
Query: 169 --VQDVVFGCGFSTNRNF-KFSGIFGLGIGRSSLVSQL----NSSFSYCIGSLHDPDYLH 221
V VFGCG S F + G+ GLG+G++SL SQ+ ++FSYC+ P
Sbjct: 252 DTVPGFVFGCGHSNAGTFGEVDGLLGLGLGKASLPSQVAARYGAAFSYCL-----PSSPS 306
Query: 222 NKLILGDGAIIDEGDATPLQFIDGH----YYITLEAISVDGRMLDINPNIFKRDDSGGGV 277
L G +A + + G YY+ L I V GR + + + F + G
Sbjct: 307 AAGYLSFGGAAARANAQFTEMVTGQDPTSYYLNLTGIVVAGRAIKVPASAFA---TAAGT 363
Query: 278 MIDSGTDVTWLVKEAYEALRD---EVMIRLEGEQMRSYSWPDKLCYHGIMSSDLKGFPTV 334
+IDSGT + L AY ALR M R ++ S D CY ++ P V
Sbjct: 364 IIDSGTAFSRLPPSAYAALRSSFRSAMGRYRYKRAPSSPIFDT-CYDFTGHETVR-IPAV 421
Query: 335 RFHFRGGAKLALEKDSMFYQPRPDA-FCMAVNPASINNRYVNFSLIGMMAQQFYNVGYDI 393
F GA + L + Y A C+A P + ++G Q+ V YD+
Sbjct: 422 ELVFADGATVHLHPSGVLYTWNDVAQTCLAFVPNH------DLGILGNTQQRTLAVIYDV 475
Query: 394 GRKQKTFQRMDC 405
G ++ F R C
Sbjct: 476 GSQRIGFGRKGC 487
>gi|125558627|gb|EAZ04163.1| hypothetical protein OsI_26305 [Oryza sativa Indica Group]
Length = 404
Score = 99.8 bits (247), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 103/370 (27%), Positives = 153/370 (41%), Gaps = 69/370 (18%)
Query: 55 DISNSLFYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLGTIFYPSRSSSYAYV 114
D S + +N+SIG PPV DTGSSL+W C PC +C+ + F P+ SS+++ +
Sbjct: 84 DNSAGAYNMNLSIGTPPVTFSVLADTGSSLIWTQCAPCTECAARPAPPFQPASSSTFSKL 143
Query: 115 PCDSEHCRYF--PYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDV 172
PC S C++ PY C+A C+Y Y +G T+ +++TE L V
Sbjct: 144 PCASSLCQFLTSPYRTCNATG--CVYYYPYGMG-FTAGYLATETLHVGGAS-----FPGV 195
Query: 173 VFGCGFSTNRNFKFSGIFGLGIGRSSLVSQLN-SSFSYCIGSLHDPDYLHNKLILGDGAI 231
FGC SGI GLG SLVSQ+ + FSYC+ S + D + ++ G A
Sbjct: 196 TFGCSTENGVGNSSSGIVGLGRSPLSLVSQVGVARFSYCLRS--NADAGDSPILFGSLAK 253
Query: 232 IDEGD--ATPL-----QFIDGHYYITLEAISVDGRMLDINPNIFKRDDSGGGVMIDSGTD 284
+ G+ +TPL +YY+ L I+V L +
Sbjct: 254 VTGGNVQSTPLLENPEMPSSSYYYVNLTGITVGATDLPM--------------------- 292
Query: 285 VTWLVKEAYEALRDEVMIRLEGEQMRSYSWPDKLCYHGIMSSDLKGFPTVRFHFR--GGA 342
M L + + LC+ + G P R GGA
Sbjct: 293 ---------------AMANLTTVNGTRFGF--DLCFDATAAGGGGGVPVPTLVLRFAGGA 335
Query: 343 KLALEKDSMF------YQPRPDAFCMAVNPASINNRYVNFSLIGMMAQQFYNVGYDIGRK 396
+ A+ + S F Q R C+ V PAS ++ S+IG + Q +V YD+
Sbjct: 336 EYAVRRRSYFGVVEVDSQGRAAVECLLVLPAS---EKLSISIIGNVMQMDLHVLYDLDGG 392
Query: 397 QKTFQRMDCE 406
+F DC
Sbjct: 393 MFSFAPADCA 402
>gi|224102847|ref|XP_002312826.1| predicted protein [Populus trichocarpa]
gi|222849234|gb|EEE86781.1| predicted protein [Populus trichocarpa]
Length = 445
Score = 99.8 bits (247), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 114/446 (25%), Positives = 187/446 (41%), Gaps = 54/446 (12%)
Query: 6 SILSPYNNPNENPA------HRVQRGINISIARLAYLQEKIRSHNTYQAQILPSNDISNS 59
SI P +P N ++ + S+AR +L+ + T L S+
Sbjct: 8 SITIPLQHPQTNQIPFQDQYQKLNHLVTTSLARARHLKNPQTTPATTTTAPLFSHSYGG- 66
Query: 60 LFYVNISIGQPPVPQFTAMDTGSSLLWVHC---YPCRDCSPQLGTI------FYPSRSSS 110
+ V++S G PP MDTGS ++W C Y C+ CS + F P SSS
Sbjct: 67 -YSVSLSFGTPPQTLSFIMDTGSDIVWFPCTSHYLCKHCSFSSSSPSSRIQPFIPKESSS 125
Query: 111 YAYVPCDSEHCRYFPYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQ 170
+ C + C + ++ + + I + L P +F + E T+H+
Sbjct: 126 SKLLGCKNPKCSWIHHSNINCDQDCSIKSCLNQTCPPYMIFYGSGTTGGVALSE-TLHLH 184
Query: 171 DVV---FGCGFSTNRNFKFSGIFGLGIGRSSLVSQLN-SSFSYCIGSLH-DPDYLHNKLI 225
+ F G S + + +GI G G G SSL SQL FSYC+ S D D + +
Sbjct: 185 SLSKPNFLVGCSVFSSHQPAGIAGFGRGLSSLPSQLGLGKFSYCLLSHRFDDDTKKSSSL 244
Query: 226 LGDGAIIDEGDATP----LQFIDG-----------HYYITLEAISVDGRMLDINPNIFKR 270
+ D +D T F+ +YY+ L I+V G + +
Sbjct: 245 VLDMEQLDSDKKTNALVYTPFVKNPKVDNKSSFSVYYYLGLRRITVGGHHVKVPYKYLSP 304
Query: 271 -DDSGGGVMIDSGTDVTWLVKEAYEALRDEVMIRLEGEQMRSYSWPDKL----CYHGIMS 325
+D GGV+IDSGT T++ +EA+E L DE IR + R D + C++ +
Sbjct: 305 GEDGNGGVIIDSGTTFTFMAREAFEPLSDE-FIRQIKDYRRVKEIEDAIGLRPCFN-VSD 362
Query: 326 SDLKGFPTVRFHFRGGAKLALEKDSMFYQPRPDAFCMAV------NPASINNRYVNFSLI 379
+ FP +R +F+GGA +AL ++ F + C+ V P + + ++
Sbjct: 363 AKTVSFPELRLYFKGGADVALPVENYFAFVGGEVACLTVVTDGVAGPERVGGPGM---IL 419
Query: 380 GMMAQQFYNVGYDIGRKQKTFQRMDC 405
G Q + V YD+ ++ F++ C
Sbjct: 420 GNFQMQNFYVEYDLRNERLGFKQEKC 445
>gi|38344991|emb|CAE01597.2| OSJNBa0008A08.5 [Oryza sativa Japonica Group]
gi|116309515|emb|CAH66581.1| OSIGBa0137O04.7 [Oryza sativa Indica Group]
gi|222628622|gb|EEE60754.1| hypothetical protein OsJ_14310 [Oryza sativa Japonica Group]
Length = 494
Score = 99.4 bits (246), Expect = 3e-18, Method: Compositional matrix adjust.
Identities = 96/401 (23%), Positives = 174/401 (43%), Gaps = 47/401 (11%)
Query: 41 RSHN-TYQAQILPSNDI---------SNSLFYVNISIGQPPVPQFTAMDTGSSLLWVHCY 90
R+H+ + + ++L + DI L+Y I IG P + +DTGS +LWV+C
Sbjct: 59 RAHDGSRRGRLLAAADIPLGGLGLPTDTGLYYTEIGIGTPTKRYYVQVDTGSDILWVNCI 118
Query: 91 PCRDCSPQLG-----TIFYPSRSSSYAYVPCDSEHC--RYFPYARCSAYKHRCIYTQLYL 143
C C + G T++ P SS+ + V CD C Y C Y+ Y
Sbjct: 119 SCDRCPRKSGLGLELTLYDPKDSSTGSKVSCDQGFCAATYGGLLPGCTTSLPCEYSVTYG 178
Query: 144 IGPETSVFVSTEQLTFKNTD---ESTIHVQDVVFGCGFST-----NRNFKFSGIFGLGIG 195
G T+ + ++ L F ++ V FGCG + N GI G G
Sbjct: 179 DGSSTTGYFVSDLLQFDQVSGDGQTRPANSTVTFGCGSQQGGDLGSSNQALDGIIGFGQS 238
Query: 196 RSSLVSQLNSS------FSYCIGSLHDPDYLHNKLILGDGAIID-EGDATPLQFIDGHYY 248
+S++SQL+++ F++C+ D ++ I G ++ + TPL HY
Sbjct: 239 NTSMLSQLSAAGKVKKIFAHCL------DTINGGGIFAIGNVVQPKVKTTPLVPNMPHYN 292
Query: 249 ITLEAISVDGRMLDINPNIFKRDDSGGGVMIDSGTDVTWLVKEAYEALRDEVMIRL--EG 306
+ L++I V G L + ++F + G +IDSGT +T+L + Y+ E+M+ + +
Sbjct: 293 VNLKSIDVGGTALKLPSHMFDTGEK-KGTIIDSGTTLTYLPEIVYK----EIMLAVFAKH 347
Query: 307 EQMRSYSWPDKLCYHGIMSSDLKGFPTVRFHFRGGAKLALEKDSMFYQPRPDAFCMAVNP 366
+ + ++ + LC+ + D FP + FHF L + F++ + +C+
Sbjct: 348 KDITFHNVQEFLCFQYVGRVD-DDFPKITFHFENDLPLNVYPHDYFFENGDNLYCVGFQN 406
Query: 367 ASINNR-YVNFSLIGMMAQQFYNVGYDIGRKQKTFQRMDCE 406
+ ++ L+G + V YD+ + + +C
Sbjct: 407 GGLQSKDGKGMVLLGDLVLSNKLVVYDLENQVIGWTEYNCS 447
>gi|21717173|gb|AAM76366.1|AC074196_24 putative nucleoid DNA binding protein [Oryza sativa Japonica Group]
gi|31433291|gb|AAP54829.1| Eukaryotic aspartyl protease family protein [Oryza sativa Japonica
Group]
Length = 413
Score = 99.4 bits (246), Expect = 3e-18, Method: Compositional matrix adjust.
Identities = 98/369 (26%), Positives = 154/369 (41%), Gaps = 39/369 (10%)
Query: 61 FYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLGTIFYPSRSSSYAYVPCDSEH 120
+ N +IG PP P +D L+W C CR C Q +F P+ SS++ PC +
Sbjct: 62 YVANFTIGTPPQPASAIVDVAGELVWTQCSACRRCFKQDLPVFVPNASSTFKPEPCGTAV 121
Query: 121 CRYFPYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDV--VFGCGF 178
C P CS C Y GP T + +T F TD I V FGC
Sbjct: 122 CESIPTRSCSG--DVCSYK-----GPPTQLRGNTSG--FAATDTFAIGTATVRLAFGCVV 172
Query: 179 STNRNF--KFSGIFGLGIGRSSLVSQLN-SSFSYCIGSLHDPDYLHNKLILGDGAIIDEG 235
+++ + SG GLG SLV+Q+ + FSYC+ + ++L LG A + G
Sbjct: 173 ASDIDTMDGPSGFIGLGRTPWSLVAQMKLTRFSYCLSPRNTGK--SSRLFLGSSAKLAGG 230
Query: 236 DATPLQ-FI------DGHYY--ITLEAISVDGRMLDINPNIFKRDDSGGGVMIDSGTDVT 286
++T FI D H+Y ++L+AI + SGG +++ + + +
Sbjct: 231 ESTSTAPFIKTSPDDDSHHYYLLSLDAIRAGNTTIATA-------QSGGILVMHTVSPFS 283
Query: 287 WLVKEAYEALRDEVMIRLEGEQMRSYSWPDK---LCYHGIMSSDLKGFPTVRFHFRGGAK 343
LV AY A + V + G + P + LC+ P + F F+G A
Sbjct: 284 LLVDSAYRAFKKAVTEAVGGAAAPPMATPPQPFDLCFKKAAGFSRATAPDLVFTFQGAAA 343
Query: 344 LALEKDSMFYQ--PRPDAFCMAVNPASINNR--YVNFSLIGMMAQQFYNVGYDIGRKQKT 399
L + D C A+ + NR S++G + Q+ + YD+ ++ +
Sbjct: 344 LTVPPAKYLIDVGEEKDTACAAILSMAWLNRTGLEGVSVLGSLQQEDVHFLYDLKKETLS 403
Query: 400 FQRMDCEVL 408
F+ DC L
Sbjct: 404 FEPADCSSL 412
>gi|326490862|dbj|BAJ90098.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 447
Score = 99.4 bits (246), Expect = 3e-18, Method: Compositional matrix adjust.
Identities = 101/375 (26%), Positives = 155/375 (41%), Gaps = 30/375 (8%)
Query: 53 SNDISNSLFYVNISIGQPPVPQFTAM--DTGSSLLWVHCYPCRDCSPQLGTIFYPSRSSS 110
S+ + + + ++ IG P PQ A+ DTGS ++W C PC DC Q F S S +
Sbjct: 84 SHVVGYTEYLIHFGIGTP-RPQQVALEVDTGSDVVWTQCRPCFDCFTQPLPRFDTSASDT 142
Query: 111 YAYVPCDSEHCRYFPYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQ 170
V C CR C + C Y Y T ++ + TF + V
Sbjct: 143 VHGVLCTDPICRALRPHAC--FLGGCTYQVNYGDNSVTIGQLAKDSFTFDGKGGGKVTVP 200
Query: 171 DVVFGCGFSTNRNFKF--SGIFGLGIGRSSLVSQLN-SSFSYCIGSLHDPDYLHNKLILG 227
D+VFGCG NF +GI G G G SL QL SSFSYC ++ + + LG
Sbjct: 201 DLVFGCGQYNTGNFHSNETGIAGFGRGPLSLPRQLGVSSFSYCFTTIFESK--STPVFLG 258
Query: 228 DGAIID------EGDATPLQFIDGH---YYITLEAISVDGRMLDINPNIF-KRDDSGGGV 277
GA D G F+ H YY++L+ I+V L + + F + D GG
Sbjct: 259 -GAPADGLRAHATGPILSTPFLPNHPEYYYLSLKGITVGKTRLAVPESAFVVKADGSGGT 317
Query: 278 MIDSGTDVTWLVKEAYEALRDEVM--IRLEGEQMRSYSWPDKLCYHGIMSSDLKGFPTVR 335
+IDSGT +T + + +L + + + L P C+ D P +
Sbjct: 318 IIDSGTAITAFPRAVFRSLWEAFVAQVPLPHTSYNDTGEPTLQCFSTESVPDASKVPVPK 377
Query: 336 --FHFRGGAKLALEKDSMFYQPRPDAFCMAVNPASINNRYVNFSLIGMMAQQFYNVGYDI 393
H G ++ M P D C+ V A ++R ++IG QQ ++ +D+
Sbjct: 378 MTLHLEGADWELPRENYMAEYPDSDQLCVVVL-AGDDDR----TMIGNFQQQNMHIVHDL 432
Query: 394 GRKQKTFQRMDCEVL 408
+ + C+ +
Sbjct: 433 AGNKLVIEPAQCDKM 447
>gi|115475303|ref|NP_001061248.1| Os08g0207800 [Oryza sativa Japonica Group]
gi|45735815|dbj|BAD12851.1| unknown protein [Oryza sativa Japonica Group]
gi|113623217|dbj|BAF23162.1| Os08g0207800 [Oryza sativa Japonica Group]
gi|125602549|gb|EAZ41874.1| hypothetical protein OsJ_26419 [Oryza sativa Japonica Group]
Length = 449
Score = 99.4 bits (246), Expect = 3e-18, Method: Compositional matrix adjust.
Identities = 110/390 (28%), Positives = 173/390 (44%), Gaps = 46/390 (11%)
Query: 50 ILPSNDISNSLFYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLGTI--FYPSR 107
+ P + ++ + IG+ Q+ +DTGSSL+W C C C +G + + S+
Sbjct: 71 VTPFRIYEDVVYLAEMEIGERQQKQYLLIDTGSSLVWTQCDECPHC--HIGDVPPYGRSQ 128
Query: 108 SSSYAYVPC------DSEH--CRYFPYARCSAY-----KHRCIYTQLY-LIGPETSVFVS 153
S ++ V C D E Y P A+ Y RC++ LY L G +V
Sbjct: 129 SRTFQEVSCGDDDDNDKEEAIASYCP-AKPPGYITLCVNGRCMFKALYNLTGQGETVQGY 187
Query: 154 TEQLTFKNTDESTIHVQD---VVFGCGFSTNRNF----KFSGIFGLGIGRSSLVSQLN-S 205
TF D+ Q +VFGC N + +GI GLG+G +S + Q +
Sbjct: 188 MSMDTFHFIDDRRFDYQAKFRMVFGCAHQENIVLTAVKECTGILGLGMGDASFLRQTGIT 247
Query: 206 SFSYCIGSLHDPDY---LHNKLILGDGAIIDEGDATPLQFIDGHYYITLEAISVD-GRML 261
FSYC+ P Y H+ L G A I G PL G YY+ L AI+ ++
Sbjct: 248 KFSYCVPP-RMPGYSYRRHSWLRFGSHAQI-SGKKVPLVMRWGKYYLPLTAITYTYNELM 305
Query: 262 DINPNI-FKRDDSGGGVMIDSGTDVTWLVKEAYEALRDEV--MIRLEGEQMRSYSWPDKL 318
P I +K + +M+D+GT + L ++ L E+ +I+ E + WP K
Sbjct: 306 SPVPIIAYKSQEDYLHMMVDTGTSLLSLPTSLHDDLIKEMEAIIKSENIMEGATRWP-KH 364
Query: 319 CYHGIMSSDLKGFPTVRFHFRGGAKLALEKDSMFYQ---PRPDAFCMAVNPASINNRYVN 375
CY M ++K TV F GG + L ++F + + A C+AVN +++
Sbjct: 365 CYKRTM-DEVKDI-TVTLSFDGGLDIELFTSALFIKTETTKGPAVCLAVNRVDDSSK--- 419
Query: 376 FSLIGMMAQQFYNVGYDIGRKQKTFQRMDC 405
+++GM AQ NVGYD+ ++ + C
Sbjct: 420 -AILGMFAQTNINVGYDLLSREIAMDPIRC 448
>gi|356555248|ref|XP_003545946.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
Length = 453
Score = 99.0 bits (245), Expect = 3e-18, Method: Compositional matrix adjust.
Identities = 91/372 (24%), Positives = 152/372 (40%), Gaps = 59/372 (15%)
Query: 61 FYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLGTIFYPSRSSSYAYVPCDSEH 120
++ + +G P + +DTGS W++C S S+ V C S
Sbjct: 113 YFAEVKVGSPGQRFWLVVDTGSEFTWLNC------------------SKSFEAVTCASRK 154
Query: 121 CR-----YFPYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDVVFG 175
C+ F + C C+Y Y G F T+ +T T+ + ++ G
Sbjct: 155 CKVDLSELFSLSVCPKPSDPCLYDISYADGSSAKGFFGTDSITVGLTNGKQGKLNNLTIG 214
Query: 176 CGFS----TNRNFKFSGIFGLGIGRSSLV----SQLNSSFSYCIGSLHDPDYLHNKLILG 227
C S N N + GI GLG + S + ++ + FSYC+ + + L +G
Sbjct: 215 CTKSMLNGVNFNEETGGILGLGFAKDSFIDKAANKYGAKFSYCLVDHLSHRSVSSNLTIG 274
Query: 228 ---DGAIIDEGDATPLQFIDGHYYITLEAISVDGRMLDINPNIFKRDDSGGGVMIDSGTD 284
+ ++ E T L Y + + IS+ G+ML I P ++ + + GG +IDSGT
Sbjct: 275 GHHNAKLLGEIRRTELILFPPFYGVNVVGISIGGQMLKIPPQVWDFN-AEGGTLIDSGTT 333
Query: 285 VTWLVKEAYEALRDEV------MIRLEGEQMRSYSWPDKLCYHGIMSSDLKGF-----PT 333
+T L+ AYEA+ + + + R+ GE + + C+ D +GF P
Sbjct: 334 LTSLLLPAYEAVFEALTKSLTKVKRVTGEDFDAL----EFCF------DAEGFDDSVVPR 383
Query: 334 VRFHFRGGAKLALEKDSMFYQPRPDAFCMAVNPASINNRYVNFSLIGMMAQQFYNVGYDI 393
+ FHF GGA+ S P C+ + P + S+IG + QQ + +D+
Sbjct: 384 LVFHFAGGARFEPPVKSYIIDVAPLVKCIGIVPI---DGIGGASVIGNIMQQNHLWEFDL 440
Query: 394 GRKQKTFQRMDC 405
F C
Sbjct: 441 STNTVGFAPSTC 452
>gi|242053991|ref|XP_002456141.1| hypothetical protein SORBIDRAFT_03g031170 [Sorghum bicolor]
gi|241928116|gb|EES01261.1| hypothetical protein SORBIDRAFT_03g031170 [Sorghum bicolor]
Length = 519
Score = 99.0 bits (245), Expect = 3e-18, Method: Compositional matrix adjust.
Identities = 105/417 (25%), Positives = 151/417 (36%), Gaps = 78/417 (18%)
Query: 61 FYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLG-------------------- 100
++V +G P P DTGS L WV C+ +P G
Sbjct: 107 YFVRFRVGTPARPFLLVADTGSDLTWVKCHRHDHDAPAPGYGYAAPASNDSSTSSLSAAA 166
Query: 101 -------TIFYPSRSSSYAYVPCDSEHCRY---FPYARCSAYKHRCIYTQLYLIGPETSV 150
+F P RS ++A +PC S+ C F A C C Y Y G
Sbjct: 167 ASSSSHARVFRPDRSRTWAPIPCSSDTCTASLPFSLAACPTPGSPCAYDYRYKDGSAARG 226
Query: 151 FVSTEQLTF------KNTDESTIHVQDVVFGCGFS-TNRNFKFS-GIFGLGIGRSSLVS- 201
V T+ T + ++ VV GC S T +F S G+ LG S S
Sbjct: 227 TVGTDSATIALSGRGAKKKQRQAKLRGVVLGCTTSYTGDSFLASDGVLSLGYSNISFASR 286
Query: 202 ---QLNSSFSYCIGSLHDPDYLHNKLILGDGAIIDEGDA--------------------- 237
+ FSYC+ P + L G +
Sbjct: 287 AAARFGGRFSYCLVDHLAPRNATSYLTFGPNPAVSSSPPSKTACAGGGSPAAAPPGPGGA 346
Query: 238 --TPLQF---IDGHYYITLEAISVDGRMLDINPNIFKRDDSGGGVMIDSGTDVTWLVKEA 292
TPL + Y +T+ ISVDG +L I P + GGG ++DSGT +T LV A
Sbjct: 347 RQTPLLLDHRMRPFYAVTVNGISVDGELLRI-PRLVWDVAKGGGAILDSGTSLTVLVSPA 405
Query: 293 YEALRDEVMIRLEGEQMRSYSWPDKLCYHGIMSS---DLK-GFPTVRFHFRGGAKLALEK 348
Y A+ + +L G + P CY+ S DL P + HF G A+L
Sbjct: 406 YRAVVAALNKKLAGLPRVTMD-PFDYCYNWTSPSTGEDLTVAMPELAVHFAGSARLQPPA 464
Query: 349 DSMFYQPRPDAFCMAVNPASINNRYVNFSLIGMMAQQFYNVGYDIGRKQKTFQRMDC 405
S P C+ + + S+IG + QQ + +D+ ++ F+R C
Sbjct: 465 KSYVIDAAPGVKCIGLQ----EGEWPGVSVIGNILQQEHLWEFDLKNRRLRFKRSRC 517
>gi|449482385|ref|XP_004156266.1| PREDICTED: aspartic proteinase-like protein 2-like [Cucumis
sativus]
Length = 491
Score = 99.0 bits (245), Expect = 3e-18, Method: Compositional matrix adjust.
Identities = 93/313 (29%), Positives = 148/313 (47%), Gaps = 33/313 (10%)
Query: 60 LFYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDC--SPQLG---TIFYPSRSSSYAYV 114
L++ + +G P +DTGS +LWV C PC C S LG +F ++SSS +
Sbjct: 83 LYFTKVKLGNPAREFNVQIDTGSDILWVTCSPCDGCPDSSGLGIELNLFDTTKSSSARVL 142
Query: 115 PCDSEHCRYFPYA--RCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKN-TDESTI--HV 169
PC C +C C Y+ Y TS F T+ + F ESTI
Sbjct: 143 PCTDPICAAVSTTTDQCLTQTDHCSYSFHYRDRSGTSGFYVTDSMHFDILLGESTIANSS 202
Query: 170 QDVVFGC-----GFSTNRNFKFSGIFGLGIGRSSLVSQLNSS------FSYCIGSLHDPD 218
+VFGC G T GIFG G G S++SQL+S FS+C+ +
Sbjct: 203 ATIVFGCSIYQYGDLTRATKALDGIFGFGQGEFSVISQLSSRGITPKVFSHCLKGGENGG 262
Query: 219 YLHNKLILGDGAIIDEGDA-TPLQFIDGHYYITLEAISVDGRMLDINPNIFKRDDSGGGV 277
+ L+LG+ I++ +PL HY + L++I++ G++ NP +F ++ G
Sbjct: 263 GI---LVLGE--ILEPSIVYSPLIPSQPHYTLKLQSIALSGQLFP-NPTMFPISNA-GET 315
Query: 278 MIDSGTDVTWLVKEAYEALRDEVMIRLEGEQMRSYSWPDKLCYHGIMS-SDLKGFPTVRF 336
+IDSGT + +LV+E Y+ + + + + S + C+ MS +D+ FP +RF
Sbjct: 316 IIDSGTTLAYLVEEVYDWIVSVITSAVSQSATPTISRGSQ-CFRVSMSVADI--FPVLRF 372
Query: 337 HFRGGAKLALEKD 349
+F G A + + +
Sbjct: 373 NFEGIASMVVTPE 385
>gi|15217887|ref|NP_176703.1| aspartic proteinase-like protein 2 [Arabidopsis thaliana]
gi|118572746|sp|Q9S9K4.2|ASPL2_ARATH RecName: Full=Aspartic proteinase-like protein 2; Flags: Precursor
gi|332196226|gb|AEE34347.1| aspartic proteinase-like protein 2 [Arabidopsis thaliana]
Length = 475
Score = 99.0 bits (245), Expect = 4e-18, Method: Compositional matrix adjust.
Identities = 87/356 (24%), Positives = 154/356 (43%), Gaps = 43/356 (12%)
Query: 38 EKIRSHNTYQ-AQILPSNDI---------SNSLFYVNISIGQPPVPQFTAMDTGSSLLWV 87
E +SH+T + +++L S D+ S L++ I +G PP +DTGS +LW+
Sbjct: 41 EHFKSHDTRRHSRMLASIDLPLGGDSRVDSVGLYFTKIKLGSPPKEYHVQVDTGSDILWI 100
Query: 88 HCYPCRDCSPQLG-----TIFYPSRSSSYAYVPCDSEHCRYFPYARCSAYKHRCIYTQLY 142
+C PC C + ++F + SS+ V CD + C + + C Y +Y
Sbjct: 101 NCKPCPKCPTKTNLNFRLSLFDMNASSTSKKVGCDDDFCSFISQSDSCQPALGCSYHIVY 160
Query: 143 LIGPETSVFVSTEQLTFKNT--DESTIHV-QDVVFGCGFST-----NRNFKFSGIFGLGI 194
+ + LT + D T + Q+VVFGCG N + G+ G G
Sbjct: 161 ADESTSDGKFIRDMLTLEQVTGDLKTGPLGQEVVFGCGSDQSGQLGNGDSAVDGVMGFGQ 220
Query: 195 GRSSLVSQLNSS------FSYCIGSLHDPDYLHNKLILGDGAIID-EGDATPLQFIDGHY 247
+S++SQL ++ FS+C+ D + I G + + TP+ HY
Sbjct: 221 SNTSVLSQLAATGDAKRVFSHCL------DNVKGGGIFAVGVVDSPKVKTTPMVPNQMHY 274
Query: 248 YITLEAISVDGRMLDINPNIFKRDDSGGGVMIDSGTDVTWLVKEAYEALRDEVMIRLEGE 307
+ L + VDG LD+ +I + GG ++DSGT + + K Y++L + ++ R +
Sbjct: 275 NVMLMGMDVDGTSLDLPRSIVRN----GGTIVDSGTTLAYFPKVLYDSLIETILAR---Q 327
Query: 308 QMRSYSWPDKLCYHGIMSSDLKGFPTVRFHFRGGAKLALEKDSMFYQPRPDAFCMA 363
++ + + ++ + FP V F F KL + + + +C
Sbjct: 328 PVKLHIVEETFQCFSFSTNVDEAFPPVSFEFEDSVKLTVYPHDYLFTLEEELYCFG 383
>gi|293335955|ref|NP_001168399.1| uncharacterized protein LOC100382168 precursor [Zea mays]
gi|223948009|gb|ACN28088.1| unknown [Zea mays]
gi|413922066|gb|AFW61998.1| hypothetical protein ZEAMMB73_694403 [Zea mays]
Length = 507
Score = 99.0 bits (245), Expect = 4e-18, Method: Compositional matrix adjust.
Identities = 103/374 (27%), Positives = 147/374 (39%), Gaps = 56/374 (14%)
Query: 66 SIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLGTIFYPSRSSSYAYVPCDSEHCRYFP 125
S G P +DTGS L WV C PC C Q +F P+ S++YA V C++ C
Sbjct: 153 SSGSPAANLTVIVDTGSDLTWVQCKPCSACYAQRDPLFDPAGSATYAAVRCNASACADSL 212
Query: 126 YA---------RCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDVVFGC 176
A A +C Y Y G + ++T+ + + VFGC
Sbjct: 213 RAATGTPGSCGSTGAGSEKCYYALAYGDGSFSRGVLATDTVALGGA-----SLGGFVFGC 267
Query: 177 GFSTNRNFKFSGIFGL-GIGRS--SLVSQLNSS----FSYCIGSLHDPDYLHN-KLILGD 228
G S NR F G GL G+GR+ SLVSQ S FSYC+ + D + L GD
Sbjct: 268 GLS-NRGL-FGGTAGLMGLGRTELSLVSQTASRYGGVFSYCLPAATSGDASGSLSLGGGD 325
Query: 229 GAIIDEGDATPLQFID--------GHYYITLEAISVDGRMLDINPNIFKRDDSGGGVMID 280
A + TP+ + Y++ + +V G L + V+ID
Sbjct: 326 DAASSYRNTTPVAYTRMIADPAQPPFYFLNVTGAAVGGTAL------AAQGLGASNVLID 379
Query: 281 SGTDVTWLVKEAYEALRDEVMIRLEGEQMRSYSWPDK-------LCYHGIMSSDLKGFPT 333
SGT +T L Y A+R E M Q + +P CY + D P
Sbjct: 380 SGTVITRLAPSVYRAVRAEFM-----RQFGAAGYPAAPGFSILDTCYD-LTGHDEVKVPL 433
Query: 334 VRFHFRGGAKLALEKDSMFYQPRPDA--FCMAVNPASINNRYVNFSLIGMMAQQFYNVGY 391
+ GGA + ++ M + R D C+A+ S + +IG Q+ V Y
Sbjct: 434 LTLRLEGGADVTVDAAGMLFVVRKDGSQVCLAMASLSYED---ETPIIGNYQQKNKRVVY 490
Query: 392 DIGRKQKTFQRMDC 405
D + F DC
Sbjct: 491 DTLGSRLGFADEDC 504
>gi|356495496|ref|XP_003516613.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Glycine max]
Length = 645
Score = 99.0 bits (245), Expect = 4e-18, Method: Compositional matrix adjust.
Identities = 101/391 (25%), Positives = 165/391 (42%), Gaps = 38/391 (9%)
Query: 36 LQEKIRSHNTYQAQILPSNDISNSLFYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDC 95
L+E H+ L + + N + + IG PP +DTGS++ +V C CR C
Sbjct: 68 LKESDSEHHPNARMRLYDDLLRNGYYTARLWIGTPPQRFALIVDTGSTVTYVPCSTCRHC 127
Query: 96 SPQLGTIFYPSRSSSYAYVPCDSEHCRYFPYARCSAYKHRCIYTQLYLIGPETSVFVSTE 155
F P S +Y V C + C + +C Y + Y +S + +
Sbjct: 128 GSHQDPKFRPEDSETYQPVKCTWQ-------CNCDNDRKQCTYERRYAEMSTSSGALGED 180
Query: 156 QLTFKNTDESTIHVQDVVFGCGFSTN---RNFKFSGIFGLGIGRSSLVSQL------NSS 206
++F N E + Q +FGC N + GI GLG G S++ QL + S
Sbjct: 181 VVSFGNQTE--LSPQRAIFGCENDETGDIYNQRADGIMGLGRGDLSIMDQLVEKKVISDS 238
Query: 207 FSYCIGSLHDPDYLHNKLILGDGAIIDEGDATPLQFIDGHYYITLEAISVDGRMLDINPN 266
FS C G + + A + + P++ +Y I L+ I V G+ L +NP
Sbjct: 239 FSLCYGGMGVGGGAMVLGGISPPADMVFTRSDPVR--SPYYNIDLKEIHVAGKRLHLNPK 296
Query: 267 IFKRDDSGGGVMIDSGTDVTWLVKEAYEALRDEVMIRLEGEQMRSYSWPDK----LCYHG 322
+F D G ++DSGT +L + A+ A + +M E ++ S PD +C+ G
Sbjct: 297 VF---DGKHGTVLDSGTTYAYLPESAFLAFKHAIM--KETHSLKRISGPDPRYNDICFSG 351
Query: 323 I---MSSDLKGFPTVRFHFRGGAKLALEKDSMFYQPRP--DAFCMAVNPASINNRYVNFS 377
+S K FP V F G KL+L ++ ++ A+C+ V +N +
Sbjct: 352 AEIDVSQISKSFPVVEMVFGNGHKLSLSPENYLFRHSKVRGAYCLGV----FSNGNDPTT 407
Query: 378 LIGMMAQQFYNVGYDIGRKQKTFQRMDCEVL 408
L+G + + V YD + F + +C L
Sbjct: 408 LLGGIVVRNTLVMYDREHTKIGFWKTNCSEL 438
>gi|218194599|gb|EEC77026.1| hypothetical protein OsI_15382 [Oryza sativa Indica Group]
Length = 409
Score = 99.0 bits (245), Expect = 4e-18, Method: Compositional matrix adjust.
Identities = 85/337 (25%), Positives = 150/337 (44%), Gaps = 36/337 (10%)
Query: 60 LFYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLG-----TIFYPSRSSSYAYV 114
L+Y I IG P + +DTGS +LWV+C C C + G T++ P SS+ + V
Sbjct: 3 LYYTEIGIGTPTKRYYVQVDTGSDILWVNCISCDRCPRKSGLGLELTLYDPKDSSTGSKV 62
Query: 115 PCDSEHC--RYFPYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTD---ESTIHV 169
CD C Y C Y+ Y G T+ + ++ L F ++
Sbjct: 63 SCDQGFCAATYGGLLPGCTTSLPCEYSVTYGDGSSTTGYFVSDLLQFDQVSGDGQTRPAN 122
Query: 170 QDVVFGCGFST-----NRNFKFSGIFGLGIGRSSLVSQLNSS------FSYCIGSLHDPD 218
V FGCG + N GI G G +S++SQL+++ F++C+ D
Sbjct: 123 STVTFGCGSQQGGDLGSSNQALDGIIGFGQSNTSMLSQLSAAGKVKKIFAHCL------D 176
Query: 219 YLHNKLILGDGAIID-EGDATPLQFIDGHYYITLEAISVDGRMLDINPNIFKRDDSGGGV 277
++ I G ++ + TPL HY + L++I V G L + ++F + G
Sbjct: 177 TINGGGIFAIGNVVQPKVKTTPLVPNMPHYNVNLKSIDVGGTALKLPSHMFDTGEK-KGT 235
Query: 278 MIDSGTDVTWLVKEAYEALRDEVMIRL--EGEQMRSYSWPDKLCYHGIMSSDLKGFPTVR 335
+IDSGT +T+L + Y+ E+M+ + + + + ++ + LC+ + D FP +
Sbjct: 236 IIDSGTTLTYLPEIVYK----EIMLAVFAKHKDITFHNVQEFLCFQYVGRVD-DDFPKIT 290
Query: 336 FHFRGGAKLALEKDSMFYQPRPDAFCMAVNPASINNR 372
FHF L + F++ + +C+ + ++
Sbjct: 291 FHFENDLPLNVYPHDYFFENGDNLYCVGFQNGGLQSK 327
>gi|297819684|ref|XP_002877725.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
gi|297323563|gb|EFH53984.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
Length = 633
Score = 98.6 bits (244), Expect = 4e-18, Method: Compositional matrix adjust.
Identities = 105/380 (27%), Positives = 161/380 (42%), Gaps = 55/380 (14%)
Query: 58 NSLFYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLGTIFYPSRSSSYAYVPCD 117
N + + IG PP +D+GS++ +V C C C F P SS+Y V C+
Sbjct: 91 NGYYTTRLWIGTPPQMFALIVDSGSTVTYVPCSDCEQCGKHQDPKFQPELSSTYQPVKCN 150
Query: 118 SEHCRYFPYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDVVFGCG 177
+ C K +C+Y + Y + + + ++F N ES + Q VFGC
Sbjct: 151 MD-------CNCDDDKEQCVYEREYAEHSSSKGVLGEDLISFGN--ESQLTPQRAVFGCE 201
Query: 178 FSTNRNF---KFSGIFGLGIGRSSLVSQL------NSSFSYCIGSLHDPDYLHNKLILGD 228
+ + GI GLG G SLV QL ++SF C G + +G
Sbjct: 202 TVETGDLYSQRADGIIGLGQGDLSLVDQLVDKGLISNSFGLCYGGMD----------VGG 251
Query: 229 GAIIDEGDATP--LQFIDG------HYYITLEAISVDGRMLDINPNIFKRDDSGGGVMID 280
G++I G P + F D +Y I L I V G+ L +N +F D G ++D
Sbjct: 252 GSMILGGFDYPSDMIFTDSDPDRSPYYNIDLTGIRVAGKKLSLNSRVF---DGEHGAVLD 308
Query: 281 SGTDVTWLVKEAYEALRDEVMIRLEGEQMRSYSWPD----KLCYHGIMSSDL----KGFP 332
SGT +L A+ A + VM E ++ PD C+ S+D+ K FP
Sbjct: 309 SGTTYAYLPDAAFAAFEEAVM--REVSPLKQIDGPDPNFKDTCFLVAASNDVSELSKIFP 366
Query: 333 TVRFHFRGGAKLALEKDS-MFYQPRP-DAFCMAVNPASINNRYVNFSLIGMMAQQFYNVG 390
+V F+ G L ++ MF + A+C+ V P N + +L+G + + V
Sbjct: 367 SVEMIFKSGQSWLLSPENYMFRHSKVHGAYCLGVFP----NGKDHTTLLGGIVVRNTLVV 422
Query: 391 YDIGRKQKTFQRMDCEVLDD 410
YD + F R +C L D
Sbjct: 423 YDRENSKVGFWRTNCSELSD 442
>gi|224081804|ref|XP_002306494.1| predicted protein [Populus trichocarpa]
gi|222855943|gb|EEE93490.1| predicted protein [Populus trichocarpa]
Length = 564
Score = 98.6 bits (244), Expect = 5e-18, Method: Compositional matrix adjust.
Identities = 98/377 (25%), Positives = 157/377 (41%), Gaps = 54/377 (14%)
Query: 58 NSLFYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLGTIFYPSRSSSYAYVPCD 117
N + + IG PP +DTGSS+ +V C C C F P SS+Y V C+
Sbjct: 10 NGYYTTRLWIGTPPQRFALIVDTGSSVTYVPCSSCEQCGRHQDPKFQPDLSSTYQSVKCN 69
Query: 118 SEHCRYFPYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDVVFGCG 177
+ C K +C+Y + Y +S + + ++F N S + Q VFGC
Sbjct: 70 ID-------CNCDDEKQQCVYERQYAEMSTSSGVLGEDIISFGNL--SALAPQRAVFGCE 120
Query: 178 FSTNRNF---KFSGIFGLGIGRSSLVSQL------NSSFSYCIGSLHDPDYLHNKLILGD 228
+ GI G+G G S+V L N SFS C + + +G
Sbjct: 121 NMETGDLYSQHADGIMGMGRGDLSIVDHLVDKGVINDSFSLC----------YGGMGIGG 170
Query: 229 GAIIDEGDATPLQFI--------DGHYYITLEAISVDGRMLDINPNIFKRDDSGGGVMID 280
GA++ G + P + +Y I L+ I V G+ L +NP +F D G ++D
Sbjct: 171 GAMVLGGISPPSNMVFSQSDPVRSPYYNIDLKEIHVAGKPLPLNPTVF---DGKHGTILD 227
Query: 281 SGTDVTWLVKEAYEALRDEVMIRLEGEQMRSYSWPD----KLCYHGI---MSSDLKGFPT 333
SGT +L + A+ + +D +M L ++ PD +C+ G +S FP
Sbjct: 228 SGTTYAYLPEAAFVSFKDAIMKEL--HSLKPIRGPDPNYNDICFSGAGSDISQLSSSFPA 285
Query: 334 VRFHFRGGAKLALEKDSMFYQPRP--DAFCMAVNPASINNRYVNFSLIGMMAQQFYNVGY 391
V F G KL L ++ ++ A+C+ + N +L+G + + V Y
Sbjct: 286 VEMVFGNGQKLLLSPENYLFRHSKVHGAYCLGI----FQNGKDPTTLLGGIVVRNTLVLY 341
Query: 392 DIGRKQKTFQRMDCEVL 408
D + F + +C L
Sbjct: 342 DRENSKIGFWKTNCSEL 358
>gi|224114179|ref|XP_002332420.1| predicted protein [Populus trichocarpa]
gi|222832373|gb|EEE70850.1| predicted protein [Populus trichocarpa]
Length = 449
Score = 98.6 bits (244), Expect = 5e-18, Method: Compositional matrix adjust.
Identities = 109/444 (24%), Positives = 184/444 (41%), Gaps = 53/444 (11%)
Query: 1 LIHRDSILSPYNNPNENPAHRVQRGINISIARLAYLQEKIRSHNTYQAQIL-PSNDISNS 59
LIH+DS SP N P ++ + A L + + ++ +++ P +
Sbjct: 18 LIHKDSPQSPLYPGNLPPGEQILQPAACPFAGLHHQTSMMSTNKAVMNRMMSPLTSYGDP 77
Query: 60 -LFYVNISIGQPPVPQ--------FTAMDTGSSLLWVHCYPCRD----CSPQLGTIFYPS 106
LF + +G + +DTG+ L W+ C C++ C P + S
Sbjct: 78 FLFLAQVGVGSFQEKSHRTHFKTYYFQIDTGNELSWIQCEGCQNKGNMCFPHKDPPYTSS 137
Query: 107 RSSSYAYVPCDSEHCRYFPYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDEST 166
+S SY V C+ +H P +C + C Y Y G TS ++ E TF +
Sbjct: 138 QSKSYKPVSCN-QHSFCEP-NQCK--EGLCAYNVTYGPGSYTSGNLANETFTFYSNHGKH 193
Query: 167 IHVQDVVFGCGF-STNRNFKF-------SGIFGLGIGRSSLVSQLNS----SFSYCIGSL 214
++ + FGC S N + F SG+ G+G G S ++QL S FSYCI +
Sbjct: 194 TALKSISFGCSTDSRNMIYAFLLDKNPVSGVLGMGWGPRSFLAQLGSISHGKFSYCITA- 252
Query: 215 HDPDYLHNKLILGDGAIIDEGDATPLQFID----GHYYITLEAISVDGRMLDINP-NIFK 269
+ HN + ++ + + + Y++ L ISV+G L+I ++
Sbjct: 253 ---NNTHNTYLRFGKHVVKSKNLQTTKIMQVKPSAAYHVNLLGISVNGVKLNITKTDLAV 309
Query: 270 RDDSGGGVMIDSGTDVTWLVKEAYEALRDEVMIRLEGEQMRS----YSWPDKLCYHGIMS 325
R D G +ID+GT T LVK ++ L + L Q + LCY +
Sbjct: 310 RKDGSRGCIIDAGTLATLLVKPIFDTLHTALSNHLSSNQNLKRWVIHKLHKDLCYEQLSD 369
Query: 326 SDLKGFPTVRFHFRGGAKLALEKDSMFYQPR---PDAFCMAVNPASINNRYVNFSLIGMM 382
+ K P V FH A L ++ +++F + FC+++ S +++ ++IG
Sbjct: 370 AGRKNLPVVTFHLE-NADLEVKPEAIFLFREFEGKNVFCLSM--LSDDSK----TIIGAY 422
Query: 383 AQQFYNVGYDIGRKQKTFQRMDCE 406
Q YD + +F DCE
Sbjct: 423 QQMKQKFVYDTKARVLSFGPEDCE 446
>gi|6850312|gb|AAF29389.1|AC009999_9 Contains similarity to nucellin from Hordeum vulgare gb|U87148.
ESTs gb|T22068, gb|F14251, gb|F14237, gb|F14242 come
from this gene [Arabidopsis thaliana]
Length = 388
Score = 98.6 bits (244), Expect = 5e-18, Method: Compositional matrix adjust.
Identities = 85/304 (27%), Positives = 134/304 (44%), Gaps = 38/304 (12%)
Query: 60 LFYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQ--LG---TIFYPSRSSSYAYV 114
L+Y I IG P + +DTGS ++WV+C C+ C + LG T++ S S V
Sbjct: 79 LYYAKIGIGTPAKSYYVQVDTGSDIMWVNCIQCKQCPRRSTLGIELTLYNIDESDSGKLV 138
Query: 115 PCDSEHCRYF---PYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTD---ESTIH 168
CD + C P + C A C Y ++Y G T+ + + + + + ++
Sbjct: 139 SCDDDFCYQISGGPLSGCKA-NMSCPYLEIYGDGSSTAGYFVKDVVQYDSVAGDLKTQTA 197
Query: 169 VQDVVFGCG------FSTNRNFKFSGIFGLGIGRSSLVSQLNSS------FSYCIGSLHD 216
V+FGCG ++ GI G G SS++SQL SS F++C+
Sbjct: 198 NGSVIFGCGARQSGDLDSSNEEALDGILGFGKANSSMISQLASSGRVKKIFAHCL----- 252
Query: 217 PDYLHNKLILGDGAIID-EGDATPLQFIDGHYYITLEAISVDGRMLDINPNIFKRDDSGG 275
D + I G ++ + + TPL HY + + A+ V L I ++F+ D G
Sbjct: 253 -DGRNGGGIFAIGRVVQPKVNMTPLVPNQPHYNVNMTAVQVGQEFLTIPADLFQPGDRKG 311
Query: 276 GVMIDSGTDVTWLVKEAYEALRDEVMIRLEGEQMRSYSWPDKLCYHGIMSSDLKGFPTVR 335
+ IDSGT + +L + YE L ++ E D C+ D +GFP V
Sbjct: 312 AI-IDSGTTLAYLPEIIYEPL-----VKKEPALKVHIVDKDYKCFQYSGRVD-EGFPNVT 364
Query: 336 FHFR 339
FHF
Sbjct: 365 FHFE 368
>gi|4646203|gb|AAD26876.1|AC007230_10 Belongs to PF|00026 Eukaryotic aspartyl protease family
[Arabidopsis thaliana]
Length = 449
Score = 98.6 bits (244), Expect = 5e-18, Method: Compositional matrix adjust.
Identities = 87/356 (24%), Positives = 154/356 (43%), Gaps = 43/356 (12%)
Query: 38 EKIRSHNTYQ-AQILPSNDI---------SNSLFYVNISIGQPPVPQFTAMDTGSSLLWV 87
E +SH+T + +++L S D+ S L++ I +G PP +DTGS +LW+
Sbjct: 41 EHFKSHDTRRHSRMLASIDLPLGGDSRVDSVGLYFTKIKLGSPPKEYHVQVDTGSDILWI 100
Query: 88 HCYPCRDCSPQLG-----TIFYPSRSSSYAYVPCDSEHCRYFPYARCSAYKHRCIYTQLY 142
+C PC C + ++F + SS+ V CD + C + + C Y +Y
Sbjct: 101 NCKPCPKCPTKTNLNFRLSLFDMNASSTSKKVGCDDDFCSFISQSDSCQPALGCSYHIVY 160
Query: 143 LIGPETSVFVSTEQLTFKNT--DESTIHV-QDVVFGCGFST-----NRNFKFSGIFGLGI 194
+ + LT + D T + Q+VVFGCG N + G+ G G
Sbjct: 161 ADESTSDGKFIRDMLTLEQVTGDLKTGPLGQEVVFGCGSDQSGQLGNGDSAVDGVMGFGQ 220
Query: 195 GRSSLVSQLNSS------FSYCIGSLHDPDYLHNKLILGDGAIID-EGDATPLQFIDGHY 247
+S++SQL ++ FS+C+ D + I G + + TP+ HY
Sbjct: 221 SNTSVLSQLAATGDAKRVFSHCL------DNVKGGGIFAVGVVDSPKVKTTPMVPNQMHY 274
Query: 248 YITLEAISVDGRMLDINPNIFKRDDSGGGVMIDSGTDVTWLVKEAYEALRDEVMIRLEGE 307
+ L + VDG LD+ +I + GG ++DSGT + + K Y++L + ++ R +
Sbjct: 275 NVMLMGMDVDGTSLDLPRSIVRN----GGTIVDSGTTLAYFPKVLYDSLIETILAR---Q 327
Query: 308 QMRSYSWPDKLCYHGIMSSDLKGFPTVRFHFRGGAKLALEKDSMFYQPRPDAFCMA 363
++ + + ++ + FP V F F KL + + + +C
Sbjct: 328 PVKLHIVEETFQCFSFSTNVDEAFPPVSFEFEDSVKLTVYPHDYLFTLEEELYCFG 383
>gi|9759559|dbj|BAB11161.1| nucleoid DNA-binding-like protein [Arabidopsis thaliana]
gi|21553652|gb|AAM62745.1| nucleoid DNA-binding-like protein [Arabidopsis thaliana]
gi|109134179|gb|ABG25087.1| At5g07030 [Arabidopsis thaliana]
Length = 439
Score = 98.2 bits (243), Expect = 6e-18, Method: Compositional matrix adjust.
Identities = 110/424 (25%), Positives = 167/424 (39%), Gaps = 41/424 (9%)
Query: 1 LIHRDSILSPYNNPNE-NPAHRVQRGINISIARLAYLQEKIRSHNTYQAQILPSND---- 55
+ H DS SP+ + + + RV + + ARL YL + + ++P
Sbjct: 39 IFHIDSPCSPFKSSSPLSWEARVLQTLAQDQARLQYLSSLVAGRS-----VVPIASGRQM 93
Query: 56 ISNSLFYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLGTIFYPSRSSSYAYVP 115
+ ++ + V IG P P AMDT S + W+ C C C T F P++S+S+ V
Sbjct: 94 LQSTTYIVKALIGTPAQPLLLAMDTSSDVAWIPCSGCVGCPSN--TAFSPAKSTSFKNVS 151
Query: 116 CDSEHCRYFPYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDVVFG 175
C + C+ P C A C + Y +S+ + Q T + + ++ FG
Sbjct: 152 CSAPQCKQVPNPTCGA--RACSFNLTY---GSSSIAANLSQDTIRLAAD---PIKAFTFG 203
Query: 176 C-------GFSTNRNFKFSGIFGLGIGRSSLVSQLNSSFSYCIGSLHDPDYLHNKLILGD 228
C G G S S S+FSYC+ S + L LG
Sbjct: 204 CVNKVAGGGTIPPPQGLLGLGRGPLSLMSQAQSIYKSTFSYCLPSFRSLTF-SGSLRLGP 262
Query: 229 GAIIDEGDATPL---QFIDGHYYITLEAISVDGRMLDINPNIFKRDDS-GGGVMIDSGTD 284
+ T L YY+ L AI V +++D+ P + S G G + DSGT
Sbjct: 263 TSQPQRVKYTQLLRNPRRSSLYYVNLVAIRVGRKVVDLPPAAIAFNPSTGAGTIFDSGTV 322
Query: 285 VTWLVKEAYEALRDEVMIRLEGEQMRSYSWPD-KLCYHGIMSSDLKGFPTVRFHFRGGAK 343
T L K YEA+R+E R++ S CY G + PT+ F F+G
Sbjct: 323 YTRLAKPVYEAVRNEFRKRVKPTTAVVTSLGGFDTCYSGQVK-----VPTITFMFKGVNM 377
Query: 344 LALEKDSMFYQPRPDAFCMAVNPASIN-NRYVNFSLIGMMAQQFYNVGYDIGRKQKTFQR 402
+ M + C+A+ A N N VN +I M QQ + V D+ + R
Sbjct: 378 TMPADNLMLHSTAGSTSCLAMAAAPENVNSVVN--VIASMQQQNHRVLIDVPNGRLGLAR 435
Query: 403 MDCE 406
C
Sbjct: 436 ERCS 439
>gi|242089623|ref|XP_002440644.1| hypothetical protein SORBIDRAFT_09g004500 [Sorghum bicolor]
gi|241945929|gb|EES19074.1| hypothetical protein SORBIDRAFT_09g004500 [Sorghum bicolor]
Length = 469
Score = 98.2 bits (243), Expect = 7e-18, Method: Compositional matrix adjust.
Identities = 96/360 (26%), Positives = 155/360 (43%), Gaps = 27/360 (7%)
Query: 55 DISNSLFYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLGTIFYPSRSSSYAYV 114
D + + SIG PP DTGS L+W C + + ++P+ SS++ +
Sbjct: 94 DGGGGAYDMEFSIGTPPQKLTALADTGSDLIWTKCDAGGGAAWGGSSSYHPNASSTFTRL 153
Query: 115 PCDSEHC---RYFPYARCSAYKHRCIYTQLYLIGPE---TSVFVSTEQLTFKNTDESTIH 168
PC C R + ARC+A C Y Y +G + T F+ +E T
Sbjct: 154 PCSDRLCAALRSYSLARCAAGGAECDYKYAYGLGDDPDFTQGFLGSETFTLGGD-----A 208
Query: 169 VQDVVFGCGFSTNRNF-KFSGIFGLGIGRSSLVSQLNS-SFSYCIGSLHDPDYLHNKLIL 226
V V FGC + ++ + +G+ GLG G SLVSQL++ +F YC+ + + L+
Sbjct: 209 VPGVGFGCTTALEGDYGEGAGLVGLGRGPLSLVSQLDAGTFMYCLTADASK---ASPLLF 265
Query: 227 GDGAIIDEGDATPLQFIDGHYYITLEAISVDGRMLDINPNIFKRDDSGGGVMIDSGTDVT 286
G A + G +Q T A+++ R + I GGV+ DSGT +T
Sbjct: 266 GALATM-TGAGAGVQSTGLLASTTFYAVNL--RSITIGSATTAGVGGPGGVVFDSGTTLT 322
Query: 287 WLVKEAYEALRDEVMIRLEGEQMRSYSWPDKLCYHGIMSSDLKGFPTVRFHFRGGAKLAL 346
+L + AY + + + + + CY S+ L P + HF GGA +AL
Sbjct: 323 YLAEPAYTEAKAAFLSQTTSLTPVEGRYGFEACYEKPDSARL--IPAMVLHFDGGADMAL 380
Query: 347 EKDSMFYQPRPDAFCMAVNPASINNRYVNFSLIGMMAQQFYNVGYDIGRKQKTFQRMDCE 406
+ + C V R + S+IG + Q Y V +D+ + +FQ +C+
Sbjct: 381 PVANYVVEVDDGVVCWVV------QRSPSLSIIGNIMQMNYLVLHDVRKSVLSFQPANCD 434
>gi|79507883|ref|NP_196320.2| aspartyl protease family protein [Arabidopsis thaliana]
gi|332003717|gb|AED91100.1| aspartyl protease family protein [Arabidopsis thaliana]
Length = 455
Score = 98.2 bits (243), Expect = 7e-18, Method: Compositional matrix adjust.
Identities = 110/424 (25%), Positives = 167/424 (39%), Gaps = 41/424 (9%)
Query: 1 LIHRDSILSPYNNPNE-NPAHRVQRGINISIARLAYLQEKIRSHNTYQAQILPSND---- 55
+ H DS SP+ + + + RV + + ARL YL + + ++P
Sbjct: 55 IFHIDSPCSPFKSSSPLSWEARVLQTLAQDQARLQYLSSLVAGRS-----VVPIASGRQM 109
Query: 56 ISNSLFYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLGTIFYPSRSSSYAYVP 115
+ ++ + V IG P P AMDT S + W+ C C C T F P++S+S+ V
Sbjct: 110 LQSTTYIVKALIGTPAQPLLLAMDTSSDVAWIPCSGCVGCPSN--TAFSPAKSTSFKNVS 167
Query: 116 CDSEHCRYFPYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDVVFG 175
C + C+ P C A C + Y +S+ + Q T + + ++ FG
Sbjct: 168 CSAPQCKQVPNPTCGA--RACSFNLTY---GSSSIAANLSQDTIRLAAD---PIKAFTFG 219
Query: 176 C-------GFSTNRNFKFSGIFGLGIGRSSLVSQLNSSFSYCIGSLHDPDYLHNKLILGD 228
C G G S S S+FSYC+ S + L LG
Sbjct: 220 CVNKVAGGGTIPPPQGLLGLGRGPLSLMSQAQSIYKSTFSYCLPSFRSLTF-SGSLRLGP 278
Query: 229 GAIIDEGDATPL---QFIDGHYYITLEAISVDGRMLDINPNIFKRDDS-GGGVMIDSGTD 284
+ T L YY+ L AI V +++D+ P + S G G + DSGT
Sbjct: 279 TSQPQRVKYTQLLRNPRRSSLYYVNLVAIRVGRKVVDLPPAAIAFNPSTGAGTIFDSGTV 338
Query: 285 VTWLVKEAYEALRDEVMIRLEGEQMRSYSWPD-KLCYHGIMSSDLKGFPTVRFHFRGGAK 343
T L K YEA+R+E R++ S CY G + PT+ F F+G
Sbjct: 339 YTRLAKPVYEAVRNEFRKRVKPTTAVVTSLGGFDTCYSGQVK-----VPTITFMFKGVNM 393
Query: 344 LALEKDSMFYQPRPDAFCMAVNPASIN-NRYVNFSLIGMMAQQFYNVGYDIGRKQKTFQR 402
+ M + C+A+ A N N VN +I M QQ + V D+ + R
Sbjct: 394 TMPADNLMLHSTAGSTSCLAMAAAPENVNSVVN--VIASMQQQNHRVLIDVPNGRLGLAR 451
Query: 403 MDCE 406
C
Sbjct: 452 ERCS 455
>gi|21717160|gb|AAM76353.1|AC074196_11 putative nucleoid DNA binding protein [Oryza sativa Japonica Group]
gi|31433304|gb|AAP54833.1| Eukaryotic aspartyl protease family protein [Oryza sativa Japonica
Group]
gi|125575544|gb|EAZ16828.1| hypothetical protein OsJ_32300 [Oryza sativa Japonica Group]
Length = 419
Score = 98.2 bits (243), Expect = 7e-18, Method: Compositional matrix adjust.
Identities = 104/425 (24%), Positives = 175/425 (41%), Gaps = 64/425 (15%)
Query: 19 AHRVQRGINISIARLAYLQEKIRSHNTYQAQILPSNDI------SNSLFYVNISIGQPPV 72
AH ++RG++ Q+ +R A P S + + N +IG PP
Sbjct: 23 AHGLRRGLD---------QQGMRGRILADATAAPPGGAVVPLHWSGAHYVANFTIGTPPQ 73
Query: 73 PQFTAMDTGSSLLWVHCYPCRD--CSPQLGTIFYPSRSSSYAYVPCDSEHCRYFPYARCS 130
+D L+W C CR C Q +F PS S++Y C S C+ P CS
Sbjct: 74 AVSGIVDLSGELVWTQCAACRSSGCFKQELPVFDPSASNTYRAEQCGSPLCKSIPTRNCS 133
Query: 131 AYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDVVFGCGFSTNRNFKF---- 186
C Y + G +T ST+ + N + + FGC +++ +
Sbjct: 134 G-DGECGYEAPSMFG-DTFGIASTDAIAIGNAEGR------LAFGCVVASDGSIDGAMDG 185
Query: 187 -SGIFGLGIGRSSLVSQLN-SSFSYCIGSLHDPDYLHNKLILGDGA-IIDEGDATPLQFI 243
SG GLG SLV Q N ++FSYC+ +LH P + L LG A + G + P +
Sbjct: 186 PSGFVGLGRTPWSLVGQSNVTAFSYCL-ALHGPGK-KSALFLGASAKLAGAGKSNPPTPL 243
Query: 244 -------------DGHYYITLEAISVDGRMLDINPNIFKRDDSGGGVM----IDSGTDVT 286
D +Y + LE I + D+ SGGG + +++ ++
Sbjct: 244 LGQHASNTSDDGSDPYYTVQLEGI----KAGDV---AVAAASSGGGAITVLQLETFRPLS 296
Query: 287 WLVKEAYEALRDEVMIRLEGEQMRSYSWPDKLCYHGIMSSDLKGFPTVRFHFRGGAKLAL 346
+L AY+AL V L M + P LC+ ++ + G P + F F+GGA L
Sbjct: 297 YLPDAAYQALEKVVTAALGSPSMANPPEPFDLCFQ---NAAVSGVPDLVFTFQGGATLTA 353
Query: 347 EKDSMFYQP--RPDAFCMAV-NPASINNRYVNFSLIGMMAQQFYNVGYDIGRKQKTFQRM 403
+ C+++ + +++ S++G + Q+ + +D+ ++ +F+
Sbjct: 354 QPSKYLLGDGNGNGTVCLSILSSTRLDSADDGVSILGSLLQENVHFLFDLEKETLSFEPA 413
Query: 404 DCEVL 408
DC L
Sbjct: 414 DCSSL 418
>gi|449440161|ref|XP_004137853.1| PREDICTED: aspartic proteinase-like protein 2-like [Cucumis
sativus]
gi|449521209|ref|XP_004167622.1| PREDICTED: aspartic proteinase-like protein 2-like [Cucumis
sativus]
Length = 492
Score = 97.8 bits (242), Expect = 8e-18, Method: Compositional matrix adjust.
Identities = 94/315 (29%), Positives = 154/315 (48%), Gaps = 35/315 (11%)
Query: 60 LFYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDC--SPQLGTI--FYPSRSSSYAYVP 115
L++ + +G PP+ +DTGS +LWV+C C C S LG F+ + SSS + +
Sbjct: 78 LYFTKVKLGTPPMEFTVQIDTGSDILWVNCNSCNGCPRSSGLGIQLNFFDASSSSSSSLV 137
Query: 116 ------CDSEHCRYFPYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNT-DESTI- 167
C+S +C ++C YT Y G TS + +E + F +S I
Sbjct: 138 SCSDPICNSAFQT--TATQCLTQSNQCSYTFQYGDGSGTSGYYVSESMYFDMVMGQSMIA 195
Query: 168 -HVQDVVFGC-----GFSTNRNFKFSGIFGLGIGRSSLVSQLNSS------FSYCIGSLH 215
VVFGC G T + GIFG G G S++SQL++ FS+C+
Sbjct: 196 NSSASVVFGCSTYQSGDLTKSDHAIDGIFGFGPGDLSVISQLSARGITPKVFSHCLKGEG 255
Query: 216 DPDYLHNKLILGDGAIIDEGDA-TPLQFIDGHYYITLEAISVDGRMLDINPNIFKRDDSG 274
+ + L+LG+ +++ G +PL HY + L++ISV+G+ L I+P++F +
Sbjct: 256 NGGGI---LVLGE--VLEPGIVYSPLVPSQPHYNLYLQSISVNGQTLPIDPSVFATSIN- 309
Query: 275 GGVMIDSGTDVTWLVKEAYEALRDEVMIRLEGEQMRSYSWPDKLCYHGIMSSDLKGFPTV 334
G +IDSGT + +LV+EAY + + + S ++ CY + +S + FP V
Sbjct: 310 RGTIIDSGTTLAYLVEEAYTPFVSAITAAVSQSVTPTISKGNQ-CYL-VSTSVGEIFPLV 367
Query: 335 RFHFRGGAKLALEKD 349
+F G A + L+ +
Sbjct: 368 SLNFAGSASMVLKPE 382
>gi|145693994|gb|ABP93697.1| unknown protein isoform 2 [Lemna minor]
Length = 351
Score = 97.8 bits (242), Expect = 9e-18, Method: Compositional matrix adjust.
Identities = 101/363 (27%), Positives = 147/363 (40%), Gaps = 36/363 (9%)
Query: 56 ISNSLFYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCR-DCSPQLGTIFYPSRSSSYAYV 114
I + + + + G P Q DTGS + W+ C PC C Q +F PS SS+Y V
Sbjct: 11 IGSGNYVITVGFGTPTRTQTVVFDTGSDVNWLQCKPCAVRCYAQQEPLFDPSLSSTYRNV 70
Query: 115 PCDSEHCRYFPYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDVVF 174
C C CS+ C+Y Y G T F++ + TF T ++ +F
Sbjct: 71 SCTEPACVGLSTRGCSS--STCLYGVFYGDGSSTIGFLAMD--TFMLTPAQ--KFKNFIF 124
Query: 175 GCGFSTNRNFKFSGIFGL-GIGRSSLVS-------QLNSSFSYCIGSLHDPDYLHN---- 222
GCG N F G GL G+GRSS S L + FSYC+ S N
Sbjct: 125 GCG--QNNTGLFQGTAGLVGLGRSSTYSLNSQVAPSLGNVFSYCLPSTSSATGYLNIGNP 182
Query: 223 KLILGDGAIIDEGDATPLQFIDGHYYITLEAISVDGRMLDINPNIFKRDDSGGGVMIDSG 282
+ G A++ + L FID L ISV G L ++ +F+ G +IDSG
Sbjct: 183 QNTPGYTAMLTDTRVPTLYFID------LIGISVGGTRLSLSSTVFQSV----GTIIDSG 232
Query: 283 TDVTWLVKEAYEALRDEVMIRLEGEQMRSYSWPDKLCYHGIMSSDLKGFPTVRFHFRGGA 342
T +T L AY AL+ V + + CY ++ + +P + HF G
Sbjct: 233 TVITRLPPTAYSALKTAVRAAMTQYTLAPAVTILDTCYDFSRTTSVV-YPVIVLHF-AGL 290
Query: 343 KLALEKDSMFYQPRPDAFCMAVNPASINNRYVNFSLIGMMAQQFYNVGYDIGRKQKTFQR 402
+ + +F+ C+A + N +IG + Q V YD K+ F
Sbjct: 291 DVRIPATGVFFVFNSSQVCLAF---AGNTDSTMIGIIGNVQQLTMEVTYDNELKRIGFSA 347
Query: 403 MDC 405
C
Sbjct: 348 GAC 350
>gi|326500408|dbj|BAK06293.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 475
Score = 97.8 bits (242), Expect = 9e-18, Method: Compositional matrix adjust.
Identities = 98/350 (28%), Positives = 142/350 (40%), Gaps = 40/350 (11%)
Query: 74 QFTAMDTGSSLLWVHCYPCR--DCSPQLGTIFYPSRSSSYAYVPCDSEHCRYF-PYAR-C 129
Q A+DT + W+ C PC C PQ +F P+ SS+ A V C S CR PY C
Sbjct: 148 QTMAIDTTVDVPWIQCAPCPIPQCYPQRDPLFDPTTSSTAAAVRCRSPACRSLGPYGNGC 207
Query: 130 S--AYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDVVFGCGFSTNRNFK-- 185
S + C Y Y T+ T+ LT T V++ FGC + F
Sbjct: 208 SNRSANAECRYLIEYSDDRATAGTYMTDTLTISGTTA----VRNFRFGCSHAVRGRFSDL 263
Query: 186 FSGIFGLGIGRSSLVSQ----LNSSFSYCIGSLHDPDYLHNKLILGDGAIIDEGD---AT 238
+G LG G SL++Q L ++FSYC+ +L +G A + T
Sbjct: 264 TAGTMSLGGGAQSLLAQTARSLGNAFSYCVPQASASGFLS----IGGPATTNSTTVFATT 319
Query: 239 PL---QFIDGHYYITLEAISVDGRMLDINPNIFKRDDSGGGVMIDSGTDVTWLVKEAYEA 295
PL Y + L+ I V GR L I P F G ++DS +T L AY A
Sbjct: 320 PLVRSAINPSLYLVRLQGIVVAGRRLGIPPVAFS-----AGAVMDSSAVITQLPPTAYRA 374
Query: 296 LRDEVMIRLEGEQMRSYSWPDKLCYHGIMSSDLKGFPTVRFHFRGGAKLALEKDSMFYQP 355
LR + + CY + ++++ P V F GGA + L+ ++
Sbjct: 375 LRRAFRNAMRAYPRSGATGTLDTCYDFLGLTNVR-VPAVSLVFGGGAVVVLDPPAVMI-- 431
Query: 356 RPDAFCMAVNPASINNRYVNFSLIGMMAQQFYNVGYDIGRKQKTFQRMDC 405
C+A S + + IG + QQ + V YD+ F+R C
Sbjct: 432 ---GGCLAFTATSSD---LALGFIGNVQQQTHEVLYDVAAGGVGFRRGAC 475
>gi|356508308|ref|XP_003522900.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
Length = 439
Score = 97.4 bits (241), Expect = 1e-17, Method: Compositional matrix adjust.
Identities = 98/363 (26%), Positives = 142/363 (39%), Gaps = 31/363 (8%)
Query: 56 ISNSLFYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLGTIFYPSRSSSYAYVP 115
I + + V IG PP AMDT + W+ C C C+ T+F P +S+++ V
Sbjct: 93 IQSPTYIVRAKIGSPPQTLLLAMDTSNDAAWIPCTACDGCT---STLFAPEKSTTFKNVS 149
Query: 116 CDSEHCRYFPYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDVVFG 175
C S C P C C + Y +S+ + Q T +T + D FG
Sbjct: 150 CGSPQCNQVPNPSCG--TSACTFNLTY---GSSSIAANVVQDTVT---LATDPIPDYTFG 201
Query: 176 C-----GFSTNRNFKFSGIFGLGIGRSSLVSQLNSSFSYCIGSLHDPDYLHNKLILGDGA 230
C G S G S + S+FSYC+ S ++ L LG A
Sbjct: 202 CVAKTTGASAPPQGLLGLGRGPLSLLSQTQNLYQSTFSYCLPSFKSLNF-SGSLRLGPVA 260
Query: 231 IIDEGDATPL---QFIDGHYYITLEAISVDGRMLDINPNIFK-RDDSGGGVMIDSGTDVT 286
TPL YY+ L AI V +++DI P +G G + DSGT T
Sbjct: 261 QPIRIKYTPLLKNPRRSSLYYVNLVAIRVGRKVVDIPPEALAFNAATGAGTVFDSGTVFT 320
Query: 287 WLVKEAYEALRDEVMIRLEGEQMRSYSWPD----KLCYHGIMSSDLKGFPTVRFHFRGGA 342
LV AY A+RDE R+ + + CY + + PT+ F F G
Sbjct: 321 RLVAPAYTAVRDEFQRRVAIAAKANLTVTSLGGFDTCYTVPIVA-----PTITFMFSGMN 375
Query: 343 KLALEKDSMFYQPRPDAFCMAVNPASINNRYVNFSLIGMMAQQFYNVGYDIGRKQKTFQR 402
E + + + C+A+ A N V ++I M QQ + V YD+ + R
Sbjct: 376 VTLPEDNILIHSTAGSTTCLAMASAPDNVNSV-LNVIANMQQQNHRVLYDVPNSRLGVAR 434
Query: 403 MDC 405
C
Sbjct: 435 ELC 437
>gi|449493359|ref|XP_004159266.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Cucumis sativus]
Length = 511
Score = 97.4 bits (241), Expect = 1e-17, Method: Compositional matrix adjust.
Identities = 110/434 (25%), Positives = 179/434 (41%), Gaps = 57/434 (13%)
Query: 17 NPAHRVQRGINISIARLAYLQEKIRSHNT--YQAQILPSNDISNSLFYVNISIGQPPVPQ 74
+P + ++ S+ R +L+ NT + P S + V+++ G PP
Sbjct: 89 DPFKTINLLLSASLNRAQHLKTPQSKSNTSIQNVSLFPR---SYGAYSVSLAFGTPPQNL 145
Query: 75 FTAMDTGSSLLWVHC---YPCRDCS-----PQLGTIFYPSRSSSYAYVPCDSEHCRYF-- 124
DTGSSL+W C Y C CS P + F P SSS V C + C +
Sbjct: 146 SFIFDTGSSLVWFPCTAGYRCSRCSFPYVDPATISKFVPKLSSSVKVVGCRNPKCAWIFG 205
Query: 125 PYAR-----CSAYKHRCI-----YTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDVVF 174
P + C++ +C Y Y G + +S E L +N V D +
Sbjct: 206 PNLKSRCRNCNSKSRKCSDSCPGYGLQYGSGATAGILLS-ETLDLENK-----RVPDFLV 259
Query: 175 GCGFSTNRNFKFSGIFGLGIGRSSLVSQLN-SSFSYCIGSL-HDPDYLHNKLILGDGAII 232
GC S + +GI G G G SL SQ+ FS+C+ S D + + L+L G+
Sbjct: 260 GC--SVMSVHQPAGIAGFGRGPESLPSQMRLKRFSHCLVSRGFDDSPVSSPLVLDSGSES 317
Query: 233 DEGDATPLQF-------------IDGHYYITLEAISVDGRMLDINPNIFKRDDSG-GGVM 278
DE + +YY++L I + G+ + D +G GG +
Sbjct: 318 DESKTKSFIYAPFRENPSVSNAAFREYYYLSLRRILIGGKPVKFPYKYLVPDSTGNGGAI 377
Query: 279 IDSGTDVTWLVKEAYEALRDEVMIRL----EGEQMRSYSWPDKLCYHGIMSSDLKGFPTV 334
IDSG+ T+L K +EA+ DE+ +L + + + S + C++ + FP V
Sbjct: 378 IDSGSTFTFLDKPIFEAIADELEKQLVKYPRAKDVEAQSG-LRPCFNIPKEEESAEFPDV 436
Query: 335 RFHFRGGAKLALEKD---SMFYQPRPDAFCMAVNPASINNRYVNFSLIGMMAQQFYNVGY 391
F+GG KL+L + +M M + A + ++G QQ V Y
Sbjct: 437 VLKFKGGGKLSLAAENYLAMVTDEGVVCLTMMTDEAVVGGGGGPAIILGAFQQQNVLVEY 496
Query: 392 DIGRKQKTFQRMDC 405
D+ +++ F++ C
Sbjct: 497 DLAKQRIGFRKQKC 510
>gi|356541713|ref|XP_003539318.1| PREDICTED: aspartic proteinase-like protein 2-like [Glycine max]
Length = 640
Score = 97.4 bits (241), Expect = 1e-17, Method: Compositional matrix adjust.
Identities = 99/392 (25%), Positives = 166/392 (42%), Gaps = 38/392 (9%)
Query: 35 YLQEKIRSHNTYQAQILPSNDISNSLFYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRD 94
+LQ H+ L + + N + + IG PP +DTGS++ +V C C+
Sbjct: 67 HLQGSQSEHHPNARMRLFDDLLRNGYYTTRLWIGTPPQRFALIVDTGSTVTYVPCSTCKH 126
Query: 95 CSPQLGTIFYPSRSSSYAYVPCDSEHCRYFPYARCSAYKHRCIYTQLYLIGPETSVFVST 154
C F P S +Y V C + C + +C Y + Y +S +
Sbjct: 127 CGSHQDPKFRPEASETYQPVKCTWQ-------CNCDDDRKQCTYERRYAEMSTSSGVLGE 179
Query: 155 EQLTFKNTDESTIHVQDVVFGCGFSTN---RNFKFSGIFGLGIGRSSLVSQL------NS 205
+ ++F N +S + Q +FGC N + GI GLG G S++ QL +
Sbjct: 180 DVVSFGN--QSELSPQRAIFGCENDETGDIYNQRADGIMGLGRGDLSIMDQLVEKKVISD 237
Query: 206 SFSYCIGSLHDPDYLHNKLILGDGAIIDEGDATPLQFIDGHYYITLEAISVDGRMLDINP 265
+FS C G + + A + + P++ +Y I L+ I V G+ L +NP
Sbjct: 238 AFSLCYGGMGVGGGAMVLGGISPPADMVFTHSDPVR--SPYYNIDLKEIHVAGKRLHLNP 295
Query: 266 NIFKRDDSGGGVMIDSGTDVTWLVKEAYEALRDEVMIRLEGEQMRSYSWPDK----LCYH 321
+F D G ++DSGT +L + A+ A + +M E ++ S PD +C+
Sbjct: 296 KVF---DGKHGTVLDSGTTYAYLPESAFLAFKHAIM--KETHSLKRISGPDPHYNDICFS 350
Query: 322 GI---MSSDLKGFPTVRFHFRGGAKLALEKDSMFYQPRP--DAFCMAVNPASINNRYVNF 376
G +S K FP V F G KL+L ++ ++ A+C+ V +N
Sbjct: 351 GAEINVSQLSKSFPVVEMVFGNGHKLSLSPENYLFRHSKVRGAYCLGV----FSNGNDPT 406
Query: 377 SLIGMMAQQFYNVGYDIGRKQKTFQRMDCEVL 408
+L+G + + V YD + F + +C L
Sbjct: 407 TLLGGIVVRNTLVMYDREHSKIGFWKTNCSEL 438
>gi|356503843|ref|XP_003520712.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Glycine max]
Length = 474
Score = 97.4 bits (241), Expect = 1e-17, Method: Compositional matrix adjust.
Identities = 114/450 (25%), Positives = 189/450 (42%), Gaps = 69/450 (15%)
Query: 7 ILSPYNNPNENPAHRVQRGINISIARLAYLQEKIRSHNTYQAQILPSNDISNSLFYVNIS 66
++ P+++ + +P H ++ + S+ R +L K R++N+ P+ S + ++++
Sbjct: 41 LIKPHSS-DSDPFHSLKFAASASLTRAHHL--KHRNNNSPSVATTPAYPKSYGGYSIDLN 97
Query: 67 IGQPPVPQFTAMDTGSSLLWVHC---YPCRDCS-PQLGT----IFYPSRSSSYAYVPCDS 118
+G PP +DTGSSL+W C Y C C+ P + T F P SS+ + C +
Sbjct: 98 LGTPPQTSPFVLDTGSSLVWFPCTSRYLCSHCNFPNIDTTKIPTFIPKNSSTAKLLGCRN 157
Query: 119 EHCRY-------FPYARCSAYKHRC-----IYTQLYLIGPETSVFVSTEQLTFKNTDEST 166
C Y F +C C Y Y +G T+ F+ + L F
Sbjct: 158 PKCGYIFGSDVQFRCPQCKPESQNCSLTCPAYIIQYGLG-STAGFLLLDNLNFPGKT--- 213
Query: 167 IHVQDVVFGCGFSTNRNFKFSGIFGLGIGRSSLVSQLN-SSFSYCIGSLHDPDYLHNK-L 224
V + GC + R + SGI G G G+ SL SQ+N FSYC+ S D + L
Sbjct: 214 --VPQFLVGCSILSIR--QPSGIAGFGRGQESLPSQMNLKRFSYCLVSHRFDDTPQSSDL 269
Query: 225 ILGDGAIIDEGDA-------TPLQ--------FIDGHYYITLEAISVDGRMLDINPNIFK 269
+L I GD TP + +YY+TL + V G+ + I P F
Sbjct: 270 VL---QISSTGDTKTNGLSYTPFRSNPSTNNPAFKEYYYLTLRKVIVGGKDVKI-PYTFL 325
Query: 270 R--DDSGGGVMIDSGTDVTWLVKEAYEALRDEVMIRLEGEQMRSYSWPDK----LCYHGI 323
D GG ++DSG+ T++ + Y + E + +LE R+ + C++ I
Sbjct: 326 EPGSDGNGGTIVDSGSTFTFMERPVYNLVAQEFVKQLEKNYSRAEDAETQSGLSPCFN-I 384
Query: 324 MSSDLKGFPTVRFHFRGGAKLALEKDSMF-YQPRPDAFCMAV------NPASINNRYVNF 376
FP + F F+GGAK+ + F + C+ V P +
Sbjct: 385 SGVKTVTFPELTFKFKGGAKMTQPLQNYFSLVGDAEVVCLTVVSDGGAGPPKTTGPAI-- 442
Query: 377 SLIGMMAQQFYNVGYDIGRKQKTFQRMDCE 406
++G QQ + + YD+ ++ F C
Sbjct: 443 -ILGNYQQQNFYIEYDLENERFGFGPRSCR 471
>gi|51091919|dbj|BAD35188.1| putative 41 kD chloroplast nucleoid DNA binding protein [Oryza
sativa Japonica Group]
gi|125596474|gb|EAZ36254.1| hypothetical protein OsJ_20576 [Oryza sativa Japonica Group]
gi|196212950|gb|ACG76111.1| S5 [Oryza sativa Japonica Group]
gi|340810891|gb|AEK75372.1| S5 [Oryza sativa]
gi|340810893|gb|AEK75373.1| S5 [Oryza sativa]
gi|340810899|gb|AEK75376.1| S5 [Oryza sativa]
gi|340810901|gb|AEK75377.1| S5 [Oryza sativa]
gi|340810933|gb|AEK75393.1| S5 [Oryza sativa]
gi|340810947|gb|AEK75400.1| S5 [Oryza sativa]
gi|340810949|gb|AEK75401.1| S5 [Oryza sativa]
gi|340810967|gb|AEK75410.1| S5 [Oryza sativa]
gi|340810969|gb|AEK75411.1| S5 [Oryza sativa]
gi|340810999|gb|AEK75426.1| S5 [Oryza rufipogon]
gi|340811017|gb|AEK75435.1| S5 [Oryza rufipogon]
gi|340811029|gb|AEK75441.1| S5 [Oryza nivara]
gi|340811051|gb|AEK75452.1| S5 [Oryza nivara]
gi|340811075|gb|AEK75464.1| S5 [Oryza nivara]
gi|340811077|gb|AEK75465.1| S5 [Oryza rufipogon]
gi|340811085|gb|AEK75469.1| S5 [Oryza nivara]
gi|340811096|gb|AEK75474.1| S5 [Oryza rufipogon]
gi|340811100|gb|AEK75476.1| S5 [Oryza rufipogon]
gi|340811114|gb|AEK75483.1| S5 [Oryza nivara]
Length = 472
Score = 97.4 bits (241), Expect = 1e-17, Method: Compositional matrix adjust.
Identities = 115/410 (28%), Positives = 175/410 (42%), Gaps = 68/410 (16%)
Query: 37 QEKIRSHNTYQAQILPSNDISNSLFYVNISIGQPPVPQFTAMDTGSSLLWVHCYP----C 92
+E+I S ++ + ++ + I++ LF + +S+G+PPV A+DTGS+L WV C P C
Sbjct: 90 EEEITSSSSTKIDVIEDSSINDFLFLMAVSLGKPPVVNLVAIDTGSTLSWVQCQPCAVHC 149
Query: 93 RDCSPQLGTIFYPSRSSSYAYVPCDSEHCRYFPY------ARCSAYKHRCIYTQLYLIGP 146
S + G IF P RS + V C S C Y A C + C Y+ Y G
Sbjct: 150 HTQSAKAGPIFDPGRSYTSRRVRCSSVKCGELRYDLRLQQANCMEKEDSCTYSVTYGNGW 209
Query: 147 ETSV-FVSTEQLTFKNTDESTIHVQDVVFGCGFSTNRNFKFSGIFGLGIGRSSLVSQLN- 204
SV + T+ L ++ D++FGC + +GIFG G S QL
Sbjct: 210 AYSVGKMVTDTLRIGDS------FMDLMFGCSMDVKYSEFEAGIFGFGSSSFSFFEQLAG 263
Query: 205 -------SSFSYCIGSLH-DPDYLHNKLILG--DGAIIDEGDATPL--QFIDGHYYITLE 252
+ SYC+ + P Y +ILG D A +D G TPL Y +T+E
Sbjct: 264 YPDILSYKALSYCLPTDETKPGY----MILGRYDRAAMD-GGYTPLFRSINRPTYSLTME 318
Query: 253 AISVDGRMLDINPNIFKRDDSGGGVMIDSGTDVTWLVKEAYEALRDEVMI---------R 303
+ +G+ L S +++DSG T L + AL D+ + R
Sbjct: 319 MLIANGQRLVT---------SSSEMIVDSGAQRTSLWPSTF-ALLDKTITQAMSSIGYHR 368
Query: 304 LEGEQMRSY----SWPDKLCYHGIMS--SDLKGFPTVRFHFRGGAKLALEKDSMFYQPRP 357
+ SY S D ++G ++ S+ P + F GGA LAL ++FY
Sbjct: 369 TSRARQESYICYLSEHDYSGWNGTITPFSNWSALPLLEIGFAGGAALALPPRNVFYNDPH 428
Query: 358 DAFCM--AVNPASINNRYVNFSLIGMMAQQFYNVGYDIGRKQKTFQRMDC 405
CM A NPA + ++G + + +DI KQ F+ C
Sbjct: 429 RGLCMTFAQNPA------LRSQILGNRVTRSFGTTFDIQGKQFGFKYAVC 472
>gi|340810993|gb|AEK75423.1| S5 [Oryza rufipogon]
gi|340811015|gb|AEK75434.1| S5 [Oryza nivara]
gi|340811021|gb|AEK75437.1| S5 [Oryza nivara]
Length = 474
Score = 97.4 bits (241), Expect = 1e-17, Method: Compositional matrix adjust.
Identities = 115/410 (28%), Positives = 175/410 (42%), Gaps = 68/410 (16%)
Query: 37 QEKIRSHNTYQAQILPSNDISNSLFYVNISIGQPPVPQFTAMDTGSSLLWVHCYP----C 92
+E+I S ++ + ++ + I++ LF + +S+G+PPV A+DTGS+L WV C P C
Sbjct: 92 EEEITSSSSTKIDVIEDSSINDFLFLMAVSLGKPPVVNLVAIDTGSTLSWVQCQPCAVHC 151
Query: 93 RDCSPQLGTIFYPSRSSSYAYVPCDSEHCRYFPY------ARCSAYKHRCIYTQLYLIGP 146
S + G IF P RS + V C S C Y A C + C Y+ Y G
Sbjct: 152 HTQSAKAGPIFDPGRSYTSRRVRCSSVKCGELRYDLRLQQANCMEKEDSCTYSVTYGNGW 211
Query: 147 ETSV-FVSTEQLTFKNTDESTIHVQDVVFGCGFSTNRNFKFSGIFGLGIGRSSLVSQLN- 204
SV + T+ L ++ D++FGC + +GIFG G S QL
Sbjct: 212 AYSVGKMVTDTLRIGDS------FMDLMFGCSMDVKYSEFEAGIFGFGSSSFSFFEQLAG 265
Query: 205 -------SSFSYCIGSLH-DPDYLHNKLILG--DGAIIDEGDATPL--QFIDGHYYITLE 252
+ SYC+ + P Y +ILG D A +D G TPL Y +T+E
Sbjct: 266 YPDILSYKALSYCLPTDETKPGY----MILGRYDRAAMD-GGYTPLFRSINRPTYSLTME 320
Query: 253 AISVDGRMLDINPNIFKRDDSGGGVMIDSGTDVTWLVKEAYEALRDEVMI---------R 303
+ +G+ L S +++DSG T L + AL D+ + R
Sbjct: 321 MLIANGQRLVT---------SSSEMIVDSGAQRTSLWPSTF-ALLDKTITQAMSSIGYHR 370
Query: 304 LEGEQMRSY----SWPDKLCYHGIMS--SDLKGFPTVRFHFRGGAKLALEKDSMFYQPRP 357
+ SY S D ++G ++ S+ P + F GGA LAL ++FY
Sbjct: 371 TSRARQESYICYLSEHDYSGWNGTITPFSNWSALPLLEIGFAGGAALALPPRNVFYNDPH 430
Query: 358 DAFCM--AVNPASINNRYVNFSLIGMMAQQFYNVGYDIGRKQKTFQRMDC 405
CM A NPA + ++G + + +DI KQ F+ C
Sbjct: 431 RGLCMTFAQNPA------LRSQILGNRVTRSFGTTFDIQGKQFGFKYAVC 474
>gi|168030587|ref|XP_001767804.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162680886|gb|EDQ67318.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 399
Score = 97.4 bits (241), Expect = 1e-17, Method: Compositional matrix adjust.
Identities = 102/396 (25%), Positives = 164/396 (41%), Gaps = 57/396 (14%)
Query: 48 AQILPSNDISNSLFYVN-ISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLGTI---- 102
A++ +D+ +Y + + IG PP +DTGS++ +V C C C +
Sbjct: 26 ARMTLHDDLLTKGYYTSRVFIGTPPNEFALIVDTGSTVTYVPCSSCTHCGHHQASFSTHR 85
Query: 103 -------FYPSRSSSYAYVPCDSEHCRYFPYARCSAYKHRCIYTQLYLIGPETSVFVSTE 155
F P SSSY + C S C C + H+C Y ++Y + + +
Sbjct: 86 LFCRDPRFKPENSSSYQKIGCRSSDC---ITGLCDSNSHQCKYERMYAEMSTSKGVLGKD 142
Query: 156 QLTFKNTDESTIHVQDVVFGCGFSTNRNFKFS---GIFGLGIGRSSLVSQL------NSS 206
L F S + Q + FGC + + + GI GLG G S+V QL S
Sbjct: 143 LLDFGPA--SRLQSQLLSFGCETAESGDLYLQVADGIMGLGRGPLSIVDQLVGNGAIEDS 200
Query: 207 FSYCIGSLHDPDYLHNKLILG-----DGAIIDEGDATPLQFIDGHYYITLEAISVDGRML 261
FS C G + D ++LG G + + D + Y + L I V G L
Sbjct: 201 FSLCYGGM---DEGGGSMVLGAIPAPSGMVFAKSDPRRSNY----YNLELTEIQVQGASL 253
Query: 262 DINPNIFKRDDSGGGVMIDSGTDVTWLVKEAYEALRDEVMIRLEGEQMRSYSWPD----K 317
++ N+F + G ++DSGT +L A+EA D V+ +L +++ PD
Sbjct: 254 KLDSNVF---NGKFGTILDSGTTYAYLPDRAFEAFTDAVVAQL--GSLQAVDGPDPNYPD 308
Query: 318 LCYHGIMSSDL---KGFPTVRFHFRGGAKLALEKDSMFYQ--PRPDAFCMAVNPASINNR 372
+CY G + K FP V F F K++L ++ ++ P A+C+ N+
Sbjct: 309 ICYAGAGTDTKELGKHFPLVDFVFAENQKVSLAPENYLFKHTKVPGAYCLGF----FKNQ 364
Query: 373 YVNFSLIGMMAQQFYNVGYDIGRKQKTFQRMDCEVL 408
L G++ + V YD Q F + +C L
Sbjct: 365 DATTLLGGIIVRNML-VTYDRYNHQIGFLKTNCTEL 399
>gi|255581508|ref|XP_002531560.1| Aspartic proteinase nepenthesin-2 precursor, putative [Ricinus
communis]
gi|223528821|gb|EEF30826.1| Aspartic proteinase nepenthesin-2 precursor, putative [Ricinus
communis]
Length = 407
Score = 97.4 bits (241), Expect = 1e-17, Method: Compositional matrix adjust.
Identities = 105/377 (27%), Positives = 161/377 (42%), Gaps = 43/377 (11%)
Query: 63 VNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLGTIFYPSRSSSYAYVPCDSEHC- 121
V++++G PP +DTGS L W++C + T F +RS SY +PC S C
Sbjct: 33 VSLTVGTPPQNVSMVIDTGSELSWLYCNKTTTTT-SYPTTFNQTRSISYRPIPCSSSTCT 91
Query: 122 ---RYFPYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDVVFGCG- 177
R F C T Y + ++++ +D + +VFGC
Sbjct: 92 NQTRDFSIPASCDSNSLCHATLSYADASSSEGNLASDTFHMGASD-----IPGMVFGCMD 146
Query: 178 --FSTN--RNFKFSGIFGLGIGRSSLVSQLN-SSFSYCIGSLHDPDYLHNKLILGDGAII 232
FS+N + K +G+ G+ G S VSQ+ FSYCI L+LG+
Sbjct: 147 SVFSSNSDEDSKNTGLMGMNRGSLSFVSQMGFPKFSYCISGTD----FSGMLLLGESNFT 202
Query: 233 DEGD---------ATPLQFIDG-HYYITLEAISVDGRMLDINPNIFKRDDSGGG-VMIDS 281
+TPL + D Y + LE I V R+L I ++F+ D +G G M+DS
Sbjct: 203 WAVPLNYTPLVQISTPLPYFDRIAYTVQLEGIKVSDRLLPIPKSVFEPDHTGAGQTMVDS 262
Query: 282 GTDVTWLVKEAYEALRDEVMIRLEGEQMRSYSWPD-------KLCYHGIMSSD-LKGFPT 333
GT T+L+ AY ALR E + + G +R PD LCY +S L PT
Sbjct: 263 GTQFTFLLGPAYTALRSEFLNQTTGF-LRVLEDPDFVFQGAMDLCYRVPISQRVLPRLPT 321
Query: 334 VRFHFRGGAKLALEKDSMFYQPRPDAFCMAVNPASINNR---YVNFSLIGMMAQQFYNVG 390
V F G ++ ++ P +V+ S N V +IG QQ +
Sbjct: 322 VSLVFNGAEMTVADERVLYRVPGEIRGNDSVHCLSFGNSDLLGVEAYVIGHHHQQNVWME 381
Query: 391 YDIGRKQKTFQRMDCEV 407
+D+ R + ++ C++
Sbjct: 382 FDLERSRIGLAQVRCDL 398
>gi|125532795|gb|EAY79360.1| hypothetical protein OsI_34488 [Oryza sativa Indica Group]
Length = 342
Score = 97.1 bits (240), Expect = 1e-17, Method: Compositional matrix adjust.
Identities = 87/366 (23%), Positives = 146/366 (39%), Gaps = 62/366 (16%)
Query: 57 SNSLFYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLGTIFYPSRSSSYAYVPC 116
S L+ N++IG PP P + +W C PCR C Q +F
Sbjct: 24 SQPLYMANLTIGTPPQPASAIIHLAGEFVWTQCSPCRRCFKQDLPLF------------- 70
Query: 117 DSEHCRYFPYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDVVFGC 176
+ Y+ ++ IG + + T + FGC
Sbjct: 71 -------------NRYEVETMFGDTSGIGGTDTFAIGTA-------------TASLAFGC 104
Query: 177 GFSTN--RNFKFSGIFGLGIGRSSLVSQLNSS-FSYCIGSLHDPDYLHNKLILGDGAIID 233
+N + SG+ GLG SLV Q+N++ FSYC+ H + L+LG A +
Sbjct: 105 AMDSNIKQLLGASGVVGLGRTPWSLVGQMNATAFSYCLAP-HGAAGKKSALLLGASAKLA 163
Query: 234 EGDA---TPLQFID---GHYYITLEAISVDGRMLDINPNIFKRDDSGGGVMIDSGTDVTW 287
G + TPL Y I LE I +++ PN G V++D+ V++
Sbjct: 164 GGKSAATTPLVNTSDDSSDYMIHLEGIKFGDVIIEPPPN-------GSVVLVDTIFGVSF 216
Query: 288 LVKEAYEALRDEVMIRLEGEQMRSYSWPDKLCY-----HGIMSSDLKGFPTVRFHFRGGA 342
LV A+ A++ V + + M + + P LC+ +S L P V F+G A
Sbjct: 217 LVDAAFHAIKKAVTVAVGAAPMATPTKPFDLCFPKAAAAAGANSSLP-LPDVVLTFQGAA 275
Query: 343 KLALEKDSMFYQPRPDAFCMAVNPASINNRYVNFSLIGMMAQQFYNVGYDIGRKQKTFQR 402
L + Y C+A+ +++ N S++G + Q+ + +D+ ++ +F+
Sbjct: 276 ALTVPPSKYMYDAGNGTVCLAMMSSAMLNLTTELSILGRLHQENIHFLFDLDKETLSFEP 335
Query: 403 MDCEVL 408
DC L
Sbjct: 336 ADCSSL 341
>gi|242066168|ref|XP_002454373.1| hypothetical protein SORBIDRAFT_04g029630 [Sorghum bicolor]
gi|241934204|gb|EES07349.1| hypothetical protein SORBIDRAFT_04g029630 [Sorghum bicolor]
Length = 458
Score = 97.1 bits (240), Expect = 1e-17, Method: Compositional matrix adjust.
Identities = 102/369 (27%), Positives = 157/369 (42%), Gaps = 36/369 (9%)
Query: 52 PSNDISNSLFYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCR-DCSPQLGTIFYPSRSSS 110
P + + + +G P +DTGSSL W+ C PC C Q G +F P SSS
Sbjct: 112 PGTSVGVGNYVTRMGLGTPAKSYVMVVDTGSSLTWLQCSPCLVSCHRQSGPVFNPRSSSS 171
Query: 111 YAYVPCDSEHCRYFPYA-----RCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDES 165
YA V C + C A CS + CIY Y + ++S + ++F +T
Sbjct: 172 YASVSCSAPQCDALTTATLNPSTCST-SNVCIYQASYGDSSFSVGYLSKDTVSFGSTS-- 228
Query: 166 TIHVQDVVFGCGFSTNRNF-KFSGIFGLGIGRSSLVSQLNS----SFSYCIGSLHDPDYL 220
V + +GCG F + +G+ GL + SL+ QL SFSYC+ +
Sbjct: 229 ---VPNFYYGCGQDNEGLFGQSAGLIGLARNKLSLLYQLAPSMGYSFSYCLPTSSSSSGY 285
Query: 221 HNKLILGDGAIIDEGDATPL---QFIDGHYYITLEAISVDGRMLDINPNIFKRDDSGGGV 277
+ G + TP+ D Y+I + I+V G+ L ++ + + S
Sbjct: 286 LSIGSYNPG----QYSYTPMAKSSLDDSLYFIKMTGITVAGKPLSVSASAY----SSLPT 337
Query: 278 MIDSGTDVTWLVKEAYEALRDEVMIRLEGEQMRSYSWPDKLCYHGIMSSDLKGFPTVRFH 337
+IDSGT +T L + Y AL V ++G S C+ G +S L+ P V
Sbjct: 338 IIDSGTVITRLPTDVYSALSKAVAGAMKGTPRASAFSILDTCFQG-QASRLR-VPQVSMA 395
Query: 338 FRGGAKLALEKDSMFYQPRPDAFCMAVNPASINNRYVNFSLIGMMAQQFYNVGYDIGRKQ 397
F GGA L L+ ++ C+A PA + ++IG QQ ++V YD+ +
Sbjct: 396 FAGGAALKLKATNLLVDVDSATTCLAFAPAR------SAAIIGNTQQQTFSVVYDVKNSK 449
Query: 398 KTFQRMDCE 406
F C
Sbjct: 450 IGFAAGGCS 458
>gi|242046812|ref|XP_002461152.1| hypothetical protein SORBIDRAFT_02g041760 [Sorghum bicolor]
gi|241924529|gb|EER97673.1| hypothetical protein SORBIDRAFT_02g041760 [Sorghum bicolor]
Length = 452
Score = 97.1 bits (240), Expect = 1e-17, Method: Compositional matrix adjust.
Identities = 100/365 (27%), Positives = 153/365 (41%), Gaps = 29/365 (7%)
Query: 56 ISNSLFYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLGTIFYPSRSSSYAYVP 115
+ + V +G PP A+DT + W+ C C C F P+ S+SY VP
Sbjct: 105 LQTPTYVVRARLGTPPQQLLLAVDTSNDAAWIPCAGCAGCPTSSAPPFDPAASTSYRSVP 164
Query: 116 CDSEHCRYFPYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDVVFG 175
C S C P A C C ++ Y ++S+ + Q + ++ V+ FG
Sbjct: 165 CGSPLCAQAPNAACPPGGKACGFSLTYA---DSSLQAALSQDSLAVAGDA---VKTYTFG 218
Query: 176 C-GFSTNRNFKFSGIFGLGIGRSSLVSQL----NSSFSYCIGSLHDPDYLHNKLILGDGA 230
C +T G+ GLG G S +SQ +FSYC+ S ++ L LG
Sbjct: 219 CLQKATGTAAPPQGLLGLGRGPLSFLSQTRDMYQGTFSYCLPSFKSLNF-SGTLRLGRNG 277
Query: 231 IIDEGDATPLQFIDGH----YYITLEAISVDGRMLDINPNIFKRDD-SGGGVMIDSGTDV 285
TPL + H YY+ + I V +++ I P D +G G ++DSGT
Sbjct: 278 QPPRIKTTPL-LANPHRSSLYYVNMTGIRVGRKVVPIPPPALAFDPATGAGTVLDSGTMF 336
Query: 286 TWLVKEAYEALRDEVMIRLEGEQMRSYSWPDKLCYHGIMSSDLKGFPTVRFHFRGGAKLA 345
T LV AY A+RDEV R+ G + S D C++ + +P V F G
Sbjct: 337 TRLVAPAYVAVRDEVRRRV-GAPVSSLGGFDT-CFN----TTAVAWPPVTLLFDGMQVTL 390
Query: 346 LEKDSMFYQPRPDAFC--MAVNPASINNRYVNFSLIGMMAQQFYNVGYDIGRKQKTFQRM 403
E++ + + C MA P +N ++I M QQ + V +D+ + F R
Sbjct: 391 PEENVVIHSTYGTISCLAMAAAPDGVNTV---LNVIASMQQQNHRVLFDVPNGRVGFARE 447
Query: 404 DCEVL 408
C +
Sbjct: 448 RCTAV 452
>gi|168014188|ref|XP_001759635.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162689174|gb|EDQ75547.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 485
Score = 97.1 bits (240), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 111/443 (25%), Positives = 180/443 (40%), Gaps = 66/443 (14%)
Query: 8 LSPYNNPNENPAHRVQRGINISIARLAYLQEKIRSHNTYQ-------------AQILPSN 54
++P N N N + V+ N A + L E R + A+++ +
Sbjct: 32 VNPSENENGNGLNGVRIQSNCGSALVLPLVESKRHGHVVDRRFERRGRGLVEDARMVLHD 91
Query: 55 DISNSLFYVN-ISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLGTI---FYPSRSSS 110
D+ +Y + + IG P +DTGS++ +V C C C F P SSS
Sbjct: 92 DLLTKGYYTSRVFIGTPAQEFALIVDTGSTVTYVPCSSCTHCGHHQACFDPRFKPDNSSS 151
Query: 111 YAYVPCDSEHCRYFPYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQ 170
Y V C+S C C A H+C Y ++Y + + + L F N S +
Sbjct: 152 YQTVSCNSPDCI---TKMCDARVHQCKYERVYAEMSSSKGVLGKDLLGFGNG--SRLQPH 206
Query: 171 DVVFGCGFSTNRNFKFS---GIFGLGIGRSSLVSQL------NSSFSYCIGSLHDPDYLH 221
++FGC + + GI GLG G S+V QL SFS C G + +
Sbjct: 207 PLLFGCETAETGDLYLQHADGIMGLGRGPLSIVDQLVGTGAMEDSFSLCYGGMDE----- 261
Query: 222 NKLILGDGAIIDEGDATPLQFI--------DGHYYITLEAISVDGRMLDINPNIFKRDDS 273
G G+++ P + +Y + L I V G L++ +F +
Sbjct: 262 -----GGGSMVLGAIPPPPAMVFAKSDPNRSNYYNLELSEIQVQGVSLNVPSEVF---NG 313
Query: 274 GGGVMIDSGTDVTWLVKEAYEALRDEVMIRLEGEQM---RSYSWPDKLCYHGIMSSDL-- 328
G ++DSGT +L +A++A +D + +L Q S+PD +C+ G S
Sbjct: 314 RLGTVLDSGTTYAYLPDKAFDAFKDAITQQLGSLQAVPGPDPSYPD-VCFAGAGSDSKAL 372
Query: 329 -KGFPTVRFHFRGGAKLALEKDSMFYQ--PRPDAFCMAVNPASINNRYVNFSLIGMMAQQ 385
K FP V F F G K+ L ++ ++ P A+C+ N+ +L+G + +
Sbjct: 373 GKHFPPVDFVFSGNQKVFLAPENYLFKHTKVPGAYCLGF----FKNQDAT-TLLGGIVVR 427
Query: 386 FYNVGYDIGRKQKTFQRMDCEVL 408
V YD Q F + +C L
Sbjct: 428 NTLVTYDRANHQIGFFKTNCTNL 450
>gi|242073262|ref|XP_002446567.1| hypothetical protein SORBIDRAFT_06g018180 [Sorghum bicolor]
gi|241937750|gb|EES10895.1| hypothetical protein SORBIDRAFT_06g018180 [Sorghum bicolor]
Length = 453
Score = 96.7 bits (239), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 116/425 (27%), Positives = 175/425 (41%), Gaps = 64/425 (15%)
Query: 22 VQRGINISIARLAYLQEKIRSHNTYQAQILPSNDISNSLFYVNISIGQPPVPQFTAMDTG 81
++R + S+ R + + R +A ++P + V + IG P A+DT
Sbjct: 54 IRRAVQRSLDRPG-VAARNRKAVVGEAPLVPRG----GEYLVKLGIGTPQHYFSAAIDTA 108
Query: 82 SSLLWVHCYPCRDCSPQLGTIFYPSRSSSYAYVPCDSEHCRYFPYARCSAYKHR-CIYTQ 140
S L+W+ C PC C QL IF P SSSYA VPC S+ C RC + C Y
Sbjct: 109 SDLVWLQCQPCVSCYRQLDPIFNPRLSSSYAVVPCSSDTCSQLDGHRCDEDDDQACRYNY 168
Query: 141 LYLIGPETSVFVSTEQLTFKNTDESTIHVQDVVFGCGFST--NRNFKFSGIFGLGIGRSS 198
Y T+ ++ ++L + H VV GC S+ + SG+ GL G S
Sbjct: 169 KYSGNAVTNGTLAIDKLAVGG---NVFHA--VVLGCSDSSVGGPPPQASGLVGLARGPLS 223
Query: 199 LVSQLN-SSFSYCIGSLHDP-DYLHNKLILGDGAIIDE----GDATPLQFIDGHYYITLE 252
L+SQL+ F YC L P KL+LG GA D D + Y +
Sbjct: 224 LLSQLSVRRFMYC---LPPPMSRTPGKLVLGAGAGADAVRNVSDRVTVTMSSSTRYPSYY 280
Query: 253 AISVDGRML-DINPNIFKRDDS--------------------GGGVMIDSGTDVTWLVKE 291
++ DG + D P +R S G+++D + +++L
Sbjct: 281 YLNFDGLAVGDQTPGTIRRPTSPPATGGGVGGGGGDGGSGANAYGMIVDVASTISFLEAS 340
Query: 292 AYEALRDEVMIRLEGEQMRSYSWPDK-----LCY---HGIMSSDLKGFPTVRFHFRGGAK 343
Y+ L D+ LE E + P LC+ G+ D PTV F G
Sbjct: 341 LYDELADD----LEEEIRLPRATPSTRLGLDLCFILPEGV-GIDRVYVPTVSMSF-DGRW 394
Query: 344 LALEKDSMFYQPRPDAFCMAVNPASINNRYVNFSLIGMMAQQFYNVGYDIGRKQKTFQRM 403
L LE+D +F + D M + + R S++G QQ +V Y++ R + TF +
Sbjct: 395 LELERDRLFLE---DGRMMCL----MIGRTSGVSILGNYQQQNMHVLYNLRRGKITFAKA 447
Query: 404 DCEVL 408
C+ L
Sbjct: 448 SCDSL 452
>gi|21717162|gb|AAM76355.1|AC074196_13 putative nucleoid DNA binding protein [Oryza sativa Japonica Group]
gi|31433286|gb|AAP54824.1| Eukaryotic aspartyl protease family protein [Oryza sativa Japonica
Group]
Length = 397
Score = 96.7 bits (239), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 85/359 (23%), Positives = 158/359 (44%), Gaps = 33/359 (9%)
Query: 64 NISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLGTIFYPSRSSSYAYVPCDSEHCRY 123
N +IG PP +D L+W C C C Q +F P+ SS++ PC ++ C+
Sbjct: 57 NFTIGTPPQAASAFIDLTGELVWTQCSQCIHCFKQDLPVFVPNASSTFKPEPCGTDVCKS 116
Query: 124 FPYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDVVFGCGFSTNRN 183
P +C++ C Y + +G T V+T+ +++ FGC +++ +
Sbjct: 117 IPTPKCAS--DVCAYDGVTGLGGHTVGIVATDTFAIGTAAPASLG-----FGCVVASDID 169
Query: 184 FKF--SGIFGLGIGRSSLVSQLN-SSFSYCIGSLHDPDYLHNKLILGDGAIIDEGDA--- 237
SG GLG SLV+Q+ + FSYC+ HD +++L LG A + G A
Sbjct: 170 TMGGPSGFIGLGRTPWSLVAQMKLTRFSYCLAP-HDTGK-NSRLFLGASAKLAGGGAWTP 227
Query: 238 ----TPLQFIDGHYYITLEAISVDGRMLDINPNIFKRDDSGGGVMIDSG-TDVTWLVKEA 292
+P + +Y I LE I + D + + ++ V++ + V+ LV
Sbjct: 228 FVKTSPNDGMSQYYPIELEEI----KAGDATITMPRGRNT---VLVQTAVVRVSLLVDSV 280
Query: 293 YEALRDEVMIRLEGEQMRS-YSWPDKLCYHGIMSSDLKGFPTVRFHFRGGAKLALEKDSM 351
Y+ + VM + + P ++C+ + + G P + F F+ GA L + +
Sbjct: 281 YQEFKKAVMASVGAAPTATPVGAPFEVCFP---KAGVSGAPDLVFTFQAGAALTVPPANY 337
Query: 352 FYQPRPDAFCMAVNPASINNRYV--NFSLIGMMAQQFYNVGYDIGRKQKTFQRMDCEVL 408
+ D C++V ++ N +++G Q+ ++ +D+ + +F+ DC L
Sbjct: 338 LFDVGNDTVCLSVMSIALLNITALDGLNILGSFQQENVHLLFDLDKDMLSFEPADCSSL 396
>gi|238011160|gb|ACR36615.1| unknown [Zea mays]
Length = 461
Score = 96.7 bits (239), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 104/413 (25%), Positives = 151/413 (36%), Gaps = 76/413 (18%)
Query: 61 FYVNISIGQPPVPQFTAMDTGSSLLWVHC---------------YPCRDCSPQLGT---- 101
++V +G P P DTGS L WV C Y +P
Sbjct: 55 YFVRFRVGTPARPFLLVADTGSDLTWVKCRRHAAPAPAPAPAPGYNYGYGAPASNDSSSV 114
Query: 102 ---------IFYPSRSSSYAYVPCDSEHCRY---FPYARCSAYKHRCIYTQLYLIGPETS 149
+F P RS ++A +PC S+ C F A C C Y Y G
Sbjct: 115 SAAASSPARVFRPDRSRTWAPIPCSSDTCTASLPFSLAACPTPGSPCAYEYRYKDGSAAR 174
Query: 150 VFVSTEQLTFK------NTDESTIHVQDVVFGCGFS-TNRNFKFS-GIFGLGIGRSSLVS 201
V T+ T + ++ VV GC S T +F S G+ LG S S
Sbjct: 175 GTVGTDSATIALSGRRAGKKQRRAKLRGVVLGCTTSYTGESFLASDGVLSLGYSNVSFAS 234
Query: 202 Q----LNSSFSYCIGSLHDPDYLHNKLILGDGAIIDEGDA----------------TPLQ 241
+ FSYC+ P + L G + A TPL
Sbjct: 235 RAAARFGGRFSYCLVDHLAPRNATSYLTFGPNPAVSSASASRTACAGSAAAPGARQTPLL 294
Query: 242 F---IDGHYYITLEAISVDGRMLDINPNIFKRDDSGGGVMIDSGTDVTWLVKEAYEALRD 298
+ Y + + +SVDG +L I P + GGG ++DSGT +T LV AY A+
Sbjct: 295 LDHRMRPFYAVAVNGVSVDGELLRI-PRLVWDVQKGGGAILDSGTSLTVLVSPAYRAVVA 353
Query: 299 EVMIRLEGEQMRSYSWPDKLCYHGIMSSDLKG------FPTVRFHFRGGAKLALEKDSMF 352
+ +L G + P CY+ +S L G P + HF G A+L S
Sbjct: 354 ALGKKLVGLPRVAMD-PFDYCYN--WTSPLTGEDLAVAVPALAVHFAGSARLQPPPKSYV 410
Query: 353 YQPRPDAFCMAVNPASINNRYVNFSLIGMMAQQFYNVGYDIGRKQKTFQRMDC 405
P C+ + + S+IG + QQ + +D+ ++ F+R C
Sbjct: 411 IDAAPGVKCIGLQ----EGDWPGVSVIGNILQQEHLWEFDLKNRRLRFKRSRC 459
>gi|218184944|gb|EEC67371.1| hypothetical protein OsI_34484 [Oryza sativa Indica Group]
Length = 396
Score = 96.7 bits (239), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 97/369 (26%), Positives = 154/369 (41%), Gaps = 39/369 (10%)
Query: 61 FYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLGTIFYPSRSSSYAYVPCDSEH 120
+ N +IG PP P +D L+W C CR C Q +F P+ SS++ PC +
Sbjct: 45 YVANFTIGTPPQPASAIVDVAGELVWTQCSACRRCFKQDLPVFVPNASSTFKPEPCGTAV 104
Query: 121 CRYFPYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDV--VFGCGF 178
C P CS C Y GP T + +T F TD I V FGC
Sbjct: 105 CESIPTRSCSG--DVCSYK-----GPPTQLRGNTSG--FAATDTFAIGTATVRLAFGCVV 155
Query: 179 STNRNF--KFSGIFGLGIGRSSLVSQLN-SSFSYCIGSLHDPDYLHNKLILGDGAIIDEG 235
+++ + SG GLG SLV+Q+ + FSYC+ + ++L LG A +
Sbjct: 156 ASDIDTMDGPSGFIGLGRTPWSLVAQMKLTRFSYCLSPRNTGK--SSRLFLGSSAKLAGS 213
Query: 236 DATPLQ-FI------DG--HYYITLEAISVDGRMLDINPNIFKRDDSGGGVMIDSGTDVT 286
++T FI DG +Y ++L+AI + SGG +++ + + +
Sbjct: 214 ESTSTAPFIKTSPDDDGSNYYLLSLDAIRAGNTTI-------ATAQSGGILVMHTVSPFS 266
Query: 287 WLVKEAYEALRDEVMIRLEGEQMRSYSWPDK---LCYHGIMSSDLKGFPTVRFHFRGGAK 343
LV AY+A + V + G + P + LC+ P + F F+G A
Sbjct: 267 LLVDSAYKAFKKAVTEAVGGAAAPPMATPPQPFDLCFKKAAGFSRATAPDLVFTFQGAAA 326
Query: 344 LALEKDSMFYQ--PRPDAFCMAVNPASINNR--YVNFSLIGMMAQQFYNVGYDIGRKQKT 399
L + D C A+ + NR S++G + Q+ + YD+ ++ +
Sbjct: 327 LTVPPAKYLIDVGEEKDTACAAILSMAWLNRTGLEGVSVLGSLQQEDVHFLYDLKKETLS 386
Query: 400 FQRMDCEVL 408
F+ DC L
Sbjct: 387 FEPADCSSL 395
>gi|326490656|dbj|BAJ89995.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 456
Score = 96.7 bits (239), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 111/400 (27%), Positives = 168/400 (42%), Gaps = 58/400 (14%)
Query: 32 RLAYLQEKIRSHNTYQAQILPSN-----DISNSL----FYVNISIGQPPVPQFTAMDTGS 82
R AY+ K N + S+ + SL + + + +G P V Q +DTGS
Sbjct: 89 RAAYITRKYSGVNGSAGDVEGSDVTVPTTLGTSLDTLEYLITVGMGSPAVAQTMLIDTGS 148
Query: 83 SLLWVHCYPCRDCSPQLGTIFYPSRSSSYAYVPCDSEHCRYFPYARCSAYKHRCIYTQLY 142
+ WV C PC C Q ++F PS SS+Y+ C S C CS+ +C YT Y
Sbjct: 149 DVSWVQCKPCSQCHSQADSLFDPSSSSTYSAFSCTSAACAQLRQRGCSS--SQCQYTVKY 206
Query: 143 LIGPETSVFVSTEQLTFKNTDESTIHVQDVVFGCGFSTNRNF---KFSGIFGLGIGRSSL 199
G S S++ L ++ V++ FGC S + N + +G+ GLG G SL
Sbjct: 207 GDGSTGSGTYSSDTLALGSST-----VENFQFGCSQSESGNLLQDQTAGLMGLGGGAESL 261
Query: 200 VSQ----LNSSFSYCIGSLHDPDYLHNKLILGDGAIIDEGDATPL---QFIDGHYYITLE 252
+Q +FSYC+ G ++ TP+ + +Y + L+
Sbjct: 262 ATQTAGTFGKAFSYCLPPTPGSSGFLTLGASTSGFVVK----TPMLRSTQVPSYYGVLLQ 317
Query: 253 AISVDGRMLDINPNIFKRDDSGGGVMIDSGTDVTWLVKEAYEALRDEVMIRLEGEQMRSY 312
AI V GR L+I + F G ++DSGT +T L + AY AL M+ Y
Sbjct: 318 AIRVGGRQLNIPASAFS-----AGSIMDSGTIITRLPRTAYSALSSAFK-----AGMKQY 367
Query: 313 SWPDKLCYHGIMSS--DLKG-----FPTVRFHFRGGAKLALEKDSMFYQPRPDAFCMAVN 365
+ GI + D G PTV F GGA + L D + C+A
Sbjct: 368 PPAQPM---GIFDTCFDFSGQSSVSIPTVALVFSGGAVVDLASDGIIL-----GSCLAF- 418
Query: 366 PASINNRYVNFSLIGMMAQQFYNVGYDIGRKQKTFQRMDC 405
+ N+ + +IG + Q+ + V YD+G F+ C
Sbjct: 419 --AANSDDTSLGIIGNVQQRTFEVLYDVGGGAVGFKAGAC 456
>gi|357159746|ref|XP_003578546.1| PREDICTED: aspartic proteinase-like protein 1-like [Brachypodium
distachyon]
Length = 530
Score = 96.3 bits (238), Expect = 3e-17, Method: Compositional matrix adjust.
Identities = 103/375 (27%), Positives = 158/375 (42%), Gaps = 48/375 (12%)
Query: 60 LFYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLGT--------IFYPSRSSSY 111
L Y +++G P V A+DTGS L WV C C C+P ++ P +SS+
Sbjct: 98 LHYAVVALGTPNVTFLVALDTGSDLFWVPC-DCIKCAPLASPDYGDLKFDMYSPRKSSTS 156
Query: 112 AYVPCDSEHCRYFPYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTD--ESTIHV 169
VPC S C P A CSA + C Y+ YL +S V E + + T+ +S I
Sbjct: 157 RKVPCSSSLCD--PQADCSAASNSCPYSIQYLSENTSSKGVLVEDVLYLTTESGQSKITQ 214
Query: 170 QDVVFGCGFSTNRNFKFS----GIFGLGIGRSSLVSQLNS------SFSYCIGSLHDPDY 219
+ FGCG + +F S G+ GLG+ S+ S L S SFS C G +
Sbjct: 215 APITFGCGQVQSGSFLGSAAPNGLLGLGMDSKSVPSLLASKGIAANSFSMCFG-----ED 269
Query: 220 LHNKLILGDGAIIDEGDATPLQFIDGHYYITLEAISVDGRMLDINPNIFKRDDSGGGVMI 279
H ++ GD D+ + TPL + Y IS+ G M+ K D+ ++
Sbjct: 270 GHGRINFGDTGSSDQLE-TPLNIYKQNPYYN---ISITGAMVG-----GKSFDTKFSAVV 320
Query: 280 DSGTDVTWLVKEAYEALRDEVMIRL-EGEQMRSYSWPDKLCYHGIMSSDLKGFPTVRFHF 338
DSGT T L Y + ++ E + S P + CY I + P +
Sbjct: 321 DSGTSFTALSDPMYTEITSTFNAQVKESRKHLDASMPFEYCYS-ISAQGAVNPPNISLTA 379
Query: 339 RGGAKLALEKDSMFY---QPRPDAFCMAVNPASINNRYVNFSLIGMMAQQFYNVGYDIGR 395
+GG+ + + RP A+C+A+ + + VN LIG + +D R
Sbjct: 380 KGGSIFPVNGPIITITDTSSRPIAYCLAI----MKSEGVN--LIGENFMSGLKIVFDRER 433
Query: 396 KQKTFQRMDCEVLDD 410
++ +C D+
Sbjct: 434 LVLGWKTFNCYNFDN 448
>gi|147814824|emb|CAN65806.1| hypothetical protein VITISV_015630 [Vitis vinifera]
Length = 449
Score = 95.9 bits (237), Expect = 3e-17, Method: Compositional matrix adjust.
Identities = 89/381 (23%), Positives = 162/381 (42%), Gaps = 33/381 (8%)
Query: 52 PSNDISNSLFYVNISIGQPPVPQFTAMDTGSSLLWVHC-YPC--RDCSPQLG------TI 102
P+ D ++V +G P DTGS L W+ C Y C R+CS + +
Sbjct: 74 PAADYGIGQYFVAFKVGTPSQKFMLVADTGSDLTWMSCKYHCRSRNCSNRKARRIRHKRV 133
Query: 103 FYPSRSSSYAYVPCDSEHCR-----YFPYARCSAYKHRCIYTQLYLIGPETSVFVSTEQL 157
F+ + SSS+ +PC ++ C+ F C C Y Y G F + E +
Sbjct: 134 FHANLSSSFKTIPCLTDMCKIELMDLFSLTNCPTPLTPCGYDYRYSDGSTALGFFANETV 193
Query: 158 TFKNTDESTIHVQDVVFGCGFS-TNRNFKFS-GIFGLGIGRSSL----VSQLNSSFSYCI 211
T + + + + +V+ GC S ++F+ + G+ GLG + S + FSYC+
Sbjct: 194 TVELKEGRKMKLHNVLIGCSESFQGQSFQAADGVMGLGYSKYSFAIKAAEKFGGKFSYCL 253
Query: 212 GSLHDPDYLHNKLILGDG----AIIDEGDATPLQF--IDGHYYITLEAISVDGRMLDINP 265
+ N L G A+++ T L ++ Y + + IS+ G ML I
Sbjct: 254 VDHLSHKNVSNYLTFGSSRSKEALLNNMTYTELVLGMVNSFYAVNMMGISIGGAMLKIPS 313
Query: 266 NIFKRDDSGGGVMIDSGTDVTWLVKEAYEALRDEVMIR-LEGEQMRSYSWPDKLCYHGIM 324
++ +GG ++ DSG+ +T+L + AY+ + + + L+ ++ P + C++
Sbjct: 314 EVWDVKGAGGTIL-DSGSSLTFLTEPAYQPVMAALRVSLLKFRKVEMDIGPLEYCFNSTG 372
Query: 325 SSDLKGFPTVRFHFRGGAKLALEKDSMFYQPRPDAFCMAVNPASINNRYVNFSLIGMMAQ 384
+ P + FHF GA+ S C+ ++ + S++G + Q
Sbjct: 373 FEE-SLVPRLVFHFADGAEFEPPVKSYVISAADGVRCLGF----VSVAWPGTSVVGNIMQ 427
Query: 385 QFYNVGYDIGRKQKTFQRMDC 405
Q + +D+G K+ F C
Sbjct: 428 QNHLWEFDLGLKKLGFAPSSC 448
>gi|356528675|ref|XP_003532925.1| PREDICTED: LOW QUALITY PROTEIN: probable aspartic protease
At2g35615-like [Glycine max]
Length = 342
Score = 95.9 bits (237), Expect = 3e-17, Method: Compositional matrix adjust.
Identities = 98/412 (23%), Positives = 159/412 (38%), Gaps = 112/412 (27%)
Query: 1 LIHRDSILSPYNNPNENPAHRVQRGINISIARLAYLQEKIRSHNTYQAQILPSNDISNSL 60
LIHRDS LSP+ NP+ P+ R I+ A L+ + N IL N N
Sbjct: 33 LIHRDSPLSPFYNPSLTPSER------ITDAALS------SNENKLPESILIPN---NGE 77
Query: 61 FYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLGTIFYPSRSSSYAYVPCDSEH 120
+ + + IG PPV + DTGS +WV C PC++C
Sbjct: 78 YLMRLYIGTPPVERLVIADTGSDFIWVQCSPCQNC------------------------- 112
Query: 121 CRYFPYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDES-TIHVQDVVFGCGFS 179
+C+Y +Y T V TE L+F +T + T+ + +FGCG +
Sbjct: 113 --------------QCVYLNIYANKSFTIEVVGTETLSFDSTGGAQTVSFPNSIFGCGAN 158
Query: 180 TNRNF----KFSGIFGLGIGRSSLVSQLNSSFSYCIGSLHDPDYLHNKLILGDGAIIDEG 235
N F K +G+ GL G+ SLVSQL + Y L + +I +G +
Sbjct: 159 NNLTFRSSDKATGLVGLVAGQLSLVSQLGAQIGYKFSYLK---FGSEAIITTNGVV---- 211
Query: 236 DATPLQFIDG--HYYITLEAISVDGRMLDINPNIFKRDDSGGGVMIDSGTDVTWLVKEAY 293
+TPL Y++ LE +++ +++
Sbjct: 212 -STPLIIKPSLPLYFLNLEVVTIGQKVVPTE----------------------------- 241
Query: 294 EALRDEVMIRLEGEQMRSYSWPDKLCYHGIMSSDLKGFPTVRFHFRGGAKLALEKDSMFY 353
L E ++ +P K C+ D P + F F G + K+ +
Sbjct: 242 ---------TLGVESVQDLPFPFKFCFP---YRDNMTVPAIAFQFTGASVALRPKNLLIK 289
Query: 354 QPRPDAFCMAVNPASINNRYVNFSLIGMMAQQFYNVGYDIGRKQKTFQRMDC 405
+ +AV P++ + + S+ G++AQ + V YD+ K+ + DC
Sbjct: 290 LQDRNMLXLAVVPSASSLSVI--SIFGIIAQFDFQVLYDLDGKKVSVAPTDC 339
>gi|125560845|gb|EAZ06293.1| hypothetical protein OsI_28528 [Oryza sativa Indica Group]
Length = 525
Score = 95.9 bits (237), Expect = 3e-17, Method: Compositional matrix adjust.
Identities = 102/362 (28%), Positives = 150/362 (41%), Gaps = 54/362 (14%)
Query: 78 MDTGSSLLWVHCYPCRDCSPQLGTIFYPSRSSSYAYVPCDSEHCRYFPYA------RCSA 131
+DTGS L WV C PC C Q +F PS S+SYA VPC++ C A C+
Sbjct: 181 VDTGSDLTWVQCKPCSVCYAQRDPLFDPSGSASYAAVPCNASACEASLKAATGVPGSCAT 240
Query: 132 Y--------KHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDVVFGCGFSTNRN 183
RC Y+ Y G + ++T+ + V VFGCG S NR
Sbjct: 241 VGGGGGGGKSERCYYSLAYGDGSFSRGVLATDTVALGGAS-----VDGFVFGCGLS-NRG 294
Query: 184 FKFSGIFGL-GIGRS--SLVSQ----LNSSFSYCIGSLHDPDYLHNKLILGDGAIIDEGD 236
F G GL G+GR+ SLVSQ FSYC+ + D + + GD + +
Sbjct: 295 L-FGGTAGLMGLGRTELSLVSQTAPRFGGVFSYCLPAATSGDAAGSLSLGGDTSSYR--N 351
Query: 237 ATPLQFID--------GHYYITLEAISVDGRMLDINPNIFKRDDSGGGVMIDSGTDVTWL 288
ATP+ + Y++ + SV G + V++DSGT +T L
Sbjct: 352 ATPVSYTRMIADPAQPPFYFMNVTGASVGGAAVAAAGLGAAN------VLLDSGTVITRL 405
Query: 289 VKEAYEALRDEVMIRLEGEQMRS---YSWPDKLCYHGIMSSDLKGFPTVRFHFRGGAKLA 345
Y A+R E + E+ + +S D CY+ + D P + GGA +
Sbjct: 406 APSVYRAVRAEFARQFGAERYPAAPPFSLLDA-CYN-LTGHDEVKVPLLTLRLEGGADMT 463
Query: 346 LEKDSMFYQPRPDA--FCMAVNPASINNRYVNFSLIGMMAQQFYNVGYDIGRKQKTFQRM 403
++ M + R D C+A+ S ++ +IG Q+ V YD + F
Sbjct: 464 VDAAGMLFMARKDGSQVCLAMASLSFEDQT---PIIGNYQQKNKRVVYDTVGSRLGFADE 520
Query: 404 DC 405
DC
Sbjct: 521 DC 522
>gi|115475621|ref|NP_001061407.1| Os08g0267300 [Oryza sativa Japonica Group]
gi|37806402|dbj|BAC99940.1| putative 41 kD chloroplast nucleoid DNA binding protein [Oryza
sativa Japonica Group]
gi|113623376|dbj|BAF23321.1| Os08g0267300 [Oryza sativa Japonica Group]
Length = 524
Score = 95.9 bits (237), Expect = 4e-17, Method: Compositional matrix adjust.
Identities = 102/362 (28%), Positives = 150/362 (41%), Gaps = 54/362 (14%)
Query: 78 MDTGSSLLWVHCYPCRDCSPQLGTIFYPSRSSSYAYVPCDSEHCRYFPYA------RCSA 131
+DTGS L WV C PC C Q +F PS S+SYA VPC++ C A C+
Sbjct: 180 VDTGSDLTWVQCKPCSVCYAQRDPLFDPSGSASYAAVPCNASACEASLKAATGVPGSCAT 239
Query: 132 Y--------KHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDVVFGCGFSTNRN 183
RC Y+ Y G + ++T+ + V VFGCG S NR
Sbjct: 240 VGGGGGGGKSERCYYSLAYGDGSFSRGVLATDTVALGGAS-----VDGFVFGCGLS-NRG 293
Query: 184 FKFSGIFGL-GIGRS--SLVSQ----LNSSFSYCIGSLHDPDYLHNKLILGDGAIIDEGD 236
F G GL G+GR+ SLVSQ FSYC+ + D + + GD + +
Sbjct: 294 L-FGGTAGLMGLGRTELSLVSQTAPRFGGVFSYCLPAATSGDAAGSLSLGGDTSSYR--N 350
Query: 237 ATPLQFID--------GHYYITLEAISVDGRMLDINPNIFKRDDSGGGVMIDSGTDVTWL 288
ATP+ + Y++ + SV G + V++DSGT +T L
Sbjct: 351 ATPVSYTRMIADPAQPPFYFMNVTGASVGGAAVAAAGLGAAN------VLLDSGTVITRL 404
Query: 289 VKEAYEALRDEVMIRLEGEQMRS---YSWPDKLCYHGIMSSDLKGFPTVRFHFRGGAKLA 345
Y A+R E + E+ + +S D CY+ + D P + GGA +
Sbjct: 405 APSVYRAVRAEFARQFGAERYPAAPPFSLLDA-CYN-LTGHDEVKVPLLTLRLEGGADMT 462
Query: 346 LEKDSMFYQPRPDA--FCMAVNPASINNRYVNFSLIGMMAQQFYNVGYDIGRKQKTFQRM 403
++ M + R D C+A+ S ++ +IG Q+ V YD + F
Sbjct: 463 VDAAGMLFMARKDGSQVCLAMASLSFEDQT---PIIGNYQQKNKRVVYDTVGSRLGFADE 519
Query: 404 DC 405
DC
Sbjct: 520 DC 521
>gi|297597434|ref|NP_001043968.2| Os01g0696800 [Oryza sativa Japonica Group]
gi|255673588|dbj|BAF05882.2| Os01g0696800 [Oryza sativa Japonica Group]
Length = 334
Score = 95.5 bits (236), Expect = 4e-17, Method: Compositional matrix adjust.
Identities = 100/328 (30%), Positives = 147/328 (44%), Gaps = 39/328 (11%)
Query: 102 IFYPSRSSSYAYVPCDSEHCRYFPYARCS------AYKHRCIYTQLYLIGPETSVFVS-- 153
+ YP+ SSS A+V C C P CS + C Y Y +T +
Sbjct: 14 LLYPTSSSSAAFVACGDRTCGELPRPLCSNVAGGGSGSGNCSYHYAYGNARDTHHYTEGI 73
Query: 154 --TEQLTFKNTDESTIHVQDVVFGCGFSTNRNFKF-SGIFGLGIGRSSLVSQLN-SSFSY 209
TE TF + + FGC + F SG+ GLG G+ SLV+QLN +F Y
Sbjct: 74 LMTETFTFG---DDAAAFPGIAFGCTLRSEGGFGTGSGLVGLGRGKLSLVTQLNVEAFGY 130
Query: 210 CIGS-LHDPDYLH-NKLILGDGAIIDEGDATPL------QFIDGHYYITLEAISVDGRML 261
+ S L P + L G D +TPL Q + YY+ L ISV G+++
Sbjct: 131 RLSSDLSAPSPISFGSLADVTGGNGDSFMSTPLLTNPVVQDLP-FYYVGLTGISVGGKLV 189
Query: 262 DINPNIFKRDDS--GGGVMIDSGTDVTWLVKEAYEALRDEVMIRLEGEQMRSYSWPDKL- 318
I F D S GGV+ DSGT +T L AY +RDE++ ++ ++ + D L
Sbjct: 190 QIPSGTFSFDRSTGAGGVIFDSGTTLTMLPDPAYTLVRDELLSQMGFQKPPPAANDDDLI 249
Query: 319 CYHGIMSSDLKGFPTVRFHFRGGAKLALEKDS----MFYQPRPDAFCMAVNPASINNRYV 374
C+ G S FP++ HF GGA + L ++ M Q A C +V +S
Sbjct: 250 CFTG--GSSTTTFPSMVLHFDGGADMDLSTENYLPQMQGQNGETARCWSVVKSS-----Q 302
Query: 375 NFSLIGMMAQQFYNVGYDI-GRKQKTFQ 401
++IG + Q ++V +D+ G + FQ
Sbjct: 303 ALTIIGNIMQMDFHVVFDLSGNARMLFQ 330
>gi|125532792|gb|EAY79357.1| hypothetical protein OsI_34486 [Oryza sativa Indica Group]
Length = 396
Score = 95.5 bits (236), Expect = 4e-17, Method: Compositional matrix adjust.
Identities = 92/371 (24%), Positives = 154/371 (41%), Gaps = 41/371 (11%)
Query: 57 SNSLFYVNISIGQPPVPQFTAMDTGSSLLWVHCYP-CRDCSPQLGTIFYPSRSSSYAYVP 115
S + + VN++IG PP P +D G L+W C CR C Q +F + SS++ P
Sbjct: 47 SQAFYVVNLTIGTPPQPVSAIIDIGGELVWTQCAQHCRRCFKQDLPLFDTNASSTFRPEP 106
Query: 116 CDSEHCRYFPYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDVVFG 175
C + C P C+ + TS + ++ T + FG
Sbjct: 107 CGAAVCESIPTRSCAGDGGGACGYEA-----STSFGRTVGRIGTDAVAIGTAATARLAFG 161
Query: 176 CGFSTNRNFKFSGIFGLGIGRS--SLVSQLN-SSFSYCIGSLHDPDY-LHNKLILGDGAI 231
C ++ + + +G+GR+ SL +Q+N ++FSYC L PD + L LG A
Sbjct: 162 CAVASEMDTMWGSSGSVGLGRTNLSLAAQMNATAFSYC---LAPPDTGKSSALFLGASAK 218
Query: 232 I---DEGDAT---------PLQFIDGHYYITLEAISVDGRMLDINPNIFKRDDSGGGVMI 279
+ +G T P + Y + LEAI + + SG +M+
Sbjct: 219 LAGAGKGAGTTPFVKTSTPPHSGLSRSYLLRLEAIRAGNATIAM-------PQSGNTIMV 271
Query: 280 DSGTDVTWLVKEAYEALRDEVMIRLEGEQMRSYSWPDKLCYHGIMSSDLKGFPTVRFHFR 339
+ T VT LV Y LR V + + LC+ +S G P + F+
Sbjct: 272 STATPVTALVDSVYRDLRKAVADAVGAAPVPPPVQNYDLCFPKASASG--GAPDLVLAFQ 329
Query: 340 GGAKLALEKDSMFYQPRPDAFCMAV--NPASINNRYVNFSLIGMMAQQFYNVGYDIGRKQ 397
GGA++ + S + D C+A+ +PA S++G + Q ++ +D+ ++
Sbjct: 330 GGAEMTVPVSSYLFDAGNDTACVAILGSPA-----LGGVSILGSLQQVNIHLLFDLDKET 384
Query: 398 KTFQRMDCEVL 408
+F+ DC L
Sbjct: 385 LSFEPADCSAL 395
>gi|357456413|ref|XP_003598487.1| Peptidase A1, putative [Medicago truncatula]
gi|355487535|gb|AES68738.1| Peptidase A1, putative [Medicago truncatula]
Length = 414
Score = 95.5 bits (236), Expect = 4e-17, Method: Compositional matrix adjust.
Identities = 96/359 (26%), Positives = 142/359 (39%), Gaps = 27/359 (7%)
Query: 56 ISNSLFYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLGTIFYPSRSSSYAYVP 115
I + + V IG PP AMDT + W+ C C C+ T+F P +S+++ V
Sbjct: 73 IQSPTYIVRAKIGTPPQTLLLAMDTSNDAAWIPCTACDGCA---STLFAPEKSTTFKNVS 129
Query: 116 CDSEHCRYFPYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDVVFG 175
C + C+ P C C + Y +S+ + Q T +T V FG
Sbjct: 130 CAAPECKQVPNPGCGV--SSCNFNLTY---GSSSIAANLVQDTIT---LATDPVPSYTFG 181
Query: 176 C-----GFSTNRNFKFSGIFGLGIGRSSLVSQLNSSFSYCIGSLHDPDYLHNKLILGDGA 230
C G S G S + S+FSYC+ S ++ L LG A
Sbjct: 182 CVSKTTGTSAPPQGLLGLGRGPLSLLSQTQNLYQSTFSYCLPSFKSLNF-SGSLRLGPVA 240
Query: 231 IIDEGDATPL---QFIDGHYYITLEAISVDGRMLDINPNIFK-RDDSGGGVMIDSGTDVT 286
TPL YY+ LEAI V +++DI P +G G + DSGT T
Sbjct: 241 QPKRIKYTPLLKNPRRSSLYYVNLEAIRVGRKVVDIPPAALAFNPTTGAGTIFDSGTVFT 300
Query: 287 WLVKEAYEALRDEVMIRLEGEQMRSYSWPDKLCYHGIMSSDLKGFPTVRFHFRGGAKLAL 346
LV Y A+RDE R+ + + CY+ + PT+ F F G
Sbjct: 301 RLVAPVYVAVRDEFRRRVGPKLTVTSLGGFDTCYNVPIV-----VPTITFIFTGMNVTLP 355
Query: 347 EKDSMFYQPRPDAFCMAVNPASINNRYVNFSLIGMMAQQFYNVGYDIGRKQKTFQRMDC 405
+ + + + C+A+ A N V ++I M QQ + V YD+ + R C
Sbjct: 356 QDNILIHSTAGSTTCLAMAGAPDNVNSV-LNVIANMQQQNHRVLYDVPNSRVGVARELC 413
>gi|326495450|dbj|BAJ85821.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 491
Score = 95.5 bits (236), Expect = 4e-17, Method: Compositional matrix adjust.
Identities = 104/357 (29%), Positives = 143/357 (40%), Gaps = 49/357 (13%)
Query: 74 QFTAMDTGSSLLWVHCYPC--RDCSPQLGTIFYPSRSSSYAYVPCDSEHCRYF-PYAR-C 129
Q A+DT + W+ C PC C PQ F P RSS+ A V C S CR YA C
Sbjct: 159 QTMAIDTTEDVPWIQCLPCLIPQCYPQRNAFFDPRRSSTGAPVRCGSRACRTLGGYANGC 218
Query: 130 SAYKHR--CIYTQLYLIGPETSVFVSTEQLTFKN--TDESTIHVQDVV----FGCGFSTN 181
S C+Y Y S +LT TD TI FGC +
Sbjct: 219 SKPNSTGDCLYRIEY----------SDHRLTLGTYMTDTLTISPSTTFLNFRFGCSHAVR 268
Query: 182 RNF--KFSGIFGLGIGRSSLVSQL----NSSFSYCIGSLHDPDYLH--NKLILGDGAIID 233
F + SG LG G SL+SQ ++FSYC+ +L + DG
Sbjct: 269 GKFSAQASGTMSLGGGPQSLLSQTARAYGNAFSYCVPGPSAAGFLSIGGPVNGDDGGGSG 328
Query: 234 EGDATPL----QFIDGHYYIT-LEAISVDGRMLDINPNIFKRDDSGGGVMIDSGTDVTWL 288
TPL I+ Y+ L+ I V GR L++ P +F GG ++DS +T L
Sbjct: 329 AFATTPLVRSANVINPTIYVVRLQGIEVAGRRLNVPPVVFS-----GGTVMDSSAVITQL 383
Query: 289 VKEAYEALRDEVMIRLEGEQMRSYSWPDKLCYHGIMSSDLKGFPTVRFHFRGGAKLALEK 348
AY ALR + + R+ + C+ + S + PTV F GGA + L
Sbjct: 384 PPTAYRALRLAFRNAMRAYKTRAPTGNLDTCFDFVGVSKVT-VPTVSLVFDGGAVIELGL 442
Query: 349 DSMFYQPRPDAFCMAVNPASINNRYVNFSLIGMMAQQFYNVGYDIGRKQKTFQRMDC 405
S+ C+A P + + IG + QQ + V YD+ F+ C
Sbjct: 443 LSVLLDS-----CLAFAPMAAD---FALGFIGNVQQQTHEVLYDVAGGAVGFRHGAC 491
>gi|449442281|ref|XP_004138910.1| PREDICTED: aspartic proteinase-like protein 2-like [Cucumis
sativus]
gi|449506266|ref|XP_004162699.1| PREDICTED: aspartic proteinase-like protein 2-like [Cucumis
sativus]
Length = 482
Score = 95.5 bits (236), Expect = 4e-17, Method: Compositional matrix adjust.
Identities = 92/372 (24%), Positives = 157/372 (42%), Gaps = 34/372 (9%)
Query: 58 NSLFYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQ--LGTIFYPSRSSSYAY-- 113
+ L++ I +G P + +DTGS +LWV+C C +C + LG SS +
Sbjct: 71 SGLYFAKIGLGTPVQDYYVQVDTGSDILWVNCAGCTNCPKKSDLGIELSLYSPSSSSTSN 130
Query: 114 -VPCDSEHCRYF---PYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTD---EST 166
V C+ + C P C+ + C Y Y G T+ + + + ++T
Sbjct: 131 RVTCNQDFCTSTYDGPIPGCTP-ELLCEYRVAYGDGSSTAGYFVRDHVVLDRVTGNFQTT 189
Query: 167 IHVQDVVFGCGFSTNRNF-----KFSGIFGLGIGRSSLVSQLNSS------FSYCIGSLH 215
+VFGCG + GI G G SS++SQL SS F++C+
Sbjct: 190 STNGSIVFGCGAQQSGQLGATSAALDGILGFGQANSSMISQLASSGKVKRVFAHCL---- 245
Query: 216 DPDYLHNKLILGDGAIID-EGDATPLQFIDGHYYITLEAISVDGRMLDINPNIFKRDDSG 274
D ++ I G ++ + TPL HY + ++AI VD +L++ ++F D
Sbjct: 246 --DNINGGGIFAIGEVVQPKVRTTPLVPQQAHYNVFMKAIEVDNEVLNLPTDVFDTDLR- 302
Query: 275 GGVMIDSGTDVTWLVKEAYEALRDEVMIRLEGEQMRSYSWPDKLCYHGIMSSDLKGFPTV 334
G +IDSGT + + YE L ++ R ++ + C+ + D GFPTV
Sbjct: 303 KGTIIDSGTTLAYFPDVIYEPLISKIFARQSTLKLHTVE-EQFTCFEYDGNVD-DGFPTV 360
Query: 335 RFHFRGGAKLALEKDSMFYQPRPDAFCMAVNPASINNR-YVNFSLIGMMAQQFYNVGYDI 393
FHF L + + + +C+ + +R + L+G + Q V YD+
Sbjct: 361 TFHFEDSLSLTVYPHEYLFDIDSNKWCVGWQNSGAQSRDGKDMILLGDLVLQNRLVMYDL 420
Query: 394 GRKQKTFQRMDC 405
+ + +C
Sbjct: 421 ENQTIGWTEYNC 432
>gi|357491933|ref|XP_003616254.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
gi|355517589|gb|AES99212.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
Length = 442
Score = 95.5 bits (236), Expect = 4e-17, Method: Compositional matrix adjust.
Identities = 92/370 (24%), Positives = 161/370 (43%), Gaps = 33/370 (8%)
Query: 61 FYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCR---DCSPQLGTIFYPSRSSSYAYVPCD 117
V + IG PP Q +DTGS L W+ C+ + P + F PS SSS+ +PC+
Sbjct: 82 LVVTLPIGTPPQLQQMVLDTGSQLSWIQCHNKKTPQKKQPPTTSSFDPSLSSSFFVLPCN 141
Query: 118 SEHCR-----YFPYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDV 172
C+ + C A C Y+ Y G + E++ F + + +
Sbjct: 142 HPLCKPRVPDFSLPTDCDA-NSLCHYSYFYADGTYAEGNLVREKIAFSPSQTTP----PI 196
Query: 173 VFGCGFSTNRNFKFSGIFGLGIGRSSLVSQLN-SSFSYCIGSLHDPDYLHNKLILGDGA- 230
+ GC ++ GI G+ +GR SQ + FSYC+ + LG+
Sbjct: 197 ILGCATQSD---DARGILGMNLGRLGFPSQAKITKFSYCVPT-KQAQPASGSFYLGNNPA 252
Query: 231 --------IIDEGDATPLQFIDG-HYYITLEAISVDGRMLDINPNIFKRDDSGGG-VMID 280
++ G + + +D Y + L+ IS+ G+ L+I P++FK + G G MID
Sbjct: 253 SSSFRYVNLLTFGQSQRMPNLDPLAYTLPLQGISIGGKKLNIPPSVFKPNAGGSGQTMID 312
Query: 281 SGTDVTWLVKEAYEALRDEVMIRLEGEQMRSYSWPD--KLCYHGIMSSDLKGFPTVRFHF 338
SG++ T+LV EAY +R+E++ ++ + + Y + +C+ G + + F F
Sbjct: 313 SGSEFTYLVDEAYNVIREELVKKVGPKIKKGYMYGGVADICFDGDAIEIGRLVGDMVFEF 372
Query: 339 RGGAKLALEKDSMFYQPRPDAFCMAVNPASINNRYVNFSLIGMMAQQFYNVGYDIGRKQK 398
G ++ + K+ + C+ + + N +IG QQ V +D+ ++
Sbjct: 373 EKGVQIVIPKERVLATVDGGVHCLGMGRSERLGAGGN--IIGNFHQQNLWVEFDLANRRV 430
Query: 399 TFQRMDCEVL 408
F DC L
Sbjct: 431 GFGEADCSKL 440
>gi|357117301|ref|XP_003560410.1| PREDICTED: uncharacterized protein LOC100833752 [Brachypodium
distachyon]
Length = 473
Score = 95.5 bits (236), Expect = 5e-17, Method: Compositional matrix adjust.
Identities = 84/314 (26%), Positives = 133/314 (42%), Gaps = 60/314 (19%)
Query: 78 MDTGSSLLWVHCYPCRDCSPQLGTIFYPSRSSSYAYVPC-DSEHCR--YFPYARCSAYKH 134
MD + W+ C PC C PQL +F P++S ++ V ++ CR Y P
Sbjct: 120 MDMAAGFSWMQCAPCHPCLPQLNPVFDPAKSPTFRPVSGHNAVLCRPPYHPL-----QDG 174
Query: 135 RCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDVVFGCGFSTNRNFKF------SG 188
RC + Y G + +++ + +F D + H+ +VFGC NR +F +G
Sbjct: 175 RCGFGIAYRNGASAAGYLARDTFSFPTGDNNFQHLPGIVFGC---ANRIARFDTHGALAG 231
Query: 189 IFGLGIGR-----SSLVSQL----NSSFSYCIGSLHDPDYLHNKLILGDGA--IIDEGDA 237
+ G+G+G + + QL FSYC ++ G A + G+
Sbjct: 232 VLGMGMGAEGKPLTGFMRQLYHNGGGRFSYC------------PIVPGTTAYSFLRFGND 279
Query: 238 TPLQFIDG----------------HYYITLEAISVDG-RMLDINPNIFKRDDSG-GGVMI 279
P Q G YY+ L ISV R+ + P +F+RD G GG I
Sbjct: 280 IPSQPPAGVHRQSMAVLAPTTTSEAYYVKLAGISVGALRVPGVTPEMFERDQHGRGGCAI 339
Query: 280 DSGTDVTWLVKEAYEALRDEVMIRLEGEQMRSYSWPD-KLCYHGIMSSDLKGFPTVRFHF 338
D GT +T +V+ AY + V L+ + R P LC H + + + P++ HF
Sbjct: 340 DIGTKMTAIVQTAYAHVEAAVRGHLQRNRARFVQSPGHHLCVHRTPAIEER-LPSMTLHF 398
Query: 339 RGGAKLALEKDSMF 352
GG L ++ +F
Sbjct: 399 VGGPWLRVKPQHLF 412
>gi|326524806|dbj|BAK04339.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 460
Score = 95.5 bits (236), Expect = 5e-17, Method: Compositional matrix adjust.
Identities = 91/346 (26%), Positives = 146/346 (42%), Gaps = 37/346 (10%)
Query: 77 AMDTGSSLLWVHCYPCRDCSPQLGTIFYPSRSSSYAYVPCDSEHCRYFPYARCSAYKHRC 136
A+D ++LLW+ C P ++ QL F P++S S+ +P ++ C P + C
Sbjct: 102 ALDMTTNLLWMQCKPVQEPFTQLPPPFEPAKSPSFRRLPGNNAFCLPAPRGHRRTVQDPC 161
Query: 137 IYTQLYLIG-PETSVFVSTEQLTFKNTDESTIHVQDVVFGC-----GFSTNRNFKFSGIF 190
+ + L G + +S E L F + + V VV GC GF+ N + +G+
Sbjct: 162 KFHSIRLDGSADARGVLSNETLAFAASGQQQTEVTGVVIGCTHNSKGFNFNSHGVLAGVL 221
Query: 191 GLGIGRSSLVSQLNS---------SFSYCIGSLHDPDYLHNKLILGDGAIIDEGD--ATP 239
GLG SL+ L FSYC+ S H+ + D + + +T
Sbjct: 222 GLGRQAPSLIWTLGQHRHGTVQVHRFSYCLPSHGSSSSDHHTFLRFDDDVPNTQHMVSTK 281
Query: 240 LQFIDG-------HYYITLEAISVDGRMLDINPNIFKRDDSG----GGVMIDSGTDVTWL 288
+ ++D Y+++L ISV G+ L +FKR G G D+GT +
Sbjct: 282 IMYMDSTTSRDFRAYFVSLTGISVAGKPLQDVKELFKRHVHGQVWTSGCAFDAGTPTMVM 341
Query: 289 VKEAYEALRDEVMIRLEGEQMRSYSWPDKLCYHGIMSSDLKGFPTVRFHF-RGGAKLALE 347
+ AY L+D V+ L+ ++ S LC+ S + PTV F A+L L
Sbjct: 342 IMPAYNKLKDAVVRHLKPLGLQIVSGQYHLCFRAT-SQLWQHLPTVMLQFAETEARLVLP 400
Query: 348 KDSMFYQPRPDAFCMAVNPASINNRYVNFSLIGMMAQQFYNVGYDI 393
+F D C+AV R + ++IG M Q YD+
Sbjct: 401 PQRLFVAVGYD-ICLAV------VRSYDITIIGAMQQVDKRFVYDV 439
>gi|356555807|ref|XP_003546221.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Glycine max]
Length = 457
Score = 95.1 bits (235), Expect = 5e-17, Method: Compositional matrix adjust.
Identities = 104/374 (27%), Positives = 164/374 (43%), Gaps = 39/374 (10%)
Query: 55 DISNSLFYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRD-CSPQLGTIFYPSRSSSYAY 113
I + +YV I +G P +DTGSSL W+ C PC C Q+ IF PS S +Y
Sbjct: 101 SIGSGNYYVKIGVGTPAKYFSMIVDTGSSLSWLQCQPCVIYCHVQVDPIFTPSVSKTYKA 160
Query: 114 VPCDSEHCRYFPYAR-----CSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIH 168
+ C S C + CS C+Y Y + ++S + LT S
Sbjct: 161 LSCSSSQCSSLKSSTLNAPGCSNATGACVYKASYGDTSFSIGYLSQDVLTL---TPSAAP 217
Query: 169 VQDVVFGCGFSTNRNF-KFSGIFGLGIGRSSLVSQLN----SSFSYCIGSLHDPDYLHNK 223
V+GCG F + +GI GL + S++ QL+ ++FSYC+ S N
Sbjct: 218 SSGFVYGCGQDNQGLFGRSAGIIGLANDKLSMLGQLSNKYGNAFSYCLPSSFSAQ--PNS 275
Query: 224 LILGDGAI-IDEGDATPLQF--------IDGHYYITLEAISVDGRMLDINPNIFKRDDSG 274
+ G +I ++P +F I Y++ L I+V G+ L ++ + +
Sbjct: 276 SVSGFLSIGASSLSSSPYKFTPLVKNPKIPSLYFLGLTTITVAGKPLGVSASSYNVP--- 332
Query: 275 GGVMIDSGTDVTWLVKEAYEALRDEVMIRLEGE--QMRSYSWPDKLCYHGIMSSDLKGFP 332
+IDSGT +T L Y AL+ ++ + + Q +S D C+ G + ++ P
Sbjct: 333 --TIIDSGTVITRLPVAIYNALKKSFVMIMSKKYAQAPGFSILDT-CFKGSV-KEMSTVP 388
Query: 333 TVRFHFRGGAKLALEKDSMFYQPRPDAFCMAVNPASINNRYVNFSLIGMMAQQFYNVGYD 392
+R FRGGA L L+ + + C+A+ AS N S+IG QQ + V YD
Sbjct: 389 EIRIIFRGGAGLELKVHNSLVEIEKGTTCLAI-AASSN----PISIIGNYQQQTFTVAYD 443
Query: 393 IGRKQKTFQRMDCE 406
+ + F C+
Sbjct: 444 VANSKIGFAPGGCQ 457
>gi|449527515|ref|XP_004170756.1| PREDICTED: aspartic proteinase nepenthesin-1-like, partial [Cucumis
sativus]
Length = 364
Score = 95.1 bits (235), Expect = 5e-17, Method: Compositional matrix adjust.
Identities = 102/361 (28%), Positives = 140/361 (38%), Gaps = 28/361 (7%)
Query: 56 ISNSLFYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLGTIFYPSRSSSYAYVP 115
I + F V IG P A+DT + W+ C C C T+F +SSS+ +P
Sbjct: 21 IQSPTFVVRAKIGTPAQTLLLALDTSNDAAWIPCSGCIGCPST--TVFSSDKSSSFRPLP 78
Query: 116 CDSEHCRYFPYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDVVFG 175
C S C P CS C + Y G T L N +T V FG
Sbjct: 79 CQSPQCNQVPNPSCSG--SACGFNLTY--GSSTVA----ADLVQDNLTLATDSVPSYTFG 130
Query: 176 C-----GFSTNRNFKFSGIFGLGIGRSSLVSQLNSSFSYCIGSLHDPDYLHNKLILGDGA 230
C G S G S S+FSYC+ S ++ L LG A
Sbjct: 131 CIRKATGSSVPPQGLLGLGRGPLSLLGQSQSLYQSTFSYCLPSFKSVNF-SGSLRLGPVA 189
Query: 231 IIDEGDATPL---QFIDGHYYITLEAISVDGRMLDINPNIFK-RDDSGGGVMIDSGTDVT 286
TPL YY+ L +I V +++DI P+ +G G +IDSGT T
Sbjct: 190 QPIRIKYTPLLRNPRRSSLYYVNLISIRVGRKIVDIPPSALAFNSATGAGTVIDSGTTFT 249
Query: 287 WLVKEAYEALRDEVMIRLEGEQMRSYSWPDKLCYHGIMSSDLKGFPTVRFHFRGGAKLAL 346
LV AY A+RDE R+ S CY + S PT+ F F G + L
Sbjct: 250 RLVAPAYTAVRDEFRRRVGRNVTVSSLGGFDTCYTVPIIS-----PTITFMF-AGMNVTL 303
Query: 347 EKDS-MFYQPRPDAFCMAVNPASINNRYVNFSLIGMMAQQFYNVGYDIGRKQKTFQRMDC 405
D+ + + C+A+ A N V ++I M QQ + + +DI + R C
Sbjct: 304 PPDNFLIHSTSGSTTCLAMAAAPDNVNSV-LNVIASMQQQNHRILFDIPNSRVGVARESC 362
Query: 406 E 406
Sbjct: 363 S 363
>gi|357124468|ref|XP_003563922.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Brachypodium
distachyon]
Length = 450
Score = 95.1 bits (235), Expect = 5e-17, Method: Compositional matrix adjust.
Identities = 94/365 (25%), Positives = 152/365 (41%), Gaps = 41/365 (11%)
Query: 61 FYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCR-DCSPQLGTIFYPSRSSSYAYVPCDSE 119
+ + +G P +D+GSSL W+ C PC C PQ G ++ P SS+YA VPC +
Sbjct: 108 YITRLGLGTPTTTYVMVVDSGSSLTWLQCAPCAVSCHPQAGPLYDPRASSTYAAVPCSAP 167
Query: 120 HCRYFPYA-----RCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDVVF 174
C A CS C Y Y G + ++S + ++ ++ +
Sbjct: 168 QCAELQAATLNPSSCSG-SGVCQYQASYGDGSFSFGYLSKDTVSLSSSGS----FPGFYY 222
Query: 175 GCGFSTNRNF-KFSGIFGLGIGRSSLVSQL----NSSFSYCIGSLHDPDYLHNKLILGDG 229
GCG F + +G+ GL + SL+SQL +SF+YC+ + + L G
Sbjct: 223 GCGQDNVGLFGRAAGLIGLARNKLSLLSQLAPSVGNSFAYCLPT----SAAASAGYLSFG 278
Query: 230 AIIDEGDATPLQF-------IDGH-YYITLEAISVDGRMLDINPNIFKRDDSGGGVMIDS 281
+ D + + +D Y+++L +SV G L + + + + +IDS
Sbjct: 279 SNSDNKNPGKYSYTSMVSSSLDASLYFVSLAGMSVAGSPLAVPSSEYGSLPT----IIDS 334
Query: 282 GTDVTWLVKEAYEALRDEVMIRLEGEQMRSYSWPDKLCYHGIMSSDLKGFPTVRFHFRGG 341
GT +T L Y AL V L +YS + C+ G ++ P V F GG
Sbjct: 335 GTVITRLPTPVYTALSKAVGAALAAPSAPAYSI-LQTCFKGQVAK--LPVPAVNMAFAGG 391
Query: 342 AKLALEKDSMFYQPRPDAFCMAVNPASINNRYVNFSLIGMMAQQFYNVGYDIGRKQKTFQ 401
A L L ++ C+A P + ++IG QQ ++V YD+ + F
Sbjct: 392 ATLRLTPGNVLVDVNETTTCLAFAPTD------STAIIGNTQQQTFSVVYDVKGSRIGFA 445
Query: 402 RMDCE 406
C
Sbjct: 446 AGGCS 450
>gi|21717154|gb|AAM76347.1|AC074196_5 putative nucleoid DNA binding protein [Oryza sativa Japonica Group]
gi|31433293|gb|AAP54831.1| Eukaryotic aspartyl protease family protein [Oryza sativa Japonica
Group]
gi|125532791|gb|EAY79356.1| hypothetical protein OsI_34485 [Oryza sativa Indica Group]
Length = 397
Score = 95.1 bits (235), Expect = 5e-17, Method: Compositional matrix adjust.
Identities = 95/376 (25%), Positives = 152/376 (40%), Gaps = 54/376 (14%)
Query: 63 VNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLGTIFYPSRSSSYAYVPCDSEHCR 122
N +IG PP P +D L+W C C C Q +F P+ SS++ PC ++ C+
Sbjct: 45 ANFTIGTPPQPASAIIDVAGELVWTQCSRCSRCFKQDLPLFIPNASSTFRPEPCGTDACK 104
Query: 123 YFPYARCSAYKHRCIY-----------TQLYLIGPET-SVFVSTEQLTFKNTDESTIHVQ 170
P + CS C Y T L ++G ET ++ +T L F S I
Sbjct: 105 STPTSNCSG--DVCTYESTTNIRLDRHTTLGIVGTETFAIGTATASLAFGCVVASDIDTM 162
Query: 171 DVVFGCGFSTNRNFKFSGIFGLGIGRSSLVSQLN-SSFSYCIGSLHDPDYLHNKLILGDG 229
D SG GLG SLV+Q+ + FSYC+ ++L LG
Sbjct: 163 DGT-------------SGFIGLGRTPRSLVAQMKLTKFSYCLSPRGTGK--SSRLFLGSS 207
Query: 230 AIIDEGDATPLQ-FI------DGHYY--ITLEAISVDGRMLDINPNIFKRDDSGGGVMID 280
A + G++T FI D H+Y ++L+AI + SGG +++
Sbjct: 208 AKLAGGESTSTAPFIKTSPDDDSHHYYLLSLDAIRAGNTTIATA-------QSGGILVMH 260
Query: 281 SGTDVTWLVKEAYEALRDEVMIRLEGEQMRSYSWPDK---LCYHGIMSSDLKGFPTVRFH 337
+ + + LV AY A + V + G + P + LC+ P + F
Sbjct: 261 TVSPFSLLVDSAYRAFKKAVTEAVGGAAAPPMATPPQPFDLCFKKAAGFSRATAPDLVFT 320
Query: 338 FRGGAKLALEKDSMFY---QPRPDAFCMAVNPASINNR--YVNFSLIGMMAQQFYNVGYD 392
F+GG + + D C A+ + NR S++G + Q+ + YD
Sbjct: 321 FQGGGAALTVPPAKYLIDVGEEKDTACAAILSMARLNRTGLEGVSVLGSLQQENVHFLYD 380
Query: 393 IGRKQKTFQRMDCEVL 408
+ ++ +F+ DC L
Sbjct: 381 LKKETLSFEPADCSSL 396
>gi|449449334|ref|XP_004142420.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Cucumis sativus]
Length = 441
Score = 95.1 bits (235), Expect = 6e-17, Method: Compositional matrix adjust.
Identities = 102/361 (28%), Positives = 141/361 (39%), Gaps = 28/361 (7%)
Query: 56 ISNSLFYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLGTIFYPSRSSSYAYVP 115
I + F V IG P A+DT + W+ C C C T+F +SSS+ +P
Sbjct: 98 IQSPTFVVRAKIGTPAQTLLLALDTSNDAAWIPCSGCIGCPST--TVFSSDKSSSFRPLP 155
Query: 116 CDSEHCRYFPYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDVVFG 175
C S C P CS C + Y G T L N +T V FG
Sbjct: 156 CQSPQCNQVPNPSCSG--SACGFNLTY--GSSTVA----ADLVQDNLTLATDSVPSYTFG 207
Query: 176 C-----GFSTNRNFKFSGIFGLGIGRSSLVSQLNSSFSYCIGSLHDPDYLHNKLILGDGA 230
C G S G S S+FSYC+ S ++ L LG A
Sbjct: 208 CIRKATGSSVPPQGLLGLGRGPLSLLGQSQSLYQSTFSYCLPSFKSVNF-SGSLRLGPVA 266
Query: 231 IIDEGDATPL---QFIDGHYYITLEAISVDGRMLDINPNIFKRDD-SGGGVMIDSGTDVT 286
TPL YY+ L +I V +++DI P+ + +G G +IDSGT T
Sbjct: 267 QPIRIKYTPLLRNPRRSSLYYVNLISIRVGRKIVDIPPSALAFNSATGAGTVIDSGTTFT 326
Query: 287 WLVKEAYEALRDEVMIRLEGEQMRSYSWPDKLCYHGIMSSDLKGFPTVRFHFRGGAKLAL 346
LV AY A+RDE R+ S CY + S PT+ F F G + L
Sbjct: 327 RLVAPAYTAVRDEFRRRVGRNVTVSSLGGFDTCYTVPIIS-----PTITFMF-AGMNVTL 380
Query: 347 EKDS-MFYQPRPDAFCMAVNPASINNRYVNFSLIGMMAQQFYNVGYDIGRKQKTFQRMDC 405
D+ + + C+A+ A N V ++I M QQ + + +DI + R C
Sbjct: 381 PPDNFLIHSTAGSTTCLAMAAAPDNVNSV-LNVIASMQQQNHRILFDIPNSRVGVARESC 439
Query: 406 E 406
Sbjct: 440 S 440
>gi|356537928|ref|XP_003537458.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Glycine max]
Length = 445
Score = 95.1 bits (235), Expect = 6e-17, Method: Compositional matrix adjust.
Identities = 103/381 (27%), Positives = 160/381 (41%), Gaps = 42/381 (11%)
Query: 58 NSLFYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLGTIFYPSRSSSYAYVPCD 117
N V++++G PP +DTGS L W+HC ++ + ++F P SSSY +PC
Sbjct: 67 NVTLTVSLTVGTPPQSVTMVLDTGSELSWLHCKKQQN----INSVFNPHLSSSYTPIPCM 122
Query: 118 SEHCRYFPYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQ-DVVFG- 175
S C+ R C L + + F S E +T + Q ++FG
Sbjct: 123 SPICKT--RTRDFLIPVSCDSNNLCHVTVSYADFTSLEGNLASDTFAISGSGQPGIIFGS 180
Query: 176 --CGFSTNRN--FKFSGIFGLGIGRSSLVSQLN-SSFSYCIGSLHDPDYLHNKLILGDGA 230
GFS+N N K +G+ G+ G S V+Q+ FSYCI S D + L+ GD
Sbjct: 181 MDSGFSSNANEDSKTTGLMGMNRGSLSFVTQMGFPKFSYCI-SGKDASGV---LLFGDAT 236
Query: 231 IIDEGDA---------TPLQFIDG-HYYITLEAISVDGRMLDINPNIFKRDDSGGG-VMI 279
G TPL + D Y + L I V + L + IF D +G G M+
Sbjct: 237 FKWLGPLKYTPLVKMNTPLPYFDRVAYTVRLMGIRVGSKPLQVPKEIFAPDHTGAGQTMV 296
Query: 280 DSGTDVTWLVKEAYEALRDEVMIRLEG------EQMRSYSWPDKLCYHGIMSSDLKGFPT 333
DSGT T+L+ Y ALR+E + + G + + LC+ + P
Sbjct: 297 DSGTRFTFLLGSVYTALRNEFVAQTRGVLTLLEDPNFVFEGAMDLCFRVRRGGVVPAVPA 356
Query: 334 VRFHFRGGAKLALEKDSMFYQPRPDAFCMAVNPASINNRYVNFSLIGMMA-------QQF 386
V F GA++++ + + Y+ D N + N L+G+ A QQ
Sbjct: 357 VTMVFE-GAEMSVSGERLLYRVGGDGDVAKGNGDVYCLTFGNSDLLGIEAYVIGHHHQQN 415
Query: 387 YNVGYDIGRKQKTFQRMDCEV 407
+ +D+ + F CE+
Sbjct: 416 VWMEFDLVNSRVGFADTKCEL 436
>gi|326520109|dbj|BAK03979.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 463
Score = 95.1 bits (235), Expect = 6e-17, Method: Compositional matrix adjust.
Identities = 104/392 (26%), Positives = 167/392 (42%), Gaps = 59/392 (15%)
Query: 1 LIHRDSILSPYNNPNENPAHRVQRGINISIARLAYLQEKIRSHNTYQAQ---ILPSNDIS 57
++HR+ +P ++ P R + R+ L ++ S +A ++ +N +
Sbjct: 64 ILHREHPCAP---ASKRPVRRSPSALQEYHTRVRRLANRLSSCPADEATASGLIFANGVP 120
Query: 58 NSLF-YVN-ISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLGTIFYPSRSSSYAYVP 115
+ YV + +G P +DT SSL WV C PC + L F P+ SS+Y V
Sbjct: 121 WDYYSYVTQVQLGTPAKTHNVLVDTASSLSWVGCEPCINAC--LIPTFNPNASSTYKVVG 178
Query: 116 CDSEHCRYFPYA-----RCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQ 170
C S C P A C A C Y Q Y + VS++ LT+ + Q
Sbjct: 179 CGSALCNAVPSATMARKSCMAPTEGCSYRQSYHDYSLSVGVVSSDTLTYG------LGSQ 232
Query: 171 DVVFGC-GFSTNRNFKFSGIFGLGIGRSSLVSQLN-----SSFSYCI------GSLHDPD 218
+FGC ++SGI G+ + + SL SQ+ + SYC G L
Sbjct: 233 KFIFGCCNLFRGVGGRYSGILGMSVNKFSLFSQMTVGHRYRAMSYCFPHPRNQGFLQFGR 292
Query: 219 YLHNKLILGDGAIIDEGDATPLQFIDGH-YYITLEAISVDGRMLDINPNIFKRDDSGGGV 277
Y +K +L TPL +IDG+ Y++ + + V+ LD+ SG
Sbjct: 293 YDEHKSLL---------RFTPL-YIDGNNYFVHVSNVMVETMSLDV-------QSSGNQT 335
Query: 278 M---IDSGTDVTWLVKEAYEALRDEVMIRLEGEQMRSYSWPDKLCYHG---IMSSDLKGF 331
M D+GT T L + + +L D V +EG R + + C+ + DL
Sbjct: 336 MRCFFDTGTPYTMLPQSLFVSLSDTVGNLVEG-YYRVGASTGQTCFQADGNWIEGDLY-M 393
Query: 332 PTVRFHFRGGAKLALEKDSMFYQPRPDAFCMA 363
PTV+ F+ GA++ L + + + P+ FC+A
Sbjct: 394 PTVKIEFQNGARITLNSEDLMFMEEPNVFCLA 425
>gi|255548660|ref|XP_002515386.1| Aspartic proteinase nepenthesin-2 precursor, putative [Ricinus
communis]
gi|223545330|gb|EEF46835.1| Aspartic proteinase nepenthesin-2 precursor, putative [Ricinus
communis]
Length = 387
Score = 94.7 bits (234), Expect = 6e-17, Method: Compositional matrix adjust.
Identities = 100/363 (27%), Positives = 156/363 (42%), Gaps = 37/363 (10%)
Query: 61 FYVNISIGQPPVPQFTAMDTGSSLLWVHCYPC-RDCSPQLGTIFYPSRSSSYAYVPCDSE 119
+ V +++G P + A+DTGS + W C PC C Q T F P +SSSY V C S
Sbjct: 45 YLVKMALGTPKLSLSLALDTGSDITWTQCEPCVGSCYRQAQTKFDPRKSSSYKNVSCSSS 104
Query: 120 HCRYFPYARCS--AYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDVVFGCG 177
CR + + CIY Y G + F +TE+LT +D + + +FGCG
Sbjct: 105 SCRIITDSGGARGCVSSTCIYKVQYGDGSYSVGFFATEKLTISPSDV----ISNFLFGCG 160
Query: 178 FSTNRNFKFSGIFGLGIGRSSLVS-----QLNSSFSYCIGSLHDPDYLHNKLILGDGAII 232
F ++ + N+ F+YC+ S H L LG G +
Sbjct: 161 QQNAGRFGRIAGLLGLGRGKLSLALQTSEKYNNLFTYCLPSFSSSSTGH--LTLG-GQVP 217
Query: 233 DEGDATPLQ--FIDGHYY-ITLEAISVDGRMLDINPNIFKRDDSGGGVMIDSGTDVTWLV 289
TPL F + +Y I ++ +SV G +L I+ ++F S G +IDSGT +T L
Sbjct: 218 KSVKFTPLSPAFKNTPFYGIDIKGLSVGGHVLPIDASVF----SNAGAIIDSGTVITRLQ 273
Query: 290 KEAYEALRDEVMIRLEGEQMRSYSWPDKL-----CYHGIMSSDLKGFPTVRFHFRGGAKL 344
Y AL + + M+ Y D CY ++ P + F F+GG ++
Sbjct: 274 PTVYSALSSKFQ-----QLMKDYPKTDGFSILDTCYD-FSGNESISVPRISFFFKGGVEV 327
Query: 345 ALEKDSMFYQPRP-DAFCMAVNPASINNRYVNFSLIGMMAQQFYNVGYDIGRKQKTFQRM 403
++ + D C+A P N+ +F + G QQ Y+V +D+ + + F
Sbjct: 328 DIKFFGILTVINAWDKVCLAFAP---NDDDGDFVVFGNSQQQTYDVVHDLAKGRIGFAPS 384
Query: 404 DCE 406
C
Sbjct: 385 GCN 387
>gi|88174565|gb|ABD39357.1| chloroplast nucleoid DNA-binding protein [Oryza glumipatula]
Length = 323
Score = 94.7 bits (234), Expect = 7e-17, Method: Compositional matrix adjust.
Identities = 86/331 (25%), Positives = 142/331 (42%), Gaps = 39/331 (11%)
Query: 61 FYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLGTIFYPSRSSSYAYVPCDSEH 120
+ +++ +G P Q +DTGSS WV C C C T F SRS++ A V C +
Sbjct: 1 YVISVGLGTPSKTQIVEIDTGSSTSWVFC-ECDGCHTNPRT-FLQSRSTTCAKVSCGTSM 58
Query: 121 CRYF---PYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDVVFGC- 176
C P+ + S C + Y G + + + LTF + + + FGC
Sbjct: 59 CLLGGSDPHCQDSENYPDCPFRVSYQDGSASYGILYQDTLTFSDVQK----IPGFTFGCN 114
Query: 177 --GFSTNRNFKFSGIFGLGIGRSSLVSQLNSS---FSYCIGSLHDPDYLHNKLI----LG 227
F N G+ G+G G+ S++ Q + + FSYC+ +K LG
Sbjct: 115 MDSFGANEFGNVDGLLGMGAGQMSVLKQSSPTFDGFSYCLPLQMSERGFFSKTTGYFSLG 174
Query: 228 DGAIIDEGDATPLQFIDGH-----YYITLEAISVDGRMLDINPNIFKRDDSGGGVMIDSG 282
D + + +++ L AISVDG L ++P+IF R GV+ DSG
Sbjct: 175 GKIAATRTDVRYTKMVARRKNTELFFVDLTAISVDGERLGLSPSIFSRK----GVVFDSG 230
Query: 283 TDVTWLVKEAYEALRD---EVMIRLEGEQMRSYSWPDKLCYHGIMSSDLKGFPTVRFHFR 339
++++++ A L E+++R + S ++ CY + S D P + HF
Sbjct: 231 SELSYIPDRALSVLSQRIRELLLRRGAAEEES----ERNCYD-MRSVDEGDMPAISLHFD 285
Query: 340 GGAKLALEKDSMFYQ---PRPDAFCMAVNPA 367
GA+ L + +F + D +C+A P
Sbjct: 286 DGARFDLGRHGVFVERSVQEQDVWCLAFAPT 316
>gi|115466068|ref|NP_001056633.1| Os06g0119600 [Oryza sativa Japonica Group]
gi|55296446|dbj|BAD68569.1| putative nucleoid DNA-binding protein cnd41 [Oryza sativa Japonica
Group]
gi|55296924|dbj|BAD68375.1| putative nucleoid DNA-binding protein cnd41 [Oryza sativa Japonica
Group]
gi|113594673|dbj|BAF18547.1| Os06g0119600 [Oryza sativa Japonica Group]
gi|215694767|dbj|BAG89958.1| unnamed protein product [Oryza sativa Japonica Group]
gi|215737752|dbj|BAG96882.1| unnamed protein product [Oryza sativa Japonica Group]
Length = 495
Score = 94.7 bits (234), Expect = 7e-17, Method: Compositional matrix adjust.
Identities = 100/357 (28%), Positives = 147/357 (41%), Gaps = 42/357 (11%)
Query: 68 GQPPVPQFTAMDTGSSLLWVHCYPCR--DCSPQLGTIFYPSRSSSYAYVPCDSEHCRYF- 124
G V Q +D+GS + WV C PC C Q +F P+ S++YA VPC S C
Sbjct: 162 GTSAVTQTVIIDSGSDVSWVQCKPCPLPMCHRQRDPLFDPAMSTTYAAVPCTSAACAQLG 221
Query: 125 PYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDVVFGCGFS---TN 181
PY R + +C + Y G + S + LT D ++ FGC + +
Sbjct: 222 PYRRGCSANAQCQFGINYGDGSTATGTYSFDDLTLGPYDV----IRGFRFGCAHADRGSA 277
Query: 182 RNFKFSGIFGLGIGRSSLVSQLNSS----FSYCIGSLHDPDYLHNKLILG----DGAIID 233
++ +G LG G SLV Q + FSYC+ L+LG +I
Sbjct: 278 FDYDVAGSLALGGGSQSLVQQTATRYGRVFSYCLPPTASS---LGFLVLGVPPERAQLIP 334
Query: 234 EGDATPL---QFIDGHYYITLEAISVDGRMLDINPNIFKRDDSGGGVMIDSGTDVTWLVK 290
+TPL Y + L AI V GR L + P +F +IDS T ++ L
Sbjct: 335 SFVSTPLLSSSMAPTFYRVLLRAIIVAGRPLAVPPAVFSASS-----VIDSSTIISRLPP 389
Query: 291 EAYEALRDEVMIRLEGEQMRSYSWPDKLCY--HGIMSSDLKGFPTVRFHFRGGAKLALEK 348
AY+ALR + + CY G+ S L P++ F GGA + L+
Sbjct: 390 TAYQALRAAFRSAMTMYRAAPPVSILDTCYDFTGVRSITL---PSIALVFDGGATVNLDA 446
Query: 349 DSMFYQPRPDAFCMAVNPASINNRYVNFSLIGMMAQQFYNVGYDIGRKQKTFQRMDC 405
+ C+A P + ++R F IG + Q+ V YD+ K F+ C
Sbjct: 447 AGILL-----GSCLAFAPTA-SDRMPGF--IGNVQQKTLEVVYDVPAKAMRFRTAAC 495
>gi|15229663|ref|NP_190574.1| aspartyl protease family protein [Arabidopsis thaliana]
gi|6522926|emb|CAB62113.1| putative protein [Arabidopsis thaliana]
gi|53828539|gb|AAU94379.1| At3g50050 [Arabidopsis thaliana]
gi|55733749|gb|AAV59271.1| At3g50050 [Arabidopsis thaliana]
gi|332645100|gb|AEE78621.1| aspartyl protease family protein [Arabidopsis thaliana]
Length = 632
Score = 94.7 bits (234), Expect = 7e-17, Method: Compositional matrix adjust.
Identities = 104/380 (27%), Positives = 161/380 (42%), Gaps = 55/380 (14%)
Query: 58 NSLFYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLGTIFYPSRSSSYAYVPCD 117
N + + IG PP +D+GS++ +V C C C F P SS+Y V C+
Sbjct: 90 NGYYTTRLWIGTPPQMFALIVDSGSTVTYVPCSDCEQCGKHQDPKFQPEMSSTYQPVKCN 149
Query: 118 SEHCRYFPYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDVVFGCG 177
+ C + +C+Y + Y + + + ++F N ES + Q VFGC
Sbjct: 150 MD-------CNCDDDREQCVYEREYAEHSSSKGVLGEDLISFGN--ESQLTPQRAVFGCE 200
Query: 178 FSTNRNF---KFSGIFGLGIGRSSLVSQL------NSSFSYCIGSLHDPDYLHNKLILGD 228
+ + GI GLG G SLV QL ++SF C G + +G
Sbjct: 201 TVETGDLYSQRADGIIGLGQGDLSLVDQLVDKGLISNSFGLCYGGMD----------VGG 250
Query: 229 GAIIDEGDATP--LQFIDG------HYYITLEAISVDGRMLDINPNIFKRDDSGGGVMID 280
G++I G P + F D +Y I L I V G+ L ++ +F D G ++D
Sbjct: 251 GSMILGGFDYPSDMVFTDSDPDRSPYYNIDLTGIRVAGKQLSLHSRVF---DGEHGAVLD 307
Query: 281 SGTDVTWLVKEAYEALRDEVMIRLEGEQMRSYSWPD----KLCYHGIMS---SDL-KGFP 332
SGT +L A+ A + VM E ++ PD C+ S S+L K FP
Sbjct: 308 SGTTYAYLPDAAFAAFEEAVM--REVSTLKQIDGPDPNFKDTCFQVAASNYVSELSKIFP 365
Query: 333 TVRFHFRGGAKLALEKDS-MFYQPRP-DAFCMAVNPASINNRYVNFSLIGMMAQQFYNVG 390
+V F+ G L ++ MF + A+C+ V P N + +L+G + + V
Sbjct: 366 SVEMVFKSGQSWLLSPENYMFRHSKVHGAYCLGVFP----NGKDHTTLLGGIVVRNTLVV 421
Query: 391 YDIGRKQKTFQRMDCEVLDD 410
YD + F R +C L D
Sbjct: 422 YDRENSKVGFWRTNCSELSD 441
>gi|224067042|ref|XP_002302336.1| predicted protein [Populus trichocarpa]
gi|222844062|gb|EEE81609.1| predicted protein [Populus trichocarpa]
Length = 438
Score = 94.7 bits (234), Expect = 8e-17, Method: Compositional matrix adjust.
Identities = 102/391 (26%), Positives = 157/391 (40%), Gaps = 37/391 (9%)
Query: 32 RLAYLQEKIRSHNTYQAQILPSNDISNSLFYV-NISIGQPPVPQFTAMDTGSSLLWVHCY 90
RL YL + T I P + YV + +G P F +DT + WV C
Sbjct: 69 RLKYL-STLADQKTTAVPIAPGQQVLKIANYVVRVKLGTPGQQMFMVLDTSNDAAWVPCS 127
Query: 91 PCRDCSPQLGTIFYPSRSSSYAYVPCDSEHCRYFPYARCSAY-KHRCIYTQLYLIGPETS 149
C CS T F P+ S++ + C C C A C++ Q Y G ++S
Sbjct: 128 GCTGCS---STTFLPNASTTLGSLDCSGAQCSQVRGFSCPATGSSACLFNQSY--GGDSS 182
Query: 150 VFVSTEQLTFKNTDESTIHVQDVV----FGC-GFSTNRNFKFSGIFGLGIGRSSLVSQLN 204
LT ++ DV+ FGC + + G+ GLG G SL+SQ
Sbjct: 183 -------LTATLVQDAITLANDVIPGFTFGCINAVSGGSIPPQGLLGLGRGPISLISQAG 235
Query: 205 S----SFSYCIGSLHDPDYLHNKLILGDGAIIDEGDATPLQFIDGH----YYITLEAISV 256
+ FSYC+ S Y L LG TPL + H YY+ L +SV
Sbjct: 236 AMYSGVFSYCLPSFKS-YYFSGSLKLGPVGQPKSIRTTPL-LRNPHRPSLYYVNLTGVSV 293
Query: 257 DGRMLDINPNIFKRD-DSGGGVMIDSGTDVTWLVKEAYEALRDEVMIRLEGEQMRSYSWP 315
+ I D ++G G +IDSGT +T V+ Y A+RDE ++ G + S
Sbjct: 294 GRIKVPIPSEQLVFDPNTGAGTIIDSGTVITRFVQPVYFAIRDEFRKQVNGP-ISSLGAF 352
Query: 316 DKLCYHGIMSSDLKGFPTVRFHFRGGAKLALEKDSMFYQPRPDAFCMAVNPASINNRYVN 375
D C+ ++ P + HF G + ++S+ + C+++ A+ NN
Sbjct: 353 DT-CFAATNEAEA---PAITLHFEGLNLVLPMENSLIHSSSGSLACLSMA-AAPNNVNSV 407
Query: 376 FSLIGMMAQQFYNVGYDIGRKQKTFQRMDCE 406
++I + QQ + +D + R C
Sbjct: 408 LNVIANLQQQNLRIMFDTTNSRLGIARELCN 438
>gi|218184943|gb|EEC67370.1| hypothetical protein OsI_34481 [Oryza sativa Indica Group]
Length = 367
Score = 94.4 bits (233), Expect = 9e-17, Method: Compositional matrix adjust.
Identities = 84/359 (23%), Positives = 158/359 (44%), Gaps = 33/359 (9%)
Query: 64 NISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLGTIFYPSRSSSYAYVPCDSEHCRY 123
N +IG PP +D L+W C C C Q +F P+ SS++ PC ++ C+
Sbjct: 27 NFTIGTPPQAASAFIDLTGELVWTQCSQCIHCFKQDLPVFVPNASSTFKPEPCGTDVCKS 86
Query: 124 FPYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDVVFGCGFSTNRN 183
P +C++ C + + +G T V+T+ +++ FGC +++ +
Sbjct: 87 IPTPKCAS--DVCAFDGVTGLGGHTVGIVATDTFAIGTAAPASLG-----FGCVVASDID 139
Query: 184 FKF--SGIFGLGIGRSSLVSQLN-SSFSYCIGSLHDPDYLHNKLILGDGAIIDEGDA--- 237
SG GLG SLV+Q+ + FSYC+ HD +++L LG A + G A
Sbjct: 140 TMGGPSGFIGLGRTPWSLVAQMKLTRFSYCLAP-HDTGK-NSRLFLGASAKLAGGGAWTP 197
Query: 238 ----TPLQFIDGHYYITLEAISVDGRMLDINPNIFKRDDSGGGVMIDSG-TDVTWLVKEA 292
+P + +Y I LE I + D + + ++ V++ + V+ LV
Sbjct: 198 FVKTSPNDGMSQYYPIELEEI----KAGDATITMPRGRNT---VLVQTAVVRVSLLVDSV 250
Query: 293 YEALRDEVMIRLEGEQMRS-YSWPDKLCYHGIMSSDLKGFPTVRFHFRGGAKLALEKDSM 351
Y+ + VM + + P ++C+ + + G P + F F+ GA L + +
Sbjct: 251 YQEFKKAVMASVGAAPTATPVGEPFEVCFP---KAGVSGAPDLVFTFQAGAALTVPPANY 307
Query: 352 FYQPRPDAFCMAVNPASINNRYV--NFSLIGMMAQQFYNVGYDIGRKQKTFQRMDCEVL 408
+ D C++V ++ N +++G Q+ ++ +D+ + +F+ DC L
Sbjct: 308 LFDVGNDTVCLSVMSIALLNITALDGLNILGSFQQENVHLLFDLDKDMLSFEPADCSSL 366
>gi|326513755|dbj|BAJ87896.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 442
Score = 94.4 bits (233), Expect = 9e-17, Method: Compositional matrix adjust.
Identities = 96/378 (25%), Positives = 153/378 (40%), Gaps = 31/378 (8%)
Query: 48 AQILPSNDISNSLFYVNISIGQP-PVPQFTAMDTGSSLLWVHCYPCRDCSPQLGTIFYPS 106
A + +N NS + +++SIG P P +DTGS ++W C PC +C Q F +
Sbjct: 79 APVGRANTDVNSEYLIHLSIGAPRSQPVVLTLDTGSDVVWTQCEPCAECFTQPLPRFDTA 138
Query: 107 RSSSYAYVPCDSEHCRYFPYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKN-TDES 165
S++ V C C C + H C Y Y G + + TF +
Sbjct: 139 ASNTVRSVACSDPLCNAHSEHGC--FLHGCTYVSGYGDGSLSFGHFLRDSFTFDDGKGGG 196
Query: 166 TIHVQDVVFGCGFSTNRNF--KFSGIFGLGIGRSSLVSQLN-SSFSYCIGSLHDPDYLHN 222
+ V D+ FGCG F +GI G G G SL SQL FSYC + + +
Sbjct: 197 KVTVPDIGFGCGMYNAGRFLQTETGIAGFGRGPLSLPSQLKVRQFSYCFTTRFEAK--SS 254
Query: 223 KLILGDGAIIDEGDATPL---QFI--------DGHYYITLEAISVDGRMLDINPNIFKRD 271
+ LG + P+ F+ + HY ++ + ++V L + P I +
Sbjct: 255 PVFLGGAGDLKAHATGPILSTPFVRSLPPGTDNSHYVLSFKGVTVGKTRLPV-PEI--KA 311
Query: 272 DSGGGVMIDSGTDVTWLVKEAYEALRDEVMIRLEGEQMRSYSWPDKLCYHGIMSSDLKGF 331
D G IDSGTD+T + L+ I + + D +C+
Sbjct: 312 DGSGATFIDSGTDITTFPDAVFRQLK-SAFIAQAALPVNKTADEDDICFS-WDGKKTAAM 369
Query: 332 PTVRFHFRGGAKLALEKDSMFYQPRPDA-FCMAVNPASINNRYVNFSLIGMMAQQFYNVG 390
P + FH GA L +++ + R C+AV+ + +R +LIG QQ ++
Sbjct: 370 PKLVFHLE-GADWDLPRENYVTEDRESGQVCVAVSTSGQMDR----TLIGNFQQQNTHIV 424
Query: 391 YDIGRKQKTFQRMDCEVL 408
YD+ + C+ L
Sbjct: 425 YDLAAGKLLLVPAQCDKL 442
>gi|88174581|gb|ABD39365.1| chloroplast nucleoid DNA-binding protein [Oryza sativa Indica
Group]
Length = 321
Score = 94.4 bits (233), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 85/329 (25%), Positives = 140/329 (42%), Gaps = 37/329 (11%)
Query: 61 FYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLGTIFYPSRSSSYAYVPCDSEH 120
+ +++ +G P Q +DTGSS WV C C C T F SRS++ A V C +
Sbjct: 1 YVISVGLGTPAKTQIVEIDTGSSTSWVFC-ECDGCHTNPRT-FLQSRSTTCAKVSCGTSM 58
Query: 121 CRYF---PYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDVVFGC- 176
C P+ + S C + Y G + + + LTF + + + FGC
Sbjct: 59 CLLGGSDPHCQDSENYPDCPFRVSYQDGSASYGILYQDTLTFSDVQK----IPSFTFGCN 114
Query: 177 --GFSTNRNFKFSGIFGLGIGRSSLVSQLN---SSFSYCIGSLHDPDYLHNKLI--LGDG 229
F N G+ G+G G S++ Q + FSYC+ +K G
Sbjct: 115 LDSFGANEFGNVDGLLGMGAGPMSVLKQSSPRFDGFSYCLPLQKSERGFFSKTTGYFSLG 174
Query: 230 AIIDEGDATPLQFIDGH-----YYITLEAISVDGRMLDINPNIFKRDDSGGGVMIDSGTD 284
+ D + + +++ L AISVDG L ++P+IF R GV+ DSG++
Sbjct: 175 KVATRTDVRYTKMVARRKNTELFFVDLAAISVDGERLGLSPSIFSRK----GVVFDSGSE 230
Query: 285 VTWLVKEAYEALRD---EVMIRLEGEQMRSYSWPDKLCYHGIMSSDLKGFPTVRFHFRGG 341
++++ A L E+++R + S ++ CY + S D P + HF G
Sbjct: 231 LSYIPDRALSVLSQRIRELLLRRGAAEEES----ERNCYD-MRSVDEGDMPAISLHFDDG 285
Query: 342 AKLALEKDSMFYQ---PRPDAFCMAVNPA 367
A+ L + +F + D +C+A P
Sbjct: 286 ARFDLGRRGVFVERSVQEQDVWCLAFAPT 314
>gi|88174569|gb|ABD39359.1| chloroplast nucleoid DNA-binding protein [Oryza nivara]
Length = 321
Score = 94.4 bits (233), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 85/329 (25%), Positives = 140/329 (42%), Gaps = 37/329 (11%)
Query: 61 FYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLGTIFYPSRSSSYAYVPCDSEH 120
+ +++ +G P Q +DTGSS WV C C C T F SRS++ A V C +
Sbjct: 1 YVISVGLGTPAKTQIVEIDTGSSTTWVFC-ECDGCHTNPRT-FLQSRSTTCAKVSCGTSM 58
Query: 121 CRYF---PYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDVVFGC- 176
C P+ + S C + Y G + + + LTF + + + FGC
Sbjct: 59 CLLGGSDPHCQDSENYPDCPFRVSYQDGSASYGILYQDTLTFSDVQK----IPSFTFGCN 114
Query: 177 --GFSTNRNFKFSGIFGLGIGRSSLVSQLNSS---FSYCIGSLHDPDYLHNKLI--LGDG 229
F N G+ G+G G S++ Q + + FSYC+ +K G
Sbjct: 115 LDSFGANEFGNVDGLLGMGAGPMSVLKQSSPTFDGFSYCLPLQKSERGFFSKTTGYFSLG 174
Query: 230 AIIDEGDATPLQFIDGH-----YYITLEAISVDGRMLDINPNIFKRDDSGGGVMIDSGTD 284
+ D + + +++ L AISVDG L ++P+IF R GV+ DSG++
Sbjct: 175 KVATRTDVRYTKMVARRKNTELFFVDLAAISVDGERLGLSPSIFSRK----GVVFDSGSE 230
Query: 285 VTWLVKEAYEALRD---EVMIRLEGEQMRSYSWPDKLCYHGIMSSDLKGFPTVRFHFRGG 341
++++ A L E+++R + S ++ CY + S D P + HF G
Sbjct: 231 LSYIPDRALSVLSQRIRELLLRRGAAEEES----ERNCYD-MRSVDEGDMPAISLHFDDG 285
Query: 342 AKLALEKDSMFYQ---PRPDAFCMAVNPA 367
A+ L +F + D +C+A P
Sbjct: 286 ARFDLGSRGVFVERSVQEQDVWCLAFAPT 314
>gi|359494621|ref|XP_002265771.2| PREDICTED: aspartic proteinase nepenthesin-2-like [Vitis vinifera]
Length = 449
Score = 94.4 bits (233), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 89/381 (23%), Positives = 161/381 (42%), Gaps = 33/381 (8%)
Query: 52 PSNDISNSLFYVNISIGQPPVPQFTAMDTGSSLLWVHC-YPC--RDCSPQLG------TI 102
P+ D + V +G P DTGS L W+ C Y C R+CS + +
Sbjct: 74 PAADYGIGQYSVAFKVGTPSQKFMLVADTGSDLTWMSCKYHCRSRNCSNRKARRIRHKRV 133
Query: 103 FYPSRSSSYAYVPCDSEHCR-----YFPYARCSAYKHRCIYTQLYLIGPETSVFVSTEQL 157
F+ + SSS+ +PC ++ C+ F C C Y Y G F + E +
Sbjct: 134 FHANLSSSFKTIPCLTDMCKIELMDLFSLTNCPTPLTPCGYDYRYSDGSTALGFFANETV 193
Query: 158 TFKNTDESTIHVQDVVFGCGFS-TNRNFKFS-GIFGLGIGRSSL----VSQLNSSFSYCI 211
T + + + + +V+ GC S ++F+ + G+ GLG + S + FSYC+
Sbjct: 194 TVELKEGRKMKLHNVLIGCSESFQGQSFQAADGVMGLGYSKYSFAIKAAEKFGGKFSYCL 253
Query: 212 GSLHDPDYLHNKLILGDG----AIIDEGDATPLQF--IDGHYYITLEAISVDGRMLDINP 265
+ N L G A+++ T L ++ Y + + IS+ G ML I
Sbjct: 254 VDHLSHKNVSNYLTFGSSRSKEALLNNMTYTELVLGMVNSFYAVNMMGISIGGAMLKIPS 313
Query: 266 NIFKRDDSGGGVMIDSGTDVTWLVKEAYEALRDEVMIR-LEGEQMRSYSWPDKLCYHGIM 324
++ +GG ++ DSG+ +T+L + AY+ + + + L+ ++ P + C++
Sbjct: 314 EVWDVKGAGGTIL-DSGSSLTFLTEPAYQPVMAALRVSLLKFRKVEMDIGPLEYCFNSTG 372
Query: 325 SSDLKGFPTVRFHFRGGAKLALEKDSMFYQPRPDAFCMAVNPASINNRYVNFSLIGMMAQ 384
+ P + FHF GA+ S C+ ++ + S++G + Q
Sbjct: 373 FEE-SLVPRLVFHFADGAEFEPPVKSYVISAADGVRCLGF----VSVAWPGTSVVGNIMQ 427
Query: 385 QFYNVGYDIGRKQKTFQRMDC 405
Q + +D+G K+ F C
Sbjct: 428 QNHLWEFDLGLKKLGFAPSSC 448
>gi|297736090|emb|CBI24128.3| unnamed protein product [Vitis vinifera]
Length = 378
Score = 94.4 bits (233), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 91/383 (23%), Positives = 163/383 (42%), Gaps = 37/383 (9%)
Query: 52 PSNDISNSLFYVNISIGQPPVPQFTAMDTGSSLLWVHC-YPC--RDCSPQLG------TI 102
P+ D + V +G P DTGS L W+ C Y C R+CS + +
Sbjct: 3 PAADYGIGQYSVAFKVGTPSQKFMLVADTGSDLTWMSCKYHCRSRNCSNRKARRIRHKRV 62
Query: 103 FYPSRSSSYAYVPCDSEHCR-----YFPYARCSAYKHRCIYTQLYLIGPETSVFVSTEQL 157
F+ + SSS+ +PC ++ C+ F C C Y Y G F + E +
Sbjct: 63 FHANLSSSFKTIPCLTDMCKIELMDLFSLTNCPTPLTPCGYDYRYSDGSTALGFFANETV 122
Query: 158 TFKNTDESTIHVQDVVFGCGFS-TNRNFKFS-GIFGLGIGRSSL----VSQLNSSFSYCI 211
T + + + + +V+ GC S ++F+ + G+ GLG + S + FSYC+
Sbjct: 123 TVELKEGRKMKLHNVLIGCSESFQGQSFQAADGVMGLGYSKYSFAIKAAEKFGGKFSYCL 182
Query: 212 GSLHDPDYLHNKLILGDG----AIIDEGDATPLQF--IDGHYYITLEAISVDGRMLDINP 265
+ N L G A+++ T L ++ Y + + IS+ G ML I
Sbjct: 183 VDHLSHKNVSNYLTFGSSRSKEALLNNMTYTELVLGMVNSFYAVNMMGISIGGAMLKIPS 242
Query: 266 NIFKRDDSGGGVMIDSGTDVTWLVKEAYEALRDEVMIR-LEGEQMRSYSWPDKLCYH--G 322
++ + GG ++DSG+ +T+L + AY+ + + + L+ ++ P + C++ G
Sbjct: 243 EVWDVKGA-GGTILDSGSSLTFLTEPAYQPVMAALRVSLLKFRKVEMDIGPLEYCFNSTG 301
Query: 323 IMSSDLKGFPTVRFHFRGGAKLALEKDSMFYQPRPDAFCMAVNPASINNRYVNFSLIGMM 382
S + P + FHF GA+ S C+ ++ + S++G +
Sbjct: 302 FEESLV---PRLVFHFADGAEFEPPVKSYVISAADGVRCLGF----VSVAWPGTSVVGNI 354
Query: 383 AQQFYNVGYDIGRKQKTFQRMDC 405
QQ + +D+G K+ F C
Sbjct: 355 MQQNHLWEFDLGLKKLGFAPSSC 377
>gi|255586856|ref|XP_002534038.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
gi|223525945|gb|EEF28342.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
Length = 533
Score = 94.4 bits (233), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 101/376 (26%), Positives = 163/376 (43%), Gaps = 57/376 (15%)
Query: 60 LFYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLG-----------TIFYPSRS 108
L Y N+SIG P + A+DTGS L W+ C C + G I+ P+ S
Sbjct: 112 LHYANVSIGTPSLSYLVALDTGSDLFWLPC-DCTNSGCVQGLQFPSGEQIDFNIYRPNAS 170
Query: 109 SSYAYVPCDSEHCRYFPYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIH 168
S+ +PC++ C +RC + + C Y YL +S V E L TD++
Sbjct: 171 STSQTIPCNNTLCSR--QSRCPSAQSTCPYQVQYLSNGTSSTGVLVEDLLHLTTDDAQSR 228
Query: 169 VQD--VVFGCGFSTNRNF----KFSGIFGLGIGRSSLVSQL------NSSFSYCIGSLHD 216
D ++FGCG +F +G+FGLG+ S+ S L ++SFS C G
Sbjct: 229 ALDAKIIFGCGRVQTGSFLDGAAPNGLFGLGMTNISVPSTLAREGYTSNSFSMCFGR--- 285
Query: 217 PDYLHNKLILGDGAIIDEGDATPLQFIDGH--YYITLEAISVDGRMLDINPNIFKRDDSG 274
D + ++ GD +G+ TP H Y +++ I+V GR D+ +
Sbjct: 286 -DGI-GRISFGDTGSSGQGE-TPFNLRQLHPTYNVSITKINVGGRDADLEFS-------- 334
Query: 275 GGVMIDSGTDVTWLVKEAYEALRDEVMIRLEGEQMRSYS-WPDKLCYHGIMSSDLKGF-- 331
+ DSGT T+L AY + + I + ++ S S P + CY MSS+
Sbjct: 335 --AIFDSGTSFTYLNDPAYTLISESFNIGAKEKRYSSISDIPFEYCYE--MSSNQTNLEI 390
Query: 332 PTVRFHFRGGAKLALEKD--SMFYQPRPDAFCMAVNPASINNRYVNFSLIGMMAQQFYNV 389
PTV +GG++ + + Q +C+A+ + + VN +IG Y +
Sbjct: 391 PTVNLVMQGGSQFNVTDPIVIVILQGGASIYCLAI----VKSGDVN--IIGQNFMTGYRI 444
Query: 390 GYDIGRKQKTFQRMDC 405
++ R ++ DC
Sbjct: 445 VFNRERNVLGWKASDC 460
>gi|115442309|ref|NP_001045434.1| Os01g0954900 [Oryza sativa Japonica Group]
gi|20161865|dbj|BAB90778.1| nucleoid DNA-binding-like protein [Oryza sativa Japonica Group]
gi|113534965|dbj|BAF07348.1| Os01g0954900 [Oryza sativa Japonica Group]
gi|215766867|dbj|BAG99095.1| unnamed protein product [Oryza sativa Japonica Group]
Length = 445
Score = 94.4 bits (233), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 90/352 (25%), Positives = 140/352 (39%), Gaps = 27/352 (7%)
Query: 67 IGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLGTIFYPSRSSSYAYVPCDSEHCRYFPY 126
+G P A+D + WV C C C+ + F P++SS+Y VPC S C P
Sbjct: 108 LGTPAQTLLVAIDPSNDAAWVPCSACAGCAASSPS-FSPTQSSTYRTVPCGSPQCAQVPS 166
Query: 127 ARCSA-YKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDVVFGC-GFSTNRNF 184
C A C + Y +V + L +N V FGC + +
Sbjct: 167 PSCPAGVGSSCGFNLTYAASTFQAVL-GQDSLALENN-----VVVSYTFGCLRVVSGNSV 220
Query: 185 KFSGIFGLGIGRSSLVSQLN----SSFSYCIGSLHDPDYLHNKLILGDGAIIDEGDATPL 240
G+ G G G S +SQ S FSYC+ + ++ L LG TPL
Sbjct: 221 PPQGLIGFGRGPLSFLSQTKDTYGSVFSYCLPNYRSSNF-SGTLKLGPIGQPKRIKTTPL 279
Query: 241 QFIDGH----YYITLEAISVDGRMLDINPNIFKRDD-SGGGVMIDSGTDVTWLVKEAYEA 295
+ + H YY+ + I V +++ + + + +G G +ID+GT T L Y A
Sbjct: 280 LY-NPHRPSLYYVNMIGIRVGSKVVQVPQSALAFNPVTGSGTIIDAGTMFTRLAAPVYAA 338
Query: 296 LRDEVMIRLEGEQMRSYSWPDKLCYHGIMSSDLKGFPTVRFHFRGGAKLAL-EKDSMFYQ 354
+RD R+ D CY+ +S PTV F F G + L E++ M +
Sbjct: 339 VRDAFRGRVRTPVAPPLGGFDT-CYNVTVS-----VPTVTFMFAGAVAVTLPEENVMIHS 392
Query: 355 PRPDAFCMAVNPASINNRYVNFSLIGMMAQQFYNVGYDIGRKQKTFQRMDCE 406
C+A+ + +++ M QQ V +D+ + F R C
Sbjct: 393 SSGGVACLAMAAGPSDGVNAALNVLASMQQQNQRVLFDVANGRVGFSRELCT 444
>gi|125579874|gb|EAZ21020.1| hypothetical protein OsJ_36669 [Oryza sativa Japonica Group]
Length = 382
Score = 94.0 bits (232), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 84/247 (34%), Positives = 114/247 (46%), Gaps = 31/247 (12%)
Query: 182 RNFKFSGIFGLGIGRSSLVSQLNSS-FSYCIGSLHDPDYLHNK-----LILGDGAIID-E 234
R+ SG+ GLG GR SLVSQ ++ FSYC+ Y HN L +G A +
Sbjct: 147 RSMAPSGLMGLGRGRLSLVSQTGATKFSYCL-----TPYFHNNGATGHLFVGASASLGGH 201
Query: 235 GDATPLQFIDG-----HYYITLEAISVDGRMLDINPNIFKRDDSG-----GGVMIDSGTD 284
GD QF+ G YY+ L ++V L I +F + GGV+IDSG+
Sbjct: 202 GDVMTTQFVKGPKGSPFYYLPLIGLTVGETRLPIPATVFDLREVAPGLFSGGVIIDSGSP 261
Query: 285 VTWLVKEAYEALRDEVMIRLEGEQMRSYSWPD--KLCYHGIMSSDL-KGFPTVRFHFRGG 341
T LV +AY+AL E+ RL G + D LC + D+ + P V FHFRGG
Sbjct: 262 FTSLVHDAYDALASELAARLNGSLVAPPPDADDGALC---VARRDVGRVVPAVVFHFRGG 318
Query: 342 AKLALEKDSMFYQPRPDAFCMAVNPASINNRYVNFSLIGMMAQQFYNVGYDIGRKQKTFQ 401
A +A+ +S + P A + Y S+IG QQ V YD+ +FQ
Sbjct: 319 ADMAVPAESYW---APVDKAAACMAIASAGPYRRQSVIGNYQQQNMRVLYDLANGDFSFQ 375
Query: 402 RMDCEVL 408
DC L
Sbjct: 376 PADCSAL 382
>gi|88174577|gb|ABD39363.1| chloroplast nucleoid DNA-binding protein [Oryza sativa Japonica
Group]
Length = 321
Score = 94.0 bits (232), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 85/329 (25%), Positives = 140/329 (42%), Gaps = 37/329 (11%)
Query: 61 FYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLGTIFYPSRSSSYAYVPCDSEH 120
+ ++ +G P Q +DTGSS+ WV C C C T F SRS++ A V C +
Sbjct: 1 YVTSVGLGTPAKTQIVEIDTGSSISWVFC-ECDGCHTNPRT-FLQSRSTTCAKVSCGTSM 58
Query: 121 CRYF---PYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDVVFGC- 176
C P+ + S C + Y G + + + LTF + + + FGC
Sbjct: 59 CLLGGSDPHCQDSENYPDCPFRVSYQDGSASYGILYQDTLTFSDVQK----IPSFTFGCN 114
Query: 177 --GFSTNRNFKFSGIFGLGIGRSSLVSQLNSS---FSYCIGSLHDPDYLHNKLI--LGDG 229
F N G+ G+G G S++ Q + + FSYC+ +K G
Sbjct: 115 LDSFGANEFGNVDGLLGMGAGPMSVLKQSSPTFDGFSYCLPLQKSERGFFSKTTGYFSLG 174
Query: 230 AIIDEGDATPLQFIDGH-----YYITLEAISVDGRMLDINPNIFKRDDSGGGVMIDSGTD 284
+ D + + +++ L AISVDG L ++P+IF R GV+ DSG++
Sbjct: 175 KVATRTDVRYTKMVARRKNTELFFVDLAAISVDGERLGLSPSIFSRK----GVVFDSGSE 230
Query: 285 VTWLVKEAYEALRD---EVMIRLEGEQMRSYSWPDKLCYHGIMSSDLKGFPTVRFHFRGG 341
++++ A L E+++R + S ++ CY + S D P + HF G
Sbjct: 231 LSYIPDRALSVLSQRIRELLLRRGAAEEES----ERNCYD-MRSVDEGDMPAISLHFDDG 285
Query: 342 AKLALEKDSMFYQ---PRPDAFCMAVNPA 367
A+ L +F + D +C+A P
Sbjct: 286 ARFDLGSSGVFVERSVQEQDVWCLAFAPT 314
>gi|125529158|gb|EAY77272.1| hypothetical protein OsI_05246 [Oryza sativa Indica Group]
Length = 426
Score = 94.0 bits (232), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 90/351 (25%), Positives = 140/351 (39%), Gaps = 27/351 (7%)
Query: 67 IGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLGTIFYPSRSSSYAYVPCDSEHCRYFPY 126
+G P A+D + WV C C C+ + F P++SS+Y VPC S C P
Sbjct: 89 LGTPAQTLLVAIDPSNDAAWVPCSACAGCAASSPS-FSPTQSSTYRTVPCGSPQCAQVPS 147
Query: 127 ARCSA-YKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDVVFGC-GFSTNRNF 184
C A C + Y +V + L +N V FGC + +
Sbjct: 148 PSCPAGVGSSCGFNLTYAASTFQAVL-GQDSLALENN-----VVVSYTFGCLRVVSGNSV 201
Query: 185 KFSGIFGLGIGRSSLVSQLN----SSFSYCIGSLHDPDYLHNKLILGDGAIIDEGDATPL 240
G+ G G G S +SQ S FSYC+ + ++ L LG TPL
Sbjct: 202 PPQGLIGFGRGPLSFLSQTKDTYGSVFSYCLPNYRSSNF-SGTLKLGPIGQPKRIKTTPL 260
Query: 241 QFIDGH----YYITLEAISVDGRMLDINPNIFKRDD-SGGGVMIDSGTDVTWLVKEAYEA 295
+ + H YY+ + I V +++ + + + +G G +ID+GT T L Y A
Sbjct: 261 LY-NPHRPSLYYVNMIGIRVGSKVVQVPQSALAFNPVTGSGTIIDAGTMFTRLAAPVYAA 319
Query: 296 LRDEVMIRLEGEQMRSYSWPDKLCYHGIMSSDLKGFPTVRFHFRGGAKLAL-EKDSMFYQ 354
+RD R+ D CY+ +S PTV F F G + L E++ M +
Sbjct: 320 VRDAFRGRVRTPVAPPLGGFDT-CYNVTVS-----VPTVTFMFAGAVAVTLPEENVMIHS 373
Query: 355 PRPDAFCMAVNPASINNRYVNFSLIGMMAQQFYNVGYDIGRKQKTFQRMDC 405
C+A+ + +++ M QQ V +D+ + F R C
Sbjct: 374 SSGGVACLAMAAGPSDGVNAALNVLASMQQQNQRVLFDVANGRVGFSRELC 424
>gi|88174567|gb|ABD39358.1| chloroplast nucleoid DNA-binding protein [Oryza glumipatula]
Length = 323
Score = 94.0 bits (232), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 86/331 (25%), Positives = 141/331 (42%), Gaps = 39/331 (11%)
Query: 61 FYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLGTIFYPSRSSSYAYVPCDSEH 120
+ +++ +G P Q +DTGSS WV C C C T F SRS++ A V C +
Sbjct: 1 YVISVGLGTPAKTQIVEIDTGSSTSWVFC-ECDGCHTNPRT-FLQSRSTTCAKVSCGTSM 58
Query: 121 CRYF---PYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDVVFGC- 176
C P+ + S C + Y G + + + LTF + + + FGC
Sbjct: 59 CLLGGSDPHCQDSENYPDCPFRVSYQDGSASYGILYQDTLTFSDVQK----IPGFTFGCN 114
Query: 177 --GFSTNRNFKFSGIFGLGIGRSSLVSQLNSS---FSYCIGSLHDPDYLHNKLI----LG 227
F N G+ G+G G+ S++ Q + + FSYC+ +K LG
Sbjct: 115 MDSFGANEFGNVDGLLGMGAGQMSVLKQSSPTFDGFSYCLPLQMSERGFFSKTTGYFSLG 174
Query: 228 DGAIIDEGDATPLQFIDGH-----YYITLEAISVDGRMLDINPNIFKRDDSGGGVMIDSG 282
D + + +++ L AISVDG L ++P+IF R GV+ DSG
Sbjct: 175 GKIAATRTDVRYTKMVARRKNTELFFVDLTAISVDGERLGLSPSIFSRK----GVVFDSG 230
Query: 283 TDVTWLVKEAYEALRD---EVMIRLEGEQMRSYSWPDKLCYHGIMSSDLKGFPTVRFHFR 339
++++++ A L E+++R + S ++ CY + S D P + HF
Sbjct: 231 SELSYIPDRALSVLSQRIRELLLRRGAAEEES----ERNCYD-MRSVDEGDMPAISLHFD 285
Query: 340 GGAKLALEKDSMFYQ---PRPDAFCMAVNPA 367
GA+ L +F + D +C+A P
Sbjct: 286 DGARFDLGSHGVFVERSVQEQDVWCLAFAPT 316
>gi|326517745|dbj|BAK03791.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 556
Score = 94.0 bits (232), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 97/378 (25%), Positives = 151/378 (39%), Gaps = 38/378 (10%)
Query: 50 ILPSNDISNSLFYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCR-DCSPQL--GTIFYPS 106
++ + DI+N LF + I +G PPV A+DTG++L +V C PC C Q G IF PS
Sbjct: 195 LIQNGDINNFLFLMPIKLGTPPVWNLVAVDTGATLSFVQCEPCTLRCHKQTDAGEIFDPS 254
Query: 107 RSSSYAYVPCDSEHCRYFPYA------RCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFK 160
+S S++ V C CR A C + C+Y+ + SV
Sbjct: 255 KSESFSRVGCSENKCRTVQRALHLQSKACMEKEDSCLYSMTFGGTSSYSVGKLVRDRLAI 314
Query: 161 NTDESTIHVQDVVFGCGFSTNRNFKFSGIFGLGIGRSSLVSQLN-----SSFSYCIGSLH 215
D +FGC T + +G+ G S Q+ +FSYC S
Sbjct: 315 GKYAKGYSFPDFLFGCSLDTEYHQYEAGLVGFADEPFSFFEQVAPLVNYKAFSYCFPSDR 374
Query: 216 DPDYLHNKLILGDGAIIDEGDATPLQFI--DGHYYITLEAISVDGRMLDINPNIFKRDDS 273
L +GD ++ TPL Y + L+ + V+G L P+
Sbjct: 375 RKT---GYLSIGDYTRVNS-TYTPLFLARQQSRYALKLDEVLVNGMALVTTPS------- 423
Query: 274 GGGVMIDSGTDVTWLVKEAYEALRDEVM--IRLEGEQMRSYSWPDKLCY---HGIMSSDL 328
+++DSG+ T L+ + + L + +R G Y D +C+ H SD
Sbjct: 424 --EMIVDSGSRWTILLSDTFTQLDAAITEAMRPLGYNRNYYRGSDYICFEDAHFQQFSDW 481
Query: 329 KGFPTVRFHFRGGAKLALEKDSMFYQPRPDAFCM-AVNPASINNRYVNFSLIGMMAQQFY 387
P V F G K+ L+ S F+ C + AS+ + L+G +
Sbjct: 482 AALPVVELKFDMGVKMVLQPQSSFHFNNDYGLCTYFMRDASLGS---GVQLLGNTMTRSV 538
Query: 388 NVGYDIGRKQKTFQRMDC 405
+ +DI Q F++ DC
Sbjct: 539 GITFDIQGGQFGFRKGDC 556
>gi|357116170|ref|XP_003559856.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Brachypodium
distachyon]
Length = 460
Score = 94.0 bits (232), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 99/376 (26%), Positives = 157/376 (41%), Gaps = 41/376 (10%)
Query: 61 FYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLGTIFYPSRSSSYAYVPCDSEH 120
+ V S+G PP A+DT + WV C C C P F P+ S+++ VPC +
Sbjct: 94 YLVRASLGTPPQRLLLAVDTSNDAAWVPCAGCHGC-PTTAPSFNPASSATFRPVPCGAPP 152
Query: 121 CRYFPYARCSAY---KHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDVVFGCG 177
C P C++ K+ C ++ Y ++S+ + Q T + ++ FGC
Sbjct: 153 CSQAPNPSCTSLAKSKNSCGFSLSYG---DSSLDATLSQDNLAVTANGGV-IKGYTFGCL 208
Query: 178 FSTNRNFKFS-GIFGLGIGRSSLVSQLNS----SFSYCIGSLH-DPDYLHNKLILG-DGA 230
+N + + G+ GLG G V+Q +FSYC+ S + L LG G
Sbjct: 209 TKSNGSAAPAQGLLGLGRGPLGFVAQTKGIYEGTFSYCLPSYYRSAANFSGSLTLGRKGQ 268
Query: 231 IIDEGDATPLQFIDGH----YYITLEAISVDGRMLDINPNIFKRDD-SGGGVMIDSGTDV 285
E T H YY+ + + + + + I P+ D +G G ++DSGT
Sbjct: 269 PAPEKMKTTPLLASPHRPSLYYVAMTGVRIGKKSVPIPPSALAFDAATGAGTVLDSGTMF 328
Query: 286 TWLVKEAYEALRDEVMIRLEGEQMRSYSWPDKLCYHGIMSSDLKGF-----------PTV 334
L + AY A+RDEV R+ G R + S L GF P V
Sbjct: 329 ARLAQPAYAAVRDEVRRRVAGSLRRR-----GGGGASVSVSSLGGFDTCYNVSTVAWPAV 383
Query: 335 RFHFRGGAKLALEKDSMFYQP---RPDAFCMAVNPASINNRYVNFSLIGMMAQQFYNVGY 391
F GG ++ L ++++ + MA +PA N +N +IG + QQ + V +
Sbjct: 384 TLVFGGGMEVRLPEENVVIRSTYGSTSCLAMAASPADGVNAALN--VIGSLQQQNHRVLF 441
Query: 392 DIGRKQKTFQRMDCEV 407
D+ + F R C
Sbjct: 442 DVPNARVGFARERCTA 457
>gi|242063796|ref|XP_002453187.1| hypothetical protein SORBIDRAFT_04g001340 [Sorghum bicolor]
gi|241933018|gb|EES06163.1| hypothetical protein SORBIDRAFT_04g001340 [Sorghum bicolor]
Length = 493
Score = 94.0 bits (232), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 104/370 (28%), Positives = 155/370 (41%), Gaps = 41/370 (11%)
Query: 61 FYVNISIGQPP-VPQFTAMDTGSSLLWVHCYPC-RDCSPQLGTIFYPSRSSSYAYVPCDS 118
+ + + +G PP Q +DTGS + WV C PC + C PQ+ +F PS SS+Y+ C S
Sbjct: 140 YVITVRLGSPPGKSQTMLIDTGSDISWVRCKPCWQQCRPQVDPLFDPSLSSTYSPFSCSS 199
Query: 119 EHCRYF---PYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDVVFG 175
C A + +C Y +Y G + + ++ +T+ V FG
Sbjct: 200 AACAQLFQEGNANGCSSSGQCQYIAMYGDGSVGTTGTYSSDTLALGSNSNTVVVSKFRFG 259
Query: 176 CGFS-TNRNFKFSGIFGLGIGRSSLVSQL-----NSSFSYCIGSLHDPDYLHNKLILGDG 229
C + T +G+ GLG G SLVSQ ++FSYC+ L LG
Sbjct: 260 CSHAETGITGLTAGLMGLGGGAQSLVSQTAGTFGTTAFSYCLPPTPSSSGF---LTLGAA 316
Query: 230 AIIDEG-DATPL---QFIDGHYYITLEAISVDGRMLDINPNIFKRDDSGGGVMIDSGTDV 285
G TP+ + Y + LEAI V GR L I +F G+++DSGT V
Sbjct: 317 GTSSAGFVKTPMLRSSQVPAFYGVRLEAIRVGGRQLSIPTTVFS-----AGMIMDSGTVV 371
Query: 286 TWLVKEAYEALRDEVMIRLEGEQMRSYSWPDKLCYHGIMSS--DLKG-----FPTVRFHF 338
T L AY +L M+ Y G + + D+ G PTV F
Sbjct: 372 TRLPPTAYSSLSSAFK-----AGMKQYPPAPSSAGGGFLDTCFDMSGQSSVSMPTVALVF 426
Query: 339 R--GGAKLALEKDSMFYQPRPDA-FCMAVNPASINNRYVNFSLIGMMAQQFYNVGYDIGR 395
GGA + L+ + Q + FC+A S + + +IG + Q+ + V YD+
Sbjct: 427 SGAGGAVVNLDASGILLQMETSSIFCLAFVATSDDG---STGIIGNVQQRTFQVLYDVAG 483
Query: 396 KQKTFQRMDC 405
F+ C
Sbjct: 484 GAVGFKAGAC 493
>gi|88174556|gb|ABD39353.1| chloroplast nucleoid DNA-binding protein [Oryza barthii]
Length = 321
Score = 94.0 bits (232), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 85/329 (25%), Positives = 141/329 (42%), Gaps = 37/329 (11%)
Query: 61 FYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLGTIFYPSRSSSYAYVPCDSEH 120
+ +++ +G P Q +DTGSS WV C C C T F SRS++ A V C +
Sbjct: 1 YVISVGLGTPSKTQIVEIDTGSSTSWVFC-ECDGCHTNPRT-FLQSRSTTCAKVSCGTSM 58
Query: 121 CRYF---PYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDVVFGC- 176
C P+ + S C + Y G + + + LTF + + + FGC
Sbjct: 59 CLLGGSDPHCQDSENYPDCPFRVSYQDGSASYGILYQDTLTFSDVQK----IPGFSFGCN 114
Query: 177 --GFSTNRNFKFSGIFGLGIGRSSLVSQLNSS---FSYCIGSLHDPDYLHNKLI--LGDG 229
F N G+ G+G G S++ Q + + FSYC+ +K G
Sbjct: 115 MDSFGANEFGNVDGLLGMGAGAMSVLKQSSPTFDCFSYCLPLQKSERGFFSKTTGYFSLG 174
Query: 230 AIIDEGDATPLQFIDGH-----YYITLEAISVDGRMLDINPNIFKRDDSGGGVMIDSGTD 284
+ D + + +++ L AISVDG L ++P+IF R GV+ DSG++
Sbjct: 175 KVATRTDVRYTKMVARKKNTELFFVDLTAISVDGERLGLSPSIFSRK----GVVFDSGSE 230
Query: 285 VTWLVKEAYEALRD---EVMIRLEGEQMRSYSWPDKLCYHGIMSSDLKGFPTVRFHFRGG 341
++++ A L E+++R + S ++ CY + S D P + HF G
Sbjct: 231 LSYIPDRALSVLSQRIRELLLRRGAAEEES----ERNCYD-MRSVDEGDMPAISLHFDDG 285
Query: 342 AKLALEKDSMFYQ---PRPDAFCMAVNPA 367
A+ L + +F + D +C+A P
Sbjct: 286 ARFDLGRGGVFVERSVQEQDVWCLAFAPT 314
>gi|88174571|gb|ABD39360.1| chloroplast nucleoid DNA-binding protein [Oryza nivara]
Length = 321
Score = 93.6 bits (231), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 85/329 (25%), Positives = 139/329 (42%), Gaps = 37/329 (11%)
Query: 61 FYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLGTIFYPSRSSSYAYVPCDSEH 120
+ +++ +G P Q +DTGSS WV C C C T F SRS++ A V C +
Sbjct: 1 YVISVGLGTPAKTQIVEIDTGSSTSWVFC-ECDGCHTNPRT-FLQSRSTTCAKVSCGTSM 58
Query: 121 CRYF---PYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDVVFGC- 176
C P+ + S C + Y G + + + LTF + + + FGC
Sbjct: 59 CLLGGSDPHCQDSENYPDCPFRVSYQDGSASYGILYQDTLTFSDVQK----IPSFTFGCN 114
Query: 177 --GFSTNRNFKFSGIFGLGIGRSSLVSQLN---SSFSYCIGSLHDPDYLHNKLI--LGDG 229
F N G+ G+G G S++ Q + FSYC+ +K G
Sbjct: 115 LDSFGANEFGNVDGLLGMGAGPMSVLKQSSPRFDGFSYCLPLQKSERGFFSKTTGYFSLG 174
Query: 230 AIIDEGDATPLQFIDGH-----YYITLEAISVDGRMLDINPNIFKRDDSGGGVMIDSGTD 284
+ D + + +++ L AISVDG L ++P+IF R GV+ DSG++
Sbjct: 175 KVATRTDVRYTKMVARRKNTELFFVDLAAISVDGERLGLSPSIFSRK----GVVFDSGSE 230
Query: 285 VTWLVKEAYEALRD---EVMIRLEGEQMRSYSWPDKLCYHGIMSSDLKGFPTVRFHFRGG 341
++++ A L E+++R + S ++ CY + S D P + HF G
Sbjct: 231 LSYIPDRALSVLSQRIRELLLRRGAAEEES----ERNCYD-MRSVDEGDMPAISLHFDDG 285
Query: 342 AKLALEKDSMFYQ---PRPDAFCMAVNPA 367
A+ L +F + D +C+A P
Sbjct: 286 ARFDLGSKGVFVERSVQEQDVWCLAFAPT 314
>gi|413944032|gb|AFW76681.1| hypothetical protein ZEAMMB73_606599 [Zea mays]
Length = 315
Score = 93.6 bits (231), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 89/330 (26%), Positives = 138/330 (41%), Gaps = 45/330 (13%)
Query: 108 SSSYAYVPCDSEHCRY---FPYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDE 164
SS++ V C CR + C+ +C Y Y T+ + + TF + +
Sbjct: 2 SSTFKAVACPDPICRPSSGVSVSACAMENFQCFYLCSYGDRSITAGHIFKDTFTFMSPNG 61
Query: 165 STIHVQDVVFGCG------FSTNRNFKFSGIFGLGIGRSSLVSQLN-SSFSYCI------ 211
+ V ++ FGCG F +N SGI G G G SL SQL FSYC+
Sbjct: 62 VPVAVSELAFGCGDYNTGLFVSNE----SGIAGFGRGPQSLPSQLKVGRFSYCLTLVTES 117
Query: 212 -------GSLHDPDYLHNKLILGDGAIIDEGDATPL---QFIDGHYYITLEAISVDGRML 261
G+ DPD L +TP+ I YY++LE I+V L
Sbjct: 118 KSSVVILGTPPDPDGLR-------AHTTGPFQSTPIIYNPLIPTFYYLSLEGITVGKTRL 170
Query: 262 DINPNIFK-RDDSGGGVMIDSGTDVTWLVKEAYEALRDEVMIRLEGEQM-RSYSWPDKLC 319
+ ++F + D GG +IDSGT +T L + +E L++E++ + + + D+LC
Sbjct: 171 PFDKSVFALKKDGSGGTVIDSGTSLTTLPEAVFELLQEELVAQFPLPRYDNTPEVGDRLC 230
Query: 320 YHGIMSSDLKGFPTVRFHFRGGAKLALEKDSMFY-QPRPDAFCMAVNPASINNRYVNFSL 378
+ P + H GA + L +D+ F +P C+ +N A L
Sbjct: 231 FRRPKGGKQVPVPKLILHL-AGADMDLPRDNYFVEEPDSGVMCLQINGA----EDTTMVL 285
Query: 379 IGMMAQQFYNVGYDIGRKQKTFQRMDCEVL 408
IG QQ +V YD+ + F C+ L
Sbjct: 286 IGNFQQQNMHVVYDVENNKLLFAPAQCDKL 315
>gi|125532793|gb|EAY79358.1| hypothetical protein OsI_34487 [Oryza sativa Indica Group]
Length = 419
Score = 93.6 bits (231), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 102/419 (24%), Positives = 175/419 (41%), Gaps = 52/419 (12%)
Query: 19 AHRVQRGINISIARLAYLQEKIRSHNTYQAQILPSNDISNSLFYVNISIGQPPVPQFTAM 78
AH ++RG++ R L + + ++P + S + + N +IG PP +
Sbjct: 23 AHGLRRGLDRQGMRGRILADATAAPP--GGAVVPLH-WSGACYVANFTIGTPPQAVSGIV 79
Query: 79 DTGSSLLWVHCYPCRD--CSPQLGTIFYPSRSSSYAYVPCDSEHCRYFPYARCSAYKHRC 136
D L+W C CR C Q +F PS S++Y C S C+ P CS C
Sbjct: 80 DLSGELVWTQCAACRSSGCFKQELPVFDPSASNTYRAEQCGSPLCKSIPTRNCSG-DGEC 138
Query: 137 IYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDVVFGCGFSTNRNFKF-----SGIFG 191
Y + G +T ST+ + N + + FGC +++ + SG G
Sbjct: 139 GYEAPSMFG-DTFGIASTDAIAIGNAE------GRLAFGCVVASDGSIDGAMDGPSGFVG 191
Query: 192 LGIGRSSLVSQLN-SSFSYCIGSLHDPDYLHNKLILGDGA-IIDEGDATPLQFI------ 243
LG SLV Q N ++FSYC+ + H P + L LG A + G + P +
Sbjct: 192 LGRTPWSLVGQSNVTAFSYCL-APHGPGK-KSALFLGASAKLAGAGKSNPPTPLLGQHAS 249
Query: 244 -------DGHYYITLEAISVDGRMLDINPNIFKRDDSGGGVM----IDSGTDVTWLVKEA 292
D +Y + LE I + D+ SGGG + +++ +++L A
Sbjct: 250 NTSDDGSDPYYTVQLEGI----KAGDV---AVAAASSGGGAITILQLETFRPLSYLPDAA 302
Query: 293 YEALRDEVMIRLEGEQMRSYSWPDKLCYHGIMSSDLKGFPTVRFHFRGGAKLALEKDSMF 352
Y+AL V L M + P LC+ ++ + G P + F F+GGA L
Sbjct: 303 YQALEKVVTAALGSPSMANPPEPFDLCFQ---NAAVSGVPDLVFTFQGGATLTAPPSKYL 359
Query: 353 YQP--RPDAFCMAV-NPASINNRYVNFSLIGMMAQQFYNVGYDIGRKQKTFQRMDCEVL 408
C+++ + +++ S++G + Q+ + +D+ ++ +F+ DC L
Sbjct: 360 LGDGNGNGTVCLSILSSTRLDSADDGVSILGSLLQENVHFLFDLEKETLSFEPADCSSL 418
>gi|388502484|gb|AFK39308.1| unknown [Medicago truncatula]
Length = 425
Score = 93.6 bits (231), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 93/347 (26%), Positives = 137/347 (39%), Gaps = 27/347 (7%)
Query: 56 ISNSLFYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLGTIFYPSRSSSYAYVP 115
I + + V IG PP AMDT + W+ C C C+ T+F P +S+++ V
Sbjct: 88 IQSPTYIVRAKIGTPPQTLLLAMDTSNDAAWIPCTACDGCA---STLFAPEKSTTFKNVS 144
Query: 116 CDSEHCRYFPYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDVVFG 175
C + C+ P C L +S+ + Q T +T V FG
Sbjct: 145 CAAPECKQVPNPGCGVSSR-----NFNLTYGSSSIAANLVQDTIT---LATDPVPSYTFG 196
Query: 176 C-----GFSTNRNFKFSGIFGLGIGRSSLVSQLNSSFSYCIGSLHDPDYLHNKLILGDGA 230
C G S G S + S+FSYC+ S ++ L LG A
Sbjct: 197 CVSKTTGTSAPPQGLLGLGRGPLSLLSQTQNLYQSTFSYCLPSFKSLNF-SGSLRLGPVA 255
Query: 231 IIDEGDATPL---QFIDGHYYITLEAISVDGRMLDINPNIFK-RDDSGGGVMIDSGTDVT 286
TPL YY+ LEAI V +++DI P +G G + DSGT T
Sbjct: 256 QPKRIKYTPLLKNPRRSSLYYVNLEAIRVGRKVVDIPPAALAFNPTTGAGTIFDSGTVFT 315
Query: 287 WLVKEAYEALRDEVMIRLEGEQMRSYSWPDKLCYHGIMSSDLKGFPTVRFHFRGGAKLAL 346
LV Y A+RDE R+ + + CY+ + PT+ F F G
Sbjct: 316 RLVAPVYVAVRDEFRRRVGPKLTVTSLGGFDTCYNVPIV-----VPTITFIFTGMNVTLP 370
Query: 347 EKDSMFYQPRPDAFCMAVNPASINNRYVNFSLIGMMAQQFYNVGYDI 393
+ + + + C+A+ A N V ++I M QQ + V YD+
Sbjct: 371 QDNILIHSTAGSTTCLAMAGAPDNVNSV-LNVIANMQQQNHRVLYDV 416
>gi|147801191|emb|CAN68822.1| hypothetical protein VITISV_007106 [Vitis vinifera]
Length = 443
Score = 93.6 bits (231), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 63/193 (32%), Positives = 89/193 (46%), Gaps = 4/193 (2%)
Query: 67 IGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLGTIFYPSRSSSYAYVPCDSEHCRYFPY 126
+G P + DTGS L+W+ C PC C Q IF P+ S +Y V DS C
Sbjct: 63 LGVPSTLVYGIADTGSELIWLQCLPCTHCYNQTPPIFDPAESYTYETVSSDSPICNAVRR 122
Query: 127 ARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDVVFGCGFSTNRNFKF 186
C C Y Y G T +ST+ F++ + + V + FGC T K
Sbjct: 123 ISCREGDKSCCYQHTYGDGTTTKGTLSTDVFAFEDPTRTIVEVGYLTFGCSHDTKARLKG 182
Query: 187 --SGIFGLGIGRSSLVSQLN-SSFSYCIGSLHDPDYLHNKLILGDGAIIDEGDATPLQFI 243
+G+ GL +SLVSQL FSYC+ + D +++ G A+I G L+
Sbjct: 183 HQAGVVGLNRHPNSLVSQLKVKKFSYCM-VIPDDHGSGSRMYFGSRAVILGGKTPLLKGD 241
Query: 244 DGHYYITLEAISV 256
HY++TL+ ISV
Sbjct: 242 YSHYFVTLKGISV 254
Score = 47.0 bits (110), Expect = 0.019, Method: Compositional matrix adjust.
Identities = 37/110 (33%), Positives = 50/110 (45%), Gaps = 3/110 (2%)
Query: 95 CSPQLGTIFYPSRSSSYAYVPCDSEHCRYFPYARCSAYKHRCIYTQLYLIG-PETSVFVS 153
C Q IF PS+SS+Y+ VP D+ C C + C Y Y G T +S
Sbjct: 334 CFNQTPPIFDPSKSSTYSTVPWDAPTCYQAGGYACHIDEEDCCYRISYGSGSTSTEGTIS 393
Query: 154 TEQLTFKNTDESTIHVQDVVFGCGFSTNRNFKF--SGIFGLGIGRSSLVS 201
+ F++ ++ + V +VFGC T FK GI GL SLVS
Sbjct: 394 IDAFAFEDNRQNMVDVXHLVFGCSDYTTGTFKGYEVGIVGLNQDSLSLVS 443
>gi|195626958|gb|ACG35309.1| aspartic proteinase nepenthesin-1 precursor [Zea mays]
Length = 450
Score = 93.6 bits (231), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 107/390 (27%), Positives = 163/390 (41%), Gaps = 33/390 (8%)
Query: 31 ARLAYLQE-KIRSHNTYQAQILPSNDISNSLFYV-NISIGQPPVPQFTAMDTGSSLLWVH 88
+RL YL +R A I + +L YV S+G PP A+DT + W+
Sbjct: 80 SRLLYLDSLAVRGRARAYAPIASGRQLLQTLTYVVRASLGTPPQQLLLAVDTSNDASWIP 139
Query: 89 CYPCRDCSPQLGTIFYPSRSSSYAYVPCDSEHCRYFPYARCSAYKHRCIYTQLYLIGPET 148
C C C F P+ S+SY VPC S C P A C C ++ Y ++
Sbjct: 140 CAGCAGCPTSSAAPFDPAASASYRTVPCGSPLCAQAPNAACPPGGKACGFSLTYA---DS 196
Query: 149 SVFVSTEQLTFKNTDESTIHVQDVVFGC-GFSTNRNFKFSGIFGLGIGRSSLVSQLN--- 204
S+ + Q + + V+ FGC +T G+ GLG G S +SQ
Sbjct: 197 SLQAALSQDSLAVAGNA---VKAYTFGCLQRATGTAAPPQGLLGLGRGPLSFLSQTKDMY 253
Query: 205 -SSFSYCIGSLHDPDYLHNKLILGDGAIIDEGDATPLQFIDGH----YYITLEAISVDGR 259
++FSYC+ S ++ L LG TPL + H YY+ + + V +
Sbjct: 254 EATFSYCLPSFKSLNF-SGTLRLGRNGQPQRIKTTPL-LANPHRSSLYYVNMTGVRVGRK 311
Query: 260 MLDINPNIFKRDDSGGGVMIDSGTDVTWLVKEAYEALRDEVMIRLEGEQMRSYSWPDKLC 319
++ I F +G G ++DSGT T LV AY A+RDEV R+ G + S D C
Sbjct: 312 VVPI--PAFD-PATGAGTVLDSGTMFTRLVAPAYVAVRDEVRRRV-GAPVSSLGGFDT-C 366
Query: 320 YHGIMSSDLKGFPTVRFHFRGGAKLALEKDSMFYQPRPDAFC--MAVNPASINNRYVNFS 377
++ + +P + F G E++ + + C MA P +N +
Sbjct: 367 FN----TTAVAWPPMTLLFDGMQVTLPEENVVIHSTYGTISCLAMAAAPDGVNTV---LN 419
Query: 378 LIGMMAQQFYNVGYDIGRKQKTFQRMDCEV 407
+I M QQ + V +D+ + F R C
Sbjct: 420 VIASMQQQNHRVLFDVPNGRVGFARERCTA 449
>gi|224142013|ref|XP_002324355.1| predicted protein [Populus trichocarpa]
gi|222865789|gb|EEF02920.1| predicted protein [Populus trichocarpa]
Length = 477
Score = 93.6 bits (231), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 100/379 (26%), Positives = 157/379 (41%), Gaps = 43/379 (11%)
Query: 43 HNTYQAQILPSNDISNSLFYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCR-DCSPQLGT 101
T A I+P+ + V + +G P + DTGS L W C PC C PQ
Sbjct: 126 QTTIPASIVPTGGA----YVVTVGLGTPKKDFTLSFDTGSDLTWTQCEPCLGGCFPQNQP 181
Query: 102 IFYPSRSSSYAYVPCDSEHCRY-----FPYARCSAYKHRCIYTQLYLIGPETSVFVSTEQ 156
F P+ S+SY V C SE C+ +P C + C+Y Y G T F++TE
Sbjct: 182 KFDPTTSTSYKNVSCSSEFCKLIAEGNYPAQDC--ISNTCLYGIQYGSG-YTIGFLATET 238
Query: 157 LTFKNTDESTIHVQDVVFGCGFSTNRNFK-FSGIFGLGIGRSSLVSQLNSS----FSYCI 211
L ++D ++ +FGC + F +G+ GLG +L SQ + FSYC+
Sbjct: 239 LAIASSD----VFKNFLFGCSEESRGTFNGTTGLLGLGRSPIALPSQTTNKYKNLFSYCL 294
Query: 212 -GSLHDPDYLHNKLILGDGAIIDEGDATPLQ-FIDGHYYITLEAISVDGRMLDINPNIFK 269
S +L + + A +TP+ + Y + ISV GR L IN +I +
Sbjct: 295 PASPSSTGHLSFGVEVSQAA-----KSTPISPKLKQLYGLNTVGISVRGRELPINGSISR 349
Query: 270 RDDSGGGVMIDSGTDVTWLVKEAYEALRDEVMIRLEGEQMRSYSWPDKLCY--HGIMSSD 327
+IDSGT T+L Y AL + + + + + CY I +
Sbjct: 350 -------TIIDSGTTFTFLPSPTYSALGSAFREMMANYTLTNGTSSFQPCYDFSNIGNGT 402
Query: 328 LKGFPTVRFHFRGGAKLALEKDSMFYQPRP-DAFCMAVNPASINNRYVNFSLIGMMAQQF 386
L P + F GG ++ ++ + C+A ++ +F++ G Q+
Sbjct: 403 LT-IPGISIFFEGGVEVEIDVSGIMIPVNGLKEVCLAFADTGSDS---DFAIFGNYQQKT 458
Query: 387 YNVGYDIGRKQKTFQRMDC 405
Y V YD+ + F C
Sbjct: 459 YEVIYDVAKGMVGFAPKGC 477
>gi|88174589|gb|ABD39369.1| chloroplast nucleoid DNA-binding protein [Oryza rufipogon]
Length = 321
Score = 93.6 bits (231), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 85/329 (25%), Positives = 140/329 (42%), Gaps = 37/329 (11%)
Query: 61 FYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLGTIFYPSRSSSYAYVPCDSEH 120
+ +++ +G P Q +DTGSS WV C C C T F SRS++ A V C +
Sbjct: 1 YVISVGLGTPAKTQIVEIDTGSSASWVFC-ECDGCHTNPRT-FLQSRSTTCAKVSCGTSM 58
Query: 121 CRYF---PYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDVVFGC- 176
C P+ + S C + Y G + + + LTF + + + FGC
Sbjct: 59 CLLGGSDPHCQDSENYPDCPFRVSYQDGSASYGILYQDTLTFSDVQK----IPSFTFGCN 114
Query: 177 --GFSTNRNFKFSGIFGLGIGRSSLVSQLNSS---FSYCIGSLHDPDYLHNKLI--LGDG 229
F N G+ G+G G S++ Q + + FSYC+ +K G
Sbjct: 115 LDSFGANEFGNVDGLLGMGAGPMSVLKQSSPTFDGFSYCLPLQKSERGFFSKTTGYFSLG 174
Query: 230 AIIDEGDATPLQFIDGH-----YYITLEAISVDGRMLDINPNIFKRDDSGGGVMIDSGTD 284
+ D + + +++ L AISVDG L ++P+IF R GV+ DSG++
Sbjct: 175 KVATRTDVRYTKMVARRKNTELFFVDLAAISVDGERLGLSPSIFSRK----GVVFDSGSE 230
Query: 285 VTWLVKEAYEALRD---EVMIRLEGEQMRSYSWPDKLCYHGIMSSDLKGFPTVRFHFRGG 341
++++ A L E+++R + S ++ CY + S D P + HF G
Sbjct: 231 LSYIPDRALSVLSQRIRELLLRRGAAEEES----ERNCYD-MRSVDEGDMPAISLHFDDG 285
Query: 342 AKLALEKDSMFYQ---PRPDAFCMAVNPA 367
A+ L +F + D +C+A P
Sbjct: 286 ARFDLGSHGVFVERSVQEQDVWCLAFAPT 314
>gi|88174558|gb|ABD39354.1| chloroplast nucleoid DNA-binding protein [Oryza meridionalis]
Length = 321
Score = 93.2 bits (230), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 84/329 (25%), Positives = 141/329 (42%), Gaps = 37/329 (11%)
Query: 61 FYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLGTIFYPSRSSSYAYVPCDSEH 120
+ +++ +G P Q +DTGSS WV C C C T F SRS++ A V C +
Sbjct: 1 YVISVGLGTPAKTQIVEIDTGSSTSWVFC-ECDGCHTNPRT-FLQSRSTTCAKVSCGTSM 58
Query: 121 CRYF---PYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDVVFGC- 176
C P+ + S C + Y G + + + LTF + + + FGC
Sbjct: 59 CLLGGSDPHCQDSENYPDCPFRVSYQDGSASYGILYQDTLTFSDVQK----IPGFTFGCN 114
Query: 177 --GFSTNRNFKFSGIFGLGIGRSSLVSQLNSS---FSYCIGSLHDPDYLHNKLI--LGDG 229
F N G+ G+G G S++ Q + + FSYC+ +K G
Sbjct: 115 LDSFGANEFGNVDGLLGMGAGPMSVLKQSSPTFDGFSYCLPLQMSERGFFSKTTGYFSLG 174
Query: 230 AIIDEGDATPLQFIDGH-----YYITLEAISVDGRMLDINPNIFKRDDSGGGVMIDSGTD 284
+ D + + +++ L AISVDG L ++P++F R GV+ DSG++
Sbjct: 175 KVATRTDVRYTKMVARKKNTELFFVDLTAISVDGERLGLSPSVFSRK----GVVFDSGSE 230
Query: 285 VTWLVKEAYEALRD---EVMIRLEGEQMRSYSWPDKLCYHGIMSSDLKGFPTVRFHFRGG 341
++++ A LR E++++ + S ++ CY + S D P + HF G
Sbjct: 231 LSYIPDRALSVLRQRIRELLLKRGAAEEES----ERNCYD-MRSVDEGDMPAISLHFDDG 285
Query: 342 AKLALEKDSMFYQ---PRPDAFCMAVNPA 367
A+ L +F + D +C+A P
Sbjct: 286 ARFDLGSHGVFVERSVQEQDVWCLAFAPT 314
>gi|88174575|gb|ABD39362.1| chloroplast nucleoid DNA-binding protein [Oryza sativa Japonica
Group]
Length = 321
Score = 93.2 bits (230), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 85/329 (25%), Positives = 140/329 (42%), Gaps = 37/329 (11%)
Query: 61 FYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLGTIFYPSRSSSYAYVPCDSEH 120
+ ++ +G P Q +DTGSS WV C C C T F SRS++ A V C +
Sbjct: 1 YVTSVGLGTPAKTQIVEIDTGSSTSWVFC-ECDGCHTNPRT-FLQSRSTTCAKVSCGTSM 58
Query: 121 CRYF---PYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDVVFGC- 176
C P+ + S C + Y G + + + LTF + + + FGC
Sbjct: 59 CLLGGSDPHCQDSENYPDCPFRVSYQDGSASYGILYQDTLTFSDVQK----IPSFTFGCN 114
Query: 177 --GFSTNRNFKFSGIFGLGIGRSSLVSQLNSSF---SYCIGSLHDPDYLHNKLI--LGDG 229
F N G+ G+G G S++ Q + +F SYC+ +K G
Sbjct: 115 LDSFGANEFGNVDGLLGMGAGPMSVLKQSSPTFDGFSYCLPLQKSERGFFSKTTGYFSLG 174
Query: 230 AIIDEGDATPLQFIDGH-----YYITLEAISVDGRMLDINPNIFKRDDSGGGVMIDSGTD 284
+ D + + +++ L AISVDG L ++P+IF R GV+ DSG++
Sbjct: 175 KVATRTDVRYTKMVARRKNTELFFVDLAAISVDGERLGLSPSIFSRK----GVVFDSGSE 230
Query: 285 VTWLVKEAYEALRD---EVMIRLEGEQMRSYSWPDKLCYHGIMSSDLKGFPTVRFHFRGG 341
++++ A L E+++R + S ++ CY + S D P + HF G
Sbjct: 231 LSYIPDRALSVLSQRIRELLLRRGAAEEES----ERNCYD-MRSVDEGDMPAISLHFDDG 285
Query: 342 AKLALEKDSMFYQ---PRPDAFCMAVNPA 367
A+ L + +F + D +C+A P
Sbjct: 286 ARFDLGRHGVFVERSVQEQDVWCLAFAPT 314
>gi|88174597|gb|ABD39373.1| chloroplast nucleoid DNA-binding protein [Oryza rufipogon]
gi|88174601|gb|ABD39375.1| chloroplast nucleoid DNA-binding protein [Oryza rufipogon]
gi|88174603|gb|ABD39376.1| chloroplast nucleoid DNA-binding protein [Oryza rufipogon]
Length = 321
Score = 93.2 bits (230), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 85/329 (25%), Positives = 140/329 (42%), Gaps = 37/329 (11%)
Query: 61 FYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLGTIFYPSRSSSYAYVPCDSEH 120
+ +++ +G P Q +DTGSS WV C C C T F SRS++ A V C +
Sbjct: 1 YVISVGLGTPAKTQIVEIDTGSSTSWVFC-ECDGCHTNPRT-FLQSRSTTCAKVSCGTSM 58
Query: 121 CRYF---PYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDVVFGC- 176
C P+ + S C + Y G + + + LTF + + + FGC
Sbjct: 59 CLLGGSDPHCQDSENYPDCPFRVSYQDGSASYGILYQDTLTFSDVQK----IPSFTFGCN 114
Query: 177 --GFSTNRNFKFSGIFGLGIGRSSLVSQLNSS---FSYCIGSLHDPDYLHNKLI--LGDG 229
F N G+ G+G G S++ Q + + FSYC+ +K G
Sbjct: 115 LDSFGANEFGNVDGLLGMGAGPMSVLKQSSPTFDGFSYCLPLQKSERGFFSKTTGYFSLG 174
Query: 230 AIIDEGDATPLQFIDGH-----YYITLEAISVDGRMLDINPNIFKRDDSGGGVMIDSGTD 284
+ D + + +++ L AISVDG L ++P+IF R GV+ DSG++
Sbjct: 175 KVATRTDVRYTKMVARRKNTELFFVDLAAISVDGERLGLSPSIFSRK----GVVFDSGSE 230
Query: 285 VTWLVKEAYEALRD---EVMIRLEGEQMRSYSWPDKLCYHGIMSSDLKGFPTVRFHFRGG 341
++++ A L E+++R + S ++ CY + S D P + HF G
Sbjct: 231 LSYIPDRALSVLSQRIRELLLRRGAAEEES----ERNCYD-MRSVDEGDMPAISLHFDDG 285
Query: 342 AKLALEKDSMFYQ---PRPDAFCMAVNPA 367
A+ L +F + D +C+A P
Sbjct: 286 ARFDLGSHGVFVERSVQEQDVWCLAFAPT 314
>gi|88174587|gb|ABD39368.1| chloroplast nucleoid DNA-binding protein [Oryza sativa Indica
Group]
Length = 321
Score = 93.2 bits (230), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 85/329 (25%), Positives = 140/329 (42%), Gaps = 37/329 (11%)
Query: 61 FYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLGTIFYPSRSSSYAYVPCDSEH 120
+ +++ +G P Q +DTGSS WV C C C T F SRS++ A V C +
Sbjct: 1 YVISVGLGTPSKTQIVEIDTGSSASWVFC-ECDGCHTNPRT-FLQSRSTTCAKVSCGTSM 58
Query: 121 CRYF---PYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDVVFGC- 176
C P+ + S C + Y G + + + LTF + + + FGC
Sbjct: 59 CLLGGSDPHCQDSENYPDCPFRVSYQDGSASYGILYQDTLTFSDVQK----IPSFTFGCN 114
Query: 177 --GFSTNRNFKFSGIFGLGIGRSSLVSQLNSS---FSYCIGSLHDPDYLHNKLI--LGDG 229
F N G+ G+G G S++ Q + + FSYC+ +K G
Sbjct: 115 LDSFGANEFGNVDGLLGMGAGPMSVLKQSSPTFDGFSYCLPLQKSERGFFSKTTGYFSLG 174
Query: 230 AIIDEGDATPLQFIDGH-----YYITLEAISVDGRMLDINPNIFKRDDSGGGVMIDSGTD 284
+ D + + +++ L AISVDG L ++P+IF R GV+ DSG++
Sbjct: 175 KVATRTDVRYTKMVARRKNTELFFVDLAAISVDGERLGLSPSIFSRK----GVVFDSGSE 230
Query: 285 VTWLVKEAYEALRD---EVMIRLEGEQMRSYSWPDKLCYHGIMSSDLKGFPTVRFHFRGG 341
++++ A L E+++R + S ++ CY + S D P + HF G
Sbjct: 231 LSYIPDRALSVLSQRIRELLLRRGAAEEES----ERNCYD-MRSVDEGDMPAISLHFDDG 285
Query: 342 AKLALEKDSMFYQ---PRPDAFCMAVNPA 367
A+ L +F + D +C+A P
Sbjct: 286 ARFDLGSHGVFVERSVQEQDVWCLAFAPT 314
>gi|88174605|gb|ABD39377.1| chloroplast nucleoid DNA-binding protein [Oryza rufipogon]
Length = 321
Score = 93.2 bits (230), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 85/329 (25%), Positives = 139/329 (42%), Gaps = 37/329 (11%)
Query: 61 FYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLGTIFYPSRSSSYAYVPCDSEH 120
+ +++ +G P Q +DTGSS WV C C C T F SRS++ A V C +
Sbjct: 1 YVISVGLGTPAKTQIVEIDTGSSTSWVFC-ECDGCHTNPRT-FLQSRSTTCAKVSCGTSM 58
Query: 121 CRYF---PYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDVVFGC- 176
C P+ + S C + Y G + + + LTF + + + FGC
Sbjct: 59 CLLGGSDPHCQDSENYPDCPFRVSYQDGSASYGILYQDTLTFSDVQK----IPSFTFGCN 114
Query: 177 --GFSTNRNFKFSGIFGLGIGRSSLVSQLN---SSFSYCIGSLHDPDYLHNKLI--LGDG 229
F N G+ G+G G S++ Q + FSYC+ +K G
Sbjct: 115 LDSFGANEFGNVDGLLGMGAGPMSVLKQSSPRFDGFSYCLPLQKSERGFFSKTTGYFSLG 174
Query: 230 AIIDEGDATPLQFIDGH-----YYITLEAISVDGRMLDINPNIFKRDDSGGGVMIDSGTD 284
+ D + + +++ L AISVDG L ++P+IF R GV+ DSG++
Sbjct: 175 KVATRTDVRYTKMVARRKNTELFFVDLAAISVDGERLGLSPSIFSRK----GVVFDSGSE 230
Query: 285 VTWLVKEAYEALRD---EVMIRLEGEQMRSYSWPDKLCYHGIMSSDLKGFPTVRFHFRGG 341
++++ A L E+++R + S ++ CY + S D P + HF G
Sbjct: 231 LSYIPDRALSVLSQRIRELLLRRGAAEEES----ERNCYD-MRSVDEGDMPAISLHFDDG 285
Query: 342 AKLALEKDSMFYQ---PRPDAFCMAVNPA 367
A+ L +F + D +C+A P
Sbjct: 286 ARFDLGSHGVFVERSVQEQDVWCLAFAPT 314
>gi|357118738|ref|XP_003561107.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Brachypodium
distachyon]
Length = 491
Score = 93.2 bits (230), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 99/356 (27%), Positives = 144/356 (40%), Gaps = 40/356 (11%)
Query: 72 VPQFTAMDTGSSLLWVHCYPCRD--CSPQLGTIFYPSRSSSYAYVPCDSEHCRYF-PYAR 128
V Q +DT S + WV C PC C Q ++ PS+SSS A PC S CR PYA
Sbjct: 154 VAQTMVIDTASDVPWVQCAPCPAPHCHAQTDVLYDPSKSSSSAAFPCSSPACRNLGPYAN 213
Query: 129 -CSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDVVFGCGFSTNRNFKF- 186
C+ +C Y Y G ++ ++ LT N + + + FGC + + F
Sbjct: 214 GCTPAGDQCQYRVQYPDGSASAGTYISDVLTL-NPAKPASAISEFRFGCSHALLQPGSFS 272
Query: 187 ---SGIFGLGIGRSSLVSQLNSS----FSYCIGSLHDPDYLHNK-LILGDGAIIDEGDA- 237
SGI LG G SL +Q ++ FSYC+ P +H+ ILG + A
Sbjct: 273 NKTSGIMALGRGAQSLPTQTKATYGDVFSYCL----PPTPVHSGFFILGVPRVAASRYAV 328
Query: 238 TPL---QFIDGHYYITLEAISVDGRMLDINPNIFKRDDSGGGVMIDSGTDVTWLVKEAYE 294
TP+ + Y + L AI V G+ L + P +F G ++DS T VT L AY
Sbjct: 329 TPMLRSKAAPMLYLVRLIAIEVAGKRLPVPPAVF-----AAGAVMDSRTIVTRLPPTAYM 383
Query: 295 ALRDEVMIRLEGEQMRSYSWPDKLCY----HGIMSSDLKGFPTVRFHFRG-GAKLALEKD 349
ALR + + + + CY P + F G + L+
Sbjct: 384 ALRAAFVAEMRAYRAAAPKEHLDTCYDFSGAAPGGGGGVKLPKITLVFDGPNGAVELDPS 443
Query: 350 SMFYQPRPDAFCMAVNPASINNRYVNFSLIGMMAQQFYNVGYDIGRKQKTFQRMDC 405
+ C+A P N +IG + QQ V Y++ F+R C
Sbjct: 444 GVLLD-----GCLAFAP---NTDDQMTGIIGNVQQQALEVLYNVDGATVGFRRGAC 491
>gi|88174583|gb|ABD39366.1| chloroplast nucleoid DNA-binding protein [Oryza sativa Indica
Group]
Length = 321
Score = 93.2 bits (230), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 85/329 (25%), Positives = 139/329 (42%), Gaps = 37/329 (11%)
Query: 61 FYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLGTIFYPSRSSSYAYVPCDSEH 120
+ +++ +G P Q +DTGSS WV C C C T F SRS++ A V C +
Sbjct: 1 YVISVGLGTPSKTQIVEIDTGSSTSWVFC-ECDGCHTNPRT-FLQSRSTTCAKVSCGTSM 58
Query: 121 CRYF---PYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDVVFGC- 176
C P+ + S C + Y G + + + LTF + + + FGC
Sbjct: 59 CLLGGSDPHCQDSENYPDCPFRVSYQDGSASYGILYQDTLTFSDVQK----IPSFTFGCN 114
Query: 177 --GFSTNRNFKFSGIFGLGIGRSSLVSQLN---SSFSYCIGSLHDPDYLHNKLI--LGDG 229
F N G+ G+G G S++ Q + FSYC+ +K G
Sbjct: 115 LDSFGANEFGNVDGLLGMGAGPMSVLKQSSPRFDGFSYCLPLQKSERGFFSKTTGYFSLG 174
Query: 230 AIIDEGDATPLQFIDGH-----YYITLEAISVDGRMLDINPNIFKRDDSGGGVMIDSGTD 284
+ D + + +++ L AISVDG L ++P+IF R GV+ DSG++
Sbjct: 175 KVATRTDVRYTKMVARRKNTELFFVDLAAISVDGERLGLSPSIFSRK----GVVFDSGSE 230
Query: 285 VTWLVKEAYEALRD---EVMIRLEGEQMRSYSWPDKLCYHGIMSSDLKGFPTVRFHFRGG 341
++++ A L E+++R + S ++ CY + S D P + HF G
Sbjct: 231 LSYIPDRALSVLSQRIRELLLRRGAAEEES----ERNCYD-MRSVDEGDMPAISLHFDDG 285
Query: 342 AKLALEKDSMFYQ---PRPDAFCMAVNPA 367
A+ L +F + D +C+A P
Sbjct: 286 ARFDLGSHGVFVERSVQEQDVWCLAFAPT 314
>gi|21717157|gb|AAM76350.1|AC074196_8 putative nucleoid DNA binding protein [Oryza sativa Japonica Group]
gi|31433294|gb|AAP54832.1| Eukaryotic aspartyl protease family protein, expressed [Oryza
sativa Japonica Group]
Length = 396
Score = 93.2 bits (230), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 91/371 (24%), Positives = 153/371 (41%), Gaps = 41/371 (11%)
Query: 57 SNSLFYVNISIGQPPVPQFTAMDTGSSLLWVHCYP-CRDCSPQLGTIFYPSRSSSYAYVP 115
S + + VN++IG PP P +D G L+W C CR C Q +F + SS++ P
Sbjct: 47 SQAFYVVNLTIGTPPQPVSAIIDIGGELVWTQCAQHCRRCFKQDLPLFDTNASSTFRPEP 106
Query: 116 CDSEHCRYFPYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDVVFG 175
C + C P C+ + T + T+ + T + FG
Sbjct: 107 CGAAVCESIPTRSCAGDGGGACGYEASTSFGRTVGRIGTDAVAI-----GTAATARLAFG 161
Query: 176 CGFSTNRNFKFSGIFGLGIGRS--SLVSQLN-SSFSYCIGSLHDPDY-LHNKLILGDGAI 231
C ++ + + +G+GR+ SL +Q+N ++FSYC L PD + L LG A
Sbjct: 162 CAVASEMDTMWGSSGSVGLGRTNLSLAAQMNATAFSYC---LAPPDTGKSSALFLGASAK 218
Query: 232 I---DEGDAT---------PLQFIDGHYYITLEAISVDGRMLDINPNIFKRDDSGGGVMI 279
+ +G T P + Y + LEAI + + SG + +
Sbjct: 219 LAGAGKGAGTTPFVKTSTPPNSGLSRSYLLRLEAIRAGNATIAM-------PQSGNTITV 271
Query: 280 DSGTDVTWLVKEAYEALRDEVMIRLEGEQMRSYSWPDKLCYHGIMSSDLKGFPTVRFHFR 339
+ T VT LV Y LR V + + LC+ +S G P + F+
Sbjct: 272 STATPVTALVDSVYRDLRKAVADAVGAAPVPPPVQNYDLCFPKASASG--GAPDLVLAFQ 329
Query: 340 GGAKLALEKDSMFYQPRPDAFCMAV--NPASINNRYVNFSLIGMMAQQFYNVGYDIGRKQ 397
GGA++ + S + D C+A+ +PA S++G + Q ++ +D+ ++
Sbjct: 330 GGAEMTVPVSSYLFDAGNDTACVAILGSPA-----LGGVSILGSLQQVNIHLLFDLDKET 384
Query: 398 KTFQRMDCEVL 408
+F+ DC L
Sbjct: 385 LSFEPADCSAL 395
>gi|115465777|ref|NP_001056488.1| Os05g0591300 [Oryza sativa Japonica Group]
gi|113580039|dbj|BAF18402.1| Os05g0591300 [Oryza sativa Japonica Group]
Length = 453
Score = 92.8 bits (229), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 104/411 (25%), Positives = 164/411 (39%), Gaps = 76/411 (18%)
Query: 61 FYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCR-DCSPQ---LGTIFYPSRSSSYAYVPC 116
F + + +G P V MDTGSSL WV C PC C Q +G IF PS SS++ +V C
Sbjct: 53 FLIPVKLGTPAVQYLVTMDTGSSLSWVQCRPCTIKCHVQPAKVGPIFDPSNSSTFRHVGC 112
Query: 117 DSEHCRYFPYA------RCSAYKHRCIYTQLYLIGPETSVFVS-TEQLTF--KNTDESTI 167
+ C Y C ++ C+YT Y G SV + T++L T +T+
Sbjct: 113 STSICSYLGRTLRIQSKACMEWEDICLYTMSYGGGWAYSVGKAVTDRLVLGGGETTRTTL 172
Query: 168 HVQDVVFGCGFSTN-RNFKFSGIFGLGIGRSSL--VSQLNS--SFSYCIGSLHDPDYLHN 222
+ + VFGC T K +GIFGLG S ++ L S +FSYC+ S D H
Sbjct: 173 SLANFVFGCSMDTQYSTHKEAGIFGLGTSNYSFEQIAPLLSYKAFSYCLPS----DEAHQ 228
Query: 223 K-LILGDGAIIDEGDATPLQFIDGH----YYITLEA--ISVDGRMLDINPNIFKRDDSGG 275
L +G D P G Y I + ++V+G + +
Sbjct: 229 GYLSIGP----DSSGGVPTSMFPGTPRPVYSIGMTGLTVTVNGEVRSLVSGSGSSPSPSS 284
Query: 276 GVMIDSGTDVTWLVKEAYEALRDEVMIRLE--GEQMRSYSWPDKLCYHGIMSSDLKGF-- 331
+++DSG +T L+ + L D ++ +E G + + + ++LC+ + SD + +
Sbjct: 285 LMVVDSGAKLTLLLASTFGQLEDAIIPAMESLGYSLNTAAGQNQLCF--LTESDRQNYLQ 342
Query: 332 ------------PTVRFHFRGGAKLALEKDSMFY-------------------------Q 354
P F G L L + FY Q
Sbjct: 343 RKPPPPSNWSALPVFHISFTLGLTLTLPPKNAFYLDARYRQEFLTIAWKQIYVRSIFPFQ 402
Query: 355 PRPDAFCMAVNPASINNRYVNFSLIGMMAQQFYNVGYDIGRKQKTFQRMDC 405
D + + A + + ++G + + + YDI RKQ F+ +C
Sbjct: 403 YFSDILGLCSSFARDDYLESGYQILGNLVTKSCGITYDIPRKQFRFRTGEC 453
>gi|88174554|gb|ABD39352.1| chloroplast nucleoid DNA-binding protein [Oryza barthii]
Length = 321
Score = 92.8 bits (229), Expect = 3e-16, Method: Compositional matrix adjust.
Identities = 85/329 (25%), Positives = 140/329 (42%), Gaps = 37/329 (11%)
Query: 61 FYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLGTIFYPSRSSSYAYVPCDSEH 120
+ +++ +G P Q +DTGSS WV C C C T F SRS++ A V C +
Sbjct: 1 YVISVGLGTPSKTQIVEIDTGSSTSWVFC-ECDGCHTNPRT-FLQSRSTTCAKVSCGTSM 58
Query: 121 CRYF---PYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDVVFGC- 176
C P+ + S C + Y G + + + LTF + + + FGC
Sbjct: 59 CLLGGSDPHCQDSENYPDCPFRVSYQDGSASYGILYQDTLTFSDVQK----IPGFSFGCN 114
Query: 177 --GFSTNRNFKFSGIFGLGIGRSSLVSQLNSS---FSYCIGSLHDPDYLHNKLI--LGDG 229
F N G+ G+G G S++ Q + + FSYC+ +K G
Sbjct: 115 MDSFGANEFGNVDGLLGMGAGAMSVLKQSSPTFDCFSYCLPLQKSERGFFSKTTGYFSLG 174
Query: 230 AIIDEGDATPLQFIDGH-----YYITLEAISVDGRMLDINPNIFKRDDSGGGVMIDSGTD 284
+ D + + +++ L AISVDG L ++P+IF R GV+ DSG++
Sbjct: 175 KVATRTDVRYTKMVARKKNTELFFVDLTAISVDGERLGLSPSIFSRK----GVVFDSGSE 230
Query: 285 VTWLVKEAYEALRD---EVMIRLEGEQMRSYSWPDKLCYHGIMSSDLKGFPTVRFHFRGG 341
++++ A L E+++R + S ++ CY + S D P + HF G
Sbjct: 231 LSYIPDRALSVLSQRIRELLLRRGAAEEES----ERNCYD-MRSVDEGDMPAISLHFDDG 285
Query: 342 AKLALEKDSMFYQ---PRPDAFCMAVNPA 367
A+ L +F + D +C+A P
Sbjct: 286 ARFDLGSHGVFVERSVQEQDVWCLAFAPT 314
>gi|212275143|ref|NP_001130306.1| uncharacterized protein LOC100191400 precursor [Zea mays]
gi|194688798|gb|ACF78483.1| unknown [Zea mays]
gi|194703430|gb|ACF85799.1| unknown [Zea mays]
gi|194707192|gb|ACF87680.1| unknown [Zea mays]
gi|223944599|gb|ACN26383.1| unknown [Zea mays]
gi|223948667|gb|ACN28417.1| unknown [Zea mays]
gi|414887962|tpg|DAA63976.1| TPA: aspartic proteinase nepenthesin-1 [Zea mays]
Length = 450
Score = 92.8 bits (229), Expect = 3e-16, Method: Compositional matrix adjust.
Identities = 108/390 (27%), Positives = 162/390 (41%), Gaps = 33/390 (8%)
Query: 31 ARLAYLQE-KIRSHNTYQAQILPSNDISNSLFYV-NISIGQPPVPQFTAMDTGSSLLWVH 88
+RL YL +R A I + + YV S+G PP A+DT + W+
Sbjct: 80 SRLLYLDSLAVRGRARAYAPIASGRQLLQTPTYVVRASLGTPPQQLLLAVDTSNDASWIP 139
Query: 89 CYPCRDCSPQLGTIFYPSRSSSYAYVPCDSEHCRYFPYARCSAYKHRCIYTQLYLIGPET 148
C C C F P+ S+SY VPC S C P A C C ++ Y ++
Sbjct: 140 CAGCAGCPTSSAAPFDPASSASYRTVPCGSPLCAQAPNAACPPGGKACGFSLTYA---DS 196
Query: 149 SVFVSTEQLTFKNTDESTIHVQDVVFGC-GFSTNRNFKFSGIFGLGIGRSSLVSQLN--- 204
S+ + Q + + V+ FGC +T G+ GLG G S +SQ
Sbjct: 197 SLQAALSQDSLAVAGNA---VKAYTFGCLQRATGTAAPPQGLLGLGRGPLSFLSQTKDMY 253
Query: 205 -SSFSYCIGSLHDPDYLHNKLILGDGAIIDEGDATPLQFIDGH----YYITLEAISVDGR 259
++FSYC+ S ++ L LG TPL + H YY+ + I V +
Sbjct: 254 EATFSYCLPSFKSLNF-SGTLRLGRNGQPQRIKTTPL-LANPHRSSLYYVNMTGIRVGRK 311
Query: 260 MLDINPNIFKRDDSGGGVMIDSGTDVTWLVKEAYEALRDEVMIRLEGEQMRSYSWPDKLC 319
++ I F +G G ++DSGT T LV AY A+RDEV R+ G + S D C
Sbjct: 312 VVPI--PAFD-PATGAGTVLDSGTMFTRLVAPAYVAVRDEVRRRV-GAPVSSLGGFDT-C 366
Query: 320 YHGIMSSDLKGFPTVRFHFRGGAKLALEKDSMFYQPRPDAFC--MAVNPASINNRYVNFS 377
++ + +P V F G E++ + + C MA P +N +
Sbjct: 367 FN----TTAVAWPPVTLLFDGMQVTLPEENVVIHSTYGTISCLAMAAAPDGVNTV---LN 419
Query: 378 LIGMMAQQFYNVGYDIGRKQKTFQRMDCEV 407
+I M QQ + V +D+ + F R C
Sbjct: 420 VIASMQQQNHRVLFDVPNGRVGFARERCTA 449
>gi|356528623|ref|XP_003532899.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
Length = 507
Score = 92.8 bits (229), Expect = 3e-16, Method: Compositional matrix adjust.
Identities = 98/419 (23%), Positives = 158/419 (37%), Gaps = 97/419 (23%)
Query: 61 FYVNISIGQPPVPQFTAMDTGSSLLWVHCY------------------------------ 90
++ + +G P + A DTGS W +C
Sbjct: 111 YFTEVKVGSPGQRFWLAADTGSEFTWFNCVMRNATTTATTKKTRKNKTKKKHHHHSKRNR 170
Query: 91 ---------------PCRDCSPQLGTIFYPSRSSSYAYVPCDSEHCR-----YFPYARCS 130
PC+ +F P RS S+ V C S+ C+ F + C
Sbjct: 171 TRTTRRTKKKKAKSNPCKG-------VFCPHRSKSFQAVTCASQKCKIDLSQLFSLSLCP 223
Query: 131 AYKHRCIYTQLYLIGPETSVFVSTEQLT--FKNTDESTIHVQDVVFGCGFSTNRNFKFS- 187
C+Y Y G F T+ +T KN E ++ ++ GC S F+
Sbjct: 224 KPSDPCLYDISYADGSSAKGFFGTDTITVDLKNGKEGKLN--NLTIGCTKSMENGVNFNE 281
Query: 188 ---GIFGLGIGRSSLVSQL----NSSFSYCIGSLHDPDYLHNKLILG---DGAIIDEGDA 237
GI GLG + S + + + FSYC+ + + L +G + ++ E
Sbjct: 282 DTGGILGLGFAKDSFIDKAAYEYGAKFSYCLVDHLSHRNVSSYLTIGGHHNAKLLGEIKR 341
Query: 238 TPLQFIDGHYYITLEAISVDGRMLDINPNIFKRDDSGGGVMIDSGTDVTWLVKEAYEALR 297
T L Y + + IS+ G+ML I P ++ + S GG +IDSGT +T L+ AYE +
Sbjct: 342 TELILFPPFYGVNVVGISIGGQMLKIPPQVWDFN-SQGGTLIDSGTTLTALLVPAYEPVF 400
Query: 298 DEVM------IRLEGEQMRSYSWPDKLCYHGIMSSDLKGF-----PTVRFHFRGGAKLAL 346
+ ++ R+ GE + + C+ D +GF P + FHF GGA+
Sbjct: 401 EALIKSLTKVKRVTGEDFGALDF----CF------DAEGFDDSVVPRLVFHFAGGARFEP 450
Query: 347 EKDSMFYQPRPDAFCMAVNPASINNRYVNFSLIGMMAQQFYNVGYDIGRKQKTFQRMDC 405
S P C+ + P + S+IG + QQ + +D+ F C
Sbjct: 451 PVKSYIIDVAPLVKCIGIVPI---DGIGGASVIGNIMQQNHLWEFDLSTNTIGFAPSIC 506
>gi|88174573|gb|ABD39361.1| chloroplast nucleoid DNA-binding protein [Oryza sativa Japonica
Group]
Length = 321
Score = 92.4 bits (228), Expect = 3e-16, Method: Compositional matrix adjust.
Identities = 85/329 (25%), Positives = 139/329 (42%), Gaps = 37/329 (11%)
Query: 61 FYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLGTIFYPSRSSSYAYVPCDSEH 120
+ ++ +G P Q +DTGSS WV C C C T F SRS++ A V C +
Sbjct: 1 YVTSVGLGTPSKTQIVEIDTGSSTSWVFC-ECDGCHTNPRT-FLQSRSTTCAKVSCGTSM 58
Query: 121 CRYF---PYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDVVFGC- 176
C P+ + S C + Y G + + + LTF + + + FGC
Sbjct: 59 CLLGGSDPHCQDSENYPDCPFRVSYQDGSASYGILYQDTLTFSDVQK----IPSFTFGCN 114
Query: 177 --GFSTNRNFKFSGIFGLGIGRSSLVSQLNSSF---SYCIGSLHDPDYLHNKLI--LGDG 229
F N G+ G+G G S++ Q + +F SYC+ +K G
Sbjct: 115 LDSFGANEFGNVDGLLGMGAGPMSVLKQSSPTFDGFSYCLPLQKSERGFFSKTTGYFSLG 174
Query: 230 AIIDEGDATPLQFIDGH-----YYITLEAISVDGRMLDINPNIFKRDDSGGGVMIDSGTD 284
+ D + + +++ L AISVDG L ++P+IF R GV+ DSG++
Sbjct: 175 KVATRTDVRYTKMVARRKNTELFFVDLAAISVDGERLGLSPSIFSRK----GVVFDSGSE 230
Query: 285 VTWLVKEAYEALRD---EVMIRLEGEQMRSYSWPDKLCYHGIMSSDLKGFPTVRFHFRGG 341
++++ A L E+++R + S ++ CY + S D P + HF G
Sbjct: 231 LSYIPDRALSVLSQRIRELLLRRGAAEEES----ERNCYD-MRSVDEGDMPAISLHFDDG 285
Query: 342 AKLALEKDSMFYQ---PRPDAFCMAVNPA 367
A+ L +F + D +C+A P
Sbjct: 286 ARFDLGSRGVFVERSVQEQDVWCLAFAPT 314
>gi|115457772|ref|NP_001052486.1| Os04g0334700 [Oryza sativa Japonica Group]
gi|113564057|dbj|BAF14400.1| Os04g0334700 [Oryza sativa Japonica Group]
Length = 482
Score = 92.4 bits (228), Expect = 3e-16, Method: Compositional matrix adjust.
Identities = 96/369 (26%), Positives = 154/369 (41%), Gaps = 37/369 (10%)
Query: 60 LFYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLGTI----FYPSRSS-SYAYV 114
L+Y +I IG P V + +DTGS WV+ C+ C + + FY RSS S V
Sbjct: 82 LYYTDIGIGTPAVKYYVQLDTGSKAFWVNGISCKQCPHESDILRKLTFYDPRSSVSSKEV 141
Query: 115 PCDSEHCRYFPYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFK---NTDESTIHVQD 171
CD C P + RC Y Y G T + T+ L + ++
Sbjct: 142 KCDDTICTSRPPCNMTL---RCPYITGYADGGLTMGILFTDLLHYHQLYGNGQTQPTSTS 198
Query: 172 VVFGCGFS-----TNRNFKFSGIFGLGIGRSSLVSQLNSS------FSYCIGSLHDPDYL 220
V FGCG N GI G G + +SQL ++ FS+C+ D
Sbjct: 199 VTFGCGLQQSGSLNNSAVAIDGIIGFGNSNQTALSQLAAAGKTKKIFSHCL------DST 252
Query: 221 HNKLILGDGAIID-EGDATPLQFIDGHYY-ITLEAISVDGRMLDINPNIFKRDDSGGGVM 278
+ I G +++ + TP+ + Y+ + L++I+V G L + NIF + G
Sbjct: 253 NGGGIFAIGEVVEPKVKTTPIVKNNEVYHLVNLKSINVAGTTLQLPANIFGTTKT-KGTF 311
Query: 279 IDSGTDVTWLVKEAYEALRDEVMIRLEGEQMRS-YSWPDKLCYHGIMSSDLKGFPTVRFH 337
IDSG+ + +L + Y L V + M + Y++ C+H + S D K FP + FH
Sbjct: 312 IDSGSTLVYLPEIIYSELILAVFAKHPDITMGAMYNFQ---CFHFLGSVDDK-FPKITFH 367
Query: 338 FRGGAKLALEKDSMFYQPRPDAFCMAVNPASINNRYVNFSLIGMMAQQFYNVGYDIGRKQ 397
F L + + + +C A I+ Y + ++G M V YD+ ++
Sbjct: 368 FENDLTLDVYPYDYLLEYEGNQYCFGFQDAGIHG-YKDMIILGDMVISNKVVVYDMEKQA 426
Query: 398 KTFQRMDCE 406
+ +C
Sbjct: 427 IGWTEHNCS 435
>gi|125543639|gb|EAY89778.1| hypothetical protein OsI_11320 [Oryza sativa Indica Group]
Length = 488
Score = 92.4 bits (228), Expect = 4e-16, Method: Compositional matrix adjust.
Identities = 88/314 (28%), Positives = 138/314 (43%), Gaps = 36/314 (11%)
Query: 116 CDSEHCRYFPYARCSAYK----HRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQD 171
CDS C+ A C K C+YT Y T+ + ++ TF + V
Sbjct: 190 CDSTLCQGLLVASCGNTKFWPNQTCVYTYYYNDKSVTTGLLEVDKFTFG----AGASVPG 245
Query: 172 VVFGCGFSTNRNFKF--SGIFGLGIGRSSLVSQLN-SSFSYCIGSLHDP-------DYLH 221
V FGCG N FK +GI G G G SL SQL +FS+C +++ D L
Sbjct: 246 VAFGCGLFNNGVFKSNETGIAGFGRGPLSLPSQLKVGNFSHCFTAVNGLKQSTVLLDLLA 305
Query: 222 NKLILGDGAIIDEGDATPLQFIDGH---YYITLEAISVDGRMLDINPNIFKRDDSGGGVM 278
+ G GA+ +TPL + YY++L+ I+V L + + F + GG +
Sbjct: 306 DLYKNGRGAV----QSTPLIQNSANPTLYYLSLKGITVGSTRLPVPESAFALTNGTGGTI 361
Query: 279 IDSGTDVTWLVKEAYEALRDEVMIRLEGEQMRSYSWPDKLCYHGIMSSDLKGFPTVRFHF 338
IDSGT +T L + Y+ +RDE +++ + + C+ S P + HF
Sbjct: 362 IDSGTSITSLPPQVYQVVRDEFAAQIKLPVVPGNATGPYTCFSA-PSQAKPDVPKLVLHF 420
Query: 339 RGGAKLALEKDSMFYQPRPDA----FCMAVNPASINNRYVNFSLIGMMAQQFYNVGYDIG 394
GA + L +++ ++ DA C+A+N + R + IG QQ +V YD+
Sbjct: 421 E-GATMDLPRENYVFEVPDDAGNSMICLAINELG-DER----ATIGNFQQQNMHVLYDLQ 474
Query: 395 RKQKTFQRMDCEVL 408
+F C+ L
Sbjct: 475 NNMLSFVAAQCDKL 488
Score = 47.4 bits (111), Expect = 0.014, Method: Compositional matrix adjust.
Identities = 34/140 (24%), Positives = 61/140 (43%), Gaps = 6/140 (4%)
Query: 253 AISVDGRMLDINPNIFKRDDSGGGVMIDSGTDVTWLVKEAYEALRDEVMIRLEGEQMRSY 312
I+V L + + F + GG +IDSGT +T L + Y+ +RDE +++ +
Sbjct: 41 GITVGSTRLPVPESAFALTNGTGGTIIDSGTSITSLPPQVYQVVRDEFAAQIKLPVVPGN 100
Query: 313 SWPDKLCYHGIMSSDLKGFPTVRFHFRGGAKLALEKDSMFYQPRPDA----FCMAVNPAS 368
+ C+ S P + HF GA + L +++ ++ DA C+A+N
Sbjct: 101 ATGPYTCFSA-PSQAKPDVPKLVLHFE-GATMDLPRENYVFEVPDDAGNSIICLAINKGD 158
Query: 369 INNRYVNFSLIGMMAQQFYN 388
NF M A +++
Sbjct: 159 ETTIIGNFQQQNMHALPYFD 178
>gi|79315693|ref|NP_001030891.1| aspartyl protease family protein [Arabidopsis thaliana]
gi|332646353|gb|AEE79874.1| aspartyl protease family protein [Arabidopsis thaliana]
Length = 499
Score = 92.4 bits (228), Expect = 4e-16, Method: Compositional matrix adjust.
Identities = 98/371 (26%), Positives = 161/371 (43%), Gaps = 70/371 (18%)
Query: 61 FYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLGTIFYPSRSSSYAYVPCDSEH 120
+++++ +G PP +DTGS L W+ C PC DC Q D++
Sbjct: 170 YFMDVLVGSPPKHFSLILDTGSDLNWIQCLPCYDCFQQ-----------------NDNQS 212
Query: 121 CRYFPYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDVVFGCGFST 180
C Y+ + Y + +V T LT +V++++FGCG
Sbjct: 213 CPYYYW-----------YGDSSNTTGDFAVETFTVNLTTNGGSSELYNVENMMFGCG-HW 260
Query: 181 NRNFKFSGIF-------GLGIGRSSLVSQLNS----SFSYCIGSLHDPDYLHNKLILG-D 228
NR G+F GLG G S SQL S SFSYC+ + + +KLI G D
Sbjct: 261 NR-----GLFHGAAGLLGLGRGPLSFSSQLQSLYGHSFSYCLVDRNSDTNVSSKLIFGED 315
Query: 229 GAIIDEGDATPLQFIDGH-------YYITLEAISVDGRMLDINPNIFK-RDDSGGGVMID 280
++ + F+ G YY+ +++I V G +L+I + D GG +ID
Sbjct: 316 KDLLSHPNLNFTSFVAGKENLVDTFYYVQIKSILVAGEVLNIPEETWNISSDGAGGTIID 375
Query: 281 SGTDVTWLVKEAYEALRDEVMIRLEGEQ--MRSYSWPDKLCYH--GIMSSDLKGFPTVRF 336
SGT +++ + AYE +++++ + +G+ R + D C++ GI + L P +
Sbjct: 376 SGTTLSYFAEPAYEFIKNKIAEKAKGKYPVYRDFPILDP-CFNVSGIHNVQL---PELGI 431
Query: 337 HFRGGAKLALEKDSMFYQPRPDAFCMAV--NPASINNRYVNFSLIGMMAQQFYNVGYDIG 394
F GA ++ F D C+A+ P S FS+IG QQ +++ YD
Sbjct: 432 AFADGAVWNFPTENSFIWLNEDLVCLAMLGTPKSA------FSIIGNYQQQNFHILYDTK 485
Query: 395 RKQKTFQRMDC 405
R + + C
Sbjct: 486 RSRLGYAPTKC 496
>gi|52076082|dbj|BAD46595.1| aspartic proteinase nepenthesin II -like [Oryza sativa Japonica
Group]
Length = 476
Score = 92.0 bits (227), Expect = 4e-16, Method: Compositional matrix adjust.
Identities = 103/381 (27%), Positives = 160/381 (41%), Gaps = 61/381 (16%)
Query: 60 LFYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDC----SPQLGT----IFYPSRSSSY 111
L Y +++G P V A+DTGS L WV C C C SP G+ ++ P++S++
Sbjct: 61 LHYAVVALGTPNVTFLVALDTGSDLFWVPC-DCLKCAPFQSPNYGSLKFDVYSPAQSTTS 119
Query: 112 AYVPCDSEHCRYFPYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTD--ESTIHV 169
VPC S C C + + C Y+ YL +S V E + + +D +S I
Sbjct: 120 RKVPCSSNLCDL--QNACRSKSNSCPYSIQYLSDNTSSSGVLVEDVLYLTSDSAQSKIVT 177
Query: 170 QDVVFGCGFSTNRNFKFS----GIFGLGIGRSSLVSQLNS------SFSYCIGSLHDPDY 219
++FGCG +F S G+ GLG+ S+ S L S SFS C G D
Sbjct: 178 APIMFGCGQVQTGSFLGSAAPNGLLGLGMDSKSVPSLLASKGLAANSFSMCFG-----DD 232
Query: 220 LHNKLILGDGAIIDEGDATPLQFI--DGHYYITLEAISVDGRMLDINPNIFKRDDSGGGV 277
H ++ GD D+ + TPL + +Y IT+ I+V + +
Sbjct: 233 GHGRINFGDTGSSDQKE-TPLNVYKQNPYYNITITGITVGSKSISTE----------FSA 281
Query: 278 MIDSGTDVTWLVKEAYEALRD--EVMIRLEGEQMRSYSWPDKLCYHGIMSSDLKGFPTVR 335
++DSGT T L Y + + IR M S P + CY +S++ P V
Sbjct: 282 IVDSGTSFTALSDPMYTQITSSFDAQIR-SSRNMLDSSMPFEFCYS--VSANGIVHPNVS 338
Query: 336 FHFRGGAKLALE------KDSMFYQPRPDAFCMAVNPASINNRYVNFSLIGMMAQQFYNV 389
+GG+ + D+ F P +C+A+ + + VN LIG V
Sbjct: 339 LTAKGGSIFPVNDPIITITDNAF---NPVGYCLAI----MKSEGVN--LIGENFMSGLKV 389
Query: 390 GYDIGRKQKTFQRMDCEVLDD 410
+D R ++ +C D+
Sbjct: 390 VFDRERMVLGWKNFNCYNFDE 410
>gi|218202547|gb|EEC84974.1| hypothetical protein OsI_32231 [Oryza sativa Indica Group]
Length = 513
Score = 92.0 bits (227), Expect = 5e-16, Method: Compositional matrix adjust.
Identities = 103/381 (27%), Positives = 160/381 (41%), Gaps = 61/381 (16%)
Query: 60 LFYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDC----SPQLGT----IFYPSRSSSY 111
L Y +++G P V A+DTGS L WV C C C SP G+ ++ P++S++
Sbjct: 98 LHYAVVALGTPNVTFLVALDTGSDLFWVPC-DCLKCAPLQSPNYGSLKFDVYSPAQSTTS 156
Query: 112 AYVPCDSEHCRYFPYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTD--ESTIHV 169
VPC S C C + + C Y+ YL +S V E + + +D +S I
Sbjct: 157 RKVPCSSNLCDL--QNACRSKSNSCPYSIQYLSDNTSSSGVLVEDVLYLTSDSAQSKIVT 214
Query: 170 QDVVFGCGFSTNRNFKFS----GIFGLGIGRSSLVSQLNS------SFSYCIGSLHDPDY 219
++FGCG +F S G+ GLG+ S+ S L S SFS C G D
Sbjct: 215 APIMFGCGQVQTGSFLGSAAPNGLLGLGMDSKSVPSLLASKGLAANSFSMCFG-----DD 269
Query: 220 LHNKLILGDGAIIDEGDATPLQFI--DGHYYITLEAISVDGRMLDINPNIFKRDDSGGGV 277
H ++ GD D+ + TPL + +Y IT+ I+V + +
Sbjct: 270 GHGRINFGDTGSSDQKE-TPLNVYKQNPYYNITITGITVGSKSISTE----------FSA 318
Query: 278 MIDSGTDVTWLVKEAYEALRD--EVMIRLEGEQMRSYSWPDKLCYHGIMSSDLKGFPTVR 335
++DSGT T L Y + + IR M S P + CY +S++ P V
Sbjct: 319 IVDSGTSFTALSDPMYTQITSSFDAQIR-SSRNMLDSSMPFEFCYS--VSANGIVHPNVS 375
Query: 336 FHFRGGAKLALE------KDSMFYQPRPDAFCMAVNPASINNRYVNFSLIGMMAQQFYNV 389
+GG+ + D+ F P +C+A+ + + VN LIG V
Sbjct: 376 LTAKGGSIFPVNDPIITITDNAF---NPVGYCLAI----MKSEGVN--LIGENFMSGLKV 426
Query: 390 GYDIGRKQKTFQRMDCEVLDD 410
+D R ++ +C D+
Sbjct: 427 VFDRERMVLGWKNFNCYNFDE 447
>gi|449495082|ref|XP_004159729.1| PREDICTED: LOW QUALITY PROTEIN: aspartic proteinase-like protein
1-like [Cucumis sativus]
Length = 524
Score = 91.7 bits (226), Expect = 6e-16, Method: Compositional matrix adjust.
Identities = 94/377 (24%), Positives = 154/377 (40%), Gaps = 51/377 (13%)
Query: 60 LFYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLGT--------IFYPSRSSSY 111
L+Y +++G P VP A+DTGS L W+ C C +C L T I+ P+ SS+
Sbjct: 106 LYYAEVTVGTPGVPYLVALDTGSDLFWLPC-DCVNCITGLNTTQGPVNFNIYSPNNSSTS 164
Query: 112 AYVPCDSEHCRYFPYARCSAYKHRCIYTQLYLIGPETSVFVSTE---QLTFKNTDESTIH 168
V C S C + +CS+ C Y YL +S E LT + ++
Sbjct: 165 KEVQCSSSLCSHL--DQCSSPSDTCPYQVSYLSDNTSSTGYLVEDILHLTTNDVQSKPVN 222
Query: 169 VQDVVFGCGFSTNRNFKFS----GIFGLGIGRSSLVSQL------NSSFSYCIGSLHDPD 218
+ + GCG + F S G+FGLGI S+ S L ++SFS C G
Sbjct: 223 AR-ITLGCGKDQSGAFLSSAAPNGLFGLGIENVSVPSILANAGLISNSFSLCFGPAR--- 278
Query: 219 YLHNKLILGDGAIIDEGDATPLQFIDGH--YYITLEAISVDGRMLDINPNIFKRDDSGGG 276
++ GD + + TP H Y +++ I V G + D++
Sbjct: 279 --MGRIEFGDKGSPGQNE-TPFNLGRRHPTYNVSITQIGVGGHISDLD----------VA 325
Query: 277 VMIDSGTDVTWLVKEAYEALRDEVMIRLEGEQMRSYS-WPDKLCYHGIMSSDLKGFPTVR 335
V+ DSGT T+L AY D+ +E +Q S P + CY + +P +
Sbjct: 326 VIFDSGTSFTYLNDPAYSLFADKFASMVEEKQFTMNSDIPFENCYELSPNQTTFTYPLMN 385
Query: 336 FHFRGGAKLALEKDSMFYQPRPD-AFCMAVNPASINNRYVNFSLIGMMAQQFYNVGYDIG 394
+GG + + FC+A+ R + ++IG Y++ +D
Sbjct: 386 LTMKGGGHFVINHPIVLISTESKRLFCLAI------ARSDSINIIGQNFMTGYHIVFDRE 439
Query: 395 RKQKTFQRMDCEVLDDD 411
+ ++ +C +D+
Sbjct: 440 KMVLGWKESNCTGYEDE 456
>gi|449456843|ref|XP_004146158.1| PREDICTED: aspartic proteinase-like protein 1-like [Cucumis
sativus]
Length = 547
Score = 91.7 bits (226), Expect = 6e-16, Method: Compositional matrix adjust.
Identities = 94/377 (24%), Positives = 154/377 (40%), Gaps = 51/377 (13%)
Query: 60 LFYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLGT--------IFYPSRSSSY 111
L+Y +++G P VP A+DTGS L W+ C C +C L T I+ P+ SS+
Sbjct: 129 LYYAEVTVGTPGVPYLVALDTGSDLFWLPC-DCVNCITGLNTTQGPVNFNIYSPNNSSTS 187
Query: 112 AYVPCDSEHCRYFPYARCSAYKHRCIYTQLYLIGPETSVFVSTE---QLTFKNTDESTIH 168
V C S C + +CS+ C Y YL +S E LT + ++
Sbjct: 188 KEVQCSSSLCSHL--DQCSSPSDTCPYQVSYLSDNTSSTGYLVEDILHLTTNDVQSKPVN 245
Query: 169 VQDVVFGCGFSTNRNFKFS----GIFGLGIGRSSLVSQL------NSSFSYCIGSLHDPD 218
+ + GCG + F S G+FGLGI S+ S L ++SFS C G
Sbjct: 246 AR-ITLGCGKDQSGAFLSSAAPNGLFGLGIENVSVPSILANAGLISNSFSLCFGPAR--- 301
Query: 219 YLHNKLILGDGAIIDEGDATPLQFIDGH--YYITLEAISVDGRMLDINPNIFKRDDSGGG 276
++ GD + + TP H Y +++ I V G + D++
Sbjct: 302 --MGRIEFGDKGSPGQNE-TPFNLGRRHPTYNVSITQIGVGGHISDLD----------VA 348
Query: 277 VMIDSGTDVTWLVKEAYEALRDEVMIRLEGEQMRSYS-WPDKLCYHGIMSSDLKGFPTVR 335
V+ DSGT T+L AY D+ +E +Q S P + CY + +P +
Sbjct: 349 VIFDSGTSFTYLNDPAYSLFADKFASMVEEKQFTMNSDIPFENCYELSPNQTTFTYPLMN 408
Query: 336 FHFRGGAKLALEKDSMFYQPRPD-AFCMAVNPASINNRYVNFSLIGMMAQQFYNVGYDIG 394
+GG + + FC+A+ R + ++IG Y++ +D
Sbjct: 409 LTMKGGGHFVINHPIVLISTESKRLFCLAI------ARSDSINIIGQNFMTGYHIVFDRE 462
Query: 395 RKQKTFQRMDCEVLDDD 411
+ ++ +C +D+
Sbjct: 463 KMVLGWKESNCTGYEDE 479
>gi|238010910|gb|ACR36490.1| unknown [Zea mays]
gi|413942664|gb|AFW75313.1| hypothetical protein ZEAMMB73_520329 [Zea mays]
Length = 481
Score = 91.7 bits (226), Expect = 6e-16, Method: Compositional matrix adjust.
Identities = 101/362 (27%), Positives = 151/362 (41%), Gaps = 61/362 (16%)
Query: 70 PPVPQFTAMDTGSSLLWVHCYPCR--DCSPQLGTIFYPSRSSSYAYVPCDSEHCRYF-PY 126
P V Q +D+ S + WV C PC C PQ+ + + PSRS S A C S C PY
Sbjct: 155 PGVIQTVVLDSASDVPWVQCVPCPIPPCHPQVDSFYDPSRSPSSAPFSCSSPTCTALGPY 214
Query: 127 ARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDVVFGCGFSTNRNF-- 184
A A ++C Y Y G TS + LT + V FGC + +F
Sbjct: 215 ANGCA-NNQCQYLVRYPDGSSTSGAYIADLLTLDAGNA----VSGFKFGCSHAEQGSFDA 269
Query: 185 KFSGIFGLGIGRSSLVSQLNS----SFSYCIG---------SLHDPDYLHNKLILGDGAI 231
+ +GI LG G SL+SQ S +FSYCI +L P ++ ++
Sbjct: 270 RAAGIMALGGGPESLLSQTASRYGNAFSYCIPATASDSGFFTLGVPRRASSRYVV----- 324
Query: 232 IDEGDATPL---QFIDGHYYITLEAISVDGRMLDINPNIFKRDDSGGGVMIDSGTDVTWL 288
TP+ + Y + L I+V G+ L + P +F G ++DS T +T L
Sbjct: 325 ------TPMVRFRQAATFYGVLLRTITVGGQRLGVAPAVFA-----AGSVLDSRTAITRL 373
Query: 289 VKEAYEALRDEVMIRLEGEQMRSY-SWPDK----LCYHGIMSSDLKGFPTVRFHFRGGAK 343
AY+ALR M Y S P K CY +++ P + F A
Sbjct: 374 PPTAYQALRSAFR-----SSMTMYRSAPPKGYLDTCYDFTGVVNIR-LPKISLVFDRNAV 427
Query: 344 LALEKDSMFYQPRPDAFCMAVNPASINNRYVNFSLIGMMAQQFYNVGYDIGRKQKTFQRM 403
L L+ + + C+A ++ ++R ++G + QQ V YD+G F++
Sbjct: 428 LPLDPSGILFN-----DCLAFT-SNADDRMPG--VLGSVQQQTIEVLYDVGGGAVGFRQG 479
Query: 404 DC 405
C
Sbjct: 480 AC 481
>gi|118486912|gb|ABK95290.1| unknown [Populus trichocarpa]
Length = 438
Score = 91.7 bits (226), Expect = 6e-16, Method: Compositional matrix adjust.
Identities = 102/392 (26%), Positives = 160/392 (40%), Gaps = 39/392 (9%)
Query: 32 RLAYLQEKIRSHNTYQAQILPSNDISNSLFYV-NISIGQPPVPQFTAMDTGSSLLWVHCY 90
RL YL + T I P + YV + +G P F +DT + WV
Sbjct: 69 RLKYL-STLADQKTTAVPIAPGQQVLKIANYVVRVKLGTPGQQMFMVLDTSNDAAWV--- 124
Query: 91 PCRDCSPQLGTIFYPSRSSSYAYVPCDSEHCRYFPYARCSAY-KHRCIYTQLYLIGPETS 149
PC C+ T F P+ S++ + C C C A C++ Q Y G ++S
Sbjct: 125 PCSGCTGFSSTTFLPNASTTLGSLDCSGAQCSQVRGFSCPATGSSACLFNQSY--GGDSS 182
Query: 150 VFVSTEQLTFKNTDESTIHVQDVV----FGC-GFSTNRNFKFSGIFGLGIGRSSLVSQLN 204
LT ++ DV+ FGC + + G+ GLG G SL+SQ
Sbjct: 183 -------LTATLVQDAITLANDVIPGFTFGCINAVSGGSIPPQGLLGLGRGPISLISQAG 235
Query: 205 S----SFSYCIGSLHDPDYLHNKLILGDGAIIDEGDATPLQFIDGH----YYITLEAISV 256
+ FSYC+ S Y L LG TPL + H YY+ L +SV
Sbjct: 236 AMYSGVFSYCLPSFKS-YYFSGSLKLGPVGQPKSIRTTPL-LRNPHRPSLYYVNLTGVSV 293
Query: 257 DGRMLDINPN--IFKRDDSGGGVMIDSGTDVTWLVKEAYEALRDEVMIRLEGEQMRSYSW 314
GR+ P+ + ++G G +IDSGT +T V+ Y A+RDE ++ G + S
Sbjct: 294 -GRIKVPIPSEQLVFDPNTGAGTIIDSGTVITRFVQPVYFAIRDEFRKQVNGP-ISSLGA 351
Query: 315 PDKLCYHGIMSSDLKGFPTVRFHFRGGAKLALEKDSMFYQPRPDAFCMAVNPASINNRYV 374
D C+ ++ P + HF G + ++S+ + C+++ A+ NN
Sbjct: 352 FDT-CFAATNEAEA---PAITLHFEGLNLVLPMENSLIHSSSGSLACLSMA-AAPNNVNS 406
Query: 375 NFSLIGMMAQQFYNVGYDIGRKQKTFQRMDCE 406
++I + QQ + +D + R C
Sbjct: 407 VLNVIANLQQQNLRIMFDTTNSRLGIARELCN 438
>gi|125571060|gb|EAZ12575.1| hypothetical protein OsJ_02481 [Oryza sativa Japonica Group]
Length = 501
Score = 91.7 bits (226), Expect = 6e-16, Method: Compositional matrix adjust.
Identities = 101/378 (26%), Positives = 142/378 (37%), Gaps = 56/378 (14%)
Query: 61 FYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLGTIFYPSRSSSYAYVPCDSEH 120
++ I +G P P +DTGS ++W+ C PCR C Q G +F P S SY V C +
Sbjct: 147 YFTKIGVGTPVTPALMVLDTGSDVVWLQCAPCRRCYDQSGQMFDPRASHSYGAVDCAAPL 206
Query: 121 CRYFPYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDVVFGCGFST 180
CR C + C+Y Y G T+ +TE LTF S V V GCG
Sbjct: 207 CRRLDSGGCDLRRKACLYQVAYGDGSVTAGDFATETLTFA----SGARVPRVALGCGHDN 262
Query: 181 NRNFKFSGIFGLGI-GRSSLVSQLN----SSFSYCI-------------------GSLHD 216
F + G S SQ++ SFSYC+ GS
Sbjct: 263 EGLFVAAAGLLGLGRGSLSFPSQISRRFGRSFSYCLVDRTSSSASATSRSSTVTFGSGAR 322
Query: 217 PDYLHNKLILGDGAIIDEGDATPLQFIDGHYYITLEAISVDGRMLDINPNIFKRDDSGGG 276
L +++ DG +GD L+ GH +P+ + GG
Sbjct: 323 -GALGRRVLHPDGEEPQDGDVL-LRAAHGHQRRRRARPGRGRVRPPPDPSTGR-----GG 375
Query: 277 VMIDSG-TDVTWLVKEAYEALRDEVMIRLEGEQMR--SYSWPDKLCYHGIMSSDLKGF-- 331
V++DSG W G ++ +S D CY DL G
Sbjct: 376 VIVDSGRPSPAWARAGRTPPCATRSRAAAAGLRLSPGGFSLFDT-CY------DLSGLKV 428
Query: 332 ---PTVRFHFRGGAKLALEKDS-MFYQPRPDAFCMAVNPASINNRYVNFSLIGMMAQQFY 387
PTV HF GGA+ AL ++ + FC A A + S+IG + QQ +
Sbjct: 429 VKVPTVSMHFAGGAEAALPPENYLIPVDSRGTFCFAF--AGTDG---GVSIIGNIQQQGF 483
Query: 388 NVGYDIGRKQKTFQRMDC 405
V +D ++ F C
Sbjct: 484 RVVFDGDGQRLGFVPKGC 501
>gi|224072901|ref|XP_002303933.1| predicted protein [Populus trichocarpa]
gi|222841365|gb|EEE78912.1| predicted protein [Populus trichocarpa]
Length = 370
Score = 91.7 bits (226), Expect = 6e-16, Method: Compositional matrix adjust.
Identities = 93/360 (25%), Positives = 143/360 (39%), Gaps = 28/360 (7%)
Query: 56 ISNSLFYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLGTIFYPSRSSSYAYVP 115
I + + V +G PP A+D W+ C C CS T+F +S+++ +
Sbjct: 30 IQSPSYIVKAKVGTPPQTLLMALDNSYDAAWIPCKGCVGCS---STVFNTVKSTTFKTLG 86
Query: 116 CDSEHCRYFPYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDVVFG 175
C + C+ P C C + Y +S +S LT S V FG
Sbjct: 87 CGAPQCKQVPNPICGG--STCTWNTTY----GSSTILS--NLTRDTIALSMDPVPYYAFG 138
Query: 176 C-GFSTNRNFKFSGIFGLGIGRSSLVSQL----NSSFSYCIGSLHDPDYLHNKLILGDGA 230
C +T + G+ G G G S +SQ S+FSYC+ S ++ L LG
Sbjct: 139 CIQKATGSSVPPQGLLGFGRGPLSFLSQTQNLYKSTFSYCLPSFRTLNF-SGSLRLGPVG 197
Query: 231 IIDEGDATPL---QFIDGHYYITLEAISVDGRMLDINPNIFK-RDDSGGGVMIDSGTDVT 286
TPL YY+ L I V +++DI + +G G + DSGT T
Sbjct: 198 QPPRIKTTPLLKNPRRSSLYYVKLNGIRVGRKIVDIPRSALAFNPTTGAGTIFDSGTVFT 257
Query: 287 WLVKEAYEALRDEVMIRLEGEQMRSYSWPDKLCYHGIMSSDLKGFPTVRFHFRGGAKLAL 346
LV AY A+R+E R+ + S D CY + PT+ F F G
Sbjct: 258 RLVAPAYIAVRNEFRKRVGNATVSSLGGFDT-CYSVPIVP-----PTITFMFSGMNVTMP 311
Query: 347 EKDSMFYQPRPDAFCMAVNPASINNRYVNFSLIGMMAQQFYNVGYDIGRKQKTFQRMDCE 406
++ + + C+A+ A N V ++I M QQ + + +D+ + R C
Sbjct: 312 PENLLIHSTAGVTSCLAMAAAPDNVNSV-LNVIASMQQQNHRILFDVPNSRLGVAREQCS 370
>gi|357140068|ref|XP_003571594.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Brachypodium
distachyon]
Length = 533
Score = 91.7 bits (226), Expect = 7e-16, Method: Compositional matrix adjust.
Identities = 107/382 (28%), Positives = 154/382 (40%), Gaps = 66/382 (17%)
Query: 61 FYVNISIGQPPVPQFTAM-DTGSSLLWVHCYPC--RDCSPQLGTIFYPSRSSSYAYVPCD 117
+ I++G T + DTGS L WV C PC C Q +F P+ S ++A VPC
Sbjct: 180 YVTTIALGGGGAKNLTVIVDTGSDLTWVQCEPCPGSSCYAQRDPLFDPAASPTFAAVPCG 239
Query: 118 SEHCRYF---------PYARCSA-YKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTI 167
S C AR + + RC Y Y G + ++ + L +T
Sbjct: 240 SPACAASLKDATGAPGSCARSAGNSEQRCYYALSYGDGSFSRGVLAQDTLGLG----TTT 295
Query: 168 HVQDVVFGCGFSTNRNFKFSGIFGL-GIGRS--SLVSQLNSSF----SYCI-------GS 213
+ VFGCG S NR F G GL G+GR+ SLVSQ + F SYC+ GS
Sbjct: 296 KLDGFVFGCGLS-NRGL-FGGTAGLMGLGRTDLSLVSQTAARFGGVFSYCLPATTTSTGS 353
Query: 214 LHD--------PDYLHNKLILGDGAIIDEGDATPLQFIDGHYYITLEAISVDGRMLDINP 265
L P+ + ++I D T F Y+I + +V G P
Sbjct: 354 LSLGPGPSSSFPNMAYTRMI---------ADPTQPPF----YFINITGAAVGGGAALTAP 400
Query: 266 NIFKRDDSGGGVMIDSGTDVTWLVKEAYEALRDEVMIRLEGEQMRSYSWPDKLCYHGIMS 325
G V++DSGT +T L Y+A+R E R E +S D CY +
Sbjct: 401 GF-----GAGNVLVDSGTVITRLAPSVYKAVRAEFARRFEYPAAPGFSILDA-CYD-LTG 453
Query: 326 SDLKGFPTVRFHFRGGAKLALEKDSMFYQPRPDA--FCMAVNPASINNRYVNFSLIGMMA 383
D P + GGA++ ++ M + R D C+A+ ++ +IG
Sbjct: 454 RDEVNVPLLTLTLEGGAQVTVDAAGMLFVVRKDGSQVCLAMASLPYEDQT---PIIGNYQ 510
Query: 384 QQFYNVGYDIGRKQKTFQRMDC 405
Q+ V YD + F DC
Sbjct: 511 QRNKRVVYDTVGSRLGFADEDC 532
>gi|88174561|gb|ABD39355.1| chloroplast nucleoid DNA-binding protein [Oryza longistaminata]
Length = 321
Score = 91.7 bits (226), Expect = 7e-16, Method: Compositional matrix adjust.
Identities = 85/329 (25%), Positives = 140/329 (42%), Gaps = 37/329 (11%)
Query: 61 FYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLGTIFYPSRSSSYAYVPCDSEH 120
+ +++ +G P Q +DTGSS WV C C C T F SRS++ A V C +
Sbjct: 1 YVISVGLGTPSKTQILEIDTGSSTSWVFC-ECDGCHTNPRT-FLQSRSTTCAKVSCGTSM 58
Query: 121 CRYF---PYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDVVFGC- 176
C P+ + S C + Y G + + + LTF + + + FGC
Sbjct: 59 CLLGGSDPHCQDSENYPDCPFRVSYQDGSASYGILYQDTLTFSDVQK----IPSFSFGCN 114
Query: 177 --GFSTNRNFKFSGIFGLGIGRSSLVSQLNSS---FSYCIGSLHDPDYLHNKLI--LGDG 229
F N G+ G+G G S++ Q + + FSYC+ +K G
Sbjct: 115 MDSFGANEFGNVDGLLGMGAGPMSVLKQSSPTFDGFSYCLPLQMSERGFFSKTTGYFSLG 174
Query: 230 AIIDEGDATPLQFIDGH-----YYITLEAISVDGRMLDINPNIFKRDDSGGGVMIDSGTD 284
+ D + + +++ L AISVDG L ++P+IF R GV+ DSG++
Sbjct: 175 KVATRTDVRYTKMVARKKNTELFFVDLTAISVDGERLGLSPSIFSRK----GVVFDSGSE 230
Query: 285 VTWLVKEAYEALRD---EVMIRLEGEQMRSYSWPDKLCYHGIMSSDLKGFPTVRFHFRGG 341
++++ A L E+++R + S ++ CY + S D P + HF G
Sbjct: 231 LSYIPDRALSVLSQRIRELLLRRGAAEEES----ERNCYD-MRSVDEGDMPAISLHFDDG 285
Query: 342 AKLALEKDSMFYQ---PRPDAFCMAVNPA 367
A+ L +F + D +C+A P
Sbjct: 286 ARFDLGSHGVFVERSVQEQDVWCLAFAPT 314
>gi|115480451|ref|NP_001063819.1| Os09g0542100 [Oryza sativa Japonica Group]
gi|113632052|dbj|BAF25733.1| Os09g0542100, partial [Oryza sativa Japonica Group]
Length = 490
Score = 91.7 bits (226), Expect = 7e-16, Method: Compositional matrix adjust.
Identities = 104/381 (27%), Positives = 161/381 (42%), Gaps = 61/381 (16%)
Query: 60 LFYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDC----SPQLGT----IFYPSRSSSY 111
L Y +++G P V A+DTGS L WV C C C SP G+ ++ P++S++
Sbjct: 75 LHYAVVALGTPNVTFLVALDTGSDLFWVPC-DCLKCAPFQSPNYGSLKFDVYSPAQSTTS 133
Query: 112 AYVPCDSEHCRYFPYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTD--ESTIHV 169
VPC S C C + + C Y+ YL +S V E + + +D +S I
Sbjct: 134 RKVPCSSNLCDL--QNACRSKSNSCPYSIQYLSDNTSSSGVLVEDVLYLTSDSAQSKIVT 191
Query: 170 QDVVFGCGFSTNRNFKFS----GIFGLGIGRSSLVSQLNS------SFSYCIGSLHDPDY 219
++FGCG +F S G+ GLG+ S+ S L S SFS C G D
Sbjct: 192 APIMFGCGQVQTGSFLGSAAPNGLLGLGMDSKSVPSLLASKGLAANSFSMCFG-----DD 246
Query: 220 LHNKLILGDGAIIDEGDATPLQFI--DGHYYITLEAISVDGRMLDINPNIFKRDDSGGGV 277
H ++ GD D+ + TPL + +Y IT+ I+V + + F
Sbjct: 247 GHGRINFGDTGSSDQKE-TPLNVYKQNPYYNITITGITVGSKSISTE---FS-------A 295
Query: 278 MIDSGTDVTWLVKEAYEALRD--EVMIRLEGEQMRSYSWPDKLCYHGIMSSDLKGFPTVR 335
++DSGT T L Y + + IR M S P + CY +S++ P V
Sbjct: 296 IVDSGTSFTALSDPMYTQITSSFDAQIR-SSRNMLDSSMPFEFCYS--VSANGIVHPNVS 352
Query: 336 FHFRGGAKLALE------KDSMFYQPRPDAFCMAVNPASINNRYVNFSLIGMMAQQFYNV 389
+GG+ + D+ F P +C+A+ + + VN LIG V
Sbjct: 353 LTAKGGSIFPVNDPIITITDNAF---NPVGYCLAI----MKSEGVN--LIGENFMSGLKV 403
Query: 390 GYDIGRKQKTFQRMDCEVLDD 410
+D R ++ +C D+
Sbjct: 404 VFDRERMVLGWKNFNCYNFDE 424
>gi|359806832|ref|NP_001241567.1| uncharacterized protein LOC100819698 precursor [Glycine max]
gi|255638149|gb|ACU19388.1| unknown [Glycine max]
Length = 437
Score = 91.3 bits (225), Expect = 7e-16, Method: Compositional matrix adjust.
Identities = 106/388 (27%), Positives = 155/388 (39%), Gaps = 33/388 (8%)
Query: 31 ARLAYLQEKIRSHNTYQAQILPSNDISNSLFY-VNISIGQPPVPQFTAMDTGSSLLWVHC 89
AR+ YL + + I I+ S Y V IG P AMDT + WV C
Sbjct: 69 ARMQYLSSLVARRSI--VPIASGRQITQSPTYIVKAKIGTPAQTLLLAMDTSNDASWVPC 126
Query: 90 YPCRDCSPQLGTIFYPSRSSSYAYVPCDSEHCRYFPYARCSAYKHRCIYTQLYLIGPETS 149
C CS T F P++S+++ V C + C+ C C + Y +S
Sbjct: 127 TACVGCS--TTTPFAPAKSTTFKKVGCGASQCKQVRNPTCDG--SACAFNFTY---GTSS 179
Query: 150 VFVSTEQLTFKNTDESTIHVQDVVFGC-----GFSTNRNFKFSGIFGLGIGRSSLVSQLN 204
V S Q T +T V FGC G S G +
Sbjct: 180 VAASLVQDTV---TLATDPVPAYAFGCIQKVTGSSVPPQGLLGLGRGPLSLLAQTQKLYQ 236
Query: 205 SSFSYCIGSLHDPDYLHNKLILGDGAIIDEGDATPL---QFIDGHYYITLEAISVDGRML 261
S+FSYC+ S ++ L LG A TPL YY+ L AI V R++
Sbjct: 237 STFSYCLPSFKTLNF-SGSLRLGPVAQPKRIKFTPLLKNPRRSSLYYVNLVAIRVGRRIV 295
Query: 262 DINPNIFKRD-DSGGGVMIDSGTDVTWLVKEAYEALRDEVMIRLEGEQMRSYSWPDKL-- 318
DI P + ++G G + DSGT T LV+ AY A+R+E R+ + + +
Sbjct: 296 DIPPEALAFNANTGAGTVFDSGTVFTRLVEPAYNAVRNEFRRRIAVHKKLTVTSLGGFDT 355
Query: 319 CYHGIMSSDLKGFPTVRFHFRGGAKLALEKDSMFYQPRPDAF-CMAVNPASINNRYVNFS 377
CY + + PT+ F F G + L D++ + C+A+ PA N V +
Sbjct: 356 CYTAPIVA-----PTITFMF-SGMNVTLPPDNILIHSTAGSVTCLAMAPAPDNVNSV-LN 408
Query: 378 LIGMMAQQFYNVGYDIGRKQKTFQRMDC 405
+I M QQ + V +D+ + R C
Sbjct: 409 VIANMQQQNHRVLFDVPNSRLGVARELC 436
>gi|88174591|gb|ABD39370.1| chloroplast nucleoid DNA-binding protein [Oryza rufipogon]
Length = 321
Score = 91.3 bits (225), Expect = 8e-16, Method: Compositional matrix adjust.
Identities = 85/329 (25%), Positives = 139/329 (42%), Gaps = 37/329 (11%)
Query: 61 FYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLGTIFYPSRSSSYAYVPCDSEH 120
+ ++ +G P Q +DTGSS WV C C C T F SRS++ A V C +
Sbjct: 1 YVTSVGLGTPAKTQIVEIDTGSSTSWVFC-ECDGCHTNPRT-FLQSRSTTCAKVSCGTSM 58
Query: 121 CRYF---PYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDVVFGC- 176
C P+ + S C + Y G + + + LTF + + + FGC
Sbjct: 59 CLLGGSDPHCQDSENYPDCPFRVSYQDGSASYGILYQDTLTFSDVQK----IPSFTFGCN 114
Query: 177 --GFSTNRNFKFSGIFGLGIGRSSLVSQLNSSF---SYCIGSLHDPDYLHNKLI--LGDG 229
F N G+ G+G G S++ Q + +F SYC+ +K G
Sbjct: 115 LDSFGANEFGNVDGLLGMGAGPMSVLKQSSPTFDGFSYCLPLQKSERGFFSKTTGYFSLG 174
Query: 230 AIIDEGDATPLQFIDGH-----YYITLEAISVDGRMLDINPNIFKRDDSGGGVMIDSGTD 284
+ D + + +++ L AISVDG L ++P+IF R GV+ DSG++
Sbjct: 175 KVATRTDVRYTKMVARRKNTELFFVDLAAISVDGERLGLSPSIFSRK----GVVFDSGSE 230
Query: 285 VTWLVKEAYEALRD---EVMIRLEGEQMRSYSWPDKLCYHGIMSSDLKGFPTVRFHFRGG 341
++++ A L E+++R + S ++ CY + S D P + HF G
Sbjct: 231 LSYIPDRALSVLSQRIRELLLRRGAAEEES----ERNCYD-MRSVDEGDMPAISLHFDDG 285
Query: 342 AKLALEKDSMFYQ---PRPDAFCMAVNPA 367
A+ L +F + D +C+A P
Sbjct: 286 ARFDLGIHGVFVERSVQEQDVWCLAFAPT 314
>gi|42565828|ref|NP_190704.2| aspartyl protease family protein [Arabidopsis thaliana]
gi|332645262|gb|AEE78783.1| aspartyl protease family protein [Arabidopsis thaliana]
Length = 488
Score = 91.3 bits (225), Expect = 8e-16, Method: Compositional matrix adjust.
Identities = 100/374 (26%), Positives = 155/374 (41%), Gaps = 54/374 (14%)
Query: 60 LFYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLGT---------IFYPSRSSS 110
L Y N++IG P A+DTGS L W+ C C + T I+ PS+S S
Sbjct: 88 LHYANVTIGTPAQWFLVALDTGSDLFWLPCNCNSTCVRSMETDQGERIKLNIYNPSKSKS 147
Query: 111 YAYVPCDSEHCRYFPYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQ 170
+ V C+S C RC + C Y YL S V E + +T+E
Sbjct: 148 SSKVTCNSTLCAL--RNRCISPVSDCPYRIRYLSPGSKSTGVLVEDVIHMSTEEGEARDA 205
Query: 171 DVVFGCGFSTNRNFK---FSGIFGLGIGRSSLVSQL------NSSFSYCIGSLHDPDYLH 221
+ FGC S FK +GI GL I ++ + L + SFS C G P+
Sbjct: 206 RITFGCSESQLGLFKEVAVNGIMGLAIADIAVPNMLVKAGVASDSFSMCFG----PN--- 258
Query: 222 NKLILGDGAII--DEGDATPLQFIDGHYYITLEAISVDGRMLDINPNIFKRD----DSGG 275
G G I D+G + L+ T + ++ D++ FK D+
Sbjct: 259 -----GKGTISFGDKGSSDQLE--------TPLSGTISPMFYDVSITKFKVGKVTVDTEF 305
Query: 276 GVMIDSGTDVTWLVKEAYEALRDEVMIRLEGEQM-RSYSWPDKLCYHGIMSSDLKGFPTV 334
DSGT VTWL++ Y AL + + ++ +S P + CY +SD P+V
Sbjct: 306 TATFDSGTAVTWLIEPYYTALTTNFHLSVPDRRLSKSVDSPFEFCYIITSTSDEDKLPSV 365
Query: 335 RFHFRGGAKLALEKDSMFYQPRPDAF---CMAVNPASINNRYVNFSLIGMMAQQFYNVGY 391
F +GGA + + + +F C+AV + +FS+IG Y + +
Sbjct: 366 SFEMKGGAAYDVFSPILVFDTSDGSFQVYCLAV----LKQVNADFSIIGQNFMTNYRIVH 421
Query: 392 DIGRKQKTFQRMDC 405
D R+ +++ +C
Sbjct: 422 DRERRILGWKKSNC 435
>gi|30680102|ref|NP_849967.1| aspartyl protease-like protein [Arabidopsis thaliana]
gi|17978947|gb|AAL47439.1| putative chloroplast nucleoid DNA-binding protein [Arabidopsis
thaliana]
gi|22655368|gb|AAM98276.1| At2g17760/At2g17760 [Arabidopsis thaliana]
gi|330251585|gb|AEC06679.1| aspartyl protease-like protein [Arabidopsis thaliana]
Length = 513
Score = 91.3 bits (225), Expect = 8e-16, Method: Compositional matrix adjust.
Identities = 100/372 (26%), Positives = 157/372 (42%), Gaps = 51/372 (13%)
Query: 60 LFYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLGT---------IFYPSRSSS 110
L Y N+++G P A+DTGS L W+ C C +C +L I+ P+ SS+
Sbjct: 103 LHYANVTVGTPSDWFMVALDTGSDLFWLPC-DCTNCVRELKAPGGSSLDLNIYSPNASST 161
Query: 111 YAYVPCDSEHCRYFPYARCSAYKHRCIYTQLYLIGPETSVFVSTEQ-LTFKNTDESTIHV 169
VPC+S C RC++ + C Y YL +S V E L + D+S+ +
Sbjct: 162 STKVPCNSTLCTRGD--RCASPESDCPYQIRYLSNGTSSTGVLVEDVLHLVSNDKSSKAI 219
Query: 170 -QDVVFGCGFSTNRNF----KFSGIFGLGIGRSSLVSQL------NSSFSYCIGSLHDPD 218
V FGCG F +G+FGLG+ S+ S L +SFS C G+
Sbjct: 220 PARVTFGCGQVQTGVFHDGAAPNGLFGLGLEDISVPSVLAKEGIAANSFSMCFGNDG--- 276
Query: 219 YLHNKLILGDGAIIDEGDATPLQFIDGH--YYITLEAISVDGRMLDINPNIFKRDDSGGG 276
++ GD +D+ + TPL H Y IT+ ISV G D+ +
Sbjct: 277 --AGRISFGDKGSVDQRE-TPLNIRQPHPTYNITVTKISVGGNTGDLEFD---------- 323
Query: 277 VMIDSGTDVTWLVKEAYEALRDEV-MIRLEGE-QMRSYSWPDKLCYHGIMSSDLKGFPTV 334
+ DSGT T+L AY + + + L+ Q P + CY + D +P V
Sbjct: 324 AVFDSGTSFTYLTDAAYTLISESFNSLALDKRYQTTDSELPFEYCYALSPNKDSFQYPAV 383
Query: 335 RFHFRGGAKLALEKDSMFYQPR-PDAFCMAVNPASINNRYVNFSLIGMMAQQFYNVGYDI 393
+GG+ + + + D +C+A+ + + S+IG Y V +D
Sbjct: 384 NLTMKGGSSYPVYHPLVVIPMKDTDVYCLAI------MKIEDISIIGQNFMTGYRVVFDR 437
Query: 394 GRKQKTFQRMDC 405
+ ++ DC
Sbjct: 438 EKLILGWKESDC 449
>gi|225464832|ref|XP_002272243.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Vitis vinifera]
Length = 467
Score = 91.3 bits (225), Expect = 8e-16, Method: Compositional matrix adjust.
Identities = 98/382 (25%), Positives = 161/382 (42%), Gaps = 40/382 (10%)
Query: 61 FYVNISIGQPPVPQFTAMDTGSSLLWVHC---YPCRDCSPQL----GTIFYPSRSSSYAY 113
+ + +S G PP MDTGS L+W C Y CR+CS IF P SSS
Sbjct: 90 YSIPLSFGTPPQTLPLIMDTGSDLVWFPCTHRYVCRNCSFSTSNPSSNIFIPKSSSSSKV 149
Query: 114 VPCDSEHCRYFPYARCSAYKHRCIYT--QLYLIGPETSVF----VSTEQLTFKNTDESTI 167
+ C + C + ++ + C T I P VF ++ + + D
Sbjct: 150 LGCVNPKCGWIHGSKVQSRCRDCEPTSPNCTQICPPYLVFYGSGITGGIMLSETLDLPGK 209
Query: 168 HVQDVVFGCGFSTNRNFKFSGIFGLGIGRSSLVSQLN-SSFSYCIGSLHDPDYLHNKLIL 226
V + + GC S + +GI G G G SL SQL FSYC+ S D + ++
Sbjct: 210 GVPNFIVGC--SVLSTSQPAGISGFGRGPPSLPSQLGLKKFSYCLLSRRYDDTTESSSLV 267
Query: 227 GDGAIIDEGDATP----LQFIDG-----------HYYITLEAISVDGRMLDIN-PNIFKR 270
DG D G+ T F+ +YY+ L I+V G+ + I +
Sbjct: 268 LDGE-SDSGEKTAGLSYTPFVQNPKVAGKHAFSVYYYLGLRHITVGGKHVKIPYKYLIPG 326
Query: 271 DDSGGGVMIDSGTDVTWLVKEAYEALRDEVMIRLEGEQMRSYSWPDKL--CYHGIMSSDL 328
D GG +IDSGT T++ E +E + E +++ ++ L C++ I +
Sbjct: 327 ADGDGGTIIDSGTTFTYMKGEIFELVAAEFEKQVQSKRATEVEGITGLRPCFN-ISGLNT 385
Query: 329 KGFPTVRFHFRGGAKLALE-KDSMFYQPRPDAFCMAVNPASINNRYVNFS---LIGMMAQ 384
FP + FRGGA++ L + + + D C+ + + + ++G Q
Sbjct: 386 PSFPELTLKFRGGAEMELPLANYVAFLGGDDVVCLTIVTDGAAGKEFSGGPAIILGNFQQ 445
Query: 385 QFYNVGYDIGRKQKTFQRMDCE 406
Q + V YD+ ++ F++ C+
Sbjct: 446 QNFYVEYDLRNERLGFRQQSCK 467
>gi|51038078|gb|AAT93881.1| hypothetical protein [Oryza sativa Japonica Group]
Length = 481
Score = 91.3 bits (225), Expect = 8e-16, Method: Compositional matrix adjust.
Identities = 105/410 (25%), Positives = 154/410 (37%), Gaps = 68/410 (16%)
Query: 61 FYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCR----------DCSPQLGTIFYPSRSSS 110
+ + IG PP P +DTGS L+W C CR C PQ + S S +
Sbjct: 78 YIASYGIGDPPQPAEAVVDTGSDLVWTQCSTCRLPAAAAAGGGGCFPQNLPYYNFSLSRT 137
Query: 111 YAYVPCDSEH---CRYFP-YARCS----AYKHRCIYTQLYLIGPETSVFVSTEQLTFKNT 162
VPCD + C P A C+ + C+ Y G V T+ TF ++
Sbjct: 138 ARAVPCDDDDGALCGVAPETAGCARGGGSGDDACVVAASYGAGVALGVL-GTDAFTFPSS 196
Query: 163 DESTIHVQDVVFGC----GFSTNRNFKFSGIFGLGIGRSSLVSQLNSS-FSYCIGSLHDP 217
T+ FGC S SGI GLG G SLVSQLN++ FSYC+
Sbjct: 197 SSVTL-----AFGCVSQTRISPGALNGASGIIGLGRGALSLVSQLNATEFSYCLTPYFRD 251
Query: 218 DYLHNKLILGDG-----------AIIDEGDATPLQFIDG--------HYYITLEAISVDG 258
+ L +GDG T + F YY+ L ++
Sbjct: 252 TVSPSHLFVGDGELAGLSAAAGGGGGGGAPVTTVPFAKNPKDSPFSTFYYLPLVGLAAGN 311
Query: 259 RMLDINPNIFKRDDS-----GGGVMIDSGTDVTWLVKEAYEALRDEVMIRLEGEQM---- 309
+ + F ++ GG +IDSG+ T LV A+ AL E+ +L G
Sbjct: 312 ATVALPAGAFDLREAAPKVWAGGALIDSGSPFTRLVDPAHRALTKELARQLRGSGSLVPP 371
Query: 310 -RSYSWPDKLCYHGIMSSD---LKGFPTVRFHF----RGGAKLALEKDSMFYQPRPDAFC 361
+LC D P + F GG +L + + + + +C
Sbjct: 372 PAKLGGALELCVEAGDDGDSLAAAAVPPLVLRFDDGVGGGRELVIPAEKYWARVEASTWC 431
Query: 362 MAVNPASINNRYV---NFSLIGMMAQQFYNVGYDIGRKQKTFQRMDCEVL 408
MAV ++ N + ++IG QQ V YD+ +FQ +C +
Sbjct: 432 MAVVSSASGNATLPTNETTIIGNFMQQDMRVLYDLANGLLSFQPANCSAV 481
>gi|88174563|gb|ABD39356.1| chloroplast nucleoid DNA-binding protein [Oryza longistaminata]
Length = 323
Score = 91.3 bits (225), Expect = 9e-16, Method: Compositional matrix adjust.
Identities = 86/331 (25%), Positives = 140/331 (42%), Gaps = 39/331 (11%)
Query: 61 FYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLGTIFYPSRSSSYAYVPCDSEH 120
+ +++ +G P Q +DTGSS WV C C C T F SRS++ A V C +
Sbjct: 1 YVISVGLGTPSKTQILEIDTGSSTSWVFC-ECDGCHTNPRT-FLQSRSTTCAKVSCGTSM 58
Query: 121 CRYF---PYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDVVFGC- 176
C P+ + S C + Y G + + + LTF + + + FGC
Sbjct: 59 CLLGGSDPHCQDSENYPDCPFRVSYQDGSASYGILYQDTLTFSDVQK----IPGFSFGCN 114
Query: 177 --GFSTNRNFKFSGIFGLGIGRSSLVSQLNSS---FSYCIGSLHDPDYLHNKLI----LG 227
F N G+ G+G G S++ Q + + FSYC+ +K LG
Sbjct: 115 MDSFGANEFGNVDGLLGMGAGPMSVLKQSSPTFDGFSYCLPLQMSERGFFSKTTGYFSLG 174
Query: 228 DGAIIDEGDATPLQFIDGH-----YYITLEAISVDGRMLDINPNIFKRDDSGGGVMIDSG 282
D + + +++ L AISVDG L ++P+IF R GV+ DSG
Sbjct: 175 GKIAATRTDVRYTKMVARRKNTELFFVDLTAISVDGERLGLSPSIFSRK----GVVFDSG 230
Query: 283 TDVTWLVKEAYEALRD---EVMIRLEGEQMRSYSWPDKLCYHGIMSSDLKGFPTVRFHFR 339
++++++ A L E+++R + S ++ CY + S D P + HF
Sbjct: 231 SELSYIPDRALSVLSQRIRELLLRRGAAEEES----ERNCYD-MRSVDEGDMPAISLHFD 285
Query: 340 GGAKLALEKDSMFYQ---PRPDAFCMAVNPA 367
GA+ L +F + D +C+A P
Sbjct: 286 DGARFDLGSHGVFVERSVQEQDVWCLAFAPT 316
>gi|115473845|ref|NP_001060521.1| Os07g0658600 [Oryza sativa Japonica Group]
gi|22775625|dbj|BAC15479.1| nucleoid DNA-binding-like protein [Oryza sativa Japonica Group]
gi|50510141|dbj|BAD31106.1| nucleoid DNA-binding-like protein [Oryza sativa Japonica Group]
gi|113612057|dbj|BAF22435.1| Os07g0658600 [Oryza sativa Japonica Group]
Length = 449
Score = 91.3 bits (225), Expect = 9e-16, Method: Compositional matrix adjust.
Identities = 104/390 (26%), Positives = 156/390 (40%), Gaps = 31/390 (7%)
Query: 31 ARLAYLQEKIRSHNTYQAQILPSNDISNSLFYVNISIGQPPVPQFTAMDTGSSLLWVHCY 90
+RL YL Y + + V +G P A+DT + W+ C
Sbjct: 77 SRLLYLDSLAVKGRAYAPIASGRQLLQTPTYVVRARLGTPAQQLLLAVDTSNDAAWIPCS 136
Query: 91 PCRDCSPQLGTIFYPSRSSSYAYVPCDSEHCRYFPYARCSAYKHRCIYTQLYLIGPETSV 150
C C + F P+ S+SY VPC S C P CS C ++ Y ++S+
Sbjct: 137 GCAGC--PTSSPFNPAASASYRPVPCGSPQCVLAPNPSCSPNAKSCGFSLSYA---DSSL 191
Query: 151 FVSTEQLTFKNTDESTIHVQDVVFGC-GFSTNRNFKFSGIFGLGIGRSSLVSQLN----S 205
+ Q T + V+ FGC +T G+ GLG G S +SQ +
Sbjct: 192 QAALSQDTLAVAGDV---VKAYTFGCLQRATGTAAPPQGLLGLGRGPLSFLSQTKDMYGA 248
Query: 206 SFSYCIGSLHDPDYLHNKLILGDGAIIDEGDATPLQFIDGH----YYITLEAISVDGRML 261
+FSYC+ S ++ L LG TPL + H YY+ + I V +++
Sbjct: 249 TFSYCLPSFKSLNF-SGTLRLGRNGQPRRIKTTPL-LANPHRSSLYYVNMTGIRVGKKVV 306
Query: 262 DINPNIFKRDD-SGGGVMIDSGTDVTWLVKEAYEALRDEVMIRL-EGEQMRSYSWPDKLC 319
I + D +G G ++DSGT T LV Y ALRDEV R+ G S C
Sbjct: 307 SIPASALAFDPATGAGTVLDSGTMFTRLVAPVYLALRDEVRRRVGAGAAAVSSLGGFDTC 366
Query: 320 YHGIMSSDLKGFPTVRFHFRGGAKLALEKDSMFYQPRPDAFC--MAVNPASINNRYVNFS 377
Y+ ++ +P V F G E++ + + C MA P +N +
Sbjct: 367 YNTTVA-----WPPVTLLFDGMQVTLPEENVVIHTTYGTTSCLAMAAAPDGVNTV---LN 418
Query: 378 LIGMMAQQFYNVGYDIGRKQKTFQRMDCEV 407
+I M QQ + V +D+ + F R C
Sbjct: 419 VIASMQQQNHRVLFDVPNGRVGFARESCTA 448
>gi|32479948|emb|CAE01594.1| OSJNBa0008A08.2 [Oryza sativa Japonica Group]
gi|38347627|emb|CAE05222.2| OSJNBa0011K22.4 [Oryza sativa Japonica Group]
gi|38567678|emb|CAE75961.1| B1159F04.24 [Oryza sativa Japonica Group]
gi|116309512|emb|CAH66578.1| OSIGBa0137O04.4 [Oryza sativa Indica Group]
Length = 431
Score = 90.9 bits (224), Expect = 9e-16, Method: Compositional matrix adjust.
Identities = 95/362 (26%), Positives = 151/362 (41%), Gaps = 37/362 (10%)
Query: 57 SNSLFYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLGTI----FYPSRSS-SY 111
L+Y +I IG P V + +DTGS WV+ C+ C + + FY RSS S
Sbjct: 55 GTGLYYTDIGIGTPAVKYYVQLDTGSKAFWVNGISCKQCPHESDILRKLTFYDPRSSVSS 114
Query: 112 AYVPCDSEHCRYFPYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFK---NTDESTIH 168
V CD C P + RC Y Y G T + T+ L + ++
Sbjct: 115 KEVKCDDTICTSRPPCNMTL---RCPYITGYADGGLTMGILFTDLLHYHQLYGNGQTQPT 171
Query: 169 VQDVVFGCGFS-----TNRNFKFSGIFGLGIGRSSLVSQLNSS------FSYCIGSLHDP 217
V FGCG N GI G G + +SQL ++ FS+C+
Sbjct: 172 STSVTFGCGLQQSGSLNNSAVAIDGIIGFGNSNQTALSQLAAAGKTKKIFSHCL------ 225
Query: 218 DYLHNKLILGDGAIID-EGDATPLQFIDGHYY-ITLEAISVDGRMLDINPNIFKRDDSGG 275
D + I G +++ + TP+ + Y+ + L++I+V G L + NIF +
Sbjct: 226 DSTNGGGIFAIGEVVEPKVKTTPIVKNNEVYHLVNLKSINVAGTTLQLPANIFGTTKT-K 284
Query: 276 GVMIDSGTDVTWLVKEAYEALRDEVMIRLEGEQMRS-YSWPDKLCYHGIMSSDLKGFPTV 334
G IDSG+ + +L + Y L V + M + Y++ C+H + S D K FP +
Sbjct: 285 GTFIDSGSTLVYLPEIIYSELILAVFAKHPDITMGAMYNFQ---CFHFLGSVDDK-FPKI 340
Query: 335 RFHFRGGAKLALEKDSMFYQPRPDAFCMAVNPASINNRYVNFSLIGMMAQQFYNVGYDIG 394
FHF L + + + +C A I+ Y + ++G M V YD+
Sbjct: 341 TFHFENDLTLDVYPYDYLLEYEGNQYCFGFQDAGIHG-YKDMIILGDMVISNKVVVYDME 399
Query: 395 RK 396
++
Sbjct: 400 KQ 401
>gi|356502091|ref|XP_003519855.1| PREDICTED: aspartic proteinase-like protein 1-like [Glycine max]
Length = 519
Score = 90.9 bits (224), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 98/377 (25%), Positives = 158/377 (41%), Gaps = 52/377 (13%)
Query: 60 LFYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLGTIFY---------PSRSSS 110
L Y + IG P V A+DTGS L WV C C C+ T F P+ SS+
Sbjct: 99 LHYTTVQIGTPGVKFMVALDTGSDLFWVPC-DCTRCAASDSTAFASDFDLNVYNPNGSST 157
Query: 111 YAYVPCDSEHCRYFPYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIH-- 168
V C++ C + ++C C Y Y + ETS + T E H
Sbjct: 158 SKKVTCNNSLCTH--RSQCLGTFSNCPYMVSY-VSAETSTSGILVEDVLHLTQEDNHHDL 214
Query: 169 -VQDVVFGCGFSTNRNF----KFSGIFGLGIGRSSLVSQLN------SSFSYCIGSLHDP 217
+V+FGCG + +F +G+FGLG+ + S+ S L+ SFS C G
Sbjct: 215 VEANVIFGCGQIQSGSFLDVAAPNGLFGLGMEKISVPSMLSREGFTADSFSMCFGR---- 270
Query: 218 DYLHNKLILGDGAIIDEGDATPLQFIDGH--YYITLEAISVDGRMLDINPNIFKRDDSGG 275
D + ++ GD D+ D TP H Y IT+ + V ++D+
Sbjct: 271 DGI-GRISFGDKGSFDQ-DETPFNLNPSHPTYNITVTQVRVGTTVIDVEFT--------- 319
Query: 276 GVMIDSGTDVTWLVKEAYEALRDEVMIRLEGEQMRSYS-WPDKLCYHGIMSSDLKGFPTV 334
+ DSGT T+LV Y L + +++ + RS S P + CY ++ P+V
Sbjct: 320 -ALFDSGTSFTYLVDPTYTRLTESFHSQVQDRRHRSDSRIPFEYCYDMSPDANTSLIPSV 378
Query: 335 RFHFRGGAKLALEKDSMFYQPRPD-AFCMAVNPASINNRYVNFSLIGMMAQQFYNVGYDI 393
GG+ A+ + + + +C+AV ++ ++IG Y V +D
Sbjct: 379 SLTMGGGSHFAVYDPIIIISTQSELVYCLAVVKSA------ELNIIGQNFMTGYRVVFDR 432
Query: 394 GRKQKTFQRMDCEVLDD 410
+ +++ DC ++D
Sbjct: 433 EKLVLGWKKFDCYDIED 449
>gi|32526671|dbj|BAC79194.1| chloroplast nucleoid DNA-binding protein -like protein [Oryza
sativa Japonica Group]
Length = 732
Score = 90.9 bits (224), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 101/380 (26%), Positives = 160/380 (42%), Gaps = 59/380 (15%)
Query: 60 LFYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDC----SPQLGT----IFYPSRSSSY 111
L Y +++G P V A+DTGS L WV C C C SP G+ ++ P++S++
Sbjct: 98 LHYAVVALGTPNVTFLVALDTGSDLFWVPC-DCLKCAPFQSPNYGSLKFDVYSPAQSTTS 156
Query: 112 AYVPCDSEHCRYFPYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTD--ESTIHV 169
VPC S C C + + C Y+ YL +S V E + + +D +S I
Sbjct: 157 RKVPCSSNLCDL--QNACRSKSNSCPYSIQYLSDNTSSSGVLVEDVLYLTSDSAQSKIVT 214
Query: 170 QDVVFGCGFSTNRNFKFS----GIFGLGIGRSSLVSQLNS------SFSYCIGSLHDPDY 219
++FGCG +F S G+ GLG+ S+ S L S SFS C G D
Sbjct: 215 APIMFGCGQVQTGSFLGSAAPNGLLGLGMDSKSVPSLLASKGLAANSFSMCFG-----DD 269
Query: 220 LHNKLILGDGAIIDEGDATPLQFI--DGHYYITLEAISVDGRMLDINPNIFKRDDSGGGV 277
H ++ GD D+ + TPL + +Y IT+ I+V + +
Sbjct: 270 GHGRINFGDTGSSDQKE-TPLNVYKQNPYYNITITGITVGSKSISTE----------FSA 318
Query: 278 MIDSGTDVTWLVKEAYEALRDEVMIRLEGEQ-MRSYSWPDKLCYHGIMSSDLKGFPTVRF 336
++DSGT T L Y + ++ + M S P + CY +S++ P V
Sbjct: 319 IVDSGTSFTALSDPMYTQITSSFDAQIRSSRNMLDSSMPFEFCYS--VSANGIVHPNVSL 376
Query: 337 HFRGGAKLALE------KDSMFYQPRPDAFCMAVNPASINNRYVNFSLIGMMAQQFYNVG 390
+GG+ + D+ F P +C+A+ + + VN LIG V
Sbjct: 377 TAKGGSIFPVNDPIITITDNAF---NPVGYCLAI----MKSEGVN--LIGENFMSGLKVV 427
Query: 391 YDIGRKQKTFQRMDCEVLDD 410
+D R ++ +C D+
Sbjct: 428 FDRERMVLGWKNFNCYNFDE 447
>gi|218194598|gb|EEC77025.1| hypothetical protein OsI_15381 [Oryza sativa Indica Group]
Length = 422
Score = 90.9 bits (224), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 95/359 (26%), Positives = 151/359 (42%), Gaps = 37/359 (10%)
Query: 60 LFYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLGTI----FYPSRSS-SYAYV 114
L+Y +I IG P V + +DTGS WV+ C+ C + + FY RSS S V
Sbjct: 58 LYYTDIGIGTPAVKYYVQLDTGSKAFWVNGISCKQCPHESDILRKLTFYDPRSSVSSKEV 117
Query: 115 PCDSEHCRYFPYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFK---NTDESTIHVQD 171
CD C P + RC Y Y G T + T+ L + ++
Sbjct: 118 KCDDTICTSRPPCNMTL---RCPYITGYADGGLTMGILFTDLLHYHQLYGNGQTQPTSTS 174
Query: 172 VVFGCGFS-----TNRNFKFSGIFGLGIGRSSLVSQLNSS------FSYCIGSLHDPDYL 220
V FGCG N GI G G + +SQL ++ FS+C+ D
Sbjct: 175 VTFGCGLQQSGSLNNSAVAIDGIIGFGNSNQTALSQLAAAGKTKKIFSHCL------DST 228
Query: 221 HNKLILGDGAIID-EGDATPLQFIDGHYY-ITLEAISVDGRMLDINPNIFKRDDSGGGVM 278
+ I G +++ + TP+ + Y+ + L++I+V G L + NIF + G
Sbjct: 229 NGGGIFAIGEVVEPKVKTTPIVKNNEVYHLVNLKSINVAGTTLQLPANIFGTTKT-KGTF 287
Query: 279 IDSGTDVTWLVKEAYEALRDEVMIRLEGEQMRS-YSWPDKLCYHGIMSSDLKGFPTVRFH 337
IDSG+ + +L + Y L V + M + Y++ C+H + S D K FP + FH
Sbjct: 288 IDSGSTLVYLPEIIYSELILAVFAKHPDITMGAMYNFQ---CFHFLGSVDDK-FPKITFH 343
Query: 338 FRGGAKLALEKDSMFYQPRPDAFCMAVNPASINNRYVNFSLIGMMAQQFYNVGYDIGRK 396
F L + + + +C A I+ Y + ++G M V YD+ ++
Sbjct: 344 FENDLTLDVYPYDYLLEYEGNQYCFGFQDAGIHG-YKDMIILGDMVISNKVVVYDMEKQ 401
>gi|297832400|ref|XP_002884082.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
gi|297329922|gb|EFH60341.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
Length = 513
Score = 90.9 bits (224), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 99/372 (26%), Positives = 157/372 (42%), Gaps = 51/372 (13%)
Query: 60 LFYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLGT---------IFYPSRSSS 110
L Y N+++G P A+DTGS L W+ C C +C +L I+ P+ SS+
Sbjct: 103 LHYANVTVGTPSDWFLVALDTGSDLFWLPC-DCTNCVRELKAPGGSSLDLNIYSPNASST 161
Query: 111 YAYVPCDSEHCRYFPYARCSAYKHRCIYTQLYLIGPETSVFVSTEQ-LTFKNTDESTIHV 169
VPC+S C RC++ + C Y YL +S V E L + D+S+ +
Sbjct: 162 STKVPCNSTLCTRGD--RCASPESNCPYQIRYLSNGTSSTGVLVEDVLHLVSNDKSSKAI 219
Query: 170 -QDVVFGCGFSTNRNF----KFSGIFGLGIGRSSLVSQL------NSSFSYCIGSLHDPD 218
V GCG F +G+FGLG+ S+ S L +SFS C G+
Sbjct: 220 PARVTLGCGQVQTGVFHDGAAPNGLFGLGLEDISVPSVLAKEGIAANSFSMCFGNDG--- 276
Query: 219 YLHNKLILGDGAIIDEGDATPLQFIDGH--YYITLEAISVDGRMLDINPNIFKRDDSGGG 276
++ GD +D+ + TPL H Y IT+ ISV+G D+ +
Sbjct: 277 --AGRISFGDKGSVDQRE-TPLNIRQPHPTYNITVTKISVEGNTGDLEFD---------- 323
Query: 277 VMIDSGTDVTWLVKEAYEALRDEV-MIRLEGE-QMRSYSWPDKLCYHGIMSSDLKGFPTV 334
+ DSGT T+L AY + + + L+ Q P + CY + D +P V
Sbjct: 324 AVFDSGTSFTYLTDAAYTLISESFNSLALDKRYQTTDSELPFEYCYALSPNKDSFQYPAV 383
Query: 335 RFHFRGGAKLALEKDSMFYQPR-PDAFCMAVNPASINNRYVNFSLIGMMAQQFYNVGYDI 393
+GG+ + + + D +C+A+ + + S+IG Y V +D
Sbjct: 384 NLTMKGGSSYPVYHPLVVIPMKDTDVYCLAI------LKIEDISIIGQNFMTGYRVVFDR 437
Query: 394 GRKQKTFQRMDC 405
+ ++ DC
Sbjct: 438 EKLILGWKESDC 449
>gi|116311058|emb|CAH67989.1| OSIGBa0142I02-OSIGBa0101B20.32 [Oryza sativa Indica Group]
Length = 488
Score = 90.9 bits (224), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 99/383 (25%), Positives = 160/383 (41%), Gaps = 50/383 (13%)
Query: 58 NSLFYVNISIGQPP---VPQFTAMDTGSSLLWVHCYPCRDCSPQLGTIFY-PSRSSSYAY 113
S + V + IG P P++ DTGS L W C PC +CS + PS+S ++
Sbjct: 120 GSTYLVQLRIGTPTDRISPRYVLFDTGSDLSWTQCEPCTNCSSFTPYPPHDPSKSRTFRR 179
Query: 114 VPCDSEHCRYFPYARCSAY------KHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTI 167
+ C C C+A C++ + Y G S + ++ F +
Sbjct: 180 LSCFDPMCEL-----CTAVVDGGGGSAGCLFRRRYGDGGAVSGELVSDVFHFGAAGDGGG 234
Query: 168 H--VQDVVFGCGFSTN----RNFKFSGIFGLGIGRSSLVSQLN-SSFSYCIGS------- 213
+ +DV FGC + R + +GI LGIG+ S V+QL FSYCI +
Sbjct: 235 YQLERDVAFGCAHVEDSKAVRGYS-TGILALGIGKPSFVTQLGVDRFSYCIPASEITDDD 293
Query: 214 -LHDPDYLHNKLILGDGAIIDEGDATPLQFIDGHYYITLEAISVD--GRMLDINP-NIFK 269
D + + L G A + G P + Y + L+++ GR+ P ++
Sbjct: 294 DDDDEERSASFLRFGSHARM-TGKRAPFKQDGSGYAVRLKSVVYQHGGRLNQQQPVPVYV 352
Query: 270 RDDSGGGVM---IDSGTDVTWLVKEAYEALRDEVMIRLEGEQMRSYSWPDKLCYHGIMSS 326
+ M +DSGT + WL + L+ + + + + P CY G M +
Sbjct: 353 AGEEAAAAMPMLVDSGTTLLWLPGSVFYPLQRRIEEDISLTRRYDLTHPSLYCYLGNM-T 411
Query: 327 DLKGFPTVRFHFRGGAKLALEKDSMFYQPR---PDAFCMAVNPASINNRYVNFSLIGMMA 383
D++ +V F GGA L L S+F+ D C+AV N +++G+
Sbjct: 412 DVEAV-SVTLGFGGGADLELFGTSLFFTDENLTEDWVCLAVAAG-------NRAILGVYP 463
Query: 384 QQFYNVGYDIGRKQKTFQRMDCE 406
Q+ NVGYD+ + F R C+
Sbjct: 464 QRNINVGYDLSTMEIAFDRDQCD 486
>gi|222637611|gb|EEE67743.1| hypothetical protein OsJ_25435 [Oryza sativa Japonica Group]
Length = 396
Score = 90.5 bits (223), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 99/360 (27%), Positives = 149/360 (41%), Gaps = 31/360 (8%)
Query: 61 FYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLGTIFYPSRSSSYAYVPCDSEH 120
+ V +G P A+DT + W+ C C C + F P+ S+SY VPC S
Sbjct: 54 YVVRARLGTPAQQLLLAVDTSNDAAWIPCSGCAGC--PTSSPFNPAASASYRPVPCGSPQ 111
Query: 121 CRYFPYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDVVFGC-GFS 179
C P CS C ++ Y ++S+ + Q T + V+ FGC +
Sbjct: 112 CVLAPNPSCSPNAKSCGFSLSYA---DSSLQAALSQDTLAVAGDV---VKAYTFGCLQRA 165
Query: 180 TNRNFKFSGIFGLGIGRSSLVSQLN----SSFSYCIGSLHDPDYLHNKLILGDGAIIDEG 235
T G+ GLG G S +SQ ++FSYC+ S ++ L LG
Sbjct: 166 TGTAAPPQGLLGLGRGPLSFLSQTKDMYGATFSYCLPSFKSLNF-SGTLRLGRNGQPRRI 224
Query: 236 DATPLQFIDGH----YYITLEAISVDGRMLDINPNIFKRDD-SGGGVMIDSGTDVTWLVK 290
TPL + H YY+ + I V +++ I + D +G G ++DSGT T LV
Sbjct: 225 KTTPL-LANPHRSSLYYVNMTGIRVGKKVVSIPASALAFDPATGAGTVLDSGTMFTRLVA 283
Query: 291 EAYEALRDEVMIRL-EGEQMRSYSWPDKLCYHGIMSSDLKGFPTVRFHFRGGAKLALEKD 349
Y ALRDEV R+ G S CY+ ++ +P V F G E++
Sbjct: 284 PVYLALRDEVRRRVGAGAAAVSSLGGFDTCYNTTVA-----WPPVTLLFDGMQVTLPEEN 338
Query: 350 SMFYQPRPDAFC--MAVNPASINNRYVNFSLIGMMAQQFYNVGYDIGRKQKTFQRMDCEV 407
+ + C MA P +N ++I M QQ + V +D+ + F R C
Sbjct: 339 VVIHTTYGTTSCLAMAAAPDGVNTV---LNVIASMQQQNHRVLFDVPNGRVGFARESCTA 395
>gi|356570895|ref|XP_003553619.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Glycine max]
Length = 470
Score = 90.5 bits (223), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 113/450 (25%), Positives = 187/450 (41%), Gaps = 68/450 (15%)
Query: 7 ILSPYNNPNENPAHRVQRGINISIARLAYLQEKIRSHNTYQAQILPSNDISNSLFYVNIS 66
+L+ ++ + +P H V+ + S+ R +L K R++N+ P+ S + ++++
Sbjct: 36 LLTKPHSSDSDPFHSVKLAASSSLTRAHHL--KHRNNNSPSVATTPAYPKSYGGYSIDLN 93
Query: 67 IGQPPVPQFTAMDTGSSLLWVHC---YPCRDCS-----PQLGTIFYPSRSSSYAYVPCDS 118
+G PP +DTGSSL+W C Y C C+ P F P SS+ + C +
Sbjct: 94 LGTPPQTSPFVLDTGSSLVWFPCTSHYLCSHCNFPNIDPTKIPTFIPKNSSTAKLLGCRN 153
Query: 119 EHCRYF----PYARCSAYK----HRC-----IYTQLYLIGPETSVFVSTEQLTFKNTDES 165
C Y +RC K C Y Y +G T+ F+ + L F
Sbjct: 154 PKCGYLFGPDVESRCPQCKKPGSQNCSLTCPSYIIQYGLG-ATAGFLLLDNLNFPGKT-- 210
Query: 166 TIHVQDVVFGCGFSTNRNFKFSGIFGLGIGRSSLVSQLN-SSFSYCIGSLHDPDYLHNK- 223
V + GC + R + SGI G G G+ SL SQ+N FSYC+ S D +
Sbjct: 211 ---VPQFLVGCSILSIR--QPSGIAGFGRGQESLPSQMNLKRFSYCLVSHRFDDTPQSSD 265
Query: 224 LILGDGAIIDEGDA-------TPLQ-------FIDGHYYITLEAISVDGRMLDINPNIFK 269
L+L I GD TP + +YY+TL + V G + I +
Sbjct: 266 LVL---QISSTGDTKTNGLSYTPFRSNPSNNSVFREYYYVTLRKLIVGGVDVKIPYKFLE 322
Query: 270 R-DDSGGGVMIDSGTDVTWLVKEAYEALRDEVMIRLEGEQMRSYSWPDK----LCYHGIM 324
D GG ++DSG+ T++ + Y + E + +L + R + + C++ I
Sbjct: 323 PGSDGNGGTIVDSGSTFTFMERPVYNLVAQEFLRQLGKKYSREENVEAQSGLSPCFN-IS 381
Query: 325 SSDLKGFPTVRFHFRGGAKLALEKDSMF-YQPRPDAFCMAV-------NPASINNRYVNF 376
FP F F+GGAK++ + F + + C V P + +
Sbjct: 382 GVKTISFPEFTFQFKGGAKMSQPLLNYFSFVGDAEVLCFTVVSDGGAGQPKTAGPAII-- 439
Query: 377 SLIGMMAQQFYNVGYDIGRKQKTFQRMDCE 406
+G QQ + V YD+ ++ F +C+
Sbjct: 440 --LGNYQQQNFYVEYDLENERFGFGPRNCK 467
>gi|215766660|dbj|BAG98888.1| unnamed protein product [Oryza sativa Japonica Group]
Length = 433
Score = 90.5 bits (223), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 95/359 (26%), Positives = 151/359 (42%), Gaps = 37/359 (10%)
Query: 60 LFYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLGTI----FYPSRSS-SYAYV 114
L+Y +I IG P V + +DTGS WV+ C+ C + + FY RSS S V
Sbjct: 82 LYYTDIGIGTPAVKYYVQLDTGSKAFWVNGISCKQCPHESDILRKLTFYDPRSSVSSKEV 141
Query: 115 PCDSEHCRYFPYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFK---NTDESTIHVQD 171
CD C P + RC Y Y G T + T+ L + ++
Sbjct: 142 KCDDTICTSRPPCNMTL---RCPYITGYADGGLTMGILFTDLLHYHQLYGNGQTQPTSTS 198
Query: 172 VVFGCGFS-----TNRNFKFSGIFGLGIGRSSLVSQLNSS------FSYCIGSLHDPDYL 220
V FGCG N GI G G + +SQL ++ FS+C+ D
Sbjct: 199 VTFGCGLQQSGSLNNSAVAIDGIIGFGNSNQTALSQLAAAGKTKKIFSHCL------DST 252
Query: 221 HNKLILGDGAIID-EGDATPLQFIDGHYY-ITLEAISVDGRMLDINPNIFKRDDSGGGVM 278
+ I G +++ + TP+ + Y+ + L++I+V G L + NIF + G
Sbjct: 253 NGGGIFAIGEVVEPKVKTTPIVKNNEVYHLVNLKSINVAGTTLQLPANIFGTTKT-KGTF 311
Query: 279 IDSGTDVTWLVKEAYEALRDEVMIRLEGEQMRS-YSWPDKLCYHGIMSSDLKGFPTVRFH 337
IDSG+ + +L + Y L V + M + Y++ C+H + S D K FP + FH
Sbjct: 312 IDSGSTLVYLPEIIYSELILAVFAKHPDITMGAMYNFQ---CFHFLGSVDDK-FPKITFH 367
Query: 338 FRGGAKLALEKDSMFYQPRPDAFCMAVNPASINNRYVNFSLIGMMAQQFYNVGYDIGRK 396
F L + + + +C A I+ Y + ++G M V YD+ ++
Sbjct: 368 FENDLTLDVYPYDYLLEYEGNQYCFGFQDAGIHG-YKDMIILGDMVISNKVVVYDMEKQ 425
>gi|296086208|emb|CBI31649.3| unnamed protein product [Vitis vinifera]
Length = 761
Score = 90.5 bits (223), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 95/387 (24%), Positives = 154/387 (39%), Gaps = 85/387 (21%)
Query: 47 QAQILPSNDIS----------NSLFYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCS 96
+ Q+LPS + N V++++G PP +DTGS L W+HC +
Sbjct: 351 KTQVLPSGSVPRPSSKLSFHHNVSLTVSLTVGSPPQTVTMVLDTGSELSWLHC----KKA 406
Query: 97 PQLGTIFYPSRSSSYAYVPCDSEHCRYFPYARCSAYKHRCIYTQLYLIGPETSVFVSTEQ 156
P L ++F P RSSSY+ +PC S CR +++ + LIG Q
Sbjct: 407 PNLHSVFDPLRSSSYSPIPCTSPTCRTRTHSKTTG-----------LIGMNRGSLSFVTQ 455
Query: 157 LTFKNTDESTIHVQDVVFGCGFSTNRNFKFSGIFGLGIGRSSLVSQLNSSFSYCIGSLHD 216
+ + I QD SGI G SSFS+ +
Sbjct: 456 MGLQKFS-YCISGQDS--------------SGILLFG----------ESSFSWLKALKYT 490
Query: 217 PDYLHNKLILGDGAIIDEGDATPLQFIDG-HYYITLEAISVDGRMLDINPNIFKRDDSGG 275
P +TPL + D Y + LE I V ML + +++ D +G
Sbjct: 491 PLV---------------QISTPLPYFDRVAYTVQLEGIKVANSMLQLPKSVYAPDHTGA 535
Query: 276 G-VMIDSGTDVTWLVKEAYEALRDEVMIRLEGEQMRSYSWPD-------KLCYH-GIMSS 326
G M+DSGT T+L+ Y AL++E +R ++ P+ LCY +
Sbjct: 536 GQTMVDSGTQFTFLLGPVYTALKNE-FVRQTKASLKVLEDPNFVFQGAMDLCYRVPLTRR 594
Query: 327 DLKGFPTVRFHFRGGAKLALEKDSMFY------QPRPDAFCMAVNPASINNRYVNFSLIG 380
L PTV FR GA++++ + + Y + +C + + V +IG
Sbjct: 595 TLPPLPTVTLMFR-GAEMSVSAERLMYRVPGVIRGSDSVYCFTFGNSELLG--VESYIIG 651
Query: 381 MMAQQFYNVGYDIGRKQKTFQRMDCEV 407
QQ + +D+ + + F + C++
Sbjct: 652 HHHQQNVWMEFDLAKSRVGFAEVRCDL 678
>gi|218195474|gb|EEC77901.1| hypothetical protein OsI_17222 [Oryza sativa Indica Group]
Length = 467
Score = 90.5 bits (223), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 99/383 (25%), Positives = 160/383 (41%), Gaps = 50/383 (13%)
Query: 58 NSLFYVNISIGQPP---VPQFTAMDTGSSLLWVHCYPCRDCSPQLGTIFY-PSRSSSYAY 113
S + V + IG P P++ DTGS L W C PC +CS + PS+S ++
Sbjct: 99 GSTYLVQLRIGTPTDRISPRYVLFDTGSDLSWTQCEPCTNCSSFTPYPPHDPSKSRTFRR 158
Query: 114 VPCDSEHCRYFPYARCSAY------KHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTI 167
+ C C C+A C++ + Y G S + ++ F +
Sbjct: 159 LSCFDPMCEL-----CTAVVDGGGGSAGCLFRRRYGDGGAVSGELVSDVFHFGAAGDGGG 213
Query: 168 H--VQDVVFGCGFSTN----RNFKFSGIFGLGIGRSSLVSQLN-SSFSYCIGS------- 213
+ +DV FGC + R + +GI LGIG+ S V+QL FSYCI +
Sbjct: 214 YQLERDVAFGCAHVEDSKAVRGYS-TGILALGIGKPSFVTQLGVDRFSYCIPASEITDDD 272
Query: 214 -LHDPDYLHNKLILGDGAIIDEGDATPLQFIDGHYYITLEAISVD--GRMLDINP-NIFK 269
D + + L G A + G P + Y + L+++ GR+ P ++
Sbjct: 273 DDDDEERSASFLRFGSHARM-TGKRAPFKQDGSGYAVRLKSVVYQHGGRLNQQQPVPVYV 331
Query: 270 RDDSGGGVM---IDSGTDVTWLVKEAYEALRDEVMIRLEGEQMRSYSWPDKLCYHGIMSS 326
+ M +DSGT + WL + L+ + + + + P CY G M +
Sbjct: 332 AGEEAAAAMPMLVDSGTTLLWLPGSVFYPLQRRIEEDISLTRRYDLTHPSLYCYLGNM-T 390
Query: 327 DLKGFPTVRFHFRGGAKLALEKDSMFYQPR---PDAFCMAVNPASINNRYVNFSLIGMMA 383
D++ +V F GGA L L S+F+ D C+AV N +++G+
Sbjct: 391 DVEAV-SVTLGFGGGADLELFGTSLFFTDENLTEDWVCLAVAAG-------NRAILGVYP 442
Query: 384 QQFYNVGYDIGRKQKTFQRMDCE 406
Q+ NVGYD+ + F R C+
Sbjct: 443 QRNINVGYDLSTMEIAFDRDQCD 465
>gi|449461377|ref|XP_004148418.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Cucumis sativus]
gi|449518059|ref|XP_004166061.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Cucumis sativus]
Length = 436
Score = 90.5 bits (223), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 103/394 (26%), Positives = 165/394 (41%), Gaps = 40/394 (10%)
Query: 31 ARLAYLQEKIRSHNTYQAQILPSNDISNSLFYV-NISIGQPPVPQFTAMDTGSSLLWVHC 89
AR+ YL + + T A I + N YV + +G P + +DT + W C
Sbjct: 65 ARIRYL-SSLTAQKTVAAPIASGQQVLNVGNYVVRVQLGTPGQTMYMVLDTSNDAAWAPC 123
Query: 90 YPCRDCSPQLGTIFYPSRSSSYAYVPCDSEHCRYFPYARCSAYKH-RCIYTQLYLIGPET 148
C CS T F SS++A + C C C + C++ Q Y G ++
Sbjct: 124 SGCIGCSST--TTFSAQNSSTFATLDCSKPECTQARGLSCPTTGNVDCLFNQTY--GGDS 179
Query: 149 SVFVSTEQLTFKNTDESTIH-----VQDVVFGC-GFSTNRNFKFSGIFGLGIGRSSLVSQ 202
+ + Q ++H + + FGC ++ + G+ GLG G SL+SQ
Sbjct: 180 TFSATLVQ--------DSLHLGPNVIPNFSFGCISSASGSSIPPQGLMGLGRGPLSLISQ 231
Query: 203 LNSS----FSYCIGSLHDPDYLHNKLILGDGAIIDEGDATPLQFIDGH----YYITLEAI 254
S FSYC+ S Y L LG TPL + H YY+ L I
Sbjct: 232 SGSLYSGLFSYCLPSFKS-YYFSGSLKLGPVGQPKAIRTTPL-LHNPHRPSLYYVNLTGI 289
Query: 255 SVDGRMLDINPNIFKRD-DSGGGVMIDSGTDVTWLVKEAYEALRDEVMIRLEGEQMRSYS 313
SV ++ I+P + D ++G G +IDSGT +T V Y A+RDE ++ G +
Sbjct: 290 SVGRVLVPISPELLAFDPNTGAGTIIDSGTVITRFVPAIYTAVRDEFRKQVGGSFSPLGA 349
Query: 314 WPDKLCYHGIMSSDLKGFPTVRFHFRG-GAKLALEKDSMFYQPRPDAFCMAVNPASINNR 372
+ + +S+ P + H G KL +E +S+ + C+A+ A+ NN
Sbjct: 350 FDTCFATNNEVSA-----PAITLHLSGLDLKLPME-NSLIHSSAGSLACLAMA-AAPNNV 402
Query: 373 YVNFSLIGMMAQQFYNVGYDIGRKQKTFQRMDCE 406
++I + QQ + + +DI + R C
Sbjct: 403 NSVVNVIANLQQQNHRILFDINNSKLGIARELCN 436
>gi|88174579|gb|ABD39364.1| chloroplast nucleoid DNA-binding protein [Oryza sativa Japonica
Group]
gi|88174585|gb|ABD39367.1| chloroplast nucleoid DNA-binding protein [Oryza sativa Indica
Group]
gi|88174595|gb|ABD39372.1| chloroplast nucleoid DNA-binding protein [Oryza rufipogon]
gi|88174599|gb|ABD39374.1| chloroplast nucleoid DNA-binding protein [Oryza rufipogon]
gi|88174607|gb|ABD39378.1| chloroplast nucleoid DNA-binding protein [Oryza rufipogon]
Length = 321
Score = 90.5 bits (223), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 83/329 (25%), Positives = 140/329 (42%), Gaps = 37/329 (11%)
Query: 61 FYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLGTIFYPSRSSSYAYVPCDSEH 120
+ +++ +G P Q +DTGSS WV C C C T F SRS++ A V C +
Sbjct: 1 YVISVGLGTPAKTQIVEIDTGSSTSWVFC-ECDGCHTNPRT-FLQSRSTTCAKVSCGTSM 58
Query: 121 CRYF---PYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDVVFGC- 176
C P+ + S C + Y G + + + LTF + + + FGC
Sbjct: 59 CLLGGSDPHCQDSENYPDCPFRVSYQDGSASYGILYQDTLTFSDVQK----IPGFSFGCN 114
Query: 177 --GFSTNRNFKFSGIFGLGIGRSSLVSQLNSS---FSYCIGSLHDPDYLHNKLI--LGDG 229
F N G+ G+G G S++ Q + + FSYC+ +K G
Sbjct: 115 MDSFGANEFGNVDGLLGMGAGPMSVLKQSSPTFDCFSYCLPLQKSERGFFSKTTGYFSLG 174
Query: 230 AIIDEGDATPLQFIDGH-----YYITLEAISVDGRMLDINPNIFKRDDSGGGVMIDSGTD 284
+ D + + +++ L AISVDG L ++P++F R GV+ DSG++
Sbjct: 175 KVATRTDVRYTKMVARKKNTELFFVDLTAISVDGERLGLSPSVFSRK----GVVFDSGSE 230
Query: 285 VTWLVKEAYEALRD---EVMIRLEGEQMRSYSWPDKLCYHGIMSSDLKGFPTVRFHFRGG 341
++++ A L E++++ + S ++ CY + S D P + HF G
Sbjct: 231 LSYIPDRALSVLSQRIRELLLKRGAAEEES----ERNCYD-MRSVDEGDMPAISLHFDDG 285
Query: 342 AKLALEKDSMFYQ---PRPDAFCMAVNPA 367
A+ L +F + D +C+A P
Sbjct: 286 ARFDLGSHGVFVERSVQEQDVWCLAFAPT 314
>gi|226491934|ref|NP_001140743.1| uncharacterized protein LOC100272818 [Zea mays]
gi|194700872|gb|ACF84520.1| unknown [Zea mays]
Length = 351
Score = 90.1 bits (222), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 99/359 (27%), Positives = 150/359 (41%), Gaps = 55/359 (15%)
Query: 70 PPVPQFTAMDTGSSLLWVHCYPCR--DCSPQLGTIFYPSRSSSYAYVPCDSEHCRYF-PY 126
P V Q +D+ S + WV C PC C PQ+ + + PSRS + A C S C PY
Sbjct: 25 PGVIQTVVLDSASDVPWVQCVPCPIPPCHPQVDSFYDPSRSPTSAAFSCSSPTCTALGPY 84
Query: 127 ARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDVVFGCGFSTNRNF-- 184
A A ++C Y Y G TS + LT + V FGC + +F
Sbjct: 85 ANGCA-NNQCQYLVRYPDGSSTSGAYIADLLTLDAGNA----VSGFKFGCSHAEQGSFDA 139
Query: 185 KFSGIFGLGIGRSSLVSQLNS----SFSYCIG---------SLHDPDYLHNKLILGDGAI 231
+ +GI LG G SL+SQ S +FSYCI +L P ++ ++
Sbjct: 140 RAAGIMALGGGPESLLSQTASRYGNAFSYCIPATASDSGFFTLGVPRRASSRYVV----- 194
Query: 232 IDEGDATPL---QFIDGHYYITLEAISVDGRMLDINPNIFKRDDSGGGVMIDSGTDVTWL 288
TP+ + Y + L I+V G+ L + P +F G ++DS T +T L
Sbjct: 195 ------TPMVRFRQAATFYGVLLRTITVGGQRLGVAPAVFA-----AGSVLDSRTAITRL 243
Query: 289 VKEAYEALRDEVMIRLEGEQMRSYSWPDKL--CYHGIMSSDLKGFPTVRFHFRGGAKLAL 346
AY+ALR R RS L CY +++ P + F A L L
Sbjct: 244 PPTAYQALR--AAFRSSMTMYRSAPPKGYLDTCYDFTGVVNIR-LPKISLVFDRNAVLPL 300
Query: 347 EKDSMFYQPRPDAFCMAVNPASINNRYVNFSLIGMMAQQFYNVGYDIGRKQKTFQRMDC 405
+ + + C+A ++ ++R ++G + QQ V YD+G F++ C
Sbjct: 301 DPSGILFND-----CLAFT-SNADDRMPG--VLGSVQQQTIEVLYDVGGGAVGFRQGAC 351
>gi|356559244|ref|XP_003547910.1| PREDICTED: aspartic proteinase-like protein 1-like [Glycine max]
Length = 515
Score = 90.1 bits (222), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 97/377 (25%), Positives = 157/377 (41%), Gaps = 52/377 (13%)
Query: 60 LFYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLGTIFY---------PSRSSS 110
L Y + IG P V A+DTGS L WV C C C+ + F P+ SS+
Sbjct: 95 LHYTTVQIGTPGVKFMVALDTGSDLFWVPC-DCTRCAATDSSAFASDFDLNVYNPNGSST 153
Query: 111 YAYVPCDSEHCRYFPYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIH-- 168
V C++ C + ++C C Y Y + ETS + T E H
Sbjct: 154 SKKVTCNNSLCMH--RSQCLGTLSNCPYMVSY-VSAETSTSGILVEDVLHLTQEDNHHDL 210
Query: 169 -VQDVVFGCGFSTNRNF----KFSGIFGLGIGRSSLVSQLN------SSFSYCIGSLHDP 217
+V+FGCG + +F +G+FGLG+ + S+ S L+ SFS C G
Sbjct: 211 VEANVIFGCGQIQSGSFLDVAAPNGLFGLGMEKISVPSMLSREGFTADSFSMCFGR---- 266
Query: 218 DYLHNKLILGDGAIIDEGDATPLQFIDGH--YYITLEAISVDGRMLDINPNIFKRDDSGG 275
D + ++ GD D+ D TP H Y IT+ + V ++D+
Sbjct: 267 DGI-GRISFGDKGSFDQ-DETPFNLNPSHPTYNITVTQVRVGTTLIDVEFT--------- 315
Query: 276 GVMIDSGTDVTWLVKEAYEALRDEVMIRLEGEQMRSYS-WPDKLCYHGIMSSDLKGFPTV 334
+ DSGT T+LV Y L + +++ + RS S P + CY ++ P+V
Sbjct: 316 -ALFDSGTSFTYLVDPTYTRLTESFHSQVQDRRHRSDSRIPFEYCYDMSPDANTSLIPSV 374
Query: 335 RFHFRGGAKLALEKDSMFYQPRPD-AFCMAVNPASINNRYVNFSLIGMMAQQFYNVGYDI 393
GG+ A+ + + + +C+AV + ++IG Y V +D
Sbjct: 375 SLTMGGGSHFAVYDPIIIISTQSELVYCLAV------VKTAELNIIGQNFMTGYRVVFDR 428
Query: 394 GRKQKTFQRMDCEVLDD 410
+ +++ DC ++D
Sbjct: 429 EKLVLGWKKFDCYDIED 445
>gi|414587000|tpg|DAA37571.1| TPA: hypothetical protein ZEAMMB73_036171 [Zea mays]
Length = 459
Score = 90.1 bits (222), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 109/389 (28%), Positives = 157/389 (40%), Gaps = 63/389 (16%)
Query: 61 FYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLGTIFYPSRSSSYAYVPCDSEH 120
+ V + G P A+DT S L+W+ C PC C QL +F P SSSYA VPC S+
Sbjct: 92 YLVKLGTGTPQHFFSAAIDTASDLVWMQCQPCVSCYRQLDPVFNPKLSSSYAVVPCTSDT 151
Query: 121 CRYFPYARCSAYKH-RCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDVVFGCGFS 179
C RC C YT Y T ++ ++L H VVFGC S
Sbjct: 152 CAQLDGHRCHEDDDGACQYTYKYSGHGVTKGTLAIDKLAIGG---DVFHA--VVFGCSDS 206
Query: 180 T--NRNFKFSGIFGLGIGRSSLVSQLN-SSFSYCIGSLHDP-DYLHNKLILGDG--AIID 233
+ + SG+ GLG G SLVSQL+ F YC L P KL+LG G A+ +
Sbjct: 207 SVGGPAAQASGLVGLGRGPLSLVSQLSVHRFMYC---LPPPMSRTSGKLVLGAGADAVRN 263
Query: 234 EGDATPLQFID-----GHYYITLEAISVDGRMLDINPNIFKRDDS--------------- 273
D + +YY+ L+ ++V D P + S
Sbjct: 264 MSDRVTVTMSSSTRYPSYYYLNLDGLAVG----DQTPGTTRNATSPPSGGAGGGGGGGGG 319
Query: 274 ---------GGGVMIDSGTDVTWLVKEAYEALRD--EVMIRLEGEQMRSYSWPDKLCY-- 320
G+++D + +++L Y+ L D E IRL S LC+
Sbjct: 320 GIVGAGGANAYGMIVDVASTISFLETSLYDELADDLEEEIRLP-RATPSLRLGLDLCFIL 378
Query: 321 -HGIMSSDLKGFPTVRFHFRGGAKLALEKDSMFYQPRPDAFCMAVNPASINNRYVNFSLI 379
G+ D PTV F G L L++D +F D M + + R S++
Sbjct: 379 PEGV-GMDRVYVPTVSLSFD-GRWLELDRDRLFVT---DGRMMCL----MIGRTSGVSIL 429
Query: 380 GMMAQQFYNVGYDIGRKQKTFQRMDCEVL 408
G Q V +++ R + TF + C+ L
Sbjct: 430 GNFQLQNMRVLFNLRRGKITFAKASCDSL 458
>gi|222629809|gb|EEE61941.1| hypothetical protein OsJ_16693 [Oryza sativa Japonica Group]
Length = 648
Score = 90.1 bits (222), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 101/408 (24%), Positives = 167/408 (40%), Gaps = 72/408 (17%)
Query: 61 FYVNISIGQPPVPQFTAMDTGSSLLWVHC---YPCRDCSPQLGT----IFYPSRSSSYAY 113
+ +S+G PP P +DTGS L WV C Y CR+CS +F+P SSS
Sbjct: 89 YAFTVSLGTPPQPLPVLLDTGSHLSWVPCTSSYQCRNCSSLSAASPLHVFHPKNSSSSRL 148
Query: 114 VPCDSEHCRYF---------------PYARCSAYKHRC-----IYTQLYLIGPETSVFVS 153
+ C + C + P A C+ Y +Y G + +S
Sbjct: 149 IGCRNPSCLWIHSPDHLSDCRAASSCPGANCTPRNANANNVCPPYLVVYGSGSTAGLLIS 208
Query: 154 TEQLTFKNTDESTIHVQDVVFGCGFSTNRNFKFSGIFGLGIGRSSLVSQLN-SSFSYCIG 212
T + + V++ V GC ++ SG+ G G G S+ SQL + FSYC+
Sbjct: 209 D---TLRTPGRA---VRNFVIGCSLASVHQPP-SGLAGFGRGAPSVPSQLGLTKFSYCLL 261
Query: 213 S--LHDPDYLHNKLILGD------------GAIIDEGDATPLQFIDGHYYITLEAISVDG 258
S D + +LILG + A P + +YY+ L AI+V G
Sbjct: 262 SRRFDDNAAVSGELILGGAGGKDGGVGMQYAPLARSASARPPYSV--YYYLALTAITVGG 319
Query: 259 RMLDINPNIFKRDDSGGGVMIDSGTDVTWLVKEAYEALRDEVMIRLEGEQMRSYSWPDKL 318
+ + + F +GGG ++DSGT ++ + +E + V+ + G RS + L
Sbjct: 320 KSVQLPERAFVAGGAGGGAIVDSGTTFSYFDRTVFEPVAAAVVAAVGGRYSRSKVVEEGL 379
Query: 319 ----CYHGIMSSDLKGFPTVRFHFRGGAKLALEKDSMFYQPRP----------DAFCMAV 364
C+ + P + HF+GG+ + L ++ F P +A C+AV
Sbjct: 380 GLSPCFAMPPGTKTMELPEMSLHFKGGSVMNLPVENYFVVAGPAPSGGAPAMAEAICLAV 439
Query: 365 -------NPASINNRYVNFSLIGMMAQQFYNVGYDIGRKQKTFQRMDC 405
+ + + ++G QQ Y + YD+ +++ F+R C
Sbjct: 440 VSDVPTSSGGAGVSSGGPAIILGSFQQQNYYIEYDLEKERLGFRRQQC 487
>gi|125548492|gb|EAY94314.1| hypothetical protein OsI_16081 [Oryza sativa Indica Group]
Length = 417
Score = 90.1 bits (222), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 89/324 (27%), Positives = 137/324 (42%), Gaps = 49/324 (15%)
Query: 22 VQRGINISIARLAYLQEKIRSHNTYQAQILPSNDI--SNSLFYVNISIGQPPVPQFTAMD 79
++R I S RLA + + + ++ I + + V + IG PP A+D
Sbjct: 48 LRRAIQRSRYRLAGIGMARGEAASARKAVVAETPIMPAGGEYLVKLGIGTPPYKFTAAID 107
Query: 80 TGSSLLWVHCYPCRDCSPQLGTIFYPSRSSSYAYVPCDSEHCRYFPYARCS-AYKHRCIY 138
T S L+W C PC C Q+ +F P SS+YA +PC S+ C RC C Y
Sbjct: 108 TASDLIWTQCQPCTGCYHQVDPMFNPRVSSTYAALPCSSDTCDELDVHRCGHDDDESCQY 167
Query: 139 TQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDVVFGCGFSTNRNF---KFSGIFGLGIG 195
T Y T ++ ++L + V FGC S+ + SG+ GLG G
Sbjct: 168 TYTYSGNATTEGTLAVDKLVIGED-----AFRGVAFGCSTSSTGGAPPPQASGVVGLGRG 222
Query: 196 RSSLVSQLN-SSFSYCIGSLHDP-DYLHNKLILGDGAIIDEGD----ATPLQF---IDGH 246
SLVSQL+ F+YC L P + KL+LG A A P++ +
Sbjct: 223 PLSLVSQLSVRRFAYC---LPPPASRIPGKLVLGADADAARNATNRIAVPMRRDPRYPSY 279
Query: 247 YYITLEAISVDGRMLDI---------------------NPN---IFKRDDSGGGVMIDSG 282
YY+ L+ + + R + + +PN + D + G++ID
Sbjct: 280 YYLNLDGLLIGDRTMSLPPTTTTTATATATATAPAPTPSPNATAVAVGDANRYGMIIDIA 339
Query: 283 TDVTWLVKEAYEALRD--EVMIRL 304
+ +T+L Y+ L + EV IRL
Sbjct: 340 STITFLEASLYDELVNDLEVEIRL 363
>gi|449434470|ref|XP_004135019.1| PREDICTED: aspartic proteinase-like protein 1-like [Cucumis
sativus]
gi|449517144|ref|XP_004165606.1| PREDICTED: aspartic proteinase-like protein 1-like [Cucumis
sativus]
Length = 508
Score = 90.1 bits (222), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 100/375 (26%), Positives = 164/375 (43%), Gaps = 51/375 (13%)
Query: 59 SLFYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQL-----GTIF---YPSRSSS 110
+L+Y N+SIG P + A+DTGS L W+ C C C L G + Y S +SS
Sbjct: 102 NLYYANVSIGTPGLYFLVALDTGSDLFWLPC-ECTKCPTYLTKRDNGKFWLNHYSSNASS 160
Query: 111 YAY-VPCDSEHCRYFPYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHV 169
+ VPC S C +CS+ K C Y YL +S + + TD+S +
Sbjct: 161 TSIRVPCSSSLCEL--ANQCSSNKSSCPYQTHYLSENSSSAGYLVQDILHMATDDSQLKP 218
Query: 170 QDVVFGCGFSTNRNFKFS------GIFGLGIGRSSLVSQLNS------SFSYCIGSLHDP 217
DV G + KFS G+ GLG+G+ S+ S L S SFS C G
Sbjct: 219 VDVKVTLGCGKVQTGKFSNVTAPNGLIGLGMGKVSVPSFLASQGLTTDSFSMCFGY---- 274
Query: 218 DYLHNKLILGDGAIIDEGDATPLQFIDGHYYITLEAISVDGRMLDINPNIFKRDDSGGGV 277
Y + ++ GD + + + TP Y +T+ I V R +++
Sbjct: 275 -YGYGRIDFGDIGPVGQRE-TPFNPASLSYNVTILQIIVTNRPTNVHLT----------A 322
Query: 278 MIDSGTDVTWLVKEAYEALRDEVMIRLEGEQMRSYS-WPDKLCYHGIMSSDLKGFPTVRF 336
+IDSG T+L Y + + + +E E+++S S +P + CY +++ + P + F
Sbjct: 323 IIDSGASFTYLTDPFYSIITENMDAAMELERIKSDSDFPFEYCYRLSLATIFQQ-PNLNF 381
Query: 337 HFRGGAKLALEKD--SMFYQPRPDAFCMAVNPASINNRYVNFSLIGMMAQQFYNVGYDIG 394
GG K + S+ P A C+A+ ++ + ++IG Y V ++
Sbjct: 382 TMEGGRKFDVITSYVSVDTDDGP-ALCLAIVKST------DINVIGHNFFGGYRVVFNRE 434
Query: 395 RKQKTFQRMDCEVLD 409
+ ++ +DC+ D
Sbjct: 435 KMTLGWKEVDCDSYD 449
>gi|115460260|ref|NP_001053730.1| Os04g0595000 [Oryza sativa Japonica Group]
gi|113565301|dbj|BAF15644.1| Os04g0595000, partial [Oryza sativa Japonica Group]
Length = 471
Score = 90.1 bits (222), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 99/385 (25%), Positives = 160/385 (41%), Gaps = 52/385 (13%)
Query: 58 NSLFYVNISIGQPP---VPQFTAMDTGSSLLWVHCYPCRDCSPQLGTIFY-PSRSSSYAY 113
S + V + IG P P++ DTGS L W C PC +CS + PS+S ++
Sbjct: 101 GSTYLVQLRIGTPTDRISPRYVLFDTGSDLSWTQCEPCTNCSSFTPYPPHDPSKSRTFRR 160
Query: 114 VPCDSEHCRYFPYARCSAY------KHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTI 167
+ C C C+A C++ + Y G S + ++ F +
Sbjct: 161 LSCFDPMCEL-----CTAVVDGGGGSAGCLFRRRYGDGGAVSGELVSDVFHFGAAGDGGG 215
Query: 168 H--VQDVVFGCGFSTN----RNFKFSGIFGLGIGRSSLVSQLN-SSFSYCIGS------- 213
+ +DV FGC + R + +GI LGIG+ S V+QL FSYCI +
Sbjct: 216 YQLERDVAFGCAHVEDSKAVRGYS-TGILALGIGKPSFVTQLGVDRFSYCIPASEITDDD 274
Query: 214 ---LHDPDYLHNKLILGDGAIIDEGDATPLQFIDGHYYITLEAISVD--GRMLDINP-NI 267
D + + L G A + G P + Y + L+++ GR+ P +
Sbjct: 275 DDDDDDEERSASFLRFGSHARM-TGKRAPFKQDGSGYAVRLKSVVYQHGGRLNQQQPVPV 333
Query: 268 FKRDDSGGGVM---IDSGTDVTWLVKEAYEALRDEVMIRLEGEQMRSYSWPDKLCYHGIM 324
+ + M +DSGT + WL + L+ + + + + P CY G M
Sbjct: 334 YVAGEEAAAAMPMLVDSGTTLLWLPGSVFYPLQRRIEEDISLTRRYDLTHPSLYCYLGNM 393
Query: 325 SSDLKGFPTVRFHFRGGAKLALEKDSMFYQPR---PDAFCMAVNPASINNRYVNFSLIGM 381
+D++ +V F GGA L L S+F+ D C+AV N +++G+
Sbjct: 394 -TDVEAV-SVTLGFGGGADLELFGTSLFFTDENLTEDWVCLAVAAG-------NRAILGV 444
Query: 382 MAQQFYNVGYDIGRKQKTFQRMDCE 406
Q+ NVGYD+ + F R C+
Sbjct: 445 YPQRNINVGYDLSTMEIAFDRDQCD 469
>gi|226509408|ref|NP_001141440.1| uncharacterized protein LOC100273550 precursor [Zea mays]
gi|194704586|gb|ACF86377.1| unknown [Zea mays]
gi|413938617|gb|AFW73168.1| hypothetical protein ZEAMMB73_633272 [Zea mays]
Length = 478
Score = 90.1 bits (222), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 108/370 (29%), Positives = 152/370 (41%), Gaps = 51/370 (13%)
Query: 55 DISNSLFYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCR---DCSPQLGTIFYPSRSSSY 111
DI + V S+G P V Q +DTGS L WV C PC C Q +F P++SSSY
Sbjct: 134 DIGTLNYVVTASLGTPGVAQTMEVDTGSDLSWVQCKPCAAAPSCYSQKDPLFDPAQSSSY 193
Query: 112 AYVPCDSEHCRYFP-YARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQ 170
A VPC C YA + +C Y Y G T+ S++ LT + VQ
Sbjct: 194 AAVPCGGPVCAGLGIYAASACSAAQCGYVVSYGDGSNTTGVYSSDTLTLSASSA----VQ 249
Query: 171 DVVFGCGFSTNRNFK-FSGIFGLGIGRSSLVSQLNSS----FSYCIGSLHDPDYLHNKLI 225
FGCG + + F G+ GLG + SLV Q + FSYC L L
Sbjct: 250 GFFFGCGHAQSGLFNGVDGLLGLGREQPSLVEQTAGTYGGVFSYC---LPTKPSTAGYLT 306
Query: 226 LGDGAIIDEGDA------TPLQFIDGHYYITLEAISVDGRMLDINPNIFKRDDSGGGVMI 279
LG G P +Y + L ISV G+ L + + F ++
Sbjct: 307 LGVGGPSGAAPGFSTTQLLPSPNAPTYYVVMLTGISVGGQQLSVPASAFAGG-----TVV 361
Query: 280 DSGTDVTWLVKEAYEALRDEVMIRLEGEQMRSYSWPDKLCYHGIMSS--DLKGF-----P 332
D+GT VT L AY ALR M SY +P +GI+ + + G+ P
Sbjct: 362 DTGTVVTRLPPTAYAALRSAFR-----SGMASYGYPTAPS-NGILDTCYNFAGYGTVTLP 415
Query: 333 TVRFHFRGGAKLALEKDSMFYQPRPDAFCMAVNPASINNRYVNFSLIGMMAQQFYNV--- 389
V F GA + L D + C+A P+ + +++G + Q+ + V
Sbjct: 416 NVALTFGSGATVTLGADGIL-----SFGCLAFAPSGSDG---GMAILGNVQQRSFEVRID 467
Query: 390 GYDIGRKQKT 399
G +G K +
Sbjct: 468 GTSVGFKPSS 477
>gi|32489096|emb|CAE03928.1| OSJNba0093F12.2 [Oryza sativa Japonica Group]
gi|58532027|emb|CAD41565.3| OSJNBa0006A01.20 [Oryza sativa Japonica Group]
Length = 489
Score = 90.1 bits (222), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 99/385 (25%), Positives = 160/385 (41%), Gaps = 52/385 (13%)
Query: 58 NSLFYVNISIGQPP---VPQFTAMDTGSSLLWVHCYPCRDCSPQLGTIFY-PSRSSSYAY 113
S + V + IG P P++ DTGS L W C PC +CS + PS+S ++
Sbjct: 119 GSTYLVQLRIGTPTDRISPRYVLFDTGSDLSWTQCEPCTNCSSFTPYPPHDPSKSRTFRR 178
Query: 114 VPCDSEHCRYFPYARCSAY------KHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTI 167
+ C C C+A C++ + Y G S + ++ F +
Sbjct: 179 LSCFDPMCEL-----CTAVVDGGGGSAGCLFRRRYGDGGAVSGELVSDVFHFGAAGDGGG 233
Query: 168 H--VQDVVFGCGFSTN----RNFKFSGIFGLGIGRSSLVSQLN-SSFSYCIGS------- 213
+ +DV FGC + R + +GI LGIG+ S V+QL FSYCI +
Sbjct: 234 YQLERDVAFGCAHVEDSKAVRGYS-TGILALGIGKPSFVTQLGVDRFSYCIPASEITDDD 292
Query: 214 ---LHDPDYLHNKLILGDGAIIDEGDATPLQFIDGHYYITLEAISVD--GRMLDINP-NI 267
D + + L G A + G P + Y + L+++ GR+ P +
Sbjct: 293 DDDDDDEERSASFLRFGSHARM-TGKRAPFKQDGSGYAVRLKSVVYQHGGRLNQQQPVPV 351
Query: 268 FKRDDSGGGVM---IDSGTDVTWLVKEAYEALRDEVMIRLEGEQMRSYSWPDKLCYHGIM 324
+ + M +DSGT + WL + L+ + + + + P CY G M
Sbjct: 352 YVAGEEAAAAMPMLVDSGTTLLWLPGSVFYPLQRRIEEDISLTRRYDLTHPSLYCYLGNM 411
Query: 325 SSDLKGFPTVRFHFRGGAKLALEKDSMFYQPR---PDAFCMAVNPASINNRYVNFSLIGM 381
+D++ +V F GGA L L S+F+ D C+AV N +++G+
Sbjct: 412 -TDVEAV-SVTLGFGGGADLELFGTSLFFTDENLTEDWVCLAVAAG-------NRAILGV 462
Query: 382 MAQQFYNVGYDIGRKQKTFQRMDCE 406
Q+ NVGYD+ + F R C+
Sbjct: 463 YPQRNINVGYDLSTMEIAFDRDQCD 487
>gi|326503602|dbj|BAJ86307.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 461
Score = 89.7 bits (221), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 80/258 (31%), Positives = 113/258 (43%), Gaps = 22/258 (8%)
Query: 58 NSLFYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLGTIFYPSRSSSYAYVPCD 117
N V++++G PP +DTGS L W+ C P + F P SS++A VPC
Sbjct: 82 NVSLTVSLAVGTPPQNVTMVLDTGSELSWLLCAPAGARNKFSAMSFRPRASSTFAAVPCA 141
Query: 118 SEHCRY----FPYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDVV 173
S CR P A C RC + Y G + ++T+ +
Sbjct: 142 SAQCRSRDLPSPPA-CDGASSRCSVSLSYADGSSSDGALATDVFAVGSGPP-----LRAA 195
Query: 174 FGC---GF-STNRNFKFSGIFGLGIGRSSLVSQLNS-SFSYCIGSLHDPDYL---HNKL- 224
FGC F S+ +G+ G+ G S VSQ ++ FSYCI D L H+ L
Sbjct: 196 FGCMSSAFDSSPDGVASAGLLGMNRGALSFVSQASTRRFSYCISDRDDAGVLLLGHSDLP 255
Query: 225 -ILGDGAIIDEGDATPLQFIDG-HYYITLEAISVDGRMLDINPNIFKRDDSGGG-VMIDS 281
L A PL + D Y + L I V G+ L I ++ D +G G M+DS
Sbjct: 256 TFLPLNYTPMYQPALPLPYFDRVAYSVQLLGIRVGGKHLPIPASVLAPDHTGAGQTMVDS 315
Query: 282 GTDVTWLVKEAYEALRDE 299
GT T+L+ +AY AL+ E
Sbjct: 316 GTQFTFLLGDAYSALKAE 333
>gi|90399145|emb|CAJ86169.1| H0913C04.10 [Oryza sativa Indica Group]
gi|125550292|gb|EAY96114.1| hypothetical protein OsI_17992 [Oryza sativa Indica Group]
Length = 491
Score = 89.7 bits (221), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 101/408 (24%), Positives = 167/408 (40%), Gaps = 72/408 (17%)
Query: 61 FYVNISIGQPPVPQFTAMDTGSSLLWVHC---YPCRDCSPQLGT----IFYPSRSSSYAY 113
+ +S+G PP P +DTGS L WV C Y CR+CS +F+P SSS
Sbjct: 89 YAFTVSLGTPPQPLPVLLDTGSHLSWVPCTSSYQCRNCSSLSAASPLHVFHPKNSSSSRL 148
Query: 114 VPCDSEHCRYF---------------PYARCSAYKHRC-----IYTQLYLIGPETSVFVS 153
+ C + C + P A C+ Y +Y G + +S
Sbjct: 149 IGCRNPSCLWIHSPDHLSDCRAASSCPGANCTPRNANANNVCPPYLVVYGSGSTAGLLIS 208
Query: 154 TEQLTFKNTDESTIHVQDVVFGCGFSTNRNFKFSGIFGLGIGRSSLVSQLN-SSFSYCIG 212
T + + V++ V GC ++ SG+ G G G S+ SQL + FSYC+
Sbjct: 209 D---TLRTPGRA---VRNFVIGCSLASVHQPP-SGLAGFGRGAPSVPSQLGLTKFSYCLL 261
Query: 213 S--LHDPDYLHNKLILGD------------GAIIDEGDATPLQFIDGHYYITLEAISVDG 258
S D + +LILG + A P + +YY+ L AI+V G
Sbjct: 262 SRRFDDNAAVSGELILGGAGGKDGGVGMQYAPLARSASARPPYSV--YYYLALTAITVGG 319
Query: 259 RMLDINPNIFKRDDSGGGVMIDSGTDVTWLVKEAYEALRDEVMIRLEGEQMRSYSWPDKL 318
+ + + F +GGG ++DSGT ++ + +E + V+ + G RS + L
Sbjct: 320 KSVQLPERAFVAGGAGGGAIVDSGTTFSYFDRTVFEPVAAAVVAAVGGRYSRSKVVEEGL 379
Query: 319 ----CYHGIMSSDLKGFPTVRFHFRGGAKLALEKDSMFYQPRP----------DAFCMAV 364
C+ + P + HF+GG+ + L ++ F P +A C+AV
Sbjct: 380 GLSPCFAMPPGTKTMELPEMSLHFKGGSVMNLPVENYFVVAGPAPSGGAPAMAEAICLAV 439
Query: 365 -------NPASINNRYVNFSLIGMMAQQFYNVGYDIGRKQKTFQRMDC 405
+ + + ++G QQ Y + YD+ +++ F+R C
Sbjct: 440 VSDVPTSSGGAGVSSGGPAIILGSFQQQNYYIEYDLEKERLGFRRQQC 487
>gi|413938607|gb|AFW73158.1| aspartic proteinase nepenthesin-2 [Zea mays]
Length = 478
Score = 89.7 bits (221), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 108/370 (29%), Positives = 152/370 (41%), Gaps = 51/370 (13%)
Query: 55 DISNSLFYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCR---DCSPQLGTIFYPSRSSSY 111
DI + V S+G P V Q +DTGS L WV C PC C Q +F P++SSSY
Sbjct: 134 DIGTLNYVVTASLGTPGVAQTMEVDTGSDLSWVQCKPCSAAPSCYSQKDPLFDPAQSSSY 193
Query: 112 AYVPCDSEHCRYFP-YARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQ 170
A VPC C YA + +C Y Y G T+ S++ LT + VQ
Sbjct: 194 AAVPCGGPVCAGLGIYAASACSAAQCGYVVSYGDGSNTTGVYSSDTLTLSASSA----VQ 249
Query: 171 DVVFGCGFSTNRNFK-FSGIFGLGIGRSSLVSQLNSS----FSYCIGSLHDPDYLHNKLI 225
FGCG + + F G+ GLG + SLV Q + FSYC L L
Sbjct: 250 GFFFGCGHAQSGLFNGVDGLLGLGREQPSLVEQTAGTYGGVFSYC---LPTKPSTAGYLT 306
Query: 226 LGDGAIIDEGDA------TPLQFIDGHYYITLEAISVDGRMLDINPNIFKRDDSGGGVMI 279
LG G P +Y + L ISV G+ L + + F ++
Sbjct: 307 LGVGGPSGAAPGFSTTQLLPSPNAPTYYVVMLTGISVGGQQLSVPASAFAGG-----TVV 361
Query: 280 DSGTDVTWLVKEAYEALRDEVMIRLEGEQMRSYSWPDKLCYHGIMSS--DLKGF-----P 332
D+GT VT L AY ALR M SY +P +GI+ + + G+ P
Sbjct: 362 DTGTVVTRLPPTAYAALRSAFR-----SGMASYGYPTAPS-NGILDTCYNFAGYGTVTLP 415
Query: 333 TVRFHFRGGAKLALEKDSMFYQPRPDAFCMAVNPASINNRYVNFSLIGMMAQQFYNV--- 389
V F GA + L D + C+A P+ + +++G + Q+ + V
Sbjct: 416 NVALTFGSGATVTLGADGIL-----SFGCLAFAPSGSDG---GMAILGNVQQRSFEVRID 467
Query: 390 GYDIGRKQKT 399
G +G K +
Sbjct: 468 GTSVGFKPSS 477
>gi|125575541|gb|EAZ16825.1| hypothetical protein OsJ_32297 [Oryza sativa Japonica Group]
Length = 416
Score = 89.7 bits (221), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 94/366 (25%), Positives = 156/366 (42%), Gaps = 54/366 (14%)
Query: 63 VNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLGTIFYPSRSSSYAYVPCDSEHCR 122
N +IG PP P +D PC +P+ SS++ PC ++ C+
Sbjct: 69 ANFTIGTPPQPASAIIDVAGP------APCS----------FPNASSTFRPEPCGTDACK 112
Query: 123 YFPYARCSAYKHRCIY--TQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDVVFGCGFST 180
P + CS+ + C Y T +G T V+T+ S + FGC ++
Sbjct: 113 SIPTSNCSS--NMCTYEGTINSKLGGHTLGIVATDTFAIGTATAS------LGFGCVVAS 164
Query: 181 NRNFKF--SGIFGLGIGRSSLVSQLN-SSFSYCIGSLHDPDYLHNKLILGDGA-IIDEGD 236
+ SG+ GLG SSLVSQ+N + FSYC+ + HD +++L+LG A + G+
Sbjct: 165 GIDTMGGPSGLIGLGRAPSSLVSQMNITKFSYCL-TPHDSGK-NSRLLLGSSAKLAGGGN 222
Query: 237 ATPLQFIDG--------HYYITLEAISVDGRMLDINPNIFKRDDSGGGVMIDSGTDVTWL 288
+T F+ +Y I L+ I + + P SG V++ + +++L
Sbjct: 223 STTTPFVKTSPGDDMSQYYPIQLDGIKAGDAAIALPP-------SGNTVLVQTLAPMSFL 275
Query: 289 VKEAYEALRDEVMIRLEGEQMRSYSWPDKLCYHGIMSSDLKGFPTVRFHF-RGGAKLALE 347
V AY+AL+ EV + + P LC+ S+ P + F F +G A L +
Sbjct: 276 VDSAYQALKKEVTKAVGAAPTATPLQPFDLCFPKAGLSNASA-PDLVFTFQQGAAALTVP 334
Query: 348 KDSMFYQ--PRPDAFCMAVNPASINNRYV---NFSLIGMMAQQFYNVGYDIGRKQKTFQR 402
CMA+ S N N +++G + Q+ + D+ +K +F+
Sbjct: 335 PPKYLIDVGEEKGTVCMAILSTSWLNTTALDENLNILGSLQQENTHFLLDLEKKTLSFEP 394
Query: 403 MDCEVL 408
DC L
Sbjct: 395 ADCAHL 400
>gi|356539555|ref|XP_003538263.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
Length = 438
Score = 89.7 bits (221), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 95/363 (26%), Positives = 141/363 (38%), Gaps = 31/363 (8%)
Query: 56 ISNSLFYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLGTIFYPSRSSSYAYVP 115
I + + V IG PP A+DT + W+ C C C+ T+F P +S+++ V
Sbjct: 92 IQSPTYIVRAKIGTPPQTLLLAIDTSNDAAWIPCTACDGCT---STLFAPEKSTTFKNVS 148
Query: 116 CDSEHCRYFPYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDVVFG 175
C S C P C C + Y +S+ + Q T +T + FG
Sbjct: 149 CGSPECNKVPSPSCGT--SACTFNLTY---GSSSIAANVVQDTVT---LATDPIPGYTFG 200
Query: 176 C-----GFSTNRNFKFSGIFGLGIGRSSLVSQLNSSFSYCIGSLHDPDYLHNKLILGDGA 230
C G ST G S + S+FSYC+ S ++ L LG A
Sbjct: 201 CVAKTTGPSTPPQGLLGLGRGPLSLLSQTQNLYQSTFSYCLPSFKSLNF-SGSLRLGPVA 259
Query: 231 IIDEGDATPL---QFIDGHYYITLEAISVDGRMLDINPNIFK-RDDSGGGVMIDSGTDVT 286
TPL YY+ L AI V +++DI P +G G + DSGT T
Sbjct: 260 QPIRIKYTPLLKNPRRSSLYYVNLFAIRVGRKIVDIPPAALAFNAATGAGTVFDSGTVFT 319
Query: 287 WLVKEAYEALRDEVMIRLEGEQMRSYSWPD----KLCYHGIMSSDLKGFPTVRFHFRGGA 342
LV Y A+RDE R+ + + CY + + PT+ F F G
Sbjct: 320 RLVAPVYTAVRDEFRRRVAMAAKANLTVTSLGGFDTCYTVPIVA-----PTITFMFSGMN 374
Query: 343 KLALEKDSMFYQPRPDAFCMAVNPASINNRYVNFSLIGMMAQQFYNVGYDIGRKQKTFQR 402
+ + + + C+A+ A N V ++I M QQ + V YD+ + R
Sbjct: 375 VTLPQDNILIHSTAGSTSCLAMASAPDNVNSV-LNVIANMQQQNHRVLYDVPNSRLGVAR 433
Query: 403 MDC 405
C
Sbjct: 434 ELC 436
>gi|222629462|gb|EEE61594.1| hypothetical protein OsJ_16002 [Oryza sativa Japonica Group]
Length = 468
Score = 89.7 bits (221), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 99/385 (25%), Positives = 160/385 (41%), Gaps = 52/385 (13%)
Query: 58 NSLFYVNISIGQPP---VPQFTAMDTGSSLLWVHCYPCRDCSPQLGTIFY-PSRSSSYAY 113
S + V + IG P P++ DTGS L W C PC +CS + PS+S ++
Sbjct: 98 GSTYLVQLRIGTPTDRISPRYVLFDTGSDLSWTQCEPCTNCSSFTPYPPHDPSKSRTFRR 157
Query: 114 VPCDSEHCRYFPYARCSAY------KHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTI 167
+ C C C+A C++ + Y G S + ++ F +
Sbjct: 158 LSCFDPMCEL-----CTAVVDGGGGSAGCLFRRRYGDGGAVSGELVSDVFHFGAAGDGGG 212
Query: 168 H--VQDVVFGCGFSTN----RNFKFSGIFGLGIGRSSLVSQLN-SSFSYCIGS------- 213
+ +DV FGC + R + +GI LGIG+ S V+QL FSYCI +
Sbjct: 213 YQLERDVAFGCAHVEDSKAVRGYS-TGILALGIGKPSFVTQLGVDRFSYCIPASEITDDD 271
Query: 214 ---LHDPDYLHNKLILGDGAIIDEGDATPLQFIDGHYYITLEAISVD--GRMLDINP-NI 267
D + + L G A + G P + Y + L+++ GR+ P +
Sbjct: 272 DDDDDDEERSASFLRFGSHARM-TGKRAPFKQDGSGYAVRLKSVVYQHGGRLNQQQPVPV 330
Query: 268 FKRDDSGGGVM---IDSGTDVTWLVKEAYEALRDEVMIRLEGEQMRSYSWPDKLCYHGIM 324
+ + M +DSGT + WL + L+ + + + + P CY G M
Sbjct: 331 YVAGEEAAAAMPMLVDSGTTLLWLPGSVFYPLQRRIEEDISLTRRYDLTHPSLYCYLGNM 390
Query: 325 SSDLKGFPTVRFHFRGGAKLALEKDSMFYQPR---PDAFCMAVNPASINNRYVNFSLIGM 381
+D++ +V F GGA L L S+F+ D C+AV N +++G+
Sbjct: 391 -TDVEAV-SVTLGFGGGADLELFGTSLFFTDENLTEDWVCLAVAAG-------NRAILGV 441
Query: 382 MAQQFYNVGYDIGRKQKTFQRMDCE 406
Q+ NVGYD+ + F R C+
Sbjct: 442 YPQRNINVGYDLSTMEIAFDRDQCD 466
>gi|222642011|gb|EEE70143.1| hypothetical protein OsJ_30189 [Oryza sativa Japonica Group]
Length = 671
Score = 89.7 bits (221), Expect = 3e-15, Method: Compositional matrix adjust.
Identities = 94/335 (28%), Positives = 144/335 (42%), Gaps = 55/335 (16%)
Query: 60 LFYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDC----SPQLGT----IFYPSRSSSY 111
L Y +++G P V A+DTGS L WV C C C SP G+ ++ P++S++
Sbjct: 34 LHYAVVALGTPNVTFLVALDTGSDLFWVPC-DCLKCAPFQSPNYGSLKFDVYSPAQSTTS 92
Query: 112 AYVPCDSEHCRYFPYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTD--ESTIHV 169
VPC S C C + + C Y+ YL +S V E + + +D +S I
Sbjct: 93 RKVPCSSNLCDL--QNACRSKSNSCPYSIQYLSDNTSSSGVLVEDVLYLTSDSAQSKIVT 150
Query: 170 QDVVFGCGFSTNRNFKFS----GIFGLGIGRSSLVSQLNS------SFSYCIGSLHDPDY 219
++FGCG +F S G+ GLG+ S+ S L S SFS C G D
Sbjct: 151 APIMFGCGQVQTGSFLGSAAPNGLLGLGMDSKSVPSLLASKGLAANSFSMCFG-----DD 205
Query: 220 LHNKLILGDGAIIDEGDATPLQFI--DGHYYITLEAISVDGRMLDINPNIFKRDDSGGGV 277
H ++ GD D+ + TPL + +Y IT+ I+V + + F
Sbjct: 206 GHGRINFGDTGSSDQKE-TPLNVYKQNPYYNITITGITVGSKSISTE---FS-------A 254
Query: 278 MIDSGTDVTWLVKEAYEALRD--EVMIRLEGEQMRSYSWPDKLCYHGIMSSDLKGFPTVR 335
++DSGT T L Y + + IR M S P + CY +S++ P V
Sbjct: 255 IVDSGTSFTALSDPMYTQITSSFDAQIR-SSRNMLDSSMPFEFCYS--VSANGIVHPNVS 311
Query: 336 FHFRGGAKLALE------KDSMFYQPRPDAFCMAV 364
+GG+ + D+ F P +C+A+
Sbjct: 312 LTAKGGSIFPVNDPIITITDNAF---NPVGYCLAI 343
>gi|108707839|gb|ABF95634.1| Eukaryotic aspartyl protease family protein [Oryza sativa Japonica
Group]
Length = 330
Score = 89.4 bits (220), Expect = 3e-15, Method: Compositional matrix adjust.
Identities = 84/302 (27%), Positives = 132/302 (43%), Gaps = 43/302 (14%)
Query: 116 CDSEHCRYFPYARCSAYK----HRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQD 171
CDS C+ A C K C+YT Y T+ + ++ TF + V
Sbjct: 38 CDSTLCQGLLVASCGNTKFWPNQTCVYTYYYNDKSVTTGLIEVDKFTFG----AGASVPG 93
Query: 172 VVFGCGFSTNRNFKF--SGIFGLGIGRSSLVSQLN-SSFSYCIGSLHDPDYLHNKLIL-- 226
V FGCG N FK +GI G G G SL SQL +FS+C +++ L +L
Sbjct: 94 VAFGCGLFNNGVFKSNETGIAGFGRGPLSLPSQLKVGNFSHCFTAVNG---LKQSTVLLD 150
Query: 227 --------GDGAIIDEGDATPLQFIDGH---YYITLEAISVDGRMLDINPNIFKRDDSGG 275
G GA+ +TPL + YY++L+ I+V L + + F + G
Sbjct: 151 LPADLYKNGRGAV----QSTPLIQNSANPTFYYLSLKGITVGSTRLPVPESAFALTNGTG 206
Query: 276 GVMIDSGTDVTWLVKEAYEALRDEVMIRLEGEQMRSYSWPDKLCYHGIMSSDLKGFPTVR 335
G +IDSGT +T L + Y+ +RDE +++ + + C+ S P +
Sbjct: 207 GTIIDSGTSITSLPPQVYQVVRDEFAAQIKLPVVPGNATGPYTCFSA-PSQAKPDVPKLV 265
Query: 336 FHFRGGAKLALEKDSMFYQPRPDA----FCMAVNPASINNRYVNFSLIGMMAQQFYNVGY 391
HF GA + L +++ ++ DA C+A+N ++IG QQ +V Y
Sbjct: 266 LHFE-GATMDLPRENYVFEVPDDAGNSIICLAINKGD------ETTIIGNFQQQNMHVLY 318
Query: 392 DI 393
D+
Sbjct: 319 DL 320
>gi|115441003|ref|NP_001044781.1| Os01g0844500 [Oryza sativa Japonica Group]
gi|19571042|dbj|BAB86469.1| putative aspartic proteinase nepenthesin II [Oryza sativa Japonica
Group]
gi|20160609|dbj|BAB89555.1| putative aspartic proteinase nepenthesin II [Oryza sativa Japonica
Group]
gi|113534312|dbj|BAF06695.1| Os01g0844500 [Oryza sativa Japonica Group]
gi|125572614|gb|EAZ14129.1| hypothetical protein OsJ_04051 [Oryza sativa Japonica Group]
Length = 442
Score = 89.4 bits (220), Expect = 3e-15, Method: Compositional matrix adjust.
Identities = 81/259 (31%), Positives = 114/259 (44%), Gaps = 23/259 (8%)
Query: 58 NSLFYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLGTI--FYPSRSSSYAYVP 115
N V++++G PP +DTGS L W+ C P + F P S ++A VP
Sbjct: 63 NVSLTVSLAVGTPPQNVTMVLDTGSELSWLLCAPGGGGGGGGRSALSFRPRASLTFASVP 122
Query: 116 CDSEHCRY----FPYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQD 171
CDS CR P A C +C + Y G + ++TE T
Sbjct: 123 CDSAQCRSRDLPSPPA-CDGASKQCRVSLSYADGSSSDGALATEVFTVGQGPP-----LR 176
Query: 172 VVFGC---GFSTNRN-FKFSGIFGLGIGRSSLVSQLNS-SFSYCIGSLHDPDYL---HNK 223
FGC F T+ + +G+ G+ G S VSQ ++ FSYCI D L H+
Sbjct: 177 AAFGCMATAFDTSPDGVATAGLLGMNRGALSFVSQASTRRFSYCISDRDDAGVLLLGHSD 236
Query: 224 L-ILGDGAIIDEGDATPLQFIDG-HYYITLEAISVDGRMLDINPNIFKRDDSGGG-VMID 280
L L A PL + D Y + L I V G+ L I ++ D +G G M+D
Sbjct: 237 LPFLPLNYTPLYQPAMPLPYFDRVAYSVQLLGIRVGGKPLPIPASVLAPDHTGAGQTMVD 296
Query: 281 SGTDVTWLVKEAYEALRDE 299
SGT T+L+ +AY AL+ E
Sbjct: 297 SGTQFTFLLGDAYSALKAE 315
>gi|388498308|gb|AFK37220.1| unknown [Lotus japonicus]
Length = 363
Score = 89.4 bits (220), Expect = 3e-15, Method: Compositional matrix adjust.
Identities = 70/192 (36%), Positives = 94/192 (48%), Gaps = 25/192 (13%)
Query: 36 LQEKIRSHNTYQAQI---LPSNDISNSLFY-VNISIGQPPVPQFTAMDTGSSLLWVHCYP 91
L++ + SH+ +QI L S +L Y V + +G + +DTGS L WV C P
Sbjct: 116 LRKMVSSHSVEVSQIQIPLASGVNFQTLNYIVTMELGGQDMT--VIIDTGSDLTWVQCEP 173
Query: 92 CRDCSPQLGTIFYPSRSSSYAYVPCDSEHCRYFPY-----ARCSAYKHRCIYTQLYLIGP 146
C C Q G +F PS SSSY +PC+S C+ C + C Y Y G
Sbjct: 174 CMSCYNQQGPVFKPSTSSSYQSIPCNSSTCQSLQLTTGNAGACESNPSNCSYAVNYGDGS 233
Query: 147 ETSVFVSTEQLTFKNTDESTIHVQDVVFGCGFSTNRNFKFSGIFGL-GIGRS--SLVSQL 203
T+ + E L+F I V + VFGCG N F G+ GL G+GRS SL+SQ
Sbjct: 234 YTNGELGAEHLSFGG-----ISVSNFVFGCG--KNNKGLFGGVSGLMGLGRSNLSLISQT 286
Query: 204 NSS----FSYCI 211
NS+ FSYC+
Sbjct: 287 NSTFGGVFSYCL 298
>gi|326515330|dbj|BAK03578.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 445
Score = 89.4 bits (220), Expect = 3e-15, Method: Compositional matrix adjust.
Identities = 108/384 (28%), Positives = 155/384 (40%), Gaps = 64/384 (16%)
Query: 63 VNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLGTIFYPSRSSSYAYVPCDSEHCR 122
V I G+ F +DT SSL W+ C C Q +F PS SSSY + S CR
Sbjct: 78 VTIGTGRGKSTYFLVLDTASSLPWMRCAHCLPVQRQRSPVFDPSDSSSYRPLHPTSPLCR 137
Query: 123 YFPYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDVVFGCGFST-- 180
P A +C + +L G E +V T+ + N T+ + V FGC ST
Sbjct: 138 A-PNPVLPA-GDKCSF---HLPG-EAHGYVGTDTIILGN---PTLPIHSVAFGCAQSTEG 188
Query: 181 -NRNFKFSGIFGLGIGRSSLVSQLN----SSFSYC-IGSLHDPD---------------- 218
+ F+G G+G +SL+ Q+ S FSYC IG H P
Sbjct: 189 FDTKGTFAGTLGMGKLPTSLIMQIKDRVGSRFSYCLIGLGHSPGRNGFIRFGADIPDPTL 248
Query: 219 YLHNKLILGDGAIIDEGDATPLQFIDGHYYITLEAISVDGRML-DINPNIF-KRDDSGGG 276
+H+++ I+ P D YY+ L IS++G + I +F +R D GG
Sbjct: 249 LVHHRI-----KILPTPPHLPHGVADSAYYVKLLGISLNGTPIPGIRQAMFERRSDGSGG 303
Query: 277 VMIDSGTDVTWLVKEAYEALRDEVMIRLEGEQMRSYSWPD-KLCYH---GIMSSDLKGFP 332
+D+GT VT LV AY + + V ++ + P+ LC+ GI S P
Sbjct: 304 CFVDAGTQVTHLVPAAYAVVEEAVAHMVQQWGYKRVRDPNFSLCFREHPGIWSH----IP 359
Query: 333 TVRFHFRGGAK-----LALEKDSMFY----QPRPDAFCMAVNPASINNRYVNFSLIGMMA 383
+ F G A L + ++F QP C V S + V +G M
Sbjct: 360 KLTLDFEGPASRTVAHLEIVSRNLFLKVDNQP---LVCFGVYRTSRGSPTV----VGAMQ 412
Query: 384 QQFYNVGYDIGRKQKTFQRMDCEV 407
Q +D+ TF R CE
Sbjct: 413 QVDTRFIFDLHANTITFHRESCEA 436
>gi|224091907|ref|XP_002309394.1| predicted protein [Populus trichocarpa]
gi|222855370|gb|EEE92917.1| predicted protein [Populus trichocarpa]
Length = 469
Score = 89.0 bits (219), Expect = 3e-15, Method: Compositional matrix adjust.
Identities = 104/436 (23%), Positives = 181/436 (41%), Gaps = 62/436 (14%)
Query: 15 NENPAHRVQRGINISIARLAYLQEKIRSHNTYQAQILPSNDISNSLFYVNISIGQPPVPQ 74
++NP + ++S++R +++ + + + P S + ++++ G PP
Sbjct: 49 SKNPWGALNHLASLSLSRAHHIKSPKTKFSLLKTPLFPR---SYGGYSISLNFGTPPQTT 105
Query: 75 FTAMDTGSSLLWVHC---YPCRDCS-PQLGT----IFYPSRSSSYAYVPCDSEHCRYF-- 124
MDTGSSL+W C Y C C P + F P +SSS + C + C +
Sbjct: 106 KFVMDTGSSLVWFPCTSRYLCSRCDFPNIEVTGIPTFIPKQSSSSNLIGCKNHKCSWLFG 165
Query: 125 PYAR-----CSAYKHRCI-----YTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDVVF 174
P + C C Y Y +G + +S E L F + + +
Sbjct: 166 PKVQSKCQECDPTTQNCTQSCPPYVIQYGLGSTAGLLLS-ETLDFPHKKT----IPGFLV 220
Query: 175 GCGFSTNRNFKFSGIFGLGIGRSSLVSQLN-SSFSYCIGS-LHDPDYLHNKLILGDGAII 232
GC + R + GI G G SL SQL FSYC+ S D + L+L G+
Sbjct: 221 GCSLFSIRQPE--GIAGFGRSPESLPSQLGLKKFSYCLVSHAFDDTPASSDLVLDTGSGS 278
Query: 233 DEGDA-----TPLQ------FIDGHYYITLEAISVDGRMLDINPN-IFKRDDSGGGVMID 280
D+ TP Q F D +YY+ L I + + + + D GG ++D
Sbjct: 279 DDTKTPGLSYTPFQKNPTAAFRD-YYYVLLRNIVIGDTHVKVPYKFLVPGSDGNGGTIVD 337
Query: 281 SGTDVTWLVKEAYEALRDEVMIRLEGEQMRSYSWPDKL--------CYHGIMSSDLKGFP 332
SGT T++ K YE + E +Q+ Y+ ++ C++ I P
Sbjct: 338 SGTTFTFMEKPVYELVAKEFE-----KQVAHYTVATEVQNQTGLRPCFN-ISGEKSVSVP 391
Query: 333 TVRFHFRGGAKLALEKDSMFYQPRPDAFCMAVNPASINNRYVNFS---LIGMMAQQFYNV 389
FHF+GGAK+AL + F C+ + +++ + ++G Q+ ++V
Sbjct: 392 EFIFHFKGGAKMALPLANYFSFVDSGVICLTIVSDNMSGSGIGGGPAIILGNYQQRNFHV 451
Query: 390 GYDIGRKQKTFQRMDC 405
+D+ ++ F++ +C
Sbjct: 452 EFDLKNERFGFKQQNC 467
>gi|359496797|ref|XP_002277380.2| PREDICTED: aspartic proteinase nepenthesin-2-like, partial [Vitis
vinifera]
Length = 358
Score = 89.0 bits (219), Expect = 4e-15, Method: Compositional matrix adjust.
Identities = 75/278 (26%), Positives = 123/278 (44%), Gaps = 28/278 (10%)
Query: 37 QEKIRSHNTYQAQILPSNDISNSLFYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRD-C 95
++ IR + + P I + +YV + G P +DTGSSL W+ C PC C
Sbjct: 94 KKDIRFPKSVSVPLNPGASIGSGNYYVKVGFGSPARYYSMIVDTGSSLSWLQCKPCVVYC 153
Query: 96 SPQLGTIFYPSRSSSYAYVPCDSEHCRYFPYAR-----CSAYKHRCIYTQLYLIGPETSV 150
Q +F PS S +Y + C S C A C + C+YT Y +
Sbjct: 154 HVQADPLFDPSASKTYKSLSCTSSQCSSLVDATLNNPLCETSSNVCVYTASYGDSSYSMG 213
Query: 151 FVSTEQLTFKNTDESTIHVQDVVFGCGFSTNRNF-KFSGIFGLGIGRSSLVSQLNS---- 205
++S + LT + + V+GCG ++ F + +GI GLG + S++ Q++S
Sbjct: 214 YLSQDLLTLAPSQT----LPGFVYGCGQDSDGLFGRAAGILGLGRNKLSMLGQVSSKFGY 269
Query: 206 SFSYCIGSLHDPDYLHNKLILGDGAIIDEG-DATPLQFIDGH---YYITLEAISVDGRML 261
+FSYC+ + +L +G ++ TP+ G+ Y++ L AI+V GR L
Sbjct: 270 AFSYCLPTRGGGGFLS----IGKASLAGSAYKFTPMTTDPGNPSLYFLRLTAITVGGRAL 325
Query: 262 DINPNIFKRDDSGGGVMIDSGTDVTWLVKEAYEALRDE 299
+ ++ +IDSGT +T L Y +
Sbjct: 326 GVAAAQYRVP-----TIIDSGTVITRLPMSVYTPFQQA 358
>gi|125543284|gb|EAY89423.1| hypothetical protein OsI_10930 [Oryza sativa Indica Group]
Length = 447
Score = 89.0 bits (219), Expect = 4e-15, Method: Compositional matrix adjust.
Identities = 104/397 (26%), Positives = 160/397 (40%), Gaps = 57/397 (14%)
Query: 58 NSLFYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLGTIFYPSRSSSYAYVPCD 117
N V +++G PP +DTGS L W+ C +P L F S SSSY VPC
Sbjct: 52 NVSLTVPVAVGTPPQNVTMVLDTGSELSWLLCN--GSYAPPLTPAFNASGSSSYGAVPCP 109
Query: 118 SEHCRY------FPYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQD 171
S C + P + + C + Y ++T+ TF T +
Sbjct: 110 STACEWRGRDLPVPPFCDTPPSNACRVSLSYADASSADGVLATD--TFLLTGGAPPVAVG 167
Query: 172 VVFGC-------------GFSTNRNFKFSGIFGLGIGRSSLVSQLNSS-FSYCIGSLHDP 217
FGC G T+ + +G+ G+ G S V+Q + F+YCI P
Sbjct: 168 AYFGCITSYSSTTATNSNGTGTDVSEAATGLLGMNRGTLSFVTQTGTRRFAYCIAPGEGP 227
Query: 218 DYLHNKLILGDGAIIDEGDATPLQFIDG--------HYYITLEAISVDGRMLDINPNIFK 269
L L+ DG + + TPL I Y + LE I V +L I ++
Sbjct: 228 GVL---LLGDDGGVAPPLNYTPLIEISQPLPYFDRVAYSVQLEGIRVGCALLPIPKSVLT 284
Query: 270 RDDSGGG-VMIDSGTDVTWLVKEAYEALRDE------VMIRLEGEQMRSYSWPDKLCYHG 322
D +G G M+DSGT T+L+ +AY AL+ E +++ GE + C+ G
Sbjct: 285 PDHTGAGQTMVDSGTQFTFLLADAYAALKAEFTSQARLLLAPLGEPGFVFQGAFDACFRG 344
Query: 323 -----IMSSDLKGFPTVRFHFRGGAKLALEKDSMFY----QPRPDAFCMAVNPASINNR- 372
+S L P V R GA++A+ + + Y + R + AV + N
Sbjct: 345 PEARVAAASGL--LPVVGLVLR-GAEVAVSGEKLLYMVPGERRGEGGAEAVWCLTFGNSD 401
Query: 373 --YVNFSLIGMMAQQFYNVGYDIGRKQKTFQRMDCEV 407
++ +IG QQ V YD+ + F C++
Sbjct: 402 MAGMSAYVIGHHHQQNVWVEYDLQNGRVGFAPARCDL 438
>gi|302769978|ref|XP_002968408.1| hypothetical protein SELMODRAFT_89951 [Selaginella moellendorffii]
gi|300164052|gb|EFJ30662.1| hypothetical protein SELMODRAFT_89951 [Selaginella moellendorffii]
Length = 492
Score = 89.0 bits (219), Expect = 4e-15, Method: Compositional matrix adjust.
Identities = 104/406 (25%), Positives = 168/406 (41%), Gaps = 62/406 (15%)
Query: 36 LQEKIRSHNTYQAQILPS------NDISNSLFYVN-ISIGQPPVPQFTAMDTGSSLLWVH 88
L+ SH ++L S +D+ +Y + + IG PP +DTGS++ +V
Sbjct: 3 LELVANSHRRRDRELLGSARMDLHDDLLTKGYYTSRVKIGTPPHEFSLIVDTGSTVTYVP 62
Query: 89 CYPCRDCSPQLGTIFYPSRSSSYAYVPCDSEHCRYFPYARCSAYKHRCIYTQLYLIGPET 148
C C C F P+ SSSY + C SE F C + Y + Y +
Sbjct: 63 CSSCTHCGNHQDPRFSPALSSSYKPLECGSECSTGF----CDGSRK---YQRQYAEKSTS 115
Query: 149 SVFVSTEQLTFKNTDESTIHVQDVVFGCGFSTNRNF---KFSGIFGLGIGRSSLVSQL-- 203
S + + + F N+ S + Q +VFGC + + GI GLG G S++ QL
Sbjct: 116 SGVLGKDVIGFSNS--SDLGGQRLVFGCETAETGDLYDQTADGIIGLGRGPLSIIDQLVE 173
Query: 204 ----NSSFSYCIGSLHDPDYLHNKLILGDGAIIDEGDATPLQFI----DGH----YYITL 251
FS C G + + G GA+I G P + D H Y + L
Sbjct: 174 KNAMEDVFSLCYGGMDE----------GGGAMILGGFQPPKDMVFTASDPHRSPYYNLML 223
Query: 252 EAISVDGRMLDINPNIFKRDDSGGGVMIDSGTDVTWLVKEAYEALRDEVMIRLEGEQMRS 311
+ I V G L + P +F D G ++DSGT + A++A + V ++ ++
Sbjct: 224 KGIRVGGSPLRLKPEVF---DGKYGTVLDSGTTYAYFPGAAFQAFKSAVKEQV--GSLKE 278
Query: 312 YSWPDK----LCYHGI---MSSDLKGFPTVRFHFRGGAKLALEKDSMFYQPRP--DAFCM 362
PD+ +CY G +S+ + FP+V F F G + L ++ ++ A+C+
Sbjct: 279 VPGPDEKFKDICYAGAGTNVSNLSQFFPSVDFVFGDGQSVTLSPENYLFRHTKISGAYCL 338
Query: 363 AVNPASINNRYVNFSLIGMMAQQFYNVGYDIGRKQKTFQRMDCEVL 408
V N L G++ + V Y+ G+ F + C L
Sbjct: 339 GV----FENGDPTTLLGGIIVRNML-VTYNRGKASIGFLKTKCNDL 379
>gi|413938615|gb|AFW73166.1| hypothetical protein ZEAMMB73_633272 [Zea mays]
Length = 386
Score = 89.0 bits (219), Expect = 4e-15, Method: Compositional matrix adjust.
Identities = 108/370 (29%), Positives = 152/370 (41%), Gaps = 51/370 (13%)
Query: 55 DISNSLFYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCR---DCSPQLGTIFYPSRSSSY 111
DI + V S+G P V Q +DTGS L WV C PC C Q +F P++SSSY
Sbjct: 42 DIGTLNYVVTASLGTPGVAQTMEVDTGSDLSWVQCKPCAAAPSCYSQKDPLFDPAQSSSY 101
Query: 112 AYVPCDSEHCRYFP-YARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQ 170
A VPC C YA + +C Y Y G T+ S++ LT + VQ
Sbjct: 102 AAVPCGGPVCAGLGIYAASACSAAQCGYVVSYGDGSNTTGVYSSDTLTLSASSA----VQ 157
Query: 171 DVVFGCGFSTNRNFK-FSGIFGLGIGRSSLVSQLNSS----FSYCIGSLHDPDYLHNKLI 225
FGCG + + F G+ GLG + SLV Q + FSYC L L
Sbjct: 158 GFFFGCGHAQSGLFNGVDGLLGLGREQPSLVEQTAGTYGGVFSYC---LPTKPSTAGYLT 214
Query: 226 LGDGAIIDEGDA------TPLQFIDGHYYITLEAISVDGRMLDINPNIFKRDDSGGGVMI 279
LG G P +Y + L ISV G+ L + + F ++
Sbjct: 215 LGVGGPSGAAPGFSTTQLLPSPNAPTYYVVMLTGISVGGQQLSVPASAFAGGT-----VV 269
Query: 280 DSGTDVTWLVKEAYEALRDEVMIRLEGEQMRSYSWPDKLCYHGIMSS--DLKGF-----P 332
D+GT VT L AY ALR M SY +P +GI+ + + G+ P
Sbjct: 270 DTGTVVTRLPPTAYAALRSAFR-----SGMASYGYPTAPS-NGILDTCYNFAGYGTVTLP 323
Query: 333 TVRFHFRGGAKLALEKDSMFYQPRPDAFCMAVNPASINNRYVNFSLIGMMAQQFYNV--- 389
V F GA + L D + C+A P+ + +++G + Q+ + V
Sbjct: 324 NVALTFGSGATVTLGADGIL-----SFGCLAFAPSGSDG---GMAILGNVQQRSFEVRID 375
Query: 390 GYDIGRKQKT 399
G +G K +
Sbjct: 376 GTSVGFKPSS 385
>gi|357507805|ref|XP_003624191.1| Aspartic proteinase-like protein [Medicago truncatula]
gi|355499206|gb|AES80409.1| Aspartic proteinase-like protein [Medicago truncatula]
Length = 406
Score = 88.6 bits (218), Expect = 5e-15, Method: Compositional matrix adjust.
Identities = 98/353 (27%), Positives = 148/353 (41%), Gaps = 49/353 (13%)
Query: 84 LLWVHCYPCRDCSPQLG---TIFYPSRSSSYAYVPCDSEHCRYFPYARCSAYKH--RCIY 138
LL + C C S LG T++ P+ S + VPC C S K C Y
Sbjct: 28 LLQLGCTACPKKS-GLGMDLTLYDPNGSKTSNAVPCGDGFCTDTYSGPISGCKQDMSCPY 86
Query: 139 TQLYLIGPETSVFVSTEQLTFKNTDESTIHVQ----DVVFGCG------FSTNRNFKFSG 188
+ Y G TS + LTF +H + V+FGCG S+N + G
Sbjct: 87 SITYGDGSTTSGSFVNDSLTFDEV-SGNLHTKPDNSSVIFGCGAKQSGSLSSNSDEALDG 145
Query: 189 IFGLGIGRSSLVSQLNSS------FSYCIGSLHDPDYLHNKLILGDGAIID-EGDATPLQ 241
I G G SS++SQL +S FS+C+ D H I G +++ + + TPL
Sbjct: 146 IIGFGQANSSVLSQLAASGKVKRIFSHCL------DSHHGGGIFSIGQVMEPKFNTTPLV 199
Query: 242 FIDGHYYITLEAISVDGRMLDINPNIFKRDDSGGGVMIDSGTDVTWLVKEAYEALRDEVM 301
HY + L+ + VDG + + +F SG G +IDSGT + +L Y L +V+
Sbjct: 200 PRMAHYNVILKDMDVDGEPILLPLYLFDSG-SGRGTIIDSGTTLAYLPLSIYNQLLPKVL 258
Query: 302 IRLEG-------EQMRSYSWPDKLCYHGIMSSDLKGFPTVRFHFRGGAKLALEKDSMFYQ 354
R G +Q + + DKL +GFP V+FHF G + D +F
Sbjct: 259 GRQPGLKLMIVEDQFTCFHYSDKLD---------EGFPVVKFHFEGLSLTVHPHDYLFLY 309
Query: 355 PRPDAFCMAVNPASINNRY-VNFSLIGMMAQQFYNVGYDIGRKQKTFQRMDCE 406
+ D +C+ +S + + LIG + V YD+ + +C
Sbjct: 310 -KEDIYCIGWQKSSTQTKEGRDLILIGDLVLSNKLVVYDLENMVIGWTNFNCS 361
>gi|115452187|ref|NP_001049694.1| Os03g0271900 [Oryza sativa Japonica Group]
gi|29893618|gb|AAP06872.1| hypothetical protein [Oryza sativa Japonica Group]
gi|108707424|gb|ABF95219.1| Eukaryotic aspartyl protease family protein, expressed [Oryza
sativa Japonica Group]
gi|108707425|gb|ABF95220.1| Eukaryotic aspartyl protease family protein, expressed [Oryza
sativa Japonica Group]
gi|113548165|dbj|BAF11608.1| Os03g0271900 [Oryza sativa Japonica Group]
gi|215715205|dbj|BAG94956.1| unnamed protein product [Oryza sativa Japonica Group]
gi|215737033|dbj|BAG95962.1| unnamed protein product [Oryza sativa Japonica Group]
gi|215740994|dbj|BAG97489.1| unnamed protein product [Oryza sativa Japonica Group]
Length = 447
Score = 88.6 bits (218), Expect = 5e-15, Method: Compositional matrix adjust.
Identities = 104/397 (26%), Positives = 160/397 (40%), Gaps = 57/397 (14%)
Query: 58 NSLFYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLGTIFYPSRSSSYAYVPCD 117
N V +++G PP +DTGS L W+ C +P L F S SSSY VPC
Sbjct: 52 NVSLTVPVAVGTPPQNVTMVLDTGSELSWLLCN--GSYAPPLTPAFNASGSSSYGAVPCP 109
Query: 118 SEHCRY------FPYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQD 171
S C + P + + C + Y ++T+ TF T +
Sbjct: 110 STACEWRGRDLPVPPFCDTPPSNACRVSLSYADASSADGVLATD--TFLLTGGAPPVAVG 167
Query: 172 VVFGC-------------GFSTNRNFKFSGIFGLGIGRSSLVSQLNSS-FSYCIGSLHDP 217
FGC G T+ + +G+ G+ G S V+Q + F+YCI P
Sbjct: 168 AYFGCITSYSSTTATNSNGTGTDVSEAATGLLGMNRGTLSFVTQTGTRRFAYCIAPGEGP 227
Query: 218 DYLHNKLILGDGAIIDEGDATPLQFIDG--------HYYITLEAISVDGRMLDINPNIFK 269
L L+ DG + + TPL I Y + LE I V +L I ++
Sbjct: 228 GVL---LLGDDGGVAPPLNYTPLIEISQPLPYFDRVAYSVQLEGIRVGCALLPIPKSVLT 284
Query: 270 RDDSGGG-VMIDSGTDVTWLVKEAYEALRDE------VMIRLEGEQMRSYSWPDKLCYHG 322
D +G G M+DSGT T+L+ +AY AL+ E +++ GE + C+ G
Sbjct: 285 PDHTGAGQTMVDSGTQFTFLLADAYAALKAEFTSQARLLLAPLGEPGFVFQGAFDACFRG 344
Query: 323 -----IMSSDLKGFPTVRFHFRGGAKLALEKDSMFY----QPRPDAFCMAVNPASINNR- 372
+S L P V R GA++A+ + + Y + R + AV + N
Sbjct: 345 PEARVAAASGL--LPEVGLVLR-GAEVAVSGEKLLYMVPGERRGEGGAEAVWCLTFGNSD 401
Query: 373 --YVNFSLIGMMAQQFYNVGYDIGRKQKTFQRMDCEV 407
++ +IG QQ V YD+ + F C++
Sbjct: 402 MAGMSAYVIGHHHQQNVWVEYDLQNGRVGFAPARCDL 438
>gi|449467979|ref|XP_004151699.1| PREDICTED: probable aspartic protease At2g35615-like, partial
[Cucumis sativus]
Length = 209
Score = 88.6 bits (218), Expect = 5e-15, Method: Compositional matrix adjust.
Identities = 58/186 (31%), Positives = 93/186 (50%), Gaps = 13/186 (6%)
Query: 1 LIHRDSILSPYNNPNENPAHRVQRGINISIARLAYLQEKIRSHNTY--QAQILPSNDISN 58
L HRDS+LSP + + R+ S++R A L + ++ QA + P +
Sbjct: 34 LFHRDSLLSPLEFSSLSHYDRLTNAFRRSLSRSATLLNRAATNGALDLQAPLTPGS---- 89
Query: 59 SLFYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLGTIFYPSRSSSYAYVPCDS 118
+ +++SIG PPV DTGS L+W C PC C Q IF P +S+S+++VPC+S
Sbjct: 90 GEYLMSVSIGTPPVDYIGMADTGSDLMWAQCLPCLKCYKQSRPIFDPLKSTSFSHVPCNS 149
Query: 119 EHCRYFPYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDVVFGCGF 178
++C+ + C A + C Y+ Y G +T + L F+ + V+ V+ GCG
Sbjct: 150 QNCKAIDDSHCGA-QGVCDYS--YTYGDQT---YTKGDLGFEKITIGSSSVKSVI-GCGH 202
Query: 179 STNRNF 184
+ F
Sbjct: 203 ESGGGF 208
>gi|168063189|ref|XP_001783556.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162664943|gb|EDQ51645.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 414
Score = 88.6 bits (218), Expect = 6e-15, Method: Compositional matrix adjust.
Identities = 98/381 (25%), Positives = 157/381 (41%), Gaps = 47/381 (12%)
Query: 58 NSLFYVNISIGQPPVPQFTAMDTGSSLLWVHC-YPCRDCSPQLGTIFYPSRSSSYAYVPC 116
+ L+Y+ + IG P + MDTGS L W+ C PCR C+ ++ P R+ V C
Sbjct: 28 DGLYYMAMRIGNPAKLYYLDMDTGSDLTWLQCDAPCRSCAVGPHGLYDPKRAR---VVDC 84
Query: 117 DSEHCRYFPYA---RCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDVV 173
C CS +C Y Y+ G T + + +T T+ + + V+
Sbjct: 85 RRPTCAQVQRGGQFTCSGDVRQCDYEVDYVDGSSTMGILVEDTITLVLTNGTRFQTRAVI 144
Query: 174 FGCGFSTNRNFKFS-----GIFGLGIGRSSLVSQL------NSSFSYCI-GSLHDPDYL- 220
GCG+ + G+ GL + SL SQL N+ +C+ G + YL
Sbjct: 145 -GCGYDQQGTLAKAPAVTDGVIGLSSSKISLPSQLAAKGIANNVIGHCLAGGSNGGGYLF 203
Query: 221 -HNKLILGDGAIIDEGDATPLQFIDGHYYITLEAISVDGRMLDINPNIFKRDDSGGGVMI 279
+ L+ G PL ++G Y L +I G +L++ D GG M
Sbjct: 204 FGDTLVPALGMTWTPMIGRPL--VEG-YQARLRSIKYGGEVLELEGTT----DDVGGAMF 256
Query: 280 DSGTDVTWLVKEAYEALRDEV--------MIRLEGEQMRSYSWPDKLCYHGIMSSDLKG- 330
DSGT T+LV AY A+ V + R++ + + W + + +D+
Sbjct: 257 DSGTSFTYLVPNAYTAVLSAVVRQAQRSGLERIKTDTTLPFCWRGPSPFESV--ADVSAY 314
Query: 331 FPTVRFHFRG------GAKLALEKDSMFYQPRPDAFCMAVNPASINNRYVNFSLIGMMAQ 384
F TV F G G L L + C+ V AS+ + V +++G ++
Sbjct: 315 FKTVTLDFGGSTWWSSGKLLELSPEGYLIVSTQGNVCLGVLDASVASLEVT-NILGDISM 373
Query: 385 QFYNVGYDIGRKQKTFQRMDC 405
+ Y V YD R+Q + R +C
Sbjct: 374 RGYLVVYDNMREQIGWVRRNC 394
>gi|255563739|ref|XP_002522871.1| DNA binding protein, putative [Ricinus communis]
gi|223537955|gb|EEF39569.1| DNA binding protein, putative [Ricinus communis]
Length = 414
Score = 88.2 bits (217), Expect = 6e-15, Method: Compositional matrix adjust.
Identities = 109/435 (25%), Positives = 170/435 (39%), Gaps = 89/435 (20%)
Query: 1 LIHRDSILSPYNNPNENPAHRVQRGINISIARLAYLQEKIRSHN---TYQAQIL-PSNDI 56
LIHRDS SP+ + R+ R + S KIR+HN + ++ P
Sbjct: 36 LIHRDSPESPFYPGKLTNSERISRLVEFS---------KIRAHNFDSGFSSEAFRPPVFQ 86
Query: 57 SNSLFYVNISIGQPPVPQFTAMDTGSSLLWV----HCYPCRDCSPQLGTIFYPSRSSSYA 112
+ + V + IG P +P + DTGS+L+W + + CR+
Sbjct: 87 DFTCYLVKVRIGNPGIPLYLVPDTGSALIWTVNNQNIFQCRN------------------ 128
Query: 113 YVPCDSEHCRYFPYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDV 172
++C YT+ Y G T+ + + L + ++ +
Sbjct: 129 ---------------------NKCSYTRRYDDGSITTGVAAQDILQSEGSERIPFY---- 163
Query: 173 VFGCGFSTNRNF-------KFSGIFGLGIGRSSLVSQLN----SSFSYCIGSLHDPDYLH 221
FGC N+NF K G+ GL SL+ QL+ FSYC+
Sbjct: 164 -FGCS-RDNQNFSVFEHTGKSGGVMGLNTSPVSLLQQLSHITQRRFSYCLNPYQHGSEPP 221
Query: 222 NKLILGDGAIIDEG----DATPLQFIDG--HYYITLEAISVDGRMLDINPNIFK-RDDSG 274
+L G I +G +TPL +Y++ L ++V G+ L + P F R D
Sbjct: 222 PSSLLRFGNDIRKGRRRFQSTPLMSSPDRPNYFLNLLDMTVAGQRLHLPPGTFALRQDGT 281
Query: 275 GGVMIDSGTDVTWLVKEAYEALRDEVMIRLEGEQMRSYSWPD-KLCYHGIMSSDLKGFPT 333
GG +IDSGT +T++ + AY L + + P+ LCY + +
Sbjct: 282 GGTIIDSGTGLTFITQTAYPRLISAFQNYFDHRGFQRVHIPEFDLCYSFRGNHTFHDHAS 341
Query: 334 VRFHFRGGAKLALEKDSMFYQPRPD--AFCMAVNPASINNRYVNFSLIGMMAQQFYNVGY 391
+ FHF A ++ D + Y P D AFC+A+ P R V IG + Q Y
Sbjct: 342 MTFHFE-RADFTVQADYV-YLPMEDDNAFCVALQPTPPQQRTV----IGAINQGNTRFIY 395
Query: 392 DIGRKQKTFQRMDCE 406
D Q F +C
Sbjct: 396 DAAAHQLLFIAENCR 410
>gi|413946455|gb|AFW79104.1| hypothetical protein ZEAMMB73_209101 [Zea mays]
Length = 480
Score = 88.2 bits (217), Expect = 6e-15, Method: Compositional matrix adjust.
Identities = 102/381 (26%), Positives = 155/381 (40%), Gaps = 46/381 (12%)
Query: 61 FYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDC---SPQLGTIFYPSRSSSYAYVPCD 117
++V +G P P DTGS L WV C D +P+ +F + S S+A + C
Sbjct: 112 YFVRFRVGTPAQPFVLVADTGSDLTWVKCSGAGDGTGDAPR--RVFRAAASRSWAPIACS 169
Query: 118 SEHC-RYFPY--ARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTF-------KNTDESTI 167
S+ C Y P+ A CS+ C Y Y G V T+ T ++
Sbjct: 170 SDTCTSYVPFSLANCSSPASPCAYDYRYNDGSAARGVVGTDSATIALSGSESRDGGGRRA 229
Query: 168 HVQDVVFGCGFS-TNRNFKFS-GIFGLGIGRSSLVS----QLNSSFSYCIGSLHDPDYLH 221
+Q VV GC S ++F+ S G+ LG S S + FSYC+ P
Sbjct: 230 KLQGVVLGCTASYDGQSFQSSDGVLSLGNSNISFASRAAARFGGRFSYCLVDHLAPRNAT 289
Query: 222 NKLILGDGAIIDEGDA------------TPL---QFIDGHYYITLEAISVDGRMLDINPN 266
+ L G EG A TPL + + Y + ++A+ V G LDI +
Sbjct: 290 SYLTFGPPG--PEGGAAASSSSSSAAARTPLLLDRRMSPFYAVAVDAVHVAGEALDIPAD 347
Query: 267 IFKRDDSGGGVMIDSGTDVTWLVKEAYEALRDEVMIRLEGEQMRSYSWPDKLCYHGIMSS 326
++ GGG ++DSGT +T L AY A+ + RL G S P + CY+ ++
Sbjct: 348 VWDV-ARGGGAILDSGTSLTVLATPAYRAVVAALSERLAGLPRVSMD-PFEYCYN--WTA 403
Query: 327 DLKGFPTVRFHFRGGAKLALEKDSMFYQPRPDAFCMAVNPASINNRYVNFSLIGMMAQQF 386
P + F G A+L S P C+ V + S+IG + QQ
Sbjct: 404 AALEIPGLEVRFAGSARLQPPAKSYVVDAAPGVKCIGVQ----EGAWPGVSVIGNILQQD 459
Query: 387 YNVGYDIGRKQKTFQRMDCEV 407
+ +D+ + F+ C +
Sbjct: 460 HLWEFDLRDRWLRFKHTRCAL 480
>gi|413951979|gb|AFW84628.1| putative aspartic protease family protein [Zea mays]
Length = 435
Score = 88.2 bits (217), Expect = 7e-15, Method: Compositional matrix adjust.
Identities = 83/292 (28%), Positives = 125/292 (42%), Gaps = 33/292 (11%)
Query: 24 RGINISIARLAYLQEKIRSHNTYQAQILPSNDISNSLFYVNISIGQPPVPQFTAMDTGSS 83
R + + L K+R H+ N V++++G PP +DTGS
Sbjct: 37 RSRQVPVGALPRPPSKLRFHH-------------NVSLTVSLAVGTPPQNVTMVLDTGSE 83
Query: 84 LLWVHCYPCRDCSPQLGTIFYPSRSSSYAYVPCDSEHC--RYFPYA-RCSAYKHRCIYTQ 140
L W+ C R + + F P S+++A VPC S C R P C A RC +
Sbjct: 84 LSWLLCATGRAAAAAADS-FRPRASATFAAVPCGSARCSSRDLPAPPSCDAASRRCRVSL 142
Query: 141 LYLIGPETSVFVSTEQLTFKNTDESTIHVQDVVFGCGF----STNRNFKFSGIFGLGIGR 196
Y G + ++T+ F D + FGC S+ +G+ G+ G
Sbjct: 143 SYADGSASDGALATD--VFAVGDAPPLRS---AFGCMSAAYDSSPDAVATAGLLGMNRGA 197
Query: 197 SSLVSQLNSS-FSYCIGSLHDPDYL---HNKL-ILGDGAIIDEGDATPLQFIDG-HYYIT 250
S V+Q ++ FSYCI D L H+ L L PL + D Y +
Sbjct: 198 LSFVTQASTRRFSYCISDRDDAGVLLLGHSDLPFLPLNYTPLYQPTPPLPYFDRVAYSVQ 257
Query: 251 LEAISVDGRMLDINPNIFKRDDSGGG-VMIDSGTDVTWLVKEAYEALRDEVM 301
L I V G+ L I P++ D +G G M+DSGT T+L+ +AY A++ E +
Sbjct: 258 LLGIRVGGKPLPIPPSVLAPDHTGAGQTMVDSGTQFTFLLGDAYSAVKAEFL 309
>gi|413944596|gb|AFW77245.1| hypothetical protein ZEAMMB73_545774 [Zea mays]
gi|414876929|tpg|DAA54060.1| TPA: hypothetical protein ZEAMMB73_875469 [Zea mays]
Length = 459
Score = 87.8 bits (216), Expect = 8e-15, Method: Compositional matrix adjust.
Identities = 98/368 (26%), Positives = 152/368 (41%), Gaps = 38/368 (10%)
Query: 55 DISNSLFYVNISIGQPPVPQFTAMDTGSSLLWVHC--YPCRDCSPQLGTIFYPSRSSSYA 112
D S + + S+G PP DTGS L+W C C PQ + P+ SS++A
Sbjct: 85 DDSGGAYDMEFSMGTPPQKLTALADTGSDLIWAKCGGACTTSCEPQGSPSYLPNASSTFA 144
Query: 113 YVPCDSEHC---RYFPYARCSAYKHRCIYTQLYLIGPE----TSVFVSTEQLTFKNTDES 165
+PC C R A C+A C Y Y +G + T F++ E T
Sbjct: 145 KLPCSDRLCSLLRSDSVAWCAAAGAECDYRYSYGLGDDDHHYTQGFLARETFTLGAD--- 201
Query: 166 TIHVQDVVFGCGFSTNRNFKFSGIFGLGI-GRSSLVSQLN-SSFSYCIGSLHDPDYLHNK 223
V V FGC ++ + G SLVSQLN S+F YC+ S +
Sbjct: 202 --AVPSVRFGCTTASEGGYGSGSGLVGLGRGPLSLVSQLNASTFMYCLTSDASK---ASP 256
Query: 224 LILGDGAIID--EGDATPLQFIDGHYYITLEAISVDGRMLDINPNIFKRDDSGGGVMIDS 281
L+ G A + + +T L Y + L +IS+ P + + + GV+ DS
Sbjct: 257 LLFGSLASLTGAQVQSTGLLASTTFYAVNLRSISIGSA---TTPGVGEPE----GVVFDS 309
Query: 282 GTDVTWLVKEAYEALRDEVMIRLEGEQMRSYSWPDKLCYHGIMSSDLK--GFPTVRFHFR 339
GT +T+L + AY + + + +Q+ + C+ + L PT+ HF
Sbjct: 310 GTTLTYLAEPAYSEAKAAFLSQTSLDQVEDTDGFEA-CFQKPANGRLSNAAVPTMVLHFD 368
Query: 340 GGAKLALEKDSMFYQPRPDAFCMAVNPASINNRYVNFSLIGMMAQQFYNVGYDIGRKQKT 399
GA +AL + + C V R + S+IG + Q Y V +D+ R +
Sbjct: 369 -GADMALPVANYVVEVEDGVVCWIV------QRSPSLSIIGNIMQVNYLVLHDVHRSVLS 421
Query: 400 FQRMDCEV 407
FQ +C+
Sbjct: 422 FQPANCDT 429
>gi|125571841|gb|EAZ13356.1| hypothetical protein OsJ_03278 [Oryza sativa Japonica Group]
Length = 447
Score = 87.8 bits (216), Expect = 8e-15, Method: Compositional matrix adjust.
Identities = 51/180 (28%), Positives = 81/180 (45%), Gaps = 12/180 (6%)
Query: 1 LIHRDSILSPYNNPNENPAHRVQRGINISIARLAYLQEKIRSHNTYQAQILPSNDISNSL 60
+ HRD++ P P +++ + AR A L + + + + +
Sbjct: 31 VFHRDALFPP--PPGAKRGSLLRQRLAADAARYASL---VDATGRLHSPVFSGIPFESGE 85
Query: 61 FYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLGTIFYPSRSSSYAYVPCDSEH 120
++ + +G P +DTGS L+W+ C PCR C Q G +F P RSS+Y VPC S
Sbjct: 86 YFALVGVGTPSTKAMLVIDTGSDLVWLQCSPCRRCYAQRGQVFDPRRSSTYRRVPCSSPQ 145
Query: 121 CRYFPYARC---SAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDVVFGCG 177
CR + C A C Y Y G ++ ++T++L F N +V +V GCG
Sbjct: 146 CRALRFPGCDSGGAAGGGCRYMVAYGDGSSSTGDLATDKLAFAN----DTYVNNVTLGCG 201
>gi|222632750|gb|EEE64882.1| hypothetical protein OsJ_19741 [Oryza sativa Japonica Group]
Length = 456
Score = 87.8 bits (216), Expect = 8e-15, Method: Compositional matrix adjust.
Identities = 108/388 (27%), Positives = 156/388 (40%), Gaps = 52/388 (13%)
Query: 39 KIRSHNTYQAQILPSNDISNSLFYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQ 98
+ R + A +L ++ + +G P +DTGS ++W P R P
Sbjct: 100 RPRRRGGFAAPLLSGLPQGTGEYFAQVGVGTPATTALMVLDTGSDVVWA---PVRALPPL 156
Query: 99 LGTIFYPSRSSSYAYVP-----CDSEHCRYFPYARCSAYKHRCIYTQLYLIGPETSVFVS 153
L + S S+ A P C + CR A C ++ C+Y Y G T+ +
Sbjct: 157 LRAVRQGS-STGAAPAPTPRWNCVAPICRRLDSAGCDRRRNSCLYQVAYGDGSVTAGDFA 215
Query: 154 TEQLTFKNTDESTIHVQDVVFGCGFSTNRNF-KFSGIFGLGIGRSSLVSQL----NSSFS 208
+E LTF VQ V GCG F SG+ GLG GR S SQ+ SFS
Sbjct: 216 SETLTFAR----GARVQRVAIGCGHDNEGLFIAASGLLGLGRGRLSFPSQIARSFGRSFS 271
Query: 209 YCIGSLHDPDYLHNKLILGDGAIIDEGDATPLQFIDGHYYITLEAISVDGRM-------- 260
YC+ G TP + YY+ L SV G
Sbjct: 272 YCLVDRTSSRRARPSRRWG---------GTPR--MATFYYVHLLGFSVGGARVKGVSQSD 320
Query: 261 LDINPNIFKRDDSGGGVMIDSGTDVTWLVKEAYEALRDEVMIRLEGEQMR--SYSWPDKL 318
L +NP + GGV++DSGT VT L + YEA+RD G ++ +S D
Sbjct: 321 LRLNPTTGR-----GGVILDSGTSVTRLARPVYEAVRDAFRAAAVGLRVSPGGFSLFDT- 374
Query: 319 CYHGIMSSDLKGFPTVRFHFRGGAKLALEKDS-MFYQPRPDAFCMAVNPASINNRYVNFS 377
CY+ + + PTV H GGA +AL ++ + FC A+ A + S
Sbjct: 375 CYN-LSGRRVVKVPTVSMHLAGGASVALPPENYLIPVDTSGTFCFAM--AGTDG---GVS 428
Query: 378 LIGMMAQQFYNVGYDIGRKQKTFQRMDC 405
+IG + QQ + V +D ++ F C
Sbjct: 429 IIGNIQQQGFRVVFDGDAQRVGFVPKSC 456
>gi|115457778|ref|NP_001052489.1| Os04g0337000 [Oryza sativa Japonica Group]
gi|113564060|dbj|BAF14403.1| Os04g0337000, partial [Oryza sativa Japonica Group]
Length = 321
Score = 87.8 bits (216), Expect = 8e-15, Method: Compositional matrix adjust.
Identities = 79/302 (26%), Positives = 137/302 (45%), Gaps = 35/302 (11%)
Query: 57 SNSLFYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLG-----TIFYPSRSSSY 111
+ L+Y I IG P + +DTGS +LWV+C C C + G T++ P SS+
Sbjct: 29 ATRLYYTEIGIGTPTKRYYVQVDTGSDILWVNCISCDRCPRKSGLGLELTLYDPKDSSTG 88
Query: 112 AYVPCDSEHC--RYFPYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTD---EST 166
+ V CD C Y C Y+ Y G T+ + ++ L F ++
Sbjct: 89 SKVSCDQGFCAATYGGLLPGCTTSLPCEYSVTYGDGSSTTGYFVSDLLQFDQVSGDGQTR 148
Query: 167 IHVQDVVFGCGFST-----NRNFKFSGIFGLGIGRSSLVSQLNSS------FSYCIGSLH 215
V FGCG + N GI G G +S++SQL+++ F++C+
Sbjct: 149 PANSTVTFGCGSQQGGDLGSSNQALDGIIGFGQSNTSMLSQLSAAGKVKKIFAHCL---- 204
Query: 216 DPDYLHNKLILGDGAIID-EGDATPLQFIDGHYYITLEAISVDGRMLDINPNIFKRDDSG 274
D ++ I G ++ + TPL HY + L++I V G L + ++F +
Sbjct: 205 --DTINGGGIFAIGNVVQPKVKTTPLVPNMPHYNVNLKSIDVGGTALKLPSHMFDTGEK- 261
Query: 275 GGVMIDSGTDVTWLVKEAYEALRDEVMIRL--EGEQMRSYSWPDKLCYHGIMSSDLKGFP 332
G +IDSGT +T+L + Y+ E+M+ + + + + ++ + LC+ + L+ P
Sbjct: 262 KGTIIDSGTTLTYLPEIVYK----EIMLAVFAKHKDITFHNVQEFLCFQYVGRYTLQHTP 317
Query: 333 TV 334
+V
Sbjct: 318 SV 319
>gi|125547728|gb|EAY93550.1| hypothetical protein OsI_15341 [Oryza sativa Indica Group]
Length = 418
Score = 87.8 bits (216), Expect = 9e-15, Method: Compositional matrix adjust.
Identities = 93/365 (25%), Positives = 148/365 (40%), Gaps = 51/365 (13%)
Query: 61 FYVNISIGQPPVPQFTAMDTGSSLLWVHCY--PCRDCSPQLGTIFYPSRSSSYAYVPCDS 118
+ V+++ G PP +DTGS + W C P C Q +F PS SSS+A +PC S
Sbjct: 88 YLVHLAAGTPPQEVQLTLDTGSDITWTQCKRCPASACFNQTLPLFDPSASSSFASLPCSS 147
Query: 119 EHCRYFPYARCS--AYKHRCIYTQLYLIGPETSVFVSTEQLTFKN--TDESTIHVQDVVF 174
C P A C Y+ Y G + + E TF + + S+ V +VF
Sbjct: 148 PACETTPPCGGGNDATSRPCNYSISYGDGSVSRGEIGREVFTFASGTGEGSSAAVPGLVF 207
Query: 175 GCGFSTNRNFKF--SGIFGLGIGRSSLVSQLN-SSFSYCIGSLHDPDYLHNKLILGDGAI 231
GCG + F +GI G G G SL SQL +FS+C ++ + ++LG +
Sbjct: 208 GCGHANRGVFTSNETGIAGFGRGSLSLPSQLKVGNFSHCFTTITGSKT--SAVLLGLPGV 265
Query: 232 IDEGDATPLQFIDGHYYITLEAISVDGRMLDINPNIFKRDDSGGGVMIDSGTDVTWLVKE 291
A+PL G Y S +SGT +T L
Sbjct: 266 APP-SASPLGRRRGSYRCRSTPRSS-----------------------NSGTSITSLPPR 301
Query: 292 AYEALRDEVMIRLEGEQMRSYSWPDKLCYHGIMSSDLKGFPTVRFHFRGGAKLALEKDSM 351
Y A+R+E +++ + + C+ + PT+ HF GA + L +++
Sbjct: 302 TYRAVREEFAAQVKLPVVPGNATDPFTCFSAPLRGPKPDVPTMALHFE-GATMRLPQENY 360
Query: 352 FYQPRPD--------AFCMAVNPASINNRYVNFSLIGMMAQQFYNVGYDIGRKQKTFQRM 403
++ D C+AV I + ++G + QQ +V YD+ + +F
Sbjct: 361 VFEVVDDDDAGNSSRIICLAV----IEGGEI---ILGNIQQQNMHVLYDLQNSKLSFVPA 413
Query: 404 DCEVL 408
C+ L
Sbjct: 414 QCDQL 418
>gi|326533540|dbj|BAK05301.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 410
Score = 87.8 bits (216), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 94/386 (24%), Positives = 160/386 (41%), Gaps = 58/386 (15%)
Query: 61 FYVNISIGQPPVPQFTAMDTGSSLLWVHC----YPCRDCSPQLGTIFYPSRSSSYAYVPC 116
FY ++IG+P P F +DTGS+L W+ C + C+ C P+ +Y + V C
Sbjct: 38 FYATLNIGEPAKPYFLDVDTGSNLTWLECHHPVHGCKGCHPRPPHPYYTPADGNLKVV-C 96
Query: 117 DSEHC----RYFP-YARCSAY-KHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQ 170
S C R P CS HRC Y Y+ G ++ ++T+ ++ D+ I
Sbjct: 97 GSPLCVAVRRDVPGIPECSRNDPHRCHYEIQYVTG-KSEGDLATDIISVNGRDKKRI--- 152
Query: 171 DVVFGCGFSTNRNF-----KFSGIFGLGIGRSSLVSQLN-------SSFSYCIGSLHDPD 218
FGCG+ GI GLG+G++ L +QL + +C+ S
Sbjct: 153 --AFGCGYKQEEPADSPPSPVDGILGLGMGKAGLAAQLKGHKMIKENVIGHCLSSKGK-- 208
Query: 219 YLHNKLILGDGAIIDEG-DATPLQFIDGHYYITLEAISVDGRMLDINPNIFKRDDSGGGV 277
L +GD G P++ +Y L + +D + + NP
Sbjct: 209 ---GVLYVGDFNPPTRGVTWAPMRESLFYYSPGLAEVFIDKQPIRGNPTF--------EA 257
Query: 278 MIDSGTDVTWLVKEAYEALRDEVMIRLEG---EQMRSYSWPDKLCYHGI--------MSS 326
+ DSG+ T + + Y + +V + L E+++ + P LC+ G + +
Sbjct: 258 VFDSGSTYTHVPAQIYNEIVSKVRVTLSESSLEEVKGRALP--LCWKGKKPFGSVNDVKN 315
Query: 327 DLKGFPTVRFHFRGGAKLALEKDSMFYQPRPDAFCMAVNPASINN--RYVNFSLIGMMAQ 384
K H RG + L + + + C+A+ AS++ + +NF LIG +
Sbjct: 316 QFKALSLKITHARGTSNLDIPPQNYLFVKEDGETCLAILDASLDPVLKELNFILIGAVTM 375
Query: 385 QFYNVGYDIGRKQKTFQRMDCEVLDD 410
Q V YD +KQ + R C+ + +
Sbjct: 376 QDLFVIYDNEKKQLGWVRAQCDRVQE 401
>gi|225431324|ref|XP_002269880.1| PREDICTED: aspartic proteinase-like protein 1 [Vitis vinifera]
gi|297739017|emb|CBI28369.3| unnamed protein product [Vitis vinifera]
Length = 518
Score = 87.4 bits (215), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 93/380 (24%), Positives = 156/380 (41%), Gaps = 58/380 (15%)
Query: 60 LFYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLGT---------IFYPSRSSS 110
L Y +S+G P A+DTGS L WV C C C+P GT I+ P SS+
Sbjct: 102 LHYTTVSLGTPGKKFLVALDTGSDLFWVPC-DCSRCAPTEGTTYASDFELSIYNPKGSST 160
Query: 111 YAYVPCDSEHCRYFPYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQ 170
V CD+ C + RC C Y Y+ ++ + E + T+++
Sbjct: 161 SRKVTCDNSLCAH--RNRCLGTFSNCPYMVSYVSAETSTSGILVEDVLHLTTEDNRQEFV 218
Query: 171 D--VVFGCGFSTNRNF----KFSGIFGLGIGRSSLVSQLNS------SFSYCIGSLHDPD 218
+ V FGCG +F +G+FGLG+ + S+ S L+ SFS C G PD
Sbjct: 219 EAYVTFGCGQVQTGSFLDIAAPNGLFGLGLEKISVPSILSKEGFTADSFSMCFG----PD 274
Query: 219 YLHNKLILGDGAIIDEGDATPLQFIDGH--YYITLEAISVDGRMLDINPNIFKRDDSGGG 276
+ ++ GD D+ + TP H Y IT+ + V ++D++
Sbjct: 275 GI-GRISFGDKGSPDQ-EETPFNLNALHPTYNITVTQVRVGTTLIDLDFT---------- 322
Query: 277 VMIDSGTDVTWLVKEAYEALRDEVMIRLEGEQMRSYSWPD-----KLCYHGIMSSDLKGF 331
+ DSGT T+LV Y V+ + S PD + CY +
Sbjct: 323 ALFDSGTSFTYLVDPIYT----NVLKSFHSQAQDSRRPPDSRIPFEFCYDMSPGENTSLI 378
Query: 332 PTVRFHFRGGAKLALEKDSMFYQPRPD-AFCMAVNPASINNRYVNFSLIGMMAQQFYNVG 390
P++ +GG++ + + + + +CMAV R ++IG Y +
Sbjct: 379 PSMSLTMKGGSQFPVYDPIIIISSQSELIYCMAV------VRSAELNIIGQNFMTGYRII 432
Query: 391 YDIGRKQKTFQRMDCEVLDD 410
+D + ++ +C+ +++
Sbjct: 433 FDREKLVLGWKEFECDDIEN 452
>gi|302776054|ref|XP_002971323.1| hypothetical protein SELMODRAFT_63598 [Selaginella moellendorffii]
gi|300161305|gb|EFJ27921.1| hypothetical protein SELMODRAFT_63598 [Selaginella moellendorffii]
Length = 395
Score = 87.4 bits (215), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 88/350 (25%), Positives = 148/350 (42%), Gaps = 49/350 (14%)
Query: 53 SNDISNSLFYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLG-----TIFYPSR 107
++ +S L++ + +G P +DTGS +LWV+C PC C + T++ P
Sbjct: 21 ADPLSGGLYFTQVGLGNPVKHYIVQVDTGSDVLWVNCRPCSGCPRKSALNIPLTMYDPRE 80
Query: 108 SSSYAYVPCDSEHC---RYFPYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDE 164
SS+ + V C C R F A+CS + C Y Y G + + + + +
Sbjct: 81 SSTTSLVSCSDPLCVRGRRFAEAQCSQTTNNCEYIFSYGDGSTSEGYYVRDAMQYNVISS 140
Query: 165 STIH--VQDVVFGCGFSTNRNFKFS-----GIFGLGIGRSSLVSQLNSS------FSYCI 211
+ + V+FGC + S GI G G S+ +QL + FS+C+
Sbjct: 141 NGLANTTSQVLFGCSIRQTGDLSTSQQAVDGIIGFGQLELSVPNQLAAQQNIPRVFSHCL 200
Query: 212 GSLHDPDYLHNKLILGDGAIIDEGDATPLQFIDGHYYITLEAISVDGRMLDINPNIFKRD 271
+ + + + TPL HY + L ISV+ L I+ F
Sbjct: 201 EGEKRGGGILVIGGIAEPGMT----YTPLVPDSVHYNVVLRGISVNSNRLPIDAEDFSST 256
Query: 272 DSGGGVMIDSGTDVTWLVKEAY----EALRDEVM---IRLEGEQMRSYSWPDKLCYHGIM 324
+ GV++DSGT + + AY +A+R+ +R++G + + +L
Sbjct: 257 ND-TGVIMDSGTTLAYFPSGAYNVFVQAIREATSATPVRVQGMDTQCFLVSGRL------ 309
Query: 325 SSDLKGFPTVRFHFRGGAKLALEKDS--MFYQPRP----DAFCMAVNPAS 368
SDL FP V +F GGA + L+ D+ M+ P D +C+ +S
Sbjct: 310 -SDL--FPNVTLNFEGGA-MELQPDNYLMWGGTAPTGTTDVWCIGWQSSS 355
>gi|449434468|ref|XP_004135018.1| PREDICTED: aspartic proteinase-like protein 1-like [Cucumis
sativus]
Length = 568
Score = 87.4 bits (215), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 84/310 (27%), Positives = 137/310 (44%), Gaps = 53/310 (17%)
Query: 60 LFYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLGTI---------FYPSRSSS 110
L+Y N+S+G P + A+DTGS L W+ C C C L T + P+ S++
Sbjct: 103 LYYANVSVGTPSLDFLVALDTGSDLFWLPC-ECSSCFTYLNTSNGGKFMLNHYSPNDSTT 161
Query: 111 YAYVPCDSEHCRYFPYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQ 170
+ VPC S C RC++ ++ C Y YL +S+ E + TD+S +
Sbjct: 162 SSTVPCTSSLCN-----RCTSNQNVCPYEMRYLSANTSSIGYLVEDVLHLATDDSLLKPV 216
Query: 171 D--VVFGCGFSTNRNFKFS----GIFGLGIGRSSLVSQL------NSSFSYCIGSLHDPD 218
+ + FGCG F + G+ GLG+ + S+ S L ++SFS C G+
Sbjct: 217 EAKITFGCGTVQTGIFATTAAPNGLIGLGMEKISVPSFLADQGLTSNSFSMCFGADG--- 273
Query: 219 YLHNKLILGDGAIIDEGDATPLQFIDGH--YYITLEAISVDGRMLDINPNIFKRDDSGGG 276
+ ++ GD D+ TP + + Y +T I+V G PN D
Sbjct: 274 --YGRIDFGDTGPADQ-KQTPFNTMLEYQSYNVTFNVINVGGE-----PN-----DVPFT 320
Query: 277 VMIDSGTDVTWLVKEAYEALRDEVMIRLEGEQMRSYS-----WPDKLCYHGIMSSDLKGF 331
+ DSGT T+L + AY + ++ G +++ YS +P + CY + +
Sbjct: 321 AIFDSGTSFTYLTEPAYSTITKQMD---AGMKLKRYSLFGPNFPFEYCYEIPPGAKEFQY 377
Query: 332 PTVRFHFRGG 341
T+ F +GG
Sbjct: 378 LTLNFTMKGG 387
>gi|194708432|gb|ACF88300.1| unknown [Zea mays]
Length = 452
Score = 87.4 bits (215), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 98/373 (26%), Positives = 155/373 (41%), Gaps = 51/373 (13%)
Query: 80 TGSSLLWVHC---YPCRDCSPQLGT---IFYPSRSSSYAYVPCDSEHCRYFPYAR----- 128
+GS L WV C Y CR+CS + +F+P SSS V C + C++ A
Sbjct: 79 SGSHLTWVPCTSSYECRNCSSPSASAVPVFHPKNSSSSRLVGCRNPSCQWVHSAANLATK 138
Query: 129 -----CSAYKHRCIYTQLYLIGPETSVFVS--TEQLTFKNTDESTIH-VQDVVFGCGFST 180
CS C + P V+ S T L +T + V V GC +
Sbjct: 139 CRRAPCSPGAANCPAAASNVCPPYAVVYGSGSTAGLLIADTLRAPGRAVPGFVLGCSLVS 198
Query: 181 NRNFKFSGIFGLGIGRSSLVSQLN-SSFSYCIGSLHDPDYLHNKLILGDGAIIDEGDATP 239
SG+ G G G S+ +QL FSYC+ S D N + G + G
Sbjct: 199 VHQPP-SGLAGFGRGAPSVPAQLGLPKFSYCLLSRRFDD---NAAVSGSLVLGGTGGGEG 254
Query: 240 LQFI-------------DGHYYITLEAISVDGRMLDINPNIFKRDDSG-GGVMIDSGTDV 285
+Q++ +YY+ L ++V G+ + + F + +G GG ++DSGT
Sbjct: 255 MQYVPLVKSAAGDKLPYGVYYYLALRGVTVGGKAVRLPARAFAANAAGSGGTIVDSGTTF 314
Query: 286 TWLVKEAYEALRDEVMIRLEGEQMRSYSWPDKL----CYHGIMSSDLKGFPTVRFHFRGG 341
T+L ++ + D V+ + G RS D+L C+ + P + FHF GG
Sbjct: 315 TYLDPTVFQPVADAVVAAVGGRYKRSKDAEDELGLHPCFALPQGARSMALPELSFHFEGG 374
Query: 342 AKLALEKDSMFY---QPRPDAFCMAV------NPASINNRYVNFSLIGMMAQQFYNVGYD 392
A + L ++ F + +A C+AV + N ++G QQ Y V YD
Sbjct: 375 AVMQLPVENYFVVAGRGAVEAICLAVVTDFSGGSGAGNEGSGPAIILGSFQQQNYLVEYD 434
Query: 393 IGRKQKTFQRMDC 405
+ +++ F+R C
Sbjct: 435 LEKERLGFRRQSC 447
>gi|88174593|gb|ABD39371.1| chloroplast nucleoid DNA-binding protein [Oryza rufipogon]
Length = 321
Score = 87.0 bits (214), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 82/329 (24%), Positives = 139/329 (42%), Gaps = 37/329 (11%)
Query: 61 FYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLGTIFYPSRSSSYAYVPCDSEH 120
+ +++ +G P Q +DTGSS WV C C C T F SRS++ A V C +
Sbjct: 1 YVISVGLGTPAKTQIVEIDTGSSTSWVFC-ECDGCHTNPRT-FLQSRSTTCAKVSCGTSM 58
Query: 121 CRYF---PYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDVVFGC- 176
C P+ + S C + Y G + + + LTF + + + FGC
Sbjct: 59 CLLGGSDPHCQDSENYPDCPFRVSYQDGSASYGILYQDTLTFSDVQK----IPGFSFGCN 114
Query: 177 --GFSTNRNFKFSGIFGLGIGRSSLVSQLNSS---FSYCIGSLHDPDYLHNKLI--LGDG 229
F N G+ G+G G S++ Q + + FSYC+ +K G
Sbjct: 115 MDSFGANEFGNVDGLLGMGAGPMSVLKQSSPTFDCFSYCLPLQKSERGFFSKTTGYFSLG 174
Query: 230 AIIDEGDATPLQFIDGH-----YYITLEAISVDGRMLDINPNIFKRDDSGGGVMIDSGTD 284
+ D + + +++ L AISVDG L ++P++F R GV+ DSG++
Sbjct: 175 KVATRTDVRYTKMVARKKNTELFFVDLIAISVDGERLGLSPSVFSRK----GVVFDSGSE 230
Query: 285 VTWLVKEAYEALRD---EVMIRLEGEQMRSYSWPDKLCYHGIMSSDLKGFPTVRFHFRGG 341
++++ A L E++++ + S ++ CY + S D P + HF
Sbjct: 231 LSYIPDRALSVLSQRIRELLLKRGAAEEES----ERNCYD-MRSVDEGDMPAISLHFDDA 285
Query: 342 AKLALEKDSMFYQ---PRPDAFCMAVNPA 367
A+ L +F + D +C+A P
Sbjct: 286 ARFDLGSHGVFVERSVQEQDVWCLAFAPT 314
>gi|222618833|gb|EEE54965.1| hypothetical protein OsJ_02555 [Oryza sativa Japonica Group]
Length = 393
Score = 87.0 bits (214), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 93/347 (26%), Positives = 132/347 (38%), Gaps = 87/347 (25%)
Query: 61 FYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSP---QLGTIFYPSRSSSYAYVPCD 117
+ +++ +G P V Q +DTGS + WV C PC SP G +F P+ SS+YA C
Sbjct: 106 YVISVGLGSPAVTQRVVIDTGSDVSWVQCEPCPAPSPCHAHAGALFDPAASSTYAAFNCS 165
Query: 118 SEHCRYFPYA----RCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDVV 173
+ C + C A K RC Y Y G T+ F+
Sbjct: 166 AAACAQLGDSGEANGCDA-KSRCQYIVKYGDGSNTT------GTGFQ------------- 205
Query: 174 FGCG---FSTNRNFKFSGIFGLGIGRSSLVSQLNSSFSYCIGSLHDPDYLHNKLILGDGA 230
FGC + K G+ GLG SLVSQ + S P Y
Sbjct: 206 FGCSHAELGAGMDDKTDGLIGLGGDAQSLVSQTAAR------SKKVPTY----------- 248
Query: 231 IIDEGDATPLQFIDGHYYITLEAISVDGRMLDINPNIFKRDDSGGGVMIDSGTDVTWLVK 290
Y+ LE I+V G+ L ++P++F G ++DSGT +T L
Sbjct: 249 ----------------YFAALEDIAVGGKKLGLSPSVFA-----AGSLVDSGTVITRLPP 287
Query: 291 EAYEALRDEVMIRLEGEQMRSYSWPDKL-----CYHGIMSSDLKGFPTVRFHFRGGAKLA 345
AY AL M Y+ + L C++ D PTV F GGA +
Sbjct: 288 AAYAALSSAFR-----AGMTRYARAEPLGILDTCFN-FTGLDKVSIPTVALVFAGGAVVD 341
Query: 346 LEKDSMFYQPRPDAFCMAVNPASINNRYVNFSLIGMMAQQFYNVGYD 392
L+ + C+A P + F IG + Q+ + V YD
Sbjct: 342 LDAHGIV-----SGGCLAFAPTRDDKA---FGTIGNVQQRTFEVLYD 380
>gi|196212948|gb|ACG76110.1| S5 [Oryza sativa Japonica Group]
gi|340810887|gb|AEK75370.1| S5 [Oryza sativa]
gi|340810903|gb|AEK75378.1| S5 [Oryza sativa]
gi|340810921|gb|AEK75387.1| S5 [Oryza sativa]
gi|340810955|gb|AEK75404.1| S5 [Oryza sativa]
gi|340811079|gb|AEK75466.1| S5 [Oryza nivara]
gi|340811090|gb|AEK75471.1| S5 [Oryza rufipogon]
gi|340811116|gb|AEK75484.1| S5 [Oryza nivara]
Length = 357
Score = 87.0 bits (214), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 109/383 (28%), Positives = 161/383 (42%), Gaps = 66/383 (17%)
Query: 63 VNISIGQPPVPQFTAMDTGSSLLWVHCYPCR-DC---SPQLGTIFYPSRSSSYAYVPCDS 118
+ +S+G+PPV A+DTGS+L WV C PC C S + G IF P RS + V C S
Sbjct: 1 MAVSLGKPPVVNLVAIDTGSTLSWVQCQPCAVHCHTQSAKAGPIFDPGRSYTSRRVRCSS 60
Query: 119 EHCRYFPY------ARCSAYKHRCIYTQLYLIGPETSV-FVSTEQLTFKNTDESTIHVQD 171
C Y A C + C Y+ Y G SV + T+ L ++ D
Sbjct: 61 VKCGELRYDLRLQQANCMEKEDSCTYSVTYGNGWAYSVGKMVTDTLRIGDS------FMD 114
Query: 172 VVFGCGFSTNRNFKFSGIFGLGIGRSSLVSQLN--------SSFSYCIGSLH-DPDYLHN 222
++FGC + +GIFG G S QL +FSYC+ + P Y
Sbjct: 115 LMFGCSMDVKYSEFEAGIFGFGSSSFSFFEQLAGYPDILSYKAFSYCLPTDETKPGY--- 171
Query: 223 KLILG--DGAIIDEGDATPLQFIDG-HYYITLEAISVDGRMLDINPNIFKRDDSGGGVMI 279
+ILG D A +D G + I+ Y +T+E + +G+ L S +++
Sbjct: 172 -MILGRYDRAAMDGGYTPLFRSINRPTYSLTMEMLIANGQRLVT---------SSSEMIV 221
Query: 280 DSGTDVTWLVKEAYEALRDEVMI---------RLEGEQMRSY----SWPDKLCYHGIMS- 325
DSG T L + AL D+ + R + SY S D ++G ++
Sbjct: 222 DSGAQRTSLWPSTF-ALLDKTITQAMSSIGYHRTSRARQESYICYLSEHDYSGWNGTITP 280
Query: 326 -SDLKGFPTVRFHFRGGAKLALEKDSMFYQPRPDAFCM--AVNPASINNRYVNFSLIGMM 382
S+ P + F GGA LAL ++FY CM A NPA + ++G
Sbjct: 281 FSNWSALPLLEIGFAGGAALALPPRNVFYNDPHRGLCMTFAQNPA------LRSQILGNR 334
Query: 383 AQQFYNVGYDIGRKQKTFQRMDC 405
+ + +DI KQ F+ C
Sbjct: 335 VTRSFGTTFDIQGKQFGFKYAAC 357
>gi|25347778|pir||B84556 hypothetical protein At2g17760 [imported] - Arabidopsis thaliana
Length = 473
Score = 87.0 bits (214), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 101/381 (26%), Positives = 158/381 (41%), Gaps = 60/381 (15%)
Query: 60 LFYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLGT---------IFYPSRSSS 110
L Y N+++G P A+DTGS L W+ C C +C +L I+ P+ SS+
Sbjct: 54 LHYANVTVGTPSDWFMVALDTGSDLFWLPC-DCTNCVRELKAPGGSSLDLNIYSPNASST 112
Query: 111 YAYVPCDSEHCRYFPYARCSAYKHRCIYTQLYLIGPETSVFVSTEQ-LTFKNTDESTIHV 169
VPC+S C RC++ + C Y YL +S V E L + D+S+ +
Sbjct: 113 STKVPCNSTLCTR--GDRCASPESDCPYQIRYLSNGTSSTGVLVEDVLHLVSNDKSSKAI 170
Query: 170 -QDVVFGCGFSTNRNF----KFSGIFGLGIGRSSLVSQL------NSSFSYCIGSLHDPD 218
V FGCG F +G+FGLG+ S+ S L +SFS C G+
Sbjct: 171 PARVTFGCGQVQTGVFHDGAAPNGLFGLGLEDISVPSVLAKEGIAANSFSMCFGNDG--- 227
Query: 219 YLHNKLILGDGAIIDEGDATPLQFIDGH--YYITLEAISVDGRMLDINPNIFKRDDSGGG 276
++ GD +D+ + TPL H Y IT+ ISV G D+ +
Sbjct: 228 --AGRISFGDKGSVDQRE-TPLNIRQPHPTYNITVTKISVGGNTGDLEFD---------- 274
Query: 277 VMIDSGTDVTWLVKEAYEALRDEV-MIRLEGE-QMRSYSWPDKLCY---------HGIMS 325
+ DSGT T+L AY + + + L+ Q P + CY H +
Sbjct: 275 AVFDSGTSFTYLTDAAYTLISESFNSLALDKRYQTTDSELPFEYCYALRLPLYSGHHHPN 334
Query: 326 SDLKGFPTVRFHFRGGAKLALEKDSMFYQPR-PDAFCMAVNPASINNRYVNFSLIGMMAQ 384
D +P V +GG+ + + + D +C+A+ + + S+IG
Sbjct: 335 KDSFQYPAVNLTMKGGSSYPVYHPLVVIPMKDTDVYCLAI------MKIEDISIIGQNFM 388
Query: 385 QFYNVGYDIGRKQKTFQRMDC 405
Y V +D + ++ DC
Sbjct: 389 TGYRVVFDREKLILGWKESDC 409
>gi|118482048|gb|ABK92955.1| unknown [Populus trichocarpa]
Length = 425
Score = 87.0 bits (214), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 97/363 (26%), Positives = 151/363 (41%), Gaps = 34/363 (9%)
Query: 56 ISNSLFYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLGTIFYPSRSSSYAYVP 115
+ + + V ++G P A+DT + W+ C C CS T+F S+++ +
Sbjct: 85 VQSPTYIVKANVGTPAQTFLMALDTSNDAAWIPCNGCVGCS---STVFNSVTSTTFKTLG 141
Query: 116 CDSEHCRYFPYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDVVFG 175
CD+ C+ P C C + Y G T + LT ST V FG
Sbjct: 142 CDAPQCKQVPNPTCGG--STCTWNTTY--GGSTIL----SNLTRDTIALSTDIVPGYTFG 193
Query: 176 C-GFSTNRNFKFSGIFGLGIGRSSLVSQL----NSSFSYCIGSLHDPDYLHNKLILGDGA 230
C +T + G+ GLG G S +SQ S+FSYC+ S ++ L LG
Sbjct: 194 CIQKTTGSSVPPQGLLGLGRGPLSFLSQTQDLYKSTFSYCLPSFRTLNF-SGTLRLGPAG 252
Query: 231 IIDEGDATPL---QFIDGHYYITLEAISVDGRMLDINPNIFKRD-DSGGGVMIDSGTDVT 286
TPL YY+ L I V +++DI + + +G G + DSGT T
Sbjct: 253 QPLRIKTTPLLKNPRRSSLYYVNLIGIRVGRKIVDIPASALAFNPTTGAGTIFDSGTVFT 312
Query: 287 WLVKEAYEALRDEVMIRLEGEQMRSYSWPDKLCYHGIMSSDLKGFPTVRFHFRGGAKLAL 346
LV Y A+RDE R+ + S D CY G + + PT+ F F G + L
Sbjct: 313 RLVAPVYTAVRDEFRKRVGNAIVSSLGGFDT-CYTGPIVA-----PTMTFMF-SGMNVTL 365
Query: 347 EKDSMFYQPRPDA---FCMAVNPASINNRYVNFSLIGMMAQQFYNVGYDIGRKQKTFQRM 403
D++ + + MA P ++N+ ++I M QQ + + +D+ + R
Sbjct: 366 PTDNLLIRSTAGSTSCLAMAAAPDNVNSV---LNVIANMQQQNHRILFDVPNSRIGVARE 422
Query: 404 DCE 406
C
Sbjct: 423 PCS 425
>gi|224057272|ref|XP_002299201.1| predicted protein [Populus trichocarpa]
gi|118483775|gb|ABK93780.1| unknown [Populus trichocarpa]
gi|222846459|gb|EEE84006.1| predicted protein [Populus trichocarpa]
Length = 425
Score = 86.7 bits (213), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 97/363 (26%), Positives = 150/363 (41%), Gaps = 34/363 (9%)
Query: 56 ISNSLFYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLGTIFYPSRSSSYAYVP 115
+ + + V ++G P A+DT + W+ C C CS T+F S+++ +
Sbjct: 85 VQSPTYIVKANVGTPAQTFLMALDTSNDAAWIPCNGCVGCS---STVFNSVTSTTFKTLG 141
Query: 116 CDSEHCRYFPYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDVVFG 175
CD+ C+ P C C + Y G T + LT ST V FG
Sbjct: 142 CDAPQCKQVPNPTCGG--STCTWNTTY--GGSTIL----SNLTRDTIALSTDIVPGYTFG 193
Query: 176 C-GFSTNRNFKFSGIFGLGIGRSSLVSQL----NSSFSYCIGSLHDPDYLHNKLILGDGA 230
C +T + G+ GLG G S +SQ S+FSYC+ S ++ L LG
Sbjct: 194 CIQKTTGSSVPPQGLLGLGRGPLSFLSQTQDLYKSTFSYCLPSFRTLNF-SGTLRLGPAG 252
Query: 231 IIDEGDATPL---QFIDGHYYITLEAISVDGRMLDINPNIFK-RDDSGGGVMIDSGTDVT 286
TPL YY+ L I V +++DI + +G G + DSGT T
Sbjct: 253 QPLRIKTTPLLKNPRRSSLYYVNLIGIRVGRKIVDIPASALAFNPTTGAGTIFDSGTVFT 312
Query: 287 WLVKEAYEALRDEVMIRLEGEQMRSYSWPDKLCYHGIMSSDLKGFPTVRFHFRGGAKLAL 346
LV Y A+RDE R+ + S D CY G + + PT+ F F G + L
Sbjct: 313 RLVAPVYTAVRDEFRKRVGNAIVSSLGGFDT-CYTGPIVA-----PTMTFMF-SGMNVTL 365
Query: 347 EKDSMFYQPRPDA---FCMAVNPASINNRYVNFSLIGMMAQQFYNVGYDIGRKQKTFQRM 403
D++ + + MA P ++N+ ++I M QQ + + +D+ + R
Sbjct: 366 PPDNLLIRSTAGSTSCLAMAAAPDNVNSV---LNVIANMQQQNHRILFDVPNSRIGVARE 422
Query: 404 DCE 406
C
Sbjct: 423 PCS 425
>gi|125528357|gb|EAY76471.1| hypothetical protein OsI_04407 [Oryza sativa Indica Group]
Length = 441
Score = 86.7 bits (213), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 80/259 (30%), Positives = 113/259 (43%), Gaps = 23/259 (8%)
Query: 58 NSLFYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLGTI--FYPSRSSSYAYVP 115
N V++++G PP +DTGS L W+ C P + F P S ++A VP
Sbjct: 62 NVSLTVSLAVGTPPQNVTMVLDTGSELSWLLCAPGGGGGGGGRSALSFRPRASLTFASVP 121
Query: 116 CDSEHCRY----FPYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQD 171
C S CR P A C +C + Y G + ++TE T
Sbjct: 122 CGSAQCRSRDLPSPPA-CDGASKQCRVSLSYADGSSSDGALATEVFTVGQGPP-----LR 175
Query: 172 VVFGC---GFSTNRN-FKFSGIFGLGIGRSSLVSQLNS-SFSYCIGSLHDPDYL---HNK 223
FGC F T+ + +G+ G+ G S VSQ ++ FSYCI D L H+
Sbjct: 176 AAFGCMATAFDTSPDGVATAGLLGMNRGALSFVSQASTRRFSYCISDRDDAGVLLLGHSD 235
Query: 224 L-ILGDGAIIDEGDATPLQFIDG-HYYITLEAISVDGRMLDINPNIFKRDDSGGG-VMID 280
L L A PL + D Y + L I V G+ L I ++ D +G G M+D
Sbjct: 236 LPFLPLNYTPLYQPAMPLPYFDRVAYSVQLLGIRVGGKPLPIPASVLAPDHTGAGQTMVD 295
Query: 281 SGTDVTWLVKEAYEALRDE 299
SGT T+L+ +AY AL+ E
Sbjct: 296 SGTQFTFLLGDAYSALKAE 314
>gi|190896584|gb|ACE96805.1| aspartyl protease [Populus tremula]
gi|190896586|gb|ACE96806.1| aspartyl protease [Populus tremula]
gi|190896588|gb|ACE96807.1| aspartyl protease [Populus tremula]
gi|190896590|gb|ACE96808.1| aspartyl protease [Populus tremula]
gi|190896592|gb|ACE96809.1| aspartyl protease [Populus tremula]
gi|190896594|gb|ACE96810.1| aspartyl protease [Populus tremula]
gi|190896596|gb|ACE96811.1| aspartyl protease [Populus tremula]
gi|190896598|gb|ACE96812.1| aspartyl protease [Populus tremula]
gi|190896600|gb|ACE96813.1| aspartyl protease [Populus tremula]
gi|190896602|gb|ACE96814.1| aspartyl protease [Populus tremula]
gi|190896604|gb|ACE96815.1| aspartyl protease [Populus tremula]
gi|190896606|gb|ACE96816.1| aspartyl protease [Populus tremula]
gi|190896610|gb|ACE96818.1| aspartyl protease [Populus tremula]
gi|190896612|gb|ACE96819.1| aspartyl protease [Populus tremula]
gi|190896614|gb|ACE96820.1| aspartyl protease [Populus tremula]
gi|190896616|gb|ACE96821.1| aspartyl protease [Populus tremula]
gi|190896618|gb|ACE96822.1| aspartyl protease [Populus tremula]
gi|190896620|gb|ACE96823.1| aspartyl protease [Populus tremula]
gi|190896622|gb|ACE96824.1| aspartyl protease [Populus tremula]
gi|190896624|gb|ACE96825.1| aspartyl protease [Populus tremula]
gi|190896626|gb|ACE96826.1| aspartyl protease [Populus tremula]
gi|190896628|gb|ACE96827.1| aspartyl protease [Populus tremula]
gi|190896630|gb|ACE96828.1| aspartyl protease [Populus tremula]
gi|190896632|gb|ACE96829.1| aspartyl protease [Populus tremula]
gi|190896634|gb|ACE96830.1| aspartyl protease [Populus tremula]
gi|190896636|gb|ACE96831.1| aspartyl protease [Populus tremula]
gi|190896638|gb|ACE96832.1| aspartyl protease [Populus tremula]
gi|190896640|gb|ACE96833.1| aspartyl protease [Populus tremula]
gi|190896642|gb|ACE96834.1| aspartyl protease [Populus tremula]
gi|190896644|gb|ACE96835.1| aspartyl protease [Populus tremula]
gi|190896646|gb|ACE96836.1| aspartyl protease [Populus tremula]
gi|190896648|gb|ACE96837.1| aspartyl protease [Populus tremula]
gi|190896650|gb|ACE96838.1| aspartyl protease [Populus tremula]
gi|190896652|gb|ACE96839.1| aspartyl protease [Populus tremula]
gi|190896654|gb|ACE96840.1| aspartyl protease [Populus tremula]
gi|190896656|gb|ACE96841.1| aspartyl protease [Populus tremula]
gi|190896658|gb|ACE96842.1| aspartyl protease [Populus tremula]
Length = 339
Score = 86.7 bits (213), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 91/323 (28%), Positives = 131/323 (40%), Gaps = 32/323 (9%)
Query: 32 RLAYLQEKIRSHNTYQAQILPSNDISNSLFYV-NISIGQPPVPQFTAMDTGSSLLWVHCY 90
RL YL + T I P + YV + +G P F +DT + WV C
Sbjct: 16 RLKYL-STLADQKTTAVPIAPGQQVLKIANYVVRVKLGTPGQQMFMVLDTSNDAAWVPCS 74
Query: 91 PCRDCSPQLGTIFYPSRSSSYAYVPCDSEHCRYFPYARCSAY-KHRCIYTQLYLIGPETS 149
C CS T F P+ S++ + C C C A C++ Q Y G ++S
Sbjct: 75 GCTGCS---STTFLPNASTTLGSLDCSEAQCSQVRGFSCPATGSSACLFNQSY--GGDSS 129
Query: 150 VFVSTEQ--LTFKNTDESTIHVQDVVFGC-GFSTNRNFKFSGIFGLGIGRSSLVSQLNS- 205
+ + Q +T N + FGC + + G+ GLG G SL+SQ +
Sbjct: 130 LAATLVQDAITLAND-----VIPGFTFGCINAVSGGSIPPQGLLGLGRGPISLISQAGAM 184
Query: 206 ---SFSYCIGSLHDPDYLHNKLILGDGAIIDEGDATPLQFIDGH----YYITLEAISVDG 258
FSYC+ S Y L LG TPL + H YY+ L +SV
Sbjct: 185 YSGVFSYCLPSFKS-YYFSGSLKLGPVGQPKSIRTTPL-LRNPHRPSLYYVNLTGVSVGR 242
Query: 259 RMLDINPNIFKRD-DSGGGVMIDSGTDVTWLVKEAYEALRDEVMIRLEGEQMRSYSWPDK 317
+ I D ++G G +IDSGT +T V+ Y A+RDE ++ G + S D
Sbjct: 243 IKVPIPSEQLVFDPNTGAGTIIDSGTVITRFVQPVYFAIRDEFRKQVNGP-ISSLGAFDT 301
Query: 318 LCYHGIMSSDLKGFPTVRFHFRG 340
C+ ++ P V HF G
Sbjct: 302 -CFAATNEAEA---PAVTLHFEG 320
>gi|224133616|ref|XP_002327639.1| predicted protein [Populus trichocarpa]
gi|222836724|gb|EEE75117.1| predicted protein [Populus trichocarpa]
Length = 484
Score = 86.7 bits (213), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 84/294 (28%), Positives = 130/294 (44%), Gaps = 49/294 (16%)
Query: 57 SNSLF-----YVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQL----GT----IF 103
SN LF Y N+S+G P V A+DTGS+LLW+ C C C L GT I+
Sbjct: 53 SNGLFGYILHYANVSVGTPSVSFLVALDTGSNLLWLPC-DCSSCVHSLRSPSGTVDLNIY 111
Query: 104 YPSRSSSYAYVPCDSEHCRYFPYARCSAYKHRCIYTQLYLI-GPETSVFVSTEQLTFKNT 162
P+ SS+ VPC+S C RC + + C Y +YL G T+ ++ + L +
Sbjct: 112 SPNTSSTSEKVPCNSTLCSQTQRDRCPSDQSNCPYQVVYLSNGTSTTGYIVQDLLHLISD 171
Query: 163 DESTIHV-QDVVFGCGFSTNRNF----KFSGIFGLGIGRSSLVSQL------NSSFSYCI 211
D + V + FGCG +F +G+FGLG+ S+ S L + SFS C
Sbjct: 172 DSQSKAVDAKITFGCGKVQTGSFLTGGAPNGLFGLGMSNISVPSTLAHNGYTSGSFSMCF 231
Query: 212 GSLHDPDYLHNKLILGDGAIIDEGDATPLQFIDGH-----YYITLEAISVDGRMLDINPN 266
P+ + ++ GD +G+ + F G Y I++ S+ G+ D+
Sbjct: 232 ----SPNGI-GRISFGDKGSTGQGETS---FNQGQPRSSLYNISITQTSIGGQASDL--- 280
Query: 267 IFKRDDSGGGVMIDSGTDVTWLVKEAYEALRDEVMIRLEGEQMRSYSWPDKLCY 320
++ + DSGT T+L AY + + ++ + S P CY
Sbjct: 281 VYS-------AIFDSGTSFTYLNDPAYTLIAESFNKLVKETRRSSTQVPFDYCY 327
>gi|148910602|gb|ABR18371.1| unknown [Picea sitchensis]
Length = 446
Score = 86.7 bits (213), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 105/430 (24%), Positives = 175/430 (40%), Gaps = 61/430 (14%)
Query: 12 NNPNENPAHRVQRGINISIARLAYLQEKIRSHNTYQAQILPSNDISNSLFYVNISIGQPP 71
N+ + A V R N RL Q I S L N + L+YV + +G P
Sbjct: 38 NDEEGSKASFVSRDTNRIGRRLQAHQTAIFS--------LKGNVVPYGLYYVTMLVGNPS 89
Query: 72 VPQFTAMDTGSSLLWVHC-YPCRDCSP--------QLGTIFYPSRSSSYAYVPCDSEHCR 122
P F +D+GS L W+ C PC C+ + G++ PS+ A V S H
Sbjct: 90 KPYFLDVDSGSELTWIQCDAPCISCAKGPHPLYKLKKGSLV-PSKDPLCAAVQAGSGH-- 146
Query: 123 YFPYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDVVFGCGFSTNR 182
Y RC Y Y + F+ + + T++ T+ + VFGCG++
Sbjct: 147 ---YHNHKEASQRCDYDVAYADHGYSEGFLVRDSVRALLTNK-TVLTANSVFGCGYNQRE 202
Query: 183 NFKFS-----GIFGLGIGRSSLVSQ------LNSSFSYCI-GSLHDPDYLHNKLILGDGA 230
+ S GI GLG G +SL SQ + + +CI G+ D Y + GD
Sbjct: 203 SLPVSDARTDGILGLGSGMASLPSQWAKQGLIKNVIGHCIFGAGRDGGY----MFFGD-D 257
Query: 231 IIDEGDATPLQFID----GHYYITLEAISVDGRMLDINPNIFKRDDSGGGVMIDSGTDVT 286
++ T + + HYY+ ++ + LD + + K GG++ DSG+ T
Sbjct: 258 LVSTSAMTWVPMLGRPSIKHYYVGAAQMNFGNKPLDKDGDGKKL----GGIIFDSGSTYT 313
Query: 287 WLVKEAYEALRDEVMIRLEGEQMR--------SYSWPDKLCYHGIMSSDLKGFP-TVRFH 337
+ +AY A V L G+Q+ S W K + + + P T++F
Sbjct: 314 YFTNQAYGAFLSVVKENLSGKQLEQDSSDSFLSLCWRRKEGFRSVAEAAAYFKPLTLKFR 373
Query: 338 FRGGAKLALEKDSMFYQPRPDAFCMAV-NPASINNRYVNFSLIGMMAQQFYNVGYDIGRK 396
++ + + + C+ + N +I V+ +++G ++ Q V YD +
Sbjct: 374 STKTKQMEIFPEGYLVVNKKGNVCLGILNGTAIG--IVDTNVLGDISFQGQLVVYDNEKN 431
Query: 397 QKTFQRMDCE 406
Q + R DC+
Sbjct: 432 QIGWARSDCQ 441
>gi|340810915|gb|AEK75384.1| S5 [Oryza sativa]
gi|340810917|gb|AEK75385.1| S5 [Oryza sativa]
gi|340810919|gb|AEK75386.1| S5 [Oryza sativa]
gi|340810927|gb|AEK75390.1| S5 [Oryza sativa]
gi|340810975|gb|AEK75414.1| S5 [Oryza nivara]
gi|340810979|gb|AEK75416.1| S5 [Oryza nivara]
gi|340810995|gb|AEK75424.1| S5 [Oryza nivara]
gi|340811027|gb|AEK75440.1| S5 [Oryza nivara]
gi|340811063|gb|AEK75458.1| S5 [Oryza nivara]
Length = 357
Score = 86.3 bits (212), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 110/383 (28%), Positives = 162/383 (42%), Gaps = 66/383 (17%)
Query: 63 VNISIGQPPVPQFTAMDTGSSLLWVHCYPCR-DC---SPQLGTIFYPSRSSSYAYVPCDS 118
+ +S+G+PPV A+DTGS+L WV C PC C S + G IF P RS + V C S
Sbjct: 1 MAVSLGKPPVVNLVAIDTGSTLSWVQCQPCAVHCHTQSAKAGPIFDPGRSYTSRRVRCSS 60
Query: 119 EHC---RY---FPYARCSAYKHRCIYTQLYLIGPETSV-FVSTEQLTFKNTDESTIHVQD 171
C RY A C + C Y+ Y G SV + T+ L ++ D
Sbjct: 61 VKCGEPRYDLRLQQANCMEKEDSCTYSVTYGNGWAYSVGKMVTDTLRIGDS------FMD 114
Query: 172 VVFGCGFSTNRNFKFSGIFGLGIGRSSLVSQLN--------SSFSYCIGSLH-DPDYLHN 222
++FGC + +GIFG G S QL +FSYC+ + P Y
Sbjct: 115 LMFGCSMDVKYSEFEAGIFGFGSSSFSFFEQLAGYPDILSYKAFSYCLPTDETKPGY--- 171
Query: 223 KLILG--DGAIIDEGDATPLQFIDG-HYYITLEAISVDGRMLDINPNIFKRDDSGGGVMI 279
+ILG D A +D G + I+ Y +T+E + +G+ L S +++
Sbjct: 172 -MILGRYDRAAMDGGYTPLFRSINRPTYSLTMEMLIANGQRLVT---------SSSEMIV 221
Query: 280 DSGTDVTWLVKEAYEALRDEVMI---------RLEGEQMRSY----SWPDKLCYHGIMS- 325
DSG T L + AL D+ + R + SY S D ++G ++
Sbjct: 222 DSGAQRTSLWPSTF-ALLDKTITQAMSSIGYHRTSRARQESYICYLSEHDYSGWNGTITP 280
Query: 326 -SDLKGFPTVRFHFRGGAKLALEKDSMFYQPRPDAFCM--AVNPASINNRYVNFSLIGMM 382
S+ P + F GGA LAL ++FY CM A NPA + ++G
Sbjct: 281 FSNWSALPLLEIGFAGGAALALSPRNVFYNDPHRGLCMTFAQNPA------LRSQILGNR 334
Query: 383 AQQFYNVGYDIGRKQKTFQRMDC 405
+ + +DI KQ F+ C
Sbjct: 335 VTRSFGTTFDIQGKQFGFKYAAC 357
>gi|326513976|dbj|BAJ92138.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 342
Score = 86.3 bits (212), Expect = 3e-14, Method: Compositional matrix adjust.
Identities = 72/256 (28%), Positives = 127/256 (49%), Gaps = 21/256 (8%)
Query: 167 IHVQDVV-FGCG-FSTNRNFKFSGIFGLGIGRSSLVSQLN-SSFSYCIGSLHDPDYLHNK 223
+HV+ + FGCG S SG+ GL G SL+SQL+ FSYC+ + +
Sbjct: 88 VHVRRALGFGCGALSAGSLVGASGLMGLSPGTMSLISQLSVPRFSYCLTPFAERKT--SP 145
Query: 224 LILGDGAIIDEGDAT-PLQ--------FIDG-HYYITLEAISVDGRMLDI-NPNIFKRDD 272
++ G A + + + T P+Q +D +YY+ L +S+ + L + ++ D
Sbjct: 146 MLFGAMADLRKYNTTGPIQTTAILRNPAMDTFYYYVPLVGLSLGTKRLRVPAASLAINPD 205
Query: 273 SGGGVMIDSGTDVTWLVKEAYEALRDEVMIRLEGEQMRSYSWPDKLCY---HGIMSSDLK 329
GG ++DSG+ + L +A++A++ V+ ++ +LC+ G+ + +K
Sbjct: 206 GTGGTIVDSGSTMAHLAGKAFDAVKKAVLEAVKLPVFNGTVEDYELCFAVPSGVAMAAVK 265
Query: 330 GFPTVRFHFRGGAKLALEKDSMFYQPRPDAFCMAVNPASINNRYVNFSLIGMMAQQFYNV 389
P V HF GGA +AL +D+ F +PR C+AV S + S+IG + QQ +V
Sbjct: 266 TPPLV-LHFDGGAAMALPRDNYFQEPRAGLMCLAVA-RSPEDLGAPISIIGNVQQQNMHV 323
Query: 390 GYDIGRKQKTFQRMDC 405
+D+ ++ +F C
Sbjct: 324 LFDVHNQKFSFAPTKC 339
>gi|340811098|gb|AEK75475.1| S5 [Oryza nivara]
Length = 357
Score = 86.3 bits (212), Expect = 3e-14, Method: Compositional matrix adjust.
Identities = 109/383 (28%), Positives = 160/383 (41%), Gaps = 66/383 (17%)
Query: 63 VNISIGQPPVPQFTAMDTGSSLLWVHCYPCR-DC---SPQLGTIFYPSRSSSYAYVPCDS 118
+ +S+G+PPV A+DTGS+L WV C PC C S + G IF P RS + V C S
Sbjct: 1 MAVSLGKPPVVNLVAIDTGSTLSWVQCQPCAVHCHTQSAKAGPIFDPGRSYTSRRVRCSS 60
Query: 119 EHCRYFPY------ARCSAYKHRCIYTQLYLIGPETSV-FVSTEQLTFKNTDESTIHVQD 171
C Y A C + C Y+ Y G SV + T+ L ++ D
Sbjct: 61 VKCGELRYDLRLQQANCMEKEDSCTYSVTYGNGWAYSVGKMVTDTLRIGDS------FMD 114
Query: 172 VVFGCGFSTNRNFKFSGIFGLGIGRSSLVSQLN--------SSFSYCIGSLH-DPDYLHN 222
++FGC + +GIFG G S QL +FSYC+ + P Y
Sbjct: 115 LMFGCSMDVKYSEFEAGIFGFGSSSFSFFEQLAGYPDILSYKAFSYCLPTDETKPGY--- 171
Query: 223 KLILG--DGAIIDEGDATPLQFIDG-HYYITLEAISVDGRMLDINPNIFKRDDSGGGVMI 279
+ILG D A +D G + I+ Y +T E + +G+ L S +++
Sbjct: 172 -MILGRYDRAAMDGGYTPLFRSINRPTYSLTTEMLIANGQRLVT---------SSSEMIV 221
Query: 280 DSGTDVTWLVKEAYEALRDEVMI---------RLEGEQMRSY----SWPDKLCYHGIMS- 325
DSG T L + AL D+ + R + SY S D ++G ++
Sbjct: 222 DSGAQRTSLWPSTF-ALLDKTITQAMSSIGYHRTSRARQESYICYLSEHDYSGWNGTITP 280
Query: 326 -SDLKGFPTVRFHFRGGAKLALEKDSMFYQPRPDAFCM--AVNPASINNRYVNFSLIGMM 382
S+ P + F GGA LAL ++FY CM A NPA + ++G
Sbjct: 281 FSNWSALPLLEIGFAGGAALALSPRNVFYNDPHRGLCMTFAQNPA------LRSQILGNR 334
Query: 383 AQQFYNVGYDIGRKQKTFQRMDC 405
+ + +DI KQ F+ C
Sbjct: 335 VTRSFGTTFDIQGKQFGFKYAAC 357
>gi|222628951|gb|EEE61083.1| hypothetical protein OsJ_14969 [Oryza sativa Japonica Group]
Length = 367
Score = 86.3 bits (212), Expect = 3e-14, Method: Compositional matrix adjust.
Identities = 77/280 (27%), Positives = 117/280 (41%), Gaps = 28/280 (10%)
Query: 22 VQRGINISIARLAYLQEKIRSHNTYQAQILPSNDI--SNSLFYVNISIGQPPVPQFTAMD 79
++R I S RLA + + + ++ I + + V + IG PP A+D
Sbjct: 48 LRRAIQRSRYRLAGIGMARGEAASARKAVVAETPIMPAGGEYLVKLGIGTPPYKFTAAID 107
Query: 80 TGSSLLWVHCYPCRDCSPQLGTIFYPSRSSSYAYVPCDSEHCRYFPYARCS-AYKHRCIY 138
T S L+W C PC C Q+ +F P SS+YA +PC S+ C RC C Y
Sbjct: 108 TASDLIWTQCQPCTGCYHQVDPMFNPRVSSTYAALPCSSDTCDELDVHRCGHDDDESCQY 167
Query: 139 TQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDVVFGCGFSTNRNF---KFSGIFGLGIG 195
T Y T ++ ++L + V FGC S+ + SG+ GLG G
Sbjct: 168 TYTYSGNATTEGTLAVDKLVIGED-----AFRGVAFGCSTSSTGGAPPPQASGVVGLGRG 222
Query: 196 RSSLVSQLN-----------SSFSYCIGSLHDP--DYLHNKLILGDGAIIDEGDATPLQF 242
SLVSQL+ S+ ++ SL+D + L ++ L G G
Sbjct: 223 PLSLVSQLSVRRYGMIIDIASTITFLEASLYDELVNDLEVEIRLPRGTGSSLGLDLCFIL 282
Query: 243 IDG----HYYITLEAISVDGRMLDINPNIFKRDDSGGGVM 278
DG Y+ A++ DGR L ++ +D G+M
Sbjct: 283 PDGVAFDRVYVPAVALAFDGRWLRLDKARLFAEDRESGMM 322
>gi|190896608|gb|ACE96817.1| aspartyl protease [Populus tremula]
Length = 339
Score = 85.9 bits (211), Expect = 3e-14, Method: Compositional matrix adjust.
Identities = 91/323 (28%), Positives = 131/323 (40%), Gaps = 32/323 (9%)
Query: 32 RLAYLQEKIRSHNTYQAQILPSNDISNSLFYV-NISIGQPPVPQFTAMDTGSSLLWVHCY 90
RL YL + T I P + YV + +G P F +DT + WV C
Sbjct: 16 RLKYL-STLADQKTTAVPIAPGQQVLKIANYVVRVKLGTPGQQMFMVLDTSNDAAWVPCS 74
Query: 91 PCRDCSPQLGTIFYPSRSSSYAYVPCDSEHCRYFPYARCSAY-KHRCIYTQLYLIGPETS 149
C CS T F P+ S++ + C C C A C++ Q Y G ++S
Sbjct: 75 GCTGCS---STTFLPNASTTLGSLDCSEAQCSQVRGFSCPATGSSACLFNQSY--GGDSS 129
Query: 150 VFVSTEQ--LTFKNTDESTIHVQDVVFGC-GFSTNRNFKFSGIFGLGIGRSSLVSQLNS- 205
+ + Q +T N + FGC + + G+ GLG G SL+SQ +
Sbjct: 130 LAATLVQDAITLAND-----VIPGFTFGCINAVSGGSIPPQGLLGLGRGPISLISQAGAM 184
Query: 206 ---SFSYCIGSLHDPDYLHNKLILGDGAIIDEGDATPLQFIDGH----YYITLEAISVDG 258
FSYC+ S Y L LG TPL + H YY+ L +SV
Sbjct: 185 YSGVFSYCLPSFKS-YYFSGSLKLGPVGQPKSIRTTPL-LRNPHRPSLYYVNLTGVSVGR 242
Query: 259 RMLDINPNIFKRD-DSGGGVMIDSGTDVTWLVKEAYEALRDEVMIRLEGEQMRSYSWPDK 317
+ I D ++G G +IDSGT +T V+ Y A+RDE ++ G + S D
Sbjct: 243 IKVPIPSEQLVFDPNTGAGTIIDSGTVITRFVQPVYFAIRDEFRKQVNGP-ISSLGAFDT 301
Query: 318 LCYHGIMSSDLKGFPTVRFHFRG 340
C+ ++ P V HF G
Sbjct: 302 -CFAETNEAEA---PAVTLHFEG 320
>gi|340810959|gb|AEK75406.1| S5 [Oryza sativa]
gi|340810971|gb|AEK75412.1| S5 [Oryza rufipogon]
Length = 357
Score = 85.9 bits (211), Expect = 3e-14, Method: Compositional matrix adjust.
Identities = 110/383 (28%), Positives = 162/383 (42%), Gaps = 66/383 (17%)
Query: 63 VNISIGQPPVPQFTAMDTGSSLLWVHCYPCR-DC---SPQLGTIFYPSRSSSYAYVPCDS 118
+ +S+G+PPV A+DTGS+L WV C PC C S + G IF P RS + V C S
Sbjct: 1 MAVSLGKPPVVNLVAIDTGSTLSWVQCQPCAVHCHTQSAKAGPIFDPGRSYTSRRVRCSS 60
Query: 119 EHC---RY---FPYARCSAYKHRCIYTQLYLIGPETSV-FVSTEQLTFKNTDESTIHVQD 171
C RY A C + C Y+ Y G SV + T+ L ++ D
Sbjct: 61 VKCGEPRYDLRLQQANCMEKEDSCTYSVTYGNGWAYSVGKMVTDTLRIGDS------FMD 114
Query: 172 VVFGCGFSTNRNFKFSGIFGLGIGRSSLVSQLN--------SSFSYCIGSLH-DPDYLHN 222
++FGC + +GIFG G S QL +FSYC+ + P Y
Sbjct: 115 LMFGCSMDVKYSEFEAGIFGFGSSSFSFFEQLAGYPDILSYKAFSYCLPTDETKPGY--- 171
Query: 223 KLILG--DGAIIDEGDATPLQFIDG-HYYITLEAISVDGRMLDINPNIFKRDDSGGGVMI 279
+ILG D A +D G + I+ Y +T+E + +G+ L S +++
Sbjct: 172 -MILGRYDRAAMDGGYTPLFRSINRPTYSLTMEMLIANGQRLVT---------SSSEMIV 221
Query: 280 DSGTDVTWLVKEAYEALRDEVMI---------RLEGEQMRSY----SWPDKLCYHGIMS- 325
DSG T L + AL D+ + R + SY S D ++G ++
Sbjct: 222 DSGAQRTSLWPSTF-ALLDKTITQAMSSIGYHRTSRARQESYICYLSEHDYSGWNGTITP 280
Query: 326 -SDLKGFPTVRFHFRGGAKLALEKDSMFYQPRPDAFCM--AVNPASINNRYVNFSLIGMM 382
S+ P + F GGA LAL ++FY CM A NPA + ++G
Sbjct: 281 FSNWSALPLLEIGFAGGAALALPPRNVFYNDPHRGLCMTFAQNPA------LRSQILGNR 334
Query: 383 AQQFYNVGYDIGRKQKTFQRMDC 405
+ + +DI KQ F+ C
Sbjct: 335 VTRSFGTTFDIQGKQFGFKYAAC 357
>gi|302756119|ref|XP_002961483.1| hypothetical protein SELMODRAFT_76765 [Selaginella moellendorffii]
gi|300170142|gb|EFJ36743.1| hypothetical protein SELMODRAFT_76765 [Selaginella moellendorffii]
Length = 388
Score = 85.9 bits (211), Expect = 3e-14, Method: Compositional matrix adjust.
Identities = 93/386 (24%), Positives = 161/386 (41%), Gaps = 52/386 (13%)
Query: 60 LFYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLG-----TIFYPSRSSSYAYV 114
L++ + +G P +DTGS +LWV+C PC C + T++ P SS+ + V
Sbjct: 1 LYFTQVGLGNPVKHYIVQVDTGSDVLWVNCRPCSGCPRKSALNIPLTMYDPRESSTTSLV 60
Query: 115 PCDSEHC---RYFPYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIH--V 169
C C R F A+CS + C Y Y G + + + + + + +
Sbjct: 61 SCSDPLCVRGRRFAEAQCSQATNNCEYIFSYGDGSTSEGYYVRDAMQYNVISSNGLANTT 120
Query: 170 QDVVFGCGFSTNRNFKFS-----GIFGLGIGRSSLVSQLNSS------FSYCIGSLHDPD 218
V+FGC + S GI G G S+ +QL + FS+C+
Sbjct: 121 SQVLFGCSIRQTGDLSTSQQAVDGIIGFGQLELSVPNQLAAQQNIPRVFSHCLEGEKRGG 180
Query: 219 YLHNKLILGDGAIIDEGDATPLQFIDGHYYITLEAISVDGRMLDINPNIFKRDDSGGGVM 278
+ + + + TPL HY + L ISV+ L I+ F + GV+
Sbjct: 181 GILVIGGIAEPGMT----YTPLVPDSVHYNVVLRGISVNSNRLPIDAEDFSSTND-TGVI 235
Query: 279 IDSGTDVTWLVKEAY----EALRDEVM---IRLEGEQMRSYSWPDKLCYHGIMSSDLKGF 331
+DSGT + + AY +A+R+ +R++G + + +L SDL F
Sbjct: 236 MDSGTTLAYFPSGAYNVFVQAIREATSATPVRVQGMDTQCFLVSGRL-------SDL--F 286
Query: 332 PTVRFHFRGGAKLALEKDS--MFYQPRP----DAFCMAVNPASIN---NRYVNFSLIGMM 382
P V +F GGA + L+ D+ M+ P D +C+ +S + +++G +
Sbjct: 287 PNVTLNFEGGA-MELQPDNYLMWGGTAPTGTTDVWCIGWQSSSSSAGPKDGSQLTILGDI 345
Query: 383 AQQFYNVGYDIGRKQKTFQRMDCEVL 408
+ V YD+ + + +C+ L
Sbjct: 346 VLKDKLVVYDLDNSRIGWMSYNCKFL 371
>gi|226495123|ref|NP_001141522.1| uncharacterized protein LOC100273634 precursor [Zea mays]
gi|194704920|gb|ACF86544.1| unknown [Zea mays]
gi|223949445|gb|ACN28806.1| unknown [Zea mays]
gi|413924531|gb|AFW64463.1| pepsin A [Zea mays]
Length = 515
Score = 85.9 bits (211), Expect = 3e-14, Method: Compositional matrix adjust.
Identities = 97/388 (25%), Positives = 160/388 (41%), Gaps = 57/388 (14%)
Query: 52 PSNDISNSLFYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLG---------TI 102
P ND+ L+Y + +G P A+DTGS L WV C C C+P G I
Sbjct: 88 PGNDL-GWLYYAWVDVGTPATSFLVALDTGSDLFWVPC-DCIQCAPLSGYRGNLDRDLRI 145
Query: 103 FYPSRSSSYAYVPCDSEHCRYFPYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNT 162
+ P+ S++ ++PC E C+ P C+ K C Y Y TS + E N
Sbjct: 146 YRPAESTTSRHLPCSHELCQSVP--GCTNPKQPCPYNIDYFSENTTSSGLLIEDTLHLNY 203
Query: 163 DESTIHVQ-DVVFGCGFSTNRNF----KFSGIFGLGIGRSSLVSQL------NSSFSYCI 211
E + V V+ GCG + ++ G+ GLG+ S+ S L +SFS C
Sbjct: 204 REDHVPVNASVIIGCGQKQSGDYLDGIAPDGLLGLGMADISVPSFLARAGLVQNSFSMCF 263
Query: 212 GSLHDPDYLHNKLILGDGAIIDEGDATPLQFIDGHYYITLEAISVDGRMLD---INPNIF 268
+ ++ GD + + +TP F+ + + A++VD + + F
Sbjct: 264 K-----EDSSGRIFFGDQGVPSQ-QSTP--FVPLYGKLQTYAVNVDKSCIGHKCLEGTSF 315
Query: 269 KRDDSGGGVMIDSGTDVTWLVKEAYEALRDEVMIRLEGEQMRSYSWPDKLCYHGIMSSDL 328
K ++DSGT T L + Y+A E ++ ++ K CY ++
Sbjct: 316 K-------ALVDSGTSFTSLPFDVYKAFTMEFDKQMNATRVPYEDTTWKYCYSA-SPLEM 367
Query: 329 KGFPTVRFHFRGGAKLALEKDSMFYQPRPDA---FCMAVNPASINNRYVNFSLIGMMAQQ 385
PT+ F L + + + A FC+AV P++ IG++AQ
Sbjct: 368 PDVPTITLTFAADKSLQAVNPILPFNDKQGALAGFCLAVLPST--------EPIGIIAQN 419
Query: 386 F---YNVGYDIGRKQKTFQRMDCEVLDD 410
F Y+V +D + + R +C ++D
Sbjct: 420 FLVGYHVVFDRESMKLGWYRSECRYVED 447
>gi|108862257|gb|ABA96612.2| Eukaryotic aspartyl protease family protein, expressed [Oryza
sativa Japonica Group]
Length = 451
Score = 85.9 bits (211), Expect = 4e-14, Method: Compositional matrix adjust.
Identities = 99/385 (25%), Positives = 163/385 (42%), Gaps = 54/385 (14%)
Query: 58 NSLFYVNISIGQPPVPQFTAMDTGSSLLWVHC-YPCRDCSPQLGTIFYPSRSSSYAYVPC 116
+ L+YV +SIG PP P F +DTGS L W+ C PC CS ++ P+++ VPC
Sbjct: 55 HGLYYVAMSIGNPPRPYFLDVDTGSDLTWLQCDAPCVSCSKVPHPLYRPTKNK---LVPC 111
Query: 117 DSEHCRYFPYA-----RCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQD 171
+ C +C + K +C Y Y + + T+ + + S +
Sbjct: 112 VDQMCAALHGGLTGRHKCDSPKQQCDYEIKYADQGSSLGVLVTDSFALRLANSSIVR-PG 170
Query: 172 VVFGCGF-----STNRNFKFSGIFGLGIGRSSLVSQL------NSSFSYCIGSLHDPDYL 220
+ FGCG+ S+ G+ GLG G SL+SQL + +C+ +
Sbjct: 171 LAFGCGYDQQVGSSTEVSATDGVLGLGSGSVSLLSQLKQHGITKNVVGHCLSTRGG---- 226
Query: 221 HNKLILGDGAIIDEGDAT--PLQFIDGHYYITLEAISV--DGRMLDINPNIFKRDDSGGG 276
L GD I+ AT P+ Y + + ++ GR L + P
Sbjct: 227 -GFLFFGD-DIVPYSRATWAPMARSTSRNYYSPGSANLYFGGRPLGVRPM---------E 275
Query: 277 VMIDSGTDVTWLVKEAYEALRDEVMIRLEG--EQMRSYSWPDKLCYHGI--MSSDL---K 329
V+ DSG+ T+ + Y+AL D + L +++ +S P LC+ G S L K
Sbjct: 276 VVFDSGSSFTYFSAQPYQALVDAIKGDLSKNLKEVPDHSLP--LCWKGKKPFKSVLDVKK 333
Query: 330 GFPTVRFHFRGGAKLALE---KDSMFYQPRPDAFCMAVNPASINNRYVNFSLIGMMAQQF 386
F TV F G K +E ++ + +A +N + + + +N ++G + Q
Sbjct: 334 EFRTVVLSFSNGKKALMEIPPENYLIVTKYGNACLGILNGSEVGLKDLN--IVGDITMQD 391
Query: 387 YNVGYDIGRKQKTFQRMDCEVLDDD 411
V YD R Q + R C+ + +D
Sbjct: 392 QMVIYDNERGQIGWIRAPCDRIPND 416
>gi|297723027|ref|NP_001173877.1| Os04g0336942 [Oryza sativa Japonica Group]
gi|255675342|dbj|BAH92605.1| Os04g0336942 [Oryza sativa Japonica Group]
Length = 388
Score = 85.9 bits (211), Expect = 4e-14, Method: Compositional matrix adjust.
Identities = 85/302 (28%), Positives = 130/302 (43%), Gaps = 36/302 (11%)
Query: 60 LFYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLGTI----FYPSRSS-SYAYV 114
L+Y +I IG P V + +DTGS WV+ C+ C + + FY RSS S V
Sbjct: 82 LYYTDIGIGTPAVKYYVQLDTGSKAFWVNGISCKQCPHESDILRKLTFYDPRSSVSSKEV 141
Query: 115 PCDSEHCRYFPYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFK---NTDESTIHVQD 171
CD C P + RC Y Y G T + T+ L + ++
Sbjct: 142 KCDDTICTSRPPCNMTL---RCPYITGYADGGLTMGILFTDLLHYHQLYGNGQTQPTSTS 198
Query: 172 VVFGCGFS-----TNRNFKFSGIFGLGIGRSSLVSQLNSS------FSYCIGSLHDPDYL 220
V FGCG N GI G G + +SQL ++ FS+C+ D
Sbjct: 199 VTFGCGLQQSGSLNNSAVAIDGIIGFGNSNQTALSQLAAAGKTKKIFSHCL------DST 252
Query: 221 HNKLILGDGAIID-EGDATPLQFIDGHYY-ITLEAISVDGRMLDINPNIFKRDDSGGGVM 278
+ I G +++ + TP+ + Y+ + L++I+V G L + NIF + G
Sbjct: 253 NGGGIFAIGEVVEPKVKTTPIVKNNEVYHLVNLKSINVAGTTLQLPANIFGTTKT-KGTF 311
Query: 279 IDSGTDVTWLVKEAYEALRDEVMIRLEGEQMRS-YSWPDKLCYHGIMSSDLKGFPTVRFH 337
IDSG+ + +L + Y L V + M + Y++ C+H + S D K FP + FH
Sbjct: 312 IDSGSTLVYLPEIIYSELILAVFAKHPDITMGAMYNFQ---CFHFLGSVDDK-FPKITFH 367
Query: 338 FR 339
F
Sbjct: 368 FE 369
>gi|242074844|ref|XP_002447358.1| hypothetical protein SORBIDRAFT_06g033560 [Sorghum bicolor]
gi|241938541|gb|EES11686.1| hypothetical protein SORBIDRAFT_06g033560 [Sorghum bicolor]
Length = 497
Score = 85.5 bits (210), Expect = 4e-14, Method: Compositional matrix adjust.
Identities = 108/395 (27%), Positives = 162/395 (41%), Gaps = 54/395 (13%)
Query: 61 FYVNISIGQPPVPQFTAMDTGSSLLWVHC---YPCRDCSPQLGT---IFYPSRSSSYAYV 114
+ S+G PP P +DTGS L WV C Y CR+CS +F+P SSS V
Sbjct: 103 YAFTASLGTPPQPLPVLLDTGSQLTWVPCTSNYDCRNCSSPFAAAVPVFHPKNSSSSRLV 162
Query: 115 PCDSEHCRYF----PYARCSAYKHRCI-YTQLYLIGPETSVFV---STEQLTFKNTDEST 166
C + C + A+C A R T + P +V ST L +T +
Sbjct: 163 GCRNPSCLWVHSAEHVAKCRAPCSRGANCTPASNVCPPYAVVYGSGSTAGLLIADTLRAP 222
Query: 167 IH-VQDVVFGCGFSTNRNFKFSGIFGLGIGRSSLVSQLN-SSFSYCIGS--LHDPDYLHN 222
V V GC + SG+ G G G S+ +QL S FSYC+ S D +
Sbjct: 223 GRAVSGFVLGCSLVSVHQ-PPSGLAGFGRGAPSVPAQLGLSKFSYCLLSRRFDDNAAVSG 281
Query: 223 KLILG---DG------AIIDEGDATPLQFIDGHYYITLEAISVDGRMLDI-NPNIFKRDD 272
L+LG DG GD P +YY+ L ++V G+ + +
Sbjct: 282 SLVLGGDNDGMQYVPLVKSAAGDKQPYAV---YYYLALSGVTVGGKAVRLPARAFAANAA 338
Query: 273 SGGGVMIDSGTDVTWLVKEAYEALRDEVMIRLEGEQMRSYSWPDKL----CYHGIMSSDL 328
GG ++DSGT T+L ++ + D V+ + G RS + L C+ +
Sbjct: 339 GSGGAIVDSGTTFTYLDPTVFQPVADAVVAAVGGRYKRSKDVEEGLGLHPCFALPQGAKS 398
Query: 329 KGFPTVRFHFRGGAKLALEKDSMFY----QPRP---------DAFCMAV-----NPASIN 370
P + HF+GGA + L ++ F P P +A C+AV + +
Sbjct: 399 MALPELSLHFKGGAVMQLPLENYFVVAGRAPVPGAGAGAGAAEAICLAVVTDFGGSGAGD 458
Query: 371 NRYVNFSLIGMMAQQFYNVGYDIGRKQKTFQRMDC 405
++G QQ Y V YD+ +++ F+R C
Sbjct: 459 EGGGPAIILGSFQQQNYLVEYDLEKERLGFRRQPC 493
>gi|224142001|ref|XP_002324349.1| predicted protein [Populus trichocarpa]
gi|222865783|gb|EEF02914.1| predicted protein [Populus trichocarpa]
Length = 490
Score = 85.5 bits (210), Expect = 4e-14, Method: Compositional matrix adjust.
Identities = 105/377 (27%), Positives = 151/377 (40%), Gaps = 37/377 (9%)
Query: 48 AQILPSND---ISNSLFYVNISIGQPPVPQFTAMDTGSSLLWVHCYPC-RDCSPQLGTIF 103
+ +P+ D + + + V + +G P DTGS + W C PC R C Q IF
Sbjct: 133 STTIPAKDGSTVGSGNYIVTVGLGTPKKDLSLIFDTGSDITWTQCQPCARSCYKQKEQIF 192
Query: 104 YPSRSSSYAYVPCDSEHCRYFPYAR-----CSAYKHRCIYTQLYLIGPETSVFVSTEQLT 158
PS+S+SY + C S C A C++ C+Y Y + F TE+LT
Sbjct: 193 DPSQSTSYTNISCSSSICNSLTSATGNTPGCAS--SACVYGIQYGDSSFSVGFFGTEKLT 250
Query: 159 FKNTDESTIHVQDVVFGCGFSTNRNFKFSGIFGLGIGRS--SLVSQL----NSSFSYCIG 212
+TD ++ FGCG N+ LG+GR S+VSQ N FSYC
Sbjct: 251 LTSTDA----FNNIYFGCG-QNNQGLFGGSAGLLGLGRDKLSVVSQTAQKYNKIFSYC-- 303
Query: 213 SLHDPDYLHNKLILGDGAIIDEGDATPLQFIDG---HYYITLEAISVDGRMLDINPNIFK 269
L L G G+ TPL I Y + ISV G+ L I+ ++F
Sbjct: 304 -LPSSSSSTGFLTFG-GSASKNAKFTPLSTISAGPSFYGLDFTGISVGGKKLAISASVF- 360
Query: 270 RDDSGGGVMIDSGTDVTWLVKEAYEALRDEVMIRLEGEQMRSYSWPDKLCYHGIMSSDLK 329
S G +IDSGT +T L AY ALR + M CY S
Sbjct: 361 ---STAGAIIDSGTVITRLPPAAYSALRASFRNLMSKYPMTKALSILDTCYD-FSSYTTI 416
Query: 330 GFPTVRFHFRGGAKLALEKDSMFYQPRPDAFCMAVNPASINNRYVNFSLIGMMAQQFYNV 389
P + F F G ++ ++ + Y C+A + N+ + + G + Q+ V
Sbjct: 417 SVPKIGFSFSSGIEVDIDATGILYASSLSQVCLAF---AGNSDATDVFIFGNVQQKTLEV 473
Query: 390 GYDIGRKQKTFQRMDCE 406
YD + F C
Sbjct: 474 FYDGSAGKVGFAPGGCS 490
>gi|195619700|gb|ACG31680.1| pepsin A [Zea mays]
Length = 485
Score = 85.5 bits (210), Expect = 4e-14, Method: Compositional matrix adjust.
Identities = 98/390 (25%), Positives = 162/390 (41%), Gaps = 61/390 (15%)
Query: 52 PSNDISNSLFYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLG---------TI 102
P ND+ L+Y + +G P A+DTGS L WV C C C+P G I
Sbjct: 58 PGNDL-GWLYYAWVDVGTPATSFLVALDTGSDLFWVPC-DCIQCAPLSGYRGNLDRDLRI 115
Query: 103 FYPSRSSSYAYVPCDSEHCRYFPYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNT 162
+ P+ S++ ++PC E C+ P C+ K C Y Y TS + E N
Sbjct: 116 YRPAESTTSRHLPCSHELCQSVP--GCTNPKQPCPYNIDYFSENTTSSGLLIEDTLHLNY 173
Query: 163 DESTIHVQ-DVVFGCGFSTNRNF----KFSGIFGLGIGRSSLVSQL------NSSFSYCI 211
E + V V+ GCG + ++ G+ GLG+ S+ S L +SFS C
Sbjct: 174 REDHVPVNASVIIGCGQKQSGDYLDGIAPDGLLGLGMADISVPSFLARAGLVQNSFSMCF 233
Query: 212 GSLHDPDYLHNKLILGDGAIIDEGDATPLQFIDGHYYITLEAISVDGRMLD---INPNIF 268
+ ++ GD + + +TP F+ + + A++VD + + F
Sbjct: 234 K-----EDSSGRIFFGDQGVPSQ-QSTP--FVPLYGKLQTYAVNVDKSCIGHKCLEGTSF 285
Query: 269 KRDDSGGGVMIDSGTDVTWLVKEAYEALRDEVMIRLEGEQM--RSYSWPDKLCYHGIMSS 326
K ++DSGT T L + Y+A E ++ ++ +W K CY
Sbjct: 286 K-------ALVDSGTSFTSLPLDVYKAFTMEFDKQMNATRVPYEDTTW--KYCYSA-SPL 335
Query: 327 DLKGFPTVRFHFRGGAKLALEKDSMFYQPRPDA---FCMAVNPASINNRYVNFSLIGMMA 383
++ PT+ F L + + + A FC+AV P++ IG++A
Sbjct: 336 EMPDVPTITLTFAADKSLQAVNPILPFNDKQGALAGFCLAVLPST--------EPIGIIA 387
Query: 384 QQF---YNVGYDIGRKQKTFQRMDCEVLDD 410
Q F Y+V +D + + R +C ++D
Sbjct: 388 QNFLVGYHVVFDRESMKLGWYRSECHDVED 417
>gi|388495448|gb|AFK35790.1| unknown [Medicago truncatula]
Length = 434
Score = 85.5 bits (210), Expect = 4e-14, Method: Compositional matrix adjust.
Identities = 95/360 (26%), Positives = 141/360 (39%), Gaps = 28/360 (7%)
Query: 56 ISNSLFYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLGTIFYPSRSSSYAYVP 115
I + + V G PP A+DT S W+ C C CS F P +S+S+ V
Sbjct: 92 IQSPTYIVKAKFGTPPQTLLLALDTSSDAAWIPCSGCVGCS--TSKPFAPIKSTSFRNVS 149
Query: 116 CDSEHCRYFPYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDVVFG 175
C S HC+ P C C + Y +S+ S Q T + + FG
Sbjct: 150 CGSPHCKQVPNPTCGG--SACAFNFTYG---SSSIAASVVQDTLTLAADP---IPGYTFG 201
Query: 176 C-----GFSTNRNFKFSGIFGLGIGRSSLVSQLNSSFSYCIGSLHDPDYLHNKLILGDGA 230
C G S + G S + S+FSYC+ S ++ L LG
Sbjct: 202 CVNKTTGSSAPQQGLLGLGRGPLSLLSQSQNLYKSTFSYCLPSFKSINF-SGSLRLGPVY 260
Query: 231 IIDEGDATPL---QFIDGHYYITLEAISVDGRMLDINPNIFK-RDDSGGGVMIDSGTDVT 286
TPL YY+ L AI V +++DI P +G G + DSGT T
Sbjct: 261 QPKRIKYTPLLRNPRRSSLYYVNLVAIKVGRKIVDIPPAALAFNPTTGAGTIFDSGTVFT 320
Query: 287 WLVKEAYEALRDEVMIRLEGEQMRSYSWPDKLCYHGIMSSDLKGFPTVRFHFRGGAKLAL 346
L + Y A+R+E R+ + + CY+ + PT+ F F G +AL
Sbjct: 321 RLAEPVYTAVRNEFRRRVGPKLPVTTLGGFDTCYNVPIV-----VPTITFLF-SGMNVAL 374
Query: 347 EKDSM-FYQPRPDAFCMAVNPASINNRYVNFSLIGMMAQQFYNVGYDIGRKQKTFQRMDC 405
D++ + C+A+ A N V ++I M QQ + V +D+ + R C
Sbjct: 375 PPDNIVIHSTAGSTTCLAMAGAPDNVNSV-LNVIANMQQQNHRVLFDVPNSRIGIARELC 433
>gi|357492303|ref|XP_003616440.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
gi|355517775|gb|AES99398.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
Length = 521
Score = 85.5 bits (210), Expect = 4e-14, Method: Compositional matrix adjust.
Identities = 97/412 (23%), Positives = 162/412 (39%), Gaps = 83/412 (20%)
Query: 30 IARLAYLQEKIRSHNTYQAQILPSNDISNSLFYVNISIGQPPVPQFTAMDTGSSLLWVHC 89
+ ++++L+ K + + LP N V++++G PP +DTGS L W+HC
Sbjct: 7 VVQISFLKVKTLPQTSLSPRKLPFQH--NVTLTVSLTVGSPPQRVTMVLDTGSELSWLHC 64
Query: 90 YPCRDCSPQLGTIFYPSRSSSYAYVPCDSEHCR-----YFPYARCSAYKHRCIYTQLYLI 144
P L IF P SSSY PC S C C A K C ++
Sbjct: 65 KKL----PNLNFIFNPLVSSSYTPTPCTSPICTTQTRDLINPVSCDANK-LCHIITFFVG 119
Query: 145 GPETSVFVSTEQLTFKNTDESTIHVQDVVFGC----GFSTNRNFKFSGIFGLGIGRSSLV 200
GP + +VFGC S + + K +G+ G+ +G S
Sbjct: 120 GPAQ---------------------RGMVFGCMDTGTSSGDEDSKTTGLMGMDLGSLSFS 158
Query: 201 SQLN-SSFSYCIGSLHDPDYLHNKLILGDGAIIDEGDATPLQFIDGHYYITLEAIS---- 255
+Q+ FSYCI + G ++ E A P + HY ++ +
Sbjct: 159 NQMRLPKFSYCISNKDS-----------TGVLVLENIANPPRLGPLHYTPLVKKTTPLPY 207
Query: 256 VDGRMLDINPNIFKRDDSGGG-VMIDSGTDVTWLVKEAYEALRDEVMIRLE------GEQ 308
+ + F D +G G M+DS T T+L + Y AL++E I+ + G+
Sbjct: 208 FNRNCCLFQKSAFLPDHTGAGQTMVDSATQFTFLRQPVYTALKNEFAIQTKNILTPLGDP 267
Query: 309 MRSYSWPDKLCYHGIMSSDLKGFPTVRFHFRGGAKLALEKDSMFYQPRPDAFCMAVNPAS 368
+ LC+ + S L P V F GA+L + + + Y+ V+ +
Sbjct: 268 KFVFQGVMDLCFRVPIGSTLPVLPVVTLMF-DGAELRVTGERLLYK---------VSNVA 317
Query: 369 INNRYV------NFSLIGMMA-------QQFYNVGYDIGRKQKTFQRMDCEV 407
+N ++ N L+G+ A Q+ + YD+ + F +C+V
Sbjct: 318 KSNSWIYCFTFGNSDLLGIEAFIIGHHHQRNVWMEYDLANSRIGFSDTNCDV 369
>gi|302802500|ref|XP_002983004.1| hypothetical protein SELMODRAFT_13348 [Selaginella moellendorffii]
gi|300149157|gb|EFJ15813.1| hypothetical protein SELMODRAFT_13348 [Selaginella moellendorffii]
Length = 332
Score = 85.5 bits (210), Expect = 5e-14, Method: Compositional matrix adjust.
Identities = 91/361 (25%), Positives = 152/361 (42%), Gaps = 45/361 (12%)
Query: 60 LFYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLGTIFYPSRSSSYAYVPCDSE 119
++Y I++G PP MDTGS L WV C P CSP + F S++Y + C +
Sbjct: 2 VYYSTITLGSPPKDFSLVMDTGSDLTWVRCDP---CSPDCSSTFDRLASNTYKALTCADD 58
Query: 120 HCRYFPYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDVVFGCGFS 179
+ + Y S +TQ + V T ++ +DE VFGCG
Sbjct: 59 YS--YGYGDGS-------FTQ-------GDLSVDTLKMAGAASDELE-EFPGFVFGCGSL 101
Query: 180 TNRNFKFS-GIFGLGIGRSSLVSQL----NSSFSYCIGSLHDPDYLHNK-LILGDGAI-- 231
GI L G S SQ+ + FSYC+ + L ++ G+ A+
Sbjct: 102 LKGLISGEVGILALSPGSLSFPSQIGEKYGNKFSYCLLRQTAQNSLKKSPMVFGEAAVEL 161
Query: 232 -------IDEGDATPLQFIDGHYYITLEAISVDGRMLDINPNIFKRDDSGGGVMIDSGTD 284
+ E TP+ +Y + L+ ISV + LD++P+ F + + DSGT
Sbjct: 162 KEPGSGKLQELQYTPIGESSIYYTVRLDGISVGNQRLDLSPSAF-LNGQDKPTIFDSGTT 220
Query: 285 VTWLVKEAYEALRDEVMIRLEGEQMRSYSWPDKLCYHGIMSSDLKGFPTVRFHFRGGAKL 344
+T L ++++ + + G + + D C+ + S +G P + FHF GGA
Sbjct: 221 LTMLPPGVCDSIKQSLASMVSGAEFVAIKGLDA-CFR-VPPSSGQGLPDITFHFNGGADF 278
Query: 345 ALEKDSMFYQPRPDAFCMAVNPASINNRYVNFSLIGMMAQQFYNVGYDIGRKQKTFQRMD 404
+ + S + C+ P + S+ G + QQ + V +D+ ++ F+ D
Sbjct: 279 -VTRPSNYVIDLGSLQCLIFVPTN------EVSIFGNLQQQDFFVLHDMDNRRIGFKETD 331
Query: 405 C 405
C
Sbjct: 332 C 332
>gi|326529727|dbj|BAK04810.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 488
Score = 85.5 bits (210), Expect = 5e-14, Method: Compositional matrix adjust.
Identities = 113/447 (25%), Positives = 181/447 (40%), Gaps = 67/447 (14%)
Query: 20 HRVQRGINISIARLAYLQEKIRSHNTYQAQILPSNDI----SNSLFYVNISIGQPPVPQF 75
H + R S+AR ++R H+ QA P S + ++S+G PP P
Sbjct: 45 HPLSRLARASLAR----ASRLRGHHQGQAASSPVRAALYPHSYGGYAFSLSLGTPPQPLP 100
Query: 76 TAMDTGSSLLWVHC---YPCRDCSPQLGT--IFYPSRSSSYAYVPCDSEHCRYF------ 124
+DTGS L WV C Y C++CS G+ +F+P SSS V C S C +
Sbjct: 101 VLLDTGSHLTWVPCTSNYQCQNCSAAAGSFPVFHPKSSSSSLLVSCSSPSCLWIHSKSHL 160
Query: 125 -----PYARCSAYKHRCIYTQLYLIGPETSVF--VSTEQLTFKNT---DESTIHVQDVVF 174
A C C T + P V+ ST L +T ++
Sbjct: 161 SDCARDSAPCRPSTANCSATATNVCPPYLVVYGSGSTAGLLVSDTLRLSPRGAASRNFAV 220
Query: 175 GCGFSTNRNFKFSGIFGLGIGRSSLVSQLN-SSFSYCIGS--LHDPDYLHNKLILGDGA- 230
GC ++ SG+ G G G S+ +QL + FSYC+ S D + +L+LG +
Sbjct: 221 GCSLASVHQ-PPSGLAGFGRGAPSVPAQLGVNKFSYCLLSRRFDDDAAISGELVLGASSA 279
Query: 231 -----------IIDEGDATPLQFIDGHYYITLEAISVDGRMLDINPNIFK--RDDSGGGV 277
++ A P + +YY++L I+V G+ + + GGG
Sbjct: 280 GKAKAMMQYAPLLKNAGARPPYSV--YYYLSLTGIAVGGKSVALPARALAPVSGGGGGGA 337
Query: 278 MIDSGTDVTWLVKEAYEALRDEVMIRLEGEQMRSYSWPDKL----CYHGIMSSDLKGFPT 333
+IDSGT T+L ++ + ++ + G RS L C+ + P
Sbjct: 338 IIDSGTTFTYLDPTVFKPVAAAMVAAVGGRYNRSKDVEGALGLRPCFALPAGARTMDLPE 397
Query: 334 VRFHFRGGAKLALEKDSMFYQP------RPDAFCMAV--------NPASINNRYVNFSLI 379
+ HF GGA++ L ++ F P+A C+AV A ++ ++
Sbjct: 398 LSLHFSGGAEMRLPIENYFLAAGPASGVAPEAICLAVVSDVSSASGGAGVSGGGGPAIIL 457
Query: 380 GMMAQQFYNVGYDIGRKQKTFQRMDCE 406
G QQ Y V YD+ + + F++ C
Sbjct: 458 GSFQQQNYQVEYDLEKNRLGFRQQPCS 484
>gi|2570402|gb|AAB97155.1| EEA1 [Hordeum vulgare subsp. vulgare]
Length = 410
Score = 85.1 bits (209), Expect = 5e-14, Method: Compositional matrix adjust.
Identities = 94/390 (24%), Positives = 162/390 (41%), Gaps = 66/390 (16%)
Query: 61 FYVNISIGQPPVPQFTAMDTGSSLLWVHCYP----CRDCSPQLGTIFYPSRSSSYAYVPC 116
FY ++IG+P P F +DTGS+L W+ C+P C+ C P+ +Y V C
Sbjct: 38 FYATLNIGEPAKPYFLDVDTGSNLTWLECHPPVHGCKGCHPRPPHPYYTPADGKLKVV-C 96
Query: 117 DSEHC----RYFP-YARCSAY-KHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQ 170
S C R P CS HRC Y Y+ G ++ ++T+ ++ D+ I
Sbjct: 97 GSPLCVAVRRDVPGIPECSRNDPHRCHYEIQYVTG-KSEGDLATDIISVNGRDKKRI--- 152
Query: 171 DVVFGCGFS-----TNRNFKFSGIFGLGIGRSSLVSQLN-------SSFSYCIGSLHDPD 218
FGCG+ + +GI GLG+G++ +QL + +C+ S
Sbjct: 153 --AFGCGYKQEEPPDSPPSPVNGILGLGMGKAGFAAQLKGLKMIKENVIGHCLSSKG--- 207
Query: 219 YLHNKLILGDGAIIDEG-DATPLQFIDGHYYITLEAISVDGRMLDINPNIFKRDDSGGGV 277
L +GD G P++ +Y L + +D + + NP
Sbjct: 208 --KGVLYVGDFNPPTRGVTWAPMRESLFYYSPGLAEVFIDKQPIRGNPTF--------EA 257
Query: 278 MIDSGTDVTWLVKEAYEALRDEVMIRLEG-------EQMRSYSWPDKLCYHGI------- 323
+ DSG+ T + + Y +E++ ++ G E+++ + P LC+ G
Sbjct: 258 VFDSGSTYTHVPAQIY----NEIVSKVRGTFSESSLEEVKGRALP--LCWKGKKPFGSVN 311
Query: 324 -MSSDLKGFPTVRFHFRGGAKLALEKDSMFYQPRPDAFCMAVNPASINN--RYVNFSLIG 380
+ + K H RG L + + + C+A+ AS++ + +NF LIG
Sbjct: 312 DVKNQFKALSLKITHARGTNNLDIPPQNYLFVKEDGETCLAILDASLDPVLKELNFILIG 371
Query: 381 MMAQQFYNVGYDIGRKQKTFQRMDCEVLDD 410
+ Q V YD +KQ + R C+ + +
Sbjct: 372 AVTMQDLFVIYDNEKKQLGWVRAQCDRVQE 401
>gi|356567798|ref|XP_003552102.1| PREDICTED: aspartic proteinase-like protein 1-like [Glycine max]
Length = 520
Score = 85.1 bits (209), Expect = 5e-14, Method: Compositional matrix adjust.
Identities = 109/418 (26%), Positives = 169/418 (40%), Gaps = 64/418 (15%)
Query: 22 VQRGINISIARLAYLQEKIRSHNTYQAQILPSNDISNSLFYVNISIGQPPVPQFTAMDTG 81
++R I + AR L SH + + ND L Y I IG P A+D G
Sbjct: 63 LRRKIKVGGARYQLL---FPSHGSKTMSL--GNDF-GWLHYTWIDIGTPSTSFLVALDAG 116
Query: 82 SSLLWVHCYPCRDCSPQLGTIFY-----------PSRSSSYAYVPCDSEHCRYFPYARCS 130
S LLW+ C C C+P L + +Y PSRS S ++ C + C + C
Sbjct: 117 SDLLWIPC-DCVQCAP-LSSSYYSNLDRDLNEYSPSRSLSSKHLSCSHQLCDKG--SNCK 172
Query: 131 AYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIH--VQ-DVVFGCGFSTNRNF--- 184
+ + +C Y YL +S + E + + S + VQ VV GCG + +
Sbjct: 173 SSQQQCPYMVSYLSENTSSSGLLVEDILHLQSGGSLSNSSVQAPVVLGCGMKQSGGYLDG 232
Query: 185 -KFSGIFGLGIGRSSLVSQLNSSFSYCIGSLHDPDYL------HNKLILGD-GAIIDEGD 236
G+ GLG G SS+ S L S G +HD L ++ GD G I +
Sbjct: 233 VAPDGLLGLGPGESSVPSFLAKS-----GLIHDSFSLCFNEDDSGRIFFGDQGPTIQQST 287
Query: 237 A-TPLQFIDGHYYITLEAISVDGRMLDINPNIFKRDDSGGGVMIDSGTDVTWLVKEAYEA 295
+ PL + Y I +E+ V L + FK V +DSGT T+L Y A
Sbjct: 288 SFLPLDGLYSTYIIGVESCCVGNSCLKMTS--FK-------VQVDSGTSFTFLPGHVYGA 338
Query: 296 LRDEVMIRLEGEQMRSYSWPDKLCYHGIMSSDLKGFPTVRFHFRGGAKLALEKDSMFYQP 355
+ +E ++ G + P + CY S +L P++ F+ + +
Sbjct: 339 IAEEFDQQVNGSRSSFEGSPWEYCYV-PSSQELPKVPSLTLTFQQNNSFVVYDPVFVFYG 397
Query: 356 RPD--AFCMAVNPASINNRYVNFSLIGMMAQQF---YNVGYDIGRKQKTFQRMDCEVL 408
FC+A+ P + +G + Q F Y + +D G K+ + R +C+ L
Sbjct: 398 NEGVIGFCLAIQPTEGD--------MGTIGQNFMTGYRLVFDRGNKKLAWSRSNCQDL 447
>gi|340810945|gb|AEK75399.1| S5 [Oryza sativa]
gi|340810957|gb|AEK75405.1| S5 [Oryza sativa]
gi|340811007|gb|AEK75430.1| S5 [Oryza nivara]
gi|340811073|gb|AEK75463.1| S5 [Oryza rufipogon]
gi|340811094|gb|AEK75473.1| S5 [Oryza rufipogon]
Length = 357
Score = 85.1 bits (209), Expect = 6e-14, Method: Compositional matrix adjust.
Identities = 110/383 (28%), Positives = 161/383 (42%), Gaps = 66/383 (17%)
Query: 63 VNISIGQPPVPQFTAMDTGSSLLWVHCYPCR-DC---SPQLGTIFYPSRSSSYAYVPCDS 118
+ +S+G+PPV A+DTGS+L WV C PC C S + G IF P RS + V C S
Sbjct: 1 MAVSLGKPPVVNLVAIDTGSTLSWVQCQPCAVHCHTQSAKAGPIFDPGRSYTSRRVRCSS 60
Query: 119 EHC---RY---FPYARCSAYKHRCIYTQLYLIGPETSV-FVSTEQLTFKNTDESTIHVQD 171
C RY A C + C Y+ Y G SV + T+ L ++ D
Sbjct: 61 VKCGEPRYDLRLQQANCMEKEDSCTYSVTYGNGWAYSVGKMVTDTLRIGDS------FMD 114
Query: 172 VVFGCGFSTNRNFKFSGIFGLGIGRSSLVSQLN--------SSFSYCIGSLH-DPDYLHN 222
++FGC + +GIFG G S QL +FSYC+ + P Y
Sbjct: 115 LMFGCSMDVKYSEFEAGIFGFGSSSFSFFEQLAGYPDILSYKAFSYCLPTDETKPGY--- 171
Query: 223 KLILG--DGAIIDEGDATPLQFIDG-HYYITLEAISVDGRMLDINPNIFKRDDSGGGVMI 279
+ILG D A +D G + I+ Y +T E + +G+ L S +++
Sbjct: 172 -MILGRYDRAAMDGGYTPLFRSINRPTYSLTTEMLIANGQRLVT---------SSSEMIV 221
Query: 280 DSGTDVTWLVKEAYEALRDEVMI---------RLEGEQMRSY----SWPDKLCYHGIMS- 325
DSG T L + AL D+ + R + SY S D ++G ++
Sbjct: 222 DSGAQRTSLWPSTF-ALLDKTITQAMSSIGYHRTSRARQESYICYLSEHDYSGWNGTITP 280
Query: 326 -SDLKGFPTVRFHFRGGAKLALEKDSMFYQPRPDAFCM--AVNPASINNRYVNFSLIGMM 382
S+ P + F GGA LAL ++FY CM A NPA + ++G
Sbjct: 281 FSNWSALPLLEIGFAGGAALALSPRNVFYNDPHRGLCMTFAQNPA------LRSQILGNR 334
Query: 383 AQQFYNVGYDIGRKQKTFQRMDC 405
+ + +DI KQ F+ C
Sbjct: 335 VTRSFGTTFDIQGKQFGFKYAAC 357
>gi|302794412|ref|XP_002978970.1| hypothetical protein SELMODRAFT_418789 [Selaginella moellendorffii]
gi|300153288|gb|EFJ19927.1| hypothetical protein SELMODRAFT_418789 [Selaginella moellendorffii]
Length = 462
Score = 85.1 bits (209), Expect = 6e-14, Method: Compositional matrix adjust.
Identities = 97/380 (25%), Positives = 152/380 (40%), Gaps = 51/380 (13%)
Query: 61 FYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLGTIFYPSRSSSYAYVPC-DSE 119
+Y +I +G P +DTGS L W+ C PC+ C+P + TI+ +RS SY V C +S+
Sbjct: 100 YYTSIKLGSPGQEAILIVDTGSELTWLKCLPCKVCAPSVDTIYDAARSVSYKPVTCNNSQ 159
Query: 120 HC---RYFPYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKN-TDESTIHVQDVVFG 175
C YA C A +C + Y G + +ST+ L + + VQD FG
Sbjct: 160 LCSNSSQGTYAYC-ARGSQCQFAAFYGDGSFSYGSLSTDTLIMETVVGGKPVTVQDFAFG 218
Query: 176 C--GFSTNRNFKFSGIFGLGIGRSSLVSQLNS----SFSYCIGSLHDPDYLHNKLILGDG 229
C G SGI GL G+ +L QL FS+C + G+
Sbjct: 219 CAQGDLELVPTGASGILGLNAGKMALPMQLGQRFGWKFSHCFPDRSSHLNSTGVVFFGNA 278
Query: 230 AIIDEG------DATPLQFIDGHYYITLEAISVDGRMLDINPNIFKRDDSGGGVMIDSGT 283
+ E T + Y++ L+ +S++ L + P G V++DSG+
Sbjct: 279 ELPHEQVQYTSVALTNSELQRKFYHVALKGVSINSHELVLLPR-------GSVVILDSGS 331
Query: 284 DVTWLV-------KEAYEALRDEVMIRLEGEQMRSYSWPDKLCYHGIMSSDL----KGFP 332
+ V +EA+ R + LEG+ S+ D + + D+ + P
Sbjct: 332 SFSSFVRPFHSQLREAFLKHRPPSLKHLEGD-----SFGDLGTCFKVSNDDIDELHRTLP 386
Query: 333 TVRFHFRGGAKLALEKDSMF-----YQPRPDAFCMAVNPASINNRYVNFSLIGMMAQQFY 387
++ F G + + + YQ C A N ++IG QQ
Sbjct: 387 SLSLVFEDGVTIGIPSIGVLLPVARYQNHVK-MCFAFEDGGPN----PVNVIGNYQQQNL 441
Query: 388 NVGYDIGRKQKTFQRMDCEV 407
V YDI R + F R C +
Sbjct: 442 WVEYDIQRSRVGFARASCVI 461
>gi|302789618|ref|XP_002976577.1| hypothetical protein SELMODRAFT_416663 [Selaginella moellendorffii]
gi|300155615|gb|EFJ22246.1| hypothetical protein SELMODRAFT_416663 [Selaginella moellendorffii]
Length = 437
Score = 85.1 bits (209), Expect = 6e-14, Method: Compositional matrix adjust.
Identities = 104/435 (23%), Positives = 171/435 (39%), Gaps = 55/435 (12%)
Query: 1 LIHRDSILSPYNNPNENPAHRVQRGINISIARLAYLQEKIRSHNTYQAQI---LPSNDIS 57
L HR S L + NE + G+ +S L +L E + I L N
Sbjct: 26 LQHRYSGLEGSSKQNE------KLGLGMSKQHLQHLVEHNDRRGRFLQGISFPLKGNYSD 79
Query: 58 NSLFYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCS------PQLGTIFYPSRSSSY 111
L+Y I +G P +DTGS +LWV C PCR C P L + S+S
Sbjct: 80 LGLYYTEIGLGNPVQKLKVIVDTGSDILWVKCSPCRSCLSKQDIIPPLSIYNLSASSTSS 139
Query: 112 AYVPCDSEHCRYFPYARCSAYKHRCIYTQLYL-IGPETSVFVSTEQLTFKNTDESTIHVQ 170
D S C Y Y +V + + +T
Sbjct: 140 VSSCSDPLCTGEEVVCSRSGNNSACAYVSSYQDKSASVGAYVRDDMHYVLHGGNAT--TS 197
Query: 171 DVVFGCGFSTNRNFKFSGIFGLGIGRSSLVSQLNSS------FSYCIGSLHDPDYLHNKL 224
+ FGC + ++ GI G G+ ++ +Q+ + FS+C+G + L
Sbjct: 198 RIFFGCATNITGSWPVDGIMGFGLISKTVPNQIATQRNMSRVFSHCLGG---EKHGGGIL 254
Query: 225 ILGDGAIIDEGDATPLQFIDGHYYITLEAISVDGRMLDINPNIF---KRDDSGGGVMIDS 281
G+ E TPL + HY + L +ISV+ ++L I+P F + + GV+IDS
Sbjct: 255 EFGEAPNTTEMVFTPLLNVTTHYNVDLLSISVNSKVLPIDPKEFSYVRNSTNNTGVIIDS 314
Query: 282 GTDVTWLVKEAYEALRDEVMIRLEGEQMRSYSWPDKL----CYHGIMSSDL---KGFPTV 334
GT L +A L E+ + + + KL C++ + S L FP V
Sbjct: 315 GTTFVLLTTKANRMLFQEI------KSLTTAKLGPKLEGLECFY--LKSGLTMETSFPNV 366
Query: 335 RFHFRGGAKLALEKDSMF----YQPRPDAFCMAVNPASINNRYVNFSLIGMMAQQFYNVG 390
F GG+ + L+ D+ Y+ + + +C A + A ++ G + + V
Sbjct: 367 TLTFSGGSTMKLKPDNYLVMAEYKKKRNGYCYAWSSAD------GLTIFGEIVLKDKLVF 420
Query: 391 YDIGRKQKTFQRMDC 405
YD+ ++ ++ +C
Sbjct: 421 YDVENRRIGWKGQNC 435
>gi|357489329|ref|XP_003614952.1| Aspartic proteinase-like protein [Medicago truncatula]
gi|355516287|gb|AES97910.1| Aspartic proteinase-like protein [Medicago truncatula]
Length = 530
Score = 85.1 bits (209), Expect = 6e-14, Method: Compositional matrix adjust.
Identities = 88/360 (24%), Positives = 149/360 (41%), Gaps = 46/360 (12%)
Query: 60 LFYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLGTIFYPS-----------RS 108
L Y I IG P V A+DTGS + WV C C +C+P L FY + S
Sbjct: 101 LHYTWIDIGTPNVSFLVALDTGSDMFWVPC-DCIECAP-LSAAFYNALDRDLNQYSPSLS 158
Query: 109 SSYAYVPCDSEHCRYFPYARCSAYKHRCIYTQLYLI-GPETSVFVSTEQLTFKNTDESTI 167
SS ++PC + C + C +K RC Y + Y +S F+ ++L + + +
Sbjct: 159 SSSRHLPCGHQLCNQ--NSNCKGFKDRCPYIKEYTSDNTSSSGFLIEDKLHLASNNATKN 216
Query: 168 HVQ-DVVFGCGFSTNRNF----KFSGIFGLGIGRSSLVSQL------NSSFSYCIGSLHD 216
+Q V+ GCG + F +G+ GLG G S+ + L +S S C+
Sbjct: 217 SIQASVILGCGRKQSGYFLEGAAPNGMLGLGPGSISVPALLAKAGLIRNSISICLN---- 272
Query: 217 PDYLHNKLILGDGAIIDEGDATPLQFIDG---HYYITLEAISVDGRMLDINPNIFKRDDS 273
+ +++ GD + +TP DG +Y++ +E V F ++
Sbjct: 273 -EKGSGRILFGDQGHATQRRSTPFLLDDGELLNYFVGVERFCVGS---------FCYKET 322
Query: 274 GGGVMIDSGTDVTWLVKEAYEALRDEVMIRLEGEQMRSYSWPDKLCYHGIMSSDLKGFPT 333
ID+GT T+L K YE + E ++ ++ S D C + S + FP
Sbjct: 323 EFKAFIDTGTSFTYLPKGVYETVVAEFEKQVHATRITSQIQSDFNCCYNASSRESNNFPP 382
Query: 334 VRFHFRGGAKLALEKDSMFYQPRPDAFCMAVNPASINNRYVNFSLIGMMAQQFYNVGYDI 393
++F F ++ + C+AV ++ + +A Q + +GYD+
Sbjct: 383 MKFTFSKNQSFIIQNPFISMDQEDTTICLAV--VQSDDELITIGRKYTIACQNFLMGYDM 440
>gi|357515189|ref|XP_003627883.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
gi|355521905|gb|AET02359.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
Length = 434
Score = 84.7 bits (208), Expect = 7e-14, Method: Compositional matrix adjust.
Identities = 95/360 (26%), Positives = 141/360 (39%), Gaps = 28/360 (7%)
Query: 56 ISNSLFYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLGTIFYPSRSSSYAYVP 115
I + + V G PP A+DT S W+ C C CS F P +S+S+ V
Sbjct: 92 IQSPTYIVKAKFGTPPQTLLLALDTSSDAAWIPCSGCVGCS--TSKPFAPIKSTSFRNVS 149
Query: 116 CDSEHCRYFPYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDVVFG 175
C S HC+ P C C + Y +S+ S Q T +T + FG
Sbjct: 150 CGSPHCKQVPNPTCGG--SACAFNFTYG---SSSIAASVVQDTL---TLATDPIPGYTFG 201
Query: 176 C-----GFSTNRNFKFSGIFGLGIGRSSLVSQLNSSFSYCIGSLHDPDYLHNKLILGDGA 230
C G S + G S + S+FSYC+ S ++ L LG
Sbjct: 202 CVNKTTGSSAPQQGLLGLGRGPLSLLSQSQNLYKSTFSYCLPSFKSINF-SGSLRLGPVY 260
Query: 231 IIDEGDATPL---QFIDGHYYITLEAISVDGRMLDINPNIFK-RDDSGGGVMIDSGTDVT 286
TPL YY+ L AI V +++DI P +G G + DSGT T
Sbjct: 261 QPKRIKYTPLLRNPRRSSLYYVNLVAIKVGRKIVDIPPAALAFNPTTGAGTIFDSGTVFT 320
Query: 287 WLVKEAYEALRDEVMIRLEGEQMRSYSWPDKLCYHGIMSSDLKGFPTVRFHFRGGAKLAL 346
L + Y A+R+E R+ + + CY+ + PT+ F F G + L
Sbjct: 321 RLAEPVYTAVRNEFRRRVGPKLPVTTLGGFDTCYNVPIV-----VPTITFLF-SGMNVTL 374
Query: 347 EKDSM-FYQPRPDAFCMAVNPASINNRYVNFSLIGMMAQQFYNVGYDIGRKQKTFQRMDC 405
D++ + C+A+ A N V ++I M QQ + V +D+ + R C
Sbjct: 375 PPDNIVIHSTAGSTTCLAMAGAPDNVNSV-LNVIANMQQQNHRVLFDVPNSRIGIARELC 433
>gi|77555282|gb|ABA98078.1| Eukaryotic aspartyl protease family protein, expressed [Oryza
sativa Japonica Group]
Length = 409
Score = 84.7 bits (208), Expect = 7e-14, Method: Compositional matrix adjust.
Identities = 85/292 (29%), Positives = 133/292 (45%), Gaps = 32/292 (10%)
Query: 123 YFPYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDVVFGCGFSTNR 182
YF +A+C+ + TS +++T+ TF T V VVFGC ++
Sbjct: 111 YFVWAQCAPLTYGGS-------AANTSGYLATDTFTFGAT-----AVPGVVFGCSDASYG 158
Query: 183 NFK-FSGIFGLGIGRSSLVSQLN-SSFSYCIGS--LHDPDYLHNKLILGDGAI--IDEGD 236
+F SG+ G+G G SL+SQL FSY + + D + + GD A+ G
Sbjct: 159 DFAGASGVIGIGRGNLSLISQLQFGKFSYQLLAPEATDDGSADSVIRFGDDAVPKTKRGR 218
Query: 237 ATPL---QFIDGHYYITLEAISVDGRMLDINP-NIFK-RDDSGGGVMIDSGTDVTWLVKE 291
+TPL YY+ L + VDG LD P F R + GGV++ S T VT+L +
Sbjct: 219 STPLLSSTLYPDFYYVNLTGVRVDGNRLDAIPAGTFDLRANGTGGVILSSTTPVTYLEQA 278
Query: 292 AYEALRDEVMIRLEGEQMR-SYSWPDKLCYHGIMSSDLKGFPTVRFHFRGGAKLALEKDS 350
AY+ +R V R+ + S + LCY+ + +K P + F GGA + L +
Sbjct: 279 AYDVVRAAVASRIGLPAVNGSAALELDLCYNASSMAKVK-VPKLTLVFDGGADMDLSAAN 337
Query: 351 MFYQPRPDAF-CMAVNPASINNRYVNFSLIGMMAQQFYNVGYDIGRKQKTFQ 401
FY C+ + P+ S++G + Q N+ YD+ + TF+
Sbjct: 338 YFYIDNDTGLECLTMLPSQ------GGSVLGTLLQTGTNMIYDVDAGRLTFE 383
>gi|302764208|ref|XP_002965525.1| hypothetical protein SELMODRAFT_406966 [Selaginella moellendorffii]
gi|300166339|gb|EFJ32945.1| hypothetical protein SELMODRAFT_406966 [Selaginella moellendorffii]
Length = 464
Score = 84.7 bits (208), Expect = 7e-14, Method: Compositional matrix adjust.
Identities = 94/384 (24%), Positives = 161/384 (41%), Gaps = 44/384 (11%)
Query: 37 QEKIRSHNTYQAQILPSNDISNSLFYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCS 96
+E+ H+ Q P + + ++Y +I++G PP MDTGS L WV C PC S
Sbjct: 103 EEEEVEHDLAQT---PVSFTNGGVYYSSITLGSPPKDFSLVMDTGSDLTWVRCDPC---S 156
Query: 97 PQLGTIFYPSRSSSYAYVPCDSEHCRYFPYARCSAYKHRCIYTQLYLIGPETSVFVSTEQ 156
P + F S++Y + C ++ R R ++ +L+ G T +
Sbjct: 157 PDCSSTFDRLASNTYKALTC-ADDLRLPVLLR--------LWRRLFHSGRS---LRDTLK 204
Query: 157 LTFKNTDESTIHVQDVVFGCGFSTNRNFKFS-GIFGLGIGRSSLVSQLN----SSFSYCI 211
+ +DE VFGCG GI L G S SQ+ + FSYC+
Sbjct: 205 MAGAASDELE-EFPGFVFGCGSLLKGLISGEVGILALSPGSLSFPSQIGEKYGNKFSYCL 263
Query: 212 GSLHDPDYLHNK-LILGDGAI---------IDEGDATPLQFIDGHYYITLEAISVDGRML 261
+ L ++ G+ A+ E TP+ +Y + L+ ISV + L
Sbjct: 264 LRQTAQNSLKKSPMVFGEAAVELKEPGSGKPQELQYTPIGESSIYYTVRLDGISVGNQRL 323
Query: 262 DINPNIFKRDDSGGGVMIDSGTDVTWLVKEAYEALRDEVMIRLEGEQMRSYSWPDKLCYH 321
D++P+ F + + DSGT +T L ++++ + + G + + D C+
Sbjct: 324 DLSPSTF-LNGQDKPTIFDSGTTLTMLPSGVCDSIKQSLASMVSGAEFVAIKGLDA-CFR 381
Query: 322 GIMSSDLKGFPTVRFHFRGGAKLALEKDSMFYQPRPDAFCMAVNPASINNRYVNFSLIGM 381
+ S +G P + FHF GGA + + S + C+ P + S+ G
Sbjct: 382 -VPPSSGQGLPDITFHFNGGADF-VTRPSNYVIDLGSLQCLIFVPTN------EVSIFGN 433
Query: 382 MAQQFYNVGYDIGRKQKTFQRMDC 405
+ QQ + V +D+ ++ F+ DC
Sbjct: 434 LQQQDFFVLHDMDNRRIGFKETDC 457
>gi|340810981|gb|AEK75417.1| S5 [Oryza rufipogon]
Length = 357
Score = 84.7 bits (208), Expect = 7e-14, Method: Compositional matrix adjust.
Identities = 108/383 (28%), Positives = 160/383 (41%), Gaps = 66/383 (17%)
Query: 63 VNISIGQPPVPQFTAMDTGSSLLWVHCYPCR-DC---SPQLGTIFYPSRSSSYAYVPCDS 118
+ +S+G+PPV A+DTGS+L WV C PC C S + G IF P RS + V C S
Sbjct: 1 MAVSLGKPPVVNLVAIDTGSTLSWVQCQPCAVHCHTQSAKAGPIFDPGRSYTSRRVRCSS 60
Query: 119 EHCRYFPY------ARCSAYKHRCIYTQLYLIGPETSV-FVSTEQLTFKNTDESTIHVQD 171
C Y A C + C Y+ Y G SV + T+ L ++ D
Sbjct: 61 VKCGELRYDLRLQQANCMEKEDSCTYSVTYGNGWAYSVGKMVTDTLRIGDS------FMD 114
Query: 172 VVFGCGFSTNRNFKFSGIFGLGIGRSSLVSQLN--------SSFSYCIGSLH-DPDYLHN 222
++FGC + +GIFG G S QL + SYC+ + P Y
Sbjct: 115 LMFGCSMDVKYSEFEAGIFGFGSSSFSFFEQLAGYPDILSYKALSYCLPTDETKPGY--- 171
Query: 223 KLILG--DGAIIDEGDATPLQFIDG-HYYITLEAISVDGRMLDINPNIFKRDDSGGGVMI 279
+ILG D A +D G + I+ Y +T+E + +G+ L S +++
Sbjct: 172 -MILGRYDRAAMDGGYTPLFRSINRPTYSLTMEMLIANGQRLVT---------SSSEMIV 221
Query: 280 DSGTDVTWLVKEAYEALRDEVMI---------RLEGEQMRSY----SWPDKLCYHGIMS- 325
DSG T L + AL D+ + R + SY S D ++G ++
Sbjct: 222 DSGAQRTSLWPSTF-ALLDKTITQAMSSIGYHRTSRARQESYICYLSEHDYSGWNGTITP 280
Query: 326 -SDLKGFPTVRFHFRGGAKLALEKDSMFYQPRPDAFCM--AVNPASINNRYVNFSLIGMM 382
S+ P + F GGA LAL ++FY CM A NPA + ++G
Sbjct: 281 FSNWSALPLLEIGFAGGAALALSPRNVFYNDPHRGLCMTFAQNPA------LRSQILGNR 334
Query: 383 AQQFYNVGYDIGRKQKTFQRMDC 405
+ + +DI KQ F+ C
Sbjct: 335 VTRSFGTTFDIQGKQFGFKYAVC 357
>gi|125595861|gb|EAZ35641.1| hypothetical protein OsJ_19928 [Oryza sativa Japonica Group]
Length = 629
Score = 84.7 bits (208), Expect = 7e-14, Method: Compositional matrix adjust.
Identities = 96/340 (28%), Positives = 141/340 (41%), Gaps = 48/340 (14%)
Query: 68 GQPPVPQFTAMDTGSSLLWVHCYPCR--DCSPQLGTIFYPSRSSSYAYVPCDSEHCRYF- 124
G V Q +D+GS + WV C PC C Q +F P+ S++YA VPC S C
Sbjct: 71 GTSAVTQTVIIDSGSDVSWVQCKPCPLPMCHRQRDPLFDPAMSTTYAAVPCTSAACAQLG 130
Query: 125 PYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDVVFGCGFS---TN 181
PY R + +C + Y G + S + LT D ++ FGC + +
Sbjct: 131 PYRRGCSANAQCQFGINYGDGSTATGTYSFDDLTLGPYDV----IRGFRFGCAHADRGSA 186
Query: 182 RNFKFSGIFGLGIGRSSLVSQLNSS----FSYCIGSLHDPDYLHNKLILG----DGAIID 233
++ +G LG G SLV Q + FSYC+ L+LG +I
Sbjct: 187 FDYDVAGSLALGGGSQSLVQQTATRYGRVFSYCLPPTASS---LGFLVLGVPPERAQLIP 243
Query: 234 EGDATPL---QFIDGHYYITLEAISVDGRMLDINPNIFKRDDSGGGVMIDSGTDVTWLVK 290
+TPL Y + L AI V GR L + P +F +IDS T ++ L
Sbjct: 244 SFVSTPLLSSSMAPTFYRVLLRAIIVAGRPLAVPPAVFSASS-----VIDSSTIISRLPP 298
Query: 291 EAYEALRDEVMIRLEGEQMRSYSWPDKL---CY--HGIMSSDLKGFPTVRFHFRGGAKLA 345
AY+ALR M + P + CY G+ S L P++ F GGA +
Sbjct: 299 TAYQALRAAFR---SAMTMYRAAPPVSILDTCYDFTGVRSITL---PSIALVFDGGATVN 352
Query: 346 LEKDSMFYQPRPDAFCMAVNPASINNRYVNFSLIGMMAQQ 385
L+ + C+A P + ++R F IG + Q+
Sbjct: 353 LDAAGILL-----GSCLAFAPTA-SDRMPGF--IGNVQQK 384
>gi|297740190|emb|CBI30372.3| unnamed protein product [Vitis vinifera]
Length = 445
Score = 84.7 bits (208), Expect = 7e-14, Method: Compositional matrix adjust.
Identities = 106/377 (28%), Positives = 154/377 (40%), Gaps = 56/377 (14%)
Query: 12 NNPNENPAHRVQRGINISIARLAYLQEKIRSHNTYQAQILPSNDISNSLFYVNISIGQPP 71
NP+ +P + + S+ R +L+ + NT P S + V++S G P
Sbjct: 61 KNPSSDPWQLLSHLTSASLTRAHHLKHR---KNTSSVNT-PLFAHSYGGYSVSLSFGTPS 116
Query: 72 VPQFTAMDTGSSLLWVHC---YPCRDCS-----PQLGTIFYPSRSSSYAYVPCDSEHCRY 123
MDTGSSL+W C Y C CS P F P SSS V C + C +
Sbjct: 117 QTLSFVMDTGSSLVWFPCTSRYVCTRCSFPNIDPAKIPTFIPKLSSSAKIVGCLNPKCGF 176
Query: 124 FPYARCSAY-KHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDVVFGCGFSTNR 182
+ SA C + T + E L F E D V GC ++R
Sbjct: 177 VMDSENSANCTKACPTYAIQYGLGTTVGLLLLESLVFAERTE-----PDFVVGCSILSSR 231
Query: 183 NFKFSGIFGLGIGRSSLVSQLN-SSFSYCIGSLH---DPDYLHNKLILGDGAIIDEGDA- 237
+ SGI G G G SSL Q+ FSYC+ S P L +G + D+
Sbjct: 232 --QPSGIAGFGRGPSSLPKQMGLKKFSYCLLSHRFDDSPKSSKMTLYVGPDSKDDKTGGL 289
Query: 238 --TPLQ--------FIDGHYYITLEAISVDGRMLDINPNIF--KRDDSGGGVMIDSGTDV 285
TP + +YY+TL I V + + + P F D GG ++DSG+
Sbjct: 290 SYTPFRKNPVSSNSAFKEYYYVTLRHIIVGDKRVKV-PYSFMVAGSDGNGGTIVDSGSTF 348
Query: 286 TWLVKEAYEALRDEVMIRLEGEQMRSYSWPD--------KLCYH--GIMSSDLKGFPTVR 335
T++ K +EA+ E QM +Y+ K C++ G+ S L P++
Sbjct: 349 TFMEKPVFEAVATEF-----DRQMANYTRAADVEALSGLKPCFNLSGVGSVAL---PSLV 400
Query: 336 FHFRGGAKLALEKDSMF 352
F F+GGAK+ L + F
Sbjct: 401 FQFKGGAKMELPVANYF 417
>gi|358346736|ref|XP_003637421.1| Aspartic proteinase nepenthesin-1, partial [Medicago truncatula]
gi|355503356|gb|AES84559.1| Aspartic proteinase nepenthesin-1, partial [Medicago truncatula]
Length = 280
Score = 84.7 bits (208), Expect = 8e-14, Method: Compositional matrix adjust.
Identities = 59/186 (31%), Positives = 87/186 (46%), Gaps = 12/186 (6%)
Query: 31 ARLAYLQEKIRSH---NTYQAQILPSNDISNSLFYVNISIGQPPVPQFTAMDTGSSLLWV 87
AR+ Y+ K+ + + I+ + ++ I IG+PP + +DTGS + WV
Sbjct: 99 ARVKYITTKLNQNFNTDKLSGPIISGTSQGSGEYFSRIGIGEPPSQAYMVLDTGSDISWV 158
Query: 88 HCYPCRDCSPQLGTIFYPSRSSSYAYVPCDSEHCRYFPYARCSAYKHRCIYTQLYLIGPE 147
C PC DC Q IF P+ S+SYA + C++ CRY ++C C+Y Y G
Sbjct: 159 QCAPCADCYRQADPIFEPTASASYAPLSCEAAQCRYLDQSQCR--NGNCLYQVSYGDGSY 216
Query: 148 TSVFVSTEQLTFKNTDESTIHVQDVVFGCGFSTNRNFKFSGIFGLGIGRS-SLVSQLNS- 205
T TE +T V++V GCG + F + G S +QLNS
Sbjct: 217 TVGDFVTETVTI-----GVNKVKNVALGCGHNNEGLFVGAAGLIGLGGGPLSFPAQLNST 271
Query: 206 SFSYCI 211
SFSYC+
Sbjct: 272 SFSYCL 277
>gi|218197465|gb|EEC79892.1| hypothetical protein OsI_21411 [Oryza sativa Indica Group]
Length = 720
Score = 84.7 bits (208), Expect = 8e-14, Method: Compositional matrix adjust.
Identities = 96/340 (28%), Positives = 141/340 (41%), Gaps = 48/340 (14%)
Query: 68 GQPPVPQFTAMDTGSSLLWVHCYPCR--DCSPQLGTIFYPSRSSSYAYVPCDSEHCRYF- 124
G V Q +D+GS + WV C PC C Q +F P+ S++YA VPC S C
Sbjct: 162 GTSAVTQTVIIDSGSDVSWVQCKPCPLPMCHRQRDPLFDPAMSTTYAAVPCTSAACAQLG 221
Query: 125 PYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDVVFGCGFS---TN 181
PY R + +C + Y G + S + LT D ++ FGC + +
Sbjct: 222 PYRRGCSANAQCQFGINYGDGSTATGTYSFDDLTLGPYDV----IRGFRFGCAHADRGSA 277
Query: 182 RNFKFSGIFGLGIGRSSLVSQLNSS----FSYCIGSLHDPDYLHNKLILG----DGAIID 233
++ +G LG G SLV Q + FSYC+ L+LG +I
Sbjct: 278 FDYDVAGSLALGGGSQSLVQQTATRYGRVFSYCLPPTAS---SLGFLVLGVPPERAQLIP 334
Query: 234 EGDATPL---QFIDGHYYITLEAISVDGRMLDINPNIFKRDDSGGGVMIDSGTDVTWLVK 290
+TPL Y + L AI V GR L + P +F +IDS T ++ L
Sbjct: 335 SFVSTPLLSSSMAPTFYRVLLRAIIVAGRPLAVPPAVFSASS-----VIDSSTIISRLPP 389
Query: 291 EAYEALRDEVMIRLEGEQMRSYSWPDKL---CY--HGIMSSDLKGFPTVRFHFRGGAKLA 345
AY+ALR M + P + CY G+ S L P++ F GGA +
Sbjct: 390 TAYQALRAAFR---SAMTMYRAAPPVSILDTCYDFTGVRSITL---PSIALVFDGGATVN 443
Query: 346 LEKDSMFYQPRPDAFCMAVNPASINNRYVNFSLIGMMAQQ 385
L+ + C+A P + ++R F IG + Q+
Sbjct: 444 LDAAGILL-----GSCLAFAPTA-SDRMPGF--IGNVQQK 475
>gi|224140036|ref|XP_002323393.1| predicted protein [Populus trichocarpa]
gi|222868023|gb|EEF05154.1| predicted protein [Populus trichocarpa]
Length = 459
Score = 84.7 bits (208), Expect = 8e-14, Method: Compositional matrix adjust.
Identities = 106/436 (24%), Positives = 174/436 (39%), Gaps = 62/436 (14%)
Query: 15 NENPAHRVQRGINISIARLAYLQEKIRSHNTYQAQILPSNDISNSLFYVNISIGQPPVPQ 74
++ P + ++S++R +++ + + + + P S + ++++ G PP
Sbjct: 40 SKKPWGSLNHLASLSLSRAHHIKSPKTNFSLIKTPLFPR---SYGGYSISLNFGTPPQTT 96
Query: 75 FTAMDTGSSLLWVHC---YPCRDCS-PQLGT----IFYPSRSSSYAYVPCDSEHCRYF-- 124
MDTGSSL+W C Y C +C+ P + F P SSS + C + C
Sbjct: 97 KFVMDTGSSLVWFPCTSRYLCSECNFPNIKKTGIPTFLPKLSSSSKLIGCKNPRCSMIFG 156
Query: 125 -----PYARCSAYKHRCIYT-QLYLI---GPETSVFVSTEQLTFKNTDESTIHVQDVVFG 175
C + C T Y+I T+ + +E L F N + D + G
Sbjct: 157 PEIQSKCQECDSTAQNCTQTCPPYVIQYGSGSTAGLLLSETLDFPNKKT----IPDFLVG 212
Query: 176 CG-FSTNRNFKFSGIFGLGIGRSSLVSQLN-SSFSYCIGS-LHDPDYLHNKLIL----GD 228
C FS + GI G G SL SQL FSYC+ S D + L+L G
Sbjct: 213 CSIFSIKQP---EGIAGFGRSPESLPSQLGLKKFSYCLVSHAFDDTPTSSDLVLDTGSGS 269
Query: 229 GAIIDEGDA------TPLQFIDGHYYITLEAISVDGRMLDINPNIF--KRDDSGGGVMID 280
G G + P +YY+ L I + + + P F D GG ++D
Sbjct: 270 GVTKTAGLSHTPFLKNPTTAFRDYYYVLLRNIVIGDTHVKV-PYKFLVPGTDGNGGTIVD 328
Query: 281 SGTDVTWLVKEAYEALRDEVMIRLEGEQMRSYSWPDKL--------CYHGIMSSDLKGFP 332
SGT T++ YE + E +QM Y+ ++ CY+ I P
Sbjct: 329 SGTTFTFMENPVYELVAKEFE-----KQMAHYTVATEIQNLTGLRPCYN-ISGEKSLSVP 382
Query: 333 TVRFHFRGGAKLALEKDSMFYQPRPDAFCMAV---NPASINNRYVNFSLIGMMAQQFYNV 389
+ F F+GGAK+AL + F C+ + N A ++G Q+ + V
Sbjct: 383 DLIFQFKGGAKMALPLSNYFSIVDSGVICLTIVSDNVAGPGLGGGPAIILGNYQQRNFYV 442
Query: 390 GYDIGRKQKTFQRMDC 405
+D+ ++ F++ C
Sbjct: 443 EFDLENEKFGFKQQSC 458
>gi|222637182|gb|EEE67314.1| hypothetical protein OsJ_24556 [Oryza sativa Japonica Group]
Length = 304
Score = 84.3 bits (207), Expect = 9e-14, Method: Compositional matrix adjust.
Identities = 97/360 (26%), Positives = 148/360 (41%), Gaps = 76/360 (21%)
Query: 63 VNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDC----SPQLGTIFYPSRSSSYAYVPCDS 118
+ +++G PPV A+ S L WV C PC C +P G Y R++S ++ P
Sbjct: 1 MELAVGTPPV-TVQALFGISDLCWVECTPCSGCNNNAAPPAGARLY-DRANSSSFSPLAD 58
Query: 119 EHCRY-FPYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDVVFGCG 177
C Y + Y ++ Y+ G + TE + F + D +T VQ FGC
Sbjct: 59 TECGYRYVYGATDTDRN-------YVKG-----ILGTETIKFGSNDAAT--VQSFTFGCT 104
Query: 178 FSTNRNFKF---SGIFGLGIGRSSLVSQLN-SSFSYCIGSLHDPDYLHNKLILGDGAIID 233
+ RN F +G+ GLG + SLV QL FSYC+ S +P+ + + ++ G A +D
Sbjct: 105 NTVYRNDLFDGNTGVVGLGRSKLSLVGQLGLDRFSYCLAS--NPN-VASPVLFGSTASMD 161
Query: 234 EG--DATPLQFIDGHYYITLEAISVDGRMLDINPNIFKRDDSGGGVMIDSGTDVTWLVKE 291
+TPL D +YY+ L ISVDG L I PN R +
Sbjct: 162 GNGVSSTPLLPDDANYYVNLLGISVDGTRLAI-PNDTAR------------------MSR 202
Query: 292 AYEALRDEVMIRLEGEQMRSYSWPDKLCYHGIMSSDLKGFPTVRFHFRGGAKLALEKDSM 351
YEA + G + + D S ++ PT+ HF G L +
Sbjct: 203 TYEA--------VNGSGLLCFLVDDA-------SKNVVTVPTMTMHFDGMDMELLFGNYF 247
Query: 352 FYQPRP------DAFCMAVNPASINNRYVNFSLIGMMAQQFYNVGYDIGRKQKTFQRMDC 405
Y + D C+ + +S +R IG Q ++V Y++ + Q DC
Sbjct: 248 AYTGKQSGGGGGDVLCLMIGKSSTGSR------IGNYLQMDFHVLYELKNSVLSVQPADC 301
>gi|326505434|dbj|BAJ95388.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 529
Score = 84.3 bits (207), Expect = 9e-14, Method: Compositional matrix adjust.
Identities = 104/375 (27%), Positives = 151/375 (40%), Gaps = 50/375 (13%)
Query: 60 LFYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDC----SPQLGT----IFYPSRSSSY 111
L Y +++G P V A+DTGS L WV C C C SP G ++ P +SS+
Sbjct: 107 LHYAVVALGTPNVTFLVALDTGSDLFWVPC-DCLKCAPLSSPDYGNLKFDVYSPRKSSTS 165
Query: 112 AYVPCDSEHCRYFPYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDE--STIHV 169
VPC S C CSA + C Y YL +S V E + + T+ S I
Sbjct: 166 RKVPCSSNMCDL--QTECSAASNSCPYKIEYLSDNTSSKGVLVEDVMYLATESGHSKITQ 223
Query: 170 QDVVFGCGFSTNRNFKFS----GIFGLGIGRSSLVSQLNS------SFSYCIGSLHDPDY 219
+ FGCG +F S G+ GLG+ S+ S L S SFS C G +
Sbjct: 224 APITFGCGQVQTGSFLGSAAPNGLLGLGMDSKSVPSLLASQGVAANSFSMCFG-----ED 278
Query: 220 LHNKLILGDGAIIDEGDATPLQFIDGHYYITLEAISVDGRMLDINPNIFKRDDSGGGVMI 279
H ++ GD D+ + TPL + Y IS+ G M K + ++
Sbjct: 279 GHGRINFGDTGSADQLE-TPLNIYKHNPYYN---ISIVGAMAG-----GKTFSTKFSAVV 329
Query: 280 DSGTDVTWLVKEAYEALRDEVMIRL-EGEQMRSYSWPDKLCYHGIMSSDLKGFPTVRFHF 338
DSGT T L Y + ++ E S P + CY I S P +
Sbjct: 330 DSGTSFTALSDPMYTEITSAFDKQVKEKRNPADSSLPFEYCYT-ISSKGAVSPPNISLTA 388
Query: 339 RGGAKLALEKDSMF----YQPRPDAFCMAVNPASINNRYVNFSLIGMMAQQFYNVGYDIG 394
+GG+ + KD + P +C+A+ + + VN LIG V +D
Sbjct: 389 KGGSVFPV-KDPIITITDISSSPVGYCLAI----MKSEGVN--LIGENFMSGLKVVFDRE 441
Query: 395 RKQKTFQRMDCEVLD 409
R ++ +C +D
Sbjct: 442 RLVLGWKSFNCYSVD 456
>gi|413923876|gb|AFW63808.1| hypothetical protein ZEAMMB73_793799 [Zea mays]
Length = 415
Score = 84.3 bits (207), Expect = 9e-14, Method: Compositional matrix adjust.
Identities = 90/363 (24%), Positives = 145/363 (39%), Gaps = 59/363 (16%)
Query: 78 MDTGSSLLWVHCYPCRDCSPQLGTIFYPSRSSSYAYVPCDSEHCRYFPYARC-------- 129
+DTGS++ W C SRS + + +PC S C C
Sbjct: 73 VDTGSNIFWTTEKEC-------------SRSKTRSMLPCCSPKCEQRASCGCRRSELKAE 119
Query: 130 SAYKHRCIYTQLYL--IGPETSVFVSTEQLTFKNTDESTI----HVQDVVFGCGFSTNRN 183
+ + +C Y Y T+ + ++LT + ++V GC S
Sbjct: 120 AEKETKCTYAIKYGGNANDSTAGVLYEDKLTIVAVASKAVPGSQSFEEVAIGCSTSATLK 179
Query: 184 FK---FSGIFGLGIGRSSLVSQLN-SSFSYCIGSLHDPDYLHNKLILGDGAIIDEGDAT- 238
FK G+FGLG +SL QLN S FSYC+ S PD L + L+L + G
Sbjct: 180 FKDPSIKGVFGLGRSATSLPRQLNFSKFSYCLSSYQKPD-LPSYLLLTAAPDMATGAVGG 238
Query: 239 ----------PLQFIDGHYYITLEAISVDGRMLDINPNIFKRDDSGGGVMIDSGTDVTWL 288
P Y++ L+ IS+ G L P + + SGG + +D+GT T L
Sbjct: 239 AAAVATTALQPNSDYKTRYFVDLQGISIGGTRL---PAVSTK--SGGNMFVDTGTSFTRL 293
Query: 289 VKEAYEALRDEVMIRLEGEQMRSYSWPDK----LCYH--GIMSSDLKGFPTVRFHFRGGA 342
+ L E + R+ E+ P + +CY + + P + HF A
Sbjct: 294 EGTVFAKLVTE-LDRIMKERKYVKEQPGRNNGQICYSPPSTAADESSKLPDMVLHFADSA 352
Query: 343 KLALEKDSMFYQPRPDAFCMAVNPASINNRYVNFSLIGMMAQQFYNVGYDIGRKQKTFQR 402
+ L DS ++ C+A++ ++I S++G Q ++ D G ++ +F R
Sbjct: 353 NMVLPWDSYLWK-TTSKLCLAIDKSNIKG---GISVLGNFQMQNTHMLLDTGNEKLSFVR 408
Query: 403 MDC 405
DC
Sbjct: 409 ADC 411
>gi|302824729|ref|XP_002994005.1| hypothetical protein SELMODRAFT_431957 [Selaginella moellendorffii]
gi|300138167|gb|EFJ04945.1| hypothetical protein SELMODRAFT_431957 [Selaginella moellendorffii]
Length = 462
Score = 84.3 bits (207), Expect = 9e-14, Method: Compositional matrix adjust.
Identities = 97/382 (25%), Positives = 152/382 (39%), Gaps = 55/382 (14%)
Query: 61 FYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLGTIFYPSRSSSYAYVPCD-SE 119
+Y +I +G P +DTGS L W+ C PC+ C+P + TI+ +RS+SY V C+ S+
Sbjct: 100 YYTSIKLGSPGQEAILIVDTGSELTWLQCLPCKVCAPSVDTIYDAARSASYRPVTCNNSQ 159
Query: 120 HCR---YFPYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNT-DESTIHVQDVVFG 175
C YA C A +C + Y G + +ST+ L + + VQD FG
Sbjct: 160 LCSNSSQGTYAYC-ARGSQCQFAAFYGDGSFSYGSLSTDTLIMETVVGGKPVTVQDFAFG 218
Query: 176 C--GFSTNRNFKFSGIFGLGIGRSSLVSQLNS----SFSYCIGSLHDPDYLHNKLILGDG 229
C G SGI GL G+ +L QL FS+C + G+
Sbjct: 219 CAQGDLELVPTGASGILGLNAGKMALPMQLGQRFGWKFSHCFPDRSSHLNSTGVVFFGNA 278
Query: 230 AIIDEG------DATPLQFIDGHYYITLEAISVDGRMLDINPNIFKRDDSGGGVMIDSGT 283
+ E T + Y++ L+ +S++ L P G V++DSG+
Sbjct: 279 ELPHEQVQYTSVALTNSELQRKFYHVALKGVSINSHELVFLPR-------GSVVILDSGS 331
Query: 284 DVTWLVK-------EAYEALRDEVMIRLEGEQMRSYSWPDKLCYHGIMSSDL----KGFP 332
+ V+ EA+ R + LEG+ S+ D + + D+ + P
Sbjct: 332 SFSSFVRPFHSQLREAFLKHRPPSLKHLEGD-----SFGDLGTCFKVSNDDIDELHRTLP 386
Query: 333 TVRFHFRGGAKLALEKDSMFYQPRPDA-------FCMAVNPASINNRYVNFSLIGMMAQQ 385
++ F G + + + P A C A N ++IG QQ
Sbjct: 387 SLSLVFEDGVTIGIPSIGVLL---PVARFQNHVKMCFAFEDGGPNP----VNVIGNYQQQ 439
Query: 386 FYNVGYDIGRKQKTFQRMDCEV 407
V YDI R + F R C +
Sbjct: 440 NLWVEYDIQRSRVGFARASCVI 461
>gi|340810961|gb|AEK75407.1| S5 [Oryza sativa]
gi|340811037|gb|AEK75445.1| S5 [Oryza rufipogon]
Length = 357
Score = 84.3 bits (207), Expect = 9e-14, Method: Compositional matrix adjust.
Identities = 108/383 (28%), Positives = 160/383 (41%), Gaps = 66/383 (17%)
Query: 63 VNISIGQPPVPQFTAMDTGSSLLWVHCYPCR-DC---SPQLGTIFYPSRSSSYAYVPCDS 118
+ +S+G+PPV A+DTGS+L WV C PC C S + G IF P RS + V C S
Sbjct: 1 MAVSLGKPPVVNLVAIDTGSTLSWVQCQPCAVHCHTQSAKAGPIFDPGRSYTSRRVRCSS 60
Query: 119 EHCRYFPY------ARCSAYKHRCIYTQLYLIGPETSV-FVSTEQLTFKNTDESTIHVQD 171
C Y A C + C Y+ Y G SV + T+ L ++ D
Sbjct: 61 VKCGELRYDLRLQQANCMEKEDSCTYSVTYGNGWAYSVGKMVTDTLRIGDS------FMD 114
Query: 172 VVFGCGFSTNRNFKFSGIFGLGIGRSSLVSQLN--------SSFSYCIGSLH-DPDYLHN 222
++FGC + +GIFG G S QL + SYC+ + P Y
Sbjct: 115 LMFGCSMDVKYSEFEAGIFGFGSSSFSFFEQLAGYPDILSYKALSYCLPTDETKPGY--- 171
Query: 223 KLILG--DGAIIDEGDATPLQFIDG-HYYITLEAISVDGRMLDINPNIFKRDDSGGGVMI 279
+ILG D A +D G + I+ Y +T+E + +G+ L S +++
Sbjct: 172 -MILGRYDRAAMDGGYTPLFRSINRPTYSLTMEMLIANGQRLVT---------SSSEMIV 221
Query: 280 DSGTDVTWLVKEAYEALRDEVMI---------RLEGEQMRSY----SWPDKLCYHGIMS- 325
DSG T L + AL D+ + R + SY S D ++G ++
Sbjct: 222 DSGAQRTSLWPSTF-ALLDKTITQAMSSIGYHRTSRARQESYICYLSEHDYSGWNGTITP 280
Query: 326 -SDLKGFPTVRFHFRGGAKLALEKDSMFYQPRPDAFCM--AVNPASINNRYVNFSLIGMM 382
S+ P + F GGA LAL ++FY CM A NPA + ++G
Sbjct: 281 FSNWSALPLLEIGFAGGAALALPPRNVFYNDPHRGLCMTFAQNPA------LRSQILGNR 334
Query: 383 AQQFYNVGYDIGRKQKTFQRMDC 405
+ + +DI KQ F+ C
Sbjct: 335 VTRSFGTTFDIQGKQFGFKYAVC 357
>gi|224142007|ref|XP_002324352.1| predicted protein [Populus trichocarpa]
gi|222865786|gb|EEF02917.1| predicted protein [Populus trichocarpa]
Length = 460
Score = 84.3 bits (207), Expect = 9e-14, Method: Compositional matrix adjust.
Identities = 103/379 (27%), Positives = 153/379 (40%), Gaps = 41/379 (10%)
Query: 47 QAQILP---SNDISNSLFYVNISIGQPPVPQFTAMDTGSSLLWVHCYPC-RDCSPQLGTI 102
QA LP I + V + +G P DTGS + W C PC + C Q
Sbjct: 102 QATTLPVQSGASIGAGDYVVTVGLGTPKKEFTLIFDTGSDITWTQCEPCVKTCYKQKEPR 161
Query: 103 FYPSRSSSYAYVPCDSEHCRYFPYAR-----CSAYKHRCIYTQLYLIGPETSVFVSTEQL 157
PS S+SY + C S C+ + CS+ C+Y Y G + F +TE L
Sbjct: 162 LNPSTSTSYKNISCSSALCKLVASGKKFSQSCSS--STCLYQVQYGDGSYSIGFFATETL 219
Query: 158 TFKNTDESTIHVQDVVFGCGFSTNRNFKFSGIFGLGI-GRSSLVSQLNSS----FSYCI- 211
T +++ ++ +FGCG N F + + +L SQ + FSYC+
Sbjct: 220 TLSSSNV----FKNFLFGCGQQNNGLFGGAAGLLGLGRTKLALPSQTAKTYKKLFSYCLP 275
Query: 212 GSLHDPDYLHNKLILGDGAIIDEGDATPLQ--FIDGHYY-ITLEAISVDGRMLDINPNIF 268
S YL LG G + TPL F +Y + + +SV GR L I+ + F
Sbjct: 276 ASSSSKGYLS----LG-GQVSKSVKFTPLSADFDSTPFYGLDITGLSVGGRKLSIDESAF 330
Query: 269 KRDDSGGGVMIDSGTDVTWLVKEAYEALRDEVM-IRLEGEQMRSYSWPDKLCYHGIMSSD 327
G +IDSGT +T L AY L + + YS D CY D
Sbjct: 331 S-----AGTVIDSGTVITRLSPTAYSELSSAFQNLMTDYPSTSGYSIFDT-CYD-FSKYD 383
Query: 328 LKGFPTVRFHFRGGAKLALEKDSMFYQPRP-DAFCMAVNPASINNRYVNFSLIGMMAQQF 386
P V F+GG ++ ++ + Y C+A + N+ + S+ G + Q+
Sbjct: 384 TVRIPKVGVTFKGGVEMDIDVSGILYPVNGLKKVCLAF---AGNDDDSDTSIFGNVQQRT 440
Query: 387 YNVGYDIGRKQKTFQRMDC 405
Y V YD + + F C
Sbjct: 441 YQVVYDGAKGRVGFAPGGC 459
>gi|224142009|ref|XP_002324353.1| predicted protein [Populus trichocarpa]
gi|222865787|gb|EEF02918.1| predicted protein [Populus trichocarpa]
Length = 412
Score = 84.3 bits (207), Expect = 9e-14, Method: Compositional matrix adjust.
Identities = 105/398 (26%), Positives = 160/398 (40%), Gaps = 44/398 (11%)
Query: 32 RLAYLQEKIRSHNTY---QAQILP---SNDISNSLFYVNISIGQPPVPQFTAMDTGSSLL 85
R+ + ++ S + QA LP I + V + +G P DTGS +
Sbjct: 36 RVDSIHARLSSRGMFPEKQATTLPVQSGASIGAGDYVVTVGLGTPKKEFTLIFDTGSDIT 95
Query: 86 WVHCYPC-RDCSPQLGTIFYPSRSSSYAYVPCDSEHCRYFPYAR-----CSAYKHRCIYT 139
W C PC + C Q PS S+SY + C S C+ + CS+ C+Y
Sbjct: 96 WTQCEPCVKTCYKQKEPRLNPSTSTSYKNISCSSALCKLVASGKKFSQSCSS--STCLYQ 153
Query: 140 QLYLIGPETSVFVSTEQLTFKNTDESTIHVQDVVFGCGFSTNRNFKFSGIFGLGI-GRSS 198
Y G + F +TE LT +++ ++ +FGCG N F + + +
Sbjct: 154 VQYGDGSYSIGFFATETLTLSSSNV----FKNFLFGCGQQNNGLFGGAAGLLGLGRTKLA 209
Query: 199 LVSQLNSS----FSYCI-GSLHDPDYLHNKLILGDGAIIDEGDATPLQ--FIDGHYY-IT 250
L SQ + FSYC+ S YL LG G + TPL F +Y +
Sbjct: 210 LPSQTAKTYKKLFSYCLPASSSSKGYLS----LG-GQVSKSVKFTPLSADFDSTPFYGLD 264
Query: 251 LEAISVDGRMLDINPNIFKRDDSGGGVMIDSGTDVTWLVKEAYEALRDEVM-IRLEGEQM 309
+ +SV GR L I+ + F G +IDSGT +T L AY L + +
Sbjct: 265 ITGLSVGGRQLSIDESAFS-----AGTVIDSGTVITRLSPTAYSELSSAFQNLMTDYPST 319
Query: 310 RSYSWPDKLCYHGIMSSDLKGFPTVRFHFRGGAKLALEKDSMFYQPRP-DAFCMAVNPAS 368
YS D CY D P V F+GG ++ ++ + Y C+A +
Sbjct: 320 SGYSIFDT-CYD-FSKYDTVRIPKVGVTFKGGVEMDIDVSGILYPVNGLKKVCLAF---A 374
Query: 369 INNRYVNFSLIGMMAQQFYNVGYDIGRKQKTFQRMDCE 406
N+ + S+ G + Q+ Y V YD + + F C
Sbjct: 375 GNDDDSDTSIFGNVQQRTYQVVYDGAKGRVGFAPGGCS 412
>gi|242059211|ref|XP_002458751.1| hypothetical protein SORBIDRAFT_03g039590 [Sorghum bicolor]
gi|241930726|gb|EES03871.1| hypothetical protein SORBIDRAFT_03g039590 [Sorghum bicolor]
Length = 444
Score = 84.3 bits (207), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 106/432 (24%), Positives = 170/432 (39%), Gaps = 55/432 (12%)
Query: 13 NPNENPAHRVQRGINISIARLAYLQEKIRSHNTYQAQILPSNDISNSLFYVNISIGQPPV 72
P P R + L K+R H+ N V++++G PP
Sbjct: 28 RPAAKPRAFPLRARQVPAGALPRPPSKLRFHH-------------NVSLTVSLAVGTPPQ 74
Query: 73 PQFTAMDTGSSLLWVHCYPCRDCSPQ------LGTIFYPSRSSSYAYVPCDSEHC--RYF 124
+DTGS L W+ C R S +G F P S+++A VPC S C R
Sbjct: 75 NVTMVLDTGSELSWLLCATGRQGSAAAGAAAAMGESFRPRASATFAAVPCGSTQCSSRDL 134
Query: 125 PYA-RCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDVVFGCGF----S 179
P C +C + Y G + ++T+ F + + FGC S
Sbjct: 135 PAPPSCDGASRQCHVSLSYADGSASDGALATD--VFAVGEAPPLRS---AFGCMSTAYDS 189
Query: 180 TNRNFKFSGIFGLGIGRSSLVSQLNSS-FSYCIGSLHDPDYL---HNKL-ILGDGAIIDE 234
+ +G+ G+ G S V+Q ++ FSYCI D L H+ L L
Sbjct: 190 SPDGVATAGLLGMNRGTLSFVTQASTRRFSYCISDRDDAGVLLLGHSDLPFLPLNYTPLY 249
Query: 235 GDATPLQFIDG-HYYITLEAISVDGRMLDINPNIFKRDDSGGG-VMIDSGTDVTWLVKEA 292
PL + D Y + L I V G+ L I ++ D +G G M+DSGT T+L+ +A
Sbjct: 250 QPTLPLPYFDRVAYSVQLLGIRVGGKALPIPASVLAPDHTGAGQTMVDSGTQFTFLLGDA 309
Query: 293 YEALRDEVMIR----LEGEQMRSYSWPDKL--CYHGIMSSDLKG--FPTVRFHFRGGAKL 344
Y AL+ E + + L S+++ + L C+ P V F GA++
Sbjct: 310 YSALKAEFLKQTKPLLRALDDPSFAFQEALDTCFRVPAGRPPPSARLPPVTLLFN-GAEM 368
Query: 345 ALEKDSMFYQPRPD------AFCMAVNPASINNRYVNFSLIGMMAQQFYNVGYDIGRKQK 398
++ D + Y+ + +C+ A + + +IG Q V YD+ R +
Sbjct: 369 SVAGDRLLYKVPGEHRGADGVWCLTFGNADMVP--LTAYVIGHHHQMNLWVEYDLERGRV 426
Query: 399 TFQRMDCEVLDD 410
+ C+V +
Sbjct: 427 GLAPVKCDVASE 438
>gi|223950045|gb|ACN29106.1| unknown [Zea mays]
Length = 392
Score = 84.3 bits (207), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 90/363 (24%), Positives = 145/363 (39%), Gaps = 59/363 (16%)
Query: 78 MDTGSSLLWVHCYPCRDCSPQLGTIFYPSRSSSYAYVPCDSEHCRYFPYARC-------- 129
+DTGS++ W C SRS + + +PC S C C
Sbjct: 50 VDTGSNIFWTTEKEC-------------SRSKTRSMLPCCSPKCEQRASCGCRRSELKAE 96
Query: 130 SAYKHRCIYTQLYL--IGPETSVFVSTEQLTFKNTDESTI----HVQDVVFGCGFSTNRN 183
+ + +C Y Y T+ + ++LT + ++V GC S
Sbjct: 97 AEKETKCTYAIKYGGNANDSTAGVLYEDKLTIVAVASKAVPGSQSFEEVAIGCSTSATLK 156
Query: 184 FK---FSGIFGLGIGRSSLVSQLN-SSFSYCIGSLHDPDYLHNKLILGDGAIIDEGDAT- 238
FK G+FGLG +SL QLN S FSYC+ S PD L + L+L + G
Sbjct: 157 FKDPSIKGVFGLGRSATSLPRQLNFSKFSYCLSSYQKPD-LPSYLLLTAAPDMATGAVGG 215
Query: 239 ----------PLQFIDGHYYITLEAISVDGRMLDINPNIFKRDDSGGGVMIDSGTDVTWL 288
P Y++ L+ IS+ G L P + + SGG + +D+GT T L
Sbjct: 216 AAAVATTALQPNSDYKTRYFVDLQGISIGGTRL---PAVSTK--SGGNMFVDTGTSFTRL 270
Query: 289 VKEAYEALRDEVMIRLEGEQMRSYSWPDK----LCYH--GIMSSDLKGFPTVRFHFRGGA 342
+ L E + R+ E+ P + +CY + + P + HF A
Sbjct: 271 EGTVFAKLVTE-LDRIMKERKYVKEQPGRNNGQICYSPPSTAADESSKLPDMVLHFADSA 329
Query: 343 KLALEKDSMFYQPRPDAFCMAVNPASINNRYVNFSLIGMMAQQFYNVGYDIGRKQKTFQR 402
+ L DS ++ C+A++ ++I S++G Q ++ D G ++ +F R
Sbjct: 330 NMVLPWDSYLWK-TTSKLCLAIDKSNIKG---GISVLGNFQMQNTHMLLDTGNEKLSFVR 385
Query: 403 MDC 405
DC
Sbjct: 386 ADC 388
>gi|224142005|ref|XP_002324351.1| predicted protein [Populus trichocarpa]
gi|222865785|gb|EEF02916.1| predicted protein [Populus trichocarpa]
Length = 472
Score = 84.3 bits (207), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 103/379 (27%), Positives = 153/379 (40%), Gaps = 41/379 (10%)
Query: 47 QAQILP---SNDISNSLFYVNISIGQPPVPQFTAMDTGSSLLWVHCYPC-RDCSPQLGTI 102
QA LP I + V + +G P DTGS + W C PC + C Q
Sbjct: 114 QATTLPVQSGASIGAGDYVVTVGLGTPKKEFTLIFDTGSDITWTQCEPCVKTCYKQKEPR 173
Query: 103 FYPSRSSSYAYVPCDSEHCRYFPYAR-----CSAYKHRCIYTQLYLIGPETSVFVSTEQL 157
PS S+SY + C S C+ + CS+ C+Y Y G + F +TE L
Sbjct: 174 LNPSTSTSYKNISCSSALCKLVASGKKFSQSCSS--STCLYQVQYGDGSYSIGFFATETL 231
Query: 158 TFKNTDESTIHVQDVVFGCGFSTNRNFKFSGIFGLGI-GRSSLVSQLNSS----FSYCI- 211
T +++ ++ +FGCG N F + + +L SQ + FSYC+
Sbjct: 232 TLSSSNV----FKNFLFGCGQQNNGLFGGAAGLLGLGRTKLALPSQTAKTYKKLFSYCLP 287
Query: 212 GSLHDPDYLHNKLILGDGAIIDEGDATPLQ--FIDGHYY-ITLEAISVDGRMLDINPNIF 268
S YL LG G + TPL F +Y + + +SV GR L I+ + F
Sbjct: 288 ASSSSKGYLS----LG-GQVSKSVKFTPLSADFDSTPFYGLDITGLSVGGRKLSIDESAF 342
Query: 269 KRDDSGGGVMIDSGTDVTWLVKEAYEALRDEVM-IRLEGEQMRSYSWPDKLCYHGIMSSD 327
G +IDSGT +T L AY L + + YS D CY D
Sbjct: 343 S-----AGTVIDSGTVITRLSPTAYSELSSAFQNLMTDYPSTSGYSIFDT-CYD-FSKYD 395
Query: 328 LKGFPTVRFHFRGGAKLALEKDSMFYQPRP-DAFCMAVNPASINNRYVNFSLIGMMAQQF 386
P V F+GG ++ ++ + Y C+A + N+ + S+ G + Q+
Sbjct: 396 TVRIPKVGVTFKGGVEMDIDVSGILYPVNGLKKVCLAF---AGNDDDSDTSIFGNVQQRT 452
Query: 387 YNVGYDIGRKQKTFQRMDC 405
Y V YD + + F C
Sbjct: 453 YQVVYDGAKGRVGFAPGGC 471
>gi|218186522|gb|EEC68949.1| hypothetical protein OsI_37668 [Oryza sativa Indica Group]
Length = 421
Score = 84.0 bits (206), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 98/380 (25%), Positives = 160/380 (42%), Gaps = 54/380 (14%)
Query: 58 NSLFYVNISIGQPPVPQFTAMDTGSSLLWVHC-YPCRDCSPQLGTIFYPSRSSSYAYVPC 116
+ L+YV +SIG PP P F +DTGS L W+ C PC CS ++ P+++ VPC
Sbjct: 55 HGLYYVAMSIGNPPRPYFLDVDTGSDLTWLQCDAPCVSCSKVPHPLYRPTKNK---LVPC 111
Query: 117 DSEHCRYFPYA-----RCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQD 171
+ C +C + K +C Y Y + + T+ + + S +
Sbjct: 112 VDQMCAALHGGLTGRHKCDSPKQQCDYEIKYADQGSSLGVLVTDSFALRLANSSIVR-PG 170
Query: 172 VVFGCGF-----STNRNFKFSGIFGLGIGRSSLVSQL------NSSFSYCIGSLHDPDYL 220
+ FGCG+ S+ G+ GLG G SL+SQL + +C+ +
Sbjct: 171 LAFGCGYDQQVGSSTEVSATDGVLGLGSGSVSLLSQLKQHGITKNVVGHCLSTRGG---- 226
Query: 221 HNKLILGDGAIIDEGDAT--PLQFIDGHYYITLEAISV--DGRMLDINPNIFKRDDSGGG 276
L GD I+ AT P+ Y + + ++ GR L + P
Sbjct: 227 -GFLFFGD-DIVPYSRATWAPMARSTSRNYYSPGSANLYFGGRPLGVRPM---------E 275
Query: 277 VMIDSGTDVTWLVKEAYEALRDEVMIRLEG--EQMRSYSWPDKLCYHGI--MSSDL---K 329
V+ DSG+ T+ + Y+AL D + L +++ +S P LC+ G S L K
Sbjct: 276 VVFDSGSSFTYFSAQPYQALVDAIKGDLSKNLKEVPDHSLP--LCWKGKKPFKSVLDVKK 333
Query: 330 GFPTVRFHFRGGAKLALE---KDSMFYQPRPDAFCMAVNPASINNRYVNFSLIGMMAQQF 386
F TV F G K +E ++ + +A +N + + + +N ++G + Q
Sbjct: 334 EFKTVVLSFSNGKKALMEIPPENYLIVTKYGNACLGILNGSEVGLKDLN--IVGDITMQD 391
Query: 387 YNVGYDIGRKQKTFQRMDCE 406
V YD R Q + R C+
Sbjct: 392 QMVIYDNERGQIGWIRAPCD 411
>gi|225440731|ref|XP_002280866.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Vitis vinifera]
Length = 469
Score = 84.0 bits (206), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 117/449 (26%), Positives = 176/449 (39%), Gaps = 78/449 (17%)
Query: 12 NNPNENPAHRVQRGINISIARLAYLQEKIRSHNTYQAQILPSNDISNSLFYVNISIGQPP 71
NP+ +P + + S+ R +L+ + NT P S + V++S G P
Sbjct: 45 KNPSSDPWQLLSHLTSASLTRAHHLKHR---KNTSSVNT-PLFAHSYGGYSVSLSFGTPS 100
Query: 72 VPQFTAMDTGSSLLWVHC---YPCRDCS-----PQLGTIFYPSRSSSYAYVPCDSEHCRY 123
MDTGSSL+W C Y C CS P F P SSS V C + C +
Sbjct: 101 QTLSFVMDTGSSLVWFPCTSRYVCTRCSFPNIDPAKIPTFIPKLSSSAKIVGCLNPKCGF 160
Query: 124 F----PYARCSAYKHR-------CIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDV 172
RC C + T + E L F E D
Sbjct: 161 VMDSEVRTRCPGCDQNSANCTKACPTYAIQYGLGTTVGLLLLESLVFAERTE-----PDF 215
Query: 173 VFGCGFSTNRNFKFSGIFGLGIGRSSLVSQLN-SSFSYCIGSLH---DPDYLHNKLILGD 228
V GC ++R + SGI G G G SSL Q+ FSYC+ S P L +G
Sbjct: 216 VVGCSILSSR--QPSGIAGFGRGPSSLPKQMGLKKFSYCLLSHRFDDSPKSSKMTLYVGP 273
Query: 229 GAIIDEGDA---TPLQ--------FIDGHYYITLEAISVDGRMLDINPNIF--KRDDSGG 275
+ D+ TP + +YY+TL I V + + + P F D G
Sbjct: 274 DSKDDKTGGLSYTPFRKNPVSSNSAFKEYYYVTLRHIIVGDKRVKV-PYSFMVAGSDGNG 332
Query: 276 GVMIDSGTDVTWLVKEAYEALRDEVMIRLEGEQMRSYSWPD--------KLCYH--GIMS 325
G ++DSG+ T++ K +EA+ E QM +Y+ K C++ G+ S
Sbjct: 333 GTIVDSGSTFTFMEKPVFEAVATEF-----DRQMANYTRAADVEALSGLKPCFNLSGVGS 387
Query: 326 SDLKGFPTVRFHFRGGAKLALEKDSMF-YQPRPDAFCMAVNPASINNRYVNFSL------ 378
L P++ F F+GGAK+ L + F C+ + ++N V +L
Sbjct: 388 VAL---PSLVFQFKGGAKMELPVANYFSLVGDLSVLCLTI----VSNEAVGSTLSSGPSI 440
Query: 379 -IGMMAQQFYNVGYDIGRKQKTFQRMDCE 406
+G Q + YD+ ++ F+R C+
Sbjct: 441 ILGNYQSQNFYTEYDLENERFGFRRQRCK 469
>gi|242041951|ref|XP_002468370.1| hypothetical protein SORBIDRAFT_01g044790 [Sorghum bicolor]
gi|241922224|gb|EER95368.1| hypothetical protein SORBIDRAFT_01g044790 [Sorghum bicolor]
Length = 408
Score = 84.0 bits (206), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 91/352 (25%), Positives = 136/352 (38%), Gaps = 30/352 (8%)
Query: 61 FYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLGTIFYPSRSSSYAYVPCDSEH 120
+ V +G P A+DT + W HC PC C G+ F P+ SSSYA +PC S+
Sbjct: 79 YVVRAGLGTPVQQLLLALDTSADATWSHCAPCDTC--PAGSRFIPASSSSYASLPCASDW 136
Query: 121 CRYFPYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDVVFGCGFS- 179
C F + + + +G V + Q + + CG++
Sbjct: 137 CPLF--------RRPAVPGEPGRVGAAADVRL--LQAASRTPRSGVLAATR----CGWAR 182
Query: 180 TNRNFKFSGIFGLGIGRSSLVSQLNSSFSYCIGSLHDPDYLHNKLILGDGAIIDEGDATP 239
T SG L S S+ N FSYC+ S Y L LG TP
Sbjct: 183 TPSPATRSGPMSL---LSQTGSRYNGVFSYCLPSYRS-YYFSGSLRLGAAGQPRNVRYTP 238
Query: 240 LQFIDGH----YYITLEAISVDGRMLDINPNIFKRDDS-GGGVMIDSGTDVTWLVKEAYE 294
L + H YY+ + +SV ++ F D S G G +IDSGT +T Y
Sbjct: 239 L-LTNPHRPSLYYVNVTGLSVGRALVKAPAGSFAFDPSTGAGTVIDSGTVITRWTAPVYA 297
Query: 295 ALRDEVMIRLEGEQMRSYSWPDKLCYHGIMSSDLKGFPTVRFHFRGGAKLALE-KDSMFY 353
ALRDE ++ + C++ G P V H GG L L ++++ +
Sbjct: 298 ALRDEFRRQVAAPSGYTSLGAFDTCFN-TDEVAAGGAPPVTLHMGGGVDLTLPMENTLIH 356
Query: 354 QPRPDAFCMAVNPASINNRYVNFSLIGMMAQQFYNVGYDIGRKQKTFQRMDC 405
C+A+ A N +++ + QQ V D+ + F R C
Sbjct: 357 SSATPLACLAMAEAP-QNVNSVVNVVANLQQQNVRVVVDVAGSRVGFAREPC 407
Database: nr
Posted date: Mar 3, 2013 10:45 PM
Number of letters in database: 999,999,864
Number of sequences in database: 2,912,245
Database: /local_scratch/syshi//blastdatabase/nr.01
Posted date: Mar 3, 2013 10:52 PM
Number of letters in database: 999,999,666
Number of sequences in database: 2,912,720
Database: /local_scratch/syshi//blastdatabase/nr.02
Posted date: Mar 3, 2013 10:58 PM
Number of letters in database: 999,999,938
Number of sequences in database: 3,014,250
Database: /local_scratch/syshi//blastdatabase/nr.03
Posted date: Mar 3, 2013 11:03 PM
Number of letters in database: 999,999,780
Number of sequences in database: 2,805,020
Database: /local_scratch/syshi//blastdatabase/nr.04
Posted date: Mar 3, 2013 11:08 PM
Number of letters in database: 999,999,551
Number of sequences in database: 2,816,253
Database: /local_scratch/syshi//blastdatabase/nr.05
Posted date: Mar 3, 2013 11:13 PM
Number of letters in database: 999,999,897
Number of sequences in database: 2,981,387
Database: /local_scratch/syshi//blastdatabase/nr.06
Posted date: Mar 3, 2013 11:18 PM
Number of letters in database: 999,999,649
Number of sequences in database: 2,911,476
Database: /local_scratch/syshi//blastdatabase/nr.07
Posted date: Mar 3, 2013 11:24 PM
Number of letters in database: 999,999,452
Number of sequences in database: 2,920,260
Database: /local_scratch/syshi//blastdatabase/nr.08
Posted date: Mar 3, 2013 11:25 PM
Number of letters in database: 64,230,274
Number of sequences in database: 189,558
Lambda K H
0.322 0.139 0.431
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 6,815,809,171
Number of Sequences: 23463169
Number of extensions: 294895144
Number of successful extensions: 538252
Number of sequences better than 100.0: 1000
Number of HSP's better than 100.0 without gapping: 1207
Number of HSP's successfully gapped in prelim test: 1110
Number of HSP's that attempted gapping in prelim test: 531949
Number of HSP's gapped (non-prelim): 2711
length of query: 411
length of database: 8,064,228,071
effective HSP length: 145
effective length of query: 266
effective length of database: 8,957,035,862
effective search space: 2382571539292
effective search space used: 2382571539292
T: 11
A: 40
X1: 16 ( 7.4 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.9 bits)
S2: 78 (34.7 bits)