BLASTP 2.2.26 [Sep-21-2011]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for compositional score matrix adjustment: Altschul, Stephen F.,
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.
Query= 036636
(341 letters)
Database: nr
23,463,169 sequences; 8,064,228,071 total letters
Searching..................................................done
>gi|255563739|ref|XP_002522871.1| DNA binding protein, putative [Ricinus communis]
gi|223537955|gb|EEF39569.1| DNA binding protein, putative [Ricinus communis]
Length = 414
Score = 180 bits (456), Expect = 9e-43, Method: Compositional matrix adjust.
Identities = 120/348 (34%), Positives = 184/348 (52%), Gaps = 34/348 (9%)
Query: 1 MVRLFIGTPSKGVLLILDTGSALIYAIFDPRKSSSFQKINCDHPDCTYFKCVNEQCVYTM 60
+V++ IG P + L+ DTGSALI+ + + Q I F+C N +C YT
Sbjct: 92 LVKVRIGNPGIPLYLVPDTGSALIWTVNN-------QNI---------FQCRNNKCSYTR 135
Query: 61 KYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSNDNHGFDEDARDGALAGVLGLSR 120
+Y D S+T G AA + + +G + F+ FGCS DN F G GV+GL+
Sbjct: 136 RYDDGSITTGVAAQDILQ--SEGSERIPFY---FGCSRDNQNFSVFEHTGKSGGVMGLNT 190
Query: 121 VTISFISQLGSIIKKRFSYCLVIPLPNGEYT--SSYLKFGTDMGYRRPSTQATKFINHPN 178
+S + QL I ++RFSYCL P +G SS L+FG D+ R Q+T ++ P+
Sbjct: 191 SPVSLLQQLSHITQRRFSYCLN-PYQHGSEPPPSSLLRFGNDIRKGRRRFQSTPLMSSPD 249
Query: 179 N-FYYLSLKDISIDNERMNFPPDTFDITVSGEGGCIIDSGSVLTYFHSDVYWKLHEKFVS 237
Y+L+L D+++ +R++ PP TF + G GG IIDSG+ LT+ Y +L F +
Sbjct: 250 RPNYFLNLLDMTVAGQRLHLPPGTFALRQDGTGGTIIDSGTGLTFITQTAYPRLISAFQN 309
Query: 238 YFER--FQLAQLSDCPEPIQLCYFLP--ETFNRFPSMAFYFEDANLRIDGENVFIIDYEN 293
YF+ FQ + PE LCY TF+ SM F+FE A+ + + V++ ++
Sbjct: 310 YFDHRGFQRVHI---PE-FDLCYSFRGNHTFHDHASMTFHFERADFTVQADYVYLPMEDD 365
Query: 294 HFFLLAVAP-HDDLVALIGSQQQRDTRFVYDLNIDLLSFVKENCSDDS 340
+ F +A+ P +IG+ Q +TRF+YD L F+ ENC +D+
Sbjct: 366 NAFCVALQPTPPQQRTVIGAINQGNTRFIYDAAAHQLLFIAENCRNDA 413
>gi|255563741|ref|XP_002522872.1| Aspartic proteinase nepenthesin-2 precursor, putative [Ricinus
communis]
gi|223537956|gb|EEF39570.1| Aspartic proteinase nepenthesin-2 precursor, putative [Ricinus
communis]
Length = 448
Score = 174 bits (440), Expect = 7e-41, Method: Compositional matrix adjust.
Identities = 120/368 (32%), Positives = 188/368 (51%), Gaps = 40/368 (10%)
Query: 1 MVRLFIGTPSKGVLLILDTGSALIYA--------------IFDPRKSSSFQKINCDHPDC 46
+V++ IG+P + L+ DTGS L + IF+ S +++ + C H C
Sbjct: 92 LVKVIIGSPGVPLYLVPDTGSGLFWTQCEPCTRRFRQLPPIFNSTASRTYRDLPCQHQFC 151
Query: 47 T----YFKCVNEQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSNDNHG 102
T F+C +++CVY + YA S T G AA + ++ E I FGCS DN
Sbjct: 152 TNNQNVFQCRDDKCVYRIAYAGGSATAGVAAQD---ILQSAENDRI--PFYFGCSRDNQN 206
Query: 103 FDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCL-VIPLPNGEYTSSYLKFGTDM 161
F G G++GL+ +S + Q+ I K RFSYCL + L + + +S L+FG D+
Sbjct: 207 FSTFESSGKGGGIIGLNMSPVSLLQQMNHITKNRFSYCLNLFDLSSPSHATSLLRFGNDI 266
Query: 162 GYRRPSTQATKFIN---HPNNFYYLSLKDISIDNERMNFPPDTFDITVSGEGGCIIDSGS 218
R +T F++ PN Y+L+L D+S+ RM PP TF + G GG IIDSG+
Sbjct: 267 RKSRRKYLSTPFVSPRGMPN--YFLNLIDVSVAGNRMQIPPGTFALKPDGTGGTIIDSGT 324
Query: 219 VLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQL----CYFLP-ETFNRFPSMAFY 273
+TY Y+ + F +YF++ +++ IQL CY TF+ +PSMAF+
Sbjct: 325 AVTYISQTAYFPVITAFKNYFDQHGFQRVN-----IQLSGYICYKQQGHTFHNYPSMAFH 379
Query: 274 FEDANLRIDGENVFIIDYENHFFLLAVAP-HDDLVALIGSQQQRDTRFVYDLNIDLLSFV 332
F+ A+ ++ E V++ + F +A+ P +IG+ Q +T+F+YD L F
Sbjct: 380 FQGADFFVEPEYVYLTVQDRGAFCVALQPISPQQRTIIGALNQANTQFIYDAANRQLLFT 439
Query: 333 KENCSDDS 340
ENC D +
Sbjct: 440 PENCQDHA 447
>gi|449523529|ref|XP_004168776.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Cucumis sativus]
Length = 461
Score = 165 bits (418), Expect = 3e-38, Method: Compositional matrix adjust.
Identities = 117/364 (32%), Positives = 182/364 (50%), Gaps = 45/364 (12%)
Query: 1 MVRLFIGTPSKGVLLILDTGSALIYA--------------IFDPRKSSSFQKINCDHPDC 46
+++L IG+P + I+DTGS LI+ IFDP++SSSF KI+C C
Sbjct: 112 LMKLAIGSPPRSFSAIMDTGSDLIWTQCKPCQQCFDQSTPIFDPKQSSSFYKISCSSELC 171
Query: 47 TYF---KCVNEQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSNDNHGF 103
C ++ C Y Y D S T+G A ET + E + G FGC NDN+G
Sbjct: 172 GALPTSTCSSDGCEYLYTYGDSSSTQGVLAFETFTFGDSTEDQISIPGLGFGCGNDNNG- 230
Query: 104 DEDARDG--ALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLKFGTDM 161
DG AG++GL R +S +SQL +++F+YCL + S L G+ +
Sbjct: 231 -----DGFSQGAGLVGLGRGPLSLVSQLK---EQKFAYCLTAI---DDSKPSSLLLGS-L 278
Query: 162 GYRRPST-----QATKFINHPN--NFYYLSLKDISIDNERMNFPPDTFDITVSGEGGCII 214
P T + T I +P+ +FYYLSL+ IS+ +++ P TF++ G GG II
Sbjct: 279 ANITPKTSKDEMKTTPLIKNPSQPSFYYLSLQGISVGGTQLSIPKSTFELHDDGSGGVII 338
Query: 215 DSGSVLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCYFLPETFNR--FPSMAF 272
DSG+ +TY + + L +F++ + L + LC+ LP N+ P + F
Sbjct: 339 DSGTTITYVENSAFTSLKNEFIA---QMNLPVDDSGTGGLDLCFNLPAGTNQVEVPKLTF 395
Query: 273 YFEDANLRIDGENVFIIDYENHFFLLAVAPHDDLVALIGSQQQRDTRFVYDLNIDLLSFV 332
+F+ A+L + GEN I D + LA+ + ++ G+ QQ++ V+DL + LSF+
Sbjct: 396 HFKGADLELPGENYMIGDSKAGLLCLAIGSSRGM-SIFGNLQQQNFMVVHDLQEETLSFL 454
Query: 333 KENC 336
C
Sbjct: 455 PTQC 458
>gi|449453902|ref|XP_004144695.1| PREDICTED: LOW QUALITY PROTEIN: aspartic proteinase
nepenthesin-1-like, partial [Cucumis sativus]
Length = 716
Score = 164 bits (416), Expect = 4e-38, Method: Compositional matrix adjust.
Identities = 117/364 (32%), Positives = 182/364 (50%), Gaps = 45/364 (12%)
Query: 1 MVRLFIGTPSKGVLLILDTGSALIYA--------------IFDPRKSSSFQKINCDHPDC 46
+++L IG+P + I+DTGS LI+ IFDP++SSSF KI+C C
Sbjct: 367 LMKLAIGSPPRSFSAIMDTGSDLIWTQCKPCQQCFDQSTPIFDPKQSSSFYKISCSSELC 426
Query: 47 TYF---KCVNEQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSNDNHGF 103
C ++ C Y Y D S T+G A ET + E + G FGC NDN+G
Sbjct: 427 GALPTSTCSSDGCEYLYTYGDSSSTQGVLAFETFTFGDSTEDQISIPGLGFGCGNDNNG- 485
Query: 104 DEDARDG--ALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLKFGTDM 161
DG AG++GL R +S +SQL +++F+YCL + S L G+ +
Sbjct: 486 -----DGFSQGAGLVGLGRGPLSLVSQLK---EQKFAYCLTAI---DDSKPSSLLLGS-L 533
Query: 162 GYRRPST-----QATKFINHPN--NFYYLSLKDISIDNERMNFPPDTFDITVSGEGGCII 214
P T + T I +P+ +FYYLSL+ IS+ +++ P TF++ G GG II
Sbjct: 534 ANITPKTSKDEMKTTPLIKNPSQPSFYYLSLQGISVGGTQLSIPKSTFELHDDGSGGVII 593
Query: 215 DSGSVLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCYFLPETFNR--FPSMAF 272
DSG+ +TY + + L +F++ + L + LC+ LP N+ P + F
Sbjct: 594 DSGTTITYVENSAFTSLKNEFIA---QMNLPVDDSGTGGLDLCFNLPAGTNQVEVPKLTF 650
Query: 273 YFEDANLRIDGENVFIIDYENHFFLLAVAPHDDLVALIGSQQQRDTRFVYDLNIDLLSFV 332
+F+ A+L + GEN I D + LA+ + ++ G+ QQ++ V+DL + LSF+
Sbjct: 651 HFKGADLELPGENYMIGDSKAGLLCLAIGSSRGM-SIFGNLQQQNFMVVHDLQEETLSFL 709
Query: 333 KENC 336
C
Sbjct: 710 PTQC 713
>gi|357158688|ref|XP_003578209.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Brachypodium
distachyon]
Length = 443
Score = 162 bits (410), Expect = 2e-37, Method: Compositional matrix adjust.
Identities = 116/363 (31%), Positives = 169/363 (46%), Gaps = 37/363 (10%)
Query: 1 MVRLFIGTPSKGVLLILDTGSALIYA--------------IFDPRKSSSFQKINCDHPDC 46
++ + IGTP + ILDTGS LI+ FDP +S S+ K+ C+ P C
Sbjct: 90 LMSMGIGTPPRYYSAILDTGSDLIWTQCAPCMLCVDQPTPFFDPAQSPSYAKLPCNSPMC 149
Query: 47 T---YFKCVNEQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSNDNHGF 103
Y C CVY Y D + T G ++ET + G + + FGC N N G
Sbjct: 150 NALYYPLCYRNVCVYQYFYGDSANTAGVLSNETFT-FGTNDTRVTVPRIAFGCGNLNAGS 208
Query: 104 DEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCL---VIPLPNGEYTSSYLKFGTD 160
+ +G++G R +S +SQLGS RFSYCL + P+P+ Y +Y +
Sbjct: 209 LFNG-----SGMVGFGRGPLSLVSQLGS---PRFSYCLTSFMSPVPSRLYFGAYATLNST 260
Query: 161 MGYRRPSTQATKFINHPN--NFYYLSLKDISIDNERMNFPPDTFDIT-VSGEGGCIIDSG 217
Q+T FI +P YYL++ IS+ E + P F I G GG IIDSG
Sbjct: 261 SASTGEPVQSTPFIVNPGLPTMYYLNMTGISVGGELLPIDPSVFAINDADGTGGVIIDSG 320
Query: 218 SVLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCYFLPETFNR---FPSMAFYF 274
S +TY Y +H+ F L + + + C+ P + P +AF+F
Sbjct: 321 STITYLARAAYDMVHQAFADQVG-LPLTNATSLADVLDTCFVWPPPPRKIVTMPELAFHF 379
Query: 275 EDANLRIDGENVFIIDYENHFFLLAVAPHDDLVALIGSQQQRDTRFVYDLNIDLLSFVKE 334
E AN+ + EN +ID + LA+A DD ++IGS Q ++ +YD LLSF
Sbjct: 380 EGANMELPLENYMLIDGDTGNLCLAIAASDD-GSIIGSFQHQNFHVLYDNENSLLSFTPA 438
Query: 335 NCS 337
C+
Sbjct: 439 TCN 441
>gi|224286173|gb|ACN40797.1| unknown [Picea sitchensis]
Length = 383
Score = 155 bits (391), Expect = 4e-35, Method: Compositional matrix adjust.
Identities = 109/357 (30%), Positives = 171/357 (47%), Gaps = 40/357 (11%)
Query: 1 MVRLFIGTPSKGVLLILDTGSALIYAIFDP------------RKSSSFQKINCDHPDC-- 46
++++ IGTP+ + I+DTGS L++ +P SS++ K+ C C
Sbjct: 43 LIQMAIGTPALSLSAIMDTGSDLVWTKCNPCTDCSTSSIYDPSSSSTYSKVLCQSSLCQP 102
Query: 47 -TYFKCVNE-QCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSNDNHGFD 104
+ F C N+ C Y Y D+S T G + ET S+ + FGC +DN GFD
Sbjct: 103 PSIFSCNNDGDCEYVYPYGDRSSTSGILSDETFSISSQS-----LPNITFGCGHDNQGFD 157
Query: 105 EDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLKFGTDMGYR 164
+ + G++G R ++S +SQLG + +FSYCLV + + +S L G
Sbjct: 158 K------VGGLVGFGRGSLSLVSQLGPSMGNKFSYCLVSRTDSSK--TSPLFIGNTASLE 209
Query: 165 RPSTQATKFINHPN-NFYYLSLKDISIDNERMNFPPDTFDITVSGEGGCIIDSGSVLTYF 223
+ +T + + N YYLSL+ IS+ + + P TFDI G GG IIDSG+ LT+
Sbjct: 210 ATTVGSTPLVQSSSTNHYYLSLEGISVGGQSLAIPTGTFDIQSDGSGGLIIDSGTTLTFL 269
Query: 224 HSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCYFLPETFNR-FPSMAFYFEDANLRID 282
Y + E VS L + LC+ + N FPSM F+F+ A+ +
Sbjct: 270 QQTAYDAVKEAMVSSIN------LPQADGQLDLCFNQQGSSNPGFPSMTFHFKGADYDVP 323
Query: 283 GENVFIIDYENHFFLLAVAPHDD---LVALIGSQQQRDTRFVYDLNIDLLSFVKENC 336
EN D + LA+ P + +A+ G+ QQ++ + +YD ++LSF C
Sbjct: 324 KENYLFPDSTSDIVCLAMMPTNSNLGNMAIFGNVQQQNYQILYDNENNVLSFAPTAC 380
>gi|225438315|ref|XP_002272802.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Vitis vinifera]
Length = 436
Score = 153 bits (387), Expect = 1e-34, Method: Compositional matrix adjust.
Identities = 115/358 (32%), Positives = 174/358 (48%), Gaps = 44/358 (12%)
Query: 1 MVRLFIGTPSKGVLLILDTGSALIYA--------------IFDPRKSSSFQKINCDHPDC 46
+++L IGTP++ I+DTGS LI+ IFDP+KSSSF K+ C C
Sbjct: 98 LMKLAIGTPAETYSAIMDTGSDLIWTQCKPCKDCFDQPTPIFDPKKSSSFSKLPCSSDLC 157
Query: 47 TYF--KCVNEQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCS--NDNHG 102
++ C Y Y D S T+G A ET + G A FGC ND G
Sbjct: 158 AALPISSCSDGCEYLYSYGDYSSTQGVLATETFAF-----GDASVSKIGFGCGEDNDGSG 212
Query: 103 FDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLKFGTDMG 162
F + A G++GL R +S ISQLG + +FSYCL + + + SS L G++
Sbjct: 213 FSQGA------GLVGLGRGPLSLISQLG---EPKFSYCLT-SMDDSKGISSLL-VGSEAT 261
Query: 163 YRRPSTQATKFINHPN--NFYYLSLKDISIDNERMNFPPDTFDITVSGEGGCIIDSGSVL 220
+ T T I +P+ +FYYLSL+ IS+ + + TF I G GG IIDSG+ +
Sbjct: 262 MKNAIT--TPLIQNPSQPSFYYLSLEGISVGDTLLPIEKSTFSIQNDGSGGLIIDSGTTI 319
Query: 221 TYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCYFLPETFNR--FPSMAFYFEDAN 278
TY + L ++F+S + +L + LC+ LP + P + F+FE A+
Sbjct: 320 TYLEDSAFAALKKEFIS---QLKLDVDESGSTGLDLCFTLPPDASTVDVPQLVFHFEGAD 376
Query: 279 LRIDGENVFIIDYENHFFLLAVAPHDDLVALIGSQQQRDTRFVYDLNIDLLSFVKENC 336
L++ EN I D L + + ++ G+ QQ++ ++DL + +SF C
Sbjct: 377 LKLPAENYIIADSGLGVICLTMGSSSGM-SIFGNFQQQNIVVLHDLEKETISFAPAQC 433
>gi|242079447|ref|XP_002444492.1| hypothetical protein SORBIDRAFT_07g022780 [Sorghum bicolor]
gi|241940842|gb|EES13987.1| hypothetical protein SORBIDRAFT_07g022780 [Sorghum bicolor]
Length = 441
Score = 150 bits (380), Expect = 6e-34, Method: Compositional matrix adjust.
Identities = 107/361 (29%), Positives = 163/361 (45%), Gaps = 37/361 (10%)
Query: 1 MVRLFIGTPSKGVLLILDTGSALIYA--------------IFDPRKSSSFQKINCDHPDC 46
+V L IGTP I+DTGS LI+ FD ++S++++ + C C
Sbjct: 90 LVDLAIGTPPLYYTAIMDTGSDLIWTQCAPCLLCAAQPTPYFDVKRSATYRALPCRSSRC 149
Query: 47 TYFK---CVNEQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSNDNHGF 103
C + CVY Y D + T G A+ET + K FGC + N G
Sbjct: 150 AALSSPSCFKKMCVYQYYYGDTASTAGVLANETFTFGAASSTKVRAANISFGCGSLNAGE 209
Query: 104 DEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCL---VIPLPNGEYTSSYLKFGTD 160
++ +G++G R +S +SQLG RFSYCL + P P+ Y + +
Sbjct: 210 LANS-----SGMVGFGRGPLSLVSQLG---PSRFSYCLTSYLSPTPSRLYFGVFANLNST 261
Query: 161 MGYRRPSTQATKFINHPN--NFYYLSLKDISIDNERMNFPPDTFDITVSGEGGCIIDSGS 218
Q+T F+ +P N Y+LS+K IS+ +R+ P F I G GG IIDSG+
Sbjct: 262 NTSSGSPVQSTPFVINPALPNMYFLSVKGISLGTKRLPIDPLVFAINDDGTGGVIIDSGT 321
Query: 219 VLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCYFLPETFN---RFPSMAFYFE 275
+T+ D Y + S L ++D + C+ P N P F+F+
Sbjct: 322 SITWLQQDAYEAVRRGLAS---TIPLPAMNDTDIGLDTCFQWPPPPNVTVTVPDFVFHFD 378
Query: 276 DANLRIDGENVFIIDYENHFFLLAVAPHDDLVALIGSQQQRDTRFVYDLNIDLLSFVKEN 335
AN+ + EN +I + LA+AP + +IG+ QQ++ +YD+ LSFV
Sbjct: 379 GANMTLPPENYMLIASTTGYLCLAMAP-TSVGTIIGNYQQQNLHLLYDIANSFLSFVPAP 437
Query: 336 C 336
C
Sbjct: 438 C 438
>gi|30678047|ref|NP_565298.2| aspartyl protease-like protein [Arabidopsis thaliana]
gi|30102688|gb|AAP21262.1| At2g03200 [Arabidopsis thaliana]
gi|110736021|dbj|BAE99983.1| putative chloroplast nucleoid DNA binding protein [Arabidopsis
thaliana]
gi|330250580|gb|AEC05674.1| aspartyl protease-like protein [Arabidopsis thaliana]
Length = 461
Score = 150 bits (378), Expect = 1e-33, Method: Compositional matrix adjust.
Identities = 113/369 (30%), Positives = 172/369 (46%), Gaps = 51/369 (13%)
Query: 1 MVRLFIGTPSKGVLLILDTGSALIYA--------------IFDPRKSSSFQKINCDHPDC 46
++ L IG P+ I+DTGS LI+ IFDP KSSS+ K+ C C
Sbjct: 108 LMELSIGNPAVKYSAIVDTGSDLIWTQCKPCTECFDQPTPIFDPEKSSSYSKVGCSSGLC 167
Query: 47 TYFKCVN-----EQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCS--ND 99
N + C Y Y D S T+G A ET + E + G FGC N+
Sbjct: 168 NALPRSNCNEDKDACEYLYTYGDYSSTRGLLATETFTF----EDENSISGIGFGCGVENE 223
Query: 100 NHGFDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSY----L 155
GF + + G++GL R +S ISQL + +FSYCL + + E +SS L
Sbjct: 224 GDGFSQGS------GLVGLGRGPLSLISQLK---ETKFSYCLT-SIEDSEASSSLFIGSL 273
Query: 156 KFG----TDMGYRRPSTQATKFINHPN--NFYYLSLKDISIDNERMNFPPDTFDITVSGE 209
G T T+ + +P+ +FYYL L+ I++ +R++ TF++ G
Sbjct: 274 ASGIVNKTGASLDGEVTKTMSLLRNPDQPSFYYLELQGITVGAKRLSVEKSTFELAEDGT 333
Query: 210 GGCIIDSGSVLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCYFLPETFNRF-- 267
GG IIDSG+ +TY + L E+F S R L + LC+ LP+
Sbjct: 334 GGMIIDSGTTITYLEETAFKVLKEEFTS---RMSLPVDDSGSTGLDLCFKLPDAAKNIAV 390
Query: 268 PSMAFYFEDANLRIDGENVFIIDYENHFFLLAVAPHDDLVALIGSQQQRDTRFVYDLNID 327
P M F+F+ A+L + GEN + D LA+ + + ++ G+ QQ++ ++DL +
Sbjct: 391 PKMIFHFKGADLELPGENYMVADSSTGVLCLAMGSSNGM-SIFGNVQQQNFNVLHDLEKE 449
Query: 328 LLSFVKENC 336
+SFV C
Sbjct: 450 TVSFVPTEC 458
>gi|356516413|ref|XP_003526889.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
Length = 453
Score = 150 bits (378), Expect = 1e-33, Method: Compositional matrix adjust.
Identities = 118/358 (32%), Positives = 181/358 (50%), Gaps = 38/358 (10%)
Query: 1 MVRLFIGTPSKGVLLILDTGSALIYA--------------IFDPRKSSSFQKINCDHPDC 46
++ L IGTP +LDTGS LI+ IFDP+KSSSF K++C C
Sbjct: 109 LIELAIGTPPVSYPAVLDTGSDLIWTQCKPCTRCYKQPTPIFDPKKSSSFSKVSCGSSLC 168
Query: 47 TYF--KCVNEQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSNDNHGFD 104
+ ++ C Y Y D S+T+G A ET + GK + K H FGC DN G
Sbjct: 169 SALPSSTCSDGCEYVYSYGDYSMTQGVLATETFT-FGKSKNKVSVHNIGFGCGEDNEG-- 225
Query: 105 EDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLKFGTDMGYR 164
D + A +G++GL R +S +SQL ++RFSYCL P + S L G+ +G
Sbjct: 226 -DGFEQA-SGLVGLGRGPLSLVSQLK---EQRFSYCLT---PIDDTKESVLLLGS-LGKV 276
Query: 165 RPSTQ--ATKFINHP--NNFYYLSLKDISIDNERMNFPPDTFDITVSGEGGCIIDSGSVL 220
+ + + T + +P +FYYLSL+ IS+ + R++ TF++ G GG IIDSG+ +
Sbjct: 277 KDAKEVVTTPLLKNPLQPSFYYLSLEAISVGDTRLSIEKSTFEVGDDGNGGVIIDSGTTI 336
Query: 221 TYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCYFLP--ETFNRFPSMAFYFEDAN 278
TY Y L ++F+S + +LA + LC+ LP T P + F+F+ +
Sbjct: 337 TYVQQKAYEALKKEFIS---QTKLALDKTSSTGLDLCFSLPSGSTQVEIPKLVFHFKGGD 393
Query: 279 LRIDGENVFIIDYENHFFLLAVAPHDDLVALIGSQQQRDTRFVYDLNIDLLSFVKENC 336
L + EN I D LA+ + ++ G+ QQ++ +DL + +SFV +C
Sbjct: 394 LELPAENYMIGDSNLGVACLAMGASSGM-SIFGNVQQQNILVNHDLEKETISFVPTSC 450
>gi|242044886|ref|XP_002460314.1| hypothetical protein SORBIDRAFT_02g026340 [Sorghum bicolor]
gi|241923691|gb|EER96835.1| hypothetical protein SORBIDRAFT_02g026340 [Sorghum bicolor]
Length = 444
Score = 149 bits (377), Expect = 2e-33, Method: Compositional matrix adjust.
Identities = 111/365 (30%), Positives = 169/365 (46%), Gaps = 43/365 (11%)
Query: 1 MVRLFIGTPSKGVLLILDTGSALIYA--------------IFDPRKSSSFQKINCDHPDC 46
++ + IGTP++ ILDTGS LI+ FDP SS+++ + C P C
Sbjct: 93 LMEMGIGTPARFYSAILDTGSDLIWTQCAPCLLCVDQPTPYFDPANSSTYRSLGCSAPAC 152
Query: 47 T---YFKCVNEQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSNDNHGF 103
Y C + CVY Y D + T G A+ET + G + + FGC N N G
Sbjct: 153 NALYYPLCYQKTCVYQYFYGDSASTAGVLANETFT-FGTNDTRVTLPRISFGCGNLNAGS 211
Query: 104 DEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCL---VIPLPNGEYTSSYLKFGTD 160
+ +G++G R ++S +SQLGS RFSYCL + P+ + Y +Y +
Sbjct: 212 LANG-----SGMVGFGRGSLSLVSQLGS---PRFSYCLTSFLSPVRSRLYFGAYATLNST 263
Query: 161 MGYRRPSTQATKFINHPN--NFYYLSLKDISIDNERMNFPPDTFDIT-VSGEGGCIIDSG 217
+ Q+T FI +P Y+L++ IS+ R+ P I G GG IIDSG
Sbjct: 264 ---NASTVQSTPFIINPALPTMYFLNMTGISVGGNRLPIDPAVLAINDTDGTGGTIIDSG 320
Query: 218 SVLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEP--IQLCYFLPETFNR---FPSMAF 272
+ +TY Y+ + E FV Y L D E + C+ P + P +
Sbjct: 321 TTITYLAEPAYYAVREAFVLYLN--STLPLLDVTETSVLDTCFQWPPPPRQSVTLPQLVL 378
Query: 273 YFEDANLRIDGENVFIIDYENHFFLLAVAPHDDLVALIGSQQQRDTRFVYDLNIDLLSFV 332
+F+ A+ + +N ++D LA+A D ++IGS Q ++ +YDL LLSFV
Sbjct: 379 HFDGADWELPLQNYMLVDPSTGGLCLAMATSSD-GSIIGSYQHQNFNVLYDLENSLLSFV 437
Query: 333 KENCS 337
C+
Sbjct: 438 PAPCN 442
>gi|242073260|ref|XP_002446566.1| hypothetical protein SORBIDRAFT_06g018170 [Sorghum bicolor]
gi|241937749|gb|EES10894.1| hypothetical protein SORBIDRAFT_06g018170 [Sorghum bicolor]
Length = 452
Score = 149 bits (377), Expect = 2e-33, Method: Compositional matrix adjust.
Identities = 112/366 (30%), Positives = 177/366 (48%), Gaps = 47/366 (12%)
Query: 1 MVRLFIGTPSKGVLLILDTGSALIYA--------------IFDPRKSSSFQKINCDHPDC 46
++ + IGTP+ I+DTGS L++ +FDP SS++ + C C
Sbjct: 101 LMDVAIGTPALSYAAIVDTGSDLVWTQCKPCVDCFKQSTPVFDPSSSSTYATVPCSSALC 160
Query: 47 TYF---KCVN-EQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGC--SNDN 100
+ C + +C YT Y D S T+G A ET ++ G+ K G FGC +N+
Sbjct: 161 SDLPTSTCTSASKCGYTYTYGDASSTQGVLASETFTL---GKEKKKLPGVAFGCGDTNEG 217
Query: 101 HGFDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLKFGTD 160
GF + A G++GL R +S +SQLG +FSYCL L +G+ S L G+
Sbjct: 218 DGFTQGA------GLVGLGRGPLSLVSQLG---LDKFSYCLT-SLDDGDGKSPLLLGGSA 267
Query: 161 MGYRRPS----TQATKFINHPN--NFYYLSLKDISIDNERMNFPPDTFDITVSGEGGCII 214
+ Q T + +P+ +FYY+SL +++ + R+ P F I G GG I+
Sbjct: 268 AAISESAATAPVQTTPLVKNPSQPSFYYVSLTGLTVGSTRITLPASAFAIQDDGTGGVIV 327
Query: 215 DSGSVLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCYFLPETF---NRFPSMA 271
DSG+ +TY Y L + FV+ + L + + LC+ P + P +
Sbjct: 328 DSGTSITYLELQGYRALKKAFVA---QMALPTVDGSEIGLDLCFQGPAKGVDEVQVPKLV 384
Query: 272 FYFE-DANLRIDGENVFIIDYENHFFLLAVAPHDDLVALIGSQQQRDTRFVYDLNIDLLS 330
+F+ A+L + EN ++D + L VAP L ++IG+ QQ++ +FVYD+ D LS
Sbjct: 385 LHFDGGADLDLPAENYMVLDSASGALCLTVAPSRGL-SIIGNFQQQNFQFVYDVAGDTLS 443
Query: 331 FVKENC 336
F C
Sbjct: 444 FAPVQC 449
>gi|409179878|gb|AFV26024.1| aspartic proteinase nepenthesin 1 [Nepenthes mirabilis]
Length = 437
Score = 149 bits (376), Expect = 2e-33, Method: Compositional matrix adjust.
Identities = 107/358 (29%), Positives = 168/358 (46%), Gaps = 41/358 (11%)
Query: 1 MVRLFIGTPSKGVLLILDTGSALIYA--------------IFDPRKSSSFQKINCDHPDC 46
++ L IGTP++ I+DTGS LI+ IF+P+ SSSF + C C
Sbjct: 96 LMNLSIGTPAQPFSAIMDTGSDLIWTQCQPCTQCFNQSTPIFNPQGSSSFSTLPCSSQLC 155
Query: 47 TYFK---CVNEQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSNDNHGF 103
+ C N C YT Y D S T+G ET++ G FGC +N GF
Sbjct: 156 QALQSPTCSNNSCQYTYGYGDGSETQGSMGTETLTF-----GSVSIPNITFGCGENNQGF 210
Query: 104 DEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLKFGTDMGY 163
+ G AG++G+ R +S SQL +FSYC+ P G TSS L G+
Sbjct: 211 GQ----GNGAGLVGMGRGPLSLPSQLD---VTKFSYCMT---PIGSSTSSTLLLGSLANS 260
Query: 164 RRPSTQATKFINHPN--NFYYLSLKDISIDNERMNFPPDTFDI-TVSGEGGCIIDSGSVL 220
+ T I FYY++L +S+ + + P F + + +G GG IIDSG+ L
Sbjct: 261 VTAGSPNTTLIESSQIPTFYYITLNGLSVGSTPLPIDPSVFKLNSNNGTGGIIIDSGTTL 320
Query: 221 TYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCYFLP--ETFNRFPSMAFYFEDAN 278
TYF + Y + + F+S + L+ ++ LC+ +P ++ + P+ +F+ +
Sbjct: 321 TYFADNAYQAVRQAFIS---QMNLSVVNGSSSGFDLCFQMPSDQSNLQIPTFVMHFDGGD 377
Query: 279 LRIDGENVFIIDYENHFFLLAVAPHDDLVALIGSQQQRDTRFVYDLNIDLLSFVKENC 336
L + EN F I N LA+ +++ G+ QQ++ VYD ++SF+ C
Sbjct: 378 LVLPSENYF-ISPSNGLICLAMGSSSQGMSIFGNIQQQNLLVVYDTGNSVVSFLFAQC 434
>gi|20197342|gb|AAC34482.2| putative chloroplast nucleoid DNA binding protein [Arabidopsis
thaliana]
Length = 353
Score = 149 bits (375), Expect = 2e-33, Method: Compositional matrix adjust.
Identities = 113/368 (30%), Positives = 171/368 (46%), Gaps = 51/368 (13%)
Query: 2 VRLFIGTPSKGVLLILDTGSALIYA--------------IFDPRKSSSFQKINCDHPDCT 47
+ L IG P+ I+DTGS LI+ IFDP KSSS+ K+ C C
Sbjct: 1 MELSIGNPAVKYSAIVDTGSDLIWTQCKPCTECFDQPTPIFDPEKSSSYSKVGCSSGLCN 60
Query: 48 YFKCVN-----EQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCS--NDN 100
N + C Y Y D S T+G A ET + E + G FGC N+
Sbjct: 61 ALPRSNCNEDKDACEYLYTYGDYSSTRGLLATETFTF----EDENSISGIGFGCGVENEG 116
Query: 101 HGFDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSY----LK 156
GF + + G++GL R +S ISQL + +FSYCL + + E +SS L
Sbjct: 117 DGFSQGS------GLVGLGRGPLSLISQLK---ETKFSYCLT-SIEDSEASSSLFIGSLA 166
Query: 157 FG----TDMGYRRPSTQATKFINHPN--NFYYLSLKDISIDNERMNFPPDTFDITVSGEG 210
G T T+ + +P+ +FYYL L+ I++ +R++ TF++ G G
Sbjct: 167 SGIVNKTGASLDGEVTKTMSLLRNPDQPSFYYLELQGITVGAKRLSVEKSTFELAEDGTG 226
Query: 211 GCIIDSGSVLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCYFLPETFNRF--P 268
G IIDSG+ +TY + L E+F S R L + LC+ LP+ P
Sbjct: 227 GMIIDSGTTITYLEETAFKVLKEEFTS---RMSLPVDDSGSTGLDLCFKLPDAAKNIAVP 283
Query: 269 SMAFYFEDANLRIDGENVFIIDYENHFFLLAVAPHDDLVALIGSQQQRDTRFVYDLNIDL 328
M F+F+ A+L + GEN + D LA+ + + ++ G+ QQ++ ++DL +
Sbjct: 284 KMIFHFKGADLELPGENYMVADSSTGVLCLAMGSSNGM-SIFGNVQQQNFNVLHDLEKET 342
Query: 329 LSFVKENC 336
+SFV C
Sbjct: 343 VSFVPTEC 350
>gi|226503109|ref|NP_001147206.1| aspartic proteinase nepenthesin-2 precursor [Zea mays]
gi|195608496|gb|ACG26078.1| aspartic proteinase nepenthesin-2 precursor [Zea mays]
gi|413921850|gb|AFW61782.1| aspartic proteinase nepenthesin-2 [Zea mays]
Length = 441
Score = 149 bits (375), Expect = 3e-33, Method: Compositional matrix adjust.
Identities = 109/361 (30%), Positives = 162/361 (44%), Gaps = 37/361 (10%)
Query: 1 MVRLFIGTPSKGVLLILDTGSALIYA--------------IFDPRKSSSFQKINCDHPDC 46
+V L IGTP I+DTGS LI+ FD +KS++++ + C C
Sbjct: 90 LVDLAIGTPPLYYTAIMDTGSDLIWTQCAPCLLCADQPTPYFDVKKSATYRALPCRSSRC 149
Query: 47 TYFK---CVNEQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSNDNHGF 103
C + CVY Y D + T G A+ET + K FGC + N G
Sbjct: 150 ASLSSPSCFKKMCVYQYYYGDTASTAGVLANETFTFGAANSTKVRATNIAFGCGSLNAGD 209
Query: 104 DEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPL---PNGEYTSSYLKFGTD 160
++ +G++G R +S +SQLG RFSYCL L P+ Y Y +
Sbjct: 210 LANS-----SGMVGFGRGPLSLVSQLG---PSRFSYCLTSYLSATPSRLYFGVYANLSST 261
Query: 161 MGYRRPSTQATKFINHPN--NFYYLSLKDISIDNERMNFPPDTFDITVSGEGGCIIDSGS 218
Q+T F+ +P N Y+LSLK IS+ + + P F I G GG IIDSG+
Sbjct: 262 NTSSGSPVQSTPFVINPALPNMYFLSLKAISLGTKLLPIDPLVFAINDDGTGGVIIDSGT 321
Query: 219 VLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCYFLPETFN---RFPSMAFYFE 275
+T+ D Y + VS L ++D + C+ P N P + F+F+
Sbjct: 322 SITWLQQDAYEAVRRGLVS---AIPLPAMNDTDIGLDTCFQWPPPPNVTVTVPDLVFHFD 378
Query: 276 DANLRIDGENVFIIDYENHFFLLAVAPHDDLVALIGSQQQRDTRFVYDLNIDLLSFVKEN 335
AN+ + EN +I + L +AP + +IG+ QQ++ +YD+ LSFV
Sbjct: 379 SANMTLLPENYMLIASTTGYLCLVMAP-TGVGTIIGNYQQQNLHLLYDIGNSFLSFVPAP 437
Query: 336 C 336
C
Sbjct: 438 C 438
>gi|409179880|gb|AFV26025.1| aspartic proteinase nepenthesin 2 [Nepenthes mirabilis]
Length = 437
Score = 149 bits (375), Expect = 3e-33, Method: Compositional matrix adjust.
Identities = 105/355 (29%), Positives = 170/355 (47%), Gaps = 36/355 (10%)
Query: 1 MVRLFIGTPSKGVLLILDTGSALIYA--------------IFDPRKSSSFQKINCDHPDC 46
++ + IGTP+ + I+DTGS LI+ IF+P+ SSSF + C+ C
Sbjct: 97 LMNVAIGTPASSLSAIMDTGSDLIWTQCEPCTQCFSQPTPIFNPQDSSSFSTLPCESQYC 156
Query: 47 TYF---KCVNEQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSNDNHGF 103
C N+ C YT Y D S T+G+ A ET + E ++ + A FGC DN GF
Sbjct: 157 QDLPSESCYND-CQYTYGYGDGSSTQGYMATETFTF----ETSSVPNIA-FGCGEDNQGF 210
Query: 104 DEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLKFGTDMGY 163
+ G AG++G+ +S SQLG +FSYC+ + T + + +
Sbjct: 211 GQ----GNGAGLIGMGWGPLSLPSQLG---VGQFSYCMTSSGSSSPSTLALGSAASGVPE 263
Query: 164 RRPSTQATKFINHPNNFYYLSLKDISIDNERMNFPPDTFDITVSGEGGCIIDSGSVLTYF 223
PST +P +YY++L+ I++ + + P TF + G GG IIDSG+ LTY
Sbjct: 264 GSPSTTLIHSSLNPT-YYYITLQGITVGGDNLGIPSSTFQLQDDGTGGMIIDSGTTLTYL 322
Query: 224 HSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCYFLPE--TFNRFPSMAFYFEDANLRI 281
D Y + + F ++ L+ + + + C+ LP + + P ++ F+ L +
Sbjct: 323 PQDAYNAVAQAFT---DQINLSPVDESSSGLSTCFQLPSDGSTVQVPEISMQFDGGVLNL 379
Query: 282 DGENVFIIDYENHFFLLAVAPHDDLVALIGSQQQRDTRFVYDLNIDLLSFVKENC 336
ENV I E L + +++ G+ QQ++T+ +YDL +SFV C
Sbjct: 380 GEENVLISPAEGVICLAMGSSSQQGISIFGNIQQQETQVLYDLQNLAVSFVPTQC 434
>gi|165292434|dbj|BAF98915.1| aspartic proteinase nepenthesin I [Nepenthes alata]
Length = 437
Score = 149 bits (375), Expect = 3e-33, Method: Compositional matrix adjust.
Identities = 106/358 (29%), Positives = 167/358 (46%), Gaps = 41/358 (11%)
Query: 1 MVRLFIGTPSKGVLLILDTGSALIYA--------------IFDPRKSSSFQKINCDHPDC 46
++ L IGTP++ I+DTGS LI+ IF+P+ SSSF + C C
Sbjct: 96 LMNLSIGTPAQPFSAIMDTGSDLIWTQCQPCTQCFNQSTPIFNPQGSSSFSTLPCSSQLC 155
Query: 47 TYFK---CVNEQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSNDNHGF 103
+ C N C YT Y D S T+G ET++ G FGC +N GF
Sbjct: 156 QALQSPTCSNNSCQYTYGYGDGSETQGSMGTETLTF-----GSVSIPNITFGCGENNQGF 210
Query: 104 DEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLKFGTDMGY 163
+ G AG++G+ R +S SQL +FSYC+ P G SS L G+
Sbjct: 211 GQ----GNGAGLVGMGRGPLSLPSQLD---VTKFSYCMT---PIGSSNSSTLLLGSLANS 260
Query: 164 RRPSTQATKFINHPN--NFYYLSLKDISIDNERMNFPPDTFDI-TVSGEGGCIIDSGSVL 220
+ T I FYY++L +S+ + + P F + + +G GG IIDSG+ L
Sbjct: 261 VTAGSPNTTLIQSSQIPTFYYITLNGLSVGSTPLPIDPSVFKLNSNNGTGGIIIDSGTTL 320
Query: 221 TYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCYFLP--ETFNRFPSMAFYFEDAN 278
TYF + Y + + F+S + L+ ++ LC+ +P ++ + P+ +F+ +
Sbjct: 321 TYFVDNAYQAVRQAFIS---QMNLSVVNGSSSGFDLCFQMPSDQSNLQIPTFVMHFDGGD 377
Query: 279 LRIDGENVFIIDYENHFFLLAVAPHDDLVALIGSQQQRDTRFVYDLNIDLLSFVKENC 336
L + EN F I N LA+ +++ G+ QQ++ VYD ++SF+ C
Sbjct: 378 LVLPSENYF-ISPSNGLICLAMGSSSQGMSIFGNIQQQNLLVVYDTGNSVVSFLSAQC 434
>gi|61214233|sp|Q766C3.1|NEP1_NEPGR RecName: Full=Aspartic proteinase nepenthesin-1; AltName:
Full=Nepenthesin-I; Flags: Precursor
gi|41016421|dbj|BAD07474.1| aspartic proteinase nepenthesin I [Nepenthes gracilis]
Length = 437
Score = 148 bits (374), Expect = 3e-33, Method: Compositional matrix adjust.
Identities = 107/358 (29%), Positives = 165/358 (46%), Gaps = 41/358 (11%)
Query: 1 MVRLFIGTPSKGVLLILDTGSALIYA--------------IFDPRKSSSFQKINCDHPDC 46
++ L IGTP++ I+DTGS LI+ IF+P+ SSSF + C C
Sbjct: 96 LMNLSIGTPAQPFSAIMDTGSDLIWTQCQPCTQCFNQSTPIFNPQGSSSFSTLPCSSQLC 155
Query: 47 TYFK---CVNEQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSNDNHGF 103
C N C YT Y D S T+G ET++ G FGC +N GF
Sbjct: 156 QALSSPTCSNNFCQYTYGYGDGSETQGSMGTETLTF-----GSVSIPNITFGCGENNQGF 210
Query: 104 DEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLKFGTDMGY 163
+ G AG++G+ R +S SQL +FSYC+ P G T S L G+
Sbjct: 211 GQ----GNGAGLVGMGRGPLSLPSQLD---VTKFSYCMT---PIGSSTPSNLLLGSLANS 260
Query: 164 RRPSTQATKFINHPN--NFYYLSLKDISIDNERMNFPPDTFDI-TVSGEGGCIIDSGSVL 220
+ T I FYY++L +S+ + R+ P F + + +G GG IIDSG+ L
Sbjct: 261 VTAGSPNTTLIQSSQIPTFYYITLNGLSVGSTRLPIDPSAFALNSNNGTGGIIIDSGTTL 320
Query: 221 TYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCYFLPETFN--RFPSMAFYFEDAN 278
TYF ++ Y + ++F+S + L ++ LC+ P + + P+ +F+ +
Sbjct: 321 TYFVNNAYQSVRQEFIS---QINLPVVNGSSSGFDLCFQTPSDPSNLQIPTFVMHFDGGD 377
Query: 279 LRIDGENVFIIDYENHFFLLAVAPHDDLVALIGSQQQRDTRFVYDLNIDLLSFVKENC 336
L + EN F I N LA+ +++ G+ QQ++ VYD ++SF C
Sbjct: 378 LELPSENYF-ISPSNGLICLAMGSSSQGMSIFGNIQQQNMLVVYDTGNSVVSFASAQC 434
>gi|147862576|emb|CAN79341.1| hypothetical protein VITISV_006338 [Vitis vinifera]
Length = 436
Score = 148 bits (373), Expect = 4e-33, Method: Compositional matrix adjust.
Identities = 112/356 (31%), Positives = 171/356 (48%), Gaps = 40/356 (11%)
Query: 1 MVRLFIGTPSKGVLLILDTGSALIYA--------------IFDPRKSSSFQKINCDHPDC 46
++ L IGTP++ I+DTGS LI+ IFDP KSSSF K+ C C
Sbjct: 98 LMNLAIGTPAETYSAIMDTGSDLIWTQCKPCKVCFDQPTPIFDPEKSSSFSKLPCSSDLC 157
Query: 47 TYF--KCVNEQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSNDNHGFD 104
++ C Y Y D S T+G A ET + G A FGC DN G
Sbjct: 158 VALPISSCSDGCEYRYSYGDHSSTQGVLATETFTF-----GDASVSKIGFGCGEDNRGRA 212
Query: 105 EDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLKFGTDMGYR 164
AG++GL R +S ISQLG +FSYCL + + + S+ L G++ +
Sbjct: 213 YSQG----AGLVGLGRGPLSLISQLG---VPKFSYCLT-SIDDSKGISTLL-VGSEATVK 263
Query: 165 RPSTQATKFINHPN--NFYYLSLKDISIDNERMNFPPDTFDITVSGEGGCIIDSGSVLTY 222
S T I +P+ +FYYLSL+ IS+ + + TF I G GG IIDSG+ +TY
Sbjct: 264 --SAIPTPLIQNPSRPSFYYLSLEGISVGDTLLPIEKSTFSIQDDGSGGLIIDSGTTITY 321
Query: 223 FHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCYFLPETFN--RFPSMAFYFEDANLR 280
+ + L ++F+S + +L + ++LC+ LP + P + F+FE +L+
Sbjct: 322 LKDNAFAALKKEFIS---QMKLDVDASGSTELELCFTLPPDGSPVEVPQLVFHFEGVDLK 378
Query: 281 IDGENVFIIDYENHFFLLAVAPHDDLVALIGSQQQRDTRFVYDLNIDLLSFVKENC 336
+ EN I D L + + ++ G+ QQ++ ++DL + +SF C
Sbjct: 379 LPKENYIIEDSALRVICLTMGSSSGM-SIFGNFQQQNIVVLHDLEKETISFAPAQC 433
>gi|61214232|sp|Q766C2.1|NEP2_NEPGR RecName: Full=Aspartic proteinase nepenthesin-2; AltName:
Full=Nepenthesin-II; Flags: Precursor
gi|41016423|dbj|BAD07475.1| aspartic proteinase nepenthesin II [Nepenthes gracilis]
Length = 438
Score = 148 bits (373), Expect = 4e-33, Method: Compositional matrix adjust.
Identities = 104/357 (29%), Positives = 168/357 (47%), Gaps = 39/357 (10%)
Query: 1 MVRLFIGTPSKGVLLILDTGSALIYA--------------IFDPRKSSSFQKINCDHPDC 46
++ + IGTP I+DTGS LI+ IF+P+ SSSF + C+ C
Sbjct: 97 LMNVAIGTPDSSFSAIMDTGSDLIWTQCEPCTQCFSQPTPIFNPQDSSSFSTLPCESQYC 156
Query: 47 TYF---KCVNEQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSNDNHGF 103
C N +C YT Y D S T+G+ A ET + E ++ + A FGC DN GF
Sbjct: 157 QDLPSETCNNNECQYTYGYGDGSTTQGYMATETFTF----ETSSVPNIA-FGCGEDNQGF 211
Query: 104 DEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLKFGTDMGY 163
+ G AG++G+ +S SQLG +FSYC+ G + S L G+
Sbjct: 212 GQ----GNGAGLIGMGWGPLSLPSQLG---VGQFSYCMT---SYGSSSPSTLALGSAASG 261
Query: 164 RRPSTQATKFINHPNN--FYYLSLKDISIDNERMNFPPDTFDITVSGEGGCIIDSGSVLT 221
+ +T I+ N +YY++L+ I++ + + P TF + G GG IIDSG+ LT
Sbjct: 262 VPEGSPSTTLIHSSLNPTYYYITLQGITVGGDNLGIPSSTFQLQDDGTGGMIIDSGTTLT 321
Query: 222 YFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCYFLPE--TFNRFPSMAFYFEDANL 279
Y D Y + + F ++ L + + + C+ P + + P ++ F+ L
Sbjct: 322 YLPQDAYNAVAQAFT---DQINLPTVDESSSGLSTCFQQPSDGSTVQVPEISMQFDGGVL 378
Query: 280 RIDGENVFIIDYENHFFLLAVAPHDDLVALIGSQQQRDTRFVYDLNIDLLSFVKENC 336
+ +N+ I E L + +++ G+ QQ++T+ +YDL +SFV C
Sbjct: 379 NLGEQNILISPAEGVICLAMGSSSQLGISIFGNIQQQETQVLYDLQNLAVSFVPTQC 435
>gi|297817972|ref|XP_002876869.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
gi|297322707|gb|EFH53128.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
Length = 462
Score = 147 bits (372), Expect = 5e-33, Method: Compositional matrix adjust.
Identities = 111/369 (30%), Positives = 172/369 (46%), Gaps = 51/369 (13%)
Query: 1 MVRLFIGTPSKGVLLILDTGSALIYA--------------IFDPRKSSSFQKINCDHPDC 46
++ L IG P+ I+DTGS LI+ IFDP KSSS+ K+ C C
Sbjct: 109 LMELSIGNPAVKYAAIVDTGSDLIWTQCKPCTECFDQPTPIFDPEKSSSYSKVGCSSGLC 168
Query: 47 TYFKCVN-----EQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCS--ND 99
N + C Y Y D S T+G A ET + E + G FGC N+
Sbjct: 169 NALPRSNCNEDKDSCEYLYTYGDYSSTRGLLATETFTF----EDENSISGIGFGCGVENE 224
Query: 100 NHGFDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSY----L 155
GF + + G++GL R +S ISQL + +FSYCL + + E +SS L
Sbjct: 225 GDGFSQGS------GLVGLGRGPLSLISQLK---ETKFSYCLT-SIEDSEASSSLFIGSL 274
Query: 156 KFG----TDMGYRRPSTQATKFINHPN--NFYYLSLKDISIDNERMNFPPDTFDITVSGE 209
G T T+ + +P+ +FYYL L+ I++ +R++ TF+++ G
Sbjct: 275 ASGIVNKTGANLDGEVTKTMSLLRNPDQPSFYYLELQGITVGAKRLSVEKSTFELSEDGT 334
Query: 210 GGCIIDSGSVLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCYFLPETFNRF-- 267
GG IIDSG+ +TY + L E+F S R L + LC+ LP
Sbjct: 335 GGMIIDSGTTITYLEETAFKVLKEEFTS---RMSLPVDDSGSTGLDLCFKLPNAAKNIAV 391
Query: 268 PSMAFYFEDANLRIDGENVFIIDYENHFFLLAVAPHDDLVALIGSQQQRDTRFVYDLNID 327
P + F+F+ A+L + GEN + D LA+ + + ++ G+ QQ++ ++DL +
Sbjct: 392 PKLIFHFKGADLELPGENYMVADSSTGVLCLAMGSSNGM-SIFGNVQQQNFNVLHDLEKE 450
Query: 328 LLSFVKENC 336
++FV C
Sbjct: 451 TVTFVPTEC 459
>gi|356557014|ref|XP_003546813.1| PREDICTED: probable aspartic protease At2g35615-like [Glycine max]
Length = 435
Score = 147 bits (372), Expect = 5e-33, Method: Compositional matrix adjust.
Identities = 109/363 (30%), Positives = 178/363 (49%), Gaps = 45/363 (12%)
Query: 1 MVRLFIGTPSKGVLLILDTGSALIY--------------AIFDPRKSSSFQKINCDHPDC 46
++R +IG+P L ++DTGS+LI+ +F+P KSS+++ CD C
Sbjct: 90 LMRFYIGSPPVERLAMVDTGSSLIWLQCSPCHNCFPQETPLFEPLKSSTYKYATCDSQPC 149
Query: 47 TYFKCVNE------QCVYTMKYADQSVTKGFAAHETISVIGKGEGKAI-FHGALFGCSND 99
T + QC+Y + Y D+S + G ET+S G + + F +FGC D
Sbjct: 150 TLLQPSQRDCGKLGQCIYGIMYGDKSFSVGILGTETLSFGSTGGAQTVSFPNTIFGCGVD 209
Query: 100 NHGFDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLKFGT 159
N+ F + + G+ GL +S +SQLG+ I +FSYCL LP ++S LKFG+
Sbjct: 210 NN-FTIYTSNKVM-GIAGLGAGPLSLVSQLGAQIGHKFSYCL---LPYDSTSTSKLKFGS 264
Query: 160 DMGYRRPSTQATKFINHPN--NFYYLSLKDISIDNERMNFPPDTFDITVSGEGGCIIDSG 217
+ +T I P+ +Y+L+L+ ++I + ++ T +G +IDSG
Sbjct: 265 EAIITTNGVVSTPLIIKPSLPTYYFLNLEAVTIGQKVVS--------TGQTDGNIVIDSG 316
Query: 218 SVLTYFHSDVYWKLHEKFVSYF-ERFQLAQLSDCPEPIQLCYFLPETFN-RFPSMAFYFE 275
+ LTY + Y FV+ E + L D P P++ C+ P N P +AF F
Sbjct: 317 TPLTYLENTFY----NNFVASLQETLGVKLLQDLPSPLKTCF--PNRANLAIPDIAFQFT 370
Query: 276 DANLRIDGENVFIIDYENHFFLLAVAPHDDL-VALIGSQQQRDTRFVYDLNIDLLSFVKE 334
A++ + +NV I +++ LAV P + ++L GS Q D + YDL +SF
Sbjct: 371 GASVALRPKNVLIPLTDSNILCLAVVPSSGIGISLFGSIAQYDFQVEYDLEGKKVSFAPT 430
Query: 335 NCS 337
+C+
Sbjct: 431 DCA 433
>gi|168025534|ref|XP_001765289.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162683608|gb|EDQ70017.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 372
Score = 147 bits (371), Expect = 7e-33, Method: Compositional matrix adjust.
Identities = 109/366 (29%), Positives = 173/366 (47%), Gaps = 50/366 (13%)
Query: 1 MVRLFIGTPSKGVLLILDTGSALIY--------------AIFDPRKSSSFQKINCDHPDC 46
+V +++GTP + ++I+DTGS L + IFDP KSS++ KI C C
Sbjct: 26 LVPIYLGTPPQKAVVIIDTGSDLTWIQSEPCRACFEQADPIFDPSKSSTYNKIACSSSAC 85
Query: 47 -----TYFKCVNEQCVYTMKYADQSVTKGFAAHETISVI-GKGE----GKAIFHGALFGC 96
T C+Y Y D SVT+G+ + ETI+ GE G ++++ FG
Sbjct: 86 ADLLGTQTCSAAANCIYAYGYGDGSVTRGYFSKETITATDTAGEEVKFGASVYNTGTFG- 144
Query: 97 SNDNHGFDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLK 156
D G+LGL + +S SQLGS++ +FSYCLV L G TS+ +
Sbjct: 145 ------------DTGGEGILGLGQGPVSMPSQLGSVLGNKFSYCLVDWLSAGSETST-MY 191
Query: 157 FGTDMGYRRPSTQATKFI---NHPNNFYYLSLKDISIDNERMNFPPDTFDITVSGEGGCI 213
FG D Q T + +HP +YY++++ IS+ ++ ++I G GG I
Sbjct: 192 FG-DAAVPSGEVQYTPIVPNADHPT-YYYIAVQGISVGGSLLDIDQSVYEIDSGGSGGTI 249
Query: 214 IDSGSVLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCYFLPETFNR-FPSMAF 272
IDSG+ +TY +V+ L + S + + LC+ T + FP+M
Sbjct: 250 IDSGTTITYLQQEVFNALVAAYTSQVRYPTTTSATG----LDLCFNTRGTGSPVFPAMTI 305
Query: 273 YFEDANLRIDGENVFIIDYENHFFLLAVAPHDDL-VALIGSQQQRDTRFVYDLNIDLLSF 331
+ + +L + N F I E + LA A D +A+ G+ QQ++ VYDL+ + F
Sbjct: 306 HLDGVHLELPTANTF-ISLETNIICLAFASALDFPIAIFGNIQQQNFDIVYDLDNMRIGF 364
Query: 332 VKENCS 337
+C+
Sbjct: 365 APADCA 370
>gi|115458644|ref|NP_001052922.1| Os04g0448300 [Oryza sativa Japonica Group]
gi|113564493|dbj|BAF14836.1| Os04g0448300 [Oryza sativa Japonica Group]
gi|215766465|dbj|BAG98773.1| unnamed protein product [Oryza sativa Japonica Group]
gi|215767943|dbj|BAH00172.1| unnamed protein product [Oryza sativa Japonica Group]
Length = 454
Score = 147 bits (370), Expect = 8e-33, Method: Compositional matrix adjust.
Identities = 112/367 (30%), Positives = 175/367 (47%), Gaps = 52/367 (14%)
Query: 1 MVRLFIGTPSKGVLLILDTGSALIYA--------------IFDPRKSSSFQKINCDHPDC 46
++ + IGTP+ I+DTGS L++ +FDP SS++ + C C
Sbjct: 106 LMDVSIGTPALAYSAIVDTGSDLVWTQCKPCVDCFKQSTPVFDPSSSSTYATVPCSSASC 165
Query: 47 TYF---KCVN-EQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGC--SNDN 100
+ KC + +C YT Y D S T+G A ET ++ K+ G +FGC +N+
Sbjct: 166 SDLPTSKCTSASKCGYTYTYGDSSSTQGVLATETFTL-----AKSKLPGVVFGCGDTNEG 220
Query: 101 HGFDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLKFGTD 160
GF + A G++GL R +S +SQLG +FSYCL + +S L G+
Sbjct: 221 DGFSQGA------GLVGLGRGPLSLVSQLG---LDKFSYCLT---SLDDTNNSPLLLGSL 268
Query: 161 MG-----YRRPSTQATKFINHPN--NFYYLSLKDISIDNERMNFPPDTFDITVSGEGGCI 213
G S Q T I +P+ +FYY+SLK I++ + R++ P F + G GG I
Sbjct: 269 AGISEASAAASSVQTTPLIKNPSQPSFYYVSLKAITVGSTRISLPSSAFAVQDDGTGGVI 328
Query: 214 IDSGSVLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCYFLPETF---NRFPSM 270
+DSG+ +TY Y L + F + + L + LC+ P P +
Sbjct: 329 VDSGTSITYLEVQGYRALKKAFAA---QMALPAADGSGVGLDLCFRAPAKGVDQVEVPRL 385
Query: 271 AFYFE-DANLRIDGENVFIIDYENHFFLLAVAPHDDLVALIGSQQQRDTRFVYDLNIDLL 329
F+F+ A+L + EN ++D + L V L ++IG+ QQ++ +FVYD+ D L
Sbjct: 386 VFHFDGGADLDLPAENYMVLDGGSGALCLTVMGSRGL-SIIGNFQQQNFQFVYDVGHDTL 444
Query: 330 SFVKENC 336
SF C
Sbjct: 445 SFAPVQC 451
>gi|38344829|emb|CAD40873.2| OSJNBa0064H22.10 [Oryza sativa Japonica Group]
gi|116310063|emb|CAH67084.1| H0818E04.1 [Oryza sativa Indica Group]
gi|116310186|emb|CAH67198.1| OSIGBa0152K17.10 [Oryza sativa Indica Group]
Length = 444
Score = 147 bits (370), Expect = 9e-33, Method: Compositional matrix adjust.
Identities = 112/367 (30%), Positives = 175/367 (47%), Gaps = 52/367 (14%)
Query: 1 MVRLFIGTPSKGVLLILDTGSALIYA--------------IFDPRKSSSFQKINCDHPDC 46
++ + IGTP+ I+DTGS L++ +FDP SS++ + C C
Sbjct: 96 LMDVSIGTPALAYSAIVDTGSDLVWTQCKPCVDCFKQSTPVFDPSSSSTYATVPCSSASC 155
Query: 47 TYF---KCVN-EQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGC--SNDN 100
+ KC + +C YT Y D S T+G A ET ++ K+ G +FGC +N+
Sbjct: 156 SDLPTSKCTSASKCGYTYTYGDSSSTQGVLATETFTL-----AKSKLPGVVFGCGDTNEG 210
Query: 101 HGFDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLKFGTD 160
GF + A G++GL R +S +SQLG +FSYCL + +S L G+
Sbjct: 211 DGFSQGA------GLVGLGRGPLSLVSQLG---LDKFSYCLT---SLDDTNNSPLLLGSL 258
Query: 161 MG-----YRRPSTQATKFINHPN--NFYYLSLKDISIDNERMNFPPDTFDITVSGEGGCI 213
G S Q T I +P+ +FYY+SLK I++ + R++ P F + G GG I
Sbjct: 259 AGISEASAAASSVQTTPLIKNPSQPSFYYVSLKAITVGSTRISLPSSAFAVQDDGTGGVI 318
Query: 214 IDSGSVLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCYFLPETF---NRFPSM 270
+DSG+ +TY Y L + F + + L + LC+ P P +
Sbjct: 319 VDSGTSITYLEVQGYRALKKAFAA---QMALPAADGSGVGLDLCFRAPAKGVDQVEVPRL 375
Query: 271 AFYFE-DANLRIDGENVFIIDYENHFFLLAVAPHDDLVALIGSQQQRDTRFVYDLNIDLL 329
F+F+ A+L + EN ++D + L V L ++IG+ QQ++ +FVYD+ D L
Sbjct: 376 VFHFDGGADLDLPAENYMVLDGGSGALCLTVMGSRGL-SIIGNFQQQNFQFVYDVGHDTL 434
Query: 330 SFVKENC 336
SF C
Sbjct: 435 SFAPVQC 441
>gi|225437854|ref|XP_002264056.1| PREDICTED: aspartic proteinase nepenthesin-1 [Vitis vinifera]
Length = 436
Score = 147 bits (370), Expect = 1e-32, Method: Compositional matrix adjust.
Identities = 112/356 (31%), Positives = 170/356 (47%), Gaps = 40/356 (11%)
Query: 1 MVRLFIGTPSKGVLLILDTGSALIYA--------------IFDPRKSSSFQKINCDHPDC 46
++ L IGTP++ I+DTGS LI+ IFDP KSSSF K+ C C
Sbjct: 98 LMNLAIGTPAETYSAIMDTGSDLIWTQCKPCKVCFDQPTPIFDPEKSSSFSKLPCSSDLC 157
Query: 47 TYF--KCVNEQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSNDNHGFD 104
++ C Y Y D S T+G A ET + G A FGC DN G
Sbjct: 158 VALPISSCSDGCEYRYSYGDHSSTQGVLATETFTF-----GDASVSKIGFGCGEDNRGRA 212
Query: 105 EDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLKFGTDMGYR 164
AG++GL R +S ISQLG +FSYCL + + + S+ L G++ +
Sbjct: 213 YSQG----AGLVGLGRGPLSLISQLG---VPKFSYCLT-SIDDSKGISTLL-VGSEATVK 263
Query: 165 RPSTQATKFINHPN--NFYYLSLKDISIDNERMNFPPDTFDITVSGEGGCIIDSGSVLTY 222
S T I +P+ +FYYLSL+ IS+ + + TF I G GG IIDSG+ +TY
Sbjct: 264 --SAIPTPLIQNPSRPSFYYLSLEGISVGDTLLPIEKSTFSIQDDGSGGLIIDSGTTITY 321
Query: 223 FHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCYFLPETFNR--FPSMAFYFEDANLR 280
+ L ++F+S + +L + ++LC+ LP + P + F+FE +L+
Sbjct: 322 LKDSAFAALKKEFIS---QMKLDVDASGSTELELCFTLPPDGSPVDVPQLVFHFEGVDLK 378
Query: 281 IDGENVFIIDYENHFFLLAVAPHDDLVALIGSQQQRDTRFVYDLNIDLLSFVKENC 336
+ EN I D L + + ++ G+ QQ++ ++DL + +SF C
Sbjct: 379 LPKENYIIEDSALRVICLTMGSSSGM-SIFGNFQQQNIVVLHDLEKETISFAPAQC 433
>gi|255566010|ref|XP_002523993.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
gi|223536720|gb|EEF38361.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
Length = 439
Score = 147 bits (370), Expect = 1e-32, Method: Compositional matrix adjust.
Identities = 110/362 (30%), Positives = 175/362 (48%), Gaps = 38/362 (10%)
Query: 1 MVRLFIGTPSKGVLLILDTGSALIYA--------------IFDPRKSSSFQKINCDHPDC 46
+++ +GTP+ +L I DTGS LI+ +FDP+ SS+++ I+C C
Sbjct: 93 LMKFSLGTPAFDILAIADTGSDLIWTQCKPCDQCYEQDAPLFDPKSSSTYRDISCSTKQC 152
Query: 47 TYFK----CV---NEQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSND 99
K C N+ C Y+ Y D+S T G A +TI++ + A+ GC ++
Sbjct: 153 DLLKEGASCSGEGNKTCHYSYSYGDRSFTSGNVAADTITLGSTSGRPVLLPKAIIGCGHN 212
Query: 100 NHG-FDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLKFG 158
N G F E IS ISQLGS I +FSYCLV PL + SS L FG
Sbjct: 213 NGGSFTEKGSGIVGL-----GGGPISLISQLGSTIDGKFSYCLV-PLSSNATNSSKLNFG 266
Query: 159 TDMGYRRPSTQATKFINH-PNNFYYLSLKDISIDNERMNFPPDTFDITVSGEGGCIIDSG 217
++ Q+T I+ P+ FY+L+L+ +S+ +ER+ FP +F + EG IIDSG
Sbjct: 267 SNGIVSGGGVQSTPLISKDPDTFYFLTLEAVSVGSERIKFPGSSFGTS---EGNIIIDSG 323
Query: 218 SVLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCYFLPETFNRFPSMAFYFEDA 277
+ LT F D + +L + + + D + LCY + +FPS+ +F+ A
Sbjct: 324 TTLTLFPEDFFSELSS---AVQDAVAGTPVEDPSGILSLCYSIDADL-KFPSITAHFDGA 379
Query: 278 NLRIDGENVFIIDYENHFFLLAVAPHDDLVALIGSQQQRDTRFVYDLNIDLLSFVKENCS 337
+++++ N F + + A P + A+ G+ Q + YDL +SF +C+
Sbjct: 380 DVKLNPLNTF-VQVSDTVLCFAFNPINS-GAIFGNLAQMNFLVGYDLEGKTVSFKPTDCT 437
Query: 338 DD 339
D
Sbjct: 438 QD 439
>gi|125548488|gb|EAY94310.1| hypothetical protein OsI_16079 [Oryza sativa Indica Group]
Length = 423
Score = 146 bits (369), Expect = 1e-32, Method: Compositional matrix adjust.
Identities = 112/367 (30%), Positives = 175/367 (47%), Gaps = 52/367 (14%)
Query: 1 MVRLFIGTPSKGVLLILDTGSALIYA--------------IFDPRKSSSFQKINCDHPDC 46
++ + IGTP+ I+DTGS L++ +FDP SS++ + C C
Sbjct: 75 LMDVSIGTPALAYSAIVDTGSDLVWTQCKPCVDCFKQSTPVFDPSSSSTYATVPCSSASC 134
Query: 47 TYF---KCVN-EQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGC--SNDN 100
+ KC + +C YT Y D S T+G A ET ++ K+ G +FGC +N+
Sbjct: 135 SDLPTSKCTSASKCGYTYTYGDSSSTQGVLATETFTL-----AKSKLPGVVFGCGDTNEG 189
Query: 101 HGFDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLKFGTD 160
GF + A G++GL R +S +SQLG +FSYCL + +S L G+
Sbjct: 190 DGFSQGA------GLVGLGRGPLSLVSQLG---LDKFSYCLT---SLDDTNNSPLLLGSL 237
Query: 161 MG-----YRRPSTQATKFINHPN--NFYYLSLKDISIDNERMNFPPDTFDITVSGEGGCI 213
G S Q T I +P+ +FYY+SLK I++ + R++ P F + G GG I
Sbjct: 238 AGISEASAAASSVQTTPLIKNPSQPSFYYVSLKAITVGSTRISLPSSAFAVQDDGTGGVI 297
Query: 214 IDSGSVLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCYFLPETF---NRFPSM 270
+DSG+ +TY Y L + F + + L + LC+ P P +
Sbjct: 298 VDSGTSITYLEVQGYRALKKAFAA---QMALPAADGSGVGLDLCFRAPAKGVDQVEVPRL 354
Query: 271 AFYFE-DANLRIDGENVFIIDYENHFFLLAVAPHDDLVALIGSQQQRDTRFVYDLNIDLL 329
F+F+ A+L + EN ++D + L V L ++IG+ QQ++ +FVYD+ D L
Sbjct: 355 VFHFDGGADLDLPAENYMVLDGGSGALCLTVMGSRGL-SIIGNFQQQNFQFVYDVGHDTL 413
Query: 330 SFVKENC 336
SF C
Sbjct: 414 SFAPVQC 420
>gi|125590542|gb|EAZ30892.1| hypothetical protein OsJ_14967 [Oryza sativa Japonica Group]
Length = 516
Score = 145 bits (367), Expect = 2e-32, Method: Compositional matrix adjust.
Identities = 112/362 (30%), Positives = 172/362 (47%), Gaps = 52/362 (14%)
Query: 6 IGTPSKGVLLILDTGSALIYA--------------IFDPRKSSSFQKINCDHPDCTYF-- 49
IGTP+ I+DTGS L++ +FDP SS++ + C C+
Sbjct: 173 IGTPALAYSAIVDTGSDLVWTQCKPCVDCFKQSTPVFDPSSSSTYATVPCSSASCSDLPT 232
Query: 50 -KCVN-EQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGC--SNDNHGFDE 105
KC + +C YT Y D S T+G A ET ++ K+ G +FGC +N+ GF +
Sbjct: 233 SKCTSASKCGYTYTYGDSSSTQGVLATETFTLA-----KSKLPGVVFGCGDTNEGDGFSQ 287
Query: 106 DARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLKFGTDMG--- 162
A G++GL R +S +SQLG +FSYCL + +S L G+ G
Sbjct: 288 GA------GLVGLGRGPLSLVSQLG---LDKFSYCLT---SLDDTNNSPLLLGSLAGISE 335
Query: 163 --YRRPSTQATKFINHPN--NFYYLSLKDISIDNERMNFPPDTFDITVSGEGGCIIDSGS 218
S Q T I +P+ +FYY+SLK I++ + R++ P F + G GG I+DSG+
Sbjct: 336 ASAAASSVQTTPLIKNPSQPSFYYVSLKAITVGSTRISLPSSAFAVQDDGTGGVIVDSGT 395
Query: 219 VLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCYFLPETF---NRFPSMAFYFE 275
+TY Y L + F + + L + LC+ P P + F+F+
Sbjct: 396 SITYLEVQGYRALKKAFAA---QMALPAADGSGVGLDLCFRAPAKGVDQVEVPRLVFHFD 452
Query: 276 -DANLRIDGENVFIIDYENHFFLLAVAPHDDLVALIGSQQQRDTRFVYDLNIDLLSFVKE 334
A+L + EN ++D + L V L ++IG+ QQ++ +FVYD+ D LSF
Sbjct: 453 GGADLDLPAENYMVLDGGSGALCLTVMGSRGL-SIIGNFQQQNFQFVYDVGHDTLSFAPV 511
Query: 335 NC 336
C
Sbjct: 512 QC 513
>gi|356508918|ref|XP_003523200.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
Length = 453
Score = 145 bits (367), Expect = 2e-32, Method: Compositional matrix adjust.
Identities = 117/358 (32%), Positives = 180/358 (50%), Gaps = 38/358 (10%)
Query: 1 MVRLFIGTPSKGVLLILDTGSALIYA--------------IFDPRKSSSFQKINCDHPDC 46
++ L IGTP +LDTGS LI+ IFDP+KSSSF K++C C
Sbjct: 109 LMELAIGTPPVSYPAVLDTGSDLIWTQCKPCTQCYKQPTPIFDPKKSSSFSKVSCGSSLC 168
Query: 47 TYF--KCVNEQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSNDNHGFD 104
+ ++ C Y Y D S+T+G A ET + GK + K H FGC DN G
Sbjct: 169 SAVPSSTCSDGCEYVYSYGDYSMTQGVLATETFT-FGKSKNKVSVHNIGFGCGEDNEG-- 225
Query: 105 EDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLKFGTDMGYR 164
D + A +G++GL R +S +SQL + RFSYCL P + S L G+ +G
Sbjct: 226 -DGFEQA-SGLVGLGRGPLSLVSQLK---EPRFSYCLT---PMDDTKESILLLGS-LGKV 276
Query: 165 RPSTQ--ATKFINHP--NNFYYLSLKDISIDNERMNFPPDTFDITVSGEGGCIIDSGSVL 220
+ + + T + +P +FYYLSL+ IS+ + R++ TF++ G GG IIDSG+ +
Sbjct: 277 KDAKEVVTTPLLKNPLQPSFYYLSLEGISVGDTRLSIEKSTFEVGDDGNGGVIIDSGTTI 336
Query: 221 TYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCYFLP--ETFNRFPSMAFYFEDAN 278
TY + L ++F+S + L + S + LC+ LP T P + F+F+ +
Sbjct: 337 TYIEQKAFEALKKEFISQ-TKLPLDKTSST--GLDLCFSLPSGSTQVEIPKIVFHFKGGD 393
Query: 279 LRIDGENVFIIDYENHFFLLAVAPHDDLVALIGSQQQRDTRFVYDLNIDLLSFVKENC 336
L + EN I D LA+ + ++ G+ QQ++ +DL + +SFV +C
Sbjct: 394 LELPAENYMIGDSNLGVACLAMGASSGM-SIFGNVQQQNILVNHDLEKETISFVPTSC 450
>gi|302753526|ref|XP_002960187.1| hypothetical protein SELMODRAFT_75184 [Selaginella moellendorffii]
gi|300171126|gb|EFJ37726.1| hypothetical protein SELMODRAFT_75184 [Selaginella moellendorffii]
Length = 398
Score = 145 bits (366), Expect = 2e-32, Method: Compositional matrix adjust.
Identities = 106/363 (29%), Positives = 167/363 (46%), Gaps = 36/363 (9%)
Query: 1 MVRLFIGTPSKGVLLILDTGSALIY--------------AIFDPRKSSSFQKINCDHPDC 46
+ + +GTP+K +I DTGS LI+ IFDP SSS+ ++C C
Sbjct: 41 VTTISLGTPAKVFSVIADTGSDLIWIQCKPCQACFNQKDPIFDPEGSSSYTTMSCGDTLC 100
Query: 47 TYF--KCVNEQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSNDNHGFD 104
K + C Y+ Y D S T+G + ET+++ K FGC + N G
Sbjct: 101 DSLPRKSCSPNCDYSYGYGDGSGTRGTLSSETVTLTSTQGEKLAAKNIAFGCGHLNRGSF 160
Query: 105 EDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLKFGTDMGYR 164
DA +G++GL R +SF+SQLG + +FSYCLV P + +S + FG +
Sbjct: 161 NDA-----SGLVGLGRGNLSFVSQLGDLFGHKFSYCLV-PWRDAPSKTSPMFFGDESSSH 214
Query: 165 RPSTQA----TKFINHP--NNFYYLSLKDISIDNERMNFPPDTFDITVSGEGGCIIDSGS 218
+ T I++P +FYY+ LKDISI + P +FDI G GG I DSG+
Sbjct: 215 SSGKKLHYAFTPMIHNPAMESFYYVKLKDISIAGRALRIPAGSFDIKPDGSGGMIFDSGT 274
Query: 219 VLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCYFLPET----FNRFPSMAFYF 274
LT Y + S + ++ + LCY + + + P+M F+F
Sbjct: 275 TLTLLPDAPYQIVLRALRS---KVSFPEIDGSSAGLDLCYDVSGSKASYKKKIPAMVFHF 331
Query: 275 EDANLRIDGENVFIIDYE-NHFFLLAVAPHDDLVALIGSQQQRDTRFVYDLNIDLLSFVK 333
E A+ ++ EN FI + LA+ + + + G+ Q++ R +YD+ + +
Sbjct: 332 EGADHQLPVENYFIAANDAGTIVCLAMVSSNMDIGIYGNMMQQNFRVMYDIGSSKIGWAP 391
Query: 334 ENC 336
C
Sbjct: 392 SQC 394
>gi|302768196|ref|XP_002967518.1| hypothetical protein SELMODRAFT_87804 [Selaginella moellendorffii]
gi|300165509|gb|EFJ32117.1| hypothetical protein SELMODRAFT_87804 [Selaginella moellendorffii]
Length = 398
Score = 145 bits (366), Expect = 3e-32, Method: Compositional matrix adjust.
Identities = 106/363 (29%), Positives = 167/363 (46%), Gaps = 36/363 (9%)
Query: 1 MVRLFIGTPSKGVLLILDTGSALIY--------------AIFDPRKSSSFQKINCDHPDC 46
+ + +GTP+K +I DTGS LI+ IFDP SSS+ ++C C
Sbjct: 41 VTTISLGTPAKVFSVIADTGSDLIWIQCKPCQACFNQKDPIFDPEGSSSYTTMSCGDTLC 100
Query: 47 TYF--KCVNEQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSNDNHGFD 104
K + C Y+ Y D S T+G + ET+++ K FGC + N G
Sbjct: 101 DSLPRKSCSPDCDYSYGYGDGSGTRGTLSSETVTLTSTQGEKLAAKNIAFGCGHLNRGSF 160
Query: 105 EDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLKFGTDMGYR 164
DA +G++GL R +SF+SQLG + +FSYCLV P + +S + FG +
Sbjct: 161 NDA-----SGLVGLGRGNLSFVSQLGDLFGHKFSYCLV-PWRDAPSKTSPMFFGDESSSH 214
Query: 165 RPSTQA----TKFINHP--NNFYYLSLKDISIDNERMNFPPDTFDITVSGEGGCIIDSGS 218
+ T I++P +FYY+ LKDISI + P +FDI G GG I DSG+
Sbjct: 215 SSGKKLHYAFTPMIHNPAMESFYYVKLKDISIAGRALRIPAGSFDIKPDGSGGMIFDSGT 274
Query: 219 VLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCYFLPETFN----RFPSMAFYF 274
LT Y + S + ++ + LCY + + + P+M F+F
Sbjct: 275 TLTLLPDAPYQIVLRALRS---KISFPKIDGSSAGLDLCYDVSGSKASYKMKIPAMVFHF 331
Query: 275 EDANLRIDGENVFIIDYE-NHFFLLAVAPHDDLVALIGSQQQRDTRFVYDLNIDLLSFVK 333
E A+ ++ EN FI + LA+ + + + G+ Q++ R +YD+ + +
Sbjct: 332 EGADYQLPVENYFIAANDAGTIVCLAMVSSNMDIGIYGNMMQQNFRVMYDIGSSKIGWAP 391
Query: 334 ENC 336
C
Sbjct: 392 SQC 394
>gi|115476828|ref|NP_001062010.1| Os08g0469000 [Oryza sativa Japonica Group]
gi|42407407|dbj|BAD09565.1| putative nucleoid DNA-binding protein [Oryza sativa Japonica Group]
gi|113623979|dbj|BAF23924.1| Os08g0469000 [Oryza sativa Japonica Group]
gi|125603713|gb|EAZ43038.1| hypothetical protein OsJ_27627 [Oryza sativa Japonica Group]
Length = 448
Score = 144 bits (362), Expect = 8e-32, Method: Compositional matrix adjust.
Identities = 109/364 (29%), Positives = 165/364 (45%), Gaps = 39/364 (10%)
Query: 1 MVRLFIGTPSKGVLLILDTGSALIYA--------------IFDPRKSSSFQKINCDHPDC 46
++ L IGTP ++DTGS LI+ F P +S++++ + C P C
Sbjct: 93 LMDLAIGTPPLRYTAMVDTGSDLIWTQCAPCVLCADQPTPYFRPARSATYRLVPCRSPLC 152
Query: 47 T---YFKCVNEQ-CVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSNDNHG 102
Y C CVY Y D++ T G A ET + K + FGC N N G
Sbjct: 153 AALPYPACFQRSVCVYQYYYGDEASTAGVLASETFTFGAANSSKVMVSDVAFGCGNINSG 212
Query: 103 FDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCL---VIPLPNGEYTSSYLKF-G 158
++ +G++GL R +S +SQLG RFSYCL + P P+ + G
Sbjct: 213 QLANS-----SGMVGLGRGPLSLVSQLG---PSRFSYCLTSFLSPEPSRLNFGVFATLNG 264
Query: 159 TDMGYRRPSTQATKFINHPN--NFYYLSLKDISIDNERMNFPPDTFDITVSGEGGCIIDS 216
T+ Q+T + + + Y++SLK IS+ +R+ P F I G GG IDS
Sbjct: 265 TNASSSGSPVQSTPLVVNAALPSLYFMSLKGISLGQKRLPIDPLVFAINDDGTGGVFIDS 324
Query: 217 GSVLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCYFLPETFN---RFPSMAFY 273
G+ LT+ D Y + + VS L +D ++ C+ P + P M +
Sbjct: 325 GTSLTWLQQDAYDAVRRELVSVLR--PLPPTNDTEIGLETCFPWPPPPSVAVTVPDMELH 382
Query: 274 FE-DANLRIDGENVFIIDYENHFFLLAVAPHDDLVALIGSQQQRDTRFVYDLNIDLLSFV 332
F+ AN+ + EN +ID F LA+ D +IG+ QQ++ +YD+ LLSFV
Sbjct: 383 FDGGANMTVPPENYMLIDGATGFLCLAMIRSGD-ATIIGNYQQQNMHILYDIANSLLSFV 441
Query: 333 KENC 336
C
Sbjct: 442 PAPC 445
>gi|125561848|gb|EAZ07296.1| hypothetical protein OsI_29544 [Oryza sativa Indica Group]
Length = 448
Score = 143 bits (361), Expect = 9e-32, Method: Compositional matrix adjust.
Identities = 109/364 (29%), Positives = 165/364 (45%), Gaps = 39/364 (10%)
Query: 1 MVRLFIGTPSKGVLLILDTGSALIYA--------------IFDPRKSSSFQKINCDHPDC 46
++ L IGTP ++DTGS LI+ F P +S++++ + C P C
Sbjct: 93 LMDLAIGTPPLRYTAMVDTGSDLIWTQCAPCVLCADQPTPYFRPARSATYRLVPCRSPLC 152
Query: 47 T---YFKCVNEQ-CVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSNDNHG 102
Y C CVY Y D++ T G A ET + K + FGC N N G
Sbjct: 153 AALPYPACFQRSVCVYQYYYGDEASTAGVLASETFTFGAANSSKVMVSDVAFGCGNINSG 212
Query: 103 FDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCL---VIPLPNGEYTSSYLKF-G 158
++ +G++GL R +S +SQLG RFSYCL + P P+ + G
Sbjct: 213 QLANS-----SGMVGLGRGPLSLVSQLG---PSRFSYCLTSFLSPEPSRLNFGVFATLNG 264
Query: 159 TDMGYRRPSTQATKFINHPN--NFYYLSLKDISIDNERMNFPPDTFDITVSGEGGCIIDS 216
T+ Q+T + + + Y++SLK IS+ +R+ P F I G GG IDS
Sbjct: 265 TNASSSGSPVQSTPLVVNAALPSLYFMSLKGISLGQKRLPIDPLVFAINDDGTGGVFIDS 324
Query: 217 GSVLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCYFLPETFN---RFPSMAFY 273
G+ LT+ D Y + + VS L +D ++ C+ P + P M +
Sbjct: 325 GTSLTWLQQDAYDAVRHELVSVLR--PLPPTNDTEIGLETCFPWPPPPSVAVTVPDMELH 382
Query: 274 FE-DANLRIDGENVFIIDYENHFFLLAVAPHDDLVALIGSQQQRDTRFVYDLNIDLLSFV 332
F+ AN+ + EN +ID F LA+ D +IG+ QQ++ +YD+ LLSFV
Sbjct: 383 FDGGANMTVPPENYMLIDGATGFLCLAMIRSGD-ATIIGNYQQQNMHILYDIANSLLSFV 441
Query: 333 KENC 336
C
Sbjct: 442 PAPC 445
>gi|357167693|ref|XP_003581287.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Brachypodium
distachyon]
Length = 468
Score = 142 bits (359), Expect = 2e-31, Method: Compositional matrix adjust.
Identities = 109/371 (29%), Positives = 173/371 (46%), Gaps = 57/371 (15%)
Query: 1 MVRLFIGTPSKGVLLILDTGSALIYA--------------IFDPRKSSSFQKINCDHPDC 46
++ + IGTP+ I+DTGS L++ +FDP SS++ + C C
Sbjct: 119 LMDMSIGTPALAYAAIVDTGSDLVWTQCKPCVECFNQSTPVFDPSSSSTYSTLPCSSSLC 178
Query: 47 TYF---KCVN--EQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGC--SND 99
+ C + + C YT Y D S T+G A ET ++ K G FGC +N+
Sbjct: 179 SDLPTSTCTSAAKDCGYTYTYGDASSTQGVLAAETFTL-----AKTKLPGVAFGCGDTNE 233
Query: 100 NHGFDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVI-------PLPNGEYTS 152
GF + A G++GL R +S +SQLG +FSYCL PL G
Sbjct: 234 GDGFTQGA------GLVGLGRGPLSLVSQLG---LGKFSYCLTSLDDTSKSPLLLG---- 280
Query: 153 SYLKFGTDMGYRRPSTQATKFINHPN--NFYYLSLKDISIDNERMNFPPDTFDITVSGEG 210
S TD + Q T I +P+ +FYY++LK +++ + R+ P F + G G
Sbjct: 281 SLAAISTDTA-SAAAIQTTPLIKNPSQPSFYYVTLKALTVGSTRIPLPGSAFAVQDDGTG 339
Query: 211 GCIIDSGSVLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCYFLPETF---NRF 267
G I+DSG+ +TY Y L + F + + +L + LC+ P +
Sbjct: 340 GVIVDSGTSITYLELQGYRPLKKAFAA---QMKLPVADGSAVGLDLCFKAPASGVDDVEV 396
Query: 268 PSMAFYFE-DANLRIDGENVFIIDYENHFFLLAVAPHDDLVALIGSQQQRDTRFVYDLNI 326
P + +F+ A+L + EN ++D + L V L ++IG+ QQ++ +FVYD++
Sbjct: 397 PKLVLHFDGGADLDLPAENYMVLDSASGALCLTVMGSRGL-SIIGNFQQQNIQFVYDVDK 455
Query: 327 DLLSFVKENCS 337
D LSF C+
Sbjct: 456 DTLSFAPVQCA 466
>gi|326494754|dbj|BAJ94496.1| predicted protein [Hordeum vulgare subsp. vulgare]
gi|326514480|dbj|BAJ96227.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 449
Score = 142 bits (357), Expect = 3e-31, Method: Compositional matrix adjust.
Identities = 104/366 (28%), Positives = 171/366 (46%), Gaps = 50/366 (13%)
Query: 1 MVRLFIGTPSKGVLLILDTGSALIYA--------------IFDPRKSSSFQKINCDHPDC 46
++ + IGTP+ I+DTGS L++ +FDP SS++ + C C
Sbjct: 103 LMDMSIGTPAVAYAAIIDTGSDLVWTQCKPCVECFNQSTPVFDPSSSSTYAALPCSSTLC 162
Query: 47 TYF---KCVNEQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGC--SNDNH 101
+ KC + +C YT Y D S T+G A ET ++ K FGC +N+
Sbjct: 163 SDLPSSKCTSAKCGYTYTYGDSSSTQGVLAAETFTL-----AKTKLPDVAFGCGDTNEGD 217
Query: 102 GFDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLKFGT-- 159
GF + A G++GL R +S +SQLG +FSYCL + + S L G+
Sbjct: 218 GFTQGA------GLVGLGRGPLSLVSQLG---LNKFSYCLT---SLDDTSKSPLLLGSLA 265
Query: 160 ---DMGYRRPSTQATKFINHPN--NFYYLSLKDISIDNERMNFPPDTFDITVSGEGGCII 214
+ S Q T I +P+ +FYY++LK +++ + + P F + G GG I+
Sbjct: 266 TISESAAAASSVQTTPLIRNPSQPSFYYVNLKGLTVGSTHITLPSSAFAVQDDGTGGVIV 325
Query: 215 DSGSVLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCYFLPETF---NRFPSMA 271
DSG+ +TY Y L + F + + +L + C+ P + P +
Sbjct: 326 DSGTSITYLELQGYRALKKAFAA---QMKLPAADGSGIGLDTCFEAPASGVDQVEVPKLV 382
Query: 272 FYFEDANLRIDGENVFIIDYENHFFLLAVAPHDDLVALIGSQQQRDTRFVYDLNIDLLSF 331
F+ + A+L + EN ++D + L V L ++IG+ QQ++ +FVYD+ + LSF
Sbjct: 383 FHLDGADLDLPAENYMVLDSGSGALCLTVMGSRGL-SIIGNFQQQNIQFVYDVGENTLSF 441
Query: 332 VKENCS 337
C+
Sbjct: 442 APVQCA 447
>gi|223946005|gb|ACN27086.1| unknown [Zea mays]
Length = 336
Score = 140 bits (353), Expect = 9e-31, Method: Compositional matrix adjust.
Identities = 102/345 (29%), Positives = 154/345 (44%), Gaps = 37/345 (10%)
Query: 17 LDTGSALIYA--------------IFDPRKSSSFQKINCDHPDCTYFK---CVNEQCVYT 59
+DTGS LI+ FD +KS++++ + C C C + CVY
Sbjct: 1 MDTGSDLIWTQCAPCLLCADQPTPYFDVKKSATYRALPCRSSRCASLSSPSCFKKMCVYQ 60
Query: 60 MKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSNDNHGFDEDARDGALAGVLGLS 119
Y D + T G A+ET + K FGC + N G ++ +G++G
Sbjct: 61 YYYGDTASTAGVLANETFTFGAANSTKVRATNIAFGCGSLNAGDLANS-----SGMVGFG 115
Query: 120 RVTISFISQLGSIIKKRFSYCLVIPL---PNGEYTSSYLKFGTDMGYRRPSTQATKFINH 176
R +S +SQLG RFSYCL L P+ Y Y + Q+T F+ +
Sbjct: 116 RGPLSLVSQLG---PSRFSYCLTSYLSATPSRLYFGVYANLSSTNTSSGSPVQSTPFVIN 172
Query: 177 PN--NFYYLSLKDISIDNERMNFPPDTFDITVSGEGGCIIDSGSVLTYFHSDVYWKLHEK 234
P N Y+LSLK IS+ + + P F I G GG IIDSG+ +T+ D Y +
Sbjct: 173 PALPNMYFLSLKAISLGTKLLPIDPLVFAINDDGTGGVIIDSGTSITWLQQDAYEAVRRG 232
Query: 235 FVSYFERFQLAQLSDCPEPIQLCYFLPETFN---RFPSMAFYFEDANLRIDGENVFIIDY 291
VS L ++D + C+ P N P + F+F+ AN+ + EN +I
Sbjct: 233 LVS---AIPLPAMNDTDIGLDTCFQWPPPPNVTVTVPDLVFHFDSANMTLLPENYMLIAS 289
Query: 292 ENHFFLLAVAPHDDLVALIGSQQQRDTRFVYDLNIDLLSFVKENC 336
+ L +AP + +IG+ QQ++ +YD+ LSFV C
Sbjct: 290 TTGYLCLVMAP-TGVGTIIGNYQQQNLHLLYDIGNSFLSFVPAPC 333
>gi|296090291|emb|CBI40110.3| unnamed protein product [Vitis vinifera]
Length = 408
Score = 139 bits (349), Expect = 3e-30, Method: Compositional matrix adjust.
Identities = 110/359 (30%), Positives = 171/359 (47%), Gaps = 38/359 (10%)
Query: 1 MVRLFIGTPSKGVLLILDTGSALIYA--------------IFDPRKSSSFQKINCDHPDC 46
+V +G P L+ +DTGS L++ IFDP KSS++ ++ D P C
Sbjct: 60 LVNFSVGRPPVPQLVGIDTGSDLLWVQCRPCADCFRQSTPIFDPSKSSTYVDLSYDSPIC 119
Query: 47 -----TYFKCVNEQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSNDNH 101
+ +N QC+Y YAD S + G A E I +G +FGC + N
Sbjct: 120 PNSPQKKYNHLN-QCIYNASYADGSTSSGNLATEDIVFETSDQGTVTVSSVVFGCGHSNR 178
Query: 102 GFDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLKFGTDM 161
G DG +G+LGLS S +S+LGS RFSYC + L + YT + L G +
Sbjct: 179 G----RFDGQQSGILGLSAGDQSIVSRLGS----RFSYC-IGDLFDPHYTHNQLVLGDGV 229
Query: 162 GYRRPSTQATKFINHPNNFYYLSLKDISIDNERMNFPPDTFDITVSGEGGCIIDSGSVLT 221
ST F N FYY++L+ IS+ R++ P+ F T SG+GG ++DSG+ T
Sbjct: 230 KMEGSSTPFHTF----NGFYYVTLEGISVGETRLDINPEVFQRTESGQGGVVMDSGTTAT 285
Query: 222 YFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCY--FLPETFNRFPSMAFYF-EDAN 278
+ D + L + + R Q+ P LCY + E FP +AF+F E A+
Sbjct: 286 FLAKDGFDPLSNE-IQRLVRGHFQQVIYRTIPGWLCYKGRVNEDLRGFPELAFHFAEGAD 344
Query: 279 LRIDGENVFIIDYENHFFLLAVAPH-DDLVALIGSQQQRDTRFVYDLNIDLLSFVKENC 336
L +D ++F+ ++ F L + + ++ ++IG Q+ YDL + F + +C
Sbjct: 345 LVLDANSLFVQKNQDVFCLAVLESNLKNIGSVIGIMAQQHYNVAYDLIGKRVYFQRTDC 403
>gi|147786881|emb|CAN62311.1| hypothetical protein VITISV_008929 [Vitis vinifera]
Length = 408
Score = 138 bits (348), Expect = 3e-30, Method: Compositional matrix adjust.
Identities = 110/359 (30%), Positives = 171/359 (47%), Gaps = 38/359 (10%)
Query: 1 MVRLFIGTPSKGVLLILDTGSALIYA--------------IFDPRKSSSFQKINCDHPDC 46
+V +G P L+ +DTGS L++ IFDP KSS++ ++ D P C
Sbjct: 60 LVNFSVGRPPVPQLVGIDTGSDLLWVQCRPCADCFRQSTPIFDPSKSSTYVDLSYDSPIC 119
Query: 47 -----TYFKCVNEQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSNDNH 101
+ +N QC+Y YAD S + G A E I +G +FGC + N
Sbjct: 120 PNSPQKKYNHLN-QCIYNASYADGSTSSGNLATEDIVFETSDQGTVTVSSVVFGCGHSNR 178
Query: 102 GFDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLKFGTDM 161
G DG +G+LGLS S +S+LGS RFSYC + L + YT + L G +
Sbjct: 179 G----RFDGQQSGILGLSAGDQSIVSRLGS----RFSYC-IGDLFDPHYTHNQLVLGDGV 229
Query: 162 GYRRPSTQATKFINHPNNFYYLSLKDISIDNERMNFPPDTFDITVSGEGGCIIDSGSVLT 221
ST F N FYY++L+ IS+ R++ P+ F T SG+GG ++DSG+ T
Sbjct: 230 KMEGSSTPFHTF----NGFYYVTLEGISVGETRLDINPEVFQRTESGQGGVVMDSGTTAT 285
Query: 222 YFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCY--FLPETFNRFPSMAFYF-EDAN 278
+ D + L + + R Q+ P LCY + E FP +AF+F E A+
Sbjct: 286 FLAKDGFDPLSNE-IQRLVRGHFQQVIYRTIPGWLCYKGRVNEDLRGFPELAFHFAEGAD 344
Query: 279 LRIDGENVFIIDYENHFFLLAVAPH-DDLVALIGSQQQRDTRFVYDLNIDLLSFVKENC 336
L +D ++F+ ++ F L + + ++ ++IG Q+ YDL + F + +C
Sbjct: 345 LVLDANSLFVQKNQDVFCLAVLESNLKNIGSVIGIMAQQHYNVAYDLIGKRVYFQRTDC 403
>gi|357492389|ref|XP_003616483.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
gi|355517818|gb|AES99441.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
Length = 434
Score = 138 bits (348), Expect = 3e-30, Method: Compositional matrix adjust.
Identities = 107/351 (30%), Positives = 163/351 (46%), Gaps = 30/351 (8%)
Query: 6 IGTPSKGVLLILDTGSALIY--------------AIFDPRKSSSFQKINCDHPDCTYFK- 50
+GTP + I DTGS +++ IF+P KSSS++ I C C +
Sbjct: 93 VGTPPTKIYGIADTGSDIVWLQCEPCEQCYNQTTPIFNPSKSSSYKNIPCSSKLCHSVRD 152
Query: 51 --CVNEQ-CVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSNDNHGFDEDA 107
C ++ C Y + Y D S ++G + +T+S+ F + GC DN G
Sbjct: 153 TSCSDQNSCQYKISYGDSSHSQGDLSVDTLSLESTSGSPVSFPKIVIGCGTDNAG----T 208
Query: 108 RDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEY-TSSYLKFGTDMGYRRP 166
GA +G++GL +S I+QLGS I +FSYCLV PL N E SS L FG
Sbjct: 209 FGGASSGIVGLGGGPVSLITQLGSSIGGKFSYCLV-PLLNKESNASSILSFGDAAVVSGD 267
Query: 167 STQATKFINHPNNFYYLSLKDISIDNERMNFPPDTFDITVSGEGGCIIDSGSVLTYFHSD 226
+T I FY+L+L+ S+ N+R+ F + EG IIDSG+ LT SD
Sbjct: 268 GVVSTPLIKKDPVFYFLTLQAFSVGNKRVEFGGSSEG--GDDEGNIIIDSGTTLTLIPSD 325
Query: 227 VYWKLHEKFVSYFERFQLAQLSDCPEPIQLCYFLPETFNRFPSMAFYFEDANLRIDGENV 286
VY L V + +L ++ D + LCY L FP + +F+ A++ + +
Sbjct: 326 VYTNLESAVV---DLVKLDRVDDPNQQFSLCYSLKSNEYDFPIITVHFKGADVELHSIST 382
Query: 287 FIIDYENHFFLLAVAPHDDLVALIGSQQQRDTRFVYDLNIDLLSFVKENCS 337
F + + A P L ++ G+ Q++ YDL +SF +C+
Sbjct: 383 F-VPITDGIVCFAFQPSPQLGSIFGNLAQQNLLVGYDLQQKTVSFKPTDCT 432
>gi|225446261|ref|XP_002265547.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Vitis vinifera]
Length = 440
Score = 138 bits (348), Expect = 3e-30, Method: Compositional matrix adjust.
Identities = 110/359 (30%), Positives = 171/359 (47%), Gaps = 38/359 (10%)
Query: 1 MVRLFIGTPSKGVLLILDTGSALIYA--------------IFDPRKSSSFQKINCDHPDC 46
+V +G P L+ +DTGS L++ IFDP KSS++ ++ D P C
Sbjct: 92 LVNFSVGRPPVPQLVGIDTGSDLLWVQCRPCADCFRQSTPIFDPSKSSTYVDLSYDSPIC 151
Query: 47 -----TYFKCVNEQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSNDNH 101
+ +N QC+Y YAD S + G A E I +G +FGC + N
Sbjct: 152 PNSPQKKYNHLN-QCIYNASYADGSTSSGNLATEDIVFETSDQGTVTVSSVVFGCGHSNR 210
Query: 102 GFDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLKFGTDM 161
G DG +G+LGLS S +S+LGS RFSYC + L + YT + L G +
Sbjct: 211 G----RFDGQQSGILGLSAGDQSIVSRLGS----RFSYC-IGDLFDPHYTHNQLVLGDGV 261
Query: 162 GYRRPSTQATKFINHPNNFYYLSLKDISIDNERMNFPPDTFDITVSGEGGCIIDSGSVLT 221
ST F N FYY++L+ IS+ R++ P+ F T SG+GG ++DSG+ T
Sbjct: 262 KMEGSSTPFHTF----NGFYYVTLEGISVGETRLDINPEVFQRTESGQGGVVMDSGTTAT 317
Query: 222 YFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCY--FLPETFNRFPSMAFYF-EDAN 278
+ D + L + + R Q+ P LCY + E FP +AF+F E A+
Sbjct: 318 FLAKDGFDPLSNE-IQRLVRGHFQQVIYRTIPGWLCYKGRVNEDLRGFPELAFHFAEGAD 376
Query: 279 LRIDGENVFIIDYENHFFLLAVAPH-DDLVALIGSQQQRDTRFVYDLNIDLLSFVKENC 336
L +D ++F+ ++ F L + + ++ ++IG Q+ YDL + F + +C
Sbjct: 377 LVLDANSLFVQKNQDVFCLAVLESNLKNIGSVIGIMAQQHYNVAYDLIGKRVYFQRTDC 435
>gi|293335828|ref|NP_001170221.1| uncharacterized protein LOC100384173 precursor [Zea mays]
gi|224034427|gb|ACN36289.1| unknown [Zea mays]
Length = 443
Score = 138 bits (348), Expect = 3e-30, Method: Compositional matrix adjust.
Identities = 106/362 (29%), Positives = 164/362 (45%), Gaps = 38/362 (10%)
Query: 1 MVRLFIGTPSKGVLLILDTGSALIYA--------------IFDPRKSSSFQKINCDHPDC 46
++ + IGTP++ ILDTGS LI+ FDP +S++++ + C P C
Sbjct: 91 LMEMGIGTPTRYYSAILDTGSDLIWTQCAPCLLCVDQPTPYFDPARSATYRSLGCASPAC 150
Query: 47 T---YFKCVNEQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSNDNHGF 103
Y C + CVY Y D + T G A+ET + G E + G FGC N N G
Sbjct: 151 NALYYPLCYQKVCVYQYFYGDSASTAGVLANETFT-FGTNETRVSLPGISFGCGNLNAGL 209
Query: 104 DEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCL---VIPLPNGEYTSSYLKFGTD 160
+ +G++G R ++S +SQLGS RFSYCL + P+P+ Y Y +
Sbjct: 210 LANG-----SGMVGFGRGSLSLVSQLGS---PRFSYCLTSFLSPVPSRLYFGVYATLNST 261
Query: 161 MGYRRPSTQATKFINHPN--NFYYLSLKDISIDNERMNFPPDTFDIT-VSGEGGCIIDSG 217
P Q+T F+ +P Y+L++ IS+ + P F I G GG IIDSG
Sbjct: 262 NASSEP-VQSTPFVVNPALPTMYFLNMTGISVGGYLLPIDPAVFAINDTDGTGGTIIDSG 320
Query: 218 SVLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCYFLPETFNR---FPSMAFYF 274
+ +TY Y + F S L ++D + C+ P + P + +F
Sbjct: 321 TTITYLAEPAYDAVRAAFASQIT-LPLLNVTDA-SVLDTCFQWPPPPRQSVTLPQLVLHF 378
Query: 275 EDANLRIDGENVFIIDYENHFFLLAVAPHDDLVALIGSQQQRDTRFVYDLNIDLLSFVKE 334
+ A+ + +N ++D L ++IGS Q ++ +YDL L+SFV
Sbjct: 379 DGADWELPLQNYMLVDPSTGGGLCLAMASSSDGSIIGSYQHQNFNVLYDLENSLMSFVPA 438
Query: 335 NC 336
C
Sbjct: 439 PC 440
>gi|356528671|ref|XP_003532923.1| PREDICTED: probable aspartic protease At2g35615-like [Glycine max]
Length = 440
Score = 138 bits (347), Expect = 4e-30, Method: Compositional matrix adjust.
Identities = 107/369 (28%), Positives = 168/369 (45%), Gaps = 51/369 (13%)
Query: 1 MVRLFIGTPSKGVLLILDTGSALIYA--------------IFDPRKSSSFQKINCDHPDC 46
++R +IGTP I DTGS LI+ +FDPRKSS+F+ + CD C
Sbjct: 93 LMRFYIGTPPVERFAIADTGSDLIWVQCAPCEKCVPQNAPLFDPRKSSTFKTVPCDSQPC 152
Query: 47 TYF-----KCVNE--QCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCS-N 98
T CV + QC Y Y D ++ G E+I+ G F FGC+ +
Sbjct: 153 TLLPPSQRACVGKSGQCYYQYIYGDHTLVSGILGFESIN-FGSKNNAIKFPKLTFGCTFS 211
Query: 99 DNHGFDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLKFG 158
+N DE R+ G++GL +S ISQLG I ++FSYC P ++S ++FG
Sbjct: 212 NNDTVDESKRN---MGLVGLGVGPLSLISQLGYQIGRKFSYCFP---PLSSNSTSKMRFG 265
Query: 159 TDMGYRRPSTQATKFINHP-------NNFYYLSLKDISIDNERMNFPPDTFDITVSGEGG 211
D + Q ++ P ++YYL+L+ +SI N+++ D G
Sbjct: 266 NDAIVK----QIKGVVSTPLIIKSIGPSYYYLNLEGVSIGNKKVKTSESQTD------GN 315
Query: 212 CIIDSGSVLTYFHSDVYWKLHEKFVSYF-ERFQLAQLSDCPEPIQLCYFLPETFNRFPSM 270
+IDSG+ T Y KFV+ E + + + P C+ RFP +
Sbjct: 316 ILIDSGTSFTILKQSFY----NKFVALVKEVYGVEAVKIPPLVYNFCFENKGKRKRFPDV 371
Query: 271 AFYFEDANLRIDGENVFIIDYENHFFLLAVAPHDDLVALIGSQQQRDTRFVYDLNIDLLS 330
F F A +R+D N+F + N ++A+ D+ ++ G+ Q + YDL ++S
Sbjct: 372 VFLFTGAKVRVDASNLFEAEDNNLLCMVALPTSDEDDSIFGNHAQIGYQVEYDLQGGMVS 431
Query: 331 FVKENCSDD 339
F +C+ D
Sbjct: 432 FAPADCAKD 440
>gi|116790042|gb|ABK25480.1| unknown [Picea sitchensis]
Length = 460
Score = 138 bits (347), Expect = 5e-30, Method: Compositional matrix adjust.
Identities = 106/358 (29%), Positives = 174/358 (48%), Gaps = 38/358 (10%)
Query: 1 MVRLFIGTPSKGVLLILDTGSALIYA--------------IFDPRKSSSFQKINCDHPDC 46
++++ IGTPS ILDTGS L + I+DP +SS++ K+ C C
Sbjct: 116 LMKMAIGTPSLSFSAILDTGSDLTWTQCKPCTDCYPQPTPIYDPSQSSTYSKVPCSSSMC 175
Query: 47 T---YFKCVNEQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSNDNHGF 103
+ C C Y Y DQS T+G ++E+ ++ + ++ H A FGC +N
Sbjct: 176 QALPMYSCSGANCEYLYSYGDQSSTQGILSYESFTLTSQ----SLPHIA-FGCGQEN--- 227
Query: 104 DEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLKFGTDMGY 163
E G++G R +S ISQLG + +FSYCLV + + +S L G
Sbjct: 228 -EGGGFSQGGGLVGFGRGPLSLISQLGQSLGNKFSYCLV-SITDSPSKTSPLFIGKTASL 285
Query: 164 RRPSTQATKFINHPN--NFYYLSLKDISIDNERMNFPPDTFDITVSGEGGCIIDSGSVLT 221
+ +T + + FYYLSL+ IS+ + ++ TFD+ + G GG IIDSG+ +T
Sbjct: 286 NAKTVSSTPLVQSRSRPTFYYLSLEGISVGGQLLDIADGTFDLQLDGTGGVIIDSGTTVT 345
Query: 222 YFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCYFLPE---TFNRFPSMAFYFEDAN 278
Y Y + + +S L Q+ + LC F P+ + + FP++ F+FE A+
Sbjct: 346 YLEQSGYDVVKKAVIS---SINLPQVDGSNIGLDLC-FEPQSGSSTSHFPTITFHFEGAD 401
Query: 279 LRIDGENVFIIDYENHFFLLAVAPHDDLVALIGSQQQRDTRFVYDLNIDLLSFVKENC 336
+ EN D + LA+ P + + ++ G+ QQ++ + +YD ++LSF C
Sbjct: 402 FNLPKENYIYTD-SSGIACLAMLPSNGM-SIFGNIQQQNYQILYDNERNVLSFAPTVC 457
>gi|224130878|ref|XP_002320947.1| predicted protein [Populus trichocarpa]
gi|222861720|gb|EEE99262.1| predicted protein [Populus trichocarpa]
Length = 440
Score = 137 bits (346), Expect = 5e-30, Method: Compositional matrix adjust.
Identities = 100/356 (28%), Positives = 168/356 (47%), Gaps = 32/356 (8%)
Query: 1 MVRLFIGTPSKGVLLILDTGSALIYA--------------IFDPRKSSSFQKINCDHPDC 46
++ L +GTP ++ I DTGS LI+ +FDP+ S +++ +CD C
Sbjct: 96 LMSLSLGTPPFKIMGIADTGSDLIWTQCKPCERCYKQVDPLFDPKSSKTYRDFSCDARQC 155
Query: 47 TYF---KCVNEQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSNDNHGF 103
+ C C Y Y D+S T G A +TI++ F + GC ++N G
Sbjct: 156 SLLDQSTCSGNICQYQYSYGDRSYTMGNVASDTITLDSTTGSPVSFPKTVIGCGHENDGT 215
Query: 104 DEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLKFGTDMGY 163
D +G++GL +S ISQ+GS + +FSYCLV PL + SS L FG++
Sbjct: 216 FSDKG----SGIVGLGAGPLSLISQMGSSVGGKFSYCLV-PLSSRAGNSSKLNFGSNAVV 270
Query: 164 RRPSTQATKFINHP--NNFYYLSLKDISIDNERMNFPPDTFDITVSGEGGCIIDSGSVLT 221
P Q+T ++ ++FY+L+L+ +S+ NER+ F + +GEG IIDSG+ LT
Sbjct: 271 SGPGVQSTPLLSSETMSSFYFLTLEAMSVGNERIKFGDSSLG---TGEGNIIIDSGTTLT 327
Query: 222 YFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCYFLPETFNRFPSMAFYFEDANLRI 281
D + L + E + D + +CY + P++ +F A++++
Sbjct: 328 IVPDDFFSNLSTAVGNQVEG---RRAEDPSGFLSVCYSATSDL-KVPAITAHFTGADVKL 383
Query: 282 DGENVFIIDYENHFFLLAVAPHDDLVALIGSQQQRDTRFVYDLNIDLLSFVKENCS 337
N F + + LA A +++ G+ Q + Y++ LSF +C+
Sbjct: 384 KPINTF-VQVSDDVVCLAFASTTSGISIYGNVAQMNFLVEYNIQGKSLSFKPTDCT 438
>gi|115479485|ref|NP_001063336.1| Os09g0452400 [Oryza sativa Japonica Group]
gi|51535935|dbj|BAD38017.1| putative aspartic proteinase nepenthesin I [Oryza sativa Japonica
Group]
gi|113631569|dbj|BAF25250.1| Os09g0452400 [Oryza sativa Japonica Group]
gi|215693279|dbj|BAG88661.1| unnamed protein product [Oryza sativa Japonica Group]
Length = 441
Score = 137 bits (345), Expect = 6e-30, Method: Compositional matrix adjust.
Identities = 105/358 (29%), Positives = 160/358 (44%), Gaps = 40/358 (11%)
Query: 6 IGTPSKGVLLILDTGSALIYA--------------IFDPRKSSSFQKINCDHPDCTYF-- 49
IG+P + ++DTGS LI+ F+P KS+S+ + C C
Sbjct: 94 IGSPPRYFSAMIDTGSDLIWTQCAPCLLCVEQPTPYFEPAKSTSYASLPCSSAMCNALYS 153
Query: 50 -KCVNEQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSNDNHGFDEDAR 108
C CVY Y D + + G A+ET + G + FGC N N G +
Sbjct: 154 PLCFQNACVYQAFYGDSASSAGVLANETFT-FGTNSTRVAVPRVSFGCGNMNAGTLFNG- 211
Query: 109 DGALAGVLGLSRVTISFISQLGSIIKKRFSYCL---VIPLPNGEYTSSYLKFGTDMGYRR 165
+G++G R +S +SQLGS RFSYCL + P + Y +Y +
Sbjct: 212 ----SGMVGFGRGALSLVSQLGS---PRFSYCLTSFMSPATSRLYFGAYATLNSTNTSSS 264
Query: 166 PSTQATKFINHPN--NFYYLSLKDISIDNERMNFPPDTFDIT-VSGEGGCIIDSGSVLTY 222
Q+T FI +P Y+L++ IS+ + + P F I G GG IIDSG+ +T+
Sbjct: 265 GPVQSTPFIVNPALPTMYFLNMTGISVAGDLLPIDPSVFAINETDGTGGVIIDSGTTVTF 324
Query: 223 FHSDVYWKLHEKFVSYFERFQLAQLSDCP-EPIQLCYFLPETFNR---FPSMAFYFEDAN 278
Y + FV++ L + + P + C+ P R P M +F+ A+
Sbjct: 325 LAQPAYAMVQGAFVAWV---GLPRANATPSDTFDTCFKWPPPPRRMVTLPEMVLHFDGAD 381
Query: 279 LRIDGENVFIIDYENHFFLLAVAPHDDLVALIGSQQQRDTRFVYDLNIDLLSFVKENC 336
+ + EN ++D LA+ P DD ++IGS Q ++ +YDL LLSFV C
Sbjct: 382 MELPLENYMVMDGGTGNLCLAMLPSDD-GSIIGSFQHQNFHMLYDLENSLLSFVPAPC 438
>gi|125563957|gb|EAZ09337.1| hypothetical protein OsI_31609 [Oryza sativa Indica Group]
gi|125605916|gb|EAZ44952.1| hypothetical protein OsJ_29595 [Oryza sativa Japonica Group]
Length = 438
Score = 137 bits (345), Expect = 8e-30, Method: Compositional matrix adjust.
Identities = 105/358 (29%), Positives = 160/358 (44%), Gaps = 40/358 (11%)
Query: 6 IGTPSKGVLLILDTGSALIYA--------------IFDPRKSSSFQKINCDHPDCTYF-- 49
IG+P + ++DTGS LI+ F+P KS+S+ + C C
Sbjct: 91 IGSPPRYFSAMIDTGSDLIWTQCAPCLLCVEQPTPYFEPAKSTSYASLPCSSAMCNALYS 150
Query: 50 -KCVNEQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSNDNHGFDEDAR 108
C CVY Y D + + G A+ET + G + FGC N N G +
Sbjct: 151 PLCFQNACVYQAFYGDSASSAGVLANETFT-FGTNSTRVAVPRVSFGCGNMNAGTLFNG- 208
Query: 109 DGALAGVLGLSRVTISFISQLGSIIKKRFSYCL---VIPLPNGEYTSSYLKFGTDMGYRR 165
+G++G R +S +SQLGS RFSYCL + P + Y +Y +
Sbjct: 209 ----SGMVGFGRGALSLVSQLGS---PRFSYCLTSFMSPATSRLYFGAYATLNSTNTSSS 261
Query: 166 PSTQATKFINHPN--NFYYLSLKDISIDNERMNFPPDTFDIT-VSGEGGCIIDSGSVLTY 222
Q+T FI +P Y+L++ IS+ + + P F I G GG IIDSG+ +T+
Sbjct: 262 GPVQSTPFIVNPALPTMYFLNMTGISVAGDLLPIDPSVFAINETDGTGGVIIDSGTTVTF 321
Query: 223 FHSDVYWKLHEKFVSYFERFQLAQLSDCP-EPIQLCYFLPETFNR---FPSMAFYFEDAN 278
Y + FV++ L + + P + C+ P R P M +F+ A+
Sbjct: 322 LAQPAYAMVQGAFVAWV---GLPRANATPSDTFDTCFKWPPPPRRMVTLPEMVLHFDGAD 378
Query: 279 LRIDGENVFIIDYENHFFLLAVAPHDDLVALIGSQQQRDTRFVYDLNIDLLSFVKENC 336
+ + EN ++D LA+ P DD ++IGS Q ++ +YDL LLSFV C
Sbjct: 379 MELPLENYMVMDGGTGNLCLAMLPSDD-GSIIGSFQHQNFHMLYDLENSLLSFVPAPC 435
>gi|414589630|tpg|DAA40201.1| TPA: hypothetical protein ZEAMMB73_629620 [Zea mays]
Length = 443
Score = 137 bits (345), Expect = 8e-30, Method: Compositional matrix adjust.
Identities = 106/362 (29%), Positives = 164/362 (45%), Gaps = 38/362 (10%)
Query: 1 MVRLFIGTPSKGVLLILDTGSALIYA--------------IFDPRKSSSFQKINCDHPDC 46
++ + IGTP++ ILDTGS LI+ FDP +S++++ + C P C
Sbjct: 91 LMEMGIGTPTRYYSAILDTGSDLIWTQCAPCLLCVDQPTPYFDPARSATYRSLGCASPAC 150
Query: 47 T---YFKCVNEQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSNDNHGF 103
Y C + CVY Y D + T G A+ET + G E + G FGC N N G
Sbjct: 151 NALYYPLCYQKVCVYQYFYGDSASTAGVLANETFT-FGTNETRVSLPGISFGCGNLNAGS 209
Query: 104 DEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCL---VIPLPNGEYTSSYLKFGTD 160
+ +G++G R ++S +SQLGS RFSYCL + P+P+ Y Y +
Sbjct: 210 LANG-----SGMVGFGRGSLSLVSQLGS---PRFSYCLTSFLSPVPSRLYFGVYATLNST 261
Query: 161 MGYRRPSTQATKFINHPN--NFYYLSLKDISIDNERMNFPPDTFDIT-VSGEGGCIIDSG 217
P Q+T F+ +P Y+L++ IS+ + P F I G GG IIDSG
Sbjct: 262 NASSEP-VQSTPFVVNPALPTMYFLNMTGISVGGYLLPIDPAVFAINDTDGTGGTIIDSG 320
Query: 218 SVLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCYFLPETFNR---FPSMAFYF 274
+ +TY Y + F S L ++D + C+ P + P + +F
Sbjct: 321 TTITYLAEPAYDAVRAAFASQIT-LPLLNVTDA-SVLDTCFQWPPPPRQSVTLPQLVLHF 378
Query: 275 EDANLRIDGENVFIIDYENHFFLLAVAPHDDLVALIGSQQQRDTRFVYDLNIDLLSFVKE 334
+ A+ + +N ++D L ++IGS Q ++ +YDL L+SFV
Sbjct: 379 DGADWELPLQNYMLVDPSTGGGLCLAMASSSDGSIIGSYQHQNFNVLYDLENSLMSFVPA 438
Query: 335 NC 336
C
Sbjct: 439 PC 440
>gi|357492401|ref|XP_003616489.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
gi|355517824|gb|AES99447.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
Length = 434
Score = 137 bits (344), Expect = 9e-30, Method: Compositional matrix adjust.
Identities = 107/351 (30%), Positives = 163/351 (46%), Gaps = 30/351 (8%)
Query: 6 IGTPSKGVLLILDTGSALIY--------------AIFDPRKSSSFQKINCDHPDCTYFK- 50
+GTP + I DTGS +++ IF+P KSSS++ I C C +
Sbjct: 93 VGTPPTKIYGIADTGSDIVWLQCEPCEQCYNQTTPIFNPSKSSSYKNIPCLSKLCHSVRD 152
Query: 51 --CVNEQ-CVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSNDNHGFDEDA 107
C ++ C Y + Y D S ++G + +T+S+ F + GC DN G
Sbjct: 153 TSCSDQNSCQYKISYGDSSHSQGDLSVDTLSLESTSGSPVSFPKTVIGCGTDNAG----T 208
Query: 108 RDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEY-TSSYLKFGTDMGYRRP 166
GA +G++GL +S I+QLGS I +FSYCLV PL N E SS L FG
Sbjct: 209 FGGASSGIVGLGGGPVSLITQLGSSIGGKFSYCLV-PLLNKESNASSILSFGDAAVVSGD 267
Query: 167 STQATKFINHPNNFYYLSLKDISIDNERMNFPPDTFDITVSGEGGCIIDSGSVLTYFHSD 226
+T I FY+L+L+ S+ N+R+ F + EG IIDSG+ LT SD
Sbjct: 268 GVVSTPLIKKDPVFYFLTLQAFSVGNKRVEFGGSSEG--GDDEGNIIIDSGTTLTLIPSD 325
Query: 227 VYWKLHEKFVSYFERFQLAQLSDCPEPIQLCYFLPETFNRFPSMAFYFEDANLRIDGENV 286
VY L V + +L ++ D + LCY L FP + +F+ A++ + +
Sbjct: 326 VYTNLESAVV---DLVKLDRVDDPNQQFSLCYSLKSNEYDFPIITAHFKGADIELHSIST 382
Query: 287 FIIDYENHFFLLAVAPHDDLVALIGSQQQRDTRFVYDLNIDLLSFVKENCS 337
F + + A P L ++ G+ Q++ YDL +SF +C+
Sbjct: 383 F-VPITDGIVCFAFQPSPQLGSIFGNLAQQNLLVGYDLQQKTVSFKPTDCT 432
>gi|225455876|ref|XP_002275164.1| PREDICTED: aspartic proteinase nepenthesin-2 [Vitis vinifera]
Length = 496
Score = 136 bits (342), Expect = 2e-29, Method: Compositional matrix adjust.
Identities = 111/362 (30%), Positives = 159/362 (43%), Gaps = 54/362 (14%)
Query: 2 VRLFIGTPSKGVLLILDTGSALIY--------------AIFDPRKSSSFQKINCDHPDCT 47
+R+ IG PSK +++DTGS + + IFDP SSSF ++ C P C
Sbjct: 162 LRVGIGRPSKTFYMVIDTGSDVNWLQCKPCDDCYQQVDPIFDPASSSSFSRLGCQTPQCR 221
Query: 48 ---YFKCVNEQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSNDNHGFD 104
F C N+ C+Y + Y D S T G A ET+S G + GC +DN G
Sbjct: 222 NLDVFACRNDSCLYQVSYGDGSYTVGDFATETVSFGNSGSVDKV----AIGCGHDNEGL- 276
Query: 105 EDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLKFGTDMGYR 164
G GL + +S I FSYCLV SS L+F +
Sbjct: 277 -------FVGAAGLIGLGGGPLSLTSQIKASSFSYCLV---NRDSVDSSTLEFNS----A 322
Query: 165 RPSTQATKFI---NHPNNFYYLSLKDISIDNERMNFPPDTFDITVSGEGGCIIDSGSVLT 221
+PS T I + + FYY+ + +S+ E++ PP F++ SG+GG I+D G+ +T
Sbjct: 323 KPSDSVTAPIFKNSKVDTFYYVGITGMSVGGEKLAIPPSIFEVDGSGKGGIIVDCGTAVT 382
Query: 222 YFHSDVYWKLHEKFVSYFERFQLAQLSDCPEP-----IQLCYFL-PETFNRFPSMAFYFE 275
+ Y L + FV + D P CY L T R P++AF F+
Sbjct: 383 RLQTQAYNALRDTFVKLTK--------DLPSTSGFALFDTCYNLSSRTSVRVPTVAFLFD 434
Query: 276 DA-NLRIDGENVFIIDYENHFFLLAVAPHDDLVALIGSQQQRDTRFVYDLNIDLLSFVKE 334
+L + N I F LA AP +++IG+ QQ+ TR YDL +SF
Sbjct: 435 GGKSLPLPPSNYLIPVDSAGTFCLAFAPTTASLSIIGNVQQQGTRVTYDLANSQVSFSSR 494
Query: 335 NC 336
C
Sbjct: 495 KC 496
>gi|449454652|ref|XP_004145068.1| PREDICTED: probable aspartic protease At2g35615-like [Cucumis
sativus]
gi|449470630|ref|XP_004153019.1| PREDICTED: probable aspartic protease At2g35615-like [Cucumis
sativus]
Length = 435
Score = 136 bits (342), Expect = 2e-29, Method: Compositional matrix adjust.
Identities = 103/358 (28%), Positives = 175/358 (48%), Gaps = 33/358 (9%)
Query: 1 MVRLFIGTPSKGVLLILDTGSALIYA--------------IFDPRKSSSFQKINCDHPDC 46
+++L +GTP ++ + DTGS +I+ +F+P KS++++K++C P C
Sbjct: 86 LMKLSVGTPPFPIIAVADTGSDIIWTQCEPCTNCYQQDLPMFNPSKSTTYRKVSCSSPVC 145
Query: 47 TYFKCVNE-----QCVYTMKYADQSVTKGFAAHETISVIGKGEGKAI-FHGALFGCSNDN 100
++ N C Y++ Y D S ++G A +T++ +G G+ + F GC +DN
Sbjct: 146 SFTGEDNSCSFKPDCTYSISYGDNSHSQGDFAVDTLT-MGSTSGRVVAFPRTAIGCGHDN 204
Query: 101 HGFDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLKFGTD 160
G + D ++G++GL S I Q+GS + +FSYCL P+ N + S+ L FG++
Sbjct: 205 AG----SFDANVSGIVGLGLGPASLIKQMGSAVGGKFSYCLT-PIGNDDGGSNKLNFGSN 259
Query: 161 MGYRRPSTQATK-FINHP-NNFYYLSLKDISIDNERMNFPPDTFDITVSGEGGCIIDSGS 218
+T +I+ +FY L LK +S+ R N T + + G+ IIDSG+
Sbjct: 260 ANVSGSGAVSTPIYISDKFKSFYSLKLKAVSVG--RNNTFYSTANSILGGKANIIIDSGT 317
Query: 219 VLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCYFLPETFNRFPSMAFYFEDAN 278
LT D+Y H + L + D + ++ C+ + P +A +FE AN
Sbjct: 318 TLTLLPVDLY---HNFAKAISNSINLQRTDDPNQFLEYCFETTTDDYKVPFIAMHFEGAN 374
Query: 279 LRIDGENVFIIDYENHFFLLAVAPHDDLVALIGSQQQRDTRFVYDLNIDLLSFVKENC 336
LR+ ENV I +N L D+ +++ G+ Q + YD+ LSF NC
Sbjct: 375 LRLQRENVLIRVSDNVICLAFAGAQDNDISIYGNIAQINFLVGYDVTNMSLSFKPMNC 432
>gi|449499012|ref|XP_004160696.1| PREDICTED: probable aspartic protease At2g35615-like [Cucumis
sativus]
Length = 435
Score = 135 bits (341), Expect = 2e-29, Method: Compositional matrix adjust.
Identities = 103/358 (28%), Positives = 175/358 (48%), Gaps = 33/358 (9%)
Query: 1 MVRLFIGTPSKGVLLILDTGSALIYA--------------IFDPRKSSSFQKINCDHPDC 46
+++L +GTP ++ + DTGS +I+ +F+P KS++++K++C P C
Sbjct: 86 LMKLSVGTPPFPIIAVADTGSDIIWTQCVPCTNCYQQDLPMFNPSKSTTYRKVSCSSPVC 145
Query: 47 TYFKCVNE-----QCVYTMKYADQSVTKGFAAHETISVIGKGEGKAI-FHGALFGCSNDN 100
++ N C Y++ Y D S ++G A +T++ +G G+ + F GC +DN
Sbjct: 146 SFTGEDNSCSFKPDCTYSISYGDNSHSQGDFAVDTLT-MGSTSGRVVAFPRTAIGCGHDN 204
Query: 101 HGFDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLKFGTD 160
G + D ++G++GL S I Q+GS + +FSYCL P+ N + S+ L FG++
Sbjct: 205 AG----SFDANVSGIVGLGLGPASLIKQMGSAVGGKFSYCLT-PIGNDDGGSNKLNFGSN 259
Query: 161 MGYRRPSTQATK-FINHP-NNFYYLSLKDISIDNERMNFPPDTFDITVSGEGGCIIDSGS 218
+T +I+ +FY L LK +S+ R N T + + G+ IIDSG+
Sbjct: 260 ANVSGSGAVSTPIYISDKFKSFYSLKLKAVSVG--RNNTFYSTANSILGGKANIIIDSGT 317
Query: 219 VLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCYFLPETFNRFPSMAFYFEDAN 278
LT D+Y H + L + D + ++ C+ + P +A +FE AN
Sbjct: 318 TLTLLPVDLY---HNFAKAISNSINLQRTDDPNQFLEYCFETTTDDYKVPFIAMHFEGAN 374
Query: 279 LRIDGENVFIIDYENHFFLLAVAPHDDLVALIGSQQQRDTRFVYDLNIDLLSFVKENC 336
LR+ ENV I +N L D+ +++ G+ Q + YD+ LSF NC
Sbjct: 375 LRLQRENVLIRVSDNVICLAFAGAQDNDISIYGNIAQINFLVGYDVTNMSLSFKPMNC 432
>gi|226493786|ref|NP_001142400.1| uncharacterized protein LOC100274575 [Zea mays]
gi|194708650|gb|ACF88409.1| unknown [Zea mays]
Length = 392
Score = 135 bits (340), Expect = 2e-29, Method: Compositional matrix adjust.
Identities = 103/377 (27%), Positives = 160/377 (42%), Gaps = 59/377 (15%)
Query: 1 MVRLFIGTPSKGVLLILDTGSALIYA---------------IFDPRKSSSFQKINCDH-- 43
++ L IGTP I DTGS LI+ +++P S++F + C+
Sbjct: 33 LMALAIGTPPLPYQAIADTGSDLIWTQCAPCTSQCFRQPTPLYNPSSSTTFAVLPCNSSL 92
Query: 44 ---------------PDCTYFKCVNEQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAI 88
P C C Y + Y T F ET + G A
Sbjct: 93 SVCAAALAGTGTAPPPGC--------ACTYNVTYGS-GWTSVFQGSETFTFGSTPAGHAR 143
Query: 89 FHGALFGCSNDNHGFDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNG 148
G FGCS + GF+ + +G++GL R +S +SQLG +FSYCL P +
Sbjct: 144 VPGIAFGCSTASSGFNASSA----SGLVGLGRGRLSLVSQLG---VPKFSYCLT-PYQDT 195
Query: 149 EYTSSYLKFGTDMGYRRPSTQATKFINHP-----NNFYYLSLKDISIDNERMNFPPDTFD 203
TS+ L + +T F+ P N FYYL+L IS+ ++ PPD F
Sbjct: 196 NSTSTLLLGPSASLNGTAGVSSTPFVASPSTAPMNTFYYLNLTGISLGTTALSIPPDAFS 255
Query: 204 ITVSGEGGCIIDSGSVLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCYFLPET 263
+ G GG IIDSG+ +T + Y ++ VS +D + LC+ LP +
Sbjct: 256 LNADGTGGLIIDSGTTITLLGNTAYQQVRAAVVSLVTLPTTDGSAD--TGLDLCFMLPSS 313
Query: 264 FN---RFPSMAFYFEDANLRIDGENVFIIDYENHFFLLAVAPHDDLVALIGSQQQRDTRF 320
+ PSM +F A++ + ++ + D + L D V ++G+ QQ++
Sbjct: 314 TSAPPAMPSMTLHFNGADMVLPADSYMMSDDSGLWCLAMQNQTDGEVNILGNYQQQNMHI 373
Query: 321 VYDLNIDLLSFVKENCS 337
+YD+ + LSF CS
Sbjct: 374 LYDIGQETLSFAPAKCS 390
>gi|125563959|gb|EAZ09339.1| hypothetical protein OsI_31611 [Oryza sativa Indica Group]
Length = 453
Score = 135 bits (340), Expect = 3e-29, Method: Compositional matrix adjust.
Identities = 113/368 (30%), Positives = 167/368 (45%), Gaps = 51/368 (13%)
Query: 4 LFIGTPSKGVLLILDTGSALIYA--------------IFDPRKSSSFQKINCDHPDCT-- 47
L +GTP + + +LDTGS LI+ +F PR SSS++ + C C
Sbjct: 102 LAVGTPPQPITALLDTGSDLIWTQCDTCTACLRQPDPLFSPRMSSSYEPMRCAGQLCGDI 161
Query: 48 -YFKCVN-EQCVYTMKYADQSVTKGFAAHETIS-VIGKGEGKAIFHGALFGCSNDNHGFD 104
+ CV + C Y Y D + T G+ A E + GE +++ G FGC N G
Sbjct: 162 LHHSCVRPDTCTYRYSYGDGTTTLGYYATERFTFASSSGETQSVPLG--FGCGTMNVGSL 219
Query: 105 EDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLKFGT--DMG 162
+A +G++G R +S +SQL SI +RFSYCL P S L+FG+ D+G
Sbjct: 220 NNA-----SGIVGFGRDPLSLVSQL-SI--RRFSYCLT---PYASSRKSTLQFGSLADVG 268
Query: 163 YRRPST---QATKFINHPNN--FYYLSLKDISIDNERMNFPPDTFDITVSGEGGCIIDSG 217
+T Q T + N FYY++ +++ R+ P F + G GG IIDSG
Sbjct: 269 LYDDATGPVQTTPILQSAQNPTFYYVAFTGVTVGARRLRIPASAFALRPDGSGGVIIDSG 328
Query: 218 SVLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCYFLPETFNR---------FP 268
+ LT F + V ++ F S R A S + + C+ P P
Sbjct: 329 TALTLFPAAVLAEVVRAFRSQL-RLPFANGSSPDDGV--CFAAPAVAAGGGRMARQVAVP 385
Query: 269 SMAFYFEDANLRIDGENVFIIDYENHFFLLAVAPHDDLVALIGSQQQRDTRFVYDLNIDL 328
M F+F+ A+L + EN + D+ + + D A IG+ Q+D R VYDL +
Sbjct: 386 RMVFHFQGADLDLPRENYVLEDHRRGHLCVLLGDSGDDGATIGNFVQQDMRVVYDLERET 445
Query: 329 LSFVKENC 336
LSF C
Sbjct: 446 LSFAPVEC 453
>gi|414886964|tpg|DAA62978.1| TPA: aspartic proteinase nepenthesin-1 [Zea mays]
Length = 452
Score = 135 bits (340), Expect = 3e-29, Method: Compositional matrix adjust.
Identities = 103/377 (27%), Positives = 160/377 (42%), Gaps = 59/377 (15%)
Query: 1 MVRLFIGTPSKGVLLILDTGSALIYA---------------IFDPRKSSSFQKINCDH-- 43
++ L IGTP I DTGS LI+ +++P S++F + C+
Sbjct: 93 LMALAIGTPPLPYQAIADTGSDLIWTQCAPCTSQCFRQPTPLYNPSSSTTFAVLPCNSSL 152
Query: 44 ---------------PDCTYFKCVNEQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAI 88
P C C Y + Y T F ET + G A
Sbjct: 153 SVCAAALAGTGTAPPPGC--------ACTYNVTYGS-GWTSVFQGSETFTFGSTPAGHAR 203
Query: 89 FHGALFGCSNDNHGFDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNG 148
G FGCS + GF+ + +G++GL R +S +SQLG +FSYCL P +
Sbjct: 204 VPGIAFGCSTASSGFNASSA----SGLVGLGRGRLSLVSQLG---VPKFSYCLT-PYQDT 255
Query: 149 EYTSSYLKFGTDMGYRRPSTQATKFINHP-----NNFYYLSLKDISIDNERMNFPPDTFD 203
TS+ L + +T F+ P N FYYL+L IS+ ++ PPD F
Sbjct: 256 NSTSTLLLGPSASLNGTAGVSSTPFVASPSTAPMNTFYYLNLTGISLGTTALSIPPDAFS 315
Query: 204 ITVSGEGGCIIDSGSVLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCYFLPET 263
+ G GG IIDSG+ +T + Y ++ VS +D + LC+ LP +
Sbjct: 316 LNADGTGGLIIDSGTTITLLGNTAYQQVRAAVVSLVTLPTTDGSAD--TGLDLCFMLPSS 373
Query: 264 FN---RFPSMAFYFEDANLRIDGENVFIIDYENHFFLLAVAPHDDLVALIGSQQQRDTRF 320
+ PSM +F A++ + ++ + D + L D V ++G+ QQ++
Sbjct: 374 TSAPPAMPSMTLHFNGADMVLPADSYMMSDDSGLWCLAMQNQTDGEVNILGNYQQQNMHI 433
Query: 321 VYDLNIDLLSFVKENCS 337
+YD+ + LSF CS
Sbjct: 434 LYDIGQETLSFAPAKCS 450
>gi|242044888|ref|XP_002460315.1| hypothetical protein SORBIDRAFT_02g026350 [Sorghum bicolor]
gi|241923692|gb|EER96836.1| hypothetical protein SORBIDRAFT_02g026350 [Sorghum bicolor]
Length = 456
Score = 135 bits (339), Expect = 3e-29, Method: Compositional matrix adjust.
Identities = 106/368 (28%), Positives = 165/368 (44%), Gaps = 46/368 (12%)
Query: 1 MVRLFIGTPSKGVLLILDTGSALIYA--------------IFDPRKSSSFQKINCDHPDC 46
+V L IGTP + V +LDTGS LI+ +F P +S+S++ + C C
Sbjct: 103 VVDLAIGTPPQPVSALLDTGSDLIWTQCAPCASCLAQPDPLFAPGESASYEPMRCAGQLC 162
Query: 47 T---YFKC-VNEQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSNDNHG 102
+ + C + + C Y Y D ++T G A E + G + + FGC + N G
Sbjct: 163 SDILHHGCEMPDTCTYRYNYGDGTMTMGVYATERFTFTSSGGDRLMTVPLGFGCGSMNVG 222
Query: 103 FDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLKFGTDMG 162
+ +G++G R +S +SQL SI +RFSYCL G S L FG+ G
Sbjct: 223 SLNNG-----SGIVGFGRNPLSLVSQL-SI--RRFSYCLT---SYGSGRKSTLLFGSLSG 271
Query: 163 Y----RRPSTQATKFINHPNN--FYYLSLKDISIDNERMNFPPDTFDITVSGEGGCIIDS 216
Q T + N FYY+ L +++ R+ P F + G GG I+DS
Sbjct: 272 GVYGDATGPVQTTPLLQSLQNPTFYYVHLAGLTVGARRLRIPESAFALRPDGSGGVIVDS 331
Query: 217 GSVLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCYFLPETFNR--------FP 268
G+ LT V L E ++ ++ +L + +C+ +P + R P
Sbjct: 332 GTALTLLPGAV---LAEVVRAFRQQLRLPFANGGNPEDGVCFLVPAAWRRSSSTSQVPVP 388
Query: 269 SMAFYFEDANLRIDGENVFIIDYENHFFLLAVAPHDDLVALIGSQQQRDTRFVYDLNIDL 328
M F+F+DA+L + N + D+ L +A D + IG+ Q+D R +YDL +
Sbjct: 389 RMVFHFQDADLDLPRRNYVLDDHRKGRLCLLLADSGDDGSTIGNLVQQDMRVLYDLEAET 448
Query: 329 LSFVKENC 336
LSF C
Sbjct: 449 LSFAPAQC 456
>gi|242041115|ref|XP_002467952.1| hypothetical protein SORBIDRAFT_01g037070 [Sorghum bicolor]
gi|241921806|gb|EER94950.1| hypothetical protein SORBIDRAFT_01g037070 [Sorghum bicolor]
Length = 774
Score = 135 bits (339), Expect = 3e-29, Method: Compositional matrix adjust.
Identities = 111/367 (30%), Positives = 166/367 (45%), Gaps = 40/367 (10%)
Query: 1 MVRLFIGTPSKGVLLILDTGSALIY--------------AIFDPRKSSSFQKINCDHPDC 46
+V L IGTP + V LILDTGS L++ DP SS+F + C P C
Sbjct: 416 LVHLAIGTPPQPVQLILDTGSDLVWTQCRPCPVCFSRALGPLDPSNSSTFDVLPCSSPVC 475
Query: 47 ---TYFKCV-----NEQCVYTMKYADQSVTKGFAAHETIS-VIGKGEGKAIFHGALFGCS 97
T+ C N+ CVY YAD S+T G ET + G G+A FGC
Sbjct: 476 DNLTWSSCGKHNWGNQTCVYVYAYADGSITTGHLDAETFTFAAADGTGQATVPDLAFGCG 535
Query: 98 NDNHGFDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLKF 157
N+G G+ G R +S SQL FS+C + E +S L
Sbjct: 536 LFNNGIFTSNE----TGIAGFGRGALSLPSQLKV---DNFSHCFTA-ITGSEPSSVLLGL 587
Query: 158 GTDM-GYRRPSTQATKFINHPNNF--YYLSLKDISIDNERMNFPPDTFDITVSGEGGCII 214
++ + Q+T + + ++ YYLSLK I++ + R+ P TF + G GG II
Sbjct: 588 PANLYSDADGAVQSTPLVQNFSSLRAYYLSLKGITVGSTRLPIPESTFALKQDGTGGTII 647
Query: 215 DSGSVLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCYFLPETFNR-FPSMAFY 273
DSG+ +T D Y +H+ F + S + + +P P + +
Sbjct: 648 DSGTGMTTLPQDAYKLVHDAFTAQVRLPVDNATSSSLSRLCFSFSVPRRAKPDVPKLVLH 707
Query: 274 FEDANLRIDGENVFIIDYEN---HFFLLAVAPHDDLVALIGSQQQRDTRFVYDLNIDLLS 330
FE A L + EN ++ ++E+ LA+ DDL +IG+ QQ++ +YDL ++LS
Sbjct: 708 FEGATLDLPREN-YMFEFEDAGGSVTCLAINAGDDLT-IIGNYQQQNLHVLYDLVRNMLS 765
Query: 331 FVKENCS 337
FV C+
Sbjct: 766 FVPAQCN 772
>gi|343198386|gb|AEM05966.1| nodulin 41 [Phaseolus vulgaris]
Length = 437
Score = 135 bits (339), Expect = 4e-29, Method: Compositional matrix adjust.
Identities = 106/362 (29%), Positives = 168/362 (46%), Gaps = 42/362 (11%)
Query: 1 MVRLFIGTPSKGVLLILDTGSALIYA--------------IFDPRKSSSFQKINCDHPDC 46
++R +IGTP L DTGS LI+ +F P KSS+F C C
Sbjct: 91 LMRFYIGTPPVERLATADTGSDLIWVQCSPCASCFPQSTPLFQPLKSSTFMPTTCRSQPC 150
Query: 47 TYFKCVNE------QCVYTMKYADQ-SVTKGFAAHETISVIGKGEGKAI-FHGALFGCSN 98
T + +C+YT KY DQ S ++G + ET+ +G + + F + FGC
Sbjct: 151 TLLLPEQKGCGKSGECIYTYKYGDQYSFSEGLLSTETLRFDSQGGVQTVAFPNSFFGCGL 210
Query: 99 DNHGFDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLKFG 158
N+ + L G++GL +S +SQ+G I +FSYCL LP G ++S LKFG
Sbjct: 211 YNNITVFPSYK--LTGIMGLGAGPLSLVSQIGDQIGHKFSYCL---LPLGSTSTSKLKFG 265
Query: 159 TDMGYRRPSTQATKFINHP--NNFYYLSLKDISIDNERMNFPPDTFDITVSGEGGCIIDS 216
+ +T I P +Y+L+L+ +++ + + T S +G IIDS
Sbjct: 266 NESIITGEGVVSTPMIIKPWLPTYYFLNLEAVTVAQKTVP--------TGSTDGNVIIDS 317
Query: 217 GSVLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCYFLPETFNRFPSMAFYFED 276
G++LTY Y+ S E + + D P+ C+ + F FP +AF F
Sbjct: 318 GTLLTYLGESFYYNFA---ASLQESLAVELVQDVLSPLPFCFPYRDNF-VFPEIAFQFTG 373
Query: 277 ANLRIDGENVFIIDYENHFFLLAVAPHD-DLVALIGSQQQRDTRFVYDLNIDLLSFVKEN 335
A + + N+F++ + + L +AP +++ GS Q D + YDL +SF +
Sbjct: 374 ARVSLKPANLFVMTEDRNTVCLMIAPSSVSGISIFGSFSQIDFQVEYDLEGKKVSFQPTD 433
Query: 336 CS 337
CS
Sbjct: 434 CS 435
>gi|356569916|ref|XP_003553140.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Glycine max]
Length = 560
Score = 134 bits (338), Expect = 5e-29, Method: Compositional matrix adjust.
Identities = 121/370 (32%), Positives = 163/370 (44%), Gaps = 44/370 (11%)
Query: 4 LFIGTPSKGVLLILDTGSAL-------IYAIF-------DPRKSSSFQKINCDHPDCTYF 49
+F+GTP K LILDTGS L YA F DP+ SSSF+ I C P C
Sbjct: 199 VFVGTPPKHFSLILDTGSDLNWIQCVPCYACFEQNGPYYDPKDSSSFKNITCHDPRCQLV 258
Query: 50 ---------KCVNEQCVYTMKYADQSVTKGFAAHETISV-IGKGEGKA---IFHGALFGC 96
K + C Y Y D S T G A ET +V + EGK I +FGC
Sbjct: 259 SSPDPPQPCKGETQSCPYFYWYGDSSNTTGDFALETFTVNLTTPEGKPELKIVENVMFGC 318
Query: 97 SNDNHGFDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLK 156
+ N G A R +SF +QL S+ FSYCLV N SS L
Sbjct: 319 GHWNRGLFHGAAGLLGL-----GRGPLSFATQLQSLYGHSFSYCLVDRNSNSS-VSSKLI 372
Query: 157 FGTDMGY-RRPSTQATKFINHPNN----FYYLSLKDISIDNERMNFPPDTFDITVSGEGG 211
FG D P+ T F+ N FYY+ +K I + E + P +T+ ++ G GG
Sbjct: 373 FGEDKELLSHPNLNFTSFVGGKENPVDTFYYVLIKSIMVGGEVLKIPEETWHLSAQGGGG 432
Query: 212 CIIDSGSVLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCYFLPETFN-RFPSM 270
IIDSG+ LTYF Y + E F+ + F L + P++ CY + P
Sbjct: 433 TIIDSGTTLTYFAEPAYEIIKEAFMRKIKGFPLVETF---PPLKPCYNVSGVEKMELPEF 489
Query: 271 AFYFED-ANLRIDGENVFI-IDYENHFFLLAVAPHDDLVALIGSQQQRDTRFVYDLNIDL 328
A F D A EN FI I+ E+ L + +++IG+ QQ++ +YDL
Sbjct: 490 AILFADGAMWDFPVENYFIQIEPEDVVCLAILGTPRSALSIIGNYQQQNFHILYDLKKSR 549
Query: 329 LSFVKENCSD 338
L + C+D
Sbjct: 550 LGYAPMKCAD 559
>gi|225427550|ref|XP_002266461.1| PREDICTED: probable aspartic protease At2g35615 [Vitis vinifera]
Length = 439
Score = 134 bits (338), Expect = 5e-29, Method: Compositional matrix adjust.
Identities = 107/359 (29%), Positives = 168/359 (46%), Gaps = 34/359 (9%)
Query: 1 MVRLFIGTPSKGVLLILDTGSALIYA--------------IFDPRKSSSFQKINCDHPDC 46
++ L+IGTP V+ I+DTGS L + +FDP+ SS+++ +C C
Sbjct: 93 LMNLYIGTPPVPVIAIVDTGSDLTWTQCRPCTHCYKQVVPLFDPKNSSTYRDSSCGTSFC 152
Query: 47 TYF----KCVNEQ-CVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSNDNH 101
C E+ C + YAD S T G A ET++V F G FGC + +
Sbjct: 153 LALGKDRSCSKEKKCTFRYSYADGSFTGGNLASETLTVDSTAGKPVSFPGFAFGCGHSSG 212
Query: 102 G-FDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLKFGTD 160
G FD+ + +G++GL +S ISQL S I FSYCL +P+ SS + FG
Sbjct: 213 GIFDKSS-----SGIVGLGGGELSLISQLKSTINGLFSYCL-LPVSTDSSISSRINFGAS 266
Query: 161 MGYRRPSTQATKFINH-PNNFYYLSLKDISIDNERMNFPPDTFDITVSGEGGCIIDSGSV 219
T +T + P+ FYYL+L+ IS+ +R+ + + V EG I+DSG+
Sbjct: 267 GRVSGYGTVSTPLVQKSPDTFYYLTLEGISVGKKRLPYKGYSKKTEVE-EGNIIVDSGTT 325
Query: 220 LTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCYFLPETFNRFPSMAFYFEDANL 279
T+ + Y KL + S + ++ D LCY N P + +F+DAN+
Sbjct: 326 YTFLPQEFYSKLEK---SVANSIKGKRVRDPNGIFSLCYNTTAEINA-PIITAHFKDANV 381
Query: 280 RIDGENVFIIDYENHFFLLAVAPHDDLVALIGSQQQRDTRFVYDLNIDLLSFVKENCSD 338
+ N F+ E+ VAP D + ++G+ Q + +DL +SF +C+
Sbjct: 382 ELQPLNTFMRMQED-LVCFTVAPTSD-IGVLGNLAQVNFLVGFDLRKKRVSFKAADCTQ 438
>gi|357119741|ref|XP_003561592.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Brachypodium
distachyon]
Length = 410
Score = 134 bits (337), Expect = 6e-29, Method: Compositional matrix adjust.
Identities = 101/355 (28%), Positives = 161/355 (45%), Gaps = 37/355 (10%)
Query: 2 VRLFIGTPSKGVLLILDTGSALIYA--------------IFDPRKSSSFQKINCDHPDCT 47
V + G ++ +L LDTG++ + +F P S +FQ + D P CT
Sbjct: 72 VSIGTGEGTRRKVLALDTGASTSWLMCEPCQPPLPQVGHLFSPAASPTFQGVRGDGPVCT 131
Query: 48 Y-FKCVNEQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAI--FHGALFGCSNDNHGFD 104
++ ++ C + +A G+ + +T + G + G +FGC++ GF
Sbjct: 132 VPYRHTDKGCSFRFPFA-----AGYLSRDTFHLRSGRSGTVMESVPGIMFGCAHSVTGFH 186
Query: 105 EDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLKFGTDMGYR 164
D G L+GVL LS +SF++ LG RFSYCL P P S+L+FG D+
Sbjct: 187 ND---GTLSGVLSLSHSPLSFLTLLGGRSSGRFSYCL--PKPTTHNPDSFLRFGADVPSL 241
Query: 165 RPSTQATKFINHPNNFYYLSLKDISIDNERMNFPPDTFDITVSGEGGCIIDSGSVLTYFH 224
P T ++ Y+L++ IS+ N+R++ F + GGC I+ +T
Sbjct: 242 PPHAHTTTLVHAGVPGYHLNIVGISLGNKRLHIDRHVF----AAGGGCSINPAVTITRIM 297
Query: 225 SDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCYFLPETFNR--FPSMAFYFED-ANLRI 281
Y + V++ + ++ P LC+ + R P M+F+FED A LR
Sbjct: 298 ELAYLAVEHALVAHMKELGSGRVKGMPG-RSLCFDHMDRSVRVQLPGMSFHFEDGAELRF 356
Query: 282 DGENVFIIDYENHFFLLAVAPHDDLVALIGSQQQRDTRFVYDLNIDLLSFVKENC 336
E +F + FL+ H V IG+ QQ DTRF +D+ L+FV E C
Sbjct: 357 AAEQLFDVRVMAACFLVVGRGHHQTV--IGAAQQVDTRFTFDIAAGRLAFVPETC 409
>gi|24430421|dbj|BAC22609.1| 41 kD chloroplast nucleoid DNA binding protein (CND41) [Nicotiana
sylvestris]
Length = 502
Score = 134 bits (337), Expect = 6e-29, Method: Compositional matrix adjust.
Identities = 104/369 (28%), Positives = 159/369 (43%), Gaps = 53/369 (14%)
Query: 1 MVRLFIGTPSKGVLLILDTGSALIYA---------------IFDPRKSSSFQKINCDHPD 45
+V + +GTP K + LI DTGS L + IFDP S ++ I+C
Sbjct: 155 IVNVGLGTPKKDLSLIFDTGSDLTWTQCQPCVKSCYAQQQPIFDPSASKTYSNISCTSTA 214
Query: 46 CTYFK--------CVNEQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCS 97
C+ K C + CVY ++Y D S T GF A +T+++ +F G +FGC
Sbjct: 215 CSGLKSATGNSPGCSSSNCVYGIQYGDSSFTVGFFAKDTLTL----TQNDVFDGFMFGCG 270
Query: 98 NDNHGFDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLKF 157
+N G G AG++GL R +S + Q K FSYC LP ++ +L F
Sbjct: 271 QNNRGL-----FGKTAGLIGLGRDPLSIVQQTAQKFGKYFSYC----LPTSRGSNGHLTF 321
Query: 158 GTDMGYRRPSTQATKFINHP------NNFYYLSLKDISIDNERMNFPPDTFDITVSGEGG 211
G G + P FY++ + IS+ + ++ P F G
Sbjct: 322 GNGNGVKTSKAVKNGITFTPFASSQGATFYFIDVLGISVGGKALSISPMLFQ-----NAG 376
Query: 212 CIIDSGSVLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCYFLPE-TFNRFPSM 270
IIDSG+V+T S VY L F + ++ A + CY L T P +
Sbjct: 377 TIIDSGTVITRLPSTVYGSLKSTFKQFMSKYPTAPALSL---LDTCYDLSNYTSISIPKI 433
Query: 271 AFYFE-DANLRIDGENVFIIDYENHFFL-LAVAPHDDLVALIGSQQQRDTRFVYDLNIDL 328
+F F +AN+ ++ + I + + L A DD + + G+ QQ+ VYD+
Sbjct: 434 SFNFNGNANVDLEPNGILITNGASQVCLAFAGNGDDDTIGIFGNIQQQTLEVVYDVAGGQ 493
Query: 329 LSFVKENCS 337
L F + CS
Sbjct: 494 LGFGYKGCS 502
>gi|224060469|ref|XP_002300215.1| predicted protein [Populus trichocarpa]
gi|222847473|gb|EEE85020.1| predicted protein [Populus trichocarpa]
Length = 439
Score = 134 bits (337), Expect = 7e-29, Method: Compositional matrix adjust.
Identities = 115/360 (31%), Positives = 172/360 (47%), Gaps = 45/360 (12%)
Query: 1 MVRLFIGTPSKGVLLILDTGSALIYA--------------IFDPRKSSSFQKINCDHPDC 46
+++L IGTP + ILDTGS LI+ IFDP+KSSSF K++C C
Sbjct: 98 LMKLAIGTPPETYSAILDTGSDLIWTQCKPCTQCFHQSTPIFDPKKSSSFSKLSCSSQLC 157
Query: 47 TYF--KCVNEQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSNDNHG-- 102
N C Y Y D S T+G A ET++ GKA FGC DN G
Sbjct: 158 EALPQSSCNNGCEYLYSYGDYSSTQGILASETLTF-----GKASVPNVAFGCGADNEGSG 212
Query: 103 FDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLKFGTDMG 162
F + A G++GL R +S +SQL + +FSYCL + +S L G+
Sbjct: 213 FSQGA------GLVGLGRGPLSLVSQLK---EPKFSYCLTT---VDDTKTSTLLMGSLAS 260
Query: 163 YRRPST--QATKFINHPN--NFYYLSLKDISIDNERMNFPPDTFDITVSGEGGCIIDSGS 218
S+ + T I+ P +FYYLSL+ IS+ + R+ TF + G GG IIDSG+
Sbjct: 261 VNASSSAIKTTPLIHSPAHPSFYYLSLEGISVGDTRLPIKKSTFSLQDDGSGGLIIDSGT 320
Query: 219 VLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCYFLP--ETFNRFPSMAFYFED 276
+TY + + ++F + + L S + +C+ LP T P + F+F+
Sbjct: 321 TITYLEESAFNLVAKEFTA---KINLPVDSSGSTGLDVCFTLPSGSTNIEVPKLVFHFDG 377
Query: 277 ANLRIDGENVFIIDYENHFFLLAVAPHDDLVALIGSQQQRDTRFVYDLNIDLLSFVKENC 336
A+L + EN I D LA+ + ++ G+ QQ++ ++DL + LSF+ C
Sbjct: 378 ADLELPAENYMIGDSSMGVACLAMGSSSGM-SIFGNVQQQNMLVLHDLEKETLSFLPTQC 436
>gi|125558631|gb|EAZ04167.1| hypothetical protein OsI_26309 [Oryza sativa Indica Group]
Length = 441
Score = 134 bits (336), Expect = 7e-29, Method: Compositional matrix adjust.
Identities = 100/369 (27%), Positives = 168/369 (45%), Gaps = 52/369 (14%)
Query: 1 MVRLFIGTPSKGVLLILDTGSALIYA---------------IFDPRKSSSFQKINCDHPD 45
+V + IGTP + +LDTGS LI+ ++ P +S+++ ++C P
Sbjct: 93 LVDIAIGTPPLPLTAVLDTGSDLIWTQCDAPCRRCFPQPAPLYAPARSATYANVSCRSPM 152
Query: 46 CT-----YFKCV--NEQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSN 98
C + +C + C Y Y D + T G A ET ++ G A+ G FGC
Sbjct: 153 CQALQSPWSRCSPPDTGCAYYFSYGDGTSTDGVLATETFTL---GSDTAV-RGVAFGCGT 208
Query: 99 DNHGFDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLKFG 158
+N G +++ +G++G+ R +S +SQLG RFSYC P +S L G
Sbjct: 209 ENLGSTDNS-----SGLVGMGRGPLSLVSQLG---VTRFSYCFT---PFNATAASPLFLG 257
Query: 159 TDMGYRRPSTQATKFINHPN-------NFYYLSLKDISIDNERMNFPPDTFDITVSGEGG 211
+ + + T F+ P+ ++YYLSL+ I++ + + P F +T G+GG
Sbjct: 258 SSARLSS-AAKTTPFVPSPSGGARRRSSYYYLSLEGITVGDTLLPIDPAVFRLTPMGDGG 316
Query: 212 CIIDSGSVLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCYFL--PETFNRFPS 269
IIDSG+ T + L S R +L S + LC+ PE P
Sbjct: 317 VIIDSGTTFTALEESAFVALARALAS---RVRLPLASGAHLGLSLCFAAASPEAVE-VPR 372
Query: 270 MAFYFEDANLRIDGENVFIIDYENHFFLLAVAPHDDLVALIGSQQQRDTRFVYDLNIDLL 329
+ +F+ A++ + E+ + D L + + +++GS QQ++T +YDL +L
Sbjct: 373 LVLHFDGADMELRRESYVVEDRSAGVACLGMVSARGM-SVLGSMQQQNTHILYDLERGIL 431
Query: 330 SFVKENCSD 338
SF C +
Sbjct: 432 SFEPAKCGE 440
>gi|115479489|ref|NP_001063338.1| Os09g0452800 [Oryza sativa Japonica Group]
gi|51535938|dbj|BAD38020.1| putative aspartic proteinase nepenthesin I [Oryza sativa Japonica
Group]
gi|113631571|dbj|BAF25252.1| Os09g0452800 [Oryza sativa Japonica Group]
gi|125605918|gb|EAZ44954.1| hypothetical protein OsJ_29597 [Oryza sativa Japonica Group]
gi|215740967|dbj|BAG97462.1| unnamed protein product [Oryza sativa Japonica Group]
Length = 453
Score = 134 bits (336), Expect = 7e-29, Method: Compositional matrix adjust.
Identities = 113/368 (30%), Positives = 166/368 (45%), Gaps = 51/368 (13%)
Query: 4 LFIGTPSKGVLLILDTGSALIYA--------------IFDPRKSSSFQKINCDHPDCT-- 47
L +GTP + + +LDTGS LI+ +F PR SSS++ + C C
Sbjct: 102 LAVGTPPQPITALLDTGSDLIWTQCDTCTACLRQPDPLFSPRMSSSYEPMRCAGQLCGDI 161
Query: 48 -YFKCVN-EQCVYTMKYADQSVTKGFAAHETIS-VIGKGEGKAIFHGALFGCSNDNHGFD 104
+ CV + C Y Y D + T G+ A E + GE +++ G FGC N G
Sbjct: 162 LHHSCVRPDTCTYRYSYGDGTTTLGYYATERFTFASSSGETQSVPLG--FGCGTMNVGSL 219
Query: 105 EDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLKFGT--DMG 162
+A +G++G R +S +SQL SI +RFSYCL P S L+FG+ D+G
Sbjct: 220 NNA-----SGIVGFGRDPLSLVSQL-SI--RRFSYCLT---PYASSRKSTLQFGSLADVG 268
Query: 163 YRRPST---QATKFINHPNN--FYYLSLKDISIDNERMNFPPDTFDITVSGEGGCIIDSG 217
+T Q T + N FYY++ +++ R+ P F + G GG IIDSG
Sbjct: 269 LYDDATGPVQTTPILQSAQNPTFYYVAFTGVTVGARRLRIPASAFALRPDGSGGVIIDSG 328
Query: 218 SVLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCYFLPETFNR---------FP 268
+ LT F V ++ F S R A S + + C+ P P
Sbjct: 329 TALTLFPVAVLAEVVRAFRSQL-RLPFANGSSPDDGV--CFAAPAVAAGGGRMARQVAVP 385
Query: 269 SMAFYFEDANLRIDGENVFIIDYENHFFLLAVAPHDDLVALIGSQQQRDTRFVYDLNIDL 328
M F+F+ A+L + EN + D+ + + D A IG+ Q+D R VYDL +
Sbjct: 386 RMVFHFQGADLDLPRENYVLEDHRRGHLCVLLGDSGDDGATIGNFVQQDMRVVYDLERET 445
Query: 329 LSFVKENC 336
LSF C
Sbjct: 446 LSFAPVEC 453
>gi|115472517|ref|NP_001059857.1| Os07g0533600 [Oryza sativa Japonica Group]
gi|22831047|dbj|BAC15910.1| putative nucleoid DNA-binding protein cnd41 [Oryza sativa Japonica
Group]
gi|50508280|dbj|BAD32129.1| putative nucleoid DNA-binding protein cnd41 [Oryza sativa Japonica
Group]
gi|113611393|dbj|BAF21771.1| Os07g0533600 [Oryza sativa Japonica Group]
gi|215766673|dbj|BAG98901.1| unnamed protein product [Oryza sativa Japonica Group]
Length = 441
Score = 133 bits (335), Expect = 1e-28, Method: Compositional matrix adjust.
Identities = 100/369 (27%), Positives = 168/369 (45%), Gaps = 52/369 (14%)
Query: 1 MVRLFIGTPSKGVLLILDTGSALIYA---------------IFDPRKSSSFQKINCDHPD 45
+V + IGTP + +LDTGS LI+ ++ P +S+++ ++C P
Sbjct: 93 LVDIAIGTPPLPLTAVLDTGSDLIWTQCDAPCRRCFPQPAPLYAPARSATYANVSCRSPM 152
Query: 46 CT-----YFKCV--NEQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSN 98
C + +C + C Y Y D + T G A ET ++ G A+ G FGC
Sbjct: 153 CQALQSPWSRCSPPDTGCAYYFSYGDGTSTDGVLATETFTL---GSDTAV-RGVAFGCGT 208
Query: 99 DNHGFDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLKFG 158
+N G +++ +G++G+ R +S +SQLG RFSYC P +S L G
Sbjct: 209 ENLGSTDNS-----SGLVGMGRGPLSLVSQLG---VTRFSYCFT---PFNATAASPLFLG 257
Query: 159 TDMGYRRPSTQATKFINHPN-------NFYYLSLKDISIDNERMNFPPDTFDITVSGEGG 211
+ + + T F+ P+ ++YYLSL+ I++ + + P F +T G+GG
Sbjct: 258 SSARLSS-AAKTTPFVPSPSGGARRRSSYYYLSLEGITVGDTLLPIDPAVFRLTPMGDGG 316
Query: 212 CIIDSGSVLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCYFL--PETFNRFPS 269
IIDSG+ T + L S R +L S + LC+ PE P
Sbjct: 317 VIIDSGTTFTALEERAFVALARALAS---RVRLPLASGAHLGLSLCFAAASPEAVE-VPR 372
Query: 270 MAFYFEDANLRIDGENVFIIDYENHFFLLAVAPHDDLVALIGSQQQRDTRFVYDLNIDLL 329
+ +F+ A++ + E+ + D L + + +++GS QQ++T +YDL +L
Sbjct: 373 LVLHFDGADMELRRESYVVEDRSAGVACLGMVSARGM-SVLGSMQQQNTHILYDLERGIL 431
Query: 330 SFVKENCSD 338
SF C +
Sbjct: 432 SFEPAKCGE 440
>gi|413944392|gb|AFW77041.1| hypothetical protein ZEAMMB73_800604 [Zea mays]
Length = 476
Score = 133 bits (335), Expect = 1e-28, Method: Compositional matrix adjust.
Identities = 102/364 (28%), Positives = 158/364 (43%), Gaps = 51/364 (14%)
Query: 1 MVRLFIGTPSKGVLLILDTGSALIY---------------AIFDPRKSSSFQKINCDHPD 45
+V + GTP++ +I DTGS + + IFDP KS+++ + C HP
Sbjct: 136 VVTVGFGTPAQTYTVIFDTGSDVSWIQCLPCSGHCYKQHDPIFDPTKSATYSVVPCGHPQ 195
Query: 46 CTYF---KCVNEQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSNDNHG 102
C KC N C+Y ++Y D S + G +HET+S+ G FGC N G
Sbjct: 196 CAAADGSKCSNGTCLYKVEYGDGSSSAGVLSHETLSLT----STRALPGFAFGCGQTNLG 251
Query: 103 FDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLKFGTDMG 162
G + G++GL R +S SQ + FSYC LP+ T YL G
Sbjct: 252 -----DFGDVDGLIGLGRGQLSLSSQAAASFGGTFSYC----LPSDNTTHGYLTIGPTTP 302
Query: 163 YRRPSTQATKFINHPN--NFYYLSLKDISIDNERMNFPPDTFDITVSGEGGCIIDSGSVL 220
Q T + + +FY++ L I I + PP F + G +DSG++L
Sbjct: 303 ASNDDVQYTAMVQKQDYPSFYFVELVSIDIGGYILPVPPTLFT-----DDGTFLDSGTIL 357
Query: 221 TYFHSDVYWKLHEKFVSYFERFQLAQLSDCP--EPIQLCY-FLPETFNRFPSMAFYFEDA 277
TY + Y L ++F +F + Q P +P CY F ++ P+++F F D
Sbjct: 358 TYLPPEAYTALRDRF-----KFTMTQYKPAPAYDPFDTCYDFTGQSAIFIPAVSFKFSDG 412
Query: 278 ---NLRIDGENVFIIDYENHFFLLA--VAPHDDLVALIGSQQQRDTRFVYDLNIDLLSFV 332
+L G +F D L P ++G+ QQR+T +YD+ + + F
Sbjct: 413 SVFDLSFFGILIFPDDTAPAIGCLGFVARPSAMPFTIVGNMQQRNTEVIYDVAAEKIGFA 472
Query: 333 KENC 336
+C
Sbjct: 473 SASC 476
>gi|168059885|ref|XP_001781930.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162666576|gb|EDQ53226.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 355
Score = 133 bits (334), Expect = 1e-28, Method: Compositional matrix adjust.
Identities = 103/357 (28%), Positives = 159/357 (44%), Gaps = 30/357 (8%)
Query: 1 MVRLFIGTPSKGVLLILDTGSALIY--------------AIFDPRKSSSFQKINCDHPDC 46
+ + +GTP + +I+DTGS L + ++F P S+SF K+ C C
Sbjct: 4 LATVRLGTPERVFSVIVDTGSDLTWVQCSPCGTCYSQNDSLFIPNTSTSFTKLACGTELC 63
Query: 47 T---YFKCVNEQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSNDNHGF 103
Y C CVY Y D S++ G ++TI++ G K FGC +DN G
Sbjct: 64 NGLPYPMCNQTTCVYWYSYGDGSLSTGDFVYDTITMDGINGQKQQVPNFAFGCGHDNEGS 123
Query: 104 DEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLKFGTDMGY 163
A G+LGL + +SF SQL ++ +FSYCLV L TS L FG
Sbjct: 124 FAGAD-----GILGLGQGPLSFPSQLKTVFNGKFSYCLVDWLAPPTQTSPLL-FGDAAVP 177
Query: 164 RRPSTQATKFINHPN--NFYYLSLKDISIDNERMNFPPDTFDITVSGEGGCIIDSGSVLT 221
P + + +P +YY+ L IS+ + +N FDI G G I DSG+ +T
Sbjct: 178 TFPGVKYISLLTNPKVPTYYYVKLNGISVGGKLLNISSTAFDIDSVGRAGTIFDSGTTVT 237
Query: 222 YFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCY--FLPETFNRFPSMAFYFEDANL 279
+V+ ++ + + SD + LC F PSM F+FE ++
Sbjct: 238 QLAGEVHQEVLAAMNA--STMDYPRKSDDSSGLDLCLGGFAEGQLPTVPSMTFHFEGGDM 295
Query: 280 RIDGENVFIIDYENHFFLLAVAPHDDLVALIGSQQQRDTRFVYDLNIDLLSFVKENC 336
+ N FI + + ++ D V +IGS QQ++ + YD + FV ++C
Sbjct: 296 ELPPSNYFIFLESSQSYCFSMVSSPD-VTIIGSIQQQNFQVYYDTVGRKIGFVPKSC 351
>gi|168000296|ref|XP_001752852.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162696015|gb|EDQ82356.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 384
Score = 133 bits (334), Expect = 1e-28, Method: Compositional matrix adjust.
Identities = 112/364 (30%), Positives = 161/364 (44%), Gaps = 47/364 (12%)
Query: 1 MVRLFIGTPSKGVLLILDTGSALIYA--------------IFDPRKSSSFQKINCDHPDC 46
++ L +G+P + +I+DTGS L + FDP KS SF+K C C
Sbjct: 40 LMTLTLGSPPQSFDVIVDTGSDLNWVQCLPCRVCYQQPGPKFDPSKSRSFRKAACTDNLC 99
Query: 47 TYFK-----CVNEQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSNDNH 101
C C Y Y DQS T G A ETIS + G G FGC N
Sbjct: 100 NVSALPLKACAANVCQYQYTYGDQSNTNGDLAFETIS-LNNGAGTQSVPNFAFGCGTQNL 158
Query: 102 GFDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLKFGTDM 161
G A AG++GL + +S SQL +FSYCLV ++S L FG+
Sbjct: 159 GTFAGA-----AGLVGLGQGPLSLNSQLSHTFANKFSYCLV---SLNSLSASPLTFGSIA 210
Query: 162 GYRRPSTQATKFI---NHPNNFYYLSLKDISIDNERMNFPPDTFDITVS-GEGGCIIDSG 217
+ Q T + HP +YY+ L I + + +N P F I S G GG IIDSG
Sbjct: 211 A--AANIQYTSIVVNARHP-TYYYVQLNSIEVGGQPLNLAPSVFAIDQSTGRGGTIIDSG 267
Query: 218 SVLTYFHSDVY---WKLHEKFVSYFERFQLAQLSDCPEPIQLCYFLPETFN-RFPSMAFY 273
+ +T Y + +E FV+Y +L + LC+ + N P M F
Sbjct: 268 TTITMLTLPAYSAVLRAYESFVNY------PRLDGSAYGLDLCFNIAGVSNPSVPDMVFK 321
Query: 274 FEDANLRIDGENVFI-IDYENHFFLLAVAPHDDLVALIGSQQQRDTRFVYDLNIDLLSFV 332
F+ A+ ++ GEN+F+ +D LA+ ++IG+ QQ++ VYDL + F
Sbjct: 322 FQGADFQMRGENLFVLVDTSATTLCLAMGGSQGF-SIIGNIQQQNHLVVYDLEAKKIGFA 380
Query: 333 KENC 336
+C
Sbjct: 381 TADC 384
>gi|449454654|ref|XP_004145069.1| PREDICTED: probable aspartic protease At2g35615-like [Cucumis
sativus]
gi|449470632|ref|XP_004153020.1| PREDICTED: probable aspartic protease At2g35615-like [Cucumis
sativus]
gi|449499016|ref|XP_004160697.1| PREDICTED: probable aspartic protease At2g35615-like [Cucumis
sativus]
Length = 434
Score = 132 bits (333), Expect = 2e-28, Method: Compositional matrix adjust.
Identities = 96/358 (26%), Positives = 169/358 (47%), Gaps = 32/358 (8%)
Query: 1 MVRLFIGTPSKGVLLILDTGSALIYA--------------IFDPRKSSSFQKINCDHPDC 46
+V + +GTP ++ + DTGS +I+ +FDP KS++++ + C P C
Sbjct: 84 LVEISVGTPPFSIVAVADTGSDVIWTQCKPCSNCYQQNAPMFDPSKSTTYKNVACSSPVC 143
Query: 47 TYF----KCVNE-QCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSNDNH 101
+Y C ++ +C+Y++ Y D S ++G A +T+++ F + GC +DN
Sbjct: 144 SYSGDGSSCSDDSECLYSIAYGDDSHSQGNLAVDTVTMQSTSGRPVAFPRTVIGCGHDNA 203
Query: 102 GFDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYT-SSYLKFGTD 160
G + ++G++GL R S ++QLG +FSYCL IP+ G S+ L FG++
Sbjct: 204 G----TFNANVSGIVGLGRGPASLVTQLGPATGGKFSYCL-IPIGTGSTNDSTKLNFGSN 258
Query: 161 MGYRRPSTQATKFINHPN--NFYYLSLKDISIDNERMNFPPDTFDITVSGEGGCIIDSGS 218
T +T + FY L L+ +S+ + + NFP + GE IIDSG+
Sbjct: 259 ANVSGSGTVSTPIYSSAQYKTFYSLKLEAVSVGDTKFNFPEGASKL--GGESNIIIDSGT 316
Query: 219 VLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCYFLPETFNRFPSMAFYFEDAN 278
LTY S + L+ + + L D E + C+ P + +FE A+
Sbjct: 317 TLTYLPSAL---LNSFGSAISQSMSLPHAQDPSEFLDYCFATTTDDYEMPPVTMHFEGAD 373
Query: 279 LRIDGENVFIIDYENHFFLLAVAPHDDLVALIGSQQQRDTRFVYDLNIDLLSFVKENC 336
+ + EN+F+ ++ L + DD + + G+ Q + YD+ +SF +C
Sbjct: 374 VPLQRENLFVRLSDDTICLAFGSFPDDNIFIYGNIAQSNFLVGYDIKNLAVSFQPAHC 431
>gi|15217764|ref|NP_176663.1| aspartyl protease-like protein [Arabidopsis thaliana]
gi|5042418|gb|AAD38257.1|AC006193_13 Hypothetical Protein [Arabidopsis thaliana]
gi|332196174|gb|AEE34295.1| aspartyl protease-like protein [Arabidopsis thaliana]
Length = 431
Score = 132 bits (333), Expect = 2e-28, Method: Compositional matrix adjust.
Identities = 102/357 (28%), Positives = 175/357 (49%), Gaps = 34/357 (9%)
Query: 1 MVRLFIGTPSKGVLLILDTGSALIYA--------------IFDPRKSSSFQKINCDHPDC 46
++ + IGTP +L I DTGS LI+ +FDP++SS+++K++C C
Sbjct: 87 LMNISIGTPPVPILAIADTGSDLIWTQCNPCEDCYQQTSPLFDPKESSTYRKVSCSSSQC 146
Query: 47 TYFK---CVNEQ--CVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSNDNH 101
+ C ++ C YT+ Y D S TKG A +T+++ G + GC ++N
Sbjct: 147 RALEDASCSTDENTCSYTITYGDNSYTKGDVAVDTVTMGSSGRRPVSLRNMIIGCGHENT 206
Query: 102 GFDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLKFGTDM 161
G D A +G++GL + S +SQL I +FSYCLV P + +S + FGT+
Sbjct: 207 G----TFDPAGSGIIGLGGGSTSLVSQLRKSINGKFSYCLV-PFTSETGLTSKINFGTNG 261
Query: 162 GYRRPSTQATKFINH-PNNFYYLSLKDISIDNERMNFPPDTFDITVSGEGGCIIDSGSVL 220
+T + P +Y+L+L+ IS+ ++++ F F +GEG +IDSG+ L
Sbjct: 262 IVSGDGVVSTSMVKKDPATYYFLNLEAISVGSKKIQFTSTIFG---TGEGNIVIDSGTTL 318
Query: 221 TYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCYFLPETFNRFPSMAFYFEDANLR 280
T S+ Y++L S + ++ D + LCY +F + P + +F+ +++
Sbjct: 319 TLLPSNFYYELESVVAS---TIKAERVQDPDGILSLCYRDSSSF-KVPDITVHFKGGDVK 374
Query: 281 IDGENVFIIDYENHFFLLAVAPHDDLVALIGSQQQRDTRFVYDLNIDLLSFVKENCS 337
+ N F+ E+ A A ++ L + G+ Q + YD +SF K +CS
Sbjct: 375 LGNLNTFVAVSED-VSCFAFAANEQLT-IFGNLAQMNFLVGYDTVSGTVSFKKTDCS 429
>gi|356499344|ref|XP_003518501.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Glycine max]
Length = 561
Score = 132 bits (332), Expect = 3e-28, Method: Compositional matrix adjust.
Identities = 118/381 (30%), Positives = 166/381 (43%), Gaps = 67/381 (17%)
Query: 4 LFIGTPSKGVLLILDTGSALIY--------------AIFDPRKSSSFQKINCDHPDCTYF 49
+F+GTP K LILDTGS L + +DP+ SSSF+ I+C P C
Sbjct: 201 VFVGTPPKHFSLILDTGSDLNWIQCVPCIACFEQSGPYYDPKDSSSFRNISCHDPRCQLV 260
Query: 50 ---------KCVNEQCVYTMKYADQSVTKGFAAHETISVI-----GKGEGKAIFHGALFG 95
K N+ C Y Y D S T G A ET +V G E K + +FG
Sbjct: 261 SAPDPPKPCKAENQSCPYFYWYGDGSNTTGDFALETFTVNLTTPNGTSELKHV-ENVMFG 319
Query: 96 CSNDNHGFDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYL 155
C + N G A + +SF SQ+ S+ + FSYCLV N SS L
Sbjct: 320 CGHWNRGLFHGAAGLLGL-----GKGPLSFASQMQSLYGQSFSYCLVDRNSNAS-VSSKL 373
Query: 156 KFGTDMGYRRPSTQATKFINHPN---------------NFYYLSLKDISIDNERMNFPPD 200
FG D + ++HPN FYY+ +K + +D+E + P +
Sbjct: 374 IFGED----------KELLSHPNLNFTSFGGGKDGSVDTFYYVQIKSVMVDDEVLKIPEE 423
Query: 201 TFDITVSGEGGCIIDSGSVLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCYFL 260
T+ ++ G GG IIDSG+ LTYF Y + E FV + +QL + P P++ CY +
Sbjct: 424 TWHLSSEGAGGTIIDSGTTLTYFAEPAYEIIKEAFVRKIKGYQLVE--GLP-PLKPCYNV 480
Query: 261 PETFN-RFPSMAFYFED-ANLRIDGENVFI-IDYENHFFLLAVAPHDDLVALIGSQQQRD 317
P F D A EN FI ID E + P L ++IG+ QQ++
Sbjct: 481 SGIEKMELPDFGILFADEAVWNFPVENYFIWIDPEVVCLAILGNPRSAL-SIIGNYQQQN 539
Query: 318 TRFVYDLNIDLLSFVKENCSD 338
+YD+ L + C+D
Sbjct: 540 FHILYDMKKSRLGYAPMKCAD 560
>gi|255563737|ref|XP_002522870.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
gi|223537954|gb|EEF39568.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
Length = 341
Score = 132 bits (331), Expect = 3e-28, Method: Compositional matrix adjust.
Identities = 79/250 (31%), Positives = 131/250 (52%), Gaps = 5/250 (2%)
Query: 94 FGCSNDNHGFDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSS 153
FGCS DN F +R G G++GL+ +S + QL ++ +RFSYCL P + +S
Sbjct: 91 FGCSKDNRNFSAFSRTGKTDGIMGLNMSPVSILQQLRNVTNQRFSYCLT-PYGSRPPATS 149
Query: 154 YLKFGTDMGYRRPSTQATKFINHPN-NFYYLSLKDISIDNERMNFPPDTFDITVSGEGGC 212
L+FG D+ +T F++ P+ Y+L+L D+S+ +R+ PP+TF + G GG
Sbjct: 150 LLRFGNDISTWGRGFYSTPFVDPPDMPNYFLNLLDLSVAGQRLRLPPETFALKRDGTGGT 209
Query: 213 IIDSGSVLTYFHSDVYWKLHEKFVSYFER--FQLAQLSDCPEPIQLCYFLPETFNRFPSM 270
IIDSG+ LT Y L ++F+ F + D ++ + TF S+
Sbjct: 210 IIDSGTGLTLVVQPAYRHLLGALQNHFDHHGFHRVHIPDTNLELRYNFAQNRTFQNHASL 269
Query: 271 AFYFEDANLRIDGENVFII-DYENHFFLLAVAPHDDLVALIGSQQQRDTRFVYDLNIDLL 329
++F+ A+ ++ +++ + EN F + +A H + A+IG+ Q +TRFVY+ L
Sbjct: 270 TYHFQGADFTVEPRYAYVVYNDENAFCVALLASHIEGRAIIGALHQANTRFVYNAAKRRL 329
Query: 330 SFVKENCSDD 339
F EN +D
Sbjct: 330 KFKAENFQND 339
>gi|108707835|gb|ABF95630.1| Eukaryotic aspartyl protease family protein [Oryza sativa Japonica
Group]
Length = 384
Score = 132 bits (331), Expect = 3e-28, Method: Compositional matrix adjust.
Identities = 108/364 (29%), Positives = 165/364 (45%), Gaps = 46/364 (12%)
Query: 1 MVRLFIGTPSKGVLLILDTGSALIYA--------------IFDPRKSSSFQKINCDHPDC 46
++ L IGTP + V L LDTGS L++ +D +SS+F +CD C
Sbjct: 36 LLHLAIGTPPQPVQLTLDTGSVLVWTQCQPCAVCFNQSLPYYDASRSSTFALPSCDSTQC 95
Query: 47 ----TYFKCVNEQ---CVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSND 99
+ CVN+ C Y+ Y D+S T GF ET+S + A G +FGC +
Sbjct: 96 KLDPSVTMCVNQTVQTCAYSYSYGDKSATIGFLDVETVSFVAG----ASVPGVVFGCGLN 151
Query: 100 NHGFDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYL-KFG 158
N G G+ G R +S SQL FS+C +G S+ L
Sbjct: 152 NTGIFRSNE----TGIAGFGRGPLSLPSQLKV---GNFSHCFTAV--SGRKPSTVLFDLP 202
Query: 159 TDMGYR--RPSTQATKFINHPNN--FYYLSLKDISIDNERMNFPPDTFDITVSGEGGCII 214
D+ Y+ R + Q T I +P + FYYLSLK I++ + R+ P F + +G GG II
Sbjct: 203 ADL-YKNGRGTVQTTPLIKNPAHPTFYYLSLKGITVGSTRLPVPESAFALK-NGTGGTII 260
Query: 215 DSGSVLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCYFLPE--TFNRFPSMAF 272
DSG+ T VY +H++F ++ + + P LC+ P P +
Sbjct: 261 DSGTAFTSLPPRVYRLVHDEFAAHVKLPVVPSNETGP---LLCFSAPPLGKAPHVPKLVL 317
Query: 273 YFEDANLRIDGENVFIIDYENHFFLLAVAPHDDLVALIGSQQQRDTRFVYDLNIDLLSFV 332
+FE A + + EN + + +A + + +IG+ QQ++ +YDL LSFV
Sbjct: 318 HFEGATMHLPRENYVFEAKDGGNCSICLAIIEGEMTIIGNFQQQNMHVLYDLKNSKLSFV 377
Query: 333 KENC 336
+ C
Sbjct: 378 RAKC 381
>gi|168041176|ref|XP_001773068.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162675615|gb|EDQ62108.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 365
Score = 132 bits (331), Expect = 3e-28, Method: Compositional matrix adjust.
Identities = 106/360 (29%), Positives = 163/360 (45%), Gaps = 36/360 (10%)
Query: 1 MVRLFIGTPSKGVLLILDTGSALIY--------------AIFDPRKSSSFQKINCDHPDC 46
+ + +GTP + +I+DTGS L + A+F P S+SF K+ C C
Sbjct: 14 LATVRLGTPERVFSVIVDTGSDLTWVQCSPCGKCYSQNDALFLPNTSTSFTKLACGSALC 73
Query: 47 T---YFKCVNEQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSNDNHGF 103
+ C CVY Y D S+T G ++TI++ G K FGC +DN G
Sbjct: 74 NGLPFPMCNQTTCVYWYSYGDGSLTTGDFVYDTITMDGINGQKQQVPNFAFGCGHDNEGS 133
Query: 104 DEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLKFGTDMGY 163
A G+LGL + +SF SQL S+ +FSYCLV L TS L FG
Sbjct: 134 FAGAD-----GILGLGQGPLSFHSQLKSVYNGKFSYCLVDWLAPPTQTSPLL-FGDAAVP 187
Query: 164 RRPSTQATKFINHPN--NFYYLSLKDISIDNERMNFPPDTFDITVSGEGGCIIDSGSVLT 221
P + + +P +YY+ L IS+ + +N FDI G G I DSG+ +T
Sbjct: 188 ILPDVKYLPILANPKVPTYYYVKLNGISVGDNLLNISSTVFDIDSVGGAGTIFDSGTTVT 247
Query: 222 YF----HSDVYWKLHEKFVSYFERF-QLAQLSDCPEPIQLCYFLPETFNRFPSMAFYFED 276
+ +V ++ ++Y + +++L C L F + P+M F+FE
Sbjct: 248 QLAEAAYKEVLAAMNASTMAYSRKIDDISRLDLC-----LSGFPKDQLPTVPAMTFHFEG 302
Query: 277 ANLRIDGENVFIIDYENHFFLLAVAPHDDLVALIGSQQQRDTRFVYDLNIDLLSFVKENC 336
++ + N FI + + A+ D V +IGS QQ++ + YD L FV ++C
Sbjct: 303 GDMVLPPSNYFIYLESSQSYCFAMTSSPD-VNIIGSVQQQNFQVYYDTAGRKLGFVPKDC 361
>gi|222624819|gb|EEE58951.1| hypothetical protein OsJ_10630 [Oryza sativa Japonica Group]
Length = 440
Score = 132 bits (331), Expect = 3e-28, Method: Compositional matrix adjust.
Identities = 108/364 (29%), Positives = 165/364 (45%), Gaps = 46/364 (12%)
Query: 1 MVRLFIGTPSKGVLLILDTGSALIYA--------------IFDPRKSSSFQKINCDHPDC 46
++ L IGTP + V L LDTGS L++ +D +SS+F +CD C
Sbjct: 92 LLHLAIGTPPQPVQLTLDTGSVLVWTQCQPCAVCFNQSLPYYDASRSSTFALPSCDSTQC 151
Query: 47 ----TYFKCVNEQ---CVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSND 99
+ CVN+ C Y+ Y D+S T GF ET+S + A G +FGC +
Sbjct: 152 KLDPSVTMCVNQTVQTCAYSYSYGDKSATIGFLDVETVSFVAG----ASVPGVVFGCGLN 207
Query: 100 NHGFDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYL-KFG 158
N G G+ G R +S SQL FS+C +G S+ L
Sbjct: 208 NTGIFRSNE----TGIAGFGRGPLSLPSQLKV---GNFSHCFTAV--SGRKPSTVLFDLP 258
Query: 159 TDMGYR--RPSTQATKFINHPNN--FYYLSLKDISIDNERMNFPPDTFDITVSGEGGCII 214
D+ Y+ R + Q T I +P + FYYLSLK I++ + R+ P F + +G GG II
Sbjct: 259 ADL-YKNGRGTVQTTPLIKNPAHPTFYYLSLKGITVGSTRLPVPESAFALK-NGTGGTII 316
Query: 215 DSGSVLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCYFLPE--TFNRFPSMAF 272
DSG+ T VY +H++F ++ + + P LC+ P P +
Sbjct: 317 DSGTAFTSLPPRVYRLVHDEFAAHVKLPVVPSNETGP---LLCFSAPPLGKAPHVPKLVL 373
Query: 273 YFEDANLRIDGENVFIIDYENHFFLLAVAPHDDLVALIGSQQQRDTRFVYDLNIDLLSFV 332
+FE A + + EN + + +A + + +IG+ QQ++ +YDL LSFV
Sbjct: 374 HFEGATMHLPRENYVFEAKDGGNCSICLAIIEGEMTIIGNFQQQNMHVLYDLKNSKLSFV 433
Query: 333 KENC 336
+ C
Sbjct: 434 RAKC 437
>gi|356528627|ref|XP_003532901.1| PREDICTED: probable aspartic protease At2g35615-like [Glycine max]
Length = 430
Score = 131 bits (330), Expect = 4e-28, Method: Compositional matrix adjust.
Identities = 108/365 (29%), Positives = 175/365 (47%), Gaps = 53/365 (14%)
Query: 1 MVRLFIGTPSKGVLLILDTGSALIY--------------AIFDPRKSSSFQKINCDHPDC 46
++R +GTPS L I DTGS L + +FDP +SS++ + C+ C
Sbjct: 89 LMRFSLGTPSVERLAIFDTGSDLSWLQCTPCKTCYPQEAPLFDPTQSSTYVDVPCESQPC 148
Query: 47 TYF-----KC-VNEQCVYTMKYADQSVTKGFAAHETISV--IGKGEGKAIFHGALFGCS- 97
T F +C ++QC+Y +Y S T G ++TIS G G+G A F ++FGC+
Sbjct: 149 TLFPQNQRECGSSKQCIYLHQYGTDSFTIGRLGYDTISFSSTGMGQGGATFPKSVFGCAF 208
Query: 98 NDNHGFDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLKF 157
N F + G +GL +S SQLG I +FSYC+V P ++ LKF
Sbjct: 209 YSNFTFKISTKAN---GFVGLGPGPLSLASQLGDQIGHKFSYCMV---PFSSTSTGKLKF 262
Query: 158 GTDMGYRRPSTQ--ATKFINHPN--NFYYLSLKDISIDNERMNFPPDTFDITVSGEGGCI 213
G+ P+ + +T F+ +P+ ++Y L+L+ I++ +++ +T G I
Sbjct: 263 GS----MAPTNEVVSTPFMINPSYPSYYVLNLEGITVGQKKV--------LTGQIGGNII 310
Query: 214 IDSGSVLTYFHSDVYWKLHEKFVSYF-ERFQLAQLSDCPEPIQLCYFLPETFNRFPSMAF 272
IDS +LT+ +Y F+S E + D P P + C P N FP F
Sbjct: 311 IDSVPILTHLEQGIY----TDFISSVKEAINVEVAEDAPTPFEYCVRNPTNLN-FPEFVF 365
Query: 273 YFEDANLRIDGENVFIIDYENHFFLLAVAPHDDLVALIGSQQQRDTRFVYDLNIDLLSFV 332
+F A++ + +N+F I +N+ + V P +++ G+ Q + + YDL +SF
Sbjct: 366 HFTGADVVLGPKNMF-IALDNNLVCMTVVPSKG-ISIFGNWAQVNFQVEYDLGEKKVSFA 423
Query: 333 KENCS 337
NCS
Sbjct: 424 PTNCS 428
>gi|357450863|ref|XP_003595708.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
gi|355484756|gb|AES65959.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
Length = 407
Score = 131 bits (330), Expect = 4e-28, Method: Compositional matrix adjust.
Identities = 101/361 (27%), Positives = 169/361 (46%), Gaps = 40/361 (11%)
Query: 1 MVRLFIGTPSKGVLLILDTGSALIY--------------AIFDPRKSSSFQKINCDHPDC 46
++ L IGTP + DTGS L++ +FDPR SSS+ I C C
Sbjct: 61 LMELSIGTPPIKIYAEADTGSDLVWFQCIPCTKCYKQQNPMFDPRSSSSYTNITCGTESC 120
Query: 47 TYFK---CVNEQ--CVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSNDNH 101
C +Q C YT YAD S+T+G A ET+++ F G +FGC ++N
Sbjct: 121 NKLDSSLCSTDQKTCNYTYSYADNSITQGVLAQETLTLTSTTGEPVAFQGIIFGCGHNNS 180
Query: 102 GFDEDARDGALAGVLGLSRVTISFISQLGSII---KKRFSYCLVIPLPNGEYTSSYLKFG 158
GF++ G++GL R +S ISQ+GS + FS CLV P +S + FG
Sbjct: 181 GFNDRE-----MGLIGLGRGPLSLISQIGSSLGAGGNMFSQCLV-PFNTDPSITSQMNFG 234
Query: 159 TDMGYRRPSTQATKFINHPNNFYYLSLKDISIDNERMNFPPDTFDITVSGEGGCIIDSGS 218
T +T I+ Y+ +L IS+++ + F + T++ +G +IDSG+
Sbjct: 235 KGSEVLGNGTVSTPLISKDGTGYFATLLGISVEDINLPFSNGSSLGTIT-KGNILIDSGT 293
Query: 219 VLTYFHSDVYWKLHEKFVS--YFERFQLAQLSDCPEPIQLCYFLPETFNRFPSMAFYFED 276
+TY + Y +L E+ + E F++ + +LCY P N P++ +FE
Sbjct: 294 TITYLPEEFYHRLIEQVRNKVALEPFRI-------DGYELCYQTPTNLNG-PTLTIHFEG 345
Query: 277 ANLRIDGENVFIIDYENHFFLLAVAPHDDLVALIGSQQQRDTRFVYDLNIDLLSFVKENC 336
++ + +FI +++F +++ V G+ Q + +DL ++SF +C
Sbjct: 346 GDVLLTPAQMFIPVQDDNFCFAVFDTNEEYVTY-GNYAQSNYLIGFDLERQVVSFKATDC 404
Query: 337 S 337
+
Sbjct: 405 T 405
>gi|226508080|ref|NP_001150678.1| aspartic proteinase nepenthesin-1 precursor [Zea mays]
gi|195641018|gb|ACG39977.1| aspartic proteinase nepenthesin-1 precursor [Zea mays]
Length = 450
Score = 131 bits (329), Expect = 5e-28, Method: Compositional matrix adjust.
Identities = 101/377 (26%), Positives = 159/377 (42%), Gaps = 59/377 (15%)
Query: 1 MVRLFIGTPSKGVLLILDTGSALIYA---------------IFDPRKSSSFQKINCDH-- 43
++ L IGTP I DTGS LI+ +++P S++F + C+
Sbjct: 91 LMALAIGTPPLPYQAIADTGSDLIWTQCAPCTSQCFRQPTPLYNPSSSTTFAVLPCNSSL 150
Query: 44 ---------------PDCTYFKCVNEQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAI 88
P C C Y + Y T F ET + G++
Sbjct: 151 SVCAAALAGTGTAPPPGC--------ACTYNVTYGS-GWTSVFQGSETFTFGSTPAGQSR 201
Query: 89 FHGALFGCSNDNHGFDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNG 148
G FGCS + GF+ + +G++GL R +S +SQLG +FSYCL P +
Sbjct: 202 VPGIAFGCSTASSGFNASSA----SGLVGLGRGRLSLVSQLG---VPKFSYCLT-PYQDT 253
Query: 149 EYTSSYLKFGTDMGYRRPSTQATKFINHP-----NNFYYLSLKDISIDNERMNFPPDTFD 203
TS+ L + +T F+ P N FYYL+L IS+ ++ PPD F
Sbjct: 254 NSTSTLLLGPSASLNGTAGVSSTPFVASPSTAPMNTFYYLNLTGISLGTTALSIPPDAFL 313
Query: 204 ITVSGEGGCIIDSGSVLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCYFLPET 263
+ G GG IIDSG+ +T + Y ++ VS + LC+ LP +
Sbjct: 314 LNADGTGGLIIDSGTTITLLGNTAYQQVRAAVVSLVTLPTTD--GSAATGLDLCFMLPSS 371
Query: 264 FN---RFPSMAFYFEDANLRIDGENVFIIDYENHFFLLAVAPHDDLVALIGSQQQRDTRF 320
+ PSM +F A++ + ++ + D + L D V ++G+ QQ++
Sbjct: 372 TSAPPAMPSMTLHFNGADMVLPADSYMMSDDSGLWCLAMQNQTDGEVNILGNYQQQNMHI 431
Query: 321 VYDLNIDLLSFVKENCS 337
+YD+ + LSF CS
Sbjct: 432 LYDIGQETLSFAPAKCS 448
>gi|2541876|dbj|BAA22813.1| CND41, chloroplast nucleoid DNA binding protein [Nicotiana tabacum]
Length = 502
Score = 131 bits (329), Expect = 5e-28, Method: Compositional matrix adjust.
Identities = 102/369 (27%), Positives = 157/369 (42%), Gaps = 53/369 (14%)
Query: 1 MVRLFIGTPSKGVLLILDTGSALIYA---------------IFDPRKSSSFQKINCDHPD 45
+V + +GTP K + LI DTGS L + IFDP S ++ I+C
Sbjct: 155 IVNVGLGTPKKDLSLIFDTGSDLTWTQCQPCVKSCYAQQQPIFDPSTSKTYSNISCTSAA 214
Query: 46 CTYFK--------CVNEQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCS 97
C+ K C + CVY ++Y D S T GF A + +++ +F G +FGC
Sbjct: 215 CSSLKSATGNSPGCSSSNCVYGIQYGDSSFTIGFFAKDKLTLTQN----DVFDGFMFGCG 270
Query: 98 NDNHGFDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLKF 157
+N G G AG++GL R +S + Q K FSYC LP ++ +L F
Sbjct: 271 QNNKGLF-----GKTAGLIGLGRDPLSIVQQTAQKFGKYFSYC----LPTSRGSNGHLTF 321
Query: 158 GTDMGYRRPSTQATKFINHP------NNFYYLSLKDISIDNERMNFPPDTFDITVSGEGG 211
G G + P +Y++ + IS+ + ++ P F G
Sbjct: 322 GNGNGVKASKAVKNGITFTPFASSQGTAYYFIDVLGISVGGKALSISPMLFQ-----NAG 376
Query: 212 CIIDSGSVLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCYFLPE-TFNRFPSM 270
IIDSG+V+T S Y L F + ++ A + CY L T P +
Sbjct: 377 TIIDSGTVITRLPSTAYGSLKSAFKQFMSKYPTAPALSL---LDTCYDLSNYTSISIPKI 433
Query: 271 AFYFE-DANLRIDGENVFIIDYENHFFL-LAVAPHDDLVALIGSQQQRDTRFVYDLNIDL 328
+F F +AN+ +D + I + + L A DD + + G+ QQ+ VYD+
Sbjct: 434 SFNFNGNANVELDPNGILITNGASQVCLAFAGNGDDDSIGIFGNIQQQTLEVVYDVAGGQ 493
Query: 329 LSFVKENCS 337
L F + CS
Sbjct: 494 LGFGYKGCS 502
>gi|224085379|ref|XP_002307559.1| predicted protein [Populus trichocarpa]
gi|222857008|gb|EEE94555.1| predicted protein [Populus trichocarpa]
Length = 455
Score = 131 bits (329), Expect = 5e-28, Method: Compositional matrix adjust.
Identities = 121/371 (32%), Positives = 166/371 (44%), Gaps = 46/371 (12%)
Query: 4 LFIGTPSKGVLLILDTGSALIY--------------AIFDPRKSSSFQKINCDHPDCTYF 49
+FIGTP K LILDTGS L + +DP++SSSF+ I C P C
Sbjct: 94 VFIGTPPKHYSLILDTGSDLNWIQCVPCHDCFEQNGPYYDPKESSSFRNIGCHDPRCHLV 153
Query: 50 ---------KCVNEQCVYTMKYADQSVTKGFAAHETISV-----IGKGEGKAIFHGALFG 95
K N+ C Y Y D S T G A ET +V GK E K + +FG
Sbjct: 154 SSPDPPLPCKAENQTCPYFYWYGDSSNTTGDFATETFTVNLTSPTGKSEFKRV-ENVMFG 212
Query: 96 CSNDNHGFDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYL 155
C + N G A +G+LGL R +SF SQL S+ FSYCLV + SS L
Sbjct: 213 CGHWNRGLFHGA-----SGLLGLGRGPLSFSSQLQSLYGHSFSYCLV-DRNSDTNVSSKL 266
Query: 156 KFGTDMGY-RRPSTQATKFINHPNN----FYYLSLKDISIDNERMNFPPDTFDITVSGEG 210
FG D P T + N FYY+ +K I + E +N P T+++T G G
Sbjct: 267 IFGEDKDLLNHPELNFTTLVGGKENPVDTFYYVQIKSIMVGGEVLNIPESTWNMTSDGVG 326
Query: 211 GCIIDSGSVLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCYFLPETFN-RFPS 269
G I+DSG+ L+YF Y + + FV + + + Q +P CY + P
Sbjct: 327 GTIVDSGTTLSYFTEPAYQIIKDAFVKKVKGYPIVQDFPILDP---CYNVSGVEKIDLPD 383
Query: 270 MAFYFED-ANLRIDGENVFI-IDYENHFFLLAVAPHDDLVALIGSQQQRDTRFVYDLNID 327
F D A EN FI +D E L + +++IG+ QQ++ +YD
Sbjct: 384 FGILFADGAVWNFPVENYFIRLDPEEVVCLAILGTPRSALSIIGNYQQQNFHVLYDTKKS 443
Query: 328 LLSFVKENCSD 338
L + NC+D
Sbjct: 444 RLGYAPMNCAD 454
>gi|357494221|ref|XP_003617399.1| 60S ribosomal protein L18a [Medicago truncatula]
gi|355518734|gb|AET00358.1| 60S ribosomal protein L18a [Medicago truncatula]
Length = 749
Score = 131 bits (329), Expect = 5e-28, Method: Compositional matrix adjust.
Identities = 117/373 (31%), Positives = 165/373 (44%), Gaps = 45/373 (12%)
Query: 4 LFIGTPSKGVLLILDTGSALIY--------------AIFDPRKSSSFQKINCDHPDCTYF 49
+FIGTP K LILDTGS L + +DP++SSSF+ I C P C
Sbjct: 196 VFIGTPPKHYSLILDTGSDLNWIQCVPCIACFEQSGPYYDPKESSSFENITCHDPRCKLV 255
Query: 50 ---------KCVNEQCVYTMKYADQSVTKGFAAHETISVI-----GKGEGKAIFHGALFG 95
K N+ C Y Y D S T G A ET +V GK E K + +FG
Sbjct: 256 SSPDPPKPCKDENQTCPYFYWYGDSSNTTGDFALETFTVNLTTPNGKSEQKHV-ENVMFG 314
Query: 96 CSNDNHGFDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYL 155
C + N G A R +SF SQL SI FSYCLV + SS L
Sbjct: 315 CGHWNRGLFHGAAGLLGL-----GRGPLSFASQLQSIYGHSFSYCLV-DRNSDTSVSSKL 368
Query: 156 KFGTDMGY-RRPSTQATKFI----NHPNNFYYLSLKDISIDNERMNFPPDTFDITVSGEG 210
FG D P+ T F+ N + FYY+ +K I +D E + P +T+ ++ G G
Sbjct: 369 IFGEDKELLSHPNLNFTSFVGGEENSVDTFYYVGIKSIMVDGEVLKIPEETWHLSKEGGG 428
Query: 211 GCIIDSGSVLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCYFLPETFN-RFPS 269
G IIDSG+ LTYF Y + E F+ + ++L + P P++ CY + P
Sbjct: 429 GTIIDSGTTLTYFAEPAYEIIKEAFMKKIKGYELVE--GFP-PLKPCYNVSGIEKMELPD 485
Query: 270 MAFYFED-ANLRIDGENVFIIDYENHFFLLAVAPHDDLVALIGSQQQRDTRFVYDLNIDL 328
F D A EN FI + L + +++IG+ QQ++ +YD+
Sbjct: 486 FGILFSDGAMWDFPVENYFIQIEPDLVCLAILGTPKSALSIIGNYQQQNFHILYDMKKSR 545
Query: 329 LSFVKENCSDDSA 341
L + C+ ++
Sbjct: 546 LGYAPMKCTATTS 558
>gi|145693992|gb|ABP93696.1| unknown protein isoform 1 [Lemna minor]
Length = 350
Score = 130 bits (327), Expect = 8e-28, Method: Compositional matrix adjust.
Identities = 98/356 (27%), Positives = 148/356 (41%), Gaps = 43/356 (12%)
Query: 1 MVRLFIGTPSKGVLLILDTGSALIY---------------AIFDPRKSSSFQKINCDHPD 45
++ + GTP K +I DTGS + + +FDP SS+++ I+C
Sbjct: 17 VITVGFGTPKKNQTVIFDTGSNVNWIQCKPCVVSCYPQQEPLFDPTLSSTYRNISCTSAA 76
Query: 46 CTYFK---CVNEQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSNDNHG 102
CT C CVY + Y D S T GF A ET ++ +F+ +FGC +N G
Sbjct: 77 CTGLSSRGCSGSTCVYGVTYGDGSSTVGFLATETFTL----AAGNVFNNFIFGCGQNNQG 132
Query: 103 FDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLKFGTDMG 162
A AG++GL R S SQL + + FSYC LP+ + YL G +
Sbjct: 133 LFTGA-----AGLIGLGRSPYSLNSQLATSLGNIFSYC----LPSTSSATGYLNIGNPL- 182
Query: 163 YRRPSTQATKFINHPNNFYYLSLKDISIDNERMNFPPDTFDITVSGEGGCIIDSGSVLTY 222
R P A + Y++ L IS+ R+ F G IIDSG+V+T
Sbjct: 183 -RTPGYTAMLTNSRAPTLYFIDLIGISVGGTRLALSSTVFQSV-----GTIIDSGTVITR 236
Query: 223 FHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCY-FLPETFNRFPSMAFYFEDANLRI 281
Y L F + ++ A + + CY F T FP++ ++ ++ I
Sbjct: 237 LPPTAYGALRTAFRAAMTQYTRAAAASI---LDTCYDFSRTTTVTFPTIKLHYTGLDVTI 293
Query: 282 DGENVFIIDYENHFFLLAVAPHDDL-VALIGSQQQRDTRFVYDLNIDLLSFVKENC 336
G VF + + L D + +IG+ QQR YD + + F C
Sbjct: 294 PGAGVFYVISSSQVCLAFAGNSDSTQIGIIGNVQQRTMEVTYDNALKRIGFAAGAC 349
>gi|224074591|ref|XP_002304395.1| predicted protein [Populus trichocarpa]
gi|222841827|gb|EEE79374.1| predicted protein [Populus trichocarpa]
Length = 443
Score = 130 bits (327), Expect = 9e-28, Method: Compositional matrix adjust.
Identities = 105/360 (29%), Positives = 167/360 (46%), Gaps = 38/360 (10%)
Query: 2 VRLFIGTPSKGVLLILDTGSALIYA--------------IFDPRKSSSFQKINCDHPDCT 47
+++ IGTP V++I DTGS L + +FDP +SSS++ + C C
Sbjct: 96 MKMSIGTPLVEVIVIADTGSDLTWVQCLPCDPCYRQKSPLFDPSRSSSYRHMLCGSRFCN 155
Query: 48 YFKCVNEQ--------CVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSND 99
V+EQ C Y Y D+S T G A E ++ +FGC
Sbjct: 156 ALD-VSEQACTMDTNICEYHYSYGDKSYTNGNLATEKFTIGSTSSRPVHLSPIVFGCGTG 214
Query: 100 NHG-FDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLKFG 158
N G FDE G LS +SQL SIIK +FSYCLV PL +S +KFG
Sbjct: 215 NGGTFDELGSGIVGLGGGALS-----LVSQLSSIIKGKFSYCLV-PLSEQSNVTSKIKFG 268
Query: 159 TDMGYRRPSTQATKFIN-HPNNFYYLSLKDISIDNERMNFPPDTFDITVSGEGGCIIDSG 217
TD P +T ++ P+ +YY++L+ IS+ N+R+ + + V +G IIDSG
Sbjct: 269 TDSVISGPQVVSTPLVSKQPDTYYYVTLEAISVGNKRLPYTNGLLNGNVE-KGNVIIDSG 327
Query: 218 SVLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCYFLPETFNRFPSMAFYFEDA 277
+ LT+ S+ + +L E + ++SD +C+ + P +A +F DA
Sbjct: 328 TTLTFLDSEFFTELERVLE---ETVKAERVSDPRGLFSVCFRSAGDID-LPVIAVHFNDA 383
Query: 278 NLRIDGENVFIIDYENHFFLLAVAPHDDLVALIGSQQQRDTRFVYDLNIDLLSFVKENCS 337
++++ N F+ E+ ++ + + + G+ Q D YDL +SF +C+
Sbjct: 384 DVKLQPLNTFVKADEDLLCFTMISSNQ--IGIFGNLAQMDFLVGYDLEKRTVSFKPTDCT 441
>gi|225430555|ref|XP_002285593.1| PREDICTED: aspartic proteinase nepenthesin-2 [Vitis vinifera]
Length = 481
Score = 130 bits (326), Expect = 1e-27, Method: Compositional matrix adjust.
Identities = 102/364 (28%), Positives = 160/364 (43%), Gaps = 50/364 (13%)
Query: 1 MVRLFIGTPSKGVLLILDTGSALIYA---------------IFDPRKSSSFQKINCDHPD 45
+V + +GTP + + I DTGS L + IF+P KS+S+ I+C P
Sbjct: 139 VVTVGLGTPKRDLTFIFDTGSDLTWTQCEPCARYCYHQQEPIFNPSKSTSYTNISCSSPT 198
Query: 46 CTYFK--------CVNEQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCS 97
C K C CVY ++Y DQS + GF A + +++ +F+ LFGC
Sbjct: 199 CDELKSGTGNSPSCSASTCVYGIQYGDQSYSVGFFAQDKLALTSTD----VFNNFLFGCG 254
Query: 98 NDNHGFDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLKF 157
+N G +AG++GL R +S +SQ K FSYC LP+ ++ YL F
Sbjct: 255 QNNRGLFV-----GVAGLIGLGRNALSLVSQTAQKYGKLFSYC----LPSTSSSTGYLTF 305
Query: 158 GTDMGYRRPSTQATKFINHPN-NFYYLSLKDISIDNERMNFPPDTFDITVSGEGGCIIDS 216
G+ G + +N +FY+L+L IS+ +++ F G IIDS
Sbjct: 306 GSGGGTSKAVKFTPSLVNSQGPSFYFLNLIAISVGGRKLSTSASVFST-----AGTIIDS 360
Query: 217 GSVLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCY-FLPETFNRFPSMAFYFE 275
G+V++ Y L F ++ A + + CY F P + YF
Sbjct: 361 GTVISRLPPTAYSDLRASFQQQMSKYPKAAPASI---LDTCYDFSQYDTVDVPKINLYFS 417
Query: 276 D-ANLRIDGENVFIIDYENHFFLLAVAPHDDL--VALIGSQQQRDTRFVYDLNIDLLSFV 332
D A + +D +F I + LA A + D +A++G+ QQ+ VYD+ + F
Sbjct: 418 DGAEMDLDPSGIFYILNISQ-VCLAFAGNSDATDIAILGNVQQKTFDVVYDVAGGRIGFA 476
Query: 333 KENC 336
C
Sbjct: 477 PGGC 480
>gi|255548662|ref|XP_002515387.1| Aspartic proteinase nepenthesin-2 precursor, putative [Ricinus
communis]
gi|223545331|gb|EEF46836.1| Aspartic proteinase nepenthesin-2 precursor, putative [Ricinus
communis]
Length = 463
Score = 130 bits (326), Expect = 1e-27, Method: Compositional matrix adjust.
Identities = 104/358 (29%), Positives = 164/358 (45%), Gaps = 50/358 (13%)
Query: 1 MVRLFIGTPSKGVLLILDTGSALIYA------IFDPRKSSSFQKINCDHPDCTYF----- 49
+V + +G+P K ++LI DTGS L +A FDP KS+S+ ++C P C+
Sbjct: 135 IVSIGLGSPKKDLMLIFDTGSDLTWARCSAAETFDPTKSTSYANVSCSTPLCSSVISATG 194
Query: 50 ---KCVNEQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSNDNHGFDED 106
+C CVY ++Y D S + GF E ++ IG + IF+ FGC G D D
Sbjct: 195 NPSRCAASTCVYGIQYGDGSYSIGFLGKERLT-IGSTD---IFNNFYFGC-----GQDVD 245
Query: 107 ARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLKFGTDMGYRRP 166
G AG+LGL R +S +SQ + FSYC LP+ T +L FG+ +
Sbjct: 246 GLFGKAAGLLGLGRDKLSVVSQTAPKYNQLFSYC----LPSSSST-GFLSFGSS---QSK 297
Query: 167 STQATKFINHPNNFYYLSLKDISIDNERMNFPPDTFDITVSGEGGCIIDSGSVLTYFHSD 226
S + T + P++FY L L I++ +++ P F G IIDSG+V+T
Sbjct: 298 SAKFTPLSSGPSSFYNLDLTGITVGGQKLAIPLSVFSTA-----GTIIDSGTVVTRLPPA 352
Query: 227 VYWKLHEKFVSYFERFQLAQLSDCPEPIQL---CY-FLPETFNRFPSMAFYFEDA-NLRI 281
Y L F + + + P+ + CY F + P + F ++ +
Sbjct: 353 AYSALRSAFRKAMASYPMGK------PLSILDTCYDFSKYKTIKVPKIVISFSGGVDVDV 406
Query: 282 DGENVFIIDYENHFFLLAVAPHDDL--VALIGSQQQRDTRFVYDLNIDLLSFVKENCS 337
D +F+ + LA A + A+ G+ QQR+ VYD++ + F +CS
Sbjct: 407 DQAGIFVANGLKQ-VCLAFAGNTGARDTAIFGNTQQRNFEVVYDVSGGKVGFAPASCS 463
>gi|356557010|ref|XP_003546811.1| PREDICTED: probable aspartic protease At2g35615-like [Glycine max]
Length = 437
Score = 129 bits (325), Expect = 1e-27, Method: Compositional matrix adjust.
Identities = 109/362 (30%), Positives = 164/362 (45%), Gaps = 44/362 (12%)
Query: 1 MVRLFIGTPSKGVLLILDTGSALIYA--------------IFDPRKSSSFQKINCDHPDC 46
++ L+IGTP L I DTGS LI+ +F+P KSS+F+ CD C
Sbjct: 93 LMTLYIGTPPVERLAIADTGSDLIWVQCSPCQNCFPQDTPLFEPLKSSTFKAATCDSQPC 152
Query: 47 TYFKCVNEQC------VYTMKYADQSVTKGFAAHETISVIGKGEGKAI-FHGALFGCSND 99
T QC +Y+ Y D+S T G ET+S G+ + + F ++FGC
Sbjct: 153 TSVPPSQRQCGKVGQCIYSYSYGDKSFTVGVVGTETLSFGSTGDAQTVSFPSSIFGCGVY 212
Query: 100 NHGFDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLKFGT 159
N+ F D V +SQLG I +FSYCL LP ++S LKFG+
Sbjct: 213 NN-FTFHTSDKVTGLVGLGGGPLSL-VSQLGPQIGYKFSYCL---LPFSSNSTSKLKFGS 267
Query: 160 DMGYRRPSTQATKFINHP--NNFYYLSLKDISIDNERMNFPPDTFDITVSGEGGCIIDSG 217
+ +T I P +FY+L+L+ ++I + + T +G IIDSG
Sbjct: 268 EAIVTTNGVVSTPLIIKPLFPSFYFLNLEAVTIGQKVVP--------TGRTDGNIIIDSG 319
Query: 218 SVLTYFHSDVYWKLHEKFVSYF-ERFQLAQLSDCPEPIQLCYFLPETFNRFPSMAFYFED 276
+VLTY Y FV+ E + D P P + C+ P P +AF F
Sbjct: 320 TVLTYLEQTFY----NNFVASLQEVLSVESAQDLPFPFKFCF--PYRDMTIPVIAFQFTG 373
Query: 277 ANLRIDGENVFIIDYENHFFLLAVAPHD-DLVALIGSQQQRDTRFVYDLNIDLLSFVKEN 335
A++ + +N+ I + + LAV P +++ G+ Q D + VYDL +SF +
Sbjct: 374 ASVALQPKNLLIKLQDRNMLCLAVVPSSLSGISIFGNVAQFDFQVVYDLEGKKVSFAPTD 433
Query: 336 CS 337
C+
Sbjct: 434 CT 435
>gi|414880708|tpg|DAA57839.1| TPA: hypothetical protein ZEAMMB73_997765 [Zea mays]
Length = 452
Score = 129 bits (325), Expect = 2e-27, Method: Compositional matrix adjust.
Identities = 102/366 (27%), Positives = 160/366 (43%), Gaps = 43/366 (11%)
Query: 6 IGTPSKGVLLILDTGSALIY--------------AIFDPRKSSSFQKINCDHPDCT---- 47
+G P L+++DTGS LI+ ++DPR S + ++I C P C
Sbjct: 98 VGDPPTHALVVIDTGSDLIWLQCLPCRRCYRQVTPLYDPRNSKTHRRIPCASPQCRGVLR 157
Query: 48 YFKC--VNEQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSNDNHGFDE 105
Y C CVY + Y D S + G A +T+ + H GC +DN G
Sbjct: 158 YPGCDARTGGCVYMVVYGDGSASSGDLATDTLVL----PDDTRVHNVTLGCGHDNEGLLA 213
Query: 106 DARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLKFGTDMGYRR 165
A AG+LG R +SF +QL FSYCL + +SSYL FG
Sbjct: 214 SA-----AGLLGAGRGQLSFPTQLAPAYGHVFSYCLGDRMSRARNSSSYLVFGRTP--EL 266
Query: 166 PSTQATKFINHPN--NFYYLSLKDISIDNERM-NFPPDTFDIT-VSGEGGCIIDSGSVLT 221
PST T +P + YY+ + S+ ER+ F + + +G GG ++DSG+ ++
Sbjct: 267 PSTAFTPLRTNPRRPSLYYVDMVGFSVGGERVAGFSNASLALNPATGRGGVVVDSGTAIS 326
Query: 222 YFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCYFL----PETFNRFPSMAFYF-ED 276
F D Y + + FVS+ + +L + CY + P T R PS+ +F
Sbjct: 327 RFTRDAYAAVRDAFVSHAAAAGMRRLRNKFSVFDTCYDVHGNGPGTGVRVPSIVLHFAAA 386
Query: 277 ANLRIDGENVFIIDY---ENHFFLLAVAPHDDLVALIGSQQQRDTRFVYDLNIDLLSFVK 333
A++ + N I +F L + DD + ++G+ QQ+ V+D+ + F
Sbjct: 387 ADMALPQANYLIPVVGGDRRTYFCLGLQAADDGLNVLGNVQQQGFGVVFDVERGRIGFTP 446
Query: 334 ENCSDD 339
CS +
Sbjct: 447 NGCSGE 452
>gi|356525748|ref|XP_003531485.1| PREDICTED: probable aspartic protease At2g35615-like [Glycine max]
Length = 436
Score = 129 bits (325), Expect = 2e-27, Method: Compositional matrix adjust.
Identities = 112/365 (30%), Positives = 166/365 (45%), Gaps = 45/365 (12%)
Query: 1 MVRLFIGTPSKGVLLILDTGSALIYA--------------IFDPRKSSSFQKINCDHPDC 46
++R +IGTP L I DT S LI+ +F+P KSS+F ++CD C
Sbjct: 91 LMRFYIGTPPVERLAIADTASDLIWVQCSPCETCFPQDTPLFEPHKSSTFANLSCDSQPC 150
Query: 47 T-----YFKCVNEQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSNDNH 101
T Y V C+YT Y D S TKG E+I G F +FGC ++N
Sbjct: 151 TSSNIYYCPLVGNLCLYTNTYGDGSSTKGVLCTESIHF---GSQTVTFPKTIFGCGSNND 207
Query: 102 GFDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLKFGTDM 161
+ + + G++GL +S +SQLG I +FSYCL LP ++ LKFG D
Sbjct: 208 FMHQISNK--VTGIVGLGAGPLSLVSQLGDQIGHKFSYCL---LPFTSTSTIKLKFGNDT 262
Query: 162 GYRRPSTQATKFINHPN--NFYYLSLKDISIDNERMNFPPDTFDITVSGEGGCIIDSGSV 219
+T I P+ ++Y+L L I+I + + T D T G IID G+V
Sbjct: 263 TITGNGVVSTPLIIDPHYPSYYFLHLVGITIGQKMLQV--RTTDHT---NGNIIIDLGTV 317
Query: 220 LTYFHSDVYWKLHEKFVSYF-ERFQLAQLS-DCPEPIQLCYFLPETFN-RFPSMAFYFED 276
LTY + Y FV+ E +++ D P P C+ P N FP + F F
Sbjct: 318 LTYLEVNFY----HNFVTLLREALGISETKDDIPYPFDFCF--PNQANITFPKIVFQFTG 371
Query: 277 ANLRIDGENVFIIDYENHFFLLAVAP--HDDLVALIGSQQQRDTRFVYDLNIDLLSFVKE 334
A + + +N+F + + LAV P + ++ G+ Q D + YD +SF
Sbjct: 372 AKVFLSPKNLFFRFDDLNMICLAVLPDFYAKGFSVFGNLAQVDFQVEYDRKGKKVSFAPA 431
Query: 335 NCSDD 339
+CS +
Sbjct: 432 DCSKN 436
>gi|302781726|ref|XP_002972637.1| hypothetical protein SELMODRAFT_97698 [Selaginella moellendorffii]
gi|300160104|gb|EFJ26723.1| hypothetical protein SELMODRAFT_97698 [Selaginella moellendorffii]
Length = 393
Score = 129 bits (324), Expect = 2e-27, Method: Compositional matrix adjust.
Identities = 107/356 (30%), Positives = 159/356 (44%), Gaps = 47/356 (13%)
Query: 4 LFIGTPSKGVLLILDTGSALIY------------AIFDPRKSSSFQKINCDHPDCTYFKC 51
+ +GTP K I DTGS L++ IFDPR+SS+F++++C CT
Sbjct: 59 ISVGTPGKRFRAIADTGSDLVWVQSEPCTGCSGGTIFDPRQSSTFREMDCSSQLCTELPG 118
Query: 52 VNE----QCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSNDNHGFDEDA 107
E C Y+ +Y T+G A +TIS+ G F GC N GFD
Sbjct: 119 SCEPGSSACSYSYEYGSGE-TEGEFARDTISLGTTSGGSQKFPSFAVGCGMVNSGFD--- 174
Query: 108 RDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLKFGTDMGYRRPS 167
+ G++GL + +S SQL + I +FSYCLV N + SS L FG
Sbjct: 175 ---GVDGLVGLGQGPVSLTSQLSAAIDSKFSYCLVD--INSQSESSPLLFGPSAALHGTG 229
Query: 168 TQATKFINHPNN----FYYLSLKDISIDNERMNFPPDTFDITVSGEGGCIIDSGSVLTYF 223
Q+TK I P++ +Y L++ I++ + M P G IIDSG+ LTY
Sbjct: 230 IQSTK-ITPPSDTYPTYYLLTVNGIAVAGQTMGSP-----------GTTIIDSGTTLTYV 277
Query: 224 HSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCYFLPETFN-RFPSMAFYFEDANLRID 282
S VY ++ + S L ++ + LCY N +FP++ A +
Sbjct: 278 PSGVYGRVLSRMESM---VTLPRVDGSSMGLDLCYDRSSNRNYKFPALTIRLAGATMTPP 334
Query: 283 GENVF-IIDYENHFFLLAVAPHDDL-VALIGSQQQRDTRFVYDLNIDLLSFVKENC 336
N F ++D LA+ L V++IG+ Q+ +YD LSFV+ C
Sbjct: 335 SSNYFLVVDDSGDTVCLAMGSAGGLPVSIIGNVMQQGYHILYDRGSSELSFVQAKC 390
>gi|312281631|dbj|BAJ33681.1| unnamed protein product [Thellungiella halophila]
Length = 502
Score = 129 bits (324), Expect = 2e-27, Method: Compositional matrix adjust.
Identities = 103/356 (28%), Positives = 159/356 (44%), Gaps = 42/356 (11%)
Query: 3 RLFIGTPSKGVLLILDTGSALIY--------------AIFDPRKSSSFQKINCDHPDCTY 48
R+ +GTP+K + ++LDTGS + + IFDP SS+F+ + C P C
Sbjct: 167 RIGVGTPAKEMYVVLDTGSDVNWIQCLPCSECYQQSDPIFDPTSSSTFKSLTCSDPKCAS 226
Query: 49 F---KCVNEQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSNDNHGFDE 105
C + +C+Y + Y D S T G A +T++ GE + AL GC +DN G
Sbjct: 227 LDVSACRSNKCLYQVSYGDGSFTVGNYATDTVTF---GESGKVNDVAL-GCGHDNEGL-- 280
Query: 106 DARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLKFGTDMGYRR 165
G GL + +S I K FSYCLV SS L F +
Sbjct: 281 ------FTGAAGLLGLGGGALSMTNQIKAKSFSYCLV---DRDSAKSSSLDFNSVQIGAG 331
Query: 166 PSTQATKFINHPNNFYYLSLKDISIDNERMNFPPDTFDITVSGEGGCIIDSGSVLTYFHS 225
+T + + FYY+ L S+ ++++ P F++ SG GG I+D G+ +T +
Sbjct: 332 DATAPLLRNSKMDTFYYVGLSGFSVGGQQVSIPSSLFEVDASGAGGVILDCGTAVTRLQT 391
Query: 226 DVYWKLHEKFVSYFERFQLAQLSDCPEPIQL---CY-FLPETFNRFPSMAFYFEDA-NLR 280
Y L + FV F+ PI L CY F + + P++ F+F +L
Sbjct: 392 QAYNSLRDAFVKLTTDFKKGT-----SPISLFDTCYDFSSLSTVKVPTVTFHFTGGKSLN 446
Query: 281 IDGENVFIIDYENHFFLLAVAPHDDLVALIGSQQQRDTRFVYDLNIDLLSFVKENC 336
+ +N I + F A AP +++IG+ QQ+ TR YDL +L+ C
Sbjct: 447 LPAKNYLIPIDDAGTFCFAFAPTSSSLSIIGNVQQQGTRITYDLANNLIGLSANKC 502
>gi|116787398|gb|ABK24493.1| unknown [Picea sitchensis]
Length = 479
Score = 129 bits (324), Expect = 2e-27, Method: Compositional matrix adjust.
Identities = 106/363 (29%), Positives = 169/363 (46%), Gaps = 49/363 (13%)
Query: 1 MVRLFIGTPSKGVLLILDTGSALIY--------------AIFDPRKSSSFQKINCDHPDC 46
+V GTP+K LLI+DTGS + + IF+P++SSS++ ++C C
Sbjct: 139 IVTAGFGTPAKNSLLIIDTGSDVTWIQCKPCSDCYSQVDPIFEPQQSSSYKHLSCLSSAC 198
Query: 47 TYFKCVNE----QCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSNDNHG 102
T +N CVY + Y D S ++G + ET+++ G F FGC + N G
Sbjct: 199 TELTTMNHCRLGGCVYEINYGDGSRSQGDFSQETLTL-----GSDSFPSFAFGCGHTNTG 253
Query: 103 FDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLKFGTDMG 162
+ + AG+LGL R +SF SQ S +FSYC LP+ ++S F G
Sbjct: 254 LFKGS-----AGLLGLGRTALSFPSQTKSKYGGQFSYC----LPDFVSSTSTGSFSVGQG 304
Query: 163 YRRPSTQATKFINHPN--NFYYLSLKDISIDNERMNFPPDTFDITVSGEGGCIIDSGSVL 220
+ +++ N +FY++ L IS+ ER++ PP V G GG I+DSG+V+
Sbjct: 305 SIPATATFVPLVSNSNYPSFYFVGLNGISVGGERLSIPP-----AVLGRGGTIVDSGTVI 359
Query: 221 TYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPI-QLCYFLPE-TFNRFPSMAFYFE-DA 277
T Y L F S A+ P I CY L + R P++ F+F+ +A
Sbjct: 360 TRLVPQAYDALKTSFRSKTRNLPSAK----PFSILDTCYDLSSYSQVRIPTITFHFQNNA 415
Query: 278 NLRIDGENV-FIIDYENHFFLLAVAPHDDLVA--LIGSQQQRDTRFVYDLNIDLLSFVKE 334
++ + + F I + LA A ++ +IG+ QQ+ R +D + F
Sbjct: 416 DVAVSAVGILFTIQSDGSQVCLAFASASQSISTNIIGNFQQQRMRVAFDTGAGRIGFAPG 475
Query: 335 NCS 337
+C+
Sbjct: 476 SCA 478
>gi|356553775|ref|XP_003545228.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
Length = 559
Score = 129 bits (324), Expect = 2e-27, Method: Compositional matrix adjust.
Identities = 113/380 (29%), Positives = 164/380 (43%), Gaps = 65/380 (17%)
Query: 4 LFIGTPSKGVLLILDTGSALIY--------------AIFDPRKSSSFQKINCDHPDCTYF 49
+F+GTP K LILDTGS L + +DP+ SSSF+ I+C P C
Sbjct: 199 VFVGTPPKHFSLILDTGSDLNWIQCVPCIACFEQSGPYYDPKDSSSFRNISCHDPRCQLV 258
Query: 50 ---------KCVNEQCVYTMKYADQSVTKGFAAHETISVI-----GKGEGKAIFHGALFG 95
K N+ C Y Y D S T G A ET +V GK E K + +FG
Sbjct: 259 SSPDPPNPCKAENQSCPYFYWYGDGSNTTGDFALETFTVNLTTPNGKSELKHV-ENVMFG 317
Query: 96 CSNDNHGFDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYL 155
C + N G A + +SF SQ+ S+ + FSYCLV N SS L
Sbjct: 318 CGHWNRGLFHGAAGLLGL-----GKGPLSFASQMQSLYGQSFSYCLVDRNSNAS-VSSKL 371
Query: 156 KFGTDMGYRRPSTQATKFINHPN---------------NFYYLSLKDISIDNERMNFPPD 200
FG D + ++HPN FYY+ + + +D+E + P +
Sbjct: 372 IFGED----------KELLSHPNLNFTSFGGGKDGSVDTFYYVQINSVMVDDEVLKIPEE 421
Query: 201 TFDITVSGEGGCIIDSGSVLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCYFL 260
T+ ++ G GG IIDSG+ LTYF Y + E FV + ++L + P P++ CY +
Sbjct: 422 TWHLSSEGAGGTIIDSGTTLTYFAEPAYEIIKEAFVRKIKGYELVE--GLP-PLKPCYNV 478
Query: 261 PETFN-RFPSMAFYFED-ANLRIDGENVFIIDYENHFFLLAVAPHDDLVALIGSQQQRDT 318
P F D A EN FI + L + +++IG+ QQ++
Sbjct: 479 SGIEKMELPDFGILFADGAVWNFPVENYFIQIDPDVVCLAILGNPRSALSIIGNYQQQNF 538
Query: 319 RFVYDLNIDLLSFVKENCSD 338
+YD+ L + C+D
Sbjct: 539 HILYDMKKSRLGYAPMKCAD 558
>gi|225423917|ref|XP_002281973.1| PREDICTED: aspartic proteinase nepenthesin-2 [Vitis vinifera]
Length = 491
Score = 129 bits (324), Expect = 2e-27, Method: Compositional matrix adjust.
Identities = 107/356 (30%), Positives = 165/356 (46%), Gaps = 44/356 (12%)
Query: 3 RLFIGTPSKGVLLILDTGSALIYA--------------IFDPRKSSSFQKINCDHPDCTY 48
R+ IG+P K V +++DTGS + + IF+P SSS+ + C+ C
Sbjct: 158 RVGIGSPPKHVYMVVDTGSDVNWVQCAPCADCYQQADPIFEPSFSSSYAPLTCETHQCKS 217
Query: 49 F---KCVNEQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSNDNHGFDE 105
+C N+ C+Y + Y D S T G A ETI++ +G A + GC +DN G
Sbjct: 218 LDVSECRNDSCLYEVSYGDGSYTVGDFATETITL----DGSASLNNVAIGCGHDNEGLFV 273
Query: 106 DARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLKFGTDMGYRR 165
A ++SF SQ+ + FSYCLV ++S L+F + +
Sbjct: 274 GAAGLLGL-----GGGSLSFPSQINA---SSFSYCLV---NRDTDSASTLEFNSPI---- 318
Query: 166 PSTQATKFI---NHPNNFYYLSLKDISIDNERMNFPPDTFDITVSGEGGCIIDSGSVLTY 222
PS T + N + FYYL + I + + ++ P +F++ SG GG I+DSG+ +T
Sbjct: 319 PSHSVTAPLLRNNQLDTFYYLGMTGIGVGGQMLSIPRSSFEVDESGNGGIIVDSGTAVTR 378
Query: 223 FHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCYFLPETFN-RFPSMAFYFEDAN-LR 280
SDVY L + FV + L S CY L + P+++F+F D L
Sbjct: 379 LQSDVYNSLRDSFVRGTQ--HLPSTSGVAL-FDTCYDLSSRSSVEVPTVSFHFPDGKYLA 435
Query: 281 IDGENVFIIDYENHFFLLAVAPHDDLVALIGSQQQRDTRFVYDLNIDLLSFVKENC 336
+ +N I F A AP +++IG+ QQ+ TR YDL+ L+ F C
Sbjct: 436 LPAKNYLIPVDSAGTFCFAFAPTTSALSIIGNVQQQGTRVSYDLSNSLVGFSPNGC 491
>gi|218192703|gb|EEC75130.1| hypothetical protein OsI_11317 [Oryza sativa Indica Group]
Length = 440
Score = 129 bits (324), Expect = 2e-27, Method: Compositional matrix adjust.
Identities = 107/364 (29%), Positives = 165/364 (45%), Gaps = 46/364 (12%)
Query: 1 MVRLFIGTPSKGVLLILDTGSALIYA--------------IFDPRKSSSFQKINCDHPDC 46
++ L IGTP + V L LDTGS L++ +D +SS+F +CD C
Sbjct: 92 LLHLAIGTPPQPVQLTLDTGSDLVWTQCQPCAVCFNQSLPYYDASRSSTFALPSCDSTQC 151
Query: 47 ----TYFKCVNEQ---CVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSND 99
+ CVN+ C ++ Y D+S T GF ET+S + A G +FGC +
Sbjct: 152 KLDPSVTMCVNQTVQTCAFSYSYGDKSATIGFLDVETVSFVAG----ASVPGVVFGCGLN 207
Query: 100 NHGFDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYL-KFG 158
N G G+ G R +S SQL FS+C +G S+ L
Sbjct: 208 NTGIFRSNE----TGIAGFGRGPLSLPSQLKV---GNFSHCFTAV--SGRKPSTVLFDLP 258
Query: 159 TDMGYR--RPSTQATKFINHPNN--FYYLSLKDISIDNERMNFPPDTFDITVSGEGGCII 214
D+ Y+ R + Q T I +P + FYYLSLK I++ + R+ P F + +G GG II
Sbjct: 259 ADL-YKNGRGTVQTTPLIKNPAHPTFYYLSLKGITVGSTRLPVPESAFALK-NGTGGTII 316
Query: 215 DSGSVLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCYFLPE--TFNRFPSMAF 272
DSG+ T VY +H++F ++ + + P LC+ P P +
Sbjct: 317 DSGTAFTSLPPRVYRLVHDEFAAHVKLPVVPSNETGP---LLCFSAPPLGKAPHVPKLVL 373
Query: 273 YFEDANLRIDGENVFIIDYENHFFLLAVAPHDDLVALIGSQQQRDTRFVYDLNIDLLSFV 332
+FE A + + EN + + +A + + +IG+ QQ++ +YDL LSFV
Sbjct: 374 HFEGATMHLPRENYVFEAKDGGNCSICLAIIEGEMTIIGNFQQQNMHVLYDLKNSKLSFV 433
Query: 333 KENC 336
+ C
Sbjct: 434 RAKC 437
>gi|255563827|ref|XP_002522914.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
gi|223537841|gb|EEF39457.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
Length = 442
Score = 129 bits (323), Expect = 2e-27, Method: Compositional matrix adjust.
Identities = 113/360 (31%), Positives = 169/360 (46%), Gaps = 45/360 (12%)
Query: 1 MVRLFIGTPSKGVLLILDTGSALIYA--------------IFDPRKSSSFQKINCDHPDC 46
++ L IGTP + I+DTGS LI+ IFDP+KSSSF K++C C
Sbjct: 101 LMNLAIGTPPETYSAIMDTGSDLIWTQCKPCTQCFDQPSPIFDPKKSSSFSKLSCSSQLC 160
Query: 47 TYF--KCVNEQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSNDNHGFD 104
++ C Y Y D S T+G A ET + GK FGC DN G
Sbjct: 161 KALPQSSCSDSCEYLYTYGDYSSTQGTMATETFTF-----GKVSIPNVGFGCGEDNEG-- 213
Query: 105 EDARDG--ALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLKFGT--D 160
DG +G++GL R +S +SQL + +FSYCL + +S L G+
Sbjct: 214 ----DGFTQGSGLVGLGRGPLSLVSQLK---EAKFSYCLT---SIDDTKTSTLLMGSLAS 263
Query: 161 MGYRRPSTQATKFINHP--NNFYYLSLKDISIDNERMNFPPDTFDITVSGEGGCIIDSGS 218
+ + + T I +P +FYYLSL+ IS+ R+ TF + G GG IIDSG+
Sbjct: 264 VNGTSAAIRTTPLIQNPLQPSFYYLSLEGISVGGTRLPIKESTFQLQDDGTGGLIIDSGT 323
Query: 219 VLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCYFLPETFNRF--PSMAFYFED 276
+TY + + ++F S + L + ++LCY LP + P + +F
Sbjct: 324 TITYLEESAFDLVKKEFTS---QMGLPVDNSGATGLELCYNLPSDTSELEVPKLVLHFTG 380
Query: 277 ANLRIDGENVFIIDYENHFFLLAVAPHDDLVALIGSQQQRDTRFVYDLNIDLLSFVKENC 336
A+L + GEN I D LA+ + ++ G+ QQ++ +DL + LSF+ NC
Sbjct: 381 ADLELPGENYMIADSSMGVICLAMGSSGGM-SIFGNVQQQNMFVSHDLEKETLSFLPTNC 439
>gi|15222611|ref|NP_173922.1| aspartyl protease-like protein [Arabidopsis thaliana]
gi|12321511|gb|AAG50814.1|AC079281_16 hypothetical protein [Arabidopsis thaliana]
gi|20466516|gb|AAM20575.1| unknown protein [Arabidopsis thaliana]
gi|23198172|gb|AAN15613.1| unknown protein [Arabidopsis thaliana]
gi|110736960|dbj|BAF00436.1| hypothetical protein [Arabidopsis thaliana]
gi|332192515|gb|AEE30636.1| aspartyl protease-like protein [Arabidopsis thaliana]
Length = 483
Score = 129 bits (323), Expect = 2e-27, Method: Compositional matrix adjust.
Identities = 108/358 (30%), Positives = 162/358 (45%), Gaps = 47/358 (13%)
Query: 2 VRLFIGTPSKGVLLILDTGSALIY--------------AIFDPRKSSSFQKINCDHPDCT 47
R+ IG P++ V ++LDTGS + + IF+P SSS++ ++CD P C
Sbjct: 150 TRVGIGKPAREVYMVLDTGSDVNWLQCTPCADCYHQTEPIFEPSSSSSYEPLSCDTPQCN 209
Query: 48 YFK---CVNEQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSNDNHGFD 104
+ C N C+Y + Y D S T G A ET+++ G + GC + N G
Sbjct: 210 ALEVSECRNATCLYEVSYGDGSYTVGDFATETLTI-----GSTLVQNVAVGCGHSNEGLF 264
Query: 105 EDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLKFGTDMGYR 164
A ++ SQL + FSYCLV ++S + FGT +
Sbjct: 265 VGAAGLLGL-----GGGLLALPSQLNT---TSFSYCLV---DRDSDSASTVDFGTSLS-- 311
Query: 165 RPSTQATKFINHP-NNFYYLSLKDISIDNERMNFPPDTFDITVSGEGGCIIDSGSVLTYF 223
+ A NH + FYYL L IS+ E + P +F++ SG GG IIDSG+ +T
Sbjct: 312 PDAVVAPLLRNHQLDTFYYLGLTGISVGGELLQIPQSSFEMDESGSGGIIIDSGTAVTRL 371
Query: 224 HSDVYWKLHEKFVSY---FERFQLAQLSDCPEPIQLCYFL-PETFNRFPSMAFYFEDAN- 278
+++Y L + FV E+ + D CY L +T P++AF+F
Sbjct: 372 QTEIYNSLRDSFVKGTLDLEKAAGVAMFDT------CYNLSAKTTVEVPTVAFHFPGGKM 425
Query: 279 LRIDGENVFIIDYENHFFLLAVAPHDDLVALIGSQQQRDTRFVYDLNIDLLSFVKENC 336
L + +N I F LA AP +A+IG+ QQ+ TR +DL L+ F C
Sbjct: 426 LALPAKNYMIPVDSVGTFCLAFAPTASSLAIIGNVQQQGTRVTFDLANSLIGFSSNKC 483
>gi|168043550|ref|XP_001774247.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162674374|gb|EDQ60883.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 419
Score = 129 bits (323), Expect = 3e-27, Method: Compositional matrix adjust.
Identities = 106/362 (29%), Positives = 158/362 (43%), Gaps = 52/362 (14%)
Query: 1 MVRLFIGTPSKGVLLILDTGSALIYA--------------IFDPRKSSSFQKINCDHPDC 46
++ + G+P + +I+DTGS LI+ IFDP KSS++ ++C C
Sbjct: 81 LIDISFGSPPQKASVIVDTGSDLIWTQCLPCETCNAAASVIFDPVKSSTYDTVSCASNFC 140
Query: 47 TY--FKCVNEQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSNDNHGFD 104
+ F+ C Y Y D S T G +S G FGC + N G
Sbjct: 141 SSLPFQSCTTSCKYDYMYGDGSSTSG-----ALSTETVTVGTGTIPNVAFGCGHTNLGSF 195
Query: 105 EDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLKFGTDMGYR 164
A AG++GL + +S ISQ SI K+FSYCLV P G +S + G D
Sbjct: 196 AGA-----AGIVGLGQGPLSLISQASSITSKKFSYCLV---PLGSTKTSPMLIG-DSAAA 246
Query: 165 RPSTQATKFINHPN-NFYYLSLKDISIDNERMNFPPDTFDITVSGEGGCIIDSGSVLTYF 223
N N FYY L IS+ + + +P TF I SG+GG I+DSG+ LTY
Sbjct: 247 GGVAYTALLTNTANPTFYYADLTGISVSGKAVTYPVGTFSIDASGQGGFILDSGTTLTYL 306
Query: 224 HSDVYWKLHEKFVSYFERFQLAQLSDCPEP--------IQLCYFLPETFN-RFPSMAFYF 274
+ F A ++ P P + C+ N +P+M F+F
Sbjct: 307 ETGA-----------FNALVAALKAEVPFPEADGSLYGLDYCFSTAGVANPTYPTMTFHF 355
Query: 275 EDANLRIDGENVFIIDYENHFFLLAVAPHDDLVALIGSQQQRDTRFVYDLNIDLLSFVKE 334
+ A+ + ENVF+ LA+A +++G+ QQ++ V+DL + F +
Sbjct: 356 KGADYELPPENVFVALDTGGSICLAMAASTGF-SIMGNIQQQNHLIVHDLVNQRVGFKEA 414
Query: 335 NC 336
NC
Sbjct: 415 NC 416
>gi|302780643|ref|XP_002972096.1| hypothetical protein SELMODRAFT_96575 [Selaginella moellendorffii]
gi|300160395|gb|EFJ27013.1| hypothetical protein SELMODRAFT_96575 [Selaginella moellendorffii]
Length = 393
Score = 129 bits (323), Expect = 3e-27, Method: Compositional matrix adjust.
Identities = 106/356 (29%), Positives = 159/356 (44%), Gaps = 47/356 (13%)
Query: 4 LFIGTPSKGVLLILDTGSALIY------------AIFDPRKSSSFQKINCDHPDCTYFKC 51
+ +GTP K I DTGS L++ IFDPR+SS+F++++C C
Sbjct: 59 ISVGTPGKRFRAIADTGSDLVWVQSEPCTGCSGGTIFDPRQSSTFREMDCSSQLCAELPG 118
Query: 52 VNE----QCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSNDNHGFDEDA 107
E C Y+ +Y T+G A +TIS+ +G F GC N GFD
Sbjct: 119 SCEPGSSTCSYSYEYGSGE-TEGEFARDTISLGTTSDGSQKFPSFAVGCGMVNSGFD--- 174
Query: 108 RDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLKFGTDMGYRRPS 167
+ G++GL + +S SQL + I +FSYCLV N + SS L FG
Sbjct: 175 ---GVDGLVGLGQGPVSLTSQLSAAIDSKFSYCLVD--INSQSESSPLLFGPSAALHGTG 229
Query: 168 TQATKFINHPNN----FYYLSLKDISIDNERMNFPPDTFDITVSGEGGCIIDSGSVLTYF 223
Q+TK I P++ +Y L++ I++ + M P G IIDSG+ LTY
Sbjct: 230 IQSTK-ITPPSDTYPTYYLLTVNGIAVAGQTMGSP-----------GTTIIDSGTTLTYV 277
Query: 224 HSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCYFLPETFN-RFPSMAFYFEDANLRID 282
S VY ++ + S L ++ + LCY N +FP++ A +
Sbjct: 278 PSGVYGRVLSRMESM---VTLPRVDGSSMGLDLCYDRSSNRNYKFPALTIRLAGATMTPP 334
Query: 283 GENVF-IIDYENHFFLLAVAPHDDL-VALIGSQQQRDTRFVYDLNIDLLSFVKENC 336
N F ++D LA+ L V++IG+ Q+ +YD LSFV+ C
Sbjct: 335 SSNYFLVVDDSGDTVCLAMGSASGLPVSIIGNVMQQGYHILYDRGSSELSFVQAKC 390
>gi|224111722|ref|XP_002315953.1| predicted protein [Populus trichocarpa]
gi|222864993|gb|EEF02124.1| predicted protein [Populus trichocarpa]
Length = 484
Score = 129 bits (323), Expect = 3e-27, Method: Compositional matrix adjust.
Identities = 106/355 (29%), Positives = 163/355 (45%), Gaps = 43/355 (12%)
Query: 3 RLFIGTPSKGVLLILDTGSALIYA--------------IFDPRKSSSFQKINCDHPDCTY 48
R+ IG P LILDTGS + + IF+P S+SF ++C+ C
Sbjct: 152 RVGIGKPPSQAYLILDTGSDVNWVQCAPCADCYQQADPIFEPASSASFSTLSCNTRQCRS 211
Query: 49 F---KCVNEQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSNDNHGFDE 105
+C N+ C+Y + Y D S T G ETI++ G A GC ++N G
Sbjct: 212 LDVSECRNDTCLYEVSYGDGSYTVGDFVTETITL-----GSAPVDNVAIGCGHNNEGLFV 266
Query: 106 DARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLKFGTDMGYRR 165
A ++SF SQ+ + FSYCLV ++S L+F + +
Sbjct: 267 GAAGLLGL-----GGGSLSFPSQINA---TSFSYCLV---DRDSESASTLEFNSTL---P 312
Query: 166 PSTQATKFI--NHPNNFYYLSLKDISIDNERMNFPPDTFDITVSGEGGCIIDSGSVLTYF 223
P+ + + +H + FYY+ L +S+ E ++ P F I SG GG I+DSG+ +T
Sbjct: 313 PNAVSAPLLRNHHLDTFYYVGLTGLSVGGELVSIPESAFQIDESGNGGVIVDSGTAITRL 372
Query: 224 HSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCYFLPETFN-RFPSMAFYFEDAN-LRI 281
+DVY L + FV +R + ++ CY L N P+++F+F D L +
Sbjct: 373 QTDVYNSLRDAFV---KRTRDLPSTNGIALFDTCYDLSSKGNVEVPTVSFHFPDGKELPL 429
Query: 282 DGENVFIIDYENHFFLLAVAPHDDLVALIGSQQQRDTRFVYDLNIDLLSFVKENC 336
+N + F A AP +++IG+ QQ+ TR VYDL L+ FV C
Sbjct: 430 PAKNYLVPLDSEGTFCFAFAPTASSLSIIGNVQQQGTRVVYDLVNHLVGFVPNKC 484
>gi|357127293|ref|XP_003565317.1| PREDICTED: aspartic proteinase nepenthesin-1-like, partial
[Brachypodium distachyon]
Length = 540
Score = 128 bits (322), Expect = 3e-27, Method: Compositional matrix adjust.
Identities = 104/362 (28%), Positives = 155/362 (42%), Gaps = 48/362 (13%)
Query: 3 RLFIGTPSKGVLLILDTGSALIY--------------AIFDPRKSSSFQKINCDHPDCTY 48
R+ IG+P++ + ++LDTGS + + +FDP SSS+ + CD P C
Sbjct: 199 RIGIGSPARQLYMVLDTGSDVTWLQCAPCADCYAQSDPLFDPALSSSYATVPCDSPHCRA 258
Query: 49 FKCV---------NEQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSND 99
N CVY + Y D S T G A ET+++ G+G A H GC +D
Sbjct: 259 LDASACHNNAANGNSSCVYEVAYGDGSYTVGDFATETLTL--GGDGSAAVHDVAIGCGHD 316
Query: 100 NHGFDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLKFGT 159
N G A +SF SQ I FSYCLV ++S L+FG
Sbjct: 317 NEGLFVGAAGLLAL-----GGGPLSFPSQ---ISATEFSYCLV---DRDSPSASTLQFGA 365
Query: 160 DMGYRRPSTQATKFINHP--NNFYYLSLKDISIDNERM-NFPPDTFDITVSGEGGCIIDS 216
ST + P N FYY++L IS+ E + + PP F + G GG I+DS
Sbjct: 366 S----DSSTVTAPLMRSPRSNTFYYVALNGISVGGETLSDIPPAAFAMDEQGSGGVIVDS 421
Query: 217 GSVLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCYFLP-ETFNRFPSMAFYFE 275
G+ +T S Y L + FV + A CY L + + P+++ FE
Sbjct: 422 GTAVTRLQSSAYSALRDAFVRGTQALPRASGVSL---FDTCYDLAGRSSVQVPAVSLRFE 478
Query: 276 DAN-LRIDGENVFIIDYENHFFLLAVAPHDDLVALIGSQQQRDTRFVYDLNIDLLSFVKE 334
L++ +N I + LA A V+++G+ QQ+ R +D + + F
Sbjct: 479 GGGELKLPAKNYLIPVDGAGTYCLAFAATGGAVSIVGNVQQQGIRVSFDTAKNTVGFSPN 538
Query: 335 NC 336
C
Sbjct: 539 KC 540
>gi|225430551|ref|XP_002283470.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Vitis vinifera]
Length = 490
Score = 128 bits (322), Expect = 3e-27, Method: Compositional matrix adjust.
Identities = 104/370 (28%), Positives = 162/370 (43%), Gaps = 62/370 (16%)
Query: 1 MVRLFIGTPSKGVLLILDTGSALIYA---------------IFDPRKSSSFQKINCDHPD 45
+V + +G+P + + I DTGS L + IFDP S S+ ++CD P
Sbjct: 148 VVTVGLGSPKRDLTFIFDTGSDLTWTQCEPCVGYCYQQREHIFDPSTSLSYSNVSCDSPS 207
Query: 46 CTYFK--------CVNEQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCS 97
C + C + C+Y ++Y D S + GF A E +S+ +F+ FGC
Sbjct: 208 CEKLESATGNSPGCSSSTCLYGIRYGDGSYSIGFFAREKLSLTSTD----VFNNFQFGCG 263
Query: 98 NDNHGFDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLKF 157
+N G G AG+LGL+R +S +SQ K FSYC LP+ ++ YL F
Sbjct: 264 QNNRGLF-----GGTAGLLGLARNPLSLVSQTAQKYGKVFSYC----LPSSSSSTGYLSF 314
Query: 158 GTDMGYRRPSTQATKFINHPNN-----FYYLSLKDISIDNERMNFPPDTFDITVSGEGGC 212
G+ G ++A KF N FY+L + IS+ ++ P F G
Sbjct: 315 GSGDG----DSKAVKFTPSEVNSDYPSFYFLDMVGISVGERKLPIPKSVFST-----AGT 365
Query: 213 IIDSGSVLTYFHSDVY---WKLHEKFVSYFERFQLAQLSDCPEPIQLCYFLPETFN-RFP 268
IIDSG+V++ VY K+ + +S + R + + D CY L + + P
Sbjct: 366 IIDSGTVISRLPPTVYSSVQKVFRELMSDYPRVKGVSILD------TCYDLSKYKTVKVP 419
Query: 269 SMAFYFE-DANLRIDGEN-VFIIDYENHFFLLAVAPHDDLVALIGSQQQRDTRFVYDLNI 326
+ YF A + + E ++++ A DD VA+IG+ QQ+ VYD
Sbjct: 420 KIILYFSGGAEMDLAPEGIIYVLKVSQVCLAFAGNSDDDEVAIIGNVQQKTIHVVYDDAE 479
Query: 327 DLLSFVKENC 336
+ F C
Sbjct: 480 GRVGFAPSGC 489
>gi|297828001|ref|XP_002881883.1| hypothetical protein ARALYDRAFT_903676 [Arabidopsis lyrata subsp.
lyrata]
gi|297327722|gb|EFH58142.1| hypothetical protein ARALYDRAFT_903676 [Arabidopsis lyrata subsp.
lyrata]
Length = 529
Score = 128 bits (322), Expect = 4e-27, Method: Compositional matrix adjust.
Identities = 114/370 (30%), Positives = 162/370 (43%), Gaps = 42/370 (11%)
Query: 4 LFIGTPSKGVLLILDTGSALIY--------------AIFDPRKSSSFQKINCDHPDCTYF 49
+ +GTP K LILDTGS L + A +DP+ S+SF+ I C+ P C+
Sbjct: 166 VLVGTPPKHFSLILDTGSDLNWLQCLPCYDCFHQNEAFYDPKTSASFKNITCNDPRCSLI 225
Query: 50 ---------KCVNEQCVYTMKYADQSVTKGFAAHETISV-IGKGEGKAI---FHGALFGC 96
K N+ C Y Y D+S T G A ET +V + EG++ +FGC
Sbjct: 226 SSPEPPVQCKSDNQSCPYFYWYGDRSNTTGDFAVETFTVNLTTTEGRSSEYKVENMMFGC 285
Query: 97 SNDNHGFDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLK 156
+ N G A R +SF SQL S+ FSYCLV + SS L
Sbjct: 286 GHWNRGLFSGASGLLGL-----GRGPLSFSSQLQSLYGHSFSYCLV-DRNSDTNVSSKLI 339
Query: 157 FGTDMGY-RRPSTQATKFINHPNN----FYYLSLKDISIDNERMNFPPDTFDITVSGEGG 211
FG D + T F+N N FYY+ +K I + E ++ P +T++I+ G GG
Sbjct: 340 FGEDKDLLNHTNLNFTSFVNGKENSVETFYYIQIKSILVGGEALDIPEETWNISPDGAGG 399
Query: 212 CIIDSGSVLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCP--EPIQLCYFLPETFNRFPS 269
IIDSG+ L+YF Y + KF + L D P +P + E P
Sbjct: 400 TIIDSGTTLSYFAEPAYEIIKNKFAEKMKENYLV-FRDFPVLDPCFNVSGIEENNIHLPE 458
Query: 270 MAFYFED-ANLRIDGENVFIIDYENHFFLLAVAPHDDLVALIGSQQQRDTRFVYDLNIDL 328
+ F D A EN FI E+ L + ++IG+ QQ++ +YD +
Sbjct: 459 LGIAFADGAVWNFPAENSFIWLSEDLVCLAILGTPKSTFSIIGNYQQQNFHILYDTKMSR 518
Query: 329 LSFVKENCSD 338
L F C+D
Sbjct: 519 LGFTPTKCAD 528
>gi|297845610|ref|XP_002890686.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
gi|297336528|gb|EFH66945.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
Length = 486
Score = 128 bits (322), Expect = 4e-27, Method: Compositional matrix adjust.
Identities = 108/358 (30%), Positives = 163/358 (45%), Gaps = 47/358 (13%)
Query: 2 VRLFIGTPSKGVLLILDTGSALIY--------------AIFDPRKSSSFQKINCDHPDCT 47
R+ IG P++ V ++LDTGS + + IF+P SSS++ ++CD P C
Sbjct: 153 TRVGIGNPAREVYMVLDTGSDVNWLQCTPCADCYHQTEPIFEPSSSSSYEPLSCDTPQCN 212
Query: 48 YFK---CVNEQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSNDNHGFD 104
+ C N C+Y + Y D S T G A ET+++ G + GC + N G
Sbjct: 213 ALEVSECRNATCLYEVSYGDGSYTVGDFATETLTI-----GSTLVQNVAVGCGHSNEGLF 267
Query: 105 EDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLKFGTDMGYR 164
A ++ SQL + FSYCLV ++S ++FGT +
Sbjct: 268 VGAAGLLGL-----GGGLLALPSQLNT---TSFSYCLV---DRDSDSASTVEFGTSL--P 314
Query: 165 RPSTQATKFINHP-NNFYYLSLKDISIDNERMNFPPDTFDITVSGEGGCIIDSGSVLTYF 223
+ A NH + FYYL L IS+ E + P +F++ SG GG IIDSG+ +T
Sbjct: 315 PDAVVAPLLRNHQLDTFYYLGLTGISVGGELLQIPQSSFEMDESGSGGIIIDSGTAVTRL 374
Query: 224 HSDVYWKLHEKFV---SYFERFQLAQLSDCPEPIQLCYFL-PETFNRFPSMAFYFEDAN- 278
+ +Y L + F+ S E+ + D CY L +T P++AF+F
Sbjct: 375 QTGIYNSLRDSFLKGTSDLEKAAGVAMFD------TCYNLSAKTTIEVPTVAFHFPGGKM 428
Query: 279 LRIDGENVFIIDYENHFFLLAVAPHDDLVALIGSQQQRDTRFVYDLNIDLLSFVKENC 336
L + +N I F LA AP +A+IG+ QQ+ TR +DL L+ F C
Sbjct: 429 LALPAKNYMIPVDSVGTFCLAFAPTASSLAIIGNVQQQGTRVTFDLANSLIGFSSNKC 486
>gi|125543640|gb|EAY89779.1| hypothetical protein OsI_11321 [Oryza sativa Indica Group]
Length = 434
Score = 128 bits (321), Expect = 4e-27, Method: Compositional matrix adjust.
Identities = 115/366 (31%), Positives = 162/366 (44%), Gaps = 47/366 (12%)
Query: 1 MVRLFIGTPSKGVLLILDTGSALIYA--------------IFDPRKSSSFQKINCDHPDC 46
+V L IGTP + V L LDTGS LI+ FDP SS+ +CD C
Sbjct: 83 LVHLAIGTPPQPVQLTLDTGSDLIWTQCQPCPACFDQALPYFDPSTSSTLSLTSCDSTLC 142
Query: 47 TYFKCV---------NEQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCS 97
N+ CVYT Y D+SVT GF + + +G G A G FGC
Sbjct: 143 QGLPVASCGSPKFWPNQTCVYTYSYGDKSVTTGFLEVDKFTFVGAG---ASVPGVAFGCG 199
Query: 98 NDNHGFDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSS-YLK 156
N+G + G+ G R +S SQL FS+C NG S+ L
Sbjct: 200 LFNNGVFKSNE----TGIAGFGRGPLSLPSQLKV---GNFSHCFTA--VNGLKPSTVLLD 250
Query: 157 FGTDMGYR--RPSTQATKFINHPNN--FYYLSLKDISIDNERMNFPPDTFDITVSGEGGC 212
D+ Y+ R + Q+T I +P N FYYLSLK I++ + R+ P F + +G GG
Sbjct: 251 LPADL-YKSGRGAVQSTPLIQNPANPTFYYLSLKGITVGSTRLPVPESEFTLK-NGTGGT 308
Query: 213 IIDSGSVLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCYFLPETFNRF-PSMA 271
IIDSG+ +T + VY + + F + + +L +S C P + P +
Sbjct: 309 IIDSGTAMTSLPTRVYRLVRDAFAA---QVKLPVVSGNTTDPYFCLSAPLRAKPYVPKLV 365
Query: 272 FYFEDANLRIDGEN-VFIIDYENHFFLLAVAPHDDLVALIGSQQQRDTRFVYDLNIDLLS 330
+FE A + + EN VF ++ L V IG+ QQ++ +YDL LS
Sbjct: 366 LHFEGATMDLPRENYVFEVEDAGSSILCLAIIEGGEVTTIGNFQQQNMHVLYDLQNSKLS 425
Query: 331 FVKENC 336
FV C
Sbjct: 426 FVPAQC 431
>gi|115452685|ref|NP_001049943.1| Os03g0318400 [Oryza sativa Japonica Group]
gi|108707841|gb|ABF95636.1| Eukaryotic aspartyl protease family protein, expressed [Oryza
sativa Japonica Group]
gi|113548414|dbj|BAF11857.1| Os03g0318400 [Oryza sativa Japonica Group]
Length = 434
Score = 128 bits (321), Expect = 5e-27, Method: Compositional matrix adjust.
Identities = 115/366 (31%), Positives = 162/366 (44%), Gaps = 47/366 (12%)
Query: 1 MVRLFIGTPSKGVLLILDTGSALIYA--------------IFDPRKSSSFQKINCDHPDC 46
+V L IGTP + V L LDTGS LI+ FDP SS+ +CD C
Sbjct: 83 LVHLAIGTPPQPVQLTLDTGSDLIWTQCQPCPACFDQALPYFDPSTSSTLSLTSCDSTLC 142
Query: 47 TYFKCV---------NEQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCS 97
N+ CVYT Y D+SVT GF + + +G G A G FGC
Sbjct: 143 QGLPVASCGSPKFWPNQTCVYTYSYGDKSVTTGFLEVDKFTFVGAG---ASVPGVAFGCG 199
Query: 98 NDNHGFDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSS-YLK 156
N+G + G+ G R +S SQL FS+C NG S+ L
Sbjct: 200 LFNNGVFKSNE----TGIAGFGRGPLSLPSQLKV---GNFSHCFTA--VNGLKPSTVLLD 250
Query: 157 FGTDMGYR--RPSTQATKFINHPNN--FYYLSLKDISIDNERMNFPPDTFDITVSGEGGC 212
D+ Y+ R + Q+T I +P N FYYLSLK I++ + R+ P F + +G GG
Sbjct: 251 LPADL-YKSGRGAVQSTPLIQNPANPTFYYLSLKGITVGSTRLPVPESEFALK-NGTGGT 308
Query: 213 IIDSGSVLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCYFLPETFNRF-PSMA 271
IIDSG+ +T + VY + + F + + +L +S C P + P +
Sbjct: 309 IIDSGTAMTSLPTRVYRLVRDAFAA---QVKLPVVSGNTTDPYFCLSAPLRAKPYVPKLV 365
Query: 272 FYFEDANLRIDGEN-VFIIDYENHFFLLAVAPHDDLVALIGSQQQRDTRFVYDLNIDLLS 330
+FE A + + EN VF ++ L V IG+ QQ++ +YDL LS
Sbjct: 366 LHFEGATMDLPRENYVFEVEDAGSSILCLAIIEGGEVTTIGNFQQQNMHVLYDLQNSKLS 425
Query: 331 FVKENC 336
FV C
Sbjct: 426 FVPAQC 431
>gi|357158691|ref|XP_003578210.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Brachypodium
distachyon]
Length = 459
Score = 127 bits (320), Expect = 5e-27, Method: Compositional matrix adjust.
Identities = 105/367 (28%), Positives = 154/367 (41%), Gaps = 43/367 (11%)
Query: 1 MVRLFIGTPSKGVLLILDTGSALIYA--------------IFDPRKSSSFQKINCDHPDC 46
+V L +GTP + V +LDTGS LI+ IF P SSS++ + C C
Sbjct: 105 LVDLAVGTPPQPVSALLDTGSDLIWTQCAPCASCLPQPDPIFSPGASSSYEPMRCAGELC 164
Query: 47 T---YFKCVN-EQCVYTMKYADQSVTKGFAAHETIS---VIGKGEGKAIFHGALFGCSND 99
+ C + C Y Y D + T+G A E + GE + FGC
Sbjct: 165 NDILHHSCQRPDTCTYRYSYGDGTTTRGVYATERFTFSSSSSGGETTKLSAPLGFGCGTM 224
Query: 100 NHGFDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLKFGT 159
N G + +G++G R +S +SQL +RFSYCL P S L FG+
Sbjct: 225 NKGSLNNG-----SGIVGFGRAPLSLVSQLA---IRRFSYCLT---PYASGRKSTLLFGS 273
Query: 160 DMG----YRRPSTQATKFINHPNN--FYYLSLKDISIDNERMNFPPDTFDITVSGEGGCI 213
G + Q T+ + N FYY+ +++ R+ P F + G GG I
Sbjct: 274 LRGGVYDAATATVQTTRLLRSRQNPTFYYVPFTGVTVGARRLRIPISAFALRPDGSGGAI 333
Query: 214 IDSGSVLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCYFLPETF----NRFPS 269
+DSG+ LT F + V ++ F S A S P+ +C+ + P
Sbjct: 334 VDSGTALTLFPAPVLAEVVRAFRSQLRLPFAANGSSGPD-DGVCFAAAASRVPRPAVVPR 392
Query: 270 MAFYFEDANLRIDGENVFIIDYENHFFLLAVAPHDDLVALIGSQQQRDTRFVYDLNIDLL 329
M F+ + A+L + N + D L +A D IG+ Q+D R +YDL D L
Sbjct: 393 MVFHLQGADLDLPRRNYVLDDQRKGNLCLLLADSGDSGTTIGNFVQQDMRVLYDLEADTL 452
Query: 330 SFVKENC 336
SF C
Sbjct: 453 SFAPAQC 459
>gi|413918484|gb|AFW58416.1| hypothetical protein ZEAMMB73_998053 [Zea mays]
Length = 475
Score = 127 bits (320), Expect = 5e-27, Method: Compositional matrix adjust.
Identities = 106/376 (28%), Positives = 166/376 (44%), Gaps = 58/376 (15%)
Query: 1 MVRLFIGTPSKGVLLILDTGSALIYA--------------IFDPRKSSSFQKINCDHPDC 46
++ L +GTP+ I+DTGS L++ +FDP SS++ + C C
Sbjct: 117 LMDLSVGTPALPYAAIVDTGSDLVWTQCKPCVECFNQTTPVFDPAASSTYAALPCSSALC 176
Query: 47 TYFKCVNEQCV-----------YTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFG 95
YT Y D S T+G A ET ++ + G FG
Sbjct: 177 ADLPTSTCASSSSSSSASSPCGYTYTYGDASSTQGVLATETFTL-----ARQKVPGVAFG 231
Query: 96 C--SNDNHGFDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSS 153
C +N+ GF + A G++GL R +S +SQLG RFSYCL L + S
Sbjct: 232 CGDTNEGDGFTQGA------GLVGLGRGPLSLVSQLG---IDRFSYCLT-SLDDAAGRSP 281
Query: 154 YL---KFGTDMGYRRPSTQATKFINHPN--NFYYLSLKDISIDNERMNFPPDTFDITVSG 208
L G Q T + +P+ +FYY+SL +++ + R+ P F I G
Sbjct: 282 LLLGSAAGISASAATAPAQTTPLVKNPSQPSFYYVSLTGLTVGSTRLALPSSAFAIQDDG 341
Query: 209 EGGCIIDSGSVLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCYFLP------E 262
GG I+DSG+ +TY Y L + FV++ L + + LC+ P +
Sbjct: 342 TGGVIVDSGTSITYLELRAYRALRKAFVAHMS---LPTVDASEIGLDLCFQGPAGAVDQD 398
Query: 263 TFNRFPSMAFYFE-DANLRIDGENVFIIDYENHFFLLAVAPHDDLVALIGSQQQRDTRFV 321
+ P + +F+ A+L + EN ++D + L V L ++IG+ QQ++ +FV
Sbjct: 399 VQVQVPKLVLHFDGGADLDLPAENYMVLDSASGALCLTVMASRGL-SIIGNFQQQNFQFV 457
Query: 322 YDLNIDLLSFVKENCS 337
YD+ D LSF C+
Sbjct: 458 YDVAGDTLSFAPAECN 473
>gi|226508202|ref|NP_001141111.1| hypothetical protein precursor [Zea mays]
gi|194702684|gb|ACF85426.1| unknown [Zea mays]
gi|414590469|tpg|DAA41040.1| TPA: hypothetical protein ZEAMMB73_571218 [Zea mays]
Length = 439
Score = 127 bits (320), Expect = 6e-27, Method: Compositional matrix adjust.
Identities = 101/371 (27%), Positives = 169/371 (45%), Gaps = 53/371 (14%)
Query: 1 MVRLFIGTPSKGVLLILDTGSALIYA---------------IFDPRKSSSFQKINCDH-- 43
++ L IGTP L I DTGS LI+ +++P S++F + C+
Sbjct: 86 LMTLAIGTPPLPFLAIADTGSDLIWTQCAPCSRQCFQQPTPLYNPSSSTTFSALPCNSSL 145
Query: 44 ----PDCTYFKCVNEQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAI-FHGALFGCSN 98
P C C+Y M Y T F ET + + G FGCSN
Sbjct: 146 GLCAPAC--------ACMYNMTYGS-GWTYVFQGTETFTFGSSTPADQVRVPGIAFGCSN 196
Query: 99 DNHGFDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLKFG 158
+ GF+ + +G++GL R ++S +SQLG+ +FSYCL P + TS+ L
Sbjct: 197 ASSGFNASSA----SGLVGLGRGSLSLVSQLGA---PKFSYCLT-PYQDTNSTSTLLLGP 248
Query: 159 TDMGYRRPSTQATKFINHPNN-FYYLSLKDISIDNERMNFPPDTFDITVSGEGGCIIDSG 217
+ +T F+ P++ +YYL+L IS+ + PP+ F + G GG IIDSG
Sbjct: 249 SASLNDTGVVSSTPFVASPSSIYYYLNLTGISLGTTALPIPPNAFSLKADGTGGLIIDSG 308
Query: 218 SVLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCYFLPETFN---RFPSMAFYF 274
+ +T + Y ++ +S + LC+ LP + + PSM +F
Sbjct: 309 TTITMLGNTAYQQVRAAVLSLVTLPTTD--GSAATGLDLCFELPSSTSAPPSMPSMTLHF 366
Query: 275 EDANLRIDGENVFI----IDYENHFFLLAVAPHDD----LVALIGSQQQRDTRFVYDLNI 326
+ A++ + +N + D ++ + LA+ D +V+++G+ QQ++ +YD+
Sbjct: 367 DGADMVLPADNYMMSLSDPDSDSSLWCLAMQNQTDTDGVVVSILGNYQQQNMHILYDVGK 426
Query: 327 DLLSFVKENCS 337
+ LSF CS
Sbjct: 427 ETLSFAPAKCS 437
>gi|302776610|ref|XP_002971459.1| hypothetical protein SELMODRAFT_64134 [Selaginella moellendorffii]
gi|300160591|gb|EFJ27208.1| hypothetical protein SELMODRAFT_64134 [Selaginella moellendorffii]
Length = 357
Score = 127 bits (319), Expect = 7e-27, Method: Compositional matrix adjust.
Identities = 110/359 (30%), Positives = 169/359 (47%), Gaps = 41/359 (11%)
Query: 2 VRLFIGTPSKGVLLILDTGSALIY--------------AIFDPRKSSSFQKINCDHPDCT 47
VR+ IG+P+K L++DTGS + + A+FDPR SSSF++++C P C
Sbjct: 16 VRVGIGSPTKLQYLVMDTGSDVPWIQCSPCKSCYKQNDAVFDPRASSSFRRLSCSTPQCK 75
Query: 48 YF-----KCVNEQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSNDNHG 102
+ +C+Y + Y D S T G A ++ SV +G + +FGC +DN G
Sbjct: 76 LLDVKACASTDNRCLYQVSYGDGSFTVGDLASDSFSV-SRGRTSPV----VFGCGHDNEG 130
Query: 103 FDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLKFGTDMG 162
A G L SF SQL S ++FSYCLV NG SS L FG
Sbjct: 131 LFVGAAGLLGLGAGKL-----SFPSQLSS---RKFSYCLV-SRDNGVRASSALLFGDSAL 181
Query: 163 YRRPSTQATKFINHP--NNFYYLSLKDISIDNERMNFPPDTFDITVS-GEGGCIIDSGSV 219
S T+ + +P + FYY L ISI ++ P F ++ S G GG IIDSG+
Sbjct: 182 PTSASFAYTQLLKNPKLDTFYYAGLSGISIGGTLLSIPSTAFKLSSSTGRGGVIIDSGTS 241
Query: 220 LTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCY-FLPETFNRFPSMAFYFE-DA 277
+T + Y + + F S ++ L + +D CY F T P+++F+FE A
Sbjct: 242 VTRLPTYAYTVMRDAFRSATQK--LPRAADF-SLFDTCYDFSALTSVTIPTVSFHFEGGA 298
Query: 278 NLRIDGENVFIIDYENHFFLLAVAPHDDLVALIGSQQQRDTRFVYDLNIDLLSFVKENC 336
++++ N + + F A + +++IG+ QQ+ R DL+ + F C
Sbjct: 299 SVQLPPSNYLVPVDTSGTFCFAFSKTSLDLSIIGNIQQQTMRVAIDLDSSRVGFAPRQC 357
>gi|242056497|ref|XP_002457394.1| hypothetical protein SORBIDRAFT_03g006630 [Sorghum bicolor]
gi|241929369|gb|EES02514.1| hypothetical protein SORBIDRAFT_03g006630 [Sorghum bicolor]
Length = 509
Score = 126 bits (317), Expect = 1e-26, Method: Compositional matrix adjust.
Identities = 108/361 (29%), Positives = 161/361 (44%), Gaps = 50/361 (13%)
Query: 3 RLFIGTPSKGVLLILDTGSALIYA--------------IFDPRKSSSFQKINCDHPDCTY 48
R+ IG+P++ + ++LDTGS + + +FDP S+S+ ++CD P C
Sbjct: 172 RVGIGSPARELYMVLDTGSDVTWVQCQPCADCYQQSDPVFDPSLSASYAAVSCDSPRCRD 231
Query: 49 F---KCVNE--QCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSNDNHGF 103
C N C+Y + Y D S T G A ET+++ G+ + + A+ GC +DN G
Sbjct: 232 LDTAACRNATGACLYEVAYGDGSYTVGDFATETLTL---GDSTPVTNVAI-GCGHDNEGL 287
Query: 104 DEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLKFGTDMGY 163
A G LS F SQ I FSYCLV +S L+FG D
Sbjct: 288 FVGAAGLLALGGGPLS-----FPSQ---ISASTFSYCLV---DRDSPAASTLQFGADGA- 335
Query: 164 RRPSTQATKFINHP--NNFYYLSLKDISIDNERMNFPPDTFDI-TVSGEGGCIIDSGSVL 220
T + P FYY++L IS+ + ++ P F + SG GG I+DSG+ +
Sbjct: 336 -EADTVTAPLVRSPRTGTFYYVALSGISVGGQALSIPSSAFAMDATSGSGGVIVDSGTAV 394
Query: 221 TYFHSDVYWKLHEKFV---SYFERFQLAQLSDCPEPIQLCYFLPE-TFNRFPSMAFYFED 276
T S Y L + FV R L D CY L + T P+++ FE
Sbjct: 395 TRLQSSAYAALRDAFVRGTPSLPRTSGVSLFD------TCYDLSDRTSVEVPAVSLRFEG 448
Query: 277 AN-LRIDGENVFIIDYENHFFLLAVAPHDDLVALIGSQQQRDTRFVYDLNIDLLSFVKEN 335
LR+ +N I + LA AP + V++IG+ QQ+ TR +D ++ F
Sbjct: 449 GGALRLPAKNYLIPVDGAGTYCLAFAPTNAAVSIIGNVQQQGTRVSFDTAKGVVGFTPNK 508
Query: 336 C 336
C
Sbjct: 509 C 509
>gi|356527091|ref|XP_003532147.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Glycine max]
Length = 482
Score = 126 bits (317), Expect = 1e-26, Method: Compositional matrix adjust.
Identities = 109/363 (30%), Positives = 164/363 (45%), Gaps = 49/363 (13%)
Query: 2 VRLFIGTPSKGVLLILDTGSALIYA---------------IFDPRKSSSFQKINCDHPDC 46
V + +GTP + + LI DTGS L + IFDP KSSS+ I C C
Sbjct: 142 VVVGLGTPKRDLSLIFDTGSYLTWTQCEPCAGSCYKQQDPIFDPSKSSSYTNIKCTSSLC 201
Query: 47 TYFKCV------NEQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSNDN 100
T F+ + C+Y +KY D S+++GF + E +++ I H LFGC DN
Sbjct: 202 TQFRSAGCSSSTDASCIYDVKYGDNSISRGFLSQERLTITATD----IVHDFLFGCGQDN 257
Query: 101 HGFDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLKFGTD 160
G AG++GLSR ISF+ Q SI K FSYC LP+ + +L FG
Sbjct: 258 EGLFR-----GTAGLMGLSRHPISFVQQTSSIYNKIFSYC----LPSTPSSLGHLTFGAS 308
Query: 161 MGYRRPSTQATKF--INHPNNFYYLSLKDISIDNERMNFPPDTFDITVSGEGGCIIDSGS 218
+ + T F I+ N+FY L + IS+ ++ P T S GG IIDSG+
Sbjct: 309 AA-TNANLKYTPFSTISGENSFYGLDIVGISVGGTKL---PAVSSSTFSA-GGSIIDSGT 363
Query: 219 VLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCY-FLPETFNRFPSMAFYFEDA 277
V+T Y L F + ++ +A + + CY F P + F F
Sbjct: 364 VITRLPPTAYAALRSAFRQFMMKYPVAYGT---RLLDTCYDFSGYKEISVPRIDFEFA-G 419
Query: 278 NLRIDGENVFIIDYEN-HFFLLAVAPHDDL--VALIGSQQQRDTRFVYDLNIDLLSFVKE 334
++++ V I+ E+ LA A + + + + G+ QQ+ VYD+ + F
Sbjct: 420 GVKVELPLVGILYGESAQQLCLAFAANGNGNDITIFGNVQQKTLEVVYDVEGGRIGFGAA 479
Query: 335 NCS 337
C+
Sbjct: 480 GCN 482
>gi|108707838|gb|ABF95633.1| Eukaryotic aspartyl protease family protein [Oryza sativa Japonica
Group]
Length = 391
Score = 126 bits (316), Expect = 1e-26, Method: Compositional matrix adjust.
Identities = 111/369 (30%), Positives = 164/369 (44%), Gaps = 49/369 (13%)
Query: 1 MVRLFIGTPSKGVLLILDTGSALIYA--------------IFDPRKSSSFQKINCDHPDC 46
+V L IGTP + V L LDTGS LI+ FDP SS+ +CD C
Sbjct: 36 LVHLAIGTPPQPVQLTLDTGSDLIWTQCQPCPACFDQALPYFDPSTSSTLSLTSCDSTLC 95
Query: 47 TYFKCV---------NEQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCS 97
N+ CVYT Y D+SVT GF + + +G G A G FGC
Sbjct: 96 QGLPVASCGSPKFWPNQTCVYTYSYGDKSVTTGFLEVDKFTFVGAG---ASVPGVAFGCG 152
Query: 98 NDNHGFDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLKF 157
N+G + G+ G R +S SQL FS+C + ++ L
Sbjct: 153 LFNNGVFKSNE----TGIAGFGRGPLSLPSQLK---VGNFSHCFTT-ITGAIPSTVLLDL 204
Query: 158 GTDM-GYRRPSTQATKFINHPNN-----FYYLSLKDISIDNERMNFPPDTFDITVSGEGG 211
D+ + + Q T I + N YYLSLK I++ + R+ P F +T +G GG
Sbjct: 205 PADLFSNGQGAVQTTPLIQYAKNEANPTLYYLSLKGITVGSTRLPVPESAFALT-NGTGG 263
Query: 212 CIIDSGSVLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCYFLP-ETFNRFPSM 270
IIDSG+ +T VY + ++F + + +L + C+ P + P +
Sbjct: 264 TIIDSGTSITSLPPQVYQVVRDEFAA---QIKLPVVPGNATGHYTCFSAPSQAKPDVPKL 320
Query: 271 AFYFEDANLRIDGEN-VFII--DYENHFFLLAVAPHDDLVALIGSQQQRDTRFVYDLNID 327
+FE A + + EN VF + D N LA+ D+ +IG+ QQ++ +YDL +
Sbjct: 321 VLHFEGATMDLPRENYVFEVPDDAGNSIICLAINKGDE-TTIIGNFQQQNMHVLYDLQNN 379
Query: 328 LLSFVKENC 336
+LSFV C
Sbjct: 380 MLSFVAAQC 388
>gi|302765224|ref|XP_002966033.1| hypothetical protein SELMODRAFT_64135 [Selaginella moellendorffii]
gi|300166847|gb|EFJ33453.1| hypothetical protein SELMODRAFT_64135 [Selaginella moellendorffii]
Length = 357
Score = 126 bits (316), Expect = 2e-26, Method: Compositional matrix adjust.
Identities = 108/359 (30%), Positives = 169/359 (47%), Gaps = 41/359 (11%)
Query: 2 VRLFIGTPSKGVLLILDTGSALIY--------------AIFDPRKSSSFQKINCDHPDCT 47
VR+ IG+P+K L++DTGS + + A+FDPR SSSF++++C P C
Sbjct: 16 VRVGIGSPTKLQYLVMDTGSDVPWIQCSPCKSCYKQNDAVFDPRASSSFRRLSCSTPQCK 75
Query: 48 YF-----KCVNEQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSNDNHG 102
+ +C+Y + Y D S T G A ++ ++ +G + +FGC +DN G
Sbjct: 76 LLDVKACASTDNRCLYQVSYGDGSFTVGDLASDSF-LVSRGRTSPV----VFGCGHDNEG 130
Query: 103 FDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLKFGTDMG 162
A G L SF SQL S ++FSYCLV NG SS L FG
Sbjct: 131 LFVGAAGLLGLGAGKL-----SFPSQLSS---RKFSYCLV-SRDNGVRASSALLFGDSAL 181
Query: 163 YRRPSTQATKFINHP--NNFYYLSLKDISIDNERMNFPPDTFDITVS-GEGGCIIDSGSV 219
S T+ + +P + FYY L ISI ++ P F ++ S G GG IIDSG+
Sbjct: 182 PTSASFAYTQLLKNPKLDTFYYAGLSGISIGGTLLSIPSTAFKLSSSTGRGGVIIDSGTS 241
Query: 220 LTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCY-FLPETFNRFPSMAFYFE-DA 277
+T + Y + + F S ++ L + +D CY F T P+++F+FE A
Sbjct: 242 VTRLPTYAYTVMRDAFRSATQK--LPRAADF-SLFDTCYDFSALTSVTIPTVSFHFEGGA 298
Query: 278 NLRIDGENVFIIDYENHFFLLAVAPHDDLVALIGSQQQRDTRFVYDLNIDLLSFVKENC 336
++++ N + + F A + +++IG+ QQ+ R DL+ + F C
Sbjct: 299 SVQLPPSNYLVPVDTSGTFCFAFSKTSLDLSIIGNIQQQTMRVAIDLDSSRVGFAPRQC 357
>gi|297603570|ref|NP_001054261.2| Os04g0677100 [Oryza sativa Japonica Group]
gi|255675885|dbj|BAF16175.2| Os04g0677100 [Oryza sativa Japonica Group]
Length = 464
Score = 126 bits (316), Expect = 2e-26, Method: Compositional matrix adjust.
Identities = 98/361 (27%), Positives = 150/361 (41%), Gaps = 54/361 (14%)
Query: 2 VRLFIGTPSKGVLLILDTGSALIYA--------------IFDPRKSSSFQKINCDHPDCT 47
VR+ +G+P L++D+GS +I+ +FDP SSSF ++C C
Sbjct: 132 VRVGVGSPPTDQYLVVDSGSDVIWVQCRPCEQCYAQTDPLFDPAASSSFSGVSCGSAICR 191
Query: 48 YFKCVNE-------QCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSNDN 100
+C Y++ Y D S TKG A ET+++ G G GC + N
Sbjct: 192 TLSGTGCGGGGDAGKCDYSVTYGDGSYTKGELALETLTL-----GGTAVQGVAIGCGHRN 246
Query: 101 HGFDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLKFGTD 160
G A AG+LGL +S + QLG FSYCL G + L G
Sbjct: 247 SGLFVGA-----AGLLGLGWGAMSLVGQLGGAAGGVFSYCLA---SRGAGGAGSLVLG-- 296
Query: 161 MGYRRPSTQATKFINHPNNFYYLSLKDISIDNERMNFPPDTFDITVSGEGGCIIDSGSVL 220
T+A ++FYY+ L I + ER+ F +T G GG ++D+G+ +
Sbjct: 297 ------RTEAVPRGRRASSFYYVGLTGIGVGGERLPLQDSLFQLTEDGAGGVVMDTGTAV 350
Query: 221 TYFHSDVYWKLHEKF---VSYFERFQLAQLSDCPEPIQLCYFLPETFN-RFPSMAFYF-E 275
T + Y L F + R L D CY L + R P+++FYF +
Sbjct: 351 TRLPREAYAALRGAFDGAMGALPRSPAVSLLDT------CYDLSGYASVRVPTVSFYFDQ 404
Query: 276 DANLRIDGENVFIIDYENHFFLLAVAPHDDLVALIGSQQQRDTRFVYDLNIDLLSFVKEN 335
A L + N +++ F LA AP ++++G+ QQ + D + F
Sbjct: 405 GAVLTLPARN-LLVEVGGAVFCLAFAPSSSGISILGNIQQEGIQITVDSANGYVGFGPNT 463
Query: 336 C 336
C
Sbjct: 464 C 464
>gi|302806531|ref|XP_002985015.1| hypothetical protein SELMODRAFT_121417 [Selaginella moellendorffii]
gi|300147225|gb|EFJ13890.1| hypothetical protein SELMODRAFT_121417 [Selaginella moellendorffii]
Length = 533
Score = 126 bits (316), Expect = 2e-26, Method: Compositional matrix adjust.
Identities = 116/368 (31%), Positives = 172/368 (46%), Gaps = 45/368 (12%)
Query: 4 LFIGTPSKGVLLILDTGSALIY--------------AIFDPRKSSSFQKINCDHPDCTYF 49
+F+G P + LLI+DTGS L + +FDP +S+SF+ I C+ C
Sbjct: 175 VFVGNPPRHFLLIIDTGSDLTWLQCKPCKACFDQSGPVFDPSQSTSFKIIPCNAAACDLV 234
Query: 50 ----------KCVNEQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAI-FHGALFGCSN 98
K + C Y Y D S T G A E++SV ++ + GC +
Sbjct: 235 VHDECRDNSSKTSPKTCKYFYWYGDSSRTSGDLALESLSVSLSDHPSSLEIRDMVIGCGH 294
Query: 99 DNHGFDEDARDGALAGVLGLSRVTISFISQL-GSIIKKRFSYCLVIPLPNGEYTSSYLKF 157
N G+LGL + +SF SQL S I + FSYCLV N SS + F
Sbjct: 295 SNK-----GLFQGAGGLLGLGQGALSFPSQLRSSPIGQSFSYCLV-DRTNNLSVSSAISF 348
Query: 158 GTDMGYRRPSTQA--TKFINHPNN---FYYLSLKDISIDNERMNFPPDTFDITVSGEGGC 212
G R Q T F+ N+ FYYL ++ I ID E + P + F I +G GG
Sbjct: 349 GAGFALSRHFDQMRFTPFVRTNNSVETFYYLGIQGIKIDQELLPIPAERFAIAPNGSGGT 408
Query: 213 IIDSGSVLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCY-FLPETFNRFPSMA 271
IIDSG+ LTY + D Y + F++ + +D + + +CY T FP+++
Sbjct: 409 IIDSGTTLTYLNRDAYRAVESAFLARISYPR----ADPFDILGICYNATGRTAVPFPTLS 464
Query: 272 FYFED-ANLRIDGENVFI-IDYENHFFLLAVAPHDDLVALIGSQQQRDTRFVYDLNIDLL 329
F++ A L + EN FI D + LA+ P D + ++IG+ QQ++ F+YD+ L
Sbjct: 465 IVFQNGAELDLPQENYFIQPDPQEAKHCLAILPTDGM-SIIGNFQQQNIHFLYDVQHARL 523
Query: 330 SFVKENCS 337
F +CS
Sbjct: 524 GFANTDCS 531
>gi|414885970|tpg|DAA61984.1| TPA: hypothetical protein ZEAMMB73_915310 [Zea mays]
Length = 523
Score = 125 bits (315), Expect = 2e-26, Method: Compositional matrix adjust.
Identities = 98/357 (27%), Positives = 154/357 (43%), Gaps = 42/357 (11%)
Query: 1 MVRLFIGTPSKGVLLILDTGSALIYA--------------IFDPRKSSSFQKINCDHPDC 46
+V + +GTP + +L++ DTGS L + +FDP +S+++ + C +C
Sbjct: 189 IVSVGLGTPRRDLLVVFDTGSDLSWVQCKPCNNCYKQHDPLFDPSQSTTYSAVPCGAQEC 248
Query: 47 T-YFKCVNEQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSNDNHGFDE 105
C + +C Y + Y D S T G A +T+++ G G +FGC +D+ G
Sbjct: 249 LDSGTCSSGKCRYEVVYGDMSQTDGNLARDTLTL---GPSSDQLQGFVFGCGDDDTGLF- 304
Query: 106 DARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLKFGTDMGYRR 165
G G+ GL R +S SQ + FSYCL P+ YL G+
Sbjct: 305 ----GRADGLFGLGRDRVSLASQAAARYGAGFSYCL----PSSWRAEGYLSLGSAAA--P 354
Query: 166 PSTQATKFINHPN--NFYYLSLKDISIDNERMNFPPDTFDITVSGEGGCIIDSGSVLTYF 223
P Q T + + +FYYL L I + + P F G +IDSG+V+T
Sbjct: 355 PHAQFTAMVTRSDTPSFYYLDLVGIKVAGRTVRVAPAVFKAP-----GTVIDSGTVITRL 409
Query: 224 HSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCY-FLPETFNRFPSMAFYFE-DANLRI 281
S Y L F + R++ A + CY F T + PS+A F+ A L +
Sbjct: 410 PSRAYSALRSSFAGFMRRYKRAPALSI---LDTCYDFTGRTKVQIPSVALLFDGGATLNL 466
Query: 282 D-GENVFIIDYENHFFLLAVAPHDDLVALIGSQQQRDTRFVYDLNIDLLSFVKENCS 337
G +++ + A D V ++G+ QQ+ VYDL + F + CS
Sbjct: 467 GFGGVLYVANRSQACLAFASNGDDTSVGILGNMQQKTFAVVYDLANQKIGFGAKGCS 523
>gi|242050426|ref|XP_002462957.1| hypothetical protein SORBIDRAFT_02g035290 [Sorghum bicolor]
gi|241926334|gb|EER99478.1| hypothetical protein SORBIDRAFT_02g035290 [Sorghum bicolor]
Length = 452
Score = 125 bits (315), Expect = 2e-26, Method: Compositional matrix adjust.
Identities = 104/366 (28%), Positives = 160/366 (43%), Gaps = 45/366 (12%)
Query: 4 LFIGTPSKGVLLILDTGSALIYA---------------IFDPRKSSSFQKINCDHPDC-- 46
L +GTP I+DTGS L + ++DP +SS+F K+ C P C
Sbjct: 100 LSVGTPPLAFPAIIDTGSDLTWTQCAPCTTACFAQPTPLYDPARSSTFSKLPCASPLCQA 159
Query: 47 ---TYFKCVNEQCVYTMKYADQSVTKGFAAHETISVIGKGE---GKAIFHGALFGCSNDN 100
+ C CVY +YA T G+ A +T+++ + F G FGCS N
Sbjct: 160 LPSAFRACNATGCVYDYRYA-VGFTAGYLAADTLAIGDGDGDGDASSSFAGVAFGCSTAN 218
Query: 101 HGFDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLKFGTD 160
G DGA +G++GL R +S +SQ+G RFSYCL + + +S + FG
Sbjct: 219 GG----DMDGA-SGIVGLGRSALSLLSQIG---VGRFSYCL---RSDADAGASPILFGAL 267
Query: 161 MGYRRPSTQATKFINHP------NNFYYLSLKDISIDNERMNFPPDTFDITVSGEGGCII 214
Q+T + +P +YY++L I++ + + TF T +G GG I+
Sbjct: 268 ANVTGDKVQSTALLRNPVAARRRAPYYYVNLTGIAVGSTDLPVTSSTFGFTAAGAGGVIV 327
Query: 215 DSGSVLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCYFLPETFNRFPSMAFYF 274
DSG+ TY Y L + F+S L ++S LC+ P + F F
Sbjct: 328 DSGTTFTYLAEAGYTMLRQAFLSQTAGL-LTRVSGAQFDFDLCFEAGAADTPVPRLVFRF 386
Query: 275 E-DANLRIDGENVF-IIDYENHFFLLAVAPHDDLVALIGSQQQRDTRFVYDLNIDLLSFV 332
A + ++ F +D L V P V++IG+ Q D +YDL+ SF
Sbjct: 387 AGGAEYAVPRQSYFDAVDEGGRVACLLVLPTRG-VSVIGNVMQMDLHVLYDLDGATFSFA 445
Query: 333 KENCSD 338
+C+
Sbjct: 446 PADCAS 451
>gi|326517992|dbj|BAK07248.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 500
Score = 125 bits (315), Expect = 2e-26, Method: Compositional matrix adjust.
Identities = 105/357 (29%), Positives = 165/357 (46%), Gaps = 45/357 (12%)
Query: 3 RLFIGTPSKGVLLILDTGSALIY--------------AIFDPRKSSSFQKINCDHPDCTY 48
R+ +G P++ + ++LDTGS + + ++DP S+S+ + CD P C
Sbjct: 166 RVGVGRPARQLYMVLDTGSDVTWLQCQPCADCYAQSDPVYDPSVSTSYATVGCDSPRCRD 225
Query: 49 F---KCVNE--QCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSNDNHGF 103
C N C+Y + Y D S T G A ET+++ G+ + + A+ GC +DN G
Sbjct: 226 LDAAACRNSTGSCLYEVAYGDGSYTVGDFATETLTL---GDSAPVSNVAI-GCGHDNEGL 281
Query: 104 DEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLKFGTDMGY 163
AG+L L +SF SQ+ + FSYCLV +SS L+FG
Sbjct: 282 FV-----GAAGLLALGGGPLSFPSQISATT---FSYCLV---DRDSPSSSTLQFGDS--- 327
Query: 164 RRPSTQATKFINHP--NNFYYLSLKDISIDNERMNFPPDTFDITVSGEGGCIIDSGSVLT 221
+P+ A I P N FYY++L IS+ E ++ P F + +G GG I+DSG+ +T
Sbjct: 328 EQPAVTA-PLIRSPRTNTFYYVALSGISVGGEALSIPSSAFAMDDAGSGGVIVDSGTAVT 386
Query: 222 YFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCYFLP-ETFNRFPSMAFYFEDAN-L 279
S Y L E FV + A CY L + + P++A +FE L
Sbjct: 387 RLQSGAYGALREAFVQGTQSLPRASGVSL---FDTCYDLAGRSSVQVPAVALWFEGGGEL 443
Query: 280 RIDGENVFIIDYENHFFLLAVAPHDDLVALIGSQQQRDTRFVYDLNIDLLSFVKENC 336
++ +N I + LA A V++IG+ QQ+ R +D + + F + C
Sbjct: 444 KLPAKNYLIPVDAAGTYCLAFAGTSGPVSIIGNVQQQGVRVSFDTAKNTVGFTADKC 500
>gi|326524580|dbj|BAK00673.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 461
Score = 125 bits (315), Expect = 2e-26, Method: Compositional matrix adjust.
Identities = 109/376 (28%), Positives = 160/376 (42%), Gaps = 58/376 (15%)
Query: 1 MVRLFIGTPSKGVLLILDTGSALIYA--------------IFDPRKSSSFQKINCDHPDC 46
++ L IGTP + V +LDTGS LI+ +F P SSS+ + C C
Sbjct: 104 LIDLAIGTPPQPVSALLDTGSDLIWTQCAPCASCLAQPDPLFAPAASSSYVPMRCSGQLC 163
Query: 47 T---YFKCVN-EQCVYTMKYADQSVTKGFAAHETIS-VIGKGEGKAIFHGALFGCSNDNH 101
+ C + C Y Y D + T G A E + GE ++ G FGC N
Sbjct: 164 NDILHHSCQRPDTCTYRYNYGDGTTTLGVYATERFTFASSSGEKLSVPLG--FGCGTMNV 221
Query: 102 GFDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLKFGT-- 159
G + +G++G R +S +SQL SI +RFSYCL P S L FG+
Sbjct: 222 GSLNNG-----SGIVGFGRDPLSLVSQL-SI--RRFSYCLT---PYTSTRKSTLMFGSLS 270
Query: 160 -------DMGYRRPSTQATKFINHPNN--FYYLSLKDISIDNERMNFPPDTFDITVSGEG 210
D + Q T+ + N FYY+ +++ R+ P F + G G
Sbjct: 271 DGVFEGDDAATGQ--VQTTRLLQSRQNPTFYYVPFTGVTVGTRRLRIPLSAFALRPDGSG 328
Query: 211 GCIIDSGSVLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCYFLP--------- 261
G I+DSG+ LT F + V L E ++ + +L S +C+ P
Sbjct: 329 GVIVDSGTALTLFPAAV---LTEVLRAFRAQLRLPFTSSSSPDDGVCFATPMAAGGRRAS 385
Query: 262 -ETFNRFPSMAFYFEDANLRIDGENVFIIDYENHFFLLAVAPHDDLVALIGSQQQRDTRF 320
T P MAF+F+ A+L + N + D + +A D A IG+ Q+D R
Sbjct: 386 AATVVSVPRMAFHFQGADLELPRRNYVLDDPRRGSLCILLADSGDSGATIGNFVQQDMRV 445
Query: 321 VYDLNIDLLSFVKENC 336
+YDL + LSF C
Sbjct: 446 LYDLEAETLSFAPAQC 461
>gi|225427552|ref|XP_002266498.1| PREDICTED: probable aspartic protease At2g35615 [Vitis vinifera]
Length = 441
Score = 125 bits (313), Expect = 4e-26, Method: Compositional matrix adjust.
Identities = 107/359 (29%), Positives = 166/359 (46%), Gaps = 34/359 (9%)
Query: 1 MVRLFIGTPSKGVLLILDTGSALIYA--------------IFDPRKSSSFQKINCDHPDC 46
++ L IGTP V+ I+DTGS L + FDP+ SS+++ +C C
Sbjct: 93 IMNLSIGTPPVPVIAIVDTGSDLTWTQCRPCTHCYKQVVPFFDPKNSSTYRDSSCGTSFC 152
Query: 47 TYF----KCVN-EQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSNDNH 101
C N ++C + YAD S T G A ET++V F G FGC + +
Sbjct: 153 LALGNDRSCRNGKKCTFMYSYADGSFTGGNLAVETLTVASTAGKPVSFPGFAFGCVHRSG 212
Query: 102 G-FDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLKFGTD 160
G FDE + +G++GL +S ISQL S I RFSYCL +P+ SS + FG
Sbjct: 213 GIFDEHS-----SGIVGLGVAELSMISQLKSTINGRFSYCL-LPVFTDSSMSSRINFGRS 266
Query: 161 MGYRRPSTQATKFI-NHPNNFYYL-SLKDISIDNERMNFPPDTFDITVSGEGGCIIDSGS 218
T +T + P+ +YYL +L+ S+ +R+++ + V EG I+DSG+
Sbjct: 267 GIVSGAGTVSTPLVMKGPDTYYYLITLEGFSVGKKRLSYKGFSKKAEVE-EGNIIVDSGT 325
Query: 219 VLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCYFLPETFNRFPSMAFYFEDAN 278
TY + Y KL E S + ++ D LCY P + +F+DAN
Sbjct: 326 TYTYLPLEFYVKLEE---SVAHSIKGKRVRDPNGISSLCYNTTVDQIDAPIITAHFKDAN 382
Query: 279 LRIDGENVFIIDYENHFFLLAVAPHDDLVALIGSQQQRDTRFVYDLNIDLLSFVKENCS 337
+ + N F+ E+ V P D + ++G+ Q + +DL +SF +C+
Sbjct: 383 VELQPWNTFLRMQED-LVCFTVLPTSD-IGILGNLAQVNFLVGFDLRKKRVSFKAADCT 439
>gi|302809015|ref|XP_002986201.1| hypothetical protein SELMODRAFT_234982 [Selaginella moellendorffii]
gi|300146060|gb|EFJ12732.1| hypothetical protein SELMODRAFT_234982 [Selaginella moellendorffii]
Length = 449
Score = 125 bits (313), Expect = 4e-26, Method: Compositional matrix adjust.
Identities = 115/368 (31%), Positives = 171/368 (46%), Gaps = 45/368 (12%)
Query: 4 LFIGTPSKGVLLILDTGSALIY--------------AIFDPRKSSSFQKINCDHPDCTYF 49
+F+G P + LLI+DTGS L + +FDP +S+SF+ I C+ C
Sbjct: 91 VFVGNPPRHFLLIIDTGSDLTWLQCKPCKACFDQSGPVFDPSQSTSFKIIPCNAAACDLV 150
Query: 50 ----------KCVNEQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAI-FHGALFGCSN 98
K + C Y Y D S T G A E++SV ++ + GC +
Sbjct: 151 VHDECRDNSSKTSPKTCKYFYWYGDSSRTSGDLALESLSVSLSDHPSSLEIRDMVIGCGH 210
Query: 99 DNHGFDEDARDGALAGVLGLSRVTISFISQL-GSIIKKRFSYCLVIPLPNGEYTSSYLKF 157
N G+LGL + +SF SQL S I + FSYCLV N SS + F
Sbjct: 211 SNK-----GLFQGAGGLLGLGQGALSFPSQLRSSPIGQSFSYCLV-DRTNNLSVSSAISF 264
Query: 158 GTDMGYRRPSTQA--TKFINHPNN---FYYLSLKDISIDNERMNFPPDTFDITVSGEGGC 212
G R Q T F+ N+ FYYL ++ I ID E + P + F I +G GG
Sbjct: 265 GAGFALSRHFDQMKFTPFVRTNNSVETFYYLGIQGIKIDQELLPIPAERFAIATNGSGGT 324
Query: 213 IIDSGSVLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCY-FLPETFNRFPSMA 271
IIDSG+ LTY + D Y + F++ + +D + + +CY FP+++
Sbjct: 325 IIDSGTTLTYLNRDAYRAVESAFLARISYPR----ADPFDILGICYNATGRAAVPFPALS 380
Query: 272 FYFED-ANLRIDGENVFI-IDYENHFFLLAVAPHDDLVALIGSQQQRDTRFVYDLNIDLL 329
F++ A L + EN FI D + LA+ P D + ++IG+ QQ++ F+YD+ L
Sbjct: 381 IVFQNGAELDLPQENYFIQPDPQEAKHCLAILPTDGM-SIIGNFQQQNIHFLYDVQHARL 439
Query: 330 SFVKENCS 337
F +CS
Sbjct: 440 GFANTDCS 447
>gi|357487631|ref|XP_003614103.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
gi|355515438|gb|AES97061.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
Length = 431
Score = 124 bits (312), Expect = 5e-26, Method: Compositional matrix adjust.
Identities = 101/351 (28%), Positives = 167/351 (47%), Gaps = 35/351 (9%)
Query: 6 IGTPSKGVLLILDTGSALIY--------------AIFDPRKSSSFQKINCDHPDCT---Y 48
+GTP V ++DTGS +++ IF+P KSSS++ I C C Y
Sbjct: 93 VGTPPFNVYGVVDTGSDIVWLQCKPCEQCYKQTTPIFNPSKSSSYKNIPCSSNLCQSVRY 152
Query: 49 FKCVNEQ--CVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSNDNHGFDED 106
C N+Q C YT+ ++DQS ++G + ET+++ F + GC ++N G +
Sbjct: 153 TSC-NKQNSCEYTINFSDQSYSQGELSVETLTLDSTTGHSVSFPKTVIGCGHNNRGMFQ- 210
Query: 107 ARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLKFGTDMGYRRP 166
G +G++GL +S +QL S I +FSYCL +PL +S L FG
Sbjct: 211 ---GETSGIVGLGIGPVSLTTQLKSSIGGKFSYCL-LPLLVDSNKTSKLNFGDAAVVSGD 266
Query: 167 STQATKFINH-PNNFYYLSLKDISIDNERMNFPPDTFDITVSGEGGCIIDSGSVLTYFHS 225
+T F+ P FYYL+L+ S+ N+R+ F + D S EG I+DSG+ LT S
Sbjct: 267 GVVSTPFVKKDPQAFYYLTLEAFSVGNKRIEF--EVLDD--SEEGNIILDSGTTLTLLPS 322
Query: 226 DVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCYFLPETFNRFPSMAFYFEDANLRIDGEN 285
VY L + +L ++ D + + LCY + FP + +F+ A+++++ +
Sbjct: 323 HVYTNLESAVA---QLVKLDRVDDPNQLLNLCYSITSDQYDFPIITAHFKGADIKLNPIS 379
Query: 286 VFIIDYENHFFLLAVAPHDDLVALIGSQQQRDTRFVYDLNIDLLSFVKENC 336
F + L + + G+ Q + YDL +++SF +C
Sbjct: 380 TFAHVADGVVCLAFTSSQTG--PIFGNLAQLNLLVGYDLQQNIVSFKPSDC 428
>gi|414584780|tpg|DAA35351.1| TPA: hypothetical protein ZEAMMB73_696016 [Zea mays]
Length = 524
Score = 124 bits (312), Expect = 5e-26, Method: Compositional matrix adjust.
Identities = 95/367 (25%), Positives = 159/367 (43%), Gaps = 46/367 (12%)
Query: 1 MVRLFIGTPSKGVLLILDTGSALIYA--------------IFDPRKSSSFQKINCDHPDC 46
+VR+ +G+P L++D+GS +++ +FDP S++F ++C C
Sbjct: 172 LVRVSVGSPPTEQYLVVDSGSDVMWVQCKPCLECYVQADPLFDPATSATFSGVSCGSAIC 231
Query: 47 TYF---KCVNEQ---CVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSNDN 100
C + + C Y + YAD S TKG A ET+++ G G + GC + N
Sbjct: 232 RILPTSACGDGELGGCEYEVSYADGSYTKGALALETLTL-----GGTAVEGVVIGCGHRN 286
Query: 101 HGFDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSS------- 153
G A AG++GL +S + QLG + FSYCL G Y S
Sbjct: 287 RGLFVGA-----AGLMGLGWGPMSLVGQLGGEVGGAFSYCLA---SRGGYGSGAADDDAG 338
Query: 154 YLKFGTDMGYRRPSTQATKFINHPN--NFYYLSLKDISIDNERMNFPPDTFDITVSGEGG 211
+L G + + +P +FYY+ L I + +ER+ F +T G G
Sbjct: 339 WLVLGRSEAVPEGAVW-VPLVRNPRAPSFYYVGLSGIEVGDERLPLQAGLFQLTEDGAGD 397
Query: 212 CIIDSGSVLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCYFLPETFN-RFPSM 270
++D+G+ +T + Y L + FV + CY L + R P++
Sbjct: 398 VVMDTGTTVTRLPQEAYAALRDAFVGALAGAVPRAQGVSSSVLDTCYDLSGYASVRVPTV 457
Query: 271 AFYFE-DANLRIDGENVFIIDYENHFFLLAVAPHDDLVALIGSQQQRDTRFVYDLNIDLL 329
+F F+ DA L + NV +++ + + LA AP ++++G+ QQ + D +
Sbjct: 458 SFCFDGDARLILAARNV-LLEVDMGIYCLAFAPSSSGLSIMGNTQQAGIQITVDSANGYI 516
Query: 330 SFVKENC 336
F NC
Sbjct: 517 GFGPANC 523
>gi|19424106|gb|AAL87345.1| putative chloroplast nucleoid DNA-binding protein [Arabidopsis
thaliana]
Length = 500
Score = 124 bits (312), Expect = 5e-26, Method: Compositional matrix adjust.
Identities = 95/353 (26%), Positives = 157/353 (44%), Gaps = 36/353 (10%)
Query: 3 RLFIGTPSKGVLLILDTGSALIY--------------AIFDPRKSSSFQKINCDHPDCTY 48
R+ +GTP+K + L+LDTGS + + +F+P SS+++ + C P C+
Sbjct: 165 RIGVGTPAKDMYLVLDTGSDVNWIQCEPCADCYQQSDPVFNPTSSSTYKSLTCSAPQCSL 224
Query: 49 FK---CVNEQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSNDNHGFDE 105
+ C + +C+Y + Y D S T G A +T++ G+ + GC +DN G
Sbjct: 225 LETSACRSNKCLYQVSYGDGSFTVGELATDTVTFGNSGK----INNVALGCGHDNEGL-- 278
Query: 106 DARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLKFGTDMGYRR 165
G GL + +S + FSYCLV SS L F +
Sbjct: 279 ------FTGAAGLLGLGGGVLSITNQMKATSFSYCLV---DRDSGKSSSLDFNSVQLGGG 329
Query: 166 PSTQATKFINHPNNFYYLSLKDISIDNERMNFPPDTFDITVSGEGGCIIDSGSVLTYFHS 225
+T + FYY+ L S+ E++ P FD+ SG GG I+D G+ +T +
Sbjct: 330 DATAPLLRNKKIDTFYYVGLSGFSVGGEKVVLPDAIFDVDASGSGGVILDCGTAVTRLQT 389
Query: 226 DVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCY-FLPETFNRFPSMAFYFEDA-NLRIDG 283
Y L + F+ L + S CY F + + P++AF+F +L +
Sbjct: 390 QAYNSLRDAFLKL--TVNLKKGSSSISLFDTCYDFSSLSTVKVPTVAFHFTGGKSLDLPA 447
Query: 284 ENVFIIDYENHFFLLAVAPHDDLVALIGSQQQRDTRFVYDLNIDLLSFVKENC 336
+N I ++ F A AP +++IG+ QQ+ TR YDL+ +++ C
Sbjct: 448 KNYLIPVDDSGTFCFAFAPTSSSLSIIGNVQQQGTRITYDLSKNVIGLSGNKC 500
>gi|293329689|dbj|BAJ04354.1| pollen allergen CPA63 [Cryptomeria japonica]
Length = 472
Score = 124 bits (311), Expect = 6e-26, Method: Compositional matrix adjust.
Identities = 100/365 (27%), Positives = 155/365 (42%), Gaps = 48/365 (13%)
Query: 1 MVRLFIGTPSKGVLLILDTGSALIY-------------AIFDPRKSSSFQKINCDHPDCT 47
+++L GTP + +LDTGS + + F+P KSS++ + C C
Sbjct: 125 IIKLGFGTPPQSFYTVLDTGSNIAWIPCNPCSGCSSKQQPFEPSKSSTYNYLTCASQQCQ 184
Query: 48 YFKCVNE-----QCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSNDNHG 102
+ + C T +Y DQS + ET+SV G +FGCSN G
Sbjct: 185 LLRVCTKSDNSVNCSLTQRYGDQSEVDEILSSETLSV-----GSQQVENFVFGCSNAARG 239
Query: 103 FDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLKFGTDMG 162
+ ++G R +SF+SQ ++ FSYCL L + +T S L +G
Sbjct: 240 LIQRT-----PSLVGFGRNPLSFVSQTATLYDSTFSYCLP-SLFSSAFTGSLL-----LG 288
Query: 163 YRRPSTQATKFINHPNN-----FYYLSLKDISIDNERMNFPPDTFDITVSGEGGCIIDSG 217
S Q KF +N FYY+ L IS+ E ++ P T + S G IIDSG
Sbjct: 289 KEALSAQGLKFTPLLSNSRYPSFYYVGLNGISVGEELVSIPAGTLSLDESTGRGTIIDSG 348
Query: 218 SVLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCYFLPETFNRFPSMAFYFED- 276
+V+T Y + + F S +A +D CY P FP + +F+D
Sbjct: 349 TVITRLVEPAYNAMRDSFRSQLSNLTMASPTDL---FDTCYNRPSGDVEFPLITLHFDDN 405
Query: 277 ANLRIDGENVFIIDYENHFFL-----LAVAPHDDLVALIGSQQQRDTRFVYDLNIDLLSF 331
+L + +N+ ++ L L DD+++ G+ QQ+ R V+D+ L
Sbjct: 406 LDLTLPLDNILYPGNDDGSVLCLAFGLPPGGGDDVLSTFGNYQQQKLRIVHDVAESRLGI 465
Query: 332 VKENC 336
ENC
Sbjct: 466 ASENC 470
>gi|15229656|ref|NP_188478.1| aspartyl protease family protein [Arabidopsis thaliana]
gi|75273882|sp|Q9LS40.1|ASPG1_ARATH RecName: Full=Protein ASPARTIC PROTEASE IN GUARD CELL 1;
Short=AtASPG1; Flags: Precursor
gi|11994113|dbj|BAB01116.1| CND41, chloroplast nucleoid DNA binding protein-like [Arabidopsis
thaliana]
gi|23297732|gb|AAN13013.1| putative chloroplast nucleoid DNA-binding protein [Arabidopsis
thaliana]
gi|332642583|gb|AEE76104.1| aspartyl protease family protein [Arabidopsis thaliana]
Length = 500
Score = 124 bits (311), Expect = 6e-26, Method: Compositional matrix adjust.
Identities = 95/353 (26%), Positives = 157/353 (44%), Gaps = 36/353 (10%)
Query: 3 RLFIGTPSKGVLLILDTGSALIY--------------AIFDPRKSSSFQKINCDHPDCTY 48
R+ +GTP+K + L+LDTGS + + +F+P SS+++ + C P C+
Sbjct: 165 RIGVGTPAKEMYLVLDTGSDVNWIQCEPCADCYQQSDPVFNPTSSSTYKSLTCSAPQCSL 224
Query: 49 FK---CVNEQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSNDNHGFDE 105
+ C + +C+Y + Y D S T G A +T++ G+ + GC +DN G
Sbjct: 225 LETSACRSNKCLYQVSYGDGSFTVGELATDTVTFGNSGK----INNVALGCGHDNEGL-- 278
Query: 106 DARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLKFGTDMGYRR 165
G GL + +S + FSYCLV SS L F +
Sbjct: 279 ------FTGAAGLLGLGGGVLSITNQMKATSFSYCLV---DRDSGKSSSLDFNSVQLGGG 329
Query: 166 PSTQATKFINHPNNFYYLSLKDISIDNERMNFPPDTFDITVSGEGGCIIDSGSVLTYFHS 225
+T + FYY+ L S+ E++ P FD+ SG GG I+D G+ +T +
Sbjct: 330 DATAPLLRNKKIDTFYYVGLSGFSVGGEKVVLPDAIFDVDASGSGGVILDCGTAVTRLQT 389
Query: 226 DVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCY-FLPETFNRFPSMAFYFEDA-NLRIDG 283
Y L + F+ L + S CY F + + P++AF+F +L +
Sbjct: 390 QAYNSLRDAFLKL--TVNLKKGSSSISLFDTCYDFSSLSTVKVPTVAFHFTGGKSLDLPA 447
Query: 284 ENVFIIDYENHFFLLAVAPHDDLVALIGSQQQRDTRFVYDLNIDLLSFVKENC 336
+N I ++ F A AP +++IG+ QQ+ TR YDL+ +++ C
Sbjct: 448 KNYLIPVDDSGTFCFAFAPTSSSLSIIGNVQQQGTRITYDLSKNVIGLSGNKC 500
>gi|449458736|ref|XP_004147103.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 2-like [Cucumis
sativus]
gi|449518669|ref|XP_004166359.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 2-like [Cucumis
sativus]
Length = 482
Score = 124 bits (311), Expect = 6e-26, Method: Compositional matrix adjust.
Identities = 101/362 (27%), Positives = 155/362 (42%), Gaps = 51/362 (14%)
Query: 2 VRLFIGTPSKGVLLILDTGSALIYA--------------IFDPRKSSSFQKINCDHPDCT 47
VR+ +G+P + +++D+GS +++ +FDP SSSF ++C C
Sbjct: 145 VRIGVGSPPRNQYMVIDSGSDIVWVQCKPCSRCYQQSDPVFDPADSSSFAGVSCGSDVCD 204
Query: 48 YFK---CVNEQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSNDNHGFD 104
+ C +C Y + Y D S TKG A ET++V G+ + GC + N G
Sbjct: 205 RLENTGCNAGRCRYEVSYGDGSYTKGTLALETLTV-----GQVMIRDVAIGCGHTNQGMF 259
Query: 105 EDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLKFGTDMGYR 164
A ++SFI QLG FSYCLV G ++ L+FG G
Sbjct: 260 IGAAGLLGL-----GGGSMSFIGQLGGQTGGAFSYCLV---SRGTGSTGALEFG--RGAL 309
Query: 165 RPSTQATKFINHPN--NFYYLSLKDISIDNERMNFPPDTFDITVSGEGGCIIDSGSVLTY 222
I +P +FYY+ L I + R++ P +TF +T G G ++D+G+ +T
Sbjct: 310 PVGATWISLIRNPRAPSFYYIGLAGIGVGGVRVSVPEETFQLTEYGTNGVVMDTGTAVTR 369
Query: 223 FHSDVYWKLHEKFVSYFERFQLAQLSDCPEP-----IQLCYFLPETFN--RFPSMAFYFE 275
F + Y + F AQ S+ P CY L F R P+++FYF
Sbjct: 370 FPTAAYVAFRDSFT--------AQTSNLPRAPGVSIFDTCYDL-NGFESVRVPTVSFYFS 420
Query: 276 DAN-LRIDGENVFIIDYENHFFLLAVAPHDDLVALIGSQQQRDTRFVYDLNIDLLSFVKE 334
D L + N I F LA AP +++IG+ QQ + +D + F
Sbjct: 421 DGPVLTLPARNFLIPVDGGGTFCLAFAPSPSGLSIIGNIQQEGIQISFDGANGFVGFGPN 480
Query: 335 NC 336
C
Sbjct: 481 IC 482
>gi|388502342|gb|AFK39237.1| unknown [Lotus japonicus]
Length = 440
Score = 124 bits (311), Expect = 7e-26, Method: Compositional matrix adjust.
Identities = 105/362 (29%), Positives = 164/362 (45%), Gaps = 45/362 (12%)
Query: 1 MVRLFIGTPSKGVLLILDTGSALIYA----------------IFDPRKSSSFQKINCDHP 44
++R++IGTPS L I DTGS L + ++DP SS+F + CD
Sbjct: 97 LMRIYIGTPSVERLAIADTGSDLTWVQCSPCDNTKCFAQNTPLYDPLNSSTFTLLPCDSQ 156
Query: 45 DCTY-----FKCVN-EQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGAL-FGCS 97
CT + C + C+Y Y D S + G + ++I ++ + ++ + FGC
Sbjct: 157 PCTQLPYSQYVCSDYGDCIYAYTYGDNSYSYGGLSSDSIRLMLL---QLHYNSKICFGCG 213
Query: 98 NDNHGFDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLKF 157
N F D + G G++GL +S +SQLG I +FSYCL LP ++S LKF
Sbjct: 214 FQNK-FTAD-KSGKTTGIVGLGAGPLSLVSQLGDEIGHKFSYCL---LPFSSNSNSKLKF 268
Query: 158 GTDMGYRRPSTQATKFINHPN-NFYYLSLKDISIDNERMNFPPDTFDITVSGEGGCIIDS 216
G + +T I P+ FYYL+L+ I++ + + T +G IIDS
Sbjct: 269 GEAAIVQGNGVVSTPLIIKPDLPFYYLNLEGITVGAKTVK--------TGQTDGNIIIDS 320
Query: 217 GSVLTYFHSDVYWKLHEKFVSYF-ERFQLAQLSDCPEPIQLCYFLPETFNRFPSMAFYFE 275
GS LTY Y +FVS E + + P P C+ E + P + F+F
Sbjct: 321 GSTLTYLEESFY----NEFVSLVKETVAVEEDQYIPYPFDFCFTYKEGMSTPPDVVFHFT 376
Query: 276 DANLRIDGENVFIIDYENHFFLLAVAPHDDLVALIGSQQQRDTRFVYDLNIDLLSFVKEN 335
++ + N ++ +N V H D +A+ G+ Q D YD+ +SF +
Sbjct: 377 GGDVVLKPMNTLVLIEDNLICSTVVPSHFDGIAIFGNLGQIDFHVGYDIQGGKVSFAPTD 436
Query: 336 CS 337
CS
Sbjct: 437 CS 438
>gi|125524353|gb|EAY72467.1| hypothetical protein OsI_00323 [Oryza sativa Indica Group]
Length = 500
Score = 124 bits (310), Expect = 9e-26, Method: Compositional matrix adjust.
Identities = 107/362 (29%), Positives = 161/362 (44%), Gaps = 55/362 (15%)
Query: 3 RLFIGTPSKGVLLILDTGSALIYA--------------IFDPRKSSSFQKINCDHPDC-- 46
R+ +G+P++ + ++LDTGS + + +FDP S+S+ + CD+P C
Sbjct: 166 RVGVGSPARQLYMVLDTGSDVTWVQCQPCADCYQQSDPVFDPSLSTSYASVACDNPRCHD 225
Query: 47 -TYFKCVNE--QCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSNDNHGF 103
C N C+Y + Y D S T G A ET+++ G+ + A+ GC +DN G
Sbjct: 226 LDAAACRNSTGACLYEVAYGDGSYTVGDFATETLTL---GDSAPVSSVAI-GCGHDNEGL 281
Query: 104 DEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLKFG--TDM 161
A G LS F SQ+ + FSYCLV +SS L+FG D
Sbjct: 282 FVGAAGLLALGGGPLS-----FPSQISATT---FSYCLV---DRDSPSSSTLQFGDAADA 330
Query: 162 GYRRPSTQATKFINHP--NNFYYLSLKDISIDNERMNFPPDTFDITVSGEGGCIIDSGSV 219
P I P + FYY+ L IS+ + ++ PP F + +G GG I+DSG+
Sbjct: 331 EVTAP------LIRSPRTSTFYYVGLSGISVGGQILSIPPSAFAMDGTGAGGVIVDSGTA 384
Query: 220 LTYFHSDVYWKLHEKFV---SYFERFQLAQLSDCPEPIQLCYFLPE-TFNRFPSMAFYFE 275
+T S Y L + FV R L D CY L + T P+++ F
Sbjct: 385 VTRLQSSAYAALRDAFVRGTQSLPRTSGVSLFD------TCYDLSDRTSVEVPAVSLRFA 438
Query: 276 DAN-LRIDGENVFIIDYENHFFLLAVAPHDDLVALIGSQQQRDTRFVYDLNIDLLSFVKE 334
LR+ +N I + LA AP + V++IG+ QQ+ TR +D + F
Sbjct: 439 GGGELRLPAKNYLIPVDGAGTYCLAFAPTNAAVSIIGNVQQQGTRVSFDTAKSTVGFTSN 498
Query: 335 NC 336
C
Sbjct: 499 KC 500
>gi|326490862|dbj|BAJ90098.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 447
Score = 124 bits (310), Expect = 9e-26, Method: Compositional matrix adjust.
Identities = 106/363 (29%), Positives = 155/363 (42%), Gaps = 38/363 (10%)
Query: 1 MVRLFIGTP-SKGVLLILDTGSALIYAI--------------FDPRKSSSFQKINCDHPD 45
++ IGTP + V L +DTGS +++ FD S + + C P
Sbjct: 93 LIHFGIGTPRPQQVALEVDTGSDVVWTQCRPCFDCFTQPLPRFDTSASDTVHGVLCTDPI 152
Query: 46 CTYFK---CVNEQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSNDNHG 102
C + C C Y + Y D SVT G A ++ + GKG GK +FGC N G
Sbjct: 153 CRALRPHACFLGGCTYQVNYGDNSVTIGQLAKDSFTFDGKGGGKVTVPDLVFGCGQYNTG 212
Query: 103 -FDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLKFGTDM 161
F + G+ G R +S QLG FSYC + + T +L
Sbjct: 213 NFHSNE-----TGIAGFGRGPLSLPRQLGV---SSFSYCFTT-IFESKSTPVFLGGAPAD 263
Query: 162 GYRRPSTQ---ATKFI-NHPNNFYYLSLKDISIDNERMNFPPDTFDITVSGEGGCIIDSG 217
G R +T +T F+ NHP +YYLSLK I++ R+ P F + G GG IIDSG
Sbjct: 264 GLRAHATGPILSTPFLPNHPE-YYYLSLKGITVGKTRLAVPESAFVVKADGSGGTIIDSG 322
Query: 218 SVLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCY---FLPETFN-RFPSMAFY 273
+ +T F V+ L E FV+ +D EP C+ +P+ P M +
Sbjct: 323 TAITAFPRAVFRSLWEAFVAQVP-LPHTSYNDTGEPTLQCFSTESVPDASKVPVPKMTLH 381
Query: 274 FEDANLRIDGENVFIIDYENHFFLLAVAPHDDLVALIGSQQQRDTRFVYDLNIDLLSFVK 333
E A+ + EN ++ + V DD +IG+ QQ++ V+DL + L
Sbjct: 382 LEGADWELPRENYMAEYPDSDQLCVVVLAGDDDRTMIGNFQQQNMHIVHDLAGNKLVIEP 441
Query: 334 ENC 336
C
Sbjct: 442 AQC 444
>gi|115434442|ref|NP_001041979.1| Os01g0140100 [Oryza sativa Japonica Group]
gi|12328547|dbj|BAB21205.1| putative aspartic proteinase nepenthesin I [Oryza sativa Japonica
Group]
gi|113531510|dbj|BAF03893.1| Os01g0140100 [Oryza sativa Japonica Group]
gi|125568961|gb|EAZ10476.1| hypothetical protein OsJ_00309 [Oryza sativa Japonica Group]
gi|215697206|dbj|BAG91200.1| unnamed protein product [Oryza sativa Japonica Group]
Length = 504
Score = 123 bits (309), Expect = 1e-25, Method: Compositional matrix adjust.
Identities = 106/362 (29%), Positives = 161/362 (44%), Gaps = 55/362 (15%)
Query: 3 RLFIGTPSKGVLLILDTGSALIYA--------------IFDPRKSSSFQKINCDHPDC-- 46
R+ +G+P++ + ++LDTGS + + +FDP S+S+ + CD+P C
Sbjct: 170 RVGVGSPARQLYMVLDTGSDVTWVQCQPCADCYQQSDPVFDPSLSTSYASVACDNPRCHD 229
Query: 47 -TYFKCVNE--QCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSNDNHGF 103
C N C+Y + Y D S T G A ET+++ G+ + A+ GC +DN G
Sbjct: 230 LDAAACRNSTGACLYEVAYGDGSYTVGDFATETLTL---GDSAPVSSVAI-GCGHDNEGL 285
Query: 104 DEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLKFG--TDM 161
A G LS F SQ+ + FSYCLV +SS L+FG D
Sbjct: 286 FVGAAGLLALGGGPLS-----FPSQISATT---FSYCLV---DRDSPSSSTLQFGDAADA 334
Query: 162 GYRRPSTQATKFINHP--NNFYYLSLKDISIDNERMNFPPDTFDITVSGEGGCIIDSGSV 219
P I P + FYY+ L +S+ + ++ PP F + +G GG I+DSG+
Sbjct: 335 EVTAP------LIRSPRTSTFYYVGLSGLSVGGQILSIPPSAFAMDSTGAGGVIVDSGTA 388
Query: 220 LTYFHSDVYWKLHEKFV---SYFERFQLAQLSDCPEPIQLCYFLPE-TFNRFPSMAFYFE 275
+T S Y L + FV R L D CY L + T P+++ F
Sbjct: 389 VTRLQSSAYAALRDAFVRGTQSLPRTSGVSLFD------TCYDLSDRTSVEVPAVSLRFA 442
Query: 276 DAN-LRIDGENVFIIDYENHFFLLAVAPHDDLVALIGSQQQRDTRFVYDLNIDLLSFVKE 334
LR+ +N I + LA AP + V++IG+ QQ+ TR +D + F
Sbjct: 443 GGGELRLPAKNYLIPVDGAGTYCLAFAPTNAAVSIIGNVQQQGTRVSFDTAKSTVGFTTN 502
Query: 335 NC 336
C
Sbjct: 503 KC 504
>gi|125558629|gb|EAZ04165.1| hypothetical protein OsI_26307 [Oryza sativa Indica Group]
Length = 455
Score = 123 bits (309), Expect = 1e-25, Method: Compositional matrix adjust.
Identities = 109/387 (28%), Positives = 167/387 (43%), Gaps = 77/387 (19%)
Query: 2 VRLFIGTPSKGVLLILDTGSALIYA----------------IFDPRKSSSFQKINCDHPD 45
+ + +GTP +I+DTGS LI+A + P +SS+F ++ C+
Sbjct: 93 MNISLGTPPLDFPVIVDTGSNLIWAQCAPCTRCFPRPTPAPVLQPARSSTFSRLPCNGSF 152
Query: 46 CTYFKC--------VNEQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCS 97
C Y C Y Y T G+ A ET++V G F FGCS
Sbjct: 153 CQYLPTSSRPRTCNATAACAYNYTYGS-GYTAGYLATETLTV-----GDGTFPKVAFGCS 206
Query: 98 NDNHGFDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLKF 157
+N G D + G++GL R +S +SQL RFSYCL + +G +S + F
Sbjct: 207 TEN-GVDNSS------GIVGLGRGPLSLVSQLA---VGRFSYCLRSDMADGG--ASPILF 254
Query: 158 GTDMGY-RRPSTQATKFINHP----NNFYYLSLKDISIDNERMNFPPDTFDITVSG-EGG 211
G+ R Q+T + +P + YY++L I++D+ + TF T +G GG
Sbjct: 255 GSLAKLTERSVVQSTPLLKNPYLQRSTHYYVNLTGIAVDSTELPVTGSTFGFTQTGLGGG 314
Query: 212 CIIDSGSVLTYFHSDVYWKLHEKFVSYFERF-QLAQLSDCPEPIQLCYFLPETFN----- 265
I+DSG+ LTY D Y + + F S Q S P + LCY P
Sbjct: 315 TIVDSGTTLTYLAKDGYAMVKQAFQSQMANLNQTTPASGAPYDLDLCY-KPSAGGGGKAV 373
Query: 266 RFPSMAFYFEDANLRIDGENVFIIDYENHFF--------------LLAVAPHDDL-VALI 310
R P +A LR G + + +N+F LL + DDL +++I
Sbjct: 374 RVPRLA-------LRFAGGAKYNVPVQNYFAGVEADSQGRVTVACLLVLPATDDLPISII 426
Query: 311 GSQQQRDTRFVYDLNIDLLSFVKENCS 337
G+ Q D +YD++ + SF +C+
Sbjct: 427 GNLMQMDMHLLYDIDGGMFSFAPADCA 453
>gi|224126751|ref|XP_002329464.1| predicted protein [Populus trichocarpa]
gi|222870144|gb|EEF07275.1| predicted protein [Populus trichocarpa]
Length = 439
Score = 123 bits (309), Expect = 1e-25, Method: Compositional matrix adjust.
Identities = 108/362 (29%), Positives = 172/362 (47%), Gaps = 45/362 (12%)
Query: 1 MVRLFIGTPSKGVLLILDTGSALIYA--------------IFDPRKSSSFQKINCDHPDC 46
+++L IGTP + I+DTGS LI+ IFDP+KSSSF K++C C
Sbjct: 98 LMKLAIGTPPETYSAIMDTGSDLIWTQCKPCTQCFDQPTPIFDPKKSSSFSKLSCSSKLC 157
Query: 47 TYF--KCVNEQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSNDNHG-- 102
++ C Y Y D S T+G A ET++ GK FGC DN G
Sbjct: 158 EALPQSTCSDGCEYLYGYGDYSSTQGMLASETLTF-----GKVSVPEVAFGCGEDNEGSG 212
Query: 103 FDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLKFGTDMG 162
F + + G++GL R +S +SQL + +FSYCL + +S L G+
Sbjct: 213 FSQGS------GLVGLGRGPLSLVSQLK---EPKFSYCLT---SVDDTKASTLLMGSLAS 260
Query: 163 YRRPSTQ--ATKFINHPN--NFYYLSLKDISIDNERMNFPPDTFDITVSGEGGCIIDSGS 218
+ ++ T I + +FYYLSL+ IS+ + + TF + G GG IIDSG+
Sbjct: 261 VKASDSEIKTTPLIQNSAQPSFYYLSLEGISVGDTSLPIKKSTFSLQEDGSGGLIIDSGT 320
Query: 219 VLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCYFLP--ETFNRFPSMAFYFED 276
+TY + + ++F S + L + +++C+ LP T P + F+F+
Sbjct: 321 TITYLEQSAFDLVAKEFTS---QINLPVDNSGSTGLEVCFTLPSGSTDIEVPKLVFHFDG 377
Query: 277 ANLRIDGENVFIIDYENHFFLLAVAPHDDLVALIGSQQQRDTRFVYDLNIDLLSFVKENC 336
A+L + EN I D LA+ + ++ G+ QQ++ ++DL + LSF+ C
Sbjct: 378 ADLELPAENYMIADASMGVACLAMGSSSGM-SIFGNIQQQNMLVLHDLEKETLSFLPTQC 436
Query: 337 SD 338
+
Sbjct: 437 DE 438
>gi|414584783|tpg|DAA35354.1| TPA: hypothetical protein ZEAMMB73_186928 [Zea mays]
Length = 464
Score = 123 bits (308), Expect = 1e-25, Method: Compositional matrix adjust.
Identities = 96/359 (26%), Positives = 153/359 (42%), Gaps = 47/359 (13%)
Query: 2 VRLFIGTPSKGVLLILDTGSALIYA--------------IFDPRKSSSFQKINCDHPDCT 47
VR+ IG+P L++D+GS +I+ +FDP S++F + C C
Sbjct: 129 VRVGIGSPPTEQYLVVDSGSDVIWVQCKPCLECYAQADPLFDPATSATFSAVPCGSAVCR 188
Query: 48 YFKCV----NEQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSNDNHGF 103
+ + C Y + Y D S TKG A ET+++ G G GC + N G
Sbjct: 189 TLRTSGCGDSGGCDYEVSYGDGSYTKGALALETLTL-----GGTAVEGVAIGCGHRNRGL 243
Query: 104 DEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLKFGTDMGY 163
A AG+LGL +S + QLG FSYCL + L G
Sbjct: 244 FVGA-----AGLLGLGWGPMSLVGQLGGAAGGAFSYCLA------SRGAGSLVLGRSEAV 292
Query: 164 RRPSTQATKFINHPN--NFYYLSLKDISIDNERMNFPPDTFDITVSGEGGCIIDSGSVLT 221
+ + +P +FYY+ L I + +ER+ D F +T G GG ++D+G+ +T
Sbjct: 293 PEGAVW-VPLVRNPQAPSFYYVGLSGIGVGDERLPLQEDLFQLTEDGAGGVVMDTGTAVT 351
Query: 222 YFHSDVYWKLHEKFVSYFERFQLAQLSDCP--EPIQLCYFLPE-TFNRFPSMAFYFED-A 277
+ Y L + FV+ + L P + CY L T R P+++FYF+ A
Sbjct: 352 RLPQEAYAALRDAFVA-----AVGALPRAPGVSLLDTCYDLSGYTSVRVPTVSFYFDGAA 406
Query: 278 NLRIDGENVFIIDYENHFFLLAVAPHDDLVALIGSQQQRDTRFVYDLNIDLLSFVKENC 336
L + N +++ + + LA AP +++G+ QQ + D + F C
Sbjct: 407 TLTLPARN-LLLEVDGGIYCLAFAPSSSGPSILGNIQQEGIQITVDSANGYIGFGPTTC 464
>gi|223949775|gb|ACN28971.1| unknown [Zea mays]
gi|414590177|tpg|DAA40748.1| TPA: hypothetical protein ZEAMMB73_257146 [Zea mays]
Length = 510
Score = 123 bits (308), Expect = 2e-25, Method: Compositional matrix adjust.
Identities = 112/376 (29%), Positives = 171/376 (45%), Gaps = 54/376 (14%)
Query: 1 MVRLFIGTPSKGVLLILDTGSALIY--------------AIFDPRKSSSFQKINCDHPDC 46
++ +++GTP + +I+DTGS L + +FDP SSS++ + C C
Sbjct: 150 LIDVYVGTPPRRFRMIMDTGSDLNWLQCAPCLDCFEQRGPVFDPAASSSYRNVTCGDQRC 209
Query: 47 TYF----------KCVNEQCVYTMKYADQSVTKGFAAHE--TISVIGKGEGKAIFHGALF 94
+ + C Y Y DQS T G A E T+++ G + + G +F
Sbjct: 210 GLVAPPEAPRACRRPAEDSCPYYYWYGDQSNTTGDLALESFTVNLTAPGASRRV-DGVVF 268
Query: 95 GCSNDNHGFDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSY 154
GC + N G AG+LGL R +SF SQL ++ FSYCLV +G S
Sbjct: 269 GCGHRNRGLFH-----GAAGLLGLGRGPLSFASQLRAVYGHTFSYCLV---EHGSDAGSK 320
Query: 155 LKFGTD-MGYRRPSTQATKFI---NHPNNFYYLSLKDISIDNERMNFPPDTFDITVSGEG 210
+ FG D + P + T F + + FYY+ LK + + + +N DT+D+ G G
Sbjct: 321 VVFGEDYLVLAHPQLKYTAFAPTSSPADTFYYVKLKGVLVGGDLLNISSDTWDVGKDGSG 380
Query: 211 GCIIDSGSVLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCYFL-----PETFN 265
G IIDSG+ L+YF Y + + FV R + D P + CY + PE
Sbjct: 381 GTIIDSGTTLSYFVEPAYQVIRQAFVDLMSRL-YPLIPDFPV-LNPCYNVSGVERPEV-- 436
Query: 266 RFPSMAFYFED-ANLRIDGENVFIIDYENHFFLLAV--APHDDLVALIGSQQQRDTRFVY 322
P ++ F D A EN F+ + LAV P + ++IG+ QQ++ VY
Sbjct: 437 --PELSLLFADGAVWDFPAENYFVRLDPDGIMCLAVRGTPRTGM-SIIGNFQQQNFHVVY 493
Query: 323 DLNIDLLSFVKENCSD 338
DL + L F C++
Sbjct: 494 DLQNNRLGFAPRRCAE 509
>gi|224067990|ref|XP_002302634.1| predicted protein [Populus trichocarpa]
gi|222844360|gb|EEE81907.1| predicted protein [Populus trichocarpa]
Length = 490
Score = 123 bits (308), Expect = 2e-25, Method: Compositional matrix adjust.
Identities = 102/361 (28%), Positives = 161/361 (44%), Gaps = 44/361 (12%)
Query: 2 VRLFIGTPSKGVLLILDTGSALIY--------------AIFDPRKSSSFQKINCDHPDCT 47
RL +GTP++ V ++LDTGS +++ +F+P KS SF I C P C
Sbjct: 149 TRLGVGTPARYVFMVLDTGSDVVWIQCAPCKKCYSQTDPVFNPTKSRSFANIPCGSPLCR 208
Query: 48 YF---KCVNEQ--CVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSNDNHG 102
C ++ C+Y + Y D S T G + ET++ G G+ GC +DN G
Sbjct: 209 RLDSPGCSTKKHICLYQVSYGDGSFTYGEFSTETLTFRGTRVGRVAL-----GCGHDNEG 263
Query: 103 FDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLKFGTDMG 162
A R +SF SQ+G ++FSYCLV + SY+ FG
Sbjct: 264 LFIGAAGLLGL-----GRGRLSFPSQIGRRFSRKFSYCLVD--RSASSKPSYMVFGDSAI 316
Query: 163 YRRPSTQATKFINHP--NNFYYLSLKDISIDNERM-NFPPDTFDITVSGEGGCIIDSGSV 219
R + + T +++P + FYY+ L +S+ R+ F + +G GG IIDSG+
Sbjct: 317 SR--TARFTPLVSNPKLDTFYYVELLGVSVGGTRVPGITASLFKLDSTGNGGVIIDSGTS 374
Query: 220 LTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPE--PIQLCYFLP-ETFNRFPSMAFYFED 276
+T Y L + F R + L PE C+ L +T + P++ +F
Sbjct: 375 VTRLTRPAYVALRDAF-----RVGASNLKRAPEFSLFDTCFDLSGKTEVKVPTVVLHFRG 429
Query: 277 ANLRIDGENVFIIDYENHFFLLAVAPHDDLVALIGSQQQRDTRFVYDLNIDLLSFVKENC 336
A++ + N I + F A A ++++G+ QQ+ R VYDL + F C
Sbjct: 430 ADVSLPASNYLIPVDNSGSFCFAFAGTMSGLSIVGNIQQQGFRVVYDLAASRVGFAPRGC 489
Query: 337 S 337
+
Sbjct: 490 A 490
>gi|90399033|emb|CAJ86229.1| H0402C08.5 [Oryza sativa Indica Group]
gi|125550227|gb|EAY96049.1| hypothetical protein OsI_17922 [Oryza sativa Indica Group]
Length = 473
Score = 123 bits (308), Expect = 2e-25, Method: Compositional matrix adjust.
Identities = 98/362 (27%), Positives = 149/362 (41%), Gaps = 47/362 (12%)
Query: 2 VRLFIGTPSKGVLLILDTGSALIYA--------------IFDPRKSSSFQKINCDHPDCT 47
VR+ +G+P L++D+GS +I+ +FDP SSSF ++C C
Sbjct: 132 VRVGVGSPPTDQYLVVDSGSDVIWVQCRPCEQCYAQTDPLFDPAASSSFSGVSCGSAICR 191
Query: 48 YFKCVNE-------QCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSNDN 100
+C Y++ Y D S TKG A ET+++ G G GC + N
Sbjct: 192 TLSGTGCGGGGDAGKCDYSVTYGDGSYTKGELALETLTL-----GGTAVQGVAIGCGHRN 246
Query: 101 HGFDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLKFGTD 160
G A AG+LGL +S I QLG FSYCL G + L G
Sbjct: 247 SGLFVGA-----AGLLGLGWGAMSLIGQLGGAAGGVFSYCLA---SRGAGGAGSLVLGRT 298
Query: 161 MGYRRPSTQATKFI-NHPNNFYYLSLKDISIDNERMNFPPDTFDITVSGEGGCIIDSGSV 219
+ N ++FYY+ L I + ER+ F +T G GG ++D+G+
Sbjct: 299 EAVPVGAVWVPLVRNNQASSFYYVGLTGIGVGGERLPLQDGLFQLTEDGAGGVVMDTGTA 358
Query: 220 LTYFHSDVYWKLHEKF---VSYFERFQLAQLSDCPEPIQLCYFLPETFN-RFPSMAFYF- 274
+T + Y L F + R L D CY L + R P+++FYF
Sbjct: 359 VTRLPREAYAALRGAFDGAMGALPRSPAVSLLDT------CYDLSGYASVRVPTVSFYFD 412
Query: 275 EDANLRIDGENVFIIDYENHFFLLAVAPHDDLVALIGSQQQRDTRFVYDLNIDLLSFVKE 334
+ A L + N +++ F LA AP ++++G+ QQ + D + F
Sbjct: 413 QGAVLTLPARN-LLVEVGGAVFCLAFAPSSSGISILGNIQQEGIQITVDSANGYVGFGPN 471
Query: 335 NC 336
C
Sbjct: 472 TC 473
>gi|242050430|ref|XP_002462959.1| hypothetical protein SORBIDRAFT_02g035310 [Sorghum bicolor]
gi|241926336|gb|EER99480.1| hypothetical protein SORBIDRAFT_02g035310 [Sorghum bicolor]
Length = 448
Score = 123 bits (308), Expect = 2e-25, Method: Compositional matrix adjust.
Identities = 102/376 (27%), Positives = 162/376 (43%), Gaps = 61/376 (16%)
Query: 1 MVRLFIGTPSKGVLLILDTGSALIYA----------------IFDPRKSSSFQKINCDH- 43
++ L IGTP I DTGS LI+ +++P S++F + C+
Sbjct: 93 LMTLSIGTPPLSYPAIADTGSDLIWTQCAPCSGDQCFAQPAPLYNPASSTTFGVLPCNSS 152
Query: 44 --------------PDCTYFKCVNEQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIF 89
P C C+Y Y T G ET + +A
Sbjct: 153 LSMCAGVLAGKAPPPGCA--------CMYNQTYG-TGWTAGVQGSETFTFGSAAADQARV 203
Query: 90 HGALFGCSNDNHGFDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGE 149
G FGCSN + D + AG++GL R ++S +SQLG+ RFSYCL P +
Sbjct: 204 PGIAFGCSNASSS-DWNGS----AGLVGLGRGSLSLVSQLGA---GRFSYCLT-PFQDTN 254
Query: 150 YTSSYLKFGTDMGYRRPSTQATKFINHP-----NNFYYLSLKDISIDNERMNFPPDTFDI 204
TS+ L G ++T F+ P + +YYL+L IS+ + ++ PD F +
Sbjct: 255 STSTLL-LGPSAALNGTGVRSTPFVASPAKAPMSTYYYLNLTGISLGAKALSISPDAFSL 313
Query: 205 TVSGEGGCIIDSGSVLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCYFLPETF 264
G GG IIDSG+ +T + Y ++ V SD + LCY LP
Sbjct: 314 KADGTGGLIIDSGTTITSLVNAAYQQVRAA-VQSLVTLPAIDGSDS-TGLDLCYALPTPT 371
Query: 265 N---RFPSMAFYFEDANLRIDGENVFIIDYENHFFLLAVAPHDDLVALIGSQQQRDTRFV 321
+ PSM +F+ A++ + ++ ++I + L D ++ G+ QQ++ +
Sbjct: 372 SAPPAMPSMTLHFDGADMVLPADS-YMISGSGVWCLAMRNQTDGAMSTFGNYQQQNMHIL 430
Query: 322 YDLNIDLLSFVKENCS 337
YD+ ++LSF CS
Sbjct: 431 YDVRNEMLSFAPAKCS 446
>gi|50508279|dbj|BAD32128.1| putative aspartic proteinase nepenthesin II [Oryza sativa Japonica
Group]
gi|125600536|gb|EAZ40112.1| hypothetical protein OsJ_24555 [Oryza sativa Japonica Group]
Length = 455
Score = 122 bits (307), Expect = 2e-25, Method: Compositional matrix adjust.
Identities = 109/387 (28%), Positives = 167/387 (43%), Gaps = 77/387 (19%)
Query: 2 VRLFIGTPSKGVLLILDTGSALIYA----------------IFDPRKSSSFQKINCDHPD 45
+ + +GTP +I+DTGS LI+A + P +SS+F ++ C+
Sbjct: 93 MNISLGTPPLDFPVIVDTGSNLIWAQCAPCTRCFPRPTPAPVLQPARSSTFSRLPCNGSF 152
Query: 46 CTYFKC--------VNEQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCS 97
C Y C Y Y T G+ A ET++V G F FGCS
Sbjct: 153 CQYLPTSSRPRTCNATAACAYNYTYGS-GYTAGYLATETLTV-----GDGTFPKVAFGCS 206
Query: 98 NDNHGFDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLKF 157
+N G D + G++GL R +S +SQL RFSYCL + +G +S + F
Sbjct: 207 TEN-GVDNSS------GIVGLGRGPLSLVSQLA---VGRFSYCLRSDMADGG--ASPILF 254
Query: 158 GTDMGYRRPST-QATKFINHP----NNFYYLSLKDISIDNERMNFPPDTFDITVSG-EGG 211
G+ S Q+T + +P + YY++L I++D+ + TF T +G GG
Sbjct: 255 GSLAKLTEGSVVQSTPLLKNPYLQRSTHYYVNLTGIAVDSTELPVTGSTFGFTQTGLGGG 314
Query: 212 CIIDSGSVLTYFHSDVYWKLHEKFVSYFERF-QLAQLSDCPEPIQLCYFLPETFN----- 265
I+DSG+ LTY D Y + + F S Q S P + LCY P
Sbjct: 315 TIVDSGTTLTYLAKDGYAMVKQAFQSQMANLNQTTPASGAPYDLDLCY-KPSAGGGGKAV 373
Query: 266 RFPSMAFYFEDANLRIDGENVFIIDYENHFF--------------LLAVAPHDDL-VALI 310
R P +A LR G + + +N+F LL + DDL +++I
Sbjct: 374 RVPRLA-------LRFAGGAKYNVPVQNYFAGVEADSQGRVTVACLLVLPATDDLPISII 426
Query: 311 GSQQQRDTRFVYDLNIDLLSFVKENCS 337
G+ Q D +YD++ + SF +C+
Sbjct: 427 GNLMQMDMHLLYDIDGGMFSFAPADCA 453
>gi|362799904|dbj|BAL41445.1| aspartyl protease 1 [Linum grandiflorum]
Length = 449
Score = 122 bits (307), Expect = 2e-25, Method: Compositional matrix adjust.
Identities = 109/381 (28%), Positives = 166/381 (43%), Gaps = 58/381 (15%)
Query: 1 MVRLFIGTPSKGVLLILDTGSALIY--------------AIFDPRKSSSFQKINCDHPDC 46
M+ L IGTP +L I DTGS L + IFDP S++F K+ C C
Sbjct: 81 MMNLSIGTPPFPILAIADTGSDLTWLQSKPCDQCYPQKGPIFDPSNSTTFHKLPCTTAPC 140
Query: 47 TYF-----KCVNEQ-CVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSNDN 100
C + C YT Y D S T G+ A +T++V G FGC N
Sbjct: 141 NALDESARSCTDPTTCGYTYSYGDHSYTTGYLASDTVTV---GNASVQIRNVAFGCGTRN 197
Query: 101 HG-FDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPL-------PNGEYTS 152
G FDE G LS F+SQLG I K+FSYCL +PL P+ +
Sbjct: 198 GGNFDEQGSGIVGLGGGNLS-----FVSQLGDTIGKKFSYCL-LPLENEISSQPSDSPAT 251
Query: 153 SYLKFGTDMGYRRPSTQA-----TKFIN-HPNNFYYLSLKDISIDNERMNFPP------- 199
S + FG + + ST T +N P+ +YYL+++ I++ +++ +
Sbjct: 252 SRIVFGDNPVFSSSSTNGVVFATTPLVNKEPSTYYYLTIEAITVGRKKLLYSSSSSKTAS 311
Query: 200 -DTFDITVSGEGGCIIDSGSVLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPI-QLC 257
D+ + EG IIDSG+ LT+ + Y L V E ++ +++D + LC
Sbjct: 312 YDSGSKSSVEEGNIIIDSGTTLTFLEEEFYGALEAALV---EEIKMERVNDVKNSMFSLC 368
Query: 258 YFLPETFNRFPSMAFYFE-DANLRIDGENVFIIDYENHFFLLAVAPHDDLVALIGSQQQR 316
+ + P M +F A++ + N F + E + P +D V + G+ Q
Sbjct: 369 FKSGKEEVELPLMKVHFRGGADVELKPVNTF-VRAEEGLVCFTMLPTND-VGIYGNLAQM 426
Query: 317 DTRFVYDLNIDLLSFVKENCS 337
+ YDL +SF+ +CS
Sbjct: 427 NFVVGYDLGKRTVSFLPADCS 447
>gi|357124468|ref|XP_003563922.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Brachypodium
distachyon]
Length = 450
Score = 122 bits (307), Expect = 2e-25, Method: Compositional matrix adjust.
Identities = 96/365 (26%), Positives = 159/365 (43%), Gaps = 51/365 (13%)
Query: 1 MVRLFIGTPSKGVLLILDTGSALIY---------------AIFDPRKSSSFQKINCDHPD 45
+ RL +GTP+ ++++D+GS+L + ++DPR SS++ + C P
Sbjct: 109 ITRLGLGTPTTTYVMVVDSGSSLTWLQCAPCAVSCHPQAGPLYDPRASSTYAAVPCSAPQ 168
Query: 46 CTYFKCVNEQ---------CVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGC 96
C + C Y Y D S + G+ + +T+S+ G F G +GC
Sbjct: 169 CAELQAATLNPSSCSGSGVCQYQASYGDGSFSFGYLSKDTVSLSSSGS----FPGFYYGC 224
Query: 97 SNDNHGFDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLK 156
DN G G AG++GL+R +S +SQL + F+YCL + ++ YL
Sbjct: 225 GQDNVGLF-----GRAAGLIGLARNKLSLLSQLAPSVGNSFAYCLPT---SAAASAGYLS 276
Query: 157 FGTDMGYRRPSTQA-TKFINHP--NNFYYLSLKDISIDNERMNFPPDTFDITVSGEGGCI 213
FG++ + P + T ++ + Y++SL +S+ + P + G I
Sbjct: 277 FGSNSDNKNPGKYSYTSMVSSSLDASLYFVSLAGMSVAGSPLAVPSSEY-----GSLPTI 331
Query: 214 IDSGSVLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCYFLPETFNRFPSMAFY 273
IDSG+V+T + VY L + + S +Q C+ P++
Sbjct: 332 IDSGTVITRLPTPVYTALSKAVGAALAAPSAPAYSI----LQTCFKGQVAKLPVPAVNMA 387
Query: 274 FE-DANLRIDGENVFIIDYENHFFLLAVAPHDDLVALIGSQQQRDTRFVYDLNIDLLSFV 332
F A LR+ NV ++D LA AP D A+IG+ QQ+ VYD+ + F
Sbjct: 388 FAGGATLRLTPGNV-LVDVNETTTCLAFAPTDS-TAIIGNTQQQTFSVVYDVKGSRIGFA 445
Query: 333 KENCS 337
CS
Sbjct: 446 AGGCS 450
>gi|125592062|gb|EAZ32412.1| hypothetical protein OsJ_16623 [Oryza sativa Japonica Group]
Length = 473
Score = 122 bits (306), Expect = 2e-25, Method: Compositional matrix adjust.
Identities = 97/362 (26%), Positives = 149/362 (41%), Gaps = 47/362 (12%)
Query: 2 VRLFIGTPSKGVLLILDTGSALIYA--------------IFDPRKSSSFQKINCDHPDCT 47
VR+ +G+P L++D+GS +I+ +FDP SSSF ++C C
Sbjct: 132 VRVGVGSPPTDQYLVVDSGSDVIWVQCRPCEQCYAQTDPLFDPAASSSFSGVSCGSAICR 191
Query: 48 YFKCVNE-------QCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSNDN 100
+C Y++ Y D S TKG A ET+++ G G GC + N
Sbjct: 192 TLSGTGCGGGGDAGKCDYSVTYGDGSYTKGELALETLTL-----GGTAVQGVAIGCGHRN 246
Query: 101 HGFDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLKFGTD 160
G A AG+LGL +S + QLG FSYCL G + L G
Sbjct: 247 SGLFVGA-----AGLLGLGWGAMSLVGQLGGAAGGVFSYCLA---SRGAGGAGSLVLGRT 298
Query: 161 MGYRRPSTQATKFI-NHPNNFYYLSLKDISIDNERMNFPPDTFDITVSGEGGCIIDSGSV 219
+ N ++FYY+ L I + ER+ F +T G GG ++D+G+
Sbjct: 299 EAVPVGAVWVPLVRNNQASSFYYVGLTGIGVGGERLPLQDSLFQLTEDGAGGVVMDTGTA 358
Query: 220 LTYFHSDVYWKLHEKF---VSYFERFQLAQLSDCPEPIQLCYFLPETFN-RFPSMAFYF- 274
+T + Y L F + R L D CY L + R P+++FYF
Sbjct: 359 VTRLPREAYAALRGAFDGAMGALPRSPAVSLLDT------CYDLSGYASVRVPTVSFYFD 412
Query: 275 EDANLRIDGENVFIIDYENHFFLLAVAPHDDLVALIGSQQQRDTRFVYDLNIDLLSFVKE 334
+ A L + N +++ F LA AP ++++G+ QQ + D + F
Sbjct: 413 QGAVLTLPARN-LLVEVGGAVFCLAFAPSSSGISILGNIQQEGIQITVDSANGYVGFGPN 471
Query: 335 NC 336
C
Sbjct: 472 TC 473
>gi|357117301|ref|XP_003560410.1| PREDICTED: uncharacterized protein LOC100833752 [Brachypodium
distachyon]
Length = 473
Score = 122 bits (306), Expect = 2e-25, Method: Compositional matrix adjust.
Identities = 96/339 (28%), Positives = 154/339 (45%), Gaps = 35/339 (10%)
Query: 27 IFDPRKSSSFQKIN------CDHPDCTYFKCVNEQCVYTMKYADQSVTKGFAAHETISVI 80
+FDP KS +F+ ++ C P Y + +C + + Y + + G+ A +T S
Sbjct: 144 VFDPAKSPTFRPVSGHNAVLCRPP---YHPLQDGRCGFGIAYRNGASAAGYLARDTFSFP 200
Query: 81 GKGEGKAIFHGALFGCSNDNHGFDEDARDGALAGVLGLS-----RVTISFISQLGSIIKK 135
G +FGC+N FD GALAGVLG+ + F+ QL
Sbjct: 201 TGDNNFQHLPGIVFGCANRIARFDTH---GALAGVLGMGMGAEGKPLTGFMRQLYHNGGG 257
Query: 136 RFSYCLVIPLPNGEYTSSYLKFGTDMGYRRPS-----TQATKFINHPNNFYYLSLKDISI 190
RFSYC ++P G S+L+FG D+ + P+ + A + YY+ L IS+
Sbjct: 258 RFSYCPIVP---GTTAYSFLRFGNDIPSQPPAGVHRQSMAVLAPTTTSEAYYVKLAGISV 314
Query: 191 DNERM-NFPPDTFDITVSGEGGCIIDSGSVLTYFHSDVYWKLHEKFVSYFERFQLAQLSD 249
R+ P+ F+ G GGC ID G+ +T Y + + +R + A+
Sbjct: 315 GALRVPGVTPEMFERDQHGRGGCAIDIGTKMTAIVQTAYAHVEAAVRGHLQRNR-ARFVQ 373
Query: 250 CPEPIQLCYFLPETFNRFPSMAFYFEDAN-LRIDGENVFII----DYENHFFLLAVAPHD 304
P + P R PSM +F LR+ +++F++ + L + P D
Sbjct: 374 SPGHHLCVHRTPAIEERLPSMTLHFVGGPWLRVKPQHLFLVVGSPTGGGEYLCLGLVP-D 432
Query: 305 DLVALIGSQQQRDTRFVYDL--NIDLLSFVKENCSDDSA 341
+ +IG+ QQ DTRF++DL NI ++SF E+C D+
Sbjct: 433 AEMTVIGAMQQIDTRFIFDLHNNIPIVSFNPEDCHLDAG 471
>gi|317106730|dbj|BAJ53226.1| JHL06P13.6 [Jatropha curcas]
Length = 445
Score = 122 bits (306), Expect = 2e-25, Method: Compositional matrix adjust.
Identities = 109/368 (29%), Positives = 166/368 (45%), Gaps = 47/368 (12%)
Query: 2 VRLFIGTPSKGVLLILDTGSALIYA--------------IFDPRKSSSFQKINCDHPDCT 47
+R+ IGTP VL+I DTGS LI+ IF+P++SS+++++ C+ C
Sbjct: 96 MRISIGTPPIEVLVIADTGSDLIWVQCQPCQECYKQKSPIFNPKQSSTYRRVLCETRYCN 155
Query: 48 YFK-----CVN----EQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSN 98
C + C Y+ Y D S T G+ A E + G FGC N
Sbjct: 156 ALNSDMRACSAHGFFKACGYSYSYGDHSFTMGYLATERFII---GSTNNSIQELAFGCGN 212
Query: 99 DNHG-FDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLKF 157
N G FDE ++S ISQLG+ I +FSYCLV L ++ + F
Sbjct: 213 SNGGNFDEVGSGIVGL-----GGGSLSLISQLGTKIDNKFSYCLVPILEKSNFSLGKIVF 267
Query: 158 GTDMGYRRPSTQATKFI--NHPNNFYYLSLKDISIDNERMNFPPDTFDITVSGEGGCIID 215
G + T + + P FYYL+L+ IS+ NER+ + D V +G IID
Sbjct: 268 GDNSFISGSDTYVSTPLVSKEPETFYYLTLEAISVGNERLAYENSRNDGNVE-KGNIIID 326
Query: 216 SGSVLTYFHSDVYWKLH---EKFVSYFERFQLAQLSDCPEPIQLCYFLPETFNRFPSMAF 272
SG+ LT+ S +Y KL EK V + ++SD +C F + P +
Sbjct: 327 SGTTLTFLDSKLYNKLELVLEKAV------EGERVSDPNGIFSIC-FRDKIGIELPIITV 379
Query: 273 YFEDANLRIDGENVFIIDYENHFFLLAVAPHDDLVALIGSQQQRDTRFVYDLNIDLLSFV 332
+F DA++ + N F E + P + +A+ G+ Q + YDL+ + +SF+
Sbjct: 380 HFTDADVELKPINTF-AKAEEDLLCFTMIPSNG-IAIFGNLAQMNFLVGYDLDKNCVSFM 437
Query: 333 KENCSDDS 340
+CS S
Sbjct: 438 PTDCSGHS 445
>gi|297814776|ref|XP_002875271.1| hypothetical protein ARALYDRAFT_484331 [Arabidopsis lyrata subsp.
lyrata]
gi|297321109|gb|EFH51530.1| hypothetical protein ARALYDRAFT_484331 [Arabidopsis lyrata subsp.
lyrata]
Length = 451
Score = 122 bits (306), Expect = 3e-25, Method: Compositional matrix adjust.
Identities = 108/378 (28%), Positives = 162/378 (42%), Gaps = 55/378 (14%)
Query: 2 VRLFIGTPSKGVLLILDTGSALIY---------------AIFDPRKSSSFQKINCDHPDC 46
V L IG P + +LLI DTGS L++ +F PR SS+F +C P C
Sbjct: 85 VDLRIGQPPQSLLLIADTGSDLVWVKCSACRNCSHHSPATVFFPRHSSTFSPAHCYDPVC 144
Query: 47 TYF-------KC----VNEQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFG 95
+C ++ C Y YAD S+T G A ET S+ +A FG
Sbjct: 145 RLVPKPGRAPRCNHTRIHSTCPYEYGYADGSLTSGLFARETTSLKTSSGKEAKLKSVAFG 204
Query: 96 CSNDNHG--FDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYT-- 151
C G + +GA GV+GL R ISF SQLG +FSYCL+ +YT
Sbjct: 205 CGFRISGQSVSGTSFNGA-NGVMGLGRGPISFASQLGRRFGNKFSYCLM------DYTLS 257
Query: 152 ---SSYLKFGTDMGYRRPSTQATKFINHP--NNFYYLSLKDISIDNERMNFPPDTFDITV 206
+SYL G D G T + +P FYY+ LK + ++ ++ P ++I
Sbjct: 258 PPPTSYLIIG-DGGDAVSKLFFTPLLTNPLSPTFYYVKLKSVFVNGAKLRIDPSIWEIDD 316
Query: 207 SGEGGCIIDSGSVLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCYFL-----P 261
SG GG ++DSG+ L + Y + + +R +L + LC + P
Sbjct: 317 SGNGGTVMDSGTTLAFLADPAYRLV---IAAVKQRIKLPNADELTPGFDLCVNVSGVTKP 373
Query: 262 ETFNRFPSMAFYFEDANLRIDGENVFIIDYENHFFLLAVAPHDDLV--ALIGSQQQRDTR 319
E P + F F + + + I+ E LA+ D V ++IG+ Q+
Sbjct: 374 EKI--LPRLKFEFSGGAVFVPPPRNYFIETEEQIQCLAIQSVDPKVGFSVIGNLMQQGFL 431
Query: 320 FVYDLNIDLLSFVKENCS 337
F +D + L F + C+
Sbjct: 432 FEFDRDRSRLGFSRRGCA 449
>gi|115452683|ref|NP_001049942.1| Os03g0317300 [Oryza sativa Japonica Group]
gi|108707833|gb|ABF95628.1| Eukaryotic aspartyl protease family protein, expressed [Oryza
sativa Japonica Group]
gi|113548413|dbj|BAF11856.1| Os03g0317300 [Oryza sativa Japonica Group]
Length = 448
Score = 122 bits (306), Expect = 3e-25, Method: Compositional matrix adjust.
Identities = 110/373 (29%), Positives = 168/373 (45%), Gaps = 48/373 (12%)
Query: 1 MVRLFIGTPSKGVLLILDTGSALIYAI--------------FDPRKSSSFQKINCDHPDC 46
+V + IGTP + V LILDTGS L + F+P +S +F + CD C
Sbjct: 86 LVHMAIGTPPQPVQLILDTGSDLTWTQCAPCVSCFRQSLPRFNPSRSMTFSVLPCDLRIC 145
Query: 47 ---TYFKCVNEQ-----CVYTMKYADQSVTKGFAAHETISVIGKGE--GKAIFHGALFGC 96
T+ C + CVY YAD S+T G +T S G A FGC
Sbjct: 146 RDLTWSSCGEQSWGNGICVYAYAYADHSITTGHLDSDTFSFASADHAIGGASVPDLTFGC 205
Query: 97 SNDNHGFDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLK 156
N+G G+ G SR +S +QL FSYC + E + +L
Sbjct: 206 GLFNNGIFVSNE----TGIAGFSRGALSMPAQLK---VDNFSYCFTA-ITGSEPSPVFLG 257
Query: 157 FGTDM-----GYRRPSTQATKFINHPNN---FYYLSLKDISIDNERMNFPPDTFDITVSG 208
++ G Q+T I + ++ YY+SLK +++ R+ P F + G
Sbjct: 258 VPPNLYSDAAGGGHGVVQSTALIRYHSSQLKAYYISLKGVTVGTTRLPIPESVFALKEDG 317
Query: 209 EGGCIIDSGSVLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCYFLPETFNR-F 267
GG I+DSG+ +T VY + + FV+ + +L + QLC+ +P
Sbjct: 318 TGGTIVDSGTGMTMLPEAVYNLVCDAFVA---QTKLTVHNSTSSLSQLCFSVPPGAKPDV 374
Query: 268 PSMAFYFEDANLRIDGEN-VFIIDYENHFFL--LAVAPHDDLVALIGSQQQRDTRFVYDL 324
P++ +FE A L + EN +F I+ L LA+ +DL ++IG+ QQ++ +YDL
Sbjct: 375 PALVLHFEGATLDLPRENYMFEIEEAGGIRLTCLAINAGEDL-SVIGNFQQQNMHVLYDL 433
Query: 325 NIDLLSFVKENCS 337
D+LSFV C+
Sbjct: 434 ANDMLSFVPARCN 446
>gi|54290728|dbj|BAD62398.1| putative nucleoid DNA-binding protein cnd41 [Oryza sativa Japonica
Group]
Length = 486
Score = 122 bits (306), Expect = 3e-25, Method: Compositional matrix adjust.
Identities = 109/364 (29%), Positives = 164/364 (45%), Gaps = 50/364 (13%)
Query: 1 MVRLFIGTPSKGVLLILDTGSALIYA-----------------IFDPRKSSSFQKINCDH 43
+V + +GTP++ LI DTGS L + +FDP KSS++ ++C
Sbjct: 145 VVAVGLGTPAQPSALIFDTGSDLSWVQCQPCGSSGHCHPQQDPLFDPSKSSTYAAVHCGE 204
Query: 44 PDCTYFKCV----NEQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSND 99
P C + N C+Y ++Y D S T G + +T+++ G FGC
Sbjct: 205 PQCAAAGDLCSEDNTTCLYLVRYGDGSSTTGVLSRDTLALTSS----RALTGFPFGCGTR 260
Query: 100 NHGFDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLKFGT 159
N G G + G+LGL R +S SQ + FSYCL P+ T+ YL G
Sbjct: 261 NLG-----DFGRVDGLLGLGRGELSLPSQAAASFGAVFSYCL----PSSNSTTGYLTIGA 311
Query: 160 DMGYRRPSTQATKFINHPN--NFYYLSLKDISIDNERMNFPPDTFDITVSGEGGCIIDSG 217
+ Q T + P +FY++ L I I + PP F GG ++DSG
Sbjct: 312 TPATDTGAAQYTAMLRKPQFPSFYFVELVSIDIGGYVLPVPPAVFT-----RGGTLLDSG 366
Query: 218 SVLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCY-FLPETFNRFPSMAFYFED 276
+VLTY + Y L ++F ER+ A +D + CY F E+ P+++F F D
Sbjct: 367 TVLTYLPAQAYALLRDRFRLTMERYTPAPPNDV---LDACYDFAGESEVVVPAVSFRFGD 423
Query: 277 -ANLRIDGENVFIIDYENHFFLLAVAPHDD---LVALIGSQQQRDTRFVYDLNIDLLSFV 332
A +D V I EN LA A D +++IG+ QQR +YD+ + + FV
Sbjct: 424 GAVFELDFFGVMIFLDEN-VGCLAFAAMDTGGLPLSIIGNTQQRSAEVIYDVAAEKIGFV 482
Query: 333 KENC 336
+C
Sbjct: 483 PASC 486
>gi|125543634|gb|EAY89773.1| hypothetical protein OsI_11315 [Oryza sativa Indica Group]
Length = 474
Score = 122 bits (305), Expect = 3e-25, Method: Compositional matrix adjust.
Identities = 110/373 (29%), Positives = 168/373 (45%), Gaps = 48/373 (12%)
Query: 1 MVRLFIGTPSKGVLLILDTGSALIYAI--------------FDPRKSSSFQKINCDHPDC 46
+V + IGTP + V LILDTGS L + F+P +S +F + CD C
Sbjct: 112 LVHMAIGTPPQPVQLILDTGSDLTWTQCAPCVSCFRQSLPRFNPSRSMTFSVLPCDLRIC 171
Query: 47 ---TYFKCVNEQ-----CVYTMKYADQSVTKGFAAHETISVIGKGE--GKAIFHGALFGC 96
T+ C + CVY YAD S+T G +T S G A FGC
Sbjct: 172 RDLTWSSCGEQSWGNGICVYAYAYADHSITTGHLDSDTFSFASADHAIGGASVPDLTFGC 231
Query: 97 SNDNHGFDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLK 156
N+G G+ G SR +S +QL FSYC + E + +L
Sbjct: 232 GLFNNGIFVSNE----TGIAGFSRGALSMPAQLK---VDNFSYCFTA-ITGSEPSPVFLG 283
Query: 157 FGTDM-----GYRRPSTQATKFINHPNN---FYYLSLKDISIDNERMNFPPDTFDITVSG 208
++ G Q+T I + ++ YY+SLK +++ R+ P F + G
Sbjct: 284 VPPNLYSDAAGGGHGVVQSTALIRYHSSQLKAYYISLKGVTVGTTRLPIPESVFALKEDG 343
Query: 209 EGGCIIDSGSVLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCYFLPETFNR-F 267
GG I+DSG+ +T VY + + FV+ + +L + QLC+ +P
Sbjct: 344 TGGTIVDSGTGMTMLPEAVYNLVCDAFVA---QTKLTVHNSTSSLSQLCFSVPPGAKPDV 400
Query: 268 PSMAFYFEDANLRIDGEN-VFIIDYENHFFL--LAVAPHDDLVALIGSQQQRDTRFVYDL 324
P++ +FE A L + EN +F I+ L LA+ +DL ++IG+ QQ++ +YDL
Sbjct: 401 PALVLHFEGATLDLPRENYMFEIEEAGGIRLTCLAINAGEDL-SVIGNFQQQNMHVLYDL 459
Query: 325 NIDLLSFVKENCS 337
D+LSFV C+
Sbjct: 460 ANDMLSFVPARCN 472
>gi|242058537|ref|XP_002458414.1| hypothetical protein SORBIDRAFT_03g033075 [Sorghum bicolor]
gi|241930389|gb|EES03534.1| hypothetical protein SORBIDRAFT_03g033075 [Sorghum bicolor]
Length = 448
Score = 122 bits (305), Expect = 3e-25, Method: Compositional matrix adjust.
Identities = 103/366 (28%), Positives = 158/366 (43%), Gaps = 45/366 (12%)
Query: 6 IGTPSKGVLLILDTGSALIY--------------AIFDPRKSSSFQKINCDHPDCT---- 47
+G P L+++DTGS LI+ ++DPR SS+ ++I C P C
Sbjct: 94 VGDPPTRALVVIDTGSDLIWLQCVPCRHCYRQVTPLYDPRSSSTHRRIPCASPRCRDVLR 153
Query: 48 YFKC--VNEQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSNDNHGFDE 105
Y C CVY + Y D S + G A + + H GC +DN G E
Sbjct: 154 YPGCDARTGGCVYMVVYGDGSASSGDLATDRLVF----PDDTHVHNVTLGCGHDNVGLLE 209
Query: 106 DARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLKFGTDMGYRR 165
A AG+LG+ R +SF +QL FSYCL L + SSYL FG
Sbjct: 210 SA-----AGLLGVGRGQLSFPTQLAPAYGHVFSYCLGDRLSRAQNGSSYLVFGRTP--EP 262
Query: 166 PSTQATKFINHPN--NFYYLSLKDISIDNERM-NFPPDTFDIT-VSGEGGCIIDSGSVLT 221
PST T +P + YY+ + S+ ER+ F + + +G GG ++DSG+ ++
Sbjct: 263 PSTAFTPLRTNPRRPSLYYVDMVGFSVGGERVTGFSNASLALNPATGRGGIVVDSGTAIS 322
Query: 222 YFHSDVYWKLHEKFVSYFERF-QLAQLSDCPEPIQLCYFL-----PETFNRFPSMAFYFE 275
F D Y + + F S+ + +L+ CY L P R PS+ +F
Sbjct: 323 RFARDAYAAVRDAFDSHAAAAGTMRKLATKFSVFDACYDLRGNGAPAAAVRVPSIVLHFA 382
Query: 276 -DANLRIDGENVFII---DYENHFFLLAVAPHDDLVALIGSQQQRDTRFVYDLNIDLLSF 331
A++ + N I +F L + DD + ++G+ QQ+ V+D+ + F
Sbjct: 383 GGADMALPQANYLIPVQGGDRRTYFCLGLQAADDGLNVLGNVQQQGFGLVFDVERGRIGF 442
Query: 332 VKENCS 337
CS
Sbjct: 443 TPNGCS 448
>gi|115445765|ref|NP_001046662.1| Os02g0314600 [Oryza sativa Japonica Group]
gi|46391044|dbj|BAD15987.1| putative chloroplast nucleoid DNA binding protein [Oryza sativa
Japonica Group]
gi|113536193|dbj|BAF08576.1| Os02g0314600 [Oryza sativa Japonica Group]
gi|125581836|gb|EAZ22767.1| hypothetical protein OsJ_06441 [Oryza sativa Japonica Group]
gi|215697168|dbj|BAG91162.1| unnamed protein product [Oryza sativa Japonica Group]
Length = 514
Score = 122 bits (305), Expect = 3e-25, Method: Compositional matrix adjust.
Identities = 104/372 (27%), Positives = 165/372 (44%), Gaps = 45/372 (12%)
Query: 1 MVRLFIGTPSKGVLLILDTGSALIY--------------AIFDPRKSSSFQKINCDHPDC 46
+V L++GTP + +I+DTGS L + +FDP S S++ + C P C
Sbjct: 153 LVDLYVGTPPRRFQMIMDTGSDLNWLQCAPCLDCFEQRGPVFDPAASLSYRNVTCGDPRC 212
Query: 47 TYF----------KCVNEQCVYTMKYADQSVTKGFAAHE--TISVIGKGEGKAIFHGALF 94
+ ++ C Y Y DQS T G A E T+++ G + + +F
Sbjct: 213 GLVAPPTAPRACRRPHSDPCPYYYWYGDQSNTTGDLALEAFTVNLTAPGASRRV-DDVVF 271
Query: 95 GCSNDNHGFDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSY 154
GC + N G A R +SF SQL ++ FSYCLV +G S
Sbjct: 272 GCGHSNRGLFHGAAGLLGL-----GRGALSFASQLRAVYGHAFSYCLV---DHGSSVGSK 323
Query: 155 LKFGTD---MGYRRP--STQATKFINHPNNFYYLSLKDISIDNERMNFPPDTFDITVSGE 209
+ FG D +G+ R + A + FYY+ LK + + E++N P T+D+ G
Sbjct: 324 IVFGDDDALLGHPRLNYTAFAPSAAAAADTFYYVQLKGVLVGGEKLNISPSTWDVGKDGS 383
Query: 210 GGCIIDSGSVLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCYFLPETFN-RFP 268
GG IIDSG+ L+YF Y + FV ++ ++D P + CY + P
Sbjct: 384 GGTIIDSGTTLSYFAEPAYEVIRRAFVERMDK-AYPLVADFPV-LSPCYNVSGVERVEVP 441
Query: 269 SMAFYFED-ANLRIDGENVFI-IDYENHFFLLAVAPHDDLVALIGSQQQRDTRFVYDLNI 326
+ F D A EN F+ +D + L + +++IG+ QQ++ +YDL
Sbjct: 442 EFSLLFADGAVWDFPAENYFVRLDPDGIMCLAVLGTPRSAMSIIGNFQQQNFHVLYDLQN 501
Query: 327 DLLSFVKENCSD 338
+ L F C++
Sbjct: 502 NRLGFAPRRCAE 513
>gi|125586057|gb|EAZ26721.1| hypothetical protein OsJ_10629 [Oryza sativa Japonica Group]
Length = 474
Score = 122 bits (305), Expect = 3e-25, Method: Compositional matrix adjust.
Identities = 110/373 (29%), Positives = 168/373 (45%), Gaps = 48/373 (12%)
Query: 1 MVRLFIGTPSKGVLLILDTGSALIYAI--------------FDPRKSSSFQKINCDHPDC 46
+V + IGTP + V LILDTGS L + F+P +S +F + CD C
Sbjct: 112 LVHMAIGTPPQPVQLILDTGSDLTWTQCAPCVSCFRQSLPRFNPSRSMTFSVLPCDLRIC 171
Query: 47 ---TYFKCVNEQ-----CVYTMKYADQSVTKGFAAHETISVIGKGE--GKAIFHGALFGC 96
T+ C + CVY YAD S+T G +T S G A FGC
Sbjct: 172 RDLTWSSCGEQSWGNGICVYAYAYADHSITTGHLDSDTFSFASADHAIGGASVPDLTFGC 231
Query: 97 SNDNHGFDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLK 156
N+G G+ G SR +S +QL FSYC + E + +L
Sbjct: 232 GLFNNGIFVSNE----TGIAGFSRGALSMPAQLK---VDNFSYCFTA-ITGSEPSPVFLG 283
Query: 157 FGTDM-----GYRRPSTQATKFINHPNN---FYYLSLKDISIDNERMNFPPDTFDITVSG 208
++ G Q+T I + ++ YY+SLK +++ R+ P F + G
Sbjct: 284 VPPNLYSDAAGGGHGVVQSTALIRYHSSQLKAYYISLKGVTVGTTRLPIPESVFALKEDG 343
Query: 209 EGGCIIDSGSVLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCYFLPETFNR-F 267
GG I+DSG+ +T VY + + FV+ + +L + QLC+ +P
Sbjct: 344 TGGTIVDSGTGMTMLPEAVYNLVCDAFVA---QTKLTVHNSTSSLSQLCFSVPPGAKPDV 400
Query: 268 PSMAFYFEDANLRIDGEN-VFIIDYENHFFL--LAVAPHDDLVALIGSQQQRDTRFVYDL 324
P++ +FE A L + EN +F I+ L LA+ +DL ++IG+ QQ++ +YDL
Sbjct: 401 PALVLHFEGATLDLPRENYMFEIEEAGGIRLTCLAINAGEDL-SVIGNFQQQNMHVLYDL 459
Query: 325 NIDLLSFVKENCS 337
D+LSFV C+
Sbjct: 460 ANDMLSFVPARCN 472
>gi|125539168|gb|EAY85563.1| hypothetical protein OsI_06935 [Oryza sativa Indica Group]
Length = 514
Score = 122 bits (305), Expect = 3e-25, Method: Compositional matrix adjust.
Identities = 104/372 (27%), Positives = 165/372 (44%), Gaps = 45/372 (12%)
Query: 1 MVRLFIGTPSKGVLLILDTGSALIY--------------AIFDPRKSSSFQKINCDHPDC 46
+V L++GTP + +I+DTGS L + +FDP S S++ + C P C
Sbjct: 153 LVDLYVGTPPRRFQMIMDTGSDLNWLQCAPCLDCFEQRGPVFDPATSLSYRNVTCGDPRC 212
Query: 47 TYF----------KCVNEQCVYTMKYADQSVTKGFAAHE--TISVIGKGEGKAIFHGALF 94
+ ++ C Y Y DQS T G A E T+++ G + + +F
Sbjct: 213 GLVAPPTAPRACRRPHSDPCPYYYWYGDQSNTTGDLALEAFTVNLTAPGASRRV-DDVVF 271
Query: 95 GCSNDNHGFDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSY 154
GC + N G A R +SF SQL ++ FSYCLV +G S
Sbjct: 272 GCGHSNRGLFHGAAGLLGL-----GRGALSFASQLRAVYGHAFSYCLV---DHGSSVGSK 323
Query: 155 LKFGTD---MGYRRP--STQATKFINHPNNFYYLSLKDISIDNERMNFPPDTFDITVSGE 209
+ FG D +G+ R + A + FYY+ LK + + E++N P T+D+ G
Sbjct: 324 IVFGDDDALLGHPRLNYTAFAPSAAAAADTFYYVQLKGVLVGGEKLNISPSTWDVGKDGS 383
Query: 210 GGCIIDSGSVLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCYFLPETFN-RFP 268
GG IIDSG+ L+YF Y + FV ++ ++D P + CY + P
Sbjct: 384 GGTIIDSGTTLSYFAEPAYEVIRRAFVERMDK-AYPLVADFPV-LSPCYNVSGVERVEVP 441
Query: 269 SMAFYFED-ANLRIDGENVFI-IDYENHFFLLAVAPHDDLVALIGSQQQRDTRFVYDLNI 326
+ F D A EN F+ +D + L + +++IG+ QQ++ +YDL
Sbjct: 442 EFSLLFADGAVWDFPAENYFVRLDPDGIMCLAVLGTPRSAMSIIGNFQQQNFHVLYDLQN 501
Query: 327 DLLSFVKENCSD 338
+ L F C++
Sbjct: 502 NRLGFAPRRCAE 513
>gi|414876414|tpg|DAA53545.1| TPA: hypothetical protein ZEAMMB73_483039 [Zea mays]
Length = 506
Score = 122 bits (305), Expect = 3e-25, Method: Compositional matrix adjust.
Identities = 107/361 (29%), Positives = 160/361 (44%), Gaps = 50/361 (13%)
Query: 3 RLFIGTPSKGVLLILDTGSALIYA--------------IFDPRKSSSFQKINCDHPDCTY 48
R+ IG+P++ + ++LDTGS + + +FDP S+S+ ++CD C
Sbjct: 169 RVGIGSPARQLYMVLDTGSDVTWVQCQPCADCYQQSDPVFDPSLSASYAAVSCDSQRCRD 228
Query: 49 F---KCVNE--QCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSNDNHGF 103
C N C+Y + Y D S T G A ET+++ G+ + + A+ GC +DN G
Sbjct: 229 LDTAACRNATGACLYEVAYGDGSYTVGDFATETLTL---GDSTPVGNVAI-GCGHDNEGL 284
Query: 104 DEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLKFGTDMGY 163
A G LS F SQ I FSYCLV +S L+FG G
Sbjct: 285 FVGAAGLLALGGGPLS-----FPSQ---ISASTFSYCLV---DRDSPAASTLQFGD--GA 331
Query: 164 RRPSTQATKFINHP--NNFYYLSLKDISIDNERMNFPPDTFDI-TVSGEGGCIIDSGSVL 220
T + P + FYY++L IS+ + ++ P F + SG GG I+DSG+ +
Sbjct: 332 AEAGTVTAPLVRSPRTSTFYYVALSGISVGGQPLSIPASAFAMDATSGSGGVIVDSGTAV 391
Query: 221 TYFHSDVYWKLHEKFVS---YFERFQLAQLSDCPEPIQLCYFLPE-TFNRFPSMAFYFED 276
T S Y L + FV R L D CY L + T P+++ FE
Sbjct: 392 TRLQSAAYAALRDAFVQGAPSLPRTSGVSLFD------TCYDLSDRTSVEVPAVSLRFEG 445
Query: 277 AN-LRIDGENVFIIDYENHFFLLAVAPHDDLVALIGSQQQRDTRFVYDLNIDLLSFVKEN 335
LR+ +N I + LA AP + V++IG+ QQ+ TR +D + F
Sbjct: 446 GGALRLPAKNYLIPVDGAGTYCLAFAPTNAAVSIIGNVQQQGTRVSFDTARGAVGFTPNK 505
Query: 336 C 336
C
Sbjct: 506 C 506
>gi|224130548|ref|XP_002320868.1| predicted protein [Populus trichocarpa]
gi|222861641|gb|EEE99183.1| predicted protein [Populus trichocarpa]
Length = 488
Score = 122 bits (305), Expect = 4e-25, Method: Compositional matrix adjust.
Identities = 106/365 (29%), Positives = 166/365 (45%), Gaps = 52/365 (14%)
Query: 2 VRLFIGTPSKGVLLILDTGSALIY--------------AIFDPRKSSSFQKINCDHPDCT 47
RL +GTP++ V ++LDTGS +++ +FDP KS SF I C P C
Sbjct: 147 TRLGVGTPARYVYMVLDTGSDIVWIQCAPCIKCYSQTDPVFDPTKSRSFANIPCGSPLCR 206
Query: 48 ---YFKCVNEQ--CVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSNDNHG 102
Y C ++ C+Y + Y D S T G + ET++ G G+ + GC +DN G
Sbjct: 207 RLDYPGCSTKKQICLYQVSYGDGSFTVGEFSTETLTFRGTRVGRVVL-----GCGHDNEG 261
Query: 103 FDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTS----SYLKFG 158
A R +SF SQ+G +FSYCL G+ ++ S + FG
Sbjct: 262 LFVGAAGLLGL-----GRGRLSFPSQIGRRFNSKFSYCL------GDRSASSRPSSIVFG 310
Query: 159 TDMGYRRPSTQATKFINHP--NNFYYLSLKDISIDNERMN-FPPDTFDITVSGEGGCIID 215
D R +T+ T +++P + FYY+ L IS+ R++ F + +G GG IID
Sbjct: 311 -DSAISR-TTRFTPLLSNPKLDTFYYVELLGISVGGTRVSGISASLFKLDSTGNGGVIID 368
Query: 216 SGSVLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPE--PIQLCYFLP-ETFNRFPSMAF 272
SG+ +T Y L + F+ + L PE C+ L +T + P++
Sbjct: 369 SGTSVTRLTRAAYVALRDAFL-----VGASNLKRAPEFSLFDTCFDLSGKTEVKVPTVVL 423
Query: 273 YFEDANLRIDGENVFIIDYENHFFLLAVAPHDDLVALIGSQQQRDTRFVYDLNIDLLSFV 332
+F A++ + N I + F A A +++IG+ QQ+ R VYDL + F
Sbjct: 424 HFRGADVPLPASNYLIPVDNSGSFCFAFAGTASGLSIIGNIQQQGFRVVYDLATSRVGFA 483
Query: 333 KENCS 337
C+
Sbjct: 484 PRGCA 488
>gi|357135412|ref|XP_003569303.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Brachypodium
distachyon]
Length = 494
Score = 121 bits (304), Expect = 4e-25, Method: Compositional matrix adjust.
Identities = 106/366 (28%), Positives = 161/366 (43%), Gaps = 46/366 (12%)
Query: 2 VRLFIGTPSKGVLLILDTGSALIY--------------AIFDPRKSSSFQKINCDHPDCT 47
++ +GTP+ L++LDTGS +++ +FDPR+S S+ + C P C
Sbjct: 144 TKIGVGTPATPALMVLDTGSDVVWLQCAPCRRCYDQSGQVFDPRRSRSYGAVGCSAPLCR 203
Query: 48 YFKC-----VNEQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSNDNHG 102
+ C+Y + Y D SVT G A ET++ G G + AL GC +DN G
Sbjct: 204 RLDSGGCDLRRKACLYQVAYGDGSVTAGDFATETLTFAG---GARVARIAL-GCGHDNEG 259
Query: 103 FDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLV--IPLPNGEYTSSYLKFGTD 160
A R ++SF +Q+ + FSYCLV N SS + FG+
Sbjct: 260 LFVAAAGLLGL-----GRGSLSFPAQISRRYGRSFSYCLVDRTSSANPASHSSTVTFGSG 314
Query: 161 MGYRRPSTQATKFINHPN--NFYYLSLKDISIDNERMNFPPDTFDITV---SGEGGCIID 215
+ T + +P FYY+ L IS+ R++ D+ D+ + SG GG I+D
Sbjct: 315 AVGSTVAASFTPMVKNPRMETFYYVQLVGISVGGARVSGVADS-DLRLDPSSGRGGVIVD 373
Query: 216 SGSVLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQL---CYFLP-ETFNRFPSMA 271
SG+ +T Y L + F R A L P L CY L + P+++
Sbjct: 374 SGTSVTRLARPAYSALRDAF-----RAAAAGLRLSPGGFSLFDTCYDLSGRKVVKVPTVS 428
Query: 272 FYFE-DANLRIDGENVFIIDYENHFFLLAVAPHDDLVALIGSQQQRDTRFVYDLNIDLLS 330
+F A + EN I F A A D V++IG+ QQ+ R V+D + +
Sbjct: 429 MHFAGGAEAALPPENYLIPVDSKGTFCFAFAGTDGGVSIIGNIQQQGFRVVFDGDGQRVG 488
Query: 331 FVKENC 336
FV + C
Sbjct: 489 FVPKGC 494
>gi|242050432|ref|XP_002462960.1| hypothetical protein SORBIDRAFT_02g035320 [Sorghum bicolor]
gi|241926337|gb|EER99481.1| hypothetical protein SORBIDRAFT_02g035320 [Sorghum bicolor]
Length = 445
Score = 121 bits (304), Expect = 4e-25, Method: Compositional matrix adjust.
Identities = 97/376 (25%), Positives = 162/376 (43%), Gaps = 58/376 (15%)
Query: 1 MVRLFIGTPSKGVLLILDTGSALIYA---------------IFDPRKSSSFQKINCDH-- 43
++ L IGTP I DTGS LI+ +++P S++F + C+
Sbjct: 87 LMTLAIGTPPVSYQAIADTGSDLIWTQCAPCSSQCFQQPTPLYNPSSSTTFAVLPCNSSL 146
Query: 44 -------------PDCTYFKCVNEQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAI-F 89
P CT C+Y M Y T + ET +
Sbjct: 147 SMCAAALAGTTPPPGCT--------CMYNMTYGS-GWTSVYQGSETFTFGSSTPANQTGV 197
Query: 90 HGALFGCSNDNHGFDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGE 149
G FGCSN + GF+ + +G++GL R ++S +SQLG +FSYCL P +
Sbjct: 198 PGIAFGCSNASGGFNTSSA----SGLVGLGRGSLSLVSQLG---VPKFSYCLT-PYQDTN 249
Query: 150 YTSSYLKFGTDMGYRRPSTQATKFINHPNN-----FYYLSLKDISIDNERMNFPPDTFDI 204
TS+ L + +T F+ P++ +YYL+L IS+ ++ P +
Sbjct: 250 STSTLLLGPSASLNDTGGVSSTPFVASPSDAPMSTYYYLNLTGISLGTTALSIPTTALSL 309
Query: 205 TVSGEGGCIIDSGSVLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCYFLPETF 264
G GG IIDSG+ +T + Y ++ VS + LC+ LP +
Sbjct: 310 KADGTGGFIIDSGTTITLLGNTAYQQVRAAVVSLVT-LPTTDGGSAATGLDLCFELPSST 368
Query: 265 N---RFPSMAFYFEDANLRIDGENVFIIDYENHFFLLAVAPHDDLVALIGSQQQRDTRFV 321
+ PSM +F+ A++ + ++ ++D N + L D V+++G+ QQ++ +
Sbjct: 369 SAPPTMPSMTLHFDGADMVLPADSYMMLD-SNLWCLAMQNQTDGGVSILGNYQQQNMHIL 427
Query: 322 YDLNIDLLSFVKENCS 337
YD+ + L+F CS
Sbjct: 428 YDVGQETLTFAPAKCS 443
>gi|356531224|ref|XP_003534178.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
Length = 492
Score = 121 bits (304), Expect = 4e-25, Method: Compositional matrix adjust.
Identities = 107/359 (29%), Positives = 155/359 (43%), Gaps = 51/359 (14%)
Query: 3 RLFIGTPSKGVLLILDTGSALIY--------------AIFDPRKSSSFQKINCDHPDCTY 48
R+ +G PSK ++LDTGS + + IFDP SSS+ + CD C
Sbjct: 160 RVGVGQPSKPFYMVLDTGSDVNWLQCKPCSDCYQQSDPIFDPTASSSYNPLTCDAQQCQD 219
Query: 49 FK---CVNEQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSNDNHGFDE 105
+ C N +C+Y + Y D S T G ET+S G + GC +DN G
Sbjct: 220 LEMSACRNGKCLYQVSYGDGSFTVGEYVTETVSF-----GAGSVNRVAIGCGHDNEGL-- 272
Query: 106 DARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLKFGTDMGYRR 165
G GL + +S I FSYCLV SS L+F + R
Sbjct: 273 ------FVGSAGLLGLGGGPLSLTSQIKATSFSYCLV---DRDSGKSSTLEFNSP----R 319
Query: 166 P--STQATKFINHP-NNFYYLSLKDISIDNERMNFPPDTFDITVSGEGGCIIDSGSVLTY 222
P S A N N FYY+ L +S+ E + PP+TF + SG GG I+DSG+ +T
Sbjct: 320 PGDSVVAPLLKNQKVNTFYYVELTGVSVGGEIVTVPPETFAVDQSGAGGVIVDSGTAITR 379
Query: 223 FHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQL---CYFLPETFN-RFPSMAFYFE-DA 277
+ Y + + F + A E + L CY L + R P+++F+F D
Sbjct: 380 LRTQAYNSVRDAFKRKTSNLRPA------EGVALFDTCYDLSSLQSVRVPTVSFHFSGDR 433
Query: 278 NLRIDGENVFIIDYENHFFLLAVAPHDDLVALIGSQQQRDTRFVYDLNIDLLSFVKENC 336
+ +N I + A AP +++IG+ QQ+ TR +DL L+ F C
Sbjct: 434 AWALPAKNYLIPVDGAGTYCFAFAPTTSSMSIIGNVQQQGTRVSFDLANSLVGFSPNKC 492
>gi|15228044|ref|NP_181826.1| aspartyl protease-like protein [Arabidopsis thaliana]
gi|20197868|gb|AAM15292.1| putative chloroplast nucleoid DNA binding protein [Arabidopsis
thaliana]
gi|20197965|gb|AAD21712.2| putative chloroplast nucleoid DNA binding protein [Arabidopsis
thaliana]
gi|330255100|gb|AEC10194.1| aspartyl protease-like protein [Arabidopsis thaliana]
Length = 527
Score = 121 bits (304), Expect = 4e-25, Method: Compositional matrix adjust.
Identities = 109/369 (29%), Positives = 159/369 (43%), Gaps = 40/369 (10%)
Query: 4 LFIGTPSKGVLLILDTGSALIY--------------AIFDPRKSSSFQKINCDHPDCTYF 49
+ +GTP K LILDTGS L + +DP+ S+SF+ I C+ P C+
Sbjct: 164 VLVGTPPKHFSLILDTGSDLNWLQCLPCYDCFHQNGMFYDPKTSASFKNITCNDPRCSLI 223
Query: 50 -------KCV--NEQCVYTMKYADQSVTKGFAAHETISV----IGKGEGKAIFHGALFGC 96
+C N+ C Y Y D+S T G A ET +V G + +FGC
Sbjct: 224 SSPDPPVQCESDNQSCPYFYWYGDRSNTTGDFAVETFTVNLTTTEGGSSEYKVGNMMFGC 283
Query: 97 SNDNHGFDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLK 156
+ N G A R +SF SQL S+ FSYCLV N SS L
Sbjct: 284 GHWNRGLFSGASGLLGL-----GRGPLSFSSQLQSLYGHSFSYCLVDRNSNTN-VSSKLI 337
Query: 157 FGTDMGY-RRPSTQATKFINHPNN----FYYLSLKDISIDNERMNFPPDTFDITVSGEGG 211
FG D + T F+N N FYY+ +K I + + ++ P +T++I+ G+GG
Sbjct: 338 FGEDKDLLNHTNLNFTSFVNGKENSVETFYYIQIKSILVGGKALDIPEETWNISSDGDGG 397
Query: 212 CIIDSGSVLTYFHSDVYWKLHEKFVSYF-ERFQLAQLSDCPEPIQLCYFLPETFNRFPSM 270
IIDSG+ L+YF Y + KF E + + + +P + E P +
Sbjct: 398 TIIDSGTTLSYFAEPAYEIIKNKFAEKMKENYPIFRDFPVLDPCFNVSGIEENNIHLPEL 457
Query: 271 AFYFEDANL-RIDGENVFIIDYENHFFLLAVAPHDDLVALIGSQQQRDTRFVYDLNIDLL 329
F D + EN FI E+ L + ++IG+ QQ++ +YD L
Sbjct: 458 GIAFVDGTVWNFPAENSFIWLSEDLVCLAILGTPKSTFSIIGNYQQQNFHILYDTKRSRL 517
Query: 330 SFVKENCSD 338
F C+D
Sbjct: 518 GFTPTKCAD 526
>gi|255558694|ref|XP_002520371.1| basic 7S globulin 2 precursor small subunit, putative [Ricinus
communis]
gi|223540418|gb|EEF41987.1| basic 7S globulin 2 precursor small subunit, putative [Ricinus
communis]
Length = 557
Score = 121 bits (304), Expect = 4e-25, Method: Compositional matrix adjust.
Identities = 115/372 (30%), Positives = 160/372 (43%), Gaps = 48/372 (12%)
Query: 4 LFIGTPSKGVLLILDTGSALIY--------------AIFDPRKSSSFQKINCDHPDCTYF 49
+FIGTP + LILDTGS L + +DP++SSSF+ I C P C
Sbjct: 196 VFIGTPPRHFSLILDTGSDLNWIQCVPCYDCFVQNGPYYDPKESSSFKNIGCHDPRCHLV 255
Query: 50 ---------KCVNEQCVYTMKYADQSVTKGFAAHETISV-----IGKGEGKAIFHGALFG 95
K N+ C Y Y D S T G A ET +V GK E K + +FG
Sbjct: 256 SSPDPPQPCKAENQTCPYFYWYGDSSNTTGDFALETFTVNLTSPAGKSEFKRV-ENVMFG 314
Query: 96 CSNDNHGFDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYL 155
C + N G A R +SF SQL S+ FSYCLV + SS L
Sbjct: 315 CGHWNRGLFHGAAGLLGL-----GRGPLSFSSQLQSLYGHSFSYCLV-DRNSDTNVSSKL 368
Query: 156 KFGTDMGY-RRPSTQATKFI----NHPNNFYYLSLKDISIDNERMNFPPDTFDITVSGEG 210
FG D P T + N + FYY+ +K I + E + P +T+ ++ G G
Sbjct: 369 IFGEDKDLLNHPEVNFTSLVAGKENPVDTFYYVQIKSIMVGGEVLKIPEETWHLSPEGAG 428
Query: 211 GCIIDSGSVLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCYFLPETFN-RFPS 269
G I+DSG+ L+YF Y + + FV + + + + D P + CY + P
Sbjct: 429 GTIVDSGTTLSYFAEPSYEIIKDAFVKKVKGYPV--IKDFPI-LDPCYNVSGVEKMELPE 485
Query: 270 MAFYFED-ANLRIDGENVFIIDYENHFFLLAV--APHDDLVALIGSQQQRDTRFVYDLNI 326
FED A EN FI LA+ P L ++IG+ QQ++ +YD
Sbjct: 486 FRILFEDGAVWNFPVENYFIKLEPEEIVCLAILGTPRSAL-SIIGNYQQQNFHILYDTKK 544
Query: 327 DLLSFVKENCSD 338
L + C+D
Sbjct: 545 SRLGYAPMKCAD 556
>gi|30683732|ref|NP_180371.2| aspartyl protease-like protein [Arabidopsis thaliana]
gi|28392898|gb|AAO41885.1| putative chloroplast nucleoid DNA binding protein [Arabidopsis
thaliana]
gi|56382011|gb|AAV85724.1| At2g28040 [Arabidopsis thaliana]
gi|330252978|gb|AEC08072.1| aspartyl protease-like protein [Arabidopsis thaliana]
Length = 395
Score = 121 bits (304), Expect = 4e-25, Method: Compositional matrix adjust.
Identities = 107/355 (30%), Positives = 160/355 (45%), Gaps = 47/355 (13%)
Query: 1 MVRLFIGTPSKGVLLILDTGSALIYA--------------IFDPRKSSSFQKINCDHPDC 46
+++L IGTP + +LDTGS I+ IFDP KSS+F++I CD D
Sbjct: 66 LMKLQIGTPPFEIEAVLDTGSEHIWTQCLPCVHCYNQTAPIFDPSKSSTFKEIRCDTHD- 124
Query: 47 TYFKCVNEQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSNDNHGFDED 106
C Y + Y +S TKG ET+++ + + GC +N GF
Sbjct: 125 -------HSCPYELVYGGKSYTKGTLVTETVTIHSTSGQPFVMPETIIGCGRNNSGFKP- 176
Query: 107 ARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLKFGTDMGYRRP 166
AGV+GL R S I+Q+G SYC G+ TS + FG +
Sbjct: 177 ----GFAGVVGLDRGPKSLITQMGGEYPGLMSYCFA-----GKGTSK-INFGANAIVAGD 226
Query: 167 STQATK-FINHPN-NFYYLSLKDISIDNERMNFPPDTFDITVSGEGGCIIDSGSVLTYFH 224
+T F+ FYYL+L +S+ N R+ F + +G +IDSGS LTYF
Sbjct: 227 GVVSTTVFVKTAKPGFYYLNLDAVSVGNTRIETVGTPFH---ALKGNIVIDSGSTLTYF- 282
Query: 225 SDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCYFLPETFNRFPSMAFYFE-DANLRIDG 283
+ Y L K V Q+ P LCY+ +T + FP + +F A+L +D
Sbjct: 283 PESYCNLVRKAVE-----QVVTAVRFPRSDILCYY-SKTIDIFPVITMHFSGGADLVLDK 336
Query: 284 ENVFIIDYENHFFLLAVAPHDDL-VALIGSQQQRDTRFVYDLNIDLLSFVKENCS 337
N+++ F LA+ + + A+ G++ Q + YD + L+SF NCS
Sbjct: 337 YNMYVASNTGGVFCLAIICNSPIEEAIFGNRAQNNFLVGYDSSSLLVSFKPTNCS 391
>gi|413952720|gb|AFW85369.1| hypothetical protein ZEAMMB73_571116 [Zea mays]
Length = 451
Score = 121 bits (304), Expect = 4e-25, Method: Compositional matrix adjust.
Identities = 103/370 (27%), Positives = 166/370 (44%), Gaps = 43/370 (11%)
Query: 1 MVRLFIGTP-SKGVLLILDTGSALIYA--------------IFDPRKSSSFQKINCDHPD 45
++ IGTP + V L +DTGS L++ +FDP SS+F+ + C P
Sbjct: 88 LIHFNIGTPRPQRVALTMDTGSDLVWTQCTPCPVCFDQPFPLFDPSVSSTFRAVACPDPI 147
Query: 46 C------TYFKCVNE--QCVYTMKYADQSVTKGFAAHETISVI---GKGEGKAIFHGALF 94
C + C + +C Y Y D+S+T G+ +T + + G+G G F
Sbjct: 148 CRPSSGLSVSACALKTFRCFYLCSYGDKSITAGYIFKDTFTFMSPNGEGAPPVAVSGLAF 207
Query: 95 GCSNDNHGFDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSY 154
GC + N G +G+ G R +S SQL RFSYCL +S
Sbjct: 208 GCGDYNTGVFASNE----SGIAGFGRGPLSLPSQL---RVGRFSYCLTSHDETESNKTSA 260
Query: 155 LKFGTDM-GYRRPST---QATKFINHPN--NFYYLSLKDISIDNERMNFPPDTFDITVSG 208
+ GT G R S+ ++T I+ P+ FYYLSL+ I++ R+ F + G
Sbjct: 261 VFLGTPPNGLRAHSSGPFRSTPIIHSPSFPTFYYLSLEGITVGKTRLPVDSSVFALKKDG 320
Query: 209 EGGCIIDSGSVLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCYFLPETFNR-- 266
GG +IDSG+ +T F + V+ +L +FV+ + S+ LC+ P+ +
Sbjct: 321 SGGTVIDSGTGVTTFPAAVFEQLKNEFVAQLPLPRYDNTSEVGN--LLCFQRPKGGKQVP 378
Query: 267 FPSMAFYFEDANLRIDGENVFIIDYENHFFLLAVAPHDDLVALIGSQQQRDTRFVYDLNI 326
P + F+ A++ + EN D ++ L + + + LIG+ QQ++ VYD+
Sbjct: 379 VPKLIFHLASADMDLPRENYIPEDTDSGVMCLMINGAEVDMVLIGNFQQQNMHIVYDVEN 438
Query: 327 DLLSFVKENC 336
L F C
Sbjct: 439 SKLLFASAQC 448
>gi|4063754|gb|AAC98462.1| putative chloroplast nucleoid DNA binding protein [Arabidopsis
thaliana]
gi|20197956|gb|AAM15330.1| putative chloroplast nucleoid DNA binding protein [Arabidopsis
thaliana]
Length = 389
Score = 121 bits (304), Expect = 5e-25, Method: Compositional matrix adjust.
Identities = 107/355 (30%), Positives = 160/355 (45%), Gaps = 47/355 (13%)
Query: 1 MVRLFIGTPSKGVLLILDTGSALIYA--------------IFDPRKSSSFQKINCDHPDC 46
+++L IGTP + +LDTGS I+ IFDP KSS+F++I CD D
Sbjct: 60 LMKLQIGTPPFEIEAVLDTGSEHIWTQCLPCVHCYNQTAPIFDPSKSSTFKEIRCDTHD- 118
Query: 47 TYFKCVNEQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSNDNHGFDED 106
C Y + Y +S TKG ET+++ + + GC +N GF
Sbjct: 119 -------HSCPYELVYGGKSYTKGTLVTETVTIHSTSGQPFVMPETIIGCGRNNSGFKP- 170
Query: 107 ARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLKFGTDMGYRRP 166
AGV+GL R S I+Q+G SYC G+ TS + FG +
Sbjct: 171 ----GFAGVVGLDRGPKSLITQMGGEYPGLMSYCFA-----GKGTSK-INFGANAIVAGD 220
Query: 167 STQATK-FINHPN-NFYYLSLKDISIDNERMNFPPDTFDITVSGEGGCIIDSGSVLTYFH 224
+T F+ FYYL+L +S+ N R+ F + +G +IDSGS LTYF
Sbjct: 221 GVVSTTVFVKTAKPGFYYLNLDAVSVGNTRIETVGTPFH---ALKGNIVIDSGSTLTYF- 276
Query: 225 SDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCYFLPETFNRFPSMAFYFE-DANLRIDG 283
+ Y L K V Q+ P LCY+ +T + FP + +F A+L +D
Sbjct: 277 PESYCNLVRKAVE-----QVVTAVRFPRSDILCYY-SKTIDIFPVITMHFSGGADLVLDK 330
Query: 284 ENVFIIDYENHFFLLAVAPHDDL-VALIGSQQQRDTRFVYDLNIDLLSFVKENCS 337
N+++ F LA+ + + A+ G++ Q + YD + L+SF NCS
Sbjct: 331 YNMYVASNTGGVFCLAIICNSPIEEAIFGNRAQNNFLVGYDSSSLLVSFKPTNCS 385
>gi|225470916|ref|XP_002263964.1| PREDICTED: aspartic proteinase nepenthesin-1 [Vitis vinifera]
gi|147788999|emb|CAN64659.1| hypothetical protein VITISV_009613 [Vitis vinifera]
Length = 489
Score = 121 bits (303), Expect = 5e-25, Method: Compositional matrix adjust.
Identities = 106/360 (29%), Positives = 159/360 (44%), Gaps = 43/360 (11%)
Query: 2 VRLFIGTPSKGVLLILDTGSALIY--------------AIFDPRKSSSFQKINCDHPDCT 47
RL +GTP K V ++LDTGS +++ +FDP+KS SF I+C P C
Sbjct: 149 TRLGVGTPPKYVYMVLDTGSDVVWIQCAPCRKCYSQTDPVFDPKKSGSFSSISCRSPLCL 208
Query: 48 YF---KCVNEQ-CVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSNDNHGF 103
C + Q C+Y + Y D S T G + ET++ G K GC +DN G
Sbjct: 209 RLDSPGCNSRQSCLYQVAYGDGSFTFGEFSTETLTFRGTRVPKVAL-----GCGHDNEGL 263
Query: 104 DEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLKFGTDMGY 163
A R +SF +Q G ++FSYCLV + + +S + FG
Sbjct: 264 FVGAAGLLGL-----GRGRLSFPTQTGLRFGRKFSYCLVDRSASSKPSS--VVFGQSAVS 316
Query: 164 RRPSTQATKFINHP--NNFYYLSLKDISIDNERM-NFPPDTFDITVSGEGGCIIDSGSVL 220
R + T I +P + FYYL L IS+ R+ F + +G GG IIDSG+ +
Sbjct: 317 R--TAVFTPLITNPKLDTFYYLELTGISVGGARVAGITASLFKLDTAGNGGVIIDSGTSV 374
Query: 221 TYFHSDVYWKLHEKFVSYFERFQLAQLSDCPE--PIQLCYFLP-ETFNRFPSMAFYFEDA 277
T Y L + F R A L P+ C+ L +T + P++ +F A
Sbjct: 375 TRLTRRAYVSLRDAF-----RAGAADLKRAPDYSLFDTCFDLSGKTEVKVPTVVMHFRGA 429
Query: 278 NLRIDGENVFIIDYENHFFLLAVAPHDDLVALIGSQQQRDTRFVYDLNIDLLSFVKENCS 337
++ + N I N F A A +++IG+ QQ+ R V+D+ + F C+
Sbjct: 430 DVSLPATNYLIPVDTNGVFCFAFAGTMSGLSIIGNIQQQGFRVVFDVAASRIGFAARGCA 489
>gi|414589628|tpg|DAA40199.1| TPA: hypothetical protein ZEAMMB73_627989 [Zea mays]
Length = 452
Score = 121 bits (303), Expect = 5e-25, Method: Compositional matrix adjust.
Identities = 105/373 (28%), Positives = 162/373 (43%), Gaps = 54/373 (14%)
Query: 1 MVRLFIGTPSKGVLLILDTGSALIYA--------------IFDPRKSSSFQKINCDHPDC 46
+V L IGTP + V +LDTGS LI+ +F P +S+S++ + C C
Sbjct: 97 VVDLAIGTPPQPVSALLDTGSDLIWTQCAPCASCLSQPDPLFAPGQSASYEPMRCAGTLC 156
Query: 47 T---YFKCVN-EQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGAL--FGCSNDN 100
+ + C + C Y Y D ++T G A E + G G FGC + N
Sbjct: 157 SDILHHSCERPDTCTYRYNYGDGTMTVGVYATERFTFASSGGGGLTTTTVPLGFGCGSVN 216
Query: 101 HGFDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTS---SYLKF 157
G + +G++G R +S +SQL SI +RFSYCL Y S S L F
Sbjct: 217 VGSLNNG-----SGIVGFGRNPLSLVSQL-SI--RRFSYCLT------SYASRRQSTLLF 262
Query: 158 GT----DMGYRRPSTQATKFINHPNN--FYYLSLKDISIDNERMNFPPDTFDITVSGEGG 211
G+ G Q T + P N FYY+ +++ R+ P F + G GG
Sbjct: 263 GSLSDGVYGDATGRVQTTPLLQSPQNPTFYYVHFTGLTVGARRLRIPESAFALRPDGSGG 322
Query: 212 CIIDSGSVLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCYFLPETFNR----- 266
I+DSG+ LT + V L E ++ ++ +L + +C+ +P + R
Sbjct: 323 VIVDSGTALTLLPAAV---LAEVVRAFRQQLRLPFANGGNPEDGVCFLVPAAWRRSSSTS 379
Query: 267 ---FPSMAFYFEDANLRIDGENVFIIDYENHFFLLAVAPHDDLVALIGSQQQRDTRFVYD 323
P M +F+ A+L + N + D+ L +A D + IG+ Q+D R +YD
Sbjct: 380 QMPVPRMVLHFQGADLDLPRRNYVLDDHRRGRLCLLLADSGDDGSTIGNLVQQDMRVLYD 439
Query: 324 LNIDLLSFVKENC 336
L + LS C
Sbjct: 440 LEAETLSIAPARC 452
>gi|242079451|ref|XP_002444494.1| hypothetical protein SORBIDRAFT_07g022800 [Sorghum bicolor]
gi|241940844|gb|EES13989.1| hypothetical protein SORBIDRAFT_07g022800 [Sorghum bicolor]
Length = 445
Score = 121 bits (303), Expect = 5e-25, Method: Compositional matrix adjust.
Identities = 106/366 (28%), Positives = 161/366 (43%), Gaps = 49/366 (13%)
Query: 6 IGTPSKGVLLILDTGSALIYA--------------IFDPRKSSSFQKINCDHPDC----- 46
IGTP + LILDTGS LI+ ++DP KSSSF CD C
Sbjct: 95 IGTPPQPRTLILDTGSDLIWTQCKLFDTRQHREKPLYDPAKSSSFAAAPCDGRLCETGSF 154
Query: 47 TYFKCVNEQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSNDNHGFDED 106
C +C+YT Y + TKG A ET + GE + + FGC G
Sbjct: 155 NTKNCSRNKCIYTYNYGS-ATTKGELASETFTF---GEHRRVSVSLDFGCGKLTSGSLPG 210
Query: 107 ARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLKFGT--DMGYR 164
A +G+LG+S +S +SQL RFSYCL L T+S++ FG D+
Sbjct: 211 A-----SGILGISPDRLSLVSQLQ---IPRFSYCLTPFL--DRNTTSHIFFGAMADLSKY 260
Query: 165 RPS--TQATKFINHP---NNFYYLSLKDISIDNERMNFPPDTFDITVSGEGGCIIDSGSV 219
R + Q T + +P N +YY+ L IS+ +R+N P +F I G GG +DSG
Sbjct: 261 RTTGPIQTTSLVTNPDGSNYYYYVPLIGISVGTKRLNVPVSSFAIGRDGSGGTFVDSGDT 320
Query: 220 LTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCYFLP-------ETFNRFPSMAF 272
S V L E V + + +D +LC+ LP ET + P + +
Sbjct: 321 TGMLPSVVMEALKEAMVEAV-KLPVVNATDHGYEYELCFQLPRNGGGAVETAVQVPPLVY 379
Query: 273 YFEDANLRIDGENVFIIDYENHFFLLAVAPHDDLVALIGSQQQRDTRFVYDLNIDLLSFV 332
+F+ + + ++++ L ++ A+IG+ QQ++ ++D+ SF
Sbjct: 380 HFDGGAAMLLRRDSYMVEVSAGRMCLVIS-SGARGAIIGNYQQQNMHVLFDVENHEFSFA 438
Query: 333 KENCSD 338
C+
Sbjct: 439 PTQCNQ 444
>gi|297834758|ref|XP_002885261.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
gi|297331101|gb|EFH61520.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
Length = 500
Score = 121 bits (303), Expect = 5e-25, Method: Compositional matrix adjust.
Identities = 96/356 (26%), Positives = 156/356 (43%), Gaps = 42/356 (11%)
Query: 3 RLFIGTPSKGVLLILDTGSALIY--------------AIFDPRKSSSFQKINCDHPDCTY 48
R+ +GTP+K + L+LDTGS + + +F+P SS+++ + C P C+
Sbjct: 165 RIGVGTPAKEMYLVLDTGSDVNWIQCEPCSDCYQQSDPVFNPTSSSTYKSLTCSAPQCSL 224
Query: 49 FK---CVNEQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSNDNHGFDE 105
+ C + +C+Y + Y D S T G A +T++ G+ + GC +DN G
Sbjct: 225 LETSACRSNKCLYQVSYGDGSFTVGELATDTVTFGNSGK----INDVALGCGHDNEGL-- 278
Query: 106 DARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLKFGTDMGYRR 165
G GL + +S + FSYCLV SS L F +
Sbjct: 279 ------FTGAAGLLGLGGGALSITNQMKATSFSYCLV---DRDSGKSSSLDFNSVQLGSG 329
Query: 166 PSTQATKFINHPNNFYYLSLKDISIDNERMNFPPDTFDITVSGEGGCIIDSGSVLTYFHS 225
+T + FYY+ L S+ +++ P FD+ SG GG I+D G+ +T +
Sbjct: 330 DATAPLLRNQKIDTFYYVGLSGFSVGGQKVMMPDAIFDVDASGSGGVILDCGTAVTRLQT 389
Query: 226 DVYWKLHEKFVSYFERFQLAQLSDCPEPIQL---CY-FLPETFNRFPSMAFYFEDA-NLR 280
Y L + F+ + S I L CY F + + P++AF+F +L
Sbjct: 390 QAYNSLRDAFLKLTTNLKKGTSS-----ISLFDTCYDFSSLSSVKVPTVAFHFTGGKSLD 444
Query: 281 IDGENVFIIDYENHFFLLAVAPHDDLVALIGSQQQRDTRFVYDLNIDLLSFVKENC 336
+ +N I +N F A AP +++IG+ QQ+ TR YDL ++ C
Sbjct: 445 LPAKNYLIPVDDNGTFCFAFAPTSSSLSIIGNVQQQGTRITYDLANKIIGLSGNKC 500
>gi|414590468|tpg|DAA41039.1| TPA: aspartic proteinase nepenthesin-1 [Zea mays]
Length = 469
Score = 120 bits (302), Expect = 6e-25, Method: Compositional matrix adjust.
Identities = 98/371 (26%), Positives = 159/371 (42%), Gaps = 50/371 (13%)
Query: 1 MVRLFIGTPSKGVLLILDTGSALIYA---------------IFDPRKSSSFQKINCDHPD 45
++ L IGTP + DTGS LI+ +++P S++F + C+
Sbjct: 113 LMTLAIGTPPLPYAAVADTGSDLIWTQCAPCGTQCFEQPAPLYNPASSTTFSVLPCNS-- 170
Query: 46 CTYFKCVNE----------QCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFG 95
+ C C+Y Y T G ET + +A G FG
Sbjct: 171 -SLSMCAGALAGAAPPPGCACMYNQTYG-TGWTAGVQGSETFTFGSSAADQARVPGVAFG 228
Query: 96 CSNDNHGFDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYL 155
CSN + D + AG++GL R ++S +SQLG+ RFSYCL P + TS+ L
Sbjct: 229 CSNASSS-DWNGS----AGLVGLGRGSLSLVSQLGA---GRFSYCLT-PFQDTNSTSTLL 279
Query: 156 KFGTDMGYRRPSTQATKFINHP-----NNFYYLSLKDISIDNERMNFPPDTFDITVSGEG 210
G ++T F+ P + +YYL+L IS+ + + P F + G G
Sbjct: 280 -LGPSAALNGTGVRSTPFVASPARAPMSTYYYLNLTGISLGAKALPISPGAFSLKPDGTG 338
Query: 211 GCIIDSGSVLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCYFLPETFNR---- 266
G IIDSG+ +T + Y ++ S SD + LC+ LP +
Sbjct: 339 GLIIDSGTTITSLANAAYQQVRAAVKSLVTTLPTVDGSDS-TGLDLCFALPAPTSAPPAV 397
Query: 267 FPSMAFYFEDANLRIDGENVFIIDYENHFFLLAVAPHDDLVALIGSQQQRDTRFVYDLNI 326
PSM +F+ A++ + ++ ++I + L D ++ G+ QQ++ +YD+
Sbjct: 398 LPSMTLHFDGADMVLPADS-YMISGSGVWCLAMRNQTDGAMSTFGNYQQQNMHILYDVRE 456
Query: 327 DLLSFVKENCS 337
+ LSF CS
Sbjct: 457 ETLSFAPAKCS 467
>gi|242077672|ref|XP_002448772.1| hypothetical protein SORBIDRAFT_06g032900 [Sorghum bicolor]
gi|241939955|gb|EES13100.1| hypothetical protein SORBIDRAFT_06g032900 [Sorghum bicolor]
Length = 471
Score = 120 bits (302), Expect = 6e-25, Method: Compositional matrix adjust.
Identities = 95/362 (26%), Positives = 155/362 (42%), Gaps = 44/362 (12%)
Query: 2 VRLFIGTPSKGVLLILDTGSALIYA--------------IFDPRKSSSFQKINCDHPDCT 47
VR+ IG+P L++D+GS +I+ +FDP S++F ++C C
Sbjct: 127 VRVGIGSPPTEQYLVVDSGSDVIWVQCKPCLECYAQADPLFDPASSATFSAVSCGSAICR 186
Query: 48 YFKCV----NEQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSNDNHGF 103
+ + C Y + Y D S TKG A ET+++ G G GC + N G
Sbjct: 187 TLRTSGCGDSGGCEYEVSYGDGSYTKGTLALETLTL-----GGTAVEGVAIGCGHRNRGL 241
Query: 104 DEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTS---SYLKFGTD 160
A AG+LGL +S + QLG FSYCL +G + L G
Sbjct: 242 FVGA-----AGLLGLGWGPMSLVGQLGGAAGGAFSYCLASRGGSGSGAADAAGSLVLGRS 296
Query: 161 MGYRRPSTQATKFINHPN--NFYYLSLKDISIDNERMNFPPDTFDITVSGEGGCIIDSGS 218
+ + +P +FYY+ + I + +ER+ F +T G GG ++D+G+
Sbjct: 297 EAVPEGAVW-VPLVRNPQAPSFYYVGVSGIGVGDERLPLQDGLFQLTEDGGGGVVMDTGT 355
Query: 219 VLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCP--EPIQLCYFLPE-TFNRFPSMAFYFE 275
+T + Y L + FV + L P + CY L T R P+++FYF+
Sbjct: 356 AVTRLPQEAYAALRDAFVG-----AVGALPRAPGVSLLDTCYDLSGYTSVRVPTVSFYFD 410
Query: 276 D-ANLRIDGENVFIIDYENHFFLLAVAPHDDLVALIGSQQQRDTRFVYDLNIDLLSFVKE 334
A L + N +++ + + LA AP ++++G+ QQ + D + F
Sbjct: 411 GAATLTLPARN-LLLEVDGGIYCLAFAPSSSGLSILGNIQQEGIQITVDSANGYIGFGPA 469
Query: 335 NC 336
C
Sbjct: 470 TC 471
>gi|356558300|ref|XP_003547445.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
Length = 447
Score = 120 bits (302), Expect = 7e-25, Method: Compositional matrix adjust.
Identities = 104/360 (28%), Positives = 165/360 (45%), Gaps = 43/360 (11%)
Query: 1 MVRLFIGTPSKGVLLILDTGSALIYA--------------IFDPRKSSSFQKINCDHPDC 46
M + IG P L+++DTGS +++ +FDP KSS+F + C P C
Sbjct: 102 MANISIGQPPIPQLVVMDTGSDILWVMCTPCTNCDNDLGLLFDPSKSSTFSPL-CKTP-C 159
Query: 47 TYFKCVNEQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSNDNHGFDED 106
+ C + +T+ YAD S G +T+ EG + LFGC + N G D D
Sbjct: 160 DFEGCRCDPIPFTVTYADNSTASGTFGRDTVVFETTDEGTSRISDVLFGCGH-NIGHDTD 218
Query: 107 ARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLV-IPLPNGEYTSSYLKFGTDM-GYR 164
G+LGL+ S +++LG ++FSYC+ + P Y L G D+ GY
Sbjct: 219 P---GHNGILGLNNGPDSLVTKLG----QKFSYCIGNLADPYYNYHQLILGEGADLEGYS 271
Query: 165 RPSTQATKFINHPNNFYYLSLKDISIDNERMNFPPDTFDITVSGEGGCIIDSGSVLTYFH 224
P N FYY++++ IS+ +R++ P+TF++ + GG IID+GS +T+
Sbjct: 272 TP-------FEVYNGFYYVTMEGISVGEKRLDIAPETFEMKENRAGGVIIDTGSTITFLV 324
Query: 225 SDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCYF--LPETFNRFPSMAFYFED-ANLRI 281
V+ KL K V + Q + P C++ + FP + F+F D A+L +
Sbjct: 325 DSVH-KLLSKEVRNLLGWSFRQATIEKSPWMQCFYGSISRDLVGFPVVTFHFSDGADLAL 383
Query: 282 DGENVFIIDYENHFFLLAVAPHDDL-----VALIGSQQQRDTRFVYDLNIDLLSFVKENC 336
D F ++ F + V P L +LIG Q+ YDL + F + +C
Sbjct: 384 D-SGSFFNQLNDNVFCMTVGPVSSLNIKSKPSLIGLLAQQSYNVGYDLVNQFVYFQRIDC 442
>gi|357143680|ref|XP_003573011.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Brachypodium
distachyon]
Length = 510
Score = 120 bits (302), Expect = 7e-25, Method: Compositional matrix adjust.
Identities = 104/369 (28%), Positives = 160/369 (43%), Gaps = 41/369 (11%)
Query: 1 MVRLFIGTPSKGVLLILDTGSALIY--------------AIFDPRKSSSFQKINCDHPDC 46
+V +++GTP + +I+DTGS L + +FDP S+S++ + C C
Sbjct: 151 LVEVYVGTPPRRFQMIMDTGSDLNWLQCAPCLDCFDQRGPVFDPMASTSYRNVTCGDTRC 210
Query: 47 ----------TYFKCVNEQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGC 96
T ++ C Y Y DQS T G A E +V G + GC
Sbjct: 211 GLVSPPAAPRTCRSSRSDPCPYYYWYGDQSNTTGDLALEAFTVNLTASSSRRVDGVVLGC 270
Query: 97 SNDNHGFDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLK 156
+ N G A R +SF SQL ++ FSYCLV +G S +
Sbjct: 271 GHRNRGLFHGAAGLLGL-----GRGPLSFASQLRAVYGHAFSYCLV---DHGSAVGSKIV 322
Query: 157 FGTD-MGYRRPSTQATKFINHP--NNFYYLSLKDISIDNERMNFPPDTFDITVS-GEGGC 212
FG D + P T F N FYY+ LK I + E ++ P +T+ ++ G GG
Sbjct: 323 FGDDNVLLSHPQLNYTAFAPSAAENTFYYVQLKGILVGGEMLDIPSNTWGVSKEDGSGGT 382
Query: 213 IIDSGSVLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCYFLPETFN-RFPSMA 271
IIDSG+ L+YF Y + + FV ++ ++D P + CY + P +
Sbjct: 383 IIDSGTTLSYFPEPAYKAIRQAFVDRMDK-AYPLIADFPV-LSPCYNVSGVERVEVPEFS 440
Query: 272 FYFED-ANLRIDGENVFI-IDYENHFFLLAVAPHDDLVALIGSQQQRDTRFVYDLNIDLL 329
F D A EN FI +D E L + +++IG+ QQ++ +YDL+ + L
Sbjct: 441 LLFADGAVWDFPAENYFIRLDTEGIMCLAVLGTPRSAMSIIGNYQQQNFHVLYDLHHNRL 500
Query: 330 SFVKENCSD 338
F C++
Sbjct: 501 GFAPRRCAE 509
>gi|224114179|ref|XP_002332420.1| predicted protein [Populus trichocarpa]
gi|222832373|gb|EEE70850.1| predicted protein [Populus trichocarpa]
Length = 449
Score = 120 bits (302), Expect = 8e-25, Method: Compositional matrix adjust.
Identities = 89/327 (27%), Positives = 151/327 (46%), Gaps = 27/327 (8%)
Query: 28 FDPRKSSSFQKINCD-HPDCTYFKCVNEQCVYTMKYADQSVTKGFAAHETISVIGKGEGK 86
+ +S S++ ++C+ H C +C C Y + Y S T G A+ET +
Sbjct: 134 YTSSQSKSYKPVSCNQHSFCEPNQCKEGLCAYNVTYGPGSYTSGNLANETFTFYSNHGKH 193
Query: 87 AIFHGALFGCSNDN----HGFDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLV 142
FGCS D+ + F D ++GVLG+ SF++QLGSI +FSYC+
Sbjct: 194 TALKSISFGCSTDSRNMIYAFLLDKN--PVSGVLGMGWGPRSFLAQLGSISHGKFSYCIT 251
Query: 143 IPLPNGEYTSSYLKFGTDMGYRRPSTQATKFIN-HPNNFYYLSLKDISIDNERMNFPPDT 201
++YL+FG + + + Q TK + P+ Y+++L IS++ ++N
Sbjct: 252 A----NNTHNTYLRFGKHV-VKSKNLQTTKIMQVKPSAAYHVNLLGISVNGVKLNITKTD 306
Query: 202 FDITVSGEGGCIIDSGSVLTYFHSDVYWKLHEKFVSY------FERFQLAQLSDCPEPIQ 255
+ G GCIID+G++ T ++ LH ++ +R+ + +L
Sbjct: 307 LAVRKDGSRGCIIDAGTLATLLVKPIFDTLHTALSNHLSSNQNLKRWVIHKLHK-----D 361
Query: 256 LCYFLPETFNR--FPSMAFYFEDANLRIDGENVFII-DYENHFFLLAVAPHDDLVALIGS 312
LCY R P + F+ E+A+L + E +F+ ++E DD +IG+
Sbjct: 362 LCYEQLSDAGRKNLPVVTFHLENADLEVKPEAIFLFREFEGKNVFCLSMLSDDSKTIIGA 421
Query: 313 QQQRDTRFVYDLNIDLLSFVKENCSDD 339
QQ +FVYD +LSF E+C +
Sbjct: 422 YQQMKQKFVYDTKARVLSFGPEDCEKN 448
>gi|414877221|tpg|DAA54352.1| TPA: hypothetical protein ZEAMMB73_214154 [Zea mays]
Length = 447
Score = 120 bits (301), Expect = 8e-25, Method: Compositional matrix adjust.
Identities = 103/365 (28%), Positives = 164/365 (44%), Gaps = 43/365 (11%)
Query: 1 MVRLFIGTPSKGVLLILDTGSALIYA--------------IFDPRKSSSFQKINCDHPDC 46
++ L IGTP + + DTGS L + I+D SSSF + C C
Sbjct: 94 LMELAIGTPPVPFVALADTGSDLTWTQCQPCKLCFPQDTPIYDTAVSSSFSPVPCASATC 153
Query: 47 ----TYFKCV--NEQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSNDN 100
+ C + C Y Y D + + G ET++ G G ++ G FGC DN
Sbjct: 154 LPIWSSRNCTASSSPCRYRYAYGDGAYSAGVLGTETLTFPGA-PGVSV-GGIAFGCGVDN 211
Query: 101 HGFDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLKFGTD 160
G ++ G +GL R ++S ++QLG +FSYCL N S L FG
Sbjct: 212 GGLSYNS-----TGTVGLGRGSLSLVAQLG---VGKFSYCLT-DFFNTSLGSPVL-FGAL 261
Query: 161 MGYRRPST----QATKFINHP--NNFYYLSLKDISIDNERMNFPPDTFDITVSGEGGCII 214
PST Q+T + P +YY+SL+ IS+ + R+ P TFD+ G GG I+
Sbjct: 262 AELAAPSTGAAVQSTPLVQSPYVPTWYYVSLEGISLGDARLPIPNGTFDLRDDGSGGMIV 321
Query: 215 DSGSVLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCYFLPETFNRFPSMAFYF 274
DSG+ T+ + ++ V+ R + S P + P M +F
Sbjct: 322 DSGTTFTFLVESAF-RVVVDHVAGVLRQPVVNASSLDSPCFPAATGEQQLPAMPDMVLHF 380
Query: 275 E-DANLRIDGENVFIIDYENHFFLLAVA--PHDDLVALIGSQQQRDTRFVYDLNIDLLSF 331
A++R+ +N + E F L +A P D V+++G+ QQ++ + ++D+ + LSF
Sbjct: 381 AGGADMRLHRDNYMSFNQEESSFCLNIAGSPSAD-VSILGNFQQQNIQMLFDITVGQLSF 439
Query: 332 VKENC 336
+ +C
Sbjct: 440 MPTDC 444
>gi|125555056|gb|EAZ00662.1| hypothetical protein OsI_22683 [Oryza sativa Indica Group]
Length = 491
Score = 120 bits (301), Expect = 9e-25, Method: Compositional matrix adjust.
Identities = 109/364 (29%), Positives = 163/364 (44%), Gaps = 50/364 (13%)
Query: 1 MVRLFIGTPSKGVLLILDTGSALIYA-----------------IFDPRKSSSFQKINCDH 43
+V + +GTP++ LI DTGS L + +FDP KSS++ ++C
Sbjct: 150 VVAVGLGTPAQPSALIFDTGSDLSWVQCQPCGSSGHCHPQQDPLFDPSKSSTYAAVHCGE 209
Query: 44 PDCTYFKCV----NEQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSND 99
P C + N C+Y + Y D S T G + +T+++ G FGC
Sbjct: 210 PQCAAAGGLCSEDNTTCLYLVHYGDGSSTTGVLSRDTLALTSS----RALAGFPFGCGTR 265
Query: 100 NHGFDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLKFGT 159
N G G + G+LGL R +S SQ + FSYCL P+ T+ YL G
Sbjct: 266 NLG-----DFGRVDGLLGLGRGELSLPSQAAASFGAVFSYCL----PSSNSTTGYLTIGA 316
Query: 160 DMGYRRPSTQATKFINHPN--NFYYLSLKDISIDNERMNFPPDTFDITVSGEGGCIIDSG 217
+ Q T + P +FY++ L I I + PP F GG ++DSG
Sbjct: 317 TPATDTGAAQYTAMLRKPQFPSFYFVELVSIDIGGYILPVPPAVFT-----RGGTLLDSG 371
Query: 218 SVLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCY-FLPETFNRFPSMAFYFED 276
+VLTY + Y L ++F ER+ A +D + CY F E+ P+++F F D
Sbjct: 372 TVLTYLPAQAYELLRDRFRLTMERYTPAPPNDV---LDACYDFAGESEVIVPAVSFRFGD 428
Query: 277 -ANLRIDGENVFIIDYENHFFLLAVAPHDD---LVALIGSQQQRDTRFVYDLNIDLLSFV 332
A +D V I EN LA A D +++IG+ QQR +YD+ + + FV
Sbjct: 429 GAVFELDFFGVMIFLDEN-VGCLAFAAMDAGGLPLSIIGNTQQRSAEVIYDVAAEKIGFV 487
Query: 333 KENC 336
+C
Sbjct: 488 PASC 491
>gi|357143854|ref|XP_003573079.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Brachypodium
distachyon]
Length = 417
Score = 120 bits (301), Expect = 1e-24, Method: Compositional matrix adjust.
Identities = 91/362 (25%), Positives = 168/362 (46%), Gaps = 38/362 (10%)
Query: 1 MVRLFIGTPSKGVLLILDTGSALIYA--------------IFDPRKSSSFQKINCDHPDC 46
++ L IGTP + + DTGS L + ++DP SS+F + C C
Sbjct: 67 LMELAIGTPPVPFVALADTGSDLTWTQCQPCKLCFPQDTPVYDPSASSTFSPVPCSSATC 126
Query: 47 --TYFK--CVNEQ--CVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGAL-FGCSND 99
T+ C N C Y Y+D + + G ET+++ G+ + G++ FGC D
Sbjct: 127 LPTWRSRNCSNPSSPCRYIYSYSDGAYSVGILGTETLTIGSSVPGQTVSVGSVAFGCGTD 186
Query: 100 NHGFDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLKFGT 159
N G ++ G +GL R T+S ++QLG +FSYCL + + +L
Sbjct: 187 NGGDSLNS-----TGTVGLGRGTLSLLAQLG---VGKFSYCLTDFFNSTMDSPFFLGTLA 238
Query: 160 DMGYRRPSTQATKFINHPNN--FYYLSLKDISIDNERMNFPPDTFDITVSGEGGCIIDSG 217
++ + Q+T + P N Y+++L+ IS+ + R+ P TFD+ G GG ++DSG
Sbjct: 239 ELAPGPGTVQSTPLLQSPLNPSRYFVNLQGISLGDVRLPIPNGTFDLRADGNGGMMVDSG 298
Query: 218 SVLTYFHSDVYWKLHEKFVSYFERFQLAQLS-DCPEPIQLCYFLPETFNRFPSMAFYFE- 275
+ T + ++ ++ + + S D P C+ P+ P + +F
Sbjct: 299 TTFTILAKSGFREVVDRVAQLLGQPPVNASSLDSP-----CFPSPDGEPFMPDLVLHFAG 353
Query: 276 DANLRIDGENVFIIDYENHFFLLAVAPHDDLVALIGSQQQRDTRFVYDLNIDLLSFVKEN 335
A++R+ +N + ++ F L + + +G+ QQ++ + ++D+ + LSF+ +
Sbjct: 354 GADMRLHRDNYMSYNEDDSSFCLNIVGSPSTWSRLGNFQQQNIQMLFDMTVGQLSFLPTD 413
Query: 336 CS 337
CS
Sbjct: 414 CS 415
>gi|297805038|ref|XP_002870403.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
gi|297316239|gb|EFH46662.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
Length = 440
Score = 120 bits (300), Expect = 1e-24, Method: Compositional matrix adjust.
Identities = 100/361 (27%), Positives = 169/361 (46%), Gaps = 41/361 (11%)
Query: 1 MVRLFIGTPSKGVLLILDTGSALIYA--------------IFDPRKSSSFQKINCDHPDC 46
++ + +GTP ++ I DTGS L++ +FDP+ SS+++ ++C C
Sbjct: 95 LMNISLGTPPFPIMAIADTGSDLLWTQCKPCDDCYTQVDPLFDPKASSTYKDVSCSSSQC 154
Query: 47 TYFK----CVNE--QCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSNDN 100
T + C E C Y+ Y D+S TKG A +T+++ + GC ++N
Sbjct: 155 TALENQASCSTEDNTCSYSTSYGDRSYTKGNIAVDTLTLGSTDTRPVQLKNIIIGCGHNN 214
Query: 101 HGFDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLKFGTD 160
G + +G++GL +S I+QLG I +FSYCLV PL + +S + FGT+
Sbjct: 215 AG----TFNKKGSGIVGLGGGAVSLITQLGDSIDGKFSYCLV-PLTSENDRTSKINFGTN 269
Query: 161 MGYRRPSTQATKFI-NHPNNFYYLSLKDISIDNERMNFPPDTFDITVSGEGGCIIDSGSV 219
+T I FYYL+LK IS+ ++ + +P + SGEG IIDSG+
Sbjct: 270 AVVSGTGVVSTPLIAKSQETFYYLTLKSISVGSKEVQYPGSD---SGSGEGNIIIDSGTT 326
Query: 220 LTYFHSDVYWKLHEKFVSYF--ERFQLAQLSDCPEPIQLCYFLPETFNRFPSMAFYFEDA 277
LT ++ Y +L + S E+ Q Q + LCY + P++ +F+ A
Sbjct: 327 LTLLPTEFYSELEDAVASSIDAEKKQDPQTG-----LSLCYSATGDL-KVPAITMHFDGA 380
Query: 278 NLRIDGENVFIIDYENHF-FLLAVAPHDDLVALIGSQQQRDTRFVYDLNIDLLSFVKENC 336
++ + N F+ E+ F +P ++ G+ Q + YD +SF +C
Sbjct: 381 DVNLKPSNCFVQISEDLVCFAFRGSPS---FSIYGNVAQMNFLVGYDTVSKTVSFKPTDC 437
Query: 337 S 337
+
Sbjct: 438 A 438
>gi|226504334|ref|NP_001141706.1| uncharacterized protein LOC100273835 precursor [Zea mays]
gi|194705620|gb|ACF86894.1| unknown [Zea mays]
gi|414885968|tpg|DAA61982.1| TPA: hypothetical protein ZEAMMB73_231717 [Zea mays]
Length = 477
Score = 119 bits (299), Expect = 1e-24, Method: Compositional matrix adjust.
Identities = 106/362 (29%), Positives = 156/362 (43%), Gaps = 50/362 (13%)
Query: 4 LFIGTPSKGVLLILDTGSALIY--------------AIFDPRKSSSFQKINCDHPDCTYF 49
L +GTP+ +L+ LDTGS + A+FDP KSS++ I C +C
Sbjct: 138 LRLGTPATDLLVELDTGSDQSWIQCKPCPDCYEQHEALFDPSKSSTYSDITCSSRECQEL 197
Query: 50 ------KC-VNEQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSNDNHG 102
C +++C Y + YAD S T G A +T+++ G +FGC ++N G
Sbjct: 198 GSSHKHNCSSDKKCPYEITYADDSYTVGNLARDTLTL----SPTDAVPGFVFGCGHNNAG 253
Query: 103 FDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLKFGTDMG 162
G + G+LGL R S SQ+ + FSYC LP+ + YL F
Sbjct: 254 -----SFGEIDGLLGLGRGKASLSSQVAARYGAGFSYC----LPSSPSATGYLSFSGAAA 304
Query: 163 YRRPSTQATKFI--NHPNNFYYLSLKDISIDNERMNFPPDTFDITVSGEGGCIIDSGSVL 220
+ Q T+ + HP +FYYL+L I++ + PP F G IIDSG+
Sbjct: 305 AAPTNAQFTEMVAGQHP-SFYYLNLTGITVAGRAIKVPPSVFATAA----GTIIDSGTAF 359
Query: 221 TYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCYFLP--ETFNRFPSMAFYFED-A 277
+ Y L S R++ A S CY L ET R PS+A F D A
Sbjct: 360 SCLPPSAYAALRSSVRSAMGRYKRAPSSTI---FDTCYDLTGHETV-RIPSVALVFADGA 415
Query: 278 NLRIDGENVFIIDYENHFFLLAVAPHDDLVAL--IGSQQQRDTRFVYDLNIDLLSFVKEN 335
+ + V LA P+ D +L +G+ QQR +YD++ + F
Sbjct: 416 TVHLHPSGVLYTWSNVSQTCLAFLPNPDDTSLGVLGNTQQRTLAVIYDVDNQKVGFGANG 475
Query: 336 CS 337
C+
Sbjct: 476 CA 477
>gi|225427554|ref|XP_002266533.1| PREDICTED: probable aspartic protease At2g35615-like [Vitis
vinifera]
Length = 447
Score = 119 bits (299), Expect = 1e-24, Method: Compositional matrix adjust.
Identities = 107/361 (29%), Positives = 166/361 (45%), Gaps = 36/361 (9%)
Query: 1 MVRLFIGTPSKGVLLILDTGSALIY--------------AIFDPRKSSSFQKINCDHPDC 46
++ + +GTP + I DTGS L++ IFDP KS ++Q ++C+ C
Sbjct: 96 LMNISLGTPPVSMHGIADTGSDLLWRQCKPCDSCYEQIEPIFDPAKSKTYQILSCEGKSC 155
Query: 47 TYFK----CVNEQ-CVYTMKYADQSVTKGFAAHETISVIGKGEGKAI-FHGALFGCSNDN 100
+ C ++ C+Y+ Y D S T G A +T++ IG G+ + +FGC ++N
Sbjct: 156 SNLGGQGGCSDDNTCIYSYSYGDGSHTSGDLAVDTLT-IGSTTGRPVSVPKVVFGCGHNN 214
Query: 101 HGFDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLKFGTD 160
G E G + L +S ISQL +I RFSYCLV PL N SS + FG+
Sbjct: 215 GGTFELHGSGLVG----LGGGPLSMISQLRPLIGGRFSYCLV-PLGNDPSVSSKMHFGSR 269
Query: 161 MGYRRPSTQATKFIN-HPNNFYYLSLKDISIDNERM---NFPPDTFDITVSGEGGCIIDS 216
+T + P+ FYYL+L+ +S+ ++++ F + + EG IIDS
Sbjct: 270 GIVSGAGAVSTPLASRQPDTFYYLTLESMSVGSKKLAYKGFSKVGSPLADADEGNIIIDS 329
Query: 217 GSVLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCYFLPETFNRFPSMAFYFED 276
G+ LT D Y L VS + D LCY R P++ +F
Sbjct: 330 GTTLTLLPQDFYGTLESNVVSAIGG---KPVRDPNNVFSLCYSNLSGL-RIPTITAHFVG 385
Query: 277 ANLRIDGENVFIIDYENHFFLLAVAPHDDLVALIGSQQQRDTRFVYDLNIDLLSFVKENC 336
A+L + N F + + F A+ P DL A+ G+ Q + YDL +SF +C
Sbjct: 386 ADLELKPLNTF-VQVQEDLFCFAMIPVSDL-AIFGNLAQMNFLVGYDLKSRTVSFKPTDC 443
Query: 337 S 337
+
Sbjct: 444 T 444
>gi|356527089|ref|XP_003532146.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
Length = 488
Score = 119 bits (299), Expect = 2e-24, Method: Compositional matrix adjust.
Identities = 104/354 (29%), Positives = 156/354 (44%), Gaps = 56/354 (15%)
Query: 2 VRLFIGTPSKGVLLILDTGSALIY---------------AIFDPRKSSSFQKINCDHPDC 46
V + +GTP + + LI DTGS L + AIFDP KS+S+ I C C
Sbjct: 147 VVVGLGTPKRDLSLIFDTGSDLTWTQCEPCARSCYKQQDAIFDPSKSTSYSNITCTSTLC 206
Query: 47 TYFKCVN----------EQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGC 96
T + C+Y ++Y D S + G+ + E +SV I LFGC
Sbjct: 207 TQLSTATGNEPGCSASTKACIYGIQYGDSSFSVGYFSRERLSVTATD----IVDNFLFGC 262
Query: 97 SNDNHGFDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLK 156
+N G G AG++GL R ISF+ Q ++ +K FSYC LP ++ L
Sbjct: 263 GQNNQGLF-----GGSAGLIGLGRHPISFVQQTAAVYRKIFSYC----LPATSSSTGRLS 313
Query: 157 FG-TDMGYRRPSTQATKFINHPNNFYYLSLKDISIDNERMNFPPDTFDITVSGEGGCIID 215
FG T Y + + +T I+ ++FY L + IS+ ++ TF GG IID
Sbjct: 314 FGTTTTSYVKYTPFST--ISRGSSFYGLDITGISVGGAKLPVSSSTFST-----GGAIID 366
Query: 216 SGSVLTYFHSDVYWKLHEKFVSYFERFQLA-QLSDCPEPIQLCYFLP--ETFNRFPSMAF 272
SG+V+T Y L F ++ A +LS + CY L E F+ P + F
Sbjct: 367 SGTVITRLPPTAYTALRSAFRQGMSKYPSAGELS----ILDTCYDLSGYEVFS-IPKIDF 421
Query: 273 YFEDA-NLRIDGENVFIIDYENHFFLLAVAPHDDL-VALIGSQQQRDTRFVYDL 324
F +++ + + + L A DD V + G+ QQ+ VYD+
Sbjct: 422 SFAGGVTVQLPPQGILYVASAKQVCLAFAANGDDSDVTIYGNVQQKTIEVVYDV 475
>gi|302781668|ref|XP_002972608.1| hypothetical protein SELMODRAFT_97538 [Selaginella moellendorffii]
gi|300160075|gb|EFJ26694.1| hypothetical protein SELMODRAFT_97538 [Selaginella moellendorffii]
Length = 430
Score = 119 bits (298), Expect = 2e-24, Method: Compositional matrix adjust.
Identities = 104/381 (27%), Positives = 156/381 (40%), Gaps = 53/381 (13%)
Query: 1 MVRLFIGTPSKGVLLILDTGSALIY---------AIFDPRK------------SSSFQKI 39
+V + GTP + VLLI DTGS LI+ F P+K S++ +
Sbjct: 55 LVSMAFGTPPQEVLLIADTGSDLIWLQCSTTAAPPAFCPKKACSRRPAFVASKSATLSVV 114
Query: 40 NCDHPDCTYFKCVNEQ-----------CVYTMKYADQSVTKGFAAHETISVIGKGEGKAI 88
C C C Y YAD S T GF A +T ++ G A
Sbjct: 115 PCSAAQCLLVPAPRGHGPSCSPAAPVPCGYAYDYADGSSTTGFLARDTATISNGTSGGAA 174
Query: 89 FHGALFGCSNDNHGFDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNG 148
G FGC N G GV+GL + +SF +Q GS+ + FSYCL + L G
Sbjct: 175 VRGVAFGCGTRNQGGSFSG----TGGVIGLGQGQLSFPAQSGSLFAQTFSYCL-LDLEGG 229
Query: 149 EY--TSSYLKFGTDMGYRRPSTQATKFINHP--NNFYYLSLKDISIDNERMNFPPDTFDI 204
+SS+L G RR + T +++P FYY+ + I + N + P + I
Sbjct: 230 RRGRSSSFLFLGRPE--RRAAFAYTPLVSNPLAPTFYYVGVVAIRVGNRVLPVPGSEWAI 287
Query: 205 TVSGEGGCIIDSGSVLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCY------ 258
V G GG +IDSGS LTY Y L F + ++ + + ++LCY
Sbjct: 288 DVLGNGGTVIDSGSTLTYLRLGAYLHLVSAFAASVHLPRIPSSATFFQGLELCYNVSSSS 347
Query: 259 FLPETFNRFPSMAFYFEDA-NLRIDGENVFIIDYENHFFLLAVAPHDDLVA--LIGSQQQ 315
L FP + F +L + N +++D + LA+ P A ++G+ Q
Sbjct: 348 SLAPANGGFPRLTIDFAQGLSLELPTGN-YLVDVADDVKCLAIRPTLSPFAFNVLGNLMQ 406
Query: 316 RDTRFVYDLNIDLLSFVKENC 336
+ +D + F + C
Sbjct: 407 QGYHVEFDRASARIGFARTEC 427
>gi|413953782|gb|AFW86431.1| hypothetical protein ZEAMMB73_038825 [Zea mays]
Length = 462
Score = 119 bits (298), Expect = 2e-24, Method: Compositional matrix adjust.
Identities = 108/368 (29%), Positives = 166/368 (45%), Gaps = 58/368 (15%)
Query: 1 MVRLFIGTPSKGVLLILDTGSALIY---------------AIFDPRKSSSFQKINCDHPD 45
+V + GTP++ L+ DTGS + + IFDP KS+++ + C HP
Sbjct: 121 VVTVGFGTPAQTYTLMFDTGSDVSWIQCLPCSGHCYKQHDPIFDPTKSATYSAVPCGHPQ 180
Query: 46 CTYF--KC-VNEQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSNDNHG 102
C KC N C+Y ++Y D S T G +HET+S+ +A+ G FGC N G
Sbjct: 181 CAAAGGKCSSNGTCLYKVQYGDGSSTAGVLSHETLSLT---SARAL-PGFAFGCGETNLG 236
Query: 103 FDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLKFGTDM- 161
G + G++GL R +S SQ + FSYC LP+ + YL GT
Sbjct: 237 -----DFGDVDGLIGLGRGQLSLSSQAAASFGAAFSYC----LPSYNTSHGYLTIGTTTP 287
Query: 162 -----GYRRPSTQATKFINHPNNFYYLSLKDISIDNERMNFPPDTFDITVSGEGGCIIDS 216
G R T + ++P +FY++ L I + + PP F G ++DS
Sbjct: 288 ASGSDGVRY--TAMIQKQDYP-SFYFVDLVSIVVGGFVLPVPPILFT-----RDGTLLDS 339
Query: 217 GSVLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCP--EPIQLCY-FLPETFNRFPSMAFY 273
G+VLTY + Y L ++F +F + Q P +P CY F + P ++F
Sbjct: 340 GTVLTYLPPEAYTALRDRF-----KFTMTQYKPAPAYDPFDTCYDFAGQNAIFMPLVSFK 394
Query: 274 FEDA---NLRIDGENVFIIDYENHFFLLAVAPHDDLV--ALIGSQQQRDTRFVYDLNIDL 328
F D +L G +F D LA P + ++G+ QQR+T +YD+ +
Sbjct: 395 FSDGSSFDLSPFGVLIFPDDTAPATGCLAFVPRPSTMPFTIVGNTQQRNTEMIYDVAAEK 454
Query: 329 LSFVKENC 336
+ FV +C
Sbjct: 455 IGFVSGSC 462
>gi|168064509|ref|XP_001784204.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162664276|gb|EDQ51002.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 367
Score = 119 bits (298), Expect = 2e-24, Method: Compositional matrix adjust.
Identities = 100/360 (27%), Positives = 160/360 (44%), Gaps = 43/360 (11%)
Query: 6 IGTPSKGVLLILDTGSALIY--------------AIFDPRKSSSFQKINCDHPDCT---Y 48
+GTP + + L++DTGS + + A+F+P SSSF+ ++C C
Sbjct: 22 VGTPRRDMYLVVDTGSDITWLQCAPCTNCYKQKDALFNPSSSSSFKVLDCSSSLCLNLDV 81
Query: 49 FKCVNEQCVYTMKYADQSVTKGFAAHETISVIGK-GEGKAIFHGALFGCSNDNHGFDEDA 107
C++ +C+Y Y D S T G + + + G G+ + GC +DN G
Sbjct: 82 MGCLSNKCLYQADYGDGSFTMGELVTDNVVLDDAFGPGQVVLTNIPLGCGHDNEG----- 136
Query: 108 RDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYT---SSYLKFGTDMGYR 164
G AG+LGL R +SF + L + + FSYC LP+ E S L FG D
Sbjct: 137 TFGTAAGILGLGRGPLSFPNNLDASTRNIFSYC----LPDRESDPNHKSTLVFG-DAAIP 191
Query: 165 RPSTQATKFINHPNN-----FYYLSLKDISI-DNERMNFPPDTFDITVSGEGGCIIDSGS 218
+T + KFI N +YY+ + IS+ N N P F + G GG I DSG+
Sbjct: 192 HTATGSVKFIPQLRNPRVATYYYVQITGISVGGNLLTNIPASVFQLDSHGNGGTIFDSGT 251
Query: 219 VLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCY-FLPETFNRFPSMAFYFE-D 276
+T + Y + + F + L +D + CY F P++ F+F+ D
Sbjct: 252 TITRLEARAYTAVRDAFRA--ATMHLTSAADF-KIFDTCYDFTGMNSISVPTVTFHFQGD 308
Query: 277 ANLRIDGENVFIIDYENHFFLLAVAPHDDLVALIGSQQQRDTRFVYDLNIDLLSFVKENC 336
++R+ N + N+ F A A ++IG+ QQ+ R +YD + + + C
Sbjct: 309 VDMRLPPSNYIVPVSNNNIFCFAFAASMG-PSVIGNVQQQSFRVIYDNVHKQIGLLPDQC 367
>gi|449453872|ref|XP_004144680.1| PREDICTED: LOW QUALITY PROTEIN: protein ASPARTIC PROTEASE IN GUARD
CELL 1-like [Cucumis sativus]
Length = 757
Score = 119 bits (298), Expect = 2e-24, Method: Compositional matrix adjust.
Identities = 115/378 (30%), Positives = 163/378 (43%), Gaps = 49/378 (12%)
Query: 2 VRLFIGTPSKGVLLILDTGSALIY--------------AIFDPRKSSSFQKINCDHPDCT 47
+ +FIG+P K LILDTGS L + +DP+ S SF+ I C+ P C
Sbjct: 198 IDVFIGSPPKHFSLILDTGSDLNWIQCVPCFDCFEQNGPYYDPKDSISFRNITCNDPRCQ 257
Query: 48 YF---------KCVNEQCVYTMKYADQSVTKG------FAAHETISVIGKGEGKAIFHGA 92
K + C Y Y D S T G F + T S GK E + +
Sbjct: 258 LVSSPDPPRPCKFETQSCPYFYWYGDSSNTTGDFALETFTVNLTSSTTGKSEFRRV-ENV 316
Query: 93 LFGCSNDNHGFDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTS 152
+FGC + N G A R +SF SQL S+ FSYCLV + S
Sbjct: 317 MFGCGHWNRGLFHGAAGLLGL-----GRGPLSFSSQLQSLYGHSFSYCLV-DRDSDTSVS 370
Query: 153 SYLKFGTDMGY-RRPSTQATKFI----NHPNNFYYLSLKDISIDNERMNFPPDTFDITVS 207
S L FG D P T I N + FYYL +K I + E++ P + ++++
Sbjct: 371 SKLIFGEDKDLLTHPELNFTSLIAGKENPVDTFYYLQIKSIFVGGEKLQIPEENWNLSAD 430
Query: 208 GEGGCIIDSGSVLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCYFLPETFN-R 266
G GG IIDSG+ L+YF Y + E F+ + ++L + D P + CY + T
Sbjct: 431 GAGGTIIDSGTTLSYFSDPAYRIIKEAFLRKVKGYKLVE--DFP-ILHPCYNVSGTDELN 487
Query: 267 FPSMAFYFED-ANLRIDGENVFIIDYENHFFLLAV--APHDDLVALIGSQQQRDTRFVYD 323
FP F D A EN FI + LA+ P L ++IG+ QQ++ +YD
Sbjct: 488 FPEFLIQFADGAVWNFPVENYFIRIQQLDIVCLAMLGTPKSAL-SIIGNYQQQNFHILYD 546
Query: 324 LNIDLLSFVKENCSDDSA 341
L + C++ A
Sbjct: 547 TKNSRLGYAPMRCAEIEA 564
>gi|356524289|ref|XP_003530762.1| PREDICTED: LOW QUALITY PROTEIN: aspartic proteinase
nepenthesin-2-like [Glycine max]
Length = 392
Score = 119 bits (298), Expect = 2e-24, Method: Compositional matrix adjust.
Identities = 105/370 (28%), Positives = 155/370 (41%), Gaps = 59/370 (15%)
Query: 1 MVRLFIGTPSKGVLLILDTGSALIY---------------AIFDPRKSSSFQKINCDHPD 45
+V + +GTP + + L+ DTGS L + AIFDP KSSS+ I C
Sbjct: 47 VVVVGLGTPKRDLSLVFDTGSDLTWTQCEPCAGSCYKQQDAIFDPSKSSSYTNITCTSSL 106
Query: 46 CTYF-------KC---VNEQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFG 95
CT +C + C+Y KY D S + GF + E +++ I LFG
Sbjct: 107 CTQLTSDGIKSECSSSTDASCIYDAKYGDNSTSVGFLSQERLTITAT----DIVDDFLFG 162
Query: 96 CSNDNHGFDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYL 155
C DN G + AG++GL R IS + Q S K FSYC LP + +L
Sbjct: 163 CGQDNEGLFNGS-----AGLMGLGRHPISIVQQTSSNYNKIFSYC----LPATSSSLGHL 213
Query: 156 KFGTDMGYRRPSTQAT------KFINHPNNFYYLSLKDISIDNERMNFPPDTFDITVSGE 209
FG +T A+ I+ N+FY L + IS+ ++ P T S
Sbjct: 214 TFGAS-----AATNASLIYTPLSTISGDNSFYGLDIVSISVGGTKL---PAVSSSTFSA- 264
Query: 210 GGCIIDSGSVLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCYFLPETFN-RFP 268
GG IIDSG+V+T VY L F E++ +A + + CY L P
Sbjct: 265 GGSIIDSGTVITRLAPTVYAALRSAFRRXMEKYPVANEAGL---LDTCYDLSGYKEISVP 321
Query: 269 SMAFYFEDA-NLRIDGENVFIIDYENHFFL-LAVAPHDDLVALIGSQQQRDTRFVYDLNI 326
+ F F + + + ++ E L A D+ + + G+ QQ+ VYD+
Sbjct: 322 RIDFEFSGGVTVELXHRGILXVESEQQVCLAFAANGSDNDITVFGNVQQKTLEVVYDVKG 381
Query: 327 DLLSFVKENC 336
+ F C
Sbjct: 382 GRIGFGAAGC 391
>gi|302756591|ref|XP_002961719.1| hypothetical protein SELMODRAFT_64161 [Selaginella moellendorffii]
gi|300170378|gb|EFJ36979.1| hypothetical protein SELMODRAFT_64161 [Selaginella moellendorffii]
Length = 357
Score = 119 bits (298), Expect = 2e-24, Method: Compositional matrix adjust.
Identities = 101/363 (27%), Positives = 155/363 (42%), Gaps = 47/363 (12%)
Query: 2 VRLFIGTPSKGVLLILDTGSALIY--------------AIFDPRKSSSFQKINCDHPDCT 47
R+ IG P + L LDTGS + + I+DP SSS++++ C C
Sbjct: 14 ARMGIGNPQRSYYLELDTGSDVTWIQCAPCSSCYSQVDPIYDPSNSSSYRRVYCGSALCQ 73
Query: 48 ---YFKCVNEQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSNDNHGFD 104
Y C C Y + Y D S + G E+ +G A+ + A FGC + N G
Sbjct: 74 ALDYSACQGMGCSYRVVYGDSSASSGDLGIESF-YLGPNSSTAMRNIA-FGCGHSNSGLF 131
Query: 105 EDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLKFG-TDMGY 163
T+SF SQ+ + I FSYCLV + SS L FG T + +
Sbjct: 132 RGEAGLLGM-----GGGTLSFFSQIAASIGPAFSYCLVDRYSQLQSRSSPLIFGRTAIPF 186
Query: 164 RRPSTQATKFINHP--NNFYYLSLKDISIDNERMNFPPDTFDITVSGEGGCIIDSGSVLT 221
+ + T + +P N FYY L IS+ + PP F +T +G GG I+DSG+ +T
Sbjct: 187 ---AARFTPLLKNPRINTFYYAVLTGISVGGTPLPIPPAQFALTGNGTGGAILDSGTSVT 243
Query: 222 YFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCYFLPETFN-------RFPSMAFYF 274
Y L + + + P Y L FN + PS+ +F
Sbjct: 244 RVVPPAYAVLRDAYRAASRNL---------PPAPGVYLLDTCFNFQGLPTVQIPSLVLHF 294
Query: 275 EDA-NLRIDGENVFIIDYENHFFLLAVAPHDDLVALIGSQQQRDTRFVYDLNIDLLSFVK 333
++ ++ + G N+ I + F LA AP +++IG+ QQ+ R +DL L++
Sbjct: 295 DNGVDMVLPGGNILIPVDRSGTFCLAFAPSSMPISVIGNVQQQTFRIGFDLQRSLIAIAP 354
Query: 334 ENC 336
C
Sbjct: 355 REC 357
>gi|449451908|ref|XP_004143702.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Cucumis sativus]
gi|449529900|ref|XP_004171936.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Cucumis sativus]
Length = 459
Score = 119 bits (298), Expect = 2e-24, Method: Compositional matrix adjust.
Identities = 102/377 (27%), Positives = 157/377 (41%), Gaps = 52/377 (13%)
Query: 2 VRLFIGTPSKGVLLILDTGSALIY---------------AIFDPRKSSSFQKINCDHPDC 46
V + +GTP + +LL+ DTGS L++ + F PR SSSF +C P C
Sbjct: 90 VDIRLGTPPQSLLLVADTGSDLVWVKCSACRNCSHHPPSSAFLPRHSSSFSPFHCFDPHC 149
Query: 47 TYFK------C----VNEQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGC 96
C ++ C + YAD S++ GF + ET ++ + G FGC
Sbjct: 150 RLLPHAPHHLCNHTRLHSPCRFLYSYADGSLSSGFFSKETTTLKSLSGSEIHLKGLSFGC 209
Query: 97 SNDNHGFD-EDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYT---- 151
G A+ GV+GL R +ISF SQLG +FSYCL+ +YT
Sbjct: 210 GFRISGPSVSGAQFNGARGVMGLGRGSISFSSQLGRRFGNKFSYCLM------DYTLSPP 263
Query: 152 -SSYLKFGTDMGYRRPSTQATKFINHP-------NNFYYLSLKDISIDNERMNFPPDTFD 203
+S+L G + + P T ATK P FYY+++ I+ID ++ P ++
Sbjct: 264 PTSFLMIGGGL-HSLPLTNATKISYTPLQINPLSPTFYYITIHSITIDGVKLPINPAVWE 322
Query: 204 ITVSGEGGCIIDSGSVLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCYFLPET 263
I G GG ++DSG+ LTY Y E S R +L ++ LC
Sbjct: 323 IDEQGNGGTVVDSGTTLTYLTKTAY---EEVLKSVRRRVKLPNAAELTPGFDLCVNASGE 379
Query: 264 FNR--FPSMAFYFEDANLRIDGENVFIIDYENHFFLLAV--APHDDLVALIGSQQQRDTR 319
R P + F + + ++ E LA+ + ++IG+ Q+
Sbjct: 380 SRRPSLPRLRFRLGGGAVFAPPPRNYFLETEEGVMCLAIRAVESGNGFSVIGNLMQQGFL 439
Query: 320 FVYDLNIDLLSFVKENC 336
+D L F + C
Sbjct: 440 LEFDKEESRLGFTRRGC 456
>gi|302774174|ref|XP_002970504.1| hypothetical protein SELMODRAFT_93504 [Selaginella moellendorffii]
gi|300162020|gb|EFJ28634.1| hypothetical protein SELMODRAFT_93504 [Selaginella moellendorffii]
Length = 483
Score = 119 bits (298), Expect = 2e-24, Method: Compositional matrix adjust.
Identities = 112/369 (30%), Positives = 168/369 (45%), Gaps = 51/369 (13%)
Query: 2 VRLFIGTPSKGVLLILDTGSALIY--------------AIFDPRKSSSFQKINCDHP--- 44
VRL +GTP++ + +++DTGS L + IFDPR SSSFQ+I C P
Sbjct: 131 VRLGVGTPARSLFMVVDTGSDLPWLQCQPCKSCYKQADPIFDPRNSSSFQRIPCLSPLCK 190
Query: 45 -----DCTYFKCVNEQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSND 99
C+ + +C Y + Y D S + G + + ++ G G A FGC
Sbjct: 191 ALEIHSCSGSRGATSRCSYQVAYGDGSFSVGDFSSDLFTL---GTGSKAMSVA-FGC--- 243
Query: 100 NHGFDEDARDGALAGVLGLSRVTISFISQL-----GSIIKKRFSYCLVIPLPNGEYTSSY 154
GFD + AG+LGL +SF SQ+ S FSYCLV +SS
Sbjct: 244 --GFDNEGLFAGAAGLLGLGAGKLSFPSQIFASSTNSSTANSFSYCLVDRSNPMTRSSSS 301
Query: 155 LKFGTDMGYRRPSTQA-TKFINHP--NNFYYLSLKDISIDNERMNFPPDTFDITVSGEGG 211
L FG PST A + + +P + FYY ++ +S+ ++ + ++ SG GG
Sbjct: 302 LIFGAAA---IPSTAALSPLLKNPKLDTFYYAAMIGVSVGGAQLPISLKSLQLSQSGSGG 358
Query: 212 CIIDSGSVLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPE--PIQLCY-FLPETFNRFP 268
IIDSG+ +T F + VY + + F R L P CY F + P
Sbjct: 359 VIIDSGTSVTRFPTSVYATIRDAF-----RNATTNLPSAPRYSLFDTCYNFSGKASVDVP 413
Query: 269 SMAFYFED-ANLRIDGENVFIIDYENHFFLLAVAPHDDLVALIGSQQQRDTRFVYDLNID 327
++ +FE+ A+L++ N I F LA AP + +IG+ QQ+ R +DL
Sbjct: 414 ALVLHFENGADLQLPPTNYLIPINTAGSFCLAFAPTSMELGIIGNIQQQSFRIGFDLQKS 473
Query: 328 LLSFVKENC 336
L+F + C
Sbjct: 474 HLAFAPQQC 482
>gi|449519146|ref|XP_004166596.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 1-like [Cucumis
sativus]
Length = 752
Score = 119 bits (297), Expect = 2e-24, Method: Compositional matrix adjust.
Identities = 119/378 (31%), Positives = 168/378 (44%), Gaps = 49/378 (12%)
Query: 2 VRLFIGTPSKGVLLILDTGSALIY--------------AIFDPRKSSSFQKINCDHPDCT 47
+ +FIG+P K LILDTGS L + +DP+ S SF+ I C+ P C
Sbjct: 198 IDVFIGSPPKHFSLILDTGSDLNWIQCVPCFDCFEQNGPYYDPKDSISFRNITCNDPRCQ 257
Query: 48 YF---------KCVNEQCVYTMKYADQSVTKG------FAAHETISVIGKGEGKAIFHGA 92
K + C Y Y D S T G F + T S GK E + +
Sbjct: 258 LVSSPDPPRPCKFETQSCPYFYWYGDSSNTTGDFALETFTVNLTSSTTGKSEFRRV-ENV 316
Query: 93 LFGCSNDNHGFDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTS 152
+FGC + N G AG+LGL R +SF SQL S+ FSYCLV + S
Sbjct: 317 MFGCGHWNRGLFH-----GAAGLLGLGRGPLSFSSQLQSLYGHSFSYCLV-DRDSDTSVS 370
Query: 153 SYLKFGTDMGY-RRPSTQATKFI----NHPNNFYYLSLKDISIDNERMNFPPDTFDITVS 207
S L FG D P T I N + FYYL +K I + E++ P + ++++
Sbjct: 371 SKLIFGEDKDLLTHPELNFTSLIAGKENPVDTFYYLQIKSIFVGGEKLQIPEENWNLSAD 430
Query: 208 GEGGCIIDSGSVLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCYFLPETFN-R 266
G GG IIDSG+ L+YF Y + E F+ + ++L + D P + CY + T
Sbjct: 431 GAGGTIIDSGTTLSYFSDPAYRIIKEAFLRKVKGYKLVE--DFP-ILHPCYNVSGTDELN 487
Query: 267 FPSMAFYFED-ANLRIDGENVFIIDYENHFFLLAV--APHDDLVALIGSQQQRDTRFVYD 323
FP F D A EN FI + LA+ P L ++IG+ QQ++ +YD
Sbjct: 488 FPEFLIQFADGAVWNFPVENYFIRIQQLDIVCLAMLGTPKSAL-SIIGNYQQQNFHILYD 546
Query: 324 LNIDLLSFVKENCSDDSA 341
L + C++ A
Sbjct: 547 TKNSRLGYAPMRCAEIEA 564
>gi|125543638|gb|EAY89777.1| hypothetical protein OsI_11319 [Oryza sativa Indica Group]
Length = 390
Score = 119 bits (297), Expect = 3e-24, Method: Compositional matrix adjust.
Identities = 107/370 (28%), Positives = 162/370 (43%), Gaps = 52/370 (14%)
Query: 1 MVRLFIGTPSKGVLLILDTGSALIYA--------------IFDPRKSSSFQKINCDHPDC 46
+V L IGTP + V L LDTGS LI+ FD +SS+ + C+ C
Sbjct: 36 LVHLAIGTPPQPVQLTLDTGSDLIWTQCKPCVSCFDQPLPYFDTSRSSTNALLPCESTQC 95
Query: 47 ----TYFKCVN-----EQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCS 97
T CV + C Y Y D SVT G A + + + G FGC
Sbjct: 96 KLDPTVTVCVKLNQTVQTCAYYTSYGDNSVTIGLLAADKFTFVAGTS----LPGVTFGCG 151
Query: 98 NDNHG-FDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLK 156
+N G F+ + G+ G R +S SQL FS+C + ++ L
Sbjct: 152 LNNTGVFNSNE-----TGIAGFGRGPLSLPSQLK---VGNFSHCFTT-ITGAIPSTVLLD 202
Query: 157 FGTDM-GYRRPSTQATKFINHPNN-----FYYLSLKDISIDNERMNFPPDTFDITVSGEG 210
D+ + + Q T I + N YYLSLK I++ + R+ P F +T +G G
Sbjct: 203 LPADLFSNGQGAVQTTPLIQYAKNEANPTLYYLSLKGITVGSTRLPVPESAFALT-NGTG 261
Query: 211 GCIIDSGSVLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCYFLP-ETFNRFPS 269
G IIDSG+ +T VY + ++F + + +L + C+ P + P
Sbjct: 262 GTIIDSGTSITSLPPQVYQVVRDEFAA---QIKLPVVPGNATGHYTCFSAPSQAKPDVPK 318
Query: 270 MAFYFEDANLRIDGEN-VFII--DYENHFFLLAVAPHDDLVALIGSQQQRDTRFVYDLNI 326
+ +FE A + + EN VF + D N LA+ D+ +IG+ QQ++ +YDL
Sbjct: 319 LVLHFEGATMDLPRENYVFEVPDDAGNSIICLAINKGDE-TTIIGNFQQQNMHVLYDLQN 377
Query: 327 DLLSFVKENC 336
++LSFV C
Sbjct: 378 NMLSFVAAQC 387
>gi|38344196|emb|CAE05761.2| OSJNBa0064G10.12 [Oryza sativa Japonica Group]
Length = 451
Score = 119 bits (297), Expect = 3e-24, Method: Compositional matrix adjust.
Identities = 95/361 (26%), Positives = 146/361 (40%), Gaps = 67/361 (18%)
Query: 2 VRLFIGTPSKGVLLILDTGSALIYA--------------IFDPRKSSSFQKINCDHPDCT 47
VR+ +G+P L++D+GS +I+ +FDP SSSF ++C C
Sbjct: 132 VRVGVGSPPTDQYLVVDSGSDVIWVQCRPCEQCYAQTDPLFDPAASSSFSGVSCGSAICR 191
Query: 48 YFKCVNE-------QCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSNDN 100
+C Y++ Y D S TKG A ET+++ G G GC + N
Sbjct: 192 TLSGTGCGGGGDAGKCDYSVTYGDGSYTKGELALETLTL-----GGTAVQGVAIGCGHRN 246
Query: 101 HGFDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLKFGTD 160
G A AG+LGL +S + QLG FSYCL
Sbjct: 247 SGLFVGA-----AGLLGLGWGAMSLVGQLGGAAGGVFSYCLA------------------ 283
Query: 161 MGYRRPSTQATKFINHPNNFYYLSLKDISIDNERMNFPPDTFDITVSGEGGCIIDSGSVL 220
S A + ++FYY+ L I + ER+ F +T G GG ++D+G+ +
Sbjct: 284 ------SRGAGGAGSLASSFYYVGLTGIGVGGERLPLQDSLFQLTEDGAGGVVMDTGTAV 337
Query: 221 TYFHSDVYWKLHEKF---VSYFERFQLAQLSDCPEPIQLCYFLPETFN-RFPSMAFYF-E 275
T + Y L F + R L D CY L + R P+++FYF +
Sbjct: 338 TRLPREAYAALRGAFDGAMGALPRSPAVSLLDT------CYDLSGYASVRVPTVSFYFDQ 391
Query: 276 DANLRIDGENVFIIDYENHFFLLAVAPHDDLVALIGSQQQRDTRFVYDLNIDLLSFVKEN 335
A L + N +++ F LA AP ++++G+ QQ + D + F
Sbjct: 392 GAVLTLPARN-LLVEVGGAVFCLAFAPSSSGISILGNIQQEGIQITVDSANGYVGFGPNT 450
Query: 336 C 336
C
Sbjct: 451 C 451
>gi|356569424|ref|XP_003552901.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Glycine max]
Length = 536
Score = 119 bits (297), Expect = 3e-24, Method: Compositional matrix adjust.
Identities = 114/383 (29%), Positives = 162/383 (42%), Gaps = 65/383 (16%)
Query: 2 VRLFIGTPSKGVLLILDTGSALIY--------------AIFDPRKSSSFQKINCDHPDCT 47
+ +F+GTP K V LILDTGS L + ++P +SSS++ I+C P C
Sbjct: 172 IDMFVGTPPKHVWLILDTGSDLSWIQCDPCYDCFEQNGPHYNPNESSSYRNISCYDPRCQ 231
Query: 48 ---------YFKCVNEQCVYTMKYADQSVTKGFAAHETISV-IGKGEGKAIFH---GALF 94
+ K N+ C Y YAD S T G A ET +V + GK F +F
Sbjct: 232 LVSSPDPLQHCKTENQTCPYFYDYADGSNTTGDFALETFTVNLTWPNGKEKFKHVVDVMF 291
Query: 95 GCSNDNHGFDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSY 154
GC + N GF A R +SF SQL SI FSYCL N SS
Sbjct: 292 GCGHWNKGFFHGAGGLLGL-----GRGPLSFPSQLQSIYGHSFSYCLTDLFSNTS-VSSK 345
Query: 155 LKFGTDMGYRRPSTQATKFINHPN---------------NFYYLSLKDISIDNERMNFPP 199
L FG D + +NH N FYYL +K I + E ++ P
Sbjct: 346 LIFGED----------KELLNHHNLNFTKLLAGEETPDDTFYYLQIKSIVVGGEVLDIPE 395
Query: 200 DTFDITVSGEGGCIIDSGSVLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCYF 259
T+ + G GG IIDSGS LT+F Y + E F ++ +L Q++ + CY
Sbjct: 396 KTWHWSSEGVGGTIIDSGSTLTFFPDSAYDVIKEAFE---KKIKLQQIAADDFIMSPCYN 452
Query: 260 LPETFN-RFPSMAFYFED-ANLRIDGENVFIIDYENHFFLLAV--APHDDLVALIGSQQQ 315
+ P +F D A EN F + LA+ P+ + +IG+ Q
Sbjct: 453 VSGAMQVELPDYGIHFADGAVWNFPAENYFYQYEPDEVICLAILKTPNHSHLTIIGNLLQ 512
Query: 316 RDTRFVYDLNIDLLSFVKENCSD 338
++ +YD+ L + C++
Sbjct: 513 QNFHILYDVKRSRLGYSPRRCAE 535
>gi|302762735|ref|XP_002964789.1| hypothetical protein SELMODRAFT_82470 [Selaginella moellendorffii]
gi|300167022|gb|EFJ33627.1| hypothetical protein SELMODRAFT_82470 [Selaginella moellendorffii]
Length = 390
Score = 118 bits (296), Expect = 3e-24, Method: Compositional matrix adjust.
Identities = 101/363 (27%), Positives = 156/363 (42%), Gaps = 47/363 (12%)
Query: 2 VRLFIGTPSKGVLLILDTGSALIY--------------AIFDPRKSSSFQKINCDHPDCT 47
R+ IG+P + L LDTGS + + I+DP SSS++++ C C
Sbjct: 47 ARMGIGSPQRSYYLELDTGSDVTWIQCAPCSSCYSQVDPIYDPSNSSSYRRVYCGSALCQ 106
Query: 48 ---YFKCVNEQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSNDNHGFD 104
Y C C Y + Y D S + G E+ +G A+ + A FGC + N G
Sbjct: 107 ALDYSACQGMGCSYRVVYGDSSASSGDLGIESF-YLGPNSSTAMRNIA-FGCGHSNSGLF 164
Query: 105 EDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLKFG-TDMGY 163
T+SF SQ+ + I FSYCLV + SS L FG T + +
Sbjct: 165 RGEAGLLGM-----GGGTLSFFSQIAASIGPAFSYCLVDRYSQLQSRSSPLIFGRTAIPF 219
Query: 164 RRPSTQATKFINHP--NNFYYLSLKDISIDNERMNFPPDTFDITVSGEGGCIIDSGSVLT 221
+ + T + +P + FYY L IS+ + PP F +T +G GG I+DSG+ +T
Sbjct: 220 ---AARFTPLLKNPRIDTFYYAILTGISVGGTALPIPPAQFALTGNGTGGAILDSGTSVT 276
Query: 222 YFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCYFLPETFN-------RFPSMAFYF 274
Y L + + + P Y L FN + PS+ +F
Sbjct: 277 RVVPAAYAVLRDAYRAASRNL---------PPAPGVYLLDTCFNFQGLPTVQIPSLVLHF 327
Query: 275 E-DANLRIDGENVFIIDYENHFFLLAVAPHDDLVALIGSQQQRDTRFVYDLNIDLLSFVK 333
+ D ++ + G N+ I + F LA AP +++IG+ QQ+ R +DL L++
Sbjct: 328 DNDVDMVLPGGNILIPVDRSGTFCLAFAPSSMPISVIGNVQQQTFRIGFDLQRSLIAIAP 387
Query: 334 ENC 336
C
Sbjct: 388 REC 390
>gi|297817208|ref|XP_002876487.1| hypothetical protein ARALYDRAFT_486375 [Arabidopsis lyrata subsp.
lyrata]
gi|297322325|gb|EFH52746.1| hypothetical protein ARALYDRAFT_486375 [Arabidopsis lyrata subsp.
lyrata]
Length = 520
Score = 118 bits (296), Expect = 3e-24, Method: Compositional matrix adjust.
Identities = 110/376 (29%), Positives = 161/376 (42%), Gaps = 56/376 (14%)
Query: 4 LFIGTPSKGVLLILDTGSALIY--------------AIFDPRKSSSFQKINCDHPDCTYF 49
+ +G+P K LILDTGS L + A +DP+ S+S++ I C+ P C
Sbjct: 159 VLVGSPPKHFSLILDTGSDLNWIQCLPCHDCFQQNGAFYDPKASASYKNITCNDPRCNLV 218
Query: 50 ---------KCVNEQCVYTMKYADQSVTKGFAAHETISV--IGKGEGKAIFH--GALFGC 96
K N+ C Y Y D S T G A ET +V G +++ +FGC
Sbjct: 219 SPPDPPKPCKSDNQSCPYYYWYGDSSNTTGDFAVETFTVNLTTSGGSSELYNVENMMFGC 278
Query: 97 SNDNHGFDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLK 156
+ N G A R +SF SQL S+ FSYCLV + SS L
Sbjct: 279 GHWNRGLFHGAAGLLGL-----GRGPLSFSSQLQSLYGHSFSYCLV-DRNSDTNVSSKLI 332
Query: 157 FGTDMGY-RRPSTQATKFINHPNN----FYYLSLKDISIDNERMNFPPDTFDITVSGEGG 211
FG D P+ T F+ N FYY+ +K I + E +N P +T++I+ G GG
Sbjct: 333 FGEDKDLLSHPNLNFTSFVARKENLVDTFYYVQIKSIIVAGEVLNIPEETWNISSDGAGG 392
Query: 212 CIIDSGSVLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCY-FLPETFN----- 265
IIDSG+ L+YF Y F + ++A+ + P+ + L FN
Sbjct: 393 TIIDSGTTLSYFAEPAY---------EFIKNKIAEKAKGKYPVYRDFPILDPCFNVSGID 443
Query: 266 --RFPSMAFYFED-ANLRIDGENVFIIDYENHFFLLAVAPHDDLVALIGSQQQRDTRFVY 322
+ P + F D A EN FI E+ L + ++IG+ QQ++ +Y
Sbjct: 444 SIQLPELGIAFADGAVWNFPTENSFIWLNEDLVCLAILGTPKSAFSIIGNYQQQNFHILY 503
Query: 323 DLNIDLLSFVKENCSD 338
D L + C+D
Sbjct: 504 DTKRSRLGYAPTKCAD 519
>gi|326506192|dbj|BAJ86414.1| predicted protein [Hordeum vulgare subsp. vulgare]
gi|326516358|dbj|BAJ92334.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 492
Score = 118 bits (296), Expect = 4e-24, Method: Compositional matrix adjust.
Identities = 109/366 (29%), Positives = 166/366 (45%), Gaps = 46/366 (12%)
Query: 2 VRLFIGTPSKGVLLILDTGSALIY--------------AIFDPRKSSSFQKINCDHPDCT 47
++ +GTP+ L++LDTGS +++ +FDPR+S S+ + C P C
Sbjct: 142 TKIGVGTPATPALMVLDTGSDVVWLQCAPCRRCYEQSGQVFDPRRSRSYNAVGCAAPLCR 201
Query: 48 YFKC-----VNEQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSNDNHG 102
C+Y + Y D SVT G A ET++ G G + AL GC +DN G
Sbjct: 202 RLDSGGCDLRRSACLYQVAYGDGSVTAGDFATETLTFAG---GARVARVAL-GCGHDNEG 257
Query: 103 FDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLV--IPLPNGEYTSSYLKFGTD 160
A AG+LGL R ++SF +Q+ + FSYCLV N SS + FG+
Sbjct: 258 LFV-----AAAGLLGLGRGSLSFPTQISRRYGRSFSYCLVDRTSSANTASRSSTVTFGSG 312
Query: 161 MGYRRPSTQATKFINHPN--NFYYLSLKDISIDNERMNFPPDTFDITV---SGEGGCIID 215
++ T + +P FYY+ L IS+ R+ ++ D+ + SG GG I+D
Sbjct: 313 AVGSTVASSFTPMVKNPRMETFYYVQLIGISVGGARVPGVANS-DLRLDPSSGRGGVIVD 371
Query: 216 SGSVLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQL---CYFLP-ETFNRFPSMA 271
SG+ +T Y L + F R A L P L CY L + P+++
Sbjct: 372 SGTSVTRLARPAYSALRDAF-----RGAAAGLRLSPGGFSLFDTCYDLSGRKVVKVPTVS 426
Query: 272 FYFE-DANLRIDGENVFIIDYENHFFLLAVAPHDDLVALIGSQQQRDTRFVYDLNIDLLS 330
+F A + EN I F A A D V++IG+ QQ+ R V+D + ++
Sbjct: 427 MHFAGGAEAALPPENYLIPVDSKGTFCFAFAGTDGGVSIIGNIQQQGFRVVFDGDGQRVA 486
Query: 331 FVKENC 336
F + C
Sbjct: 487 FTPKGC 492
>gi|242050428|ref|XP_002462958.1| hypothetical protein SORBIDRAFT_02g035300 [Sorghum bicolor]
gi|241926335|gb|EER99479.1| hypothetical protein SORBIDRAFT_02g035300 [Sorghum bicolor]
Length = 460
Score = 118 bits (296), Expect = 4e-24, Method: Compositional matrix adjust.
Identities = 100/378 (26%), Positives = 163/378 (43%), Gaps = 59/378 (15%)
Query: 1 MVRLFIGTPSKGVLLILDTGSALIYA---------------IFDPRKSSSFQKINCDHPD 45
+V IGTP + +LDTGS LI+ ++ P +S ++ ++C
Sbjct: 101 LVDFAIGTPPLALSAVLDTGSDLIWTQCDAPCRRCFPQPAPLYAPARSVTYANVSCGSRL 160
Query: 46 CTYFKCVNEQ----------------CVYTMKYADQSVTKGFAAHETISVIGKGEGKAIF 89
C + C Y Y D S T G A ET + G G +
Sbjct: 161 CDALPSLRPSSRCSASASAPAPERGGCTYYYSYGDGSSTDGVLATETFTF---GAGTTV- 216
Query: 90 HGALFGCSNDNHGFDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGE 149
H FGC DN G +++ +G++G+ R +S +SQLG +FSYC P N
Sbjct: 217 HDLAFGCGTDNLGGTDNS-----SGLVGMGRGPLSLVSQLG---VTKFSYCFT-PF-NDT 266
Query: 150 YTSSYLKFGTDMGYRRPSTQATKFINHPN-----NFYYLSLKDISIDNERMNFPPDTFDI 204
TSS L G+ P+ ++T F+ P+ ++YYLSL+ I++ + + P F +
Sbjct: 267 TTSSPLFLGSSASLS-PAAKSTPFVPSPSGPRRSSYYYLSLEGITVGDTLLPIDPAVFRL 325
Query: 205 TVSGEGGCIIDSGSVLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCYFLPETF 264
T SG GG IIDSG+ T + L + LA S + +C+ P+
Sbjct: 326 TASGRGGLIIDSGTTFTALEERAFVVLARAVAARVA-LPLA--SGAHLGLSVCFAAPQGR 382
Query: 265 NR----FPSMAFYFEDANLRIDGENVFIIDYENHFFLLAVAPHDDLVALIGSQQQRDTRF 320
P + +F+ A++ + + + D L + + +++GS QQ++
Sbjct: 383 GPEAVDVPRLVLHFDGADMELPRSSAVVEDRVAGVACLGIVSARGM-SVLGSMQQQNMHV 441
Query: 321 VYDLNIDLLSFVKENCSD 338
YD+ D+LSF NC +
Sbjct: 442 RYDVGRDVLSFEPANCGE 459
>gi|302793638|ref|XP_002978584.1| hypothetical protein SELMODRAFT_54048 [Selaginella moellendorffii]
gi|300153933|gb|EFJ20570.1| hypothetical protein SELMODRAFT_54048 [Selaginella moellendorffii]
Length = 407
Score = 118 bits (295), Expect = 4e-24, Method: Compositional matrix adjust.
Identities = 112/369 (30%), Positives = 168/369 (45%), Gaps = 51/369 (13%)
Query: 2 VRLFIGTPSKGVLLILDTGSALIY--------------AIFDPRKSSSFQKINCDHP--- 44
VRL +GTP++ + +++DTGS L + IFDPR SSSFQ+I C P
Sbjct: 56 VRLGLGTPARSLFMVVDTGSDLPWLQCQPCKSCYKQADPIFDPRNSSSFQRIPCLSPLCK 115
Query: 45 -----DCTYFKCVNEQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSND 99
C+ + +C Y + Y D S + G + + ++ G G A FGC
Sbjct: 116 ALEVHSCSGSRGATSRCSYQVAYGDGSFSVGDFSSDLFTL---GTGSKAMSVA-FGC--- 168
Query: 100 NHGFDEDARDGALAGVLGLSRVTISFISQL-----GSIIKKRFSYCLVIPLPNGEYTSSY 154
GFD + AG+LGL +SF SQ+ S FSYCLV +SS
Sbjct: 169 --GFDNEGLFAGAAGLLGLGAGKLSFPSQIFASSTNSSTANSFSYCLVDRSNPMTRSSSS 226
Query: 155 LKFGTDMGYRRPSTQA-TKFINHP--NNFYYLSLKDISIDNERMNFPPDTFDITVSGEGG 211
L FG PST A + + +P + FYY ++ +S+ ++ + ++ SG GG
Sbjct: 227 LIFGVAA---IPSTAALSPLLKNPKLDTFYYAAMIGVSVGGAQLPISLKSLQLSQSGSGG 283
Query: 212 CIIDSGSVLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPE--PIQLCY-FLPETFNRFP 268
IIDSG+ +T F + VY + + F R L P CY F + P
Sbjct: 284 VIIDSGTSVTRFPTSVYATIRDAF-----RNATINLPSAPRYSLFDTCYNFSGKASVDVP 338
Query: 269 SMAFYFED-ANLRIDGENVFIIDYENHFFLLAVAPHDDLVALIGSQQQRDTRFVYDLNID 327
++ +FE+ A+L++ N I F LA AP + +IG+ QQ+ R +DL
Sbjct: 339 ALVLHFENGADLQLPPTNYLIPINTAGSFCLAFAPTSMELGIIGNIQQQSFRIGFDLQKS 398
Query: 328 LLSFVKENC 336
L+F + C
Sbjct: 399 HLAFAPQQC 407
>gi|224126551|ref|XP_002329582.1| predicted protein [Populus trichocarpa]
gi|222870291|gb|EEF07422.1| predicted protein [Populus trichocarpa]
Length = 440
Score = 118 bits (295), Expect = 4e-24, Method: Compositional matrix adjust.
Identities = 105/362 (29%), Positives = 169/362 (46%), Gaps = 42/362 (11%)
Query: 1 MVRLFIGTPSKGVLLILDTGSALIYA--------------IFDPRKSSSFQKINCDHPDC 46
++ L +GTP +L I DTGS LI+ +FDP+ S +++ ++CD C
Sbjct: 94 LMSLSLGTPPFEILAIADTGSDLIWTQCTPCDKCYKQIAPLFDPKSSKTYRDLSCDTRQC 153
Query: 47 TYF----KCVNEQ-CVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSNDNH 101
C +EQ C Y+ Y D+S T G A +T+++ G F + GC N+
Sbjct: 154 QNLGESSSCSSEQLCQYSYYYGDRSFTNGNLAVDTVTLPSTNGGPVYFPKTVIGCGRRNN 213
Query: 102 GFDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLKFGTDM 161
G D +D +G++GL +S ISQ+GS + +FSYCLV SS L FG +
Sbjct: 214 G-TFDKKD---SGIIGLGGGPMSLISQMGSSVGGKFSYCLVPFSSESAGNSSKLHFGRNA 269
Query: 162 GYRRPSTQATKFIN-HPNNFYYLSLKDISIDNERMNFPPDTFDITVSGEGGCIIDSGSVL 220
Q+T I+ +P+ FYYL+L+ +S+ ++++ EG IIDSG+ L
Sbjct: 270 VVSGSGVQSTPLISKNPDTFYYLTLEAMSVGDKKIE---FGGSSFGGSEGNIIIDSGTSL 326
Query: 221 TYFHSDVYWKLH---EKFVSYFERFQLAQ--LSDCPEPIQLCYFLPETFNRFPSMAFYFE 275
T F + + + E V ER Q A LS C P P+ + P + +F
Sbjct: 327 TLFPVNFFTEFATAVENAVINGERTQDASGLLSHCYRPT------PDL--KVPVITAHFN 378
Query: 276 DANLRIDGENVFIIDYENHFFLLAVAPHDDLVALIGSQQQRDTRFVYDLNIDLLSFVKEN 335
A++ + N FI+ ++ L + A+ G+ Q + YD+ +SF +
Sbjct: 379 GADVVLQTLNTFILISDDVLCLAFNSTQSG--AIFGNVAQMNFLIGYDIQGKSVSFKPTD 436
Query: 336 CS 337
C+
Sbjct: 437 CT 438
>gi|8979711|emb|CAB96832.1| nucleoid DNA-binding protein cnd41-like protein [Arabidopsis
thaliana]
Length = 446
Score = 118 bits (295), Expect = 5e-24, Method: Compositional matrix adjust.
Identities = 98/363 (26%), Positives = 151/363 (41%), Gaps = 47/363 (12%)
Query: 1 MVRLFIGTPSKGVLLILDTGSALIYA---------------IFDPRKSSSFQKINCDHPD 45
+V + +GTP + LI DTGS L + IF+P KS+S+ ++C
Sbjct: 105 IVTVGLGTPKNDLSLIFDTGSDLTWTQCQPCVRTCYDQKEPIFNPSKSTSYYNVSCSSAA 164
Query: 46 CTYFK--------CVNEQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCS 97
C C C+Y ++Y DQS + GF A E ++ +F G FGC
Sbjct: 165 CGSLSSATGNAGSCSASNCIYGIQYGDQSFSVGFLAKEKFTL----TNSDVFDGVYFGCG 220
Query: 98 NDNHGFDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLKF 157
+N G +AG+LGL R +SF SQ + K FSYC LP+ + +L F
Sbjct: 221 ENNQGLFT-----GVAGLLGLGRDKLSFPSQTATAYNKIFSYC----LPSSASYTGHLTF 271
Query: 158 GTDMGYRRPSTQATKFINHPNNFYYLSLKDISIDNERMNFPPDTFDITVSGEGGCIIDSG 217
G+ R I +FY L++ I++ +++ P F G +IDSG
Sbjct: 272 GSAGISRSVKFTPISTITDGTSFYGLNIVAITVGGQKLPIPSTVFSTP-----GALIDSG 326
Query: 218 SVLTYFHSDVYWKLHEKFVSYFERF-QLAQLSDCPEPIQLCYFLPETFNRFPSMAFYFE- 275
+V+T Y L F + ++ + +S L F T P +AF F
Sbjct: 327 TVITRLPPKAYAALRSSFKAKMSKYPTTSGVSILDTCFDLSGFKTVT---IPKVAFSFSG 383
Query: 276 DANLRIDGENVFIIDYENHFFLLAVAPHDDL-VALIGSQQQRDTRFVYDLNIDLLSFVKE 334
A + + + +F + + L DD A+ G+ QQ+ VYD + F
Sbjct: 384 GAVVELGSKGIFYVFKISQVCLAFAGNSDDSNAAIFGNVQQQTLEVVYDGAGGRVGFAPN 443
Query: 335 NCS 337
CS
Sbjct: 444 GCS 446
>gi|22326716|ref|NP_196638.2| aspartyl protease family protein [Arabidopsis thaliana]
gi|18700103|gb|AAL77663.1| AT5g10770/T30N20_40 [Arabidopsis thaliana]
gi|24111269|gb|AAN46758.1| At5g10770/T30N20_40 [Arabidopsis thaliana]
gi|332004211|gb|AED91594.1| aspartyl protease family protein [Arabidopsis thaliana]
Length = 474
Score = 118 bits (295), Expect = 5e-24, Method: Compositional matrix adjust.
Identities = 98/363 (26%), Positives = 151/363 (41%), Gaps = 47/363 (12%)
Query: 1 MVRLFIGTPSKGVLLILDTGSALIYA---------------IFDPRKSSSFQKINCDHPD 45
+V + +GTP + LI DTGS L + IF+P KS+S+ ++C
Sbjct: 133 IVTVGLGTPKNDLSLIFDTGSDLTWTQCQPCVRTCYDQKEPIFNPSKSTSYYNVSCSSAA 192
Query: 46 CTYFK--------CVNEQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCS 97
C C C+Y ++Y DQS + GF A E ++ +F G FGC
Sbjct: 193 CGSLSSATGNAGSCSASNCIYGIQYGDQSFSVGFLAKEKFTLTNS----DVFDGVYFGCG 248
Query: 98 NDNHGFDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLKF 157
+N G +AG+LGL R +SF SQ + K FSYC LP+ + +L F
Sbjct: 249 ENNQGLFT-----GVAGLLGLGRDKLSFPSQTATAYNKIFSYC----LPSSASYTGHLTF 299
Query: 158 GTDMGYRRPSTQATKFINHPNNFYYLSLKDISIDNERMNFPPDTFDITVSGEGGCIIDSG 217
G+ R I +FY L++ I++ +++ P F G +IDSG
Sbjct: 300 GSAGISRSVKFTPISTITDGTSFYGLNIVAITVGGQKLPIPSTVFSTP-----GALIDSG 354
Query: 218 SVLTYFHSDVYWKLHEKFVSYFERF-QLAQLSDCPEPIQLCYFLPETFNRFPSMAFYFE- 275
+V+T Y L F + ++ + +S L F T P +AF F
Sbjct: 355 TVITRLPPKAYAALRSSFKAKMSKYPTTSGVSILDTCFDLSGFKTVT---IPKVAFSFSG 411
Query: 276 DANLRIDGENVFIIDYENHFFLLAVAPHDDL-VALIGSQQQRDTRFVYDLNIDLLSFVKE 334
A + + + +F + + L DD A+ G+ QQ+ VYD + F
Sbjct: 412 GAVVELGSKGIFYVFKISQVCLAFAGNSDDSNAAIFGNVQQQTLEVVYDGAGGRVGFAPN 471
Query: 335 NCS 337
CS
Sbjct: 472 GCS 474
>gi|326512608|dbj|BAJ99659.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 484
Score = 117 bits (294), Expect = 6e-24, Method: Compositional matrix adjust.
Identities = 98/359 (27%), Positives = 153/359 (42%), Gaps = 43/359 (11%)
Query: 1 MVRLFIGTPSKGVLLILDTGSALIYA--------------IFDPRKSSSFQKINCDHPDC 46
+V + +GTP++ + ++ DTGS L + +FDP +SS++ + C P+C
Sbjct: 147 VVSMGLGTPARDMTVVFDTGSDLSWVQCTPCSDCYEQKDPLFDPARSSTYSAVPCASPEC 206
Query: 47 TYF---KCV-NEQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSNDNHG 102
C +++C Y + Y DQS T G A +T+++ + G +FGC + G
Sbjct: 207 QGLDSRSCSRDKKCRYEVVYGDQSQTDGALARDTLTLTQSD----VLPGFVFGCGEQDTG 262
Query: 103 FDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLKFGTDMG 162
G G++GL R +S SQ S FSYC LP+ + YL G
Sbjct: 263 LF-----GRADGLVGLGREKVSLSSQAASKYGAGFSYC----LPSSPSAAGYLSLGGPAP 313
Query: 163 YRRPSTQATKFINHPNNFYYLSLKDISIDNERMNFPPDTFDITVSGEGGCIIDSGSVLTY 222
T + P +FYY+ L + + + P F G +IDSG+V+T
Sbjct: 314 ANARFTAMETRHDSP-SFYYVRLVGVKVAGRTVRVSPIVFS-----AAGTVIDSGTVITR 367
Query: 223 FHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCY-FLPETFNRFPSMAFYFE-DANLR 280
VY L F R+ + + + CY F T R PS+A F A +
Sbjct: 368 LPPRVYAALRSAFARSMGRYGYKR-APALSILDTCYDFTGHTTVRIPSVALVFAGGAAVG 426
Query: 281 IDGENVFIIDYENHFFLLAVAPHDDL--VALIGSQQQRDTRFVYDLNIDLLSFVKENCS 337
+D V + + LA AP+ D +IG+ QQ+ VYD+ + F CS
Sbjct: 427 LDFSGVLYVAKVSQ-ACLAFAPNGDGADAGIIGNTQQKTLAVVYDVARQKIGFGANGCS 484
>gi|302780575|ref|XP_002972062.1| hypothetical protein SELMODRAFT_96804 [Selaginella moellendorffii]
gi|300160361|gb|EFJ26979.1| hypothetical protein SELMODRAFT_96804 [Selaginella moellendorffii]
Length = 429
Score = 117 bits (294), Expect = 6e-24, Method: Compositional matrix adjust.
Identities = 103/381 (27%), Positives = 156/381 (40%), Gaps = 53/381 (13%)
Query: 1 MVRLFIGTPSKGVLLILDTGSALIY---------AIFDPRK------------SSSFQKI 39
+V + GTP + VLLI DTGS LI+ F P+K S++ +
Sbjct: 54 LVSMAFGTPPQEVLLIADTGSDLIWLQCSTTAAPPAFCPKKACSRRPAFVASKSATLSVV 113
Query: 40 NCDHPDCTYFKCVNEQ-----------CVYTMKYADQSVTKGFAAHETISVIGKGEGKAI 88
C C C Y YAD S T GF A +T ++ G A
Sbjct: 114 PCSAAQCLLVPAPRGHGPACSPAAPVPCGYAYDYADGSSTTGFLARDTATISNGTSGGAA 173
Query: 89 FHGALFGCSNDNHGFDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNG 148
G FGC N G GV+GL + +SF +Q GS+ + FSYCL + L G
Sbjct: 174 VRGVAFGCGTRNQGGSFSG----TGGVIGLGQGQLSFPAQSGSLFAQTFSYCL-LDLEGG 228
Query: 149 EY--TSSYLKFGTDMGYRRPSTQATKFINHP--NNFYYLSLKDISIDNERMNFPPDTFDI 204
+SS+L G RR + T +++P FYY+ + I + N + P + I
Sbjct: 229 RRGRSSSFLFLGRPE--RRAAFAYTPLVSNPLAPTFYYVGVVAIRVGNRVLPVPGSEWAI 286
Query: 205 TVSGEGGCIIDSGSVLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCYFL---- 260
V G GG +IDSGS LTY Y L F + ++ + + ++LCY +
Sbjct: 287 DVLGNGGTVIDSGSTLTYLRLGAYLHLVSAFAASVHLPRIPSSATFFQGLELCYNVSSSS 346
Query: 261 --PETFNRFPSMAFYFEDA-NLRIDGENVFIIDYENHFFLLAVAPHDDLVA--LIGSQQQ 315
FP + F +L + N +++D + LA+ P A ++G+ Q
Sbjct: 347 SSAPANGGFPRLTIDFAQGLSLELPTGN-YLVDVADDVKCLAIRPTLSPFAFNVLGNLMQ 405
Query: 316 RDTRFVYDLNIDLLSFVKENC 336
+ +D + F + C
Sbjct: 406 QGYHVEFDRASARIGFARTEC 426
>gi|449434646|ref|XP_004135107.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 1-like [Cucumis
sativus]
Length = 486
Score = 117 bits (294), Expect = 6e-24, Method: Compositional matrix adjust.
Identities = 99/355 (27%), Positives = 155/355 (43%), Gaps = 43/355 (12%)
Query: 3 RLFIGTPSKGVLLILDTGSALIYA--------------IFDPRKSSSFQKINCDHPDCTY 48
R+ IG P V ++LDTGS + + IF+P S+SF ++C+ C
Sbjct: 154 RVGIGRPPSPVYMVLDTGSDVSWVQCAPCAECYEQTDPIFEPTSSASFTSLSCETEQCKS 213
Query: 49 F---KCVNEQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSNDNHGFDE 105
+C N C+Y + Y D S T G ET+++ G GC ++N G
Sbjct: 214 LDVSECRNGTCLYEVSYGDGSYTVGDFVTETVTL-----GSTSLGNIAIGCGHNNEGLFI 268
Query: 106 DARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLKFGTDMGYRR 165
A ++SF SQL + FSYCLV ++S L F + +
Sbjct: 269 GAAGLLGL-----GGGSLSFPSQLNA---SSFSYCLV---DRDSDSTSTLDFNSPI---T 314
Query: 166 PSTQATKFINHPN--NFYYLSLKDISIDNERMNFPPDTFDITVSGEGGCIIDSGSVLTYF 223
P +PN F+YL L +S+ + P +F ++ G GG I+DSG+ +T
Sbjct: 315 PDAVTAPLHRNPNLDTFFYLGLTGMSVGGAVLPIPETSFQMSEDGNGGIIVDSGTAVTRL 374
Query: 224 HSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCYFL-PETFNRFPSMAFYFEDAN-LRI 281
+ VY L + FV Q A+ CY L ++ P+++F+F + N L +
Sbjct: 375 QTTVYNVLRDAFVKSTHDLQTARGVAL---FDTCYDLSSKSRVEVPTVSFHFANGNELPL 431
Query: 282 DGENVFIIDYENHFFLLAVAPHDDLVALIGSQQQRDTRFVYDLNIDLLSFVKENC 336
+N I F A AP D ++++G+ QQ+ TR +DL L+ F C
Sbjct: 432 PAKNYLIPVDSEGTFCFAFAPTDSTLSILGNAQQQGTRVGFDLANSLVGFSPNKC 486
>gi|15230868|ref|NP_189198.1| aspartyl protease family protein [Arabidopsis thaliana]
gi|11994761|dbj|BAB03090.1| chloroplast nucleoid DNA binding protein-like; nucellin-like
protein [Arabidopsis thaliana]
gi|189339286|gb|ACD89063.1| At3g25700 [Arabidopsis thaliana]
gi|332643533|gb|AEE77054.1| aspartyl protease family protein [Arabidopsis thaliana]
Length = 452
Score = 117 bits (294), Expect = 7e-24, Method: Compositional matrix adjust.
Identities = 108/379 (28%), Positives = 161/379 (42%), Gaps = 57/379 (15%)
Query: 2 VRLFIGTPSKGVLLILDTGSALIY---------------AIFDPRKSSSFQKINCDHPDC 46
V L IG P + +LLI DTGS L++ +F PR SS+F +C P C
Sbjct: 86 VDLRIGQPPQSLLLIADTGSDLVWVKCSACRNCSHHSPATVFFPRHSSTFSPAHCYDPVC 145
Query: 47 TYFK-------C----VNEQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFG 95
C ++ C Y YAD S+T G A ET S+ +A FG
Sbjct: 146 RLVPKPDRAPICNHTRIHSTCHYEYGYADGSLTSGLFARETTSLKTSSGKEARLKSVAFG 205
Query: 96 CSNDNHG--FDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYT-- 151
C G + +GA GV+GL R ISF SQLG +FSYCL+ +YT
Sbjct: 206 CGFRISGQSVSGTSFNGA-NGVMGLGRGPISFASQLGRRFGNKFSYCLM------DYTLS 258
Query: 152 ---SSYLKFGTDMGYRRPSTQATKFINHP--NNFYYLSLKDISIDNERMNFPPDTFDITV 206
+SYL G + G T + +P FYY+ LK + ++ ++ P ++I
Sbjct: 259 PPPTSYLIIG-NGGDGISKLFFTPLLTNPLSPTFYYVKLKSVFVNGAKLRIDPSIWEIDD 317
Query: 207 SGEGGCIIDSGSVLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEP-IQLCYFL----- 260
SG GG ++DSG+ L + Y ++ R ++D P LC +
Sbjct: 318 SGNGGTVVDSGTTLAFLAEPAY----RSVIAAVRRRVKLPIADALTPGFDLCVNVSGVTK 373
Query: 261 PETFNRFPSMAFYFEDANLRIDGENVFIIDYENHFFLLAVAPHDDLV--ALIGSQQQRDT 318
PE P + F F + + + I+ E LA+ D V ++IG+ Q+
Sbjct: 374 PEKI--LPRLKFEFSGGAVFVPPPRNYFIETEEQIQCLAIQSVDPKVGFSVIGNLMQQGF 431
Query: 319 RFVYDLNIDLLSFVKENCS 337
F +D + L F + C+
Sbjct: 432 LFEFDRDRSRLGFSRRGCA 450
>gi|168005153|ref|XP_001755275.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162693403|gb|EDQ79755.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 429
Score = 117 bits (294), Expect = 7e-24, Method: Compositional matrix adjust.
Identities = 105/362 (29%), Positives = 162/362 (44%), Gaps = 52/362 (14%)
Query: 1 MVRLFIGTPSKGVLLILDTGSAL--------------IYAIFDPRKSSSFQKINCDHPDC 46
++ + G P + I+DTGS L + A FDP KS+S++ + C C
Sbjct: 91 LIDISYGNPPQKSTAIVDTGSDLNWVQCLPCKSCYETLSAKFDPSKSASYKTLGCGSNFC 150
Query: 47 T--YFKCVNEQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSNDNHGFD 104
F+ C Y Y D S T G + + +++ G GK FGC N N G
Sbjct: 151 QDLPFQSCAASCQYDYMYGDGSSTSGALSTDDVTI---GTGK--IPNVAFGCGNSNLGTF 205
Query: 105 EDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLKFGTDMGYR 164
A + +S +SQLG K+FSYCLV PL + + + Y+ T G
Sbjct: 206 AGAGGLVGL-----GKGPLSLVSQLGGTATKKFSYCLV-PLGSTKTSPLYIGDSTLAGGV 259
Query: 165 RPSTQATKFINHPNNFYYLSLKDISIDNERMNFPPDTFDITVSGEGGCIIDSGSVLTYFH 224
+ T N+P FYY L+ IS++ + +N+P +TFDI +G GG I+DSG+ LTY
Sbjct: 260 AYTPMLTNN-NYPT-FYYAELQGISVEGKAVNYPANTFDIAATGRGGLILDSGTTLTYLD 317
Query: 225 SDVYWKLHEKFVSYFERFQLAQLSDCPEP--------IQLCYFLPETFN-RFPSMAFYFE 275
D F A + P P ++ C+ N +P++ F+F
Sbjct: 318 VDA-----------FNPMVAALKAALPYPEADGSFYGLEYCFSTAGVANPTYPTVVFHFN 366
Query: 276 DANLRIDGENVFI-IDYENHFFLLAVAPHDDLVALIGSQQQRDTRFVYDLNIDLLSFVKE 334
A++ + +N FI +D+E LA+A ++ G+ QQ + V+DL + F
Sbjct: 367 GADVALAPDNTFIALDFEGT-TCLAMASSTGF-SIFGNIQQLNHVIVHDLVNKRIGFKSA 424
Query: 335 NC 336
NC
Sbjct: 425 NC 426
>gi|297823357|ref|XP_002879561.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
gi|297325400|gb|EFH55820.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
Length = 447
Score = 117 bits (293), Expect = 7e-24, Method: Compositional matrix adjust.
Identities = 113/376 (30%), Positives = 166/376 (44%), Gaps = 58/376 (15%)
Query: 2 VRLFIGTPSKGVLLILDTGSALIYA--------------IFDPRKSSSFQKINCDHPDCT 47
+ + IGTP V I DTGS L + IFD +KSS+++ CD +C
Sbjct: 87 MSITIGTPPMKVFAIADTGSDLTWVQCKPCQQCYKENGPIFDKKKSSTYKSEPCDSRNCH 146
Query: 48 YFKCVNEQC-------VYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSNDN 100
C Y Y DQS +KG A ETIS+ F G +FGC +N
Sbjct: 147 ALSSSERGCDESKNVCKYRYSYGDQSFSKGDVATETISIDSASGSPVSFPGTVFGCGYNN 206
Query: 101 HG-FDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLV--IPLPNGEYTSSYLKF 157
G FDE G L S ISQLGS I K+FSYCL NG +S +
Sbjct: 207 GGTFDETGSGIIGLGGGHL-----SLISQLGSSISKKFSYCLSHKSATTNG---TSVINL 258
Query: 158 GTDMGYRRPSTQ-------ATKFIN-HPNNFYYLSLKDISIDNERM-----NFPPDTFDI 204
GT+ PS+ +T ++ P +YYL+L+ IS+ +++ ++ P+ I
Sbjct: 259 GTN---SIPSSLSKDSGVISTPLVDKEPRTYYYLTLEAISVGKKKIPYTGSSYNPNDGGI 315
Query: 205 TVSGEGGCIIDSGSVLTYFHS---DVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCYFLP 261
G IIDSG+ LT S D + E+ V+ +R +SD + C+
Sbjct: 316 FSETSGNIIIDSGTTLTLLDSGFFDKFGAAVEELVTGAKR-----VSDPQGLLSHCFKSG 370
Query: 262 ETFNRFPSMAFYFEDANLRIDGENVFIIDYENHFFLLAVAPHDDLVALIGSQQQRDTRFV 321
P + +F A++R+ N F+ E+ L++ P + VA+ G+ Q D
Sbjct: 371 SAEIGLPEITVHFTGADVRLSPINAFVKVSED-MVCLSMVPTTE-VAIYGNFAQMDFLVG 428
Query: 322 YDLNIDLLSFVKENCS 337
YDL +SF + +CS
Sbjct: 429 YDLETRTVSFQRMDCS 444
>gi|242095586|ref|XP_002438283.1| hypothetical protein SORBIDRAFT_10g011110 [Sorghum bicolor]
gi|241916506|gb|EER89650.1| hypothetical protein SORBIDRAFT_10g011110 [Sorghum bicolor]
Length = 470
Score = 117 bits (293), Expect = 7e-24, Method: Compositional matrix adjust.
Identities = 94/363 (25%), Positives = 157/363 (43%), Gaps = 53/363 (14%)
Query: 1 MVRLFIGTPSKGVLLILDTGSALIY---------------AIFDPRKSSSFQKINCDHPD 45
+ L +GTP+ +++DTGS+L + ++DPR SS++ + C
Sbjct: 135 VTELGLGTPATSYAMVVDTGSSLTWLQCSPCVVSCHRQVGPLYDPRASSTYATVPCSASQ 194
Query: 46 CTYFKC---------VNEQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGC 96
C + V C+Y Y D S + G+ + +T+S G + +GC
Sbjct: 195 CDELQAATLNPSACSVRNVCIYQASYGDSSFSVGYLSRDTVSF-----GSGSYPNFYYGC 249
Query: 97 SNDNHGFDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLK 156
DN G G AG++GL+R +S + QL + FSYCL P G YL
Sbjct: 250 GQDNEGLF-----GRSAGLIGLARNKLSLLYQLAPSLGYSFSYCLPTPASTG-----YLS 299
Query: 157 FGT-DMGYRRPSTQATKFINHPNNFYYLSLKDISIDNERMNFPPDTFDITVSGEGGCIID 215
G G+ + A+ ++ + Y+++L +S+ + P + + IID
Sbjct: 300 IGPYTSGHYSYTPMASSSLDA--SLYFVTLSGMSVGGSPLAVSPAEYSSLPT-----IID 352
Query: 216 SGSVLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCYFLPETFNRFPSMAFYFE 275
SG+V+T + VY L + + Q A + C+ + R P++A F
Sbjct: 353 SGTVITRLPTAVYTALSKAVAAAMVGVQSAPAFSI---LDTCFQGQASQLRVPAVAMAFA 409
Query: 276 -DANLRIDGENVFIIDYENHFFLLAVAPHDDLVALIGSQQQRDTRFVYDLNIDLLSFVKE 334
A L++ +NV +ID ++ LA AP D +IG+ QQ+ VYD+ + F
Sbjct: 410 GGATLKLATQNV-LIDVDDSTTCLAFAPTDS-TTIIGNTQQQTFSVVYDVAQSRIGFAAG 467
Query: 335 NCS 337
CS
Sbjct: 468 GCS 470
>gi|359485189|ref|XP_002279141.2| PREDICTED: aspartic proteinase nepenthesin-2-like [Vitis vinifera]
Length = 546
Score = 117 bits (293), Expect = 8e-24, Method: Compositional matrix adjust.
Identities = 112/373 (30%), Positives = 157/373 (42%), Gaps = 46/373 (12%)
Query: 2 VRLFIGTPSKGVLLILDTGSALIY--------------AIFDPRKSSSFQKINCDHPDCT 47
+ +F+GTP K LILDTGS L + +DP +SSS++ I C C
Sbjct: 183 IDVFVGTPPKHFSLILDTGSDLNWIQCVPCYECFEQNGPHYDPGQSSSYRNIGCHDSRCH 242
Query: 48 YF---------KCVNEQCVYTMKYADQSVTKGFAAHETISV-----IGKGEGKAIFHGAL 93
K N+ C Y Y D S T G A ET +V GK E + + +
Sbjct: 243 LVSSPDPPQPCKAENQTCPYYYWYGDSSNTTGDFALETFTVNLTMSSGKPELRRV-ENVM 301
Query: 94 FGCSNDNHGFDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSS 153
FGC + N G A R +SF SQL S+ FSYCLV + SS
Sbjct: 302 FGCGHWNRGLFHGAAGLLGL-----GRGPLSFSSQLQSLYGHSFSYCLVDRNSDAN-VSS 355
Query: 154 YLKFGTDMGY-RRPSTQATKFI----NHPNNFYYLSLKDISIDNERMNFPPDTFDITVSG 208
L FG D P T + N + FYY+ +K I + E +N P + + I G
Sbjct: 356 KLIFGEDKDLLSHPELNFTTLVAGKENPVDTFYYVQIKSIVVGGEVVNIPEEKWQIATDG 415
Query: 209 EGGCIIDSGSVLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCYFLPETFNR-F 267
GG IIDSG+ L+YF Y + E F++ + + + + EP CY +
Sbjct: 416 SGGTIIDSGTTLSYFAEPAYQVIKEAFMAKVKGYPVVKDFPVLEP---CYNVTGVEQPDL 472
Query: 268 PSMAFYFED-ANLRIDGENVFI-IDYENHFFLLAVAPHDDLVALIGSQQQRDTRFVYDLN 325
P F D A EN FI I+ L + +++IG+ QQ++ +YD
Sbjct: 473 PDFGIVFSDGAVWNFPVENYFIEIEPREVVCLAILGTPPSALSIIGNYQQQNFHILYDTK 532
Query: 326 IDLLSFVKENCSD 338
L F C+D
Sbjct: 533 KSRLGFAPTKCAD 545
>gi|224126221|ref|XP_002329620.1| predicted protein [Populus trichocarpa]
gi|222870359|gb|EEF07490.1| predicted protein [Populus trichocarpa]
Length = 357
Score = 117 bits (293), Expect = 8e-24, Method: Compositional matrix adjust.
Identities = 96/357 (26%), Positives = 153/357 (42%), Gaps = 43/357 (12%)
Query: 2 VRLFIGTPSKGVLLILDTGSALIY--------------AIFDPRKSSSFQKINCDHPDCT 47
R+ +G P++ ++LDTGS + + IFDP SS++ + C C+
Sbjct: 22 TRVGVGNPARQFYMVLDTGSDINWLQCQPCTDCYQQTDPIFDPTASSTYAPVTCQSQQCS 81
Query: 48 YFK---CVNEQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSNDNHGFD 104
+ C + QC+Y + Y D S T G A E++S G K + GC +DN G
Sbjct: 82 SLEMSSCRSGQCLYQVNYGDGSYTFGDFATESVSFGNSGSVKNV----ALGCGHDNEGL- 136
Query: 105 EDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSY--LKFGTDMG 162
G GL + +S + FSYCLV G T + + G D
Sbjct: 137 -------FVGAAGLLGLGGGPLSLTNQLKATSFSYCLVNRDSAGSSTLDFNSAQLGVD-- 187
Query: 163 YRRPSTQATKFINHP-NNFYYLSLKDISIDNERMNFPPDTFDITVSGEGGCIIDSGSVLT 221
S A N + FYY+ L +S+ + ++ P TF + SG GG I+D G+ +T
Sbjct: 188 ----SVTAPLMKNRKIDTFYYVGLSGMSVGGQMVSIPESTFRLDESGNGGIIVDCGTAIT 243
Query: 222 YFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCYFLP-ETFNRFPSMAFYFEDA-NL 279
+ Y L + FV + +L+ CY L + R P+++F+F D +
Sbjct: 244 RLQTQAYNPLRDAFVRMTQNL---KLTSAVALFDTCYDLSGQASVRVPTVSFHFADGKSW 300
Query: 280 RIDGENVFIIDYENHFFLLAVAPHDDLVALIGSQQQRDTRFVYDLNIDLLSFVKENC 336
+ N I + A AP +++IG+ QQ+ TR +DL + + F C
Sbjct: 301 NLPAANYLIPVDSAGTYCFAFAPTTSSLSIIGNVQQQGTRVTFDLANNRMGFSPNKC 357
>gi|224143371|ref|XP_002324933.1| predicted protein [Populus trichocarpa]
gi|222866367|gb|EEF03498.1| predicted protein [Populus trichocarpa]
Length = 463
Score = 117 bits (293), Expect = 8e-24, Method: Compositional matrix adjust.
Identities = 101/353 (28%), Positives = 154/353 (43%), Gaps = 40/353 (11%)
Query: 1 MVRLFIGTPSKGVLLILDTGSALIYA-------------IFDPRKSSSFQKINCDHPDCT 47
+V + IGTP K + LI DTGS LI+ +FDP KS+SF+ + C C
Sbjct: 133 IVNVGIGTPKKEMPLIFDTGSGLIWTQCKPCKACYPKVPVFDPTKSASFKGLPCSSKLCQ 192
Query: 48 YFK--CVNEQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSNDNHGFDE 105
+ C + +C Y Y D S + G A ETIS K F L GCS+ G
Sbjct: 193 SIRQGCSSPKCTYLTAYVDNSSSTGTLATETISF---SHLKYDFKNILIGCSDQVSG--- 246
Query: 106 DARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLKFGTDMGYRR 165
+G++GL+R IS SQ +I K FSYC +P+ ++ +L FG +
Sbjct: 247 --ESLGESGIMGLNRSPISLASQTANIYDKLFSYC----IPSTPGSTGHLTFGGKVPNDV 300
Query: 166 PSTQATKFINHPNNFYYLSLKDISIDNERMNFPPDTFDITVSGEGGCIIDSGSVLTYFHS 225
+ +K P++ Y + + IS+ ++ F I + IDSG+VLT
Sbjct: 301 RFSPVSK--TAPSSDYDIKMTGISVGGRKLLIDASAFKIAST------IDSGAVLTRLPP 352
Query: 226 DVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCY-FLPETFNRFPSMAFYFEDA-NLRIDG 283
Y L F + + L D + CY F + PS++ +FE + ID
Sbjct: 353 KAYSALRSVFREMMKGYPLLDQDDF---LDTCYDFSNYSTVAIPSISVFFEGGVEMDIDV 409
Query: 284 ENVFIIDYENHFFLLAVAPHDDLVALIGSQQQRDTRFVYDLNIDLLSFVKENC 336
+ + + LA A DD V++ G+ QQ+ V+D + + F C
Sbjct: 410 SGIMWQVPGSKVYCLAFAELDDEVSIFGNFQQKTYTVVFDGAKERIGFAPGGC 462
>gi|118484458|gb|ABK94105.1| unknown [Populus trichocarpa]
Length = 499
Score = 117 bits (293), Expect = 8e-24, Method: Compositional matrix adjust.
Identities = 98/357 (27%), Positives = 154/357 (43%), Gaps = 43/357 (12%)
Query: 2 VRLFIGTPSKGVLLILDTGSALIY--------------AIFDPRKSSSFQKINCDHPDCT 47
R+ +G P++ ++LDTGS + + IFDP SS++ + C C+
Sbjct: 163 TRVGVGNPARQFYMVLDTGSDINWLQCQPCTDCYQQTDPIFDPTASSTYAPVTCQSQQCS 222
Query: 48 YFK---CVNEQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSNDNHGFD 104
+ C + QC+Y + Y D S T G A E++S G K + GC +DN G
Sbjct: 223 SLEMSSCRSGQCLYQVNYGDGSYTFGDFATESVSFGNSGSVKNV----ALGCGHDNEGLF 278
Query: 105 EDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSY--LKFGTDMG 162
A G LS +QL + FSYCLV G T + + G D
Sbjct: 279 VGAAGLLGLGGGPLS-----LTNQLKAT---SFSYCLVNRDSAGSSTLDFNSAQLGVD-- 328
Query: 163 YRRPSTQATKFINHP-NNFYYLSLKDISIDNERMNFPPDTFDITVSGEGGCIIDSGSVLT 221
S A N + FYY+ L +S+ + ++ P TF + SG GG I+D G+ +T
Sbjct: 329 ----SVTAPLMKNRKIDTFYYVGLSGMSVGGQMVSIPESTFRLDESGNGGIIVDCGTAIT 384
Query: 222 YFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCYFLP-ETFNRFPSMAFYFEDA-NL 279
+ Y L + FV + +L+ CY L + R P+++F+F D +
Sbjct: 385 RLQTQAYNPLRDAFVRMTQNL---KLTSAVALFDTCYDLSGQASVRVPTVSFHFADGKSW 441
Query: 280 RIDGENVFIIDYENHFFLLAVAPHDDLVALIGSQQQRDTRFVYDLNIDLLSFVKENC 336
+ N I + A AP +++IG+ QQ+ TR +DL + + F C
Sbjct: 442 NLPAANYLIPVDSAGTYCFAFAPTTSSLSIIGNVQQQGTRVTFDLANNRMGFSPNKC 498
>gi|255553149|ref|XP_002517617.1| Aspartic proteinase nepenthesin-2 precursor, putative [Ricinus
communis]
gi|223543249|gb|EEF44781.1| Aspartic proteinase nepenthesin-2 precursor, putative [Ricinus
communis]
Length = 449
Score = 117 bits (293), Expect = 8e-24, Method: Compositional matrix adjust.
Identities = 103/368 (27%), Positives = 160/368 (43%), Gaps = 45/368 (12%)
Query: 1 MVRLFIGTPSKGVLLILDTGSALIYA--------------IFDPRKSSSFQKINCDHPDC 46
++R+ IG P +L I DTGS LI+ IFDPR+SSS++ + C + C
Sbjct: 94 LMRISIGNPQVEILAIADTGSDLIWVQCQPCEMCYKQNSPIFDPRRSSSYRNVLCGNEFC 153
Query: 47 TYF---------KCVNEQCVYTMKYADQSVTKGFAAHETISVIGKGEGK----AIFHGAL 93
+ + C YT Y DQS + G A E + A F
Sbjct: 154 NKLDGEARSCDARGFVKTCGYTYSYGDQSFSDGHLAIERFGIGSTNSNTSAAIAYFQEVA 213
Query: 94 FGCSNDNHG-FDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTS 152
FGC N G FDE ++S +SQLG + +FSYCLV YTS
Sbjct: 214 FGCGTKNGGTFDELGSGIIGL-----GGGSMSLVSQLGPKLSGKFSYCLVPTSEQSNYTS 268
Query: 153 SYLKFGTDM---GYRRPSTQATKFINHPNNFYYLSLKDISIDNERMNFPPDTFDITVSGE 209
+ FG D+ G P +YYL+L+ IS++N+R+ + + ++ V +
Sbjct: 269 K-INFGNDINISGSNYNVVSTPLLPKKPETYYYLTLEAISVENKRLPY-TNLWNGEVE-K 325
Query: 210 GGCIIDSGSVLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCYFLPETFNRFPS 269
G IIDSG+ LT+ S+ + L + E + ++SD +C F E P
Sbjct: 326 GNIIIDSGTTLTFLDSEFFNNLDS---AVEEAVKGERVSDPHGLFNIC-FKDEKAIELPI 381
Query: 270 MAFYFEDANLRIDGENVFIIDYENHFFLLAVAPHDDLVALIGSQQQRDTRFVYDLNIDLL 329
+ +F A++ + N F E + P +D +A+ G+ Q + YDL +
Sbjct: 382 ITAHFTGADVELQPVNTF-AKVEEDLLCFTMIPSND-IAIFGNLAQMNFLVGYDLEKKAV 439
Query: 330 SFVKENCS 337
SF+ +C+
Sbjct: 440 SFLPTDCT 447
>gi|449443786|ref|XP_004139658.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 1-like [Cucumis
sativus]
gi|449475416|ref|XP_004154449.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 1-like [Cucumis
sativus]
Length = 453
Score = 117 bits (293), Expect = 8e-24, Method: Compositional matrix adjust.
Identities = 108/361 (29%), Positives = 157/361 (43%), Gaps = 44/361 (12%)
Query: 2 VRLFIGTPSKGVLLILDTGSALIY--------------AIFDPRKSSSFQKINCDHPDCT 47
RL +GTP + + ++LDTGS +++ IF+P KS SF I C P C
Sbjct: 112 TRLGVGTPPRYLYMVLDTGSDVVWLQCSPCRKCYSQSDPIFNPYKSKSFAGIPCSSPLCR 171
Query: 48 YF---KCVNEQ--CVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSNDNHG 102
C + C+Y + Y D S T G A ET++ G K GC + N G
Sbjct: 172 RLDSSGCSTRRHTCLYQVSYGDGSFTTGDFATETLTFRGNKIAKVAL-----GCGHHNEG 226
Query: 103 FDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLKFGTDMG 162
A R +SF SQ G +FSYCLV + + +S + FG D
Sbjct: 227 LFVGAAGLLGL-----GRGRLSFPSQTGIRFNHKFSYCLVDRSASSKPSS--MVFG-DAA 278
Query: 163 YRRPSTQATKFINHP--NNFYYLSLKDISIDNERM-NFPPDTFDITVSGEGGCIIDSGSV 219
R + + T I +P + FYY+ L IS+ R+ P F + +G GG IIDSG+
Sbjct: 279 ISRLA-RFTPLIRNPKLDTFYYVGLIGISVGGVRVRGVSPSLFKLDSAGNGGVIIDSGTS 337
Query: 220 LTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPE--PIQLCYFLP-ETFNRFPSMAFYFED 276
+T Y L + F R L PE CY L ++ + P++ +F
Sbjct: 338 VTRLTRPAYTALRDAF-----RVGARHLKRGPEFSLFDTCYDLSGQSSVKVPTVVLHFRG 392
Query: 277 ANLRIDGENVFIIDYENHFFLLAVAPHDDLVALIGSQQQRDTRFVYDLNIDLLSFVKENC 336
A++ + N I EN F A A +++IG+ QQ+ R VYDL + F C
Sbjct: 393 ADMALPATNYLIPVDENGSFCFAFAGTISGLSIIGNIQQQGFRVVYDLAGSRIGFAPRGC 452
Query: 337 S 337
+
Sbjct: 453 T 453
>gi|326507654|dbj|BAK03220.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 442
Score = 117 bits (292), Expect = 1e-23, Method: Compositional matrix adjust.
Identities = 104/375 (27%), Positives = 163/375 (43%), Gaps = 61/375 (16%)
Query: 1 MVRLFIGTPSKGVLLILDTGSALIYA---------------IFDPRKSSSFQKINCDH-- 43
++ L IGTP I DTGS LI+ ++P S++F + C+
Sbjct: 89 IMTLAIGTPPLSYPAIADTGSDLIWTQCAPCGSQCFKQAGQPYNPSSSTTFGVLPCNSSV 148
Query: 44 ------------PDCTYFKCVNEQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHG 91
P C+ C+Y Y T G + ET + + G
Sbjct: 149 SMCAALAGPSPPPGCS--------CMYNQTYG-TGWTAGIQSVETFTFGSTPADQTRVPG 199
Query: 92 ALFGCSNDNHGFDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYT 151
FGCSN + D +G+ AG++GL R ++S +SQLG+ FSYCL P + T
Sbjct: 200 IAFGCSNAS----SDDWNGS-AGLVGLGRGSMSLVSQLGA---GMFSYCLT-PFQDANST 250
Query: 152 SSYLKFGTDMGYRRPSTQATKFINHPN-----NFYYLSLKDISIDNERMNFPPDTFDITV 206
S+ L G T F+ P+ +YYL+L ISI ++ PP+ F +
Sbjct: 251 STLL-LGPSAALNGTGVLTTPFVASPSKAPMSTYYYLNLTGISIGTTALSIPPNAFALRT 309
Query: 207 SGEGGCIIDSGSVLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCYFL---PET 263
G GG IIDSG+ +T Y ++ S +A SD + LC+ L T
Sbjct: 310 DGTGGLIIDSGTTITSLVDAAYQQVRAAIESLVT-LPVADGSDS-TGLDLCFALTSETST 367
Query: 264 FNRFPSMAFYFEDANLRIDGENVFIIDYENHFFLLAVAPHD-DLVALIGSQQQRDTRFVY 322
PSM F+F+ A++ + +N I+ + + LA+ ++ G+ QQ++ +Y
Sbjct: 368 PPSMPSMTFHFDGADMVLPVDNYMILG--SGVWCLAMRNQTVGAMSTFGNYQQQNVHLLY 425
Query: 323 DLNIDLLSFVKENCS 337
D++ + LSF CS
Sbjct: 426 DIHEETLSFAPAKCS 440
>gi|356546370|ref|XP_003541599.1| PREDICTED: probable aspartic protease At2g35615-like [Glycine max]
Length = 434
Score = 117 bits (292), Expect = 1e-23, Method: Compositional matrix adjust.
Identities = 108/365 (29%), Positives = 171/365 (46%), Gaps = 47/365 (12%)
Query: 1 MVRLFIGTPSKGVLLILDTGSALIY--------------AIFDPRKSSS-----FQKINC 41
++ +G P + I+DTGS +I+ IFDP KS++ F C
Sbjct: 87 LISYSVGIPPFQLYGIIDTGSDMIWLQCKPCEKCYNQTTRIFDPSKSNTYKILPFSSTTC 146
Query: 42 DHPDCTYFKCVNEQ-CVYTMKYADQSVTKGFAAHETISVIGKGEGKAI-FHGALFGCSND 99
+ T N + C YT+ Y D S ++G + ET++ +G G ++ F + GC +
Sbjct: 147 QSVEDTSCSSDNRKMCEYTIYYGDGSYSQGDLSVETLT-LGSTNGSSVKFRRTVIGCGRN 205
Query: 100 NHGFDEDARDGALAGVLGLSRVTISFISQL---GSIIKKRFSYCLVIPLPNGEYTSSYLK 156
N + +G +G++GL +S I+QL S I ++FSYCL + N SS L
Sbjct: 206 N----TVSFEGKSSGIVGLGNGPVSLINQLRRRSSSIGRKFSYCLA-SMSN---ISSKLN 257
Query: 157 FGTDMGYRRPSTQATKFINH-PNNFYYLSLKDISIDNERMNFPPDTFDITVSGE-GGCII 214
FG T +T + H P FYYL+L+ S+ N R+ F +F GE G II
Sbjct: 258 FGDAAVVSGDGTVSTPIVTHDPKVFYYLTLEAFSVGNNRIEFTSSSFRF---GEKGNIII 314
Query: 215 DSGSVLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCYFLPETFNRF--PSMAF 272
DSG+ LT +D+Y KL E L ++ D + + LCY TF+ P +
Sbjct: 315 DSGTTLTLLPNDIYSKLESAVADLVE---LDRVKDPLKQLSLCY--RSTFDELNAPVIMA 369
Query: 273 YFEDANLRIDGENVFIIDYENHFFLLAVAPHDDLVALIGSQQQRDTRFVYDLNIDLLSFV 332
+F A+++++ N F I+ E LA + + G+ Q++ YDL ++SF
Sbjct: 370 HFSGADVKLNAVNTF-IEVEQGVTCLAFI-SSKIGPIFGNMAQQNFLVGYDLQKKIVSFK 427
Query: 333 KENCS 337
+CS
Sbjct: 428 PTDCS 432
>gi|125527523|gb|EAY75637.1| hypothetical protein OsI_03542 [Oryza sativa Indica Group]
Length = 446
Score = 117 bits (292), Expect = 1e-23, Method: Compositional matrix adjust.
Identities = 100/374 (26%), Positives = 161/374 (43%), Gaps = 62/374 (16%)
Query: 6 IGTPSKGVLLILDTGSALIY--------------AIFDPRKSSSFQKINCDHPDCTYFK- 50
+GTPS +L++DTGS L++ +FDPR+SS+++++ C P C +
Sbjct: 92 VGTPSTKAMLVIDTGSDLVWLQCSPCRRCYAQRGQVFDPRRSSTYRRVPCSSPQCRALRF 151
Query: 51 -------CVNEQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSNDNHGF 103
C Y + Y D S + G A + ++ + GC DN G
Sbjct: 152 PGCDSGGAAGGGCRYMVAYGDGSSSTGELATDKLAF----ANDTYVNNVTLGCGRDNEGL 207
Query: 104 DEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTS-----SYLKFG 158
+ A AG+LG++R IS +Q+ F YCL G+ TS SYL FG
Sbjct: 208 FDSA-----AGLLGVARGKISISTQVAPAYGSVFEYCL------GDRTSRSTRSSYLVFG 256
Query: 159 TDMGYRRPSTQATKFINHPN--NFYYLSLKDISIDNERM-NFPPDTFDI-TVSGEGGCII 214
PST T +++P + YY+ + S+ ER+ F + + T +G GG ++
Sbjct: 257 RTP--EPPSTAFTALLSNPRRPSLYYVDMAGFSVGGERVTGFSNASLALDTATGRGGVVV 314
Query: 215 DSGSVLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCYFL---PET-------- 263
DSG+ ++ F D Y L + F + + +L+ CY L P
Sbjct: 315 DSGTAISRFARDAYAALRDAFDARARAAGMRRLAGEHSVFDACYDLRGRPAASAPLIVLH 374
Query: 264 FNRFPSMAFYFEDANLRIDGENVFIIDYENHFFLLAVAPHDDLVALIGSQQQRDTRFVYD 323
F MA E+ L +DG Y L DD +++IG+ QQ+ R V+D
Sbjct: 375 FAGGADMALPPENYFLPVDGGRRRAASYRR---CLGFEAADDGLSVIGNVQQQGFRVVFD 431
Query: 324 LNIDLLSFVKENCS 337
+ + + F + C+
Sbjct: 432 VEKERIGFAPKGCT 445
>gi|168013126|ref|XP_001759252.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162689565|gb|EDQ75936.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 385
Score = 117 bits (292), Expect = 1e-23, Method: Compositional matrix adjust.
Identities = 96/360 (26%), Positives = 156/360 (43%), Gaps = 39/360 (10%)
Query: 2 VRLFIGTPSKGVLLILDTGSALIY--------------AIFDPRKSSSFQKINCDHPDCT 47
+R+ +GTP +G+ L++DTGS +++ +FDP KSS++ + C+ C
Sbjct: 39 IRVSVGTPPRGMYLVMDTGSDILWLQCAPCVSCYHQCDEVFDPYKSSTYSTLGCNSRQCL 98
Query: 48 YF---KCVNEQCVYTMKYADQSVTKGFAAHETISVIG-KGEGKAIFHGALFGCSNDNHGF 103
CV +C+Y + Y D S + G A + +S+ G G+ + + GC +DN G+
Sbjct: 99 NLDVGGCVGNKCLYQVDYGDGSFSTGEFATDAVSLNSTSGGGQVVLNKIPLGCGHDNEGY 158
Query: 104 DEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLKFGTDMGY 163
A + +SF +Q+ S RFSYCL G T S + G
Sbjct: 159 FVGAAGLLGL-----GKGPLSFPNQINSENGGRFSYCLT-----GRDTDSTERSSLIFGD 208
Query: 164 RRPSTQATKFINHPNN-----FYYLSLKDISIDNERMNFPPDTFDITVSGEGGCIIDSGS 218
+F +N FYYL + IS+ + P F + G GG IIDSG+
Sbjct: 209 AAVPPAGVRFTPQASNLRVSTFYYLKMTGISVGGSILTIPTSAFQLDSLGNGGVIIDSGT 268
Query: 219 VLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCYFLPETFN-RFPSMAFYFE-D 276
+T + Y L E F + L CY L + + P++ +F+
Sbjct: 269 SVTRLQNAAYASLREAFRAGTSDLVLTTEFSL---FDTCYNLSDLSSVDVPTVTLHFQGG 325
Query: 277 ANLRIDGENVFIIDYENHFFLLAVAPHDDLVALIGSQQQRDTRFVYDLNIDLLSFVKENC 336
A+L++ N + + F LA A ++IG+ QQ+ R +YD + + FV C
Sbjct: 326 ADLKLPASNYLVPVDNSSTFCLAFAGTTG-PSIIGNIQQQGFRVIYDNLHNQVGFVPSQC 384
>gi|242045120|ref|XP_002460431.1| hypothetical protein SORBIDRAFT_02g028000 [Sorghum bicolor]
gi|241923808|gb|EER96952.1| hypothetical protein SORBIDRAFT_02g028000 [Sorghum bicolor]
Length = 481
Score = 116 bits (291), Expect = 1e-23, Method: Compositional matrix adjust.
Identities = 93/361 (25%), Positives = 155/361 (42%), Gaps = 42/361 (11%)
Query: 1 MVRLFIGTPSKGVLLILDTGSALIYA--------------IFDPRKSSSFQKINCDHPDC 46
+V + +GTP + +L++ DTGS L + +FDP +S+++ + C +C
Sbjct: 139 IVSVGLGTPKRDLLVVFDTGSDLSWVQCKPCDGCYQQHDPLFDPSQSTTYSAVPCGAQEC 198
Query: 47 TYF---KCVNEQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAI--FHGALFGCSNDNH 101
C + +C Y + Y D S T G A +T+++ + +FGC +D+
Sbjct: 199 RRLDSGSCSSGKCRYEVVYGDMSQTDGNLARDTLTLGPSSSSSSSDQLQEFVFGCGDDDT 258
Query: 102 GFDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLKFGTDM 161
G G G+ GL R +S SQ + FSYCL P+ YL G+
Sbjct: 259 GLF-----GKADGLFGLGRDRVSLASQAAAKYGAGFSYCL----PSSSTAEGYLSLGSAA 309
Query: 162 GYRRPSTQATKFINHPN--NFYYLSLKDISIDNERMNFPPDTFDITVSGEGGCIIDSGSV 219
P+ + T + + +FYYL+L I + + P F G +IDSG+V
Sbjct: 310 ---PPNARFTAMVTRSDTPSFYYLNLVGIKVAGRTVRVSPAVFRTP-----GTVIDSGTV 361
Query: 220 LTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCY-FLPETFNRFPSMAFYFED-A 277
+T S Y L F R+ + + + CY F + PS+A F+ A
Sbjct: 362 ITRLPSRAYAALRSSFAGLMRRYSYKR-APALSILDTCYDFTGRNKVQIPSVALLFDGGA 420
Query: 278 NLRID-GENVFIIDYENHFFLLAVAPHDDLVALIGSQQQRDTRFVYDLNIDLLSFVKENC 336
L + GE +++ + A D +A++G+ QQ+ VYD+ + F + C
Sbjct: 421 TLNLGFGEVLYVANKSQACLAFASNGDDTSIAILGNMQQKTFAVVYDVANQKIGFGAKGC 480
Query: 337 S 337
S
Sbjct: 481 S 481
>gi|224072755|ref|XP_002303865.1| predicted protein [Populus trichocarpa]
gi|222841297|gb|EEE78844.1| predicted protein [Populus trichocarpa]
Length = 412
Score = 116 bits (291), Expect = 1e-23, Method: Compositional matrix adjust.
Identities = 99/360 (27%), Positives = 159/360 (44%), Gaps = 57/360 (15%)
Query: 11 KGVLLILDTGSALIYA--------------IFDPRKSSSFQKINCDHPDCTYFKCVN--- 53
+ + +I+DTGS L + +F+P KS S++ + C+ C +
Sbjct: 75 RKMTVIVDTGSDLSWVQCQPCNRCYNQQDPVFNPSKSPSYRTVLCNSLTCRSLQLATGNS 134
Query: 54 -------EQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSNDNHGFDED 106
C Y + Y D S T G E +++ G + +FGC N G
Sbjct: 135 GVCGSNPPTCNYVVNYGDGSYTSGEVGMEHLNL-----GNTTVNNFIFGCGRKNQGLF-- 187
Query: 107 ARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLKFGTDMGYRRP 166
G +G++GL R +S ISQ+ + FSYCL P E + S + G Y+
Sbjct: 188 ---GGASGLVGLGRTDLSLISQISPMFGGVFSYCL--PTTEAEASGSLVMGGNSSVYKNT 242
Query: 167 STQA-TKFINHPN-NFYYLSLKDISIDNERMNFPPDTFDITVSGEGGCIIDSGSVLTYFH 224
+ + T+ I++P FY+L+L I++ + P +F G+ IIDSG+V++
Sbjct: 243 TPISYTRMIHNPLLPFYFLNLTGITVGGVEVQAP--SF-----GKDRMIIDSGTVISRLP 295
Query: 225 SDVYWKLHEKFVSYFERFQLAQLSDCPEPIQL--CYFLPETFN-RFPSMAFYFE-DANLR 280
+Y L +FV F + A P + L C+ L + P + YFE A L
Sbjct: 296 PSIYQALKAEFVKQFSGYPSA-----PSFMILDSCFNLSGYQEVKIPDIKMYFEGSAELN 350
Query: 281 IDGENVFI---IDYENHFFLLAVAPHDDLVALIGSQQQRDTRFVYDLNIDLLSFVKENCS 337
+D VF D +A P++D V +IG+ QQ++ R +YD +L F +E CS
Sbjct: 351 VDVTGVFYSVKTDASQVCLAIASLPYEDEVGIIGNYQQKNQRIIYDTKGSMLGFAEEACS 410
>gi|115472515|ref|NP_001059856.1| Os07g0532800 [Oryza sativa Japonica Group]
gi|50508274|dbj|BAD32123.1| putative aspartic proteinase nepenthesin I [Oryza sativa Japonica
Group]
gi|113611392|dbj|BAF21770.1| Os07g0532800 [Oryza sativa Japonica Group]
Length = 436
Score = 116 bits (291), Expect = 1e-23, Method: Compositional matrix adjust.
Identities = 105/374 (28%), Positives = 161/374 (43%), Gaps = 65/374 (17%)
Query: 2 VRLFIGTPSKGVLLILDTGSALIYA--------------IFDPRKSSSFQKINCDHPDCT 47
+ + +GTP ++ DTGS LI+ F P SS+F K+ C C
Sbjct: 88 MNISVGTPLLTFSVVADTGSDLIWTQCAPCTKCFQQPAPPFQPASSSTFSKLPCTSSFCQ 147
Query: 48 YF-----KCVNEQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSNDNHG 102
+ C CVY KY T G+ A ET+ V G A F FGCS +N
Sbjct: 148 FLPNSIRTCNATGCVYNYKYGS-GYTAGYLATETLKV-----GDASFPSVAFGCSTEN-- 199
Query: 103 FDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLKFGTDMG 162
+ +G+ GL R +S I QLG RFSYCL G +S + FG+
Sbjct: 200 ----GVGNSTSGIAGLGRGALSLIPQLG---VGRFSYCLRSGSAAG---ASPILFGSLAN 249
Query: 163 YRRPSTQATKFINHPN---NFYYLSLKDISIDNERMNFPPDTFDITVSG-EGGCIIDSGS 218
+ Q+T F+N+P ++YY++L I++ + TF T +G GG I+DSG+
Sbjct: 250 LTDGNVQSTPFVNNPAVHPSYYYVNLTGITVGETDLPVTTSTFGFTQNGLGGGTIVDSGT 309
Query: 219 VLTYFHSDVYWKLHEKFVSYFERFQLAQLS--DCPEPIQLCY----------FLPETFNR 266
LTY D Y + + F+S Q A ++ + + LC+ +P R
Sbjct: 310 TLTYLAKDGYEMVKQAFLS-----QTADVTTVNGTRGLDLCFKSTGGGGGGIAVPSLVLR 364
Query: 267 FPSMAFYFED---ANLRIDGENVFIIDYENHFFLLAVAPHDDLVALIGSQQQRDTRFVYD 323
F A Y A + D + + ++ A D +++IG+ Q D +YD
Sbjct: 365 FDGGAEYAVPTYFAGVETDSQGSVTV----ACLMMLPAKGDQPMSVIGNVMQMDMHLLYD 420
Query: 324 LNIDLLSFVKENCS 337
L+ + SF +C+
Sbjct: 421 LDGGIFSFAPADCA 434
>gi|224053042|ref|XP_002297678.1| predicted protein [Populus trichocarpa]
gi|222844936|gb|EEE82483.1| predicted protein [Populus trichocarpa]
Length = 482
Score = 116 bits (291), Expect = 1e-23, Method: Compositional matrix adjust.
Identities = 99/367 (26%), Positives = 168/367 (45%), Gaps = 66/367 (17%)
Query: 11 KGVLLILDTGSALIYA--------------IFDPRKSSSFQKINCDHPDCTYFKCVN--- 53
+ + +I+DTGS L + +F+P S S++ + C P C +
Sbjct: 144 RKMTVIVDTGSDLSWVQCQPCKRCYNQQDPVFNPSTSPSYRTVLCSSPTCQSLQSATGNL 203
Query: 54 -------EQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSNDNHGFDED 106
C Y + Y D S T+G E + + G A+ + +FGC +N G
Sbjct: 204 GVCGSNPPSCNYVVNYGDGSYTRGELGTEHLDL---GNSTAV-NNFIFGCGRNNQGLF-- 257
Query: 107 ARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLKFGTDMGYRRP 166
G +G++GL R ++S ISQ ++ FSYCL P+ E + S + G Y+
Sbjct: 258 ---GGASGLVGLGRSSLSLISQTSAMFGGVFSYCL--PITETEASGSLVMGGNSSVYKNT 312
Query: 167 STQA-TKFINHPN-NFYYLSLKDISIDNERMNFPPDTFDITVSGEGGCIIDSGSVLTYFH 224
+ + T+ I +P FY+L+L I++ + + P +F G+ G +IDSG+V+T
Sbjct: 313 TPISYTRMIPNPQLPFYFLNLTGITVGSVAVQAP--SF-----GKDGMMIDSGTVITRLP 365
Query: 225 SDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCYFLPET-FN-------RFPSMAFYFE- 275
+Y L ++FV F F P + + +T FN P++ +FE
Sbjct: 366 PSIYQALKDEFVKQFSGF----------PSAPAFMILDTCFNLSGYQEVEIPNIKMHFEG 415
Query: 276 DANLRIDGENVF-IIDYENHFFLLAVAP--HDDLVALIGSQQQRDTRFVYDLNIDLLSFV 332
+A L +D VF + + LA+A +++ V +IG+ QQ++ R +YD +L F
Sbjct: 416 NAELNVDVTGVFYFVKTDASQVCLAIASLSYENEVGIIGNYQQKNQRVIYDTKGSMLGFA 475
Query: 333 KENCSDD 339
E C+ D
Sbjct: 476 AEACTFD 482
>gi|125558622|gb|EAZ04158.1| hypothetical protein OsI_26300 [Oryza sativa Indica Group]
Length = 435
Score = 116 bits (291), Expect = 1e-23, Method: Compositional matrix adjust.
Identities = 105/373 (28%), Positives = 161/373 (43%), Gaps = 64/373 (17%)
Query: 2 VRLFIGTPSKGVLLILDTGSALIYA--------------IFDPRKSSSFQKINCDHPDCT 47
+ + +GTP ++ DTGS LI+ F P SS+F K+ C C
Sbjct: 88 MNISVGTPLLTFPVVADTGSDLIWTQCAPCTKCFQQPAPPFQPASSSTFSKLPCTSSFCQ 147
Query: 48 YF-----KCVNEQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSNDNHG 102
+ C CVY KY T G+ A ET+ V G A F FGCS +N
Sbjct: 148 FLPNSIRTCNATGCVYNYKYGS-GYTAGYLATETLKV-----GDASFPSVAFGCSTEN-- 199
Query: 103 FDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLKFGTDMG 162
+ +G+ GL R +S I QLG RFSYCL G +S + FG+
Sbjct: 200 ----GVGNSTSGIAGLGRGALSLIPQLG---VGRFSYCLRSGSAAG---ASPILFGSLAN 249
Query: 163 YRRPSTQATKFINHPN---NFYYLSLKDISIDNERMNFPPDTFDITVSG-EGGCIIDSGS 218
+ Q+T F+N+P ++YY++L I++ + TF T +G GG I+DSG+
Sbjct: 250 LTDGNVQSTPFVNNPAVHPSYYYVNLTGITVGETDLPVTTSTFGFTQNGLGGGTIVDSGT 309
Query: 219 VLTYFHSDVYWKLHEKFVSYFERFQLAQLS--DCPEPIQLCY---------FLPETFNRF 267
LTY D Y + + F+S Q A ++ + + LC+ +P RF
Sbjct: 310 TLTYLAKDGYEMVKQAFLS-----QTANVTTVNGTRGLDLCFKSTGGGGGIAVPSLVLRF 364
Query: 268 PSMAFYFED---ANLRIDGENVFIIDYENHFFLLAVAPHDDLVALIGSQQQRDTRFVYDL 324
A Y A + D + + ++ A D +++IG+ Q D +YDL
Sbjct: 365 DGGAEYAVPTYFAGVETDSQGSVTV----ACLMMLPAKGDQPMSVIGNVMQMDMHLLYDL 420
Query: 325 NIDLLSFVKENCS 337
+ + SF +C+
Sbjct: 421 DGGIFSFSPADCA 433
>gi|356575389|ref|XP_003555824.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
Length = 473
Score = 116 bits (290), Expect = 2e-23, Method: Compositional matrix adjust.
Identities = 98/362 (27%), Positives = 160/362 (44%), Gaps = 51/362 (14%)
Query: 2 VRLFIGTPSKGVLLILDTGSALIYA--------------IFDPRKSSSFQKINCDHPDCT 47
VR+ +G+P + +++D+GS +I+ +F+P SSS+ ++C C+
Sbjct: 136 VRIGVGSPPRNQYVVIDSGSDIIWVQCEPCTQCYHQSDPVFNPADSSSYAGVSCASTVCS 195
Query: 48 YF---KCVNEQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSNDNHGFD 104
+ C +C Y + Y D S TKG A ET++ G+ + GC + N G
Sbjct: 196 HVDNAGCHEGRCRYEVSYGDGSYTKGTLALETLTF-----GRTLIRNVAIGCGHHNQGMF 250
Query: 105 EDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLKFGTDMGYR 164
A AG+LGL +SF+ QLG FSYCLV G +S L+FG +
Sbjct: 251 VGA-----AGLLGLGSGPMSFVGQLGGQAGGTFSYCLV---SRGIQSSGLLQFGREA--- 299
Query: 165 RPSTQA-TKFINHP--NNFYYLSLKDISIDNERMNFPPDTFDITVSGEGGCIIDSGSVLT 221
P A I++P +FYY+ L + + R+ D F ++ G+GG ++D+G+ +T
Sbjct: 300 VPVGAAWVPLIHNPRAQSFYYVGLSGLGVGGLRVPISEDVFKLSELGDGGVVMDTGTAVT 359
Query: 222 YFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQL-----CYFLPETFN-RFPSMAFYFE 275
+ Y + F+ AQ ++ P + CY L + R P+++FYF
Sbjct: 360 RLPTAAYEAFRDAFI--------AQTTNLPRASGVSIFDTCYDLFGFVSVRVPTVSFYFS 411
Query: 276 DAN-LRIDGENVFIIDYENHFFLLAVAPHDDLVALIGSQQQRDTRFVYDLNIDLLSFVKE 334
L + N I + F A AP +++IG+ QQ D + F
Sbjct: 412 GGPILTLPARNFLIPVDDVGSFCFAFAPSSSGLSIIGNIQQEGIEISVDGANGFVGFGPN 471
Query: 335 NC 336
C
Sbjct: 472 VC 473
>gi|356557203|ref|XP_003546907.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
Length = 470
Score = 116 bits (290), Expect = 2e-23, Method: Compositional matrix adjust.
Identities = 104/366 (28%), Positives = 157/366 (42%), Gaps = 57/366 (15%)
Query: 6 IGTPSKGVLLILDTGSALIYA--------------IFDPRKSSSFQKINCDHPDCTYFK- 50
+G S+ + +I+DTGS L + +F P S S+Q I C+ C +
Sbjct: 126 MGLGSQNMSVIVDTGSDLTWVQCEPCRSCYNQNGPLFKPSTSPSYQPILCNSTTCQSLEL 185
Query: 51 -------CVNEQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSNDNHGF 103
+ C Y + Y D S T G E + G G +FGC +N G
Sbjct: 186 GACGSDPSTSATCDYVVNYGDGSYTSGELGIEKL-----GFGGISVSNFVFGCGRNNKGL 240
Query: 104 DEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLKFGTDMGY 163
G +G++GL R +S ISQ + FSYCL P + S L G G
Sbjct: 241 F-----GGASGLMGLGRSELSMISQTNATFGGVFSYCL--PSTDQAGASGSLVMGNQSGV 293
Query: 164 RRPSTQATKFINHPN----NFYYLSLKDISIDNERMNFPPDTFDITVSGEGGCIIDSGSV 219
+ T PN NFY L+L I + ++ +F G GG I+DSG+V
Sbjct: 294 FKNVTPIAYTRMLPNLQLSNFYILNLTGIDVGGVSLHVQASSF-----GNGGVILDSGTV 348
Query: 220 LTYFHSDVYWKLHEKFVSYFERFQLAQLSDCP--EPIQLCYFLP--ETFNRFPSMAFYFE 275
++ VY L KF+ F F A P + C+ L + N P+++ YFE
Sbjct: 349 ISRLAPSVYKALKAKFLEQFSGFPSA-----PGFSILDTCFNLTGYDQVN-IPTISMYFE 402
Query: 276 -DANLRIDGENVF-IIDYENHFFLLAVAPHDD--LVALIGSQQQRDTRFVYDLNIDLLSF 331
+A L +D +F ++ + LA+A D + +IG+ QQR+ R +YD + + F
Sbjct: 403 GNAELNVDATGIFYLVKEDASRVCLALASLSDEYEMGIIGNYQQRNQRVLYDAKLSQVGF 462
Query: 332 VKENCS 337
KE C+
Sbjct: 463 AKEPCT 468
>gi|30686482|ref|NP_850251.1| aspartyl protease-like protein [Arabidopsis thaliana]
gi|122215044|sp|Q3EBM5.1|ASPR1_ARATH RecName: Full=Probable aspartic protease At2g35615; Flags:
Precursor
gi|330254036|gb|AEC09130.1| aspartyl protease-like protein [Arabidopsis thaliana]
Length = 447
Score = 116 bits (290), Expect = 2e-23, Method: Compositional matrix adjust.
Identities = 110/372 (29%), Positives = 163/372 (43%), Gaps = 50/372 (13%)
Query: 2 VRLFIGTPSKGVLLILDTGSALIYA--------------IFDPRKSSSFQKINCDHPDCT 47
+ + IGTP V I DTGS L + IFD +KSS+++ CD +C
Sbjct: 87 MSITIGTPPIKVFAIADTGSDLTWVQCKPCQQCYKENGPIFDKKKSSTYKSEPCDSRNCQ 146
Query: 48 YFKCV-------NEQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSNDN 100
N C Y Y DQS +KG A ET+S+ F G +FGC +N
Sbjct: 147 ALSSTERGCDESNNICKYRYSYGDQSFSKGDVATETVSIDSASGSPVSFPGTVFGCGYNN 206
Query: 101 HG-FDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLV--IPLPNGEYTSSYLKF 157
G FDE G L S ISQLGS I K+FSYCL NG +S +
Sbjct: 207 GGTFDETGSGIIGLGGGHL-----SLISQLGSSISKKFSYCLSHKSATTNG---TSVINL 258
Query: 158 GTD----MGYRRPSTQATKFIN-HPNNFYYLSLKDISIDNERM-----NFPPDTFDITVS 207
GT+ + +T ++ P +YYL+L+ IS+ +++ ++ P+ I
Sbjct: 259 GTNSIPSSLSKDSGVVSTPLVDKEPLTYYYLTLEAISVGKKKIPYTGSSYNPNDDGILSE 318
Query: 208 GEGGCIIDSGSVLTYFHSDVYWKLHEKFVSYFERFQLA--QLSDCPEPIQLCYFLPETFN 265
G IIDSG+ LT + + +KF S E ++SD + C+
Sbjct: 319 TSGNIIIDSGTTLTLLEAGFF----DKFSSAVEESVTGAKRVSDPQGLLSHCFKSGSAEI 374
Query: 266 RFPSMAFYFEDANLRIDGENVFIIDYENHFFLLAVAPHDDLVALIGSQQQRDTRFVYDLN 325
P + +F A++R+ N F+ E+ L++ P + VA+ G+ Q D YDL
Sbjct: 375 GLPEITVHFTGADVRLSPINAFVKLSED-MVCLSMVPTTE-VAIYGNFAQMDFLVGYDLE 432
Query: 326 IDLLSFVKENCS 337
+SF +CS
Sbjct: 433 TRTVSFQHMDCS 444
>gi|357481205|ref|XP_003610888.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
gi|355512223|gb|AES93846.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
Length = 413
Score = 116 bits (290), Expect = 2e-23, Method: Compositional matrix adjust.
Identities = 102/362 (28%), Positives = 167/362 (46%), Gaps = 40/362 (11%)
Query: 1 MVRLFIGTPSKGVLLILDTGSALIYA--------------IFDPRKSSSFQKINCDHPDC 46
++ L+IGTP + +DTGS LI+ +FDP KSS++ I+CD P C
Sbjct: 65 LMELYIGTPPIKISGTVDTGSDLIWVQCVPCLGCYNQINPMFDPLKSSTYTNISCDSPLC 124
Query: 47 --TYF-KCVNEQ-CVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSNDNHG 102
Y +C E+ C YT YAD S+TKG A ET+++ G LFGC ++N G
Sbjct: 125 YKPYIGECSPEKRCDYTYGYADSSLTKGVLAQETVTLTSNTGKPISLQGILFGCGHNNTG 184
Query: 103 FDEDARDGALAGVLGLSRVTISFISQLGSII-KKRFSYCLVIPLPNGEYTSSYLKFGTDM 161
D G++GL S +SQ+G + K+FS CLV P SS + FG
Sbjct: 185 NFNDHE----MGLIGLGGGPTSLVSQIGPLFGGKKFSQCLV-PFLTDITISSQMSFGKGS 239
Query: 162 GYRRPSTQATKFINHPNNF--YYLSLKDISIDNERMNFPPDTFDITVSGEGGCIIDSGSV 219
T + + YY++L IS+++ + + T+ +G ++DSG+
Sbjct: 240 EVLGEGVVTTPLVQREQDMTSYYVTLLGISVEDTYL-----PMNSTIE-KGNMLVDSGTP 293
Query: 220 LTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPE-PIQLCYFLPETFNRFPSMAFYFEDAN 278
+Y ++ +V + L ++D P QLCY +T + P++ ++FE AN
Sbjct: 294 PNILPQQLYDRV---YVEVKNKVPLEPITDDPSLGPQLCYRT-QTNLKGPTLTYHFEGAN 349
Query: 279 LRIDGENVFI---IDYENHFFLLAVAPHDDLVALIGSQQQRDTRFVYDLNIDLLSFVKEN 335
L + FI + + F L + + G+ Q + +DL+ ++SF +
Sbjct: 350 LLLTPIQTFIPPTPETKGVFCLAITNCANSDPGIYGNFAQTNYLIGFDLDRQIVSFKPTD 409
Query: 336 CS 337
C+
Sbjct: 410 CT 411
>gi|356567196|ref|XP_003551807.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Glycine max]
Length = 490
Score = 116 bits (290), Expect = 2e-23, Method: Compositional matrix adjust.
Identities = 98/353 (27%), Positives = 152/353 (43%), Gaps = 53/353 (15%)
Query: 2 VRLFIGTPSKGVLLILDTGSALIYA---------------IFDPRKSSSFQKINCDHPDC 46
V + +GTP + + LI DTGS L + IFDP KS+S+ I C C
Sbjct: 148 VVVGLGTPKRDLSLIFDTGSDLTWTQCEPCARSCYKQQDVIFDPSKSTSYSNITCTSALC 207
Query: 47 TYFKCVN----------EQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGC 96
T + C+Y ++Y D S + G+ + E ++V + LFGC
Sbjct: 208 TQLSTATGNDPGCSASTKACIYGIQYGDSSFSVGYFSRERLTVTATD----VVDNFLFGC 263
Query: 97 SNDNHGFDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLK 156
+N G G AG++GL R ISF+ Q + +K FSYC LP+ ++ +L
Sbjct: 264 GQNNQGLF-----GGSAGLIGLGRHPISFVQQTAAKYRKIFSYC----LPSTSSSTGHLS 314
Query: 157 FGTDMGYRRPSTQATKFINHPNNFYYLSLKDISIDNERMNFPPDTFDITVSGEGGCIIDS 216
FG R I+ ++FY L + I++ ++ TF GG IIDS
Sbjct: 315 FGPAATGRYLKYTPFSTISRGSSFYGLDITAIAVGGVKLPVSSSTFST-----GGAIIDS 369
Query: 217 GSVLTYFHSDVYWKLHEKFVSYFERFQLA-QLSDCPEPIQLCYFLP--ETFNRFPSMAFY 273
G+V+T Y L F ++ A +LS + CY L + F+ P++ F
Sbjct: 370 GTVITRLPPTAYGALRSAFRQGMSKYPSAGELS----ILDTCYDLSGYKVFS-IPTIEFS 424
Query: 274 FEDA-NLRIDGENVFIIDYENHFFLLAVAPHDDL-VALIGSQQQRDTRFVYDL 324
F +++ + + + L A DD V + G+ QQR VYD+
Sbjct: 425 FAGGVTVKLPPQGILFVASTKQVCLAFAANGDDSDVTIYGNVQQRTIEVVYDV 477
>gi|225427558|ref|XP_002266614.1| PREDICTED: probable aspartic protease At2g35615-like [Vitis
vinifera]
Length = 444
Score = 116 bits (290), Expect = 2e-23, Method: Compositional matrix adjust.
Identities = 106/364 (29%), Positives = 170/364 (46%), Gaps = 40/364 (10%)
Query: 1 MVRLFIGTPSKGVLLILDTGSALIY--------------AIFDPRKSSSFQKINCDHPDC 46
++ + +GTP +L I DTGS LI+ +FDP++S +++ ++CD+ C
Sbjct: 95 LMNISLGTPPVPMLGIADTGSDLIWRQCLPCPNCYEQVEPLFDPKESETYKTLDCDNEFC 154
Query: 47 TYF----KCVNEQ-CVYTMKYADQSVTKGFAAHETISVIGKGEG-KAIFHGALFGCSNDN 100
C ++ C Y+ Y D+S T+G + +T++ IG EG A F G FGC +DN
Sbjct: 155 QDLGQQGSCDDDNTCTYSYSYGDRSYTRGDLSSDTLT-IGSTEGDPASFPGIAFGCGHDN 213
Query: 101 HG-FDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLKFGT 159
G F+E G LS V QL S + +FSYCLV PL + SS + FG
Sbjct: 214 GGTFNEKDGGLIGLGGGPLSLVM-----QLSSEVGGQFSYCLV-PLSSDSTVSSKINFGK 267
Query: 160 DMGYRRPSTQATKFI-NHPNNFYYLSLKDISIDNERM---NFPPDTFDITVSGEGGCIID 215
T +T I P+ FYYL+L+ +S+ +E + F + EG IID
Sbjct: 268 SGVVSGSGTVSTPLIKGTPDTFYYLTLEGLSVGSETVAFKGFSENKSSPAAVEEGNIIID 327
Query: 216 SGSVLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPI-QLCYFLPETFNRFPSMAFYF 274
SG+ LT D Y + + Q + P I LCY P++ +F
Sbjct: 328 SGTTLTLLPQDFYTDVESALTNAIG----GQTTTDPNGIFSLCYSSVNNL-EIPTITAHF 382
Query: 275 EDANLRIDGENVFIIDYENHFFLLAVAPHDDLVALIGSQQQRDTRFVYDLNIDLLSFVKE 334
A++++ N F + + ++ P +L A+ G+ Q + YDL + +SF +
Sbjct: 383 TGADVQLPPLNTF-VQVQEDLVCFSMIPSSNL-AIFGNLAQINFLVGYDLKNNKVSFKQT 440
Query: 335 NCSD 338
+C++
Sbjct: 441 DCTE 444
>gi|18461217|dbj|BAB84414.1| chloroplast nucleoid DNA-binding protein cnd41-like [Oryza sativa
Japonica Group]
Length = 446
Score = 116 bits (290), Expect = 2e-23, Method: Compositional matrix adjust.
Identities = 100/374 (26%), Positives = 160/374 (42%), Gaps = 62/374 (16%)
Query: 6 IGTPSKGVLLILDTGSALIY--------------AIFDPRKSSSFQKINCDHPDCTYFK- 50
+GTPS +L++DTGS L++ +FDPR+SS+++++ C P C +
Sbjct: 92 VGTPSTKAMLVIDTGSDLVWLQCSPCRRCYAQRGQVFDPRRSSTYRRVPCSSPQCRALRF 151
Query: 51 -------CVNEQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSNDNHGF 103
C Y + Y D S + G A + ++ + GC DN G
Sbjct: 152 PGCDSGGAAGGGCRYMVAYGDGSSSTGDLATDKLAF----ANDTYVNNVTLGCGRDNEGL 207
Query: 104 DEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTS-----SYLKFG 158
+ A AG+LG+ R IS +Q+ F YCL G+ TS SYL FG
Sbjct: 208 FDSA-----AGLLGVGRGKISISTQVAPAYGSVFEYCL------GDRTSRSTRSSYLVFG 256
Query: 159 TDMGYRRPSTQATKFINHPN--NFYYLSLKDISIDNERM-NFPPDTFDI-TVSGEGGCII 214
PST T +++P + YY+ + S+ ER+ F + + T +G GG ++
Sbjct: 257 RTP--EPPSTAFTALLSNPRRPSLYYVDMAGFSVGGERVTGFSNASLALDTATGRGGVVV 314
Query: 215 DSGSVLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCYFL---PET-------- 263
DSG+ ++ F D Y L + F + + +L+ CY L P
Sbjct: 315 DSGTAISRFARDAYAALRDAFDARARAAGMRRLAGEHSVFDACYDLRGRPAASAPLIVLH 374
Query: 264 FNRFPSMAFYFEDANLRIDGENVFIIDYENHFFLLAVAPHDDLVALIGSQQQRDTRFVYD 323
F MA E+ L +DG Y L DD +++IG+ QQ+ R V+D
Sbjct: 375 FAGGADMALPPENYFLPVDGGRRRAASYRR---CLGFEAADDGLSVIGNVQQQGFRVVFD 431
Query: 324 LNIDLLSFVKENCS 337
+ + + F + C+
Sbjct: 432 VEKERIGFAPKGCT 445
>gi|167997964|ref|XP_001751688.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162696786|gb|EDQ83123.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 418
Score = 115 bits (289), Expect = 2e-23, Method: Compositional matrix adjust.
Identities = 109/371 (29%), Positives = 165/371 (44%), Gaps = 50/371 (13%)
Query: 2 VRLFIGTPSKGVLLILDTGSALIYA--------------IFDPRKSSSFQKINCDHPDCT 47
V F+GTP + LI+D+GS L++ ++ P SS+F + C DC
Sbjct: 66 VDFFLGTPPQKFSLIVDSGSDLLWVQCSPCRQCYAQDSPLYVPSNSSTFSPVPCLSSDCL 125
Query: 48 Y------FKC---VNEQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSN 98
F C C Y YAD S +KG A+E+ +V G K F GC +
Sbjct: 126 LIPATEGFPCDFRYPGACAYEYLYADTSSSKGVFAYESATVDGVRIDKVAF-----GCGS 180
Query: 99 DNHGFDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLKFG 158
DN G A GVLGL + +SF SQ+G +F+YCLV L + SS L FG
Sbjct: 181 DNQG-----SFAAAGGVLGLGQGPLSFGSQVGYAYGNKFAYCLVNYL-DPTSVSSSLIFG 234
Query: 159 TDMGYRRPSTQATKFINHPNN--FYYLSLKDISIDNERMNFPPDTFDITVSGEGGCIIDS 216
++ Q T +++P + YY+ ++ +++ + + ++I + G GG I DS
Sbjct: 235 DELISTIHDMQYTPIVSNPKSPTLYYVQIEKVTVGGKSLPISDSAWEIDLLGNGGSIFDS 294
Query: 217 GSVLTYFHSDVYWKLHEKFVS--YFERFQLAQLSDCPEPIQLCYFLPETFN-RFPSMAFY 273
G+ LTY+ Y + F S ++ R + Q D LC L FPS
Sbjct: 295 GTTLTYWFPSAYSHILAAFDSGVHYPRAESVQGLD------LCVELTGVDQPSFPSFTIE 348
Query: 274 FED-ANLRIDGENVFIIDYENHFFLLAVAPHDDLVA---LIGSQQQRDTRFVYDLNIDLL 329
F+D A + + EN F +D + LA+A + IG+ Q++ YD +L+
Sbjct: 349 FDDGAVFQPEAENYF-VDVAPNVRCLAMAGLASPLGGFNTIGNLLQQNFFVQYDREENLI 407
Query: 330 SFVKENCSDDS 340
F CS S
Sbjct: 408 GFAPAKCSSHS 418
>gi|15231625|ref|NP_191467.1| aspartyl protease family protein [Arabidopsis thaliana]
gi|15983376|gb|AAL11556.1|AF424562_1 AT3g59080/F17J16_130 [Arabidopsis thaliana]
gi|7529751|emb|CAB86936.1| putative protein [Arabidopsis thaliana]
gi|20466704|gb|AAM20669.1| putative protein [Arabidopsis thaliana]
gi|23198236|gb|AAN15645.1| putative protein [Arabidopsis thaliana]
gi|332646352|gb|AEE79873.1| aspartyl protease family protein [Arabidopsis thaliana]
Length = 535
Score = 115 bits (289), Expect = 2e-23, Method: Compositional matrix adjust.
Identities = 108/369 (29%), Positives = 160/369 (43%), Gaps = 42/369 (11%)
Query: 4 LFIGTPSKGVLLILDTGSALIY--------------AIFDPRKSSSFQKINCDHPDCTYF 49
+ +G+P K LILDTGS L + A +DP+ S+S++ I C+ C
Sbjct: 174 VLVGSPPKHFSLILDTGSDLNWIQCLPCYDCFQQNGAFYDPKASASYKNITCNDQRCNLV 233
Query: 50 ---------KCVNEQCVYTMKYADQSVTKGFAAHETISV--IGKGEGKAIFH--GALFGC 96
K N+ C Y Y D S T G A ET +V G +++ +FGC
Sbjct: 234 SSPDPPMPCKSDNQSCPYYYWYGDSSNTTGDFAVETFTVNLTTNGGSSELYNVENMMFGC 293
Query: 97 SNDNHGFDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLK 156
+ N G A R +SF SQL S+ FSYCLV + SS L
Sbjct: 294 GHWNRGLFHGAAGLLGL-----GRGPLSFSSQLQSLYGHSFSYCLV-DRNSDTNVSSKLI 347
Query: 157 FGTDMGY-RRPSTQATKFINHPNN----FYYLSLKDISIDNERMNFPPDTFDITVSGEGG 211
FG D P+ T F+ N FYY+ +K I + E +N P +T++I+ G GG
Sbjct: 348 FGEDKDLLSHPNLNFTSFVAGKENLVDTFYYVQIKSILVAGEVLNIPEETWNISSDGAGG 407
Query: 212 CIIDSGSVLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCYFLPETFN-RFPSM 270
IIDSG+ L+YF Y + K ++ + + D P + C+ + N + P +
Sbjct: 408 TIIDSGTTLSYFAEPAYEFIKNK-IAEKAKGKYPVYRDFP-ILDPCFNVSGIHNVQLPEL 465
Query: 271 AFYFED-ANLRIDGENVFIIDYENHFFLLAVAPHDDLVALIGSQQQRDTRFVYDLNIDLL 329
F D A EN FI E+ L + ++IG+ QQ++ +YD L
Sbjct: 466 GIAFADGAVWNFPTENSFIWLNEDLVCLAMLGTPKSAFSIIGNYQQQNFHILYDTKRSRL 525
Query: 330 SFVKENCSD 338
+ C+D
Sbjct: 526 GYAPTKCAD 534
>gi|255541796|ref|XP_002511962.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
gi|223549142|gb|EEF50631.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
Length = 495
Score = 115 bits (289), Expect = 2e-23, Method: Compositional matrix adjust.
Identities = 101/357 (28%), Positives = 154/357 (43%), Gaps = 44/357 (12%)
Query: 2 VRLFIGTPSKGVLLILDTGSALIY--------------AIFDPRKSSSFQKINCDHPDCT 47
R+ +G P+K ++LDTGS + + IF P SSS+ + CD C
Sbjct: 161 TRVGVGNPAKSYYMVLDTGSDINWIQCQPCSDCYQQSDPIFTPAASSSYSPLTCDSQQCN 220
Query: 48 YFK---CVNEQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSNDNHGFD 104
+ C N QC Y + Y D S T G ET+S G G +I GC +DN G
Sbjct: 221 SLQMSSCRNGQCRYQVNYGDGSFTFGDFVTETMSFGGSGTVNSI----ALGCGHDNEGLF 276
Query: 105 EDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLKFGTDM--- 161
A G LS SQL + FSYCLV SS L F +
Sbjct: 277 VGAAGLLGLGGGPLS-----LTSQLKA---TSFSYCLV---NRDSAASSTLDFNSAPVGD 325
Query: 162 GYRRPSTQATKFINHPNNFYYLSLKDISIDNERMNFPPDTFDITVSGEGGCIIDSGSVLT 221
P +++K + FYY+ L +S+ E + P + F + SG+GG I+D G+ +T
Sbjct: 326 SVIAPLLKSSKI----DTFYYVGLSGMSVGGELLRIPQEVFKLDDSGDGGVIVDCGTAIT 381
Query: 222 YFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCYFLP-ETFNRFPSMAFYFEDA-NL 279
S+ Y L + FVS + + CY L ++ + P+++F+F+ +
Sbjct: 382 RLQSEAYNSLRDSFVSMSRHL---RSTSGVALFDTCYDLSGQSSVKVPTVSFHFDGGKSW 438
Query: 280 RIDGENVFIIDYENHFFLLAVAPHDDLVALIGSQQQRDTRFVYDLNIDLLSFVKENC 336
+ N I + A AP +++IG+ QQ+ TR +DL + + F C
Sbjct: 439 DLPAANYLIPVDSAGTYCFAFAPTTSSLSIIGNVQQQGTRVSFDLANNRVGFSTNKC 495
>gi|297811185|ref|XP_002873476.1| hypothetical protein ARALYDRAFT_325615 [Arabidopsis lyrata subsp.
lyrata]
gi|297319313|gb|EFH49735.1| hypothetical protein ARALYDRAFT_325615 [Arabidopsis lyrata subsp.
lyrata]
Length = 475
Score = 115 bits (289), Expect = 2e-23, Method: Compositional matrix adjust.
Identities = 97/363 (26%), Positives = 149/363 (41%), Gaps = 47/363 (12%)
Query: 1 MVRLFIGTPSKGVLLILDTGSALIYA---------------IFDPRKSSSFQKINCDHPD 45
+V + +GTP + LI DTGS L + IF+P KS+S+ ++C
Sbjct: 134 IVTVGLGTPKNDLSLIFDTGSDLTWTQCQPCVRTCYDQKEPIFNPSKSTSYYNVSCSSAA 193
Query: 46 CTYFK--------CVNEQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCS 97
C C C+Y ++Y DQS + GF A + ++ +F G FGC
Sbjct: 194 CGSLSSATGNAGSCSASNCIYGIQYGDQSFSVGFLAKDKFTL----TSSDVFDGVYFGCG 249
Query: 98 NDNHGFDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLKF 157
+N G +AG+LGL R +SF SQ + K FSYC LP+ + +L F
Sbjct: 250 ENNQGLFT-----GVAGLLGLGRDKLSFPSQTATAYNKIFSYC----LPSSASYTGHLTF 300
Query: 158 GTDMGYRRPSTQATKFINHPNNFYYLSLKDISIDNERMNFPPDTFDITVSGEGGCIIDSG 217
G+ R I +FY L++ I++ +++ P F G +IDSG
Sbjct: 301 GSAGISRSVKFTPISTITDGTSFYGLNIVAITVGGQKLPIPSTVFSTP-----GALIDSG 355
Query: 218 SVLTYFHSDVYWKLHEKFVSYFERF-QLAQLSDCPEPIQLCYFLPETFNRFPSMAFYFED 276
+V+T Y L F + ++ + +S L F T P +AF F
Sbjct: 356 TVITRLPPKAYAALRSSFKAKMSKYPTTSGVSILDTCFDLSGFKTVT---IPKVAFSFSG 412
Query: 277 ANLRIDGENVFIIDYENHFFLLAVAPH--DDLVALIGSQQQRDTRFVYDLNIDLLSFVKE 334
+ G ++ LA A + D A+ G+ QQ+ VYD + F
Sbjct: 413 GAVVELGSKGIFYAFKISQVCLAFAGNSDDSNAAIFGNVQQQTLEVVYDGAGGRVGFAPN 472
Query: 335 NCS 337
CS
Sbjct: 473 GCS 475
>gi|356546372|ref|XP_003541600.1| PREDICTED: probable aspartic protease At2g35615-like [Glycine max]
Length = 433
Score = 115 bits (289), Expect = 2e-23, Method: Compositional matrix adjust.
Identities = 103/363 (28%), Positives = 166/363 (45%), Gaps = 43/363 (11%)
Query: 1 MVRLFIGTPSKGVLLILDTGSALIY--------------AIFDPRKSSSFQKINCDHPDC 46
++ +GTPS V ILDTGS +I+ IFD KS +++ + C C
Sbjct: 90 LISYSVGTPSLQVFGILDTGSDIIWLQCQPCKKCYEQTTPIFDSSKSQTYKTLPCPSNTC 149
Query: 47 T----YFKCVNEQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAI-FHGALFGCSNDNH 101
F + C+Y++ Y D S + G + ET++ +G G + F G + GC N
Sbjct: 150 QSVQGTFCSSRKHCLYSIHYVDGSQSLGDLSVETLT-LGSTNGSPVQFPGTVIGCGRYNA 208
Query: 102 -GFDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLKFGTD 160
G +E +G++GL R +S I+QL +FSYCLV P SS L FG
Sbjct: 209 IGIEEKN-----SGIVGLGRGPMSLITQLSPSTGGKFSYCLV---PGLSTASSKLNFGNA 260
Query: 161 MGYRRPSTQATK-FINHPNNFYYLSLKDISIDNERMNF-PPDTFDITVSGEGGCIIDSGS 218
T +T F + FY+L+L+ S+ R+ F P + G+G IIDSG+
Sbjct: 261 AVVSGRGTVSTPLFSKNGLVFYFLTLEAFSVGRNRIEFGSPGS-----GGKGNIIIDSGT 315
Query: 219 VLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCY-FLPETFN-RFPSMAFYFED 276
LT + VY KL + L ++ D + + LCY P+ + P + +F
Sbjct: 316 TLTALPNGVYSKLEAAVA---KTVILQRVRDPNQVLGLCYKVTPDKLDASVPVITAHFSG 372
Query: 277 ANLRIDGENVFIIDYENHFFLLAVAPHDDLVALIGSQQQRDTRFVYDLNIDLLSFVKENC 336
A++ ++ N F + + A P + A+ G+ Q++ YDL ++ +SF +C
Sbjct: 373 ADVTLNAINTF-VQVADDVVCFAFQP-TETGAVFGNLAQQNLLVGYDLQMNTVSFKHTDC 430
Query: 337 SDD 339
+
Sbjct: 431 TKQ 433
>gi|226492465|ref|NP_001150925.1| aspartic proteinase nepenthesin-1 [Zea mays]
gi|195642996|gb|ACG40966.1| aspartic proteinase nepenthesin-1 precursor [Zea mays]
Length = 472
Score = 115 bits (288), Expect = 3e-23, Method: Compositional matrix adjust.
Identities = 98/372 (26%), Positives = 159/372 (42%), Gaps = 51/372 (13%)
Query: 1 MVRLFIGTPSKGVLLILDTGSALIYA---------------IFDPRKSSSFQKINCDHPD 45
++ L IGTP + DTGS LI+ +++P S++F + C+
Sbjct: 115 LMTLAIGTPPLPYAAVADTGSDLIWTQCAPCGTQCFEQPAPLYNPASSTTFSVLPCNS-- 172
Query: 46 CTYFKCVNE----------QCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFG 95
+ C C+Y Y T G ET + +A G FG
Sbjct: 173 -SLSMCAGALAGAAPPPGCACMYYQTYG-TGWTAGVQGSETFTFGSSAADQARVPGVAFG 230
Query: 96 CSNDNHGFDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYL 155
CSN + D + AG++GL R ++S +SQLG+ RFSYCL P + TS+ L
Sbjct: 231 CSNASSS-DWNGS----AGLVGLGRGSLSLVSQLGA---GRFSYCLT-PFQDTNSTSTLL 281
Query: 156 KFGTDMGYRRPSTQATKFINHP-----NNFYYLSLKDISIDNERMNFPPDTFDITVSGEG 210
G ++T F+ P + +YYL+L IS+ + + P F + G G
Sbjct: 282 -LGPSAALNGTGVRSTPFVASPARAPMSTYYYLNLTGISLGAKALPISPGAFSLKPDGTG 340
Query: 211 GCIIDSGSVLTYFHSDVYWKLHEKFVSYF-ERFQLAQLSDCPEPIQLCYFLPETFNR--- 266
G IIDSG+ +T + Y ++ S SD + LC+ LP +
Sbjct: 341 GLIIDSGTTITSLANAAYQQVRAAVKSQLVTTLPTVDGSDS-TGLDLCFALPAPTSAPPA 399
Query: 267 -FPSMAFYFEDANLRIDGENVFIIDYENHFFLLAVAPHDDLVALIGSQQQRDTRFVYDLN 325
PSM +F+ A++ + ++ ++I + L D ++ G+ QQ++ +YD+
Sbjct: 400 VLPSMTLHFDGADMVLPADS-YMISGSGVWCLAMRNQTDGAMSTFGNYQQQNMHILYDVR 458
Query: 326 IDLLSFVKENCS 337
+ LSF CS
Sbjct: 459 EETLSFAPAKCS 470
>gi|242053503|ref|XP_002455897.1| hypothetical protein SORBIDRAFT_03g026970 [Sorghum bicolor]
gi|241927872|gb|EES01017.1| hypothetical protein SORBIDRAFT_03g026970 [Sorghum bicolor]
Length = 493
Score = 115 bits (288), Expect = 3e-23, Method: Compositional matrix adjust.
Identities = 104/367 (28%), Positives = 161/367 (43%), Gaps = 47/367 (12%)
Query: 2 VRLFIGTPSKGVLLILDTGSALIY--------------AIFDPRKSSSFQKINCDHPDCT 47
++ +GTPS L++LDTGS +++ +FDPR+SSS+ ++C P C
Sbjct: 142 TKIGVGTPSTPALMVLDTGSDVVWLQCAPCRRCYDQSGPVFDPRRSSSYGAVDCAAPLCR 201
Query: 48 YFKC-----VNEQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSNDNHG 102
C+Y + Y D SVT G A ET++ G G + AL GC +DN G
Sbjct: 202 RLDSGGCDLRRRACLYQVAYGDGSVTAGDFATETLTFAG---GARVARVAL-GCGHDNEG 257
Query: 103 FDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLKFGTDMG 162
A R ++SF +Q+ K FSYCLV + ++ + +
Sbjct: 258 LFVAAAGLLGL-----GRGSLSFPTQISRRYGKSFSYCLVDRTSSSSSGAASRSRSSTVT 312
Query: 163 YRRPSTQATKF---INHPN--NFYYLSLKDISIDNERMNFPPDTFDITV---SGEGGCII 214
+ PS A F + +P FYY+ L IS+ R+ ++ D+ + +G GG I+
Sbjct: 313 FGPPSASAASFTPMVRNPRMETFYYVQLVGISVGGARVPGVAES-DLRLDPSTGRGGVIV 371
Query: 215 DSGSVLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQL---CYFL-PETFNRFPSM 270
DSG+ +T Y L + F R A L P L CY L + P++
Sbjct: 372 DSGTSVTRLARPSYSALRDAF-----RAAAAGLRLSPGGFSLFDTCYDLGGRKVVKVPTV 426
Query: 271 AFYFE-DANLRIDGENVFIIDYENHFFLLAVAPHDDLVALIGSQQQRDTRFVYDLNIDLL 329
+ +F A + EN I F A A D V++IG+ QQ+ R V+D + +
Sbjct: 427 SMHFAGGAEAALPPENYLIPVDSRGTFCFAFAGTDGGVSIIGNIQQQGFRVVFDGDGQRV 486
Query: 330 SFVKENC 336
F + C
Sbjct: 487 GFAPKGC 493
>gi|255566835|ref|XP_002524401.1| basic 7S globulin 2 precursor small subunit, putative [Ricinus
communis]
gi|223536362|gb|EEF38012.1| basic 7S globulin 2 precursor small subunit, putative [Ricinus
communis]
Length = 455
Score = 115 bits (288), Expect = 3e-23, Method: Compositional matrix adjust.
Identities = 101/376 (26%), Positives = 162/376 (43%), Gaps = 50/376 (13%)
Query: 2 VRLFIGTPSKGVLLILDTGSALIYAIFDP---------------RKSSSFQKINCDHPDC 46
V L IGTP + +LL+ DTGS LI+ P R S+++ I+C P C
Sbjct: 88 VSLRIGTPPQTLLLVADTGSDLIWVKCSPCRNCSHRSPGSAFFARHSTTYSAIHCYSPQC 147
Query: 47 TYFK------C----VNEQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGC 96
C ++ C Y YAD S T GF + E +++ +G FGC
Sbjct: 148 QLVPHPHPNPCNRTRLHSPCRYQYTYADSSTTTGFFSKEALTLNTSTGKVKKLNGLSFGC 207
Query: 97 SNDNHG--FDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYT--- 151
G + +GA GV+GL R ISF SQLG +FSYCL+ +YT
Sbjct: 208 GFRISGPSLTGASFEGA-QGVMGLGRAPISFSSQLGRRFGSKFSYCLM------DYTLSP 260
Query: 152 --SSYLKFGTDMGY---RRPSTQATKFINHP--NNFYYLSLKDISIDNERMNFPPDTFDI 204
+S+L G ++ T + +P FYY+++K + ++ ++ P + I
Sbjct: 261 PPTSFLTIGGAQNVAVSKKGIMSFTPLLINPLSPTFYYIAIKGVYVNGVKLPINPSVWSI 320
Query: 205 TVSGEGGCIIDSGSVLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCYFLPE-T 263
G GG IIDSG+ LT+ Y ++ + F +R +L ++ LC + T
Sbjct: 321 DDLGNGGTIIDSGTTLTFITEPAYTEILKAFK---KRVKLPSPAEPTPGFDLCMNVSGVT 377
Query: 264 FNRFPSMAFYFEDANLRIDGENVFIIDYENHFFLLAVAP--HDDLVALIGSQQQRDTRFV 321
P M+F ++ + I+ + LAV P D +++G+ Q+
Sbjct: 378 RPALPRMSFNLAGGSVFSPPPRNYFIETGDQIKCLAVQPVSQDGGFSVLGNLMQQGFLLE 437
Query: 322 YDLNIDLLSFVKENCS 337
+D + L F + C+
Sbjct: 438 FDRDKSRLGFTRRGCA 453
>gi|297846526|ref|XP_002891144.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
gi|297336986|gb|EFH67403.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
Length = 445
Score = 115 bits (288), Expect = 3e-23, Method: Compositional matrix adjust.
Identities = 106/372 (28%), Positives = 167/372 (44%), Gaps = 48/372 (12%)
Query: 2 VRLFIGTPSKGVLLILDTGSALIYA--------------IFDPRKSSSFQKINCDHPDCT 47
+ + IGTP L I DTGS L + +FD +KSS+++ +CD C
Sbjct: 87 MSISIGTPPSKFLAIADTGSDLTWVQCKPCQQCYKQNTPLFDKKKSSTYKTESCDSITCN 146
Query: 48 YFKCVNE-------QCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSNDN 100
E C Y Y D+S TKG A ETIS+ F G FGC +N
Sbjct: 147 ALSEHEEGCDESRNACKYRYSYGDESFTKGEVATETISIDSSSGSPVSFPGTAFGCGYNN 206
Query: 101 HGFDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCL--VIPLPNGEYTSSYLKFG 158
G E+ G + L +S +SQLGS I K+FSYCL NG +S + G
Sbjct: 207 GGTFEETGSGIIG----LGGGPLSLVSQLGSSIGKKFSYCLSHTSATTNG---TSVINLG 259
Query: 159 TDMGYRRPSTQA----TKFINH-PNNFYYLSLKDISIDNERMNFPPD---TFDITVSGEG 210
T+ +PS + T I P +Y+L+L+ I++ ++ + + + G
Sbjct: 260 TNSMTSKPSKDSAILTTPLIQKDPETYYFLTLEAITVGKTKLPYTGGGGYSLNRKSKKTG 319
Query: 211 GCIIDSGSVLTYFHS---DVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCYFLPETFNRF 267
IIDSG+ LT S D + + E+ V+ +R +SD + C+ +
Sbjct: 320 NIIIDSGTTLTLLDSGFYDDFGAVVEESVTGAKR-----VSDPQGILTHCFKSGDKEIGL 374
Query: 268 PSMAFYFEDANLRIDGENVFIIDYENHFFLLAVAPHDDLVALIGSQQQRDTRFVYDLNID 327
P++ +F A++++ N F+ E+ L++ P + VA+ G+ Q D YDL
Sbjct: 375 PTITMHFTGADVKLSPINSFVKLSED-IVCLSMIPTTE-VAIYGNMVQMDFLVGYDLETK 432
Query: 328 LLSFVKENCSDD 339
+SF + +CS +
Sbjct: 433 TVSFQRMDCSGN 444
>gi|54290717|dbj|BAD62387.1| putative 41 kD chloroplast nucleoid DNA binding protein [Oryza
sativa Japonica Group]
gi|215734915|dbj|BAG95637.1| unnamed protein product [Oryza sativa Japonica Group]
Length = 469
Score = 115 bits (288), Expect = 3e-23, Method: Compositional matrix adjust.
Identities = 94/363 (25%), Positives = 156/363 (42%), Gaps = 51/363 (14%)
Query: 1 MVRLFIGTPSKGVLLILDTGSALIY---------------AIFDPRKSSSFQKINCDHPD 45
+ RL +GTP+ ++++DTGS+L + +FDPR S ++ + C +
Sbjct: 132 VTRLGLGTPATSYVMVVDTGSSLTWLQCSPCSVSCHRQAGPVFDPRASGTYAAVQCSSSE 191
Query: 46 CTYFKC---------VNEQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGC 96
C + V+ C+Y Y D S + G+ + +T+S G F G +GC
Sbjct: 192 CGELQAATLNPSACSVSNVCIYQASYGDSSYSVGYLSKDTVSF-----GSGSFPGFYYGC 246
Query: 97 SNDNHGFDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLK 156
DN G G AG++GL++ +S + QL + FSYC LP + YL
Sbjct: 247 GQDNEGLF-----GRSAGLIGLAKNKLSLLYQLAPSLGYAFSYC----LPTSSAAAGYLS 297
Query: 157 FGT-DMGYRRPSTQATKFINHPNNFYYLSLKDISIDNERMNFPPDTFDITVSGEGGCIID 215
G+ + G + A+ ++ + Y+++L IS+ + PP + + IID
Sbjct: 298 IGSYNPGQYSYTPMASSSLD--ASLYFVTLSGISVAGAPLAVPPSEYRSLPT-----IID 350
Query: 216 SGSVLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCYFLPETFNRFPSMAFYFE 275
SG+V+T +VY L + + + C+ R P + F
Sbjct: 351 SGTVITRLPPNVYTALSRAVAAAMASAAPRAPTY--SILDTCFRGSAAGLRVPRVDMAFA 408
Query: 276 -DANLRIDGENVFIIDYENHFFLLAVAPHDDLVALIGSQQQRDTRFVYDLNIDLLSFVKE 334
A L + NV +ID ++ LA AP A+IG+ QQ+ VYD+ + F
Sbjct: 409 GGATLALSPGNV-LIDVDDSTTCLAFAPTGG-TAIIGNTQQQTFSVVYDVAQSRIGFAAG 466
Query: 335 NCS 337
CS
Sbjct: 467 GCS 469
>gi|302811785|ref|XP_002987581.1| hypothetical protein SELMODRAFT_426333 [Selaginella moellendorffii]
gi|300144735|gb|EFJ11417.1| hypothetical protein SELMODRAFT_426333 [Selaginella moellendorffii]
Length = 511
Score = 115 bits (288), Expect = 3e-23, Method: Compositional matrix adjust.
Identities = 106/381 (27%), Positives = 164/381 (43%), Gaps = 53/381 (13%)
Query: 2 VRLFIGTPSKGVLLILDTGSALIY--------------AIFDPRKSSSFQKINCDHPDCT 47
V L +GTP+ V+LI+DTGS + + F+PR SSSF K+ C CT
Sbjct: 141 VPLQVGTPAVEVVLIMDTGSDVSWIQCVPCKDCVPALRPPFNPRHSSSFFKLPCASSTCT 200
Query: 48 --------YFKCVNEQCVYTMKYADQSVTKGFAAHETIS--VIGKGEGKAI-FHGALFGC 96
+ C+++++Y D S++ G A ETI+ G+G+ + GC
Sbjct: 201 NVYQGVKPFCSPSGRTCLFSIQYGDGSLSSGLLAMETIAGNTPNFGDGEPVKLSNITLGC 260
Query: 97 SNDNHGFDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLK 156
++ D + +G+LG+ R ISF SQL S ++FS+C P SS L
Sbjct: 261 AD----IDREGLPTGASGLLGMDRRPISFPSQLSSRYARKFSHCF--PDKIAHLNSSGLV 314
Query: 157 FGTDMGYRRPSTQATKFINHPN------NFYYLSLKDISIDNERMNFPPDTFDI-TVSGE 209
F + P + T + +P ++YY+ L IS+D R+ FDI V+G
Sbjct: 315 FFGESDIISPYLRYTPLVQNPAVPSASLDYYYVGLVGISVDESRLPLSHKNFDIDKVTGS 374
Query: 210 GGCIIDSGSVLTYFHSDVYWKLHEKFVSYFERFQLAQLSD------CPEPIQLCYFLPET 263
GG IIDSG+ TY + + +F++ LA++ D C L T
Sbjct: 375 GGTIIDSGTAFTYLKKPAFQAMRREFLARTS--HLAKVDDNSGFTPCYNITSGTAALEST 432
Query: 264 FNRFPSMAFYFEDANLRIDGENVFIIDY----ENHFFLLAVAPHDDL-VALIGSQQQRDT 318
PS+ +F + +N +I E LA D+ +IG+ QQ++
Sbjct: 433 I--LPSITLHFRGGLDVVLPKNSILIPVSSSEEQTTLCLAFLMSGDIPFNIIGNYQQQNL 490
Query: 319 RFVYDLNIDLLSFVKENCSDD 339
YDL L C+ D
Sbjct: 491 WVEYDLEKLRLGIAPAQCATD 511
>gi|449530542|ref|XP_004172253.1| PREDICTED: LOW QUALITY PROTEIN: protein ASPARTIC PROTEASE IN GUARD
CELL 1-like [Cucumis sativus]
Length = 486
Score = 115 bits (288), Expect = 3e-23, Method: Compositional matrix adjust.
Identities = 98/355 (27%), Positives = 154/355 (43%), Gaps = 43/355 (12%)
Query: 3 RLFIGTPSKGVLLILDTGSALIYA--------------IFDPRKSSSFQKINCDHPDCTY 48
R+ IG P V ++LDTGS + + F+P S+SF ++C+ C
Sbjct: 154 RVGIGRPPSPVYMVLDTGSDVSWVQCAPCAECYEQTDPXFEPTSSASFTSLSCETEQCKS 213
Query: 49 F---KCVNEQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSNDNHGFDE 105
+C N C+Y + Y D S T G ET+++ G GC ++N G
Sbjct: 214 LDVSECRNGTCLYEVSYGDGSYTVGDFVTETVTL-----GSTSLGNIAIGCGHNNEGLFI 268
Query: 106 DARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLKFGTDMGYRR 165
A ++SF SQL + FSYCLV ++S L F + +
Sbjct: 269 GAAGLLGL-----GGGSLSFPSQLNA---SSFSYCLV---DRDSDSTSTLDFNSPI---T 314
Query: 166 PSTQATKFINHPN--NFYYLSLKDISIDNERMNFPPDTFDITVSGEGGCIIDSGSVLTYF 223
P +PN F+YL L +S+ + P +F ++ G GG I+DSG+ +T
Sbjct: 315 PDAVTAPLHRNPNLDTFFYLGLTGMSVGGAVLPIPETSFQMSEDGNGGIIVDSGTAVTRL 374
Query: 224 HSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCYFL-PETFNRFPSMAFYFEDAN-LRI 281
+ VY L + FV Q A+ CY L ++ P+++F+F + N L +
Sbjct: 375 QTTVYNVLRDAFVKSTHDLQTARGVAL---FDTCYDLSSKSRVEVPTVSFHFANGNELPL 431
Query: 282 DGENVFIIDYENHFFLLAVAPHDDLVALIGSQQQRDTRFVYDLNIDLLSFVKENC 336
+N I F A AP D ++++G+ QQ+ TR +DL L+ F C
Sbjct: 432 PAKNYLIPVDSEGTFCFAFAPTDSTLSILGNAQQQGTRVGFDLANSLVGFSPNKC 486
>gi|302822373|ref|XP_002992845.1| hypothetical protein SELMODRAFT_136051 [Selaginella moellendorffii]
gi|300139393|gb|EFJ06135.1| hypothetical protein SELMODRAFT_136051 [Selaginella moellendorffii]
Length = 510
Score = 115 bits (288), Expect = 3e-23, Method: Compositional matrix adjust.
Identities = 106/381 (27%), Positives = 164/381 (43%), Gaps = 53/381 (13%)
Query: 2 VRLFIGTPSKGVLLILDTGSALIY--------------AIFDPRKSSSFQKINCDHPDCT 47
V L +GTP+ V+LI+DTGS + + F+PR SSSF K+ C CT
Sbjct: 140 VPLQLGTPAVEVVLIMDTGSDVSWIQCVPCKDCVPALRPPFNPRHSSSFFKLPCASSTCT 199
Query: 48 --------YFKCVNEQCVYTMKYADQSVTKGFAAHETIS--VIGKGEGKAI-FHGALFGC 96
+ C+++++Y D S++ G A ETI+ G+G+ + GC
Sbjct: 200 NVYQGVKPFCSPSGRTCLFSIQYGDGSLSSGLLAMETIAGNTPNFGDGEPVKLSNITLGC 259
Query: 97 SNDNHGFDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLK 156
++ D + +G+LG+ R ISF SQL S ++FS+C P SS L
Sbjct: 260 AD----IDREGLPTGASGLLGMDRRPISFPSQLSSRYARKFSHCF--PDKIAHLNSSGLV 313
Query: 157 FGTDMGYRRPSTQATKFINHPN------NFYYLSLKDISIDNERMNFPPDTFDI-TVSGE 209
F + P + T + +P ++YY+ L IS+D R+ FDI V+G
Sbjct: 314 FFGESDIISPYLRYTPLVQNPAVPSASLDYYYVGLVGISVDESRLPLSHKNFDIDKVTGS 373
Query: 210 GGCIIDSGSVLTYFHSDVYWKLHEKFVSYFERFQLAQLSD------CPEPIQLCYFLPET 263
GG IIDSG+ TY + + +F++ LA++ D C L T
Sbjct: 374 GGTIIDSGTAFTYLKKPAFQAMRREFLARTS--HLAKVDDNSGFTPCYNITSGTAALEST 431
Query: 264 FNRFPSMAFYFEDANLRIDGENVFIIDY----ENHFFLLAVAPHDDL-VALIGSQQQRDT 318
PS+ +F + +N +I E LA D+ +IG+ QQ++
Sbjct: 432 I--LPSITLHFRGGLDVVLPKNSILIPVSSSEEQTTLCLAFQMSGDIPFNIIGNYQQQNL 489
Query: 319 RFVYDLNIDLLSFVKENCSDD 339
YDL L C+ D
Sbjct: 490 WVEYDLEKLRLGIAPAQCATD 510
>gi|357163818|ref|XP_003579856.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Brachypodium
distachyon]
Length = 467
Score = 115 bits (288), Expect = 3e-23, Method: Compositional matrix adjust.
Identities = 100/394 (25%), Positives = 172/394 (43%), Gaps = 77/394 (19%)
Query: 1 MVRLFIGTPSKGVLLILDTGSALIYA--------------IFDPRKSSSFQKINCDHPDC 46
+V+L +GTP +DT S LI+ +F+P S+S+ + C+ C
Sbjct: 89 LVKLGLGTPQHCFTAAIDTASDLIWTQCQPCVKCYKQLDPVFNPVASTSYAVVPCNSDTC 148
Query: 47 TYF---KCVNE-------QCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGC 96
+C + C YT Y + T+G A + +++ G +F G +FGC
Sbjct: 149 DELDTHRCARDGDSDDEDACQYTYSYGGNATTRGILAVDRLAI-----GDDVFRGVVFGC 203
Query: 97 SNDNHGFDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLK 156
S+ + G ++GV+GL R +S +SQL +RF YCL P+ ++ L
Sbjct: 204 SSSSVGGPPPQ----VSGVVGLGRGALSLVSQLSV---RRFMYCLPPPV---SRSAGRLV 253
Query: 157 FGTDMG--YRRPSTQATKFINHPN---NFYYLSLKDISIDNERMNFPP-DTFDITVSGEG 210
G D R S + ++ + ++YYL+L ISI + M+F + + T G
Sbjct: 254 LGADAAATVRNASERVVVPMSTGSRYPSYYYLNLDGISIGDRAMSFRSRNRMNATTPGTA 313
Query: 211 ------------------------GCIIDSGSVLTYFHSDVYWKLHEKFVSYFERFQLAQ 246
G IID S +T+ +Y ++ + E +L +
Sbjct: 314 AGAPASPVSGSGDGDGSGTGPDAYGMIIDIASTITFLEESLYEEMVDDL---EEEIRLPR 370
Query: 247 LSDCPEPIQLCYFLPE--TFNRF--PSMAFYFEDANLRIDGENVFIIDYENHFFLLAVAP 302
S + LC+ LPE +R P ++ FE LR+D E +F+ D + L V
Sbjct: 371 GSGSDLGLDLCFILPEGVPMSRVYAPPVSLAFEGVWLRLDKEQMFVEDRASGMMCLMVGK 430
Query: 303 HDDLVALIGSQQQRDTRFVYDLNIDLLSFVKENC 336
D V+++G+ QQ++ + +Y+L ++F+K C
Sbjct: 431 TDG-VSILGNYQQQNMQVMYNLRRGRITFIKTAC 463
>gi|413921976|gb|AFW61908.1| hypothetical protein ZEAMMB73_608282 [Zea mays]
Length = 459
Score = 115 bits (287), Expect = 4e-23, Method: Compositional matrix adjust.
Identities = 99/373 (26%), Positives = 156/373 (41%), Gaps = 47/373 (12%)
Query: 1 MVRLFIGTPSKGVLLILDTGSALIYA--------------IFDPRKSSSFQKINCDHPDC 46
++ L IGTP + + DTGS L + I+D S+SF + C C
Sbjct: 96 LMELAIGTPPVPFVALADTGSDLTWTQCKPCKLCFPQDTPIYDTAASASFSPVPCASATC 155
Query: 47 TYF--------KCVNEQCVYTMKYADQSVTKGFAAHETISVIGKGEGK----AIFHGALF 94
C Y Y D + + G ET++ G G G F
Sbjct: 156 LPIWRSSRNCTATTTSPCRYRYAYDDGAYSAGVLGTETLTFAGSSPGAPGPGVSVGGVAF 215
Query: 95 GCSNDNHGFDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSY 154
GC DN G ++ G +GL R ++S ++QLG +FSYCL N S
Sbjct: 216 GCGVDNGGLSYNS-----TGTVGLGRGSLSLVAQLG---VGKFSYCLT-DFFNTSLGSPV 266
Query: 155 LKFGTDMGYRRPST------QATKFINHPNN--FYYLSLKDISIDNERMNFPPDTFDITV 206
L FG+ PST Q+T + P N YY+SL+ IS+ + R+ P TFD+
Sbjct: 267 L-FGSLAELAAPSTIGGAAVQSTPLVQGPYNPSRYYVSLEGISLGDARLPIPNGTFDLRD 325
Query: 207 SGEGGCIIDSGSVLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCYFLPETFNR 266
G GG I+DSG++ T + + + + S P +
Sbjct: 326 DGSGGMIVDSGTIFTVLVESAFRVVVNHVAGVLNQ-PVVNASSLDSPCFPATAGEQQLPD 384
Query: 267 FPSMAFYFE-DANLRIDGENVFIIDYENHFFLLAVAPHDDLV-ALIGSQQQRDTRFVYDL 324
P M +F A++R+ +N + E+ F L +A +++G+ QQ++ + ++D+
Sbjct: 385 MPDMLLHFAGGADMRLHRDNYMSFNQESSSFCLNIAGAPSAYGSILGNFQQQNIQMLFDI 444
Query: 325 NIDLLSFVKENCS 337
+ LSFV +CS
Sbjct: 445 TVGQLSFVPTDCS 457
>gi|357143847|ref|XP_003573077.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Brachypodium
distachyon]
Length = 420
Score = 115 bits (287), Expect = 4e-23, Method: Compositional matrix adjust.
Identities = 96/363 (26%), Positives = 162/363 (44%), Gaps = 42/363 (11%)
Query: 1 MVRLFIGTPSKGVLLILDTGSALIYA--------------IFDPRKSSSFQKINCDHPDC 46
++ L IG P + + DTGS L + ++DP SS+F + C C
Sbjct: 72 LMELAIGKPPVPFVALADTGSDLTWTQCQPCKLCFPQDTPVYDPSASSTFSPLPCSSATC 131
Query: 47 TYFKCVN----EQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSNDNHG 102
N C Y Y D + + G ET++ +G G FGC DN G
Sbjct: 132 LPIWSRNCTPSSLCRYRYAYGDGAYSAGILGTETLT-LGPSSAPVSVGGVAFGCGTDNGG 190
Query: 103 FDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLKFGT--D 160
++ G +GL R T+S ++QLG +FSYCL N S +L GT +
Sbjct: 191 DSLNS-----TGTVGLGRGTLSLLAQLG---VGKFSYCLTDFF-NSALDSPFL-LGTLAE 240
Query: 161 MGYRRPSTQATKFINHPNN--FYYLSLKDISIDNERMNFPPDTFDITVSGEGGCIIDSGS 218
+ + Q+T + P N Y++SL+ IS+ + R+ P TFD+ G GG I+DSG+
Sbjct: 241 LAPGPSTVQSTPLLQSPQNPSRYFVSLQGISLGDVRLPIPNGTFDLRGDGTGGMIVDSGT 300
Query: 219 VLTYFHSDVYWKLHEKFVSYFERFQL-AQLSDCPEPIQLCYFLPETFNRF-PSMAFYFE- 275
T + ++ + + + A D P C+ P + P + +F
Sbjct: 301 TFTILAESGFREVVGRVARVLGQPPVNASSLDAP-----CFPAPAGEPPYMPDLVLHFAG 355
Query: 276 DANLRIDGENVFIIDYENHFFLLAVA-PHDDLVALIGSQQQRDTRFVYDLNIDLLSFVKE 334
A++R+ +N + E+ F L +A + +++G+ QQ++ + ++D + LSF+
Sbjct: 356 GADMRLYRDNYMSYNEEDSSFCLNIAGTTPESTSVLGNFQQQNIQMLFDTTVGQLSFLPT 415
Query: 335 NCS 337
+CS
Sbjct: 416 DCS 418
>gi|148908573|gb|ABR17396.1| unknown [Picea sitchensis]
Length = 350
Score = 115 bits (287), Expect = 4e-23, Method: Compositional matrix adjust.
Identities = 101/359 (28%), Positives = 157/359 (43%), Gaps = 43/359 (11%)
Query: 2 VRLFIGTPSKGVLLILDTGSALIY--------------AIFDPRKSSSFQKINCDHPDCT 47
R+ IGTP++ ++LDTGS +++ IF+P S SF + CD C+
Sbjct: 10 TRIGIGTPTREQYMVLDTGSDVVWIQCEPCRECYSQADPIFNPSSSVSFSTVGCDSAVCS 69
Query: 48 YFK---CVNEQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSNDNHGFD 104
C C+Y + Y D S T G A ET++ G GC +DN G
Sbjct: 70 QLDANDCHGGGCLYEVSYGDGSYTVGSYATETLTF-----GTTSIQNVAIGCGHDNVGLF 124
Query: 105 EDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLKFGTDMGYR 164
A ++SF +QLG+ + FSYCLV +S L+FG +
Sbjct: 125 VGAAGLLGL-----GAGSLSFPAQLGTQTGRAFSYCLV---DRDSESSGTLEFGPE---S 173
Query: 165 RP-STQATKFINHP--NNFYYLSLKDISIDNERMN-FPPDTFDI-TVSGEGGCIIDSGSV 219
P + T + +P FYYLS+ IS+ ++ P + F I +G GG IIDSG+
Sbjct: 174 VPIGSIFTPLVANPFLPTFYYLSMVAISVGGVILDSVPSEAFRIDETTGRGGIIIDSGTA 233
Query: 220 LTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCYFLPETFN-RFPSMAFYFED-A 277
+T + Y L + F++ + A D CY L + P++ F+F + A
Sbjct: 234 VTRLQTSAYDALRDAFIAGTQHLPRA---DGISIFDTCYDLSALQSVSIPAVGFHFSNGA 290
Query: 278 NLRIDGENVFIIDYENHFFLLAVAPHDDLVALIGSQQQRDTRFVYDLNIDLLSFVKENC 336
+ +N I F A AP D ++++G+ QQ+ R +D L+ F + C
Sbjct: 291 GFILPAKNCLIPMDSMGTFCFAFAPADSNLSIMGNIQQQGIRVSFDSANSLVGFAIDQC 349
>gi|116787367|gb|ABK24480.1| unknown [Picea sitchensis]
Length = 496
Score = 114 bits (286), Expect = 5e-23, Method: Compositional matrix adjust.
Identities = 101/359 (28%), Positives = 156/359 (43%), Gaps = 43/359 (11%)
Query: 2 VRLFIGTPSKGVLLILDTGSALIY--------------AIFDPRKSSSFQKINCDHPDCT 47
R+ IGTP++ ++LDTGS +++ IF+P S SF + CD C+
Sbjct: 156 TRIGIGTPTREQYMVLDTGSDVVWIQCEPCRECYSQADPIFNPSSSVSFSTVGCDSAVCS 215
Query: 48 YFK---CVNEQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSNDNHGFD 104
C C+Y + Y D S T G A ET++ G GC +DN G
Sbjct: 216 QLDANDCHGGGCLYEVSYGDGSYTVGSYATETLTF-----GTTSIQNVAIGCGHDNVGLF 270
Query: 105 EDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLKFGTD---M 161
A ++SF +QLG+ + FSYCLV +S L+FG + +
Sbjct: 271 VGAAGLLGL-----GAGSLSFPAQLGTQTGRAFSYCLV---DRDSESSGTLEFGPESVPI 322
Query: 162 GYRRPSTQATKFINHPNNFYYLSLKDISIDNERMN-FPPDTFDI-TVSGEGGCIIDSGSV 219
G A F+ FYYLS+ IS+ ++ P + F I +G GG IIDSG+
Sbjct: 323 GSIFTPLVANPFL---PTFYYLSMVAISVGGVILDSVPSEAFRIDETTGRGGIIIDSGTA 379
Query: 220 LTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCYFLPETFN-RFPSMAFYFED-A 277
+T + Y L + F++ + A D CY L + P++ F+F + A
Sbjct: 380 VTRLQTSAYDALRDAFIAGTQHLPRA---DGISIFDTCYDLSALQSVSIPAVGFHFSNGA 436
Query: 278 NLRIDGENVFIIDYENHFFLLAVAPHDDLVALIGSQQQRDTRFVYDLNIDLLSFVKENC 336
+ +N I F A AP D ++++G+ QQ+ R +D L+ F + C
Sbjct: 437 GFILPAKNCLIPMDSMGTFCFAFAPADSNLSIMGNIQQQGIRVSFDSANSLVGFAIDQC 495
>gi|356524287|ref|XP_003530761.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
Length = 481
Score = 114 bits (286), Expect = 5e-23, Method: Compositional matrix adjust.
Identities = 106/368 (28%), Positives = 155/368 (42%), Gaps = 58/368 (15%)
Query: 2 VRLFIGTPSKGVLLILDTGSALIY---------------AIFDPRKSSSFQKINCDHPDC 46
V + +GTP + + L+ DTGS L + AIFDP KSSS+ I C C
Sbjct: 138 VVVGLGTPKRDLSLVFDTGSDLTWTQCEPCAGSCYKQQDAIFDPSKSSSYINITCTSSLC 197
Query: 47 TYF-------KCVNE--QCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCS 97
T +C + C+Y ++Y D+S + GF + E +++ I LFGC
Sbjct: 198 TQLTSAGIKSRCSSSTTACIYGIQYGDKSTSVGFLSQERLTITATD----IVDDFLFGCG 253
Query: 98 NDNHGFDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLKF 157
DN G + AG++GL R ISF+ Q SI K FSYC LP+ + +L F
Sbjct: 254 QDNEGLFSGS-----AGLIGLGRHPISFVQQTSSIYNKIFSYC----LPSTSSSLGHLTF 304
Query: 158 G------TDMGYRRPSTQATKFINHPNNFYYLSLKDISIDNERMNFPPDTFDITVSGEGG 211
G ++ Y ST I+ N FY L + IS+ ++ P T S GG
Sbjct: 305 GASAATNANLKYTPLST-----ISGDNTFYGLDIVGISVGGTKL---PAVSSSTFSA-GG 355
Query: 212 CIIDSGSVLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCY-FLPETFNRFPSM 270
IIDSG+V+T Y L F E++ +A CY F P +
Sbjct: 356 SIIDSGTVITRLAPTAYAALRSAFRQGMEKYPVANEDGL---FDTCYDFSGYKEISVPKI 412
Query: 271 AFYFEDA-NLRIDGENVFIIDYENHFFL-LAVAPHDDLVALIGSQQQRDTRFVYDLNIDL 328
F F + + + I L A +D+ + + G+ QQ+ VYD+
Sbjct: 413 DFEFAGGVTVELPLVGILIGRSAQQVCLAFAANGNDNDITIFGNVQQKTLEVVYDVEGGR 472
Query: 329 LSFVKENC 336
+ F C
Sbjct: 473 IGFGAAGC 480
>gi|449466304|ref|XP_004150866.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 1-like [Cucumis
sativus]
gi|449485213|ref|XP_004157102.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 1-like [Cucumis
sativus]
Length = 473
Score = 114 bits (286), Expect = 5e-23, Method: Compositional matrix adjust.
Identities = 98/365 (26%), Positives = 157/365 (43%), Gaps = 52/365 (14%)
Query: 1 MVRLFIGTPSKGVLLILDTGSALIYA---------------IFDPRKSSSFQKINCDHPD 45
+V + +GTP K + LI DTGS L + +F P +S+++ I+C PD
Sbjct: 132 IVSVGLGTPKKYLSLIFDTGSDLTWTQCQPCARYCYNQKDPVFVPSQSTTYSNISCSSPD 191
Query: 46 CTYFKC---------VNEQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGC 96
C+ + C+Y ++Y DQS + G+ A ET+++ + LFGC
Sbjct: 192 CSQLESGTGNQPGCSAARACIYGIQYGDQSFSVGYFAKETLTLT----STDVIENFLFGC 247
Query: 97 SNDNHGFDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLK 156
+N G G+ AG++GL + IS + Q + FSYC LP ++ YL
Sbjct: 248 GQNNRGL-----FGSAAGLIGLGQDKISIVKQTAQKYGQVFSYC----LPKTSSSTGYLT 298
Query: 157 FGTDMGYRRPSTQATKFINHPNNFYYLSLKDISIDNERMNFPPDTFDITVSGEGGCIIDS 216
FG G + NFY + + + + ++ F + G IIDS
Sbjct: 299 FGGGGGGGALKYTPITKAHGVANFYGVDIVGMKVGGTQIPISSSVFSTS-----GAIIDS 353
Query: 217 GSVLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPE--PIQLCYFLPE-TFNRFPSMAFY 273
G+V+T D Y L S FE+ +A+ PE + CY L + + + P + F
Sbjct: 354 GTVITRLPPDAYSALK----SAFEK-GMAKYPKAPELSILDTCYDLSKYSTIQIPKVGFV 408
Query: 274 FEDA-NLRIDGENVFIIDYENHFFLLAVAPHD-DLVALIGSQQQRDTRFVYDLNIDLLSF 331
F+ L +DG + + L D VA+IG+ QQ+ + VYD+ + F
Sbjct: 409 FKGGEELDLDGIGIMYGASTSQVCLAFAGNQDPSTVAIIGNVQQKTLQVVYDVGGGKIGF 468
Query: 332 VKENC 336
C
Sbjct: 469 GYNGC 473
>gi|357450869|ref|XP_003595711.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
gi|355484759|gb|AES65962.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
Length = 443
Score = 114 bits (286), Expect = 5e-23, Method: Compositional matrix adjust.
Identities = 100/367 (27%), Positives = 162/367 (44%), Gaps = 45/367 (12%)
Query: 1 MVRLFIGTPSKGVLLILDTGSALIY--------------AIFDPRKSSSFQKINCDHPDC 46
++ L IGTP +DTGS LI+ +FDP+ SS++ I C
Sbjct: 60 LMELSIGTPPVKTYAQVDTGSDLIWLQCIPCTNCYKQLNPMFDPQSSSTYSNIAYGSESC 119
Query: 47 TYF---KCVNEQ--CVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSNDNH 101
+ C +Q C YT Y D S+T+G A ET+++ G +FGC ++N+
Sbjct: 120 SKLYSTSCSPDQNNCNYTYSYEDDSITEGVLAQETLTLTSTTGKPVALKGVIFGCGHNNN 179
Query: 102 GFDEDARDGALAGVLGLSRVTISFISQLGSII-KKRFSYCLVIPLPNGEYTSSYLKFGTD 160
G D G++GL R +S +SQ+GS K FS CLV P +S + FG
Sbjct: 180 GVFNDKE----MGIIGLGRGPLSLVSQIGSSFGGKMFSQCLV-PFHTNPSITSPMSFGKG 234
Query: 161 MGYRRPSTQATKFI--NHPNNFYYLSLKDISIDNERMNFP-PDTFDITVSGEGGCIIDSG 217
+T + N FY+++L IS+ E +N P D + +G +IDSG
Sbjct: 235 SEVLGNGVVSTPLVSKNTHQAFYFVTLLGISV--EDINLPFNDGSSLEPITKGNMVIDSG 292
Query: 218 SVLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPI------QLCYFLPETFNRFPSMA 271
+ T D Y +L E+ + +++ P PI QLCY P + ++
Sbjct: 293 TPTTLLPEDFYHRLVEEVRN--------KVALDPIPIDPTLGYQLCYRTPTNL-KGTTLT 343
Query: 272 FYFEDANLRIDGENVFIIDYENHFFLLAVAPHDDLVALIGSQQQRDTRFVYDLNIDLLSF 331
+FE A++ + +FI + F + + + G+ Q + +DL L+SF
Sbjct: 344 AHFEGADVLLTPTQIFIPVQDGIFCFAFTSTFSNEYGIYGNHAQSNYLIGFDLEKQLVSF 403
Query: 332 VKENCSD 338
+C++
Sbjct: 404 KATDCTN 410
>gi|357122568|ref|XP_003562987.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Brachypodium
distachyon]
Length = 455
Score = 114 bits (286), Expect = 6e-23, Method: Compositional matrix adjust.
Identities = 105/380 (27%), Positives = 166/380 (43%), Gaps = 55/380 (14%)
Query: 1 MVRLFIGTPSKGVLLILDTGSALIYA----------------------IFDPRKSSSFQK 38
++ L IGTP I DTGS LI+ +++P S++F
Sbjct: 88 IMTLSIGTPPLSYRAIADTGSDLIWTQCAPCGDTVTDTDNQCFKQSGCLYNPSSSTTFGV 147
Query: 39 INCDHP--DCTYFKCVNE----QCVYTMKYADQSVTKGFAAHETISVIGKGEGKAI-FHG 91
+ C+ P C + C+Y Y T G + ET + A+
Sbjct: 148 LPCNSPLSMCAAMAGPSPPPGCACMYNQTYG-TGWTAGVQSVETFTFGSSSTPPAVRVPN 206
Query: 92 ALFGCSNDNHGFDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYT 151
FGCSN + + +G+ AG++GL R ++S +SQLG+ FSYCL P + T
Sbjct: 207 IAFGCSNAS----SNDWNGS-AGLVGLGRGSMSLVSQLGA---GAFSYCLT-PFQDANST 257
Query: 152 SSYLKFGTDMGYRRPST---QATKFINHPN-----NFYYLSLKDISIDNERMNFPPDTFD 203
S+ L G T ++T F+ P+ +YYL+L IS+ + PPD F
Sbjct: 258 STLL-LGPSAAAALKGTGPVRSTPFVAGPSKAPMSTYYYLNLTGISVGETALAIPPDAFS 316
Query: 204 ITVSGEGGCIIDSGSVLTYFHSDVYWKLHEKFVSYF-ERFQLAQLSDCPEPIQLCYFLPE 262
+ G GG IIDSG+ +T Y ++ S R LA D + LC+ L
Sbjct: 317 LRADGTGGLIIDSGTTITTLVDSAYQQVRAAVRSLLVTRLPLAHGPDHSTGLDLCFALKA 376
Query: 263 TF--NRFPSMAFYFE-DANLRIDGENVFIIDYENHFFLLAVAPHD-DLVALIGSQQQRDT 318
+ PSM +FE A++ + EN I+ + + LA+ ++++G+ QQ++
Sbjct: 377 STPPPAMPSMTLHFEGGADMVLPVENYMILG--SGVWCLAMRNQTVGAMSMVGNYQQQNI 434
Query: 319 RFVYDLNIDLLSFVKENCSD 338
+YD+ + LSF CS
Sbjct: 435 HVLYDVRKETLSFAPAVCSS 454
>gi|224142011|ref|XP_002324354.1| predicted protein [Populus trichocarpa]
gi|222865788|gb|EEF02919.1| predicted protein [Populus trichocarpa]
Length = 471
Score = 114 bits (286), Expect = 6e-23, Method: Compositional matrix adjust.
Identities = 102/366 (27%), Positives = 151/366 (41%), Gaps = 59/366 (16%)
Query: 2 VRLFIGTPSKGVLLILDTGSALIYA---------------IFDPRKSSSFQKINCDHPDC 46
V + +GTP K L+ DTGS L + FDP KS+S++ ++C C
Sbjct: 134 VTVGLGTPKKDFSLLFDTGSDLTWTQCEPCSGGCFPQNDEKFDPTKSTSYKNLSCSSEPC 193
Query: 47 TYFKCVNEQ-------CVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSND 99
+ Q C+Y +KY T GF A ET+++ +F + GC
Sbjct: 194 KSIGKESAQGCSSSNSCLYGVKYG-TGYTVGFLATETLTITPS----DVFENFVIGCGER 248
Query: 100 NHGFDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLKFGT 159
N G R AG+LGL R ++ SQ S K FSYC LP ++ +L FG
Sbjct: 249 NGG-----RFSGTAGLLGLGRSPVALPSQTSSTYKNLFSYC----LPASSSSTGHLSFGG 299
Query: 160 DMGYRRPSTQATKF---INHPNNFYYLSLKDISIDNERMNFPPDTFDITVSGEGGCIIDS 216
+ +QA KF + Y L + IS+ ++ P F G IIDS
Sbjct: 300 GV------SQAAKFTPITSKIPELYGLDVSGISVGGRKLPIDPSVFRTA-----GTIIDS 348
Query: 217 GSVLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCYFLPETFN---RFPSMAFY 273
G+ LTY S + L F + L + + +Q CY + N P ++ +
Sbjct: 349 GTTLTYLPSTAHSALSSAFQEMMTNYTLTKGT---SGLQPCYDFSKHANDNITIPQISIF 405
Query: 274 FEDA-NLRIDGENVFIIDYENHFFLLAVA--PHDDLVALIGSQQQRDTRFVYDLNIDLLS 330
FE + ID +FI LA +D VA+ G+ QQ+ VYD+ ++
Sbjct: 406 FEGGVEVDIDDSGIFIAANGLEEVCLAFKDNGNDTDVAIFGNVQQKTYEVVYDVAKGMVG 465
Query: 331 FVKENC 336
F C
Sbjct: 466 FAPGGC 471
>gi|357124861|ref|XP_003564115.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Brachypodium
distachyon]
Length = 477
Score = 114 bits (286), Expect = 6e-23, Method: Compositional matrix adjust.
Identities = 110/394 (27%), Positives = 157/394 (39%), Gaps = 72/394 (18%)
Query: 1 MVRLFIGTPSKGVLLILDTGSALIYA---------------IFDPRKSSSFQKINCDHPD 45
+V L +GTP + V L LDTGS L++ + DP SS+ + CD P
Sbjct: 95 LVHLSVGTPPRPVALTLDTGSDLVWTQCAPCLNCFDQGAIPVLDPAASSTHAAVRCDAPV 154
Query: 46 CTYFKCVN----------EQCVYTMKYADQSVTKGFAAHETISVIGKGE---GKAIFHGA 92
C + CVY Y D+S+T G A + + G G+ G +
Sbjct: 155 CRALPFTSCGRGGSSWGERSCVYVYHYGDKSITVGKLASDRFT-FGPGDNADGGGVSERR 213
Query: 93 L-FGCSNDNHGFDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYT 151
L FGC + N G + G+ G R S SQLG FSYC E T
Sbjct: 214 LTFGCGHFNKGIFQANE----TGIAGFGRGRWSLPSQLGVT---SFSYCFTSMF---EST 263
Query: 152 SSYLKFGTDMG--YRRPSTQATKFINHPN--NFYYLSLKDISIDNERMNFPPDTFDITVS 207
SS + G + Q+T + P+ + Y+LSLK I++ R+ P +
Sbjct: 264 SSLVTLGVAPAELHLTGQVQSTPLLRDPSQPSLYFLSLKAITVGATRIPIPERRQRLR-- 321
Query: 208 GEGGCIIDSGSVLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCYFLPETFN-- 265
E IIDSG+ +T DVY + +FV+ + L + + LC+ LP
Sbjct: 322 -EASAIIDSGASITTLPEDVYEAVKAEFVA---QVGLPVSAVEGSALDLCFALPSAAAPK 377
Query: 266 ----------------RFPSMAFYFED-ANLRIDGENVFIIDYENHFFLL---AVAPHDD 305
R P + F+ A+ + EN DY L A D
Sbjct: 378 SAFGWRWRGRGRAMPVRVPRLVFHLGGGADWELPRENYVFEDYGARVMCLVLDAATGGGD 437
Query: 306 LVALIGSQQQRDTRFVYDLNIDLLSFVKENCSDD 339
+IG+ QQ++T VYDL D+LSF C D
Sbjct: 438 QTVVIGNYQQQNTHVVYDLENDVLSFAPARCECD 471
>gi|356532672|ref|XP_003534895.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
Length = 449
Score = 114 bits (285), Expect = 6e-23, Method: Compositional matrix adjust.
Identities = 103/363 (28%), Positives = 165/363 (45%), Gaps = 48/363 (13%)
Query: 1 MVRLFIGTPSKGVLLILDTGSALIYA--------------IFDPRKSSSFQ---KINCDH 43
M + IG P L+++DTGS +++ +FDP SS+F K CD
Sbjct: 102 MANISIGQPPIPQLVVMDTGSDILWVMCTPCTNCDNHLGLLFDPSMSSTFSPLCKTPCDF 161
Query: 44 PDCTYFKCVNEQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSNDNHGF 103
C+ +C + +T+ YAD S G +T+ EG + LFGC H
Sbjct: 162 KGCS--RC--DPIPFTVTYADNSTASGMFGRDTVVFETTDEGTSRIPDVLFGC---GHNI 214
Query: 104 DEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLV-IPLPNGEYTSSYLKFGTDM- 161
+D D G+LGL+ S +++G ++FSYC+ + P Y L G D+
Sbjct: 215 GQDT-DPGHNGILGLNNGPDSLATKIG----QKFSYCIGDLADPYYNYHQLILGEGADLE 269
Query: 162 GYRRPSTQATKFINHPNNFYYLSLKDISIDNERMNFPPDTFDITVSGEGGCIIDSGSVLT 221
GY P F H N FYY++++ IS+ +R++ P+TF++ + GG IID+GS +T
Sbjct: 270 GYSTP------FEVH-NGFYYVTMEGISVGEKRLDIAPETFEMKKNRTGGVIIDTGSTIT 322
Query: 222 YFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCYF--LPETFNRFPSMAFYFED-AN 278
+ V+ +L K V + Q + P C++ + FP + F+F D A+
Sbjct: 323 FLVDSVH-RLLSKEVRNLLGWSFRQTTIEKSPWMQCFYGSISRDLVGFPVVTFHFADGAD 381
Query: 279 LRIDGENVFIIDYENHFFLLAVAPHDDL-----VALIGSQQQRDTRFVYDLNIDLLSFVK 333
L +D F ++ F + V P L +LIG Q+ YDL + F +
Sbjct: 382 LALD-SGSFFNQLNDNVFCMTVGPVSSLNLKSKPSLIGLLAQQSYSVGYDLVNQFVYFQR 440
Query: 334 ENC 336
+C
Sbjct: 441 IDC 443
>gi|15226358|ref|NP_180389.1| aspartyl protease-like protein [Arabidopsis thaliana]
gi|4803959|gb|AAD29831.1| putative chloroplast nucleoid DNA binding protein [Arabidopsis
thaliana]
gi|330252998|gb|AEC08092.1| aspartyl protease-like protein [Arabidopsis thaliana]
Length = 756
Score = 114 bits (285), Expect = 7e-23, Method: Compositional matrix adjust.
Identities = 98/354 (27%), Positives = 164/354 (46%), Gaps = 40/354 (11%)
Query: 1 MVRLFIGTPSKGVLLILDTGSALIYA--------------IFDPRKSSSFQKINCDHPDC 46
+++L +GTP ++ +DTGS +I+ IFDP KSS+F++ C+
Sbjct: 422 LMKLQVGTPPFEIVAEIDTGSDIIWTQCMPCPNCYSQFAPIFDPSKSSTFREQRCN---- 477
Query: 47 TYFKCVNEQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSNDNHGFDED 106
C Y + YAD++ +KG A ET+++ + GC DN
Sbjct: 478 ------GNSCHYEIIYADKTYSKGILATETVTIPSTSGEPFVMAETKIGCGLDNTNLQYS 531
Query: 107 ARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLKFGTDMGYRRP 166
+ +G++GL+ +S ISQ+ SYC +G+ TS + FGT+
Sbjct: 532 GFASSSSGIVGLNMGPLSLISQMDLPYPGLISYCF-----SGQGTSK-INFGTNAIVAGD 585
Query: 167 STQATK-FINHPNNFYYLSLKDISIDNERMNFPPDTFDITVSGEGGCIIDSGSVLTYFHS 225
T A FI N FYYL+L +S+++ + F + +G IDSG+ LTYF
Sbjct: 586 GTVAADMFIKKDNPFYYLNLDAVSVEDNLIATLGTPFH---AEDGNIFIDSGTTLTYFPM 642
Query: 226 DVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCYFLPETFNRFPSMAFYFE-DANLRIDGE 284
Y L + V + ++ D LCY+ +T + FP + +F A+L +D
Sbjct: 643 S-YCNLVREAVE--QVVTAVKVPDMGSDNLLCYY-SDTIDIFPVITMHFSGGADLVLDKY 698
Query: 285 NVFIIDYENHFFLLAVAPHD-DLVALIGSQQQRDTRFVYDLNIDLLSFVKENCS 337
N+++ F LA+ +D + A+ G++ Q + YD + +++SF NCS
Sbjct: 699 NMYLETITGGIFCLAIGCNDPSMPAVFGNRAQNNFLVGYDPSSNVISFSPTNCS 752
Score = 107 bits (266), Expect = 1e-20, Method: Compositional matrix adjust.
Identities = 101/351 (28%), Positives = 157/351 (44%), Gaps = 50/351 (14%)
Query: 1 MVRLFIGTPSKGVLLILDTGSALIYA--------------IFDPRKSSSFQKINCDHPDC 46
+++L +GTP + +DTGS LI+ IFDP KSS+F +
Sbjct: 83 LMKLQVGTPPFEIAAEIDTGSDLIWTQCMPCPDCYSQFDPIFDPSKSSTFNEQ------- 135
Query: 47 TYFKCVNEQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSNDNHGFDED 106
+C + C Y + Y D + +KG A ET+++ + GC N D
Sbjct: 136 ---RCHGKSCHYEIIYEDNTYSKGILATETVTIHSTSGEPFVMAETTIGCGLHNTDLDNS 192
Query: 107 ARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLKFGTDMGYRRP 166
+ +G++GL+ S ISQ+ SYC +G+ TS + FGT+
Sbjct: 193 GFASSSSGIVGLNMGPRSLISQMDLPYPGLISYCF-----SGQGTSK-INFGTNAIVAGD 246
Query: 167 STQATK-FINHPNNFYYLSLKDISIDNERMNFPPDTFDITVSGEGGCIIDSGSVLTYFHS 225
T A FI N FYYL+L +S+++ R+ F + +G +IDSGS +TYF
Sbjct: 247 GTVAADMFIKKDNPFYYLNLDAVSVEDNRIETLGTPFH---AEDGNIVIDSGSTVTYFPV 303
Query: 226 DVYWKLHEKFVSYFERFQLAQLSDCPEPI---QLCYFLPETFNRFPSMAFYFE-DANLRI 281
Y L K V Q+ P+P LCYF ET + FP + +F A+L +
Sbjct: 304 S-YCNLVRKAVE-----QVVTAVRVPDPSGNDMLCYF-SETIDIFPVITMHFSGGADLVL 356
Query: 282 DGENVFIIDYENHFFLLAV---APHDDLVALIGSQQQRDTRFVYDLNIDLL 329
D N+++ F LA+ +P + A+ G++ Q + YD + LL
Sbjct: 357 DKYNMYMESNSGGLFCLAIICNSPTQE--AIFGNRAQNNFLVGYDSSSLLL 405
>gi|449529638|ref|XP_004171805.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 1-like, partial
[Cucumis sativus]
Length = 384
Score = 114 bits (285), Expect = 7e-23, Method: Compositional matrix adjust.
Identities = 101/360 (28%), Positives = 157/360 (43%), Gaps = 43/360 (11%)
Query: 2 VRLFIGTPSKGVLLILDTGSALIY--------------AIFDPRKSSSFQKINCDHPDCT 47
R+ +GTP K V ++LDTGS +++ +F+P KS SF K+ C P C
Sbjct: 44 TRIGVGTPPKYVYMVLDTGSDIVWLQCAPCKNCYSQTDPVFNPVKSGSFAKVLCRTPLCR 103
Query: 48 YFK---CVNEQ-CVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSNDNHGF 103
+ C Q C+Y + Y D S T G ET++ + GC +DN G
Sbjct: 104 RLESPGCNQRQTCLYQVSYGDGSYTTGEFVTETLTFR-----RTKVEQVALGCGHDNEGL 158
Query: 104 DEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLKFGTDMGY 163
A R +SF SQ G ++FSYCLV + + +S + FG
Sbjct: 159 FVGAAGLLGL-----GRGGLSFPSQAGRTFNQKFSYCLVDRSASSKPSS--VVFGNSAVS 211
Query: 164 RRPSTQATKFINHP--NNFYYLSLKDISIDNERMN-FPPDTFDITVSGEGGCIIDSGSVL 220
R + + T + +P + FYY+ L IS+ ++ F + +G GG IID G+ +
Sbjct: 212 R--TARFTPLLTNPRLDTFYYVELLGISVGGTPVSGITASHFKLDRTGNGGVIIDCGTSV 269
Query: 221 TYFHSDVYWKLHEKFVSYFERFQLAQLSDCPE--PIQLCYFLP-ETFNRFPSMAFYFEDA 277
T + Y L + F R + L PE CY L +T + P++ +F A
Sbjct: 270 TRLNKPAYIALRDAF-----RAGASSLKSAPEFSLFDTCYDLSGKTTVKVPTVVLHFRGA 324
Query: 278 NLRIDGENVFIIDYENHFFLLAVAPHDDLVALIGSQQQRDTRFVYDLNIDLLSFVKENCS 337
++ + N I + F A A +++IG+ QQ+ R VYDL + F C+
Sbjct: 325 DVSLPASNYLIPVDGSGRFCFAFAGTTSGLSIIGNIQQQGFRVVYDLASSRVGFSPRGCA 384
>gi|148905906|gb|ABR16115.1| unknown [Picea sitchensis]
Length = 482
Score = 114 bits (284), Expect = 8e-23, Method: Compositional matrix adjust.
Identities = 102/368 (27%), Positives = 166/368 (45%), Gaps = 55/368 (14%)
Query: 1 MVRLFIGTPSKGVLLILDTGSALIY--------------AIFDPRKSSSFQKINCDHPDC 46
+V GTP+K LLI+DTGS L + AIF+P++SSS++ + C C
Sbjct: 138 IVTAGFGTPAKNSLLIIDTGSDLTWIQCKPCADCYSQVDAIFEPKQSSSYKTLPCLSATC 197
Query: 47 TYF--------KCVNEQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSN 98
T C+ CVY + Y D S ++G + ET+++ G F FGC +
Sbjct: 198 TELITSESNPTPCLLGGCVYEINYGDGSSSQGDFSQETLTL-----GSDSFQNFAFGCGH 252
Query: 99 DNHGFDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLKFG 158
N G + + +G+LGL + ++SF SQ S +F+YC LP+ ++S F
Sbjct: 253 TNTGLFKGS-----SGLLGLGQNSLSFPSQSKSKYGGQFAYC----LPDFGSSTSTGSFS 303
Query: 159 TDMGYRRPSTQATKFIN---HPNNFYYLSLKDISIDNERMNFPPDTFDITVSGEGGCIID 215
G S T ++ +P FY++ L IS+ +R++ PP V G G I+D
Sbjct: 304 VGKGSIPASAVFTPLVSNFMYP-TFYFVGLNGISVGGDRLSIPP-----AVLGRGSTIVD 357
Query: 216 SGSVLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCYFLPETFN-RFPSMAFYF 274
SG+V+T Y L F S A+ + CY L R P++ F+F
Sbjct: 358 SGTVITRLLPQAYNALKTSFRSKTRDLPSAKPFSI---LDTCYDLSRHSQVRIPTITFHF 414
Query: 275 E-DANLRIDGENVFIIDYENH----FFLLAVAPHDDLVALIGSQQQRDTRFVYDLNIDLL 329
+ +A++ + + ++ +N A A D +IG+ QQ+ R +D +
Sbjct: 415 QNNADVAVSDVGI-LVPVQNGGSQVCLAFASASQMDGFNIIGNFQQQRMRVAFDTGAGRI 473
Query: 330 SFVKENCS 337
F +C+
Sbjct: 474 GFASGSCA 481
>gi|148906646|gb|ABR16474.1| unknown [Picea sitchensis]
Length = 538
Score = 114 bits (284), Expect = 9e-23, Method: Compositional matrix adjust.
Identities = 102/362 (28%), Positives = 162/362 (44%), Gaps = 49/362 (13%)
Query: 2 VRLFIGTPSKGVLLILDTGSALIY--------------AIFDPRKSSSFQKINCDHPDCT 47
R+ +GTP + ++LDTGS +++ IF+P S+SF + C+ C+
Sbjct: 199 TRIGVGTPMREQYMVLDTGSDVVWIQCEPCSKCYSQVDPIFNPSLSASFSTLGCNSAVCS 258
Query: 48 Y---FKCVNEQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSNDNHGFD 104
Y + C C+Y + Y D S T G A E ++ G GC +DN G
Sbjct: 259 YLDAYNCHGGGCLYKVSYGDGSYTIGSFATEMLTF-----GTTSVRNVAIGCGHDNAGLF 313
Query: 105 EDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLKFGTD---M 161
A +SF SQLG+ + FSYCLV +S L+FG + +
Sbjct: 314 VGAAGLLGL-----GAGLLSFPSQLGTQTGRAFSYCLVDRF---SESSGTLEFGPESVPL 365
Query: 162 GYRRPSTQATKFINHPN--NFYYLSLKDISIDNERMN-FPPDTFDI-TVSGEGGCIIDSG 217
G + T + +P+ FYY+ L IS+ ++ PPD F I SG GG I+DSG
Sbjct: 366 G-----SILTPLLTNPSLPTFYYVPLISISVGGALLDSVPPDVFRIDETSGRGGFIVDSG 420
Query: 218 SVLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCYFLPE-TFNRFPSMAFYFED 276
+ +T + VY + + FV+ + A+ CY L P++ F+F +
Sbjct: 421 TAVTRLQTPVYDAVRDAFVAGTRQLPKAEGVSI---FDTCYDLSGLPLVNVPTVVFHFSN 477
Query: 277 -ANLRIDGENVFI-IDYENHFFLLAVAPHDDLVALIGSQQQRDTRFVYDLNIDLLSFVKE 334
A+L + +N I +D+ F A AP ++++G+ QQ+ R +D L+ F
Sbjct: 478 GASLILPAKNYMIPMDFMGT-FCFAFAPATSDLSIMGNIQQQGIRVSFDTANSLVGFALR 536
Query: 335 NC 336
C
Sbjct: 537 QC 538
>gi|356498306|ref|XP_003517994.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
Length = 484
Score = 114 bits (284), Expect = 9e-23, Method: Compositional matrix adjust.
Identities = 109/358 (30%), Positives = 161/358 (44%), Gaps = 47/358 (13%)
Query: 2 VRLFIGTPSKGVLLILDTGSALIY--------------AIFDPRKSSSFQKINCDHPDCT 47
+R+ IG P ++LDTGS + + IFDP S+S+ I CD P C
Sbjct: 151 LRVGIGKPPSQAYVVLDTGSDVSWIQCAPCSECYQQSDPIFDPISSNSYSPIRCDEPQCK 210
Query: 48 YF---KCVNEQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSNDNHGFD 104
+C N C+Y + Y D S T G A ET+++ G A GC ++N G
Sbjct: 211 SLDLSECRNGTCLYEVSYGDGSYTVGEFATETVTL-----GSAAVENVAIGCGHNNEGLF 265
Query: 105 EDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTS-SYLKFGTDMGY 163
A G LS F +Q+ + FSYCLV N + + S L+F + +
Sbjct: 266 VGAAGLLGLGGGKLS-----FPAQVNAT---SFSYCLV----NRDSDAVSTLEFNSPL-- 311
Query: 164 RRPSTQATK-FINHP--NNFYYLSLKDISIDNERMNFPPDTFDITVSGEGGCIIDSGSVL 220
P AT + +P + FYYL LK IS+ E + P +F++ G GG IIDSG+ +
Sbjct: 312 --PRNAATAPLMRNPELDTFYYLGLKGISVGGEALPIPESSFEVDAIGGGGIIIDSGTAV 369
Query: 221 TYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCYFLPETFN-RFPSMAFYF-EDAN 278
T S+VY L + FV + A CY L + P+++F F E
Sbjct: 370 TRLRSEVYDALRDAFVKGAKGIPKANGVSL---FDTCYDLSSRESVEIPTVSFRFPEGRE 426
Query: 279 LRIDGENVFIIDYENHFFLLAVAPHDDLVALIGSQQQRDTRFVYDLNIDLLSFVKENC 336
L + N I F A AP +++IG+ QQ+ TR +D+ L+ F ++C
Sbjct: 427 LPLPARNYLIPVDSVGTFCFAFAPTTSSLSIIGNVQQQGTRVGFDIANSLVGFSVDSC 484
>gi|168040957|ref|XP_001772959.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162675692|gb|EDQ62184.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 351
Score = 114 bits (284), Expect = 9e-23, Method: Compositional matrix adjust.
Identities = 100/365 (27%), Positives = 163/365 (44%), Gaps = 50/365 (13%)
Query: 1 MVRLFIGTPSKGVLLILDTGSALIYA--------------IFDPRKSSSFQKINCDHPDC 46
++++ +GTP + I+DTGS L + +F P SSS+ +C C
Sbjct: 9 VLQISLGTPPQQFSAIVDTGSDLCWVQCAPCARCFEQPDPLFIPLASSSYSNASCTDSLC 68
Query: 47 TYFK----CVNEQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSNDNHG 102
+ C Y+ Y D S T+G A ET+++ G + FGC ++ G
Sbjct: 69 DALPRPTCSMRNTCTYSYSYGDGSNTRGDFAFETVTLNGSTLARI-----GFGCGHNQEG 123
Query: 103 FDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLKFGTDMG 162
A G++GL + +S SQL S FSYCLV G + S + FG
Sbjct: 124 TFAGAD-----GLIGLGQGPLSLPSQLNSSFTHIFSYCLVDQSTTGTF--SPITFGNAAE 176
Query: 163 YRRPSTQATKFINHPNN--FYYLSLKDISIDNERMNFPPDTFDITVSGEGGCIIDSGSVL 220
R S T + + +N +YY+ ++ IS+ N R+ PP F I +G GG I+DSG+ +
Sbjct: 177 NSRAS--FTPLLQNEDNPSYYYVGVESISVGNRRVPTPPSAFRIDANGVGGVILDSGTTI 234
Query: 221 TYFHSDVYWKLHEKFVSYFE--RFQLAQLSDCPEP--IQLCYFLPETFNR---FPSMAFY 273
T YW+L F+ R Q++ P P + LCY + PSM +
Sbjct: 235 T------YWRL-AAFIPILAELRRQISYPEADPTPYGLNLCYDISSVSASSLTLPSMTVH 287
Query: 274 FEDANLRIDGENVFI-IDYENHFFLLAVAPHDDLVALIGSQQQRDTRFVYDLNIDLLSFV 332
+ + I N+++ +D A++ D ++IG+ QQ++ V D+ + F+
Sbjct: 288 LTNVDFEIPVSNLWVLVDNFGETVCTAMSTSDQF-SIIGNVQQQNNLIVTDVANSRVGFL 346
Query: 333 KENCS 337
+CS
Sbjct: 347 ATDCS 351
>gi|255576511|ref|XP_002529147.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
gi|223531426|gb|EEF33260.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
Length = 479
Score = 113 bits (283), Expect = 1e-22, Method: Compositional matrix adjust.
Identities = 99/355 (27%), Positives = 153/355 (43%), Gaps = 43/355 (12%)
Query: 3 RLFIGTPSKGVLLILDTGSALIY--------------AIFDPRKSSSFQKINCDHPDCTY 48
R+ IG PS V ++LDTGS + + IF+P S+S+ ++CD C
Sbjct: 147 RVGIGKPSSPVYMVLDTGSDVNWIQCAPCADCYHQADPIFEPASSTSYSPLSCDTKQCQS 206
Query: 49 F---KCVNEQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSNDNHGFDE 105
+C N C+Y + Y D S T G ETI++ G A GC ++N G
Sbjct: 207 LDVSECRNNTCLYEVSYGDGSYTVGDFVTETITL-----GSASVDNVAIGCGHNNEGLFI 261
Query: 106 DARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLKFGTDMGYRR 165
A G LS F SQ+ + FSYCLV ++S L+F + +
Sbjct: 262 GAAGLLGLGGGKLS-----FPSQINA---SSFSYCLV---DRDSDSASTLEFNSAL---L 307
Query: 166 PSTQATKFINHP--NNFYYLSLKDISIDNERMNFPPDTFDITVSGEGGCIIDSGSVLTYF 223
P + + + FYY+ + +S+ E ++ P F++ SG GG IIDSG+ +T
Sbjct: 308 PHAITAPLLRNRELDTFYYVGMTGLSVGGELLSIPESMFEMDESGNGGIIIDSGTAVTRL 367
Query: 224 HSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCYFLP-ETFNRFPSMAFYFEDAN-LRI 281
+ Y L + FV + + CY L +T P++ F+ L +
Sbjct: 368 QTAAYNALRDAFVKGTKDLPVTSEVAL---FDTCYDLSRKTSVEVPTVTFHLAGGKVLPL 424
Query: 282 DGENVFIIDYENHFFLLAVAPHDDLVALIGSQQQRDTRFVYDLNIDLLSFVKENC 336
N I + F A AP +++IG+ QQ+ TR +DL L+ F C
Sbjct: 425 PATNYLIPVDSDGTFCFAFAPTSSALSIIGNVQQQGTRVGFDLANSLVGFEPRQC 479
>gi|413921849|gb|AFW61781.1| hypothetical protein ZEAMMB73_702843 [Zea mays]
Length = 379
Score = 113 bits (283), Expect = 1e-22, Method: Compositional matrix adjust.
Identities = 87/287 (30%), Positives = 125/287 (43%), Gaps = 33/287 (11%)
Query: 1 MVRLFIGTPSKGVLLILDTGSALIYA--------------IFDPRKSSSFQKINCDHPDC 46
+V L IGTP I+DTGS LI+ FD +KS++++ + C C
Sbjct: 90 LVDLAIGTPPLYYTAIMDTGSDLIWTQCAPCLLCADQPTPYFDVKKSATYRALPCRSSRC 149
Query: 47 TYFK---CVNEQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSNDNHGF 103
C + CVY Y D + T G A+ET + K FGC + N G
Sbjct: 150 ASLSSPSCFKKMCVYQYYYGDTASTAGVLANETFTFGAANSTKVRATNIAFGCGSLNAGD 209
Query: 104 DEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPL---PNGEYTSSYLKFGTD 160
++ +G++G R +S +SQLG RFSYCL L P+ Y Y +
Sbjct: 210 LANS-----SGMVGFGRGPLSLVSQLG---PSRFSYCLTSYLSATPSRLYFGVYANLSST 261
Query: 161 MGYRRPSTQATKFINHPN--NFYYLSLKDISIDNERMNFPPDTFDITVSGEGGCIIDSGS 218
Q+T F+ +P N Y+LSLK IS+ + + P F I G GG IIDSG+
Sbjct: 262 NTSSGSPVQSTPFVINPALPNMYFLSLKAISLGTKLLPIDPLVFAINDDGTGGVIIDSGT 321
Query: 219 VLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCYFLPETFN 265
+T+ D Y + VS L ++D + C+ P N
Sbjct: 322 SITWLQQDAYEAVRRGLVSAIP---LTAMNDTDIGLDTCFQWPPPPN 365
>gi|356537728|ref|XP_003537377.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Glycine max]
Length = 543
Score = 113 bits (283), Expect = 1e-22, Method: Compositional matrix adjust.
Identities = 114/377 (30%), Positives = 164/377 (43%), Gaps = 51/377 (13%)
Query: 4 LFIGTPSKGVLLILDTGSALIYAIFDP--------------RKSSSFQKINCDHPDCT-- 47
+F+GTP K V LILDTGS L + DP + SS+++ I+C P C
Sbjct: 175 MFVGTPPKHVWLILDTGSDLSWIQCDPCYDCFEQNGSHYYPKDSSTYRNISCYDPRCQLV 234
Query: 48 -------YFKCVNEQCVYTMKYADQSVTKGFAAHETISV-IGKGEGKAIFH---GALFGC 96
+ K N+ C Y YAD S T G A ET +V + GK F +FGC
Sbjct: 235 SSSDPLQHCKAENQTCPYFYDYADGSNTTGDFASETFTVNLTWPNGKEKFKQVVDVMFGC 294
Query: 97 SNDNHGFDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLK 156
+ N GF A +G+LGL R ISF SQ+ SI FSYCL N SS L
Sbjct: 295 GHWNKGFFYGA-----SGLLGLGRGPISFPSQIQSIYGHSFSYCLTDLFSNTS-VSSKLI 348
Query: 157 FGTDM----GYRRPSTQATKFINHPN-NFYYLSLKDISIDNERMNFPPDTFDIT-----V 206
FG D + T P+ FYYL +K I + E ++ T+ +
Sbjct: 349 FGEDKELLNNHNLNFTTLLAGEETPDETFYYLQIKSIMVGGEVLDISEQTWHWSSEGAAA 408
Query: 207 SGEGGCIIDSGSVLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCYFLPETFNR 266
GG IIDSGS LT+F Y + E F ++ +L Q++ + CY + +
Sbjct: 409 DAGGGTIIDSGSTLTFFPDSAYDIIKEAFE---KKIKLQQIAADDFVMSPCYNVSGAMMQ 465
Query: 267 --FPSMAFYFEDANL-RIDGENVFIIDYENHFFLLAV--APHDDLVALIGSQQQRDTRFV 321
P +F D + EN F + LA+ P+ + +IG+ Q++ +
Sbjct: 466 VELPDFGIHFADGGVWNFPAENYFYQYEPDEVICLAIMKTPNHSHLTIIGNLLQQNFHIL 525
Query: 322 YDLNIDLLSFVKENCSD 338
YD+ L + C++
Sbjct: 526 YDVKRSRLGYSPRRCAE 542
>gi|356495754|ref|XP_003516738.1| PREDICTED: probable aspartic protease At2g35615-like [Glycine max]
Length = 420
Score = 113 bits (283), Expect = 1e-22, Method: Compositional matrix adjust.
Identities = 99/359 (27%), Positives = 170/359 (47%), Gaps = 36/359 (10%)
Query: 1 MVRLFIGTPSKGVLLILDTGSALIYA--------------IFDPRKSSSFQKINCDHPDC 46
++ L IGTP + I DTGS L + +FDP+KS++++ I+CD C
Sbjct: 73 LMELSIGTPPFKIYGIADTGSDLTWTSCVPCNNCYKQRNPMFDPQKSTTYRNISCDSKLC 132
Query: 47 ----TYFKCVNEQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAI-FHGALFGCSNDNH 101
T ++C YT YA ++T+G A ETI+ + +GK++ G +FGC ++N
Sbjct: 133 HKLDTGVCSPQKRCNYTYAYASAAITRGVLAQETIT-LSSTKGKSVPLKGIVFGCGHNNT 191
Query: 102 GFDEDARDGALAGVLGLSRVTISFISQLGSII-KKRFSYCLVIPLPNGEYTSSYLKFGTD 160
G D G++GL +S ISQ+GS KRFS CLV P SS + FG
Sbjct: 192 GGFNDHE----MGIIGLGGGPVSLISQMGSSFGGKRFSQCLV-PFHTDVSVSSKMSFGKG 246
Query: 161 MGYRRPSTQATKFINHPNNF-YYLSLKDISIDNERMNFPPDTFDITVSGEGGCIIDSGSV 219
+T + + Y+++L IS++N ++F + ++ +G +DSG+
Sbjct: 247 SKVSGKGVVSTPLVAKQDKTPYFVTLLGISVENTYLHFNGSSQNVE---KGNMFLDSGTP 303
Query: 220 LTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPE-PIQLCYFLPETFNRFPSMAFYFEDAN 278
T + +Y ++ + S + ++D P+ QLCY R P + +FE A+
Sbjct: 304 PTILPTQLYDQVVAQVRS---EVAMKPVTDDPDLGPQLCYRTKNNL-RGPVLTAHFEGAD 359
Query: 279 LRIDGENVFIIDYENHFFLLAVAPHDDLVALIGSQQQRDTRFVYDLNIDLLSFVKENCS 337
+++ FI + F L D + G+ Q + +DL+ ++SF ++C+
Sbjct: 360 VKLSPTQTFISPKDGVFCLGFTNTSSD-GGVYGNFAQSNYLIGFDLDRQVVSFKPKDCT 417
>gi|357500973|ref|XP_003620775.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
gi|357500991|ref|XP_003620784.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
gi|355495790|gb|AES76993.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
gi|355495799|gb|AES77002.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
Length = 438
Score = 113 bits (283), Expect = 1e-22, Method: Compositional matrix adjust.
Identities = 104/357 (29%), Positives = 173/357 (48%), Gaps = 37/357 (10%)
Query: 6 IGTPSKGVLLILDTGSALIY--------------AIFDPRKSSSFQKINCDHPDCTYFK- 50
+GTP I+DTGS +++ F+P KSSS++ I+C C +
Sbjct: 93 VGTPPIKSYGIVDTGSDIVWLQCEPCEQCYNQTTPKFNPSKSSSYKNISCSSKLCQSVRD 152
Query: 51 --CVNEQ-CVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSNDNHGFDEDA 107
C +++ C Y++ Y +QS ++G + ET+++ F + GC +N G +
Sbjct: 153 TSCNDKKNCEYSINYGNQSHSQGDLSLETLTLESTTGRPVSFPKTVIGCGTNNIG----S 208
Query: 108 RDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLV---IPLPNGEYTSSYLKFGTDMGYR 164
+GV+GL S I+QLG I +FSYCLV I L N SS L FG
Sbjct: 209 FKRVSSGVVGLGGGPASLITQLGPSIGGKFSYCLVRMSITLKNMSMGSSKLNFGDVAIVS 268
Query: 165 RPSTQATKFINHPNN-FYYLSLKDISIDNERMNFPPDTFDITVSGEGGCIIDSGSVLTYF 223
+ +T + ++ FYYL+++ S+ ++R+ F + + EG IIDS +++T+
Sbjct: 269 GHNVLSTPIVKKDHSFFYYLTIEAFSVGDKRVEFAGSSKGVE---EGNIIIDSSTIVTFV 325
Query: 224 HSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCYFLP--ETFNRFPSMAFYFEDANLRI 281
SDVY KL+ V + L ++ D + LCY + E ++ FP M +F+ A++ +
Sbjct: 326 PSDVYTKLNSAIV---DLVTLERVDDPNQQFSLCYNVSSDEEYD-FPYMTAHFKGADILL 381
Query: 282 DGENVFIIDYENHFFLLAVAPHDDLVALIGSQQQRDTRFVYDLNIDLLSFVKENCSD 338
N F ++ A AP + A+ GS Q+D YDL +SF +C++
Sbjct: 382 YATNTF-VEVARDVLCFAFAPSNG-GAIFGSFSQQDFMVGYDLQQKTVSFKSVDCTE 436
>gi|449432044|ref|XP_004133810.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 1-like [Cucumis
sativus]
Length = 471
Score = 113 bits (283), Expect = 1e-22, Method: Compositional matrix adjust.
Identities = 101/360 (28%), Positives = 157/360 (43%), Gaps = 43/360 (11%)
Query: 2 VRLFIGTPSKGVLLILDTGSALIY--------------AIFDPRKSSSFQKINCDHPDCT 47
R+ +GTP K V ++LDTGS +++ +F+P KS SF K+ C P C
Sbjct: 131 TRIGVGTPPKYVYMVLDTGSDIVWLQCAPCKNCYSQTDPVFNPVKSGSFAKVLCRTPLCR 190
Query: 48 YFK---CVNEQ-CVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSNDNHGF 103
+ C Q C+Y + Y D S T G ET++ + GC +DN G
Sbjct: 191 RLESPGCNQRQTCLYQVSYGDGSYTTGEFVTETLTFR-----RTKVEQVALGCGHDNEGL 245
Query: 104 DEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLKFGTDMGY 163
A R +SF SQ G ++FSYCLV + + +S + FG
Sbjct: 246 FVGAAGLLGL-----GRGGLSFPSQAGRTFNQKFSYCLVDRSASSKPSS--VVFGNSAVS 298
Query: 164 RRPSTQATKFINHP--NNFYYLSLKDISIDNERMN-FPPDTFDITVSGEGGCIIDSGSVL 220
R + + T + +P + FYY+ L IS+ ++ F + +G GG IID G+ +
Sbjct: 299 R--TARFTPLLTNPRLDTFYYVELLGISVGGTPVSGITASHFKLDRTGNGGVIIDCGTSV 356
Query: 221 TYFHSDVYWKLHEKFVSYFERFQLAQLSDCPE--PIQLCYFLP-ETFNRFPSMAFYFEDA 277
T + Y L + F R + L PE CY L +T + P++ +F A
Sbjct: 357 TRLNKPAYIALRDAF-----RAGASSLKSAPEFSLFDTCYDLSGKTTVKVPTVVLHFRGA 411
Query: 278 NLRIDGENVFIIDYENHFFLLAVAPHDDLVALIGSQQQRDTRFVYDLNIDLLSFVKENCS 337
++ + N I + F A A +++IG+ QQ+ R VYDL + F C+
Sbjct: 412 DVSLPASNYLIPVDGSGRFCFAFAGTTSGLSIIGNIQQQGFRVVYDLASSRVGFSPRGCA 471
>gi|357118074|ref|XP_003560784.1| PREDICTED: aspartic proteinase nepenthesin-2-like, partial
[Brachypodium distachyon]
Length = 452
Score = 113 bits (283), Expect = 1e-22, Method: Compositional matrix adjust.
Identities = 103/363 (28%), Positives = 154/363 (42%), Gaps = 50/363 (13%)
Query: 1 MVRLFIGTPSKGVLLILDTGSALIY---------------AIFDPRKSSSFQKINCDHPD 45
+V + G+P++ + DTGS L + +FDP KSSS+ + C +
Sbjct: 113 VVVVGFGSPAQTSATMFDTGSDLSWIQCQPCSGHCYKQHDPVFDPAKSSSYAVVPCGTTE 172
Query: 46 CTYF--KCVNEQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSNDNHGF 103
C +C CVY ++Y D S T G A ET++ E F G +FGC N G
Sbjct: 173 CAAAGGECNGTTCVYGVEYGDGSSTTGVLARETLTFSSSSE----FTGFIFGCGETNLG- 227
Query: 104 DEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLKFGTDMGY 163
D DG L G ++ G I FSYC LP+ T YL G
Sbjct: 228 DFGEVDGLLGLGRGSLSLSSQAAPAFGGI----FSYC----LPSYNTTPGYLSIGATPVT 279
Query: 164 RRPSTQATKFINHPN--NFYYLSLKDISIDNERMNFPPDTFDITVSGEGGCIIDSGSVLT 221
+ Q T +N P+ +FY++ L I+I + PP F T G ++DSG++LT
Sbjct: 280 GQIPVQYTAMVNKPDYPSFYFIELVSINIGGYVLPVPPSEFTKT-----GTLLDSGTILT 334
Query: 222 YFHSDVYWKLHEKFVSYFERFQLAQLSDCP--EPIQLCY-FLPETFNRFPSMAFYFEDA- 277
Y Y L ++F +F + P + + CY F ++ P ++F F D
Sbjct: 335 YLPPPAYTALRDRF-----KFTMQGSKPAPPYDELDTCYDFTGQSGILIPGVSFNFSDGA 389
Query: 278 --NLRIDGENVFIIDYENHFFLLAVA--PHDDLVALIGSQQQRDTRFVYDLNIDLLSFVK 333
NL G F D + LA P D +++GS QR +YD+ + F+
Sbjct: 390 VFNLNFFGIMTFPDDTKPAVGCLAFVSRPADMPFSVVGSTTQRSAEVIYDVPAQKIGFIP 449
Query: 334 ENC 336
+C
Sbjct: 450 ASC 452
>gi|357143657|ref|XP_003573000.1| PREDICTED: LOW QUALITY PROTEIN: aspartic proteinase
nepenthesin-2-like [Brachypodium distachyon]
Length = 464
Score = 113 bits (282), Expect = 1e-22, Method: Compositional matrix adjust.
Identities = 96/359 (26%), Positives = 145/359 (40%), Gaps = 49/359 (13%)
Query: 1 MVRLFIGTPSKGVLLILDTGSALIY-----AIFDPRKSSSFQKINCDHPDCTYFK----- 50
++ + IG+P+ + +DTGS + + ++DP SS++ +C P C
Sbjct: 132 VITVSIGSPAVAXTMFIDTGSDVSWLRCKSRLYDPGTSSTYAPFSCSAPACAQLGRRGTG 191
Query: 51 -CVNEQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSNDNHGFDEDARD 109
CVY++KY D S T G +T+++ G E + G FGCS HGF+ED D
Sbjct: 192 CSSGSTCVYSVKYGDGSNTTGTYGSDTLTLAGTSE--PLISGFQFGCSAVEHGFEEDNTD 249
Query: 110 GALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLKFGTDMGYRRPSTQ 169
G++GL SF+SQ + FSYC LP +S +L G +
Sbjct: 250 ----GLMGLGGDAQSFVSQTAATYGSAFSYC----LPPTWNSSGFLTLGAPSSSTSAAFS 301
Query: 170 ATKFI--NHPNNFYYLSLKDISIDNERMNFPPDTFDITVSGEGGCIIDSGSVLTYFHSDV 227
T + FY L L+ IS+ + + P F G I+DSG+V+T
Sbjct: 302 TTPMLRSKQAATFYGLLLRGISVGGKTLEIPSSVF------SAGSIVDSGTVITRLPPTA 355
Query: 228 YWKLHEKFVSYFERFQLAQLSDCPEPIQLCYFLPETF--------NRF--PSMAFYFEDA 277
Y L F R+Q +P L F N F PS+A D
Sbjct: 356 YGALSAAFRDGMARYQY-------QPAAPRGLLDTCFDFTGHGEGNNFTVPSVALVL-DG 407
Query: 278 NLRIDGENVFIIDYENHFFLLAVAPHDDLVALIGSQQQRDTRFVYDLNIDLLSFVKENC 336
+D I+ ++ A D +IG+ QQR +YD+ + F C
Sbjct: 408 GAVVDLHPNGIV--QDGCLAFAATDDDGRTGIIGNVQQRTFEVLYDVGQSVFGFRPGAC 464
>gi|224093400|ref|XP_002309912.1| predicted protein [Populus trichocarpa]
gi|222852815|gb|EEE90362.1| predicted protein [Populus trichocarpa]
Length = 382
Score = 113 bits (282), Expect = 2e-22, Method: Compositional matrix adjust.
Identities = 90/357 (25%), Positives = 156/357 (43%), Gaps = 41/357 (11%)
Query: 2 VRLFIGTPSKGVLLILDTGSALIYA--------------IFDPRKSSSFQKINCDHPDCT 47
VR+ +G+P + +++D+GS +++ +FDP S+SF ++C C
Sbjct: 45 VRIGLGSPPRSQYMVIDSGSDIVWVQCKPCTQCYHQTDPLFDPADSASFMGVSCSSAVCD 104
Query: 48 YFK---CVNEQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSNDNHGFD 104
+ C + +C Y + Y D S TKG A ET++ G+ + GC + N G
Sbjct: 105 RVENAGCNSGRCRYEVSYGDGSYTKGTLALETLTF-----GRTVVRNVAIGCGHSNRGMF 159
Query: 105 EDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLKFGTDMGYR 164
A ++SF+ QL FSYCLV G T+ +L+FG++
Sbjct: 160 VGAAGLLGL-----GGGSMSFMGQLSGQTGNAFSYCLV---SRGTNTNGFLEFGSEA--- 208
Query: 165 RPSTQA-TKFINHPN--NFYYLSLKDISIDNERMNFPPDTFDITVSGEGGCIIDSGSVLT 221
P A + +P +FYY+ L + + + R+ D F + G GG ++D+G+ +T
Sbjct: 209 MPVGAAWIPLVRNPRAPSFYYIRLLGLGVGDTRVPVSEDVFQLNELGSGGVVMDTGTAVT 268
Query: 222 YFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCYFLPETFN-RFPSMAFYFEDANLR 280
F + Y F+ E+ Q + CY L + R P+++FYF +
Sbjct: 269 RFPTVAYEAFRNAFI---EQTQNLPRASGVSIFDTCYNLFGFLSVRVPTVSFYFSGGPIL 325
Query: 281 IDGENVFIIDYENH-FFLLAVAPHDDLVALIGSQQQRDTRFVYDLNIDLLSFVKENC 336
N F+I ++ F A AP ++++G+ QQ + D + + F C
Sbjct: 326 TIPANNFLIPVDDAGTFCFAFAPSPSGLSILGNIQQEGIQISVDEANEFVGFGPNIC 382
>gi|357514989|ref|XP_003627783.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
gi|355521805|gb|AET02259.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
Length = 416
Score = 113 bits (282), Expect = 2e-22, Method: Compositional matrix adjust.
Identities = 98/352 (27%), Positives = 158/352 (44%), Gaps = 50/352 (14%)
Query: 6 IGTPSKGVLLILDTGSALIY--------------AIFDPRKSSSFQKINCDHPDCTYFKC 51
+GTP + I DTGS +++ F P KSS+++ I C C
Sbjct: 93 VGTPPFKLYGIADTGSDIVWLQCEPCKECYNQTTPKFKPSKSSTYKNIPCSSDLC----- 147
Query: 52 VNEQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSNDNHGFDEDARDGA 111
+S +G + +T+++ F + GC DN + +GA
Sbjct: 148 -------------KSGQQGNLSVDTLTLESSTGHPISFPKTVIGCGTDN----TVSFEGA 190
Query: 112 LAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLKFGTDMGYRRPSTQAT 171
+G++GL S I+QLGS I +FSYCL +P P T+S L FG +T
Sbjct: 191 SSGIVGLGGGPASLITQLGSSIDAKFSYCL-LPNPVESNTTSKLNFGDTAVVSGDGVVST 249
Query: 172 KFINH-PNNFYYLSLKDISIDNERMNFPPDTFDITVSGEGGCIIDSGSVLTYFHSDVYWK 230
+ P FYYL+L+ S+ N+R+ F + EG IIDSG+ LT +DVY
Sbjct: 250 PIVKKDPIVFYYLTLEAFSVGNKRIEFEGSS---NGGHEGNIIIDSGTTLTVIPTDVYNN 306
Query: 231 LHEKFVSYFERFQLAQLSDCPEPIQLCYFLPETFNRFPSMAFYFEDANLRIDGENVFIID 290
L + E +L +++D LCY + FP + +F+ A++++ + F +D
Sbjct: 307 LES---AVLELVKLKRVNDPTRLFNLCYSVTSDGYDFPIITTHFKGADVKLHPISTF-VD 362
Query: 291 YENHFFLLAVAPH-----DDLVALIGSQQQRDTRFVYDLNIDLLSFVKENCS 337
+ LA A D+V++ G+ Q++ YDL ++SF +CS
Sbjct: 363 VADGIVCLAFATTSAFIPSDVVSIFGNLAQQNLLVGYDLQQKIVSFKPTDCS 414
>gi|225463766|ref|XP_002267930.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Vitis vinifera]
Length = 479
Score = 112 bits (281), Expect = 2e-22, Method: Compositional matrix adjust.
Identities = 92/362 (25%), Positives = 154/362 (42%), Gaps = 51/362 (14%)
Query: 2 VRLFIGTPSKGVLLILDTGSALIYA--------------IFDPRKSSSFQKINCDHPDCT 47
VR+ +G+P + +++D+GS +++ +FDP S+SF ++C C
Sbjct: 142 VRIGVGSPPRSQYMVIDSGSDIVWVQCQPCTQCYHQSDPVFDPADSASFTGVSCSSSVCD 201
Query: 48 YFK---CVNEQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSNDNHGFD 104
+ C +C Y + Y D S TKG A ET++ G+ + GC + N G
Sbjct: 202 RLENAGCHAGRCRYEVSYGDGSYTKGTLALETLTF-----GRTMVRSVAIGCGHRNRGMF 256
Query: 105 EDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLKFGTDMGYR 164
A ++SF+ QLG FSYCLV G +S L FG +
Sbjct: 257 VGAAGLLGL-----GGGSMSFVGQLGGQTGGAFSYCLV---SRGTDSSGSLVFGREA--- 305
Query: 165 RPSTQA-TKFINHPN--NFYYLSLKDISIDNERMNFPPDTFDITVSGEGGCIIDSGSVLT 221
P+ A + +P +FYY+ L + + R+ + F +T G+GG ++D+G+ +T
Sbjct: 306 LPAGAAWVPLVRNPRAPSFYYIGLAGLGVGGIRVPISEEVFRLTELGDGGVVMDTGTAVT 365
Query: 222 YFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQL-----CY-FLPETFNRFPSMAFYFE 275
+ Y + F LAQ ++ P + CY L R P+++FYF
Sbjct: 366 RLPTLAYQAFRDAF--------LAQTANLPRATGVAIFDTCYDLLGFVSVRVPTVSFYFS 417
Query: 276 DAN-LRIDGENVFIIDYENHFFLLAVAPHDDLVALIGSQQQRDTRFVYDLNIDLLSFVKE 334
L + N I + F A AP ++++G+ QQ + +D + F
Sbjct: 418 GGPILTLPARNFLIPMDDAGTFCFAFAPSTSGLSILGNIQQEGIQISFDGANGYVGFGPN 477
Query: 335 NC 336
C
Sbjct: 478 IC 479
>gi|293336306|ref|NP_001168599.1| uncharacterized protein LOC100382383 [Zea mays]
gi|223949441|gb|ACN28804.1| unknown [Zea mays]
Length = 326
Score = 112 bits (280), Expect = 2e-22, Method: Compositional matrix adjust.
Identities = 104/349 (29%), Positives = 154/349 (44%), Gaps = 50/349 (14%)
Query: 15 LILDTGSALIYA--------------IFDPRKSSSFQKINCDHPDCTYF---KCVNE--Q 55
++LDTGS + + +FDP S+S+ ++CD C C N
Sbjct: 1 MVLDTGSDVTWVQCQPCADCYQQSDPVFDPSLSASYAAVSCDSQRCRDLDTAACRNATGA 60
Query: 56 CVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSNDNHGFDEDARDGALAGV 115
C+Y + Y D S T G A ET+++ G+ + + A+ GC +DN G AG+
Sbjct: 61 CLYEVAYGDGSYTVGDFATETLTL---GDSTPVGNVAI-GCGHDNEGLFV-----GAAGL 111
Query: 116 LGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLKFGTDMGYRRPSTQATKFIN 175
L L +SF SQ I FSYCLV +S L+FG G T +
Sbjct: 112 LALGGGPLSFPSQ---ISASTFSYCLV---DRDSPAASTLQFGD--GAAEAGTVTAPLVR 163
Query: 176 HP--NNFYYLSLKDISIDNERMNFPPDTFDI-TVSGEGGCIIDSGSVLTYFHSDVYWKLH 232
P + FYY++L IS+ + ++ P F + SG GG I+DSG+ +T S Y L
Sbjct: 164 SPRTSTFYYVALSGISVGGQPLSIPASAFAMDATSGSGGVIVDSGTAVTRLQSAAYAALR 223
Query: 233 EKFVS---YFERFQLAQLSDCPEPIQLCYFLPE-TFNRFPSMAFYFEDAN-LRIDGENVF 287
+ FV R L D CY L + T P+++ FE LR+ +N
Sbjct: 224 DAFVQGAPSLPRTSGVSLFD------TCYDLSDRTSVEVPAVSLRFEGGGALRLPAKNYL 277
Query: 288 IIDYENHFFLLAVAPHDDLVALIGSQQQRDTRFVYDLNIDLLSFVKENC 336
I + LA AP + V++IG+ QQ+ TR +D + F C
Sbjct: 278 IPVDGAGTYCLAFAPTNAAVSIIGNVQQQGTRVSFDTARGAVGFTPNKC 326
>gi|297821064|ref|XP_002878415.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
gi|297324253|gb|EFH54674.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
Length = 486
Score = 112 bits (280), Expect = 2e-22, Method: Compositional matrix adjust.
Identities = 106/370 (28%), Positives = 155/370 (41%), Gaps = 58/370 (15%)
Query: 2 VRLFIGTPSKGVLLILDTGSALIY--------------AIFDPRKSSSFQKINCDHPDCT 47
+RL +GTP+ V ++LDTGS +++ IFDP+KS +F + C C
Sbjct: 140 MRLGVGTPATNVYMVLDTGSDVVWLQCSPCKACYNQSDVIFDPKKSKTFATVPCGSRLCR 199
Query: 48 YF----KCVNEQ---CVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGAL-----FG 95
+CV + C+Y + Y D S T+G + ET++ FHGA G
Sbjct: 200 RLDDSSECVTRRSKTCLYQVSYGDGSFTEGDFSTETLT----------FHGARVDHVPLG 249
Query: 96 CSNDNHGFDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLV--IPLPNGEYTSS 153
C +DN G A R +SF SQ S +FSYCLV + S
Sbjct: 250 CGHDNEGLFVGAAGLLGL-----GRGGLSFPSQTKSRYNGKFSYCLVDRTSSGSSSKPPS 304
Query: 154 YLKFGTDMGYRRPSTQA-TKFINHP--NNFYYLSLKDISIDNERM-NFPPDTFDITVSGE 209
+ FG D P T T + +P + FYYL L IS+ R+ F + +G
Sbjct: 305 TIVFGNDA---VPKTSVFTPLLTNPKLDTFYYLQLLGISVGGSRVPGVSESQFKLDATGN 361
Query: 210 GGCIIDSGSVLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCP--EPIQLCYFLPE-TFNR 266
GG IIDSG+ +T Y L + F R +L P C+ L T +
Sbjct: 362 GGVIIDSGTSVTRLTQSAYVALRDAF-----RLGATKLKRAPSYSLFDTCFDLSGMTTVK 416
Query: 267 FPSMAFYFEDANLRIDGENVFIIDYENHFFLLAVAPHDDLVALIGSQQQRDTRFVYDLNI 326
P++ F+F + + N I F A A +++IG+ QQ+ R YDL
Sbjct: 417 VPTVVFHFGGGEVSLPASNYLIPVNTEGRFCFAFAGTMGSLSIIGNIQQQGFRVAYDLVG 476
Query: 327 DLLSFVKENC 336
+ F+ C
Sbjct: 477 SRVGFLSRAC 486
>gi|449486856|ref|XP_004157423.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 1-like [Cucumis
sativus]
Length = 491
Score = 112 bits (279), Expect = 3e-22, Method: Compositional matrix adjust.
Identities = 102/363 (28%), Positives = 159/363 (43%), Gaps = 58/363 (15%)
Query: 3 RLFIGTPSKGVLLILDTGSALIY--------------AIFDPRKSSSFQKINCDHPDCTY 48
R+ +G P+K ++LDTGS + + IFDPR SSSF + C+ C
Sbjct: 158 RVGVGQPAKPFYMVLDTGSDINWLQCQPCTDCYQQTDPIFDPRSSSSFASLPCESQQCQA 217
Query: 49 FK---CVNEQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSNDNHGFDE 105
+ C +C+Y + Y D S T G ET++ G + + GC +DN G
Sbjct: 218 LETSGCRASKCLYQVSYGDGSFTVGEFVTETLTFGNSG----MINDVAVGCGHDNEGL-- 271
Query: 106 DARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLKFGTDMGYRR 165
G GL + +S + FSYCLV +SS L+F
Sbjct: 272 ------FVGSAGLLGLGGGPLSLTSQMKASSFSYCLV---DRDSSSSSDLEFN------- 315
Query: 166 PSTQATKFINHP-------NNFYYLSLKDISIDNERMNFPPDTFDITVSGEGGCIIDSGS 218
S + +N P + FYY+ L +S+ + ++ PP+ F + SG GG I+DSG+
Sbjct: 316 -SAAPSDSVNAPLLKSGKVDTFYYVGLTGMSVGGQLLSIPPNLFQMDDSGYGGIIVDSGT 374
Query: 219 VLTYFHSDVYWKLHEKFVS---YFERFQLAQLSDCPEPIQLCYFL-PETFNRFPSMAFYF 274
+T + Y L + FVS Y ++ L D CY L ++ P+++F F
Sbjct: 375 AITRLQTQAYNTLRDAFVSRTPYLKKTNGFALFDT------CYDLSSQSRVTIPTVSFEF 428
Query: 275 EDA-NLRIDGENVFIIDYENHFFLLAVAPHDDLVALIGSQQQRDTRFVYDLNIDLLSFVK 333
+L++ +N I F A AP +++IG+ QQ+ TR YDL ++ F
Sbjct: 429 AGGKSLQLPPKNYLIPVDSVGTFCFAFAPTTSSLSIIGNVQQQGTRVHYDLANSVVGFSP 488
Query: 334 ENC 336
C
Sbjct: 489 HKC 491
>gi|449439383|ref|XP_004137465.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 1-like [Cucumis
sativus]
Length = 491
Score = 112 bits (279), Expect = 4e-22, Method: Compositional matrix adjust.
Identities = 102/363 (28%), Positives = 159/363 (43%), Gaps = 58/363 (15%)
Query: 3 RLFIGTPSKGVLLILDTGSALIY--------------AIFDPRKSSSFQKINCDHPDCTY 48
R+ +G P+K ++LDTGS + + IFDPR SSSF + C+ C
Sbjct: 158 RVGVGQPAKPFYMVLDTGSDINWLQCQPCTDCYQQTDPIFDPRSSSSFASLPCESQQCQA 217
Query: 49 FK---CVNEQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSNDNHGFDE 105
+ C +C+Y + Y D S T G ET++ G + + GC +DN G
Sbjct: 218 LETSGCRASKCLYQVSYGDGSFTVGEFVIETLTFGNSG----MINNVAVGCGHDNEGL-- 271
Query: 106 DARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLKFGTDMGYRR 165
G GL + +S + FSYCLV +SS L+F
Sbjct: 272 ------FVGSAGLLGLGGGSLSLTSQMKASSFSYCLV---DRDSSSSSDLEFN------- 315
Query: 166 PSTQATKFINHP-------NNFYYLSLKDISIDNERMNFPPDTFDITVSGEGGCIIDSGS 218
S + +N P + FYY+ L +S+ + ++ PP+ F + SG GG I+DSG+
Sbjct: 316 -SAAPSDSVNAPLLKSGKVDTFYYVGLTGMSVGGQLLSIPPNLFQMDDSGYGGIIVDSGT 374
Query: 219 VLTYFHSDVYWKLHEKFVS---YFERFQLAQLSDCPEPIQLCYFL-PETFNRFPSMAFYF 274
+T + Y L + FVS Y ++ L D CY L ++ P+++F F
Sbjct: 375 AITRLQTQAYNTLRDAFVSRTPYLKKTNGFALFDT------CYDLSSQSRVTIPTVSFEF 428
Query: 275 EDA-NLRIDGENVFIIDYENHFFLLAVAPHDDLVALIGSQQQRDTRFVYDLNIDLLSFVK 333
+L++ +N I F A AP +++IG+ QQ+ TR YDL ++ F
Sbjct: 429 AGGKSLQLPPKNYLIPVDSVGTFCFAFAPTTSSLSIIGNVQQQGTRVHYDLANSVVGFSP 488
Query: 334 ENC 336
C
Sbjct: 489 HKC 491
>gi|125586059|gb|EAZ26723.1| hypothetical protein OsJ_10631 [Oryza sativa Japonica Group]
Length = 339
Score = 112 bits (279), Expect = 4e-22, Method: Compositional matrix adjust.
Identities = 101/352 (28%), Positives = 155/352 (44%), Gaps = 37/352 (10%)
Query: 6 IGTPSKGVLLILDTGSALIYAIFDPRKSSSFQKINCDHP-----DCTYFKC------VNE 54
+GTP V L L+ G+ LI+ +P Q P + C N+
Sbjct: 1 MGTPPNPVKLKLENGNELIWNHSNPSPECFEQAFPYFEPLTFSRGLPFASCGSPKFWPNQ 60
Query: 55 QCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSNDNHGFDEDARDGALAG 114
CVYT Y D+SVT GF + + +G G A G FGC N+G + G
Sbjct: 61 TCVYTYSYGDKSVTTGFLEVDKFTFVGAG---ASVPGVAFGCGLFNNGVFKSNE----TG 113
Query: 115 VLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLKFGTDM-GYRRPSTQATKF 173
+ G R +S SQL FS+C + ++ L D+ + + Q T
Sbjct: 114 IAGFGRGPLSLPSQLKV---GNFSHCFTT-ITGAIPSTVLLDLPADLFSNGQGAVQTTPL 169
Query: 174 INHPNN-----FYYLSLKDISIDNERMNFPPDTFDITVSGEGGCIIDSGSVLTYFHSDVY 228
I + N YYLSLK I++ + R+ P F +T +G GG IIDSG+ +T VY
Sbjct: 170 IQYAKNEANPTLYYLSLKGITVGSTRLPVPESAFALT-NGTGGTIIDSGTSITSLPPQVY 228
Query: 229 WKLHEKFVSYFERFQLAQLSDCPEPIQLCYFLP-ETFNRFPSMAFYFEDANLRIDGEN-V 286
+ ++F + + +L + C+ P + P + +FE A + + EN V
Sbjct: 229 QVVRDEFAA---QIKLPVVPGNATGHYTCFSAPSQAKPDVPKLVLHFEGATMDLPRENYV 285
Query: 287 FII--DYENHFFLLAVAPHDDLVALIGSQQQRDTRFVYDLNIDLLSFVKENC 336
F + D N LA+ D+ +IG+ QQ++ +YDL ++LSFV C
Sbjct: 286 FEVPDDAGNSIICLAINKGDE-TTIIGNFQQQNMHVLYDLQNNMLSFVAAQC 336
>gi|116787333|gb|ABK24467.1| unknown [Picea sitchensis]
Length = 497
Score = 111 bits (278), Expect = 4e-22, Method: Compositional matrix adjust.
Identities = 102/359 (28%), Positives = 163/359 (45%), Gaps = 44/359 (12%)
Query: 3 RLFIGTPSKGVLLILDTGSALIY--------------AIFDPRKSSSFQKINCDHPDCTY 48
RL +GTP + ++LDTGS +++ +F+P SS+++K+ C P C
Sbjct: 156 RLGVGTPPRYTYMVLDTGSDIMWIQCLPCAKCYGQTDPLFNPAASSTYRKVPCATPLCKK 215
Query: 49 F---KCVNEQ-CVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSNDNHGFD 104
C N++ C Y + Y D S T G + ET++ G+ + GC +DN G
Sbjct: 216 LDISGCRNKRYCEYQVSYGDGSFTVGDFSTETLTFRGQ-----VIRRVALGCGHDNEGLF 270
Query: 105 EDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLKFGTDMGYR 164
A R ++SF SQ G+ KRFSYCLV +G T+S L FG +
Sbjct: 271 IGAAGLLGL-----GRGSLSFPSQTGAQFSKRFSYCLVDRSASG--TASSLIFGKAAIPK 323
Query: 165 RPSTQATKFINHP--NNFYYLSLKDISIDNERM-NFPPDTFDITVSGEGGCIIDSGSVLT 221
S T +++P + FYY+ L IS+ R+ + P F + +G GG IIDSG+ +T
Sbjct: 324 --SAIFTPLLSNPKLDTFYYVELVGISVGGRRLTSIPASVFRMDATGNGGVIIDSGTSVT 381
Query: 222 YFHSDVYWKLHEKFVSYFERFQLAQLSDCP--EPIQLCYFLPETFN-RFPSMAFYFE-DA 277
Y + + F R L CY L + P++ F+F+ A
Sbjct: 382 RLVDSAYSTMRDAF-----RVGTGNLKSAGGFSLFDTCYDLSGLKTVKVPTLVFHFQGGA 436
Query: 278 NLRIDGENVFIIDYENHFFLLAVAPHDDLVALIGSQQQRDTRFVYDLNIDLLSFVKENC 336
++ + N I + F A A + +++IG+ QQ+ R V+D + + F +C
Sbjct: 437 HISLPATNYLIPVDSSATFCFAFAGNTGGLSIIGNIQQQGYRVVFDSLANRVGFKAGSC 495
>gi|326525377|dbj|BAK07958.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 463
Score = 111 bits (278), Expect = 4e-22, Method: Compositional matrix adjust.
Identities = 95/344 (27%), Positives = 153/344 (44%), Gaps = 48/344 (13%)
Query: 27 IFDPRKSSSFQKIN------CDHPDCTYFKCVNEQCVYTMKYADQSVTKGFAAHETISVI 80
+FDP KS +F I C P Y N C + + Y D + G+ A +T S
Sbjct: 139 VFDPTKSPTFSNIPAHNTVWCRPP---YQPLANGACGFDIAYRDNTHASGYLARDTFSFP 195
Query: 81 GKGEGKAIFHGALFGCSNDNHGFDEDARDGALAGVLGL-----SRVTISFISQLGSIIKK 135
+ +FGC++ F A+AG+LGL + +F Q+
Sbjct: 196 AGNDDFVPLSAIVFGCAHQTEHFKNQR---AVAGILGLGMGPAGKPPTAFTKQVLPAHGG 252
Query: 136 RFSYCLVIPLPNGEYTSSYLKFGTDMGYRRPST---QATKFI--NHPNNFYYLSLKDISI 190
RFSYC +P G SYL+FG+D+ P Q+T + H + Y++ L +S+
Sbjct: 253 RFSYCPFVP---GMSMYSYLRFGSDIPSHPPPNVHRQSTPVLAPAHNSEAYFVKLAGVSV 309
Query: 191 DNERMN-FPPDTFDITVSGEGGCIIDSGSVLTYFHSDVYWKLHEKFVSYFER----FQLA 245
R++ P F G GGC++D G+ +T F Y + + +R +
Sbjct: 310 GANRLSGVTPAMFRRNAHGAGGCVVDIGTRMTAFIHSAYVHIDHAVRQHLQRRGAHIVVV 369
Query: 246 QLSDC---PEPIQLCYFLPETFNRFPSMAFYFED-ANLRIDGENVFI--IDYENHFFLLA 299
+ + C P P + PSM +FE+ A LR+ E+VF+ + +H+
Sbjct: 370 RGNTCVQQPAPHH---------DVLPSMTLHFENGAWLRVMPEHVFMPFVVGGHHYQCFG 420
Query: 300 VAPHDDLVALIGSQQQRDTRFVYDLN--IDLLSFVKENCSDDSA 341
DL +IG++QQ + RF++DL+ I ++SF E+C D A
Sbjct: 421 FVSSTDLT-VIGARQQVNHRFIFDLHDTIPIMSFNPEDCHLDGA 463
>gi|15222357|ref|NP_174430.1| aspartyl protease-like protein [Arabidopsis thaliana]
gi|12322538|gb|AAG51267.1|AC027135_8 chloroplast nucleoid DNA binding protein, putative [Arabidopsis
thaliana]
gi|67633408|gb|AAY78629.1| aspartyl protease family protein [Arabidopsis thaliana]
gi|332193236|gb|AEE31357.1| aspartyl protease-like protein [Arabidopsis thaliana]
Length = 445
Score = 111 bits (278), Expect = 4e-22, Method: Compositional matrix adjust.
Identities = 105/372 (28%), Positives = 165/372 (44%), Gaps = 48/372 (12%)
Query: 2 VRLFIGTPSKGVLLILDTGSALIYA--------------IFDPRKSSSFQKINCDHPDCT 47
+ + IGTP V I DTGS L + +FD +KSS+++ +CD C
Sbjct: 87 MSISIGTPPSKVFAIADTGSDLTWVQCKPCQQCYKQNSPLFDKKKSSTYKTESCDSKTCQ 146
Query: 48 YFKCVNEQC-------VYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSNDN 100
E C Y Y D S TKG A ETIS+ F G +FGC +N
Sbjct: 147 ALSEHEEGCDESKDICKYRYSYGDNSFTKGDVATETISIDSSSGSSVSFPGTVFGCGYNN 206
Query: 101 HGFDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCL--VIPLPNGEYTSSYLKFG 158
G E+ G + L +S +SQLGS I K+FSYCL NG +S + G
Sbjct: 207 GGTFEETGSGIIG----LGGGPLSLVSQLGSSIGKKFSYCLSHTAATTNG---TSVINLG 259
Query: 159 TDMGYRRPS----TQATKFINH-PNNFYYLSLKDISIDNERMNFPPDTFDI---TVSGEG 210
T+ PS T T I P +Y+L+L+ +++ ++ + + + + G
Sbjct: 260 TNSIPSNPSKDSATLTTPLIQKDPETYYFLTLEAVTVGKTKLPYTGGGYGLNGKSSKRTG 319
Query: 211 GCIIDSGSVLTYFHS---DVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCYFLPETFNRF 267
IIDSG+ LT S D + E+ V+ +R +SD + C+ +
Sbjct: 320 NIIIDSGTTLTLLDSGFYDDFGTAVEESVTGAKR-----VSDPQGLLTHCFKSGDKEIGL 374
Query: 268 PSMAFYFEDANLRIDGENVFIIDYENHFFLLAVAPHDDLVALIGSQQQRDTRFVYDLNID 327
P++ +F +A++++ N F+ E+ L + + VA+ G+ Q D YDL
Sbjct: 375 PAITMHFTNADVKLSPINAFVKLNEDTVCLSMIPTTE--VAIYGNMVQMDFLVGYDLETK 432
Query: 328 LLSFVKENCSDD 339
+SF + +CS +
Sbjct: 433 TVSFQRMDCSGN 444
>gi|255564685|ref|XP_002523337.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
gi|223537425|gb|EEF39053.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
Length = 469
Score = 111 bits (278), Expect = 4e-22, Method: Compositional matrix adjust.
Identities = 100/361 (27%), Positives = 156/361 (43%), Gaps = 44/361 (12%)
Query: 2 VRLFIGTPSKGVLLILDTGSALIY--------------AIFDPRKSSSFQKINCDHPDCT 47
R+ +GTP + V ++LDTGS +++ +FDPRKS SF I C P C
Sbjct: 128 TRIGVGTPPRYVYMVLDTGSDIVWIQCAPCKRCYAQSDPVFDPRKSRSFASIACRSPLCH 187
Query: 48 YF-----KCVNEQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSNDNHG 102
+ C+Y + Y D S T G + ET++ + GC +DN G
Sbjct: 188 RLDSPGCNTQKQTCMYQVSYGDGSFTFGDFSTETLTFR-----RTRVARVALGCGHDNEG 242
Query: 103 FDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLKFGTDMG 162
A R +SF SQ G +FSYCLV + + +S + FG
Sbjct: 243 LFVGAAGLLGL-----GRGRLSFPSQTGRRFNHKFSYCLVDRSASSKPSS--MVFGDSAV 295
Query: 163 YRRPSTQATKFINHP--NNFYYLSLKDISIDNERM-NFPPDTFDITVSGEGGCIIDSGSV 219
R + + T +++P + FYY+ L IS+ R+ F + +G GG IIDSG+
Sbjct: 296 SR--TARFTPLVSNPKLDTFYYVELLGISVGGTRVPGITASLFKLDQTGNGGVIIDSGTS 353
Query: 220 LTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPE--PIQLCYFLP-ETFNRFPSMAFYFED 276
+T Y + F R + L P+ C+ L +T + P++ +F
Sbjct: 354 VTRLTRPAYIAFRDAF-----RAGASNLKRAPQFSLFDTCFDLSGKTEVKVPTVVLHFRG 408
Query: 277 ANLRIDGENVFIIDYENHFFLLAVAPHDDLVALIGSQQQRDTRFVYDLNIDLLSFVKENC 336
A++ + N I + F LA A +++IG+ QQ+ R VYDL + F C
Sbjct: 409 ADVSLPASNYLIPVDTSGNFCLAFAGTMGGLSIIGNIQQQGFRVVYDLAGSRVGFAPHGC 468
Query: 337 S 337
+
Sbjct: 469 A 469
>gi|242095588|ref|XP_002438284.1| hypothetical protein SORBIDRAFT_10g011120 [Sorghum bicolor]
gi|241916507|gb|EER89651.1| hypothetical protein SORBIDRAFT_10g011120 [Sorghum bicolor]
Length = 487
Score = 111 bits (278), Expect = 4e-22, Method: Compositional matrix adjust.
Identities = 100/383 (26%), Positives = 161/383 (42%), Gaps = 64/383 (16%)
Query: 1 MVRLFIGTPSKGVLLILDTGSALIYA----------------IFDPRKSSSFQKINCDHP 44
+V + IGTP + ++ DTGS L + +FDP KSS++ + C P
Sbjct: 123 VVTIGIGTPPRNFTVLFDTGSDLTWVQCLPCPDSSCYPQQEPLFDPSKSSTYVDVPCSAP 182
Query: 45 DC-----TYFKCVNEQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSND 99
+C +C C Y++KY D+S T G A ET ++ G +FGCS++
Sbjct: 183 ECHIGGVQQTRCGATSCEYSVKYGDESETHGSLAEETFTLSPPSPLAPAATGVVFGCSHE 242
Query: 100 NHGFDEDARDGALAGVLGLSRVTISFISQLGSIIKKR---FSYCLVIPLPNGEYTSSYLK 156
D G +AG+LGL R S +SQ I FSYCL P G T YL
Sbjct: 243 YISVFNDTGMG-VAGLLGLGRGDSSILSQTRRSINSGGGVFSYCLP---PRGSST-GYLT 297
Query: 157 FGTDMGYRRPSTQATKF--------INHPNNFYYLSLKDISIDNERMNFPPDTFDITVSG 208
G G P Q + I+ + Y ++L +S++ ++ P F +
Sbjct: 298 IGG--GAAAPQQQYSNLSFTPLITTISQLRSAYVVNLAGVSVNGAAVDIPASAFSL---- 351
Query: 209 EGGCIIDSGSVLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEP----IQLCY-FLPET 263
G +IDSG+V+T+ + Y+ L ++F R + PE + CY +
Sbjct: 352 --GAVIDSGTVVTHMPAAAYYPLRDEF-----RLHMGSYKMLPEGSMKLLDTCYDVTGQD 404
Query: 264 FNRFPSMAFYF-EDANLRIDGENVFII----DYENHFFLLA----VAPHDDLVALIGSQQ 314
P +A F A + +D + ++ D LA + + + ++G+ Q
Sbjct: 405 VVTAPRVALEFGGGARIDVDASGILLVLPAEDGSGQSLTLACLAFLPTNSAGLVIVGNMQ 464
Query: 315 QRDTRFVYDLNIDLLSFVKENCS 337
QR V+D++ + F CS
Sbjct: 465 QRAYNVVFDVDGGRIGFGPNGCS 487
>gi|356502456|ref|XP_003520035.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
Length = 484
Score = 111 bits (278), Expect = 4e-22, Method: Compositional matrix adjust.
Identities = 106/355 (29%), Positives = 159/355 (44%), Gaps = 41/355 (11%)
Query: 2 VRLFIGTPSKGVLLILDTGSALIY--------------AIFDPRKSSSFQKINCDHPDCT 47
+R+ IG P ++LDTGS + + IFDP S+S+ I CD P C
Sbjct: 151 LRVGIGKPPSQAYVVLDTGSDVSWIQCAPCSECYQQSDPIFDPVSSNSYSPIRCDAPQCK 210
Query: 48 YF---KCVNEQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSNDNHGFD 104
+C N C+Y + Y D S T G A ET+++ G A GC ++N G
Sbjct: 211 SLDLSECRNGTCLYEVSYGDGSYTVGEFATETVTL-----GTAAVENVAIGCGHNNEGLF 265
Query: 105 EDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTS-SYLKFGTDMGY 163
A G LS F +Q+ + FSYCLV N + + S L+F + +
Sbjct: 266 VGAAGLLGLGGGKLS-----FPAQVNAT---SFSYCLV----NRDSDAVSTLEFNSPLP- 312
Query: 164 RRPSTQATKFINHPNNFYYLSLKDISIDNERMNFPPDTFDITVSGEGGCIIDSGSVLTYF 223
R T + + FYYL LK IS+ E + P F++ G GG IIDSG+ +T
Sbjct: 313 RNVVTAPLRRNPELDTFYYLGLKGISVGGEALPIPESIFEVDAIGGGGIIIDSGTAVTRL 372
Query: 224 HSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCYFLPETFN-RFPSMAFYF-EDANLRI 281
S+VY L + FV + A CY L + + P+++F+F E L +
Sbjct: 373 RSEVYDALRDAFVKGAKGIPKANGVSL---FDTCYDLSSRESVQVPTVSFHFPEGRELPL 429
Query: 282 DGENVFIIDYENHFFLLAVAPHDDLVALIGSQQQRDTRFVYDLNIDLLSFVKENC 336
N I F A AP ++++G+ QQ+ TR +D+ L+ F ++C
Sbjct: 430 PARNYLIPVDSVGTFCFAFAPTTSSLSIMGNVQQQGTRVGFDIANSLVGFSADSC 484
>gi|242066176|ref|XP_002454377.1| hypothetical protein SORBIDRAFT_04g029680 [Sorghum bicolor]
gi|241934208|gb|EES07353.1| hypothetical protein SORBIDRAFT_04g029680 [Sorghum bicolor]
Length = 474
Score = 111 bits (278), Expect = 5e-22, Method: Compositional matrix adjust.
Identities = 96/365 (26%), Positives = 149/365 (40%), Gaps = 60/365 (16%)
Query: 1 MVRLFIGTPSKGVLLILDTGSALIYA----------------IFDPRKSSSFQKINCDHP 44
+V + +GTP L +DTGS L + +FDP +SSS+ + C P
Sbjct: 141 VVTVSLGTPGVAQTLEVDTGSDLSWVQCTPCAAPACYSQKDPLFDPAQSSSYAAVPCGGP 200
Query: 45 DCTYF-----KCVNEQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSND 99
C C QC Y + Y D S T G + +T+++ + F FGC +
Sbjct: 201 VCGGLGIYASSCSAAQCGYVVSYGDGSKTTGVYSSDTLTLSPNDAVRGFF----FGCGHA 256
Query: 100 NHGFDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLKFGT 159
GF + G+LGL R S + Q FSYC LP T+ YL G
Sbjct: 257 QSGFTGN------DGLLGLGREEASLVEQTAGTYGGVFSYC----LPTRPSTTGYLTLGG 306
Query: 160 DMGYRRPSTQATKFINHPN--NFYYLSLKDISIDNERMNFPPDTFDITVSGEGGCIIDSG 217
G P T+ ++ PN +Y + L IS+ ++++ P F GG ++D+G
Sbjct: 307 PSGAAPPGFSTTQLLSSPNAATYYVVMLTGISVGGQQLSVPSSVF------AGGTVVDTG 360
Query: 218 SVLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCY-FLPETFNRFPSMAFYFE- 275
+V+T Y L F S + + + CY F P++A F
Sbjct: 361 TVITRLPPTAYAALRSAFRSGMASYGYPS-APATGILDTCYNFSGYGTVTLPNVALTFSG 419
Query: 276 DANLRIDGENVFIIDYENHFFLLAVAP--HDDLVALIGSQQQRDTRFVYDLNIDLLS--F 331
A + + + + F LA AP D +A++G+ QQR +++ ID S F
Sbjct: 420 GATVTLGADGIL------SFGCLAFAPSGSDGGMAILGNVQQRS----FEVRIDGTSVGF 469
Query: 332 VKENC 336
+C
Sbjct: 470 KPSSC 474
>gi|218192707|gb|EEC75134.1| hypothetical protein OsI_11325 [Oryza sativa Indica Group]
Length = 401
Score = 111 bits (278), Expect = 5e-22, Method: Compositional matrix adjust.
Identities = 101/327 (30%), Positives = 145/327 (44%), Gaps = 47/327 (14%)
Query: 1 MVRLFIGTPSKGVLLILDTGSALIYA--------------IFDPRKSSSFQKINCDHPDC 46
+V L IGTP + V L LDTGS LI+ FDP SS+ +CD C
Sbjct: 83 LVHLAIGTPPQPVQLTLDTGSDLIWTQCQPCPACFDQALPYFDPSTSSTLSLTSCDSTLC 142
Query: 47 TYFKCV---------NEQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCS 97
N+ CVYT Y D+SVT GF + + +G G A G FGC
Sbjct: 143 QGLPVASCGSPKFWPNQTCVYTYSYGDKSVTTGFLEVDKFTFVGAG---ASVPGVAFGCG 199
Query: 98 NDNHGFDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSS-YLK 156
N+G + G+ G R +S SQL FS+C NG S+ L
Sbjct: 200 LFNNGVFKSNE----TGIAGFGRGPLSLPSQLKV---GNFSHCFTA--VNGLKPSTVLLD 250
Query: 157 FGTDMGYR--RPSTQATKFINHPNN--FYYLSLKDISIDNERMNFPPDTFDITVSGEGGC 212
D+ Y+ R + Q+T I +P N FYYLSLK I++ + R+ P F + +G GG
Sbjct: 251 LPADL-YKSGRGAVQSTPLIQNPANPTFYYLSLKGITVGSTRLPVPESEFALK-NGTGGT 308
Query: 213 IIDSGSVLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCYFLPETFNRF-PSMA 271
IIDSG+ +T + VY + + F + + +L +S C P + P +
Sbjct: 309 IIDSGTAMTSLPTRVYRLVRDAFAA---QVKLPVVSGNTTDPYFCLSAPLRAKPYVPKLV 365
Query: 272 FYFEDANLRIDGEN-VFIIDYENHFFL 297
+FE A + + EN V++ Y +
Sbjct: 366 LHFEGATMDLPRENYVWLKHYPKRLLI 392
>gi|148910443|gb|ABR18297.1| unknown [Picea sitchensis]
Length = 452
Score = 111 bits (277), Expect = 6e-22, Method: Compositional matrix adjust.
Identities = 95/357 (26%), Positives = 160/357 (44%), Gaps = 43/357 (12%)
Query: 1 MVRLFIGTPSKGVLLILDTGSALIY-------------AIFDPRKSSSFQKINCDHPDCT 47
++++ GTP + + ++DTGS + + IFDP KSSS++ CD C
Sbjct: 116 IIQVDFGTPKQSMYTLIDTGSDVAWIPCKQCQGCHSTAPIFDPAKSSSYKPFACDSQPCQ 175
Query: 48 YFK--C-VNEQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSNDNHGFD 104
C N +C + + Y D + G A + I++ G FGC+
Sbjct: 176 EISGNCGGNSKCQFEVSYGDGTQVDGTLASDAITL-----GSQYLPNFSFGCA---ESLS 227
Query: 105 EDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLKFGTDMGYR 164
ED LG +++ + + FSYC LP+ +S L G +
Sbjct: 228 EDTSPSPGLMGLGGGSLSLLTQAPTAELFGGTFSYC----LPSSSTSSGSLVLGKEAAVS 283
Query: 165 RPSTQATKFINHPN--NFYYLSLKDISIDNERMNFPPDTFDITVSGEGGCIIDSGSVLTY 222
S + T I P+ FY+++LK IS+ N R++ P ++ GG IIDSG+ +T+
Sbjct: 284 SSSLKFTTLIKDPSIPTFYFVTLKAISVGNTRISVP----GTNIASGGGTIIDSGTTITH 339
Query: 223 FHSDVYWKLHEKFVSYFERFQLAQLSDCP-EPIQLCYFLPETFNRFPSMAFYFE-DANLR 280
Y L + F R QL+ L P E + CY L + P++ + + + +L
Sbjct: 340 LVPSAYTALRDAF-----RQQLSSLQPTPVEDMDTCYDLSSSSVDVPTITLHLDRNVDLV 394
Query: 281 IDGENVFIIDYENHFFLLAVAPHDDLVALIGSQQQRDTRFVYDLNIDLLSFVKENCS 337
+ EN+ I E+ LA + D ++IG+ QQ++ R V+D+ + F +E C+
Sbjct: 395 LPKENILITQ-ESGLACLAFSSTDSR-SIIGNVQQQNWRIVFDVPNSQVGFAQEQCA 449
>gi|413944032|gb|AFW76681.1| hypothetical protein ZEAMMB73_606599 [Zea mays]
Length = 315
Score = 110 bits (276), Expect = 7e-22, Method: Compositional matrix adjust.
Identities = 94/324 (29%), Positives = 145/324 (44%), Gaps = 33/324 (10%)
Query: 33 SSSFQKINCDHPDC------TYFKCVNE--QCVYTMKYADQSVTKGFAAHETISVIGKGE 84
SS+F+ + C P C + C E QC Y Y D+S+T G +T + +
Sbjct: 2 SSTFKAVACPDPICRPSSGVSVSACAMENFQCFYLCSYGDRSITAGHIFKDTFTFMSPNG 61
Query: 85 GKAIFHGALFGCSNDNHGFDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIP 144
FGC + N G +G+ G R S SQL RFSYCL +
Sbjct: 62 VPVAVSELAFGCGDYNTGLFVSNE----SGIAGFGRGPQSLPSQLKV---GRFSYCLTL- 113
Query: 145 LPNGEYTSSYLKFGTDM---GYRRPST---QATKFINHP--NNFYYLSLKDISIDNERMN 196
E SS + GT G R +T Q+T I +P FYYLSL+ I++ R+
Sbjct: 114 --VTESKSSVVILGTPPDPDGLRAHTTGPFQSTPIIYNPLIPTFYYLSLEGITVGKTRLP 171
Query: 197 FPPDTFDITVSGEGGCIIDSGSVLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPE-PIQ 255
F F + G GG +IDSG+ LT V+ L E+ V+ +F L + + PE +
Sbjct: 172 FDKSVFALKKDGSGGTVIDSGTSLTTLPEAVFELLQEELVA---QFPLPRYDNTPEVGDR 228
Query: 256 LCYFLPETFNR--FPSMAFYFEDANLRIDGENVFIIDYENHFFLLAV-APHDDLVALIGS 312
LC+ P+ + P + + A++ + +N F+ + ++ L + D + LIG+
Sbjct: 229 LCFRRPKGGKQVPVPKLILHLAGADMDLPRDNYFVEEPDSGVMCLQINGAEDTTMVLIGN 288
Query: 313 QQQRDTRFVYDLNIDLLSFVKENC 336
QQ++ VYD+ + L F C
Sbjct: 289 FQQQNMHVVYDVENNKLLFAPAQC 312
>gi|357130848|ref|XP_003567056.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Brachypodium
distachyon]
Length = 448
Score = 110 bits (276), Expect = 7e-22, Method: Compositional matrix adjust.
Identities = 94/356 (26%), Positives = 157/356 (44%), Gaps = 37/356 (10%)
Query: 6 IGTPSKGVLLILDTGSALIY--------------AIFDPRKSSSFQKINCDHPDCTYFKC 51
+GTP LL++DTGS +++ ++DPR SS++ + C P C +
Sbjct: 105 VGTPPTPALLVIDTGSDVVWLQCKPCVHCYRQLSPLYDPRGSSTYAQTPCSPPQCRNPQT 164
Query: 52 VNEQ---CVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSNDNHGFDEDAR 108
+ C Y + Y D S T G A + + GC +DN G
Sbjct: 165 CDGTTGGCGYRIVYGDASSTSGNLATDRLVF----SNDTSVGNVTLGCGHDNEGLF---- 216
Query: 109 DGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLKFGTDMGYRRPST 168
G+ AG+LG++R SF +Q+ + F+YCL +G +SSYL FG PS+
Sbjct: 217 -GSAAGLLGVARGNNSFATQVADSYGRYFAYCLGDRTRSGS-SSSYLVFGR-TAPEPPSS 273
Query: 169 QATKFINHPN--NFYYLSLKDISIDNERM-NFPPDTFDI-TVSGEGGCIIDSGSVLTYFH 224
T ++P + YY+ + S+ E + F + + +G GG ++DSG+ +T F
Sbjct: 274 VFTPLRSNPRRPSLYYVDMVGFSVGGEPVTGFSNASLSLDPATGRGGVVVDSGTSITRFA 333
Query: 225 SDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCYFLPE-TFNRFPSMAFYFE-DANLRID 282
D Y L + F + + + ++ CY L P + +F A++ +
Sbjct: 334 RDAYGALRDAFDARAAKVGMRKVGRGISVFDACYDLRGVAVADAPGVVLHFAGGADVALP 393
Query: 283 GENVFIIDYEN--HFFLLAVAPHDDLVALIGSQQQRDTRFVYDLNIDLLSFVKENC 336
EN + + H F L A HD L ++IG+ Q+ R V+D+ + + F C
Sbjct: 394 PENYLVPEESGRYHCFALEAAGHDGL-SVIGNVLQQRFRVVFDVENERVGFEPNGC 448
>gi|297842769|ref|XP_002889266.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
gi|297335107|gb|EFH65525.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
Length = 489
Score = 110 bits (276), Expect = 7e-22, Method: Compositional matrix adjust.
Identities = 101/363 (27%), Positives = 157/363 (43%), Gaps = 61/363 (16%)
Query: 11 KGVLLILDTGSALIYA--------------IFDPRKSSSFQKINCDHPDCT--------- 47
K + LI+DTGS L + ++DP SSS++ + C+ C
Sbjct: 147 KNMSLIVDTGSDLTWVQCQPCRSCYNQQGPLYDPSVSSSYKTVFCNSSTCQDLVAATGNS 206
Query: 48 -----YFKCVNEQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSNDNHG 102
+ V C Y + Y D S T+G A E+I + G +FGC +N G
Sbjct: 207 GPCGGFNGVVKTTCEYVVSYGDGSYTRGDLASESIVL-----GDTKLENLVFGCGRNNKG 261
Query: 103 FDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLKFGTDMG 162
G +G++GL R ++S +SQ FSYCL L +G S L FG D
Sbjct: 262 LF-----GGASGLMGLGRSSVSLVSQTLKTFNGVFSYCLP-SLEDG--ASGTLSFGNDFS 313
Query: 163 YRRPSTQA--TKFINHPN--NFYYLSLKDISIDNERMNFPPDTFDITVSGEGGCIIDSGS 218
+ ST T + +P +FY L+L SI + T+S G +IDSG+
Sbjct: 314 VYKNSTSVFYTPLVQNPQLRSFYILNLTGASIGGVELK--------TLSFGRGILIDSGT 365
Query: 219 VLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCYFLPETFN-RFPSMAFYFE-D 276
V+T +Y + +F+ F F A + C+ L + P++ FE +
Sbjct: 366 VITRLPPSIYKAVKTEFLKQFSGFPSAPGYSI---LDTCFNLTSYEDISIPTIKMIFEGN 422
Query: 277 ANLRIDGENVF-IIDYENHFFLLAVAP--HDDLVALIGSQQQRDTRFVYDLNIDLLSFVK 333
A L +D VF + + LA+A +++ V +IG+ QQ++ R +YD + L
Sbjct: 423 AELEVDVTGVFYFVKPDASLVCLALASLSYENEVGIIGNYQQKNQRVIYDTTQERLGIAG 482
Query: 334 ENC 336
ENC
Sbjct: 483 ENC 485
>gi|242081123|ref|XP_002445330.1| hypothetical protein SORBIDRAFT_07g009580 [Sorghum bicolor]
gi|241941680|gb|EES14825.1| hypothetical protein SORBIDRAFT_07g009580 [Sorghum bicolor]
Length = 543
Score = 110 bits (276), Expect = 7e-22, Method: Compositional matrix adjust.
Identities = 98/366 (26%), Positives = 154/366 (42%), Gaps = 58/366 (15%)
Query: 7 GTPSKGVLLILDTGSALIYA--------------IFDPRKSSSFQKINCDHPDC------ 46
G+P+ + +I+DTGS L + +FDP S+++ + C+ C
Sbjct: 197 GSPAANLTVIVDTGSDLTWVQCKPCSACYAQRDPLFDPAGSATYAAVRCNASACAASLKA 256
Query: 47 ---TYFKCV--NEQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSNDNH 101
T C NE+C Y + Y D S ++G A +T+++ G A G +FGC N
Sbjct: 257 ATGTPGSCGGGNERCYYALAYGDGSFSRGVLATDTVAL-----GGASLDGFVFGCGLSNR 311
Query: 102 GFDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLKFGTDM 161
G G AG++GL R +S +SQ FSYCL P S L G D
Sbjct: 312 GLF-----GGTAGLMGLGRTELSLVSQTALRYGGVFSYCL--PATTSGDASGSLSLGGDA 364
Query: 162 GYRRPSTQA--TKFINHPNN--FYYLSLKDISIDNERMNFPPDTFDITVSGEGGCIIDSG 217
R +T T+ I P FY+L++ ++ + G +IDSG
Sbjct: 365 SSYRNTTPVAYTRMIADPAQPPFYFLNVTGAAVGGTALA-------AQGLGASNVLIDSG 417
Query: 218 SVLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCP--EPIQLCYFLPETFN-RFPSMAFYF 274
+V+T VY + +F +F A P + CY L + P +
Sbjct: 418 TVITRLAPSVYRGVRAEFT---RQFAAAGYPTAPGFSILDTCYDLTGHDEVKVPLLTLRL 474
Query: 275 ED-ANLRIDGENV-FIIDYENHFFLLAVAP--HDDLVALIGSQQQRDTRFVYDLNIDLLS 330
E A + +D + F++ + LA+A ++D +IG+ QQ++ R VYD L
Sbjct: 475 EGGAEVTVDAAGMLFVVRKDGSQVCLAMASLSYEDQTPIIGNYQQKNKRVVYDTVGSRLG 534
Query: 331 FVKENC 336
F E+C
Sbjct: 535 FADEDC 540
>gi|357143850|ref|XP_003573078.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Brachypodium
distachyon]
Length = 431
Score = 110 bits (276), Expect = 8e-22, Method: Compositional matrix adjust.
Identities = 92/367 (25%), Positives = 162/367 (44%), Gaps = 45/367 (12%)
Query: 1 MVRLFIGTPSKGVLLILDTGSALIYA--------------IFDPRKSSSFQKINCDHPDC 46
++ L IGTP + + DTGS L + ++DP SS+F + C C
Sbjct: 78 LMELAIGTPPVPFVALADTGSDLTWTQCQPCKLCFPQDTPVYDPSASSTFSPVPCSSATC 137
Query: 47 TYF----KCVNEQ--CVYTMKYADQSVTKGFAAHETISVIGKGEGKAI-FHGALFGCSND 99
C C Y Y+D + + G ET+++ G+A+ FGC D
Sbjct: 138 LPVLRSRNCSTPSSLCRYGYSYSDGAYSAGILGTETLTLGSSVPGQAVSVSDVAFGCGTD 197
Query: 100 NHGFDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLKFGT 159
N G ++ G +GL R T+S ++QLG +FSYCL N S +L GT
Sbjct: 198 NGGDSLNS-----TGTVGLGRGTLSLLAQLG---VGKFSYCLT-DFFNSTLDSPFL-LGT 247
Query: 160 --DMGYRRPSTQATKFINHPNN--FYYLSLKDISIDNERMNFPPDTFDITVSGEGGCIID 215
++ + Q+T + P N Y +SL+ I++ + R+ P TFD+ + GG ++D
Sbjct: 248 LAELAPGPGAVQSTPLLQSPLNPSRYVVSLQGITLGDVRLPIPNKTFDLHANSTGGMVVD 307
Query: 216 SGSVLTYFHSDVYWKLHEKFVSYFERFQL-AQLSDCPEPIQLCYFLPETFNRFPSMA--- 271
SG+ + + + + + + A D P C+ P + P M
Sbjct: 308 SGTTFSILPESGFRVVVDHVAQVLGQPPVNASSLDSP-----CFPAPAGERQLPFMPDLV 362
Query: 272 -FYFEDANLRIDGENVFIIDYENHFFLLAVAPHDDLVALIGSQQQRDTRFVYDLNIDLLS 330
+ A++R+ +N + E+ F L + +++G+ QQ++ + ++D+ + LS
Sbjct: 363 LHFAGGADMRLHRDNYMSYNQEDSSFCLNIVGTTSTWSMLGNFQQQNIQMLFDMTVGQLS 422
Query: 331 FVKENCS 337
F+ +CS
Sbjct: 423 FLPTDCS 429
>gi|356555042|ref|XP_003545848.1| PREDICTED: probable aspartic protease At2g35615-like [Glycine max]
Length = 431
Score = 110 bits (276), Expect = 8e-22, Method: Compositional matrix adjust.
Identities = 99/356 (27%), Positives = 163/356 (45%), Gaps = 44/356 (12%)
Query: 6 IGTPSKGVLLILDTGSALIYA--------------IFDPRKSSSFQKINCDHPDCTYFK- 50
+GTP V I+DT S +I+ +FDP S +++ + C C +
Sbjct: 94 LGTPPFPVYGIVDTASDIIWVQCQLCETCYNDTSPMFDPSYSKTYKNLPCSSTTCKSVQG 153
Query: 51 --CVNEQ---CVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGC-SNDNHGFD 104
C +++ C +T+ Y D S ++G ET+++ + F + GC N N FD
Sbjct: 154 TSCSSDERKICEHTVNYKDGSHSQGDLIVETVTLGSYNDPFVHFPRTVIGCIRNTNVSFD 213
Query: 105 EDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLKFGTDMGYR 164
G++GL +S + QL S I K+FSYCL P+ + SS LKFG
Sbjct: 214 S-------IGIVGLGGGPVSLVPQLSSSISKKFSYCLA-PISD---RSSKLKFGDAAMVS 262
Query: 165 RPSTQATKFI-NHPNNFYYLSLKDISIDNERMNFPPDTFDITVSGEGGCIIDSGSVLTYF 223
T +T+ + FYYL+L+ S+ N R+ F + SG+G IIDSG+ T
Sbjct: 263 GDGTVSTRIVFKDWKKFYYLTLEAFSVGNNRIEF--RSSSSRSSGKGNIIIDSGTTFTVL 320
Query: 224 HSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCYFLPETFNR--FPSMAFYFEDANLRI 281
DVY KL + +L + D + LCY T+++ P + +F A++++
Sbjct: 321 PDDVYSKLESAVA---DVVKLERAEDPLKQFSLCY--KSTYDKVDVPVITAHFSGADVKL 375
Query: 282 DGENVFIIDYENHFFLLAVAPHDDLVALIGSQQQRDTRFVYDLNIDLLSFVKENCS 337
+ N FI+ L ++ A+ G+ Q++ YDL ++SF +C+
Sbjct: 376 NALNTFIVASHRVVCLAFLSSQSG--AIFGNLAQQNFLVGYDLQRKIVSFKPTDCT 429
>gi|356499109|ref|XP_003518386.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Glycine max]
Length = 428
Score = 110 bits (276), Expect = 8e-22, Method: Compositional matrix adjust.
Identities = 97/374 (25%), Positives = 165/374 (44%), Gaps = 57/374 (15%)
Query: 2 VRLFIGTPSKGVLLILDTGSALIY----------AIFDPRKSSSFQKINCDHPDCTY--- 48
V L +G+P + V ++LDTGS L + + F+P SSS+ C+ CT
Sbjct: 62 VSLTVGSPPQNVTMVLDTGSELSWLHCKKLPNLNSTFNPLLSSSYTPTPCNSSICTTRTR 121
Query: 49 -----FKC--VNEQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSNDNH 101
C N+ C + YAD S +G A ET S+ G + G LFGC D+
Sbjct: 122 DLTIPASCDPNNKLCHVIVSYADASSAEGTLAAETFSLAGAAQ-----PGTLFGCM-DSA 175
Query: 102 GFDEDA-RDGALAGVLGLSRVTISFISQLGSIIKKRFSYCL-------VIPLPNGEYTSS 153
G+ D D G++G++R ++S ++Q+ +FSYC+ V+ L +G S
Sbjct: 176 GYTSDINEDSKTTGLMGMNRGSLSLVTQMS---LPKFSYCISGEDALGVLLLGDGTDAPS 232
Query: 154 YLKFGTDMGYRRPSTQATKFINHPNNF-YYLSLKDISIDNERMNFPPDTFDITVSGEGGC 212
L++ P AT + N Y + L+ I + + + P F +G G
Sbjct: 233 PLQY-------TPLVTATTSSPYFNRVAYTVQLEGIKVSEKLLQLPKSVFVPDHTGAGQT 285
Query: 213 IIDSGSVLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPE-----PIQLCYFLPETFNRF 267
++DSG+ T+ VY L ++F+ + L ++ D P + LCY P +F
Sbjct: 286 MVDSGTQFTFLLGSVYSSLKDEFLEQTKGV-LTRIED-PNFVFEGAMDLCYHAPASFAAV 343
Query: 268 PSMAFYFEDANLRIDGEN-VFIIDYENHFFLLAVAPHDDLVAL----IGSQQQRDTRFVY 322
P++ F A +R+ GE ++ + + + + DL+ + IG Q++ +
Sbjct: 344 PAVTLVFSGAEMRVSGERLLYRVSKGSDWVYCFTFGNSDLLGIEAYVIGHHHQQNVWMEF 403
Query: 323 DLNIDLLSFVKENC 336
DL + F + C
Sbjct: 404 DLLKSRVGFTQTTC 417
>gi|224080963|ref|XP_002306246.1| predicted protein [Populus trichocarpa]
gi|222855695|gb|EEE93242.1| predicted protein [Populus trichocarpa]
Length = 382
Score = 110 bits (276), Expect = 8e-22, Method: Compositional matrix adjust.
Identities = 89/357 (24%), Positives = 161/357 (45%), Gaps = 41/357 (11%)
Query: 2 VRLFIGTPSKGVLLILDTGSALIYA--------------IFDPRKSSSFQKINCDHPDCT 47
VR+ +G+P + +++D+GS +++ +FDP S+SF ++C C
Sbjct: 45 VRIGVGSPPRSQYMVIDSGSDIVWVQCKPCTQCYHQTDPLFDPADSASFMGVSCSSAVCD 104
Query: 48 YF---KCVNEQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSNDNHGFD 104
C + +C Y + Y D S TKG A ET+++ G+ + GC + N G
Sbjct: 105 QVDNAGCNSGRCRYEVSYGDGSSTKGTLALETLTL-----GRTVVQNVAIGCGHMNQGMF 159
Query: 105 EDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLKFGTDMGYR 164
A ++SF+ QL FSYCLV + N ++ +L+FG++
Sbjct: 160 VGAAGLLGL-----GGGSMSFVGQLSRERGNAFSYCLVSRVTN---SNGFLEFGSEA--- 208
Query: 165 RPSTQA-TKFINHPNN--FYYLSLKDISIDNERMNFPPDTFDITVSGEGGCIIDSGSVLT 221
P A I +P++ +YY+ L + + + ++ D F++T G GG ++D+G+ +T
Sbjct: 209 MPVGAAWIPLIRNPHSPSYYYIGLSGLGVGDMKVPISEDIFELTELGNGGVVMDTGTAVT 268
Query: 222 YFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCYFLPETFN-RFPSMAFYFEDANLR 280
F + Y + F+ + L + S CY L + R P+++FYF +
Sbjct: 269 RFPTVAYEAFRDAFID--QTGNLPRASGV-SIFDTCYNLFGFLSVRVPTVSFYFSGGPIL 325
Query: 281 IDGENVFIIDYENH-FFLLAVAPHDDLVALIGSQQQRDTRFVYDLNIDLLSFVKENC 336
N F+I ++ F A AP ++++G+ QQ + D + + F C
Sbjct: 326 TLPANNFLIPVDDAGTFCFAFAPSPSGLSILGNIQQEGIQISVDGANEFVGFGPNVC 382
>gi|168014386|ref|XP_001759733.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162689272|gb|EDQ75645.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 392
Score = 110 bits (275), Expect = 9e-22, Method: Compositional matrix adjust.
Identities = 107/373 (28%), Positives = 157/373 (42%), Gaps = 50/373 (13%)
Query: 2 VRLFIGTPSKGVLLILDTGSALIYA--------------IFDPRKSSSFQKINCDHPDCT 47
V +GTP + LI+DTGS L + ++ P SS+F + CD +C
Sbjct: 36 VDFSLGTPEQKFHLIVDTGSDLAFVQCAPCDLCYEQDGPLYQPSNSSTFTPVPCDSAECL 95
Query: 48 YFK------CVNE--------QCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGAL 93
C + C Y +Y D S T G A+ET +V G + H A
Sbjct: 96 LIPAPVGAPCSSSYPESPPQGACSYEYRYGDNSSTVGVFAYETATV----GGIRVNHVA- 150
Query: 94 FGCSNDNHGFDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPL-PNGEYTS 152
FGC N N G A GVLGL + +SF SQ G + +F+YCL L P ++S
Sbjct: 151 FGCGNRNQGSFVSA-----GGVLGLGQGALSFTSQAGYAFENKFAYCLTSYLSPTSVFSS 205
Query: 153 SYLKFGTDMGYRRPSTQATKFINHPNN--FYYLSLKDISIDNERMNFPPDTFDITVSGEG 210
L FG DM Q T +++P N YY+ + I E + P + I G G
Sbjct: 206 --LIFGDDMMSTIHDLQFTPLVSNPLNPSVYYVQIVRICFGGETLLIPDSAWKIDSVGNG 263
Query: 211 GCIIDSGSVLTYFHSDVYWKLHEKFVSYFER-FQLAQLSDCPEPIQLCYFLPETFNR-FP 268
G I DSG+ +TY+ Y ++ ++ FE+ + P+ + LC + + +P
Sbjct: 264 GTIFDSGTTVTYWSPQAYARI----IAAFEKSVPYPRAPPSPQGLPLCVNVSGIDHPIYP 319
Query: 269 SMAFYF-EDANLRIDGENVFIIDYENHFFLLAVAPHDDLVALIGSQQQRDTRFVYDLNID 327
S F + A R + N FI N L + D +IG+ Q++ YD
Sbjct: 320 SFTIEFDQGATYRPNQGNYFIEVSPNIDCLAMLESSSDGFNVIGNIIQQNYLVQYDREEH 379
Query: 328 LLSFVKENCSDDS 340
+ F NC S
Sbjct: 380 RIGFAHANCDAPS 392
>gi|224286159|gb|ACN40790.1| unknown [Picea sitchensis]
Length = 452
Score = 110 bits (275), Expect = 9e-22, Method: Compositional matrix adjust.
Identities = 96/357 (26%), Positives = 160/357 (44%), Gaps = 43/357 (12%)
Query: 1 MVRLFIGTPSKGVLLILDTGSALIY-------------AIFDPRKSSSFQKINCDHPDCT 47
++++ GTP + + ++DTGS + + IFDP KSSS++ CD C
Sbjct: 116 IIQVDFGTPKQSMYTLIDTGSDVAWIPCKQCQGCHSTAPIFDPAKSSSYKPFACDSQPCQ 175
Query: 48 YFK--CV-NEQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSNDNHGFD 104
C N +C + + Y D + G A + I++ G FGC+
Sbjct: 176 EISGNCGGNSKCQFEVLYGDGTQVDGTLASDAITL-----GSQYLPNFSFGCA---ESLS 227
Query: 105 EDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLKFGTDMGYR 164
ED LG +++ + + FSYCL P+ +S L G +
Sbjct: 228 EDTYSSPGLMGLGGGSLSLLTQAPTAELFGGTFSYCL----PSSSTSSGSLVLGKEAAVS 283
Query: 165 RPSTQATKFINHPN--NFYYLSLKDISIDNERMNFPPDTFDITVSGEGGCIIDSGSVLTY 222
S + T I P+ FY+++LK IS+ N R++ P ++ GG IIDSG+ +TY
Sbjct: 284 SSSLKFTTLIKDPSFPTFYFVTLKAISVGNTRISVPA----TNIASGGGTIIDSGTTITY 339
Query: 223 FHSDVYWKLHEKFVSYFERFQLAQLSDCP-EPIQLCYFLPETFNRFPSMAFYFE-DANLR 280
Y L + F R QL+ L P E + CY L + P++ + + + +L
Sbjct: 340 LVPSAYKDLRDAF-----RQQLSSLQPTPVEDMDTCYDLSSSSVDVPTITLHLDRNVDLV 394
Query: 281 IDGENVFIIDYENHFFLLAVAPHDDLVALIGSQQQRDTRFVYDLNIDLLSFVKENCS 337
+ EN+ I E+ LA + D ++IG+ QQ++ R V+D+ + F +E C+
Sbjct: 395 LPKENILITQ-ESGLSCLAFSSTDSR-SIIGNVQQQNWRIVFDVPNSQVGFAQEQCA 449
>gi|449436215|ref|XP_004135889.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 2-like [Cucumis
sativus]
Length = 496
Score = 110 bits (275), Expect = 9e-22, Method: Compositional matrix adjust.
Identities = 105/370 (28%), Positives = 166/370 (44%), Gaps = 61/370 (16%)
Query: 6 IGTPSKGVLLILDTGSALIYA--------------IFDPRKSSSFQKINCDHPDCTYFK- 50
+G + LI+DTGS L + +F+P SSSF + C+ P C +
Sbjct: 149 VGIGGQNSTLIVDTGSDLTWVQCLPCRLCYNQQEPLFNPSNSSSFLSLPCNSPTCVALQP 208
Query: 51 -------CVNEQ---CVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSNDN 100
C N+ C Y + Y D S ++G E +++ GK +FGC +N
Sbjct: 209 TAGSSGLCSNKNSTSCDYQIDYGDGSYSRGELGFEKLTL-----GKTEIDNFIFGCGRNN 263
Query: 101 HGFDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLKFGTD 160
G G +G++GL+R +S +SQ S+ FSYCL P + S G D
Sbjct: 264 KGLF-----GGASGLMGLARSELSLVSQTSSLFGSVFSYCL--PTTGVGSSGSLTLGGAD 316
Query: 161 MG-YRRPSTQA-TKFINHP--NNFYYLSLKDISIDNERMNFPPDTFDITVSGEGG-CIID 215
++ S + T+ I +P +NFY+L+L ISI +N P + S EG ++D
Sbjct: 317 FSNFKNISPISYTRMIQNPQMSNFYFLNLTGISIGGVNLNVPRLS-----SNEGVLSLLD 371
Query: 216 SGSVLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCP--EPIQLCYFLP--ETFNRFPSMA 271
SG+V+T +Y + F + FE+ Q + P + C+ L E N P++
Sbjct: 372 SGTVITRLSPSIY----KAFKAEFEK-QFSGYRTTPGFSILNTCFNLTGYEEVN-IPTVK 425
Query: 272 FYFE-DANLRIDGENVFII---DYENHFFLLAVAPHDDLVALIGSQQQRDTRFVYDLNID 327
F FE +A + +D E VF D A ++D +IG+ QQ++ R +Y+
Sbjct: 426 FIFEGNAEMIVDVEGVFYFVKSDASQICLAFASLGYEDQTMIIGNYQQKNQRVIYNSKES 485
Query: 328 LLSFVKENCS 337
+ F E CS
Sbjct: 486 KVGFAGEPCS 495
>gi|224092220|ref|XP_002309515.1| predicted protein [Populus trichocarpa]
gi|222855491|gb|EEE93038.1| predicted protein [Populus trichocarpa]
Length = 473
Score = 110 bits (275), Expect = 9e-22, Method: Compositional matrix adjust.
Identities = 101/361 (27%), Positives = 149/361 (41%), Gaps = 49/361 (13%)
Query: 2 VRLFIGTPSKGVLLILDTGSALIYAI---------------FDPRKSSSFQKINCDHPDC 46
V + +GTP K LI DTGS L + DP KS+S++ I+C C
Sbjct: 135 VTVGLGTPKKEFTLIFDTGSDLTWTQCEPCAKTCYKQKEPRLDPTKSTSYKNISCSSAFC 194
Query: 47 TYF------KCVNEQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSNDN 100
C + C+Y ++Y D S + GF A ET+++ +F LFGC N
Sbjct: 195 KLLDTEGGESCSSPTCLYQVQYGDGSYSIGFFATETLTL----SSSNVFKNFLFGCGQQN 250
Query: 101 HGFDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLKFGTD 160
G A AG+LGL R +S SQ KK FSYC LP + YL FG
Sbjct: 251 SGLFRGA-----AGLLGLGRTKLSLPSQTAQKYKKLFSYC----LPASSSSKGYLSFGGQ 301
Query: 161 MGYRRPSTQATK-FINHPNNFYYLSLKDISIDNERMNFPPDTFDITVSGEGGCIIDSGSV 219
+ T ++ F + P FY L + ++S+ ++ + D ++ G +IDSG+V
Sbjct: 302 VSKTVKFTPLSEDFKSTP--FYGLDITELSVGGNKL-----SIDASIFSTSGTVIDSGTV 354
Query: 220 LTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCY-FLPETFNRFPSMAFYFEDA- 277
+T S Y L F + +D CY F + P + F+
Sbjct: 355 ITRLPSTAYSALSSAFQKLMTDY---PSTDGYSIFDTCYDFSKNETIKIPKVGVSFKGGV 411
Query: 278 NLRIDGENVFIIDYENHFFLLAVAPHDDLV--ALIGSQQQRDTRFVYDLNIDLLSFVKEN 335
+ ID + LA A + D V A+ G+ QQ+ + VYD + F
Sbjct: 412 EMDIDVSGILYPVNGLKKVCLAFAGNGDDVKAAIFGNTQQKTYQVVYDDAKGRVGFAPSG 471
Query: 336 C 336
C
Sbjct: 472 C 472
>gi|449509162|ref|XP_004163513.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 2-like [Cucumis
sativus]
Length = 417
Score = 110 bits (275), Expect = 1e-21, Method: Compositional matrix adjust.
Identities = 105/370 (28%), Positives = 166/370 (44%), Gaps = 61/370 (16%)
Query: 6 IGTPSKGVLLILDTGSALIYA--------------IFDPRKSSSFQKINCDHPDCTYFK- 50
+G + LI+DTGS L + +F+P SSSF + C+ P C +
Sbjct: 70 VGIGGQNSTLIVDTGSDLTWVQCLPCRLCYNQQEPLFNPSNSSSFLSLPCNSPTCVALQP 129
Query: 51 -------CVNEQ---CVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSNDN 100
C N+ C Y + Y D S ++G E +++ GK +FGC +N
Sbjct: 130 TAGSSGLCSNKNSTSCDYQIDYGDGSYSRGELGFEKLTL-----GKTEIDNFIFGCGRNN 184
Query: 101 HGFDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLKFGTD 160
G G +G++GL+R +S +SQ S+ FSYCL P + S G D
Sbjct: 185 KGLF-----GGASGLMGLARSELSLVSQTSSLFGSVFSYCL--PTTGVGSSGSLTLGGAD 237
Query: 161 MG-YRRPSTQA-TKFINHP--NNFYYLSLKDISIDNERMNFPPDTFDITVSGEGG-CIID 215
++ S + T+ I +P +NFY+L+L ISI +N P + S EG ++D
Sbjct: 238 FSNFKNISPISYTRMIQNPQMSNFYFLNLTGISIGGVNLNVPRLS-----SNEGVLSLLD 292
Query: 216 SGSVLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCP--EPIQLCYFLP--ETFNRFPSMA 271
SG+V+T +Y + F + FE+ Q + P + C+ L E N P++
Sbjct: 293 SGTVITRLSPSIY----KAFKAEFEK-QFSGYRTTPGFSILNTCFNLTGYEEVN-IPTVK 346
Query: 272 FYFE-DANLRIDGENVFII---DYENHFFLLAVAPHDDLVALIGSQQQRDTRFVYDLNID 327
F FE +A + +D E VF D A ++D +IG+ QQ++ R +Y+
Sbjct: 347 FIFEGNAEMIVDVEGVFYFVKSDASQICLAFASLGYEDQTMIIGNYQQKNQRVIYNSKES 406
Query: 328 LLSFVKENCS 337
+ F E CS
Sbjct: 407 KVGFAGEPCS 416
>gi|356532674|ref|XP_003534896.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
Length = 446
Score = 110 bits (275), Expect = 1e-21, Method: Compositional matrix adjust.
Identities = 99/360 (27%), Positives = 167/360 (46%), Gaps = 44/360 (12%)
Query: 1 MVRLFIGTPSKGVLLILDTGSALIYA--------------IFDPRKSSSFQKINCDHPDC 46
+V L IG PS L+++DTGS +++ +FDP SS+F + C P C
Sbjct: 102 LVNLSIGQPSIPQLVVMDTGSDILWIMCNPCTNCDNHLGLLFDPSMSSTFSPL-CKTP-C 159
Query: 47 TYFKCVNEQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSNDNHGFDED 106
+ C + +T+ Y D S G + + EG + + GC + N GF+ D
Sbjct: 160 GFKGCKCDPIPFTISYVDNSSASGTFGRDILVFETTDEGTSQISDVIIGCGH-NIGFNSD 218
Query: 107 ARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLV-IPLPNGEYTSSYLKFGTDM-GYR 164
G+LGL+ S +Q+G ++FSYC+ + P Y L G D+ GY
Sbjct: 219 P---GYNGILGLNNGPNSLATQIG----RKFSYCIGNLADPYYNYNQLRLGEGADLEGYS 271
Query: 165 RPSTQATKFINHPNNFYYLSLKDISIDNERMNFPPDTFDITVSGEGGCIIDSGSVLTYFH 224
P + H FYY++++ IS+ +R++ +TF++ +G GG I+DSG+ +TY
Sbjct: 272 TPFE-----VYH--GFYYVTMEGISVGEKRLDIALETFEMKRNGTGGVILDSGTTITYLV 324
Query: 225 SDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCYF--LPETFNRFPSMAFYFED-ANLRI 281
+ L+ + V ++ Q+ P +LCY+ + FP + F+F D A+L +
Sbjct: 325 DSAHKLLYNE-VRNLLKWSFRQVIFENAPWKLCYYGIISRDLVGFPVVTFHFVDGADLAL 383
Query: 282 DGENVFIIDYENHFFLLAVAPHDDL-----VALIGSQQQRDTRFVYDLNIDLLSFVKENC 336
D + F + F + V+P L ++IG Q+ YDL + F + +C
Sbjct: 384 DTGSFF--SQRDDIFCMTVSPASILNTTISPSVIGLLAQQSYNVGYDLVNQFVYFQRIDC 441
>gi|357114697|ref|XP_003559132.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Brachypodium
distachyon]
Length = 416
Score = 110 bits (275), Expect = 1e-21, Method: Compositional matrix adjust.
Identities = 91/321 (28%), Positives = 143/321 (44%), Gaps = 26/321 (8%)
Query: 27 IFDPRKSSSFQKINCDHPDCTY-FKCVNEQCVYTMKYADQSVTKGFAAHETISVIGKG-E 84
+F P S +F ++ + P CT ++ C + +A G+ + +T + G
Sbjct: 110 LFSPAASPTFHGVHSNDPVCTAPYRPTANGCSFRFPFAS-----GYLSRDTFHLRNGGLS 164
Query: 85 GKAIFH---GALFGCSNDNHGFDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCL 141
G A G +FGC++ GF D G L GVL LS + +S ++QL + RFSYCL
Sbjct: 165 GGAPIESVPGIMFGCAHSVAGFHND---GTLGGVLSLSHLRLSLLTQLSARAGGRFSYCL 221
Query: 142 VIPLPNGEYTSSYLKFGTDMGYRRPSTQATKFINHPNNF--YYLSLKDISIDNERMNFPP 199
P P +L+ G D+ P + T + YYLSL I++ +R+ P
Sbjct: 222 --PKPTQGNPHGFLRLGADVLPPLPHSHMTALTVRSGSAPDYYLSLVGITLAEKRLRIDP 279
Query: 200 DTFDITVSGEGGCIIDSGSVLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCYF 259
F +G GGC I+ + +T Y + V+Y + ++ P +F
Sbjct: 280 RVF---AAGRGGCSINPAATITAIMEPAYLVVERALVAYMKELGSDRVKKGPPGGGALFF 336
Query: 260 ---LPETFNRFPSMAFYFED-ANLRIDGENVFIIDYENHFFLLAVAPHDDLVALIGSQQQ 315
R PSMAF+F+D A L E +F + +F++ + V IG+ QQ
Sbjct: 337 DRMYKSVQARLPSMAFHFKDGAELWFTPEQLFEVHGMVAWFMMVGKGYRRTV--IGAPQQ 394
Query: 316 RDTRFVYDLNIDLLSFVKENC 336
+TRF +D+ LSF E C
Sbjct: 395 VNTRFTFDVAAGRLSFASELC 415
>gi|15242803|ref|NP_198319.1| aspartyl protease family protein [Arabidopsis thaliana]
gi|75130158|sp|Q6XBF8.1|CDR1_ARATH RecName: Full=Aspartic proteinase CDR1; AltName: Full=Protein
CONSTITUTIVE DISEASE RESISTANCE 1; Flags: Precursor
gi|37935737|gb|AAP72988.1| CDR1 [Arabidopsis thaliana]
gi|91806924|gb|ABE66189.1| aspartyl protease family protein [Arabidopsis thaliana]
gi|109946613|gb|ABG48485.1| At5g33340 [Arabidopsis thaliana]
gi|332006513|gb|AED93896.1| aspartyl protease family protein [Arabidopsis thaliana]
Length = 437
Score = 110 bits (275), Expect = 1e-21, Method: Compositional matrix adjust.
Identities = 97/360 (26%), Positives = 170/360 (47%), Gaps = 38/360 (10%)
Query: 1 MVRLFIGTPSKGVLLILDTGSALIYA--------------IFDPRKSSSFQKINCDHPDC 46
++ + IGTP ++ I DTGS L++ +FDP+ SS+++ ++C C
Sbjct: 91 LMNVSIGTPPFPIMAIADTGSDLLWTQCAPCDDCYTQVDPLFDPKTSSTYKDVSCSSSQC 150
Query: 47 TYFK----CV--NEQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSNDN 100
T + C + C Y++ Y D S TKG A +T+++ + GC ++N
Sbjct: 151 TALENQASCSTNDNTCSYSLSYGDNSYTKGNIAVDTLTLGSSDTRPMQLKNIIIGCGHNN 210
Query: 101 HGFDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLKFGTD 160
G + +G++GL +S I QLG I +FSYCLV PL + + +S + FGT+
Sbjct: 211 AG----TFNKKGSGIVGLGGGPVSLIKQLGDSIDGKFSYCLV-PLTSKKDQTSKINFGTN 265
Query: 161 MGYRRPSTQATKFINHPNN--FYYLSLKDISIDNERMNFPPDTFDITVSGEGGCIIDSGS 218
+T I + FYYL+LK IS+ ++++ + + + S EG IIDSG+
Sbjct: 266 AIVSGSGVVSTPLIAKASQETFYYLTLKSISVGSKQIQY---SGSDSESSEGNIIIDSGT 322
Query: 219 VLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCYFLPETFNRFPSMAFYFEDAN 278
LT ++ Y +L + S + + D + LCY + P + +F+ A+
Sbjct: 323 TLTLLPTEFYSELEDAVASSIDA---EKKQDPQSGLSLCYSATGDL-KVPVITMHFDGAD 378
Query: 279 LRIDGENVFIIDYENHF-FLLAVAPHDDLVALIGSQQQRDTRFVYDLNIDLLSFVKENCS 337
+++D N F+ E+ F +P ++ G+ Q + YD +SF +C+
Sbjct: 379 VKLDSSNAFVQVSEDLVCFAFRGSPS---FSIYGNVAQMNFLVGYDTVSKTVSFKPTDCA 435
>gi|242092878|ref|XP_002436929.1| hypothetical protein SORBIDRAFT_10g011180 [Sorghum bicolor]
gi|241915152|gb|EER88296.1| hypothetical protein SORBIDRAFT_10g011180 [Sorghum bicolor]
Length = 505
Score = 110 bits (275), Expect = 1e-21, Method: Compositional matrix adjust.
Identities = 105/372 (28%), Positives = 160/372 (43%), Gaps = 64/372 (17%)
Query: 1 MVRLFIGTPSKGVLLILDTGSALIY---------------AIFDPRKSSSFQKINCDHPD 45
+V + G+P++ L +DTGS + + +FDP KS+++ + C HP
Sbjct: 162 VVTVGFGSPAQNYTLSIDTGSDVSWIQCLPCSGHCYKQHDPVFDPTKSATYSAVPCGHPQ 221
Query: 46 CTYF--KCVNE-QCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSNDNHG 102
C KC N C+Y + Y D S T G +HET+S+ + G FGC N G
Sbjct: 222 CAAAGGKCSNSGTCLYKVTYGDGSSTAGVLSHETLSLSSTRD----LPGFAFGCGQTNLG 277
Query: 103 FDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLKFGTDMG 162
G + G++GL R +S SQ + FSYC LP+ + T YL MG
Sbjct: 278 -----EFGGVDGLVGLGRGALSLPSQAAATFGATFSYC----LPSYDTTHGYLT----MG 324
Query: 163 YRRPST-------QATKFINHPN--NFYYLSLKDISIDNERMNFPPDTFDITVSGEGGCI 213
P+ Q T I + + Y++ + I I + PP F G +
Sbjct: 325 STTPAASNDDDDVQYTAMIQKEDYPSLYFVEVVSIDIGGYILPVPPTVFT-----RDGTL 379
Query: 214 IDSGSVLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCP--EPIQLCY-FLPETFNRFPSM 270
DSG++LTY + Y L ++F +F + Q P +P CY F P++
Sbjct: 380 FDSGTILTYLPPEAYASLRDRF-----KFTMTQYKPAPAYDPFDTCYDFTGHNAIFMPAV 434
Query: 271 AFYFEDANLRIDGENVFIIDYENHFF----LLAVAPHDDLVA--LIGSQQQRDTRFVYDL 324
AF F D + D V I+ Y + LA P + +IG+ QQR T +YD+
Sbjct: 435 AFKFSDGAV-FDLSPVAILIYPDDTAPATGCLAFVPRPSTMPFNIIGNTQQRGTEVIYDV 493
Query: 325 NIDLLSFVKENC 336
+ + F + C
Sbjct: 494 AAEKIGFGQFTC 505
>gi|226492391|ref|NP_001140482.1| uncharacterized protein LOC100272542 precursor [Zea mays]
gi|224030447|gb|ACN34299.1| unknown [Zea mays]
gi|414887506|tpg|DAA63520.1| TPA: hypothetical protein ZEAMMB73_432695 [Zea mays]
Length = 512
Score = 110 bits (274), Expect = 1e-21, Method: Compositional matrix adjust.
Identities = 108/380 (28%), Positives = 169/380 (44%), Gaps = 57/380 (15%)
Query: 1 MVRLFIGTPSKGVLLILDTGSALIY--------------AIFDPRKSSSFQKINCDHPDC 46
++ +++GTP + +I+DTGS L + +FDP SSS++ + C P C
Sbjct: 147 LMDVYVGTPPRRFQMIMDTGSDLNWLQCAPCLDCFEQRGPVFDPAASSSYRNLTCGDPRC 206
Query: 47 TYF------------KCVNEQCVYTMKYADQSVTKGFAAHE--TISVIGKGEGKAIFHGA 92
+ + + C Y Y DQS + G A E T+++ G + G
Sbjct: 207 GHVAPPEAPAPRACRRPGEDPCPYYYWYGDQSNSTGDLALESFTVNLTAPGASSRV-DGV 265
Query: 93 LFGCSNDNHGFDEDARDGALAGVLGLSRVTISFISQLGSIIKKR-FSYCLVIPLPNGEYT 151
+FGC + N G AG+LGL R +SF SQL ++ FSYCLV +G
Sbjct: 266 VFGCGHRNRGLFH-----GAAGLLGLGRGPLSFASQLRAVYGGHTFSYCLV---DHGSDV 317
Query: 152 SSYLKFGTDMGYR---RPSTQATKFI---NHPNNFYYLSLKDISIDNERMNFPPDTFDIT 205
+S + FG D P + T F + + FYY+ L + + E +N DT+D +
Sbjct: 318 ASKVVFGEDDALALAAHPRLKYTAFAPASSPADTFYYVRLTGVLVGGELLNISSDTWDAS 377
Query: 206 VSGEGGCIIDSGSVLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCYFL----- 260
G GG IIDSG+ L+YF Y + F+ + D P + CY +
Sbjct: 378 EGGSGGTIIDSGTTLSYFVEPAYQVIRRAFIDRMSG-SYPPVPDFPV-LSPCYNVSGVER 435
Query: 261 PETFNRFPSMAFYFED-ANLRIDGENVFI-IDYENHFFLLAVAPHDDLVALIGSQQQRDT 318
PE P ++ F D A EN FI +D + L + +++IG+ QQ++
Sbjct: 436 PEV----PELSLLFADGAVWDFPAENYFIRLDPDGIMCLAVLGTPRTGMSIIGNFQQQNF 491
Query: 319 RFVYDLNIDLLSFVKENCSD 338
YDL+ + L F C++
Sbjct: 492 HVAYDLHNNRLGFAPRRCAE 511
>gi|116831531|gb|ABK28718.1| unknown [Arabidopsis thaliana]
Length = 438
Score = 110 bits (274), Expect = 1e-21, Method: Compositional matrix adjust.
Identities = 97/360 (26%), Positives = 170/360 (47%), Gaps = 38/360 (10%)
Query: 1 MVRLFIGTPSKGVLLILDTGSALIYA--------------IFDPRKSSSFQKINCDHPDC 46
++ + IGTP ++ I DTGS L++ +FDP+ SS+++ ++C C
Sbjct: 91 LMNVSIGTPPFPIMAIADTGSDLLWTQCAPCDDCYTQVDPLFDPKTSSTYKDVSCSSSQC 150
Query: 47 TYFK----CV--NEQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSNDN 100
T + C + C Y++ Y D S TKG A +T+++ + GC ++N
Sbjct: 151 TALENQASCSTNDNTCSYSLSYGDNSYTKGNIAVDTLTLGSSDTRPMQLKNIIIGCGHNN 210
Query: 101 HGFDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLKFGTD 160
G + +G++GL +S I QLG I +FSYCLV PL + + +S + FGT+
Sbjct: 211 AG----TFNKKGSGIVGLGGGPVSLIKQLGDSIDGKFSYCLV-PLTSKKDQTSKINFGTN 265
Query: 161 MGYRRPSTQATKFINHPNN--FYYLSLKDISIDNERMNFPPDTFDITVSGEGGCIIDSGS 218
+T I + FYYL+LK IS+ ++++ + + + S EG IIDSG+
Sbjct: 266 AIVSGSGVVSTPLIAKASQETFYYLTLKSISVGSKQIQY---SGSDSESSEGNIIIDSGT 322
Query: 219 VLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCYFLPETFNRFPSMAFYFEDAN 278
LT ++ Y +L + S + + D + LCY + P + +F+ A+
Sbjct: 323 TLTLLPTEFYSELEDAVASSIDA---EKKQDPQSGLSLCYSATGDL-KVPVITMHFDGAD 378
Query: 279 LRIDGENVFIIDYENHF-FLLAVAPHDDLVALIGSQQQRDTRFVYDLNIDLLSFVKENCS 337
+++D N F+ E+ F +P ++ G+ Q + YD +SF +C+
Sbjct: 379 VKLDSSNAFVQVSEDLVCFAFRGSPS---FSIYGNVAQMNFLVGYDTVSKTVSFKPTDCA 435
>gi|226494448|ref|NP_001141341.1| uncharacterized protein LOC100273432 precursor [Zea mays]
gi|194704078|gb|ACF86123.1| unknown [Zea mays]
gi|413953775|gb|AFW86424.1| aspartic proteinase nepenthesin-2 [Zea mays]
Length = 471
Score = 110 bits (274), Expect = 1e-21, Method: Compositional matrix adjust.
Identities = 92/364 (25%), Positives = 159/364 (43%), Gaps = 54/364 (14%)
Query: 1 MVRLFIGTPSKGVLLILDTGSALIY---------------AIFDPRKSSSFQKINCDHPD 45
+ +L +GTPS +++DTGS+L + +FDPR SS++ + C
Sbjct: 135 VTQLGLGTPSTSYAMVVDTGSSLTWLQCSPCVVSCHRQVGPLFDPRASSTYTSVRCSASQ 194
Query: 46 CTYFKCV---------NEQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGC 96
C + + C+Y Y D S + G+ + +T+S G + +GC
Sbjct: 195 CDELQAATLNPSACSASNVCIYQASYGDSSFSVGYLSTDTVSF-----GSTSYPSFYYGC 249
Query: 97 SNDNHGFDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLK 156
DN G G AG++GL+R +S + QL + FSYCL G YL
Sbjct: 250 GQDNEGLF-----GRSAGLIGLARNKLSLLYQLAPSLGYSFSYCLPTAASTG-----YLS 299
Query: 157 FGT-DMG-YRRPSTQATKFINHPNNFYYLSLKDISIDNERMNFPPDTFDITVSGEGGCII 214
G + G Y + A+ ++ + Y+++L +S+ + P + + II
Sbjct: 300 IGPYNTGHYYSYTPMASSSLD--ASLYFITLSGMSVGGSPLAVSPSEYSSLPT-----II 352
Query: 215 DSGSVLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCYFLPETFNRFPSMAFYF 274
DSG+V+T + V+ L + + AQ + + C+ + R P++ F
Sbjct: 353 DSGTVITRLPTAVHTALSKAVA---QAMAGAQRAPAFSILDTCFEGQASQLRVPTVVMAF 409
Query: 275 E-DANLRIDGENVFIIDYENHFFLLAVAPHDDLVALIGSQQQRDTRFVYDLNIDLLSFVK 333
A++++ NV +ID ++ LA AP D A+IG+ QQ+ +YD+ + F
Sbjct: 410 AGGASMKLTTRNV-LIDVDDSTTCLAFAPTDS-TAIIGNTQQQTFSVIYDVAQSRIGFSA 467
Query: 334 ENCS 337
CS
Sbjct: 468 GGCS 471
>gi|17979392|gb|AAL49921.1| unknown protein [Arabidopsis thaliana]
Length = 439
Score = 110 bits (274), Expect = 1e-21, Method: Compositional matrix adjust.
Identities = 101/362 (27%), Positives = 157/362 (43%), Gaps = 42/362 (11%)
Query: 6 IGTPSKGVLLILDTGSALIYA-------------IFDPRKSSSFQKINCDHPDC------ 46
+GTP+K +++DTGS L + +F +S SF+ + C C
Sbjct: 90 VGTPAKKFRVVVDTGSELTWVNCRYRARGKDNRRVFRADESKSFKTVGCLTQTCKVDLMN 149
Query: 47 ----TYFKCVNEQCVYTMKYADQSVTKGFAAHETISVIGKGEGK-AIFHGALFGCSNDNH 101
T + C Y +YAD S +G A ETI+V G G+ A G L GCS+
Sbjct: 150 LFSLTTCPTPSTPCSYDYRYADGSAAQGVFAKETITV-GLTNGRMARLPGHLIGCSSSFT 208
Query: 102 GFDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLKFGTDM 161
G DG VLGL+ SF S S+ +FSYCLV L N + S+YL FG+
Sbjct: 209 GQSFQGADG----VLGLAFSDFSFTSTATSLYGAKFSYCLVDHLSN-KNVSNYLIFGSSR 263
Query: 162 GYRRPSTQATKF-INHPNNFYYLSLKDISIDNERMNFPPDTFDITVSGEGGCIIDSGSVL 220
+ + T + FY +++ IS+ + ++ P +D T SG GG I+DSG+ L
Sbjct: 264 STKTAFRRTTPLDLTRIPPFYAINVIGISLGYDMLDIPSQVWDAT-SG-GGTILDSGTSL 321
Query: 221 TYFHSDVYWKLHEKFVSYFERFQLAQLSDCPE--PIQLCYFLPETFN--RFPSMAFYFED 276
T Y ++ Y + + PE PI+ C+ FN + P + F+ +
Sbjct: 322 TLLADAAYKQVVTGLARYLVELKRVK----PEGVPIEYCFSFTSGFNVSKLPQLTFHLKG 377
Query: 277 ANLRIDGENVFIIDYENHFFLLA-VAPHDDLVALIGSQQQRDTRFVYDLNIDLLSFVKEN 335
+++D L V+ +IG+ Q++ + +DL LSF
Sbjct: 378 GARFEPHRKSYLVDAAPGVKCLGFVSAGTPATNVIGNIMQQNYLWEFDLMASTLSFAPSA 437
Query: 336 CS 337
C+
Sbjct: 438 CT 439
>gi|15223368|ref|NP_171637.1| aspartyl protease-like protein [Arabidopsis thaliana]
gi|9665144|gb|AAF97328.1|AC023628_9 Unknown protein [Arabidopsis thaliana]
gi|22135930|gb|AAM91547.1| chloroplast nucleoid DNA binding protein, putative [Arabidopsis
thaliana]
gi|30387595|gb|AAP31963.1| At1g01300 [Arabidopsis thaliana]
gi|332189147|gb|AEE27268.1| aspartyl protease-like protein [Arabidopsis thaliana]
Length = 485
Score = 110 bits (274), Expect = 1e-21, Method: Compositional matrix adjust.
Identities = 98/361 (27%), Positives = 153/361 (42%), Gaps = 44/361 (12%)
Query: 2 VRLFIGTPSKGVLLILDTGSALIY--------------AIFDPRKSSSFQKINCDHPDCT 47
RL +GTP++ V ++LDTGS +++ IFDPRKS ++ I C P C
Sbjct: 144 TRLGVGTPARYVYMVLDTGSDIVWLQCAPCRRCYSQSDPIFDPRKSKTYATIPCSSPHCR 203
Query: 48 YFKCVN-----EQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSNDNHG 102
+ C+Y + Y D S T G + ET++ + G GC +DN G
Sbjct: 204 RLDSAGCNTRRKTCLYQVSYGDGSFTVGDFSTETLTFR-----RNRVKGVALGCGHDNEG 258
Query: 103 FDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLKFGTDMG 162
A + +SF Q G ++FSYCLV + + +S + FG
Sbjct: 259 LFVGAAGLLGL-----GKGKLSFPGQTGHRFNQKFSYCLVDRSASSKPSS--VVFGNAAV 311
Query: 163 YRRPSTQATKFINHP--NNFYYLSLKDISIDNERM-NFPPDTFDITVSGEGGCIIDSGSV 219
R + T +++P + FYY+ L IS+ R+ F + G GG IIDSG+
Sbjct: 312 SR--IARFTPLLSNPKLDTFYYVGLLGISVGGTRVPGVTASLFKLDQIGNGGVIIDSGTS 369
Query: 220 LTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPE--PIQLCYFLPETFN-RFPSMAFYFED 276
+T Y + + F R L P+ C+ L + P++ +F
Sbjct: 370 VTRLIRPAYIAMRDAF-----RVGAKTLKRAPDFSLFDTCFDLSNMNEVKVPTVVLHFRG 424
Query: 277 ANLRIDGENVFIIDYENHFFLLAVAPHDDLVALIGSQQQRDTRFVYDLNIDLLSFVKENC 336
A++ + N I N F A A +++IG+ QQ+ R VYDL + F C
Sbjct: 425 ADVSLPATNYLIPVDTNGKFCFAFAGTMGGLSIIGNIQQQGFRVVYDLASSRVGFAPGGC 484
Query: 337 S 337
+
Sbjct: 485 A 485
>gi|115438214|ref|NP_001043485.1| Os01g0598600 [Oryza sativa Japonica Group]
gi|15290061|dbj|BAB63755.1| nucleoid DNA-binding protein cnd41-like [Oryza sativa Japonica
Group]
gi|113533016|dbj|BAF05399.1| Os01g0598600 [Oryza sativa Japonica Group]
gi|125526702|gb|EAY74816.1| hypothetical protein OsI_02707 [Oryza sativa Indica Group]
Length = 500
Score = 110 bits (274), Expect = 1e-21, Method: Compositional matrix adjust.
Identities = 109/373 (29%), Positives = 157/373 (42%), Gaps = 61/373 (16%)
Query: 3 RLFIGTPSKGVLLILDTGSALIY--------------AIFDPRKSSSFQKINCDHPDCTY 48
++ +GTP L++LDTGS +++ +FDPR S S+ ++C P C
Sbjct: 150 KIGVGTPVTPALMVLDTGSDVVWLQCAPCRRCYDQSGQMFDPRASHSYGAVDCAAPLCRR 209
Query: 49 FKC-----VNEQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSNDNHGF 103
+ C+Y + Y D SVT G A ET++ G + AL GC +DN G
Sbjct: 210 LDSGGCDLRRKACLYQVAYGDGSVTAGDFATETLTFA---SGARVPRVAL-GCGHDNEGL 265
Query: 104 DEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLV---IPLPNGEYTSSYLKFGTD 160
A R ++SF SQ+ + FSYCLV + SS + FG+
Sbjct: 266 FVAAAGLLGL-----GRGSLSFPSQISRRFGRSFSYCLVDRTSSSASATSRSSTVTFGS- 319
Query: 161 MGYRRPSTQA--TKFINHPN--NFYYLSLKDISIDNER--------MNFPPDTFDITVSG 208
G PS A T + +P FYY+ L IS+ R + P T G
Sbjct: 320 -GAVGPSAAASFTPMVKNPRMETFYYVQLMGISVGGARVPGVAVSDLRLDPST------G 372
Query: 209 EGGCIIDSGSVLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQL---CYFLPE-TF 264
GG I+DSG+ +T Y L + F R A L P L CY L
Sbjct: 373 RGGVIVDSGTSVTRLARPAYAALRDAF-----RAAAAGLRLSPGGFSLFDTCYDLSGLKV 427
Query: 265 NRFPSMAFYFE-DANLRIDGENVFIIDYENHFFLLAVAPHDDLVALIGSQQQRDTRFVYD 323
+ P+++ +F A + EN I F A A D V++IG+ QQ+ R V+D
Sbjct: 428 VKVPTVSMHFAGGAEAALPPENYLIPVDSRGTFCFAFAGTDGGVSIIGNIQQQGFRVVFD 487
Query: 324 LNIDLLSFVKENC 336
+ L FV + C
Sbjct: 488 GDGQRLGFVPKGC 500
>gi|195625122|gb|ACG34391.1| aspartic proteinase nepenthesin-2 precursor [Zea mays]
Length = 471
Score = 110 bits (274), Expect = 1e-21, Method: Compositional matrix adjust.
Identities = 93/364 (25%), Positives = 159/364 (43%), Gaps = 54/364 (14%)
Query: 1 MVRLFIGTPSKGVLLILDTGSALIY---------------AIFDPRKSSSFQKINCDHPD 45
+ +L +GTPS +++DTGS+L + +FDPR SS++ + C
Sbjct: 135 VTQLGLGTPSTSYAMVVDTGSSLTWLQCSPCVVSCHRQVGPLFDPRASSTYASVRCSASQ 194
Query: 46 CTYFKCV---------NEQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGC 96
C + + C+Y Y D S + G + +T+S G + +GC
Sbjct: 195 CDELQAATLNPSACSASNVCIYQASYGDSSFSVGSLSTDTVSF-----GSTRYPSFYYGC 249
Query: 97 SNDNHGFDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLK 156
DN G G AG++GL+R +S + QL + FSYCL G YL
Sbjct: 250 GQDNEGLF-----GRSAGLIGLARNKLSLLYQLAPSLGYSFSYCLPTAASTG-----YLS 299
Query: 157 FGT-DMG-YRRPSTQATKFINHPNNFYYLSLKDISIDNERMNFPPDTFDITVSGEGGCII 214
G + G Y + A+ ++ + Y+++L +S+ + P + + II
Sbjct: 300 IGPYNTGHYYSYTPMASSSLD--ASLYFITLSGMSVGGSPLAVSPSEYSSLPT-----II 352
Query: 215 DSGSVLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCYFLPETFNRFPSMAFYF 274
DSG+V+T + V+ L + + AQ + + C+ + R P++A F
Sbjct: 353 DSGTVITRLPTAVHTALSKAVA---QAMAGAQRAPAFSILDTCFEGQASQLRVPTVAMAF 409
Query: 275 E-DANLRIDGENVFIIDYENHFFLLAVAPHDDLVALIGSQQQRDTRFVYDLNIDLLSFVK 333
A++++ NV +ID ++ LA AP D A+IG+ QQ+ +YD+ + F
Sbjct: 410 AGGASMKLTTRNV-LIDVDDSTTCLAFAPTDS-TAIIGNTQQQTFSVIYDVAQSRIGFSA 467
Query: 334 ENCS 337
CS
Sbjct: 468 GGCS 471
>gi|357118076|ref|XP_003560785.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Brachypodium
distachyon]
Length = 477
Score = 109 bits (273), Expect = 1e-21, Method: Compositional matrix adjust.
Identities = 105/363 (28%), Positives = 155/363 (42%), Gaps = 50/363 (13%)
Query: 1 MVRLFIGTPSKGVLLILDTGSALIY---------------AIFDPRKSSSFQKINCDHPD 45
+V + GTP++ +ILDTGS L + FDP KSSS+ + C P
Sbjct: 138 VVVVGFGTPAQTAAIILDTGSDLSWIQCKPCSGHCYRQHDPDFDPAKSSSYAAVPCGTPV 197
Query: 46 CTYFK--CVNEQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSNDNHGF 103
C C C+Y ++Y D S T G + +T++ + F G FGC N G
Sbjct: 198 CAAAGGMCNGTTCLYGVQYGDGSSTTGVLSRDTLTF----NSSSKFTGFTFGCGEKNIG- 252
Query: 104 DEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLKFGTDMGY 163
G + G+LGL R +S SQ FSYCL P+ T YL G
Sbjct: 253 ----DFGEVDGLLGLGRGKLSLPSQAAPSFGGVFSYCL----PSYNTTPGYLNIGATKPT 304
Query: 164 RRPSTQATKFINHPN--NFYYLSLKDISIDNERMNFPPDTFDITVSGEGGCIIDSGSVLT 221
Q T I P +FY++ L I+I + PP F T G ++DSG++LT
Sbjct: 305 STVPVQYTAMIKKPQYPSFYFIELVSINIGGYILPVPPSVFTKT-----GTLLDSGTILT 359
Query: 222 YFHSDVYWKLHEKFVSYFERFQLAQLSDCP--EPIQLCY-FLPETFNRFPSMAFYFEDA- 277
Y Y L ++F +F + P EP+ CY F + P+++F F D
Sbjct: 360 YLPPPAYTSLRDRF-----KFTMQGNKPAPPYEPLDTCYDFTGQGAIVIPAVSFNFSDGA 414
Query: 278 --NLRIDGENVFIIDYENHFFLLAVA--PHDDLVALIGSQQQRDTRFVYDLNIDLLSFVK 333
+L G +F D + LA P +++G+ QQR +YD+ + F+
Sbjct: 415 VFDLDFYGIMIFPDDAKPLIGCLAFVSRPAAMPFSIVGNTQQRAAEVIYDVPSQKIGFIP 474
Query: 334 ENC 336
+C
Sbjct: 475 ISC 477
>gi|30682289|ref|NP_187876.2| aspartyl protease family protein [Arabidopsis thaliana]
gi|11994412|dbj|BAB02414.1| chloroplast nucleoid DNA binding protein-like [Arabidopsis
thaliana]
gi|332641715|gb|AEE75236.1| aspartyl protease family protein [Arabidopsis thaliana]
Length = 461
Score = 109 bits (273), Expect = 1e-21, Method: Compositional matrix adjust.
Identities = 100/360 (27%), Positives = 157/360 (43%), Gaps = 38/360 (10%)
Query: 6 IGTPSKGVLLILDTGSALIYA-------------IFDPRKSSSFQKINCDHPDC------ 46
+GTP+K +++DTGS L + +F +S SF+ + C C
Sbjct: 112 VGTPAKKFRVVVDTGSELTWVNCRYRARGKDNRRVFRADESKSFKTVGCLTQTCKVDLMN 171
Query: 47 ----TYFKCVNEQCVYTMKYADQSVTKGFAAHETISVIGKGEGK-AIFHGALFGCSNDNH 101
T + C Y +YAD S +G A ETI+V G G+ A G L GCS+
Sbjct: 172 LFSLTTCPTPSTPCSYDYRYADGSAAQGVFAKETITV-GLTNGRMARLPGHLIGCSSSFT 230
Query: 102 GFDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLKFGTDM 161
G DG VLGL+ SF S S+ +FSYCLV L N + S+YL FG+
Sbjct: 231 GQSFQGADG----VLGLAFSDFSFTSTATSLYGAKFSYCLVDHLSN-KNVSNYLIFGSSR 285
Query: 162 GYRRPSTQATKF-INHPNNFYYLSLKDISIDNERMNFPPDTFDITVSGEGGCIIDSGSVL 220
+ + T + FY +++ IS+ + ++ P +D T SG GG I+DSG+ L
Sbjct: 286 STKTAFRRTTPLDLTRIPPFYAINVIGISLGYDMLDIPSQVWDAT-SG-GGTILDSGTSL 343
Query: 221 TYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCYFLPETFN--RFPSMAFYFEDAN 278
T Y ++ Y +L ++ PI+ C+ FN + P + F+ +
Sbjct: 344 TLLADAAYKQVVTGLARYL--VELKRVKPEGVPIEYCFSFTSGFNVSKLPQLTFHLKGGA 401
Query: 279 LRIDGENVFIIDYENHFFLLA-VAPHDDLVALIGSQQQRDTRFVYDLNIDLLSFVKENCS 337
+++D L V+ +IG+ Q++ + +DL LSF C+
Sbjct: 402 RFEPHRKSYLVDAAPGVKCLGFVSAGTPATNVIGNIMQQNYLWEFDLMASTLSFAPSACT 461
>gi|297848386|ref|XP_002892074.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
gi|297337916|gb|EFH68333.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
Length = 485
Score = 109 bits (273), Expect = 2e-21, Method: Compositional matrix adjust.
Identities = 98/361 (27%), Positives = 153/361 (42%), Gaps = 44/361 (12%)
Query: 2 VRLFIGTPSKGVLLILDTGSALIY--------------AIFDPRKSSSFQKINCDHPDCT 47
RL +GTP++ V ++LDTGS +++ IFDPRKS ++ I C P C
Sbjct: 144 TRLGVGTPARYVYMVLDTGSDIVWLQCAPCRRCYSQSDPIFDPRKSKTYATIPCSSPHCR 203
Query: 48 YFKCVN-----EQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSNDNHG 102
+ C+Y + Y D S T G + ET++ + G GC +DN G
Sbjct: 204 RLDSAGCNTRRKTCLYQVSYGDGSFTVGDFSTETLTFR-----RNRVKGVALGCGHDNEG 258
Query: 103 FDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLKFGTDMG 162
A + +SF Q G ++FSYCLV + + +S + FG
Sbjct: 259 LFVGAAGLLGL-----GKGKLSFPGQTGHRFNQKFSYCLVDRSASSKPSS--VVFGNAAV 311
Query: 163 YRRPSTQATKFINHP--NNFYYLSLKDISIDNERM-NFPPDTFDITVSGEGGCIIDSGSV 219
R + T +++P + FYY+ L IS+ R+ F + G GG IIDSG+
Sbjct: 312 SR--IARFTPLLSNPKLDTFYYVELLGISVGGTRVPGVAASLFKLDQIGNGGVIIDSGTS 369
Query: 220 LTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPE--PIQLCYFLPETFN-RFPSMAFYFED 276
+T Y + + F R L P+ C+ L + P++ +F
Sbjct: 370 VTRLIRPAYIAMRDAF-----RVGAKALKRAPDFSLFDTCFDLSNMNEVKVPTVVLHFRG 424
Query: 277 ANLRIDGENVFIIDYENHFFLLAVAPHDDLVALIGSQQQRDTRFVYDLNIDLLSFVKENC 336
A++ + N I N F A A +++IG+ QQ+ R VYDL + F C
Sbjct: 425 ADVSLPATNYLIPVDTNGKFCFAFAGTMGGLSIIGNIQQQGFRVVYDLASSRVGFAPGGC 484
Query: 337 S 337
+
Sbjct: 485 A 485
>gi|413952718|gb|AFW85367.1| hypothetical protein ZEAMMB73_231535 [Zea mays]
Length = 443
Score = 109 bits (273), Expect = 2e-21, Method: Compositional matrix adjust.
Identities = 106/379 (27%), Positives = 153/379 (40%), Gaps = 70/379 (18%)
Query: 1 MVRLFIGTPSKGVLLILDTGSALIYA--------------IFDPRKSSSFQKINCDHPDC 46
+VRL +GTP + V L LDTGS L++ + DP SS++ + C C
Sbjct: 85 LVRLAVGTPRRPVALTLDTGSDLVWTQCAPCRDCFDQDLPVLDPAASSTYAALPCGAARC 144
Query: 47 ---TYFKCV------NEQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGA--LFG 95
+ C + C+Y Y D+S+T G A + + G H FG
Sbjct: 145 RALPFTSCGVRTLGNHRSCIYAYHYGDKSLTVGEIATDRFTFGDSGGSGESLHTRRLTFG 204
Query: 96 CSNDNHGFDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYL 155
C + N G + G+ G R S SQL FSYC E SS +
Sbjct: 205 CGHLNKGVFQSNE----TGIAGFGRGRWSLPSQLNVT---SFSYCFTSMF---ESKSSLV 254
Query: 156 KFGTDMG-----YRRPSTQATKFINHPN--NFYYLSLKDISIDNERMNFPPDTFDITVSG 208
G + T + +P+ + Y+LSLK IS+ R+ P F T
Sbjct: 255 TLGGSPAALYSHAHSGEVRTTPILKNPSQPSLYFLSLKGISVGKTRLPVPETKFRST--- 311
Query: 209 EGGCIIDSGSVLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQ-----LCYFLPET 263
IIDSG+ +T +VY + +F AQ+ P ++ LC+ LP T
Sbjct: 312 ----IIDSGASITTLPEEVYEAVKAEFA--------AQVGLPPSGVEGSALDLCFALPVT 359
Query: 264 --FNR--FPSMAFYFEDANLRIDGENVFIIDYENHFF--LLAVAPHDDLVALIGSQQQRD 317
+ R PS+ + E A+ + N D +L AP + V IG+ QQ++
Sbjct: 360 ALWRRPAVPSLTLHLEGADWELPRSNYVFEDLGARVMCIVLDAAPGEQTV--IGNFQQQN 417
Query: 318 TRFVYDLNIDLLSFVKENC 336
T VYDL D LSF C
Sbjct: 418 THVVYDLENDRLSFAPARC 436
>gi|50508275|dbj|BAD32124.1| putative aspartic proteinase nepenthesin I [Oryza sativa Japonica
Group]
Length = 451
Score = 109 bits (273), Expect = 2e-21, Method: Compositional matrix adjust.
Identities = 105/380 (27%), Positives = 159/380 (41%), Gaps = 64/380 (16%)
Query: 2 VRLFIGTPSKGVLLILDTGSALIYA--------------IFDPRKSSSFQKINCDHPDCT 47
+ L IGTP ++ DTGS+LI+ F P SS+F K+ C C
Sbjct: 92 MNLSIGTPPVTFSVLADTGSSLIWTQCAPCTECAARPAPPFQPASSSTFSKLPCASSLCQ 151
Query: 48 -----YFKCVNEQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSNDNHG 102
Y C CVY Y T G+ A ET+ V G A F G FGCS +N
Sbjct: 152 FLTSPYLTCNATGCVYYYPYG-MGFTAGYLATETLHV-----GGASFPGVAFGCSTENGV 205
Query: 103 FDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLKFGTDMG 162
+ + G++GL R +S +SQ+G RFSYCL G+ S + FG+
Sbjct: 206 GNSSS------GIVGLGRSPLSLVSQVG---VGRFSYCLRSDADAGD---SPILFGSLAK 253
Query: 163 YRRPSTQATKFINHP----NNFYYLSLKDISIDNERMNFPPDTFDITVSGE----GGCII 214
+ Q+T + +P +++YY++L I++ + TF T GG I+
Sbjct: 254 VTGGNVQSTPLLENPEMPSSSYYYVNLTGITVGATDLPVTSTTFGFTRGAGAGLVGGTIV 313
Query: 215 DSGSVLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPE-PIQLCY-----------FLPE 262
DSG+ LTY + Y + F+S L + LC+ +P
Sbjct: 314 DSGTTLTYLVKEGYAMVKRAFLSQMATANLTTTVNGTRFGFDLCFDATAAGGGSGVPVPT 373
Query: 263 TFNRFPSMAFYFEDANLRIDGENVFIIDYENHFF---LLAVAPHDDL-VALIGSQQQRDT 318
RF A Y A R V +D + LL + + L +++IG+ Q D
Sbjct: 374 LVLRFAGGAEY---AVRRRSYVGVVAVDSQGRAAVECLLVLPASEKLSISIIGNVMQMDL 430
Query: 319 RFVYDLNIDLLSFVKENCSD 338
+YDL+ + SF +C++
Sbjct: 431 HVLYDLDGGMFSFAPADCAN 450
>gi|116310064|emb|CAH67085.1| H0818E04.2 [Oryza sativa Indica Group]
gi|116310187|emb|CAH67199.1| OSIGBa0152K17.11 [Oryza sativa Indica Group]
Length = 464
Score = 109 bits (273), Expect = 2e-21, Method: Compositional matrix adjust.
Identities = 102/389 (26%), Positives = 170/389 (43%), Gaps = 71/389 (18%)
Query: 1 MVRLFIGTPSKGVLLILDTGSALIYA--------------IFDPRKSSSFQKINCDHPDC 46
+V+L IGTP +DT S LI+ +F+PR SS++ + C C
Sbjct: 90 LVKLGIGTPPYKFTAAIDTASDLIWTQCQPCTGCYHQVDPMFNPRVSSTYAALPCSSDTC 149
Query: 47 TYF---KCV---NEQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSNDN 100
+C +E C YT Y+ + T+G A + + + G+ F G FGCS +
Sbjct: 150 DELDVHRCGHDDDESCQYTYTYSGNATTEGTLAVDKLVI-----GEDAFRGVAFGCSTSS 204
Query: 101 HGFDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLKFGTD 160
G A +GV+GL R +S +SQL +RF+YCL P L G D
Sbjct: 205 TG---GAPPPQASGVVGLGRGPLSLVSQLSV---RRFAYCLP---PPASRIPGKLVLGAD 255
Query: 161 MGYRRPSTQ--ATKFINHPN--NFYYLSLKDISIDNERMNF------------------- 197
R +T A P ++YYL+L + I + M+
Sbjct: 256 ADAARNATNRIAVPMRRDPRYPSYYYLNLDGLLIGDRTMSLPPTTTTTATATATAPAPAP 315
Query: 198 --PPDTFDITV--SGEGGCIIDSGSVLTYFHSDVYWKLHEKFVSYFE-RFQLAQLSDCPE 252
P+ + V + G IID S +T+ + +Y ++ V+ E +L + +
Sbjct: 316 TPSPNATAVAVGDANRYGMIIDIASTITFLEASLY----DELVNDLEVEIRLPRGTGSSL 371
Query: 253 PIQLCYFLPE--TFNRF--PSMAFYFEDANLRIDGENVFIIDYENHFFLLAVAPHD-DLV 307
+ LC+ LP+ F+R P++A F+ LR+D +F D E+ L V + V
Sbjct: 372 GLDLCFILPDGVAFDRVYVPAVALAFDGRWLRLDKARLFAEDRESGMMCLMVGRAEAGSV 431
Query: 308 ALIGSQQQRDTRFVYDLNIDLLSFVKENC 336
+++G+ QQ++ + +Y+L ++FV+ C
Sbjct: 432 SILGNFQQQNMQVLYNLRRGRVTFVQSPC 460
>gi|115458646|ref|NP_001052923.1| Os04g0448500 [Oryza sativa Japonica Group]
gi|38344830|emb|CAD40872.2| OSJNBa0064H22.11 [Oryza sativa Japonica Group]
gi|113564494|dbj|BAF14837.1| Os04g0448500 [Oryza sativa Japonica Group]
Length = 464
Score = 109 bits (273), Expect = 2e-21, Method: Compositional matrix adjust.
Identities = 102/389 (26%), Positives = 170/389 (43%), Gaps = 71/389 (18%)
Query: 1 MVRLFIGTPSKGVLLILDTGSALIYA--------------IFDPRKSSSFQKINCDHPDC 46
+V+L IGTP +DT S LI+ +F+PR SS++ + C C
Sbjct: 90 LVKLGIGTPPYKFTAAIDTASDLIWTQCQPCTGCYHQVDPMFNPRVSSTYAALPCSSDTC 149
Query: 47 TYF---KCV---NEQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSNDN 100
+C +E C YT Y+ + T+G A + + + G+ F G FGCS +
Sbjct: 150 DELDVHRCGHDDDESCQYTYTYSGNATTEGTLAVDKLVI-----GEDAFRGVAFGCSTSS 204
Query: 101 HGFDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLKFGTD 160
G A +GV+GL R +S +SQL +RF+YCL P L G D
Sbjct: 205 TG---GAPPPQASGVVGLGRGPLSLVSQLSV---RRFAYCLP---PPASRIPGKLVLGAD 255
Query: 161 MGYRRPSTQ--ATKFINHPN--NFYYLSLKDISIDNERMNF------------------- 197
R +T A P ++YYL+L + I + M+
Sbjct: 256 ADAARNATNRIAVPMRRDPRYPSYYYLNLDGLLIGDRAMSLPPTTTTTATATATAPAPAP 315
Query: 198 --PPDTFDITV--SGEGGCIIDSGSVLTYFHSDVYWKLHEKFVSYFE-RFQLAQLSDCPE 252
P+ + V + G IID S +T+ + +Y ++ V+ E +L + +
Sbjct: 316 TPSPNATAVAVGDANRYGMIIDIASTITFLEASLY----DELVNDLEVEIRLPRGTGSSL 371
Query: 253 PIQLCYFLPE--TFNRF--PSMAFYFEDANLRIDGENVFIIDYENHFFLLAVAPHD-DLV 307
+ LC+ LP+ F+R P++A F+ LR+D +F D E+ L V + V
Sbjct: 372 GLDLCFILPDGVAFDRVYVPAVALAFDGRWLRLDKARLFAEDRESGMMCLMVGRAEAGSV 431
Query: 308 ALIGSQQQRDTRFVYDLNIDLLSFVKENC 336
+++G+ QQ++ + +Y+L ++FV+ C
Sbjct: 432 SILGNFQQQNMQVLYNLRRGRVTFVQSPC 460
>gi|413953792|gb|AFW86441.1| hypothetical protein ZEAMMB73_342504 [Zea mays]
Length = 459
Score = 109 bits (273), Expect = 2e-21, Method: Compositional matrix adjust.
Identities = 97/371 (26%), Positives = 152/371 (40%), Gaps = 67/371 (18%)
Query: 1 MVRLFIGTPSKGVLLILDTGSALIYA----------------IFDPRKSSSFQKINCD-- 42
+V + +GTP+ +L++DTGS L + +FDP +SS++ I C+
Sbjct: 121 VVTVGLGTPAVSQVLLIDTGSDLSWVQCAPCNSTTCYPQKDPLFDPSRSSTYAPIPCNTD 180
Query: 43 ----------HPDCTYFKCVNEQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGA 92
DCT QC Y + Y D S T G ++ET++ + G FH
Sbjct: 181 ACRDLTRDGYGSDCTSGSGGGAQCGYAITYGDGSQTTGVYSNETLT-MAPGVTVKDFH-- 237
Query: 93 LFGCSNDNHGFDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTS 152
FGC G D+D + G+LGL S + Q S+ FSYC LP +
Sbjct: 238 -FGC-----GHDQDGPNDKYDGLLGLGGAPESLVVQTSSVYGGAFSYC----LPAANDQA 287
Query: 153 SYLKFGTDMGYRRPSTQATKFINHP-----NNFYYLSLKDISIDNERMNFPPDTFDITVS 207
+L G P A+ F+ P FY +++ I++ E ++ PP F
Sbjct: 288 GFLALGA------PVNDASGFVFTPMVREQQTFYVVNMTGITVGGEPIDVPPSAF----- 336
Query: 208 GEGGCIIDSGSVLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCY-FLPETFNR 266
GG IIDSG+V+T Y L F + L + + CY F +
Sbjct: 337 -SGGMIIDSGTVVTELQHTAYAALQAAFRKAMAAYPLLPNGE----LDTCYNFTGHSNVT 391
Query: 267 FPSMAFYFE-DANLRIDGENVFIIDYENHFFLLAVAPHDDLVALIGSQQQRDTRFVYDLN 325
P +A F A + +D + ++D N P D+ ++G+ QR +YD+
Sbjct: 392 VPRVALTFSGGATVDLDVPDGILLD--NCLAFQEAGP-DNQPGILGNVNQRTLEVLYDVG 448
Query: 326 IDLLSFVKENC 336
+ F + C
Sbjct: 449 HGRVGFGADAC 459
>gi|15228618|ref|NP_191741.1| aspartyl protease-like protein [Arabidopsis thaliana]
gi|6850873|emb|CAB71112.1| putative protein [Arabidopsis thaliana]
gi|332646739|gb|AEE80260.1| aspartyl protease-like protein [Arabidopsis thaliana]
Length = 483
Score = 109 bits (272), Expect = 2e-21, Method: Compositional matrix adjust.
Identities = 105/370 (28%), Positives = 155/370 (41%), Gaps = 58/370 (15%)
Query: 2 VRLFIGTPSKGVLLILDTGSALIY--------------AIFDPRKSSSFQKINCDHPDCT 47
+RL +GTP+ V ++LDTGS +++ AIFDP+KS +F + C C
Sbjct: 137 MRLGVGTPATNVYMVLDTGSDVVWLQCSPCKACYNQTDAIFDPKKSKTFATVPCGSRLCR 196
Query: 48 YF----KCVNEQ---CVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGAL-----FG 95
+CV + C+Y + Y D S T+G + ET++ FHGA G
Sbjct: 197 RLDDSSECVTRRSKTCLYQVSYGDGSFTEGDFSTETLT----------FHGARVDHVPLG 246
Query: 96 CSNDNHGFDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLV--IPLPNGEYTSS 153
C +DN G A R +SF SQ + +FSYCLV + S
Sbjct: 247 CGHDNEGLFVGAAGLLGL-----GRGGLSFPSQTKNRYNGKFSYCLVDRTSSGSSSKPPS 301
Query: 154 YLKFGTDMGYRRPSTQA-TKFINHP--NNFYYLSLKDISIDNERM-NFPPDTFDITVSGE 209
+ FG P T T + +P + FYYL L IS+ R+ F + +G
Sbjct: 302 TIVFGNAA---VPKTSVFTPLLTNPKLDTFYYLQLLGISVGGSRVPGVSESQFKLDATGN 358
Query: 210 GGCIIDSGSVLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCP--EPIQLCYFLPE-TFNR 266
GG IIDSG+ +T Y L + F R +L P C+ L T +
Sbjct: 359 GGVIIDSGTSVTRLTQPAYVALRDAF-----RLGATKLKRAPSYSLFDTCFDLSGMTTVK 413
Query: 267 FPSMAFYFEDANLRIDGENVFIIDYENHFFLLAVAPHDDLVALIGSQQQRDTRFVYDLNI 326
P++ F+F + + N I F A A +++IG+ QQ+ R YDL
Sbjct: 414 VPTVVFHFGGGEVSLPASNYLIPVNTEGRFCFAFAGTMGSLSIIGNIQQQGFRVAYDLVG 473
Query: 327 DLLSFVKENC 336
+ F+ C
Sbjct: 474 SRVGFLSRAC 483
>gi|242045564|ref|XP_002460653.1| hypothetical protein SORBIDRAFT_02g032590 [Sorghum bicolor]
gi|241924030|gb|EER97174.1| hypothetical protein SORBIDRAFT_02g032590 [Sorghum bicolor]
Length = 525
Score = 109 bits (272), Expect = 2e-21, Method: Compositional matrix adjust.
Identities = 113/390 (28%), Positives = 168/390 (43%), Gaps = 69/390 (17%)
Query: 1 MVRLFIGTPSKGVLLILDTGSALIY--------------AIFDPRKSSSFQKINCDHPDC 46
++ +++GTP + +I+DTGS L + +FDP SSS++ + C C
Sbjct: 152 LMDVYVGTPPRRFRMIMDTGSDLNWLQCAPCLDCFEQRGPVFDPAASSSYRNVTCGDHRC 211
Query: 47 ---------------TYFKCVNEQCVYTMKYADQSVTKGFAAHE--TISVIGKGEGKAIF 89
T + + C Y Y DQS T G A E T+++ G + +
Sbjct: 212 GHVAPPPEPEASSPRTCRRPGEDPCPYYYWYGDQSNTTGDLALESFTVNLTAPGASRRV- 270
Query: 90 HGALFGCSNDNHGFDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGE 149
G +FGC + N G AG+LGL R +SF SQL ++ FSYCLV +G
Sbjct: 271 DGVVFGCGHRNRGLFH-----GAAGLLGLGRGPLSFASQLRAVYGHTFSYCLV---DHGS 322
Query: 150 YTSSYLKFGTD----MGYRRPSTQATKFINHP------NNFYYLSLKDISIDNERMNFPP 199
S + FG D P + T F + FYY+ LK + + E +N
Sbjct: 323 DVGSKVVFGEDDDALALAAHPQLKYTAFAPASSSSSPADTFYYVKLKGVLVGGELLNISS 382
Query: 200 DTFDITVSGEGGCIIDSGSVLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPE-PI-QLC 257
DT+D+ G GG IIDSG+ L+YF Y + F+ R PE P+ C
Sbjct: 383 DTWDVGKDGSGGTIIDSGTTLSYFVEPAYQVIRHAFMDRMSR----SYPLVPEFPVLSPC 438
Query: 258 YFL-----PETFNRFPSMAFYFED-ANLRIDGENVFI---IDYENHFFLLAVAPHDDLVA 308
Y + PE P ++ F D A EN FI D + L + ++
Sbjct: 439 YNVSGVERPEV----PELSLLFADGAVWDFPAENYFIRLDPDGGSIMCLAVLGTPRTGMS 494
Query: 309 LIGSQQQRDTRFVYDLNIDLLSFVKENCSD 338
+IG+ QQ++ VYDL + L F C++
Sbjct: 495 IIGNFQQQNFHVVYDLQNNRLGFAPRRCAE 524
>gi|297820186|ref|XP_002877976.1| hypothetical protein ARALYDRAFT_906847 [Arabidopsis lyrata subsp.
lyrata]
gi|297323814|gb|EFH54235.1| hypothetical protein ARALYDRAFT_906847 [Arabidopsis lyrata subsp.
lyrata]
Length = 425
Score = 109 bits (272), Expect = 2e-21, Method: Compositional matrix adjust.
Identities = 90/361 (24%), Positives = 156/361 (43%), Gaps = 48/361 (13%)
Query: 1 MVRLFIGTPSKGVLLILDTGSALIY------------AIFDPRKSSSFQKINCDHPDCTY 48
+VR IGTP++ +L+ LDT + + +FDP KSSS + + C+ P C
Sbjct: 89 IVRANIGTPAQAMLVALDTSNDAAWIPCSGCVGCSSSVLFDPSKSSSSRTLQCEAPQCKQ 148
Query: 49 FK----CVNEQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSNDNHGFD 104
V++ C + M Y ++ + + +T+++ + FGC N G
Sbjct: 149 APNPSCTVSKSCGFNMTYGGSAI-EAYLTQDTLTL-----ATDVIPNYTFGCINKASGTS 202
Query: 105 EDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYT--SSYLKFGTDMG 162
A+ G++GL R +S ISQ ++ + FSYCL PN + + S L+ G
Sbjct: 203 LPAQ-----GLMGLGRGPLSLISQSQNLYQSTFSYCL----PNSKSSNFSGSLRLGPKNQ 253
Query: 163 YRRPSTQATKFINHP--NNFYYLSLKDISIDNERMNFPPDTFDITVSGEGGCIIDSGSVL 220
R + T + +P ++ YY++L I + N+ ++ P + G I DSG+V
Sbjct: 254 PIR--IKTTPLLKNPRRSSLYYVNLVGIRVGNKIVDIPTSALAFDPATGAGTIFDSGTVY 311
Query: 221 TYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCYFLPETFNRFPSMAFYFEDANLR 280
T Y + +F + L CY FPS+ F F N+
Sbjct: 312 TRLVEPAYVAMRNEFRRRVKNANATSLGG----FDTCY---SGSVVFPSVTFMFAGMNVT 364
Query: 281 IDGENVFIIDYENHFFLLAVAPH----DDLVALIGSQQQRDTRFVYDLNIDLLSFVKENC 336
+ +N+ I + LA+A + ++ +I S QQ++ R + D+ L +E C
Sbjct: 365 LPPDNLLIHSSAGNLSCLAMAAAPTNVNSVLNVIASMQQQNHRVLIDVPNSRLGISRETC 424
Query: 337 S 337
+
Sbjct: 425 T 425
>gi|21594980|gb|AAM66061.1| chloroplast nucleoid DNA binding protein, putative [Arabidopsis
thaliana]
Length = 485
Score = 109 bits (272), Expect = 2e-21, Method: Compositional matrix adjust.
Identities = 98/361 (27%), Positives = 152/361 (42%), Gaps = 44/361 (12%)
Query: 2 VRLFIGTPSKGVLLILDTGSALIY--------------AIFDPRKSSSFQKINCDHPDCT 47
RL +GTP++ V ++LDTGS +++ IFDPRKS ++ I C P C
Sbjct: 144 TRLGVGTPARYVYMVLDTGSDIVWLQCAPCRRCYSQSDPIFDPRKSKTYATIPCSSPHCR 203
Query: 48 YFKCVN-----EQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSNDNHG 102
+ C+Y + Y D S T G + ET++ + G GC +DN G
Sbjct: 204 RLDSAGCNTRRKTCLYQVSYGDGSFTVGDFSTETLTFR-----RNRVKGVALGCGHDNEG 258
Query: 103 FDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLKFGTDMG 162
A + +SF Q G ++FSYCLV + + +S + FG
Sbjct: 259 LFVGAAGLLGL-----GKGKLSFPGQTGHRFNQKFSYCLVDRSASSKPSS--VVFGNAAV 311
Query: 163 YRRPSTQATKFINHP--NNFYYLSLKDISIDNERM-NFPPDTFDITVSGEGGCIIDSGSV 219
R + T +++P + FYY+ L IS+ R+ F + G GG IIDSG+
Sbjct: 312 SR--IARFTPLLSNPKLDTFYYVGLLGISVGGTRVPGVTASLFKLDQIGNGGVIIDSGTS 369
Query: 220 LTYFHSDVYWKLHEKFVSYFERFQLAQLSDCP--EPIQLCYFLPETFN-RFPSMAFYFED 276
+T Y + + F R L P C+ L + P++ +F
Sbjct: 370 VTRLIRPAYIAMRDAF-----RVGAKTLKRAPNFSLFDTCFDLSNMNEVKVPTVVLHFRR 424
Query: 277 ANLRIDGENVFIIDYENHFFLLAVAPHDDLVALIGSQQQRDTRFVYDLNIDLLSFVKENC 336
A++ + N I N F A A +++IG+ QQ+ R VYDL + F C
Sbjct: 425 ADVSLPATNYLIPVDTNGKFCFAFAGTMGGLSIIGNIQQQGFRVVYDLASSRVGFAPGGC 484
Query: 337 S 337
+
Sbjct: 485 A 485
>gi|326530426|dbj|BAJ97639.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 479
Score = 109 bits (272), Expect = 2e-21, Method: Compositional matrix adjust.
Identities = 97/361 (26%), Positives = 152/361 (42%), Gaps = 42/361 (11%)
Query: 2 VRLFIGTPSKGVLLILDTGSALIY--------------AIFDPRKSSSFQKINCDHPDCT 47
VR+ +G+P L++D+GS +I+ +FDP S+SF + CD C
Sbjct: 135 VRVGVGSPPTEQYLVVDSGSDVIWIQCRPCAECYQQADPLFDPAASASFTAVPCDSGVCR 194
Query: 48 YFK-----CVNE-QCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSNDNH 101
C + C Y + Y D S T+G A ET++ G+ + G GC + N
Sbjct: 195 TLPGGSSGCADSGACRYQVSYGDGSYTQGVLAMETLTF---GDSTPV-QGVAIGCGHRNR 250
Query: 102 GFDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLKFGTDM 161
G A AG+LGL +S + QLG FSYCL + + L FG D
Sbjct: 251 GLFVGA-----AGLLGLGWGPMSLVGQLGGAAGGAFSYCLASR--GADAGAGSLVFGRDD 303
Query: 162 GYRRPSTQATKFIN-HPNNFYYLSLKDISIDNERMNFPPDTFDITVSGEGGCIIDSGSVL 220
+ N +FYY+ L + + ER+ FD+T G GG ++D+G+ +
Sbjct: 304 AMPVGAVWVPLLRNAQQPSFYYVGLTGLGVGGERLPLQDGLFDLTEDGGGGVVMDTGTAV 363
Query: 221 TYFHSDVYWKLHEKFVSYFERFQLAQLSDCP--EPIQLCYFLPETFN-RFPSMAFYF--E 275
T D Y L + F S L P + CY L + R P++A YF +
Sbjct: 364 TRLPPDAYAALRDAFASTIG----GDLPRAPGVSLLDTCYDLSGYASVRVPTVALYFGRD 419
Query: 276 DANLRIDGENVFIIDYENHFFLLAVAPHDDLVALIGSQQQRDTRFVYDLNIDLLSFVKEN 335
A L + N +++ + LA A ++++G+ QQ+ + D + F
Sbjct: 420 GAALTLPARN-LLVEMGGGVYCLAFAASASGLSILGNIQQQGIQITVDSANGYVGFGPST 478
Query: 336 C 336
C
Sbjct: 479 C 479
>gi|326516344|dbj|BAJ92327.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 464
Score = 109 bits (272), Expect = 2e-21, Method: Compositional matrix adjust.
Identities = 96/367 (26%), Positives = 150/367 (40%), Gaps = 58/367 (15%)
Query: 6 IGTPSKGVLLILDTGSALIYA--------------IFDPRKSSSFQKINCDHPDCTYFKC 51
+G +I+DT S L + +FDP S S+ + C+ C +
Sbjct: 117 VGIGGGEATVIVDTASELTWVQCEPCDACHDQQEPLFDPSSSPSYAAVPCNSSSCDALRV 176
Query: 52 VN-----------EQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSNDN 100
C YT+ Y D S ++G AH+ +S+ G+ G +FGC N
Sbjct: 177 ATGMSGQACDDQPAACSYTLSYRDGSYSRGVLAHDRLSLAGED-----IQGFVFGCGTSN 231
Query: 101 HG-FDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLKFGT 159
G F G +G++GL R +S ISQ FSYCL P +S L G
Sbjct: 232 QGPF------GGTSGLMGLGRSQLSLISQTMDQFGGVFSYCLP---PKESGSSGSLVLGD 282
Query: 160 DMGYRRPSTQA--TKFINHP--NNFYYLSLKDISIDNERMNFPPDTFDITVSGEGGCIID 215
D R ST T ++ P FY +L I++ E + P + G G I+D
Sbjct: 283 DASVYRNSTPIVYTAMVSDPLQGPFYLANLTGITVGGEDVQSP----GFSAGGGGKAIVD 338
Query: 216 SGSVLTYFHSDVYWKLHEKFVSYFERF-QLAQLSDCPEPIQLCYFLPETFN-RFPSMAFY 273
SG+++T VY + +FVS + Q A S + C+ L + PS+
Sbjct: 339 SGTIITSLVPSVYAAVRAEFVSQLAEYPQAAPFS----ILDTCFDLTGLREVQVPSLKLV 394
Query: 274 FE-DANLRIDGENVFII---DYENHFFLLAVAPHDDLVALIGSQQQRDTRFVYDLNIDLL 329
F+ A + +D + V + D LA + +IG+ QQ++ R ++D +
Sbjct: 395 FDGGAEVEVDSKGVLYVVTGDASQVCLALASLKSEYDTPIIGNYQQKNLRVIFDTVGSQI 454
Query: 330 SFVKENC 336
F +E C
Sbjct: 455 GFAQETC 461
>gi|297803034|ref|XP_002869401.1| aspartyl protease family [Arabidopsis lyrata subsp. lyrata]
gi|297315237|gb|EFH45660.1| aspartyl protease family [Arabidopsis lyrata subsp. lyrata]
Length = 438
Score = 108 bits (271), Expect = 2e-21, Method: Compositional matrix adjust.
Identities = 106/369 (28%), Positives = 162/369 (43%), Gaps = 55/369 (14%)
Query: 1 MVRLFIGTPSKGVLLILDTGSALIY--------------AIFDPRKSSSFQKINCDHPDC 46
+V + IG+P LL +DT S L++ IFDP +S + + +C
Sbjct: 86 LVNISIGSPPVTQLLHMDTASDLLWLQCRPCINCYAQSLPIFDPSRSYTHRNESCRTSQY 145
Query: 47 TY----FKCVNEQCVYTMKYADQSVTKGFAAHETI--SVIGKGEGKAIFHGALFGCSNDN 100
+ F C Y+M+Y D + +KG A E + + I A H +FGC +DN
Sbjct: 146 SMPSLRFNAKTRSCEYSMRYMDGTGSKGILAKEMLMFNTIYDESSSAALHDVVFGCGHDN 205
Query: 101 HGFDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLKFGTD 160
+G G+LGL S + + G+ +FSYC L + Y + L G D
Sbjct: 206 YG-----EPLVGTGILGLGYGEFSLVHRFGT----KFSYCFG-SLDDPSYPHNVLVLGDD 255
Query: 161 MGYRRPSTQATKFINHPNNFYYLSLKDISIDNERMNFPPDTFDIT-VSGEGGCIIDSGSV 219
T + N FYY++++ IS+D + P F+ +G GG IID+G+
Sbjct: 256 GANILGDTTPLEIYN---GFYYVTIEAISVDGIILPIDPWVFNRNHQTGLGGTIIDTGNS 312
Query: 220 LTYFHSDVYWKLHEKFVSYFE-RFQLAQLSDCPEPIQLCYFLPETFN----------RFP 268
LT + Y L K YFE RF A ++ Q F E +N FP
Sbjct: 313 LTSLVEEAYKPLKNKIEDYFEGRFTAADVN------QDDMFKVECYNGNLERDLVESGFP 366
Query: 269 SMAFYFED-ANLRIDGENVFIIDYENHFFLLAVAPHDDLVALIGSQQQRDTRFVYDLNID 327
+ F+F D A L +D ++VF + + F LAV P + + IG+ Q+ YDL
Sbjct: 367 IVTFHFSDGAELSLDVKSVF-MKLSPNVFCLAVTPGN--MNSIGATAQQSYNIGYDLEAK 423
Query: 328 LLSFVKENC 336
+SF + +C
Sbjct: 424 KISFERIDC 432
>gi|242062704|ref|XP_002452641.1| hypothetical protein SORBIDRAFT_04g029670 [Sorghum bicolor]
gi|241932472|gb|EES05617.1| hypothetical protein SORBIDRAFT_04g029670 [Sorghum bicolor]
Length = 466
Score = 108 bits (271), Expect = 3e-21, Method: Compositional matrix adjust.
Identities = 90/361 (24%), Positives = 152/361 (42%), Gaps = 51/361 (14%)
Query: 1 MVRLFIGTPSKGVLLILDTGSALIY--------------AIFDPRKSSSFQKINCDHPDC 46
++ + IG+P+ + +DTGS + + ++FDP SS++ +C C
Sbjct: 132 VITVGIGSPAVTQTMSMDTGSDVSWVQCKPCSQCHSEVDSLFDPSASSTYSPFSCSSAAC 191
Query: 47 TYFK-------CVNEQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSND 99
C + QC Y + Y D S T G + +T+++ G G FGCS
Sbjct: 192 VQLSQSQQGNGCSSSQCQYIVSYVDGSSTTGTYSSDTLTL-----GSNAIKGFQFGCSQS 246
Query: 100 NHGFDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLKFG- 158
G D D G++GL S +SQ K FSYCL P P +S +L G
Sbjct: 247 ESGGFSDQTD----GLMGLGGDAQSLVSQTAGTFGKAFSYCLP-PTPG---SSGFLTLGA 298
Query: 159 -TDMGY-RRPSTQATKFINHPNNFYYLSLKDISIDNERMNFPPDTFDITVSGEGGCIIDS 216
+ G+ + P ++T+ +Y + L+ I + +++N P F G ++DS
Sbjct: 299 ASRSGFVKTPMLRSTQI----PTYYGVLLEAIRVGGQQLNIPTSVF------SAGSVMDS 348
Query: 217 GSVLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCY-FLPETFNRFPSMAFYFE 275
G+V+T Y L F + +++ AQ S + C+ F ++ PS+A F
Sbjct: 349 GTVITRLPPTAYSALSSAFKAGMKKYPPAQPSGI---LDTCFDFSGQSSVSIPSVALVFS 405
Query: 276 DANLRIDGENVFIIDYENHFFLLAVAPHDDLVALIGSQQQRDTRFVYDLNIDLLSFVKEN 335
+ N +++ +N A D + IG+ QQR +YD+ + F
Sbjct: 406 GGAVVNLDFNGIMLELDNWCLAFAANSDDSSLGFIGNVQQRTFEVLYDVGGGAVGFRAGA 465
Query: 336 C 336
C
Sbjct: 466 C 466
>gi|125561849|gb|EAZ07297.1| hypothetical protein OsI_29545 [Oryza sativa Indica Group]
Length = 451
Score = 108 bits (270), Expect = 3e-21, Method: Compositional matrix adjust.
Identities = 102/370 (27%), Positives = 160/370 (43%), Gaps = 57/370 (15%)
Query: 6 IGTPSKGVLLILDTGSALIYA------------------IFDPRKSSSFQKINCDHPDC- 46
IGTP + LI+DTGS LI+ ++DP +SS+F + C C
Sbjct: 97 IGTPPQPRKLIVDTGSDLIWTQCKLSSSTAVAARHGSPPVYDPGESSTFAFLPCSDRLCQ 156
Query: 47 ----TYFKCVNE-QCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSNDNH 101
++ C ++ +CVY Y + G A ET + G +A+ FGC +
Sbjct: 157 EGQFSFKNCTSKNRCVYEDVYGSAAAV-GVLASETFTF---GARRAVSLRLGFGCGALSA 212
Query: 102 GFDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLKFGTDM 161
G A G+LGLS ++S I+QL +RFSYCL P + +S L FG
Sbjct: 213 GSLIGA-----TGILGLSPESLSLITQLK---IQRFSYCLT---PFADKKTSPLLFGAMA 261
Query: 162 GYRRPST----QATKFINHP--NNFYYLSLKDISIDNERMNFPPDTFDITVSGEGGCIID 215
R T Q T +++P +YY+ L IS+ ++R+ P + + G GG I+D
Sbjct: 262 DLSRHKTTRPIQTTAIVSNPVKTVYYYVPLVGISLGHKRLAVPAASLAMRPDGGGGTIVD 321
Query: 216 SGSVLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCYFLPETFNRFPSMAFYFE 275
SGS + Y + + E + + +L + E +LC+ LP A
Sbjct: 322 SGSTVAYLVEAAFEAVKE---AVMDVVRLPVANRTVEDYELCFVLPRRTAAAAMEAVQVP 378
Query: 276 DANLRIDGENVFIIDYENHF-------FLLAVAPHDD--LVALIGSQQQRDTRFVYDLNI 326
L DG ++ +N+F LAV D V++IG+ QQ++ ++D+
Sbjct: 379 PLVLHFDGGAAMVLPRDNYFQEPRAGLMCLAVGKTTDGSGVSIIGNVQQQNMHVLFDVQH 438
Query: 327 DLLSFVKENC 336
SF C
Sbjct: 439 HKFSFAPTQC 448
>gi|147821993|emb|CAN70318.1| hypothetical protein VITISV_016757 [Vitis vinifera]
Length = 429
Score = 108 bits (270), Expect = 3e-21, Method: Compositional matrix adjust.
Identities = 93/379 (24%), Positives = 162/379 (42%), Gaps = 62/379 (16%)
Query: 2 VRLFIGTPSKGVLLILDTGSAL----------IYAIFDPRKSSSFQKINCDHPDCTY--- 48
V L +G+P + V ++LDTGS L ++++FDP +SSS+ I C P C
Sbjct: 58 VSLTVGSPPQTVTMVLDTGSELSWLHCKKAPNLHSVFDPLRSSSYSPIPCTSPTCRTRTR 117
Query: 49 -----FKCVNEQ-CVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSNDNHG 102
C ++ C + YAD S +G A +T + G + +FGC +
Sbjct: 118 DFSIPVSCDKKKLCHAIISYADASSIEGNLASDTFHI-----GNSAIPATIFGCMDSGFS 172
Query: 103 FDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLKFGTDMG 162
+ D D G++G++R ++SF++Q+G ++FSYC+ +G+ +S L FG
Sbjct: 173 SNSD-EDSKTTGLIGMNRGSLSFVTQMG---LQKFSYCI-----SGQDSSGILLFGESSF 223
Query: 163 YRRPSTQATKF--INHPNNF-----YYLSLKDISIDNERMNFPPDTFDITVSGEGGCIID 215
+ + T I+ P + Y + L+ I + N + P + +G G ++D
Sbjct: 224 SWLKALKYTPLVQISTPLPYFDRVAYTVQLEGIKVANSMLQLPKSVYAPDHTGAGQTMVD 283
Query: 216 SGSVLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEP-------IQLCYFLP---ETFN 265
SG+ T+ VY L +FV R A L +P + LCY +P T
Sbjct: 284 SGTQFTFLLGPVYTALKNEFV----RQTKASLKVLEDPNFVFQGAMDLCYRVPLTRRTLP 339
Query: 266 RFPSMAFYFEDANLRIDGENVF-----IIDYENHFFLLAVAPHDDLVA---LIGSQQQRD 317
P++ F A + + E + +I + + + L +IG Q++
Sbjct: 340 PLPTVTLMFRGAEMSVSAERLMYRVPGVIRGSDSVYCFTFGNSELLGVESYIIGHHHQQN 399
Query: 318 TRFVYDLNIDLLSFVKENC 336
+DL + F + C
Sbjct: 400 VWMEFDLAKSRVGFAEVRC 418
>gi|357137788|ref|XP_003570481.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Brachypodium
distachyon]
Length = 455
Score = 108 bits (270), Expect = 4e-21, Method: Compositional matrix adjust.
Identities = 88/365 (24%), Positives = 157/365 (43%), Gaps = 55/365 (15%)
Query: 1 MVRLFIGTPSKGVLLILDTGSALIY---------------AIFDPRKSSSFQKINCDHPD 45
+ R+ +GTP+K ++++DTGS+L + +FDP+ SSS+ ++C P
Sbjct: 118 VTRMGLGTPAKPYIMVVDTGSSLTWLQCSPCRVSCHRQSGPVFDPKTSSSYAAVSCSSPQ 177
Query: 46 CTYFKCV---------NEQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGC 96
C + C+Y Y D S + G+ + +T+S G +GC
Sbjct: 178 CDGLSTATLNPAVCSPSNVCIYQASYGDSSFSVGYLSKDTVSF-----GANSVPNFYYGC 232
Query: 97 SNDNHGFDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLK 156
DN G G AG++GL+R +S + QL + FSYCL +S YL
Sbjct: 233 GQDNEGLF-----GRSAGLMGLARNKLSLLYQLAPTLGYSFSYCL-----PSTSSSGYLS 282
Query: 157 FGTDMGYRRPSTQATKFINHP--NNFYYLSLKDISIDNERMNFPPDTFDITVSGEGGCII 214
G+ Y T +++ ++ Y++SL +++ + + + + II
Sbjct: 283 IGS---YNPGGYSYTPMVSNTLDDSLYFISLSGMTVAGKPLAVSSSEYTSLPT-----II 334
Query: 215 DSGSVLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCY-FLPETFNRFPSMAFY 273
DSG+V+T + VY L + + + + + + C+ P+++
Sbjct: 335 DSGTVITRLPTSVYTALSKAVAAAMKGST--KRAAAYSILDTCFEGQASKLRAVPAVSMA 392
Query: 274 FE-DANLRIDGENVFIIDYENHFFLLAVAPHDDLVALIGSQQQRDTRFVYDLNIDLLSFV 332
F A L++ N ++D + LA AP A+IG+ QQ+ VYD+ + + F
Sbjct: 393 FSGGATLKLSAGN-LLVDVDGATTCLAFAPARS-AAIIGNTQQQTFSVVYDVKSNRIGFA 450
Query: 333 KENCS 337
CS
Sbjct: 451 AAGCS 455
>gi|326506682|dbj|BAJ91382.1| predicted protein [Hordeum vulgare subsp. vulgare]
gi|326525815|dbj|BAJ88954.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 473
Score = 108 bits (270), Expect = 4e-21, Method: Compositional matrix adjust.
Identities = 85/362 (23%), Positives = 159/362 (43%), Gaps = 51/362 (14%)
Query: 1 MVRLFIGTPSKGVLLILDTGSALIY---------------AIFDPRKSSSFQKINCDHPD 45
+ R+ +GTP+K ++++DTGS+L + +FDP+ SSS+ ++C P
Sbjct: 138 VTRMGLGTPAKPYIMVVDTGSSLTWLQCSPCRVSCHRQSGPVFDPKTSSSYAAVSCSTPQ 197
Query: 46 CTYFKCV---------NEQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGC 96
C ++ C+Y Y D S + G+ + +T+S G +GC
Sbjct: 198 CNDLSTATLNPAACSSSDVCIYQASYGDSSFSVGYLSKDTVSF-----GSNSVPNFYYGC 252
Query: 97 SNDNHGFDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLK 156
DN G G AG++GL+R +S + QL + FSYC LP+ +
Sbjct: 253 GQDNEGLF-----GRSAGLMGLARNKLSLLYQLAPTLGYSFSYC----LPSSSSSGYLSI 303
Query: 157 FGTDMGYRRPSTQATKFINHPNNFYYLSLKDISIDNERMNFPPDTFDITVSGEGGCIIDS 216
+ G + + ++ ++ Y++ L +++ + + + + IIDS
Sbjct: 304 GSYNPGQYSYTPMVSSTLD--DSLYFIKLSGMTVAGKPLAVSSSEYSSLPT-----IIDS 356
Query: 217 GSVLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCYFLPETFNRFPSMAFYFE- 275
G+V+T + VY L + + + A D + C+ + R P+++ F
Sbjct: 357 GTVITRLPTTVYDALSKAVAGAMKGTKRA---DAYSILDTCFVGQASSLRVPAVSMAFSG 413
Query: 276 DANLRIDGENVFIIDYENHFFLLAVAPHDDLVALIGSQQQRDTRFVYDLNIDLLSFVKEN 335
A L++ +N ++D ++ LA AP A+IG+ QQ+ VYD+ + + F
Sbjct: 414 GAALKLSAQN-LLVDVDSSTTCLAFAPARS-AAIIGNTQQQTFSVVYDVKSNRIGFAAGG 471
Query: 336 CS 337
C+
Sbjct: 472 CT 473
>gi|225449446|ref|XP_002283126.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Vitis vinifera]
Length = 436
Score = 108 bits (270), Expect = 4e-21, Method: Compositional matrix adjust.
Identities = 93/379 (24%), Positives = 162/379 (42%), Gaps = 62/379 (16%)
Query: 2 VRLFIGTPSKGVLLILDTGSAL----------IYAIFDPRKSSSFQKINCDHPDCTY--- 48
V L +G+P + V ++LDTGS L ++++FDP +SSS+ I C P C
Sbjct: 65 VSLTVGSPPQTVTMVLDTGSELSWLHCKKAPNLHSVFDPLRSSSYSPIPCTSPTCRTRTR 124
Query: 49 -----FKCVNEQ-CVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSNDNHG 102
C ++ C + YAD S +G A +T + G + +FGC +
Sbjct: 125 DFSIPVSCDKKKLCHAIISYADASSIEGNLASDTFHI-----GNSAIPATIFGCMDSGFS 179
Query: 103 FDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLKFGTDMG 162
+ D D G++G++R ++SF++Q+G ++FSYC+ +G+ +S L FG
Sbjct: 180 SNSD-EDSKTTGLIGMNRGSLSFVTQMG---LQKFSYCI-----SGQDSSGILLFGESSF 230
Query: 163 YRRPSTQATKF--INHPNNF-----YYLSLKDISIDNERMNFPPDTFDITVSGEGGCIID 215
+ + T I+ P + Y + L+ I + N + P + +G G ++D
Sbjct: 231 SWLKALKYTPLVQISTPLPYFDRVAYTVQLEGIKVANSMLQLPKSVYAPDHTGAGQTMVD 290
Query: 216 SGSVLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEP-------IQLCYFLP---ETFN 265
SG+ T+ VY L +FV R A L +P + LCY +P T
Sbjct: 291 SGTQFTFLLGPVYTALKNEFV----RQTKASLKVLEDPNFVFQGAMDLCYRVPLTRRTLP 346
Query: 266 RFPSMAFYFEDANLRIDGENVF-----IIDYENHFFLLAVAPHDDLVA---LIGSQQQRD 317
P++ F A + + E + +I + + + L +IG Q++
Sbjct: 347 PLPTVTLMFRGAEMSVSAERLMYRVPGVIRGSDSVYCFTFGNSELLGVESYIIGHHHQQN 406
Query: 318 TRFVYDLNIDLLSFVKENC 336
+DL + F + C
Sbjct: 407 VWMEFDLAKSRVGFAEVRC 425
>gi|302763741|ref|XP_002965292.1| hypothetical protein SELMODRAFT_83230 [Selaginella moellendorffii]
gi|300167525|gb|EFJ34130.1| hypothetical protein SELMODRAFT_83230 [Selaginella moellendorffii]
Length = 423
Score = 108 bits (269), Expect = 4e-21, Method: Compositional matrix adjust.
Identities = 93/358 (25%), Positives = 156/358 (43%), Gaps = 41/358 (11%)
Query: 2 VRLFIGTPSKGVLLILDTGSALIY--------------AIFDPRKSSSFQKINCDHPDCT 47
V L +GTP + V ++ DTGS +++ +F+P SS+FQ I C C
Sbjct: 83 VSLGVGTPPRTVNMVADTGSDVLWLQCLPCQSCYGQTDPLFNPSFSSTFQSITCGSSLCQ 142
Query: 48 YF---KCVNEQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSNDNHGFD 104
C QC+Y + Y D S T G + ET+S G + GC ++N G
Sbjct: 143 QLLIRGCRRNQCLYQVSYGDGSFTVGEFSTETLSF-----GSNAVNSVAIGCGHNNQGLF 197
Query: 105 EDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSY-LKFGTDMGY 163
A + +SF SQ+G + FSYC LP E T S L FG
Sbjct: 198 TGAAGLLGL-----GKGLLSFPSQVGQLYGSVFSYC----LPTRESTGSVPLIFGNQA-- 246
Query: 164 RRPSTQATKFINHP--NNFYYLSLKDISIDNERMNFPPDTFDI-TVSGEGGCIIDSGSVL 220
+ Q T + +P + FYY+ + I + +N P + + + +G GG I+DSG+ +
Sbjct: 247 VASNAQFTTLLTNPKLDTFYYVEMVGIKVGGTSVNIPAGSLSLDSSTGNGGVILDSGTAV 306
Query: 221 TYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCYFLP-ETFNRFPSMAFYFE-DAN 278
T + Y + + F + A+++ CY L + P+++F F A
Sbjct: 307 TRLVTSAYNPMRDAFRAGMP--SDAKMTSGFSLFDTCYDLSGRSSIMLPAVSFVFNGGAT 364
Query: 279 LRIDGENVFIIDYENHFFLLAVAPHDDLVALIGSQQQRDTRFVYDLNIDLLSFVKENC 336
+ + +N+ + + + LA AP+ + ++IG+ QQ+ R +D + + C
Sbjct: 365 MALPAQNIMVPVDNSGTYCLAFAPNSENFSIIGNIQQQSFRMSFDSTGNRVGIGANQC 422
>gi|357514995|ref|XP_003627786.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
gi|355521808|gb|AET02262.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
Length = 436
Score = 108 bits (269), Expect = 5e-21, Method: Compositional matrix adjust.
Identities = 97/357 (27%), Positives = 160/357 (44%), Gaps = 40/357 (11%)
Query: 6 IGTPSKGVLLILDTGSALIY--------------AIFDPRKSSSFQKINCDHPDCTYFK- 50
+GTP + I+DTGS +++ +F+P KSSS++ I C C +
Sbjct: 93 VGTPPFKLYGIVDTGSDIVWLQCEPCQECYNQTTPMFNPSKSSSYKNIPCPSKLCQSMED 152
Query: 51 --CVNEQ-CVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSNDNHGFDEDA 107
C ++ C Y+ Y D S + G + +T+++ F + GC +N +
Sbjct: 153 TSCNDKNYCEYSTYYGDNSHSGGDLSVDTLTLESTNGLTVSFPNIVIGCGTNNI----LS 208
Query: 108 RDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPN----GEYTSSYLKFGTDMGY 163
+GA +G++G SFI+QLGS +FSYCL PL + +S L FG
Sbjct: 209 YEGASSGIVGFGSGPASFITQLGSSTGGKFSYCLT-PLFSVTNIQSNATSKLNFGDAATV 267
Query: 164 RRPSTQATKFINH-PNNFYYLSLKDISIDNERMNFP--PDTFDITVSGEGGCIIDSGSVL 220
T + P FYYL+L+ S+ N R+ P+ EG IIDSG+ L
Sbjct: 268 SGDGVVTTPILKKDPETFYYLTLEAFSVGNRRVEIGGVPNG-----DNEGNIIIDSGTTL 322
Query: 221 TYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCYFLPETFNRFPSMAFYFEDANLR 280
T D Y L V + +L ++ D + + LCY + FP + +F+ A++
Sbjct: 323 TSLTKDDYSFLESAVV---DLVKLERVDDPTQTLNLCYSVKAEGYDFPIITMHFKGADVD 379
Query: 281 IDGENVFIIDYENHFFLLAVAPHDDLVALIGSQQQRDTRFVYDLNIDLLSFVKENCS 337
+ + F+ + F L + D A+ G+ Q++ YDL ++SF +C+
Sbjct: 380 LHPISTFVSVADGVFCLAFESSQDH--AIFGNLAQQNLMVGYDLQQKIVSFKPSDCT 434
>gi|15232503|ref|NP_191008.1| aspartyl protease family protein [Arabidopsis thaliana]
gi|7288018|emb|CAB81805.1| nucleoid DNA-binding-like protein [Arabidopsis thaliana]
gi|17979257|gb|AAL49945.1| AT3g54400/T12E18_90 [Arabidopsis thaliana]
gi|21700851|gb|AAM70549.1| AT3g54400/T12E18_90 [Arabidopsis thaliana]
gi|332645705|gb|AEE79226.1| aspartyl protease family protein [Arabidopsis thaliana]
Length = 425
Score = 108 bits (269), Expect = 5e-21, Method: Compositional matrix adjust.
Identities = 90/361 (24%), Positives = 156/361 (43%), Gaps = 48/361 (13%)
Query: 1 MVRLFIGTPSKGVLLILDTGSALIY------------AIFDPRKSSSFQKINCDHPDCTY 48
+VR IGTP++ +L+ LDT + + +FDP KSSS + + C+ P C
Sbjct: 89 IVRANIGTPAQPMLVALDTSNDAAWIPCSGCVGCSSSVLFDPSKSSSSRTLQCEAPQCKQ 148
Query: 49 FK----CVNEQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSNDNHGFD 104
V++ C + M Y ++ + + +T+++ + FGC N G
Sbjct: 149 APNPSCTVSKSCGFNMTYGGSTI-EAYLTQDTLTLASD-----VIPNYTFGCINKASGTS 202
Query: 105 EDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYT--SSYLKFGTDMG 162
A+ G++GL R +S ISQ ++ + FSYC LPN + + S L+ G
Sbjct: 203 LPAQ-----GLMGLGRGPLSLISQSQNLYQSTFSYC----LPNSKSSNFSGSLRLGPKNQ 253
Query: 163 YRRPSTQATKFINHP--NNFYYLSLKDISIDNERMNFPPDTFDITVSGEGGCIIDSGSVL 220
R + T + +P ++ YY++L I + N+ ++ P + G I DSG+V
Sbjct: 254 PIR--IKTTPLLKNPRRSSLYYVNLVGIRVGNKIVDIPTSALAFDPATGAGTIFDSGTVY 311
Query: 221 TYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCYFLPETFNRFPSMAFYFEDANLR 280
T Y + +F + L CY FPS+ F F N+
Sbjct: 312 TRLVEPAYVAVRNEFRRRVKNANATSLGG----FDTCY---SGSVVFPSVTFMFAGMNVT 364
Query: 281 IDGENVFIIDYENHFFLLAVAPH----DDLVALIGSQQQRDTRFVYDLNIDLLSFVKENC 336
+ +N+ I + LA+A + ++ +I S QQ++ R + D+ L +E C
Sbjct: 365 LPPDNLLIHSSAGNLSCLAMAAAPVNVNSVLNVIASMQQQNHRVLIDVPNSRLGISRETC 424
Query: 337 S 337
+
Sbjct: 425 T 425
>gi|21617933|gb|AAM66983.1| nucleoid DNA-binding-like protein [Arabidopsis thaliana]
Length = 425
Score = 108 bits (269), Expect = 5e-21, Method: Compositional matrix adjust.
Identities = 90/361 (24%), Positives = 156/361 (43%), Gaps = 48/361 (13%)
Query: 1 MVRLFIGTPSKGVLLILDTGSALIY------------AIFDPRKSSSFQKINCDHPDCTY 48
+VR IGTP++ +L+ LDT + + +FDP KSSS + + C+ P C
Sbjct: 89 IVRANIGTPAQPMLVALDTSNDAAWIPCSGCVGCSSSVLFDPSKSSSSRTLQCEAPQCKQ 148
Query: 49 FK----CVNEQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSNDNHGFD 104
V++ C + M Y ++ + + +T+++ + FGC N G
Sbjct: 149 APNPSCTVSKSCGFNMTYGGSTI-EAYLTQDTLTLASD-----VIPNYTFGCINKASGTS 202
Query: 105 EDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYT--SSYLKFGTDMG 162
A+ G++GL R +S ISQ ++ + FSYC LPN + + S L+ G
Sbjct: 203 LPAQ-----GLMGLGRGPLSLISQSQNLYQSTFSYC----LPNSKSSNFSGSLRLGPKNQ 253
Query: 163 YRRPSTQATKFINHP--NNFYYLSLKDISIDNERMNFPPDTFDITVSGEGGCIIDSGSVL 220
R + T + +P ++ YY++L I + N+ ++ P + G I DSG+V
Sbjct: 254 PIR--IKTTPLLKNPRRSSLYYVNLVGIRVGNKIVDIPTSALAFDPATGAGTIFDSGTVY 311
Query: 221 TYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCYFLPETFNRFPSMAFYFEDANLR 280
T Y + +F + L CY FPS+ F F N+
Sbjct: 312 TRLVEPAYVAVRNEFRRRVKNANATSLGG----FDTCY---SGSVVFPSVTFMFAGMNVT 364
Query: 281 IDGENVFIIDYENHFFLLAVAPH----DDLVALIGSQQQRDTRFVYDLNIDLLSFVKENC 336
+ +N+ I + LA+A + ++ +I S QQ++ R + D+ L +E C
Sbjct: 365 LPPDNLLIHSSAGNLSCLAMAAAPVNVNSVLNVIASMQQQNHRVLIDVPNSRLGISRETC 424
Query: 337 S 337
+
Sbjct: 425 T 425
>gi|225427556|ref|XP_002266575.1| PREDICTED: probable aspartic protease At2g35615 [Vitis vinifera]
Length = 445
Score = 108 bits (269), Expect = 5e-21, Method: Compositional matrix adjust.
Identities = 105/363 (28%), Positives = 166/363 (45%), Gaps = 40/363 (11%)
Query: 1 MVRLFIGTPSKGVLLILDTGSALIY--------------AIFDPRKSSSFQKINCDHPDC 46
++ + +GTP +L I DTGS LI+ +FDP+KS +++ + C++ C
Sbjct: 95 LMNISLGTPPVSMLGIADTGSDLIWRQCLPCDDCYKQVEPLFDPKKSKTYKTLGCNNDFC 154
Query: 47 TYF----KCVNEQ-CVYTMKYADQSVTKGFAAHETISVIGKGEG-KAIFHGALFGCSNDN 100
C ++ C + Y DQS T+ + ET + IG EG A F G FGC + N
Sbjct: 155 QDLGQQGSCGDDNTCTSSYSYGDQSYTRRDLSSETFT-IGSTEGDPASFPGLAFGCGHSN 213
Query: 101 HG-FDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLKFGT 159
G F+E G LS V QL S + +FSYCLV PL + SS + FG
Sbjct: 214 GGTFNEKDSGLIGLGGGPLSLVM-----QLSSKVGGQFSYCLV-PLSSDSTASSKINFGK 267
Query: 160 DMGYRRPSTQATKFIN-HPNNFYYLSLKDISIDNERM---NFPPDTFDITVSGEGGCIID 215
T +T I P+ FYYL+L+ +S+ +E++ F + + E IID
Sbjct: 268 SAVVSGSGTVSTPLIKGTPDTFYYLTLEGMSLGSEKVAFKGFSKNKSSPAAAEESNIIID 327
Query: 216 SGSVLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPE-PIQLCYFLPETFNRFPSMAFYF 274
SG+ LT D Y + S + Q + P LCY + P++ +F
Sbjct: 328 SGTTLTLLPRDFYTDME----SALTKVIGGQTTTDPRGTFSLCYSGVKKL-EIPTITAHF 382
Query: 275 EDANLRIDGENVFIIDYENHFFLLAVAPHDDLVALIGSQQQRDTRFVYDLNIDLLSFVKE 334
A++++ N F+ E+ ++ P +L A+ G+ Q + YDL + +SF
Sbjct: 383 IGADVQLPPLNTFVQAQED-LVCFSMIPSSNL-AIFGNLSQMNFLVGYDLKNNKVSFKPT 440
Query: 335 NCS 337
+C+
Sbjct: 441 DCT 443
>gi|357481191|ref|XP_003610881.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
gi|355512216|gb|AES93839.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
Length = 427
Score = 108 bits (269), Expect = 5e-21, Method: Compositional matrix adjust.
Identities = 100/359 (27%), Positives = 168/359 (46%), Gaps = 36/359 (10%)
Query: 1 MVRLFIGTPSKGVLLILDTGSALIYA--------------IFDPRKSSSFQKINCDHPDC 46
+++L +G+P + ++DTGS L++A +F+P +S ++ I C+ C
Sbjct: 83 LMKLTLGSPPVDIYGLVDTGSDLVWAQCTPCGGCYRQKSPMFEPLRSKTYSPIPCESEQC 142
Query: 47 TYF--KCVNEQ-CVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSNDNHG- 102
++F C ++ C Y+ YAD SVTKG A E I+ + +FGC + N G
Sbjct: 143 SFFGYSCSPQKMCAYSYSYADSSVTKGVLAREAITFSSTDGDPVVVGDIIFGCGHSNSGT 202
Query: 103 FDEDARDGALAGVLGLSRVTISFISQLGSII-KKRFSYCLVIPLPNGEYTSSYLKFGTDM 161
F+E+ G L S +SQ+G++ KRFS CLV P +TS + FG +
Sbjct: 203 FNENDMGIIGMGGGPL-----SLVSQIGTLYGSKRFSQCLV-PFHTDAHTSGTINFGEES 256
Query: 162 GYRRPSTQATKFINHPNNFYYL-SLKDISIDNERMNFPPDTFDITVSGEGGCIIDSGSVL 220
T + YL +L+ IS+ + + F T+S +G +IDSG+
Sbjct: 257 DVSGEGVVTTPLASEEGQTSYLVTLEGISVGDTFVRF---NSSETLS-KGNIMIDSGTPA 312
Query: 221 TYFHSDVYWKLHEKFVSYFERFQLAQLSDCPE-PIQLCYFLPETFNRFPSMAFYFEDANL 279
TY + Y +L E+ + L + D P+ QLCY ET P + +FE A++
Sbjct: 313 TYIPQEFYERLVEELKV---QSSLLPIEDDPDLGTQLCY-RSETNLEGPILTAHFEGADV 368
Query: 280 RIDGENVFIIDYENHFFLLAVAPHDDLVALIGSQQQRDTRFVYDLNIDLLSFVKENCSD 338
++ FI ++ F A+A D + G+ Q + +DL+ +SF +C++
Sbjct: 369 QLLPIQTFIPP-KDGVFCFAMAGSTDGDYIFGNFAQSNILMGFDLDRKTISFKPTDCTN 426
>gi|293335955|ref|NP_001168399.1| uncharacterized protein LOC100382168 precursor [Zea mays]
gi|223948009|gb|ACN28088.1| unknown [Zea mays]
gi|413922066|gb|AFW61998.1| hypothetical protein ZEAMMB73_694403 [Zea mays]
Length = 507
Score = 108 bits (269), Expect = 5e-21, Method: Compositional matrix adjust.
Identities = 98/370 (26%), Positives = 157/370 (42%), Gaps = 60/370 (16%)
Query: 7 GTPSKGVLLILDTGSALIYA--------------IFDPRKSSSFQKINCDHPDC------ 46
G+P+ + +I+DTGS L + +FDP S+++ + C+ C
Sbjct: 155 GSPAANLTVIVDTGSDLTWVQCKPCSACYAQRDPLFDPAGSATYAAVRCNASACADSLRA 214
Query: 47 ---TYFKCVN-----EQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSN 98
T C + E+C Y + Y D S ++G A +T+++ G A G +FGC
Sbjct: 215 ATGTPGSCGSTGAGSEKCYYALAYGDGSFSRGVLATDTVAL-----GGASLGGFVFGCGL 269
Query: 99 DNHGFDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLKFG 158
N G G AG++GL R +S +SQ S FSYCL S L G
Sbjct: 270 SNRGLF-----GGTAGLMGLGRTELSLVSQTASRYGGVFSYCLPAATSGDASGSLSLGGG 324
Query: 159 TDMG--YRRPSTQA-TKFINHPNN--FYYLSLKDISIDNERMNFPPDTFDITVSGEGGCI 213
D YR + A T+ I P FY+L++ ++ + G +
Sbjct: 325 DDAASSYRNTTPVAYTRMIADPAQPPFYFLNVTGAAVGGTALA-------AQGLGASNVL 377
Query: 214 IDSGSVLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCP--EPIQLCYFLPETFN-RFPSM 270
IDSG+V+T VY + +F+ +F A P + CY L + P +
Sbjct: 378 IDSGTVITRLAPSVYRAVRAEFM---RQFGAAGYPAAPGFSILDTCYDLTGHDEVKVPLL 434
Query: 271 AFYFE-DANLRIDGENV-FIIDYENHFFLLAVA--PHDDLVALIGSQQQRDTRFVYDLNI 326
E A++ +D + F++ + LA+A ++D +IG+ QQ++ R VYD
Sbjct: 435 TLRLEGGADVTVDAAGMLFVVRKDGSQVCLAMASLSYEDETPIIGNYQQKNKRVVYDTLG 494
Query: 327 DLLSFVKENC 336
L F E+C
Sbjct: 495 SRLGFADEDC 504
>gi|145693994|gb|ABP93697.1| unknown protein isoform 2 [Lemna minor]
Length = 351
Score = 108 bits (269), Expect = 5e-21, Method: Compositional matrix adjust.
Identities = 92/358 (25%), Positives = 148/358 (41%), Gaps = 46/358 (12%)
Query: 1 MVRLFIGTPSKGVLLILDTGSALIY---------------AIFDPRKSSSFQKINCDHPD 45
++ + GTP++ ++ DTGS + + +FDP SS+++ ++C P
Sbjct: 17 VITVGFGTPTRTQTVVFDTGSDVNWLQCKPCAVRCYAQQEPLFDPSLSSTYRNVSCTEPA 76
Query: 46 CTYFK---CVNEQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSNDNHG 102
C C + C+Y + Y D S T GF A +T + + F +FGC +N G
Sbjct: 77 CVGLSTRGCSSSTCLYGVFYGDGSSTIGFLAMDTFMLTPAQK----FKNFIFGCGQNNTG 132
Query: 103 FDEDARDGALAGVLGLSR-VTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLKFGTDM 161
+ AG++GL R T S SQ+ + FSYC LP+ + YL G
Sbjct: 133 LFQGT-----AGLVGLGRSSTYSLNSQVAPSLGNVFSYC----LPSTSSATGYLNIGNPQ 183
Query: 162 GYRRPSTQATKFINHPNNFYYLSLKDISIDNERMNFPPDTFDITVSGEGGCIIDSGSVLT 221
P A Y++ L IS+ R++ F G IIDSG+V+T
Sbjct: 184 --NTPGYTAMLTDTRVPTLYFIDLIGISVGGTRLSLSSTVFQ-----SVGTIIDSGTVIT 236
Query: 222 YFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCY-FLPETFNRFPSMAFYFEDANLR 280
Y L + ++ LA + CY F T +P + +F ++R
Sbjct: 237 RLPPTAYSALKTAVRAAMTQYTLAPAVTI---LDTCYDFSRTTSVVYPVIVLHFAGLDVR 293
Query: 281 IDGENVFIIDYENHFFLLAVAPHDD--LVALIGSQQQRDTRFVYDLNIDLLSFVKENC 336
I VF + + + LA A + D ++ +IG+ QQ YD + + F C
Sbjct: 294 IPATGVFFV-FNSSQVCLAFAGNTDSTMIGIIGNVQQLTMEVTYDNELKRIGFSAGAC 350
>gi|312283333|dbj|BAJ34532.1| unnamed protein product [Thellungiella halophila]
Length = 428
Score = 107 bits (268), Expect = 6e-21, Method: Compositional matrix adjust.
Identities = 90/361 (24%), Positives = 152/361 (42%), Gaps = 48/361 (13%)
Query: 1 MVRLFIGTPSKGVLLILDTGSALIYA------------IFDPRKSSSFQKINCDHPDCTY 48
+VR IGTP++ +L+ LDT + + +FDP KSSS + + CD P C
Sbjct: 92 IVRANIGTPAQPMLVALDTSNDAAWVPCSGCVGCASSVLFDPSKSSSSRNLQCDAPQCKQ 151
Query: 49 FK----CVNEQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSNDNHGFD 104
+ C + M Y ++ +T+++ + FGC + G
Sbjct: 152 APNPTCTAGKSCGFNMTYGGSTIEASLT-QDTLTL-----ANDVIKSYTFGCISKATGTS 205
Query: 105 EDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYT--SSYLKFGTDMG 162
A+ G++GL R +S ISQ ++ FSYCL PN + + S L+ G
Sbjct: 206 LPAQ-----GLMGLGRGPLSLISQTQNLYMSTFSYCL----PNSKSSNFSGSLRLGPK-- 254
Query: 163 YRRPSTQATKFINHP--NNFYYLSLKDISIDNERMNFPPDTFDITVSGEGGCIIDSGSVL 220
Y+ + T + +P ++ YY++L I + N+ ++ P S G I DSG+V
Sbjct: 255 YQPVRIKTTPLLKNPRRSSLYYVNLVGIRVGNKIVDIPTSALAFDASTGAGTIFDSGTVF 314
Query: 221 TYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCYFLPETFNRFPSMAFYFEDANLR 280
T Y + +F + L CY +PS+ F F N+
Sbjct: 315 TRLVEPAYVAVRNEFRRRIKNANATSLGG----FDTCY---SGSVVYPSVTFMFAGMNVT 367
Query: 281 IDGENVFI--IDYENHFFLLAVAPH--DDLVALIGSQQQRDTRFVYDLNIDLLSFVKENC 336
+ +N+ I +A AP+ + ++ +I S QQ++ R + DL L +E C
Sbjct: 368 LPPDNLLIHSSSGSTSCLAMAAAPNNVNSVLNVIASMQQQNHRVLIDLPNSRLGISRETC 427
Query: 337 S 337
+
Sbjct: 428 T 428
>gi|242081367|ref|XP_002445452.1| hypothetical protein SORBIDRAFT_07g019450 [Sorghum bicolor]
gi|241941802|gb|EES14947.1| hypothetical protein SORBIDRAFT_07g019450 [Sorghum bicolor]
Length = 459
Score = 107 bits (268), Expect = 6e-21, Method: Compositional matrix adjust.
Identities = 94/363 (25%), Positives = 162/363 (44%), Gaps = 51/363 (14%)
Query: 6 IGTPSKGVLLILDTGSALIYA--------------IFDPRKSSSFQKINCDHPDC----- 46
+GTP + +ILD GS L++ +FD +SSSF + CD C
Sbjct: 113 VGTPPQPSKVILDLGSDLLWTQCSLVGPTAKQLEPVFDAARSSSFSVLPCDSKLCEAGTF 172
Query: 47 TYFKCVNEQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSNDNHGFDED 106
T C + +C Y Y + T G A ET + G + FGC +G +
Sbjct: 173 TNKTCTDRKCAYENDYGIMTAT-GVLATETFTF---GAHHGVSANLTFGCGKLANGTIAE 228
Query: 107 ARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLKFGT--DMGYR 164
A +G+LGLS +S + QL +FSYCL P + +S + FG D+G
Sbjct: 229 A-----SGILGLSPGPLSMLKQLAIT---KFSYCLT---PFADRKTSPVMFGAMADLGKY 277
Query: 165 RPS--TQATKFINHP--NNFYYLSLKDISIDNERMNFPPDTFDITVSGEGGCIIDSGSVL 220
+ + Q + +P + +YY+ + +S+ ++R++ P +T I G GG ++DS + L
Sbjct: 278 KTTGKVQTIPLLKNPVEDIYYYVPMVGMSVGSKRLDVPQETLAIKPDGTGGTVLDSATTL 337
Query: 221 TYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCYFLPETFN----RFPSMAFYFE- 275
Y + +L + + E +L + + +C+ LP + + P + +F+
Sbjct: 338 AYLVEPAFTELKK---AVMEGIKLPVANRSVDDYPVCFELPRGMSMEGVQVPPLVLHFDG 394
Query: 276 DANLRIDGENVFIIDYENHFFLLAV--APHDDLVALIGSQQQRDTRFVYDLNIDLLSFVK 333
DA + + +N F + LAV AP + +IG+ QQ++ +YD+ S+
Sbjct: 395 DAEMSLPRDNYF-QEPSPGMMCLAVMQAPFEGAPNVIGNVQQQNMHVLYDVGNRKFSYAP 453
Query: 334 ENC 336
C
Sbjct: 454 TKC 456
>gi|125542690|gb|EAY88829.1| hypothetical protein OsI_10302 [Oryza sativa Indica Group]
Length = 440
Score = 107 bits (268), Expect = 6e-21, Method: Compositional matrix adjust.
Identities = 99/375 (26%), Positives = 164/375 (43%), Gaps = 56/375 (14%)
Query: 1 MVRLFIGTPSKGVLLILDTGSALIYA------------IFDPRKSSSFQKINCDHPDCTY 48
+VR +G+PS+ +LL LDT + +A +F P SSS+ + C C
Sbjct: 82 VVRAGLGSPSQQLLLALDTSADATWAHCSPCGTCPSSSLFAPANSSSYASLPCSSSWCPL 141
Query: 49 FK---CVNEQ--------------CVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHG 91
F+ C Q C ++ +AD S A+ +T+ + GK
Sbjct: 142 FQGQACPAPQGGGDAAPPPATLPTCAFSKPFADASFQAALAS-DTLRL-----GKDAIPN 195
Query: 92 ALFGCSNDNHGFDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYT 151
FGC + G + G+LGL R ++ +SQ GS+ FSYCL P Y
Sbjct: 196 YTFGCVSSVTGPTTNMPR---QGLLGLGRGPMALLSQAGSLYNGVFSYCL--PSYRSYYF 250
Query: 152 SSYLKFGTDMGYRRPSTQATKFINHP--NNFYYLSLKDISIDNERMNFPPDTFDITVSGE 209
S L+ G G R S + T + +P ++ YY+++ +S+ + P +F +
Sbjct: 251 SGSLRLGAGGGQPR-SVRYTPMLRNPHRSSLYYVNVTGLSVGRAWVKVPAGSFAFDAATG 309
Query: 210 GGCIIDSGSVLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPE--PIQLCYFLPE-TFNR 266
G ++DSG+V+T + + VY L E+F R Q+A S C+ E
Sbjct: 310 AGTVVDSGTVITRWTAPVYAALREEF-----RRQVAAPSGYTSLGAFDTCFNTDEVAAGG 364
Query: 267 FPSMAFYFEDA-NLRIDGENVFIIDYENHFFLLAV--APH--DDLVALIGSQQQRDTRFV 321
P++ + + +L + EN I LA+ AP + +V +I + QQ++ R V
Sbjct: 365 APAVTVHMDGGVDLALPMENTLIHSSATPLACLAMAEAPQNVNSVVNVIANLQQQNIRVV 424
Query: 322 YDLNIDLLSFVKENC 336
+D+ + F KE+C
Sbjct: 425 FDVANSRIGFAKESC 439
>gi|242056193|ref|XP_002457242.1| hypothetical protein SORBIDRAFT_03g003930 [Sorghum bicolor]
gi|241929217|gb|EES02362.1| hypothetical protein SORBIDRAFT_03g003930 [Sorghum bicolor]
Length = 457
Score = 107 bits (268), Expect = 6e-21, Method: Compositional matrix adjust.
Identities = 101/373 (27%), Positives = 165/373 (44%), Gaps = 63/373 (16%)
Query: 6 IGTPSKGVLLILDTGSALIY------------------AIFDPRKSSSFQKINCDHPDCT 47
+GTP +L I DTGS L++ +F P +SS++ +++C C
Sbjct: 109 VGTPPTQLLAIADTGSDLVWVNCSSSGGGLADADAGGNVVFQPTRSSTYSQLSCQSNACQ 168
Query: 48 YF---KC-VNEQCVYTMKYADQSVTKGFAAHETISVI-GKGEGKAIFHGALFGCSNDNHG 102
C + +C Y Y D S T G + ET S + G G+G+ FGCS + G
Sbjct: 169 ALSQASCDADSECQYQYSYGDGSRTIGVLSTETFSFVDGGGKGQVRVPRVNFGCSTASAG 228
Query: 103 -FDEDARDGALAGVLGLSRVTISFISQLGSI--IKKRFSYCLVIPLPNGEYTSSYLKFGT 159
F D G++GL S +SQLG+ I ++ SYCL IP + +SS L FG+
Sbjct: 229 TFRSD-------GLVGLGAGAFSLVSQLGATTHIDRKLSYCL-IPSYDAN-SSSTLNFGS 279
Query: 160 DMGYRRPSTQATKFI-NHPNNFYYLSLKDISIDNERMNFPPDTFDITVSGEGGCIIDSGS 218
P +T + + +++Y ++L+ +++ + + T D + I+DSG+
Sbjct: 280 RAVVSEPGAASTPLVPSDVDSYYTVALESVAVGGQEVA----THDSRI------IVDSGT 329
Query: 219 VLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEP-IQLCYFLPETFNRFPSMAFYFEDA 277
LT+ + L V+ ER Q PE +QLCY + + + F D
Sbjct: 330 TLTFLDPALLGPL----VTELERRIKLQRVQPPEQLLQLCY---DVQGKSETDNFGIPDV 382
Query: 278 NLRIDGENVFIIDYENHFFL-------LAVAP--HDDLVALIGSQQQRDTRFVYDLNIDL 328
LR G + EN F L L + P V+++G+ Q++ YDL+
Sbjct: 383 TLRFGGGAAVTLRPENTFSLLQEGTLCLVLVPVSESQPVSILGNIAQQNFHVGYDLDART 442
Query: 329 LSFVKENCSDDSA 341
++F +C+ SA
Sbjct: 443 VTFAAADCARSSA 455
>gi|115451209|ref|NP_001049205.1| Os03g0186900 [Oryza sativa Japonica Group]
gi|49532749|dbj|BAD26705.1| Radc1 [Oryza sativa Japonica Group]
gi|108706569|gb|ABF94364.1| Eukaryotic aspartyl protease family protein, expressed [Oryza
sativa Japonica Group]
gi|113547676|dbj|BAF11119.1| Os03g0186900 [Oryza sativa Japonica Group]
gi|215692805|dbj|BAG88249.1| unnamed protein product [Oryza sativa Japonica Group]
gi|215767626|dbj|BAG99854.1| unnamed protein product [Oryza sativa Japonica Group]
Length = 438
Score = 107 bits (268), Expect = 7e-21, Method: Compositional matrix adjust.
Identities = 99/375 (26%), Positives = 165/375 (44%), Gaps = 56/375 (14%)
Query: 1 MVRLFIGTPSKGVLLILDTGSALIYA------------IFDPRKSSSFQKINCDHPDCTY 48
+VR +G+PS+ +LL LDT + +A +F P SSS+ + C C
Sbjct: 80 VVRAGLGSPSQQLLLALDTSADATWAHCSPCGTCPSSSLFAPANSSSYASLPCSSSWCPL 139
Query: 49 FK---CVNEQ--------------CVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHG 91
F+ C Q C ++ +AD S A+ +T+ + GK
Sbjct: 140 FQGQACPAPQGGGDAAPPPATLPTCAFSKPFADASFQAALAS-DTLRL-----GKDAIPN 193
Query: 92 ALFGCSNDNHGFDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYT 151
FGC + G + G+LGL R ++ +SQ GS+ FSYCL P Y
Sbjct: 194 YTFGCVSSVTGPTTNMPR---QGLLGLGRGPMALLSQAGSLYNGVFSYCL--PSYRSYYF 248
Query: 152 SSYLKFGTDMGYRRPSTQATKFINHP--NNFYYLSLKDISIDNERMNFPPDTFDITVSGE 209
S L+ G G R S + T + +P ++ YY+++ +S+ + + P +F +
Sbjct: 249 SGSLRLGAGGGQPR-SVRYTPMLRNPHRSSLYYVNVTGLSVGHAWVKVPAGSFAFDAATG 307
Query: 210 GGCIIDSGSVLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPE--PIQLCYFLPE-TFNR 266
G ++DSG+V+T + + VY L E+F R Q+A S C+ E
Sbjct: 308 AGTVVDSGTVITRWTAPVYAALREEF-----RRQVAAPSGYTSLGAFDTCFNTDEVAAGG 362
Query: 267 FPSMAFYFEDA-NLRIDGENVFIIDYENHFFLLAV--APH--DDLVALIGSQQQRDTRFV 321
P++ + + +L + EN I LA+ AP + +V +I + QQ++ R V
Sbjct: 363 APAVTVHMDGGVDLALPMENTLIHSSATPLACLAMAEAPQNVNSVVNVIANLQQQNIRVV 422
Query: 322 YDLNIDLLSFVKENC 336
+D+ + F KE+C
Sbjct: 423 FDVANSRVGFAKESC 437
>gi|255543383|ref|XP_002512754.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
gi|223547765|gb|EEF49257.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
Length = 414
Score = 107 bits (268), Expect = 7e-21, Method: Compositional matrix adjust.
Identities = 97/360 (26%), Positives = 152/360 (42%), Gaps = 56/360 (15%)
Query: 11 KGVLLILDTGSALIYA--------------IFDPRKSSSFQKINCDHPDCTYFKCVN--- 53
+ + +I+DTGS L + +F+P S S+Q I C+ C +
Sbjct: 76 RNMTVIVDTGSDLTWVQCQPCRLCYNQQDPLFNPSGSPSYQTILCNSSTCQSLQYATGNL 135
Query: 54 -------EQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSNDNHGFDED 106
C Y + Y D S T+G E +++ G +FGC +N G
Sbjct: 136 GVCGSNTPTCNYVVNYGDGSYTRGDLGMEQLNL-----GTTHVSNFIFGCGRNNKGLF-- 188
Query: 107 ARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLKFGTDMGYRRP 166
G +G++GL + +S +SQ +I + FSYCL P + + S + G Y+
Sbjct: 189 ---GGASGLMGLGKSDLSLVSQTSAIFEGVFSYCL--PTTAADASGSLILGGNSSVYKNT 243
Query: 167 STQA-TKFINHPN--NFYYLSLKDISIDNERMNFPPDTFDITVSGEGGCIIDSGSVLTYF 223
+ + T+ I +P FY+L+L ISI + P + G +IDSG+V+T
Sbjct: 244 TPISYTRMIANPQLPTFYFLNLTGISIGGVALQAPNYR-------QSGILIDSGTVITRL 296
Query: 224 HSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCYFLPETFNR--FPSMAFYFE-DANLR 280
VY L +F+ F F A P I F ++ P++ FE +A L
Sbjct: 297 PPPVYRDLKAEFLKQFSGFPSAP----PFSILDTCFNLNGYDEVDIPTIRMQFEGNAELT 352
Query: 281 IDGENVFI---IDYENHFFLLAVAPHDDLVALIGSQQQRDTRFVYDLNIDLLSFVKENCS 337
+D +F D LA DD + +IG+ QQR+ R +Y+ L F E CS
Sbjct: 353 VDVTGIFYFVKTDASQVCLALASLSFDDEIPIIGNYQQRNQRVIYNTKESKLGFAAEACS 412
>gi|326518194|dbj|BAK07349.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 435
Score = 107 bits (267), Expect = 8e-21, Method: Compositional matrix adjust.
Identities = 97/336 (28%), Positives = 147/336 (43%), Gaps = 34/336 (10%)
Query: 26 AIFDPRKSSSFQKINCDHPDCT--YFKCVNEQC-VYTMKYADQSVTKGFAAHETISVIGK 82
A+F S ++ P CT Y V +C YT + G+ + G
Sbjct: 109 AVFKSAVSPRYKDTKATDPKCTPPYTPSVGNRCSFYTTSW--NVAAHGYLGSDMFGFAGS 166
Query: 83 -GEGKAIFHGA-----LFGCSNDNHGFDEDARDGALAGVLGLSRVTISFISQLGS--IIK 134
G G HG FGC++ GF E G LAG L LSR SF+SQL + +
Sbjct: 167 PGTGG---HGTDVDKLTFGCAHTTDGF-ERLNHGVLAGALSLSRHPTSFLSQLTARRLAD 222
Query: 135 KRFSYCLVIPLPNGEYTSSYLKFGTDMGYRRPSTQATKFI---NHPNNFYYLSLKDISID 191
RFSYCL + +L+FG D+ R +T + + YY+ + IS++
Sbjct: 223 SRFSYCLFPGQSHPNARHGFLRFGRDI-PRHDHAHSTSLLFTGRGSGSMYYIGVTSISLN 281
Query: 192 NERM-NFPPDTFDITV-SGEGGCIIDSGSVLTYFHSDVYWKLHEKFVSYFERFQLAQLSD 249
+R+ P F + GG ++D G+ LT + Y + + V+Y Q
Sbjct: 282 GKRIIGLQPAFFRRNPQTRRGGSVVDPGTPLTRLVREAYNIVEAELVAYM---QTQGSRR 338
Query: 250 CPEPIQ---LCYFLPETFNRFPSMAFYFED--ANLRIDGENVFIIDYENHFFLLAVAPHD 304
P P+Q LC F+ PSM + A L I E +F+ H L V D
Sbjct: 339 APAPVQGHRLC-FVSWGHAHLPSMTINMNEDRAKLFIKPELLFLKVTHEHLCFLVVP--D 395
Query: 305 DLVALIGSQQQRDTRFVYDLNIDLLSFVKENCSDDS 340
+ + ++G+ QQ DTRF +DL+ + L F +E+C+ D+
Sbjct: 396 EEMTVLGAAQQVDTRFTFDLHANRLYFAQEHCTADT 431
>gi|224092218|ref|XP_002309514.1| predicted protein [Populus trichocarpa]
gi|222855490|gb|EEE93037.1| predicted protein [Populus trichocarpa]
Length = 474
Score = 107 bits (267), Expect = 8e-21, Method: Compositional matrix adjust.
Identities = 101/363 (27%), Positives = 156/363 (42%), Gaps = 50/363 (13%)
Query: 1 MVRLFIGTPSKGVLLILDTGSALIYA---------------IFDPRKSSSFQKINCDHPD 45
+V + +GTP + L+ DTGS + + FDP KS+S+ ++C
Sbjct: 136 VVTVGLGTPKEDFTLVFDTGSGITWTQCQPCLGSCYPQKEQKFDPTKSTSYNNVSCSSAS 195
Query: 46 CTYF-------KCVNEQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSN 98
C N C+Y + Y DQS ++GF A ET+++ +F LFGC
Sbjct: 196 CNLLPTSERGCSASNSTCLYQIIYGDQSYSQGFFATETLTI----SSSDVFTNFLFGCGQ 251
Query: 99 DNHGFDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLKFG 158
N+G G AG+LGLS ++S SQ +K+FSYC LP+ ++ YL FG
Sbjct: 252 SNNGLF-----GQAAGLLGLSSSSVSLPSQTAEKYQKQFSYC----LPSTPSSTGYLNFG 302
Query: 159 TDMGYRRPSTQATKFINHPNNFYYLSLKDISIDNERMNFPPDTFDITVSGEGGCIIDSGS 218
G + T ++FY + + IS+ ++ P F + G IIDSG+
Sbjct: 303 ---GKVSQTAGFTPISPAFSSFYGIDIVGISVAGSQLPIDPSIFTTS-----GAIIDSGT 354
Query: 219 VLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCY-FLPETFNRFPSMAFYFEDA 277
V+T Y L E F E+ ++ E + CY F T FP ++ F+
Sbjct: 355 VITRLPPTAYKALKEAFD---EKMSNYPKTNGDELLDTCYDFSNYTTVSFPKVSVSFKGG 411
Query: 278 -NLRIDGENVFIIDYENHFFLLAVAPH--DDLVALIGSQQQRDTRFVYDLNIDLLSFVKE 334
+ ID + + LA A + D + G+ QQ+ VYD ++ F
Sbjct: 412 VEVDIDASGILYLVNGVKMVCLAFAANKDDSEFGIFGNHQQKTYEVVYDGAKGMIGFAAG 471
Query: 335 NCS 337
CS
Sbjct: 472 ACS 474
>gi|359473000|ref|XP_002278677.2| PREDICTED: aspartic proteinase nepenthesin-1-like [Vitis vinifera]
Length = 458
Score = 107 bits (267), Expect = 8e-21, Method: Compositional matrix adjust.
Identities = 99/382 (25%), Positives = 159/382 (41%), Gaps = 62/382 (16%)
Query: 2 VRLFIGTPSKGVLLILDTGSALIY---------------AIFDPRKSSSFQKINC----- 41
V L +GTP + +LL+ DTGS L++ + F R S++F +C
Sbjct: 91 VDLRLGTPPQKLLLVADTGSDLVWVKCSACRNCTRHTPGSAFLARHSTTFSPNHCYDSAC 150
Query: 42 ------DHPDCTYFKCVNEQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFG 95
H C + + ++ C Y Y D S T GF + ET ++ +A G FG
Sbjct: 151 QLVPLPKHHRCNHAR-LHSPCRYEYSYGDGSKTSGFFSKETTTLNTSSGREAKLKGIAFG 209
Query: 96 CSNDNHGFD-EDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCL----VIPLPNGEY 150
C+ G A GV+GL R IS SQLG +FSYCL + P P
Sbjct: 210 CAFRISGPSVSGASFNGAHGVMGLGRGPISLSSQLGHRFGNKFSYCLMDHDISPSP---- 265
Query: 151 TSSYLKFGTDMGYRRPSTQATKFIN-HPN----NFYYLSLKDISIDNERMNFPPDTFDIT 205
+SYL G+ P + +F H N FYY+ ++ +S+D ++ P + +
Sbjct: 266 -TSYLLIGSTQNDVAPGKRRMRFTPLHINPLSPTFYYIGIESVSVDGIKLPINPSVWALD 324
Query: 206 VSGEGGCIIDSGSVLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCYFLPETFN 265
G GG I+DSG+ LT+ Y ++ R +L ++ LC + E +
Sbjct: 325 ELGNGGTIVDSGTTLTFLPEPAYLQI---LTVIKRRVRLPSPAEPTPGFDLCVNVSEIEH 381
Query: 266 -RFPSMAFYFEDANLRIDGENVFIIDYENHF---------FLLAVAPHDDLVALIGSQQQ 315
R P ++F ++ G++VF N+F L ++IG+ Q
Sbjct: 382 PRLPKLSF-------KLGGDSVFSPPPRNYFVDTDEDVKCLALQAVMTPSGFSVIGNLMQ 434
Query: 316 RDTRFVYDLNIDLLSFVKENCS 337
+ +D + L F + C+
Sbjct: 435 QGFLLEFDKDRTRLGFSRHGCA 456
>gi|148907930|gb|ABR17085.1| unknown [Picea sitchensis]
Length = 498
Score = 107 bits (267), Expect = 8e-21, Method: Compositional matrix adjust.
Identities = 101/356 (28%), Positives = 155/356 (43%), Gaps = 37/356 (10%)
Query: 2 VRLFIGTPSKGVLLILDTGSALIY--------------AIFDPRKSSSFQKINCDHPDCT 47
R+ +GTP++ ++LDTGS + + IF+P S+SF + CD C+
Sbjct: 159 TRIGVGTPTREQYMVLDTGSDVAWIQCEPCRECYSQADPIFNPSYSASFSTVGCDSAVCS 218
Query: 48 Y---FKCVNEQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSNDNHGFD 104
+ C + C+Y Y D S + G A ET++ G GC + N G
Sbjct: 219 QLDAYDCHSGGCLYEASYGDGSYSTGSFATETLTF-----GTTSVANVAIGCGHKNVGLF 273
Query: 105 EDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLKFGTDMGYR 164
A +SF +Q+G+ FSYCLV +S L+FG
Sbjct: 274 IGAAGLLGL-----GAGALSFPNQIGTQTGHTFSYCLV---DRESDSSGPLQFGPKSVPV 325
Query: 165 RPSTQATKFINHPNNFYYLSLKDISIDNERMN-FPPDTFDI-TVSGEGGCIIDSGSVLTY 222
+ H FYYLS+ IS+ ++ PP+ F I SG GG IIDSG+V+T
Sbjct: 326 GSIFTPLEKNPHLPTFYYLSVTAISVGGALLDSIPPEVFRIDETSGHGGFIIDSGTVVTR 385
Query: 223 FHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCYFLPE-TFNRFPSMAFYFED-ANLR 280
+ Y + + FV+ QL + +D CY L F P++ F+F + A+L
Sbjct: 386 LVTSAYDAVRDAFVA--GTGQLPR-TDAVSIFDTCYDLSGLQFVSVPTVGFHFSNGASLI 442
Query: 281 IDGENVFIIDYENHFFLLAVAPHDDLVALIGSQQQRDTRFVYDLNIDLLSFVKENC 336
+ +N I F A AP V+++G+ QQ+ R +D L+ F + C
Sbjct: 443 LPAKNYLIPMDTVGTFCFAFAPAASSVSIMGNTQQQHIRVSFDSANSLVGFAFDQC 498
>gi|242092368|ref|XP_002436674.1| hypothetical protein SORBIDRAFT_10g006870 [Sorghum bicolor]
gi|241914897|gb|EER88041.1| hypothetical protein SORBIDRAFT_10g006870 [Sorghum bicolor]
Length = 461
Score = 107 bits (266), Expect = 1e-20, Method: Compositional matrix adjust.
Identities = 103/383 (26%), Positives = 151/383 (39%), Gaps = 68/383 (17%)
Query: 1 MVRLFIGTPSKGVLLILDTGSALIYA--------------IFDPRKSSSFQKINCDHPDC 46
+V L +GTP + V L LDTGS L++ + DP SS++ + C P C
Sbjct: 93 LVHLAVGTPPRPVALTLDTGSDLVWTQCAPCRDCFHQGLPLLDPAASSTYAALPCGAPRC 152
Query: 47 -------------TYFKCVNEQCVYTMKYADQSVTKGFAAHETISVIGK---GEGKAIFH 90
+ + N C Y Y D+SVT G A + + G G+ +
Sbjct: 153 RALPFTSCGGGGRSSWGNGNRSCAYIYHYGDKSVTVGEIATDRFTFGGDNGDGDSRLPTR 212
Query: 91 GALFGCSNDNHGFDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEY 150
FGC + N G + G+ G R S SQL FSYC E
Sbjct: 213 RLTFGCGHFNKGVFQSNE----TGIAGFGRGRWSLPSQLNVTT---FSYCFTSMF---ES 262
Query: 151 TSSYLKFG---------TDMGYRRPSTQATKFINHPN--NFYYLSLKDISIDNERMNFPP 199
SS + G + + + T + +P+ + Y+LSLK IS+ R+ P
Sbjct: 263 KSSLVTLGGAPAAALLYSHAAHISGEVRTTPLLKNPSQPSLYFLSLKGISVGKTRLAVPE 322
Query: 200 DTFDITVSGEGGCIIDSGSVLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCYF 259
T IIDSG+ +T VY + +F + + + LC+
Sbjct: 323 AKLRST-------IIDSGASITTLPEAVYEAVKAEFAAQVGLPPTGVVEG--SALDLCFA 373
Query: 260 LPET--FNR--FPSMAFYFEDANLRIDGENVFIIDYENHFF--LLAVAPHDDLVALIGSQ 313
LP T + R PS+ + + A+ + N D +L AP D V IG+
Sbjct: 374 LPVTALWRRPPVPSLTLHLDGADWELPRGNYVFEDLAARVMCVVLDAAPGDQTV--IGNF 431
Query: 314 QQRDTRFVYDLNIDLLSFVKENC 336
QQ++T VYDL D LSF C
Sbjct: 432 QQQNTHVVYDLENDWLSFAPARC 454
>gi|449440933|ref|XP_004138238.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 1-like [Cucumis
sativus]
Length = 487
Score = 107 bits (266), Expect = 1e-20, Method: Compositional matrix adjust.
Identities = 107/364 (29%), Positives = 162/364 (44%), Gaps = 53/364 (14%)
Query: 1 MVRLFIGTPSKGVLLILDTGSALIY-----------------AIFDPRKSSSFQKINCDH 43
+ ++ +G P K L+ DTGS + + IFDP+ SSS+ ++C+
Sbjct: 149 LAQIGVGQPVKLFYLVPDTGSDVTWLQCQPCASENTCYKQFDPIFDPKSSSSYSPLSCNS 208
Query: 44 PDCTYF---KCVNEQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSNDN 100
C C ++ C+Y + Y D S T G A ET+S G +I + + GC +DN
Sbjct: 209 QQCKLLDKANCNSDTCIYQVHYGDGSFTTGELATETLSF---GNSNSIPNLPI-GCGHDN 264
Query: 101 HGFDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEY-TSSYLKFGT 159
G IS SQL + FSYCLV N + +SS L+F +
Sbjct: 265 EGLFAGGAGLIGL-----GGGAISLSSQLKA---SSFSYCLV----NLDSDSSSTLEFNS 312
Query: 160 DMGYRRPSTQATKFINHPNNFY---YLSLKDISIDNERMNFPPDTFDITVSGEGGCIIDS 216
+M PS T + + F+ Y+ + IS+ + + P F+I SG GG I+DS
Sbjct: 313 NM----PSDSLTSPLVKNDRFHSYRYVKVVGISVGGKTLPISPTRFEIDESGLGGIIVDS 368
Query: 217 GSVLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCP--EPIQLCY-FLPETFNRFPSMAFY 273
G++++ SDVY L E FV + LS P CY F ++ P++AF
Sbjct: 369 GTIISRLPSDVYESLREAFVKL-----TSSLSPAPGISVFDTCYNFSGQSNVEVPTIAFV 423
Query: 274 F-EDANLRIDGENVFIIDYENHFFLLAVAPHDDLVALIGSQQQRDTRFVYDLNIDLLSFV 332
E +LR+ N I+ + LA +++IGS QQ+ R YDL L+ F
Sbjct: 424 LSEGTSLRLPARNYLIMLDTAGTYCLAFIKTKSSLSIIGSFQQQGIRVSYDLTNSLVGFS 483
Query: 333 KENC 336
C
Sbjct: 484 TNKC 487
>gi|357457681|ref|XP_003599121.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
gi|355488169|gb|AES69372.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
Length = 439
Score = 107 bits (266), Expect = 1e-20, Method: Compositional matrix adjust.
Identities = 100/360 (27%), Positives = 162/360 (45%), Gaps = 46/360 (12%)
Query: 6 IGTPSKGVLLILDTGSALIY--------------AIFDPRKSSSFQKINCDHPDCTY--- 48
IGTP + ++DTGS I+ IF+P KSS+++ I C P C
Sbjct: 96 IGTPPFQLYGVVDTGSDGIWFQCKPCKPCLNQTSPIFNPSKSSTYKNIRCSSPICKRGEK 155
Query: 49 FKCVN---EQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSNDNHGFDE 105
+C + +C Y + Y D+S ++G + +T+++ F + GC + N E
Sbjct: 156 TRCSSNRKRKCEYEITYLDRSGSQGDISKDTLTLNSNDGSPISFPKIVIGCGHKNSLTTE 215
Query: 106 DARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLKFGTDMG--- 162
G +G++G R S +SQLGS I +FSYCL L + SS L FG DM
Sbjct: 216 ----GLASGIIGFGRGNFSIVSQLGSSIGGKFSYCLA-SLFSKANISSKLYFG-DMAVVS 269
Query: 163 ----YRRPSTQATKFINHPNNFYYLSLKDISIDNERMNFPPDTFDITVSGEGGCIIDSGS 218
P Q+ N+ N S+ D I + + PD EG +IDSGS
Sbjct: 270 GHGVVSTPLIQSFYVGNYFTNLEAFSVGDHIIKLKDSSLIPDN-------EGNAVIDSGS 322
Query: 219 VLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCYFLPETFNRFPSMAFYFEDAN 278
+T +DVY +L +S +L ++ D + + LCY P + +F A+
Sbjct: 323 TITQLPNDVYSQLETAVISM---VKLKRVKDPTQQLSLCYKTTLKKYEVPIITAHFRGAD 379
Query: 279 LRIDGENVFI-IDYENHFFLLAVAPHDDLVALIGSQQQRDTRFVYDLNIDLLSFVKENCS 337
++++ N FI +++E F + +V G+ Q++ YD +++SF NC+
Sbjct: 380 VKLNAFNTFIQMNHEVMCFAFNSSAFPWVV--YGNIAQQNFLVGYDTLKNIISFKPTNCT 437
>gi|326532056|dbj|BAK01404.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 506
Score = 107 bits (266), Expect = 1e-20, Method: Compositional matrix adjust.
Identities = 112/380 (29%), Positives = 155/380 (40%), Gaps = 66/380 (17%)
Query: 1 MVRLFIGTPSKGVLLILDTGSALIY--------------AIFDPRKSSSFQKINCDHPDC 46
+V +++GTP + +I+DTGS L + IFDP S S++ + C C
Sbjct: 150 LVDVYLGTPPRRFRMIMDTGSDLNWLQCAPCLDCFEQSGPIFDPAASISYRNVTCGDDRC 209
Query: 47 TYFKCVNEQ------------CVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALF 94
E C Y Y DQS T G A E +V G G F
Sbjct: 210 RLVSPPAESAPRECRRPRSDPCPYYYWYGDQSNTTGDLALEAFTVNLTQSGTRRVDGVAF 269
Query: 95 GCSNDNHGFDEDARDGALAGVLGLSRVTISFISQLGSIIKKR-FSYCLVIPLPNGEYTSS 153
GC + N G AG+LGL R +SF SQL + FSYCLV +G S
Sbjct: 270 GCGHRNRGLFH-----GAAGLLGLGRGPLSFASQLRGVYGGHAFSYCLV---EHGSAAGS 321
Query: 154 YLKFGTDMG-YRRPSTQATKF--INHPNNFYYLSLKDISIDNERMNFPPDTFDITVSGEG 210
+ FG D P T F + FYYL LK I + E +N DT G
Sbjct: 322 KIIFGHDDALLAHPQLNYTAFAPTTDADTFYYLQLKSILVGGEAVNISSDTLS-----AG 376
Query: 211 GCIIDSGSVLTYFHSDVYWKLHEKFV-----SYFERFQLAQLSDC-----PEPIQLCYFL 260
G IIDSG+ L+YF Y + + F+ SY LS C E +++
Sbjct: 377 GTIIDSGTTLSYFPEPAYQAIRQAFIDRMSPSYPLILGFPVLSPCYNVSGAEKVEV---- 432
Query: 261 PETFNRFPSMAFYFED-ANLRIDGENVFI-IDYENHFFLLAVAPHDDLVALIGSQQQRDT 318
P ++ F D A EN FI ++ E L + +++IG+ QQ++
Sbjct: 433 -------PELSLVFADGAAWEFPAENYFIRLEPEGIMCLAVLGTPRSGMSIIGNYQQQNF 485
Query: 319 RFVYDLNIDLLSFVKENCSD 338
+YDL + L F C+D
Sbjct: 486 HVLYDLEHNRLGFAPRRCAD 505
>gi|242079449|ref|XP_002444493.1| hypothetical protein SORBIDRAFT_07g022790 [Sorghum bicolor]
gi|241940843|gb|EES13988.1| hypothetical protein SORBIDRAFT_07g022790 [Sorghum bicolor]
Length = 449
Score = 106 bits (265), Expect = 1e-20, Method: Compositional matrix adjust.
Identities = 106/382 (27%), Positives = 161/382 (42%), Gaps = 72/382 (18%)
Query: 6 IGTPSKGVLLILDTGSALIYA---------------------IFDPRKSSSFQKINCDHP 44
IGTP + LI+DTGS LI+ +++PR+SSSF + C
Sbjct: 90 IGTPPQPRTLIVDTGSDLIWTQCSMLSRRTRTAASASRQREPLYEPRRSSSFAYLPCSDR 149
Query: 45 DC-----TYFKCV-NEQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSN 98
C +Y C N +C+Y Y G A ET + G + FGC
Sbjct: 150 LCQEGQFSYKNCARNNRCMYDELYGSAEA-GGVLASETFTF---GVNAKVSLPLGFGCGA 205
Query: 99 DNHGFDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLKFG 158
+ G A +G++GLS +S +SQL RFSYCL P E +S L FG
Sbjct: 206 LSAGDLVGA-----SGLMGLSPGIMSLVSQLS---VPRFSYCLT---PFAERKTSPLLFG 254
Query: 159 TDMGYRRPST----QATKFINHP---NNFYYLSLKDISIDNERMNFPPDTFD-ITVSGEG 210
RR T Q T + +P +YY+ L +S+ +R++ P + I G G
Sbjct: 255 AMADLRRYRTTGTVQTTSILRNPAMETAYYYVPLVGLSLGTKRLDVPATSLGMIKPDGSG 314
Query: 211 GCIIDSGSVLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCP-EPIQLCYFLPETFN---- 265
G I+DSGS ++Y + + + V R +A +D + +LC+ LP
Sbjct: 315 GTIVDSGSTMSYLEETAFRAVKKAVVEAV-RLPVANGTDEDYDDYELCFALPTGVAMEAV 373
Query: 266 RFPSMAFYFEDANLRIDGENVFIIDYENHF-------FLLAVAPHDD--LVALIGSQQQR 316
+ P + +F DG + +N+F LAV D V++IG+ QQ+
Sbjct: 374 KTPPLVLHF-------DGGAAMTLPRDNYFQEPRAGLMCLAVGTSPDGFGVSIIGNVQQQ 426
Query: 317 DTRFVYDLNIDLLSFVKENCSD 338
+ ++D+ SF C D
Sbjct: 427 NMHVLFDVRNQKFSFAPTKCDD 448
>gi|357140068|ref|XP_003571594.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Brachypodium
distachyon]
Length = 533
Score = 106 bits (265), Expect = 1e-20, Method: Compositional matrix adjust.
Identities = 97/371 (26%), Positives = 151/371 (40%), Gaps = 65/371 (17%)
Query: 7 GTPSKGVLLILDTGSALIYA----------------IFDPRKSSSFQKINCDHPDCTYF- 49
G +K + +I+DTGS L + +FDP S +F + C P C
Sbjct: 188 GGGAKNLTVIVDTGSDLTWVQCEPCPGSSCYAQRDPLFDPAASPTFAAVPCGSPACAASL 247
Query: 50 --------KCVN------EQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAI-FHGALF 94
C ++C Y + Y D S ++G A +T+ G G G +F
Sbjct: 248 KDATGAPGSCARSAGNSEQRCYYALSYGDGSFSRGVLAQDTL-----GLGTTTKLDGFVF 302
Query: 95 GCSNDNHGFDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSY 154
GC N G G AG++GL R +S +SQ + FSYCL P ++
Sbjct: 303 GCGLSNRGLF-----GGTAGLMGLGRTDLSLVSQTAARFGGVFSYCL----PATTTSTGS 353
Query: 155 LKFGTDMGYRRPSTQATKFINHPNN--FYYLSLKDISIDNERMNFPPDTFDITVSGEGGC 212
L G P+ T+ I P FY++++ ++ P G G
Sbjct: 354 LSLGPGPSSSFPNMAYTRMIADPTQPPFYFINITGAAVGGGAALTAPGF------GAGNV 407
Query: 213 IIDSGSVLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCYFLP--ETFNRFPSM 270
++DSG+V+T VY + +F FE S + CY L + N P +
Sbjct: 408 LVDSGTVITRLAPSVYKAVRAEFARRFEYPAAPGFSI----LDACYDLTGRDEVN-VPLL 462
Query: 271 AFYFED-ANLRIDGENV-FIIDYENHFFLLAVA--PHDDLVALIGSQQQRDTRFVYDLNI 326
E A + +D + F++ + LA+A P++D +IG+ QQR+ R VYD
Sbjct: 463 TLTLEGGAQVTVDAAGMLFVVRKDGSQVCLAMASLPYEDQTPIIGNYQQRNKRVVYDTVG 522
Query: 327 DLLSFVKENCS 337
L F E+C+
Sbjct: 523 SRLGFADEDCT 533
>gi|326515330|dbj|BAK03578.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 445
Score = 106 bits (265), Expect = 1e-20, Method: Compositional matrix adjust.
Identities = 89/361 (24%), Positives = 157/361 (43%), Gaps = 48/361 (13%)
Query: 15 LILDTGSALIY--------------AIFDPRKSSSFQKINCDHPDCTYFKCV---NEQCV 57
L+LDT S+L + +FDP SSS++ ++ P C V ++C
Sbjct: 91 LVLDTASSLPWMRCAHCLPVQRQRSPVFDPSDSSSYRPLHPTSPLCRAPNPVLPAGDKCS 150
Query: 58 YTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSNDNHGFDEDARDGALAGVLG 117
+ + G+ +TI + G H FGC+ GFD G AG LG
Sbjct: 151 FHLP----GEAHGYVGTDTIIL---GNPTLPIHSVAFGCAQSTEGFDTK---GTFAGTLG 200
Query: 118 LSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLKFGTDMG-------YRRPSTQA 170
+ ++ S I Q+ + RFSYCL I L + + +++FG D+ +R
Sbjct: 201 MGKLPTSLIMQIKDRVGSRFSYCL-IGLGHSPGRNGFIRFGADIPDPTLLVHHRIKILPT 259
Query: 171 TKFINH--PNNFYYLSLKDISIDNERM-NFPPDTFDITVSGEGGCIIDSGSVLTYFHSDV 227
+ H ++ YY+ L IS++ + F+ G GGC +D+G+ +T+
Sbjct: 260 PPHLPHGVADSAYYVKLLGISLNGTPIPGIRQAMFERRSDGSGGCFVDAGTQVTHLVPAA 319
Query: 228 YWKLHEKFVSYFERFQLAQLSDCPEPIQLCYF-LPETFNRFPSMAFYFED------ANLR 280
Y + E +++ ++ D LC+ P ++ P + FE A+L
Sbjct: 320 YAVVEEAVAHMVQQWGYKRVRD--PNFSLCFREHPGIWSHIPKLTLDFEGPASRTVAHLE 377
Query: 281 IDGENVFI-IDYENHFFLLAVAPHDDLVALIGSQQQRDTRFVYDLNIDLLSFVKENCSDD 339
I N+F+ +D + ++G+ QQ DTRF++DL+ + ++F +E+C D
Sbjct: 378 IVSRNLFLKVDNQPLVCFGVYRTSRGSPTVVGAMQQVDTRFIFDLHANTITFHRESCEAD 437
Query: 340 S 340
+
Sbjct: 438 T 438
>gi|125600538|gb|EAZ40114.1| hypothetical protein OsJ_24557 [Oryza sativa Japonica Group]
Length = 412
Score = 106 bits (265), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 89/362 (24%), Positives = 152/362 (41%), Gaps = 67/362 (18%)
Query: 1 MVRLFIGTPSKGVLLILDTGSALIYA---------------IFDPRKSSSFQKINCDHPD 45
+V + IGTP + +LDTGS LI+ ++ P +S+++ ++C P
Sbjct: 93 LVDIAIGTPPLPLTAVLDTGSDLIWTQCDAPCRRCFPQPAPLYAPARSATYANVSCRSPM 152
Query: 46 C-----TYFKCV--NEQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSN 98
C + +C + C Y Y D + T G A ET ++ G A+ G FGC
Sbjct: 153 CQALQSPWSRCSPPDTGCAYYFSYGDGTSTDGVLATETFTL---GSDTAV-RGVAFGCGT 208
Query: 99 DNHGFDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLKFG 158
+N G +++ +G++G+ R +S +SQLG +R S
Sbjct: 209 ENLGSTDNS-----SGLVGMGRGPLSLVSQLGVTRPRR----------------SCRARA 247
Query: 159 TDMGYRRPSTQATKFINHPNNFYYLSLKDISIDNERMNFPPDTFDITVSGEGGCIIDSGS 218
G P+T + L+ I++ + + P F +T G+GG IIDSG+
Sbjct: 248 AARGGGAPTTTS-------------PLEGITVGDTLLPIDPAVFRLTPMGDGGVIIDSGT 294
Query: 219 VLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCYFL--PETFNRFPSMAFYFED 276
T + L S R +L S + LC+ PE P + +F+
Sbjct: 295 TFTALEERAFVALARALAS---RVRLPLASGAHLGLSLCFAAASPEAVE-VPRLVLHFDG 350
Query: 277 ANLRIDGENVFIIDYENHFFLLAVAPHDDLVALIGSQQQRDTRFVYDLNIDLLSFVKENC 336
A++ + E+ + D L + + +++GS QQ++T +YDL +LSF C
Sbjct: 351 ADMELRRESYVVEDRSAGVACLGMVSARGM-SVLGSMQQQNTHILYDLERGILSFEPAKC 409
Query: 337 SD 338
+
Sbjct: 410 GE 411
>gi|356536463|ref|XP_003536757.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
Length = 475
Score = 106 bits (265), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 95/362 (26%), Positives = 156/362 (43%), Gaps = 51/362 (14%)
Query: 2 VRLFIGTPSKGVLLILDTGSALIYA--------------IFDPRKSSSFQKINCDHPDCT 47
VR+ +G+P + +++D+GS +I+ +F+P SSSF ++C C+
Sbjct: 138 VRIGVGSPPRNQYVVMDSGSDIIWVQCEPCTQCYHQSDPVFNPADSSSFSGVSCASTVCS 197
Query: 48 YF---KCVNEQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSNDNHGFD 104
+ C +C Y + Y D S TKG A ETI+ G+ + GC + N G
Sbjct: 198 HVDNAACHEGRCRYEVSYGDGSYTKGTLALETITF-----GRTLIRNVAIGCGHHNQGMF 252
Query: 105 EDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLKFGTDMGYR 164
A +SF+ QLG FSYCLV G +S L+FG +
Sbjct: 253 VGAAGLLGL-----GGGPMSFVGQLGGQTGGAFSYCLV---SRGIESSGLLEFGREA--- 301
Query: 165 RPSTQA-TKFINHP--NNFYYLSLKDISIDNERMNFPPDTFDITVSGEGGCIIDSGSVLT 221
P A I++P +FYY+ L + + R++ D F ++ G+GG ++D+G+ +T
Sbjct: 302 MPVGAAWVPLIHNPRAQSFYYIGLSGLGVGGLRVSISEDVFKLSELGDGGVVMDTGTAVT 361
Query: 222 YFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQL-----CYFLPETFN-RFPSMAFYFE 275
+ Y + F+ AQ ++ P + CY L + R P+++FYF
Sbjct: 362 RLPTVAYEAFRDGFI--------AQTTNLPRASGVSIFDTCYDLFGFVSVRVPTVSFYFS 413
Query: 276 DAN-LRIDGENVFIIDYENHFFLLAVAPHDDLVALIGSQQQRDTRFVYDLNIDLLSFVKE 334
L + N I + F A AP +++IG+ QQ + D + F
Sbjct: 414 GGPILTLPARNFLIPVDDVGTFCFAFAPSSSGLSIIGNIQQEGIQISVDGANGFVGFGPN 473
Query: 335 NC 336
C
Sbjct: 474 VC 475
>gi|357130715|ref|XP_003566992.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Brachypodium
distachyon]
Length = 479
Score = 106 bits (264), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 99/393 (25%), Positives = 150/393 (38%), Gaps = 69/393 (17%)
Query: 2 VRLFIGTPSKGVLLILDTGSALIYA-------------------------IFDPRKSSSF 36
VR +GTP++ LL+ DTGS L + F P KS ++
Sbjct: 97 VRFRVGTPAQPFLLVADTGSDLTWVKCRPAKAAAASTNSSSSASASSPRRAFRPEKSKTW 156
Query: 37 QKINCDHPDCTYF--------KCVNEQCVYTMKYADQSVTKGFAAHETISVI-------- 80
I C C+ C Y +Y D S +G E+ ++
Sbjct: 157 APIPCASDTCSKSLPFSLSTCPTPGSPCAYDYRYKDGSAARGTVGTESATIALSSSSSSS 216
Query: 81 GKGEGKAIFHGALFGCSNDNHGFDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYC 140
KA G + GC+ G +A D GVL L +SF S S RFSYC
Sbjct: 217 KNKVKKAKLQGLVLGCTGSYTGPSFEASD----GVLSLGYSNVSFASHAASRFGGRFSYC 272
Query: 141 LVIPLPNGEYTSSYLKFGTDMGYRRPSTQA-------TKFI--NHPNNFYYLSLKDISID 191
LV L + +SYL FG + P A T + + FY +S+K IS+D
Sbjct: 273 LVDHL-SPRNATSYLTFGPNSALSGPCPAAAGPGARQTPLVLDSRMRPFYDVSIKAISVD 331
Query: 192 NERMNFPPDTFDITVSGEGGCIIDSGSVLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCP 251
E + P D ++ V G GG I+DSG+ LT Y + RF +
Sbjct: 332 GELLKIPRDVWE--VDGGGGVIVDSGTSLTVLAKPAYRAVVAALGKKLARFPRVAM---- 385
Query: 252 EPIQLCYFLPETFNR-----FPSMAFYFEDANLRIDGENVFIIDYENHFFLLAV--APHD 304
+P + CY + P +A +F + ++ID + V P
Sbjct: 386 DPFEYCYNWTSPSRKDEGDDLPKLAVHFAGSARLEPPSKSYVIDAAPGVKCIGVQEGPWP 445
Query: 305 DLVALIGSQQQRDTRFVYDLNIDLLSFVKENCS 337
+++IG+ Q++ + +DL L F + C+
Sbjct: 446 G-ISVIGNILQQEHLWEFDLKNRRLRFKRSRCT 477
>gi|302809855|ref|XP_002986620.1| hypothetical protein SELMODRAFT_124369 [Selaginella moellendorffii]
gi|300145803|gb|EFJ12477.1| hypothetical protein SELMODRAFT_124369 [Selaginella moellendorffii]
Length = 423
Score = 106 bits (264), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 92/358 (25%), Positives = 156/358 (43%), Gaps = 41/358 (11%)
Query: 2 VRLFIGTPSKGVLLILDTGSALIY--------------AIFDPRKSSSFQKINCDHPDCT 47
V L +GTP + V ++ DTGS +++ +F+P SS+FQ I C C
Sbjct: 83 VSLGVGTPPRTVNMVADTGSDVLWLQCLPCQSCYGQTDPLFNPSFSSTFQSITCGSSLCQ 142
Query: 48 YF---KCVNEQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSNDNHGFD 104
C QC+Y + Y D S T G + ET+S G + GC ++N G
Sbjct: 143 QLLIRGCRRNQCLYQVSYGDGSFTVGEFSTETLSF-----GSNAVNSVAIGCGHNNQGLF 197
Query: 105 EDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSY-LKFGTDMGY 163
A + +SF SQ+G + FSYC LP E T S L FG
Sbjct: 198 TGAAGLLGL-----GKGLLSFPSQVGQLYGSVFSYC----LPTRESTGSVPLIFGNQA-- 246
Query: 164 RRPSTQATKFINHP--NNFYYLSLKDISIDNERMNFPPDTFDI-TVSGEGGCIIDSGSVL 220
+ Q T + +P + FYY+ + I + ++ P + + + +G GG I+DSG+ +
Sbjct: 247 VASNAQFTTLLTNPKLDTFYYVEMVGIKVGGTSVSIPAGSLSLDSSTGNGGVILDSGTAV 306
Query: 221 TYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCYFLP-ETFNRFPSMAFYFE-DAN 278
T + Y + + F + A+++ CY L + P+++F F A
Sbjct: 307 TRLVTSAYNPMRDAFRAGMP--SDAKMTSGFSLFDTCYDLSGRSSIMLPAVSFVFNGGAT 364
Query: 279 LRIDGENVFIIDYENHFFLLAVAPHDDLVALIGSQQQRDTRFVYDLNIDLLSFVKENC 336
+ + +N+ + + + LA AP+ + ++IG+ QQ+ R +D + + C
Sbjct: 365 MALPAQNIMVPVDNSGTYCLAFAPNSENFSIIGNIQQQSFRMSFDSTGNRVGIGANQC 422
>gi|356553832|ref|XP_003545255.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Glycine max]
Length = 427
Score = 106 bits (264), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 95/374 (25%), Positives = 161/374 (43%), Gaps = 57/374 (15%)
Query: 2 VRLFIGTPSKGVLLILDTGSALIY----------AIFDPRKSSSFQKINCDHPDCTY--- 48
+ L IG+P + V ++LDTGS L + + F+P SSS+ C+ C
Sbjct: 61 ISLTIGSPPQNVTMVLDTGSELSWLHCKKLPNLNSTFNPLLSSSYTPTPCNSSVCMTRTR 120
Query: 49 -----FKC--VNEQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSNDNH 101
C N+ C + YAD S +G A ET S+ G + G LFGC D+
Sbjct: 121 DLTIPASCDPNNKLCHVIVSYADASSAEGTLAAETFSLAGAAQ-----PGTLFGCM-DSA 174
Query: 102 GFDEDA-RDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLKFGTD 160
G+ D D G++G++R ++S ++Q+ + +FSYC+ +GE L G
Sbjct: 175 GYTSDINEDAKTTGLMGMNRGSLSLVTQM---VLPKFSYCI-----SGEDAFGVLLLGD- 225
Query: 161 MGYRRPST-QATKFINHPNN-------FYYLSLKDISIDNERMNFPPDTFDITVSGEGGC 212
G PS Q T + + Y + L+ I + + + P F +G G
Sbjct: 226 -GPSAPSPLQYTPLVTATTSSPYFDRVAYTVQLEGIKVSEKLLQLPKSVFVPDHTGAGQT 284
Query: 213 IIDSGSVLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPE-----PIQLCYFLPETFNRF 267
++DSG+ T+ VY L ++F+ + L ++ D P + LCY P +
Sbjct: 285 MVDSGTQFTFLLGPVYNSLKDEFLEQTKGV-LTRIED-PNFVFEGAMDLCYHAPASLAAV 342
Query: 268 PSMAFYFEDANLRIDGEN-VFIIDYENHFFLLAVAPHDDLVAL----IGSQQQRDTRFVY 322
P++ F A +R+ GE ++ + + + DL+ + IG Q++ +
Sbjct: 343 PAVTLVFSGAEMRVSGERLLYRVSKGRDWVYCFTFGNSDLLGIEAYVIGHHHQQNVWMEF 402
Query: 323 DLNIDLLSFVKENC 336
DL + F + C
Sbjct: 403 DLVKSRVGFTETTC 416
>gi|356523155|ref|XP_003530207.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
Length = 412
Score = 105 bits (263), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 107/368 (29%), Positives = 158/368 (42%), Gaps = 62/368 (16%)
Query: 6 IGTPSKGVLLILDTGSALIYA--------------IFDPRKSSSFQKINCDHPDCTYFKC 51
+G S + +I+DTGS L + IF P SSS+Q ++C+ C +
Sbjct: 69 MGLGSTNMTVIIDTGSDLTWVQCEPCMSCYNQQGPIFKPSTSSSYQSVSCNSSTCQSLQF 128
Query: 52 VN----------EQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSNDNH 101
C Y + Y D S T G E +S G +FGC +N
Sbjct: 129 ATGNTGACGSNPSTCNYVVNYGDGSYTNGELGVEQLSF-----GGVSVSDFVFGCGRNNK 183
Query: 102 GFDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEY-TSSYLKFGTD 160
G G ++G++GL R +S +SQ + FSYCL P E S L G +
Sbjct: 184 GLF-----GGVSGLMGLGRSYLSLVSQTNATFGGVFSYCL----PTTESGASGSLVMGNE 234
Query: 161 MGYRRPSTQATKFINHPN----NFYYLSLKDISIDNERMNFPPDTFDITVSGEGGCIIDS 216
+ T T PN NFY L+L I +D + P +F G GG +IDS
Sbjct: 235 SSVFKNVTPITYTRMLPNPQLSNFYILNLTGIDVDGVALQVP--SF-----GNGGVLIDS 287
Query: 217 GSVLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCP--EPIQLCYFLPETFN-RFPSMAFY 273
G+V+T S VY L F+ F F A P + C+ L P+++ +
Sbjct: 288 GTVITRLPSSVYKALKALFLKQFTGFPSA-----PGFSILDTCFNLTGYDEVSIPTISMH 342
Query: 274 FE-DANLRIDGENVF-IIDYENHFFLLAVAPHDDL--VALIGSQQQRDTRFVYDLNIDLL 329
FE +A L++D F ++ + LA+A D A+IG+ QQR+ R +YD +
Sbjct: 343 FEGNAELKVDATGTFYVVKEDASQVCLALASLSDAYDTAIIGNYQQRNQRVIYDTKQSKV 402
Query: 330 SFVKENCS 337
F +E+CS
Sbjct: 403 GFAEESCS 410
>gi|413953772|gb|AFW86421.1| hypothetical protein ZEAMMB73_098827 [Zea mays]
Length = 482
Score = 105 bits (263), Expect = 3e-20, Method: Compositional matrix adjust.
Identities = 100/379 (26%), Positives = 157/379 (41%), Gaps = 65/379 (17%)
Query: 1 MVRLFIGTPSKGVLLILDTGSALIYA---------------IFDPRKSSSFQKINCDHPD 45
+V + IGTP++ ++ DTGS L + +FDP KSS++ + C P
Sbjct: 127 VVTIGIGTPARNFTVLFDTGSDLTWVQCKPCTDSCYQQQEPLFDPSKSSTYVDVPCGTPQ 186
Query: 46 CTY-----FKCVNEQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSND- 99
C C C Y++KY DQSVT+G A E ++ A G +FGCS++
Sbjct: 187 CKIGGGQDLTCGGTTCEYSVKYGDQSVTRGNLAQEAFTLSPSAPPAA---GVVFGCSHEY 243
Query: 100 NHGFDEDARDGALAGVLGLSRVTISFISQLGSIIKKR------FSYCLVIPLPNGEYTSS 153
+ G + ++AG+LGL R S +SQ +R FSYC LP ++
Sbjct: 244 SSGVKGAEEEMSVAGLLGLGRGDSSILSQ-----TRRGNSGDVFSYC----LPPRGSSAG 294
Query: 154 YLKFGTDMGYRRPSTQATKFI------NHPNNFYYLSLKDISIDNERMNFPPDTFDITVS 207
YL G P F + ++ Y ++L IS+ + F I
Sbjct: 295 YLTIGAAA----PPQSNLSFTPLVTDNSQLSSVYVVNLVGISVSGAALPIDASAFYI--- 347
Query: 208 GEGGCIIDSGSVLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCY-FLPETFNR 266
G +IDSG+V+T+ + Y+ L ++F + + + E + CY
Sbjct: 348 ---GTVIDSGTVITHMPAAAYYVLRDEFRRHMGGYTMLPEGHV-ESLDTCYDVTGHDVVT 403
Query: 267 FPSMAFYF-EDANLRIDGEN---VFIIDYENHFFLLA----VAPHDDLVALIGSQQQRDT 318
P +A F A + +D VF +D LA V + +IG+ QQR
Sbjct: 404 APPVALEFGGGARIDVDASGILLVFAVDASGQSLTLACLAFVPTNLPGFVIIGNMQQRAY 463
Query: 319 RFVYDLNIDLLSFVKENCS 337
V+D+ + F CS
Sbjct: 464 NVVFDVEGRRIGFGANGCS 482
>gi|42568291|ref|NP_199124.3| aspartyl protease family protein [Arabidopsis thaliana]
gi|332007527|gb|AED94910.1| aspartyl protease family protein [Arabidopsis thaliana]
Length = 631
Score = 105 bits (262), Expect = 3e-20, Method: Compositional matrix adjust.
Identities = 100/373 (26%), Positives = 154/373 (41%), Gaps = 67/373 (17%)
Query: 2 VRLFIGTPSKGVLLILDTGSALIYAI--------------FDPRKSSSFQKINCDHPDCT 47
RL+IGTP + LI+DTGS + Y F P S+S+Q + C+ PDC
Sbjct: 78 TRLWIGTPPQEFALIVDTGSTVTYVPCSTCKQCGKHQDPKFQPELSTSYQALKCN-PDCN 136
Query: 48 YFKCVNEQ--CVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSNDNHGFDE 105
C +E CVY +YA+ S + G + + IS E + A+FGC N+ G
Sbjct: 137 ---CDDEGKLCVYERRYAEMSSSSGVLSEDLISF--GNESQLSPQRAVFGCENEETG--- 188
Query: 106 DARDGALAGVLGLSRVTISFISQL--GSIIKKRFSYCLVIPLPNGEYTSSYLKFGTDMGY 163
D G++GL R +S + QL +I+ FS C Y + G M
Sbjct: 189 DLFSQRADGIMGLGRGKLSVVDQLVDKGVIEDVFSLC---------YGGMEVGGGA-MVL 238
Query: 164 RRPSTQATKFINHPNNF----YYLSLKDISIDNERMNFPPDTFDITVSGEGGCIIDSGSV 219
+ S +H + F Y + LK + + + + P F+ G+ G ++DSG+
Sbjct: 239 GKISPPPGMVFSHSDPFRSPYYNIDLKQMHVAGKSLKLNPKVFN----GKHGTVLDSGTT 294
Query: 220 LTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCYF-----LPETFNRFPSMAFYF 274
YF + + + + + + D P +C+ + E N FP +A F
Sbjct: 295 YAYFPKEAFIAIKDAVIKEIPSLKRIHGPD-PNYDDVCFSGAGRDVAEIHNFFPEIAMEF 353
Query: 275 EDANLRIDGENVFIIDYENHFF---------LLAVAPHDDLVALIGSQQQRDTRFVYDLN 325
+ I+ EN+ F L + P D L+G R+T YD
Sbjct: 354 GNGQ-------KLILSPENYLFRHTKVRGAYCLGIFPDRDSTTLLGGIVVRNTLVTYDRE 406
Query: 326 IDLLSFVKENCSD 338
D L F+K NCSD
Sbjct: 407 NDKLGFLKTNCSD 419
>gi|9757837|dbj|BAB08274.1| unnamed protein product [Arabidopsis thaliana]
Length = 586
Score = 105 bits (262), Expect = 3e-20, Method: Compositional matrix adjust.
Identities = 100/373 (26%), Positives = 154/373 (41%), Gaps = 67/373 (17%)
Query: 2 VRLFIGTPSKGVLLILDTGSALIYAI--------------FDPRKSSSFQKINCDHPDCT 47
RL+IGTP + LI+DTGS + Y F P S+S+Q + C+ PDC
Sbjct: 78 TRLWIGTPPQEFALIVDTGSTVTYVPCSTCKQCGKHQDPKFQPELSTSYQALKCN-PDCN 136
Query: 48 YFKCVNEQ--CVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSNDNHGFDE 105
C +E CVY +YA+ S + G + + IS E + A+FGC N+ G
Sbjct: 137 ---CDDEGKLCVYERRYAEMSSSSGVLSEDLISF--GNESQLSPQRAVFGCENEETG--- 188
Query: 106 DARDGALAGVLGLSRVTISFISQL--GSIIKKRFSYCLVIPLPNGEYTSSYLKFGTDMGY 163
D G++GL R +S + QL +I+ FS C Y + G M
Sbjct: 189 DLFSQRADGIMGLGRGKLSVVDQLVDKGVIEDVFSLC---------YGGMEVGGGA-MVL 238
Query: 164 RRPSTQATKFINHPNNF----YYLSLKDISIDNERMNFPPDTFDITVSGEGGCIIDSGSV 219
+ S +H + F Y + LK + + + + P F+ G+ G ++DSG+
Sbjct: 239 GKISPPPGMVFSHSDPFRSPYYNIDLKQMHVAGKSLKLNPKVFN----GKHGTVLDSGTT 294
Query: 220 LTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCYF-----LPETFNRFPSMAFYF 274
YF + + + + + + D P +C+ + E N FP +A F
Sbjct: 295 YAYFPKEAFIAIKDAVIKEIPSLKRIHGPD-PNYDDVCFSGAGRDVAEIHNFFPEIAMEF 353
Query: 275 EDANLRIDGENVFIIDYENHFF---------LLAVAPHDDLVALIGSQQQRDTRFVYDLN 325
+ I+ EN+ F L + P D L+G R+T YD
Sbjct: 354 GNGQ-------KLILSPENYLFRHTKVRGAYCLGIFPDRDSTTLLGGIVVRNTLVTYDRE 406
Query: 326 IDLLSFVKENCSD 338
D L F+K NCSD
Sbjct: 407 NDKLGFLKTNCSD 419
>gi|242041575|ref|XP_002468182.1| hypothetical protein SORBIDRAFT_01g041210 [Sorghum bicolor]
gi|241922036|gb|EER95180.1| hypothetical protein SORBIDRAFT_01g041210 [Sorghum bicolor]
Length = 490
Score = 105 bits (262), Expect = 3e-20, Method: Compositional matrix adjust.
Identities = 100/365 (27%), Positives = 155/365 (42%), Gaps = 42/365 (11%)
Query: 1 MVRLFIGTPSKGVLLILDTGSALIY--------------AIFDPRKSSSFQKINCDHPDC 46
+ ++ +GTP LL LDT S L + +FDPR S+S+++++ + DC
Sbjct: 139 IAKIAVGTPGVEALLALDTASDLTWLQCQPCRRCYPQSGPVFDPRHSTSYREMSFNAADC 198
Query: 47 TYFK------CVNEQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSNDN 100
CVYT+ Y D S T G ET++ G I GC +DN
Sbjct: 199 QALGRSGGGDAKRGTCVYTVGYGDGSTTVGDFIEETLTFAGGVRLPRIS----IGCGHDN 254
Query: 101 HG-FDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLKFGT 159
G F A AG+LGL R +SF +Q+ FSYCLV L SS L FG
Sbjct: 255 KGLFGAPA-----AGILGLGRGLMSFPNQIDH--NGTFSYCLVDFLSGPGSLSSTLTFGA 307
Query: 160 DMGYRRPSTQATKFINHPN--NFYYLSLKDISIDNERMNFPPDTFDITV---SGEGGCII 214
P T + + N FYY+ L IS+ R+ + D+ + +G GG I+
Sbjct: 308 GAVDTSPPVSFTPTVLNLNMPTFYYVRLTGISVGGVRVPGVTER-DLQLDPYTGRGGVIV 366
Query: 215 DSGSVLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCYFL-PETFNRFPSMAFY 273
DSG+ +T Y + F + + CY + + P+++ +
Sbjct: 367 DSGTAVTRLARPAYTAFRDAFRAVAVDLGQVSIGGPSGFFDTCYTVGGRGMKKVPTVSMH 426
Query: 274 FEDA-NLRIDGENVFI-IDYENHFFLLAVAPHDDLVALIGSQQQRDTRFVYDLNIDLLSF 331
F + +++ +N I +D A D V++IG+ QQ+ R VYD+ + F
Sbjct: 427 FAGSVEVKLQPKNYLIPVDSMGTVCFAFAATGDHSVSIIGNIQQQGFRIVYDIG-GRVGF 485
Query: 332 VKENC 336
+C
Sbjct: 486 APNSC 490
>gi|255543963|ref|XP_002513044.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
gi|223548055|gb|EEF49547.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
Length = 431
Score = 105 bits (262), Expect = 3e-20, Method: Compositional matrix adjust.
Identities = 87/357 (24%), Positives = 151/357 (42%), Gaps = 42/357 (11%)
Query: 1 MVRLFIGTPSKGVLLILDTGSALIY-----------AIFDPRKSSSFQKINCDHPDCTYF 49
+VR IGTP++ +LL +DT + + +F+ KS++F+ + C+ P C
Sbjct: 97 IVRAKIGTPAQTMLLAMDTSNDAAWIPCSGCVGCSSTVFNNVKSTTFKTVGCEAPQCKQV 156
Query: 50 ---KCVNEQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSNDNHGFDED 106
KC C + M Y S+ AA+ + V+ FGC + G
Sbjct: 157 PNSKCGGSACAFNMTYGSSSI----AANLSQDVVTLATDS--IPSYTFGCLTEATGSSIP 210
Query: 107 ARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLKFGTDMGYRRP 166
+ G+LGL R +S +SQ ++ + FSYCL P S L+ G +R
Sbjct: 211 PQ-----GLLGLGRGPMSLLSQTQNLYQSTFSYCL--PSFRSLNFSGSLRLGPVGQPKR- 262
Query: 167 STQATKFINHP--NNFYYLSLKDISIDNERMNFPPDTFDITVSGEGGCIIDSGSVLTYFH 224
+ T + +P ++ YY++L I + ++ PP + G I DSG+V T
Sbjct: 263 -IKTTPLLKNPRRSSLYYVNLMAIRVGRRVVDIPPSALAFNPTTGAGTIFDSGTVFTRLV 321
Query: 225 SDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCYFLPETFNRFPSMAFYFEDANLRIDGE 284
+ Y + + F + L CY P P++ F F N+ + +
Sbjct: 322 APAYTAVRDAFRKRVGNATVTSLGG----FDTCYTSPIV---APTITFMFSGMNVTLPPD 374
Query: 285 NVFIIDYENHFFLLAVAPHDD----LVALIGSQQQRDTRFVYDLNIDLLSFVKENCS 337
N+ I + LA+A D ++ +I + QQ++ R ++D+ L +E C+
Sbjct: 375 NLLIHSTASSITCLAMAAAPDNVNSVLNVIANMQQQNHRILFDVPNSRLGVAREPCT 431
>gi|168054484|ref|XP_001779661.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162668975|gb|EDQ55572.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 419
Score = 105 bits (262), Expect = 3e-20, Method: Compositional matrix adjust.
Identities = 96/368 (26%), Positives = 153/368 (41%), Gaps = 44/368 (11%)
Query: 2 VRLFIGTPSKGVLLILDTGSALIYA--------------IFDPRKSSSFQKINCDHPDCT 47
V F+GTP + LI+D+GS L++ ++ P SS+F + C P+C
Sbjct: 67 VDFFLGTPPQKFSLIVDSGSDLLWVQCAPCLQCYAQDTPLYAPSNSSTFNPVPCLSPECL 126
Query: 48 Y------FKC---VNEQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSN 98
F C C Y +YAD S++KG A+E+ +V FGC
Sbjct: 127 LIPATEGFPCDFHYPGACAYEYRYADTSLSKGVFAYESATV-----DDVRIDKVAFGCGR 181
Query: 99 DNHGFDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLKFG 158
DN G A GVLGL + +SF SQ+G +F+YCLV L + SS+L FG
Sbjct: 182 DNQG-----SFAAAGGVLGLGQGPLSFGSQVGYAYGNKFAYCLVNYL-DPTSVSSWLIFG 235
Query: 159 TDMGYRRPSTQATKFINHPNN--FYYLSLKDISIDNERMNFPPDTFDITVSGEGGCIIDS 216
++ Q T +++ N YY+ ++ + + E + + + G GG I DS
Sbjct: 236 DELISTIHDLQFTPIVSNSRNPTLYYVQIEKVMVGGESLPISHSAWSLDFLGNGGSIFDS 295
Query: 217 GSVLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCYFLPETFN-RFPSMAFYFE 275
G+ +TY+ Y + F + A + + + LC + FPS
Sbjct: 296 GTTVTYWLPPAYRNILAAFDKNVRYPRAASV----QGLDLCVDVTGVDQPSFPSFTIVLG 351
Query: 276 DANLRIDGENVFIIDYENHFFLLAVAPHDDLVA---LIGSQQQRDTRFVYDLNIDLLSFV 332
+ + + +D + LA+A V IG+ Q++ YD + + F
Sbjct: 352 GGAVFQPQQGNYFVDVAPNVQCLAMAGLPSSVGGFNTIGNLLQQNFLVQYDREENRIGFA 411
Query: 333 KENCSDDS 340
CS S
Sbjct: 412 PAKCSSHS 419
>gi|356498789|ref|XP_003518231.1| PREDICTED: probable aspartic protease At2g35615-like [Glycine max]
Length = 446
Score = 105 bits (261), Expect = 4e-20, Method: Compositional matrix adjust.
Identities = 96/360 (26%), Positives = 158/360 (43%), Gaps = 36/360 (10%)
Query: 1 MVRLFIGTPSKGVLLILDTGSALIYA--------------IFDPRKSSSFQKINCDHPDC 46
++ IG P L ++DTGS+L + IFDP KSS++ ++C +C
Sbjct: 94 LMNFSIGEPPIPQLAVMDTGSSLTWVMCHPCSSCSQQSVPIFDPSKSSTYSNLSCS--EC 151
Query: 47 TYFKCVNEQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSNDNHGFDED 106
VN +C Y+++Y ++G A E +++ E +FGC
Sbjct: 152 NKCDVVNGECPYSVEYVGSGSSQGIYAREQLTLETIDESIIKVPSLIFGCGRKFSISSNG 211
Query: 107 ARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLKFGTDMGYRRP 166
+ GV GL S + G K+FSYC + L N Y + L G +
Sbjct: 212 YPYQGINGVFGLGSGRFSLLPSFG----KKFSYC-IGNLRNTNYKFNRLVLGDKANMQGD 266
Query: 167 STQATKFINHPNNFYYLSLKDISIDNERMNFPPDTFDITVS-GEGGCIIDSGSVLTYFHS 225
ST +N N YY++L+ ISI +++ P F+ +++ G IIDSG+ T+
Sbjct: 267 STT----LNVINGLYYVNLEAISIGGRKLDIDPTLFERSITDNNSGVIIDSGADHTWLTK 322
Query: 226 DVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCY--FLPETFNRFPSMAFYF-EDANLRID 282
+ L + + E + D P LCY + + + FP + F+F E A L +D
Sbjct: 323 YGFEVLSFEVENLLEGVLVLAQQDKHNPYTLCYSGVVSQDLSGFPLVTFHFAEGAVLDLD 382
Query: 283 GENVFIIDYENHFFLLAVAPHD------DLVALIGSQQQRDTRFVYDLNIDLLSFVKENC 336
++FI EN F +A+ P + + + IG Q++ YDLN + F + +C
Sbjct: 383 VTSMFIQTTENE-FCMAMLPGNYFGDDYESFSSIGMLAQQNYNVGYDLNRMRVYFQRIDC 441
>gi|357486591|ref|XP_003613583.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
gi|355514918|gb|AES96541.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
Length = 437
Score = 105 bits (261), Expect = 4e-20, Method: Compositional matrix adjust.
Identities = 97/358 (27%), Positives = 166/358 (46%), Gaps = 33/358 (9%)
Query: 1 MVRLFIGTPSKGVLLILDTGSALIY--------------AIFDPRKSSSFQKINCDHPDC 46
++ IGTP + ++DT + I+ +FDP KSS+++ I C P C
Sbjct: 90 IISFLIGTPPFQLYGVMDTANDNIWFQCNPCKPCFNTTSPMFDPSKSSTYKTIPCSSPKC 149
Query: 47 TYFK---CVNEQ---CVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSNDN 100
+ C ++ C Y+ Y ++ ++G + +T+++ + F + GC + N
Sbjct: 150 KNVENTHCSSDDKKVCEYSFTYGGEAYSQGDLSIDTLTLNSNNDTPISFKNIVIGCGHRN 209
Query: 101 HGFDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLKFGTD 160
G +G ++G +GL R +SFISQL S I +FSYCLV PL + E S L FG
Sbjct: 210 KG----PLEGYVSGNIGLGRGPLSFISQLNSSIGGKFSYCLV-PLFSNEGISGKLHFGDK 264
Query: 161 MGYRRPSTQATKFINHPNNFYYLSLKDISIDNERMNFPPDTFDITVSGEGGCIIDSGSVL 220
T +T I Y +L +S+ + + F T G IIDSG+ L
Sbjct: 265 SVVSGVGTVSTP-ITAGEIGYSTTLNALSVGDHIIKFENSTSK--NDNLGNTIIDSGTTL 321
Query: 221 TYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCYFLPETFNRFPSMAFYFEDANLR 280
T +VY +L E V+ + + A+ + + +LCY P + +F A++
Sbjct: 322 TILPENVYSRL-ESIVTSMVKLERAKSPN--QQFKLCYKATLKNLDVPIITAHFNGADVH 378
Query: 281 IDGENVFI-IDYENHFFLLAVAPHDDLVALIGSQQQRDTRFVYDLNIDLLSFVKENCS 337
++ N F ID+E F V+ + +IG+ Q++ +DL +++SF +C+
Sbjct: 379 LNSLNTFYPIDHEVVCFAF-VSVGNFPGTIIGNIAQQNFLVGFDLQKNIISFKPTDCT 435
>gi|307136234|gb|ADN34070.1| aspartic proteinase nepenthesin-1 precursor [Cucumis melo subsp.
melo]
Length = 412
Score = 105 bits (261), Expect = 4e-20, Method: Compositional matrix adjust.
Identities = 97/378 (25%), Positives = 164/378 (43%), Gaps = 61/378 (16%)
Query: 2 VRLFIGTPSKGVLLILDTGSALIY----------AIFDPRKSSSFQKINCDHPDCTY--- 48
V L +G+P + V ++LDTGS L + ++F+P SSS+ I C P C
Sbjct: 42 VSLTVGSPPQQVTMVLDTGSELSWLHCKKSPNLTSVFNPLSSSSYSPIPCSSPVCRTRTR 101
Query: 49 -----FKCVNEQ-CVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSNDNHG 102
C ++ C + YAD S +G A + + G + G LFGC + G
Sbjct: 102 DLPNPVTCDPKKLCHAIVSYADASSLEGNLASDNFRI-----GSSALPGTLFGCMDS--G 154
Query: 103 FDEDAR-DGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLKFG--- 158
F ++ D G++G++R ++SF++QLG +FSYC+ +G +S L FG
Sbjct: 155 FSSNSEEDAKTTGLMGMNRGSLSFVTQLG---LPKFSYCI-----SGRDSSGVLLFGDSH 206
Query: 159 ----TDMGYRRPSTQATKFINHPNNFYYLSLKDISIDNERMNFPPDTFDITVSGEGGCII 214
++ Y +T Y + L I + N+ + P F +G G ++
Sbjct: 207 LSWLGNLTYTPLVQISTPLPYFDRVAYTVQLDGIRVGNKILPLPKSIFAPDHTGAGQTMV 266
Query: 215 DSGSVLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPE-----PIQLCYFLPE--TFNRF 267
DSG+ T+ VY L +F+ + LA L D P + LCY +P
Sbjct: 267 DSGTQFTFLLGPVYTALRNEFLEQ-TKGVLAPLGD-PNFVFQGAMDLCYRVPAGGKLPEL 324
Query: 268 PSMAFYFEDANLRIDGENVF-----IIDYENHFFLLAVAPHDDLVAL----IGSQQQRDT 318
P+++ F A + + GE + ++ + + L + DL+ + IG Q++
Sbjct: 325 PAVSLMFRGAEMVVGGEVLLYKVPGMMKGKEWVYCLTFG-NSDLLGIEAFVIGHHHQQNV 383
Query: 319 RFVYDLNIDLLSFVKENC 336
+DL + FV+ C
Sbjct: 384 WMEFDLVKSRVGFVETRC 401
>gi|297822477|ref|XP_002879121.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
gi|297324960|gb|EFH55380.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
Length = 711
Score = 104 bits (260), Expect = 5e-20, Method: Compositional matrix adjust.
Identities = 101/357 (28%), Positives = 159/357 (44%), Gaps = 50/357 (14%)
Query: 1 MVRLFIGTPSKGVLLILDTGSALIYA--------------IFDPRKSSSFQKINC-DHPD 45
+++L +GTP + ++DTGS + + IFDP KSS+F++ C DH
Sbjct: 381 LMKLQVGTPPFEIEAVIDTGSEITWTQCLPCVHCYKQNAPIFDPSKSSTFKEKRCHDH-- 438
Query: 46 CTYFKCVNEQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSNDNHGFDE 105
C Y + Y D++ TKG A +T+++ + + GC +N F
Sbjct: 439 ---------SCPYEVDYFDKTYTKGTLATDTVTIHSTSGEPFVMAETIIGCGRNNSWFRP 489
Query: 106 DARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLKFGTDM---G 162
+ G +GL+ +S I+Q+G SYC NG +S + FGT+ G
Sbjct: 490 -----SFEGFVGLNWGPLSLITQMGGEYPGLMSYCFA---GNG---TSKINFGTNAIVGG 538
Query: 163 YRRPSTQATKFINHPNNFYYLSLKDISIDNERMNFPPDTFDITVSGEGGCIIDSGSVLTY 222
ST P FYYL+L +S+ + R+ F EG +IDSG+ LTY
Sbjct: 539 GGVVSTTMFVTTARP-GFYYLNLDAVSVGDTRIETLGTPFHAL---EGNIVIDSGTTLTY 594
Query: 223 FHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCYFLPETFNRFPSMAFYFE-DANLRI 281
F + Y L + V + +D LCY+ T FP + +F A+L +
Sbjct: 595 F-PESYCNLVRQAVEHV--VPAVPAADPTGNDLLCYY-SNTTEIFPVITMHFSGGADLVL 650
Query: 282 DGENVFIIDYENHFFLLAVAPHDDLV-ALIGSQQQRDTRFVYDLNIDLLSFVKENCS 337
D N+F+ Y F LA+ ++ A+ G++ Q + YD + L+SF NCS
Sbjct: 651 DKYNMFMESYSGGLFCLAIICNNPTQEAIFGNRAQNNFLVGYDSSSLLVSFKPTNCS 707
Score = 97.8 bits (242), Expect = 7e-18, Method: Compositional matrix adjust.
Identities = 97/340 (28%), Positives = 147/340 (43%), Gaps = 64/340 (18%)
Query: 1 MVRLFIGTPSKGVLLILDTGSALIYA--------------IFDPRKSSSFQKINCDHPDC 46
+++L IGTP V +LDTGS LI+ IFDP KSS+F++ C+ PD
Sbjct: 66 LMKLQIGTPPFEVEAVLDTGSELIWTQCLPCLHCYDQKAPIFDPSKSSTFKETRCNTPD- 124
Query: 47 TYFKCVNEQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSNDNHGFDED 106
C Y + Y D+S T+G A ET+++ + + GCS +N G
Sbjct: 125 -------HSCPYKLVYDDKSYTQGTLATETVTIHSTSGVPFVMPETIIGCSRNNSG---S 174
Query: 107 ARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLKFGTDMGYRRP 166
+ +G++GLSR ++S ISQ+G G Y + T
Sbjct: 175 GFRPSSSGIVGLSRGSLSLISQMG-----------------GAYPGDGVVSTTMFAKTAK 217
Query: 167 STQATKFINHPNNFYYLSLKDISIDNERMNFPPDTFDITVSGEGGCIIDSGSVLTYFHSD 226
Q YYL+L +S+ + R+ F G +IDSG+ LTYF
Sbjct: 218 RGQ-----------YYLNLDAVSVGDTRIETVGTPFHAL---NGNIVIDSGTPLTYFPVS 263
Query: 227 VYWKLHEKFVSYFERFQLA-QLSDCPEPIQLCYFLPETFNRFPSMAFYFE-DANLRIDGE 284
Y L K V ER A ++ D LCY+ T FP + +F A+L +D
Sbjct: 264 -YCNLVRKAV---ERVVTADRVVDPSRNDMLCYY-SNTIEIFPVITVHFSGGADLVLDKY 318
Query: 285 NVFIIDYENHFFLLAVAPHDDL-VALIGSQQQRDTRFVYD 323
N+++ F LA+ ++ VA+ G++ Q + YD
Sbjct: 319 NMYMELNRGGVFCLAIICNNPTQVAIFGNRAQNNFLVGYD 358
>gi|125543639|gb|EAY89778.1| hypothetical protein OsI_11320 [Oryza sativa Indica Group]
Length = 488
Score = 104 bits (260), Expect = 5e-20, Method: Compositional matrix adjust.
Identities = 91/293 (31%), Positives = 138/293 (47%), Gaps = 27/293 (9%)
Query: 53 NEQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSNDNHGFDEDARDGAL 112
N+ CVYT Y D+SVT G + + G G ++ G FGC N+G +
Sbjct: 211 NQTCVYTYYYNDKSVTTGLLEVDKFTF---GAGASV-PGVAFGCGLFNNGVFKSNE---- 262
Query: 113 AGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNG-EYTSSYLKFGTDMGYR--RPSTQ 169
G+ G R +S SQL FS+C NG + ++ L D+ Y+ R + Q
Sbjct: 263 TGIAGFGRGPLSLPSQLKV---GNFSHCFTAV--NGLKQSTVLLDLLADL-YKNGRGAVQ 316
Query: 170 ATKFINHPNN--FYYLSLKDISIDNERMNFPPDTFDITVSGEGGCIIDSGSVLTYFHSDV 227
+T I + N YYLSLK I++ + R+ P F +T +G GG IIDSG+ +T V
Sbjct: 317 STPLIQNSANPTLYYLSLKGITVGSTRLPVPESAFALT-NGTGGTIIDSGTSITSLPPQV 375
Query: 228 YWKLHEKFVSYFERFQLAQLSDCPEPIQLCYFLPETFNR-FPSMAFYFEDANLRIDGEN- 285
Y + ++F + + + + P C+ P P + +FE A + + EN
Sbjct: 376 YQVVRDEFAAQIKLPVVPGNATGP---YTCFSAPSQAKPDVPKLVLHFEGATMDLPRENY 432
Query: 286 VFII--DYENHFFLLAVAPHDDLVALIGSQQQRDTRFVYDLNIDLLSFVKENC 336
VF + D N LA+ D A IG+ QQ++ +YDL ++LSFV C
Sbjct: 433 VFEVPDDAGNSMICLAINELGDERATIGNFQQQNMHVLYDLQNNMLSFVAAQC 485
Score = 47.8 bits (112), Expect = 0.007, Method: Compositional matrix adjust.
Identities = 37/134 (27%), Positives = 65/134 (48%), Gaps = 9/134 (6%)
Query: 188 ISIDNERMNFPPDTFDITVSGEGGCIIDSGSVLTYFHSDVYWKLHEKFVSYFERFQLAQL 247
I++ + R+ P F +T +G GG IIDSG+ +T VY + ++F + + +
Sbjct: 42 ITVGSTRLPVPESAFALT-NGTGGTIIDSGTSITSLPPQVYQVVRDEFAAQIKLPVVPGN 100
Query: 248 SDCPEPIQLCYFLP-ETFNRFPSMAFYFEDANLRIDGEN-VFII--DYENHFFLLAVAPH 303
+ P C+ P + P + +FE A + + EN VF + D N LA+
Sbjct: 101 ATGP---YTCFSAPSQAKPDVPKLVLHFEGATMDLPRENYVFEVPDDAGNSIICLAINKG 157
Query: 304 DDLVALIGSQQQRD 317
D+ +IG+ QQ++
Sbjct: 158 DE-TTIIGNFQQQN 170
>gi|15234607|ref|NP_194733.1| aspartyl protease family protein [Arabidopsis thaliana]
gi|4938480|emb|CAB43839.1| putative protein [Arabidopsis thaliana]
gi|7269904|emb|CAB80997.1| putative protein [Arabidopsis thaliana]
gi|67633776|gb|AAY78812.1| aspartyl protease family protein [Arabidopsis thaliana]
gi|332660311|gb|AEE85711.1| aspartyl protease family protein [Arabidopsis thaliana]
Length = 427
Score = 104 bits (260), Expect = 5e-20, Method: Compositional matrix adjust.
Identities = 104/360 (28%), Positives = 156/360 (43%), Gaps = 47/360 (13%)
Query: 1 MVRLFIGTPSKGVLLILDTGSALIY--------------AIFDPRKSSSFQKINCDHPDC 46
+V + IG+P LL +DT S L++ IFDP +S + + C
Sbjct: 86 LVNISIGSPPITQLLHMDTASDLLWIQCLPCINCYAQSLPIFDPSRSYTHRNETCRTSQY 145
Query: 47 TY----FKCVNEQCVYTMKYADQSVTKGFAAHETI--SVIGKGEGKAIFHGALFGCSNDN 100
+ F C Y+M+Y D + +KG A E + + I A H +FGC +DN
Sbjct: 146 SMPSLKFNANTRSCEYSMRYVDDTGSKGILAREMLLFNTIYDESSSAALHDVVFGCGHDN 205
Query: 101 HGFDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLKFGTD 160
+G G+LGL S + + G K+FSYC L + Y + L G D
Sbjct: 206 YG-----EPLVGTGILGLGYGEFSLVHRFG----KKFSYCFG-SLDDPSYPHNVLVLGDD 255
Query: 161 MGYRRPSTQATKFINHPNNFYYLSLKDISIDNERMNFPPDTFDIT-VSGEGGCIIDSGSV 219
T + N FYY++++ IS+D + P F+ +G GG IID+G+
Sbjct: 256 GANILGDTTPLEI---HNGFYYVTIEAISVDGIILPIDPRVFNRNHQTGLGGTIIDTGNS 312
Query: 220 LTYFHSDVYWKLHEKFVSYFE-RFQLAQLSDCPEPIQLCYFLPETFNR------FPSMAF 272
LT + Y L + FE RF A +S CY F R FP + F
Sbjct: 313 LTSLVEEAYKPLKNRIEDIFEGRFTAADVSQDDMIKMECY--NGNFERDLVESGFPIVTF 370
Query: 273 YF-EDANLRIDGENVFIIDYENHFFLLAVAPHDDLVALIGSQQQRDTRFVYDLNIDLLSF 331
+F E A L +D +++F + + F LAV P + + IG+ Q+ YDL +SF
Sbjct: 371 HFSEGAELSLDVKSLF-MKLSPNVFCLAVTPGN--LNSIGATAQQSYNIGYDLEAMEVSF 427
>gi|449441139|ref|XP_004138341.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 1-like [Cucumis
sativus]
gi|449477464|ref|XP_004155031.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 1-like [Cucumis
sativus]
Length = 336
Score = 104 bits (260), Expect = 5e-20, Method: Compositional matrix adjust.
Identities = 106/356 (29%), Positives = 146/356 (41%), Gaps = 47/356 (13%)
Query: 6 IGTPSKGVLLILDTGSAL-----------------IYAIFDPRKSSSFQKINCDHPDCTY 48
+G P + +LDTGS + I IFDP SSS+ ++CD C
Sbjct: 3 VGQPQQPSFFVLDTGSDVTWLQCLPCAGKNGCYEQITPIFDPELSSSYNPVSCDSEQCQL 62
Query: 49 FK---CVNEQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSNDNHGFDE 105
C C+Y ++Y D S T G A ET++ + I GC +DN G
Sbjct: 63 LDEAGCNVNSCIYKVEYGDGSFTIGELATETLTFVHSNSIPNIS----IGCGHDNEGLFV 118
Query: 106 DARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLV-IPLPNGEYTSSYLKFGTDMGYR 164
A IS SQL + FSYCLV I P + S L F TD
Sbjct: 119 GADGLIGL-----GGGAISISSQLKA---SSFSYCLVDIDSP----SFSTLDFNTDPPSD 166
Query: 165 RPSTQATKFINHPNNFYYLSLKDISIDNERMNFPPDTFDITVSGEGGCIIDSGSVLTYFH 224
+ K P+ F Y+ + +S+ + + F+I SG GG I+DSG+ +T
Sbjct: 167 SLISPLVKNDRFPS-FRYVKVIGMSVGGKPLPISSSRFEIDESGLGGIIVDSGTTITQLP 225
Query: 225 SDVYWKLHEKFVSYFERFQLAQLSDCPE--PIQLCYFLPETFN-RFPSMAFYFEDAN-LR 280
SDVY L E F+ L PE P CY L N P++AF N L+
Sbjct: 226 SDVYEVLREAFLGL-----TTNLPPAPEISPFDTCYDLSSQSNVEVPTIAFILPGENSLQ 280
Query: 281 IDGENVFIIDYENHFFLLAVAPHDDLVALIGSQQQRDTRFVYDLNIDLLSFVKENC 336
+ +N I F LA +++IG+ QQ+ R YDL L+ F C
Sbjct: 281 LPAKNCLIQVDSAGTFCLAFVSATFPLSIIGNFQQQGIRVSYDLTNSLVGFSTNKC 336
>gi|356540371|ref|XP_003538663.1| PREDICTED: probable aspartic protease At2g35615-like [Glycine max]
Length = 374
Score = 104 bits (260), Expect = 5e-20, Method: Compositional matrix adjust.
Identities = 96/357 (26%), Positives = 156/357 (43%), Gaps = 31/357 (8%)
Query: 1 MVRLFIGTPSKGVLLILDTGSALIYA--------------IFDPRKSSSFQKINCDHPDC 46
++ + IGTP + I DTGS L + IFDP+KS+S++ I+CD C
Sbjct: 26 LMEVSIGTPPFKIYGIADTGSDLTWTSCVPCNKCYKQRNPIFDPQKSTSYRNISCDSKLC 85
Query: 47 ----TYFKCVNEQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSNDNHG 102
T + C YT YA ++T+G A ETI++ G +FGC ++N G
Sbjct: 86 HKLDTGVCSPQKHCNYTYAYASAAITQGVLAQETITLSSTKGESVPLKGIVFGCGHNNTG 145
Query: 103 FDEDARDGALAGVLGLSRVTISFISQLGSII-KKRFSYCLVIPLPNGEYTSSYLKFGTDM 161
D G++GL +SFISQ+GS KRFS CLV P SS + G
Sbjct: 146 GFNDRE----MGIIGLGGGPVSFISQIGSSFGGKRFSQCLV-PFHTDVSVSSKMSLGKGS 200
Query: 162 GYRRPSTQATKFINHPNNF-YYLSLKDISIDNERMNFPPDTFDITVSGEGGCIIDSGSVL 220
+T + + Y+++L IS+ N ++F + +G +DSG+
Sbjct: 201 EVSGKGVVSTPLVAKQDKTPYFVTLLGISVGNTYLHFNGSSSQSVE--KGNVFLDSGTPP 258
Query: 221 TYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCYFLPETFNRFPSMAFYFEDANLR 280
T + +Y +L + S E +D QLCY R P + +FE +++
Sbjct: 259 TILPTQLYDRLVAQVRS--EVAMKPVTNDLDLGPQLCYRTKNNL-RGPVLTAHFEGGDVK 315
Query: 281 IDGENVFIIDYENHFFLLAVAPHDDLVALIGSQQQRDTRFVYDLNIDLLSFVKENCS 337
+ F + ++ F L + G+ Q + +DL+ ++SF +C+
Sbjct: 316 LLPTQTF-VSPKDGVFCLGFTNTSSDGGVYGNFAQSNYLIGFDLDRQVVSFKPMDCT 371
>gi|242092898|ref|XP_002436939.1| hypothetical protein SORBIDRAFT_10g011740 [Sorghum bicolor]
gi|241915162|gb|EER88306.1| hypothetical protein SORBIDRAFT_10g011740 [Sorghum bicolor]
Length = 469
Score = 104 bits (260), Expect = 5e-20, Method: Compositional matrix adjust.
Identities = 102/369 (27%), Positives = 153/369 (41%), Gaps = 64/369 (17%)
Query: 1 MVRLFIGTPSKGVLLILDTGSALIYA----------------IFDPRKSSSFQKINCDHP 44
+V L GTPS +L++DTGS + + +FDP KSS++ I C+
Sbjct: 132 VVTLGFGTPSVPQVLLMDTGSDVSWVQCTPCNSTKCYPQKDPLFDPSKSSTYAPIACNTD 191
Query: 45 DCTYFK------CVN--EQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGC 96
C C + QC Y+++YAD S ++G ++ET++ + G FH FGC
Sbjct: 192 ACRKLGDHYHNGCTSGGTQCGYSVEYADGSHSRGVYSNETLT-LAPGITVEDFH---FGC 247
Query: 97 SNDNHGFDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLK 156
D G D DG +LGL +S + Q S+ FSYCL P N E + +L
Sbjct: 248 GRDQRG-PSDKYDG----LLGLGGAPVSLVVQTSSVYGGAFSYCL--PALNSE--AGFLV 298
Query: 157 FGTDMGYRRPSTQATKFINHP-------NNFYYLSLKDISIDNERMNFPPDTFDITVSGE 209
G+ PS + F+ P FY +++ IS+ + ++ P F
Sbjct: 299 LGSP-----PSGNKSAFVFTPMRHLPGYATFYMVTMTGISVGGKPLHIPQSAF------R 347
Query: 210 GGCIIDSGSVLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCYFLPETFN-RFP 268
GG IIDSG+V T Y L + + L D CY N P
Sbjct: 348 GGMIIDSGTVDTELPETAYNALEAALRKALKAYPLVPSDD----FDTCYNFTGYSNITVP 403
Query: 269 SMAFYFE-DANLRIDGENVFIIDYENHFFLLAVAPHDDLVALIGSQQQRDTRFVYDLNID 327
+AF F A + +D N ++ N + DD + +IG+ QR +YD
Sbjct: 404 RVAFTFSGGATIDLDVPNGILV---NDCLAFQESGPDDGLGIIGNVNQRTLEVLYDAGRG 460
Query: 328 LLSFVKENC 336
+ F C
Sbjct: 461 NVGFRAGAC 469
>gi|413947545|gb|AFW80194.1| hypothetical protein ZEAMMB73_386053 [Zea mays]
Length = 456
Score = 104 bits (260), Expect = 6e-20, Method: Compositional matrix adjust.
Identities = 100/373 (26%), Positives = 162/373 (43%), Gaps = 61/373 (16%)
Query: 6 IGTPSKGVLLILDTGSALIY----------------AIFDPRKSSSFQKINCDHPDCTYF 49
+GTP +L I DTGS L++ +F P +S+++ ++C C
Sbjct: 106 VGTPPAQMLAIADTGSDLVWVNCSSNGGGGGASDGAVVFHPSRSTTYSLLSCQSAACQAL 165
Query: 50 ---KC-VNEQCVYTMKYADQSVTKGFAAHETIS---VIGKGEGKAIFHGALFGCSNDNHG 102
C + +C Y Y D S T G + ET S G GEG+ FGCS + G
Sbjct: 166 SQASCDADSECQYQYAYGDGSRTIGVLSTETFSFAAAGGGGEGQVRVPRVSFGCSTGSAG 225
Query: 103 -FDEDARDGALAGVLGLSRVTISFISQLGSI--IKKRFSYCLVIPLPNGEYTSSYLKFGT 159
F D G++GL +S +SQLG+ I +RFSYCLV P +SS L FG
Sbjct: 226 SFRSD-------GLVGLGAGALSLVSQLGAAARIARRFSYCLVPPYAAAN-SSSTLSFGA 277
Query: 160 DMGYRRPSTQATKFI-NHPNNFYYLSLKDISIDNERMNFPPDTFDITVSGEGGCIIDSGS 218
P +T + + +++Y ++L+ +++ + D+ + I+DSG+
Sbjct: 278 RAVVSDPGAASTPLVPSEVDSYYTVALESVAVAGQ---------DVASANSSRIIVDSGT 328
Query: 219 VLTYFHSDVYWKLHEKFVSYFE-RFQLAQLSDCPEPIQLCYFLPETFNRFPSMAFYFEDA 277
LT+ L V+ E R +L + + +QLCY + + + F D
Sbjct: 329 TLTFLDP----ALLRPLVAELERRIRLPRAQPPEQLLQLCY---DVQGKSQAEDFGIPDV 381
Query: 278 NLRIDGENVFIIDYENHFFL-------LAVAP--HDDLVALIGSQQQRDTRFVYDLNIDL 328
LR G + EN F L L + P V+++G+ Q++ YDL+
Sbjct: 382 TLRFGGGASVTLRPENTFSLLEEGTLCLVLVPVSESQPVSILGNIAQQNFHVGYDLDART 441
Query: 329 LSFVKENCSDDSA 341
++F +C+ SA
Sbjct: 442 VTFAAVDCTRSSA 454
>gi|242091325|ref|XP_002441495.1| hypothetical protein SORBIDRAFT_09g028050 [Sorghum bicolor]
gi|241946780|gb|EES19925.1| hypothetical protein SORBIDRAFT_09g028050 [Sorghum bicolor]
Length = 466
Score = 104 bits (260), Expect = 6e-20, Method: Compositional matrix adjust.
Identities = 90/360 (25%), Positives = 148/360 (41%), Gaps = 36/360 (10%)
Query: 2 VRLFIGTPSKGVLLILDTGSALIYA----------IFDPRKSSSFQKINCDHPDC----- 46
V+L +GTP + L+ DTGS L + +F P+ S S+ I C C
Sbjct: 118 VKLRVGTPVQEFTLVADTGSDLTWVKCAGASPPGRVFRPKTSRSWAPIPCSSDTCKLDVP 177
Query: 47 -TYFKCVN--EQCVYTMKYADQSV-TKGFAAHETISVIGKGEGKAIFHGALFGCSNDNHG 102
T C + C Y +Y + S +G E+ ++ G A + GCS+ + G
Sbjct: 178 FTLANCSSPASPCTYDYRYKEGSAGARGIVGTESATIALPGGKVAQLKDVVLGCSSSHDG 237
Query: 103 FDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLKFGTDMG 162
+ DG VL L ISF +Q + FSYCLV L T YL FG
Sbjct: 238 QSFRSADG----VLSLGNAKISFATQAAARFGGSFSYCLVDHLAPRNATG-YLAFGPGQV 292
Query: 163 YRRPSTQATKFINHPNNFYYLSLKDISIDNERMNFPPDTFDITVSGEGGCIIDSGSVLTY 222
R P+TQ F++ FY + + I + + ++ P + +D + GG I+DSG+ LT
Sbjct: 293 PRTPATQTKLFLDPEMPFYGVKVDAIHVAGKALDIPAEVWD---AKSGGVILDSGNTLTV 349
Query: 223 FHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCYFL----PETFNRFPSMAFYFEDAN 278
+ Y + + + P + CY P P +A F +
Sbjct: 350 LAAPAYKAVVAALSKHLDGVPKVSF----PPFEHCYNWTARRPGAPEIIPKLAVQFAGSA 405
Query: 279 LRIDGENVFIIDYENHFFLLAVAPHD-DLVALIGSQQQRDTRFVYDLNIDLLSFVKENCS 337
++ID + + V + +++IG+ Q++ + +DL + F + NC+
Sbjct: 406 RLEPPAKSYVIDVKPGVKCIGVQEGEWPGLSVIGNIMQQEHLWEFDLKNMQVRFKQSNCT 465
>gi|297742733|emb|CBI35367.3| unnamed protein product [Vitis vinifera]
Length = 521
Score = 104 bits (259), Expect = 6e-20, Method: Compositional matrix adjust.
Identities = 84/348 (24%), Positives = 142/348 (40%), Gaps = 68/348 (19%)
Query: 2 VRLFIGTPSKGVLLILDTGSALIYA--------------IFDPRKSSSFQKINCDHPDCT 47
VR+ +G+P + +++D+GS +++ +FDP S+SF ++C C
Sbjct: 203 VRIGVGSPPRSQYMVIDSGSDIVWVQCQPCTQCYHQSDPVFDPADSASFTGVSCSSSVCD 262
Query: 48 YFK---CVNEQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSNDNHGFD 104
+ C +C Y + Y D S TKG A ET++ G+ + GC + N G
Sbjct: 263 RLENAGCHAGRCRYEVSYGDGSYTKGTLALETLTF-----GRTMVRSVAIGCGHRNRGMF 317
Query: 105 EDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLKFGTDMGYR 164
A ++SF+ QLG FSYCLV
Sbjct: 318 VGAAGLLGL-----GGGSMSFVGQLGGQTGGAFSYCLV---------------------- 350
Query: 165 RPSTQATKFINHPN--NFYYLSLKDISIDNERMNFPPDTFDITVSGEGGCIIDSGSVLTY 222
S + +P +FYY+ L + + R+ + F +T G+GG ++D+G+ +T
Sbjct: 351 --SAAWVPLVRNPRAPSFYYIGLAGLGVGGIRVPISEEVFRLTELGDGGVVMDTGTAVTR 408
Query: 223 FHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQL-----CY-FLPETFNRFPSMAFYFED 276
+ Y + F LAQ ++ P + CY L R P+++FYF
Sbjct: 409 LPTLAYQAFRDAF--------LAQTANLPRATGVAIFDTCYDLLGFVSVRVPTVSFYFSG 460
Query: 277 AN-LRIDGENVFIIDYENHFFLLAVAPHDDLVALIGSQQQRDTRFVYD 323
L + N I + F A AP ++++G+ QQ + +D
Sbjct: 461 GPILTLPARNFLIPMDDAGTFCFAFAPSTSGLSILGNIQQEGIQISFD 508
>gi|115479237|ref|NP_001063212.1| Os09g0423500 [Oryza sativa Japonica Group]
gi|50725896|dbj|BAD33424.1| putative nucleoid DNA-binding protein cnd41, chloroplast [Oryza
sativa Japonica Group]
gi|50726136|dbj|BAD33657.1| putative nucleoid DNA-binding protein cnd41, chloroplast [Oryza
sativa Japonica Group]
gi|113631445|dbj|BAF25126.1| Os09g0423500 [Oryza sativa Japonica Group]
gi|215767614|dbj|BAG99842.1| unnamed protein product [Oryza sativa Japonica Group]
gi|222641596|gb|EEE69728.1| hypothetical protein OsJ_29412 [Oryza sativa Japonica Group]
Length = 473
Score = 104 bits (259), Expect = 6e-20, Method: Compositional matrix adjust.
Identities = 98/368 (26%), Positives = 159/368 (43%), Gaps = 65/368 (17%)
Query: 6 IGTPSKGVLLILDTGSALIYA--------------IFDPRKSSSFQKINCDHPDCTYFKC 51
+G +I+DT S L + +FDP S S+ + C+ C +
Sbjct: 131 VGLGGGEATVIVDTASELTWVQCAPCASCHDQQGPLFDPASSPSYAVLPCNSSSCDALQV 190
Query: 52 VN----------EQ--CVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSND 99
EQ C YT+ Y D S ++G AH+ +S+ G+ + G +FGC
Sbjct: 191 ATGSAAGACGGGEQPSCSYTLSYRDGSYSQGVLAHDKLSLAGE-----VIDGFVFGCGTS 245
Query: 100 NHGFDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLKFGT 159
N G G +G++GL R +S ISQ FSYCL PL E +S L G
Sbjct: 246 NQG-----PFGGTSGLMGLGRSQLSLISQTMDQFGGVFSYCL--PLKESE-SSGSLVLGD 297
Query: 160 DMGYRRPSTQA--TKFINHP--NNFYYLSLKDISIDNERMNFPPDTFDITVSGEGGCIID 215
D R ST T ++ P FY+++L I+I + + S G I+D
Sbjct: 298 DTSVYRNSTPIVYTTMVSDPVQGPFYFVNLTGITIGGQEVE----------SSAGKVIVD 347
Query: 216 SGSVLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCP--EPIQLCYFLPETFN-RFPSMAF 272
SG+++T VY + +F+S F + A P + C+ L + PS+ F
Sbjct: 348 SGTIITSLVPSVYNAVKAEFLSQFAEYPQA-----PGFSILDTCFNLTGFREVQIPSLKF 402
Query: 273 YFE-DANLRIDGENV-FIIDYENHFFLLAVAP--HDDLVALIGSQQQRDTRFVYDLNIDL 328
FE + + +D V + + ++ LA+A + ++IG+ QQ++ R ++D
Sbjct: 403 VFEGNVEVEVDSSGVLYFVSSDSSQVCLALASLKSEYETSIIGNYQQKNLRVIFDTLGSQ 462
Query: 329 LSFVKENC 336
+ F +E C
Sbjct: 463 IGFAQETC 470
>gi|326520291|dbj|BAK07404.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 464
Score = 104 bits (259), Expect = 7e-20, Method: Compositional matrix adjust.
Identities = 96/380 (25%), Positives = 162/380 (42%), Gaps = 57/380 (15%)
Query: 1 MVRLFIGTP-SKGVLLILDTGSALIYA-------------IFDPRKSSSFQKINCDHPDC 46
++ L IGTP + V+L LDTGS L++ +F S +F ++ C P C
Sbjct: 95 LIHLGIGTPRPQRVVLHLDTGSDLVWTQCACTVCFDQPVPVFRASVSHTFSRVPCSDPLC 154
Query: 47 TYF--------KCVNEQCVYTMKYADQSVTKGFAAHETISVIG--KGEGKAIFHGALFGC 96
+ + C Y Y D S+T G A +T + + + A FGC
Sbjct: 155 GHAVYLPLSGCAARDRSCFYAYGYMDHSITTGKMAEDTFTFKAPDRADTAAAVPNIRFGC 214
Query: 97 SNDNHGFDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLK 156
N+G + +G+ G +S SQL +RFSYC E S +
Sbjct: 215 GMMNYGLFTPNQ----SGIAGFGTGPLSLPSQLKV---RRFSYCFTA---MEESRVSPVI 264
Query: 157 FGTD----MGYRRPSTQATKFINHPNN-------FYYLSLKDISIDNERMNFPPDTFDIT 205
G + + Q+T F P FY+LSL+ +++ R+ F TF +
Sbjct: 265 LGGEPENIEAHATGPIQSTPFAPGPAGAPVGSQPFYFLSLRGVTVGETRLPFNASTFALK 324
Query: 206 VSGEGGCIIDSGSVLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCYFLP--ET 263
G GG IDSG+ +T+F V+ L E FV+ +A+ P+ + LC+ +P +
Sbjct: 325 GDGSGGTFIDSGTAITFFPQAVFRSLREAFVAQVP-LPVAKGYTDPDNL-LCFSVPAKKK 382
Query: 264 FNRFPSMAFYFEDANLRIDGENVFIIDYEN-------HFFLLAVAPHDDLVALIGSQQQR 316
P + + E A+ + EN +++D ++ ++ ++ + +IG+ QQ+
Sbjct: 383 APAVPKLILHLEGADWELPREN-YVLDNDDDGSGAGRKLCVVILSAGNSNGTIIGNFQQQ 441
Query: 317 DTRFVYDLNIDLLSFVKENC 336
+ VYDL + + F C
Sbjct: 442 NMHIVYDLESNKMVFAPARC 461
>gi|326532334|dbj|BAK05096.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 437
Score = 104 bits (259), Expect = 8e-20, Method: Compositional matrix adjust.
Identities = 89/328 (27%), Positives = 146/328 (44%), Gaps = 19/328 (5%)
Query: 26 AIFDPRKSSSFQKINCDHPDCT--YFKCVNEQC-VYTMKYADQSVTKGFAAHETISVIGK 82
A+FD +S ++ + P CT Y V +C YT + G+ + + G
Sbjct: 110 AVFDSAESPRYKHMKATDPMCTPPYTPSVGNRCSFYTTTW--NVAAHGYLGSDMFAFAGT 167
Query: 83 GEG--KAIFHGALFGCSNDNHGFDEDARDGALAGVLGLSRVTISFISQLGS--IIKKRFS 138
G G +FGC++ G E G LAG L LSR +SF+SQL + + RFS
Sbjct: 168 GAGGHSTDVDQLIFGCAHTTDGL-ERLSHGVLAGALSLSRHPMSFLSQLTARGLADSRFS 226
Query: 139 YCLVIPLPNGEYTSSYLKFGTDMGYRRPSTQATKFINHPNN--FYYLSLKDISIDNER-M 195
YCL + +L+FG D+ + + P + Y++ + IS++ R M
Sbjct: 227 YCLFPEQSHPIAKHGFLRFGRDIPRHDHAHSTSLLFTGPGSGGMYHIRVVGISLNGRRIM 286
Query: 196 NFPPDTFDITV-SGEGGCIIDSGSVLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPI 254
P F + + GG ++D G+ LT Y + + V+ ++ + +
Sbjct: 287 RLQPAMFTRNLQTRRGGSVVDPGTPLTRLVRQAYDIVEAEVVANMQKQGARRAKAQVQGH 346
Query: 255 QLCYFLPETFNRFPSMA--FYFEDANLRIDGENVFIIDYENHFFLLAVAPHDDLVALIGS 312
+LC F+ PS+ Y + A L I E +F V P D+ + ++G+
Sbjct: 347 RLC-FVSWGHVHLPSLTINMYEDTAKLFIKPELLFR-KVTARLLCFTVMP-DEEMTVLGA 403
Query: 313 QQQRDTRFVYDLNIDLLSFVKENCSDDS 340
QQ DTRF +DL+ + L F +ENC+ D+
Sbjct: 404 AQQMDTRFTFDLHANRLYFAQENCNADT 431
>gi|326503794|dbj|BAK02683.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 456
Score = 104 bits (259), Expect = 8e-20, Method: Compositional matrix adjust.
Identities = 91/362 (25%), Positives = 152/362 (41%), Gaps = 61/362 (16%)
Query: 1 MVRLFIGTPSKGVLLILDTGSALIY---------AIFDPRKSSSFQKINCDHPDCTYFK- 50
++ + IG+P+ +++DTGS + + +FDP KS+++ +C C
Sbjct: 130 VITVGIGSPAVTQTMMIDTGSDVSWVRCNSTDGLTLFDPSKSTTYAPFSCSSAACAQLGN 189
Query: 51 ----CVNEQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSNDNHGFDED 106
C N C Y ++Y D S T G + +T++ + + FH FGCS+ FD +
Sbjct: 190 NGDGCSNSGCQYRVQYGDGSNTTGTYSSDTLA-LSASDTVTDFH---FGCSHHEEDFDGE 245
Query: 107 ARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLKFGTDMGYRRP 166
DG ++GL S +SQ + K FSYC LP TS +L FG P
Sbjct: 246 KIDG----LMGLGGDAQSLVSQTAATYGKSFSYC----LPPTNRTSGFLTFGA------P 291
Query: 167 STQATKFINHP-------NNFYYLSLKDISIDNERMNFPPDTFDITVSGEGGCIIDSGSV 219
+ + F+ P Y + L+DIS+ + P G ++DSG+V
Sbjct: 292 NGTSGGFVTTPMLRWPKAPTLYGVLLQDISVGGTPLGIQPSVL------SNGSVMDSGTV 345
Query: 220 LTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQL---CYFLPETFN-RFPSMAFYFE 275
+T+ Y L F S R + + + P+ + CY N P+++ +
Sbjct: 346 ITWLPRRAYSALSSAFRSSMTRLRHQRAA----PLGILDTCYDFTGLVNVSIPAVSLVLD 401
Query: 276 -DANLRIDGENVFIIDYENHFFLLAVAPHDDLVALIGSQQQRDTRFVYDLNIDLLSFVKE 334
A + +DG + I D A D ++IG+ QQR ++D+ + F
Sbjct: 402 GGAVVDLDGNGIMIQD----CLAFAATSGD---SIIGNVQQRTFEVLHDVGQGVFGFRSG 454
Query: 335 NC 336
C
Sbjct: 455 AC 456
>gi|255554715|ref|XP_002518395.1| Aspartic proteinase Asp1 precursor, putative [Ricinus communis]
gi|223542240|gb|EEF43782.1| Aspartic proteinase Asp1 precursor, putative [Ricinus communis]
Length = 489
Score = 104 bits (259), Expect = 8e-20, Method: Compositional matrix adjust.
Identities = 98/371 (26%), Positives = 160/371 (43%), Gaps = 48/371 (12%)
Query: 2 VRLFIGTPS-KGVLLILDTGSALIYA------------------IFDPRKSSSFQKINCD 42
V + IGTP + +L+ DTGS L + +F SSSF+ I C
Sbjct: 121 VSIRIGTPRPQKFILVTDTGSDLTWMNCEYWCKSCPKPNPHPGRVFRANDSSSFRTIPCS 180
Query: 43 HPDCT-----YFKCV-----NEQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGA 92
DC YF N C++ +Y + G A+ET++V K
Sbjct: 181 SDDCKIELQDYFSLTECPNPNAPCLFDYRYLNGPRAIGVFANETVTVGLNDHKKIRLFDV 240
Query: 93 LFGCS---NDNHGFDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGE 149
L GC+ N+ +GF + GV+GL S +L I +FSYCLV L +
Sbjct: 241 LIGCTESFNETNGFPD--------GVMGLGYRKHSLALRLAEIFGNKFSYCLVDHLSSSN 292
Query: 150 YTSSYLKFGTDMGYRRPSTQATKFI-NHPNNFYYLSLKDISIDNERMNFPPDTFDITVSG 208
+ ++L FG + P Q T+ + + N FY +++ IS+ ++ D +++T G
Sbjct: 293 H-KNFLSFGDIPEMKLPKMQHTELLLGYINAFYPVNVSGISVGGSMLSISSDIWNVT--G 349
Query: 209 EGGCIIDSGSVLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCYFLPETFNR-- 266
GG I+DSG+ LT + Y K+ + F++ + + PE C F + F+R
Sbjct: 350 VGGMIVDSGTSLTMLAGEAYDKVVDALKPIFDKHKKVVPIELPELNNFC-FEDKGFDRAA 408
Query: 267 FPSMAFYFEDANLRIDGENVFIIDYENHFFLLAVAPHD-DLVALIGSQQQRDTRFVYDLN 325
P + +F D + +IID L + D +++G+ Q++ + YDL
Sbjct: 409 VPRLLIHFADGAIFKPPVKSYIIDVAEGIKCLGIIKADFPGSSILGNVMQQNHLWEYDLG 468
Query: 326 IDLLSFVKENC 336
L F +C
Sbjct: 469 RGKLGFGPSSC 479
>gi|116786826|gb|ABK24255.1| unknown [Picea sitchensis]
gi|148906052|gb|ABR16185.1| unknown [Picea sitchensis]
Length = 485
Score = 104 bits (259), Expect = 8e-20, Method: Compositional matrix adjust.
Identities = 93/358 (25%), Positives = 152/358 (42%), Gaps = 44/358 (12%)
Query: 3 RLFIGTPSKGVLLILDTGSALIY--------------AIFDPRKSSSFQKINCDHPDCTY 48
R+ +G P + L++LDTGS + + I++P SSS++ + C C
Sbjct: 148 RIGVGAPRRDQLMVLDTGSDVTWIQCEPCSDCYQQSDPIYNPALSSSYKLVGCQANLCQQ 207
Query: 49 FKCV----NEQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSNDNHGFD 104
N C+Y + Y D S T+G A ET+++ G A GC +DN G
Sbjct: 208 LDVSGCSRNGSCLYQVSYGDGSYTQGNFATETLTL-----GGAPLQNVAIGCGHDNEGLF 262
Query: 105 EDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLKFGT----D 160
A ++SF SQL K FSYCLV +SS L+FG +
Sbjct: 263 VGAAGLLGL-----GGGSLSFPSQLTDENGKIFSYCLV---DRDSESSSTLQFGRAAVPN 314
Query: 161 MGYRRPSTQATKFINHPNNFYYLSLKDISIDNERMNFPPDTFDITVSGEGGCIIDSGSVL 220
P + ++ + FYY+SL IS+ + ++ F I SG GG I+DSG+ +
Sbjct: 315 GAVLAPMLKNSRL----DTFYYVSLSGISVGGKMLSISDSVFGIDASGNGGVIVDSGTAV 370
Query: 221 TYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCYFLPETFN-RFPSMAFYFE-DAN 278
T + Y L + F + + +D CY L + P++ F+F +
Sbjct: 371 TRLQTAAYDSLRDAFRAGTKNL---PSTDGVSLFDTCYDLSSKESVDVPTVVFHFSGGGS 427
Query: 279 LRIDGENVFIIDYENHFFLLAVAPHDDLVALIGSQQQRDTRFVYDLNIDLLSFVKENC 336
+ + +N + F A AP ++++G+ QQ+ R +D + + F C
Sbjct: 428 MSLPAKNYLVPVDSMGTFCFAFAPTSSSLSIVGNIQQQGIRVSFDRANNQVGFAVNKC 485
>gi|449527149|ref|XP_004170575.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 1-like [Cucumis
sativus]
Length = 487
Score = 103 bits (258), Expect = 8e-20, Method: Compositional matrix adjust.
Identities = 106/364 (29%), Positives = 161/364 (44%), Gaps = 53/364 (14%)
Query: 1 MVRLFIGTPSKGVLLILDTGSALIY-----------------AIFDPRKSSSFQKINCDH 43
+ ++ +G P K L+ DTGS + + IFDP+ SSS+ ++C+
Sbjct: 149 LAQIGVGQPVKLFYLVPDTGSDVTWLQCQPCASENTCYKQFDPIFDPKSSSSYSPLSCNS 208
Query: 44 PDCTYF---KCVNEQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSNDN 100
C C ++ C+Y + Y D S T G A ET+S G +I + + GC +DN
Sbjct: 209 QQCKLLDKANCNSDTCIYQVHYGDGSFTTGELATETLSF---GNSNSIPNLPI-GCGHDN 264
Query: 101 HGFDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEY-TSSYLKFGT 159
G IS SQL + FSYCLV N + +SS L+F +
Sbjct: 265 EGLFAGGAGLIGL-----GGGAISLSSQLKA---SSFSYCLV----NLDSDSSSTLEFNS 312
Query: 160 DMGYRRPSTQATKFINHPNNFY---YLSLKDISIDNERMNFPPDTFDITVSGEGGCIIDS 216
M PS T + + F+ Y+ + IS+ + + P F+I SG GG I+DS
Sbjct: 313 YM----PSDSLTSPLVKNDRFHSYRYVKVVGISVGGKTLPISPTRFEIDESGLGGIIVDS 368
Query: 217 GSVLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCP--EPIQLCY-FLPETFNRFPSMAFY 273
G++++ SDVY L E FV + LS P CY F ++ P++AF
Sbjct: 369 GTIISRLPSDVYESLREAFVKL-----TSSLSPAPGISVFDTCYNFSGQSNVEVPTIAFV 423
Query: 274 F-EDANLRIDGENVFIIDYENHFFLLAVAPHDDLVALIGSQQQRDTRFVYDLNIDLLSFV 332
E +LR+ N I+ + LA +++IGS QQ+ R YDL ++ F
Sbjct: 424 LSEGTSLRLPARNYLIMLDTAGTYCLAFIKTKSSLSIIGSFQQQGIRVSYDLTNSIVGFS 483
Query: 333 KENC 336
C
Sbjct: 484 TNKC 487
>gi|125563764|gb|EAZ09144.1| hypothetical protein OsI_31414 [Oryza sativa Indica Group]
Length = 472
Score = 103 bits (258), Expect = 8e-20, Method: Compositional matrix adjust.
Identities = 98/368 (26%), Positives = 159/368 (43%), Gaps = 65/368 (17%)
Query: 6 IGTPSKGVLLILDTGSALIYA--------------IFDPRKSSSFQKINCDHPDCTYFKC 51
+G +I+DT S L + +FDP S S+ + C+ C +
Sbjct: 130 VGLGGGEATVIVDTASELTWVQCAPCASCHDQQGPLFDPASSPSYAVLPCNSSSCDALQV 189
Query: 52 VN----------EQ--CVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSND 99
EQ C YT+ Y D S ++G AH+ +S+ G+ + G +FGC
Sbjct: 190 ATGSAAGACGGGEQPSCSYTLSYRDGSYSQGVLAHDKLSLAGE-----VIDGFVFGCGTS 244
Query: 100 NHGFDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLKFGT 159
N G G +G++GL R +S ISQ FSYCL PL E +S L G
Sbjct: 245 NQG-----PFGGTSGLMGLGRSQLSLISQTMDQFGGVFSYCL--PLKESE-SSGSLVLGD 296
Query: 160 DMGYRRPSTQA--TKFINHP--NNFYYLSLKDISIDNERMNFPPDTFDITVSGEGGCIID 215
D R ST T ++ P FY+++L I+I + + S G I+D
Sbjct: 297 DTSVYRNSTPIVYTTMVSDPVQGPFYFVNLTGITIGGQEVE----------SSAGKVIVD 346
Query: 216 SGSVLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCP--EPIQLCYFLPETFN-RFPSMAF 272
SG+++T VY + +F+S F + A P + C+ L + PS+ F
Sbjct: 347 SGTIITSLVPSVYNAVKAEFLSQFAEYPQA-----PGFSILDTCFNLTGFREVQIPSLKF 401
Query: 273 YFE-DANLRIDGENV-FIIDYENHFFLLAVAP--HDDLVALIGSQQQRDTRFVYDLNIDL 328
FE + + +D V + + ++ LA+A + ++IG+ QQ++ R ++D
Sbjct: 402 VFEGNVEVEVDSSGVLYFVSSDSSQVCLALASLKSEYETSIIGNYQQKNLRVIFDTLGSQ 461
Query: 329 LSFVKENC 336
+ F +E C
Sbjct: 462 IGFAQETC 469
>gi|224090744|ref|XP_002309070.1| predicted protein [Populus trichocarpa]
gi|222855046|gb|EEE92593.1| predicted protein [Populus trichocarpa]
Length = 404
Score = 103 bits (258), Expect = 8e-20, Method: Compositional matrix adjust.
Identities = 86/323 (26%), Positives = 149/323 (46%), Gaps = 54/323 (16%)
Query: 1 MVRLFIGTPSKGVLLILDTGSALIY----------AIFDPRKSSSFQKINCDHPDCTY-- 48
+V L +GTP + V +++DTGS L + FDP +S+S+Q I C P CT
Sbjct: 32 IVSLTVGTPPQNVSMVIDTGSELSWLHCNKTLSYPTTFDPTRSTSYQTIPCSSPTCTNRT 91
Query: 49 ------FKC-VNEQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSND-- 99
C N C T+ YAD S + G A + + G + G +FGC +
Sbjct: 92 QDFPIPASCDSNNLCHATLSYADASSSDGNLASDVFHI-----GSSDISGLVFGCMDSVF 146
Query: 100 NHGFDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLKFG- 158
+ DED++ G++G++R ++SF+SQLG +FSYC+ +G S L G
Sbjct: 147 SSNSDEDSKS---TGLMGMNRGSLSFVSQLG---FPKFSYCI-----SGTDFSGLLLLGE 195
Query: 159 TDMGYRRPSTQATKF-INHPNNF-----YYLSLKDISIDNERMNFPPDTFDITVSGEGGC 212
+++ + P I+ P + Y + L+ I + ++ + P TF+ +G G
Sbjct: 196 SNLTWSVPLNYTPLIQISTPLPYFDRVAYTVQLEGIKVLDKLLPIPKSTFEPDHTGAGQT 255
Query: 213 IIDSGSVLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPE-----PIQLCYFLP---ETF 264
++DSG+ T+ VY L F++ + + ++ + P+ + LCY +P
Sbjct: 256 MVDSGTQFTFLLGPVYNALRSAFLN--QTSSVLRVLEDPDFVFQGAMDLCYLVPLSQRVL 313
Query: 265 NRFPSMAFYFEDANLRIDGENVF 287
P++ F A + + G+ V
Sbjct: 314 PLLPTVTLVFRGAEMTVSGDRVL 336
>gi|356522504|ref|XP_003529886.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
Length = 473
Score = 103 bits (258), Expect = 9e-20, Method: Compositional matrix adjust.
Identities = 99/361 (27%), Positives = 151/361 (41%), Gaps = 44/361 (12%)
Query: 2 VRLFIGTPSKGVLLILDTGSALIY--------------AIFDPRKSSSFQKINCDHPDCT 47
RL +GTP K + ++LDTGS +++ IFDP KS SF I C P C
Sbjct: 132 TRLGVGTPPKYLYMVLDTGSDVVWLQCKPCTKCYSQTDQIFDPSKSKSFAGIPCYSPLCR 191
Query: 48 YF-----KCVNEQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSNDNHG 102
N C Y + Y D S T G + ET++ +A GC +DN G
Sbjct: 192 RLDSPGCSLKNNLCQYQVSYGDGSFTFGDFSTETLTF-----RRAAVPRVAIGCGHDNEG 246
Query: 103 FDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLKFGTDMG 162
A R +SF +Q G+ +FSYCL + + +S + FG
Sbjct: 247 LFVGAAGLLGL-----GRGGLSFPTQTGTRFNNKFSYCLTDRTASAKPSS--IVFGDSAV 299
Query: 163 YRRPSTQATKFINHP--NNFYYLSLKDISIDNERMN-FPPDTFDITVSGEGGCIIDSGSV 219
R + + T + +P + FYY+ L IS+ + F + +G GG IIDSG+
Sbjct: 300 SR--TARFTPLVKNPKLDTFYYVELLGISVGGAPVRGISASFFRLDSTGNGGVIIDSGTS 357
Query: 220 LTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPE--PIQLCYFLPETFN-RFPSMAFYFED 276
+T Y L + F R + L PE CY L + P++ +F
Sbjct: 358 VTRLTRPAYVSLRDAF-----RVGASHLKRAPEFSLFDTCYDLSGLSEVKVPTVVLHFRG 412
Query: 277 ANLRIDGENVFIIDYENHFFLLAVAPHDDLVALIGSQQQRDTRFVYDLNIDLLSFVKENC 336
A++ + N + + F A A +++IG+ QQ+ R V+DL + F C
Sbjct: 413 ADVSLPAANYLVPVDNSGSFCFAFAGTMSGLSIIGNIQQQGFRVVFDLAGSRVGFAPRGC 472
Query: 337 S 337
+
Sbjct: 473 A 473
>gi|300681506|emb|CBH32600.1| pepsin A, putative, expressed [Triticum aestivum]
Length = 477
Score = 103 bits (258), Expect = 9e-20, Method: Compositional matrix adjust.
Identities = 102/393 (25%), Positives = 153/393 (38%), Gaps = 72/393 (18%)
Query: 2 VRLFIGTPSKGVLLILDTGSALIYA-----------------------IFDPRKSSSFQK 38
VR +GTP++ LL+ DTGS L + F P S ++
Sbjct: 99 VRFRVGTPAQPFLLVADTGSDLTWVKCRRPASANSSLSPADSGPGPGRAFRPEDSRTWAP 158
Query: 39 INCDHPDCTYF--------KCVNEQCVYTMKYADQSVTKGFAAHE--TISVIGKGEGKAI 88
I+C CT C Y +Y D S +G E TI++ G+ E KA
Sbjct: 159 ISCASDTCTKSLPFSLATCPTPGSPCAYDYRYKDGSAARGTVGTESATIALSGREERKAK 218
Query: 89 FHGALFGCSNDNHGFDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNG 148
G + GCS+ G +A D GVL L ISF S S RFSYCLV L +
Sbjct: 219 LKGLVLGCSSSYTGPSFEASD----GVLSLGYSGISFASHAASRFGGRFSYCLVDHL-SP 273
Query: 149 EYTSSYLKFGTDMGYRRP------------STQATKFI--NHPNNFYYLSLKDISIDNER 194
+SYL FG + P + T + FY +SLK IS+ E
Sbjct: 274 RNATSYLTFGPNPAVSSPRASPSSCAAAAPRARQTPLLLDRRMRPFYDVSLKAISVAGEF 333
Query: 195 MNFPPDTFDITVSGEGGCIIDSGSVLTYFHSDVYWKL---HEKFVSYFERFQLAQLSDCP 251
+ P +D+ GG I+DSG+ LT Y + K ++ R +
Sbjct: 334 LKIPRAVWDVEAG--GGVILDSGTSLTVLAKPAYRAVVAALSKGLAGLPRVTM------- 384
Query: 252 EPIQLCYFLPETFNR-----FPSMAFYFEDANLRIDGENVFIIDYENHFFLLAV--APHD 304
+P + CY + P MA +F A ++ID + + P
Sbjct: 385 DPFEYCYNWTSPSGKDADVAVPKMAVHFAGAARLEPPGKSYVIDAAPGVKCIGLQEGPWP 444
Query: 305 DLVALIGSQQQRDTRFVYDLNIDLLSFVKENCS 337
+++IG+ Q++ + +D+ L F + C+
Sbjct: 445 G-ISVIGNILQQEHLWEFDIKNRRLKFQRSRCT 476
>gi|414881704|tpg|DAA58835.1| TPA: hypothetical protein ZEAMMB73_701358 [Zea mays]
Length = 485
Score = 103 bits (258), Expect = 9e-20, Method: Compositional matrix adjust.
Identities = 101/370 (27%), Positives = 155/370 (41%), Gaps = 50/370 (13%)
Query: 2 VRLFIGTPSKGVLLILDTGSALIYA--------------IFDPRKSSSFQKINCDHPDCT 47
++ +GTP+ L++LDTGS +++ +FDPR+SSS+ + C C
Sbjct: 131 TKIGVGTPATQALMVLDTGSDVVWVQCAPCRRCYEQSGPVFDPRRSSSYGAVGCGAALCR 190
Query: 48 YFKCVN-----EQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSNDNHG 102
C+Y + Y D SVT G ET++ G G + AL GC +DN G
Sbjct: 191 RLDSGGCDLRRGACMYQVAYGDGSVTAGDFVTETLTFAG---GARVARVAL-GCGHDNEG 246
Query: 103 FDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLV------IPLPNGEYTSSYLK 156
A R +SF +Q+ + FSYCLV G + SS +
Sbjct: 247 LFVAAAGLLGL-----GRGGLSFPTQISRRYGRSFSYCLVDRTSSGAGAAPGSHRSSTVS 301
Query: 157 FGTDMGYRRPSTQATKFINHPN--NFYYLSLKDISIDNERMNFPPDTFDITV---SGEGG 211
FG S T + +P FYY+ L IS+ R+ ++ D+ + +G GG
Sbjct: 302 FGAGS-VGASSASFTPMVRNPRMETFYYVQLVGISVGGARVPGVAES-DLRLDPSTGRGG 359
Query: 212 CIIDSGSVLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQL---CYFL-PETFNRF 267
I+DSG+ +T Y L + F + L P L CY L +
Sbjct: 360 VIVDSGTSVTRLARASYSALRDAFRAA----AAGGLRLSPGGFSLFDTCYDLGGRRVVKV 415
Query: 268 PSMAFYFE-DANLRIDGENVFIIDYENHFFLLAVAPHDDLVALIGSQQQRDTRFVYDLNI 326
P+++ +F A + EN I F A A D V++IG+ QQ+ R V+D +
Sbjct: 416 PTVSMHFAGGAEAALPPENYLIPVDSRGTFCFAFAGTDGGVSIIGNIQQQGFRVVFDGDG 475
Query: 327 DLLSFVKENC 336
+ F + C
Sbjct: 476 QRVGFAPKGC 485
>gi|357481199|ref|XP_003610885.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
gi|355512220|gb|AES93843.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
Length = 416
Score = 103 bits (258), Expect = 9e-20, Method: Compositional matrix adjust.
Identities = 99/361 (27%), Positives = 159/361 (44%), Gaps = 39/361 (10%)
Query: 1 MVRLFIGTPSKGVLLILDTGSALIY--------------AIFDPRKSSSFQKINCDHPDC 46
++ ++IGTP + ++DTGS LI+ +FDP KSS++ I+CD P C
Sbjct: 69 LMEIYIGTPPIKITGLVDTGSDLIWIQCAPCLGCYKQIKPMFDPLKSSTYNNISCDSPLC 128
Query: 47 ----TYFKCVNEQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSNDNHG 102
T ++C YT Y D S+TKG A +T + LFGC ++N G
Sbjct: 129 HKLDTGVCSPEKRCNYTYGYGDNSLTKGVLAQDTATFTSNTGKPVSLSRFLFGCGHNNTG 188
Query: 103 FDEDARDGALAGVLGLSRVTISFISQLGSII-KKRFSYCLVIPLPNGEYTSSYLKFGTDM 161
D G++GL S ISQ+G + K+FS CLV P SS + FG
Sbjct: 189 GFNDHE----MGLIGLGGGPTSLISQIGPLFGGKKFSQCLV-PFLTDIKISSRMSFGKGS 243
Query: 162 GYRRPSTQATKFI-NHPNNFYYLSLKDISIDNERMNFPPDTFDITVSGEGGCIIDSGSVL 220
T + + Y+++L IS+ E FP + + G+ ++DSG+
Sbjct: 244 QVLGNGVVTTPLVPREKDTSYFVTLLGISV--EDTYFPMN----STIGKANMLVDSGTPP 297
Query: 221 TYFHSDVYWKLHEKFVSYFERFQLAQLSDCPE-PIQLCYFLPETFNRFPSMAFYFEDANL 279
+Y K+ F + L ++D P QLCY +T + P++ F+F AN+
Sbjct: 298 ILLPQQLYDKV---FAEVRNKVALKPITDDPSLGTQLCYRT-QTNLKGPTLTFHFVGANV 353
Query: 280 RIDGENVFI--IDYENHFFLLAVAPHDDL-VALIGSQQQRDTRFVYDLNIDLLSFVKENC 336
+ FI F LA+ + + G+ Q + +DL+ ++SF +C
Sbjct: 354 LLTPIQTFIPPTPQTKGIFCLAIYNRTNSDPGVYGNFAQSNYLIGFDLDRQVVSFKPTDC 413
Query: 337 S 337
+
Sbjct: 414 T 414
>gi|297795137|ref|XP_002865453.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
gi|297311288|gb|EFH41712.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
Length = 665
Score = 103 bits (258), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 100/373 (26%), Positives = 153/373 (41%), Gaps = 67/373 (17%)
Query: 2 VRLFIGTPSKGVLLILDTGSALIYAI--------------FDPRKSSSFQKINCDHPDCT 47
RL+IGTP + LI+DTGS + Y F P SSS++ + C+ PDC
Sbjct: 82 TRLWIGTPPQEFALIVDTGSTVTYVPCSTCKQCGKHQDPKFQPELSSSYKALKCN-PDCN 140
Query: 48 YFKCVNEQ--CVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSNDNHGFDE 105
C +E CVY +YA+ S + G + + IS E + A+FGC N G
Sbjct: 141 ---CDDEGKLCVYERRYAEMSSSSGVLSEDLISF--GNESQLTPQRAVFGCENVETG--- 192
Query: 106 DARDGALAGVLGLSRVTISFISQL--GSIIKKRFSYCLVIPLPNGEYTSSYLKFGTDMGY 163
D G++GL R +S + QL +I+ FS C Y + G M
Sbjct: 193 DLFSQRADGIMGLGRGKLSVVDQLVDKGVIEDVFSLC---------YGGMEVGGGA-MVL 242
Query: 164 RRPSTQATKFINHPNNF----YYLSLKDISIDNERMNFPPDTFDITVSGEGGCIIDSGSV 219
+ S A +H + F Y + LK + + + + P F+ G+ G ++DSG+
Sbjct: 243 GKISPPAGMVFSHSDPFRSPYYNIDLKQMHVAGKSLKLNPKVFN----GKHGTVLDSGTT 298
Query: 220 LTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCYF-----LPETFNRFPSMAFYF 274
YF + + + + + + D P +C+ + E N FP + F
Sbjct: 299 YAYFPKEAFIAIKDAIIKEIPSLKRIHGPD-PNYDDVCFSGAGRDVAEIHNFFPEIDMEF 357
Query: 275 EDANLRIDGENVFIIDYENHFF---------LLAVAPHDDLVALIGSQQQRDTRFVYDLN 325
+ I+ EN+ F L + P D L+G R+T YD
Sbjct: 358 GNGQ-------KLILSPENYLFRHTKVRGAYCLGIFPDRDSTTLLGGIVVRNTLVTYDRE 410
Query: 326 IDLLSFVKENCSD 338
D L F+K NCSD
Sbjct: 411 NDKLGFLKTNCSD 423
>gi|297736090|emb|CBI24128.3| unnamed protein product [Vitis vinifera]
Length = 378
Score = 103 bits (258), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 99/371 (26%), Positives = 152/371 (40%), Gaps = 49/371 (13%)
Query: 6 IGTPSKGVLLILDTGSALIYA-----------------------IFDPRKSSSFQKINCD 42
+GTPS+ +L+ DTGS L + +F SSSF+ I C
Sbjct: 18 VGTPSQKFMLVADTGSDLTWMSCKYHCRSRNCSNRKARRIRHKRVFHANLSSSFKTIPCL 77
Query: 43 HPDCT-----YFKCVN-----EQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGA 92
C F N C Y +Y+D S GF A+ET++V K K H
Sbjct: 78 TDMCKIELMDLFSLTNCPTPLTPCGYDYRYSDGSTALGFFANETVTVELKEGRKMKLHNV 137
Query: 93 LFGCSNDNHGFDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTS 152
L GCS G A D GV+GL SF + +FSYCLV L + + S
Sbjct: 138 LIGCSESFQGQSFQAAD----GVMGLGYSKYSFAIKAAEKFGGKFSYCLVDHLSH-KNVS 192
Query: 153 SYLKFGTDMGYR---RPSTQATKFINHPNNFYYLSLKDISIDNERMNFPPDTFDITVSGE 209
+YL FG+ T + N+FY +++ ISI + P + +D V G
Sbjct: 193 NYLTFGSSRSKEALLNNMTYTELVLGMVNSFYAVNMMGISIGGAMLKIPSEVWD--VKGA 250
Query: 210 GGCIIDSGSVLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCYFLPETFNR--F 267
GG I+DSGS LT+ Y + +F+ ++ P++ C F F
Sbjct: 251 GGTILDSGSSLTFLTEPAYQPVMAALRVSLLKFRKVEMDI--GPLEYC-FNSTGFEESLV 307
Query: 268 PSMAFYFEDANLRIDGENVFIIDYENHFFLLA-VAPHDDLVALIGSQQQRDTRFVYDLNI 326
P + F+F D ++I + L V+ +++G+ Q++ + +DL +
Sbjct: 308 PRLVFHFADGAEFEPPVKSYVISAADGVRCLGFVSVAWPGTSVVGNIMQQNHLWEFDLGL 367
Query: 327 DLLSFVKENCS 337
L F +C+
Sbjct: 368 KKLGFAPSSCT 378
>gi|357132618|ref|XP_003567926.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Brachypodium
distachyon]
Length = 468
Score = 103 bits (257), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 99/375 (26%), Positives = 146/375 (38%), Gaps = 52/375 (13%)
Query: 2 VRLFIGTPSKGVLLILDTGSALIYA-------------------IFDPRKSSSFQKINCD 42
VRL +GTP++ +L+ DTGS L + +F P S S+ + CD
Sbjct: 106 VRLRVGTPAQPFVLVADTGSDLTWVKCSSPSSSSSSPAASPPQRVFRPAGSKSWSPLPCD 165
Query: 43 HPDCTY---FKCVN-----EQCVYTMKYADQSVTKGFAA--HETISVIGK-GEGKAIFHG 91
C F N + C Y +Y D S +G T+S+ G G KA
Sbjct: 166 SDTCKSYVPFSLANCSSPPDPCSYDYRYKDNSSARGVVGLDSATVSLSGNDGTRKAKLQE 225
Query: 92 ALFGCSNDNHGFDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYT 151
+ GC+ G + DG VL L ISF S+ S RFSYCLV L T
Sbjct: 226 VVLGCTTSYDGQSFKSSDG----VLSLGNSNISFASRAASRFGGRFSYCLVDHLAPRNAT 281
Query: 152 SSYLKFGTDMGY--------RRPSTQATKFINHPNNFYYLSLKDISIDNERMNFPPDTFD 203
S+L FG R P P FY++S+ +++ ER+ PD +D
Sbjct: 282 -SFLTFGNGDSSPGDDSSSRRTPLVLLEDARTRP--FYFVSVDAVTVAGERLEILPDVWD 338
Query: 204 ITVSGEGGCIIDSGSVLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCYFLPET 263
GG I+DSG+ LT + Y + + F + +P + CY
Sbjct: 339 F--RKNGGAILDSGTSLTILATPAYDAVVKAISKQFAGVPRVNM----DPFEYCYNWTGV 392
Query: 264 FNRFPSMAFYFEDANLRIDGENVFIIDYENHFFLLAVAPHD-DLVALIGSQQQRDTRFVY 322
P M F A ++ID + V V++IG+ Q++ + +
Sbjct: 393 SAEIPRMELRFAGAATLAPPGKSYVIDTAPGVKCIGVVEGAWPGVSVIGNILQQEHLWEF 452
Query: 323 DLNIDLLSFVKENCS 337
DL L F + C+
Sbjct: 453 DLANRWLRFKQSRCA 467
>gi|302817380|ref|XP_002990366.1| hypothetical protein SELMODRAFT_43971 [Selaginella moellendorffii]
gi|300141928|gb|EFJ08635.1| hypothetical protein SELMODRAFT_43971 [Selaginella moellendorffii]
Length = 420
Score = 103 bits (257), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 97/363 (26%), Positives = 154/363 (42%), Gaps = 53/363 (14%)
Query: 2 VRLFIGTPSKGVLLILDTGSALIY--------------AIFDPRKSSSFQKINCDHPDCT 47
R+ +GTP++ V ++ DTGS + + IF+P SSSF+ + C C
Sbjct: 83 ARIGVGTPARSVYMVADTGSDVSWLQCSPCRKCYRQQDPIFNPSLSSSFKPLACASSICG 142
Query: 48 YFK---CVNE-QCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSNDNHGF 103
K C + +C+Y + Y D S T G + ET+S G+ GC +N G
Sbjct: 143 KLKIKGCSRKNECMYQVSYGDGSFTVGDFSTETLSF-----GEHAVRSVAMGCGRNNQGL 197
Query: 104 DEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYT-SSYLKFGTDMG 162
A R +SF SQ G+ FSYCL P E ++ L FG
Sbjct: 198 FHGAAGLLGL-----GRGPLSFPSQTGTSYASVFSYCL----PRRESAIAASLVFGPSAV 248
Query: 163 YRRPSTQATKFINHPN----NFYYLSLKDISIDNERMNFPPDTFDITVSGEGGCIIDSGS 218
+ + TK + PN +YY+ L I + +N PPD F + G GG I+DSG+
Sbjct: 249 PEK--ARFTKLL--PNRRLDTYYYVGLARIRVAGSPVNIPPDAFAMGSRGTGGVIVDSGT 304
Query: 219 VLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQL---CYFLPETFN-RFPSMAFYF 274
++ + Y L + F S L P I L CY L P++ F
Sbjct: 305 AISRLTTPAYTALRDAFRS------LVTFPSAPG-ISLFDTCYDLSSMKTATLPAVVLDF 357
Query: 275 E-DANLRIDGENVFIIDYENHFFLLAVAPHDDLVALIGSQQQRDTRFVYDLNIDLLSFVK 333
+ A++ + + + + + + LA AP ++ ++IG+ QQ+ R D + +
Sbjct: 358 DGGASMPLPADGILVNVDDEGTYCLAFAPEEEAFSIIGNVQQQTFRISIDNQKEQMGIAP 417
Query: 334 ENC 336
+ C
Sbjct: 418 DQC 420
>gi|212274713|ref|NP_001130791.1| uncharacterized protein LOC100191895 precursor [Zea mays]
gi|194690124|gb|ACF79146.1| unknown [Zea mays]
gi|194708040|gb|ACF88104.1| unknown [Zea mays]
gi|223950469|gb|ACN29318.1| unknown [Zea mays]
gi|414885521|tpg|DAA61535.1| TPA: hypothetical protein ZEAMMB73_650724 [Zea mays]
Length = 500
Score = 103 bits (257), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 95/375 (25%), Positives = 158/375 (42%), Gaps = 68/375 (18%)
Query: 6 IGTPSKGVLLILDTGSALIYA--------------IFDPRKSSSFQKINCDHPDCTYFK- 50
+G +I+DT S L + +FDP S S+ + CD P C +
Sbjct: 147 VGLGGGEATVIVDTASELTWVQCAPCESCHDQQGPLFDPSSSPSYAAVPCDSPSCDALQQ 206
Query: 51 ------------C---VNEQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFG 95
C C Y + Y D S ++G AH+ +S+ G+ + G +FG
Sbjct: 207 QLATGAGAGAPPCDAGRPAACSYALSYRDGSYSRGVLAHDRLSLAGE-----VIDGFVFG 261
Query: 96 CSNDNHGFDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYL 155
C N G G +G++GL R +S +SQ FSYCL PL S L
Sbjct: 262 CGTSNQG----PPFGGTSGLMGLGRSQLSLVSQTVDQFGGVFSYCL--PLSRESDASGSL 315
Query: 156 KFGTDMGYRRPSTQA--TKFINHPNN-----FYYLSLKDISIDNERMNFPPDTFDITVSG 208
G D R ST T +++ + FY ++L I++ + + +
Sbjct: 316 VLGDDPSAYRNSTPVVYTSMVSNSDPLLQGPFYLVNLTGITVGGQEVE--------STGF 367
Query: 209 EGGCIIDSGSVLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCP--EPIQLCYFLPETFN- 265
I+DSG+V+T VY + +F+S QLA+ P + C+ +
Sbjct: 368 SARAIVDSGTVITSLVPSVYNAVRAEFMS-----QLAEYPQAPGFSILDTCFNMTGLKEV 422
Query: 266 RFPSMAFYFED-ANLRID-GENVFIIDYENHFFLLAVAP--HDDLVALIGSQQQRDTRFV 321
+ PS+ F+ A + +D G ++ + ++ LAVA +D ++IG+ QQ++ R V
Sbjct: 423 QVPSLTLVFDGGAEVEVDSGGVLYFVSSDSSQVCLAVASLKSEDETSIIGNYQQKNLRVV 482
Query: 322 YDLNIDLLSFVKENC 336
+D + + F +E C
Sbjct: 483 FDTSASQVGFAQETC 497
>gi|15226315|ref|NP_180368.1| aspartyl protease-like protein [Arabidopsis thaliana]
gi|4510415|gb|AAD21501.1| putative chloroplast nucleoid DNA binding protein [Arabidopsis
thaliana]
gi|330252975|gb|AEC08069.1| aspartyl protease-like protein [Arabidopsis thaliana]
Length = 396
Score = 103 bits (257), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 101/358 (28%), Positives = 159/358 (44%), Gaps = 52/358 (14%)
Query: 1 MVRLFIGTPSKGVLLILDTGSALIYA--------------IFDPRKSSSFQKINCDHPDC 46
+++L +GTP + I+DTGS + + IFDP KSS+F++ CD
Sbjct: 66 LMKLQVGTPPFEIQAIIDTGSEITWTQCLPCVHCYEQNAPIFDPSKSSTFKEKRCD---- 121
Query: 47 TYFKCVNEQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSNDNHGFDED 106
C Y + Y D + T G A ETI++ + + GC ++N F
Sbjct: 122 ------GHSCPYEVDYFDHTYTMGTLATETITLHSTSGEPFVMPETIIGCGHNNSWFKP- 174
Query: 107 ARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLKFGTD---MGY 163
+ +G++GL+ S I+Q+G SYC +G+ TS + FG + G
Sbjct: 175 ----SFSGMVGLNWGPSSLITQMGGEYPGLMSYCF-----SGQGTSK-INFGANAIVAGD 224
Query: 164 RRPSTQATKFINHPNNFYYLSLKDISIDNERMNFPPDTFDITVSGEGGCIIDSGSVLTYF 223
ST P FYYL+L +S+ N R+ TF EG +IDSG+ LTYF
Sbjct: 225 GVVSTTMFMTTAKP-GFYYLNLDAVSVGNTRIETMGTTFHAL---EGNIVIDSGTTLTYF 280
Query: 224 HSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCYFLPETFNRFPSMAFYFEDA-NLRID 282
Y L + V + + +D LCY +T + FP + +F +L +D
Sbjct: 281 PVS-YCNLVRQAVEHV--VTAVRAADPTGNDMLCYN-SDTIDIFPVITMHFSGGVDLVLD 336
Query: 283 GENVFIIDYENHFFLLAV---APHDDLVALIGSQQQRDTRFVYDLNIDLLSFVKENCS 337
N+++ F LA+ +P + A+ G++ Q + YD + L+SF NCS
Sbjct: 337 KYNMYMESNNGGVFCLAIICNSPTQE--AIFGNRAQNNFLVGYDSSSLLVSFSPTNCS 392
>gi|47777372|gb|AAT38006.1| unknow protein [Oryza sativa Japonica Group]
Length = 475
Score = 103 bits (257), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 103/363 (28%), Positives = 160/363 (44%), Gaps = 47/363 (12%)
Query: 6 IGTPSKGVLLILDTGSALIY--------------AIFDPRKSSSFQKINCDHPDCTYFKC 51
+GTP+ L++LDTGS +++ +FDPR+S S+ ++C P C
Sbjct: 128 VGTPATTALMVLDTGSDVVWLQCAPCRHCYAQSGRVFDPRRSRSYAAVDCVAPICRRLDS 187
Query: 52 VN-----EQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSNDNHGFDED 106
C+Y + Y D SVT G A ET++ +G A GC +DN G
Sbjct: 188 AGCDRRRNSCLYQVAYGDGSVTAGDFASETLT-FARG---ARVQRVAIGCGHDNEGLFI- 242
Query: 107 ARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLV---IPLPNGEYTSSYLKFGTDMGY 163
A +G+LGL R +SF SQ+ + FSYCLV + SS + FG
Sbjct: 243 ----AASGLLGLGRGRLSFPSQIARSFGRSFSYCLVDRTSSVRPSSTRSSTVTFGAGAVA 298
Query: 164 RRPSTQATKFINHPN--NFYYLSLKDISIDNERMNFPPDTFDITV---SGEGGCIIDSGS 218
T +P FYY+ L S+ R+ + D+ + +G GG I+DSG+
Sbjct: 299 AAAGASFTPMGRNPRMATFYYVHLLGFSVGGARVKGVSQS-DLRLNPTTGRGGVILDSGT 357
Query: 219 VLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQL---CYFLP-ETFNRFPSMAFYF 274
+T VY + + F R L P L CY L + P+++ +
Sbjct: 358 SVTRLARPVYEAVRDAF-----RAAAVGLRVSPGGFSLFDTCYNLSGRRVVKVPTVSMHL 412
Query: 275 E-DANLRIDGENVFIIDYENHFFLLAVAPHDDLVALIGSQQQRDTRFVYDLNIDLLSFVK 333
A++ + EN I + F A+A D V++IG+ QQ+ R V+D + + FV
Sbjct: 413 AGGASVALPPENYLIPVDTSGTFCFAMAGTDGGVSIIGNIQQQGFRVVFDGDAQRVGFVP 472
Query: 334 ENC 336
++C
Sbjct: 473 KSC 475
>gi|312282359|dbj|BAJ34045.1| unnamed protein product [Thellungiella halophila]
Length = 484
Score = 103 bits (257), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 98/362 (27%), Positives = 152/362 (41%), Gaps = 42/362 (11%)
Query: 2 VRLFIGTPSKGVLLILDTGSALIY--------------AIFDPRKSSSFQKINCDHPDCT 47
+RL +GTP+ + ++LDTGS +++ +F+P KS +F + C C
Sbjct: 138 MRLGVGTPATNMYMVLDTGSDVVWLQCSPCKVCYNQSDPVFNPAKSKTFATVPCGSRLCR 197
Query: 48 YF----KCVNEQ---CVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSNDN 100
+CV+ + C+Y + Y D S T G + ET++ G + H AL GC +DN
Sbjct: 198 RLDDSSECVSRRSKACLYQVSYGDGSFTVGDFSTETLTF----HGARVDHVAL-GCGHDN 252
Query: 101 HGFDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLKFGTD 160
G A R +SF SQ + +FSYCLV +G +
Sbjct: 253 EGLFVGAAGLLGL-----GRGGLSFPSQTKNRYNGKFSYCLVDRTSSGSSSKPPSTIVFG 307
Query: 161 MGYRRPSTQATKFINHP--NNFYYLSLKDISIDNERM-NFPPDTFDITVSGEGGCIIDSG 217
G + T + +P + FYYL L IS+ R+ F + +G GG IIDSG
Sbjct: 308 NGAVPKTAVFTPLLTNPKLDTFYYLQLLGISVGGSRVPGVSESQFKLDATGNGGVIIDSG 367
Query: 218 SVLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCP--EPIQLCYFLPE-TFNRFPSMAFYF 274
+ +T Y L + F R +L P C+ L T + P++ F+F
Sbjct: 368 TSVTRLTQSAYVALRDAF-----RLGATRLKRAPSYSLFDTCFDLSGMTTVKVPTVVFHF 422
Query: 275 EDANLRIDGENVFIIDYENHFFLLAVAPHDDLVALIGSQQQRDTRFVYDLNIDLLSFVKE 334
+ + N I F A A +++IG+ QQ+ R YDL + F+
Sbjct: 423 TGGEVSLPASNYLIPVNNQGRFCFAFAGTMGSLSIIGNIQQQGFRVAYDLVGSRVGFLSR 482
Query: 335 NC 336
C
Sbjct: 483 AC 484
>gi|356526294|ref|XP_003531753.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
Length = 414
Score = 103 bits (257), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 106/369 (28%), Positives = 161/369 (43%), Gaps = 62/369 (16%)
Query: 6 IGTPSKGVLLILDTGSALIYA--------------IFDPRKSSSFQKINCDHPDCTYFKC 51
+G SK + +I+DTGS L + IF P SSS+Q ++C+ C +
Sbjct: 69 MGLGSKNMTVIIDTGSDLTWVQCEPCMSCYNQQGPIFKPSTSSSYQSVSCNSSTCQSLQF 128
Query: 52 VN-----------EQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSNDN 100
C Y + Y D S T G E +S G +FGC +N
Sbjct: 129 ATGNTGACGSSNPSTCNYVVNYGDGSYTNGELGVEALSF-----GGVSVSDFVFGCGRNN 183
Query: 101 HGFDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSS-YLKFGT 159
G G ++G++GL R +S +SQ + FSYCL P E SS L G
Sbjct: 184 KGLF-----GGVSGLMGLGRSYLSLVSQTNATFGGVFSYCL----PTTEAGSSGSLVMGN 234
Query: 160 DMGYRRPSTQAT--KFINHP--NNFYYLSLKDISIDNERMNFPPDTFDITVSGEGGCIID 215
+ + + T + +++P +NFY L+L I + + P +F G GG +ID
Sbjct: 235 ESSVFKNANPITYTRMLSNPQLSNFYILNLTGIDVGGVALK-APLSF-----GNGGILID 288
Query: 216 SGSVLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCP--EPIQLCYFLPETFN-RFPSMAF 272
SG+V+T S VY L +F+ F F A P + C+ L P+++
Sbjct: 289 SGTVITRLPSSVYKALKAEFLKKFTGFPSA-----PGFSILDTCFNLTGYDEVSIPTISL 343
Query: 273 YFE-DANLRIDGENVF-IIDYENHFFLLAVAPHDDL--VALIGSQQQRDTRFVYDLNIDL 328
FE +A L +D F ++ + LA+A D A+IG+ QQR+ R +YD
Sbjct: 344 RFEGNAQLNVDATGTFYVVKEDASQVCLALASLSDAYDTAIIGNYQQRNQRVIYDTKQSK 403
Query: 329 LSFVKENCS 337
+ F +E CS
Sbjct: 404 VGFAEEPCS 412
>gi|115465771|ref|NP_001056485.1| Os05g0590000 [Oryza sativa Japonica Group]
gi|49328116|gb|AAT58814.1| putative nucleoid DNA-binding protein [Oryza sativa Japonica Group]
gi|113580036|dbj|BAF18399.1| Os05g0590000 [Oryza sativa Japonica Group]
Length = 481
Score = 103 bits (257), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 103/363 (28%), Positives = 160/363 (44%), Gaps = 47/363 (12%)
Query: 6 IGTPSKGVLLILDTGSALIY--------------AIFDPRKSSSFQKINCDHPDCTYFKC 51
+GTP+ L++LDTGS +++ +FDPR+S S+ ++C P C
Sbjct: 134 VGTPATTALMVLDTGSDVVWLQCAPCRHCYAQSGRVFDPRRSRSYAAVDCVAPICRRLDS 193
Query: 52 VN-----EQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSNDNHGFDED 106
C+Y + Y D SVT G A ET++ +G A GC +DN G
Sbjct: 194 AGCDRRRNSCLYQVAYGDGSVTAGDFASETLT-FARG---ARVQRVAIGCGHDNEGLFI- 248
Query: 107 ARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLV---IPLPNGEYTSSYLKFGTDMGY 163
A +G+LGL R +SF SQ+ + FSYCLV + SS + FG
Sbjct: 249 ----AASGLLGLGRGRLSFPSQIARSFGRSFSYCLVDRTSSVRPSSTRSSTVTFGAGAVA 304
Query: 164 RRPSTQATKFINHPN--NFYYLSLKDISIDNERMNFPPDTFDITV---SGEGGCIIDSGS 218
T +P FYY+ L S+ R+ + D+ + +G GG I+DSG+
Sbjct: 305 AAAGASFTPMGRNPRMATFYYVHLLGFSVGGARVKGVSQS-DLRLNPTTGRGGVILDSGT 363
Query: 219 VLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQL---CYFLP-ETFNRFPSMAFYF 274
+T VY + + F R L P L CY L + P+++ +
Sbjct: 364 SVTRLARPVYEAVRDAF-----RAAAVGLRVSPGGFSLFDTCYNLSGRRVVKVPTVSMHL 418
Query: 275 E-DANLRIDGENVFIIDYENHFFLLAVAPHDDLVALIGSQQQRDTRFVYDLNIDLLSFVK 333
A++ + EN I + F A+A D V++IG+ QQ+ R V+D + + FV
Sbjct: 419 AGGASVALPPENYLIPVDTSGTFCFAMAGTDGGVSIIGNIQQQGFRVVFDGDAQRVGFVP 478
Query: 334 ENC 336
++C
Sbjct: 479 KSC 481
>gi|302795261|ref|XP_002979394.1| hypothetical protein SELMODRAFT_53966 [Selaginella moellendorffii]
gi|300153162|gb|EFJ19802.1| hypothetical protein SELMODRAFT_53966 [Selaginella moellendorffii]
Length = 353
Score = 103 bits (257), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 97/363 (26%), Positives = 154/363 (42%), Gaps = 53/363 (14%)
Query: 2 VRLFIGTPSKGVLLILDTGSALIY--------------AIFDPRKSSSFQKINCDHPDCT 47
R+ +GTP++ V ++ DTGS + + IF+P SSSF+ + C C
Sbjct: 16 ARIGVGTPARSVYMVADTGSDVSWLQCSPCRKCYRQQDPIFNPSLSSSFKPLACASSICG 75
Query: 48 YFK---CVNE-QCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSNDNHGF 103
K C + +C+Y + Y D S T G + ET+S G+ GC +N G
Sbjct: 76 KLKIKGCSRKNKCMYQVSYGDGSFTVGDFSTETLSF-----GEHAVRSVAMGCGRNNQGL 130
Query: 104 DEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYT-SSYLKFGTDMG 162
A R +SF SQ G+ FSYCL P E ++ L FG
Sbjct: 131 FHGAAGLLGL-----GRGPLSFPSQTGTSYASVFSYCL----PRRESAIAASLVFGPSAV 181
Query: 163 YRRPSTQATKFINHPN----NFYYLSLKDISIDNERMNFPPDTFDITVSGEGGCIIDSGS 218
+ + TK + PN +YY+ L I + +N PPD F + G GG I+DSG+
Sbjct: 182 PEK--ARFTKLL--PNRRLDTYYYVGLARIRVAGSPVNIPPDAFAMGSRGTGGVIVDSGT 237
Query: 219 VLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQL---CYFLPETFN-RFPSMAFYF 274
++ + Y L + F S L P I L CY L P++ F
Sbjct: 238 AISRLTTPAYTALRDAFRS------LVTFPSAPG-ISLFDTCYDLSSMKTATLPAVVLDF 290
Query: 275 E-DANLRIDGENVFIIDYENHFFLLAVAPHDDLVALIGSQQQRDTRFVYDLNIDLLSFVK 333
+ A++ + + + + + + LA AP ++ ++IG+ QQ+ R D + +
Sbjct: 291 DGGASMPLPADGILVNVDDEGTYCLAFAPEEEAFSIIGNVQQQTFRISIDNQKEQMGIAP 350
Query: 334 ENC 336
+ C
Sbjct: 351 DQC 353
>gi|168008816|ref|XP_001757102.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162691600|gb|EDQ77961.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 406
Score = 103 bits (257), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 94/363 (25%), Positives = 154/363 (42%), Gaps = 43/363 (11%)
Query: 2 VRLFIGTPSKGVLLILDTGSALIY--------------AIFDPRKSSSFQKINCDHPDCT 47
+R+ +GTP + + L++DTGS +++ AIFDP KSS++ + C C
Sbjct: 60 IRISVGTPPRRMYLVMDTGSDILWLQCAPCVNCYHQSDAIFDPYKSSTYSTLGCSTRQCL 119
Query: 48 YF---KCVNEQCVYTMKYADQSVTKGFAAHETISVIG-KGEGKAIFHGALFGCSNDNHGF 103
C +C+Y + Y D S T G + +S+ G G+ + + GC +DN G+
Sbjct: 120 NLDIGTCQANKCLYQVDYGDGSFTTGEFGTDDVSLNSTSGVGQVVLNKIPLGCGHDNEGY 179
Query: 104 DEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLKFGTDMGY 163
A + +SF +Q+ RFSYCL + SS + G
Sbjct: 180 FVGAAGLLGL-----GKGPLSFPNQVDPQNGGRFSYCLTDRETDSTEGSSLV-----FGE 229
Query: 164 RRPSTQATKFINHPNN-----FYYLSLKDISIDNERMNFPPDTFDITVSGEGGCIIDSGS 218
+F +N FYYL + IS+ + P F + G GG IIDSG+
Sbjct: 230 AAVPPAGARFTPQDSNMRVPTFYYLKMTGISVGGTILTIPTSAFQLDSLGNGGVIIDSGT 289
Query: 219 VLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCP--EPIQLCYFLPETFN-RFPSMAFYFE 275
+T + Y L + F R + L+ CY L + P++ +F+
Sbjct: 290 SVTRLQNAAYASLRDAF-----RAGTSDLAPTAGFSLFDTCYDLSGLASVDVPTVTLHFQ 344
Query: 276 DA-NLRIDGENVFIIDYENHFFLLAVAPHDDLVALIGSQQQRDTRFVYDLNIDLLSFVKE 334
+L++ N I ++ F LA A ++IG+ QQ+ R +YD + + FV
Sbjct: 345 GGTDLKLPASNYLIPVDNSNTFCLAFAGTTG-PSIIGNIQQQGFRVIYDNLHNQVGFVPS 403
Query: 335 NCS 337
C+
Sbjct: 404 QCN 406
>gi|359483137|ref|XP_002272278.2| PREDICTED: aspartic proteinase nepenthesin-1-like [Vitis vinifera]
Length = 402
Score = 103 bits (256), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 86/294 (29%), Positives = 128/294 (43%), Gaps = 38/294 (12%)
Query: 56 CVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSNDNHGFDEDARDGALAGV 115
C Y + Y D S T+G HE + G + +FGC +N G G ++G+
Sbjct: 133 CNYAINYGDGSFTRGELGHEKLKF-----GTILVKDFIFGCGRNNKGLF-----GGVSGL 182
Query: 116 LGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLKFGTDMGYRRPSTQA-TKFI 174
+GL R +S ISQ I FSYCL P + + S + G YR S + K I
Sbjct: 183 MGLGRSDLSLISQTSGIFGGVFSYCL--PSTERKGSGSLILGGNSSVYRNSSPISYAKMI 240
Query: 175 NHPN--NFYYLSLKDISIDNERMNFPPDTFDITVSGEGGCIIDSGSVLTYFHSDVYWKLH 232
+P NFY+++L ISI + P G ++DSG+V+T +Y L
Sbjct: 241 ENPQLYNFYFINLTGISIGGVALQAPS-------VGPSRILVDSGTVITRLPPTIYKALK 293
Query: 233 EKFVSYFERFQLAQLSDCPEP----IQLCYFLPETFN-RFPSMAFYFE-DANLRIDGENV 286
+F+ F F P P + C+ L P++ +FE +A L +D V
Sbjct: 294 AEFLKQFTGFP-------PAPAFSILDTCFNLSAYQEVDIPTIKMHFEGNAELTVDVTGV 346
Query: 287 FII---DYENHFFLLAVAPHDDLVALIGSQQQRDTRFVYDLNIDLLSFVKENCS 337
F D LA + D VA++G+ QQ++ R +YD + F E CS
Sbjct: 347 FYFVKSDASQVCLALASLEYQDEVAILGNYQQKNLRVIYDTKETKVGFALETCS 400
>gi|356495752|ref|XP_003516737.1| PREDICTED: probable aspartic protease At2g35615-like [Glycine max]
Length = 396
Score = 103 bits (256), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 97/360 (26%), Positives = 165/360 (45%), Gaps = 37/360 (10%)
Query: 1 MVRLFIGTPSKGVLLILDTGSALIYA--------------IFDPRKSSSFQKINCDHPDC 46
+++L +GTP V ++DTGS L++A +F+P +S+++ I CD +C
Sbjct: 51 LMKLTLGTPPVDVYGLVDTGSDLVWAQCTPCQGCYRQKSPMFEPLRSNTYTPIPCDSEEC 110
Query: 47 TYF---KCVNEQ-CVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSNDNHG 102
C ++ C Y+ YAD SVTKG A ET++ + +FGC + N G
Sbjct: 111 NSLFGHSCSPQKLCAYSYAYADSSVTKGVLARETVTFSSTDGEPVVVGDIVFGCGHSNSG 170
Query: 103 -FDEDARDGALAGVLGLSRVTISFISQLGSII-KKRFSYCLVIPLPNGEYTSSYLKFGTD 160
F+E+ +S +SQ G++ KRFS CLV P +T + FG
Sbjct: 171 TFNENDMGIIGL-----GGGPLSLVSQFGNLYGSKRFSQCLV-PFHADPHTLGTISFGDA 224
Query: 161 MGYRRPSTQATKFINHPNNFYYL-SLKDISIDNERMNFPPDTFDITVSGEGGCIIDSGSV 219
AT ++ YL +L+ IS+ + ++F + +G +IDSG+
Sbjct: 225 SDVSGEGVAATPLVSEEGQTPYLVTLEGISVGDTFVSFNSS----EMLSKGNIMIDSGTP 280
Query: 220 LTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPE-PIQLCYFLPETFNRFPSMAFYFEDAN 278
TY + Y +L ++ + + + D P+ QLCY ET P + +FE A+
Sbjct: 281 ATYLPQEFYDRLVKELKV---QSNMLPIDDDPDLGTQLCY-RSETNLEGPILIAHFEGAD 336
Query: 279 LRIDGENVFIIDYENHFFLLAVAPHDDLVALIGSQQQRDTRFVYDLNIDLLSFVKENCSD 338
+++ FI ++ F A+A D + G+ Q + +DL+ +SF +CS+
Sbjct: 337 VQLMPIQTFIPP-KDGVFCFAMAGTTDGEYIFGNFAQSNVLIGFDLDRKTVSFKATDCSN 395
>gi|115467014|ref|NP_001057106.1| Os06g0209100 [Oryza sativa Japonica Group]
gi|51091210|dbj|BAD35903.1| putative aspartic proteinase nepenthesin I [Oryza sativa Japonica
Group]
gi|113595146|dbj|BAF19020.1| Os06g0209100 [Oryza sativa Japonica Group]
gi|125554496|gb|EAZ00102.1| hypothetical protein OsI_22105 [Oryza sativa Indica Group]
Length = 454
Score = 103 bits (256), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 98/380 (25%), Positives = 154/380 (40%), Gaps = 57/380 (15%)
Query: 1 MVRLFIGTPSKGVLLILDTGSALIYA---------------IFDPRKSSSFQKINCDHPD 45
++ + +GTP + V L LDTGS L++ + DP SS+ + CD P
Sbjct: 91 LMHVSVGTPPRPVALTLDTGSDLVWTQCAPCLDCFEQGAAPVLDPAASSTHAALPCDAPL 150
Query: 46 C---TYFKCV-----NEQCVYTMKYADQSVTKGFAAHETISVIGK-GEGKAIFHGALFGC 96
C + C + CVY Y D+S+T G A ++ + G G FGC
Sbjct: 151 CRALPFTSCGGRSWGDRSCVYVYHYGDRSLTVGQLATDSFTFGGDDNAGGLAARRVTFGC 210
Query: 97 SNDNHGFDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLK 156
+ N G + G+ G R S SQL FSYC +SS +
Sbjct: 211 GHINKGIFQANE----TGIAGFGRGRWSLPSQLN---VTSFSYCFTSMFDT--KSSSVVT 261
Query: 157 FG--------TDMGYRRPSTQATKFINHPN--NFYYLSLKDISIDNERMNFPPDTFDITV 206
G T + T+ I +P+ + Y++ L+ IS+ R+ P +
Sbjct: 262 LGAAAAELLHTHHAAHTGDVRTTRLIKNPSQPSLYFVPLRGISVGGARVAVPESRLRSST 321
Query: 207 SGEGGCIIDSGSVLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCYFLP--ETF 264
IIDSG+ +T DVY + +FVS + L + + LC+ LP +
Sbjct: 322 ------IIDSGASITTLPEDVYEAVKAEFVS---QVGLPAAAAGSAALDLCFALPVAALW 372
Query: 265 NR--FPSMAFYFED-ANLRIDGENVFIIDYENHFFLLAVAPHDDLVALIGSQQQRDTRFV 321
R P++ + + A+ + N DY + + +IG+ QQ++T V
Sbjct: 373 RRPAVPALTLHLDGGADWELPRGNYVFEDYAARVLCVVLDAAAGEQVVIGNYQQQNTHVV 432
Query: 322 YDLNIDLLSFVKENCSDDSA 341
YDL D+LSF C +A
Sbjct: 433 YDLENDVLSFAPARCDKLAA 452
>gi|296085498|emb|CBI29230.3| unnamed protein product [Vitis vinifera]
Length = 542
Score = 103 bits (256), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 98/344 (28%), Positives = 147/344 (42%), Gaps = 53/344 (15%)
Query: 1 MVRLFIGTPSKGVLLILDTGSALIYA--------------IFDPRKSSSFQKINCDHPDC 46
++ L+IGTP V+ I+DTGS L + +FDP+ SS+++ +C C
Sbjct: 93 LMNLYIGTPPVPVIAIVDTGSDLTWTQCRPCTHCYKQVVPLFDPKNSSTYRDSSCGTSFC 152
Query: 47 TYF----KCVNEQ-CVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSNDNH 101
C E+ C + YAD S T G A ET++V F G FGC + +
Sbjct: 153 LALGKDRSCSKEKKCTFRYSYADGSFTGGNLASETLTVDSTAGKPVSFPGFAFGCGHSSG 212
Query: 102 G-FDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLKFGTD 160
G FD+ + +G++GL +S ISQL S I FSYCL +P+ SS + FG
Sbjct: 213 GIFDKSS-----SGIVGLGGGELSLISQLKSTINGLFSYCL-LPVSTDSSISSRINFGAS 266
Query: 161 MGYRRPSTQATKFINHPNNFYYLSLKDISIDNERMNFPPDTFDITVSGEGGCIIDSGSVL 220
T +T L K S E EG I+DSG+
Sbjct: 267 GRVSGYGTVSTPL--------RLPYKGYSKKTEVE-------------EGNIIVDSGTTY 305
Query: 221 TYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCYFLPETFNRFPSMAFYFEDANLR 280
T+ + Y KL + S + ++ D LCY N P + +F+DAN+
Sbjct: 306 TFLPQEFYSKLEK---SVANSIKGKRVRDPNGIFSLCYNTTAEINA-PIITAHFKDANVE 361
Query: 281 IDGENVFIIDYENHFFLLAVAPHDDLVALIGSQQQRDTRFVYDL 324
+ N F+ E+ VAP D + ++G+ Q + +DL
Sbjct: 362 LQPLNTFMRMQED-LVCFTVAPTSD-IGVLGNLAQVNFLVGFDL 403
Score = 43.9 bits (102), Expect = 0.097, Method: Compositional matrix adjust.
Identities = 35/129 (27%), Positives = 57/129 (44%), Gaps = 5/129 (3%)
Query: 209 EGGCIIDSGSVLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCYFLPETFNRFP 268
EG I+DSG+ TY + Y KL E S + ++ D LCY P
Sbjct: 417 EGNIIVDSGTTYTYLPLEFYVKLEE---SVAHSIKGKRVRDPNGISSLCYNTTVDQIDAP 473
Query: 269 SMAFYFEDANLRIDGENVFIIDYENHFFLLAVAPHDDLVALIGSQQQRDTRFVYDLNIDL 328
+ +F+DAN+ + N F+ E+ V P D + ++G+ Q + +DL
Sbjct: 474 IITAHFKDANVELQPWNTFLRMQED-LVCFTVLPTSD-IGILGNLAQVNFLVGFDLRKKR 531
Query: 329 LSFVKENCS 337
+SF +C+
Sbjct: 532 VSFKAADCT 540
>gi|147814824|emb|CAN65806.1| hypothetical protein VITISV_015630 [Vitis vinifera]
Length = 449
Score = 103 bits (256), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 99/371 (26%), Positives = 152/371 (40%), Gaps = 49/371 (13%)
Query: 6 IGTPSKGVLLILDTGSALIYA-----------------------IFDPRKSSSFQKINCD 42
+GTPS+ +L+ DTGS L + +F SSSF+ I C
Sbjct: 89 VGTPSQKFMLVADTGSDLTWMSCKYHCRSRNCSNRKARRIRHKRVFHANLSSSFKTIPCL 148
Query: 43 HPDCT-----YFKCVN-----EQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGA 92
C F N C Y +Y+D S GF A+ET++V K K H
Sbjct: 149 TDMCKIELMDLFSLTNCPTPLTPCGYDYRYSDGSTALGFFANETVTVELKEGRKMKLHNV 208
Query: 93 LFGCSNDNHGFDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTS 152
L GCS G A D GV+GL SF + +FSYCLV L + + S
Sbjct: 209 LIGCSESFQGQSFQAAD----GVMGLGYSKYSFAIKAAEKFGGKFSYCLVDHLSH-KNVS 263
Query: 153 SYLKFGTDMGYR---RPSTQATKFINHPNNFYYLSLKDISIDNERMNFPPDTFDITVSGE 209
+YL FG+ T + N+FY +++ ISI + P + +D V G
Sbjct: 264 NYLTFGSSRSKEALLNNMTYTELVLGMVNSFYAVNMMGISIGGAMLKIPSEVWD--VKGA 321
Query: 210 GGCIIDSGSVLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCYFLPETFNR--F 267
GG I+DSGS LT+ Y + +F+ ++ P++ C F F
Sbjct: 322 GGTILDSGSSLTFLTEPAYQPVMAALRVSLLKFRKVEMDI--GPLEYC-FNSTGFEESLV 378
Query: 268 PSMAFYFEDANLRIDGENVFIIDYENHFFLLA-VAPHDDLVALIGSQQQRDTRFVYDLNI 326
P + F+F D ++I + L V+ +++G+ Q++ + +DL +
Sbjct: 379 PRLVFHFADGAEFEPPVKSYVISAADGVRCLGFVSVAWPGTSVVGNIMQQNHLWEFDLGL 438
Query: 327 DLLSFVKENCS 337
L F +C+
Sbjct: 439 KKLGFAPSSCT 449
>gi|357521081|ref|XP_003630829.1| Aspartic proteinase nepenthesin-2 [Medicago truncatula]
gi|355524851|gb|AET05305.1| Aspartic proteinase nepenthesin-2 [Medicago truncatula]
Length = 526
Score = 103 bits (256), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 96/352 (27%), Positives = 152/352 (43%), Gaps = 54/352 (15%)
Query: 1 MVRLFIGTPSKGVLLILDTGSALIY--------------AIFDPRKSSSFQKINCDHPDC 46
+V++ +G P + +I D + + +IFDP +SSS+ ++C+ C
Sbjct: 188 LVQIGVGGPPQKFYMIFDLQTDFTWLQCQPCIKCYDQPDSIFDPSQSSSYTLLSCETKHC 247
Query: 47 TYF---KCVNEQ-CVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSNDNHG 102
C ++ C Y + Y D + T+G +ET+S E GCSN N G
Sbjct: 248 NLLPNSSCSDDGYCRYNITYKDGTNTEGVLINETVSF----ESSGWVDRVSLGCSNKNQG 303
Query: 103 --FDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLKFGTD 160
D G GL R ++SF S++ + SYCLV Y+SS L+F +
Sbjct: 304 PFVGSD-------GTFGLGRGSLSFPSRINA---SSMSYCLV--ESKDGYSSSTLEFNSP 351
Query: 161 MGYRRPSTQATKFINHPN--NFYYLSLKDISIDNERMNFPPDTFDITVSGEGGCIIDSGS 218
+ K + +P N YY+ LK I + E+++ P TF I G GG I+ S S
Sbjct: 352 ---PCSGSVKAKLLQNPKAENLYYVGLKGIKVGGEKIDVPNSTFTIDPYGNGGMIVSSSS 408
Query: 219 VLTYFHSDVYWKLHEKFVS---YFERFQLAQLSDCPEPIQLCYFLPETFN-RFPSMAFYF 274
++T +D Y + + FV+ + ER + D CY L P + F
Sbjct: 409 LITMLENDTYNVVRDAFVAKTQHLERLKAFLQFD------TCYNLSSNNTVELPILEFEV 462
Query: 275 EDAN--LRIDGENVFIIDYENHFFLLAVAPHDDLVALIGSQQQRDTRFVYDL 324
D L ++ +D +N F A AP +++G+ QQ TR +DL
Sbjct: 463 NDGKSWLLPKESYLYAVD-KNGTFCFAFAPSKGSFSILGTLQQYGTRVTFDL 513
>gi|115448349|ref|NP_001047954.1| Os02g0720600 [Oryza sativa Japonica Group]
gi|45735841|dbj|BAD12876.1| putative 41 kD chloroplast nucleoid DNA binding protein (CND41)
[Oryza sativa Japonica Group]
gi|45735967|dbj|BAD12996.1| putative 41 kD chloroplast nucleoid DNA binding protein (CND41)
[Oryza sativa Japonica Group]
gi|113537485|dbj|BAF09868.1| Os02g0720600 [Oryza sativa Japonica Group]
gi|125583492|gb|EAZ24423.1| hypothetical protein OsJ_08176 [Oryza sativa Japonica Group]
gi|215740989|dbj|BAG97484.1| unnamed protein product [Oryza sativa Japonica Group]
Length = 463
Score = 103 bits (256), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 89/361 (24%), Positives = 150/361 (41%), Gaps = 50/361 (13%)
Query: 1 MVRLFIGTPSKGVLLILDTGS----------------ALIYAIFDPRKSSSFQKINCDHP 44
++ + +GTP+ + +DTGS A A+FDP KSS+++ ++C
Sbjct: 128 VISVGLGTPAVTQTVTIDTGSDVSWVQCNPCPNPPCYAQTGALFDPAKSSTYRAVSCAAA 187
Query: 45 DCTYFK-------CVNEQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCS 97
+C + N +C Y ++Y D S T G + +T+++ G + F FGCS
Sbjct: 188 ECAQLEQQGNGCGATNYECQYGVQYGDGSTTNGTYSRDTLTLSGASDAVKGFQ---FGCS 244
Query: 98 NDNHGFDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLKF 157
+ GF D D G++GL S +SQ + FSYCL P +
Sbjct: 245 HVESGF-SDQTD----GLMGLGGGAQSLVSQTAAAYGNSFSYCLP---PTSGSSGFLTLG 296
Query: 158 GTDMGYRRPSTQATKFINHPNNFYYLSLKDISIDNERMNFPPDTFDITVSGEGGCIIDSG 217
G +T+ + P FY L+DI++ +++ P F G ++DSG
Sbjct: 297 GGGGVSGFVTTRMLRSRQIP-TFYGARLQDIAVGGKQLGLSPSVF------AAGSVVDSG 349
Query: 218 SVLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCY-FLPETFNRFPSMAFYFE- 275
+++T Y L F + ++++ A + C+ F +T P++A F
Sbjct: 350 TIITRLPPTAYSALSSAFKAGMKQYRSAPARSI---LDTCFDFAGQTQISIPTVALVFSG 406
Query: 276 DANLRIDGENVFIIDYENHFFLLAVAPHDDLVALIGSQQQRDTRFVYDLNIDLLSFVKEN 335
A + +D + Y N A D +IG+ QQR +YD+ L F
Sbjct: 407 GAAIDLDPNGIM---YGNCLAFAATG-DDGTTGIIGNVQQRTFEVLYDVGSSTLGFRSGA 462
Query: 336 C 336
C
Sbjct: 463 C 463
>gi|125540928|gb|EAY87323.1| hypothetical protein OsI_08727 [Oryza sativa Indica Group]
Length = 463
Score = 102 bits (255), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 89/361 (24%), Positives = 150/361 (41%), Gaps = 50/361 (13%)
Query: 1 MVRLFIGTPSKGVLLILDTGS----------------ALIYAIFDPRKSSSFQKINCDHP 44
++ + +GTP+ + +DTGS A A+FDP KSS+++ ++C
Sbjct: 128 VISVGLGTPAVTQTVTIDTGSDVSWVQCNPCPNPPCHAQTGALFDPAKSSTYRAVSCAAA 187
Query: 45 DCTYFK-------CVNEQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCS 97
+C + N +C Y ++Y D S T G + +T+++ G + F FGCS
Sbjct: 188 ECAQLEQQGNGCGATNYECQYGVQYGDGSTTNGTYSRDTLTLSGASDAVKGFQ---FGCS 244
Query: 98 NDNHGFDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLKF 157
+ GF D D G++GL S +SQ + FSYCL P +
Sbjct: 245 HLESGF-SDQTD----GLMGLGGGAQSLVSQTAAAYGNSFSYCLP---PTSGSSGFLTLG 296
Query: 158 GTDMGYRRPSTQATKFINHPNNFYYLSLKDISIDNERMNFPPDTFDITVSGEGGCIIDSG 217
G +T+ + P FY L+DI++ +++ P F G ++DSG
Sbjct: 297 GGGGASGFVTTRMLRSKQIP-TFYGARLQDIAVGGKQLGLSPSVF------AAGSVVDSG 349
Query: 218 SVLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCY-FLPETFNRFPSMAFYFE- 275
+++T Y L F + ++++ A + C+ F +T P++A F
Sbjct: 350 TIITRLPPTAYSALSSAFKAGMKQYRSAPARSI---LDTCFDFAGQTQISIPTVALVFSG 406
Query: 276 DANLRIDGENVFIIDYENHFFLLAVAPHDDLVALIGSQQQRDTRFVYDLNIDLLSFVKEN 335
A + +D + Y N A D +IG+ QQR +YD+ L F
Sbjct: 407 GAAIDLDPNGIM---YGNCLAFAATG-DDGTTGIIGNVQQRTFEVLYDVGSSTLGFRSGA 462
Query: 336 C 336
C
Sbjct: 463 C 463
>gi|125564143|gb|EAZ09523.1| hypothetical protein OsI_31798 [Oryza sativa Indica Group]
Length = 485
Score = 102 bits (255), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 92/360 (25%), Positives = 147/360 (40%), Gaps = 47/360 (13%)
Query: 1 MVRLFIGTPSKGVLLILDTGSALIYA--------------IFDPRKSSSFQKINCDHPDC 46
+V + +GTP+K +I DTGS L + +FDP SS++ + C P+C
Sbjct: 150 VVSVGLGTPAKQYAVIFDTGSDLSWVQCKPCADCYEQQDPLFDPSLSSTYAAVACGAPEC 209
Query: 47 TYFKC----VNEQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSNDNHG 102
+ +C Y ++Y DQS T G +T+++ G +FGC + N G
Sbjct: 210 QELDASGCSSDSRCRYEVQYGDQSQTDGNLVRDTLTL----SASDTLPGFVFGCGDQNAG 265
Query: 103 FDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLKFGTDMG 162
G + G+ GL R +S SQ F+YC LP+ YL G G
Sbjct: 266 L-----FGQVDGLFGLGREKVSLPSQGAPSYGPGFTYC----LPSSSSGRGYLSLG---G 313
Query: 163 YRRPSTQATKFINHPN-NFYYLSLKDISIDNERMNFPPDTFDITVSGEGGCIIDSGSVLT 221
+ Q T + +FYY+ L I + + P F +IDSG+V+T
Sbjct: 314 APPANAQFTALADGATPSFYYIDLVGIKVGGRAIRIPATAFAAAGG----TVIDSGTVIT 369
Query: 222 YFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCY-FLPETFNRFPSMAFYFE-DANL 279
Y L F +++ A + CY F + P++ F A +
Sbjct: 370 RLPPRAYAPLRAAFARSMAQYKKAPALSI---LDTCYDFTGHRTAQIPTVELAFAGGATV 426
Query: 280 RIDGENVFIIDYENHFFLLAVAPH--DDLVALIGSQQQRDTRFVYDLNIDLLSFVKENCS 337
+D V + + LA AP+ D +A++G+ QQ+ YD+ + F + CS
Sbjct: 427 SLDFTGVLYVSKVSQ-ACLAFAPNADDSSIAILGNTQQKTFAVTYDVANQRIGFGAKGCS 485
>gi|115479815|ref|NP_001063501.1| Os09g0482200 [Oryza sativa Japonica Group]
gi|50725878|dbj|BAD33407.1| putative nucleoid DNA-binding protein cnd41 [Oryza sativa Japonica
Group]
gi|50725881|dbj|BAD33410.1| putative nucleoid DNA-binding protein cnd41 [Oryza sativa Japonica
Group]
gi|113631734|dbj|BAF25415.1| Os09g0482200 [Oryza sativa Japonica Group]
gi|125606112|gb|EAZ45148.1| hypothetical protein OsJ_29786 [Oryza sativa Japonica Group]
Length = 485
Score = 102 bits (255), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 92/360 (25%), Positives = 147/360 (40%), Gaps = 47/360 (13%)
Query: 1 MVRLFIGTPSKGVLLILDTGSALIYA--------------IFDPRKSSSFQKINCDHPDC 46
+V + +GTP+K +I DTGS L + +FDP SS++ + C P+C
Sbjct: 150 VVSVGLGTPAKQYAVIFDTGSDLSWVQCKPCADCYEQQDPLFDPSLSSTYAAVACGAPEC 209
Query: 47 TYFKC----VNEQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSNDNHG 102
+ +C Y ++Y DQS T G +T+++ G +FGC + N G
Sbjct: 210 QELDASGCSSDSRCRYEVQYGDQSQTDGNLVRDTLTL----SASDTLPGFVFGCGDQNAG 265
Query: 103 FDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLKFGTDMG 162
G + G+ GL R +S SQ F+YC LP+ YL G G
Sbjct: 266 L-----FGQVDGLFGLGREKVSLPSQGAPSYGPGFTYC----LPSSSSGRGYLSLG---G 313
Query: 163 YRRPSTQATKFINHPN-NFYYLSLKDISIDNERMNFPPDTFDITVSGEGGCIIDSGSVLT 221
+ Q T + +FYY+ L I + + P F +IDSG+V+T
Sbjct: 314 APPANAQFTALADGATPSFYYIDLVGIKVGGRAIRIPATAFAAAGG----TVIDSGTVIT 369
Query: 222 YFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCY-FLPETFNRFPSMAFYFE-DANL 279
Y L F +++ A + CY F + P++ F A +
Sbjct: 370 RLPPRAYAPLRAAFARSMAQYKKAPALSI---LDTCYDFTGHRTAQIPTVELAFAGGATV 426
Query: 280 RIDGENVFIIDYENHFFLLAVAPH--DDLVALIGSQQQRDTRFVYDLNIDLLSFVKENCS 337
+D V + + LA AP+ D +A++G+ QQ+ YD+ + F + CS
Sbjct: 427 SLDFTGVLYVSKVSQ-ACLAFAPNADDSSIAILGNTQQKTFAVAYDVANQRIGFGAKGCS 485
>gi|359494621|ref|XP_002265771.2| PREDICTED: aspartic proteinase nepenthesin-2-like [Vitis vinifera]
Length = 449
Score = 102 bits (255), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 99/371 (26%), Positives = 152/371 (40%), Gaps = 49/371 (13%)
Query: 6 IGTPSKGVLLILDTGSALIYA-----------------------IFDPRKSSSFQKINCD 42
+GTPS+ +L+ DTGS L + +F SSSF+ I C
Sbjct: 89 VGTPSQKFMLVADTGSDLTWMSCKYHCRSRNCSNRKARRIRHKRVFHANLSSSFKTIPCL 148
Query: 43 HPDCT-----YFKCVN-----EQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGA 92
C F N C Y +Y+D S GF A+ET++V K K H
Sbjct: 149 TDMCKIELMDLFSLTNCPTPLTPCGYDYRYSDGSTALGFFANETVTVELKEGRKMKLHNV 208
Query: 93 LFGCSNDNHGFDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTS 152
L GCS G A D GV+GL SF + +FSYCLV L + + S
Sbjct: 209 LIGCSESFQGQSFQAAD----GVMGLGYSKYSFAIKAAEKFGGKFSYCLVDHLSH-KNVS 263
Query: 153 SYLKFGTDMGYR---RPSTQATKFINHPNNFYYLSLKDISIDNERMNFPPDTFDITVSGE 209
+YL FG+ T + N+FY +++ ISI + P + +D V G
Sbjct: 264 NYLTFGSSRSKEALLNNMTYTELVLGMVNSFYAVNMMGISIGGAMLKIPSEVWD--VKGA 321
Query: 210 GGCIIDSGSVLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCYFLPETFNR--F 267
GG I+DSGS LT+ Y + +F+ ++ P++ C F F
Sbjct: 322 GGTILDSGSSLTFLTEPAYQPVMAALRVSLLKFRKVEMDI--GPLEYC-FNSTGFEESLV 378
Query: 268 PSMAFYFEDANLRIDGENVFIIDYENHFFLLA-VAPHDDLVALIGSQQQRDTRFVYDLNI 326
P + F+F D ++I + L V+ +++G+ Q++ + +DL +
Sbjct: 379 PRLVFHFADGAEFEPPVKSYVISAADGVRCLGFVSVAWPGTSVVGNIMQQNHLWEFDLGL 438
Query: 327 DLLSFVKENCS 337
L F +C+
Sbjct: 439 KKLGFAPSSCT 449
>gi|115476830|ref|NP_001062011.1| Os08g0469100 [Oryza sativa Japonica Group]
gi|42407408|dbj|BAD09566.1| putative nucleoid DNA-binding protein [Oryza sativa Japonica Group]
gi|113623980|dbj|BAF23925.1| Os08g0469100 [Oryza sativa Japonica Group]
Length = 373
Score = 102 bits (255), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 98/361 (27%), Positives = 155/361 (42%), Gaps = 57/361 (15%)
Query: 15 LILDTGSALIYA------------------IFDPRKSSSFQKINCDHPDC-----TYFKC 51
LI+DTGS LI+ ++DP +SS+F + C C ++ C
Sbjct: 28 LIVDTGSDLIWTQCKLSSSTAAAARHGSPPVYDPGESSTFAFLPCSDRLCQEGQFSFKNC 87
Query: 52 VNE-QCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSNDNHGFDEDARDG 110
++ +CVY Y + G A ET + G +A+ FGC + G A
Sbjct: 88 TSKNRCVYEDVYGSAAAV-GVLASETFTF---GARRAVSLRLGFGCGALSAGSLIGA--- 140
Query: 111 ALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLKFGTDMGYRRPST-- 168
G+LGLS ++S I+QL +RFSYCL P + +S L FG R T
Sbjct: 141 --TGILGLSPESLSLITQLK---IQRFSYCLT---PFADKKTSPLLFGAMADLSRHKTTR 192
Query: 169 --QATKFINHP--NNFYYLSLKDISIDNERMNFPPDTFDITVSGEGGCIIDSGSVLTYFH 224
Q T +++P +YY+ L IS+ ++R+ P + + G GG I+DSGS + Y
Sbjct: 193 PIQTTAIVSNPVETVYYYVPLVGISLGHKRLAVPAASLAMRPDGGGGTIVDSGSTVAYLV 252
Query: 225 SDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCYFLPETFNRFPSMAFYFEDANLRIDGE 284
+ + E + + +L + E +LC+ LP A L DG
Sbjct: 253 EAAFEAVKE---AVMDVVRLPVANRTVEDYELCFVLPRRTAAAAMEAVQVPPLVLHFDGG 309
Query: 285 NVFIIDYENHF-------FLLAVAPHDD--LVALIGSQQQRDTRFVYDLNIDLLSFVKEN 335
++ +N+F LAV D V++IG+ QQ++ ++D+ SF
Sbjct: 310 AAMVLPRDNYFQEPRAGLMCLAVGKTTDGSGVSIIGNVQQQNMHVLFDVQHHKFSFAPTQ 369
Query: 336 C 336
C
Sbjct: 370 C 370
>gi|356504173|ref|XP_003520873.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
Length = 461
Score = 102 bits (255), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 101/360 (28%), Positives = 152/360 (42%), Gaps = 44/360 (12%)
Query: 2 VRLFIGTPSKGVLLILDTGSALIYA--------------IFDPRKSSSFQKINCDHPDCT 47
R+ +GTP++ V ++LDTGS +++ +FDP KS ++ I C P C
Sbjct: 120 TRIGVGTPARYVYMVLDTGSDVVWLQCAPCRKCYTQTDHVFDPTKSRTYAGIPCGAPLCR 179
Query: 48 YF---KCVNEQ--CVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSNDNHG 102
C N+ C Y + Y D S T G + ET++ + AL GC +DN G
Sbjct: 180 RLDSPGCSNKNKVCQYQVSYGDGSFTFGDFSTETLTF----RRNRVTRVAL-GCGHDNEG 234
Query: 103 FDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLKFGTDMG 162
A R +SF Q G +FSYCLV + + +S + FG
Sbjct: 235 LFTGAAGLLGL-----GRGRLSFPVQTGRRFNHKFSYCLVDRSASAKPSS--VIFGDSAV 287
Query: 163 YRRPSTQATKFINHP--NNFYYLSLKDISIDNERMN-FPPDTFDITVSGEGGCIIDSGSV 219
R + T I +P + FYYL L IS+ + F + +G GG IIDSG+
Sbjct: 288 SR--TAHFTPLIKNPKLDTFYYLELLGISVGGAPVRGLSASLFRLDAAGNGGVIIDSGTS 345
Query: 220 LTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPE--PIQLCYFLPE-TFNRFPSMAFYFED 276
+T Y L + F R + L PE C+ L T + P++ +F
Sbjct: 346 VTRLTRPAYIALRDAF-----RIGASHLKRAPEFSLFDTCFDLSGLTEVKVPTVVLHFRG 400
Query: 277 ANLRIDGENVFIIDYENHFFLLAVAPHDDLVALIGSQQQRDTRFVYDLNIDLLSFVKENC 336
A++ + N I + F A A +++IG+ QQ+ R YDL + F C
Sbjct: 401 ADVSLPATNYLIPVDNSGSFCFAFAGTMSGLSIIGNIQQQGFRISYDLTGSRVGFAPRGC 460
>gi|255537017|ref|XP_002509575.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
gi|223549474|gb|EEF50962.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
Length = 459
Score = 102 bits (255), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 101/361 (27%), Positives = 150/361 (41%), Gaps = 42/361 (11%)
Query: 1 MVRLFIGTPSKGVLLILDTGSALIYAIFDPRKSSSFQK---------------INCDHPD 45
+V IG P ++DTGS+L + +P + QK + D D
Sbjct: 111 LVNFSIGQPPVPQYAVMDTGSSLTWIQCEPCINCHQQKGPLYNPSSSSTYVSCSDFDRTD 170
Query: 46 CTYFKCVNEQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSNDNHGFDE 105
T+ C Y+ YAD++ T+G A E + +G I H +FGC ++N
Sbjct: 171 TTFTATHGSDCNYSQTYADKTTTRGTYAREQLLFETPDDGITIMHDVIFGCGHNNTQL-- 228
Query: 106 DARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLKFGTDM---G 162
G +GV GL S IS+LG FSYC + + + Y L G + G
Sbjct: 229 PGPTGYASGVFGLGDSGSSIISKLGF----GFSYC-IGNIGDPLYGFHRLTLGNKLKIEG 283
Query: 163 YRRPSTQATKFINHPNNFYYLSLKDISIDNERMNFPPDTFD-ITVSG-EGGCIIDSGSVL 220
Y P P YY++L ISI ER++ P F + ++G +IDSG+ L
Sbjct: 284 YSTPLV--------PRGLYYITLVGISIGQERLDIDPIVFQRVDLNGISSRIVIDSGATL 335
Query: 221 TYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCYF--LPETFNRFPSMAFYFED-A 277
+Y Y + +K S F L++ + LCY L + FP F+ D A
Sbjct: 336 SYIPRQAYNVVRDKVSSILSGF-LSRYRYIARHLSLCYIGKLNQDLQGFPDATFHLADGA 394
Query: 278 NLRIDGENVFIIDYENHFFLLAVAP--HDDLVALIGSQQQRDTRFVYDLNIDLLSFVKEN 335
+L E +F Y ++ LA+ P D+ LIG Q+ YDL L F +
Sbjct: 395 DLVFQVEGLF-FQYTDNVLCLALVPTESDEETCLIGLLAQQYYNVAYDLKQQKLYFQRIE 453
Query: 336 C 336
C
Sbjct: 454 C 454
>gi|125553531|gb|EAY99240.1| hypothetical protein OsI_21202 [Oryza sativa Indica Group]
Length = 475
Score = 102 bits (255), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 102/363 (28%), Positives = 160/363 (44%), Gaps = 47/363 (12%)
Query: 6 IGTPSKGVLLILDTGSALIY--------------AIFDPRKSSSFQKINCDHPDCTYFKC 51
+GTP+ L++LDTGS +++ +FDPR+S S+ ++C P C
Sbjct: 128 VGTPATTALMVLDTGSDVVWLQCAPCRHCYAQSGRVFDPRRSRSYAAVDCVAPICRRLDS 187
Query: 52 VN-----EQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSNDNHGFDED 106
C+Y + Y D SVT G A ET++ +G A GC +DN G
Sbjct: 188 AGCDRRRNSCLYQVAYGDGSVTAGDFASETLT-FARG---ARVQRVAIGCGHDNEGLFI- 242
Query: 107 ARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLV---IPLPNGEYTSSYLKFGTDMGY 163
A +G+LGL R +SF +Q+ + FSYCLV + SS + FG
Sbjct: 243 ----AASGLLGLGRGRLSFPTQIARSFGRSFSYCLVDRTSSVRPSSTRSSTVTFGAGAVA 298
Query: 164 RRPSTQATKFINHPN--NFYYLSLKDISIDNERMNFPPDTFDITV---SGEGGCIIDSGS 218
T +P FYY+ L S+ R+ + D+ + +G GG I+DSG+
Sbjct: 299 AAAGASFTPMGRNPRMATFYYVHLLGFSVGGARVKGVSQS-DLRLNPTTGRGGVILDSGT 357
Query: 219 VLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQL---CYFLP-ETFNRFPSMAFYF 274
+T VY + + F R L P L CY L + P+++ +
Sbjct: 358 SVTRLARPVYEAVRDAF-----RAAAVGLRVSPGGFSLFDTCYNLSGRRVVKVPTVSMHL 412
Query: 275 E-DANLRIDGENVFIIDYENHFFLLAVAPHDDLVALIGSQQQRDTRFVYDLNIDLLSFVK 333
A++ + EN I + F A+A D V++IG+ QQ+ R V+D + + FV
Sbjct: 413 AGGASVALPPENYLIPVDTSGTFCFAMAGTDGGVSIIGNIQQQGFRVVFDGDAQRVGFVP 472
Query: 334 ENC 336
++C
Sbjct: 473 KSC 475
>gi|326491519|dbj|BAJ94237.1| predicted protein [Hordeum vulgare subsp. vulgare]
gi|326524456|dbj|BAK00611.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 499
Score = 102 bits (254), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 92/361 (25%), Positives = 151/361 (41%), Gaps = 50/361 (13%)
Query: 1 MVRLFIGTPSKGVLLILDTGSALIYA---------------IFDPRKSSSFQKINCDHPD 45
+V + +GTP+ ++ DTGS + +FDP KSS++ ++C
Sbjct: 164 VVTVGLGTPASKYTVVFDTGSDTTWVQCRPCVVKCYKQKEPLFDPAKSSTYANVSCTDSA 223
Query: 46 CTYF---KCVNEQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSNDNHG 102
C C C+Y ++Y D S T GF A +T+++ G FGC N+G
Sbjct: 224 CADLDTNGCTGGHCLYAVQYGDGSYTVGFFAQDTLTIA-----HDAIKGFRFGCGEKNNG 278
Query: 103 FDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLKFGTDMG 162
G AG++GL R S Q + F+YC LP + YL FG G
Sbjct: 279 L-----FGKTAGLMGLGRGKTSLTVQAYNKYGGAFAYC----LPALTTGTGYLDFGP--G 327
Query: 163 YRRPSTQATKFI-NHPNNFYYLSLKDISIDNERMNFPPDTFDITVSGEGGCIIDSGSVLT 221
+ + T + + FYY+ + I + +++ F G ++DSG+V+T
Sbjct: 328 SAGNNARLTPMLTDKGQTFYYVGMTGIRVGGQQVPVAESVFSTA-----GTLVDSGTVIT 382
Query: 222 YFHSDVYWKLHEKFVSYFERFQLAQ-LSDCP--EPIQLCY-FLPETFNRFPSMAFYFE-D 276
+ Y L S F++ LA+ P + CY F + P+++ F+
Sbjct: 383 RLPATAYTALS----SAFDKVMLARGYKKAPGYSILDTCYDFTGLSDVELPTVSLVFQGG 438
Query: 277 ANLRIDGEN-VFIIDYENHFFLLAVAPHDDLVALIGSQQQRDTRFVYDLNIDLLSFVKEN 335
A L +D V+ I A D+ VA++G+ QQ+ +YDL + F +
Sbjct: 439 ACLDVDVSGIVYAISEAQVCLAFASNGDDESVAIVGNTQQKTYGVLYDLGKKTVGFAPGS 498
Query: 336 C 336
C
Sbjct: 499 C 499
>gi|326520736|dbj|BAJ92731.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 499
Score = 102 bits (254), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 92/361 (25%), Positives = 151/361 (41%), Gaps = 50/361 (13%)
Query: 1 MVRLFIGTPSKGVLLILDTGSALIYA---------------IFDPRKSSSFQKINCDHPD 45
+V + +GTP+ ++ DTGS + +FDP KSS++ ++C
Sbjct: 164 VVTVGLGTPASKYTVVFDTGSDTTWVQCRPCVVKCYKQKGPLFDPAKSSTYANVSCTDSA 223
Query: 46 CTYF---KCVNEQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSNDNHG 102
C C C+Y ++Y D S T GF A +T+++ G FGC N+G
Sbjct: 224 CADLDTNGCTGGHCLYAVQYGDGSYTVGFFAQDTLTIA-----HDAIKGFRFGCGEKNNG 278
Query: 103 FDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLKFGTDMG 162
G AG++GL R S Q + F+YC LP + YL FG G
Sbjct: 279 L-----FGKTAGLMGLGRGKTSLTVQAYNKYGGAFAYC----LPALTTGTGYLDFGP--G 327
Query: 163 YRRPSTQATKFI-NHPNNFYYLSLKDISIDNERMNFPPDTFDITVSGEGGCIIDSGSVLT 221
+ + T + + FYY+ + I + +++ F G ++DSG+V+T
Sbjct: 328 SAGNNARLTPMLTDKGQTFYYVGMTGIRVGGQQVPVAESVFSTA-----GTLVDSGTVIT 382
Query: 222 YFHSDVYWKLHEKFVSYFERFQLAQ-LSDCP--EPIQLCY-FLPETFNRFPSMAFYFE-D 276
+ Y L S F++ LA+ P + CY F + P+++ F+
Sbjct: 383 RLPATAYTALS----SAFDKVMLARGYKKAPGYSILDTCYDFTGLSDVELPTVSLVFQGG 438
Query: 277 ANLRIDGEN-VFIIDYENHFFLLAVAPHDDLVALIGSQQQRDTRFVYDLNIDLLSFVKEN 335
A L +D V+ I A D+ VA++G+ QQ+ +YDL + F +
Sbjct: 439 ACLDVDVSGIVYAISEAQVCLAFASNGDDESVAIVGNTQQKTYGVLYDLGKKTVGFAPGS 498
Query: 336 C 336
C
Sbjct: 499 C 499
>gi|224091849|ref|XP_002309371.1| predicted protein [Populus trichocarpa]
gi|222855347|gb|EEE92894.1| predicted protein [Populus trichocarpa]
Length = 438
Score = 102 bits (254), Expect = 3e-19, Method: Compositional matrix adjust.
Identities = 101/360 (28%), Positives = 156/360 (43%), Gaps = 53/360 (14%)
Query: 1 MVRLFIGTPSKGVLLILDTGSALIY---------------AIFDPRKSSSFQKINCDHPD 45
+V +G P L I+DTGS+L++ +FDP SS++ ++C +
Sbjct: 103 LVNFSMGQPPVPQLAIMDTGSSLLWIQCAPCKSCSQQIIGPMFDPSISSTYDSLSCKNII 162
Query: 46 CTYF---KC-VNEQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSNDNH 101
C Y +C + QCVY Y + + G A E + EG+ + LFGCS+ N
Sbjct: 163 CRYAPSGECDSSSQCVYNQTYVEGLPSVGVIATEQLIFGSSDEGRNAVNNVLFGCSHRNG 222
Query: 102 GFDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCL-VIPLPNGEYTSSYLKFGTD 160
+ +D GV GL S ++Q+GS +FSYC+ I P+ Y L G +
Sbjct: 223 NY----KDRRFTGVFGLGSGITSVVNQMGS----KFSYCIGNIADPDYSYNQLVLSEGVN 274
Query: 161 M-GYRRPSTQATKFINHPNNFYYLSLKDISIDNERMNFPPDTFDITVSGEGGCIIDSGSV 219
M GY P ++ + Y + L+ IS+ R+ P F T + IIDSG+
Sbjct: 275 MEGYSTP-------LDVVDGHYQVILEGISVGETRLVIDPSAFKRT-EKQRRVIIDSGTA 326
Query: 220 LTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCY--FLPETFNRFPSMAFYF-ED 276
T+ + Y L + + +RF L+ LCY + + FP++ F+F E
Sbjct: 327 PTWLAENEYRALEREVRNLLDRF----LTPFMRESFLCYKGKVGQDLVGFPAVTFHFAEG 382
Query: 277 ANLRIDGENVFIIDYENHFFLLAVAPHDDLVALIGSQQQRDTRFVYDLNIDLLSFVKENC 336
A+L +D E Y F +V IG Q+ YDLN L F + +C
Sbjct: 383 ADLVVDTEMRQASVYGKDFKDFSV---------IGLMAQQYYNVAYDLNKHKLFFQRIDC 433
>gi|242084330|ref|XP_002442590.1| hypothetical protein SORBIDRAFT_08g022570 [Sorghum bicolor]
gi|241943283|gb|EES16428.1| hypothetical protein SORBIDRAFT_08g022570 [Sorghum bicolor]
Length = 494
Score = 102 bits (254), Expect = 3e-19, Method: Compositional matrix adjust.
Identities = 104/374 (27%), Positives = 157/374 (41%), Gaps = 52/374 (13%)
Query: 1 MVRLFIGTPSKGVLLILDTGSALIY--------------AIFDPRKSSSFQKINCDHPDC 46
M ++ +GTP+ LL LDT S L + +FDPR S+S+ ++N D PDC
Sbjct: 135 MAKIAVGTPAVQALLALDTASDLTWLQCQPCRRCYPQSGPVFDPRHSTSYGEMNYDAPDC 194
Query: 47 TYFK------CVNEQCVYTMKYAD----QSVTKGFAAHETISVIGKGEGKAIFHGALFGC 96
C+YT++Y D S + G ET++ G G +A GC
Sbjct: 195 QALGRSGGGDAKRGTCIYTVQYGDGHGSTSTSVGDLVEETLTFAG-GVRQAYLS---IGC 250
Query: 97 SNDNHG-FDEDARDGALAGVLGLSRVTISFISQLGSI-IKKRFSYCLVIPLPNGEYTSSY 154
+DN G F A AG+LGL R IS Q+ + FSYCLV + SS
Sbjct: 251 GHDNKGLFGAPA-----AGILGLGRGQISIPHQIAFLGYNASFSYCLVDFISGPGSPSST 305
Query: 155 LKFGTDMGYRRPSTQATKFINHPN--NFYYLSLKDISIDNERMNFPPDTFDITV-----S 207
L FG P T + + N FYY+ L +S+ R+ P + + +
Sbjct: 306 LTFGAGAVDTSPPASFTPTVLNQNMPTFYYVRLIGVSVGGVRV---PGVTERDLQLDPYT 362
Query: 208 GEGGCIIDSGSVLTYFHSDVYWKLHEKFVSYFERFQLAQLS-DCPEPI-QLCYFLPETFN 265
G GG I+DSG+ +T Y + + L Q+S P + CY +
Sbjct: 363 GRGGVILDSGTTVTRLARPAY--VAFRDAFRAAATSLGQVSTGGPSGLFDTCYTVGGRAG 420
Query: 266 -RFPSMAFYFEDA-NLRIDGENVFI-IDYENHFFLLAVAPHDDLVALIGSQQQRDTRFVY 322
+ P+++ +F + + +N I +D D V++IG+ Q+ R VY
Sbjct: 421 VKVPAVSMHFAGGVEVSLQPKNYLIPVDSRGTVCFAFAGTGDRSVSVIGNILQQGFRVVY 480
Query: 323 DLNIDLLSFVKENC 336
DL + F NC
Sbjct: 481 DLAGQRVGFAPNNC 494
>gi|79315693|ref|NP_001030891.1| aspartyl protease family protein [Arabidopsis thaliana]
gi|332646353|gb|AEE79874.1| aspartyl protease family protein [Arabidopsis thaliana]
Length = 499
Score = 102 bits (253), Expect = 3e-19, Method: Compositional matrix adjust.
Identities = 103/346 (29%), Positives = 152/346 (43%), Gaps = 32/346 (9%)
Query: 4 LFIGTPSKGVLLILDTGSALIYAIFDPRKSSSFQKINCDHPDCTYFKCVNEQCVYTMKYA 63
+ +G+P K LILDTGS L + Q + C DC + + N+ C Y Y
Sbjct: 174 VLVGSPPKHFSLILDTGSDLNW----------IQCLPC--YDC-FQQNDNQSCPYYYWYG 220
Query: 64 DQSVTKGFAAHETISV--IGKGEGKAIFH--GALFGCSNDNHGFDEDARDGALAGVLGLS 119
D S T G A ET +V G +++ +FGC + N G A
Sbjct: 221 DSSNTTGDFAVETFTVNLTTNGGSSELYNVENMMFGCGHWNRGLFHGAAGLLGL-----G 275
Query: 120 RVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLKFGTDMGY-RRPSTQATKFINHPN 178
R +SF SQL S+ FSYCLV + SS L FG D P+ T F+
Sbjct: 276 RGPLSFSSQLQSLYGHSFSYCLV-DRNSDTNVSSKLIFGEDKDLLSHPNLNFTSFVAGKE 334
Query: 179 N----FYYLSLKDISIDNERMNFPPDTFDITVSGEGGCIIDSGSVLTYFHSDVYWKLHEK 234
N FYY+ +K I + E +N P +T++I+ G GG IIDSG+ L+YF Y + K
Sbjct: 335 NLVDTFYYVQIKSILVAGEVLNIPEETWNISSDGAGGTIIDSGTTLSYFAEPAYEFIKNK 394
Query: 235 FVSYFERFQLAQLSDCPEPIQLCYFLPETFN-RFPSMAFYFED-ANLRIDGENVFIIDYE 292
++ + + D P + C+ + N + P + F D A EN FI E
Sbjct: 395 -IAEKAKGKYPVYRDFP-ILDPCFNVSGIHNVQLPELGIAFADGAVWNFPTENSFIWLNE 452
Query: 293 NHFFLLAVAPHDDLVALIGSQQQRDTRFVYDLNIDLLSFVKENCSD 338
+ L + ++IG+ QQ++ +YD L + C+D
Sbjct: 453 DLVCLAMLGTPKSAFSIIGNYQQQNFHILYDTKRSRLGYAPTKCAD 498
>gi|224067042|ref|XP_002302336.1| predicted protein [Populus trichocarpa]
gi|222844062|gb|EEE81609.1| predicted protein [Populus trichocarpa]
Length = 438
Score = 102 bits (253), Expect = 3e-19, Method: Compositional matrix adjust.
Identities = 92/359 (25%), Positives = 154/359 (42%), Gaps = 43/359 (11%)
Query: 1 MVRLFIGTPSKGVLLILDTGSALIY-----------AIFDPRKSSSFQKINCDHPDCTY- 48
+VR+ +GTP + + ++LDT + + F P S++ ++C C+
Sbjct: 99 VVRVKLGTPGQQMFMVLDTSNDAAWVPCSGCTGCSSTTFLPNASTTLGSLDCSGAQCSQV 158
Query: 49 --FKCV---NEQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSNDNHGF 103
F C + C++ Y S + I++ + G FGC N G
Sbjct: 159 RGFSCPATGSSACLFNQSYGGDSSLTATLVQDAITL-----ANDVIPGFTFGCINAVSGG 213
Query: 104 DEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLKFGTDMGY 163
+ G+LGL R IS ISQ G++ FSYCL P Y S LK G +G
Sbjct: 214 SIPPQ-----GLLGLGRGPISLISQAGAMYSGVFSYCL--PSFKSYYFSGSLKLG-PVGQ 265
Query: 164 RRPSTQATKFINHPN--NFYYLSLKDISIDNERMNFPPDTFDITVSGEGGCIIDSGSVLT 221
+ S + T + +P+ + YY++L +S+ ++ P + + G IIDSG+V+T
Sbjct: 266 PK-SIRTTPLLRNPHRPSLYYVNLTGVSVGRIKVPIPSEQLVFDPNTGAGTIIDSGTVIT 324
Query: 222 YFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCYFLPETFNRFPSMAFYFEDANLRI 281
F VY+ + ++F R Q+ C F P++ +FE NL +
Sbjct: 325 RFVQPVYFAIRDEF-----RKQVNGPISSLGAFDTC-FAATNEAEAPAITLHFEGLNLVL 378
Query: 282 DGENVFIIDYENHFFLL--AVAPH--DDLVALIGSQQQRDTRFVYDLNIDLLSFVKENC 336
EN I L A AP+ + ++ +I + QQ++ R ++D L +E C
Sbjct: 379 PMENSLIHSSSGSLACLSMAAAPNNVNSVLNVIANLQQQNLRIMFDTTNSRLGIARELC 437
>gi|242066142|ref|XP_002454360.1| hypothetical protein SORBIDRAFT_04g029400 [Sorghum bicolor]
gi|241934191|gb|EES07336.1| hypothetical protein SORBIDRAFT_04g029400 [Sorghum bicolor]
Length = 466
Score = 102 bits (253), Expect = 3e-19, Method: Compositional matrix adjust.
Identities = 86/360 (23%), Positives = 149/360 (41%), Gaps = 51/360 (14%)
Query: 1 MVRLFIGTPSKGVLLILDTGSALIYA--------------IFDPRKSSSFQKINCDHPDC 46
++ + +G+P K +++DTGS + + +FDP SS++ +C C
Sbjct: 134 LITVRLGSPGKSQTMLIDTGSDVSWVQCKPCSQCHSQADPLFDPSSSSTYSPFSCSSAAC 193
Query: 47 TYFK-----CVNEQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSNDNH 101
C + QC YT+ Y D S T G + +T+++ G FGCSN
Sbjct: 194 AQLGQEGNGCSSSQCQYTVTYGDGSSTTGTYSSDTLAL-----GSNAVRKFQFGCSNVES 248
Query: 102 GFDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLKFGTDM 161
GF++ G++GL S +SQ FSYC LP +S +L G
Sbjct: 249 GFNDQTD-----GLMGLGGGAQSLVSQTAGTFGAAFSYC----LPATSSSSGFLTLGAGT 299
Query: 162 G--YRRPSTQATKFINHPNNFYYLSLKDISIDNERMNFPPDTFDITVSGEGGCIIDSGSV 219
+ P ++++ FY + ++ I + +++ P F G I+DSG+V
Sbjct: 300 SGFVKTPMLRSSQV----PTFYGVRIQAIRVGGRQLSIPTSVF------SAGTIMDSGTV 349
Query: 220 LTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCY-FLPETFNRFPSMAFYFEDAN 278
LT Y L F + +++ A S + C+ F ++ P++A F
Sbjct: 350 LTRLPPTAYSALSSAFKAGMKQYPSAPPSGI---LDTCFDFSGQSSVSIPTVALVFSGGA 406
Query: 279 LRIDGENVFIIDYENHFFLLAVAPH--DDLVALIGSQQQRDTRFVYDLNIDLLSFVKENC 336
+ + ++ N LA A + D + +IG+ QQR +YD+ + F C
Sbjct: 407 VVDIASDGIMLQTSNSILCLAFAANSDDSSLGIIGNVQQRTFEVLYDVGGGAVGFKAGAC 466
>gi|118486912|gb|ABK95290.1| unknown [Populus trichocarpa]
Length = 438
Score = 102 bits (253), Expect = 3e-19, Method: Compositional matrix adjust.
Identities = 92/359 (25%), Positives = 154/359 (42%), Gaps = 43/359 (11%)
Query: 1 MVRLFIGTPSKGVLLILDTGSALIY-----------AIFDPRKSSSFQKINCDHPDCTY- 48
+VR+ +GTP + + ++LDT + + F P S++ ++C C+
Sbjct: 99 VVRVKLGTPGQQMFMVLDTSNDAAWVPCSGCTGFSSTTFLPNASTTLGSLDCSGAQCSQV 158
Query: 49 --FKCV---NEQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSNDNHGF 103
F C + C++ Y S + I++ + G FGC N G
Sbjct: 159 RGFSCPATGSSACLFNQSYGGDSSLTATLVQDAITL-----ANDVIPGFTFGCINAVSGG 213
Query: 104 DEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLKFGTDMGY 163
+ G+LGL R IS ISQ G++ FSYCL P Y S LK G +G
Sbjct: 214 SIPPQ-----GLLGLGRGPISLISQAGAMYSGVFSYCL--PSFKSYYFSGSLKLG-PVGQ 265
Query: 164 RRPSTQATKFINHPN--NFYYLSLKDISIDNERMNFPPDTFDITVSGEGGCIIDSGSVLT 221
+ S + T + +P+ + YY++L +S+ ++ P + + G IIDSG+V+T
Sbjct: 266 PK-SIRTTPLLRNPHRPSLYYVNLTGVSVGRIKVPIPSEQLVFDPNTGAGTIIDSGTVIT 324
Query: 222 YFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCYFLPETFNRFPSMAFYFEDANLRI 281
F VY+ + ++F R Q+ C F P++ +FE NL +
Sbjct: 325 RFVQPVYFAIRDEF-----RKQVNGPISSLGAFDTC-FAATNEAEAPAITLHFEGLNLVL 378
Query: 282 DGENVFIIDYENHFFLL--AVAPH--DDLVALIGSQQQRDTRFVYDLNIDLLSFVKENC 336
EN I L A AP+ + ++ +I + QQ++ R ++D L +E C
Sbjct: 379 PMENSLIHSSSGSLACLSMAAAPNNVNSVLNVIANLQQQNLRIMFDTTNSRLGIARELC 437
>gi|297826117|ref|XP_002880941.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
gi|297326780|gb|EFH57200.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
Length = 397
Score = 102 bits (253), Expect = 3e-19, Method: Compositional matrix adjust.
Identities = 94/357 (26%), Positives = 153/357 (42%), Gaps = 45/357 (12%)
Query: 1 MVRLFIGTPSKGVLLILDTGSALIYA--------------IFDPRKSSSFQKINCDHPDC 46
++RL +GTP ++ +DTGS LI+ IFDP KSS+F++
Sbjct: 62 LMRLQLGTPPFEIVAEIDTGSDLIWTQCMPCPNCYTQFAPIFDPSKSSTFKEK------- 114
Query: 47 TYFKCVNEQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSNDNHGFDED 106
+C C Y + YAD+S + G A ET+++ + GC +N
Sbjct: 115 ---RCHGNSCPYEIIYADESYSTGILATETVTIQSTSGEPFVMAETSIGCGLNNSNLMTP 171
Query: 107 ARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLKFGTDMGYRRP 166
+ +G++GL+ S ISQ+ I SYC +S + FGT+
Sbjct: 172 GYAASSSGIVGLNMGPSSLISQMDLPIPGLISYCF------SSQGTSKINFGTNAVVAGD 225
Query: 167 STQAT-KFINHPNNFYYLSLKDISIDNERMNFPPDTFDITVSGEGGCIIDSGSVLTYFHS 225
T A FI FYYL+L +S+ ++R+ F + +G IDSG+ TY +
Sbjct: 226 GTVAADMFIKKDQPFYYLNLDAVSVGDKRIETLGTPFH---AQDGNIFIDSGTTYTYLPT 282
Query: 226 DVYWKLHEKFVSYFERFQLAQLSDCPEPIQ---LCYFLPETFNRFPSMAFYFE-DANLRI 281
+ V + + P+P LCY +T FP + +F A+L +
Sbjct: 283 S-----YCNLVREAVAASVVAANQVPDPSSENLLCYNW-DTMEIFPVITLHFAGGADLVL 336
Query: 282 DGENVFIIDYENHFFLLAVAPHD-DLVALIGSQQQRDTRFVYDLNIDLLSFVKENCS 337
D N+++ F LA+ D + A+ G++ + YD + ++SF NCS
Sbjct: 337 DKYNMYVETITGGTFCLAIGCVDPSMPAIFGNRAHNNLLVGYDSSTLVISFSPTNCS 393
>gi|297805036|ref|XP_002870402.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
gi|297316238|gb|EFH46661.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
Length = 435
Score = 101 bits (252), Expect = 5e-19, Method: Compositional matrix adjust.
Identities = 96/358 (26%), Positives = 152/358 (42%), Gaps = 40/358 (11%)
Query: 1 MVRLFIGTPSKGVLLILDTGSALIYA--------------IFDPRKSSSFQKINCDHPDC 46
++ L +GTP ++ + DTGS LI+ +FDP+ SS+++ ++C C
Sbjct: 95 LMNLSLGTPPSPIMAVADTGSNLIWTQCKPCDDCYTQVDPLFDPKASSTYKDVSCSSSQC 154
Query: 47 TYFK----CVNE--QCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSNDN 100
T + C E C Y + YAD S T G A +T+++ + GC +N
Sbjct: 155 TALENQASCSTEDKTCSYLVSYADGSYTMGKFAVDTLTLGSTDNRPVQLKNIIIGCGQNN 214
Query: 101 HGFDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLKFGTD 160
+ G + L +S I QLG I +FSYCLV P + TS + FGT+
Sbjct: 215 AVTFRNKSSGVVG----LGGGAVSLIKQLGDSIDGKFSYCLV---PENDQTSK-INFGTN 266
Query: 161 MGYRRPSTQATKF-INHPNNFYYLSLKDISIDNERMNFPPDTFDITVSGEGGCIIDSGSV 219
P T +T + + FYYL+LK IS+ ++ M P +G +IDSG+
Sbjct: 267 AVVSGPGTVSTPLVVKSRDTFYYLTLKSISVGSKNMQTPDSNI------KGNMVIDSGTT 320
Query: 220 LTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCYFLPETFNRFPSMAFYFEDANL 279
LT Y ++ S + D LCY N P + +FE A++
Sbjct: 321 LTLLPVKYYIEIENAVASLINA---DKSKDERIGSSLCYNATADLN-IPVITMHFEGADV 376
Query: 280 RIDGENVFIIDYENHFFLLAVAPHDDLVALIGSQQQRDTRFVYDLNIDLLSFVKENCS 337
++ N F E+ LA + G+ Q++ YD +SF +C+
Sbjct: 377 KLYPYNSFFKVTED-LVCLAFGMSFYRNGIYGNVAQKNFLVGYDTASKTMSFKPTDCA 433
>gi|297829808|ref|XP_002882786.1| hypothetical protein ARALYDRAFT_478632 [Arabidopsis lyrata subsp.
lyrata]
gi|297328626|gb|EFH59045.1| hypothetical protein ARALYDRAFT_478632 [Arabidopsis lyrata subsp.
lyrata]
Length = 449
Score = 101 bits (251), Expect = 5e-19, Method: Compositional matrix adjust.
Identities = 98/370 (26%), Positives = 152/370 (41%), Gaps = 52/370 (14%)
Query: 6 IGTPSKGVLLILDTGSALIYA----------------IFDPRKSSSFQKINCDHPDC--- 46
+GTP+K +++DTGS L + +F +S SF+ + C C
Sbjct: 94 VGTPAKKFRVVVDTGSELTWVNCRYRGRGKGKVKNRRVFRAEESKSFKTVGCFTQTCKVD 153
Query: 47 -------TYFKCVNEQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSND 99
+ + C Y +YAD S +G A ETI+V KA G L GCS+
Sbjct: 154 LMNLFSLSTCPTPSTPCSYDYRYADGSAAQGVFAKETITVGLTNGRKARLRGLLVGCSSS 213
Query: 100 NHGFDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLKFGT 159
G D GVLGL+ SF S S+ + SYCLV L N + S+YL FG
Sbjct: 214 FSGQSFQGAD----GVLGLAFSDFSFTSTATSLFGAKLSYCLVDHLSN-KNISNYLIFGY 268
Query: 160 DMGYRRPSTQATKF----INHPNNFYYLSLKDISIDNERMNFPPDTFDITVSGEGGCIID 215
T + + FY +++ ISI ++ ++ P +D T GG I+D
Sbjct: 269 SSSSTSTKTAPGRTTPLDLTLIPPFYAINIIGISIGDDMLDIPTQVWDATTG--GGTILD 326
Query: 216 SGSVLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPE--PIQLCYFLPETFN--RFPSMA 271
SG+ LT Y + Y + + PE PI+ C+ FN + P +
Sbjct: 327 SGTSLTLLAEAAYKPVVTGLARYLVELKRVK----PEGIPIEYCFSSTSGFNESKLPQLT 382
Query: 272 FYFEDANLRIDGENVFIIDYENHF----FLLAVAPHDDLVALIGSQQQRDTRFVYDLNID 327
F+ + +++D F+ A P ++V G+ Q++ + +DL
Sbjct: 383 FHLKGGARFEPHRKSYLVDAAPGVKCLGFMSAGTPATNVV---GNIMQQNYLWEFDLMAS 439
Query: 328 LLSFVKENCS 337
LSF C+
Sbjct: 440 TLSFAPSTCT 449
>gi|255548664|ref|XP_002515388.1| Aspartic proteinase nepenthesin-2 precursor, putative [Ricinus
communis]
gi|223545332|gb|EEF46837.1| Aspartic proteinase nepenthesin-2 precursor, putative [Ricinus
communis]
Length = 494
Score = 101 bits (251), Expect = 5e-19, Method: Compositional matrix adjust.
Identities = 100/365 (27%), Positives = 147/365 (40%), Gaps = 54/365 (14%)
Query: 2 VRLFIGTPSKGVLLILDTGSALIY---------------AIFDPRKSSSFQKINCDHPDC 46
V + +GTP K LI DTGS L + AIF+P +S+S+ I+C C
Sbjct: 155 VTVGLGTPKKDFSLIFDTGSDLTWTQCEPCVKSCYNQKEAIFNPSQSTSYANISCGSTLC 214
Query: 47 --------TYFKCVNEQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSN 98
F C + CVY ++Y D S + GF E +S+ +F+ FGC
Sbjct: 215 DSLASATGNIFNCASSTCVYGIQYGDSSFSIGFFGKEKLSLTATD----VFNDFYFGCGQ 270
Query: 99 DNHGFDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLKFG 158
+N G A R +S +SQ K FSYC LP+ ++ +L FG
Sbjct: 271 NNKGLFGGAAGLLGL-----GRDKLSLVSQTAQRYNKIFSYC----LPSSSSSTGFLTFG 321
Query: 159 TDMGYRRPSTQATKFINHPNNFYYLSLKDISIDNERMNFPPDTFDITVSGEGGCIIDSGS 218
+ S I+ ++FY L L IS+ ++ P F G IIDSG+
Sbjct: 322 GSTS-KSASFTPLATISGGSSFYGLDLTGISVGGRKLAISPSVFST-----AGTIIDSGT 375
Query: 219 VLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCP--EPIQLCY-FLPETFNRFPSMAFYFE 275
V+T Y L F R ++Q P + C+ F P + +F
Sbjct: 376 VITRLPPAAYSALSSTF-----RKLMSQYPAAPALSILDTCFDFSNHDTISVPKIGLFFS 430
Query: 276 DA-NLRIDGENVFIIDYENHFFLLAVAPHDDL--VALIGSQQQRDTRFVYDLNIDLLSFV 332
+ ID +F ++ LA A + D VA+ G+ QQ+ VYD + F
Sbjct: 431 GGVVVDIDKTGIFYVNDLTQ-VCLAFAGNSDASDVAIFGNVQQKTLEVVYDGAAGRVGFA 489
Query: 333 KENCS 337
CS
Sbjct: 490 PAGCS 494
>gi|225217022|gb|ACN85307.1| aspartic proteinase nepenthesin-1 precursor [Oryza ridleyi]
Length = 525
Score = 101 bits (251), Expect = 6e-19, Method: Compositional matrix adjust.
Identities = 92/363 (25%), Positives = 148/363 (40%), Gaps = 51/363 (14%)
Query: 1 MVRLFIGTPSKGVLLILDTGSALIYA---------------IFDPRKSSSFQKINCDHPD 45
+V + +GTP+ ++ DTGS + +FDP +SS+ I+C P
Sbjct: 187 VVTIGLGTPAGRYTVVFDTGSDTTWVQCEPCVVVCYEQQEKLFDPARSSTDANISCAAPA 246
Query: 46 CT--YFK-CVNEQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSNDNHG 102
C+ Y K C C+Y ++Y D S + GF A +T+++ G FGC N G
Sbjct: 247 CSDLYTKGCSGGHCLYGVQYGDGSYSIGFFAMDTLTL----SSYDAIKGFRFGCGERNEG 302
Query: 103 FDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLKFGTDMG 162
+A AG+LGL R S Q F++C P + YL FG
Sbjct: 303 LFGEA-----AGLLGLGRGKTSLPVQAYDKYGGVFAHC----FPARSSGTGYLDFGPGSS 353
Query: 163 YRRPSTQATK-FINHPNNFYYLSLKDISIDNERMNFPPDTFDITVSGEGGCIIDSGSVLT 221
+ T +++ FYY+ L I + + ++ PP F G I+DSG+V+T
Sbjct: 354 PAVSTKLTTPMLVDNGLTFYYVGLTGIRVGGKLLSIPPSVFTTA-----GTIVDSGTVIT 408
Query: 222 YFHSDVYWKLHEKFVS------YFERFQLAQLSDCPEPIQLCYFLPETFNRFPSMAFYFE 275
Y L F S Y + L+ L C + F + P+++ F+
Sbjct: 409 RLPPAAYSSLRSAFASAIAARGYKKAPALSLLDTCYD------FTGMSQVAIPTVSLLFQ 462
Query: 276 -DANLRIDGEN-VFIIDYENHFFLLAVAPHDDLVALIGSQQQRDTRFVYDLNIDLLSFVK 333
A+L +D ++ A DD V ++G+ Q + VYD+ ++ F
Sbjct: 463 GGASLDVDASGIIYAASVSQACLGFAANEEDDDVGIVGNTQLKTFGVVYDIGKKVVGFSP 522
Query: 334 ENC 336
C
Sbjct: 523 GAC 525
>gi|357137345|ref|XP_003570261.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Brachypodium
distachyon]
Length = 458
Score = 101 bits (251), Expect = 6e-19, Method: Compositional matrix adjust.
Identities = 94/368 (25%), Positives = 153/368 (41%), Gaps = 67/368 (18%)
Query: 1 MVRLFIGTPSKGVLLILDTGSALIYA------------IFDPRKSSSFQKINCDHPDCTY 48
++ + IGTP+ +++DTGS + + FDP KSS++ +C CT
Sbjct: 126 VITVSIGTPAMTQAVMIDTGSDVSWVHCHARAGAGSSLFFDPGKSSTYTPFSCSSAACTR 185
Query: 49 FK------CVNEQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGC---SND 99
+ +N C YT++Y D S T G +T++ + E F FGC S+
Sbjct: 186 LEGRDNGCSLNSTCQYTVRYGDGSNTTGTYGSDTLA-LNSTEKVENFQ---FGCSETSDP 241
Query: 100 NHGFDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLKFGT 159
G DED D G++GL S +SQ + FSYC LP +S +L G
Sbjct: 242 GEGLDEDQTD----GLMGLGGGAPSLVSQTAATYGSAFSYC----LPATTRSSGFLTLGA 293
Query: 160 DMGYRRPSTQATKFINHP-------NNFYYLSLKDISIDNERMNFPPDTFDITVSGEGGC 212
ST + F+ P FY++ L+ I++ + + P F G
Sbjct: 294 -------STGTSGFVTTPMFRSRRAPTFYFVILQGINVGGDPVAISPTVF------AAGS 340
Query: 213 IIDSGSVLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCY-FLPETFNRFPSMA 271
I+DSG+++T Y L F + R+ A+ + C+ F + P++
Sbjct: 341 IMDSGTIITRLPPRAYSALSAAFRAGMRRYPRARAFSI---LDTCFDFTGQDNVSIPAVE 397
Query: 272 FYFEDANLRIDGENVFIIDYENHFF--LLAVAPHDDLV-ALIGSQQQRDTRFVYDLNIDL 328
F G V +D + + LA AP + ++IG+ QQR ++D+ +
Sbjct: 398 LVFS-------GGAVVDLDADGIMYGSCLAFAPATGGIGSIIGNVQQRTFEVLHDVGQSV 450
Query: 329 LSFVKENC 336
L F C
Sbjct: 451 LGFRPGAC 458
>gi|225216930|gb|ACN85225.1| aspartic proteinase nepenthesin-1 precursor [Oryza punctata]
gi|225216938|gb|ACN85232.1| aspartic proteinase nepenthesin-1 precursor [Oryza minuta]
Length = 516
Score = 101 bits (251), Expect = 6e-19, Method: Compositional matrix adjust.
Identities = 89/357 (24%), Positives = 142/357 (39%), Gaps = 41/357 (11%)
Query: 1 MVRLFIGTPSKGVLLILDTGSALIYA---------------IFDPRKSSSFQKINCDHPD 45
+V + +GTP+ ++ DTGS + +FDP +SS++ ++C P
Sbjct: 180 VVTVGLGTPASRYTVVFDTGSDTTWVQCQPCVVVCYEQREKLFDPARSSTYANVSCAAPA 239
Query: 46 CTYF---KCVNEQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSNDNHG 102
C+ C C+Y ++Y D S + GF A +T+++ G FGC N G
Sbjct: 240 CSDLDTRGCSGGHCLYGVQYGDGSYSIGFFAMDTLTL----SSYDAVKGFRFGCGERNEG 295
Query: 103 FDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLKFGTDMG 162
+A AG+LGL R S Q F++C LP + YL FG
Sbjct: 296 LFGEA-----AGLLGLGRGKTSLPVQTYDKYGGVFAHC----LPARSTGTGYLDFGAGSP 346
Query: 163 YRRPSTQATKFINHPNNFYYLSLKDISIDNERMNFPPDTFDITVSGEGGCIIDSGSVLTY 222
R +T N P FYY+ L I + + P F G I+DSG+V+T
Sbjct: 347 AARLTTTPMLVDNGP-TFYYVGLTGIRVGGRLLYIPQSVFATA-----GTIVDSGTVITR 400
Query: 223 FHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCY-FLPETFNRFPSMAFYFE-DANLR 280
Y L F + + + + CY F + P+++ F+ A L
Sbjct: 401 LPPAAYSSLRSAFAAAMSARGYKK-APAVSLLDTCYDFAGMSQVAIPTVSLLFQGGARLD 459
Query: 281 IDGENVFIIDYENHFFLLAVAPHDDL-VALIGSQQQRDTRFVYDLNIDLLSFVKENC 336
+D + + L A D V ++G+ Q + YD+ ++SF C
Sbjct: 460 VDASGIMYAASASQVCLAFAANEDGGDVGIVGNTQLKTFGVAYDIGKKVVSFSPGAC 516
>gi|302758750|ref|XP_002962798.1| hypothetical protein SELMODRAFT_78156 [Selaginella moellendorffii]
gi|300169659|gb|EFJ36261.1| hypothetical protein SELMODRAFT_78156 [Selaginella moellendorffii]
Length = 427
Score = 101 bits (251), Expect = 6e-19, Method: Compositional matrix adjust.
Identities = 106/383 (27%), Positives = 160/383 (41%), Gaps = 65/383 (16%)
Query: 2 VRLFIGTPSKGVLLILDTGSALIY-----------------AIFDPRKSSSFQKINCDHP 44
V L +GTP+K LI+DTGS L + +D SSS+++I C
Sbjct: 61 VELRVGTPAKKFPLIVDTGSDLTWIQCNPPNTTANSSSPPAPWYDKSSSSSYREIPCTDD 120
Query: 45 DCTYFKC---------VNEQCVYTMKYADQSVTKGFAAHETISV-----IGKGEG----- 85
+C + C YT Y+DQS T G A+ETIS+ GK G
Sbjct: 121 ECQFLPAPIGSSCSITSPSPCDYTYGYSDQSRTTGILAYETISMKSRKRSGKRAGNHKTR 180
Query: 86 KAIFHGALFGCSNDNHGFDEDARDGALAGVLGLSRVTISFISQ-----LGSIIKKRFSYC 140
+ GCS ++ G A +GVLGL + IS +Q LG I FSYC
Sbjct: 181 RIRIKNVALGCSRESVG----ASFLGASGVLGLGQGPISLATQTRHTALGGI----FSYC 232
Query: 141 LVIPLPNGEYTSSYLKFGTDMGYRRPSTQATKFINHP--NNFYYLSLKDISIDNERMN-F 197
LV L G SS+L G +R+ T + +P +FYY+++ +++D + ++
Sbjct: 233 LVDYL-RGSNASSFLVMGRTH-WRK--LAHTPIVRNPAAQSFYYVNVTGVAVDGKPVDGI 288
Query: 198 PPDTFDITVSGEGGCIIDSGSVLTYFHSDVYWKLHEKFVS--YFERFQLAQLSDCPEPIQ 255
+ I G G I DSG+ L+Y Y K+ + Y R Q + PE +
Sbjct: 289 ASSDWGIDGDGNKGTIFDSGTTLSYLREPAYSKVLGALNASIYLPRAQ-----EIPEGFE 343
Query: 256 LCYFLPETFNRFPSMAFYFE-DANLRIDGENVFIIDYEN-HFFLLAVAPHDDLVALIGSQ 313
LCY + P + F+ A + + N ++ EN L + ++G+
Sbjct: 344 LCYNVTRMEKGMPKLGVEFQGGAVMELPWNNYMVLVAENVQCVALQKVTTTNGSNILGNL 403
Query: 314 QQRDTRFVYDLNIDLLSFVKENC 336
Q+D YDL + F C
Sbjct: 404 LQQDHHIEYDLAKARIGFKWSPC 426
>gi|224101015|ref|XP_002312106.1| predicted protein [Populus trichocarpa]
gi|222851926|gb|EEE89473.1| predicted protein [Populus trichocarpa]
Length = 440
Score = 101 bits (251), Expect = 6e-19, Method: Compositional matrix adjust.
Identities = 93/378 (24%), Positives = 164/378 (43%), Gaps = 60/378 (15%)
Query: 2 VRLFIGTPSKGVLLILDTGSALIY----------AIFDPRKSSSFQKINCDHPDCTY--- 48
V L GTP + + ++LDTGS L + +IF+P S ++ KI C P C
Sbjct: 69 VSLTAGTPLQNITMVLDTGSELSWLHCKKEPNFNSIFNPLASKTYTKIPCSSPTCETRTR 128
Query: 49 -----FKC-VNEQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSNDNHG 102
C + C + + YAD S +G A ET V G +FGC + G
Sbjct: 129 DLPLPVSCDPAKLCHFIISYADASSVEGNLAFETFRV-----GSVTGPATVFGCMDS--G 181
Query: 103 FDEDAR-DGALAGVLGLSRVTISFISQLGSIIKKRFSYCL-------VIPLPNGEYTSSY 154
F ++ D G++G++R ++SF++Q+G ++FSYC+ V+ L GE + S+
Sbjct: 182 FSSNSEEDAKTTGLMGMNRGSLSFVNQMGF---RKFSYCISDRDSSGVLLL--GEASFSW 236
Query: 155 LKFGTDMGYRRPSTQATKFINHPNNFYYLSLKDISIDNERMNFPPDTFDITVSGEGGCII 214
LK + Y +T Y + L+ I + ++ ++ P F +G G ++
Sbjct: 237 LK---PLNYTPLVEMSTPLPYFDRVAYSVQLEGIRVSDKVLSLPKSVFVPDHTGAGQTMV 293
Query: 215 DSGSVLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPE-----PIQLCYFLPETFNRFPS 269
DSG+ T+ VY L ++F+ + + ++ + P + LCY + T P+
Sbjct: 294 DSGTQFTFLLGPVYSALKQEFL--LQTKGVLRVLNEPRYVFQGAMDLCYLIEPTRAALPN 351
Query: 270 MA---FYFEDANLRIDGENVFI-----IDYENHFFLLAVAPHDDLVA---LIGSQQQRDT 318
+ F A + + G+ + + ++ + D L +IG QQ++
Sbjct: 352 LPVVNLMFRGAEMSVSGQRLLYRVPGEVRGKDSVWCFTFGNSDSLGIESFVIGHHQQQNV 411
Query: 319 RFVYDLNIDLLSFVKENC 336
YDL + F + C
Sbjct: 412 WMEYDLEKSRIGFAEVRC 429
>gi|302758122|ref|XP_002962484.1| hypothetical protein SELMODRAFT_78458 [Selaginella moellendorffii]
gi|300169345|gb|EFJ35947.1| hypothetical protein SELMODRAFT_78458 [Selaginella moellendorffii]
Length = 395
Score = 101 bits (251), Expect = 6e-19, Method: Compositional matrix adjust.
Identities = 106/383 (27%), Positives = 157/383 (40%), Gaps = 65/383 (16%)
Query: 2 VRLFIGTPSKGVLLILDTGSALIY-----------------AIFDPRKSSSFQKINCDHP 44
V L +GTP+K LI+DTGS L + +D SSS+++I C
Sbjct: 29 VELRVGTPAKKFPLIIDTGSDLTWIQCNPPNTTANSSSPPAPWYDKSSSSSYREIPCTDD 88
Query: 45 DCTYFKC---------VNEQCVYTMKYADQSVTKGFAAHETISVIG-KGEGKAI------ 88
+C + C YT Y+DQS T G A+ETIS+ K GK
Sbjct: 89 ECLFLPAPIGSSCSIKSPSPCDYTYGYSDQSRTTGILAYETISMKSRKRSGKRAGNHKTR 148
Query: 89 ---FHGALFGCSNDNHGFDEDARDGALAGVLGLSRVTISFISQ-----LGSIIKKRFSYC 140
GCS ++ G A +GVLGL + IS +Q LG I FSYC
Sbjct: 149 TIRIKNVALGCSRESVG----ASFLGASGVLGLGQGPISLATQTRHTALGGI----FSYC 200
Query: 141 LVIPLPNGEYTSSYLKFGTDMGYRRPSTQATKFINHP--NNFYYLSLKDISIDNERMN-F 197
LV L G SS+L G R T + +P +FYY+++ +++D + ++
Sbjct: 201 LVDYL-RGSNASSFLVMGRT---RWRKLAHTPIVRNPAAQSFYYVNVTGVAVDGKPVDGI 256
Query: 198 PPDTFDITVSGEGGCIIDSGSVLTYFHSDVYWKLHEKFVS--YFERFQLAQLSDCPEPIQ 255
+ I G G I DSG+ L+Y Y K+ + Y R Q + PE +
Sbjct: 257 ASSDWGIDGDGNKGTIFDSGTTLSYLREPAYSKVLGALNASIYLPRAQ-----EIPEGFE 311
Query: 256 LCYFLPETFNRFPSMAFYFE-DANLRIDGENVFIIDYEN-HFFLLAVAPHDDLVALIGSQ 313
LCY + P + F+ A + + N ++ EN L + ++G+
Sbjct: 312 LCYNVTRMEKGMPKLGVEFQGGAVMELPWNNYMVLVAENVQCVALQKVTTTNGSNILGNL 371
Query: 314 QQRDTRFVYDLNIDLLSFVKENC 336
Q+D YDL + F C
Sbjct: 372 LQQDHHIEYDLAKARIGFKWSPC 394
>gi|302757235|ref|XP_002962041.1| hypothetical protein SELMODRAFT_64201 [Selaginella moellendorffii]
gi|300170700|gb|EFJ37301.1| hypothetical protein SELMODRAFT_64201 [Selaginella moellendorffii]
Length = 367
Score = 100 bits (250), Expect = 7e-19, Method: Compositional matrix adjust.
Identities = 99/371 (26%), Positives = 154/371 (41%), Gaps = 45/371 (12%)
Query: 2 VRLFIGTPSKGVLLILDTGSALIY--------------AIFDPRKSSSFQKINCDHPDCT 47
+ + +G+P K I+DTGS L++ I+DP SS+F K +C C
Sbjct: 6 MEIELGSPPKKFNAIVDTGSDLVWIQCKPCSQCYSQSDPIYDPSASSTFAKTSCSTSSCQ 65
Query: 48 YFKCVN-----EQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSNDNHG 102
+ C+Y +Y D S T+G A ET+++ G F FGC N G
Sbjct: 66 SLPASGCSSSAKTCIYGYQYGDSSSTQGDFALETLTLRSSGGSSKAFPNFQFGCGRLNSG 125
Query: 103 FDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLKFGTDMG 162
G AG++GL + IS +QLGS I +FSYCLV + +S L FG+
Sbjct: 126 -----SFGGAAGIVGLGQGKISLSTQLGSAINNKFSYCLV-DFDDDSSKTSPLIFGSSAS 179
Query: 163 YRRPSTQATKFINHP-NNFYYLSLKDISIDNERMNFPPDTFD-ITVSGE----------- 209
+ N + +Y++ L+ IS+ ++++ D ++V +
Sbjct: 180 TGSGAISTPIIPNSGRSTYYFVGLEGISVGGKQLSLATRAIDFLSVRSKKKLRVRALEVN 239
Query: 210 -GGCIIDSGSVLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCYFLPETFN-RF 267
GG I DSG+ LT VY K+ F S L + LCY + ++ N +F
Sbjct: 240 SGGTIFDSGTTLTLLDDAVYSKVKSAFAS---SVSLPTVDASSSGFDLCYDVSKSKNFKF 296
Query: 268 PSMAFYFEDANLRIDGENVF-IIDYENHFFLLAVAPHDDLVALIGSQ-QQRDTRFVYDLN 325
P++ F+ +N F I+D LA+ L I Q++ VYD
Sbjct: 297 PALTLAFKGTKFSPPQKNYFVIVDTAETVACLAMGGSGSLGLGIIGNLMQQNYHVVYDRG 356
Query: 326 IDLLSFVKENC 336
+S C
Sbjct: 357 TSTISMSPAQC 367
>gi|449516339|ref|XP_004165204.1| PREDICTED: probable aspartic protease At2g35615-like [Cucumis
sativus]
Length = 456
Score = 100 bits (250), Expect = 7e-19, Method: Compositional matrix adjust.
Identities = 97/360 (26%), Positives = 165/360 (45%), Gaps = 37/360 (10%)
Query: 1 MVRLFIGTPSKGVLLILDTGSALIY--------------AIFDPRKSSSFQKINCDHPDC 46
+V L IG+P L+++DTGS+L++ + FDP KS SF+ + C P
Sbjct: 105 LVNLSIGSPPVTQLVVVDTGSSLLWVQCLPCINCFQQSTSWFDPLKSVSFKTLGCGFPGY 164
Query: 47 TY---FKCVN-EQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSNDNHG 102
Y +KC Q Y ++Y ++G A E++ EGK FGC + N
Sbjct: 165 NYINGYKCNRFNQAEYKLRYLGGDSSQGILAKESLLFETLDEGKIKKSNITFGCGHMN-- 222
Query: 103 FDEDARDGALAGVLGLSRVT-ISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLKFGTDM 161
+ D A GV GL I+ +QLG+ +FSYC + + N YT ++L G
Sbjct: 223 -IKTNNDDAYNGVFGLGAYPHITMATQLGN----KFSYC-IGDINNPLYTHNHLVLGQGS 276
Query: 162 GYRRPSTQATKFINHPNNFYYLSLKDISIDNERMNFPPDTFDITVSGEGGCIIDSGSVLT 221
ST H YY++L+ IS+ ++ + P+ F I+ G GG +IDSG T
Sbjct: 277 YIEGDSTPLQIHFGH----YYVTLQSISVGSKTLKIDPNAFKISSDGSGGVLIDSGMTYT 332
Query: 222 YFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCY--FLPETFNRFPSMAFYFE-DAN 278
+ + L+++ V + L ++ + LC+ + FP++ F+F A+
Sbjct: 333 KLANGGFELLYDEIVDLMKGL-LERIPTQRKFEGLCFKGVVSRDLVGFPAVTFHFAGGAD 391
Query: 279 LRIDGENVFIIDYENHFFLLAVAPHDDLVAL--IGSQQQRDTRFVYDLNIDLLSFVKENC 336
L ++ ++F + F L + + +L+ L IG Q++ +DL + F + +C
Sbjct: 392 LVLESGSLFRQHGGDRFCLAILPSNSELLNLSVIGILAQQNYNVGFDLEQMKVFFRRIDC 451
>gi|293333012|ref|NP_001168410.1| uncharacterized protein LOC100382179 precursor [Zea mays]
gi|223948083|gb|ACN28125.1| unknown [Zea mays]
gi|413953783|gb|AFW86432.1| hypothetical protein ZEAMMB73_737575 [Zea mays]
Length = 466
Score = 100 bits (250), Expect = 7e-19, Method: Compositional matrix adjust.
Identities = 91/368 (24%), Positives = 148/368 (40%), Gaps = 64/368 (17%)
Query: 1 MVRLFIGTPSKGVLLILDTGSALIYA----------------IFDPRKSSSFQKINCDHP 44
++ + +GTP+ ++ +DTGS + + +FDP KS+++ +C
Sbjct: 131 VITVSLGTPAVTQVMSIDTGSDVSWVQCAPCAAQSCSSQKDKLFDPAKSATYSAFSCSSA 190
Query: 45 DCTYFK-----CVNEQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSND 99
C C+N C Y +KY D S T G +T+ + K FGCS+
Sbjct: 191 QCAQLGGEGNGCLNSHCQYIVKYVDHSNTTGTYGSDTLGLTTSDAVKNF----QFGCSHR 246
Query: 100 NHGFDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLKFGT 159
+GF G L G++GL T S +SQ + K FSYCL P+ +L G
Sbjct: 247 ANGFV-----GQLDGLMGLGGDTESLVSQTAATYGKAFSYCLP---PSSSSAGGFLTLGA 298
Query: 160 DMGYRRPSTQATKFINHP------NNFYYLSLKDISIDNERMNFPPDTFDITVSGEGGCI 213
G T ++++ P FY + L+ I++ ++N P F G +
Sbjct: 299 AAG----GTSSSRYSRTPLVRFNVPTFYGVFLQAITVAGTKLNVPASVF------SGASV 348
Query: 214 IDSGSVLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQL---CY-FLPETFNRFPS 269
+DSG+V+T Y L F + + A P+ + C+ F R P
Sbjct: 349 VDSGTVITQLPPTAYQALRTAFKKEMKAYPSA------APVGILDTCFDFSGIKTVRVPV 402
Query: 270 MAFYF-EDANLRIDGENVFIIDYENHFFLLAVAPHDDLVALIGSQQQRDTRFVYDLNIDL 328
+ F A + +D +F Y A A D ++G+ QQR ++D+
Sbjct: 403 VTLTFSRGAVMDLDVSGIF---YAGCLAFTATA-QDGDTGILGNVQQRTFEMLFDVGGST 458
Query: 329 LSFVKENC 336
L F C
Sbjct: 459 LGFRPGAC 466
>gi|242058093|ref|XP_002458192.1| hypothetical protein SORBIDRAFT_03g028480 [Sorghum bicolor]
gi|241930167|gb|EES03312.1| hypothetical protein SORBIDRAFT_03g028480 [Sorghum bicolor]
Length = 468
Score = 100 bits (250), Expect = 8e-19, Method: Compositional matrix adjust.
Identities = 96/368 (26%), Positives = 147/368 (39%), Gaps = 56/368 (15%)
Query: 1 MVRLFIGTPSKGVLLILDTGSALIYA----------------IFDPRKSSSFQKINCDHP 44
+V + +GTPS +L++DTGS L + +FDP KSS++ I C+
Sbjct: 125 VVTVGLGTPSVSQVLLIDTGSDLSWVQCQPCNSTTCYPQKDPLFDPSKSSTYAPIPCNTD 184
Query: 45 DCTYFK-------CVN----EQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGAL 93
C C + QC + + Y D S T+G ++ET++ + G F
Sbjct: 185 ACRDLTDDGYGGGCASGDGAAQCGFAITYGDGSQTRGVYSNETLA-LAPGVAVKDFR--- 240
Query: 94 FGCSNDNHGFDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSS 153
FGC G D+D + G+LGL S + Q S+ FSYCL P N +
Sbjct: 241 FGC-----GHDQDGANDKYDGLLGLGGAPESLVVQTASVYGGAFSYCL--PALNNQVGFL 293
Query: 154 YLKFGTDMGYRRPSTQA---TKFINHPNNFYYLSLKDISIDNERMNFPPDTFDITVSGEG 210
L G +T T I FY +++ I++ E ++ PP F G
Sbjct: 294 ALGGGGAPSGGVVNTSGFVFTPMIREEETFYVVNMTGITVGGEPIDVPPSAF------SG 347
Query: 211 GCIIDSGSVLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCYFLPETFN-RFPS 269
G IIDSG+V+T Y L F + L + + + CY N P
Sbjct: 348 GMIIDSGTVVTELQHTAYNALQAAFRKAMAAYPLVRNGE----LDTCYDFSGYSNVTLPK 403
Query: 270 MAFYFE-DANLRIDGENVFIIDYENHFFLLAVAPHDDLVALIGSQQQRDTRFVYDLNIDL 328
+A F A + +D N ++D + DD ++G+ QR +YD
Sbjct: 404 VALTFSGGATIDLDVPNGILLD---DCLAFQESGPDDQPGILGNVNQRTLEVLYDAGRGR 460
Query: 329 LSFVKENC 336
+ F C
Sbjct: 461 VGFRAAVC 468
>gi|297741705|emb|CBI32837.3| unnamed protein product [Vitis vinifera]
Length = 455
Score = 100 bits (250), Expect = 8e-19, Method: Compositional matrix adjust.
Identities = 93/356 (26%), Positives = 156/356 (43%), Gaps = 40/356 (11%)
Query: 1 MVRLFIGTPSKGVLLILDTGSALIY--------------AIFDPRKSSSFQKINCDHPDC 46
+ L IG P V ++LDTGS L + I++ KS S+ ++ C+ P C
Sbjct: 107 LANLSIGNPPTNVYVVLDTGSDLFWIQCEPCDVCYKQKDPIYNRTKSDSYTEMLCNEPPC 166
Query: 47 TYF----KCVNE-QCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSNDNH 101
+C + C+Y YAD S T G ++E ++ + FGC N
Sbjct: 167 LSLGREGQCSDSGSCLYQTSYADGSRTSGLLSYEKVAFTSHYSDEDKTAQVGFGCGLQNL 226
Query: 102 GFDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLV-IPLPNGEYTSSYLKFGTD 160
F +RDG + G+ +S +S +G + K F+YC + PN +L FG D
Sbjct: 227 NFVTSSRDGGVLGLGPGLVSLVSQLSAIGKV-SKSFAYCFGNLSNPNA---GGFLVFG-D 281
Query: 161 MGYRRPSTQATKFINHPNNFYYLSLKDIS--IDNERMNFPPDTFDITVSGEGGCIIDSGS 218
Y FYY++L I ++ R++ +F+ G GG IIDSGS
Sbjct: 282 ATYLNGDMTPMVIAE----FYYVNLLGIGLGVEEPRLDINSSSFERKPDGSGGVIIDSGS 337
Query: 219 VLTYFHSDVYWKLHEKFVSYFER-FQLAQLSDCPEPIQLCY--FLPETFNRFPSMAFYFE 275
L+ F +VY + V ++ + ++ L+ P+ C+ + FP++ Y E
Sbjct: 338 TLSIFPPEVYEVVRNAVVDKLKKGYNISPLTSSPD----CFEGKIGRDLPLFPTLVLYLE 393
Query: 276 DANLRIDGENVFIIDYENHFFLLAVAPHDDLVALIGSQQQRDTRFVYDLNIDLLSF 331
+ D ++F+ Y+ F L + L ++IG+ Q+ +F Y+L + LS
Sbjct: 394 STGILNDRWSIFLQRYD-ELFCLGFTSGEGL-SIIGTLAQQSYKFGYNLELSTLSI 447
>gi|449464952|ref|XP_004150193.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 2-like [Cucumis
sativus]
gi|449526850|ref|XP_004170426.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 2-like [Cucumis
sativus]
Length = 476
Score = 100 bits (249), Expect = 9e-19, Method: Compositional matrix adjust.
Identities = 92/358 (25%), Positives = 150/358 (41%), Gaps = 43/358 (12%)
Query: 2 VRLFIGTPSKGVLLILDTGSALIYA--------------IFDPRKSSSFQKINCDHPDCT 47
VR+ +G+P + +++D+GS +++ +FDP S+++ I+CD C
Sbjct: 139 VRIGVGSPPRSQYVVIDSGSDIVWVQCQPCSECYQQSDPVFDPAGSATYAGISCDSSVCD 198
Query: 48 YF---KCVNEQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSNDNHGFD 104
C + +C Y + Y D S T+G A ET++ G+ + GC + N G
Sbjct: 199 RLDNAGCNDGRCRYEVSYGDGSYTRGTLALETLTF-----GRVLIRNIAIGCGHMNRGMF 253
Query: 105 EDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLKFGTDMGYR 164
A +SF+ QLG FSYCLV G ++ L+FG G
Sbjct: 254 IGAAGLLGL-----GGGAMSFVGQLGGQTGGAFSYCLV---SRGTESTGTLEFG--RGAM 303
Query: 165 RPSTQATKFINHPN--NFYYLSLKDISIDNERMNFPPDTFDITVSGEGGCIIDSGSVLTY 222
I +P +FYY+ L + + R+ P F++T G GG ++D+G+ +T
Sbjct: 304 PVGAAWVPLIRNPRAPSFYYVGLSGLGVGGIRVPIPEQIFELTDLGYGGVVMDTGTAVTR 363
Query: 223 FHSDVYWKLHEKFVSYFERFQLAQL--SDCPEPIQLCYFLPETFN-RFPSMAFYFEDAN- 278
+ Y + F+ Q A L SD CY L + R P+++FYF
Sbjct: 364 LPAPAYEAFRDTFIG-----QTANLPRSDRVSIFDTCYNLNGFVSVRVPTVSFYFSGGPI 418
Query: 279 LRIDGENVFIIDYENHFFLLAVAPHDDLVALIGSQQQRDTRFVYDLNIDLLSFVKENC 336
L + N I F A A +++IG+ QQ + D + + F C
Sbjct: 419 LTLPARNFLIPVDGEGTFCFAFAASASGLSIIGNIQQEGIQISIDGSNGFVGFGPTIC 476
>gi|358346443|ref|XP_003637277.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
gi|355503212|gb|AES84415.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
Length = 434
Score = 100 bits (249), Expect = 9e-19, Method: Compositional matrix adjust.
Identities = 92/357 (25%), Positives = 164/357 (45%), Gaps = 42/357 (11%)
Query: 1 MVRLFIGTPSKGVLLILDTGSALIY--------------AIFDPRKSSSFQKINCDHPDC 46
++ +GTP V +DTGS +++ IF+P KSSS++ I C C
Sbjct: 90 LISYSVGTPPFKVYGFMDTGSNIVWLQCQPCNTCFNQTSPIFNPSKSSSYKNIPCTSSTC 149
Query: 47 -----TYFKCVN--EQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSND 99
T+ C N + C Y++ Y + ++G +++++++ +F + GC +
Sbjct: 150 KDTNDTHISCSNGGDVCEYSITYGGDAKSQGDLSNDSLTLDSTSGSSVLFPNIVIGCGHI 209
Query: 100 NHGFDEDARDGALAGVLGLSRVTISFISQLG-SIIKKRFSYCLVIPLPNGEYTSSYLKFG 158
N D +GV+G+ R +S I Q+G S + +FSYCL IP + +SS L FG
Sbjct: 210 NVLQDNSQS----SGVVGMGRGPMSLIKQVGSSSVGSKFSYCL-IPYNSDSNSSSKLIFG 264
Query: 159 TDM---GYRRPSTQATKFINHPNNFYYLSLKDISIDNERMNFPPDTFDITVSGEGGCIID 215
D+ G ST K +N N+Y+L+L+ S+ N R+ + + + + +ID
Sbjct: 265 EDVVVSGEIVVSTPMVK-VNGQENYYFLTLEAFSVGNNRIEYG----ERSNASTQNILID 319
Query: 216 SGSVLTYFHSDVYWKLHEKFVSYF-ERFQLAQLSDCPEPIQLCYFLPETFNRFPSMAFYF 274
SG+ LT + KL VSY + +L ++ + LCY P + +F
Sbjct: 320 SGTPLTMLPNLFLSKL----VSYVAQEVKLPRIEPPDHHLSLCYNTTGKQLNVPDITAHF 375
Query: 275 EDANLRIDGENVFIIDYENHFFLLAVAPHDDLVALIGSQQQRDTRFVYDLNIDLLSF 331
A+++++ F +E+ + L + G+ Q + YDL +++SF
Sbjct: 376 NGADVKLNSNGTF-FPFEDGIMCFGFISSNGL-EIFGNIAQNNLLIDYDLEKEIISF 430
>gi|15226317|ref|NP_180370.1| aspartyl protease-like protein [Arabidopsis thaliana]
gi|4063755|gb|AAC98463.1| putative chloroplast nucleoid DNA binding protein [Arabidopsis
thaliana]
gi|20197953|gb|AAM15327.1| putative chloroplast nucleoid DNA binding protein [Arabidopsis
thaliana]
gi|330252977|gb|AEC08071.1| aspartyl protease-like protein [Arabidopsis thaliana]
Length = 392
Score = 100 bits (249), Expect = 9e-19, Method: Compositional matrix adjust.
Identities = 97/358 (27%), Positives = 155/358 (43%), Gaps = 52/358 (14%)
Query: 1 MVRLFIGTPSKGVLLILDTGSALIYA--------------IFDPRKSSSFQKINCDHPDC 46
+++L +GTP + +DTGS LI+ IFDP SS+F++ C+
Sbjct: 62 LMKLQVGTPPFEIEAEIDTGSDLIWTQCMPCTNCYSQYAPIFDPSNSSTFKEKRCN---- 117
Query: 47 TYFKCVNEQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSNDNHGFDED 106
C Y + YAD + +KG A ET+++ + GC +++ F
Sbjct: 118 ------GNSCHYKIIYADTTYSKGTLATETVTIHSTSGEPFVMPETTIGCGHNSSWFKP- 170
Query: 107 ARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLKFGTDMGYRRP 166
+G++GLS S I+Q+G SYC +S + FGT+
Sbjct: 171 ----TFSGMVGLSWGPSSLITQMGGEYPGLMSYCF------ASQGTSKINFGTNAIVAGD 220
Query: 167 STQATK--FINHPNNFYYLSLKDISIDNERMNFPPDTFDITVSGEGGCIIDSGSVLTYFH 224
+T YYL+L +S+ + + TF + EG IIDSG+ LTYF
Sbjct: 221 GVVSTTMFLTTAKPGLYYLNLDAVSVGDTHVETMGTTFH---ALEGNIIIDSGTTLTYFP 277
Query: 225 SDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCYFLPETFNRFPSMAFYFE-DANLRIDG 283
Y L + V ++ + +D LCY+ +T + FP + +F A+L +D
Sbjct: 278 VS-YCNLVREAVDHY--VTAVRTADPTGNDMLCYYT-DTIDIFPVITMHFSGGADLVLDK 333
Query: 284 ENVFIIDYENHFFLLAV----APHDDLVALIGSQQQRDTRFVYDLNIDLLSFVKENCS 337
N++I F LA+ P D A+ G++ Q + YD + L+SF NCS
Sbjct: 334 YNMYIETITRGTFCLAIICNNPPQD---AIFGNRAQNNFLVGYDSSSLLVSFSPTNCS 388
>gi|326524806|dbj|BAK04339.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 460
Score = 100 bits (249), Expect = 1e-18, Method: Compositional matrix adjust.
Identities = 94/342 (27%), Positives = 161/342 (47%), Gaps = 40/342 (11%)
Query: 28 FDPRKSSSFQKINCDHPDCT-----YFKCVNEQCVYTMKYADQSV-TKGFAAHETISVIG 81
F+P KS SF+++ ++ C + + V + C + D S +G ++ET++
Sbjct: 128 FEPAKSPSFRRLPGNNAFCLPAPRGHRRTVQDPCKFHSIRLDGSADARGVLSNETLAFAA 187
Query: 82 KGEGKAIFHGALFGCSNDNHGFDEDARDGALAGVLGLSRVTISFISQLG-----SIIKKR 136
G+ + G + GC++++ GF+ ++ G LAGVLGL R S I LG ++ R
Sbjct: 188 SGQQQTEVTGVVIGCTHNSKGFNFNSH-GVLAGVLGLGRQAPSLIWTLGQHRHGTVQVHR 246
Query: 137 FSYCLVIPLPNGEYTSSYLKFGTDMGYRRPSTQ---ATKFI----NHPNNF--YYLSLKD 187
FSYCL + ++L+F D+ P+TQ +TK + +F Y++SL
Sbjct: 247 FSYCLPSHGSSSSDHHTFLRFDDDV----PNTQHMVSTKIMYMDSTTSRDFRAYFVSLTG 302
Query: 188 ISIDNERMNFPPDTFDITVSGE---GGCIIDSGSVLTYFHSDVYWKLHEKFVSYFERFQL 244
IS+ + + + F V G+ GC D+G+ Y KL + V + + L
Sbjct: 303 ISVAGKPLQDVKELFKRHVHGQVWTSGCAFDAGTPTMVMIMPAYNKLKDAVVRHLKPLGL 362
Query: 245 AQLSDCPEPIQLCY-FLPETFNRFPSMAFYFED--ANLRIDGENVFI-IDYENHFFLLAV 300
+S LC+ + + P++ F + A L + + +F+ + Y+ LAV
Sbjct: 363 QIVSG---QYHLCFRATSQLWQHLPTVMLQFAETEARLVLPPQRLFVAVGYD---ICLAV 416
Query: 301 APHDDLVALIGSQQQRDTRFVYDLNIDLLSFVKEN-CSDDSA 341
D + +IG+ QQ D RFVYD+ + FV EN C D+
Sbjct: 417 VRSYD-ITIIGAMQQVDKRFVYDVRHGRIYFVPENACHADAG 457
>gi|357118064|ref|XP_003560779.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Brachypodium
distachyon]
Length = 472
Score = 100 bits (248), Expect = 1e-18, Method: Compositional matrix adjust.
Identities = 100/374 (26%), Positives = 152/374 (40%), Gaps = 65/374 (17%)
Query: 1 MVRLFIGTPSKGVLLILDTGSALIYA----------------IFDPRKSSSFQKINCDHP 44
+V L IGTP+ +++DTGS L + +FDP KSS+F I C
Sbjct: 126 VVTLGIGTPAVQQTVLIDTGSDLSWVQCKPCNASDCYPQKDPLFDPSKSSTFATIPCASD 185
Query: 45 DCTYFK-------CVNE------QCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHG 91
C C N QC Y ++Y + ++T+G + ET+++ A+
Sbjct: 186 ACKQLPVDGYDNGCTNNTSGMPPQCGYAIEYGNGAITEGVYSTETLAL----GSSAVVKS 241
Query: 92 ALFGCSNDNHG-FDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEY 150
FGC +D HG +D+ G+LGL S +SQ S+ FSYCL PL +G
Sbjct: 242 FRFGCGSDQHGPYDK------FDGLLGLGGAPESLVSQTASVYGGAFSYCLP-PLNSG-- 292
Query: 151 TSSYLKFGTDMGYRRPS-----TQATKFINHPNNFYYLSLKDISIDNERMNFPPDTFDIT 205
+ +L G + T F FY ++L IS+ + ++ PP F
Sbjct: 293 -AGFLTLGAPNSTNNSNSGFVFTPMHAFSPKIATFYVVTLTGISVGGKALDIPPAVF--- 348
Query: 206 VSGEGGCIIDSGSVLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCY-FLPETF 264
G I+DSG+V+T + Y L F S + L +D + CY F
Sbjct: 349 ---AKGNIVDSGTVITGIPTTAYKALRTAFRSAMAEYPLLPPAD--SALDTCYNFTGHGT 403
Query: 265 NRFPSMAFYF-EDANLRIDGENVFIIDYENHFFLLAVA-PHDDLVALIGSQQQRDTRFVY 322
P +A F A + +D + +++ LA A D +IG+ R +Y
Sbjct: 404 VTVPKVALTFVGGATVDLDVPSGVLVED-----CLAFADAGDGSFGIIGNVNTRTIEVLY 458
Query: 323 DLNIDLLSFVKENC 336
D L F C
Sbjct: 459 DSGKGHLGFRAGAC 472
>gi|224099307|ref|XP_002311432.1| predicted protein [Populus trichocarpa]
gi|222851252|gb|EEE88799.1| predicted protein [Populus trichocarpa]
Length = 458
Score = 100 bits (248), Expect = 1e-18, Method: Compositional matrix adjust.
Identities = 98/384 (25%), Positives = 161/384 (41%), Gaps = 60/384 (15%)
Query: 2 VRLFIGTPSKGVLLILDTGSALIY----------------AIFDPRKSSSFQKINC---- 41
V + +G+P + +LL+ DTGS L + + F R S++F +C
Sbjct: 85 VSIRLGSPPQTLLLVADTGSDLTWVRCSACKTNCSIHPPGSTFLARHSTTFSPTHCFSSL 144
Query: 42 ------DHPDCTYFKCVNEQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFG 95
+P+ ++ C Y Y+D S T GF + ET ++ + FG
Sbjct: 145 CQLVPQPNPNPCNHTRLHSTCRYEYVYSDGSKTSGFFSKETTTLNTSSGREMKLKSIAFG 204
Query: 96 CSNDNHG--FDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYT-- 151
C G + +GA +GV+GL R ISF SQLG + FSYCL+ +YT
Sbjct: 205 CGFHASGPSLIGSSFNGA-SGVMGLGRGPISFASQLGRRFGRSFSYCLL------DYTLS 257
Query: 152 ---SSYLKFGTDMGYRRPSTQATKF----IN-HPNNFYYLSLKDISIDNERMNFPPDTFD 203
+SYL G + ++ + F IN FYY+S+K + +D +++ P +
Sbjct: 258 PPPTSYLMIGDVVSTKKDNKSMMSFTPLLINPEAPTFYYISIKGVFVDGVKLHIDPSVWS 317
Query: 204 ITVSGEGGCIIDSGSVLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCP------EPIQLC 257
+ G GG +IDSG+ LT+ Y + +S F+R ++ S P LC
Sbjct: 318 LDELGNGGTVIDSGTTLTFLTEPAY----REILSAFKR-EVKLPSPTPGGASTRSGFDLC 372
Query: 258 YFLPE-TFNRFPSMAFYFEDANLRIDGENVFIIDYENHFFLLAVAP---HDDLVALIGSQ 313
+ + RFP ++ +L + ID LA+ P ++IG+
Sbjct: 373 VNVTGVSRPRFPRLSLELGGESLYSPPPRNYFIDISEGIKCLAIQPVEAESGRFSVIGNL 432
Query: 314 QQRDTRFVYDLNIDLLSFVKENCS 337
Q+ +D L F + C+
Sbjct: 433 MQQGFLLEFDRGKSRLGFSRRGCA 456
>gi|212274314|ref|NP_001130524.1| uncharacterized protein LOC100191623 [Zea mays]
gi|194689376|gb|ACF78772.1| unknown [Zea mays]
gi|224031455|gb|ACN34803.1| unknown [Zea mays]
gi|238011528|gb|ACR36799.1| unknown [Zea mays]
gi|238015454|gb|ACR38762.1| unknown [Zea mays]
Length = 304
Score = 100 bits (248), Expect = 1e-18, Method: Compositional matrix adjust.
Identities = 87/316 (27%), Positives = 133/316 (42%), Gaps = 45/316 (14%)
Query: 40 NCDHPDCTYFKCVNEQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGAL--FGCS 97
+C+ PD C Y Y D ++T G A E + G G FGC
Sbjct: 15 SCERPD---------TCTYRYNYGDGTMTVGVYATERFTFASSGGGGLTTTTVPLGFGCG 65
Query: 98 NDNHGFDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTS---SY 154
+ N G + +G++G R +S +SQL SI +RFSYCL Y S S
Sbjct: 66 SVNVGSLNNG-----SGIVGFGRNPLSLVSQL-SI--RRFSYCLT------SYASRRQST 111
Query: 155 LKFGT----DMGYRRPSTQATKFINHPNN--FYYLSLKDISIDNERMNFPPDTFDITVSG 208
L FG+ G Q T + P N FYY+ +++ R+ P F + G
Sbjct: 112 LLFGSLSDGVYGDATGRVQTTPLLQSPQNPTFYYVHFTGLTVGARRLRIPESAFALRPDG 171
Query: 209 EGGCIIDSGSVLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCYFLPETFNR-- 266
GG I+DSG+ LT + V L E ++ ++ +L + +C+ +P + R
Sbjct: 172 SGGVIVDSGTALTLLPAAV---LAEVVRAFRQQLRLPFANGGNPEDGVCFLVPAAWRRSS 228
Query: 267 ------FPSMAFYFEDANLRIDGENVFIIDYENHFFLLAVAPHDDLVALIGSQQQRDTRF 320
P M +F+ A+L + N + D+ L +A D + IG+ Q+D R
Sbjct: 229 STSQMPVPRMVLHFQGADLDLPRRNYVLDDHRRGRLCLLLADSGDDGSTIGNLVQQDMRV 288
Query: 321 VYDLNIDLLSFVKENC 336
+YDL + LS C
Sbjct: 289 LYDLEAETLSIAPARC 304
>gi|168014188|ref|XP_001759635.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162689174|gb|EDQ75547.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 485
Score = 100 bits (248), Expect = 1e-18, Method: Compositional matrix adjust.
Identities = 97/370 (26%), Positives = 148/370 (40%), Gaps = 56/370 (15%)
Query: 3 RLFIGTPSKGVLLILDTGSALIY-------------AIFDPR----KSSSFQKINCDHPD 45
R+FIGTP++ LI+DTGS + Y A FDPR SSS+Q ++C+ PD
Sbjct: 102 RVFIGTPAQEFALIVDTGSTVTYVPCSSCTHCGHHQACFDPRFKPDNSSSYQTVSCNSPD 161
Query: 46 CTYFKCVNE--QCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIF-HGALFGCSNDNHG 102
C C QC Y YA+ S +KG + ++G G G + H LFGC G
Sbjct: 162 CITKMCDARVHQCKYERVYAEMSSSKGVLGKD---LLGFGNGSRLQPHPLLFGCETAETG 218
Query: 103 FDEDARDGALAGVLGLSRVTISFISQL--GSIIKKRFSYCLVIPLPNGEYTSSYLKFGTD 160
D G++GL R +S + QL ++ FS C Y G+
Sbjct: 219 ---DLYLQHADGIMGLGRGPLSIVDQLVGTGAMEDSFSLC---------YGGMDEGGGSM 266
Query: 161 MGYRRPSTQATKFINH-PN--NFYYLSLKDISIDNERMNFPPDTFDITVSGEGGCIIDSG 217
+ P A F PN N+Y L L +I + +N P + F+ G G ++DSG
Sbjct: 267 VLGAIPPPPAMVFAKSDPNRSNYYNLELSEIQVQGVSLNVPSEVFN----GRLGTVLDSG 322
Query: 218 SVLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCYFLPETFNRFPSMAFYFEDA 277
+ Y + + Q D P +C+ + ++ ++ +F
Sbjct: 323 TTYAYLPDKAFDAFKDAITQQLGSLQAVPGPD-PSYPDVCFAGAGSDSK--ALGKHFPPV 379
Query: 278 NLRIDGENVFIIDYENHFFLLAVAP---------HDDLVALIGSQQQRDTRFVYDLNIDL 328
+ G + EN+ F P + D L+G R+T YD
Sbjct: 380 DFVFSGNQKVFLAPENYLFKHTKVPGAYCLGFFKNQDATTLLGGIVVRNTLVTYDRANHQ 439
Query: 329 LSFVKENCSD 338
+ F K NC++
Sbjct: 440 IGFFKTNCTN 449
>gi|255544139|ref|XP_002513132.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
gi|223548143|gb|EEF49635.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
Length = 481
Score = 99.8 bits (247), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 86/361 (23%), Positives = 145/361 (40%), Gaps = 49/361 (13%)
Query: 2 VRLFIGTPSKGVLLILDTGSALIYA--------------IFDPRKSSSFQKINCDHPDCT 47
+R+ +G+P + +++D+GS +++ +FDP S+SF + C C
Sbjct: 144 IRIGVGSPPREQYVVIDSGSDIVWVQCQPCTQCYHQTDPVFDPADSASFMGVPCSSSVCE 203
Query: 48 YFK---CVNEQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSNDNHGFD 104
+ C C Y + Y D S TKG A ET++ G+ + GC + N G
Sbjct: 204 RIENAGCHAGGCRYEVMYGDGSYTKGTLALETLTF-----GRTVVRNVAIGCGHRNRGMF 258
Query: 105 EDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLKFGTDMGYR 164
A ++S + QLG FSYCLV G ++ L+FG G
Sbjct: 259 VGAAGLLGL-----GGGSMSLVGQLGGQTGGAFSYCLV---SRGTDSAGSLEFG--RGAM 308
Query: 165 RPSTQATKFINHPN--NFYYLSLKDISIDNERMNFPPDTFDITVSGEGGCIIDSGSVLTY 222
I +P +FYY+ L + + ++ D F + G GG ++D+G+ +T
Sbjct: 309 PVGAAWIPLIRNPRAPSFYYIRLSGVGVGGMKVPISEDVFQLNEMGNGGVVMDTGTAVTR 368
Query: 223 FHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQL-----CYFLPETFN-RFPSMAFYFED 276
+ Y + F+ Q + P + CY L + R P+++FYF
Sbjct: 369 IPTVAYVAFRDAFI--------GQTGNLPRASGVSIFDTCYNLNGFVSVRVPTVSFYFAG 420
Query: 277 AN-LRIDGENVFIIDYENHFFLLAVAPHDDLVALIGSQQQRDTRFVYDLNIDLLSFVKEN 335
L + N I + F A A +++IG+ QQ + +D + F
Sbjct: 421 GPILTLPARNFLIPVDDVGTFCFAFAASPSGLSIIGNIQQEGIQISFDGANGFVGFGPNV 480
Query: 336 C 336
C
Sbjct: 481 C 481
>gi|356537928|ref|XP_003537458.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Glycine max]
Length = 445
Score = 99.8 bits (247), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 99/380 (26%), Positives = 162/380 (42%), Gaps = 62/380 (16%)
Query: 2 VRLFIGTPSKGVLLILDTGSAL----------IYAIFDPRKSSSFQKINCDHPDCTY--- 48
V L +GTP + V ++LDTGS L I ++F+P SSS+ I C P C
Sbjct: 72 VSLTVGTPPQSVTMVLDTGSELSWLHCKKQQNINSVFNPHLSSSYTPIPCMSPICKTRTR 131
Query: 49 -----FKC-VNEQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSNDNHG 102
C N C T+ YAD + +G A +T ++ G G+ IF G++ + G
Sbjct: 132 DFLIPVSCDSNNLCHVTVSYADFTSLEGNLASDTFAISGSGQPGIIF-GSM------DSG 184
Query: 103 FDEDA-RDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLKFGTDM 161
F +A D G++G++R ++SF++Q+G +FSYC+ +G+ S L FG
Sbjct: 185 FSSNANEDSKTTGLMGMNRGSLSFVTQMGF---PKFSYCI-----SGKDASGVLLFGDAT 236
Query: 162 GYRRPSTQATKFI--NHPNNF-----YYLSLKDISIDNERMNFPPDTFDITVSGEGGCII 214
+ T + N P + Y + L I + ++ + P + F +G G ++
Sbjct: 237 FKWLGPLKYTPLVKMNTPLPYFDRVAYTVRLMGIRVGSKPLQVPKEIFAPDHTGAGQTMV 296
Query: 215 DSGSVLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPE-----PIQLCYFLPE--TFNRF 267
DSG+ T+ VY L +FV+ R L L D P + LC+ +
Sbjct: 297 DSGTRFTFLLGSVYTALRNEFVAQ-TRGVLTLLED-PNFVFEGAMDLCFRVRRGGVVPAV 354
Query: 268 PSMAFYFEDANLRIDGENVF--------IIDYENHFFLLAVAPHDDL---VALIGSQQQR 316
P++ FE A + + GE + + + L D L +IG Q+
Sbjct: 355 PAVTMVFEGAEMSVSGERLLYRVGGDGDVAKGNGDVYCLTFGNSDLLGIEAYVIGHHHQQ 414
Query: 317 DTRFVYDLNIDLLSFVKENC 336
+ +DL + F C
Sbjct: 415 NVWMEFDLVNSRVGFADTKC 434
>gi|388504358|gb|AFK40245.1| unknown [Medicago truncatula]
Length = 480
Score = 99.8 bits (247), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 93/364 (25%), Positives = 156/364 (42%), Gaps = 54/364 (14%)
Query: 6 IGTPSKGVLLILDTGSALIYAIFDPRKSSSFQK--------------INCDHPDCTYFKC 51
IG ++ + +I+DTGS L + DP S Q+ + C+ C +
Sbjct: 137 IGLGNQNMTVIIDTGSDLTWVQCDPCMSCYSQQGPVFNPSNSSSYNSLLCNSSTCQNLQF 196
Query: 52 VN-----------EQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSNDN 100
C +T+ Y D S T G E +S G +FGC +N
Sbjct: 197 TTGNTEACESNNPSSCNHTVSYGDGSFTDGELGVEHLSF-----GGISVSNFVFGCGRNN 251
Query: 101 HGFDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLKFGTD 160
G G ++G++GL R +S ISQ + FSYCL P + + S +
Sbjct: 252 KGLF-----GGVSGIMGLGRSNLSMISQTNTTFGGVFSYCL--PTTDSGASGSLVIGNES 304
Query: 161 MGYRRPSTQA-TKFINHP--NNFYYLSLKDISIDNERMNFPPDTFDITVSGEGGCIIDSG 217
++ + A T +++P +NFY L+L I + + T G GG +IDSG
Sbjct: 305 SLFKNLTPIAYTSMVSNPQLSNFYVLNLTGIDVGGVAI-------QDTSFGNGGILIDSG 357
Query: 218 SVLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCYFLPETFN-RFPSMAFYFED 276
+V+T +Y L +F+ F + +A + C+ L P+++ +FE+
Sbjct: 358 TVITRLAPSLYNALKAEFLKQFSGYPIA---PALSILDTCFNLTGIEEVSIPTLSMHFEN 414
Query: 277 -ANLRIDGENVFIIDYENHFFLLAVAPHDDL--VALIGSQQQRDTRFVYDLNIDLLSFVK 333
+L +D + + + LA+A D +A+IG+ QQR+ R +YD + F +
Sbjct: 415 NVDLNVDAVGILYMPKDGSQVCLALASLSDENDMAIIGNYQQRNQRVIYDAKQSKIGFAR 474
Query: 334 ENCS 337
E+CS
Sbjct: 475 EDCS 478
>gi|224109494|ref|XP_002315215.1| predicted protein [Populus trichocarpa]
gi|222864255|gb|EEF01386.1| predicted protein [Populus trichocarpa]
Length = 444
Score = 99.8 bits (247), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 93/375 (24%), Positives = 163/375 (43%), Gaps = 58/375 (15%)
Query: 4 LFIGTPSKGVLLILDTGSALIY----------AIFDPRKSSSFQKINCDHPDCTY----- 48
L IGTP + + ++LDTGS L + +IF+P S ++ KI C C
Sbjct: 71 LTIGTPPQNITMVLDTGSELSWLRCKKEPNFTSIFNPLASKTYTKIPCSSQTCKTRTSDL 130
Query: 49 ---FKC-VNEQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGC--SNDNHG 102
C + C + + YAD S +G A ET G +FGC S +
Sbjct: 131 TLPVTCDPAKLCHFIISYADASSVEGHLAFETFRF-----GSLTRPATVFGCMDSGSSSN 185
Query: 103 FDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCL-----VIPLPNGEYTSSYLKF 157
+EDA+ G++G++R ++SF++Q+G ++FSYC+ L GE S+LK
Sbjct: 186 TEEDAKT---TGLMGMNRGSLSFVNQMGF---RKFSYCISGLDSTGFLLLGEARYSWLK- 238
Query: 158 GTDMGYRRPSTQATKFINHPNNFYYLSLKDISIDNERMNFPPDTFDITVSGEGGCIIDSG 217
+ Y +T Y + L+ I ++N+ + P F +G G ++DSG
Sbjct: 239 --PLNYTPLVQISTPLPYFDRVAYSVQLEGIKVNNKVLPLPKSVFVPDHTGAGQTMVDSG 296
Query: 218 SVLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPE-----PIQLCYFLPETFNRFPSM-- 270
+ T+ VY L ++F+ + + ++ + P+ + LCY + T + P++
Sbjct: 297 TQFTFLLGPVYSALRKEFL--LQTAGVLRVLNEPQYVFQGAMDLCYLIDSTSSTLPNLPV 354
Query: 271 -AFYFEDANLRIDGENVFI-----IDYENHFFLLAVAPHDDLVA---LIGSQQQRDTRFV 321
F A + + G+ + + ++ + D+L LIG QQ++
Sbjct: 355 VKLMFRGAEMSVSGQRLLYRVPGEVRGKDSVWCFTFGNSDELGISSFLIGHHQQQNVWME 414
Query: 322 YDLNIDLLSFVKENC 336
YDL + F + C
Sbjct: 415 YDLENSRIGFAELRC 429
>gi|357133002|ref|XP_003568117.1| PREDICTED: aspartic proteinase-like protein 2-like [Brachypodium
distachyon]
Length = 497
Score = 99.8 bits (247), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 102/388 (26%), Positives = 171/388 (44%), Gaps = 81/388 (20%)
Query: 2 VRLFIGTPSKGVLLILDTGSALIY-------------------AIFDPRKSSSFQKINCD 42
++ IGTP K + +DTGS +++ A++DP+ SSS ++CD
Sbjct: 89 TKIEIGTPPKPFHVQVDTGSDILWVNCVSCDKCPTKSGLGIDLALYDPKGSSSGSAVSCD 148
Query: 43 H-------------PDCTYFKCVNEQCVYTMKYADQSVTKGFAAHETI---SVIGKGEGK 86
+ P CT + C Y +Y D S T G +++ + G + +
Sbjct: 149 NKFCAATYGSGEKLPGCT----AGKPCEYRAEYGDGSSTAGSFVSDSLQYNQLSGNAQTR 204
Query: 87 AIFHGALFGCSNDNHGFDEDARDGALAGVLGLSRVTISFISQLGSI--IKKRFSYCLVIP 144
+FGC G D ++ + AL G++G + S +SQL S +KK FS+CL
Sbjct: 205 HAKANVIFGCGA-QQGGDLESTNQALDGIIGFGQSNTSTLSQLASAGEVKKIFSHCLDTI 263
Query: 145 LPNGEYTSSYLKFGTDMGYRRPSTQATKFINHPN-NFYYLSLKDISIDNERMNFPPDTFD 203
G + + +P ++T + PN + Y ++L+ I + + PP F+
Sbjct: 264 KGGGIFAIGEV--------VQPKVKSTPLL--PNMSHYNVNLQSIDVAGNALQLPPHIFE 313
Query: 204 ITVSGEGGCIIDSGSVLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQ--LCYFLP 261
S + G IIDSG+ LTY VY + + F++ Q IQ LC+
Sbjct: 314 --TSEKRGTIIDSGTTLTYLPELVYKDI---LAAVFQKHQDITF----RTIQGFLCFEYS 364
Query: 262 ETF-NRFPSMAFYFE-DANLRI--------DGENVFIIDYENHFFLLAVAPHD--DLVAL 309
E+ + FP + F+FE D L + +G+N++ + ++N F P D D+V L
Sbjct: 365 ESVDDGFPKITFHFEDDLGLNVYPHDYFFQNGDNLYCLGFQNGGF----QPKDAKDMV-L 419
Query: 310 IGSQQQRDTRFVYDLNIDLLSFVKENCS 337
+G + VYDL ++ + NCS
Sbjct: 420 LGDLVLSNKVVVYDLEKQVIGWTDYNCS 447
>gi|356573235|ref|XP_003554768.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
Length = 472
Score = 99.8 bits (247), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 98/361 (27%), Positives = 153/361 (42%), Gaps = 46/361 (12%)
Query: 2 VRLFIGTPSKGVLLILDTGSALIY--------------AIFDPRKSSSFQKINCDHPDCT 47
R+ +GTP++ V ++LDTGS +++ +FDP KS ++ I C P C
Sbjct: 131 TRIGVGTPARYVYMVLDTGSDVVWLQCAPCRKCYTQADPVFDPTKSRTYAGIPCGAPLCR 190
Query: 48 YF---KCVNEQ--CVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSNDNHG 102
C N+ C Y + Y D S T G + ET++ + GC +DN G
Sbjct: 191 RLDSPGCNNKNKVCQYQVSYGDGSFTFGDFSTETLTFR-----RTRVTRVALGCGHDNEG 245
Query: 103 FDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLKFGTDMG 162
A R +SF Q G ++FSYCLV + + +S + FG
Sbjct: 246 LFIGAAGLLGL-----GRGRLSFPVQTGRRFNQKFSYCLVDRSASAKPSS--VVFGDSAV 298
Query: 163 YRRPSTQATKFINHP--NNFYYLSLKDISIDNERM-NFPPDTFDITVSGEGGCIIDSGSV 219
R + + T I +P + FYYL L IS+ + F + +G GG IIDSG+
Sbjct: 299 SR--TARFTPLIKNPKLDTFYYLELLGISVGGSPVRGLSASLFRLDAAGNGGVIIDSGTS 356
Query: 220 LTYFHSDVYWKLHEKF---VSYFERFQLAQLSDCPEPIQLCYFLPE-TFNRFPSMAFYFE 275
+T Y L + F S+ +R L D C+ L T + P++ +F
Sbjct: 357 VTRLTRPAYIALRDAFRVGASHLKRAAEFSLFD------TCFDLSGLTEVKVPTVVLHFR 410
Query: 276 DANLRIDGENVFIIDYENHFFLLAVAPHDDLVALIGSQQQRDTRFVYDLNIDLLSFVKEN 335
A++ + N I + F A A +++IG+ QQ+ R +DL + F
Sbjct: 411 GADVSLPATNYLIPVDNSGSFCFAFAGTMSGLSIIGNIQQQGFRVSFDLAGSRVGFAPRG 470
Query: 336 C 336
C
Sbjct: 471 C 471
>gi|225217008|gb|ACN85295.1| aspartic proteinase nepenthesin-1 precursor [Oryza coarctata]
Length = 500
Score = 99.4 bits (246), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 89/361 (24%), Positives = 147/361 (40%), Gaps = 47/361 (13%)
Query: 1 MVRLFIGTPSKGVLLILDTGSALIYA---------------IFDPRKSSSFQKINCDHPD 45
+V + +GTP+ ++ DTGS + +FDP +SS++ I+C P
Sbjct: 162 VVTIGLGTPAGRYTVVFDTGSDTTWVQCEPCVVVCYKQQEKLFDPARSSTYANISCAAPA 221
Query: 46 CT--YFK-CVNEQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSNDNHG 102
C+ Y K C C+Y ++Y D S + GF A +T+++ G FGC N G
Sbjct: 222 CSDLYIKGCSGGHCLYGVQYGDGSYSIGFFAMDTLTL----SSYDAIKGFRFGCGERNEG 277
Query: 103 FDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLKFGTDMG 162
+A AG+LGL R S Q F++C P + YL FG
Sbjct: 278 LYGEA-----AGLLGLGRGKTSLPVQAYDKYGGVFAHC----FPARSSGTGYLDFGPG-- 326
Query: 163 YRRPSTQAT----KFINHPNNFYYLSLKDISIDNERMNFPPDTFDITVSGEGGCIIDSGS 218
P+ A +++ FYY+ L I + + ++ P F + G I+DSG+
Sbjct: 327 -SLPAVSAKLTTPMLVDNGPTFYYVGLTGIRVGGKLLSIPQSVFTTS-----GTIVDSGT 380
Query: 219 VLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCY-FLPETFNRFPSMAFYFE-D 276
V+T Y L F S + + + CY F + P+++ F+
Sbjct: 381 VITRLPPAAYSSLRSAFASAMAERGYKK-APALSLLDTCYDFTGMSEVAIPTVSLLFQGG 439
Query: 277 ANLRIDGEN-VFIIDYENHFFLLAVAPHDDLVALIGSQQQRDTRFVYDLNIDLLSFVKEN 335
A+L + ++ A DD V ++G+ Q + VYD+ ++ F
Sbjct: 440 ASLDVHASGIIYAASVSQACLGFAGNKEDDDVGIVGNTQLKTFGVVYDIGKKVVGFCPGA 499
Query: 336 C 336
C
Sbjct: 500 C 500
>gi|225217056|gb|ACN85339.1| aspartic proteinase nepenthesin-1 precursor [Oryza granulata]
Length = 521
Score = 99.4 bits (246), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 92/364 (25%), Positives = 149/364 (40%), Gaps = 53/364 (14%)
Query: 1 MVRLFIGTPSKGVLLILDTGSALIYA---------------IFDPRKSSSFQKINCDHPD 45
+V + +GTP+ ++ DTGS + +FDP +SS++ ++C P
Sbjct: 183 VVTIGLGTPASRYTVVFDTGSDTTWVQCQPCVVVCYKQQEKLFDPARSSTYANVSCAAPA 242
Query: 46 CT--YFK-CVNEQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSNDNHG 102
C+ Y + C C+Y+++Y D S + GF A +T+++ G FGC N G
Sbjct: 243 CSDLYTRGCSGGHCLYSVQYGDGSYSIGFFAMDTLTL----SSYDAVKGFRFGCGERNEG 298
Query: 103 FDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLKFGTDMG 162
+A AG+LGL R S Q F++C LP + YL FG
Sbjct: 299 LFGEA-----AGLLGLGRGKTSLPVQTYDKYGGVFAHC----LPARSSGTGYLDFGPGSP 349
Query: 163 YRRPSTQATKFI--NHPNNFYYLSLKDISIDNERMNFPPDTFDITVSGEGGCIIDSGSVL 220
+ Q T + N P FYY+ + I + + ++ P F G I+DSG+V+
Sbjct: 350 AAVGARQTTPMLTDNGP-TFYYVGMTGIRVGGQLLSIPQSVFSTA-----GTIVDSGTVI 403
Query: 221 TYFHSDVYWKLHEKFVS------YFERFQLAQLSDCPEPIQLCYFLPETFNRFPSMAFYF 274
T Y L F S Y + L+ L C + F + P ++ F
Sbjct: 404 TRLPPAAYSSLRSAFASAMAARGYKKAPALSLLDTCYD------FTGMSEVAIPKVSLLF 457
Query: 275 E-DANLRIDGENVFIIDYENHFFL-LAVAPHDDLVALIGSQQQRDTRFVYDLNIDLLSFV 332
+ A L ++ + + L A DD V ++G+ Q + VYD+ + F
Sbjct: 458 QGGAYLDVNASGIMYAASLSQVCLGFAANEDDDDVGIVGNTQLKTFGVVYDIGKKTVGFS 517
Query: 333 KENC 336
C
Sbjct: 518 PGAC 521
>gi|222624820|gb|EEE58952.1| hypothetical protein OsJ_10633 [Oryza sativa Japonica Group]
Length = 415
Score = 99.4 bits (246), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 100/355 (28%), Positives = 149/355 (41%), Gaps = 51/355 (14%)
Query: 1 MVRLFIGTPSKGVLLILDTGSALIYA--------------IFDPRKSSSFQKINCDHPDC 46
+V L IGTP + V L LDTGS LI+ FDP SS+ +CD C
Sbjct: 90 LVHLAIGTPPQPVQLTLDTGSDLIWTQCQPCPACFDQALPYFDPSTSSTLSLTSCDSTLC 149
Query: 47 TYFKCVNEQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSNDNHGFDED 106
V ++ +D+ + +G G A G FGC N+G +
Sbjct: 150 QGLP------VASLPRSDK-----------FTFVGAG---ASVPGVAFGCGLFNNGVFKS 189
Query: 107 ARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLKFGTDM-GYRR 165
G+ G R +S SQL FS+C + ++ L D+ +
Sbjct: 190 NE----TGIAGFGRGPLSLPSQLKV---GNFSHCFTT-ITGAIPSTVLLDLPADLFSNGQ 241
Query: 166 PSTQATKFINHPNN--FYYLSLKDISIDNERMNFPPDTFDITVSGEGGCIIDSGSVLTYF 223
+ Q T I +P N FYYLSLK I++ + R+ P F + +G GG IIDSG+ +T
Sbjct: 242 GAVQTTPLIQNPANPTFYYLSLKGITVGSTRLPVPESEFALK-NGTGGTIIDSGTAMTSL 300
Query: 224 HSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCYFLPETFNRF-PSMAFYFEDANLRID 282
+ VY + + F + + +L +S C P + P + +FE A + +
Sbjct: 301 PTRVYRLVRDAFAA---QVKLPVVSGNTTDPYFCLSAPLRAKPYVPKLVLHFEGATMDLP 357
Query: 283 GEN-VFIIDYENHFFLLAVAPHDDLVALIGSQQQRDTRFVYDLNIDLLSFVKENC 336
EN VF ++ L V IG+ QQ++ +YDL LSFV C
Sbjct: 358 RENYVFEVEDAGSSILCLAIIEGGEVTTIGNFQQQNMHVLYDLQNSKLSFVPAQC 412
>gi|357515001|ref|XP_003627789.1| Aspartic proteinase nepenthesin-2 [Medicago truncatula]
gi|355521811|gb|AET02265.1| Aspartic proteinase nepenthesin-2 [Medicago truncatula]
Length = 415
Score = 99.4 bits (246), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 93/347 (26%), Positives = 153/347 (44%), Gaps = 43/347 (12%)
Query: 6 IGTPSKGVLLILDTGSALIY--------------AIFDPRKSSSFQKINCDHPDCTYFKC 51
IGTP V +DTGS L++ IFDP SSS+Q I C
Sbjct: 94 IGTPPFKVFGFVDTGSDLVWLQCEPCKQCYPQITPIFDPSLSSSYQNI----------PC 143
Query: 52 VNEQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSNDNHGFDEDARDGA 111
+++ C ++M+ V +G+ + ET+++ F + GC N G G
Sbjct: 144 LSDTC-HSMRTTSCDV-RGYLSVETLTLDSTTGYSVSFPKTMIGCGYRNTG----TFHGP 197
Query: 112 LAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLKFGTDMGYRRPSTQAT 171
+G++GL +S SQLG+ I +FSYCL LPN ++S L FG T
Sbjct: 198 SSGIVGLGSGPMSLPSQLGTSIGGKFSYCLGPWLPN---STSKLNFGDAAIVYGDGAMTT 254
Query: 172 KFINH-PNNFYYLSLKDISIDNERMNFPPDTFDITVSGEGGCIIDSGSVLTYFHSDVYWK 230
+ + YYL+L+ S+ N+ + F T+ EG +IDSG+ T+ DVY++
Sbjct: 255 PIVKKDAQSGYYLTLEAFSVGNKLIEFGGPTYG---GNEGNILIDSGTTFTFLPYDVYYR 311
Query: 231 LHEKFVSYFERFQLAQLSDCPEPIQLCYFLPETFNRFPSMAFYFEDANLRIDGENVFIID 290
Y L + D +LCY + P + +F+ A++++ + F I
Sbjct: 312 FESAVAEY---INLEHVEDPNGTFKLCYNVAYHGFEAPLITAHFKGADIKLYYISTF-IK 367
Query: 291 YENHFFLLAVAPHDDLVALIGSQQQRDTRFVYDLNIDLLSFVKENCS 337
+ LA P A+ G+ Q++ Y+L + ++F +C+
Sbjct: 368 VSDGIACLAFIPSQ--TAIFGNVAQQNLLVGYNLVQNTVTFKPVDCT 412
>gi|147858841|emb|CAN78694.1| hypothetical protein VITISV_037475 [Vitis vinifera]
Length = 442
Score = 99.4 bits (246), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 93/356 (26%), Positives = 154/356 (43%), Gaps = 40/356 (11%)
Query: 1 MVRLFIGTPSKGVLLILDTGSALIY--------------AIFDPRKSSSFQKINCDHPDC 46
+ L IG P V ++LDTGS L + I++ KS S+ ++ C+ P C
Sbjct: 94 LANLSIGNPPTNVYVVLDTGSDLFWIQCEPCDVCYKQKDPIYNRTKSDSYTEMLCNEPPC 153
Query: 47 TYF----KCVNE-QCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSNDNH 101
+C + C+Y YAD + T G ++E ++ + FGC N
Sbjct: 154 VSLGREGQCSDSGSCLYQTAYADGARTSGLLSYEKVAFTSHYSDEDKTAQVGFGCGLQNL 213
Query: 102 GFDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLV-IPLPNGEYTSSYLKFGTD 160
F RDG + G+ +S +S +G + K F+YC I PN +L FG D
Sbjct: 214 NFITSNRDGGVLGLGPGLVSLVSQLSAIGKV-SKSFAYCFGNISNPNA---GGFLVFG-D 268
Query: 161 MGYRRPSTQATKFINHPNNFYYLSLKDIS--IDNERMNFPPDTFDITVSGEGGCIIDSGS 218
Y FYY++L I + R++ +F+ G GG IIDSGS
Sbjct: 269 ATYLNGDMTPMVIAE----FYYVNLLGIGLGVGEPRLDINSSSFERKPDGSGGVIIDSGS 324
Query: 219 VLTYFHSDVYWKLHEKFVSYFER-FQLAQLSDCPEPIQLCY--FLPETFNRFPSMAFYFE 275
L+ F +VY + V ++ + ++ L+ P+ C+ + FP++ Y E
Sbjct: 325 TLSVFPPEVYEVVRNAVVDKLKKGYNISPLTSSPD----CFEGKIERDLPLFPTLVLYLE 380
Query: 276 DANLRIDGENVFIIDYENHFFLLAVAPHDDLVALIGSQQQRDTRFVYDLNIDLLSF 331
+ D ++F+ Y+ F L + L ++IG+ Q+ +F Y+L + LS
Sbjct: 381 STGILNDRWSIFLQRYD-ELFCLGFTSGEGL-SIIGTLAQQSYKFGYNLELSTLSI 434
>gi|359476193|ref|XP_003631802.1| PREDICTED: LOW QUALITY PROTEIN: aspartic proteinase
nepenthesin-2-like [Vitis vinifera]
Length = 496
Score = 99.0 bits (245), Expect = 3e-18, Method: Compositional matrix adjust.
Identities = 103/361 (28%), Positives = 159/361 (44%), Gaps = 52/361 (14%)
Query: 1 MVRLFIGTPSKGVLLILDTGSALIYAI--------------FDPRKSSSFQKINCDHPDC 46
+V + GTP + LILDTGS++ + FDP S ++ +C
Sbjct: 163 LVDVAFGTPPQKFTLILDTGSSITWTQCKPCVRCLKASRRHFDPSASLTYSLGSCIPSTV 222
Query: 47 TYFKCVNEQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSNDNHGFDED 106
Y M Y D+S + G +T+++ E +F FGC +N G D
Sbjct: 223 GN--------TYNMTYGDKSTSVGNYGCDTMTL----EHSDVFPKFQFGCGRNNEG---D 267
Query: 107 ARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLKFGTDMGYRRP 166
GA G+LGL + +S +SQ S KK FSYC LP + S L FG +
Sbjct: 268 FGSGA-DGMLGLGQGQLSTVSQTASKFKKVFSYC----LPEEDSIGSLL-FGEKATSQSS 321
Query: 167 STQATKFINHP-------NNFYYLSLKDISIDNERMNFPPDTFDITVSGEGGCIIDSGSV 219
S + T +N P + +Y++ L DIS+ N+R+N P F G IIDSG+V
Sbjct: 322 SLKFTSLVNGPGTSGLEESGYYFVKLLDISVGNKRLNIPSSVF-----ASPGTIIDSGTV 376
Query: 220 LTYFHSDVYWKLHEKFVSYFERFQLAQ-LSDCPEPIQLCYFLPETFN-RFPSMAFYF-ED 276
+T Y L F ++ L+ + + CY L + P + +F E
Sbjct: 377 ITRLPQRAYSALKAAFKKAMAKYPLSNGRRKKGDILDTCYNLSGRKDVLLPEIVLHFGEG 436
Query: 277 ANLRIDGENVFIIDYENHFFLLAVAPHDDLVALIGSQQQRDTRFVYDLNIDLLSFVKENC 336
A++R++G+ V I + LA A + +L +IG++QQ +YD+ + F C
Sbjct: 437 ADVRLNGKRV-IWGNDASRLCLAFAGNSELT-IIGNRQQVSLTVLYDIQGGRIGFGGNGC 494
Query: 337 S 337
S
Sbjct: 495 S 495
>gi|125527257|gb|EAY75371.1| hypothetical protein OsI_03267 [Oryza sativa Indica Group]
Length = 484
Score = 99.0 bits (245), Expect = 3e-18, Method: Compositional matrix adjust.
Identities = 106/407 (26%), Positives = 155/407 (38%), Gaps = 85/407 (20%)
Query: 2 VRLFIGTPSKGVLLILDTGSALIYA-----------------------------IFDPRK 32
VR +GTP++ LL+ DTGS L + F P K
Sbjct: 89 VRFRVGTPAQPFLLVADTGSDLTWVKCHRAAAAASASPRNASSLPAPAPASPRRTFRPDK 148
Query: 33 SSSFQKINCDHPDC------TYFKCVNEQ--CVYTMKYADQSVTKGFAA--HETISVIGK 82
S ++ I C C + C C Y +Y D S +G TI++ G+
Sbjct: 149 SRTWAPIPCSSATCRESLPFSLAACATPANPCAYDYRYKDGSAARGTVGVDSATIALSGR 208
Query: 83 GEGKAIFHGALFGCSNDNHGFDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLV 142
KA G + GC+ +G A DG VL L ISF S+ S RFSYCLV
Sbjct: 209 AARKAKLRGVVLGCTTSYNGQSFLASDG----VLSLGYSNISFASRAASRFGGRFSYCLV 264
Query: 143 IPLPNGEYTSSYLKFGTDMGY--RRPS----------------------TQATKFINHPN 178
L T SYL FG + + RRPS Q ++H
Sbjct: 265 DHLAPRNAT-SYLTFGPNPAFSSRRPSEGIASCKPAPAPTPAPAGAPGARQTPLVLDHRT 323
Query: 179 N-FYYLSLKDISIDNERMNFPPDTFDITVSGEGGCIIDSGSVLTYFHSDVYWKLHEKFVS 237
FY +++K +S+ E + P +D V GG I+DSG+ LT Y V+
Sbjct: 324 RPFYAVTVKGVSVAGELLKIPRAVWD--VEQGGGAILDSGTSLTMLAKPAY----RAVVA 377
Query: 238 YFERFQLAQLSDCP-EPIQLCYFL-----PETFNRFPSMAFYFEDANLRIDGENVFIIDY 291
+ +LA L +P CY + P +A +F + ++ID
Sbjct: 378 ALSK-RLAGLPRVTMDPFDYCYNWTSPSGSDVAAPLPMLAVHFAGSARLEPPAKSYVIDA 436
Query: 292 ENHFFLLAV--APHDDLVALIGSQQQRDTRFVYDLNIDLLSFVKENC 336
+ + P L ++IG+ Q++ + YDL L F + C
Sbjct: 437 APGVKCIGLQEGPWPGL-SVIGNILQQEHLWEYDLKNRRLRFKRSRC 482
>gi|242073262|ref|XP_002446567.1| hypothetical protein SORBIDRAFT_06g018180 [Sorghum bicolor]
gi|241937750|gb|EES10895.1| hypothetical protein SORBIDRAFT_06g018180 [Sorghum bicolor]
Length = 453
Score = 99.0 bits (245), Expect = 3e-18, Method: Compositional matrix adjust.
Identities = 91/382 (23%), Positives = 167/382 (43%), Gaps = 67/382 (17%)
Query: 1 MVRLFIGTPSKGVLLILDTGSALIY--------------AIFDPRKSSSFQKINCDHPDC 46
+V+L IGTP +DT S L++ IF+PR SSS+ + C C
Sbjct: 89 LVKLGIGTPQHYFSAAIDTASDLVWLQCQPCVSCYRQLDPIFNPRLSSSYAVVPCSSDTC 148
Query: 47 TYF---KCV---NEQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSNDN 100
+ +C ++ C Y KY+ +VT G A + ++V G +FH + GCS+ +
Sbjct: 149 SQLDGHRCDEDDDQACRYNYKYSGNAVTNGTLAIDKLAV-----GGNVFHAVVLGCSDSS 203
Query: 101 HGFDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLKFGTD 160
G +G++GL+R +S +SQL +RF YCL P+ T L G
Sbjct: 204 VGGPPPQ----ASGLVGLARGPLSLLSQLSV---RRFMYCLPPPM---SRTPGKLVLGAG 253
Query: 161 MG---YRRPSTQATKFINHPN---NFYYLSLKDISIDNE------RMNFPPDTFDITV-- 206
G R S + T ++ ++YYL+ +++ ++ R PP T
Sbjct: 254 AGADAVRNVSDRVTVTMSSSTRYPSYYYLNFDGLAVGDQTPGTIRRPTSPPATGGGVGGG 313
Query: 207 -------SGEGGCIIDSGSVLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPE-PIQLCY 258
+ G I+D S +++ + +Y +L + E +L + + + LC+
Sbjct: 314 GGDGGSGANAYGMIVDVASTISFLEASLYDELADDL---EEEIRLPRATPSTRLGLDLCF 370
Query: 259 FLPETFN----RFPSMAFYFEDANLRIDGENVFIIDYENHFFLLAVAPHDDLVALIGSQQ 314
LPE P+++ F+ L ++ + +F+ D ++ V+++G+ Q
Sbjct: 371 ILPEGVGIDRVYVPTVSMSFDGRWLELERDRLFLEDGRMMCLMIG---RTSGVSILGNYQ 427
Query: 315 QRDTRFVYDLNIDLLSFVKENC 336
Q++ +Y+L ++F K +C
Sbjct: 428 QQNMHVLYNLRRGKITFAKASC 449
>gi|297834938|ref|XP_002885351.1| pepsin A [Arabidopsis lyrata subsp. lyrata]
gi|297331191|gb|EFH61610.1| pepsin A [Arabidopsis lyrata subsp. lyrata]
Length = 471
Score = 99.0 bits (245), Expect = 3e-18, Method: Compositional matrix adjust.
Identities = 87/356 (24%), Positives = 148/356 (41%), Gaps = 39/356 (10%)
Query: 2 VRLFIGTPSKGVLLILDTGSALIYA--------------IFDPRKSSSFQKINCDHPDCT 47
VR+ +G+P + +++D+GS +++ +FDP KS S+ ++C C
Sbjct: 134 VRIGVGSPPRDQYMVIDSGSDMVWVQCQPCKLCYKQSDPVFDPAKSGSYTGVSCGSSVCD 193
Query: 48 YFK---CVNEQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSNDNHGFD 104
+ C + C Y + Y D S TKG A ET++ K + GC + N G
Sbjct: 194 RIENSGCHSGGCRYEVMYGDGSYTKGTLALETLTF-----AKTVVRNVAMGCGHRNRGMF 248
Query: 105 EDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLKFGTDMGYR 164
A ++SF+ QL F YCLV G ++ L FG +
Sbjct: 249 IGAAGLLGI-----GGGSMSFVGQLSGQTGGAFGYCLV---SRGTDSTGSLVFGREA--L 298
Query: 165 RPSTQATKFINHPN--NFYYLSLKDISIDNERMNFPPDTFDITVSGEGGCIIDSGSVLTY 222
+ +P +FYY+ LK + + R+ P FD+T +G+GG ++D+G+ +T
Sbjct: 299 PVGASWVPLVRNPRAPSFYYVGLKGLGVGGVRIPLPDGVFDLTETGDGGVVMDTGTAVTR 358
Query: 223 FHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCYFLPETFN-RFPSMAFYF-EDANLR 280
+ Y + F S A CY L + R P+++FYF E L
Sbjct: 359 LPTGAYAAFRDGFKSQTANLPRASGVSI---FDTCYDLSGFVSVRVPTVSFYFTEGPVLT 415
Query: 281 IDGENVFIIDYENHFFLLAVAPHDDLVALIGSQQQRDTRFVYDLNIDLLSFVKENC 336
+ N + ++ + A A +++IG+ QQ + +D + F C
Sbjct: 416 LPARNFLMPVDDSGTYCFAFAASPTGLSIIGNIQQEGIQVSFDGANGFVGFGPNVC 471
>gi|86438622|emb|CAJ26370.1| chloroplast nucleoid binding protein [Brachypodium sylvaticum]
Length = 443
Score = 99.0 bits (245), Expect = 3e-18, Method: Compositional matrix adjust.
Identities = 97/381 (25%), Positives = 158/381 (41%), Gaps = 67/381 (17%)
Query: 1 MVRLFIGTPSKGVLLILDTGSALIYA----------------IFDPRKSSSFQKINCDHP 44
+V + +GTP++ + ++ DTGS L + +F P SS+F + C P
Sbjct: 86 VVSVGLGTPARDLTVVFDTGSDLSWVQCGPCSSGGCYHQQDPLFAPSSSSTFSAVRCGEP 145
Query: 45 DCTYFK--CV----NEQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAI------FHGA 92
+C + C +++C Y + Y D+S T G ++T+++ A G
Sbjct: 146 ECPRARQSCSSSPGDDRCPYEVVYGDKSRTVGHLGNDTLTLGTTPSTNASENNSNKLPGF 205
Query: 93 LFGCSNDNHGFDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTS 152
+FGC +N G G G+ GL R +S SQ + FSYCL N
Sbjct: 206 VFGCGENNTGLF-----GKADGLFGLGRGKVSLSSQAAGKYGEGFSYCLPSSSSNAH--- 257
Query: 153 SYLKFGTDMGYRRPSTQATKF---INHPN--NFYYLSLKDISIDNE--RMNFPPDTFDIT 205
YL GT P+ +F +N N +FYY+ L I + +++ P +
Sbjct: 258 GYLSLGTPA----PAPAHARFTPMLNRSNTPSFYYVKLVGIRVAGRAIKVSSRPALW--- 310
Query: 206 VSGEGGCIIDSGSVLTYFHSDVYWKLHEKFVSYFERF---QLAQLSDCPEPIQLCYFLPE 262
G I+DSG+V+T Y L F+S ++ + +LS + CY
Sbjct: 311 ---PAGLIVDSGTVITRLAPRAYSALRTAFLSAMGKYGYKRAPRLSI----LDTCYDFTA 363
Query: 263 TFN---RFPSMAFYFE-DANLRIDGENVFIIDYENHFFLLAVAPHDD--LVALIGSQQQR 316
N P++A F A + +D V + LA AP+ + ++G+ QQR
Sbjct: 364 HANATVSIPAVALVFAGGATISVDFSGVLYVAKVAQ-ACLAFAPNGNGRSAGILGNTQQR 422
Query: 317 DTRFVYDLNIDLLSFVKENCS 337
VYD+ + F + CS
Sbjct: 423 TVAVVYDVGRQKIGFAAKGCS 443
>gi|413923780|gb|AFW63712.1| hypothetical protein ZEAMMB73_689747 [Zea mays]
Length = 470
Score = 99.0 bits (245), Expect = 3e-18, Method: Compositional matrix adjust.
Identities = 94/357 (26%), Positives = 139/357 (38%), Gaps = 65/357 (18%)
Query: 1 MVRLFIGTPSKGVLLILDTGSALIYA----------------IFDPRKSSSFQKINCDHP 44
+V +GTP L +DTGS L + +FDP +SSS+ + C
Sbjct: 138 VVTASLGTPGMAQTLEVDTGSDLSWVQCKPCAAPSCYRQKDPLFDPAQSSSYAAVPCGRS 197
Query: 45 DCTYF-----KCVNEQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSND 99
C C QC Y + Y D S T G + +T+++ A G LFGC +
Sbjct: 198 ACAGLGIYASACSAAQCGYVVSYGDGSNTTGVYSSDTLTL----AANATVQGFLFGCGHA 253
Query: 100 NHGFDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLKFGT 159
G D G+LG R S + Q FSYC LP T+ YL G
Sbjct: 254 QSGGLFTGID----GLLGFGREQPSLVQQTAGAYGGVFSYC----LPTKSSTTGYLTLGG 305
Query: 160 DMGYRRPSTQATKFINHPN--NFYYLSLKDISIDNERMNFPPDTFDITVSGEGGCIIDSG 217
G P T+ + PN +Y + L IS+ + ++ P F G ++D+G
Sbjct: 306 PSGV-APGFSTTQLLPSPNAPTYYVVMLTGISVGGQPLSVPASAF------AAGTVVDTG 358
Query: 218 SVLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQL---CY-FLPETFNRFPSMAFY 273
+V+T Y L F S + A PI + CY F S+A
Sbjct: 359 TVITRLPPAAYAALRSAFRSGMASYPSA------PPIGILDTCYSFAGYGTVNLTSVALT 412
Query: 274 FED-ANLRIDGENVFIIDYENHFFLLAVAP--HDDLVALIGSQQQRDTRFVYDLNID 327
F A + + + + F LA A D +A++G+ QQR +++ ID
Sbjct: 413 FSSGATMTLGADGIM------SFGCLAFASSGSDGSMAILGNVQQRS----FEVRID 459
>gi|449515031|ref|XP_004164553.1| PREDICTED: probable aspartic protease At2g35615-like [Cucumis
sativus]
Length = 430
Score = 99.0 bits (245), Expect = 3e-18, Method: Compositional matrix adjust.
Identities = 103/366 (28%), Positives = 161/366 (43%), Gaps = 57/366 (15%)
Query: 1 MVRLFIGTPSKGVLLILDTGSALIYA--------------IFDPRKSSSFQKINCDHPDC 46
++ + IGTP L I DTGS L +A IF+P KS+SF + C+ C
Sbjct: 93 LMSVSIGTPPVDYLGIADTGSDLTWAQCLPCLKCYQQLRPIFNPLKSTSFSHVPCNTQTC 152
Query: 47 TYF---KC-VNEQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSNDNHG 102
C V C Y+ Y D++ +KG E I+ IG K++ GC + + G
Sbjct: 153 HAVDDGHCGVQGVCDYSYTYGDRTYSKGDLGFEKIT-IGSSSVKSV-----IGCGHASSG 206
Query: 103 FDEDARDGALAGVLGLSRVTISFISQLG--SIIKKRFSYCLVIPLPNGEYTSSYLKFGTD 160
G +GV+GL +S +SQ+ S I +RFSYCL L + + + FG +
Sbjct: 207 -----GFGFASGVIGLGGGQLSLVSQMSQTSGISRRFSYCLPTLL---SHANGKINFGEN 258
Query: 161 MGYRRPSTQATKFINHPN-NFYYLSLKDISIDNERMNFPPDTFDITVSGEGGCIIDSGSV 219
P +T I+ +YY++L+ ISI NER + + +G IIDSG+
Sbjct: 259 AVVSGPGVVSTPLISKNTVTYYYITLEAISIGNER--------HMAFAKQGNVIIDSGTT 310
Query: 220 LTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCYFLPETFNRFPSM------AFY 273
LT ++Y + S + + ++ D + LC+ + N S+ A +
Sbjct: 311 LTILPKELYDGV---VSSLLKVVKAKRVKDPHGSLDLCF--DDGINAAASLGIPVITAHF 365
Query: 274 FEDANLRIDGENVF--IIDYENHFFLLAVAPHDDLVALIGSQQQRDTRFVYDLNIDLLSF 331
AN+ + N F + D N L A +P + +IG+ Q + YDL LSF
Sbjct: 366 SGGANVNLLPINTFRKVADNVNCLTLKAASPTTEF-GIIGNLAQANFLIGYDLEAKRLSF 424
Query: 332 VKENCS 337
C+
Sbjct: 425 KPTVCA 430
>gi|242095602|ref|XP_002438291.1| hypothetical protein SORBIDRAFT_10g011210 [Sorghum bicolor]
gi|241916514|gb|EER89658.1| hypothetical protein SORBIDRAFT_10g011210 [Sorghum bicolor]
Length = 464
Score = 99.0 bits (245), Expect = 3e-18, Method: Compositional matrix adjust.
Identities = 90/361 (24%), Positives = 143/361 (39%), Gaps = 51/361 (14%)
Query: 1 MVRLFIGTPSKGVLLILDTGSALIYA----------------IFDPRKSSSFQKINCDHP 44
++ + IGTP+ ++ +DTGS + + +FDP S+++ +C
Sbjct: 130 VITVTIGTPAVTQVMSIDTGSDVSWVQCAPCAAQSCSSQKDKLFDPAMSATYSAFSCGSA 189
Query: 45 DCTYFK-----CVNEQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSND 99
C C+ QC Y +KY D S T G +T+S+ K+ FGCS+
Sbjct: 190 QCAQLGDEGNGCLKSQCQYIVKYGDGSNTAGTYGSDTLSLTSSDAVKSF----QFGCSHR 245
Query: 100 NHGFDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLKFGT 159
GF G L G++GL T S +SQ + K FSYCL P +G +L G
Sbjct: 246 AAGFV-----GELDGLMGLGGDTESLVSQTAATYGKAFSYCLPPPSSSG---GGFLTLGA 297
Query: 160 DMGYRRPSTQATKFIN-HPNNFYYLSLKDISIDNERMNFPPDTFDITVSGEGGCIIDSGS 218
G T + FY + L+ I++ +N P F G ++DSG+
Sbjct: 298 AGGASSSRYSHTPMVRFSVPTFYGVFLQGITVAGTMLNVPASVF------SGASVVDSGT 351
Query: 219 VLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCYFLPETFNRF--PSMAFYF-E 275
V+T Y L F + + A + C+ FN P++ F
Sbjct: 352 VITQLPPTAYQALRTAFKKEMKAYPSAAPVGS---LDTCFDF-SGFNTITVPTVTLTFSR 407
Query: 276 DANLRIDGENVFIIDYENHFFLLAVAPHDDLVALIGSQQQRDTRFVYDLNIDLLSFVKEN 335
A + +D + Y A A HD ++G+ QQR ++D+ + F
Sbjct: 408 GAAMDLDISGIL---YAGCLAFTATA-HDGDTGILGNVQQRTFEMLFDVGGRTIGFRSGA 463
Query: 336 C 336
C
Sbjct: 464 C 464
>gi|326523839|dbj|BAJ96930.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 473
Score = 98.6 bits (244), Expect = 3e-18, Method: Compositional matrix adjust.
Identities = 97/378 (25%), Positives = 147/378 (38%), Gaps = 59/378 (15%)
Query: 2 VRLFIGTPSKGVLLILDTGSALIYA-------------------IFDPRKSSSFQKINCD 42
V+ +GTP++ +L+ DTGS L + +F P S S+ I C
Sbjct: 112 VQFRVGTPAQPFVLVADTGSDLTWVKCRGRRASSPDASPLASPRVFRPANSKSWAPIPCS 171
Query: 43 HPDCTY---FKCVN--------EQCVYTMKYADQSVTKGFAAHETISVIGKGEG---KAI 88
C F N C Y +Y D+S +G + ++ G G KA
Sbjct: 172 SDTCKSYVPFSLANCSAGTTPPAPCGYDYRYKDKSSARGVVGTDAATIALSGSGSDRKAK 231
Query: 89 FHGALFGCSNDNHGFDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNG 148
+ GC+ G + DG VL L ISF S+ + RFSYCLV L
Sbjct: 232 LQEVVLGCTTSYDGQSFQSSDG----VLSLGNSNISFASRAAARFGGRFSYCLVDHLAPR 287
Query: 149 EYTSSYLKFGTDMGYRRPSTQATKFINHPNNFYYLSLKDISIDNERMNFPPDTFDITVSG 208
T SYL FG PS FY +++ +S+ + +N P + +D V
Sbjct: 288 NAT-SYLTFGPVGAAHSPSRTPLLLDAQVAPFYAVTVDAVSVAGKALNIPAEVWD--VKK 344
Query: 209 EGGCIIDSGSVLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCYFLPETFNRFP 268
GG I+DSG+ LT + Y + R + +P + CY T R P
Sbjct: 345 NGGAILDSGTSLTILATPAYKAVVAALSKQLARVPRVTM----DPFEYCYNW--TATRRP 398
Query: 269 SMAFYFE-----DANLRIDGENVFIIDYENHFFLL----AVAPHDDLVALIGSQQQRDTR 319
E A LR ++ ++ID + V P V++IG+ Q++
Sbjct: 399 PAVPRLEVRFAGSARLRPPTKS-YVIDAAPGVKCIGLQEGVWPG---VSVIGNILQQEHL 454
Query: 320 FVYDLNIDLLSFVKENCS 337
+ +DL L F + C+
Sbjct: 455 WEFDLANRWLRFQESRCA 472
>gi|18405138|ref|NP_565911.1| Eukaryotic aspartyl protease family protein [Arabidopsis thaliana]
gi|13877759|gb|AAK43957.1|AF370142_1 unknown protein [Arabidopsis thaliana]
gi|15293231|gb|AAK93726.1| unknown protein [Arabidopsis thaliana]
gi|20196976|gb|AAB87120.2| expressed protein [Arabidopsis thaliana]
gi|20197046|gb|AAM14894.1| expressed protein [Arabidopsis thaliana]
gi|330254616|gb|AEC09710.1| Eukaryotic aspartyl protease family protein [Arabidopsis thaliana]
Length = 442
Score = 98.6 bits (244), Expect = 3e-18, Method: Compositional matrix adjust.
Identities = 83/324 (25%), Positives = 145/324 (44%), Gaps = 56/324 (17%)
Query: 2 VRLFIGTPSKGVLLILDTGSALIY----------AIFDPRKSSSFQKINCDHPDCTY--- 48
V L +G P + + ++LDTGS L + ++F+P SS++ + C P C
Sbjct: 67 VTLAVGDPPQNISMVLDTGSELSWLHCKKSPNLGSVFNPVSSSTYSPVPCSSPICRTRTR 126
Query: 49 -----FKC--VNEQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGC--SND 99
C C + YAD + +G AHET + G G LFGC S
Sbjct: 127 DLPIPASCDPKTHLCHVAISYADATSIEGNLAHETFVI-----GSVTRPGTLFGCMDSGL 181
Query: 100 NHGFDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLKFG- 158
+ +EDA+ G++G++R ++SF++QLG +FSYC+ +G +S +L G
Sbjct: 182 SSNSEEDAKS---TGLMGMNRGSLSFVNQLGF---SKFSYCI-----SGSDSSGFLLLGD 230
Query: 159 ------TDMGYRRPSTQATKFINHPNNFYYLSLKDISIDNERMNFPPDTFDITVSGEGGC 212
+ Y Q+T Y + L+ I + ++ ++ P F +G G
Sbjct: 231 ASYSWLGPIQYTPLVLQSTPLPYFDRVAYTVQLEGIRVGSKILSLPKSVFVPDHTGAGQT 290
Query: 213 IIDSGSVLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPE-----PIQLCYFLPET---- 263
++DSG+ T+ VY L +F++ + + +L D P+ + LCY + T
Sbjct: 291 MVDSGTQFTFLMGPVYTALKNEFIT--QTKSVLRLVDDPDFVFQGTMDLCYKVGSTTRPN 348
Query: 264 FNRFPSMAFYFEDANLRIDGENVF 287
F+ P ++ F A + + G+ +
Sbjct: 349 FSGLPMVSLMFRGAEMSVSGQKLL 372
>gi|217074470|gb|ACJ85595.1| unknown [Medicago truncatula]
gi|388505166|gb|AFK40649.1| unknown [Medicago truncatula]
Length = 452
Score = 98.6 bits (244), Expect = 4e-18, Method: Compositional matrix adjust.
Identities = 87/371 (23%), Positives = 158/371 (42%), Gaps = 60/371 (16%)
Query: 2 VRLFIGTPSKGVLLILDTGSALIY---------------AIFDPRKSSSFQKINC----- 41
V++ +G+P+K +I+DTGS+ + +F+P S +++ + C
Sbjct: 105 VKMGLGSPTKYYTMIVDTGSSFSWLQCQPCTIYCHIQEDPVFNPSASKTYKTVPCSSSQC 164
Query: 42 --------DHPDCTYFKCVNEQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGAL 93
+ P C+ + CVY Y D S + G+ + + +++ +
Sbjct: 165 SSLKSATLNEPTCSK---QSNACVYKASYGDSSFSLGYLSQDVLTLTPS----QTLSSFV 217
Query: 94 FGCSNDNHGFDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPL--PNGEYT 151
+GC DN G G G++GL+ +S +SQL FSYCL PN
Sbjct: 218 YGCGQDNQGLF-----GRTDGIIGLANNELSMLSQLSGKYGNAFSYCLPTSFSTPNSP-K 271
Query: 152 SSYLKFGTDMGYRRPSTQATKFINHPNN--FYYLSLKDISIDNERMNFPPDTFDITVSGE 209
+L GT S + T + +PNN Y++ L+ I++ + ++ +
Sbjct: 272 EGFLSIGTSSLTPSSSYKFTPLLKNPNNPSLYFIDLESITVAGRPLGVAASSYKVPT--- 328
Query: 210 GGCIIDSGSVLTYFHSDVYWKLHEKFVSYF-ERFQLAQLSDCPEPIQLCY--FLPETFNR 266
IIDSG+V+T + VY L +V+ +++Q A + C+ L
Sbjct: 329 ---IIDSGTVITRLPTPVYTTLKNAYVTILSKKYQQAPGISL---LDTCFKGSLAGISEV 382
Query: 267 FPSMAFYFE-DANLRIDGENVFIIDYENHFFLLAVAPHDDLVALIGSQQQRDTRFVYDLN 325
P + F+ A+L++ G N +++ E LA+A +A+IG+ QQ+ + YD+
Sbjct: 383 APDIRIIFKGGADLQLKGHNS-LVELETGITCLAMAGSSS-IAIIGNYQQQTVKVAYDVG 440
Query: 326 IDLLSFVKENC 336
+ F C
Sbjct: 441 NSRVGFAPGGC 451
>gi|357448247|ref|XP_003594399.1| Aspartic proteinase nepenthesin-2 [Medicago truncatula]
gi|355483447|gb|AES64650.1| Aspartic proteinase nepenthesin-2 [Medicago truncatula]
Length = 452
Score = 98.6 bits (244), Expect = 4e-18, Method: Compositional matrix adjust.
Identities = 87/371 (23%), Positives = 158/371 (42%), Gaps = 60/371 (16%)
Query: 2 VRLFIGTPSKGVLLILDTGSALIY---------------AIFDPRKSSSFQKINC----- 41
V++ +G+P+K +I+DTGS+ + +F+P S +++ + C
Sbjct: 105 VKMGLGSPTKYYTMIVDTGSSFSWLQCQPCTIYCHIQEDPVFNPSASKTYKTVPCSSSQC 164
Query: 42 --------DHPDCTYFKCVNEQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGAL 93
+ P C+ + CVY Y D S + G+ + + +++ +
Sbjct: 165 SSLKSATLNEPTCSK---QSNACVYKASYGDSSFSLGYLSQDVLTLTPS----QTLSSFV 217
Query: 94 FGCSNDNHGFDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPL--PNGEYT 151
+GC DN G G G++GL+ +S +SQL FSYCL PN
Sbjct: 218 YGCGQDNQGLF-----GRTDGIIGLANNELSMLSQLSGKYGNAFSYCLPTSFSTPNSP-K 271
Query: 152 SSYLKFGTDMGYRRPSTQATKFINHPNN--FYYLSLKDISIDNERMNFPPDTFDITVSGE 209
+L GT S + T + +PNN Y++ L+ I++ + ++ +
Sbjct: 272 EGFLSIGTSSLTPSSSYKFTPLLKNPNNPSLYFIDLESITVAGRPLGVAASSYKVPT--- 328
Query: 210 GGCIIDSGSVLTYFHSDVYWKLHEKFVSYF-ERFQLAQLSDCPEPIQLCY--FLPETFNR 266
IIDSG+V+T + VY L +V+ +++Q A + C+ L
Sbjct: 329 ---IIDSGTVITRLPTPVYTTLKNAYVTILSKKYQQAPGISL---LDTCFKGSLAGISEV 382
Query: 267 FPSMAFYFE-DANLRIDGENVFIIDYENHFFLLAVAPHDDLVALIGSQQQRDTRFVYDLN 325
P + F+ A+L++ G N +++ E LA+A +A+IG+ QQ+ + YD+
Sbjct: 383 APDIRIIFKGGADLQLKGHNS-LVELETGITCLAMAGSSS-IAIIGNYQQQTVKVAYDVG 440
Query: 326 IDLLSFVKENC 336
+ F C
Sbjct: 441 NSRVGFAPGGC 451
>gi|222637180|gb|EEE67312.1| hypothetical protein OsJ_24552 [Oryza sativa Japonica Group]
Length = 420
Score = 98.6 bits (244), Expect = 4e-18, Method: Compositional matrix adjust.
Identities = 98/374 (26%), Positives = 153/374 (40%), Gaps = 81/374 (21%)
Query: 2 VRLFIGTPSKGVLLILDTGSALIYA--------------IFDPRKSSSFQKINCDHPDCT 47
+ + +GTP ++ DTGS LI+ F P SS+F K+ C C
Sbjct: 88 MNISVGTPLLTFSVVADTGSDLIWTQCAPCTKCFQQPAPPFQPASSSTFSKLPCTSSFCQ 147
Query: 48 YF-----KCVNEQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSNDNHG 102
+ C CVY KY T G+ A ET+ V G A F FGCS +N
Sbjct: 148 FLPNSIRTCNATGCVYNYKYG-SGYTAGYLATETLKV-----GDASFPSVAFGCSTEN-- 199
Query: 103 FDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLKFGTDMG 162
GL ++ + RFSYCL G +S + FG+
Sbjct: 200 --------------GLGQLDLGV---------GRFSYCLRSGSAAG---ASPILFGSLAN 233
Query: 163 YRRPSTQATKFINHPN---NFYYLSLKDISIDNERMNFPPDTFDITVSG-EGGCIIDSGS 218
+ Q+T F+N+P ++YY++L I++ + TF T +G GG I+DSG+
Sbjct: 234 LTDGNVQSTPFVNNPAVHPSYYYVNLTGITVGETDLPVTTSTFGFTQNGLGGGTIVDSGT 293
Query: 219 VLTYFHSDVYWKLHEKFVSYFERFQLAQLS--DCPEPIQLCY----------FLPETFNR 266
LTY D Y + + F+S Q A ++ + + LC+ +P R
Sbjct: 294 TLTYLAKDGYEMVKQAFLS-----QTADVTTVNGTRGLDLCFKSTGGGGGGIAVPSLVLR 348
Query: 267 FPSMAFYFED---ANLRIDGENVFIIDYENHFFLLAVAPHDDLVALIGSQQQRDTRFVYD 323
F A Y A + D + + ++ A D +++IG+ Q D +YD
Sbjct: 349 FDGGAEYAVPTYFAGVETDSQGSVTV----ACLMMLPAKGDQPMSVIGNVMQMDMHLLYD 404
Query: 324 LNIDLLSFVKENCS 337
L+ + SF +C+
Sbjct: 405 LDGGIFSFAPADCA 418
>gi|79407941|ref|NP_188636.2| aspartyl protease family protein [Arabidopsis thaliana]
gi|75273243|sp|Q9LHE3.1|ASPG2_ARATH RecName: Full=Protein ASPARTIC PROTEASE IN GUARD CELL 2;
Short=AtASPG2; Flags: Precursor
gi|11994777|dbj|BAB03167.1| nucleoid chloroplast DNA-binding protein-like [Arabidopsis
thaliana]
gi|28392860|gb|AAO41867.1| unknown protein [Arabidopsis thaliana]
gi|332642798|gb|AEE76319.1| aspartyl protease family protein [Arabidopsis thaliana]
Length = 470
Score = 98.6 bits (244), Expect = 4e-18, Method: Compositional matrix adjust.
Identities = 87/356 (24%), Positives = 148/356 (41%), Gaps = 39/356 (10%)
Query: 2 VRLFIGTPSKGVLLILDTGSALIYA--------------IFDPRKSSSFQKINCDHPDCT 47
VR+ +G+P + +++D+GS +++ +FDP KS S+ ++C C
Sbjct: 133 VRIGVGSPPRDQYMVIDSGSDMVWVQCQPCKLCYKQSDPVFDPAKSGSYTGVSCGSSVCD 192
Query: 48 YFK---CVNEQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSNDNHGFD 104
+ C + C Y + Y D S TKG A ET++ K + GC + N G
Sbjct: 193 RIENSGCHSGGCRYEVMYGDGSYTKGTLALETLTF-----AKTVVRNVAMGCGHRNRGMF 247
Query: 105 EDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLKFGTDMGYR 164
A ++SF+ QL F YCLV G ++ L FG +
Sbjct: 248 IGAAGLLGI-----GGGSMSFVGQLSGQTGGAFGYCLV---SRGTDSTGSLVFGREA--L 297
Query: 165 RPSTQATKFINHPN--NFYYLSLKDISIDNERMNFPPDTFDITVSGEGGCIIDSGSVLTY 222
+ +P +FYY+ LK + + R+ P FD+T +G+GG ++D+G+ +T
Sbjct: 298 PVGASWVPLVRNPRAPSFYYVGLKGLGVGGVRIPLPDGVFDLTETGDGGVVMDTGTAVTR 357
Query: 223 FHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCYFLPETFN-RFPSMAFYF-EDANLR 280
+ Y + F S A CY L + R P+++FYF E L
Sbjct: 358 LPTAAYVAFRDGFKSQTANLPRASGVSI---FDTCYDLSGFVSVRVPTVSFYFTEGPVLT 414
Query: 281 IDGENVFIIDYENHFFLLAVAPHDDLVALIGSQQQRDTRFVYDLNIDLLSFVKENC 336
+ N + ++ + A A +++IG+ QQ + +D + F C
Sbjct: 415 LPARNFLMPVDDSGTYCFAFAASPTGLSIIGNIQQEGIQVSFDGANGFVGFGPNVC 470
>gi|293332735|ref|NP_001168472.1| uncharacterized protein LOC100382248 [Zea mays]
gi|223948487|gb|ACN28327.1| unknown [Zea mays]
Length = 434
Score = 98.2 bits (243), Expect = 4e-18, Method: Compositional matrix adjust.
Identities = 89/364 (24%), Positives = 144/364 (39%), Gaps = 54/364 (14%)
Query: 1 MVRLFIGTPSKGVLLILDTGSALIYA---------------IFDPRKSSSFQKINCDHPD 45
+V + +GTP++ ++ DTGS + +FDP KS+++ I+C
Sbjct: 97 VVPVRLGTPAERFTVVFDTGSDTTWVQCQPCVAYCYRQKEPLFDPTKSATYANISCSSSY 156
Query: 46 CTYF---KCVNEQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSNDNHG 102
C+ C C+Y ++Y D S T GF A +T+++ FGC N G
Sbjct: 157 CSDLYVSGCSGGHCLYGIQYGDGSYTIGFYAQDTLTLA-----YDTIKNFRFGCGEKNRG 211
Query: 103 FDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLKFGTDMG 162
G AG+LGL R S Q F+YC LP + +L D+G
Sbjct: 212 L-----FGRAAGLLGLGRGKTSLPVQAYDKYGGVFAYC----LPATSAGTGFL----DLG 258
Query: 163 YRRPSTQA---TKFINHPNNFYYLSLKDISIDNERMNFPPDTFDITVSGEGGCIIDSGSV 219
P+ A ++ FYY+ + I + + P F G ++DSG+V
Sbjct: 259 PGAPAANARLTPMLVDRGPTFYYVGMTGIKVGGHVLPIPGSVFSTA-----GTLVDSGTV 313
Query: 220 LTYFHSDVYWKLHEKFVSYFERFQLAQLSDCP--EPIQLCYFLPETFN---RFPSMAFYF 274
+T Y L F + Q S P + CY L P+++ F
Sbjct: 314 ITRLPPSAYAPLRSAFS---KAMQGLGYSAAPAFSILDTCYDLTGHKGGSIALPAVSLVF 370
Query: 275 E-DANLRIDGENV-FIIDYENHFFLLAVAPHDDLVALIGSQQQRDTRFVYDLNIDLLSFV 332
+ A L +D + ++ D A D VA++G+ QQ+ +YD+ ++ F
Sbjct: 371 QGGACLDVDASGILYVADVSQACLAFAPNADDTDVAIVGNTQQKTHGVLYDIGKKIVGFA 430
Query: 333 KENC 336
C
Sbjct: 431 PGAC 434
>gi|357166728|ref|XP_003580821.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Brachypodium
distachyon]
Length = 479
Score = 98.2 bits (243), Expect = 5e-18, Method: Compositional matrix adjust.
Identities = 95/368 (25%), Positives = 154/368 (41%), Gaps = 44/368 (11%)
Query: 1 MVRLFIGTPSKGVLLILDTGSALIYA--------------IFDPRKSSSFQKINCDHPDC 46
+VR+ IG+P L+ DTGS +I+ +FDP S+SF + C+ C
Sbjct: 124 LVRVGIGSPPLEQHLVADTGSDVIWVQCSPCSDCYAQGDPLFDPANSASFSPVPCNSGVC 183
Query: 47 --------TYFKCVNEQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSN 98
+ +C Y + Y D+S T G A ET+++ G E G GC +
Sbjct: 184 RAAARYSSSSCGGGGGECEYKVSYGDKSYTNGVLALETLTLDGGTE----VQGVAMGCGH 239
Query: 99 DNHGFDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLKFG 158
+N G +A AG+LGL +S + QLG FSYCL S L G
Sbjct: 240 ENRGLFAEA-----AGLLGLGWGPMSLVGQLGGAAGGAFSYCLAGYYSGEGSGSGSLVLG 294
Query: 159 TDMGYRRPSTQATKFINHPN--NFYYLSLKDISIDNERMNFPPDTFDITVSGEGGCIIDS 216
+ + + +P+ +FYY+ + + + ER+ FD+ G GG ++D+
Sbjct: 295 REDAAPTGAVW-VPLVRNPDAPSFYYVGVNGLGVAGERLQLQDGLFDLGDDGGGGVVMDT 353
Query: 217 GSVLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCYFLPETFN-RFPSMAFYF- 274
G+ +T ++ Y L F FE + A + CY L + R P++A YF
Sbjct: 354 GTAVTRLPAEAYAALRGAFAGAFE--EGAPRAPGVSLFDTCYDLSGYASVRVPTVALYFG 411
Query: 275 ------EDANLRIDGENVFIIDYENHFFLLAVAPHDDLVALIGSQQQRDTRFVYDLNIDL 328
E A+L + N+ + + + LA A +++G+ QQ+ D
Sbjct: 412 GGGQGQEAASLTLPARNLLVPVDDGGTYCLAFAAVASGPSILGNIQQQGIEITVDSASGY 471
Query: 329 LSFVKENC 336
+ F C
Sbjct: 472 VGFGPATC 479
>gi|449461377|ref|XP_004148418.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Cucumis sativus]
gi|449518059|ref|XP_004166061.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Cucumis sativus]
Length = 436
Score = 98.2 bits (243), Expect = 5e-18, Method: Compositional matrix adjust.
Identities = 90/362 (24%), Positives = 158/362 (43%), Gaps = 48/362 (13%)
Query: 1 MVRLFIGTPSKGVLLILDTGSALIYA------------IFDPRKSSSFQKINCDHPDCTY 48
+VR+ +GTP + + ++LDT + +A F + SS+F ++C P+CT
Sbjct: 96 VVRVQLGTPGQTMYMVLDTSNDAAWAPCSGCIGCSSTTTFSAQNSSTFATLDCSKPECTQ 155
Query: 49 FKCV------NEQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSNDNHG 102
+ + N C++ Y S +++ + G + FGC + G
Sbjct: 156 ARGLSCPTTGNVDCLFNQTYGGDSTFSATLVQDSLHL-----GPNVIPNFSFGCISSASG 210
Query: 103 FDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLKFGTDMG 162
+ G++GL R +S ISQ GS+ FSYCL P Y S LK G +G
Sbjct: 211 SSIPPQ-----GLMGLGRGPLSLISQSGSLYSGLFSYCL--PSFKSYYFSGSLKLG-PVG 262
Query: 163 YRRPSTQATKFINHPN--NFYYLSLKDISIDNERMNFPPDTFDITVSGEGGCIIDSGSVL 220
+ + + T +++P+ + YY++L IS+ + P+ + G IIDSG+V+
Sbjct: 263 QPK-AIRTTPLLHNPHRPSLYYVNLTGISVGRVLVPISPELLAFDPNTGAGTIIDSGTVI 321
Query: 221 TYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCYFLPETFNRF--PSMAFYFEDAN 278
T F +Y + ++F R Q+ C+ T N P++ + +
Sbjct: 322 TRFVPAIYTAVRDEF-----RKQVGGSFSPLGAFDTCF---ATNNEVSAPAITLHLSGLD 373
Query: 279 LRIDGENVFIIDYENHFFLLAVAPHD----DLVALIGSQQQRDTRFVYDLNIDLLSFVKE 334
L++ EN I LA+A +V +I + QQ++ R ++D+N L +E
Sbjct: 374 LKLPMENSLIHSSAGSLACLAMAAAPNNVNSVVNVIANLQQQNHRILFDINNSKLGIARE 433
Query: 335 NC 336
C
Sbjct: 434 LC 435
>gi|26451756|dbj|BAC42973.1| unknown protein [Arabidopsis thaliana]
Length = 442
Score = 98.2 bits (243), Expect = 5e-18, Method: Compositional matrix adjust.
Identities = 83/324 (25%), Positives = 145/324 (44%), Gaps = 56/324 (17%)
Query: 2 VRLFIGTPSKGVLLILDTGSALIY----------AIFDPRKSSSFQKINCDHPDCTY--- 48
V L +G P + + ++LDTGS L + ++F+P SS++ + C P C
Sbjct: 67 VTLAVGDPPQNISMVLDTGSELSWLHCKKSPNLGSVFNPVSSSTYSPVPCSSPICRTRTR 126
Query: 49 -----FKC--VNEQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGC--SND 99
C C + YAD + +G AHET + G G LFGC S
Sbjct: 127 DLPIPASCDPKTHLCHVAISYADATSIEGNLAHETFVI-----GSVTRPGTLFGCMDSGL 181
Query: 100 NHGFDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLKFG- 158
+ +EDA+ G++G++R ++SF++QLG +FSYC+ +G +S +L G
Sbjct: 182 SSNSEEDAKS---TGLMGMNRGSLSFVNQLGF---SKFSYCI-----SGSDSSVFLLLGD 230
Query: 159 ------TDMGYRRPSTQATKFINHPNNFYYLSLKDISIDNERMNFPPDTFDITVSGEGGC 212
+ Y Q+T Y + L+ I + ++ ++ P F +G G
Sbjct: 231 ASYSWLGPIQYTPLVLQSTPLPYFDRVAYTVQLEGIRVGSKILSLPKSVFVPDHTGAGQT 290
Query: 213 IIDSGSVLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPE-----PIQLCYFLPET---- 263
++DSG+ T+ VY L +F++ + + +L D P+ + LCY + T
Sbjct: 291 MVDSGTQFTFLMGPVYTALKNEFIT--QTKSVLRLVDDPDFVFQGTMDLCYKVGSTTRPN 348
Query: 264 FNRFPSMAFYFEDANLRIDGENVF 287
F+ P ++ F A + + G+ +
Sbjct: 349 FSGLPMVSLMFRGAEMSVSGQKLL 372
>gi|224142013|ref|XP_002324355.1| predicted protein [Populus trichocarpa]
gi|222865789|gb|EEF02920.1| predicted protein [Populus trichocarpa]
Length = 477
Score = 98.2 bits (243), Expect = 5e-18, Method: Compositional matrix adjust.
Identities = 91/366 (24%), Positives = 149/366 (40%), Gaps = 59/366 (16%)
Query: 1 MVRLFIGTPSKGVLLILDTGSALIYA---------------IFDPRKSSSFQKINCDHPD 45
+V + +GTP K L DTGS L + FDP S+S++ ++C
Sbjct: 141 VVTVGLGTPKKDFTLSFDTGSDLTWTQCEPCLGGCFPQNQPKFDPTTSTSYKNVSCSSEF 200
Query: 46 CTYFK--------CVNEQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCS 97
C C++ C+Y ++Y T GF A ET+++ +F LFGCS
Sbjct: 201 CKLIAEGNYPAQDCISNTCLYGIQYGS-GYTIGFLATETLAI----ASSDVFKNFLFGCS 255
Query: 98 NDNHG-FDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLK 156
++ G F+ G+LGL R I+ SQ + K FSYC LP ++ +L
Sbjct: 256 EESRGTFN------GTTGLLGLGRSPIALPSQTTNKYKNLFSYC----LPASPSSTGHLS 305
Query: 157 FGTDMGYRRPSTQATKFINHPNNFYYLSLKDISIDNERMNFPPDTFDITVSGEGGCIIDS 216
FG ++ ST + + Y L+ IS+ + P I+ + IIDS
Sbjct: 306 FGVEVSQAAKSTPISPKLKQ---LYGLNTVGISVRGREL---PINGSISRT-----IIDS 354
Query: 217 GSVLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCYFLPETFN---RFPSMAFY 273
G+ T+ S Y L F + L + +P CY N P ++ +
Sbjct: 355 GTTFTFLPSPTYSALGSAFREMMANYTLTNGTSSFQP---CYDFSNIGNGTLTIPGISIF 411
Query: 274 FE---DANLRIDGENVFIIDYENHFFLLAVAPHDDLVALIGSQQQRDTRFVYDLNIDLLS 330
FE + + + G + + + A D A+ G+ QQ+ +YD+ ++
Sbjct: 412 FEGGVEVEIDVSGIMIPVNGLKEVCLAFADTGSDSDFAIFGNYQQKTYEVIYDVAKGMVG 471
Query: 331 FVKENC 336
F + C
Sbjct: 472 FAPKGC 477
>gi|24417232|gb|AAN60226.1| unknown [Arabidopsis thaliana]
Length = 464
Score = 98.2 bits (243), Expect = 5e-18, Method: Compositional matrix adjust.
Identities = 98/356 (27%), Positives = 154/356 (43%), Gaps = 44/356 (12%)
Query: 1 MVRLFIGTPSKGVLLILDTGSALIYA---------------IFDPRKSSSFQKINCDHPD 45
+V + IGTP + L+ DTGS L + F+P SS++Q ++C P
Sbjct: 133 IVTIGIGTPKHDLSLVFDTGSDLTWTQCEPCLGSCYSQKEPKFNPSSSSTYQNVSCSSPM 192
Query: 46 CTYFK-CVNEQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSNDNHG-F 103
C + C CVY++ Y D+S T+GF A E ++ + FGC +N G F
Sbjct: 193 CEDAESCSASNCVYSIGYGDKSFTQGFLAKEKFTLTNSD----VLEDVYFGCGENNQGLF 248
Query: 104 DEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLKFGTDMGY 163
D A L +S +Q + FSYCL N ++ +L FG+ G
Sbjct: 249 DGVAGLLGLG------PGKLSLPAQTTTTYNNIFSYCLPSFTSN---STGHLTFGS-AGI 298
Query: 164 RRPSTQATKFINHPNNFYY-LSLKDISIDNERMNFPPDTFDITVSGEGGCIIDSGSVLTY 222
S + T + P+ F Y + + IS+ ++ + P++F G IIDSG+V T
Sbjct: 299 SE-SVKFTPISSFPSAFNYGIDIIGISVGDKELAITPNSFSTE-----GAIIDSGTVFTR 352
Query: 223 FHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCY-FLPETFNRFPSMAFYFEDAN-LR 280
+ VY +L F E+ + + CY F +P++AF F +
Sbjct: 353 LPTKVYAELRSVFK---EKMSSYKSTSGYGLFDTCYDFTGLDTVTYPTIAFSFAGGTVVE 409
Query: 281 IDGENVFIIDYENHFFLLAVAPHDDLVALIGSQQQRDTRFVYDLNIDLLSFVKENC 336
+DG + + + LA A +DDL A+ G+ QQ VYD+ + F C
Sbjct: 410 LDGSGIS-LPIKISQVCLAFAGNDDLPAIFGNVQQTTLDVVYDVAGGRVGFAPNGC 464
>gi|449440931|ref|XP_004138237.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 1-like [Cucumis
sativus]
Length = 523
Score = 97.8 bits (242), Expect = 6e-18, Method: Compositional matrix adjust.
Identities = 103/357 (28%), Positives = 145/357 (40%), Gaps = 41/357 (11%)
Query: 2 VRLFIGTPSKGVLLILDTGSAL-----------------IYAIFDPRKSSSFQKINCDHP 44
R+ +G P + + DTGS + I IFDP+ SSS+ ++CD
Sbjct: 186 ARIGVGQPVQSYFFVPDTGSDVSWLQCQPCDGENGCYKQIGPIFDPKSSSSYSPLSCDSE 245
Query: 45 DCTYF---KCVNEQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSNDNH 101
C C C+Y ++Y D S T G A ET S GC +DN
Sbjct: 246 QCHLLDEAACDANSCIYEVEYGDGSFTVGELATETFSFRHSNS----IPNLPIGCGHDNE 301
Query: 102 GFDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLKFGTDM 161
G A IS SQL + FSYCLV +SS L F D
Sbjct: 302 GLFVGADGLIGL-----GGGAISLSSQLEAT---SFSYCLV---DLDSESSSTLDFNADQ 350
Query: 162 GYRRPSTQATKFINHPNNFYYLSLKDISIDNERMNFPPDTFDITVSGEGGCIIDSGSVLT 221
++ K P F Y+ + +S+ + + +F+I SG GG I+DSG+ +T
Sbjct: 351 PSDSLTSPLVKNDRFPT-FRYVKVIGMSVGGKPLPISSSSFEIDESGSGGIIVDSGTTIT 409
Query: 222 YFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCYFLPETFN-RFPSMAFYFEDAN-L 279
SDVY L + FV + A P CY L N P++AF N L
Sbjct: 410 EIPSDVYDVLRDAFVGLTKNLPPAPGV---SPFDTCYDLSSQSNVEVPTIAFILPGENSL 466
Query: 280 RIDGENVFIIDYENHFFLLAVAPHDDLVALIGSQQQRDTRFVYDLNIDLLSFVKENC 336
++ +N I F LA P +++IG+ QQ+ R YDL L+ F + C
Sbjct: 467 QLPAKNCLIQVDSAGTFCLAFLPSTFPLSIIGNVQQQGIRVSYDLANSLVGFSTDKC 523
>gi|413943688|gb|AFW76337.1| hypothetical protein ZEAMMB73_223549 [Zea mays]
gi|413943689|gb|AFW76338.1| hypothetical protein ZEAMMB73_223549 [Zea mays]
Length = 499
Score = 97.8 bits (242), Expect = 6e-18, Method: Compositional matrix adjust.
Identities = 89/364 (24%), Positives = 144/364 (39%), Gaps = 54/364 (14%)
Query: 1 MVRLFIGTPSKGVLLILDTGSALIYA---------------IFDPRKSSSFQKINCDHPD 45
+V + +GTP++ ++ DTGS + +FDP KS+++ I+C
Sbjct: 162 VVPVRLGTPAERFTVVFDTGSDTTWVQCQPCVAYCYRQKEPLFDPTKSATYANISCSSSY 221
Query: 46 CTYF---KCVNEQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSNDNHG 102
C+ C C+Y ++Y D S T GF A +T+++ FGC N G
Sbjct: 222 CSDLYVSGCSGGHCLYGIQYGDGSYTIGFYAQDTLTLA-----YDTIKNFRFGCGEKNRG 276
Query: 103 FDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLKFGTDMG 162
G AG+LGL R S Q F+YC LP + +L D+G
Sbjct: 277 L-----FGRAAGLLGLGRGKTSLPVQAYDKYGGVFAYC----LPATSAGTGFL----DLG 323
Query: 163 YRRPSTQA---TKFINHPNNFYYLSLKDISIDNERMNFPPDTFDITVSGEGGCIIDSGSV 219
P+ A ++ FYY+ + I + + P F G ++DSG+V
Sbjct: 324 PGAPAANARLTPMLVDRGPTFYYVGMTGIKVGGHVLPIPGSVFSTA-----GTLVDSGTV 378
Query: 220 LTYFHSDVYWKLHEKFVSYFERFQLAQLSDCP--EPIQLCYFLPETFN---RFPSMAFYF 274
+T Y L F + Q S P + CY L P+++ F
Sbjct: 379 ITRLPPSAYAPLRSAFS---KAMQGLGYSAAPAFSILDTCYDLTGHKGGSIALPAVSLVF 435
Query: 275 E-DANLRIDGENV-FIIDYENHFFLLAVAPHDDLVALIGSQQQRDTRFVYDLNIDLLSFV 332
+ A L +D + ++ D A D VA++G+ QQ+ +YD+ ++ F
Sbjct: 436 QGGACLDVDASGILYVADVSQACLAFAPNADDTDVAIVGNTQQKTHGVLYDIGKKIVGFA 495
Query: 333 KENC 336
C
Sbjct: 496 PGAC 499
>gi|255568540|ref|XP_002525244.1| Aspartic proteinase nepenthesin-2 precursor, putative [Ricinus
communis]
gi|223535541|gb|EEF37210.1| Aspartic proteinase nepenthesin-2 precursor, putative [Ricinus
communis]
Length = 460
Score = 97.8 bits (242), Expect = 6e-18, Method: Compositional matrix adjust.
Identities = 89/366 (24%), Positives = 156/366 (42%), Gaps = 59/366 (16%)
Query: 2 VRLFIGTPSKGVLLILDTGSALIY---------------AIFDPRKSSSFQKINCDHPDC 46
++L +G+P K +ILDTGS+L + +F+P S++++ + C +C
Sbjct: 122 LKLGLGSPPKYYTMILDTGSSLSWLQCKPCVVYCHSQVDPLFEPSASNTYRPLYCSSSEC 181
Query: 47 TYFKCVNEQ---------CVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCS 97
+ K CVYT Y D S + G+ + + +++ +GC
Sbjct: 182 SLLKAATLNDPLCTASGVCVYTASYGDASYSMGYLSRDLLTL----TPSQTLPSFTYGCG 237
Query: 98 NDNHGFDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLKF 157
DN G G AG++GL+R +S ++QL FSYCL +G +L
Sbjct: 238 QDNEGLF-----GKAAGIVGLARDKLSMLAQLSPKYGYAFSYCLPTSTSSG---GGFL-- 287
Query: 158 GTDMGYRRPST-QATKFINHPNN--FYYLSLKDISIDNERMNFPPDTFDITVSGEGGCII 214
+G PS+ + T I + N Y+L L I++ + + + II
Sbjct: 288 --SIGKISPSSYKFTPMIRNSQNPSLYFLRLAAITVAGRPVGVAAAGYQVPT------II 339
Query: 215 DSGSVLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCP--EPIQLCYFLP-ETFNRFPSMA 271
DSG+V+T +Y L E FV R + P + C+ ++ + P +
Sbjct: 340 DSGTVVTRLPISIYAALREAFVKIMSR----RYEQAPAYSILDTCFKGSLKSMSGAPEIR 395
Query: 272 FYFE-DANLRIDGENVFIIDYENHFFLLAVAPHDDLVALIGSQQQRDTRFVYDLNIDLLS 330
F+ A+L + N+ +I+ + LA A + +A+IG+ QQ+ YD++ +
Sbjct: 396 MIFQGGADLSLRAPNI-LIEADKGIACLAFASSNQ-IAIIGNHQQQTYNIAYDVSASKIG 453
Query: 331 FVKENC 336
F C
Sbjct: 454 FAPGGC 459
>gi|15238250|ref|NP_196637.1| aspartyl protease family protein [Arabidopsis thaliana]
gi|8979710|emb|CAB96831.1| nucleoid DNA-binding protein cnd41-like protein [Arabidopsis
thaliana]
gi|18176136|gb|AAL59990.1| putative nucleoid DNA-binding protein cnd41 [Arabidopsis thaliana]
gi|22136986|gb|AAM91722.1| putative nucleoid DNA-binding protein cnd41 [Arabidopsis thaliana]
gi|110740988|dbj|BAE98588.1| nucleoid DNA-binding protein cnd41 - like protein [Arabidopsis
thaliana]
gi|332004210|gb|AED91593.1| aspartyl protease family protein [Arabidopsis thaliana]
Length = 464
Score = 97.8 bits (242), Expect = 6e-18, Method: Compositional matrix adjust.
Identities = 98/356 (27%), Positives = 155/356 (43%), Gaps = 44/356 (12%)
Query: 1 MVRLFIGTPSKGVLLILDTGSALIYA---------------IFDPRKSSSFQKINCDHPD 45
+V + IGTP + L+ DTGS L + F+P SS++Q ++C P
Sbjct: 133 IVTIGIGTPKHDLSLVFDTGSDLTWTQCEPCLGSCYSQKEPKFNPSSSSTYQNVSCSSPM 192
Query: 46 CTYFK-CVNEQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSNDNHG-F 103
C + C CVY++ Y D+S T+GF A E ++ + FGC +N G F
Sbjct: 193 CEDAESCSASNCVYSIVYGDKSFTQGFLAKEKFTLTNSD----VLEDVYFGCGENNQGLF 248
Query: 104 DEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLKFGTDMGY 163
D A L +S +Q + FSYCL N ++ +L FG+ G
Sbjct: 249 DGVAGLLGLG------PGKLSLPAQTTTTYNNIFSYCLPSFTSN---STGHLTFGS-AGI 298
Query: 164 RRPSTQATKFINHPNNFYY-LSLKDISIDNERMNFPPDTFDITVSGEGGCIIDSGSVLTY 222
S + T + P+ F Y + + IS+ ++ + P++F G IIDSG+V T
Sbjct: 299 SE-SVKFTPISSFPSAFNYGIDIIGISVGDKELAITPNSFSTE-----GAIIDSGTVFTR 352
Query: 223 FHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCY-FLPETFNRFPSMAFYFEDAN-LR 280
+ VY +L F E+ + + CY F +P++AF F + +
Sbjct: 353 LPTKVYAELRSVFK---EKMSSYKSTSGYGLFDTCYDFTGLDTVTYPTIAFSFAGSTVVE 409
Query: 281 IDGENVFIIDYENHFFLLAVAPHDDLVALIGSQQQRDTRFVYDLNIDLLSFVKENC 336
+DG + + + LA A +DDL A+ G+ QQ VYD+ + F C
Sbjct: 410 LDGSGIS-LPIKISQVCLAFAGNDDLPAIFGNVQQTTLDVVYDVAGGRVGFAPNGC 464
>gi|67633548|gb|AAY78698.1| aspartyl protease family protein [Arabidopsis thaliana]
Length = 392
Score = 97.8 bits (242), Expect = 6e-18, Method: Compositional matrix adjust.
Identities = 96/358 (26%), Positives = 154/358 (43%), Gaps = 52/358 (14%)
Query: 1 MVRLFIGTPSKGVLLILDTGSALIYA--------------IFDPRKSSSFQKINCDHPDC 46
+++L +GTP + +DTGS LI+ IFDP SS+F++ C+
Sbjct: 62 LMKLQVGTPPFEIEAEIDTGSDLIWTQCMPCTNCYSQYAPIFDPSNSSTFKEKRCN---- 117
Query: 47 TYFKCVNEQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSNDNHGFDED 106
C Y + YAD + +KG A ET+++ + GC +++ F
Sbjct: 118 ------GNSCHYKIIYADTTYSKGTLATETVTIHSTSGEPFVMPETTIGCGHNSSWFKP- 170
Query: 107 ARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLKFGTDMGYRRP 166
+G++GLS S I+Q+G SYC +S + FGT+
Sbjct: 171 ----TFSGMVGLSWGPSSLITQMGGEYPGLMSYCF------ASQGTSKINFGTNAIVAGD 220
Query: 167 STQATK--FINHPNNFYYLSLKDISIDNERMNFPPDTFDITVSGEGGCIIDSGSVLTYFH 224
+T YYL+L +S+ + + TF + EG IIDSG+ LTYF
Sbjct: 221 GVVSTTMFLTTAKPGLYYLNLDAVSVGDTHVETMGTTFH---ALEGNIIIDSGTTLTYFP 277
Query: 225 SDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCYFLPETFNRFPSMAFYFE-DANLRIDG 283
Y L + V ++ + +D LCY+ +T + FP + +F A+L +D
Sbjct: 278 VS-YCNLVREAVDHY--VTAVRTADPTGNDMLCYYT-DTIDIFPVITMHFSGGADLVLDK 333
Query: 284 ENVFIIDYENHFFLLAV----APHDDLVALIGSQQQRDTRFVYDLNIDLLSFVKENCS 337
N++I F LA+ P D A+ G++ Q + YD + L+ F NCS
Sbjct: 334 YNMYIETITRGTFCLAIICNNPPQD---AIFGNRAQNNFLVGYDSSSLLVFFSPTNCS 388
>gi|449462551|ref|XP_004149004.1| PREDICTED: probable aspartic protease At2g35615-like [Cucumis
sativus]
gi|449515029|ref|XP_004164552.1| PREDICTED: probable aspartic protease At2g35615-like [Cucumis
sativus]
Length = 434
Score = 97.8 bits (242), Expect = 7e-18, Method: Compositional matrix adjust.
Identities = 103/362 (28%), Positives = 164/362 (45%), Gaps = 43/362 (11%)
Query: 1 MVRLFIGTPSKGVLLILDTGSALIYA--------------IFDPRKSSSFQKINCDHPDC 46
++ +FIGTP V+ I DTGS L + IF+PR+SSS++K++C C
Sbjct: 91 LMSIFIGTPPVNVIAIADTGSDLTWTQCLPCRECFNQSQPIFNPRRSSSYRKVSCASDTC 150
Query: 47 TYFKCVN-----EQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSNDNH 101
+ + + C Y Y D+S T G A + I++ G + GC + N
Sbjct: 151 RSLESYHCGPDLQSCSYGYSYGDRSFTYGDLASDQITI-----GSFKLPKTVIGCGHQNG 205
Query: 102 GFDEDARDGALAGVLGLSRVTISFISQLGSI--IKKRFSYCLVIPLPNGEYTSSYLKFGT 159
G G +G++GL ++S +SQ+ +I +K RFSYCL N T + + FG
Sbjct: 206 G----TFGGVTSGIIGLGGGSLSLVSQMRTIAGVKPRFSYCLPTFFSNANITGT-ISFGR 260
Query: 160 DMGYRRPSTQATKFI-NHPNNFYYLSLKDISIDNERMNFPPDTFDITVSGEGGCIIDSGS 218
+T + P+ FY+L+L+ IS+ +R F ++ G IIDSG+
Sbjct: 261 KAVVSGRQVVSTPLVPRSPDTFYFLTLEAISVGKKR--FKAANGISAMTNHGNIIIDSGT 318
Query: 219 VLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPI-QLCYFLPETFN-RFPSMAFYFE- 275
LT +Y+ + S R A+ D P I +LCY + + P + +F
Sbjct: 319 TLTLLPRSLYYGV----FSTLARVIKAKRVDDPSGILELCYSAGQVDDLNIPIITAHFAG 374
Query: 276 DANLRIDGENVFIIDYENHFFLLAVAPHDDLVALIGSQQQRDTRFVYDLNIDLLSFVKEN 335
A++++ N F +N L AP VA+ G+ Q + YDL LSF +
Sbjct: 375 GADVKLLPVNTFAPVADN-VTCLTFAPATQ-VAIFGNLAQINFEVGYDLGNKRLSFEPKL 432
Query: 336 CS 337
C+
Sbjct: 433 CA 434
>gi|147811402|emb|CAN61225.1| hypothetical protein VITISV_006732 [Vitis vinifera]
Length = 440
Score = 97.8 bits (242), Expect = 7e-18, Method: Compositional matrix adjust.
Identities = 89/360 (24%), Positives = 162/360 (45%), Gaps = 37/360 (10%)
Query: 1 MVRLFIGTPSKGVLLILDTGSALIYA--------------IFDPRKSSSFQKINCDHPDC 46
++++ IGTP V I DTGS L++ +FDP KS+SF++++C+ C
Sbjct: 92 LMKISIGTPPFDVYGIYDTGSDLMWTQCLPCLSCYKQKNPMFDPSKSTSFKEVSCESQQC 151
Query: 47 TYFKCVN-----EQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSNDNH 101
V+ + C ++ Y D S+ +G A ET+++ +FGC ++N
Sbjct: 152 RLLDTVSCSQPQKLCDFSYGYGDGSLAQGVIATETLTLNSNSGQPXSIXNIVFGCGHNNS 211
Query: 102 G-FDEDARDGALAGVLGLSRVTISFISQLGSII--KKRFSYCLVIPLPNGEYTSSYLKFG 158
G F+E+ G+ G +S SQ+ S + ++FS CLV P +S + FG
Sbjct: 212 GTFNENE-----MGLFGTGGRPLSLTSQIMSTLGSGRKFSQCLV-PFRTDPSITSKIIFG 265
Query: 159 TDMGYRRPSTQATKFINHPN-NFYYLSLKDISIDNERMNFPPDTFDITVSGEGGCIIDSG 217
+ +T + + +Y+++L IS+ ++ F + ++ +G ID+G
Sbjct: 266 PEAEVSGSXVVSTPLVTKDDPTYYFVTLDGISVGDKLFPF---SSSSPMATKGNVFIDAG 322
Query: 218 SVLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCYFLPETFNRFPSMAFYFEDA 277
+ T D Y +L + E + + D QLCY T P + +F+ A
Sbjct: 323 TPPTLLPRDFYNRLVQGVK---EAIPMEPVQDPDLQPQLCY-RSATLIDGPILTAHFDGA 378
Query: 278 NLRIDGENVFIIDYENHFFLLAVAPHDDLVALIGSQQQRDTRFVYDLNIDLLSFVKENCS 337
++++ N FI E + A+ P D + G+ Q + +DL+ +SF +C+
Sbjct: 379 DVQLKPLNTFISPKEG-VYCFAMQPIDGDTGIFGNFVQMNFLIGFDLDGKKVSFKAVDCT 437
>gi|357439021|ref|XP_003589787.1| Aspartic proteinase nepenthesin-2 [Medicago truncatula]
gi|355478835|gb|AES60038.1| Aspartic proteinase nepenthesin-2 [Medicago truncatula]
Length = 456
Score = 97.4 bits (241), Expect = 8e-18, Method: Compositional matrix adjust.
Identities = 90/359 (25%), Positives = 142/359 (39%), Gaps = 57/359 (15%)
Query: 2 VRLFIGTPSKGVLLILDTGSALIY--------------AIFDPRKSSSFQKINCDHPDCT 47
VR+ IG+P+ +++D+GS +++ IF+P S+SF + C C
Sbjct: 131 VRIGIGSPAIYQYMVIDSGSDIVWIQCEPCDQCYNQTDPIFNPATSASFIGVACSSNVCN 190
Query: 48 YF----KCVNEQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSNDNHGF 103
C +C Y + Y D S TKG A ETI++ G+ + GC + N G
Sbjct: 191 QLDDDVACRKGRCGYQVAYGDGSYTKGTLALETITI-----GRTVIQDTAIGCGHWNEGM 245
Query: 104 DEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLKFGTDMGY 163
A +SF+ QLG+ F YCLV S + G
Sbjct: 246 FVGAAGLLGL-----GGGPMSFVGQLGAQTGGAFGYCLV---------SRAMPVG----- 286
Query: 164 RRPSTQATKFINHP--NNFYYLSLKDISIDNERMNFPPDTFDITVSGEGGCIIDSGSVLT 221
I++P +FYY+SL +++ R+ F +T G GG ++D+G+ +T
Sbjct: 287 ----AMWVPLIHNPFYPSFYYVSLSGLAVGGIRVPISEQIFQLTDIGTGGVVMDTGTAIT 342
Query: 222 YFHSDVYWKLHEKFVSYFERFQLAQLSDCP--EPIQLCYFLPETFN-RFPSMAFYFEDAN 278
+ Y + F++ Q L P CY L R P+++FYF
Sbjct: 343 RLPTVAYNAFRDAFIA-----QTTNLPRAPGVSIFDTCYDLNGFVTVRVPTVSFYFSGGQ 397
Query: 279 -LRIDGENVFIIDYENHFFLLAVAPHDDLVALIGSQQQRDTRFVYDLNIDLLSFVKENC 336
L N I + F A AP +++IG+ QQ + D + F C
Sbjct: 398 ILTFPARNFLIPADDVGTFCFAFAPSPSGLSIIGNIQQEGIQVSIDGTNGFVGFGPNVC 456
>gi|225436202|ref|XP_002271145.1| PREDICTED: probable aspartic protease At2g35615-like [Vitis
vinifera]
Length = 440
Score = 97.4 bits (241), Expect = 8e-18, Method: Compositional matrix adjust.
Identities = 89/360 (24%), Positives = 162/360 (45%), Gaps = 37/360 (10%)
Query: 1 MVRLFIGTPSKGVLLILDTGSALIYA--------------IFDPRKSSSFQKINCDHPDC 46
++++ IGTP V I DTGS L++ +FDP KS+SF++++C+ C
Sbjct: 92 LMKISIGTPPFDVYGIYDTGSDLMWTQCLPCLSCYKQKNPMFDPSKSTSFKEVSCESQQC 151
Query: 47 TYFKCVN-----EQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSNDNH 101
V+ + C ++ Y D S+ +G A ET+++ +FGC ++N
Sbjct: 152 RLLDTVSCSQPQKLCDFSYGYGDGSLAQGVIATETLTLNSNSGQPTSILNIVFGCGHNNS 211
Query: 102 G-FDEDARDGALAGVLGLSRVTISFISQLGSII--KKRFSYCLVIPLPNGEYTSSYLKFG 158
G F+E+ G+ G +S SQ+ S + ++FS CLV P +S + FG
Sbjct: 212 GTFNENE-----MGLFGTGGRPLSLTSQIMSTLGSGRKFSQCLV-PFRTDPSITSKIIFG 265
Query: 159 TDMGYRRPSTQATKFINHPN-NFYYLSLKDISIDNERMNFPPDTFDITVSGEGGCIIDSG 217
+ +T + + +Y+++L IS+ ++ F + ++ +G ID+G
Sbjct: 266 PEAEVSGSDVVSTPLVTKDDPTYYFVTLDGISVGDKLFPF---SSSSPMATKGNVFIDAG 322
Query: 218 SVLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCYFLPETFNRFPSMAFYFEDA 277
+ T D Y +L + E + + D QLCY T P + +F+ A
Sbjct: 323 TPPTLLPRDFYNRLVQGVK---EAIPMEPVQDPDLQPQLCY-RSATLIDGPILTAHFDGA 378
Query: 278 NLRIDGENVFIIDYENHFFLLAVAPHDDLVALIGSQQQRDTRFVYDLNIDLLSFVKENCS 337
++++ N FI E + A+ P D + G+ Q + +DL+ +SF +C+
Sbjct: 379 DVQLKPLNTFISPKEG-VYCFAMQPIDGDTGIFGNFVQMNFLIGFDLDGKKVSFKAVDCT 437
>gi|255585473|ref|XP_002533429.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
gi|223526717|gb|EEF28949.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
Length = 448
Score = 97.4 bits (241), Expect = 8e-18, Method: Compositional matrix adjust.
Identities = 93/365 (25%), Positives = 155/365 (42%), Gaps = 50/365 (13%)
Query: 1 MVRLFIGTPSKGVLLILDTGSALIYA--------------IFDPRKSSSFQKINCDHPDC 46
+V +G P+ L I+DTGS +++ + DP KSS++ + C + C
Sbjct: 100 LVNFSMGQPATPQLAIMDTGSNILWVRCAPCKRCTQQNGPLLDPSKSSTYASLPCTNTMC 159
Query: 47 TYFKCVN----EQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSNDNHG 102
Y QC Y + YA + G A E + EG +FGCS++N
Sbjct: 160 HYAPSAYCNRLNQCGYNLSYATGLSSAGVLATEQLIFHSSDEGVNAVPSVVFGCSHENG- 218
Query: 103 FDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLKFGTDMG 162
D +D GV GL + SF++++GS +FSYCL + + Y + L FG
Sbjct: 219 ---DYKDRRFTGVFGLGKGITSFVTRMGS----KFSYCLG-NIADPHYGYNQLVFGEKAN 270
Query: 163 YRRPSTQATKFINHPNNFYYLSLKDISIDNERMNFPPDTFDITVSGEGGCIIDSGSVLTY 222
+ ST + N YY++L+ IS+ +R++ F + E +IDSG+ LT+
Sbjct: 271 FEGYSTP----LKVVNGHYYVTLEGISVGEKRLDIDSTAFSMK-GNEKSALIDSGTALTW 325
Query: 223 FHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCY--FLPETFNRFPSMAFYFE-DANL 279
+ L + + + CY + + FP + F+F A+L
Sbjct: 326 LAESAFRALDNEVRQLLDGVLMPFWRGSFA----CYKGTVSQDLIGFPVVTFHFSGGADL 381
Query: 280 RIDGENVFIIDYENHFFLLAVAPHDDLV--------ALIGSQQQRDTRFVYDLNIDLLSF 331
+D E++F Y+ +L +A ++IG Q+ YDLN + L F
Sbjct: 382 DLDTESMF---YQATPDILCIAVRQASAYGNDFKSFSVIGLMAQQYYNMAYDLNSNKLFF 438
Query: 332 VKENC 336
+ +C
Sbjct: 439 QRIDC 443
>gi|147776733|emb|CAN74676.1| hypothetical protein VITISV_038368 [Vitis vinifera]
Length = 389
Score = 97.4 bits (241), Expect = 8e-18, Method: Compositional matrix adjust.
Identities = 82/280 (29%), Positives = 123/280 (43%), Gaps = 38/280 (13%)
Query: 56 CVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSNDNHGFDEDARDGALAGV 115
C Y + Y D S T+G HE + G + +FGC +N G G ++G+
Sbjct: 76 CNYAINYGDGSFTRGELGHEKLKF-----GTILVKDFIFGCGRNNKGLF-----GGVSGL 125
Query: 116 LGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLKFGTDMGYRRPSTQA-TKFI 174
+GL R +S ISQ I FSYCL P + + S + G YR S + K I
Sbjct: 126 MGLGRSDLSLISQTSGIFGGVFSYCL--PSTERKGSGSLILGGNSSVYRNSSPISYAKMI 183
Query: 175 NHPN--NFYYLSLKDISIDNERMNFPPDTFDITVSGEGGCIIDSGSVLTYFHSDVYWKLH 232
+P NFY+++L ISI + P G ++DSG+V+T +Y L
Sbjct: 184 ENPQLYNFYFINLTGISIGGVALQAPS-------VGPSRILVDSGTVITRLPPTIYKALK 236
Query: 233 EKFVSYFERFQLAQLSDCPEP----IQLCYFLPETFN-RFPSMAFYFE-DANLRIDGENV 286
+F+ F F P P + C+ L P++ +FE +A L +D V
Sbjct: 237 AEFLKQFTGFP-------PAPAFSILDTCFNLSAYQEVDIPTIKMHFEGNAELTVDVTGV 289
Query: 287 FII---DYENHFFLLAVAPHDDLVALIGSQQQRDTRFVYD 323
F D LA + D VA++G+ QQ++ R +YD
Sbjct: 290 FYFVKSDASQVCLALASLEYQDEVAILGNYQQKNLRVIYD 329
>gi|218188634|gb|EEC71061.1| hypothetical protein OsI_02803 [Oryza sativa Indica Group]
Length = 479
Score = 97.1 bits (240), Expect = 1e-17, Method: Compositional matrix adjust.
Identities = 92/375 (24%), Positives = 149/375 (39%), Gaps = 70/375 (18%)
Query: 1 MVRLFIGTPSKGVLLILDTGSALIY-----------------AIFDPRKSSSFQKINCDH 43
++ + +G+P+ +++DTGS + + A+FDP SS++ NC
Sbjct: 136 VISVGLGSPAMTQRVVIDTGSDVSWVQCEPCPAPSPCHAHAGALFDPAASSTYAAFNCSA 195
Query: 44 PDCTYFKCVNE--------QCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFG 95
C E +C Y +KY D S T G + + +++ G + G FG
Sbjct: 196 AACAQLGDSGEANGCDAKSRCQYIVKYGDGSNTTGTYSSDVLTL----SGSDVVRGFQFG 251
Query: 96 CSNDNHGFDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCL-VIPLPNGEYTSSY 154
CS+ G D + G++GL S +SQ + K FSYCL P +G T
Sbjct: 252 CSHAELGAGMDDK---TDGLIGLGGDAQSLVSQTAARYGKSFSYCLPATPASSGFLTLGA 308
Query: 155 LKFGTDMGYRRPSTQATKFINHPNNFYYLSLKDISIDNERMNFPPDTFDITVSGEGGCII 214
G G R +T +Y+ +L+DI++ +++ P F G ++
Sbjct: 309 PASGGGGGASRFATTPMLRSKKVPTYYFAALEDIAVGGKKLGLSPSVF------AAGSLV 362
Query: 215 DSGSVLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCYFLPETFN-------RF 267
DSG+V+T Y L F + R+ A EP+ + L FN
Sbjct: 363 DSGTVITRLPPAAYAALSSAFRAGMTRYARA------EPLGI---LDTCFNFTGLDKVSI 413
Query: 268 PSMAFYFEDANLRIDGENVFIIDYENHFFL----LAVAP--HDDLVALIGSQQQRDTRFV 321
P++A F ++D + H + LA AP D IG+ QQR +
Sbjct: 414 PTVALVFAGGA---------VVDLDAHGIVSGGCLAFAPTRDDKAFGTIGNVQQRTFEVL 464
Query: 322 YDLNIDLLSFVKENC 336
YD+ + F C
Sbjct: 465 YDVGGGVFGFRAGAC 479
>gi|449515033|ref|XP_004164554.1| PREDICTED: probable aspartic protease At2g35615-like [Cucumis
sativus]
Length = 430
Score = 97.1 bits (240), Expect = 1e-17, Method: Compositional matrix adjust.
Identities = 101/367 (27%), Positives = 164/367 (44%), Gaps = 59/367 (16%)
Query: 1 MVRLFIGTPSKGVLLILDTGSALIYA--------------IFDPRKSSSFQKINCDHPDC 46
++ + IGTP + + DTGS L++A IFDP KS+SF + C+ +C
Sbjct: 93 LMSVSIGTPPVDYIGMADTGSDLMWAQCLPCLKCYKQSRPIFDPLKSTSFSHVPCNSQNC 152
Query: 47 TYFKCVNEQ-------CVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSND 99
K +++ C Y+ Y DQ+ TKG E I+ IG K++ GC
Sbjct: 153 ---KAIDDSHCGAQGVCDYSYTYGDQTYTKGDLGFEKIT-IGSSSVKSV-----IGC--- 200
Query: 100 NHGFDEDARDGALAGVLGLSRVTISFISQLG--SIIKKRFSYCLVIPLPNGEYTSSYLKF 157
G + G +GV+GL +S +SQ+ S I +RFSYCL L + + + F
Sbjct: 201 --GHESGGGFGFASGVIGLGGGQLSLVSQMSQTSGISRRFSYCLPTLL---SHANGKINF 255
Query: 158 GTDMGYRRPSTQATKFIN-HPNNFYYLSLKDISIDNERMNFPPDTFDITVSGEGGCIIDS 216
G + P +T I+ +P +YY++L+ ISI NER + + +G IIDS
Sbjct: 256 GQNAVVSGPGVVSTPLISKNPVTYYYVTLEAISIGNER--------HMASAKQGNVIIDS 307
Query: 217 GSVLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCY---FLPETFNRFPSMAFY 273
G+ L++ ++Y + S + + ++ D LC+ T + P +
Sbjct: 308 GTTLSFLPKELYDGV---VSSLLKVVKAKRVKDPGNFWDLCFDDGINVATSSGIPIITAQ 364
Query: 274 FE-DANLRIDGENVF--IIDYENHFFLLAVAPHDDLVALIGSQQQRDTRFVYDLNIDLLS 330
F AN+ + N F + + N L +P D+ +IG+ + YDL LS
Sbjct: 365 FSGGANVNLLPVNTFQKVANNVNCLTLTPASPTDEF-GIIGNLALANFLIGYDLEAKRLS 423
Query: 331 FVKENCS 337
F C+
Sbjct: 424 FKPTVCT 430
>gi|357153697|ref|XP_003576537.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Brachypodium
distachyon]
Length = 474
Score = 97.1 bits (240), Expect = 1e-17, Method: Compositional matrix adjust.
Identities = 96/373 (25%), Positives = 154/373 (41%), Gaps = 67/373 (17%)
Query: 6 IGTPSKGVLLILDTGSALIYA--------------IFDPRKSSSFQKINCDHPDCTYFKC 51
+G + +++DT S L + +FDP S S+ + C+ C +
Sbjct: 124 VGLGAAEATVVVDTASELTWVQCQPCESCHDQQDPLFDPSSSPSYAAVPCNSSSCDALRV 183
Query: 52 V-----------NEQ---CVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCS 97
NEQ C Y + Y D S ++G A + + + G+ G +FGC
Sbjct: 184 AMAAGTSPCADDNEQQPACSYALSYRDGSYSRGVLARDKLRLAGQD-----IEGFVFGCG 238
Query: 98 NDNHGFDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSS-YLK 156
N G A G +G++GL R +S +SQ FSYC LP E SS L
Sbjct: 239 TSNQG----APFGGTSGLMGLGRSHVSLVSQTMDQFGGVFSYC----LPMRESGSSGSLV 290
Query: 157 FGTDMGYRRPSTQA--TKFINHP----NNFYYLSLKDISIDNERMNFPPDTFDITVSGEG 210
G D R ST T ++ FY+L+L I++ + + P + G
Sbjct: 291 LGDDSSAYRNSTPIVYTAMVSDSGPLQGPFYFLNLTGITVGGQEVESPWFS-------AG 343
Query: 211 GCIIDSGSVLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCP--EPIQLCYFLPETFN-RF 267
IIDSG+++T VY + +F+S QLA+ P + C+ L +
Sbjct: 344 RVIIDSGTIITTLVPSVYNAVRAEFLS-----QLAEYPQAPAFSILDTCFNLTGLKEVQV 398
Query: 268 PSMAFYFEDA-NLRIDGENVFII---DYENHFFLLAVAPHDDLVALIGSQQQRDTRFVYD 323
PS+ F FE + + +D + V D LA + ++IG+ QQ++ R ++D
Sbjct: 399 PSLKFVFEGSVEVEVDSKGVLYFVSSDASQVCLALASLKSEYDTSIIGNYQQKNLRVIFD 458
Query: 324 LNIDLLSFVKENC 336
+ F +E C
Sbjct: 459 TLGSQIGFAQETC 471
>gi|356558304|ref|XP_003547447.1| PREDICTED: LOW QUALITY PROTEIN: aspartic proteinase
nepenthesin-1-like [Glycine max]
Length = 336
Score = 96.7 bits (239), Expect = 1e-17, Method: Compositional matrix adjust.
Identities = 99/353 (28%), Positives = 159/353 (45%), Gaps = 52/353 (14%)
Query: 4 LFIGTPSKGVLLILDTGSALIYA-------IFDPRKSSSFQKINCDHPDCTYFKCVNEQC 56
L IG P L+I+DT S +++ +FDP KSS+F + C P C + C +
Sbjct: 13 LSIGQPPIPQLVIMDTSSDILWIMCNHVGLLFDPSKSSTFSPL-CKTP-CGFKGCKCDPI 70
Query: 57 VYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSNDNHGFDEDARDGALAGVL 116
+ + Y D+S T G +T+ EG + L C + N GF+ D G+
Sbjct: 71 PFNISYVDKSSTSGTFGSDTVVFETTDEGHSQIFDVLVRCGH-NIGFNTDP---GYNGIR 126
Query: 117 GLSRVTISFISQLGSIIKKRFSYCLV-IPLPNGEYTSSYLKFGTDM-GYRRPSTQATKFI 174
GL+ S +++G ++FSYC+ + P Y L G D+ GY P F
Sbjct: 127 GLNNGPNSLATKIG----QKFSYCVGNLADPYYNYNQLILCEGADLEGYSTP------FE 176
Query: 175 NHPNNFYYLSLKDISIDNERMNFPPDTFDITVSGEGGCIIDSGSVLTYFHSDVYWKLHEK 234
H + FYY++LK I + +R++ P TF+I + GG I DSG+ +TY V+ L+ +
Sbjct: 177 VH-HGFYYVTLKGIIVGEKRLDIAPITFEIKGNNTGGVIRDSGTTITYLVDSVHKLLYNE 235
Query: 235 ---FVSYFERFQLAQLSDCPEPIQLCYF--LPETFNRFPSMAFYFED-ANLRIDGENVFI 288
+S+ R QLC++ + FP + F+F D A+L +D + F
Sbjct: 236 VRNLLSWSFR-------------QLCHYGIISRDLVGFPVVTFHFADGADLALDTGSFF- 281
Query: 289 IDYENHFFLLAVAPHDDLVALIGSQ-----QQRDTRFVYDLNIDLLSFVKENC 336
+ N + V+P L I Q+ YDL + + F + +C
Sbjct: 282 -NQLNSILCMTVSPASILNTTISPSVIELLAQQSYNVGYDLLTNFVYFQRIDC 333
>gi|125555058|gb|EAZ00664.1| hypothetical protein OsI_22685 [Oryza sativa Indica Group]
Length = 465
Score = 96.7 bits (239), Expect = 1e-17, Method: Compositional matrix adjust.
Identities = 100/371 (26%), Positives = 149/371 (40%), Gaps = 59/371 (15%)
Query: 1 MVRLFIGTPSKGVLLILDTGSALIYA----------------IFDPRKSSSFQKINCDHP 44
+V L IGTP+ ++++DTGS L + +FDP SSS+ + CD
Sbjct: 119 VVTLGIGTPAVQQIVLIDTGSDLSWVQCKPCGAGECYAQKDPLFDPSSSSSYASVPCDSD 178
Query: 45 DCTYFK-------CVNEQ---CVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALF 94
C C + C Y ++Y +++ T G + ET++ + G A F F
Sbjct: 179 ACRKLAAGAYGHGCTSGAAALCEYGIEYGNRATTTGVYSTETLT-LKPGVVVADFG---F 234
Query: 95 GCSNDNHGFDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSY 154
GC + HG E G+LGL S +SQ S FSYC LP + +
Sbjct: 235 GCGDHQHGPYEK-----FDGLLGLGGAPESLVSQTSSQFGGPFSYC----LPPTSGGAGF 285
Query: 155 LKFGTDMGYRRPSTQATKFINHP-------NNFYYLSLKDISIDNERMNFPPDTFDITVS 207
L G ST A F+ P FY ++L IS+ + PP F
Sbjct: 286 LALGAPNSSSS-STAAAGFLFTPMRRIPSVPTFYVVTLTGISVGGAPLAVPPSAF----- 339
Query: 208 GEGGCIIDSGSVLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCY-FLPETFNR 266
G +IDSG+V+T + Y L F S ++L S+ + CY F T
Sbjct: 340 -SSGMVIDSGTVITGLPATAYAALRSAFRSAMSEYRLLPPSNG-AVLDTCYDFTGHTNVT 397
Query: 267 FPSMAFYFE-DANLRIDGENVFIIDYENHFFLLAVAPHDDLVALIGSQQQRDTRFVYDLN 325
P++A F A + + ++D A A DD + +IG+ QR +YD
Sbjct: 398 VPTIALTFSGGATIDLATPAGVLVD---GCLAFAGAGTDDTIGIIGNVNQRTFEVLYDSG 454
Query: 326 IDLLSFVKENC 336
+ F C
Sbjct: 455 KGTVGFRAGAC 465
>gi|359476204|ref|XP_002262813.2| PREDICTED: LOW QUALITY PROTEIN: aspartic proteinase
nepenthesin-2-like [Vitis vinifera]
Length = 460
Score = 96.7 bits (239), Expect = 1e-17, Method: Compositional matrix adjust.
Identities = 98/361 (27%), Positives = 156/361 (43%), Gaps = 54/361 (14%)
Query: 1 MVRLFIGTPSKGVLLILDTGSALIYA--------------IFDPRKSSSFQKINCDHPDC 46
+V + GTP + LILDTGS++ + FD SS++
Sbjct: 129 LVDVAFGTPXTEIXLILDTGSSITWTQCKACVNCLQDSNRYFDSSASSTY---------- 178
Query: 47 TYFKCVNE--QCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSNDNHGFD 104
++ C+ + Y M Y D S + G +T+++ E +F FGC +N G
Sbjct: 179 SFGSCIPSTVENNYNMTYGDDSTSVGNYGCDTMTL----EPSDVFQKFQFGCGRNNKGDF 234
Query: 105 EDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLKFGTDMGYR 164
DG +LGL + +S +SQ S K FSYCL P + S L FG +
Sbjct: 235 GSGVDG----MLGLGQGQLSTVSQTASKFNKVFSYCL----PEEDSIGSLL-FGEKATSQ 285
Query: 165 RPSTQATKFINHP-----NNFYYLSLKDISIDNERMNFPPDTFDITVSGEGGCIIDSGSV 219
S + T +N P + +Y+++L DIS+ NER+N P F G IIDS +V
Sbjct: 286 SSSLKFTSLVNGPGTLQESGYYFVNLSDISVGNERLNIPSSVF-----ASPGTIIDSRTV 340
Query: 220 LTYFHSDVYWKLHEKFVSYFERFQLAQ-LSDCPEPIQLCYFLPETFN-RFPSMAFYF-ED 276
+T Y L F ++ L+ + + CY L + P + +F
Sbjct: 341 ITRLPQRAYSALKAAFKKAMAKYPLSNGRRKKGDILDTCYNLSGRKDVLLPEIVLHFGGG 400
Query: 277 ANLRIDGENVFIIDYENHFFLLAVAPHDDLVALIGSQQQRDTRFVYDLNIDLLSFVKENC 336
A++R++G N+ + + LA A +L +IG++QQ +YD+ + F C
Sbjct: 401 ADVRLNGTNI-VWGSDASRLCLAFAGTSELT-IIGNRQQLSLTVLYDIQGRRIGFGGNGC 458
Query: 337 S 337
S
Sbjct: 459 S 459
>gi|242046218|ref|XP_002460980.1| hypothetical protein SORBIDRAFT_02g038700 [Sorghum bicolor]
gi|241924357|gb|EER97501.1| hypothetical protein SORBIDRAFT_02g038700 [Sorghum bicolor]
Length = 517
Score = 96.7 bits (239), Expect = 1e-17, Method: Compositional matrix adjust.
Identities = 108/380 (28%), Positives = 167/380 (43%), Gaps = 57/380 (15%)
Query: 1 MVRLFIGTPSKGVLLILDTGSAL--------------IYAIFDPRKSSSFQKINCDHPDC 46
++ +++GTP + +I+DTGS L + +FDP SSS++ + C C
Sbjct: 152 LMDVYVGTPPRRFRMIMDTGSDLNWLQCAPCLDCFDQVGPVFDPAASSSYRNVTCGDQRC 211
Query: 47 TYF----------KCVNEQCVYTMKYADQSVTKGFAAHE--TISVIGKGEGKAIFHGALF 94
+ + C Y Y DQS T G A E T+++ G + + +F
Sbjct: 212 GLVAPPEPPRACRRPGEDSCPYYYWYGDQSNTTGDLALESFTVNLTAPGASRRV-DDVVF 270
Query: 95 GCSNDNHGFDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSY 154
GC + N G AG+LGL R +SF SQL ++ FSYCLV +G +S
Sbjct: 271 GCGHWNRGLFH-----GAAGLLGLGRGPLSFASQLRAVYGHTFSYCLVD---HGSDVASK 322
Query: 155 LKFG----TDMGYRRPSTQATKFI---NHPNNFYYLSLKDISIDNERMNFPPDTF--DIT 205
+ FG + P T F + + FYY+ LK + + E +N DT+
Sbjct: 323 VVFGEDDALALAAAHPQLNYTAFAPASSPADTFYYVKLKGVLVGGELLNISSDTWGVGEG 382
Query: 206 VSGEGGCIIDSGSVLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCYFL----- 260
G GG IIDSG+ L+YF Y + + F+ R + D P + CY +
Sbjct: 383 EGGSGGTIIDSGTTLSYFVEPAYQVIRQAFIDRMGR-SYPLIPDFPV-LSPCYNVSGVDR 440
Query: 261 PETFNRFPSMAFYFED-ANLRIDGENVFI-IDYENHFFLLAVAPHDDLVALIGSQQQRDT 318
PE P ++ F D A EN FI +D + L + +++IG+ QQ++
Sbjct: 441 PEV----PELSLLFADGAVWDFPAENYFIRLDPDGIMCLAVLGTPRTGMSIIGNFQQQNF 496
Query: 319 RFVYDLNIDLLSFVKENCSD 338
VYDL + L F C++
Sbjct: 497 HVVYDLKNNRLGFAPRRCAE 516
>gi|218194599|gb|EEC77026.1| hypothetical protein OsI_15382 [Oryza sativa Indica Group]
Length = 409
Score = 96.7 bits (239), Expect = 1e-17, Method: Compositional matrix adjust.
Identities = 96/383 (25%), Positives = 167/383 (43%), Gaps = 73/383 (19%)
Query: 2 VRLFIGTPSKGVLLILDTGSALIY-------------------AIFDPRKSSSFQKINCD 42
+ IGTP+K + +DTGS +++ ++DP+ SS+ K++CD
Sbjct: 6 TEIGIGTPTKRYYVQVDTGSDILWVNCISCDRCPRKSGLGLELTLYDPKDSSTGSKVSCD 65
Query: 43 H-----------PDCTYFKCVNEQCVYTMKYADQSVTKGFAAHETI---SVIGKGEGKAI 88
P CT + C Y++ Y D S T G+ + + V G G+ +
Sbjct: 66 QGFCAATYGGLLPGCT----TSLPCEYSVTYGDGSSTTGYFVSDLLQFDQVSGDGQTRPA 121
Query: 89 FHGALFGCSNDNHGFDEDARDGALAGVLGLSRVTISFISQLGSI--IKKRFSYCLVIPLP 146
FGC + G D + + AL G++G + S +SQL + +KK F++CL
Sbjct: 122 NSTVTFGCGSQQGG-DLGSSNQALDGIIGFGQSNTSMLSQLSAAGKVKKIFAHCLDTING 180
Query: 147 NGEYTSSYLKFGTDMGYRRPSTQATKFI-NHPNNFYYLSLKDISIDNERMNFPPDTFDIT 205
G + + +P + T + N P+ Y ++LK I + + P FD
Sbjct: 181 GGIFAIGNV--------VQPKVKTTPLVPNMPH--YNVNLKSIDVGGTALKLPSHMFD-- 228
Query: 206 VSGE-GGCIIDSGSVLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCY-FLPET 263
+GE G IIDSG+ LTY VY E ++ F + + + E LC+ ++
Sbjct: 229 -TGEKKGTIIDSGTTLTYLPEIVY---KEIMLAVFAKHKDITFHNVQE--FLCFQYVGRV 282
Query: 264 FNRFPSMAFYFE-DANLRI--------DGENVFIIDYENHFFLLAVAPHDDLVALIGSQQ 314
+ FP + F+FE D L + +G+N++ + ++N + + L+G
Sbjct: 283 DDDFPKITFHFENDLPLNVYPHDYFFENGDNLYCVGFQNGGLQ---SKDGKGMVLLGDLV 339
Query: 315 QRDTRFVYDLNIDLLSFVKENCS 337
+ VYDL ++ + + NCS
Sbjct: 340 LSNKLVVYDLENQVIGWTEYNCS 362
>gi|357154085|ref|XP_003576664.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Brachypodium
distachyon]
Length = 509
Score = 96.7 bits (239), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 95/378 (25%), Positives = 156/378 (41%), Gaps = 64/378 (16%)
Query: 1 MVRLFIGTPSKGVLLILDTGSALIYA----------------IFDPRKSSSFQKINCDHP 44
+V + +GTP++ + ++ DTGS L + +F P SS+F + C
Sbjct: 155 VVSVGLGTPARDLTVVFDTGSDLSWVQCGPCSSGGCYKQQDPLFAPSDSSTFSAVRCGAR 214
Query: 45 DCTYFKCV-----NEQCVYTMKYADQSVTKGFAAHETISVI------GKGEGKAIFHGAL 93
+C + +++C Y + Y D+S T+G ++T+++ E G +
Sbjct: 215 ECRARQSCGGSPGDDRCPYEVVYGDKSRTQGHLGNDTLTLGTMAPANASAENDNKLPGFV 274
Query: 94 FGCSNDNHGFDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSS 153
FGC +N G A G+ GL R +S SQ + FSYCL +
Sbjct: 275 FGCGENNTGLFGQAD-----GLFGLGRGKVSLSSQAAGKFGEGFSYCLPS---SSSSAPG 326
Query: 154 YLKFGTDMGYRRPS-TQATKFINHPN--NFYYLSLKDISIDNE--RMNFPPDTFDITVSG 208
YL GT + P+ Q T +N +FYY+ L I + R++ P +
Sbjct: 327 YLSLGTPV--PAPAHAQFTPMLNRTTTPSFYYVKLVGIRVAGRAIRVSSPRVALPL---- 380
Query: 209 EGGCIIDSGSVLTYFHSDVYWKLHEKFVSYFERF---QLAQLSDCPEPIQLCYFLPETFN 265
I+DSG+V+T Y L F+S ++ + +LS + CY N
Sbjct: 381 ----IVDSGTVITRLAPRAYRALRAAFLSAMGKYGYKRAPRLSI----LDTCYDFTAHAN 432
Query: 266 ---RFPSMAFYFE-DANLRIDGENVFIIDYENHFFLLAVAPHDD--LVALIGSQQQRDTR 319
P++A F A + +D V + LA AP+ D ++G+ QQR
Sbjct: 433 ATVSIPAVALVFAGGATISVDFSGVLYVAKVAQ-ACLAFAPNGDGRSAGILGNTQQRTLA 491
Query: 320 FVYDLNIDLLSFVKENCS 337
VYD+ + F + CS
Sbjct: 492 VVYDVARQKIGFAAKGCS 509
>gi|449441618|ref|XP_004138579.1| PREDICTED: uncharacterized protein LOC101220661 [Cucumis sativus]
Length = 2819
Score = 96.7 bits (239), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 97/377 (25%), Positives = 161/377 (42%), Gaps = 63/377 (16%)
Query: 2 VRLFIGTPSKGVLLILDTGSALIY----------AIFDPRKSSSFQKINCDHPDCTY--- 48
V L +G+P + V ++LDTGS L + ++F+P SSS+ I C P C
Sbjct: 1002 VSLTVGSPPQQVTMVLDTGSELSWLHCKKSPNLTSVFNPLSSSSYSPIPCSSPICRTRTR 1061
Query: 49 -----FKCVNEQ-CVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSNDNHG 102
C ++ C + YAD S +G A + + G + G LFGC + G
Sbjct: 1062 DLPNPVTCDPKKLCHAIVSYADASSLEGNLASDNFRI-----GSSALPGTLFGCMDS--G 1114
Query: 103 FDEDAR-DGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLKFG--- 158
F ++ D G++G++R ++SF++QLG +FSYC+ +G +S L FG
Sbjct: 1115 FSSNSEEDAKTTGLMGMNRGSLSFVTQLG---LPKFSYCI-----SGRDSSGVLLFGDLH 1166
Query: 159 ----TDMGYRRPSTQATKFINHPNNFYYLSLKDISIDNERMNFPPDTFDITVSGEGGCII 214
++ Y +T Y + L I + N+ + P F +G G ++
Sbjct: 1167 LSWLGNLTYTPLVQISTPLPYFDRVAYTVQLDGIRVGNKILPLPKSIFAPDHTGAGQTMV 1226
Query: 215 DSGSVLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPE-----PIQLCYFLPE--TFNRF 267
DSG+ T+ VY L +F+ + LA L D P + LCY +
Sbjct: 1227 DSGTQFTFLLGPVYTALRNEFLEQ-TKGVLAPLGD-PNFVFQGAMDLCYSVAAGGKLPTL 1284
Query: 268 PSMAFYFEDANLRIDGENVFIIDYE----NHFFLLAVAPHDDLVAL----IGSQQQRDTR 319
PS++ F A + + GE + E N + + DL+ + IG Q++
Sbjct: 1285 PSVSLMFRGAEMVVGGEVLLYRVPEMMKGNEWVYCLTFGNSDLLGIEAFVIGHHHQQNVW 1344
Query: 320 FVYDLNIDLLSFVKENC 336
+ DL++F + C
Sbjct: 1345 ----MEFDLVAFAADLC 1357
>gi|225216995|gb|ACN85284.1| aspartic proteinase nepenthesin-1 precursor [Oryza australiensis]
Length = 519
Score = 96.7 bits (239), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 88/359 (24%), Positives = 143/359 (39%), Gaps = 43/359 (11%)
Query: 1 MVRLFIGTPSKGVLLILDTGSALIYA---------------IFDPRKSSSFQKINCDHPD 45
+V + +GTP+ ++ DTGS + +FDP +SS++ I+C P
Sbjct: 181 VVTVGLGTPASRYTVVFDTGSDTTWVQCQPCVVVCYEQREKLFDPARSSTYANISCAAPA 240
Query: 46 CTYF---KCVNEQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSNDNHG 102
C+ C C+Y ++Y D S + GF A +T+++ G FGC N G
Sbjct: 241 CSDLDTRGCSGGNCLYGVQYGDGSYSIGFFAMDTLTL----SSYDAVKGFRFGCGERNEG 296
Query: 103 FDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLKFGTDMG 162
+A AG+LGL R S Q F++C LP + YL FG
Sbjct: 297 LFGEA-----AGLLGLGRGKTSLPVQTYDKYGGVFAHC----LPARSSGTGYLDFGPGSP 347
Query: 163 YRRPSTQATKFI--NHPNNFYYLSLKDISIDNERMNFPPDTFDITVSGEGGCIIDSGSVL 220
+ T + N P FYY+ + I + + ++ P F G I+DSG+V+
Sbjct: 348 AAAGARLTTPMLTDNGP-TFYYVGMTGIRVGGQLLSIPQSVFTTA-----GTIVDSGTVI 401
Query: 221 TYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCY-FLPETFNRFPSMAFYFE-DAN 278
T Y L F S + + + CY F + P+++ F+ A
Sbjct: 402 TRLPPAAYSSLRSAFASAMAARGYKK-APAVSLLDTCYDFTGMSQVAIPTVSLLFQGGAR 460
Query: 279 LRIDGENVFIIDYENHFFLLAVAPHDDL-VALIGSQQQRDTRFVYDLNIDLLSFVKENC 336
L +D + + L A D V ++G+ Q + YD+ ++ F C
Sbjct: 461 LDVDASGIMYAASVSQVCLGFAANEDGGDVGIVGNTQLKTFGVAYDIGKKVVGFSPGAC 519
>gi|359482097|ref|XP_002271077.2| PREDICTED: aspartic proteinase nepenthesin-1-like [Vitis vinifera]
Length = 458
Score = 96.3 bits (238), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 89/379 (23%), Positives = 170/379 (44%), Gaps = 62/379 (16%)
Query: 2 VRLFIGTPSKGVLLILDTGSALIY----------AIFDPRKSSSFQKINCDHPDCT---- 47
V L +GTP + V ++LDTGS L + FDP +SSS+ + C CT
Sbjct: 87 VSLTVGTPPQNVSMVLDTGSELSWLRCNKTQTFQTTFDPNRSSSYSPVPCSSLTCTDRTR 146
Query: 48 ----YFKC-VNEQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGC--SNDN 100
C N+ C + YAD S ++G A +T + G + G +FGC S+ +
Sbjct: 147 DFPIPASCDSNQLCHAILSYADASSSEGNLASDTFYI-----GNSDMPGTIFGCMDSSFS 201
Query: 101 HGFDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLKFGTD 160
+ED+++ G++G++R ++SF+SQ+ +FSYC+ + +++ L +
Sbjct: 202 TNTEEDSKN---TGLMGMNRGSLSFVSQMD---FPKFSYCIS----DSDFSGVLLLGDAN 251
Query: 161 MGYRRPSTQATKF-INHPNNF-----YYLSLKDISIDNERMNFPPDTFDITVSGEGGCII 214
+ P I+ P + Y + L+ I + ++ + P F +G G ++
Sbjct: 252 FSWLMPLNYTPLIQISTPLPYFDRVAYTVQLEGIKVSSKLLPLPKSVFVPDHTGAGQTMV 311
Query: 215 DSGSVLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPE-----PIQLCYFLP---ETFNR 266
DSG+ T+ VY L +F++ + Q+ ++ + P + LCY +P +
Sbjct: 312 DSGTQFTFLLGPVYSALRNEFLN--QTSQILRVLEDPNYVFQGGMDLCYRVPLSQTSLPW 369
Query: 267 FPSMAFYFEDANLRIDGENVFI-----IDYENHFFLLAVAPHDDLVA----LIGSQQQRD 317
P+++ F A +++ G+ + + + + + DL+A +IG Q++
Sbjct: 370 LPTVSLMFRGAEMKVSGDRLLYRVPGEVRGSDSVYCFTFG-NSDLLAVEAYVIGHHHQQN 428
Query: 318 TRFVYDLNIDLLSFVKENC 336
+DL + F + C
Sbjct: 429 VWMEFDLEKSRIGFAQVQC 447
>gi|357120129|ref|XP_003561782.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Brachypodium
distachyon]
Length = 452
Score = 96.3 bits (238), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 95/393 (24%), Positives = 151/393 (38%), Gaps = 73/393 (18%)
Query: 2 VRLFIGTPSKGVLLILDTGSALIY---------AIFDPRKSSSFQKINCDHPDCTYFK-- 50
V + +GTP + V ++LDTGS L + A FD SSS+ + C P CT+
Sbjct: 65 VPVAVGTPPQNVTMVLDTGSELSWLLCNGSRHDAPFDASASSSYAPVPCSSPACTWLGRD 124
Query: 51 ------CVNEQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSNDNHGFD 104
C + C ++ YAD S G A +T ++G A LFGC ++
Sbjct: 125 LPVRPFCDSSACRVSLSYADASSADGLLAADTF-LLGSSPMPA-----LFGCIT-SYSSS 177
Query: 105 EDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLKFG--TDMG 162
D + G+LG++R +SF++Q + +RF+YC+ G+ L G T+
Sbjct: 178 TDPSETPPTGLLGMNRGGLSFVTQTAT---RRFAYCIAA----GQGPGILLLGGNDTETP 230
Query: 163 YRRPSTQATKF-----INHPNNF-----YYLSLKDISIDNERMNFPPDTFDITVSGEGGC 212
P Q + I+ P + Y + L+ I + + + P +G G
Sbjct: 231 LTSPPQQQLNYTPLVEISQPLPYFDRAAYTVQLEGIRVGSALLAIPKHLLTPDHTGAGQT 290
Query: 213 IIDSGSVLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEP-------IQLCYFLPETFN 265
++DSG+ T+ D Y L +F + R L+ EP C+ E
Sbjct: 291 MVDSGTRFTFLLPDAYAALKAEFANQLTRSLDGGLAPLGEPGFVFQGAFDACFRGTEARV 350
Query: 266 RFPSMAFYFEDANLRIDGENVFIIDYENHFFLLAVAPHD----------------DLVA- 308
+ + L + G V + E LL P + D+
Sbjct: 351 SAAAAGGLLPEVGLVLRGAEVVVAGAEK---LLYRVPGERRGEGEGVWCLTFGSSDMAGV 407
Query: 309 ---LIGSQQQRDTRFVYDLNIDLLSFVKENCSD 338
+IG Q+D YDL L F C+D
Sbjct: 408 SAYVIGHHHQQDVWVEYDLRNARLGFAAARCAD 440
>gi|359476195|ref|XP_002268758.2| PREDICTED: aspartic proteinase nepenthesin-2-like [Vitis vinifera]
gi|296082174|emb|CBI21179.3| unnamed protein product [Vitis vinifera]
Length = 460
Score = 96.3 bits (238), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 100/361 (27%), Positives = 146/361 (40%), Gaps = 66/361 (18%)
Query: 1 MVRLFIGTPSKGVLLILDTGSALIYAIFDPRKSSSFQKINCDHPDCTYFKCVNEQ----- 55
+V + GTP + LI+DTGS + I C+ C+ C N++
Sbjct: 130 LVNVGFGTPQQKFNLIIDTGSDTTW-------------IQCN--SCSLGNCHNKKTFNPS 174
Query: 56 ---------CV------YTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSNDN 100
C+ YTMKY D S +KG + +++ +F FGC +
Sbjct: 175 LSSSYSNRSCIPSTDTNYTMKYEDNSYSKGVFVCDEVTL-----KPDVFPKFQFGCGDSG 229
Query: 101 HGFDEDARDGALAGVLGLSR-VTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLKFGT 159
G G +GVLGL++ S ISQ S KK+FSYC P E+T L FG
Sbjct: 230 GG-----EFGTASGVLGLAKGEQYSLISQTASKFKKKFSYC----FPPKEHTLGSLLFGE 280
Query: 160 DMGYRRPSTQATKFINHPNNF-YYLSLKDISIDNERMNFPPDTFDITVSGEGGCIIDSGS 218
PS + T+ +N P+ Y++ L IS+ +R+N F G IIDSG+
Sbjct: 281 KAISASPSLKFTQLLNPPSGLGYFVELIGISVAKKRLNVSSSLF-----ASPGTIIDSGT 335
Query: 219 VLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEP--IQLCYFLPETFNR---FPSMAFY 273
V+T + Y L F E +S P+ + CY L R P + +
Sbjct: 336 VITRLPTAAYEALRTAFQQ--EMLHCPSISPPPQEKLLDTCYNLKGCGGRNIKLPEIVLH 393
Query: 274 F---EDANLRIDGENVFIIDYENHFFLLAVAPHDDLVALIGSQQQRDTRFVYDLNIDLLS 330
F D +L G D A + V +IG++QQ + VYD+ L
Sbjct: 394 FVGEVDVSLHPSGILWANGDLTQACLAFARKSNPSHVTIIGNRQQVSLKVVYDIEGGRLG 453
Query: 331 F 331
F
Sbjct: 454 F 454
>gi|38344991|emb|CAE01597.2| OSJNBa0008A08.5 [Oryza sativa Japonica Group]
gi|116309515|emb|CAH66581.1| OSIGBa0137O04.7 [Oryza sativa Indica Group]
gi|222628622|gb|EEE60754.1| hypothetical protein OsJ_14310 [Oryza sativa Japonica Group]
Length = 494
Score = 96.3 bits (238), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 96/379 (25%), Positives = 166/379 (43%), Gaps = 73/379 (19%)
Query: 6 IGTPSKGVLLILDTGSALIY-------------------AIFDPRKSSSFQKINCDH--- 43
IGTP+K + +DTGS +++ ++DP+ SS+ K++CD
Sbjct: 95 IGTPTKRYYVQVDTGSDILWVNCISCDRCPRKSGLGLELTLYDPKDSSTGSKVSCDQGFC 154
Query: 44 --------PDCTYFKCVNEQCVYTMKYADQSVTKGFAAHETI---SVIGKGEGKAIFHGA 92
P CT + C Y++ Y D S T G+ + + V G G+ +
Sbjct: 155 AATYGGLLPGCT----TSLPCEYSVTYGDGSSTTGYFVSDLLQFDQVSGDGQTRPANSTV 210
Query: 93 LFGCSNDNHGFDEDARDGALAGVLGLSRVTISFISQLGSI--IKKRFSYCLVIPLPNGEY 150
FGC + G D + + AL G++G + S +SQL + +KK F++CL G +
Sbjct: 211 TFGCGS-QQGGDLGSSNQALDGIIGFGQSNTSMLSQLSAAGKVKKIFAHCLDTINGGGIF 269
Query: 151 TSSYLKFGTDMGYRRPSTQATKFI-NHPNNFYYLSLKDISIDNERMNFPPDTFDITVSGE 209
+ +P + T + N P+ Y ++LK I + + P FD +GE
Sbjct: 270 AIGNV--------VQPKVKTTPLVPNMPH--YNVNLKSIDVGGTALKLPSHMFD---TGE 316
Query: 210 -GGCIIDSGSVLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCY-FLPETFNRF 267
G IIDSG+ LTY VY E ++ F + + + E LC+ ++ + F
Sbjct: 317 KKGTIIDSGTTLTYLPEIVY---KEIMLAVFAKHKDITFHNVQE--FLCFQYVGRVDDDF 371
Query: 268 PSMAFYFE-DANLRI--------DGENVFIIDYENHFFLLAVAPHDDLVALIGSQQQRDT 318
P + F+FE D L + +G+N++ + ++N + + L+G +
Sbjct: 372 PKITFHFENDLPLNVYPHDYFFENGDNLYCVGFQNGGLQ---SKDGKGMVLLGDLVLSNK 428
Query: 319 RFVYDLNIDLLSFVKENCS 337
VYDL ++ + + NCS
Sbjct: 429 LVVYDLENQVIGWTEYNCS 447
>gi|15234606|ref|NP_194732.1| aspartyl protease family protein [Arabidopsis thaliana]
gi|4938479|emb|CAB43838.1| putative protein [Arabidopsis thaliana]
gi|7269903|emb|CAB80996.1| putative protein [Arabidopsis thaliana]
gi|67633774|gb|AAY78811.1| aspartyl protease family protein [Arabidopsis thaliana]
gi|332660310|gb|AEE85710.1| aspartyl protease family protein [Arabidopsis thaliana]
Length = 424
Score = 96.3 bits (238), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 96/358 (26%), Positives = 156/358 (43%), Gaps = 39/358 (10%)
Query: 1 MVRLFIGTPSKGVLLILDTGSALIY-------------AIFDPRKSSSFQKINCDHPDCT 47
+ + IG P LL++DTGS L + F P +SS+++ +C
Sbjct: 79 LANISIGNPPVPQLLLIDTGSDLTWIHCLPCKCYPQTIPFFHPSRSSTYRNASCVSAPHA 138
Query: 48 YFKCVNEQ----CVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSNDNHGF 103
+ ++ C Y ++Y D S T+G A E ++ +G +FGC DN GF
Sbjct: 139 MPQIFRDEKTGNCQYHLRYRDFSNTRGILAEEKLTFETSDDGLISKQNIVFGCGQDNSGF 198
Query: 104 DEDARDGALAGVLGLSRVTISFISQ-LGSIIKKRFSYCLVIPLPNGEYTSSYLKFGTDMG 162
+ +GVLGL T S +++ GS +FSYC L N Y + L G
Sbjct: 199 TK------YSGVLGLGPGTFSIVTRNFGS----KFSYCFG-SLTNPTYPHNILILGNGAK 247
Query: 163 YRRPSTQATKFINHPNNFYYLSLKDISIDNERMNFPPDTFDITVSGEGGCIIDSGSVLTY 222
T F + YYL L+ IS + ++ P TF S +GG +ID+G T
Sbjct: 248 IEGDPTPLQIF----QDRYYLDLQAISFGEKLLDIEPGTFQRYRS-QGGTVIDTGCSPTI 302
Query: 223 FHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCY--FLPETFNRFPSMAFYFE-DANL 279
+ Y L E+ + + L ++ D + CY L FP + F+F A L
Sbjct: 303 LAREAYETLSEE-IDFLLGEVLRRVKDWDQYTTPCYEGNLKLDLYGFPVVTFHFAGGAEL 361
Query: 280 RIDGENVFIIDYENHFFLLAVAPHD-DLVALIGSQQQRDTRFVYDLNIDLLSFVKENC 336
+D E++F+ F LA+ + D +++IG+ Q++ Y+L + F + +C
Sbjct: 362 ALDVESLFVSSESGDSFCLAMTMNTFDDMSVIGAMAQQNYNVGYNLRTMKVYFQRTDC 419
>gi|449527151|ref|XP_004170576.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 1-like [Cucumis
sativus]
Length = 523
Score = 96.3 bits (238), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 102/357 (28%), Positives = 144/357 (40%), Gaps = 41/357 (11%)
Query: 2 VRLFIGTPSKGVLLILDTGSAL-----------------IYAIFDPRKSSSFQKINCDHP 44
R+ +G P + + DTGS + I IFDP+ SSS+ ++CD
Sbjct: 186 ARIGVGQPVQSYFFVPDTGSDVSWLQCQPCDGENGCYKQIGPIFDPKSSSSYSPLSCDSE 245
Query: 45 DCTYF---KCVNEQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSNDNH 101
C C C+Y ++Y D S T G A ET S GC +DN
Sbjct: 246 QCHLLDEAACDANSCIYEVEYGDGSFTVGELATETFSFRHSNS----IPNLPIGCGHDNE 301
Query: 102 GFDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLKFGTDM 161
G A IS SQL + FSYCLV +SS L F D
Sbjct: 302 GLFVGAAGLIGL-----GGGAISLSSQLEAT---SFSYCLV---DLDSESSSTLDFNADQ 350
Query: 162 GYRRPSTQATKFINHPNNFYYLSLKDISIDNERMNFPPDTFDITVSGEGGCIIDSGSVLT 221
++ K P F Y+ + +S+ + + +F+I SG GG I+DSG+ +T
Sbjct: 351 PSDSLTSPLVKNDRFPT-FRYVKVIGMSVGGKPLPISSSSFEIDESGSGGIIVDSGTTIT 409
Query: 222 YFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCYFLPETFN-RFPSMAFYFEDAN-L 279
SDVY L + FV + A P CY L N P++AF N L
Sbjct: 410 EIPSDVYDVLRDAFVGLTKNLPPAPGV---SPFDTCYDLSSQSNVEVPTIAFILPGENSL 466
Query: 280 RIDGENVFIIDYENHFFLLAVAPHDDLVALIGSQQQRDTRFVYDLNIDLLSFVKENC 336
++ +N F LA P +++IG+ QQ+ R YDL L+ F + C
Sbjct: 467 QLPAKNCLFQVDSAGTFCLAFLPSTFPLSIIGNVQQQGIRVSYDLANSLVGFSTDKC 523
>gi|297798978|ref|XP_002867373.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
gi|297313209|gb|EFH43632.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
Length = 434
Score = 96.3 bits (238), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 95/358 (26%), Positives = 158/358 (44%), Gaps = 39/358 (10%)
Query: 1 MVRLFIGTPSKGVLLILDTGSALIY-------------AIFDPRKSSSFQKINCDHPDCT 47
+ + IG P LL++DTGS L + F P +SS+++ +C+
Sbjct: 89 LANISIGDPPVPQLLLIDTGSDLTWIQCLPCKCYPQTIPFFHPSRSSTYRNASCESAPHA 148
Query: 48 YFKCVNEQ----CVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSNDNHGF 103
+ ++ C Y ++Y D S T+G A E ++ EG +FGC DN GF
Sbjct: 149 MPQIFRDEKTGNCRYHLRYRDFSNTRGILAKEKLTFQTSDEGLISKPNIVFGCGQDNSGF 208
Query: 104 DEDARDGALAGVLGLSRVTISFISQ-LGSIIKKRFSYCLVIPLPNGEYTSSYLKFGTDMG 162
+ +GVLGL T S +++ GS +FSYC L + Y ++L G
Sbjct: 209 TQ------YSGVLGLGPGTFSIVTRNFGS----KFSYCFG-SLIDPTYPHNFLILGNGAR 257
Query: 163 YRRPSTQATKFINHPNNFYYLSLKDISIDNERMNFPPDTFDITVSGEGGCIIDSGSVLTY 222
T F + YYL L+ IS+ + ++ P F S +GG +ID+G T
Sbjct: 258 IEGDPTPLQIF----QDRYYLDLQAISLGEKLLDIEPGIFQRYRS-KGGTVIDTGCSPTI 312
Query: 223 FHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCY--FLPETFNRFPSMAFYFE-DANL 279
+ Y L E+ + + L ++ D + CY L FP + F+F A L
Sbjct: 313 LAREAYETLSEE-IDFLLGEVLRRVKDWEQYTNHCYEGNLKLDLYGFPVVTFHFAGGAEL 371
Query: 280 RIDGENVFIIDYENHFFLLAVAPHD-DLVALIGSQQQRDTRFVYDLNIDLLSFVKENC 336
+D E++F+ F LA+ + D +++IG+ Q++ Y+L + F + +C
Sbjct: 372 ALDVESLFVSSESGDSFCLAMTMNTFDDMSVIGAMAQQNYNVGYNLRTMKVYFQRTDC 429
>gi|242093566|ref|XP_002437273.1| hypothetical protein SORBIDRAFT_10g023970 [Sorghum bicolor]
gi|241915496|gb|EER88640.1| hypothetical protein SORBIDRAFT_10g023970 [Sorghum bicolor]
Length = 503
Score = 95.9 bits (237), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 82/358 (22%), Positives = 142/358 (39%), Gaps = 42/358 (11%)
Query: 1 MVRLFIGTPSKGVLLILDTGSALIYA---------------IFDPRKSSSFQKINCDHPD 45
+V + +GTP+ ++ DTGS + +F P KS+++ I+C
Sbjct: 166 VVPIRLGTPAARFTVVFDTGSDTTWVQCQPCVAYCYQQKEPLFTPTKSATYANISCTSSY 225
Query: 46 CTYF---KCVNEQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSNDNHG 102
C+ C C+Y ++Y D S T GF A +T+++ G FGC N G
Sbjct: 226 CSDLDTRGCSGGHCLYAVQYGDGSYTVGFYAQDTLTL-----GYDTVKDFRFGCGEKNRG 280
Query: 103 FDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLKFGTDMG 162
G AG++GL R S Q F+YC +P + +L FG
Sbjct: 281 L-----FGKAAGLMGLGRGKTSVPVQAYDKYSGVFAYC----IPATSSGTGFLDFGPGAP 331
Query: 163 YRRPSTQATKFINHPNNFYYLSLKDISIDNERMNFPPDTFDITVSGEGGCIIDSGSVLTY 222
+ +++ FYY+ + I + ++ P F + G ++DSG+V+T
Sbjct: 332 AAANARLTPMLVDNGPTFYYVGMTGIKVGGHLLSIPATVFS-----DAGALVDSGTVITR 386
Query: 223 FHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCYFLP--ETFNRFPSMAFYFE-DANL 279
Y L F E + + + CY L + P+++ F+ A L
Sbjct: 387 LPPSAYEPLRSAFAKGMEGLGY-KTAPAFSILDTCYDLTGYQGSIALPAVSLVFQGGACL 445
Query: 280 RIDGENV-FIIDYENHFFLLAVAPHDDLVALIGSQQQRDTRFVYDLNIDLLSFVKENC 336
+D + ++ D A D + ++G+ QQ+ +YDL ++ F C
Sbjct: 446 DVDASGILYVADVSQACLAFAANDDDTDMTIVGNTQQKTYSVLYDLGKKVVGFAPGAC 503
>gi|242086412|ref|XP_002443631.1| hypothetical protein SORBIDRAFT_08g022620 [Sorghum bicolor]
gi|241944324|gb|EES17469.1| hypothetical protein SORBIDRAFT_08g022620 [Sorghum bicolor]
Length = 507
Score = 95.9 bits (237), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 100/376 (26%), Positives = 153/376 (40%), Gaps = 50/376 (13%)
Query: 1 MVRLFIGTPSKGVLLILDTGSALIY--------------AIFDPRKSSSFQKINCDHPDC 46
+ ++ +GTP+ LL LDT S L + +FDPR S+S+ ++N D PDC
Sbjct: 142 IAKIAVGTPAVEALLALDTASDLTWLQCQPCRRCYPQSGPVFDPRHSTSYGEMNYDAPDC 201
Query: 47 TYFK------CVNEQCVYTMKYAD------QSVTKGFAAHETISVIGKGEGKAIFHGALF 94
C+YT+ Y D S + G ET++ G G +A
Sbjct: 202 QALGRSGGGDAKRGTCIYTVLYGDGDGHGSTSTSVGDLVEETLTFAG-GVRQAYLS---I 257
Query: 95 GCSNDNHG-FDEDARDGALAGVLGLSRVTISFISQLGSI-IKKRFSYCLVIPLPNGEYTS 152
GC +DN G F A AG+LGLSR IS Q+ + FSYCLV + S
Sbjct: 258 GCGHDNKGLFGAPA-----AGILGLSRGQISIPHQIAFLGYNASFSYCLVDFISGPGSPS 312
Query: 153 SYLKFGTDMGYRRPSTQATKFINHPN--NFYYLSLKDISIDNERMNFPPDTFDITV---S 207
S L FG P T + + N FYY+ L +S+ R+ + D+ + +
Sbjct: 313 STLTFGAGAVDTSPPASFTPTVLNQNMPTFYYVRLIGVSVGGVRVPGVTER-DLQLDPYT 371
Query: 208 GEGGCIIDSGSVLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCYFLPETFN-- 265
G GG I+DSG+ +T Y + F + CY +
Sbjct: 372 GHGGVILDSGTTVTRLARPAYTAFRDAFRAAATGLGQVSTGGPSGLFDTCYTVGGRAGLR 431
Query: 266 ---RFPSMAFYFEDA-NLRIDGENVFI-IDYENHFFLLAVAPHDDLVALIGSQQQRDTRF 320
+ P+++ +F L + +N I +D D V++IG+ Q+ R
Sbjct: 432 HCVKVPAVSMHFAGGVELSLQPKNYLITVDSRGTVCFAFAGTGDRSVSVIGNILQQGFRV 491
Query: 321 VYDLNIDLLSFVKENC 336
VYD+ + F +C
Sbjct: 492 VYDIGGQRVGFAPNSC 507
>gi|449457263|ref|XP_004146368.1| PREDICTED: probable aspartic protease At2g35615-like [Cucumis
sativus]
Length = 469
Score = 95.9 bits (237), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 97/373 (26%), Positives = 168/373 (45%), Gaps = 50/373 (13%)
Query: 1 MVRLFIGTPSKGVLLILDTGSALIY--------------AIFDPRKSSSFQKINCDHPDC 46
+V L IG+P L+++DTGS+L++ + FDP KS SF+ + C P
Sbjct: 105 LVNLSIGSPPVTQLVVVDTGSSLLWVQCLPCINCFQQSTSWFDPLKSVSFKTLGCGFPGY 164
Query: 47 TY---FKCVN-EQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGAL--------- 93
Y +KC Q Y ++Y ++G A E++ EG+ + A+
Sbjct: 165 NYINGYKCNRFNQAEYKLRYLGGDSSQGILAKESLLFETLDEGRVFQYNAISTQISKIKK 224
Query: 94 ----FGCSNDNHGFDEDARDGALAGVLGLSRVT-ISFISQLGSIIKKRFSYCLVIPLPNG 148
FGC + N + D A GV GL I+ +QLG+ +FSYC + + N
Sbjct: 225 SNITFGCGHMN---IKTNNDDAYNGVFGLGAYPHITMATQLGN----KFSYC-IGDINNP 276
Query: 149 EYTSSYLKFGTDMGYRRPSTQATKFINHPNNFYYLSLKDISIDNERMNFPPDTFDITVSG 208
YT ++L G ST H YY++L+ IS+ ++ + P+ F I+ G
Sbjct: 277 LYTHNHLVLGQGSYIEGDSTPLQIHFGH----YYVTLQSISVGSKTLKIDPNAFKISSDG 332
Query: 209 EGGCIIDSGSVLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCY--FLPETFNR 266
GG +IDSG T + + L+++ V + L ++ + LC+ +
Sbjct: 333 SGGVLIDSGMTYTKLANGGFELLYDEIVDLMKGL-LERIPTQRKFEGLCFKGVVSRDLVG 391
Query: 267 FPSMAFYFE-DANLRIDGENVFIIDYENHFFLLAVAPHDDLVAL--IGSQQQRDTRFVYD 323
FP++ F+F A+L ++ ++F + F L + + +L+ L IG Q++ +D
Sbjct: 392 FPAVTFHFAGGADLVLESGSLFRQHGGDRFCLAILPSNSELLNLSVIGILAQQNYNVGFD 451
Query: 324 LNIDLLSFVKENC 336
L + F + +C
Sbjct: 452 LEQMKVFFRRIDC 464
>gi|302143530|emb|CBI22091.3| unnamed protein product [Vitis vinifera]
Length = 360
Score = 95.9 bits (237), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 91/302 (30%), Positives = 128/302 (42%), Gaps = 23/302 (7%)
Query: 50 KCVNEQCVYTMKYADQSVTKGFAAHETISV-----IGKGEGKAIFHGALFGCSNDNHGFD 104
K N+ C Y Y D S T G A ET +V GK E + + +FGC + N G
Sbjct: 68 KAENQTCPYYYWYGDSSNTTGDFALETFTVNLTMSSGKPELRRV-ENVMFGCGHWNRGLF 126
Query: 105 EDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLKFGTDMGY- 163
A R +SF SQL S+ FSYCLV + SS L FG D
Sbjct: 127 HGAAGLLGL-----GRGPLSFSSQLQSLYGHSFSYCLVDRNSDAN-VSSKLIFGEDKDLL 180
Query: 164 RRPSTQATKFI----NHPNNFYYLSLKDISIDNERMNFPPDTFDITVSGEGGCIIDSGSV 219
P T + N + FYY+ +K I + E +N P + + I G GG IIDSG+
Sbjct: 181 SHPELNFTTLVAGKENPVDTFYYVQIKSIVVGGEVVNIPEEKWQIATDGSGGTIIDSGTT 240
Query: 220 LTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCYFLPETFNR-FPSMAFYFED-A 277
L+YF Y + E F++ + + + + EP CY + P F D A
Sbjct: 241 LSYFAEPAYQVIKEAFMAKVKGYPVVKDFPVLEP---CYNVTGVEQPDLPDFGIVFSDGA 297
Query: 278 NLRIDGENVFI-IDYENHFFLLAVAPHDDLVALIGSQQQRDTRFVYDLNIDLLSFVKENC 336
EN FI I+ L + +++IG+ QQ++ +YD L F C
Sbjct: 298 VWNFPVENYFIEIEPREVVCLAILGTPPSALSIIGNYQQQNFHILYDTKKSRLGFAPTKC 357
Query: 337 SD 338
+D
Sbjct: 358 AD 359
>gi|242092900|ref|XP_002436940.1| hypothetical protein SORBIDRAFT_10g011750 [Sorghum bicolor]
gi|241915163|gb|EER88307.1| hypothetical protein SORBIDRAFT_10g011750 [Sorghum bicolor]
Length = 465
Score = 95.9 bits (237), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 101/371 (27%), Positives = 151/371 (40%), Gaps = 66/371 (17%)
Query: 1 MVRLFIGTPSKGVLLILDTGSALIYA----------------IFDPRKSSSFQKINCDHP 44
MV L GTPS +L++DTGS + + +FDP KSS++ I C
Sbjct: 126 MVTLGFGTPSVPQVLLMDTGSDVSWVQCAPCNSTECYPQKDPLFDPSKSSTYAPIACGAD 185
Query: 45 DCTYFK------CVN--EQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGC 96
C C + QC Y ++Y D S T+G ++ETI+ G FH FGC
Sbjct: 186 ACNKLGDHYRNGCTSGGTQCGYRVEYGDGSSTRGVYSNETIT-FAPGITVKDFH---FGC 241
Query: 97 SNDNHGFDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLK 156
+D G D DG +LGL S + Q S+ FSYCL P N E + +L
Sbjct: 242 GHDQRG-PSDKFDG----LLGLGGAPESLVVQTASVYGGAFSYCL--PALNSE--AGFLA 292
Query: 157 FGTDMGYRRPS--TQATKFINHP-------NNFYYLSLKDISIDNERMNFPPDTFDITVS 207
G RPS T + F+ P Y +++ IS+ + ++ P F
Sbjct: 293 LGV-----RPSAATNTSAFVFTPMWHLPMDATSYMVNMTGISVGGKPLDIPRSAF----- 342
Query: 208 GEGGCIIDSGSVLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCY-FLPETFNR 266
GG +IDSG+++T Y L+ F + + D CY F +
Sbjct: 343 -RGGMLIDSGTIVTELPETAYNALNAALRKAFAAYPMVASED----FDTCYNFTGYSNVT 397
Query: 267 FPSMAFYFE-DANLRIDGENVFIIDYENHFFLLAVAPHDDLVALIGSQQQRDTRFVYDLN 325
P +A F A + +D N ++ ++ P D + +IG+ QR +YD
Sbjct: 398 VPRVALTFSGGATIDLDVPNGILV--KDCLAFRESGP-DVGLGIIGNVNQRTLEVLYDAG 454
Query: 326 IDLLSFVKENC 336
+ F C
Sbjct: 455 HGKVGFRAGAC 465
>gi|147809812|emb|CAN71447.1| hypothetical protein VITISV_040904 [Vitis vinifera]
Length = 988
Score = 95.9 bits (237), Expect = 3e-17, Method: Compositional matrix adjust.
Identities = 99/368 (26%), Positives = 159/368 (43%), Gaps = 58/368 (15%)
Query: 1 MVRLFIGTPSKGVLLILDTGSALIYAI--------------FDPRKSSSFQKINCDHPDC 46
+V + GTP + LILDTGS++ + FD SS++
Sbjct: 128 LVDVAFGTPPQKFKLILDTGSSITWTQCKACVHCLKDSHRHFDSLASSTY---------- 177
Query: 47 TYFKCVNEQC--VYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSNDNHGFD 104
++ C+ Y M Y D+S + G +T+++ E +F FGC +N G
Sbjct: 178 SFGSCIPSTVGNTYNMTYGDKSTSVGNYGCDTMTL----EPSDVFQKFQFGCGRNNEG-- 231
Query: 105 EDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLKFGTDMGYR 164
D GA G+LGL + +S +SQ S KK FSYC LP S L FG +
Sbjct: 232 -DFGSGA-DGMLGLGQGQLSTVSQTASKFKKVFSYC----LPEENSIGSLL-FGEKATSQ 284
Query: 165 RPSTQATKFINHP-------NNFYYLSLKDISIDNERMNFPPDTFDITVSGEGGCIIDSG 217
S + T +N P + +Y++ L DIS+ N+R+N P F G IIDSG
Sbjct: 285 SSSLKFTSLVNGPGTSGLEESGYYFVKLLDISVGNKRLNIPSSVF-----ASPGTIIDSG 339
Query: 218 SVLTYFHSDVYWKLHEKFVSYFERFQLAQ-LSDCPEPIQLCYFLPETFN-RFPSMAFYFE 275
+V+T Y L F ++ L+ + + CY L + P +F
Sbjct: 340 TVITRLPQRAYSALKAAFKKAMAKYPLSNGRRKENDMLDTCYNLSGRKDVLLPEXVLHFG 399
Query: 276 D-ANLRIDGENVFIIDYENHFFLL----AVAPHDDLVALIGSQQQRDTRFVYDLNIDLLS 330
D A++R++G+ V + + L + + + + +IG++QQ +YD+ +
Sbjct: 400 DGADVRLNGKRVVWGNDASRLCLAFAGNSKSTMNPELTIIGNRQQVSLTVLYDIRGRRIG 459
Query: 331 FVKENCSD 338
F CS+
Sbjct: 460 FGGNGCSN 467
>gi|297825301|ref|XP_002880533.1| hypothetical protein ARALYDRAFT_481251 [Arabidopsis lyrata subsp.
lyrata]
gi|297326372|gb|EFH56792.1| hypothetical protein ARALYDRAFT_481251 [Arabidopsis lyrata subsp.
lyrata]
Length = 430
Score = 95.5 bits (236), Expect = 3e-17, Method: Compositional matrix adjust.
Identities = 98/363 (26%), Positives = 153/363 (42%), Gaps = 64/363 (17%)
Query: 2 VRLFIGTPSKGVLLILDTGSAL----------------IYAIFDPRKSSSFQKINCDHPD 45
V +G P I+DTGS+L I+ +F+P SS+F + +CD
Sbjct: 70 VNFSVGQPPVPQFTIMDTGSSLLWIQCHPCKHCSSNHMIHPVFNPALSSTFVECSCDDRF 129
Query: 46 CTYF---KCVNEQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSNDNHG 102
C Y C + +CVY Y + +KG A E ++ + FGC ++N
Sbjct: 130 CRYAPNGHCSSNKCVYEQVYISGTGSKGVLAKERLTFTTPNGNTVVTQPIAFGCGHEN-- 187
Query: 103 FDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLKFGTD-- 160
+ + G+LGL S QLGS +FSYC + L N Y + L G D
Sbjct: 188 --GEQLESEFTGILGLGAKPTSLAVQLGS----KFSYC-IGDLANKNYGYNQLVLGEDAD 240
Query: 161 -MGYRRPSTQATKFINHPNNFYYLSLKDISIDNERMNFPPDTFDITVSGEGGCIIDSGSV 219
+G P T+ N YY++L+ IS+ ++++N P F S G I+D+G++
Sbjct: 241 ILGDPTPIEFETE-----NGIYYMNLEGISVGDKQLNIEPVVFKRRGS-RTGVILDTGTL 294
Query: 220 LTYFHSDVYWKLHEKFVSY----FERFQLAQLSDCPEPIQLCYF--LPETFNRFPSMAFY 273
T+ Y +L+ + S ERF LCY + E FP + F+
Sbjct: 295 YTWLADIAYRELYNEIKSILDPKLERFWFRDF--------LCYHGRVNEELIGFPVVTFH 346
Query: 274 FE-DANLRIDGENVFI----IDYENHFFLLAVAP-------HDDLVALIGSQQQRDTRFV 321
F A L ++ ++F D ++ F ++V P + D A IG Q+
Sbjct: 347 FAGGAELAMEATSMFYPMTESDTYHNVFCMSVRPTTEHGGEYKDFTA-IGLMAQQYYNIA 405
Query: 322 YDL 324
YDL
Sbjct: 406 YDL 408
>gi|118487542|gb|ABK95598.1| unknown [Populus trichocarpa]
Length = 471
Score = 95.5 bits (236), Expect = 3e-17, Method: Compositional matrix adjust.
Identities = 86/368 (23%), Positives = 159/368 (43%), Gaps = 57/368 (15%)
Query: 2 VRLFIGTPSKGVLLILDTGSALIY---------------AIFDPRKSSSFQKINCDHPDC 46
V+L +GTP K +ILDTGS+L + ++DP S +++K++C +C
Sbjct: 127 VKLGLGTPPKYYAMILDTGSSLSWLQCQPCAVYCHAQADPLYDPSVSKTYKKLSCASVEC 186
Query: 47 TYFKCV----------NEQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGC 96
+ K + C+YT Y D S + G+ + + +++ +GC
Sbjct: 187 SRLKAATLNDPLCETDSNACLYTASYGDTSFSIGYLSQDLLTL----TSSQTLPQFTYGC 242
Query: 97 SNDNHGFDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLK 156
DN G G AG++GL+R +S ++QL + FSYC LP SS
Sbjct: 243 GQDNQGLF-----GRAAGIIGLARDKLSMLAQLSTKYGHAFSYC----LPTANSGSSGGG 293
Query: 157 FGTDMGYRRPSTQATKFINHPNN--FYYLSLKDISIDNERMNFPPDTFDITVSGEGGCII 214
F + S + T + N Y+L L I++ ++ + + +I
Sbjct: 294 FLSIGSISPTSYKFTPMLTDSKNPSLYFLRLTAITVSGRPLDLAAAMYRVPT------LI 347
Query: 215 DSGSVLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCP--EPIQLCYFLP-ETFNRFPSMA 271
DSG+V+T +Y L + FV + + + P + C+ ++ + P +
Sbjct: 348 DSGTVITRLPMSMYAALRQAFV----KIMSTKYAKAPAYSILDTCFKGSLKSISAVPEIK 403
Query: 272 FYFE-DANLRIDGENVFIIDYENHFFLLAVAPHD--DLVALIGSQQQRDTRFVYDLNIDL 328
F+ A+L + ++ +I+ + LA A + +A+IG++QQ+ YD++
Sbjct: 404 MIFQGGADLTLRAPSI-LIEADKGITCLAFAGSSGTNQIAIIGNRQQQTYNIAYDVSTSR 462
Query: 329 LSFVKENC 336
+ F +C
Sbjct: 463 IGFAPGSC 470
>gi|224136884|ref|XP_002326969.1| predicted protein [Populus trichocarpa]
gi|222835284|gb|EEE73719.1| predicted protein [Populus trichocarpa]
Length = 626
Score = 95.5 bits (236), Expect = 3e-17, Method: Compositional matrix adjust.
Identities = 97/379 (25%), Positives = 147/379 (38%), Gaps = 78/379 (20%)
Query: 2 VRLFIGTPSKGVLLILDTGSALIYAI--------------FDPRKSSSFQKINCDHPDCT 47
RLFIGTP + LI+DTGS + Y F P SS+++ + C+ P C
Sbjct: 79 TRLFIGTPPQEFALIVDTGSTVTYVPCSSCEQCGKHQDPRFQPDLSSTYRPVKCN-PSCN 137
Query: 48 YFKCVNE--QCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSNDNHGFDE 105
C +E QC Y +YA+ S + G A + +S + E K A+FGC N G
Sbjct: 138 ---CDDEGKQCTYERRYAEMSSSSGVIAEDVVSFGNESELKP--QRAVFGCENVETG--- 189
Query: 106 DARDGALAGVLGLSRVTISFISQL--GSIIKKRFSYCL--------------VIPLPNGE 149
D G++GL R +S + QL +I FS C + P PN
Sbjct: 190 DLYSQRADGIMGLGRGRLSVVDQLVDKGVIGDSFSLCYGGMDVGGGAMVLGQISPPPN-- 247
Query: 150 YTSSYLKFGTDMGYRRPSTQATKFINHPNNFYYLSLKDISIDNERMNFPPDTFDITVSGE 209
+ F YR P +Y + LK++ + + + P FD +
Sbjct: 248 -----MVFSHSNPYRSP-------------YYNIELKELHVAGKPLKLKPKVFD----EK 285
Query: 210 GGCIIDSGSVLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCYFLPETFNRFPS 269
G ++DSG+ YF + L + + E L Q+ P+P
Sbjct: 286 HGTVLDSGTTYAYFPEAAFHALKDAIMK--EIRHLKQIPG-PDPNYHDICFSGAGREVSH 342
Query: 270 MAFYFEDANLRIDGENVFIIDYENHFF----------LLAVAPHDDLVALIGSQQQRDTR 319
++ F + N+ + EN+ F L +DL L+G R+T
Sbjct: 343 LSKVFPEVNMVFGSGQKLSLSPENYLFRHTKVSGAYCLGIFQNGNDLTTLLGGIVVRNTL 402
Query: 320 FVYDLNIDLLSFVKENCSD 338
YD D + F K NCS+
Sbjct: 403 VTYDRENDKIGFWKTNCSE 421
>gi|212722026|ref|NP_001131674.1| uncharacterized protein LOC100193034 precursor [Zea mays]
gi|194692214|gb|ACF80191.1| unknown [Zea mays]
gi|413946454|gb|AFW79103.1| hypothetical protein ZEAMMB73_752316 [Zea mays]
Length = 441
Score = 95.5 bits (236), Expect = 3e-17, Method: Compositional matrix adjust.
Identities = 88/364 (24%), Positives = 148/364 (40%), Gaps = 44/364 (12%)
Query: 2 VRLFIGTPSKGVLLILDTGSALIYA-----------IFDPRKSSSFQKINCDHPDCTY-- 48
V++ +GTP++ L+ DTGS L + +F P S S+ + C C
Sbjct: 93 VKVLVGTPAQEFTLVADTGSELTWVKCAGGASPPGLVFRPEASKSWAPVPCSSDTCKLDV 152
Query: 49 -FKCVN-----EQCVYTMKYADQSVTK-GFAAHETISVIGKGEGKAIFHGALFGCSNDNH 101
F N C Y +Y + S G ++ ++ G A + GCS+ +
Sbjct: 153 PFSLANCSSSASPCSYDYRYKEGSAGALGVVGTDSATIALPGGKVAQLQDVVLGCSSTHD 212
Query: 102 GFDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLKFGTDM 161
G + DG VL L ISF S+ + FSYCLV L T YL FG
Sbjct: 213 GQSFKSVDG----VLSLGNAKISFASRAAARFGGSFSYCLVDHLAPRNATG-YLAFGPGQ 267
Query: 162 GYRRPSTQATKFINHPNNFYYLSLKDISIDNERMNFPPDTFDITVSGEGGCIIDSGSVLT 221
R P+TQ F++ FY + + + + + ++ P + +D GG I+DSG+ LT
Sbjct: 268 VPRTPATQTKLFLDPAMPFYGVKVDAVHVAGQALDIPAEVWDPK---SGGVILDSGTTLT 324
Query: 222 YFHSDVYWKLHEKFVSYFERFQLAQLSDCPE----PIQLCY--FLPET-FNRFPSMAFYF 274
+ Y + V+ + L+ P+ P + CY P P +A F
Sbjct: 325 VLATPAY----KAVVAALTKL----LAGVPKVDFPPFEHCYNWTAPRPGAPEIPKLAVQF 376
Query: 275 EDANLRIDGENVFIIDYENHFFLLAVAPHD-DLVALIGSQQQRDTRFVYDLNIDLLSFVK 333
++ID + + + + V++IG+ Q++ + +DL + F+
Sbjct: 377 TGCARLEPPAKSYVIDVKPGVKCIGLQEGEWPGVSVIGNIMQQEHLWEFDLKNMEVRFMP 436
Query: 334 ENCS 337
C+
Sbjct: 437 STCT 440
>gi|357123876|ref|XP_003563633.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Brachypodium
distachyon]
Length = 503
Score = 95.5 bits (236), Expect = 3e-17, Method: Compositional matrix adjust.
Identities = 91/359 (25%), Positives = 145/359 (40%), Gaps = 42/359 (11%)
Query: 1 MVRLFIGTPSKGVLLILDTGSALIYA---------------IFDPRKSSSFQKINCDHPD 45
+V + +GTP ++ DTGS + +FDP KSS++ ++C P
Sbjct: 164 VVPIGLGTPPSRFTVVFDTGSDTTWVQCRPCVVSCYKQKDRLFDPAKSSTYANVSCADPA 223
Query: 46 CTYF---KCVNEQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSNDNHG 102
C C C+Y ++Y D S T GF A +T++V + G FGC N G
Sbjct: 224 CADLDASGCNAGHCLYGIQYGDGSYTVGFFAKDTLAVA-----QDAIKGFKFGCGEKNRG 278
Query: 103 FDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLKFGTDMG 162
G AG+LGL R S Q FSYC LP + YL+FG
Sbjct: 279 L-----FGQTAGLLGLGRGPTSITVQAYEKYGGSFSYC----LPASSAATGYLEFGPLSP 329
Query: 163 YRRPSTQATK--FINHPNNFYYLSLKDISIDNERMNFPPDTFDITVSGEGGCIIDSGSVL 220
S T + FYY+ L I + +++ P+ +V G ++DSG+V+
Sbjct: 330 SSSGSNAKTTPMLTDKGPTFYYVGLTGIRVGGKQLGAIPE----SVFSNSGTLVDSGTVI 385
Query: 221 TYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCY-FLPETFNRFPSMAFYFE-DAN 278
T D + + + + + CY F + P+++ F+ A
Sbjct: 386 TRL-PDTAYAALSSAFAAAMAASGYKKAAAYSILDTCYDFTGLSQVSLPTVSLVFQGGAC 444
Query: 279 LRIDGEN-VFIIDYENHFFLLAVAPHDDLVALIGSQQQRDTRFVYDLNIDLLSFVKENC 336
L +D V+ I A D+ V ++G+ QQR +YD++ ++ F C
Sbjct: 445 LDLDASGIVYAISQSQVCLGFASNGDDESVGIVGNTQQRTYGVLYDVSKKVVGFAPGAC 503
>gi|224072901|ref|XP_002303933.1| predicted protein [Populus trichocarpa]
gi|222841365|gb|EEE78912.1| predicted protein [Populus trichocarpa]
Length = 370
Score = 95.5 bits (236), Expect = 3e-17, Method: Compositional matrix adjust.
Identities = 86/357 (24%), Positives = 148/357 (41%), Gaps = 42/357 (11%)
Query: 1 MVRLFIGTPSKGVLLILDTG--SALI---------YAIFDPRKSSSFQKINCDHPDCTYF 49
+V+ +GTP + +L+ LD +A I +F+ KS++F+ + C P C
Sbjct: 36 IVKAKVGTPPQTLLMALDNSYDAAWIPCKGCVGCSSTVFNTVKSTTFKTLGCGAPQCKQV 95
Query: 50 K---CVNEQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSNDNHGFDED 106
C C + Y ++ ++ T I ++ FGC G
Sbjct: 96 PNPICGGSTCTWNTTYGSSTI----LSNLTRDTIALSMDPVPYYA--FGCIQKATGSSVP 149
Query: 107 ARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLKFGTDMGYRRP 166
+ G+LG R +SF+SQ ++ K FSYCL P S L+ G +G + P
Sbjct: 150 PQ-----GLLGFGRGPLSFLSQTQNLYKSTFSYCL--PSFRTLNFSGSLRLG-PVG-QPP 200
Query: 167 STQATKFINHP--NNFYYLSLKDISIDNERMNFPPDTFDITVSGEGGCIIDSGSVLTYFH 224
+ T + +P ++ YY+ L I + + ++ P + G I DSG+V T
Sbjct: 201 RIKTTPLLKNPRRSSLYYVKLNGIRVGRKIVDIPRSALAFNPTTGAGTIFDSGTVFTRLV 260
Query: 225 SDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCYFLPETFNRFPSMAFYFEDANLRIDGE 284
+ Y + +F ++ L CY +P P++ F F N+ + E
Sbjct: 261 APAYIAVRNEFRKRVGNATVSSLGG----FDTCYSVPIV---PPTITFMFSGMNVTMPPE 313
Query: 285 NVFIIDYENHFFLLAVAPHDD----LVALIGSQQQRDTRFVYDLNIDLLSFVKENCS 337
N+ I LA+A D ++ +I S QQ++ R ++D+ L +E CS
Sbjct: 314 NLLIHSTAGVTSCLAMAAAPDNVNSVLNVIASMQQQNHRILFDVPNSRLGVAREQCS 370
>gi|223950123|gb|ACN29145.1| unknown [Zea mays]
gi|413923785|gb|AFW63717.1| hypothetical protein ZEAMMB73_445506 [Zea mays]
Length = 385
Score = 95.5 bits (236), Expect = 3e-17, Method: Compositional matrix adjust.
Identities = 86/360 (23%), Positives = 152/360 (42%), Gaps = 51/360 (14%)
Query: 1 MVRLFIGTPSKGVLLILDTGSALIYA--------------IFDPRKSSSFQKINCDHPDC 46
++ + +G+P+ +++DTGS + + +FDP SS++ +C DC
Sbjct: 53 LITVGLGSPATSQTMLIDTGSDVSWVQCKPCSQCHSQADPLFDPSSSSTYSPFSCGSADC 112
Query: 47 TYFK------CVNEQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSNDN 100
+ QC Y + Y D S T G + +T+++ G + FGCSN
Sbjct: 113 AQLGQEGNGCSSSSQCQYIVTYGDGSSTTGTYSSDTLAL-----GSSAVRSFQFGCSNVE 167
Query: 101 HGFDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLKFGTD 160
GF++ G++GL S +SQ + + FSYCL P P+ +S +L G
Sbjct: 168 SGFNDQTD-----GLMGLGGGAQSLVSQTAGTLGRAFSYCLP-PTPS---SSGFLTLGAA 218
Query: 161 MGYRRPSTQATKFINHPN--NFYYLSLKDISIDNERMNFPPDTFDITVSGEGGCIIDSGS 218
G T + FY + L+ I + +++ P F G ++DSG+
Sbjct: 219 GGSGTSGFVKTPMLRSSQVPTFYGVRLQAIRVGGRQLSIPASVF------SAGTVMDSGT 272
Query: 219 VLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCY-FLPETFNRFPSMAFYFE-D 276
V+T Y L F + +++ AQ S + C+ F ++ PS+A F
Sbjct: 273 VITRLPPTAYSALSSAFKAGMKQYPPAQPSGI---LDTCFDFSGQSSVSIPSVALVFSGG 329
Query: 277 ANLRIDGENVFIIDYENHFFLLAVAPHDDLVALIGSQQQRDTRFVYDLNIDLLSFVKENC 336
A + +D + + ++ A D + +IG+ QQR +YD+ ++ F C
Sbjct: 330 AVVSLDASGIIL----SNCLAFAGNSDDSSLGIIGNVQQRTFEVLYDVGRGVVGFRAGAC 385
>gi|294461757|gb|ADE76437.1| unknown [Picea sitchensis]
Length = 325
Score = 95.1 bits (235), Expect = 4e-17, Method: Compositional matrix adjust.
Identities = 84/348 (24%), Positives = 149/348 (42%), Gaps = 46/348 (13%)
Query: 13 VLLILDTGSALIY--------------AIFDPRKSSSFQKINCDHPDCTYFK-----CVN 53
+ L++DTGS + + ++F P S++++ + C+ C + C+N
Sbjct: 1 MFLLIDTGSDITWIQCDPCPQCYKQQDSLFQPAGSATYKPLPCNSTMCQQLQSFSHSCLN 60
Query: 54 EQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSNDNHGFDEDARDGALA 113
C Y + Y D+S T+G A ET+++ FGC + N G A A
Sbjct: 61 SSCNYMVSYGDKSTTRGDFALETLTLRSDDTILVSVPNFAFGCGHANKGLFNGA-----A 115
Query: 114 GVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLKFGTD--MGYRRPSTQAT 171
G++GL + +I F +Q K FSYCL P + S L FG + Y T
Sbjct: 116 GLMGLGKSSIGFPAQTSVAFGKVFSYCL--PSVSSTIPSGILHFGEAAMLDYDVRFTPLV 173
Query: 172 KFINHPNNFYYLSLKDISIDNERMNFPPDTFDITVSGEGGCIIDSGSVLTYFHSDVYWKL 231
+ P+ Y++S+ I++ +E + ++DSG+V++ F Y +L
Sbjct: 174 DSSSGPSQ-YFVSMTGINVGDELLPI-----------SATVMVDSGTVISRFEQSAYERL 221
Query: 232 HEKFVSYFERFQLAQLSDCPEPIQLCYFLPETFN-RFPSMAFYF-EDANLRIDGENVFII 289
+ F Q A P C+ + + P + +F +DA LR+ ++ +
Sbjct: 222 RDAFTQILPGLQTAVSV---APFDTCFRVSTVDDINIPLITLHFRDDAELRLSPVHI-LY 277
Query: 290 DYENHFFLLAVAPHDDLVALIGSQQQRDTRFVYDLNIDLLSFVKENCS 337
++ A AP +++G+ QQ++ RFVYD+ L C+
Sbjct: 278 PVDDGVMCFAFAPSSSGRSVLGNFQQQNLRFVYDIPKSRLGISAFECN 325
>gi|326513755|dbj|BAJ87896.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 442
Score = 95.1 bits (235), Expect = 4e-17, Method: Compositional matrix adjust.
Identities = 96/365 (26%), Positives = 152/365 (41%), Gaps = 47/365 (12%)
Query: 1 MVRLFIGTP-SKGVLLILDTGSALIYA--------------IFDPRKSSSFQKINCDHPD 45
++ L IG P S+ V+L LDTGS +++ FD S++ + + C P
Sbjct: 93 LIHLSIGAPRSQPVVLTLDTGSDVVWTQCEPCAECFTQPLPRFDTAASNTVRSVACSDPL 152
Query: 46 C---TYFKCVNEQCVYTMKYADQSVTKGFAAHETISVI-GKGEGKAIFHGALFGCSNDNH 101
C + C C Y Y D S++ G ++ + GKG GK FGC N
Sbjct: 153 CNAHSEHGCFLHGCTYVSGYGDGSLSFGHFLRDSFTFDDGKGGGKVTVPDIGFGCGMYNA 212
Query: 102 GFDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLKFGTDM 161
G G+ G R +S SQL ++FSYC E SS + G
Sbjct: 213 GRFLQTE----TGIAGFGRGPLSLPSQLKV---RQFSYCFTTRF---EAKSSPVFLGGAG 262
Query: 162 GYRRPSTQ---ATKFINH-----PNNFYYLSLKDISIDNERMNFPPDTFDITVSGEGGCI 213
+ +T +T F+ N+ Y LS K +++ R+ P +I G G
Sbjct: 263 DLKAHATGPILSTPFVRSLPPGTDNSHYVLSFKGVTVGKTRLPVP----EIKADGSGATF 318
Query: 214 IDSGSVLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCY-FLPETFNRFPSMAF 272
IDSG+ +T F V+ +L F++ ++ + +C+ + + P + F
Sbjct: 319 IDSGTDITTFPDAVFRQLKSAFIAQ----AALPVNKTADEDDICFSWDGKKTAAMPKLVF 374
Query: 273 YFEDANLRIDGENVFIIDYENHFFLLAVAPHDDL-VALIGSQQQRDTRFVYDLNIDLLSF 331
+ E A+ + EN D E+ +AV+ + LIG+ QQ++T VYDL L
Sbjct: 375 HLEGADWDLPRENYVTEDRESGQVCVAVSTSGQMDRTLIGNFQQQNTHIVYDLAAGKLLL 434
Query: 332 VKENC 336
V C
Sbjct: 435 VPAQC 439
>gi|194707866|gb|ACF88017.1| unknown [Zea mays]
Length = 461
Score = 95.1 bits (235), Expect = 4e-17, Method: Compositional matrix adjust.
Identities = 86/360 (23%), Positives = 152/360 (42%), Gaps = 51/360 (14%)
Query: 1 MVRLFIGTPSKGVLLILDTGSALIYA--------------IFDPRKSSSFQKINCDHPDC 46
++ + +G+P+ +++DTGS + + +FDP SS++ +C DC
Sbjct: 129 LITVGLGSPATSQTMLIDTGSDVSWVQCKPCSQCHSQADPLFDPSSSSTYSPFSCGSADC 188
Query: 47 TYFK------CVNEQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSNDN 100
+ QC Y + Y D S T G + +T+++ G + FGCSN
Sbjct: 189 AQLGQEGNGCSSSSQCQYIVTYGDGSSTTGTYSSDTLAL-----GSSAVRSFQFGCSNVE 243
Query: 101 HGFDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLKFGTD 160
GF++ G++GL S +SQ + + FSYCL P P+ +S +L G
Sbjct: 244 SGFNDQTD-----GLMGLGGGAQSLVSQTAGTLGRAFSYCLP-PTPS---SSGFLTLGAA 294
Query: 161 MGYRRPSTQATKFINHPN--NFYYLSLKDISIDNERMNFPPDTFDITVSGEGGCIIDSGS 218
G T + FY + L+ I + +++ P F G ++DSG+
Sbjct: 295 GGSGTSGFVKTPMLRSSQVPTFYGVRLQAIRVGGRQLSIPASVF------SAGTVMDSGT 348
Query: 219 VLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCY-FLPETFNRFPSMAFYFE-D 276
V+T Y L F + +++ AQ S + C+ F ++ PS+A F
Sbjct: 349 VITRLPPTAYSALSSAFKAGMKQYPPAQPSGI---LDTCFDFSGQSSVSIPSVALVFSGG 405
Query: 277 ANLRIDGENVFIIDYENHFFLLAVAPHDDLVALIGSQQQRDTRFVYDLNIDLLSFVKENC 336
A + +D + + ++ A D + +IG+ QQR +YD+ ++ F C
Sbjct: 406 AVVSLDASGIIL----SNCLAFAGNSDDSSLGIIGNVQQRTFEVLYDVGRGVVGFRAGAC 461
>gi|326533786|dbj|BAK05424.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 412
Score = 95.1 bits (235), Expect = 5e-17, Method: Compositional matrix adjust.
Identities = 85/323 (26%), Positives = 141/323 (43%), Gaps = 27/323 (8%)
Query: 27 IFDPRKSSSFQKINCDHPDCT--YFKCVNEQ-CVYTMKYADQSVTKGFAAHETISVIGKG 83
+F P +S +F+ + D P C Y + + C + A G+ A +T +
Sbjct: 109 LFSPAESPTFRGVRRDDPVCVPPYHRLHSTNGCSFAFPSA-----IGYLARDTFH-LRHS 162
Query: 84 EGKAI--FHGALFGCSNDNHGF-DEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYC 140
E + G FGC++ GF +ED L GVL LS +SF++Q GS RFSYC
Sbjct: 163 ERSVVKSISGVAFGCAHTTTGFYNEDI----LGGVLSLSPSPLSFLTQFGSRAGGRFSYC 218
Query: 141 LVIPLPNGEYTSSYLKFGTDMGYRRPSTQATKFINHPNNFYYLSLKDISIDNERMNFPPD 200
L P S +++FG ++ P T + + Y+LSL IS+ N+R++
Sbjct: 219 LPDPT-TSHNPSGFIQFGIEV-PSLPRHAHTTTLTVSASGYHLSLIGISLGNKRLD---- 272
Query: 201 TFDITVSGEGGCIIDSGSVLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCP-EPIQLCYF 259
D + GC I+ +T Y + + ++ Q+ P P+
Sbjct: 273 -IDRHILTSHGCSINPAETITKIAEPAYIIVARELMAQMNELGSKQVKGPPSSPLVFNKI 331
Query: 260 LPETFNRFPSMAFYFED-ANLRIDGENVFIIDYENHFFLLAVAPHDDLVALIGSQQQRDT 318
R P+M F+F D ++ +F + FL V H +IG+ QQ +
Sbjct: 332 SRRVRARLPNMVFHFADGGDMWFTAGKLFQVIGTTARFL--VEGHGSHRTVIGAAQQVNA 389
Query: 319 RFVYDLNIDLLSFVKENCSDDSA 341
RF++++ L+F +E CS ++A
Sbjct: 390 RFIFNVAAGRLTFAEELCSREAA 412
>gi|115434870|ref|NP_001042193.1| Os01g0178600 [Oryza sativa Japonica Group]
gi|55296112|dbj|BAD67831.1| putative CDR1 [Oryza sativa Japonica Group]
gi|55296252|dbj|BAD67993.1| putative CDR1 [Oryza sativa Japonica Group]
gi|113531724|dbj|BAF04107.1| Os01g0178600 [Oryza sativa Japonica Group]
gi|125569253|gb|EAZ10768.1| hypothetical protein OsJ_00604 [Oryza sativa Japonica Group]
Length = 454
Score = 94.7 bits (234), Expect = 5e-17, Method: Compositional matrix adjust.
Identities = 89/374 (23%), Positives = 152/374 (40%), Gaps = 55/374 (14%)
Query: 1 MVRLFIGTPSKGVLLILDTGSALIY-----------------AIFDPRKSSSFQKINCDH 43
++ + +G+P + +L I DTGS L++ FDP +SS++ +++C
Sbjct: 102 LMTVNLGSPPRSMLAIADTGSDLVWVKCKKGNNDTSSAAAPTTQFDPSRSSTYGRVSCQT 161
Query: 44 PDCTYFKCVN----EQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFH----GALFG 95
C C Y Y D S T G + ET + G G++ G FG
Sbjct: 162 DACEALGRATCDDGSNCAYLYAYGDGSNTTGVLSTETFTFDDGGSGRSPRQVRVGGVKFG 221
Query: 96 CSNDNHGFDEDARDGALAGVLGLSRVTISFISQLGSI--IKKRFSYCLVIPLPNGEYTSS 153
CS G ++GL +S ++QLG + +RFSYCLV P+ SS
Sbjct: 222 CSTATAGSFPADG------LVGLGGGAVSLVTQLGGATSLGRRFSYCLV---PHSVNASS 272
Query: 154 YLKFGTDMGYRRPSTQATKFI-NHPNNFYYLSLKDISIDNERMNFPPDTFDITVSGEGGC 212
L FG P +T + + +Y + L + + N+ + +
Sbjct: 273 ALNFGALADVTEPGAASTPLVAGDVDTYYTVVLDSVKVGNKT---------VASAASSRI 323
Query: 213 IIDSGSVLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCYFLP----ETFNRFP 268
I+DSG+ LT+ + + ++ R L + +QLCY + E P
Sbjct: 324 IVDSGTTLTFLDPSLLGPIVDEL---SRRITLPPVQSPDGLLQLCYNVAGREVEAGESIP 380
Query: 269 SMAFYF-EDANLRIDGENVFIIDYENHFFLLAVAPHDDL-VALIGSQQQRDTRFVYDLNI 326
+ F A + + EN F+ E L VA + V+++G+ Q++ YDL+
Sbjct: 381 DLTLEFGGGAAVALKPENAFVAVQEGTLCLAIVATTEQQPVSILGNLAQQNIHVGYDLDA 440
Query: 327 DLLSFVKENCSDDS 340
++F +C+ S
Sbjct: 441 GTVTFAGADCAGSS 454
>gi|359492937|ref|XP_002283889.2| PREDICTED: aspartic proteinase nepenthesin-1-like [Vitis vinifera]
Length = 439
Score = 94.7 bits (234), Expect = 5e-17, Method: Compositional matrix adjust.
Identities = 86/359 (23%), Positives = 149/359 (41%), Gaps = 43/359 (11%)
Query: 1 MVRLFIGTPSKGVLLILDTGSALIYA-----------IFDPRKSSSFQKINCDHPDCTYF 49
+VR+ +GTP + + ++LDT + F P SS++ + C P CT
Sbjct: 100 VVRVKLGTPGQLMFMVLDTSRDAAWVPCADCAGCSSPTFSPNTSSTYASLQCSVPQCTQV 159
Query: 50 KCVN------EQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSNDNHGF 103
+ ++ C + Y S + +++ G FGC N G
Sbjct: 160 RGLSCPTTGTAACFFNQTYGGDSSFSAMLSQDSL-----GLAVDTLPSYSFGCVNAVSGS 214
Query: 104 DEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLKFGTDMGY 163
+ G+LGL R +S +SQ GS+ FSYC P Y S L+ G +G
Sbjct: 215 TLPPQ-----GLLGLGRGPMSLLSQSGSLYSGVFSYCF--PSFKSYYFSGSLRLG-PLGQ 266
Query: 164 RRPSTQATKFINHPN--NFYYLSLKDISIDNERMNFPPDTFDITVSGEGGCIIDSGSVLT 221
+ + + T + +P+ YY++L +S+ + P+ + G IIDSG+V+T
Sbjct: 267 PK-NIRTTPLLRNPHRPTLYYVNLTGVSVGRVLVPVAPELLAFDPNTGAGTIIDSGTVIT 325
Query: 222 YFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCYFLPETFNRFPSMAFYFEDANLRI 281
F VY + ++F R Q+ C F + P + F+F +L++
Sbjct: 326 RFVEPVYAAIRDEF-----RKQVKGPFATIGAFDTC-FAATNEDIAPPVTFHFTGMDLKL 379
Query: 282 DGENVFIIDYENHFFLLAVAPH----DDLVALIGSQQQRDTRFVYDLNIDLLSFVKENC 336
EN I LA+A + ++ +I + QQ++ R ++D+ L +E C
Sbjct: 380 PLENTLIHSSAGSLACLAMAAAPNNVNSVLNVIANLQQQNLRIMFDVTNSRLGIARELC 438
>gi|302784853|ref|XP_002974198.1| hypothetical protein SELMODRAFT_54030 [Selaginella moellendorffii]
gi|300157796|gb|EFJ24420.1| hypothetical protein SELMODRAFT_54030 [Selaginella moellendorffii]
Length = 359
Score = 94.7 bits (234), Expect = 5e-17, Method: Compositional matrix adjust.
Identities = 101/365 (27%), Positives = 162/365 (44%), Gaps = 45/365 (12%)
Query: 1 MVRLFIGTPSKGVLLILDTGSALIY----------------AIFDPRKSSSFQKINCDHP 44
M+ L IGTP + + ++DTGS L++ IF SSS++K+ C+
Sbjct: 6 MMELSIGTPPQLIPAMIDTGSDLVWLKCDNCDHCDLDHHGETIFFSDASSSYKKLPCNST 65
Query: 45 DCTYFKCVN------EQCVYTMKYADQSVTKGFAAHETISVIGKGEG---KAIFHGALFG 95
C+ E C Y +Y D S T G + IS G G ++ F G LFG
Sbjct: 66 HCSGMSSAGIGPRCEETCKYKYEYGDGSRTSGDVGSDRISFRSHGAGEDHRSFFDGFLFG 125
Query: 96 CSNDNHGFDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYL 155
C+ G D + G ++GL + + S I QLG + +FSYCLV + S+L
Sbjct: 126 CARKLKG-DWNFTQG----LIGLGQKSHSLIQQLGDKLGYKFSYCLV-SYDSPPSAKSFL 179
Query: 156 KFGTDMGYRRPSTQATKFINHPN---NFYYLSLKDISIDN-ERMNFPPDTFDITVSG--- 208
G+ R +T ++ + YY+ L+ I+I + + ++ T G
Sbjct: 180 FLGSSAALRGHDVVSTPILHGDHLDQTLYYVDLQSITIGGVPVVVYDKESGHNTSVGPFL 239
Query: 209 EGGCIIDSGSVLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCYFLP-ETFNRF 267
+IDSG+ T VY + + S E+ L L + + LC+ +T F
Sbjct: 240 ANKTVIDSGTTYTLLTPPVYEAMRK---SIEEQVILPTLGN-SAGLDLCFNSSGDTSYGF 295
Query: 268 PSMAFYFED-ANLRIDGENVFIIDYENHFFLLAVAPHDDLVALIGSQQQRDTRFVYDLNI 326
PS+ FYF + L + EN+F + + L + DL ++IG+ QQ++ +YDL
Sbjct: 296 PSVTFYFANQVQLVLPFENIFQVTSRDVVCLSMDSSGGDL-SIIGNMQQQNFHILYDLVA 354
Query: 327 DLLSF 331
+SF
Sbjct: 355 SQISF 359
>gi|356555248|ref|XP_003545946.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
Length = 453
Score = 94.7 bits (234), Expect = 5e-17, Method: Compositional matrix adjust.
Identities = 85/349 (24%), Positives = 149/349 (42%), Gaps = 31/349 (8%)
Query: 6 IGTPSKGVLLILDTGSALIYAIFDPRKSSSFQKINCDHPDC----------TYFKCVNEQ 55
+G+P + L++DTGS + S SF+ + C C + ++
Sbjct: 119 VGSPGQRFWLVVDTGSEFTWL----NCSKSFEAVTCASRKCKVDLSELFSLSVCPKPSDP 174
Query: 56 CVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSN---DNHGFDEDARDGAL 112
C+Y + YAD S KGF ++I+V + + GC+ + F+E+
Sbjct: 175 CLYDISYADGSSAKGFFGTDSITVGLTNGKQGKLNNLTIGCTKSMLNGVNFNEET----- 229
Query: 113 AGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLKFGTDMGYRRPSTQATK 172
G+LGL SFI + + +FSYCLV L + +S+ G + T+
Sbjct: 230 GGILGLGFAKDSFIDKAANKYGAKFSYCLVDHLSHRSVSSNLTIGGHHNAKLLGEIRRTE 289
Query: 173 FINHPNNFYYLSLKDISIDNERMNFPPDTFDITVSGEGGCIIDSGSVLTYFHSDVYWKLH 232
I P FY +++ ISI + + PP +D + EGG +IDSG+ LT Y +
Sbjct: 290 LILFP-PFYGVNVVGISIGGQMLKIPPQVWDF--NAEGGTLIDSGTTLTSLLLPAYEAVF 346
Query: 233 EKFVSYFERFQLAQLSDCPEPIQLCYFLPETFNR--FPSMAFYFEDANLRIDGENVFIID 290
E + + D + ++ C F E F+ P + F+F +IID
Sbjct: 347 EALTKSLTKVKRVTGEDF-DALEFC-FDAEGFDDSVVPRLVFHFAGGARFEPPVKSYIID 404
Query: 291 YENHFFLLAVAPHDDL--VALIGSQQQRDTRFVYDLNIDLLSFVKENCS 337
+ + P D + ++IG+ Q++ + +DL+ + + F C+
Sbjct: 405 VAPLVKCIGIVPIDGIGGASVIGNIMQQNHLWEFDLSTNTVGFAPSTCT 453
>gi|108707839|gb|ABF95634.1| Eukaryotic aspartyl protease family protein [Oryza sativa Japonica
Group]
Length = 330
Score = 94.7 bits (234), Expect = 5e-17, Method: Compositional matrix adjust.
Identities = 86/281 (30%), Positives = 134/281 (47%), Gaps = 28/281 (9%)
Query: 53 NEQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSNDNHGFDEDARDGAL 112
N+ CVYT Y D+SVT G + + G G ++ G FGC N+G +
Sbjct: 59 NQTCVYTYYYNDKSVTTGLIEVDKFTF---GAGASV-PGVAFGCGLFNNGVFKSNE---- 110
Query: 113 AGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNG-EYTSSYLKFGTDMGYR--RPSTQ 169
G+ G R +S SQL FS+C NG + ++ L D+ Y+ R + Q
Sbjct: 111 TGIAGFGRGPLSLPSQLKV---GNFSHCFTAV--NGLKQSTVLLDLPADL-YKNGRGAVQ 164
Query: 170 ATKFINHPNN--FYYLSLKDISIDNERMNFPPDTFDITVSGEGGCIIDSGSVLTYFHSDV 227
+T I + N FYYLSLK I++ + R+ P F +T +G GG IIDSG+ +T V
Sbjct: 165 STPLIQNSANPTFYYLSLKGITVGSTRLPVPESAFALT-NGTGGTIIDSGTSITSLPPQV 223
Query: 228 YWKLHEKFVSYFERFQLAQLSDCPEPIQLCYFLP-ETFNRFPSMAFYFEDANLRIDGEN- 285
Y + ++F + + + + P C+ P + P + +FE A + + EN
Sbjct: 224 YQVVRDEFAAQIKLPVVPGNATGP---YTCFSAPSQAKPDVPKLVLHFEGATMDLPRENY 280
Query: 286 VFII--DYENHFFLLAVAPHDDLVALIGSQQQRDTRFVYDL 324
VF + D N LA+ D+ +IG+ QQ++ +YDL
Sbjct: 281 VFEVPDDAGNSIICLAINKGDE-TTIIGNFQQQNMHVLYDL 320
>gi|9759559|dbj|BAB11161.1| nucleoid DNA-binding-like protein [Arabidopsis thaliana]
gi|21553652|gb|AAM62745.1| nucleoid DNA-binding-like protein [Arabidopsis thaliana]
gi|109134179|gb|ABG25087.1| At5g07030 [Arabidopsis thaliana]
Length = 439
Score = 94.7 bits (234), Expect = 6e-17, Method: Compositional matrix adjust.
Identities = 90/358 (25%), Positives = 147/358 (41%), Gaps = 39/358 (10%)
Query: 1 MVRLFIGTPSKGVLLILDTGSALIY------------AIFDPRKSSSFQKINCDHPDCTY 48
+V+ IGTP++ +LL +DT S + + F P KS+SF+ ++C P C
Sbjct: 100 IVKALIGTPAQPLLLAMDTSSDVAWIPCSGCVGCPSNTAFSPAKSTSFKNVSCSAPQCKQ 159
Query: 49 FK---CVNEQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSNDNHGFDE 105
C C + + Y S+ + +TI + FGC N G
Sbjct: 160 VPNPTCGARACSFNLTYGSSSIAANLS-QDTIRLAADP-----IKAFTFGCVNKVAGGGT 213
Query: 106 DARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLKFGTDMGYRR 165
L G+ R +S +SQ SI K FSYCL P S L+ G +R
Sbjct: 214 IPPPQGLLGL---GRGPLSLMSQAQSIYKSTFSYCL--PSFRSLTFSGSLRLGPTSQPQR 268
Query: 166 PSTQATKFINHP--NNFYYLSLKDISIDNERMNFPPDTFDITVSGEGGCIIDSGSVLTYF 223
+ T+ + +P ++ YY++L I + + ++ PP S G I DSG+V T
Sbjct: 269 --VKYTQLLRNPRRSSLYYVNLVAIRVGRKVVDLPPAAIAFNPSTGAGTIFDSGTVYTRL 326
Query: 224 HSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCYFLPETFNRFPSMAFYFEDANLRIDG 283
VY + +F + S CY + P++ F F+ N+ +
Sbjct: 327 AKPVYEAVRNEFRKRVKPTTAVVTSL--GGFDTCY---SGQVKVPTITFMFKGVNMTMPA 381
Query: 284 ENVFI--IDYENHFFLLAVAPH--DDLVALIGSQQQRDTRFVYDLNIDLLSFVKENCS 337
+N+ + +A AP + +V +I S QQ++ R + D+ L +E CS
Sbjct: 382 DNLMLHSTAGSTSCLAMAAAPENVNSVVNVIASMQQQNHRVLIDVPNGRLGLARERCS 439
>gi|79507883|ref|NP_196320.2| aspartyl protease family protein [Arabidopsis thaliana]
gi|332003717|gb|AED91100.1| aspartyl protease family protein [Arabidopsis thaliana]
Length = 455
Score = 94.7 bits (234), Expect = 6e-17, Method: Compositional matrix adjust.
Identities = 90/358 (25%), Positives = 147/358 (41%), Gaps = 39/358 (10%)
Query: 1 MVRLFIGTPSKGVLLILDTGSALIY------------AIFDPRKSSSFQKINCDHPDCTY 48
+V+ IGTP++ +LL +DT S + + F P KS+SF+ ++C P C
Sbjct: 116 IVKALIGTPAQPLLLAMDTSSDVAWIPCSGCVGCPSNTAFSPAKSTSFKNVSCSAPQCKQ 175
Query: 49 FK---CVNEQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSNDNHGFDE 105
C C + + Y S+ + +TI + FGC N G
Sbjct: 176 VPNPTCGARACSFNLTYGSSSIAANLS-QDTIRLAADP-----IKAFTFGCVNKVAGGGT 229
Query: 106 DARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLKFGTDMGYRR 165
L G+ R +S +SQ SI K FSYCL P S L+ G +R
Sbjct: 230 IPPPQGLLGL---GRGPLSLMSQAQSIYKSTFSYCL--PSFRSLTFSGSLRLGPTSQPQR 284
Query: 166 PSTQATKFINHP--NNFYYLSLKDISIDNERMNFPPDTFDITVSGEGGCIIDSGSVLTYF 223
+ T+ + +P ++ YY++L I + + ++ PP S G I DSG+V T
Sbjct: 285 --VKYTQLLRNPRRSSLYYVNLVAIRVGRKVVDLPPAAIAFNPSTGAGTIFDSGTVYTRL 342
Query: 224 HSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCYFLPETFNRFPSMAFYFEDANLRIDG 283
VY + +F + S CY + P++ F F+ N+ +
Sbjct: 343 AKPVYEAVRNEFRKRVKPTTAVVTSL--GGFDTCY---SGQVKVPTITFMFKGVNMTMPA 397
Query: 284 ENVFI--IDYENHFFLLAVAPH--DDLVALIGSQQQRDTRFVYDLNIDLLSFVKENCS 337
+N+ + +A AP + +V +I S QQ++ R + D+ L +E CS
Sbjct: 398 DNLMLHSTAGSTSCLAMAAAPENVNSVVNVIASMQQQNHRVLIDVPNGRLGLARERCS 455
>gi|125555051|gb|EAZ00657.1| hypothetical protein OsI_22678 [Oryza sativa Indica Group]
Length = 435
Score = 94.7 bits (234), Expect = 6e-17, Method: Compositional matrix adjust.
Identities = 93/342 (27%), Positives = 146/342 (42%), Gaps = 52/342 (15%)
Query: 20 GSALIYAIFDPRKSSSFQKINCDHPDCTYFKCVNEQCVYTMKYADQSVTKGFAAHETISV 79
G A F+P +SSSF I C P+C +C C +T+++ + +V G +T+++
Sbjct: 121 GGAPCDPAFEPSRSSSFAAIPCGSPECA-VECTGASCPFTIQFGNVTVANGTLVRDTLTL 179
Query: 80 IGKGEGKAIFHGALFGCSNDNHGFDEDARDGALAGVLGLSRVTISFISQL----GSIIKK 135
A F G FGC G D D DGA+ G++ LSR + S S++ +
Sbjct: 180 ----PPSATFAGFTFGC--IEVGADADTFDGAV-GLIDLSRSSHSLASRVISNGATTSAA 232
Query: 136 RFSYCLVIPLPNGEYTSS--YLKFGT--------DMGYRRPSTQATKFINHPNNFYYLSL 185
FSYC LP+ TSS +L G D+ Y S+ NHPN+ Y++ L
Sbjct: 233 AFSYC----LPSSSATSSRGFLSIGASRPEYSGGDIKYAPMSSNP----NHPNS-YFVEL 283
Query: 186 KDISIDNERMNFPPDTFDITVSGEGGCIIDSGSVLTYFHSDVYWKLHEKFVSYFERFQLA 245
IS+ E + PP F G ++++ + T+ Y L + F R +A
Sbjct: 284 VGISVGGEDLPVPPAVF-----AAHGTLLEAATEFTFLAPAAYAALRDAF-----RRDMA 333
Query: 246 QLSDCP--EPIQLCYFLPETFN-RFPSMAFYFEDAN-LRID-GENVFIIDYENHFFLLAV 300
P + CY L + P++A F L +D + ++ D + F +A
Sbjct: 334 PYPAAPPFRVLDTCYNLTGLASLAVPTVALRFAGGTELELDVRQMMYFADPSSVFSSVAC 393
Query: 301 ------APHDDLVALIGSQQQRDTRFVYDLNIDLLSFVKENC 336
V++IG+ QR T VYDL + F+ C
Sbjct: 394 LAFAAAPLPAFPVSVIGTLAQRSTEVVYDLRGGRVGFIPGRC 435
>gi|308044575|ref|NP_001183392.1| uncharacterized protein LOC100501808 [Zea mays]
gi|238011188|gb|ACR36629.1| unknown [Zea mays]
Length = 342
Score = 94.7 bits (234), Expect = 6e-17, Method: Compositional matrix adjust.
Identities = 97/357 (27%), Positives = 147/357 (41%), Gaps = 50/357 (14%)
Query: 15 LILDTGSALIYA--------------IFDPRKSSSFQKINCDHPDCTYFKCVN-----EQ 55
++LDTGS +++ +FDPR+SSS+ + C C
Sbjct: 1 MVLDTGSDVVWVQCAPCRRCYEQSGPVFDPRRSSSYGAVGCGAALCRRLDSGGCDLRRGA 60
Query: 56 CVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSNDNHGFDEDARDGALAGV 115
C+Y + Y D SVT G ET++ G G + AL GC +DN G A
Sbjct: 61 CMYQVAYGDGSVTAGDFVTETLTFAG---GARVARVAL-GCGHDNEGLFVAAAGLLGL-- 114
Query: 116 LGLSRVTISFISQLGSIIKKRFSYCLV------IPLPNGEYTSSYLKFGTDMGYRRPSTQ 169
R +SF +Q+ + FSYCLV G + SS + FG S
Sbjct: 115 ---GRGGLSFPTQISRRYGRSFSYCLVDRTSSGAGAAPGSHRSSTVSFGAGS-VGASSAS 170
Query: 170 ATKFINHPN--NFYYLSLKDISIDNERMNFPPDTFDITV---SGEGGCIIDSGSVLTYFH 224
T + +P FYY+ L IS+ R+ ++ D+ + +G GG I+DSG+ +T
Sbjct: 171 FTPMVRNPRMETFYYVQLVGISVGGARVPGVAES-DLRLDPSTGRGGVIVDSGTSVTRLA 229
Query: 225 SDVYWKLHEKFVSYFERFQLAQLSDCPEPIQL---CYFL-PETFNRFPSMAFYFE-DANL 279
Y L + F + L P L CY L + P+++ +F A
Sbjct: 230 RASYSALRDAFRAA----AAGGLRLSPGGFSLFDTCYDLGGRRVVKVPTVSMHFAGGAEA 285
Query: 280 RIDGENVFIIDYENHFFLLAVAPHDDLVALIGSQQQRDTRFVYDLNIDLLSFVKENC 336
+ EN I F A A D V++IG+ QQ+ R V+D + + F + C
Sbjct: 286 ALPPENYLIPVDSRGTFCFAFAGTDGGVSIIGNIQQQGFRVVFDGDGQRVGFAPKGC 342
>gi|225216988|gb|ACN85278.1| aspartic proteinase nepenthesin-1 precursor [Oryza alta]
Length = 518
Score = 94.7 bits (234), Expect = 6e-17, Method: Compositional matrix adjust.
Identities = 90/364 (24%), Positives = 145/364 (39%), Gaps = 53/364 (14%)
Query: 1 MVRLFIGTPSKGVLLILDTGSALIYA---------------IFDPRKSSSFQKINCDHPD 45
+V + +GTP+ ++ DTGS + +FDP +SS++ ++C P
Sbjct: 180 VVTVGLGTPASRYTVVFDTGSDTTWVQCQPCVVVCYEQQEKLFDPARSSTYANVSCAAPA 239
Query: 46 C---TYFKCVNEQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSNDNHG 102
C C C+Y ++Y D S + GF A +T+++ G FGC N G
Sbjct: 240 CFDLDTRGCSGGHCLYGVQYGDGSYSIGFFAMDTLTL----SSYDAVKGFRFGCGERNEG 295
Query: 103 FDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLKFGTDMG 162
+A AG+LGL R S Q F++C LP + YL FG
Sbjct: 296 LFGEA-----AGLLGLGRGKTSLPVQTYDKYGGVFAHC----LPARSSGTGYLDFGPGSP 346
Query: 163 YRRPSTQATKFI--NHPNNFYYLSLKDISIDNERMNFPPDTFDITVSGEGGCIIDSGSVL 220
+ T + N P FYY+ + I + + ++ P F G I+DSG+V+
Sbjct: 347 AAAGARLTTPMLTDNGP-TFYYVGMTGIRVGGQLLSIPQSVFATA-----GTIVDSGTVI 400
Query: 221 TYFHSDVYWKLHEKFVSY-----FERFQLAQLSDCPEPIQLCY-FLPETFNRFPSMAFYF 274
T Y L FVS +++ L D CY F + P+++ F
Sbjct: 401 TRLPPPAYSSLRSAFVSAMAARGYKKAPAVSLLD------TCYDFTGMSQVAIPTVSLLF 454
Query: 275 E-DANLRIDGENVFIIDYENHFFLLAVAPHDDL-VALIGSQQQRDTRFVYDLNIDLLSFV 332
+ A L +D + + L A D V ++G+ Q + YD+ ++ F
Sbjct: 455 QGGAILDVDASGIMYAASVSQVCLGFAANEDGGDVGIVGNTQLKTFGVAYDIGKKVVGFS 514
Query: 333 KENC 336
C
Sbjct: 515 PGAC 518
>gi|449463971|ref|XP_004149703.1| PREDICTED: aspartic proteinase-like protein 2-like [Cucumis
sativus]
Length = 641
Score = 94.4 bits (233), Expect = 7e-17, Method: Compositional matrix adjust.
Identities = 100/368 (27%), Positives = 158/368 (42%), Gaps = 56/368 (15%)
Query: 2 VRLFIGTPSKGVLLILDTGSALIYAI--------------FDPRKSSSFQKINCDHPDCT 47
RL+IGTP + LI+DTGS + Y FDP SS+++ I C+ DC
Sbjct: 85 TRLWIGTPPQQFALIVDTGSTVTYVPCSTCEQCGRHQDPKFDPESSSTYKPIKCNI-DCI 143
Query: 48 YFKCVNE--QCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSNDNHGFDE 105
C ++ QCVY +YA+ S + G + IS + E I A+FGC N G
Sbjct: 144 ---CDSDGVQCVYERQYAEMSTSSGVLGEDVISFGNQSE--LIPQRAVFGCENMETG--- 195
Query: 106 DARDGALAGVLGLSRVTISFISQL--GSIIKKRFSYCL-VIPLPNGEYTSSYLKFGTDM- 161
D G++GL +S + QL I FS C + + G + +DM
Sbjct: 196 DLFSQRADGIMGLGTGDLSLVDQLVEKGAINDSFSLCYGGMDIGGGAMVLGGISPPSDMI 255
Query: 162 -GYRRPSTQATKFINHPNNFYYLSLKDISIDNERMNFPPDTFDITVSGEGGCIIDSGSVL 220
Y P + P +Y + LK+I + +++ FD G G ++DSG+
Sbjct: 256 FTYSDP-------VRSP--YYNVDLKEIHVAGKKLPLSSGIFD----GRYGAVLDSGTTY 302
Query: 221 TYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEP--IQLCYF-----LPETFNRFPSMAFY 273
Y ++ + + + E L ++ D P+P +C+ E N+FP++
Sbjct: 303 AYLPAEAFSAFKDAIMD--EIHSLKKI-DGPDPNFKDICFSGAGSDAAELSNKFPTVDMV 359
Query: 274 FEDAN-LRIDGENVFIIDYENH--FFLLAVAPHDDLVALIGSQQQRDTRFVYDLNIDLLS 330
FE+ L + EN F + H + L +D L+G R+T +YD +
Sbjct: 360 FENGQKLSLTPENYFFRHSKVHGAYCLGIFENGNDQTTLLGGIVVRNTLVMYDRANSKIG 419
Query: 331 FVKENCSD 338
F K NCS+
Sbjct: 420 FWKTNCSE 427
>gi|296082170|emb|CBI21175.3| unnamed protein product [Vitis vinifera]
Length = 386
Score = 94.4 bits (233), Expect = 7e-17, Method: Compositional matrix adjust.
Identities = 89/362 (24%), Positives = 140/362 (38%), Gaps = 92/362 (25%)
Query: 1 MVRLFIGTPSKGVLLILDTGSALIYA---------------IFDPRKSSSFQKINCDHPD 45
+V + +G+P + + I DTGS L + IFDP S S+ ++CD P
Sbjct: 90 VVTVGLGSPKRDLTFIFDTGSDLTWTQCEPCVGYCYQQREHIFDPSTSLSYSNVSCDSPS 149
Query: 46 CTYFK--------CVNEQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCS 97
C + C + C+Y ++Y D S + GF A E +S+ +F+ FGC
Sbjct: 150 CEKLESATGNSPGCSSSTCLYGIRYGDGSYSIGFFAREKLSLTST----DVFNNFQFGCG 205
Query: 98 NDNHGFDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLKF 157
+N G G AG+LGL+R +S +SQ K FSYC LP+ ++ YL F
Sbjct: 206 QNNRGLF-----GGTAGLLGLARNPLSLVSQTAQKYGKVFSYC----LPSSSSSTGYLSF 256
Query: 158 GTDMGYRRPSTQATKFINHPNNFYYLSLKDISIDNERMNFPPDTFDITVSGEGGCIIDSG 217
G+ G ++A KF PP +
Sbjct: 257 GSGDG----DSKAVKFTPR--------------------LPPTVYSSV------------ 280
Query: 218 SVLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCYFLPETFN-RFPSMAFYFE- 275
K+ + +S + R + + D CY L + + P + YF
Sbjct: 281 -----------QKVFRELMSDYPRVKGVSILD------TCYDLSKYKTVKVPKIILYFSG 323
Query: 276 DANLRIDGEN-VFIIDYENHFFLLAVAPHDDLVALIGSQQQRDTRFVYDLNIDLLSFVKE 334
A + + E ++++ A DD VA+IG+ QQ+ VYD + F
Sbjct: 324 GAEMDLAPEGIIYVLKVSQVCLAFAGNSDDDEVAIIGNVQQKTIHVVYDDAEGRVGFAPS 383
Query: 335 NC 336
C
Sbjct: 384 GC 385
>gi|357113696|ref|XP_003558637.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Brachypodium
distachyon]
Length = 432
Score = 94.4 bits (233), Expect = 7e-17, Method: Compositional matrix adjust.
Identities = 91/372 (24%), Positives = 157/372 (42%), Gaps = 54/372 (14%)
Query: 1 MVRLFIGTPSKGVLLILDTGSALIYA-------------IFDPRKSSSFQKINCDHPDCT 47
+VR +G+P++ +LL LDT + +A +F P S+S+ + C CT
Sbjct: 78 VVRAGLGSPAQPILLALDTSADATWAHCSPCGTCPSSGSLFAPANSTSYAPLPCSSTMCT 137
Query: 48 YFK---CVNE----------QCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALF 94
+ C + C +T +AD S A+ GK F
Sbjct: 138 VLQGQPCPAQDPYDSSAPLPMCAFTKPFADASFQASLASDWL------HLGKDAIPNYAF 191
Query: 95 GCSNDNHGFDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSY 154
GC + G + G+LGL R ++ +SQ+G++ FSYCL P Y S
Sbjct: 192 GCVSAVSGPTANLPK---QGLLGLGRGPMALLSQVGNMYNGVFSYCL--PSYKSYYFSGS 246
Query: 155 LKFGTDMGYRRPSTQATKFINHPN--NFYYLSLKDISIDNERMNFPPDTFDITVSGEGGC 212
L+ G R + T + +PN + YY+++ +S+ + P +F + G
Sbjct: 247 LRLGAAGQPR--GVRYTPMLKNPNRSSLYYVNVTGLSVGRAPVKVPAGSFAFDPATGAGT 304
Query: 213 IIDSGSVLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPE--PIQLCYFLPETFNRF-PS 269
++DSG+V+T + VY L E+F R +A S C+ E P+
Sbjct: 305 VVDSGTVITRWTPPVYAALREEF-----RRHVAAPSGYTSLGAFDTCFNTDEVAAGVAPA 359
Query: 270 MAFYFEDA-NLRIDGENVFIIDYENHFFLLAV--APH--DDLVALIGSQQQRDTRFVYDL 324
+ + + +L + EN I LA+ AP + +V ++ + QQ++ R V+D+
Sbjct: 360 VTVHMDGGLDLALPMENTLIHSSATPLACLAMAEAPQNVNAVVNVLANLQQQNLRVVFDV 419
Query: 325 NIDLLSFVKENC 336
+ F +E+C
Sbjct: 420 ANSRVGFARESC 431
>gi|115465373|ref|NP_001056286.1| Os05g0557100 [Oryza sativa Japonica Group]
gi|113579837|dbj|BAF18200.1| Os05g0557100 [Oryza sativa Japonica Group]
gi|125553268|gb|EAY98977.1| hypothetical protein OsI_20935 [Oryza sativa Indica Group]
Length = 494
Score = 94.4 bits (233), Expect = 7e-17, Method: Compositional matrix adjust.
Identities = 103/394 (26%), Positives = 154/394 (39%), Gaps = 70/394 (17%)
Query: 2 VRLFIGTPSKGVLLILDTGSALIYA-----------------------------IFDPRK 32
VR +GTP++ +LI DTGS L + +F P
Sbjct: 112 VRFRVGTPAQPFVLIADTGSDLTWVKCRGAASPSHATATASPAAAPSPAVAPPRVFRPGD 171
Query: 33 SSSFQKINCDHPDCTY---FKCVN-----EQCVYTMKYADQSVTKGFAAHETISVI---- 80
S ++ I C C F N C Y +Y D S +G ++ +V
Sbjct: 172 SKTWSPIPCSSETCKSTIPFSLANCSSSTAACSYDYRYNDNSAARGVVGTDSATVALSGG 231
Query: 81 ----GKGEGKAIFHGALFGCSNDNHGFDEDARDGALAGVLGLSRVTISFISQLGSIIKKR 136
G G+ KA G + GC+ + G +A D GVL L ISF S+ S R
Sbjct: 232 RGGGGGGDRKAKLQGVVLGCTTAHAGQGFEASD----GVLSLGYSNISFASRAASRFGGR 287
Query: 137 FSYCLVIPLPNGEYTSSYLKFG-----TDMGYRRPSTQATKFIN-HPNNFYYLSLKDISI 190
FSYCLV L T SYL FG P ++ ++ FY +++ +S+
Sbjct: 288 FSYCLVDHLAPRNAT-SYLTFGAGPDAASSSAPAPGSRTPLLLDARVRPFYAVAVDSVSV 346
Query: 191 DNERMNFPPDTFDITVSGEGGCIIDSGSVLTYFHSDVYWKLHEKFVSYFERFQLAQLSDC 250
D ++ P + +D V GG IIDSG+ LT + Y K V QLA L
Sbjct: 347 DGVALDIPAEVWD--VGSNGGTIIDSGTSLTVLATPAY-----KAVVAALSEQLAGLPRV 399
Query: 251 P-EPIQLCYFLPETFN-----RFPSMAFYFEDANLRIDGENVFIIDYENHFFLLAVAPHD 304
+P CY + P +A F + ++ID + V
Sbjct: 400 AMDPFDYCYNWTARGDGGGDLAVPKLAVQFAGSARLEPPAKSYVIDAAPGVKCIGVQEGA 459
Query: 305 -DLVALIGSQQQRDTRFVYDLNIDLLSFVKENCS 337
V++IG+ Q++ + +DLN L F + +C+
Sbjct: 460 WPGVSVIGNILQQEHLWEFDLNNRWLRFRQTSCT 493
>gi|449508297|ref|XP_004163275.1| PREDICTED: aspartic proteinase-like protein 2-like [Cucumis
sativus]
Length = 641
Score = 94.4 bits (233), Expect = 7e-17, Method: Compositional matrix adjust.
Identities = 100/368 (27%), Positives = 158/368 (42%), Gaps = 56/368 (15%)
Query: 2 VRLFIGTPSKGVLLILDTGSALIYAI--------------FDPRKSSSFQKINCDHPDCT 47
RL+IGTP + LI+DTGS + Y FDP SS+++ I C+ DC
Sbjct: 85 TRLWIGTPPQQFALIVDTGSTVTYVPCSTCEQCGRHQDPKFDPESSSTYKPIKCNI-DCI 143
Query: 48 YFKCVNE--QCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSNDNHGFDE 105
C ++ QCVY +YA+ S + G + IS + E I A+FGC N G
Sbjct: 144 ---CDSDGVQCVYERQYAEMSTSSGVLGEDVISFGNQSE--LIPQRAVFGCENMETG--- 195
Query: 106 DARDGALAGVLGLSRVTISFISQL--GSIIKKRFSYCL-VIPLPNGEYTSSYLKFGTDM- 161
D G++GL +S + QL I FS C + + G + +DM
Sbjct: 196 DLFSQRADGIMGLGTGDLSLVDQLVEKGAINDSFSLCYGGMDIGGGAMVLGGISPPSDMI 255
Query: 162 -GYRRPSTQATKFINHPNNFYYLSLKDISIDNERMNFPPDTFDITVSGEGGCIIDSGSVL 220
Y P + P +Y + LK+I + +++ FD G G ++DSG+
Sbjct: 256 FTYSDP-------VRSP--YYNVDLKEIHVAGKKLPLSSGIFD----GRYGAVLDSGTTY 302
Query: 221 TYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEP--IQLCYF-----LPETFNRFPSMAFY 273
Y ++ + + + E L ++ D P+P +C+ E N+FP++
Sbjct: 303 AYLPAEAFSAFKDAIMD--EIHSLKKI-DGPDPNFKDICFSGAGSDAAELSNKFPTVDMV 359
Query: 274 FEDAN-LRIDGENVFIIDYENH--FFLLAVAPHDDLVALIGSQQQRDTRFVYDLNIDLLS 330
FE+ L + EN F + H + L +D L+G R+T +YD +
Sbjct: 360 FENGQKLSLTPENYFFRHSKVHGAYCLGIFENGNDQTTLLGGIVVRNTLVMYDRANSKIG 419
Query: 331 FVKENCSD 338
F K NCS+
Sbjct: 420 FWKTNCSE 427
>gi|356528623|ref|XP_003532899.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
Length = 507
Score = 94.4 bits (233), Expect = 7e-17, Method: Compositional matrix adjust.
Identities = 86/330 (26%), Positives = 141/330 (42%), Gaps = 29/330 (8%)
Query: 26 AIFDPRKSSSFQKINCDHPDC----------TYFKCVNEQCVYTMKYADQSVTKGFAAHE 75
+F P +S SFQ + C C + ++ C+Y + YAD S KGF +
Sbjct: 189 GVFCPHRSKSFQAVTCASQKCKIDLSQLFSLSLCPKPSDPCLYDISYADGSSAKGFFGTD 248
Query: 76 TISVIGKGEGKAIFHGALFGCSN---DNHGFDEDARDGALAGVLGLSRVTISFISQLGSI 132
TI+V K + + GC+ + F+ED G+LGL SFI +
Sbjct: 249 TITVDLKNGKEGKLNNLTIGCTKSMENGVNFNEDT-----GGILGLGFAKDSFIDKAAYE 303
Query: 133 IKKRFSYCLVIPLPNGEYTSSYLKFGTDMGYR-RPSTQATKFINHPNNFYYLSLKDISID 191
+FSYCLV L + SSYL G + + T+ I P FY +++ ISI
Sbjct: 304 YGAKFSYCLVDHLSH-RNVSSYLTIGGHHNAKLLGEIKRTELILFP-PFYGVNVVGISIG 361
Query: 192 NERMNFPPDTFDITVSGEGGCIIDSGSVLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCP 251
+ + PP +D + +GG +IDSG+ LT Y + E + + + D
Sbjct: 362 GQMLKIPPQVWDF--NSQGGTLIDSGTTLTALLVPAYEPVFEALIKSLTKVKRVTGEDF- 418
Query: 252 EPIQLCYFLPETFNR--FPSMAFYFEDANLRIDGENVFIIDYENHFFLLAVAPHDDL--V 307
+ C F E F+ P + F+F +IID + + P D +
Sbjct: 419 GALDFC-FDAEGFDDSVVPRLVFHFAGGARFEPPVKSYIIDVAPLVKCIGIVPIDGIGGA 477
Query: 308 ALIGSQQQRDTRFVYDLNIDLLSFVKENCS 337
++IG+ Q++ + +DL+ + + F C+
Sbjct: 478 SVIGNIMQQNHLWEFDLSTNTIGFAPSICT 507
>gi|255581545|ref|XP_002531578.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
gi|223528808|gb|EEF30814.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
Length = 442
Score = 94.4 bits (233), Expect = 7e-17, Method: Compositional matrix adjust.
Identities = 82/320 (25%), Positives = 141/320 (44%), Gaps = 50/320 (15%)
Query: 2 VRLFIGTPSKGVLLILDTGSALIY----------AIFDPRKSSSFQKINCDHPDCTY--- 48
V L +G+P + V ++LDTGS L + ++F+P S ++ K+ C P C
Sbjct: 71 VSLTVGSPPQNVTMVLDTGSELSWLHCKKTQFLNSVFNPLSSKTYSKVPCLSPTCKTRTR 130
Query: 49 -----FKC-VNEQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSNDNHG 102
C + C + YAD + +G A ET + G +FGC + G
Sbjct: 131 DLTIPVSCDATKLCHVIVSYADATSIEGNLAFETFRL-----GSLTKPATIFGCMDS--G 183
Query: 103 FDEDAR-DGALAGVLGLSRVTISFISQLGSIIKKRFSYCL-------VIPLPNGEYTSSY 154
F ++ D G++G++R ++SF++Q+G +FSYC+ V+ L N + +
Sbjct: 184 FSSNSEEDSKTTGLIGMNRGSLSFVNQMG---YPKFSYCISGFDSAGVLLLGNASF--PW 238
Query: 155 LKFGTDMGYRRPSTQATKFINHPNNFYYLSLKDISIDNERMNFPPDTFDITVSGEGGCII 214
LK + Y +T Y + L+ I + N+ ++ P F +G G ++
Sbjct: 239 LK---PLSYTPLVQISTPLPYFDRVAYTVQLEGIKVKNKVLSLPKSVFVPDHTGAGQTMV 295
Query: 215 DSGSVLTYFHSDVYWKLHEKFVSYFERFQLAQLSD----CPEPIQLCYFLPET---FNRF 267
DSG+ T+ VY L +F+S R L L+D + LCY L +
Sbjct: 296 DSGTQFTFLLGPVYTALKNEFLSQ-TRGILKVLNDDNFVFQGAMDLCYLLDSSRPNLQNL 354
Query: 268 PSMAFYFEDANLRIDGENVF 287
P ++ F+ A + + GE +
Sbjct: 355 PVVSLMFQGAEMSVSGERLL 374
>gi|297810815|ref|XP_002873291.1| hypothetical protein ARALYDRAFT_487523 [Arabidopsis lyrata subsp.
lyrata]
gi|297319128|gb|EFH49550.1| hypothetical protein ARALYDRAFT_487523 [Arabidopsis lyrata subsp.
lyrata]
Length = 439
Score = 94.4 bits (233), Expect = 8e-17, Method: Compositional matrix adjust.
Identities = 89/358 (24%), Positives = 148/358 (41%), Gaps = 39/358 (10%)
Query: 1 MVRLFIGTPSKGVLLILDTGSALIY------------AIFDPRKSSSFQKINCDHPDCTY 48
+V++ IGTP++ +LL +DT S + + F P KS+SF+ ++C P C
Sbjct: 100 IVKVLIGTPAQPLLLAMDTSSDVAWIPCSGCVGCPSNTAFSPAKSTSFKNVSCSAPQCKQ 159
Query: 49 FK---CVNEQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSNDNHGFDE 105
C C + + Y S+ + +TI + FGC N G
Sbjct: 160 VPNPACGARACSFNLTYGSSSIAANLS-QDTIRLAADP-----IKAFTFGCVNKVAGGGT 213
Query: 106 DARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLKFGTDMGYRR 165
L G+ R +S +SQ S+ K FSYCL P S L+ G +R
Sbjct: 214 IPPPQGLLGL---GRGPLSLMSQAQSVYKSTFSYCL--PSFRSLTFSGSLRLGPTSQPQR 268
Query: 166 PSTQATKFINHP--NNFYYLSLKDISIDNERMNFPPDTFDITVSGEGGCIIDSGSVLTYF 223
+ T+ + +P ++ YY++L I + + ++ PP S G I DSG+V T
Sbjct: 269 --VKYTQLLRNPRRSSLYYVNLVAIRVGRKVVDLPPAAIAFNPSTGAGTIFDSGTVYTRL 326
Query: 224 HSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCYFLPETFNRFPSMAFYFEDANLRIDG 283
VY + +F + S CY + P++ F F+ N+ +
Sbjct: 327 AKPVYEAVRNEFRKRVKPPTAVVTSL--GGFDTCY---SGQVKVPTITFMFKGVNMTMPA 381
Query: 284 ENVFIIDYENHFFLLAVAPH----DDLVALIGSQQQRDTRFVYDLNIDLLSFVKENCS 337
+N+ + LA+A + +V +I S QQ++ R + D+ L +E CS
Sbjct: 382 DNLMLHSTAGSTSCLAMASAPENVNSVVNVIASMQQQNHRVLIDVPNGRLGLARERCS 439
>gi|297827577|ref|XP_002881671.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
gi|297327510|gb|EFH57930.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
Length = 438
Score = 94.4 bits (233), Expect = 8e-17, Method: Compositional matrix adjust.
Identities = 79/328 (24%), Positives = 141/328 (42%), Gaps = 64/328 (19%)
Query: 2 VRLFIGTPSKGVLLILDTGSALIY----------AIFDPRKSSSFQKINCDHPDC----- 46
V L +G+P + + ++LDTGS L + ++F+P SS++ + C P C
Sbjct: 63 VTLAVGSPPQNISMVLDTGSELSWLHCKKSPNLGSVFNPVSSSTYSPVPCSSPICRTRTR 122
Query: 47 -----------TYFKCVNEQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFG 95
T+F C + YAD + +G AH+T + G G LFG
Sbjct: 123 DLPIPASCDPKTHF------CHVAISYADATSIEGNLAHDTFVI-----GSVTRPGTLFG 171
Query: 96 CSNDNHGFDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYL 155
C + D + D G++G++R ++SF++QLG +FSYC+ +G +S L
Sbjct: 172 CMDSGLSSDSE-EDAKSTGLMGMNRGSLSFVNQLG---FSKFSYCI-----SGSDSSGIL 222
Query: 156 KFG-------TDMGYRRPSTQATKFINHPNNFYYLSLKDISIDNERMNFPPDTFDITVSG 208
G + Y Q T Y + L+ I + ++ ++ P F +G
Sbjct: 223 LLGDASYSWLGPIQYTPLVLQTTPLPYFDRVAYTVQLEGIRVGSKILSLPKSVFVPDHTG 282
Query: 209 EGGCIIDSGSVLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPE-----PIQLCYFLPET 263
G ++DSG+ T+ VY L +F++ + + ++ D P + LCY + +
Sbjct: 283 AGQTMVDSGTQFTFLMGPVYTALKNEFIA--QTKSVLRIVDDPNFVFQGTMDLCYRVGSS 340
Query: 264 ----FNRFPSMAFYFEDANLRIDGENVF 287
F P ++ F A + + G+ +
Sbjct: 341 TRPNFTGLPVISLMFRGAEMSVSGQKLL 368
>gi|413923784|gb|AFW63716.1| aspartic proteinase nepenthesin-2 [Zea mays]
Length = 531
Score = 94.0 bits (232), Expect = 8e-17, Method: Compositional matrix adjust.
Identities = 85/360 (23%), Positives = 150/360 (41%), Gaps = 51/360 (14%)
Query: 1 MVRLFIGTPSKGVLLILDTGSALIYA--------------IFDPRKSSSFQKINCDHPDC 46
++ + +G+P+ +++DTGS + + +FDP SS++ +C DC
Sbjct: 199 LITVGLGSPATSQTMLIDTGSDVSWVQCKPCSQCHSQADPLFDPSSSSTYSPFSCGSADC 258
Query: 47 TYFK------CVNEQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSNDN 100
+ QC Y + Y D S T G + +T+++ G + FGCSN
Sbjct: 259 AQLGQEGNGCSSSSQCQYIVTYGDGSSTTGTYSSDTLAL-----GSSAVRSFQFGCSNVE 313
Query: 101 HGFDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLKFGTD 160
GF++ G++GL S +SQ + + FSYC LP +S +L G
Sbjct: 314 SGFNDQTD-----GLMGLGGGAQSLVSQTAGTLGRAFSYC----LPPTPSSSGFLTLGAA 364
Query: 161 MGYRRPSTQATKFINHPN--NFYYLSLKDISIDNERMNFPPDTFDITVSGEGGCIIDSGS 218
G T + FY + L+ I + +++ P F G ++DSG+
Sbjct: 365 GGSGTSGFVKTPMLRSSQVPTFYGVRLQAIRVGGRQLSIPASVF------SAGTVMDSGT 418
Query: 219 VLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCY-FLPETFNRFPSMAFYFE-D 276
V+T Y L F + +++ AQ S + C+ F ++ PS+A F
Sbjct: 419 VITRLPPTAYSALSSAFKAGMKQYPPAQPSGI---LDTCFDFSGQSSVSIPSVALVFSGG 475
Query: 277 ANLRIDGENVFIIDYENHFFLLAVAPHDDLVALIGSQQQRDTRFVYDLNIDLLSFVKENC 336
A + +D + + ++ A D + +IG+ QQR +YD+ ++ F C
Sbjct: 476 AVVSLDASGIIL----SNCLAFAGNSDDSSLGIIGNVQQRTFEVLYDVGRGVVGFRAGAC 531
>gi|343172998|gb|AEL99202.1| aspartyl protease family protein, partial [Silene latifolia]
Length = 584
Score = 94.0 bits (232), Expect = 8e-17, Method: Compositional matrix adjust.
Identities = 90/360 (25%), Positives = 146/360 (40%), Gaps = 48/360 (13%)
Query: 5 FIGTPSKGVLLILDTGSALIYAI--------------FDPRKSSSFQKINCDHPDCTYFK 50
+IGTP + LI+DTGS + Y F P S ++ + C+ PDCT
Sbjct: 1 WIGTPPQEFALIVDTGSTVTYVPCNSCDQCGNHQDPKFQPDLSDTYHPVKCN-PDCT-CD 58
Query: 51 CVNEQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSNDNHG--FDEDAR 108
N+QC Y +YA+ S + G + +S E K A+FGC N G F + A
Sbjct: 59 TENDQCTYERQYAEMSSSSGILGEDLVSFGNMSELKP--QRAVFGCENAETGDLFSQHAD 116
Query: 109 DGALAGVLGLSRVTISFISQL--GSIIKKRFSYCL-VIPLPNGEYTSSYLKFGTDMGYRR 165
G++GL R +S + QL +I FS C + + G + +DM +
Sbjct: 117 -----GIMGLGRGDLSIVDQLVEKGVINDSFSLCYGGMEVGGGAMVLGQISPPSDMVFSH 171
Query: 166 PSTQATKFINHPNNFYYLSLKDISIDNERMNFPPDTFDITVSGEGGCIIDSGSVLTYFHS 225
+ +Y + L+ + + ++++ P FD G+ G I+DSG+ Y
Sbjct: 172 SDPDRSP-------YYNIELRGLHVAGKKLDINPQVFD----GKHGTILDSGTTYAYLPE 220
Query: 226 DVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCYF-----LPETFNRFPSMAFYFEDA-NL 279
+ + S + + D P +C+ +PE + FPS+ F++
Sbjct: 221 AAFLPFIQAITSELHGLKQIRGPD-PNYNDVCFSGAGSEIPELYKTFPSVDMVFDNGEKY 279
Query: 280 RIDGENVFIIDYENH--FFLLAVAPHDDLVALIGSQQQRDTRFVYDLNIDLLSFVKENCS 337
+ EN + H + L D L+G R+T YD + F K NCS
Sbjct: 280 SLSPENYLFKHSKVHGAYCLGVFQNGKDPTTLLGGIVVRNTLVTYDREHSKVGFWKTNCS 339
>gi|242044724|ref|XP_002460233.1| hypothetical protein SORBIDRAFT_02g025060 [Sorghum bicolor]
gi|241923610|gb|EER96754.1| hypothetical protein SORBIDRAFT_02g025060 [Sorghum bicolor]
Length = 512
Score = 94.0 bits (232), Expect = 9e-17, Method: Compositional matrix adjust.
Identities = 90/373 (24%), Positives = 157/373 (42%), Gaps = 62/373 (16%)
Query: 6 IGTPSKGVLLILDTGSALIYA--------------IFDPRKSSSFQKINCDHPDCTYFK- 50
+G +I+DT S L + +FDP S S+ + C+ C +
Sbjct: 157 VGLGGGEATVIVDTASELTWVQCAPCESCHDQQDPLFDPSSSPSYAAVPCNSSSCDALQL 216
Query: 51 -----------CVNE-----QCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALF 94
C + C YT+ Y D S ++G AH+ +S+ G+ + G +F
Sbjct: 217 ATGGTSGGAAACQGQDQSAAACSYTLSYRDGSYSRGVLAHDRLSLAGE-----VIDGFVF 271
Query: 95 GCSNDNHGFDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSY 154
GC N G G +G++GL R +S +SQ FSYCL PL + +S
Sbjct: 272 GCGTSNQG----PPFGGTSGLMGLGRSQLSLVSQTMDQFGGVFSYCL--PLKESD-SSGS 324
Query: 155 LKFGTDMGYRRPSTQA--TKFINHP--NNFYYLSLKDISIDNERMNFPPDTFDITVSGEG 210
L G D R ST ++ P FY+++L I++ + + +
Sbjct: 325 LVIGDDSSVYRNSTPIVYASMVSDPLQGPFYFVNLTGITVGGQEVESSGFSSGGGGG--- 381
Query: 211 GCIIDSGSVLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCP--EPIQLCYFLPETFN-RF 267
IIDSG+V+T +Y + +F+S F + A P + C+ + +
Sbjct: 382 KAIIDSGTVITSLVPSIYNAVKAEFLSQFAEYPQA-----PGFSILDTCFNMTGLREVQV 436
Query: 268 PSMAFYFEDA-NLRID-GENVFIIDYENHFFLLAVAP--HDDLVALIGSQQQRDTRFVYD 323
PS+ F+ + +D G ++ + ++ LA+AP + +IG+ QQ++ R ++D
Sbjct: 437 PSLKLVFDGGVEVEVDSGGVLYFVSSDSSQVCLAMAPLKSEYETNIIGNYQQKNLRVIFD 496
Query: 324 LNIDLLSFVKENC 336
+ + F +E C
Sbjct: 497 TSGSQVGFAQETC 509
>gi|343172996|gb|AEL99201.1| aspartyl protease family protein, partial [Silene latifolia]
Length = 584
Score = 94.0 bits (232), Expect = 9e-17, Method: Compositional matrix adjust.
Identities = 90/360 (25%), Positives = 146/360 (40%), Gaps = 48/360 (13%)
Query: 5 FIGTPSKGVLLILDTGSALIYAI--------------FDPRKSSSFQKINCDHPDCTYFK 50
+IGTP + LI+DTGS + Y F P S ++ + C+ PDCT
Sbjct: 1 WIGTPPQEFALIVDTGSTVTYVPCNSCDQCGNHQDPKFQPDLSDTYHPVKCN-PDCT-CD 58
Query: 51 CVNEQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSNDNHG--FDEDAR 108
N+QC Y +YA+ S + G + +S E K A+FGC N G F + A
Sbjct: 59 TENDQCTYERQYAEMSSSSGILGEDLVSFGNMSELKP--QRAVFGCENAETGDLFSQHAD 116
Query: 109 DGALAGVLGLSRVTISFISQL--GSIIKKRFSYCL-VIPLPNGEYTSSYLKFGTDMGYRR 165
G++GL R +S + QL +I FS C + + G + +DM +
Sbjct: 117 -----GIMGLGRGDLSIVDQLVEKGVINDSFSLCYGGMEVGGGAMVLGQISPPSDMVFSH 171
Query: 166 PSTQATKFINHPNNFYYLSLKDISIDNERMNFPPDTFDITVSGEGGCIIDSGSVLTYFHS 225
+ +Y + L+ + + ++++ P FD G+ G I+DSG+ Y
Sbjct: 172 SDPDRSP-------YYNIELRGLHVAGKKLDINPQVFD----GKHGTILDSGTTYAYLPE 220
Query: 226 DVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCYF-----LPETFNRFPSMAFYFEDA-NL 279
+ + S + + D P +C+ +PE + FPS+ F++
Sbjct: 221 AAFLPFIQAITSELHGLKQIRGPD-PNYNDVCFSGAGSEIPELYKTFPSVDMVFDNGEKY 279
Query: 280 RIDGENVFIIDYENH--FFLLAVAPHDDLVALIGSQQQRDTRFVYDLNIDLLSFVKENCS 337
+ EN + H + L D L+G R+T YD + F K NCS
Sbjct: 280 SLSPENYLFKHSKVHGAYCLGVFQNGKDPTTLLGGIVVRNTLVTYDREHSKVGFWKTNCS 339
>gi|414587000|tpg|DAA37571.1| TPA: hypothetical protein ZEAMMB73_036171 [Zea mays]
Length = 459
Score = 94.0 bits (232), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 91/384 (23%), Positives = 164/384 (42%), Gaps = 69/384 (17%)
Query: 1 MVRLFIGTPSKGVLLILDTGSALIY--------------AIFDPRKSSSFQKINCDHPDC 46
+V+L GTP +DT S L++ +F+P+ SSS+ + C C
Sbjct: 93 LVKLGTGTPQHFFSAAIDTASDLVWMQCQPCVSCYRQLDPVFNPKLSSSYAVVPCTSDTC 152
Query: 47 TYF---KCVNE---QCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSNDN 100
+C + C YT KY+ VTKG A + +++ G +FH +FGCS+ +
Sbjct: 153 AQLDGHRCHEDDDGACQYTYKYSGHGVTKGTLAIDKLAI-----GGDVFHAVVFGCSDSS 207
Query: 101 HGFDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLKFGTD 160
G +G++GL R +S +SQL RF YCL P+ TS L G
Sbjct: 208 VG----GPAAQASGLVGLGRGPLSLVSQLS---VHRFMYCLPPPM---SRTSGKLVLGAG 257
Query: 161 M-GYRRPSTQATKFINHPN---NFYYLSLKDISIDNE------RMNFPPDTFDITVSGEG 210
R S + T ++ ++YYL+L +++ ++ PP G G
Sbjct: 258 ADAVRNMSDRVTVTMSSSTRYPSYYYLNLDGLAVGDQTPGTTRNATSPPSGGAGGGGGGG 317
Query: 211 -------------GCIIDSGSVLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPE-PIQL 256
G I+D S +++ + +Y +L + E +L + + + L
Sbjct: 318 GGGIVGAGGANAYGMIVDVASTISFLETSLYDELADDLE---EEIRLPRATPSLRLGLDL 374
Query: 257 CYFLPETFNR----FPSMAFYFEDANLRIDGENVFIIDYENHFFLLAVAPHDDLVALIGS 312
C+ LPE P+++ F+ L +D + +F+ D ++ V+++G+
Sbjct: 375 CFILPEGVGMDRVYVPTVSLSFDGRWLELDRDRLFVTDGRMMCLMIG---RTSGVSILGN 431
Query: 313 QQQRDTRFVYDLNIDLLSFVKENC 336
Q ++ R +++L ++F K +C
Sbjct: 432 FQLQNMRVLFNLRRGKITFAKASC 455
>gi|125596976|gb|EAZ36756.1| hypothetical protein OsJ_21092 [Oryza sativa Japonica Group]
Length = 435
Score = 94.0 bits (232), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 93/342 (27%), Positives = 146/342 (42%), Gaps = 52/342 (15%)
Query: 20 GSALIYAIFDPRKSSSFQKINCDHPDCTYFKCVNEQCVYTMKYADQSVTKGFAAHETISV 79
G A F+P +SSSF I C P+C +C C +T+++ + +V G +T+++
Sbjct: 121 GGAPCDPAFEPSRSSSFAAIPCGSPECA-VECTGASCPFTIQFGNVTVANGTLVRDTLTL 179
Query: 80 IGKGEGKAIFHGALFGCSNDNHGFDEDARDGALAGVLGLSRVTISFISQL----GSIIKK 135
A F G FGC G D D DGA+ G++ LSR + S S++ +
Sbjct: 180 ----PPSATFAGFTFGC--IEVGADADTFDGAV-GLIDLSRSSHSLASRVISNGATTSAA 232
Query: 136 RFSYCLVIPLPNGEYTSS--YLKFGT--------DMGYRRPSTQATKFINHPNNFYYLSL 185
FSYC LP+ TSS +L G D+ Y S+ NHPN+ Y++ L
Sbjct: 233 AFSYC----LPSSSATSSRGFLSIGASRPEYSGGDIKYAPMSSNP----NHPNS-YFVDL 283
Query: 186 KDISIDNERMNFPPDTFDITVSGEGGCIIDSGSVLTYFHSDVYWKLHEKFVSYFERFQLA 245
IS+ E + PP F G ++++ + T+ Y L + F R +A
Sbjct: 284 VGISVGGEDLPVPPAVF-----AAHGTLLEAATEFTFLAPAAYAALRDAF-----RKDMA 333
Query: 246 QLSDCP--EPIQLCYFLPETFN-RFPSMAFYFEDAN-LRID-GENVFIIDYENHFFLLAV 300
P + CY L + P++A F L +D + ++ D + F +A
Sbjct: 334 PYPAAPPFRVLDTCYNLTGLASLAVPAVALRFAGGTELELDVRQMMYFADPSSVFSSVAC 393
Query: 301 ------APHDDLVALIGSQQQRDTRFVYDLNIDLLSFVKENC 336
V++IG+ QR T VYDL + F+ C
Sbjct: 394 LAFAAAPLPAFPVSVIGTLAQRSTEVVYDLRGGRVGFIPGRC 435
>gi|242066140|ref|XP_002454359.1| hypothetical protein SORBIDRAFT_04g029390 [Sorghum bicolor]
gi|241934190|gb|EES07335.1| hypothetical protein SORBIDRAFT_04g029390 [Sorghum bicolor]
Length = 460
Score = 94.0 bits (232), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 86/366 (23%), Positives = 150/366 (40%), Gaps = 67/366 (18%)
Query: 1 MVRLFIGTPSKGVLLILDTGSALIYA--------------IFDPRKSSSFQKINCDHPDC 46
++ + +G+P+K +++D+GS + + +FDP SS++ +C C
Sbjct: 132 LITVRLGSPAKTQTVLIDSGSDVSWVQCKPCLQCHSQVDPLFDPSLSSTYSPFSCSSAAC 191
Query: 47 TYFK------CVNEQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSNDN 100
+ QC Y ++YAD S T G + +T+++ G FGCS+
Sbjct: 192 AQLGQDGNGCSSSSQCQYIVRYADGSSTTGTYSSDTLAL-----GSNTISNFQFGCSHVE 246
Query: 101 HGFDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLKFGTD 160
GF+ D D G++GL S SQ FSYCL P P+ +S +L G
Sbjct: 247 SGFN-DLTD----GLMGLGGGAPSLASQTAGTFGTAFSYCLP-PTPS---SSGFLTLGAG 297
Query: 161 MGYRRPSTQATKFINHP-------NNFYYLSLKDISIDNERMNFPPDTFDITVSGEGGCI 213
+ F+ P FY + L+ I + +++ P F G +
Sbjct: 298 T---------SGFVKTPMLRSSPVPTFYGVRLEAIRVGGTQLSIPTSVF------SAGMV 342
Query: 214 IDSGSVLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQ-LCY-FLPETFNRFPSMA 271
+DSG+++T Y L F + ++++ A P I C+ F ++ R PS+A
Sbjct: 343 MDSGTIITRLPRTAYSALSSAFKAGMKQYRPAP----PRSIMDTCFDFSGQSSVRLPSVA 398
Query: 272 FYFE-DANLRIDGENVFIIDYENHFFLLAVAPHDDLVALIGSQQQRDTRFVYDLNIDLLS 330
F A + +D + + + A D ++G+ QQR +YD+ +
Sbjct: 399 LVFSGGAVVNLDANGIIL----GNCLAFAANSDDSSPGIVGNVQQRTFEVLYDVGGGAVG 454
Query: 331 FVKENC 336
F C
Sbjct: 455 FKAGAC 460
>gi|302786560|ref|XP_002975051.1| hypothetical protein SELMODRAFT_54028 [Selaginella moellendorffii]
gi|300157210|gb|EFJ23836.1| hypothetical protein SELMODRAFT_54028 [Selaginella moellendorffii]
Length = 359
Score = 94.0 bits (232), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 100/365 (27%), Positives = 161/365 (44%), Gaps = 45/365 (12%)
Query: 1 MVRLFIGTPSKGVLLILDTGSALIY----------------AIFDPRKSSSFQKINCDHP 44
M+ L IGTP + + ++DTGS L++ IF SSS++K+ C+
Sbjct: 6 MMELSIGTPPQLIPAMIDTGSDLVWLKCDNCDHCDLDHHGETIFFSDASSSYKKLPCNST 65
Query: 45 DCTYFKCVN------EQCVYTMKYADQSVTKGFAAHETISVIGKGEG---KAIFHGALFG 95
C+ E C Y +Y D S T G + IS G G ++ F G LFG
Sbjct: 66 HCSGMSSAGIGPRCEETCKYKYEYGDGSRTSGDVGSDRISFRSHGAGEDHRSFFDGFLFG 125
Query: 96 CSNDNHGFDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYL 155
C G D + G ++GL + + S I QLG + +FSYCLV + S+L
Sbjct: 126 CGRKLKG-DWNFTQG----LIGLGQKSHSLIQQLGDKLGYKFSYCLV-SYDSPPSAKSFL 179
Query: 156 KFGTDMGYRRPSTQATKFINHPN---NFYYLSLKDISIDN-ERMNFPPDTFDITVSG--- 208
G+ R +T ++ + YY+ L+ I++ + + ++ T G
Sbjct: 180 FLGSSAALRGHDVVSTPILHGDHLDQTLYYVDLQSITVGGVPVVVYDKESGHNTSVGPFL 239
Query: 209 EGGCIIDSGSVLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCYFLP-ETFNRF 267
+IDSG+ T VY + + S E+ L L + + LC+ +T F
Sbjct: 240 ANKTVIDSGTTYTLLTPPVYEAMRK---SIEEQVILPTLGN-SAGLDLCFNSSGDTSYGF 295
Query: 268 PSMAFYFED-ANLRIDGENVFIIDYENHFFLLAVAPHDDLVALIGSQQQRDTRFVYDLNI 326
PS+ FYF + L + EN+F + + L + DL ++IG+ QQ++ +YDL
Sbjct: 296 PSVTFYFANQVQLVLPFENIFQVTSRDVVCLSMDSSGGDL-SIIGNMQQQNFHILYDLVA 354
Query: 327 DLLSF 331
+SF
Sbjct: 355 SQISF 359
>gi|222637181|gb|EEE67313.1| hypothetical protein OsJ_24553 [Oryza sativa Japonica Group]
Length = 414
Score = 93.6 bits (231), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 93/340 (27%), Positives = 142/340 (41%), Gaps = 51/340 (15%)
Query: 28 FDPRKSSSFQKINCDHPDCT-----YFKCVNEQCVYTMKYADQSVTKGFAAHETISVIGK 82
F P SS+F K+ C C Y C CVY Y T G+ A ET+ V
Sbjct: 96 FQPASSSTFSKLPCASSLCQFLTSPYLTCNATGCVYYYPYG-MGFTAGYLATETLHV--- 151
Query: 83 GEGKAIFHGALFGCSNDNHGFDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLV 142
G A F G FGCS +N G + +G++GL R +S +SQ+G RFSYCL
Sbjct: 152 --GGASFPGVAFGCSTEN-GVGNSS-----SGIVGLGRSPLSLVSQVG---VGRFSYCLR 200
Query: 143 IPLPNGEYTSSYLKFGTDMGYRRPSTQATKFINHP----NNFYYLSLKDISIDNERMNFP 198
G+ S + FG+ + + + +P +++YY++L I++ +
Sbjct: 201 SDADAGD---SPILFGS-LAKVTGGKSSPAILENPEMPSSSYYYVNLTGITVGATDLPVT 256
Query: 199 PDTFDITVSGE----GGCIIDSGSVLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPE-P 253
TF T GG I+DSG+ LTY + Y + F+S L +
Sbjct: 257 STTFGFTRGAGAGLVGGTIVDSGTTLTYLVKEGYAMVKRAFLSQMATANLTTTVNGTRFG 316
Query: 254 IQLCY-----------FLPETFNRFPSMAFYFEDANLRIDGENVFIIDYENHFF---LLA 299
LC+ +P RF A Y A R V +D + LL
Sbjct: 317 FDLCFDANAAGGGSGVPVPTLVLRFAGGAEY---AVRRRSYVGVVEVDSQGRAAVECLLV 373
Query: 300 VAPHDDL-VALIGSQQQRDTRFVYDLNIDLLSFVKENCSD 338
+ + L +++IG+ Q D +YDL+ + SF +C++
Sbjct: 374 LPASEKLSISIIGNVMQMDLHVLYDLDGGMFSFAPADCAN 413
>gi|449433371|ref|XP_004134471.1| PREDICTED: probable aspartic protease At2g35615-like [Cucumis
sativus]
gi|449495479|ref|XP_004159853.1| PREDICTED: probable aspartic protease At2g35615-like [Cucumis
sativus]
Length = 424
Score = 93.6 bits (231), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 104/368 (28%), Positives = 167/368 (45%), Gaps = 50/368 (13%)
Query: 1 MVRLFIGTPSKGVLLILDTGSALIY------------------AIFDPRKSSSFQKINCD 42
++ IG PS V+ LDT + LI+ F KS +++ C
Sbjct: 76 LMSFNIGNPSSQVMGFLDTSNGLIWVQCSNCNSQCEPEKRGLTTKFLSSKSFTYEMEPCG 135
Query: 43 HPDC---TYFKCVN---EQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGAL-FG 95
C T F+ N + C Y + Y D T G + ++ +G + G L FG
Sbjct: 136 SNFCNSLTGFQTCNSSDKWCKYRLVYGDNKATSGILSSDSFG-FDTSDGMLVDVGFLNFG 194
Query: 96 CSNDNHGFDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYL 155
CS DE + G +GL++ +S ISQLG K+FSYCLV P N TS +
Sbjct: 195 CSEAPLTGDEQS----YTGNVGLNQTPLSLISQLG---IKKFSYCLV-PFNNLGSTSK-M 245
Query: 156 KFGTDMGYRRPSTQATKF-INHPN-NFYYLSLKDISIDNERMNFPPDTFDITVSGEGGCI 213
FG+ P T + + +PN + YY+ + ISI N+ +F FD+ G I
Sbjct: 246 YFGS-----LPVTSGGQTPLLYPNSDAYYVKVLGISIGNDEPHFD-GVFDV-YEVRDGWI 298
Query: 214 IDSGSVLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCP-EPIQLCYFLPET--FNRFPSM 270
ID+G + +D + L KF++ + Q D P E +LC+ L FP +
Sbjct: 299 IDTGITYSSLETDAFDSLLAKFLTLKD---FPQRKDDPKERFELCFELQNANDLESFPDV 355
Query: 271 AFYFEDANLRIDGENVFIIDYENHFFLLAVAPHDDLVALIGSQQQRDTRFVYDLNIDLLS 330
+F+ A+L ++ E+ F+ ++ F LA+ V+++G+ Q ++ YDL ++S
Sbjct: 356 TVHFDGADLILNVESTFVKIEDDGIFCLALLRSGSPVSILGNFQLQNYHVGYDLEAQVIS 415
Query: 331 FVKENCSD 338
F +C+D
Sbjct: 416 FAPVDCAD 423
>gi|226530755|ref|NP_001150335.1| aspartic proteinase nepenthesin-2 precursor [Zea mays]
gi|195638492|gb|ACG38714.1| aspartic proteinase nepenthesin-2 precursor [Zea mays]
Length = 461
Score = 93.6 bits (231), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 85/360 (23%), Positives = 151/360 (41%), Gaps = 51/360 (14%)
Query: 1 MVRLFIGTPSKGVLLILDTGSALIYA--------------IFDPRKSSSFQKINCDHPDC 46
++ + +G+P+ +++DTGS + + +FDP SS++ +C C
Sbjct: 129 LITVGLGSPATSQTMLIDTGSDVSWVQCKPCSQCHSQADPLFDPSSSSTYSPFSCGSAAC 188
Query: 47 TYFK------CVNEQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSNDN 100
+ QC Y + Y D S T G + +T+++ G + FGCSN
Sbjct: 189 AQLGQEGNGCSSSSQCQYIVTYGDGSSTTGTYSSDTLAL-----GSSAVKSFQFGCSNVE 243
Query: 101 HGFDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLKFGTD 160
GF++ G++GL S +SQ + + FSYCL P P+ +S +L G
Sbjct: 244 SGFNDQTD-----GLMGLGGGAQSLVSQTAGTLGRAFSYCLP-PTPS---SSGFLTLGAA 294
Query: 161 MGYRRPSTQATKFINHPN--NFYYLSLKDISIDNERMNFPPDTFDITVSGEGGCIIDSGS 218
G T + FY + L+ I + +++ P F G ++DSG+
Sbjct: 295 GGSGTSGFVKTPMLRSSQVPTFYGVRLQAIRVGGRQLSIPASVF------SAGTVMDSGT 348
Query: 219 VLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCY-FLPETFNRFPSMAFYFE-D 276
V+T Y L F + +++ AQ S + C+ F ++ PS+A F
Sbjct: 349 VITRLPPTAYSALSSAFKAGMKQYPPAQPSGI---LDTCFDFSGQSSVSIPSVALVFSGG 405
Query: 277 ANLRIDGENVFIIDYENHFFLLAVAPHDDLVALIGSQQQRDTRFVYDLNIDLLSFVKENC 336
A + +D + + ++ A D + +IG+ QQR +YD+ ++ F C
Sbjct: 406 AVVSLDASGIIL----SNCLAFAANSDDSSLGIIGNVQQRTFEVLYDVGRGVVGFRAGAC 461
>gi|54290724|dbj|BAD62394.1| putative nucleoid DNA-binding protein cnd41 [Oryza sativa Japonica
Group]
Length = 523
Score = 93.2 bits (230), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 93/342 (27%), Positives = 146/342 (42%), Gaps = 52/342 (15%)
Query: 20 GSALIYAIFDPRKSSSFQKINCDHPDCTYFKCVNEQCVYTMKYADQSVTKGFAAHETISV 79
G A F+P +SSSF I C P+C +C C +T+++ + +V G +T+++
Sbjct: 209 GGAPCDPAFEPSRSSSFAAIPCGSPECA-VECTGASCPFTIQFGNVTVANGTLVRDTLTL 267
Query: 80 IGKGEGKAIFHGALFGCSNDNHGFDEDARDGALAGVLGLSRVTISFISQL----GSIIKK 135
A F G FGC G D D DGA+ G++ LSR + S S++ +
Sbjct: 268 ----PPSATFAGFTFGCI--EVGADADTFDGAV-GLIDLSRSSHSLASRVISNGATTSAA 320
Query: 136 RFSYCLVIPLPNGEYTSS--YLKFGT--------DMGYRRPSTQATKFINHPNNFYYLSL 185
FSYC LP+ TSS +L G D+ Y S+ NHPN+ Y++ L
Sbjct: 321 AFSYC----LPSSSATSSRGFLSIGASRPEYSGGDIKYAPMSSNP----NHPNS-YFVDL 371
Query: 186 KDISIDNERMNFPPDTFDITVSGEGGCIIDSGSVLTYFHSDVYWKLHEKFVSYFERFQLA 245
IS+ E + PP F G ++++ + T+ Y L + F R +A
Sbjct: 372 VGISVGGEDLPVPPAVF-----AAHGTLLEAATEFTFLAPAAYAALRDAF-----RKDMA 421
Query: 246 QLSDCP--EPIQLCYFLPETFN-RFPSMAFYFEDAN-LRID-GENVFIIDYENHFFLLA- 299
P + CY L + P++A F L +D + ++ D + F +A
Sbjct: 422 PYPAAPPFRVLDTCYNLTGLASLAVPAVALRFAGGTELELDVRQMMYFADPSSVFSSVAC 481
Query: 300 -----VAPHDDLVALIGSQQQRDTRFVYDLNIDLLSFVKENC 336
V++IG+ QR T VYDL + F+ C
Sbjct: 482 LAFAAAPLPAFPVSVIGTLAQRSTEVVYDLRGGRVGFIPGRC 523
>gi|115475621|ref|NP_001061407.1| Os08g0267300 [Oryza sativa Japonica Group]
gi|37806402|dbj|BAC99940.1| putative 41 kD chloroplast nucleoid DNA binding protein [Oryza
sativa Japonica Group]
gi|113623376|dbj|BAF23321.1| Os08g0267300 [Oryza sativa Japonica Group]
Length = 524
Score = 93.2 bits (230), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 97/372 (26%), Positives = 152/372 (40%), Gaps = 72/372 (19%)
Query: 13 VLLILDTGSALIYA--------------IFDPRKSSSFQKINCDHPDCTYF--------- 49
+ +I+DTGS L + +FDP S+S+ + C+ C
Sbjct: 176 LTVIVDTGSDLTWVQCKPCSVCYAQRDPLFDPSGSASYAAVPCNASACEASLKAATGVPG 235
Query: 50 KCV----------NEQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSND 99
C +E+C Y++ Y D S ++G A +T+++ G A G +FGC
Sbjct: 236 SCATVGGGGGGGKSERCYYSLAYGDGSFSRGVLATDTVAL-----GGASVDGFVFGCGLS 290
Query: 100 NHGFDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLKFGT 159
N G G AG++GL R +S +SQ FSYCL P + L G
Sbjct: 291 NRGLF-----GGTAGLMGLGRTELSLVSQTAPRFGGVFSYCL--PAATSGDAAGSLSLGG 343
Query: 160 DMGYRRPSTQA--TKFINHPNN--FYYLSLKDISIDNERMNFPPDTFDITVSGEGGCIID 215
D R +T T+ I P FY++++ S+ + G ++D
Sbjct: 344 DTSSYRNATPVSYTRMIADPAQPPFYFMNVTGASVGGAAVA-------AAGLGAANVLLD 396
Query: 216 SGSVLTYFHSDVYWKLHEKFVSYF--ERFQLAQLSDCPEPIQL---CYFLPETFN-RFPS 269
SG+V+T VY + +F F ER+ A P L CY L + P
Sbjct: 397 SGTVITRLAPSVYRAVRAEFARQFGAERYPAA------PPFSLLDACYNLTGHDEVKVPL 450
Query: 270 MAFYFED-ANLRIDGENV-FIIDYENHFFLLAVAP--HDDLVALIGSQQQRDTRFVYDLN 325
+ E A++ +D + F+ + LA+A +D +IG+ QQ++ R VYD
Sbjct: 451 LTLRLEGGADMTVDAAGMLFMARKDGSQVCLAMASLSFEDQTPIIGNYQQKNKRVVYDTV 510
Query: 326 IDLLSFVKENCS 337
L F E+CS
Sbjct: 511 GSRLGFADEDCS 522
>gi|449462553|ref|XP_004149005.1| PREDICTED: probable aspartic protease At2g35615-like [Cucumis
sativus]
Length = 418
Score = 93.2 bits (230), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 100/360 (27%), Positives = 156/360 (43%), Gaps = 53/360 (14%)
Query: 5 FIGTPSKGVLLILDTGSALIYA--------------IFDPRKSSSFQKINCDHPDCTYF- 49
IGTP L I DTGS L +A IF+P KS+SF + C+ C
Sbjct: 85 IIGTPPVDYLGIADTGSDLTWAQCLPCLKCYQQLRPIFNPLKSTSFSHVPCNTQTCHAVD 144
Query: 50 --KC-VNEQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSNDNHGFDED 106
C V C Y+ Y D++ +KG E I+ IG K++ GC + + G
Sbjct: 145 DGHCGVQGVCDYSYTYGDRTYSKGDLGFEKIT-IGSSSVKSV-----IGCGHASSG---- 194
Query: 107 ARDGALAGVLGLSRVTISFISQLG--SIIKKRFSYCLVIPLPNGEYTSSYLKFGTDMGYR 164
G +GV+GL +S +SQ+ S I +RFSYCL L + + + FG +
Sbjct: 195 -GFGFASGVIGLGGGQLSLVSQMSQTSGISRRFSYCLPTLL---SHANGKINFGQNAVVS 250
Query: 165 RPSTQATKFINHPN-NFYYLSLKDISIDNERMNFPPDTFDITVSGEGGCIIDSGSVLTYF 223
P +T I+ +YY++L+ ISI NER + + +G IIDSG+ L++
Sbjct: 251 GPGVVSTPLISKNTVTYYYITLEAISIGNER--------HMAFAKQGNVIIDSGTTLSFL 302
Query: 224 HSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCY---FLPETFNRFPSMAFYFE-DANL 279
++Y + S + + ++ D LC+ T + P + F AN+
Sbjct: 303 PKELYDGV---VSSLLKVVKAKRVKDPGNFWDLCFDDGINVATSSGIPIITAQFSGGANV 359
Query: 280 RIDGENVF--IIDYENHFFLLAVAPHDDLVALIGSQQQRDTRFVYDLNIDLLSFVKENCS 337
+ N F + + N L +P D+ +IG+ + YDL LSF C+
Sbjct: 360 NLLPVNTFQKVANNVNCLTLTPASPTDEF-GIIGNLALANFLIGYDLEAKRLSFKPTVCT 418
>gi|125560845|gb|EAZ06293.1| hypothetical protein OsI_28528 [Oryza sativa Indica Group]
Length = 525
Score = 93.2 bits (230), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 97/372 (26%), Positives = 152/372 (40%), Gaps = 72/372 (19%)
Query: 13 VLLILDTGSALIYA--------------IFDPRKSSSFQKINCDHPDCTYF--------- 49
+ +I+DTGS L + +FDP S+S+ + C+ C
Sbjct: 177 LTVIVDTGSDLTWVQCKPCSVCYAQRDPLFDPSGSASYAAVPCNASACEASLKAATGVPG 236
Query: 50 KCV----------NEQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSND 99
C +E+C Y++ Y D S ++G A +T+++ G A G +FGC
Sbjct: 237 SCATVGGGGGGGKSERCYYSLAYGDGSFSRGVLATDTVAL-----GGASVDGFVFGCGLS 291
Query: 100 NHGFDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLKFGT 159
N G G AG++GL R +S +SQ FSYCL P + L G
Sbjct: 292 NRGLF-----GGTAGLMGLGRTELSLVSQTAPRFGGVFSYCL--PAATSGDAAGSLSLGG 344
Query: 160 DMGYRRPSTQA--TKFINHPNN--FYYLSLKDISIDNERMNFPPDTFDITVSGEGGCIID 215
D R +T T+ I P FY++++ S+ + G ++D
Sbjct: 345 DTSSYRNATPVSYTRMIADPAQPPFYFMNVTGASVGGAAVA-------AAGLGAANVLLD 397
Query: 216 SGSVLTYFHSDVYWKLHEKFVSYF--ERFQLAQLSDCPEPIQL---CYFLPETFN-RFPS 269
SG+V+T VY + +F F ER+ A P L CY L + P
Sbjct: 398 SGTVITRLAPSVYRAVRAEFARQFGAERYPAA------PPFSLLDACYNLTGHDEVKVPL 451
Query: 270 MAFYFED-ANLRIDGENV-FIIDYENHFFLLAVAP--HDDLVALIGSQQQRDTRFVYDLN 325
+ E A++ +D + F+ + LA+A +D +IG+ QQ++ R VYD
Sbjct: 452 LTLRLEGGADMTVDAAGMLFMARKDGSQVCLAMASLSFEDQTPIIGNYQQKNKRVVYDTV 511
Query: 326 IDLLSFVKENCS 337
L F E+CS
Sbjct: 512 GSRLGFADEDCS 523
>gi|357487593|ref|XP_003614084.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
gi|355515419|gb|AES97042.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
Length = 412
Score = 92.8 bits (229), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 92/349 (26%), Positives = 149/349 (42%), Gaps = 51/349 (14%)
Query: 6 IGTPSKGVLLILDTGSALIY--------------AIFDPRKSSSFQKINCDHPDCTYFKC 51
IGTP + ++DTG+ I+ +F P KSS+++ I C P C
Sbjct: 96 IGTPPFQLYSLIDTGNDNIWFQCKPCKPCLNQTSPMFHPSKSSTYKTIPCTSPIC----- 150
Query: 52 VNEQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSNDNHGFDEDARDGA 111
K AD + +T+++ F + GC + N G +G
Sbjct: 151 ---------KNADGH----YLGVDTLTLNSNNGTPISFKNIVIGCGHRNQG----PLEGY 193
Query: 112 LAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLKFGTDMGYRRPSTQAT 171
++G +GL+R +SFISQL S I +FSYCLV PL + E SS L FG T +T
Sbjct: 194 VSGNIGLARGPLSFISQLNSSIGGKFSYCLV-PLFSKENVSSKLHFGDKSTVSGLGTVST 252
Query: 172 KFINHPNNFYYLSLKDISIDNERMNFPPDTFDITVSGEGGCIIDSGSVLTYFHSDVYWKL 231
N Y++SL+ S+ + + G IIDSG+ +T DVY +L
Sbjct: 253 PI--KEENGYFVSLEAFSVGDHIIKLE------NSDNRGNSIIDSGTTMTILPKDVYSRL 304
Query: 232 HEKFVSYFERFQLAQLSDCPEPIQLCYFLPET--FNRFPSMAFYFEDANLRIDGENVFI- 288
+ + +L ++ D + LCY T + + +F + + ++ N F
Sbjct: 305 ESVVL---DMVKLKRVKDPSQQFNLCYQTTSTTLLTKVLIITAHFSGSEVHLNALNTFYP 361
Query: 289 IDYENHFFLLAVAPHDDLVALIGSQQQRDTRFVYDLNIDLLSFVKENCS 337
I E F + +A+ G+ Q++ +DLN +SF +C+
Sbjct: 362 ITDEVICFAFVSGGNFSSLAIFGNVVQQNFLVGFDLNKKTISFKPTDCT 410
>gi|225216949|gb|ACN85242.1| aspartic proteinase nepenthesin-1 precursor [Oryza minuta]
Length = 519
Score = 92.8 bits (229), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 85/363 (23%), Positives = 141/363 (38%), Gaps = 51/363 (14%)
Query: 1 MVRLFIGTPSKGVLLILDTGSALIYA---------------IFDPRKSSSFQKINCDHPD 45
+V + +GTP+ ++ DTGS + +FDP +SS++ ++C P
Sbjct: 181 VVTVGLGTPASRYTVVFDTGSDTTWVQCQPCVVVCYEQREKLFDPARSSTYANVSCAAPA 240
Query: 46 CTYFK---CVNEQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSNDNHG 102
C+ C C+Y ++Y D S + GF A +T+++ G FGC N G
Sbjct: 241 CSDLNIHGCSGGHCLYGVQYGDGSYSIGFFAMDTLTL----SSYDAVKGFRFGCGERNEG 296
Query: 103 FDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLKFGT-DM 161
+A AG+LGL R S Q F++C LP + YL FG +
Sbjct: 297 LFGEA-----AGLLGLGRGKTSLPVQTYDKYGGVFAHC----LPARSTGTGYLDFGAGSL 347
Query: 162 GYRRPSTQATKFINHPNNFYYLSLKDISIDNERMNFPPDTFDITVSGEGGCIIDSGSVLT 221
R + FYY+ + I + + ++ P F G I+DSG+V+T
Sbjct: 348 AAARARLTTPMLTENGPTFYYVGMTGIRVGGQLLSIPQSVFATA-----GTIVDSGTVIT 402
Query: 222 YFHSDVYWKLH-----EKFVSYFERFQLAQLSDCPEPIQLCY-FLPETFNRFPSMAFYFE 275
Y L +++ L D CY F + P+++ F+
Sbjct: 403 RLPPAAYSSLRYAFAAAMAARGYKKAPAVSLLD------TCYDFTGMSQVAIPTVSLLFQ 456
Query: 276 -DANLRIDGENVFIIDYENHFFLLAVAPHDDL-VALIGSQQQRDTRFVYDLNIDLLSFVK 333
A L +D + + L A D V ++G+ Q + YD+ ++ F
Sbjct: 457 GGARLDVDASGIMYAASASQVCLAFAANEDGGDVGIVGNTQLKTFGVAYDIGKKVVGFYP 516
Query: 334 ENC 336
C
Sbjct: 517 GAC 519
>gi|242051593|ref|XP_002454942.1| hypothetical protein SORBIDRAFT_03g001790 [Sorghum bicolor]
gi|241926917|gb|EES00062.1| hypothetical protein SORBIDRAFT_03g001790 [Sorghum bicolor]
Length = 431
Score = 92.8 bits (229), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 94/369 (25%), Positives = 153/369 (41%), Gaps = 63/369 (17%)
Query: 2 VRLFIGTPSKGVLLILDTGSALIYA--------------IFDPRKSSSFQKINCDHPDCT 47
V + IGTP + LI DT S L + +FDP KSSSF + C CT
Sbjct: 93 VTIGIGTPPQLHTLIADTASDLTWTQCNLFNDTAKQVEPLFDPAKSSSFAFVTCSSKLCT 152
Query: 48 -----YFKCVNEQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSNDNHG 102
+C N+ C Y Y G A+E+ ++ + + G FGC G
Sbjct: 153 EDNPGTKRCSNKTCRYVYPYVSVEAA-GVLAYESFTLSDNNQHICMSFG--FGC-----G 204
Query: 103 FDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLKFG--TD 160
D +G+LG+S +S +SQL +FSYCL P + SS L FG D
Sbjct: 205 ALTDGNLLGASGILGMSPAILSMVSQLA---IPKFSYCLT---PYTDRKSSPLFFGAWAD 258
Query: 161 MGYRRPSTQATKFINHPNNFYYLSLKDISIDNERMNFPPDTFDITVSGEGGCIIDSGSVL 220
+G + + K + +YY+ L +S+ R++ P TF + +GG ++D G +
Sbjct: 259 LGRYKTTGPIQKSLTF---YYYVPLVGLSLGTRRLDVPAATFALK---QGGTVVDLGCTV 312
Query: 221 TYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCYFLPETFN----RFPSMAFYFED 276
+ L E + L + + ++C+ LP + P + YF
Sbjct: 313 GQLAEPAFTALKE---AVLHTLNLPLTNRTVKDYKVCFALPSGVAMGAVQTPPLVLYF-- 367
Query: 277 ANLRIDGENVFIIDYENHF-------FLLAVAPHDDLVALIGSQQQRDTRFVYDLNIDLL 329
DG ++ +N+F LA+ P + ++IG+ QQ++ ++D++
Sbjct: 368 -----DGGADMVLPRDNYFQEPTAGLMCLALVPGGGM-SIIGNVQQQNFHLLFDVHDSKF 421
Query: 330 SFVKENCSD 338
F C D
Sbjct: 422 LFAPTICDD 430
>gi|115448353|ref|NP_001047956.1| Os02g0720900 [Oryza sativa Japonica Group]
gi|45735844|dbj|BAD12879.1| putative chloroplast nucleoid DNA-binding protein cnd41 [Oryza
sativa Japonica Group]
gi|45735970|dbj|BAD12999.1| putative chloroplast nucleoid DNA-binding protein cnd41 [Oryza
sativa Japonica Group]
gi|113537487|dbj|BAF09870.1| Os02g0720900 [Oryza sativa Japonica Group]
gi|125540930|gb|EAY87325.1| hypothetical protein OsI_08729 [Oryza sativa Indica Group]
gi|215692622|dbj|BAG88042.1| unnamed protein product [Oryza sativa Japonica Group]
Length = 458
Score = 92.8 bits (229), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 81/362 (22%), Positives = 152/362 (41%), Gaps = 51/362 (14%)
Query: 1 MVRLFIGTPSKGVLLILDTGSALIY---------------AIFDPRKSSSFQKINCDHPD 45
+ R+ +GTP+ ++++DTGS+L + +F+P+ SS++ + C
Sbjct: 123 VTRMGLGTPATQYVMVVDTGSSLTWLQCSPCLVSCHRQSGPVFNPKSSSTYASVGCSAQQ 182
Query: 46 CTYF--------KCVNEQ-CVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGC 96
C+ C + C+Y Y D S + G+ + +T+S G +GC
Sbjct: 183 CSDLPSATLNPSACSSSNVCIYQASYGDSSFSVGYLSKDTVSF-----GSTSLPNFYYGC 237
Query: 97 SNDNHGFDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLK 156
DN G G AG++GL+R +S + QL + F+YC LP+ +
Sbjct: 238 GQDNEGLF-----GRSAGLIGLARNKLSLLYQLAPSLGYSFTYC----LPSSSSSGYLSL 288
Query: 157 FGTDMGYRRPSTQATKFINHPNNFYYLSLKDISIDNERMNFPPDTFDITVSGEGGCIIDS 216
+ G + + ++ ++ Y++ L +++ P + + IIDS
Sbjct: 289 GSYNPGQYSYTPMVSSSLD--DSLYFIKLSGMTVAGN-----PLSVSSSAYSSLPTIIDS 341
Query: 217 GSVLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCYFLPETFNRFPSMAFYFE- 275
G+V+T + VY L + + + A + C+ + P++ F
Sbjct: 342 GTVITRLPTSVYSALSKAVAAAMKGTSRASAYSI---LDTCFKGQASRVSAPAVTMSFAG 398
Query: 276 DANLRIDGENVFIIDYENHFFLLAVAPHDDLVALIGSQQQRDTRFVYDLNIDLLSFVKEN 335
A L++ +N ++D ++ LA AP A+IG+ QQ+ VYD+ + F
Sbjct: 399 GAALKLSAQN-LLVDVDDSTTCLAFAPARS-AAIIGNTQQQTFSVVYDVKSSRIGFAAGG 456
Query: 336 CS 337
CS
Sbjct: 457 CS 458
>gi|242084332|ref|XP_002442591.1| hypothetical protein SORBIDRAFT_08g022580 [Sorghum bicolor]
gi|241943284|gb|EES16429.1| hypothetical protein SORBIDRAFT_08g022580 [Sorghum bicolor]
Length = 493
Score = 92.8 bits (229), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 103/372 (27%), Positives = 161/372 (43%), Gaps = 49/372 (13%)
Query: 1 MVRLFIGTPSKGVLLILDTGSALIY--------------AIFDPRKSSSFQKINCDHPDC 46
M ++ +GTP+ LL +DTGS + + +FDPR S+S++++ D PDC
Sbjct: 135 MAKIAVGTPAVEALLAMDTGSDITWLQCQPCRRCYPQSGPVFDPRHSTSYREMGYDAPDC 194
Query: 47 TYFK------CVNEQCVYTMKYADQ-SVTKGFAAHETISVIGKGEGKAIFHGALFGCSND 99
CVY + Y D S T G ET++ G G + H ++ GC +D
Sbjct: 195 QALGRSGGGDAKRMTCVYAVGYGDDGSTTVGDFIEETLTFAG---GVQVPHMSI-GCGHD 250
Query: 100 NHG-FDEDARDGALAGVLGLSRVTISFISQLGSII--KKRFSYCLV---IPLPNGEYTSS 153
N G F A AG+LGL R IS SQ+ ++ FSYCL + P G SS
Sbjct: 251 NKGLFAAPA-----AGILGLGRGQISCPSQIAALGYNVTSFSYCLADFFLSSP-GRSVSS 304
Query: 154 YLKFGTDMGYRRPSTQATKFINHPN--NFYYLSLKDISIDNERMNFPPDTFDITV---SG 208
L G P T + + N FYY+ L +S+ R+ + D+ + +G
Sbjct: 305 TLTIGDGAAAGSPPPSFTPTVQNLNMATFYYVRLVGVSVGGVRVPGVTED-DLKLDPYTG 363
Query: 209 EGGCIIDSGSVLTYFHSDVYWKLHEKFVSYFERFQLAQLS-DCPEP-IQLCYFLPETFNR 266
GG I+DSG+ +T Y + + L Q+S P CY + +
Sbjct: 364 RGGVILDSGTAVTRLARRAY--IAFRDAFRAAAVDLGQVSIGGPSGFFDTCYTMGGRAMK 421
Query: 267 FPSMAFYFEDA-NLRIDGENVFI-IDYENHFFLLAVAPHDDLVALIGSQQQRDTRFVYDL 324
P+++ +F L + +N I +D D V++IG+ QQ+ R VY++
Sbjct: 422 VPTVSMHFAGGVELTLPPKNYLIPVDSMGTVCFAFAGTGDRSVSIIGNIQQQGFRVVYNI 481
Query: 325 NIDLLSFVKENC 336
+ F +C
Sbjct: 482 GGGRVGFAPNSC 493
>gi|242045118|ref|XP_002460430.1| hypothetical protein SORBIDRAFT_02g027990 [Sorghum bicolor]
gi|241923807|gb|EER96951.1| hypothetical protein SORBIDRAFT_02g027990 [Sorghum bicolor]
Length = 488
Score = 92.8 bits (229), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 93/366 (25%), Positives = 147/366 (40%), Gaps = 46/366 (12%)
Query: 1 MVRLFIGTPSKGVLLILDTGSALIYA--------------IFDPRKSSSFQKINCDHPDC 46
+ L +GTP+ +++ LDTGS + +FDP SS++ + C +C
Sbjct: 140 VASLRLGTPATELVVELDTGSDQSWVQCKPCADCYEQRDPVFDPTASSTYSAVPCGAREC 199
Query: 47 TYFKCVN----------EQCVYTMKYADQSVTKGFAAHETISVIGKGEGKA--IFHGALF 94
+ + C Y + Y D S T G A +T+++ G +F
Sbjct: 200 QELASSSSSRNCSSDNNKNCPYEVSYDDDSHTVGDLARDTLTLSPSPSPSPADTVPGFVF 259
Query: 95 GCSNDNHGFDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSY 154
GC + N G G + G+LGL S SQ+ + FSYC LP+ + Y
Sbjct: 260 GCGHSNAG-----TFGEVDGLLGLGLGKASLPSQVAARYGAAFSYC----LPSSPSAAGY 310
Query: 155 LKFGTDMGYRRPSTQATKFINHPN-NFYYLSLKDISIDNERMNFPPDTFDITVSGEGGCI 213
L FG R + Q T+ + + YYL+L I + + P F G I
Sbjct: 311 LSFGGAAA--RANAQFTEMVTGQDPTSYYLNLTGIVVAGRAIKVPASAFATAA----GTI 364
Query: 214 IDSGSVLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCY-FLPETFNRFPSMAF 272
IDSG+ + Y L F S R++ + P CY F R P++
Sbjct: 365 IDSGTAFSRLPPSAYAALRSSFRSAMGRYRYKRAPSSPI-FDTCYDFTGHETVRIPAVEL 423
Query: 273 YFED-ANLRIDGENVFIIDYENHFFLLAVAPHDDLVALIGSQQQRDTRFVYDLNIDLLSF 331
F D A + + V + LA P+ DL ++G+ QQR +YD+ + F
Sbjct: 424 VFADGATVHLHPSGVLYTWNDVAQTCLAFVPNHDL-GILGNTQQRTLAVIYDVGSQRIGF 482
Query: 332 VKENCS 337
++ C+
Sbjct: 483 GRKGCA 488
>gi|115452187|ref|NP_001049694.1| Os03g0271900 [Oryza sativa Japonica Group]
gi|29893618|gb|AAP06872.1| hypothetical protein [Oryza sativa Japonica Group]
gi|108707424|gb|ABF95219.1| Eukaryotic aspartyl protease family protein, expressed [Oryza
sativa Japonica Group]
gi|108707425|gb|ABF95220.1| Eukaryotic aspartyl protease family protein, expressed [Oryza
sativa Japonica Group]
gi|113548165|dbj|BAF11608.1| Os03g0271900 [Oryza sativa Japonica Group]
gi|215715205|dbj|BAG94956.1| unnamed protein product [Oryza sativa Japonica Group]
gi|215737033|dbj|BAG95962.1| unnamed protein product [Oryza sativa Japonica Group]
gi|215740994|dbj|BAG97489.1| unnamed protein product [Oryza sativa Japonica Group]
Length = 447
Score = 92.4 bits (228), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 98/395 (24%), Positives = 157/395 (39%), Gaps = 75/395 (18%)
Query: 2 VRLFIGTPSKGVLLILDTGSALIYAI------------FDPRKSSSFQKINCDHPDCTYF 49
V + +GTP + V ++LDTGS L + + F+ SSS+ + C C +
Sbjct: 57 VPVAVGTPPQNVTMVLDTGSELSWLLCNGSYAPPLTPAFNASGSSSYGAVPCPSTACEWR 116
Query: 50 K--------C---VNEQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGC-- 96
C + C ++ YAD S G A +T + G A+ GA FGC
Sbjct: 117 GRDLPVPPFCDTPPSNACRVSLSYADASSADGVLATDTFLLTGGAPPVAV--GAYFGCIT 174
Query: 97 ------SNDNHGFDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEY 150
+ +++G D + A G+LG++R T+SF++Q G+ +RF+YC+ GE
Sbjct: 175 SYSSTTATNSNGTGTDVSEAA-TGLLGMNRGTLSFVTQTGT---RRFAYCIA----PGE- 225
Query: 151 TSSYLKFGTDMGYRRPSTQATKF-INHPNNF-----YYLSLKDISIDNERMNFPPDTFDI 204
L G D G P I+ P + Y + L+ I + + P
Sbjct: 226 GPGVLLLGDDGGVAPPLNYTPLIEISQPLPYFDRVAYSVQLEGIRVGCALLPIPKSVLTP 285
Query: 205 TVSGEGGCIIDSGSVLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEP-------IQLC 257
+G G ++DSG+ T+ +D Y L +F S R LA L EP C
Sbjct: 286 DHTGAGQTMVDSGTQFTFLLADAYAALKAEFTSQ-ARLLLAPLG---EPGFVFQGAFDAC 341
Query: 258 YFLPE-----TFNRFPSMAFYFEDANLRIDGEN-VFIIDYENHFFLLAVA------PHDD 305
+ PE P + A + + GE ++++ E A A + D
Sbjct: 342 FRGPEARVAAASGLLPEVGLVLRGAEVAVSGEKLLYMVPGERRGEGGAEAVWCLTFGNSD 401
Query: 306 LVAL----IGSQQQRDTRFVYDLNIDLLSFVKENC 336
+ + IG Q++ YDL + F C
Sbjct: 402 MAGMSAYVIGHHHQQNVWVEYDLQNGRVGFAPARC 436
>gi|242091327|ref|XP_002441496.1| hypothetical protein SORBIDRAFT_09g028060 [Sorghum bicolor]
gi|241946781|gb|EES19926.1| hypothetical protein SORBIDRAFT_09g028060 [Sorghum bicolor]
Length = 466
Score = 92.4 bits (228), Expect = 3e-16, Method: Compositional matrix adjust.
Identities = 91/376 (24%), Positives = 146/376 (38%), Gaps = 53/376 (14%)
Query: 2 VRLFIGTPSKGVLLILDTGSALIYA------------------IFDPRKSSSFQKINCDH 43
VR +GTP++ +L+ DTGS L + +F S S+ I C
Sbjct: 103 VRFRVGTPAQPFVLVADTGSDLTWVKCRGAGAAAGTGAGSPARVFRTAASKSWAPIACSS 162
Query: 44 PDCTYF------KCVN--EQCVYTMKYADQSVTKGFAAHETISVI-----------GKGE 84
CT + C + C Y +Y D S +G ++ ++ G
Sbjct: 163 DTCTSYVPFSLANCSSPASPCAYDYRYRDGSAARGVVGTDSATIALSSGSGRGGGDSSGG 222
Query: 85 GKAIFHGALFGCSNDNHGFDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIP 144
+A G + GC+ G + DG VL L ISF S+ + RFSYCLV
Sbjct: 223 RRAKLQGVVLGCAATYDGQSFQSSDG----VLSLGNSNISFASRAAARFGGRFSYCLVDH 278
Query: 145 LPNGEYTSSYLKFGTDMGYRRPSTQATKFINHP-NNFYYLSLKDISIDNERMNFPPDTFD 203
L T SYL FG G P+ Q ++ FY +++ + + E ++ P D +D
Sbjct: 279 LAPRNAT-SYLTFGP--GATAPAAQTPLLLDRRMTPFYAVTVDAVYVAGEALDIPADVWD 335
Query: 204 ITVSGEGGCIIDSGSVLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCYFLPET 263
V GG I+DSG+ LT + Y + + + +P + CY +
Sbjct: 336 --VDRNGGAILDSGTSLTILATPAYRAVVTALSKHLAGLPRVTM----DPFEYCYNWTDA 389
Query: 264 FN-RFPSMAFYFEDANLRIDGENVFIIDYENHFFLLAVAPHD-DLVALIGSQQQRDTRFV 321
P M +F + ++ID + V V++IG+ Q++ +
Sbjct: 390 GALEIPKMEVHFAGSARLEPPAKSYVIDAAPGVKCIGVQEGSWPGVSVIGNILQQEHLWE 449
Query: 322 YDLNIDLLSFVKENCS 337
+DL L F C+
Sbjct: 450 FDLRDRWLRFKHTRCA 465
>gi|115468912|ref|NP_001058055.1| Os06g0610800 [Oryza sativa Japonica Group]
gi|51091964|dbj|BAD35493.1| putative nucleoid DNA-binding protein cnd41, chloroplast [Oryza
sativa Japonica Group]
gi|113596095|dbj|BAF19969.1| Os06g0610800 [Oryza sativa Japonica Group]
gi|218198535|gb|EEC80962.1| hypothetical protein OsI_23679 [Oryza sativa Indica Group]
Length = 519
Score = 92.4 bits (228), Expect = 3e-16, Method: Compositional matrix adjust.
Identities = 91/365 (24%), Positives = 146/365 (40%), Gaps = 58/365 (15%)
Query: 1 MVRLFIGTPSKGVLLILDTGSALIYA---------------IFDPRKSSSFQKINCDHPD 45
+V + +GTP+ ++ DTGS + +FDP SS++ ++C P
Sbjct: 184 VVTVGLGTPASRYTVVFDTGSDTTWVQCQPCVVACYEQREKLFDPASSSTYANVSCAAPA 243
Query: 46 CTYF---KCVNEQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSNDNHG 102
C+ C C+Y ++Y D S + GF A +T+++ G FGC N G
Sbjct: 244 CSDLDVSGCSGGHCLYGVQYGDGSYSIGFFAMDTLTL----SSYDAVKGFRFGCGERNDG 299
Query: 103 FDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLKFGTDMG 162
+A AG+LGL R S Q F++C LP + YL FG
Sbjct: 300 LFGEA-----AGLLGLGRGKTSLPVQTYGKYGGVFAHC----LPARSTGTGYLDFGAG-- 348
Query: 163 YRRPSTQATKFI--NHPNNFYYLSLKDISIDNERMNFPPDTFDITVSGEGGCIIDSGSVL 220
P+T T + N P FYY+ + I + + P F G I+DSG+V+
Sbjct: 349 -SPPATTTTPMLTGNGP-TFYYVGMTGIRVGGRLLPIAPSVFAAA-----GTIVDSGTVI 401
Query: 221 TYFHSDVYWKLHEKFVSY-----FERFQLAQLSDCPEPIQLCY-FLPETFNRFPSMAFYF 274
T Y L F + + + L D CY F + P+++ F
Sbjct: 402 TRLPPAAYSSLRSAFAAAMAARGYRKAAAVSLLD------TCYDFTGMSQVAIPTVSLLF 455
Query: 275 E-DANLRIDGENVFIIDYENHFFLLAVAPHDDL--VALIGSQQQRDTRFVYDLNIDLLSF 331
+ A L +D + + LA A ++D V ++G+ Q + YD+ ++ F
Sbjct: 456 QGGAALDVDASGIMYTVSASQ-VCLAFAGNEDGGDVGIVGNTQLKTFGVAYDIGKKVVGF 514
Query: 332 VKENC 336
C
Sbjct: 515 SPGAC 519
>gi|222640709|gb|EEE68841.1| hypothetical protein OsJ_27628 [Oryza sativa Japonica Group]
Length = 375
Score = 92.0 bits (227), Expect = 3e-16, Method: Compositional matrix adjust.
Identities = 91/338 (26%), Positives = 141/338 (41%), Gaps = 36/338 (10%)
Query: 15 LILDTGSALIYAIFDPRKSSSFQKINCDHPDCTYFKCVNEQC-VYTMKYADQSVTKGFAA 73
LI+DTGS LI+ K SS H + + +T + G A
Sbjct: 55 LIVDTGSDLIWTQC---KLSSSTAAAARHGSPPLSRTAPARTGAFTRTCTASAAAVGVLA 111
Query: 74 HETISVIGKGEGKAIFHGALFGCSNDNHGFDEDARDGALAGVLGLSRVTISFISQLGSII 133
ET + G +A+ FGC + G A G+LGLS ++S I+QL
Sbjct: 112 SETFTF---GARRAVSLRLGFGCGALSAGSLIGA-----TGILGLSPESLSLITQLK--- 160
Query: 134 KKRFSYCLVIPLPNGEYTSSYLKFGTDMGYRRPST----QATKFINHP--NNFYYLSLKD 187
+RFSYCL P + +S L FG R T Q T +++P +YY+ L
Sbjct: 161 IQRFSYCLT---PFADKKTSPLLFGAMADLSRHKTTRPIQTTAIVSNPVETVYYYVPLVG 217
Query: 188 ISIDNERMNFPPDTFDITVSGEGGCIIDSGSVLTYFHSDVYWKLHEKFVSYFERFQLAQL 247
IS+ ++R+ P + + G GG I+DSGS + Y + + E + + +L
Sbjct: 218 ISLGHKRLAVPAASLAMRPDGGGGTIVDSGSTVAYLVEAAFEAVKE---AVMDVVRLPVA 274
Query: 248 SDCPEPIQLCYFLPETFNRFPSMAFYFEDANLRIDGENVFIIDYENHF-------FLLAV 300
+ E +LC+ LP A L DG ++ +N+F LAV
Sbjct: 275 NRTVEDYELCFVLPRRTAAAAMEAVQVPPLVLHFDGGAAMVLPRDNYFQEPRAGLMCLAV 334
Query: 301 APHDD--LVALIGSQQQRDTRFVYDLNIDLLSFVKENC 336
D V++IG+ QQ++ ++D+ SF C
Sbjct: 335 GKTTDGSGVSIIGNVQQQNMHVLFDVQHHKFSFAPTQC 372
>gi|242079765|ref|XP_002444651.1| hypothetical protein SORBIDRAFT_07g025440 [Sorghum bicolor]
gi|241941001|gb|EES14146.1| hypothetical protein SORBIDRAFT_07g025440 [Sorghum bicolor]
Length = 493
Score = 92.0 bits (227), Expect = 3e-16, Method: Compositional matrix adjust.
Identities = 98/375 (26%), Positives = 161/375 (42%), Gaps = 65/375 (17%)
Query: 6 IGTPSKGVLLILDTGSALIY-------------------AIFDPRKSSSFQKINCDHPDC 46
+GTP K + +DTGS +++ ++DP+ SS+ + CD C
Sbjct: 94 LGTPPKRFYVQVDTGSDILWVNCITCDQCPHKSGLGLDLTLYDPKASSTGSTVMCDQGFC 153
Query: 47 T------YFKC-VNEQCVYTMKYADQSVTKGFAAHETIS---VIGKGEGKAIFHGALFGC 96
KC N C Y++ Y D S T G ++ + V G G+ + +FGC
Sbjct: 154 ADTFGGRLPKCSANVPCEYSVTYGDGSSTVGSFVNDALQFDQVTGDGQTQPANASVIFGC 213
Query: 97 SNDNHGFDEDARDGALAGVLGLSRVTISFISQLGSI--IKKRFSYCLVIPLPNGEYTSSY 154
G D + AL G+LG S +SQL + +KK F++CL G +
Sbjct: 214 GA-QQGGDLGSSSQALDGILGFGEANTSMLSQLATAGKVKKIFAHCLDTIKGGGIFAIGD 272
Query: 155 LKFGTDMGYRRPSTQATKFI-NHPNNFYYLSLKDISIDNERMNFPPDTFDITVSGEG-GC 212
+ +P + T + + P+ Y ++LK I + + P D F GE G
Sbjct: 273 V--------VQPKVKTTPLVADKPH--YNVNLKTIDVGGTTLELPADIFK---PGEKRGT 319
Query: 213 IIDSGSVLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCY-FLPETFNRFPSMA 271
IIDSG+ LTY V+ K+ ++ F + Q D + LC+ + + FP++
Sbjct: 320 IIDSGTTLTYLPELVFKKV---MLAVFNKHQDITFHDVQD--FLCFEYSGSVDDGFPTLT 374
Query: 272 FYFE-DANLRI--------DGENVFIIDYENHFFLLAVAPHDDLVALIGSQQQRDTRFVY 322
F+FE D L + +G +V+ + ++N L D+V L+G + VY
Sbjct: 375 FHFEDDLALHVYPHEYFFPNGNDVYCVGFQNG--ALQSKDGKDIV-LMGDLVLSNKLVVY 431
Query: 323 DLNIDLLSFVKENCS 337
DL ++ + NCS
Sbjct: 432 DLENRVIGWTDYNCS 446
>gi|194696934|gb|ACF82551.1| unknown [Zea mays]
gi|413936470|gb|AFW71021.1| hypothetical protein ZEAMMB73_589717 [Zea mays]
gi|413953801|gb|AFW86450.1| hypothetical protein ZEAMMB73_488726 [Zea mays]
Length = 445
Score = 92.0 bits (227), Expect = 3e-16, Method: Compositional matrix adjust.
Identities = 90/366 (24%), Positives = 149/366 (40%), Gaps = 64/366 (17%)
Query: 1 MVRLFIGTPSKGVLLILDTGSALIY----------------AIFDPRKSSSFQKINCDHP 44
+VR+ GTP+ ++++DTGS + + ++DP SS++ + C
Sbjct: 114 VVRVSFGTPAVPQVVVIDTGSDVSWLQCKPCSSGQCFPQKDPLYDPSHSSTYSAVPCASD 173
Query: 45 DCTYFK-------CVN-EQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGC 96
C C + +QC + + YAD + T G + + +++ AI FGC
Sbjct: 174 VCKKLAADAYGSGCTSGKQCGFAISYADGTSTVGAYSQDKLTL----APGAIVQNFYFGC 229
Query: 97 SNDNHGFDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLK 156
+ H A G GVLGL R+ S ++ G + FSYCL P+ +L
Sbjct: 230 GHGKH-----AVRGLFDGVLGLGRLRESLGARYGGV----FSYCL----PSVSSKPGFLA 276
Query: 157 FGTDMGYRRPS----TQATKFINHPNNFYYLSLKDISIDNERMNFPPDTFDITVSGEGGC 212
G + PS T P F ++L I++ ++++ P F GG
Sbjct: 277 LGAG---KNPSGFVFTPMGTVPGQPT-FSTVTLAGINVGGKKLDLRPSAF------SGGM 326
Query: 213 IIDSGSVLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCYFLPETFN-RFPSMA 271
I+DSG+V+T S Y L F E ++L D + CY L N P +A
Sbjct: 327 IVDSGTVITGLQSTAYRALRSAFRKAMEAYRLLPNGD----LDTCYNLTGYKNVVVPKIA 382
Query: 272 FYFE-DANLRIDGENVFIIDYENHFFLLAVAPHDDLVALIGSQQQRDTRFVYDLNIDLLS 330
F A + +D N ++ N A + D ++G+ QR ++D +
Sbjct: 383 LTFTGGATINLDVPNGILV---NGCLAFAESGPDGSAGVLGNVNQRAFEVLFDTSTSKFG 439
Query: 331 FVKENC 336
F + C
Sbjct: 440 FRAKAC 445
>gi|326531454|dbj|BAJ97731.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 476
Score = 92.0 bits (227), Expect = 4e-16, Method: Compositional matrix adjust.
Identities = 91/369 (24%), Positives = 144/369 (39%), Gaps = 69/369 (18%)
Query: 1 MVRLFIGTPSKGVLLILDTGSALIYA----------------IFDPRKSSSFQKINCDHP 44
+V + +GTP + +DTGS + + +FDP KSS++ + C
Sbjct: 144 VVTVSLGTPGVSQTVEVDTGSDVSWVQCKPCSAPACNSQRDQLFDPAKSSTYSAVPCGAD 203
Query: 45 DCTYFK-----CVNEQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSND 99
C+ + C QC Y + Y D S T G +T++ + G F LFGC +
Sbjct: 204 ACSELRIYEAGCSGSQCGYVVSYGDGSNTTGVYGSDTLA-LAPGNTVGTF---LFGCGHA 259
Query: 100 NHGFDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLKFGT 159
G + G+L L R ++S SQ FSYC LP+ + + YL G
Sbjct: 260 QAGMFA-----GIDGLLALGRQSMSLKSQAAGAYGGVFSYC----LPSKQSAAGYLTLGG 310
Query: 160 DMGYRRPSTQATKFINHPNNFYYLSLKDISIDNERMNFPPDTFDITVSGEGGCIIDSGSV 219
+T FY + L IS+ +++ P F GG ++D+G+V
Sbjct: 311 PTSASGFATTGLLTAWAAPTFYMVMLTGISVGGQQVAVPASAF------AGGTVVDTGTV 364
Query: 220 LTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEP----IQLCYFLPETFNRF-----PSM 270
+T Y L F R +A P + CY F+R+ P++
Sbjct: 365 ITRLPPTAYAALRSAF-----RGAIAPYGYPSAPANGILDTCY----DFSRYGVVTLPTV 415
Query: 271 AFYFE-DANLRIDGENVFIIDYENHFFLLAVAPH--DDLVALIGSQQQRDTRFVYDLNID 327
A F A L ++ + LA AP+ D A++G+ QQR F +
Sbjct: 416 ALTFSGGATLALEAPGIL------SSGCLAFAPNGGDGDAAILGNVQQRS--FAVRFDGS 467
Query: 328 LLSFVKENC 336
+ F+ C
Sbjct: 468 TVGFMPGAC 476
>gi|297805182|ref|XP_002870475.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
gi|297316311|gb|EFH46734.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
Length = 480
Score = 92.0 bits (227), Expect = 4e-16, Method: Compositional matrix adjust.
Identities = 90/378 (23%), Positives = 161/378 (42%), Gaps = 67/378 (17%)
Query: 3 RLFIGTPSKGVLLILDTGSALIY-------------------AIFDPRKSSSFQKINCDH 43
++ +G+P K + +DTGS +++ +++D + SS+ + + C+
Sbjct: 80 KIKLGSPPKEYYVQVDTGSDILWVNCAPCPKCPVKTDLGIPLSLYDSKASSTSKNVGCED 139
Query: 44 PDCTYF----KC-VNEQCVYTMKYADQSVTKGFAAHETIS---VIGKGEGKAIFHGALFG 95
C++ C + C Y + Y D S + G + I+ V G + +FG
Sbjct: 140 AFCSFIMQSETCGAKKPCSYHVVYGDGSTSDGDFVKDNITLDQVTGNLRTAPLAQEVVFG 199
Query: 96 CSNDNHGFDEDARDGALAGVLGLSRVTISFISQL--GSIIKKRFSYCLVIPLPNGEYTSS 153
C + G + A+ G++G + S ISQL G +K+ FS+CL G +
Sbjct: 200 CGKNQSG-QLGQTESAVDGIMGFGQSNTSVISQLAAGGSVKRIFSHCLDNMNGGGIFAIG 258
Query: 154 YLKFGTDMGYRRPSTQATKFINHPNNFYY-LSLKDISIDNERMNFPPDTFDITVSGEGGC 212
++ P + T + PN +Y + LK + +D E ++ PP + +G+GG
Sbjct: 259 EVE--------SPVVKTTPLV--PNQVHYNVILKGMDVDGEPIDLPPSL--ASTNGDGGT 306
Query: 213 IIDSGSVLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCYFLPETFNRFPSMAF 272
IIDSG+ L Y ++Y L EK + Q +L E F T FP +
Sbjct: 307 IIDSGTTLAYLPQNLYNSLIEKITAK----QQVKLHMVQETFACFSFTSNTDKAFPVVNL 362
Query: 273 YFEDA------------NLRIDGENVFIIDYENHFFLLAVAPHDDL-VALIGSQQQRDTR 319
+FED+ +LR E+++ +++ + D V L+G +
Sbjct: 363 HFEDSLKLSVYPHDYLFSLR---EDMYCFGWQSG----GMTTQDGADVILLGDLVLSNKL 415
Query: 320 FVYDLNIDLLSFVKENCS 337
VYDL +++ + NCS
Sbjct: 416 VVYDLENEVIGWADHNCS 433
>gi|242069057|ref|XP_002449805.1| hypothetical protein SORBIDRAFT_05g023600 [Sorghum bicolor]
gi|241935648|gb|EES08793.1| hypothetical protein SORBIDRAFT_05g023600 [Sorghum bicolor]
Length = 430
Score = 92.0 bits (227), Expect = 4e-16, Method: Compositional matrix adjust.
Identities = 99/372 (26%), Positives = 158/372 (42%), Gaps = 62/372 (16%)
Query: 1 MVRLFIGTPSKGVLLILDTGSALIYA--------------IFDPRKSSSFQKINCDHPDC 46
++ L IGTP + + DTGS L + I+D SSSF + C C
Sbjct: 84 LMELAIGTPPVPFIALADTGSDLTWTQCKPCKLCFGQDTPIYDTTTSSSFSPLPCSSATC 143
Query: 47 TYF---KCV--NEQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSNDNH 101
+C + C Y Y D + + A ISV G FGC DN
Sbjct: 144 LPIWSSRCSTPSATCRYRYAYDDGAYSPECAG---ISV----------GGIAFGCGVDNG 190
Query: 102 GFDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLKFGTDM 161
G ++ G +GL R ++S ++QLG +FSYCL SS + FG+
Sbjct: 191 GLSYNS-----TGTVGLGRGSLSLVAQLG---VGKFSYCLTDFF--NTSLSSPVFFGSLA 240
Query: 162 GYRRPS-------TQATKFINHPNN--FYYLSLKDISIDNERMNFPPDTFDITVS-GEGG 211
S Q+T + P N YY+SL+ IS+ + R+ P TFD+ G GG
Sbjct: 241 ELAASSASADAAVVQSTPLVQSPYNPSRYYVSLEGISLGDARLPIPNGTFDLNDDDGSGG 300
Query: 212 CIIDSGSVLTYFHSDVYWKLHEKFVSYFERFQLAQLS---DC-PEPIQLCYFLPETFNRF 267
I+DSG++ T + + + + + S C P P LP+
Sbjct: 301 MIVDSGTIFTILVETGFRVVVDHVAGVLGQPVVNASSLDRPCFPAPAAGVQELPD----M 356
Query: 268 PSMAFYFED-ANLRIDGENVFIIDYENHFFLLAVAPHDDLV-ALIGSQQQRDTRFVYDLN 325
P M +F A++R+ +N + E F L + + +++G+ QQ++ + ++D+
Sbjct: 357 PDMVLHFAGGADMRLHRDNYMSFNEEESSFCLNIVGTESASGSVLGNFQQQNIQMLFDIT 416
Query: 326 IDLLSFVKENCS 337
+ LSF+ +CS
Sbjct: 417 VGQLSFMPTDCS 428
>gi|219362525|ref|NP_001136612.1| uncharacterized protein LOC100216735 [Zea mays]
gi|194696366|gb|ACF82267.1| unknown [Zea mays]
gi|413953802|gb|AFW86451.1| hypothetical protein ZEAMMB73_488726 [Zea mays]
Length = 411
Score = 92.0 bits (227), Expect = 4e-16, Method: Compositional matrix adjust.
Identities = 90/366 (24%), Positives = 149/366 (40%), Gaps = 64/366 (17%)
Query: 1 MVRLFIGTPSKGVLLILDTGSALIY----------------AIFDPRKSSSFQKINCDHP 44
+VR+ GTP+ ++++DTGS + + ++DP SS++ + C
Sbjct: 80 VVRVSFGTPAVPQVVVIDTGSDVSWLQCKPCSSGQCFPQKDPLYDPSHSSTYSAVPCASD 139
Query: 45 DCTYFK-------CVN-EQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGC 96
C C + +QC + + YAD + T G + + +++ AI FGC
Sbjct: 140 VCKKLAADAYGSGCTSGKQCGFAISYADGTSTVGAYSQDKLTL----APGAIVQNFYFGC 195
Query: 97 SNDNHGFDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLK 156
+ H A G GVLGL R+ S ++ G + FSYCL P+ +L
Sbjct: 196 GHGKH-----AVRGLFDGVLGLGRLRESLGARYGGV----FSYCL----PSVSSKPGFLA 242
Query: 157 FGTDMGYRRPS----TQATKFINHPNNFYYLSLKDISIDNERMNFPPDTFDITVSGEGGC 212
G + PS T P F ++L I++ ++++ P F GG
Sbjct: 243 LGAG---KNPSGFVFTPMGTVPGQPT-FSTVTLAGINVGGKKLDLRPSAF------SGGM 292
Query: 213 IIDSGSVLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCYFLPETFN-RFPSMA 271
I+DSG+V+T S Y L F E ++L D + CY L N P +A
Sbjct: 293 IVDSGTVITGLQSTAYRALRSAFRKAMEAYRLLPNGD----LDTCYNLTGYKNVVVPKIA 348
Query: 272 FYFE-DANLRIDGENVFIIDYENHFFLLAVAPHDDLVALIGSQQQRDTRFVYDLNIDLLS 330
F A + +D N ++ N A + D ++G+ QR ++D +
Sbjct: 349 LTFTGGATINLDVPNGILV---NGCLAFAESGPDGSAGVLGNVNQRAFEVLFDTSTSKFG 405
Query: 331 FVKENC 336
F + C
Sbjct: 406 FRAKAC 411
>gi|326495920|dbj|BAJ90582.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 476
Score = 92.0 bits (227), Expect = 4e-16, Method: Compositional matrix adjust.
Identities = 91/369 (24%), Positives = 144/369 (39%), Gaps = 69/369 (18%)
Query: 1 MVRLFIGTPSKGVLLILDTGSALIYA----------------IFDPRKSSSFQKINCDHP 44
+V + +GTP + +DTGS + + +FDP KSS++ + C
Sbjct: 144 VVTVSLGTPGVSQTVEVDTGSDVSWVQCKPCSAPACNSQRDQLFDPAKSSTYSAVPCGAD 203
Query: 45 DCTYFK-----CVNEQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSND 99
C+ + C QC Y + Y D S T G +T++ + G F LFGC +
Sbjct: 204 ACSELRIYEAGCSGSQCGYVVSYGDGSNTTGVYGSDTLA-LAPGNTVGTF---LFGCGHA 259
Query: 100 NHGFDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLKFGT 159
G + G+L L R ++S SQ FSYC LP+ + + YL G
Sbjct: 260 QAGMFA-----GIDGLLALGRQSMSLKSQAAGAYGGVFSYC----LPSKQSAAGYLTLGG 310
Query: 160 DMGYRRPSTQATKFINHPNNFYYLSLKDISIDNERMNFPPDTFDITVSGEGGCIIDSGSV 219
+T FY + L IS+ +++ P F GG ++D+G+V
Sbjct: 311 PSSASGFATTGLLTAWAAPTFYMVMLTGISVGGQQVAVPASAF------AGGTVVDTGTV 364
Query: 220 LTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEP----IQLCYFLPETFNRF-----PSM 270
+T Y L F R +A P + CY F+R+ P++
Sbjct: 365 ITRLPPTAYAALRSAF-----RGAIAPCGYPSAPANGILDTCY----DFSRYGVVTLPTV 415
Query: 271 AFYFE-DANLRIDGENVFIIDYENHFFLLAVAPH--DDLVALIGSQQQRDTRFVYDLNID 327
A F A L ++ + LA AP+ D A++G+ QQR F +
Sbjct: 416 ALTFSGGATLALEAPGIL------SSGCLAFAPNGGDGDAAILGNVQQRS--FAVRFDGS 467
Query: 328 LLSFVKENC 336
+ F+ C
Sbjct: 468 TVGFMPGAC 476
>gi|225458774|ref|XP_002283258.1| PREDICTED: aspartic proteinase-like protein 2 [Vitis vinifera]
gi|302142232|emb|CBI19435.3| unnamed protein product [Vitis vinifera]
Length = 659
Score = 92.0 bits (227), Expect = 4e-16, Method: Compositional matrix adjust.
Identities = 89/363 (24%), Positives = 149/363 (41%), Gaps = 47/363 (12%)
Query: 2 VRLFIGTPSKGVLLILDTGSALIYAI--------------FDPRKSSSFQKINCDHPDCT 47
RL+IGTP + LI+DTGS + Y F P +SS++ + C+ DC
Sbjct: 90 TRLWIGTPPQEFALIVDTGSTVTYVPCSDCEHCGKHQDPRFQPDESSTYHPVKCNM-DC- 147
Query: 48 YFKCVNE--QCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSNDNHGFDE 105
C ++ CVY +YA+ S + G + IS + E + A+FGC N G
Sbjct: 148 --NCDHDGVNCVYERRYAEMSSSSGVLGEDIISFGNQSE--VVPQRAVFGCENVETG--- 200
Query: 106 DARDGALAGVLGLSRVTISFISQL--GSIIKKRFSYCL-VIPLPNGEYTSSYLKFGTDMG 162
D G++GL R +S + QL ++I FS C + + G + DM
Sbjct: 201 DLYSQRADGIMGLGRGQLSIVDQLVDKNVINDSFSLCYGGMHVGGGAMVLGGIPPPPDMV 260
Query: 163 YRRPSTQATKFINHPNNFYYLSLKDISIDNERMNFPPDTFDITVSGEGGCIIDSGSVLTY 222
+ R + +Y + LK+I + + + P TFD + G ++DSG+ Y
Sbjct: 261 FSRSDPYRSP-------YYNIELKEIHVAGKPLKLSPSTFD----RKHGTVLDSGTTYAY 309
Query: 223 FHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCYF-----LPETFNRFPSMAFYFEDA 277
+ + + + + D P +C+ + + FP + F +
Sbjct: 310 LPEEAFVAFRDAIIKKSHNLKQIHGPD-PNYNDICFSGAGRDVSQLSKAFPEVDMVFSNG 368
Query: 278 N-LRIDGENVFIIDYENH-FFLLAVAPHDDLVALIGSQQQRDTRFVYDLNIDLLSFVKEN 335
L + EN + H + L + + D L+G R+T YD + + F K N
Sbjct: 369 QKLSLTPENYLFQHTKVHGAYCLGIFRNGDSTTLLGGIIVRNTLVTYDRENEKIGFWKTN 428
Query: 336 CSD 338
CS+
Sbjct: 429 CSE 431
>gi|388518257|gb|AFK47190.1| unknown [Lotus japonicus]
Length = 478
Score = 92.0 bits (227), Expect = 4e-16, Method: Compositional matrix adjust.
Identities = 91/377 (24%), Positives = 168/377 (44%), Gaps = 60/377 (15%)
Query: 3 RLFIGTPSKGVLLILDTGSALIY-------------------AIFDPRKSSSFQKINCDH 43
++ +G+PSK + +DTGS +++ ++DP++S + + ++C+H
Sbjct: 72 KIGLGSPSKDYYVQVDTGSDILWVNCVECTRCPRKSDIGIGLTLYDPKRSKTSEFVSCEH 131
Query: 44 PDC--TY----FKCVNEQ-CVYTMKYADQSVTKGFAAHETIS---VIGKGEGKAIFHGAL 93
C TY C E C Y++ Y D S T G+ + ++ V G +
Sbjct: 132 NFCSSTYEGRILGCKAENPCPYSISYGDGSATTGYYVQDYLTFNRVNGNPHTATQNSSII 191
Query: 94 FGCSNDNHGFDEDARDGALAGVLGLSRVTISFISQLGSI--IKKRFSYCLVIPLPNGEYT 151
FGC G + + AL G++G + S +SQL + +KK FS+CL + G ++
Sbjct: 192 FGCGAAQSGTFASSSEEALDGIIGFGQANSSVLSQLAASGKVKKIFSHCLDTNVGGGIFS 251
Query: 152 SSYLKFGTDMGYRRPSTQATKFINHPNNFYY-LSLKDISIDNERMNFPPDTFDITVSGEG 210
+ P + T + PN +Y + LK+I +D + + P DTFD + +G+
Sbjct: 252 IGEV--------VEPKVKTTPLV--PNMAHYNVILKNIEVDGDILQLPSDTFD-SENGK- 299
Query: 211 GCIIDSGSVLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCYFLPETFNRFPSM 270
G +IDSG+ L Y VY +L K ++ R ++ + + Q + + FP +
Sbjct: 300 GTVIDSGTTLAYLPRIVYDQLMSKVLAKQPRLKVYLVEEQYSCFQ---YTGNVDSGFPIV 356
Query: 271 AFYFEDA-NLRI---------DGENVFIIDYENHFFLLAVAPHDDLVALIGSQQQRDTRF 320
+FED+ +L + G++ + I ++ + + + L+G +
Sbjct: 357 KLHFEDSLSLTVYPHDYLFNYKGDSYWCIGWQKS---ASETKNGKDMTLLGDFVLSNKLV 413
Query: 321 VYDLNIDLLSFVKENCS 337
VYDL + + NCS
Sbjct: 414 VYDLENMTIGWTDYNCS 430
>gi|255566008|ref|XP_002523992.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
gi|223536719|gb|EEF38360.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
Length = 442
Score = 92.0 bits (227), Expect = 4e-16, Method: Compositional matrix adjust.
Identities = 91/360 (25%), Positives = 157/360 (43%), Gaps = 41/360 (11%)
Query: 1 MVRLFIGTPSKGVLLILDTGSALIY---------------AIFDPRKSSSFQKINCDHPD 45
++++ IG P +L+ + TGS L++ FDP +SS+++ + CD
Sbjct: 99 LMKISIGIPPTELLVNVATGSDLVWIPCLSFKPCTHNCDLRFFDPMESSTYKNVPCDSYR 158
Query: 46 CTYFK---CVNEQCVYTMKYADQ-SVTKGFAAHETISVIGKGEGKAIFHGALFGCSNDNH 101
C C C Y+ Q S G A +T+++ + F C N
Sbjct: 159 CQITNAATCQFSDCFYSCDPRHQDSCPDGDLAMDTLTLNSTTGKSFMLPNTGFICGNRIG 218
Query: 102 GFDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLKFGTDM 161
G D G+LGL ++S ++++ +I +FS+C+V P +S L FG
Sbjct: 219 G------DYPGVGILGLGHGSLSLLNRISHLIDGKFSHCIV---PYSSNQTSKLSFGDKA 269
Query: 162 GYRRPSTQATKFINHPNNF-YYLSLKDISIDNERMNFPPDTFDITVSGEGGCIIDSGSVL 220
+ +T+ + Y LS IS+ N+ ++ D ++G G +DSG++
Sbjct: 270 VVSGSAMFSTRLDMTGGPYSYTLSFYGISVGNKSISAGGIGSDYYMNGLG---MDSGTMF 326
Query: 221 TYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEP---IQLCYFLPETFNRFPSMAFYFEDA 277
TYF Y +L Y R+ + Q P+P ++LCY F+ P++ +FE
Sbjct: 327 TYFPEYFYSQLE-----YDVRYAIQQEPLYPDPTRRLRLCYRYSPDFSP-PTITMHFEGG 380
Query: 278 NLRIDGENVFIIDYENHFFLLAVAPHDDLVALIGSQQQRDTRFVYDLNIDLLSFVKENCS 337
++ + N FI E+ L + A+ G QQ + YDL+ LSF+K +C+
Sbjct: 381 SVELSSSNSFIRMTEDIVCLAFATSSSEQDAVFGYWQQTNLLIGYDLDAGFLSFLKTDCT 440
>gi|125543284|gb|EAY89423.1| hypothetical protein OsI_10930 [Oryza sativa Indica Group]
Length = 447
Score = 92.0 bits (227), Expect = 4e-16, Method: Compositional matrix adjust.
Identities = 98/395 (24%), Positives = 157/395 (39%), Gaps = 75/395 (18%)
Query: 2 VRLFIGTPSKGVLLILDTGSALIYAI------------FDPRKSSSFQKINCDHPDCTYF 49
V + +GTP + V ++LDTGS L + + F+ SSS+ + C C +
Sbjct: 57 VPVAVGTPPQNVTMVLDTGSELSWLLCNGSYAPPLTPAFNASGSSSYGAVPCPSTACEWR 116
Query: 50 K--------C---VNEQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGC-- 96
C + C ++ YAD S G A +T + G A+ GA FGC
Sbjct: 117 GRDLPVPPFCDTPPSNACRVSLSYADASSADGVLATDTFLLTGGAPPVAV--GAYFGCIT 174
Query: 97 ------SNDNHGFDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEY 150
+ +++G D + A G+LG++R T+SF++Q G+ +RF+YC+ GE
Sbjct: 175 SYSSTTATNSNGTGTDVSEAA-TGLLGMNRGTLSFVTQTGT---RRFAYCIA----PGE- 225
Query: 151 TSSYLKFGTDMGYRRPSTQATKF-INHPNNF-----YYLSLKDISIDNERMNFPPDTFDI 204
L G D G P I+ P + Y + L+ I + + P
Sbjct: 226 GPGVLLLGDDGGVAPPLNYTPLIEISQPLPYFDRVAYSVQLEGIRVGCALLPIPKSVLTP 285
Query: 205 TVSGEGGCIIDSGSVLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEP-------IQLC 257
+G G ++DSG+ T+ +D Y L +F S R LA L EP C
Sbjct: 286 DHTGAGQTMVDSGTQFTFLLADAYAALKAEFTSQ-ARLLLAPLG---EPGFVFQGAFDAC 341
Query: 258 YFLPE-----TFNRFPSMAFYFEDANLRIDGEN-VFIIDYENHFFLLAVA------PHDD 305
+ PE P + A + + GE ++++ E A A + D
Sbjct: 342 FRGPEARVAAASGLLPVVGLVLRGAEVAVSGEKLLYMVPGERRGEGGAEAVWCLTFGNSD 401
Query: 306 LVAL----IGSQQQRDTRFVYDLNIDLLSFVKENC 336
+ + IG Q++ YDL + F C
Sbjct: 402 MAGMSAYVIGHHHQQNVWVEYDLQNGRVGFAPARC 436
>gi|242057809|ref|XP_002458050.1| hypothetical protein SORBIDRAFT_03g026140 [Sorghum bicolor]
gi|241930025|gb|EES03170.1| hypothetical protein SORBIDRAFT_03g026140 [Sorghum bicolor]
Length = 489
Score = 91.7 bits (226), Expect = 4e-16, Method: Compositional matrix adjust.
Identities = 94/379 (24%), Positives = 161/379 (42%), Gaps = 73/379 (19%)
Query: 6 IGTPSKGVLLILDTGSALIY-------------------AIFDPRKSSSFQKINCDH--- 43
+GTP K + +DTGS +++ +DP+ SSS ++CD
Sbjct: 90 LGTPPKRYYVQVDTGSDILWVNCISCEKCPRKSGLGLDLTFYDPKASSSGSTVSCDQGFC 149
Query: 44 --------PDCTYFKCVNEQCVYTMKYADQSVTKGFAAHETI---SVIGKGEGKAIFHGA 92
P CT N C Y++ Y D S T GF + + V G G+ +
Sbjct: 150 AATYGGKLPGCT----ANVPCEYSVMYGDGSSTTGFFVTDALQFDQVTGDGQTQPGNATV 205
Query: 93 LFGCSNDNHGFDEDARDGALAGVLGLSRVTISFISQLGSI--IKKRFSYCLVIPLPNGEY 150
FGC G D + + AL G+LG + S +SQL + +KK F++CL G +
Sbjct: 206 TFGCGA-QQGGDLGSSNQALDGILGFGQANTSMLSQLAAAGKVKKIFAHCLDTIKGGGIF 264
Query: 151 TSSYLKFGTDMGYRRPSTQATKFI-NHPNNFYYLSLKDISIDNERMNFPPDTFDITVSGE 209
+ +P + T + + P+ Y ++LK I + + P F+ +GE
Sbjct: 265 AIGNV--------VQPKVKTTPLVADMPH--YNVNLKSIDVGGTTLQLPAHVFE---TGE 311
Query: 210 -GGCIIDSGSVLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCYFLPETF-NRF 267
G IIDSG+ LTY V+ ++ + + + D +C+ P + + F
Sbjct: 312 RKGTIIDSGTTLTYLPELVFKEVMAAIFNKHQDIVFHNVQDF-----MCFQYPGSVDDGF 366
Query: 268 PSMAFYFE-DANLRI--------DGENVFIIDYENHFFLLAVAPHDDLVALIGSQQQRDT 318
P++ F+FE D L + +G +++ + ++N L D+V L+G +
Sbjct: 367 PTITFHFEDDLALHVYPHEYFFPNGNDMYCVGFQNG--ALQSKDGKDIV-LMGDLVLSNK 423
Query: 319 RFVYDLNIDLLSFVKENCS 337
+YDL ++ + NCS
Sbjct: 424 LVIYDLENQVIGWTDYNCS 442
>gi|225216879|gb|ACN85177.1| aspartic proteinase nepenthesin-1 precursor [Oryza nivara]
gi|225216896|gb|ACN85193.1| aspartic proteinase nepenthesin-1 precursor [Oryza rufipogon]
Length = 515
Score = 91.7 bits (226), Expect = 4e-16, Method: Compositional matrix adjust.
Identities = 91/365 (24%), Positives = 146/365 (40%), Gaps = 58/365 (15%)
Query: 1 MVRLFIGTPSKGVLLILDTGSALIYA---------------IFDPRKSSSFQKINCDHPD 45
+V + +GTP+ ++ DTGS + +FDP SS++ ++C P
Sbjct: 180 VVTVGLGTPASRYTVVFDTGSDTTWVQCQPCVVACYEQREKLFDPASSSTYANVSCAAPA 239
Query: 46 CTYF---KCVNEQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSNDNHG 102
C+ C C+Y ++Y D S + GF A +T+++ G FGC N G
Sbjct: 240 CSDLDVSGCSGGHCLYGVQYGDGSYSIGFFAMDTLTL----SSYDAVKGFRFGCGERNDG 295
Query: 103 FDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLKFGTDMG 162
+A AG+LGL R S Q F++C LP + YL FG
Sbjct: 296 LFGEA-----AGLLGLGRGKTSLPVQTYGKYGGVFAHC----LPARSTGTGYLDFGAG-- 344
Query: 163 YRRPSTQATKFI--NHPNNFYYLSLKDISIDNERMNFPPDTFDITVSGEGGCIIDSGSVL 220
P+T T + N P FYY+ + I + + P F G I+DSG+V+
Sbjct: 345 -SPPATTTTPMLTGNGP-TFYYVGMTGIRVGGRLLPIAPSVF-----AAAGTIVDSGTVI 397
Query: 221 TYFHSDVYWKLHEKFVSY-----FERFQLAQLSDCPEPIQLCY-FLPETFNRFPSMAFYF 274
T Y L F + + + L D CY F + P+++ F
Sbjct: 398 TRLPPAAYSSLRSAFAAAMAARGYRKAAAVSLLD------TCYDFTGMSQVAIPTVSLLF 451
Query: 275 E-DANLRIDGENVFIIDYENHFFLLAVAPHDDL--VALIGSQQQRDTRFVYDLNIDLLSF 331
+ A L +D + + LA A ++D V ++G+ Q + YD+ ++ F
Sbjct: 452 QGGAALDVDASGIMYTVSASQ-VCLAFAGNEDGGDVGIVGNTQLKTFGVAYDIGKKVVGF 510
Query: 332 VKENC 336
C
Sbjct: 511 SPGAC 515
>gi|225216914|gb|ACN85210.1| aspartic proteinase nepenthesin-1 precursor [Oryza glaberrima]
Length = 516
Score = 91.7 bits (226), Expect = 4e-16, Method: Compositional matrix adjust.
Identities = 91/365 (24%), Positives = 146/365 (40%), Gaps = 58/365 (15%)
Query: 1 MVRLFIGTPSKGVLLILDTGSALIYA---------------IFDPRKSSSFQKINCDHPD 45
+V + +GTP+ ++ DTGS + +FDP SS++ ++C P
Sbjct: 181 VVTVGLGTPASRYTVVFDTGSDTTWVQCQPCVVACYEQREKLFDPASSSTYANVSCAAPA 240
Query: 46 CTYF---KCVNEQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSNDNHG 102
C+ C C+Y ++Y D S + GF A +T+++ G FGC N G
Sbjct: 241 CSDLDVSGCSGGHCLYGVQYGDGSYSIGFFAMDTLTL----SSYDAVKGFRFGCGERNDG 296
Query: 103 FDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLKFGTDMG 162
+A AG+LGL R S Q F++C LP + YL FG
Sbjct: 297 LFGEA-----AGLLGLGRGKTSLPVQTYGKYGGVFAHC----LPPRSTGTGYLDFGAG-- 345
Query: 163 YRRPSTQATKFI--NHPNNFYYLSLKDISIDNERMNFPPDTFDITVSGEGGCIIDSGSVL 220
P+T T + N P FYY+ + I + + P F G I+DSG+V+
Sbjct: 346 -SPPATTTTPMLTGNGP-TFYYVGMTGIRVGGRLLPIAPSVF-----AAAGTIVDSGTVI 398
Query: 221 TYFHSDVYWKLHEKFVSY-----FERFQLAQLSDCPEPIQLCY-FLPETFNRFPSMAFYF 274
T Y L F + + + L D CY F + P+++ F
Sbjct: 399 TRLPPAAYSSLRSAFAAAMAARGYRKAAAVSLLD------TCYDFTGMSQVAIPTVSLLF 452
Query: 275 E-DANLRIDGENVFIIDYENHFFLLAVAPHDDL--VALIGSQQQRDTRFVYDLNIDLLSF 331
+ A L +D + + LA A ++D V ++G+ Q + YD+ ++ F
Sbjct: 453 QGGAALDVDASGIMYTVSASQ-VCLAFAGNEDGGDVGIVGNTQLKTFGVAYDIGKKVVGF 511
Query: 332 VKENC 336
C
Sbjct: 512 SPGAC 516
>gi|359478045|ref|XP_002267046.2| PREDICTED: aspartic proteinase-like protein 2-like [Vitis vinifera]
Length = 502
Score = 91.7 bits (226), Expect = 4e-16, Method: Compositional matrix adjust.
Identities = 99/393 (25%), Positives = 166/393 (42%), Gaps = 94/393 (23%)
Query: 2 VRLFIGTPSKGVLLILDTGSALIY-------------------AIFDPRKSSSFQKINCD 42
++ IGTP++ + +DTGS +++ ++D ++S + + ++CD
Sbjct: 100 AKIGIGTPARDYYVQVDTGSDIMWVNCIQCNECPKKSSLGMELTLYDIKESLTGKLVSCD 159
Query: 43 HPDCTYFK------CV-NEQCVYTMKYADQSVTKGFAAHETIS---VIGKGEGKAIFHGA 92
C C+ N C YT YAD S + G+ + + V G E +
Sbjct: 160 QDFCYAINGGPPSYCIANMSCSYTEIYADGSSSFGYFVRDIVQYDQVSGDLETTSANGSV 219
Query: 93 LFGCSNDNHGFDEDARDGALAGVLGLSRVTISFISQLGSI--IKKRFSYCLVIPLPNGEY 150
+FGCS G + + + AL G+LG + S ISQL S ++K F++CL G +
Sbjct: 220 IFGCSATQSG--DLSSEEALDGILGFGKSNTSMISQLASSGKVRKMFAHCLDGLNGGGIF 277
Query: 151 TSSYLKFGTDMGYRRPSTQATKFINHPNNFYY-LSLKDISIDNERMNFPPDTFDITVSGE 209
++ +P T + PN +Y +++K + + +N P D FD V +
Sbjct: 278 AIGHI--------VQPKVNTTPLV--PNQTHYNVNMKAVEVGGYFLNLPTDVFD--VGDK 325
Query: 210 GGCIIDSGSVLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQL-CYFLPETF-NRF 267
G IIDSG+ L Y VY +L K S+ ++ + D Q C+ E+ + F
Sbjct: 326 KGTIIDSGTTLAYLPEVVYDQLLSKIFSWQSDLKVHTIHD-----QFTCFQYSESLDDGF 380
Query: 268 PSMAFYFEDANLRIDGENVFIIDYENHFFLLAVAPHD-----DLVALIGSQ----QQRDT 318
P++ F+FE++ L V PH+ D + IG Q Q RD
Sbjct: 381 PAVTFHFENS------------------LYLKVHPHEYLFSYDGLWCIGWQNSGMQSRDR 422
Query: 319 R--------------FVYDLNIDLLSFVKENCS 337
R +YDL ++ + + NCS
Sbjct: 423 RNITLLGDLALSNKLVLYDLENQVIGWTEYNCS 455
>gi|357482719|ref|XP_003611646.1| Aspartic proteinase-like protein [Medicago truncatula]
gi|355512981|gb|AES94604.1| Aspartic proteinase-like protein [Medicago truncatula]
Length = 640
Score = 91.7 bits (226), Expect = 5e-16, Method: Compositional matrix adjust.
Identities = 92/364 (25%), Positives = 149/364 (40%), Gaps = 48/364 (13%)
Query: 2 VRLFIGTPSKGVLLILDTGSALIYAI--------------FDPRKSSSFQKINCDHPDCT 47
RL+IGTP + LI+DTGS + Y F P S ++Q + C PDC
Sbjct: 91 TRLWIGTPPQRFALIVDTGSTVTYVPCSTCEHCGRHQDPKFQPDLSETYQPVKCT-PDCN 149
Query: 48 YFKCVNEQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSNDNHGFDEDA 107
N QC+Y +YA+ S + G + +S E A+FGC ND G D
Sbjct: 150 CDGDTN-QCMYDRQYAEMSSSSGVLGEDVVSFGNLSELAP--QRAVFGCENDETG---DL 203
Query: 108 RDGALAGVLGLSRVTISFISQL--GSIIKKRFSYCL-VIPLPNGEYTSSYLKFGTDMGYR 164
G++GL R +S + QL +I FS C + + G + DM +
Sbjct: 204 YSQRADGIMGLGRGDLSIMDQLVDKKVISDSFSLCYGGMDVGGGAMILGGISPPEDMVFT 263
Query: 165 RPSTQATKFINHPNNFYYLSLKDISIDNERMNFPPDTFDITVSGEGGCIIDSGSVLTYFH 224
+ +Y ++LK++ + +++ P FD G+ G ++DSG+ Y
Sbjct: 264 HSDPDRSP-------YYNINLKEMHVAGKKLQLNPKVFD----GKHGTVLDSGTTYAYLP 312
Query: 225 SDVYWKLHEKFVSYFERFQLAQLSDCPEP--IQLCYF-----LPETFNRFPSMAFYFEDA 277
+ + ER L Q+ + P+P +C+ + + FP + FE+
Sbjct: 313 ETAFLAFKRAIMK--ERNSLKQI-NGPDPNYKDICFTGAGIDVSQLAKSFPVVDMVFENG 369
Query: 278 N-LRIDGENVFI--IDYENHFFLLAVAPHDDLVALIGSQQQRDTRFVYDLNIDLLSFVKE 334
+ L + EN + L + D L+G R+T +YD + F K
Sbjct: 370 HKLSLSPENYLFRHSKVRGAYCLGVFSNGRDPTTLLGGIFVRNTLVMYDRENSKIGFWKT 429
Query: 335 NCSD 338
NCS+
Sbjct: 430 NCSE 433
>gi|326531368|dbj|BAK05035.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 412
Score = 91.7 bits (226), Expect = 5e-16, Method: Compositional matrix adjust.
Identities = 88/318 (27%), Positives = 141/318 (44%), Gaps = 27/318 (8%)
Query: 27 IFDPRKSSSFQKINCDHPDCT--YFKCVNEQCVYTMKYADQSVTKGFAAHETISVIGKGE 84
+F P S +F ++ + P CT Y K N C + S G+ + +T + G
Sbjct: 113 LFSPGASPTFHGVHSNDPVCTVPYRKTAN-GCSFHF-----SSITGYLSRDTFH-LRTGR 165
Query: 85 GKAIFHG---ALFGCSNDNHGFDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCL 141
A+ +FGC++ + GF D L GVL LS + +S ++QLG+ RFSYCL
Sbjct: 166 AGAVRESIPRVVFGCAHSSTGFHND---NTLGGVLSLSHLPLSLLTQLGAHASGRFSYCL 222
Query: 142 VIPLPNGEYTSSYLKFGTDMGYRRPSTQATKFINHPN-NFYYLSLKDISIDNERMNFPPD 200
P G L G D+ P + T + HP + Y+L+L I+ +R+
Sbjct: 223 --PKSTGHNPHGSLFLGADVPSPPPHSHTTNLVIHPGVSGYHLNLIGITRGYKRLK---- 276
Query: 201 TFDITVSGEGGCIIDSGSVLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCP-EPIQLCYF 259
D V C I+ +T+ +Y + + V+ + ++ P P+
Sbjct: 277 -IDKRVLVSHSCSINPAETITHIAEPIYLVVEKALVARMKELGSDRVKGPPGGPLWFDRM 335
Query: 260 LPETFNRFPSMAFYFE-DANLRIDGENVFIIDYENHFFLLAVAPHDDLVALIGSQQQRDT 318
+ P+MAF+FE A L + +F + N F++A + V IG+ QQ +T
Sbjct: 336 YQSVKEQLPNMAFHFEGGAELWFTSDRLFEVHGMNARFMVAGRGYRRTV--IGAAQQVNT 393
Query: 319 RFVYDLNIDLLSFVKENC 336
RF +D+ LSFV E C
Sbjct: 394 RFTFDVARGKLSFVSEVC 411
>gi|359497446|ref|XP_003635520.1| PREDICTED: aspartic proteinase nepenthesin-2-like, partial [Vitis
vinifera]
Length = 354
Score = 91.3 bits (225), Expect = 5e-16, Method: Compositional matrix adjust.
Identities = 85/366 (23%), Positives = 154/366 (42%), Gaps = 58/366 (15%)
Query: 2 VRLFIGTPSKGVLLILDTGSALIY---------------AIFDPRKSSSFQKINCDHPDC 46
V++ +G+P++ +I+DTGS+L + +FDP S +++ ++C C
Sbjct: 15 VKVGLGSPARYYSMIVDTGSSLSWLQCKPCVVYCHVQADPLFDPSASKTYKSLSCTSSQC 74
Query: 47 TYF----------KCVNEQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGC 96
+ + + CVYT Y D S + G+ + + +++ G ++GC
Sbjct: 75 SSLVDATLNNPLCETSSNVCVYTASYGDSSYSMGYLSQDLLTL----APSQTLPGFVYGC 130
Query: 97 SNDNHGFDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLK 156
D+ G G AG+LGL R +S + Q+ S FSYCL P G +L
Sbjct: 131 GQDSEGLF-----GRAAGILGLGRNKLSMLGQVSSKFGYAFSYCL--PTRGG---GGFLS 180
Query: 157 FGTDMGYRRPSTQATKFINHPNN--FYYLSLKDISIDNERMNFPPDTFDITVSGEGGCII 214
G + + T P N Y+L L I++ + + + II
Sbjct: 181 IG-KASLAGSAYKFTPMTTDPGNPSLYFLRLTAITVGGRALGVAAAQYRVPT------II 233
Query: 215 DSGSVLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCP--EPIQLCYFLP-ETFNRFPSMA 271
DSG+V+T VY + FV + ++ + P + C+ + P +
Sbjct: 234 DSGTVITRLPMSVYTPFQQAFV----KIMSSKYARAPGFSILDTCFKGNLKDMQSVPEVR 289
Query: 272 FYFE-DANLRIDGENVFIIDYENHFFLLAVAPHDDLVALIGSQQQRDTRFVYDLNIDLLS 330
F+ A+L + NV ++ + LA A ++ VA+IG+ QQ+ + +D++ +
Sbjct: 290 LIFQGGADLNLRPVNV-LLQVDEGLTCLAFAGNNG-VAIIGNHQQQTFKVAHDISTARIG 347
Query: 331 FVKENC 336
F C
Sbjct: 348 FATGGC 353
>gi|356523171|ref|XP_003530215.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
Length = 442
Score = 91.3 bits (225), Expect = 5e-16, Method: Compositional matrix adjust.
Identities = 85/323 (26%), Positives = 144/323 (44%), Gaps = 53/323 (16%)
Query: 2 VRLFIGTPSKGVLLILDTGSAL-------------IYAIFDPRKSSSFQKINCDHPDCTY 48
+ + +GTP + + +++DTGS L Y F+P SSS+ I+C P CT
Sbjct: 68 ISITVGTPPQNMSMVIDTGSELSWLHCNTNTTATIPYPFFNPNISSSYTPISCSSPTCTT 127
Query: 49 --------FKC-VNEQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSND 99
C N C T+ YAD S ++G A +T G G + G +FGC N
Sbjct: 128 RTRDFPIPASCDSNNLCHATLSYADASSSEGNLASDTF-----GFGSSFNPGIVFGCMNS 182
Query: 100 NHGFDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCL-------VIPLPNGEYTS 152
++ + ++ D G++G++ ++S +SQL +FSYC+ ++ L GE
Sbjct: 183 SYSTNSES-DSNTTGLMGMNLGSLSLVSQLK---IPKFSYCISGSDFSGILLL--GE--- 233
Query: 153 SYLKFGTDMGYRRPSTQATKFINHPNNFYYLSLKDISIDNERMNFPPDTFDITVSGEGGC 212
S +G + Y +T + Y + L+ I I ++ +N + F +G G
Sbjct: 234 SNFSWGGSLNYTPLVQISTPLPYFDRSAYTVRLEGIKISDKLLNISGNLFVPDHTGAGQT 293
Query: 213 IIDSGSVLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPE-----PIQLCYFLP---ETF 264
+ D G+ +Y VY L ++F++ L L D P + LCY +P
Sbjct: 294 MFDLGTQFSYLLGPVYNALRDEFLNQ-TNGTLRALDD-PNFVFQIAMDLCYRVPVNQSEL 351
Query: 265 NRFPSMAFYFEDANLRIDGENVF 287
PS++ FE A +R+ G+ +
Sbjct: 352 PELPSVSLVFEGAEMRVFGDQLL 374
>gi|357131735|ref|XP_003567490.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Brachypodium
distachyon]
Length = 458
Score = 91.3 bits (225), Expect = 6e-16, Method: Compositional matrix adjust.
Identities = 86/370 (23%), Positives = 151/370 (40%), Gaps = 47/370 (12%)
Query: 1 MVRLFIGTPSKGVLLILDTGSALIYAI---------------FDPRKSSSFQKINCDHPD 45
+ R +GTP + +L+ +D + + FDP +SS+++ + C P
Sbjct: 101 VARARLGTPPQTLLVAIDPSNDAAWVPCSACLGCAPGASSPSFDPTQSSTYRPVRCGAPQ 160
Query: 46 CTYFKCVNEQC--------VYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGA-LFGC 96
C C + + YA ++ + +S + G A+ FGC
Sbjct: 161 CAQVPPATPSCPAGPGASCAFNLSYASSTL-HAVLGQDALS-LSDSNGAAVPDDHYTFGC 218
Query: 97 SNDNHGFDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLK 156
G G++G R +SF+SQ + FSYCL P S L+
Sbjct: 219 LRVVTGSGGSVPP---QGLVGFGRGPLSFLSQTKATYGSIFSYCL--PSYKSSNFSGTLR 273
Query: 157 FGTDMGYRRPSTQATKFINHPN--NFYYLSLKDISIDNERMNFPPDTFDI-TVSGEGGCI 213
G RR + T +++P+ + YY+++ + ++ + + P + +G GG I
Sbjct: 274 LGPAGQPRR--IKTTPLLSNPHRPSLYYVAMVGVRVNGKAVPIPASALALDAATGRGGTI 331
Query: 214 IDSGSVLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCYFLPETFNRFPSMAFY 273
+D+G++ T Y L F R A + CY++ T P++AF
Sbjct: 332 VDAGTMFTRLSPPAYAALRNA----FRRGVSAPAAPALGGFDTCYYVNGT-KSVPAVAFV 386
Query: 274 FE-DANLRIDGENVFIIDYENHFFLLAVA--PHDDLVA---LIGSQQQRDTRFVYDLNID 327
F A + + ENV I LA+A P D + A ++ S QQ++ R V+D+
Sbjct: 387 FAGGARVTLPEENVVISSTSGGVACLAMAAGPSDGVNAGLNVLASMQQQNHRVVFDVGNG 446
Query: 328 LLSFVKENCS 337
+ F +E C+
Sbjct: 447 RVGFSRELCT 456
>gi|255581508|ref|XP_002531560.1| Aspartic proteinase nepenthesin-2 precursor, putative [Ricinus
communis]
gi|223528821|gb|EEF30826.1| Aspartic proteinase nepenthesin-2 precursor, putative [Ricinus
communis]
Length = 407
Score = 91.3 bits (225), Expect = 6e-16, Method: Compositional matrix adjust.
Identities = 94/382 (24%), Positives = 159/382 (41%), Gaps = 65/382 (17%)
Query: 2 VRLFIGTPSKGVLLILDTGSALIY-------------AIFDPRKSSSFQKINCDHPDCTY 48
V L +GTP + V +++DTGS L + F+ +S S++ I C CT
Sbjct: 33 VSLTVGTPPQNVSMVIDTGSELSWLYCNKTTTTTSYPTTFNQTRSISYRPIPCSSSTCTN 92
Query: 49 --------FKC-VNEQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSND 99
C N C T+ YAD S ++G A +T + G + G +FGC +
Sbjct: 93 QTRDFSIPASCDSNSLCHATLSYADASSSEGNLASDTFHM-----GASDIPGMVFGCMDS 147
Query: 100 --NHGFDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLKF 157
+ DED+++ G++G++R ++SF+SQ+G +FSYC+ +G S L
Sbjct: 148 VFSSNSDEDSKN---TGLMGMNRGSLSFVSQMG---FPKFSYCI-----SGTDFSGMLLL 196
Query: 158 G-------TDMGYRRPSTQATKFINHPNNFYYLSLKDISIDNERMNFPPDTFDITVSGEG 210
G + Y +T Y + L+ I + + + P F+ +G G
Sbjct: 197 GESNFTWAVPLNYTPLVQISTPLPYFDRIAYTVQLEGIKVSDRLLPIPKSVFEPDHTGAG 256
Query: 211 GCIIDSGSVLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPE-----PIQLCYFLP---E 262
++DSG+ T+ Y L +F++ F L L D P+ + LCY +P
Sbjct: 257 QTMVDSGTQFTFLLGPAYTALRSEFLNQTTGF-LRVLED-PDFVFQGAMDLCYRVPISQR 314
Query: 263 TFNRFPSMAFYFEDANLRIDGENVFI-----IDYENHFFLLAVAPHDDL---VALIGSQQ 314
R P+++ F A + + E V I + L+ D L +IG
Sbjct: 315 VLPRLPTVSLVFNGAEMTVADERVLYRVPGEIRGNDSVHCLSFGNSDLLGVEAYVIGHHH 374
Query: 315 QRDTRFVYDLNIDLLSFVKENC 336
Q++ +DL + + C
Sbjct: 375 QQNVWMEFDLERSRIGLAQVRC 396
>gi|357476337|ref|XP_003608454.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
gi|355509509|gb|AES90651.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
Length = 683
Score = 91.3 bits (225), Expect = 7e-16, Method: Compositional matrix adjust.
Identities = 95/366 (25%), Positives = 158/366 (43%), Gaps = 52/366 (14%)
Query: 2 VRLFIGTPSKGVLLILDTGSALIYAI--------------FDPRKSSSFQKINCDHPDCT 47
RL+IGTP + LI+DTGS + Y F P SS++Q + C DC
Sbjct: 83 TRLWIGTPPQMFALIVDTGSTVTYVPCSTCEQCGRHQDPKFQPDLSSTYQPVKCTL-DC- 140
Query: 48 YFKCVNE--QCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSNDNHGFDE 105
C N+ QCVY +YA+ S + G + +S + E A+FGC N G
Sbjct: 141 --NCDNDRMQCVYERQYAEMSTSSGVLGEDVVSFGNQSELAP--QRAVFGCENVETG--- 193
Query: 106 DARDGALAGVLGLSRVTISFISQL--GSIIKKRFSYCL-VIPLPNGEYTSSYLKFGTDMG 162
D G++GL R +S + QL +++ FS C + + G + +DM
Sbjct: 194 DLYSQHADGIMGLGRGDLSIMDQLVDKNVVSDSFSLCYGGMDVGGGAMVLGGISPPSDMV 253
Query: 163 YRRPSTQATKFINHPNNFYYLSLKDISIDNERMNFPPDTFDITVSGEGGCIIDSGSVLTY 222
+ + + + P +Y + LK+I + +R+ P FD G+ G ++DSG+ Y
Sbjct: 254 FAQ-----SDPVRSP--YYNIDLKEIHVAGKRLPLNPSVFD----GKHGSVLDSGTTYAY 302
Query: 223 FHSDVYWKLHEKFVSYFERFQLAQLSDCPEP--IQLCYF-----LPETFNRFPSMAFYFE 275
+ + E V + F +Q+S P+P LC+ + + FP + F
Sbjct: 303 LPEEAFLAFKEAIVKELQSF--SQISG-PDPNYNDLCFSGAGIDVSQLSKTFPVVDMIFG 359
Query: 276 DAN-LRIDGEN-VFIIDYENHFFLLAVAPH-DDLVALIGSQQQRDTRFVYDLNIDLLSFV 332
+ + + EN +F + L + + D L+G R+T +YD + F
Sbjct: 360 NGHKYSLSPENYMFRHSKVRGAYCLGIFQNGKDPTTLLGGIVVRNTLVLYDREQTKIGFW 419
Query: 333 KENCSD 338
K NC++
Sbjct: 420 KTNCAE 425
>gi|242066168|ref|XP_002454373.1| hypothetical protein SORBIDRAFT_04g029630 [Sorghum bicolor]
gi|241934204|gb|EES07349.1| hypothetical protein SORBIDRAFT_04g029630 [Sorghum bicolor]
Length = 458
Score = 91.3 bits (225), Expect = 7e-16, Method: Compositional matrix adjust.
Identities = 86/362 (23%), Positives = 154/362 (42%), Gaps = 50/362 (13%)
Query: 1 MVRLFIGTPSKGVLLILDTGSALIY---------------AIFDPRKSSSFQKINCDHPD 45
+ R+ +GTP+K ++++DTGS+L + +F+PR SSS+ ++C P
Sbjct: 122 VTRMGLGTPAKSYVMVVDTGSSLTWLQCSPCLVSCHRQSGPVFNPRSSSSYASVSCSAPQ 181
Query: 46 CTYFKCV---------NEQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGC 96
C + C+Y Y D S + G+ + +T+S G +GC
Sbjct: 182 CDALTTATLNPSTCSTSNVCIYQASYGDSSFSVGYLSKDTVSF-----GSTSVPNFYYGC 236
Query: 97 SNDNHGFDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLK 156
DN G G AG++GL+R +S + QL + FSYCL + +
Sbjct: 237 GQDNEGLF-----GQSAGLIGLARNKLSLLYQLAPSMGYSFSYCLPT---SSSSSGYLSI 288
Query: 157 FGTDMGYRRPSTQATKFINHPNNFYYLSLKDISIDNERMNFPPDTFDITVSGEGGCIIDS 216
+ G + A ++ ++ Y++ + I++ + ++ + + IIDS
Sbjct: 289 GSYNPGQYSYTPMAKSSLD--DSLYFIKMTGITVAGKPLSVSASAYSSLPT-----IIDS 341
Query: 217 GSVLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCYFLPETFNRFPSMAFYFE- 275
G+V+T +DVY L + + A + C+ + R P ++ F
Sbjct: 342 GTVITRLPTDVYSALSKAVAGAMKGTPRASAFSI---LDTCFQGQASRLRVPQVSMAFAG 398
Query: 276 DANLRIDGENVFIIDYENHFFLLAVAPHDDLVALIGSQQQRDTRFVYDLNIDLLSFVKEN 335
A L++ N ++D ++ LA AP A+IG+ QQ+ VYD+ + F
Sbjct: 399 GAALKLKATN-LLVDVDSATTCLAFAPARS-AAIIGNTQQQTFSVVYDVKNSKIGFAAGG 456
Query: 336 CS 337
CS
Sbjct: 457 CS 458
>gi|125536523|gb|EAY83011.1| hypothetical protein OsI_38231 [Oryza sativa Indica Group]
Length = 469
Score = 90.9 bits (224), Expect = 7e-16, Method: Compositional matrix adjust.
Identities = 82/328 (25%), Positives = 143/328 (43%), Gaps = 43/328 (13%)
Query: 28 FDPRKSSSFQKINCDHPDCTYFKCVNEQCV----------------YTMKYADQSV-TKG 70
F P S++F + C C + E C Y++ Y + T G
Sbjct: 134 FRPNGSATFSPLPCSSDMC--LPVLRETCGRAGAAANATAGARCDSYSLTYGGSAANTSG 191
Query: 71 FAAHETISVIGKGEGKAIFHGALFGCSNDNHGFDEDARDGALAGVLGLSRVTISFISQLG 130
+ A +T + G G +FGCS+ ++G A +GV+G+ R +S ISQL
Sbjct: 192 YLATDTFTF-----GATAVPGVVFGCSDASYGDFAGA-----SGVIGIGRGNLSLISQL- 240
Query: 131 SIIKKRFSYCLVIPLPNGEYTS-SYLKFGTDMGYRRPSTQATKFINHP--NNFYYLSLKD 187
+FSY L+ P + ++ S ++FG D + Q+T ++ +FYY++L
Sbjct: 241 --QFGKFSYQLLAPEATDDGSADSVIRFGDDAVPKTKRGQSTPLLSSTLYPDFYYVNLTG 298
Query: 188 ISIDNERMN-FPPDTFDITVSGEGGCIIDSGSVLTYFHSDVYWKLHEKFVSYFERFQLAQ 246
+ +D R++ P TFD+ +G GG I+ S + +TY Y + S R L
Sbjct: 299 VRVDGNRLDAIPAGTFDLRANGTGGVILSSTTPVTYLEQAAYDVVRAAVAS---RIGLPA 355
Query: 247 LSDCPE-PIQLCYFLPETFN-RFPSMAFYFE-DANLRIDGENVFIIDYENHFFLLAVAPH 303
++ + LCY + P + F+ A++ + N F ID + L + P
Sbjct: 356 VNGSAALELDLCYNASSMAKVKVPKLTLVFDGGADMDLSAANYFYIDNDTGLECLTMLPS 415
Query: 304 DDLVALIGSQQQRDTRFVYDLNIDLLSF 331
+++G+ Q T +YD++ L+F
Sbjct: 416 QG-GSVLGTLLQTGTNMIYDVDAGRLTF 442
>gi|359476191|ref|XP_003631801.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Vitis vinifera]
Length = 439
Score = 90.9 bits (224), Expect = 8e-16, Method: Compositional matrix adjust.
Identities = 93/349 (26%), Positives = 149/349 (42%), Gaps = 51/349 (14%)
Query: 1 MVRLFIGTPSKGVLLILDTGSALIYAIFDPRKSSSFQKINCDHPDCTYFKCVNEQCVYTM 60
+V + GTP + LILDTGS++ + T K + Y M
Sbjct: 129 LVDVAFGTPPQNFTLILDTGSSITW---------------------TQCKACTVENNYNM 167
Query: 61 KYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSNDNHGFDEDARDGALAGVLGLSR 120
Y D S + G +T+++ E +F FG +N G D G + G+LGL +
Sbjct: 168 TYGDDSTSVGNYGCDTMTL----EPSDVFQKFQFGRGRNNKG---DFGSG-VDGMLGLGQ 219
Query: 121 VTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLKFGTDMGYRRPSTQATKFINHP--- 177
+S +SQ S K FSYCL P + S L FG + S + T +N P
Sbjct: 220 GQLSTVSQTASKFNKVFSYCL----PEEDSIGSLL-FGEKATSQSSSLKFTSLVNGPGTL 274
Query: 178 --NNFYYLSLKDISIDNERMNFPPDTFDITVSGEGGCIIDSGSVLTYFHSDVYWKLHEKF 235
+ +Y+++L DIS+ NER+N P F G IIDS +V+T Y L F
Sbjct: 275 QESGYYFVNLSDISVGNERLNIPSSVF-----ASPGTIIDSRTVITRLPQRAYSALKAAF 329
Query: 236 VSYFERFQLAQ-LSDCPEPIQLCYFLPETFN-RFPSMAFYF-EDANLRIDGENVFIIDYE 292
++ L+ + + CY L + P + +F A++R++G N+ E
Sbjct: 330 KKAMAKYPLSNGRRKKGDILDTCYNLSGRKDVLLPEIVLHFGGGADVRLNGTNIVWGSDE 389
Query: 293 NHFFLL----AVAPHDDLVALIGSQQQRDTRFVYDLNIDLLSFVKENCS 337
+ L + + + + +IG++QQ +YD+ + F CS
Sbjct: 390 SRLCLAFAGNSKSTMNPELTIIGNRQQLSLTVLYDIQGGRIGFRSNGCS 438
>gi|115441003|ref|NP_001044781.1| Os01g0844500 [Oryza sativa Japonica Group]
gi|19571042|dbj|BAB86469.1| putative aspartic proteinase nepenthesin II [Oryza sativa Japonica
Group]
gi|20160609|dbj|BAB89555.1| putative aspartic proteinase nepenthesin II [Oryza sativa Japonica
Group]
gi|113534312|dbj|BAF06695.1| Os01g0844500 [Oryza sativa Japonica Group]
gi|125572614|gb|EAZ14129.1| hypothetical protein OsJ_04051 [Oryza sativa Japonica Group]
Length = 442
Score = 90.5 bits (223), Expect = 9e-16, Method: Compositional matrix adjust.
Identities = 85/323 (26%), Positives = 139/323 (43%), Gaps = 53/323 (16%)
Query: 2 VRLFIGTPSKGVLLILDTGSALIYAI----------------FDPRKSSSFQKINCDHPD 45
V L +GTP + V ++LDTGS L + + F PR S +F + CD
Sbjct: 68 VSLAVGTPPQNVTMVLDTGSELSWLLCAPGGGGGGGGRSALSFRPRASLTFASVPCDSAQ 127
Query: 46 C------TYFKC--VNEQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCS 97
C + C ++QC ++ YAD S + G A E +V G+G + A FGC
Sbjct: 128 CRSRDLPSPPACDGASKQCRVSLSYADGSSSDGALATEVFTV---GQGPPLR--AAFGCM 182
Query: 98 NDNHGFDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLKF 157
FD A AG+LG++R +SF+SQ + +RFSYC+ + + L
Sbjct: 183 AT--AFDTSPDGVATAGLLGMNRGALSFVSQAST---RRFSYCI-----SDRDDAGVLLL 232
Query: 158 G-TDMGYR----RPSTQATKFINHPNNFYY-LSLKDISIDNERMNFPPDTFDITVSGEGG 211
G +D+ + P Q + + + Y + L I + + + P +G G
Sbjct: 233 GHSDLPFLPLNYTPLYQPAMPLPYFDRVAYSVQLLGIRVGGKPLPIPASVLAPDHTGAGQ 292
Query: 212 CIIDSGSVLTYFHSDVYWKLHEKFVSYFERFQLAQLSDC----PEPIQLCYFLPE---TF 264
++DSG+ T+ D Y L +F S + L L+D E C+ +P+
Sbjct: 293 TMVDSGTQFTFLLGDAYSALKAEF-SRQTKPWLPALNDPNFAFQEAFDTCFRVPQGRAPP 351
Query: 265 NRFPSMAFYFEDANLRIDGENVF 287
R P++ F A + + G+ +
Sbjct: 352 ARLPAVTLLFNGAQMTVAGDRLL 374
>gi|326501496|dbj|BAK02537.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 476
Score = 90.5 bits (223), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 96/392 (24%), Positives = 151/392 (38%), Gaps = 68/392 (17%)
Query: 2 VRLFIGTPSKGVLLILDTGSALIYA------------------IFDPRKSSSFQKINCDH 43
VR +GTP++ LL+ DTGS L + F P S ++ I+C
Sbjct: 96 VRFRVGTPAQPFLLVADTGSDLTWVKCRRPAANSSESGSGSGRAFRPEDSRTWAPISCAS 155
Query: 44 PDCT------YFKCVN--EQCVYTMKYADQSVTKGFAAHE--TISVIGKG--EGKAIFHG 91
CT C C Y +Y D S +G E TI++ G+G E KA G
Sbjct: 156 DTCTKSLPFSLATCPTPGSPCAYDYRYKDGSAARGTVGTESATIALSGRGREERKAKLKG 215
Query: 92 ALFGCSNDNHGFDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYT 151
+ GC++ G + DG VL L +SF S S RFSYCLV L +
Sbjct: 216 LVLGCTSSYTGPSFEVSDG----VLSLGYSDVSFASHAASRFAGRFSYCLVDHL-SPRNA 270
Query: 152 SSYLKFGTD---------------------MGYRRPSTQATKFINHP-NNFYYLSLKDIS 189
+SYL FG + R + Q ++ FY +++K +S
Sbjct: 271 TSYLTFGPNPAVASSSSPSSPAPASCTAAAPRPRPRARQTPLLLDRRMRPFYDVAVKAVS 330
Query: 190 IDNERMNFPPDTFDITVSGEGGCIIDSGSVLTYFHSDVYWKLHEKFVSYFERFQLAQLSD 249
+ + + P +D+ GG I+DSG+ LT Y + +
Sbjct: 331 VAGQFLKIPRAVWDVDAG--GGVILDSGTSLTVLAKPAYRAVVAALSEGLAGLPRVTM-- 386
Query: 250 CPEPIQLCYFL--PETFNRFPSMAFYFEDANLRIDGENVFIIDYENHFFLLAV--APHDD 305
+P + CY P P MA +F A ++ID + + P
Sbjct: 387 --DPFEYCYNWTSPSGDVTLPKMAVHFAGAARLEPPGKSYVIDAAPGVKCIGLQEGPWPG 444
Query: 306 LVALIGSQQQRDTRFVYDLNIDLLSFVKENCS 337
+++IG+ Q++ + +D+ L F + C+
Sbjct: 445 -ISVIGNILQQEHLWEFDIKNRRLKFQRSRCT 475
>gi|9759039|dbj|BAB09366.1| aspartyl protease-like [Arabidopsis thaliana]
Length = 478
Score = 90.5 bits (223), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 90/379 (23%), Positives = 160/379 (42%), Gaps = 67/379 (17%)
Query: 2 VRLFIGTPSKGVLLILDTGSALIY-------------------AIFDPRKSSSFQKINCD 42
++ +G+P K + +DTGS +++ +++D + SS+ + + C+
Sbjct: 76 TKIKLGSPPKEYYVQVDTGSDILWVNCAPCPKCPVKTDLGIPLSLYDSKTSSTSKNVGCE 135
Query: 43 HPDCTYF----KC-VNEQCVYTMKYADQSVTKGFAAHETIS---VIGKGEGKAIFHGALF 94
C++ C + C Y + Y D S + G + I+ V G + +F
Sbjct: 136 DDFCSFIMQSETCGAKKPCSYHVVYGDGSTSDGDFIKDNITLEQVTGNLRTAPLAQEVVF 195
Query: 95 GCSNDNHGFDEDARDGALAGVLGLSRVTISFISQL--GSIIKKRFSYCLVIPLPNGEYTS 152
GC + G D A+ G++G + S ISQL G K+ FS+CL G +
Sbjct: 196 GCGKNQSG-QLGQTDSAVDGIMGFGQSNTSIISQLAAGGSTKRIFSHCLDNMNGGGIFAV 254
Query: 153 SYLKFGTDMGYRRPSTQATKFINHPNNFYY-LSLKDISIDNERMNFPPDTFDITVSGEGG 211
++ P + T + PN +Y + LK + +D + ++ PP + +G+GG
Sbjct: 255 GEVE--------SPVVKTTPIV--PNQVHYNVILKGMDVDGDPIDLPPSL--ASTNGDGG 302
Query: 212 CIIDSGSVLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCYFLPETFNRFPSMA 271
IIDSG+ L Y ++Y L EK + Q +L E F T FP +
Sbjct: 303 TIIDSGTTLAYLPQNLYNSLIEKITAK----QQVKLHMVQETFACFSFTSNTDKAFPVVN 358
Query: 272 FYFEDA------------NLRIDGENVFIIDYENHFFLLAVAPHDDL-VALIGSQQQRDT 318
+FED+ +LR E+++ +++ + D V L+G +
Sbjct: 359 LHFEDSLKLSVYPHDYLFSLR---EDMYCFGWQSG----GMTTQDGADVILLGDLVLSNK 411
Query: 319 RFVYDLNIDLLSFVKENCS 337
VYDL +++ + NCS
Sbjct: 412 LVVYDLENEVIGWADHNCS 430
>gi|115472519|ref|NP_001059858.1| Os07g0533800 [Oryza sativa Japonica Group]
gi|113611394|dbj|BAF21772.1| Os07g0533800 [Oryza sativa Japonica Group]
Length = 458
Score = 90.5 bits (223), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 99/375 (26%), Positives = 158/375 (42%), Gaps = 54/375 (14%)
Query: 1 MVRLFIGTPSKGVLLILDTGSALIYA---------------IFDPRKSSSFQKINCDHPD 45
++ L IGTP + I DTGS L++ +++P S +F+ + C
Sbjct: 98 IMTLAIGTPPQSYPAIADTGSDLVWTQCAPCGERCFKQPSPLYNPSSSPTFRVLPCSS-- 155
Query: 46 CTYFKCVNEQ------------CVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGAL 93
C E C Y Y T G ET + + G
Sbjct: 156 -ALNLCAAEARLAGATPPPGCACRYNQTYGT-GWTSGLQGSETFTFGSSPADQVRVPGIA 213
Query: 94 FGCSNDNHGFDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSS 153
FGCSN + D +G+ AG++GL R +S +SQL + + FSYCL P + + S+
Sbjct: 214 FGCSNAS----SDDWNGS-AGLVGLGRGGLSLVSQLAAGM---FSYCLT-PFQDTKSKST 264
Query: 154 YL--KFGTDMGYRRPSTQATKFINHPN-----NFYYLSLKDISIDNERMNFPPDTFDITV 206
L ++T F+ P+ +YYL+L IS+ + PP F +
Sbjct: 265 LLLGPAAAAAALNGTGVRSTPFVPSPSKPPMSTYYYLNLTGISVGPAALPIPPGAFALRA 324
Query: 207 SGEGGCIIDSGSVLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCYFLPETF-- 264
G GG IIDSG+ +T D +K V + + S+ + LC+ LP +
Sbjct: 325 DGTGGLIIDSGTTITSL-VDAAYKRVRAAVRSLVKLPVTDGSNA-TGLDLCFALPSSSAP 382
Query: 265 -NRFPSMAFYF-EDANLRIDGENVFIIDYENHFFLLAVAPHDDLVALIGSQQQRDTRFVY 322
PSM +F A++ + EN I+D + L + D ++ +G+ QQ++ +Y
Sbjct: 383 PATLPSMTLHFGGGADMVLPVENYMILD-GGMWCLAMRSQTDGELSTLGNYQQQNLHILY 441
Query: 323 DLNIDLLSFVKENCS 337
D+ + LSF CS
Sbjct: 442 DVQKETLSFAPAKCS 456
>gi|30692930|ref|NP_198475.2| aspartyl protease family protein [Arabidopsis thaliana]
gi|66792626|gb|AAY56415.1| At5g36260 [Arabidopsis thaliana]
gi|332006680|gb|AED94063.1| aspartyl protease family protein [Arabidopsis thaliana]
Length = 482
Score = 90.1 bits (222), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 90/379 (23%), Positives = 160/379 (42%), Gaps = 67/379 (17%)
Query: 2 VRLFIGTPSKGVLLILDTGSALIY-------------------AIFDPRKSSSFQKINCD 42
++ +G+P K + +DTGS +++ +++D + SS+ + + C+
Sbjct: 80 TKIKLGSPPKEYYVQVDTGSDILWVNCAPCPKCPVKTDLGIPLSLYDSKTSSTSKNVGCE 139
Query: 43 HPDCTYF----KC-VNEQCVYTMKYADQSVTKGFAAHETIS---VIGKGEGKAIFHGALF 94
C++ C + C Y + Y D S + G + I+ V G + +F
Sbjct: 140 DDFCSFIMQSETCGAKKPCSYHVVYGDGSTSDGDFIKDNITLEQVTGNLRTAPLAQEVVF 199
Query: 95 GCSNDNHGFDEDARDGALAGVLGLSRVTISFISQL--GSIIKKRFSYCLVIPLPNGEYTS 152
GC + G D A+ G++G + S ISQL G K+ FS+CL G +
Sbjct: 200 GCGKNQSG-QLGQTDSAVDGIMGFGQSNTSIISQLAAGGSTKRIFSHCLDNMNGGGIFAV 258
Query: 153 SYLKFGTDMGYRRPSTQATKFINHPNNFYY-LSLKDISIDNERMNFPPDTFDITVSGEGG 211
++ P + T + PN +Y + LK + +D + ++ PP + +G+GG
Sbjct: 259 GEVE--------SPVVKTTPIV--PNQVHYNVILKGMDVDGDPIDLPPSL--ASTNGDGG 306
Query: 212 CIIDSGSVLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCYFLPETFNRFPSMA 271
IIDSG+ L Y ++Y L EK + Q +L E F T FP +
Sbjct: 307 TIIDSGTTLAYLPQNLYNSLIEKITAK----QQVKLHMVQETFACFSFTSNTDKAFPVVN 362
Query: 272 FYFEDA------------NLRIDGENVFIIDYENHFFLLAVAPHDDL-VALIGSQQQRDT 318
+FED+ +LR E+++ +++ + D V L+G +
Sbjct: 363 LHFEDSLKLSVYPHDYLFSLR---EDMYCFGWQSG----GMTTQDGADVILLGDLVLSNK 415
Query: 319 RFVYDLNIDLLSFVKENCS 337
VYDL +++ + NCS
Sbjct: 416 LVVYDLENEVIGWADHNCS 434
>gi|22831049|dbj|BAC15912.1| putative nucleoid DNA-binding protein cnd41 [Oryza sativa Japonica
Group]
gi|50508281|dbj|BAD32130.1| putative nucleoid DNA-binding protein cnd41 [Oryza sativa Japonica
Group]
Length = 453
Score = 90.1 bits (222), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 99/375 (26%), Positives = 158/375 (42%), Gaps = 54/375 (14%)
Query: 1 MVRLFIGTPSKGVLLILDTGSALIYA---------------IFDPRKSSSFQKINCDHPD 45
++ L IGTP + I DTGS L++ +++P S +F+ + C
Sbjct: 93 IMTLAIGTPPQSYPAIADTGSDLVWTQCAPCGERCFKQPSPLYNPSSSPTFRVLPCSS-- 150
Query: 46 CTYFKCVNEQ------------CVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGAL 93
C E C Y Y T G ET + + G
Sbjct: 151 -ALNLCAAEARLAGATPPPGCACRYNQTYG-TGWTSGLQGSETFTFGSSPADQVRVPGIA 208
Query: 94 FGCSNDNHGFDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSS 153
FGCSN + D +G+ AG++GL R +S +SQL + + FSYCL P + + S+
Sbjct: 209 FGCSNAS----SDDWNGS-AGLVGLGRGGLSLVSQLAAGM---FSYCLT-PFQDTKSKST 259
Query: 154 YL--KFGTDMGYRRPSTQATKFINHPN-----NFYYLSLKDISIDNERMNFPPDTFDITV 206
L ++T F+ P+ +YYL+L IS+ + PP F +
Sbjct: 260 LLLGPAAAAAALNGTGVRSTPFVPSPSKPPMSTYYYLNLTGISVGPAALPIPPGAFALRA 319
Query: 207 SGEGGCIIDSGSVLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCYFLPETF-- 264
G GG IIDSG+ +T D +K V + + S+ + LC+ LP +
Sbjct: 320 DGTGGLIIDSGTTITSL-VDAAYKRVRAAVRSLVKLPVTDGSNA-TGLDLCFALPSSSAP 377
Query: 265 -NRFPSMAFYF-EDANLRIDGENVFIIDYENHFFLLAVAPHDDLVALIGSQQQRDTRFVY 322
PSM +F A++ + EN I+D + L + D ++ +G+ QQ++ +Y
Sbjct: 378 PATLPSMTLHFGGGADMVLPVENYMILD-GGMWCLAMRSQTDGELSTLGNYQQQNLHILY 436
Query: 323 DLNIDLLSFVKENCS 337
D+ + LSF CS
Sbjct: 437 DVQKETLSFAPAKCS 451
>gi|125558632|gb|EAZ04168.1| hypothetical protein OsI_26310 [Oryza sativa Indica Group]
gi|125600539|gb|EAZ40115.1| hypothetical protein OsJ_24558 [Oryza sativa Japonica Group]
Length = 453
Score = 90.1 bits (222), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 99/375 (26%), Positives = 158/375 (42%), Gaps = 54/375 (14%)
Query: 1 MVRLFIGTPSKGVLLILDTGSALIYA---------------IFDPRKSSSFQKINCDHPD 45
++ L IGTP + I DTGS L++ +++P S +F+ + C
Sbjct: 93 IMTLAIGTPPQSYPAIADTGSDLVWTQCAPCGERCFKQPSPLYNPSSSPTFRVLPCSS-- 150
Query: 46 CTYFKCVNEQ------------CVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGAL 93
C E C Y Y T G ET + + G
Sbjct: 151 -ALNLCAAEARLAGATPPPGCACRYNQTYG-TGWTSGLQGSETFTFGSSPADQVRVPGIA 208
Query: 94 FGCSNDNHGFDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSS 153
FGCSN + D +G+ AG++GL R +S +SQL + + FSYCL P + + S+
Sbjct: 209 FGCSNAS----SDDWNGS-AGLVGLGRGGLSLVSQLAAGM---FSYCLT-PFQDTKSKST 259
Query: 154 YL--KFGTDMGYRRPSTQATKFINHPN-----NFYYLSLKDISIDNERMNFPPDTFDITV 206
L ++T F+ P+ +YYL+L IS+ + PP F +
Sbjct: 260 LLLGPAAAAAALNGTGVRSTPFVPSPSKPPMSTYYYLNLTGISVGAAALPIPPGAFALRA 319
Query: 207 SGEGGCIIDSGSVLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCYFLPETF-- 264
G GG IIDSG+ +T D +K V + + S+ + LC+ LP +
Sbjct: 320 DGTGGLIIDSGTTITSL-VDAAYKRVRAAVRSLVKLPVTDGSNA-TGLDLCFALPSSSAP 377
Query: 265 -NRFPSMAFYF-EDANLRIDGENVFIIDYENHFFLLAVAPHDDLVALIGSQQQRDTRFVY 322
PSM +F A++ + EN I+D + L + D ++ +G+ QQ++ +Y
Sbjct: 378 PATLPSMTLHFGGGADMVLPVENYMILD-GGMWCLAMRSQTDGELSTLGNYQQQNLHILY 436
Query: 323 DLNIDLLSFVKENCS 337
D+ + LSF CS
Sbjct: 437 DVQKETLSFAPAKCS 451
>gi|195627138|gb|ACG35399.1| aspartic proteinase nepenthesin-1 precursor [Zea mays]
Length = 431
Score = 90.1 bits (222), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 97/373 (26%), Positives = 148/373 (39%), Gaps = 59/373 (15%)
Query: 1 MVRLFIGTPSKGVLLILDTGSALIYA------------IFDPRKSSSFQKINCDHPDCTY 48
+VR +GTP + +LL LDT + ++ F P SSS+ + C C
Sbjct: 80 VVRAGLGTPVQQLLLALDTSADATWSHCAPCDTCPAGSRFIPASSSSYASLPCASDWCPL 139
Query: 49 FK---CVNEQ--------CVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCS 97
F+ C Q C ++ +AD S + +T+ + GK G FGC
Sbjct: 140 FEGQPCPANQDASAPLPACAFSKPFADTSFQASLGS-DTLRL-----GKDAIAGYAFGCV 193
Query: 98 NDNHGFDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLKF 157
G + G+LGL R +S +SQ GS FSYCL P Y S L+
Sbjct: 194 GAVAGPTTNLPK---QGLLGLGRGPMSLLSQTGSTYNGVFSYCL--PSYRSYYFSGSLRL 248
Query: 158 GTDMGYRRPSTQATKFINHPN--NFYYLSLKDISIDNERMNFPPDTFDITVSGEGGCIID 215
G R + + T + +P+ + YY+++ +S+ + P +F + G +ID
Sbjct: 249 GAAGQPR--NVRYTPLLTNPHRPSLYYVNVTGLSVGRTWVKVPAGSFAFDPATGAGTVID 306
Query: 216 SGSVLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCYFLPETFNRFPSMAFYFE 275
SG+V+T + + VY L E+F R Q+A S L F FN A
Sbjct: 307 SGTVITRWTAPVYAALREEF-----RRQVAAPSGY---TSLGAFD-TCFNTDEVAAGGAP 357
Query: 276 DANLRIDGENVFIIDYENHFFLLAVAPHDDLVALIGSQ------------QQRDTRFVYD 323
L +DG + EN + P L Q QQ++ R V D
Sbjct: 358 PVTLHMDGGVDLTLPMENTLIHSSATPLACLAMAEAPQNVNAVVNVVANLQQQNVRVVVD 417
Query: 324 LNIDLLSFVKENC 336
+ + F +E C
Sbjct: 418 VAGSRVGFAREPC 430
>gi|359476197|ref|XP_003631803.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Vitis vinifera]
Length = 414
Score = 89.7 bits (221), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 86/296 (29%), Positives = 135/296 (45%), Gaps = 35/296 (11%)
Query: 57 VYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSNDNHGFDEDARDGALAGVL 116
Y M Y D+S + G +T+++ E +F FGC +N G D GA G+L
Sbjct: 138 TYNMTYGDKSTSVGNYGCDTMTL----EPSDVFPKFQFGCGRNNEG---DFGSGA-DGML 189
Query: 117 GLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLKFGTDMGYRRPSTQATKFINH 176
GL + +S +SQ S KK FSYC LP + S L FG + + S + T +N
Sbjct: 190 GLGQGQLSTVSQTASKFKKVFSYC----LPEEDSIGSLL-FG-EKATSQSSLKFTSLVNG 243
Query: 177 P-------NNFYYLSLKDISIDNERMNFPPDTFDITVSGEGGCIIDSGSVLTYFHSDVYW 229
P + +Y++ L DIS+ N+R+N P F G IIDSG+V+T Y
Sbjct: 244 PGTSGLEESGYYFVKLLDISVGNKRLNVPSSVF-----ASPGTIIDSGTVITCLPQRAYS 298
Query: 230 KLHEKFVSYFERFQLAQ-LSDCPEPIQLCYFLPETFN-RFPSMAFYF-EDANLRIDGENV 286
L F ++ L+ + + CY L + P + +F E A++R++G+ V
Sbjct: 299 ALTAAFKKAMAKYPLSNGRRKKGDILDTCYNLSGRKDVLLPEIVLHFGEGADVRLNGKRV 358
Query: 287 FIIDYENHFFLLAVAPH-----DDLVALIGSQQQRDTRFVYDLNIDLLSFVKENCS 337
I + LA A + + + +IG++QQ +YD+ + F CS
Sbjct: 359 -IWGNDASRLCLAFAGNSKSTMNSELTIIGNRQQVSLTVLYDIQGGRIGFGGNGCS 413
>gi|357116104|ref|XP_003559824.1| PREDICTED: probable aspartic protease At2g35615-like [Brachypodium
distachyon]
Length = 489
Score = 89.7 bits (221), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 91/379 (24%), Positives = 148/379 (39%), Gaps = 70/379 (18%)
Query: 27 IFDPRKSSSFQKINCDHPDCTYF---KCVNEQCVYTMKYADQSVTKGFAAHETISV-IGK 82
IFDP+ S ++ + D P C + +C + +++ +++ G+ + + G
Sbjct: 117 IFDPKTSHRYKNVGHDDPLCKAPFTPRPTEHRCGFNIRFRAEAMATGYLGKDEFAFGAGS 176
Query: 83 GEGKAIFHGALFGCSNDNHGFDED---------------------------ARDG----- 110
G G +FGC++ +G++ A DG
Sbjct: 177 GSRTTNVDGLVFGCAHRINGWNNKDVLAGIPSLNRRPTSFVRQLSTHGGGGAVDGLVFGC 236
Query: 111 -----------ALAGVLGLSRVTISFISQL---GSIIKKRFSYCLV--IPLPNGEYTSSY 154
LAG+L L+R SF+ QL G RFSYCLV PN +
Sbjct: 237 AHAINGWKNQDVLAGILSLNRRPTSFVRQLSVHGGGTTPRFSYCLVDHKKYPN---KHGF 293
Query: 155 LKFGTDMGYRRPSTQATKFINHPNN---FYYLSLKDISIDNERM-NFPPDTFDI-TVSGE 209
L+FG D+ + P+ YY+ L +S+ ++ P F S
Sbjct: 294 LRFGADVPDHSHAQSTALLYGEPDGGFGMYYVRLVGVSVAGRKLTGITPKMFQRDRRSRL 353
Query: 210 GGCIIDSGSVLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEP-IQLCY--FLPETFNR 266
GGC +D G+ T F Y L ++ L + P P +LC PE +
Sbjct: 354 GGCYVDVGNPTTRFAEAPYDILEAGVAAHMASHGLHR---TPVPGHRLCVRGTSPEVMPK 410
Query: 267 FPSMAFYF---EDANLRIDGENVF-IIDYENHFFLLAVAPHDDLVALIGSQQQRDTRFVY 322
PS+ +F E A L I +F + + ++ + + +IG QQ DTRF +
Sbjct: 411 LPSITLHFAEDEAAGLEIKSRLLFATVKHAGADYVCFIVQRAPVTTVIGGHQQVDTRFTF 470
Query: 323 DLNIDLLSFVKENCSDDSA 341
DL + L F E+C D++
Sbjct: 471 DLEENRLFFAPEDCHGDAS 489
>gi|20160862|dbj|BAB89801.1| putative aspartic proteinase nepenthesin I [Oryza sativa Japonica
Group]
Length = 488
Score = 89.7 bits (221), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 94/348 (27%), Positives = 142/348 (40%), Gaps = 44/348 (12%)
Query: 6 IGTPSKGVLLILDTGSALIY------AIFDPRKSSSFQKINCDHPDCTYFK-----CVNE 54
IGTP + V LD S L++ A F+P +S++ + C C F
Sbjct: 106 IGTPPQQVSGALDISSDLVWTACGATAPFNPVRSTTVADVPCTDDACQQFAPQTCGAGAS 165
Query: 55 QCVYTMKYADQSVTKGFAAHETISVIGKGE---GKAIFHGALFGCSNDNHG-FDEDARDG 110
+C YT Y G A T ++G G G +FGC N G F
Sbjct: 166 ECAYTYMY-------GGGAANTTGLLGTEAFTFGDTRIDGVVFGCGLKNVGDFS------ 212
Query: 111 ALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEY-TSSYLKFGTDMGYRRPSTQ 169
++GV+GL R +S +SQL RFSY P+ T S++ FG D + T
Sbjct: 213 GVSGVIGLGRGNLSLVSQL---QVDRFSYHFA---PDDSVDTQSFILFGDDATPQTSHTL 266
Query: 170 ATKFINHPNN--FYYLSLKDISIDNERMNFPPDTFDI-TVSGEGGCIIDSGSVLTYFHSD 226
+T+ + N YY+ L I +D + + P TFD+ G GG + ++T
Sbjct: 267 STRLLASDANPSLYYVELAGIQVDGKDLAIPSGTFDLRNKDGSGGVFLSITDLVTVLEEA 326
Query: 227 VYWKLHEKFVSYFERFQLAQLSDCPEPIQLCYFLPETFN-RFPSMAFYFE-DANLRIDGE 284
Y L + S + L ++ + LCY + PSMA F A + ++
Sbjct: 327 AYKPLRQAVAS---KIGLPAVNGSALGLDLCYTGESLAKAKVPSMALVFAGGAVMELELG 383
Query: 285 NVFIIDYENHFFLLAVAPHDDL-VALIGSQQQRDTRFVYDLNIDLLSF 331
N F +D L + P +++GS Q T +YD+N L F
Sbjct: 384 NYFYMDSTTGLACLTILPSSAGDGSVLGSLIQVGTHMMYDINGSKLVF 431
>gi|18400416|ref|NP_565559.1| aspartyl protease family protein [Arabidopsis thaliana]
gi|20197296|gb|AAM15014.1| predicted protein [Arabidopsis thaliana]
gi|330252412|gb|AEC07506.1| aspartyl protease family protein [Arabidopsis thaliana]
Length = 458
Score = 89.7 bits (221), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 92/338 (27%), Positives = 144/338 (42%), Gaps = 56/338 (16%)
Query: 1 MVRLFIGTPSKGVLLILDTGSAL----------------IYAIFDPRKSSSFQKINCDHP 44
+V +G P L I+DTGS+L I+ +F+P SS+F + +CD
Sbjct: 97 LVNFSVGQPPVPQLTIMDTGSSLLWIQCQPCKHCSSDHMIHPVFNPALSSTFVECSCDDR 156
Query: 45 DCTYF---KC-VNEQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSNDN 100
C Y C + +CVY Y + +KG A E ++ + FGC +N
Sbjct: 157 FCRYAPNGHCGSSNKCVYEQVYISGTGSKGVLAKERLTFTTPNGNTVVTQPIAFGCGYEN 216
Query: 101 HGFDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLKFGTD 160
+ + G+LGL S QLGS +FSYC + L N Y + L G D
Sbjct: 217 G----EQLESHFTGILGLGAKPTSLAVQLGS----KFSYC-IGDLANKNYGYNQLVLGED 267
Query: 161 ---MGYRRPSTQATKFINHPNNFYYLSLKDISIDNERMNFPPDTFDITVSGEGGCIIDSG 217
+G P T+ N+ YY++L+ IS+ + ++N P F G I+DSG
Sbjct: 268 ADILGDPTPIEFETE-----NSIYYMNLEGISVGDTQLNIEPVVFK-RRGPRTGVILDSG 321
Query: 218 SVLTYFHSDVYWKLHEKFVSY----FERFQLAQLSDCPEPIQLCYF--LPETFNRFPSMA 271
++ T+ Y +L+ + S ERF LCY + E FP +
Sbjct: 322 TLYTWLADIAYRELYNEIKSILDPKLERFWFRDF--------LCYHGRVSEELIGFPVVT 373
Query: 272 FYFE-DANLRIDGENVFI-IDYENHF--FLLAVAPHDD 305
F+F A L ++ ++F + N F F ++V P +
Sbjct: 374 FHFAGGAELAMEATSMFYPLSEPNTFNVFCMSVKPTKE 411
>gi|296089645|emb|CBI39464.3| unnamed protein product [Vitis vinifera]
Length = 477
Score = 89.7 bits (221), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 98/392 (25%), Positives = 165/392 (42%), Gaps = 94/392 (23%)
Query: 2 VRLFIGTPSKGVLLILDTGSALIYA-------------------IFDPRKSSSFQKINCD 42
++ IGTP++ + +DTGS +++ ++D ++S + + ++CD
Sbjct: 100 AKIGIGTPARDYYVQVDTGSDIMWVNCIQCNECPKKSSLGMELTLYDIKESLTGKLVSCD 159
Query: 43 HPDCTYFK------CV-NEQCVYTMKYADQSVTKGFAAHETIS---VIGKGEGKAIFHGA 92
C C+ N C YT YAD S + G+ + + V G E +
Sbjct: 160 QDFCYAINGGPPSYCIANMSCSYTEIYADGSSSFGYFVRDIVQYDQVSGDLETTSANGSV 219
Query: 93 LFGCSNDNHGFDEDARDGALAGVLGLSRVTISFISQLGSI--IKKRFSYCLVIPLPNGEY 150
+FGCS G + + + AL G+LG + S ISQL S ++K F++CL G +
Sbjct: 220 IFGCSATQSG--DLSSEEALDGILGFGKSNTSMISQLASSGKVRKMFAHCLDGLNGGGIF 277
Query: 151 TSSYLKFGTDMGYRRPSTQATKFINHPNNFYY-LSLKDISIDNERMNFPPDTFDITVSGE 209
++ +P T + PN +Y +++K + + +N P D FD V +
Sbjct: 278 AIGHI--------VQPKVNTTPLV--PNQTHYNVNMKAVEVGGYFLNLPTDVFD--VGDK 325
Query: 210 GGCIIDSGSVLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQL-CYFLPETF-NRF 267
G IIDSG+ L Y VY +L K S+ ++ + D Q C+ E+ + F
Sbjct: 326 KGTIIDSGTTLAYLPEVVYDQLLSKIFSWQSDLKVHTIHD-----QFTCFQYSESLDDGF 380
Query: 268 PSMAFYFEDANLRIDGENVFIIDYENHFFLLAVAPHD-----DLVALIGSQ----QQRDT 318
P++ F+FE++ L V PH+ D + IG Q Q RD
Sbjct: 381 PAVTFHFENS------------------LYLKVHPHEYLFSYDGLWCIGWQNSGMQSRDR 422
Query: 319 R--------------FVYDLNIDLLSFVKENC 336
R +YDL ++ + + NC
Sbjct: 423 RNITLLGDLALSNKLVLYDLENQVIGWTEYNC 454
>gi|190896584|gb|ACE96805.1| aspartyl protease [Populus tremula]
gi|190896586|gb|ACE96806.1| aspartyl protease [Populus tremula]
gi|190896588|gb|ACE96807.1| aspartyl protease [Populus tremula]
gi|190896590|gb|ACE96808.1| aspartyl protease [Populus tremula]
gi|190896592|gb|ACE96809.1| aspartyl protease [Populus tremula]
gi|190896594|gb|ACE96810.1| aspartyl protease [Populus tremula]
gi|190896596|gb|ACE96811.1| aspartyl protease [Populus tremula]
gi|190896598|gb|ACE96812.1| aspartyl protease [Populus tremula]
gi|190896600|gb|ACE96813.1| aspartyl protease [Populus tremula]
gi|190896602|gb|ACE96814.1| aspartyl protease [Populus tremula]
gi|190896604|gb|ACE96815.1| aspartyl protease [Populus tremula]
gi|190896606|gb|ACE96816.1| aspartyl protease [Populus tremula]
gi|190896610|gb|ACE96818.1| aspartyl protease [Populus tremula]
gi|190896612|gb|ACE96819.1| aspartyl protease [Populus tremula]
gi|190896614|gb|ACE96820.1| aspartyl protease [Populus tremula]
gi|190896616|gb|ACE96821.1| aspartyl protease [Populus tremula]
gi|190896618|gb|ACE96822.1| aspartyl protease [Populus tremula]
gi|190896620|gb|ACE96823.1| aspartyl protease [Populus tremula]
gi|190896622|gb|ACE96824.1| aspartyl protease [Populus tremula]
gi|190896624|gb|ACE96825.1| aspartyl protease [Populus tremula]
gi|190896626|gb|ACE96826.1| aspartyl protease [Populus tremula]
gi|190896628|gb|ACE96827.1| aspartyl protease [Populus tremula]
gi|190896630|gb|ACE96828.1| aspartyl protease [Populus tremula]
gi|190896632|gb|ACE96829.1| aspartyl protease [Populus tremula]
gi|190896634|gb|ACE96830.1| aspartyl protease [Populus tremula]
gi|190896636|gb|ACE96831.1| aspartyl protease [Populus tremula]
gi|190896638|gb|ACE96832.1| aspartyl protease [Populus tremula]
gi|190896640|gb|ACE96833.1| aspartyl protease [Populus tremula]
gi|190896642|gb|ACE96834.1| aspartyl protease [Populus tremula]
gi|190896644|gb|ACE96835.1| aspartyl protease [Populus tremula]
gi|190896646|gb|ACE96836.1| aspartyl protease [Populus tremula]
gi|190896648|gb|ACE96837.1| aspartyl protease [Populus tremula]
gi|190896650|gb|ACE96838.1| aspartyl protease [Populus tremula]
gi|190896652|gb|ACE96839.1| aspartyl protease [Populus tremula]
gi|190896654|gb|ACE96840.1| aspartyl protease [Populus tremula]
gi|190896656|gb|ACE96841.1| aspartyl protease [Populus tremula]
gi|190896658|gb|ACE96842.1| aspartyl protease [Populus tremula]
Length = 339
Score = 89.7 bits (221), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 80/307 (26%), Positives = 131/307 (42%), Gaps = 39/307 (12%)
Query: 1 MVRLFIGTPSKGVLLILDTGSALIY-----------AIFDPRKSSSFQKINCDHPDCTY- 48
+VR+ +GTP + + ++LDT + + F P S++ ++C C+
Sbjct: 46 VVRVKLGTPGQQMFMVLDTSNDAAWVPCSGCTGCSSTTFLPNASTTLGSLDCSEAQCSQV 105
Query: 49 --FKCV---NEQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSNDNHGF 103
F C + C++ Y S + I++ + G FGC N G
Sbjct: 106 RGFSCPATGSSACLFNQSYGGDSSLAATLVQDAITL-----ANDVIPGFTFGCINAVSGG 160
Query: 104 DEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLKFGTDMGY 163
+ G+LGL R IS ISQ G++ FSYCL P Y S LK G +G
Sbjct: 161 SIPPQ-----GLLGLGRGPISLISQAGAMYSGVFSYCL--PSFKSYYFSGSLKLG-PVGQ 212
Query: 164 RRPSTQATKFINHPN--NFYYLSLKDISIDNERMNFPPDTFDITVSGEGGCIIDSGSVLT 221
+ S + T + +P+ + YY++L +S+ ++ P + + G IIDSG+V+T
Sbjct: 213 PK-SIRTTPLLRNPHRPSLYYVNLTGVSVGRIKVPIPSEQLVFDPNTGAGTIIDSGTVIT 271
Query: 222 YFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCYFLPETFNRFPSMAFYFEDANLRI 281
F VY+ + ++F R Q+ C F P++ +FE NL +
Sbjct: 272 RFVQPVYFAIRDEF-----RKQVNGPISSLGAFDTC-FAATNEAEAPAVTLHFEGLNLVL 325
Query: 282 DGENVFI 288
EN I
Sbjct: 326 PMENSLI 332
>gi|7715602|gb|AAF68120.1|AC010793_15 F20B17.14 [Arabidopsis thaliana]
gi|12324588|gb|AAG52249.1|AC011717_17 putative aspartyl protease; 105611-106921 [Arabidopsis thaliana]
Length = 436
Score = 89.7 bits (221), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 96/363 (26%), Positives = 148/363 (40%), Gaps = 61/363 (16%)
Query: 11 KGVLLILDTGSALIYA--------------IFDPRKSSSFQKINCDHPDCTYF------- 49
K + LI+DTGS L + ++DP SSS++ + C+ C
Sbjct: 96 KNMSLIVDTGSDLTWVQCQPCRSCYNQQGPLYDPSVSSSYKTVFCNSSTCQDLVAATSNS 155
Query: 50 -------KCVNEQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSNDNHG 102
V C Y + Y D S T+G A E+I + G +FGC +N G
Sbjct: 156 GPCGGNNGVVKTPCEYVVSYGDGSYTRGDLASESILL-----GDTKLENFVFGCGRNNKG 210
Query: 103 FDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLKFGTDMG 162
+ R ++S +SQ FSYCL L +G S L FG D
Sbjct: 211 LFGGSSGLMGL-----GRSSVSLVSQTLKTFNGVFSYCLP-SLEDG--ASGSLSFGNDSS 262
Query: 163 YRRPSTQA--TKFINHPN--NFYYLSLKDISIDNERMNFPPDTFDITVSGEGGCIIDSGS 218
ST T + +P +FY L+L SI + + S G +IDSG+
Sbjct: 263 VYTNSTSVSYTPLVQNPQLRSFYILNLTGASIGGVELK--------SSSFGRGILIDSGT 314
Query: 219 VLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCYFLPETFN-RFPSMAFYFE-D 276
V+T +Y + +F+ F F A + C+ L + P + F+ +
Sbjct: 315 VITRLPPSIYKAVKIEFLKQFSGFPTAPGYSI---LDTCFNLTSYEDISIPIIKMIFQGN 371
Query: 277 ANLRIDGENVF-IIDYENHFFLLAVAP--HDDLVALIGSQQQRDTRFVYDLNIDLLSFVK 333
A L +D VF + + LA+A +++ V +IG+ QQ++ R +YD + L V
Sbjct: 372 AELEVDVTGVFYFVKPDASLVCLALASLSYENEVGIIGNYQQKNQRVIYDTTQERLGIVG 431
Query: 334 ENC 336
ENC
Sbjct: 432 ENC 434
>gi|414878073|tpg|DAA55204.1| TPA: hypothetical protein ZEAMMB73_344109 [Zea mays]
Length = 440
Score = 89.4 bits (220), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 94/375 (25%), Positives = 154/375 (41%), Gaps = 54/375 (14%)
Query: 5 FIGTPSKGVLLILDTGSALIY----------------AIFDPRKSSSFQKINCDHPDC-- 46
IG P + I+DTGS LI+ + +DP +S + + + C+ C
Sbjct: 76 LIGDPPQQAEAIIDTGSNLIWTQCSTCQPAGCFSQNLSFYDPSRSRTARPVACNDTACAL 135
Query: 47 -TYFKCV--NEQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSNDNHGF 103
+ +C N+ C Y V G E + + E ++ FGC
Sbjct: 136 GSETRCARDNKACAVLTAYG-AGVIGGVLGTEAFTFQPQSENVSL----AFGCIAATR-L 189
Query: 104 DEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLKFGTDMGY 163
+ DGA +G++GL R +S +SQLG +FSYCL P + +S L G G
Sbjct: 190 TPGSLDGA-SGIIGLGRGNLSLVSQLG---DNKFSYCLT-PYFSQSTNTSRLFVGASAGL 244
Query: 164 RRPSTQATK--FINHPN-----NFYYLSLKDISIDNERMNFPPDTFDITVSGEG---GCI 213
AT F+ +P+ FYYL L I++ + ++ P FD+ G G +
Sbjct: 245 SSGGAPATSVPFLKNPDVDPFSTFYYLPLTGITVGDAKLAVPEAAFDLRQVATGLWAGTL 304
Query: 214 IDSGSVLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCYFLP--ETFNRFPSMA 271
IDSGS T Y L ++ V + + E + LC + + P +
Sbjct: 305 IDSGSPFTSLVDVAYQALRDELVQQLGASIVPPPAGA-EGLDLCAAVAHGDVGKLVPPLV 363
Query: 272 FYFED--ANLRIDGENVF-IIDYENHFFLL--AVAPHDDL----VALIGSQQQRDTRFVY 322
+F ++ + EN + +D ++ + P+ L +IG+ Q+D +Y
Sbjct: 364 LHFGSGGGDVAVPPENYWGPVDDSTACMVVFSSGGPNSTLPMNETTIIGNYMQQDMHLLY 423
Query: 323 DLNIDLLSFVKENCS 337
DL +LSF +CS
Sbjct: 424 DLEKGMLSFQPADCS 438
>gi|242117573|dbj|BAH80056.1| hypothetical protein [Oryza sativa Indica Group]
Length = 469
Score = 89.4 bits (220), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 81/328 (24%), Positives = 143/328 (43%), Gaps = 43/328 (13%)
Query: 28 FDPRKSSSFQKINCDHPDCTYFKCVNEQCV----------------YTMKYADQSV-TKG 70
F P S++F + C C + E C Y++ Y + T G
Sbjct: 134 FRPNGSATFSPLPCSSDMC--LPVLRETCGRAGAAANATAGARCDSYSLTYGGSAANTSG 191
Query: 71 FAAHETISVIGKGEGKAIFHGALFGCSNDNHGFDEDARDGALAGVLGLSRVTISFISQLG 130
+ A +T + G G +FGCS+ ++G A +GV+G+ R +S ISQL
Sbjct: 192 YLATDTFTF-----GATAVPGVVFGCSDASYGDFAGA-----SGVIGIGRGNLSLISQL- 240
Query: 131 SIIKKRFSYCLVIPLPNGEYTS-SYLKFGTDMGYRRPSTQATKFINHP--NNFYYLSLKD 187
+FSY L+ P + ++ S ++FG D + ++T ++ +FYY++L
Sbjct: 241 --QFGKFSYQLLAPEATDDGSADSVIRFGDDAVPKTKRGRSTPLLSSTLYPDFYYVNLTG 298
Query: 188 ISIDNERMN-FPPDTFDITVSGEGGCIIDSGSVLTYFHSDVYWKLHEKFVSYFERFQLAQ 246
+ +D R++ P TFD+ +G GG I+ S + +TY Y + S R L
Sbjct: 299 VRVDGNRLDAIPAGTFDLRANGTGGVILSSTTPVTYLEQAAYDVVRAAVAS---RIGLPA 355
Query: 247 LSDCPE-PIQLCYFLPETFN-RFPSMAFYFE-DANLRIDGENVFIIDYENHFFLLAVAPH 303
++ + LCY + P + F+ A++ + N F ID + L + P
Sbjct: 356 VNGSAALELDLCYNASSMAKVKVPKLTLVFDGGADMDLSAANYFYIDNDTGLECLTMLPS 415
Query: 304 DDLVALIGSQQQRDTRFVYDLNIDLLSF 331
+++G+ Q T +YD++ L+F
Sbjct: 416 QG-GSVLGTLLQTGTNMIYDVDAGRLTF 442
>gi|357148754|ref|XP_003574882.1| PREDICTED: aspartic proteinase-like protein 2-like [Brachypodium
distachyon]
Length = 488
Score = 89.4 bits (220), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 97/374 (25%), Positives = 164/374 (43%), Gaps = 63/374 (16%)
Query: 6 IGTPSKGVLLILDTGSALIYA-------------------IFDPRKSSSFQKINCDHPDC 46
IGTP K + +DTGS +++ ++DP+ SSS ++CD C
Sbjct: 89 IGTPPKQYHVQVDTGSDILWVNCISCNKCPRKSDLGIDLRLYDPKGSSSGSTVSCDQKFC 148
Query: 47 --TYFK----CV-NEQCVYTMKYADQSVTKGFAAHETIS---VIGKGEGKAIFHGALFGC 96
TY C N C Y++ Y D S T G+ +++ V G G+ + +FGC
Sbjct: 149 AATYGGKLPGCAKNIPCEYSVMYGDGSSTTGYFVSDSLQYNQVSGDGQTRHANASVIFGC 208
Query: 97 SNDNHGFDEDARDGALAGVLGLSRVTISFISQLGSI--IKKRFSYCLVIPLPNGEYTSSY 154
G D + + AL G++G + S +SQL + +KK FS+CL G +
Sbjct: 209 GA-QQGGDLGSTNQALDGIIGFGQSNTSMLSQLAAAGEVKKIFSHCLDTIKGGGIFAIGD 267
Query: 155 LKFGTDMGYRRPSTQATKFI-NHPNNFYYLSLKDISIDNERMNFPPDTFDITVSGEG-GC 212
+ +P ++T + + P+ Y ++L+ I++ + P F+ +GE G
Sbjct: 268 V--------VQPKVKSTPLVPDMPH--YNVNLESINVGGTTLQLPSHMFE---TGEKKGT 314
Query: 213 IIDSGSVLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCYFLPETFNRFPSMAF 272
IIDSG+ LTY VY + + F + + + + YF + FP + F
Sbjct: 315 IIDSGTTLTYLPELVY---KDVLAAVFAKHPDTTFHSVQDFLCIQYF-QSVDDGFPKITF 370
Query: 273 YFE-DANLRI--------DGENVFIIDYENHFFLLAVAPHDDLVALIGSQQQRDTRFVYD 323
+FE D L + +G+N++ ++N L D+V L+G + VYD
Sbjct: 371 HFEDDLGLNVYPHDYFFQNGDNLYCFGFQNGG--LQSKDGKDMV-LLGDLVLSNKVVVYD 427
Query: 324 LNIDLLSFVKENCS 337
L ++ + NCS
Sbjct: 428 LENQVVGWTDYNCS 441
>gi|125528511|gb|EAY76625.1| hypothetical protein OsI_04577 [Oryza sativa Indica Group]
Length = 492
Score = 89.4 bits (220), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 94/352 (26%), Positives = 143/352 (40%), Gaps = 48/352 (13%)
Query: 6 IGTPSKGVLLILDTGSALIY------AIFDPRKSSSFQKINCDHPDCTYFK--------- 50
IGTP + V LD S L++ A F+P +S++ + C C F
Sbjct: 106 IGTPPQQVSGALDISSDLVWTACGATAPFNPVRSTTVADVPCTDDACQQFAPQTCGAGAG 165
Query: 51 CVNEQCVYTMKYADQSVTKGFAAHETISVIGKGE---GKAIFHGALFGCSNDNHG-FDED 106
+ +C YT Y G A T ++G G G +FGC N G F
Sbjct: 166 AGSSECAYTYMY-------GGGAANTTGLLGTEAFTFGDTRIDGVVFGCGLQNVGDFS-- 216
Query: 107 ARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEY-TSSYLKFGTDMGYRR 165
++GV+GL R +S +SQL RFSY P+ T S++ FG D +
Sbjct: 217 ----GVSGVIGLGRGNLSLVSQL---QVDRFSYHFA---PDDSVDTQSFILFGDDATPQT 266
Query: 166 PSTQATKFINHPNN--FYYLSLKDISIDNERMNFPPDTFDI-TVSGEGGCIIDSGSVLTY 222
T +T+ + N YY+ L I +D + + P TFD+ G GG + ++T
Sbjct: 267 SHTLSTRLLASDANPSLYYVELAGIQVDGKDLAIPSGTFDLRNKDGSGGVFLSITDLVTV 326
Query: 223 FHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCYFLPETFN-RFPSMAFYFE-DANLR 280
Y L + S + L ++ + LCY + PSMA F A +
Sbjct: 327 LEEAAYKPLRQAVAS---KIGLPAVNGSALGLDLCYTGESLAKAKVPSMALVFAGGAVME 383
Query: 281 IDGENVFIIDYENHFFLLAVAPHDDL-VALIGSQQQRDTRFVYDLNIDLLSF 331
++ N F +D L + P +++GS Q T +YD+N L F
Sbjct: 384 LELGNYFYMDSTTGLACLTILPSSAGDGSVLGSLIQVGTHMMYDINGSKLVF 435
>gi|190896608|gb|ACE96817.1| aspartyl protease [Populus tremula]
Length = 339
Score = 89.4 bits (220), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 80/307 (26%), Positives = 131/307 (42%), Gaps = 39/307 (12%)
Query: 1 MVRLFIGTPSKGVLLILDTGSALIY-----------AIFDPRKSSSFQKINCDHPDCTY- 48
+VR+ +GTP + + ++LDT + + F P S++ ++C C+
Sbjct: 46 VVRVKLGTPGQQMFMVLDTSNDAAWVPCSGCTGCSSTTFLPNASTTLGSLDCSEAQCSQV 105
Query: 49 --FKCV---NEQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSNDNHGF 103
F C + C++ Y S + I++ + G FGC N G
Sbjct: 106 RGFSCPATGSSACLFNQSYGGDSSLAATLVQDAITL-----ANDVIPGFTFGCINAVSGG 160
Query: 104 DEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLKFGTDMGY 163
+ G+LGL R IS ISQ G++ FSYCL P Y S LK G +G
Sbjct: 161 SIPPQ-----GLLGLGRGPISLISQAGAMYSGVFSYCL--PSFKSYYFSGSLKLG-PVGQ 212
Query: 164 RRPSTQATKFINHPN--NFYYLSLKDISIDNERMNFPPDTFDITVSGEGGCIIDSGSVLT 221
+ S + T + +P+ + YY++L +S+ ++ P + + G IIDSG+V+T
Sbjct: 213 PK-SIRTTPLLRNPHRPSLYYVNLTGVSVGRIKVPIPSEQLVFDPNTGAGTIIDSGTVIT 271
Query: 222 YFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCYFLPETFNRFPSMAFYFEDANLRI 281
F VY+ + ++F R Q+ C F P++ +FE NL +
Sbjct: 272 RFVQPVYFAIRDEF-----RKQVNGPISSLGAFDTC-FAETNEAEAPAVTLHFEGLNLVL 325
Query: 282 DGENVFI 288
EN I
Sbjct: 326 PMENSLI 332
>gi|54290731|dbj|BAD62401.1| putative nucleoid DNA-binding protein cnd41 [Oryza sativa Japonica
Group]
Length = 521
Score = 89.4 bits (220), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 96/374 (25%), Positives = 145/374 (38%), Gaps = 62/374 (16%)
Query: 1 MVRLFIGTPSKGVLLILDTGSALIY----------------AIFDPRKSSSFQKINCDHP 44
+V L IGTP+ +++DTGS L + +FDP SSS+ + CD
Sbjct: 172 VVTLGIGTPAVQQTVLIDTGSDLSWVQCKPCGAGECYAQKDPLFDPSSSSSYASVPCDSD 231
Query: 45 DCTYFK-------CVNEQ------CVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHG 91
C C C Y ++Y +++ T G + ET++ + G A F
Sbjct: 232 ACRKLAAGAYGHGCTGVSGGAAALCEYGIEYGNRATTTGVYSTETLT-LKPGVVVADFG- 289
Query: 92 ALFGCSNDNHGFDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYT 151
FGC + HG E G+LGL S +SQ S FSYC LP
Sbjct: 290 --FGCGDHQHGPYEK-----FDGLLGLGGAPESLVSQTSSQFGGPFSYC----LPPTSGG 338
Query: 152 SSYLKFGTDMGYRRPSTQATKFINHP-------NNFYYLSLKDISIDNERMNFPPDTFDI 204
+ +L G ST A+ P FY ++L IS+ + PP F
Sbjct: 339 AGFLTLGAPPNSSS-STAASGLSFTPMRRLPSVPTFYIVTLTGISVGGAPLAIPPSAF-- 395
Query: 205 TVSGEGGCIIDSGSVLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCY-FLPET 263
G +IDSG+V+T + Y L F S ++L S+ + CY F
Sbjct: 396 ----SSGMVIDSGTVITGLPATAYAALRSAFRSAMSEYRLLPPSNG-GVLDTCYDFTGHA 450
Query: 264 FNRFPSMAFYFE-DANLRIDGENVFIIDYENHFFLLAVAPHDDLVALIGSQQQRDTRFVY 322
P+++ F A + + ++D A A D+ + +IG+ QR +Y
Sbjct: 451 NVTVPTISLTFSGGATIDLAAPAGVLVD---GCLAFAGAGTDNAIGIIGNVNQRTFEVLY 507
Query: 323 DLNIDLLSFVKENC 336
D + F C
Sbjct: 508 DSGKGTVGFRAGAC 521
>gi|222635451|gb|EEE65583.1| hypothetical protein OsJ_21095 [Oryza sativa Japonica Group]
Length = 441
Score = 89.4 bits (220), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 96/374 (25%), Positives = 145/374 (38%), Gaps = 62/374 (16%)
Query: 1 MVRLFIGTPSKGVLLILDTGSALIYA----------------IFDPRKSSSFQKINCDHP 44
+V L IGTP+ +++DTGS L + +FDP SSS+ + CD
Sbjct: 92 VVTLGIGTPAVQQTVLIDTGSDLSWVQCKPCGAGECYAQKDPLFDPSSSSSYASVPCDSD 151
Query: 45 DCTYFK-------CVNEQ------CVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHG 91
C C C Y ++Y +++ T G + ET++ + G A F
Sbjct: 152 ACRKLAAGAYGHGCTGVSGGAAALCEYGIEYGNRATTTGVYSTETLT-LKPGVVVADFG- 209
Query: 92 ALFGCSNDNHGFDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYT 151
FGC + HG E G+LGL S +SQ S FSYC LP
Sbjct: 210 --FGCGDHQHGPYEK-----FDGLLGLGGAPESLVSQTSSQFGGPFSYC----LPPTSGG 258
Query: 152 SSYLKFGTDMGYRRPSTQATKFINHP-------NNFYYLSLKDISIDNERMNFPPDTFDI 204
+ +L G ST A+ P FY ++L IS+ + PP F
Sbjct: 259 AGFLTLGAPPNSSS-STAASGLSFTPMRRLPSVPTFYIVTLTGISVGGAPLAIPPSAF-- 315
Query: 205 TVSGEGGCIIDSGSVLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCY-FLPET 263
G +IDSG+V+T + Y L F S ++L S+ + CY F
Sbjct: 316 ----SSGMVIDSGTVITGLPATAYAALRSAFRSAMSEYRLLPPSNG-GVLDTCYDFTGHA 370
Query: 264 FNRFPSMAFYFE-DANLRIDGENVFIIDYENHFFLLAVAPHDDLVALIGSQQQRDTRFVY 322
P+++ F A + + ++D A A D+ + +IG+ QR +Y
Sbjct: 371 NVTVPTISLTFSGGATIDLAAPAGVLVD---GCLAFAGAGTDNAIGIIGNVNQRTFEVLY 427
Query: 323 DLNIDLLSFVKENC 336
D + F C
Sbjct: 428 DSGKGTVGFRAGAC 441
>gi|413951979|gb|AFW84628.1| putative aspartic protease family protein [Zea mays]
Length = 435
Score = 89.4 bits (220), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 92/379 (24%), Positives = 156/379 (41%), Gaps = 61/379 (16%)
Query: 2 VRLFIGTPSKGVLLILDTGSALIYAI-------------FDPRKSSSFQKINC------- 41
V L +GTP + V ++LDTGS L + + F PR S++F + C
Sbjct: 63 VSLAVGTPPQNVTMVLDTGSELSWLLCATGRAAAAAADSFRPRASATFAAVPCGSARCSS 122
Query: 42 -DHPDCTYFKCVNEQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSNDN 100
D P + +C ++ YAD S + G A + +V G A + FGC +
Sbjct: 123 RDLPAPPSCDAASRRCRVSLSYADGSASDGALATDVFAV-----GDAPPLRSAFGCMSAA 177
Query: 101 HGFDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLKFG-T 159
+ DA A AG+LG++R +SF++Q + +RFSYC+ + + L G +
Sbjct: 178 YDSSPDAV--ATAGLLGMNRGALSFVTQAST---RRFSYCI-----SDRDDAGVLLLGHS 227
Query: 160 DMGYR----RPSTQATKFINHPNNFYY-LSLKDISIDNERMNFPPDTFDITVSGEGGCII 214
D+ + P Q T + + + Y + L I + + + PP +G G ++
Sbjct: 228 DLPFLPLNYTPLYQPTPPLPYFDRVAYSVQLLGIRVGGKPLPIPPSVLAPDHTGAGQTMV 287
Query: 215 DSGSVLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCP----EPIQLCYFLPE----TFNR 266
DSG+ T+ D Y + +F+ + L L D E C+ +P+ R
Sbjct: 288 DSGTQFTFLLGDAYSAVKAEFLKQTKPL-LPALEDPSFAFQEAFDTCFRVPKGRPPPSAR 346
Query: 267 FPSMAFYFEDANLRIDGEN-VFIIDYENH----FFLLAVAPHDDLVAL----IGSQQQRD 317
P + F A + + G+ ++ + E + L + D+V L IG Q +
Sbjct: 347 LPPVTLLFNGAQMSVAGDRLLYKVPGERRGADGVWCLTFG-NADMVPLTAYVIGHHHQMN 405
Query: 318 TRFVYDLNIDLLSFVKENC 336
YDL + C
Sbjct: 406 LWVEYDLERGRVGLAPVKC 424
>gi|255538124|ref|XP_002510127.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
gi|223550828|gb|EEF52314.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
Length = 641
Score = 89.4 bits (220), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 91/368 (24%), Positives = 148/368 (40%), Gaps = 56/368 (15%)
Query: 2 VRLFIGTPSKGVLLILDTGSALIYAI--------------FDPRKSSSFQKINCDHPDCT 47
RLFIGTP + LI+DTGS + Y F P SS+++ + C+ P C
Sbjct: 90 TRLFIGTPPQEFALIVDTGSTVTYVPCSTCEQCGKHQDPRFQPESSSTYKPMQCN-PSC- 147
Query: 48 YFKCVNE--QCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSNDNHG--F 103
C +E QC Y +YA+ S + G A + +S E + A+FGC G F
Sbjct: 148 --NCDDEGKQCTYERRYAEMSSSSGLLAEDVLSF--GNESELTPQRAIFGCETVETGELF 203
Query: 104 DEDARDGALAGVLGLSRVTISFISQL--GSIIKKRFSYCL-VIPLPNGEYTSSYLKFGTD 160
+ A G++GL R +S + QL ++ FS C + + G + D
Sbjct: 204 SQRAD-----GIMGLGRGPLSVVDQLVIKEVVGNSFSLCYGGMDVVGGAMVLGNIPPPPD 258
Query: 161 M--GYRRPSTQATKFINHPNNFYYLSLKDISIDNERMNFPPDTFDITVSGEGGCIIDSGS 218
M + P A +Y + LK++ + +R+ P FD G+ G ++DSG+
Sbjct: 259 MVFAHSDPYRSA---------YYNIELKELHVAGKRLKLNPRVFD----GKHGTVLDSGT 305
Query: 219 VLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCYF-----LPETFNRFPSMAFY 273
Y + + + + + + D P +C+ + + FP +
Sbjct: 306 TYAYLPEEAFVAFKDAIIKEIKFLKQIHGPD-PSYNDICFSGAGRDVSQLSKIFPEVNMV 364
Query: 274 FEDAN-LRIDGENVFI--IDYENHFFLLAVAPHDDLVALIGSQQQRDTRFVYDLNIDLLS 330
F + L + EN + L D L+G R+T YD + D +
Sbjct: 365 FGNGQKLSLSPENYLFRHTKVSGAYCLGIFQNGKDPTTLLGGIVVRNTLVTYDRDNDKIG 424
Query: 331 FVKENCSD 338
F K NCS+
Sbjct: 425 FWKTNCSE 432
>gi|226491934|ref|NP_001140743.1| uncharacterized protein LOC100272818 [Zea mays]
gi|194700872|gb|ACF84520.1| unknown [Zea mays]
Length = 351
Score = 89.4 bits (220), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 80/325 (24%), Positives = 130/325 (40%), Gaps = 49/325 (15%)
Query: 28 FDPRKSSSFQKINCDHPDCTYFK-----CVNEQCVYTMKYADQSVTKGFAAHETISVIGK 82
+DP +S + +C P CT C N QC Y ++Y D S T G + +++
Sbjct: 60 YDPSRSPTSAAFSCSSPTCTALGPYANGCANNQCQYLVRYPDGSSTSGAYIADLLTL--- 116
Query: 83 GEGKAIFHGALFGCSNDNHG-FDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCL 141
G A+ G FGCS+ G FD A AG++ L S +SQ S FSYC
Sbjct: 117 DAGNAV-SGFKFGCSHAEQGSFDARA-----AGIMALGGGPESLLSQTASRYGNAFSYC- 169
Query: 142 VIPLPNGEYTSSYLKFGTDMGYRRPSTQATKFINHP-------NNFYYLSLKDISIDNER 194
+P S + G P +++++ P FY + L+ I++ +R
Sbjct: 170 ---IPATASDSGFFTLGV------PRRASSRYVVTPMVRFRQAATFYGVLLRTITVGGQR 220
Query: 195 MNFPPDTFDITVSGEGGCIIDSGSVLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEP- 253
+ P F G ++DS + +T Y L F S ++ A P+
Sbjct: 221 LGVAPAVF------AAGSVLDSRTAITRLPPTAYQALRAAFRSSMTMYRSAP----PKGY 270
Query: 254 IQLCYFLPETFN-RFPSMAFYFE-DANLRIDGENVFIIDYENHFFLLAVAPHDDLVALIG 311
+ CY N R P ++ F+ +A L +D + N D + ++G
Sbjct: 271 LDTCYDFTGVVNIRLPKISLVFDRNAVLPLDPSGILF----NDCLAFTSNADDRMPGVLG 326
Query: 312 SQQQRDTRFVYDLNIDLLSFVKENC 336
S QQ+ +YD+ + F + C
Sbjct: 327 SVQQQTIEVLYDVGGGAVGFRQGAC 351
>gi|356546378|ref|XP_003541603.1| PREDICTED: probable aspartic protease At2g35615-like [Glycine max]
Length = 439
Score = 89.4 bits (220), Expect = 3e-15, Method: Compositional matrix adjust.
Identities = 97/360 (26%), Positives = 167/360 (46%), Gaps = 37/360 (10%)
Query: 1 MVRLFIGTPSKGVLLILDTGSALIY--------------AIFDPRKSSSFQKINCDHPDC 46
++R +G+P VL I+DTGS +++ IFDP KS +++ + C C
Sbjct: 92 LMRYSVGSPPFQVLGIVDTGSDILWLQCEPCEDCYKQTTPIFDPSKSKTYKTLPCSSNTC 151
Query: 47 TYFK---CVNEQ-CVYTMKYADQSVTKGFAAHETISVIGKGEGKAI-FHGALFGCSNDNH 101
+ C ++ C Y++ Y D S + G + ET++ +G +G ++ F + GC ++N
Sbjct: 152 ESLRNTACSSDNVCEYSIDYGDGSHSDGDLSVETLT-LGSTDGSSVHFPKTVIGCGHNNG 210
Query: 102 G-FDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLKFGTD 160
G F E+ G +S ++ S G +FSYCL P+ + +SS L FG
Sbjct: 211 GTFQEEGSGIVGLGGGPVSLISQLSSSIGG-----KFSYCLA-PIFSESNSSSKLNFGDA 264
Query: 161 MGYRRPSTQATKFINHPNN---FYYLSLKDISIDNERMNFPPDTFDITVSGEGGCIIDSG 217
T +T P N FY+L+L+ S+ + R+ F + + SG+G IIDSG
Sbjct: 265 AVVSGRGTVSTPL--DPLNGQVFYFLTLEAFSVGDNRIEFSGSSSSGSGSGDGNIIIDSG 322
Query: 218 SVLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCYFLPETFNRFPSMAFYFEDA 277
+ LT + Y L E VS + +L + D + + LCY P + +F+ A
Sbjct: 323 TTLTLLPQEDYLNL-ESAVS--DVIKLERARDPSKLLSLCYKTTSDELDLPVITAHFKGA 379
Query: 278 NLRIDGENVFIIDYENHFFLLAVAPHDDLVALIGSQQQRDTRFVYDLNIDLLSFVKENCS 337
++ ++ + F + E A + A+ G+ Q++ YDL +SF +C+
Sbjct: 380 DVELNPISTF-VPVEKGVVCFAFI-SSKIGAIFGNLAQQNLLVGYDLVKKTVSFKPTDCT 437
>gi|238010910|gb|ACR36490.1| unknown [Zea mays]
gi|413942664|gb|AFW75313.1| hypothetical protein ZEAMMB73_520329 [Zea mays]
Length = 481
Score = 89.4 bits (220), Expect = 3e-15, Method: Compositional matrix adjust.
Identities = 80/324 (24%), Positives = 128/324 (39%), Gaps = 47/324 (14%)
Query: 28 FDPRKSSSFQKINCDHPDCTYFK-----CVNEQCVYTMKYADQSVTKGFAAHETISVIGK 82
+DP +S S +C P CT C N QC Y ++Y D S T G + +++
Sbjct: 190 YDPSRSPSSAPFSCSSPTCTALGPYANGCANNQCQYLVRYPDGSSTSGAYIADLLTL--- 246
Query: 83 GEGKAIFHGALFGCSNDNHG-FDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCL 141
G A+ G FGCS+ G FD A AG++ L S +SQ S FSYC
Sbjct: 247 DAGNAV-SGFKFGCSHAEQGSFDARA-----AGIMALGGGPESLLSQTASRYGNAFSYC- 299
Query: 142 VIPLPNGEYTSSYLKFGTDMGYRRPSTQATKFINHP-------NNFYYLSLKDISIDNER 194
+P S + G P +++++ P FY + L+ I++ +R
Sbjct: 300 ---IPATASDSGFFTLGV------PRRASSRYVVTPMVRFRQAATFYGVLLRTITVGGQR 350
Query: 195 MNFPPDTFDITVSGEGGCIIDSGSVLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPI 254
+ P F G ++DS + +T Y L F S ++ A +
Sbjct: 351 LGVAPAVF------AAGSVLDSRTAITRLPPTAYQALRSAFRSSMTMYRSAPPKGY---L 401
Query: 255 QLCYFLPETFN-RFPSMAFYFE-DANLRIDGENVFIIDYENHFFLLAVAPHDDLVALIGS 312
CY N R P ++ F+ +A L +D + N D + ++GS
Sbjct: 402 DTCYDFTGVVNIRLPKISLVFDRNAVLPLDPSGILF----NDCLAFTSNADDRMPGVLGS 457
Query: 313 QQQRDTRFVYDLNIDLLSFVKENC 336
QQ+ +YD+ + F + C
Sbjct: 458 VQQQTIEVLYDVGGGAVGFRQGAC 481
>gi|413946455|gb|AFW79104.1| hypothetical protein ZEAMMB73_209101 [Zea mays]
Length = 480
Score = 89.4 bits (220), Expect = 3e-15, Method: Compositional matrix adjust.
Identities = 96/378 (25%), Positives = 152/378 (40%), Gaps = 54/378 (14%)
Query: 2 VRLFIGTPSKGVLLILDTGSALIYA---------------IFDPRKSSSFQKINCDHPDC 46
VR +GTP++ +L+ DTGS L + +F S S+ I C C
Sbjct: 114 VRFRVGTPAQPFVLVADTGSDLTWVKCSGAGDGTGDAPRRVFRAAASRSWAPIACSSDTC 173
Query: 47 TYF------KCVN--EQCVYTMKYADQSVTKGFAAHE--TISVIGK-----GEGKAIFHG 91
T + C + C Y +Y D S +G + TI++ G G +A G
Sbjct: 174 TSYVPFSLANCSSPASPCAYDYRYNDGSAARGVVGTDSATIALSGSESRDGGGRRAKLQG 233
Query: 92 ALFGCSNDNHGFDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYT 151
+ GC+ G + DG VL L ISF S+ + RFSYCLV L T
Sbjct: 234 VVLGCTASYDGQSFQSSDG----VLSLGNSNISFASRAAARFGGRFSYCLVDHLAPRNAT 289
Query: 152 SSYLKF---GTDMGYRRPSTQATKFINHP-------NNFYYLSLKDISIDNERMNFPPDT 201
SYL F G + G S+ ++ P + FY +++ + + E ++ P D
Sbjct: 290 -SYLTFGPPGPEGGAAASSSSSSAAARTPLLLDRRMSPFYAVAVDAVHVAGEALDIPADV 348
Query: 202 FDITVSGEGGCIIDSGSVLTYFHSDVYWKLHEKFVSYFERFQ-LAQLSDCPEPIQLCYFL 260
+D+ GG I+DSG+ LT + Y + ER L ++S +P + CY
Sbjct: 349 WDVARG--GGAILDSGTSLTVLATPAY---RAVVAALSERLAGLPRVSM--DPFEYCYNW 401
Query: 261 PETFNRFPSMAFYFEDANLRIDGENVFIIDYENHFFLLAVAPHD-DLVALIGSQQQRDTR 319
P + F + +++D + V V++IG+ Q+D
Sbjct: 402 TAAALEIPGLEVRFAGSARLQPPAKSYVVDAAPGVKCIGVQEGAWPGVSVIGNILQQDHL 461
Query: 320 FVYDLNIDLLSFVKENCS 337
+ +DL L F C+
Sbjct: 462 WEFDLRDRWLRFKHTRCA 479
>gi|195607464|gb|ACG25562.1| aspartic proteinase nepenthesin-2 precursor [Zea mays]
Length = 478
Score = 89.4 bits (220), Expect = 3e-15, Method: Compositional matrix adjust.
Identities = 90/367 (24%), Positives = 141/367 (38%), Gaps = 60/367 (16%)
Query: 1 MVRLFIGTPSKGVLLILDTGSALIYA-----------------IFDPRKSSSFQKINCDH 43
+V +GTP + +DTGS L + +FDP +SSS+ + C
Sbjct: 141 VVTASLGTPGVAQTMEVDTGSDLSWVQCKPCSAAPSCYSQKDPLFDPAQSSSYAAVPCGG 200
Query: 44 PDCTYFKCVNEQCV------YTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCS 97
P C Y + Y D S T G + +T+++ + G FGC
Sbjct: 201 PVCAGLGIYAASACSAAQCGYVVSYGDGSNTTGVYSSDTLTL----SASSAVQGFFFGCG 256
Query: 98 NDNHGFDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLKF 157
+ G + G+LGL R S + Q FSYC LP T+ YL
Sbjct: 257 HAQSGLFN-----GVDGLLGLGREQPSLVEQTAGTYGGVFSYC----LPTKPSTAGYLTL 307
Query: 158 G-TDMGYRRPSTQATKFINHPN--NFYYLSLKDISIDNERMNFPPDTFDITVSGEGGCII 214
G P T+ + PN +Y + L IS+ ++++ P F GG ++
Sbjct: 308 GLGGPSGAAPGFSTTQLLPSPNAPTYYVVMLTGISVGGQQLSVPASAF------AGGTVV 361
Query: 215 DSGSVLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCY-FLPETFNRFPSMAFY 273
D+G+V+T Y L F S + + + CY F P++A
Sbjct: 362 DTGTVITRLPPTAYAALRSAFRSGMASYGY-PTAPSNGILDTCYNFAGYGTVTLPNVALT 420
Query: 274 FEDANLRIDGENVFIIDYENHFFLLAVAP--HDDLVALIGSQQQRDTRFVYDLNIDLLS- 330
F + G + + F LA AP D +A++G+ QQR +++ ID S
Sbjct: 421 FGSGATVMLGADGIL-----SFGCLAFAPSGSDGGMAILGNVQQRS----FEVRIDGTSV 471
Query: 331 -FVKENC 336
F +C
Sbjct: 472 GFKPSSC 478
>gi|18412482|ref|NP_565219.1| aspartyl protease-like protein [Arabidopsis thaliana]
gi|19699359|gb|AAL91289.1| At1g79720/F19K16_30 [Arabidopsis thaliana]
gi|26450464|dbj|BAC42346.1| unknown protein [Arabidopsis thaliana]
gi|115646741|gb|ABJ17101.1| At1g79720 [Arabidopsis thaliana]
gi|332198170|gb|AEE36291.1| aspartyl protease-like protein [Arabidopsis thaliana]
Length = 484
Score = 89.4 bits (220), Expect = 3e-15, Method: Compositional matrix adjust.
Identities = 96/363 (26%), Positives = 148/363 (40%), Gaps = 61/363 (16%)
Query: 11 KGVLLILDTGSALIYA--------------IFDPRKSSSFQKINCDHPDCTYF------- 49
K + LI+DTGS L + ++DP SSS++ + C+ C
Sbjct: 144 KNMSLIVDTGSDLTWVQCQPCRSCYNQQGPLYDPSVSSSYKTVFCNSSTCQDLVAATSNS 203
Query: 50 -------KCVNEQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSNDNHG 102
V C Y + Y D S T+G A E+I + G +FGC +N G
Sbjct: 204 GPCGGNNGVVKTPCEYVVSYGDGSYTRGDLASESILL-----GDTKLENFVFGCGRNNKG 258
Query: 103 FDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLKFGTDMG 162
+ R ++S +SQ FSYCL L +G S L FG D
Sbjct: 259 LFGGSSGLMGL-----GRSSVSLVSQTLKTFNGVFSYCLP-SLEDG--ASGSLSFGNDSS 310
Query: 163 YRRPSTQA--TKFINHPN--NFYYLSLKDISIDNERMNFPPDTFDITVSGEGGCIIDSGS 218
ST T + +P +FY L+L SI + + S G +IDSG+
Sbjct: 311 VYTNSTSVSYTPLVQNPQLRSFYILNLTGASIGGVELK--------SSSFGRGILIDSGT 362
Query: 219 VLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCYFLPETFN-RFPSMAFYFE-D 276
V+T +Y + +F+ F F A + C+ L + P + F+ +
Sbjct: 363 VITRLPPSIYKAVKIEFLKQFSGFPTAPGYSI---LDTCFNLTSYEDISIPIIKMIFQGN 419
Query: 277 ANLRIDGENVF-IIDYENHFFLLAVAP--HDDLVALIGSQQQRDTRFVYDLNIDLLSFVK 333
A L +D VF + + LA+A +++ V +IG+ QQ++ R +YD + L V
Sbjct: 420 AELEVDVTGVFYFVKPDASLVCLALASLSYENEVGIIGNYQQKNQRVIYDTTQERLGIVG 479
Query: 334 ENC 336
ENC
Sbjct: 480 ENC 482
>gi|356565521|ref|XP_003550988.1| PREDICTED: aspartic proteinase-like protein 2-like [Glycine max]
Length = 634
Score = 89.0 bits (219), Expect = 3e-15, Method: Compositional matrix adjust.
Identities = 95/373 (25%), Positives = 150/373 (40%), Gaps = 66/373 (17%)
Query: 2 VRLFIGTPSKGVLLILDTGSALIYAI--------------FDPRKSSSFQKINCDHPDCT 47
RL+IGTP + LI+DTGS + Y F P SS++Q + C DC
Sbjct: 86 TRLWIGTPPQMFALIVDTGSTVTYVPCSTCEQCGRHQDPKFQPESSSTYQPVKCTI-DC- 143
Query: 48 YFKCVNE--QCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSNDNHGFDE 105
C ++ QCVY +YA+ S + G + IS + E A+FGC N G
Sbjct: 144 --NCDSDRMQCVYERQYAEMSTSSGVLGEDLISFGNQSELAP--QRAVFGCENVETG--- 196
Query: 106 DARDGALAGVLGLSRVTISFISQL--GSIIKKRFSYCLVIPLPNGEYTSSYLKFGTDM-- 161
D G++GL R +S + QL ++I FS C Y + G +
Sbjct: 197 DLYSQHADGIMGLGRGDLSIMDQLVDKNVISDSFSLC---------YGGMDVGGGAMVLG 247
Query: 162 GYRRPSTQATKFINHPNNFYY-LSLKDISIDNERMNFPPDTFDITVSGEGGCIIDSGSVL 220
G PS A + + + YY + LK+I + +R+ + FD G+ G ++DSG+
Sbjct: 248 GISPPSDMAFAYSDPVRSPYYNIDLKEIHVAGKRLPLNANVFD----GKHGTVLDSGTTY 303
Query: 221 TYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCYF-----LPETFNRFPSMAFYFE 275
Y + + V + + D P +C+ + + FP + FE
Sbjct: 304 AYLPEAAFLAFKDAIVKELQSLKKISGPD-PNYNDICFSGAGIDVSQLSKSFPVVDMVFE 362
Query: 276 DANLRIDGENVFIIDYENHFF----------LLAVAPHDDLVALIGSQQQRDTRFVYDLN 325
+ + + EN+ F L +D L+G R+T VYD
Sbjct: 363 NG-------QKYTLSPENYMFRHSKVRGAYCLGVFQNGNDQTTLLGGIIVRNTLVVYDRE 415
Query: 326 IDLLSFVKENCSD 338
+ F K NC++
Sbjct: 416 QTKIGFWKTNCAE 428
>gi|356539555|ref|XP_003538263.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
Length = 438
Score = 89.0 bits (219), Expect = 3e-15, Method: Compositional matrix adjust.
Identities = 83/358 (23%), Positives = 147/358 (41%), Gaps = 39/358 (10%)
Query: 1 MVRLFIGTPSKGVLLILDTGSALIY-----------AIFDPRKSSSFQKINCDHPDCTYF 49
+VR IGTP + +LL +DT + + +F P KS++F+ ++C P+C
Sbjct: 98 IVRAKIGTPPQTLLLAIDTSNDAAWIPCTACDGCTSTLFAPEKSTTFKNVSCGSPECNKV 157
Query: 50 ---KCVNEQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSNDNHGFDED 106
C C + + Y S+ +T+++ G FGC G
Sbjct: 158 PSPSCGTSACTFNLTYGSSSIAANVV-QDTVTL-----ATDPIPGYTFGCVAKTTGPSTP 211
Query: 107 ARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLKFGTDMGYRRP 166
+ R +S +SQ ++ + FSYCL P S L+ G R
Sbjct: 212 PQGLLGL-----GRGPLSLLSQTQNLYQSTFSYCL--PSFKSLNFSGSLRLGPVAQPIR- 263
Query: 167 STQATKFINHP--NNFYYLSLKDISIDNERMNFPPDTFDITVSGEGGCIIDSGSVLTYFH 224
+ T + +P ++ YY++L I + + ++ PP + G + DSG+V T
Sbjct: 264 -IKYTPLLKNPRRSSLYYVNLFAIRVGRKIVDIPPAALAFNAATGAGTVFDSGTVFTRLV 322
Query: 225 SDVYWKLHEKFVSYFERFQLAQLSDCP-EPIQLCYFLPETFNRFPSMAFYFEDANLRIDG 283
+ VY + ++F A L+ CY +P P++ F F N+ +
Sbjct: 323 APVYTAVRDEFRRRVAMAAKANLTVTSLGGFDTCYTVPIV---APTITFMFSGMNVTLPQ 379
Query: 284 ENVFIIDYENHFFLLAVAPHDD----LVALIGSQQQRDTRFVYDLNIDLLSFVKENCS 337
+N+ I LA+A D ++ +I + QQ++ R +YD+ L +E C+
Sbjct: 380 DNILIHSTAGSTSCLAMASAPDNVNSVLNVIANMQQQNHRVLYDVPNSRLGVARELCT 437
>gi|449511696|ref|XP_004164029.1| PREDICTED: aspartic proteinase-like protein 2-like [Cucumis
sativus]
Length = 639
Score = 89.0 bits (219), Expect = 3e-15, Method: Compositional matrix adjust.
Identities = 96/368 (26%), Positives = 150/368 (40%), Gaps = 56/368 (15%)
Query: 2 VRLFIGTPSKGVLLILDTGSALIYAI--------------FDPRKSSSFQKINCDHPDCT 47
RL+IG+P + LI+DTGS + Y F P SS++Q + C+ DC
Sbjct: 91 TRLWIGSPPQEFALIVDTGSTVTYVPCSNCVQCGNHQDPRFQPELSSTYQPVKCN-ADC- 148
Query: 48 YFKCVNE--QCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSNDNHGFDE 105
C QC Y +YA+ S + G A + +S GK E + + A+FGC G
Sbjct: 149 --NCDENGVQCTYERRYAEMSTSSGVLAEDVMS-FGK-ESELVPQRAVFGCETMESG--- 201
Query: 106 DARDGALAGVLGLSRVTISFISQL--GSIIKKRFSYCLVIPLPNGEYTSSYLKFGTDM-- 161
D G++GL R T+S + QL ++ FS C Y + G +
Sbjct: 202 DLYTQRADGIMGLGRGTLSVMDQLVGKGVVSNSFSLC---------YGGMDVGGGAMVLG 252
Query: 162 GYRRPSTQATKFINHPNNFYY-LSLKDISIDNERMNFPPDTFDITVSGEGGCIIDSGSVL 220
G P + + YY + LK+I + + + P TFD G+ G I+DSG+
Sbjct: 253 GISSPPGMVFSHSDPSRSPYYNIELKEIHVAGKPLKLNPRTFD----GKYGAILDSGTTY 308
Query: 221 TYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEP--IQLCYF-----LPETFNRFPSMAFY 273
YF Y+ + + L Q+S P+P +C+ + E FP +
Sbjct: 309 AYFPEKAYYAFKDAIMKKISF--LKQISG-PDPNFKDICFSGAGRDVTELPKVFPEVDMV 365
Query: 274 FEDAN-LRIDGENVFI--IDYENHFFLLAVAPHDDLVALIGSQQQRDTRFVYDLNIDLLS 330
F + + + EN + L +D L+G R+T Y+ +
Sbjct: 366 FANGQKISLSPENYLFRHTKVSGAYCLGIFKNGNDQTTLLGGIIVRNTLVTYNRENSTIG 425
Query: 331 FVKENCSD 338
F K NCS+
Sbjct: 426 FWKTNCSE 433
>gi|194698750|gb|ACF83459.1| unknown [Zea mays]
gi|194703964|gb|ACF86066.1| unknown [Zea mays]
gi|219886221|gb|ACL53485.1| unknown [Zea mays]
gi|219886359|gb|ACL53554.1| unknown [Zea mays]
gi|223950085|gb|ACN29126.1| unknown [Zea mays]
gi|414865218|tpg|DAA43775.1| TPA: aspartic proteinase nepenthesin-1 [Zea mays]
Length = 431
Score = 89.0 bits (219), Expect = 3e-15, Method: Compositional matrix adjust.
Identities = 97/373 (26%), Positives = 148/373 (39%), Gaps = 59/373 (15%)
Query: 1 MVRLFIGTPSKGVLLILDTGSALIYA------------IFDPRKSSSFQKINCDHPDCTY 48
+VR +GTP + +LL LDT + ++ F P SSS+ + C C
Sbjct: 80 VVRAGLGTPVQQLLLALDTSADATWSHCAPCDTCPAGSRFIPASSSSYASLPCASDWCPL 139
Query: 49 FK---CVNEQ--------CVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCS 97
F+ C Q C ++ +AD S + +T+ + GK G FGC
Sbjct: 140 FEGQPCPANQDASAPLPACAFSKPFADTSFQASLGS-DTLRL-----GKDAIAGYAFGCV 193
Query: 98 NDNHGFDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLKF 157
G + G+LGL R +S +SQ GS FSYCL P Y S L+
Sbjct: 194 GAVAGPTTNLPK---QGLLGLGRGPMSLLSQTGSRYNGVFSYCL--PSYRSYYFSGSLRL 248
Query: 158 GTDMGYRRPSTQATKFINHPN--NFYYLSLKDISIDNERMNFPPDTFDITVSGEGGCIID 215
G R + + T + +P+ + YY+++ +S+ + P +F + G +ID
Sbjct: 249 GAAGQPR--NVRYTPLLTNPHRPSLYYVNVTGLSVGRTWVKVPAGSFAFDPATGAGTVID 306
Query: 216 SGSVLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCYFLPETFNRFPSMAFYFE 275
SG+V+T + + VY L E+F R Q+A S L F FN A
Sbjct: 307 SGTVITRWTAPVYAALREEF-----RRQVAAPSGY---TSLGAFD-TCFNTDEVAAGGAP 357
Query: 276 DANLRIDGENVFIIDYENHFFLLAVAPHDDLVALIGSQ------------QQRDTRFVYD 323
L +DG + EN + P L Q QQ++ R V D
Sbjct: 358 PVTLHMDGGVDLTLPMENTLIHSSATPLACLAMAEAPQNVNAVVNVVANLQQQNVRVVVD 417
Query: 324 LNIDLLSFVKENC 336
+ + F +E C
Sbjct: 418 VAGSRVGFAREPC 430
>gi|212722554|ref|NP_001131154.1| uncharacterized protein LOC100192462 precursor [Zea mays]
gi|194690728|gb|ACF79448.1| unknown [Zea mays]
Length = 431
Score = 89.0 bits (219), Expect = 3e-15, Method: Compositional matrix adjust.
Identities = 97/373 (26%), Positives = 148/373 (39%), Gaps = 59/373 (15%)
Query: 1 MVRLFIGTPSKGVLLILDTGSALIYA------------IFDPRKSSSFQKINCDHPDCTY 48
+VR +GTP + +LL LDT + ++ F P SSS+ + C C
Sbjct: 80 VVRAGLGTPVQQLLLALDTSADATWSHCAPCDTCPAGSRFIPASSSSYASLPCASDWCPL 139
Query: 49 FK---CVNEQ--------CVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCS 97
F+ C Q C ++ +AD S + +T+ + GK G FGC
Sbjct: 140 FEGQPCPANQDASAPLPACAFSKPFADTSFQASLGS-DTLRL-----GKDAIAGYAFGCV 193
Query: 98 NDNHGFDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLKF 157
G + G+LGL R +S +SQ GS FSYCL P Y S L+
Sbjct: 194 GAVAGPTTNLPK---QGLLGLGRGPMSLLSQTGSRYNGVFSYCL--PSYRSYYFSGSLRL 248
Query: 158 GTDMGYRRPSTQATKFINHPN--NFYYLSLKDISIDNERMNFPPDTFDITVSGEGGCIID 215
G R + + T + +P+ + YY+++ +S+ + P +F + G +ID
Sbjct: 249 GAAGQPR--NVRYTPLLTNPHRPSLYYVNVTGLSVGRTWVKVPAGSFAFDPATGAGTVID 306
Query: 216 SGSVLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCYFLPETFNRFPSMAFYFE 275
SG+V+T + + VY L E+F R Q+A S L F FN A
Sbjct: 307 SGTVITRWTAPVYAALREEF-----RRQVAAPSGY---TSLGAFD-TCFNTDEVAAGGAP 357
Query: 276 DANLRIDGENVFIIDYENHFFLLAVAPHDDLVALIGSQ------------QQRDTRFVYD 323
L +DG + EN + P L Q QQ++ R V D
Sbjct: 358 PVTLHMDGGVDLTLPMENTLIHSSATPLACLAMAEAPQNVNAVVNVVANLQQQNVRVVVD 417
Query: 324 LNIDLLSFVKENC 336
+ + F +E C
Sbjct: 418 VAGSRVGFAREPC 430
>gi|45735840|dbj|BAD12875.1| putative chloroplast nucleoid DNA-binding protein cnd41 [Oryza
sativa Japonica Group]
gi|45735966|dbj|BAD12995.1| putative chloroplast nucleoid DNA-binding protein cnd41 [Oryza
sativa Japonica Group]
gi|125583491|gb|EAZ24422.1| hypothetical protein OsJ_08175 [Oryza sativa Japonica Group]
Length = 475
Score = 89.0 bits (219), Expect = 3e-15, Method: Compositional matrix adjust.
Identities = 88/368 (23%), Positives = 143/368 (38%), Gaps = 67/368 (18%)
Query: 1 MVRLFIGTPSKGVLLILDTGSALIYA----------------IFDPRKSSSFQKINCDHP 44
+V + +GTP+ L +DTGS + + +FDP +SSS+ + C
Sbjct: 143 VVTVSLGTPAVAQTLEVDTGSDVSWVQCKPCPSPPCYSQRDPLFDPTRSSSYSAVPCAAA 202
Query: 45 DCTYFK-----CVNEQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSND 99
C+ C QC Y + Y D S T G + +T+++ G G LFGC +
Sbjct: 203 SCSQLALYSNGCSGGQCGYVVSYGDGSTTTGVYSSDTLTLTGSNA----LKGFLFGCGHA 258
Query: 100 NHGFDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLKFGT 159
G + G+LGL R S +SQ S FSYC LP + + Y+ G
Sbjct: 259 QQGLFA-----GVDGLLGLGRQGQSLVSQASSTYGGVFSYC----LPPTQNSVGYISLGG 309
Query: 160 DMGYRRPSTQATKFINHPNNFYYLSLKDISIDNERMNFPPDTFDITVSGEGGCIIDSGSV 219
ST ++ +Y + L IS+ + ++ F G ++D+G+V
Sbjct: 310 PSSTAGFSTTPLLTASNDPTYYIVMLAGISVGGQPLSIDASVF------ASGAVVDTGTV 363
Query: 220 LTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEP----IQLCYFLPETFNRF-----PSM 270
+T Y L F R +A P + CY F R+ P++
Sbjct: 364 VTRLPPTAYSALRSAF-----RAAMAPYGYPSAPATGILDTCY----DFTRYGTVTLPTI 414
Query: 271 AFYFEDANLRIDGENVFIIDYENHFFLLAVAP--HDDLVALIGSQQQRDTRFVYDLNIDL 328
+ F G + + LA AP D +++G+ QQR +D +
Sbjct: 415 SIAFGGGAAMDLGTSGILTS-----GCLAFAPTGGDSQASILGNVQQRSFEVRFDGST-- 467
Query: 329 LSFVKENC 336
+ F+ +C
Sbjct: 468 VGFMPASC 475
>gi|45735845|dbj|BAD12880.1| putative chloroplast nucleoid DNA-binding protein cnd41 [Oryza
sativa Japonica Group]
gi|45735971|dbj|BAD13000.1| putative chloroplast nucleoid DNA-binding protein cnd41 [Oryza
sativa Japonica Group]
Length = 333
Score = 89.0 bits (219), Expect = 3e-15, Method: Compositional matrix adjust.
Identities = 80/359 (22%), Positives = 150/359 (41%), Gaps = 51/359 (14%)
Query: 4 LFIGTPSKGVLLILDTGSALIY---------------AIFDPRKSSSFQKINCDHPDCTY 48
+ +GTP+ ++++DTGS+L + +F+P+ SS++ + C C+
Sbjct: 1 MGLGTPATQYVMVVDTGSSLTWLQCSPCLVSCHRQSGPVFNPKSSSTYASVGCSAQQCSD 60
Query: 49 F--------KCVNEQ-CVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSND 99
C + C+Y Y D S + G+ + +T+S G +GC D
Sbjct: 61 LPSATLNPSACSSSNVCIYQASYGDSSFSVGYLSKDTVSF-----GSTSLPNFYYGCGQD 115
Query: 100 NHGFDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLKFGT 159
N G G AG++GL+R +S + QL + F+YC LP+ +
Sbjct: 116 NEGLF-----GRSAGLIGLARNKLSLLYQLAPSLGYSFTYC----LPSSSSSGYLSLGSY 166
Query: 160 DMGYRRPSTQATKFINHPNNFYYLSLKDISIDNERMNFPPDTFDITVSGEGGCIIDSGSV 219
+ G + + ++ ++ Y++ L +++ P + + IIDSG+V
Sbjct: 167 NPGQYSYTPMVSSSLD--DSLYFIKLSGMTVAGN-----PLSVSSSAYSSLPTIIDSGTV 219
Query: 220 LTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCYFLPETFNRFPSMAFYFE-DAN 278
+T + VY L + + + A + C+ + P++ F A
Sbjct: 220 ITRLPTSVYSALSKAVAAAMKGTSRASAYSI---LDTCFKGQASRVSAPAVTMSFAGGAA 276
Query: 279 LRIDGENVFIIDYENHFFLLAVAPHDDLVALIGSQQQRDTRFVYDLNIDLLSFVKENCS 337
L++ +N ++D ++ LA AP A+IG+ QQ+ VYD+ + F CS
Sbjct: 277 LKLSAQN-LLVDVDDSTTCLAFAPARS-AAIIGNTQQQTFSVVYDVKSSRIGFAAGGCS 333
>gi|302790323|ref|XP_002976929.1| hypothetical protein SELMODRAFT_105896 [Selaginella moellendorffii]
gi|300155407|gb|EFJ22039.1| hypothetical protein SELMODRAFT_105896 [Selaginella moellendorffii]
Length = 373
Score = 89.0 bits (219), Expect = 3e-15, Method: Compositional matrix adjust.
Identities = 101/380 (26%), Positives = 154/380 (40%), Gaps = 57/380 (15%)
Query: 6 IGTPSKGVLLILDTGSALIYA--------------IFDPRKSSSFQKINCDHPDCTYFKC 51
IGTP + VLL++DT S L + F+P SSSF C C
Sbjct: 5 IGTPPREVLLLVDTASELTWVQGTSCTNCSPTKVPPFNPGLSSSFISEPCTSSVCLGRSK 64
Query: 52 VNEQ---------CVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSNDNHG 102
+ Q C + + Y D S G A E S+ + +FGC++ +
Sbjct: 65 LGFQSACNRSTGSCSFQVAYLDGSEAYGVIAREIFSLQSWDGAASTLGDVIFGCASKDLQ 124
Query: 103 FDEDARDGALAGVLGLSRVTISFISQLGSIIKK----RFSYCLVIPLPN-GEY--TSSYL 155
D +G LGL+R + SF +Q+GS K RFSYC PN E+ +S +
Sbjct: 125 RPVDFS----SGTLGLNRGSFSFPAQIGSRSKSGLSDRFSYCF----PNRAEHLNSSGVI 176
Query: 156 KFGTDMGYRRPSTQATKFINHPN-----NFYYLSLKDISIDNERMNFPPDTFDITVSGEG 210
FG D G Q P +FYY+ L+ IS+ E ++ P F I G G
Sbjct: 177 IFG-DSGIPAHHFQYLSLEQEPPIASIVDFYYVGLQGISVGGELLHIPRSAFKIDRLGNG 235
Query: 211 GCIIDSGSVLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCYFLPETFNRFPS- 269
G DSG+ +++ + L E F SD + +LCY + R P+
Sbjct: 236 GTYFDSGTTVSFLVEPAHTALVEAFGRRVLHLNRTSGSDFTK--ELCYDVAAGDARLPTA 293
Query: 270 --MAFYFE-DANLRIDGENVFIIDYENH-------FFLLAVAPHDDLVALIGSQQQRDTR 319
+ +F+ + ++ + +V++ F+ A A V +IG+ QQ+D
Sbjct: 294 PLVTLHFKNNVDMELREASVWVPLARTPQVVTICLAFVNAGAVAQGGVNVIGNYQQQDYL 353
Query: 320 FVYDLNIDLLSFVKENCSDD 339
+DL + F NC D
Sbjct: 354 IEHDLERSRIGFAPANCVMD 373
>gi|356495496|ref|XP_003516613.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Glycine max]
Length = 645
Score = 89.0 bits (219), Expect = 3e-15, Method: Compositional matrix adjust.
Identities = 95/366 (25%), Positives = 151/366 (41%), Gaps = 52/366 (14%)
Query: 2 VRLFIGTPSKGVLLILDTGSALIYAI--------------FDPRKSSSFQKINCDHPDCT 47
RL+IGTP + LI+DTGS + Y F P S ++Q + C C
Sbjct: 95 ARLWIGTPPQRFALIVDTGSTVTYVPCSTCRHCGSHQDPKFRPEDSETYQPVKCTW-QC- 152
Query: 48 YFKCVNE--QCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSNDNHGFDE 105
C N+ QC Y +YA+ S + G + +S + E A+FGC ND G
Sbjct: 153 --NCDNDRKQCTYERRYAEMSTSSGALGEDVVSFGNQTELSP--QRAIFGCENDETG--- 205
Query: 106 DARDGALAGVLGLSRVTISFISQL--GSIIKKRFSYCLVIPLPNGEYTS-SYLKFGTDMG 162
D + G++GL R +S + QL +I FS C G + DM
Sbjct: 206 DIYNQRADGIMGLGRGDLSIMDQLVEKKVISDSFSLCYGGMGVGGGAMVLGGISPPADMV 265
Query: 163 YRRPSTQATKFINHPNNFYYLSLKDISIDNERMNFPPDTFDITVSGEGGCIIDSGSVLTY 222
+ R + + P +Y + LK+I + +R++ P FD G+ G ++DSG+ Y
Sbjct: 266 FTR-----SDPVRSP--YYNIDLKEIHVAGKRLHLNPKVFD----GKHGTVLDSGTTYAY 314
Query: 223 FHSDVYWKLHEKFVSYFERFQLAQLSDCPEPI--QLCYF-----LPETFNRFPSMAFYFE 275
+ L K E L ++S P+P +C+ + + FP + F
Sbjct: 315 LPESAF--LAFKHAIMKETHSLKRISG-PDPRYNDICFSGAEIDVSQISKSFPVVEMVFG 371
Query: 276 DAN-LRIDGENVFI--IDYENHFFLLAVAPHDDLVALIGSQQQRDTRFVYDLNIDLLSFV 332
+ + L + EN + L + +D L+G R+T +YD + F
Sbjct: 372 NGHKLSLSPENYLFRHSKVRGAYCLGVFSNGNDPTTLLGGIVVRNTLVMYDREHTKIGFW 431
Query: 333 KENCSD 338
K NCS+
Sbjct: 432 KTNCSE 437
>gi|356541713|ref|XP_003539318.1| PREDICTED: aspartic proteinase-like protein 2-like [Glycine max]
Length = 640
Score = 89.0 bits (219), Expect = 3e-15, Method: Compositional matrix adjust.
Identities = 91/369 (24%), Positives = 148/369 (40%), Gaps = 58/369 (15%)
Query: 2 VRLFIGTPSKGVLLILDTGSALIYAI--------------FDPRKSSSFQKINCDHPDCT 47
RL+IGTP + LI+DTGS + Y F P S ++Q + C
Sbjct: 95 TRLWIGTPPQRFALIVDTGSTVTYVPCSTCKHCGSHQDPKFRPEASETYQPVKCT----- 149
Query: 48 YFKCV----NEQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSNDNHGF 103
++C +QC Y +YA+ S + G + +S + E A+FGC ND G
Sbjct: 150 -WQCNCDDDRKQCTYERRYAEMSTSSGVLGEDVVSFGNQSELSP--QRAIFGCENDETG- 205
Query: 104 DEDARDGALAGVLGLSRVTISFISQL--GSIIKKRFSYCLVIPLPNGEYTSSYLKFGTDM 161
D + G++GL R +S + QL +I FS C +
Sbjct: 206 --DIYNQRADGIMGLGRGDLSIMDQLVEKKVISDAFSLCYG--------GMGVGGGAMVL 255
Query: 162 GYRRPSTQATKFINHP--NNFYYLSLKDISIDNERMNFPPDTFDITVSGEGGCIIDSGSV 219
G P + P + +Y + LK+I + +R++ P FD G+ G ++DSG+
Sbjct: 256 GGISPPADMVFTHSDPVRSPYYNIDLKEIHVAGKRLHLNPKVFD----GKHGTVLDSGTT 311
Query: 220 LTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPI--QLCYF-----LPETFNRFPSMAF 272
Y + L K E L ++S P+P +C+ + + FP +
Sbjct: 312 YAYLPESAF--LAFKHAIMKETHSLKRISG-PDPHYNDICFSGAEINVSQLSKSFPVVEM 368
Query: 273 YFEDAN-LRIDGENVFI--IDYENHFFLLAVAPHDDLVALIGSQQQRDTRFVYDLNIDLL 329
F + + L + EN + L + +D L+G R+T +YD +
Sbjct: 369 VFGNGHKLSLSPENYLFRHSKVRGAYCLGVFSNGNDPTTLLGGIVVRNTLVMYDREHSKI 428
Query: 330 SFVKENCSD 338
F K NCS+
Sbjct: 429 GFWKTNCSE 437
>gi|242086034|ref|XP_002443442.1| hypothetical protein SORBIDRAFT_08g019550 [Sorghum bicolor]
gi|241944135|gb|EES17280.1| hypothetical protein SORBIDRAFT_08g019550 [Sorghum bicolor]
Length = 443
Score = 89.0 bits (219), Expect = 3e-15, Method: Compositional matrix adjust.
Identities = 91/369 (24%), Positives = 149/369 (40%), Gaps = 46/369 (12%)
Query: 1 MVRLFIGTPSKGVLLILDTGSALIYA----------------IFDPRKSSSFQKINCDHP 44
+ +G P + ++DTGS+LI+ F+ S SF + C
Sbjct: 87 IAEYMVGDPPQRAEALIDTGSSLIWTQCTACLRKVCVRQDLPYFNASSSGSFAPVPCQDK 146
Query: 45 DCT----YFKCVNEQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSNDN 100
C +F ++ C + + Y + GF + + G A FGC +
Sbjct: 147 ACAGNYLHFCALDGTCTFRVTYGAGGII-GFLGTDAFTFQSGGATLA------FGCVSFT 199
Query: 101 HGFDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLKFGT- 159
D GA +G++GL R +S SQ G+ KRFSYCL P + SS+L G
Sbjct: 200 RFAAPDVLHGA-SGLIGLGRGRLSLASQTGA---KRFSYCLT-PYFHNNGASSHLFVGAA 254
Query: 160 -DMGYRRPSTQATKFINHPNN-----FYYLSLKDISIDNERMNFPPDTFDITVS----GE 209
+ + + F+ P + FYYL L I++ ++ P FD+ E
Sbjct: 255 ASLSGGGGAVMSMAFVESPKDYPYSTFYYLPLVGITVGETKLAIPSTAFDLQEVEEGFWE 314
Query: 210 GGCIIDSGSVLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCYFLPETFNRFPS 269
GG IIDSGS T D Y L + + + + LC + P+
Sbjct: 315 GGVIIDSGSPFTSLVEDAYEPLMGELARQLNGSLVPPPGEDDGGMALCVARGDLDRVVPT 374
Query: 270 MAFYFE-DANLRIDGENVFIIDYENHFFLLAVAPHDDLVALIGSQQQRDTRFVYDLNIDL 328
+ +F A++ + EN + E +A+ L ++IG+ QQ++ ++D+
Sbjct: 375 LVLHFSGGADMALPPEN-YWAPLEKSTACMAIV-RGYLQSIIGNFQQQNMHILFDVGGGR 432
Query: 329 LSFVKENCS 337
LSF +CS
Sbjct: 433 LSFQNADCS 441
>gi|238011160|gb|ACR36615.1| unknown [Zea mays]
Length = 461
Score = 89.0 bits (219), Expect = 3e-15, Method: Compositional matrix adjust.
Identities = 95/414 (22%), Positives = 149/414 (35%), Gaps = 90/414 (21%)
Query: 2 VRLFIGTPSKGVLLILDTGSALIYA----------------------------------- 26
VR +GTP++ LL+ DTGS L +
Sbjct: 57 VRFRVGTPARPFLLVADTGSDLTWVKCRRHAAPAPAPAPAPGYNYGYGAPASNDSSSVSA 116
Query: 27 -------IFDPRKSSSFQKINCDHPDCTYF--------KCVNEQCVYTMKYADQSVTKGF 71
+F P +S ++ I C CT C Y +Y D S +G
Sbjct: 117 AASSPARVFRPDRSRTWAPIPCSSDTCTASLPFSLAACPTPGSPCAYEYRYKDGSAARGT 176
Query: 72 AAHETISVI------GKGEGKAIFHGALFGCSNDNHGFDEDARDGALAGVLGLSRVTISF 125
++ ++ GK + +A G + GC+ G A DG VL L +SF
Sbjct: 177 VGTDSATIALSGRRAGKKQRRAKLRGVVLGCTTSYTGESFLASDG----VLSLGYSNVSF 232
Query: 126 ISQLGSIIKKRFSYCLVIPLPNGEYTSSYLKFGTDMGYRRPSTQATK------------- 172
S+ + RFSYCLV L T SYL FG + S T
Sbjct: 233 ASRAAARFGGRFSYCLVDHLAPRNAT-SYLTFGPNPAVSSASASRTACAGSAAAPGARQT 291
Query: 173 --FINHP-NNFYYLSLKDISIDNERMNFPPDTFDITVSGEGGCIIDSGSVLTYFHSDVYW 229
++H FY +++ +S+D E + P +D V GG I+DSG+ LT S Y
Sbjct: 292 PLLLDHRMRPFYAVAVNGVSVDGELLRIPRLVWD--VQKGGGAILDSGTSLTVLVSPAY- 348
Query: 230 KLHEKFVSYFERFQLAQLSDCPEPIQLCYFLPETFN------RFPSMAFYFEDANLRIDG 283
V+ + + +P CY P++A +F +
Sbjct: 349 ---RAVVAALGKKLVGLPRVAMDPFDYCYNWTSPLTGEDLAVAVPALAVHFAGSARLQPP 405
Query: 284 ENVFIIDYENHFFLLAVAPHD-DLVALIGSQQQRDTRFVYDLNIDLLSFVKENC 336
++ID + + D V++IG+ Q++ + +DL L F + C
Sbjct: 406 PKSYVIDAAPGVKCIGLQEGDWPGVSVIGNILQQEHLWEFDLKNRRLRFKRSRC 459
>gi|115473845|ref|NP_001060521.1| Os07g0658600 [Oryza sativa Japonica Group]
gi|22775625|dbj|BAC15479.1| nucleoid DNA-binding-like protein [Oryza sativa Japonica Group]
gi|50510141|dbj|BAD31106.1| nucleoid DNA-binding-like protein [Oryza sativa Japonica Group]
gi|113612057|dbj|BAF22435.1| Os07g0658600 [Oryza sativa Japonica Group]
Length = 449
Score = 89.0 bits (219), Expect = 3e-15, Method: Compositional matrix adjust.
Identities = 86/358 (24%), Positives = 152/358 (42%), Gaps = 39/358 (10%)
Query: 1 MVRLFIGTPSKGVLLILDTGSALIY------------AIFDPRKSSSFQKINCDHPDCTY 48
+VR +GTP++ +LL +DT + + + F+P S+S++ + C P C
Sbjct: 108 VVRARLGTPAQQLLLAVDTSNDAAWIPCSGCAGCPTSSPFNPAASASYRPVPCGSPQCVL 167
Query: 49 F---KCV--NEQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSNDNHGF 103
C + C +++ YAD S+ + +T++V G + FGC G
Sbjct: 168 APNPSCSPNAKSCGFSLSYADSSLQAALS-QDTLAVAGD-----VVKAYTFGCLQRATGT 221
Query: 104 DEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLKFGTDMGY 163
+ +LGL R +SF+SQ + FSYCL P S L+ G +
Sbjct: 222 AAPPQG-----LLGLGRGPLSFLSQTKDMYGATFSYCL--PSFKSLNFSGTLRLGRNGQP 274
Query: 164 RRPSTQATKFINHPNNFYYLSLKDISIDNERMNFPPDTFDITVSGEGGCIIDSGSVLTYF 223
RR T H ++ YY+++ I + + ++ P + G ++DSG++ T
Sbjct: 275 RRIKTTPLLANPHRSSLYYVNMTGIRVGKKVVSIPASALAFDPATGAGTVLDSGTMFTRL 334
Query: 224 HSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCYFLPETFNRFPSMAFYFEDANLRIDG 283
+ VY L ++ A S CY T +P + F+ + +
Sbjct: 335 VAPVYLALRDEVRRRVGAGAAAVSSL--GGFDTCY---NTTVAWPPVTLLFDGMQVTLPE 389
Query: 284 ENVFIIDYENHFFLLAVAPHDD----LVALIGSQQQRDTRFVYDLNIDLLSFVKENCS 337
ENV I LA+A D ++ +I S QQ++ R ++D+ + F +E+C+
Sbjct: 390 ENVVIHTTYGTTSCLAMAAAPDGVNTVLNVIASMQQQNHRVLFDVPNGRVGFARESCT 447
>gi|356532386|ref|XP_003534754.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Glycine max]
Length = 463
Score = 88.6 bits (218), Expect = 3e-15, Method: Compositional matrix adjust.
Identities = 88/372 (23%), Positives = 154/372 (41%), Gaps = 61/372 (16%)
Query: 2 VRLFIGTPSKGVLLILDTGSALIY---------------AIFDPRKSSSFQKINC----- 41
V++ +GTP+K +I+DTGS+L + IF P S +++ + C
Sbjct: 115 VKIGLGTPAKYFSMIVDTGSSLSWLQCQPCVIYCHVQVDPIFTPSTSKTYKALPCSSSQC 174
Query: 42 --------DHPDCTYFKCVNEQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGAL 93
+ P C+ CVY Y D S + G+ + + +++ +A G +
Sbjct: 175 SSLKSSTLNAPGCSN---ATGACVYKASYGDTSFSIGYLSQDVLTLT---PSEAPSSGFV 228
Query: 94 FGCSNDNHGFDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSS 153
+GC DN G G +G++GL+ IS + QL FSYCL P SS
Sbjct: 229 YGCGQDNQGLF-----GRSSGIIGLANDKISMLGQLSKKYGNAFSYCL--PSSFSAPNSS 281
Query: 154 YLKFGTDMGYRRPSTQATKFINHPNN-----FYYLSLKDISIDNERMNFPPDTFDITVSG 208
L +G ++ KF N Y+L L I++ + + ++++
Sbjct: 282 SLSGFLSIGASSLTSSPYKFTPLVKNQKIPSLYFLDLTTITVAGKPLGVSASSYNVPT-- 339
Query: 209 EGGCIIDSGSVLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCP--EPIQLCYFLP-ETFN 265
IIDSG+V+T VY L + FV + + + P + C+ + +
Sbjct: 340 ----IIDSGTVITRLPVAVYNALKKSFVLIMSK----KYAQAPGFSILDTCFKGSVKEMS 391
Query: 266 RFPSMAFYFE-DANLRIDGENVFIIDYENHFFLLAVAPHDDLVALIGSQQQRDTRFVYDL 324
P + F A L + N +++ E LA+A + +++IG+ QQ+ + YD+
Sbjct: 392 TVPEIQIIFRGGAGLELKAHNS-LVEIEKGTTCLAIAASSNPISIIGNYQQQTFKVAYDV 450
Query: 325 NIDLLSFVKENC 336
+ F C
Sbjct: 451 ANFKIGFAPGGC 462
>gi|302802500|ref|XP_002983004.1| hypothetical protein SELMODRAFT_13348 [Selaginella moellendorffii]
gi|300149157|gb|EFJ15813.1| hypothetical protein SELMODRAFT_13348 [Selaginella moellendorffii]
Length = 332
Score = 88.6 bits (218), Expect = 3e-15, Method: Compositional matrix adjust.
Identities = 85/345 (24%), Positives = 159/345 (46%), Gaps = 35/345 (10%)
Query: 6 IGTPSKGVLLILDTGSALIYAIFDPRK---SSSFQKINCDHPDCTY--FKCVNEQCVYTM 60
+G+P K L++DTGS L + DP SS+F ++ + TY C ++ Y+
Sbjct: 9 LGSPPKDFSLVMDTGSDLTWVRCDPCSPDCSSTFDRLASN----TYKALTCADD---YSY 61
Query: 61 KYADQSVTKGFAAHETISVIGKGEGK-AIFHGALFGCSNDNHGFDEDARDGALAGVLGLS 119
Y D S T+G + +T+ + G + F G +FGC + G G+L LS
Sbjct: 62 GYGDGSFTQGDLSVDTLKMAGAASDELEEFPGFVFGCGSLLKGLISGE-----VGILALS 116
Query: 120 RVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLKFG-TDMGYRRPST---QATKF-- 173
++SF SQ+G +FSYCL+ S + FG + + P + Q ++
Sbjct: 117 PGSLSFPSQIGEKYGNKFSYCLLRQTAQNSLKKSPMVFGEAAVELKEPGSGKLQELQYTP 176
Query: 174 INHPNNFYYLSLKDISIDNERMNFPPDTFDITVSGEGG-CIIDSGSVLTYFHSDVYWKLH 232
I + +Y + L IS+ N+R++ P F ++G+ I DSG+ LT V +
Sbjct: 177 IGESSIYYTVRLDGISVGNQRLDLSPSAF---LNGQDKPTIFDSGTTLTMLPPGVCDSIK 233
Query: 233 EKFVSYFERFQLAQLSDCPEPIQLCYFLPETFNR-FPSMAFYFEDANLRIDGENVFIIDY 291
+ S + + + + C+ +P + + P + F+F + + ++ID
Sbjct: 234 QSLASMVSGAEFVAI----KGLDACFRVPPSSGQGLPDITFHFNGGADFVTRPSNYVIDL 289
Query: 292 ENHFFLLAVAPHDDLVALIGSQQQRDTRFVYDLNIDLLSFVKENC 336
+ L+ V ++ V++ G+ QQ+D ++D++ + F + +C
Sbjct: 290 GSLQCLIFVPTNE--VSIFGNLQQQDFFVLHDMDNRRIGFKETDC 332
>gi|449447285|ref|XP_004141399.1| PREDICTED: aspartic proteinase-like protein 2-like [Cucumis
sativus]
Length = 609
Score = 88.6 bits (218), Expect = 4e-15, Method: Compositional matrix adjust.
Identities = 96/368 (26%), Positives = 150/368 (40%), Gaps = 56/368 (15%)
Query: 2 VRLFIGTPSKGVLLILDTGSALIYAI--------------FDPRKSSSFQKINCDHPDCT 47
RL+IG+P + LI+DTGS + Y F P SS++Q + C+ DC
Sbjct: 91 TRLWIGSPPQEFALIVDTGSTVTYVPCSNCVQCGNHQDPRFQPELSSTYQPVKCN-ADC- 148
Query: 48 YFKCVNE--QCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSNDNHGFDE 105
C QC Y +YA+ S + G A + +S GK E + + A+FGC G
Sbjct: 149 --NCDENGVQCTYERRYAEMSTSSGVLAEDVMS-FGK-ESELVPQRAVFGCETMESG--- 201
Query: 106 DARDGALAGVLGLSRVTISFISQL--GSIIKKRFSYCLVIPLPNGEYTSSYLKFGTDM-- 161
D G++GL R T+S + QL ++ FS C Y + G +
Sbjct: 202 DLYTQRADGIMGLGRGTLSVMDQLVGKGVVSNSFSLC---------YGGMDVGGGAMVLG 252
Query: 162 GYRRPSTQATKFINHPNNFYY-LSLKDISIDNERMNFPPDTFDITVSGEGGCIIDSGSVL 220
G P + + YY + LK+I + + + P TFD G+ G I+DSG+
Sbjct: 253 GISSPPGMVFSHSDPSRSPYYNIELKEIHVAGKPLKLNPRTFD----GKYGAILDSGTTY 308
Query: 221 TYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEP--IQLCYF-----LPETFNRFPSMAFY 273
YF Y+ + + L Q+S P+P +C+ + E FP +
Sbjct: 309 AYFPEKAYYAFKDAIMKKISF--LKQISG-PDPNFKDICFSGAGRDVTELPKVFPEVDMV 365
Query: 274 FEDAN-LRIDGENVFI--IDYENHFFLLAVAPHDDLVALIGSQQQRDTRFVYDLNIDLLS 330
F + + + EN + L +D L+G R+T Y+ +
Sbjct: 366 FANGQKISLSPENYLFRHTKVSGAYCLGIFKNGNDQTTLLGGIIVRNTLVTYNRENSTIG 425
Query: 331 FVKENCSD 338
F K NCS+
Sbjct: 426 FWKTNCSE 433
>gi|125553822|gb|EAY99427.1| hypothetical protein OsI_21398 [Oryza sativa Indica Group]
Length = 469
Score = 88.6 bits (218), Expect = 4e-15, Method: Compositional matrix adjust.
Identities = 77/321 (23%), Positives = 129/321 (40%), Gaps = 36/321 (11%)
Query: 27 IFDPRKSSSFQKINCDHPDCTYF-----KCV-NEQCVYTMKYADQSVTKGFAAHETISVI 80
++DP KSSS +C+ P CT C N QC Y ++Y D + T G + +++
Sbjct: 174 LYDPTKSSSSGVFSCNSPTCTQLGPYANGCTNNNQCQYRVRYPDGTSTAGTYISDLLTIT 233
Query: 81 GKGEGKAIFHGALFGCSNDNHG---FDEDARDGALAGVLGLSRVTISFISQLGSIIKKRF 137
++ FGCS+ G F A AG++ L S +SQ + + F
Sbjct: 234 PATAVRSF----QFGCSHGVQGSFSFGSSA-----AGIMALGGGPESLVSQTAATYGRVF 284
Query: 138 SYCLVIPLPNGEYTSSYLKFGTDMGYRRPSTQATKFINHPNNFYYLSLKDISIDNERMNF 197
S+C P G +T L +R T K P FY + L+ I++ +R+
Sbjct: 285 SHCFPPPTRRGFFT---LGVPRVAAWRYVLTPMLKNPAIPPTFYMVRLEAIAVAGQRIAV 341
Query: 198 PPDTFDITVSGEGGCIIDSGSVLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLC 257
PP F G +DS + +T Y L + F +R + Q + P+ C
Sbjct: 342 PPTVF------AAGAALDSRTAITRLPPTAYQALRQAF---RDRMAMYQPAPPKGPLDTC 392
Query: 258 YFLPETFN-RFPSMAFYF-EDANLRIDGENVFIIDYENHFFLLAVAPHDDLVALIGSQQQ 315
Y + + P + F ++A + +D V P+D + +IG+ Q
Sbjct: 393 YDMAGVRSFALPRITLVFDKNAAVELDPSGVLF----QGCLAFTAGPNDQVPGIIGNIQL 448
Query: 316 RDTRFVYDLNIDLLSFVKENC 336
+ +Y++ L+ F C
Sbjct: 449 QTLEVLYNIPAALVGFRHAAC 469
>gi|115466060|ref|NP_001056629.1| Os06g0118700 [Oryza sativa Japonica Group]
gi|55296436|dbj|BAD68559.1| putative nucleoid DNA-binding protein cnd41 [Oryza sativa Japonica
Group]
gi|113594669|dbj|BAF18543.1| Os06g0118700 [Oryza sativa Japonica Group]
gi|215767921|dbj|BAH00150.1| unnamed protein product [Oryza sativa Japonica Group]
Length = 494
Score = 88.6 bits (218), Expect = 4e-15, Method: Compositional matrix adjust.
Identities = 77/321 (23%), Positives = 129/321 (40%), Gaps = 36/321 (11%)
Query: 27 IFDPRKSSSFQKINCDHPDCTYF-----KCV-NEQCVYTMKYADQSVTKGFAAHETISVI 80
++DP KSSS +C+ P CT C N QC Y ++Y D + T G + +++
Sbjct: 199 LYDPTKSSSSGVFSCNSPTCTQLGPYANGCTNNNQCQYRVRYPDGTSTAGTYISDLLTIT 258
Query: 81 GKGEGKAIFHGALFGCSNDNHG---FDEDARDGALAGVLGLSRVTISFISQLGSIIKKRF 137
++ FGCS+ G F A AG++ L S +SQ + + F
Sbjct: 259 PATAVRSF----QFGCSHGVQGSFSFGSSA-----AGIMALGGGPESLVSQTAATYGRVF 309
Query: 138 SYCLVIPLPNGEYTSSYLKFGTDMGYRRPSTQATKFINHPNNFYYLSLKDISIDNERMNF 197
S+C P G +T L +R T K P FY + L+ I++ +R+
Sbjct: 310 SHCFPPPTRRGFFT---LGVPRVAAWRYVLTPMLKNPAIPPTFYMVRLEAIAVAGQRIAV 366
Query: 198 PPDTFDITVSGEGGCIIDSGSVLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLC 257
PP F G +DS + +T Y L + F +R + Q + P+ C
Sbjct: 367 PPTVF------AAGAALDSRTAITRLPPTAYQALRQAF---RDRMAMYQPAPPKGPLDTC 417
Query: 258 YFLPETFN-RFPSMAFYF-EDANLRIDGENVFIIDYENHFFLLAVAPHDDLVALIGSQQQ 315
Y + + P + F ++A + +D V P+D + +IG+ Q
Sbjct: 418 YDMAGVRSFALPRITLVFDKNAAVELDPSGVLF----QGCLAFTAGPNDQVPGIIGNIQL 473
Query: 316 RDTRFVYDLNIDLLSFVKENC 336
+ +Y++ L+ F C
Sbjct: 474 QTLEVLYNIPAALVGFRHAAC 494
>gi|356555807|ref|XP_003546221.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Glycine max]
Length = 457
Score = 88.6 bits (218), Expect = 4e-15, Method: Compositional matrix adjust.
Identities = 86/372 (23%), Positives = 154/372 (41%), Gaps = 61/372 (16%)
Query: 2 VRLFIGTPSKGVLLILDTGSALIY---------------AIFDPRKSSSFQKI------- 39
V++ +GTP+K +I+DTGS+L + IF P S +++ +
Sbjct: 109 VKIGVGTPAKYFSMIVDTGSSLSWLQCQPCVIYCHVQVDPIFTPSVSKTYKALSCSSSQC 168
Query: 40 ------NCDHPDCTYFKCVNEQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGAL 93
+ P C+ CVY Y D S + G+ + + +++ A G +
Sbjct: 169 SSLKSSTLNAPGCSN---ATGACVYKASYGDTSFSIGYLSQDVLTLT---PSAAPSSGFV 222
Query: 94 FGCSNDNHGFDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPL---PNGEY 150
+GC DN G G AG++GL+ +S + QL + FSYCL PN
Sbjct: 223 YGCGQDNQGLF-----GRSAGIIGLANDKLSMLGQLSNKYGNAFSYCLPSSFSAQPNSS- 276
Query: 151 TSSYLKFGTDMGYRRPSTQATKFINHPN--NFYYLSLKDISIDNERMNFPPDTFDITVSG 208
S +L G P + T + +P + Y+L L I++ + + ++++
Sbjct: 277 VSGFLSIGASSLSSSP-YKFTPLVKNPKIPSLYFLGLTTITVAGKPLGVSASSYNVPT-- 333
Query: 209 EGGCIIDSGSVLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCP--EPIQLCYFLP-ETFN 265
IIDSG+V+T +Y L + FV + + + P + C+ + +
Sbjct: 334 ----IIDSGTVITRLPVAIYNALKKSFVMIMSK----KYAQAPGFSILDTCFKGSVKEMS 385
Query: 266 RFPSMAFYFE-DANLRIDGENVFIIDYENHFFLLAVAPHDDLVALIGSQQQRDTRFVYDL 324
P + F A L + N +++ E LA+A + +++IG+ QQ+ YD+
Sbjct: 386 TVPEIRIIFRGGAGLELKVHNS-LVEIEKGTTCLAIAASSNPISIIGNYQQQTFTVAYDV 444
Query: 325 NIDLLSFVKENC 336
+ F C
Sbjct: 445 ANSKIGFAPGGC 456
>gi|357142295|ref|XP_003572524.1| PREDICTED: aspartic proteinase-like protein 2-like [Brachypodium
distachyon]
Length = 494
Score = 88.6 bits (218), Expect = 4e-15, Method: Compositional matrix adjust.
Identities = 98/382 (25%), Positives = 154/382 (40%), Gaps = 69/382 (18%)
Query: 2 VRLFIGTPSKGVLLILDTGSALIY-------------------AIFDPRKSSSFQKINCD 42
++ IGTPSKG + +DTGS +++ ++DP S+S + + C
Sbjct: 91 TQIGIGTPSKGYYVQVDTGSDILWVNCISCDSCPRKSGLGIDLTLYDPTASASSKTVTCG 150
Query: 43 HPDCTYFK--------CVNEQCVYTMKYADQSVTKGFAAHETI---SVIGKGEGKAIFHG 91
C N C Y++ Y D S T GF + + V G G+
Sbjct: 151 QEFCATATNGGVPPSCAANSPCQYSITYGDGSSTTGFFVADFLQYDQVSGDGQTNLANAS 210
Query: 92 ALFGCSNDNHGFDEDARDGALAGVLGLSRVTISFISQLGSI--IKKRFSYCLVIPLPNGE 149
FGC G + + AL G+LG + S +SQL S + K FS+CL G
Sbjct: 211 VTFGCGAKIGGA-LGSSNVALDGILGFGQANSSMLSQLTSAGKVTKIFSHCLDTVNGGGI 269
Query: 150 YTSSYLKFGTDMGYRRPSTQATKFI-NHPNNFYYLSLKDISIDNERMNFPPDTFDITVSG 208
+ + +P + T + P+ Y + LK I + + P + FDI G
Sbjct: 270 FAIGNV--------VQPKVKTTPLVPGMPH--YNVVLKTIDVGGSTLQLPTNIFDIG-GG 318
Query: 209 EGGCIIDSGSVLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCY-FLPETFNRF 267
G IIDSG+ L Y VY + S L + D LC+ + N F
Sbjct: 319 SRGTIIDSGTTLAYLPEVVYKAVLSAVFSNHPDVTLKNVQDF-----LCFQYSGSVDNGF 373
Query: 268 PSMAFYFEDANLRI----------DGENVFIIDYENHFFLLAVAPHD--DLVALIGSQQQ 315
P + F+F D +L + + E+V+ + +++ V D D+V L+G
Sbjct: 374 PEVTFHF-DGDLPLVVYPHDYLFQNTEDVYCVGFQSG----GVQSKDGKDMV-LLGDLAL 427
Query: 316 RDTRFVYDLNIDLLSFVKENCS 337
+ VYDL ++ + NCS
Sbjct: 428 SNKLVVYDLENQVIGWTNYNCS 449
>gi|21595063|gb|AAM66069.1| putative aspartyl protease [Arabidopsis thaliana]
Length = 484
Score = 88.6 bits (218), Expect = 4e-15, Method: Compositional matrix adjust.
Identities = 96/363 (26%), Positives = 148/363 (40%), Gaps = 61/363 (16%)
Query: 11 KGVLLILDTGSALIYA--------------IFDPRKSSSFQKINCDHPDCTYF------- 49
K + LI+DTGS L + ++DP SSS++ + C+ C
Sbjct: 144 KNMSLIVDTGSDLTWVQCQPCRSCYNQQGPLYDPSVSSSYKTVFCNSSTCQDLVAATSNS 203
Query: 50 -------KCVNEQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSNDNHG 102
V C Y + Y D S T+G A E+I + G +FGC +N G
Sbjct: 204 GPCGGNNGVVKTPCEYVVSYGDGSYTRGDLASESILL-----GDTKLENFVFGCGRNNKG 258
Query: 103 FDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLKFGTDMG 162
+ R ++S +SQ FSYCL L +G S L FG D
Sbjct: 259 LFGGSSGLMGL-----GRSSVSLVSQTLKTFNGVFSYCLP-SLEDG--ASGSLSFGNDSS 310
Query: 163 YRRPSTQA--TKFINHPN--NFYYLSLKDISIDNERMNFPPDTFDITVSGEGGCIIDSGS 218
ST T + +P +FY L+L SI + + S G +IDSG+
Sbjct: 311 VYTNSTSVSYTPLVQNPQLRSFYILNLTGASIGGVELK--------SSSFGRGILIDSGT 362
Query: 219 VLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCYFLPETFN-RFPSMAFYFE-D 276
V+T +Y + +F+ F F A + C+ L + P + F+ +
Sbjct: 363 VITRLPPSIYKAVKIEFLKQFSGFPTAPGYSI---LDTCFNLTSYEDISIPIIKMIFQGN 419
Query: 277 ANLRIDGENVF-IIDYENHFFLLAVAP--HDDLVALIGSQQQRDTRFVYDLNIDLLSFVK 333
A L +D VF + + LA+A +++ V +IG+ QQ++ R +YD + L V
Sbjct: 420 AELEVDVTGVFYFVKPDASLVCLALASLSYENEVGIIGNYQQKNQRVIYDSTQERLGIVG 479
Query: 334 ENC 336
ENC
Sbjct: 480 ENC 482
>gi|297720195|ref|NP_001172459.1| Os01g0608366 [Oryza sativa Japonica Group]
gi|53792202|dbj|BAD52835.1| nucleoid DNA-binding protein cnd41-like [Oryza sativa Japonica
Group]
gi|255673454|dbj|BAH91189.1| Os01g0608366 [Oryza sativa Japonica Group]
Length = 452
Score = 88.6 bits (218), Expect = 4e-15, Method: Compositional matrix adjust.
Identities = 89/362 (24%), Positives = 144/362 (39%), Gaps = 70/362 (19%)
Query: 1 MVRLFIGTPSKGVLLILDTGSALIY-----------------AIFDPRKSSSFQKINCDH 43
++ + +G+P+ +++DTGS + + A+FDP SS++ NC
Sbjct: 109 VISVGLGSPAVTQRVVIDTGSDVSWVQCEPCPAPSPCHAHAGALFDPAASSTYAAFNCSA 168
Query: 44 PDCTYFKCVNE--------QCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFG 95
C E +C Y +KY D S T G + + +++ G + G FG
Sbjct: 169 AACAQLGDSGEANGCDAKSRCQYIVKYGDGSNTTGTYSSDVLTL----SGSDVVRGFQFG 224
Query: 96 CSNDNHGFDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCL-VIPLPNGEYTSSY 154
CS+ G D + G++GL S +SQ + K F YCL P +G T
Sbjct: 225 CSHAELGAGMDDK---TDGLIGLGGDAQSPVSQTAARYGKSFFYCLPATPASSGFLTLGA 281
Query: 155 LKFGTDMGYRRPSTQATKFINHPNNFYYLSLKDISIDNERMNFPPDTFDITVSGEGGCII 214
G G R +T +Y+ +L+DI++ +++ P F G ++
Sbjct: 282 PASGGGGGASRFATTPMLRSKKVPTYYFAALEDIAVGGKKLGLSPSVF------AAGSLV 335
Query: 215 DSGSVLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCYFLPETFN-------RF 267
DSG+V+T Y L F + R+ A EP+ + L FN
Sbjct: 336 DSGTVITRLPPAAYAALSSAFRAGMTRYARA------EPLGI---LDTCFNFTGLDKVSI 386
Query: 268 PSMAFYFEDANLRIDGENVFIIDYENHFFL----LAVAP--HDDLVALIGSQQQRDTRFV 321
P++A F ++D + H + LA AP D IG+ QQR +
Sbjct: 387 PTVALVFAGGA---------VVDLDAHGIVSGGCLAFAPTRDDKAFGTIGNVQQRTFEVL 437
Query: 322 YD 323
YD
Sbjct: 438 YD 439
>gi|293333354|ref|NP_001169607.1| uncharacterized protein LOC100383488 [Zea mays]
gi|224030351|gb|ACN34251.1| unknown [Zea mays]
Length = 342
Score = 88.6 bits (218), Expect = 4e-15, Method: Compositional matrix adjust.
Identities = 82/344 (23%), Positives = 150/344 (43%), Gaps = 55/344 (15%)
Query: 27 IFDPRKSSSFQKINCDHPDCTYF---KCVNE---QCVYTMKYADQSVTKGFAAHETISVI 80
+F+P+ SSS+ + C C +C + C YT KY+ VTKG A + +++
Sbjct: 16 VFNPKLSSSYAVVPCTSDTCAQLDGHRCHEDDDGACQYTYKYSGHGVTKGTLAIDKLAI- 74
Query: 81 GKGEGKAIFHGALFGCSNDNHGFDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYC 140
G +FH +FGCS+ + G +G++GL R +S +SQL RF YC
Sbjct: 75 ----GGDVFHAVVFGCSDSSVG----GPAAQASGLVGLGRGPLSLVSQLS---VHRFMYC 123
Query: 141 LVIPLPNGEYTSSYLKFGTDM-GYRRPSTQATKFINHPN---NFYYLSLKDISIDNE--- 193
L P+ TS L G R S + T ++ ++YYL+L +++ ++
Sbjct: 124 LPPPM---SRTSGKLVLGAGADAVRNMSDRVTVTMSSSTRYPSYYYLNLDGLAVGDQTPG 180
Query: 194 ---RMNFPPDTFDITVSGEG-------------GCIIDSGSVLTYFHSDVYWKLHEKFVS 237
PP G G G I+D S +++ + +Y +L +
Sbjct: 181 TTRNATSPPSGGAGGGGGGGGGGIVGAGGANAYGMIVDVASTISFLETSLYDELADDLE- 239
Query: 238 YFERFQLAQLSDCPE-PIQLCYFLPETFNR----FPSMAFYFEDANLRIDGENVFIIDYE 292
E +L + + + LC+ LPE P+++ F+ L +D + +F+ D
Sbjct: 240 --EEIRLPRATPSLRLGLDLCFILPEGVGMDRVYVPTVSLSFDGRWLELDRDRLFVTDGR 297
Query: 293 NHFFLLAVAPHDDLVALIGSQQQRDTRFVYDLNIDLLSFVKENC 336
++ V+++G+ Q ++ R +++L ++F K +C
Sbjct: 298 MMCLMIG---RTSGVSILGNFQLQNMRVLFNLRRGKITFAKASC 338
>gi|302781476|ref|XP_002972512.1| hypothetical protein SELMODRAFT_441822 [Selaginella moellendorffii]
gi|300159979|gb|EFJ26598.1| hypothetical protein SELMODRAFT_441822 [Selaginella moellendorffii]
Length = 496
Score = 88.6 bits (218), Expect = 4e-15, Method: Compositional matrix adjust.
Identities = 92/382 (24%), Positives = 162/382 (42%), Gaps = 59/382 (15%)
Query: 2 VRLFIGTPSKGVLLILDTGSALIYA--------IFDPRKSSSFQKINCDHPDCTYFK--- 50
++L IG+ K + I+DTGS + +FDP S S++++ C C +
Sbjct: 102 MQLGIGSLQKNLSAIIDTGSEAVLVQCGSRSRPVFDPAASQSYRQVPCISQLCLAVQQQT 161
Query: 51 -------CVNEQ--CVYTMKYADQSVTKGFAAHETISVIG-KGEGKAI-FHGALFGCSND 99
CVN C Y++ Y D + G + + I + G+A+ F FGC++
Sbjct: 162 SNGSSQPCVNSSATCTYSLSYGDSRNSTGDFSQDVIFLNSTNSSGQAVQFRDVAFGCAHS 221
Query: 100 NHGFDEDARDGALAGVLGLSRVTISFISQLGSIIK-KRFSYCLVIPLPNGEYTSSYLKFG 158
GF D G+L G++G +R +S SQL + +FSYC P + ++ + F
Sbjct: 222 PQGFLVDL--GSL-GIVGFNRGNLSLPSQLKDRLGGSKFSYCF--PSQPWQPRATGVIFL 276
Query: 159 TDMGYRRPSTQATKFINHP-----NNFYYLSLKDISIDNERMNFPPDTFDITVS-GEGGC 212
D G + T +++P + YY+ L IS+D + + P F + S G+GG
Sbjct: 277 GDSGLSKSKVGYTPLLDNPVTPARSQLYYVGLTSISVDGKTLAIPESAFKLDPSTGDGGT 336
Query: 213 IIDSGSVLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCYFLP--ETFNRFPSM 270
++DSG+ T D Y F + R L + CY + + P +
Sbjct: 337 VLDSGTTFTRVVDDAYTAFRNAFAAS-NRSGLRKKVGAAAGFDDCYNISAGSSLPGVPEV 395
Query: 271 AFYFEDANLRIDGENVFIIDYENHFFLLAVAPHDDLVAL---------------IGSQQQ 315
++ N+R++ + +E+ F ++ A ++ V L +G+ QQ
Sbjct: 396 RLSLQN-NVRLE------LRFEHLFVPVSAAGNEVTVCLAILSSQKSGFGKINVLGNYQQ 448
Query: 316 RDTRFVYDLNIDLLSFVKENCS 337
+ YD + F + +CS
Sbjct: 449 SNYLVEYDNERSRVGFERADCS 470
>gi|357122155|ref|XP_003562781.1| PREDICTED: aspartic proteinase-like protein 2-like [Brachypodium
distachyon]
Length = 629
Score = 88.6 bits (218), Expect = 4e-15, Method: Compositional matrix adjust.
Identities = 91/364 (25%), Positives = 145/364 (39%), Gaps = 48/364 (13%)
Query: 2 VRLFIGTPSKGVLLILDTGSALIYAI--------------FDPRKSSSFQKINCDHPDCT 47
RL+IGTP + LI+D+GS + Y F P SS++ + C DCT
Sbjct: 87 TRLYIGTPPQEFALIVDSGSTVTYVPCASCEQCGNHQDPRFQPDLSSTYSPVKCSA-DCT 145
Query: 48 YFKCVNEQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSNDNHG--FDE 105
QC Y +YA+ S + G + +S + E K A+FGC N G F +
Sbjct: 146 -CDSDKSQCTYERQYAEMSSSSGVLGEDIVSFGTESELKP--QRAVFGCENSETGDLFSQ 202
Query: 106 DARDGALAGVLGLSRVTISFISQL--GSIIKKRFSYCL-VIPLPNGEYTSSYLKFGTDMG 162
A G++GL R +S + QL +I FS C + + G + DM
Sbjct: 203 HAD-----GIMGLGRGQLSIMDQLVDKGVIGDSFSMCYGGMDIGGGAMVLGAMPAPPDMV 257
Query: 163 YRRPSTQATKFINHPNNFYYLSLKDISIDNERMNFPPDTFDITVSGEGGCIIDSGSVLTY 222
+ R + + P +Y + LK+I + + + P FD + G ++DSG+ Y
Sbjct: 258 FSR-----SDPVRSP--YYNIELKEIHVAGKALRLDPRIFD----SKHGTVLDSGTTYAY 306
Query: 223 FHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCYF-----LPETFNRFPSMAFYFEDA 277
+ + S + + D P +C+ + + FP + F D
Sbjct: 307 LPEQAFVAFKDAVTSKVRPLKKIRGPD-PNYKDICFAGAGRNVSQLSQAFPDVDMVFGDG 365
Query: 278 N-LRIDGENVFI--IDYENHFFLLAVAPHDDLVALIGSQQQRDTRFVYDLNIDLLSFVKE 334
L + EN E + L D L+G R+T YD + + + F K
Sbjct: 366 QKLSLSPENYLFRHSKVEGAYCLGVFQNGKDPTTLLGGIVVRNTLVTYDRHNEKIGFWKT 425
Query: 335 NCSD 338
NCS+
Sbjct: 426 NCSE 429
>gi|297840891|ref|XP_002888327.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
gi|297334168|gb|EFH64586.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
Length = 473
Score = 88.6 bits (218), Expect = 4e-15, Method: Compositional matrix adjust.
Identities = 87/372 (23%), Positives = 156/372 (41%), Gaps = 58/372 (15%)
Query: 3 RLFIGTPSKGVLLILDTGSALIY-------------------AIFDPRKSSSFQKINCDH 43
++ +G+P K + +DTGS +++ ++FD SS+ +K+ CD
Sbjct: 77 KIKLGSPPKEYHVQVDTGSDILWVNCKPCPECPSKTNLNFHLSLFDVNASSTSKKVGCDD 136
Query: 44 PDCTYFKCVNE-----QCVYTMKYADQSVTKGFAAHETIS---VIGKGEGKAIFHGALFG 95
C++ + C Y + YAD+S ++G + ++ V G + + +FG
Sbjct: 137 DFCSFISQSDSCQPAVGCSYHIVYADESTSEGNFIRDKLTLEQVTGDLQTGPLGQEVVFG 196
Query: 96 CSNDNHGFDEDARDGALAGVLGLSRVTISFISQLGSI--IKKRFSYCLVIPLPNGEYTSS 153
C +D G D A+ GV+G + S +SQL + K+ FS+CL G +
Sbjct: 197 CGSDQSG-QLGKSDSAVDGVMGFGQSNTSVLSQLAATGDAKRVFSHCLDNVKGGGIFAVG 255
Query: 154 YLKFGTDMGYRRPSTQATKFINHPNNFYY-LSLKDISIDNERMNFPPDTFDITVSGEGGC 212
+ P + T + PN +Y + L + +D ++ PP ++ GG
Sbjct: 256 VVD--------SPKVKTTPMV--PNQMHYNVMLMGMDVDGTALDLPP-----SIMRNGGT 300
Query: 213 IIDSGSVLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCYFLPETFN-RFPSMA 271
I+DSG+ L YF +Y L E ++ Q +L + Q C+ E + FP ++
Sbjct: 301 IVDSGTTLAYFPKVLYDSLIETILAR----QPVKLHIVEDTFQ-CFSFSENVDVAFPPVS 355
Query: 272 FYFEDANLRIDGENVFIIDYENHFFLLA------VAPHDDLVALIGSQQQRDTRFVYDLN 325
F FED+ + ++ E + V L+G + VYDL
Sbjct: 356 FEFEDSVKLTVYPHDYLFTLEKELYCFGWQAGGLTTGERTEVILLGDLVLSNKLVVYDLE 415
Query: 326 IDLLSFVKENCS 337
+++ + NCS
Sbjct: 416 NEVIGWADHNCS 427
>gi|125540927|gb|EAY87322.1| hypothetical protein OsI_08726 [Oryza sativa Indica Group]
Length = 464
Score = 88.6 bits (218), Expect = 4e-15, Method: Compositional matrix adjust.
Identities = 88/368 (23%), Positives = 143/368 (38%), Gaps = 67/368 (18%)
Query: 1 MVRLFIGTPSKGVLLILDTGSALIYA----------------IFDPRKSSSFQKINCDHP 44
+V + +GTP+ L +DTGS + + +FDP +SSS+ + C
Sbjct: 132 VVTVSLGTPAVAQTLEVDTGSDVSWVQCKPCPSPPCYSQRDPLFDPTRSSSYSAVPCAAA 191
Query: 45 DCTYFK-----CVNEQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSND 99
C+ C QC Y + Y D S T G + +T+++ G G LFGC +
Sbjct: 192 SCSQLALYSNGCSGGQCGYVVSYGDGSTTTGVYSSDTLTLTGSNA----LKGFLFGCGHA 247
Query: 100 NHGFDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLKFGT 159
G + G+LGL R S +SQ S FSYC LP + + Y+ G
Sbjct: 248 QQGLFA-----GVDGLLGLGRQGQSLVSQASSTYGGVFSYC----LPPTQNSVGYISLGG 298
Query: 160 DMGYRRPSTQATKFINHPNNFYYLSLKDISIDNERMNFPPDTFDITVSGEGGCIIDSGSV 219
ST ++ +Y + L IS+ + ++ F G ++D+G+V
Sbjct: 299 PSSTAGFSTTPLLTASNDPTYYIVMLAGISVGGQPLSIDASVF------ASGAVVDTGTV 352
Query: 220 LTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEP----IQLCYFLPETFNRF-----PSM 270
+T Y L F R +A P + CY F R+ P++
Sbjct: 353 VTRLPPTAYSALRSAF-----RAAMAPYGYPSAPATGILDTCY----DFTRYGTVTLPTI 403
Query: 271 AFYFEDANLRIDGENVFIIDYENHFFLLAVAP--HDDLVALIGSQQQRDTRFVYDLNIDL 328
+ F G + + LA AP D +++G+ QQR +D +
Sbjct: 404 SIAFGGGAAMDLGTSGILTS-----GCLAFAPTGGDSQASILGNVQQRSFEVRFDGST-- 456
Query: 329 LSFVKENC 336
+ F+ +C
Sbjct: 457 VGFMPASC 464
>gi|222637611|gb|EEE67743.1| hypothetical protein OsJ_25435 [Oryza sativa Japonica Group]
Length = 396
Score = 88.2 bits (217), Expect = 5e-15, Method: Compositional matrix adjust.
Identities = 82/358 (22%), Positives = 148/358 (41%), Gaps = 39/358 (10%)
Query: 1 MVRLFIGTPSKGVLLILDTGSALIY------------AIFDPRKSSSFQKINCDHPDCTY 48
+VR +GTP++ +LL +DT + + + F+P S+S++ + C P C
Sbjct: 55 VVRARLGTPAQQLLLAVDTSNDAAWIPCSGCAGCPTSSPFNPAASASYRPVPCGSPQCVL 114
Query: 49 FKCVN-----EQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSNDNHGF 103
+ + C +++ YAD S+ + +T++V G + FGC G
Sbjct: 115 APNPSCSPNAKSCGFSLSYADSSLQAALS-QDTLAVAGD-----VVKAYTFGCLQRATGT 168
Query: 104 DEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLKFGTDMGY 163
+ R +SF+SQ + FSYCL P S L+ G +
Sbjct: 169 AAPPQGLLGL-----GRGPLSFLSQTKDMYGATFSYCL--PSFKSLNFSGTLRLGRNGQP 221
Query: 164 RRPSTQATKFINHPNNFYYLSLKDISIDNERMNFPPDTFDITVSGEGGCIIDSGSVLTYF 223
RR T H ++ YY+++ I + + ++ P + G ++DSG++ T
Sbjct: 222 RRIKTTPLLANPHRSSLYYVNMTGIRVGKKVVSIPASALAFDPATGAGTVLDSGTMFTRL 281
Query: 224 HSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCYFLPETFNRFPSMAFYFEDANLRIDG 283
+ VY L ++ A S CY T +P + F+ + +
Sbjct: 282 VAPVYLALRDEVRRRVGAGAAAVSSL--GGFDTCY---NTTVAWPPVTLLFDGMQVTLPE 336
Query: 284 ENVFIIDYENHFFLLAVAPHDD----LVALIGSQQQRDTRFVYDLNIDLLSFVKENCS 337
ENV I LA+A D ++ +I S QQ++ R ++D+ + F +E+C+
Sbjct: 337 ENVVIHTTYGTTSCLAMAAAPDGVNTVLNVIASMQQQNHRVLFDVPNGRVGFARESCT 394
>gi|125528357|gb|EAY76471.1| hypothetical protein OsI_04407 [Oryza sativa Indica Group]
Length = 441
Score = 88.2 bits (217), Expect = 5e-15, Method: Compositional matrix adjust.
Identities = 84/323 (26%), Positives = 137/323 (42%), Gaps = 53/323 (16%)
Query: 2 VRLFIGTPSKGVLLILDTGSALIYAI----------------FDPRKSSSFQKINC---- 41
V L +GTP + V ++LDTGS L + + F PR S +F + C
Sbjct: 67 VSLAVGTPPQNVTMVLDTGSELSWLLCAPGGGGGGGGRSALSFRPRASLTFASVPCGSAQ 126
Query: 42 ----DHPDCTYFKCVNEQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCS 97
D P ++QC ++ YAD S + G A E +V G+G + A FGC
Sbjct: 127 CRSRDLPSPPACDGASKQCRVSLSYADGSSSDGALATEVFTV---GQGPPLR--AAFGCM 181
Query: 98 NDNHGFDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLKF 157
FD A AG+LG++R +SF+SQ + +RFSYC+ + + L
Sbjct: 182 AT--AFDTSPDGVATAGLLGMNRGALSFVSQAST---RRFSYCI-----SDRDDAGVLLL 231
Query: 158 G-TDMGYR----RPSTQATKFINHPNNFYY-LSLKDISIDNERMNFPPDTFDITVSGEGG 211
G +D+ + P Q + + + Y + L I + + + P +G G
Sbjct: 232 GHSDLPFLPLNYTPLYQPAMPLPYFDRVAYSVQLLGIRVGGKPLPIPASVLAPDHTGAGQ 291
Query: 212 CIIDSGSVLTYFHSDVYWKLHEKFVSYFERFQLAQLSDC----PEPIQLCYFLPE---TF 264
++DSG+ T+ D Y L +F S + L L+D E C+ +P+
Sbjct: 292 TMVDSGTQFTFLLGDAYSALKAEF-SRQTKPWLPALNDPNFAFQEAFDTCFRVPQGRAPP 350
Query: 265 NRFPSMAFYFEDANLRIDGENVF 287
R P++ F A + + G+ +
Sbjct: 351 ARLPAVTLLFNGAQMTVAGDRLL 373
>gi|357456413|ref|XP_003598487.1| Peptidase A1, putative [Medicago truncatula]
gi|355487535|gb|AES68738.1| Peptidase A1, putative [Medicago truncatula]
Length = 414
Score = 88.2 bits (217), Expect = 5e-15, Method: Compositional matrix adjust.
Identities = 82/358 (22%), Positives = 149/358 (41%), Gaps = 43/358 (12%)
Query: 1 MVRLFIGTPSKGVLLILDTGSALIY-----------AIFDPRKSSSFQKINCDHPDCTYF 49
+VR IGTP + +LL +DT + + +F P KS++F+ ++C P+C
Sbjct: 79 IVRAKIGTPPQTLLLAMDTSNDAAWIPCTACDGCASTLFAPEKSTTFKNVSCAAPECKQV 138
Query: 50 K---CVNEQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSNDNHGFDED 106
C C + + Y S+ +TI++ FGC + G
Sbjct: 139 PNPGCGVSSCNFNLTYGSSSIAANLV-QDTITL-----ATDPVPSYTFGCVSKTTGTSAP 192
Query: 107 ARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLKFGTDMGYRRP 166
+ R +S +SQ ++ + FSYCL P S L+ G +R
Sbjct: 193 PQGLLGL-----GRGPLSLLSQTQNLYQSTFSYCL--PSFKSLNFSGSLRLGPVAQPKR- 244
Query: 167 STQATKFINHP--NNFYYLSLKDISIDNERMNFPPDTFDITVSGEGGCIIDSGSVLTYFH 224
+ T + +P ++ YY++L+ I + + ++ PP + G I DSG+V T
Sbjct: 245 -IKYTPLLKNPRRSSLYYVNLEAIRVGRKVVDIPPAALAFNPTTGAGTIFDSGTVFTRLV 303
Query: 225 SDVYWKLHEKFVSYF-ERFQLAQLSDCPEPIQLCYFLPETFNRFPSMAFYFEDANLRIDG 283
+ VY + ++F + + L CY +P P++ F F N+ +
Sbjct: 304 APVYVAVRDEFRRRVGPKLTVTSLGG----FDTCYNVPIV---VPTITFIFTGMNVTLPQ 356
Query: 284 ENVFIIDYENHFFLLAVAPHDD----LVALIGSQQQRDTRFVYDLNIDLLSFVKENCS 337
+N+ I LA+A D ++ +I + QQ++ R +YD+ + +E C+
Sbjct: 357 DNILIHSTAGSTTCLAMAGAPDNVNSVLNVIANMQQQNHRVLYDVPNSRVGVARELCT 414
>gi|118482048|gb|ABK92955.1| unknown [Populus trichocarpa]
Length = 425
Score = 88.2 bits (217), Expect = 5e-15, Method: Compositional matrix adjust.
Identities = 82/357 (22%), Positives = 143/357 (40%), Gaps = 42/357 (11%)
Query: 1 MVRLFIGTPSKGVLLILDTGSALIY-----------AIFDPRKSSSFQKINCDHPDCTYF 49
+V+ +GTP++ L+ LDT + + +F+ S++F+ + CD P C
Sbjct: 91 IVKANVGTPAQTFLMALDTSNDAAWIPCNGCVGCSSTVFNSVTSTTFKTLGCDAPQCKQV 150
Query: 50 ---KCVNEQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSNDNHGFDED 106
C C + Y ++ +TI++ I G FGC G
Sbjct: 151 PNPTCGGSTCTWNTTYGGSTILSNLT-RDTIAL-----STDIVPGYTFGCIQKTTGSSVP 204
Query: 107 ARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLKFGTDMGYRRP 166
+ R +SF+SQ + K FSYCL P S L+ G R
Sbjct: 205 PQGLLGL-----GRGPLSFLSQTQDLYKSTFSYCL--PSFRTLNFSGTLRLGPAGQPLR- 256
Query: 167 STQATKFINHP--NNFYYLSLKDISIDNERMNFPPDTFDITVSGEGGCIIDSGSVLTYFH 224
+ T + +P ++ YY++L I + + ++ P + G I DSG+V T
Sbjct: 257 -IKTTPLLKNPRRSSLYYVNLIGIRVGRKIVDIPASALAFNPTTGAGTIFDSGTVFTRLV 315
Query: 225 SDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCYFLPETFNRFPSMAFYFEDANLRIDGE 284
+ VY + ++F ++ L CY P P+M F F N+ + +
Sbjct: 316 APVYTAVRDEFRKRVGNAIVSSLGG----FDTCYTGPIV---APTMTFMFSGMNVTLPTD 368
Query: 285 NVFIIDYENHFFLLAVAPHDD----LVALIGSQQQRDTRFVYDLNIDLLSFVKENCS 337
N+ I LA+A D ++ +I + QQ++ R ++D+ + +E CS
Sbjct: 369 NLLIRSTAGSTSCLAMAAAPDNVNSVLNVIANMQQQNHRILFDVPNSRIGVAREPCS 425
>gi|242092892|ref|XP_002436936.1| hypothetical protein SORBIDRAFT_10g011730 [Sorghum bicolor]
gi|241915159|gb|EER88303.1| hypothetical protein SORBIDRAFT_10g011730 [Sorghum bicolor]
Length = 469
Score = 87.8 bits (216), Expect = 6e-15, Method: Compositional matrix adjust.
Identities = 96/368 (26%), Positives = 143/368 (38%), Gaps = 53/368 (14%)
Query: 1 MVRLFIGTPSKGVLLILDTGSALIYA----------------IFDPRKSSSFQKINCDH- 43
+V L GTP+ +L++DTGS L + +FDP SS++ + C
Sbjct: 123 VVTLGFGTPAVPQVLLIDTGSDLSWVQCQPCNSSTCYPQKDPVFDPSASSTYAPVPCGSE 182
Query: 44 ------PDCTYFKCVNEQ-----CVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGA 92
PD C N C Y ++Y + T G + ET+++ E + +
Sbjct: 183 ACRDLDPDSYANGCTNSSSGASLCQYGIQYGNGDTTVGVYSTETLTL--SPEAATVVNNF 240
Query: 93 LFGCSNDNHGFDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTS 152
FGC G + G+LGL S +SQ FSYC LP G T+
Sbjct: 241 SFGC-----GLVQKGVFDLFDGLLGLGGAPESLVSQTTGTYGGAFSYC----LPAGNSTA 291
Query: 153 SYLKFGTDM--GYRRPSTQATKFINHPNNFYYLSLKDISIDNERMNFPPDTFDITVSGEG 210
+L G G Q T FY + L IS+ ++++ P F G
Sbjct: 292 GFLALGAPATGGNNTAGFQFTPLQVVETTFYLVKLTGISVGGKQLDIEPTVF------AG 345
Query: 211 GCIIDSGSVLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCY-FLPETFNRFPS 269
G IIDSG+++T Y L F S + L +D E + CY F T P+
Sbjct: 346 GMIIDSGTIVTGLPETAYSALRTAFRSAMSAYPLLPPND-DEDLDTCYDFTGNTNVTVPT 404
Query: 270 MAFYFEDA-NLRIDGENVFIIDYENHFFLLAVAPHDDLVALIGSQQQRDTRFVYDLNIDL 328
+A FE + +D + ++D F A D +IG+ QR +YD
Sbjct: 405 VALTFEGGVTIDLDVPSGVLLDGCLAFVAGA---SDGDTGIIGNVNQRTFEVLYDSARGH 461
Query: 329 LSFVKENC 336
+ F C
Sbjct: 462 VGFRAGAC 469
>gi|224057272|ref|XP_002299201.1| predicted protein [Populus trichocarpa]
gi|118483775|gb|ABK93780.1| unknown [Populus trichocarpa]
gi|222846459|gb|EEE84006.1| predicted protein [Populus trichocarpa]
Length = 425
Score = 87.8 bits (216), Expect = 6e-15, Method: Compositional matrix adjust.
Identities = 82/357 (22%), Positives = 143/357 (40%), Gaps = 42/357 (11%)
Query: 1 MVRLFIGTPSKGVLLILDTGSALIY-----------AIFDPRKSSSFQKINCDHPDCTYF 49
+V+ +GTP++ L+ LDT + + +F+ S++F+ + CD P C
Sbjct: 91 IVKANVGTPAQTFLMALDTSNDAAWIPCNGCVGCSSTVFNSVTSTTFKTLGCDAPQCKQV 150
Query: 50 ---KCVNEQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSNDNHGFDED 106
C C + Y ++ +TI++ I G FGC G
Sbjct: 151 PNPTCGGSTCTWNTTYGGSTILSNLT-RDTIAL-----STDIVPGYTFGCIQKTTGSSVP 204
Query: 107 ARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLKFGTDMGYRRP 166
+ R +SF+SQ + K FSYCL P S L+ G R
Sbjct: 205 PQGLLGL-----GRGPLSFLSQTQDLYKSTFSYCL--PSFRTLNFSGTLRLGPAGQPLR- 256
Query: 167 STQATKFINHP--NNFYYLSLKDISIDNERMNFPPDTFDITVSGEGGCIIDSGSVLTYFH 224
+ T + +P ++ YY++L I + + ++ P + G I DSG+V T
Sbjct: 257 -IKTTPLLKNPRRSSLYYVNLIGIRVGRKIVDIPASALAFNPTTGAGTIFDSGTVFTRLV 315
Query: 225 SDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCYFLPETFNRFPSMAFYFEDANLRIDGE 284
+ VY + ++F ++ L CY P P+M F F N+ + +
Sbjct: 316 APVYTAVRDEFRKRVGNAIVSSLGG----FDTCYTGPIV---APTMTFMFSGMNVTLPPD 368
Query: 285 NVFIIDYENHFFLLAVAPHDD----LVALIGSQQQRDTRFVYDLNIDLLSFVKENCS 337
N+ I LA+A D ++ +I + QQ++ R ++D+ + +E CS
Sbjct: 369 NLLIRSTAGSTSCLAMAAAPDNVNSVLNVIANMQQQNHRILFDVPNSRIGVAREPCS 425
>gi|296087361|emb|CBI33735.3| unnamed protein product [Vitis vinifera]
Length = 633
Score = 87.8 bits (216), Expect = 6e-15, Method: Compositional matrix adjust.
Identities = 95/366 (25%), Positives = 149/366 (40%), Gaps = 52/366 (14%)
Query: 2 VRLFIGTPSKGVLLILDTGSALIYAI--------------FDPRKSSSFQKINCDHPDCT 47
R++IGTP + LI+DTGS L Y F P SS++Q + C +CT
Sbjct: 94 TRIWIGTPPQTFALIVDTGSTLTYVPCSTCEQCGKHQDPNFQPDWSSTYQPLKCSM-ECT 152
Query: 48 YFKCVNE--QCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSNDNHGFDE 105
C +E CVY +YA+ S + G + +S + E K +FGC N G
Sbjct: 153 ---CDSEMMHCVYDRQYAEMSSSSGVLGEDIVSFGKQSELKP--QRTVFGCENVETG--- 204
Query: 106 DARDGALAGVLGLSRVTISFISQL--GSIIKKRFSYCLVIPLPNGEYTSSYLKFGTDM-- 161
D G++GL R +S + QL +I FS C Y + G +
Sbjct: 205 DIYSQRADGIMGLGRGDLSIVDQLVEKGVIGNSFSLC---------YGGMDVGGGAMVLG 255
Query: 162 GYRRPSTQATKFINHPNNFYY-LSLKDISIDNERMNFPPDTFDITVSGEGGCIIDSGSVL 220
G P+ + + YY + LK+I I +++ P FD G+ G I+DSG+
Sbjct: 256 GISPPAGMVFTHSDPARSAYYNIDLKEIHIAGKQLPINPMVFD----GKYGTILDSGTTY 311
Query: 221 TYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCYF-----LPETFNRFPSMAFYFE 275
Y + + + +L Q D +C+ + + FP++ F
Sbjct: 312 AYLPEPAFKAFKDAIMKELNSLKLIQGPD-RNYNDICFSGVGSDVSQLSKTFPAVDLVFS 370
Query: 276 DAN-LRIDGENVFIIDYENH--FFLLAVAPHDDLVALIGSQQQRDTRFVYDLNIDLLSFV 332
+ N L + EN + H + L +D L+G R+T +YD + F
Sbjct: 371 NGNRLSLSPENYLFQHSKAHGAYCLGIFQNENDQTTLLGGIIVRNTLVMYDREHLKIGFW 430
Query: 333 KENCSD 338
K NCS+
Sbjct: 431 KTNCSE 436
>gi|225438908|ref|XP_002279194.1| PREDICTED: aspartic proteinase-like protein 2-like [Vitis vinifera]
Length = 634
Score = 87.8 bits (216), Expect = 7e-15, Method: Compositional matrix adjust.
Identities = 95/366 (25%), Positives = 149/366 (40%), Gaps = 52/366 (14%)
Query: 2 VRLFIGTPSKGVLLILDTGSALIYAI--------------FDPRKSSSFQKINCDHPDCT 47
R++IGTP + LI+DTGS L Y F P SS++Q + C +CT
Sbjct: 94 TRIWIGTPPQTFALIVDTGSTLTYVPCSTCEQCGKHQDPNFQPDWSSTYQPLKCSM-ECT 152
Query: 48 YFKCVNE--QCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSNDNHGFDE 105
C +E CVY +YA+ S + G + +S + E K +FGC N G
Sbjct: 153 ---CDSEMMHCVYDRQYAEMSSSSGVLGEDIVSFGKQSELKP--QRTVFGCENVETG--- 204
Query: 106 DARDGALAGVLGLSRVTISFISQL--GSIIKKRFSYCLVIPLPNGEYTSSYLKFGTDM-- 161
D G++GL R +S + QL +I FS C Y + G +
Sbjct: 205 DIYSQRADGIMGLGRGDLSIVDQLVEKGVIGNSFSLC---------YGGMDVGGGAMVLG 255
Query: 162 GYRRPSTQATKFINHPNNFYY-LSLKDISIDNERMNFPPDTFDITVSGEGGCIIDSGSVL 220
G P+ + + YY + LK+I I +++ P FD G+ G I+DSG+
Sbjct: 256 GISPPAGMVFTHSDPARSAYYNIDLKEIHIAGKQLPINPMVFD----GKYGTILDSGTTY 311
Query: 221 TYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCYF-----LPETFNRFPSMAFYFE 275
Y + + + +L Q D +C+ + + FP++ F
Sbjct: 312 AYLPEPAFKAFKDAIMKELNSLKLIQGPD-RNYNDICFSGVGSDVSQLSKTFPAVDLVFS 370
Query: 276 DAN-LRIDGENVFIIDYENH--FFLLAVAPHDDLVALIGSQQQRDTRFVYDLNIDLLSFV 332
+ N L + EN + H + L +D L+G R+T +YD + F
Sbjct: 371 NGNRLSLSPENYLFQHSKAHGAYCLGIFQNENDQTTLLGGIIVRNTLVMYDREHLKIGFW 430
Query: 333 KENCSD 338
K NCS+
Sbjct: 431 KTNCSE 436
>gi|326515366|dbj|BAK03596.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 452
Score = 87.8 bits (216), Expect = 7e-15, Method: Compositional matrix adjust.
Identities = 88/336 (26%), Positives = 145/336 (43%), Gaps = 27/336 (8%)
Query: 26 AIFDPRKSSSFQKINCDHPDCT--YFKCVNEQCVYTMKYA-DQSVTKGFAAHETISVIGK 82
++F+ S + I P C Y + +C + +K+ S +G + G
Sbjct: 121 SVFNTAASPHYHHIASTDPRCMAPYTRAGQGRCTFDVKFQYGDSRARGVLGSDDFVFDGS 180
Query: 83 GEGKAI--FHGALFGCSNDNHGF-DEDARDGALAGVLGLSRVTISFISQLGS--IIKKRF 137
G G I +G +FGC+++ H F + D AGV+ L+R SFI QL + + RF
Sbjct: 181 GPGSPISSVNGLVFGCAHNTHDFYNHDL----WAGVMSLNRHPTSFIRQLSARGLAAPRF 236
Query: 138 SYCLVIPLPNGEYTSSYLKFGTDM---GYRRPSTQATKFINHPNNFYYLSLKDISIDNER 194
SYCL +L+FG D+ + R + + YY+ + +S+ R
Sbjct: 237 SYCLASR--QHRDRRGFLRFGADIPDQSHARSTPLLHGDLAQGGGMYYVGVVGVSLGGRR 294
Query: 195 MN-FPPDTFDITV-SGEGGCIIDSGSVLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPE 252
+ P F++ S GGCIID G+ LT + Y L + +++ + P
Sbjct: 295 LTAITPVMFELNRRSLRGGCIIDVGTSLTLMATAPYHVLVAELIAHMRSRGVQHAIFSPG 354
Query: 253 PIQLCYFLPETFNR-FPSMAFYF----EDANLRIDGENVFI--IDYENHFFLLAVAPHDD 305
E+ +R PS+ +F E L I E +F+ + LA+ P+ +
Sbjct: 355 QKHCFRGKWESIHRHLPSVTLHFQFHPESVALFIRPELLFVAMTGERTDYVCLAIVPYAE 414
Query: 306 LVALIGSQQQRDTRFVYDLNIDLLSFVKENCSDDSA 341
+IG+ Q DTRF +DL + L F E C D++
Sbjct: 415 RT-IIGAGQMLDTRFTFDLQQNRLFFAPEQCHLDTS 449
>gi|225216973|gb|ACN85264.1| aspartic proteinase nepenthesin-1 precursor [Oryza alta]
Length = 517
Score = 87.4 bits (215), Expect = 8e-15, Method: Compositional matrix adjust.
Identities = 87/364 (23%), Positives = 143/364 (39%), Gaps = 53/364 (14%)
Query: 1 MVRLFIGTPSKGVLLILDTGSALIYA---------------IFDPRKSSSFQKINCDHPD 45
+V + +GTP+ ++ DTGS + +FDP +SS++ ++C P
Sbjct: 179 VVTVGLGTPASRYTVVFDTGSDTTWVQCQPCVVVCYEQQEKLFDPVRSSTYANVSCAAPA 238
Query: 46 CT---YFKCVNEQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSNDNHG 102
C+ C C+Y ++Y D S + GF A +T+++ G FGC N G
Sbjct: 239 CSDLNIHGCSGGHCLYGVQYGDGSYSIGFFAMDTLTL----SSYDAVKGFRFGCGERNEG 294
Query: 103 FDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLKFGTDMG 162
+A AG+LGL R S Q F++C LP + YL FG
Sbjct: 295 LFGEA-----AGLLGLGRGKTSLPVQTYDKYGGVFAHC----LPARSTGTGYLDFGAGSP 345
Query: 163 YRRPSTQATKFI--NHPNNFYYLSLKDISIDNERMNFPPDTFDITVSGEGGCIIDSGSVL 220
+ T + N P FYY+ + I + + ++ P F G I+DSG+V+
Sbjct: 346 AAASARLTTPMLTDNGP-TFYYIGMTGIRVGGQLLSIPQSVFATA-----GTIVDSGTVI 399
Query: 221 TYFHSDVYWKLH-----EKFVSYFERFQLAQLSDCPEPIQLCY-FLPETFNRFPSMAFYF 274
T Y L +++ L D CY F + P+++ F
Sbjct: 400 TRLPPPAYSSLRYAFAAAMAARGYKKAPAVSLLD------TCYDFTGMSQVAIPTVSLLF 453
Query: 275 E-DANLRIDGENVFIIDYENHFFLLAVAPHDDL-VALIGSQQQRDTRFVYDLNIDLLSFV 332
+ A L +D + + L A D V ++G+ Q + YD+ ++ F
Sbjct: 454 QGGARLDVDASGIMYAASASQVCLAFAANEDGGDVGIVGNTQLKTFGVAYDIGKKVVGFY 513
Query: 333 KENC 336
C
Sbjct: 514 PGVC 517
>gi|125547728|gb|EAY93550.1| hypothetical protein OsI_15341 [Oryza sativa Indica Group]
Length = 418
Score = 87.4 bits (215), Expect = 8e-15, Method: Compositional matrix adjust.
Identities = 99/369 (26%), Positives = 149/369 (40%), Gaps = 75/369 (20%)
Query: 1 MVRLFIGTPSKGVLLILDTGSALIYA----------------IFDPRKSSSFQKINCDHP 44
+V L GTP + V L LDTGS + + +FDP SSSF + C P
Sbjct: 89 LVHLAAGTPPQEVQLTLDTGSDITWTQCKRCPASACFNQTLPLFDPSASSSFASLPCSSP 148
Query: 45 DC-TYFKC------VNEQCVYTMKYADQSVTKGFAAHETIS-VIGKGEG-KAIFHGALFG 95
C T C + C Y++ Y D SV++G E + G GEG A G +FG
Sbjct: 149 ACETTPPCGGGNDATSRPCNYSISYGDGSVSRGEIGREVFTFASGTGEGSSAAVPGLVFG 208
Query: 96 CSNDNHG-FDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSY 154
C + N G F + G+ G R ++S SQL FS+C G TS+
Sbjct: 209 CGHANRGVFTSNE-----TGIAGFGRGSLSLPSQL---KVGNFSHCFTT--ITGSKTSAV 258
Query: 155 LKFGTDMGYRRPSTQATKFINHPNNFYYLSLKDISIDNERMNFPPDTFDITVSGEGGCII 214
L G G PS A+ ++ S S
Sbjct: 259 L-LGLP-GVAPPS--ASPLGRRRGSYRCRSTPRSS------------------------- 289
Query: 215 DSGSVLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCYFLPETFNR--FPSMAF 272
+SG+ +T Y + E+F + + + + +P C+ P + P+MA
Sbjct: 290 NSGTSITSLPPRTYRAVREEFAAQVKLPVVP--GNATDPF-TCFSAPLRGPKPDVPTMAL 346
Query: 273 YFEDANLRIDGEN-VFII----DYENHFFLLAVAPHDDLVALIGSQQQRDTRFVYDLNID 327
+FE A +R+ EN VF + D N ++ +A + ++G+ QQ++ +YDL
Sbjct: 347 HFEGATMRLPQENYVFEVVDDDDAGNSSRIICLAVIEGGEIILGNIQQQNMHVLYDLQNS 406
Query: 328 LLSFVKENC 336
LSFV C
Sbjct: 407 KLSFVPAQC 415
>gi|326503602|dbj|BAJ86307.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 461
Score = 87.4 bits (215), Expect = 9e-15, Method: Compositional matrix adjust.
Identities = 82/318 (25%), Positives = 128/318 (40%), Gaps = 43/318 (13%)
Query: 2 VRLFIGTPSKGVLLILDTGSALIYAI--------------FDPRKSSSFQKINC------ 41
V L +GTP + V ++LDTGS L + + F PR SS+F + C
Sbjct: 87 VSLAVGTPPQNVTMVLDTGSELSWLLCAPAGARNKFSAMSFRPRASSTFAAVPCASAQCR 146
Query: 42 --DHPDCTYFKCVNEQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSND 99
D P + +C ++ YAD S + G A + +V G G + A FGC +
Sbjct: 147 SRDLPSPPACDGASSRCSVSLSYADGSSSDGALATDVFAV---GSGPPLR--AAFGCMSS 201
Query: 100 NHGFDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLKFGT 159
FD A AG+LG++R +SF+SQ + +RFSYC+ G + T
Sbjct: 202 --AFDSSPDGVASAGLLGMNRGALSFVSQAST---RRFSYCISDRDDAGVLLLGHSDLPT 256
Query: 160 --DMGYRRPSTQATKFINHPNNFYYLSLKDISIDNERMNFPPDTFDITVSGEGGCIIDSG 217
+ Y A Y + L I + + + P +G G ++DSG
Sbjct: 257 FLPLNYTPMYQPALPLPYFDRVAYSVQLLGIRVGGKHLPIPASVLAPDHTGAGQTMVDSG 316
Query: 218 SVLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCP----EPIQLCYFLPETFN----RFPS 269
+ T+ D Y L +F R L L D E C+ +P+ + R P
Sbjct: 317 TQFTFLLGDAYSALKAEFTRQ-ARPLLPALDDPSFAFQEAFDTCFRVPQGRSPPTARLPG 375
Query: 270 MAFYFEDANLRIDGENVF 287
+ F A + + G+ +
Sbjct: 376 VTLLFNGAEMAVAGDRLL 393
>gi|224065128|ref|XP_002301682.1| predicted protein [Populus trichocarpa]
gi|222843408|gb|EEE80955.1| predicted protein [Populus trichocarpa]
Length = 441
Score = 87.4 bits (215), Expect = 9e-15, Method: Compositional matrix adjust.
Identities = 77/271 (28%), Positives = 120/271 (44%), Gaps = 55/271 (20%)
Query: 1 MVRLFIGTPSKGVLLILDTGSALIY--------------AIFDPRKSSSFQKINCDHPDC 46
+V L IGTP + +ILDTGS L + ++FDP SSSF + C+HP C
Sbjct: 83 LVSLPIGTPPQTQQMILDTGSQLSWIQCHKKVPRKPPPSSVFDPSLSSSFSVLPCNHPLC 142
Query: 47 TY--------FKC-VNEQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCS 97
C N C Y+ YAD ++ +G E I+ + + GC+
Sbjct: 143 KPRIPDFTLPTSCDQNRLCHYSYFYADGTLAEGNLVREKITFSRSQSTPPL----ILGCA 198
Query: 98 NDNHGFDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLKF 157
++ DA+ G+LG++ +SF SQ +FSYC+ P + +
Sbjct: 199 EES----SDAK-----GILGMNLGRLSFASQAK---LTKFSYCV----PTRQVRPGFTPT 242
Query: 158 GTDMGYRRPSTQATKFIN---------HPNN---FYYLSLKDISIDNERMNFPPDTFDIT 205
G+ P++ ++IN PN Y ++++ I I N+++N P F
Sbjct: 243 GSFYLGENPNSGGFRYINLLTFSQSQRMPNLDPLAYTVAMQGIRIGNQKLNIPISAFRPD 302
Query: 206 VSGEGGCIIDSGSVLTYFHSDVYWKLHEKFV 236
SG G +IDSGS TY + Y K+ E+ V
Sbjct: 303 PSGAGQTMIDSGSEFTYLVDEAYNKVREEVV 333
>gi|225216960|gb|ACN85252.1| aspartic proteinase nepenthesin-1 precursor [Oryza officinalis]
Length = 519
Score = 87.0 bits (214), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 87/364 (23%), Positives = 142/364 (39%), Gaps = 53/364 (14%)
Query: 1 MVRLFIGTPSKGVLLILDTGSALIYA---------------IFDPRKSSSFQKINCDHPD 45
+V + +GTP ++ DTGS + +FDP +SS++ ++C P
Sbjct: 181 VVTVGLGTPVSRYTVVFDTGSDTTWVQCQPCVVVCYEQREKLFDPARSSTYANVSCAAPA 240
Query: 46 CTYFK---CVNEQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSNDNHG 102
C+ C C+Y ++Y D S + GF A +T+++ G FGC N G
Sbjct: 241 CSDLNIHGCSGGHCLYGVQYGDGSYSIGFFAMDTLTL----SSYDAVKGFRFGCGERNEG 296
Query: 103 FDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLKFGTDMG 162
+A AG+LGL R S Q F++C LP + YL FG
Sbjct: 297 LFGEA-----AGLLGLGRGKTSLPVQTYDKYGGVFAHC----LPARSTGTGYLDFGAGSL 347
Query: 163 YRRPSTQATKFI--NHPNNFYYLSLKDISIDNERMNFPPDTFDITVSGEGGCIIDSGSVL 220
+ T + N P FYY+ + I + + ++ P F G I+DSG+V+
Sbjct: 348 AAASARLTTPMLTDNGP-TFYYVGMTGIRVGGQLLSIPQSVFATA-----GTIVDSGTVI 401
Query: 221 TYFHSDVYWKLH-----EKFVSYFERFQLAQLSDCPEPIQLCY-FLPETFNRFPSMAFYF 274
T Y L +++ L D CY F + P+++ F
Sbjct: 402 TRLPPAAYSSLRYAFAAAMAARGYKKAPAVSLLD------TCYDFTGMSQVAIPTVSLLF 455
Query: 275 E-DANLRIDGENVFIIDYENHFFLLAVAPHDDL-VALIGSQQQRDTRFVYDLNIDLLSFV 332
+ A L +D + + L A D V ++G+ Q + YD+ ++ F
Sbjct: 456 QGGARLDVDASGIMYAASASQVCLAFAANEDGGDVGIVGNTQLKTFGVAYDIGKKVVGFY 515
Query: 333 KENC 336
C
Sbjct: 516 PGAC 519
>gi|242059211|ref|XP_002458751.1| hypothetical protein SORBIDRAFT_03g039590 [Sorghum bicolor]
gi|241930726|gb|EES03871.1| hypothetical protein SORBIDRAFT_03g039590 [Sorghum bicolor]
Length = 444
Score = 87.0 bits (214), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 93/387 (24%), Positives = 156/387 (40%), Gaps = 70/387 (18%)
Query: 2 VRLFIGTPSKGVLLILDTGSALIYAI--------------------FDPRKSSSFQKINC 41
V L +GTP + V ++LDTGS L + + F PR S++F + C
Sbjct: 65 VSLAVGTPPQNVTMVLDTGSELSWLLCATGRQGSAAAGAAAAMGESFRPRASATFAAVPC 124
Query: 42 DHPDCTYF------KC--VNEQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGAL 93
C+ C + QC ++ YAD S + G A + +V G+A +
Sbjct: 125 GSTQCSSRDLPAPPSCDGASRQCHVSLSYADGSASDGALATDVFAV-----GEAPPLRSA 179
Query: 94 FGCSNDNHGFDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSS 153
FGC + +D A AG+LG++R T+SF++Q + +RFSYC+ + +
Sbjct: 180 FGCMST--AYDSSPDGVATAGLLGMNRGTLSFVTQAST---RRFSYCI-----SDRDDAG 229
Query: 154 YLKFG-TDMGYR----RPSTQATKFINHPNNFYY-LSLKDISIDNERMNFPPDTFDITVS 207
L G +D+ + P Q T + + + Y + L I + + + P +
Sbjct: 230 VLLLGHSDLPFLPLNYTPLYQPTLPLPYFDRVAYSVQLLGIRVGGKALPIPASVLAPDHT 289
Query: 208 GEGGCIIDSGSVLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCP-----EPIQLCYFL-- 260
G G ++DSG+ T+ D Y L +F+ + L + D P E + C+ +
Sbjct: 290 GAGQTMVDSGTQFTFLLGDAYSALKAEFLKQTK--PLLRALDDPSFAFQEALDTCFRVPA 347
Query: 261 --PETFNRFPSMAFYFEDANLRIDGENVFIIDYENH-----FFLLAVAPHDDLVAL---- 309
P R P + F A + + G+ + H + L + D+V L
Sbjct: 348 GRPPPSARLPPVTLLFNGAEMSVAGDRLLYKVPGEHRGADGVWCLTFG-NADMVPLTAYV 406
Query: 310 IGSQQQRDTRFVYDLNIDLLSFVKENC 336
IG Q + YDL + C
Sbjct: 407 IGHHHQMNLWVEYDLERGRVGLAPVKC 433
>gi|358345197|ref|XP_003636668.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
gi|355502603|gb|AES83806.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
Length = 294
Score = 87.0 bits (214), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 67/220 (30%), Positives = 100/220 (45%), Gaps = 29/220 (13%)
Query: 1 MVRLFIGTPSKGVLLILDTGSALIY--------------AIFDPRKSSSFQKINCDHPDC 46
++ L IGTP + DTGS LI+ +FD + SS+F I C C
Sbjct: 60 LMELSIGTPPVKIYAQADTGSDLIWLQCIPCTNCYKQLNPMFDSQSSSTFSNIACGSESC 119
Query: 47 TYF---KCVNEQ--CVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSNDNH 101
+ C +Q C Y Y D S T+G A ET+++ F G +FGC ++N+
Sbjct: 120 SKLYSTSCSPDQINCKYNYSYVDGSETQGVLAQETLTLTSTTGEPVAFKGVIFGCGHNNN 179
Query: 102 GFDEDARDGALAGVLGLSRVTISFISQLGSII-KKRFSYCLVIPLPNGEYTSSYLKFGTD 160
G D G++GL R +S +SQ+GS + FS CLV P SS + FG
Sbjct: 180 GAFNDKE----MGIIGLGRGPLSLVSQIGSSLGGNMFSQCLV-PFNTNPSISSPMSFGKG 234
Query: 161 MGYRRPSTQATKFINHP--NNFYYLSLKDISIDNERMNFP 198
+T ++ +FY+++L IS+ E +N P
Sbjct: 235 SEVLGNGVVSTPLVSKTTYQSFYFVTLLGISV--EDINLP 272
>gi|326521034|dbj|BAJ92880.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 448
Score = 86.7 bits (213), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 79/358 (22%), Positives = 144/358 (40%), Gaps = 41/358 (11%)
Query: 1 MVRLFIGTPSKGVLLILDTGSALIY------------AIFDPRKSSSFQKINCDHPDCTY 48
+VR +GTP + +LL +DT + + F+P S S++ + C P C+
Sbjct: 109 VVRARLGTPPQQLLLAVDTSNDAAWIPCSGCAGCPTTTPFNPAASKSYRAVPCGSPACSR 168
Query: 49 F---KCV--NEQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSNDNHGF 103
C + C +++ YAD S+ + ++++V + FGC G
Sbjct: 169 APNPSCSLNTKSCGFSLTYADSSLEAALS-QDSLAVAND-----VVKSYTFGCLQKATGT 222
Query: 104 DEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLKFGTDMGY 163
+ R +SF+SQ + + FSYCL P S L+ G
Sbjct: 223 ATPPQGLLGL-----GRGPLSFLSQTKDMYEGTFSYCL--PSFKSLNFSGTLRLGRKGQP 275
Query: 164 RRPSTQATKFINHPNNFYYLSLKDISIDNERMNFPPDTFDITVSGEGGCIIDSGSVLTYF 223
R T H ++ YY+S+ I + + + PP + G ++DSG++ T
Sbjct: 276 LRIKTTPLLVNPHRSSLYYVSMTGIRVGKKVVPIPPAALAFDPATGAGTVLDSGTMFTRL 335
Query: 224 HSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCYFLPETFNRFPSMAFYFEDANLRIDG 283
+ Y + ++ L+ L CY T ++P + F F + +
Sbjct: 336 VAPAYVAVRDEVRRRIRGAPLSSLGG----FDTCY---NTTVKWPPVTFMFTGMQVTLPA 388
Query: 284 ENVFIIDYENHFFLLAVAPHDD----LVALIGSQQQRDTRFVYDLNIDLLSFVKENCS 337
+N+ I LA+A D ++ +I S QQ++ R ++D+ + F +E C+
Sbjct: 389 DNLVIHSTYGTTSCLAMAAAPDGVNTVLNVIASMQQQNHRILFDVPNGRVGFAREQCT 446
>gi|293332531|ref|NP_001169558.1| uncharacterized protein LOC100383437 precursor [Zea mays]
gi|224030089|gb|ACN34120.1| unknown [Zea mays]
gi|413925069|gb|AFW65001.1| hypothetical protein ZEAMMB73_160528 [Zea mays]
Length = 491
Score = 86.7 bits (213), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 94/377 (24%), Positives = 161/377 (42%), Gaps = 69/377 (18%)
Query: 6 IGTPSKGVLLILDTGSALIY-------------------AIFDPRKSSSFQKINCDHPDC 46
+GTP K + +DTGS +++ ++DP+ SS+ + CD C
Sbjct: 92 LGTPPKHYYVQVDTGSDILWVNCITCEQCPHKSGLGLDLTLYDPKASSTGSMVMCDQAFC 151
Query: 47 TYF------KC-VNEQCVYTMKYADQSVTKGFAAHETI---SVIGKGEGKAIFHGALFGC 96
KC N C Y++ Y D S T G + + V G+ + +FGC
Sbjct: 152 AATFGGKLPKCGANVPCEYSVTYGDGSSTIGSFVTDALQFDQVTRDGQTQPANASVIFGC 211
Query: 97 SNDNHGFDEDARDGALAGVLGLSRVTISFISQLGSI--IKKRFSYCLVIPLPNGEYTSSY 154
G D + + AL G+LG S +SQL + +KK F++CL G ++
Sbjct: 212 GA-QQGGDLGSSNQALDGILGFGEANTSMLSQLTTAGKVKKIFAHCLDTIKGGGIFSIGD 270
Query: 155 LKFGTDMGYRRPSTQATKFI-NHPNNFYYLSLKDISIDNERMNFPPDTFDITVSGE-GGC 212
+ +P + T + + P+ Y ++LK I + + P F+ GE G
Sbjct: 271 V--------VQPKVKTTPLVADKPH--YNVNLKTIDVGGTTLQLPAHIFE---PGEKKGT 317
Query: 213 IIDSGSVLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQ--LCYFLPETF-NRFPS 269
IIDSG+ LTY V+ E ++ F + Q D +Q LC+ P + + FP+
Sbjct: 318 IIDSGTTLTYLPELVF---KEVMLAVFNKHQDITFHD----VQGFLCFQYPGSVDDGFPT 370
Query: 270 MAFYFE-DANLRI--------DGENVFIIDYENHFFLLAVAPHDDLVALIGSQQQRDTRF 320
+ F+FE D L + +G +V+ + ++N + + + L+G +
Sbjct: 371 ITFHFEDDLALHVYPHEYFFANGNDVYCVGFQNG---ASQSKDGKDIVLMGDLVLSNKLV 427
Query: 321 VYDLNIDLLSFVKENCS 337
+YDL ++ + NCS
Sbjct: 428 IYDLENRVIGWTDYNCS 444
>gi|222624645|gb|EEE58777.1| hypothetical protein OsJ_10300 [Oryza sativa Japonica Group]
Length = 431
Score = 86.7 bits (213), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 94/386 (24%), Positives = 153/386 (39%), Gaps = 73/386 (18%)
Query: 2 VRLFIGTPSKGVLLILDTGSALIYAIFDPRKSSSFQKIN--------------CDHPDCT 47
V + +GTP + V ++LDTGS L + + + + + + CD P
Sbjct: 57 VPVAVGTPPQNVTMVLDTGSELSWLLCNGSYAPPLTRRSTRRWRGRDLPVPPFCDTPP-- 114
Query: 48 YFKCVNEQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGC--------SND 99
+ C ++ YAD S G A +T + G A+ GA FGC + +
Sbjct: 115 -----SNACRVSLSYADASSADGVLATDTFLLTGGAPPVAV--GAYFGCITSYSSTTATN 167
Query: 100 NHGFDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLKFGT 159
++G D + A G+LG++R T+SF++Q G+ +RF+YC+ GE L G
Sbjct: 168 SNGTGTDVSEAA-TGLLGMNRGTLSFVTQTGT---RRFAYCIA----PGE-GPGVLLLGD 218
Query: 160 DMGYRRPSTQATKF-INHPNNF-----YYLSLKDISIDNERMNFPPDTFDITVSGEGGCI 213
D G P I+ P + Y + L+ I + + P +G G +
Sbjct: 219 DGGVAPPLNYTPLIEISQPLPYFDRVAYSVQLEGIRVGCALLPIPKSVLTPDHTGAGQTM 278
Query: 214 IDSGSVLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEP-------IQLCYFLPE---- 262
+DSG+ T+ +D Y L +F S R LA L EP C+ PE
Sbjct: 279 VDSGTQFTFLLADAYAALKAEFTSQ-ARLLLAPLG---EPGFVFQGAFDACFRGPEARVA 334
Query: 263 -TFNRFPSMAFYFEDANLRIDGEN-VFIIDYENHFFLLAVA------PHDDLVAL----I 310
P + A + + GE ++++ E A A + D+ + I
Sbjct: 335 AASGLLPEVGLVLRGAEVAVSGEKLLYMVPGERRGEGGAEAVWCLTFGNSDMAGMSAYVI 394
Query: 311 GSQQQRDTRFVYDLNIDLLSFVKENC 336
G Q++ YDL + F C
Sbjct: 395 GHHHQQNVWVEYDLQNGRVGFAPARC 420
>gi|242089069|ref|XP_002440367.1| hypothetical protein SORBIDRAFT_09g030430 [Sorghum bicolor]
gi|241945652|gb|EES18797.1| hypothetical protein SORBIDRAFT_09g030430 [Sorghum bicolor]
Length = 462
Score = 86.7 bits (213), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 90/358 (25%), Positives = 141/358 (39%), Gaps = 70/358 (19%)
Query: 6 IGTPSKGVLLILDTGSALIY--------------AIFDPRKSSSFQKINCDHPDCTYFKC 51
+GTP LL+LDTGS +++ +FDPR+S S+ + C P C
Sbjct: 148 VGTPPTPALLVLDTGSDVVWLQCAPCRQCYAQSGRVFDPRRSRSYAAVRCGAPPCRGLDA 207
Query: 52 VNE--------QCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSNDNHGF 103
C+Y + Y D SVT G A ET+ G + A+ GC +DN G
Sbjct: 208 GGGGGCDRRRGTCLYQVAYGDGSVTAGDLATETLWF---ARGARVPRVAV-GCGHDNEGL 263
Query: 104 DEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLKFGTDMGY 163
A R +S +Q +RFSYC G+D+ +
Sbjct: 264 FVAAAGLLGL-----GRGRLSLPTQTARRYGRRFSYCFQ---------------GSDLDH 303
Query: 164 RRPSTQATKFINHPNNFYYLSLKDISIDNERMNFPPDTFDITVSGEGGCIIDSGSVLTYF 223
R I + + + + + P T G GG I+DSG+ +T
Sbjct: 304 R--------TIIRTVHQHVGGARVRGVGERSLRLDPST------GRGGVILDSGTSVTRL 349
Query: 224 HSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQL---CYFL-PETFNRFPSMAFYFED-AN 278
VY + E F + +LA P L CY L + P+++ + A
Sbjct: 350 ARPVYVAVREAFRAAAGGLRLA-----PGGFSLFDTCYDLRGRRVVKVPTVSVHLAGGAE 404
Query: 279 LRIDGENVFIIDYENHFFLLAVAPHDDLVALIGSQQQRDTRFVYDLNIDLLSFVKENC 336
+ + EN I F LA+A D V+++G+ QQ+ R V+D + ++ V ++C
Sbjct: 405 VALPPENYLIPVDTRGTFCLALAGTDGGVSIVGNIQQQGFRVVFDGDRQRVALVPKSC 462
>gi|222632750|gb|EEE64882.1| hypothetical protein OsJ_19741 [Oryza sativa Japonica Group]
Length = 456
Score = 86.7 bits (213), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 102/359 (28%), Positives = 155/359 (43%), Gaps = 58/359 (16%)
Query: 6 IGTPSKGVLLILDTGSALIYA---IFDP-----RKSSSF-------QKINCDHPDCTYFK 50
+GTP+ L++LDTGS +++A P R+ SS + NC P C
Sbjct: 128 VGTPATTALMVLDTGSDVVWAPVRALPPLLRAVRQGSSTGAAPAPTPRWNCVAPICRRLD 187
Query: 51 CVN-----EQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSNDNHGFDE 105
C+Y + Y D SVT G A ET++ +G A GC +DN G
Sbjct: 188 SAGCDRRRNSCLYQVAYGDGSVTAGDFASETLT-FARG---ARVQRVAIGCGHDNEGLFI 243
Query: 106 DARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLKFGTDMGYRR 165
A +G+LGL R +SF SQ+ + FSYCLV + S GT R
Sbjct: 244 -----AASGLLGLGRGRLSFPSQIARSFGRSFSYCLVDRTSSRRARPSRRWGGTP----R 294
Query: 166 PSTQATKFINHPNNFYYLSLKDISIDNERMNFPPDTFDITV---SGEGGCIIDSGSVLTY 222
+T FYY+ L S+ R+ + D+ + +G GG I+DSG+ +T
Sbjct: 295 MAT-----------FYYVHLLGFSVGGARVKGVSQS-DLRLNPTTGRGGVILDSGTSVTR 342
Query: 223 FHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQL---CYFLP-ETFNRFPSMAFYFE-DA 277
VY + + F R L P L CY L + P+++ + A
Sbjct: 343 LARPVYEAVRDAF-----RAAAVGLRVSPGGFSLFDTCYNLSGRRVVKVPTVSMHLAGGA 397
Query: 278 NLRIDGENVFIIDYENHFFLLAVAPHDDLVALIGSQQQRDTRFVYDLNIDLLSFVKENC 336
++ + EN I + F A+A D V++IG+ QQ+ R V+D + + FV ++C
Sbjct: 398 SVALPPENYLIPVDTSGTFCFAMAGTDGGVSIIGNIQQQGFRVVFDGDAQRVGFVPKSC 456
>gi|255565759|ref|XP_002523869.1| Aspartic proteinase nepenthesin-2 precursor, putative [Ricinus
communis]
gi|223536957|gb|EEF38595.1| Aspartic proteinase nepenthesin-2 precursor, putative [Ricinus
communis]
Length = 447
Score = 86.3 bits (212), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 92/385 (23%), Positives = 155/385 (40%), Gaps = 67/385 (17%)
Query: 2 VRLFIGTPSKGVLLILDTGSALIY------------------AIFDPRKSSSFQKINCDH 43
+ L GTP + + ++DTGS+ ++ + F P+ SSS + I C +
Sbjct: 79 ISLSFGTPPQTLSFVMDTGSSFVWFPCTLRYLCNNCSFTSRISPFLPKHSSSSKIIGCKN 138
Query: 44 PDCTYFKCVNEQCV---------------YTMKYADQSVTKGFAAHETISVIGKGEGKAI 88
P C++ + +C Y + Y T G A ET+ + G I
Sbjct: 139 PKCSWIHQTDLRCTDCDNNSRNCSQICPPYLILYGS-GTTGGVALSETLHLHG-----LI 192
Query: 89 FHGALFGCSNDNHGFDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPL--P 146
L GCS F AG+ G R S SQLG +FSYCL+
Sbjct: 193 VPNFLVGCSV----FSSRQP----AGIAGFGRGPSSLPSQLG---LTKFSYCLLSHKFDD 241
Query: 147 NGEYTSSYLKFGTDMGYRRPSTQATKFINHP--------NNFYYLSLKDISIDNERMNFP 198
E +S L +D + + T + +P + +YY+SL+ ISI + P
Sbjct: 242 TQESSSLVLDSQSDSDKKTAALMYTPLVKNPKVQDKPAFSVYYYVSLRRISIGGRSVKIP 301
Query: 199 PDTFDITVSGEGGCIIDSGSVLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCY 258
G GG IIDSG+ TY ++ + L +F+S + ++ A + + ++ C+
Sbjct: 302 YKYLSPDKDGNGGTIIDSGTTFTYMSTEAFEILSNEFISQVKNYERALMVEALSGLKPCF 361
Query: 259 FLPETFN-RFPSMAFYFE---DANLRIDGENVFIIDYENHFFLLAVAPHDDLVA---LIG 311
+ P + +F+ D L ++ F+ E F + + ++G
Sbjct: 362 NVSGAKELELPQLRLHFKGGADVELPLENYFAFLGSREVACFTVVTDGAEKASGPGMILG 421
Query: 312 SQQQRDTRFVYDLNIDLLSFVKENC 336
+ Q ++ YDL + L F KE+C
Sbjct: 422 NFQMQNFYVEYDLQNERLGFKKESC 446
>gi|388508518|gb|AFK42325.1| unknown [Lotus japonicus]
Length = 204
Score = 86.3 bits (212), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 58/207 (28%), Positives = 104/207 (50%), Gaps = 12/207 (5%)
Query: 134 KKRFSYCLVIPLPNGEYTSSYLKFGTDMGYRRPSTQATKFINHPN--NFYYLSLKDISID 191
+ +FSYCL + +S L G+ + +T + +P+ +FYYLSL+ I +
Sbjct: 3 EAKFSYCLT---SMDDSKASVLLLGS-LAKATKDAISTPLLTNPSQPSFYYLSLEGIPVG 58
Query: 192 NERMNFPPDTFDITVSGEGGCIIDSGSVLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCP 251
+++ FD++ G GG IIDSG+ +TY V+ L ++F+S QL + S
Sbjct: 59 GTQLSIEQSIFDVSDDGSGGVIIDSGTTITYLEKSVFDTLKKEFISQ-SNLQLDKSSST- 116
Query: 252 EPIQLCYFLPE--TFNRFPSMAFYFEDANLRIDGENVFIIDYENHFFLLAVAPHDDLVAL 309
+ +C+ LP T P + F+F+ +L + E+ I D + LA+ + + ++
Sbjct: 117 -GLDVCFSLPSETTQVEVPKLVFHFKGGDLELPAESYMIADSKLGVACLAMGASNGM-SI 174
Query: 310 IGSQQQRDTRFVYDLNIDLLSFVKENC 336
G+ QQ++ +DL + +SFV C
Sbjct: 175 FGNVQQQNILVNHDLEKETISFVPTQC 201
>gi|218190722|gb|EEC73149.1| hypothetical protein OsI_07179 [Oryza sativa Indica Group]
Length = 494
Score = 86.3 bits (212), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 94/381 (24%), Positives = 155/381 (40%), Gaps = 69/381 (18%)
Query: 2 VRLFIGTPSKGVLLILDTGSALIY-------------------AIFDPRKSSSFQKINCD 42
R+ IGTP+K + +DTGS +++ ++DPR S S + + CD
Sbjct: 92 TRIGIGTPAKRYYVQVDTGSDILWVNCVSCDGCPRKSNLGIELTMYDPRGSQSGELVTCD 151
Query: 43 H-----------PDCTYFKCVNEQCVYTMKYADQSVTKGFAAHETI---SVIGKGEGKAI 88
P CT C Y++ Y D S T GF + + V G G+
Sbjct: 152 QQFCVANYGGVLPSCTS----TSPCEYSISYGDGSSTAGFFVTDFLQYNQVSGDGQTTPA 207
Query: 89 FHGALFGCSNDNHGFDEDARDGALAGVLGLSRVTISFISQLGSI--IKKRFSYCLVIPLP 146
FGC G D + + AL G+LG + S +SQL + ++K F++CL
Sbjct: 208 NASVSFGCGA-KLGGDLGSSNLALDGILGFGQSNSSMLSQLAAAGKVRKMFAHCLDTVNG 266
Query: 147 NGEYTSSYLKFGTDMGYRRPSTQATKFINHPNNFYYLSLKDISIDNERMNFPPDTFDITV 206
G + + +P + T ++ + Y + LK I + + P + FD
Sbjct: 267 GGIFAIGNV--------VQPKVKTTPLVSDMPH-YNVILKGIDVGGTALGLPTNIFDSGN 317
Query: 207 SGEGGCIIDSGSVLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCY-FLPETFN 265
S G IIDSG+ L Y VY L + + L D C+ + +
Sbjct: 318 S--KGTIIDSGTTLAYVPEGVYKALFAMVFDKHQDISVQTLQDFS-----CFQYSGSVDD 370
Query: 266 RFPSMAFYFE-DANLRI--------DGENVFIIDYENHFFLLAVAPHDDLVALIGSQQQR 316
FP + F+FE D +L + +G+N++ + ++N + D+V L+G
Sbjct: 371 GFPEVTFHFEGDVSLIVSPHDYLFQNGKNLYCMGFQNGG--VQTKDGKDMV-LLGDLVLS 427
Query: 317 DTRFVYDLNIDLLSFVKENCS 337
+ +YDL + + NCS
Sbjct: 428 NKLVLYDLENQAIGWADYNCS 448
>gi|224124882|ref|XP_002329972.1| predicted protein [Populus trichocarpa]
gi|222871994|gb|EEF09125.1| predicted protein [Populus trichocarpa]
Length = 332
Score = 86.3 bits (212), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 80/355 (22%), Positives = 151/355 (42%), Gaps = 57/355 (16%)
Query: 15 LILDTGSALIY---------------AIFDPRKSSSFQKINCDHPDCTYFKCV------- 52
+ILDTGS+L + ++DP S +++K++C +C+ K
Sbjct: 1 MILDTGSSLSWLQCQPCAVYCHAQADPLYDPSVSKTYKKLSCASVECSRLKAATLNDPLC 60
Query: 53 ---NEQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSNDNHGFDEDARD 109
+ C+YT Y D S + G+ + + +++ +GC DN G
Sbjct: 61 ETDSNACLYTASYGDTSFSIGYLSQDLLTL----TSSQTLPQFTYGCGQDNQGLF----- 111
Query: 110 GALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLKFGTDMGYRRPSTQ 169
G AG++GL+R +S ++QL + FSYC LP SS F + S +
Sbjct: 112 GRAAGIIGLARDKLSMLAQLSTKYGHAFSYC----LPTANSGSSGGGFLSIGSISPTSYK 167
Query: 170 ATKFINHPNN--FYYLSLKDISIDNERMNFPPDTFDITVSGEGGCIIDSGSVLTYFHSDV 227
T + N Y+L L I++ ++ + + +IDSG+V+T +
Sbjct: 168 FTPMLTDSKNPSLYFLRLTAITVSGRPLDLAAAMYRVPT------LIDSGTVITRLPMSM 221
Query: 228 YWKLHEKFVSYFERFQLAQLSDCP--EPIQLCYFLP-ETFNRFPSMAFYFE-DANLRIDG 283
Y L + FV + + + P + C+ ++ + P + F+ A+L +
Sbjct: 222 YAALRQAFV----KIMSTKYAKAPAYSILDTCFKGSLKSISAVPEIKMIFQGGADLTLRA 277
Query: 284 ENVFIIDYENHFFLLAVAPHD--DLVALIGSQQQRDTRFVYDLNIDLLSFVKENC 336
++ +I+ + LA A + +A+IG++QQ+ YD++ + F +C
Sbjct: 278 PSI-LIEADKGITCLAFAGSSGTNQIAIIGNRQQQTYNIAYDVSTSRIGFAPGSC 331
>gi|115473125|ref|NP_001060161.1| Os07g0592200 [Oryza sativa Japonica Group]
gi|29027762|dbj|BAC65898.1| putative CND41, chloroplast nucleoid DNA binding protein [Oryza
sativa Japonica Group]
gi|113611697|dbj|BAF22075.1| Os07g0592200 [Oryza sativa Japonica Group]
Length = 631
Score = 86.3 bits (212), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 94/369 (25%), Positives = 150/369 (40%), Gaps = 58/369 (15%)
Query: 2 VRLFIGTPSKGVLLILDTGSALIYAI--------------FDPRKSSSFQKINCDHPDCT 47
RL+IGTPS+ LI+D+GS + Y F P SS++ + C+ DCT
Sbjct: 93 TRLYIGTPSQEFALIVDSGSTVTYVPCATCEQCGNHQDPRFQPDLSSTYSPVKCNV-DCT 151
Query: 48 YFKCVNE--QCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSNDNHG--F 103
C NE QC Y +YA+ S + G + +S + E K A+FGC N G F
Sbjct: 152 ---CDNERSQCTYERQYAEMSSSSGVLGEDIMSFGKESELKP--QRAVFGCENTETGDLF 206
Query: 104 DEDARDGALAGVLGLSRVTISFISQL--GSIIKKRFSYCLVIPLPNGEYTSSYLKFGTDM 161
+ A G++GL R +S + QL +I FS C Y + GT +
Sbjct: 207 SQHAD-----GIMGLGRGQLSIMDQLVEKGVISDSFSLC---------YGGMDVGGGTMV 252
Query: 162 GYRRPSTQATKFINHPN----NFYYLSLKDISIDNERMNFPPDTFDITVSGEGGCIIDSG 217
P+ F +H N +Y + LK+I + + + P F+ + G ++DSG
Sbjct: 253 LGGMPAPPDMVF-SHSNPVRSPYYNIELKEIHVAGKALRLDPKIFN----SKHGTVLDSG 307
Query: 218 SVLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCYF-----LPETFNRFPSMAF 272
+ Y + + + + + D P +C+ + + FP +
Sbjct: 308 TTYAYLPEQAFVAFKDAVTNKVNSLKKIRGPD-PNYKDICFAGAGRNVSQLSEVFPDVDM 366
Query: 273 YFEDAN-LRIDGENVFI--IDYENHFFLLAVAPHDDLVALIGSQQQRDTRFVYDLNIDLL 329
F + L + EN E + L D L+G R+T YD + + +
Sbjct: 367 VFGNGQKLSLSPENYLFRHSKVEGAYCLGVFQNGKDPTTLLGGIVVRNTLVTYDRHNEKI 426
Query: 330 SFVKENCSD 338
F K NCS+
Sbjct: 427 GFWKTNCSE 435
>gi|115446115|ref|NP_001046837.1| Os02g0473200 [Oryza sativa Japonica Group]
gi|47497549|dbj|BAD19621.1| putative aspartic proteinase nepenthesin [Oryza sativa Japonica
Group]
gi|47847591|dbj|BAD21978.1| putative aspartic proteinase nepenthesin [Oryza sativa Japonica
Group]
gi|113536368|dbj|BAF08751.1| Os02g0473200 [Oryza sativa Japonica Group]
Length = 494
Score = 86.3 bits (212), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 95/382 (24%), Positives = 156/382 (40%), Gaps = 71/382 (18%)
Query: 2 VRLFIGTPSKGVLLILDTGSALIY-------------------AIFDPRKSSSFQKINCD 42
R+ IGTP+K + +DTGS +++ ++DPR S S + + CD
Sbjct: 92 TRIGIGTPAKRYYVQVDTGSDILWVNCVSCDGCPRKSNLGIELTMYDPRGSQSGELVTCD 151
Query: 43 H-----------PDCTYFKCVNEQCVYTMKYADQSVTKGFAAHETI---SVIGKGEGKAI 88
P CT C Y++ Y D S T GF + + V G G+
Sbjct: 152 QQFCVANYGGVLPSCTS----TSPCEYSISYGDGSSTAGFFVTDFLQYNQVSGDGQTTPA 207
Query: 89 FHGALFGCSNDNHGFDEDARDGALAGVLGLSRVTISFISQLGSI--IKKRFSYCLVIPLP 146
FGC G D + + AL G+LG + S +SQL + ++K F++CL
Sbjct: 208 NASVSFGCGA-KLGGDLGSSNLALDGILGFGQSNSSMLSQLAAAGKVRKMFAHCLDTVNG 266
Query: 147 NGEYTSSYLKFGTDMGYRRPSTQATKFI-NHPNNFYYLSLKDISIDNERMNFPPDTFDIT 205
G + + +P + T + + P+ Y + LK I + + P + FD
Sbjct: 267 GGIFAIGNV--------VQPKVKTTPLVPDMPH--YNVILKGIDVGGTALGLPTNIFDSG 316
Query: 206 VSGEGGCIIDSGSVLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCY-FLPETF 264
S G IIDSG+ L Y VY L + + L D C+ +
Sbjct: 317 NS--KGTIIDSGTTLAYVPEGVYKALFAMVFDKHQDISVQTLQDFS-----CFQYSGSVD 369
Query: 265 NRFPSMAFYFE-DANLRI--------DGENVFIIDYENHFFLLAVAPHDDLVALIGSQQQ 315
+ FP + F+FE D +L + +G+N++ + ++N + D+V L+G
Sbjct: 370 DGFPEVTFHFEGDVSLIVSPHDYLFQNGKNLYCMGFQNGG--VQTKDGKDMV-LLGDLVL 426
Query: 316 RDTRFVYDLNIDLLSFVKENCS 337
+ +YDL + + NCS
Sbjct: 427 SNKLVLYDLENQAIGWADYNCS 448
>gi|326513976|dbj|BAJ92138.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 342
Score = 86.3 bits (212), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 73/267 (27%), Positives = 119/267 (44%), Gaps = 43/267 (16%)
Query: 94 FGCSNDNHGFDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSS 153
FGC + G A +G++GLS T+S ISQL RFSYCL P E +S
Sbjct: 96 FGCGALSAGSLVGA-----SGLMGLSPGTMSLISQLSV---PRFSYCLT---PFAERKTS 144
Query: 154 YLKFGTDMGYRRPST----QATKFINHPNN---FYYLSLKDISIDNERMNFPPDTFDITV 206
+ FG R+ +T Q T + +P +YY+ L +S+ +R+ P + I
Sbjct: 145 PMLFGAMADLRKYNTTGPIQTTAILRNPAMDTFYYYVPLVGLSLGTKRLRVPAASLAINP 204
Query: 207 SGEGGCIIDSGSVLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCYFLPETFN- 265
G GG I+DSGS + + + + + + E +L + E +LC+ +P
Sbjct: 205 DGTGGTIVDSGSTMAHLAGKAFDAVKK---AVLEAVKLPVFNGTVEDYELCFAVPSGVAM 261
Query: 266 ---RFPSMAFYFEDANLRIDGENVFIIDYENHF-------FLLAVA--PHD--DLVALIG 311
+ P + +F DG + +N+F LAVA P D +++IG
Sbjct: 262 AAVKTPPLVLHF-------DGGAAMALPRDNYFQEPRAGLMCLAVARSPEDLGAPISIIG 314
Query: 312 SQQQRDTRFVYDLNIDLLSFVKENCSD 338
+ QQ++ ++D++ SF C D
Sbjct: 315 NVQQQNMHVLFDVHNQKFSFAPTKCHD 341
>gi|255548660|ref|XP_002515386.1| Aspartic proteinase nepenthesin-2 precursor, putative [Ricinus
communis]
gi|223545330|gb|EEF46835.1| Aspartic proteinase nepenthesin-2 precursor, putative [Ricinus
communis]
Length = 387
Score = 86.3 bits (212), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 103/368 (27%), Positives = 158/368 (42%), Gaps = 57/368 (15%)
Query: 1 MVRLFIGTPSKGVLLILDTGSALIYA---------------IFDPRKSSSFQKINCDHPD 45
+V++ +GTP + L LDTGS + + FDPRKSSS++ ++C
Sbjct: 46 LVKMALGTPKLSLSLALDTGSDITWTQCEPCVGSCYRQAQTKFDPRKSSSYKNVSCSSSS 105
Query: 46 CTYFK-------CVNEQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSN 98
C CV+ C+Y ++Y D S + GF A E +++ + LFGC
Sbjct: 106 CRIITDSGGARGCVSSTCIYKVQYGDGSYSVGFFATEKLTI----SPSDVISNFLFGCGQ 161
Query: 99 DNHGFDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLKFG 158
N G R G +AG+LGL R +S Q F+YCL ++SS
Sbjct: 162 QNAG-----RFGRIAGLLGLGRGKLSLALQTSEKYNNLFTYCLP------SFSSSSTGHL 210
Query: 159 TDMGYRRPSTQAT----KFINHPNNFYYLSLKDISIDNERMNFPPDTFDITVSGEGGCII 214
T G S + T F N P FY + +K +S+ + D +V G II
Sbjct: 211 TLGGQVPKSVKFTPLSPAFKNTP--FYGIDIKGLSVGGHVL-----PIDASVFSNAGAII 263
Query: 215 DSGSVLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCY-FLPETFNRFPSMAFY 273
DSG+V+T VY L KF + + +D + CY F P ++F+
Sbjct: 264 DSGTVITRLQPTVYSALSSKFQQLMKDY---PKTDGFSILDTCYDFSGNESISVPRISFF 320
Query: 274 FEDANLRIDGENVFIIDYENHF--FLLAVAPHDD--LVALIGSQQQRDTRFVYDLNIDLL 329
F+ + +D + I+ N + LA AP+DD + G+ QQ+ V+DL +
Sbjct: 321 FK-GGVEVDIKFFGILTVINAWDKVCLAFAPNDDDGDFVVFGNSQQQTYDVVHDLAKGRI 379
Query: 330 SFVKENCS 337
F C+
Sbjct: 380 GFAPSGCN 387
>gi|414881575|tpg|DAA58706.1| TPA: hypothetical protein ZEAMMB73_168363 [Zea mays]
Length = 506
Score = 85.9 bits (211), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 98/385 (25%), Positives = 154/385 (40%), Gaps = 69/385 (17%)
Query: 6 IGTPSKGVLLILDTGSALIY-------------------AIFDPRKSSSFQKINCDH--- 43
+GTP K + +DTGS +++ +DP+ SSS ++CD
Sbjct: 93 LGTPPKRYYVQVDTGSDILWVNCISCSKCPRKSGLGLDLTFYDPKASSSGSTVSCDQGFC 152
Query: 44 --------PDCTYFKCVNEQCVYTMKYADQSVTKGFAAHETI---SVIGKGEGKAIFHGA 92
P CT N C Y++ Y D S T GF + + V G G+ +
Sbjct: 153 AATYGGKLPGCT----ANVPCEYSVMYGDGSSTTGFFITDALQFDQVTGDGQTQPGNATI 208
Query: 93 LFGCSNDNHGFDEDARDGALAGVLGLSRVTISFISQLGSI--IKKRFSYCL-------VI 143
FGC G D + AL G+LG + S +SQL + KK F++CL +
Sbjct: 209 TFGCGAQQGG-DLGNSNQALDGILGFGQANTSMLSQLAAAGKAKKIFAHCLDTIKGGGIF 267
Query: 144 PLPNGEYTSSYLKFGTDMGYRRPSTQATKFINHPNNFYYLSLKDISIDNERMNFPPDTFD 203
+ N Y F G I Y ++LK I + + P F+
Sbjct: 268 AIGNVVQPKCYFVFFFAHGLLNIPLFLLVMILLSRPHYNVNLKSIDVGGTTLQLPAHVFE 327
Query: 204 ITVSGE-GGCIIDSGSVLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCY-FLP 261
+GE G IIDSG+ LTY V+ ++ + S L D LC+ +
Sbjct: 328 ---TGEKKGTIIDSGTTLTYLPELVFKQVMDVVFSKHRDIAFHNLQDF-----LCFQYSG 379
Query: 262 ETFNRFPSMAFYFE-DANLRI--------DGENVFIIDYENHFFLLAVAPHDDLVALIGS 312
+ FP++ F+FE D L + +G +++ + ++N L D+V L+G
Sbjct: 380 SVDDGFPTITFHFEDDLALHVYPHEYFFPNGNDIYCVGFQNG--ALQSKDGKDIV-LMGD 436
Query: 313 QQQRDTRFVYDLNIDLLSFVKENCS 337
+ VYDL ++ + NCS
Sbjct: 437 LVLSNKLVVYDLENQVIGWTDYNCS 461
>gi|255567949|ref|XP_002524952.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
gi|223535787|gb|EEF37449.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
Length = 394
Score = 85.9 bits (211), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 82/311 (26%), Positives = 132/311 (42%), Gaps = 50/311 (16%)
Query: 2 VRLFIGTPSKGVLLILDTGSALIYA--------------IFDPRKSSSFQKINCDHPDCT 47
R++IGTP + LI+DTGS + Y F+P SS++Q ++C+ DCT
Sbjct: 92 TRIWIGTPPQTFALIVDTGSTVTYVPCSTCEQCGRHQDPKFEPELSSTYQPVSCNI-DCT 150
Query: 48 YFKCVNE--QCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSNDNHGFDE 105
C NE QCVY +YA+ S + G + IS + E + A+FGC N G
Sbjct: 151 ---CDNERKQCVYERQYAEMSSSSGVLGEDIISFGNQSE--LVPQRAIFGCENQETG--- 202
Query: 106 DARDGALAGVLGLSRVTISFISQL--GSIIKKRFSYCLVIPLPNGEYTSSYLKFGTD-MG 162
D G++GL R +S + QL +I FS C Y + G +G
Sbjct: 203 DLYSQRADGIMGLGRGDLSIVDQLVEKGVISDSFSLC---------YGGMDIGGGAMILG 253
Query: 163 YRRPSTQATKFINHP--NNFYYLSLKDISIDNERMNFPPDTFDITVSGEGGCIIDSGSVL 220
P + + P + +Y + LK I + ++++ P FD G+ G ++DSG+
Sbjct: 254 GISPPSGMVFAESDPVRSQYYNIDLKAIHVAGKQLHLDPSIFD----GKHGTVLDSGTTY 309
Query: 221 TYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCYFLPET-----FNRFPSMAFYFE 275
Y + + + + D P +C+ E+ N FP++ F
Sbjct: 310 AYLPEAAFTAFKDAMMKELTSLKQIHGPD-PNYNDICFSGAESDVSQLSNTFPAVEMVFS 368
Query: 276 DAN-LRIDGEN 285
+ L + EN
Sbjct: 369 NGQKLSLSPEN 379
>gi|15217887|ref|NP_176703.1| aspartic proteinase-like protein 2 [Arabidopsis thaliana]
gi|118572746|sp|Q9S9K4.2|ASPL2_ARATH RecName: Full=Aspartic proteinase-like protein 2; Flags: Precursor
gi|332196226|gb|AEE34347.1| aspartic proteinase-like protein 2 [Arabidopsis thaliana]
Length = 475
Score = 85.9 bits (211), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 86/372 (23%), Positives = 153/372 (41%), Gaps = 56/372 (15%)
Query: 2 VRLFIGTPSKGVLLILDTGSALIY-------------------AIFDPRKSSSFQKINCD 42
++ +G+P K + +DTGS +++ ++FD SS+ +K+ CD
Sbjct: 76 TKIKLGSPPKEYHVQVDTGSDILWINCKPCPKCPTKTNLNFRLSLFDMNASSTSKKVGCD 135
Query: 43 HPDCTYFKCVNE-----QCVYTMKYADQSVTKGFAAHETIS---VIGKGEGKAIFHGALF 94
C++ + C Y + YAD+S + G + ++ V G + + +F
Sbjct: 136 DDFCSFISQSDSCQPALGCSYHIVYADESTSDGKFIRDMLTLEQVTGDLKTGPLGQEVVF 195
Query: 95 GCSNDNHGFDEDARDGALAGVLGLSRVTISFISQLGSI--IKKRFSYCLVIPLPNGEYTS 152
GC +D G + D A+ GV+G + S +SQL + K+ FS+CL G +
Sbjct: 196 GCGSDQSGQLGNG-DSAVDGVMGFGQSNTSVLSQLAATGDAKRVFSHCLDNVKGGGIFAV 254
Query: 153 SYLKFGTDMGYRRPSTQATKFINHPNNFYY-LSLKDISIDNERMNFPPDTFDITVSGEGG 211
+ P + T + PN +Y + L + +D ++ P ++ GG
Sbjct: 255 GVVD--------SPKVKTTPMV--PNQMHYNVMLMGMDVDGTSLDLPR-----SIVRNGG 299
Query: 212 CIIDSGSVLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCYFLPETFNRFPSMA 271
I+DSG+ L YF +Y L E ++ Q +L E Q F FP ++
Sbjct: 300 TIVDSGTTLAYFPKVLYDSLIETILAR----QPVKLHIVEETFQCFSFSTNVDEAFPPVS 355
Query: 272 FYFEDANLRIDGENVFIIDYENHFFLLA------VAPHDDLVALIGSQQQRDTRFVYDLN 325
F FED+ + ++ E + V L+G + VYDL+
Sbjct: 356 FEFEDSVKLTVYPHDYLFTLEEELYCFGWQAGGLTTDERSEVILLGDLVLSNKLVVYDLD 415
Query: 326 IDLLSFVKENCS 337
+++ + NCS
Sbjct: 416 NEVIGWADHNCS 427
>gi|357502759|ref|XP_003621668.1| Aspartic proteinase-like protein [Medicago truncatula]
gi|355496683|gb|AES77886.1| Aspartic proteinase-like protein [Medicago truncatula]
Length = 481
Score = 85.9 bits (211), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 93/378 (24%), Positives = 159/378 (42%), Gaps = 60/378 (15%)
Query: 2 VRLFIGTPSKGVLLILDTGSALIY-------------------AIFDPRKSSSFQKINCD 42
++ IGTPSK L +DTG+ +++ +++ ++SSS + + CD
Sbjct: 75 AKIGIGTPSKDYYLQVDTGTDMMWVNCIQCKECPTRSNLGMDLTLYNIKESSSGKLVPCD 134
Query: 43 HPDCTYFK------CV---NEQCVYTMKYADQSVTKGFAAHETI---SVIGKGEGKAIFH 90
C C N+ C Y Y D S T G+ + + V G + +
Sbjct: 135 QELCKEINGGLLTGCTSKTNDSCPYLEIYGDGSSTAGYFVKDVVLFDQVSGDLKTASANG 194
Query: 91 GALFGCSNDNHGFDEDARDGALAGVLGLSRVTISFISQLGSI--IKKRFSYCLVIPLPNG 148
+FGC G + + AL G+LG + S ISQL S +KK F++CL NG
Sbjct: 195 SVIFGCGARQSGDLSYSNEEALDGILGFGKANYSMISQLSSSGKVKKMFAHCL-----NG 249
Query: 149 EYTSSYLKFGTDMGYRRPSTQATKFI-NHPNNFYYLSLKDISIDNERMNFPPDTFDITVS 207
G + +P+ T + + P+ Y +++ I + + +N D + S
Sbjct: 250 VNGGGIFAIGHVV---QPTVNTTPLLPDQPH--YSVNMTAIQVGHTFLNLSTDASEQRDS 304
Query: 208 GEGGCIIDSGSVLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCYFLPETFNRF 267
G IIDSG+ L Y +Y L K +S ++ L D Q + + F
Sbjct: 305 --KGTIIDSGTTLAYLPDGIYQPLVYKILSQQPNLKVQTLHDEYTCFQ---YSGSVDDGF 359
Query: 268 PSMAFYFEDA-NLRI-------DGENVFIIDYENHFFLLAVAPHDDLVALIGSQQQRDTR 319
P++ FYFE+ +L++ EN++ I ++N A + + L+G +
Sbjct: 360 PNVTFYFENGLSLKVYPHDYLFLSENLWCIGWQNSG---AQSRDSKNMTLLGDLVLSNKL 416
Query: 320 FVYDLNIDLLSFVKENCS 337
YDL ++ + + NCS
Sbjct: 417 VFYDLENQVIGWTEYNCS 434
>gi|302141829|emb|CBI19032.3| unnamed protein product [Vitis vinifera]
Length = 382
Score = 85.9 bits (211), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 76/252 (30%), Positives = 117/252 (46%), Gaps = 25/252 (9%)
Query: 94 FGCSNDNH--GFDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYT 151
FGC +N G D+ A L R +S +SQLG+ ++FSYCL E
Sbjct: 144 FGCGVNNRATGMDQTAGLLGLG------RGVLSLVSQLGT---QKFSYCLT---SIHENK 191
Query: 152 SSYLKFGTDM--GYRRPSTQATKFINHP--NNFYYLSLKDISIDNERMNFPPDTFDITVS 207
+S L FG+ + T I +P ++YYL+LK I++ + P F +
Sbjct: 192 TSSLLFGSLAYSNFNPGKIPRTPLIQNPFLPSYYYLALKGITVGYTLLPIPEFAFQLGKD 251
Query: 208 GEGGCIIDSGSVLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCYFLP---ETF 264
G GG I+DSG+ +TY D + L F+S E Q+A S + LC+ LP
Sbjct: 252 GSGGMILDSGTTITYLQEDAFDVLKNAFISQTE-LQVANSSTT--GLDLCFHLPVKNAAE 308
Query: 265 NRFPSMAFYFEDANLRIDGENVFIIDYENHFFLLAVAPHDDLVALIGSQQQRDTRFVYDL 324
+ P + F+F+ +L + EN + D E LA+ L ++ G+ QQ++ ++DL
Sbjct: 309 VKVPKLIFHFKGLDLALPVENYMVSDPEMGLICLAIDATGSL-SIFGNIQQQNMLVLHDL 367
Query: 325 NIDLLSFVKENC 336
LS V C
Sbjct: 368 KKSTLSLVPTQC 379
>gi|224079535|ref|XP_002305886.1| predicted protein [Populus trichocarpa]
gi|222848850|gb|EEE86397.1| predicted protein [Populus trichocarpa]
Length = 436
Score = 85.9 bits (211), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 80/271 (29%), Positives = 119/271 (43%), Gaps = 55/271 (20%)
Query: 1 MVRLFIGTPSKGVLLILDTGSALIY--------------AIFDPRKSSSFQKINCDHPDC 46
+V L IGTP + +ILDTGS L + +FDP SSSF + C+HP C
Sbjct: 78 LVSLPIGTPPQSQQMILDTGSQLSWIQCHKKVPRKPPPSTVFDPSLSSSFSVLPCNHPLC 137
Query: 47 TY--------FKC-VNEQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCS 97
C +N C Y+ YAD ++ +G E I+ + + GC+
Sbjct: 138 KPRIPDFTLPTSCDLNRLCHYSYFYADGTLAEGNLVREKITFSTSQSTPPL----ILGCA 193
Query: 98 NDNHGFDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLKF 157
EDA D G+LG++ +SF SQ I K FSYC+ P + +
Sbjct: 194 -------EDASDDK--GILGMNLGRLSFASQ-AKITK--FSYCV----PTRQVRPGFTPT 237
Query: 158 GTDMGYRRPSTQATKFIN---------HPNN---FYYLSLKDISIDNERMNFPPDTFDIT 205
G+ P++ ++I+ PN + ++L+ I I N+++N P F
Sbjct: 238 GSFYLGENPNSAGFQYISLLTFSQSQRMPNLDPLAHTVALQGIRIGNKKLNIPVSAFRAD 297
Query: 206 VSGEGGCIIDSGSVLTYFHSDVYWKLHEKFV 236
SG G +IDSGS TY Y K+ E+ V
Sbjct: 298 PSGAGQSMIDSGSEFTYLVDVAYNKVREEVV 328
>gi|159464048|ref|XP_001690254.1| pepsin-type aspartyl protease [Chlamydomonas reinhardtii]
gi|158284242|gb|EDP09992.1| pepsin-type aspartyl protease [Chlamydomonas reinhardtii]
Length = 485
Score = 85.9 bits (211), Expect = 3e-14, Method: Compositional matrix adjust.
Identities = 90/367 (24%), Positives = 148/367 (40%), Gaps = 54/367 (14%)
Query: 4 LFIGTPSKGVLLILDTGSALIY--------------AIFDPRKSSSFQKI-------NCD 42
L +GTP + +I+DTGS + Y FDP KS++ +K+ NC
Sbjct: 17 LKLGTPERTFSVIIDTGSTITYIPCKDCSHCGKHTAEWFDPDKSTTAKKLACGDPLCNCG 76
Query: 43 HPDCTYFKCVNEQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSNDNHG 102
P CT C N++C Y+ YA++S ++G+ +T + +FGC N G
Sbjct: 77 TPSCT---CNNDRCYYSRTYAERSSSEGWMIEDTFGFPDSDSPVRL----VFGCENGETG 129
Query: 103 FDEDARDGALAGVLGLSRVTISFISQL--GSIIKKRFSYCLVIPLPNGEYTSSYLKFGTD 160
E R A G++G+ +F SQL +I+ FS C P L G
Sbjct: 130 --EIYRQMA-DGIMGMGNNHNAFQSQLVQRKVIEDVFSLCFGYP------KDGILLLGDV 180
Query: 161 MGYRRPSTQATKFINHPNNFYY-LSLKDISIDNERMNFPPDTFDITVSGEGGCIIDSGSV 219
+T T + H + YY + + I+++ + + F FD G ++DSG+
Sbjct: 181 TLPEGANTVYTPLLTHLHLHYYNVKMDGITVNGQTLAFDASVFDRGY----GTVLDSGTT 236
Query: 220 LTYFHSDVYWKLHEKFVSYFERFQLAQLSDC-PEPIQLCYFLPETFNRFPSMAFYFEDAN 278
TY +D + + + Y E+ L P+ +C+ ++F + YF A
Sbjct: 237 FTYLPTDAFKAMAKAVGDYVEKKGLQSTPGADPQYNDICW--KGAPDQFKDLDKYFPPAE 294
Query: 279 LRIDGENVFIIDYENHFFL-------LAVAPHDDLVALIGSQQQRDTRFVYDLNIDLLSF 331
G + + FL L + + + AL+G RD YD + F
Sbjct: 295 FVFGGGAKLTLPPLRYLFLSKPAEYCLGIFDNGNSGALVGGVSVRDVVVTYDRRNSKVGF 354
Query: 332 VKENCSD 338
C+D
Sbjct: 355 TTMACAD 361
>gi|242053991|ref|XP_002456141.1| hypothetical protein SORBIDRAFT_03g031170 [Sorghum bicolor]
gi|241928116|gb|EES01261.1| hypothetical protein SORBIDRAFT_03g031170 [Sorghum bicolor]
Length = 519
Score = 85.9 bits (211), Expect = 3e-14, Method: Compositional matrix adjust.
Identities = 105/422 (24%), Positives = 158/422 (37%), Gaps = 98/422 (23%)
Query: 2 VRLFIGTPSKGVLLILDTGSALIYA----------------------------------- 26
VR +GTP++ LL+ DTGS L +
Sbjct: 109 VRFRVGTPARPFLLVADTGSDLTWVKCHRHDHDAPAPGYGYAAPASNDSSTSSLSAAAAS 168
Query: 27 ------IFDPRKSSSFQKINCDHPDCTYF--------KCVNEQCVYTMKYADQSVTKGFA 72
+F P +S ++ I C CT C Y +Y D S +G
Sbjct: 169 SSSHARVFRPDRSRTWAPIPCSSDTCTASLPFSLAACPTPGSPCAYDYRYKDGSAARGTV 228
Query: 73 AHE--TISVIGKGEGK----AIFHGALFGCSNDNHGFDEDARDGALAGVLGLSRVTISFI 126
+ TI++ G+G K A G + GC+ G A DG VL L ISF
Sbjct: 229 GTDSATIALSGRGAKKKQRQAKLRGVVLGCTTSYTGDSFLASDG----VLSLGYSNISFA 284
Query: 127 SQLGSIIKKRFSYCLVIPLPNGEYTSSYLKFGTD--MGYRRPST---------------- 168
S+ + RFSYCLV L T SYL FG + + PS
Sbjct: 285 SRAAARFGGRFSYCLVDHLAPRNAT-SYLTFGPNPAVSSSPPSKTACAGGGSPAAAPPGP 343
Query: 169 ----QATKFINHP-NNFYYLSLKDISIDNERMNFPPDTFDITVSGEGGCIIDSGSVLTYF 223
Q ++H FY +++ IS+D E + P +D V+ GG I+DSG+ LT
Sbjct: 344 GGARQTPLLLDHRMRPFYAVTVNGISVDGELLRIPRLVWD--VAKGGGAILDSGTSLTVL 401
Query: 224 HSDVYWKLHEKFVSYFERFQLAQLSDCP-EPIQLCYFL--PETFN----RFPSMAFYFED 276
S Y V+ + +LA L +P CY P T P +A +F
Sbjct: 402 VSPAY----RAVVAALNK-KLAGLPRVTMDPFDYCYNWTSPSTGEDLTVAMPELAVHFAG 456
Query: 277 ANLRIDGENVFIIDYENHFFLLAVAPHD-DLVALIGSQQQRDTRFVYDLNIDLLSFVKEN 335
+ ++ID + + + V++IG+ Q++ + +DL L F +
Sbjct: 457 SARLQPPAKSYVIDAAPGVKCIGLQEGEWPGVSVIGNILQQEHLWEFDLKNRRLRFKRSR 516
Query: 336 CS 337
C+
Sbjct: 517 CT 518
>gi|302821814|ref|XP_002992568.1| hypothetical protein SELMODRAFT_46291 [Selaginella moellendorffii]
gi|300139637|gb|EFJ06374.1| hypothetical protein SELMODRAFT_46291 [Selaginella moellendorffii]
Length = 368
Score = 85.9 bits (211), Expect = 3e-14, Method: Compositional matrix adjust.
Identities = 89/381 (23%), Positives = 157/381 (41%), Gaps = 59/381 (15%)
Query: 2 VRLFIGTPSKGVLLILDTGSALIYA--------IFDPRKSSSFQKINCDHPDCTYFK--- 50
++L IG+ K + I+DTGS + +FDP S S++++ C C +
Sbjct: 1 MQLGIGSLQKNLSAIIDTGSEAVLVQCGSRSRPVFDPAASQSYRQVPCISQLCLAVQQQT 60
Query: 51 -------CVNEQ--CVYTMKYADQSVTKGFAAHETI--SVIGKGEGKAIFHGALFGCSND 99
CVN C Y++ Y D + G + + I + F FGC++
Sbjct: 61 SNGSSQPCVNSSAACTYSLSYGDSRNSTGDFSQDVIFLNSTNSSSQAVQFRDVAFGCAHS 120
Query: 100 NHGFDEDARDGALAGVLGLSRVTISFISQLGSIIK-KRFSYCLVIPLPNGEYTSSYLKFG 158
GF D G+L G++G +R +S SQL + +FSYC P + ++ + F
Sbjct: 121 PQGFLVDL--GSL-GIVGFNRGNLSLPSQLKDRLGGSKFSYCF--PSQPWQPRATGVIFL 175
Query: 159 TDMGYRRPSTQATKFINHP-----NNFYYLSLKDISIDNERMNFPPDTFDITVS-GEGGC 212
D G + T +++P + YY+ L IS+D + + P F + S G+GG
Sbjct: 176 GDSGLSKSKVSYTPLLDNPVTPARSQLYYVGLTSISVDGKTLAIPESAFKLDPSTGDGGT 235
Query: 213 IIDSGSVLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCYFLP--ETFNRFPSM 270
++DSG+ T D Y F + R L + CY + + P +
Sbjct: 236 VLDSGTTFTRVVDDAYTAFRNAFAAS-NRSGLRKKVGAAAGFDDCYNISAGSSLPGVPEV 294
Query: 271 AFYFEDANLRIDGENVFIIDYENHFFLLAVAPHDDLVAL---------------IGSQQQ 315
++ N+R++ + +E+ F ++ A ++ V L +G+ QQ
Sbjct: 295 RLSLQN-NVRLE------LRFEHLFVPVSAAGNEVTVCLAILSSQKSGFGKINVLGNYQQ 347
Query: 316 RDTRFVYDLNIDLLSFVKENC 336
+ YD + F + +C
Sbjct: 348 SNYLVEYDNERSRVGFERADC 368
>gi|359476756|ref|XP_002277082.2| PREDICTED: aspartic proteinase-like protein 2-like isoform 2 [Vitis
vinifera]
Length = 560
Score = 85.9 bits (211), Expect = 3e-14, Method: Compositional matrix adjust.
Identities = 93/376 (24%), Positives = 162/376 (43%), Gaps = 60/376 (15%)
Query: 2 VRLFIGTPSKGVLLILDTGSALIY-------------------AIFDPRKSSSFQKINCD 42
++ IGTPSK + +DTGS +++ ++D + S++ + CD
Sbjct: 157 AKIGIGTPSKDYYVQVDTGSDILWVNCAGCDRCPTKSDLGVDLTLYDMKASTTSDAVGCD 216
Query: 43 HPDCTYFK-----CV-NEQCVYTMKYADQSVTKGFAAHETIS---VIGKGEGKAIFHGAL 93
C+ + C QC+Y++ Y D S T G+ + + + G + +
Sbjct: 217 DNFCSLYDGPLPGCKPGLQCLYSVLYGDGSSTTGYFVQDFVQYNRISGNFQTTPTNGTVV 276
Query: 94 FGCSNDNHGFDEDARDGALAGVLGLSRVTISFISQLGSI--IKKRFSYCLVIPLPNGEYT 151
FGC N G + + AL G+LG + S +SQL S +KK FS+CL G +
Sbjct: 277 FGCGNKQSG-ELGSSSEALDGILGFGQANSSMLSQLASSGKVKKVFSHCLDNVDGGGIFA 335
Query: 152 SSYLKFGTDMGYRRPSTQATKFINHPNNFYYLSLKDISIDNERMNFPPDTFDITVSGE-G 210
+ P T + + + Y + +K+I + + ++ P D F+ SG+
Sbjct: 336 IGEVV--------EPKVNITPLVQNQAH-YNVVMKEIEVGGDPLDVPSDAFE---SGDRK 383
Query: 211 GCIIDSGSVLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCY-FLPETFNRFPS 269
G IIDSG+ L YF +VY L EK +S +L + E C+ + + FP+
Sbjct: 384 GTIIDSGTTLAYFPQEVYVPLIEKILSQQPDLRLHTV----EQAFTCFDYTGNVDDGFPT 439
Query: 270 MAFYFEDA-NLRI-------DGENVFIIDYENHFFLLAVAPHDDLVALIGSQQQRDTRFV 321
+ +F+ + +L + E + I ++N A + L+G + V
Sbjct: 440 VTLHFDKSISLTVYPHEYLFQHEFEWCIGWQNSG---AQTKDGKDLTLLGDLVLSNKLVV 496
Query: 322 YDLNIDLLSFVKENCS 337
YDL + +V+ NCS
Sbjct: 497 YDLEKQGIGWVEYNCS 512
>gi|297828736|ref|XP_002882250.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
gi|297328090|gb|EFH58509.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
Length = 488
Score = 85.9 bits (211), Expect = 3e-14, Method: Compositional matrix adjust.
Identities = 86/377 (22%), Positives = 157/377 (41%), Gaps = 64/377 (16%)
Query: 2 VRLFIGTPSKGVLLILDTGSALIYA-----IFDPRKS-------------SSFQKINCDH 43
++ +GTPS+ + +DTGS +++ I PRKS S+ + ++C
Sbjct: 87 AKIGLGTPSRDFHVQVDTGSDILWVNCAGCIRCPRKSDLVELTPYDADASSTAKSVSCSD 146
Query: 44 PDCTYFKCVNE-----QCVYTMKYADQSVTKGFAAHETIS---VIGKGEGKAIFHGALFG 95
C+Y +E C Y + Y D S T G+ + + V G + + +FG
Sbjct: 147 NFCSYVNQRSECHSGSTCQYVILYGDGSSTNGYLVRDVVHLDLVTGNRQTGSTNGTIIFG 206
Query: 96 CSNDNHGFDEDARDGALAGVLGLSRVTISFISQLGS--IIKKRFSYCLVIPLPNGEYTSS 153
C + G +++ A+ G++G + SFISQL S +K+ F++CL +
Sbjct: 207 CGSKQSGQLGESQ-AAVDGIMGFGQSNSSFISQLASQGKVKRSFAHCL-----DNNNGGG 260
Query: 154 YLKFGTDMGYRRPSTQATKFINHPNNFYYLSLKDISIDNERMNFPPDTFDITVSGE-GGC 212
G + P + T ++ + Y ++L I + N + D FD SG+ G
Sbjct: 261 IFAIGEVV---SPKVKTTPMLSKSAH-YSVNLNAIEVGNSVLQLSSDAFD---SGDDKGV 313
Query: 213 IIDSGSVLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCYFLPETFNRFPSMAF 272
IIDSG+ L Y VY L + ++ + L + D C+ + +RFP++ F
Sbjct: 314 IIDSGTTLVYLPDAVYNPLMNQILASHQELNLHTVQDSFT----CFHYIDRLDRFPTVTF 369
Query: 273 YFEDA------------NLRIDGENVFIIDYENHFFLLAVAPHDDLVALIGSQQQRDTRF 320
F+ + +R E+ + ++N + ++G +
Sbjct: 370 QFDKSVSLAVYPQEYLFQVR---EDTWCFGWQNGGLQTKGGAS---LTILGDMALSNKLV 423
Query: 321 VYDLNIDLLSFVKENCS 337
VYD+ ++ + NCS
Sbjct: 424 VYDIENQVIGWTNHNCS 440
>gi|77555282|gb|ABA98078.1| Eukaryotic aspartyl protease family protein, expressed [Oryza
sativa Japonica Group]
Length = 409
Score = 85.5 bits (210), Expect = 3e-14, Method: Compositional matrix adjust.
Identities = 77/294 (26%), Positives = 135/294 (45%), Gaps = 28/294 (9%)
Query: 46 CTYFKCVNEQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSNDNHGFDE 105
+YF V QC + T G+ A +T + G G +FGCS+ ++G
Sbjct: 109 TSYF--VWAQCAPLTYGGSAANTSGYLATDTFTF-----GATAVPGVVFGCSDASYGDFA 161
Query: 106 DARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTS-SYLKFGTDMGYR 164
A +GV+G+ R +S ISQL +FSY L+ P + ++ S ++FG D +
Sbjct: 162 GA-----SGVIGIGRGNLSLISQL---QFGKFSYQLLAPEATDDGSADSVIRFGDDAVPK 213
Query: 165 RPSTQATKFIN---HPNNFYYLSLKDISIDNERMN-FPPDTFDITVSGEGGCIIDSGSVL 220
++T ++ +P+ FYY++L + +D R++ P TFD+ +G GG I+ S + +
Sbjct: 214 TKRGRSTPLLSSTLYPD-FYYVNLTGVRVDGNRLDAIPAGTFDLRANGTGGVILSSTTPV 272
Query: 221 TYFHSDVYWKLHEKFVSYFERFQLAQLSDCPE-PIQLCYFLPETFN-RFPSMAFYFE-DA 277
TY Y + S R L ++ + LCY + P + F+ A
Sbjct: 273 TYLEQAAYDVVRAAVAS---RIGLPAVNGSAALELDLCYNASSMAKVKVPKLTLVFDGGA 329
Query: 278 NLRIDGENVFIIDYENHFFLLAVAPHDDLVALIGSQQQRDTRFVYDLNIDLLSF 331
++ + N F ID + L + P +++G+ Q T +YD++ L+F
Sbjct: 330 DMDLSAANYFYIDNDTGLECLTMLPSQG-GSVLGTLLQTGTNMIYDVDAGRLTF 382
>gi|242094478|ref|XP_002437729.1| hypothetical protein SORBIDRAFT_10g001440 [Sorghum bicolor]
gi|241915952|gb|EER89096.1| hypothetical protein SORBIDRAFT_10g001440 [Sorghum bicolor]
Length = 486
Score = 85.5 bits (210), Expect = 3e-14, Method: Compositional matrix adjust.
Identities = 90/358 (25%), Positives = 143/358 (39%), Gaps = 56/358 (15%)
Query: 7 GTPSKGVLLILDTGSALIY-----------AIFDPRKSSSFQKINCDHPDCTYFK----- 50
G+ S V ++LDT + + A +DP +SS++ C+ C
Sbjct: 157 GSSSPPVTVVLDTAGDVPWMRCVPCTFAQCADYDPTRSSTYSAFPCNSSACKQLGRYANG 216
Query: 51 C-VNEQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSNDNHGFDEDARD 109
C N QC Y + A S T + I G+ G FGCS + G E+ D
Sbjct: 217 CDANGQCQYMVVTAGDSFTTSGTYSSDVLTINSGD---RVEGFRFGCSQNEQGSFENQAD 273
Query: 110 GALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLKFGTDMG--YRRPS 167
G ++ L R S ++Q S FSYC LP E T + + G +G YR +
Sbjct: 274 G----IMALGRGVQSLMAQTSSTYGDAFSYC----LPPTETTKGFFQIGVPIGASYRFVT 325
Query: 168 TQATKFINHPN----NFYYLSLKDISIDNERMNFPPDTFDITVSGEGGCIIDSGSVLTYF 223
T K + Y L I++D + +N P + F G ++DS +++T
Sbjct: 326 TPMLKERGGASAAAATLYRALLLAITVDGKELNVPAEVF------AAGTVMDSRTIITRL 379
Query: 224 HSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCYFLPET-FNRFPSMAFYFEDANLRID 282
Y L F + R+++A E + CY L + R P +A F D
Sbjct: 380 PVTAYGALRAAFRNRM-RYRVAPPQ---EELDTCYDLTGVRYPRLPRIALVF-------D 428
Query: 283 GENVFIIDYENHFF--LLAVAPHDD--LVALIGSQQQRDTRFVYDLNIDLLSFVKENC 336
G V +D LA A +DD +++G+ QQ+ + ++D+ + F C
Sbjct: 429 GNAVVEMDRSGILLNGCLAFASNDDDSSPSILGNVQQQTIQVLHDVGGGRIGFRSAAC 486
>gi|297848856|ref|XP_002892309.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
gi|297338151|gb|EFH68568.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
Length = 484
Score = 85.5 bits (210), Expect = 3e-14, Method: Compositional matrix adjust.
Identities = 93/377 (24%), Positives = 159/377 (42%), Gaps = 60/377 (15%)
Query: 2 VRLFIGTPSKGVLLILDTGSALIYA-------------------IFDPRKSSSFQKINCD 42
++ IGTP+K + +DTGS +++ +++ +S S + ++CD
Sbjct: 82 AKIGIGTPAKSYYVQVDTGSDIMWVNCIQCKQCPRRSTLGIELTLYNIDESDSGKLVSCD 141
Query: 43 HPDCTYFK------C-VNEQCVYTMKYADQSVTKGFAAHETI---SVIGKGEGKAIFHGA 92
C C N C Y Y D S T G+ + + SV G + +
Sbjct: 142 DDFCYQISGGPLSGCKANMSCPYLEIYGDGSSTAGYFVKDVVQYDSVAGDLKTQTANGSV 201
Query: 93 LFGCSNDNHGFDEDARDGALAGVLGLSRVTISFISQLGSI--IKKRFSYCLVIPLPNGEY 150
+FGC G + + + AL G+LG + S ISQL S +KK F++CL +G
Sbjct: 202 IFGCGARQSGDLDSSNEEALDGILGFGKANSSMISQLASSGRVKKIFAHCL-----DGRN 256
Query: 151 TSSYLKFGTDMGYRRPSTQATKFI-NHPNNFYYLSLKDISIDNERMNFPPDTFDITVSGE 209
G + +P T + N P+ Y +++ + + E +N P D F
Sbjct: 257 GGGIFAIGRVV---QPKVNMTPLVPNQPH--YNVNMTAVQVGQEFLNIPADLFQ--PGDR 309
Query: 210 GGCIIDSGSVLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCYFLPETFNRFPS 269
G IIDSG+ L Y +Y L +K S ++ + + Q + E FP+
Sbjct: 310 KGAIIDSGTTLAYLPEIIYEPLVKKITSQEPALKVHIVDKDYKCFQYSGRVDEG---FPN 366
Query: 270 MAFYFEDAN-LRI-------DGENVFIIDYENHFFLLAVAPHDDL-VALIGSQQQRDTRF 320
+ F+FE++ LR+ E ++ I ++N A+ D + L+G +
Sbjct: 367 VTFHFENSVFLRVYPHDYLFPYEGMWCIGWQNS----AMQSRDRRNMTLLGDLVLSNKLV 422
Query: 321 VYDLNIDLLSFVKENCS 337
+YDL L+ + + NCS
Sbjct: 423 LYDLENQLIGWTEYNCS 439
>gi|359476206|ref|XP_002262837.2| PREDICTED: aspartic proteinase nepenthesin-2-like [Vitis vinifera]
Length = 462
Score = 85.5 bits (210), Expect = 4e-14, Method: Compositional matrix adjust.
Identities = 95/353 (26%), Positives = 143/353 (40%), Gaps = 48/353 (13%)
Query: 1 MVRLFIGTPSKGVLLILDTGSALIYAIFDPRKSSSFQKINCDHPDCTYFK---------- 50
+V + G P + + LI+DTGS + + +S NC + F
Sbjct: 130 LVNVGFGKPQQNLNLIIDTGSDTTWI-----RCNSCSLGNCHNKKIPTFNPSLSSSYSNR 184
Query: 51 -CV-NEQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSNDNHGFDEDAR 108
C+ + + YTM Y D S +KG + +++ +F G
Sbjct: 185 SCIPSTKTNYTMNYEDNSYSKGVFVCDEVTL----------KPDVFPKFQFGCGDSGGGD 234
Query: 109 DGALAGVLGLSR-VTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLKFGTDMGYRRPS 167
G+ +GVLGL++ S ISQ S KK+FSYC P+ E T L FG PS
Sbjct: 235 FGSASGVLGLAQGEQYSLISQTASKFKKKFSYC----FPHNENTRGSLLFGEKAISASPS 290
Query: 168 TQATKFIN-HPNNFYYLSLKDISIDNERMNFPPDTFDITVSGEGGCIIDSGSVLTYFHSD 226
+ T+ +N + Y++ L IS+ +R+N F G IIDSG+V+T+ +
Sbjct: 291 LKFTRLLNPSSGSVYFVELIGISVAKKRLNVSSSLF-----ASPGTIIDSGTVITHLPTA 345
Query: 227 VYWKLHEKFVSYFERFQLAQLSDCPE--PIQLCYFLPETFNR---FPSMAFYF---EDAN 278
Y L F E +S P+ P+ CY L R P + +F D +
Sbjct: 346 AYEALRTAFQQ--EMLHCPSVSPPPQEKPLDTCYNLKGCGGRNIKLPEIVLHFVGEVDVS 403
Query: 279 LRIDGENVFIIDYENHFFLLAVAPHDDLVALIGSQQQRDTRFVYDLNIDLLSF 331
L G D A H V +IG++QQ + VYD+ L F
Sbjct: 404 LHPSGILWANGDLTQACLAFARKSHPSHVTIIGNRQQVSLKVVYDIEGGRLGF 456
>gi|297735249|emb|CBI17611.3| unnamed protein product [Vitis vinifera]
Length = 480
Score = 85.1 bits (209), Expect = 4e-14, Method: Compositional matrix adjust.
Identities = 93/377 (24%), Positives = 162/377 (42%), Gaps = 61/377 (16%)
Query: 2 VRLFIGTPSKGVLLILDTGSALIY-------------------AIFDPRKSSSFQKINCD 42
++ IGTPSK + +DTGS +++ ++D + S++ + CD
Sbjct: 76 AKIGIGTPSKDYYVQVDTGSDILWVNCAGCDRCPTKSDLGVDLTLYDMKASTTSDAVGCD 135
Query: 43 HPDCTYFK-----CV-NEQCVYTMKYADQSVTKGFAAHETIS---VIGKGEGKAIFHGAL 93
C+ + C QC+Y++ Y D S T G+ + + + G + +
Sbjct: 136 DNFCSLYDGPLPGCKPGLQCLYSVLYGDGSSTTGYFVQDFVQYNRISGNFQTTPTNGTVV 195
Query: 94 FGCSNDNHGFDEDARDGALAGVLGLSRVTISFISQLGSI--IKKRFSYCLVIPLPNGEYT 151
FGC N G + + AL G+LG + S +SQL S +KK FS+CL G +
Sbjct: 196 FGCGNKQSG-ELGSSSEALDGILGFGQANSSMLSQLASSGKVKKVFSHCLDNVDGGGIFA 254
Query: 152 SSYLKFGTDMGYRRPSTQATKFINHPNNFYYLSLKDISIDNERMNFPPDTFDITVSGE-G 210
+ P T + + + Y + +K+I + + ++ P D F+ SG+
Sbjct: 255 IGEV--------VEPKVNITPLVQNQAH-YNVVMKEIEVGGDPLDVPSDAFE---SGDRK 302
Query: 211 GCIIDSGSVLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCY-FLPETFNRFPS 269
G IIDSG+ L YF +VY L EK +S +L + E C+ + + FP+
Sbjct: 303 GTIIDSGTTLAYFPQEVYVPLIEKILSQQPDLRLHTV----EQAFTCFDYTGNVDDGFPT 358
Query: 270 MAFYFEDA-NLRI--------DGENVFIIDYENHFFLLAVAPHDDLVALIGSQQQRDTRF 320
+ +F+ + +L + E + I ++N A + L+G +
Sbjct: 359 VTLHFDKSISLTVYPHEYLFQVKEFEWCIGWQNSG---AQTKDGKDLTLLGDLVLSNKLV 415
Query: 321 VYDLNIDLLSFVKENCS 337
VYDL + +V+ NCS
Sbjct: 416 VYDLEKQGIGWVEYNCS 432
>gi|449440014|ref|XP_004137780.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Cucumis sativus]
gi|449483406|ref|XP_004156582.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Cucumis sativus]
Length = 449
Score = 85.1 bits (209), Expect = 4e-14, Method: Compositional matrix adjust.
Identities = 84/332 (25%), Positives = 142/332 (42%), Gaps = 71/332 (21%)
Query: 2 VRLFIGTPSKGVLLILDTGSALIY-------------AIFDPRKSSSFQKINCDHPDCT- 47
V L +GTP + V +++DTGS L + + F+P SSS+ I C CT
Sbjct: 75 VSLTVGTPPQNVTMVIDTGSELSWLHCNTSQNSSSSSSTFNPVWSSSYSPIPCSSSTCTD 134
Query: 48 -------YFKCVNEQ-CVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSND 99
C + Q C T+ YAD S ++G A +T + G + +FGC +
Sbjct: 135 QTRDFPIRPSCDSNQFCHATLSYADASSSEGNLATDTFYI-----GSSGIPNVVFGCMDS 189
Query: 100 --NHGFDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLKF 157
+ +ED+++ G++G++R ++SF+SQ+G +FSYC+ EY S L
Sbjct: 190 IFSSNSEEDSKN---TGLMGMNRGSLSFVSQMG---FPKFSYCI------SEYDFSGLLL 237
Query: 158 GTD--------MGYRRPSTQATKFINHPNNFYYLSLKDISIDNERMNFPPDTFDITVSGE 209
D + Y +T Y + L+ I + ++ + P F+ +G
Sbjct: 238 LGDANFSWLAPLNYTPLIEMSTPLPYFDRVAYTVQLEGIKVAHKLLPIPESVFEPDHTGA 297
Query: 210 GGCIIDSGSVLTYFHSDVYWKLHEKFVS-------YFER----FQLAQLSDCPEPIQLCY 258
G ++DSG+ T+ Y L + F++ +E FQ A + LCY
Sbjct: 298 GQTMVDSGTQFTFLLGPAYTALRDHFLNKTAGSLRVYEDSNFVFQGA--------MDLCY 349
Query: 259 FLPETFNR---FPSMAFYFEDANLRIDGENVF 287
+P R PS+ F A + + G+ +
Sbjct: 350 RVPTNQTRLPPLPSVTLVFRGAEMTVTGDRIL 381
>gi|356508308|ref|XP_003522900.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
Length = 439
Score = 84.7 bits (208), Expect = 5e-14, Method: Compositional matrix adjust.
Identities = 80/358 (22%), Positives = 145/358 (40%), Gaps = 39/358 (10%)
Query: 1 MVRLFIGTPSKGVLLILDTGSALIY-----------AIFDPRKSSSFQKINCDHPDCTYF 49
+VR IG+P + +LL +DT + + +F P KS++F+ ++C P C
Sbjct: 99 IVRAKIGSPPQTLLLAMDTSNDAAWIPCTACDGCTSTLFAPEKSTTFKNVSCGSPQCNQV 158
Query: 50 ---KCVNEQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSNDNHGFDED 106
C C + + Y S+ +T+++ FGC G
Sbjct: 159 PNPSCGTSACTFNLTYGSSSIAANVV-QDTVTL-----ATDPIPDYTFGCVAKTTGASAP 212
Query: 107 ARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLKFGTDMGYRRP 166
+ R +S +SQ ++ + FSYCL P S L+ G R
Sbjct: 213 PQGLLGL-----GRGPLSLLSQTQNLYQSTFSYCL--PSFKSLNFSGSLRLGPVAQPIR- 264
Query: 167 STQATKFINHP--NNFYYLSLKDISIDNERMNFPPDTFDITVSGEGGCIIDSGSVLTYFH 224
+ T + +P ++ YY++L I + + ++ PP+ + G + DSG+V T
Sbjct: 265 -IKYTPLLKNPRRSSLYYVNLVAIRVGRKVVDIPPEALAFNAATGAGTVFDSGTVFTRLV 323
Query: 225 SDVYWKLHEKFVSYFERFQLAQLSDCP-EPIQLCYFLPETFNRFPSMAFYFEDANLRIDG 283
+ Y + ++F A L+ CY +P P++ F F N+ +
Sbjct: 324 APAYTAVRDEFQRRVAIAAKANLTVTSLGGFDTCYTVPIV---APTITFMFSGMNVTLPE 380
Query: 284 ENVFIIDYENHFFLLAVAPHDD----LVALIGSQQQRDTRFVYDLNIDLLSFVKENCS 337
+N+ I LA+A D ++ +I + QQ++ R +YD+ L +E C+
Sbjct: 381 DNILIHSTAGSTTCLAMASAPDNVNSVLNVIANMQQQNHRVLYDVPNSRLGVARELCT 438
>gi|326499093|dbj|BAK06037.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 471
Score = 84.7 bits (208), Expect = 5e-14, Method: Compositional matrix adjust.
Identities = 94/382 (24%), Positives = 154/382 (40%), Gaps = 68/382 (17%)
Query: 1 MVRLFIGTPSKGVLLILDTGSALIY------------------------AIFDPRKSSSF 36
++ + IGTP ++ I DTGS LI+ FDP KS++F
Sbjct: 101 LMAVNIGTPPTRMVAIADTGSDLIWLNCSYGGDGPGLAAARDADAQPPGVQFDPSKSTTF 160
Query: 37 QKINCDHPDCTYF---KC-VNEQCVYTMKYADQSVTKGFAAHETISVI----GKGEGKAI 88
+ ++CD C+ C + +C Y+ Y D S T G + ET + +G+G
Sbjct: 161 RLVDCDSVACSELPEASCGADSKCRYSYSYGDGSHTSGVLSTETFTFADAPGARGDGTTT 220
Query: 89 FHGAL-FGCSNDNHGFDEDARDGALAGVLGLSRVTISFISQLG--SIIKKRFSYCLVIPL 145
+ FGCS G G++GL +S +SQLG + + +RFSYCLV
Sbjct: 221 RVANVNFGCSTTFVG------SSVGDGLVGLGGGDLSLVSQLGADTSLGRRFSYCLV--- 271
Query: 146 PNGEYTSSYLKFGTDMGYRRPSTQATKFI-NHPNNFYYLSLKDISIDNERMNFPPDTFDI 204
P SS L FG P T I + +Y + L+ + + N+ P
Sbjct: 272 PYSVKASSALNFGPRAAVTDPGAVTTPLIPSQVKAYYIVELRSVKVGNKTFEAP------ 325
Query: 205 TVSGEGGCIIDSGSVLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCYFLPETF 264
I+DSG+ LT+ + L ++ R +L + LC+ + +
Sbjct: 326 ---DRSPLIVDSGTTLTFLPEALVDPLVKELTG---RIKLPPAQSPERLLPLCFDV--SG 377
Query: 265 NRFPSMAFYFEDANLRIDGENVFIIDYENHF-------FLLAVAPHDDL--VALIGSQQQ 315
R +A D + + G + EN F LAV+ + ++IG+ Q
Sbjct: 378 VREGQVAAMIPDVTVGLGGGAAVTLKAENTFVEVQEGTLCLAVSAMSEQFPASIIGNIAQ 437
Query: 316 RDTRFVYDLNIDLLSFVKENCS 337
++ YDL+ ++F C+
Sbjct: 438 QNMHVGYDLDKGTVTFAPAACA 459
>gi|357128280|ref|XP_003565802.1| PREDICTED: probable aspartic protease At2g35615-like [Brachypodium
distachyon]
Length = 530
Score = 84.7 bits (208), Expect = 5e-14, Method: Compositional matrix adjust.
Identities = 85/341 (24%), Positives = 138/341 (40%), Gaps = 40/341 (11%)
Query: 28 FDPRKSSSFQKINCDHPD-CTYF---KCV----NEQCVYTMKYADQSVTKGFAAHET--- 76
+ P SSS+++ C D C F C NE C Y Y D +VT+G ET
Sbjct: 175 YRPSLSSSWRRYRCSQKDACGSFPHNTCRSPNHNESCSYEQMYEDGTVTRGIYGRETATV 234
Query: 77 -ISVIGKGEGKA--IFHGALFGCSNDNHGFDEDARDGALAGVLGLSRVTISFISQLGSII 133
+SV G GEG+ + G + GCS G DA D GVL L +SF + +
Sbjct: 235 PVSVSGAGEGQTAVLLPGLVLGCSTFEAGATVDAHD----GVLTLGNHAVSFGTVAAARF 290
Query: 134 KKRFSYCLVIPLPNGEYTSSYLKFGTDMGYRRPSTQATKFINHPNN--FYYLSLKDISID 191
RFS+CL+ + +G T SYL FG + + + T + P+ + + + +D
Sbjct: 291 GGRFSFCLLHTM-SGRDTFSYLTFGPNPALNGGAMEETNLVYSPDGEPAFGAGVTGVFVD 349
Query: 192 NERM-NFPPDTFDITVSGEGGCIIDSGSVLTYFHSDVYWKLHEKFVSYFERFQLAQLSDC 250
ER+ PP+ +D V G G +D+G+ LT + + Q ++
Sbjct: 350 GERLAGIPPEVWDPAVLG-GALNLDTGTSLTGLVEPAFEAVRAAVDRRLGHLQKEDVAG- 407
Query: 251 PEPIQLCYFL------------PETFNRFPSMAFYFE-DANLRIDGENVFIIDYENHFFL 297
+CY P P +AF FE A L + + +
Sbjct: 408 ---FDICYKWAFGAGAGDEGVDPAHNVTVPKVAFEFEGGARLEPVARGIVLPEVVPGVAC 464
Query: 298 LAVAPHDDLVALIGSQQQRDTRFVYDLNIDLLSFVKENCSD 338
L + +++G+ ++ + +D L F K+ C++
Sbjct: 465 LGFRRREVGPSVLGNVHMQEHVWEFDHMAGKLRFRKDKCTN 505
>gi|356542355|ref|XP_003539632.1| PREDICTED: probable aspartic protease At2g35615-like [Glycine max]
Length = 444
Score = 84.7 bits (208), Expect = 6e-14, Method: Compositional matrix adjust.
Identities = 95/360 (26%), Positives = 165/360 (45%), Gaps = 41/360 (11%)
Query: 6 IGTPSKGVLLILDTGSALIY--------------AIFDPRKSSSFQKINCDHPDCTYFKC 51
+GTP +L I+DTGS +I+ IFDP +S +++ + C C +
Sbjct: 100 VGTPPFQILGIVDTGSDIIWLQCQPCEDCYNQTTPIFDPSQSKTYKTLPCSSNICQSVQS 159
Query: 52 V------NEQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAI-FHGALFGCSNDNHG-F 103
N++C YT+ Y D S ++G + ET++ +G +G ++ F + GC ++N G F
Sbjct: 160 AASCSSNNDECEYTITYGDNSHSQGDLSVETLT-LGSTDGSSVQFPKTVIGCGHNNKGTF 218
Query: 104 DEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLKFGTDMGY 163
+ G +S ++ S G +FSYCL PL + +SS L FG +
Sbjct: 219 QREGSGIVGLGGGPVSLISQLSSSIGG-----KFSYCLA-PLFSQSNSSSKLNFGDEAVV 272
Query: 164 RRPSTQATKFINHPNN---FYYLSLKDISIDNERMNFPPDTFDITVSGEGGCIIDSGSVL 220
T +T + P N FY+L+L+ S+ + R+ F + + GEG IIDSG+ L
Sbjct: 273 SGRGTVSTPIV--PKNGLGFYFLTLEAFSVGDNRIEF-GSSSFESSGGEGNIIIDSGTTL 329
Query: 221 TYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCYFLPETFN-RFPSMAFYFEDANL 279
T D Y L E L ++ D + ++LCY + P + +F+ A++
Sbjct: 330 TILPEDDYLNLESAVADAIE---LERVEDPSKFLRLCYRTTSSDELNVPVITAHFKGADV 386
Query: 280 RIDGENVFIIDYENHFFLLAVAPHDDLVALIGSQQQRDTRFVYDLNIDLLSFVKENCSDD 339
++ + F I+ + A + + G+ Q++ YDL +SF +C+ +
Sbjct: 387 ELNPISTF-IEVDEGVVCFAFR-SSKIGPIFGNLAQQNLLVGYDLVKQTVSFKPTDCTQE 444
>gi|15229663|ref|NP_190574.1| aspartyl protease family protein [Arabidopsis thaliana]
gi|6522926|emb|CAB62113.1| putative protein [Arabidopsis thaliana]
gi|53828539|gb|AAU94379.1| At3g50050 [Arabidopsis thaliana]
gi|55733749|gb|AAV59271.1| At3g50050 [Arabidopsis thaliana]
gi|332645100|gb|AEE78621.1| aspartyl protease family protein [Arabidopsis thaliana]
Length = 632
Score = 84.7 bits (208), Expect = 6e-14, Method: Compositional matrix adjust.
Identities = 94/367 (25%), Positives = 151/367 (41%), Gaps = 53/367 (14%)
Query: 2 VRLFIGTPSKGVLLILDTGSALIYAI--------------FDPRKSSSFQKINCDHPDCT 47
RL+IGTP + LI+D+GS + Y F P SS++Q + C+ DC
Sbjct: 95 TRLWIGTPPQMFALIVDSGSTVTYVPCSDCEQCGKHQDPKFQPEMSSTYQPVKCNM-DC- 152
Query: 48 YFKCVN--EQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSNDNHGFDE 105
C + EQCVY +YA+ S +KG + IS E + A+FGC G
Sbjct: 153 --NCDDDREQCVYEREYAEHSSSKGVLGEDLISF--GNESQLTPQRAVFGCETVETG--- 205
Query: 106 DARDGALAGVLGLSRVTISFISQL--GSIIKKRFSYCL-VIPLPNGEYTSSYLKFGTDMG 162
D G++GL + +S + QL +I F C + + G + +DM
Sbjct: 206 DLYSQRADGIIGLGQGDLSLVDQLVDKGLISNSFGLCYGGMDVGGGSMILGGFDYPSDMV 265
Query: 163 YRRPSTQATKFINHPNNFYYLSLKDISIDNERMNFPPDTFDITVSGEGGCIIDSGSVLTY 222
+ + +Y + L I + ++++ FD GE G ++DSG+ Y
Sbjct: 266 FTDSDPDRSP-------YYNIDLTGIRVAGKQLSLHSRVFD----GEHGAVLDSGTTYAY 314
Query: 223 FHSDVYWKLHEKFVSYFERFQLAQLSDCPEP--------IQLCYFLPETFNRFPSMAFYF 274
+ E + E L Q+ D P+P + ++ E FPS+ F
Sbjct: 315 LPDAAFAAFEEAVMR--EVSTLKQI-DGPDPNFKDTCFQVAASNYVSELSKIFPSVEMVF 371
Query: 275 EDA-NLRIDGENVFIIDYENH-FFLLAVAPH-DDLVALIGSQQQRDTRFVYDLNIDLLSF 331
+ + + EN + H + L V P+ D L+G R+T VYD + F
Sbjct: 372 KSGQSWLLSPENYMFRHSKVHGAYCLGVFPNGKDHTTLLGGIVVRNTLVVYDRENSKVGF 431
Query: 332 VKENCSD 338
+ NCS+
Sbjct: 432 WRTNCSE 438
>gi|388495448|gb|AFK35790.1| unknown [Medicago truncatula]
Length = 434
Score = 84.7 bits (208), Expect = 6e-14, Method: Compositional matrix adjust.
Identities = 84/359 (23%), Positives = 143/359 (39%), Gaps = 44/359 (12%)
Query: 1 MVRLFIGTPSKGVLLILDTGSALIY------------AIFDPRKSSSFQKINCDHPDCTY 48
+V+ GTP + +LL LDT S + F P KS+SF+ ++C P C
Sbjct: 98 IVKAKFGTPPQTLLLALDTSSDAAWIPCSGCVGCSTSKPFAPIKSTSFRNVSCGSPHCKQ 157
Query: 49 FK---CVNEQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSNDNHGFDE 105
C C + Y S+ +T+++ G FGC N G
Sbjct: 158 VPNPTCGGSACAFNFTYGSSSIAAS-VVQDTLTLAADP-----IPGYTFGCVNKTTGSSA 211
Query: 106 DARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLKFGTDMGYRR 165
+ R +S +SQ ++ K FSYCL P S L+ G +R
Sbjct: 212 PQQGLLGL-----GRGPLSLLSQSQNLYKSTFSYCL--PSFKSINFSGSLRLGPVYQPKR 264
Query: 166 PSTQATKFINHP--NNFYYLSLKDISIDNERMNFPPDTFDITVSGEGGCIIDSGSVLTYF 223
+ T + +P ++ YY++L I + + ++ PP + G I DSG+V T
Sbjct: 265 --IKYTPLLRNPRRSSLYYVNLVAIKVGRKIVDIPPAALAFNPTTGAGTIFDSGTVFTRL 322
Query: 224 HSDVYWKLHEKFVSYF-ERFQLAQLSDCPEPIQLCYFLPETFNRFPSMAFYFEDANLRID 282
VY + +F + + L CY +P P++ F F N+ +
Sbjct: 323 AEPVYTAVRNEFRRRVGPKLPVTTLGG----FDTCYNVPIV---VPTITFLFSGMNVALP 375
Query: 283 GENVFIIDYENHFFLLAVAPHDD----LVALIGSQQQRDTRFVYDLNIDLLSFVKENCS 337
+N+ I LA+A D ++ +I + QQ++ R ++D+ + +E C+
Sbjct: 376 PDNIVIHSTAGSTTCLAMAGAPDNVNSVLNVIANMQQQNHRVLFDVPNSRIGIARELCT 434
>gi|356514298|ref|XP_003525843.1| PREDICTED: aspartic proteinase-like protein 1-like [Glycine max]
Length = 663
Score = 84.7 bits (208), Expect = 6e-14, Method: Compositional matrix adjust.
Identities = 92/368 (25%), Positives = 148/368 (40%), Gaps = 56/368 (15%)
Query: 2 VRLFIGTPSKGVLLILDTGSALIYAI--------------FDPRKSSSFQKINCDHPDCT 47
RL+IGTP + LI+DTGS + Y F P SS++Q + C DC
Sbjct: 114 TRLWIGTPPQMFALIVDTGSTVTYVPCSTCEQCGRHQDPKFQPESSSTYQPVKCTI-DC- 171
Query: 48 YFKCVNE--QCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSNDNHGFDE 105
C + QCVY +YA+ S + G + IS + E A+FGC N G
Sbjct: 172 --NCDGDRMQCVYERQYAEMSTSSGVLGEDVISFGNQSELAP--QRAVFGCENVETG--- 224
Query: 106 DARDGALAGVLGLSRVTISFISQL--GSIIKKRFSYCLVIPLPNGEYTSSYLKFGTD-MG 162
D G++GL R +S + QL +I FS C Y + G +G
Sbjct: 225 DLYSQHADGIMGLGRGDLSIMDQLVDKKVISDSFSLC---------YGGMDVGGGAMVLG 275
Query: 163 YRRPSTQATKFINHPNN--FYYLSLKDISIDNERMNFPPDTFDITVSGEGGCIIDSGSVL 220
P + T + P+ +Y + LK++ + +R+ + FD G+ G ++DSG+
Sbjct: 276 GISPPSDMTFAYSDPDRSPYYNIDLKEMHVAGKRLPLNANVFD----GKHGTVLDSGTTY 331
Query: 221 TYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCYFLPETFNRFPSMAFYFEDANLR 280
Y + + V + L Q+S P+P N ++ F ++
Sbjct: 332 AYLPEAAFLAFKDAIVKELQ--SLKQISG-PDPNYNDICFSGAGNDVSQLSKSFPVVDMV 388
Query: 281 IDGENVFIIDYENHFF----------LLAVAPHDDLVALIGSQQQRDTRFVYDLNIDLLS 330
+ + + EN+ F L +D L+G R+T +YD +
Sbjct: 389 FGNGHKYSLSPENYMFRHSKVRGAYCLGIFQNGNDQTTLLGGIIVRNTLVMYDREQTKIG 448
Query: 331 FVKENCSD 338
F K NC++
Sbjct: 449 FWKTNCAE 456
>gi|242084336|ref|XP_002442593.1| hypothetical protein SORBIDRAFT_08g022613 [Sorghum bicolor]
gi|241943286|gb|EES16431.1| hypothetical protein SORBIDRAFT_08g022613 [Sorghum bicolor]
Length = 482
Score = 84.7 bits (208), Expect = 6e-14, Method: Compositional matrix adjust.
Identities = 88/325 (27%), Positives = 135/325 (41%), Gaps = 43/325 (13%)
Query: 27 IFDPRKSSSFQKINCDHPDCTYFK----CVN--EQCVYTMKYADQSVTKGFAAHETISVI 80
+++ KSSS + C P C CV +C Y ++Y D S + G ET++
Sbjct: 171 VYNRLKSSSASDVGCYAPACRALGSSGGCVQFLNECQYKVEYGDGSSSAGDFGVETLTF- 229
Query: 81 GKGEGKAIFHGALFGCSNDNHG-FDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSY 139
G GC +DN G F A AG+LGL R ++SF SQ+ + FSY
Sbjct: 230 ---PPGVRVPGVAIGCGSDNQGLFPAPA-----AGILGLGRGSLSFPSQIAGRYGRSFSY 281
Query: 140 CLVIPLPNGEYTSSYLKFGTDMGYRRPSTQATKFINHPNN-----FYYLSLKDISIDNER 194
CL G SS L FG+ +T F N FYY+ L IS+ R
Sbjct: 282 CLAGQGTGGR--SSTLTFGSGASATTTTTTPPSFTPMLTNSRMYTFYYVGLVGISVGGVR 339
Query: 195 MNFPPDTFDITV---SGEGGCIIDSGSVLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCP 251
+ ++ D+ + +G GG I+DSG+ +T Y + F R + P
Sbjct: 340 VRGVTES-DLRLDPSTGHGGVIVDSGTAVTRLSGPAYAAFRDAF-----RVAAVKELGWP 393
Query: 252 EP------IQLCY--FLPETFNRFPSMAFYFEDA-NLRIDGENVFI-IDYENHFFLLAVA 301
P CY + P+++ +F +++ +N I +D A A
Sbjct: 394 SPGGPFAFFDTCYSSVRGRVMKKVPAVSMHFAGGVEVKLPPQNYLIPVDSNKGTMCFAFA 453
Query: 302 PHDDL-VALIGSQQQRDTRFVYDLN 325
D V++IG+ Q + R VYD++
Sbjct: 454 GSGDRGVSIIGNIQLQGFRVVYDVD 478
>gi|224142007|ref|XP_002324352.1| predicted protein [Populus trichocarpa]
gi|222865786|gb|EEF02917.1| predicted protein [Populus trichocarpa]
Length = 460
Score = 84.7 bits (208), Expect = 6e-14, Method: Compositional matrix adjust.
Identities = 93/370 (25%), Positives = 142/370 (38%), Gaps = 62/370 (16%)
Query: 1 MVRLFIGTPSKGVLLILDTGSALIYAI---------------FDPRKSSSFQKINCDHPD 45
+V + +GTP K LI DTGS + + +P S+S++ I+C
Sbjct: 120 VVTVGLGTPKKEFTLIFDTGSDITWTQCEPCVKTCYKQKEPRLNPSTSTSYKNISCSSAL 179
Query: 46 CTYF--------KCVNEQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCS 97
C C + C+Y ++Y D S + GF A ET+++ +F LFGC
Sbjct: 180 CKLVASGKKFSQSCSSSTCLYQVQYGDGSYSIGFFATETLTL----SSSNVFKNFLFGCG 235
Query: 98 NDNHGFDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLKF 157
N+G A R ++ SQ KK FSYC LP + YL
Sbjct: 236 QQNNGLFGGAAGLLGL-----GRTKLALPSQTAKTYKKLFSYC----LPASSSSKGYLSL 286
Query: 158 GTDMGYRRPSTQ-ATKFINHPNNFYYLSLKDISIDNERMNFPPDTFDITVSGEGGCIIDS 216
G + T + F + P FY L + +S+ +++ F G +IDS
Sbjct: 287 GGQVSKSVKFTPLSADFDSTP--FYGLDITGLSVGGRKLSIDESAF------SAGTVIDS 338
Query: 217 GSVLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEP-----IQLCY-FLPETFNRFPSM 270
G+V+T Y +L F + ++D P CY F R P +
Sbjct: 339 GTVITRLSPTAYSELSSAFQNL--------MTDYPSTSGYSIFDTCYDFSKYDTVRIPKV 390
Query: 271 AFYFEDA-NLRIDGENVFIIDYENHFFLLAVAPHDD--LVALIGSQQQRDTRFVYDLNID 327
F+ + ID + LA A +DD ++ G+ QQR + VYD
Sbjct: 391 GVTFKGGVEMDIDVSGILYPVNGLKKVCLAFAGNDDDSDTSIFGNVQQRTYQVVYDGAKG 450
Query: 328 LLSFVKENCS 337
+ F CS
Sbjct: 451 RVGFAPGGCS 460
>gi|224142005|ref|XP_002324351.1| predicted protein [Populus trichocarpa]
gi|222865785|gb|EEF02916.1| predicted protein [Populus trichocarpa]
Length = 472
Score = 84.7 bits (208), Expect = 6e-14, Method: Compositional matrix adjust.
Identities = 93/370 (25%), Positives = 142/370 (38%), Gaps = 62/370 (16%)
Query: 1 MVRLFIGTPSKGVLLILDTGSALIYAI---------------FDPRKSSSFQKINCDHPD 45
+V + +GTP K LI DTGS + + +P S+S++ I+C
Sbjct: 132 VVTVGLGTPKKEFTLIFDTGSDITWTQCEPCVKTCYKQKEPRLNPSTSTSYKNISCSSAL 191
Query: 46 CTYF--------KCVNEQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCS 97
C C + C+Y ++Y D S + GF A ET+++ +F LFGC
Sbjct: 192 CKLVASGKKFSQSCSSSTCLYQVQYGDGSYSIGFFATETLTL----SSSNVFKNFLFGCG 247
Query: 98 NDNHGFDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLKF 157
N+G A R ++ SQ KK FSYC LP + YL
Sbjct: 248 QQNNGLFGGAAGLLGL-----GRTKLALPSQTAKTYKKLFSYC----LPASSSSKGYLSL 298
Query: 158 GTDMGYRRPSTQ-ATKFINHPNNFYYLSLKDISIDNERMNFPPDTFDITVSGEGGCIIDS 216
G + T + F + P FY L + +S+ +++ F G +IDS
Sbjct: 299 GGQVSKSVKFTPLSADFDSTP--FYGLDITGLSVGGRKLSIDESAF------SAGTVIDS 350
Query: 217 GSVLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEP-----IQLCY-FLPETFNRFPSM 270
G+V+T Y +L F + ++D P CY F R P +
Sbjct: 351 GTVITRLSPTAYSELSSAFQNL--------MTDYPSTSGYSIFDTCYDFSKYDTVRIPKV 402
Query: 271 AFYFEDA-NLRIDGENVFIIDYENHFFLLAVAPHDD--LVALIGSQQQRDTRFVYDLNID 327
F+ + ID + LA A +DD ++ G+ QQR + VYD
Sbjct: 403 GVTFKGGVEMDIDVSGILYPVNGLKKVCLAFAGNDDDSDTSIFGNVQQRTYQVVYDGAKG 462
Query: 328 LLSFVKENCS 337
+ F CS
Sbjct: 463 RVGFAPGGCS 472
>gi|356546376|ref|XP_003541602.1| PREDICTED: probable aspartic protease At2g35615-like [Glycine max]
Length = 450
Score = 84.7 bits (208), Expect = 6e-14, Method: Compositional matrix adjust.
Identities = 93/360 (25%), Positives = 166/360 (46%), Gaps = 38/360 (10%)
Query: 6 IGTPSKGVLLILDTGSALIY--------------AIFDPRKSSSFQKINCDHPDCTYF-- 49
+GTP +L ++DTGS + + IFDP KS +++ + C C
Sbjct: 103 VGTPPFEILGVVDTGSGITWMQCQRCEDCYEQTTPIFDPSKSKTYKTLPCSSNMCQSVIS 162
Query: 50 --KCVNEQ--CVYTMKYADQSVTKGFAAHETISVIGKGEGKAI-FHGALFGCSNDNHGFD 104
C +++ C YT+KY D S ++G + ET++ +G G ++ F + GC ++N G
Sbjct: 163 TPSCSSDKIGCKYTIKYGDGSHSQGDLSVETLT-LGSTNGSSVQFPNTVIGCGHNNKGTF 221
Query: 105 EDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLKFGTDMGYR 164
+ G + G + S +G +FSYCL P+ + +SS L FG
Sbjct: 222 QGEGSGVVGLGGGPVSLISQLSSSIGG----KFSYCLA-PMFSQSNSSSKLNFGDAAVVS 276
Query: 165 RPSTQATKFINHPNN--FYYLSLKDISIDNERMNF-PPDTFDITVSGEGGCIIDSGSVLT 221
+T ++ + FYYL+L+ S+ ++R+ F + + +GEG IIDSG+ LT
Sbjct: 277 GLGAVSTPLVSKTGSEVFYYLTLEAFSVGDKRIEFVGGSSSSGSSNGEGNIIIDSGTTLT 336
Query: 222 YFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCY-FLPETFNRFPSMAFYFEDANLR 280
+ Y L + Q ++SD + LCY P P + +F+ A++
Sbjct: 337 LLPQEDYSNLESAVA---DAIQANRVSDPSNFLSLCYQTTPSGQLDVPVITAHFKGADVE 393
Query: 281 IDGENVFIIDYENHFFLLAVAPH-DDLVALIGSQQQRDTRFVYDLNIDLLSFVKENCSDD 339
++ + F+ E ++ A H ++V++ G+ Q + YDL +SF +C+ +
Sbjct: 394 LNPISTFVQVAEG---VVCFAFHSSEVVSIFGNLAQLNLLVGYDLMEQTVSFKPTDCTQE 450
>gi|359476754|ref|XP_002277058.2| PREDICTED: aspartic proteinase-like protein 2-like isoform 1 [Vitis
vinifera]
Length = 561
Score = 84.3 bits (207), Expect = 6e-14, Method: Compositional matrix adjust.
Identities = 93/377 (24%), Positives = 162/377 (42%), Gaps = 61/377 (16%)
Query: 2 VRLFIGTPSKGVLLILDTGSALIY-------------------AIFDPRKSSSFQKINCD 42
++ IGTPSK + +DTGS +++ ++D + S++ + CD
Sbjct: 157 AKIGIGTPSKDYYVQVDTGSDILWVNCAGCDRCPTKSDLGVDLTLYDMKASTTSDAVGCD 216
Query: 43 HPDCTYFK-----CV-NEQCVYTMKYADQSVTKGFAAHETIS---VIGKGEGKAIFHGAL 93
C+ + C QC+Y++ Y D S T G+ + + + G + +
Sbjct: 217 DNFCSLYDGPLPGCKPGLQCLYSVLYGDGSSTTGYFVQDFVQYNRISGNFQTTPTNGTVV 276
Query: 94 FGCSNDNHGFDEDARDGALAGVLGLSRVTISFISQLGSI--IKKRFSYCLVIPLPNGEYT 151
FGC N G + + AL G+LG + S +SQL S +KK FS+CL G +
Sbjct: 277 FGCGNKQSG-ELGSSSEALDGILGFGQANSSMLSQLASSGKVKKVFSHCLDNVDGGGIFA 335
Query: 152 SSYLKFGTDMGYRRPSTQATKFINHPNNFYYLSLKDISIDNERMNFPPDTFDITVSGE-G 210
+ P T + + + Y + +K+I + + ++ P D F+ SG+
Sbjct: 336 IGEVV--------EPKVNITPLVQNQAH-YNVVMKEIEVGGDPLDVPSDAFE---SGDRK 383
Query: 211 GCIIDSGSVLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCY-FLPETFNRFPS 269
G IIDSG+ L YF +VY L EK +S +L + E C+ + + FP+
Sbjct: 384 GTIIDSGTTLAYFPQEVYVPLIEKILSQQPDLRLHTV----EQAFTCFDYTGNVDDGFPT 439
Query: 270 MAFYFEDA-NLRI--------DGENVFIIDYENHFFLLAVAPHDDLVALIGSQQQRDTRF 320
+ +F+ + +L + E + I ++N A + L+G +
Sbjct: 440 VTLHFDKSISLTVYPHEYLFQVKEFEWCIGWQNSG---AQTKDGKDLTLLGDLVLSNKLV 496
Query: 321 VYDLNIDLLSFVKENCS 337
VYDL + +V+ NCS
Sbjct: 497 VYDLEKQGIGWVEYNCS 513
>gi|224142009|ref|XP_002324353.1| predicted protein [Populus trichocarpa]
gi|222865787|gb|EEF02918.1| predicted protein [Populus trichocarpa]
Length = 412
Score = 84.3 bits (207), Expect = 6e-14, Method: Compositional matrix adjust.
Identities = 93/370 (25%), Positives = 142/370 (38%), Gaps = 62/370 (16%)
Query: 1 MVRLFIGTPSKGVLLILDTGSALIYAI---------------FDPRKSSSFQKINCDHPD 45
+V + +GTP K LI DTGS + + +P S+S++ I+C
Sbjct: 72 VVTVGLGTPKKEFTLIFDTGSDITWTQCEPCVKTCYKQKEPRLNPSTSTSYKNISCSSAL 131
Query: 46 CTYF--------KCVNEQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCS 97
C C + C+Y ++Y D S + GF A ET+++ +F LFGC
Sbjct: 132 CKLVASGKKFSQSCSSSTCLYQVQYGDGSYSIGFFATETLTL----SSSNVFKNFLFGCG 187
Query: 98 NDNHGFDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLKF 157
N+G A R ++ SQ KK FSYC LP + YL
Sbjct: 188 QQNNGLFGGAAGLLGL-----GRTKLALPSQTAKTYKKLFSYC----LPASSSSKGYLSL 238
Query: 158 GTDMGYRRPSTQ-ATKFINHPNNFYYLSLKDISIDNERMNFPPDTFDITVSGEGGCIIDS 216
G + T + F + P FY L + +S+ +++ F G +IDS
Sbjct: 239 GGQVSKSVKFTPLSADFDSTP--FYGLDITGLSVGGRQLSIDESAF------SAGTVIDS 290
Query: 217 GSVLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEP-----IQLCY-FLPETFNRFPSM 270
G+V+T Y +L F + ++D P CY F R P +
Sbjct: 291 GTVITRLSPTAYSELSSAFQNL--------MTDYPSTSGYSIFDTCYDFSKYDTVRIPKV 342
Query: 271 AFYFEDA-NLRIDGENVFIIDYENHFFLLAVAPHDD--LVALIGSQQQRDTRFVYDLNID 327
F+ + ID + LA A +DD ++ G+ QQR + VYD
Sbjct: 343 GVTFKGGVEMDIDVSGILYPVNGLKKVCLAFAGNDDDSDTSIFGNVQQRTYQVVYDGAKG 402
Query: 328 LLSFVKENCS 337
+ F CS
Sbjct: 403 RVGFAPGGCS 412
>gi|357507803|ref|XP_003624190.1| Aspartic proteinase-like protein [Medicago truncatula]
gi|355499205|gb|AES80408.1| Aspartic proteinase-like protein [Medicago truncatula]
Length = 476
Score = 84.3 bits (207), Expect = 7e-14, Method: Compositional matrix adjust.
Identities = 93/378 (24%), Positives = 156/378 (41%), Gaps = 62/378 (16%)
Query: 2 VRLFIGTPSKGVLLILDTGSALIY-------------------AIFDPRKSSSFQKINCD 42
++ +G+P+K + +DTGS +++ ++DP S + + C
Sbjct: 74 TKVGLGSPAKEFYVQVDTGSDILWVNCAGCTACPKKSGLGMDLTLYDPNGSKTSNAVPCG 133
Query: 43 HPDCT------YFKCVNE-QCVYTMKYADQSVTKGFAAHETIS---VIGKGEGKAIFHGA 92
CT C + C Y++ Y D S T G +++++ V G K
Sbjct: 134 DGFCTDTYSGPISGCKQDMSCPYSITYGDGSTTSGSFVNDSLTFDEVSGNLHTKPDNSSV 193
Query: 93 LFGCSNDNHGFDEDARDGALAGVLGLSRVTISFISQLGSI--IKKRFSYCLVIPLPNGEY 150
+FGC G D AL G++G + S +SQL + +K+ FS+CL + +
Sbjct: 194 IFGCGAKQSGSLSSNSDEALDGIIGFGQANSSVLSQLAASGKVKRIFSHCL-----DSHH 248
Query: 151 TSSYLKFGTDMGYRRPSTQATKFINHPNNFYYLSLKDISIDNERMNFPPDTFDITVSGEG 210
G M + +T + H Y + LKD+ +D E + P FD SG G
Sbjct: 249 GGGIFSIGQVMEPKFNTTPLVPRMAH----YNVILKDMDVDGEPILLPLYLFD---SGSG 301
Query: 211 -GCIIDSGSVLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQL-CYFLPETFNR-F 267
G IIDSG+ L Y +Y +L K + +L + D Q C+ + + F
Sbjct: 302 RGTIIDSGTTLAYLPLSIYNQLLPKVLGRQPGLKLMIVED-----QFTCFHYSDKLDEGF 356
Query: 268 PSMAFYFEDANLRID--------GENVFIIDYENHFFLLAVAPHDDLVALIGSQQQRDTR 319
P + F+FE +L + E+++ I ++ DL+ LIG +
Sbjct: 357 PVVKFHFEGLSLTVHPHDYLFLYKEDIYCIGWQKSS--TQTKEGRDLI-LIGDLVLSNKL 413
Query: 320 FVYDLNIDLLSFVKENCS 337
VYDL ++ + NCS
Sbjct: 414 VVYDLENMVIGWTNFNCS 431
>gi|302764208|ref|XP_002965525.1| hypothetical protein SELMODRAFT_406966 [Selaginella moellendorffii]
gi|300166339|gb|EFJ32945.1| hypothetical protein SELMODRAFT_406966 [Selaginella moellendorffii]
Length = 464
Score = 84.3 bits (207), Expect = 8e-14, Method: Compositional matrix adjust.
Identities = 83/350 (23%), Positives = 156/350 (44%), Gaps = 33/350 (9%)
Query: 6 IGTPSKGVLLILDTGSALIYAIFDPRK---SSSFQKINCDHPDCTY--FKCVNE-QCVYT 59
+G+P K L++DTGS L + DP SS+F ++ + TY C ++ +
Sbjct: 130 LGSPPKDFSLVMDTGSDLTWVRCDPCSPDCSSTFDRLASN----TYKALTCADDLRLPVL 185
Query: 60 MKYADQSVTKGFAAHETISVIGKGEGK-AIFHGALFGCSNDNHGFDEDARDGALAGVLGL 118
++ + G + +T+ + G + F G +FGC + G G+L L
Sbjct: 186 LRLWRRLFHSGRSLRDTLKMAGAASDELEEFPGFVFGCGSLLKGLISGE-----VGILAL 240
Query: 119 SRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLKFG------TDMGYRRPSTQATK 172
S ++SF SQ+G +FSYCL+ S + FG + G +P
Sbjct: 241 SPGSLSFPSQIGEKYGNKFSYCLLRQTAQNSLKKSPMVFGEAAVELKEPGSGKPQELQYT 300
Query: 173 FINHPNNFYYLSLKDISIDNERMNFPPDTFDITVSGEGG-CIIDSGSVLTYFHSDVYWKL 231
I + +Y + L IS+ N+R++ P TF ++G+ I DSG+ LT S V +
Sbjct: 301 PIGESSIYYTVRLDGISVGNQRLDLSPSTF---LNGQDKPTIFDSGTTLTMLPSGVCDSI 357
Query: 232 HEKFVSYFERFQLAQLSDCPEPIQLCYFLPETFNR-FPSMAFYFEDANLRIDGENVFIID 290
+ S + + + + C+ +P + + P + F+F + + ++ID
Sbjct: 358 KQSLASMVSGAEFVAI----KGLDACFRVPPSSGQGLPDITFHFNGGADFVTRPSNYVID 413
Query: 291 YENHFFLLAVAPHDDLVALIGSQQQRDTRFVYDLNIDLLSFVKENCSDDS 340
+ L+ V ++ V++ G+ QQ+D ++D++ + F + +C S
Sbjct: 414 LGSLQCLIFVPTNE--VSIFGNLQQQDFFVLHDMDNRRIGFKETDCGAHS 461
>gi|297819684|ref|XP_002877725.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
gi|297323563|gb|EFH53984.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
Length = 633
Score = 84.3 bits (207), Expect = 8e-14, Method: Compositional matrix adjust.
Identities = 93/367 (25%), Positives = 149/367 (40%), Gaps = 53/367 (14%)
Query: 2 VRLFIGTPSKGVLLILDTGSALIYAI--------------FDPRKSSSFQKINCDHPDCT 47
RL+IGTP + LI+D+GS + Y F P SS++Q + C+ DC
Sbjct: 96 TRLWIGTPPQMFALIVDSGSTVTYVPCSDCEQCGKHQDPKFQPELSSTYQPVKCNM-DCN 154
Query: 48 YFKCVN--EQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSNDNHGFDE 105
C + EQCVY +YA+ S +KG + IS E + A+FGC G
Sbjct: 155 ---CDDDKEQCVYEREYAEHSSSKGVLGEDLISF--GNESQLTPQRAVFGCETVETG--- 206
Query: 106 DARDGALAGVLGLSRVTISFISQL--GSIIKKRFSYCL-VIPLPNGEYTSSYLKFGTDMG 162
D G++GL + +S + QL +I F C + + G + +DM
Sbjct: 207 DLYSQRADGIIGLGQGDLSLVDQLVDKGLISNSFGLCYGGMDVGGGSMILGGFDYPSDMI 266
Query: 163 YRRPSTQATKFINHPNNFYYLSLKDISIDNERMNFPPDTFDITVSGEGGCIIDSGSVLTY 222
+ + +Y + L I + ++++ FD GE G ++DSG+ Y
Sbjct: 267 FTDSDPDRSP-------YYNIDLTGIRVAGKKLSLNSRVFD----GEHGAVLDSGTTYAY 315
Query: 223 FHSDVYWKLHEKFVSYFERFQLAQLSDCPEP-IQLCYFLPETFNRFPSMAFYFEDANLRI 281
+ E + E L Q+ D P+P + FL N ++ F +
Sbjct: 316 LPDAAFAAFEEAVMR--EVSPLKQI-DGPDPNFKDTCFLVAASNDVSELSKIFPSVEMIF 372
Query: 282 DGENVFIIDYENHFF---------LLAVAPH-DDLVALIGSQQQRDTRFVYDLNIDLLSF 331
+++ EN+ F L V P+ D L+G R+T VYD + F
Sbjct: 373 KSGQSWLLSPENYMFRHSKVHGAYCLGVFPNGKDHTTLLGGIVVRNTLVVYDRENSKVGF 432
Query: 332 VKENCSD 338
+ NCS+
Sbjct: 433 WRTNCSE 439
>gi|388505490|gb|AFK40811.1| unknown [Medicago truncatula]
Length = 193
Score = 84.0 bits (206), Expect = 8e-14, Method: Compositional matrix adjust.
Identities = 50/181 (27%), Positives = 94/181 (51%), Gaps = 13/181 (7%)
Query: 166 PSTQATKFINHP-------NNFYYLSLKDISIDNERMNFPPDTFDITVSGEGGCIIDSGS 218
P+ ATK + P +FYY+SL+ IS+ + +++ TF+++ G GG IIDSG+
Sbjct: 15 PNVNATKQVTTPLITNPLQPSFYYISLEVISVGDTKLSIEQSTFEVSDDGSGGVIIDSGT 74
Query: 219 VLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCYFLP--ETFNRFPSMAFYFED 276
+TY + + L ++F S + +L + +C+ LP +T P + F+F+
Sbjct: 75 TITYIEENAFDSLKKEFTS---QTKLPVDKSGSTGLDVCFSLPSGKTEVEIPKLVFHFKG 131
Query: 277 ANLRIDGENVFIIDYENHFFLLAVAPHDDLVALIGSQQQRDTRFVYDLNIDLLSFVKENC 336
+L + GEN I D LA+ + + ++ G+ QQ++ +DL + ++F+ C
Sbjct: 132 GDLELPGENYMIADSSLGVACLAMGASNGM-SIFGNIQQQNILVNHDLQKETITFIPTQC 190
Query: 337 S 337
+
Sbjct: 191 N 191
>gi|357515189|ref|XP_003627883.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
gi|355521905|gb|AET02359.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
Length = 434
Score = 84.0 bits (206), Expect = 8e-14, Method: Compositional matrix adjust.
Identities = 84/359 (23%), Positives = 143/359 (39%), Gaps = 44/359 (12%)
Query: 1 MVRLFIGTPSKGVLLILDTGSALIY------------AIFDPRKSSSFQKINCDHPDCTY 48
+V+ GTP + +LL LDT S + F P KS+SF+ ++C P C
Sbjct: 98 IVKAKFGTPPQTLLLALDTSSDAAWIPCSGCVGCSTSKPFAPIKSTSFRNVSCGSPHCKQ 157
Query: 49 FK---CVNEQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSNDNHGFDE 105
C C + Y S+ +T+++ G FGC N G
Sbjct: 158 VPNPTCGGSACAFNFTYGSSSIAAS-VVQDTLTL-----ATDPIPGYTFGCVNKTTGSSA 211
Query: 106 DARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLKFGTDMGYRR 165
+ R +S +SQ ++ K FSYCL P S L+ G +R
Sbjct: 212 PQQGLLGL-----GRGPLSLLSQSQNLYKSTFSYCL--PSFKSINFSGSLRLGPVYQPKR 264
Query: 166 PSTQATKFINHP--NNFYYLSLKDISIDNERMNFPPDTFDITVSGEGGCIIDSGSVLTYF 223
+ T + +P ++ YY++L I + + ++ PP + G I DSG+V T
Sbjct: 265 --IKYTPLLRNPRRSSLYYVNLVAIKVGRKIVDIPPAALAFNPTTGAGTIFDSGTVFTRL 322
Query: 224 HSDVYWKLHEKFVSYF-ERFQLAQLSDCPEPIQLCYFLPETFNRFPSMAFYFEDANLRID 282
VY + +F + + L CY +P P++ F F N+ +
Sbjct: 323 AEPVYTAVRNEFRRRVGPKLPVTTLGG----FDTCYNVPIV---VPTITFLFSGMNVTLP 375
Query: 283 GENVFIIDYENHFFLLAVAPHDD----LVALIGSQQQRDTRFVYDLNIDLLSFVKENCS 337
+N+ I LA+A D ++ +I + QQ++ R ++D+ + +E C+
Sbjct: 376 PDNIVIHSTAGSTTCLAMAGAPDNVNSVLNVIANMQQQNHRVLFDVPNSRIGIARELCT 434
>gi|224081804|ref|XP_002306494.1| predicted protein [Populus trichocarpa]
gi|222855943|gb|EEE93490.1| predicted protein [Populus trichocarpa]
Length = 564
Score = 84.0 bits (206), Expect = 9e-14, Method: Compositional matrix adjust.
Identities = 91/366 (24%), Positives = 147/366 (40%), Gaps = 52/366 (14%)
Query: 2 VRLFIGTPSKGVLLILDTGSALIYAI--------------FDPRKSSSFQKINCDHPDCT 47
RL+IGTP + LI+DTGS++ Y F P SS++Q + C+ DC
Sbjct: 15 TRLWIGTPPQRFALIVDTGSSVTYVPCSSCEQCGRHQDPKFQPDLSSTYQSVKCNI-DC- 72
Query: 48 YFKCVNE--QCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIF-HGALFGCSNDNHGFD 104
C +E QCVY +YA+ S + G + IS G A+ A+FGC N G
Sbjct: 73 --NCDDEKQQCVYERQYAEMSTSSGVLGEDIISF---GNLSALAPQRAVFGCENMETG-- 125
Query: 105 EDARDGALAGVLGLSRVTISFISQL--GSIIKKRFSYCLVIPLPNGEYTSSYLKFGTDMG 162
D G++G+ R +S + L +I FS C +G
Sbjct: 126 -DLYSQHADGIMGMGRGDLSIVDHLVDKGVINDSFSLCYG--------GMGIGGGAMVLG 176
Query: 163 YRRPSTQATKFINHP--NNFYYLSLKDISIDNERMNFPPDTFDITVSGEGGCIIDSGSVL 220
P + + P + +Y + LK+I + + + P FD G+ G I+DSG+
Sbjct: 177 GISPPSNMVFSQSDPVRSPYYNIDLKEIHVAGKPLPLNPTVFD----GKHGTILDSGTTY 232
Query: 221 TYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCYF-----LPETFNRFPSMAFYFE 275
Y + + + + + D P +C+ + + + FP++ F
Sbjct: 233 AYLPEAAFVSFKDAIMKELHSLKPIRGPD-PNYNDICFSGAGSDISQLSSSFPAVEMVFG 291
Query: 276 DAN-LRIDGENVFIIDYENH--FFLLAVAPHDDLVALIGSQQQRDTRFVYDLNIDLLSFV 332
+ L + EN + H + L D L+G R+T +YD + F
Sbjct: 292 NGQKLLLSPENYLFRHSKVHGAYCLGIFQNGKDPTTLLGGIVVRNTLVLYDRENSKIGFW 351
Query: 333 KENCSD 338
K NCS+
Sbjct: 352 KTNCSE 357
>gi|449442281|ref|XP_004138910.1| PREDICTED: aspartic proteinase-like protein 2-like [Cucumis
sativus]
gi|449506266|ref|XP_004162699.1| PREDICTED: aspartic proteinase-like protein 2-like [Cucumis
sativus]
Length = 482
Score = 84.0 bits (206), Expect = 9e-14, Method: Compositional matrix adjust.
Identities = 92/374 (24%), Positives = 156/374 (41%), Gaps = 54/374 (14%)
Query: 2 VRLFIGTPSKGVLLILDTGSALIYAIFD-----PRKS--------------SSFQKINCD 42
++ +GTP + + +DTGS +++ P+KS S+ ++ C+
Sbjct: 76 AKIGLGTPVQDYYVQVDTGSDILWVNCAGCTNCPKKSDLGIELSLYSPSSSSTSNRVTCN 135
Query: 43 HPDCTYF------KCVNEQ-CVYTMKYADQSVTKGFAAHETI---SVIGKGEGKAIFHGA 92
CT C E C Y + Y D S T G+ + + V G + +
Sbjct: 136 QDFCTSTYDGPIPGCTPELLCEYRVAYGDGSSTAGYFVRDHVVLDRVTGNFQTTSTNGSI 195
Query: 93 LFGCSNDNHGFDEDARDGALAGVLGLSRVTISFISQLGSI--IKKRFSYCLVIPLPNGEY 150
+FGC G A AL G+LG + S ISQL S +K+ F++CL G +
Sbjct: 196 VFGCGAQQSG-QLGATSAALDGILGFGQANSSMISQLASSGKVKRVFAHCLDNINGGGIF 254
Query: 151 TSSYLKFGTDMGYRRPSTQATKFINHPNNFYY-LSLKDISIDNERMNFPPDTFDITVSGE 209
+ +P + T + P +Y + +K I +DNE +N P D FD +
Sbjct: 255 AIGEVV--------QPKVRTTPLV--PQQAHYNVFMKAIEVDNEVLNLPTDVFDTDL--R 302
Query: 210 GGCIIDSGSVLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCYFLPETFNRFPS 269
G IIDSG+ L YF +Y L K F R +L E + + FP+
Sbjct: 303 KGTIIDSGTTLAYFPDVIYEPLISKI---FARQSTLKLHTVEEQFTCFEYDGNVDDGFPT 359
Query: 270 MAFYFEDANLRIDGENVFIIDYENHFFLL------AVAPHDDLVALIGSQQQRDTRFVYD 323
+ F+FED+ + ++ D +++ + + A + + L+G ++ +YD
Sbjct: 360 VTFHFEDSLSLTVYPHEYLFDIDSNKWCVGWQNSGAQSRDGKDMILLGDLVLQNRLVMYD 419
Query: 324 LNIDLLSFVKENCS 337
L + + + NCS
Sbjct: 420 LENQTIGWTEYNCS 433
>gi|115457772|ref|NP_001052486.1| Os04g0334700 [Oryza sativa Japonica Group]
gi|113564057|dbj|BAF14400.1| Os04g0334700 [Oryza sativa Japonica Group]
Length = 482
Score = 84.0 bits (206), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 83/364 (22%), Positives = 151/364 (41%), Gaps = 49/364 (13%)
Query: 6 IGTPSKGVLLILDTGSALIYA-------------------IFDPRKSSSFQKINCDHPDC 46
IGTP+ + LDTGS + +DPR S S +++ CD C
Sbjct: 89 IGTPAVKYYVQLDTGSKAFWVNGISCKQCPHESDILRKLTFYDPRSSVSSKEVKCDDTIC 148
Query: 47 TYFKCVNE--QCVYTMKYADQSVTKGFAAHETI---SVIGKGEGKAIFHGALFGCSNDNH 101
T N +C Y YAD +T G + + + G G+ + FGC
Sbjct: 149 TSRPPCNMTLRCPYITGYADGGLTMGILFTDLLHYHQLYGNGQTQPTSTSVTFGCGLQQS 208
Query: 102 GFDEDARDGALAGVLGLSRVTISFISQLGSI--IKKRFSYCLVIPLPNGEYTSSYLKFGT 159
G ++ A+ G++G + +SQL + KK FS+CL G + +
Sbjct: 209 GSLNNSAV-AIDGIIGFGNSNQTALSQLAAAGKTKKIFSHCLDSTNGGGIFAIGEV---- 263
Query: 160 DMGYRRPSTQATKFINHPNNFYYLSLKDISIDNERMNFPPDTFDITVSGEGGCIIDSGSV 219
P + T + + ++ ++LK I++ + P + F T + G IDSGS
Sbjct: 264 ----VEPKVKTTPIVKNNEVYHLVNLKSINVAGTTLQLPANIFGTTKT--KGTFIDSGST 317
Query: 220 LTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCYFLPETFNRFPSMAFYFEDANL 279
L Y +Y +L + + + + Q +FL ++FP + F+FE+ +L
Sbjct: 318 LVYLPEIIYSELILAVFAKHPDITMGAMYN----FQCFHFLGSVDDKFPKITFHFEN-DL 372
Query: 280 RIDGENV-FIIDYENHFFLL-----AVAPHDDLVALIGSQQQRDTRFVYDLNIDLLSFVK 333
+D ++++YE + + + + D++ ++G + VYD+ + + +
Sbjct: 373 TLDVYPYDYLLEYEGNQYCFGFQDAGIHGYKDMI-ILGDMVISNKVVVYDMEKQAIGWTE 431
Query: 334 ENCS 337
NCS
Sbjct: 432 HNCS 435
>gi|293332561|ref|NP_001170100.1| uncharacterized protein LOC100384018 precursor [Zea mays]
gi|224033441|gb|ACN35796.1| unknown [Zea mays]
gi|413944035|gb|AFW76684.1| hypothetical protein ZEAMMB73_746438 [Zea mays]
Length = 456
Score = 83.6 bits (205), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 81/295 (27%), Positives = 117/295 (39%), Gaps = 59/295 (20%)
Query: 1 MVRLFIGTPSKGVLLILDTGSALIYA--------------IFDPRKSSSFQKINCDHPDC 46
+V L +GTP + V L LDTGS L++ + DP SS++ + C P C
Sbjct: 87 LVHLAVGTPPRPVALTLDTGSDLVWTQCAPCRDCFDQGIPLLDPAASSTYAALPCGAPRC 146
Query: 47 ---TYFKCVNEQCVYTMKYADQSVTKGFAAHE--TISVIGKGEGKAIFHGA---LFGCSN 98
+ C CVY Y D+SVT G A + T G+ G FGC +
Sbjct: 147 RALPFTSCGGRSCVYVYHYGDKSVTVGKIATDRFTFGDNGRRNGDGSLPATRRLTFGCGH 206
Query: 99 DNHGFDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLKFG 158
N G + G+ G R S SQL + FSYC + SS + G
Sbjct: 207 FNKGVFQSNET----GIAGFGRGRWSLPSQLNAT---SFSYCFTSMF---DSKSSIVTLG 256
Query: 159 TDMG-----YRRPSTQATKFINHPN--NFYYLSLKDISIDNERMNFPPDTFDITVSGEGG 211
+ T +P+ + Y+LSLK IS+ R+ P F T
Sbjct: 257 GAPAALYSHAHSGEVRTTPLFKNPSQPSLYFLSLKGISVGKTRLPVPETKFRST------ 310
Query: 212 CIIDSGSVLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQ-----LCYFLP 261
IIDSG+ +T +VY + +F AQ+ P ++ +C+ LP
Sbjct: 311 -IIDSGASITTLPEEVYEAVKAEFA--------AQVGLPPSGVEGSALDVCFALP 356
>gi|194707632|gb|ACF87900.1| unknown [Zea mays]
Length = 423
Score = 83.6 bits (205), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 92/378 (24%), Positives = 153/378 (40%), Gaps = 58/378 (15%)
Query: 2 VRLFIGTPSKGVLLILDTGSALIYAI-------------------FDPRKSSSFQKINCD 42
R+ +G P+K + +DTGS +++ F+P SS+ +I C
Sbjct: 7 TRVKLGNPAKEFFVQIDTGSDILWVTCSPCTGCPTSSGLNIQLESFNPDSSSTASRITCS 66
Query: 43 HPDCT--------YFKCVNEQ---CVYTMKYADQSVTKGFAAHETI---SVIGKGEGKAI 88
CT + N Q C YT Y D S T G+ +T+ +V+G +
Sbjct: 67 DDRCTAGFQTGEAICQTSNSQSSPCGYTFTYGDGSGTSGYYVSDTMFFETVMGNEQTANS 126
Query: 89 FHGALFGCSNDNHGFDEDARDGALAGVLGLSRVTISFISQLGS--IIKKRFSYCLVIPLP 146
+FGCSN G D D A+ G+ G + +S ISQL S + K FS+CL
Sbjct: 127 SASIVFGCSNSQSG-DLTKADRAVDGIFGFGQHQLSVISQLNSLGVSPKVFSHCL----K 181
Query: 147 NGEYTSSYLKFGTDMGYRRPSTQATKFI-NHPNNFYYLSLKDISIDNERMNFPPDTFDIT 205
+ L G + P T + + P+ Y L+L+ I+++ +++ P D+ T
Sbjct: 182 GSDNGGGILVLGEIV---EPGLVYTPLVPSQPH--YNLNLESIAVNGQKL--PIDSSLFT 234
Query: 206 VSGEGGCIIDSGSVLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCYFLPETFN 265
S G I+DSG+ L Y Y + FVS + C+ + +
Sbjct: 235 TSNTQGTIVDSGTTLAYLADGAY----DPFVSAIAAAVSPSVRSLVSKGSQCFITSSSVD 290
Query: 266 -RFPSMAFYFEDA-NLRIDGENVFI----IDYENHFFLLAVAPHDDLVALIGSQQQRDTR 319
FP++ YF + + EN + +D + + + ++G +D
Sbjct: 291 SSFPTVTLYFMGGVAMSVKPENYLLQQASVDNSVLWCIGWQRNQGQEITILGDLVLKDKI 350
Query: 320 FVYDLNIDLLSFVKENCS 337
FVYDL + + +CS
Sbjct: 351 FVYDLANMRMGWADYDCS 368
>gi|356528675|ref|XP_003532925.1| PREDICTED: LOW QUALITY PROTEIN: probable aspartic protease
At2g35615-like [Glycine max]
Length = 342
Score = 83.6 bits (205), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 83/343 (24%), Positives = 140/343 (40%), Gaps = 87/343 (25%)
Query: 1 MVRLFIGTPSKGVLLILDTGSALIYAIFDPRKSSSFQKINCDHPDCTYFKCVNEQCVYTM 60
++RL+IGTP L+I DTGS I+ P C N QCVY
Sbjct: 79 LMRLYIGTPPVERLVIADTGSDFIWVQCSP--------------------CQNCQCVYLN 118
Query: 61 KYADQSVTKGFAAHETISVIGKGEGKAI-FHGALFGC-SNDNHGFDEDARDGALAGVLGL 118
YA++S T ET+S G + + F ++FGC +N+N F + G++GL
Sbjct: 119 IYANKSFTIEVVGTETLSFDSTGGAQTVSFPNSIFGCGANNNLTFRSSDKA---TGLVGL 175
Query: 119 SRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLKFGTDMGYRRPSTQATKFINHPN 178
+S +SQLG+ I +F SYLKFG++ +T I P+
Sbjct: 176 VAGQLSLVSQLGAQIGYKF---------------SYLKFGSEAIITTNGVVSTPLIIKPS 220
Query: 179 -NFYYLSLKDISIDNERMNFPPDTFDITVSGEGGCIIDSGSVLTYFHSDVYWKLHEKFVS 237
Y+L+L+ ++I + + P +T +
Sbjct: 221 LPLYFLNLEVVTIGQKVV--PTETLGV--------------------------------- 245
Query: 238 YFERFQLAQLSDCPEPIQLCYFLPETFNRFPSMAFYFEDANLRIDGENVFIIDYENHFFL 297
+ D P P + C+ + P++AF F A++ + +N+ I + +
Sbjct: 246 -------ESVQDLPFPFKFCFPYRDNMT-VPAIAFQFTGASVALRPKNLLIKLQDRNMLX 297
Query: 298 LAVAPHD---DLVALIGSQQQRDTRFVYDLNIDLLSFVKENCS 337
LAV P ++++ G Q D + +YDL+ +S +C+
Sbjct: 298 LAVVPSASSLSVISIFGIIAQFDFQVLYDLDGKKVSVAPTDCT 340
>gi|255566006|ref|XP_002523991.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
gi|223536718|gb|EEF38359.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
Length = 455
Score = 83.6 bits (205), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 98/383 (25%), Positives = 161/383 (42%), Gaps = 74/383 (19%)
Query: 1 MVRLFIGTPSKGVLLILDTGSALIY--------------AIFDPRKSSSFQKINCDHPDC 46
+++L IGTP + +DTGS +I+ +IF+P SS++Q CD C
Sbjct: 99 LMKLLIGTPPTEIHAAIDTGSNVIWIPCINCKDCFNQSSSIFNPLASSTYQDAPCDSYQC 158
Query: 47 --TYFKCVNEQ-CVYTMKYADQ-SVTKGFAAHETISVIGKGEGKAIFHGALFGCSNDNHG 102
T C ++ C+Y+ Q + G A +T+++ + F C N +
Sbjct: 159 ETTSSSCQSDNVCLYSCDEKHQLNCPNGRIAVDTMTLTSSDGRPFPLPYSDFVCGNSIY- 217
Query: 103 FDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTS---SYLKFG- 158
+ A GV+GL R +S S+L + +FSYCL +Y S S + FG
Sbjct: 218 -----KTFAGVGVIGLGRGALSLTSKLYHLSDGKFSYCL------ADYYSKQPSKINFGL 266
Query: 159 -------------TDMGYRRPSTQATKFINHPNNFYYLSLKDISIDNERMN--FPPDTFD 203
T +G+ R H N YY++L+ IS+ +R + + D F
Sbjct: 267 QSFISDDDLEVVSTTLGHHR----------HSGN-YYVTLEGISVGEKRQDLYYVDDPFA 315
Query: 204 ITVSGEGGCIIDSGSVLT--------YFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQ 255
V G +IDSG++ T Y S V + + E ++ + D +
Sbjct: 316 PPV---GNMLIDSGTMFTLLPKDFYDYLWSTVSYAIPENPQNHPHNSRFPFSMDNTLKLS 372
Query: 256 LC-YFLPETFNRFPSMAFYFEDANLRIDGENVFIIDYENHFFLLAVAPHDDLVALIGSQQ 314
C ++ PE +FP + +F DA++ + +N FI E+ A + GS Q
Sbjct: 373 PCFWYYPEL--KFPKITIHFTDADVELSDDNSFIRVAEDVVCFAFAATQPGQSTVYGSWQ 430
Query: 315 QRDTRFVYDLNIDLLSFVKENCS 337
Q + YDL +SF + +CS
Sbjct: 431 QMNFILGYDLKRGTVSFKRTDCS 453
>gi|356540369|ref|XP_003538662.1| PREDICTED: LOW QUALITY PROTEIN: probable aspartic protease
At2g35615-like [Glycine max]
Length = 364
Score = 83.6 bits (205), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 94/341 (27%), Positives = 153/341 (44%), Gaps = 26/341 (7%)
Query: 1 MVRLFIGTPSKGVLLILDTGSALIYAIFDPRKSSSFQKINCDHP--DCTYF---KCVNEQ 55
+++L +GTP V ++DT S L++A P + QK P +C F C E+
Sbjct: 32 LMKLTLGTPPVDVYGLVDTDSDLVWAQCTPCQGCYKQKNPMFDPLKECNSFFDHSCSPEK 91
Query: 56 -CVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSNDNHG-FDEDARDGALA 113
C Y YAD S TKG A E I+ +GK I +FGC ++N G F+E+
Sbjct: 92 ACDYVYAYADDSATKGMLAKE-IATFSSTDGKPIVESIIFGCGHNNTGVFNENDMGLIGL 150
Query: 114 GVLGLSRVTISFISQLGSII-KKRFSYCLVIPLPNGEYTSSYLKFGTDMGYRRPSTQATK 172
+S +SQ+G++ KRFS CLV P +TS + G T
Sbjct: 151 -----GGGPLSLVSQMGNLYGSKRFSQCLV-PFHADPHTSGTISLGEASDVSGEGVVTTP 204
Query: 173 FINHPNNFYYL-SLKDISIDNERMNFPPDTFDITVSGEGGCIIDSGSVLTYFHSDVYWKL 231
++ YL +L+ IS+ + + F + +G +IDSG+ TY + Y +L
Sbjct: 205 LVSEEGQTPYLVTLEGISVGDTFVPF----NSSEMLSKGNIMIDSGTPETYLPQEFYDRL 260
Query: 232 HEKFVSYFERFQLAQLSDCPE-PIQLCYFLPETFNRFPSMAFYFEDANLRIDGENVFIID 290
E+ + L + P+ QLCY ET P + +FE A++++ FI
Sbjct: 261 VEELKV---QINLPPIHVDPDLGTQLCY-KSETNLEGPILTAHFEGADVKLLPLQTFIPP 316
Query: 291 YENHFFLLAVAPHDDLVALIGSQQQRDTRFVYDLNIDLLSF 331
++ F A+ D + + G+ Q + +DL+ ++ F
Sbjct: 317 -KDGVFCFAMTGTTDGLYIFGNFAQSNVLIGFDLDKRIVFF 356
>gi|148907752|gb|ABR17002.1| unknown [Picea sitchensis]
Length = 454
Score = 83.6 bits (205), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 97/390 (24%), Positives = 154/390 (39%), Gaps = 82/390 (21%)
Query: 2 VRLFIGTPSKGVLLILDTGSALIYA-------------------IFDPRKSSSFQKINCD 42
R+ +GTP + + +DTGS +++ FDPR SS+ ++C
Sbjct: 43 TRIELGTPPRPFYVQIDTGSDILWVNCKPCNACPLTSGLGVALNFFDPRGSSTASPLSCI 102
Query: 43 HPDCTYFKCVNEQ-------CVYTMKYADQSVTKGFAAHETIS--------VIGKGEGKA 87
C ++E C Y+ +Y D S T G+ + V K
Sbjct: 103 DSKCVSSNQISESVCTTDRYCGYSFEYGDGSGTLGYYVSDEFDYNQYVNQYVTNNASAKI 162
Query: 88 IFHGALFGCSNDNHGFDEDARDGALAGVLGLSRVTISFISQLGS--IIKKRFSYCLVIPL 145
FGCS + G D D A+ G+ G + +S +SQL S + K FS+CL
Sbjct: 163 T-----FGCSYNQSG-DLTKPDRAVDGIFGFGQNDLSVVSQLNSQGLAPKIFSHCLEGAD 216
Query: 146 PNGEYTSSYLKFGTDMGYRRPSTQATKFI-NHPNNFYYLSLKDISIDNERMNFPPDTFDI 204
P G L G P T + + P+ Y L+L+ I+++ ++++ P F
Sbjct: 217 PGG----GILVLGE---ITEPGMVYTPIVPSQPH--YNLNLQGIAVNGQQLSIDPQVFAT 267
Query: 205 TVSGEGGCIIDSGSVLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQL----CYFL 260
T G IID G+ L Y + Y E FV+ +A +S +P L C+
Sbjct: 268 T--NTRGTIIDCGTTLAYLAEEAY----EPFVNTI----IAAVSQSTQPFMLKGNPCFLT 317
Query: 261 PETFNR-FPSMAFYFEDA------------NLRIDGENVFIIDYENHFFLLAVAPHDDLV 307
+ + FPS+ YFE A L D V+ I ++ A +
Sbjct: 318 VHSIDEIFPSVTLYFEGAPMDLKPKDYLIQQLSPDSSPVWCIGWQKSG---QQATDSSKM 374
Query: 308 ALIGSQQQRDTRFVYDLNIDLLSFVKENCS 337
++G +D FVYDL + + +CS
Sbjct: 375 TILGDLVLKDKVFVYDLENQRIGWTSFDCS 404
>gi|168063189|ref|XP_001783556.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162664943|gb|EDQ51645.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 414
Score = 83.6 bits (205), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 94/377 (24%), Positives = 142/377 (37%), Gaps = 61/377 (16%)
Query: 4 LFIGTPSKGVLLILDTGSALIY---------------AIFDPRKSSSFQKINCDHPDCTY 48
+ IG P+K L +DTGS L + ++DP+++ + ++C P C
Sbjct: 35 MRIGNPAKLYYLDMDTGSDLTWLQCDAPCRSCAVGPHGLYDPKRA---RVVDCRRPTCAQ 91
Query: 49 ------FKCVNE--QCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSNDN 100
F C + QC Y + Y D S T G +TI+++ G A+ GC D
Sbjct: 92 VQRGGQFTCSGDVRQCDYEVDYVDGSSTMGILVEDTITLV-LTNGTRFQTRAVIGCGYDQ 150
Query: 101 HGFDEDARDGALAGVLGLSRVTISFISQLGS--IIKKRFSYCLVIPLPNGEYTSSYLKFG 158
G A GV+GLS IS SQL + I +CL G YL FG
Sbjct: 151 QGTLAKA-PAVTDGVIGLSSSKISLPSQLAAKGIANNVIGHCLA----GGSNGGGYLFFG 205
Query: 159 TDMGYRRPSTQATKFINHP-NNFYYLSLKDISIDNERMNFPPDTFDITVSGEGGCIIDSG 217
D T I P Y L+ I E + T D+ GG + DSG
Sbjct: 206 -DTLVPALGMTWTPMIGRPLVEGYQARLRSIKYGGEVLELEGTTDDV-----GGAMFDSG 259
Query: 218 SVLTYFHSDVYWKLHEKFVSYFERFQLAQL-SDCPEPIQLCYFLPETFNRFPSMAFYFED 276
+ TY + Y + V +R L ++ +D P C+ P F ++ YF+
Sbjct: 260 TSFTYLVPNAYTAVLSAVVRQAQRSGLERIKTDTTLP--FCWRGPSPFESVADVSAYFKT 317
Query: 277 ANLRIDG--------------ENVFIIDYENHF---FLLAVAPHDDLVALIGSQQQRDTR 319
L G E I+ + + L A ++ ++G R
Sbjct: 318 VTLDFGGSTWWSSGKLLELSPEGYLIVSTQGNVCLGVLDASVASLEVTNILGDISMRGYL 377
Query: 320 FVYDLNIDLLSFVKENC 336
VYD + + +V+ NC
Sbjct: 378 VVYDNMREQIGWVRRNC 394
>gi|357116170|ref|XP_003559856.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Brachypodium
distachyon]
Length = 460
Score = 83.6 bits (205), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 84/374 (22%), Positives = 151/374 (40%), Gaps = 49/374 (13%)
Query: 1 MVRLFIGTPSKGVLLILDT-------------GSALIYAIFDPRKSSSFQKINCD----- 42
+VR +GTP + +LL +DT G F+P S++F+ + C
Sbjct: 95 LVRASLGTPPQRLLLAVDTSNDAAWVPCAGCHGCPTTAPSFNPASSATFRPVPCGAPPCS 154
Query: 43 ---HPDCTYFKCVNEQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSND 99
+P CT C +++ Y D S+ + + ++V G + G FGC
Sbjct: 155 QAPNPSCTSLAKSKNSCGFSLSYGDSSLDATLS-QDNLAVTANG---GVIKGYTFGCLTK 210
Query: 100 NHGFDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLKFGT 159
++G A+ +LGL R + F++Q I + FSYCL + S L G
Sbjct: 211 SNGSAAPAQG-----LLGLGRGPLGFVAQTKGIYEGTFSYCLPSYYRSAANFSGSLTLGR 265
Query: 160 DMGYRRPSTQATKFINHPN--NFYYLSLKDISIDNERMNFPPDTFDITVSGEGGCIIDSG 217
+ T + P+ + YY+++ + I + + PP + G ++DSG
Sbjct: 266 KGQPAPEKMKTTPLLASPHRPSLYYVAMTGVRIGKKSVPIPPSALAFDAATGAGTVLDSG 325
Query: 218 SVLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQL--------CYFLPETFNRFPS 269
++ Y + ++ V L + + + CY + +P+
Sbjct: 326 TMFARLAQPAYAAVRDE-VRRRVAGSLRRRGGGGASVSVSSLGGFDTCYNVSTV--AWPA 382
Query: 270 MAFYFEDA-NLRIDGENVFIIDYENHFFLLAVA--PHDDLVA---LIGSQQQRDTRFVYD 323
+ F +R+ ENV I LA+A P D + A +IGS QQ++ R ++D
Sbjct: 383 VTLVFGGGMEVRLPEENVVIRSTYGSTSCLAMAASPADGVNAALNVIGSLQQQNHRVLFD 442
Query: 324 LNIDLLSFVKENCS 337
+ + F +E C+
Sbjct: 443 VPNARVGFARERCT 456
>gi|18390579|ref|NP_563751.1| aspartyl protease-like protein [Arabidopsis thaliana]
gi|332189782|gb|AEE27903.1| aspartyl protease-like protein [Arabidopsis thaliana]
Length = 485
Score = 83.6 bits (205), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 92/377 (24%), Positives = 158/377 (41%), Gaps = 60/377 (15%)
Query: 2 VRLFIGTPSKGVLLILDTGSALIY-------------------AIFDPRKSSSFQKINCD 42
++ IGTP+K + +DTGS +++ +++ +S S + ++CD
Sbjct: 82 AKIGIGTPAKSYYVQVDTGSDIMWVNCIQCKQCPRRSTLGIELTLYNIDESDSGKLVSCD 141
Query: 43 HPDCTYFK------C-VNEQCVYTMKYADQSVTKGFAAHETI---SVIGKGEGKAIFHGA 92
C C N C Y Y D S T G+ + + SV G + +
Sbjct: 142 DDFCYQISGGPLSGCKANMSCPYLEIYGDGSSTAGYFVKDVVQYDSVAGDLKTQTANGSV 201
Query: 93 LFGCSNDNHGFDEDARDGALAGVLGLSRVTISFISQLGSI--IKKRFSYCLVIPLPNGEY 150
+FGC G + + + AL G+LG + S ISQL S +KK F++CL +G
Sbjct: 202 IFGCGARQSGDLDSSNEEALDGILGFGKANSSMISQLASSGRVKKIFAHCL-----DGRN 256
Query: 151 TSSYLKFGTDMGYRRPSTQATKFI-NHPNNFYYLSLKDISIDNERMNFPPDTFDITVSGE 209
G + +P T + N P+ Y +++ + + E + P D F
Sbjct: 257 GGGIFAIGRVV---QPKVNMTPLVPNQPH--YNVNMTAVQVGQEFLTIPADLFQ--PGDR 309
Query: 210 GGCIIDSGSVLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCYFLPETFNRFPS 269
G IIDSG+ L Y +Y L +K S ++ + + Q + E FP+
Sbjct: 310 KGAIIDSGTTLAYLPEIIYEPLVKKITSQEPALKVHIVDKDYKCFQYSGRVDEG---FPN 366
Query: 270 MAFYFEDAN-LRI-------DGENVFIIDYENHFFLLAVAPHDDL-VALIGSQQQRDTRF 320
+ F+FE++ LR+ E ++ I ++N A+ D + L+G +
Sbjct: 367 VTFHFENSVFLRVYPHDYLFPHEGMWCIGWQNS----AMQSRDRRNMTLLGDLVLSNKLV 422
Query: 321 VYDLNIDLLSFVKENCS 337
+YDL L+ + + NCS
Sbjct: 423 LYDLENQLIGWTEYNCS 439
>gi|218195474|gb|EEC77901.1| hypothetical protein OsI_17222 [Oryza sativa Indica Group]
Length = 467
Score = 83.2 bits (204), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 96/381 (25%), Positives = 154/381 (40%), Gaps = 64/381 (16%)
Query: 1 MVRLFIGTPSKGV---LLILDTGSALI---------------YAIFDPRKSSSFQKINCD 42
+V+L IGTP+ + ++ DTGS L Y DP KS +F++++C
Sbjct: 103 LVQLRIGTPTDRISPRYVLFDTGSDLSWTQCEPCTNCSSFTPYPPHDPSKSRTFRRLSCF 162
Query: 43 HPDCTYFKCV------NEQCVYTMKYADQSVTKGFAAHET--ISVIGKGEGKAIFHGALF 94
P C V + C++ +Y D G + G G G + F
Sbjct: 163 DPMCELCTAVVDGGGGSAGCLFRRRYGDGGAVSGELVSDVFHFGAAGDGGGYQLERDVAF 222
Query: 95 GCSNDNHGFDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYT--- 151
GC+ H D A G G+L L SF++QLG RFSYC +P E T
Sbjct: 223 GCA---HVEDSKAVRGYSTGILALGIGKPSFVTQLGV---DRFSYC----IPASEITDDD 272
Query: 152 --------SSYLKFGTDMGYRRPSTQATKFINHPNNFYYLSLKDISIDNERMNFPPDTFD 203
+S+L+FG+ + R + + F + Y + LK + +
Sbjct: 273 DDDDEERSASFLRFGS---HARMTGKRAPF-KQDGSGYAVRLKSVVYQHGGRLNQQQPVP 328
Query: 204 ITVSGEGGC-----IIDSGSVLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCY 258
+ V+GE ++DSG+ L + V++ L + E L + D P CY
Sbjct: 329 VYVAGEEAAAAMPMLVDSGTTLLWLPGSVFYPLQRRIE---EDISLTRRYDLTHPSLYCY 385
Query: 259 FLPETFNRFPSMAFYF-EDANLRIDGENVFIID--YENHFFLLAVAPHDDLVALIGSQQQ 315
T S+ F A+L + G ++F D + LAVA + A++G Q
Sbjct: 386 LGNMTDVEAVSVTLGFGGGADLELFGTSLFFTDENLTEDWVCLAVAAGNR--AILGVYPQ 443
Query: 316 RDTRFVYDLNIDLLSFVKENC 336
R+ YDL+ ++F ++ C
Sbjct: 444 RNINVGYDLSTMEIAFDRDQC 464
>gi|359476199|ref|XP_003631804.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Vitis vinifera]
Length = 421
Score = 83.2 bits (204), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 74/246 (30%), Positives = 109/246 (44%), Gaps = 43/246 (17%)
Query: 1 MVRLFIGTPSKGVLLILDTGSALIYAIFDPRKSSSFQKINCDHPDCTYFKCVNEQ----- 55
+V + GTP + +LILDTGS++ + + +NC YF
Sbjct: 129 LVDVAFGTPPQNFMLILDTGSSITW-------TQCKACVNCLQDSHRYFNWSASSTYSSG 181
Query: 56 -CV-------YTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSNDNHGFDEDA 107
C+ Y M Y D S + G +T+++ E +F FGC +N G D
Sbjct: 182 SCIPGTVENNYNMTYGDDSTSVGNYGCDTMTL----EPSDVFQKFQFGCGRNNKG---DF 234
Query: 108 RDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLKFGTDMGYRRPS 167
G + G+LGL + +S +SQ S K FSYCL P + S L FG + S
Sbjct: 235 GSG-VDGMLGLGQGQLSTVSQTASKFNKVFSYCL----PEEDSIGSLL-FGEKATSQSSS 288
Query: 168 TQATKFINHP-----NNFYYLSLKDISIDNERMNFPPDTFDITVSGEGGCIIDSGSVLTY 222
+ T +N P + +Y+++L DIS+ NER+N P F G IIDS +V+T
Sbjct: 289 LKFTSLVNGPGTLQESGYYFVNLSDISVGNERLNIPSSVF-----ASPGTIIDSRTVITR 343
Query: 223 FHSDVY 228
Y
Sbjct: 344 LPQRAY 349
>gi|116311058|emb|CAH67989.1| OSIGBa0142I02-OSIGBa0101B20.32 [Oryza sativa Indica Group]
Length = 488
Score = 83.2 bits (204), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 96/381 (25%), Positives = 154/381 (40%), Gaps = 64/381 (16%)
Query: 1 MVRLFIGTPSKGV---LLILDTGSALI---------------YAIFDPRKSSSFQKINCD 42
+V+L IGTP+ + ++ DTGS L Y DP KS +F++++C
Sbjct: 124 LVQLRIGTPTDRISPRYVLFDTGSDLSWTQCEPCTNCSSFTPYPPHDPSKSRTFRRLSCF 183
Query: 43 HPDCTYFKCV------NEQCVYTMKYADQSVTKGFAAHET--ISVIGKGEGKAIFHGALF 94
P C V + C++ +Y D G + G G G + F
Sbjct: 184 DPMCELCTAVVDGGGGSAGCLFRRRYGDGGAVSGELVSDVFHFGAAGDGGGYQLERDVAF 243
Query: 95 GCSNDNHGFDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYT--- 151
GC+ H D A G G+L L SF++QLG RFSYC +P E T
Sbjct: 244 GCA---HVEDSKAVRGYSTGILALGIGKPSFVTQLG---VDRFSYC----IPASEITDDD 293
Query: 152 --------SSYLKFGTDMGYRRPSTQATKFINHPNNFYYLSLKDISIDNERMNFPPDTFD 203
+S+L+FG+ + R + + F + Y + LK + +
Sbjct: 294 DDDDEERSASFLRFGS---HARMTGKRAPF-KQDGSGYAVRLKSVVYQHGGRLNQQQPVP 349
Query: 204 ITVSGEGGC-----IIDSGSVLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCY 258
+ V+GE ++DSG+ L + V++ L + E L + D P CY
Sbjct: 350 VYVAGEEAAAAMPMLVDSGTTLLWLPGSVFYPLQRRIE---EDISLTRRYDLTHPSLYCY 406
Query: 259 FLPETFNRFPSMAFYF-EDANLRIDGENVFIID--YENHFFLLAVAPHDDLVALIGSQQQ 315
T S+ F A+L + G ++F D + LAVA + A++G Q
Sbjct: 407 LGNMTDVEAVSVTLGFGGGADLELFGTSLFFTDENLTEDWVCLAVAAGNR--AILGVYPQ 464
Query: 316 RDTRFVYDLNIDLLSFVKENC 336
R+ YDL+ ++F ++ C
Sbjct: 465 RNINVGYDLSTMEIAFDRDQC 485
>gi|18409620|ref|NP_566966.1| aspartyl protease family protein [Arabidopsis thaliana]
gi|13430562|gb|AAK25903.1|AF360193_1 unknown protein [Arabidopsis thaliana]
gi|4886277|emb|CAB43423.1| putative protein [Arabidopsis thaliana]
gi|14532764|gb|AAK64083.1| unknown protein [Arabidopsis thaliana]
gi|15450892|gb|AAK96717.1| Unknown protein [Arabidopsis thaliana]
gi|30387567|gb|AAP31949.1| At3g52500 [Arabidopsis thaliana]
gi|332645431|gb|AEE78952.1| aspartyl protease family protein [Arabidopsis thaliana]
Length = 469
Score = 83.2 bits (204), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 95/389 (24%), Positives = 147/389 (37%), Gaps = 65/389 (16%)
Query: 2 VRLFIGTPSKGVLLILDTGSALIY----------------------AIFDPRKSSSFQKI 39
V L GTPS+ + + DTGS+L++ F P+ SSS + I
Sbjct: 92 VSLSFGTPSQTIPFVFDTGSSLVWLPCTSRYLCSGCDFSGLDPTLIPRFIPKNSSSSKII 151
Query: 40 NCDHPDCTYFKCVNEQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALF----- 94
C P C + N QC ++ T G + +G G I F
Sbjct: 152 GCQSPKCQFLYGPNVQC-RGCDPNTRNCTVGCPPYILQYGLGSTAGVLITEKLDFPDLTV 210
Query: 95 -----GCSNDNHGFDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGE 149
GCS AG+ G R +S SQ+ KRFS+CLV +
Sbjct: 211 PDFVVGCS--------IISTRQPAGIAGFGRGPVSLPSQMN---LKRFSHCLVSRRFDDT 259
Query: 150 YTSSYLKF----GTDMGYRRPSTQATKFINHPN-------NFYYLSLKDISIDNERMNFP 198
++ L G + G + P T F +PN +YYL+L+ I + + + P
Sbjct: 260 NVTTDLDLDTGSGHNSGSKTPGLTYTPFRKNPNVSNKAFLEYYYLNLRRIYVGRKHVKIP 319
Query: 199 PDTFDITVSGEGGCIIDSGSVLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCY 258
+G+GG I+DSGS T+ V+ + E+F S + + + + C+
Sbjct: 320 YKYLAPGTNGDGGSIVDSGSTFTFMERPVFELVAEEFASQMSNYTREKDLEKETGLGPCF 379
Query: 259 FLPETFN-RFPSMAFYFE-DANLRIDGENVFIIDYENHFFLLAVAPHDDL--------VA 308
+ + P + F F+ A L + N F L V +
Sbjct: 380 NISGKGDVTVPELIFEFKGGAKLELPLSNYFTFVGNTDTVCLTVVSDKTVNPSGGTGPAI 439
Query: 309 LIGSQQQRDTRFVYDLNIDLLSFVKENCS 337
++GS QQ++ YDL D F K+ CS
Sbjct: 440 ILGSFQQQNYLVEYDLENDRFGFAKKKCS 468
>gi|413923782|gb|AFW63714.1| hypothetical protein ZEAMMB73_300584 [Zea mays]
Length = 458
Score = 83.2 bits (204), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 86/365 (23%), Positives = 152/365 (41%), Gaps = 58/365 (15%)
Query: 1 MVRLFIGTPSKGVLLILDTGSALIY--------------AIFDPRKSSSFQKINCDHPDC 46
++ + IG+P+ + +DTGS + + ++FDP SS++ +C C
Sbjct: 123 VITVGIGSPAVTQTMSMDTGSDVSWVQCKPCSQCHSEVDSLFDPSSSSTYSPFSCSSAPC 182
Query: 47 TYFK-------CVNEQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSND 99
C++ QC Y + Y D S T G + +T+++ G + FGCS
Sbjct: 183 AQLSQSQEGNGCMSSQCQYIVNYGDSSSTTGTYSSDTLTL-----GSSAMTDFQFGCSQS 237
Query: 100 NHGFDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLKFGT 159
G D D G++GL S SQ FSYC LP +S +L GT
Sbjct: 238 ESGGFNDQTD----GLMGLGGGAQSLASQTAGTFGTAFSYC----LPPTSGSSGFLTLGT 289
Query: 160 DMG--YRRPSTQATKFINHPNNFYYLSLKDISIDNERMNFPPDTFDITVSGEGGCIIDSG 217
+ P ++T+ +Y + L+ I + ++++N P F G ++DSG
Sbjct: 290 GSSGFVKTPMLRSTQI----PTYYVVLLESIKVGSQQLNLPTSVF------SAGSLMDSG 339
Query: 218 SVLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCY-FLPETFNRFPSMAFYFED 276
+++T Y L F + +++ A S + C+ F ++ P++ F
Sbjct: 340 TIITRLPPTAYSALSSAFKAGMQQYPPATPSGI---LDTCFDFSGQSSISIPTVTLVFSG 396
Query: 277 A---NLRIDGENVFIIDYENHFFLLAVAPH--DDLVALIGSQQQRDTRFVYDLNIDLLSF 331
+L DG +++ + LA P+ D + +IG+ QQR +YD+ + F
Sbjct: 397 GAAVDLAFDG---IMLEISSSIRCLAFTPNGDDSSLGIIGNVQQRTFEVLYDVGGGAVGF 453
Query: 332 VKENC 336
C
Sbjct: 454 KAGAC 458
>gi|226508052|ref|NP_001150337.1| LOC100283967 precursor [Zea mays]
gi|195638522|gb|ACG38729.1| aspartic proteinase nepenthesin-2 precursor [Zea mays]
Length = 507
Score = 83.2 bits (204), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 92/378 (24%), Positives = 153/378 (40%), Gaps = 58/378 (15%)
Query: 2 VRLFIGTPSKGVLLILDTGSALIYAI-------------------FDPRKSSSFQKINCD 42
R+ +G P+K + +DTGS +++ F+P SS+ +I C
Sbjct: 91 TRVKLGNPAKEFFVQIDTGSDILWVTCSPCTGCPTSSGLNIQLESFNPDSSSTASRITCS 150
Query: 43 HPDCT--------YFKCVNEQ---CVYTMKYADQSVTKGFAAHETI---SVIGKGEGKAI 88
CT + N Q C YT Y D S T G+ +T+ +V+G +
Sbjct: 151 DDRCTAGFQTGEAICQTSNSQSSPCGYTFTYGDGSGTSGYYVSDTMFFETVMGNEQTANS 210
Query: 89 FHGALFGCSNDNHGFDEDARDGALAGVLGLSRVTISFISQLGS--IIKKRFSYCLVIPLP 146
+FGCSN G D D A+ G+ G + +S ISQL S + K FS+CL
Sbjct: 211 SASIVFGCSNSQSG-DLTKADRAVDGIFGFGQHQLSVISQLNSLGVSPKVFSHCL----K 265
Query: 147 NGEYTSSYLKFGTDMGYRRPSTQATKFI-NHPNNFYYLSLKDISIDNERMNFPPDTFDIT 205
+ L G + P T + + P+ Y L+L+ I+++ +++ P D+ T
Sbjct: 266 GSDNGGGILVLGEIV---EPGLVYTPLVPSQPH--YNLNLESIAVNGQKL--PIDSSLFT 318
Query: 206 VSGEGGCIIDSGSVLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCYFLPETFN 265
S G I+DSG+ L Y Y + FVS + C+ + +
Sbjct: 319 TSNTQGTIVDSGTTLAYLADGAY----DPFVSAIAAAVSPSVRSLVSKGSQCFITSSSVD 374
Query: 266 -RFPSMAFYFEDA-NLRIDGENVFI----IDYENHFFLLAVAPHDDLVALIGSQQQRDTR 319
FP++ YF + + EN + +D + + + ++G +D
Sbjct: 375 SSFPTVTLYFMGGVAMSVKPENYLLQQASVDNSVLWCIGWQRNQGQEITILGDLVLKDKI 434
Query: 320 FVYDLNIDLLSFVKENCS 337
FVYDL + + +CS
Sbjct: 435 FVYDLANMRMGWADYDCS 452
>gi|413952263|gb|AFW84912.1| aspartic proteinase nepenthesin-2 [Zea mays]
Length = 509
Score = 83.2 bits (204), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 92/378 (24%), Positives = 153/378 (40%), Gaps = 58/378 (15%)
Query: 2 VRLFIGTPSKGVLLILDTGSALIYAI-------------------FDPRKSSSFQKINCD 42
R+ +G P+K + +DTGS +++ F+P SS+ +I C
Sbjct: 93 TRVKLGNPAKEFFVQIDTGSDILWVTCSPCTGCPTSSGLNIQLESFNPDSSSTASRITCS 152
Query: 43 HPDCT--------YFKCVNEQ---CVYTMKYADQSVTKGFAAHETI---SVIGKGEGKAI 88
CT + N Q C YT Y D S T G+ +T+ +V+G +
Sbjct: 153 DDRCTAGFQTGEAICQTSNSQSSPCGYTFTYGDGSGTSGYYVSDTMFFETVMGNEQTANS 212
Query: 89 FHGALFGCSNDNHGFDEDARDGALAGVLGLSRVTISFISQLGS--IIKKRFSYCLVIPLP 146
+FGCSN G D D A+ G+ G + +S ISQL S + K FS+CL
Sbjct: 213 SASIVFGCSNSQSG-DLTKADRAVDGIFGFGQHQLSVISQLNSLGVSPKVFSHCL----K 267
Query: 147 NGEYTSSYLKFGTDMGYRRPSTQATKFI-NHPNNFYYLSLKDISIDNERMNFPPDTFDIT 205
+ L G + P T + + P+ Y L+L+ I+++ +++ P D+ T
Sbjct: 268 GSDNGGGILVLGEIV---EPGLVYTPLVPSQPH--YNLNLESIAVNGQKL--PIDSSLFT 320
Query: 206 VSGEGGCIIDSGSVLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCYFLPETFN 265
S G I+DSG+ L Y Y + FVS + C+ + +
Sbjct: 321 TSNTQGTIVDSGTTLAYLADGAY----DPFVSAIAAAVSPSVRSLVSKGSQCFITSSSVD 376
Query: 266 -RFPSMAFYFEDA-NLRIDGENVFI----IDYENHFFLLAVAPHDDLVALIGSQQQRDTR 319
FP++ YF + + EN + +D + + + ++G +D
Sbjct: 377 SSFPTVTLYFMGGVAMSVKPENYLLQQASVDNSVLWCIGWQRNQGQEITILGDLVLKDKI 436
Query: 320 FVYDLNIDLLSFVKENCS 337
FVYDL + + +CS
Sbjct: 437 FVYDLANMRMGWADYDCS 454
>gi|326490656|dbj|BAJ89995.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 456
Score = 82.8 bits (203), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 84/359 (23%), Positives = 147/359 (40%), Gaps = 53/359 (14%)
Query: 1 MVRLFIGTPSKGVLLILDTGSALIY--------------AIFDPRKSSSFQKINCDHPDC 46
++ + +G+P+ +++DTGS + + ++FDP SS++ +C C
Sbjct: 128 LITVGMGSPAVAQTMLIDTGSDVSWVQCKPCSQCHSQADSLFDPSSSSTYSAFSCTSAAC 187
Query: 47 TYFK---CVNEQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSNDNHG- 102
+ C + QC YT+KY D S G + +T+++ G + FGCS G
Sbjct: 188 AQLRQRGCSSSQCQYTVKYGDGSTGSGTYSSDTLAL-----GSSTVENFQFGCSQSESGN 242
Query: 103 FDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLKFGTDMG 162
+D G + L S +Q K FSYCL P P +S +L G
Sbjct: 243 LLQDQTAGLMG----LGGGAESLATQTAGTFGKAFSYCLP-PTPG---SSGFLTLGASTS 294
Query: 163 ---YRRPSTQATKFINHPNNFYYLSLKDISIDNERMNFPPDTFDITVSGEGGCIIDSGSV 219
+ P ++T+ ++Y + L+ I + ++N P F G I+DSG++
Sbjct: 295 GFVVKTPMLRSTQV----PSYYGVLLQAIRVGGRQLNIPASAF------SAGSIMDSGTI 344
Query: 220 LTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPI-QLCY-FLPETFNRFPSMAFYFEDA 277
+T Y L F + +++ AQ P I C+ F ++ P++A F
Sbjct: 345 ITRLPRTAYSALSSAFKAGMKQYPPAQ----PMGIFDTCFDFSGQSSVSIPTVALVFSGG 400
Query: 278 NLRIDGENVFIIDYENHFFLLAVAPHDDLVALIGSQQQRDTRFVYDLNIDLLSFVKENC 336
+ + I+ A D + +IG+ QQR +YD+ + F C
Sbjct: 401 AVVDLASDGIIL---GSCLAFAANSDDTSLGIIGNVQQRTFEVLYDVGGGAVGFKAGAC 456
>gi|326501422|dbj|BAK02500.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 631
Score = 82.8 bits (203), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 90/364 (24%), Positives = 145/364 (39%), Gaps = 48/364 (13%)
Query: 2 VRLFIGTPSKGVLLILDTGSALIYAI--------------FDPRKSSSFQKINCDHPDCT 47
RL IGTP + LI+D+GS + Y F P SS++ + C+ DCT
Sbjct: 90 TRLHIGTPPQEFALIVDSGSTVTYVPCASCEQCGNHQDPRFQPDLSSTYSPVKCNV-DCT 148
Query: 48 YFKCVNEQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSNDNHG--FDE 105
N QC Y +YA+ S + G + +S + E K A+FGC N G F +
Sbjct: 149 CDSDKN-QCTYERQYAEMSSSSGVLGEDIVSFGTESELKP--QRAVFGCENSETGDLFSQ 205
Query: 106 DARDGALAGVLGLSRVTISFISQL--GSIIKKRFSYCL-VIPLPNGEYTSSYLKFGTDMG 162
A G++GL R +S + QL +I FS C + + G + M
Sbjct: 206 HAD-----GIMGLGRGQLSIMDQLVDKGVIGDSFSMCYGGMDIGGGAMVLGAMPAPPGMI 260
Query: 163 YRRPSTQATKFINHPNNFYYLSLKDISIDNERMNFPPDTFDITVSGEGGCIIDSGSVLTY 222
Y + + P +Y + LK++ + + + P FD G+ G ++DSG+ Y
Sbjct: 261 YTH-----SNAVRSP--YYNIELKEMHVAGKALRVDPRIFD----GKHGTVLDSGTTYAY 309
Query: 223 FHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCYF-----LPETFNRFPSMAFYFEDA 277
+ + S + + D P +C+ + + FP + F +
Sbjct: 310 LPEQAFVAFKDAVSSQVHPLKKIRGPD-PNYKDICFAGAGRNVSQLSEVFPKVDMVFGNG 368
Query: 278 N-LRIDGENVFI--IDYENHFFLLAVAPHDDLVALIGSQQQRDTRFVYDLNIDLLSFVKE 334
L + EN E + L D L+G R+T YD + + + F K
Sbjct: 369 QKLSLSPENYLFRHSKVEGAYCLGVFQNGKDPTTLLGGIVVRNTLVTYDRHNEKIGFWKT 428
Query: 335 NCSD 338
NCS+
Sbjct: 429 NCSE 432
>gi|356535252|ref|XP_003536162.1| PREDICTED: aspartic proteinase-like protein 2-like [Glycine max]
Length = 475
Score = 82.8 bits (203), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 89/374 (23%), Positives = 157/374 (41%), Gaps = 55/374 (14%)
Query: 3 RLFIGTPSKGVLLILDTGSALIYA-------------------IFDPRKSSSFQKINCDH 43
+L +G+P K + +DTGS +++ ++DP+ S + + I+CD
Sbjct: 73 KLGLGSPPKDYYVQVDTGSDILWVNCVKCSRCPRKSDLGIDLTLYDPKGSETSELISCDQ 132
Query: 44 PDCTYF------KCVNE-QCVYTMKYADQSVTKGFAAHETIS---VIGKGEGKAIFHGAL 93
C+ C +E C Y++ Y D S T G+ + ++ V +
Sbjct: 133 EFCSATYDGPIPGCKSEIPCPYSITYGDGSATTGYYVQDYLTYNHVNDNLRTAPQNSSII 192
Query: 94 FGCSNDNHGFDEDARDGALAGVLGLSRVTISFISQLGSI--IKKRFSYCLVIPLPNGEYT 151
FGC G + + AL G++G + S +SQL + +KK FS+CL +
Sbjct: 193 FGCGAVQSGTLSSSSEEALDGIIGFGQSNSSVLSQLAASGKVKKIFSHCL-----DNIRG 247
Query: 152 SSYLKFGTDMGYRRPSTQATKFINHPNNFYYLSLKDISIDNERMNFPPDTFDITVSGEG- 210
G + + +T + H Y + LK I +D + + P D FD SG G
Sbjct: 248 GGIFAIGEVVEPKVSTTPLVPRMAH----YNVVLKSIEVDTDILQLPSDIFD---SGNGK 300
Query: 211 GCIIDSGSVLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCYFLPETFNR-FPS 269
G IIDSG+ L Y + VY +L K ++ R +L + E C+ +R FP
Sbjct: 301 GTIIDSGTTLAYLPAIVYDELIPKVMARQPRLKLYLV----EQQFSCFQYTGNVDRGFPV 356
Query: 270 MAFYFEDANLRIDGENVFIIDYENHFFLL------AVAPHDDLVALIGSQQQRDTRFVYD 323
+ +FED+ + ++ +++ + + A + + L+G + +YD
Sbjct: 357 VKLHFEDSLSLTVYPHDYLFQFKDGIWCIGWQKSVAQTKNGKDMTLLGDLVLSNKLVIYD 416
Query: 324 LNIDLLSFVKENCS 337
L + + NCS
Sbjct: 417 LENMAIGWTDYNCS 430
>gi|218189149|gb|EEC71576.1| hypothetical protein OsI_03949 [Oryza sativa Indica Group]
Length = 504
Score = 82.8 bits (203), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 92/376 (24%), Positives = 152/376 (40%), Gaps = 56/376 (14%)
Query: 2 VRLFIGTPSKGVLLILDTGSALIYAI-------------------FDPRKSSSFQKINCD 42
R+ +G+P K + +DTGS +++ F+P SS+ KI C
Sbjct: 93 TRVKLGSPPKEYFVQIDTGSDILWVACSPCTGCPSSSGLNIQLEFFNPDTSSTSSKIPCS 152
Query: 43 HPDCTYFKCVNEQ---------CVYTMKYADQSVTKGFAAHETI---SVIGKGEGKAIFH 90
CT +E C YT Y D S T G+ +T+ SV+G +
Sbjct: 153 DDRCTAALQTSEAVCQTSDNSPCGYTFTYGDGSGTSGYYVSDTMYFDSVMGNEQTANSSA 212
Query: 91 GALFGCSNDNHGFDEDARDGALAGVLGLSRVTISFISQLGS--IIKKRFSYCLVIPLPNG 148
+FGCSN G D D A+ G+ G + +S +SQL S + K FS+CL
Sbjct: 213 SIVFGCSNSQSG-DLTKTDRAVDGIFGFGQHQLSVVSQLNSLGVSPKVFSHCL----KGS 267
Query: 149 EYTSSYLKFGTDMGYRRPSTQATKFI-NHPNNFYYLSLKDISIDNERMNFPPDTFDITVS 207
+ L G + P T + + P+ Y L+L+ I ++ +++ P D+ T S
Sbjct: 268 DNGGGILVLGEIV---EPGLVYTPLVPSQPH--YNLNLESIVVNGQKL--PIDSSLFTTS 320
Query: 208 GEGGCIIDSGSVLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCYFLPETFN-R 266
G I+DSG+ L Y Y + FV+ + C+ + +
Sbjct: 321 NTQGTIVDSGTTLAYLADGAY----DPFVNAITAAVSPSVRSLVSKGNQCFVTSSSVDSS 376
Query: 267 FPSMAFYFEDA-NLRIDGENVFI----IDYENHFFLLAVAPHDDLVALIGSQQQRDTRFV 321
FP+++ YF + + EN + ID + + + ++G +D FV
Sbjct: 377 FPTVSLYFMGGVAMTVKPENYLLQQASIDNNVLWCIGWQRNQGQQITILGDLVLKDKIFV 436
Query: 322 YDLNIDLLSFVKENCS 337
YDL + + +CS
Sbjct: 437 YDLANMRMGWTDYDCS 452
>gi|326515172|dbj|BAK03499.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 494
Score = 82.8 bits (203), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 80/334 (23%), Positives = 129/334 (38%), Gaps = 62/334 (18%)
Query: 27 IFDPRKSSSFQKINCDHPDCTYFKC--------VNEQCVYTMKYADQSVTKGFAAHETIS 78
++DP KSS+F I C P C ++C Y + Y D T G +T++
Sbjct: 199 LYDPAKSSTFAPIPCGSPACKELGSSYGNGCSPTTDECKYIVNYGDGKATTGTYVTDTLT 258
Query: 79 VIGKGEGKAIFHGALFGCSNDNHGFDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFS 138
+ + FGCS+ G + AG+L L S + Q FS
Sbjct: 259 M----SPTIVVKDFRFGCSHAVRGSFSNQN----AGILALGGGRGSLLEQTADAYGNAFS 310
Query: 139 YCLVIPLPNGEYTSSYLKFGTDMGYRRPSTQATKFI-------NHPNNFYYLSLKDISID 191
YC IP P+ ++ +L G P + KF H FY + L+ I +
Sbjct: 311 YC--IPKPS---SAGFLSLG------GPVEASLKFSYTPLIKNKHAPTFYIVHLEAIIVA 359
Query: 192 NERMNFPPDTFDITVSGEGGCIIDSGSVLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCP 251
+++ PP F G ++DSG+V+T VY L F R +A
Sbjct: 360 GKQLAVPPTAFAT------GAVMDSGAVVTQLPPQVYAALRAAF-----RSAMAAYGPLA 408
Query: 252 EPIQ---LCYFLPETFNRFPSMA------FYFEDANLRIDGENVFIIDYENHFFLLAVAP 302
P++ CY F RFP + + A L ++ ++ + + A P
Sbjct: 409 APVRNLDTCY----DFTRFPDVKVPKVSLVFAGGATLDLEPASIIL----DGCLAFAATP 460
Query: 303 HDDLVALIGSQQQRDTRFVYDLNIDLLSFVKENC 336
++ V IG+ QQ+ +YD+ + F + C
Sbjct: 461 GEESVGFIGNVQQQTYEVLYDVGGGKVGFRRGAC 494
>gi|242085924|ref|XP_002443387.1| hypothetical protein SORBIDRAFT_08g018620 [Sorghum bicolor]
gi|241944080|gb|EES17225.1| hypothetical protein SORBIDRAFT_08g018620 [Sorghum bicolor]
Length = 460
Score = 82.8 bits (203), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 90/382 (23%), Positives = 155/382 (40%), Gaps = 53/382 (13%)
Query: 1 MVRLFIGTPSKGVLLILDTGSALIYA----------------IFDPRKSSSFQKINCDHP 44
+ IG P + I+DTGS LI+ +DP +S + + + C+
Sbjct: 85 IAEYLIGDPPQQAAAIIDTGSNLIWTQCSTCRANGCFGQDLTFYDPSRSRTAKPVACNDT 144
Query: 45 DC---TYFKCV--NEQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSND 99
C + +C + C Y ++ GF E + G G+ FGC
Sbjct: 145 ACLLGSETRCARDGKACAVLTAYGAGAI-GGFLGTEVFT-FGHGQSSENNVSLAFGCITA 202
Query: 100 NHGFDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSS-YLKFG 158
+ + DGA +G++GL R +S SQLG +FSYCL + TS+ ++
Sbjct: 203 SR-LTPGSLDGA-SGIIGLGRGKLSLPSQLG---DNKFSYCLTPYFSDAANTSTLFVGAS 257
Query: 159 TDMGYRRPSTQATKFINHPNN-----FYYLSLKDISIDNERMNFPPDTFD---ITVSGEG 210
+ + F+ +P++ FYYL L I++ +++ P FD + + G
Sbjct: 258 AGLSGGGAPATSVPFLKNPDDDPFDSFYYLPLTGITVGTAKLDVPAAAFDLREVAPAKWG 317
Query: 211 GCIIDSGSVLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCY--FLPETFNRF- 267
G +IDSGS T Y L ++ V + + E + LC P +
Sbjct: 318 GTLIDSGSPFTSLIDVAYQALRDELVRQLGASVVPPPAGA-EGLDLCVGGVAPGDAGKLV 376
Query: 268 PSMAFYF-----EDANLRIDGENVF-IIDYENHFFLL--AVAPHDDL----VALIGSQQQ 315
P + +F ++ + EN + +D ++ + P+ L +IG+ Q
Sbjct: 377 PPLVLHFGSGGGGGGDVVVPPENYWGPVDDSTACMVVFSSGGPNSTLPLNETTIIGNYMQ 436
Query: 316 RDTRFVYDLNIDLLSFVKENCS 337
+D +YDL +LSF +CS
Sbjct: 437 QDMHLLYDLGQGVLSFQPADCS 458
>gi|242065058|ref|XP_002453818.1| hypothetical protein SORBIDRAFT_04g018520 [Sorghum bicolor]
gi|241933649|gb|EES06794.1| hypothetical protein SORBIDRAFT_04g018520 [Sorghum bicolor]
Length = 490
Score = 82.8 bits (203), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 99/385 (25%), Positives = 160/385 (41%), Gaps = 77/385 (20%)
Query: 2 VRLFIGTPSKGVLLILDTGSALIY-------------------AIFDPRKSSSFQKINCD 42
++ IG+PSKG + +DTGS +++ +DP S + + CD
Sbjct: 87 TQIEIGSPSKGYYVQVDTGSDILWVNCIRCDGCPTTSGLGIELTQYDPAGSGT--TVGCD 144
Query: 43 HPDCTYFK---------CVNEQCVYTMKYADQSVTKGFAAHETI---SVIGKGEGKAIFH 90
C + C + + Y D S T GF +++ V G G+
Sbjct: 145 QEFCVANSPNGLPPACPSTSSPCQFRIAYGDGSSTTGFYVSDSVQYNQVSGNGQTTPSNA 204
Query: 91 GALFGCSNDNHGFDEDARDGALAGVLGLSRVTISFISQLGSI--IKKRFSYCLVIPLPNG 148
FGC G D + AL G+LG + S +SQL + ++K F++CL +
Sbjct: 205 SITFGCGA-QLGGDLGSSSQALDGILGFGQADSSMLSQLAAARKVRKIFAHCL-----DT 258
Query: 149 EYTSSYLKFGTDMGYRRPSTQATKFINHPNNFYYLSLKDISIDNERMNFPPDTFDITVSG 208
+ G + + +T + + H Y ++L+ IS+ + P TFD SG
Sbjct: 259 VHGGGIFAIGNVVQPKVKTTPLVQNVTH----YNVNLQGISVGGATLQLPSSTFD---SG 311
Query: 209 EG-GCIIDSGSVLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCY-FLPETFNR 266
+ G IIDSG+ L Y +VY L + F+++Q L + + + C+ F +
Sbjct: 312 DSKGTIIDSGTTLAYLPREVYRTL---LTAVFDKYQDLALHNYQDFV--CFQFSGSIDDG 366
Query: 267 FPSMAFYFEDANLRIDGE---NVFIIDY----ENHFFLL-----AVAPHD--DLVALIGS 312
FP + F FE GE NV+ DY EN + + V D D+V L+G
Sbjct: 367 FPVVTFSFE-------GEITLNVYPHDYLFQNENDLYCMGFLDGGVQTKDGKDMV-LLGD 418
Query: 313 QQQRDTRFVYDLNIDLLSFVKENCS 337
+ VYDL ++ + NCS
Sbjct: 419 LVLSNKLVVYDLEKQVIGWADYNCS 443
>gi|307103543|gb|EFN51802.1| hypothetical protein CHLNCDRAFT_59135 [Chlorella variabilis]
Length = 746
Score = 82.8 bits (203), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 80/324 (24%), Positives = 136/324 (41%), Gaps = 51/324 (15%)
Query: 4 LFIGTPSKGVLLILDTGSALIY----------------AIFDPRKSSSFQKINCDHPDCT 47
L++GTP+K +I+DTGS + Y A FDP SS+ +I+C P C+
Sbjct: 82 LYLGTPAKKFAVIVDTGSTMTYVPCSSCGSGCGPNHQDAAFDPEASSTASRISCTSPKCS 141
Query: 48 ----YFKCVNEQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSNDNHG- 102
C +QC YT YA+QS + G + +++ G I +FGC G
Sbjct: 142 CGSPRCGCSTQQCTYTRSYAEQSSSSGILLEDVLALHDGLPGAPI----IFGCETRETGE 197
Query: 103 -FDEDARDGALAGVLGLSRVTISFISQL--GSIIKKRFSYCLVIPLPNGEYTSSYLKFGT 159
F + A G+ GL S ++QL +I FS C + +G L G
Sbjct: 198 IFRQRAD-----GLFGLGNSDASVVNQLVKAGVIDDVFSLCFGMVEGDGA-----LLLGD 247
Query: 160 DMGYRRPSTQATKFINHPNNFYYLSLKDISIDNERMNFPPDTFDITVSGEG-GCIIDSGS 218
S Q T + + +Y ++K +S+ E P ++ +G G ++DSG+
Sbjct: 248 AEVPGSISLQYTPLLTSTTHPFYYNVKMLSLAVEGQLLP---VSQSLFDQGYGTVLDSGT 304
Query: 219 VLTYFHSDVYWKLHEKFVSYFERFQLAQ-LSDCPEP----IQLCYFLPETFNRFPSMAFY 273
TY S V+ + F E++ L+ L P P +C+ + + +++
Sbjct: 305 TFTYMPSPVF----KAFAGAVEKYALSHGLKRVPGPDPQFDDICFGQAPSHDDLEALSSV 360
Query: 274 FEDANLRIDGENVFIIDYENHFFL 297
F ++ D ++ N+ F+
Sbjct: 361 FPSMEVQFDQGTSLVLGPLNYLFV 384
>gi|32489096|emb|CAE03928.1| OSJNba0093F12.2 [Oryza sativa Japonica Group]
gi|58532027|emb|CAD41565.3| OSJNBa0006A01.20 [Oryza sativa Japonica Group]
Length = 489
Score = 82.4 bits (202), Expect = 3e-13, Method: Compositional matrix adjust.
Identities = 96/383 (25%), Positives = 154/383 (40%), Gaps = 66/383 (17%)
Query: 1 MVRLFIGTPSKGV---LLILDTGSALI---------------YAIFDPRKSSSFQKINCD 42
+V+L IGTP+ + ++ DTGS L Y DP KS +F++++C
Sbjct: 123 LVQLRIGTPTDRISPRYVLFDTGSDLSWTQCEPCTNCSSFTPYPPHDPSKSRTFRRLSCF 182
Query: 43 HPDCTYFKCV------NEQCVYTMKYADQSVTKGFAAHET--ISVIGKGEGKAIFHGALF 94
P C V + C++ +Y D G + G G G + F
Sbjct: 183 DPMCELCTAVVDGGGGSAGCLFRRRYGDGGAVSGELVSDVFHFGAAGDGGGYQLERDVAF 242
Query: 95 GCSNDNHGFDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYT--- 151
GC+ H D A G G+L L SF++QLG RFSYC +P E T
Sbjct: 243 GCA---HVEDSKAVRGYSTGILALGIGKPSFVTQLG---VDRFSYC----IPASEITDDD 292
Query: 152 ----------SSYLKFGTDMGYRRPSTQATKFINHPNNFYYLSLKDISIDNERMNFPPDT 201
+S+L+FG+ + R + + F + Y + LK + +
Sbjct: 293 DDDDDDEERSASFLRFGS---HARMTGKRAPF-KQDGSGYAVRLKSVVYQHGGRLNQQQP 348
Query: 202 FDITVSGEGGC-----IIDSGSVLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQL 256
+ V+GE ++DSG+ L + V++ L + E L + D P
Sbjct: 349 VPVYVAGEEAAAAMPMLVDSGTTLLWLPGSVFYPLQRRIE---EDISLTRRYDLTHPSLY 405
Query: 257 CYFLPETFNRFPSMAFYF-EDANLRIDGENVFIID--YENHFFLLAVAPHDDLVALIGSQ 313
CY T S+ F A+L + G ++F D + LAVA + A++G
Sbjct: 406 CYLGNMTDVEAVSVTLGFGGGADLELFGTSLFFTDENLTEDWVCLAVAAGNR--AILGVY 463
Query: 314 QQRDTRFVYDLNIDLLSFVKENC 336
QR+ YDL+ ++F ++ C
Sbjct: 464 PQRNINVGYDLSTMEIAFDRDQC 486
>gi|224142001|ref|XP_002324349.1| predicted protein [Populus trichocarpa]
gi|222865783|gb|EEF02914.1| predicted protein [Populus trichocarpa]
Length = 490
Score = 82.4 bits (202), Expect = 3e-13, Method: Compositional matrix adjust.
Identities = 91/364 (25%), Positives = 141/364 (38%), Gaps = 50/364 (13%)
Query: 1 MVRLFIGTPSKGVLLILDTGSALIYA---------------IFDPRKSSSFQKINCDHPD 45
+V + +GTP K + LI DTGS + + IFDP +S+S+ I+C
Sbjct: 150 IVTVGLGTPKKDLSLIFDTGSDITWTQCQPCARSCYKQKEQIFDPSQSTSYTNISCSSSI 209
Query: 46 CTYFK--------CVNEQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCS 97
C C + CVY ++Y D S + GF E +++ F+ FGC
Sbjct: 210 CNSLTSATGNTPGCASSACVYGIQYGDSSFSVGFFGTEKLTLTSTDA----FNNIYFGCG 265
Query: 98 NDNHGFDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLKF 157
+N G + R +S +SQ K FSYC LP+ ++ +L F
Sbjct: 266 QNNQGLFGGSAGLLGL-----GRDKLSVVSQTAQKYNKIFSYC----LPSSSSSTGFLTF 316
Query: 158 GTDMGYRRPSTQATKFINHPNNFYYLSLKDISIDNERMNFPPDTFDITVSGEGGCIIDSG 217
G T + P +FY L IS+ +++ F G IIDSG
Sbjct: 317 GGSASKNAKFTPLSTISAGP-SFYGLDFTGISVGGKKLAISASVFST-----AGAIIDSG 370
Query: 218 SVLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCY-FLPETFNRFPSMAFYFED 276
+V+T Y L F + ++ + + + CY F T P + F F
Sbjct: 371 TVITRLPPAAYSALRASFRNLMSKYPMTKALSI---LDTCYDFSSYTTISVPKIGFSFSS 427
Query: 277 A-NLRIDGENVFIIDYENHFFLLAVAPHDDL--VALIGSQQQRDTRFVYDLNIDLLSFVK 333
+ ID + + LA A + D V + G+ QQ+ YD + + F
Sbjct: 428 GIEVDIDATGILYASSLSQ-VCLAFAGNSDATDVFIFGNVQQKTLEVFYDGSAGKVGFAP 486
Query: 334 ENCS 337
CS
Sbjct: 487 GGCS 490
>gi|222629462|gb|EEE61594.1| hypothetical protein OsJ_16002 [Oryza sativa Japonica Group]
Length = 468
Score = 82.4 bits (202), Expect = 3e-13, Method: Compositional matrix adjust.
Identities = 96/383 (25%), Positives = 154/383 (40%), Gaps = 66/383 (17%)
Query: 1 MVRLFIGTPSKGV---LLILDTGSALI---------------YAIFDPRKSSSFQKINCD 42
+V+L IGTP+ + ++ DTGS L Y DP KS +F++++C
Sbjct: 102 LVQLRIGTPTDRISPRYVLFDTGSDLSWTQCEPCTNCSSFTPYPPHDPSKSRTFRRLSCF 161
Query: 43 HPDCTYFKCV------NEQCVYTMKYADQSVTKGFAAHET--ISVIGKGEGKAIFHGALF 94
P C V + C++ +Y D G + G G G + F
Sbjct: 162 DPMCELCTAVVDGGGGSAGCLFRRRYGDGGAVSGELVSDVFHFGAAGDGGGYQLERDVAF 221
Query: 95 GCSNDNHGFDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYT--- 151
GC+ H D A G G+L L SF++QLG RFSYC +P E T
Sbjct: 222 GCA---HVEDSKAVRGYSTGILALGIGKPSFVTQLGV---DRFSYC----IPASEITDDD 271
Query: 152 ----------SSYLKFGTDMGYRRPSTQATKFINHPNNFYYLSLKDISIDNERMNFPPDT 201
+S+L+FG+ + R + + F + Y + LK + +
Sbjct: 272 DDDDDDEERSASFLRFGS---HARMTGKRAPF-KQDGSGYAVRLKSVVYQHGGRLNQQQP 327
Query: 202 FDITVSGEGGC-----IIDSGSVLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQL 256
+ V+GE ++DSG+ L + V++ L + E L + D P
Sbjct: 328 VPVYVAGEEAAAAMPMLVDSGTTLLWLPGSVFYPLQRRIE---EDISLTRRYDLTHPSLY 384
Query: 257 CYFLPETFNRFPSMAFYF-EDANLRIDGENVFIID--YENHFFLLAVAPHDDLVALIGSQ 313
CY T S+ F A+L + G ++F D + LAVA + A++G
Sbjct: 385 CYLGNMTDVEAVSVTLGFGGGADLELFGTSLFFTDENLTEDWVCLAVAAGNR--AILGVY 442
Query: 314 QQRDTRFVYDLNIDLLSFVKENC 336
QR+ YDL+ ++F ++ C
Sbjct: 443 PQRNINVGYDLSTMEIAFDRDQC 465
>gi|115460260|ref|NP_001053730.1| Os04g0595000 [Oryza sativa Japonica Group]
gi|113565301|dbj|BAF15644.1| Os04g0595000, partial [Oryza sativa Japonica Group]
Length = 471
Score = 82.4 bits (202), Expect = 3e-13, Method: Compositional matrix adjust.
Identities = 96/383 (25%), Positives = 154/383 (40%), Gaps = 66/383 (17%)
Query: 1 MVRLFIGTPSKGV---LLILDTGSALI---------------YAIFDPRKSSSFQKINCD 42
+V+L IGTP+ + ++ DTGS L Y DP KS +F++++C
Sbjct: 105 LVQLRIGTPTDRISPRYVLFDTGSDLSWTQCEPCTNCSSFTPYPPHDPSKSRTFRRLSCF 164
Query: 43 HPDCTYFKCV------NEQCVYTMKYADQSVTKGFAAHET--ISVIGKGEGKAIFHGALF 94
P C V + C++ +Y D G + G G G + F
Sbjct: 165 DPMCELCTAVVDGGGGSAGCLFRRRYGDGGAVSGELVSDVFHFGAAGDGGGYQLERDVAF 224
Query: 95 GCSNDNHGFDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYT--- 151
GC+ H D A G G+L L SF++QLG RFSYC +P E T
Sbjct: 225 GCA---HVEDSKAVRGYSTGILALGIGKPSFVTQLGV---DRFSYC----IPASEITDDD 274
Query: 152 ----------SSYLKFGTDMGYRRPSTQATKFINHPNNFYYLSLKDISIDNERMNFPPDT 201
+S+L+FG+ + R + + F + Y + LK + +
Sbjct: 275 DDDDDDEERSASFLRFGS---HARMTGKRAPF-KQDGSGYAVRLKSVVYQHGGRLNQQQP 330
Query: 202 FDITVSGEGGC-----IIDSGSVLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQL 256
+ V+GE ++DSG+ L + V++ L + E L + D P
Sbjct: 331 VPVYVAGEEAAAAMPMLVDSGTTLLWLPGSVFYPLQRRIE---EDISLTRRYDLTHPSLY 387
Query: 257 CYFLPETFNRFPSMAFYF-EDANLRIDGENVFIID--YENHFFLLAVAPHDDLVALIGSQ 313
CY T S+ F A+L + G ++F D + LAVA + A++G
Sbjct: 388 CYLGNMTDVEAVSVTLGFGGGADLELFGTSLFFTDENLTEDWVCLAVAAGNR--AILGVY 445
Query: 314 QQRDTRFVYDLNIDLLSFVKENC 336
QR+ YDL+ ++F ++ C
Sbjct: 446 PQRNINVGYDLSTMEIAFDRDQC 468
>gi|222637379|gb|EEE67511.1| hypothetical protein OsJ_24961 [Oryza sativa Japonica Group]
Length = 641
Score = 82.4 bits (202), Expect = 3e-13, Method: Compositional matrix adjust.
Identities = 94/379 (24%), Positives = 150/379 (39%), Gaps = 68/379 (17%)
Query: 2 VRLFIGTPSKGVLLILDTGSALIYAI------------------------FDPRKSSSFQ 37
RL+IGTPS+ LI+D+GS + Y F P SS++
Sbjct: 93 TRLYIGTPSQEFALIVDSGSTVTYVPCATCEQCGNHQSESPNIIEAHDPRFQPDLSSTYS 152
Query: 38 KINCDHPDCTYFKCVNE--QCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFG 95
+ C+ DCT C NE QC Y +YA+ S + G + +S + E K A+FG
Sbjct: 153 PVKCNV-DCT---CDNERSQCTYERQYAEMSSSSGVLGEDIMSFGKESELKP--QRAVFG 206
Query: 96 CSNDNHG--FDEDARDGALAGVLGLSRVTISFISQL--GSIIKKRFSYCLVIPLPNGEYT 151
C N G F + A G++GL R +S + QL +I FS C Y
Sbjct: 207 CENTETGDLFSQHAD-----GIMGLGRGQLSIMDQLVEKGVISDSFSLC---------YG 252
Query: 152 SSYLKFGTDMGYRRPSTQATKFINHPN----NFYYLSLKDISIDNERMNFPPDTFDITVS 207
+ GT + P+ F +H N +Y + LK+I + + + P F+
Sbjct: 253 GMDVGGGTMVLGGMPAPPDMVF-SHSNPVRSPYYNIELKEIHVAGKALRLDPKIFN---- 307
Query: 208 GEGGCIIDSGSVLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCYF-----LPE 262
+ G ++DSG+ Y + + + + + D P +C+ + +
Sbjct: 308 SKHGTVLDSGTTYAYLPEQAFVAFKDAVTNKVNSLKKIRGPD-PNYKDICFAGAGRNVSQ 366
Query: 263 TFNRFPSMAFYFEDAN-LRIDGENVFI--IDYENHFFLLAVAPHDDLVALIGSQQQRDTR 319
FP + F + L + EN E + L D L+G R+T
Sbjct: 367 LSEVFPDVDMVFGNGQKLSLSPENYLFRHSKVEGAYCLGVFQNGKDPTTLLGGIVVRNTL 426
Query: 320 FVYDLNIDLLSFVKENCSD 338
YD + + + F K NCS+
Sbjct: 427 VTYDRHNEKIGFWKTNCSE 445
>gi|218199944|gb|EEC82371.1| hypothetical protein OsI_26705 [Oryza sativa Indica Group]
Length = 642
Score = 82.4 bits (202), Expect = 3e-13, Method: Compositional matrix adjust.
Identities = 94/379 (24%), Positives = 150/379 (39%), Gaps = 68/379 (17%)
Query: 2 VRLFIGTPSKGVLLILDTGSALIYAI------------------------FDPRKSSSFQ 37
RL+IGTPS+ LI+D+GS + Y F P SS++
Sbjct: 94 TRLYIGTPSQEFALIVDSGSTVTYVPCATCEQCGNHQSESPNIIEAHDPRFQPDLSSTYS 153
Query: 38 KINCDHPDCTYFKCVNE--QCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFG 95
+ C+ DCT C NE QC Y +YA+ S + G + +S + E K A+FG
Sbjct: 154 PVKCNV-DCT---CDNERSQCTYERQYAEMSSSSGVLGEDIMSFGKESELKP--QRAVFG 207
Query: 96 CSNDNHG--FDEDARDGALAGVLGLSRVTISFISQL--GSIIKKRFSYCLVIPLPNGEYT 151
C N G F + A G++GL R +S + QL +I FS C Y
Sbjct: 208 CENTETGDLFSQHAD-----GIMGLGRGQLSIMDQLVEKGVISDSFSLC---------YG 253
Query: 152 SSYLKFGTDMGYRRPSTQATKFINHPN----NFYYLSLKDISIDNERMNFPPDTFDITVS 207
+ GT + P+ F +H N +Y + LK+I + + + P F+
Sbjct: 254 GMDVGGGTMVLGGMPAPPDMVF-SHSNPVRSPYYNIELKEIHVAGKALRLDPKIFN---- 308
Query: 208 GEGGCIIDSGSVLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCYF-----LPE 262
+ G ++DSG+ Y + + + + + D P +C+ + +
Sbjct: 309 SKHGTVLDSGTTYAYLPEQAFVAFKDAVTNKVNSLKKIRGPD-PNYKDICFAGAGRNVSQ 367
Query: 263 TFNRFPSMAFYFEDAN-LRIDGENVFI--IDYENHFFLLAVAPHDDLVALIGSQQQRDTR 319
FP + F + L + EN E + L D L+G R+T
Sbjct: 368 LSEVFPDVDMVFGNGQKLSLSPENYLFRHSKVEGAYCLGVFQNGKDPTTLLGGIVVRNTL 427
Query: 320 FVYDLNIDLLSFVKENCSD 338
YD + + + F K NCS+
Sbjct: 428 VTYDRHNEKIGFWKTNCSE 446
>gi|168030587|ref|XP_001767804.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162680886|gb|EDQ67318.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 399
Score = 82.0 bits (201), Expect = 3e-13, Method: Compositional matrix adjust.
Identities = 97/388 (25%), Positives = 142/388 (36%), Gaps = 84/388 (21%)
Query: 3 RLFIGTPSKGVLLILDTGSALIYAI-------------------------FDPRKSSSFQ 37
R+FIGTP LI+DTGS + Y F P SSS+Q
Sbjct: 43 RVFIGTPPNEFALIVDTGSTVTYVPCSSCTHCGHHQASFSTHRLFCRDPRFKPENSSSYQ 102
Query: 38 KINCDHPDCTYFKC--VNEQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGAL-F 94
KI C DC C + QC Y YA+ S +KG + + G + L F
Sbjct: 103 KIGCRSSDCITGLCDSNSHQCKYERMYAEMSTSKGVLGKDLLDF---GPASRLQSQLLSF 159
Query: 95 GCSNDNHGFDEDARDGALAGVLGLSRVTISFISQL--GSIIKKRFSYCL----------- 141
GC G D G++GL R +S + QL I+ FS C
Sbjct: 160 GCETAESG---DLYLQVADGIMGLGRGPLSIVDQLVGNGAIEDSFSLCYGGMDEGGGSMV 216
Query: 142 --VIPLPNGEYTSSYLKFGTDMGYRRPSTQATKFINHPNNFYYLSLKDISIDNERMNFPP 199
IP P+G + F R +N+Y L L +I + +
Sbjct: 217 LGAIPAPSG------MVFAKSDPRR-------------SNYYNLELTEIQVQGASLKLDS 257
Query: 200 DTFDITVSGEGGCIIDSGSVLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEP--IQLC 257
+ F+ G+ G I+DSG+ Y + + V+ Q D P+P +C
Sbjct: 258 NVFN----GKFGTILDSGTTYAYLPDRAFEAFTDAVVAQLGSLQAV---DGPDPNYPDIC 310
Query: 258 YF-----LPETFNRFPSMAFYF-EDANLRIDGEN-VFIIDYENHFFLLAVAPHDDLVALI 310
Y E FP + F F E+ + + EN +F + L + D L+
Sbjct: 311 YAGAGTDTKELGKHFPLVDFVFAENQKVSLAPENYLFKHTKVPGAYCLGFFKNQDATTLL 370
Query: 311 GSQQQRDTRFVYDLNIDLLSFVKENCSD 338
G R+ YD + F+K NC++
Sbjct: 371 GGIIVRNMLVTYDRYNHQIGFLKTNCTE 398
>gi|53791672|dbj|BAD53242.1| putative aspartic proteinase nepenthesin II [Oryza sativa Japonica
Group]
Length = 504
Score = 82.0 bits (201), Expect = 3e-13, Method: Compositional matrix adjust.
Identities = 91/376 (24%), Positives = 152/376 (40%), Gaps = 56/376 (14%)
Query: 2 VRLFIGTPSKGVLLILDTGSALIYAI-------------------FDPRKSSSFQKINCD 42
R+ +G+P K + +DTGS +++ F+P SS+ KI C
Sbjct: 93 TRVKLGSPPKEYFVQIDTGSDILWVACSPCTGCPSSSGLNIQLEFFNPDTSSTSSKIPCS 152
Query: 43 HPDCTYFKCVNEQ---------CVYTMKYADQSVTKGFAAHETI---SVIGKGEGKAIFH 90
CT +E C YT Y D S T G+ +T+ +V+G +
Sbjct: 153 DDRCTAALQTSEAVCQTSDNSPCGYTFTYGDGSGTSGYYVSDTMYFDTVMGNEQTANSSA 212
Query: 91 GALFGCSNDNHGFDEDARDGALAGVLGLSRVTISFISQLGS--IIKKRFSYCLVIPLPNG 148
+FGCSN G D D A+ G+ G + +S +SQL S + K FS+CL
Sbjct: 213 SIVFGCSNSQSG-DLTKTDRAVDGIFGFGQHQLSVVSQLNSLGVSPKVFSHCL----KGS 267
Query: 149 EYTSSYLKFGTDMGYRRPSTQATKFI-NHPNNFYYLSLKDISIDNERMNFPPDTFDITVS 207
+ L G + P T + + P+ Y L+L+ I ++ +++ P D+ T S
Sbjct: 268 DNGGGILVLGEIV---EPGLVYTPLVPSQPH--YNLNLESIVVNGQKL--PIDSSLFTTS 320
Query: 208 GEGGCIIDSGSVLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCYFLPETFN-R 266
G I+DSG+ L Y Y + FV+ + C+ + +
Sbjct: 321 NTQGTIVDSGTTLAYLADGAY----DPFVNAITAAVSPSVRSLVSKGNQCFVTSSSVDSS 376
Query: 267 FPSMAFYFEDA-NLRIDGENVFI----IDYENHFFLLAVAPHDDLVALIGSQQQRDTRFV 321
FP+++ YF + + EN + ID + + + ++G +D FV
Sbjct: 377 FPTVSLYFMGGVAMTVKPENYLLQQASIDNNVLWCIGWQRNQGQQITILGDLVLKDKIFV 436
Query: 322 YDLNIDLLSFVKENCS 337
YDL + + +CS
Sbjct: 437 YDLANMRMGWTDYDCS 452
>gi|356568907|ref|XP_003552649.1| PREDICTED: aspartic proteinase-like protein 2-like [Glycine max]
Length = 490
Score = 82.0 bits (201), Expect = 4e-13, Method: Compositional matrix adjust.
Identities = 97/392 (24%), Positives = 153/392 (39%), Gaps = 90/392 (22%)
Query: 2 VRLFIGTPSKGVLLILDTGSALIY-------------------AIFDPRKSSSFQKINCD 42
++ IGTP K L +DTGS +++ ++D ++SSS + + CD
Sbjct: 87 AKIGIGTPPKNYYLQVDTGSDIMWVNCIQCKECPTRSNLGMDLTLYDIKESSSGKFVPCD 146
Query: 43 HPDCTYFK-------CVNEQCVYTMKYADQSVTKGFAAHETI---SVIGKGEGKAIFHGA 92
C N C Y Y D S T G+ + + V G + +
Sbjct: 147 QEFCKEINGGLLTGCTANISCPYLEIYGDGSSTAGYFVKDIVLYDQVSGDLKTDSANGSI 206
Query: 93 LFGCSNDNHGFDEDARDGALAGVLGLSRVTISFISQLGSI--IKKRFSYCLVIPLPNGEY 150
+FGC G + + AL G+LG + S ISQL S +KK F++CL NG
Sbjct: 207 VFGCGARQSGDLSSSNEEALGGILGFGKANSSMISQLASSGKVKKMFAHCL-----NGVN 261
Query: 151 TSSYLKFGTDMGYRRPSTQATKFI-NHPNNFYYLSLKDISIDNERMNFPPDTFDITVSGE 209
G + +P T + + P+ Y +++ + + + ++ DT T
Sbjct: 262 GGGIFAIGHVV---QPKVNMTPLLPDQPH--YSVNMTAVQVGHAFLSLSTDTS--TQGDR 314
Query: 210 GGCIIDSGSVLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCYFLPETF-NRFP 268
G IIDSG+ L Y +Y L K +S ++ L D C+ E+ + FP
Sbjct: 315 KGTIIDSGTTLAYLPEGIYEPLVYKIISQHPDLKVRTLHD----EYTCFQYSESVDDGFP 370
Query: 269 SMAFYFEDANLRIDGENVFIIDYENHFFLLAVAPHDDLVA-----LIGSQ----QQRDTR 319
++ FYFE+ L V PHD L IG Q Q RD++
Sbjct: 371 AVTFYFENG------------------LSLKVYPHDYLFPSGDFWCIGWQNSGTQSRDSK 412
Query: 320 FV--------------YDLNIDLLSFVKENCS 337
+ YDL ++ + + NCS
Sbjct: 413 NMTLLGDLVLSNKLVFYDLENQVIGWTEYNCS 444
>gi|326529233|dbj|BAK01010.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 441
Score = 81.6 bits (200), Expect = 4e-13, Method: Compositional matrix adjust.
Identities = 90/374 (24%), Positives = 151/374 (40%), Gaps = 64/374 (17%)
Query: 5 FIGTPSKGVLLILDTGSALIYA-----------------IFDPRKSSSFQKINCD----- 42
IG P + ++DTGS LI+ ++ +SS+F + C
Sbjct: 89 LIGDPPQRAAALIDTGSNLIWTQCGTTCGLKACAKQDLPYYNLSRSSTFAAVPCADSAKL 148
Query: 43 -HPDCTYFKCVNEQCVYTMKYADQSV-----TKGFAAHETISVIGKGEGKAIFHGALFGC 96
+ + ++ C + Y SV T+ F + +G FGC
Sbjct: 149 CAANGVHLCGLDGSCTFAASYGAGSVFGSLGTEAFTFQSGAAKLG------------FGC 196
Query: 97 SNDNHGFDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLK 156
+ + A +GA +G++GL R +S +SQ G+ +FSYCL L N SS+L
Sbjct: 197 VSLTR-ITKGALNGA-SGLIGLGRGRLSLVSQTGA---TKFSYCLTPYLRN-HGASSHLF 250
Query: 157 FGTDMGYRRPSTQATK--FINHP-----NNFYYLSLKDISIDNERMNFPPDTFDI--TVS 207
G T F+ P + FYYL L IS+ ++ P F++ +
Sbjct: 251 VGASASLSGGGGAVTSIPFVKSPEDYPYSTFYYLPLVGISVGETKLPIPSAAFELRRVAA 310
Query: 208 G--EGGCIIDSGSVLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCYFLPETFN 265
G GG IID+GS +T Y L ++ R + +D + LC +
Sbjct: 311 GYWSGGVIIDTGSPVTSLAEAAYSALSDEVARQLNRSLVQPPAD--TGLDLCVARQDVDK 368
Query: 266 RFPSMAFYFED-ANLRIDGENVF-IIDYENHFFLLAVAPHDDLVALIGSQQQRDTRFVYD 323
P + F+F A++ + + + +D L+ ++ + IG+ QQ+D +YD
Sbjct: 369 VVPVLVFHFGGGADMAVSAGSYWGPVDKSTACMLIEEGGYETV---IGNFQQQDVHLLYD 425
Query: 324 LNIDLLSFVKENCS 337
+ LSF +CS
Sbjct: 426 IGKGELSFQTADCS 439
>gi|413951280|gb|AFW83929.1| hypothetical protein ZEAMMB73_279135 [Zea mays]
Length = 451
Score = 81.6 bits (200), Expect = 4e-13, Method: Compositional matrix adjust.
Identities = 79/363 (21%), Positives = 149/363 (41%), Gaps = 46/363 (12%)
Query: 1 MVRLFIGTPSKGVLLILDTGSALIYAI------------FDPRKSSSFQKINCDHPDCTY 48
+ R +GTP++ +L+ +D + + FDP +SS+++ + C P C+
Sbjct: 108 VARARLGTPAQALLVAIDPSNDAAWVPCAACAGCARAPSFDPTRSSTYRPVRCGAPQCSQ 167
Query: 49 FKC------VNEQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSNDNHG 102
+ C + + YA S + + +++ + A + FGC + G
Sbjct: 168 APAPSCPGGLGSSCAFNLSYA-ASTFQALLGQDALALHDDVDAVAAYT---FGCLHVVTG 223
Query: 103 FDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLKFGTDMG 162
+ G++G R +SF SQ + FSYCL P S L+ G
Sbjct: 224 GSVPPQ-----GLVGFGRGPLSFPSQTKDVYGSVFSYCL--PSYKSSNFSGTLRLGPAGQ 276
Query: 163 YRRPSTQATKFINHPN--NFYYLSLKDISIDNERMNFPPDTFDITVSGEGGCIIDSGSVL 220
+R + T +++P+ + YY+++ I + + P + G I+D+G++
Sbjct: 277 PKR--IKTTPLLSNPHRPSLYYVNMVGIRVGGRPVPVPASALAFDPTSGRGTIVDAGTMF 334
Query: 221 TYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCYFLPETFNRFPSMAFYFED-ANL 279
T + VY + + F S L CY P++ F F+ ++
Sbjct: 335 TRLSAPVYAAVRDVFRSRVRAPVAGPLGG----FDTCY---NVTISVPTVTFSFDGRVSV 387
Query: 280 RIDGENVFIIDYENHFFLLAVA--PHDDLVA---LIGSQQQRDTRFVYDLNIDLLSFVKE 334
+ ENV I LA+A P D + A ++ S QQ++ R ++D+ + F +E
Sbjct: 388 TLPEENVVIRSSSGGIACLAMAAGPPDGVDAALNVLASMQQQNHRVLFDVANGRVGFSRE 447
Query: 335 NCS 337
C+
Sbjct: 448 LCT 450
>gi|125529158|gb|EAY77272.1| hypothetical protein OsI_05246 [Oryza sativa Indica Group]
Length = 426
Score = 81.6 bits (200), Expect = 4e-13, Method: Compositional matrix adjust.
Identities = 84/364 (23%), Positives = 143/364 (39%), Gaps = 49/364 (13%)
Query: 1 MVRLFIGTPSKGVLLILD-------------TGSALIYAIFDPRKSSSFQKINCDHPDCT 47
+ R +GTP++ +L+ +D G A F P +SS+++ + C P C
Sbjct: 84 IARAGLGTPAQTLLVAIDPSNDAAWVPCSACAGCAASSPSFSPTQSSTYRTVPCGSPQCA 143
Query: 48 YFKC------VNEQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSNDNH 101
V C + + YA S + +++++ E + FGC
Sbjct: 144 QVPSPSCPAGVGSSCGFNLTYA-ASTFQAVLGQDSLAL----ENNVVVS-YTFGCLRVVS 197
Query: 102 GFDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYT--SSYLKFGT 159
G + G++G R +SF+SQ FSYCL PN + S LK G
Sbjct: 198 GNSVPPQ-----GLIGFGRGPLSFLSQTKDTYGSVFSYCL----PNYRSSNFSGTLKLGP 248
Query: 160 DMGYRRPSTQATKFINHPNNFYYLSLKDISIDNERMNFPPDTFDITVSGEGGCIIDSGSV 219
+R T + H + YY+++ I + ++ + P G IID+G++
Sbjct: 249 IGQPKRIKTTPLLYNPHRPSLYYVNMIGIRVGSKVVQVPQSALAFNPVTGSGTIIDAGTM 308
Query: 220 LTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCYFLPETFNRFPSMAFYFEDA-N 278
T + VY + + F L CY P++ F F A
Sbjct: 309 FTRLAAPVYAAVRDAFRGRVRTPVAPPLGG----FDTCY---NVTVSVPTVTFMFAGAVA 361
Query: 279 LRIDGENVFIIDYENHFFLLAVA--PHDDLVA---LIGSQQQRDTRFVYDLNIDLLSFVK 333
+ + ENV I LA+A P D + A ++ S QQ++ R ++D+ + F +
Sbjct: 362 VTLPEENVMIHSSSGGVACLAMAAGPSDGVNAALNVLASMQQQNQRVLFDVANGRVGFSR 421
Query: 334 ENCS 337
E C+
Sbjct: 422 ELCT 425
>gi|357127505|ref|XP_003565420.1| PREDICTED: probable aspartic protease At2g35615-like [Brachypodium
distachyon]
Length = 466
Score = 81.6 bits (200), Expect = 4e-13, Method: Compositional matrix adjust.
Identities = 85/337 (25%), Positives = 139/337 (41%), Gaps = 38/337 (11%)
Query: 22 ALIYAIFDPRKSSSFQKINCDHPDC----TYFKCVNEQ--CVYTMKYADQSVTKGFAAHE 75
A++Y F+P SSS+ ++ CD P C T C + C + Y D + G A +
Sbjct: 139 AVVY--FNPFDSSSYSRVGCDGPSCLALATNASCNGDSHACDFRYSYRDGASATGLLAAD 196
Query: 76 TISVIGKGEGKAIFHGAL-FGCSNDNHGFDEDARDGALAGVLGLSRVTISFISQLGSIIK 134
T + G ++ FGC+ G R+ G++GL +S SQLG
Sbjct: 197 TFTFGGNINNDTTSTASIDFGCATGTAG-----REFQADGMVGLGAGPLSLASQLG---- 247
Query: 135 KRFSYCLVIPLPNGEYTSSYLKFGTDMGYRRPSTQATKFINHPNN--FYYLSLKDISIDN 192
++FS+CL + + SS L FG P T I +N YY ISID+
Sbjct: 248 RKFSFCLTA--YDIDDASSILNFGARAVVSDPGAATTPLIASSSNAAAYYA----ISIDS 301
Query: 193 ERMNFPPDTFDITVSGEGGCIIDSGSVLTYF-HSDVYWKLHEKFVSYFERFQLAQLSDCP 251
++ P +VS I+D+G+VLT+ + + L E + L +
Sbjct: 302 LKVAGQPVPGTTSVS---KVIVDTGTVLTFLDRAALLAPLTESLARVMDGAGLPRAPPPD 358
Query: 252 EPIQLCY---FLPETFNRFPSMAFYF---EDANLRIDGENVFIIDYENHFFLLAVAPHDD 305
E ++LCY + + P + +R+ GE F++ E L V +
Sbjct: 359 ETLELCYDVSRVKDVDGVIPDVTLVLGGGGGGEVRLTGEGTFVLVKEGVLCLAVVTTSPE 418
Query: 306 L--VALIGSQQQRDTRFVYDLNIDLLSFVKENCSDDS 340
L ++++G+ +D DL+ +F NC S
Sbjct: 419 LQPLSVLGNVALQDLHVGIDLDARTATFATANCDSSS 455
>gi|222619345|gb|EEE55477.1| hypothetical protein OsJ_03658 [Oryza sativa Japonica Group]
Length = 530
Score = 81.6 bits (200), Expect = 5e-13, Method: Compositional matrix adjust.
Identities = 91/376 (24%), Positives = 152/376 (40%), Gaps = 56/376 (14%)
Query: 2 VRLFIGTPSKGVLLILDTGSALIYAI-------------------FDPRKSSSFQKINCD 42
R+ +G+P K + +DTGS +++ F+P SS+ KI C
Sbjct: 119 TRVKLGSPPKEYFVQIDTGSDILWVACSPCTGCPSSSGLNIQLEFFNPDTSSTSSKIPCS 178
Query: 43 HPDCTYFKCVNEQ---------CVYTMKYADQSVTKGFAAHETI---SVIGKGEGKAIFH 90
CT +E C YT Y D S T G+ +T+ +V+G +
Sbjct: 179 DDRCTAALQTSEAVCQTSDNSPCGYTFTYGDGSGTSGYYVSDTMYFDTVMGNEQTANSSA 238
Query: 91 GALFGCSNDNHGFDEDARDGALAGVLGLSRVTISFISQLGS--IIKKRFSYCLVIPLPNG 148
+FGCSN G D D A+ G+ G + +S +SQL S + K FS+CL
Sbjct: 239 SIVFGCSNSQSG-DLTKTDRAVDGIFGFGQHQLSVVSQLNSLGVSPKVFSHCL----KGS 293
Query: 149 EYTSSYLKFGTDMGYRRPSTQATKFI-NHPNNFYYLSLKDISIDNERMNFPPDTFDITVS 207
+ L G + P T + + P+ Y L+L+ I ++ +++ P D+ T S
Sbjct: 294 DNGGGILVLGEIV---EPGLVYTPLVPSQPH--YNLNLESIVVNGQKL--PIDSSLFTTS 346
Query: 208 GEGGCIIDSGSVLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCYFLPETFN-R 266
G I+DSG+ L Y Y + FV+ + C+ + +
Sbjct: 347 NTQGTIVDSGTTLAYLADGAY----DPFVNAITAAVSPSVRSLVSKGNQCFVTSSSVDSS 402
Query: 267 FPSMAFYFEDA-NLRIDGENVFI----IDYENHFFLLAVAPHDDLVALIGSQQQRDTRFV 321
FP+++ YF + + EN + ID + + + ++G +D FV
Sbjct: 403 FPTVSLYFMGGVAMTVKPENYLLQQASIDNNVLWCIGWQRNQGQQITILGDLVLKDKIFV 462
Query: 322 YDLNIDLLSFVKENCS 337
YDL + + +CS
Sbjct: 463 YDLANMRMGWTDYDCS 478
>gi|16209647|gb|AAL14384.1| AT3g52500/F22O6_120 [Arabidopsis thaliana]
Length = 469
Score = 81.6 bits (200), Expect = 5e-13, Method: Compositional matrix adjust.
Identities = 95/389 (24%), Positives = 146/389 (37%), Gaps = 65/389 (16%)
Query: 2 VRLFIGTPSKGVLLILDTGSALI----------------------YAIFDPRKSSSFQKI 39
V L GTPS+ + + DTGS+L+ F P+ SSS + I
Sbjct: 92 VSLSFGTPSQTIPFVFDTGSSLVCLPCTSRYLCSGCDFSGLDPTLIPRFIPKNSSSSKII 151
Query: 40 NCDHPDCTYFKCVNEQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALF----- 94
C P C + N QC ++ T G + +G G I F
Sbjct: 152 GCQSPKCQFLYGPNVQC-RGCDPNTRNCTVGCPPYILQYGLGSTAGVLITEKLDFPDLTV 210
Query: 95 -----GCSNDNHGFDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGE 149
GCS AG+ G R +S SQ+ KRFS+CLV +
Sbjct: 211 PDFVVGCS--------IISTRQPAGIAGFGRGPVSLPSQMN---LKRFSHCLVSRRFDDT 259
Query: 150 YTSSYLKF----GTDMGYRRPSTQATKFINHPN-------NFYYLSLKDISIDNERMNFP 198
++ L G + G + P T F +PN +YYL+L+ I + + + P
Sbjct: 260 NVTTDLDLDTGSGHNSGSKTPGLTYTPFRKNPNVSNKAFLEYYYLNLRRIYVGRKHVKIP 319
Query: 199 PDTFDITVSGEGGCIIDSGSVLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCY 258
+G+GG I+DSGS T+ V+ + E+F S + + + + C+
Sbjct: 320 YKYLAPGTNGDGGSIVDSGSTFTFMERPVFELVAEEFASQMSNYTREKDLEKETGLGPCF 379
Query: 259 FLPETFN-RFPSMAFYFE-DANLRIDGENVFIIDYENHFFLLAVAPHDDL--------VA 308
+ + P + F F+ A L + N F L V +
Sbjct: 380 NISGKGDVTVPELIFEFKGGAKLELPLSNYFTFVGNTDTVCLTVVSDKTVNPSGGTGPAI 439
Query: 309 LIGSQQQRDTRFVYDLNIDLLSFVKENCS 337
++GS QQ++ YDL D F K+ CS
Sbjct: 440 ILGSFQQQNYLVEYDLENDRFGFAKKKCS 468
>gi|56202144|dbj|BAD73477.1| chloroplast nucleoid DNA binding protein-like [Oryza sativa
Japonica Group]
gi|125571574|gb|EAZ13089.1| hypothetical protein OsJ_03009 [Oryza sativa Japonica Group]
Length = 316
Score = 81.6 bits (200), Expect = 5e-13, Method: Compositional matrix adjust.
Identities = 85/316 (26%), Positives = 126/316 (39%), Gaps = 48/316 (15%)
Query: 56 CVYTMKYADQSVTKGFAA--HETISVIGKGEGKAIFHGALFGCSNDNHGFDEDARDGALA 113
C +Y D S +G TI++ G+ KA G + GC+ +G A DG
Sbjct: 12 CSAARRYKDGSAARGTVGVDSATIALSGRAARKAKLRGVVLGCTTSYNGQSFLASDG--- 68
Query: 114 GVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLKFGTDMGY--RRPS---- 167
VL L ISF S+ S RFSYCLV L T SYL FG + + RRPS
Sbjct: 69 -VLSLGYSNISFASRAASRFGGRFSYCLVDHLAPRNAT-SYLTFGPNPAFSSRRPSEGTA 126
Query: 168 ------------------TQATKFINHPNN-FYYLSLKDISIDNERMNFPPDTFDITVSG 208
Q ++H FY +++K +S+ E + P +D V
Sbjct: 127 SCKPAPAPTPAPAGAPGARQTPLVLDHRTRPFYAVTVKGVSVAGELLKIPRAVWD--VEQ 184
Query: 209 EGGCIIDSGSVLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCP-EPIQLCYFL-----PE 262
GG I+DSG+ LT Y V+ + +LA L +P CY +
Sbjct: 185 GGGAILDSGTSLTMLAKPAY----RAVVAALSK-RLAGLPRVTMDPFDYCYNWTSPSGSD 239
Query: 263 TFNRFPSMAFYFEDANLRIDGENVFIIDYENHFFLLAV--APHDDLVALIGSQQQRDTRF 320
P +A +F + ++ID + + P L ++IG+ Q++ +
Sbjct: 240 VAAPLPMLAVHFAGSARLEPPAKSYVIDAAPGVKCIGLQEGPWPGL-SVIGNILQQEHLW 298
Query: 321 VYDLNIDLLSFVKENC 336
YDL L F + C
Sbjct: 299 EYDLKNRRLRFKRSRC 314
>gi|4646203|gb|AAD26876.1|AC007230_10 Belongs to PF|00026 Eukaryotic aspartyl protease family
[Arabidopsis thaliana]
Length = 449
Score = 81.6 bits (200), Expect = 5e-13, Method: Compositional matrix adjust.
Identities = 84/370 (22%), Positives = 151/370 (40%), Gaps = 56/370 (15%)
Query: 2 VRLFIGTPSKGVLLILDTGSALIY-------------------AIFDPRKSSSFQKINCD 42
++ +G+P K + +DTGS +++ ++FD SS+ +K+ CD
Sbjct: 76 TKIKLGSPPKEYHVQVDTGSDILWINCKPCPKCPTKTNLNFRLSLFDMNASSTSKKVGCD 135
Query: 43 HPDCTYFKCVNE-----QCVYTMKYADQSVTKGFAAHETIS---VIGKGEGKAIFHGALF 94
C++ + C Y + YAD+S + G + ++ V G + + +F
Sbjct: 136 DDFCSFISQSDSCQPALGCSYHIVYADESTSDGKFIRDMLTLEQVTGDLKTGPLGQEVVF 195
Query: 95 GCSNDNHGFDEDARDGALAGVLGLSRVTISFISQLGSI--IKKRFSYCLVIPLPNGEYTS 152
GC +D G + D A+ GV+G + S +SQL + K+ FS+CL G +
Sbjct: 196 GCGSDQSGQLGNG-DSAVDGVMGFGQSNTSVLSQLAATGDAKRVFSHCLDNVKGGGIFAV 254
Query: 153 SYLKFGTDMGYRRPSTQATKFINHPNNFYY-LSLKDISIDNERMNFPPDTFDITVSGEGG 211
+ P + T + PN +Y + L + +D ++ P ++ GG
Sbjct: 255 GVVD--------SPKVKTTPMV--PNQMHYNVMLMGMDVDGTSLDLPR-----SIVRNGG 299
Query: 212 CIIDSGSVLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCYFLPETFNRFPSMA 271
I+DSG+ L YF +Y L E ++ Q +L E Q F FP ++
Sbjct: 300 TIVDSGTTLAYFPKVLYDSLIETILAR----QPVKLHIVEETFQCFSFSTNVDEAFPPVS 355
Query: 272 FYFEDANLRIDGENVFIIDYENHFFLLA------VAPHDDLVALIGSQQQRDTRFVYDLN 325
F FED+ + ++ E + V L+G + VYDL+
Sbjct: 356 FEFEDSVKLTVYPHDYLFTLEEELYCFGWQAGGLTTDERSEVILLGDLVLSNKLVVYDLD 415
Query: 326 IDLLSFVKEN 335
+++ + N
Sbjct: 416 NEVIGWADHN 425
>gi|115442309|ref|NP_001045434.1| Os01g0954900 [Oryza sativa Japonica Group]
gi|20161865|dbj|BAB90778.1| nucleoid DNA-binding-like protein [Oryza sativa Japonica Group]
gi|113534965|dbj|BAF07348.1| Os01g0954900 [Oryza sativa Japonica Group]
gi|215766867|dbj|BAG99095.1| unnamed protein product [Oryza sativa Japonica Group]
Length = 445
Score = 81.6 bits (200), Expect = 5e-13, Method: Compositional matrix adjust.
Identities = 84/364 (23%), Positives = 143/364 (39%), Gaps = 49/364 (13%)
Query: 1 MVRLFIGTPSKGVLLILD-------------TGSALIYAIFDPRKSSSFQKINCDHPDCT 47
+ R +GTP++ +L+ +D G A F P +SS+++ + C P C
Sbjct: 103 IARAGLGTPAQTLLVAIDPSNDAAWVPCSACAGCAASSPSFSPTQSSTYRTVPCGSPQCA 162
Query: 48 YFKC------VNEQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSNDNH 101
V C + + YA S + +++++ E + FGC
Sbjct: 163 QVPSPSCPAGVGSSCGFNLTYA-ASTFQAVLGQDSLAL----ENNVVVS-YTFGCLRVVS 216
Query: 102 GFDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYT--SSYLKFGT 159
G + G++G R +SF+SQ FSYCL PN + S LK G
Sbjct: 217 GNSVPPQ-----GLIGFGRGPLSFLSQTKDTYGSVFSYCL----PNYRSSNFSGTLKLGP 267
Query: 160 DMGYRRPSTQATKFINHPNNFYYLSLKDISIDNERMNFPPDTFDITVSGEGGCIIDSGSV 219
+R T + H + YY+++ I + ++ + P G IID+G++
Sbjct: 268 IGQPKRIKTTPLLYNPHRPSLYYVNMIGIRVGSKVVQVPQSALAFNPVTGSGTIIDAGTM 327
Query: 220 LTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCYFLPETFNRFPSMAFYFEDA-N 278
T + VY + + F L CY P++ F F A
Sbjct: 328 FTRLAAPVYAAVRDAFRGRVRTPVAPPLGG----FDTCY---NVTVSVPTVTFMFAGAVA 380
Query: 279 LRIDGENVFIIDYENHFFLLAVA--PHDDLVA---LIGSQQQRDTRFVYDLNIDLLSFVK 333
+ + ENV I LA+A P D + A ++ S QQ++ R ++D+ + F +
Sbjct: 381 VTLPEENVMIHSSSGGVACLAMAAGPSDGVNAALNVLASMQQQNQRVLFDVANGRVGFSR 440
Query: 334 ENCS 337
E C+
Sbjct: 441 ELCT 444
>gi|413953788|gb|AFW86437.1| hypothetical protein ZEAMMB73_618532 [Zea mays]
Length = 469
Score = 81.6 bits (200), Expect = 5e-13, Method: Compositional matrix adjust.
Identities = 94/376 (25%), Positives = 143/376 (38%), Gaps = 86/376 (22%)
Query: 6 IGTPSKGVLLILDTGSALIYA----------------IFDPRKSSSFQKINCDHPDCTYF 49
+GTP+ LILDTGS+L + +FDP SSS+ + CD +C
Sbjct: 135 LGTPAVPQTLILDTGSSLTWVQCKPCNSSQCYPQRLPLFDPNTSSSYSPVPCDSQECRAL 194
Query: 50 K-------CVNE---QCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSND 99
C ++ C Y + Y + G + + ++ +G G FH FGC +
Sbjct: 195 AAGIDGDGCTSDGDWGCAYEIHYGSGATPAGEYSTDALT-LGPGAIVKRFH---FGCGHH 250
Query: 100 NHGFDEDARDGALAGVLGLSRVTISFISQLGSIIKKR----FSYCLVIPLPNGEYTSSYL 155
D D GVLGL R+ S Q + +R FS+C LP ++ +L
Sbjct: 251 QQRGKFDMAD----GVLGLGRLPQSLAWQASA---RRGGGVFSHC----LPPTGVSTGFL 299
Query: 156 KFGTDMGYRRPSTQATKFINHP-------NNFYYLSLKDISIDNERMNFPPDTFDITVSG 208
G + F+ P FY L IS+ + ++ PP F
Sbjct: 300 ALGAPH-------DTSAFVFTPLLTMDDQPWFYQLMPTAISVAGQLLDIPPAVF------ 346
Query: 209 EGGCIIDSGSVLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCYFLPETFN--- 265
G I DSG+VL+ Y L F S + LA P+ L FN
Sbjct: 347 REGVITDSGTVLSALQETAYTALRTAFRSAMAEYPLA------PPVG---HLDTCFNFTG 397
Query: 266 ----RFPSMAFYFE-DANLRIDGENVFIIDYENHFFLLAVAPHDDLVALIGSQQQRDTRF 320
P+++ F A + +D + ++D F+ + D+ LIGS QR
Sbjct: 398 YDNVTVPTVSLTFRGGATVHLDASSGVLMDGCLAFW----SSGDEYTGLIGSVSQRTIEV 453
Query: 321 VYDLNIDLLSFVKENC 336
+YD+ + F C
Sbjct: 454 LYDMPGRKVGFRTGAC 469
>gi|357507805|ref|XP_003624191.1| Aspartic proteinase-like protein [Medicago truncatula]
gi|355499206|gb|AES80409.1| Aspartic proteinase-like protein [Medicago truncatula]
Length = 406
Score = 81.3 bits (199), Expect = 5e-13, Method: Compositional matrix adjust.
Identities = 87/342 (25%), Positives = 142/342 (41%), Gaps = 43/342 (12%)
Query: 19 TGSALIYAIFDPRKSSSFQKINCDHPDCT------YFKCVNEQ-CVYTMKYADQSVTKGF 71
+G + ++DP S + + C CT C + C Y++ Y D S T G
Sbjct: 40 SGLGMDLTLYDPNGSKTSNAVPCGDGFCTDTYSGPISGCKQDMSCPYSITYGDGSTTSGS 99
Query: 72 AAHETIS---VIGKGEGKAIFHGALFGCSNDNHGFDEDARDGALAGVLGLSRVTISFISQ 128
+++++ V G K +FGC G D AL G++G + S +SQ
Sbjct: 100 FVNDSLTFDEVSGNLHTKPDNSSVIFGCGAKQSGSLSSNSDEALDGIIGFGQANSSVLSQ 159
Query: 129 LGSI--IKKRFSYCLVIPLPNGEYTSSYLKFGTDMGYRRPSTQATKFINHPNNFYYLSLK 186
L + +K+ FS+CL + + G M + +T + H Y + LK
Sbjct: 160 LAASGKVKRIFSHCL-----DSHHGGGIFSIGQVMEPKFNTTPLVPRMAH----YNVILK 210
Query: 187 DISIDNERMNFPPDTFDITVSGEG-GCIIDSGSVLTYFHSDVYWKLHEKFVSYFERFQLA 245
D+ +D E + P FD SG G G IIDSG+ L Y +Y +L K + +L
Sbjct: 211 DMDVDGEPILLPLYLFD---SGSGRGTIIDSGTTLAYLPLSIYNQLLPKVLGRQPGLKLM 267
Query: 246 QLSDCPEPIQL-CYFLPETFNR-FPSMAFYFEDANLRID--------GENVFIIDYENHF 295
+ D Q C+ + + FP + F+FE +L + E+++ I ++
Sbjct: 268 IVED-----QFTCFHYSDKLDEGFPVVKFHFEGLSLTVHPHDYLFLYKEDIYCIGWQKSS 322
Query: 296 FLLAVAPHDDLVALIGSQQQRDTRFVYDLNIDLLSFVKENCS 337
DL+ LIG + VYDL ++ + NCS
Sbjct: 323 --TQTKEGRDLI-LIGDLVLSNKLVVYDLENMVIGWTNFNCS 361
>gi|359806832|ref|NP_001241567.1| uncharacterized protein LOC100819698 precursor [Glycine max]
gi|255638149|gb|ACU19388.1| unknown [Glycine max]
Length = 437
Score = 81.3 bits (199), Expect = 6e-13, Method: Compositional matrix adjust.
Identities = 79/361 (21%), Positives = 144/361 (39%), Gaps = 46/361 (12%)
Query: 1 MVRLFIGTPSKGVLLILDTGSALIY------------AIFDPRKSSSFQKINCDHPDCTY 48
+V+ IGTP++ +LL +DT + + F P KS++F+K+ C C
Sbjct: 99 IVKAKIGTPAQTLLLAMDTSNDASWVPCTACVGCSTTTPFAPAKSTTFKKVGCGASQCKQ 158
Query: 49 FK---CVNEQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSNDNHGFDE 105
+ C C + Y SV +T+++ FGC G
Sbjct: 159 VRNPTCDGSACAFNFTYGTSSVAASLV-QDTVTL-----ATDPVPAYAFGCIQKVTGSSV 212
Query: 106 DARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLKFGTDMGYRR 165
+ R +S ++Q + + FSYCL P S L+ G +R
Sbjct: 213 PPQGLLGL-----GRGPLSLLAQTQKLYQSTFSYCL--PSFKTLNFSGSLRLGPVAQPKR 265
Query: 166 PSTQATKFINHP--NNFYYLSLKDISIDNERMNFPPDTFDITVSGEGGCIIDSGSVLTYF 223
+ T + +P ++ YY++L I + ++ PP+ + G + DSG+V T
Sbjct: 266 --IKFTPLLKNPRRSSLYYVNLVAIRVGRRIVDIPPEALAFNANTGAGTVFDSGTVFTRL 323
Query: 224 HSDVYWKLHEKF---VSYFERFQLAQLSDCPEPIQLCYFLPETFNRFPSMAFYFEDANLR 280
Y + +F ++ ++ + L CY P P++ F F N+
Sbjct: 324 VEPAYNAVRNEFRRRIAVHKKLTVTSLGG----FDTCYTAPIV---APTITFMFSGMNVT 376
Query: 281 IDGENVFIIDYENHFFLLAVAPHDD----LVALIGSQQQRDTRFVYDLNIDLLSFVKENC 336
+ +N+ I LA+AP D ++ +I + QQ++ R ++D+ L +E C
Sbjct: 377 LPPDNILIHSTAGSVTCLAMAPAPDNVNSVLNVIANMQQQNHRVLFDVPNSRLGVARELC 436
Query: 337 S 337
+
Sbjct: 437 T 437
>gi|449451076|ref|XP_004143288.1| PREDICTED: aspartic proteinase-like protein 2-like [Cucumis
sativus]
Length = 488
Score = 81.3 bits (199), Expect = 6e-13, Method: Compositional matrix adjust.
Identities = 97/371 (26%), Positives = 156/371 (42%), Gaps = 54/371 (14%)
Query: 3 RLFIGTPSKGVLLILDTGSALIYA-------------------IFDPRKSSSFQKINCDH 43
++ +G P++ + +DTGS +++ +FD KSSS + + C
Sbjct: 87 KVKLGNPAREFNVQIDTGSDILWVTCSPCDGCPDSSGLGIELNLFDTTKSSSARVLPCTD 146
Query: 44 PDCTYFKCVNEQCV-------YTMKYADQSVTKGFAAHETISV-IGKGEGKAIFHGA--L 93
P C +QC+ Y+ Y D+S T GF +++ I GE A +
Sbjct: 147 PICAAVSTTTDQCLTQTDHCSYSFHYRDRSGTSGFYVTDSMHFDILLGESTIANSSATIV 206
Query: 94 FGCSNDNHGFDEDARDGALAGVLGLSRVTISFISQLGS--IIKKRFSYCLVIPLPNGEYT 151
FGCS +G D AL G+ G + S ISQL S I K FS+C L GE
Sbjct: 207 FGCSIYQYG-DLTRATKALDGIFGFGQGEFSVISQLSSRGITPKVFSHC----LKGGENG 261
Query: 152 SSYLKFGTDMGYRRPSTQATKFI-NHPNNFYYLSLKDISIDNERMNFP-PDTFDITVSGE 209
L G + PS + I + P+ Y L L+ I++ + FP P F I+ +GE
Sbjct: 262 GGILVLGEIL---EPSIVYSPLIPSQPH--YTLKLQSIALSGQL--FPNPTMFPISNAGE 314
Query: 210 GGCIIDSGSVLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCYFLPETFNRFPS 269
IIDSG+ L Y +VY + S + +S + ++ + + FP
Sbjct: 315 --TIIDSGTTLAYLVEEVYDWIVSVITSAVSQSATPTISRGSQCFRVSMSVADI---FPV 369
Query: 270 MAFYFED-ANLRIDGENVFIID---YENHFFLLAVAPHDDLVALIGSQQQRDTRFVYDLN 325
+ F FE A++ + E D E + + +D + ++G +D VYDL
Sbjct: 370 LRFNFEGIASMVVTPEEYLQFDSIVREPALWCIGFQKAEDGLNILGDLVLKDKIIVYDLA 429
Query: 326 IDLLSFVKENC 336
+ + +C
Sbjct: 430 RQRIGWANYDC 440
>gi|15232960|ref|NP_186923.1| aspartyl protease-like protein [Arabidopsis thaliana]
gi|6728988|gb|AAF26986.1|AC018363_31 putative aspartyl protease [Arabidopsis thaliana]
gi|21593593|gb|AAM65560.1| putative aspartyl protease [Arabidopsis thaliana]
gi|332640332|gb|AEE73853.1| aspartyl protease-like protein [Arabidopsis thaliana]
Length = 488
Score = 81.3 bits (199), Expect = 6e-13, Method: Compositional matrix adjust.
Identities = 84/377 (22%), Positives = 156/377 (41%), Gaps = 64/377 (16%)
Query: 2 VRLFIGTPSKGVLLILDTGSALIYA-----IFDPRKS-------------SSFQKINCDH 43
++ +GTPS+ + +DTGS +++ I PRKS S+ + ++C
Sbjct: 87 AKIGLGTPSRDFHVQVDTGSDILWVNCAGCIRCPRKSDLVELTPYDVDASSTAKSVSCSD 146
Query: 44 PDCTYFKCVNE-----QCVYTMKYADQSVTKGFAAHETIS---VIGKGEGKAIFHGALFG 95
C+Y +E C Y + Y D S T G+ + + V G + + +FG
Sbjct: 147 NFCSYVNQRSECHSGSTCQYVIMYGDGSSTNGYLVKDVVHLDLVTGNRQTGSTNGTIIFG 206
Query: 96 CSNDNHGFDEDARDGALAGVLGLSRVTISFISQLGS--IIKKRFSYCLVIPLPNGEYTSS 153
C + G +++ A+ G++G + SFISQL S +K+ F++CL +
Sbjct: 207 CGSKQSGQLGESQ-AAVDGIMGFGQSNSSFISQLASQGKVKRSFAHCL-----DNNNGGG 260
Query: 154 YLKFGTDMGYRRPSTQATKFINHPNNFYYLSLKDISIDNERMNFPPDTFDITVSG-EGGC 212
G + P + T ++ + Y ++L I + N + + FD SG + G
Sbjct: 261 IFAIGEVV---SPKVKTTPMLSKSAH-YSVNLNAIEVGNSVLELSSNAFD---SGDDKGV 313
Query: 213 IIDSGSVLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCYFLPETFNRFPSMAF 272
IIDSG+ L Y VY L + ++ L + + C+ + +RFP++ F
Sbjct: 314 IIDSGTTLVYLPDAVYNPLLNEILASHPELTLHTVQESFT----CFHYTDKLDRFPTVTF 369
Query: 273 YFEDA------------NLRIDGENVFIIDYENHFFLLAVAPHDDLVALIGSQQQRDTRF 320
F+ + +R E+ + ++N + ++G +
Sbjct: 370 QFDKSVSLAVYPREYLFQVR---EDTWCFGWQNGGLQTKGGAS---LTILGDMALSNKLV 423
Query: 321 VYDLNIDLLSFVKENCS 337
VYD+ ++ + NCS
Sbjct: 424 VYDIENQVIGWTNHNCS 440
>gi|449493359|ref|XP_004159266.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Cucumis sativus]
Length = 511
Score = 81.3 bits (199), Expect = 6e-13, Method: Compositional matrix adjust.
Identities = 95/398 (23%), Positives = 153/398 (38%), Gaps = 82/398 (20%)
Query: 2 VRLFIGTPSKGVLLILDTGSALIY----------------------AIFDPRKSSSFQKI 39
V L GTP + + I DTGS+L++ + F P+ SSS + +
Sbjct: 134 VSLAFGTPPQNLSFIFDTGSSLVWFPCTAGYRCSRCSFPYVDPATISKFVPKLSSSVKVV 193
Query: 40 NCDHPDCTYF-----------------KCVNEQCVYTMKYADQSVTKGFAAHETISVIGK 82
C +P C + KC + Y ++Y T G ET+ +
Sbjct: 194 GCRNPKCAWIFGPNLKSRCRNCNSKSRKCSDSCPGYGLQYGS-GATAGILLSETLDL--- 249
Query: 83 GEGKAIFHGALFGCS-NDNHGFDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCL 141
E K + L GCS H AG+ G R S SQ+ KRFS+CL
Sbjct: 250 -ENKRV-PDFLVGCSVMSVH---------QPAGIAGFGRGPESLPSQMR---LKRFSHCL 295
Query: 142 VIPLPNGEYTSSYLKFGTDMGYRRPSTQATKFINHP------------NNFYYLSLKDIS 189
V + SS L D G ++ FI P +YYLSL+ I
Sbjct: 296 VSRGFDDSPVSSPLVL--DSGSESDESKTKSFIYAPFRENPSVSNAAFREYYYLSLRRIL 353
Query: 190 IDNERMNFPPDTFDITVSGEGGCIIDSGSVLTYFHSDVYWKLHEKFVSYFERFQLAQLSD 249
I + + FP +G GG IIDSGS T+ ++ + ++ ++ A+ +
Sbjct: 354 IGGKPVKFPYKYLVPDSTGNGGAIIDSGSTFTFLDKPIFEAIADELEKQLVKYPRAKDVE 413
Query: 250 CPEPIQLCYFLP--ETFNRFPSMAFYFE-DANLRIDGENVFIIDYENHFFLLAVAPHDDL 306
++ C+ +P E FP + F+ L + EN + + L + + +
Sbjct: 414 AQSGLRPCFNIPKEEESAEFPDVVLKFKGGGKLSLAAENYLAMVTDEGVVCLTMMTDEAV 473
Query: 307 -------VALIGSQQQRDTRFVYDLNIDLLSFVKENCS 337
++G+ QQ++ YDL + F K+ C+
Sbjct: 474 VGGGGGPAIILGAFQQQNVLVEYDLAKQRIGFRKQKCT 511
>gi|225440729|ref|XP_002275391.1| PREDICTED: aspartic proteinase nepenthesin-1 [Vitis vinifera]
gi|147789748|emb|CAN67404.1| hypothetical protein VITISV_025615 [Vitis vinifera]
Length = 450
Score = 81.3 bits (199), Expect = 7e-13, Method: Compositional matrix adjust.
Identities = 94/386 (24%), Positives = 156/386 (40%), Gaps = 66/386 (17%)
Query: 2 VRLFIGTPSKGVLLILDTGSALIYA----------------------IFDPRKSSSFQKI 39
+ L GTP + + ++DTGS +++A IFDP+ SSS + +
Sbjct: 80 ISLSFGTPPQKLSFLVDTGSDVVWAPCTTDYTCTNCSFSAADPKKVPIFDPKLSSSSKIL 139
Query: 40 NCDHPDC--TYFKCVN----------EQCVYTMKYADQSVTKGFAAHETISVIGKGEGKA 87
+C +P C TYF V+ + C Y Y+ Q T + + + + K K
Sbjct: 140 DCRNPKCVSTYFPYVHLGCPRCNGNSKHCSYACPYSTQYGTGASSGYFLLENL-KFPRKT 198
Query: 88 IFHGALFGCSNDNHGFDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPN 147
I L GC+ AR+ + + G R S Q+G K+F+YCL +
Sbjct: 199 I-RNFLLGCTT------SAARELSSDALAGFGRSMFSLPIQMGV---KKFAYCLN----S 244
Query: 148 GEYTSSYLKFGTDMGYRRPSTQA---TKFINHPNN---FYYLSLKDISIDNERMNFPPDT 201
+Y + + YR T+ T F+ P +Y+L +KDI I N+ + P
Sbjct: 245 HDYDDTRNSGKLILDYRDGKTKGLSYTPFLKSPPASAFYYHLGVKDIKIGNKLLRIPSKY 304
Query: 202 FDITVSGEGGCIIDSG-SVLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCY-F 259
G G IIDSG Y V+ + + +++ + ++ + CY F
Sbjct: 305 LAPGSDGRSGVIIDSGYGGAGYMTGPVFKIVTNELKKQMSKYRRSLEAETQTGLTPCYNF 364
Query: 260 LPETFNRFPSMAFYFE-DANLRIDGENVFIIDYENHF--FLL------AVAPHDDLVALI 310
+ P + + F AN+ + G+N F I + FL+ A+ D ++
Sbjct: 365 TGHKSIKIPPLIYQFRGGANMVVPGKNYFGISPQESLACFLMDTNGTNALEITPDPSIIL 424
Query: 311 GSQQQRDTRFVYDLNIDLLSFVKENC 336
G+ Q D YDL D F ++ C
Sbjct: 425 GNSQHVDYYVEYDLKNDRFGFRRQTC 450
>gi|413925432|gb|AFW65364.1| hypothetical protein ZEAMMB73_378208 [Zea mays]
Length = 418
Score = 80.9 bits (198), Expect = 7e-13, Method: Compositional matrix adjust.
Identities = 96/364 (26%), Positives = 144/364 (39%), Gaps = 68/364 (18%)
Query: 6 IGTPSKGVLLILDTGSALIYA--------------IFDPRKSSSFQKINCDHPDCTYF-- 49
+GTP + + + DTGS LI+A + P KSSSF K+ C C
Sbjct: 87 MGTPPQTLSALADTGSDLIWAKCGACKRCAPRGSASYYPTKSSSFSKLPCSSALCRTLES 146
Query: 50 ---------KCVNEQCVYTMKYADQS----VTKGFAAHETISVIGKGEGKAIFHGALFGC 96
+ C Y Y S T+G+ ET ++ G G FGC
Sbjct: 147 QSLATCGGTRARGAVCSYRYSYGLSSNPHHYTQGYMGSETFTL-----GSDAVQGIGFGC 201
Query: 97 SNDNHGFDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLK 156
+ + G R +S + QL FSYCL + TSS L
Sbjct: 202 TTMSEGGYGSGSGLVGL-----GRGKLSLVRQL---KVGAFSYCLT----SDPSTSSPLL 249
Query: 157 FGTDMGYRRPSTQATKFIN-HPNNFYYLSLKDISIDNERMNFPPDTFDITVSGEGGCIID 215
FG P Q+T +N + FY ++L ISI + P T G G I D
Sbjct: 250 FGAG-ALTGPGVQSTPLVNLKTSTFYTVNLDSISIGAAKT---PGT------GRHGIIFD 299
Query: 216 SGSVLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCP--EPIQLCYFLPETFNRFPSMAFY 273
SG+ LT+ Y +S Q L+ P + ++C F FPSM +
Sbjct: 300 SGTTLTFLAEPAYTLAEAGLLS-----QTTNLTRVPGTDGYEVC-FQTSGGAVFPSMVLH 353
Query: 274 FEDANLRIDGENVF-IIDYENHFFLLAVAPHDDLVALIGSQQQRDTRFVYDLNIDLLSFV 332
F+ ++ + EN F ++ +L+ +P + ++++G+ Q D YDL+ +LSF
Sbjct: 354 FDGGDMALKTENYFGAVNDSVSCWLVQKSPSE--MSIVGNIMQMDYHIRYDLDKSVLSFQ 411
Query: 333 KENC 336
NC
Sbjct: 412 PTNC 415
>gi|224128838|ref|XP_002320434.1| predicted protein [Populus trichocarpa]
gi|222861207|gb|EEE98749.1| predicted protein [Populus trichocarpa]
Length = 485
Score = 80.9 bits (198), Expect = 7e-13, Method: Compositional matrix adjust.
Identities = 91/381 (23%), Positives = 158/381 (41%), Gaps = 70/381 (18%)
Query: 3 RLFIGTPSKGVLLILDTGSALIYA-------------------IFDPRKSSSFQKINCDH 43
++ IGTP+K + +DTGS +++ +++ +S + + + CD
Sbjct: 81 KIGIGTPTKDYYVQVDTGSDIMWVNCIQCRECPKTSSLGIDLTLYNINESDTGKLVPCDQ 140
Query: 44 -----------PDCTYFKCVNEQCVYTMKYADQSVTKGFAAHETIS---VIGKGEGKAIF 89
P CT N C Y Y D S T G+ + + V G + A
Sbjct: 141 EFCYEINGGQLPGCT----ANMSCPYLEIYGDGSSTAGYFVKDVVQYARVSGDLKTTAAN 196
Query: 90 HGALFGCSNDNHGFDEDARDGALAGVLGLSRVTISFISQLGSI--IKKRFSYCLVIPLPN 147
+FGC G + + AL G+LG + S ISQL +KK F++CL +
Sbjct: 197 GSVIFGCGARQSGDLGSSNEEALDGILGFGKSNSSMISQLAVTGKVKKIFAHCL-----D 251
Query: 148 GEYTSSYLKFGTDMGYRRPSTQATKFI-NHPNNFYYLSLKDISIDNERMNFPPDTFDITV 206
G G + +P T I N P+ Y +++ + + +E ++ P D F+
Sbjct: 252 GTNGGGIFVIGHVV---QPKVNMTPLIPNQPH--YNVNMTAVQVGHEFLSLPTDVFE--A 304
Query: 207 SGEGGCIIDSGSVLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCYFLPETF-N 265
G IIDSG+ L Y VY L K +S ++ + D C+ ++ +
Sbjct: 305 GDRKGAIIDSGTTLAYLPEMVYKPLVSKIISQQPDLKVHTVRD----EYTCFQYSDSLDD 360
Query: 266 RFPSMAFYFEDAN-LRI-------DGENVFIIDYENHFFLLAVAPHDDL-VALIGSQQQR 316
FP++ F+FE++ L++ E ++ I ++N V D + L+G
Sbjct: 361 GFPNVTFHFENSVILKVYPHEYLFPFEGLWCIGWQNS----GVQSRDRRNMTLLGDLVLS 416
Query: 317 DTRFVYDLNIDLLSFVKENCS 337
+ +YDL + + + NCS
Sbjct: 417 NKLVLYDLENQAIGWTEYNCS 437
>gi|356505293|ref|XP_003521426.1| PREDICTED: aspartic proteinase-like protein 2-like [Glycine max]
Length = 499
Score = 80.9 bits (198), Expect = 7e-13, Method: Compositional matrix adjust.
Identities = 87/374 (23%), Positives = 155/374 (41%), Gaps = 55/374 (14%)
Query: 3 RLFIGTPSKGVLLILDTGSALIYA-------------------IFDPRKSSSFQKINCDH 43
++ +G+P+K + +DTGS +++ FD SS+ ++C
Sbjct: 86 KVKLGSPAKDFYVQIDTGSDILWINCITCSNCPHSSGLGIELDFFDTAGSSTAALVSCAD 145
Query: 44 PDCTYF------KCVNE--QCVYTMKYADQSVTKGFAAHETI----SVIGKGEGKAIFHG 91
P C+Y C ++ QC YT +Y D S T G+ +T+ ++G+
Sbjct: 146 PICSYAVQTATSGCSSQANQCSYTFQYGDGSGTTGYYVSDTMYFDTVLLGQSMVANSSST 205
Query: 92 ALFGCSNDNHGFDEDARDGALAGVLGLSRVTISFISQLGS--IIKKRFSYCLVIPLPNGE 149
+FGCS G D D A+ G+ G +S ISQL S + K FS+C L GE
Sbjct: 206 IVFGCSTYQSG-DLTKTDKAVDGIFGFGPGALSVISQLSSRGVTPKVFSHC----LKGGE 260
Query: 150 YTSSYLKFGTDMGYRRPSTQATKFI-NHPNNFYYLSLKDISIDNERMNFPPDTFDITVSG 208
L G + PS + + + P+ Y L+L+ I+++ + + P D+ +
Sbjct: 261 NGGGVLVLGEIL---EPSIVYSPLVPSLPH--YNLNLQSIAVNGQLL--PIDSNVFATTN 313
Query: 209 EGGCIIDSGSVLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCYFLPETF-NRF 267
G I+DSG+ L Y + Y + + +F +S + CY + + + F
Sbjct: 314 NQGTIVDSGTTLAYLVQEAYNPFVDAITAAVSQFSKPIISKGNQ----CYLVSNSVGDIF 369
Query: 268 PSMAFYFEDANLRIDGENVFIIDY----ENHFFLLAVAPHDDLVALIGSQQQRDTRFVYD 323
P ++ F + +++ Y + + + ++G +D FVYD
Sbjct: 370 PQVSLNFMGGASMVLNPEHYLMHYGFLDSAAMWCIGFQKVERGFTILGDLVLKDKIFVYD 429
Query: 324 LNIDLLSFVKENCS 337
L + + NCS
Sbjct: 430 LANQRIGWADYNCS 443
>gi|115457778|ref|NP_001052489.1| Os04g0337000 [Oryza sativa Japonica Group]
gi|113564060|dbj|BAF14403.1| Os04g0337000, partial [Oryza sativa Japonica Group]
Length = 321
Score = 80.9 bits (198), Expect = 8e-13, Method: Compositional matrix adjust.
Identities = 79/303 (26%), Positives = 130/303 (42%), Gaps = 58/303 (19%)
Query: 6 IGTPSKGVLLILDTGSALIY-------------------AIFDPRKSSSFQKINCDH--- 43
IGTP+K + +DTGS +++ ++DP+ SS+ K++CD
Sbjct: 39 IGTPTKRYYVQVDTGSDILWVNCISCDRCPRKSGLGLELTLYDPKDSSTGSKVSCDQGFC 98
Query: 44 --------PDCTYFKCVNEQCVYTMKYADQSVTKGFAAHETI---SVIGKGEGKAIFHGA 92
P CT + C Y++ Y D S T G+ + + V G G+ +
Sbjct: 99 AATYGGLLPGCT----TSLPCEYSVTYGDGSSTTGYFVSDLLQFDQVSGDGQTRPANSTV 154
Query: 93 LFGCSNDNHGFDEDARDGALAGVLGLSRVTISFISQLGSI--IKKRFSYCLVIPLPNGEY 150
FGC + G D + + AL G++G + S +SQL + +KK F++CL G +
Sbjct: 155 TFGCGSQQGG-DLGSSNQALDGIIGFGQSNTSMLSQLSAAGKVKKIFAHCLDTINGGGIF 213
Query: 151 TSSYLKFGTDMGYRRPSTQATKFI-NHPNNFYYLSLKDISIDNERMNFPPDTFDITVSGE 209
+ +P + T + N P+ Y ++LK I + + P FD +GE
Sbjct: 214 AIGNVV--------QPKVKTTPLVPNMPH--YNVNLKSIDVGGTALKLPSHMFD---TGE 260
Query: 210 -GGCIIDSGSVLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCYFLPETFNRFP 268
G IIDSG+ LTY VY E ++ F + + + E + Y T P
Sbjct: 261 KKGTIIDSGTTLTYLPEIVY---KEIMLAVFAKHKDITFHNVQEFLCFQYVGRYTLQHTP 317
Query: 269 SMA 271
S++
Sbjct: 318 SVS 320
>gi|218197468|gb|EEC79895.1| hypothetical protein OsI_21423 [Oryza sativa Indica Group]
Length = 471
Score = 80.9 bits (198), Expect = 8e-13, Method: Compositional matrix adjust.
Identities = 77/325 (23%), Positives = 124/325 (38%), Gaps = 42/325 (12%)
Query: 26 AIFDPRKSSSFQKINCDHPDCTYFK-----CVNEQCVYTMKYADQSVTKGFAAHETISVI 80
A+FDPR+S + + C C C N QC Y + Y D T G + +++
Sbjct: 175 ALFDPRRSRTSAAVPCGSAACGELGRYGAGCSNNQCQYFVDYGDGRATSGTYMVDALTL- 233
Query: 81 GKGEGKAIFHGALFGCSNDNHGFDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYC 140
+ FGCS+ G + G ++ LG R S +SQ + FSYC
Sbjct: 234 ---NPSTVVMNFRFGCSHAVRGNFSASTSGTMS--LGGGRQ--SLLSQTAATFGNAFSYC 286
Query: 141 LVIPLPNG-------EYTSSYLKFGTDMGYRRPSTQATKFINHPNNFYYLSLKDISIDNE 193
+ P +G +F R PS T Y + L+ I +
Sbjct: 287 VPDPSSSGFLSLGGPADGGGAGRFARTPLVRNPSIIPT--------LYLVRLRGIEVGGR 338
Query: 194 RMNFPPDTFDITVSGEGGCIIDSGSVLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEP 253
R+N PP F GG ++DS ++T Y L F S + +++
Sbjct: 339 RLNVPPVVF------AGGAVMDSSVIITQLPPTAYRALRLAFRSAMAAYP--RVAGGRAG 390
Query: 254 IQLCY-FLPETFNRFPSMAFYFE-DANLRIDGENVFIIDYENHFFLLAVAPHDDLVALIG 311
+ CY F+ T P+++ F+ A +R+D V + P D + IG
Sbjct: 391 LDTCYDFVRFTSVTVPAVSLVFDGGAVVRLDAMGVMV----EGCLAFVPTPGDFALGFIG 446
Query: 312 SQQQRDTRFVYDLNIDLLSFVKENC 336
+ QQ+ +YD+ + F + C
Sbjct: 447 NVQQQTHEVLYDVGGGSVGFRRGAC 471
>gi|242046812|ref|XP_002461152.1| hypothetical protein SORBIDRAFT_02g041760 [Sorghum bicolor]
gi|241924529|gb|EER97673.1| hypothetical protein SORBIDRAFT_02g041760 [Sorghum bicolor]
Length = 452
Score = 80.9 bits (198), Expect = 8e-13, Method: Compositional matrix adjust.
Identities = 78/362 (21%), Positives = 147/362 (40%), Gaps = 47/362 (12%)
Query: 1 MVRLFIGTPSKGVLLILDTGSALIY--------------AIFDPRKSSSFQKINCDHPDC 46
+VR +GTP + +LL +DT + + FDP S+S++ + C P C
Sbjct: 111 VVRARLGTPPQQLLLAVDTSNDAAWIPCAGCAGCPTSSAPPFDPAASTSYRSVPCGSPLC 170
Query: 47 TYFKCV-----NEQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSNDNH 101
+ C +++ YAD S+ + ++++V G FGC
Sbjct: 171 AQAPNAACPPGGKACGFSLTYADSSLQAALS-QDSLAVAGDA-----VKTYTFGCLQKAT 224
Query: 102 GFDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLKFGTDM 161
G + R +SF+SQ + + FSYCL P S L+ G +
Sbjct: 225 GTAAPPQGLLGL-----GRGPLSFLSQTRDMYQGTFSYCL--PSFKSLNFSGTLRLGRN- 276
Query: 162 GYRRPSTQATKFINHP--NNFYYLSLKDISIDNERMNFPPDTFDITVSGEGGCIIDSGSV 219
+ P + T + +P ++ YY+++ I + + + PP + G ++DSG++
Sbjct: 277 -GQPPRIKTTPLLANPHRSSLYYVNMTGIRVGRKVVPIPPPALAFDPATGAGTVLDSGTM 335
Query: 220 LTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCYFLPETFNRFPSMAFYFEDANL 279
T + Y + ++ R ++ C+ T +P + F+ +
Sbjct: 336 FTRLVAPAYVAVRDEV-----RRRVGAPVSSLGGFDTCFN--TTAVAWPPVTLLFDGMQV 388
Query: 280 RIDGENVFIIDYENHFFLLAVAPHDD----LVALIGSQQQRDTRFVYDLNIDLLSFVKEN 335
+ ENV I LA+A D ++ +I S QQ++ R ++D+ + F +E
Sbjct: 389 TLPEENVVIHSTYGTISCLAMAAAPDGVNTVLNVIASMQQQNHRVLFDVPNGRVGFARER 448
Query: 336 CS 337
C+
Sbjct: 449 CT 450
>gi|449442641|ref|XP_004139089.1| PREDICTED: aspartic proteinase-like protein 2-like [Cucumis
sativus]
Length = 478
Score = 80.9 bits (198), Expect = 8e-13, Method: Compositional matrix adjust.
Identities = 89/381 (23%), Positives = 159/381 (41%), Gaps = 70/381 (18%)
Query: 3 RLFIGTPSKGVLLILDTGSALIYA-------------------IFDPRKSSSFQKINCDH 43
R+ IG+P + +DTGS +++ +++P+ SS+ I CD
Sbjct: 76 RIGIGSPPNDFHVQVDTGSDILWVNCVGCSNCPKKSDIGVDLQLYNPKSSSTSTLITCDQ 135
Query: 44 PDC--TYFKCV-----NEQCVYTMKYADQSVTKGFAAHETISV---IGKGEGKAIFHGAL 93
P C TY + + C Y + Y D S T G+ ++ I + +G + +
Sbjct: 136 PFCSATYDAPIPGCKPDLLCQYKVIYGDGSATAGYFVNDYIQLQRAVGNHKTSETNGSIV 195
Query: 94 FGCSNDNHGFDEDARDGALAGVLGLSRVTISFISQLGSI--IKKRFSYCLVIPLPNGEYT 151
FGC G + + AL G+LG + S ISQL + +KK F++CL G +
Sbjct: 196 FGCGAKQSG-ELGSSSEALDGILGFGQANSSMISQLAATGKVKKIFAHCLDSISGGGIFA 254
Query: 152 SSYLKFGTDMGYRRPSTQATKFINHPNNFYY-LSLKDISIDNERMNFPPDTFDITVSGEG 210
+ P + T + PN +Y + L + + + ++ P F+ S +
Sbjct: 255 IGEV--------VEPKLKTTPVV--PNQAHYNVVLNGVKVGDTALDLPLGLFE--TSYKR 302
Query: 211 GCIIDSGSVLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCYFL--PETFNRFP 268
G IIDSG+ L Y +Y L EK + +L + D Q F+ + FP
Sbjct: 303 GAIIDSGTTLAYLPDSIYLPLMEKILGAQPDLKLRTVDD-----QFTCFVFDKNVDDGFP 357
Query: 269 SMAFYFEDA------------NLRIDGENVFIIDYENHFFLLAVAPHDDLVALIGSQQQR 316
++ F FE++ +R ++V+ + ++N A + + V L+G +
Sbjct: 358 TVTFKFEESLILTIYPHEYLFQIR---DDVWCVGWQNSG---AQSKDGNEVTLLGDLVLQ 411
Query: 317 DTRFVYDLNIDLLSFVKENCS 337
+ Y+L + + + NCS
Sbjct: 412 NKLVYYNLENQTIGWTEYNCS 432
>gi|225464832|ref|XP_002272243.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Vitis vinifera]
Length = 467
Score = 80.9 bits (198), Expect = 8e-13, Method: Compositional matrix adjust.
Identities = 92/401 (22%), Positives = 152/401 (37%), Gaps = 92/401 (22%)
Query: 2 VRLFIGTPSKGVLLILDTGSALIY---------------------AIFDPRKSSSFQKIN 40
+ L GTP + + LI+DTGS L++ IF P+ SSS + +
Sbjct: 92 IPLSFGTPPQTLPLIMDTGSDLVWFPCTHRYVCRNCSFSTSNPSSNIFIPKSSSSSKVLG 151
Query: 41 CDHPDCTYF-------KC---------VNEQCVYTMKYADQSVTKGFAAHETISVIGKGE 84
C +P C + +C + C + + +T G ET+ + GKG
Sbjct: 152 CVNPKCGWIHGSKVQSRCRDCEPTSPNCTQICPPYLVFYGSGITGGIMLSETLDLPGKGV 211
Query: 85 GKAIFHGALFGCSNDNHGFDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIP 144
I ++ S AG+ G R S SQLG K+FSYCL+
Sbjct: 212 PNFIVGCSVLSTSQP-------------AGISGFGRGPPSLPSQLG---LKKFSYCLLSR 255
Query: 145 L--PNGEYTSSYLKFGTDMGYRRPSTQATKFINHP--------NNFYYLSLKDISIDNER 194
E +S L +D G + T F+ +P + +YYL L+ I++ +
Sbjct: 256 RYDDTTESSSLVLDGESDSGEKTAGLSYTPFVQNPKVAGKHAFSVYYYLGLRHITVGGKH 315
Query: 195 MNFPPDTFDITVSGEGGCIIDSGSVLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPI 254
+ P G+GG IIDSG+ TY +++ + +F + + + +
Sbjct: 316 VKIPYKYLIPGADGDGGTIIDSGTTFTYMKGEIFELVAAEFEKQVQSKRATE-------V 368
Query: 255 QLCYFLPETFNRFPSMAFYFEDANLRIDGENVFIIDYENHFFLLAVAPHDDLVAL----- 309
+ L FN F + L+ G + N+ L DD+V L
Sbjct: 369 EGITGLRPCFNISGLNTPSFPELTLKFRGGAEMELPLANYVAFLG---GDDVVCLTIVTD 425
Query: 310 --------------IGSQQQRDTRFVYDLNIDLLSFVKENC 336
+G+ QQ++ YDL + L F +++C
Sbjct: 426 GAAGKEFSGGPAIILGNFQQQNFYVEYDLRNERLGFRQQSC 466
>gi|356570798|ref|XP_003553571.1| PREDICTED: aspartic proteinase-like protein 2-like [Glycine max]
Length = 500
Score = 80.9 bits (198), Expect = 8e-13, Method: Compositional matrix adjust.
Identities = 86/374 (22%), Positives = 155/374 (41%), Gaps = 55/374 (14%)
Query: 3 RLFIGTPSKGVLLILDTGSALIYA-------------------IFDPRKSSSFQKINCDH 43
++ +G+P+K + +DTGS +++ FD SS+ ++C
Sbjct: 86 KVKLGSPAKEFYVQIDTGSDILWINCITCSNCPHSSGLGIELDFFDTAGSSTAALVSCGD 145
Query: 44 PDCTYF------KCVNE--QCVYTMKYADQSVTKGFAAHETI----SVIGKGEGKAIFHG 91
P C+Y +C ++ QC YT +Y D S T G+ +T+ ++G+
Sbjct: 146 PICSYAVQTATSECSSQANQCSYTFQYGDGSGTTGYYVSDTMYFDTVLLGQSVVANSSST 205
Query: 92 ALFGCSNDNHGFDEDARDGALAGVLGLSRVTISFISQLGS--IIKKRFSYCLVIPLPNGE 149
+FGCS G D D A+ G+ G +S ISQL S + K FS+C L GE
Sbjct: 206 IIFGCSTYQSG-DLTKTDKAVDGIFGFGPGALSVISQLSSRGVTPKVFSHC----LKGGE 260
Query: 150 YTSSYLKFGTDMGYRRPSTQATKFI-NHPNNFYYLSLKDISIDNERMNFPPDTFDITVSG 208
L G + PS + + + P+ Y L+L+ I+++ + + P D+ +
Sbjct: 261 NGGGVLVLGEIL---EPSIVYSPLVPSQPH--YNLNLQSIAVNGQLL--PIDSNVFATTN 313
Query: 209 EGGCIIDSGSVLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCYFLPETFNR-F 267
G I+DSG+ L Y + Y + + +F +S + CY + + F
Sbjct: 314 NQGTIVDSGTTLAYLVQEAYNPFVKAITAAVSQFSKPIISKGNQ----CYLVSNSVGDIF 369
Query: 268 PSMAFYFEDANLRIDGENVFIIDY----ENHFFLLAVAPHDDLVALIGSQQQRDTRFVYD 323
P ++ F + +++ Y + + + ++G +D FVYD
Sbjct: 370 PQVSLNFMGGASMVLNPEHYLMHYGFLDGAAMWCIGFQKVEQGFTILGDLVLKDKIFVYD 429
Query: 324 LNIDLLSFVKENCS 337
L + + +CS
Sbjct: 430 LANQRIGWADYDCS 443
>gi|222622847|gb|EEE56979.1| hypothetical protein OsJ_06707 [Oryza sativa Japonica Group]
Length = 494
Score = 80.9 bits (198), Expect = 9e-13, Method: Compositional matrix adjust.
Identities = 85/338 (25%), Positives = 139/338 (41%), Gaps = 68/338 (20%)
Query: 2 VRLFIGTPSKGVLLILDTGSALIY-------------------AIFDPRKSSSFQKINCD 42
R+ IGTP+K + +DTGS +++ ++DPR S S + + CD
Sbjct: 92 TRIGIGTPAKRYYVQVDTGSDILWVNCVSCDGCPRKSNLGIELTMYDPRGSQSGELVTCD 151
Query: 43 H-----------PDCTYFKCVNEQCVYTMKYADQSVTKGFAAHETI---SVIGKGEGKAI 88
P CT C Y++ Y D S T GF + + V G G+
Sbjct: 152 QQFCVANYGGVLPSCTS----TSPCEYSISYGDGSSTAGFFVTDFLQYNQVSGDGQTTPA 207
Query: 89 FHGALFGCSNDNHGFDEDARDGALAGVLGLSRVTISFISQLGSI--IKKRFSYCLVIPLP 146
FGC G D + + AL G+LG + S +SQL + ++K F++CL
Sbjct: 208 NASVSFGCGA-KLGGDLGSSNLALDGILGFGQSNSSMLSQLAAAGKVRKMFAHCLDTVNG 266
Query: 147 NGEYTSSYLKFGTDMGYRRPSTQATKFI-NHPNNFYYLSLKDISIDNERMNFPPDTFDIT 205
G + + +P + T + + P+ Y + LK I + + P + FD
Sbjct: 267 GGIFAIGNV--------VQPKVKTTPLVPDMPH--YNVILKGIDVGGTALGLPTNIFDSG 316
Query: 206 VSGEGGCIIDSGSVLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCY-FLPETF 264
S G IIDSG+ L Y VY L + + L D C+ +
Sbjct: 317 NS--KGTIIDSGTTLAYVPEGVYKALFAMVFDKHQDISVQTLQDFS-----CFQYSGSVD 369
Query: 265 NRFPSMAFYFE-DANLRI--------DGENVFIIDYEN 293
+ FP + F+FE D +L + +G+N++ + ++N
Sbjct: 370 DGFPEVTFHFEGDVSLIVSPHDYLFQNGKNLYCMGFQN 407
>gi|297605079|ref|NP_001056639.2| Os06g0121800 [Oryza sativa Japonica Group]
gi|255676668|dbj|BAF18553.2| Os06g0121800 [Oryza sativa Japonica Group]
Length = 487
Score = 80.5 bits (197), Expect = 9e-13, Method: Compositional matrix adjust.
Identities = 77/325 (23%), Positives = 124/325 (38%), Gaps = 42/325 (12%)
Query: 26 AIFDPRKSSSFQKINCDHPDCTYFK-----CVNEQCVYTMKYADQSVTKGFAAHETISVI 80
A+FDPR+S + + C C C N QC Y + Y D T G + +++
Sbjct: 191 ALFDPRRSRTSAAVPCGSAACGELGRYGAGCSNNQCQYFVDYGDGRATSGTYMVDALTL- 249
Query: 81 GKGEGKAIFHGALFGCSNDNHGFDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYC 140
+ FGCS+ G + G ++ LG R S +SQ + FSYC
Sbjct: 250 ---NPSTVVMNFRFGCSHAVRGNFSASTSGTMS--LGGGRQ--SLLSQTAATFGNAFSYC 302
Query: 141 LVIPLPNG-------EYTSSYLKFGTDMGYRRPSTQATKFINHPNNFYYLSLKDISIDNE 193
+ P +G +F R PS T Y + L+ I +
Sbjct: 303 VPDPSSSGFLSLGGPADGGGAGRFARTPLVRNPSIIPT--------LYLVRLRGIEVGGR 354
Query: 194 RMNFPPDTFDITVSGEGGCIIDSGSVLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEP 253
R+N PP F GG ++DS ++T Y L F S + +++
Sbjct: 355 RLNVPPVVF------AGGAVMDSSVIITQLPPTAYRALRLAFRSAMAAYP--RVAGGRAG 406
Query: 254 IQLCY-FLPETFNRFPSMAFYFE-DANLRIDGENVFIIDYENHFFLLAVAPHDDLVALIG 311
+ CY F+ T P+++ F+ A +R+D V + P D + IG
Sbjct: 407 LDTCYDFVRFTSVTVPAVSLVFDGGAVVRLDAMGVMV----EGCLAFVPTPGDFALGFIG 462
Query: 312 SQQQRDTRFVYDLNIDLLSFVKENC 336
+ QQ+ +YD+ + F + C
Sbjct: 463 NVQQQTHEVLYDVGGGSVGFRRGAC 487
>gi|125553832|gb|EAY99437.1| hypothetical protein OsI_21406 [Oryza sativa Indica Group]
Length = 409
Score = 80.5 bits (197), Expect = 1e-12, Method: Compositional matrix adjust.
Identities = 82/336 (24%), Positives = 136/336 (40%), Gaps = 63/336 (18%)
Query: 27 IFDPRKSSSFQKINCDHPDCTYFK-----CV-NEQCVYTMKYADQSVTKGFAAHETISVI 80
+FDP S+++ + C C C+ N QC + + YA+ + G + + ++ +
Sbjct: 111 LFDPATSTTYAAVPCSSAACARLGPYRRGCLANSQCQFGITYANGATATGTYSSDDLT-L 169
Query: 81 GKGEGKAIFHGALFGCSNDNHG--FDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFS 138
G + + G LFGC++ + G F D +AG L L + SF+ Q S + FS
Sbjct: 170 GPYD---VVRGFLFGCAHADQGSTFSYD-----VAGTLALGGGSQSFVQQTASQYSRVFS 221
Query: 139 YCLVIPLPNGEYTSSYLKFGTDMGYRRPSTQAT---KFINHP--------NNFYYLSLKD 187
YC +P + ++ FG P +A F++ P FY + L+
Sbjct: 222 YC----VPPSTSSFGFIMFGV------PPQRAALVPTFVSTPLLSSSTMSPTFYRVLLRS 271
Query: 188 ISIDNERMNFPPDTFDITVSGEGGCIIDSGSVLTYFHSDVYWKLHEKFVSYFERFQLAQL 247
I + + PP F + +IDS +V++ Y L F S ++ A
Sbjct: 272 IIVAGRPLPVPPTVFSAS------SVIDSATVISRIPPTAYQALRAAFRSAMTMYRPA-- 323
Query: 248 SDCPEPIQL---CY-FLPETFNRFPSMAFYFE-DANLRIDGENVFIIDYENHFFLLAVAP 302
P+ + CY F PS+A F+ A + +D + + LA AP
Sbjct: 324 ----PPVSILDTCYDFSGVRSITLPSIALVFDGGATVNLDAAGILLQG------CLAFAP 373
Query: 303 --HDDLVALIGSQQQRDTRFVYDLNIDLLSFVKENC 336
D + IG+ QQR VYD+ + F C
Sbjct: 374 TASDRMPGFIGNVQQRTLEVVYDVPGKAIRFRSAAC 409
>gi|363808270|ref|NP_001242239.1| uncharacterized protein LOC100801883 [Glycine max]
gi|255641727|gb|ACU21134.1| unknown [Glycine max]
Length = 475
Score = 80.5 bits (197), Expect = 1e-12, Method: Compositional matrix adjust.
Identities = 83/373 (22%), Positives = 159/373 (42%), Gaps = 53/373 (14%)
Query: 3 RLFIGTPSKGVLLILDTGSALIYA-------------------IFDPRKSSSFQKINCDH 43
+L +G+P + + +DTGS +++ ++DP+ S + ++CD
Sbjct: 73 KLGLGSPPRDYYVQVDTGSDILWVNCVECSRCPRKSDLGIDLTLYDPKGSETSDVVSCDQ 132
Query: 44 PDCTYF------KCVNE-QCVYTMKYADQSVTKGFAAHETIS---VIGKGEGKAIFHGAL 93
C+ C +E C Y++ Y D S T G+ + ++ + G +
Sbjct: 133 DFCSATFDGPIPGCKSEIPCPYSITYGDGSATTGYYVQDYLTYNRINGNLRTSPQNSSII 192
Query: 94 FGCSNDNHGFDEDARDGALAGVLGLSRVTISFISQLGSI--IKKRFSYCLVIPLPNGEYT 151
FGC G + + AL G++G + S +SQL + +KK FS+CL +
Sbjct: 193 FGCGAVQSGTLGSSSEEALDGIIGFGQANSSVLSQLAASGKVKKIFSHCL-----DNVRG 247
Query: 152 SSYLKFGTDMGYRRPSTQATKFINHPNNFYYLSLKDISIDNERMNFPPDTFDITVSGEGG 211
G + + +T + H Y + LK I +D + + P D FD +V+G+ G
Sbjct: 248 GGIFAIGEVVEPKVSTTPLVPRMAH----YNVVLKSIEVDTDILQLPSDIFD-SVNGK-G 301
Query: 212 CIIDSGSVLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCYFLPETFNR-FPSM 270
+IDSG+ L Y VY +L +K ++ +L + E C+ +R FP +
Sbjct: 302 TVIDSGTTLAYLPDIVYDELIQKVLARQPGLKLYLV----EQQFRCFLYTGNVDRGFPVV 357
Query: 271 AFYFEDANLRIDGENVFIIDYENHFFLL------AVAPHDDLVALIGSQQQRDTRFVYDL 324
+F+D+ + ++ +++ + + A + + L+G + +YDL
Sbjct: 358 KLHFKDSLSLTVYPHDYLFQFKDGIWCIGWQRSVAQTKNGKDMTLLGDLVLSNKLVIYDL 417
Query: 325 NIDLLSFVKENCS 337
++ + NCS
Sbjct: 418 ENMVIGWTDYNCS 430
>gi|388502484|gb|AFK39308.1| unknown [Medicago truncatula]
Length = 425
Score = 80.5 bits (197), Expect = 1e-12, Method: Compositional matrix adjust.
Identities = 79/345 (22%), Positives = 143/345 (41%), Gaps = 43/345 (12%)
Query: 1 MVRLFIGTPSKGVLLILDTGSALIY-----------AIFDPRKSSSFQKINCDHPDCTYF 49
+VR IGTP + +LL +DT + + +F P KS++F+ ++C P+C
Sbjct: 94 IVRAKIGTPPQTLLLAMDTSNDAAWIPCTACDGCASTLFAPEKSTTFKNVSCAAPECKQV 153
Query: 50 K---CVNEQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSNDNHGFDED 106
C + + Y S+ +TI++ FGC + G
Sbjct: 154 PNPGCGVSSRNFNLTYGSSSIAANLV-QDTITL-----ATDPVPSYTFGCVSKTTGTSAP 207
Query: 107 ARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLKFGTDMGYRRP 166
+ R +S +SQ ++ + FSYCL P S L+ G +R
Sbjct: 208 PQGLLGL-----GRGPLSLLSQTQNLYQSTFSYCL--PSFKSLNFSGSLRLGPVAQPKR- 259
Query: 167 STQATKFINHP--NNFYYLSLKDISIDNERMNFPPDTFDITVSGEGGCIIDSGSVLTYFH 224
+ T + +P ++ YY++L+ I + + ++ PP + G I DSG+V T
Sbjct: 260 -IKYTPLLKNPRRSSLYYVNLEAIRVGRKVVDIPPAALAFNPTTGAGTIFDSGTVFTRLV 318
Query: 225 SDVYWKLHEKFVSYF-ERFQLAQLSDCPEPIQLCYFLPETFNRFPSMAFYFEDANLRIDG 283
+ VY + ++F + + L CY +P P++ F F N+ +
Sbjct: 319 APVYVAVRDEFRRRVGPKLTVTSLGG----FDTCYNVPIV---VPTITFIFTGMNVTLPQ 371
Query: 284 ENVFIIDYENHFFLLAVAPHDD----LVALIGSQQQRDTRFVYDL 324
+N+ I LA+A D ++ +I + QQ++ R +YD+
Sbjct: 372 DNILIHSTAGSTTCLAMAGAPDNVNSVLNVIANMQQQNHRVLYDV 416
>gi|356531884|ref|XP_003534506.1| PREDICTED: aspartic proteinase-like protein 2-like [Glycine max]
Length = 482
Score = 80.5 bits (197), Expect = 1e-12, Method: Compositional matrix adjust.
Identities = 82/342 (23%), Positives = 144/342 (42%), Gaps = 40/342 (11%)
Query: 19 TGSALIYAIFDPRKSSSFQKINCDHPDCTYF------KCVNE-QCVYTMKYADQSVTKGF 71
+G + ++DP S + + + CD CT C + C Y++ Y D S T G
Sbjct: 113 SGLGMELTLYDPNSSKTSKVVPCDDEFCTSTYDGPISGCKKDMSCPYSITYGDGSTTSGS 172
Query: 72 AAHETIS---VIGKGEGKAIFHGALFGCSNDNHGFDEDARDGALAGVLGLSRVTISFISQ 128
+ ++ V+G +FGC + G D +L G++G + S +SQ
Sbjct: 173 YIKDDLTFDRVVGDLRTVPDNTSVIFGCGSKQSGTLSSTTDTSLDGIIGFGQANSSVLSQ 232
Query: 129 LGSI--IKKRFSYCLVIPLPNGEYTSSYLKFGTDMGYRRPSTQATKFINHPNNFYYLSLK 186
L + +K+ FS+CL G + + +P + T + + Y + LK
Sbjct: 233 LAAAGKVKRVFSHCLDTVNGGGIFAIGEV--------VQPKVKTTPLVPRMAH-YNVVLK 283
Query: 187 DISIDNERMNFPPDTFDITVSGEGGCIIDSGSVLTYFHSDVYWKLHEKFVSYFERFQLAQ 246
DI + + + P D FD T SG G IIDSG+ L Y +Y +L EK ++ +L
Sbjct: 284 DIEVAGDPIQLPTDIFDST-SGR-GTIIDSGTTLAYLPVSIYDQLLEKTLAQRSGMELYL 341
Query: 247 LSDCPEPIQL-CYFLPETF---NRFPSMAFYFEDANLRIDGENVFIIDYENHFFLLA--- 299
+ D Q C+ + + FP++ F FE+ + ++ ++ + +
Sbjct: 342 VED-----QFTCFHYSDEKSLDDAFPTVKFTFEEGLTLTAYPHDYLFPFKEDMWCIGWQK 396
Query: 300 ----VAPHDDLVALIGSQQQRDTRFVYDLNIDLLSFVKENCS 337
DL+ L+G + F+YDL+ + + NCS
Sbjct: 397 STAQTKDGKDLI-LLGDLVLTNKLFIYDLDNMSIGWTDYNCS 437
>gi|195638734|gb|ACG38835.1| aspartic proteinase nepenthesin-1 precursor [Zea mays]
Length = 465
Score = 80.1 bits (196), Expect = 1e-12, Method: Compositional matrix adjust.
Identities = 82/361 (22%), Positives = 146/361 (40%), Gaps = 47/361 (13%)
Query: 1 MVRLFIGTPSKGVLLILDTGSALIY---------------AIFDPRKSSSFQKINCDHPD 45
+ R+ +GTP+K ++++DTGS+L + +F+P+ SSS+ ++C
Sbjct: 128 VTRMGLGTPAKSYVMVVDTGSSLTWLQCSPCVVSCHRQSGPVFNPKASSSYASVSCSAQQ 187
Query: 46 CTYFKCV---------NEQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGC 96
C+ + C+Y Y D S + G+ + +T+S G +GC
Sbjct: 188 CSDLTTATLNPASCSTSNVCIYQASYGDSSFSVGYLSKDTVSF-----GSTSVPNFYYGC 242
Query: 97 SNDNHGFDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLK 156
DN G G AG++GL+R +S + QL + FSYCL P + +
Sbjct: 243 GQDNEGLF-----GQSAGLIGLARNKLSLLYQLAPSMGYSFSYCL--PTSSSSSSGYLSI 295
Query: 157 FGTDMGYRRPSTQATKFINHPNNFYYLSLKDISIDNERMNFPPDTFDITVSGEGGCIIDS 216
+ G + A+ ++ ++ Y++ + I + + P + + IIDS
Sbjct: 296 GSYNPGQYSYTPMASSSLD--DSLYFIKMTGIKVAGK-----PLSVSSSAYSSLPTIIDS 348
Query: 217 GSVLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCYFLPETFNRFPSMAFYFED 276
G+V+T + VY L + + A + C+ R P + F
Sbjct: 349 GTVITRLPTGVYSALSKAVAGAMKGTPRASAFSI---LDTCFQGQAARLRVPEVTMAFAG 405
Query: 277 ANLRIDGENVFIIDYENHFFLLAVAPHDDLVALIGSQQQRDTRFVYDLNIDLLSFVKENC 336
++D ++ LA AP A+IG+ QQ+ VYD+ + F C
Sbjct: 406 GAALKLAARNLLVDVDSATTCLAFAPARS-AAIIGNTQQQTFSVVYDVKNSKIGFAAAGC 464
Query: 337 S 337
S
Sbjct: 465 S 465
>gi|449446233|ref|XP_004140876.1| PREDICTED: aspartic proteinase-like protein 2-like [Cucumis
sativus]
Length = 498
Score = 80.1 bits (196), Expect = 1e-12, Method: Compositional matrix adjust.
Identities = 89/373 (23%), Positives = 150/373 (40%), Gaps = 52/373 (13%)
Query: 2 VRLFIGTPSKGVLLILDTGSALIY-------------------AIFDPRKSSSFQKINCD 42
++ IGTPSK + +DTGS +++ +D +S++ + ++CD
Sbjct: 89 AKIGIGTPSKDYYVQVDTGSDIVWVNCIQCRECPRTSSLGMELTPYDLEESTTGKLVSCD 148
Query: 43 HPDCTYFK-------CVNEQCVYTMKYADQSVTKGFAAHETIS---VIGKGEGKAIFHGA 92
C N C Y Y D S T G+ + + V G E A
Sbjct: 149 EQFCLEVNGGPLSGCTTNMSCPYLQIYGDGSSTAGYFVKDYVQYNRVSGDLETTAANGSI 208
Query: 93 LFGCSNDNHGFDEDARDGALAGVLGLSRVTISFISQLGSI--IKKRFSYCLVIPLPNGEY 150
FGC G + + AL G+LG + S ISQL S +KK F++CL +G
Sbjct: 209 KFGCGARQSGDLGSSGEEALDGILGFGKSNSSIISQLASTRKVKKMFAHCL-----DGTN 263
Query: 151 TSSYLKFGTDMGYRRPSTQATKFI-NHPNNFYYLSLKDISIDNERMNFPPDTFDITVSGE 209
G + +P T + N P+ Y +++ + + + +N D F+
Sbjct: 264 GGGIFAMGHVV---QPKVNMTPLVPNQPH--YNVNMTGVQVGHIILNISADVFE--AGDR 316
Query: 210 GGCIIDSGSVLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCYFLPETFNRFPS 269
G IIDSG+ L Y +Y L K +S ++ + + Q + + FP
Sbjct: 317 KGTIIDSGTTLAYLPELIYEPLVAKILSQQHNLEVQTIHGEYKCFQ---YSERVDDGFPP 373
Query: 270 MAFYFEDANLRIDGENVFIIDYENHFFL----LAVAPHDDL-VALIGSQQQRDTRFVYDL 324
+ F+FE++ L + ++ YEN + + + D V L G + +YDL
Sbjct: 374 VIFHFENSLLLKVYPHEYLFQYENLWCIGWQNSGMQSRDRKNVTLFGDLVLSNKLVLYDL 433
Query: 325 NIDLLSFVKENCS 337
+ + + NCS
Sbjct: 434 ENQTIGWTEYNCS 446
>gi|32479948|emb|CAE01594.1| OSJNBa0008A08.2 [Oryza sativa Japonica Group]
gi|38347627|emb|CAE05222.2| OSJNBa0011K22.4 [Oryza sativa Japonica Group]
gi|38567678|emb|CAE75961.1| B1159F04.24 [Oryza sativa Japonica Group]
gi|116309512|emb|CAH66578.1| OSIGBa0137O04.4 [Oryza sativa Indica Group]
Length = 431
Score = 80.1 bits (196), Expect = 1e-12, Method: Compositional matrix adjust.
Identities = 81/367 (22%), Positives = 152/367 (41%), Gaps = 49/367 (13%)
Query: 6 IGTPSKGVLLILDTGSALIY-------------------AIFDPRKSSSFQKINCDHPDC 46
IGTP+ + LDTGS + +DPR S S +++ CD C
Sbjct: 65 IGTPAVKYYVQLDTGSKAFWVNGISCKQCPHESDILRKLTFYDPRSSVSSKEVKCDDTIC 124
Query: 47 TYFKCVNE--QCVYTMKYADQSVTKGFAAHETI---SVIGKGEGKAIFHGALFGCSNDNH 101
T N +C Y YAD +T G + + + G G+ + FGC
Sbjct: 125 TSRPPCNMTLRCPYITGYADGGLTMGILFTDLLHYHQLYGNGQTQPTSTSVTFGCGLQQS 184
Query: 102 GFDEDARDGALAGVLGLSRVTISFISQLGSI--IKKRFSYCLVIPLPNGEYTSSYLKFGT 159
G ++ A+ G++G + +SQL + KK FS+CL G + +
Sbjct: 185 GSLNNSAV-AIDGIIGFGNSNQTALSQLAAAGKTKKIFSHCLDSTNGGGIFAIGEV---- 239
Query: 160 DMGYRRPSTQATKFINHPNNFYYLSLKDISIDNERMNFPPDTFDITVSGEGGCIIDSGSV 219
P + T + + ++ ++LK I++ + P + F T + G IDSGS
Sbjct: 240 ----VEPKVKTTPIVKNNEVYHLVNLKSINVAGTTLQLPANIFGTTKT--KGTFIDSGST 293
Query: 220 LTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCYFLPETFNRFPSMAFYFEDANL 279
L Y +Y +L + + + + Q +FL ++FP + F+FE+ +L
Sbjct: 294 LVYLPEIIYSELILAVFAKHPDITMGAMYN----FQCFHFLGSVDDKFPKITFHFEN-DL 348
Query: 280 RIDGENV-FIIDYENHFFLL-----AVAPHDDLVALIGSQQQRDTRFVYDLNIDLLSFVK 333
+D ++++YE + + + + D++ ++G + VYD+ + + +
Sbjct: 349 TLDVYPYDYLLEYEGNQYCFGFQDAGIHGYKDMI-ILGDMVISNKVVVYDMEKQAIGWTE 407
Query: 334 ENCSDDS 340
N +++
Sbjct: 408 HNSVEEA 414
>gi|115484725|ref|NP_001067506.1| Os11g0215400 [Oryza sativa Japonica Group]
gi|77549255|gb|ABA92052.1| Eukaryotic aspartyl protease family protein, expressed [Oryza
sativa Japonica Group]
gi|113644728|dbj|BAF27869.1| Os11g0215400 [Oryza sativa Japonica Group]
Length = 428
Score = 80.1 bits (196), Expect = 1e-12, Method: Compositional matrix adjust.
Identities = 91/360 (25%), Positives = 151/360 (41%), Gaps = 53/360 (14%)
Query: 1 MVRLFIGTPSKGVLLILDTGSALIYAIFD-------PR-----KSSSFQKINCDHPDCTY 48
++ + +GTP+K ++ +DTGS+ + + PR +S++ K++C C
Sbjct: 83 VISVGLGTPAKTQIVEIDTGSSTSWVFCECDGCHTNPRTFLQSRSTTCAKVSCGTSMCLL 142
Query: 49 F----KCVNEQ----CVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSNDN 100
C + + C + + Y D S + G +T++ G FGC+ D+
Sbjct: 143 GGSDPHCQDSENYPDCPFRVSYQDGSASYGILYQDTLTF----SDVQKIPGFSFGCNMDS 198
Query: 101 HGFDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEY-----TSSYL 155
G +E G + G+LG+ +S + Q S FSYCL PL E T+ Y
Sbjct: 199 FGANEF---GNVDGLLGMGAGPMSVLKQ-SSPTFDCFSYCL--PLQKSERGFFSKTTGYF 252
Query: 156 KFGTDMGYRRPSTQATKFINHPNN--FYYLSLKDISIDNERMNFPPDTFDITVSGEGGCI 213
G R + TK + N +++ L IS+D ER+ P F G +
Sbjct: 253 SLGKVA--TRTDVRYTKMVARKKNTELFFVDLTAISVDGERLGLSPSVFS-----RKGVV 305
Query: 214 IDSGSVLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCYFLPET-FNRFPSMAF 272
DSGS L+Y L ++ + A+ E + CY + P+++
Sbjct: 306 FDSGSELSYIPDRALSVLSQRIRELLLKRGAAE----EESERNCYDMRSVDEGDMPAISL 361
Query: 273 YFED-ANLRIDGENVFIID--YENHFFLLAVAPHDDLVALIGSQQQRDTRFVYDLNIDLL 329
+F+D A + VF+ E + LA AP + V++IGS Q VYDL L+
Sbjct: 362 HFDDGARFDLGSHGVFVERSVQEQDVWCLAFAPTES-VSIIGSLMQTSKEVVYDLKRQLI 420
>gi|294463081|gb|ADE77078.1| unknown [Picea sitchensis]
Length = 370
Score = 80.1 bits (196), Expect = 1e-12, Method: Compositional matrix adjust.
Identities = 96/384 (25%), Positives = 153/384 (39%), Gaps = 82/384 (21%)
Query: 17 LDTGSALIY-------------------AIFDPRKSSSFQKINCDHPDCTYF-------- 49
+DTGS L++ +F PR SSS + C +C
Sbjct: 1 MDTGSDLVWVPCTRNYSCINCPEDSASNGVFLPRMSSSLHLVTCADSNCKTLYGNNTELL 60
Query: 50 --------KCVNEQCV-YTMKYADQSVTKGFAAHETISV-IGKGEG-KAIFHGALFGCSN 98
K +E C Y ++Y S T G ET+++ + GEG +AI H A+ GCS
Sbjct: 61 CQSCAGSLKNCSETCPPYGIQYGRGS-TAGLLLTETLNLPLENGEGARAITHFAV-GCS- 117
Query: 99 DNHGFDEDARDGALAGVLGLSRVTISFISQLGS-IIKKRFSYCLVIPLPNGEYTSSYLKF 157
+G+ G R +S SQLG I K RF+YCL + E S +
Sbjct: 118 -------IVSSQQPSGIAGFGRGALSMPSQLGEHIGKDRFAYCLQSHRFDEENKKSLMVL 170
Query: 158 GTDMGYRRPSTQATKFINH----PNN----FYYLSLKDISIDNERM-NFPPDTFDITVSG 208
G T F+ + P++ +YY+ L+ +SI +R+ P G
Sbjct: 171 GDKALPNNIPLNYTPFLTNSRAPPSSQYGVYYYIGLRGVSIGGKRLKQLPSKLLRFDTKG 230
Query: 209 EGGCIIDSGSVLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCYFLPETFN-RF 267
GG IIDSG+ T F +++ + F S + ++ D + LCY + N
Sbjct: 231 NGGTIIDSGTTFTVFSDEIFKHIAAGFASQIGYRRAGEVED-KTGMGLCYDVTGLENIVL 289
Query: 268 PSMAFYFEDANLRIDGENVFIIDYENHF--------FLLAVAPHDDLV-------ALIGS 312
P AF+F+ G + ++ N+F L + L+ ++G+
Sbjct: 290 PEFAFHFK-------GGSDMVLPVANYFSYFSSFDSICLTMISSRGLLEVDSGPAVILGN 342
Query: 313 QQQRDTRFVYDLNIDLLSFVKENC 336
QQ+D +YD + L F ++ C
Sbjct: 343 DQQQDFYLLYDREKNRLGFTQQTC 366
>gi|300681439|emb|CBH32531.1| hypothetical protein TAA_ctg0091b.00060.1 [Triticum aestivum]
Length = 426
Score = 79.7 bits (195), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 84/322 (26%), Positives = 136/322 (42%), Gaps = 33/322 (10%)
Query: 33 SSSFQKINCDHPDCTYF-----KCVNEQ---CVYTMKYADQSVTKGFAAHETISVIGKGE 84
SS F ++ C C C N C Y +Y T G+ + E ++ +G
Sbjct: 107 SSDFTEVFCFSQTCQLALDEEDACGNSTSFTCPYAYQYGPGISTTGYISAEEVTAVGT-- 164
Query: 85 GKAIFHGALFGCSNDNHGFDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIP 144
I ALFGCS + DG +GVLG SR S +SQL RFSY ++
Sbjct: 165 --HITGRALFGCSLAS----TVPLDGE-SGVLGFSRGPYSLLSQLK---ISRFSYFMLPD 214
Query: 145 LPNGEYTSSYLKFGTDMGYRRPSTQATKFINHPN--NFYYLSLKDISIDNERMN-FPPDT 201
+ + S L G D + S+++T + + + YY+ L I +D++ ++ P T
Sbjct: 215 DADKPDSESVLLLGDDAVPQTNSSRSTPLLRNEAYPDLYYVKLTGIKVDDKSLSGIPAGT 274
Query: 202 FDITVSG-EGGCIIDSGSVLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCYFL 260
FD+ +G GG ++ + S +TY Y L S + + +D ++LCY +
Sbjct: 275 FDLAANGCSGGVVMSTLSPITYLQPAAYNALTRALASKIKSQPVRPKADDVADLRLCYNI 334
Query: 261 PETFN-RFPSMAFYF-----EDANLRIDGENVFIIDYENHFFLLAVAPH---DDLVALIG 311
N FP + F A + + + FI + L + P + +++G
Sbjct: 335 QSVANLTFPKITLVFHGVDGRPAPMELTTAHYFIRENSTGLQCLTMLPTPAGSPVSSVLG 394
Query: 312 SQQQRDTRFVYDLNIDLLSFVK 333
S Q T +YDL L+F K
Sbjct: 395 SLLQTGTHMIYDLRGGSLTFEK 416
>gi|219886223|gb|ACL53486.1| unknown [Zea mays]
gi|238015146|gb|ACR38608.1| unknown [Zea mays]
gi|413938611|gb|AFW73162.1| hypothetical protein ZEAMMB73_440759 [Zea mays]
gi|413938612|gb|AFW73163.1| hypothetical protein ZEAMMB73_440759 [Zea mays]
gi|413938613|gb|AFW73164.1| hypothetical protein ZEAMMB73_440759 [Zea mays]
gi|413938614|gb|AFW73165.1| hypothetical protein ZEAMMB73_440759 [Zea mays]
Length = 467
Score = 79.7 bits (195), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 82/361 (22%), Positives = 146/361 (40%), Gaps = 47/361 (13%)
Query: 1 MVRLFIGTPSKGVLLILDTGSALIY---------------AIFDPRKSSSFQKINCDHPD 45
+ R+ +GTP+K ++++DTGS+L + +F+P+ SSS+ ++C
Sbjct: 130 VTRMGLGTPAKSYVMVVDTGSSLTWLQCSPCVVSCHRQSGPVFNPKASSSYTSVSCSAQQ 189
Query: 46 CTYFKCV---------NEQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGC 96
C+ + C+Y Y D S + G+ + +T+S G +GC
Sbjct: 190 CSDLTTATLNPASCSTSNVCIYQASYGDSSFSVGYLSKDTVSF-----GSTSVPNFYYGC 244
Query: 97 SNDNHGFDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLK 156
DN G G AG++GL+R +S + QL + FSYCL P + +
Sbjct: 245 GQDNEGLF-----GQSAGLIGLARNKLSLLYQLAPSMGYSFSYCL--PTSSSSSSGYLSI 297
Query: 157 FGTDMGYRRPSTQATKFINHPNNFYYLSLKDISIDNERMNFPPDTFDITVSGEGGCIIDS 216
+ G + A+ ++ ++ Y++ + I + + P + + IIDS
Sbjct: 298 GSYNPGQYSYTPMASSSLD--DSLYFIKMTGIKVAGK-----PLSVSSSAYSSLPTIIDS 350
Query: 217 GSVLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCYFLPETFNRFPSMAFYFED 276
G+V+T + VY L + + A + C+ R P + F
Sbjct: 351 GTVITRLPTGVYSALSKAVAGAMKGTPRASAFSI---LDTCFQGQAARLRVPEVTMAFAG 407
Query: 277 ANLRIDGENVFIIDYENHFFLLAVAPHDDLVALIGSQQQRDTRFVYDLNIDLLSFVKENC 336
++D ++ LA AP A+IG+ QQ+ VYD+ + F C
Sbjct: 408 GAALKLAARNLLVDVDSATTCLAFAPARS-AAIIGNTQQQTFSVVYDVKNSKIGFAAGGC 466
Query: 337 S 337
S
Sbjct: 467 S 467
>gi|326511104|dbj|BAK03251.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 631
Score = 79.7 bits (195), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 89/364 (24%), Positives = 144/364 (39%), Gaps = 48/364 (13%)
Query: 2 VRLFIGTPSKGVLLILDTGSALIYAI--------------FDPRKSSSFQKINCDHPDCT 47
RL IGTP + LI+D+GS + Y F P SS++ + C+ DCT
Sbjct: 90 TRLHIGTPPQEFALIVDSGSTVTYVPCASCEQCGNHQDPRFQPDLSSTYSPVKCNV-DCT 148
Query: 48 YFKCVNEQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSNDNHG--FDE 105
N QC Y +YA+ S + G + +S + E K A+FGC N G F +
Sbjct: 149 CDSDKN-QCTYERQYAEMSSSSGVLGEDIVSFGTESELKP--QRAVFGCENSETGDLFSQ 205
Query: 106 DARDGALAGVLGLSRVTISFISQL--GSIIKKRFSYCL-VIPLPNGEYTSSYLKFGTDMG 162
A G++GL R +S + QL +I FS C + + G + M
Sbjct: 206 HAD-----GIMGLGRGQLSIMDQLVDKGVIGDSFSMCYGGMDIGGGAMVLGAMPAPPGMI 260
Query: 163 YRRPSTQATKFINHPNNFYYLSLKDISIDNERMNFPPDTFDITVSGEGGCIIDSGSVLTY 222
Y + + P +Y + LK++ + + + P FD G+ G ++DSG+ Y
Sbjct: 261 YTH-----SNAVRSP--YYNIELKEMHVAGKALRVDPRIFD----GKHGTVLDSGTTYAY 309
Query: 223 FHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCYF-----LPETFNRFPSMAFYFEDA 277
+ + S + + D +C+ + + FP + F +
Sbjct: 310 LPEQAFVAFKDAVSSQVHPLKKIRGPDS-NYKDICFAGAGRNVSQLSEVFPKVDMVFGNG 368
Query: 278 N-LRIDGENVFI--IDYENHFFLLAVAPHDDLVALIGSQQQRDTRFVYDLNIDLLSFVKE 334
L + EN E + L D L+G R+T YD + + + F K
Sbjct: 369 QKLSLSPENYLFRHSKVEGAYCLGVFQNGKDPTTLLGGIVVRNTLVTYDRHNEKIGFWKT 428
Query: 335 NCSD 338
NCS+
Sbjct: 429 NCSE 432
>gi|115465837|ref|NP_001056518.1| Os05g0596000 [Oryza sativa Japonica Group]
gi|55733881|gb|AAV59388.1| unknown protein [Oryza sativa Japonica Group]
gi|57900669|gb|AAW57794.1| unknown protein [Oryza sativa Japonica Group]
gi|113580069|dbj|BAF18432.1| Os05g0596000 [Oryza sativa Japonica Group]
gi|215697162|dbj|BAG91156.1| unnamed protein product [Oryza sativa Japonica Group]
gi|215768162|dbj|BAH00391.1| unnamed protein product [Oryza sativa Japonica Group]
Length = 535
Score = 79.7 bits (195), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 79/336 (23%), Positives = 130/336 (38%), Gaps = 30/336 (8%)
Query: 21 SALIYAIFDPRKSSSFQKINCDHPDCT---YFKC----VNEQCVYTMKYADQSVTKGFAA 73
+ +I + P KSSS+++ C C Y C N C Y D ++T G
Sbjct: 180 TKVIMNWYRPAKSSSWRRFRCSQRACMDLPYNTCESPDQNTSCTYYQVMKDSTITSGIYG 239
Query: 74 HETISVIGKGEGKAIFHGALFGCSNDNHGFDEDARDGALAGVLGLSRVTISFISQLGSII 133
E +V G + GCS HG ++ DG +L L SF
Sbjct: 240 QEKATVAVSDGTMKKLPGLVIGCSTFEHGGAVNSHDG----ILSLGNSPSSFGIAAARRF 295
Query: 134 KKRFSYCLVIPLPNGEYTSSYLKFGTDMGYRRPSTQATKFINHPNNFYYLSLKDISIDNE 193
R S+CL+ +G SSYL FG + + P T T + + + Y + I + +
Sbjct: 296 GGRLSFCLLATT-SGRNASSYLTFGANPAVQAPGTMETPLL-YRDVAYGAHVTGILVGGQ 353
Query: 194 RMNFPPDTFDITVSG----EGGCIIDSGSVLTYFHSDVYWKLHEKFVSYFERFQLAQLSD 249
++ PP+ +D G E G I+D+G+ +TY S VY + S+ A++
Sbjct: 354 PLDIPPEVWDEGPLGNDNPEAGIILDTGTSITYLVSAVYDPVTAALDSHLAHLPKAEIKG 413
Query: 250 CPEPIQLCYFL--------PETFNRFPSMAFYFE-DANLRIDGENVFIIDYENHFFLLAV 300
+ CY P PS + DA L D +++ + + L
Sbjct: 414 ----FEYCYNWTFAGDGVDPAHNVTIPSFSIEMAGDARLAADAKSIVVPEVVPGVVCLGF 469
Query: 301 APHDDLVALIGSQQQRDTRFVYDLNIDLLSFVKENC 336
++IG+ ++ + D +L F K+ C
Sbjct: 470 NRISQGPSIIGNVLMQEHIWEIDHMSTVLRFRKDKC 505
>gi|356523724|ref|XP_003530485.1| PREDICTED: aspartic proteinase-like protein 2-like [Glycine max]
Length = 488
Score = 79.7 bits (195), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 92/381 (24%), Positives = 155/381 (40%), Gaps = 68/381 (17%)
Query: 2 VRLFIGTPSKGVLLILDTGSALIY-------------------AIFDPRKSSSFQKINCD 42
++ IGTP K L +DTGS +++ ++D ++SSS + + CD
Sbjct: 85 AKIGIGTPPKNYYLQVDTGSDIMWVNCIQCKECPTRSSLGMDLTLYDIKESSSGKLVPCD 144
Query: 43 HPDCTYFK-------CVNEQCVYTMKYADQSVTKGFAAHETI---SVIGKGEGKAIFHGA 92
C N C Y Y D S T G+ + + V G + +
Sbjct: 145 QEFCKEINGGLLTGCTANISCPYLEIYGDGSSTAGYFVKDIVLYDQVSGDLKTDSANGSI 204
Query: 93 LFGCSNDNHGFDEDARDGALAGVLGLSRVTISFISQLGSI--IKKRFSYCLVIPLPNGEY 150
+FGC G + + AL G+LG + S ISQL S +KK F++CL NG
Sbjct: 205 VFGCGARQSGDLSSSNEEALDGILGFGKANSSMISQLASSGKVKKMFAHCL-----NGVN 259
Query: 151 TSSYLKFGTDMGYRRPSTQATKFI-NHPNNFYYLSLKDISIDNERMNFPPDTFDITVSGE 209
G + +P T + + P+ Y +++ + + + ++ DT S +
Sbjct: 260 GGGIFAIGHVV---QPKVNMTPLLPDQPH--YSVNMTAVQVGHTFLSLSTDT-----SAQ 309
Query: 210 G---GCIIDSGSVLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCYFLPETF-N 265
G G IIDSG+ L Y +Y L K +S ++ L D C+ E+ +
Sbjct: 310 GDRKGTIIDSGTTLAYLPEGIYEPLVYKMISQHPDLKVQTLHD----EYTCFQYSESVDD 365
Query: 266 RFPSMAFYFEDA-NLRI-------DGENVFIIDYENHFFLLAVAPHDDL-VALIGSQQQR 316
FP++ F+FE+ +L++ N + I ++N D + L+G
Sbjct: 366 GFPAVTFFFENGLSLKVYPHDYLFPSVNFWCIGWQNS----GTQSRDSKNMTLLGDLVLS 421
Query: 317 DTRFVYDLNIDLLSFVKENCS 337
+ YDL + + + NCS
Sbjct: 422 NKLVFYDLENQAIGWAEYNCS 442
>gi|212275300|ref|NP_001130675.1| uncharacterized protein LOC100191778 precursor [Zea mays]
gi|194706308|gb|ACF87238.1| unknown [Zea mays]
Length = 467
Score = 79.7 bits (195), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 82/361 (22%), Positives = 146/361 (40%), Gaps = 47/361 (13%)
Query: 1 MVRLFIGTPSKGVLLILDTGSALIY---------------AIFDPRKSSSFQKINCDHPD 45
+ R+ +GTP+K ++++DTGS+L + +F+P+ SSS+ ++C
Sbjct: 130 VTRMGLGTPAKSYVMVVDTGSSLTWLQCSPCVVSCHRQSGPVFNPKASSSYTSVSCSAQQ 189
Query: 46 CTYFKCV---------NEQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGC 96
C+ + C+Y Y D S + G+ + +T+S G +GC
Sbjct: 190 CSDLTTATLSPASCSTSNVCIYQASYGDSSFSVGYLSKDTVSF-----GSTSVPNFYYGC 244
Query: 97 SNDNHGFDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLK 156
DN G G AG++GL+R +S + QL + FSYCL P + +
Sbjct: 245 GQDNEGLF-----GQSAGLIGLARNKLSLLYQLAPSMGYSFSYCL--PTSSSSSSGYLSI 297
Query: 157 FGTDMGYRRPSTQATKFINHPNNFYYLSLKDISIDNERMNFPPDTFDITVSGEGGCIIDS 216
+ G + A+ ++ ++ Y++ + I + + P + + IIDS
Sbjct: 298 GSYNPGQYSYTPMASSSLD--DSLYFIKMTGIKVAGK-----PLSVSSSAYSSLPTIIDS 350
Query: 217 GSVLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCYFLPETFNRFPSMAFYFED 276
G+V+T + VY L + + A + C+ R P + F
Sbjct: 351 GTVITRLPTGVYSALSKAVAGAMKGTPRASAFSI---LDTCFQGQAARLRVPEVTMAFAG 407
Query: 277 ANLRIDGENVFIIDYENHFFLLAVAPHDDLVALIGSQQQRDTRFVYDLNIDLLSFVKENC 336
++D ++ LA AP A+IG+ QQ+ VYD+ + F C
Sbjct: 408 GAALKLAARNLLVDVDSATTCLAFAPARS-AAIIGNTQQQTFSVVYDVKNSKIGFAAGGC 466
Query: 337 S 337
S
Sbjct: 467 S 467
>gi|449476186|ref|XP_004154665.1| PREDICTED: LOW QUALITY PROTEIN: aspartic proteinase-like protein
2-like [Cucumis sativus]
Length = 478
Score = 79.7 bits (195), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 89/381 (23%), Positives = 158/381 (41%), Gaps = 70/381 (18%)
Query: 3 RLFIGTPSKGVLLILDTGSALIYA-------------------IFDPRKSSSFQKINCDH 43
R+ IG+P + +DTGS +++ +++P+ SS+ I CD
Sbjct: 76 RIGIGSPPNDFHVQVDTGSDILWVNCVGCSNCPKKSDIGVDLQLYNPKSSSTSTLITCDQ 135
Query: 44 PDC--TYFKCV-----NEQCVYTMKYADQSVTKGFAAHETISV---IGKGEGKAIFHGAL 93
P C TY + + C Y + Y D S T G+ ++ I + +G + +
Sbjct: 136 PFCSATYDAPIPGCKPDLLCQYKVIYGDGSATAGYFVNDYIQLQRAVGNHKTSETNGSIV 195
Query: 94 FGCSNDNHGFDEDARDGALAGVLGLSRVTISFISQLGSI--IKKRFSYCLVIPLPNGEYT 151
FGC G + + AL G+LG + S ISQL + +KK F++CL G +
Sbjct: 196 FGCGAKQSG-ELGSSSEALDGILGFGQANSSMISQLAATGKVKKIFAHCLDSISGGGIFA 254
Query: 152 SSYLKFGTDMGYRRPSTQATKFINHPNNFYY-LSLKDISIDNERMNFPPDTFDITVSGEG 210
+ P T + PN +Y + L + + + ++ P F+ S +
Sbjct: 255 IGEV--------VEPKLXNTPVV--PNQAHYNVVLNGVKVGDTALDLPLGLFE--TSYKR 302
Query: 211 GCIIDSGSVLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCYFL--PETFNRFP 268
G IIDSG+ L Y +Y L EK + +L + D Q F+ + FP
Sbjct: 303 GAIIDSGTTLAYLPESIYLPLMEKILGAQPDLKLRTVDD-----QFTCFVFDKNVDDGFP 357
Query: 269 SMAFYFEDA------------NLRIDGENVFIIDYENHFFLLAVAPHDDLVALIGSQQQR 316
++ F FE++ +R ++V+ + ++N A + + V L+G +
Sbjct: 358 TVTFKFEESLILTIYPHEYLFQIR---DDVWCVGWQNSG---AQSKDGNEVTLLGDLVLQ 411
Query: 317 DTRFVYDLNIDLLSFVKENCS 337
+ Y+L + + + NCS
Sbjct: 412 NKLVYYNLENQTIGWTEYNCS 432
>gi|255556768|ref|XP_002519417.1| Aspartic proteinase Asp1 precursor, putative [Ricinus communis]
gi|223541280|gb|EEF42831.1| Aspartic proteinase Asp1 precursor, putative [Ricinus communis]
Length = 494
Score = 79.7 bits (195), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 94/378 (24%), Positives = 156/378 (41%), Gaps = 64/378 (16%)
Query: 2 VRLFIGTPSKGVLLILDTGSALIYAI-------------------FDPRKSSSFQKINCD 42
R+ +GTP + + +DTGS +++ FD SS+ + + C
Sbjct: 83 TRVKLGTPPREFNVQIDTGSDVLWVTCSSCSNCPQTSGLGIQLNYFDTTSSSTARLVPCS 142
Query: 43 HPDC------TYFKC--VNEQCVYTMKYADQSVTKGFAAHETI---SVIGKGEGKAIFHG 91
HP C T +C + QC Y +Y D S T G+ +T +V+G+
Sbjct: 143 HPICTSQIQTTATQCPPQSNQCSYAFQYGDGSGTSGYYVSDTFYFDAVLGESLIANSSAA 202
Query: 92 ALFGCSNDNHGFDEDARDGALAGVLGLSRVTISFISQLGS--IIKKRFSYCLVIPLPNGE 149
+FGCS G D D A+ G+ G + +S ISQL S I + FS+CL GE
Sbjct: 203 IVFGCSTYQSG-DLTKTDKAVDGIFGFGQGELSVISQLSSHGITPRVFSHCL-----KGE 256
Query: 150 YT-SSYLKFGTDMGYRRPSTQATKFI-NHPNNFYYLSLKDISIDNERMNFPPDTFDITVS 207
+ L G + P + + + P+ Y L L+ I++ + + P F S
Sbjct: 257 DSGGGILVLGEIL---EPGIVYSPLVPSQPH--YNLDLQSIAVSGQLLPIDPAAF--ATS 309
Query: 208 GEGGCIIDSGSVLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPI----QLCYFLPET 263
G IID+G+ L Y + Y + FVS A +S P CY + +
Sbjct: 310 SNRGTIIDTGTTLAYLVEEAY----DPFVSAIT----AAVSQLATPTINKGNQCYLVSNS 361
Query: 264 FNR-FPSMAFYFE-DANLRIDGEN--VFIIDYEN-HFFLLAVAPHDDLVALIGSQQQRDT 318
+ FP ++F F A + + E +++ +Y + + + ++G +D
Sbjct: 362 VSEVFPPVSFNFAGGATMLLKPEEYLMYLTNYAGAALWCIGFQKIQGGITILGDLVLKDK 421
Query: 319 RFVYDLNIDLLSFVKENC 336
FVYDL + + +C
Sbjct: 422 IFVYDLAHQRIGWANYDC 439
>gi|125558627|gb|EAZ04163.1| hypothetical protein OsI_26305 [Oryza sativa Indica Group]
Length = 404
Score = 79.7 bits (195), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 65/212 (30%), Positives = 96/212 (45%), Gaps = 41/212 (19%)
Query: 2 VRLFIGTPSKGVLLILDTGSALIYA--------------IFDPRKSSSFQKINCDHPDCT 47
+ L IGTP ++ DTGS+LI+ F P SS+F K+ C C
Sbjct: 92 MNLSIGTPPVTFSVLADTGSSLIWTQCAPCTECAARPAPPFQPASSSTFSKLPCASSLCQ 151
Query: 48 -----YFKCVNEQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSNDNHG 102
Y C CVY Y T G+ A ET+ V G A F G FGCS +N
Sbjct: 152 FLTSPYRTCNATGCVYYYPYG-MGFTAGYLATETLHV-----GGASFPGVTFGCSTENGV 205
Query: 103 FDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLKFGTDMG 162
+ +G++GL R +S +SQ+G RFSYCL N + S + FG+
Sbjct: 206 GNSS------SGIVGLGRSPLSLVSQVG---VARFSYCL---RSNADAGDSPILFGSLAK 253
Query: 163 YRRPSTQATKFINHP----NNFYYLSLKDISI 190
+ Q+T + +P +++YY++L I++
Sbjct: 254 VTGGNVQSTPLLENPEMPSSSYYYVNLTGITV 285
>gi|223975883|gb|ACN32129.1| unknown [Zea mays]
gi|223975971|gb|ACN32173.1| unknown [Zea mays]
gi|224034191|gb|ACN36171.1| unknown [Zea mays]
gi|413938623|gb|AFW73174.1| aspartic proteinase nepenthesin-1 isoform 1 [Zea mays]
gi|413938624|gb|AFW73175.1| aspartic proteinase nepenthesin-1 isoform 2 [Zea mays]
Length = 465
Score = 79.7 bits (195), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 82/361 (22%), Positives = 146/361 (40%), Gaps = 47/361 (13%)
Query: 1 MVRLFIGTPSKGVLLILDTGSALIY---------------AIFDPRKSSSFQKINCDHPD 45
+ R+ +GTP+K ++++DTGS+L + +F+P+ SSS+ ++C
Sbjct: 128 VTRMGLGTPAKSYVMVVDTGSSLTWLQCSPCVVSCHRQSGPVFNPKASSSYASVSCSAQQ 187
Query: 46 CTYFKCV---------NEQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGC 96
C+ + C+Y Y D S + G+ + +T+S G +GC
Sbjct: 188 CSDLTTATLNPASCSTSNVCIYQASYGDSSFSVGYLSKDTVSF-----GSTSVPNFYYGC 242
Query: 97 SNDNHGFDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLK 156
DN G G AG++GL+R +S + QL + FSYCL P + +
Sbjct: 243 GQDNEGLF-----GQSAGLIGLARNKLSLLYQLAPSMGYSFSYCL--PTSSSSSSGYLSI 295
Query: 157 FGTDMGYRRPSTQATKFINHPNNFYYLSLKDISIDNERMNFPPDTFDITVSGEGGCIIDS 216
+ G + A+ ++ ++ Y++ + I + + P + + IIDS
Sbjct: 296 GSYNPGQYSYTPMASSSLD--DSLYFIKMTGIKVAGK-----PLSVSSSAYSSLPTIIDS 348
Query: 217 GSVLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCYFLPETFNRFPSMAFYFED 276
G+V+T + VY L + + A + C+ R P + F
Sbjct: 349 GTVITRLPTGVYSALSKAVAGAMKGTPRASAFSI---LDTCFQGQAARLRVPEVTMAFAG 405
Query: 277 ANLRIDGENVFIIDYENHFFLLAVAPHDDLVALIGSQQQRDTRFVYDLNIDLLSFVKENC 336
++D ++ LA AP A+IG+ QQ+ VYD+ + F C
Sbjct: 406 GAALKLAARNLLVDVDSATTCLAFAPARS-AAIIGNTQQQTFSVVYDVKNSKIGFAAGGC 464
Query: 337 S 337
S
Sbjct: 465 S 465
>gi|56542455|gb|AAV92892.1| Avr9/Cf-9 rapidly elicited protein 36, partial [Nicotiana tabacum]
Length = 191
Score = 79.7 bits (195), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 52/168 (30%), Positives = 83/168 (49%), Gaps = 8/168 (4%)
Query: 175 NHPNNFYYLSLKDISIDNERMNFPPDTFDITVSGEGGCIIDSGSVLTYFHSDVYWKLHEK 234
NH FYY+ +K + + E +N P +T++++ G GG IIDSG+ L+YF Y + +
Sbjct: 27 NHLETFYYVQIKSVIVGGEVLNIPEETWNLSTEGVGGTIIDSGTTLSYFAEPAYEIIKQA 86
Query: 235 FVSYFERFQLAQLSDCPEPIQLCYFLPETFN-RFPSMAFYFEDANL-RIDGENVFIIDYE 292
FV+ +R+ + L D P ++ CY + PS F D + EN FI
Sbjct: 87 FVNKVKRYPI--LDDFP-ILKPCYNVSGVEKLELPSFGIVFGDGAIWTFPVENYFIKLEP 143
Query: 293 NHFFLLAV--APHDDLVALIGSQQQRDTRFVYDLNIDLLSFVKENCSD 338
LA+ PH + ++IG+ QQ++ +YD L F C+D
Sbjct: 144 EDIVCLAILGTPHSAM-SIIGNYQQQNFHILYDTKRSRLGFAPRRCAD 190
>gi|255550723|ref|XP_002516410.1| pepsin A, putative [Ricinus communis]
gi|223544445|gb|EEF45965.1| pepsin A, putative [Ricinus communis]
Length = 416
Score = 79.7 bits (195), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 102/417 (24%), Positives = 159/417 (38%), Gaps = 104/417 (24%)
Query: 1 MVRLFIGTPSKGVLLILDTGSALIYAIFDPRKSSSFQKINCD------------------ 42
++ L IGTP + + + +DTGS L + P + SF ++CD
Sbjct: 13 LISLNIGTPPQVIQVYMDTGSDLTWV---PCGNLSFDCMDCDDYRNSKLMSAFSPSHSSS 69
Query: 43 -------HPDCT--------YFKCVNEQCV---------------YTMKYADQSVTKGFA 72
P CT + C C + Y V G
Sbjct: 70 SYRDSCASPYCTDIHSSDNSFDPCTVAGCSLSTLIKATCARPCPSFAYTYGAGGVVTGTL 129
Query: 73 AHETISVIGKGEGKAIFHGAL----FGCSNDNHGFDEDARDGALAGVLGLSRVTISFISQ 128
+T+ V EG A + FGC + G+ G R T+SF SQ
Sbjct: 130 TRDTLRV---HEGPARVTKDIPKFCFGCVGSTYH--------EPIGIAGFVRGTLSFPSQ 178
Query: 129 LGSIIKKRFSYC-LVIPLPNGEYTSSYLKFGTDMGYRRPSTQATKFINHP--NNFYYLSL 185
LG ++KK FS+C L N SS L G + + Q T + P N+YY+ L
Sbjct: 179 LG-LLKKGFSHCFLAFKYANNPNISSPLVIGDTALSSKDNMQFTPMLKSPMYPNYYYIGL 237
Query: 186 KDISIDN-ERMNFPPDTFDITVSGEGGCIIDSGSVLTYFHSDVYWKLHEKFVSYFERFQL 244
+ I++ N P + + G GG +IDSG+ T+ Y +L F + +
Sbjct: 238 EAITVGNVSATTVPLNLREFDSQGNGGMLIDSGTTYTHLPEPFYSQLLSIFKAIIT-YPR 296
Query: 245 AQLSDCPEPIQLCYFLPETFNR-------FPSMAFYFEDANLRIDGENV-FIIDYENHFF 296
A + LCY +P NR FPS+ F+F + NV F++ NHF+
Sbjct: 297 ATEVEMRAGFDLCYKVPCPNNRLTDDDNLFPSITFHFLN--------NVSFVLPQGNHFY 348
Query: 297 LLAVAPHDDLV----------------ALIGSQQQRDTRFVYDLNIDLLSFVKENCS 337
++ + +V + GS QQ++ + VYDL + + F +C+
Sbjct: 349 AMSAPSNSTVVKCLLFQSMADSDYGPAGVFGSFQQQNVQIVYDLEKERIGFQPMDCA 405
>gi|125553570|gb|EAY99279.1| hypothetical protein OsI_21243 [Oryza sativa Indica Group]
gi|125605796|gb|EAZ44832.1| hypothetical protein OsJ_29469 [Oryza sativa Japonica Group]
Length = 534
Score = 79.7 bits (195), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 79/336 (23%), Positives = 130/336 (38%), Gaps = 30/336 (8%)
Query: 21 SALIYAIFDPRKSSSFQKINCDHPDCT---YFKC----VNEQCVYTMKYADQSVTKGFAA 73
+ +I + P KSSS+++ C C Y C N C Y D ++T G
Sbjct: 179 TKVIMNWYRPAKSSSWRRFRCSQRACMDLPYNTCESPDQNTSCTYYQVMKDSTITSGIYG 238
Query: 74 HETISVIGKGEGKAIFHGALFGCSNDNHGFDEDARDGALAGVLGLSRVTISFISQLGSII 133
E +V G + GCS HG ++ DG +L L SF
Sbjct: 239 QEKATVAVSDGTMKKLPGLVIGCSTFEHGGAVNSHDG----ILSLGNSPSSFGIAAARRF 294
Query: 134 KKRFSYCLVIPLPNGEYTSSYLKFGTDMGYRRPSTQATKFINHPNNFYYLSLKDISIDNE 193
R S+CL+ +G SSYL FG + + P T T + + + Y + I + +
Sbjct: 295 GGRLSFCLLATT-SGRNASSYLTFGANPAVQAPGTMETPLL-YRDVAYGAHVTGILVGGQ 352
Query: 194 RMNFPPDTFDITVSG----EGGCIIDSGSVLTYFHSDVYWKLHEKFVSYFERFQLAQLSD 249
++ PP+ +D G E G I+D+G+ +TY S VY + S+ A++
Sbjct: 353 PLDIPPEVWDEGPLGNDNPEAGIILDTGTSITYLVSAVYDPVTAALDSHLAHLPKAEIKG 412
Query: 250 CPEPIQLCYFL--------PETFNRFPSMAFYFE-DANLRIDGENVFIIDYENHFFLLAV 300
+ CY P PS + DA L D +++ + + L
Sbjct: 413 ----FEYCYNWTFAGDGVDPAHNVTIPSFSIEMAGDARLAADAKSIVVPEVVPGVVCLGF 468
Query: 301 APHDDLVALIGSQQQRDTRFVYDLNIDLLSFVKENC 336
++IG+ ++ + D +L F K+ C
Sbjct: 469 NRISQGPSIIGNVLMQEHIWEIDHMSTVLRFRKDKC 504
>gi|449527515|ref|XP_004170756.1| PREDICTED: aspartic proteinase nepenthesin-1-like, partial [Cucumis
sativus]
Length = 364
Score = 79.3 bits (194), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 82/360 (22%), Positives = 142/360 (39%), Gaps = 44/360 (12%)
Query: 1 MVRLFIGTPSKGVLLILDTGSALIY------------AIFDPRKSSSFQKINCDHPDCTY 48
+VR IGTP++ +LL LDT + + +F KSSSF+ + C P C
Sbjct: 27 VVRAKIGTPAQTLLLALDTSNDAAWIPCSGCIGCPSTTVFSSDKSSSFRPLPCQSPQCNQ 86
Query: 49 F---KCVNEQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSNDNHGFDE 105
C C + + Y +V + +++ FGC G
Sbjct: 87 VPNPSCSGSACGFNLTYGSSTVAADLV-QDNLTLATDS-----VPSYTFGCIRKATGSSV 140
Query: 106 DARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLKFGTDMGYRR 165
+ R +S + Q S+ + FSYCL P S L+ G R
Sbjct: 141 PPQGLLGL-----GRGPLSLLGQSQSLYQSTFSYCL--PSFKSVNFSGSLRLGPVAQPIR 193
Query: 166 PSTQATKFINHP--NNFYYLSLKDISIDNERMNFPPDTFDITVSGEGGCIIDSGSVLTYF 223
+ T + +P ++ YY++L I + + ++ PP + G +IDSG+ T
Sbjct: 194 --IKYTPLLRNPRRSSLYYVNLISIRVGRKIVDIPPSALAFNSATGAGTVIDSGTTFTRL 251
Query: 224 HSDVYWKLHEKFVSYFER-FQLAQLSDCPEPIQLCYFLPETFNRFPSMAFYFEDANLRID 282
+ Y + ++F R ++ L CY +P P++ F F N+ +
Sbjct: 252 VAPAYTAVRDEFRRRVGRNVTVSSLGG----FDTCYTVPII---SPTITFMFAGMNVTLP 304
Query: 283 GENVFIIDYENHFFLLAVAPHDD----LVALIGSQQQRDTRFVYDLNIDLLSFVKENCSD 338
+N I LA+A D ++ +I S QQ++ R ++D+ + +E+CS
Sbjct: 305 PDNFLIHSTSGSTTCLAMAAAPDNVNSVLNVIASMQQQNHRILFDIPNSRVGVARESCSS 364
>gi|326500408|dbj|BAK06293.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 475
Score = 79.3 bits (194), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 79/333 (23%), Positives = 125/333 (37%), Gaps = 58/333 (17%)
Query: 27 IFDPRKSSSFQKINCDHPDCTYF---------KCVNEQCVYTMKYADQSVTKGFAAHETI 77
+FDP SS+ + C P C + N +C Y ++Y+D T G +T+
Sbjct: 178 LFDPTTSSTAAAVRCRSPACRSLGPYGNGCSNRSANAECRYLIEYSDDRATAGTYMTDTL 237
Query: 78 SVIGKGEGKAIFHGALFGCSNDNHGFDEDARDGALAGVLGLSRVTISFISQLGSIIKKRF 137
++ G FGCS+ G D AG + L S ++Q + F
Sbjct: 238 TI----SGTTAVRNFRFGCSHAVRGRFSD----LTAGTMSLGGGAQSLLAQTARSLGNAF 289
Query: 138 SYCLVIPLPNGEYTSSYLKFGTDMGYRRPST--QATKFINHP-------NNFYYLSLKDI 188
SYC +P S +L G P+T T F P + Y + L+ I
Sbjct: 290 SYC----VPQAS-ASGFLSIGG------PATTNSTTVFATTPLVRSAINPSLYLVRLQGI 338
Query: 189 SIDNERMNFPPDTFDITVSGEGGCIIDSGSVLTYFHSDVYWKLHEKFVSYFERFQLAQLS 248
+ R+ PP F G ++DS +V+T Y L F + + + +
Sbjct: 339 VVAGRRLGIPPVAF------SAGAVMDSSAVITQLPPTAYRALRRAFRNAMRAYPRSGAT 392
Query: 249 DCPEPIQLCY-FLPETFNRFPSMAFYFEDANLRIDGENVFIIDYENHFF---LLAVAPHD 304
+ CY FL T R P+++ F G V ++D L A
Sbjct: 393 GT---LDTCYDFLGLTNVRVPAVSLVF-------GGGAVVVLDPPAVMIGGCLAFTATSS 442
Query: 305 DL-VALIGSQQQRDTRFVYDLNIDLLSFVKENC 336
DL + IG+ QQ+ +YD+ + F + C
Sbjct: 443 DLALGFIGNVQQQTHEVLYDVAAGGVGFRRGAC 475
>gi|115489316|ref|NP_001067145.1| Os12g0583300 [Oryza sativa Japonica Group]
gi|77556903|gb|ABA99699.1| Eukaryotic aspartyl protease family protein, expressed [Oryza
sativa Japonica Group]
gi|113649652|dbj|BAF30164.1| Os12g0583300 [Oryza sativa Japonica Group]
gi|125537189|gb|EAY83677.1| hypothetical protein OsI_38901 [Oryza sativa Indica Group]
Length = 446
Score = 79.3 bits (194), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 92/361 (25%), Positives = 144/361 (39%), Gaps = 39/361 (10%)
Query: 5 FIGTPSKGVLLILDTGSALIYA----------------IFDPRKSSSFQKINCDHPDCTY 48
IG P + ++DTGS L++ ++ SS+F + C C
Sbjct: 95 LIGDPPQRAEALIDTGSDLVWTQCSTCLRKVCARQALPYYNSSASSTFAPVPCAARICA- 153
Query: 49 FKCVNEQCVYTMKYADQ-SVTKGFAAHETISVIGKGEGKAIFHGAL---FGCSNDNHGFD 104
N+ ++ A SV G+ A +G E A G FGC
Sbjct: 154 ---ANDDIIHFCDLAAGCSVIAGYGAGVVAGTLGT-EAFAFQSGTAELAFGCVTFTR-IV 208
Query: 105 EDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLKFGTDMGYR 164
+ A GA +G++GL R +S +SQ G+ +FSYCL N T +
Sbjct: 209 QGALHGA-SGLIGLGRGRLSLVSQTGA---TKFSYCLTPYFHNNGATGHLFVGASASLGG 264
Query: 165 RPSTQATKFINHPNN--FYYLSLKDISIDNERMNFPPDTFDITVSG----EGGCIIDSGS 218
T+F+ P FYYL L +++ R+ P FD+ GG IIDSGS
Sbjct: 265 HGDVMTTQFVKGPKGSPFYYLPLIGLTVGETRLPIPATVFDLREVAPGLFSGGVIIDSGS 324
Query: 219 VLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCYFLPETFNRFPSMAFYFED-A 277
T D Y L + + +A D + LC + P++ F+F A
Sbjct: 325 PFTSLVHDAYDALASELAARLNGSLVAPPPDADDG-ALCVARRDVGRVVPAVVFHFRGGA 383
Query: 278 NLRIDGENVFI-IDYENHFFLLAVAPHDDLVALIGSQQQRDTRFVYDLNIDLLSFVKENC 336
++ + E+ + +D +A A ++IG+ QQ++ R +YDL SF +C
Sbjct: 384 DMAVPAESYWAPVDKAAACMAIASAGPYRRQSVIGNYQQQNMRVLYDLANGDFSFQPADC 443
Query: 337 S 337
S
Sbjct: 444 S 444
>gi|225440722|ref|XP_002275223.1| PREDICTED: aspartic proteinase nepenthesin-2 [Vitis vinifera]
gi|147841923|emb|CAN65212.1| hypothetical protein VITISV_039022 [Vitis vinifera]
Length = 458
Score = 79.3 bits (194), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 91/389 (23%), Positives = 153/389 (39%), Gaps = 73/389 (18%)
Query: 2 VRLFIGTPSKGVLLILDTGSALIYA--------------------IFDPRKSSSFQKINC 41
+ L GTP + + ++DTGS +++A IF+P SSS + + C
Sbjct: 89 IPLSFGTPPQKLSFLVDTGSHVVWAPCTTHYTCTNCSFSNPKKVPIFNPELSSSDKILGC 148
Query: 42 DHPDCTYF-----------------KCVNEQCVYTMKYADQSVTKGFAAHETISVIGKGE 84
P C KC + YT++Y + + GF E + GK
Sbjct: 149 RDPKCANTSSPDVHLGCPRCNGNSKKCSHACPQYTLQYGTGAAS-GFFLLENLDFPGK-- 205
Query: 85 GKAIFHGALFGCSNDNHGFDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIP 144
H L GC+ D + ALAG R S Q+G K+F+YCL
Sbjct: 206 ---TIHKFLVGCTTSA---DREPSSDALAG---FGRTMFSLPMQMGV---KKFAYCLN-- 251
Query: 145 LPNGEYTSSYLKFGTDMGYRRPSTQA---TKFINHPNNF---YYLSLKDISIDNERMNFP 198
+ +Y + + Y TQ F+ +P ++ YYL +KD+ I N+ + P
Sbjct: 252 --SHDYDDTRNSGKLILDYSDGETQGLSYAPFLKNPPDYPFYYYLGVKDMKIGNKLLRIP 309
Query: 199 PDTFDITVSGEGGCIIDSGSVLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCY 258
GG +IDSG Y V+ + + +++ + ++ + CY
Sbjct: 310 GKYLTPGSDSRGGVMIDSGFAYGYMTLPVFKIVTNELKKQMSKYRRSLEAETQSGLTPCY 369
Query: 259 -FLPETFNRFPSMAFYFE-DANLRIDGENVFIIDYENHFFLLAV---APHDDL------V 307
F + P + + F AN+ + G N F++ E V +P ++L
Sbjct: 370 NFTGHKSIKIPDLIYQFTGGANMVVPGMNYFLLFSEASLGCFPVTTDSPTNNLEFTPGPS 429
Query: 308 ALIGSQQQRDTRFVYDLNIDLLSFVKENC 336
++G+ QQ D +DL + L F ++ C
Sbjct: 430 IILGNYQQVDHYVEFDLKNERLGFRQQTC 458
>gi|302817726|ref|XP_002990538.1| hypothetical protein SELMODRAFT_131679 [Selaginella moellendorffii]
gi|300141706|gb|EFJ08415.1| hypothetical protein SELMODRAFT_131679 [Selaginella moellendorffii]
Length = 434
Score = 79.3 bits (194), Expect = 3e-12, Method: Compositional matrix adjust.
Identities = 78/304 (25%), Positives = 122/304 (40%), Gaps = 54/304 (17%)
Query: 6 IGTPSKGVLLILDTGSALIYA-------------------IFDPRKSSSFQKINCDHPDC 46
+GTP + L +DTGS L++ +D + S+S K+ C P C
Sbjct: 42 LGTPPRTYNLQVDTGSDLLWVNCHPCIGCPAFSDLKIPIVPYDVKASASSSKVPCSDPSC 101
Query: 47 TYFKCVNE-------QCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSND 99
T ++E QC Y+ +Y D S T G+ + + + I FGC
Sbjct: 102 TLITQISESGCNDQNQCGYSFQYGDGSGTLGYLVEDVLHYMVNATATVI-----FGCGFK 156
Query: 100 NHGFDEDARDGALAGVLGLSRVTISFISQLGSIIKKR--FSYCLVIPLPNGEYTSSYLKF 157
G D + AL G++G +SF SQL K F++C L GE L
Sbjct: 157 QSG-DLSTSERALDGIIGFGASDLSFNSQLAKQGKTPNVFAHC----LDGGERGGGILVL 211
Query: 158 GTDMGYRRPSTQATKFINHPNNFYYLSLKDISIDNERMNFPPDTFDITVSGEGGCIIDSG 217
G + P Q T + + ++ Y + L+ IS++N + P F V G I DSG
Sbjct: 212 GNVI---EPDIQYTPLVPYMSH-YNVVLQSISVNNANLTIDPKLFSNDV--MQGTIFDSG 265
Query: 218 SVLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCYFLPETFNRFPSMAFYFEDA 277
+ L Y + Y + F L +L F+ + FP++ YFE A
Sbjct: 266 TTLAYLPDEAYQAFTQAVSLVVAPFLLCD-------TRLSRFI---YKLFPNVVLYFEGA 315
Query: 278 NLRI 281
++ +
Sbjct: 316 SMTL 319
>gi|125548492|gb|EAY94314.1| hypothetical protein OsI_16081 [Oryza sativa Indica Group]
Length = 417
Score = 79.3 bits (194), Expect = 3e-12, Method: Compositional matrix adjust.
Identities = 80/310 (25%), Positives = 127/310 (40%), Gaps = 66/310 (21%)
Query: 1 MVRLFIGTPSKGVLLILDTGSALIYA--------------IFDPRKSSSFQKINCDHPDC 46
+V+L IGTP +DT S LI+ +F+PR SS++ + C C
Sbjct: 90 LVKLGIGTPPYKFTAAIDTASDLIWTQCQPCTGCYHQVDPMFNPRVSSTYAALPCSSDTC 149
Query: 47 TYF---KCV---NEQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSNDN 100
+C +E C YT Y+ + T+G A + + + G+ F G FGCS +
Sbjct: 150 DELDVHRCGHDDDESCQYTYTYSGNATTEGTLAVDKLVI-----GEDAFRGVAFGCSTSS 204
Query: 101 HGFDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLKFGTD 160
G A +GV+GL R +S +SQL +RF+YCL P L G D
Sbjct: 205 TG---GAPPPQASGVVGLGRGPLSLVSQLSV---RRFAYCLP---PPASRIPGKLVLGAD 255
Query: 161 MGYRRPSTQ--ATKFINHPN--NFYYLSLKDISIDNERMNF------------------- 197
R +T A P ++YYL+L + I + M+
Sbjct: 256 ADAARNATNRIAVPMRRDPRYPSYYYLNLDGLLIGDRTMSLPPTTTTTATATATATAPAP 315
Query: 198 --PPDTFDITVSGEG--GCIIDSGSVLTYFHSDVYWKLHEKFVSYFE-RFQLAQLSDCPE 252
P+ + V G IID S +T+ + +Y ++ V+ E +L + +
Sbjct: 316 TPSPNATAVAVGDANRYGMIIDIASTITFLEASLY----DELVNDLEVEIRLPRGTGSSL 371
Query: 253 PIQLCYFLPE 262
+ LC+ LP+
Sbjct: 372 GLDLCFILPD 381
>gi|125571060|gb|EAZ12575.1| hypothetical protein OsJ_02481 [Oryza sativa Japonica Group]
Length = 501
Score = 79.3 bits (194), Expect = 3e-12, Method: Compositional matrix adjust.
Identities = 104/377 (27%), Positives = 154/377 (40%), Gaps = 66/377 (17%)
Query: 2 VRLFIGTPSKGVLLILDTGSALIY--------------AIFDPRKSSSFQKINCDHPDCT 47
++ +GTP L++LDTGS +++ +FDPR S S+ ++C P C
Sbjct: 149 TKIGVGTPVTPALMVLDTGSDVVWLQCAPCRRCYDQSGQMFDPRASHSYGAVDCAAPLCR 208
Query: 48 YFKC-----VNEQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSNDNHG 102
+ C+Y + Y D SVT G A ET++ G + AL GC +DN G
Sbjct: 209 RLDSGGCDLRRKACLYQVAYGDGSVTAGDFATETLTFA---SGARVPRVAL-GCGHDNEG 264
Query: 103 FDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLV---IPLPNGEYTSSYLKFGT 159
A AG+LGL R ++SF SQ+ + FSYCLV + SS + FG+
Sbjct: 265 LFV-----AAAGLLGLGRGSLSFPSQISRRFGRSFSYCLVDRTSSSASATSRSSTVTFGS 319
Query: 160 DMGYRRPSTQATKFINHPNN-------------FYYLSLKDISIDNERMNFPPDTFDITV 206
G R + + HP+ + + R+ PPD
Sbjct: 320 --GAR---GALGRRVLHPDGEEPQDGDVLLRAAHGHQRRRRARPGRGRVRPPPD----PS 370
Query: 207 SGEGGCIIDSGSVLTYFHSDVYWKLHEKF--VSYFERFQLAQLSDCPEPIQL---CYFLP 261
+G GG I+DSG W + + R A L P L CY L
Sbjct: 371 TGRGGVIVDSG------RPSPAWARAGRTPPCATRSRAAAAGLRLSPGGFSLFDTCYDLS 424
Query: 262 E-TFNRFPSMAFYFE-DANLRIDGENVFIIDYENHFFLLAVAPHDDLVALIGSQQQRDTR 319
+ P+++ +F A + EN I F A A D V++IG+ QQ+ R
Sbjct: 425 GLKVVKVPTVSMHFAGGAEAALPPENYLIPVDSRGTFCFAFAGTDGGVSIIGNIQQQGFR 484
Query: 320 FVYDLNIDLLSFVKENC 336
V+D + L FV + C
Sbjct: 485 VVFDGDGQRLGFVPKGC 501
>gi|195626958|gb|ACG35309.1| aspartic proteinase nepenthesin-1 precursor [Zea mays]
Length = 450
Score = 79.0 bits (193), Expect = 3e-12, Method: Compositional matrix adjust.
Identities = 83/360 (23%), Positives = 149/360 (41%), Gaps = 47/360 (13%)
Query: 1 MVRLFIGTPSKGVLLILDTGSALIY--------------AIFDPRKSSSFQKINCDHPDC 46
+VR +GTP + +LL +DT + + A FDP S+S++ + C P C
Sbjct: 113 VVRASLGTPPQQLLLAVDTSNDASWIPCAGCAGCPTSSAAPFDPAASASYRTVPCGSPLC 172
Query: 47 TYFKCV-----NEQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSNDNH 101
+ C +++ YAD S+ + ++++V G FGC
Sbjct: 173 AQAPNAACPPGGKACGFSLTYADSSLQAALS-QDSLAVAGNA-----VKAYTFGCLQRAT 226
Query: 102 GFDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLKFGTDM 161
G + +LGL R +SF+SQ + + FSYCL P S L+ G +
Sbjct: 227 GTAAPPQG-----LLGLGRGPLSFLSQTKDMYEATFSYCL--PSFKSLNFSGTLRLGRNG 279
Query: 162 GYRRPSTQATKFINHPNNFYYLSLKDISIDNERMNFPPDTFDITVSGEGGCIIDSGSVLT 221
+R T H ++ YY+++ + + + + P FD G ++DSG++ T
Sbjct: 280 QPQRIKTTPLLANPHRSSLYYVNMTGVRVGRKVVPIP--AFDPATG--AGTVLDSGTMFT 335
Query: 222 YFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCYFLPETFNRFPSMAFYFEDANLRI 281
+ Y + ++ R ++ C+ T +P M F+ + +
Sbjct: 336 RLVAPAYVAVRDEV-----RRRVGAPVSSLGGFDTCFN--TTAVAWPPMTLLFDGMQVTL 388
Query: 282 DGENVFIIDYENHFFLLAVAPHDD----LVALIGSQQQRDTRFVYDLNIDLLSFVKENCS 337
ENV I LA+A D ++ +I S QQ++ R ++D+ + F +E C+
Sbjct: 389 PEENVVIHSTYGTISCLAMAAAPDGVNTVLNVIASMQQQNHRVLFDVPNGRVGFARERCT 448
>gi|218187618|gb|EEC70045.1| hypothetical protein OsI_00635 [Oryza sativa Indica Group]
Length = 570
Score = 79.0 bits (193), Expect = 3e-12, Method: Compositional matrix adjust.
Identities = 84/358 (23%), Positives = 137/358 (38%), Gaps = 70/358 (19%)
Query: 1 MVRLFIGTPSKGVLLILDTGSALIY-----------------AIFDPRKSSSFQKINCDH 43
++ + +G+P + +L I DTGS L++ FDP +SS++ +++C
Sbjct: 102 LMTVNLGSPPRSMLAIADTGSDLVWVKCKKGNNDTSSAAAPTTQFDPSRSSTYGRVSCQT 161
Query: 44 PDCTYFKCVN----EQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAI----FHGALFG 95
C C Y Y D S T G + ET + G G++ G FG
Sbjct: 162 DACEALGRATCDDGSNCAYLYAYGDGSNTTGVLSTETFTFDDGGAGRSPRQVRIGGVKFG 221
Query: 96 CSNDNHGFDEDARDGALAGVLGLSRVTISFISQLGSI--IKKRFSYCLVIPLPNGEYTSS 153
CS G ++GL +S ++QLG + +RFSYCLV P+ SS
Sbjct: 222 CSTATAGSFPADG------LVGLGGGAVSLVTQLGGATSLGRRFSYCLV---PHSVNASS 272
Query: 154 YLKFGTDMGYRRPSTQATKFINHPNNFYYLSLKDISIDNERMNFPPDTFDITVSGEGGCI 213
L FG P +T + + S + I
Sbjct: 273 ALNFGALADVTEPGAASTPLVGNKTVASAASSR-------------------------II 307
Query: 214 IDSGSVLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCYFLP----ETFNRFPS 269
+DSG+ LT+ + + ++ R L + +QLCY + E P
Sbjct: 308 VDSGTTLTFLDPSLLGPIVDEL---SRRITLPPVQSPDGLLQLCYNVAGREVEAGESIPD 364
Query: 270 MAFYF-EDANLRIDGENVFIIDYENHFFLLAVAPHDDL-VALIGSQQQRDTRFVYDLN 325
+ F A + + EN F+ E L VA + V+++G+ Q++ YDL+
Sbjct: 365 LTLEFGGGAAVALKPENAFVAVQEGTLCLAIVATTEQQPVSILGNLAQQNIHVGYDLD 422
>gi|449449334|ref|XP_004142420.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Cucumis sativus]
Length = 441
Score = 79.0 bits (193), Expect = 3e-12, Method: Compositional matrix adjust.
Identities = 82/360 (22%), Positives = 142/360 (39%), Gaps = 44/360 (12%)
Query: 1 MVRLFIGTPSKGVLLILDTGSALIY------------AIFDPRKSSSFQKINCDHPDCTY 48
+VR IGTP++ +LL LDT + + +F KSSSF+ + C P C
Sbjct: 104 VVRAKIGTPAQTLLLALDTSNDAAWIPCSGCIGCPSTTVFSSDKSSSFRPLPCQSPQCNQ 163
Query: 49 F---KCVNEQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSNDNHGFDE 105
C C + + Y +V + +++ FGC G
Sbjct: 164 VPNPSCSGSACGFNLTYGSSTVAADLV-QDNLTLATDS-----VPSYTFGCIRKATGSSV 217
Query: 106 DARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLKFGTDMGYRR 165
+ G +S + Q S+ + FSYCL P S L+ G R
Sbjct: 218 PPQGLLGLGR-----GPLSLLGQSQSLYQSTFSYCL--PSFKSVNFSGSLRLGPVAQPIR 270
Query: 166 PSTQATKFINHP--NNFYYLSLKDISIDNERMNFPPDTFDITVSGEGGCIIDSGSVLTYF 223
+ T + +P ++ YY++L I + + ++ PP + G +IDSG+ T
Sbjct: 271 --IKYTPLLRNPRRSSLYYVNLISIRVGRKIVDIPPSALAFNSATGAGTVIDSGTTFTRL 328
Query: 224 HSDVYWKLHEKFVSYFER-FQLAQLSDCPEPIQLCYFLPETFNRFPSMAFYFEDANLRID 282
+ Y + ++F R ++ L CY +P P++ F F N+ +
Sbjct: 329 VAPAYTAVRDEFRRRVGRNVTVSSLGG----FDTCYTVPII---SPTITFMFAGMNVTLP 381
Query: 283 GENVFIIDYENHFFLLAVAPHDD----LVALIGSQQQRDTRFVYDLNIDLLSFVKENCSD 338
+N I LA+A D ++ +I S QQ++ R ++D+ + +E+CS
Sbjct: 382 PDNFLIHSTAGSTTCLAMAAAPDNVNSVLNVIASMQQQNHRILFDIPNSRVGVARESCSS 441
>gi|215766660|dbj|BAG98888.1| unnamed protein product [Oryza sativa Japonica Group]
Length = 433
Score = 79.0 bits (193), Expect = 3e-12, Method: Compositional matrix adjust.
Identities = 81/362 (22%), Positives = 149/362 (41%), Gaps = 49/362 (13%)
Query: 6 IGTPSKGVLLILDTGSALIY-------------------AIFDPRKSSSFQKINCDHPDC 46
IGTP+ + LDTGS + +DPR S S +++ CD C
Sbjct: 89 IGTPAVKYYVQLDTGSKAFWVNGISCKQCPHESDILRKLTFYDPRSSVSSKEVKCDDTIC 148
Query: 47 TYFKCVNE--QCVYTMKYADQSVTKGFAAHETI---SVIGKGEGKAIFHGALFGCSNDNH 101
T N +C Y YAD +T G + + + G G+ + FGC
Sbjct: 149 TSRPPCNMTLRCPYITGYADGGLTMGILFTDLLHYHQLYGNGQTQPTSTSVTFGCGLQQS 208
Query: 102 GFDEDARDGALAGVLGLSRVTISFISQLGSI--IKKRFSYCLVIPLPNGEYTSSYLKFGT 159
G ++ A+ G++G + +SQL + KK FS+CL G + +
Sbjct: 209 GSLNNSAV-AIDGIIGFGNSNQTALSQLAAAGKTKKIFSHCLDSTNGGGIFAIGEV---- 263
Query: 160 DMGYRRPSTQATKFINHPNNFYYLSLKDISIDNERMNFPPDTFDITVSGEGGCIIDSGSV 219
P + T + + ++ ++LK I++ + P + F T + G IDSGS
Sbjct: 264 ----VEPKVKTTPIVKNNEVYHLVNLKSINVAGTTLQLPANIFGTTKT--KGTFIDSGST 317
Query: 220 LTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCYFLPETFNRFPSMAFYFEDANL 279
L Y +Y +L + + + + Q +FL ++FP + F+FE+ +L
Sbjct: 318 LVYLPEIIYSELILAVFAKHPDITMGAMYN----FQCFHFLGSVDDKFPKITFHFEN-DL 372
Query: 280 RIDGENV-FIIDYENHFFLL-----AVAPHDDLVALIGSQQQRDTRFVYDLNIDLLSFVK 333
+D ++++YE + + + + D++ ++G + VYD+ + + +
Sbjct: 373 TLDVYPYDYLLEYEGNQYCFGFQDAGIHGYKDMI-ILGDMVISNKVVVYDMEKQAIGWTE 431
Query: 334 EN 335
N
Sbjct: 432 HN 433
>gi|218194598|gb|EEC77025.1| hypothetical protein OsI_15381 [Oryza sativa Indica Group]
Length = 422
Score = 79.0 bits (193), Expect = 3e-12, Method: Compositional matrix adjust.
Identities = 81/362 (22%), Positives = 149/362 (41%), Gaps = 49/362 (13%)
Query: 6 IGTPSKGVLLILDTGSALIY-------------------AIFDPRKSSSFQKINCDHPDC 46
IGTP+ + LDTGS + +DPR S S +++ CD C
Sbjct: 65 IGTPAVKYYVQLDTGSKAFWVNGISCKQCPHESDILRKLTFYDPRSSVSSKEVKCDDTIC 124
Query: 47 TYFKCVNE--QCVYTMKYADQSVTKGFAAHETI---SVIGKGEGKAIFHGALFGCSNDNH 101
T N +C Y YAD +T G + + + G G+ + FGC
Sbjct: 125 TSRPPCNMTLRCPYITGYADGGLTMGILFTDLLHYHQLYGNGQTQPTSTSVTFGCGLQQS 184
Query: 102 GFDEDARDGALAGVLGLSRVTISFISQLGSI--IKKRFSYCLVIPLPNGEYTSSYLKFGT 159
G ++ A+ G++G + +SQL + KK FS+CL G + +
Sbjct: 185 GSLNNSAV-AIDGIIGFGNSNQTALSQLAAAGKTKKIFSHCLDSTNGGGIFAIGEV---- 239
Query: 160 DMGYRRPSTQATKFINHPNNFYYLSLKDISIDNERMNFPPDTFDITVSGEGGCIIDSGSV 219
P + T + + ++ ++LK I++ + P + F T + G IDSGS
Sbjct: 240 ----VEPKVKTTPIVKNNEVYHLVNLKSINVAGTTLQLPANIFGTTKT--KGTFIDSGST 293
Query: 220 LTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCYFLPETFNRFPSMAFYFEDANL 279
L Y +Y +L + + + + Q +FL ++FP + F+FE+ +L
Sbjct: 294 LVYLPEIIYSELILAVFAKHPDITMGAMYN----FQCFHFLGSVDDKFPKITFHFEN-DL 348
Query: 280 RIDGENV-FIIDYENHFFLL-----AVAPHDDLVALIGSQQQRDTRFVYDLNIDLLSFVK 333
+D ++++YE + + + + D++ ++G + VYD+ + + +
Sbjct: 349 TLDVYPYDYLLEYEGNQYCFGFQDAGIHGYKDMI-ILGDMVISNKVVVYDMEKQAIGWTE 407
Query: 334 EN 335
N
Sbjct: 408 HN 409
>gi|223942467|gb|ACN25317.1| unknown [Zea mays]
gi|413936886|gb|AFW71437.1| pepsin A [Zea mays]
Length = 491
Score = 79.0 bits (193), Expect = 3e-12, Method: Compositional matrix adjust.
Identities = 97/382 (25%), Positives = 159/382 (41%), Gaps = 71/382 (18%)
Query: 2 VRLFIGTPSKGVLLILDTGSALIY-------------------AIFDPRKSSSFQKINCD 42
R+ IG+P KG + +DTGS +++ +DP S + + C+
Sbjct: 86 TRIEIGSPPKGYYVQVDTGSDILWVNCIRCDGCPTRSGLGIELTQYDPAGSGT--TVGCE 143
Query: 43 HPDCTYFKC---------VNEQCVYTMKYADQSVTKGFAAHETI---SVIGKGEGKAIFH 90
C + C + + Y D S T GF + + V G G+
Sbjct: 144 QEFCVANSAGGVPPTCPSTSSPCQFRITYGDGSTTTGFYVTDFVQYNQVSGNGQTTTSNA 203
Query: 91 GALFGCSNDNHGFDEDARDGALAGVLGLSRVTISFISQLGSI--IKKRFSYCLVIPLPNG 148
FGC G D + + AL G+LG + S +SQL + ++K F++CL G
Sbjct: 204 SITFGCGA-QLGGDLGSSNQALDGILGFGQSDSSMLSQLAAARRVRKIFAHCLDTVRGGG 262
Query: 149 EYTSSYLKFGTDMGYRRPSTQATKFINHPNNFYY-LSLKDISIDNERMNFPPDTFDITVS 207
+ + +P + T + PN +Y ++L+ IS+ + P TFD S
Sbjct: 263 IFAIGNVV--------QPKVKTTPLV--PNVTHYNVNLQGISVGGATLQLPTSTFD---S 309
Query: 208 GEG-GCIIDSGSVLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCY-FLPETFN 265
G+ G IIDSG+ L Y +VY L + F+++Q L + + + C+ F +
Sbjct: 310 GDSKGTIIDSGTTLAYLPREVYRTL---LAAVFDKYQDLPLHNYQDFV--CFQFSGSIDD 364
Query: 266 RFPSMAFYFEDANLRIDGENVFIIDY--ENHFFLLAVAPHDDLVA--------LIGSQQQ 315
FP + F FE +L + NV+ DY +N L + D V L+G
Sbjct: 365 GFPVITFSFE-GDLTL---NVYPDDYLFQNRNDLYCMGFLDGGVQTKDGKDMLLLGDLVL 420
Query: 316 RDTRFVYDLNIDLLSFVKENCS 337
+ VYDL +++ + NCS
Sbjct: 421 SNKLVVYDLEKEVIGWTDYNCS 442
>gi|302803839|ref|XP_002983672.1| hypothetical protein SELMODRAFT_118648 [Selaginella moellendorffii]
gi|300148509|gb|EFJ15168.1| hypothetical protein SELMODRAFT_118648 [Selaginella moellendorffii]
Length = 388
Score = 79.0 bits (193), Expect = 3e-12, Method: Compositional matrix adjust.
Identities = 79/305 (25%), Positives = 122/305 (40%), Gaps = 56/305 (18%)
Query: 6 IGTPSKGVLLILDTGSALIYA-------------------IFDPRKSSSFQKINCDHPDC 46
+GTP + L +DTGS L++ +D + S+S K+ C P C
Sbjct: 42 LGTPPRTYNLQVDTGSDLLWVNCHPCIGCPAFSDLKIPIVPYDVKASASSSKVPCSDPSC 101
Query: 47 TYFKCVNE-------QCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSND 99
T ++E QC Y+ +Y D S T G+ + + + I FGC
Sbjct: 102 TLITQISESGCNDQNQCGYSFQYGDGSGTLGYLVEDVLHYMVNATATVI-----FGCGFK 156
Query: 100 NHGFDEDARDGALAGVLGLSRVTISFISQLGSIIKKR--FSYCLVIPLPNGEYTSSYLKF 157
G D + AL G++G +SF SQL K F++C L GE L
Sbjct: 157 QSG-DLSTSERALDGIIGFGASDLSFNSQLAKQGKTPNVFAHC----LDGGERGGGILVL 211
Query: 158 GTDMGYRRPSTQATKFINHPNNFYY-LSLKDISIDNERMNFPPDTFDITVSGEGGCIIDS 216
G + P Q T + P ++Y + L+ IS++N + P F V G I DS
Sbjct: 212 GNVI---EPDIQYTPLV--PYMYHYNVVLQSISVNNANLTIDPKLFSNDV--MQGTIFDS 264
Query: 217 GSVLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCYFLPETFNRFPSMAFYFED 276
G+ L Y + Y + F L +L F+ + FP++ YFE
Sbjct: 265 GTTLAYLPDEAYQAFTQAVSLVVAPFLLCD-------TRLSRFI---YKLFPNVVLYFEG 314
Query: 277 ANLRI 281
A++ +
Sbjct: 315 ASMTL 319
>gi|242095592|ref|XP_002438286.1| hypothetical protein SORBIDRAFT_10g011130 [Sorghum bicolor]
gi|241916509|gb|EER89653.1| hypothetical protein SORBIDRAFT_10g011130 [Sorghum bicolor]
Length = 495
Score = 78.6 bits (192), Expect = 4e-12, Method: Compositional matrix adjust.
Identities = 85/334 (25%), Positives = 129/334 (38%), Gaps = 51/334 (15%)
Query: 28 FDPRKSSSFQKINCDHPDCTYFKC-VNEQCVYTMKYADQSVTKGFAAHETISVIGKGEGK 86
FDP SSSF+ + C PDC C C +T++ + G +T+++
Sbjct: 185 FDPSMSSSFRSVLCGSPDCGGHSCSAGGSCTFTLQNSTFVFGNGTIVMDTLTL----SPS 240
Query: 87 AIFHGALFGCSN-DNHGFDEDARDGALAGVLGLSRVTISFISQL---GSIIKKRFSYCLV 142
A F GC DN F DG G + LS S +++ FSYC
Sbjct: 241 ATFENFAVGCMQLDNDLF----TDGVAVGNIDLSLSRHSLATRVLNSSPPGMAAFSYC-- 294
Query: 143 IPLPNGEYTSSYLKFGTDMG--YRRPSTQATKFINHPN--NFYYLSLKDISIDNERMNFP 198
LP T +L + + + +P NFYY+ L I+I+ E + P
Sbjct: 295 --LPADTDTHGFLTIAPALSDYSDHAGVKYVPLVTNPTGPNFYYVDLVAIAINGEDLPIP 352
Query: 199 PDTFDITVSGEGGCIIDSGSVLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCY 258
P F G +IDS S TY + +Y L ++F ++Q P+
Sbjct: 353 PALFT-----GNGTMIDSQSAFTYLNPPIYAALRDEFRKAMLQYQ---------PVPAFG 398
Query: 259 FLPETFNRFPSMAFYFEDANLRI-DGENVFIIDYENHFFL--------------LAVAPH 303
L +N + Y D LR +GE + + D + +F A AP
Sbjct: 399 GLDTCYNFTLAENIYLPDITLRFSNGETMDLDDRQFMYFFREHLTDGFPFGCLAFAAAPD 458
Query: 304 DDLV-ALIGSQQQRDTRFVYDLNIDLLSFVKENC 336
+ +GSQ QR VYD+ +++FV C
Sbjct: 459 QNFPWNYLGSQVQRTKEIVYDVRGGMVAFVPSRC 492
>gi|413936472|gb|AFW71023.1| hypothetical protein ZEAMMB73_589717 [Zea mays]
Length = 289
Score = 78.6 bits (192), Expect = 4e-12, Method: Compositional matrix adjust.
Identities = 74/290 (25%), Positives = 118/290 (40%), Gaps = 40/290 (13%)
Query: 53 NEQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSNDNHGFDEDARDGAL 112
+QC + + YAD + T G + + +++ AI FGC + H A G
Sbjct: 34 GKQCGFAISYADGTSTVGAYSQDKLTL----APGAIVQNFYFGCGHGKH-----AVRGLF 84
Query: 113 AGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLKFGTDMGYRRPS----T 168
GVLGL R+ S ++ G + FSYCL P+ +L G + PS T
Sbjct: 85 DGVLGLGRLRESLGARYGGV----FSYCL----PSVSSKPGFLALGAG---KNPSGFVFT 133
Query: 169 QATKFINHPNNFYYLSLKDISIDNERMNFPPDTFDITVSGEGGCIIDSGSVLTYFHSDVY 228
P F ++L I++ ++++ P F GG I+DSG+V+T S Y
Sbjct: 134 PMGTVPGQPT-FSTVTLAGINVGGKKLDLRPSAF------SGGMIVDSGTVITGLQSTAY 186
Query: 229 WKLHEKFVSYFERFQLAQLSDCPEPIQLCYFLPETFN-RFPSMAFYFE-DANLRIDGENV 286
L F E ++L D + CY L N P +A F A + +D N
Sbjct: 187 RALRSAFRKAMEAYRLLPNGD----LDTCYNLTGYKNVVVPKIALTFTGGATINLDVPNG 242
Query: 287 FIIDYENHFFLLAVAPHDDLVALIGSQQQRDTRFVYDLNIDLLSFVKENC 336
++ N A + D ++G+ QR ++D + F + C
Sbjct: 243 ILV---NGCLAFAESGPDGSAGVLGNVNQRAFEVLFDTSTSKFGFRAKAC 289
>gi|356515690|ref|XP_003526531.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
Length = 439
Score = 78.6 bits (192), Expect = 4e-12, Method: Compositional matrix adjust.
Identities = 87/361 (24%), Positives = 151/361 (41%), Gaps = 45/361 (12%)
Query: 1 MVRLFIGTPSKGVLLILDTGSALIY-----------AIFDPRKSSSFQKINCDHPDCTYF 49
+VR+ +GTP + + ++LDT + + F P+ S+S+ ++C P C
Sbjct: 100 VVRVKLGTPGQLLFMVLDTSTDEAFVPCSGCTGCSDTTFSPKASTSYGPLDCSVPQCGQV 159
Query: 50 KCVN------EQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSNDNHGF 103
+ ++ C + YA S F+A + ++ FGC N G
Sbjct: 160 RGLSCPATGTGACSFNQSYAGSS----FSATLVQDALRLATDVIPYYS--FGCVNAITGA 213
Query: 104 DEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLKFGTDMGY 163
A+ R +S +SQ GS FSYCL P Y S LK G +G
Sbjct: 214 SVPAQGLLGL-----GRGPLSLLSQSGSNYSGIFSYCL--PSFKSYYFSGSLKLG-PVGQ 265
Query: 164 RRPSTQATKFINHPN--NFYYLSLKDISIDNERMNFPPDTFDITVSGEGGCIIDSGSVLT 221
+ S + T + P+ + YY++ IS+ + FP + + G IIDSG+V+T
Sbjct: 266 PK-SIRTTPLLRSPHRPSLYYVNFTGISVGRVLVPFPSEYLGFNPNTGSGTIIDSGTVIT 324
Query: 222 YFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCYFLPETFNRF-PSMAFYFEDANLR 280
F VY + E+F + C+ +T+ P + +FE +L+
Sbjct: 325 RFVEPVYNAVREEFRKQVGGTTFTSIGA----FDTCFV--KTYETLAPPITLHFEGLDLK 378
Query: 281 IDGENVFIIDYENHFFLLAVAPHDD----LVALIGSQQQRDTRFVYDLNIDLLSFVKENC 336
+ EN I LA+A D ++ +I + QQ++ R ++D+ + + +E C
Sbjct: 379 LPLENSLIHSSAGSLACLAMAAAPDNVNSVLNVIANFQQQNLRILFDIVNNKVGIAREVC 438
Query: 337 S 337
+
Sbjct: 439 N 439
>gi|356553263|ref|XP_003544977.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
Length = 445
Score = 78.6 bits (192), Expect = 4e-12, Method: Compositional matrix adjust.
Identities = 71/264 (26%), Positives = 110/264 (41%), Gaps = 44/264 (16%)
Query: 1 MVRLFIGTPSKGVLLILDTGSALIY----------AIFDPRKSSSFQKINCDHPDCTY-- 48
+V L IGTP + ++LDTGS L + A FDP SSSF + C HP C
Sbjct: 89 VVTLPIGTPPQPQQMVLDTGSQLSWIQCHNKTPPTASFDPSLSSSFYVLPCTHPLCKPRV 148
Query: 49 ------FKC-VNEQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSNDNH 101
C N C Y+ YAD + +G E ++ + + GCS+++
Sbjct: 149 PDFTLPTTCDQNRLCHYSYFYADGTYAEGNLVREKLAFSPSQTTPPL----ILGCSSESR 204
Query: 102 GFDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVI--PLPNGEYTSSYLKFGT 159
DAR G+LG++ +SF Q +FSYC+ P N + + G
Sbjct: 205 ----DAR-----GILGMNLGRLSFPFQAKVT---KFSYCVPTRQPANNNNFPTGSFYLGN 252
Query: 160 DMGYRR-------PSTQATKFINHPNNFYYLSLKDISIDNERMNFPPDTFDITVSGEGGC 212
+ R Q+ + N Y + ++ I I ++N PP F G G
Sbjct: 253 NPNSARFRYVSMLTFPQSQRMPNLDPLAYTVPMQGIRIGGRKLNIPPSVFRPNAGGSGQT 312
Query: 213 IIDSGSVLTYFHSDVYWKLHEKFV 236
++DSGS T+ Y ++ E+ +
Sbjct: 313 MVDSGSEFTFLVDVAYDRVREEII 336
>gi|125532788|gb|EAY79353.1| hypothetical protein OsI_34482 [Oryza sativa Indica Group]
Length = 394
Score = 78.6 bits (192), Expect = 4e-12, Method: Compositional matrix adjust.
Identities = 82/361 (22%), Positives = 142/361 (39%), Gaps = 55/361 (15%)
Query: 6 IGTPSKGVLLILDTGSALIYA--------------IFDPRKSSSFQKINCDHPDCTYF-- 49
IGTP + ++D L++ +FDP S++++ C P C
Sbjct: 57 IGTPPQPASAVIDLAGELVWTQCKQCSRCFEQDTPLFDPTASNTYRAEPCGTPLCESIPS 116
Query: 50 ---KCVNEQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSNDNHGFDED 106
C C Y + T G +T +V G KA FGC + D D
Sbjct: 117 DSRNCSGNVCAY-QASTNAGDTGGKVGTDTFAV---GTAKASLA---FGCVVAS---DID 166
Query: 107 ARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLKFGTDM--GYR 164
G +G++GL R S ++Q G FSYCL P G+ ++ +L + G +
Sbjct: 167 TMGGP-SGIVGLGRTPWSLVTQTG---VAAFSYCLA-PHDAGKNSALFLGSSAKLAGGGK 221
Query: 165 RPSTQATKFINHPN---NFYYLSLKDISIDNERMNFPPDTFDITVSGEGGCIIDSGSVLT 221
ST + N N+Y + L+ + + + PP + ++D+ S ++
Sbjct: 222 AASTPFVNISGNGNDLSNYYKVQLEGLKAGDAMIPLPPSGSTV--------LLDTFSPIS 273
Query: 222 YFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCYFLPETFNRFPSMAFYFEDANLRI 281
+ Y + + +A EP LC+ P + F F
Sbjct: 274 FLVDGAYQAVKKAVTVAVGAPPMATPV---EPFDLCFPKSGASGAAPDLVFTFRGGAAMT 330
Query: 282 DGENVFIIDYENHFFLLAVAPHDDL-----VALIGSQQQRDTRFVYDLNIDLLSFVKENC 336
+ +++DY+N LA+ L ++L+GS QQ + F++DL+ + LSF +C
Sbjct: 331 VAASNYLLDYKNGTVCLAMLSSARLNSTTELSLLGSLQQENIHFLFDLDKETLSFEPADC 390
Query: 337 S 337
+
Sbjct: 391 T 391
>gi|115483168|ref|NP_001065177.1| Os10g0538200 [Oryza sativa Japonica Group]
gi|21717168|gb|AAM76361.1|AC074196_19 putative nucleoid DNA binding protein [Oryza sativa Japonica Group]
gi|31433289|gb|AAP54827.1| Eukaryotic aspartyl protease family protein, expressed [Oryza
sativa Japonica Group]
gi|113639786|dbj|BAF27091.1| Os10g0538200 [Oryza sativa Japonica Group]
gi|215686408|dbj|BAG87693.1| unnamed protein product [Oryza sativa Japonica Group]
Length = 394
Score = 78.6 bits (192), Expect = 5e-12, Method: Compositional matrix adjust.
Identities = 84/362 (23%), Positives = 145/362 (40%), Gaps = 57/362 (15%)
Query: 6 IGTPSKGVLLILDTGSALIYA--------------IFDPRKSSSFQKINCDHPDCTYF-- 49
IGTP + ++D L++ +FDP S++++ C P C
Sbjct: 57 IGTPPQPASAVIDLAGELVWTQCKQCSRCFEQDTPLFDPTASNTYRAEPCGTPLCESIPS 116
Query: 50 ---KCVNEQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSNDNHGFDED 106
C C Y + T G +T +V G KA FGC + D D
Sbjct: 117 DSRNCSGNVCAY-QASTNAGDTGGKVGTDTFAV---GTAKASLA---FGCVVAS---DID 166
Query: 107 ARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLKFGTDM--GYR 164
G +G++GL R S ++Q G FSYCL P G ++ +L + G +
Sbjct: 167 TMGGP-SGIVGLGRTPWSLVTQTG---VAAFSYCLA-PHDAGRNSALFLGSSAKLAGGGK 221
Query: 165 RPSTQATKFINHPN---NFYYLSLKDISIDNERMNFPPDTFDITVSGEGGCIIDSGSVLT 221
ST + N N+Y + L+ + + + PP + ++D+ S ++
Sbjct: 222 AASTPFVNISGNGNDLSNYYKVQLEGLKAGDAMIPLPPSGSTV--------LLDTFSPIS 273
Query: 222 YFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCYFLPETFNRFPSMAFYFED-ANLR 280
+ Y + + + +A EP LC+ P + F F A +
Sbjct: 274 FLVDGAYQAVKKAVTAAVGAPPMATPV---EPFDLCFPKSGASGAAPDLVFTFRGGAAMT 330
Query: 281 IDGENVFIIDYENHFFLLAVAPHDDL-----VALIGSQQQRDTRFVYDLNIDLLSFVKEN 335
+ N +++DY+N LA+ L ++L+GS QQ + F++DL+ + LSF +
Sbjct: 331 VPATN-YLLDYKNGTVCLAMLSSARLNSTTELSLLGSLQQENIHFLFDLDKETLSFEPAD 389
Query: 336 CS 337
C+
Sbjct: 390 CT 391
>gi|259490398|ref|NP_001159203.1| uncharacterized protein LOC100304289 [Zea mays]
gi|223942623|gb|ACN25395.1| unknown [Zea mays]
Length = 378
Score = 78.2 bits (191), Expect = 5e-12, Method: Compositional matrix adjust.
Identities = 88/373 (23%), Positives = 140/373 (37%), Gaps = 48/373 (12%)
Query: 2 VRLFIGTPSKGVLLILDTGSALIYAI----------------FDPRKSSSFQKINCDHPD 45
VR +GTP++ +L+ DTGS L + F +S S+ + C
Sbjct: 16 VRFRVGTPAQPFVLVADTGSDLTWVKCRGAAGPPASDPPAREFRASESRSWAPLACSSDT 75
Query: 46 CTYF------KCVN--EQCVYTMKYADQSVTKGFAAHETISVI----------GKGEGKA 87
CT + C + C Y +Y D S +G + ++ G G +A
Sbjct: 76 CTSYVPFSLANCSSPASPCAYDYRYKDGSAARGVVGTDAATIALSGSGSEDGSGGGGRRA 135
Query: 88 IFHGALFGCSNDNHGFDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPN 147
G + GC+ G + DG VL L ISF S+ + RFSYCLV L
Sbjct: 136 KLQGVVLGCTATYDGQSFQSSDG----VLSLGNSNISFASRAAARFGGRFSYCLVDHLAP 191
Query: 148 GEYTSSYLKFGTDMGYRRPSTQATKFI--NHPNNFYYLSLKDISIDNERMNFPPDTFDIT 205
SSYL FG T + + FY +++ + + E ++ P D +D
Sbjct: 192 -RNASSYLTFGPGPEGGGAPAARTPLVLDRRVSPFYAVAVDAVYVAGEALDIPADVWD-- 248
Query: 206 VSGEGGCIIDSGSVLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCYFLPETFN 265
V GG I+DSG+ LT + Y + + +P + CY
Sbjct: 249 VGRGGGAILDSGTSLTVLATPAYRAVVAALGGRLAALPRVAM----DPFEYCYNWTAGAP 304
Query: 266 RFPSMAFYFEDANLRIDGENVFIIDYENHFFLLAVAPHD-DLVALIGSQQQRDTRFVYDL 324
P + F + ++ID + V V++IG+ Q++ + +DL
Sbjct: 305 EIPKLEVSFAGSARLEPPAKSYVIDAAPGVKCIGVQEGAWPGVSVIGNILQQEHLWEFDL 364
Query: 325 NIDLLSFVKENCS 337
L F C+
Sbjct: 365 RDRWLRFKHTRCA 377
>gi|413944387|gb|AFW77036.1| hypothetical protein ZEAMMB73_461996 [Zea mays]
Length = 472
Score = 78.2 bits (191), Expect = 5e-12, Method: Compositional matrix adjust.
Identities = 90/372 (24%), Positives = 144/372 (38%), Gaps = 63/372 (16%)
Query: 1 MVRLFIGTPSKGVLLILDTGSALIYA----------------IFDPRKSSSFQKINCDH- 43
+V L IGTP+ +++DTGS L + ++DP SS++ + CD
Sbjct: 128 VVTLGIGTPAVQQTVLIDTGSDLSWVQCKPCNSSSCYPQKDPLYDPTASSTYAPVPCDSK 187
Query: 44 ------PDCTYFKCVNEQ----CVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGAL 93
PD C N C Y ++Y ++ T G + ET+++ + K
Sbjct: 188 ACKDLVPDAYDHGCTNSSGTSLCQYGIEYGNRDTTVGVYSTETLTLSPQVSVKDFG---- 243
Query: 94 FGCSNDNHGFDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSS 153
FGC G + G+LGL S +SQ FSYC LP G T+
Sbjct: 244 FGC-----GLVQQGTFDLFDGLLGLGGAPESLVSQTAETYGGAFSYC----LPPGNSTTG 294
Query: 154 YLKFGTDMGYRRPSTQATKFINHP-------NNFYYLSLKDISIDNERMNFPPDTFDITV 206
+L G + F+ P FY ++L +S+ + ++ PP
Sbjct: 295 FLALGAPTN----NNDTAGFLFTPLHSLPEQATFYLVNLTGVSVGGKPLDIPPTVL---- 346
Query: 207 SGEGGCIIDSGSVLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCYFLPETFN- 265
GG IIDSG+++T Y L F + + L ++ + + CY N
Sbjct: 347 --SGGMIIDSGTIITGLPDTAYSALRTAFRTAMSAYPLLPPNN-DDVLDTCYNFTGIANV 403
Query: 266 RFPSMAFYFE-DANLRIDGENVFIIDYENHFFLLAVAPHDDLVALIGSQQQRDTRFVYDL 324
P++A F+ A + +D + +I A D V +IG+ QR +YD
Sbjct: 404 TVPTVALTFDGGATIDLDVPSGVLI---QDCLAFAGGASDGDVGIIGNVNQRTFEVLYDS 460
Query: 325 NIDLLSFVKENC 336
+ F C
Sbjct: 461 GRGHVGFRPGAC 472
>gi|356556809|ref|XP_003546713.1| PREDICTED: LOW QUALITY PROTEIN: aspartic proteinase
nepenthesin-1-like [Glycine max]
Length = 444
Score = 78.2 bits (191), Expect = 5e-12, Method: Compositional matrix adjust.
Identities = 78/362 (21%), Positives = 140/362 (38%), Gaps = 49/362 (13%)
Query: 1 MVRLFIGTPSKGVLLILDTGSALIY------------AIFDPRKSSSFQKINCDHPDCTY 48
+VR GTP++ +LL +DT + + F P KS++F+K+ C C
Sbjct: 107 IVRAKFGTPAQTLLLAMDTSNDAAWVPCTACVGCSTTTPFAPPKSTTFKKVGCGASQCKQ 166
Query: 49 FK---CVNEQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSNDNHGFDE 105
+ C C + Y SV +T+++ FGC G
Sbjct: 167 VRNPTCDGSACAFNFTYGTSSVAASLV-QDTVTL-----ATDPVPAYTFGCIQKATGSSL 220
Query: 106 DARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLKFGTDM-GYR 164
+ R +S ++Q + + FSYCL + + D+
Sbjct: 221 PPQGLLGL-----GRGPLSLLAQTQKLYQSTFSYCL------PSFKTLNFSGHXDLXPVA 269
Query: 165 RPSTQATKFINHP--NNFYYLSLKDISIDNERMNFPPDTFDITVSGEGGCIIDSGSVLTY 222
+P Q +P ++ YY++L I + ++ PP+ G + DSG+V T
Sbjct: 270 QPRDQVYPSFKNPRRSSLYYVNLVAIRVGRRIVDIPPEALAFNPXTGAGTVFDSGTVFTR 329
Query: 223 FHSDVYWKLHEKF---VSYFERFQLAQLSDCPEPIQLCYFLPETFNRFPSMAFYFEDANL 279
Y + +F VS ++ + L CY +P P++ F F N+
Sbjct: 330 LVEPAYTAVRNEFRRRVSVHKKLTVTSLGG----FDTCYTVPIV---APTITFMFSGMNV 382
Query: 280 RIDGENVFIIDYENHFFLLAVAPHDD----LVALIGSQQQRDTRFVYDLNIDLLSFVKEN 335
+ +N+ I LA+AP D ++ +I + QQ++ R ++D+ L +E
Sbjct: 383 TLPPDNILIHSTAGSVTCLAMAPAPDNVNSVLNVIANMQQQNHRVLFDVPNSRLGVAREL 442
Query: 336 CS 337
C+
Sbjct: 443 CT 444
>gi|125555046|gb|EAZ00652.1| hypothetical protein OsI_22673 [Oryza sativa Indica Group]
Length = 340
Score = 78.2 bits (191), Expect = 5e-12, Method: Compositional matrix adjust.
Identities = 73/256 (28%), Positives = 111/256 (43%), Gaps = 40/256 (15%)
Query: 20 GSALIYAIFDPRKSSSFQKINCDHPDCTYFKCVNEQCVYTMKYADQSVTKGFAAHETISV 79
G A FDP +SSSF I C P+C +C C +T+++ + +V G +T+++
Sbjct: 25 GGAPCDVAFDPSRSSSFAAIPCGSPECA-VECTGASCPFTIQFGNVTVANGTLVRDTLTL 83
Query: 80 IGKGEGKAIFHGALFGCSNDNHGFDEDARDGALAGVLGLSRVTISFISQLGS-----IIK 134
A F G FGC G D D DGA+ G++ LSR + S S++ S
Sbjct: 84 ----SPSATFAGFTFGC--IEVGADADTFDGAV-GLIDLSRSSHSLASRVISNGATTTTT 136
Query: 135 KRFSYCLVIPLPNGEYTSSYLKFGT--------DMGYRRPSTQATKFINHPNNFYYLSLK 186
FSYCL P + + +L G D+ Y S+ NHPN+ Y++ L
Sbjct: 137 AAFSYCL--PSLSSTRSRGFLSIGASRPEYSGGDIKYAPMSSNP----NHPNS-YFVDLV 189
Query: 187 DISIDNERMNFPPDTFDITVSGEGGCIIDSGSVLTYFHSDVYWKLHEKFVSYFERFQLAQ 246
IS+ E + PP V G ++++ + T+ Y L + F R +AQ
Sbjct: 190 GISVGGEDLPVPP-----AVLAAHGTLLEAATEFTFLAPAAYAALRDAF-----RNDMAQ 239
Query: 247 LSDCP--EPIQLCYFL 260
P + CY L
Sbjct: 240 YPAAPPFRVLDTCYNL 255
>gi|414589629|tpg|DAA40200.1| TPA: hypothetical protein ZEAMMB73_727364, partial [Zea mays]
Length = 201
Score = 78.2 bits (191), Expect = 5e-12, Method: Compositional matrix adjust.
Identities = 47/178 (26%), Positives = 79/178 (44%), Gaps = 13/178 (7%)
Query: 169 QATKFINHPNN--FYYLSLKDISIDNERMNFPPDTFDITVSGEGGCIIDSGSVLTYFHSD 226
Q T + P N FYY+ +++ R+ P F + G GG I+DSG+ LT +
Sbjct: 27 QTTPLLQSPQNPTFYYVHFTGLTVGARRLRIPESAFALRPDGSGGVIVDSGTALTLLPAA 86
Query: 227 VYWKLHEKFVSYFERFQLAQLSDCPEPIQLCYFLPETFNR--------FPSMAFYFEDAN 278
V L E ++ ++ +L + +C+ +P + R P M +F+ A+
Sbjct: 87 V---LAEVVRAFRQQLRLPFANGGNPEDGVCFLVPAAWRRSSSTSQMPVPRMVLHFQGAD 143
Query: 279 LRIDGENVFIIDYENHFFLLAVAPHDDLVALIGSQQQRDTRFVYDLNIDLLSFVKENC 336
L + N + D+ L +A D + IG+ Q+D R +YDL + LS C
Sbjct: 144 LDLPRRNYVLDDHRRGRLCLLLADSGDDGSTIGNLVQQDMRVLYDLEAETLSIAPARC 201
>gi|226531872|ref|NP_001147022.1| aspartic proteinase nepenthesin-2 precursor [Zea mays]
gi|195606574|gb|ACG25117.1| aspartic proteinase nepenthesin-2 precursor [Zea mays]
Length = 491
Score = 78.2 bits (191), Expect = 5e-12, Method: Compositional matrix adjust.
Identities = 91/402 (22%), Positives = 162/402 (40%), Gaps = 85/402 (21%)
Query: 6 IGTPSKGVLLILDTGSALIY--------------------AIFDPRKSSSFQKINCDHPD 45
+GTP + + ++LDTGS L + +F P+ SSS + + C +P
Sbjct: 105 LGTPPQPLPVLLDTGSHLTWVPCTSSYECRNCSSPSASAVPVFHPKNSSSSRLVGCRNPS 164
Query: 46 CTYF--------KCVNEQC----------------VYTMKYADQSVTKGFAAHETISVIG 81
C + KC C Y + Y S T G +T+
Sbjct: 165 CQWVHSAANLATKCRRAPCSPGAANCPAAASNVCPPYAVVYGSGS-TAGLLIADTL---- 219
Query: 82 KGEGKAIFHGALFGCSNDNHGFDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCL 141
+ G+A+ G + GCS + +G+ G R S +QLG +FSYCL
Sbjct: 220 RAPGRAV-PGFVLGCS-------LVSVHQPPSGLAGFGRGAPSVPAQLG---LPKFSYCL 268
Query: 142 VI------PLPNGEYTSSYLKFGTDMGYRRPSTQATKFINHPNN-FYYLSLKDISIDNER 194
+ +G G M Y P ++ P +YYL+L+ +++ +
Sbjct: 269 LSRRFDDNAAVSGSLVLGGTGGGEGMQYV-PLVKSAAGDKLPYGVYYYLALRGVTVGGKA 327
Query: 195 MNFPPDTFDITVSGEGGCIIDSGSVLTYFHSDVYWKLHEKFVSYF-ERFQLAQLSDCPEP 253
+ P F +G GG I+DSG+ TY V+ + + V+ R++ ++ ++
Sbjct: 328 VRLPARAFAGNAAGSGGTIVDSGTTFTYLDPTVFQPVADAVVAAVGGRYKRSKDAEDGLG 387
Query: 254 IQLCYFLPETFNR--FPSMAFYFE-DANLRIDGENVFIIDYENHFFLLAVAPHDDLVA-- 308
+ C+ LP+ P ++F+FE A +++ EN F++ + +A D
Sbjct: 388 LHPCFALPQGARSMALPELSFHFEGGAVMQLPVENYFVVAGRGAVEAICLAVVTDFGGGS 447
Query: 309 -----------LIGSQQQRDTRFVYDLNIDLLSFVKENCSDD 339
++GS QQ++ YDL + L F +++C+
Sbjct: 448 GAGNEGSGPAIILGSFQQQNYLVEYDLEKERLGFRRQSCTSS 489
>gi|413948408|gb|AFW81057.1| hypothetical protein ZEAMMB73_038743 [Zea mays]
Length = 469
Score = 78.2 bits (191), Expect = 5e-12, Method: Compositional matrix adjust.
Identities = 87/373 (23%), Positives = 140/373 (37%), Gaps = 48/373 (12%)
Query: 2 VRLFIGTPSKGVLLILDTGSALIYAI----------------FDPRKSSSFQKINCDHPD 45
VR +GTP++ +L+ DTGS L + F +S S+ + C
Sbjct: 107 VRFRVGTPAQPFVLVADTGSDLTWVKCRGAAGPPASDPPAREFRASESRSWAPLACSSDT 166
Query: 46 CTYF------KCVN--EQCVYTMKYADQSVTKGFAAHETISVI----------GKGEGKA 87
CT + C + C Y +Y D S +G + ++ G G +A
Sbjct: 167 CTSYVPFSLANCSSPASPCAYDYRYKDGSAARGVVGTDAATIALSGSGSEDGSGGGGRRA 226
Query: 88 IFHGALFGCSNDNHGFDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPN 147
G + GC+ G + DG VL L ISF S+ + RFSYCLV L
Sbjct: 227 KLQGVVLGCTATYDGQSFQSSDG----VLSLGNSNISFASRAAARFGGRFSYCLVDHLAP 282
Query: 148 GEYTSSYLKFGTDMGYRRPSTQATKFI--NHPNNFYYLSLKDISIDNERMNFPPDTFDIT 205
SSYL FG T + + FY +++ + + E ++ P D +D+
Sbjct: 283 -RNASSYLTFGPGPEGGGAPAARTPLVLDRRVSPFYAVAVDAVYVAGEALDIPADVWDVG 341
Query: 206 VSGEGGCIIDSGSVLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCYFLPETFN 265
GG I+DSG+ LT + Y + + +P + CY
Sbjct: 342 RG--GGAILDSGTSLTVLATPAYRAVVAALGGRLAALPRVAM----DPFEYCYNWTAGAP 395
Query: 266 RFPSMAFYFEDANLRIDGENVFIIDYENHFFLLAVAPHD-DLVALIGSQQQRDTRFVYDL 324
P + F + ++ID + V V++IG+ Q++ + +DL
Sbjct: 396 EIPKLEVSFAGSARLEPPAKSYVIDAAPGVKCIGVQEGAWPGVSVIGNILQQEHLWEFDL 455
Query: 325 NIDLLSFVKENCS 337
L F C+
Sbjct: 456 RDRWLRFKHTRCA 468
>gi|125533812|gb|EAY80360.1| hypothetical protein OsI_35532 [Oryza sativa Indica Group]
Length = 428
Score = 78.2 bits (191), Expect = 6e-12, Method: Compositional matrix adjust.
Identities = 91/360 (25%), Positives = 152/360 (42%), Gaps = 53/360 (14%)
Query: 1 MVRLFIGTPSKGVLLILDTGSALIYAIFD-------PR-----KSSSFQKINCDHPDCTY 48
++ + +GTP+K ++ +DTGS+ + + PR +S++ K++C C
Sbjct: 83 VISVGLGTPAKTQIVEIDTGSSTSWVFCECDGCHTNPRTFLQSRSTTCAKVSCGTSMCLL 142
Query: 49 F----KCVNEQ----CVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSNDN 100
C + + C + + Y D S + G +T++ + + FGC+ D+
Sbjct: 143 GGSDPHCQDSENYPDCPFRVSYQDGSASYGILYQDTLTFSDVQKIPSF----TFGCNLDS 198
Query: 101 HGFDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEY-----TSSYL 155
G +E G + G+LG+ +S + Q S FSYCL PL E T+ Y
Sbjct: 199 FGANEF---GNVDGLLGMGAGPMSVLKQ-SSPRFDGFSYCL--PLQKSERGFFSKTTGYF 252
Query: 156 KFGTDMGYRRPSTQATKFINHPNN--FYYLSLKDISIDNERMNFPPDTFDITVSGEGGCI 213
G R + TK + N +++ L IS+D ER+ P F G +
Sbjct: 253 SLGKVA--TRTDVRYTKMVARRKNTELFFVDLAAISVDGERLGLSPSIFS-----RKGVV 305
Query: 214 IDSGSVLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCYFLPET-FNRFPSMAF 272
DSGS L+Y L ++ R A+ E + CY + P+++
Sbjct: 306 FDSGSELSYIPDRALSVLSQRIRELLLRRGAAE----EESERNCYDMRSVDEGDMPAISL 361
Query: 273 YFED-ANLRIDGENVFIID--YENHFFLLAVAPHDDLVALIGSQQQRDTRFVYDLNIDLL 329
+F+D A + VF+ E + LA AP + V++IGS Q VYDL L+
Sbjct: 362 HFDDGARFDLGSHGVFVERSVQEQDVWCLAFAPTES-VSIIGSLMQTSKEVVYDLKRQLI 420
>gi|380719867|gb|AFD63134.1| aspartyl protease [Vitis quinquangularis]
Length = 458
Score = 78.2 bits (191), Expect = 6e-12, Method: Compositional matrix adjust.
Identities = 91/389 (23%), Positives = 151/389 (38%), Gaps = 73/389 (18%)
Query: 2 VRLFIGTPSKGVLLILDTGSALIYA--------------------IFDPRKSSSFQKINC 41
+ L GTP + + ++DTGS +++A IF+P SSS + + C
Sbjct: 89 IPLSFGTPPQKLSFLMDTGSHVVWAPCTTHYTCTNCSFSNPKKVPIFNPELSSSDKILGC 148
Query: 42 DHPDCT-----------------YFKCVNEQCVYTMKYADQSVTKGFAAHETISVIGKGE 84
P C KC + YT++Y + + GF E + GK
Sbjct: 149 RDPKCADTSSPBVHLGXPRCNGNSKKCSHACPQYTLQYGTGAAS-GFFLLENLDFPGK-- 205
Query: 85 GKAIFHGALFGCSNDNHGFDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIP 144
H L GC+ D + ALAG R S Q+G K+F+YCL
Sbjct: 206 ---TIHKFLVGCTTSA---DREPSSDALAG---FGRTMFSLPMQMGV---KKFAYCLN-- 251
Query: 145 LPNGEYTSSYLKFGTDMGYRRPSTQA---TKFINHPNNF---YYLSLKDISIDNERMNFP 198
+ +Y + + Y TQ F +P ++ YYL +KD+ I N+ + P
Sbjct: 252 --SHDYDDTRNSGKLILDYSDGETQGLSYAPFXKNPPDYPIYYYLGVKDMKIGNKVLRIP 309
Query: 199 PDTFDITVSGEGGCIIDSGSVLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCY 258
GG +IDSG +Y V+ + + +++ + + + CY
Sbjct: 310 GKYLTPGSDSRGGVVIDSGFAYSYMTLPVFKIVTNELKKQMSKYRRSLELEAQTGVTPCY 369
Query: 259 -FLPETFNRFPSMAFYFE-DANLRIDGENVFIIDYENHFFLLAV---APHDDL------V 307
F + P + + F AN+ + G N F++ E V +P +L
Sbjct: 370 NFTGHKSIKIPDLIYQFTGGANMVVPGMNYFLLFSEASLGCFPVTTDSPTSNLEFTPGPS 429
Query: 308 ALIGSQQQRDTRFVYDLNIDLLSFVKENC 336
++G+ QQ D +DL + L F ++ C
Sbjct: 430 IILGNYQQVDHYVEFDLKNERLGFRQQTC 458
>gi|297806153|ref|XP_002870960.1| hypothetical protein ARALYDRAFT_908082 [Arabidopsis lyrata subsp.
lyrata]
gi|297316797|gb|EFH47219.1| hypothetical protein ARALYDRAFT_908082 [Arabidopsis lyrata subsp.
lyrata]
Length = 453
Score = 78.2 bits (191), Expect = 6e-12, Method: Compositional matrix adjust.
Identities = 89/374 (23%), Positives = 153/374 (40%), Gaps = 59/374 (15%)
Query: 9 PSKGVLLILDTGSALIYAI------------FDPRKSSSFQKINCDHPDCTY-------- 48
P + + +++DTGS L + FDP +SSS+ I C P C
Sbjct: 82 PPQNISMVIDTGSELSWLRCNRSSNPNPVNNFDPTRSSSYSPIPCSSPTCRTRTRDFLIP 141
Query: 49 FKCVNEQ-CVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSNDNHGFDEDA 107
C +++ C T+ YAD S ++G A E I G + +FGC G D +
Sbjct: 142 ASCDSDKLCHATLSYADASSSEGNLAAE-IFHFGNSTNDS---NLIFGCMGSVSGSDPE- 196
Query: 108 RDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLV----IP--LPNGEYTSSYLKFGTDM 161
D G+LG++R ++SFISQ+G +FSYC+ P L G+ ++L T +
Sbjct: 197 EDTKTTGLLGMNRGSLSFISQMG---FPKFSYCISGTDDFPGFLLLGDSNFTWL---TPL 250
Query: 162 GYRRPSTQATKFINHPNNFYYLSLKDISIDNERMNFPPDTFDITVSGEGGCIIDSGSVLT 221
Y +T Y + L I ++ + + P +G G ++DSG+ T
Sbjct: 251 NYTPLIRISTPLPYFDRVAYTVQLTGIKVNGKLLPIPKSVLLPDHTGAGQTMVDSGTQFT 310
Query: 222 YFHSDVYWKLHEKFVSYFERFQLAQLSDCPE-----PIQLCYFLPE------TFNRFPSM 270
+ VY L F++ + + + + PE + LCY + +R P++
Sbjct: 311 FLLGPVYTALRSDFLN--QTNGILTVYEDPEFVFQGTMDLCYRISPFRIRTGILHRLPTV 368
Query: 271 AFYFEDANLRIDGENVFI----IDYENHFFLLAVAPHDDLVAL----IGSQQQRDTRFVY 322
+ FE A + + G+ + + N + DL+ + IG Q++ +
Sbjct: 369 SLVFEGAEIAVSGQPLLYRVPHLTAGNDSVYCFTFGNSDLMGMEAYVIGHHHQQNMWIEF 428
Query: 323 DLNIDLLSFVKENC 336
DL + C
Sbjct: 429 DLQRSRIGLAPVQC 442
>gi|242063796|ref|XP_002453187.1| hypothetical protein SORBIDRAFT_04g001340 [Sorghum bicolor]
gi|241933018|gb|EES06163.1| hypothetical protein SORBIDRAFT_04g001340 [Sorghum bicolor]
Length = 493
Score = 78.2 bits (191), Expect = 6e-12, Method: Compositional matrix adjust.
Identities = 86/366 (23%), Positives = 140/366 (38%), Gaps = 60/366 (16%)
Query: 9 PSKGVLLILDTGSALIYA---------------IFDPRKSSSFQKINCDHPDCTYF---- 49
P K +++DTGS + + +FDP SS++ +C C
Sbjct: 150 PGKSQTMLIDTGSDISWVRCKPCWQQCRPQVDPLFDPSLSSTYSPFSCSSAACAQLFQEG 209
Query: 50 ---KCVNE-QCVYTMKYADQSV-TKGFAAHETISVIGKGEGKAIFHGALFGCSNDNHGFD 104
C + QC Y Y D SV T G + +T++ +G + FGCS+ G
Sbjct: 210 NANGCSSSGQCQYIAMYGDGSVGTTGTYSSDTLA-LGSNSNTVVVSKFRFGCSHAETGIT 268
Query: 105 EDARDGALAGVLGLSRVTISFISQL-GSIIKKRFSYCLVIPLPNGEYTSSYLKFGTDMGY 163
S +SQ G+ FSYCL P +S +L G
Sbjct: 269 GLTAGLMGL-----GGGAQSLVSQTAGTFGTTAFSYCL----PPTPSSSGFLTLGAA--- 316
Query: 164 RRPSTQATKFINHP-------NNFYYLSLKDISIDNERMNFPPDTFDITVSGEGGCIIDS 216
T + F+ P FY + L+ I + +++ P F G I+DS
Sbjct: 317 ---GTSSAGFVKTPMLRSSQVPAFYGVRLEAIRVGGRQLSIPTTVF------SAGMIMDS 367
Query: 217 GSVLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCYFLP-ETFNRFPSMAFYFE 275
G+V+T Y L F + +++ A S + C+ + ++ P++A F
Sbjct: 368 GTVVTRLPPTAYSSLSSAFKAGMKQYPPAPSSAGGGFLDTCFDMSGQSSVSMPTVALVFS 427
Query: 276 DAN---LRIDGENVFIIDYENHFFLLA-VAPHDD-LVALIGSQQQRDTRFVYDLNIDLLS 330
A + +D + + + F LA VA DD +IG+ QQR + +YD+ +
Sbjct: 428 GAGGAVVNLDASGILLQMETSSIFCLAFVATSDDGSTGIIGNVQQRTFQVLYDVAGGAVG 487
Query: 331 FVKENC 336
F C
Sbjct: 488 FKAGAC 493
>gi|255547548|ref|XP_002514831.1| Aspartic proteinase Asp1 precursor, putative [Ricinus communis]
gi|223545882|gb|EEF47385.1| Aspartic proteinase Asp1 precursor, putative [Ricinus communis]
Length = 488
Score = 77.8 bits (190), Expect = 6e-12, Method: Compositional matrix adjust.
Identities = 92/378 (24%), Positives = 151/378 (39%), Gaps = 64/378 (16%)
Query: 3 RLFIGTPSKGVLLILDTGSALIY-------------------AIFDPRKSSSFQKINCDH 43
++ +G P K + +DTGS +++ ++DP+ S+S +I CD
Sbjct: 85 KIGLGNPPKDYYVQVDTGSDILWVNCANCDKCPTKSDLGVKLTLYDPQSSTSATRIYCDD 144
Query: 44 PDC--TYFK----CVNE-QCVYTMKYADQSVTKGFAAHETIS---VIGKGEGKAIFHGAL 93
C TY C + C Y++ Y D S T GF + + V G + + +
Sbjct: 145 DFCAATYNGVLQGCTKDLPCQYSVVYGDGSSTAGFFVKDNLQFDRVTGNLQTSSANGSVI 204
Query: 94 FGCSNDNHGFDEDARDGALAGVLGLSRVTISFISQLGSI--IKKRFSYCLVIPLPNGEYT 151
FGC G + AL G+LG + S ISQL + +K+ F++CL G +
Sbjct: 205 FGCGAKQSG-ELGTSSEALDGILGFGQANSSMISQLAAAGKVKRVFAHCLDNVKGGGIFA 263
Query: 152 SSYLKFGTDMGYRRPSTQATKFI-NHPNNFYYLSLKDISIDNERMNFPPDTFDITVSGEG 210
+ P T + N P+ Y + +K+I + + P D FD
Sbjct: 264 IGEVV--------SPKVNTTPMVPNQPH--YNVVMKEIEVGGNVLELPTDIFD--TGDRR 311
Query: 211 GCIIDSGSVLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCYFLPETFNR-FPS 269
G IIDSG+ L Y VY + K VS +L + E C+ N FP
Sbjct: 312 GTIIDSGTTLAYLPEVVYESMMTKIVSEQPGLKLHTV----EEQFTCFQYTGNVNEGFPV 367
Query: 270 MAFYFEDA-NLRID--------GENVFIIDYENHFFLLAVAPHDDL-VALIGSQQQRDTR 319
+ F+F + +L ++ E V+ ++N + D + L+G +
Sbjct: 368 VKFHFNGSLSLTVNPHDYLFQIHEEVWCFGWQNS----GMQSKDGRDMTLLGDLVLSNKL 423
Query: 320 FVYDLNIDLLSFVKENCS 337
+YDL + + NCS
Sbjct: 424 VLYDLENQAIGWTDYNCS 441
>gi|125536419|gb|EAY82907.1| hypothetical protein OsI_38120 [Oryza sativa Indica Group]
Length = 448
Score = 77.8 bits (190), Expect = 6e-12, Method: Compositional matrix adjust.
Identities = 79/321 (24%), Positives = 134/321 (41%), Gaps = 58/321 (18%)
Query: 1 MVRLFIGTPSKGVLLILDTGSALIYA--------------IFDPRKSSSFQKINCDHPDC 46
+++ IG P + +DTGS L++ ++DP +S S K+ C C
Sbjct: 88 IMQFSIGEPPLLIWAEVDTGSDLMWVKCSPCNGCNPPPSPLYDPARSRSSGKLPCSSQLC 147
Query: 47 TYF---KCVNEQCV---------YTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALF 94
+ +++QC Y ++ T+G ET + G+G + + F
Sbjct: 148 QALGRGRIISDQCSDDPPLCGYHYAYGHSGDHSTQGVLGTETFTF---GDGY-VANNVSF 203
Query: 95 GCSNDNHGFDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYT--- 151
G S+ G ++ G AG++GL R +S +SQLG+ RF+YCL PN T
Sbjct: 204 GRSDTIDG----SQFGGTAGLVGLGRGHLSLVSQLGA---GRFAYCLAAD-PNVYSTILF 255
Query: 152 SSYLKFGTDMGYRRPSTQATKFINHP----NNFYYLSLKDISIDNERMNFPPDTFDITVS 207
S T G +T + +P + YY++L+ IS+ R+ TF I
Sbjct: 256 GSLAALDTSAG----DVSSTPLVTNPKPDRDTHYYVNLQGISVGGSRLPIKDGTFAINSD 311
Query: 208 GEGGCIIDSGSVLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCYFLP--ETFN 265
G GG DSG++ T Y + + S +R D C+ +
Sbjct: 312 GSGGVFFDSGAIDTSLKDAAYQVVRQAITSEIQRLGYDAGDDT------CFVAANQQAVA 365
Query: 266 RFPSMAFYFED-ANLRIDGEN 285
+ P + +F+D A++ ++G N
Sbjct: 366 QMPPLVLHFDDGADMSLNGRN 386
>gi|356539818|ref|XP_003538390.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
Length = 457
Score = 77.8 bits (190), Expect = 7e-12, Method: Compositional matrix adjust.
Identities = 78/272 (28%), Positives = 116/272 (42%), Gaps = 57/272 (20%)
Query: 1 MVRLFIGTPSKGVLLILDTGSALIY--------------AIFDPRKSSSFQKINCDHPDC 46
+V L IGTP + ++LDTGS L + A FDP SS+F + C HP C
Sbjct: 98 IVDLPIGTPPQVQPMVLDTGSQLSWIQCHKKAPAKPPPTASFDPSLSSTFSTLPCTHPVC 157
Query: 47 TY--------FKC-VNEQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGAL-FGC 96
C N C Y+ YAD + +G E + +++F L GC
Sbjct: 158 KPRIPDFTLPTSCDQNRLCHYSYFYADGTYAEGNLVREKFTF-----SRSLFTPPLILGC 212
Query: 97 SNDNHGFDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLK 156
+ ++ D R G+LG++R +SF SQ S I K FSYC+ P Y
Sbjct: 213 ATES----TDPR-----GILGMNRGRLSFASQ--SKITK-FSYCV----PTRVTRPGYTP 256
Query: 157 FGTDMGYRRPSTQATKFI---------NHPNN---FYYLSLKDISIDNERMNFPPDTFDI 204
G+ P++ ++I PN Y ++L+ I I ++N P F
Sbjct: 257 TGSFYLGHNPNSNTFRYIEMLTFARSQRMPNLDPLAYTVALQGIRIGGRKLNISPAVFRA 316
Query: 205 TVSGEGGCIIDSGSVLTYFHSDVYWKLHEKFV 236
G G ++DSGS TY ++ Y K+ + V
Sbjct: 317 DAGGSGQTMLDSGSEFTYLVNEAYDKVRAEVV 348
>gi|88174563|gb|ABD39356.1| chloroplast nucleoid DNA-binding protein [Oryza longistaminata]
Length = 323
Score = 77.8 bits (190), Expect = 7e-12, Method: Compositional matrix adjust.
Identities = 84/340 (24%), Positives = 144/340 (42%), Gaps = 47/340 (13%)
Query: 1 MVRLFIGTPSKGVLLILDTGSALIYAIFD-------PR-----KSSSFQKINCDHPDCTY 48
++ + +GTPSK +L +DTGS+ + + PR +S++ K++C C
Sbjct: 2 VISVGLGTPSKTQILEIDTGSSTSWVFCECDGCHTNPRTFLQSRSTTCAKVSCGTSMCLL 61
Query: 49 F----KCVNEQ----CVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSNDN 100
C + + C + + Y D S + G +T++ G FGC+ D+
Sbjct: 62 GGSDPHCQDSENYPDCPFRVSYQDGSASYGILYQDTLTF----SDVQKIPGFSFGCNMDS 117
Query: 101 HGFDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEY---TSSYLKF 157
G +E G + G+LG+ +S + Q S FSYCL + + + T+ Y
Sbjct: 118 FGANE---FGNVDGLLGMGAGPMSVLKQ-SSPTFDGFSYCLPLQMSERGFFSKTTGYFSL 173
Query: 158 GTDMGYRRPSTQATKFINHPNN--FYYLSLKDISIDNERMNFPPDTFDITVSGEGGCIID 215
G + R + TK + N +++ L IS+D ER+ P F G + D
Sbjct: 174 GGKIAATRTDVRYTKMVARRKNTELFFVDLTAISVDGERLGLSPSIFS-----RKGVVFD 228
Query: 216 SGSVLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCYFLPET-FNRFPSMAFYF 274
SGS L+Y L ++ R A+ E + CY + P+++ +F
Sbjct: 229 SGSELSYIPDRALSVLSQRIRELLLRRGAAE----EESERNCYDMRSVDEGDMPAISLHF 284
Query: 275 ED-ANLRIDGENVFIID--YENHFFLLAVAPHDDLVALIG 311
+D A + VF+ E + LA AP + V++IG
Sbjct: 285 DDGARFDLGSHGVFVERSVQEQDVWCLAFAPTES-VSIIG 323
>gi|357125326|ref|XP_003564345.1| PREDICTED: aspartic proteinase-like protein 2-like [Brachypodium
distachyon]
Length = 506
Score = 77.8 bits (190), Expect = 7e-12, Method: Compositional matrix adjust.
Identities = 88/376 (23%), Positives = 149/376 (39%), Gaps = 56/376 (14%)
Query: 2 VRLFIGTPSKGVLLILDTGSALIYAI-------------------FDPRKSSSFQKINCD 42
R+ +G P+K + +DTGS +++ F+P SS+ +I C
Sbjct: 91 TRVKLGNPAKEYFVQIDTGSDILWVACSPCTGCPTSSGLNIQLEFFNPDSSSTSSRIPCS 150
Query: 43 HPDCTYFKCVNEQ-----------CVYTMKYADQSVTKGFAAHETI---SVIGKGEGKAI 88
CT E C YT Y D S T GF +T+ +V+G +
Sbjct: 151 DDRCTAALQTGEAVCQSSDSPSSPCGYTFTYGDGSGTSGFYVSDTMYFDTVMGNEQTANS 210
Query: 89 FHGALFGCSNDNHGFDEDARDGALAGVLGLSRVTISFISQLGS--IIKKRFSYCLVIPLP 146
+FGCSN G D D A+ G+ G + +S +SQL S + K FS+C L
Sbjct: 211 SASVVFGCSNSQSG-DLMKTDRAVDGIFGFGQHQLSVVSQLYSLGVSPKTFSHC----LK 265
Query: 147 NGEYTSSYLKFGTDMGYRRPSTQATKFI-NHPNNFYYLSLKDISIDNERMNFPPDTFDIT 205
+ L G + P T + + P+ Y L+L+ I++ +++ P D+
Sbjct: 266 GSDNGGGILVLGEIV---EPGLVFTPLVPSQPH--YNLNLESIAVSGQKL--PIDSSLFA 318
Query: 206 VSGEGGCIIDSGSVLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCYFLPETFN 265
S G I+DSG+ L Y Y + F++ + C+ + +
Sbjct: 319 TSNTQGTIVDSGTTLVYLVDGAY----DPFINAIAAAVSPSVRSVVSKGIQCFVTTSSVD 374
Query: 266 -RFPSMAFYFEDA-NLRIDGENVFIID--YENHFFLLAVAPHDDLVALIGSQQQRDTRFV 321
FP+ YF+ ++ + EN + +N+ + ++G +D FV
Sbjct: 375 SSFPTATLYFKGGVSMTVKPENYLLQQGSVDNNVLWCIGWQRSQGITILGDLVLKDKIFV 434
Query: 322 YDLNIDLLSFVKENCS 337
YDL + + +CS
Sbjct: 435 YDLANMRMGWADYDCS 450
>gi|224140036|ref|XP_002323393.1| predicted protein [Populus trichocarpa]
gi|222868023|gb|EEF05154.1| predicted protein [Populus trichocarpa]
Length = 459
Score = 77.8 bits (190), Expect = 7e-12, Method: Compositional matrix adjust.
Identities = 97/391 (24%), Positives = 153/391 (39%), Gaps = 71/391 (18%)
Query: 2 VRLFIGTPSKGVLLILDTGSALIY----------------------AIFDPRKSSSFQKI 39
+ L GTP + ++DTGS+L++ F P+ SSS + I
Sbjct: 85 ISLNFGTPPQTTKFVMDTGSSLVWFPCTSRYLCSECNFPNIKKTGIPTFLPKLSSSSKLI 144
Query: 40 NCDHPDCTYF-------KC-----VNEQCV-----YTMKYADQSVTKGFAAHETISVIGK 82
C +P C+ KC + C Y ++Y S T G ET+
Sbjct: 145 GCKNPRCSMIFGPEIQSKCQECDSTAQNCTQTCPPYVIQYGSGS-TAGLLLSETLDF--- 200
Query: 83 GEGKAIFHGALFGCSNDNHGFDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLV 142
K L GCS + E G+ G R S SQLG K+FSYCLV
Sbjct: 201 -PNKKTIPDFLVGCSIFSIKQPE--------GIAGFGRSPESLPSQLG---LKKFSYCLV 248
Query: 143 IPLPNGEYTSSYLKFGTDMGYRRPSTQA---TKFINHPN----NFYYLSLKDISIDNERM 195
+ TSS L T G T T F+ +P ++YY+ L++I I + +
Sbjct: 249 SHAFDDTPTSSDLVLDTGSGSGVTKTAGLSHTPFLKNPTTAFRDYYYVLLRNIVIGDTHV 308
Query: 196 NFPPDTFDITVSGEGGCIIDSGSVLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQ 255
P G GG I+DSG+ T+ + VY + ++F + +A ++
Sbjct: 309 KVPYKFLVPGTDGNGGTIVDSGTTFTFMENPVYELVAKEFEKQMAHYTVATEIQNLTGLR 368
Query: 256 LCYFLP-ETFNRFPSMAFYFE-DANLRIDGENVFIIDYENHFFLLAVAPH-------DDL 306
CY + E P + F F+ A + + N F I L V+ +
Sbjct: 369 PCYNISGEKSLSVPDLIFQFKGGAKMALPLSNYFSIVDSGVICLTIVSDNVAGPGLGGGP 428
Query: 307 VALIGSQQQRDTRFVYDLNIDLLSFVKENCS 337
++G+ QQR+ +DL + F +++C+
Sbjct: 429 AIILGNYQQRNFYVEFDLENEKFGFKQQSCA 459
>gi|212275143|ref|NP_001130306.1| uncharacterized protein LOC100191400 precursor [Zea mays]
gi|194688798|gb|ACF78483.1| unknown [Zea mays]
gi|194703430|gb|ACF85799.1| unknown [Zea mays]
gi|194707192|gb|ACF87680.1| unknown [Zea mays]
gi|223944599|gb|ACN26383.1| unknown [Zea mays]
gi|223948667|gb|ACN28417.1| unknown [Zea mays]
gi|414887962|tpg|DAA63976.1| TPA: aspartic proteinase nepenthesin-1 [Zea mays]
Length = 450
Score = 77.8 bits (190), Expect = 7e-12, Method: Compositional matrix adjust.
Identities = 80/360 (22%), Positives = 145/360 (40%), Gaps = 47/360 (13%)
Query: 1 MVRLFIGTPSKGVLLILDTGSALIY--------------AIFDPRKSSSFQKINCDHPDC 46
+VR +GTP + +LL +DT + + A FDP S+S++ + C P C
Sbjct: 113 VVRASLGTPPQQLLLAVDTSNDASWIPCAGCAGCPTSSAAPFDPASSASYRTVPCGSPLC 172
Query: 47 TYFKCV-----NEQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSNDNH 101
+ C +++ YAD S+ + ++++V G FGC
Sbjct: 173 AQAPNAACPPGGKACGFSLTYADSSLQAALS-QDSLAVAGNA-----VKAYTFGCLQRAT 226
Query: 102 GFDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLKFGTDM 161
G + R +SF+SQ + + FSYCL P S L+ G +
Sbjct: 227 GTAAPPQGLLGL-----GRGPLSFLSQTKDMYEATFSYCL--PSFKSLNFSGTLRLGRNG 279
Query: 162 GYRRPSTQATKFINHPNNFYYLSLKDISIDNERMNFPPDTFDITVSGEGGCIIDSGSVLT 221
+R T H ++ YY+++ I + + + P FD G ++DSG++ T
Sbjct: 280 QPQRIKTTPLLANPHRSSLYYVNMTGIRVGRKVVPIP--AFDPATG--AGTVLDSGTMFT 335
Query: 222 YFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCYFLPETFNRFPSMAFYFEDANLRI 281
+ Y + ++ R ++ C+ T +P + F+ + +
Sbjct: 336 RLVAPAYVAVRDEV-----RRRVGAPVSSLGGFDTCFN--TTAVAWPPVTLLFDGMQVTL 388
Query: 282 DGENVFIIDYENHFFLLAVAPHDD----LVALIGSQQQRDTRFVYDLNIDLLSFVKENCS 337
ENV I LA+A D ++ +I S QQ++ R ++D+ + F +E C+
Sbjct: 389 PEENVVIHSTYGTISCLAMAAAPDGVNTVLNVIASMQQQNHRVLFDVPNGRVGFARERCT 448
>gi|226492150|ref|NP_001146362.1| hypothetical protein precursor [Zea mays]
gi|219886805|gb|ACL53777.1| unknown [Zea mays]
gi|414878074|tpg|DAA55205.1| TPA: hypothetical protein ZEAMMB73_415404 [Zea mays]
Length = 440
Score = 77.8 bits (190), Expect = 7e-12, Method: Compositional matrix adjust.
Identities = 93/393 (23%), Positives = 145/393 (36%), Gaps = 88/393 (22%)
Query: 5 FIGTPSKGVLLILDTGSALIYA---------------IFDPRKSSSFQKINCDHPDCTY- 48
IG P + I+DTGS LI+ +DP +S + + + C+ C
Sbjct: 76 LIGDPPQRAEAIIDTGSNLIWTQCSRCRPTCFRQNLPYYDPSRSRAARAVGCNDAACALG 135
Query: 49 --FKCV--NEQCVYTMKYADQSVTKGFAAH------ETISVIGKGEGKAIFHGALFGCSN 98
+C+ N+ C Y ++ A ET+S++ FGC
Sbjct: 136 SETQCLSDNKTCAVVTGYGAGNIAGTLATENLTFQSETVSLV-------------FGCIV 182
Query: 99 DNHGFDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLKFG 158
+ +GA +G++GL R +S SQLG RFSYCL P S++ G
Sbjct: 183 VTK-LSPGSLNGA-SGIIGLGRGKLSLPSQLG---DTRFSYCLT-PYFEDTIEPSHMVVG 236
Query: 159 TDMGYRRPSTQATK-----FINHPNN-----FYYLSLKDISIDNERMNFPPDTFDITVSG 208
G S +T F+ P++ FYYL L I+ ++ P FD+
Sbjct: 237 ASAGLINGSASSTPVTTVPFVRSPSDDPFSTFYYLPLTGITAGKVKLAVPSAAFDLRQVA 296
Query: 209 EG---GCIIDSGSVLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCYFLPETFN 265
G G IDSG+ LT Y L + + L+ LC L +
Sbjct: 297 PGMWTGTFIDSGAPLTSLVDVAYQALRAELARQLGAALVQPLAGT-TGFDLCVALKDAER 355
Query: 266 RFPSMAFYFEDANLRIDGENV-FIIDYENHFFLLAVAPHDDLVA---------------- 308
P + +F + G ++ N++ AP D A
Sbjct: 356 LVPPLVLHFGGGS----GTGTDLVVPPANYW-----APVDSATACMVVFSSVDRKSLPMN 406
Query: 309 ---LIGSQQQRDTRFVYDLNIDLLSFVKENCSD 338
+IG+ Q++ +YDL +LSF +CS
Sbjct: 407 ETTVIGNYMQQNMHVLYDLAGGVLSFQPADCSS 439
>gi|115483166|ref|NP_001065176.1| Os10g0537800 [Oryza sativa Japonica Group]
gi|21717159|gb|AAM76352.1|AC074196_10 putative nucleoid DNA binding protein [Oryza sativa Japonica Group]
gi|31433285|gb|AAP54823.1| Eukaryotic aspartyl protease family protein, expressed [Oryza
sativa Japonica Group]
gi|113639785|dbj|BAF27090.1| Os10g0537800 [Oryza sativa Japonica Group]
gi|215692411|dbj|BAG87831.1| unnamed protein product [Oryza sativa Japonica Group]
Length = 394
Score = 77.8 bits (190), Expect = 8e-12, Method: Compositional matrix adjust.
Identities = 84/362 (23%), Positives = 145/362 (40%), Gaps = 57/362 (15%)
Query: 6 IGTPSKGVLLILDTGSALIYA--------------IFDPRKSSSFQKINCDHPDCTYF-- 49
IGTP + ++D L++ +FDP S++++ C P C
Sbjct: 57 IGTPPQPASAVIDLAGELVWTQCKQCGRCFEQGTPLFDPTASNTYRAEPCGTPLCESIPS 116
Query: 50 ---KCVNEQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSNDNHGFDED 106
C C Y + T G +T +V G KA FGC + D D
Sbjct: 117 DVRNCSGNVCAYEAS-TNAGDTGGKVGTDTFAV---GTAKASLA---FGCVVAS---DID 166
Query: 107 ARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLKFGTDM--GYR 164
G +G++GL R S ++Q G FSYCL P G+ ++ +L + G +
Sbjct: 167 TMGGP-SGIVGLGRTPWSLVTQTG---VAAFSYCLA-PHDAGKNSALFLGSSAKLAGGGK 221
Query: 165 RPSTQATKFINHPN---NFYYLSLKDISIDNERMNFPPDTFDITVSGEGGCIIDSGSVLT 221
ST + N N+Y + L+ + + + PP + ++D+ S ++
Sbjct: 222 AASTPFVNISGNGNDLSNYYKVQLEGLKAGDAMIPLPPSGSTV--------LLDTFSPIS 273
Query: 222 YFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCYFLPETFNRFPSMAFYFED-ANLR 280
+ Y + + +A EP LC+ P + F F A +
Sbjct: 274 FLVDGAYQAVKKAVTVAVGAPPMATPV---EPFDLCFPKSGASGAAPDLVFTFRGGAAMT 330
Query: 281 IDGENVFIIDYENHFFLLAVAPHDDL-----VALIGSQQQRDTRFVYDLNIDLLSFVKEN 335
+ N +++DY+N LA+ L ++L+GS QQ + F++DL+ + LSF +
Sbjct: 331 VPATN-YLLDYKNGTVCLAMLSSARLNSTTELSLLGSLQQENIHFLFDLDKETLSFEPAD 389
Query: 336 CS 337
C+
Sbjct: 390 CT 391
>gi|383165471|gb|AFG65613.1| Pinus taeda anonymous locus 2_6422_01 genomic sequence
Length = 136
Score = 77.8 bits (190), Expect = 8e-12, Method: Compositional matrix adjust.
Identities = 47/138 (34%), Positives = 77/138 (55%), Gaps = 13/138 (9%)
Query: 27 IFDPRKSSSFQKINCDHPDCTY---FKCVNEQ-CVYTMKYADQSVTKGFAAHETISVIGK 82
I+DP +SS++ K++C C F+C + C Y Y D S+T G ++ET+++ K
Sbjct: 6 IYDPARSSTYSKVSCKSLLCNALPDFECKSAAGCEYQYTYGDFSITVGILSYETLTLTSK 65
Query: 83 GEGKAIFHGALFGCS--NDNHGFDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYC 140
+ + FGC N+ +GFD+ AG++GL R +S ISQL + + K+FSYC
Sbjct: 66 SGAEQLIPNFAFGCGQNNEGNGFDQG------AGIVGLGRGPLSLISQLSASMPKKFSYC 119
Query: 141 LVIPLPNGEYTSSYLKFG 158
L+ + + + +S L FG
Sbjct: 120 LMT-IDDSQSKTSPLMFG 136
>gi|302769978|ref|XP_002968408.1| hypothetical protein SELMODRAFT_89951 [Selaginella moellendorffii]
gi|300164052|gb|EFJ30662.1| hypothetical protein SELMODRAFT_89951 [Selaginella moellendorffii]
Length = 492
Score = 77.8 bits (190), Expect = 8e-12, Method: Compositional matrix adjust.
Identities = 93/366 (25%), Positives = 151/366 (41%), Gaps = 55/366 (15%)
Query: 3 RLFIGTPSKGVLLILDTGSALIYAI--------------FDPRKSSSFQKINCDHPDCTY 48
R+ IGTP LI+DTGS + Y F P SSS++ + C +C+
Sbjct: 38 RVKIGTPPHEFSLIVDTGSTVTYVPCSSCTHCGNHQDPRFSPALSSSYKPLECGS-ECST 96
Query: 49 FKCVNEQCVYTMKYADQSVTKGFAAHETISVIGKGE--GKAIFHGALFGCSNDNHGFDED 106
C + Y +YA++S + G + I + G+ + +FGC G D
Sbjct: 97 GFCDGSR-KYQRQYAEKSTSSGVLGKDVIGFSNSSDLGGQRL----VFGCETAETG---D 148
Query: 107 ARDGALAGVLGLSRVTISFISQL--GSIIKKRFSYCLVIPLPNGEYTSSYLKFGTDMGYR 164
D G++GL R +S I QL + ++ FS C + + G G++
Sbjct: 149 LYDQTADGIIGLGRGPLSIIDQLVEKNAMEDVFSLC----YGGMDEGGGAMILG---GFQ 201
Query: 165 RPSTQA-TKFINHPNNFYYLSLKDISIDNERMNFPPDTFDITVSGEGGCIIDSGSVLTYF 223
P T H + +Y L LK I + + P+ FD G+ G ++DSG+ YF
Sbjct: 202 PPKDMVFTASDPHRSPYYNLMLKGIRVGGSPLRLKPEVFD----GKYGTVLDSGTTYAYF 257
Query: 224 HSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQ----LCYFLPET-----FNRFPSMAFYF 274
+ + F S + Q+ L + P P + +CY T FPS+ F F
Sbjct: 258 PGAAF----QAFKSAVKE-QVGSLKEVPGPDEKFKDICYAGAGTNVSNLSQFFPSVDFVF 312
Query: 275 EDA-NLRIDGEN-VFIIDYENHFFLLAVAPHDDLVALIGSQQQRDTRFVYDLNIDLLSFV 332
D ++ + EN +F + + L V + D L+G R+ Y+ + F+
Sbjct: 313 GDGQSVTLSPENYLFRHTKISGAYCLGVFENGDPTTLLGGIIVRNMLVTYNRGKASIGFL 372
Query: 333 KENCSD 338
K C+D
Sbjct: 373 KTKCND 378
>gi|225440720|ref|XP_002275202.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Vitis vinifera]
Length = 479
Score = 77.8 bits (190), Expect = 8e-12, Method: Compositional matrix adjust.
Identities = 92/388 (23%), Positives = 155/388 (39%), Gaps = 69/388 (17%)
Query: 2 VRLFIGTPSKGVLLILDTGSALIYA----------------------IFDPRKSSSFQKI 39
+ L GTP + + ++DTGS +++A IF+P+ SSS + +
Sbjct: 89 IPLSFGTPPQKLSFLVDTGSHVVWAPCTTHYTCTNCSFSDAEPKKVPIFNPKLSSSSKIL 148
Query: 40 NCDHPDCTYFK-----------------CVNEQCVYTMKYADQSVTKGFAAHETISVIGK 82
C +P C C + Y+++Y + + F E ++ GK
Sbjct: 149 GCRNPKCVNTSSPDVHLGCPPCNGNSKNCSHACPPYSLQYGTGASSGDFLL-ENLNFPGK 207
Query: 83 GEGKAIFHGALFGCSNDNHGFDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLV 142
H L GC+ G + ALAG R S Q+G K+F+YCL
Sbjct: 208 -----TIHEFLVGCTTSAVG---EVTSAALAG---FGRSMFSLPMQMGV---KKFAYCLN 253
Query: 143 IPLPNGEYTSSYLKFGTDMGYRRPSTQATKFINHPNNF---YYLSLKDISIDNERMNFPP 199
+ SS L G + + A F+ +P +F YYL +KDI I N+ + P
Sbjct: 254 SHDYDDTRNSSKLILDYSDGETKGLSYA-PFLKNPPDFPIYYYLGVKDIKIGNKLLRIPS 312
Query: 200 DTFDITVSGEGGCIIDSGSVLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCY- 258
G GG +IDSG Y V+ K+ + +++ + ++ + CY
Sbjct: 313 KYLAPGSDGRGGLMIDSGFAYGYMTGPVFKKVTNELKKRMSKYRRSLEAEAEIGVTPCYN 372
Query: 259 FLPETFNRFPSMAFYFE-DANLRIDGENVFIIDYE---NHFFLLAVAPHDDL------VA 308
F + + P + + F A + + G+N F++ E F L A + L
Sbjct: 373 FTGQKSIKIPDLIYQFRGGATMVVPGKNYFVLIPEISLACFPLTTDAGTNTLEFTPGPSI 432
Query: 309 LIGSQQQRDTRFVYDLNIDLLSFVKENC 336
++G+ Q D +DL + L F ++ C
Sbjct: 433 ILGNSQHVDYYVEFDLKNERLGFRQQTC 460
>gi|361068027|gb|AEW08325.1| Pinus taeda anonymous locus 2_6422_01 genomic sequence
gi|383165459|gb|AFG65601.1| Pinus taeda anonymous locus 2_6422_01 genomic sequence
gi|383165460|gb|AFG65602.1| Pinus taeda anonymous locus 2_6422_01 genomic sequence
gi|383165461|gb|AFG65603.1| Pinus taeda anonymous locus 2_6422_01 genomic sequence
gi|383165462|gb|AFG65604.1| Pinus taeda anonymous locus 2_6422_01 genomic sequence
gi|383165463|gb|AFG65605.1| Pinus taeda anonymous locus 2_6422_01 genomic sequence
gi|383165465|gb|AFG65607.1| Pinus taeda anonymous locus 2_6422_01 genomic sequence
gi|383165466|gb|AFG65608.1| Pinus taeda anonymous locus 2_6422_01 genomic sequence
gi|383165467|gb|AFG65609.1| Pinus taeda anonymous locus 2_6422_01 genomic sequence
gi|383165468|gb|AFG65610.1| Pinus taeda anonymous locus 2_6422_01 genomic sequence
gi|383165469|gb|AFG65611.1| Pinus taeda anonymous locus 2_6422_01 genomic sequence
gi|383165472|gb|AFG65614.1| Pinus taeda anonymous locus 2_6422_01 genomic sequence
gi|383165473|gb|AFG65615.1| Pinus taeda anonymous locus 2_6422_01 genomic sequence
gi|383165474|gb|AFG65616.1| Pinus taeda anonymous locus 2_6422_01 genomic sequence
gi|383165475|gb|AFG65617.1| Pinus taeda anonymous locus 2_6422_01 genomic sequence
gi|383165476|gb|AFG65618.1| Pinus taeda anonymous locus 2_6422_01 genomic sequence
Length = 136
Score = 77.8 bits (190), Expect = 8e-12, Method: Compositional matrix adjust.
Identities = 47/138 (34%), Positives = 77/138 (55%), Gaps = 13/138 (9%)
Query: 27 IFDPRKSSSFQKINCDHPDCTY---FKCVNEQ-CVYTMKYADQSVTKGFAAHETISVIGK 82
I+DP +SS++ K++C C F+C + C Y Y D S+T G ++ET+++ K
Sbjct: 6 IYDPARSSTYSKVSCKSLLCNALPDFECKSTAGCEYQYTYGDFSITVGILSYETLTLTSK 65
Query: 83 GEGKAIFHGALFGCS--NDNHGFDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYC 140
+ + FGC N+ +GFD+ AG++GL R +S ISQL + + K+FSYC
Sbjct: 66 SGAEQLIPNFAFGCGQNNEGNGFDQG------AGIVGLGRGPLSLISQLSASMPKKFSYC 119
Query: 141 LVIPLPNGEYTSSYLKFG 158
L+ + + + +S L FG
Sbjct: 120 LMT-IDDSQSKTSPLMFG 136
>gi|224091907|ref|XP_002309394.1| predicted protein [Populus trichocarpa]
gi|222855370|gb|EEE92917.1| predicted protein [Populus trichocarpa]
Length = 469
Score = 77.4 bits (189), Expect = 8e-12, Method: Compositional matrix adjust.
Identities = 96/393 (24%), Positives = 152/393 (38%), Gaps = 77/393 (19%)
Query: 2 VRLFIGTPSKGVLLILDTGSALIY----------------------AIFDPRKSSSFQKI 39
+ L GTP + ++DTGS+L++ F P++SSS I
Sbjct: 94 ISLNFGTPPQTTKFVMDTGSSLVWFPCTSRYLCSRCDFPNIEVTGIPTFIPKQSSSSNLI 153
Query: 40 NCDHPDCTYF-------KCVNEQCVYTMKYADQSV-----------TKGFAAHETISVIG 81
C + C++ KC ++C T + QS T G ET+
Sbjct: 154 GCKNHKCSWLFGPKVQSKC--QECDPTTQNCTQSCPPYVIQYGLGSTAGLLLSETLDF-- 209
Query: 82 KGEGKAIFHGALFGCSNDNHGFDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCL 141
K G L GCS + E G+ G R S SQLG K+FSYCL
Sbjct: 210 --PHKKTIPGFLVGCSLFSIRQPE--------GIAGFGRSPESLPSQLG---LKKFSYCL 256
Query: 142 VIPLPNGEYTSSYLKFGTDMG---YRRPSTQATKFINHPN----NFYYLSLKDISIDNER 194
V + SS L T G + P T F +P ++YY+ L++I I +
Sbjct: 257 VSHAFDDTPASSDLVLDTGSGSDDTKTPGLSYTPFQKNPTAAFRDYYYVLLRNIVIGDTH 316
Query: 195 MNFPPDTFDITVSGEGGCIIDSGSVLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPI 254
+ P G GG I+DSG+ T+ VY + ++F + +A +
Sbjct: 317 VKVPYKFLVPGSDGNGGTIVDSGTTFTFMEKPVYELVAKEFEKQVAHYTVATEVQNQTGL 376
Query: 255 QLCYFLP-ETFNRFPSMAFYFE-DANLRIDGENVFIIDYENHFFLLAVAPHDDLVA---- 308
+ C+ + E P F+F+ A + + N F L V+ D++
Sbjct: 377 RPCFNISGEKSVSVPEFIFHFKGGAKMALPLANYFSFVDSGVICLTIVS--DNMSGSGIG 434
Query: 309 -----LIGSQQQRDTRFVYDLNIDLLSFVKENC 336
++G+ QQR+ +DL + F ++NC
Sbjct: 435 GGPAIILGNYQQRNFHVEFDLKNERFGFKQQNC 467
>gi|449482385|ref|XP_004156266.1| PREDICTED: aspartic proteinase-like protein 2-like [Cucumis
sativus]
Length = 491
Score = 77.4 bits (189), Expect = 8e-12, Method: Compositional matrix adjust.
Identities = 97/374 (25%), Positives = 157/374 (41%), Gaps = 57/374 (15%)
Query: 3 RLFIGTPSKGVLLILDTGSALIYA-------------------IFDPRKSSSFQKINCDH 43
++ +G P++ + +DTGS +++ +FD KSSS + + C
Sbjct: 87 KVKLGNPAREFNVQIDTGSDILWVTCSPCDGCPDSSGLGIELNLFDTTKSSSARVLPCTD 146
Query: 44 PDCTYFKCVNEQCV-------YTMKYADQSVTKGFAAHETISV-IGKGEGKAIFHGA--L 93
P C +QC+ Y+ Y D+S T GF +++ I GE A +
Sbjct: 147 PICAAVSTTTDQCLTQTDHCSYSFHYRDRSGTSGFYVTDSMHFDILLGESTIANSSATIV 206
Query: 94 FGCSNDNHGFDEDARDGALAGVLGLSRVTISFISQLGS--IIKKRFSYCLVIPLPNGEYT 151
FGCS +G D AL G+ G + S ISQL S I K FS+C L GE
Sbjct: 207 FGCSIYQYG-DLTRATKALDGIFGFGQGEFSVISQLSSRGITPKVFSHC----LKGGENG 261
Query: 152 SSYLKFGTDMGYRRPSTQATKFI-NHPNNFYYLSLKDISIDNERMNFP-PDTFDITVSGE 209
L G + PS + I + P+ Y L L+ I++ + FP P F I+ +GE
Sbjct: 262 GGILVLGEIL---EPSIVYSPLIPSQPH--YTLKLQSIALSGQL--FPNPTMFPISNAGE 314
Query: 210 GGCIIDSGSVLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCYFLPETFNRFPS 269
IIDSG+ L Y +VY + S + +S + ++ + + FP
Sbjct: 315 --TIIDSGTTLAYLVEEVYDWIVSVITSAVSQSATPTISRGSQCFRVSMSVADI---FPV 369
Query: 270 MAFYFED-ANLRIDGENVFIID-----YE-NHFFLLAVAPHDDLVALIGSQQQRDTRFVY 322
+ F FE A++ + E D Y+ + + +D + ++G +D VY
Sbjct: 370 LRFNFEGIASMVVTPEEYLQFDSIVSCYKFASLWCIGFQKAEDGLNILGDLVLKDKIIVY 429
Query: 323 DLNIDLLSFVKENC 336
DL + + +C
Sbjct: 430 DLAQQRIGWANYDC 443
>gi|293334661|ref|NP_001168795.1| uncharacterized protein LOC100382594 precursor [Zea mays]
gi|223973065|gb|ACN30720.1| unknown [Zea mays]
Length = 631
Score = 77.4 bits (189), Expect = 8e-12, Method: Compositional matrix adjust.
Identities = 90/364 (24%), Positives = 149/364 (40%), Gaps = 48/364 (13%)
Query: 2 VRLFIGTPSKGVLLILDTGSALIYAI--------------FDPRKSSSFQKINCDHPDCT 47
RL+IGTP + LI+D+GS + Y F P SSS+ + C+ DCT
Sbjct: 90 TRLYIGTPPQEFALIVDSGSTVTYVPCSSCEQCGNHQDPRFQPDLSSSYSPVKCNV-DCT 148
Query: 48 YFKCVNEQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSNDNHG--FDE 105
+QC Y +YA+ S + G + +S + E K A+FGC N G F +
Sbjct: 149 -CDSDKKQCTYERQYAEMSSSSGVLGEDIVSFGRESELKP--QHAIFGCENSETGDLFSQ 205
Query: 106 DARDGALAGVLGLSRVTISFISQL--GSIIKKRFSYCL-VIPLPNGEYTSSYLKFGTDMG 162
A G++GL R +S + QL +I FS C + + G + DM
Sbjct: 206 HA-----DGIMGLGRGQLSIMDQLVEKGVISDSFSLCYGGMDIGGGAMVLGGMLAPPDMI 260
Query: 163 YRRPSTQATKFINHPNNFYYLSLKDISIDNERMNFPPDTFDITVSGEGGCIIDSGSVLTY 222
+ + + P +Y + LK+I + + + F+ + G ++DSG+ Y
Sbjct: 261 FSN-----SDPLRSP--YYNIELKEIHVAGKALRVESRIFN----SKHGTVLDSGTTYAY 309
Query: 223 FHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCYF-----LPETFNRFPSMAFYFEDA 277
+ E S + + D P +C+ + + FP + F +
Sbjct: 310 LPEQAFVAFKEAVTSKVHSLKKIRGPD-PSYKDICFAGAGRNVSKLHEVFPDVDMVFGNG 368
Query: 278 N-LRIDGEN-VFIIDYENHFFLLAVAPH-DDLVALIGSQQQRDTRFVYDLNIDLLSFVKE 334
L + EN +F + + L V + D L+G R+T YD + + + F K
Sbjct: 369 QKLSLTPENYLFRHSKVDGAYCLGVFQNGKDPTTLLGGIIVRNTLVTYDRHNEKIGFWKT 428
Query: 335 NCSD 338
NCS+
Sbjct: 429 NCSE 432
>gi|302756119|ref|XP_002961483.1| hypothetical protein SELMODRAFT_76765 [Selaginella moellendorffii]
gi|300170142|gb|EFJ36743.1| hypothetical protein SELMODRAFT_76765 [Selaginella moellendorffii]
Length = 388
Score = 77.4 bits (189), Expect = 9e-12, Method: Compositional matrix adjust.
Identities = 87/387 (22%), Positives = 158/387 (40%), Gaps = 74/387 (19%)
Query: 2 VRLFIGTPSKGVLLILDTGSALIY-------------------AIFDPRKSSSFQKINCD 42
++ +G P K ++ +DTGS +++ ++DPR+SS+ ++C
Sbjct: 4 TQVGLGNPVKHYIVQVDTGSDVLWVNCRPCSGCPRKSALNIPLTMYDPRESSTTSLVSCS 63
Query: 43 HPDCTYFKCVNE--------QCVYTMKYADQSVTKGFAAHETI--SVIGKGEGKAIFHGA 92
P C + E C Y Y D S ++G+ + + +VI
Sbjct: 64 DPLCVRGRRFAEAQCSQATNNCEYIFSYGDGSTSEGYYVRDAMQYNVISSNGLANTTSQV 123
Query: 93 LFGCSNDNHGFDEDARDGALAGVLGLSRVTISFISQLGSI--IKKRFSYCLVIPLPNGEY 150
LFGCS G D A+ G++G ++ +S +QL + I + FS+CL E
Sbjct: 124 LFGCSIRQTG-DLSTSQQAVDGIIGFGQLELSVPNQLAAQQNIPRVFSHCL-------EG 175
Query: 151 TSSYLKFGTDMGYRRPSTQATKFINHPNNFYY-LSLKDISIDNERMNFPPDTFDITVSGE 209
G P T + P++ +Y + L+ IS+++ R+ P D D + + +
Sbjct: 176 EKRGGGILVIGGIAEPGMTYTPLV--PDSVHYNVVLRGISVNSNRL--PIDAEDFSSTND 231
Query: 210 GGCIIDSGSVLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQ----LCYFLPETF- 264
G I+DSG+ L YF S Y + F + S P +Q C+ +
Sbjct: 232 TGVIMDSGTTLAYFPSGAY--------NVFVQAIREATSATPVRVQGMDTQCFLVSGRLS 283
Query: 265 NRFPSMAFYFEDANLRIDGEN--------------VFIIDYENHFFLLAVAPHD-DLVAL 309
+ FP++ FE + + +N V+ I +++ + P D + +
Sbjct: 284 DLFPNVTLNFEGGAMELQPDNYLMWGGTAPTGTTDVWCIGWQSSSS--SAGPKDGSQLTI 341
Query: 310 IGSQQQRDTRFVYDLNIDLLSFVKENC 336
+G +D VYDL+ + ++ NC
Sbjct: 342 LGDIVLKDKLVVYDLDNSRIGWMSYNC 368
>gi|356507997|ref|XP_003522749.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
Length = 440
Score = 77.4 bits (189), Expect = 9e-12, Method: Compositional matrix adjust.
Identities = 85/360 (23%), Positives = 149/360 (41%), Gaps = 45/360 (12%)
Query: 1 MVRLFIGTPSKGVLLILDTGSALIY-----------AIFDPRKSSSFQKINCDHPDCTYF 49
+VR+ +GTP + + ++LDT + + F P+ S+S+ ++C P C
Sbjct: 101 VVRVKLGTPGQLLFMVLDTSTDEAFVPCSGCTGCSDTTFSPKASTSYGPLDCSVPQCGQV 160
Query: 50 KCVN------EQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSNDNHGF 103
+ ++ C + YA S + +++ + + FGC N G
Sbjct: 161 RGLSCPATGTGACSFNQSYAGSSFSATLV-QDSLRL-----ATDVIPNYSFGCVNAITGA 214
Query: 104 DEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLKFGTDMGY 163
A+ R +S +SQ GS FSYCL P Y S LK G +G
Sbjct: 215 SVPAQGLLGL-----GRGPLSLLSQSGSNYSGIFSYCL--PSFKSYYFSGSLKLG-PVGQ 266
Query: 164 RRPSTQATKFINHPN--NFYYLSLKDISIDNERMNFPPDTFDITVSGEGGCIIDSGSVLT 221
+ S + T + P+ + YY++ IS+ + FP + + G IIDSG+V+T
Sbjct: 267 PK-SIRTTPLLRSPHRPSLYYVNFTGISVGRVLVPFPSEYLGFNPNTGSGTIIDSGTVIT 325
Query: 222 YFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCYFLPETFNRF-PSMAFYFEDANLR 280
F VY + E+F + C+ +T+ P + +FE +L+
Sbjct: 326 RFVEPVYNAVREEFRKQVGGTTFTSIGA----FDTCFV--KTYETLAPPITLHFEGLDLK 379
Query: 281 IDGENVFIIDYENHFFLLAVAPHDD----LVALIGSQQQRDTRFVYDLNIDLLSFVKENC 336
+ EN I LA+A D ++ +I + QQ++ R ++D + + +E C
Sbjct: 380 LPLENSLIHSSAGSLACLAMAAAPDNVNSVLNVIANFQQQNLRILFDTVNNKVGIAREVC 439
>gi|383165464|gb|AFG65606.1| Pinus taeda anonymous locus 2_6422_01 genomic sequence
gi|383165470|gb|AFG65612.1| Pinus taeda anonymous locus 2_6422_01 genomic sequence
Length = 136
Score = 77.4 bits (189), Expect = 9e-12, Method: Compositional matrix adjust.
Identities = 47/138 (34%), Positives = 78/138 (56%), Gaps = 13/138 (9%)
Query: 27 IFDPRKSSSFQKINCDHPDCTY---FKCVNEQ-CVYTMKYADQSVTKGFAAHETISVIGK 82
I+DP +SS++ K++C C F+C + C Y Y D S+T G ++ET+++ K
Sbjct: 6 IYDPARSSTYSKVSCKSLLCNALPDFECKSTAGCEYQYTYGDFSITVGILSYETLTLTSK 65
Query: 83 GEGKAIFHGALFGC--SNDNHGFDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYC 140
+ + FGC +N+ +GFD+ A G++GL R +S ISQL + + K+FSYC
Sbjct: 66 SGAEQLIPKFAFGCGQNNEGNGFDQGA------GIVGLGRGPLSLISQLSASMPKKFSYC 119
Query: 141 LVIPLPNGEYTSSYLKFG 158
L+ + + + +S L FG
Sbjct: 120 LMT-IDDSQSKTSPLMFG 136
>gi|21450872|gb|AAK44106.2|AF370291_1 unknown protein [Arabidopsis thaliana]
Length = 375
Score = 77.4 bits (189), Expect = 9e-12, Method: Compositional matrix adjust.
Identities = 85/368 (23%), Positives = 151/368 (41%), Gaps = 56/368 (15%)
Query: 1 MVRLFIGTPSKGVLLILDTGSALIY-------------AIFDPRKSSSFQKINCDHPDCT 47
+VR +GTP + + ++LDT + ++ F+ SS++ ++C CT
Sbjct: 31 VVRAKLGTPPQLMFMVLDTSNDAVWLPCSGCSGCSNASTSFNTNSSSTYSTVSCSTAQCT 90
Query: 48 YFK---CVNEQ-----CVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSND 99
+ C + C + Y S +T+++ + FGC N
Sbjct: 91 QARGLTCPSSSPQPSVCSFNQSYGGDSSFSASLVQDTLTL-----APDVIPNFSFGCINS 145
Query: 100 NHGFDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLKFGT 159
G + G++GL R +S +SQ S+ FSYCL P Y S LK G
Sbjct: 146 ASGNSLPPQ-----GLMGLGRGPMSLVSQTTSLYSGVFSYCL--PSFRSFYFSGSLKLGL 198
Query: 160 DMGYRRPSTQATKFINHPN--NFYYLSLKDISIDNERMNFPPDTFDITVSGEGGCIIDSG 217
+G + S + T + +P + YY++L +S+ + ++ P + G IIDSG
Sbjct: 199 -LGQPK-SIRYTPLLRNPRRPSLYYVNLTGVSVGSVQVPVDPVYLTFDANSGAGTIIDSG 256
Query: 218 SVLTYFHSDVYWKLHEKF-----VSYFERFQLAQLSDCPEPIQLCYFLPETFNRFPSMAF 272
+V+T F VY + ++F VS F L C F + N P +
Sbjct: 257 TVITRFAQPVYEAIRDEFRKQVNVSSFS--TLGAFDTC--------FSADNENVAPKITL 306
Query: 273 YFEDANLRIDGENVFIIDYENHFFLLAVAP----HDDLVALIGSQQQRDTRFVYDLNIDL 328
+ +L++ EN I L++A + ++ +I + QQ++ R ++D+
Sbjct: 307 HMTSLDLKLPMENTLIHSSAGTLTCLSMAGIRQNANAVLNVIANLQQQNLRILFDVPNSR 366
Query: 329 LSFVKENC 336
+ E C
Sbjct: 367 IGIAPEPC 374
>gi|242050744|ref|XP_002463116.1| hypothetical protein SORBIDRAFT_02g038150 [Sorghum bicolor]
gi|241926493|gb|EER99637.1| hypothetical protein SORBIDRAFT_02g038150 [Sorghum bicolor]
Length = 632
Score = 77.4 bits (189), Expect = 1e-11, Method: Compositional matrix adjust.
Identities = 89/364 (24%), Positives = 150/364 (41%), Gaps = 48/364 (13%)
Query: 2 VRLFIGTPSKGVLLILDTGSALIYAI--------------FDPRKSSSFQKINCDHPDCT 47
RL+IGTP + LI+D+GS + Y F P SSS+ + C+ DCT
Sbjct: 91 TRLYIGTPPQEFALIVDSGSTVTYVPCASCEQCGNHQDPRFQPDLSSSYSPVKCNV-DCT 149
Query: 48 YFKCVNEQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSNDNHG--FDE 105
+QC Y +YA+ S + G + +S + E K A+FGC N G F +
Sbjct: 150 -CDSDKKQCTYERQYAEMSSSSGVLGEDIVSFGRESELKP--QRAVFGCENSETGDLFSQ 206
Query: 106 DARDGALAGVLGLSRVTISFISQL--GSIIKKRFSYCL-VIPLPNGEYTSSYLKFGTDMG 162
A G++GL R +S + QL +I FS C + + G + +DM
Sbjct: 207 HAD-----GIMGLGRGQLSIMDQLVEKGVISDSFSLCYGGMDIGGGAMVLGGVPAPSDMV 261
Query: 163 YRRPSTQATKFINHPNNFYYLSLKDISIDNERMNFPPDTFDITVSGEGGCIIDSGSVLTY 222
+ + + P +Y + LK+I + + + F+ + G ++DSG+ Y
Sbjct: 262 FSH-----SDPLRSP--YYNIELKEIHVAGKALRVDSRVFN----SKHGTVLDSGTTYAY 310
Query: 223 FHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCYF-----LPETFNRFPSMAFYFEDA 277
+ + S + + D P +C+ + + FP + F +
Sbjct: 311 LPEQAFVAFKDAVTSKVHSLKKIRGPD-PNYKDICFAGAGRNVSKLHEVFPDVDMVFGNG 369
Query: 278 N-LRIDGEN-VFIIDYENHFFLLAVAPH-DDLVALIGSQQQRDTRFVYDLNIDLLSFVKE 334
L + EN +F + + L V + D L+G R+T YD + + + F K
Sbjct: 370 QKLSLTPENYLFRHSKVDGAYCLGVFQNGKDPTTLLGGIIVRNTLVTYDRHNEKIGFWKT 429
Query: 335 NCSD 338
NCS+
Sbjct: 430 NCSE 433
>gi|226492633|ref|NP_001149953.1| LOC100283580 precursor [Zea mays]
gi|195635701|gb|ACG37319.1| pepsin A [Zea mays]
Length = 491
Score = 77.0 bits (188), Expect = 1e-11, Method: Compositional matrix adjust.
Identities = 95/382 (24%), Positives = 159/382 (41%), Gaps = 71/382 (18%)
Query: 2 VRLFIGTPSKGVLLILDTGSALIY-------------------AIFDPRKSSSFQKINCD 42
R+ IG+P KG + +DTGS +++ +DP S + + C+
Sbjct: 86 TRIEIGSPPKGYYVQVDTGSDILWVNCIRCDGCPTRSGLGIELTQYDPAGSGT--TVGCE 143
Query: 43 HPDCTYFKC---------VNEQCVYTMKYADQSVTKGFAAHETI---SVIGKGEGKAIFH 90
C + C + + Y D S T GF + + V G G+
Sbjct: 144 QEFCVANSAGGVPPTCPSTSSPCQFRITYGDGSTTTGFYVTDFVQYNQVSGNGQTTTSNA 203
Query: 91 GALFGCSNDNHGFDEDARDGALAGVLGLSRVTISFISQLGSI--IKKRFSYCLVIPLPNG 148
FGC G D + + AL G+LG + S +SQL + ++K F++CL G
Sbjct: 204 SITFGCGA-QLGGDLGSSNQALDGILGFGQSDSSMLSQLAAARRVRKIFAHCLDTVRGGG 262
Query: 149 EYTSSYLKFGTDMGYRRPSTQATKFINHPNNFYY-LSLKDISIDNERMNFPPDTFDITVS 207
+ + +P + T + PN +Y ++L+ IS+ + P TFD S
Sbjct: 263 IFAIGNVV--------QPKVKTTPLV--PNVTHYNVNLQGISVGGATLQLPTSTFD---S 309
Query: 208 GEG-GCIIDSGSVLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCY-FLPETFN 265
G+ G IIDSG+ L Y +VY L + F+++Q L + + + C+ F +
Sbjct: 310 GDSKGTIIDSGTTLAYLPREVYRTL---LAAVFDKYQDLPLHNYQDFV--CFQFSGSIDD 364
Query: 266 RFPSMAFYFEDANLRIDGENVFIIDY----ENHFFLL-----AVAPHDDL-VALIGSQQQ 315
FP + F F+ +L + NV+ DY N + + V D + L+G
Sbjct: 365 GFPVITFSFK-GDLTL---NVYPDDYLFQNRNDLYCMGFLDGGVQTKDGKDMLLLGDLVL 420
Query: 316 RDTRFVYDLNIDLLSFVKENCS 337
+ VYDL +++ + NCS
Sbjct: 421 SNKLVVYDLEKEVIGWTDYNCS 442
>gi|302776054|ref|XP_002971323.1| hypothetical protein SELMODRAFT_63598 [Selaginella moellendorffii]
gi|300161305|gb|EFJ27921.1| hypothetical protein SELMODRAFT_63598 [Selaginella moellendorffii]
Length = 395
Score = 77.0 bits (188), Expect = 1e-11, Method: Compositional matrix adjust.
Identities = 86/383 (22%), Positives = 154/383 (40%), Gaps = 66/383 (17%)
Query: 2 VRLFIGTPSKGVLLILDTGSALIY-------------------AIFDPRKSSSFQKINCD 42
++ +G P K ++ +DTGS +++ ++DPR+SS+ ++C
Sbjct: 31 TQVGLGNPVKHYIVQVDTGSDVLWVNCRPCSGCPRKSALNIPLTMYDPRESSTTSLVSCS 90
Query: 43 HPDCTYFKCVNE--------QCVYTMKYADQSVTKGFAAHETI--SVIGKGEGKAIFHGA 92
P C + E C Y Y D S ++G+ + + +VI
Sbjct: 91 DPLCVRGRRFAEAQCSQTTNNCEYIFSYGDGSTSEGYYVRDAMQYNVISSNGLANTTSQV 150
Query: 93 LFGCSNDNHGFDEDARDGALAGVLGLSRVTISFISQLGSI--IKKRFSYCLVIPLPNGEY 150
LFGCS G D A+ G++G ++ +S +QL + I + FS+CL E
Sbjct: 151 LFGCSIRQTG-DLSTSQQAVDGIIGFGQLELSVPNQLAAQQNIPRVFSHCL-------EG 202
Query: 151 TSSYLKFGTDMGYRRPSTQATKFINHPNNFYY-LSLKDISIDNERMNFPPDTFDITVSGE 209
G P T + P++ +Y + L+ IS+++ R+ P D D + + +
Sbjct: 203 EKRGGGILVIGGIAEPGMTYTPLV--PDSVHYNVVLRGISVNSNRL--PIDAEDFSSTND 258
Query: 210 GGCIIDSGSVLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCYFLPETF-NRFP 268
G I+DSG+ L YF S Y FV A C+ + + FP
Sbjct: 259 TGVIMDSGTTLAYFPSGAY----NVFVQAIREATSATPVRVQGMDTQCFLVSGRLSDLFP 314
Query: 269 SMAFYFEDANLRIDGEN--------------VFIIDYENHFFLLAVAPHD-DLVALIGSQ 313
++ FE + + +N V+ I +++ + P D + ++G
Sbjct: 315 NVTLNFEGGAMELQPDNYLMWGGTAPTGTTDVWCIGWQSSSS--SAGPKDGSQLTILGDI 372
Query: 314 QQRDTRFVYDLNIDLLSFVKENC 336
+D VYDL+ + ++ NC
Sbjct: 373 VLKDKLVVYDLDNSRIGWMSYNC 395
>gi|224102847|ref|XP_002312826.1| predicted protein [Populus trichocarpa]
gi|222849234|gb|EEE86781.1| predicted protein [Populus trichocarpa]
Length = 445
Score = 77.0 bits (188), Expect = 1e-11, Method: Compositional matrix adjust.
Identities = 96/397 (24%), Positives = 157/397 (39%), Gaps = 82/397 (20%)
Query: 2 VRLFIGTPSKGVLLILDTGSALIY-----------------------AIFDPRKSSSFQK 38
V L GTP + + I+DTGS +++ F P++SSS +
Sbjct: 69 VSLSFGTPPQTLSFIMDTGSDIVWFPCTSHYLCKHCSFSSSSPSSRIQPFIPKESSSSKL 128
Query: 39 INCDHPDCTYF--------------KCVNEQCVYTMKYADQSVTKGFAAHETISVIGKGE 84
+ C +P C++ C+N+ C M + T G A ET+ +
Sbjct: 129 LGCKNPKCSWIHHSNINCDQDCSIKSCLNQTCPPYMIFYGSGTTGGVALSETLHL--HSL 186
Query: 85 GKAIFHGALFGCSN-DNHGFDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVI 143
K F L GCS +H AG+ G R S SQLG +FSYCL+
Sbjct: 187 SKPNF---LVGCSVFSSH---------QPAGIAGFGRGLSSLPSQLG---LGKFSYCLLS 231
Query: 144 -PLPNGEYTSSYLKFGTDMGYRRPSTQA---TKFINHP--------NNFYYLSLKDISID 191
+ SS L + T A T F+ +P + +YYL L+ I++
Sbjct: 232 HRFDDDTKKSSSLVLDMEQLDSDKKTNALVYTPFVKNPKVDNKSSFSVYYYLGLRRITVG 291
Query: 192 NERMNFPPDTFDITVSGEGGCIIDSGSVLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCP 251
+ P G GG IIDSG+ T+ + + L ++F+ + ++ + +
Sbjct: 292 GHHVKVPYKYLSPGEDGNGGVIIDSGTTFTFMAREAFEPLSDEFIRQIKDYRRVKEIEDA 351
Query: 252 EPIQLCYFLPETFN-RFPSMAFYFE-DANLRIDGENVFIIDYENHFFLLAVAPHDDLVA- 308
++ C+ + + FP + YF+ A++ + EN F L V D VA
Sbjct: 352 IGLRPCFNVSDAKTVSFPELRLYFKGGADVALPVENYFAFVGGEVACLTVVT---DGVAG 408
Query: 309 ---------LIGSQQQRDTRFVYDLNIDLLSFVKENC 336
++G+ Q ++ YDL + L F +E C
Sbjct: 409 PERVGGPGMILGNFQMQNFYVEYDLRNERLGFKQEKC 445
>gi|147819672|emb|CAN76394.1| hypothetical protein VITISV_020864 [Vitis vinifera]
Length = 507
Score = 77.0 bits (188), Expect = 1e-11, Method: Compositional matrix adjust.
Identities = 79/315 (25%), Positives = 139/315 (44%), Gaps = 55/315 (17%)
Query: 2 VRLFIGTPSKGVLLILDTGSALIY-------------------AIFDPRKSSSFQKINCD 42
++ IGTPSK + +DTGS +++ ++D + S++ + CD
Sbjct: 80 AKIGIGTPSKDYYVQVDTGSDILWVNCAGCDRCPTKSDLGVDLTLYDMKASTTSDAVGCD 139
Query: 43 HPDCTYFK-----CV-NEQCVYTMKYADQSVTKGFAAHETIS---VIGKGEGKAIFHGAL 93
C+ + C QC+Y++ Y D S T G+ + + + G + +
Sbjct: 140 DNFCSLYDGPLPGCKPGLQCLYSVLYGDGSSTTGYFVQDFVQYNRISGNFQTTPTNGTVV 199
Query: 94 FGCSNDNHGFDEDARDGALAGVLGLSRVTISFISQLGS--IIKKRFSYCL-------VIP 144
FGC N G + + AL G+LG + S +SQL S +KK FS+CL +
Sbjct: 200 FGCGNKQSG-ELGSSSEALDGILGFGQANSSMLSQLASSGKVKKVFSHCLDNVDGGGIFA 258
Query: 145 LPNGEYTSSYLKFGTDMGYRRPSTQATKFINHPNNFYYLSLKDISIDNERMNFPPDTFDI 204
+ GE ++F F++ + Y + +K+I + + ++ P D F+
Sbjct: 259 I--GEVVEPKVRF----LLMNSVMIVVLFLSRAH--YNVVMKEIEVGGDPLDVPSDAFE- 309
Query: 205 TVSGE-GGCIIDSGSVLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCY-FLPE 262
SG+ G IIDSG+ L YF +VY L EK +S +L + E C+ +
Sbjct: 310 --SGDRKGTIIDSGTTLAYFPQEVYVPLIEKILSQQPDLRLHTV----EQAFTCFDYTGN 363
Query: 263 TFNRFPSMAFYFEDA 277
+ FP++ +F+ +
Sbjct: 364 VDDGFPTVTLHFDKS 378
>gi|224029721|gb|ACN33936.1| unknown [Zea mays]
gi|413946782|gb|AFW79431.1| pepsin A [Zea mays]
Length = 534
Score = 77.0 bits (188), Expect = 1e-11, Method: Compositional matrix adjust.
Identities = 60/210 (28%), Positives = 90/210 (42%), Gaps = 14/210 (6%)
Query: 28 FDPRKSSSFQKINCDHPDCT---YFKCVN----EQCVYTMKYADQSVTKGFAAHETISVI 80
+ P KSSS+++I C +C Y C + E C Y K D +VT G E +V
Sbjct: 187 YRPAKSSSWRRIRCSQKECAVLPYNTCQSPSKAESCSYFQKTQDGTVTIGIYGKEKATVT 246
Query: 81 GKGEGKAIFHGALFGCSNDNHGFDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYC 140
A G + GCS G DA D GVL L +SF +RFS+C
Sbjct: 247 VSDGRMAKLPGLILGCSVLEAGGSVDAHD----GVLSLGNGDMSFAVHAAKRFGQRFSFC 302
Query: 141 LVIPLPNGEYTSSYLKFGTDMGYRRPSTQATKFINHPN--NFYYLSLKDISIDNERMNFP 198
L + + SSYL FG + P T T + + + Y + + + ER++ P
Sbjct: 303 L-LSANSSRDASSYLTFGPNPAVMGPGTMETDILYNVDVKPAYGAQVTGVLVGGERLDIP 361
Query: 199 PDTFDITVSGEGGCIIDSGSVLTYFHSDVY 228
+ +D GG I+D+ + +T + Y
Sbjct: 362 DEVWDAERFVGGGVILDTSTSVTSLVPEAY 391
>gi|18391062|ref|NP_563851.1| aspartyl protease-like protein [Arabidopsis thaliana]
gi|2160166|gb|AAB60729.1| F21M12.13 gene product [Arabidopsis thaliana]
gi|21593996|gb|AAM65914.1| nucleoid DNA-binding-like protein [Arabidopsis thaliana]
gi|26983826|gb|AAN86165.1| unknown protein [Arabidopsis thaliana]
gi|332190367|gb|AEE28488.1| aspartyl protease-like protein [Arabidopsis thaliana]
Length = 449
Score = 77.0 bits (188), Expect = 1e-11, Method: Compositional matrix adjust.
Identities = 85/368 (23%), Positives = 151/368 (41%), Gaps = 56/368 (15%)
Query: 1 MVRLFIGTPSKGVLLILDTGSALIY-------------AIFDPRKSSSFQKINCDHPDCT 47
+VR +GTP + + ++LDT + ++ F+ SS++ ++C CT
Sbjct: 105 VVRAKLGTPPQLMFMVLDTSNDAVWLPCSGCSGCSNASTSFNTNSSSTYSTVSCSTAQCT 164
Query: 48 YFK---CVNEQ-----CVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSND 99
+ C + C + Y S +T+++ + FGC N
Sbjct: 165 QARGLTCPSSSPQPSVCSFNQSYGGDSSFSASLVQDTLTL-----APDVIPNFSFGCINS 219
Query: 100 NHGFDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLKFGT 159
G + G++GL R +S +SQ S+ FSYCL P Y S LK G
Sbjct: 220 ASGNSLPPQ-----GLMGLGRGPMSLVSQTTSLYSGVFSYCL--PSFRSFYFSGSLKLGL 272
Query: 160 DMGYRRPSTQATKFINHPN--NFYYLSLKDISIDNERMNFPPDTFDITVSGEGGCIIDSG 217
+G + S + T + +P + YY++L +S+ + ++ P + G IIDSG
Sbjct: 273 -LGQPK-SIRYTPLLRNPRRPSLYYVNLTGVSVGSVQVPVDPVYLTFDANSGAGTIIDSG 330
Query: 218 SVLTYFHSDVYWKLHEKF-----VSYFERFQLAQLSDCPEPIQLCYFLPETFNRFPSMAF 272
+V+T F VY + ++F VS F L C F + N P +
Sbjct: 331 TVITRFAQPVYEAIRDEFRKQVNVSSFS--TLGAFDTC--------FSADNENVAPKITL 380
Query: 273 YFEDANLRIDGENVFIIDYENHFFLLAVAP----HDDLVALIGSQQQRDTRFVYDLNIDL 328
+ +L++ EN I L++A + ++ +I + QQ++ R ++D+
Sbjct: 381 HMTSLDLKLPMENTLIHSSAGTLTCLSMAGIRQNANAVLNVIANLQQQNLRILFDVPNSR 440
Query: 329 LSFVKENC 336
+ E C
Sbjct: 441 IGIAPEPC 448
>gi|226491620|ref|NP_001149154.1| pepsin A precursor [Zea mays]
gi|195625132|gb|ACG34396.1| pepsin A [Zea mays]
Length = 537
Score = 77.0 bits (188), Expect = 1e-11, Method: Compositional matrix adjust.
Identities = 60/210 (28%), Positives = 90/210 (42%), Gaps = 14/210 (6%)
Query: 28 FDPRKSSSFQKINCDHPDCT---YFKCVN----EQCVYTMKYADQSVTKGFAAHETISVI 80
+ P KSSS+++I C +C Y C + E C Y K D +VT G E +V
Sbjct: 190 YRPAKSSSWRRIRCSQKECAVLPYNTCQSPSKAESCSYFQKTQDGTVTIGIYGKEKATVT 249
Query: 81 GKGEGKAIFHGALFGCSNDNHGFDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYC 140
A G + GCS G DA D GVL L +SF +RFS+C
Sbjct: 250 VSDGRMAKLPGLILGCSVLEAGGSVDAHD----GVLSLGNGDMSFAVHAAKRFGQRFSFC 305
Query: 141 LVIPLPNGEYTSSYLKFGTDMGYRRPSTQATKFINHPN--NFYYLSLKDISIDNERMNFP 198
L + + SSYL FG + P T T + + + Y + + + ER++ P
Sbjct: 306 L-LSANSSRDASSYLTFGPNPAVMGPGTMETDILYNVDVKPAYGAKVTGVLVGGERLDIP 364
Query: 199 PDTFDITVSGEGGCIIDSGSVLTYFHSDVY 228
+ +D GG I+D+ + +T + Y
Sbjct: 365 DEVWDAERFVGGGVILDTSTSVTSLVPEAY 394
>gi|88174565|gb|ABD39357.1| chloroplast nucleoid DNA-binding protein [Oryza glumipatula]
Length = 323
Score = 77.0 bits (188), Expect = 1e-11, Method: Compositional matrix adjust.
Identities = 83/340 (24%), Positives = 144/340 (42%), Gaps = 47/340 (13%)
Query: 1 MVRLFIGTPSKGVLLILDTGSALIYAIFD-------PR-----KSSSFQKINCDHPDCTY 48
++ + +GTPSK ++ +DTGS+ + + PR +S++ K++C C
Sbjct: 2 VISVGLGTPSKTQIVEIDTGSSTSWVFCECDGCHTNPRTFLQSRSTTCAKVSCGTSMCLL 61
Query: 49 F----KCVNEQ----CVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSNDN 100
C + + C + + Y D S + G +T++ G FGC+ D+
Sbjct: 62 GGSDPHCQDSENYPDCPFRVSYQDGSASYGILYQDTLTF----SDVQKIPGFTFGCNMDS 117
Query: 101 HGFDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEY---TSSYLKF 157
G +E G + G+LG+ +S + Q S FSYCL + + + T+ Y
Sbjct: 118 FGANE---FGNVDGLLGMGAGQMSVLKQ-SSPTFDGFSYCLPLQMSERGFFSKTTGYFSL 173
Query: 158 GTDMGYRRPSTQATKFINHPNN--FYYLSLKDISIDNERMNFPPDTFDITVSGEGGCIID 215
G + R + TK + N +++ L IS+D ER+ P F G + D
Sbjct: 174 GGKIAATRTDVRYTKMVARRKNTELFFVDLTAISVDGERLGLSPSIFS-----RKGVVFD 228
Query: 216 SGSVLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCYFLPET-FNRFPSMAFYF 274
SGS L+Y L ++ R A+ E + CY + P+++ +F
Sbjct: 229 SGSELSYIPDRALSVLSQRIRELLLRRGAAE----EESERNCYDMRSVDEGDMPAISLHF 284
Query: 275 ED-ANLRIDGENVFIID--YENHFFLLAVAPHDDLVALIG 311
+D A + VF+ E + LA AP + V++IG
Sbjct: 285 DDGARFDLGRHGVFVERSVQEQDVWCLAFAPTES-VSIIG 323
>gi|255571584|ref|XP_002526738.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
gi|223533927|gb|EEF35652.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
Length = 457
Score = 77.0 bits (188), Expect = 1e-11, Method: Compositional matrix adjust.
Identities = 88/370 (23%), Positives = 152/370 (41%), Gaps = 59/370 (15%)
Query: 1 MVRLFIGTPSKGVLLILDTGSALIY----------------AIFDPRKSSSFQKINCDHP 44
+++ IG+P+ I D+GS+L++ +F+P KS ++ K C+
Sbjct: 102 VMKFSIGSPAVDTYAIPDSGSSLVWLQCGTPYCRNCYRQKIPLFNPSKSVTYMKRLCNTA 161
Query: 45 DC------TYFKCV--NEQCVYTMKYADQSVTKG------FAAHETISVIGKGEGKAIFH 90
+C Y++C N+ C Y Y D S T+G F E IS G + IF
Sbjct: 162 ECRVALGDEYWRCKKPNQICKYHEDYLDDSYTEGVISTDIFTFPEHISGFGNYTLRIIF- 220
Query: 91 GALFGCSNDNHGFDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEY 150
GC +N D + G++GL+ S + Q+ +FSYC+ I
Sbjct: 221 ----GCGYNN----SDPQHFYPPGLVGLTNNKASLVGQMDV---DQFSYCVSIDTEQNLK 269
Query: 151 TSSYLKFGTDMGYRRPSTQATKFINHPNNFYYLSLKDISIDN-ERMNFPPDTFDITVSGE 209
S ++FG STQ N + + ++ I ++ E +P F T G+
Sbjct: 270 GSMEIRFGLAASISGHSTQLVP--NSDGWYIFKNVDGIYVNEFEVEGYPAWVFKYTEGGQ 327
Query: 210 GGCIIDSGSVLTYFHSDVY---WKLHEKFVSYFERFQLAQLSDCPEPIQLCYFLPETFN- 265
GG +D+G+ T H+ V KL E+ ++ + +LCYF +
Sbjct: 328 GGLTMDTGTTYTELHNSVMDPLIKLLEEHITIVPEKDYSN-----SGFELCYFSDDFLGA 382
Query: 266 RFPSMAFYF---EDANLRIDGENVFIIDYENHFFLLAVAPHDDLVALIGSQQQRDTRFVY 322
P + F +D + N + + + L + +++IG Q RD + Y
Sbjct: 383 TLPDIELRFTDNKDTYFSFNTRNAWTPNGRSQMCLAMFRTNG--MSIIGMHQLRDIKIGY 440
Query: 323 DLNIDLLSFV 332
DL+ +++SF
Sbjct: 441 DLHHNIVSFT 450
>gi|15241713|ref|NP_195839.1| aspartyl protease family protein [Arabidopsis thaliana]
gi|75181297|sp|Q9LZL3.1|PCS1L_ARATH RecName: Full=Aspartic proteinase PCS1; AltName: Full=Aspartic
protease 38; Short=AtASP38; AltName: Full=Protein EMBRYO
DEFECTIVE 24; AltName: Full=Protein PROMOTION OF CELL
SURVIVAL 1; Flags: Precursor
gi|7340693|emb|CAB82992.1| putative protein [Arabidopsis thaliana]
gi|50897174|gb|AAT85726.1| At5g02190 [Arabidopsis thaliana]
gi|53828617|gb|AAU94418.1| At5g02190 [Arabidopsis thaliana]
gi|110742159|dbj|BAE99007.1| hypothetical protein [Arabidopsis thaliana]
gi|332003059|gb|AED90442.1| aspartyl protease family protein [Arabidopsis thaliana]
Length = 453
Score = 76.6 bits (187), Expect = 1e-11, Method: Compositional matrix adjust.
Identities = 88/374 (23%), Positives = 152/374 (40%), Gaps = 59/374 (15%)
Query: 9 PSKGVLLILDTGSALIYAI------------FDPRKSSSFQKINCDHPDCTY-------- 48
P + + +++DTGS L + FDP +SSS+ I C P C
Sbjct: 82 PPQNISMVIDTGSELSWLRCNRSSNPNPVNNFDPTRSSSYSPIPCSSPTCRTRTRDFLIP 141
Query: 49 FKCVNEQ-CVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSNDNHGFDEDA 107
C +++ C T+ YAD S ++G A E I G + +FGC G D +
Sbjct: 142 ASCDSDKLCHATLSYADASSSEGNLAAE-IFHFGNSTNDS---NLIFGCMGSVSGSDPE- 196
Query: 108 RDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLV----IP--LPNGEYTSSYLKFGTDM 161
D G+LG++R ++SFISQ+G +FSYC+ P L G+ ++L T +
Sbjct: 197 EDTKTTGLLGMNRGSLSFISQMG---FPKFSYCISGTDDFPGFLLLGDSNFTWL---TPL 250
Query: 162 GYRRPSTQATKFINHPNNFYYLSLKDISIDNERMNFPPDTFDITVSGEGGCIIDSGSVLT 221
Y +T Y + L I ++ + + P +G G ++DSG+ T
Sbjct: 251 NYTPLIRISTPLPYFDRVAYTVQLTGIKVNGKLLPIPKSVLVPDHTGAGQTMVDSGTQFT 310
Query: 222 YFHSDVYWKLHEKFVSYFERFQLAQLSDCPE-----PIQLCYFLPET------FNRFPSM 270
+ VY L F++ + + + P+ + LCY + +R P++
Sbjct: 311 FLLGPVYTALRSHFLNRTN--GILTVYEDPDFVFQGTMDLCYRISPVRIRSGILHRLPTV 368
Query: 271 AFYFEDANLRIDGENVFI----IDYENHFFLLAVAPHDDLVAL----IGSQQQRDTRFVY 322
+ FE A + + G+ + + N + DL+ + IG Q++ +
Sbjct: 369 SLVFEGAEIAVSGQPLLYRVPHLTVGNDSVYCFTFGNSDLMGMEAYVIGHHHQQNMWIEF 428
Query: 323 DLNIDLLSFVKENC 336
DL + C
Sbjct: 429 DLQRSRIGLAPVEC 442
>gi|357460823|ref|XP_003600693.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
gi|355489741|gb|AES70944.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
Length = 431
Score = 76.6 bits (187), Expect = 1e-11, Method: Compositional matrix adjust.
Identities = 94/379 (24%), Positives = 152/379 (40%), Gaps = 69/379 (18%)
Query: 1 MVRLFIGTPSKGVLLILDTGSALIY----------AIFDPRKSSSFQKINCDHPDCTY-- 48
++ L IGTP + ++LDTGS L + A FDP SS+F + C HP C
Sbjct: 76 IINLPIGTPPQTQPMVLDTGSQLSWIQCHKKQPPTASFDPSLSSTFSILPCTHPLCKPRI 135
Query: 49 ------FKC-VNEQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSNDNH 101
C N C Y+ YAD + +G E + + + GC+ ++
Sbjct: 136 PDFTLPTSCDQNRLCHYSYFYADGTYAEGNLVREKFTFSRSVSTPPL----ILGCATES- 190
Query: 102 GFDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLKFGTDM 161
D R G+LG++ +SF Q S I K FSYC+ P + + G+
Sbjct: 191 ---TDPR-----GILGMNLGRLSFAKQ--SKITK-FSYCV----PPRQTRPGFTPTGSFY 235
Query: 162 GYRRPSTQATKFI-------NHPNNF----YYLSLKDISIDNERMNFPPDTFDITVSGEG 210
PS++ K++ NF Y + + I I +++N P F G G
Sbjct: 236 LGNNPSSKGFKYVGMMTSSRQRMPNFDPLAYTIPMVGIRIAGKKLNISPAVFRADAGGSG 295
Query: 211 GCIIDSGSVLTYFHSDVYWKLHEKFV-SYFERFQLAQLS--------DCPEPIQLCYFLP 261
+IDSGS TY S+ Y K+ + V + R + + D + +++ +
Sbjct: 296 QTMIDSGSEFTYLVSEAYDKVRAQVVRAVGPRLKKGYVYGGVADMCFDSVKAVEIGRLIG 355
Query: 262 ETFNRFPSMAFYFEDANLRIDGENVFIIDYENHFFLLAVAPHDDLVA---LIGSQQQRDT 318
E M F FE + + + D + + D L A +IG+ Q++
Sbjct: 356 E-------MVFEFERGVEVVIPKERVLADVGGGVHCVGIGSSDKLGAASNIIGNFHQQNL 408
Query: 319 RFVYDLNIDLLSFVKENCS 337
+DL + F K +CS
Sbjct: 409 WVEFDLVRRRVGFGKADCS 427
>gi|297819968|ref|XP_002877867.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
gi|297323705|gb|EFH54126.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
Length = 469
Score = 76.6 bits (187), Expect = 1e-11, Method: Compositional matrix adjust.
Identities = 93/394 (23%), Positives = 147/394 (37%), Gaps = 75/394 (19%)
Query: 2 VRLFIGTPSKGVLLILDTGSALIY----------------------AIFDPRKSSSFQKI 39
V L GTPS+ + + DTGS+L++ F P+ SSS + I
Sbjct: 92 VSLSFGTPSQTIPFVFDTGSSLVWFPCTSRYLCSDCNFSGLDPTQIPRFIPKNSSSSRVI 151
Query: 40 NCDHPDCTYFKCVNEQC---------------VYTMKYADQSVTKGFAAHETISVIGKGE 84
C +P C + N QC Y ++Y S T G E +
Sbjct: 152 GCQNPKCQFLFGANVQCRGCDPNTRNCTVPCPPYILQYGLGS-TAGILISEKLDFP---- 206
Query: 85 GKAIFHGALFGCSNDNHGFDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIP 144
+ GCS AG+ G R S SQ+ K FS+CLV
Sbjct: 207 -DLTVPDFVVGCS--------VISTRTPAGIAGFGRGPESLPSQMK---LKSFSHCLVSR 254
Query: 145 LPNGEYTSSYLKFGTDMGYRR----PSTQATKFINHPN-------NFYYLSLKDISIDNE 193
+ ++ L T G++ P T F +PN +YYL+L+ I + ++
Sbjct: 255 RFDDTNVTTDLGLDTGSGHKSGSKTPGLSYTPFRKNPNVSNTAFLEYYYLNLRRIYVGSK 314
Query: 194 RMNFPPDTFDITVSGEGGCIIDSGSVLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEP 253
+ P +G GG I+DSGS T+ V+ + E+F + + + +
Sbjct: 315 HVKIPYKFLAPGTNGNGGSIVDSGSTFTFMERPVFELVAEEFATQMSNYTREKDLEKVSG 374
Query: 254 IQLCYFLPETFN-RFPSMAFYFE-DANLRIDGENVFIIDYENHFFLLAVAPHDDL----- 306
I C+ + + P + F F+ A + + N F L V + +
Sbjct: 375 IAPCFNISGKGDVTVPELIFEFKGGAKMELPLSNYFSFVGNADTVCLTVVSDNTVNPGGG 434
Query: 307 ---VALIGSQQQRDTRFVYDLNIDLLSFVKENCS 337
++GS QQ++ YDL D F K+ CS
Sbjct: 435 TGPAIILGSFQQQNYLVEYDLENDRFGFAKKKCS 468
>gi|356568507|ref|XP_003552452.1| PREDICTED: aspartic proteinase-like protein 2-like [Glycine max]
Length = 481
Score = 76.6 bits (187), Expect = 1e-11, Method: Compositional matrix adjust.
Identities = 81/341 (23%), Positives = 143/341 (41%), Gaps = 38/341 (11%)
Query: 19 TGSALIYAIFDPRKSSSFQKINCDHPDCT------YFKCVNE-QCVYTMKYADQSVTKGF 71
+G + ++DP S + + + CD CT C C Y++ Y D S T G
Sbjct: 112 SGLGMDLTLYDPNLSKTSKAVPCDDEFCTSTYDGQISGCTKGMSCPYSITYGDGSTTSGS 171
Query: 72 AAHETIS---VIGKGEGKAIFHGALFGCSNDNHGFDEDARDGALAGVLGLSRVTISFISQ 128
+ ++ V+G +FGC + G D +L G++G + S +SQ
Sbjct: 172 YIKDDLTFDRVVGDLRTVPDNTSVIFGCGSKQSGTLSSTTDTSLDGIIGFGQANSSVLSQ 231
Query: 129 LGSI--IKKRFSYCLVIPLPNGEYTSSYLKFGTDMGYRRPSTQATKFINHPNNFYYLSLK 186
L + +K+ FS+CL G + + +P + T + + Y + LK
Sbjct: 232 LAAAGKVKRIFSHCLDSISGGGIFAIGEV--------VQPKVKTTPLLQGMAH-YNVVLK 282
Query: 187 DISIDNERMNFPPDTFDITVSGEGGCIIDSGSVLTYFHSDVYWKLHEKFVSYFERFQLAQ 246
DI + + + P D D + SG G IIDSG+ L Y +Y +L EK ++ +L
Sbjct: 283 DIEVAGDPIQLPSDILD-SSSGR-GTIIDSGTTLAYLPVSIYDQLLEKILAQRSGMKLYL 340
Query: 247 LSDCPEPIQLCYFLPETFNR-FPSMAFYFEDA---------NLRIDGENVFIIDYENHFF 296
+ D + Y E+ + FP++ F FE+ L + E+++ + ++
Sbjct: 341 VED--QFTCFHYSDEESVDDLFPTVKFTFEEGLTLTTYPRDYLFLFKEDMWCVGWQKS-- 396
Query: 297 LLAVAPHDDLVALIGSQQQRDTRFVYDLNIDLLSFVKENCS 337
+A + L+G + VYDL+ + + NCS
Sbjct: 397 -MAQTKDGKELILLGDLVLANKLVVYDLDNMAIGWADYNCS 436
>gi|88174567|gb|ABD39358.1| chloroplast nucleoid DNA-binding protein [Oryza glumipatula]
Length = 323
Score = 76.6 bits (187), Expect = 1e-11, Method: Compositional matrix adjust.
Identities = 82/340 (24%), Positives = 144/340 (42%), Gaps = 47/340 (13%)
Query: 1 MVRLFIGTPSKGVLLILDTGSALIYAIFD-------PR-----KSSSFQKINCDHPDCTY 48
++ + +GTP+K ++ +DTGS+ + + PR +S++ K++C C
Sbjct: 2 VISVGLGTPAKTQIVEIDTGSSTSWVFCECDGCHTNPRTFLQSRSTTCAKVSCGTSMCLL 61
Query: 49 F----KCVNEQ----CVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSNDN 100
C + + C + + Y D S + G +T++ G FGC+ D+
Sbjct: 62 GGSDPHCQDSENYPDCPFRVSYQDGSASYGILYQDTLTF----SDVQKIPGFTFGCNMDS 117
Query: 101 HGFDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEY---TSSYLKF 157
G +E G + G+LG+ +S + Q S FSYCL + + + T+ Y
Sbjct: 118 FGANEF---GNVDGLLGMGAGQMSVLKQ-SSPTFDGFSYCLPLQMSERGFFSKTTGYFSL 173
Query: 158 GTDMGYRRPSTQATKFINHPNN--FYYLSLKDISIDNERMNFPPDTFDITVSGEGGCIID 215
G + R + TK + N +++ L IS+D ER+ P F G + D
Sbjct: 174 GGKIAATRTDVRYTKMVARRKNTELFFVDLTAISVDGERLGLSPSIFS-----RKGVVFD 228
Query: 216 SGSVLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCYFLPET-FNRFPSMAFYF 274
SGS L+Y L ++ R A+ E + CY + P+++ +F
Sbjct: 229 SGSELSYIPDRALSVLSQRIRELLLRRGAAE----EESERNCYDMRSVDEGDMPAISLHF 284
Query: 275 ED-ANLRIDGENVFIID--YENHFFLLAVAPHDDLVALIG 311
+D A + VF+ E + LA AP + V++IG
Sbjct: 285 DDGARFDLGSHGVFVERSVQEQDVWCLAFAPTES-VSIIG 323
>gi|356498711|ref|XP_003518193.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Glycine max]
Length = 466
Score = 76.6 bits (187), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 94/398 (23%), Positives = 159/398 (39%), Gaps = 80/398 (20%)
Query: 2 VRLFIGTPSKGVLLILDTGSALIY------------------AIFDPRKSSSFQKINCDH 43
+ L GTPS+ +LDTGS L++ F P+ SSS + + C +
Sbjct: 88 IDLEFGTPSQTFPFVLDTGSTLVWLPCSSHYLCSKCNSFSNTPKFIPKNSSSSKFVGCTN 147
Query: 44 PDCTY-----------------FKCVNEQC-VYTMKYADQSVTKGFAAHETISVIGKGEG 85
P C + F ++ C YT++Y S T GF E ++ K
Sbjct: 148 PKCAWVFGPDVKSHCCRQDKAAFNNCSQTCPAYTVQYGLGS-TAGFLLSENLNFPTKK-- 204
Query: 86 KAIFHGALFGCSNDNHGFDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPL 145
+ L GCS AG+ G R S SQ+ RFSYCL+
Sbjct: 205 ---YSDFLLGCS--------VVSVYQPAGIAGFGRGEESLPSQMN---LTRFSYCLLSHQ 250
Query: 146 PNGEYTSS---YLKFGTDMGYRRPSTQATKFINHPNN--------FYYLSLKDISIDNER 194
+ T + L+ + + T F+ +P +YY++LK I + +R
Sbjct: 251 FDDSATITSNLVLETASSRDGKTNGVSYTPFLKNPTTKKNPAFGAYYYITLKRIVVGEKR 310
Query: 195 MNFPPDTFDITVSGEGGCIIDSGSVLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPI 254
+ P + V G+GG I+DSGS T+ ++ + ++F + A+ ++ +
Sbjct: 311 VRVPRRLLEPNVDGDGGFIVDSGSTFTFMERPIFDLVAQEFAKQVS-YTRAREAEKQFGL 369
Query: 255 QLCYFL---PETFNRFPSMAFYFE-DANLRIDGENVFIIDYENHFFLLAVAPHDDLVA-- 308
C+ L ET + FP + F F A +R+ N F + + L + DD+
Sbjct: 370 SPCFVLAGGAETAS-FPELRFEFRGGAKMRLPVANYFSLVGKGDVACLTIV-SDDVAGSG 427
Query: 309 -------LIGSQQQRDTRFVYDLNIDLLSFVKENCSDD 339
++G+ QQ++ YDL + F ++C +
Sbjct: 428 GTVGPAVILGNYQQQNFYVEYDLENERFGFRSQSCQTN 465
>gi|115466068|ref|NP_001056633.1| Os06g0119600 [Oryza sativa Japonica Group]
gi|55296446|dbj|BAD68569.1| putative nucleoid DNA-binding protein cnd41 [Oryza sativa Japonica
Group]
gi|55296924|dbj|BAD68375.1| putative nucleoid DNA-binding protein cnd41 [Oryza sativa Japonica
Group]
gi|113594673|dbj|BAF18547.1| Os06g0119600 [Oryza sativa Japonica Group]
gi|215694767|dbj|BAG89958.1| unnamed protein product [Oryza sativa Japonica Group]
gi|215737752|dbj|BAG96882.1| unnamed protein product [Oryza sativa Japonica Group]
Length = 495
Score = 76.6 bits (187), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 78/329 (23%), Positives = 131/329 (39%), Gaps = 50/329 (15%)
Query: 27 IFDPRKSSSFQKINCDHPDCTYFK------CVNEQCVYTMKYADQSVTKGFAAHETISVI 80
+FDP S+++ + C C N QC + + Y D S G + + ++ +
Sbjct: 198 LFDPAMSTTYAAVPCTSAACAQLGPYRRGCSANAQCQFGINYGDGSTATGTYSFDDLT-L 256
Query: 81 GKGEGKAIFHGALFGCSNDNHG--FDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFS 138
G + + G FGC++ + G FD D +AG L L + S + Q + + FS
Sbjct: 257 GPYD---VIRGFRFGCAHADRGSAFDYD-----VAGSLALGGGSQSLVQQTATRYGRVFS 308
Query: 139 YCLVIPLPNGEYTSSYLKFGT--DMGYRRPSTQATKFINH--PNNFYYLSLKDISIDNER 194
YC LP + +L G + PS +T ++ FY + L+ I +
Sbjct: 309 YC----LPPTASSLGFLVLGVPPERAQLIPSFVSTPLLSSSMAPTFYRVLLRAIIVAGRP 364
Query: 195 MNFPPDTFDITVSGEGGCIIDSGSVLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPI 254
+ PP F + +IDS ++++ Y L F S ++ A P+
Sbjct: 365 LAVPPAVFSAS------SVIDSSTIISRLPPTAYQALRAAFRSAMTMYRAA------PPV 412
Query: 255 QL---CY-FLPETFNRFPSMAFYFE-DANLRIDGENVFIIDYENHFFLLAVAP--HDDLV 307
+ CY F PS+A F+ A + +D + + LA AP D +
Sbjct: 413 SILDTCYDFTGVRSITLPSIALVFDGGATVNLDAAGILLGS------CLAFAPTASDRMP 466
Query: 308 ALIGSQQQRDTRFVYDLNIDLLSFVKENC 336
IG+ QQ+ VYD+ + F C
Sbjct: 467 GFIGNVQQKTLEVVYDVPAKAMRFRTAAC 495
>gi|225217039|gb|ACN85323.1| aspartic proteinase nepenthesin-1 precursor [Oryza brachyantha]
Length = 287
Score = 76.6 bits (187), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 75/305 (24%), Positives = 116/305 (38%), Gaps = 40/305 (13%)
Query: 45 DCTYFKCVNEQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSNDNHGFD 104
D T C C+Y ++Y D S T GF A +T+++ G FGC N G
Sbjct: 10 DXTTRGCSGGHCLYGVQYGDGSYTIGFFAMDTLTL----SSHDAIKGFRFGCGERNEGLF 65
Query: 105 EDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLKFGTDMGYR 164
+A AG+LGL R S Q F++C P + YL+FG
Sbjct: 66 GEA-----AGLLGLGRGKTSLPVQTYDKYGGVFAHC----FPARSSGTGYLEFGPG---S 113
Query: 165 RPSTQAT-----KFINHPNNFYYLSLKDISIDNERMNFPPDTFDITVSGEGGCIIDSGSV 219
P+ A I+ FYY+ + I + + + P F G I+DSG+V
Sbjct: 114 SPAVSAKLSTTPMLIDTGPTFYYVGMTGIRVGGKLLPIPQSVFAAA-----GTIVDSGTV 168
Query: 220 LTYFHSDVYWKLHEKFVSY-----FERFQLAQLSDCPEPIQLCYFLPETFN-RFPSMAFY 273
+T Y L F + ++R L D CY L P+++
Sbjct: 169 ITRLPPAAYSSLRSAFAASMAARGYKRAPALSLLD------TCYDLTGASEVAIPTVSLL 222
Query: 274 FEDA-NLRIDGEN-VFIIDYENHFFLLAVAPHDDLVALIGSQQQRDTRFVYDLNIDLLSF 331
F+ +L +D ++ A D VA++G+ Q + VYD+ ++ F
Sbjct: 223 FQGGVSLDVDASGIIYAASVSQACLGFAGNEAADDVAIVGNTQLKTFGVVYDIASKVVGF 282
Query: 332 VKENC 336
C
Sbjct: 283 CPGAC 287
>gi|297723027|ref|NP_001173877.1| Os04g0336942 [Oryza sativa Japonica Group]
gi|255675342|dbj|BAH92605.1| Os04g0336942 [Oryza sativa Japonica Group]
Length = 388
Score = 76.3 bits (186), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 76/312 (24%), Positives = 128/312 (41%), Gaps = 45/312 (14%)
Query: 6 IGTPSKGVLLILDTGSALIY-------------------AIFDPRKSSSFQKINCDHPDC 46
IGTP+ + LDTGS + +DPR S S +++ CD C
Sbjct: 89 IGTPAVKYYVQLDTGSKAFWVNGISCKQCPHESDILRKLTFYDPRSSVSSKEVKCDDTIC 148
Query: 47 TYFKCVNE--QCVYTMKYADQSVTKGFAAHETI---SVIGKGEGKAIFHGALFGCSNDNH 101
T N +C Y YAD +T G + + + G G+ + FGC
Sbjct: 149 TSRPPCNMTLRCPYITGYADGGLTMGILFTDLLHYHQLYGNGQTQPTSTSVTFGCGLQQS 208
Query: 102 GFDEDARDGALAGVLGLSRVTISFISQLGSI--IKKRFSYCLVIPLPNGEYTSSYLKFGT 159
G ++ A+ G++G + +SQL + KK FS+CL G + +
Sbjct: 209 GSLNNSAV-AIDGIIGFGNSNQTALSQLAAAGKTKKIFSHCLDSTNGGGIFAIGEV---- 263
Query: 160 DMGYRRPSTQATKFINHPNNFYYLSLKDISIDNERMNFPPDTFDITVSGEGGCIIDSGSV 219
P + T + + ++ ++LK I++ + P + F T + G IDSGS
Sbjct: 264 ----VEPKVKTTPIVKNNEVYHLVNLKSINVAGTTLQLPANIFGTTKT--KGTFIDSGST 317
Query: 220 LTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCYFLPETFNRFPSMAFYFEDANL 279
L Y +Y +L + + + + Q +FL ++FP + F+FE+ +L
Sbjct: 318 LVYLPEIIYSELILAVFAKHPDITMGAMYN----FQCFHFLGSVDDKFPKITFHFEN-DL 372
Query: 280 RIDGENVFIIDY 291
+D V+ DY
Sbjct: 373 TLD---VYPYDY 381
>gi|302783200|ref|XP_002973373.1| hypothetical protein SELMODRAFT_98841 [Selaginella moellendorffii]
gi|300159126|gb|EFJ25747.1| hypothetical protein SELMODRAFT_98841 [Selaginella moellendorffii]
Length = 389
Score = 76.3 bits (186), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 80/337 (23%), Positives = 139/337 (41%), Gaps = 43/337 (12%)
Query: 26 AIFDPRKSSSFQKINCDHPDCTYFKCV------NEQCVYTMKYADQSVTKGFAAHETISV 79
++F P S+S K+ C P C+ F V + C Y Y + G + I+
Sbjct: 39 SLFQPGLSTSHTKLPCGSPSCSAFSAVSTSCGPSSSCSYNTSYGTNFSSAGDLVSD-IAT 97
Query: 80 IGKGEGKAIFHGALFGCSNDNHGFDEDARDGALAGVLGLSRVTISFISQLGSI-IKKRFS 138
+ + + GC D+ G E +G +G + +SF+ QL ++ + +F
Sbjct: 98 MDSVRNRKVAANLSLGCGRDSGGLLELLDT---SGFVGFDKGNVSFMGQLSALGYRSKFI 154
Query: 139 YCLVI-----PLPNGEYTSSYLKFGTDMGYRRPSTQATKFINHPN--NFYYLSLKDISID 191
YCL L G Y + M Y T I +P Y+++L ISID
Sbjct: 155 YCLPSDTFRGKLVIGNYKLRNASISSSMAY-------TPMITNPQAAELYFINLSTISID 207
Query: 192 NERMNFPPDTFDITVSGEGGCIIDSGSVLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCP 251
+ P F +G GG +ID+ + L+Y SD Y +L + +Y L ++S
Sbjct: 208 KNKFQVPIQGF--LSNGTGGTVIDTTTFLSYLTSDFYTQLVQAIKNYTTN--LVEVSSSV 263
Query: 252 E---PIQLCYFLPETFNRFP---SMAFYFEDANLRIDGENVFIIDYE---NHFFLLAVAP 302
++LCY + + FP ++ ++F ++ F++D N+ +A+
Sbjct: 264 ADALGVELCYNISAN-SDFPPPATLTYHFL-GGAGVEVSTWFLLDDSDSVNNTICMAIGR 321
Query: 303 HDDL---VALIGSQQQRDTRFVYDLNIDLLSFVKENC 336
+ + + +IG+ QQ D YDL F + C
Sbjct: 322 SESVGPNLNVIGTYQQLDLTVEYDLEQMRYGFGAQGC 358
>gi|357118398|ref|XP_003560942.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Brachypodium
distachyon]
Length = 478
Score = 76.3 bits (186), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 100/384 (26%), Positives = 154/384 (40%), Gaps = 57/384 (14%)
Query: 1 MVRLFIGTP-SKGVLLILDTGSALIYA-------------IFDPRKSSSFQKINCDHPDC 46
++ L IGTP + V L LDTGS L++ FD S + + C P C
Sbjct: 101 LIHLSIGTPRPQRVALTLDTGSDLVWTQCACHVCFAQPFPTFDALASQTTLAVPCSDPIC 160
Query: 47 TYFK-----CV--NEQCVYTMKYADQSVTKGFAAHETISVIG-KGEGKAIFHGAL----- 93
T K C + C Y YAD+S+T G +T + +G + H +
Sbjct: 161 TSGKYPLSGCTFNDNTCFYLYDYADKSITSGRIVEDTFTFRSPQGNNGSKAHAGVAVPNV 220
Query: 94 -FGCSNDNHGFDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTS 152
FGC N G + +G+ G SR +S SQL RFS+C + + +
Sbjct: 221 RFGCGQYNKGIFKSNE----SGIAGFSRGPMSLPSQLKV---ARFSHCFTA-IADARTSP 272
Query: 153 SYL--KFGTDM--GYRRPSTQATKFINHPNNFYYLSLKDISIDNERMNFPPDTF--DITV 206
+L G D + Q+T F N + YYL+LK I++ R+ F T
Sbjct: 273 VFLGGAPGPDNLGAHATGPVQSTPFANSNGSLYYLTLKGITVGKTRLPLNALAFAGKGTG 332
Query: 207 SGEGGCIIDSGSVLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCYFLPET--- 263
SG GG IIDSG+ + +Y L FV+ + +A S LC+ +
Sbjct: 333 SGSGGTIIDSGTGIRTLPGPMYRSLRAAFVARV-KLPVANESAADAESTLCFEAARSASL 391
Query: 264 -----FNRFPSMAFYFEDANLRIDGENVFIIDYENH------FFLLAVAPHDDLVALIGS 312
P + + A+ + E+ + E+ L+ + D + +IG+
Sbjct: 392 PPEAPAPALPKVVLHVAGADWDLPRESYVLDLLEDEDGSGSGLCLVMNSAGDSDLTIIGN 451
Query: 313 QQQRDTRFVYDLNIDLLSFVKENC 336
QQ++ YDL + L FV C
Sbjct: 452 FQQQNMHVAYDLEKNKLVFVPARC 475
>gi|326524762|dbj|BAK04317.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 533
Score = 76.3 bits (186), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 79/328 (24%), Positives = 136/328 (41%), Gaps = 29/328 (8%)
Query: 28 FDPRKSSSFQKINCDHPD-CTYFKCV-------NEQCVYTMKYADQSVTKGFAAHETISV 79
+ P +SSS+++ C D C F V NE C Y D +VT+G ET +V
Sbjct: 193 YRPARSSSWRRYRCSQRDTCGNFPYVACKTPDHNESCSYKQMLQDGTVTRGIFGRETATV 252
Query: 80 IGKGEGKAIFHGALFGCSNDNHGFDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSY 139
G +A G + GCS G DA D GVL L +SF + G + FS+
Sbjct: 253 SVSGGRQARLPGLVLGCSTYEAGGTVDAHD----GVLTLGNQHVSFGNIAGQSFQGLFSF 308
Query: 140 CLVIPLPNGEYTSSYLKFGTDMGYRRPSTQATKFINHPNNFYYLSLK--DISIDNERM-N 196
CL + +G SSYL FG + I + N + ++ + ++ +R+ N
Sbjct: 309 CL-LATHSGRDASSYLTFGPNPAIETGGVAGETDIIYVTNMPTMGVQVTGVLVNGQRLDN 367
Query: 197 FPPDTFDITVSGEGGCIIDSGSVLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQL 256
PP+ ++ V GG +D+G+ ++ Y + + + +L ++SD E +
Sbjct: 368 IPPEVWNYRV--HGGLNLDTGTSVSSLVEPAYGIVTRALARHLDP-KLEKVSDVIE-FEH 423
Query: 257 CYFL------PETFNRFPSMAFYFE-DANLRIDGENVFIIDYENHFFLLAVAPHDDLVAL 309
CY PET P + + A + V + + L + ++
Sbjct: 424 CYKWDGVKPAPETI--VPKLELVLQGGARMEPSLTGVLMPEVVPGVACLGFWRRELGPSV 481
Query: 310 IGSQQQRDTRFVYDLNIDLLSFVKENCS 337
+G+ ++ + +D L F K+ C+
Sbjct: 482 LGNVHMQEHIWEFDSVKGKLRFKKDKCT 509
>gi|168042409|ref|XP_001773681.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162675069|gb|EDQ61569.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 399
Score = 76.3 bits (186), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 72/305 (23%), Positives = 129/305 (42%), Gaps = 48/305 (15%)
Query: 2 VRLFIGTPSKGVLLILDTGSALIY--------------------AIFDPRKSSSFQKINC 41
++++GTP G + +DTGS + + +DP +SS+ ++C
Sbjct: 39 TKIYLGTPPVGYYVQVDTGSDVTWLNCAPCTSCVTETQLPSIKLTTYDPSRSSTDGALSC 98
Query: 42 DHPDCTYFKCVNE-------QCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGA-- 92
+C NE C Y+ Y D S T+G+ + ++ + A
Sbjct: 99 RDSNCGAALGSNEVSCTSAGYCAYSTTYGDGSSTQGYFIQDVMTFQEIHNNTQVNGTASV 158
Query: 93 LFGCSNDNHGFDEDARDGALAGVLGLSRVTISFISQLGSI--IKKRFSYCLVIPLPNGEY 150
FGC G + AL G++G + +S SQL S+ + RF++CL G
Sbjct: 159 YFGCGTTQSG-NLLMSSRALDGLIGFGQAAVSIPSQLASMGKVGNRFAHCLQGDNQGG-- 215
Query: 151 TSSYLKFGTDMGYRRPSTQATKFINHPNNFYYLSLKDISIDNERMNFPPDTFDITVSGEG 210
+ G+ P+ T ++ N Y + +++I++ N R P +FD T + G
Sbjct: 216 --GTIVIGS---VSEPNISYTPIVSR--NHYAVGMQNIAV-NGRNVTTPASFDTTSTSAG 267
Query: 211 GCIIDSGSVLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCYFLPETFNRFPSM 270
G I+DSG+ L Y Y +FV+ F+ + S + +QL + + FP++
Sbjct: 268 GVIMDSGTTLAYLVDPAY----TQFVNAVSTFESSMFSSHSQCLQLAWCSLQA--DFPTV 321
Query: 271 AFYFE 275
+F+
Sbjct: 322 KLFFD 326
>gi|238007638|gb|ACR34854.1| unknown [Zea mays]
gi|413948713|gb|AFW81362.1| pepsin A [Zea mays]
Length = 538
Score = 76.3 bits (186), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 59/210 (28%), Positives = 90/210 (42%), Gaps = 14/210 (6%)
Query: 28 FDPRKSSSFQKINCDHPDCT---YFKCVN----EQCVYTMKYADQSVTKGFAAHETISVI 80
+ P KSSS+++I C +C Y C + E C Y + D ++T G E +V
Sbjct: 189 YRPAKSSSWRRIRCSQKECALLPYNTCQSPSKAESCSYYQQMQDGTLTMGIYGKEKATVT 248
Query: 81 GKGEGKAIFHGALFGCSNDNHGFDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYC 140
A G + GCS G DA D GVL L +SF +RFS+C
Sbjct: 249 VSDGRMAKLPGLILGCSVLEAGGSVDAHD----GVLSLGNGEMSFAVHAAKRFGQRFSFC 304
Query: 141 LVIPLPNGEYTSSYLKFGTDMGYRRPSTQATKFINHPN--NFYYLSLKDISIDNERMNFP 198
L + + SSYL FG + P T T + + + Y + I + ER++ P
Sbjct: 305 L-LSANSSRDASSYLTFGPNPAVMGPGTMETDIVYNVDVKPAYGPLVTGIFVGGERLDIP 363
Query: 199 PDTFDITVSGEGGCIIDSGSVLTYFHSDVY 228
+ +D GG I+D+ + +T + Y
Sbjct: 364 QEIWDAEKVVGGGVILDTSTSVTSLVPEAY 393
>gi|359496797|ref|XP_002277380.2| PREDICTED: aspartic proteinase nepenthesin-2-like, partial [Vitis
vinifera]
Length = 358
Score = 76.3 bits (186), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 64/259 (24%), Positives = 107/259 (41%), Gaps = 48/259 (18%)
Query: 2 VRLFIGTPSKGVLLILDTGSALIY---------------AIFDPRKSSSFQKINCDHPDC 46
V++ G+P++ +I+DTGS+L + +FDP S +++ ++C C
Sbjct: 120 VKVGFGSPARYYSMIVDTGSSLSWLQCKPCVVYCHVQADPLFDPSASKTYKSLSCTSSQC 179
Query: 47 TYF----------KCVNEQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGC 96
+ + + CVYT Y D S + G+ + + +++ G ++GC
Sbjct: 180 SSLVDATLNNPLCETSSNVCVYTASYGDSSYSMGYLSQDLLTL----APSQTLPGFVYGC 235
Query: 97 SNDNHGFDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLK 156
G D D G AG+LGL R +S + Q+ S FSYCL P G +L
Sbjct: 236 -----GQDSDGLFGRAAGILGLGRNKLSMLGQVSSKFGYAFSYCL--PTRGG---GGFLS 285
Query: 157 FGTDMGYRRPSTQATKFINHPNN--FYYLSLKDISIDNERMNFPPDTFDITVSGEGGCII 214
G + + T P N Y+L L I++ + + + II
Sbjct: 286 IG-KASLAGSAYKFTPMTTDPGNPSLYFLRLTAITVGGRALGVAAAQYRVPT------II 338
Query: 215 DSGSVLTYFHSDVYWKLHE 233
DSG+V+T VY +
Sbjct: 339 DSGTVITRLPMSVYTPFQQ 357
>gi|226492334|ref|NP_001147965.1| pepsin A precursor [Zea mays]
gi|195614874|gb|ACG29267.1| pepsin A [Zea mays]
Length = 538
Score = 75.9 bits (185), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 59/210 (28%), Positives = 90/210 (42%), Gaps = 14/210 (6%)
Query: 28 FDPRKSSSFQKINCDHPDCT---YFKCVN----EQCVYTMKYADQSVTKGFAAHETISVI 80
+ P KSSS+++I C +C Y C + E C Y + D ++T G E +V
Sbjct: 189 YRPAKSSSWRRIRCSQKECALLPYNTCQSPSKAESCSYYQQMQDGTLTMGIYGKEKATVT 248
Query: 81 GKGEGKAIFHGALFGCSNDNHGFDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYC 140
A G + GCS G DA D GVL L +SF +RFS+C
Sbjct: 249 VSDGRMAKLPGLILGCSVLEAGGSVDAHD----GVLSLGNGEMSFAVHAAKRFGQRFSFC 304
Query: 141 LVIPLPNGEYTSSYLKFGTDMGYRRPSTQATKFINHPN--NFYYLSLKDISIDNERMNFP 198
L + + SSYL FG + P T T + + + Y + I + ER++ P
Sbjct: 305 L-LSANSSRDASSYLTFGPNPAVMGPGTMETDIVYNVDVKPAYGPLVTGIFVGGERLDIP 363
Query: 199 PDTFDITVSGEGGCIIDSGSVLTYFHSDVY 228
+ +D GG I+D+ + +T + Y
Sbjct: 364 QEIWDAEKVVGGGVILDTSTSVTSLVPEAY 393
>gi|219886219|gb|ACL53484.1| unknown [Zea mays]
gi|219888509|gb|ACL54629.1| unknown [Zea mays]
gi|414588374|tpg|DAA38945.1| TPA: nucellin-like aspartic protease [Zea mays]
Length = 415
Score = 75.9 bits (185), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 95/374 (25%), Positives = 146/374 (39%), Gaps = 62/374 (16%)
Query: 2 VRLFIGTPSKGVLLILDTGSALIYAIFD-PRKS-----------SSFQKINCDHPDCTYF 49
V + IG P+K L +DTGS L + D P +S ++ + + C + CT
Sbjct: 55 VTMNIGNPAKPYFLDVDTGSDLTWLQCDAPCRSCNKVPHPLYRPTANRLVPCANALCTAL 114
Query: 50 --------KCVN-EQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSNDN 100
KC + +QC Y +KY D + ++G +++ S+ + I G FGC D
Sbjct: 115 HSGQGSNNKCPSPKQCDYQIKYTDSASSQGVLINDSFSLPMRSSN--IRPGLTFGCGYDQ 172
Query: 101 HGFDEDARDGALAGVLGLSRVTISFISQLG--SIIKKRFSYCLVIPLPNGEYTSSYLKFG 158
A A+ G+LGL R ++S +SQL I K +CL NG +L FG
Sbjct: 173 QVGKNGAVQAAIDGMLGLGRGSVSLVSQLKQQGITKNVVGHCLST---NG---GGFLFFG 226
Query: 159 TDMGYRRPSTQATKFINHPNNFYYLSLKDISIDNERMNFPPDTFDITVSGEGGCIIDSGS 218
D+ T N+Y + D + P + DSGS
Sbjct: 227 DDVVPSSRVTWVPMAQRTSGNYYSPGSGTLYFDRRSLGVKPMEV----------VFDSGS 276
Query: 219 VLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCYFLPETF-------NRFPSMA 271
TYF + Y + + L Q+SD P LC+ + F N F SM
Sbjct: 277 TYTYFTAQPYQAVVSALKGGLSK-SLKQVSDPTLP--LCWKGQKAFKSVFDVKNEFKSMF 333
Query: 272 FYF---EDANLRIDGENVFIIDYENHFFLLAVAPHDDLVA-----LIGSQQQRDTRFVYD 323
F ++A + I EN I+ + L + D A +IG +D +YD
Sbjct: 334 LSFASAKNAAMEIPPENYLIVTKNGNVCLGIL---DGTAAKLSFNVIGDITMQDQMVIYD 390
Query: 324 LNIDLLSFVKENCS 337
L + + C+
Sbjct: 391 NEKSQLGWARGACT 404
>gi|357440289|ref|XP_003590422.1| Aspartic proteinase-like protein [Medicago truncatula]
gi|355479470|gb|AES60673.1| Aspartic proteinase-like protein [Medicago truncatula]
Length = 498
Score = 75.9 bits (185), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 93/384 (24%), Positives = 161/384 (41%), Gaps = 74/384 (19%)
Query: 3 RLFIGTPSKGVLLILDTGSALIYA-------------------IFDPRKSSSFQKINCDH 43
++ +GTP + + +DTGS +++ FD SS+ + C
Sbjct: 87 KVKMGTPPREFTVQIDTGSDILWINCNTCSNCPKSSGLGIELNFFDTVGSSTAALVPCSD 146
Query: 44 PDCT------YFKCVNE--QCVYTMKYADQSVTKGFAAHETI---SVIGKGEGKAIFHGA 92
P C +C + QC YT +Y D S T G + + ++G+ + A
Sbjct: 147 PMCASAIQGAAAQCSPQVNQCSYTFQYEDGSGTSGVYVSDAMYFDMILGQSTPANVASSA 206
Query: 93 --LFGCSNDNHGFDEDARDGALAGVLGLSRVTISFISQLGS--IIKKRFSYCLVIPLPNG 148
+FGCS G D D A+ G+LG +S +SQL S I K FS+CL G
Sbjct: 207 TIVFGCSTYQSG-DLTKTDKAVDGILGFGPGELSVVSQLSSRGITPKVFSHCLKGDGNGG 265
Query: 149 EYTSSYLKFGTDMGYRRPSTQATKFI-NHPNNFYYLSLKDISIDNERMNFPPDTFDITVS 207
L G + PS + + + P+ Y L+L+ I+++ + ++ P F S
Sbjct: 266 ----GILVLGEIL---EPSIVYSPLVPSQPH--YNLNLQSIAVNGQVLSINPAVF--ATS 314
Query: 208 GEGGCIIDSGSVLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCYFLPETF-NR 266
+ G IIDSG+ L+Y + Y L + +F + +S + CY + + +
Sbjct: 315 DKRGTIIDSGTTLSYLVQEAYDPLVNAVDTAVSQFATSFISKGSQ----CYLVLTSIDDS 370
Query: 267 FPSMAFYFE-DANLRI------------DGENVFIIDYENHFFLLAVAPHDDLVALIGSQ 313
FP+++F FE A++ + DG ++ I ++ + V ++G
Sbjct: 371 FPTVSFNFEGGASMDLKPSQYLLNRGFQDGAKMWCIGFQK---------VQEGVTILGDL 421
Query: 314 QQRDTRFVYDLNIDLLSFVKENCS 337
+D VYDL + + +CS
Sbjct: 422 VLKDKIVVYDLARQQIGWTNYDCS 445
>gi|195645150|gb|ACG42043.1| aspartic proteinase Asp1 precursor [Zea mays]
Length = 415
Score = 75.9 bits (185), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 95/374 (25%), Positives = 146/374 (39%), Gaps = 62/374 (16%)
Query: 2 VRLFIGTPSKGVLLILDTGSALIYAIFD-PRKS-----------SSFQKINCDHPDCTYF 49
V + IG P+K L +DTGS L + D P +S ++ + + C + CT
Sbjct: 55 VTMNIGNPAKPYFLDVDTGSDLTWLQCDAPCRSCNKVPHPLYRPTANRLVPCANALCTAL 114
Query: 50 --------KCVN-EQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSNDN 100
KC + +QC Y +KY D + ++G +++ S+ + I G FGC D
Sbjct: 115 HSGQGSNNKCPSPKQCDYQIKYTDSASSQGVLINDSFSLPMRSSN--IRPGLTFGCGYDQ 172
Query: 101 HGFDEDARDGALAGVLGLSRVTISFISQLG--SIIKKRFSYCLVIPLPNGEYTSSYLKFG 158
A A+ G+LGL R ++S +SQL I K +CL NG +L FG
Sbjct: 173 QVGKNGAVQAAIDGMLGLGRGSVSLVSQLKQQGITKNVVGHCLST---NG---GGFLFFG 226
Query: 159 TDMGYRRPSTQATKFINHPNNFYYLSLKDISIDNERMNFPPDTFDITVSGEGGCIIDSGS 218
D+ T N+Y + D + P + DSGS
Sbjct: 227 DDVVPSSRVTWVPMAQRTSGNYYSPGSGTLYFDRRSLGVKPMEV----------VFDSGS 276
Query: 219 VLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCYFLPETF-------NRFPSMA 271
TYF + Y + + L Q+SD P LC+ + F N F SM
Sbjct: 277 TYTYFTAQPYQAVVSALKGGLSK-SLKQVSDPTLP--LCWKGQKAFKSVFDVKNEFKSMF 333
Query: 272 FYF---EDANLRIDGENVFIIDYENHFFLLAVAPHDDLVA-----LIGSQQQRDTRFVYD 323
F ++A + I EN I+ + L + D A +IG +D +YD
Sbjct: 334 LSFSSAKNAAMEIPPENYLIVTKNGNVCLGIL---DGTAAKLSFNVIGDITMQDQMVIYD 390
Query: 324 LNIDLLSFVKENCS 337
L + + C+
Sbjct: 391 NEKSQLGWARGACT 404
>gi|168060150|ref|XP_001782061.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162666472|gb|EDQ53125.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 423
Score = 75.9 bits (185), Expect = 3e-11, Method: Compositional matrix adjust.
Identities = 91/378 (24%), Positives = 147/378 (38%), Gaps = 63/378 (16%)
Query: 4 LFIGTPSKGVLLILDTGSALIYA---------------IFDPRKSSSFQKINCDHPDCTY 48
L +G+P K L +DTGS L +A +++P+K+ + ++C P C
Sbjct: 44 LLLGSPPKLYFLDMDTGSDLTWAQCDAPCRNCAIGPHGLYNPKKA---KVVDCHLPVCAQ 100
Query: 49 ------FKCVNE--QCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSNDN 100
++C ++ QC Y ++YAD S T G +T++V G I A+ GC D
Sbjct: 101 IQQGGSYECNSDVKQCDYEVEYADGSSTMGVLVEDTLTVR-LTNGTLIQTKAIIGCGYDQ 159
Query: 101 HGFDEDARDGALAGVLGLSRVTISFISQLG--SIIKKRFSYCLVIPLPNGEYTSSYLKFG 158
G + + GV+GLS ++ +QL IIK +CL +G YL FG
Sbjct: 160 QGTLAKS-PASTDGVIGLSSSKVALPAQLAEKGIIKNVLGHCLA----DGSNGGGYLFFG 214
Query: 159 TDMGYRRPS--TQATKFINHPNNF-YYLSLKDISIDNERMNFPPDTFDITVSGEGGCIID 215
++ PS T + P Y L+ I + + D D+T S + D
Sbjct: 215 DEL---VPSWGMTWTPMMGKPEMLGYQARLQSIRYGGDSLVLNNDE-DLTRS-TSSVMFD 269
Query: 216 SGSVLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCYFLPETFNRFPSMAFYFE 275
SG+ TY Y + ++ + P C+ P F + YF+
Sbjct: 270 SGTSFTYLVPQAYASVLSAVTKQSGLLRVKSDTTLP----YCWRGPSPFQSITDVHQYFK 325
Query: 276 DANLRIDGENVF----IIDYENHFFLL-------------AVAPHDDLVALIGSQQQRDT 318
L G N F +D +L+ A ++ +IG R
Sbjct: 326 TLTLDFGGRNWFATDSTLDLSPQGYLIVSTQGNVCLGILDASGASLEVTNIIGDVSMRGY 385
Query: 319 RFVYDLNIDLLSFVKENC 336
VYD D + +++ NC
Sbjct: 386 LVVYDNVRDRIGWIRRNC 403
>gi|449518783|ref|XP_004166415.1| PREDICTED: aspartic proteinase-like protein 2-like, partial
[Cucumis sativus]
Length = 420
Score = 75.9 bits (185), Expect = 3e-11, Method: Compositional matrix adjust.
Identities = 79/324 (24%), Positives = 132/324 (40%), Gaps = 47/324 (14%)
Query: 2 VRLFIGTPSKGVLLILDTGSALIY-------------------AIFDPRKSSSFQKINCD 42
++ IGTPSK + +DTGS +++ +D +S++ + ++CD
Sbjct: 89 AKIGIGTPSKDYYVQVDTGSDIVWVNCIQCRECPRTSSLGMELTPYDLEESTTGKLVSCD 148
Query: 43 HPDCTYFK-------CVNEQCVYTMKYADQSVTKGFAAHETIS---VIGKGEGKAIFHGA 92
C N C Y Y D S T G+ + + V G E A
Sbjct: 149 EQFCLEVNGGPLSGCTTNMSCPYLQIYGDGSSTAGYFVKDYVQYNRVSGDLETTAANGSI 208
Query: 93 LFGCSNDNHGFDEDARDGALAGVLGLSRVTISFISQLGSI--IKKRFSYCLVIPLPNGEY 150
FGC G + + AL G+LG + S ISQL S +KK F++CL +G
Sbjct: 209 KFGCGARQSGDLGSSGEEALDGILGFGKSNSSIISQLASTRKVKKMFAHCL-----DGTN 263
Query: 151 TSSYLKFGTDMGYRRPSTQATKFI-NHPNNFYYLSLKDISIDNERMNFPPDTFDITVSGE 209
G + +P T + N P+ Y +++ + + + +N D F+
Sbjct: 264 GGGIFAMGHVV---QPKVNMTPLVPNQPH--YNVNMTGVQVGHIILNISADVFE--AGDR 316
Query: 210 GGCIIDSGSVLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCYFLPETFNRFPS 269
G IIDSG+ L Y +Y L K +S ++ + + Q + + FP
Sbjct: 317 KGTIIDSGTTLAYLPELIYEPLVAKILSQQHNLEVQTIHGEYKCFQ---YSERVDDGFPP 373
Query: 270 MAFYFEDANLRIDGENVFIIDYEN 293
+ F+FE++ L + ++ YEN
Sbjct: 374 VIFHFENSLLLKVYPHEYLFQYEN 397
>gi|413922067|gb|AFW61999.1| hypothetical protein ZEAMMB73_694403, partial [Zea mays]
Length = 328
Score = 75.9 bits (185), Expect = 3e-11, Method: Compositional matrix adjust.
Identities = 61/222 (27%), Positives = 95/222 (42%), Gaps = 43/222 (19%)
Query: 7 GTPSKGVLLILDTGSALIYA--------------IFDPRKSSSFQKINCDHPDC------ 46
G+P+ + +I+DTGS L + +FDP S+++ + C+ C
Sbjct: 103 GSPAANLTVIVDTGSDLTWVQCKPCSACYAQRDPLFDPAGSATYAAVRCNASACADSLRA 162
Query: 47 ---TYFKCVN-----EQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSN 98
T C + E+C Y + Y D S ++G A +T+++ G A G +FGC
Sbjct: 163 ATGTPGSCGSTGAGSEKCYYALAYGDGSFSRGVLATDTVAL-----GGASLGGFVFGCGL 217
Query: 99 DNHGFDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLKFG 158
N G G AG++GL R +S +SQ S FSYCL S L G
Sbjct: 218 SNRGLF-----GGTAGLMGLGRTELSLVSQTASRYGGVFSYCLPAATSGDASGSLSLGGG 272
Query: 159 TDMG--YRRPSTQA-TKFINHPNN--FYYLSLKDISIDNERM 195
D YR + A T+ I P FY+L++ ++ +
Sbjct: 273 DDAASSYRNTTPVAYTRMIADPAQPPFYFLNVTGAAVGGTAL 314
>gi|147771308|emb|CAN69536.1| hypothetical protein VITISV_043237 [Vitis vinifera]
Length = 372
Score = 75.5 bits (184), Expect = 3e-11, Method: Compositional matrix adjust.
Identities = 81/358 (22%), Positives = 147/358 (41%), Gaps = 43/358 (12%)
Query: 1 MVRLFIGTPSKGVLLILDTGSALIY-----------AIFDPRKSSSFQKINCDHPDCTYF 49
+VR IGTP++ +L+ +DT S + + +F+ S++++ + C C
Sbjct: 37 IVRAKIGTPAQTMLMAMDTSSDVAWIPCNGCLGCSSTLFNSPASTTYKSLGCQAAQCKQV 96
Query: 50 K---CVNEQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSNDNHGFDED 106
C C + + Y S+ + +TI++ G FGC G
Sbjct: 97 PKPTCGGGVCSFNLTYGGSSLAANLS-QDTITLATDA-----VPGYSFGCIQKATGGSLP 150
Query: 107 ARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLKFGTDMGYRRP 166
A+ R +S +SQ ++ + FSYCL P S L+ G +R
Sbjct: 151 AQGLLGL-----GRGPLSLLSQTQNLYQSTFSYCL--PSFKSLNFSGSLRLGPVGQPKR- 202
Query: 167 STQATKFINHPN--NFYYLSLKDISIDNERMNFPPDTFDITVSGEGGCIIDSGSVLTYFH 224
+ T + +P + Y+++L + + ++ PP +F S G I DSG+V T
Sbjct: 203 -IKYTPLLKNPRRPSLYFVNLMAVRVGRRVVDVPPGSFTFNPSTGAGTIFDSGTVFTRLV 261
Query: 225 SDVYWKLHEKFVSYFER-FQLAQLSDCPEPIQLCYFLPETFNRFPSMAFYFEDANLRIDG 283
+ Y + + F + R + L CY +P P++ F F N+ +
Sbjct: 262 TPAYIAVRDAFRNRVGRNLTVTSLGG----FDTCYTVPIA---APTITFMFTGMNVTLPP 314
Query: 284 ENVFIIDYENHFFLLAVAPHDD----LVALIGSQQQRDTRFVYDLNIDLLSFVKENCS 337
+N+ I LA+A D ++ +I + QQ++ R +YD+ L +E C+
Sbjct: 315 DNLLIHSTAGSTTCLAMAAAPDNVNSVLNVIANLQQQNHRLLYDVPNSRLGVARELCT 372
>gi|242089103|ref|XP_002440384.1| hypothetical protein SORBIDRAFT_09g030880 [Sorghum bicolor]
gi|241945669|gb|EES18814.1| hypothetical protein SORBIDRAFT_09g030880 [Sorghum bicolor]
Length = 555
Score = 75.5 bits (184), Expect = 3e-11, Method: Compositional matrix adjust.
Identities = 65/229 (28%), Positives = 99/229 (43%), Gaps = 19/229 (8%)
Query: 28 FDPRKSSSFQKINCDHPDCT---YFKCVN----EQCVYTMKYADQSVTKGFAAHETISVI 80
+ P KSSS+++I C C Y C + E C Y K D +VT G +E +V
Sbjct: 208 YRPAKSSSWRRIRCSEQQCAHLPYNTCQSPSKLESCSYYQKTQDGTVTIGIYGNEKATVT 267
Query: 81 GKGEGKAIFHGALFGCSNDNHGFDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYC 140
A G + GCS G DA DG L+ G I + + G RFS+C
Sbjct: 268 VSDGRMAKLPGLVLGCSVLEAGASVDAHDGVLSLGNGHMSFAIHAVLRFGG----RFSFC 323
Query: 141 LVIPLPNGEYTSSYLKFGTDMGYRRPSTQATKFINHPN--NFYYLSLKDISIDNERMNFP 198
L + + SSYL FG + P T T+ + + + Y + + + ER++ P
Sbjct: 324 L-LSANSSRDASSYLTFGPNPAVMGPGTMETEILYNVDVKAAYGPRVTAVLVGGERLDIP 382
Query: 199 PDTFDITVSGEGGCIIDSGSVLTYFHSDVYWKLHEKFVSYFERFQLAQL 247
D ++I G I+D+ + +T + Y E V+ +R LA L
Sbjct: 383 DDVWNIDKGLGSGVILDTSTSVTSLVPEAY----EPLVAALDR-HLAHL 426
>gi|414866064|tpg|DAA44621.1| TPA: putative aspartic protease family protein [Zea mays]
Length = 454
Score = 75.5 bits (184), Expect = 4e-11, Method: Compositional matrix adjust.
Identities = 89/392 (22%), Positives = 150/392 (38%), Gaps = 71/392 (18%)
Query: 2 VRLFIGTPSKGVLLILDTGSALIY-----------------AIFDPRKSSSFQKINCDHP 44
V + +G P + V ++LDTGS L + A F+ SS++ +C P
Sbjct: 64 VPVAVGAPPQNVTMVLDTGSELSWLRCNGSRVPSTPPPQAPAAFNGSASSTYAAAHCSSP 123
Query: 45 DCTYFK--------CV---NEQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGAL 93
+C + C + C ++ YAD S G A +T + G +A L
Sbjct: 124 ECQWRGRDLPVPPFCAGPPSNSCRVSLSYADASSADGILAADTFLLGGAPPVRA-----L 178
Query: 94 FGC--SNDNHGFDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYT 151
FGC S + + A G+LG++R ++SF++Q ++ RF+YC+ G+
Sbjct: 179 FGCVTSYSSATATNSSDSEAATGLLGMNRGSLSFVTQTATL---RFAYCIA----PGD-G 230
Query: 152 SSYLKFGTDMGYRRPSTQATKFI--NHPNNF-----YYLSLKDISIDNERMNFPPDTFDI 204
L G D P T I + P + Y + L+ I + + P
Sbjct: 231 PGLLVLGGDGAALAPQLNYTPLIQISRPLPYFDRVAYSVQLEGIRVGAALLPIPKSVLAP 290
Query: 205 TVSGEGGCIIDSGSVLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQ----LCYFL 260
+G G ++DSG+ T+ +D Y L +F++ LA L + Q C+
Sbjct: 291 DHTGAGQTMVDSGTQFTFLLADAYAPLKGEFLNQTSAL-LAPLGESDFVFQGAFDACFRA 349
Query: 261 PE-----TFNRFPSMAFYFEDANLRIDGEN-VFIIDYENHFFLLAVA------PHDDLVA 308
E P + A + + GE ++ + E A A + D+
Sbjct: 350 SEARVAAASQMLPEVGLVLRGAEVAVGGEKLLYRVPGERRGEGGAEAVWCLTFGNSDMAG 409
Query: 309 L----IGSQQQRDTRFVYDLNIDLLSFVKENC 336
+ IG Q++ YDL + F C
Sbjct: 410 MSAYVIGHHHQQNVWVEYDLQNGRVGFAPARC 441
>gi|225465837|ref|XP_002264626.1| PREDICTED: aspartic proteinase nepenthesin-1-like isoform 1 [Vitis
vinifera]
Length = 437
Score = 75.5 bits (184), Expect = 4e-11, Method: Compositional matrix adjust.
Identities = 81/358 (22%), Positives = 147/358 (41%), Gaps = 43/358 (12%)
Query: 1 MVRLFIGTPSKGVLLILDTGSALIY-----------AIFDPRKSSSFQKINCDHPDCTYF 49
+VR IGTP++ +L+ +DT S + + +F+ S++++ + C C
Sbjct: 102 IVRAKIGTPAQTMLMAMDTSSDVAWIPCNGCLGCSSTLFNSPASTTYKSLGCQAAQCKQV 161
Query: 50 K---CVNEQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSNDNHGFDED 106
C C + + Y S+ + +TI++ G FGC G
Sbjct: 162 PKPTCGGGVCSFNLTYGGSSLAANLS-QDTITLATDA-----VPGYSFGCIQKATGGSLP 215
Query: 107 ARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLKFGTDMGYRRP 166
A+ R +S +SQ ++ + FSYCL P S L+ G +R
Sbjct: 216 AQGLLGL-----GRGPLSLLSQTQNLYQSTFSYCL--PSFKSLNFSGSLRLGPVGQPKR- 267
Query: 167 STQATKFINHPN--NFYYLSLKDISIDNERMNFPPDTFDITVSGEGGCIIDSGSVLTYFH 224
+ T + +P + Y+++L + + ++ PP +F S G I DSG+V T
Sbjct: 268 -IKYTPLLKNPRRPSLYFVNLMAVRVGRRVVDVPPGSFTFNPSTGAGTIFDSGTVFTRLV 326
Query: 225 SDVYWKLHEKFVSYFER-FQLAQLSDCPEPIQLCYFLPETFNRFPSMAFYFEDANLRIDG 283
+ Y + + F + R + L CY +P P++ F F N+ +
Sbjct: 327 TPAYIAVRDAFRNRVGRNLTVTSLGG----FDTCYTVPIA---APTITFMFTGMNVTLPP 379
Query: 284 ENVFIIDYENHFFLLAVAPHDD----LVALIGSQQQRDTRFVYDLNIDLLSFVKENCS 337
+N+ I LA+A D ++ +I + QQ++ R +YD+ L +E C+
Sbjct: 380 DNLLIHSTAGSTTCLAMAAAPDNVNSVLNVIANLQQQNHRLLYDVPNSRLGVARELCT 437
>gi|226530102|ref|NP_001152414.1| PCS1 precursor [Zea mays]
gi|195656033|gb|ACG47484.1| PCS1 [Zea mays]
Length = 452
Score = 75.5 bits (184), Expect = 4e-11, Method: Compositional matrix adjust.
Identities = 68/272 (25%), Positives = 112/272 (41%), Gaps = 50/272 (18%)
Query: 2 VRLFIGTPSKGVLLILDTGSALIY-----------------AIFDPRKSSSFQKINCDHP 44
V + +G P + V ++LDTGS L + A F+ SS++ +C P
Sbjct: 62 VPVAVGAPPQNVTMVLDTGSELSWLRCNGSRVPSTPPPQAPAAFNGSASSTYAAAHCSSP 121
Query: 45 DCTYFK--------CVNE---QCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGAL 93
+C + C C ++ YAD S G A +T + G A AL
Sbjct: 122 ECQWRGRDLPVPPFCAGPPSXSCRVSLSYADASSADGILAADTFLL-----GGAPPVXAL 176
Query: 94 FGC--SNDNHGFDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYT 151
FGC S + + A G+LG++R ++SF++Q ++ RF+YC+ G+
Sbjct: 177 FGCVTSYSSATATNSSDSEAATGLLGMNRGSLSFVTQTATL---RFAYCIA----PGD-G 228
Query: 152 SSYLKFGTDMGYRRPSTQATKFI--NHPNNF-----YYLSLKDISIDNERMNFPPDTFDI 204
L G D P T I + P + Y + L+ I + + P
Sbjct: 229 PGLLVLGGDGAALAPQLNYTPLIQISRPLPYFDRVAYSVQLEGIRVGAALLPIPKSVLAP 288
Query: 205 TVSGEGGCIIDSGSVLTYFHSDVYWKLHEKFV 236
+G G ++DSG+ T+ +D Y L +F+
Sbjct: 289 DHTGAGQTMVDSGTQFTFLLADAYAPLKGEFL 320
>gi|308081797|ref|NP_001182920.1| uncharacterized protein LOC100501208 [Zea mays]
gi|238008190|gb|ACR35130.1| unknown [Zea mays]
gi|413922182|gb|AFW62114.1| hypothetical protein ZEAMMB73_927324 [Zea mays]
Length = 269
Score = 75.5 bits (184), Expect = 4e-11, Method: Compositional matrix adjust.
Identities = 66/256 (25%), Positives = 119/256 (46%), Gaps = 28/256 (10%)
Query: 94 FGCSNDNHGFDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSS 153
FGC +G A +G++G+S +S + QL SI K FSYCL P ++ +S
Sbjct: 26 FGCGKLTNGTIAGA-----SGIMGVSPGPLSVLKQL-SITK--FSYCLT---PFTDHKTS 74
Query: 154 YLKFGT--DMGYRRPS--TQATKFINHP--NNFYYLSLKDISIDNERMNFPPDTFDITVS 207
+ FG D+G + + Q + +P + +YY+ + ISI ++R++ P +
Sbjct: 75 PVMFGAMADLGKYKTTGKVQTIPLLKNPVEDIYYYVPMVGISIGSKRLDVPEAILALRPD 134
Query: 208 GEGGCIIDSGSVLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCYFLPETFN-- 265
G GG ++DS + L Y + +L + + E +L + + +C+ LP +
Sbjct: 135 GTGGTVLDSATTLAYLVEPAFKELKK---AVMEGMKLPAANRSIDDYPVCFELPRGMSME 191
Query: 266 --RFPSMAFYFE-DANLRIDGENVFIIDYENHFFLLAV--APHDDLVALIGSQQQRDTRF 320
+ P + +F DA + + ++ F + LAV AP + +IG+ QQ++
Sbjct: 192 GVQVPPLVLHFAGDAEMSLPRDSYF-QEPSPGMMCLAVMQAPFEGAPNVIGNVQQQNMHV 250
Query: 321 VYDLNIDLLSFVKENC 336
+YDL S+ C
Sbjct: 251 LYDLGNRKFSYAPTKC 266
>gi|357159298|ref|XP_003578403.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Brachypodium
distachyon]
Length = 442
Score = 75.1 bits (183), Expect = 4e-11, Method: Compositional matrix adjust.
Identities = 97/378 (25%), Positives = 158/378 (41%), Gaps = 73/378 (19%)
Query: 5 FIGTPSKGVLLILDTGSALIYAIFDPRKSSSFQKINCDHPDCTYFKCVNEQCVYTMKYAD 64
IG+P + ++DTGS LI+ + +++ +C Y+ + AD
Sbjct: 91 LIGSPPQRTEALIDTGSDLIWT----QCATTCLPKSCAKQGLPYYNLSQSSTFVPVPCAD 146
Query: 65 QSVTKGFAAHETISVIGK----------GEGKAIFHGAL---------------FGC--- 96
++ GF A + + G G G+ I G+L FGC
Sbjct: 147 KA---GFCAANGVHLCGLDGSCTFIASYGAGRVI--GSLGTESFAFESGTTSLAFGCVSL 201
Query: 97 SNDNHGFDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLK 156
+ G DA +G++GL R +S +SQ+G+ RFSYCL P + SS+L
Sbjct: 202 TRITSGALNDA-----SGLIGLGRGRLSLVSQIGAT---RFSYCLT-PYFHSSGASSHL- 251
Query: 157 FGTDMGYRRPSTQATKFINHPNN-----FYYLSLKDISIDNERM-NFPPDTFDITV---- 206
F + F+ P + FYYL L+ I++ R+ TF +
Sbjct: 252 FVGASASLGGGGASMPFVKSPKDYPYSTFYYLPLEGITVGKTRLPAVNSTTFQLRQLFKG 311
Query: 207 SGEGGCIIDSGSVLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEP----IQLCYFLPE 262
GG IID+GS LT S Y L E+ + QL S P P ++LC E
Sbjct: 312 YWAGGVIIDTGSPLTQLASHAYEALKEEVAA-----QLGNGSLVPAPEDSGLELC-VARE 365
Query: 263 TFNRF-PSMAFYFED-ANLRIDGENVFI-IDYENHFFLLAVAPHDDLVALIGSQQQRDTR 319
F + P++ F+F A++ + + + +D ++ +D ++IG+ QQ+D
Sbjct: 366 GFQKVVPALVFHFGGGADMAVPAASYWAPVDKAAACMMILEGGYD---SIIGNFQQQDMH 422
Query: 320 FVYDLNIDLLSFVKENCS 337
+YDL SF +C+
Sbjct: 423 LLYDLRRGRFSFQTADCT 440
>gi|224058947|ref|XP_002299658.1| predicted protein [Populus trichocarpa]
gi|222846916|gb|EEE84463.1| predicted protein [Populus trichocarpa]
Length = 451
Score = 75.1 bits (183), Expect = 4e-11, Method: Compositional matrix adjust.
Identities = 90/373 (24%), Positives = 144/373 (38%), Gaps = 55/373 (14%)
Query: 3 RLFIGTPSKGVLLILDTGSALIYA-------------------IFDPRKSSSFQKINCDH 43
RL +G+P + + +DTGS +++ FDP S + I+C
Sbjct: 93 RLQLGSPPRDFYVQIDTGSDVLWVSCSSCNGCPVSSGLHIPLNFFDPGSSPTASLISCSD 152
Query: 44 PDCTY--------FKCVNEQCVYTMKYADQSVTKGFAAHETI---SVIGKGEGKAIFHGA 92
C+ N QC YT +Y D S T G+ + + +++G K
Sbjct: 153 QRCSLGLQSSDSVCAAQNNQCGYTFQYGDGSGTSGYYVSDLLHFDTILGGSVMKNSSAPI 212
Query: 93 LFGCSNDNHGFDEDARDGALAGVLGLSRVTISFISQLGS--IIKKRFSYCLVIPLPNGEY 150
+FGCS G D D A+ G+ G + +S ISQL S I + FS+C L +
Sbjct: 213 VFGCSTLQTG-DLTKPDRAVDGIFGFGQQDMSVISQLASQGITPRVFSHC----LKGDDS 267
Query: 151 TSSYLKFGTDMGYRRPSTQATKFI-NHPNNFYYLSLKDISIDNERMNFPPDTFDITVSGE 209
L G + P+ T + + P+ Y L+L+ I ++ + + P F S
Sbjct: 268 GGGILVLGEIV---EPNIVYTPLVPSQPH--YNLNLQSIYVNGQTLAIDPSVF--ATSSN 320
Query: 210 GGCIIDSGSVLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCYFLPETFNR-FP 268
G IIDSG+ L Y Y + F+S +S CY + N FP
Sbjct: 321 QGTIIDSGTTLAYLTEAAY----DPFISAITSTVSPSVSPYLSKGNQCYLTSSSINDVFP 376
Query: 269 SMAFYFEDANLRIDGENVFIIDYE--NHFFLLAVA---PHDDLVALIGSQQQRDTRFVYD 323
++ F I ++I N L V + ++G +D FVYD
Sbjct: 377 QVSLNFAGGTSMILIPQDYLIQQSSINGAALWCVGFQKIQGQEITILGDLVLKDKIFVYD 436
Query: 324 LNIDLLSFVKENC 336
+ + + +C
Sbjct: 437 IAGQRIGWANYDC 449
>gi|147801191|emb|CAN68822.1| hypothetical protein VITISV_007106 [Vitis vinifera]
Length = 443
Score = 75.1 bits (183), Expect = 4e-11, Method: Compositional matrix adjust.
Identities = 64/212 (30%), Positives = 99/212 (46%), Gaps = 39/212 (18%)
Query: 6 IGTPSKGVLLILDTGSALIY--------------AIFDPRKSSSFQKINCDHPDCTYFKC 51
+G PS V I DTGS LI+ IFDP +S +++ ++ D P C +
Sbjct: 63 LGVPSTLVYGIADTGSELIWLQCLPCTHCYNQTPPIFDPAESYTYETVSSDSPICNAVRR 122
Query: 52 V-----NEQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSNDNHGFDED 106
+ ++ C Y Y D + TKG + + + FGCS+D
Sbjct: 123 ISCREGDKSCCYQHTYGDGTTTKGTLSTDVFAFEDPTRTIVEVGYLTFGCSHDT-----K 177
Query: 107 AR-DGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLKFGTD---MG 162
AR G AGV+GL+R S +SQL K+FSYC+VIP +G + S + FG+ +G
Sbjct: 178 ARLKGHQAGVVGLNRHPNSLVSQLKV---KKFSYCMVIPDDHG--SGSRMYFGSRAVILG 232
Query: 163 YRRPSTQATKFINHPNNFYYLSLKDISIDNER 194
+ P + + Y+++LK IS+ E+
Sbjct: 233 GKTP------LLKGDYSHYFVTLKGISVGEEK 258
>gi|357465299|ref|XP_003602931.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
gi|355491979|gb|AES73182.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
Length = 438
Score = 75.1 bits (183), Expect = 4e-11, Method: Compositional matrix adjust.
Identities = 87/360 (24%), Positives = 154/360 (42%), Gaps = 45/360 (12%)
Query: 1 MVRLFIGTPSKGVLLILDTGS--ALIYA---------IFDPRKSSSFQKINCDHPDCTYF 49
+VR+ IGTP + + ++LDT + A I + F P S+S+ + C P C+
Sbjct: 99 IVRVKIGTPGQLLFMVLDTSTDEAFIPSSGCIGCSATTFSPNASTSYVPLECSVPQCSQV 158
Query: 50 KCVN------EQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSNDNHGF 103
+ ++ C + YA + + +++ + + FG N G
Sbjct: 159 RGLSCPATGSGACSFNKSYAGSTYSATLV-QDSLRL-----ATDVIPSYSFGSINAISGS 212
Query: 104 DEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLKFGTDMGY 163
A+ R +S +SQ GS+ FSYCL P Y S LK G +G
Sbjct: 213 SIPAQGLLGL-----GRGPLSLLSQTGSLYSGVFSYCL--PSFKSYYFSGSLKLG-PVGQ 264
Query: 164 RRPSTQATKFINHPN--NFYYLSLKDISIDNERMNFPPDTFDITVSGEGGCIIDSGSVLT 221
+ S + T + +P + Y+++L I++ + FP + V+ G IIDSG+V+T
Sbjct: 265 PK-SIRTTPLLRNPRRPSLYFVNLTGITVGKVNVPFPKELLAFDVNTGSGTIIDSGTVIT 323
Query: 222 YFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCYFLPETFNRFPSMAFYFEDANLRI 281
F VY + ++F R Q+ C F+ P++ +F D +L++
Sbjct: 324 RFVEPVYNAVRDEF-----RKQVTGPFSSLGAFDTC-FVKNYETLAPAITLHFTDLDLKL 377
Query: 282 DGENVFIIDYENHFFLLAVA--PHD---DLVALIGSQQQRDTRFVYDLNIDLLSFVKENC 336
EN I LA+A P + ++ +I + QQ++ R ++D + + +E C
Sbjct: 378 PLENSLIHSSSGSLACLAMASTPKNVNYTVLNVIANYQQQNLRVLFDTVNNKVGIARELC 437
>gi|223973231|gb|ACN30803.1| unknown [Zea mays]
Length = 459
Score = 75.1 bits (183), Expect = 5e-11, Method: Compositional matrix adjust.
Identities = 91/402 (22%), Positives = 162/402 (40%), Gaps = 85/402 (21%)
Query: 6 IGTPSKGVLLILDTGSALIY--------------------AIFDPRKSSSFQKINCDHPD 45
+GTP + + ++LDTGS L + +F P+ SSS + + C +P
Sbjct: 73 LGTPPQPLPVLLDTGSHLTWVPCTSSYECRNCSSPSASAVPVFHPKNSSSSRLVGCRNPS 132
Query: 46 CTYF--------KCVNEQC----------------VYTMKYADQSVTKGFAAHETISVIG 81
C + KC C Y + Y S T G +T+
Sbjct: 133 CQWVHSAANLATKCRRAPCSPGAANCPAAASNVCPPYAVVYGSGS-TAGLLIADTL---- 187
Query: 82 KGEGKAIFHGALFGCSNDNHGFDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCL 141
+ G+A+ G + GCS + +G+ G R S +QLG +FSYCL
Sbjct: 188 RAPGRAV-PGFVLGCS-------LVSVHQPPSGLAGFGRGAPSVPAQLG---LPKFSYCL 236
Query: 142 VI------PLPNGEYTSSYLKFGTDMGYRRPSTQATKFINHPNN-FYYLSLKDISIDNER 194
+ +G G M Y P ++ P +YYL+L+ +++ +
Sbjct: 237 LSRRFDDNAAVSGSLVLGGTGGGEGMQYV-PLVKSAAGDKLPYGVYYYLALRGVTVGGKA 295
Query: 195 MNFPPDTFDITVSGEGGCIIDSGSVLTYFHSDVYWKLHEKFVSYF-ERFQLAQLSDCPEP 253
+ P F +G GG I+DSG+ TY V+ + + V+ R++ ++ ++
Sbjct: 296 VRLPARAFAANAAGSGGTIVDSGTTFTYLDPTVFQPVADAVVAAVGGRYKRSKDAEDELG 355
Query: 254 IQLCYFLPETFNR--FPSMAFYFE-DANLRIDGENVFIIDYENHFFLLAVAPHDDLVA-- 308
+ C+ LP+ P ++F+FE A +++ EN F++ + +A D
Sbjct: 356 LHPCFALPQGARSMALPELSFHFEGGAVMQLPVENYFVVAGRGAVEAICLAVVTDFSGGS 415
Query: 309 -----------LIGSQQQRDTRFVYDLNIDLLSFVKENCSDD 339
++GS QQ++ YDL + L F +++C+
Sbjct: 416 GAGNEGSGPAIILGSFQQQNYLVEYDLEKERLGFRRQSCTSS 457
>gi|242089623|ref|XP_002440644.1| hypothetical protein SORBIDRAFT_09g004500 [Sorghum bicolor]
gi|241945929|gb|EES19074.1| hypothetical protein SORBIDRAFT_09g004500 [Sorghum bicolor]
Length = 469
Score = 75.1 bits (183), Expect = 5e-11, Method: Compositional matrix adjust.
Identities = 90/361 (24%), Positives = 145/361 (40%), Gaps = 63/361 (17%)
Query: 6 IGTPSKGVLLILDTGSALIYAIFD--------------PRKSSSFQKINCDHPDCTYFK- 50
IGTP + + + DTGS LI+ D P SS+F ++ C C +
Sbjct: 106 IGTPPQKLTALADTGSDLIWTKCDAGGGAAWGGSSSYHPNASSTFTRLPCSDRLCAALRS 165
Query: 51 -----CV--NEQCVYTMKYA---DQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSNDN 100
C +C Y Y D T+GF ET ++ G G FGC+
Sbjct: 166 YSLARCAAGGAECDYKYAYGLGDDPDFTQGFLGSETFTLGGDA-----VPGVGFGCTTAL 220
Query: 101 HGFDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLKFGT- 159
G D +GA G++GL R +S +SQL + F YCL +S L FG
Sbjct: 221 EG---DYGEGA--GLVGLGRGPLSLVSQLDA---GTFMYCLTADASK----ASPLLFGAL 268
Query: 160 -DMGYRRPSTQATKFINHPNNFYYLSLKDISIDNERMNFPPDTFDITVSGEGGCIIDSGS 218
M Q+T + FY ++L+ I+I + + DSG+
Sbjct: 269 ATMTGAGAGVQSTGLLAS-TTFYAVNLRSITIGSATTAGVGGPGGVV--------FDSGT 319
Query: 219 VLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPE--PIQLCYFLPETFNRFPSMAFYFE- 275
LTY Y + F+S Q L+ + CY P++ P+M +F+
Sbjct: 320 TLTYLAEPAYTEAKAAFLS-----QTTSLTPVEGRYGFEACYEKPDSARLIPAMVLHFDG 374
Query: 276 DANLRIDGENVFIIDYENHFFLLAVAPHDDLVALIGSQQQRDTRFVYDLNIDLLSFVKEN 335
A++ + N ++++ ++ V L ++IG+ Q + ++D+ +LSF N
Sbjct: 375 GADMALPVAN-YVVEVDDGVVCWVVQRSPSL-SIIGNIMQMNYLVLHDVRKSVLSFQPAN 432
Query: 336 C 336
C
Sbjct: 433 C 433
>gi|224140237|ref|XP_002323490.1| predicted protein [Populus trichocarpa]
gi|222868120|gb|EEF05251.1| predicted protein [Populus trichocarpa]
Length = 478
Score = 75.1 bits (183), Expect = 5e-11, Method: Compositional matrix adjust.
Identities = 90/382 (23%), Positives = 158/382 (41%), Gaps = 72/382 (18%)
Query: 3 RLFIGTPSKGVLLILDTGSALIYA-------------------IFDPRKSSSFQKINCDH 43
++ +G+P + + +DTGS +++ FD SS+ ++C
Sbjct: 69 KVKLGSPPREFNVQIDTGSDVLWVCCNSCNNCPRTSGLGIQLNFFDSSSSSTAGLVHCSD 128
Query: 44 PDCT------YFKCVNE--QCVYTMKYADQSVTKGFAAHETI---SVIGKGEGKAIFHGA 92
P CT +C + QC YT +Y D S T G+ +T+ +++ GE + A
Sbjct: 129 PICTSAVQTTVTQCSPQTNQCSYTFQYEDGSGTSGYYVSDTLYFDAIL--GESLVVNSSA 186
Query: 93 L--FGCSNDNHGFDEDARDGALAGVLGLSRVTISFISQLGS--IIKKRFSYCLVIPLPNG 148
L FGCS G D D A+ G+ G + +S ISQL + I + FS+CL G
Sbjct: 187 LIVFGCSTFQSG-DLTMTDKAVDGIFGFGQGELSVISQLSTHGITPRVFSHCL-----KG 240
Query: 149 EYTSSYLKFGTDMGYRRPSTQATKFI-NHPNNFYYLSLKDISIDNERMNFPPDTFDITVS 207
E + ++ P + + + P+ Y L+L+ I+++ + + P F S
Sbjct: 241 EGIGGGILVLGEI--LEPGMVYSPLVPSQPH--YNLNLQSIAVNGKLLPIDPSVF--ATS 294
Query: 208 GEGGCIIDSGSVLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCYFLPETFNR- 266
G I+DSG+ L Y ++ Y + FVS ++ CY + + ++
Sbjct: 295 NSQGTIVDSGTTLAYLVAEAY----DPFVSAVNVIVSPSVTPIISKGNQCYLVSTSVSQM 350
Query: 267 FPSMAFYFEDANLRIDGENVFIIDYENHFFLLAVAPHDDLVALIGSQQ-----------Q 315
FP +F F G ++ E++ + ++ IG Q+
Sbjct: 351 FPLASFNFA-------GGASMVLKPEDYLIPFGPSQGGSVMWCIGFQKVQGVTILGDLVL 403
Query: 316 RDTRFVYDLNIDLLSFVKENCS 337
+D FVYDL + + +CS
Sbjct: 404 KDKIFVYDLVRQRIGWANYDCS 425
>gi|225436397|ref|XP_002272121.1| PREDICTED: aspartic proteinase-like protein 2 isoform 2 [Vitis
vinifera]
Length = 499
Score = 74.7 bits (182), Expect = 5e-11, Method: Compositional matrix adjust.
Identities = 89/375 (23%), Positives = 152/375 (40%), Gaps = 58/375 (15%)
Query: 3 RLFIGTPSKGVLLILDTGSALIYA-------------------IFDPRKSSSFQKINCDH 43
R+ +G+P K + +DTGS +++ FDP SS+ I+C
Sbjct: 86 RVLLGSPPKEFYVQIDTGSDVLWVSCGSCNGCPQSSGLHIPLNFFDPGSSSTASLISCSD 145
Query: 44 PDCTY------FKCVNE--QCVYTMKYADQSVTKGFAAHETI---SVIGKGEGKAIFHGA 92
C+ C ++ QC+YT +Y D S T G+ + + +++G +
Sbjct: 146 QRCSLGVQSSDAGCSSQGNQCIYTFQYGDGSGTSGYYVSDLLNFDAIVGSSVTNS-SASI 204
Query: 93 LFGCSNDNHGFDEDARDGALAGVLGLSRVTISFISQLGS--IIKKRFSYCLVIPLPNGEY 150
+FGCS G D D A+ G+ G + +S ISQ+ S I K FS+CL G
Sbjct: 205 VFGCSISQTG-DLTKSDRAVDGIFGFGQQDMSVISQMSSQGITPKVFSHCLKGDGGGGGI 263
Query: 151 TSSYLKFGTDMGYRRPSTQATKFINHPNNFYYLSLKDISIDNERMNFPPDTFDITVSGEG 210
D+ Y + + P+ Y L+L+ IS++ + + P+ F S
Sbjct: 264 LVLGEIVEEDIVY------SPLVPSQPH--YNLNLQSISVNGKSLAIDPEVF--ATSTNR 313
Query: 211 GCIIDSGSVLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCYFLPETFNR-FPS 269
G I+DSG+ L Y + Y + FVS + CY + + FP+
Sbjct: 314 GTIVDSGTTLAYLAEEAY----DPFVSAITEAVSQSVRPLLSKGTQCYLITSSVKGIFPT 369
Query: 270 MAFYFEDA-NLRIDGENVFIIDYENHFFLLAV------APHDDLVALIGSQQQRDTRFVY 322
++ F ++ + E+ + +N AV + ++G +D FVY
Sbjct: 370 VSLNFAGGVSMNLKPEDYLL--QQNSIGDAAVWCIGFQKIQGQGITILGDLVLKDKIFVY 427
Query: 323 DLNIDLLSFVKENCS 337
DL + + +CS
Sbjct: 428 DLAGQRIGWANYDCS 442
>gi|357127503|ref|XP_003565419.1| PREDICTED: probable aspartic protease At2g35615-like [Brachypodium
distachyon]
Length = 486
Score = 74.7 bits (182), Expect = 5e-11, Method: Compositional matrix adjust.
Identities = 96/390 (24%), Positives = 149/390 (38%), Gaps = 77/390 (19%)
Query: 1 MVRLFIGTPSKGVLLILDTGSALIY-----------------AIFDPRKSSSFQKINCDH 43
++ + +GTP VL I DTGS L++ F P SS++ ++ CD
Sbjct: 111 LMAIEVGTPPVRVLAIADTGSDLVWVKCKGKDNDNNSTAPPSVYFVPSASSTYGRVGCDT 170
Query: 44 PDCTYFKCV-----NEQCVYTMKYADQSVTKGFAAHE--TISVIGKGEGKAIFHGAL--- 93
C + C Y Y D S G + E T S I
Sbjct: 171 KACRALSSAASCSPDGSCEYLYSYGDGSRASGQLSTETFTFSTIADSSKTNSHGNNNNNS 230
Query: 94 ------------FGCSNDNHG-FDEDARDGALAGVLGLSRVTISFISQLGSI--IKKRFS 138
FGCS G F D ++GL +S SQLG+ + ++FS
Sbjct: 231 SSHGQVEIAKLDFGCSTTTTGTFRADG-------LVGLGGGPVSLASQLGATTSLGRKFS 283
Query: 139 YCLVIPLPNGEYTSSYLKFGTDMGYRRPSTQATKFIN-HPNNFYYLSLKDISIDNERMNF 197
YCL P N SS L FG+ P +T I +Y ++L I++ +
Sbjct: 284 YCLA-PYANTN-ASSALNFGSRAVVSEPGAASTPLITGEVETYYTIALDSINVAGTKRP- 340
Query: 198 PPDTFDITVSGEGGCIIDSGSVLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPI-QL 256
T + + I+DSG+ LTY S + L + R +L + ++ PE I L
Sbjct: 341 -------TTAAQAHIIVDSGTTLTYLDSALLTPLVKDLT---RRIKLPR-AESPEKILDL 389
Query: 257 CYFLPETFNRFPSMAFYFEDANLRIDG--------ENVFIIDYENHFFLLAVAPHD-DLV 307
CY + A D L + G +N F++ E L VA + V
Sbjct: 390 CYDISGVRGE---DALGIPDVTLVLGGGGEVTLKPDNTFVVVQEGVLCLALVATSERQSV 446
Query: 308 ALIGSQQQRDTRFVYDLNIDLLSFVKENCS 337
+++G+ Q++ YDL ++F +C+
Sbjct: 447 SILGNIAQQNLHVGYDLEKGTVTFAAADCA 476
>gi|357155293|ref|XP_003577072.1| PREDICTED: LOW QUALITY PROTEIN: probable aspartic protease
At2g35615-like [Brachypodium distachyon]
Length = 429
Score = 74.7 bits (182), Expect = 5e-11, Method: Compositional matrix adjust.
Identities = 90/373 (24%), Positives = 142/373 (38%), Gaps = 66/373 (17%)
Query: 6 IGTPSKGVLLILDTGSALIY------------------AIFDPRKSSSFQKINCDHPDC- 46
+GTP L+ +DTGS L + ++FDP KS++++ + C DC
Sbjct: 81 LGTPPVANLVTVDTGSTLSWVVCQRCQISCHTTAPEAGSVFDPDKSTTYELVGCSSRDCA 140
Query: 47 -------TYFKCVNE--QCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCS 97
F C+ E C+Y+++Y + A + +I G +FGCS
Sbjct: 141 DVQRSLVAPFGCIEETDTCLYSLRYGSGPSGQYSAGRLGTDKLTLASSSSIIDGFIFGCS 200
Query: 98 NDNHGFDEDARDGALAGVLGLSRVTISFISQLGSIIKKR-FSYCLVIPLPNGEYTSSYLK 156
D D+ G +GV+G SF +Q+ R FSYC P +L
Sbjct: 201 GD------DSFKGYESGVIGFGGANFSFFNQVARQTNYRAFSYC----FPGDHTAEGFLS 250
Query: 157 FGTDMGYRRPSTQATKFINH--PNNFYYLSLKDISIDNERMNFPPDTFDITVSGEGGCII 214
G Y + T I H + Y L D+ +D R+ + + ++
Sbjct: 251 IGA---YPKDELVYTNLIPHFGDRSVYSLQQIDMMVDGNRLQVDQSEYTKRM-----MVV 302
Query: 215 DSGSVLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCYFLPETFNR-----FPS 269
DSG+V T+ V+ + S + LSD + C F P + P+
Sbjct: 303 DSGTVDTFLLGPVFDAFSKAMASAMQAKGF--LSDTVG-TETC-FRPNGGDSVDSGDLPT 358
Query: 270 MAFYFEDANLRIDGENVFIIDYENH-FFLLAVAPHDDL-----VALIGSQQQRDTRFVYD 323
+ F L++ ENVF +H LA P D+ V ++G++ R VYD
Sbjct: 359 VEMRFIGTTLKLPPENVFHDLLPSHDKICLAFKP--DVAGVRNVQILGNKATXSFRVVYD 416
Query: 324 LNIDLLSFVKENC 336
L F C
Sbjct: 417 LQAMYFGFQAGAC 429
>gi|21805926|gb|AAM76716.1| nucellin-like aspartic protease [Zea mays]
Length = 357
Score = 74.7 bits (182), Expect = 5e-11, Method: Compositional matrix adjust.
Identities = 94/370 (25%), Positives = 144/370 (38%), Gaps = 62/370 (16%)
Query: 6 IGTPSKGVLLILDTGSALIYAIFD-PRKS-----------SSFQKINCDHPDCTYF---- 49
IG P+K L +DTGS L + D P +S ++ + + C + CT
Sbjct: 1 IGNPAKPYFLDVDTGSDLTWLQCDAPCRSCNKVPHPLYRPTANRLVPCANALCTALHSGQ 60
Query: 50 ----KCVN-EQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSNDNHGFD 104
KC + +QC Y +KY D + ++G +++ S+ + I G FGC D
Sbjct: 61 GSNNKCPSPKQCDYQIKYTDSASSQGVLINDSFSLPMRSSN--IRPGLTFGCGYDQQVGK 118
Query: 105 EDARDGALAGVLGLSRVTISFISQLG--SIIKKRFSYCLVIPLPNGEYTSSYLKFGTDMG 162
A A+ G+LGL R ++S +SQL I K +CL NG +L FG D+
Sbjct: 119 NGAVQAAIDGMLGLGRGSVSLVSQLKQQGITKNVVGHCLST---NG---GGFLFFGDDVV 172
Query: 163 YRRPSTQATKFINHPNNFYYLSLKDISIDNERMNFPPDTFDITVSGEGGCIIDSGSVLTY 222
T N+Y + D + P + DSGS TY
Sbjct: 173 PSSRVTWVPMAQRTSGNYYSPGSGTLYFDRRSLGVKPMEV----------VFDSGSTYTY 222
Query: 223 FHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCYFLPETF-------NRFPSMAFYF- 274
F + Y + + L Q+SD P LC+ + F N F SM F
Sbjct: 223 FTAQPYQAVVSALKGGLSK-SLKQVSDPTLP--LCWKGQKAFKSVFDVKNEFKSMFLSFA 279
Query: 275 --EDANLRIDGENVFIIDYENHFFLLAVAPHDDLVA-----LIGSQQQRDTRFVYDLNID 327
++A + I EN I+ + L + D A +IG +D +YD
Sbjct: 280 SAKNAAMEIPPENYLIVTKNGNVCLGIL---DGTAAKLSFNVIGDITMQDQMVIYDNEKS 336
Query: 328 LLSFVKENCS 337
L + + C+
Sbjct: 337 QLGWARGACT 346
>gi|357481195|ref|XP_003610883.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
gi|355512218|gb|AES93841.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
Length = 315
Score = 74.7 bits (182), Expect = 6e-11, Method: Compositional matrix adjust.
Identities = 79/309 (25%), Positives = 134/309 (43%), Gaps = 26/309 (8%)
Query: 40 NCDHPDC----TYFKCVNEQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFG 95
+CD P C T ++C YT Y D S+TKG A +T + LFG
Sbjct: 20 SCDSPLCHKLDTGVCSPEKRCNYTYGYGDNSLTKGVLAQDTATFTSNTGKLVSLSRFLFG 79
Query: 96 CSNDNHGFDEDARDGALAGVLGLSRVTISFISQLGSII-KKRFSYCLVIPLPNGEYTSSY 154
C ++N G D G++GL S ISQ+G + K+FS CLV P SS
Sbjct: 80 CGHNNTGGFNDHE----MGLIGLGGGPTSLISQIGPLFGGKKFSQCLV-PFLTDIKISSR 134
Query: 155 LKFGTDMGYRRPSTQATKFINHPNNF--YYLSLKDISIDNERMNFPPDTFDITVSGEGGC 212
+ FG T + + Y+++L IS+++ + + T+ +G
Sbjct: 135 MSFGKGSQVLGDGVVTTPLVQREQDMTSYFVTLLGISVEDTYL-----PMNSTIE-KGNM 188
Query: 213 IIDSGSVLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPE-PIQLCYFLPETFNRFPSMA 271
++DSG+ +Y ++ +V L +++ P QLCY +T + P++
Sbjct: 189 LVDSGTPPNILPQQLYDRV---YVEVKNNVPLELITNDPSLGPQLCYRT-QTNLKGPTLT 244
Query: 272 FYFEDANLRIDGENVFI--IDYENHFFLLAVAPHDDLVALI-GSQQQRDTRFVYDLNIDL 328
++FE ANL + FI F LA+ + + + G+ Q + +DL+ +
Sbjct: 245 YHFEGANLLLTPIQTFIPPTPETKGVFCLAINNYTNSNGGVYGNFAQSNYLIGFDLDRQV 304
Query: 329 LSFVKENCS 337
+SF +C+
Sbjct: 305 VSFKATDCT 313
>gi|359488213|ref|XP_002263620.2| PREDICTED: aspartic proteinase nepenthesin-2-like [Vitis vinifera]
Length = 434
Score = 74.7 bits (182), Expect = 6e-11, Method: Compositional matrix adjust.
Identities = 100/375 (26%), Positives = 154/375 (41%), Gaps = 60/375 (16%)
Query: 1 MVRLFIGTPSKGVLLILDTGSALIY-----------AIFDPRKSSSFQKINCDH------ 43
+V L IGTP + ++LDTGS L + FDP SSSF + C+H
Sbjct: 79 IVSLPIGTPPQTQQMVLDTGSQLSWIQCKVPPKTPPTAFDPLLSSSFSVLPCNHSLCKPR 138
Query: 44 -PDCTYFKCV--NEQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSNDN 100
PD T N C Y+ YAD + +G E + + + GC+ D+
Sbjct: 139 VPDYTLPTSCDQNRLCHYSYFYADGTYAEGNLVREKFTFSSSQTTPPL----ILGCATDS 194
Query: 101 HGFDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPL-------PNGEY--- 150
D + G+LG++ +SF S L I K FSYC V P P G +
Sbjct: 195 ----SDTQ-----GILGMNLGRLSF-SSLAKISK--FSYC-VPPRRSQSGSSPTGSFYLG 241
Query: 151 ---TSSYLKFGTDMGYRRPSTQATKFINHPNNFYYLSLKDISIDNERMNFPPDTFDITVS 207
+S+ K+ M YR Q+ + N Y L + I I+ +++N F S
Sbjct: 242 PNPSSAGFKYVNLMTYR----QSQRMPNLDPLAYTLPMLGIRINGKKLNISTSAFRADPS 297
Query: 208 GEGGCIIDSGSVLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCYFLPETF--N 265
G G +IDSG+ T+ + Y K+ E+ V +L + + +C+
Sbjct: 298 GAGQTLIDSGTWFTFLVDEAYSKVKEEIVK-LAGPKLKKGYVYGGSLDMCFDGDAMVIGR 356
Query: 266 RFPSMAFYFEDANLRIDGENVFIIDYENHFFLLAVAPHDDL-VA--LIGSQQQRDTRFVY 322
+MAF FE+ + + D L + D L VA +IG+ Q+D +
Sbjct: 357 MIGNMAFEFENGVEIVVEREKMLADVGGGVQCLGIGRSDLLGVASNIIGNFHQQDLWVEF 416
Query: 323 DLNIDLLSFVKENCS 337
DL + F + +CS
Sbjct: 417 DLVGRRVGFGRTDCS 431
>gi|297734873|emb|CBI17107.3| unnamed protein product [Vitis vinifera]
Length = 484
Score = 74.7 bits (182), Expect = 6e-11, Method: Compositional matrix adjust.
Identities = 89/375 (23%), Positives = 152/375 (40%), Gaps = 58/375 (15%)
Query: 3 RLFIGTPSKGVLLILDTGSALIYA-------------------IFDPRKSSSFQKINCDH 43
R+ +G+P K + +DTGS +++ FDP SS+ I+C
Sbjct: 71 RVLLGSPPKEFYVQIDTGSDVLWVSCGSCNGCPQSSGLHIPLNFFDPGSSSTASLISCSD 130
Query: 44 PDCTY------FKCVNE--QCVYTMKYADQSVTKGFAAHETI---SVIGKGEGKAIFHGA 92
C+ C ++ QC+YT +Y D S T G+ + + +++G +
Sbjct: 131 QRCSLGVQSSDAGCSSQGNQCIYTFQYGDGSGTSGYYVSDLLNFDAIVGSSVTNS-SASI 189
Query: 93 LFGCSNDNHGFDEDARDGALAGVLGLSRVTISFISQLGS--IIKKRFSYCLVIPLPNGEY 150
+FGCS G D D A+ G+ G + +S ISQ+ S I K FS+CL G
Sbjct: 190 VFGCSISQTG-DLTKSDRAVDGIFGFGQQDMSVISQMSSQGITPKVFSHCLKGDGGGGGI 248
Query: 151 TSSYLKFGTDMGYRRPSTQATKFINHPNNFYYLSLKDISIDNERMNFPPDTFDITVSGEG 210
D+ Y + + P+ Y L+L+ IS++ + + P+ F S
Sbjct: 249 LVLGEIVEEDIVY------SPLVPSQPH--YNLNLQSISVNGKSLAIDPEVF--ATSTNR 298
Query: 211 GCIIDSGSVLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCYFLPETFNR-FPS 269
G I+DSG+ L Y + Y + FVS + CY + + FP+
Sbjct: 299 GTIVDSGTTLAYLAEEAY----DPFVSAITEAVSQSVRPLLSKGTQCYLITSSVKGIFPT 354
Query: 270 MAFYFEDA-NLRIDGENVFIIDYENHFFLLAV------APHDDLVALIGSQQQRDTRFVY 322
++ F ++ + E+ + +N AV + ++G +D FVY
Sbjct: 355 VSLNFAGGVSMNLKPEDYLL--QQNSIGDAAVWCIGFQKIQGQGITILGDLVLKDKIFVY 412
Query: 323 DLNIDLLSFVKENCS 337
DL + + +CS
Sbjct: 413 DLAGQRIGWANYDCS 427
>gi|255583547|ref|XP_002532530.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
gi|223527742|gb|EEF29846.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
Length = 440
Score = 74.3 bits (181), Expect = 7e-11, Method: Compositional matrix adjust.
Identities = 102/384 (26%), Positives = 154/384 (40%), Gaps = 74/384 (19%)
Query: 1 MVRLFIGTPSKGVLLILDTGSALIY---------------AIFDPRKSSSFQKINCDHPD 45
+V L IGTP + ++LDTGS L + FDP SSSF + C+HP
Sbjct: 81 IVSLPIGTPPQTQQMVLDTGSQLSWIQCHKKSVPKKPPPTTSFDPSLSSSFSVLPCNHPL 140
Query: 46 CTY--------FKC-VNEQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGC 96
C C N C Y+ YAD + +G E I+ + + GC
Sbjct: 141 CKPRIPDFTLPTTCDQNRLCHYSYFYADGTYAEGSLVREKITFSSSQSTPPL----ILGC 196
Query: 97 SNDNHGFDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLK 156
+ + DE G+LG++ SF SQ + I K FSYC+ P + +
Sbjct: 197 AEAST--DEK-------GILGMNLGRRSFASQ--AKISK-FSYCV----PTRQARAGLSS 240
Query: 157 FGTDMGYRRPSTQATKFIN---------HPNN---FYYLSLKDISIDNERMNFPPDTFDI 204
G+ P++ ++IN PN Y + ++ I + N R+N F
Sbjct: 241 TGSFYLGNNPNSGRFQYINLLTFTPSQRSPNLDPLAYTIPMQGIRMGNARLNISATLFRP 300
Query: 205 TVSGEGGCIIDSGSVLTYFHSDVYWKLHEKFVSYF-----ERFQLAQLSDCPEPIQLCY- 258
SG G IIDSGS TY + Y K+ E+ V + + +SD +C+
Sbjct: 301 DPSGAGQTIIDSGSEFTYLVDEAYNKVREEVVRLVGPKLKKGYVYGGVSD------MCFD 354
Query: 259 FLPETFNRF-PSMAFYFEDA-NLRIDGENVFIIDYENHFFLLAVAPHDDLVA---LIGSQ 313
P R +M F FE + ID V + D + + + L A +IG+
Sbjct: 355 GNPMEIGRLIGNMVFEFEKGVEIVIDKWRV-LADVGGGVHCIGIGRSEMLGAASNIIGNF 413
Query: 314 QQRDTRFVYDLNIDLLSFVKENCS 337
Q++ YDL + K +CS
Sbjct: 414 HQQNLWVEYDLANRRIGLGKADCS 437
>gi|30688682|ref|NP_197676.2| aspartyl protease family protein [Arabidopsis thaliana]
gi|110736370|dbj|BAF00154.1| protease-like protein [Arabidopsis thaliana]
gi|332005704|gb|AED93087.1| aspartyl protease family protein [Arabidopsis thaliana]
Length = 493
Score = 74.3 bits (181), Expect = 8e-11, Method: Compositional matrix adjust.
Identities = 89/376 (23%), Positives = 155/376 (41%), Gaps = 59/376 (15%)
Query: 3 RLFIGTPSKGVLLILDTGSALIYA-------------------IFDPRKSSSFQKINCDH 43
+L +GTP + + +DTGS +++ FDP S + I+C
Sbjct: 84 KLRLGTPPRDFYVQVDTGSDVLWVSCASCNGCPQTSGLQIQLNFFDPGSSVTASPISCSD 143
Query: 44 PDCTY--------FKCVNEQCVYTMKYADQSVTKGFAAHETIS---VIGKGEGKAIFHGA 92
C++ N C YT +Y D S T GF + + ++G
Sbjct: 144 QRCSWGIQSSDSGCSVQNNLCAYTFQYGDGSGTSGFYVSDVLQFDMIVGSSLVPNSTAPV 203
Query: 93 LFGCSNDNHGFDEDARDGALAGVLGLSRVTISFISQLGS--IIKKRFSYCLVIPLPNGEY 150
+FGCS G D D A+ G+ G + +S ISQL S I + FS+CL GE
Sbjct: 204 VFGCSTSQTG-DLVKSDRAVDGIFGFGQQGMSVISQLASQGIAPRVFSHCL-----KGEN 257
Query: 151 TSSYLKFGTDMGYRRPSTQATKFI-NHPNNFYYLSLKDISIDNERMNFPPDTFDITVSGE 209
+ ++ P+ T + + P+ Y ++L IS++ + + P F T +G+
Sbjct: 258 GGGGILVLGEI--VEPNMVFTPLVPSQPH--YNVNLLSISVNGQALPINPSVFS-TSNGQ 312
Query: 210 GGCIIDSGSVLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCYFLPETF-NRFP 268
G IID+G+ L Y Y E + + +S + CY + + + FP
Sbjct: 313 -GTIIDTGTTLAYLSEAAYVPFVEAITNAVSQSVRPVVSKGNQ----CYVITTSVGDIFP 367
Query: 269 SMAFYFE-DANLRIDGENVFIIDYENHFFLLAV------APHDDLVALIGSQQQRDTRFV 321
++ F A++ ++ ++ I +N+ AV + + ++G +D FV
Sbjct: 368 PVSLNFAGGASMFLNPQDYLI--QQNNVGGTAVWCIGFQRIQNQGITILGDLVLKDKIFV 425
Query: 322 YDLNIDLLSFVKENCS 337
YDL + + +CS
Sbjct: 426 YDLVGQRIGWANYDCS 441
>gi|125579874|gb|EAZ21020.1| hypothetical protein OsJ_36669 [Oryza sativa Japonica Group]
Length = 382
Score = 74.3 bits (181), Expect = 8e-11, Method: Compositional matrix adjust.
Identities = 66/239 (27%), Positives = 103/239 (43%), Gaps = 12/239 (5%)
Query: 107 ARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLKFGTDMGYRRP 166
AR A +G++GL R +S +SQ G+ +FSYCL N T +
Sbjct: 146 ARSMAPSGLMGLGRGRLSLVSQTGA---TKFSYCLTPYFHNNGATGHLFVGASASLGGHG 202
Query: 167 STQATKFINHPNN--FYYLSLKDISIDNERMNFPPDTFDITVSG----EGGCIIDSGSVL 220
T+F+ P FYYL L +++ R+ P FD+ GG IIDSGS
Sbjct: 203 DVMTTQFVKGPKGSPFYYLPLIGLTVGETRLPIPATVFDLREVAPGLFSGGVIIDSGSPF 262
Query: 221 TYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCYFLPETFNRFPSMAFYFED-ANL 279
T D Y L + + +A D + LC + P++ F+F A++
Sbjct: 263 TSLVHDAYDALASELAARLNGSLVAPPPDADDG-ALCVARRDVGRVVPAVVFHFRGGADM 321
Query: 280 RIDGENVFI-IDYENHFFLLAVAPHDDLVALIGSQQQRDTRFVYDLNIDLLSFVKENCS 337
+ E+ + +D +A A ++IG+ QQ++ R +YDL SF +CS
Sbjct: 322 AVPAESYWAPVDKAAACMAIASAGPYRRQSVIGNYQQQNMRVLYDLANGDFSFQPADCS 380
>gi|449485448|ref|XP_004157171.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Cucumis sativus]
Length = 430
Score = 74.3 bits (181), Expect = 8e-11, Method: Compositional matrix adjust.
Identities = 102/392 (26%), Positives = 161/392 (41%), Gaps = 86/392 (21%)
Query: 1 MVRLFIGTPSKGVLLILDTGSALIY-------------AIFDPRKSSSFQKIN------- 40
+V L IGTP + L+LDTGS L + + P+ +S ++
Sbjct: 67 VVSLPIGTPPQPTDLVLDTGSQLSWIQCHDKKIKKRLPPLPKPKTTSFDPSLSSSFSLLP 126
Query: 41 CDHPDCTY--------FKC-VNEQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHG 91
C+HP C C N C Y+ YAD ++ +G E + +
Sbjct: 127 CNHPICKPRIPDFTLPTSCDQNRLCHYSYFYADGTLAEGNLVREKFTFSKSLSTPPV--- 183
Query: 92 ALFGC---SNDNHGFDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPL--- 145
+ GC S +N G +LG++R +SFISQ + I K FSYC+
Sbjct: 184 -ILGCAQASTENRG------------ILGMNRGRLSFISQ--AKISK-FSYCVPSRTGSN 227
Query: 146 PNGEY------TSSYLKFGTDMGYRRPSTQATKFINHPNNFYYLSLKDISIDNERMNFPP 199
P G + SS K+ T + + P +Q++ ++ Y L +K I I +R+N PP
Sbjct: 228 PTGLFYLGDNPNSSKFKYVTMLTF--PESQSSPNLDP--LAYTLPMKAIKIAGKRLNVPP 283
Query: 200 DTFDITVSGEGGCIIDSGSVLTYFHSDVYWKLHEKFVSYF-----ERFQLAQLSDCPEPI 254
F G G +IDSGS LTY + Y K+ E+ V + + A ++D
Sbjct: 284 AAFKPDAGGSGQTMIDSGSDLTYLVDEAYEKVKEEVVRLVGAMMKKGYVYADVAD----- 338
Query: 255 QLCY---FLPETFNRFPSMAFYFEDANLRI---DGENVFIIDYENHFFLLAVAPHDDL-- 306
+C+ E R ++F F D + I GE V + + E + + + L
Sbjct: 339 -MCFDAGVTAEVGRRIGGISFEF-DNGVEIFVGRGEGV-LTEVEKGVKCVGIGRSERLGI 395
Query: 307 -VALIGSQQQRDTRFVYDLNIDLLSFVKENCS 337
+IG+ Q++ YDL + F CS
Sbjct: 396 GSNIIGTVHQQNMWVEYDLANKRVGFGGAECS 427
>gi|302824729|ref|XP_002994005.1| hypothetical protein SELMODRAFT_431957 [Selaginella moellendorffii]
gi|300138167|gb|EFJ04945.1| hypothetical protein SELMODRAFT_431957 [Selaginella moellendorffii]
Length = 462
Score = 74.3 bits (181), Expect = 9e-11, Method: Compositional matrix adjust.
Identities = 90/375 (24%), Positives = 161/375 (42%), Gaps = 59/375 (15%)
Query: 6 IGTPSKGVLLILDTGSALIY--------------AIFDPRKSSSFQKINCDHPDC----- 46
+G+P + +LI+DTGS L + I+D +S+S++ + C++
Sbjct: 106 LGSPGQEAILIVDTGSELTWLQCLPCKVCAPSVDTIYDAARSASYRPVTCNNSQLCSNSS 165
Query: 47 --TYFKCV-NEQCVYTMKYADQSVTKGFAAHETI---SVIGKGEGKAI-FHGALFGCSND 99
TY C QC + Y D S + G + +T+ +V+G GK + FGC+
Sbjct: 166 QGTYAYCARGSQCQFAAFYGDGSFSYGSLSTDTLIMETVVG---GKPVTVQDFAFGCA-- 220
Query: 100 NHGFDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLKF-- 157
G E GA +G+LGL+ ++ QLG +FS+C P + S+ + F
Sbjct: 221 -QGDLELVPTGA-SGILGLNAGKMALPMQLGQRFGWKFSHCF--PDRSSHLNSTGVVFFG 276
Query: 158 GTDMGYRRPSTQATKFINHP--NNFYYLSLKDISIDNERMNFPPDTFDITVSGEGGCIID 215
++ + + + N FY+++LK +SI++ + F P + I+D
Sbjct: 277 NAELPHEQVQYTSVALTNSELQRKFYHVALKGVSINSHELVFLPRGSVV--------ILD 328
Query: 216 SGSVLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCYFLP-----ETFNRFPSM 270
SGS + F + +L E F+ + D + C+ + E PS+
Sbjct: 329 SGSSFSSFVRPFHSQLREAFLKHRPPSLKHLEGDSFGDLGTCFKVSNDDIDELHRTLPSL 388
Query: 271 AFYFEDA-NLRIDGENVF--IIDYENHFFLLAVAPHD---DLVALIGSQQQRDTRFVYDL 324
+ FED + I V + ++NH + A D + V +IG+ QQ++ YD+
Sbjct: 389 SLVFEDGVTIGIPSIGVLLPVARFQNH-VKMCFAFEDGGPNPVNVIGNYQQQNLWVEYDI 447
Query: 325 NIDLLSFVKENCSDD 339
+ F + +C D
Sbjct: 448 QRSRVGFARASCVID 462
>gi|10177232|dbj|BAB10606.1| protease-like protein [Arabidopsis thaliana]
Length = 539
Score = 73.9 bits (180), Expect = 9e-11, Method: Compositional matrix adjust.
Identities = 89/376 (23%), Positives = 155/376 (41%), Gaps = 59/376 (15%)
Query: 3 RLFIGTPSKGVLLILDTGSALIYA-------------------IFDPRKSSSFQKINCDH 43
+L +GTP + + +DTGS +++ FDP S + I+C
Sbjct: 84 KLRLGTPPRDFYVQVDTGSDVLWVSCASCNGCPQTSGLQIQLNFFDPGSSVTASPISCSD 143
Query: 44 PDCTY--------FKCVNEQCVYTMKYADQSVTKGFAAHETIS---VIGKGEGKAIFHGA 92
C++ N C YT +Y D S T GF + + ++G
Sbjct: 144 QRCSWGIQSSDSGCSVQNNLCAYTFQYGDGSGTSGFYVSDVLQFDMIVGSSLVPNSTAPV 203
Query: 93 LFGCSNDNHGFDEDARDGALAGVLGLSRVTISFISQLGS--IIKKRFSYCLVIPLPNGEY 150
+FGCS G D D A+ G+ G + +S ISQL S I + FS+CL GE
Sbjct: 204 VFGCSTSQTG-DLVKSDRAVDGIFGFGQQGMSVISQLASQGIAPRVFSHCL-----KGEN 257
Query: 151 TSSYLKFGTDMGYRRPSTQATKFI-NHPNNFYYLSLKDISIDNERMNFPPDTFDITVSGE 209
+ ++ P+ T + + P+ Y ++L IS++ + + P F T +G+
Sbjct: 258 GGGGILVLGEI--VEPNMVFTPLVPSQPH--YNVNLLSISVNGQALPINPSVFS-TSNGQ 312
Query: 210 GGCIIDSGSVLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCYFLPETF-NRFP 268
G IID+G+ L Y Y E + + +S + CY + + + FP
Sbjct: 313 -GTIIDTGTTLAYLSEAAYVPFVEAITNAVSQSVRPVVSKGNQ----CYVITTSVGDIFP 367
Query: 269 SMAFYFE-DANLRIDGENVFIIDYENHFFLLAV------APHDDLVALIGSQQQRDTRFV 321
++ F A++ ++ ++ I +N+ AV + + ++G +D FV
Sbjct: 368 PVSLNFAGGASMFLNPQDYLI--QQNNVGGTAVWCIGFQRIQNQGITILGDLVLKDKIFV 425
Query: 322 YDLNIDLLSFVKENCS 337
YDL + + +CS
Sbjct: 426 YDLVGQRIGWANYDCS 441
>gi|125575542|gb|EAZ16826.1| hypothetical protein OsJ_32298 [Oryza sativa Japonica Group]
Length = 396
Score = 73.9 bits (180), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 92/369 (24%), Positives = 141/369 (38%), Gaps = 61/369 (16%)
Query: 6 IGTPSKGVLLILDTGSALIYA--------------IFDPRKSSSFQKINCDHPDCTYF-- 49
IGTP + I+D L++ +F P SS+F+ C C
Sbjct: 49 IGTPPQPASAIIDVAGELVWTQCSRCSRCFKQDLPLFIPNASSTFRPEPCGTDACKSTPT 108
Query: 50 -KCVNEQCVY---TMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSNDNHGFDE 105
C + C Y T D+ T G ET ++ G A FGC + D
Sbjct: 109 SNCSGDVCTYESTTNIRLDRHTTLGIVGTETFAI---GTATASLA---FGCVVAS---DI 159
Query: 106 DARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLKFGTDMGYRR 165
D DG +G +GL R S ++Q+ +FSYCL P G SS L G+
Sbjct: 160 DTMDGT-SGFIGLGRTPRSLVAQMK---LTKFSYCLS---PRGTGKSSRLFLGSSAKLAG 212
Query: 166 -PSTQATKFI-----NHPNNFYYLSLKDISIDNERMNFPPDTFDITVSGEGGCIIDSGSV 219
ST FI + +++Y LSL I N + T G ++ + S
Sbjct: 213 GESTSTAPFIKTSPDDDSHHYYLLSLDAIRAGNTTI--------ATAQSGGILVMHTVSP 264
Query: 220 LTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCYFLPETFNRF--PSMAFYFEDA 277
+ Y + ++ P+P LC+ F+R P + F F+ A
Sbjct: 265 FSLLVDSAYRAFKKAVTEAVGGAAEQPMATPPQPFDLCFKKAAGFSRATAPDLVFTFQGA 324
Query: 278 NLRIDGENVFIIDYENH-----FFLLAVAPHD----DLVALIGSQQQRDTRFVYDLNIDL 328
++ID +L++A + + V+++GS QQ D F+YDL +
Sbjct: 325 AALTVPPAKYLIDVGEEKDTACAAILSMAWLNRTGLEGVSVLGSLQQEDVHFLYDLKKET 384
Query: 329 LSFVKENCS 337
LSF +CS
Sbjct: 385 LSFEPADCS 393
>gi|56784779|dbj|BAD82000.1| putative aspartic proteinase nepenthesin II [Oryza sativa Japonica
Group]
Length = 486
Score = 73.9 bits (180), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 92/359 (25%), Positives = 142/359 (39%), Gaps = 56/359 (15%)
Query: 6 IGTPSKGVLLILDTGSALIYAIFDPRKSSSFQKINCDHPDCTY--------------FKC 51
+GTP + V +LD S ++ + S+ D P T +C
Sbjct: 103 VGTPPQVVTGVLDITSDFVW-----MQCSACATCGADAPAATSAPPFYAFLSSTIREVRC 157
Query: 52 VNEQC---VYTMKYADQS------VTKGFAAHETISVIGKGE---GKAIFHGALFGCSND 99
N C V AD S V G AA+ T ++ G +FGC+
Sbjct: 158 ANRGCQRLVPQTCSADDSPCGYSYVYGGGAANTTAGLLAVDAFAFATVRADGVIFGCAV- 216
Query: 100 NHGFDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEY-TSSYLKFG 158
A +G + GV+GL R +S +SQL RFSY L P+ S++ F
Sbjct: 217 -------ATEGDIGGVIGLGRGELSPVSQLQ---IGRFSYYLA---PDDAVDVGSFILFL 263
Query: 159 TDMGYRRPSTQATKFI--NHPNNFYYLSLKDISIDNERMNFPPDTFDITVSGEGGCIIDS 216
D R +T + + YY+ L I +D E + P TFD+ G GG ++
Sbjct: 264 DDAKPRTSRAVSTPLVASRASRSLYYVELAGIRVDGEDLAIPRGTFDLQADGSGGVVLSI 323
Query: 217 GSVLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCYFLPE-TFNRFPSMAFYFE 275
+T+ + Y + + S E + A S+ + LCY + PSMA F
Sbjct: 324 TIPVTFLDAGAYKVVRQAMASKIE-LRAADGSEL--GLDLCYTSESLATAKVPSMALVFA 380
Query: 276 -DANLRIDGENVFIIDYENHFFLLAV--APHDDLVALIGSQQQRDTRFVYDLNIDLLSF 331
A + ++ N F +D L + +P D +L+GS Q T +YD++ L F
Sbjct: 381 GGAVMELEMGNYFYMDSTTGLECLTILPSPAGD-GSLLGSLIQVGTHMIYDISGSRLVF 438
>gi|357491933|ref|XP_003616254.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
gi|355517589|gb|AES99212.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
Length = 442
Score = 73.9 bits (180), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 74/274 (27%), Positives = 114/274 (41%), Gaps = 61/274 (22%)
Query: 1 MVRLFIGTPSKGVLLILDTGSALIYAIFDPRKSSSFQK-----------------INCDH 43
+V L IGTP + ++LDTGS L + +K+ ++ + C+H
Sbjct: 83 VVTLPIGTPPQLQQMVLDTGSQLSWIQCHNKKTPQKKQPPTTSSFDPSLSSSFFVLPCNH 142
Query: 44 PDCTY--------FKC-VNEQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALF 94
P C C N C Y+ YAD + +G E I+ I +
Sbjct: 143 PLCKPRVPDFSLPTDCDANSLCHYSYFYADGTYAEGNLVREKIAFSPSQTTPPI----IL 198
Query: 95 GCSNDNHGFDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSY 154
GC+ + +DAR G+LG++ + F SQ I K FSYC +P + S
Sbjct: 199 GCATQS----DDAR-----GILGMNLGRLGFPSQ-AKITK--FSYC--VPTKQAQPASGS 244
Query: 155 LKFGTDMGYRRPSTQATKFIN---------HPNN---FYYLSLKDISIDNERMNFPPDTF 202
G + P++ + +++N PN Y L L+ ISI +++N PP F
Sbjct: 245 FYLGNN-----PASSSFRYVNLLTFGQSQRMPNLDPLAYTLPLQGISIGGKKLNIPPSVF 299
Query: 203 DITVSGEGGCIIDSGSVLTYFHSDVYWKLHEKFV 236
G G +IDSGS TY + Y + E+ V
Sbjct: 300 KPNAGGSGQTMIDSGSEFTYLVDEAYNVIREELV 333
>gi|449495082|ref|XP_004159729.1| PREDICTED: LOW QUALITY PROTEIN: aspartic proteinase-like protein
1-like [Cucumis sativus]
Length = 524
Score = 73.9 bits (180), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 90/370 (24%), Positives = 141/370 (38%), Gaps = 69/370 (18%)
Query: 6 IGTPSKGVLLILDTGSALIYA---------------------IFDPRKSSSFQKINCDHP 44
+GTP L+ LDTGS L + I+ P SS+ +++ C
Sbjct: 113 VGTPGVPYLVALDTGSDLFWLPCDCVNCITGLNTTQGPVNFNIYSPNNSSTSKEVQCSSS 172
Query: 45 DCTYF-KCVN--EQCVYTMKY-ADQSVTKGFAAHETISVIGKG-EGKAIFHGALFGCSND 99
C++ +C + + C Y + Y +D + + G+ + + + + K + GC D
Sbjct: 173 LCSHLDQCSSPSDTCPYQVSYLSDNTSSTGYLVEDILHLTTNDVQSKPVNARITLGCGKD 232
Query: 100 NHG-FDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLKFG 158
G F A L G LG+ V++ I +I FS C G ++FG
Sbjct: 233 QSGAFLSSAAPNGLFG-LGIENVSVPSILANAGLISNSFSLCF------GPARMGRIEFG 285
Query: 159 TDMGYRRPSTQATKF---INHPNNFYYLSLKDISIDNERMNFPPDTFDITVSGEGGCIID 215
D G P T F HP Y +S+ I + + D+ V I D
Sbjct: 286 -DKG--SPGQNETPFNLGRRHPT--YNVSITQIGVGGHISDL-----DVAV------IFD 329
Query: 216 SGSVLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCYFLPETFNRFPSMAFYFE 275
SG+ TY + Y +KF S E Q SD P + CY L F +
Sbjct: 330 SGTSFTYLNDPAYSLFADKFASMVEEKQFTMNSDI--PFENCYELSPN-----QTTFTYP 382
Query: 276 DANLRIDGENVFIIDY--------ENHFFLLAVAPHDDLVALIGSQQQRDTRFVYDLNID 327
NL + G F+I++ F LA+A D + +IG V+D
Sbjct: 383 LMNLTMKGGGHFVINHPIVLISTESKRLFCLAIA-RSDSINIIGQNFMTGYHIVFDREKM 441
Query: 328 LLSFVKENCS 337
+L + + NC+
Sbjct: 442 VLGWKESNCT 451
>gi|298204765|emb|CBI25263.3| unnamed protein product [Vitis vinifera]
Length = 359
Score = 73.9 bits (180), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 73/291 (25%), Positives = 110/291 (37%), Gaps = 75/291 (25%)
Query: 56 CVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSNDNHGFDEDARDGALAGV 115
C Y + Y D S T+G HE + G + +FGC +N G G ++G+
Sbjct: 133 CNYAINYGDGSFTRGELGHEKLKF-----GTILVKDFIFGCGRNNKGLF-----GGVSGL 182
Query: 116 LGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLKFGTDMGYRRPSTQATKFIN 175
+GL R +S ISQ E Y
Sbjct: 183 MGLGRSDLSLISQ------------------TSENPQLY--------------------- 203
Query: 176 HPNNFYYLSLKDISIDNERMNFPPDTFDITVSGEGGCIIDSGSVLTYFHSDVYWKLHEKF 235
NFY+++L ISI + P G ++DSG+V+T +Y L +F
Sbjct: 204 ---NFYFINLTGISIGGVALQAPS-------VGPSRILVDSGTVITRLPPTIYKALKAEF 253
Query: 236 VSYFERFQLAQLSDCPEP----IQLCYFLPETFN-RFPSMAFYFE-DANLRIDGENVFII 289
+ F F P P + C+ L P++ +FE +A L +D VF
Sbjct: 254 LKQFTGFP-------PAPAFSILDTCFNLSAYQEVDIPTIKMHFEGNAELTVDVTGVFYF 306
Query: 290 ---DYENHFFLLAVAPHDDLVALIGSQQQRDTRFVYDLNIDLLSFVKENCS 337
D LA + D VA++G+ QQ++ R +YD + F E CS
Sbjct: 307 VKSDASQVCLALASLEYQDEVAILGNYQQKNLRVIYDTKETKVGFALETCS 357
>gi|255587337|ref|XP_002534234.1| pepsin A, putative [Ricinus communis]
gi|223525662|gb|EEF28148.1| pepsin A, putative [Ricinus communis]
Length = 468
Score = 73.9 bits (180), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 103/404 (25%), Positives = 154/404 (38%), Gaps = 89/404 (22%)
Query: 2 VRLFIGTPSKGVLLILDTGSALIY----------------------AIFDPRKSSSFQKI 39
+ L +GTPS+ V LI+DTGS+L++ F PR SSS + I
Sbjct: 86 MSLSLGTPSQTVKLIMDTGSSLVWFPCTSRYVCASCNFPNTDITKIPKFMPRLSSSSKLI 145
Query: 40 NCDHPDCTYF-------KCVN-----EQCV-----YTMKYADQSVTKGFAAHETISVIGK 82
C +P C + KC N + C Y ++Y S T G ETI+ K
Sbjct: 146 GCKNPKCAWVFGSSVQSKCHNCNPQAQNCTQACPPYIIQYGLGS-TAGLLLSETINFPNK 204
Query: 83 GEGKAIFHGALFGCSNDNHGFDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLV 142
L GCS + E G+ G R S QLG K+FSYCLV
Sbjct: 205 -----TISDFLAGCSLLSTRQPE--------GIAGFGRSQESLPLQLG---LKKFSYCLV 248
Query: 143 IPLPNGEYTSSYLKFGTDMGYRRPSTQATKFINHP-------------NNFYYLSLKDIS 189
+ SS L DMG ++ T P +YY+ L+ I
Sbjct: 249 SRRFDDSPVSSDLIL--DMGPSTSDSKTTGLSYTPFQKNLASQSNPAFQEYYYVMLRKII 306
Query: 190 IDNERMNFPPDTFDITVSGEGGCIIDSGSVLTYFHSDVYWKLHEKFVSYFERFQLAQLSD 249
+ + P G GG I+DSGS T+ V+ L ++F + +A
Sbjct: 307 VGKTHVKVPYSFLVPGSDGNGGTIVDSGSTFTFVEGHVFELLAKEFEKQMANYTVATNVQ 366
Query: 250 CPEPIQLCYFLP-ETFNRFPSMAFYFE-DANLRIDGENVFIIDYENHFFLLAVAPHDDLV 307
++ C+ + E P + F F+ A +++ N F L V+ D+
Sbjct: 367 KLTGLRPCFDISGEKSVVIPDLTFQFKGGAKMQLPLSNYFAFVDMGVVCLTIVS--DNAA 424
Query: 308 AL--------------IGSQQQRDTRFVYDLNIDLLSFVKENCS 337
AL +G+ QQ++ YDL D F +++C+
Sbjct: 425 ALGGDGGVRSSGPAIILGNFQQQNFYIEYDLENDRFGFKEQSCA 468
>gi|357160409|ref|XP_003578755.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Brachypodium
distachyon]
Length = 373
Score = 73.6 bits (179), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 78/370 (21%), Positives = 150/370 (40%), Gaps = 64/370 (17%)
Query: 6 IGTPSKGVLLILDTGSALIYA------------------IFDPRKSSSFQKINCDHPDCT 47
+GTP+ L+ +DTGS + + F+ SS+++++ C C
Sbjct: 29 LGTPAVFNLVTIDTGSTISWVQCQYCIVHCYTQDQRAGPTFNTSSSSTYRRVGCSAQVCH 88
Query: 48 YFK--------CVNEQ--CVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCS 97
CV E+ C+Y+++YA + G+ + + +++ + +FGC
Sbjct: 89 DMHVSQNIPSGCVEEEDSCIYSLRYASGEYSAGYLSQDRLTLANSYSIQKF----IFGCG 144
Query: 98 NDNHGFDEDARDGALAGVLGLSRVTISFISQLGSIIK-KRFSYCLVIPLPNGEYTSSYLK 156
+DN +G AG++G + SF +Q+ + FSYC P+ + +L
Sbjct: 145 SDNR------YNGHSAGIIGFGNKSYSFFNQIAQLTNYSAFSYC----FPSNQENEGFLS 194
Query: 157 FGTDMGYRRPSTQ--ATKFINHPNNF--YYLSLKDISIDNERMNFPPDTFDITVSGEGGC 212
G Y R S + T+ ++ + Y L D+ ++ R+ P + ++
Sbjct: 195 IGP---YVRDSNKLILTQLFDYGAHLPVYALQQFDMMVNGMRLQVDPPVYTTRMT----- 246
Query: 213 IIDSGSVLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCYFL---PETFNRFPS 269
++DSG+V T+ S V+ L + SD E +C+ +++ P
Sbjct: 247 VVDSGTVETFVLSPVFRALDRALTKAMVAEGYVRGSDSKE---ICFHSNGDSVDWSKLPV 303
Query: 270 MAFYFEDANLRIDGENVFIIDYENHFFLLAVAPHD---DLVALIGSQQQRDTRFVYDLNI 326
+ F + L++ ENVF + + P D V ++G++ R R V+D+
Sbjct: 304 VEIKFSRSILKLPAENVFYYETSDGSICSTFQPDDAGVPGVQILGNRATRSFRVVFDIQQ 363
Query: 327 DLLSFVKENC 336
F C
Sbjct: 364 RNFGFEAGAC 373
>gi|356543524|ref|XP_003540210.1| PREDICTED: aspartic proteinase-like protein 2-like [Glycine max]
Length = 493
Score = 73.6 bits (179), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 93/381 (24%), Positives = 145/381 (38%), Gaps = 69/381 (18%)
Query: 3 RLFIGTPSKGVLLILDTGSALIYA-------------------IFDPRKSSSFQKINCDH 43
++ +GTP + +DTGS +++ FDP SS+ I C
Sbjct: 81 KVQLGTPPVEFNVQIDTGSDVLWVSCNSCNGCPQTSGLQIQLNFFDPGSSSTSSMIACSD 140
Query: 44 PDCTYFK--------CVNEQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGAL-- 93
C K N QC YT +Y D S T G+ + + + IF G++
Sbjct: 141 QRCNNGKQSSDATCSSQNNQCSYTFQYGDGSGTSGYYVSDMMHL------NTIFEGSMTT 194
Query: 94 -------FGCSNDNHGFDEDARDGALAGVLGLSRVTISFISQLGS--IIKKRFSYCLVIP 144
FGCSN G D D A+ G+ G + +S ISQL S I + FS+CL
Sbjct: 195 NSTAPVVFGCSNQQTG-DLTKSDRAVDGIFGFGQQEMSVISQLSSQGIAPRIFSHCLKGD 253
Query: 145 LPNGEYTSSYLKFGTDMGYRRPSTQATKFI-NHPNNFYYLSLKDISIDNERMNFPPDTFD 203
G L G + P+ T + P+ Y L+L+ IS++ + + F
Sbjct: 254 SSGG----GILVLGEIV---EPNIVYTSLVPAQPH--YNLNLQSISVNGQTLQIDSSVF- 303
Query: 204 ITVSGEGGCIIDSGSVLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCYFLPET 263
S G I+DSG+ L Y + Y + FVS + CY + +
Sbjct: 304 -ATSNSRGTIVDSGTTLAYLAEEAY----DPFVSAITAAIPQSVRTVVSRGNQCYLITSS 358
Query: 264 F-NRFPSMAFYFEDANLRIDGENVFIIDYENHFFLLAV------APHDDLVALIGSQQQR 316
+ FP ++ F I ++I +N AV + ++G +
Sbjct: 359 VTDVFPQVSLNFAGGASMILRPQDYLIQ-QNSIGGAAVWCIGFQKIQGQGITILGDLVLK 417
Query: 317 DTRFVYDLNIDLLSFVKENCS 337
D VYDL + + +CS
Sbjct: 418 DKIVVYDLAGQRIGWANYDCS 438
>gi|414887402|tpg|DAA63416.1| TPA: hypothetical protein ZEAMMB73_414910 [Zea mays]
Length = 407
Score = 73.6 bits (179), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 68/255 (26%), Positives = 109/255 (42%), Gaps = 39/255 (15%)
Query: 2 VRLFIGTPSKGVLLILDTGSALIYAI--------------FDPRKSSSFQKINCDHPDCT 47
RL+IGTP + LI+D+GS + Y F P SSS+ + C+ DCT
Sbjct: 91 TRLYIGTPPQEFALIVDSGSTVTYVPCASCEQCGNHQDPRFQPDLSSSYSPVKCNV-DCT 149
Query: 48 YFKCVNEQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSNDNHG--FDE 105
+QC Y +YA+ S + G + +S + E KA A+FGC N G F +
Sbjct: 150 -CDSDKKQCTYERQYAEMSSSSGVLGEDIVSFGRESELKA--QRAVFGCENSETGDLFSQ 206
Query: 106 DARDGALAGVLGLSRVTISFISQL--GSIIKKRFSYCL-VIPLPNGEYTSSYLKFGTDMG 162
A G++GL R +S + QL +I FS C + + G + +DM
Sbjct: 207 HA-----DGIMGLGRGQLSIMDQLVEKGVINDSFSLCYGGMDIGGGAMVLGGVPTPSDMV 261
Query: 163 YRRPSTQATKFINHPNNFYYLSLKDISIDNERMNFPPDTFDITVSGEGGCIIDSGSVLTY 222
+ R + + P +Y + LK+I + + + FD + G ++DSG+ Y
Sbjct: 262 FSR-----SDPLRSP--YYNIELKEIHVAGKALRVDSRIFD----SKHGTVLDSGTTYAY 310
Query: 223 FHSDVYWKLHEKFVS 237
+ + S
Sbjct: 311 LPEQAFMAFKDAVTS 325
>gi|297843774|ref|XP_002889768.1| hypothetical protein ARALYDRAFT_471076 [Arabidopsis lyrata subsp.
lyrata]
gi|297335610|gb|EFH66027.1| hypothetical protein ARALYDRAFT_471076 [Arabidopsis lyrata subsp.
lyrata]
Length = 449
Score = 73.6 bits (179), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 82/365 (22%), Positives = 148/365 (40%), Gaps = 51/365 (13%)
Query: 1 MVRLFIGTPSKGVLLILDTGSALIY-------------AIFDPRKSSSFQKINCDHPDCT 47
+VR +GTP + + ++LDT + ++ F+ SS++ ++C CT
Sbjct: 106 VVRARLGTPPQLMFMVLDTSNDAVWLPCSGCSGCSNASTSFNTNSSSTYSTVSCSTTQCT 165
Query: 48 YFK---CVNEQ-----CVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSND 99
+ C + C + Y S +T+++ + FGC N
Sbjct: 166 QARGLTCPSSTPQPSICSFNQSYGGDSSFSANLVQDTLTL-----SPDVIPNFSFGCINS 220
Query: 100 NHGFDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLKFGT 159
G + G++GL R +S +SQ S+ FSYCL P Y S LK G
Sbjct: 221 ASGNSLPPQ-----GLMGLGRGPMSLVSQTTSLYSGVFSYCL--PSFRSFYFSGSLKLGL 273
Query: 160 DMGYRRPSTQATKFINHPN--NFYYLSLKDISIDNERMNFPPDTFDITVSGEGGCIIDSG 217
+G + S + T + +P + YY++L +S+ + ++ P + G IIDSG
Sbjct: 274 -LGQPK-SIRYTPLLRNPRRPSLYYVNLTGVSVGSVQVPVDPVYLTFDSNSGAGTIIDSG 331
Query: 218 SVLTYFHSDVYWKLHEKFVSYFER--FQLAQLSDCPEPIQLCYFLPETFNRFPSMAFYFE 275
+V+T F VY + ++F L C F + N P + +
Sbjct: 332 TVITRFAQPVYEAIRDEFRKQVNGSFSTLGAFDTC--------FSADNENVTPKITLHMT 383
Query: 276 DANLRIDGENVFIIDYENHFFLLAVAP----HDDLVALIGSQQQRDTRFVYDLNIDLLSF 331
+L++ EN I L++A + ++ +I + QQ++ R ++D+ +
Sbjct: 384 SLDLKLPMENTLIHSSAGTLTCLSMAGIRQNANAVLNVIANLQQQNLRILFDVPNSRIGI 443
Query: 332 VKENC 336
E C
Sbjct: 444 APEPC 448
>gi|255549236|ref|XP_002515672.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
gi|223545215|gb|EEF46724.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
Length = 492
Score = 73.6 bits (179), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 87/376 (23%), Positives = 153/376 (40%), Gaps = 60/376 (15%)
Query: 3 RLFIGTPSKGVLLILDTGSALIY-------------------AIFDPRKSSSFQKINCDH 43
++ IGTPSK + +DTGS +++ +++ + S S + + CD
Sbjct: 89 KVGIGTPSKDYYVQVDTGSDIMWVNCIQCRECPRTSSLGMELTLYNIKDSVSGKLVPCDE 148
Query: 44 PDCTYFK-------CVNEQCVYTMKYADQSVTKGFAAHETIS---VIGKGEGKAIFHGAL 93
C N C Y Y D S T G+ + + V G + + +
Sbjct: 149 EFCYEVNGGPLSGCTANMSCPYLEIYGDGSSTAGYFVKDVVQYDRVSGDLQTTSSNGSVI 208
Query: 94 FGCSNDNHGFDEDARDGALAGVLGLSRVTISFISQLGSI--IKKRFSYCLVIPLPNGEYT 151
FGC G + AL G+LG + S ISQL + +KK F++CL +G
Sbjct: 209 FGCGARQSGDLGPTSEEALDGILGFGKSNSSMISQLAATRKVKKIFAHCL-----DGING 263
Query: 152 SSYLKFGTDMGYRRPSTQATKFI-NHPNNFYYLSLKDISIDNERMNFPPDTFDITVSGEG 210
G + +P T I N P+ Y +++ + + + ++ P + F+
Sbjct: 264 GGIFAIGHVV---QPKVNMTPLIPNQPH--YNVNMTAVQVGEDFLHLPTEEFE--AGDRK 316
Query: 211 GCIIDSGSVLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCYFLPETFNRFPSM 270
G IIDSG+ L Y VY L K +S ++ + D Q + + FP++
Sbjct: 317 GAIIDSGTTLAYLPEIVYEPLVSKIISQQPDLKVHIVRDEYTCFQ---YSGSVDDGFPNV 373
Query: 271 AFYFEDAN-LRIDG-------ENVFIIDYENHFFLLAVAPHDDL-VALIGSQQQRDTRFV 321
F+FE++ L++ E ++ I ++N + D + L+G + +
Sbjct: 374 TFHFENSVFLKVHPHEYLFPFEGLWCIGWQNS----GMQSRDRRNMTLLGDLVLSNKLVL 429
Query: 322 YDLNIDLLSFVKENCS 337
YDL + + + NCS
Sbjct: 430 YDLENQAIGWTEYNCS 445
>gi|115484503|ref|NP_001065913.1| Os11g0183900 [Oryza sativa Japonica Group]
gi|62954888|gb|AAY23257.1| nucellin-like aspartic protease [Oryza sativa Japonica Group]
gi|77549017|gb|ABA91814.1| Eukaryotic aspartyl protease family protein, expressed [Oryza
sativa Japonica Group]
gi|113644617|dbj|BAF27758.1| Os11g0183900 [Oryza sativa Japonica Group]
gi|222615638|gb|EEE51770.1| hypothetical protein OsJ_33210 [Oryza sativa Japonica Group]
Length = 418
Score = 73.6 bits (179), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 90/372 (24%), Positives = 143/372 (38%), Gaps = 59/372 (15%)
Query: 2 VRLFIGTPSKGVLLILDTGSALIY---------------AIFDPRKSSSFQKINCDHPDC 46
V + IG P+K L +DTGS L + ++ P K+ + + C + C
Sbjct: 59 VTMNIGDPAKPYFLDVDTGSDLTWLQCDAPCQSCNKVPHPLYRPTKN---KLVPCANSIC 115
Query: 47 TYF--------KCV-NEQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCS 97
T KC +QC Y +KY D++ + G ++ S+ + + + FGC
Sbjct: 116 TALHSGSSPNKKCTTQQQCDYQIKYTDKASSLGVLVMDSFSLPLRNKSN-VRPSLSFGCG 174
Query: 98 NDNHGFDEDARDGALAGVLGLSRVTISFISQLG--SIIKKRFSYCLVIPLPNGEYTSSYL 155
D A G+LGL R ++S +SQL I K +CL +L
Sbjct: 175 YDQQVGKNGAAPATTDGLLGLGRGSVSLLSQLKQQGITKNVLGHCL------STSGGGFL 228
Query: 156 KFGTDMGYRRPSTQATKFINHPNNFYYLSLKDISIDNERMNFPPDTFDITVSGEGGCIID 215
FG DM T + + N+Y + D ++ P + D
Sbjct: 229 FFGDDMVPTSRVTWVSMVRSTSGNYYSPGSATLYFDRRSLSTKPMEV----------VFD 278
Query: 216 SGSVLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCYFLPETFNR-------FP 268
SGS TYF + Y + L Q+SD P LC+ + F F
Sbjct: 279 SGSTYTYFSAQPYQATISAIKGSLSK-SLKQVSDPSLP--LCWKGQKAFKSVSDVKKDFK 335
Query: 269 SMAFYF-EDANLRIDGENVFIIDYENHFFL--LAVAPHDDLVALIGSQQQRDTRFVYDLN 325
S+ F F ++A + I EN II + L L + ++IG +D +YD
Sbjct: 336 SLQFIFGKNAVMDIPPENYLIITKNGNVCLGILDGSAAKLSFSIIGDITMQDQMVIYDNE 395
Query: 326 IDLLSFVKENCS 337
L +++ +CS
Sbjct: 396 KAQLGWIRGSCS 407
>gi|242070719|ref|XP_002450636.1| hypothetical protein SORBIDRAFT_05g008470 [Sorghum bicolor]
gi|241936479|gb|EES09624.1| hypothetical protein SORBIDRAFT_05g008470 [Sorghum bicolor]
Length = 410
Score = 73.6 bits (179), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 88/355 (24%), Positives = 141/355 (39%), Gaps = 59/355 (16%)
Query: 6 IGTPSKGVLLILDTGSALIYA--------------IFDPRKSSSFQKINCDHPDCTYF-- 49
IGTP + + + DTGS LI+A + P KSSSF K+ C C+
Sbjct: 88 IGTPPQELSALADTGSDLIWAKCGACTRCVPQGSPSYYPNKSSSFSKLPCSGSLCSDLPS 147
Query: 50 -KCV--NEQCVYTMKYADQS----VTKGFAAHETISVIGKGEGKAIFHGALFGCSNDNHG 102
+C +C Y Y S T+G+ ET ++ G G FGC+ + G
Sbjct: 148 SQCSAGGAECDYKYSYGLASDPHHYTQGYLGSETFTL-----GSDAVPGIGFGCTTMSEG 202
Query: 103 FDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLKFGTDMG 162
R +S +SQL FSYCL + +S L FG+
Sbjct: 203 GYGSGSGLVGL-----GRGPLSLVSQLN---VGAFSYCLT----SDAAKTSPLLFGSG-A 249
Query: 163 YRRPSTQATKFINHPNNFYYLSLKDISIDNERMNFPPDTFDITVSGEGGCIIDSGSVLTY 222
Q+T + +Y ++L+ ISI +G G I DSG+ + +
Sbjct: 250 LTGAGVQSTPLLRTSTYYYTVNLESISIGAAT---------TAGTGSSGIIFDSGTTVAF 300
Query: 223 FHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCYFLPETFNRFPSMAFYFEDANLRID 282
Y E +S +A D E +C+ + FPSM +F+ ++ +
Sbjct: 301 LAEPAYTLAKEAVLSQTTNLTMASGRDGYE---VCF--QTSGAVFPSMVLHFDGGDMDLP 355
Query: 283 GENVF-IIDYENHFFLLAVAPHDDLVALIGSQQQRDTRFVYDLNIDLLSFVKENC 336
EN F +D +++ +P ++++G+ Q + YD+ +LSF NC
Sbjct: 356 TENYFGAVDDSVSCWIVQKSPS---LSIVGNIMQMNYHIRYDVEKSMLSFQPANC 407
>gi|218189440|gb|EEC71867.1| hypothetical protein OsI_04576 [Oryza sativa Indica Group]
Length = 508
Score = 73.6 bits (179), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 90/359 (25%), Positives = 140/359 (38%), Gaps = 56/359 (15%)
Query: 6 IGTPSKGVLLILDTGSALIYAIFDPRKSSSFQKINCDHPDCTY--------------FKC 51
+GTP + V +LD S ++ + S+ D P T +C
Sbjct: 103 VGTPPQVVTGVLDITSDFVW-----MQCSACATCGADAPAATSAPPFYAFLSSTIREVRC 157
Query: 52 VNEQC---VYTMKYADQS------VTKGFAAHETISVIGKGE---GKAIFHGALFGCSND 99
N C V AD S V G AA+ T ++ G +FGC+
Sbjct: 158 ANRGCQRLVPQTCSADDSPCGYSYVYGGGAANTTAGLLAVDAFAFATVRADGVIFGCAV- 216
Query: 100 NHGFDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEY-TSSYLKFG 158
A +G + GV+GL R +S +SQL RFSY L P+ S++ F
Sbjct: 217 -------ATEGDIGGVIGLGRGELSLVSQL---QIGRFSYYLA---PDDAVDVGSFILFL 263
Query: 159 TDMGYRRPSTQATKFINH--PNNFYYLSLKDISIDNERMNFPPDTFDITVSGEGGCIIDS 216
D R +T + + + YY+ L I +D E + P TFD+ G GG ++
Sbjct: 264 DDAKPRTSRAVSTPLVANRASRSLYYVELAGIRVDGEDLAIPRGTFDLQADGSGGVVLSI 323
Query: 217 GSVLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCYFLPE-TFNRFPSMAFYFE 275
+T+ + Y + + S + L + LCY + PSMA F
Sbjct: 324 TIPVTFLDAGAYKVVRQAMAS---KIGLRAADGSELGLDLCYTSESLATAKVPSMALVFA 380
Query: 276 -DANLRIDGENVFIIDYENHFFLLAV--APHDDLVALIGSQQQRDTRFVYDLNIDLLSF 331
A + ++ N F +D L + +P D +L+GS Q T +YD++ L F
Sbjct: 381 GGAVMELEMGNYFYMDSTTGLECLTILPSPAGD-GSLLGSLIQVGTHMIYDISGSRLVF 438
>gi|356546036|ref|XP_003541438.1| PREDICTED: aspartic proteinase-like protein 2-like [Glycine max]
Length = 486
Score = 73.2 bits (178), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 93/384 (24%), Positives = 155/384 (40%), Gaps = 75/384 (19%)
Query: 3 RLFIGTPSKGVLLILDTGSALIYA-------------------IFDPRKSSSFQKINCDH 43
++ +GTP K + +DTGS +++ FD SS+ I C
Sbjct: 81 KVKMGTPPKEFNVQIDTGSDILWVNCNTCSNCPQSSQLGIELNFFDTVGSSTAALIPCSD 140
Query: 44 PDCT------YFKC---VNEQCVYTMKYADQSVTKGFAAHETI--SVIGKGEGKAIFHGA 92
P CT +C VN QC YT +Y D S T G+ + + S+I G+ A+ A
Sbjct: 141 PICTSRVQGAAAECSPRVN-QCSYTFQYGDGSGTSGYYVSDAMYFSLI-MGQPPAVNSSA 198
Query: 93 --LFGCSNDNHGFDEDARDGALAGVLGLSRVTISFISQLGS--IIKKRFSYCLVIPLPNG 148
+FGCS G D D A+ G+ G +S +SQL S I K FS+CL
Sbjct: 199 TIVFGCSISQSG-DLTKTDKAVDGIFGFGPGPLSVVSQLSSRGITPKVFSHCL------- 250
Query: 149 EYTSSYLKFGTDMGYRRPSTQATKFI-NHPNNFYYLSLKDISIDNERMNFPPDTFDITVS 207
+ PS + + + P+ Y L+L+ I+++ + + P F I+ +
Sbjct: 251 KGDGDGGGVLVLGEILEPSIVYSPLVPSQPH--YNLNLQSIAVNGQLLPINPAVFSIS-N 307
Query: 208 GEGGCIIDSGSVLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCYFLPETF-NR 266
GG I+D G+ L Y + Y L + + S + CY + + +
Sbjct: 308 NRGGTIVDCGTTLAYLIQEAYDPLVTAINTAVSQSARQTNSKGNQ----CYLVSTSIGDI 363
Query: 267 FPSMAFYFEDA-------------NLRIDGENVFIIDYENHFFLLAVAPHDDLVALIGSQ 313
FPS++ FE N +DG ++ I ++ + +++G
Sbjct: 364 FPSVSLNFEGGASMVLKPEQYLMHNGYLDGAEMWCIGFQK---------FQEGASILGDL 414
Query: 314 QQRDTRFVYDLNIDLLSFVKENCS 337
+D VYD+ + + +CS
Sbjct: 415 VLKDKIVVYDIAQQRIGWANYDCS 438
>gi|449456843|ref|XP_004146158.1| PREDICTED: aspartic proteinase-like protein 1-like [Cucumis
sativus]
Length = 547
Score = 73.2 bits (178), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 90/370 (24%), Positives = 141/370 (38%), Gaps = 69/370 (18%)
Query: 6 IGTPSKGVLLILDTGSALIYA---------------------IFDPRKSSSFQKINCDHP 44
+GTP L+ LDTGS L + I+ P SS+ +++ C
Sbjct: 136 VGTPGVPYLVALDTGSDLFWLPCDCVNCITGLNTTQGPVNFNIYSPNNSSTSKEVQCSSS 195
Query: 45 DCTYF-KCVN--EQCVYTMKY-ADQSVTKGFAAHETISVIGKG-EGKAIFHGALFGCSND 99
C++ +C + + C Y + Y +D + + G+ + + + + K + GC D
Sbjct: 196 LCSHLDQCSSPSDTCPYQVSYLSDNTSSTGYLVEDILHLTTNDVQSKPVNARITLGCGKD 255
Query: 100 NHG-FDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLKFG 158
G F A L G LG+ V++ I +I FS C G ++FG
Sbjct: 256 QSGAFLSSAAPNGLFG-LGIENVSVPSILANAGLISNSFSLCF------GPARMGRIEFG 308
Query: 159 TDMGYRRPSTQATKF---INHPNNFYYLSLKDISIDNERMNFPPDTFDITVSGEGGCIID 215
D G P T F HP Y +S+ I + + D+ V I D
Sbjct: 309 -DKG--SPGQNETPFNLGRRHPT--YNVSITQIGVGGHISDL-----DVAV------IFD 352
Query: 216 SGSVLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCYFLPETFNRFPSMAFYFE 275
SG+ TY + Y +KF S E Q SD P + CY L F +
Sbjct: 353 SGTSFTYLNDPAYSLFADKFASMVEEKQFTMNSDI--PFENCYELSPN-----QTTFTYP 405
Query: 276 DANLRIDGENVFIIDY--------ENHFFLLAVAPHDDLVALIGSQQQRDTRFVYDLNID 327
NL + G F+I++ F LA+A D + +IG V+D
Sbjct: 406 LMNLTMKGGGHFVINHPIVLISTESKRLFCLAIA-RSDSINIIGQNFMTGYHIVFDREKM 464
Query: 328 LLSFVKENCS 337
+L + + NC+
Sbjct: 465 VLGWKESNCT 474
>gi|224068901|ref|XP_002326227.1| predicted protein [Populus trichocarpa]
gi|222833420|gb|EEE71897.1| predicted protein [Populus trichocarpa]
Length = 477
Score = 73.2 bits (178), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 87/379 (22%), Positives = 154/379 (40%), Gaps = 67/379 (17%)
Query: 3 RLFIGTPSKGVLLILDTGSALIYA-------------------IFDPRKSSSFQKINCDH 43
++ +G+P + + +DTGS +++ FD SS+ ++ C
Sbjct: 69 KVKLGSPPREFNVQIDTGSDVLWVCCNSCNNCPRTSGLGIQLNFFDSSSSSTAGQVRCSD 128
Query: 44 PDCT------YFKCVNE--QCVYTMKYADQSVTKGFAAHETI---SVIGKGEGKAIFHGA 92
P CT +C ++ QC YT +Y D S T G+ +T+ +++G+
Sbjct: 129 PICTSAVQTTATQCSSQTDQCSYTFQYGDGSGTSGYYVSDTLYFDAILGQSLIDNSSALI 188
Query: 93 LFGCSNDNHGFDEDARDGALAGVLGLSRVTISFISQLGS--IIKKRFSYCLVIPLPNGEY 150
+FGCS G D D A+ G+ G + +S ISQL + I + FS+CL G
Sbjct: 189 VFGCSAYQSG-DLTKTDKAVDGIFGFGQGELSVISQLSTRGITPRVFSHCLKGDGSGG-- 245
Query: 151 TSSYLKFGTDMGYRRPSTQATKFI-NHPNNFYYLSLKDISIDNERMNFPPDTFDITVSGE 209
L G + P + + + P+ Y L+L I+++ + + P F S
Sbjct: 246 --GILVLGEIL---EPGIVYSPLVPSQPH--YNLNLLSIAVNGQLLPIDPAAF--ATSNS 296
Query: 210 GGCIIDSGSVLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCYFLPETFNR-FP 268
G I+DSG+ L Y ++ Y + FVS ++ CY + + ++ FP
Sbjct: 297 QGTIVDSGTTLAYLVAEAY----DPFVSAVNAIVSPSVTPITSKGNQCYLVSTSVSQMFP 352
Query: 269 SMAFYFEDANLRIDGENVFIIDYENHFFLLAVAPHDDL----------VALIGSQQQRDT 318
+F F G ++ E++ + + V ++G +D
Sbjct: 353 LASFNFA-------GGASMVLKPEDYLIPFGSSGGSAMWCIGFQKVQGVTILGDLVLKDK 405
Query: 319 RFVYDLNIDLLSFVKENCS 337
FVYDL + + +CS
Sbjct: 406 IFVYDLVRQRIGWANYDCS 424
>gi|84453222|dbj|BAE71208.1| hypothetical protein [Trifolium pratense]
gi|84453226|dbj|BAE71210.1| hypothetical protein [Trifolium pratense]
Length = 437
Score = 73.2 bits (178), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 85/359 (23%), Positives = 151/359 (42%), Gaps = 44/359 (12%)
Query: 1 MVRLFIGTPSKGVLLILDTGSALIY-----------AIFDPRKSSSFQKINCDHPDCTYF 49
+VR+ IGTP + + ++LDT + + F P S+SF ++C P C
Sbjct: 99 VVRVKIGTPGQLLFMVLDTSTDEAFVPSSGCIGCSATTFYPNVSTSFVPLDCSVPQCGQV 158
Query: 50 KCVN------EQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSNDNHGF 103
+ ++ C + YA + + +++ + + FG N G
Sbjct: 159 RGLSCPATGSGACSFNQSYAGSTFSATLV-QDSLRL-----ATDVIPSYSFGSINAISGS 212
Query: 104 DEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLKFGTDMGY 163
A+ R +S +SQ G+I FSYCL P Y S LK G +G
Sbjct: 213 SVPAQGLLGL-----GRGPLSLLSQSGAIYSGVFSYCL--PSFKSYYFSGSLKLG-PVGQ 264
Query: 164 RRPSTQATKFINHPN--NFYYLSLKDISIDNERMNFPPDTFDITVSGEGGCIIDSGSVLT 221
+ S + T +++P+ + YY++L IS+ + P + S G IIDSG+V+T
Sbjct: 265 PK-SIRTTPLLHNPHRPSLYYVNLTAISVGRVYVPLPSELLAFNPSTGAGTIIDSGTVIT 323
Query: 222 YFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCYFLPETFNRFPSMAFYFEDANLRI 281
F +Y + ++F R Q+ C F+ P++ +F D +L++
Sbjct: 324 RFVEPIYNAVRDEF-----RKQVTGPFSSLGAFDTC-FVKNYETLAPAITLHFTDLDLKL 377
Query: 282 DGENVFIIDYENHFFLLAVAPH----DDLVALIGSQQQRDTRFVYDLNIDLLSFVKENC 336
EN I LA+A + ++ +I + QQ++ R ++D + + +E C
Sbjct: 378 PLENSLIHSSSGSLACLAMAAAPSNVNSVLNVIANFQQQNLRVLFDTVNNKVGIARELC 436
>gi|297720449|ref|NP_001172586.1| Os01g0776900 [Oryza sativa Japonica Group]
gi|255673740|dbj|BAH91316.1| Os01g0776900 [Oryza sativa Japonica Group]
Length = 381
Score = 73.2 bits (178), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 71/261 (27%), Positives = 111/261 (42%), Gaps = 46/261 (17%)
Query: 2 VRLFIGTPSKGVLLILDTGSALIYAI-------------------FDPRKSSSFQKINCD 42
R+ +G+P K + +DTGS +++ F+P SS+ KI C
Sbjct: 93 TRVKLGSPPKEYFVQIDTGSDILWVACSPCTGCPSSSGLNIQLEFFNPDTSSTSSKIPCS 152
Query: 43 HPDCTYFKCVNEQ---------CVYTMKYADQSVTKGFAAHETI---SVIGKGEGKAIFH 90
CT +E C YT Y D S T G+ +T+ +V+G +
Sbjct: 153 DDRCTAALQTSEAVCQTSDNSPCGYTFTYGDGSGTSGYYVSDTMYFDTVMGNEQTANSSA 212
Query: 91 GALFGCSNDNHGFDEDARDGALAGVLGLSRVTISFISQLGS--IIKKRFSYCLVIPLPNG 148
+FGCSN G D D A+ G+ G + +S +SQL S + K FS+CL
Sbjct: 213 SIVFGCSNSQSG-DLTKTDRAVDGIFGFGQHQLSVVSQLNSLGVSPKVFSHCL----KGS 267
Query: 149 EYTSSYLKFGTDMGYRRPSTQATKFI-NHPNNFYYLSLKDISIDNERMNFPPDTFDITVS 207
+ L G + P T + + P+ Y L+L+ I ++ +++ P D+ T S
Sbjct: 268 DNGGGILVLGEIV---EPGLVYTPLVPSQPH--YNLNLESIVVNGQKL--PIDSSLFTTS 320
Query: 208 GEGGCIIDSGSVLTYFHSDVY 228
G I+DSG+ L Y Y
Sbjct: 321 NTQGTIVDSGTTLAYLADGAY 341
>gi|296082172|emb|CBI21177.3| unnamed protein product [Vitis vinifera]
Length = 376
Score = 73.2 bits (178), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 46/151 (30%), Positives = 72/151 (47%), Gaps = 32/151 (21%)
Query: 1 MVRLFIGTPSKGVLLILDTGSALIYA---------------IFDPRKSSSFQKINCDHPD 45
+V + +GTP + + I DTGS L + IF+P KS+S+ I+C P
Sbjct: 139 VVTVGLGTPKRDLTFIFDTGSDLTWTQCEPCARYCYHQQEPIFNPSKSTSYTNISCSSPT 198
Query: 46 CTYFK--------CVNEQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCS 97
C K C CVY ++Y DQS + GF A + +++ +F+ LFGC
Sbjct: 199 CDELKSGTGNSPSCSASTCVYGIQYGDQSYSVGFFAQDKLALT----STDVFNNFLFGCG 254
Query: 98 NDNHGFDEDARDGALAGVLGLSRVTISFISQ 128
+N G +AG++GL R +S +S+
Sbjct: 255 QNNRGLFV-----GVAGLIGLGRNALSLMSK 280
>gi|413944596|gb|AFW77245.1| hypothetical protein ZEAMMB73_545774 [Zea mays]
gi|414876929|tpg|DAA54060.1| TPA: hypothetical protein ZEAMMB73_875469 [Zea mays]
Length = 459
Score = 73.2 bits (178), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 88/366 (24%), Positives = 146/366 (39%), Gaps = 70/366 (19%)
Query: 6 IGTPSKGVLLILDTGSALIYA----------------IFDPRKSSSFQKINCDHPDCTYF 49
+GTP + + + DTGS LI+A + P SS+F K+ C C+
Sbjct: 97 MGTPPQKLTALADTGSDLIWAKCGGACTTSCEPQGSPSYLPNASSTFAKLPCSDRLCSLL 156
Query: 50 K--------CVNEQCVYTMKYA----DQSVTKGFAAHETISVIGKGEGKAIFHGALFGCS 97
+ +C Y Y D T+GF A ET ++ G FGC+
Sbjct: 157 RSDSVAWCAAAGAECDYRYSYGLGDDDHHYTQGFLARETFTL-----GADAVPSVRFGCT 211
Query: 98 NDNHGFDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLKF 157
+ G R +S +SQL + F YCL + +S L F
Sbjct: 212 TASEGGYGSGSGLVGL-----GRGPLSLVSQLNA---STFMYCLT----SDASKASPLLF 259
Query: 158 GTDMGYRRPSTQATKFINHPNNFYYLSLKDISIDNERMNFPPDTFDITVSGEG---GCII 214
G+ Q+T + FY ++L+ ISI + T G G G +
Sbjct: 260 GSLASLTGAQVQSTGLLAS-TTFYAVNLRSISIGSA-----------TTPGVGEPEGVVF 307
Query: 215 DSGSVLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCYFLPE----TFNRFPSM 270
DSG+ LTY Y + F+S + L Q+ D + + C+ P + P+M
Sbjct: 308 DSGTTLTYLAEPAYSEAKAAFLS---QTSLDQVEDT-DGFEACFQKPANGRLSNAAVPTM 363
Query: 271 AFYFEDANLRIDGENVFIIDYENHFFLLAVAPHDDLVALIGSQQQRDTRFVYDLNIDLLS 330
+F+ A++ + N ++++ E+ V L ++IG+ Q + ++D++ +LS
Sbjct: 364 VLHFDGADMALPVAN-YVVEVEDGVVCWIVQRSPSL-SIIGNIMQVNYLVLHDVHRSVLS 421
Query: 331 FVKENC 336
F NC
Sbjct: 422 FQPANC 427
>gi|357118738|ref|XP_003561107.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Brachypodium
distachyon]
Length = 491
Score = 73.2 bits (178), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 75/335 (22%), Positives = 125/335 (37%), Gaps = 54/335 (16%)
Query: 27 IFDPRKSSSFQKINCDHPDCTYF-------KCVNEQCVYTMKYADQSVTKGFAAHETISV 79
++DP KSSS C P C +QC Y ++Y D S + G + +++
Sbjct: 186 LYDPSKSSSSAAFPCSSPACRNLGPYANGCTPAGDQCQYRVQYPDGSASAGTYISDVLTL 245
Query: 80 IGKGEGKAIFHGALFGCSNDNHGFDEDAR-DGALAGVLGLSRVTISFISQLGSIIKKRFS 138
AI FGCS H + +G++ L R S +Q + FS
Sbjct: 246 NPAKPASAISE-FRFGCS---HALLQPGSFSNKTSGIMALGRGAQSLPTQTKATYGDVFS 301
Query: 139 YCLVIPLPNGEYTSSYLKFGTDMGYRRPSTQATKFINHP-------NNFYYLSLKDISID 191
YC LP S + G P A+++ P Y + L I +
Sbjct: 302 YC----LPPTPVHSGFFILGV------PRVAASRYAVTPMLRSKAAPMLYLVRLIAIEVA 351
Query: 192 NERMNFPPDTFDITVSGEGGCIIDSGSVLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCP 251
+R+ PP F G ++DS +++T Y L FV+ ++ A
Sbjct: 352 GKRLPVPPAVF------AAGAVMDSRTIVTRLPPTAYMALRAAFVAEMRAYRAAAPK--- 402
Query: 252 EPIQLCY------FLPETFNRFPSMAFYFEDAN--LRIDGENVFIIDYENHFFLLAVAPH 303
E + CY + P + F+ N + +D V + LA AP+
Sbjct: 403 EHLDTCYDFSGAAPGGGGGVKLPKITLVFDGPNGAVELDPSGVLLDG------CLAFAPN 456
Query: 304 --DDLVALIGSQQQRDTRFVYDLNIDLLSFVKENC 336
D + +IG+ QQ+ +Y+++ + F + C
Sbjct: 457 TDDQMTGIIGNVQQQALEVLYNVDGATVGFRRGAC 491
>gi|356503843|ref|XP_003520712.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Glycine max]
Length = 474
Score = 73.2 bits (178), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 97/396 (24%), Positives = 157/396 (39%), Gaps = 80/396 (20%)
Query: 2 VRLFIGTPSKGVLLILDTGSALIY----------------------AIFDPRKSSSFQKI 39
+ L +GTP + +LDTGS+L++ F P+ SS+ + +
Sbjct: 94 IDLNLGTPPQTSPFVLDTGSSLVWFPCTSRYLCSHCNFPNIDTTKIPTFIPKNSSTAKLL 153
Query: 40 NCDHPDCTY-------FKCVNEQC------------VYTMKYADQSVTKGFAAHETISVI 80
C +P C Y F+C QC Y ++Y S T GF + ++
Sbjct: 154 GCRNPKCGYIFGSDVQFRC--PQCKPESQNCSLTCPAYIIQYGLGS-TAGFLLLDNLNFP 210
Query: 81 GKGEGKAIFHGALFGCSNDNHGFDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYC 140
GK L GCS +G+ G R S SQ+ KRFSYC
Sbjct: 211 GK-----TVPQFLVGCS--------ILSIRQPSGIAGFGRGQESLPSQMN---LKRFSYC 254
Query: 141 LVIPLPNGEYTSS--YLKFGTDMGYRRPSTQATKFINHP--NN-----FYYLSLKDISID 191
LV + SS L+ + + T F ++P NN +YYL+L+ + +
Sbjct: 255 LVSHRFDDTPQSSDLVLQISSTGDTKTNGLSYTPFRSNPSTNNPAFKEYYYLTLRKVIVG 314
Query: 192 NERMNFPPDTFDITVSGEGGCIIDSGSVLTYFHSDVYWKLHEKFVSYFER-FQLAQLSDC 250
+ + P + G GG I+DSGS T+ VY + ++FV E+ + A+ ++
Sbjct: 315 GKDVKIPYTFLEPGSDGNGGTIVDSGSTFTFMERPVYNLVAQEFVKQLEKNYSRAEDAET 374
Query: 251 PEPIQLCYFLPETFN-RFPSMAFYFE-DANLRIDGENVFIIDYENHFFLLAV-------A 301
+ C+ + FP + F F+ A + +N F + + L V
Sbjct: 375 QSGLSPCFNISGVKTVTFPELTFKFKGGAKMTQPLQNYFSLVGDAEVVCLTVVSDGGAGP 434
Query: 302 PHDDLVALI-GSQQQRDTRFVYDLNIDLLSFVKENC 336
P A+I G+ QQ++ YDL + F +C
Sbjct: 435 PKTTGPAIILGNYQQQNFYIEYDLENERFGFGPRSC 470
>gi|326512066|dbj|BAJ96014.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 485
Score = 73.2 bits (178), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 91/374 (24%), Positives = 146/374 (39%), Gaps = 56/374 (14%)
Query: 2 VRLFIGTPSKGVLLILDTGSALIY-------------------AIFDPRKSSSFQKINCD 42
++ IGTP+K + +DTGS +++ ++DP SSS + C
Sbjct: 83 TQIGIGTPAKSYYVQVDTGSDILWVNCVFCDTCPRKSGLGIELTLYDPSGSSSGTGVTCG 142
Query: 43 HPDCTYF------KCVNEQ-CVYTMKYADQSVTKGFAAHETIS---VIGKGEGKAIFHGA 92
C CV C Y++ Y D S T GF + + V G +
Sbjct: 143 QDFCVATHGGVIPSCVPAAPCQYSISYGDGSSTTGFFVTDFLQYNQVSGNSQTTLANTSI 202
Query: 93 LFGCSNDNHGFDEDARDGALAGVLGLSRVTISFISQLGSI--IKKRFSYCLVIPLPNGEY 150
FGC G D + AL G+LG + S +SQL + ++K F++CL G +
Sbjct: 203 TFGCGAKIGG-DLGSSSQALDGILGFGQSNSSMLSQLAAAGKVRKVFAHCLDTINGGGIF 261
Query: 151 TSSYLKFGTDMGYRRPSTQATKFI-NHPNNFYYLSLKDISIDNERMNFPPDTFDITVSGE 209
+ +P T + P+ Y ++L+ I + ++ P + FDI S
Sbjct: 262 AIGDVV--------QPKVSTTPLVPGMPH--YNVNLEAIDVGGVKLQLPTNIFDIGES-- 309
Query: 210 GGCIIDSGSVLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCYFLPETFNRFPS 269
G IIDSG+ L Y VY + K + + L D Q + + FP
Sbjct: 310 KGTIIDSGTTLAYLPGVVYNAIMSKVFAQYGDMPLKNDQD----FQCFRYSGSVDDGFPI 365
Query: 270 MAFYFEDA-NLRIDGENVFIIDYENHFF-----LLAVAPHDDLVALIGSQQQRDTRFVYD 323
+ F+FE L I + + E + L D+V L+G + +YD
Sbjct: 366 ITFHFEGGLPLNIHPHDYLFQNGELYCMGFQTGGLQTKDGKDMV-LLGDLAFSNRLVLYD 424
Query: 324 LNIDLLSFVKENCS 337
L ++ + NCS
Sbjct: 425 LENQVIGWTDYNCS 438
>gi|255571588|ref|XP_002526740.1| aspartic-type endopeptidase, putative [Ricinus communis]
gi|223533929|gb|EEF35654.1| aspartic-type endopeptidase, putative [Ricinus communis]
Length = 471
Score = 72.8 bits (177), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 94/376 (25%), Positives = 156/376 (41%), Gaps = 61/376 (16%)
Query: 1 MVRLFIGTPSKGVLLILDTGSALIY----------------AIFDPRKSSSFQKINCDHP 44
+++ IG+P I DTGS +++ +F+P KSS++ C H
Sbjct: 109 VMKFNIGSPPVETYAIPDTGSNIVWIQCGSPICTNCYKQKIPLFNPTKSSTYAIRLCGHR 168
Query: 45 DCT--------YFKCVN--EQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHG--- 91
+C Y C + + C Y + Y D S ++G + + I+ E A F
Sbjct: 169 ECKQALWGLGEYLGCKSSVQVCRYHISYEDHSFSEGTISTDIITF---PEHIAEFGNYSL 225
Query: 92 -ALFGCS-NDNHGFDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIP---LP 146
FGC N++ +D GV+GL S + QL +FSYC+ P P
Sbjct: 226 RMFFGCGYNNSETPGQDPNSFTAPGVVGLGNEMASLVGQL---TLGQFSYCISTPDVQKP 282
Query: 147 NGEYTSSYLKFGTDMGYRRPSTQATKFINHPNNFY-YLSLKDISIDNERMN-FPPDTFDI 204
NG ++FG S +T N+ +Y + ++ I +D+ ++ +P F
Sbjct: 283 NGTIE---IRFGLAASI---SGHSTALANNLEGWYIFQNVDGIYVDDTKVKGYPEWVFQF 336
Query: 205 TVSGEGGCIIDSGSVLTYFHSDVYWKLHEKFVSYF-ERFQLAQLSD--CPEPIQLCYFLP 261
G GG I+DSG+ T ++Y+ + + E+ +LA + LCY
Sbjct: 337 AEGGIGGLIMDSGTTYT----ELYFSALDALIGELKEQIELAPDTQDHSNSNYSLCYNAA 392
Query: 262 E-TFNRFPSMAFYFED---ANLRIDGENVFIIDYENHFFLLAVAPHDDLVALIGSQQQRD 317
P++ F D A N +I D N + LA+ +++IG Q RD
Sbjct: 393 NFLLTYVPAIELKFTDNKEAYFPFTLRNAWI-DNGNDQYCLAMFGTSG-ISIIGIYQHRD 450
Query: 318 TRFVYDLNIDLLSFVK 333
+ YDL +L+SF +
Sbjct: 451 IKIGYDLKYNLVSFTE 466
>gi|226532674|ref|NP_001151415.1| pepsin A precursor [Zea mays]
gi|195646632|gb|ACG42784.1| pepsin A [Zea mays]
Length = 492
Score = 72.8 bits (177), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 95/384 (24%), Positives = 152/384 (39%), Gaps = 73/384 (19%)
Query: 2 VRLFIGTPSKGVLLILDTGSALIY-------------------AIFDPRKSSSFQKINCD 42
R+ IG+P KG + +DTGS +++ +DP S + + C+
Sbjct: 87 TRIEIGSPPKGYYVQVDTGSDILWVNGISCDGCPTRSGLGIELTQYDPAGSGT--TVGCE 144
Query: 43 HPDCTYFKCVN----------EQCVYTMKYADQSVTKGFAAHETI---SVIGKGEGKAIF 89
C + C + + Y D S T GF + + V G G+
Sbjct: 145 QEFCVANSAASGVPPACPSAASPCQFRITYGDGSSTTGFYVTDFVQYNQVSGNGQTTPSN 204
Query: 90 HGALFGCSNDNHGFDEDARDGALAGVLGLSRVTISFISQLGSI--IKKRFSYCLVIPLPN 147
FGC G D + AL G+LG + S +SQL + ++K F++CL
Sbjct: 205 VSITFGCGA-QLGGDLGSSSQALDGILGFGQSDASMLSQLAAARKVRKIFAHCLDTVRGG 263
Query: 148 GEYTSSYLKFGTDMGYRRPSTQATKFINHPNNFYY-LSLKDISIDNERMNFPPDTFDITV 206
G + + + P + T + PN +Y ++L+ IS+ + P TFD
Sbjct: 264 GIFAIGNV-------VQPPIVKTTPLV--PNATHYNVNLQGISVGGATLQLPTSTFD--- 311
Query: 207 SGEG-GCIIDSGSVLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCYFLPETFN 265
SG+ G IIDSG+ L Y +VY L + D +C+ + +
Sbjct: 312 SGDSKGTIIDSGTTLAYLPREVYRTLLTAVFDKHPDLAVRNYEDF-----ICFQFSGSLD 366
Query: 266 -RFPSMAFYFEDANLRIDGENVFIIDY----ENHFFLL-----AVAPHD--DLVALIGSQ 313
FP + F FE +L + NV+ DY N + + V D D+V L+G
Sbjct: 367 EEFPVITFSFE-GDLTL---NVYPHDYLFQNGNDLYCMGFLDGGVQTKDGKDMV-LLGDL 421
Query: 314 QQRDTRFVYDLNIDLLSFVKENCS 337
+ VYDL ++ + NCS
Sbjct: 422 VLSNKLVVYDLEKQVIGWTDYNCS 445
>gi|218185380|gb|EEC67807.1| hypothetical protein OsI_35373 [Oryza sativa Indica Group]
Length = 418
Score = 72.8 bits (177), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 89/372 (23%), Positives = 142/372 (38%), Gaps = 59/372 (15%)
Query: 2 VRLFIGTPSKGVLLILDTGSALIY---------------AIFDPRKSSSFQKINCDHPDC 46
V + IG P+K L +DTGS L + ++ P K+ + + C + C
Sbjct: 59 VTMNIGDPAKPYFLDVDTGSDLTWLQCDAPCQSCNKVPHPLYRPTKN---KLVPCANSIC 115
Query: 47 TYF--------KCV-NEQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCS 97
T KC +QC Y +KY D++ + G ++ S+ + + + FGC
Sbjct: 116 TALHSGSSPNKKCTTQQQCDYQIKYTDKASSLGVLVTDSFSLPLRNKSN-VRPSLSFGCG 174
Query: 98 NDNHGFDEDARDGALAGVLGLSRVTISFISQLG--SIIKKRFSYCLVIPLPNGEYTSSYL 155
D A G+LGL R ++S +SQL I K +CL +L
Sbjct: 175 YDQQVGKNGAAPATTDGLLGLGRGSVSLLSQLKQQGITKNVLGHCL------STSGGGFL 228
Query: 156 KFGTDMGYRRPSTQATKFINHPNNFYYLSLKDISIDNERMNFPPDTFDITVSGEGGCIID 215
FG DM T + N+Y + D ++ P + D
Sbjct: 229 FFGDDMVPTSRVTWVPMVRSTSGNYYSPGSATLYFDRRSLSTKPMEV----------VFD 278
Query: 216 SGSVLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCYFLPETFNR-------FP 268
SGS TYF + Y + L Q+SD P LC+ + F F
Sbjct: 279 SGSTYTYFSAQPYQATISAIKGSLSK-SLKQVSDPSLP--LCWKGQKAFKSVSDVKKDFK 335
Query: 269 SMAFYF-EDANLRIDGENVFIIDYENHFFL--LAVAPHDDLVALIGSQQQRDTRFVYDLN 325
S+ F F ++A + I EN I+ + L L + ++IG +D +YD
Sbjct: 336 SLQFIFGKNAVMEIPPENYLIVTKNGNVCLGILDGSAAKLSFSIIGDITMQDQMVIYDNE 395
Query: 326 IDLLSFVKENCS 337
L +++ +CS
Sbjct: 396 KAQLGWIRGSCS 407
>gi|21717160|gb|AAM76353.1|AC074196_11 putative nucleoid DNA binding protein [Oryza sativa Japonica Group]
gi|31433304|gb|AAP54833.1| Eukaryotic aspartyl protease family protein [Oryza sativa Japonica
Group]
gi|125575544|gb|EAZ16828.1| hypothetical protein OsJ_32300 [Oryza sativa Japonica Group]
Length = 419
Score = 72.8 bits (177), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 90/372 (24%), Positives = 146/372 (39%), Gaps = 63/372 (16%)
Query: 6 IGTPSKGVLLILDTGSALIYA----------------IFDPRKSSSFQKINCDHPDCTYF 49
IGTP + V I+D L++ +FDP S++++ C P C
Sbjct: 68 IGTPPQAVSGIVDLSGELVWTQCAACRSSGCFKQELPVFDPSASNTYRAEQCGSPLCKSI 127
Query: 50 KCVNEQCVYTMKYADQSV---TKGFAAHETISVIGKGEGKAIFHGALFGCSNDNHGFDED 106
N Y S+ T G A+ + I+ IG EG+ F GC + G +
Sbjct: 128 PTRNCSGDGECGYEAPSMFGDTFGIASTDAIA-IGNAEGRLAF-----GCVVASDGSIDG 181
Query: 107 ARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLKFGTDMGYRRP 166
A DG +G +GL R S + Q FSYCL + P G+ ++ +L +
Sbjct: 182 AMDGP-SGFVGLGRTPWSLVGQSN---VTAFSYCLALHGP-GKKSALFLGASAKLAGAGK 236
Query: 167 STQATKFIN-HPNN--------FYYLSLKDISIDNERMNFPPDTFDITVSGEGGCI---- 213
S T + H +N +Y + L+ I D S GG I
Sbjct: 237 SNPPTPLLGQHASNTSDDGSDPYYTVQLEGIKAG--------DVAVAAASSGGGAITVLQ 288
Query: 214 IDSGSVLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCYFLPETFNRFPSMAFY 273
+++ L+Y Y L EK V+ +++ PEP LC F + P + F
Sbjct: 289 LETFRPLSYLPDAAYQAL-EKVVT--AALGSPSMANPPEPFDLC-FQNAAVSGVPDLVFT 344
Query: 274 FEDANLRIDGENVFIIDYENHFFLLAVA--------PHDDLVALIGSQQQRDTRFVYDLN 325
F+ + +++ N + ++ DD V+++GS Q + F++DL
Sbjct: 345 FQGGATLTAQPSKYLLGDGNGNGTVCLSILSSTRLDSADDGVSILGSLLQENVHFLFDLE 404
Query: 326 IDLLSFVKENCS 337
+ LSF +CS
Sbjct: 405 KETLSFEPADCS 416
>gi|358346726|ref|XP_003637416.1| Aspartic proteinase nepenthesin-2, partial [Medicago truncatula]
gi|355503351|gb|AES84554.1| Aspartic proteinase nepenthesin-2, partial [Medicago truncatula]
Length = 165
Score = 72.8 bits (177), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 50/161 (31%), Positives = 81/161 (50%), Gaps = 5/161 (3%)
Query: 178 NNFYYLSLKDISIDNERMNFPPDTFDITVSGEGGCIIDSGSVLTYFHSDVYWKLHEKFVS 237
+ +YY+ L IS+ E + P +F++ +G GG I+DSG+ +T SDVY + + FV
Sbjct: 8 DTYYYVGLVGISVGGELLAIPETSFEVDSAGNGGIIVDSGTAVTRLQSDVYNVVRDAFVK 67
Query: 238 YFERFQLAQLSDCPEPIQLCYFL-PETFNRFPSMAFYF-EDANLRIDGENVFIIDYENHF 295
+ LA ++ CY L +T P++AF+F E L + +N +
Sbjct: 68 GTKDL-LA--TNEVSLFDTCYDLSSKTSVEVPTVAFHFGEGKVLVLPAKNYLVPVDSVGT 124
Query: 296 FLLAVAPHDDLVALIGSQQQRDTRFVYDLNIDLLSFVKENC 336
F A AP +++IG+ QQ+ TR +DL L+ F C
Sbjct: 125 FCFAFAPTMSSLSIIGNIQQQGTRVSFDLANSLVGFSPNRC 165
>gi|388515789|gb|AFK45956.1| unknown [Medicago truncatula]
Length = 225
Score = 72.8 bits (177), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 65/233 (27%), Positives = 99/233 (42%), Gaps = 22/233 (9%)
Query: 113 AGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLKFGTDMGYRRPSTQATK 172
AG+LGL +SF+ QLG FSYCLV G +S L+FG +
Sbjct: 6 AGLLGLGSGPMSFVGQLGGQAGGTFSYCLV---SRGTESSGSLEFGRES--VPVGASWVS 60
Query: 173 FINHPN--NFYYLSLKDISIDNERMNFPPDTFDITVSGEGGCIIDSGSVLTYFHSDVYWK 230
I++P +FYY+ L + + R+ D F + GEGG ++D+G+ +T + Y
Sbjct: 61 LIHNPRAPSFYYIGLSGLGVGGLRVPISEDIFRLNELGEGGVVMDTGTAVTRLPAAAYNA 120
Query: 231 LHEKFVSYFERFQLAQLSDCPEPIQL-----CYFLPETFN-RFPSMAFYFEDAN-LRIDG 283
+ FV AQ ++ P+ + CY L R P+++FYF L +
Sbjct: 121 FRDAFV--------AQTTNLPKTSGVSIFDTCYDLNGFVTVRVPTISFYFLGGPILTLPA 172
Query: 284 ENVFIIDYENHFFLLAVAPHDDLVALIGSQQQRDTRFVYDLNIDLLSFVKENC 336
N I F A AP +++IG+ QQ D + F C
Sbjct: 173 RNFLIPVDSVGTFCFAFAPSSSGLSIIGNIQQEGIEISVDGANGYIGFGPNIC 225
>gi|326490597|dbj|BAJ89966.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 450
Score = 72.8 bits (177), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 87/386 (22%), Positives = 152/386 (39%), Gaps = 64/386 (16%)
Query: 2 VRLFIGTPSKGVLLILDTGSAL-----------IYAIFDPRKSSSFQKINCDHPDCTYF- 49
V + +GTP + V ++LDTGS L A F+ S ++ ++C P C +
Sbjct: 67 VSVVVGTPPQNVTMVLDTGSELSGLLCNGSSLSPPAPFNASASLTYSAVDCSSPACVWRG 126
Query: 50 ----------KCVNEQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCS-- 97
+ C ++ YAD S G +T ++G +A+ ALFGC
Sbjct: 127 RDLPVRPFCDAPPSTSCRVSISYADASSADGHLVADTF-ILGT---QAV--PALFGCITS 180
Query: 98 -NDNHGFDEDARDG--ALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSY 154
+ + + A D A G+LG++R ++SF++Q ++ RF+YC+ G
Sbjct: 181 YSSSTAINSSATDPSEAATGLLGMNRGSLSFVTQTATL---RFAYCIAPGQGPGILLLGG 237
Query: 155 LKFGTDMGYRRPSTQATKFINHPNNFYY-LSLKDISIDNERMNFPPDTFDITVSGEGGCI 213
P + ++ + + + Y + L+ I + + + P +G G +
Sbjct: 238 DGGAAPPLNYTPLIEISQPLPYFDRVAYSVQLEGIRVGSALLQIPKSVLTPDHTGAGQTM 297
Query: 214 IDSGSVLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEP-------IQLCYFLPE---- 262
+DSG+ T+ +D Y L +F++ R LA L EP C+ PE
Sbjct: 298 VDSGTQFTFLLADAYAALKAEFLNQ-ARSLLAPLG---EPGFVFQGAFDACFRGPEERVS 353
Query: 263 -TFNRFPSMAFYFEDANLRIDGEN-VFIIDYENHFFLLAVA------PHDDLVAL----I 310
P + A + + GE ++ + E A A + D+ + I
Sbjct: 354 AASRLLPEVGLVLRGAEVAVAGEKLLYSVPGERRGEEGAEAVWCLTFGNSDMAGMSAYVI 413
Query: 311 GSQQQRDTRFVYDLNIDLLSFVKENC 336
G Q+D YDL + F C
Sbjct: 414 GHHHQQDVWVEYDLQNGRVGFAPARC 439
>gi|302794412|ref|XP_002978970.1| hypothetical protein SELMODRAFT_418789 [Selaginella moellendorffii]
gi|300153288|gb|EFJ19927.1| hypothetical protein SELMODRAFT_418789 [Selaginella moellendorffii]
Length = 462
Score = 72.8 bits (177), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 90/375 (24%), Positives = 159/375 (42%), Gaps = 59/375 (15%)
Query: 6 IGTPSKGVLLILDTGSALIY--------------AIFDPRKSSSFQKINCDHPDC----- 46
+G+P + +LI+DTGS L + I+D +S S++ + C++
Sbjct: 106 LGSPGQEAILIVDTGSELTWLKCLPCKVCAPSVDTIYDAARSVSYKPVTCNNSQLCSNSS 165
Query: 47 --TYFKCV-NEQCVYTMKYADQSVTKGFAAHETI---SVIGKGEGKAI-FHGALFGCSND 99
TY C QC + Y D S + G + +T+ +V+G GK + FGC+
Sbjct: 166 QGTYAYCARGSQCQFAAFYGDGSFSYGSLSTDTLIMETVVG---GKPVTVQDFAFGCA-- 220
Query: 100 NHGFDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLKF-- 157
G E GA +G+LGL+ ++ QLG +FS+C P + S+ + F
Sbjct: 221 -QGDLELVPTGA-SGILGLNAGKMALPMQLGQRFGWKFSHCF--PDRSSHLNSTGVVFFG 276
Query: 158 GTDMGYRRPSTQATKFINHP--NNFYYLSLKDISIDNERMNFPPDTFDITVSGEGGCIID 215
++ + + + N FY+++LK +SI++ + P + I+D
Sbjct: 277 NAELPHEQVQYTSVALTNSELQRKFYHVALKGVSINSHELVLLPRGSVV--------ILD 328
Query: 216 SGSVLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCYFLP-----ETFNRFPSM 270
SGS + F + +L E F+ + D + C+ + E PS+
Sbjct: 329 SGSSFSSFVRPFHSQLREAFLKHRPPSLKHLEGDSFGDLGTCFKVSNDDIDELHRTLPSL 388
Query: 271 AFYFEDA-NLRIDGENVF--IIDYENHFFLLAVAPHD---DLVALIGSQQQRDTRFVYDL 324
+ FED + I V + Y+NH + A D + V +IG+ QQ++ YD+
Sbjct: 389 SLVFEDGVTIGIPSIGVLLPVARYQNH-VKMCFAFEDGGPNPVNVIGNYQQQNLWVEYDI 447
Query: 325 NIDLLSFVKENCSDD 339
+ F + +C D
Sbjct: 448 QRSRVGFARASCVID 462
>gi|242092902|ref|XP_002436941.1| hypothetical protein SORBIDRAFT_10g011760 [Sorghum bicolor]
gi|241915164|gb|EER88308.1| hypothetical protein SORBIDRAFT_10g011760 [Sorghum bicolor]
Length = 445
Score = 72.8 bits (177), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 89/360 (24%), Positives = 142/360 (39%), Gaps = 63/360 (17%)
Query: 7 GTPSKGVLLILDTGSALIY----------------AIFDPRKSSSFQKINCDHPDCTYFK 50
GTP+ ++++DTGS L + +FDP SS++ + C +C
Sbjct: 119 GTPAVPQVVVIDTGSDLTWLQCKPCSSGQCSPQKDPLFDPSHSSTYSAVPCASGECKKLA 178
Query: 51 -------CVNEQ-CVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSNDNHG 102
C N Q C + + Y D + T G + +++ AI FGC G
Sbjct: 179 ADAYGSGCSNGQPCGFAISYVDGTSTVGVYGKDKLTL----APGAIVKDFYFGC-----G 229
Query: 103 FDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLKFGTDMG 162
+ + G G+LGL R++ S +Q FSYCL P +L FG
Sbjct: 230 HSKSSLPGLFDGLLGLGRLSESLGAQY--GGGGGFSYCL----PAVNSKPGFLAFGAG-- 281
Query: 163 YRRPS----TQATKFINHPNNFYYLSLKDISIDNERMNFPPDTFDITVSGEGGCIIDSGS 218
R PS T + P F ++L I++ ++++ P F GG I+DSG+
Sbjct: 282 -RNPSGFVFTPMGRVPGQPT-FSTVTLAGITVGGKKLDLRPSAF------SGGMIVDSGT 333
Query: 219 VLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCYFLPETFN-RFPSMAFYFE-D 276
V+T S VY L F + ++L + CY L N P +A F
Sbjct: 334 VVTVLQSTVYRALRAAFREAMKAYRLVH-----GDLDTCYDLTGYKNVVVPKIALTFSGG 388
Query: 277 ANLRIDGENVFIIDYENHFFLLAVAPHDDLVALIGSQQQRDTRFVYDLNIDLLSFVKENC 336
A + +D N ++ N A D ++G+ QR ++D + F + C
Sbjct: 389 ATINLDVPNGILV---NGCLAFAETGKDGTAGVLGNVNQRTFEVLFDTSASKFGFRAKAC 445
>gi|388516465|gb|AFK46294.1| unknown [Medicago truncatula]
Length = 434
Score = 72.8 bits (177), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 85/347 (24%), Positives = 149/347 (42%), Gaps = 45/347 (12%)
Query: 1 MVRLFIGTPSKGVLLILDTGS--ALIYA---------IFDPRKSSSFQKINCDHPDCTYF 49
+VR+ IGTP + + ++LDT + A I + F P S+S+ + C P C+
Sbjct: 99 IVRVKIGTPGQLLFMVLDTSTDEAFIPSSGCIGCSATTFSPNASTSYVPLECSVPQCSQV 158
Query: 50 KCVN------EQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSNDNHGF 103
+ ++ C + YA + + +++ + + FG N G
Sbjct: 159 RGLSCPATGSGACSFNKSYAGSTYSATLV-QDSLRL-----ATDVIPSYSFGSINAISGS 212
Query: 104 DEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLKFGTDMGY 163
A+ R +S +SQ GS+ FSYCL P Y S LK G +G
Sbjct: 213 SIPAQGLLGL-----GRGPLSLLSQTGSLYSGVFSYCL--PSFKSYYFSGSLKLG-PVGQ 264
Query: 164 RRPSTQATKFINHPN--NFYYLSLKDISIDNERMNFPPDTFDITVSGEGGCIIDSGSVLT 221
+ S + T + +P + Y+++L I++ + FP + V+ G IIDSG+V+T
Sbjct: 265 PK-SIRTTPLLRNPRRPSLYFVNLTGITVGKVNVPFPKELLAFDVNTGSGTIIDSGTVIT 323
Query: 222 YFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCYFLPETFNRFPSMAFYFEDANLRI 281
F VY + ++F R Q+ C F+ P++ +F D +L++
Sbjct: 324 RFVEPVYNAVRDEF-----RKQVTGPFSSLGAFDTC-FVKNYETLAPAITLHFTDLDLKL 377
Query: 282 DGENVFIIDYENHFFLLAVA--PHD---DLVALIGSQQQRDTRFVYD 323
EN I LA+A P + ++ +I + QQ++ R ++D
Sbjct: 378 PLENSLIHSSSGSLACLAMASTPKNVNYTVLNVIANYQQQNLRVLFD 424
>gi|296085344|emb|CBI29076.3| unnamed protein product [Vitis vinifera]
Length = 434
Score = 72.8 bits (177), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 68/256 (26%), Positives = 107/256 (41%), Gaps = 38/256 (14%)
Query: 114 GVLGLSRVTISFISQLGSIIKKRFSYC-LVIPLPNGEYTSSYLKFGTDMGYRRPSTQATK 172
G+ G R +S SQLG ++K FS+C L N SS L G Q T
Sbjct: 181 GIAGFGRGVLSLPSQLG-FLQKGFSHCFLGFKFANNPNISSPLVIGDLAISSNDHLQFTS 239
Query: 173 FINHP--NNFYYLSLKDISIDNER-MNFPPDTFDITVSGEGGCIIDSGSVLTYFHSDVYW 229
+ +P N+YY+ L+ I++ N + P + G GG IIDSG+ T+ Y
Sbjct: 240 LLKNPMYPNYYYIGLEAITVGNATAIQVPSSLREFDSHGNGGMIIDSGTTYTHLPGPFYT 299
Query: 230 KLHEKFVSYFERFQLAQLSDCPEPIQLCYFLPETFN-------RFPSMAFYFEDANLRID 282
+L S + AQ + LCY +P N PS++F+F +
Sbjct: 300 QLLSMLQSIIT-YPRAQEQEARTGFDLCYRIPCPNNVVTDHDHLLPSISFHFSN------ 352
Query: 283 GENV-FIIDYENHFFLLAVAPHDDLV----------------ALIGSQQQRDTRFVYDLN 325
NV ++ NHF+ + + +V + GS QQ++ + VYDL
Sbjct: 353 --NVSLVLPQGNHFYAMGAPSNSTVVKCLLLQNMDDSDSGPAGVFGSFQQQNVKVVYDLE 410
Query: 326 IDLLSFVKENCSDDSA 341
+ + F +C+ +A
Sbjct: 411 KERIGFQPMDCASAAA 426
>gi|359484086|ref|XP_002263357.2| PREDICTED: aspartic proteinase nepenthesin-2-like [Vitis vinifera]
Length = 417
Score = 72.4 bits (176), Expect = 3e-10, Method: Compositional matrix adjust.
Identities = 68/256 (26%), Positives = 107/256 (41%), Gaps = 38/256 (14%)
Query: 114 GVLGLSRVTISFISQLGSIIKKRFSYC-LVIPLPNGEYTSSYLKFGTDMGYRRPSTQATK 172
G+ G R +S SQLG ++K FS+C L N SS L G Q T
Sbjct: 164 GIAGFGRGVLSLPSQLG-FLQKGFSHCFLGFKFANNPNISSPLVIGDLAISSNDHLQFTS 222
Query: 173 FINHP--NNFYYLSLKDISIDNER-MNFPPDTFDITVSGEGGCIIDSGSVLTYFHSDVYW 229
+ +P N+YY+ L+ I++ N + P + G GG IIDSG+ T+ Y
Sbjct: 223 LLKNPMYPNYYYIGLEAITVGNATAIQVPSSLREFDSHGNGGMIIDSGTTYTHLPGPFYT 282
Query: 230 KLHEKFVSYFERFQLAQLSDCPEPIQLCYFLPETFN-------RFPSMAFYFEDANLRID 282
+L S + AQ + LCY +P N PS++F+F +
Sbjct: 283 QLLSMLQSIIT-YPRAQEQEARTGFDLCYRIPCPNNVVTDHDHLLPSISFHFSN------ 335
Query: 283 GENV-FIIDYENHFFLLAVAPHDDLV----------------ALIGSQQQRDTRFVYDLN 325
NV ++ NHF+ + + +V + GS QQ++ + VYDL
Sbjct: 336 --NVSLVLPQGNHFYAMGAPSNSTVVKCLLLQNMDDSDSGPAGVFGSFQQQNVKVVYDLE 393
Query: 326 IDLLSFVKENCSDDSA 341
+ + F +C+ +A
Sbjct: 394 KERIGFQPMDCASAAA 409
>gi|18421660|ref|NP_568551.1| aspartyl protease family protein [Arabidopsis thaliana]
gi|10177438|dbj|BAB10671.1| unnamed protein product [Arabidopsis thaliana]
gi|15809850|gb|AAL06853.1| AT5g37540/mpa22_p_70 [Arabidopsis thaliana]
gi|20260182|gb|AAM12989.1| unknown protein [Arabidopsis thaliana]
gi|23197748|gb|AAN15401.1| unknown protein [Arabidopsis thaliana]
gi|332006821|gb|AED94204.1| aspartyl protease family protein [Arabidopsis thaliana]
Length = 442
Score = 72.4 bits (176), Expect = 3e-10, Method: Compositional matrix adjust.
Identities = 76/273 (27%), Positives = 113/273 (41%), Gaps = 57/273 (20%)
Query: 1 MVRLFIGTPSKGVLLILDTGSALIY----------------AIFDPRKSSSFQKINCDHP 44
++ L IGTPS+ L+LDTGS L + FDP SSSF + C HP
Sbjct: 81 ILSLPIGTPSQSQELVLDTGSQLSWIQCHPKKIKKPLPPPTTSFDPSLSSSFSDLPCSHP 140
Query: 45 DCTY--------FKC-VNEQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFG 95
C C N C Y+ YAD + +G E + + + G
Sbjct: 141 LCKPRIPDFTLPTSCDSNRLCHYSYFYADGTFAEGNLVKEKFTFSNSQTTPPL----ILG 196
Query: 96 CSNDNHGFDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYL 155
C+ ++ DE G+LG++ +SFISQ + I K FSYC+ P
Sbjct: 197 CAKEST--DEK-------GILGMNLGRLSFISQ--AKISK-FSYCI----PTRSNRPGLA 240
Query: 156 KFGTDMGYRRPSTQATKFIN---------HPNN---FYYLSLKDISIDNERMNFPPDTFD 203
G+ P+++ K+++ PN Y + L+ I I +R+N P F
Sbjct: 241 STGSFYLGDNPNSRGFKYVSLLTFPQSQRMPNLDPLAYTVPLQGIRIGQKRLNIPGSVFR 300
Query: 204 ITVSGEGGCIIDSGSVLTYFHSDVYWKLHEKFV 236
G G ++DSGS T+ Y K+ E+ V
Sbjct: 301 PDAGGSGQTMVDSGSEFTHLVDVAYDKVKEEIV 333
>gi|413950927|gb|AFW83576.1| aspartic proteinase nepenthesin-2 [Zea mays]
Length = 316
Score = 72.4 bits (176), Expect = 3e-10, Method: Compositional matrix adjust.
Identities = 74/304 (24%), Positives = 118/304 (38%), Gaps = 40/304 (13%)
Query: 62 YADQSVTKGFAAHETISVI------GKGEGKAIFHGALFGCSNDNHGFDEDARDGALAGV 115
Y D S +G ++ ++ GK + +A G + GC+ G A DG V
Sbjct: 22 YKDGSAARGTVGTDSATIALSGRRAGKKQRRAKLRGVVLGCTTSYTGESFLASDG----V 77
Query: 116 LGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLKFGTDMGYRRPSTQATK--- 172
L L +SF S+ + RFSYCLV L T SYL FG + S T
Sbjct: 78 LSLGYSNVSFASRAAARFGGRFSYCLVDHLAPRNAT-SYLTFGPNPAVSSASASRTACAG 136
Query: 173 ------------FINHP-NNFYYLSLKDISIDNERMNFPPDTFDITVSGEGGCIIDSGSV 219
++H FY +++ +S+D E + P +D V GG I+DSG+
Sbjct: 137 SAAAPGARQTPLLLDHRMRPFYAVAVNGVSVDGELLRIPRLVWD--VQKGGGAILDSGTS 194
Query: 220 LTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCYFLPETFN------RFPSMAFY 273
LT S Y V+ + + +P CY P++A +
Sbjct: 195 LTVLVSPAY----RAVVAALGKKLVGLPRVAMDPFDYCYNWTSPLTGEDLAVAVPALAVH 250
Query: 274 FEDANLRIDGENVFIIDYENHFFLLAVAPHD-DLVALIGSQQQRDTRFVYDLNIDLLSFV 332
F + ++ID + + D V++IG+ Q++ + +DL L F
Sbjct: 251 FAGSARLQPPPKSYVIDAAPGVKCIGLQEGDWPGVSVIGNILQQEHLWEFDLKNRRLRFK 310
Query: 333 KENC 336
+ C
Sbjct: 311 RSRC 314
>gi|6850312|gb|AAF29389.1|AC009999_9 Contains similarity to nucellin from Hordeum vulgare gb|U87148.
ESTs gb|T22068, gb|F14251, gb|F14237, gb|F14242 come
from this gene [Arabidopsis thaliana]
Length = 388
Score = 72.4 bits (176), Expect = 3e-10, Method: Compositional matrix adjust.
Identities = 68/265 (25%), Positives = 111/265 (41%), Gaps = 44/265 (16%)
Query: 2 VRLFIGTPSKGVLLILDTGSALIYA-------------------IFDPRKSSSFQKINCD 42
++ IGTP+K + +DTGS +++ +++ +S S + ++CD
Sbjct: 82 AKIGIGTPAKSYYVQVDTGSDIMWVNCIQCKQCPRRSTLGIELTLYNIDESDSGKLVSCD 141
Query: 43 HPDCTYFK------C-VNEQCVYTMKYADQSVTKGFAAHETI---SVIGKGEGKAIFHGA 92
C C N C Y Y D S T G+ + + SV G + +
Sbjct: 142 DDFCYQISGGPLSGCKANMSCPYLEIYGDGSSTAGYFVKDVVQYDSVAGDLKTQTANGSV 201
Query: 93 LFGCSNDNHGFDEDARDGALAGVLGLSRVTISFISQLGSI--IKKRFSYCLVIPLPNGEY 150
+FGC G + + + AL G+LG + S ISQL S +KK F++CL +G
Sbjct: 202 IFGCGARQSGDLDSSNEEALDGILGFGKANSSMISQLASSGRVKKIFAHCL-----DGRN 256
Query: 151 TSSYLKFGTDMGYRRPSTQATKFI-NHPNNFYYLSLKDISIDNERMNFPPDTFDITVSGE 209
G + +P T + N P+ Y +++ + + E + P D F
Sbjct: 257 GGGIFAIGRVV---QPKVNMTPLVPNQPH--YNVNMTAVQVGQEFLTIPADLFQ--PGDR 309
Query: 210 GGCIIDSGSVLTYFHSDVYWKLHEK 234
G IIDSG+ L Y +Y L +K
Sbjct: 310 KGAIIDSGTTLAYLPEIIYEPLVKK 334
>gi|147859621|emb|CAN83119.1| hypothetical protein VITISV_043393 [Vitis vinifera]
Length = 431
Score = 72.4 bits (176), Expect = 3e-10, Method: Compositional matrix adjust.
Identities = 76/293 (25%), Positives = 131/293 (44%), Gaps = 43/293 (14%)
Query: 2 VRLFIGTPSKGVLLILDTGSALIYAIFDPRKSSSFQKINCDHPDCTYFK------CV-NE 54
++ IGTP++ + ++ ++D ++S + + ++CD C C+ N
Sbjct: 100 AKIGIGTPARDYYVQMEL------TLYDIKESLTGKLVSCDQDFCYAINGGPPSYCIANM 153
Query: 55 QCVYTMKYADQSVT-----KGFAAHETISVIGKGEGKAIFHGALFGCSNDNHGFDEDARD 109
C YT YAD S + KG+ + I + L CS G + + +
Sbjct: 154 SCSYTEIYADGSSSFGYFVKGYCTASKYNSIPHLNNNPLLEVPLR-CSATQSG--DLSSE 210
Query: 110 GALAGVLGLSRVTISFISQLGSI--IKKRFSYCLVIPLPNGEYTSSYLKFGTDMGYRRPS 167
AL G+LG + S ISQL S ++K F++CL +G G + +P
Sbjct: 211 EALDGILGFGKSNTSMISQLASSGKVRKMFAHCL-----DGLNGGGIFAIGHIV---QPK 262
Query: 168 TQATKFINHPNNFYY-LSLKDISIDNERMNFPPDTFDITVSGEGGCIIDSGSVLTYFHSD 226
T + PN +Y +++K + + +N P D FD V + G IIDSG+ L Y
Sbjct: 263 VNTTPLV--PNQTHYNVNMKAVEVGGYFLNLPTDVFD--VGDKKGTIIDSGTTLAYLPEV 318
Query: 227 VYWKLHEKFVSYFERFQLAQLSDCPEPIQL-CYFLPETF-NRFPSMAFYFEDA 277
VY +L K S+ ++ + D Q C+ E+ + FP++ F+FE++
Sbjct: 319 VYDQLLSKIFSWQSDLKVHTIHD-----QFTCFQYSESLDDGFPAVTFHFENS 366
>gi|242041431|ref|XP_002468110.1| hypothetical protein SORBIDRAFT_01g039750 [Sorghum bicolor]
gi|241921964|gb|EER95108.1| hypothetical protein SORBIDRAFT_01g039750 [Sorghum bicolor]
Length = 467
Score = 72.4 bits (176), Expect = 3e-10, Method: Compositional matrix adjust.
Identities = 89/410 (21%), Positives = 159/410 (38%), Gaps = 89/410 (21%)
Query: 2 VRLFIGTPSKGVLLILDTGSALIY------------------AIFDPRKSSSFQKINCDH 43
V + +G P + V ++LDTGS L + A F+ SS++ +C
Sbjct: 61 VPVAVGAPPQNVTMVLDTGSELSWLLCNGSRVPSTPPQPQAPAAFNGSASSTYAAAHCSS 120
Query: 44 -PDCTYFK--------CV---NEQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHG 91
P+C + C + C ++ YAD S G A +T + G A
Sbjct: 121 SPECQWRGRDLPVPPFCAGPPSNSCRVSLSYADASSADGVLAADTFLL-----GGAPPVR 175
Query: 92 ALFGC--------SNDNHGFDEDAR----DGALAGVLGLSRVTISFISQLGSIIKKRFSY 139
ALFGC + D +G DA A G+LG++R ++SF++Q G++ RF+Y
Sbjct: 176 ALFGCITSYSSSSTADGNGNGNDASATNSSEAATGLLGMNRGSLSFVTQTGTL---RFAY 232
Query: 140 CLV------IPLPNGEYTSSYLKFGTDMGYRRPSTQATKFINHPNNFYY-LSLKDISIDN 192
C+ + + G+ + L + Y P + ++ + + + Y + L+ I +
Sbjct: 233 CIAPGDGPGLLVLGGDGDGAALSAAPQLNYT-PLIEMSQPLPYFDRVAYSVQLEGIRVGA 291
Query: 193 ERMNFPPDTFDITVSGEGGCIIDSGSVLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPE 252
+ P +G G ++DSG+ T+ +D Y L +F++ A L+ E
Sbjct: 292 ALLPIPKSVLAPDHTGAGQTMVDSGTQFTFLLADAYAPLKGEFLNQTS----ALLAPLGE 347
Query: 253 P-------IQLCYFLPE-------TFNRFPSMAFYFEDANLRIDGEN-VFIIDYENH--- 294
P C+ E P + A + + GE ++++ E
Sbjct: 348 PDFVFQGAFDACFRASEARVAAATASQLLPEVGLVLRGAEVAVGGEKLLYMVPGERRGEG 407
Query: 295 ----FFLLAVAPHDDLVAL----IGSQQQRDTRFVYDLNIDLLSFVKENC 336
+ L + D+ + IG Q++ YDL + F C
Sbjct: 408 GSEAVWCLTFG-NSDMAGMSAYVIGHHHQQNVWVEYDLQNSRVGFAPARC 456
>gi|224033419|gb|ACN35785.1| unknown [Zea mays]
gi|413934980|gb|AFW69531.1| aspartic-type endopeptidase/ pepsin A [Zea mays]
Length = 543
Score = 72.0 bits (175), Expect = 4e-10, Method: Compositional matrix adjust.
Identities = 78/295 (26%), Positives = 114/295 (38%), Gaps = 60/295 (20%)
Query: 6 IGTPSKGVLLILDTGSALIYAIFD------------------------PRKSSSFQKINC 41
+GTP+ L+ LDTGS L + D PR+SS+ +++ C
Sbjct: 114 LGTPNATFLVALDTGSDLFWVPCDCRQCATIPSANGTGQDAPSLRPYSPRRSSTSKQVAC 173
Query: 42 DHPDCTY----FKCVNEQCVYTMKYADQSVTKGF-----AAHETISVIGKG-EGKAIFHG 91
D+P C N C Y ++Y + + H T G G G+A+
Sbjct: 174 DNPLCGQRNGCSAATNGSCPYEVQYVSANTSSSGVLVQDVLHLTRERPGPGAAGEALQAP 233
Query: 92 ALFGCSNDNHGFDEDARDGALAGVLGLSRVTISFISQL---GSIIKKRFSYCLVIPLPNG 148
+FGC G D GA+ G++GL +S S L G + FS C G
Sbjct: 234 VVFGCGQVQTGAFLDGGGGAVDGLMGLGMGKVSVPSALAASGLVASDSFSMCF------G 287
Query: 149 EYTSSYLKFGTDMGYRRPSTQATKF-INHPNNFYYLSLKDISIDNERMNFPPDTFDITVS 207
+ + FG D G R T F + N Y +S I + +E +V+
Sbjct: 288 DDGVGRVNFG-DAGSR--GQAETPFTVRSLNPTYNVSFTSIGVGSE-----------SVA 333
Query: 208 GEGGCIIDSGSVLTYFHSDVYWKLHEKFVSYF--ERFQLAQLSDCPEPIQLCYFL 260
E ++DSG+ TY Y +L KF S R + S P P + CY L
Sbjct: 334 AEFAAVMDSGTSFTYLSDPEYTQLATKFNSQVSERRVNFSSGSADPFPFEYCYRL 388
>gi|255545932|ref|XP_002514026.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
gi|223547112|gb|EEF48609.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
Length = 437
Score = 72.0 bits (175), Expect = 4e-10, Method: Compositional matrix adjust.
Identities = 84/359 (23%), Positives = 145/359 (40%), Gaps = 43/359 (11%)
Query: 1 MVRLFIGTPSKGVLLILDTGSALIYA-----------IFDPRKSSSFQKINCDHPDCTY- 48
+VR+ +GTP + + ++LDT + + F SS++ ++C CT
Sbjct: 98 VVRVKLGTPGQFMFMVLDTSNDAAWVPCSGCTGCSSTTFSTNTSSTYGSLDCSMAQCTQV 157
Query: 49 --FKCV---NEQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSNDNHGF 103
F C + CV+ Y S +++ ++ + FGC N G
Sbjct: 158 RGFSCPATGSSSCVFNQSYGGDSSFSATLVEDSLRLVND-----VIPNFAFGCINSISGG 212
Query: 104 DEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLKFGTDMGY 163
+ R +S I+Q GS+ FSYCL P Y S LK G
Sbjct: 213 SVPPQGLLGL-----GRGPLSLIAQSGSLYSGLFSYCL--PSFKSYYFSGSLKLGP--AG 263
Query: 164 RRPSTQATKFINHPN--NFYYLSLKDISIDNERMNFPPDTFDITVSGEGGCIIDSGSVLT 221
+ S + T + +P+ + YY++L +S+ + P+ + G IIDSG+V+T
Sbjct: 264 QPKSIRYTPLLRNPHRPSLYYVNLTGVSVGRTLVPIAPELLAFNPNTGAGTIIDSGTVIT 323
Query: 222 YFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCYFLPETFNRFPSMAFYFEDANLRI 281
F +Y + ++F R Q+A C F P++ +F NL +
Sbjct: 324 RFVQPIYTAIRDEF-----RKQVAGPFSSLGAFDTC-FAATNEAVAPAVTLHFTGLNLVL 377
Query: 282 DGENVFIIDYENHFFLLAVAPH----DDLVALIGSQQQRDTRFVYDLNIDLLSFVKENC 336
EN I LA+A + ++ +I + QQ++ R ++D+ L +E C
Sbjct: 378 PMENSLIHSSAGSLACLAMAAAPNNVNSVLNVIANLQQQNLRLLFDVPNSRLGIARELC 436
>gi|125532793|gb|EAY79358.1| hypothetical protein OsI_34487 [Oryza sativa Indica Group]
Length = 419
Score = 72.0 bits (175), Expect = 4e-10, Method: Compositional matrix adjust.
Identities = 90/372 (24%), Positives = 145/372 (38%), Gaps = 63/372 (16%)
Query: 6 IGTPSKGVLLILDTGSALIYA----------------IFDPRKSSSFQKINCDHPDCTYF 49
IGTP + V I+D L++ +FDP S++++ C P C
Sbjct: 68 IGTPPQAVSGIVDLSGELVWTQCAACRSSGCFKQELPVFDPSASNTYRAEQCGSPLCKSI 127
Query: 50 KCVNEQCVYTMKYADQSV---TKGFAAHETISVIGKGEGKAIFHGALFGCSNDNHGFDED 106
N Y S+ T G A+ + I+ IG EG+ F GC + G +
Sbjct: 128 PTRNCSGDGECGYEAPSMFGDTFGIASTDAIA-IGNAEGRLAF-----GCVVASDGSIDG 181
Query: 107 ARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLKFGTDMGYRRP 166
A DG +G +GL R S + Q FSYCL P G+ ++ +L +
Sbjct: 182 AMDGP-SGFVGLGRTPWSLVGQSN---VTAFSYCLA-PHGPGKKSALFLGASAKLAGAGK 236
Query: 167 STQATKFIN-HPNN--------FYYLSLKDISIDNERMNFPPDTFDITVSGEGGCI---- 213
S T + H +N +Y + L+ I D S GG I
Sbjct: 237 SNPPTPLLGQHASNTSDDGSDPYYTVQLEGIKAG--------DVAVAAASSGGGAITILQ 288
Query: 214 IDSGSVLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCYFLPETFNRFPSMAFY 273
+++ L+Y Y L EK V+ +++ PEP LC F + P + F
Sbjct: 289 LETFRPLSYLPDAAYQAL-EKVVT--AALGSPSMANPPEPFDLC-FQNAAVSGVPDLVFT 344
Query: 274 FEDANLRIDGENVFIIDYENHFFLLAVA--------PHDDLVALIGSQQQRDTRFVYDLN 325
F+ + +++ N + ++ DD V+++GS Q + F++DL
Sbjct: 345 FQGGATLTAPPSKYLLGDGNGNGTVCLSILSSTRLDSADDGVSILGSLLQENVHFLFDLE 404
Query: 326 IDLLSFVKENCS 337
+ LSF +CS
Sbjct: 405 KETLSFEPADCS 416
>gi|226499286|ref|NP_001147826.1| aspartic-type endopeptidase/ pepsin A [Zea mays]
gi|195613980|gb|ACG28820.1| aspartic-type endopeptidase/ pepsin A [Zea mays]
Length = 545
Score = 72.0 bits (175), Expect = 4e-10, Method: Compositional matrix adjust.
Identities = 94/383 (24%), Positives = 142/383 (37%), Gaps = 77/383 (20%)
Query: 6 IGTPSKGVLLILDTGSALIYAIFD------------------------PRKSSSFQKINC 41
+GTP+ L+ LDTGS L + D PR+SS+ +++ C
Sbjct: 116 LGTPNATFLVALDTGSDLFWVPCDCRQCATIPSANATGPDAPPLRPYSPRRSSTSEQVAC 175
Query: 42 DHPDCTYFK----CVNEQCVYTMKYADQSVTKGF-----AAHETISVIGKG-EGKAIFHG 91
D+P C N C Y ++Y + + H T G G G+A+
Sbjct: 176 DNPLCGRRNGCSAATNGSCPYEVQYVSANTSSSGVLVQDVLHLTRERPGPGAAGEALQAP 235
Query: 92 ALFGCSNDNHGFDEDARDGALAGVLGLSRVTISFISQL---GSIIKKRFSYCLVIPLPNG 148
+FGC G D GA+ G++GL +S S L G + FS C G
Sbjct: 236 VVFGCGQVQTGAFLDDGGGAVDGLMGLGMGKVSVPSALAASGLVASDSFSMCF------G 289
Query: 149 EYTSSYLKFGTDMGYRRPSTQATKF-INHPNNFYYLSLKDISIDNERMNFPPDTFDITVS 207
+ + FG D G R T F + N Y +S I I +E +V+
Sbjct: 290 DDGVGRVNFG-DAGSR--GQAETPFTVRSLNPTYNVSFTSIGIGSE-----------SVA 335
Query: 208 GEGGCIIDSGSVLTYFHSDVYWKLHEKFVSYF--ERFQLAQLSDCPEPIQLCYFLPETFN 265
E ++DSG+ TY Y +L KF S R + S P P + CY L
Sbjct: 336 AEFAAVMDSGTSFTYLSDPEYTQLATKFNSQVSERRVNFSSGSADPFPFEYCYRLSPN-- 393
Query: 266 RFPSMAFYFEDANLRIDGENVFII--------DYENHF--FLLAVAPHDDLVA--LIGSQ 313
D +L G +F + D + LA+ +D + +IG
Sbjct: 394 ---QTEVAMPDVSLTAKGGALFPVTQPFIPVGDTTGRAIGYCLAIMRNDMAIGIDIIGQN 450
Query: 314 QQRDTRFVYDLNIDLLSFVKENC 336
+ V+D +L + K +C
Sbjct: 451 FMTGLKVVFDRERSVLGWEKFDC 473
>gi|224136436|ref|XP_002322329.1| predicted protein [Populus trichocarpa]
gi|222869325|gb|EEF06456.1| predicted protein [Populus trichocarpa]
Length = 486
Score = 71.6 bits (174), Expect = 4e-10, Method: Compositional matrix adjust.
Identities = 101/419 (24%), Positives = 157/419 (37%), Gaps = 100/419 (23%)
Query: 1 MVRLFIGTPSKGVLLILDTGSALIYAIFDPRKSSSFQKINCD------------------ 42
++ L IGTP + + +++DTGS L + P + SF + CD
Sbjct: 83 LISLNIGTPPQVIQVLMDTGSDLTWV---PCGNLSFDCMECDDYRNNKLMATFSPSYSSS 139
Query: 43 ------------------HP--DCTYFKC-----VNEQCV-----YTMKYADQSVTKGFA 72
+P CT C V C + Y V G
Sbjct: 140 SYRASCASPFCIDIHSSDNPLDTCTVAGCSLSTLVKATCSRPCPSFAYTYGAGGVVTGIL 199
Query: 73 AHETISVIGKGEGKAI-FHGALFGCSNDNHGFDEDARDGALAGVLGLSRVTISFISQLGS 131
+T+ V G G A FGC + R+ G+ G R T+S +SQLG
Sbjct: 200 TRDTLRVNGSSPGVAKEIPKFCFGCVGSAY------REPI--GIAGFGRGTLSMVSQLG- 250
Query: 132 IIKKRFSYC-LVIPLPNGEYTSSYLKFGTDMGYRRPSTQATKFINHP--NNFYYLSLKDI 188
++K FS+C L N SS L G + Q T +N P NFYY+ L+ I
Sbjct: 251 FLQKGFSHCFLAFKYANNPNISSPLVVGDIALTSKDDMQFTPMLNSPMYPNFYYVGLEAI 310
Query: 189 SIDNERMNFPPDTF-DITVSGEGGCIIDSGSVLTYFHSDVYWKLHEKFVSYFERFQLAQL 247
++ N P + + G GG IDSG+ T+ Y ++ S + +
Sbjct: 311 TVGNVSATEVPSSLREFDSLGNGGMKIDSGTTYTHLPEPFYSQVLSILQSTINYPRDTGM 370
Query: 248 SDCPEPIQLCYFLPETFNR-------FPSMAFYFEDANLRIDGENV-FIIDYENHFFLLA 299
+ LCY +P N PS+ F+F + NV ++ NHF+ ++
Sbjct: 371 -EMQTGFDLCYKVPRPNNNTLTSDDLLPSITFHFLN--------NVSLVLPQGNHFYPVS 421
Query: 300 VAPHDDLV-----------------ALIGSQQQRDTRFVYDLNIDLLSFVKENCSDDSA 341
AP + V + GS QQ++ VYDL + + F +C+ ++
Sbjct: 422 -APGNPAVVKCLMFQSTDDGDDGPAGVFGSFQQQNVEVVYDLEKERIGFQPMDCASAAS 479
>gi|88174554|gb|ABD39352.1| chloroplast nucleoid DNA-binding protein [Oryza barthii]
Length = 321
Score = 71.6 bits (174), Expect = 4e-10, Method: Compositional matrix adjust.
Identities = 86/342 (25%), Positives = 143/342 (41%), Gaps = 53/342 (15%)
Query: 1 MVRLFIGTPSKGVLLILDTGSALIYAIFD-------PR-----KSSSFQKINCDHPDCTY 48
++ + +GTPSK ++ +DTGS+ + + PR +S++ K++C C
Sbjct: 2 VISVGLGTPSKTQIVEIDTGSSTSWVFCECDGCHTNPRTFLQSRSTTCAKVSCGTSMCLL 61
Query: 49 F----KCVNEQ----CVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSNDN 100
C + + C + + Y D S + G +T++ G FGC+ D+
Sbjct: 62 GGSDPHCQDSENYPDCPFRVSYQDGSASYGILYQDTLTF----SDVQKIPGFSFGCNMDS 117
Query: 101 HGFDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEY-----TSSYL 155
G +E G + G+LG+ +S + Q S FSYCL PL E T+ Y
Sbjct: 118 FGANE---FGNVDGLLGMGAGAMSVLKQ-SSPTFDCFSYCL--PLQKSERGFFSKTTGYF 171
Query: 156 KFGTDMGYRRPSTQATKFINHPNN--FYYLSLKDISIDNERMNFPPDTFDITVSGEGGCI 213
G R + TK + N +++ L IS+D ER+ P F G +
Sbjct: 172 SLGKVA--TRTDVRYTKMVARKKNTELFFVDLTAISVDGERLGLSPSIFS-----RKGVV 224
Query: 214 IDSGSVLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCYFLPET-FNRFPSMAF 272
DSGS L+Y L ++ R A+ E + CY + P+++
Sbjct: 225 FDSGSELSYIPDRALSVLSQRIRELLLRRGAAE----EESERNCYDMRSVDEGDMPAISL 280
Query: 273 YFED-ANLRIDGENVFIID--YENHFFLLAVAPHDDLVALIG 311
+F+D A + VF+ E + LA AP + V++IG
Sbjct: 281 HFDDGARFDLGSHGVFVERSVQEQDVWCLAFAPTES-VSIIG 321
>gi|326495450|dbj|BAJ85821.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 491
Score = 71.6 bits (174), Expect = 5e-10, Method: Compositional matrix adjust.
Identities = 79/330 (23%), Positives = 125/330 (37%), Gaps = 45/330 (13%)
Query: 26 AIFDPRKSSSFQKINCDHPDCTYF-----KCVNEQ----CVYTMKYADQSVTKGFAAHET 76
A FDPR+SS+ + C C C C+Y ++Y+D +T G +T
Sbjct: 188 AFFDPRRSSTGAPVRCGSRACRTLGGYANGCSKPNSTGDCLYRIEYSDHRLTLGTYMTDT 247
Query: 77 ISVIGKGEGKAIFHGALFGCSNDNHG-FDEDARDGALAGVLGLSRVTISFISQLGSIIKK 135
+++ F FGCS+ G F A +G + L S +SQ
Sbjct: 248 LTI----SPSTTFLNFRFGCSHAVRGKFSAQA-----SGTMSLGGGPQSLLSQTARAYGN 298
Query: 136 RFSYCLVIPLPNGEYTSSYLKFGTDMG-----YRRPSTQATKFINHPNNFYYLSLKDISI 190
FSYC+ P G + G D G P ++ IN Y + L+ I +
Sbjct: 299 AFSYCVPGPSAAGFLSIGGPVNGDDGGGSGAFATTPLVRSANVIN--PTIYVVRLQGIEV 356
Query: 191 DNERMNFPPDTFDITVSGEGGCIIDSGSVLTYFHSDVYWKLHEKF----VSYFERFQLAQ 246
R+N PP F GG ++DS +V+T Y L F +Y R
Sbjct: 357 AGRRLNVPPVVF------SGGTVMDSSAVITQLPPTAYRALRLAFRNAMRAYKTRAPTGN 410
Query: 247 LSDCPEPIQLCYFLPETFNRFPSMAFYFEDANLRIDGENVFIIDYENHFFLLAVAPHDDL 306
L C + F+ + P+++ F+ + G ++D F +A D
Sbjct: 411 LDTCFD------FVGVSKVTVPTVSLVFDGGAVIELGLLSVLLDSCLAFAPMAA---DFA 461
Query: 307 VALIGSQQQRDTRFVYDLNIDLLSFVKENC 336
+ IG+ QQ+ +YD+ + F C
Sbjct: 462 LGFIGNVQQQTHEVLYDVAGGAVGFRHGAC 491
>gi|168027607|ref|XP_001766321.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162682535|gb|EDQ68953.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 381
Score = 71.6 bits (174), Expect = 5e-10, Method: Compositional matrix adjust.
Identities = 89/377 (23%), Positives = 142/377 (37%), Gaps = 70/377 (18%)
Query: 4 LFIGTPSKGVLLILDTGSALIY---------------AIFDPRKSSSFQKINCDHPDCTY 48
+ IG P+K L +DTGS L + ++DP+K+ + ++C P C
Sbjct: 27 MLIGAPAKLYYLDMDTGSDLTWLQCDAPCRSCASGPHGLYDPKKA---RLVDCRVPLCAL 83
Query: 49 ------FKCVN--EQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSNDN 100
+ C QC Y ++YAD S T G +TI+++ G A+ GC D
Sbjct: 84 VQQGGSYACGGPVRQCDYDVEYADGSSTMGVLMEDTITLL-LTNGTRSKTTAIIGCGYDQ 142
Query: 101 HGFDEDARDGALAGVLGLSRVTISFISQLG--SIIKKRFSYCLVIPLPNGEYTSSYLKFG 158
G + GV+GLS IS SQL I++ +CL G YL FG
Sbjct: 143 QGTLAQT-PASTDGVMGLSSAKISLPSQLAKKGIVRNVIGHCLA----GGSNGGGYLFFG 197
Query: 159 TDMGYRRPSTQATKFINHPNNFYYLSLKDIS--IDNERMNFPPDTFDITVSGEGGCIIDS 216
+ P+ T + + K I+ I + + T DI GG + DS
Sbjct: 198 DSL---VPALGMT--------WTPIMGKSITGNIGGKSGDADDKTGDI-----GGVMFDS 241
Query: 217 GSVLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCYFLPETFNRFPSMAFYFED 276
G+ TY + Y + E+ L ++ + C+ P F + YF+
Sbjct: 242 GTSFTYLVPEAYNAVLSAMEMQVEKSGLVRIKT-DNTLPFCWRGPSPFESVADVQRYFKT 300
Query: 277 AN--------------LRIDGENVFIIDYENHF---FLLAVAPHDDLVALIGSQQQRDTR 319
L + E I+ + + L A ++ +IG R
Sbjct: 301 VTLDFGKRNWYSASRVLELSPEGYLIVSTQGNVCLGILDASGASLEVTNIIGDVSMRGYL 360
Query: 320 FVYDLNIDLLSFVKENC 336
VYD + + +V+ NC
Sbjct: 361 VVYDNARNQIGWVRRNC 377
>gi|359482287|ref|XP_002263129.2| PREDICTED: aspartic proteinase-like protein 2 isoform 2 [Vitis
vinifera]
gi|297740017|emb|CBI30199.3| unnamed protein product [Vitis vinifera]
Length = 502
Score = 71.6 bits (174), Expect = 5e-10, Method: Compositional matrix adjust.
Identities = 92/378 (24%), Positives = 157/378 (41%), Gaps = 63/378 (16%)
Query: 3 RLFIGTPSKGVLLILDTGSALIY-------------------AIFDPRKSSSFQKINCDH 43
++ +G+P + + +DTGS +++ + FDP SS+ ++C H
Sbjct: 89 KVKLGSPPREFNVQIDTGSDILWVTCNSCNDCPRTSGLGIELSFFDPSSSSTTSLVSCSH 148
Query: 44 PDCTYF------KCV--NEQCVYTMKYADQSVTKGFAAHETI---SVIGKGEGKAIFHGA 92
P CT +C + QC Y+ Y D S T G+ + + +V+G
Sbjct: 149 PICTSLVQTTAAECSPQSNQCSYSFHYGDGSGTTGYYVSDMLYFDTVLGDSLIANSSASI 208
Query: 93 LFGCSNDNHGFDEDARDGALAGVLGLSRVTISFISQLGS--IIKKRFSYCLVIPLPNGEY 150
+FGCS G D D A+ G+ G + +S +SQL S I K FS+CL GE
Sbjct: 209 VFGCSTYQSG-DLTKVDKAIDGIFGFGQQDLSVVSQLSSLGITPKVFSHCL-----KGEG 262
Query: 151 -TSSYLKFGTDMGYRRPSTQATKFINHPNNFYYLSLKDISIDNERMNFPPDTFDITVSGE 209
L G + P+ + + + Y L+L+ IS++ + + P F S
Sbjct: 263 DGGGKLVLGEIL---EPNIIYSPLVPS-QSHYNLNLQSISVNGQLLPIDPAVF--ATSNN 316
Query: 210 GGCIIDSGSVLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPI----QLCYFLPETFN 265
G I+DSG+ LTY Y + FVS A +S P+ CY + + +
Sbjct: 317 QGTIVDSGTTLTYLVETAY----DPFVSAIT----ATVSSSTTPVLSKGNQCYLVSTSVD 368
Query: 266 R-FPSMAFYFEDANLRI--DGENVFIIDYENHFFLLAVA---PHDDLVALIGSQQQRDTR 319
FP ++ F + GE + + + + + + + + ++G +D
Sbjct: 369 EIFPPVSLNFAGGASMVLKPGEYLMHLGFSDGAAMWCIGFQKVAEPGITILGDLVLKDKI 428
Query: 320 FVYDLNIDLLSFVKENCS 337
FVYDL + + +CS
Sbjct: 429 FVYDLAHQRIGWANYDCS 446
>gi|356564743|ref|XP_003550608.1| PREDICTED: aspartic proteinase-like protein 2-like [Glycine max]
Length = 490
Score = 71.6 bits (174), Expect = 5e-10, Method: Compositional matrix adjust.
Identities = 90/375 (24%), Positives = 142/375 (37%), Gaps = 57/375 (15%)
Query: 3 RLFIGTPSKGVLLILDTGSALIYA-------------------IFDPRKSSSFQKINCDH 43
++ +GTP + +DTGS +++ FDP SS+ I C
Sbjct: 78 KVQLGTPPVEFNVQIDTGSDVLWVSCNSCSGCPQTSGLQIQLNFFDPGSSSTSSMIACSD 137
Query: 44 PDC--------TYFKCVNEQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGA--- 92
C N QC YT +Y D S T G+ + + + EG +
Sbjct: 138 QRCNNGIQSSDATCSSQNNQCSYTFQYGDGSGTSGYYVSDMMHLNTIFEGSVTTNSTAPV 197
Query: 93 LFGCSNDNHGFDEDARDGALAGVLGLSRVTISFISQLGS--IIKKRFSYCLVIPLPNGEY 150
+FGCSN G D D A+ G+ G + +S ISQL S I + FS+CL G
Sbjct: 198 VFGCSNQQTG-DLTKSDRAVDGIFGFGQQEMSVISQLSSQGIAPRVFSHCLKGDSSGG-- 254
Query: 151 TSSYLKFGTDMGYRRPSTQATKFI-NHPNNFYYLSLKDISIDNERMNFPPDTFDITVSGE 209
L G + P+ T + P+ Y L+L+ I+++ + + F S
Sbjct: 255 --GILVLGEIV---EPNIVYTSLVPAQPH--YNLNLQSIAVNGQTLQIDSSVF--ATSNS 305
Query: 210 GGCIIDSGSVLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCYFLPETFNR-FP 268
G I+DSG+ L Y + Y + FVS + CY + + FP
Sbjct: 306 RGTIVDSGTTLAYLAEEAY----DPFVSAITASIPQSVHTVVSRGNQCYLITSSVTEVFP 361
Query: 269 SMAFYFEDANLRIDGENVFIIDYENHFFLLAV------APHDDLVALIGSQQQRDTRFVY 322
++ F I ++I +N AV + ++G +D VY
Sbjct: 362 QVSLNFAGGASMILRPQDYLIQ-QNSIGGAAVWCIGFQKIQGQGITILGDLVLKDKIVVY 420
Query: 323 DLNIDLLSFVKENCS 337
DL + + +CS
Sbjct: 421 DLAGQRIGWANYDCS 435
>gi|242072067|ref|XP_002451310.1| hypothetical protein SORBIDRAFT_05g027510 [Sorghum bicolor]
gi|241937153|gb|EES10298.1| hypothetical protein SORBIDRAFT_05g027510 [Sorghum bicolor]
Length = 509
Score = 71.6 bits (174), Expect = 5e-10, Method: Compositional matrix adjust.
Identities = 79/354 (22%), Positives = 138/354 (38%), Gaps = 58/354 (16%)
Query: 14 LLILDTGSALIYA----------------IFDPRKSSSFQKINCDHPDCTYFK-----CV 52
L++LDT S + + ++DP KS S + C P C C
Sbjct: 183 LMLLDTASDVAWVQCFPCPASQCYAQTDVLYDPSKSRSSESFACSSPTCRQLGPYANGCS 242
Query: 53 NE-----QCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSNDNHGFDEDA 107
+ QC Y ++Y D S T G + +S+ + FGCS+ G +
Sbjct: 243 SSSNSAGQCQYRVRYPDGSTTSGTLVADQLSLSPTSQVPKFE----FGCSHAARGSFSRS 298
Query: 108 RDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLKFGTDMGYRRPS 167
+ AG++ L R S +SQ + + FSYC T+S+ F RR S
Sbjct: 299 K---TAGIMALGRGVQSLVSQTSTKYGQVFSYCF-------PPTASHKGFFVLGVPRRSS 348
Query: 168 TQ--ATKFINHPNNFYYLSLKDISIDNERMNFPPDTFDITVSGEGGCIIDSGSVLTYFHS 225
++ T + P Y + L+ I++ +R++ PP F G +DS +V+T
Sbjct: 349 SRYAVTPMLKTP-MLYQVRLEAIAVAGQRLDVPPTVF------AAGAALDSRTVITRLPP 401
Query: 226 DVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCY-FLPETFNRFPSMAFYFE--DANLRID 282
Y L F ++ A + + CY F + P+++ F+ A +++D
Sbjct: 402 TAYQALRSAFRDKMSMYRPAAANG---QLDTCYDFTGVSSIMLPTISLVFDRTGAGVQLD 458
Query: 283 GENVFIIDYENHFFLLAVAPHDDLVALIGSQQQRDTRFVYDLNIDLLSFVKENC 336
V + + + A D +IG Q + +Y++ + F + C
Sbjct: 459 PSGVL---FGSCLAFASTAGDDRATGIIGFLQLQTIEVLYNVAGGSVGFRRGAC 509
>gi|449445943|ref|XP_004140731.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Cucumis sativus]
Length = 430
Score = 71.6 bits (174), Expect = 6e-10, Method: Compositional matrix adjust.
Identities = 101/392 (25%), Positives = 160/392 (40%), Gaps = 86/392 (21%)
Query: 1 MVRLFIGTPSKGVLLILDTGSALIY-------------AIFDPRKSSSFQKIN------- 40
+V L IGTP + L+LDTGS L + + P+ +S ++
Sbjct: 67 VVSLPIGTPPQPTDLVLDTGSQLSWIQCHDKKVKKRLPPLPKPKTASFDPSLSSSFSLLP 126
Query: 41 CDHPDCTY--------FKC-VNEQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHG 91
C+HP C C N C Y+ YAD ++ +G E + +
Sbjct: 127 CNHPICKPRIPDFTLPTSCDQNRLCHYSYFYADGTLAEGNLVREKFTFSKSLSTPPV--- 183
Query: 92 ALFGC---SNDNHGFDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPL--- 145
+ GC S +N G +LG++ +SFISQ + I K FSYC+
Sbjct: 184 -ILGCAQASTENRG------------ILGMNHGRLSFISQ--AKISK-FSYCVPSRTGSN 227
Query: 146 PNGEY------TSSYLKFGTDMGYRRPSTQATKFINHPNNFYYLSLKDISIDNERMNFPP 199
P G + SS K+ T + + P +Q++ ++ Y L +K I I +R+N PP
Sbjct: 228 PTGLFYLGDNPNSSKFKYVTMLTF--PESQSSPNLDP--LAYTLPMKAIKIAGKRLNIPP 283
Query: 200 DTFDITVSGEGGCIIDSGSVLTYFHSDVYWKLHEKFVSYF-----ERFQLAQLSDCPEPI 254
F G G +IDSGS LTY + Y K+ E+ V + + A ++D
Sbjct: 284 AAFKPDAGGSGQTMIDSGSDLTYLVDEAYEKVKEEVVRLVGAMMKKGYVYADVAD----- 338
Query: 255 QLCY---FLPETFNRFPSMAFYFEDANLRI---DGENVFIIDYENHFFLLAVAPHDDL-- 306
+C+ E R ++F F D + I GE V + + E + + + L
Sbjct: 339 -MCFDAGVTAEVGRRIGGISFEF-DNGVEIFVGRGEGV-LTEVEKGVKCVGIGRSERLGI 395
Query: 307 -VALIGSQQQRDTRFVYDLNIDLLSFVKENCS 337
+IG+ Q++ YDL + F CS
Sbjct: 396 GSNIIGTVHQQNMWVEYDLANKRVGFGGAECS 427
>gi|297801286|ref|XP_002868527.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
gi|297314363|gb|EFH44786.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
Length = 444
Score = 71.2 bits (173), Expect = 7e-10, Method: Compositional matrix adjust.
Identities = 74/273 (27%), Positives = 111/273 (40%), Gaps = 57/273 (20%)
Query: 1 MVRLFIGTPSKGVLLILDTGSALIY----------------AIFDPRKSSSFQKINCDHP 44
++ L IGTPS+ L+LDTGS L + FDP SSSF + C HP
Sbjct: 82 ILSLPIGTPSQSQELVLDTGSQLSWIQCHPKKIKKPLPPPTTSFDPSLSSSFSDLPCSHP 141
Query: 45 DCTY--------FKC-VNEQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFG 95
C C N C Y+ YAD + +G E + + + G
Sbjct: 142 LCKPRIPDFTLPTSCDSNRLCHYSYFYADGTFAEGNLVKEKFTFSNSQTTPPL----ILG 197
Query: 96 CSNDNHGFDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYL 155
C+ ++ + G+LG++ +SFISQ + I K FSYC+ P
Sbjct: 198 CAKESTD---------VKGILGMNLGRLSFISQ--AKISK-FSYCI----PTRSNRPGLA 241
Query: 156 KFGTDMGYRRPSTQATKFIN---------HPNN---FYYLSLKDISIDNERMNFPPDTFD 203
G+ P+++ K+++ PN Y + L I I +R+N P F
Sbjct: 242 STGSFYLGENPNSRGFKYVSLLTFPQSQRMPNLDPLAYTVPLLGIRIGQKRLNIPSSVFR 301
Query: 204 ITVSGEGGCIIDSGSVLTYFHSDVYWKLHEKFV 236
G G ++DSGS T+ Y K+ E+ V
Sbjct: 302 PDAGGSGQTMVDSGSEFTHLVDVAYDKVKEEIV 334
>gi|77808087|gb|AAS48510.2| aspartic protease [Fagopyrum esculentum]
gi|82780908|gb|ABB88696.2| aspartic proteinase-like protein [Fagopyrum esculentum]
Length = 447
Score = 71.2 bits (173), Expect = 7e-10, Method: Compositional matrix adjust.
Identities = 87/377 (23%), Positives = 149/377 (39%), Gaps = 60/377 (15%)
Query: 6 IGTPSKGVLLILDTGSALIYA-------------------------IFDPRKSSSFQKIN 40
+GTP + V L+LDTGS+L++ I+ KSS+ Q +
Sbjct: 80 LGTPPQKVSLVLDTGSSLVWTPCTIPTATYTCQNCTFSGVDPTKIPIYARNKSSTVQSLP 139
Query: 41 CDHPDCTYFKCVNEQCVYTMK--YADQSVTKGFAAHETIS-VIGKGEGKAIFHGALFGCS 97
C P C + + C T + Y G + +S V+G + I LFGCS
Sbjct: 140 CRSPKCNWVFGSDLNCSTTKRCPYYGLEYGLGSTTGQLVSDVLGLSKLNRI-PDFLFGCS 198
Query: 98 NDNHGFDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLKF 157
++ E G+ G R S +QLG +FSYCLV + S L
Sbjct: 199 LVSNRQPE--------GIAGFGRGLASIPAQLGLT---KFSYCLVSHRFDDTPQSGDLVL 247
Query: 158 GTDMGYRRPSTQA-----TKFINHP-----NNFYYLSLKDISIDNERMNFPPDTFDITVS 207
G R A F P + +YY+SL I + + + PP +
Sbjct: 248 --HRGRRHADAAANGVAYAPFTKSPALSPYSEYYYISLSKILVGGKDVPIPPRYLVPSKE 305
Query: 208 GEGGCIIDSGSVLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCYFLP-ETFNR 266
G+GG I+DSGS T+ ++ + + + +++ A+ + + CY + ++
Sbjct: 306 GDGGMIVDSGSTFTFMERIIFDPVARELEKHMTKYKRAKEIEDSSGLGPCYNITGQSEVD 365
Query: 267 FPSMAFYFE-DANLRIDGENVFIIDYENHFFLLAVAPHDD------LVALIGSQQQRDTR 319
P + F F+ AN+ + + F + + + + D+ ++G+ QQ++
Sbjct: 366 VPKLTFSFKGGANMDLPLTDYFSLVTDGVVCMTVLTDPDEPGSTTGPAIILGNYQQQNFY 425
Query: 320 FVYDLNIDLLSFVKENC 336
YDL F + C
Sbjct: 426 IEYDLKKQRFGFKPQQC 442
>gi|296090179|emb|CBI39998.3| unnamed protein product [Vitis vinifera]
Length = 334
Score = 70.9 bits (172), Expect = 8e-10, Method: Compositional matrix adjust.
Identities = 82/355 (23%), Positives = 150/355 (42%), Gaps = 66/355 (18%)
Query: 1 MVRLFIGTPSKGVLLILDTGSALIYA--------------IFDPRKSSSFQKINCDHPDC 46
++++ IGTP V I DTGS L++ +FDP KS+SF++++C+
Sbjct: 25 LMKISIGTPPFDVYGIYDTGSDLMWTQCLPCLSCYKQKNPMFDPSKSTSFKEVSCE---- 80
Query: 47 TYFKCVNEQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSNDNHG-FDE 105
++QC + D + +FGC ++N G F+E
Sbjct: 81 ------SQQC----RLLDTPTS--------------------ILNIVFGCGHNNSGTFNE 110
Query: 106 DARDGALAGVLGLSRVTISFISQLGSII--KKRFSYCLVIPLPNGEYTSSYLKFGTDMGY 163
+ G+ G +S SQ+ S + ++FS CLV P +S + FG +
Sbjct: 111 NE-----MGLFGTGGRPLSLTSQIMSTLGSGRKFSQCLV-PFRTDPSITSKIIFGPEAEV 164
Query: 164 RRPSTQATKFINHPN-NFYYLSLKDISIDNERMNFPPDTFDITVSGEGGCIIDSGSVLTY 222
+T + + +Y+++L IS+ ++ F + ++ +G ID+G+ T
Sbjct: 165 SGSDVVSTPLVTKDDPTYYFVTLDGISVGDKLFPF---SSSSPMATKGNVFIDAGTPPTL 221
Query: 223 FHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCYFLPETFNRFPSMAFYFEDANLRID 282
D Y +L + E + + D QLCY T P + +F+ A++++
Sbjct: 222 LPRDFYNRLVQGVK---EAIPMEPVQDPDLQPQLCY-RSATLIDGPILTAHFDGADVQLK 277
Query: 283 GENVFIIDYENHFFLLAVAPHDDLVALIGSQQQRDTRFVYDLNIDLLSFVKENCS 337
N FI E + A+ P D + G+ Q + +DL+ +SF +C+
Sbjct: 278 PLNTFISPKEG-VYCFAMQPIDGDTGIFGNFVQMNFLIGFDLDGKKVSFKAVDCT 331
>gi|449456068|ref|XP_004145772.1| PREDICTED: aspartic proteinase-like protein 2-like [Cucumis
sativus]
gi|449496218|ref|XP_004160076.1| PREDICTED: aspartic proteinase-like protein 2-like [Cucumis
sativus]
Length = 500
Score = 70.5 bits (171), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 90/376 (23%), Positives = 141/376 (37%), Gaps = 59/376 (15%)
Query: 3 RLFIGTPSKGVLLILDTGSALIYA-------------------IFDPRKSSSFQKINCDH 43
R+ +G P K + +DTGS +++ FDP S++ ++C
Sbjct: 86 RVQLGNPPKDFYVQIDTGSDVLWVSCNSCNGCPATSGLQIPLNFFDPGSSTTASLVSCSD 145
Query: 44 PDCTY------FKCVNE--QCVYTMKYADQSVTKGFAAHETIS---VIGKGEGKAIFHGA 92
C C + QC Y +Y D S T G+ + I VI
Sbjct: 146 QICALGVQSSDSACFGQSNQCAYVFQYGDGSGTSGYYVMDMIHLDVVIDSSVTSNSSASV 205
Query: 93 LFGCSNDNHGFDEDARDGALAGVLGLSRVTISFISQLGS--IIKKRFSYCLVIPLPNGEY 150
+FGCS G D D A+ G+ G + +S ISQL S I K FS+C L +
Sbjct: 206 VFGCSTSQTG-DLTKSDRAVDGIFGFGQQDLSVISQLSSRGIAPKVFSHC----LKGDDS 260
Query: 151 TSSYLKFGTDMGYRRPSTQATKFI-NHPNNFYYLSLKDISIDNERMNFPPDTFDITVSGE 209
L G + P+ T + + P+ Y L+L+ IS++ + + P F S
Sbjct: 261 GGGILVLGEIV---EPNVVYTPLVPSQPH--YNLNLQSISVNGQVLPISPAVF--ATSSS 313
Query: 210 GGCIIDSGSVLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCYFLPETFNR-FP 268
G IIDSG+ L Y + Y FV CY + + FP
Sbjct: 314 QGTIIDSGTTLAYLAEEAY----NAFVVAVTNIVSQSTQSVVLKGNRCYVTSSSVSDIFP 369
Query: 269 SMAFYFEDANLRIDGENVFIIDYENHFFLLAV-------APHDDLVALIGSQQQRDTRFV 321
++ F + G ++I +N V P + ++G +D F+
Sbjct: 370 QVSLNFAGGASLVLGAQDYLIQ-QNSVGGTTVWCIGFQKIPGQG-ITILGDLVLKDKIFI 427
Query: 322 YDLNIDLLSFVKENCS 337
YDL + + +CS
Sbjct: 428 YDLANQRIGWTNYDCS 443
>gi|15235526|ref|NP_193028.1| aspartyl protease family protein [Arabidopsis thaliana]
gi|5123933|emb|CAB45491.1| putative protein [Arabidopsis thaliana]
gi|7267994|emb|CAB78334.1| putative protein [Arabidopsis thaliana]
gi|332657803|gb|AEE83203.1| aspartyl protease family protein [Arabidopsis thaliana]
Length = 389
Score = 70.5 bits (171), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 89/364 (24%), Positives = 137/364 (37%), Gaps = 63/364 (17%)
Query: 1 MVRLFIGTPSKGVLLILDTGSAL---------------IYAIFDPRKSSSFQKINCD--H 43
M + G+P K L +DTGS+L IY + P S +++ C+ H
Sbjct: 59 MAEIHFGSPQKKQFLHMDTGSSLTWTQCFPCSDCYAQKIYPKYRPAASITYRDAMCEDSH 118
Query: 44 PDCT---YFKCVNEQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSNDN 100
P F + C Y Y D++ KG A E I+V G HG FGC+
Sbjct: 119 PKSNPHFAFDPLTRICTYQQHYLDETNIKGTLAQEMITVDTHDGGFKRVHGVYFGCNT-- 176
Query: 101 HGFDEDARDGAL---AGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLKF 157
DG+ G+LGL S I + GS +FS+CL GE +
Sbjct: 177 ------LSDGSYFTGTGILGLGVGKYSIIGEFGS----KFSFCL------GEISEPKASH 220
Query: 158 GTDMGYRRPSTQATKFINHPNNFYYLSLKDISIDNERMNFPPDTFDITVSGEGGCIIDSG 217
+G IN L+ I + E IT+ +D+G
Sbjct: 221 NLILGDGANVQGHPTVINITEGHTIFQLESIIVGEE----------ITLDDPVQVFVDTG 270
Query: 218 SVLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCYFLPETFNRFPSMAFYFE-- 275
S L++ +++Y+K + F L+ EP LCY +T R M F+
Sbjct: 271 STLSHLSTNLYYKFVDAFDDLIGSRPLSY-----EPT-LCY-KADTIERLEKMDVGFKFD 323
Query: 276 -DANLRIDGENVFIIDYENHFFLLAVAPHDDLVA--LIGSQQQRDTRFVYDLNIDLLSFV 332
A L ++ N+FI LA+ + + + +IG + YDL+
Sbjct: 324 VGAELSVNIHNIFIQQGPPEIRCLAIQNNKESFSHVIIGVIAMQGYNVGYDLSAKTAYIN 383
Query: 333 KENC 336
K++C
Sbjct: 384 KQDC 387
>gi|88174556|gb|ABD39353.1| chloroplast nucleoid DNA-binding protein [Oryza barthii]
Length = 321
Score = 70.5 bits (171), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 86/342 (25%), Positives = 143/342 (41%), Gaps = 53/342 (15%)
Query: 1 MVRLFIGTPSKGVLLILDTGSALIYAIFD-------PR-----KSSSFQKINCDHPDCTY 48
++ + +GTPSK ++ +DTGS+ + + PR +S++ K++C C
Sbjct: 2 VISVGLGTPSKTQIVEIDTGSSTSWVFCECDGCHTNPRTFLQSRSTTCAKVSCGTSMCLL 61
Query: 49 F----KCVNEQ----CVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSNDN 100
C + + C + + Y D S + G +T++ G FGC+ D+
Sbjct: 62 GGSDPHCQDSENYPDCPFRVSYQDGSASYGILYQDTLTF----SDVQKIPGFSFGCNMDS 117
Query: 101 HGFDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEY-----TSSYL 155
G +E G + G+LG+ +S + Q S FSYCL PL E T+ Y
Sbjct: 118 FGANE---FGNVDGLLGMGAGAMSVLKQ-SSPTFDCFSYCL--PLQKSERGFFSKTTGYF 171
Query: 156 KFGTDMGYRRPSTQATKFINHPNN--FYYLSLKDISIDNERMNFPPDTFDITVSGEGGCI 213
G R + TK + N +++ L IS+D ER+ P F G +
Sbjct: 172 SLGKVA--TRTDVRYTKMVARKKNTELFFVDLTAISVDGERLGLSPSIFS-----RKGVV 224
Query: 214 IDSGSVLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCYFLPET-FNRFPSMAF 272
DSGS L+Y L ++ R A+ E + CY + P+++
Sbjct: 225 FDSGSELSYIPDRALSVLSQRIRELLLRRGAAE----EESERNCYDMRSVDEGDMPAISL 280
Query: 273 YFED-ANLRIDGENVFIID--YENHFFLLAVAPHDDLVALIG 311
+F+D A + VF+ E + LA AP + V++IG
Sbjct: 281 HFDDGARFDLGRGGVFVERSVQEQDVWCLAFAPTES-VSIIG 321
>gi|302774304|ref|XP_002970569.1| hypothetical protein SELMODRAFT_93861 [Selaginella moellendorffii]
gi|300162085|gb|EFJ28699.1| hypothetical protein SELMODRAFT_93861 [Selaginella moellendorffii]
Length = 490
Score = 70.5 bits (171), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 91/364 (25%), Positives = 150/364 (41%), Gaps = 53/364 (14%)
Query: 3 RLFIGTPSKGVLLILDTGSALIYAI------------FDPRKSSSFQKINCDHPDCTYFK 50
R+ IGTP LI+D S + F P SSS++ + C + +C+
Sbjct: 38 RVKIGTPPHEFSLIVDRSSFVSPKTMFCSFFFLQDPRFSPALSSSYKPLECGN-ECSTGF 96
Query: 51 CVNEQCVYTMKYADQSVTKGFAAHETISVIGKGE--GKAIFHGALFGCSNDNHGFDEDAR 108
C + Y +YA++S + G + IS + G+ + +FGC G D
Sbjct: 97 CDGSR-KYQRQYAEKSTSSGVLGKDVISFSNSSDLGGQRL----VFGCETAETG---DLY 148
Query: 109 DGALAGVLGLSRVTISFISQL--GSIIKKRFSYCLVIPLPNGEYTSSYLKFGTDMGYRRP 166
D G++GL R +S I QL + ++ FS C + + G G++ P
Sbjct: 149 DQTADGIIGLGRGPLSIIDQLVEKNAMEDVFSLCY----GGMDEGGGAMILG---GFQPP 201
Query: 167 STQA-TKFINHPNNFYYLSLKDISIDNERMNFPPDTFDITVSGEGGCIIDSGSVLTYFHS 225
T H + +Y L LK I + + P+ FD G+ G ++DSG+ YF
Sbjct: 202 KDMVFTSSDPHRSPYYNLMLKGIRVGGSPLRLKPEVFD----GKYGTVLDSGTTYAYFPG 257
Query: 226 DVYWKLHEKFVSYFERFQLAQLSDCPEPIQ----LCYFLPET-----FNRFPSMAFYFED 276
+ + F S + Q+ L + P P + +CY T FPS+ F F D
Sbjct: 258 AAF----QAFKSAVKE-QVGSLKEVPGPDEKFKDICYAGAGTNVSNLSQFFPSVDFVFGD 312
Query: 277 A-NLRIDGEN-VFIIDYENHFFLLAVAPHDDLVALIGSQQQRDTRFVYDLNIDLLSFVKE 334
++ + EN +F + + L V + D L+G R+ Y+ + F+K
Sbjct: 313 GQSVTLSPENYLFRHTKISGAYCLGVFENGDPTTLLGGIIVRNMLVTYNRGKASIGFLKT 372
Query: 335 NCSD 338
C+D
Sbjct: 373 KCND 376
>gi|297812425|ref|XP_002874096.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
gi|297319933|gb|EFH50355.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
Length = 493
Score = 70.5 bits (171), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 85/376 (22%), Positives = 155/376 (41%), Gaps = 59/376 (15%)
Query: 3 RLFIGTPSKGVLLILDTGSALIYA-------------------IFDPRKSSSFQKINCDH 43
++ +G+P + + +DTGS +++ FDP S + ++C
Sbjct: 84 KIRLGSPPRDFYVQVDTGSDVLWVSCASCNGCPQTSGLQIQLNFFDPGSSVTATPVSCSD 143
Query: 44 PDCTY--------FKCVNEQCVYTMKYADQSVTKGFAAHETIS---VIGKGEGKAIFHGA 92
C++ N C YT +Y D S T GF + + ++G
Sbjct: 144 QRCSWGIQSSDSGCSVQNNLCAYTFQYGDGSGTSGFYVSDVLQFDMIVGSSLVPNSTAPV 203
Query: 93 LFGCSNDNHGFDEDARDGALAGVLGLSRVTISFISQLGS--IIKKRFSYCLVIPLPNGEY 150
+FGCS G D D A+ G+ G + +S ISQL S + + FS+CL GE
Sbjct: 204 VFGCSTSQTG-DLVKSDRAVDGIFGFGQQGMSVISQLASQGLAPRVFSHCL-----KGEN 257
Query: 151 TSSYLKFGTDMGYRRPSTQATKFI-NHPNNFYYLSLKDISIDNERMNFPPDTFDITVSGE 209
+ ++ P+ T + + P+ Y ++L IS++ + + P F T +G+
Sbjct: 258 GGGGILVLGEI--VEPNMVFTPLVPSQPH--YNVNLLSISVNGQALPINPSVFS-TSNGQ 312
Query: 210 GGCIIDSGSVLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCYFLPETF-NRFP 268
G IID+G+ L Y Y E + + +S + CY + + + FP
Sbjct: 313 -GTIIDTGTTLAYLSEAAYVPFVEAITNAVSQSVRPVVSKGNQ----CYVIATSVADIFP 367
Query: 269 SMAFYFE-DANLRIDGENVFIIDYENHFFLLAV------APHDDLVALIGSQQQRDTRFV 321
++ F A++ ++ ++ I +N+ AV + + ++G +D FV
Sbjct: 368 PVSLNFAGGASMFLNPQDYLI--QQNNVGGTAVWCIGFQRIQNQGITILGDLVLKDKIFV 425
Query: 322 YDLNIDLLSFVKENCS 337
YDL + + +CS
Sbjct: 426 YDLVGQRIGWANYDCS 441
>gi|242067689|ref|XP_002449121.1| hypothetical protein SORBIDRAFT_05g005400 [Sorghum bicolor]
gi|241934964|gb|EES08109.1| hypothetical protein SORBIDRAFT_05g005400 [Sorghum bicolor]
Length = 358
Score = 70.5 bits (171), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 86/323 (26%), Positives = 130/323 (40%), Gaps = 59/323 (18%)
Query: 2 VRLFIGTPSKGVLLILDTGSALIYAIFD-PRKS-----------SSFQKINCDHPDCTYF 49
V + IG P+K L +DTGS L + D P +S ++ + C + CT
Sbjct: 56 VTMNIGNPAKPYFLDVDTGSDLTWLQCDAPCRSCNKVPHPLYRPTANSLVPCANALCTAL 115
Query: 50 --------KCVN-EQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSNDN 100
KC + +QC Y +KY D + ++G ++ S+ + I G FGC D
Sbjct: 116 HSGHGSNNKCPSPKQCDYQIKYTDSASSQGVLINDNFSLPMRSSN--IRPGLTFGCGYDQ 173
Query: 101 HGFDEDARDGALAGVLGLSRVTISFISQLG--SIIKKRFSYCLVIPLPNGEYTSSYLKFG 158
A A G+LGL R ++S +SQL I K +CL NG +L FG
Sbjct: 174 QVGKNGAVQAATDGMLGLGRGSVSLVSQLKQQGITKNVLGHCLST---NG---GGFLFFG 227
Query: 159 TDMGYRRPSTQAT--KFINHPNNFYYLSLKDISIDNERMNFPPDTFDITVSGEGGCIIDS 216
D+ P+++ T N+Y + D + P + DS
Sbjct: 228 DDI---VPTSRVTWVPMAKISGNYYSPGSGTLYFDRRSLGVKPMEV----------VFDS 274
Query: 217 GSVLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCYFLPETF-------NRFPS 269
GS TYF + Y + S + L Q+SD P LC+ P+ F F S
Sbjct: 275 GSTYTYFTAQPYQAVVSALKSGLSK-SLKQVSDPSLP--LCWKGPKAFKSVFDVKKEFKS 331
Query: 270 MAFYF---EDANLRIDGENVFII 289
+ F ++A + I EN I+
Sbjct: 332 LFLSFASAKNAVMEIPPENYLIV 354
>gi|296086208|emb|CBI31649.3| unnamed protein product [Vitis vinifera]
Length = 761
Score = 70.1 bits (170), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 79/370 (21%), Positives = 137/370 (37%), Gaps = 105/370 (28%)
Query: 2 VRLFIGTPSKGVLLILDTGSAL----------IYAIFDPRKSSSFQKINCDHPDCTYFKC 51
V L +G+P + V ++LDTGS L ++++FDP +SSS+ I C P C
Sbjct: 377 VSLTVGSPPQTVTMVLDTGSELSWLHCKKAPNLHSVFDPLRSSSYSPIPCTSPTC----- 431
Query: 52 VNEQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSNDNHGFDEDARDGA 111
+ +T +IG G
Sbjct: 432 -----------------RTRTHSKTTGLIGMNRG-------------------------- 448
Query: 112 LAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLKFGTDMGYRRPSTQAT 171
++SF++Q+G ++FSYC+ +G+ +S L FG + + T
Sbjct: 449 ----------SLSFVTQMG---LQKFSYCI-----SGQDSSGILLFGESSFSWLKALKYT 490
Query: 172 KF--INHPNNF-----YYLSLKDISIDNERMNFPPDTFDITVSGEGGCIIDSGSVLTYFH 224
I+ P + Y + L+ I + N + P + +G G ++DSG+ T+
Sbjct: 491 PLVQISTPLPYFDRVAYTVQLEGIKVANSMLQLPKSVYAPDHTGAGQTMVDSGTQFTFLL 550
Query: 225 SDVYWKLHEKFVSYFERFQLAQLSDCPEP-------IQLCYFLP---ETFNRFPSMAFYF 274
VY L +FV R A L +P + LCY +P T P++ F
Sbjct: 551 GPVYTALKNEFV----RQTKASLKVLEDPNFVFQGAMDLCYRVPLTRRTLPPLPTVTLMF 606
Query: 275 EDANLRIDGENVF-----IIDYENHFFLLAVAPHDDLVA---LIGSQQQRDTRFVYDLNI 326
A + + E + +I + + + L +IG Q++ +DL
Sbjct: 607 RGAEMSVSAERLMYRVPGVIRGSDSVYCFTFGNSELLGVESYIIGHHHQQNVWMEFDLAK 666
Query: 327 DLLSFVKENC 336
+ F + C
Sbjct: 667 SRVGFAEVRC 676
>gi|449522369|ref|XP_004168199.1| PREDICTED: LOW QUALITY PROTEIN: aspartic proteinase
nepenthesin-2-like [Cucumis sativus]
Length = 457
Score = 70.1 bits (170), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 95/391 (24%), Positives = 150/391 (38%), Gaps = 75/391 (19%)
Query: 4 LFIGTPSKGVLLILDTGSALIY----------------------AIFDPRKSSSFQKINC 41
L GTP + + LI DTGS+L++ F P+ SSS + + C
Sbjct: 85 LSFGTPQQTLHLIFDTGSSLVWFPCTSRYLCSECSFPKIDPTGIPRFVPKLSSSSKLVGC 144
Query: 42 DHPDCTYF--KCVNEQC---------------VYTMKYADQSVTKGFAAHETISVIGKGE 84
+P C++ V QC Y ++Y S T G ET+ K
Sbjct: 145 QNPKCSWIFGPDVKSQCRSCNPKTENCTQTCPAYVVQYGSGS-TAGLLLSETLDFPDK-- 201
Query: 85 GKAIFHGALFGCSNDNHGFDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIP 144
+ GCS F + +G+ G R + S SQ+G K+F+YCL
Sbjct: 202 ---XIPNFVVGCS-----FLSIHQP---SGIAGFGRGSESLPSQMG---LKKFAYCLASR 247
Query: 145 LPNGEYTSSYLKFGTDMGYRRPSTQATKFINHP---NN----FYYLSLKDISIDNERMNF 197
+ S L + G + T F +P NN +YYL+++ I + N+ +
Sbjct: 248 KFDDSPHSGQLILDS-TGVKSSGLTYTPFRQNPSVSNNAYKEYYYLNIRKIIVGNQAVKV 306
Query: 198 PPDTFDITVSGEGGCIIDSGSVLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLC 257
P G GG IIDSGS T+ V + +F + A + ++ C
Sbjct: 307 PYKFLVPGPDGNGGSIIDSGSTFTFMDKPVLEVVAREFEKQLANWTRATDVETLTGLRPC 366
Query: 258 YFL-PETFNRFPSMAFYFE-DANLRIDGENVFIIDYENHFFLLAVAPHD---------DL 306
+ + E +FP + F F+ A + N F + + L V H
Sbjct: 367 FDISKEKSVKFPELIFQFKGGAKWALPLNNYFALVSSSGVACLTVVTHQMEDGGGGGGGP 426
Query: 307 VALIGSQQQRDTRFVYDLNIDLLSFVKENCS 337
++G+ QQ++ YDL L F ++ CS
Sbjct: 427 SVILGAFQQQNFYVEYDLVNQRLGFRQQTCS 457
>gi|357162717|ref|XP_003579500.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Brachypodium
distachyon]
Length = 488
Score = 70.1 bits (170), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 98/411 (23%), Positives = 153/411 (37%), Gaps = 95/411 (23%)
Query: 6 IGTPSKGVLLILDTGSALIYA----------------------IFDPRKSSSFQKINCDH 43
+GTP + + ++LDTGS L + +F P+ SSS + + C +
Sbjct: 97 LGTPPQPLPVLLDTGSHLSWVPCTSSYQCRNCSSSPSAMSAMAVFHPKNSSSSRLVGCRN 156
Query: 44 PDCTYFKCVNEQCV--------------YTMKYADQSVTKGFAAHETISV--IGKGEGKA 87
P C + + Y + Y S T G +T+ + A
Sbjct: 157 PACRWIHSKSPSTCGSTGNNGNGDVCPPYLVVYGSGS-TSGLLISDTLRLSPSSSSSAPA 215
Query: 88 IFHGALFGCSNDNHGFDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVI-PLP 146
F GCS + +G+ G R S SQL +FSYCL+
Sbjct: 216 PFRNFAIGCS-------IVSVHQPPSGLAGFGRGAPSVPSQLKV---PKFSYCLLSRRFD 265
Query: 147 NGEYTSSYLKFGTDM---GYRRPSTQATKFINHPNN------FYYLSLKDISIDNERMNF 197
+ S L G M G ++ + Q +N+ + +YYL+L IS+ + +N
Sbjct: 266 DNSAVSGELVLGDAMVPAGKKKTTMQYVPLLNNAASKPPYSVYYYLALTGISVGGKPVNL 325
Query: 198 PPDTFDITVSGEGGCIIDSGSVLTYFHSDVYWKLHEKFVSYF-ERFQLAQLSDCPEPIQL 256
P F S GG IIDSG+ TY V+ + S R+ ++ + ++
Sbjct: 326 PSRAF--VPSSGGGAIIDSGTTFTYLDPTVFKPVAAAMESAVGGRYNRSRPVEDALGLRP 383
Query: 257 CYFLPETFNRFPSMAFYFEDANLRIDGENVFIIDYENHF---------------FLLAVA 301
C+ LP P A D L+ G V + EN+F LAV
Sbjct: 384 CFALPPG----PGGAMELPDLELKFKGGAVMRLPVENYFVAAGPAGGPAAGPVAICLAVV 439
Query: 302 PHDDLVA------------LIGSQQQRDTRFVYDLNIDLLSFVKENCSDDS 340
DL A ++GS QQ++ YDL + L F ++ C+ S
Sbjct: 440 --SDLPASGGDGAAAGPAIILGSFQQQNYHIEYDLGKERLGFRQQPCAPKS 488
>gi|357137832|ref|XP_003570503.1| PREDICTED: aspartic proteinase Asp1-like [Brachypodium distachyon]
Length = 564
Score = 70.1 bits (170), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 82/314 (26%), Positives = 125/314 (39%), Gaps = 52/314 (16%)
Query: 4 LFIGTPSKGVLLILDTGSALIY---------------AIFDPRKSSSF-------QKINC 41
+F+G P + L +DTGS L + ++ P K Q++
Sbjct: 198 IFVGNPPRPYFLDVDTGSDLTWIQCDAPCTNCAKGPHPLYKPAKEKIVPPRDLLCQELQG 257
Query: 42 DHPDCTYFKCVNEQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSNDNH 101
D C K QC Y ++YAD+S + G A + + +I G+ +FGC+ D
Sbjct: 258 DQNYCATCK----QCDYEIEYADRSSSMGVLAKDDMHMIATNGGREKLD-FVFGCAYDQQ 312
Query: 102 GFDEDARDGALAGVLGLSRVTISFISQLGS--IIKKRFSYCLVIPLPNGEYTSSYLKFGT 159
G G+LGLS IS SQL S II F +C + PNG Y+ G
Sbjct: 313 G-QLLTSPAKTDGILGLSSAAISLPSQLASQGIISNVFGHC-ITKEPNG---GGYMFLGD 367
Query: 160 DMGYRRPSTQATKFINHPNNFYYLSLKDISIDNERMNFPPDTFDITVSGEGG----CIID 215
D R T A P+N Y+ + ++ ++++ + G+ G I D
Sbjct: 368 DYVPRWGMTWA-PIRGGPDNLYHTEAQKVNYGDQQLR---------MHGQAGSSIQVIFD 417
Query: 216 SGSVLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCYFLPETFNRFPSMAFYFE 275
SGS TY ++Y KL + F + SD P LC+ + +F+
Sbjct: 418 SGSSYTYLPDEIYKKLVTAIKYDYPSF-VQDTSDTTLP--LCWKADFDVRYLEDVKQFFK 474
Query: 276 DANLRIDGENVFII 289
NL G F+I
Sbjct: 475 PLNLHF-GNRWFVI 487
>gi|88174561|gb|ABD39355.1| chloroplast nucleoid DNA-binding protein [Oryza longistaminata]
Length = 321
Score = 70.1 bits (170), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 83/340 (24%), Positives = 144/340 (42%), Gaps = 49/340 (14%)
Query: 1 MVRLFIGTPSKGVLLILDTGSALIYAIFD-------PR-----KSSSFQKINCDHPDCTY 48
++ + +GTPSK +L +DTGS+ + + PR +S++ K++C C
Sbjct: 2 VISVGLGTPSKTQILEIDTGSSTSWVFCECDGCHTNPRTFLQSRSTTCAKVSCGTSMCLL 61
Query: 49 F----KCVNEQ----CVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSNDN 100
C + + C + + Y D S + G +T++ + + FGC+ D+
Sbjct: 62 GGSDPHCQDSENYPDCPFRVSYQDGSASYGILYQDTLTFSDVQKIPSF----SFGCNMDS 117
Query: 101 HGFDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEY---TSSYLKF 157
G +E G + G+LG+ +S + Q S FSYCL + + + T+ Y
Sbjct: 118 FGANE---FGNVDGLLGMGAGPMSVLKQ-SSPTFDGFSYCLPLQMSERGFFSKTTGYFSL 173
Query: 158 GTDMGYRRPSTQATKFINHPNN--FYYLSLKDISIDNERMNFPPDTFDITVSGEGGCIID 215
G R + TK + N +++ L IS+D ER+ P F G + D
Sbjct: 174 GKVA--TRTDVRYTKMVARKKNTELFFVDLTAISVDGERLGLSPSIFS-----RKGVVFD 226
Query: 216 SGSVLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCYFLPET-FNRFPSMAFYF 274
SGS L+Y L ++ R A+ E + CY + P+++ +F
Sbjct: 227 SGSELSYIPDRALSVLSQRIRELLLRRGAAE----EESERNCYDMRSVDEGDMPAISLHF 282
Query: 275 ED-ANLRIDGENVFIID--YENHFFLLAVAPHDDLVALIG 311
+D A + VF+ E + LA AP + V++IG
Sbjct: 283 DDGARFDLGSHGVFVERSVQEQDVWCLAFAPTES-VSIIG 321
>gi|224104765|ref|XP_002313558.1| predicted protein [Populus trichocarpa]
gi|222849966|gb|EEE87513.1| predicted protein [Populus trichocarpa]
Length = 468
Score = 69.7 bits (169), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 86/383 (22%), Positives = 143/383 (37%), Gaps = 71/383 (18%)
Query: 2 VRLFIGTPSKGVLLILDTGSALIYA-------------------IFDPRKSSSFQKINCD 42
RL +GTP + + +DTGS +++ FDP S + I+C
Sbjct: 54 TRLQLGTPPRDFYVQIDTGSDVLWVSCGSCNGCPVNSGLHIPLNFFDPGSSPTASLISCS 113
Query: 43 HPDCTY--------FKCVNEQCVYTMKYADQSVTKGFAAHETI---SVIGKGEGKAIFHG 91
C+ N C Y +Y D S T G+ + + +V+G
Sbjct: 114 DQRCSLGLQSSDSVCSAQNNLCGYNFQYGDGSGTSGYYVSDLLHFDTVLGGSVMNNSSAP 173
Query: 92 ALFGCSNDNHGFDEDARDGALAGVLGLSRVTISFISQLGS--IIKKRFSYCLVIPLPNGE 149
+FGCS G D D A+ G+ G + +S +SQL S I + FS+C L +
Sbjct: 174 IVFGCSALQTG-DLTKSDRAVDGIFGFGQQDMSVVSQLASQGISPRAFSHC----LKGDD 228
Query: 150 YTSSYLKFGTDMGYRRPSTQATKFI-NHPNNFYYLSLKDISIDNERMNFPPDTFDITVSG 208
L G + P+ T + + P+ Y L+++ IS++ + + P F S
Sbjct: 229 SGGGILVLGEIV---EPNIVYTPLVPSQPH--YNLNMQSISVNGQTLAIDPSVFG--TSS 281
Query: 209 EGGCIIDSGSVLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCYFLPETFNR-F 267
G IIDSG+ L Y Y + F+S + CY + + N F
Sbjct: 282 SQGTIIDSGTTLAYLAEAAY----DPFISAITSIVSPSVRPYLSKGNHCYLISSSINDIF 337
Query: 268 PSMAFYFEDA-------------NLRIDGENVFIIDYENHFFLLAVAPHDDLVALIGSQQ 314
P ++ F I G ++ I ++ + ++G
Sbjct: 338 PQVSLNFAGGASMILIPQDYLIQQSSIGGAALWCIGFQKI--------QGQGITILGDLV 389
Query: 315 QRDTRFVYDLNIDLLSFVKENCS 337
+D FVYD+ + + +CS
Sbjct: 390 LKDKIFVYDIANQRIGWANYDCS 412
>gi|449437856|ref|XP_004136706.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Cucumis sativus]
Length = 457
Score = 69.7 bits (169), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 95/391 (24%), Positives = 150/391 (38%), Gaps = 75/391 (19%)
Query: 4 LFIGTPSKGVLLILDTGSALIY----------------------AIFDPRKSSSFQKINC 41
L GTP + + LI DTGS+L++ F P+ SSS + + C
Sbjct: 85 LSFGTPQQTLHLIFDTGSSLVWFPCTSRYLCSECSFPKIDPTGIPRFVPKLSSSSKLVGC 144
Query: 42 DHPDCTYF--KCVNEQC---------------VYTMKYADQSVTKGFAAHETISVIGKGE 84
+P C++ V QC Y ++Y S T G ET+ K
Sbjct: 145 QNPKCSWIFGPDVKSQCRSCNPKTENCTQTCPAYVVQYGSGS-TAGLLLSETLDFPDKK- 202
Query: 85 GKAIFHGALFGCSNDNHGFDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIP 144
+ GCS F + +G+ G R + S SQ+G K+F+YCL
Sbjct: 203 ----IPNFVVGCS-----FLSIHQP---SGIAGFGRGSESLPSQMG---LKKFAYCLASR 247
Query: 145 LPNGEYTSSYLKFGTDMGYRRPSTQATKFINHP---NN----FYYLSLKDISIDNERMNF 197
+ S L + G + T F +P NN +YYL+++ I + N+ +
Sbjct: 248 KFDDSPHSGQLILDS-TGVKSSGLTYTPFRQNPSVSNNAYKEYYYLNIRKIIVGNQAVKV 306
Query: 198 PPDTFDITVSGEGGCIIDSGSVLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLC 257
P G GG IIDSGS T+ V + +F + A + ++ C
Sbjct: 307 PYKFLVPGPDGNGGSIIDSGSTFTFMDKPVLEVVAREFEKQLANWTRATDVETLTGLRPC 366
Query: 258 YFL-PETFNRFPSMAFYFE-DANLRIDGENVFIIDYENHFFLLAVAPHD---------DL 306
+ + E +FP + F F+ A + N F + + L V H
Sbjct: 367 FDISKEKSVKFPELIFQFKGGAKWALPLNNYFALVSSSGVACLTVVTHQMEDGGGGGGGP 426
Query: 307 VALIGSQQQRDTRFVYDLNIDLLSFVKENCS 337
++G+ QQ++ YDL L F ++ CS
Sbjct: 427 SVILGAFQQQNFYVEYDLVNQRLGFRQQTCS 457
>gi|42407406|dbj|BAD09564.1| nucleoid DNA-binding protein-like [Oryza sativa Japonica Group]
Length = 205
Score = 69.7 bits (169), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 54/175 (30%), Positives = 80/175 (45%), Gaps = 15/175 (8%)
Query: 115 VLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLKFGTDMGYRRPST----QA 170
++GL R +S +SQLG RFSYCL L ++ F T G S+ Q+
Sbjct: 1 MVGLGRGLLSLVSQLG---PSRFSYCLTSFLSPEPSRLNFGVFATLNGTNASSSGLPVQS 57
Query: 171 TKFINHPN--NFYYLSLKDISIDNERMNFPPDTFDITVSGEGGCIIDSGSVLTYFHSDVY 228
T + + + Y++SLK IS+ +R+ P F I G GG IDSG+ LT+ DVY
Sbjct: 58 TPLVVNAALPSLYFMSLKGISLGQKRLPIDPLVFAINDDGTGGVFIDSGTSLTWLQQDVY 117
Query: 229 WKLHEKFVSYFERFQLAQLSDCPEPIQLCYFLPETFNRFPSMAFYFEDANLRIDG 283
+ + VS L +D ++ C+ P P++ D L DG
Sbjct: 118 DAVRRELVSVLR--PLPPANDTEIGLETCFPWPPP----PTVTMTVPDMELHFDG 166
>gi|125561847|gb|EAZ07295.1| hypothetical protein OsI_29543 [Oryza sativa Indica Group]
Length = 205
Score = 69.7 bits (169), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 54/175 (30%), Positives = 80/175 (45%), Gaps = 15/175 (8%)
Query: 115 VLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLKFGTDMGYRRPST----QA 170
++GL R +S +SQLG RFSYCL L ++ F T G S+ Q+
Sbjct: 1 MVGLGRGLLSLVSQLG---PSRFSYCLTSFLSPEPSRLNFGVFATLNGTNASSSGLPVQS 57
Query: 171 TKFINHPN--NFYYLSLKDISIDNERMNFPPDTFDITVSGEGGCIIDSGSVLTYFHSDVY 228
T + + + Y++SLK IS+ +R+ P F I G GG IDSG+ LT+ DVY
Sbjct: 58 TPLVVNAALPSLYFMSLKGISLGQKRLPIDPLVFAINDDGTGGVFIDSGTSLTWLQQDVY 117
Query: 229 WKLHEKFVSYFERFQLAQLSDCPEPIQLCYFLPETFNRFPSMAFYFEDANLRIDG 283
+ + VS A +D ++ C+ P P++ D L DG
Sbjct: 118 DAVRRELVSVLRPLPPA--NDTEIGLETCFPWPPP----PTVTMTVPDMELHFDG 166
>gi|413938615|gb|AFW73166.1| hypothetical protein ZEAMMB73_633272 [Zea mays]
Length = 386
Score = 69.7 bits (169), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 90/372 (24%), Positives = 140/372 (37%), Gaps = 70/372 (18%)
Query: 1 MVRLFIGTPSKGVLLILDTGSALIYA-----------------IFDPRKSSSFQKINCDH 43
+V +GTP + +DTGS L + +FDP +SSS+ + C
Sbjct: 49 VVTASLGTPGVAQTMEVDTGSDLSWVQCKPCAAAPSCYSQKDPLFDPAQSSSYAAVPCGG 108
Query: 44 PDCTYFKCVNEQCV------YTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCS 97
P C Y + Y D S T G + +T+++ + G FGC
Sbjct: 109 PVCAGLGIYAASACSAAQCGYVVSYGDGSNTTGVYSSDTLTL----SASSAVQGFFFGCG 164
Query: 98 NDNHGFDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLKF 157
+ G + G+LGL R S + Q FSYC LP T+ YL
Sbjct: 165 HAQSGLFN-----GVDGLLGLGREQPSLVEQTAGTYGGVFSYC----LPTKPSTAGYLTL 215
Query: 158 GTDM-GYRRPSTQATKFINHPN--NFYYLSLKDISIDNERMNFPPDTFDITVSGEGGCII 214
G P T+ + PN +Y + L IS+ ++++ P F +
Sbjct: 216 GVGGPSGAAPGFSTTQLLPSPNAPTYYVVMLTGISVGGQQLSVPASAFAGGTV------V 269
Query: 215 DSGSVLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEP----IQLCY-FLPETFNRFPS 269
D+G+V+T Y L F R +A P + CY F P+
Sbjct: 270 DTGTVVTRLPPTAYAALRSAF-----RSGMASYGYPTAPSNGILDTCYNFAGYGTVTLPN 324
Query: 270 MAFYF-EDANLRIDGENVFIIDYENHFFLLAVAP--HDDLVALIGSQQQRDTRFVYDLNI 326
+A F A + + + + F LA AP D +A++G+ QQR +++ I
Sbjct: 325 VALTFGSGATVTLGADGIL------SFGCLAFAPSGSDGGMAILGNVQQRS----FEVRI 374
Query: 327 DLLS--FVKENC 336
D S F +C
Sbjct: 375 DGTSVGFKPSSC 386
>gi|225465839|ref|XP_002264668.1| PREDICTED: aspartic proteinase nepenthesin-1-like isoform 2 [Vitis
vinifera]
Length = 451
Score = 69.7 bits (169), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 81/372 (21%), Positives = 147/372 (39%), Gaps = 57/372 (15%)
Query: 1 MVRLFIGTPSKGVLLILDTGSALIY-----------AIFDPRKSSSFQKINCDHPDCTYF 49
+VR IGTP++ +L+ +DT S + + +F+ S++++ + C C
Sbjct: 102 IVRAKIGTPAQTMLMAMDTSSDVAWIPCNGCLGCSSTLFNSPASTTYKSLGCQAAQCKQV 161
Query: 50 -----------------KCVNEQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGA 92
C C + + Y S+ + +TI++ G
Sbjct: 162 LHLLSPLLTSPSVVPKPTCGGGVCSFNLTYGGSSLAANLS-QDTITLATDA-----VPGY 215
Query: 93 LFGCSNDNHGFDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTS 152
FGC G A+ R +S +SQ ++ + FSYCL P S
Sbjct: 216 SFGCIQKATGGSLPAQGLLGL-----GRGPLSLLSQTQNLYQSTFSYCL--PSFKSLNFS 268
Query: 153 SYLKFGTDMGYRRPSTQATKFINHPN--NFYYLSLKDISIDNERMNFPPDTFDITVSGEG 210
L+ G +R + T + +P + Y+++L + + ++ PP +F S
Sbjct: 269 GSLRLGPVGQPKR--IKYTPLLKNPRRPSLYFVNLMAVRVGRRVVDVPPGSFTFNPSTGA 326
Query: 211 GCIIDSGSVLTYFHSDVYWKLHEKFVSYFER-FQLAQLSDCPEPIQLCYFLPETFNRFPS 269
G I DSG+V T + Y + + F + R + L CY +P P+
Sbjct: 327 GTIFDSGTVFTRLVTPAYIAVRDAFRNRVGRNLTVTSLGG----FDTCYTVPIA---APT 379
Query: 270 MAFYFEDANLRIDGENVFIIDYENHFFLLAVAPHDD----LVALIGSQQQRDTRFVYDLN 325
+ F F N+ + +N+ I LA+A D ++ +I + QQ++ R +YD+
Sbjct: 380 ITFMFTGMNVTLPPDNLLIHSTAGSTTCLAMAAAPDNVNSVLNVIANLQQQNHRLLYDVP 439
Query: 326 IDLLSFVKENCS 337
L +E C+
Sbjct: 440 NSRLGVARELCT 451
>gi|224031303|gb|ACN34727.1| unknown [Zea mays]
gi|413923868|gb|AFW63800.1| hypothetical protein ZEAMMB73_012138 [Zea mays]
Length = 557
Score = 69.7 bits (169), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 70/252 (27%), Positives = 105/252 (41%), Gaps = 40/252 (15%)
Query: 4 LFIGTPSKGVLLILDTGSALIY---------------AIFDPRKSSSF-------QKINC 41
+FIG P + L +DTGS L + ++ P K Q++
Sbjct: 191 IFIGNPPRPYFLDVDTGSDLTWIQCDAPCTNCAKGPHPLYKPAKEKIVPPRDLLCQELQG 250
Query: 42 DHPDCTYFKCVNEQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSNDNH 101
+ C K QC Y ++YADQS + G A + + +I G+ +FGC+ D
Sbjct: 251 NQNYCETCK----QCDYEIEYADQSSSMGVLARDDMHMIATNGGREKLD-FVFGCAYDQQ 305
Query: 102 GFDEDARDGALAGVLGLSRVTISFISQLGS--IIKKRFSYCLVIPLPNGEYTSSYLKFGT 159
G + G+LGLS ISF SQL S II F +C+ G Y+ G
Sbjct: 306 G-QLLSSPAKTDGILGLSSAAISFPSQLASHGIIANVFGHCITREQGGG----GYMFLGD 360
Query: 160 DMGYRRPSTQATKFINHPNNFYYLSLKDISIDNERMNFPPDTFDITVSGEGGCIIDSGSV 219
D R T T + P+N Y+ + ++++ P+ TV I DSGS
Sbjct: 361 DYVPRWGVTW-TSIRSGPDNLYHTQAHHVKYGDQQLRR-PEQAGSTVQ----VIFDSGSS 414
Query: 220 LTYFHSDVYWKL 231
TY +++Y L
Sbjct: 415 YTYLPNEIYENL 426
>gi|226495667|ref|NP_001146721.1| uncharacterized protein LOC100280323 [Zea mays]
gi|219888491|gb|ACL54620.1| unknown [Zea mays]
Length = 557
Score = 69.7 bits (169), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 70/252 (27%), Positives = 105/252 (41%), Gaps = 40/252 (15%)
Query: 4 LFIGTPSKGVLLILDTGSALIY---------------AIFDPRKSSSF-------QKINC 41
+FIG P + L +DTGS L + ++ P K Q++
Sbjct: 191 IFIGNPPRPYFLDVDTGSDLTWIQCDAPCTNFAKGPHPLYKPAKEKIVPPRDLLCQELQG 250
Query: 42 DHPDCTYFKCVNEQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSNDNH 101
+ C K QC Y ++YADQS + G A + + +I G+ +FGC+ D
Sbjct: 251 NQNYCETCK----QCDYEIEYADQSSSMGVLARDDMHMIATNGGREKLD-FVFGCAYDQQ 305
Query: 102 GFDEDARDGALAGVLGLSRVTISFISQLGS--IIKKRFSYCLVIPLPNGEYTSSYLKFGT 159
G + G+LGLS ISF SQL S II F +C+ G Y+ G
Sbjct: 306 G-QLLSSPAKTDGILGLSSAAISFPSQLASHGIIANVFGHCITREQGGG----GYMFLGD 360
Query: 160 DMGYRRPSTQATKFINHPNNFYYLSLKDISIDNERMNFPPDTFDITVSGEGGCIIDSGSV 219
D R T T + P+N Y+ + ++++ P+ TV I DSGS
Sbjct: 361 DYVPRWGVTW-TSIRSGPDNLYHTQAHHVKYGDQQLRR-PEQAGSTVQ----VIFDSGSS 414
Query: 220 LTYFHSDVYWKL 231
TY +++Y L
Sbjct: 415 YTYLPNEIYENL 426
>gi|449440161|ref|XP_004137853.1| PREDICTED: aspartic proteinase-like protein 2-like [Cucumis
sativus]
gi|449521209|ref|XP_004167622.1| PREDICTED: aspartic proteinase-like protein 2-like [Cucumis
sativus]
Length = 492
Score = 69.3 bits (168), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 83/333 (24%), Positives = 133/333 (39%), Gaps = 57/333 (17%)
Query: 29 DPRKSSSFQKINCDHPDCTYFKCVNE--QCVYTMKYADQSVTKGFAAHETI---SVIGKG 83
DP +S+FQ T +C+ + QC YT +Y D S T G+ E++ V+G+
Sbjct: 141 DPICNSAFQT--------TATQCLTQSNQCSYTFQYGDGSGTSGYYVSESMYFDMVMGQS 192
Query: 84 EGKAIFHGALFGCSNDNHGFDEDARDGALAGVLGLSRVTISFISQLGS--IIKKRFSYCL 141
+FGCS G D D A+ G+ G +S ISQL + I K FS+CL
Sbjct: 193 MIANSSASVVFGCSTYQSG-DLTKSDHAIDGIFGFGPGDLSVISQLSARGITPKVFSHCL 251
Query: 142 VIPLPNGEYT-SSYLKFGTDMGYRRPSTQATKFI-NHPNNFYYLSLKDISIDNERMNFPP 199
GE L G + P + + + P+ Y L L+ IS++ + + P
Sbjct: 252 -----KGEGNGGGILVLGEVL---EPGIVYSPLVPSQPH--YNLYLQSISVNGQTLPIDP 301
Query: 200 DTFDITVSGEGGCIIDSGSVLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCYF 259
F ++ G IIDSG+ L Y + Y FVS ++ CY
Sbjct: 302 SVFATSI--NRGTIIDSGTTLAYLVEEAY----TPFVSAITAAVSQSVTPTISKGNQCYL 355
Query: 260 LPETFNR-FPSMAFYFEDANLRI-------------DGENVFIIDYENHFFLLAVAPHDD 305
+ + FP ++ F + + DG ++ I ++ +
Sbjct: 356 VSTSVGEIFPLVSLNFAGSASMVLKPEEYLMHLGFYDGAALWCIGFQK---------VQE 406
Query: 306 LVALIGSQQQRDTRFVYDLNIDLLSFVKENCSD 338
V ++G +D FVYDL + + +CS
Sbjct: 407 GVTILGDLVMKDKIFVYDLARQRIGWASYDCSQ 439
>gi|125527370|gb|EAY75484.1| hypothetical protein OsI_03384 [Oryza sativa Indica Group]
Length = 453
Score = 69.3 bits (168), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 97/366 (26%), Positives = 142/366 (38%), Gaps = 68/366 (18%)
Query: 6 IGTPSKGVLLILDTGSALIY------AIFDPRKSSSFQK--------INCDHPDCTYFK- 50
IGTP+ G+ DTGS LI+ A PR S S+ + C C
Sbjct: 98 IGTPATGLSGEADTGSDLIWTKCGACARCSPRGSPSYYPTSSSSAAFVACGDRTCGELPR 157
Query: 51 --CVN--------EQCVYTMKYADQS----VTKGFAAHETISVIGKGEGKAIFHGALFGC 96
C N C Y Y + T+G ET + G+ A F G FGC
Sbjct: 158 PLCSNVAGGGSGSGNCSYHYAYGNARDTHHYTEGILMTETFTF---GDDAAAFPGIAFGC 214
Query: 97 S-NDNHGFDEDARDGALAGVLGLSRVTISFISQLG-SIIKKRFSYCLVIPLPNGEYTSSY 154
+ GF G +G++GL R +S ++QL R S L P P S+
Sbjct: 215 TLRSEGGF------GTGSGLVGLGRGKLSLVTQLNVEAFGYRLSSDLSAPSP-----ISF 263
Query: 155 LKFGTDMGYRRPSTQATKFINHPN----NFYYLSLKDISIDNERMNFPPDTFDITVS-GE 209
G S +T + +P FYY+ L IS+ + + P TF S G
Sbjct: 264 GSLADVTGGNGDSFMSTPLLTNPVVQDLPFYYVGLTGISVGGKLVQIPSGTFSFDRSTGA 323
Query: 210 GGCIIDSGSVLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQ-----LCYFLPETF 264
GG I DSG+ LT Y + ++ +S FQ P P +C+ +
Sbjct: 324 GGVIFDSGTTLTMLPDPAYTLVRDELLSQMG-FQK------PPPAANDDDLICFTGGSST 376
Query: 265 NRFPSMAFYFE-DANLRIDGENVFIIDYENHFFLLA----VAPHDDLVALIGSQQQRDTR 319
FPSM +F+ A++ + EN ++ + A V + +IG+ Q D
Sbjct: 377 TTFPSMVLHFDGGADMDLSTEN-YLPQMQGQNGETARCWSVVKSSQALTIIGNIMQMDFH 435
Query: 320 FVYDLN 325
V+DL+
Sbjct: 436 VVFDLS 441
>gi|51536458|gb|AAU05467.1| At5g22850 [Arabidopsis thaliana]
gi|55733777|gb|AAV59285.1| At5g22850 [Arabidopsis thaliana]
Length = 426
Score = 69.3 bits (168), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 69/272 (25%), Positives = 112/272 (41%), Gaps = 45/272 (16%)
Query: 3 RLFIGTPSKGVLLILDTGSALIYA-------------------IFDPRKSSSFQKINCDH 43
+L +GTP + + +DTGS +++ FDP S + I+C
Sbjct: 84 KLRLGTPPRDFYVQVDTGSDVLWVSCASCNGCPQTSGLQIQLNFFDPGSSVTASPISCSD 143
Query: 44 PDCTY--------FKCVNEQCVYTMKYADQSVTKGFAAHETIS---VIGKGEGKAIFHGA 92
C++ N C YT +Y D S T GF + + ++G
Sbjct: 144 QRCSWGIQSSDSGCSVQNNLCAYTFQYGDGSGTSGFYVSDVLQFDMIVGSSLVPNSTAPV 203
Query: 93 LFGCSNDNHGFDEDARDGALAGVLGLSRVTISFISQLGS--IIKKRFSYCLVIPLPNGEY 150
+FGCS G D D A+ G+ G + +S ISQL S I + FS+CL GE
Sbjct: 204 VFGCSTSQTG-DLVKSDRAVDGIFGFGQQGMSVISQLASQGIAPRVFSHCL-----KGEN 257
Query: 151 TSSYLKFGTDMGYRRPSTQATKFI-NHPNNFYYLSLKDISIDNERMNFPPDTFDITVSGE 209
+ ++ P+ T + + P+ Y ++L IS++ + + P F T +G+
Sbjct: 258 GGGGILVLGEI--VEPNMVFTPLVPSQPH--YNVNLLSISVNGQALPINPSVFS-TSNGQ 312
Query: 210 GGCIIDSGSVLTYFHSDVYWKLHEKFVSYFER 241
G IID+G+ L Y Y E + +
Sbjct: 313 -GTIIDTGTTLAYLSEAAYVPFVEAITNAVSQ 343
>gi|125571687|gb|EAZ13202.1| hypothetical protein OsJ_03122 [Oryza sativa Japonica Group]
Length = 453
Score = 69.3 bits (168), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 97/366 (26%), Positives = 142/366 (38%), Gaps = 68/366 (18%)
Query: 6 IGTPSKGVLLILDTGSALIY------AIFDPRKSSSFQK--------INCDHPDCTYFK- 50
IGTP+ G+ DTGS LI+ A PR S S+ + C C
Sbjct: 98 IGTPATGLSGEADTGSDLIWTKCGACARCSPRGSPSYYPTSSSSAAFVACGDRTCGELPR 157
Query: 51 --CVN--------EQCVYTMKYADQS----VTKGFAAHETISVIGKGEGKAIFHGALFGC 96
C N C Y Y + T+G ET + G+ A F G FGC
Sbjct: 158 PLCSNVAGGGSGSGNCSYHYAYGNARDTHHYTEGILMTETFTF---GDDAAAFPGIAFGC 214
Query: 97 S-NDNHGFDEDARDGALAGVLGLSRVTISFISQLG-SIIKKRFSYCLVIPLPNGEYTSSY 154
+ GF G +G++GL R +S ++QL R S L P P S+
Sbjct: 215 TLRSEGGF------GTGSGLVGLGRGKLSLVTQLNVEAFGYRLSSDLSAPSP-----ISF 263
Query: 155 LKFGTDMGYRRPSTQATKFINHPN----NFYYLSLKDISIDNERMNFPPDTFDITVS-GE 209
G S +T + +P FYY+ L IS+ + + P TF S G
Sbjct: 264 GSLADVTGGNGDSFMSTPLLTNPVVQDLPFYYVGLTGISVGGKLVQIPSGTFSFDRSTGA 323
Query: 210 GGCIIDSGSVLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQ-----LCYFLPETF 264
GG I DSG+ LT Y + ++ +S FQ P P +C+ +
Sbjct: 324 GGVIFDSGTTLTMLPDPAYTLVRDELLSQMG-FQK------PPPAANDDDLICFTGGSST 376
Query: 265 NRFPSMAFYFE-DANLRIDGENVFIIDYENHFFLLA----VAPHDDLVALIGSQQQRDTR 319
FPSM +F+ A++ + EN ++ + A V + +IG+ Q D
Sbjct: 377 TTFPSMVLHFDGGADMDLSTEN-YLPQMQGQNGETARCWSVVKSSQALTIIGNIMQMDFH 435
Query: 320 FVYDLN 325
V+DL+
Sbjct: 436 VVFDLS 441
>gi|194708432|gb|ACF88300.1| unknown [Zea mays]
Length = 452
Score = 68.9 bits (167), Expect = 3e-09, Method: Compositional matrix adjust.
Identities = 84/367 (22%), Positives = 149/367 (40%), Gaps = 65/367 (17%)
Query: 21 SALIYAIFDPRKSSSFQKINCDHPDCTYF--------KCVNEQCV--------------- 57
SA +F P+ SSS + + C +P C + KC C
Sbjct: 101 SASAVPVFHPKNSSSSRLVGCRNPSCQWVHSAANLATKCRRAPCSPGAANCPAAASNVCP 160
Query: 58 -YTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSNDNHGFDEDARDGALAGVL 116
Y + Y S T G +T+ + G+A+ G + GCS + +G+
Sbjct: 161 PYAVVYGSGS-TAGLLIADTL----RAPGRAV-PGFVLGCS-------LVSVHQPPSGLA 207
Query: 117 GLSRVTISFISQLGSIIKKRFSYCLVI------PLPNGEYTSSYLKFGTDMGYRRPSTQA 170
G R S +QLG +FSYCL+ +G G M Y P ++
Sbjct: 208 GFGRGAPSVPAQLG---LPKFSYCLLSRRFDDNAAVSGSLVLGGTGGGEGMQYV-PLVKS 263
Query: 171 TKFINHPNN-FYYLSLKDISIDNERMNFPPDTFDITVSGEGGCIIDSGSVLTYFHSDVYW 229
P +YYL+L+ +++ + + P F +G GG I+DSG+ TY V+
Sbjct: 264 AAGDKLPYGVYYYLALRGVTVGGKAVRLPARAFAANAAGSGGTIVDSGTTFTYLDPTVFQ 323
Query: 230 KLHEKFVSYF-ERFQLAQLSDCPEPIQLCYFLPETFNR--FPSMAFYFE-DANLRIDGEN 285
+ + V+ R++ ++ ++ + C+ LP+ P ++F+FE A +++ EN
Sbjct: 324 PVADAVVAAVGGRYKRSKDAEDELGLHPCFALPQGARSMALPELSFHFEGGAVMQLPVEN 383
Query: 286 VFIIDYENHFFLLAVAPHDDLVA-------------LIGSQQQRDTRFVYDLNIDLLSFV 332
F++ + +A D ++GS QQ++ YDL + L F
Sbjct: 384 YFVVAGRGAVEAICLAVVTDFSGGSGAGNEGSGPAIILGSFQQQNYLVEYDLEKERLGFR 443
Query: 333 KENCSDD 339
+++C+
Sbjct: 444 RQSCTSS 450
>gi|125554848|gb|EAZ00454.1| hypothetical protein OsI_22475 [Oryza sativa Indica Group]
Length = 538
Score = 68.9 bits (167), Expect = 3e-09, Method: Compositional matrix adjust.
Identities = 68/248 (27%), Positives = 103/248 (41%), Gaps = 32/248 (12%)
Query: 4 LFIGTPSKGVLLILDTGSALIYAIFD---------PRKSSSFQKINCDHPDCTY------ 48
++IG P + L +DTGS L + D P +K N P +Y
Sbjct: 163 MYIGNPPRPYFLDVDTGSDLTWIQCDAPCTNCAKGPHPLYKPEKPNVVPPRDSYCQELQG 222
Query: 49 ---FKCVNEQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSNDNHGFDE 105
+ ++QC Y + YAD+S + G A + + +I +G+ +FGC D G +
Sbjct: 223 NQNYGDTSKQCDYEITYADRSSSMGILARDNMQLI-TADGERENLDFVFGCGYDQQG-NL 280
Query: 106 DARDGALAGVLGLSRVTISFISQLGS--IIKKRFSYCLVIPLPNGEYTSSYLKFGTDMGY 163
+ G+LGLS IS +QL S II F +C+ NG Y+ G D
Sbjct: 281 LSSPANTDGILGLSNAAISLPTQLASQGIISNVFGHCIAADPSNG----GYMFLGDDYVP 336
Query: 164 RRPSTQATKFINHPNNFYYLSLKDISIDNERMNFPPDTFDITVSGEGGCIIDSGSVLTYF 223
R T N P N Y ++ ++ ++++N +T I DSGS TY
Sbjct: 337 RWGMTW-MPIRNGPENLYSTEVQKVNYGDQQLNVRRKAGKLTQ-----VIFDSGSSYTYL 390
Query: 224 HSDVYWKL 231
D Y L
Sbjct: 391 PHDDYTNL 398
>gi|115467508|ref|NP_001057353.1| Os06g0268700 [Oryza sativa Japonica Group]
gi|53791766|dbj|BAD53531.1| putative nucellin-like aspartic protease [Oryza sativa Japonica
Group]
gi|53793187|dbj|BAD54393.1| putative nucellin-like aspartic protease [Oryza sativa Japonica
Group]
gi|113595393|dbj|BAF19267.1| Os06g0268700 [Oryza sativa Japonica Group]
gi|125596798|gb|EAZ36578.1| hypothetical protein OsJ_20919 [Oryza sativa Japonica Group]
gi|215767941|dbj|BAH00170.1| unnamed protein product [Oryza sativa Japonica Group]
Length = 538
Score = 68.9 bits (167), Expect = 3e-09, Method: Compositional matrix adjust.
Identities = 68/248 (27%), Positives = 103/248 (41%), Gaps = 32/248 (12%)
Query: 4 LFIGTPSKGVLLILDTGSALIYAIFD---------PRKSSSFQKINCDHPDCTY------ 48
++IG P + L +DTGS L + D P +K N P +Y
Sbjct: 163 MYIGNPPRPYFLDVDTGSDLTWIQCDAPCTNCAKGPHPLYKPEKPNVVPPRDSYCQELQG 222
Query: 49 ---FKCVNEQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSNDNHGFDE 105
+ ++QC Y + YAD+S + G A + + +I +G+ +FGC D G +
Sbjct: 223 NQNYGDTSKQCDYEITYADRSSSMGILARDNMQLI-TADGERENLDFVFGCGYDQQG-NL 280
Query: 106 DARDGALAGVLGLSRVTISFISQLGS--IIKKRFSYCLVIPLPNGEYTSSYLKFGTDMGY 163
+ G+LGLS IS +QL S II F +C+ NG Y+ G D
Sbjct: 281 LSSPANTDGILGLSNAAISLPTQLASQGIISNVFGHCIAADPSNG----GYMFLGDDYVP 336
Query: 164 RRPSTQATKFINHPNNFYYLSLKDISIDNERMNFPPDTFDITVSGEGGCIIDSGSVLTYF 223
R T N P N Y ++ ++ ++++N +T I DSGS TY
Sbjct: 337 RWGMTW-MPIRNGPENLYSTEVQKVNYGDQQLNVRRKAGKLTQ-----VIFDSGSSYTYL 390
Query: 224 HSDVYWKL 231
D Y L
Sbjct: 391 PHDDYTNL 398
>gi|218184944|gb|EEC67371.1| hypothetical protein OsI_34484 [Oryza sativa Indica Group]
Length = 396
Score = 68.9 bits (167), Expect = 3e-09, Method: Compositional matrix adjust.
Identities = 90/365 (24%), Positives = 142/365 (38%), Gaps = 55/365 (15%)
Query: 6 IGTPSKGVLLILDTGSALIYA--------------IFDPRKSSSFQKINCDHPDCTYF-- 49
IGTP + I+D L++ +F P SS+F+ C C
Sbjct: 51 IGTPPQPASAIVDVAGELVWTQCSACRRCFKQDLPVFVPNASSTFKPEPCGTAVCESIPT 110
Query: 50 -KCVNEQCVYTMKYAD-QSVTKGFAAHETISVIGKGEGKAIFHGALFGCSNDNHGFDEDA 107
C + C Y + T GFAA +T ++ G A A FGC + D D
Sbjct: 111 RSCSGDVCSYKGPPTQLRGNTSGFAATDTFAI-----GTATVRLA-FGCVVAS---DIDT 161
Query: 108 RDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLKFGTDMGYRRPS 167
DG +G +GL R S ++Q+ RFSYCL P G+ + +L + +
Sbjct: 162 MDGP-SGFIGLGRTPWSLVAQMK---LTRFSYCLS-PRNTGKSSRLFLGSSAKLAGSEST 216
Query: 168 TQATKFINHPN----NFYYLSLKDISIDNERMNFPPDTFDITVSGEGGCIIDSGSVLTYF 223
+ A P+ N+Y LSL I N + T G ++ + S +
Sbjct: 217 STAPFIKTSPDDDGSNYYLLSLDAIRAGNTTI--------ATAQSGGILVMHTVSPFSLL 268
Query: 224 HSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCYFLPETFNRF--PSMAFYFEDANLRI 281
Y + ++ P+P LC+ F+R P + F F+ A
Sbjct: 269 VDSAYKAFKKAVTEAVGGAAAPPMATPPQPFDLCFKKAAGFSRATAPDLVFTFQGAAALT 328
Query: 282 DGENVFIIDYENH-----FFLLAVAPHD----DLVALIGSQQQRDTRFVYDLNIDLLSFV 332
++ID +L++A + + V+++GS QQ D F+YDL + LSF
Sbjct: 329 VPPAKYLIDVGEEKDTACAAILSMAWLNRTGLEGVSVLGSLQQEDVHFLYDLKKETLSFE 388
Query: 333 KENCS 337
+CS
Sbjct: 389 PADCS 393
>gi|168064205|ref|XP_001784055.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162664441|gb|EDQ51161.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 459
Score = 68.9 bits (167), Expect = 3e-09, Method: Compositional matrix adjust.
Identities = 82/325 (25%), Positives = 131/325 (40%), Gaps = 63/325 (19%)
Query: 2 VRLFIGTPSKGVLLILDTGS-------------------ALIYAIFDPRKSSSFQKINCD 42
R+++GTP + + +DTGS AL +IFDP KS+S I+C
Sbjct: 50 TRIYLGTPPQQFYVHVDTGSDVAWVNCVPCTNCKRASNVALPISIFDPEKSTSKTSISCT 109
Query: 43 HPDC---TYFKCV--NEQCVYTMKYADQSVTKGFAAHETISV--IGKGEGKAIFHGA--L 93
+C + KC + C Y+ Y D S T G+ ++ +S + G A A
Sbjct: 110 DEECYLASNSKCSFNSMSCPYSTLYGDGSSTAGYLINDVLSFNQVPSGNSTATSGTARLT 169
Query: 94 FGCSNDNHGFDEDARDGALAGVLGLSRVTISFISQLG--SIIKKRFSYCLVIPLPNGEYT 151
FGC ++ G G++G + +S SQL ++ F++C L
Sbjct: 170 FGCGSNQTG------TWLTDGLVGFGQAEVSLPSQLSKQNVSVNIFAHC----LQGDNKG 219
Query: 152 SSYLKFGTDMGYRRPSTQATKFINHPNNFYYLSLKDISIDNERMNFPPDTFDITVSGEGG 211
S L G R P T + + Y + L +I + + P FD+ S GG
Sbjct: 220 SGTLVIGH---IREPGLVYTPIVPK-QSHYNVELLNIGVSGTNVT-TPTAFDL--SNSGG 272
Query: 212 CIIDSGSVLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCYFLPETFNRFPSMA 271
I+DSG+ LTY Y ++FQ A++ DC LP F F ++
Sbjct: 273 VIMDSGTTLTYLVQPAY-----------DQFQ-AKVRDC----MRSGVLPVAFQFFCTIE 316
Query: 272 FYFEDANLRIDGENVFIIDYENHFF 296
YF + L G ++ ++ +
Sbjct: 317 GYFPNVTLYFAGGAAMLLSPSSYLY 341
>gi|297597434|ref|NP_001043968.2| Os01g0696800 [Oryza sativa Japonica Group]
gi|255673588|dbj|BAF05882.2| Os01g0696800 [Oryza sativa Japonica Group]
Length = 334
Score = 68.9 bits (167), Expect = 3e-09, Method: Compositional matrix adjust.
Identities = 87/336 (25%), Positives = 130/336 (38%), Gaps = 54/336 (16%)
Query: 22 ALIYAIFDPRKSSSFQKINCDHPDCTYFK---CVN--------EQCVYTMKYAD----QS 66
AL+ + P SSS + C C C N C Y Y +
Sbjct: 9 ALMLPLLYPTSSSSAAFVACGDRTCGELPRPLCSNVAGGGSGSGNCSYHYAYGNARDTHH 68
Query: 67 VTKGFAAHETISVIGKGEGKAIFHGALFGCS-NDNHGFDEDARDGALAGVLGLSRVTISF 125
T+G ET + G+ A F G FGC+ GF G +G++GL R +S
Sbjct: 69 YTEGILMTETFTF---GDDAAAFPGIAFGCTLRSEGGF------GTGSGLVGLGRGKLSL 119
Query: 126 ISQLG-SIIKKRFSYCLVIPLPNGEYTSSYLKFGTDMGYRRPSTQATKFINHPN----NF 180
++QL R S L P P S+ G S +T + +P F
Sbjct: 120 VTQLNVEAFGYRLSSDLSAPSP-----ISFGSLADVTGGNGDSFMSTPLLTNPVVQDLPF 174
Query: 181 YYLSLKDISIDNERMNFPPDTFDITVS-GEGGCIIDSGSVLTYFHSDVYWKLHEKFVSYF 239
YY+ L IS+ + + P TF S G GG I DSG+ LT Y + ++ +S
Sbjct: 175 YYVGLTGISVGGKLVQIPSGTFSFDRSTGAGGVIFDSGTTLTMLPDPAYTLVRDELLSQM 234
Query: 240 ERFQLAQLSDCPEPIQ-----LCYFLPETFNRFPSMAFYFE-DANLRIDGENVFIIDYEN 293
FQ P P +C+ + FPSM +F+ A++ + EN ++ +
Sbjct: 235 G-FQK------PPPAANDDDLICFTGGSSTTTFPSMVLHFDGGADMDLSTEN-YLPQMQG 286
Query: 294 HFFLLA----VAPHDDLVALIGSQQQRDTRFVYDLN 325
A V + +IG+ Q D V+DL+
Sbjct: 287 QNGETARCWSVVKSSQALTIIGNIMQMDFHVVFDLS 322
>gi|88174587|gb|ABD39368.1| chloroplast nucleoid DNA-binding protein [Oryza sativa Indica
Group]
Length = 321
Score = 68.9 bits (167), Expect = 3e-09, Method: Compositional matrix adjust.
Identities = 85/342 (24%), Positives = 144/342 (42%), Gaps = 53/342 (15%)
Query: 1 MVRLFIGTPSKGVLLILDTGSALIYAIFD-------PR-----KSSSFQKINCDHPDCTY 48
++ + +GTPSK ++ +DTGS+ + + PR +S++ K++C C
Sbjct: 2 VISVGLGTPSKTQIVEIDTGSSASWVFCECDGCHTNPRTFLQSRSTTCAKVSCGTSMCLL 61
Query: 49 F----KCVNEQ----CVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSNDN 100
C + + C + + Y D S + G +T++ + + FGC+ D+
Sbjct: 62 GGSDPHCQDSENYPDCPFRVSYQDGSASYGILYQDTLTFSDVQKIPSF----TFGCNLDS 117
Query: 101 HGFDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEY-----TSSYL 155
G +E G + G+LG+ +S + Q S FSYCL PL E T+ Y
Sbjct: 118 FGANE---FGNVDGLLGMGAGPMSVLKQ-SSPTFDGFSYCL--PLQKSERGFFSKTTGYF 171
Query: 156 KFGTDMGYRRPSTQATKFINHPNN--FYYLSLKDISIDNERMNFPPDTFDITVSGEGGCI 213
G R + TK + N +++ L IS+D ER+ P F G +
Sbjct: 172 SLGKVA--TRTDVRYTKMVARRKNTELFFVDLAAISVDGERLGLSPSIFS-----RKGVV 224
Query: 214 IDSGSVLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCYFLPET-FNRFPSMAF 272
DSGS L+Y L ++ R A+ E + CY + P+++
Sbjct: 225 FDSGSELSYIPDRALSVLSQRIRELLLRRGAAE----EESERNCYDMRSVDEGDMPAISL 280
Query: 273 YFED-ANLRIDGENVFIID--YENHFFLLAVAPHDDLVALIG 311
+F+D A + VF+ E + LA AP + V++IG
Sbjct: 281 HFDDGARFDLGSHGVFVERSVQEQDVWCLAFAPTES-VSIIG 321
>gi|357492303|ref|XP_003616440.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
gi|355517775|gb|AES99398.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
Length = 521
Score = 68.9 bits (167), Expect = 3e-09, Method: Compositional matrix adjust.
Identities = 85/360 (23%), Positives = 136/360 (37%), Gaps = 55/360 (15%)
Query: 2 VRLFIGTPSKGVLLILDTGSALIYA----------IFDPRKSSSFQKINCDHPDCTYFKC 51
V L +G+P + V ++LDTGS L + IF+P SSS+ C P C
Sbjct: 38 VSLTVGSPPQRVTMVLDTGSELSWLHCKKLPNLNFIFNPLVSSSYTPTPCTSPIC----- 92
Query: 52 VNEQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSNDNHGFDEDARDGA 111
T + D A++ +I G G +FGC + G D
Sbjct: 93 -------TTQTRDLINPVSCDANKLCHIITFFVGGPAQRGMVFGCMDT--GTSSGDEDSK 143
Query: 112 LAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLKFGTDMGYRRPSTQAT 171
G++G+ ++SF +Q+ +FSYC+ N + T + R T
Sbjct: 144 TTGLMGMDLGSLSFSNQMR---LPKFSYCIS----NKDSTGVLVLENIANPPRLGPLHYT 196
Query: 172 KFINHPNNFYYLSLKDISIDNERMNFPPDTFDITVSGEGGCIIDSGSVLTYFHSDVYWKL 231
+ Y + ++ F PD +G G ++DS + T+ VY L
Sbjct: 197 PLVKKTTPLPYFNRNCCLF--QKSAFLPDH-----TGAGQTMVDSATQFTFLRQPVYTAL 249
Query: 232 HEKFVSYFERFQLAQLSDCPE-----PIQLCYFLP--ETFNRFPSMAFYFEDANLRIDGE 284
+F + L L D P+ + LC+ +P T P + F+ A LR+ GE
Sbjct: 250 KNEFAIQTKNI-LTPLGD-PKFVFQGVMDLCFRVPIGSTLPVLPVVTLMFDGAELRVTGE 307
Query: 285 NVFI----IDYENHFFLLAVAPHDDLVA----LIGSQQQRDTRFVYDLNIDLLSFVKENC 336
+ + N + + DL+ +IG QR+ YDL + F NC
Sbjct: 308 RLLYKVSNVAKSNSWIYCFTFGNSDLLGIEAFIIGHHHQRNVWMEYDLANSRIGFSDTNC 367
>gi|384252236|gb|EIE25712.1| acid protease [Coccomyxa subellipsoidea C-169]
Length = 599
Score = 68.9 bits (167), Expect = 3e-09, Method: Compositional matrix adjust.
Identities = 66/267 (24%), Positives = 115/267 (43%), Gaps = 56/267 (20%)
Query: 4 LFIGTPSKGVLLILDTGSALIY----------------AIFDPRKSSSFQKINCDHPDCT 47
L +GTP++ +I+DTGS + Y A FDP SSS I CD C
Sbjct: 66 LHLGTPARQFAVIVDTGSTITYVPCASCGRNCGPHHKDAAFDPASSSSSAVIGCDSDKCI 125
Query: 48 YFK----CVNE-QCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGA---LFGCSND 99
+ C + +C Y YA+QS + G + + + GA +FGC
Sbjct: 126 CGRPPCGCSEKRECTYQRTYAEQSSSAGLLVSDQLQ---------LRDGAVEVVFGCETK 176
Query: 100 NHG--FDEDARDGALAGVLGLSRVTISFISQLG--SIIKKRFSYCLVIPLPNGEYTSSYL 155
G ++++A G+LGL +S ++QL +I F+ C + E + +
Sbjct: 177 ETGEIYNQEAD-----GILGLGNSEVSLVNQLAGSGVIDDVFALC----FGSVEGDGALM 227
Query: 156 KFGTDMGYRRPSTQATKFIN---HPNNFYYLSLKDISIDNERMNFPPDTFDITVSGEG-G 211
D + Q T ++ HP ++Y + L+ + + +++ P+ ++ EG G
Sbjct: 228 LGDVDAAEYDVALQYTALLSSLAHP-HYYSVQLEALWVGGQQLPVKPERYE-----EGYG 281
Query: 212 CIIDSGSVLTYFHSDVYWKLHEKFVSY 238
++DSG+ TY S+ + E +Y
Sbjct: 282 TVLDSGTTFTYLPSEAFQLFKEAVSAY 308
>gi|88174579|gb|ABD39364.1| chloroplast nucleoid DNA-binding protein [Oryza sativa Japonica
Group]
gi|88174585|gb|ABD39367.1| chloroplast nucleoid DNA-binding protein [Oryza sativa Indica
Group]
gi|88174595|gb|ABD39372.1| chloroplast nucleoid DNA-binding protein [Oryza rufipogon]
gi|88174599|gb|ABD39374.1| chloroplast nucleoid DNA-binding protein [Oryza rufipogon]
gi|88174607|gb|ABD39378.1| chloroplast nucleoid DNA-binding protein [Oryza rufipogon]
Length = 321
Score = 68.9 bits (167), Expect = 3e-09, Method: Compositional matrix adjust.
Identities = 84/342 (24%), Positives = 143/342 (41%), Gaps = 53/342 (15%)
Query: 1 MVRLFIGTPSKGVLLILDTGSALIYAIFD-------PR-----KSSSFQKINCDHPDCTY 48
++ + +GTP+K ++ +DTGS+ + + PR +S++ K++C C
Sbjct: 2 VISVGLGTPAKTQIVEIDTGSSTSWVFCECDGCHTNPRTFLQSRSTTCAKVSCGTSMCLL 61
Query: 49 F----KCVNEQ----CVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSNDN 100
C + + C + + Y D S + G +T++ G FGC+ D+
Sbjct: 62 GGSDPHCQDSENYPDCPFRVSYQDGSASYGILYQDTLTF----SDVQKIPGFSFGCNMDS 117
Query: 101 HGFDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEY-----TSSYL 155
G +E G + G+LG+ +S + Q S FSYCL PL E T+ Y
Sbjct: 118 FGANEF---GNVDGLLGMGAGPMSVLKQ-SSPTFDCFSYCL--PLQKSERGFFSKTTGYF 171
Query: 156 KFGTDMGYRRPSTQATKFINHPNN--FYYLSLKDISIDNERMNFPPDTFDITVSGEGGCI 213
G R + TK + N +++ L IS+D ER+ P F G +
Sbjct: 172 SLGKVA--TRTDVRYTKMVARKKNTELFFVDLTAISVDGERLGLSPSVFS-----RKGVV 224
Query: 214 IDSGSVLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCYFLPET-FNRFPSMAF 272
DSGS L+Y L ++ + A+ E + CY + P+++
Sbjct: 225 FDSGSELSYIPDRALSVLSQRIRELLLKRGAAE----EESERNCYDMRSVDEGDMPAISL 280
Query: 273 YFED-ANLRIDGENVFIID--YENHFFLLAVAPHDDLVALIG 311
+F+D A + VF+ E + LA AP + V++IG
Sbjct: 281 HFDDGARFDLGSHGVFVERSVQEQDVWCLAFAPTES-VSIIG 321
>gi|88174569|gb|ABD39359.1| chloroplast nucleoid DNA-binding protein [Oryza nivara]
Length = 321
Score = 68.9 bits (167), Expect = 4e-09, Method: Compositional matrix adjust.
Identities = 84/342 (24%), Positives = 144/342 (42%), Gaps = 53/342 (15%)
Query: 1 MVRLFIGTPSKGVLLILDTGSALIYAIFD-------PR-----KSSSFQKINCDHPDCTY 48
++ + +GTP+K ++ +DTGS+ + + PR +S++ K++C C
Sbjct: 2 VISVGLGTPAKTQIVEIDTGSSTTWVFCECDGCHTNPRTFLQSRSTTCAKVSCGTSMCLL 61
Query: 49 F----KCVNEQ----CVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSNDN 100
C + + C + + Y D S + G +T++ + + FGC+ D+
Sbjct: 62 GGSDPHCQDSENYPDCPFRVSYQDGSASYGILYQDTLTFSDVQKIPSF----TFGCNLDS 117
Query: 101 HGFDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEY-----TSSYL 155
G +E G + G+LG+ +S + Q S FSYCL PL E T+ Y
Sbjct: 118 FGANEF---GNVDGLLGMGAGPMSVLKQ-SSPTFDGFSYCL--PLQKSERGFFSKTTGYF 171
Query: 156 KFGTDMGYRRPSTQATKFINHPNN--FYYLSLKDISIDNERMNFPPDTFDITVSGEGGCI 213
G R + TK + N +++ L IS+D ER+ P F G +
Sbjct: 172 SLGKVA--TRTDVRYTKMVARRKNTELFFVDLAAISVDGERLGLSPSIFS-----RKGVV 224
Query: 214 IDSGSVLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCYFLPET-FNRFPSMAF 272
DSGS L+Y L ++ R A+ E + CY + P+++
Sbjct: 225 FDSGSELSYIPDRALSVLSQRIRELLLRRGAAE----EESERNCYDMRSVDEGDMPAISL 280
Query: 273 YFED-ANLRIDGENVFIID--YENHFFLLAVAPHDDLVALIG 311
+F+D A + VF+ E + LA AP + V++IG
Sbjct: 281 HFDDGARFDLGSRGVFVERSVQEQDVWCLAFAPTES-VSIIG 321
Database: nr
Posted date: Mar 3, 2013 10:45 PM
Number of letters in database: 999,999,864
Number of sequences in database: 2,912,245
Database: /local_scratch/syshi//blastdatabase/nr.01
Posted date: Mar 3, 2013 10:52 PM
Number of letters in database: 999,999,666
Number of sequences in database: 2,912,720
Database: /local_scratch/syshi//blastdatabase/nr.02
Posted date: Mar 3, 2013 10:58 PM
Number of letters in database: 999,999,938
Number of sequences in database: 3,014,250
Database: /local_scratch/syshi//blastdatabase/nr.03
Posted date: Mar 3, 2013 11:03 PM
Number of letters in database: 999,999,780
Number of sequences in database: 2,805,020
Database: /local_scratch/syshi//blastdatabase/nr.04
Posted date: Mar 3, 2013 11:08 PM
Number of letters in database: 999,999,551
Number of sequences in database: 2,816,253
Database: /local_scratch/syshi//blastdatabase/nr.05
Posted date: Mar 3, 2013 11:13 PM
Number of letters in database: 999,999,897
Number of sequences in database: 2,981,387
Database: /local_scratch/syshi//blastdatabase/nr.06
Posted date: Mar 3, 2013 11:18 PM
Number of letters in database: 999,999,649
Number of sequences in database: 2,911,476
Database: /local_scratch/syshi//blastdatabase/nr.07
Posted date: Mar 3, 2013 11:24 PM
Number of letters in database: 999,999,452
Number of sequences in database: 2,920,260
Database: /local_scratch/syshi//blastdatabase/nr.08
Posted date: Mar 3, 2013 11:25 PM
Number of letters in database: 64,230,274
Number of sequences in database: 189,558
Lambda K H
0.324 0.141 0.432
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 5,728,849,627
Number of Sequences: 23463169
Number of extensions: 251234616
Number of successful extensions: 437664
Number of sequences better than 100.0: 1000
Number of HSP's better than 100.0 without gapping: 333
Number of HSP's successfully gapped in prelim test: 1354
Number of HSP's that attempted gapping in prelim test: 432420
Number of HSP's gapped (non-prelim): 1940
length of query: 341
length of database: 8,064,228,071
effective HSP length: 143
effective length of query: 198
effective length of database: 9,003,962,200
effective search space: 1782784515600
effective search space used: 1782784515600
T: 11
A: 40
X1: 15 ( 7.0 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (22.0 bits)
S2: 77 (34.3 bits)