BLASTP 2.2.26 [Sep-21-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.


Reference for compositional score matrix adjustment: Altschul, Stephen F., 
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.

Query= 038667
         (411 letters)

Database: nr 
           23,463,169 sequences; 8,064,228,071 total letters

Searching..................................................done



>gi|255585473|ref|XP_002533429.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
           communis]
 gi|223526717|gb|EEF28949.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
           communis]
          Length = 448

 Score =  357 bits (916), Expect = 6e-96,   Method: Compositional matrix adjust.
 Identities = 187/417 (44%), Positives = 260/417 (62%), Gaps = 13/417 (3%)

Query: 1   LIHRDSILSPYNNPNENPAHRVQRGINISIARLAYLQEKIRS---HNTYQAQILPSNDIS 57
           LIH  SILSPY NPN + A R +R +  S  R+AYL  +I+     N ++  +LPS    
Sbjct: 38  LIHWGSILSPYFNPNASVAERAERIVKTSATRIAYLYAQIKGDIHMNDFELNLLPST--Y 95

Query: 58  NSLFYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLGTIFYPSRSSSYAYVPCD 117
             LF VN S+GQP  PQ   MDTGS++LWV C PC+ C+ Q G +  PS+SS+YA +PC 
Sbjct: 96  EPLFLVNFSMGQPATPQLAIMDTGSNILWVRCAPCKRCTQQNGPLLDPSKSSTYASLPCT 155

Query: 118 SEHCRYFPYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDVVFGCG 177
           +  C Y P A C+   ++C Y   Y  G  ++  ++TEQL F ++DE    V  VVFGC 
Sbjct: 156 NTMCHYAPSAYCNRL-NQCGYNLSYATGLSSAGVLATEQLIFHSSDEGVNAVPSVVFGCS 214

Query: 178 FSTN--RNFKFSGIFGLGIGRSSLVSQLNSSFSYCIGSLHDPDYLHNKLILGDGAIIDEG 235
                 ++ +F+G+FGLG G +S V+++ S FSYC+G++ DP Y +N+L+ G+ A   EG
Sbjct: 215 HENGDYKDRRFTGVFGLGKGITSFVTRMGSKFSYCLGNIADPHYGYNQLVFGEKANF-EG 273

Query: 236 DATPLQFIDGHYYITLEAISVDGRMLDINPNIFKRDDSGGGVMIDSGTDVTWLVKEAYEA 295
            +TPL+ ++GHYY+TLE ISV  + LDI+   F    +    +IDSGT +TWL + A+ A
Sbjct: 274 YSTPLKVVNGHYYVTLEGISVGEKRLDIDSTAFSMKGNEKSALIDSGTALTWLAESAFRA 333

Query: 296 LRDEVMIRLEGEQMRSYSWPDKL-CYHGIMSSDLKGFPTVRFHFRGGAKLALEKDSMFYQ 354
           L +EV   L+G  M    W     CY G +S DL GFP V FHF GGA L L+ +SMFYQ
Sbjct: 334 LDNEVRQLLDGVLMP--FWRGSFACYKGTVSQDLIGFPVVTFHFSGGADLDLDTESMFYQ 391

Query: 355 PRPDAFCMAVNPASI-NNRYVNFSLIGMMAQQFYNVGYDIGRKQKTFQRMDCEVLDD 410
             PD  C+AV  AS   N + +FS+IG+MAQQ+YN+ YD+   +  FQR+DC++L D
Sbjct: 392 ATPDILCIAVRQASAYGNDFKSFSVIGLMAQQYYNMAYDLNSNKLFFQRIDCQLLVD 448


>gi|224091849|ref|XP_002309371.1| predicted protein [Populus trichocarpa]
 gi|222855347|gb|EEE92894.1| predicted protein [Populus trichocarpa]
          Length = 438

 Score =  330 bits (847), Expect = 6e-88,   Method: Compositional matrix adjust.
 Identities = 185/418 (44%), Positives = 254/418 (60%), Gaps = 28/418 (6%)

Query: 1   LIHRDSILSPYNNPNENPAHRVQRGINISIARLAYLQEKIRSH---NTYQAQILPSNDIS 57
           LIHRDSI+SPY   N+  A R +R +  S+ARL+YL  KI      N     + PS   S
Sbjct: 41  LIHRDSIVSPYYRSNDTVADRTERTMKASLARLSYLYAKIERDFDINDLWLNLHPS--AS 98

Query: 58  NSLFYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQ-LGTIFYPSRSSSYAYVPC 116
             LF VN S+GQPPVPQ   MDTGSSLLW+ C PC+ CS Q +G +F PS SS+Y  + C
Sbjct: 99  EPLFLVNFSMGQPPVPQLAIMDTGSSLLWIQCAPCKSCSQQIIGPMFDPSISSTYDSLSC 158

Query: 117 DSEHCRYFPYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDVVFGC 176
            +  CRY P   C +   +C+Y Q Y+ G  +   ++TEQL F ++DE    V +V+FGC
Sbjct: 159 KNIICRYAPSGECDS-SSQCVYNQTYVEGLPSVGVIATEQLIFGSSDEGRNAVNNVLFGC 217

Query: 177 GFSTNRNFK---FSGIFGLGIGRSSLVSQLNSSFSYCIGSLHDPDYLHNKLILGDGAIID 233
               N N+K   F+G+FGLG G +S+V+Q+ S FSYCIG++ DPDY +N+L+L +G  + 
Sbjct: 218 SHR-NGNYKDRRFTGVFGLGSGITSVVNQMGSKFSYCIGNIADPDYSYNQLVLSEGVNM- 275

Query: 234 EGDATPLQFIDGHYYITLEAISVDGRMLDINPNIFKRDDSGGGVMIDSGTDVTWLVKEAY 293
           EG +TPL  +DGHY + LE ISV    L I+P+ FKR +    V+IDSGT  TWL +  Y
Sbjct: 276 EGYSTPLDVVDGHYQVILEGISVGETRLVIDPSAFKRTEKQRRVIIDSGTAPTWLAENEY 335

Query: 294 EALRDEVMIRLEGEQMRSYSWPDKLCYHGIMSSDLKGFPTVRFHFRGGAKLALEKDSMFY 353
            AL  EV   L+   +  +     LCY G +  DL GFP V FHF  GA L ++ +    
Sbjct: 336 RALEREVRNLLD-RFLTPFMRESFLCYKGKVGQDLVGFPAVTFHFAEGADLVVDTE---- 390

Query: 354 QPRPDAFCMAVNPASINNR-YVNFSLIGMMAQQFYNVGYDIGRKQKTFQRMDCEVLDD 410
                     +  AS+  + + +FS+IG+MAQQ+YNV YD+ + +  FQR+DCE+LD+
Sbjct: 391 ----------MRQASVYGKDFKDFSVIGLMAQQYYNVAYDLNKHKLFFQRIDCELLDE 438


>gi|297825301|ref|XP_002880533.1| hypothetical protein ARALYDRAFT_481251 [Arabidopsis lyrata subsp.
           lyrata]
 gi|297326372|gb|EFH56792.1| hypothetical protein ARALYDRAFT_481251 [Arabidopsis lyrata subsp.
           lyrata]
          Length = 430

 Score =  330 bits (845), Expect = 1e-87,   Method: Compositional matrix adjust.
 Identities = 180/427 (42%), Positives = 258/427 (60%), Gaps = 23/427 (5%)

Query: 1   LIHRDSILSPYNNPNE----NPAHRVQRGINISIARLAYLQEKI-RSHNTYQAQILPSND 55
           LI R+S++   +NP+      P   +Q   +IS AR  YLQ  I +   +   Q+     
Sbjct: 5   LIRRESVVR--HNPDARVPVTPEDHIQHMTDISSARFKYLQNSIVKELGSSDFQVDVHQA 62

Query: 56  ISNSLFYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQ--LGTIFYPSRSSSYAY 113
           I  SLF+VN S+GQPPVPQFT MDTGSSLLW+ C+PC+ CS    +  +F P+ SS++  
Sbjct: 63  IKTSLFFVNFSVGQPPVPQFTIMDTGSSLLWIQCHPCKHCSSNHMIHPVFNPALSSTFVE 122

Query: 114 VPCDSEHCRYFPYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDVV 173
             CD   CRY P   CS+  ++C+Y Q+Y+ G  +   ++ E+LTF   + +T+  Q + 
Sbjct: 123 CSCDDRFCRYAPNGHCSS--NKCVYEQVYISGTGSKGVLAKERLTFTTPNGNTVVTQPIA 180

Query: 174 FGCGFSTNRNF--KFSGIFGLGIGRSSLVSQLNSSFSYCIGSLHDPDYLHNKLILGDGAI 231
           FGCG         +F+GI GLG   +SL  QL S FSYCIG L + +Y +N+L+LG+ A 
Sbjct: 181 FGCGHENGEQLESEFTGILGLGAKPTSLAVQLGSKFSYCIGDLANKNYGYNQLVLGEDAD 240

Query: 232 IDEGDATPLQF--IDGHYYITLEAISVDGRMLDINPNIFKRDDSGGGVMIDSGTDVTWLV 289
           I  GD TP++F   +G YY+ LE ISV  + L+I P +FKR  S  GV++D+GT  TWL 
Sbjct: 241 I-LGDPTPIEFETENGIYYMNLEGISVGDKQLNIEPVVFKRRGSRTGVILDTGTLYTWLA 299

Query: 290 KEAYEALRDEVMIRLEGEQMRSYSWPDKLCYHGIMSSDLKGFPTVRFHFRGGAKLALEKD 349
             AY  L +E+   L+  ++  + + D LCYHG ++ +L GFP V FHF GGA+LA+E  
Sbjct: 300 DIAYRELYNEIKSILD-PKLERFWFRDFLCYHGRVNEELIGFPVVTFHFAGGAELAMEAT 358

Query: 350 SMFYQPRP-----DAFCMAVNPASIN-NRYVNFSLIGMMAQQFYNVGYDIGRKQKTFQRM 403
           SMFY         + FCM+V P + +   Y +F+ IG+MAQQ+YN+ YD+  +    QR+
Sbjct: 359 SMFYPMTESDTYHNVFCMSVRPTTEHGGEYKDFTAIGLMAQQYYNIAYDLKERNIYLQRI 418

Query: 404 DCEVLDD 410
           DC +LDD
Sbjct: 419 DCVLLDD 425


>gi|296090291|emb|CBI40110.3| unnamed protein product [Vitis vinifera]
          Length = 408

 Score =  327 bits (838), Expect = 7e-87,   Method: Compositional matrix adjust.
 Identities = 185/417 (44%), Positives = 253/417 (60%), Gaps = 28/417 (6%)

Query: 1   LIHRDSILSPYNNPNENPAHRVQRGINISIARLAYLQEKIRSHNTYQAQILPSNDISNSL 60
           LIH+DSILS Y + + N   R +        R A++ ++I      QA ++   D     
Sbjct: 13  LIHQDSILSSYQSLDRNNVERRR------TRRAAFITDEI------QANMVA--DDRGQA 58

Query: 61  FYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLGTIFYPSRSSSYAYVPCDSEH 120
           F VN S+G+PPVPQ   +DTGS LLWV C PC DC  Q   IF PS+SS+Y  +  DS  
Sbjct: 59  FLVNFSVGRPPVPQLVGIDTGSDLLWVQCRPCADCFRQSTPIFDPSKSSTYVDLSYDSPI 118

Query: 121 CRYFPYARCSAYKH--RCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDVVFGCGF 178
           C   P  +   Y H  +CIY   Y  G  +S  ++TE + F+ +D+ T+ V  VVFGCG 
Sbjct: 119 CPNSPQKK---YNHLNQCIYNASYADGSTSSGNLATEDIVFETSDQGTVTVSSVVFGCGH 175

Query: 179 STNRNF--KFSGIFGLGIGRSSLVSQLNSSFSYCIGSLHDPDYLHNKLILGDGAIIDEGD 236
           S    F  + SGI GL  G  S+VS+L S FSYCIG L DP Y HN+L+LGDG  + EG 
Sbjct: 176 SNRGRFDGQQSGILGLSAGDQSIVSRLGSRFSYCIGDLFDPHYTHNQLVLGDGVKM-EGS 234

Query: 237 ATPLQFIDGHYYITLEAISVDGRMLDINPNIFKRDDSG-GGVMIDSGTDVTWLVKEAYEA 295
           +TP    +G YY+TLE ISV    LDINP +F+R +SG GGV++DSGT  T+L K+ ++ 
Sbjct: 235 STPFHTFNGFYYVTLEGISVGETRLDINPEVFQRTESGQGGVVMDSGTTATFLAKDGFDP 294

Query: 296 LRDEV--MIRLEGEQMRSYSWPDKLCYHGIMSSDLKGFPTVRFHFRGGAKLALEKDSMFY 353
           L +E+  ++R   +Q+   + P  LCY G ++ DL+GFP + FHF  GA L L+ +S+F 
Sbjct: 295 LSNEIQRLVRGHFQQVIYRTIPGWLCYKGRVNEDLRGFPELAFHFAEGADLVLDANSLFV 354

Query: 354 QPRPDAFCMAVNPASINNRYVNFSLIGMMAQQFYNVGYDIGRKQKTFQRMDCEVLDD 410
           Q   D FC+AV  +++ N     S+IG+MAQQ YNV YD+  K+  FQR DCE+L+D
Sbjct: 355 QKNQDVFCLAVLESNLKNIG---SVIGIMAQQHYNVAYDLIGKRVYFQRTDCELLED 408


>gi|225446261|ref|XP_002265547.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Vitis vinifera]
          Length = 440

 Score =  327 bits (838), Expect = 7e-87,   Method: Compositional matrix adjust.
 Identities = 185/417 (44%), Positives = 253/417 (60%), Gaps = 28/417 (6%)

Query: 1   LIHRDSILSPYNNPNENPAHRVQRGINISIARLAYLQEKIRSHNTYQAQILPSNDISNSL 60
           LIH+DSILS Y + + N   R +        R A++ ++I      QA ++   D     
Sbjct: 45  LIHQDSILSSYQSLDRNNVERRR------TRRAAFITDEI------QANMVA--DDRGQA 90

Query: 61  FYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLGTIFYPSRSSSYAYVPCDSEH 120
           F VN S+G+PPVPQ   +DTGS LLWV C PC DC  Q   IF PS+SS+Y  +  DS  
Sbjct: 91  FLVNFSVGRPPVPQLVGIDTGSDLLWVQCRPCADCFRQSTPIFDPSKSSTYVDLSYDSPI 150

Query: 121 CRYFPYARCSAYKH--RCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDVVFGCGF 178
           C   P  +   Y H  +CIY   Y  G  +S  ++TE + F+ +D+ T+ V  VVFGCG 
Sbjct: 151 CPNSPQKK---YNHLNQCIYNASYADGSTSSGNLATEDIVFETSDQGTVTVSSVVFGCGH 207

Query: 179 STNRNF--KFSGIFGLGIGRSSLVSQLNSSFSYCIGSLHDPDYLHNKLILGDGAIIDEGD 236
           S    F  + SGI GL  G  S+VS+L S FSYCIG L DP Y HN+L+LGDG  + EG 
Sbjct: 208 SNRGRFDGQQSGILGLSAGDQSIVSRLGSRFSYCIGDLFDPHYTHNQLVLGDGVKM-EGS 266

Query: 237 ATPLQFIDGHYYITLEAISVDGRMLDINPNIFKRDDSG-GGVMIDSGTDVTWLVKEAYEA 295
           +TP    +G YY+TLE ISV    LDINP +F+R +SG GGV++DSGT  T+L K+ ++ 
Sbjct: 267 STPFHTFNGFYYVTLEGISVGETRLDINPEVFQRTESGQGGVVMDSGTTATFLAKDGFDP 326

Query: 296 LRDEV--MIRLEGEQMRSYSWPDKLCYHGIMSSDLKGFPTVRFHFRGGAKLALEKDSMFY 353
           L +E+  ++R   +Q+   + P  LCY G ++ DL+GFP + FHF  GA L L+ +S+F 
Sbjct: 327 LSNEIQRLVRGHFQQVIYRTIPGWLCYKGRVNEDLRGFPELAFHFAEGADLVLDANSLFV 386

Query: 354 QPRPDAFCMAVNPASINNRYVNFSLIGMMAQQFYNVGYDIGRKQKTFQRMDCEVLDD 410
           Q   D FC+AV  +++ N     S+IG+MAQQ YNV YD+  K+  FQR DCE+L+D
Sbjct: 387 QKNQDVFCLAVLESNLKNIG---SVIGIMAQQHYNVAYDLIGKRVYFQRTDCELLED 440


>gi|147786881|emb|CAN62311.1| hypothetical protein VITISV_008929 [Vitis vinifera]
          Length = 408

 Score =  327 bits (837), Expect = 9e-87,   Method: Compositional matrix adjust.
 Identities = 185/417 (44%), Positives = 253/417 (60%), Gaps = 28/417 (6%)

Query: 1   LIHRDSILSPYNNPNENPAHRVQRGINISIARLAYLQEKIRSHNTYQAQILPSNDISNSL 60
           LIH+DSILS Y + + N   R +        R A++ ++I      QA ++   D     
Sbjct: 13  LIHQDSILSSYQSLDRNNVERRR------TRRAAFIXDEI------QANMVA--DDRGQA 58

Query: 61  FYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLGTIFYPSRSSSYAYVPCDSEH 120
           F VN S+G+PPVPQ   +DTGS LLWV C PC DC  Q   IF PS+SS+Y  +  DS  
Sbjct: 59  FLVNFSVGRPPVPQLVGIDTGSDLLWVQCRPCADCFRQSTPIFDPSKSSTYVDLSYDSPI 118

Query: 121 CRYFPYARCSAYKH--RCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDVVFGCGF 178
           C   P  +   Y H  +CIY   Y  G  +S  ++TE + F+ +D+ T+ V  VVFGCG 
Sbjct: 119 CPNSPQKK---YNHLNQCIYNASYADGSTSSGNLATEDIVFETSDQGTVTVSSVVFGCGH 175

Query: 179 STNRNF--KFSGIFGLGIGRSSLVSQLNSSFSYCIGSLHDPDYLHNKLILGDGAIIDEGD 236
           S    F  + SGI GL  G  S+VS+L S FSYCIG L DP Y HN+L+LGDG  + EG 
Sbjct: 176 SNRGRFDGQQSGILGLSAGDQSIVSRLGSRFSYCIGDLFDPHYTHNQLVLGDGVKM-EGS 234

Query: 237 ATPLQFIDGHYYITLEAISVDGRMLDINPNIFKRDDSG-GGVMIDSGTDVTWLVKEAYEA 295
           +TP    +G YY+TLE ISV    LDINP +F+R +SG GGV++DSGT  T+L K+ ++ 
Sbjct: 235 STPFHTFNGFYYVTLEGISVGETRLDINPEVFQRTESGQGGVVMDSGTTATFLAKDGFDP 294

Query: 296 LRDEV--MIRLEGEQMRSYSWPDKLCYHGIMSSDLKGFPTVRFHFRGGAKLALEKDSMFY 353
           L +E+  ++R   +Q+   + P  LCY G ++ DL+GFP + FHF  GA L L+ +S+F 
Sbjct: 295 LSNEIQRLVRGHFQQVIYRTIPGWLCYKGRVNEDLRGFPELAFHFAEGADLVLDANSLFV 354

Query: 354 QPRPDAFCMAVNPASINNRYVNFSLIGMMAQQFYNVGYDIGRKQKTFQRMDCEVLDD 410
           Q   D FC+AV  +++ N     S+IG+MAQQ YNV YD+  K+  FQR DCE+L+D
Sbjct: 355 QKNQDVFCLAVLESNLKNIG---SVIGIMAQQHYNVAYDLIGKRVYFQRTDCELLED 408


>gi|18400416|ref|NP_565559.1| aspartyl protease family protein [Arabidopsis thaliana]
 gi|20197296|gb|AAM15014.1| predicted protein [Arabidopsis thaliana]
 gi|330252412|gb|AEC07506.1| aspartyl protease family protein [Arabidopsis thaliana]
          Length = 458

 Score =  327 bits (837), Expect = 9e-87,   Method: Compositional matrix adjust.
 Identities = 181/424 (42%), Positives = 251/424 (59%), Gaps = 17/424 (4%)

Query: 1   LIHRDSI--LSPYNNPNENPAHRVQRGINISIARLAYLQEKI-RSHNTYQAQILPSNDIS 57
           LIHR+S+  L+P       P   ++   +IS AR  YLQ  I +   +   Q+     I 
Sbjct: 33  LIHRESVARLNPNARVPITPEDHIKHLTDISSARFKYLQNSIDKELGSSNFQVDVEQAIK 92

Query: 58  NSLFYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSP--QLGTIFYPSRSSSYAYVP 115
            SLF VN S+GQPPVPQ T MDTGSSLLW+ C PC+ CS    +  +F P+ SS++    
Sbjct: 93  TSLFLVNFSVGQPPVPQLTIMDTGSSLLWIQCQPCKHCSSDHMIHPVFNPALSSTFVECS 152

Query: 116 CDSEHCRYFPYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDVVFG 175
           CD   CRY P   C +  ++C+Y Q+Y+ G  +   ++ E+LTF   + +T+  Q + FG
Sbjct: 153 CDDRFCRYAPNGHCGS-SNKCVYEQVYISGTGSKGVLAKERLTFTTPNGNTVVTQPIAFG 211

Query: 176 CGFSTNRNFK--FSGIFGLGIGRSSLVSQLNSSFSYCIGSLHDPDYLHNKLILGDGAIID 233
           CG+      +  F+GI GLG   +SL  QL S FSYCIG L + +Y +N+L+LG+ A I 
Sbjct: 212 CGYENGEQLESHFTGILGLGAKPTSLAVQLGSKFSYCIGDLANKNYGYNQLVLGEDADI- 270

Query: 234 EGDATPLQFIDGH--YYITLEAISVDGRMLDINPNIFKRDDSGGGVMIDSGTDVTWLVKE 291
            GD TP++F   +  YY+ LE ISV    L+I P +FKR     GV++DSGT  TWL   
Sbjct: 271 LGDPTPIEFETENSIYYMNLEGISVGDTQLNIEPVVFKRRGPRTGVILDSGTLYTWLADI 330

Query: 292 AYEALRDEVMIRLEGEQMRSYSWPDKLCYHGIMSSDLKGFPTVRFHFRGGAKLALEKDSM 351
           AY  L +E+   L+  ++  + + D LCYHG +S +L GFP V FHF GGA+LA+E  SM
Sbjct: 331 AYRELYNEIKSILD-PKLERFWFRDFLCYHGRVSEELIGFPVVTFHFAGGAELAMEATSM 389

Query: 352 FY---QPRP-DAFCMAVNPASIN-NRYVNFSLIGMMAQQFYNVGYDIGRKQKTFQRMDCE 406
           FY   +P   + FCM+V P   +   Y  F+ IG+MAQQ+YN+GYD+  K    QR+DC 
Sbjct: 390 FYPLSEPNTFNVFCMSVKPTKEHGGEYKEFTAIGLMAQQYYNIGYDLKEKNIYLQRIDCV 449

Query: 407 VLDD 410
            LDD
Sbjct: 450 QLDD 453


>gi|449516339|ref|XP_004165204.1| PREDICTED: probable aspartic protease At2g35615-like [Cucumis
           sativus]
          Length = 456

 Score =  312 bits (800), Expect = 2e-82,   Method: Compositional matrix adjust.
 Identities = 174/421 (41%), Positives = 252/421 (59%), Gaps = 17/421 (4%)

Query: 1   LIHRDSILSPYNNPNENPAHRVQRGINISIARLAYLQEKIRS----HNTYQAQILPSNDI 56
           LIHR+S L P  + NE    R +R    SI R  +L+ KI+      N  ++ ++P N  
Sbjct: 42  LIHRNSYLHPLYDQNETVEDRSKREQTSSIERFDFLESKIKELKSVGNEARSSLIPFN-- 99

Query: 57  SNSLFYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLGTIFYPSRSSSYAYVPC 116
             S F VN+SIG PPV Q   +DTGSSLLWV C PC +C  Q  + F P +S S+  + C
Sbjct: 100 RGSGFLVNLSIGSPPVTQLVVVDTGSSLLWVQCLPCINCFQQSTSWFDPLKSVSFKTLGC 159

Query: 117 DSEHCRYFPYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDVVFGC 176
                 Y    +C+ + ++  Y   YL G  +   ++ E L F+  DE  I   ++ FGC
Sbjct: 160 GFPGYNYINGYKCNRF-NQAEYKLRYLGGDSSQGILAKESLLFETLDEGKIKKSNITFGC 218

Query: 177 G---FSTNRNFKFSGIFGLGI-GRSSLVSQLNSSFSYCIGSLHDPDYLHNKLILGDGAII 232
           G     TN +  ++G+FGLG     ++ +QL + FSYCIG +++P Y HN L+LG G+ I
Sbjct: 219 GHMNIKTNNDDAYNGVFGLGAYPHITMATQLGNKFSYCIGDINNPLYTHNHLVLGQGSYI 278

Query: 233 DEGDATPLQFIDGHYYITLEAISVDGRMLDINPNIFK-RDDSGGGVMIDSGTDVTWLVKE 291
            EGD+TPLQ   GHYY+TL++ISV  + L I+PN FK   D  GGV+IDSG   T L   
Sbjct: 279 -EGDSTPLQIHFGHYYVTLQSISVGSKTLKIDPNAFKISSDGSGGVLIDSGMTYTKLANG 337

Query: 292 AYEALRDEVMIRLEG--EQMRSYSWPDKLCYHGIMSSDLKGFPTVRFHFRGGAKLALEKD 349
            +E L DE++  ++G  E++ +    + LC+ G++S DL GFP V FHF GGA L LE  
Sbjct: 338 GFELLYDEIVDLMKGLLERIPTQRKFEGLCFKGVVSRDLVGFPAVTFHFAGGADLVLESG 397

Query: 350 SMFYQPRPDAFCMAVNPASINNRYVNFSLIGMMAQQFYNVGYDIGRKQKTFQRMDCEVLD 409
           S+F Q   D FC+A+ P+  N+  +N S+IG++AQQ YNVG+D+ + +  F+R+DC++LD
Sbjct: 398 SLFRQHGGDRFCLAILPS--NSELLNLSVIGILAQQNYNVGFDLEQMKVFFRRIDCQLLD 455

Query: 410 D 410
           +
Sbjct: 456 E 456


>gi|356498789|ref|XP_003518231.1| PREDICTED: probable aspartic protease At2g35615-like [Glycine max]
          Length = 446

 Score =  312 bits (799), Expect = 2e-82,   Method: Compositional matrix adjust.
 Identities = 182/421 (43%), Positives = 245/421 (58%), Gaps = 34/421 (8%)

Query: 1   LIHRDSILSPYNNPNENPAHRVQRGINISIARLAYLQEKIRSHNTYQAQILPSNDISNSL 60
           LIH +S LSPYN+ +    H   + +  + +            N Y + ++PS      +
Sbjct: 47  LIHHESSLSPYNSKDTIWDHYSHKILKQTFS------------NDYISNLVPSPRYV--V 92

Query: 61  FYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLGTIFYPSRSSSYAYVPCDSEH 120
           F +N SIG+PP+PQ   MDTGSSL WV C+PC  CS Q   IF PS+SS+Y+ + C    
Sbjct: 93  FLMNFSIGEPPIPQLAVMDTGSSLTWVMCHPCSSCSQQSVPIFDPSKSSTYSNLSC--SE 150

Query: 121 CRYFPYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDVVFGCG--F 178
           C      +C      C Y+  Y+    +    + EQLT +  DES I V  ++FGCG  F
Sbjct: 151 CN-----KCDVVNGECPYSVEYVGSGSSQGIYAREQLTLETIDESIIKVPSLIFGCGRKF 205

Query: 179 STNRNF----KFSGIFGLGIGRSSLVSQLNSSFSYCIGSLHDPDYLHNKLILGDGAIIDE 234
           S + N       +G+FGLG GR SL+      FSYCIG+L + +Y  N+L+LGD A + +
Sbjct: 206 SISSNGYPYQGINGVFGLGSGRFSLLPSFGKKFSYCIGNLRNTNYKFNRLVLGDKANM-Q 264

Query: 235 GDATPLQFIDGHYYITLEAISVDGRMLDINPNIFKRD--DSGGGVMIDSGTDVTWLVKEA 292
           GD+T L  I+G YY+ LEAIS+ GR LDI+P +F+R   D+  GV+IDSG D TWL K  
Sbjct: 265 GDSTTLNVINGLYYVNLEAISIGGRKLDIDPTLFERSITDNNSGVIIDSGADHTWLTKYG 324

Query: 293 YEALRDEVMIRLEGEQMRSYS---WPDKLCYHGIMSSDLKGFPTVRFHFRGGAKLALEKD 349
           +E L  EV   LEG  + +      P  LCY G++S DL GFP V FHF  GA L L+  
Sbjct: 325 FEVLSFEVENLLEGVLVLAQQDKHNPYTLCYSGVVSQDLSGFPLVTFHFAEGAVLDLDVT 384

Query: 350 SMFYQPRPDAFCMAVNPAS-INNRYVNFSLIGMMAQQFYNVGYDIGRKQKTFQRMDCEVL 408
           SMF Q   + FCMA+ P +   + Y +FS IGM+AQQ YNVGYD+ R +  FQR+DCE+L
Sbjct: 385 SMFIQTTENEFCMAMLPGNYFGDDYESFSSIGMLAQQNYNVGYDLNRMRVYFQRIDCELL 444

Query: 409 D 409
           D
Sbjct: 445 D 445


>gi|356532674|ref|XP_003534896.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
          Length = 446

 Score =  310 bits (795), Expect = 7e-82,   Method: Compositional matrix adjust.
 Identities = 180/420 (42%), Positives = 251/420 (59%), Gaps = 26/420 (6%)

Query: 1   LIHRDSILSPYNNPNENPAHRVQRGINISIARLAYLQEKIRS----HNTYQAQILPSNDI 56
           LIH  S+  P+  PNE    R++  I  S ARLAY+Q +I      +N Y A + PS  +
Sbjct: 39  LIHPGSVHHPHYKPNETAKDRMELDIEHSAARLAYIQARIEGSLVYNNDYTASVSPS--L 96

Query: 57  SNSLFYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLGTIFYPSRSSSYA---Y 113
           +     VN+SIGQP +PQ   MDTGS +LW+ C PC +C   LG +F PS SS+++    
Sbjct: 97  TGRTILVNLSIGQPSIPQLVVMDTGSDILWIMCNPCTNCDNHLGLLFDPSMSSTFSPLCK 156

Query: 114 VPCDSEHCRYFPYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDVV 173
            PC  + C+  P            +T  Y+     S     + L F+ TDE T  + DV+
Sbjct: 157 TPCGFKGCKCDPIP----------FTISYVDNSSASGTFGRDILVFETTDEGTSQISDVI 206

Query: 174 FGCGFST--NRNFKFSGIFGLGIGRSSLVSQLNSSFSYCIGSLHDPDYLHNKLILGDGAI 231
            GCG +   N +  ++GI GL  G +SL +Q+   FSYCIG+L DP Y +N+L LG+GA 
Sbjct: 207 IGCGHNIGFNSDPGYNGILGLNNGPNSLATQIGRKFSYCIGNLADPYYNYNQLRLGEGAD 266

Query: 232 IDEGDATPLQFIDGHYYITLEAISVDGRMLDINPNIFKRDDSG-GGVMIDSGTDVTWLVK 290
           + EG +TP +   G YY+T+E ISV  + LDI    F+   +G GGV++DSGT +T+LV 
Sbjct: 267 L-EGYSTPFEVYHGFYYVTMEGISVGEKRLDIALETFEMKRNGTGGVILDSGTTITYLVD 325

Query: 291 EAYEALRDEV--MIRLEGEQMRSYSWPDKLCYHGIMSSDLKGFPTVRFHFRGGAKLALEK 348
            A++ L +EV  +++    Q+   + P KLCY+GI+S DL GFP V FHF  GA LAL+ 
Sbjct: 326 SAHKLLYNEVRNLLKWSFRQVIFENAPWKLCYYGIISRDLVGFPVVTFHFVDGADLALDT 385

Query: 349 DSMFYQPRPDAFCMAVNPASINNRYVNFSLIGMMAQQFYNVGYDIGRKQKTFQRMDCEVL 408
            S F Q R D FCM V+PASI N  ++ S+IG++AQQ YNVGYD+  +   FQR+DCE+L
Sbjct: 386 GSFFSQ-RDDIFCMTVSPASILNTTISPSVIGLLAQQSYNVGYDLVNQFVYFQRIDCELL 444


>gi|356558300|ref|XP_003547445.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
          Length = 447

 Score =  306 bits (783), Expect = 2e-80,   Method: Compositional matrix adjust.
 Identities = 176/420 (41%), Positives = 243/420 (57%), Gaps = 25/420 (5%)

Query: 1   LIHRDSILSPYNNPNENPAHRVQRGINISIARLAYLQEKIR----SHNTYQAQILPSNDI 56
           LIH  S+  P+  PNE    R++  I  S ARLA +Q +I     S+N Y+A++ PS  +
Sbjct: 39  LIHPGSVHHPHYKPNETAKDRMELDIQHSAARLANIQARIEGSLVSNNDYKARVSPS--L 96

Query: 57  SNSLFYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLGTIFYPSRSSSYA---Y 113
           +      NISIGQPP+PQ   MDTGS +LWV C PC +C   LG +F PS+SS+++    
Sbjct: 97  TGRTIMANISIGQPPIPQLVVMDTGSDILWVMCTPCTNCDNDLGLLFDPSKSSTFSPLCK 156

Query: 114 VPCDSEHCRYFPYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDVV 173
            PCD E CR  P            +T  Y      S     + + F+ TDE T  + DV+
Sbjct: 157 TPCDFEGCRCDPIP----------FTVTYADNSTASGTFGRDTVVFETTDEGTSRISDVL 206

Query: 174 FGCGFSTNRNFK--FSGIFGLGIGRSSLVSQLNSSFSYCIGSLHDPDYLHNKLILGDGAI 231
           FGCG +   +     +GI GL  G  SLV++L   FSYCIG+L DP Y +++LILG+GA 
Sbjct: 207 FGCGHNIGHDTDPGHNGILGLNNGPDSLVTKLGQKFSYCIGNLADPYYNYHQLILGEGAD 266

Query: 232 IDEGDATPLQFIDGHYYITLEAISVDGRMLDINPNIFK-RDDSGGGVMIDSGTDVTWLVK 290
           + EG +TP +  +G YY+T+E ISV  + LDI P  F+ +++  GGV+ID+G+ +T+LV 
Sbjct: 267 L-EGYSTPFEVYNGFYYVTMEGISVGEKRLDIAPETFEMKENRAGGVIIDTGSTITFLVD 325

Query: 291 EAYEALRDEV--MIRLEGEQMRSYSWPDKLCYHGIMSSDLKGFPTVRFHFRGGAKLALEK 348
             ++ L  EV  ++     Q      P   C++G +S DL GFP V FHF  GA LAL+ 
Sbjct: 326 SVHKLLSKEVRNLLGWSFRQATIEKSPWMQCFYGSISRDLVGFPVVTFHFSDGADLALDS 385

Query: 349 DSMFYQPRPDAFCMAVNPASINNRYVNFSLIGMMAQQFYNVGYDIGRKQKTFQRMDCEVL 408
            S F Q   + FCM V P S  N     SLIG++AQQ YNVGYD+  +   FQR+DCE+L
Sbjct: 386 GSFFNQLNDNVFCMTVGPVSSLNIKSKPSLIGLLAQQSYNVGYDLVNQFVYFQRIDCELL 445


>gi|449457263|ref|XP_004146368.1| PREDICTED: probable aspartic protease At2g35615-like [Cucumis
           sativus]
          Length = 469

 Score =  305 bits (781), Expect = 3e-80,   Method: Compositional matrix adjust.
 Identities = 175/434 (40%), Positives = 253/434 (58%), Gaps = 30/434 (6%)

Query: 1   LIHRDSILSPYNNPNENPAHRVQRGINISIARLAYLQEKIRS----HNTYQAQILPSNDI 56
           LIHR+S L P  + NE    R +R    SI R  +L+ KI+      N  ++ ++P N  
Sbjct: 42  LIHRNSYLHPLYDQNETVEDRSKREQTSSIERFDFLESKIKELKSVGNEARSSLIPFN-- 99

Query: 57  SNSLFYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLGTIFYPSRSSSYAYVPC 116
             S F VN+SIG PPV Q   +DTGSSLLWV C PC +C  Q  + F P +S S+  + C
Sbjct: 100 RGSGFLVNLSIGSPPVTQLVVVDTGSSLLWVQCLPCINCFQQSTSWFDPLKSVSFKTLGC 159

Query: 117 DSEHCRYFPYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDE------------ 164
                 Y    +C+ + ++  Y   YL G  +   ++ E L F+  DE            
Sbjct: 160 GFPGYNYINGYKCNRF-NQAEYKLRYLGGDSSQGILAKESLLFETLDEGRVFQYNAISTQ 218

Query: 165 -STIHVQDVVFGCG---FSTNRNFKFSGIFGLGI-GRSSLVSQLNSSFSYCIGSLHDPDY 219
            S I   ++ FGCG     TN +  ++G+FGLG     ++ +QL + FSYCIG +++P Y
Sbjct: 219 ISKIKKSNITFGCGHMNIKTNNDDAYNGVFGLGAYPHITMATQLGNKFSYCIGDINNPLY 278

Query: 220 LHNKLILGDGAIIDEGDATPLQFIDGHYYITLEAISVDGRMLDINPNIFK-RDDSGGGVM 278
            HN L+LG G+ I EGD+TPLQ   GHYY+TL++ISV  + L I+PN FK   D  GGV+
Sbjct: 279 THNHLVLGQGSYI-EGDSTPLQIHFGHYYVTLQSISVGSKTLKIDPNAFKISSDGSGGVL 337

Query: 279 IDSGTDVTWLVKEAYEALRDEVMIRLEG--EQMRSYSWPDKLCYHGIMSSDLKGFPTVRF 336
           IDSG   T L    +E L DE++  ++G  E++ +    + LC+ G++S DL GFP V F
Sbjct: 338 IDSGMTYTKLANGGFELLYDEIVDLMKGLLERIPTQRKFEGLCFKGVVSRDLVGFPAVTF 397

Query: 337 HFRGGAKLALEKDSMFYQPRPDAFCMAVNPASINNRYVNFSLIGMMAQQFYNVGYDIGRK 396
           HF GGA L LE  S+F Q   D FC+A+ P+  N+  +N S+IG++AQQ YNVG+D+ + 
Sbjct: 398 HFAGGADLVLESGSLFRQHGGDRFCLAILPS--NSELLNLSVIGILAQQNYNVGFDLEQM 455

Query: 397 QKTFQRMDCEVLDD 410
           +  F+R+DC++LD+
Sbjct: 456 KVFFRRIDCQLLDE 469


>gi|356532672|ref|XP_003534895.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
          Length = 449

 Score =  297 bits (761), Expect = 5e-78,   Method: Compositional matrix adjust.
 Identities = 172/420 (40%), Positives = 240/420 (57%), Gaps = 24/420 (5%)

Query: 1   LIHRDSILSPYNNPNENPAHRVQRGINISIARLAYLQEKIR----SHNTYQAQILPSNDI 56
           LIH  S+  P+  PNE    R++  I  S AR AY+Q +I     S+N Y+A++ PS  +
Sbjct: 39  LIHPGSVHHPHYKPNETAKDRMELDIQHSAARFAYIQARIEGSLVSNNEYKARVSPS--L 96

Query: 57  SNSLFYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLGTIFYPSRSSSYA---Y 113
           +      NISIGQPP+PQ   MDTGS +LWV C PC +C   LG +F PS SS+++    
Sbjct: 97  TGRTIMANISIGQPPIPQLVVMDTGSDILWVMCTPCTNCDNHLGLLFDPSMSSTFSPLCK 156

Query: 114 VPCDSEHCRYFPYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDVV 173
            PCD + C     +RC        +T  Y      S     + + F+ TDE T  + DV+
Sbjct: 157 TPCDFKGC-----SRCDPIP----FTVTYADNSTASGMFGRDTVVFETTDEGTSRIPDVL 207

Query: 174 FGCGFSTNRNFK--FSGIFGLGIGRSSLVSQLNSSFSYCIGSLHDPDYLHNKLILGDGAI 231
           FGCG +  ++     +GI GL  G  SL +++   FSYCIG L DP Y +++LILG+GA 
Sbjct: 208 FGCGHNIGQDTDPGHNGILGLNNGPDSLATKIGQKFSYCIGDLADPYYNYHQLILGEGAD 267

Query: 232 IDEGDATPLQFIDGHYYITLEAISVDGRMLDINPNIFK-RDDSGGGVMIDSGTDVTWLVK 290
           + EG +TP +  +G YY+T+E ISV  + LDI P  F+ + +  GGV+ID+G+ +T+LV 
Sbjct: 268 L-EGYSTPFEVHNGFYYVTMEGISVGEKRLDIAPETFEMKKNRTGGVIIDTGSTITFLVD 326

Query: 291 EAYEALRDEV--MIRLEGEQMRSYSWPDKLCYHGIMSSDLKGFPTVRFHFRGGAKLALEK 348
             +  L  EV  ++     Q      P   C++G +S DL GFP V FHF  GA LAL+ 
Sbjct: 327 SVHRLLSKEVRNLLGWSFRQTTIEKSPWMQCFYGSISRDLVGFPVVTFHFADGADLALDS 386

Query: 349 DSMFYQPRPDAFCMAVNPASINNRYVNFSLIGMMAQQFYNVGYDIGRKQKTFQRMDCEVL 408
            S F Q   + FCM V P S  N     SLIG++AQQ Y+VGYD+  +   FQR+DCE+L
Sbjct: 387 GSFFNQLNDNVFCMTVGPVSSLNLKSKPSLIGLLAQQSYSVGYDLVNQFVYFQRIDCELL 446


>gi|297798978|ref|XP_002867373.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
 gi|297313209|gb|EFH43632.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
          Length = 434

 Score =  281 bits (718), Expect = 6e-73,   Method: Compositional matrix adjust.
 Identities = 160/357 (44%), Positives = 210/357 (58%), Gaps = 19/357 (5%)

Query: 61  FYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLGTIFYPSRSSSYAYVPCDSEH 120
           F  NISIG PPVPQ   +DTGS L W+ C PC+ C PQ    F+PSRSS+Y    C+S  
Sbjct: 88  FLANISIGDPPVPQLLLIDTGSDLTWIQCLPCK-CYPQTIPFFHPSRSSTYRNASCESA- 145

Query: 121 CRYFPYARCSAYKHR----CIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDVVFGC 176
               P+A    ++      C Y   Y     T   ++ E+LTF+ +DE  I   ++VFGC
Sbjct: 146 ----PHAMPQIFRDEKTGNCRYHLRYRDFSNTRGILAKEKLTFQTSDEGLISKPNIVFGC 201

Query: 177 GFSTNRNFKFSGIFGLGIGRSSLVSQ-LNSSFSYCIGSLHDPDYLHNKLILGDGAIIDEG 235
           G   +   ++SG+ GLG G  S+V++   S FSYC GSL DP Y HN LILG+GA I EG
Sbjct: 202 GQDNSGFTQYSGVLGLGPGTFSIVTRNFGSKFSYCFGSLIDPTYPHNFLILGNGARI-EG 260

Query: 236 DATPLQFIDGHYYITLEAISVDGRMLDINPNIFKRDDSGGGVMIDSGTDVTWLVKEAYEA 295
           D TPLQ     YY+ L+AIS+  ++LDI P IF+R  S GG +ID+G   T L +EAYE 
Sbjct: 261 DPTPLQIFQDRYYLDLQAISLGEKLLDIEPGIFQRYRSKGGTVIDTGCSPTILAREAYET 320

Query: 296 LRDEVMIRLEGEQMRSYSWPD--KLCYHGIMSSDLKGFPTVRFHFRGGAKLALEKDSMFY 353
           L +E+   L     R   W      CY G +  DL GFP V FHF GGA+LAL+ +S+F 
Sbjct: 321 LSEEIDFLLGEVLRRVKDWEQYTNHCYEGNLKLDLYGFPVVTFHFAGGAELALDVESLFV 380

Query: 354 QPRP-DAFCMAVNPASINNRYVNFSLIGMMAQQFYNVGYDIGRKQKTFQRMDCEVLD 409
                D+FC+A+      N + + S+IG MAQQ YNVGY++   +  FQR DCE+LD
Sbjct: 381 SSESGDSFCLAMT----MNTFDDMSVIGAMAQQNYNVGYNLRTMKVYFQRTDCEILD 433


>gi|15234606|ref|NP_194732.1| aspartyl protease family protein [Arabidopsis thaliana]
 gi|4938479|emb|CAB43838.1| putative protein [Arabidopsis thaliana]
 gi|7269903|emb|CAB80996.1| putative protein [Arabidopsis thaliana]
 gi|67633774|gb|AAY78811.1| aspartyl protease family protein [Arabidopsis thaliana]
 gi|332660310|gb|AEE85710.1| aspartyl protease family protein [Arabidopsis thaliana]
          Length = 424

 Score =  280 bits (716), Expect = 1e-72,   Method: Compositional matrix adjust.
 Identities = 161/358 (44%), Positives = 213/358 (59%), Gaps = 21/358 (5%)

Query: 61  FYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLGTIFYPSRSSSYAYVPCDSEH 120
           F  NISIG PPVPQ   +DTGS L W+HC PC+ C PQ    F+PSRSS+Y    C S  
Sbjct: 78  FLANISIGNPPVPQLLLIDTGSDLTWIHCLPCK-CYPQTIPFFHPSRSSTYRNASCVSA- 135

Query: 121 CRYFPYARCSAYKHR----CIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDVVFGC 176
               P+A    ++      C Y   Y     T   ++ E+LTF+ +D+  I  Q++VFGC
Sbjct: 136 ----PHAMPQIFRDEKTGNCQYHLRYRDFSNTRGILAEEKLTFETSDDGLISKQNIVFGC 191

Query: 177 GFSTNRNFKFSGIFGLGIGRSSLVSQ-LNSSFSYCIGSLHDPDYLHNKLILGDGAIIDEG 235
           G   +   K+SG+ GLG G  S+V++   S FSYC GSL +P Y HN LILG+GA I EG
Sbjct: 192 GQDNSGFTKYSGVLGLGPGTFSIVTRNFGSKFSYCFGSLTNPTYPHNILILGNGAKI-EG 250

Query: 236 DATPLQFIDGHYYITLEAISVDGRMLDINPNIFKRDDSGGGVMIDSGTDVTWLVKEAYEA 295
           D TPLQ     YY+ L+AIS   ++LDI P  F+R  S GG +ID+G   T L +EAYE 
Sbjct: 251 DPTPLQIFQDRYYLDLQAISFGEKLLDIEPGTFQRYRSQGGTVIDTGCSPTILAREAYET 310

Query: 296 LRDEVMIRLEGEQMRSYSWPDKL---CYHGIMSSDLKGFPTVRFHFRGGAKLALEKDSMF 352
           L +E+   L GE +R     D+    CY G +  DL GFP V FHF GGA+LAL+ +S+F
Sbjct: 311 LSEEIDFLL-GEVLRRVKDWDQYTTPCYEGNLKLDLYGFPVVTFHFAGGAELALDVESLF 369

Query: 353 YQPRP-DAFCMAVNPASINNRYVNFSLIGMMAQQFYNVGYDIGRKQKTFQRMDCEVLD 409
                 D+FC+A+      N + + S+IG MAQQ YNVGY++   +  FQR DCE++D
Sbjct: 370 VSSESGDSFCLAMT----MNTFDDMSVIGAMAQQNYNVGYNLRTMKVYFQRTDCEIID 423


>gi|297803034|ref|XP_002869401.1| aspartyl protease family [Arabidopsis lyrata subsp. lyrata]
 gi|297315237|gb|EFH45660.1| aspartyl protease family [Arabidopsis lyrata subsp. lyrata]
          Length = 438

 Score =  273 bits (699), Expect = 9e-71,   Method: Compositional matrix adjust.
 Identities = 161/393 (40%), Positives = 221/393 (56%), Gaps = 21/393 (5%)

Query: 29  SIARLAYLQEKIRSHNTYQAQILPSNDISNSLFYVNISIGQPPVPQFTAMDTGSSLLWVH 88
           S+ RL YL  K ++     A + P+  I    F VNISIG PPV Q   MDT S LLW+ 
Sbjct: 55  SVERLEYL--KAKATGDIIAHLSPNVPIIPQAFLVNISIGSPPVTQLLHMDTASDLLWLQ 112

Query: 89  CYPCRDCSPQLGTIFYPSRSSSYAYVPCDSEHCRYFPYARCSAYKHRCIYTQLYLIGPET 148
           C PC +C  Q   IF PSRS ++    C +      P  R +A    C Y+  Y+ G  +
Sbjct: 113 CRPCINCYAQSLPIFDPSRSYTHRNESCRTSQYS-MPSLRFNAKTRSCEYSMRYMDGTGS 171

Query: 149 SVFVSTEQLTFKNT--DESTIHVQDVVFGCGFST-NRNFKFSGIFGLGIGRSSLVSQLNS 205
              ++ E L F     + S+  + DVVFGCG          +GI GLG G  SLV +  +
Sbjct: 172 KGILAKEMLMFNTIYDESSSAALHDVVFGCGHDNYGEPLVGTGILGLGYGEFSLVHRFGT 231

Query: 206 SFSYCIGSLHDPDYLHNKLILGDGAIIDEGDATPLQFIDGHYYITLEAISVDGRMLDINP 265
            FSYC GSL DP Y HN L+LGD      GD TPL+  +G YY+T+EAISVDG +L I+P
Sbjct: 232 KFSYCFGSLDDPSYPHNVLVLGDDGANILGDTTPLEIYNGFYYVTIEAISVDGIILPIDP 291

Query: 266 NIFKRDDSG--GGVMIDSGTDVTWLVKEAYEALRDEVMIRLEGEQMRSYSWPDKL----C 319
            +F R+     GG +ID+G  +T LV+EAY+ L++++    EG    +    D +    C
Sbjct: 292 WVFNRNHQTGLGGTIIDTGNSLTSLVEEAYKPLKNKIEDYFEGRFTAADVNQDDMFKVEC 351

Query: 320 YHGIMSSDL--KGFPTVRFHFRGGAKLALEKDSMFYQPRPDAFCMAVNPASINNRYVNFS 377
           Y+G +  DL   GFP V FHF  GA+L+L+  S+F +  P+ FC+AV P ++N+      
Sbjct: 352 YNGNLERDLVESGFPIVTFHFSDGAELSLDVKSVFMKLSPNVFCLAVTPGNMNS------ 405

Query: 378 LIGMMAQQFYNVGYDIGRKQKTFQRMDCEVLDD 410
            IG  AQQ YN+GYD+  K+ +F+R+DC VL D
Sbjct: 406 -IGATAQQSYNIGYDLEAKKISFERIDCGVLFD 437


>gi|255537017|ref|XP_002509575.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
           communis]
 gi|223549474|gb|EEF50962.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
           communis]
          Length = 459

 Score =  261 bits (666), Expect = 6e-67,   Method: Compositional matrix adjust.
 Identities = 164/434 (37%), Positives = 237/434 (54%), Gaps = 37/434 (8%)

Query: 1   LIHRDSILSPYNNPNENPAHRVQRGINISIARLAYLQ---------------EKIRSHNT 45
           LIHRDSI SP  NPN++   R +R +  S AR  Y+Q               +   + + 
Sbjct: 39  LIHRDSIFSPAYNPNDSIKDRAKRMLKNSNARFDYVQAISKRNSAVVDYDGGDTSAADDA 98

Query: 46  YQAQILPSNDISNSLFYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLGTIFYP 105
           Y+A +L         F VN SIGQPPVPQ+  MDTGSSL W+ C PC +C  Q G ++ P
Sbjct: 99  YEASLLSEL----CTFLVNFSIGQPPVPQYAVMDTGSSLTWIQCEPCINCHQQKGPLYNP 154

Query: 106 SRSSSYAYVPCDSEHCRYFPYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDES 165
           S SS+Y            F     + +   C Y+Q Y     T    + EQL F+  D+ 
Sbjct: 155 SSSSTYVSCSDFDRTDTTF----TATHGSDCNYSQTYADKTTTRGTYAREQLLFETPDDG 210

Query: 166 TIHVQDVVFGCGFSTNR----NFKFSGIFGLGIGRSSLVSQLNSSFSYCIGSLHDPDYLH 221
              + DV+FGCG +  +        SG+FGLG   SS++S+L   FSYCIG++ DP Y  
Sbjct: 211 ITIMHDVIFGCGHNNTQLPGPTGYASGVFGLGDSGSSIISKLGFGFSYCIGNIGDPLYGF 270

Query: 222 NKLILGDGAIIDEGDATPLQFIDGHYYITLEAISVDGRMLDINPNIFKRDDSGG---GVM 278
           ++L LG+   I EG +TPL    G YYITL  IS+    LDI+P +F+R D  G    ++
Sbjct: 271 HRLTLGNKLKI-EGYSTPL-VPRGLYYITLVGISIGQERLDIDPIVFQRVDLNGISSRIV 328

Query: 279 IDSGTDVTWLVKEAYEALRDEVMIRLEG--EQMRSYSWPDKLCYHGIMSSDLKGFPTVRF 336
           IDSG  ++++ ++AY  +RD+V   L G   + R  +    LCY G ++ DL+GFP   F
Sbjct: 329 IDSGATLSYIPRQAYNVVRDKVSSILSGFLSRYRYIARHLSLCYIGKLNQDLQGFPDATF 388

Query: 337 HFRGGAKLALEKDSMFYQPRPDAFCMAVNPASINNRYVNFSLIGMMAQQFYNVGYDIGRK 396
           H   GA L  + + +F+Q   +  C+A+ P   +       LIG++AQQ+YNV YD+ ++
Sbjct: 389 HLADGADLVFQVEGLFFQYTDNVLCLALVPTESDEETC---LIGLLAQQYYNVAYDLKQQ 445

Query: 397 QKTFQRMDCEVLDD 410
           +  FQR++CE+LDD
Sbjct: 446 KLYFQRIECELLDD 459


>gi|15234607|ref|NP_194733.1| aspartyl protease family protein [Arabidopsis thaliana]
 gi|4938480|emb|CAB43839.1| putative protein [Arabidopsis thaliana]
 gi|7269904|emb|CAB80997.1| putative protein [Arabidopsis thaliana]
 gi|67633776|gb|AAY78812.1| aspartyl protease family protein [Arabidopsis thaliana]
 gi|332660311|gb|AEE85711.1| aspartyl protease family protein [Arabidopsis thaliana]
          Length = 427

 Score =  253 bits (647), Expect = 8e-65,   Method: Compositional matrix adjust.
 Identities = 151/383 (39%), Positives = 208/383 (54%), Gaps = 21/383 (5%)

Query: 29  SIARLAYLQEKIRSHNTYQAQILPSNDISNSLFYVNISIGQPPVPQFTAMDTGSSLLWVH 88
           S+ RL YL  K ++     A + P+  I    F VNISIG PP+ Q   MDT S LLW+ 
Sbjct: 55  SVERLEYL--KAKTTGDIIAHLSPNVPIIPQAFLVNISIGSPPITQLLHMDTASDLLWIQ 112

Query: 89  CYPCRDCSPQLGTIFYPSRSSSYAYVPCDSEHCRYFPYARCSAYKHRCIYTQLYLIGPET 148
           C PC +C  Q   IF PSRS ++    C +      P  + +A    C Y+  Y+    +
Sbjct: 113 CLPCINCYAQSLPIFDPSRSYTHRNETCRTSQYS-MPSLKFNANTRSCEYSMRYVDDTGS 171

Query: 149 SVFVSTEQLTFKNT--DESTIHVQDVVFGCGFST-NRNFKFSGIFGLGIGRSSLVSQLNS 205
              ++ E L F     + S+  + DVVFGCG          +GI GLG G  SLV +   
Sbjct: 172 KGILAREMLLFNTIYDESSSAALHDVVFGCGHDNYGEPLVGTGILGLGYGEFSLVHRFGK 231

Query: 206 SFSYCIGSLHDPDYLHNKLILGDGAIIDEGDATPLQFIDGHYYITLEAISVDGRMLDINP 265
            FSYC GSL DP Y HN L+LGD      GD TPL+  +G YY+T+EAISVDG +L I+P
Sbjct: 232 KFSYCFGSLDDPSYPHNVLVLGDDGANILGDTTPLEIHNGFYYVTIEAISVDGIILPIDP 291

Query: 266 NIFKRDDSG--GGVMIDSGTDVTWLVKEAYEALRDEVMIRLEGEQMRSYSWPDKL----C 319
            +F R+     GG +ID+G  +T LV+EAY+ L++ +    EG    +    D +    C
Sbjct: 292 RVFNRNHQTGLGGTIIDTGNSLTSLVEEAYKPLKNRIEDIFEGRFTAADVSQDDMIKMEC 351

Query: 320 YHGIMSSDL--KGFPTVRFHFRGGAKLALEKDSMFYQPRPDAFCMAVNPASINNRYVNFS 377
           Y+G    DL   GFP V FHF  GA+L+L+  S+F +  P+ FC+AV P ++N+      
Sbjct: 352 YNGNFERDLVESGFPIVTFHFSEGAELSLDVKSLFMKLSPNVFCLAVTPGNLNS------ 405

Query: 378 LIGMMAQQFYNVGYDIGRKQKTF 400
            IG  AQQ YN+GYD+   + +F
Sbjct: 406 -IGATAQQSYNIGYDLEAMEVSF 427


>gi|356558304|ref|XP_003547447.1| PREDICTED: LOW QUALITY PROTEIN: aspartic proteinase
           nepenthesin-1-like [Glycine max]
          Length = 336

 Score =  226 bits (577), Expect = 1e-56,   Method: Compositional matrix adjust.
 Identities = 140/360 (38%), Positives = 208/360 (57%), Gaps = 32/360 (8%)

Query: 55  DISNSLFYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLGTIFYPSRSSSYA-- 112
           D  N  ++  +SIGQPP+PQ   MDT S +LW+ C         +G +F PS+SS+++  
Sbjct: 3   DERNKPYWSILSIGQPPIPQLVIMDTSSDILWIMC-------NHVGLLFDPSKSSTFSPL 55

Query: 113 -YVPCDSEHCRYFPYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQD 171
              PC  + C+  P     +Y  +            TS    ++ + F+ TDE    + D
Sbjct: 56  CKTPCGFKGCKCDPIPFNISYVDK----------SSTSGTFGSDTVVFETTDEGHSQIFD 105

Query: 172 VVFGCGFST--NRNFKFSGIFGLGIGRSSLVSQLNSSFSYCIGSLHDPDYLHNKLILGDG 229
           V+  CG +   N +  ++GI GL  G +SL +++   FSYC+G+L DP Y +N+LIL +G
Sbjct: 106 VLVRCGHNIGFNTDPGYNGIRGLNNGPNSLATKIGQKFSYCVGNLADPYYNYNQLILCEG 165

Query: 230 AIIDEGDATPLQFIDGHYYITLEAISVDGRMLDINPNIFK-RDDSGGGVMIDSGTDVTWL 288
           A + EG +TP +   G YY+TL+ I V  + LDI P  F+ + ++ GGV+ DSGT +T+L
Sbjct: 166 ADL-EGYSTPFEVHHGFYYVTLKGIIVGEKRLDIAPITFEIKGNNTGGVIRDSGTTITYL 224

Query: 289 VKEAYEALRDEVMIRLEGEQMRSYSWPDKLCYHGIMSSDLKGFPTVRFHFRGGAKLALEK 348
           V   ++ L +EV        + S+S+  +LC++GI+S DL GFP V FHF  GA LAL+ 
Sbjct: 225 VDSVHKLLYNEV------RNLLSWSF-RQLCHYGIISRDLVGFPVVTFHFADGADLALDT 277

Query: 349 DSMFYQPRPDAFCMAVNPASINNRYVNFSLIGMMAQQFYNVGYDIGRKQKTFQRMDCEVL 408
            S F Q      CM V+PASI N  ++ S+I ++AQQ YNVGYD+      FQR+DCE+L
Sbjct: 278 GSFFNQLN-SILCMTVSPASILNTTISPSVIELLAQQSYNVGYDLLTNFVYFQRIDCELL 336


>gi|225427554|ref|XP_002266533.1| PREDICTED: probable aspartic protease At2g35615-like [Vitis
           vinifera]
          Length = 447

 Score =  190 bits (483), Expect = 1e-45,   Method: Compositional matrix adjust.
 Identities = 142/427 (33%), Positives = 218/427 (51%), Gaps = 36/427 (8%)

Query: 1   LIHRDSILSPYNNPNENPAHRVQRGINISIARLAYLQEKIRSHNTYQAQILPSNDISNSL 60
           LI RDS LSP+ NP+E    R+Q+  + SI+R  + +    S N+ Q+ ++ +N      
Sbjct: 39  LISRDSPLSPFYNPSETQFDRLQKAFHRSISRANHFRANGVSTNSIQSPVISNN----GE 94

Query: 61  FYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLGTIFYPSRSSSYAYVPCDSEH 120
           + +NIS+G PPV      DTGS LLW  C PC  C  Q+  IF P++S +Y  + C+ + 
Sbjct: 95  YLMNISLGTPPVSMHGIADTGSDLLWRQCKPCDSCYEQIEPIFDPAKSKTYQILSCEGKS 154

Query: 121 CRYFPYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDVVFGCGFST 180
           C         +  + CIY+  Y  G  TS  ++ + LT  +T    + V  VVFGCG + 
Sbjct: 155 CSNLGGQGGCSDDNTCIYSYSYGDGSHTSGDLAVDTLTIGSTTGRPVSVPKVVFGCGHNN 214

Query: 181 NRNFK--FSGIFGLGIGRSSLVSQL----NSSFSYCIGSLHDPDYLHNKLILGDGAIIDE 234
              F+   SG+ GLG G  S++SQL       FSYC+  L +   + +K+  G   I+  
Sbjct: 215 GGTFELHGSGLVGLGGGPLSMISQLRPLIGGRFSYCLVPLGNDPSVSSKMHFGSRGIVSG 274

Query: 235 GDA--TPL--QFIDGHYYITLEAISVDGRML------DINPNIFKRDDSGGGVMIDSGTD 284
             A  TPL  +  D  YY+TLE++SV  + L       +   +   D+  G ++IDSGT 
Sbjct: 275 AGAVSTPLASRQPDTFYYLTLESMSVGSKKLAYKGFSKVGSPLADADE--GNIIIDSGTT 332

Query: 285 VTWLVKEAYEALRDEVMIRLEGEQMRSYSWPDKLCYHGIMSSDLKGF--PTVRFHFRGGA 342
           +T L ++ Y  L   V+  + G+ +R  +    LCY     S+L G   PT+  HF  GA
Sbjct: 333 LTLLPQDFYGTLESNVVSAIGGKPVRDPNNVFSLCY-----SNLSGLRIPTITAHFV-GA 386

Query: 343 KLALEKDSMFYQPRPDAFCMAVNPASINNRYVNFSLIGMMAQQFYNVGYDIGRKQKTFQR 402
            L L+  + F Q + D FC A+ P S      + ++ G +AQ  + VGYD+  +  +F+ 
Sbjct: 387 DLELKPLNTFVQVQEDLFCFAMIPVS------DLAIFGNLAQMNFLVGYDLKSRTVSFKP 440

Query: 403 MDCEVLD 409
            DC  +D
Sbjct: 441 TDCTKID 447


>gi|225427550|ref|XP_002266461.1| PREDICTED: probable aspartic protease At2g35615 [Vitis vinifera]
          Length = 439

 Score =  189 bits (479), Expect = 3e-45,   Method: Compositional matrix adjust.
 Identities = 128/415 (30%), Positives = 208/415 (50%), Gaps = 24/415 (5%)

Query: 1   LIHRDSILSPYNNPNENPAHRVQRGINISIARLAYLQEKIRSHNTYQAQILPSNDISNSL 60
           LIHRDS  SP+ +P++  A R+      S++R+   +    + +  Q++I+PS       
Sbjct: 36  LIHRDSPHSPFFDPSKTQAERLTDAFRRSVSRVGRFRPTAMTSDGIQSRIVPSA----GE 91

Query: 61  FYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLGTIFYPSRSSSYAYVPCDSEH 120
           + +N+ IG PPVP    +DTGS L W  C PC  C  Q+  +F P  SS+Y    C +  
Sbjct: 92  YLMNLYIGTPPVPVIAIVDTGSDLTWTQCRPCTHCYKQVVPLFDPKNSSTYRDSSCGTSF 151

Query: 121 CRYFPYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDVVFGCGFST 180
           C      R  + + +C +   Y  G  T   +++E LT  +T    +      FGCG S+
Sbjct: 152 CLALGKDRSCSKEKKCTFRYSYADGSFTGGNLASETLTVDSTAGKPVSFPGFAFGCGHSS 211

Query: 181 NRNF--KFSGIFGLGIGRSSLVSQLNSS----FSYCIGSLHDPDYLHNKLILG-DGAIID 233
              F    SGI GLG G  SL+SQL S+    FSYC+  +     + +++  G  G +  
Sbjct: 212 GGIFDKSSSGIVGLGGGELSLISQLKSTINGLFSYCLLPVSTDSSISSRINFGASGRVSG 271

Query: 234 EGD-ATPL--QFIDGHYYITLEAISVDGRMLDINPNIFKRDDSGGGVMIDSGTDVTWLVK 290
            G  +TPL  +  D  YY+TLE ISV  + L       K +   G +++DSGT  T+L +
Sbjct: 272 YGTVSTPLVQKSPDTFYYLTLEGISVGKKRLPYKGYSKKTEVEEGNIIVDSGTTYTFLPQ 331

Query: 291 EAYEALRDEVMIRLEGEQMRSYSWPDKLCYHGIMSSDLKGFPTVRFHFRGGAKLALEKDS 350
           E Y  L   V   ++G+++R  +    LCY+   ++++   P +  HF+  A + L+  +
Sbjct: 332 EFYSKLEKSVANSIKGKRVRDPNGIFSLCYN--TTAEINA-PIITAHFK-DANVELQPLN 387

Query: 351 MFYQPRPDAFCMAVNPASINNRYVNFSLIGMMAQQFYNVGYDIGRKQKTFQRMDC 405
            F + + D  C  V P S      +  ++G +AQ  + VG+D+ +K+ +F+  DC
Sbjct: 388 TFMRMQEDLVCFTVAPTS------DIGVLGNLAQVNFLVGFDLRKKRVSFKAADC 436


>gi|297805038|ref|XP_002870403.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
 gi|297316239|gb|EFH46662.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
          Length = 440

 Score =  187 bits (475), Expect = 8e-45,   Method: Compositional matrix adjust.
 Identities = 145/426 (34%), Positives = 212/426 (49%), Gaps = 38/426 (8%)

Query: 1   LIHRDSILSPYNNPNENPAHRVQRGINISIARLAY---LQEKIRSHNTYQAQILPSNDIS 57
           LIHRDS  SP+ NP E  + R++  I+ S++R+ +   + +K  S N  Q  +      S
Sbjct: 35  LIHRDSPKSPFYNPTETSSQRLRNAIHRSVSRVFHFTDISQKDASDNAPQIDL-----TS 89

Query: 58  NSLFYV-NISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLGTIFYPSRSSSYAYVPC 116
           NS  Y+ NIS+G PP P     DTGS LLW  C PC DC  Q+  +F P  SS+Y  V C
Sbjct: 90  NSGEYLMNISLGTPPFPIMAIADTGSDLLWTQCKPCDDCYTQVDPLFDPKASSTYKDVSC 149

Query: 117 DSEHCRYFP-YARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDVVFG 175
            S  C      A CS   + C Y+  Y     T   ++ + LT  +TD   + +++++ G
Sbjct: 150 SSSQCTALENQASCSTEDNTCSYSTSYGDRSYTKGNIAVDTLTLGSTDTRPVQLKNIIIG 209

Query: 176 CGFSTNRNF--KFSGIFGLGIGRSSLVSQLNSS----FSYCIGSLHDPDYLHNKLILGDG 229
           CG +    F  K SGI GLG G  SL++QL  S    FSYC+  L   +   +K+  G  
Sbjct: 210 CGHNNAGTFNKKGSGIVGLGGGAVSLITQLGDSIDGKFSYCLVPLTSENDRTSKINFGTN 269

Query: 230 AIIDEGD--ATPL--QFIDGHYYITLEAISVDGRMLDINPNIFKRDDSGGG---VMIDSG 282
           A++      +TPL  +  +  YY+TL++ISV  + +      +   DSG G   ++IDSG
Sbjct: 270 AVVSGTGVVSTPLIAKSQETFYYLTLKSISVGSKEVQ-----YPGSDSGSGEGNIIIDSG 324

Query: 283 TDVTWLVKEAYEALRDEVMIRLEGEQMRSYSWPDKLCYHGIMSSDLKGFPTVRFHFRGGA 342
           T +T L  E Y  L D V   ++ E+ +       LCY    + DLK  P +  HF  GA
Sbjct: 325 TTLTLLPTEFYSELEDAVASSIDAEKKQDPQTGLSLCYSA--TGDLK-VPAITMHFD-GA 380

Query: 343 KLALEKDSMFYQPRPDAFCMAVNPASINNRYVNFSLIGMMAQQFYNVGYDIGRKQKTFQR 402
            + L+  + F Q   D  C A   +       +FS+ G +AQ  + VGYD   K  +F+ 
Sbjct: 381 DVNLKPSNCFVQISEDLVCFAFRGSP------SFSIYGNVAQMNFLVGYDTVSKTVSFKP 434

Query: 403 MDCEVL 408
            DC  +
Sbjct: 435 TDCAKM 440


>gi|357457681|ref|XP_003599121.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
 gi|355488169|gb|AES69372.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
          Length = 439

 Score =  186 bits (472), Expect = 2e-44,   Method: Compositional matrix adjust.
 Identities = 139/423 (32%), Positives = 206/423 (48%), Gaps = 29/423 (6%)

Query: 1   LIHRDSILSPYNNPNENPAHRVQRGINISIARLAYLQEKIR-SHNTY-QAQILPSNDISN 58
           LIH DS  SP+ N  E    R+   +  SI R  YL      SHN   +  I+P    + 
Sbjct: 31  LIHPDSSRSPFYNIRETQLQRISNVVTHSIKRAHYLNHVFSLSHNDLPKPTIIP---YAG 87

Query: 59  SLFYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLGTIFYPSRSSSYAYVPCDS 118
           S + ++ SIG PP   +  +DTGS  +W  C PC+ C  Q   IF PS+SS+Y  + C S
Sbjct: 88  SYYVMSYSIGTPPFQLYGVVDTGSDGIWFQCKPCKPCLNQTSPIFNPSKSSTYKNIRCSS 147

Query: 119 EHCRYFPYARCSA-YKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDVVFGCG 177
             C+     RCS+  K +C Y   YL    +   +S + LT  + D S I    +V GCG
Sbjct: 148 PICKRGEKTRCSSNRKRKCEYEITYLDRSGSQGDISKDTLTLNSNDGSPISFPKIVIGCG 207

Query: 178 FSTNRNFK--FSGIFGLGIGRSSLVSQLNSS----FSYCIGSLHDPDYLHNKLILGDGAI 231
              +   +   SGI G G G  S+VSQL SS    FSYC+ SL     + +KL  GD A+
Sbjct: 208 HKNSLTTEGLASGIIGFGRGNFSIVSQLGSSIGGKFSYCLASLFSKANISSKLYFGDMAV 267

Query: 232 IDEGD--ATPL--QFIDGHYYITLEAISVDGRMLDINPNIFKRDDSGGGVMIDSGTDVTW 287
           +      +TPL   F  G+Y+  LEA SV   ++ +  +    D+ G  V IDSG+ +T 
Sbjct: 268 VSGHGVVSTPLIQSFYVGNYFTNLEAFSVGDHIIKLKDSSLIPDNEGNAV-IDSGSTITQ 326

Query: 288 LVKEAYEALRDEVMIRLEGEQMRSYSWPDKLCYHGIMSSDLKGF--PTVRFHFRGGAKLA 345
           L  + Y  L   V+  ++ ++++  +    LCY     + LK +  P +  HFR GA + 
Sbjct: 327 LPNDVYSQLETAVISMVKLKRVKDPTQQLSLCYK----TTLKKYEVPIITAHFR-GADVK 381

Query: 346 LEKDSMFYQPRPDAFCMAVNPASINNRYVNFSLIGMMAQQFYNVGYDIGRKQKTFQRMDC 405
           L   + F Q   +  C A N ++       + + G +AQQ + VGYD  +   +F+  +C
Sbjct: 382 LNAFNTFIQMNHEVMCFAFNSSAF-----PWVVYGNIAQQNFLVGYDTLKNIISFKPTNC 436

Query: 406 EVL 408
             L
Sbjct: 437 TKL 439


>gi|356546370|ref|XP_003541599.1| PREDICTED: probable aspartic protease At2g35615-like [Glycine max]
          Length = 434

 Score =  185 bits (470), Expect = 3e-44,   Method: Compositional matrix adjust.
 Identities = 134/420 (31%), Positives = 204/420 (48%), Gaps = 36/420 (8%)

Query: 1   LIHRDSILSPYNNPNENPAHRVQRGINISIARLAYLQEKIRSHNTYQAQILPSNDISNSL 60
           +IHRDS  SP+  P E    RV   ++ S+ R  +     ++H   +A I   ND     
Sbjct: 33  MIHRDSSRSPFFRPTETQFQRVANAVHRSVNRANHFH---KAHKAAKATI-TQND---GE 85

Query: 61  FYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLGTIFYPSRSSSYAYVPCDSEH 120
           + ++ S+G PP   +  +DTGS ++W+ C PC  C  Q   IF PS+S++Y  +P  S  
Sbjct: 86  YLISYSVGIPPFQLYGIIDTGSDMIWLQCKPCEKCYNQTTRIFDPSKSNTYKILPFSSTT 145

Query: 121 CRYFPYARCSAYKHR-CIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDVVFGCGFS 179
           C+      CS+   + C YT  Y  G  +   +S E LT  +T+ S++  +  V GCG +
Sbjct: 146 CQSVEDTSCSSDNRKMCEYTIYYGDGSYSQGDLSVETLTLGSTNGSSVKFRRTVIGCGRN 205

Query: 180 TNRNF--KFSGIFGLGIGRSSLVSQL-------NSSFSYCIGSLHDPDYLHNKLILGDGA 230
              +F  K SGI GLG G  SL++QL          FSYC+ S+ +   + +KL  GD A
Sbjct: 206 NTVSFEGKSSGIVGLGNGPVSLINQLRRRSSSIGRKFSYCLASMSN---ISSKLNFGDAA 262

Query: 231 IIDEGDATPLQFIDGH-----YYITLEAISVDGRMLDINPNIFKRDDSGGGVMIDSGTDV 285
           ++  GD T    I  H     YY+TLEA SV    ++   + F+  +  G ++IDSGT +
Sbjct: 263 VV-SGDGTVSTPIVTHDPKVFYYLTLEAFSVGNNRIEFTSSSFRFGEK-GNIIIDSGTTL 320

Query: 286 TWLVKEAYEALRDEVMIRLEGEQMRSYSWPDKLCYHGIMSSDLKGFPTVRFHFRGGAKLA 345
           T L  + Y  L   V   +E ++++       LCY      D    P +  HF  GA + 
Sbjct: 321 TLLPNDIYSKLESAVADLVELDRVKDPLKQLSLCYRSTF--DELNAPVIMAHF-SGADVK 377

Query: 346 LEKDSMFYQPRPDAFCMAVNPASINNRYVNFSLIGMMAQQFYNVGYDIGRKQKTFQRMDC 405
           L   + F +      C+A   + I        + G MAQQ + VGYD+ +K  +F+  DC
Sbjct: 378 LNAVNTFIEVEQGVTCLAFISSKIG------PIFGNMAQQNFLVGYDLQKKIVSFKPTDC 431


>gi|357514995|ref|XP_003627786.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
 gi|355521808|gb|AET02262.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
          Length = 436

 Score =  184 bits (467), Expect = 6e-44,   Method: Compositional matrix adjust.
 Identities = 140/424 (33%), Positives = 211/424 (49%), Gaps = 41/424 (9%)

Query: 1   LIHRDSILSPYNNPNENPAHRVQRGINISIARLAYLQEKIRSHNTYQAQILPSNDISNSL 60
           LIHRDS+ SP   P +N           SI R  +   K    N  Q+ ++P  DI   L
Sbjct: 32  LIHRDSLKSPLYKPTQNKYQYFVDAARRSINRANHFY-KYSLANIPQSTVIP--DIGEYL 88

Query: 61  FYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLGTIFYPSRSSSYAYVPCDSEH 120
             +  S+G PP   +  +DTGS ++W+ C PC++C  Q   +F PS+SSSY  +PC S+ 
Sbjct: 89  --MTYSVGTPPFKLYGIVDTGSDIVWLQCEPCQECYNQTTPMFNPSKSSSYKNIPCPSKL 146

Query: 121 CRYFPYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDVVFGCGFST 180
           C+      C+  K+ C Y+  Y     +   +S + LT ++T+  T+   ++V GCG  T
Sbjct: 147 CQSMEDTSCND-KNYCEYSTYYGDNSHSGGDLSVDTLTLESTNGLTVSFPNIVIGCG--T 203

Query: 181 NRNFKF----SGIFGLGIGRSSLVSQLNSS----FSYCIGSLHDPDYLH----NKLILGD 228
           N    +    SGI G G G +S ++QL SS    FSYC+  L     +     +KL  GD
Sbjct: 204 NNILSYEGASSGIVGFGSGPASFITQLGSSTGGKFSYCLTPLFSVTNIQSNATSKLNFGD 263

Query: 229 GAIIDEGDA---TPLQFIDGH--YYITLEAISVDGRMLDIN--PNIFKRDDSGGGVMIDS 281
            A +  GD    TP+   D    YY+TLEA SV  R ++I   PN     D+ G ++IDS
Sbjct: 264 AATV-SGDGVVTTPILKKDPETFYYLTLEAFSVGNRRVEIGGVPN----GDNEGNIIIDS 318

Query: 282 GTDVTWLVKEAYEALRDEVMIRLEGEQMRSYSWPDKLCYHGIMSSDLKGFPTVRFHFRGG 341
           GT +T L K+ Y  L   V+  ++ E++   +    LCY   + ++   FP +  HF+ G
Sbjct: 319 GTTLTSLTKDDYSFLESAVVDLVKLERVDDPTQTLNLCYS--VKAEGYDFPIITMHFK-G 375

Query: 342 AKLALEKDSMFYQPRPDAFCMAVNPASINNRYVNFSLIGMMAQQFYNVGYDIGRKQKTFQ 401
           A + L   S F       FC+A   +       + ++ G +AQQ   VGYD+ +K  +F+
Sbjct: 376 ADVDLHPISTFVSVADGVFCLAFESSQ------DHAIFGNLAQQNLMVGYDLQQKIVSFK 429

Query: 402 RMDC 405
             DC
Sbjct: 430 PSDC 433


>gi|15242803|ref|NP_198319.1| aspartyl protease family protein [Arabidopsis thaliana]
 gi|75130158|sp|Q6XBF8.1|CDR1_ARATH RecName: Full=Aspartic proteinase CDR1; AltName: Full=Protein
           CONSTITUTIVE DISEASE RESISTANCE 1; Flags: Precursor
 gi|37935737|gb|AAP72988.1| CDR1 [Arabidopsis thaliana]
 gi|91806924|gb|ABE66189.1| aspartyl protease family protein [Arabidopsis thaliana]
 gi|109946613|gb|ABG48485.1| At5g33340 [Arabidopsis thaliana]
 gi|332006513|gb|AED93896.1| aspartyl protease family protein [Arabidopsis thaliana]
          Length = 437

 Score =  183 bits (465), Expect = 1e-43,   Method: Compositional matrix adjust.
 Identities = 144/421 (34%), Positives = 205/421 (48%), Gaps = 31/421 (7%)

Query: 1   LIHRDSILSPYNNPNENPAHRVQRGINISIARLAYLQEKIRSHNTYQAQI-LPSNDISNS 59
           LIHRDS  SP+ NP E  + R++  I+ S+ R+ +  EK    NT Q QI L SN   + 
Sbjct: 35  LIHRDSPKSPFYNPMETSSQRLRNAIHRSVNRVFHFTEK---DNTPQPQIDLTSN---SG 88

Query: 60  LFYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLGTIFYPSRSSSYAYVPCDSE 119
            + +N+SIG PP P     DTGS LLW  C PC DC  Q+  +F P  SS+Y  V C S 
Sbjct: 89  EYLMNVSIGTPPFPIMAIADTGSDLLWTQCAPCDDCYTQVDPLFDPKTSSTYKDVSCSSS 148

Query: 120 HCRYFP-YARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDVVFGCGF 178
            C      A CS   + C Y+  Y     T   ++ + LT  ++D   + +++++ GCG 
Sbjct: 149 QCTALENQASCSTNDNTCSYSLSYGDNSYTKGNIAVDTLTLGSSDTRPMQLKNIIIGCGH 208

Query: 179 STNRNF--KFSGIFGLGIGRSSLVSQLNSS----FSYCIGSLHDPDYLHNKLILGDGAII 232
           +    F  K SGI GLG G  SL+ QL  S    FSYC+  L       +K+  G  AI+
Sbjct: 209 NNAGTFNKKGSGIVGLGGGPVSLIKQLGDSIDGKFSYCLVPLTSKKDQTSKINFGTNAIV 268

Query: 233 DEGDATPLQFI-----DGHYYITLEAISVDGRMLDINPNIFKRDDSGGGVMIDSGTDVTW 287
                     I     +  YY+TL++ISV  +   I  +    + S G ++IDSGT +T 
Sbjct: 269 SGSGVVSTPLIAKASQETFYYLTLKSISVGSKQ--IQYSGSDSESSEGNIIIDSGTTLTL 326

Query: 288 LVKEAYEALRDEVMIRLEGEQMRSYSWPDKLCYHGIMSSDLKGFPTVRFHFRGGAKLALE 347
           L  E Y  L D V   ++ E+ +       LCY    + DLK  P +  HF  GA + L+
Sbjct: 327 LPTEFYSELEDAVASSIDAEKKQDPQSGLSLCYSA--TGDLK-VPVITMHFD-GADVKLD 382

Query: 348 KDSMFYQPRPDAFCMAVNPASINNRYVNFSLIGMMAQQFYNVGYDIGRKQKTFQRMDCEV 407
             + F Q   D  C A   +       +FS+ G +AQ  + VGYD   K  +F+  DC  
Sbjct: 383 SSNAFVQVSEDLVCFAFRGSP------SFSIYGNVAQMNFLVGYDTVSKTVSFKPTDCAK 436

Query: 408 L 408
           +
Sbjct: 437 M 437


>gi|116831531|gb|ABK28718.1| unknown [Arabidopsis thaliana]
          Length = 438

 Score =  183 bits (465), Expect = 1e-43,   Method: Compositional matrix adjust.
 Identities = 144/421 (34%), Positives = 205/421 (48%), Gaps = 31/421 (7%)

Query: 1   LIHRDSILSPYNNPNENPAHRVQRGINISIARLAYLQEKIRSHNTYQAQI-LPSNDISNS 59
           LIHRDS  SP+ NP E  + R++  I+ S+ R+ +  EK    NT Q QI L SN   + 
Sbjct: 35  LIHRDSPKSPFYNPMETSSQRLRNAIHRSVNRVFHFTEK---DNTPQPQIDLTSN---SG 88

Query: 60  LFYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLGTIFYPSRSSSYAYVPCDSE 119
            + +N+SIG PP P     DTGS LLW  C PC DC  Q+  +F P  SS+Y  V C S 
Sbjct: 89  EYLMNVSIGTPPFPIMAIADTGSDLLWTQCAPCDDCYTQVDPLFDPKTSSTYKDVSCSSS 148

Query: 120 HCRYFP-YARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDVVFGCGF 178
            C      A CS   + C Y+  Y     T   ++ + LT  ++D   + +++++ GCG 
Sbjct: 149 QCTALENQASCSTNDNTCSYSLSYGDNSYTKGNIAVDTLTLGSSDTRPMQLKNIIIGCGH 208

Query: 179 STNRNF--KFSGIFGLGIGRSSLVSQLNSS----FSYCIGSLHDPDYLHNKLILGDGAII 232
           +    F  K SGI GLG G  SL+ QL  S    FSYC+  L       +K+  G  AI+
Sbjct: 209 NNAGTFNKKGSGIVGLGGGPVSLIKQLGDSIDGKFSYCLVPLTSKKDQTSKINFGTNAIV 268

Query: 233 DEGDATPLQFI-----DGHYYITLEAISVDGRMLDINPNIFKRDDSGGGVMIDSGTDVTW 287
                     I     +  YY+TL++ISV  +   I  +    + S G ++IDSGT +T 
Sbjct: 269 SGSGVVSTPLIAKASQETFYYLTLKSISVGSKQ--IQYSGSDSESSEGNIIIDSGTTLTL 326

Query: 288 LVKEAYEALRDEVMIRLEGEQMRSYSWPDKLCYHGIMSSDLKGFPTVRFHFRGGAKLALE 347
           L  E Y  L D V   ++ E+ +       LCY    + DLK  P +  HF  GA + L+
Sbjct: 327 LPTEFYSELEDAVASSIDAEKKQDPQSGLSLCYSA--TGDLK-VPVITMHFD-GADVKLD 382

Query: 348 KDSMFYQPRPDAFCMAVNPASINNRYVNFSLIGMMAQQFYNVGYDIGRKQKTFQRMDCEV 407
             + F Q   D  C A   +       +FS+ G +AQ  + VGYD   K  +F+  DC  
Sbjct: 383 SSNAFVQVSEDLVCFAFRGSP------SFSIYGNVAQMNFLVGYDTVSKTVSFKPTDCAK 436

Query: 408 L 408
           +
Sbjct: 437 M 437


>gi|116790042|gb|ABK25480.1| unknown [Picea sitchensis]
          Length = 460

 Score =  181 bits (459), Expect = 6e-43,   Method: Compositional matrix adjust.
 Identities = 136/420 (32%), Positives = 196/420 (46%), Gaps = 30/420 (7%)

Query: 1   LIHRDSILSPYNNPNENPAHRVQRGINISIARLAYLQEKIRSHNTYQAQILPSNDISNSL 60
           L+  DS LSP++  N +   R +R I  S  RL  LQ  +      +A +       N  
Sbjct: 59  LVRTDSPLSPFSPGNISSTERFKRAIKRSQDRLEKLQMSVDEVKAVEAPVY----AGNGE 114

Query: 61  FYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLGTIFYPSRSSSYAYVPCDSEH 120
           F + ++IG P +     +DTGS L W  C PC DC PQ   I+ PS+SS+Y+ VPC S  
Sbjct: 115 FLMKMAIGTPSLSFSAILDTGSDLTWTQCKPCTDCYPQPTPIYDPSQSSTYSKVPCSSSM 174

Query: 121 CRYFPYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDVVFGCGFST 180
           C+  P   CS     C Y  LY  G ++S        +F  T +S  H   + FGCG   
Sbjct: 175 CQALPMYSCSG--ANCEY--LYSYGDQSSTQGILSYESFTLTSQSLPH---IAFGCGQEN 227

Query: 181 NRNFKFSGIFGLGIGRS--SLVSQLNSS----FSYCIGSLHDPDYLHNKLILGDGAIIDE 234
                  G   +G GR   SL+SQL  S    FSYC+ S+ D     + L +G  A ++ 
Sbjct: 228 EGGGFSQGGGLVGFGRGPLSLISQLGQSLGNKFSYCLVSITDSPSKTSPLFIGKTASLNA 287

Query: 235 GDATPLQFIDGH-----YYITLEAISVDGRMLDINPNIFKRD-DSGGGVMIDSGTDVTWL 288
              +    +        YY++LE ISV G++LDI    F    D  GGV+IDSGT VT+L
Sbjct: 288 KTVSSTPLVQSRSRPTFYYLSLEGISVGGQLLDIADGTFDLQLDGTGGVIIDSGTTVTYL 347

Query: 289 VKEAYEALRDEVMIRLEGEQMRSYSWPDKLCYHGIMSSDLKGFPTVRFHFRGGAKLALEK 348
            +  Y+ ++  V+  +   Q+   +    LC+     S    FPT+ FHF  GA   L K
Sbjct: 348 EQSGYDVVKKAVISSINLPQVDGSNIGLDLCFEPQSGSSTSHFPTITFHFE-GADFNLPK 406

Query: 349 DSMFYQPRPDAFCMAVNPASINNRYVNFSLIGMMAQQFYNVGYDIGRKQKTFQRMDCEVL 408
           ++  Y       C+A+ P++        S+ G + QQ Y + YD  R   +F    C+ L
Sbjct: 407 ENYIYTDSSGIACLAMLPSN------GMSIFGNIQQQNYQILYDNERNVLSFAPTVCDTL 460


>gi|356546372|ref|XP_003541600.1| PREDICTED: probable aspartic protease At2g35615-like [Glycine max]
          Length = 433

 Score =  180 bits (456), Expect = 1e-42,   Method: Compositional matrix adjust.
 Identities = 133/419 (31%), Positives = 201/419 (47%), Gaps = 35/419 (8%)

Query: 1   LIHRDSILSPYNNPNENPAHRVQRGINISIARLAYLQEKIRSHNTYQAQILPSNDISNSL 60
           +IHRDS  SP+ +P E    RV   ++ SI R  +L +   S N+      P   + ++L
Sbjct: 33  MIHRDSSRSPFFSPTETQFQRVANAVHRSINRANHLNQSFVSPNS------PETTVISAL 86

Query: 61  --FYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLGTIFYPSRSSSYAYVPCDS 118
             + ++ S+G P +  F  +DTGS ++W+ C PC+ C  Q   IF  S+S +Y  +PC S
Sbjct: 87  GEYLISYSVGTPSLQVFGILDTGSDIIWLQCQPCKKCYEQTTPIFDSSKSQTYKTLPCPS 146

Query: 119 EHCRYFPYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDVVFGCGF 178
             C+      CS+ KH C+Y+  Y+ G ++   +S E LT  +T+ S +     V GCG 
Sbjct: 147 NTCQSVQGTFCSSRKH-CLYSIHYVDGSQSLGDLSVETLTLGSTNGSPVQFPGTVIGCGR 205

Query: 179 --STNRNFKFSGIFGLGIGRSSLVSQLNSS----FSYCIGSLHDPDYLHNKLILGDGAII 232
             +     K SGI GLG G  SL++QL+ S    FSYC+  +       +KL  G+ A++
Sbjct: 206 YNAIGIEEKNSGIVGLGRGPMSLITQLSPSTGGKFSYCL--VPGLSTASSKLNFGNAAVV 263

Query: 233 DEGD--ATPLQFIDG--HYYITLEAISVDGRMLDINPNIFKRDDSG--GGVMIDSGTDVT 286
                 +TPL   +G   Y++TLEA SV    ++     F    SG  G ++IDSGT +T
Sbjct: 264 SGRGTVSTPLFSKNGLVFYFLTLEAFSVGRNRIE-----FGSPGSGGKGNIIIDSGTTLT 318

Query: 287 WLVKEAYEALRDEVMIRLEGEQMRSYSWPDKLCYHGIMSSDLKGFPTVRFHFRGGAKLAL 346
            L    Y  L   V   +  +++R  +    LCY           P +  HF  GA + L
Sbjct: 319 ALPNGVYSKLEAAVAKTVILQRVRDPNQVLGLCYKVTPDKLDASVPVITAHFS-GADVTL 377

Query: 347 EKDSMFYQPRPDAFCMAVNPASINNRYVNFSLIGMMAQQFYNVGYDIGRKQKTFQRMDC 405
              + F Q   D  C A  P          ++ G +AQQ   VGYD+     +F+  DC
Sbjct: 378 NAINTFVQVADDVVCFAFQPTETG------AVFGNLAQQNLLVGYDLQMNTVSFKHTDC 430


>gi|356546378|ref|XP_003541603.1| PREDICTED: probable aspartic protease At2g35615-like [Glycine max]
          Length = 439

 Score =  178 bits (451), Expect = 5e-42,   Method: Compositional matrix adjust.
 Identities = 131/416 (31%), Positives = 196/416 (47%), Gaps = 25/416 (6%)

Query: 1   LIHRDSILSPYNNPNENPAHRVQRGINISIARLAYLQEKIRSHNTYQAQILPSNDISNSL 60
           +IHRDS  SP   P E P  RV   +  SI R  + ++   S ++ ++ ++ S       
Sbjct: 35  MIHRDSSRSPLYRPTETPFQRVANAVRRSINRGNHFKKAFVSTDSAESTVVASQ----GE 90

Query: 61  FYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLGTIFYPSRSSSYAYVPCDSEH 120
           + +  S+G PP      +DTGS +LW+ C PC DC  Q   IF PS+S +Y  +PC S  
Sbjct: 91  YLMRYSVGSPPFQVLGIVDTGSDILWLQCEPCEDCYKQTTPIFDPSKSKTYKTLPCSSNT 150

Query: 121 CRYFPYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDVVFGCGFST 180
           C       CS+  + C Y+  Y  G  +   +S E LT  +TD S++H    V GCG + 
Sbjct: 151 CESLRNTACSS-DNVCEYSIDYGDGSHSDGDLSVETLTLGSTDGSSVHFPKTVIGCGHNN 209

Query: 181 NRNFKFSGIFGLGIGRSSL------VSQLNSSFSYCIGSLHDPDYLHNKLILGDGAIIDE 234
              F+  G   +G+G   +       S +   FSYC+  +       +KL  GD A++  
Sbjct: 210 GGTFQEEGSGIVGLGGGPVSLISQLSSSIGGKFSYCLAPIFSESNSSSKLNFGDAAVVSG 269

Query: 235 GD--ATPLQFIDGH--YYITLEAISV-DGRMLDINPNIFKRDDSGGGVMIDSGTDVTWLV 289
               +TPL  ++G   Y++TLEA SV D R+     +        G ++IDSGT +T L 
Sbjct: 270 RGTVSTPLDPLNGQVFYFLTLEAFSVGDNRIEFSGSSSSGSGSGDGNIIIDSGTTLTLLP 329

Query: 290 KEAYEALRDEVMIRLEGEQMRSYSWPDKLCYHGIMSSDLKGFPTVRFHFRGGAKLALEKD 349
           +E Y  L   V   ++ E+ R  S    LCY    +SD    P +  HF+ GA + L   
Sbjct: 330 QEDYLNLESAVSDVIKLERARDPSKLLSLCYK--TTSDELDLPVITAHFK-GADVELNPI 386

Query: 350 SMFYQPRPDAFCMAVNPASINNRYVNFSLIGMMAQQFYNVGYDIGRKQKTFQRMDC 405
           S F        C A   + I       ++ G +AQQ   VGYD+ +K  +F+  DC
Sbjct: 387 STFVPVEKGVVCFAFISSKIG------AIFGNLAQQNLLVGYDLVKKTVSFKPTDC 436


>gi|225427552|ref|XP_002266498.1| PREDICTED: probable aspartic protease At2g35615 [Vitis vinifera]
          Length = 441

 Score =  177 bits (450), Expect = 7e-42,   Method: Compositional matrix adjust.
 Identities = 123/418 (29%), Positives = 200/418 (47%), Gaps = 24/418 (5%)

Query: 1   LIHRDSILSPYNNPNENPAHRVQRGINISIARLAYLQEKIRSHNTYQAQILPSNDISNSL 60
           LIHRDS  SP+ +P++    R+    + S +R+   ++   + +  Q++++PS       
Sbjct: 36  LIHRDSPHSPFFDPSKTRTERLTDAFHRSASRVGRFRQSAMTSDGIQSRLVPSA----GE 91

Query: 61  FYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLGTIFYPSRSSSYAYVPCDSEH 120
           + +N+SIG PPVP    +DTGS L W  C PC  C  Q+   F P  SS+Y    C +  
Sbjct: 92  YIMNLSIGTPPVPVIAIVDTGSDLTWTQCRPCTHCYKQVVPFFDPKNSSTYRDSSCGTSF 151

Query: 121 CRYFPYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDVVFGCGFST 180
           C      R      +C +   Y  G  T   ++ E LT  +T    +      FGC   +
Sbjct: 152 CLALGNDRSCRNGKKCTFMYSYADGSFTGGNLAVETLTVASTAGKPVSFPGFAFGCVHRS 211

Query: 181 NRNF--KFSGIFGLGIGRSSLVSQLNSS----FSYCIGSLHDPDYLHNKLILGDGAIIDE 234
              F    SGI GLG+   S++SQL S+    FSYC+  +     + +++  G   I+  
Sbjct: 212 GGIFDEHSSGIVGLGVAELSMISQLKSTINGRFSYCLLPVFTDSSMSSRINFGRSGIVSG 271

Query: 235 GD--ATPLQFI--DGHYY-ITLEAISVDGRMLDINPNIFKRDDSGGGVMIDSGTDVTWLV 289
               +TPL     D +YY ITLE  SV  + L       K +   G +++DSGT  T+L 
Sbjct: 272 AGTVSTPLVMKGPDTYYYLITLEGFSVGKKRLSYKGFSKKAEVEEGNIIVDSGTTYTYLP 331

Query: 290 KEAYEALRDEVMIRLEGEQMRSYSWPDKLCYHGIMSSDLKGFPTVRFHFRGGAKLALEKD 349
            E Y  L + V   ++G+++R  +    LCY+   + D    P +  HF+  A + L+  
Sbjct: 332 LEFYVKLEESVAHSIKGKRVRDPNGISSLCYN--TTVDQIDAPIITAHFK-DANVELQPW 388

Query: 350 SMFYQPRPDAFCMAVNPASINNRYVNFSLIGMMAQQFYNVGYDIGRKQKTFQRMDCEV 407
           + F + + D  C  V P S      +  ++G +AQ  + VG+D+ +K+ +F+  DC +
Sbjct: 389 NTFLRMQEDLVCFTVLPTS------DIGILGNLAQVNFLVGFDLRKKRVSFKAADCTL 440


>gi|356557014|ref|XP_003546813.1| PREDICTED: probable aspartic protease At2g35615-like [Glycine max]
          Length = 435

 Score =  177 bits (449), Expect = 9e-42,   Method: Compositional matrix adjust.
 Identities = 130/420 (30%), Positives = 201/420 (47%), Gaps = 35/420 (8%)

Query: 1   LIHRDSILSPYNNPNENPAHRVQRGINISIARLAYLQEKIRSHNTYQAQILPSNDISNSL 60
           LIHRDS  SP+ NP+  P+ R+      S++RL  +   +  +   ++ ++P        
Sbjct: 33  LIHRDSPSSPFYNPSLTPSERIINAALRSMSRLQRVSHFLDENKLPESLLIPDKGEYLMR 92

Query: 61  FYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLGTIFYPSRSSSYAYVPCDSEH 120
           FY    IG PPV +   +DTGSSL+W+ C PC +C PQ   +F P +SS+Y Y  CDS+ 
Sbjct: 93  FY----IGSPPVERLAMVDTGSSLIWLQCSPCHNCFPQETPLFEPLKSSTYKYATCDSQP 148

Query: 121 CRYF-PYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDES-TIHVQDVVFGCGF 178
           C    P  R      +CIY  +Y     +   + TE L+F +T  + T+   + +FGCG 
Sbjct: 149 CTLLQPSQRDCGKLGQCIYGIMYGDKSFSVGILGTETLSFGSTGGAQTVSFPNTIFGCGV 208

Query: 179 STNRNF----KFSGIFGLGIGRSSLVSQLNS----SFSYCIGSLHDPDYLHNKLILGDGA 230
             N       K  GI GLG G  SLVSQL +     FSYC+  L       +KL  G  A
Sbjct: 209 DNNFTIYTSNKVMGIAGLGAGPLSLVSQLGAQIGHKFSYCL--LPYDSTSTSKLKFGSEA 266

Query: 231 IIDEGD--ATPLQF---IDGHYYITLEAISVDGRMLDINPNIFKRDDSGGGVMIDSGTDV 285
           II      +TPL     +  +Y++ LEA+++  +++           + G ++IDSGT +
Sbjct: 267 IITTNGVVSTPLIIKPSLPTYYFLNLEAVTIGQKVVSTG-------QTDGNIVIDSGTPL 319

Query: 286 TWLVKEAYEALRDEVMIRLEGEQMRSYSWPDKLCYHGIMSSDLKGFPTVRFHFRGGAKLA 345
           T+L    Y      +   L  + ++    P K C+     ++L   P + F F G +   
Sbjct: 320 TYLENTFYNNFVASLQETLGVKLLQDLPSPLKTCFPN--RANL-AIPDIAFQFTGASVAL 376

Query: 346 LEKDSMFYQPRPDAFCMAVNPASINNRYVNFSLIGMMAQQFYNVGYDIGRKQKTFQRMDC 405
             K+ +      +  C+AV P+S     +  SL G +AQ  + V YD+  K+ +F   DC
Sbjct: 377 RPKNVLIPLTDSNILCLAVVPSS----GIGISLFGSIAQYDFQVEYDLEGKKVSFAPTDC 432


>gi|225436202|ref|XP_002271145.1| PREDICTED: probable aspartic protease At2g35615-like [Vitis
           vinifera]
          Length = 440

 Score =  177 bits (448), Expect = 1e-41,   Method: Compositional matrix adjust.
 Identities = 127/417 (30%), Positives = 196/417 (47%), Gaps = 31/417 (7%)

Query: 1   LIHRDSILSPYNNPNENPAHRVQRGINISIARLAYLQEKIRSHNTYQAQILPSNDISNSL 60
           LIHRDS  SP  NP+E PA R+ R       R     E   S NT +    P    +N  
Sbjct: 39  LIHRDSPKSPLYNPSETPAERLDRFFR----RFMSFSEASISPNTPE----PPVSSNNGE 90

Query: 61  FYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLGTIFYPSRSSSYAYVPCDSEH 120
           + + ISIG PP   +   DTGS L+W  C PC  C  Q   +F PS+S+S+  V C+S+ 
Sbjct: 91  YLMKISIGTPPFDVYGIYDTGSDLMWTQCLPCLSCYKQKNPMFDPSKSTSFKEVSCESQQ 150

Query: 121 CRYFPYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDVVFGCGFST 180
           CR      CS  +  C ++  Y  G      ++TE LT  +       + ++VFGCG + 
Sbjct: 151 CRLLDTVSCSQPQKLCDFSYGYGDGSLAQGVIATETLTLNSNSGQPTSILNIVFGCGHNN 210

Query: 181 NRNFKFS--GIFGLGIGRSSLVSQLNSS------FSYCIGSLHDPDYLHNKLILGDGAII 232
           +  F  +  G+FG G    SL SQ+ S+      FS C+        + +K+I G  A +
Sbjct: 211 SGTFNENEMGLFGTGGRPLSLTSQIMSTLGSGRKFSQCLVPFRTDPSITSKIIFGPEAEV 270

Query: 233 DEGD--ATPLQFID--GHYYITLEAISVDGRMLDINPNIFKRDDSGGGVMIDSGTDVTWL 288
              D  +TPL   D   +Y++TL+ ISV  ++   + +      + G V ID+GT  T L
Sbjct: 271 SGSDVVSTPLVTKDDPTYYFVTLDGISVGDKLFPFSSS--SPMATKGNVFIDAGTPPTLL 328

Query: 289 VKEAYEALRDEVMIRLEGEQMRSYSWPDKLCYHGIMSSDLKGFPTVRFHFRGGAKLALEK 348
            ++ Y  L   V   +  E ++      +LCY    S+ L   P +  HF  GA + L+ 
Sbjct: 329 PRDFYNRLVQGVKEAIPMEPVQDPDLQPQLCYR---SATLIDGPILTAHFD-GADVQLKP 384

Query: 349 DSMFYQPRPDAFCMAVNPASINNRYVNFSLIGMMAQQFYNVGYDIGRKQKTFQRMDC 405
            + F  P+   +C A+ P   +       + G   Q  + +G+D+  K+ +F+ +DC
Sbjct: 385 LNTFISPKEGVYCFAMQPIDGDT-----GIFGNFVQMNFLIGFDLDGKKVSFKAVDC 436


>gi|297805036|ref|XP_002870402.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
 gi|297316238|gb|EFH46661.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
          Length = 435

 Score =  176 bits (445), Expect = 2e-41,   Method: Compositional matrix adjust.
 Identities = 135/421 (32%), Positives = 206/421 (48%), Gaps = 39/421 (9%)

Query: 1   LIHRDSILSPYNNPNENPAHRVQRGINISIARLAY---LQEKIRSHNTYQAQILPSNDIS 57
           LIHRDS  SP+ NP E P+ R++  I+ S  R+++   L E   S N+ Q  I P     
Sbjct: 35  LIHRDSPKSPFYNPAETPSQRIRNAIHRSFNRVSHFTDLSEMDASLNSPQTDITPCG--- 91

Query: 58  NSLFYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLGTIFYPSRSSSYAYVPCD 117
              + +N+S+G PP P     DTGS+L+W  C PC DC  Q+  +F P  SS+Y  V C 
Sbjct: 92  -GEYLMNLSLGTPPSPIMAVADTGSNLIWTQCKPCDDCYTQVDPLFDPKASSTYKDVSCS 150

Query: 118 SEHCRYFP-YARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDVVFGC 176
           S  C      A CS     C Y   Y  G  T    + + LT  +TD   + +++++ GC
Sbjct: 151 SSQCTALENQASCSTEDKTCSYLVSYADGSYTMGKFAVDTLTLGSTDNRPVQLKNIIIGC 210

Query: 177 GFSTNRNF--KFSGIFGLGIGRSSLVSQLNSS----FSYCIGSLHDP----DYLHNKLIL 226
           G +    F  K SG+ GLG G  SL+ QL  S    FSYC+   +D     ++  N ++ 
Sbjct: 211 GQNNAVTFRNKSSGVVGLGGGAVSLIKQLGDSIDGKFSYCLVPENDQTSKINFGTNAVVS 270

Query: 227 GDGAIIDEGDATPL--QFIDGHYYITLEAISVDGRMLDINPNIFKRDDSGGGVMIDSGTD 284
           G G +     +TPL  +  D  YY+TL++ISV  + +    +  K     G ++IDSGT 
Sbjct: 271 GPGTV-----STPLVVKSRDTFYYLTLKSISVGSKNMQTPDSNIK-----GNMVIDSGTT 320

Query: 285 VTWLVKEAYEALRDEVMIRLEGEQMRSYSWPDKLCYHGIMSSDLKGFPTVRFHFRGGAKL 344
           +T L  + Y  + + V   +  ++ +       LCY+   ++DL   P +  HF  GA +
Sbjct: 321 LTLLPVKYYIEIENAVASLINADKSKDERIGSSLCYNA--TADL-NIPVITMHFE-GADV 376

Query: 345 ALEKDSMFYQPRPDAFCMAVNPASINNRYVNFSLIGMMAQQFYNVGYDIGRKQKTFQRMD 404
            L   + F++   D  C+A   +   N      + G +AQ+ + VGYD   K  +F+  D
Sbjct: 377 KLYPYNSFFKVTEDLVCLAFGMSFYRN-----GIYGNVAQKNFLVGYDTASKTMSFKPTD 431

Query: 405 C 405
           C
Sbjct: 432 C 432


>gi|147811402|emb|CAN61225.1| hypothetical protein VITISV_006732 [Vitis vinifera]
          Length = 440

 Score =  175 bits (443), Expect = 4e-41,   Method: Compositional matrix adjust.
 Identities = 126/417 (30%), Positives = 195/417 (46%), Gaps = 31/417 (7%)

Query: 1   LIHRDSILSPYNNPNENPAHRVQRGINISIARLAYLQEKIRSHNTYQAQILPSNDISNSL 60
           LIHRDS  SP  NP+E PA R+ R       R     E   S NT +    P    +N  
Sbjct: 39  LIHRDSPKSPLYNPSETPAERLDRFFR----RFMSFSEASISPNTPE----PPVSSNNGE 90

Query: 61  FYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLGTIFYPSRSSSYAYVPCDSEH 120
           + + ISIG PP   +   DTGS L+W  C PC  C  Q   +F PS+S+S+  V C+S+ 
Sbjct: 91  YLMKISIGTPPFDVYGIYDTGSDLMWTQCLPCLSCYKQKNPMFDPSKSTSFKEVSCESQQ 150

Query: 121 CRYFPYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDVVFGCGFST 180
           CR      CS  +  C ++  Y  G      ++TE LT  +       + ++VFGCG + 
Sbjct: 151 CRLLDTVSCSQPQKLCDFSYGYGDGSLAQGVIATETLTLNSNSGQPXSIXNIVFGCGHNN 210

Query: 181 NRNFKFS--GIFGLGIGRSSLVSQLNSS------FSYCIGSLHDPDYLHNKLILGDGAII 232
           +  F  +  G+FG G    SL SQ+ S+      FS C+        + +K+I G  A +
Sbjct: 211 SGTFNENEMGLFGTGGRPLSLTSQIMSTLGSGRKFSQCLVPFRTDPSITSKIIFGPEAEV 270

Query: 233 DEGD--ATPLQFID--GHYYITLEAISVDGRMLDINPNIFKRDDSGGGVMIDSGTDVTWL 288
                 +TPL   D   +Y++TL+ ISV  ++   + +      + G V ID+GT  T L
Sbjct: 271 SGSXVVSTPLVTKDDPTYYFVTLDGISVGDKLFPFSSS--SPMATKGNVFIDAGTPPTLL 328

Query: 289 VKEAYEALRDEVMIRLEGEQMRSYSWPDKLCYHGIMSSDLKGFPTVRFHFRGGAKLALEK 348
            ++ Y  L   V   +  E ++      +LCY    S+ L   P +  HF  GA + L+ 
Sbjct: 329 PRDFYNRLVQGVKEAIPMEPVQDPDLQPQLCYR---SATLIDGPILTAHFD-GADVQLKP 384

Query: 349 DSMFYQPRPDAFCMAVNPASINNRYVNFSLIGMMAQQFYNVGYDIGRKQKTFQRMDC 405
            + F  P+   +C A+ P   +       + G   Q  + +G+D+  K+ +F+ +DC
Sbjct: 385 LNTFISPKEGVYCFAMQPIDGDT-----GIFGNFVQMNFLIGFDLDGKKVSFKAVDC 436


>gi|224074591|ref|XP_002304395.1| predicted protein [Populus trichocarpa]
 gi|222841827|gb|EEE79374.1| predicted protein [Populus trichocarpa]
          Length = 443

 Score =  175 bits (443), Expect = 5e-41,   Method: Compositional matrix adjust.
 Identities = 133/417 (31%), Positives = 205/417 (49%), Gaps = 26/417 (6%)

Query: 1   LIHRDSILSPYNNPNENPAHRVQRGINISIARLAYLQEKIRSHNTYQAQILPSNDISNSL 60
           LIHRDS LSP  NPN     R++   + SI+R+   + K    N++Q  ++P+       
Sbjct: 38  LIHRDSPLSPLYNPNHTDFDRLRNAFSRSISRVNVFKTKAVDINSFQNDLVPNG----GE 93

Query: 61  FYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLGTIFYPSRSSSYAYVPCDSEH 120
           +++ +SIG P V      DTGS L WV C PC  C  Q   +F PSRSSSY ++ C S  
Sbjct: 94  YFMKMSIGTPLVEVIVIADTGSDLTWVQCLPCDPCYRQKSPLFDPSRSSSYRHMLCGSRF 153

Query: 121 CRYFPYAR--CSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDVVFGCGF 178
           C     +   C+   + C Y   Y     T+  ++TE+ T  +T    +H+  +VFGCG 
Sbjct: 154 CNALDVSEQACTMDTNICEYHYSYGDKSYTNGNLATEKFTIGSTSSRPVHLSPIVFGCGT 213

Query: 179 STNRNFK--FSGIFGLGIGRSSLVSQLNS----SFSYCIGSLHDPDYLHNKLILGDGAII 232
                F    SGI GLG G  SLVSQL+S     FSYC+  L +   + +K+  G  ++I
Sbjct: 214 GNGGTFDELGSGIVGLGGGALSLVSQLSSIIKGKFSYCLVPLSEQSNVTSKIKFGTDSVI 273

Query: 233 D--EGDATPL--QFIDGHYYITLEAISVDGRMLDINPNIFKRDDSGGGVMIDSGTDVTWL 288
              +  +TPL  +  D +YY+TLEAISV  + L     +   +   G V+IDSGT +T+L
Sbjct: 274 SGPQVVSTPLVSKQPDTYYYVTLEAISVGNKRLPYTNGLLNGNVEKGNVIIDSGTTLTFL 333

Query: 289 VKEAYEALRDEVMIRLEGEQMRSYSWPDKLCYHGIMSSDLKGFPTVRFHFRGGAKLALEK 348
             E +  L   +   ++ E++        +C+      DL   P +  HF   A + L+ 
Sbjct: 334 DSEFFTELERVLEETVKAERVSDPRGLFSVCFRSAGDIDL---PVIAVHFN-DADVKLQP 389

Query: 349 DSMFYQPRPDAFCMAVNPASINNRYVNFSLIGMMAQQFYNVGYDIGRKQKTFQRMDC 405
            + F +   D  C  +  ++         + G +AQ  + VGYD+ ++  +F+  DC
Sbjct: 390 LNTFVKADEDLLCFTMISSN------QIGIFGNLAQMDFLVGYDLEKRTVSFKPTDC 440


>gi|357515001|ref|XP_003627789.1| Aspartic proteinase nepenthesin-2 [Medicago truncatula]
 gi|355521811|gb|AET02265.1| Aspartic proteinase nepenthesin-2 [Medicago truncatula]
          Length = 415

 Score =  174 bits (440), Expect = 9e-41,   Method: Compositional matrix adjust.
 Identities = 133/424 (31%), Positives = 197/424 (46%), Gaps = 64/424 (15%)

Query: 1   LIHRDSILSPYNNPNENPAHRVQRGINISIARLAYLQEKIRSHNTYQAQILPSNDISN-- 58
           LIHRDS  SP+  P +N   R+   +  SI R+ +  +       Y     P + +++  
Sbjct: 33  LIHRDSSKSPFYQPTQNKYERIANAVRRSINRVNHFYK-------YSLTSTPQSTVNSDK 85

Query: 59  SLFYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLGTIFYPSRSSSYAYVPCDS 118
             + ++ SIG PP   F  +DTGS L+W+ C PC+ C PQ+  IF PS SSSY  +PC S
Sbjct: 86  GEYLMSYSIGTPPFKVFGFVDTGSDLVWLQCEPCKQCYPQITPIFDPSLSSSYQNIPCLS 145

Query: 119 EHCRYFPYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDVVFGCGF 178
           + C       C                 +   ++S E LT  +T   ++     + GCG+
Sbjct: 146 DTCHSMRTTSC-----------------DVRGYLSVETLTLDSTTGYSVSFPKTMIGCGY 188

Query: 179 STNRNFK--FSGIFGLGIGRSSLVSQLNSS----FSYCIGSLHDPDYLHN---KLILGDG 229
                F    SGI GLG G  SL SQL +S    FSYC+G      +L N   KL  GD 
Sbjct: 189 RNTGTFHGPSSGIVGLGSGPMSLPSQLGTSIGGKFSYCLG-----PWLPNSTSKLNFGDA 243

Query: 230 AII--DEGDATPLQFIDGH--YYITLEAISVDGRMLDINPNIFKRDDSGGGVMIDSGTDV 285
           AI+  D    TP+   D    YY+TLEA SV  ++++     +  ++  G ++IDSGT  
Sbjct: 244 AIVYGDGAMTTPIVKKDAQSGYYLTLEAFSVGNKLIEFGGPTYGGNE--GNILIDSGTTF 301

Query: 286 TWLVKEAYEALRDEVMIRLEGEQMRSYSWPDKLC----YHGIMSSDLKGFPTVRFHFRGG 341
           T+L  + Y      V   +  E +   +   KLC    YHG  +      P +  HF+ G
Sbjct: 302 TFLPYDVYYRFESAVAEYINLEHVEDPNGTFKLCYNVAYHGFEA------PLITAHFK-G 354

Query: 342 AKLALEKDSMFYQPRPDAFCMAVNPASINNRYVNFSLIGMMAQQFYNVGYDIGRKQKTFQ 401
           A + L   S F +      C+A  P+         ++ G +AQQ   VGY++ +   TF+
Sbjct: 355 ADIKLYYISTFIKVSDGIACLAFIPSQT-------AIFGNVAQQNLLVGYNLVQNTVTFK 407

Query: 402 RMDC 405
            +DC
Sbjct: 408 PVDC 411


>gi|357487631|ref|XP_003614103.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
 gi|355515438|gb|AES97061.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
          Length = 431

 Score =  174 bits (440), Expect = 1e-40,   Method: Compositional matrix adjust.
 Identities = 129/415 (31%), Positives = 191/415 (46%), Gaps = 28/415 (6%)

Query: 1   LIHRDSILSPYNNPNENPAHRVQRGINISIARLAYLQEKIRSHNTYQAQILPSNDISNSL 60
           LIHRDS  SP   P +N    V      SI R   L +   S NT ++ +     ++   
Sbjct: 32  LIHRDSSKSPLYKPAQNKFQHVVNAARRSINRANRLFKDSLS-NTPESTVY----VNGGE 86

Query: 61  FYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLGTIFYPSRSSSYAYVPCDSEH 120
           + +  S+G PP   +  +DTGS ++W+ C PC  C  Q   IF PS+SSSY  +PC S  
Sbjct: 87  YLMTYSVGTPPFNVYGVVDTGSDIVWLQCKPCEQCYKQTTPIFNPSKSSSYKNIPCSSNL 146

Query: 121 CRYFPYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDVVFGCGFST 180
           C+   Y  C+  ++ C YT  +     +   +S E LT  +T   ++     V GCG + 
Sbjct: 147 CQSVRYTSCNK-QNSCEYTINFSDQSYSQGELSVETLTLDSTTGHSVSFPKTVIGCGHNN 205

Query: 181 NRNFK--FSGIFGLGIGRSSLVSQLNSS----FSYCIGSLHDPDYLHNKLILGDGAIIDE 234
              F+   SGI GLGIG  SL +QL SS    FSYC+  L       +KL  GD A++  
Sbjct: 206 RGMFQGETSGIVGLGIGPVSLTTQLKSSIGGKFSYCLLPLLVDSNKTSKLNFGDAAVVSG 265

Query: 235 GDATPLQFI----DGHYYITLEAISVDGRMLDINPNIFKRDDSGGGVMIDSGTDVTWLVK 290
                  F+       YY+TLEA SV  + ++        D   G +++DSGT +T L  
Sbjct: 266 DGVVSTPFVKKDPQAFYYLTLEAFSVGNKRIEFE---VLDDSEEGNIILDSGTTLTLLPS 322

Query: 291 EAYEALRDEVMIRLEGEQMRSYSWPDKLCYHGIMSSDLKGFPTVRFHFRGGAKLALEKDS 350
             Y  L   V   ++ +++   +    LCY   ++SD   FP +  HF+ GA + L   S
Sbjct: 323 HVYTNLESAVAQLVKLDRVDDPNQLLNLCYS--ITSDQYDFPIITAHFK-GADIKLNPIS 379

Query: 351 MFYQPRPDAFCMAVNPASINNRYVNFSLIGMMAQQFYNVGYDIGRKQKTFQRMDC 405
            F        C+A   +          + G +AQ    VGYD+ +   +F+  DC
Sbjct: 380 TFAHVADGVVCLAFTSSQTG------PIFGNLAQLNLLVGYDLQQNIVSFKPSDC 428


>gi|15217764|ref|NP_176663.1| aspartyl protease-like protein [Arabidopsis thaliana]
 gi|5042418|gb|AAD38257.1|AC006193_13 Hypothetical Protein [Arabidopsis thaliana]
 gi|332196174|gb|AEE34295.1| aspartyl protease-like protein [Arabidopsis thaliana]
          Length = 431

 Score =  173 bits (438), Expect = 2e-40,   Method: Compositional matrix adjust.
 Identities = 132/418 (31%), Positives = 191/418 (45%), Gaps = 26/418 (6%)

Query: 1   LIHRDSILSPYNNPNENPAHRVQRGINISIARLAYLQEKIRSHNTYQAQILPSNDISNSL 60
           LIHRDS  SP+ N  E  + R++  I  S            S N+ Q+ I  +       
Sbjct: 30  LIHRDSPKSPFYNSAETSSQRMRNAIRRSARSTLQFSNDDASPNSPQSFITSNR----GE 85

Query: 61  FYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLGTIFYPSRSSSYAYVPCDSEH 120
           + +NISIG PPVP     DTGS L+W  C PC DC  Q   +F P  SS+Y  V C S  
Sbjct: 86  YLMNISIGTPPVPILAIADTGSDLIWTQCNPCEDCYQQTSPLFDPKESSTYRKVSCSSSQ 145

Query: 121 CRYFPYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDVVFGCGFST 180
           CR    A CS  ++ C YT  Y     T   V+ + +T  ++    + +++++ GCG   
Sbjct: 146 CRALEDASCSTDENTCSYTITYGDNSYTKGDVAVDTVTMGSSGRRPVSLRNMIIGCGHEN 205

Query: 181 NRNF--KFSGIFGLGIGRSSLVSQL----NSSFSYCIGSLHDPDYLHNKLILGDGAIIDE 234
              F    SGI GLG G +SLVSQL    N  FSYC+        L +K+  G   I+  
Sbjct: 206 TGTFDPAGSGIIGLGGGSTSLVSQLRKSINGKFSYCLVPFTSETGLTSKINFGTNGIVSG 265

Query: 235 GDATPLQFIDG----HYYITLEAISVDGRMLDINPNIFKRDDSGGGVMIDSGTDVTWLVK 290
                   +      +Y++ LEAISV  + +     IF   +  G ++IDSGT +T L  
Sbjct: 266 DGVVSTSMVKKDPATYYFLNLEAISVGSKKIQFTSTIFGTGE--GNIVIDSGTTLTLLPS 323

Query: 291 EAYEALRDEVMIRLEGEQMRSYSWPDKLCYHGIMSSDLKGFPTVRFHFRGGAKLALEKDS 350
             Y  L   V   ++ E+++       LCY    SS  K  P +  HF+GG  + L   +
Sbjct: 324 NFYYELESVVASTIKAERVQDPDGILSLCYRD--SSSFK-VPDITVHFKGG-DVKLGNLN 379

Query: 351 MFYQPRPDAFCMAVNPASINNRYVNFSLIGMMAQQFYNVGYDIGRKQKTFQRMDCEVL 408
            F     D  C A    + N +    ++ G +AQ  + VGYD      +F++ DC  +
Sbjct: 380 TFVAVSEDVSCFAF---AANEQ---LTIFGNLAQMNFLVGYDTVSGTVSFKKTDCSQM 431


>gi|449515031|ref|XP_004164553.1| PREDICTED: probable aspartic protease At2g35615-like [Cucumis
           sativus]
          Length = 430

 Score =  172 bits (437), Expect = 2e-40,   Method: Compositional matrix adjust.
 Identities = 130/420 (30%), Positives = 209/420 (49%), Gaps = 37/420 (8%)

Query: 1   LIHRDSILSPYNNPNENPAHRVQRGINISIARLAYLQEKIRSHNT--YQAQILPSNDISN 58
           L HRDS+LSP    + +   R+      S++R A L  +  +      Q+ I P +    
Sbjct: 34  LFHRDSLLSPLEFSSLSHYDRLANAFRRSLSRSAALLNRAATSGAVGLQSSIGPGS---- 89

Query: 59  SLFYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLGTIFYPSRSSSYAYVPCDS 118
             + +++SIG PPV      DTGS L W  C PC  C  QL  IF P +S+S+++VPC++
Sbjct: 90  GEYLMSVSIGTPPVDYLGIADTGSDLTWAQCLPCLKCYQQLRPIFNPLKSTSFSHVPCNT 149

Query: 119 EHCRYFPYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDVVFGCGF 178
           + C       C   +  C Y+  Y  G  T    S   L F+     +  V+ V+ GCG 
Sbjct: 150 QTCHAVDDGHC-GVQGVCDYSYTY--GDRT---YSKGDLGFEKITIGSSSVKSVI-GCGH 202

Query: 179 STNRNFKF-SGIFGLGIGRSSLVSQLNSS------FSYCIGSLHDPDYLHNKLILGDGAI 231
           +++  F F SG+ GLG G+ SLVSQ++ +      FSYC+ +L    + + K+  G+ A+
Sbjct: 203 ASSGGFGFASGVIGLGGGQLSLVSQMSQTSGISRRFSYCLPTLL--SHANGKINFGENAV 260

Query: 232 IDEGD--ATPL--QFIDGHYYITLEAISVDGRMLDINPNIFKRDDSGGGVMIDSGTDVTW 287
           +      +TPL  +    +YYITLEAIS+           F +    G V+IDSGT +T 
Sbjct: 261 VSGPGVVSTPLISKNTVTYYYITLEAISIGNE----RHMAFAKQ---GNVIIDSGTTLTI 313

Query: 288 LVKEAYEALRDEVMIRLEGEQMRSYSWPDKLCY-HGIMSSDLKGFPTVRFHFRGGAKLAL 346
           L KE Y+ +   ++  ++ ++++       LC+  GI ++   G P +  HF GGA + L
Sbjct: 314 LPKELYDGVVSSLLKVVKAKRVKDPHGSLDLCFDDGINAAASLGIPVITAHFSGGANVNL 373

Query: 347 EKDSMFYQPRPDAFCMAVNPASINNRYVNFSLIGMMAQQFYNVGYDIGRKQKTFQRMDCE 406
              + F +   +  C+ +  AS       F +IG +AQ  + +GYD+  K+ +F+   C 
Sbjct: 374 LPINTFRKVADNVNCLTLKAASPTTE---FGIIGNLAQANFLIGYDLEAKRLSFKPTVCA 430


>gi|357492389|ref|XP_003616483.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
 gi|355517818|gb|AES99441.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
          Length = 434

 Score =  172 bits (435), Expect = 3e-40,   Method: Compositional matrix adjust.
 Identities = 132/416 (31%), Positives = 200/416 (48%), Gaps = 27/416 (6%)

Query: 1   LIHRDSILSPYNNPNENPAHRVQRGINISIARLAYLQEKIRSHNTYQAQILPSNDISNSL 60
           LIHRDS  SPY  P EN           SI R  +   K    +T ++ ++P        
Sbjct: 32  LIHRDSPKSPYYKPTENKYQHFVDAARRSINRANHF-FKDSDTSTPESTVIPDR----GG 86

Query: 61  FYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLGTIFYPSRSSSYAYVPCDSEH 120
           + +  S+G PP   +   DTGS ++W+ C PC  C  Q   IF PS+SSSY  +PC S+ 
Sbjct: 87  YLMTYSVGTPPTKIYGIADTGSDIVWLQCEPCEQCYNQTTPIFNPSKSSSYKNIPCSSKL 146

Query: 121 CRYFPYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDVVFGCGFST 180
           C       CS  ++ C Y   Y     +   +S + L+ ++T  S +    +V GCG   
Sbjct: 147 CHSVRDTSCSD-QNSCQYKISYGDSSHSQGDLSVDTLSLESTSGSPVSFPKIVIGCGTDN 205

Query: 181 NRNF--KFSGIFGLGIGRSSLVSQLNSS----FSYCIGSLHDPDYLHNKLI-LGDGAIID 233
              F    SGI GLG G  SL++QL SS    FSYC+  L + +   + ++  GD A++ 
Sbjct: 206 AGTFGGASSGIVGLGGGPVSLITQLGSSIGGKFSYCLVPLLNKESNASSILSFGDAAVV- 264

Query: 234 EGD---ATPLQFIDG-HYYITLEAISVDGRMLDINPNIFKRDDSGGGVMIDSGTDVTWLV 289
            GD   +TPL   D   Y++TL+A SV  + ++   +    DD  G ++IDSGT +T + 
Sbjct: 265 SGDGVVSTPLIKKDPVFYFLTLQAFSVGNKRVEFGGSSEGGDDE-GNIIIDSGTTLTLIP 323

Query: 290 KEAYEALRDEVMIRLEGEQMRSYSWPDKLCYHGIMSSDLKGFPTVRFHFRGGAKLALEKD 349
            + Y  L   V+  ++ +++   +    LCY   + S+   FP +  HF+ GA + L   
Sbjct: 324 SDVYTNLESAVVDLVKLDRVDDPNQQFSLCYS--LKSNEYDFPIITVHFK-GADVELHSI 380

Query: 350 SMFYQPRPDAFCMAVNPASINNRYVNFSLIGMMAQQFYNVGYDIGRKQKTFQRMDC 405
           S F        C A  P+         S+ G +AQQ   VGYD+ +K  +F+  DC
Sbjct: 381 STFVPITDGIVCFAFQPSPQLG-----SIFGNLAQQNLLVGYDLQQKTVSFKPTDC 431


>gi|125561848|gb|EAZ07296.1| hypothetical protein OsI_29544 [Oryza sativa Indica Group]
          Length = 448

 Score =  171 bits (434), Expect = 5e-40,   Method: Compositional matrix adjust.
 Identities = 122/374 (32%), Positives = 187/374 (50%), Gaps = 43/374 (11%)

Query: 61  FYVNISIGQPPVPQFTAM-DTGSSLLWVHCYPCRDCSPQLGTIFYPSRSSSYAYVPCDSE 119
           + ++++IG PP+ ++TAM DTGS L+W  C PC  C+ Q    F P+RS++Y  VPC S 
Sbjct: 92  YLMDLAIGTPPL-RYTAMVDTGSDLIWTQCAPCVLCADQPTPYFRPARSATYRLVPCRSP 150

Query: 120 HCRYFPYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDVVFGCG-F 178
            C   PY  C   +  C+Y   Y     T+  +++E  TF   + S + V DV FGCG  
Sbjct: 151 LCAALPYPAC-FQRSVCVYQYYYGDEASTAGVLASETFTFGAANSSKVMVSDVAFGCGNI 209

Query: 179 STNRNFKFSGIFGLGIGRSSLVSQLNSS-FSYCIGSLHDPDYLHNKLILGDGAIIDEGDA 237
           ++ +    SG+ GLG G  SLVSQL  S FSYC+ S   P+   ++L  G  A ++  +A
Sbjct: 210 NSGQLANSSGMVGLGRGPLSLVSQLGPSRFSYCLTSFLSPE--PSRLNFGVFATLNGTNA 267

Query: 238 ---------TPL---QFIDGHYYITLEAISVDGRMLDINPNIFKRDDSG-GGVMIDSGTD 284
                    TPL     +   Y+++L+ IS+  + L I+P +F  +D G GGV IDSGT 
Sbjct: 268 SSSGSPVQSTPLVVNAALPSLYFMSLKGISLGQKRLPIDPLVFAINDDGTGGVFIDSGTS 327

Query: 285 VTWLVKEAYEALRDEVMIRL---------EGEQMRSYSWPDKLCYHGIMSSDLKGFPTVR 335
           +TWL ++AY+A+R E++  L         E      + WP          S     P + 
Sbjct: 328 LTWLQQDAYDAVRHELVSVLRPLPPTNDTEIGLETCFPWPPP-------PSVAVTVPDME 380

Query: 336 FHFRGGAKLALEKDS-MFYQPRPDAFCMAVNPASINNRYVNFSLIGMMAQQFYNVGYDIG 394
            HF GGA + +  ++ M         C+A+       R  + ++IG   QQ  ++ YDI 
Sbjct: 381 LHFDGGANMTVPPENYMLIDGATGFLCLAM------IRSGDATIIGNYQQQNMHILYDIA 434

Query: 395 RKQKTFQRMDCEVL 408
               +F    C ++
Sbjct: 435 NSLLSFVPAPCNIV 448


>gi|115476828|ref|NP_001062010.1| Os08g0469000 [Oryza sativa Japonica Group]
 gi|42407407|dbj|BAD09565.1| putative nucleoid DNA-binding protein [Oryza sativa Japonica Group]
 gi|113623979|dbj|BAF23924.1| Os08g0469000 [Oryza sativa Japonica Group]
 gi|125603713|gb|EAZ43038.1| hypothetical protein OsJ_27627 [Oryza sativa Japonica Group]
          Length = 448

 Score =  171 bits (434), Expect = 5e-40,   Method: Compositional matrix adjust.
 Identities = 122/374 (32%), Positives = 187/374 (50%), Gaps = 43/374 (11%)

Query: 61  FYVNISIGQPPVPQFTAM-DTGSSLLWVHCYPCRDCSPQLGTIFYPSRSSSYAYVPCDSE 119
           + ++++IG PP+ ++TAM DTGS L+W  C PC  C+ Q    F P+RS++Y  VPC S 
Sbjct: 92  YLMDLAIGTPPL-RYTAMVDTGSDLIWTQCAPCVLCADQPTPYFRPARSATYRLVPCRSP 150

Query: 120 HCRYFPYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDVVFGCG-F 178
            C   PY  C   +  C+Y   Y     T+  +++E  TF   + S + V DV FGCG  
Sbjct: 151 LCAALPYPAC-FQRSVCVYQYYYGDEASTAGVLASETFTFGAANSSKVMVSDVAFGCGNI 209

Query: 179 STNRNFKFSGIFGLGIGRSSLVSQLNSS-FSYCIGSLHDPDYLHNKLILGDGAIIDEGDA 237
           ++ +    SG+ GLG G  SLVSQL  S FSYC+ S   P+   ++L  G  A ++  +A
Sbjct: 210 NSGQLANSSGMVGLGRGPLSLVSQLGPSRFSYCLTSFLSPE--PSRLNFGVFATLNGTNA 267

Query: 238 ---------TPL---QFIDGHYYITLEAISVDGRMLDINPNIFKRDDSG-GGVMIDSGTD 284
                    TPL     +   Y+++L+ IS+  + L I+P +F  +D G GGV IDSGT 
Sbjct: 268 SSSGSPVQSTPLVVNAALPSLYFMSLKGISLGQKRLPIDPLVFAINDDGTGGVFIDSGTS 327

Query: 285 VTWLVKEAYEALRDEVMIRL---------EGEQMRSYSWPDKLCYHGIMSSDLKGFPTVR 335
           +TWL ++AY+A+R E++  L         E      + WP          S     P + 
Sbjct: 328 LTWLQQDAYDAVRRELVSVLRPLPPTNDTEIGLETCFPWPPP-------PSVAVTVPDME 380

Query: 336 FHFRGGAKLALEKDS-MFYQPRPDAFCMAVNPASINNRYVNFSLIGMMAQQFYNVGYDIG 394
            HF GGA + +  ++ M         C+A+       R  + ++IG   QQ  ++ YDI 
Sbjct: 381 LHFDGGANMTVPPENYMLIDGATGFLCLAM------IRSGDATIIGNYQQQNMHILYDIA 434

Query: 395 RKQKTFQRMDCEVL 408
               +F    C ++
Sbjct: 435 NSLLSFVPAPCNIV 448


>gi|343198386|gb|AEM05966.1| nodulin 41 [Phaseolus vulgaris]
          Length = 437

 Score =  171 bits (433), Expect = 5e-40,   Method: Compositional matrix adjust.
 Identities = 132/424 (31%), Positives = 204/424 (48%), Gaps = 41/424 (9%)

Query: 1   LIHRDSILSPYNNPNENPAHRVQRGINISIARLAYLQEKIRSHNTYQAQILPSNDISNSL 60
           LIHRDS LSP+ NP+  P+ R+      SI+RL  +   +  +N     +L    + N  
Sbjct: 33  LIHRDSPLSPFYNPSLTPSQRIINAALRSISRLNRVSNLLDQNNKLPQSVL---ILHNGE 89

Query: 61  FYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLGTIFYPSRSSSYAYVPCDSEH 120
           + +   IG PPV +    DTGS L+WV C PC  C PQ   +F P +SS++    C S+ 
Sbjct: 90  YLMRFYIGTPPVERLATADTGSDLIWVQCSPCASCFPQSTPLFQPLKSSTFMPTTCRSQP 149

Query: 121 CR-YFPYARCSAYKHRCIYTQLYLIGPETSV---FVSTEQLTFKNTDE-STIHVQDVVFG 175
           C    P  +       CIYT  Y  G + S     +STE L F +     T+   +  FG
Sbjct: 150 CTLLLPEQKGCGKSGECIYT--YKYGDQYSFSEGLLSTETLRFDSQGGVQTVAFPNSFFG 207

Query: 176 CGFSTN----RNFKFSGIFGLGIGRSSLVSQLNS----SFSYCIGSLHDPDYLHNKLILG 227
           CG   N     ++K +GI GLG G  SLVSQ+       FSYC+  L       +KL  G
Sbjct: 208 CGLYNNITVFPSYKLTGIMGLGAGPLSLVSQIGDQIGHKFSYCLLPLGSTS--TSKLKFG 265

Query: 228 DGAIID-EG-DATPL---QFIDGHYYITLEAISVDGRMLDINPNIFKRDDSGGGVMIDSG 282
           + +II  EG  +TP+    ++  +Y++ LEA++V  + +           + G V+IDSG
Sbjct: 266 NESIITGEGVVSTPMIIKPWLPTYYFLNLEAVTVAQKTVPTG-------STDGNVIIDSG 318

Query: 283 TDVTWLVKEAYEALRDEVMIRLEGEQMRSYSWPDKLCYHGIMSSDLKGFPTVRFHFRGGA 342
           T +T+L +  Y      +   L  E ++    P   C+      D   FP + F F  GA
Sbjct: 319 TLLTYLGESFYYNFAASLQESLAVELVQDVLSPLPFCFP---YRDNFVFPEIAFQFT-GA 374

Query: 343 KLALEKDSMFYQPRP-DAFCMAVNPASINNRYVNFSLIGMMAQQFYNVGYDIGRKQKTFQ 401
           +++L+  ++F      +  C+ + P+S++      S+ G  +Q  + V YD+  K+ +FQ
Sbjct: 375 RVSLKPANLFVMTEDRNTVCLMIAPSSVS----GISIFGSFSQIDFQVEYDLEGKKVSFQ 430

Query: 402 RMDC 405
             DC
Sbjct: 431 PTDC 434


>gi|115458644|ref|NP_001052922.1| Os04g0448300 [Oryza sativa Japonica Group]
 gi|113564493|dbj|BAF14836.1| Os04g0448300 [Oryza sativa Japonica Group]
 gi|215766465|dbj|BAG98773.1| unnamed protein product [Oryza sativa Japonica Group]
 gi|215767943|dbj|BAH00172.1| unnamed protein product [Oryza sativa Japonica Group]
          Length = 454

 Score =  170 bits (430), Expect = 2e-39,   Method: Compositional matrix adjust.
 Identities = 126/368 (34%), Positives = 177/368 (48%), Gaps = 32/368 (8%)

Query: 58  NSLFYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLGTIFYPSRSSSYAYVPCD 117
           N  F +++SIG P +     +DTGS L+W  C PC DC  Q   +F PS SS+YA VPC 
Sbjct: 102 NGEFLMDVSIGTPALAYSAIVDTGSDLVWTQCKPCVDCFKQSTPVFDPSSSSTYATVPCS 161

Query: 118 SEHCRYFPYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDVVFGCG 177
           S  C   P ++C++   +C YT  Y     T   ++TE  T   +      +  VVFGCG
Sbjct: 162 SASCSDLPTSKCTS-ASKCGYTYTYGDSSSTQGVLATETFTLAKS-----KLPGVVFGCG 215

Query: 178 FSTNRNFKFS---GIFGLGIGRSSLVSQLN-SSFSYCIGSLHDPDYLHNKLILGDGAIID 233
             TN    FS   G+ GLG G  SLVSQL    FSYC+ SL D +  ++ L+LG  A I 
Sbjct: 216 -DTNEGDGFSQGAGLVGLGRGPLSLVSQLGLDKFSYCLTSLDDTN--NSPLLLGSLAGIS 272

Query: 234 EG-------DATPL---QFIDGHYYITLEAISVDGRMLDINPNIFK-RDDSGGGVMIDSG 282
           E          TPL         YY++L+AI+V    + +  + F  +DD  GGV++DSG
Sbjct: 273 EASAAASSVQTTPLIKNPSQPSFYYVSLKAITVGSTRISLPSSAFAVQDDGTGGVIVDSG 332

Query: 283 TDVTWLVKEAYEALRDEVMIRLEGEQMRSYSWPDKLCYHGIMSS-DLKGFPTVRFHFRGG 341
           T +T+L  + Y AL+     ++             LC+       D    P + FHF GG
Sbjct: 333 TSITYLEVQGYRALKKAFAAQMALPAADGSGVGLDLCFRAPAKGVDQVEVPRLVFHFDGG 392

Query: 342 AKLALEKDS-MFYQPRPDAFCMAVNPASINNRYVNFSLIGMMAQQFYNVGYDIGRKQKTF 400
           A L L  ++ M       A C+ V    + +R    S+IG   QQ +   YD+G    +F
Sbjct: 393 ADLDLPAENYMVLDGGSGALCLTV----MGSR--GLSIIGNFQQQNFQFVYDVGHDTLSF 446

Query: 401 QRMDCEVL 408
             + C  L
Sbjct: 447 APVQCNKL 454


>gi|38344829|emb|CAD40873.2| OSJNBa0064H22.10 [Oryza sativa Japonica Group]
 gi|116310063|emb|CAH67084.1| H0818E04.1 [Oryza sativa Indica Group]
 gi|116310186|emb|CAH67198.1| OSIGBa0152K17.10 [Oryza sativa Indica Group]
          Length = 444

 Score =  169 bits (429), Expect = 2e-39,   Method: Compositional matrix adjust.
 Identities = 126/368 (34%), Positives = 177/368 (48%), Gaps = 32/368 (8%)

Query: 58  NSLFYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLGTIFYPSRSSSYAYVPCD 117
           N  F +++SIG P +     +DTGS L+W  C PC DC  Q   +F PS SS+YA VPC 
Sbjct: 92  NGEFLMDVSIGTPALAYSAIVDTGSDLVWTQCKPCVDCFKQSTPVFDPSSSSTYATVPCS 151

Query: 118 SEHCRYFPYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDVVFGCG 177
           S  C   P ++C++   +C YT  Y     T   ++TE  T   +      +  VVFGCG
Sbjct: 152 SASCSDLPTSKCTS-ASKCGYTYTYGDSSSTQGVLATETFTLAKS-----KLPGVVFGCG 205

Query: 178 FSTNRNFKFS---GIFGLGIGRSSLVSQLN-SSFSYCIGSLHDPDYLHNKLILGDGAIID 233
             TN    FS   G+ GLG G  SLVSQL    FSYC+ SL D +  ++ L+LG  A I 
Sbjct: 206 -DTNEGDGFSQGAGLVGLGRGPLSLVSQLGLDKFSYCLTSLDDTN--NSPLLLGSLAGIS 262

Query: 234 EG-------DATPL---QFIDGHYYITLEAISVDGRMLDINPNIFK-RDDSGGGVMIDSG 282
           E          TPL         YY++L+AI+V    + +  + F  +DD  GGV++DSG
Sbjct: 263 EASAAASSVQTTPLIKNPSQPSFYYVSLKAITVGSTRISLPSSAFAVQDDGTGGVIVDSG 322

Query: 283 TDVTWLVKEAYEALRDEVMIRLEGEQMRSYSWPDKLCYHGIMSS-DLKGFPTVRFHFRGG 341
           T +T+L  + Y AL+     ++             LC+       D    P + FHF GG
Sbjct: 323 TSITYLEVQGYRALKKAFAAQMALPAADGSGVGLDLCFRAPAKGVDQVEVPRLVFHFDGG 382

Query: 342 AKLALEKDS-MFYQPRPDAFCMAVNPASINNRYVNFSLIGMMAQQFYNVGYDIGRKQKTF 400
           A L L  ++ M       A C+ V    + +R    S+IG   QQ +   YD+G    +F
Sbjct: 383 ADLDLPAENYMVLDGGSGALCLTV----MGSR--GLSIIGNFQQQNFQFVYDVGHDTLSF 436

Query: 401 QRMDCEVL 408
             + C  L
Sbjct: 437 APVQCNKL 444


>gi|357492401|ref|XP_003616489.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
 gi|355517824|gb|AES99447.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
          Length = 434

 Score =  169 bits (429), Expect = 2e-39,   Method: Compositional matrix adjust.
 Identities = 132/416 (31%), Positives = 199/416 (47%), Gaps = 27/416 (6%)

Query: 1   LIHRDSILSPYNNPNENPAHRVQRGINISIARLAYLQEKIRSHNTYQAQILPSNDISNSL 60
           LIHRDS  SPY  P EN           SI R  +   K    +T ++ ++P        
Sbjct: 32  LIHRDSPKSPYYKPTENKYQHFVDAARRSINRANHF-FKDSDTSTPESTVIPDR----GG 86

Query: 61  FYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLGTIFYPSRSSSYAYVPCDSEH 120
           + +  S+G PP   +   DTGS ++W+ C PC  C  Q   IF PS+SSSY  +PC S+ 
Sbjct: 87  YLMTYSVGTPPTKIYGIADTGSDIVWLQCEPCEQCYNQTTPIFNPSKSSSYKNIPCLSKL 146

Query: 121 CRYFPYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDVVFGCGFST 180
           C       CS  ++ C Y   Y     +   +S + L+ ++T  S +     V GCG   
Sbjct: 147 CHSVRDTSCSD-QNSCQYKISYGDSSHSQGDLSVDTLSLESTSGSPVSFPKTVIGCGTDN 205

Query: 181 NRNF--KFSGIFGLGIGRSSLVSQLNSS----FSYCIGSLHDPDYLHNKLI-LGDGAIID 233
              F    SGI GLG G  SL++QL SS    FSYC+  L + +   + ++  GD A++ 
Sbjct: 206 AGTFGGASSGIVGLGGGPVSLITQLGSSIGGKFSYCLVPLLNKESNASSILSFGDAAVV- 264

Query: 234 EGD---ATPLQFIDG-HYYITLEAISVDGRMLDINPNIFKRDDSGGGVMIDSGTDVTWLV 289
            GD   +TPL   D   Y++TL+A SV  + ++   +    DD  G ++IDSGT +T + 
Sbjct: 265 SGDGVVSTPLIKKDPVFYFLTLQAFSVGNKRVEFGGSSEGGDDE-GNIIIDSGTTLTLIP 323

Query: 290 KEAYEALRDEVMIRLEGEQMRSYSWPDKLCYHGIMSSDLKGFPTVRFHFRGGAKLALEKD 349
            + Y  L   V+  ++ +++   +    LCY   + S+   FP +  HF+ GA + L   
Sbjct: 324 SDVYTNLESAVVDLVKLDRVDDPNQQFSLCYS--LKSNEYDFPIITAHFK-GADIELHSI 380

Query: 350 SMFYQPRPDAFCMAVNPASINNRYVNFSLIGMMAQQFYNVGYDIGRKQKTFQRMDC 405
           S F        C A  P+         S+ G +AQQ   VGYD+ +K  +F+  DC
Sbjct: 381 STFVPITDGIVCFAFQPSPQLG-----SIFGNLAQQNLLVGYDLQQKTVSFKPTDC 431


>gi|125548488|gb|EAY94310.1| hypothetical protein OsI_16079 [Oryza sativa Indica Group]
          Length = 423

 Score =  169 bits (428), Expect = 2e-39,   Method: Compositional matrix adjust.
 Identities = 126/368 (34%), Positives = 177/368 (48%), Gaps = 32/368 (8%)

Query: 58  NSLFYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLGTIFYPSRSSSYAYVPCD 117
           N  F +++SIG P +     +DTGS L+W  C PC DC  Q   +F PS SS+YA VPC 
Sbjct: 71  NGEFLMDVSIGTPALAYSAIVDTGSDLVWTQCKPCVDCFKQSTPVFDPSSSSTYATVPCS 130

Query: 118 SEHCRYFPYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDVVFGCG 177
           S  C   P ++C++   +C YT  Y     T   ++TE  T   +      +  VVFGCG
Sbjct: 131 SASCSDLPTSKCTS-ASKCGYTYTYGDSSSTQGVLATETFTLAKS-----KLPGVVFGCG 184

Query: 178 FSTNRNFKFS---GIFGLGIGRSSLVSQLN-SSFSYCIGSLHDPDYLHNKLILGDGAIID 233
             TN    FS   G+ GLG G  SLVSQL    FSYC+ SL D +  ++ L+LG  A I 
Sbjct: 185 -DTNEGDGFSQGAGLVGLGRGPLSLVSQLGLDKFSYCLTSLDDTN--NSPLLLGSLAGIS 241

Query: 234 EG-------DATPL---QFIDGHYYITLEAISVDGRMLDINPNIFK-RDDSGGGVMIDSG 282
           E          TPL         YY++L+AI+V    + +  + F  +DD  GGV++DSG
Sbjct: 242 EASAAASSVQTTPLIKNPSQPSFYYVSLKAITVGSTRISLPSSAFAVQDDGTGGVIVDSG 301

Query: 283 TDVTWLVKEAYEALRDEVMIRLEGEQMRSYSWPDKLCYHGIMSS-DLKGFPTVRFHFRGG 341
           T +T+L  + Y AL+     ++             LC+       D    P + FHF GG
Sbjct: 302 TSITYLEVQGYRALKKAFAAQMALPAADGSGVGLDLCFRAPAKGVDQVEVPRLVFHFDGG 361

Query: 342 AKLALEKDS-MFYQPRPDAFCMAVNPASINNRYVNFSLIGMMAQQFYNVGYDIGRKQKTF 400
           A L L  ++ M       A C+ V    + +R    S+IG   QQ +   YD+G    +F
Sbjct: 362 ADLDLPAENYMVLDGGSGALCLTV----MGSR--GLSIIGNFQQQNFQFVYDVGHDTLSF 415

Query: 401 QRMDCEVL 408
             + C  L
Sbjct: 416 APVQCNKL 423


>gi|357487593|ref|XP_003614084.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
 gi|355515419|gb|AES97042.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
          Length = 412

 Score =  169 bits (427), Expect = 3e-39,   Method: Compositional matrix adjust.
 Identities = 118/413 (28%), Positives = 202/413 (48%), Gaps = 42/413 (10%)

Query: 1   LIHRDSILSPYNNPNENPAHRVQRGINISIARLAYLQEKIRSHNTYQAQILPSNDISNSL 60
           LIH  S  SP+ NP E    R+   +N SI R+ YL   + S +  + Q +P +    + 
Sbjct: 31  LIHPISSRSPFYNPKETQIQRISSILNYSINRVRYLNH-VFSFSPNKIQDVPLSSFMGAG 89

Query: 61  FYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLGTIFYPSRSSSYAYVPCDSEH 120
           + ++ SIG PP   ++ +DTG+  +W  C PC+ C  Q   +F+PS+SS+Y  +PC S  
Sbjct: 90  YVMSYSIGTPPFQLYSLIDTGNDNIWFQCKPCKPCLNQTSPMFHPSKSSTYKTIPCTSPI 149

Query: 121 CRYFPYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDVVFGCGFST 180
           C+       +A  H                ++  + LT  + + + I  +++V GCG   
Sbjct: 150 CK-------NADGH----------------YLGVDTLTLNSNNGTPISFKNIVIGCGHRN 186

Query: 181 NRNFK--FSGIFGLGIGRSSLVSQLNSS----FSYCIGSLHDPDYLHNKLILGDGAIID- 233
               +   SG  GL  G  S +SQLNSS    FSYC+  L   + + +KL  GD + +  
Sbjct: 187 QGPLEGYVSGNIGLARGPLSFISQLNSSIGGKFSYCLVPLFSKENVSSKLHFGDKSTVSG 246

Query: 234 -EGDATPLQFIDGHYYITLEAISVDGRMLDINPNIFKRDDSGGGVMIDSGTDVTWLVKEA 292
               +TP++  +G Y+++LEA SV   ++ +     +  D+ G  +IDSGT +T L K+ 
Sbjct: 247 LGTVSTPIKEENG-YFVSLEAFSVGDHIIKL-----ENSDNRGNSIIDSGTTMTILPKDV 300

Query: 293 YEALRDEVMIRLEGEQMRSYSWPDKLCYHGIMSSDLKGFPTVRFHFRGGAKLALEKDSMF 352
           Y  L   V+  ++ ++++  S    LCY    ++ L     +  HF  G+++ L   + F
Sbjct: 301 YSRLESVVLDMVKLKRVKDPSQQFNLCYQTTSTTLLTKVLIITAHF-SGSEVHLNALNTF 359

Query: 353 YQPRPDAFCMAVNPASINNRYVNFSLIGMMAQQFYNVGYDIGRKQKTFQRMDC 405
           Y    +  C A         + + ++ G + QQ + VG+D+ +K  +F+  DC
Sbjct: 360 YPITDEVICFAFVSGG---NFSSLAIFGNVVQQNFLVGFDLNKKTISFKPTDC 409


>gi|165292434|dbj|BAF98915.1| aspartic proteinase nepenthesin I [Nepenthes alata]
          Length = 437

 Score =  169 bits (427), Expect = 3e-39,   Method: Compositional matrix adjust.
 Identities = 123/358 (34%), Positives = 174/358 (48%), Gaps = 31/358 (8%)

Query: 61  FYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLGTIFYPSRSSSYAYVPCDSEH 120
           + +N+SIG P  P    MDTGS L+W  C PC  C  Q   IF P  SSS++ +PC S+ 
Sbjct: 95  YLMNLSIGTPAQPFSAIMDTGSDLIWTQCQPCTQCFNQSTPIFNPQGSSSFSTLPCSSQL 154

Query: 121 CRYFPYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDVVFGCGFST 180
           C+      CS   + C YT  Y  G ET   + TE LTF      ++ + ++ FGCG   
Sbjct: 155 CQALQSPTCS--NNSCQYTYGYGDGSETQGSMGTETLTFG-----SVSIPNITFGCG-EN 206

Query: 181 NRNF---KFSGIFGLGIGRSSLVSQLN-SSFSYC---IGSLHDPDYLHNKLILGDGAIID 233
           N+ F     +G+ G+G G  SL SQL+ + FSYC   IGS +    L     L +     
Sbjct: 207 NQGFGQGNGAGLVGMGRGPLSLPSQLDVTKFSYCMTPIGSSNSSTLLLGS--LANSVTAG 264

Query: 234 EGDATPLQF--IDGHYYITLEAISVDGRMLDINPNIFK--RDDSGGGVMIDSGTDVTWLV 289
             + T +Q   I   YYITL  +SV    L I+P++FK   ++  GG++IDSGT +T+ V
Sbjct: 265 SPNTTLIQSSQIPTFYYITLNGLSVGSTPLPIDPSVFKLNSNNGTGGIIIDSGTTLTYFV 324

Query: 290 KEAYEALRDEVMIRLEGEQMRSYSWPDKLCYHGIMSSDLKGF--PTVRFHFRGGAKLALE 347
             AY+A+R   + ++    +   S    LC+   M SD      PT   HF GG  L L 
Sbjct: 325 DNAYQAVRQAFISQMNLSVVNGSSSGFDLCFQ--MPSDQSNLQIPTFVMHFDGG-DLVLP 381

Query: 348 KDSMFYQPRPDAFCMAVNPASINNRYVNFSLIGMMAQQFYNVGYDIGRKQKTFQRMDC 405
            ++ F  P     C+A+  +S        S+ G + QQ   V YD G    +F    C
Sbjct: 382 SENYFISPSNGLICLAMGSSS-----QGMSIFGNIQQQNLLVVYDTGNSVVSFLSAQC 434


>gi|449523529|ref|XP_004168776.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Cucumis sativus]
          Length = 461

 Score =  168 bits (425), Expect = 5e-39,   Method: Compositional matrix adjust.
 Identities = 124/405 (30%), Positives = 195/405 (48%), Gaps = 27/405 (6%)

Query: 20  HRVQRGINISIARLAYLQEKI--RSHNTYQAQILPSNDISNSLFYVNISIGQPPVPQFTA 77
            R++RG+     RL  L   +   ++ T   Q+       N  F + ++IG PP      
Sbjct: 68  ERLRRGVARGKNRLHRLNAMVLAAANATVGDQVKAPVVAGNGEFLMKLAIGSPPRSFSAI 127

Query: 78  MDTGSSLLWVHCYPCRDCSPQLGTIFYPSRSSSYAYVPCDSEHCRYFPYARCSAYKHRCI 137
           MDTGS L+W  C PC+ C  Q   IF P +SSS+  + C SE C   P + CS+    C 
Sbjct: 128 MDTGSDLIWTQCKPCQQCFDQSTPIFDPKQSSSFYKISCSSELCGALPTSTCSS--DGCE 185

Query: 138 YTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDVVFGCGFSTNRN--FKFSGIFGLGIG 195
           Y   Y     T   ++ E  TF ++ E  I +  + FGCG   N +   + +G+ GLG G
Sbjct: 186 YLYTYGDSSSTQGVLAFETFTFGDSTEDQISIPGLGFGCGNDNNGDGFSQGAGLVGLGRG 245

Query: 196 RSSLVSQLNS-SFSYCIGSLHDPDYLHNKLILGDGAII------DEGDATPL---QFIDG 245
             SLVSQL    F+YC+ ++ D     + L+LG  A I      DE   TPL        
Sbjct: 246 PLSLVSQLKEQKFAYCLTAIDDSK--PSSLLLGSLANITPKTSKDEMKTTPLIKNPSQPS 303

Query: 246 HYYITLEAISVDGRMLDINPNIFK-RDDSGGGVMIDSGTDVTWLVKEAYEALRDEVMIRL 304
            YY++L+ ISV G  L I  + F+  DD  GGV+IDSGT +T++   A+ +L++E + ++
Sbjct: 304 FYYLSLQGISVGGTQLSIPKSTFELHDDGSGGVIIDSGTTITYVENSAFTSLKNEFIAQM 363

Query: 305 EGEQMRSYSWPDKLCYHGIMSSDLKGFPTVRFHFRGGAKLALEKDS-MFYQPRPDAFCMA 363
                 S +    LC++    ++    P + FHF+ GA L L  ++ M    +    C+A
Sbjct: 364 NLPVDDSGTGGLDLCFNLPAGTNQVEVPKLTFHFK-GADLELPGENYMIGDSKAGLLCLA 422

Query: 364 VNPASINNRYVNFSLIGMMAQQFYNVGYDIGRKQKTFQRMDCEVL 408
           +  +         S+ G + QQ + V +D+  +  +F    C+ +
Sbjct: 423 IGSSR------GMSIFGNLQQQNFMVVHDLQEETLSFLPTQCDSI 461


>gi|356528671|ref|XP_003532923.1| PREDICTED: probable aspartic protease At2g35615-like [Glycine max]
          Length = 440

 Score =  168 bits (425), Expect = 6e-39,   Method: Compositional matrix adjust.
 Identities = 132/425 (31%), Positives = 192/425 (45%), Gaps = 40/425 (9%)

Query: 1   LIHRDSILSPYNNPNENPAHRVQRGINISIAR----LAYLQEKIRSHNTYQAQILPSNDI 56
           LIHR+S LSP+ NP+  P+ R++  +  S AR    L   Q   RS  T     +P   I
Sbjct: 33  LIHRESPLSPFYNPSLTPSERIKNTVLRSFARSKRRLRLSQNDDRSPGTIT---IPDEPI 89

Query: 57  SNSLFYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLGTIFYPSRSSSYAYVPC 116
           +  L  +   IG PPV +F   DTGS L+WV C PC  C PQ   +F P +SS++  VPC
Sbjct: 90  TEYL--MRFYIGTPPVERFAIADTGSDLIWVQCAPCEKCVPQNAPLFDPRKSSTFKTVPC 147

Query: 117 DSEHCRYFPYAR--CSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDVVF 174
           DS+ C   P ++  C     +C Y  +Y      S  +  E + F + + + I    + F
Sbjct: 148 DSQPCTLLPPSQRACVGKSGQCYYQYIYGDHTLVSGILGFESINFGSKNNA-IKFPKLTF 206

Query: 175 GCGFSTNRNFKFS----GIFGLGIGRSSLVSQL----NSSFSYCIGSLHDPDYLHNKLIL 226
           GC FS N     S    G+ GLG+G  SL+SQL       FSYC   L       +K+  
Sbjct: 207 GCTFSNNDTVDESKRNMGLVGLGVGPLSLISQLGYQIGRKFSYCFPPLSSNS--TSKMRF 264

Query: 227 GDGAIIDEGD---ATPL---QFIDGHYYITLEAISVDGRMLDINPNIFKRDDSGGGVMID 280
           G+ AI+ +     +TPL        +YY+ LE +S+  + +  +        + G ++ID
Sbjct: 265 GNDAIVKQIKGVVSTPLIIKSIGPSYYYLNLEGVSIGNKKVKTS-----ESQTDGNILID 319

Query: 281 SGTDVTWLVKEAYEALRDEVMIRLEGEQMRSYSWPDKLCYHGIMSSDLKGFPTVRFHFRG 340
           SGT  T L +  Y      V      E ++        C+        K FP V F F  
Sbjct: 320 SGTSFTILKQSFYNKFVALVKEVYGVEAVKIPPLVYNFCFEN--KGKRKRFPDVVFLFT- 376

Query: 341 GAKLALEKDSMFYQPRPDAFCMAVNPASINNRYVNFSLIGMMAQQFYNVGYDIGRKQKTF 400
           GAK+ ++  ++F     +  CM   P S  +     S+ G  AQ  Y V YD+     +F
Sbjct: 377 GAKVRVDASNLFEAEDNNLLCMVALPTSDEDD----SIFGNHAQIGYQVEYDLQGGMVSF 432

Query: 401 QRMDC 405
              DC
Sbjct: 433 APADC 437


>gi|449451908|ref|XP_004143702.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Cucumis sativus]
 gi|449529900|ref|XP_004171936.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Cucumis sativus]
          Length = 459

 Score =  167 bits (424), Expect = 6e-39,   Method: Compositional matrix adjust.
 Identities = 124/398 (31%), Positives = 186/398 (46%), Gaps = 42/398 (10%)

Query: 41  RSHNTYQAQILPSNDISNSLFYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQ-L 99
           R + T ++ ++      +  ++V+I +G PP       DTGS L+WV C  CR+CS    
Sbjct: 68  RPNPTLKSPLISGASTGSGQYFVDIRLGTPPQSLLLVADTGSDLVWVKCSACRNCSHHPP 127

Query: 100 GTIFYPSRSSSYAYVPCDSEHCRYFPYARCSAYKHRCIYTQL---------YLIGPETSV 150
            + F P  SSS++   C   HCR  P+A      H C +T+L         Y  G  +S 
Sbjct: 128 SSAFLPRHSSSFSPFHCFDPHCRLLPHAP----HHLCNHTRLHSPCRFLYSYADGSLSSG 183

Query: 151 FVSTEQLTFKNTDESTIHVQDVVFGCGFSTN----RNFKFS---GIFGLGIGRSSLVSQL 203
           F S E  T K+   S IH++ + FGCGF  +       +F+   G+ GLG G  S  SQL
Sbjct: 184 FFSKETTTLKSLSGSEIHLKGLSFGCGFRISGPSVSGAQFNGARGVMGLGRGSISFSSQL 243

Query: 204 ----NSSFSYCIG--SLHDPDYLHNKLILGDG------AIIDEGDATPLQ---FIDGHYY 248
                + FSYC+   +L  P    + L++G G          +   TPLQ        YY
Sbjct: 244 GRRFGNKFSYCLMDYTLSPPPT--SFLMIGGGLHSLPLTNATKISYTPLQINPLSPTFYY 301

Query: 249 ITLEAISVDGRMLDINPNIFKRDDSG-GGVMIDSGTDVTWLVKEAYEALRDEVMIRLEGE 307
           IT+ +I++DG  L INP +++ D+ G GG ++DSGT +T+L K AYE +   V  R++  
Sbjct: 302 ITIHSITIDGVKLPINPAVWEIDEQGNGGTVVDSGTTLTYLTKTAYEEVLKSVRRRVKLP 361

Query: 308 QMRSYSWPDKLCYHGIMSSDLKGFPTVRFHFRGGAKLALEKDSMFYQPRPDAFCMAVNPA 367
                +    LC +    S     P +RF   GGA  A    + F +      C+A+   
Sbjct: 362 NAAELTPGFDLCVNASGESRRPSLPRLRFRLGGGAVFAPPPRNYFLETEEGVMCLAIRAV 421

Query: 368 SINNRYVNFSLIGMMAQQFYNVGYDIGRKQKTFQRMDC 405
              N    FS+IG + QQ + + +D    +  F R  C
Sbjct: 422 ESGN---GFSVIGNLMQQGFLLEFDKEESRLGFTRRGC 456


>gi|297817972|ref|XP_002876869.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
 gi|297322707|gb|EFH53128.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
          Length = 462

 Score =  167 bits (424), Expect = 7e-39,   Method: Compositional matrix adjust.
 Identities = 127/417 (30%), Positives = 198/417 (47%), Gaps = 35/417 (8%)

Query: 15  NENPAHRVQRGINISIARLAYLQE----KIRSHNTYQAQILPSNDISNSLFYVNISIGQP 70
           N     ++QRGIN    RL  L       + S+      I       +  F + +SIG P
Sbjct: 58  NLTKIQKIQRGINRGFHRLNRLGAVAVLAVASNPDDTNNIKAPTHGGSGEFLMELSIGNP 117

Query: 71  PVPQFTAMDTGSSLLWVHCYPCRDCSPQLGTIFYPSRSSSYAYVPCDSEHCRYFPYARCS 130
            V     +DTGS L+W  C PC +C  Q   IF P +SSSY+ V C S  C   P + C+
Sbjct: 118 AVKYAAIVDTGSDLIWTQCKPCTECFDQPTPIFDPEKSSSYSKVGCSSGLCNALPRSNCN 177

Query: 131 AYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDVVFGCGFSTNRN--FKFSG 188
             K  C Y   Y     T   ++TE  TF+  DE++I    + FGCG     +   + SG
Sbjct: 178 EDKDSCEYLYTYGDYSSTRGLLATETFTFE--DENSI--SGIGFGCGVENEGDGFSQGSG 233

Query: 189 IFGLGIGRSSLVSQLN-SSFSYCIGSLHDPDY--------LHNKLILGDGAIIDEGDATP 239
           + GLG G  SL+SQL  + FSYC+ S+ D +         L + ++   GA +D G+ T 
Sbjct: 234 LVGLGRGPLSLISQLKETKFSYCLTSIEDSEASSSLFIGSLASGIVNKTGANLD-GEVTK 292

Query: 240 LQFI------DGHYYITLEAISVDGRMLDINPNIFK-RDDSGGGVMIDSGTDVTWLVKEA 292
              +         YY+ L+ I+V  + L +  + F+  +D  GG++IDSGT +T+L + A
Sbjct: 293 TMSLLRNPDQPSFYYLELQGITVGAKRLSVEKSTFELSEDGTGGMIIDSGTTITYLEETA 352

Query: 293 YEALRDEVMIRLEGEQMRSYSWPDKLCYHGIMSSDLKGFPTVRFHFRGGAKLALEKDS-M 351
           ++ L++E   R+      S S    LC+    ++     P + FHF+ GA L L  ++ M
Sbjct: 353 FKVLKEEFTSRMSLPVDDSGSTGLDLCFKLPNAAKNIAVPKLIFHFK-GADLELPGENYM 411

Query: 352 FYQPRPDAFCMAVNPASINNRYVNFSLIGMMAQQFYNVGYDIGRKQKTFQRMDCEVL 408
                    C+A+  ++        S+ G + QQ +NV +D+ ++  TF   +C  L
Sbjct: 412 VADSSTGVLCLAMGSSN------GMSIFGNVQQQNFNVLHDLEKETVTFVPTECGKL 462


>gi|356542355|ref|XP_003539632.1| PREDICTED: probable aspartic protease At2g35615-like [Glycine max]
          Length = 444

 Score =  167 bits (424), Expect = 7e-39,   Method: Compositional matrix adjust.
 Identities = 128/418 (30%), Positives = 193/418 (46%), Gaps = 25/418 (5%)

Query: 1   LIHRDSILSPYNNPNENPAHRVQRGINISIARLAYLQEK--IRSHNTYQAQILPSNDISN 58
           +IHRDS  SPY  P E    RV   +  SI R  +  +   + S NT ++ ++ S     
Sbjct: 36  IIHRDSSRSPYYRPTETQFQRVANALRRSINRANHFNKPNLVASTNTAESTVIASQ---- 91

Query: 59  SLFYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLGTIFYPSRSSSYAYVPCDS 118
             + ++ S+G PP      +DTGS ++W+ C PC DC  Q   IF PS+S +Y  +PC S
Sbjct: 92  GEYLMSYSVGTPPFQILGIVDTGSDIIWLQCQPCEDCYNQTTPIFDPSQSKTYKTLPCSS 151

Query: 119 EHCRYF-PYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDVVFGCG 177
             C+     A CS+    C YT  Y     +   +S E LT  +TD S++     V GCG
Sbjct: 152 NICQSVQSAASCSSNNDECEYTITYGDNSHSQGDLSVETLTLGSTDGSSVQFPKTVIGCG 211

Query: 178 FSTNRNFKFSGIFGLGIGRSSL------VSQLNSSFSYCIGSLHDPDYLHNKLILGDGAI 231
            +    F+  G   +G+G   +       S +   FSYC+  L       +KL  GD A+
Sbjct: 212 HNNKGTFQREGSGIVGLGGGPVSLISQLSSSIGGKFSYCLAPLFSQSNSSSKLNFGDEAV 271

Query: 232 IDEGDATPLQFID----GHYYITLEAISVDGRMLDINPNIFKRDDSGGGVMIDSGTDVTW 287
           +          +     G Y++TLEA SV    ++   + F+     G ++IDSGT +T 
Sbjct: 272 VSGRGTVSTPIVPKNGLGFYFLTLEAFSVGDNRIEFGSSSFESSGGEGNIIIDSGTTLTI 331

Query: 288 LVKEAYEALRDEVMIRLEGEQMRSYSWPDKLCYHGIMSSDLKGFPTVRFHFRGGAKLALE 347
           L ++ Y  L   V   +E E++   S   +LCY    SSD    P +  HF+ GA + L 
Sbjct: 332 LPEDDYLNLESAVADAIELERVEDPSKFLRLCYR-TTSSDELNVPVITAHFK-GADVELN 389

Query: 348 KDSMFYQPRPDAFCMAVNPASINNRYVNFSLIGMMAQQFYNVGYDIGRKQKTFQRMDC 405
             S F +      C A   + I        + G +AQQ   VGYD+ ++  +F+  DC
Sbjct: 390 PISTFIEVDEGVVCFAFRSSKIG------PIFGNLAQQNLLVGYDLVKQTVSFKPTDC 441


>gi|449453902|ref|XP_004144695.1| PREDICTED: LOW QUALITY PROTEIN: aspartic proteinase
           nepenthesin-1-like, partial [Cucumis sativus]
          Length = 716

 Score =  167 bits (423), Expect = 8e-39,   Method: Compositional matrix adjust.
 Identities = 124/405 (30%), Positives = 195/405 (48%), Gaps = 27/405 (6%)

Query: 20  HRVQRGINISIARLAYLQEKI--RSHNTYQAQILPSNDISNSLFYVNISIGQPPVPQFTA 77
            R++RG+     RL  L   +   ++ T   Q+       N  F + ++IG PP      
Sbjct: 323 ERLRRGVARGKNRLHRLNAMVLAAANATVGDQVKAPVVAGNGEFLMKLAIGSPPRSFSAI 382

Query: 78  MDTGSSLLWVHCYPCRDCSPQLGTIFYPSRSSSYAYVPCDSEHCRYFPYARCSAYKHRCI 137
           MDTGS L+W  C PC+ C  Q   IF P +SSS+  + C SE C   P + CS+    C 
Sbjct: 383 MDTGSDLIWTQCKPCQQCFDQSTPIFDPKQSSSFYKISCSSELCGALPTSTCSS--DGCE 440

Query: 138 YTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDVVFGCGFSTNRN--FKFSGIFGLGIG 195
           Y   Y     T   ++ E  TF ++ E  I +  + FGCG   N +   + +G+ GLG G
Sbjct: 441 YLYTYGDSSSTQGVLAFETFTFGDSTEDQISIPGLGFGCGNDNNGDGFSQGAGLVGLGRG 500

Query: 196 RSSLVSQLNS-SFSYCIGSLHDPDYLHNKLILGDGAII------DEGDATPL---QFIDG 245
             SLVSQL    F+YC+ ++ D     + L+LG  A I      DE   TPL        
Sbjct: 501 PLSLVSQLKEQKFAYCLTAIDDSK--PSSLLLGSLANITPKTSKDEMKTTPLIKNPSQPS 558

Query: 246 HYYITLEAISVDGRMLDINPNIFK-RDDSGGGVMIDSGTDVTWLVKEAYEALRDEVMIRL 304
            YY++L+ ISV G  L I  + F+  DD  GGV+IDSGT +T++   A+ +L++E + ++
Sbjct: 559 FYYLSLQGISVGGTQLSIPKSTFELHDDGSGGVIIDSGTTITYVENSAFTSLKNEFIAQM 618

Query: 305 EGEQMRSYSWPDKLCYHGIMSSDLKGFPTVRFHFRGGAKLALEKDS-MFYQPRPDAFCMA 363
                 S +    LC++    ++    P + FHF+ GA L L  ++ M    +    C+A
Sbjct: 619 NLPVDDSGTGGLDLCFNLPAGTNQVEVPKLTFHFK-GADLELPGENYMIGDSKAGLLCLA 677

Query: 364 VNPASINNRYVNFSLIGMMAQQFYNVGYDIGRKQKTFQRMDCEVL 408
           +  +         S+ G + QQ + V +D+  +  +F    C+ +
Sbjct: 678 IGSSR------GMSIFGNLQQQNFMVVHDLQEETLSFLPTQCDSI 716


>gi|224130878|ref|XP_002320947.1| predicted protein [Populus trichocarpa]
 gi|222861720|gb|EEE99262.1| predicted protein [Populus trichocarpa]
          Length = 440

 Score =  167 bits (423), Expect = 8e-39,   Method: Compositional matrix adjust.
 Identities = 129/418 (30%), Positives = 200/418 (47%), Gaps = 29/418 (6%)

Query: 1   LIHRDSILSPYNNPNENPAHRVQRGINISIARLAYLQEKIRSHNTYQAQILPSNDISNSL 60
           LIHRDS LSP+ N  E    R+   +  SI+R+ +      +  + +A     +D++++ 
Sbjct: 36  LIHRDSPLSPFYNSEETDLQRINNALRRSISRVHHFDPIAAASVSPKAA---ESDVTSNR 92

Query: 61  --FYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLGTIFYPSRSSSYAYVPCDS 118
             + +++S+G PP       DTGS L+W  C PC  C  Q+  +F P  S +Y    CD+
Sbjct: 93  GEYLMSLSLGTPPFKIMGIADTGSDLIWTQCKPCERCYKQVDPLFDPKSSKTYRDFSCDA 152

Query: 119 EHCRYFPYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDVVFGCGF 178
             C     + CS   + C Y   Y     T   V+++ +T  +T  S +     V GCG 
Sbjct: 153 RQCSLLDQSTCSG--NICQYQYSYGDRSYTMGNVASDTITLDSTTGSPVSFPKTVIGCGH 210

Query: 179 STNRNF--KFSGIFGLGIGRSSLVSQLNSS----FSYCIGSLHDPDYLHNKLILGDGAII 232
             +  F  K SGI GLG G  SL+SQ+ SS    FSYC+  L       +KL  G  A++
Sbjct: 211 ENDGTFSDKGSGIVGLGAGPLSLISQMGSSVGGKFSYCLVPLSSRAGNSSKLNFGSNAVV 270

Query: 233 DEG--DATPL---QFIDGHYYITLEAISVDGRMLDINPNIFKRDDSGGGVMIDSGTDVTW 287
                 +TPL   + +   Y++TLEA+SV    +    +     +  G ++IDSGT +T 
Sbjct: 271 SGPGVQSTPLLSSETMSSFYFLTLEAMSVGNERIKFGDSSLGTGE--GNIIIDSGTTLTI 328

Query: 288 LVKEAYEALRDEVMIRLEGEQMRSYSWPDKLCYHGIMSSDLKGFPTVRFHFRGGAKLALE 347
           +  + +  L   V  ++EG +    S    +CY    +SDLK  P +  HF  GA + L+
Sbjct: 329 VPDDFFSNLSTAVGNQVEGRRAEDPSGFLSVCYSA--TSDLK-VPAITAHFT-GADVKLK 384

Query: 348 KDSMFYQPRPDAFCMAVNPASINNRYVNFSLIGMMAQQFYNVGYDIGRKQKTFQRMDC 405
             + F Q   D  C+A   AS  +     S+ G +AQ  + V Y+I  K  +F+  DC
Sbjct: 385 PINTFVQVSDDVVCLAF--ASTTS---GISIYGNVAQMNFLVEYNIQGKSLSFKPTDC 437


>gi|357450863|ref|XP_003595708.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
 gi|355484756|gb|AES65959.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
          Length = 407

 Score =  167 bits (423), Expect = 8e-39,   Method: Compositional matrix adjust.
 Identities = 123/382 (32%), Positives = 188/382 (49%), Gaps = 23/382 (6%)

Query: 36  LQEKIRSHNTYQAQILPSN-DISNSLFYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRD 94
           L  +  SH++Y+   + S     +  + + +SIG PP+  +   DTGS L+W  C PC  
Sbjct: 34  LIRRNSSHDSYKPSTIQSPVSAYDCEYLMELSIGTPPIKIYAEADTGSDLVWFQCIPCTK 93

Query: 95  CSPQLGTIFYPSRSSSYAYVPCDSEHCRYFPYARCSAYKHRCIYTQLYLIGPETSVFVST 154
           C  Q   +F P  SSSY  + C +E C     + CS  +  C YT  Y     T   ++ 
Sbjct: 94  CYKQQNPMFDPRSSSSYTNITCGTESCNKLDSSLCSTDQKTCNYTYSYADNSITQGVLAQ 153

Query: 155 EQLTFKNTDESTIHVQDVVFGCGFSTNR-NFKFSGIFGLGIGRSSLVSQLNSS------- 206
           E LT  +T    +  Q ++FGCG + +  N +  G+ GLG G  SL+SQ+ SS       
Sbjct: 154 ETLTLTSTTGEPVAFQGIIFGCGHNNSGFNDREMGLIGLGRGPLSLISQIGSSLGAGGNM 213

Query: 207 FSYCIGSLHDPDYLHNKLILGDGA-IIDEGD-ATPLQFIDGH-YYITLEAISVDGRMLDI 263
           FS C+   +    + +++  G G+ ++  G  +TPL   DG  Y+ TL  ISV+   L  
Sbjct: 214 FSQCLVPFNTDPSITSQMNFGKGSEVLGNGTVSTPLISKDGTGYFATLLGISVEDINLPF 273

Query: 264 NPNIFKRDDSGGGVMIDSGTDVTWLVKEAYEALRDEVMIRLEGEQMRSYSWPDKLCYHGI 323
           +        + G ++IDSGT +T+L +E Y  L ++V  ++  E  R   +  +LCY   
Sbjct: 274 SNGSSLGTITKGNILIDSGTTITYLPEEFYHRLIEQVRNKVALEPFRIDGY--ELCYQ-- 329

Query: 324 MSSDLKGFPTVRFHFRGGAKLALEKDSMFYQPRPDAFCMAVNPASINNRYVNFSLIGMMA 383
             ++L G PT+  HF GG  L L    MF   + D FC AV     N  YV +   G  A
Sbjct: 330 TPTNLNG-PTLTIHFEGGDVL-LTPAQMFIPVQDDNFCFAV--FDTNEEYVTY---GNYA 382

Query: 384 QQFYNVGYDIGRKQKTFQRMDC 405
           Q  Y +G+D+ R+  +F+  DC
Sbjct: 383 QSNYLIGFDLERQVVSFKATDC 404


>gi|317106730|dbj|BAJ53226.1| JHL06P13.6 [Jatropha curcas]
          Length = 445

 Score =  167 bits (422), Expect = 1e-38,   Method: Compositional matrix adjust.
 Identities = 133/422 (31%), Positives = 209/422 (49%), Gaps = 34/422 (8%)

Query: 1   LIHRDSILSPYNNPNENPAHRVQRGINISIARL-AYLQEKIRSHNTYQAQILPSNDISNS 59
           LIHRDS +SP  NP      R+Q   + SI+R   +    + +  T +  I+P       
Sbjct: 37  LIHRDSPISPLYNPKNTYFDRLQSSFHRSISRANRFTPNSVSAAKTLEYDIIPGG----G 92

Query: 60  LFYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLGTIFYPSRSSSYAYVPCDSE 119
            +++ ISIG PP+      DTGS L+WV C PC++C  Q   IF P +SS+Y  V C++ 
Sbjct: 93  EYFMRISIGTPPIEVLVIADTGSDLIWVQCQPCQECYKQKSPIFNPKQSSTYRRVLCETR 152

Query: 120 HCRYF--PYARCSAYK--HRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDVVFG 175
           +C         CSA+     C Y+  Y     T  +++TE+    +T+ S   +Q++ FG
Sbjct: 153 YCNALNSDMRACSAHGFFKACGYSYSYGDHSFTMGYLATERFIIGSTNNS---IQELAFG 209

Query: 176 CGFSTNRNF--KFSGIFGLGIGRSSLVSQL----NSSFSYC-IGSLHDPDYLHNKLILGD 228
           CG S   NF    SGI GLG G  SL+SQL    ++ FSYC +  L   ++   K++ GD
Sbjct: 210 CGNSNGGNFDEVGSGIVGLGGGSLSLISQLGTKIDNKFSYCLVPILEKSNFSLGKIVFGD 269

Query: 229 GAIIDEGD---ATPL--QFIDGHYYITLEAISVDGRMLDINPNIFKRDDSGGGVMIDSGT 283
            + I   D   +TPL  +  +  YY+TLEAISV    L    +    +   G ++IDSGT
Sbjct: 270 NSFISGSDTYVSTPLVSKEPETFYYLTLEAISVGNERLAYENSRNDGNVEKGNIIIDSGT 329

Query: 284 DVTWLVKEAYEALRDEVMIRLEGEQMRSYSWPDKLCYHGIMSSDLKGFPTVRFHFRGGAK 343
            +T+L  + Y  L   +   +EGE++   +    +C+   +  +L   P +  HF   A 
Sbjct: 330 TLTFLDSKLYNKLELVLEKAVEGERVSDPNGIFSICFRDKIGIEL---PIITVHFT-DAD 385

Query: 344 LALEKDSMFYQPRPDAFCMAVNPASINNRYVNFSLIGMMAQQFYNVGYDIGRKQKTFQRM 403
           + L+  + F +   D  C  + P++        ++ G +AQ  + VGYD+ +   +F   
Sbjct: 386 VELKPINTFAKAEEDLLCFTMIPSN------GIAIFGNLAQMNFLVGYDLDKNCVSFMPT 439

Query: 404 DC 405
           DC
Sbjct: 440 DC 441


>gi|30678047|ref|NP_565298.2| aspartyl protease-like protein [Arabidopsis thaliana]
 gi|30102688|gb|AAP21262.1| At2g03200 [Arabidopsis thaliana]
 gi|110736021|dbj|BAE99983.1| putative chloroplast nucleoid DNA binding protein [Arabidopsis
           thaliana]
 gi|330250580|gb|AEC05674.1| aspartyl protease-like protein [Arabidopsis thaliana]
          Length = 461

 Score =  165 bits (418), Expect = 3e-38,   Method: Compositional matrix adjust.
 Identities = 128/419 (30%), Positives = 202/419 (48%), Gaps = 39/419 (9%)

Query: 15  NENPAHRVQRGINISIARL------AYLQEKIRSHNTYQAQILPSNDISNSLFYVNISIG 68
           N     ++QRGIN    RL      A L    +  +T   +  P++  S   F + +SIG
Sbjct: 57  NLTKIQKIQRGINRGFHRLNRLGAVAVLAVASKPDDTNNIKA-PTHGGSGE-FLMELSIG 114

Query: 69  QPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLGTIFYPSRSSSYAYVPCDSEHCRYFPYAR 128
            P V     +DTGS L+W  C PC +C  Q   IF P +SSSY+ V C S  C   P + 
Sbjct: 115 NPAVKYSAIVDTGSDLIWTQCKPCTECFDQPTPIFDPEKSSSYSKVGCSSGLCNALPRSN 174

Query: 129 CSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDVVFGCGFSTNRN--FKF 186
           C+  K  C Y   Y     T   ++TE  TF+  DE++I    + FGCG     +   + 
Sbjct: 175 CNEDKDACEYLYTYGDYSSTRGLLATETFTFE--DENSI--SGIGFGCGVENEGDGFSQG 230

Query: 187 SGIFGLGIGRSSLVSQLN-SSFSYCIGSLHDPDY--------LHNKLILGDGAIIDEGDA 237
           SG+ GLG G  SL+SQL  + FSYC+ S+ D +         L + ++   GA +D G+ 
Sbjct: 231 SGLVGLGRGPLSLISQLKETKFSYCLTSIEDSEASSSLFIGSLASGIVNKTGASLD-GEV 289

Query: 238 TPLQFI------DGHYYITLEAISVDGRMLDINPNIFK-RDDSGGGVMIDSGTDVTWLVK 290
           T    +         YY+ L+ I+V  + L +  + F+  +D  GG++IDSGT +T+L +
Sbjct: 290 TKTMSLLRNPDQPSFYYLELQGITVGAKRLSVEKSTFELAEDGTGGMIIDSGTTITYLEE 349

Query: 291 EAYEALRDEVMIRLEGEQMRSYSWPDKLCYHGIMSSDLKGFPTVRFHFRGGAKLALEKDS 350
            A++ L++E   R+      S S    LC+    ++     P + FHF+ GA L L  ++
Sbjct: 350 TAFKVLKEEFTSRMSLPVDDSGSTGLDLCFKLPDAAKNIAVPKMIFHFK-GADLELPGEN 408

Query: 351 -MFYQPRPDAFCMAVNPASINNRYVNFSLIGMMAQQFYNVGYDIGRKQKTFQRMDCEVL 408
            M         C+A+  ++        S+ G + QQ +NV +D+ ++  +F   +C  L
Sbjct: 409 YMVADSSTGVLCLAMGSSN------GMSIFGNVQQQNFNVLHDLEKETVSFVPTECGKL 461


>gi|61214233|sp|Q766C3.1|NEP1_NEPGR RecName: Full=Aspartic proteinase nepenthesin-1; AltName:
           Full=Nepenthesin-I; Flags: Precursor
 gi|41016421|dbj|BAD07474.1| aspartic proteinase nepenthesin I [Nepenthes gracilis]
          Length = 437

 Score =  164 bits (416), Expect = 6e-38,   Method: Compositional matrix adjust.
 Identities = 119/356 (33%), Positives = 170/356 (47%), Gaps = 27/356 (7%)

Query: 61  FYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLGTIFYPSRSSSYAYVPCDSEH 120
           + +N+SIG P  P    MDTGS L+W  C PC  C  Q   IF P  SSS++ +PC S+ 
Sbjct: 95  YLMNLSIGTPAQPFSAIMDTGSDLIWTQCQPCTQCFNQSTPIFNPQGSSSFSTLPCSSQL 154

Query: 121 CRYFPYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDVVFGCGFST 180
           C+      CS   + C YT  Y  G ET   + TE LTF      ++ + ++ FGCG   
Sbjct: 155 CQALSSPTCS--NNFCQYTYGYGDGSETQGSMGTETLTFG-----SVSIPNITFGCG-EN 206

Query: 181 NRNF---KFSGIFGLGIGRSSLVSQLN-SSFSYC---IGSLHDPDYLHNKLILGDGAIID 233
           N+ F     +G+ G+G G  SL SQL+ + FSYC   IGS    + L     L +     
Sbjct: 207 NQGFGQGNGAGLVGMGRGPLSLPSQLDVTKFSYCMTPIGSSTPSNLLLGS--LANSVTAG 264

Query: 234 EGDATPLQF--IDGHYYITLEAISVDGRMLDINPNIF--KRDDSGGGVMIDSGTDVTWLV 289
             + T +Q   I   YYITL  +SV    L I+P+ F    ++  GG++IDSGT +T+ V
Sbjct: 265 SPNTTLIQSSQIPTFYYITLNGLSVGSTRLPIDPSAFALNSNNGTGGIIIDSGTTLTYFV 324

Query: 290 KEAYEALRDEVMIRLEGEQMRSYSWPDKLCYHGIMSSDLKGFPTVRFHFRGGAKLALEKD 349
             AY+++R E + ++    +   S    LC+           PT   HF GG  L L  +
Sbjct: 325 NNAYQSVRQEFISQINLPVVNGSSSGFDLCFQTPSDPSNLQIPTFVMHFDGG-DLELPSE 383

Query: 350 SMFYQPRPDAFCMAVNPASINNRYVNFSLIGMMAQQFYNVGYDIGRKQKTFQRMDC 405
           + F  P     C+A+  +S        S+ G + QQ   V YD G    +F    C
Sbjct: 384 NYFISPSNGLICLAMGSSS-----QGMSIFGNIQQQNMLVVYDTGNSVVSFASAQC 434


>gi|409179878|gb|AFV26024.1| aspartic proteinase nepenthesin 1 [Nepenthes mirabilis]
          Length = 437

 Score =  164 bits (415), Expect = 7e-38,   Method: Compositional matrix adjust.
 Identities = 119/358 (33%), Positives = 172/358 (48%), Gaps = 31/358 (8%)

Query: 61  FYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLGTIFYPSRSSSYAYVPCDSEH 120
           + +N+SIG P  P    MDTGS L+W  C PC  C  Q   IF P  SSS++ +PC S+ 
Sbjct: 95  YLMNLSIGTPAQPFSAIMDTGSDLIWTQCQPCTQCFNQSTPIFNPQGSSSFSTLPCSSQL 154

Query: 121 CRYFPYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDVVFGCGFST 180
           C+      CS   + C YT  Y  G ET   + TE LTF      ++ + ++ FGCG   
Sbjct: 155 CQALQSPTCS--NNSCQYTYGYGDGSETQGSMGTETLTFG-----SVSIPNITFGCG-EN 206

Query: 181 NRNF---KFSGIFGLGIGRSSLVSQLN-SSFSYCIGSLHDPDYLHNKLILGDGAIIDEGD 236
           N+ F     +G+ G+G G  SL SQL+ + FSYC+  +       + L+LG  A      
Sbjct: 207 NQGFGQGNGAGLVGMGRGPLSLPSQLDVTKFSYCMTPIGSST--SSTLLLGSLANSVTAG 264

Query: 237 ATPLQFIDGH-----YYITLEAISVDGRMLDINPNIFK--RDDSGGGVMIDSGTDVTWLV 289
           +     I+       YYITL  +SV    L I+P++FK   ++  GG++IDSGT +T+  
Sbjct: 265 SPNTTLIESSQIPTFYYITLNGLSVGSTPLPIDPSVFKLNSNNGTGGIIIDSGTTLTYFA 324

Query: 290 KEAYEALRDEVMIRLEGEQMRSYSWPDKLCYHGIMSSDLKGF--PTVRFHFRGGAKLALE 347
             AY+A+R   + ++    +   S    LC+   M SD      PT   HF GG  L L 
Sbjct: 325 DNAYQAVRQAFISQMNLSVVNGSSSGFDLCFQ--MPSDQSNLQIPTFVMHFDGG-DLVLP 381

Query: 348 KDSMFYQPRPDAFCMAVNPASINNRYVNFSLIGMMAQQFYNVGYDIGRKQKTFQRMDC 405
            ++ F  P     C+A+  +S        S+ G + QQ   V YD G    +F    C
Sbjct: 382 SENYFISPSNGLICLAMGSSS-----QGMSIFGNIQQQNLLVVYDTGNSVVSFLFAQC 434


>gi|297823357|ref|XP_002879561.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
 gi|297325400|gb|EFH55820.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
          Length = 447

 Score =  164 bits (415), Expect = 7e-38,   Method: Compositional matrix adjust.
 Identities = 137/430 (31%), Positives = 201/430 (46%), Gaps = 37/430 (8%)

Query: 1   LIHRDSILSPYNNPNENPAHRVQRGINISIARLAYLQEKIRSHNTYQAQILPSNDISNSL 60
           LIHRDS LSP  NP      R+      SI+R   L   I S    Q+ ++ ++      
Sbjct: 30  LIHRDSPLSPLYNPKNTVTDRLNAAFLRSISRSRRLNN-ILSQTDLQSGLIGAD----GE 84

Query: 61  FYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLGTIFYPSRSSSYAYVPCDSEH 120
           F+++I+IG PP+  F   DTGS L WV C PC+ C  + G IF   +SS+Y   PCDS +
Sbjct: 85  FFMSITIGTPPMKVFAIADTGSDLTWVQCKPCQQCYKENGPIFDKKKSSTYKSEPCDSRN 144

Query: 121 CRYFPYAR--CSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDVVFGCGF 178
           C     +   C   K+ C Y   Y     +   V+TE ++  +   S +     VFGCG+
Sbjct: 145 CHALSSSERGCDESKNVCKYRYSYGDQSFSKGDVATETISIDSASGSPVSFPGTVFGCGY 204

Query: 179 STNRNFKFSGIFGLGIGRS--SLVSQLNSS----FSYCIGSLHDPDYLHNKLILGDGAI- 231
           +    F  +G   +G+G    SL+SQL SS    FSYC+          + + LG  +I 
Sbjct: 205 NNGGTFDETGSGIIGLGGGHLSLISQLGSSISKKFSYCLSHKSATTNGTSVINLGTNSIP 264

Query: 232 ----IDEGD-ATPL--QFIDGHYYITLEAISVDGRMLDINPNIFKRDDSG------GGVM 278
                D G  +TPL  +    +YY+TLEAISV  + +    + +  +D G      G ++
Sbjct: 265 SSLSKDSGVISTPLVDKEPRTYYYLTLEAISVGKKKIPYTGSSYNPNDGGIFSETSGNII 324

Query: 279 IDSGTDVTWLVKEAYEALRDEVMIRLEGEQMRSYSWPDKLCYHGIMSSDLK-GFPTVRFH 337
           IDSGT +T L    ++     V   + G   +  S P  L  H   S   + G P +  H
Sbjct: 325 IDSGTTLTLLDSGFFDKFGAAVEELVTG--AKRVSDPQGLLSHCFKSGSAEIGLPEITVH 382

Query: 338 FRGGAKLALEKDSMFYQPRPDAFCMAVNPASINNRYVNFSLIGMMAQQFYNVGYDIGRKQ 397
           F  GA + L   + F +   D  C+++ P +    Y NF      AQ  + VGYD+  + 
Sbjct: 383 FT-GADVRLSPINAFVKVSEDMVCLSMVPTTEVAIYGNF------AQMDFLVGYDLETRT 435

Query: 398 KTFQRMDCEV 407
            +FQRMDC  
Sbjct: 436 VSFQRMDCSA 445


>gi|242079447|ref|XP_002444492.1| hypothetical protein SORBIDRAFT_07g022780 [Sorghum bicolor]
 gi|241940842|gb|EES13987.1| hypothetical protein SORBIDRAFT_07g022780 [Sorghum bicolor]
          Length = 441

 Score =  164 bits (414), Expect = 1e-37,   Method: Compositional matrix adjust.
 Identities = 124/426 (29%), Positives = 193/426 (45%), Gaps = 34/426 (7%)

Query: 1   LIHRDSILSPYNNPNENPAHRVQRGINISIARLAYLQEKIRSHNTYQAQILPSNDI---S 57
           L H D+  S Y  P       + R I  S AR+A LQ    S       I  +  +   S
Sbjct: 32  LTHVDAGTS-YTKP-----QLLSRAIARSKARVAALQSAAVSPAPVADPITAARVLVTAS 85

Query: 58  NSLFYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLGTIFYPSRSSSYAYVPCD 117
           +  + V+++IG PP+     MDTGS L+W  C PC  C+ Q    F   RS++Y  +PC 
Sbjct: 86  SGEYLVDLAIGTPPLYYTAIMDTGSDLIWTQCAPCLLCAAQPTPYFDVKRSATYRALPCR 145

Query: 118 SEHCRYFPYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDVVFGCG 177
           S  C       C  +K  C+Y   Y     T+  ++ E  TF     + +   ++ FGCG
Sbjct: 146 SSRCAALSSPSC--FKKMCVYQYYYGDTASTAGVLANETFTFGAASSTKVRAANISFGCG 203

Query: 178 -FSTNRNFKFSGIFGLGIGRSSLVSQLNSS-FSYCIGSLHDPDYLHNKLILGDGAIIDEG 235
             +       SG+ G G G  SLVSQL  S FSYC+ S   P    ++L  G  A ++  
Sbjct: 204 SLNAGELANSSGMVGFGRGPLSLVSQLGPSRFSYCLTSYLSPT--PSRLYFGVFANLNST 261

Query: 236 D---ATPLQ--------FIDGHYYITLEAISVDGRMLDINPNIFKRDDSG-GGVMIDSGT 283
           +    +P+Q         +   Y+++++ IS+  + L I+P +F  +D G GGV+IDSGT
Sbjct: 262 NTSSGSPVQSTPFVINPALPNMYFLSVKGISLGTKRLPIDPLVFAINDDGTGGVIIDSGT 321

Query: 284 DVTWLVKEAYEALRDEVMIRLEGEQMRSYSWPDKLCYHGIMSSDLK-GFPTVRFHFRGGA 342
            +TWL ++AYEA+R  +   +    M         C+      ++    P   FHF G  
Sbjct: 322 SITWLQQDAYEAVRRGLASTIPLPAMNDTDIGLDTCFQWPPPPNVTVTVPDFVFHFDGAN 381

Query: 343 KLALEKDSMFYQPRPDAFCMAVNPASINNRYVNFSLIGMMAQQFYNVGYDIGRKQKTFQR 402
                ++ M         C+A+ P S+       ++IG   QQ  ++ YDI     +F  
Sbjct: 382 MTLPPENYMLIASTTGYLCLAMAPTSVG------TIIGNYQQQNLHLLYDIANSFLSFVP 435

Query: 403 MDCEVL 408
             C+++
Sbjct: 436 APCDII 441


>gi|225437854|ref|XP_002264056.1| PREDICTED: aspartic proteinase nepenthesin-1 [Vitis vinifera]
          Length = 436

 Score =  163 bits (413), Expect = 1e-37,   Method: Compositional matrix adjust.
 Identities = 125/398 (31%), Positives = 186/398 (46%), Gaps = 29/398 (7%)

Query: 20  HRVQRGINISIARLAYLQEKIRSHNTYQAQILPSNDISNSLFYVNISIGQPPVPQFTAMD 79
            R+QR +     RL  L  K  S   ++  +       N  F +N++IG P       MD
Sbjct: 59  ERLQRAVKRGRLRLQRLSAKTAS---FEPSVEAPVHAGNGEFLMNLAIGTPAETYSAIMD 115

Query: 80  TGSSLLWVHCYPCRDCSPQLGTIFYPSRSSSYAYVPCDSEHCRYFPYARCSAYKHRCIYT 139
           TGS L+W  C PC+ C  Q   IF P +SSS++ +PC S+ C   P + CS     C Y 
Sbjct: 116 TGSDLIWTQCKPCKVCFDQPTPIFDPEKSSSFSKLPCSSDLCVALPISSCS---DGCEYR 172

Query: 140 QLYLIGPETSVFVSTEQLTFKNTDESTIHVQDVVFGCGFSTNRNFKFS---GIFGLGIGR 196
             Y     T   ++TE  TF +       V  + FGCG   NR   +S   G+ GLG G 
Sbjct: 173 YSYGDHSSTQGVLATETFTFGDA-----SVSKIGFGCG-EDNRGRAYSQGAGLVGLGRGP 226

Query: 197 SSLVSQLN-SSFSYCIGSLHDPDYLHNKLILGDGAIIDEGDATPL---QFIDGHYYITLE 252
            SL+SQL    FSYC+ S+ D   + + L++G  A +     TPL         YY++LE
Sbjct: 227 LSLISQLGVPKFSYCLTSIDDSKGI-STLLVGSEATVKSAIPTPLIQNPSRPSFYYLSLE 285

Query: 253 AISVDGRMLDINPNIFK-RDDSGGGVMIDSGTDVTWLVKEAYEALRDEVMIRLEGEQMRS 311
            ISV   +L I  + F  +DD  GG++IDSGT +T+L   A+ AL+ E + +++ +   S
Sbjct: 286 GISVGDTLLPIEKSTFSIQDDGSGGLIIDSGTTITYLKDSAFAALKKEFISQMKLDVDAS 345

Query: 312 YSWPDKLCYHGIMSSDLKGFPTVRFHFRGGAKLALEKDSMFYQPRP-DAFCMAVNPASIN 370
            S   +LC+           P + FHF  G  L L K++   +       C+ +  +S  
Sbjct: 346 GSTELELCFTLPPDGSPVDVPQLVFHFE-GVDLKLPKENYIIEDSALRVICLTMGSSS-- 402

Query: 371 NRYVNFSLIGMMAQQFYNVGYDIGRKQKTFQRMDCEVL 408
                 S+ G   QQ   V +D+ ++  +F    C  L
Sbjct: 403 ----GMSIFGNFQQQNIVVLHDLEKETISFAPAQCNQL 436


>gi|147862576|emb|CAN79341.1| hypothetical protein VITISV_006338 [Vitis vinifera]
          Length = 436

 Score =  163 bits (413), Expect = 1e-37,   Method: Compositional matrix adjust.
 Identities = 125/398 (31%), Positives = 186/398 (46%), Gaps = 29/398 (7%)

Query: 20  HRVQRGINISIARLAYLQEKIRSHNTYQAQILPSNDISNSLFYVNISIGQPPVPQFTAMD 79
            R+QR +     RL  L  K  S   ++  +       N  F +N++IG P       MD
Sbjct: 59  ERLQRAVKRGRLRLQRLSAKTAS---FEPSVEAPVHAGNGEFLMNLAIGTPAETYSAIMD 115

Query: 80  TGSSLLWVHCYPCRDCSPQLGTIFYPSRSSSYAYVPCDSEHCRYFPYARCSAYKHRCIYT 139
           TGS L+W  C PC+ C  Q   IF P +SSS++ +PC S+ C   P + CS     C Y 
Sbjct: 116 TGSDLIWTQCKPCKVCFDQPTPIFDPEKSSSFSKLPCSSDLCVALPISSCS---DGCEYR 172

Query: 140 QLYLIGPETSVFVSTEQLTFKNTDESTIHVQDVVFGCGFSTNRNFKFS---GIFGLGIGR 196
             Y     T   ++TE  TF +       V  + FGCG   NR   +S   G+ GLG G 
Sbjct: 173 YSYGDHSSTQGVLATETFTFGDAS-----VSKIGFGCG-EDNRGRAYSQGAGLVGLGRGP 226

Query: 197 SSLVSQLN-SSFSYCIGSLHDPDYLHNKLILGDGAIIDEGDATPL---QFIDGHYYITLE 252
            SL+SQL    FSYC+ S+ D   + + L++G  A +     TPL         YY++LE
Sbjct: 227 LSLISQLGVPKFSYCLTSIDDSKGI-STLLVGSEATVKSAIPTPLIQNPSRPSFYYLSLE 285

Query: 253 AISVDGRMLDINPNIFK-RDDSGGGVMIDSGTDVTWLVKEAYEALRDEVMIRLEGEQMRS 311
            ISV   +L I  + F  +DD  GG++IDSGT +T+L   A+ AL+ E + +++ +   S
Sbjct: 286 GISVGDTLLPIEKSTFSIQDDGSGGLIIDSGTTITYLKDNAFAALKKEFISQMKLDVDAS 345

Query: 312 YSWPDKLCYHGIMSSDLKGFPTVRFHFRGGAKLALEKDSMFYQPRP-DAFCMAVNPASIN 370
            S   +LC+           P + FHF  G  L L K++   +       C+ +  +S  
Sbjct: 346 GSTELELCFTLPPDGSPVEVPQLVFHFE-GVDLKLPKENYIIEDSALRVICLTMGSSS-- 402

Query: 371 NRYVNFSLIGMMAQQFYNVGYDIGRKQKTFQRMDCEVL 408
                 S+ G   QQ   V +D+ ++  +F    C  L
Sbjct: 403 ----GMSIFGNFQQQNIVVLHDLEKETISFAPAQCNQL 436


>gi|356557010|ref|XP_003546811.1| PREDICTED: probable aspartic protease At2g35615-like [Glycine max]
          Length = 437

 Score =  163 bits (412), Expect = 1e-37,   Method: Compositional matrix adjust.
 Identities = 127/421 (30%), Positives = 201/421 (47%), Gaps = 38/421 (9%)

Query: 1   LIHRDSILSPYNNPNENPAHRVQRGINISIARLAYLQEKIRSHNTYQAQILPSNDISNSL 60
           LIHRDS LSP+ +P+  P+ R+      S +RL  +   +  +N  ++ ++P N      
Sbjct: 36  LIHRDSPLSPFYDPSLTPSERITNAAFRSSSRLNRVSHFLDENNLPESLLIPEN----GE 91

Query: 61  FYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLGTIFYPSRSSSYAYVPCDSEH 120
           + + + IG PPV +    DTGS L+WV C PC++C PQ   +F P +SS++    CDS+ 
Sbjct: 92  YLMTLYIGTPPVERLAIADTGSDLIWVQCSPCQNCFPQDTPLFEPLKSSTFKAATCDSQP 151

Query: 121 CRYFPYARCSAYK-HRCIYTQLYLIGPETSVFVSTEQLTFKNT-DESTIHVQDVVFGCGF 178
           C   P ++    K  +CIY+  Y     T   V TE L+F +T D  T+     +FGCG 
Sbjct: 152 CTSVPPSQRQCGKVGQCIYSYSYGDKSFTVGVVGTETLSFGSTGDAQTVSFPSSIFGCGV 211

Query: 179 STNRNFKFS--------GIFGLGIGRSSLVSQLNSSFSYCIGSLHDPDYLHNKLILGDGA 230
             N  F  S           G     S L  Q+   FSYC+  L       +KL  G  A
Sbjct: 212 YNNFTFHTSDKVTGLVGLGGGPLSLVSQLGPQIGYKFSYCL--LPFSSNSTSKLKFGSEA 269

Query: 231 IIDEGD--ATPL---QFIDGHYYITLEAISVDGRMLDINPNIFKRDDSGGGVMIDSGTDV 285
           I+      +TPL         Y++ LEA+++  +++        R D  G ++IDSGT +
Sbjct: 270 IVTTNGVVSTPLIIKPLFPSFYFLNLEAVTIGQKVVPTG-----RTD--GNIIIDSGTVL 322

Query: 286 TWLVKEAYEALRDEVMIRLEGEQMRSYSWPDKLCYHGIMSSDLKGFPTVRFHFRGGAKLA 345
           T+L +  Y      +   L  E  +   +P K C+      D+   P + F F  GA +A
Sbjct: 323 TYLEQTFYNNFVASLQEVLSVESAQDLPFPFKFCFP---YRDMT-IPVIAFQFT-GASVA 377

Query: 346 LEKDSMFYQPRP-DAFCMAVNPASINNRYVNFSLIGMMAQQFYNVGYDIGRKQKTFQRMD 404
           L+  ++  + +  +  C+AV P+S++      S+ G +AQ  + V YD+  K+ +F   D
Sbjct: 378 LQPKNLLIKLQDRNMLCLAVVPSSLS----GISIFGNVAQFDFQVVYDLEGKKVSFAPTD 433

Query: 405 C 405
           C
Sbjct: 434 C 434


>gi|357450869|ref|XP_003595711.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
 gi|355484759|gb|AES65962.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
          Length = 443

 Score =  162 bits (411), Expect = 2e-37,   Method: Compositional matrix adjust.
 Identities = 128/413 (30%), Positives = 200/413 (48%), Gaps = 39/413 (9%)

Query: 28  ISIARLAYLQEKIRSHN-TYQAQILPSNDISNSLF----------------YVNISIGQP 70
           ++I  L ++   I +HN  +  +++P N  S  LF                 + +SIG P
Sbjct: 10  LAILLLVFIFPSIEAHNGRFTVKLIPRNS-SQVLFNRITAQTPVSVHHYDYLMELSIGTP 68

Query: 71  PVPQFTAMDTGSSLLWVHCYPCRDCSPQLGTIFYPSRSSSYAYVPCDSEHCRYFPYARCS 130
           PV  +  +DTGS L+W+ C PC +C  QL  +F P  SS+Y+ +   SE C       CS
Sbjct: 69  PVKTYAQVDTGSDLIWLQCIPCTNCYKQLNPMFDPQSSSTYSNIAYGSESCSKLYSTSCS 128

Query: 131 AYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDVVFGCGFSTNRNF--KFSG 188
             ++ C YT  Y     T   ++ E LT  +T    + ++ V+FGCG + N  F  K  G
Sbjct: 129 PDQNNCNYTYSYEDDSITEGVLAQETLTLTSTTGKPVALKGVIFGCGHNNNGVFNDKEMG 188

Query: 189 IFGLGIGRSSLVSQLNSS-----FSYCIGSLHDPDYLHNKLILGDGA-IIDEG-DATPLQ 241
           I GLG G  SLVSQ+ SS     FS C+   H    + + +  G G+ ++  G  +TPL 
Sbjct: 189 IIGLGRGPLSLVSQIGSSFGGKMFSQCLVPFHTNPSITSPMSFGKGSEVLGNGVVSTPLV 248

Query: 242 FIDGH---YYITLEAISVDGRMLDINPNIFKRDDSGGGVMIDSGTDVTWLVKEAYEALRD 298
             + H   Y++TL  ISV+   L  N        + G ++IDSGT  T L ++ Y  L +
Sbjct: 249 SKNTHQAFYFVTLLGISVEDINLPFNDGSSLEPITKGNMVIDSGTPTTLLPEDFYHRLVE 308

Query: 299 EVMIRLEGEQMR-SYSWPDKLCYHGIMSSDLKGFPTVRFHFRGGAKLALEKDSMFYQPRP 357
           EV  ++  + +    +   +LCY     ++LKG  T+  HF  GA + L    +F   + 
Sbjct: 309 EVRNKVALDPIPIDPTLGYQLCYR--TPTNLKG-TTLTAHFE-GADVLLTPTQIFIPVQD 364

Query: 358 DAFCMAVNPASINNRYVNFSLIGMMAQQFYNVGYDIGRKQKTFQRMDCEVLDD 410
             FC A   ++ +N Y    + G  AQ  Y +G+D+ ++  +F+  DC  L D
Sbjct: 365 GIFCFAFT-STFSNEY---GIYGNHAQSNYLIGFDLEKQLVSFKATDCTNLQD 413


>gi|357514989|ref|XP_003627783.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
 gi|355521805|gb|AET02259.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
          Length = 416

 Score =  162 bits (411), Expect = 2e-37,   Method: Compositional matrix adjust.
 Identities = 128/420 (30%), Positives = 195/420 (46%), Gaps = 51/420 (12%)

Query: 1   LIHRDSILSPYNNPNENPAHRVQRGINISIARLAYLQEKIRSHNTYQAQILPSNDISNSL 60
           LIHRDS  SP   P +N    +      SI R  +   K    NT Q+ ++P +      
Sbjct: 32  LIHRDSSKSPLYQPTQNKYQHIVNAARRSINRANHFY-KTALTNTPQSTVIPDH----GE 86

Query: 61  FYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLGTIFYPSRSSSYAYVPCDSEH 120
           + +  S+G PP   +   DTGS ++W+ C PC++C  Q    F PS+SS+Y  +PC S+ 
Sbjct: 87  YLMTYSVGTPPFKLYGIADTGSDIVWLQCEPCKECYNQTTPKFKPSKSSTYKNIPCSSDL 146

Query: 121 CRYFPYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDVVFGCGFST 180
           C+                             +S + LT +++    I     V GCG   
Sbjct: 147 CKSGQQGN-----------------------LSVDTLTLESSTGHPISFPKTVIGCGTDN 183

Query: 181 NRNFK--FSGIFGLGIGRSSLVSQLNSS----FSYCIGSLHDPDYLHNKLILGDGAIIDE 234
             +F+   SGI GLG G +SL++QL SS    FSYC+          +KL  GD A++  
Sbjct: 184 TVSFEGASSGIVGLGGGPASLITQLGSSIDAKFSYCLLPNPVESNTTSKLNFGDTAVV-S 242

Query: 235 GD---ATPLQFIDG--HYYITLEAISVDGRMLDINPNIFKRDDSG---GGVMIDSGTDVT 286
           GD   +TP+   D    YY+TLEA SV  + ++     F+   +G   G ++IDSGT +T
Sbjct: 243 GDGVVSTPIVKKDPIVFYYLTLEAFSVGNKRIE-----FEGSSNGGHEGNIIIDSGTTLT 297

Query: 287 WLVKEAYEALRDEVMIRLEGEQMRSYSWPDKLCYHGIMSSDLKGFPTVRFHFRGGAKLAL 346
            +  + Y  L   V+  ++ +++   +    LCY   ++SD   FP +  HF+ GA + L
Sbjct: 298 VIPTDVYNNLESAVLELVKLKRVNDPTRLFNLCYS--VTSDGYDFPIITTHFK-GADVKL 354

Query: 347 EKDSMFYQPRPDAFCMAVNPASINNRYVNFSLIGMMAQQFYNVGYDIGRKQKTFQRMDCE 406
              S F        C+A    S        S+ G +AQQ   VGYD+ +K  +F+  DC 
Sbjct: 355 HPISTFVDVADGIVCLAFATTSAFIPSDVVSIFGNLAQQNLLVGYDLQQKIVSFKPTDCS 414


>gi|296085498|emb|CBI29230.3| unnamed protein product [Vitis vinifera]
          Length = 542

 Score =  162 bits (410), Expect = 3e-37,   Method: Compositional matrix adjust.
 Identities = 117/414 (28%), Positives = 195/414 (47%), Gaps = 41/414 (9%)

Query: 1   LIHRDSILSPYNNPNENPAHRVQRGINISIARLAYLQEKIRSHNTYQAQILPSNDISNSL 60
           LIHRDS  SP+ +P++  A R+      S++R+   +    + +  Q++I+PS       
Sbjct: 36  LIHRDSPHSPFFDPSKTQAERLTDAFRRSVSRVGRFRPTAMTSDGIQSRIVPSA----GE 91

Query: 61  FYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLGTIFYPSRSSSYAYVPCDSEH 120
           + +N+ IG PPVP    +DTGS L W  C PC  C  Q+  +F P  SS+Y    C +  
Sbjct: 92  YLMNLYIGTPPVPVIAIVDTGSDLTWTQCRPCTHCYKQVVPLFDPKNSSTYRDSSCGTSF 151

Query: 121 CRYFPYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDVVFGCGFST 180
           C      R  + + +C +   Y  G  T   +++E LT  +T    +      FGCG S+
Sbjct: 152 CLALGKDRSCSKEKKCTFRYSYADGSFTGGNLASETLTVDSTAGKPVSFPGFAFGCGHSS 211

Query: 181 NRNF--KFSGIFGLGIGRSSLVSQLNSS----FSYCIGSLHDPDYLHNKLILG-DGAIID 233
              F    SGI GLG G  SL+SQL S+    FSYC+  +     + +++  G  G +  
Sbjct: 212 GGIFDKSSSGIVGLGGGELSLISQLKSTINGLFSYCLLPVSTDSSISSRINFGASGRVSG 271

Query: 234 EGD-ATPLQFIDGHYYITLEAISVDGRMLDINPNIFKRDDSGGGVMIDSGTDVTWLVKEA 292
            G  +TPL+     Y    E                      G +++DSGT  T+L +E 
Sbjct: 272 YGTVSTPLRLPYKGYSKKTEV-------------------EEGNIIVDSGTTYTFLPQEF 312

Query: 293 YEALRDEVMIRLEGEQMRSYSWPDKLCYHGIMSSDLKGFPTVRFHFRGGAKLALEKDSMF 352
           Y  L   V   ++G+++R  +    LCY+   ++++   P +  HF+  A + L+  + F
Sbjct: 313 YSKLEKSVANSIKGKRVRDPNGIFSLCYN--TTAEINA-PIITAHFK-DANVELQPLNTF 368

Query: 353 YQPRPDAFCMAVNPASINNRYVNFSLIGMMAQQFYNVGYDIGRKQKTFQRMDCE 406
            + + D  C  V P S      +  ++G +AQ  + VG+D+ +K+   ++ + E
Sbjct: 369 MRMQEDLVCFTVAPTS------DIGVLGNLAQVNFLVGFDLRKKRGFSKKAEVE 416



 Score = 63.2 bits (152), Expect = 3e-07,   Method: Compositional matrix adjust.
 Identities = 39/139 (28%), Positives = 72/139 (51%), Gaps = 9/139 (6%)

Query: 269 KRDDSGGGVMIDSGTDVTWLVKEAYEALRDEVMIRLEGEQMRSYSWPDKLCYHGIMSSDL 328
           K +   G +++DSGT  T+L  E Y  L + V   ++G+++R  +    LCY+   + D 
Sbjct: 412 KAEVEEGNIIVDSGTTYTYLPLEFYVKLEESVAHSIKGKRVRDPNGISSLCYN--TTVDQ 469

Query: 329 KGFPTVRFHFRGGAKLALEKDSMFYQPRPDAFCMAVNPASINNRYVNFSLIGMMAQQFYN 388
              P +  HF+  A + L+  + F + + D  C  V P S      +  ++G +AQ  + 
Sbjct: 470 IDAPIITAHFK-DANVELQPWNTFLRMQEDLVCFTVLPTS------DIGILGNLAQVNFL 522

Query: 389 VGYDIGRKQKTFQRMDCEV 407
           VG+D+ +K+ +F+  DC +
Sbjct: 523 VGFDLRKKRVSFKAADCTL 541


>gi|362799904|dbj|BAL41445.1| aspartyl protease 1 [Linum grandiflorum]
          Length = 449

 Score =  162 bits (410), Expect = 3e-37,   Method: Compositional matrix adjust.
 Identities = 132/438 (30%), Positives = 208/438 (47%), Gaps = 55/438 (12%)

Query: 1   LIHRDSILSPYNNPNENPAHRVQRGINISIARLAYLQEKIRSHNTYQAQILPSNDISNSL 60
           LIHRDS LSP + PN   + R+Q     +I+R +        H  +Q  +LPS       
Sbjct: 31  LIHRDSPLSPLHTPNLTFSDRLQASFLRAISRQS-------RHVDFQTDLLPSG----GE 79

Query: 61  FYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLGTIFYPSRSSSYAYVPCDSEH 120
           + +N+SIG PP P     DTGS L W+   PC  C PQ G IF PS S+++  +PC +  
Sbjct: 80  YMMNLSIGTPPFPILAIADTGSDLTWLQSKPCDQCYPQKGPIFDPSNSTTFHKLPCTTAP 139

Query: 121 CRYF-PYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDVVFGCGFS 179
           C      AR       C YT  Y     T+ +++++ +T  N   +++ +++V FGCG  
Sbjct: 140 CNALDESARSCTDPTTCGYTYSYGDHSYTTGYLASDTVTVGN---ASVQIRNVAFGCGTR 196

Query: 180 TNRNF--KFSGIFGLGIGRSSLVSQLNSS----FSYCIGSLHDP-------DYLHNKLIL 226
              NF  + SGI GLG G  S VSQL  +    FSYC+  L +            ++++ 
Sbjct: 197 NGGNFDEQGSGIVGLGGGNLSFVSQLGDTIGKKFSYCLLPLENEISSQPSDSPATSRIVF 256

Query: 227 GDGAIIDEGDATPLQFI---------DGHYYITLEAISVDGRMLDINPNIFKRD--DSG- 274
           GD  +        + F            +YY+T+EAI+V  + L  + +  K    DSG 
Sbjct: 257 GDNPVFSSSSTNGVVFATTPLVNKEPSTYYYLTIEAITVGRKKLLYSSSSSKTASYDSGS 316

Query: 275 ------GGVMIDSGTDVTWLVKEAYEALRDEVMIRLEGEQMRSYSWPD-KLCYHGIMSSD 327
                 G ++IDSGT +T+L +E Y AL   ++  ++ E++         LC+      +
Sbjct: 317 KSSVEEGNIIIDSGTTLTFLEEEFYGALEAALVEEIKMERVNDVKNSMFSLCFKS--GKE 374

Query: 328 LKGFPTVRFHFRGGAKLALEKDSMFYQPRPDAFCMAVNPASINNRYVNFSLIGMMAQQFY 387
               P ++ HFRGGA + L+  + F +      C  + P +      +  + G +AQ  +
Sbjct: 375 EVELPLMKVHFRGGADVELKPVNTFVRAEEGLVCFTMLPTN------DVGIYGNLAQMNF 428

Query: 388 NVGYDIGRKQKTFQRMDC 405
            VGYD+G++  +F   DC
Sbjct: 429 VVGYDLGKRTVSFLPADC 446


>gi|125590542|gb|EAZ30892.1| hypothetical protein OsJ_14967 [Oryza sativa Japonica Group]
          Length = 516

 Score =  162 bits (409), Expect = 3e-37,   Method: Compositional matrix adjust.
 Identities = 123/359 (34%), Positives = 171/359 (47%), Gaps = 32/359 (8%)

Query: 67  IGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLGTIFYPSRSSSYAYVPCDSEHCRYFPY 126
           IG P +     +DTGS L+W  C PC DC  Q   +F PS SS+YA VPC S  C   P 
Sbjct: 173 IGTPALAYSAIVDTGSDLVWTQCKPCVDCFKQSTPVFDPSSSSTYATVPCSSASCSDLPT 232

Query: 127 ARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDVVFGCGFSTNRNFKF 186
           ++C++   +C YT  Y     T   ++TE  T   +      +  VVFGCG  TN    F
Sbjct: 233 SKCTS-ASKCGYTYTYGDSSSTQGVLATETFTLAKS-----KLPGVVFGCG-DTNEGDGF 285

Query: 187 S---GIFGLGIGRSSLVSQLN-SSFSYCIGSLHDPDYLHNKLILGDGAIIDEG------- 235
           S   G+ GLG G  SLVSQL    FSYC+ SL D +  ++ L+LG  A I E        
Sbjct: 286 SQGAGLVGLGRGPLSLVSQLGLDKFSYCLTSLDDTN--NSPLLLGSLAGISEASAAASSV 343

Query: 236 DATPL---QFIDGHYYITLEAISVDGRMLDINPNIFK-RDDSGGGVMIDSGTDVTWLVKE 291
             TPL         YY++L+AI+V    + +  + F  +DD  GGV++DSGT +T+L  +
Sbjct: 344 QTTPLIKNPSQPSFYYVSLKAITVGSTRISLPSSAFAVQDDGTGGVIVDSGTSITYLEVQ 403

Query: 292 AYEALRDEVMIRLEGEQMRSYSWPDKLCYHGIMSS-DLKGFPTVRFHFRGGAKLALEKDS 350
            Y AL+     ++             LC+       D    P + FHF GGA L L  ++
Sbjct: 404 GYRALKKAFAAQMALPAADGSGVGLDLCFRAPAKGVDQVEVPRLVFHFDGGADLDLPAEN 463

Query: 351 -MFYQPRPDAFCMAVNPASINNRYVNFSLIGMMAQQFYNVGYDIGRKQKTFQRMDCEVL 408
            M       A C+ V    + +R    S+IG   QQ +   YD+G    +F  + C  L
Sbjct: 464 YMVLDGGSGALCLTV----MGSR--GLSIIGNFQQQNFQFVYDVGHDTLSFAPVQCNKL 516


>gi|357167693|ref|XP_003581287.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Brachypodium
           distachyon]
          Length = 468

 Score =  161 bits (408), Expect = 4e-37,   Method: Compositional matrix adjust.
 Identities = 118/368 (32%), Positives = 171/368 (46%), Gaps = 31/368 (8%)

Query: 58  NSLFYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLGTIFYPSRSSSYAYVPCD 117
           N  F +++SIG P +     +DTGS L+W  C PC +C  Q   +F PS SS+Y+ +PC 
Sbjct: 115 NGEFLMDMSIGTPALAYAAIVDTGSDLVWTQCKPCVECFNQSTPVFDPSSSSTYSTLPCS 174

Query: 118 SEHCRYFPYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDVVFGCG 177
           S  C   P + C++    C YT  Y     T   ++ E  T   T      +  V FGCG
Sbjct: 175 SSLCSDLPTSTCTSAAKDCGYTYTYGDASSTQGVLAAETFTLAKT-----KLPGVAFGCG 229

Query: 178 FSTNRNFKFS---GIFGLGIGRSSLVSQLN-SSFSYCIGSLHDPDYLHNKLILGDGAII- 232
             TN    F+   G+ GLG G  SLVSQL    FSYC+ SL D     + L+LG  A I 
Sbjct: 230 -DTNEGDGFTQGAGLVGLGRGPLSLVSQLGLGKFSYCLTSLDDTS--KSPLLLGSLAAIS 286

Query: 233 -DEGDATPLQFI--------DGHYYITLEAISVDGRMLDINPNIFK-RDDSGGGVMIDSG 282
            D   A  +Q             YY+TL+A++V    + +  + F  +DD  GGV++DSG
Sbjct: 287 TDTASAAAIQTTPLIKNPSQPSFYYVTLKALTVGSTRIPLPGSAFAVQDDGTGGVIVDSG 346

Query: 283 TDVTWLVKEAYEALRDEVMIRLEGEQMRSYSWPDKLCYHGIMSS-DLKGFPTVRFHFRGG 341
           T +T+L  + Y  L+     +++       +    LC+    S  D    P +  HF GG
Sbjct: 347 TSITYLELQGYRPLKKAFAAQMKLPVADGSAVGLDLCFKAPASGVDDVEVPKLVLHFDGG 406

Query: 342 AKLALEKDS-MFYQPRPDAFCMAVNPASINNRYVNFSLIGMMAQQFYNVGYDIGRKQKTF 400
           A L L  ++ M       A C+ V    + +R    S+IG   QQ     YD+ +   +F
Sbjct: 407 ADLDLPAENYMVLDSASGALCLTV----MGSR--GLSIIGNFQQQNIQFVYDVDKDTLSF 460

Query: 401 QRMDCEVL 408
             + C  L
Sbjct: 461 APVQCAKL 468


>gi|356555042|ref|XP_003545848.1| PREDICTED: probable aspartic protease At2g35615-like [Glycine max]
          Length = 431

 Score =  161 bits (407), Expect = 6e-37,   Method: Compositional matrix adjust.
 Identities = 127/418 (30%), Positives = 198/418 (47%), Gaps = 33/418 (7%)

Query: 1   LIHRDSILSPYNNPNENPAHRVQRGINISIARLAYLQEKIRSHNTYQAQILPSNDISNSL 60
           +IHRDS  SP+    E    RV   +  S+ R  +  +     N  ++   P   + +  
Sbjct: 31  IIHRDSSRSPFYRATETQFQRVTNAVRRSMNRANHFNQISVYSNAVES---PVTLLDDGD 87

Query: 61  FYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLGTIFYPSRSSSYAYVPCDSEH 120
           + ++ S+G PP P +  +DT S ++WV C  C  C      +F PS S +Y  +PC S  
Sbjct: 88  YLMSYSLGTPPFPVYGIVDTASDIIWVQCQLCETCYNDTSPMFDPSYSKTYKNLPCSSTT 147

Query: 121 CRYFPYARCSAYKHR-CIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDVVFGCGFS 179
           C+      CS+ + + C +T  Y  G  +   +  E +T  + ++  +H    V GC  +
Sbjct: 148 CKSVQGTSCSSDERKICEHTVNYKDGSHSQGDLIVETVTLGSYNDPFVHFPRTVIGCIRN 207

Query: 180 TNRNFKFSGIFGLGIGRSSLVSQLNSS----FSYCIGSLHDPDYLHNKLILGDGAII--D 233
           TN +F   GI GLG G  SLV QL+SS    FSYC+  + D     +KL  GD A++  D
Sbjct: 208 TNVSFDSIGIVGLGGGPVSLVPQLSSSISKKFSYCLAPISDRS---SKLKFGDAAMVSGD 264

Query: 234 EGDATPLQFID--GHYYITLEAISVDGRMLDINPNIFKRDDSGGGVMIDSGTDVTWLVKE 291
              +T + F D    YY+TLEA SV    ++   +   R    G ++IDSGT  T L  +
Sbjct: 265 GTVSTRIVFKDWKKFYYLTLEAFSVGNNRIEFRSSS-SRSSGKGNIIIDSGTTFTVLPDD 323

Query: 292 AYEALRDEV--MIRLEGEQ--MRSYSWPDKLCYHGIMSSDLKGFPTVRFHFRGGAKLALE 347
            Y  L   V  +++LE  +  ++ +S    LCY    + D    P +  HF  GA + L 
Sbjct: 324 VYSKLESAVADVVKLERAEDPLKQFS----LCYKS--TYDKVDVPVITAHF-SGADVKLN 376

Query: 348 KDSMFYQPRPDAFCMAVNPASINNRYVNFSLIGMMAQQFYNVGYDIGRKQKTFQRMDC 405
             + F        C+A   +       + ++ G +AQQ + VGYD+ RK  +F+  DC
Sbjct: 377 ALNTFIVASHRVVCLAFLSSQ------SGAIFGNLAQQNFLVGYDLQRKIVSFKPTDC 428


>gi|242073260|ref|XP_002446566.1| hypothetical protein SORBIDRAFT_06g018170 [Sorghum bicolor]
 gi|241937749|gb|EES10894.1| hypothetical protein SORBIDRAFT_06g018170 [Sorghum bicolor]
          Length = 452

 Score =  161 bits (407), Expect = 6e-37,   Method: Compositional matrix adjust.
 Identities = 118/368 (32%), Positives = 170/368 (46%), Gaps = 29/368 (7%)

Query: 58  NSLFYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLGTIFYPSRSSSYAYVPCD 117
           N  F ++++IG P +     +DTGS L+W  C PC DC  Q   +F PS SS+YA VPC 
Sbjct: 97  NGEFLMDVAIGTPALSYAAIVDTGSDLVWTQCKPCVDCFKQSTPVFDPSSSSTYATVPCS 156

Query: 118 SEHCRYFPYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDVVFGCG 177
           S  C   P + C++   +C YT  Y     T   +++E  T     +    +  V FGCG
Sbjct: 157 SALCSDLPTSTCTS-ASKCGYTYTYGDASSTQGVLASETFTLGKEKK---KLPGVAFGCG 212

Query: 178 FSTNRNFKFS---GIFGLGIGRSSLVSQLN-SSFSYCIGSLHDPDYLHNKLILGDGAIID 233
             TN    F+   G+ GLG G  SLVSQL    FSYC+ SL D D   + L+LG  A   
Sbjct: 213 -DTNEGDGFTQGAGLVGLGRGPLSLVSQLGLDKFSYCLTSLDDGDG-KSPLLLGGSAAAI 270

Query: 234 EG-------DATPLQFIDGH---YYITLEAISVDGRMLDINPNIFK-RDDSGGGVMIDSG 282
                      TPL         YY++L  ++V    + +  + F  +DD  GGV++DSG
Sbjct: 271 SESAATAPVQTTPLVKNPSQPSFYYVSLTGLTVGSTRITLPASAFAIQDDGTGGVIVDSG 330

Query: 283 TDVTWLVKEAYEALRDEVMIRLEGEQMRSYSWPDKLCYHGIMSS-DLKGFPTVRFHFRGG 341
           T +T+L  + Y AL+   + ++    +        LC+ G     D    P +  HF GG
Sbjct: 331 TSITYLELQGYRALKKAFVAQMALPTVDGSEIGLDLCFQGPAKGVDEVQVPKLVLHFDGG 390

Query: 342 AKLALEKDS-MFYQPRPDAFCMAVNPASINNRYVNFSLIGMMAQQFYNVGYDIGRKQKTF 400
           A L L  ++ M       A C+ V P+         S+IG   QQ +   YD+     +F
Sbjct: 391 ADLDLPAENYMVLDSASGALCLTVAPSR------GLSIIGNFQQQNFQFVYDVAGDTLSF 444

Query: 401 QRMDCEVL 408
             + C  L
Sbjct: 445 APVQCNKL 452


>gi|297741705|emb|CBI32837.3| unnamed protein product [Vitis vinifera]
          Length = 455

 Score =  161 bits (407), Expect = 7e-37,   Method: Compositional matrix adjust.
 Identities = 136/410 (33%), Positives = 196/410 (47%), Gaps = 33/410 (8%)

Query: 1   LIHRDSILSPYNNPNENPAHRVQRGINISIARLAYLQEKIRSHNTYQ-AQILPSNDISN- 58
           LIH  S  SPY N       +    +  +++R AYL  + R     Q A  +P   I + 
Sbjct: 47  LIHIHSPSSPYKNVKAESLAK-DTALESTLSRHAYL--RARQQKALQPADFVPPPLIRDK 103

Query: 59  SLFYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLGTIFYPSRSSSYAYVPCDS 118
           S F  N+SIG PP   +  +DTGS L W+ C PC  C  Q   I+  ++S SY  + C+ 
Sbjct: 104 SAFLANLSIGNPPTNVYVVLDTGSDLFWIQCEPCDVCYKQKDPIYNRTKSDSYTEMLCNE 163

Query: 119 EHCRYFPYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDVVFGCGF 178
             C         +    C+Y   Y  G  TS  +S E++ F +          V FGCG 
Sbjct: 164 PPCLSLGREGQCSDSGSCLYQTSYADGSRTSGLLSYEKVAFTSHYSDEDKTAQVGFGCGL 223

Query: 179 STNRNFKFSG----IFGLGIGRSSLVSQLNS------SFSYCIGSLHDPDYLHNKLILGD 228
             N NF  S     + GLG G  SLVSQL++      SF+YC G+L +P+     L+ GD
Sbjct: 224 Q-NLNFVTSSRDGGVLGLGPGLVSLVSQLSAIGKVSKSFAYCFGNLSNPN-AGGFLVFGD 281

Query: 229 GAIIDEGDATPLQFIDGHYYITLEAI--SVDGRMLDINPNIFKRD-DSGGGVMIDSGTDV 285
              ++ GD TP+  I   YY+ L  I   V+   LDIN + F+R  D  GGV+IDSG+ +
Sbjct: 282 ATYLN-GDMTPM-VIAEFYYVNLLGIGLGVEEPRLDINSSSFERKPDGSGGVIIDSGSTL 339

Query: 286 TWLVKEAYEALRDEVMIRLEGEQMRS--YSWPDKLCYHGIMSSDLKGFPTVRFHFRGGAK 343
           +    E YE +R+ V+ +L+     S   S PD  C+ G +  DL  FPT+  +      
Sbjct: 340 SIFPPEVYEVVRNAVVDKLKKGYNISPLTSSPD--CFEGKIGRDLPLFPTLVLYLESTGI 397

Query: 344 LALEKDSMFYQPRPDAFCMAVNPASINNRYVNFSLIGMMAQQFYNVGYDI 393
           L  ++ S+F Q   + FC+              S+IG +AQQ Y  GY++
Sbjct: 398 LN-DRWSIFLQRYDELFCLGFTSGE------GLSIIGTLAQQSYKFGYNL 440


>gi|356516413|ref|XP_003526889.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
          Length = 453

 Score =  161 bits (407), Expect = 7e-37,   Method: Compositional matrix adjust.
 Identities = 131/408 (32%), Positives = 195/408 (47%), Gaps = 27/408 (6%)

Query: 15  NENPAHRVQRGINISIARLAYLQEKIRSHNTY---QAQILPSNDISNSLFYVNISIGQPP 71
           N     RVQ GI    +RL  L   + + ++    + Q+       N  + + ++IG PP
Sbjct: 59  NLTKLERVQHGIKRGKSRLQKLNAMVLAASSTPDSEDQLEAPIHAGNGEYLIELAIGTPP 118

Query: 72  VPQFTAMDTGSSLLWVHCYPCRDCSPQLGTIFYPSRSSSYAYVPCDSEHCRYFPYARCSA 131
           V     +DTGS L+W  C PC  C  Q   IF P +SSS++ V C S  C   P + CS 
Sbjct: 119 VSYPAVLDTGSDLIWTQCKPCTRCYKQPTPIFDPKKSSSFSKVSCGSSLCSALPSSTCS- 177

Query: 132 YKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDVVFGCGFSTNRNFKF---SG 188
               C Y   Y     T   ++TE  TF  + ++ + V ++ FGCG   N    F   SG
Sbjct: 178 --DGCEYVYSYGDYSMTQGVLATETFTFGKS-KNKVSVHNIGFGCG-EDNEGDGFEQASG 233

Query: 189 IFGLGIGRSSLVSQLNSS-FSYCIGSLHDPDYLHNKLILGD-GAIIDEGD--ATPL---Q 241
           + GLG G  SLVSQL    FSYC+  + D     + L+LG  G + D  +   TPL    
Sbjct: 234 LVGLGRGPLSLVSQLKEQRFSYCLTPIDDTK--ESVLLLGSLGKVKDAKEVVTTPLLKNP 291

Query: 242 FIDGHYYITLEAISVDGRMLDINPNIFK-RDDSGGGVMIDSGTDVTWLVKEAYEALRDEV 300
                YY++LEAISV    L I  + F+  DD  GGV+IDSGT +T++ ++AYEAL+ E 
Sbjct: 292 LQPSFYYLSLEAISVGDTRLSIEKSTFEVGDDGNGGVIIDSGTTITYVQQKAYEALKKEF 351

Query: 301 MIRLEGEQMRSYSWPDKLCYHGIMSSDLKGFPTVRFHFRGGAKLALEKDSMFYQPRPDAF 360
           + + +    ++ S    LC+     S     P + FHF+GG      ++ M         
Sbjct: 352 ISQTKLALDKTSSTGLDLCFSLPSGSTQVEIPKLVFHFKGGDLELPAENYMIGDSNLGVA 411

Query: 361 CMAVNPASINNRYVNFSLIGMMAQQFYNVGYDIGRKQKTFQRMDCEVL 408
           C+A+  +S        S+ G + QQ   V +D+ ++  +F    C+ L
Sbjct: 412 CLAMGASS------GMSIFGNVQQQNILVNHDLEKETISFVPTSCDQL 453


>gi|225438315|ref|XP_002272802.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Vitis vinifera]
          Length = 436

 Score =  160 bits (406), Expect = 8e-37,   Method: Compositional matrix adjust.
 Identities = 124/398 (31%), Positives = 185/398 (46%), Gaps = 29/398 (7%)

Query: 20  HRVQRGINISIARLAYLQEKIRSHNTYQAQILPSNDISNSLFYVNISIGQPPVPQFTAMD 79
            R+QR +     RL  L  K  S   +++ +       N  F + ++IG P       MD
Sbjct: 59  ERLQRAMKRGKLRLQRLSAKTAS---FESSVEAPVHAGNGEFLMKLAIGTPAETYSAIMD 115

Query: 80  TGSSLLWVHCYPCRDCSPQLGTIFYPSRSSSYAYVPCDSEHCRYFPYARCSAYKHRCIYT 139
           TGS L+W  C PC+DC  Q   IF P +SSS++ +PC S+ C   P + CS     C Y 
Sbjct: 116 TGSDLIWTQCKPCKDCFDQPTPIFDPKKSSSFSKLPCSSDLCAALPISSCS---DGCEYL 172

Query: 140 QLYLIGPETSVFVSTEQLTFKNTDESTIHVQDVVFGCGFSTNRNFKFS---GIFGLGIGR 196
             Y     T   ++TE   F +       V  + FGCG   N    FS   G+ GLG G 
Sbjct: 173 YSYGDYSSTQGVLATETFAFGDA-----SVSKIGFGCG-EDNDGSGFSQGAGLVGLGRGP 226

Query: 197 SSLVSQLNS-SFSYCIGSLHDPDYLHNKLILGDGAIIDEGDATPL---QFIDGHYYITLE 252
            SL+SQL    FSYC+ S+ D   + + L++G  A +     TPL         YY++LE
Sbjct: 227 LSLISQLGEPKFSYCLTSMDDSKGI-SSLLVGSEATMKNAITTPLIQNPSQPSFYYLSLE 285

Query: 253 AISVDGRMLDINPNIFK-RDDSGGGVMIDSGTDVTWLVKEAYEALRDEVMIRLEGEQMRS 311
            ISV   +L I  + F  ++D  GG++IDSGT +T+L   A+ AL+ E + +L+ +   S
Sbjct: 286 GISVGDTLLPIEKSTFSIQNDGSGGLIIDSGTTITYLEDSAFAALKKEFISQLKLDVDES 345

Query: 312 YSWPDKLCYHGIMSSDLKGFPTVRFHFRGGAKLALEKDS-MFYQPRPDAFCMAVNPASIN 370
            S    LC+     +     P + FHF  GA L L  ++ +         C+ +  +S  
Sbjct: 346 GSTGLDLCFTLPPDASTVDVPQLVFHFE-GADLKLPAENYIIADSGLGVICLTMGSSS-- 402

Query: 371 NRYVNFSLIGMMAQQFYNVGYDIGRKQKTFQRMDCEVL 408
                 S+ G   QQ   V +D+ ++  +F    C  L
Sbjct: 403 ----GMSIFGNFQQQNIVVLHDLEKETISFAPAQCNQL 436


>gi|357486591|ref|XP_003613583.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
 gi|355514918|gb|AES96541.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
          Length = 437

 Score =  160 bits (406), Expect = 9e-37,   Method: Compositional matrix adjust.
 Identities = 124/422 (29%), Positives = 196/422 (46%), Gaps = 34/422 (8%)

Query: 1   LIHRDSILSPYNNPNENPAHRVQRGINISIARLAYLQEKIRSHNTYQAQILPSNDISNSL 60
           LIH  S  SP+ N  E+   R+   +  S  R+ YL               P N + N +
Sbjct: 30  LIHPISSKSPFYNTAESHFQRMSNNMKHSTNRVHYLNHVFS---------FPPNKVPNIV 80

Query: 61  --------FYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLGTIFYPSRSSSYA 112
                   + ++  IG PP   +  MDT +  +W  C PC+ C      +F PS+SS+Y 
Sbjct: 81  VSPFMGDGYIISFLIGTPPFQLYGVMDTANDNIWFQCNPCKPCFNTTSPMFDPSKSSTYK 140

Query: 113 YVPCDSEHCRYFPYARCSA-YKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQD 171
            +PC S  C+      CS+  K  C Y+  Y     +   +S + LT  + +++ I  ++
Sbjct: 141 TIPCSSPKCKNVENTHCSSDDKKVCEYSFTYGGEAYSQGDLSIDTLTLNSNNDTPISFKN 200

Query: 172 VVFGCGFSTNRNFK--FSGIFGLGIGRSSLVSQLNSS----FSYCIGSLHDPDYLHNKLI 225
           +V GCG       +   SG  GLG G  S +SQLNSS    FSYC+  L   + +  KL 
Sbjct: 201 IVIGCGHRNKGPLEGYVSGNIGLGRGPLSFISQLNSSIGGKFSYCLVPLFSNEGISGKLH 260

Query: 226 LGDGAIIDEGD--ATPLQFIDGHYYITLEAISVDGRMLDINPNIFKRDDSGGGVMIDSGT 283
            GD +++      +TP+   +  Y  TL A+SV   ++    N   ++D+ G  +IDSGT
Sbjct: 261 FGDKSVVSGVGTVSTPITAGEIGYSTTLNALSVGDHIIKFE-NSTSKNDNLGNTIIDSGT 319

Query: 284 DVTWLVKEAYEALRDEVMIRLEGEQMRSYSWPDKLCYHGIMSSDLKGFPTVRFHFRGGAK 343
            +T L +  Y  L   V   ++ E+ +S +   KLCY   + +     P +  HF  GA 
Sbjct: 320 TLTILPENVYSRLESIVTSMVKLERAKSPNQQFKLCYKATLKN--LDVPIITAHFN-GAD 376

Query: 344 LALEKDSMFYQPRPDAFCMAVNPASINNRYVNFSLIGMMAQQFYNVGYDIGRKQKTFQRM 403
           + L   + FY    +  C A    S+ N     ++IG +AQQ + VG+D+ +   +F+  
Sbjct: 377 VHLNSLNTFYPIDHEVVCFAF--VSVGN--FPGTIIGNIAQQNFLVGFDLQKNIISFKPT 432

Query: 404 DC 405
           DC
Sbjct: 433 DC 434


>gi|226503109|ref|NP_001147206.1| aspartic proteinase nepenthesin-2 precursor [Zea mays]
 gi|195608496|gb|ACG26078.1| aspartic proteinase nepenthesin-2 precursor [Zea mays]
 gi|413921850|gb|AFW61782.1| aspartic proteinase nepenthesin-2 [Zea mays]
          Length = 441

 Score =  160 bits (405), Expect = 9e-37,   Method: Compositional matrix adjust.
 Identities = 116/405 (28%), Positives = 189/405 (46%), Gaps = 29/405 (7%)

Query: 22  VQRGINISIARLAYLQEKIRSHNTY----QAQILPSNDISNSLFYVNISIGQPPVPQFTA 77
           + R I  S AR+A LQ              A++L +   S+  + V+++IG PP+     
Sbjct: 48  LSRAIARSKARVAALQSAAVLPPVVDPITAARVLVT--ASSGEYLVDLAIGTPPLYYTAI 105

Query: 78  MDTGSSLLWVHCYPCRDCSPQLGTIFYPSRSSSYAYVPCDSEHCRYFPYARCSAYKHRCI 137
           MDTGS L+W  C PC  C+ Q    F   +S++Y  +PC S  C       C  +K  C+
Sbjct: 106 MDTGSDLIWTQCAPCLLCADQPTPYFDVKKSATYRALPCRSSRCASLSSPSC--FKKMCV 163

Query: 138 YTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDVVFGCG-FSTNRNFKFSGIFGLGIGR 196
           Y   Y     T+  ++ E  TF   + + +   ++ FGCG  +       SG+ G G G 
Sbjct: 164 YQYYYGDTASTAGVLANETFTFGAANSTKVRATNIAFGCGSLNAGDLANSSGMVGFGRGP 223

Query: 197 SSLVSQLNSS-FSYCIGSL--HDPDYLHNKLILGDGAIIDEGDATPLQ--------FIDG 245
            SLVSQL  S FSYC+ S     P  L+   +  + +  +    +P+Q         +  
Sbjct: 224 LSLVSQLGPSRFSYCLTSYLSATPSRLYFG-VYANLSSTNTSSGSPVQSTPFVINPALPN 282

Query: 246 HYYITLEAISVDGRMLDINPNIFKRDDSG-GGVMIDSGTDVTWLVKEAYEALRDEVMIRL 304
            Y+++L+AIS+  ++L I+P +F  +D G GGV+IDSGT +TWL ++AYEA+R  ++  +
Sbjct: 283 MYFLSLKAISLGTKLLPIDPLVFAINDDGTGGVIIDSGTSITWLQQDAYEAVRRGLVSAI 342

Query: 305 EGEQMRSYSWPDKLCYHGIMSSDLK-GFPTVRFHFRGGAKLALEKDSMFYQPRPDAFCMA 363
               M         C+      ++    P + FHF       L ++ M         C+ 
Sbjct: 343 PLPAMNDTDIGLDTCFQWPPPPNVTVTVPDLVFHFDSANMTLLPENYMLIASTTGYLCLV 402

Query: 364 VNPASINNRYVNFSLIGMMAQQFYNVGYDIGRKQKTFQRMDCEVL 408
           + P  +       ++IG   QQ  ++ YDIG    +F    C+++
Sbjct: 403 MAPTGVG------TIIGNYQQQNLHLLYDIGNSFLSFVPAPCDII 441


>gi|225427558|ref|XP_002266614.1| PREDICTED: probable aspartic protease At2g35615-like [Vitis
           vinifera]
          Length = 444

 Score =  160 bits (405), Expect = 1e-36,   Method: Compositional matrix adjust.
 Identities = 116/421 (27%), Positives = 188/421 (44%), Gaps = 32/421 (7%)

Query: 1   LIHRDSILSPYNNPNENPAHRVQRGINISIARLAYLQEKIRSHNTYQAQILPSNDISNSL 60
            I RDS  SP+ NP+E    R+Q+    SI R  + +    S N  Q+ ++         
Sbjct: 38  FISRDSPHSPFYNPSETKYQRLQKAFRRSILRGNHFRAMRASPNDIQSDVISGG----GA 93

Query: 61  FYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLGTIFYPSRSSSYAYVPCDSEH 120
           + +NIS+G PPVP     DTGS L+W  C PC +C  Q+  +F P  S +Y  + CD+E 
Sbjct: 94  YLMNISLGTPPVPMLGIADTGSDLIWRQCLPCPNCYEQVEPLFDPKESETYKTLDCDNEF 153

Query: 121 CRYFPYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDVVFGCGFST 180
           C+           + C Y+  Y     T   +S++ LT  +T+        + FGCG   
Sbjct: 154 CQDLGQQGSCDDDNTCTYSYSYGDRSYTRGDLSSDTLTIGSTEGDPASFPGIAFGCGHDN 213

Query: 181 NRNFKFSGIFGLGIGRS------SLVSQLNSSFSYCIGSLHDPDYLHNKLILGDGAIIDE 234
              F       +G+G         L S++   FSYC+  L     + +K+  G   ++  
Sbjct: 214 GGTFNEKDGGLIGLGGGPLSLVMQLSSEVGGQFSYCLVPLSSDSTVSSKINFGKSGVVSG 273

Query: 235 GDATPLQFIDG----HYYITLEAISVDGRMLDINPNIFKRDDSG------GGVMIDSGTD 284
                   I G     YY+TLE +SV    +      F  + S       G ++IDSGT 
Sbjct: 274 SGTVSTPLIKGTPDTFYYLTLEGLSVGSETVAFKG--FSENKSSPAAVEEGNIIIDSGTT 331

Query: 285 VTWLVKEAYEALRDEVMIRLEGEQMRSYSWPDKLCYHGIMSSDLKGFPTVRFHFRGGAKL 344
           +T L ++ Y  +   +   + G+     +    LCY  + + ++   PT+  HF  GA +
Sbjct: 332 LTLLPQDFYTDVESALTNAIGGQTTTDPNGIFSLCYSSVNNLEI---PTITAHFT-GADV 387

Query: 345 ALEKDSMFYQPRPDAFCMAVNPASINNRYVNFSLIGMMAQQFYNVGYDIGRKQKTFQRMD 404
            L   + F Q + D  C ++ P+S      N ++ G +AQ  + VGYD+   + +F++ D
Sbjct: 388 QLPPLNTFVQVQEDLVCFSMIPSS------NLAIFGNLAQINFLVGYDLKNNKVSFKQTD 441

Query: 405 C 405
           C
Sbjct: 442 C 442


>gi|356546376|ref|XP_003541602.1| PREDICTED: probable aspartic protease At2g35615-like [Glycine max]
          Length = 450

 Score =  160 bits (404), Expect = 1e-36,   Method: Compositional matrix adjust.
 Identities = 128/421 (30%), Positives = 189/421 (44%), Gaps = 28/421 (6%)

Query: 1   LIHRDSILSPYNNPNENPAHRVQRGINISIARLAYLQEK--IRSHNTYQAQILPSNDISN 58
           +IHRDS  SP     E P  RV   +  SI R  +  +K  + S NT ++ +      S 
Sbjct: 39  MIHRDSSRSPLYRHTETPFQRVANAMRRSINRANHFNKKSFVASTNTAESTV----KASQ 94

Query: 59  SLFYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLGTIFYPSRSSSYAYVPCDS 118
             + ++ S+G PP      +DTGS + W+ C  C DC  Q   IF PS+S +Y  +PC S
Sbjct: 95  GEYLMSYSVGTPPFEILGVVDTGSGITWMQCQRCEDCYEQTTPIFDPSKSKTYKTLPCSS 154

Query: 119 EHCR-YFPYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDVVFGCG 177
             C+       CS+ K  C YT  Y  G  +   +S E LT  +T+ S++   + V GCG
Sbjct: 155 NMCQSVISTPSCSSDKIGCKYTIKYGDGSHSQGDLSVETLTLGSTNGSSVQFPNTVIGCG 214

Query: 178 FSTNRNFK------FSGIFGLGIGRSSLVSQLNSSFSYCIGSLHDPDYLHNKLILGDGAI 231
            +    F+           G     S L S +   FSYC+  +       +KL  GD A+
Sbjct: 215 HNNKGTFQGEGSGVVGLGGGPVSLISQLSSSIGGKFSYCLAPMFSQSNSSSKLNFGDAAV 274

Query: 232 IDEGDA--TPLQFIDGH---YYITLEAISVDGRMLDI--NPNIFKRDDSGGGVMIDSGTD 284
           +    A  TPL    G    YY+TLEA SV  + ++     +     +  G ++IDSGT 
Sbjct: 275 VSGLGAVSTPLVSKTGSEVFYYLTLEAFSVGDKRIEFVGGSSSSGSSNGEGNIIIDSGTT 334

Query: 285 VTWLVKEAYEALRDEVMIRLEGEQMRSYSWPDKLCYHGIMSSDLKGFPTVRFHFRGGAKL 344
           +T L +E Y  L   V   ++  ++   S    LCY    S  L   P +  HF+ GA +
Sbjct: 335 LTLLPQEDYSNLESAVADAIQANRVSDPSNFLSLCYQTTPSGQLD-VPVITAHFK-GADV 392

Query: 345 ALEKDSMFYQPRPDAFCMAVNPASINNRYVNFSLIGMMAQQFYNVGYDIGRKQKTFQRMD 404
            L   S F Q      C A + + +       S+ G +AQ    VGYD+  +  +F+  D
Sbjct: 393 ELNPISTFVQVAEGVVCFAFHSSEV------VSIFGNLAQLNLLVGYDLMEQTVSFKPTD 446

Query: 405 C 405
           C
Sbjct: 447 C 447


>gi|356508918|ref|XP_003523200.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
          Length = 453

 Score =  160 bits (404), Expect = 2e-36,   Method: Compositional matrix adjust.
 Identities = 130/407 (31%), Positives = 194/407 (47%), Gaps = 26/407 (6%)

Query: 15  NENPAHRVQRGINISIARLAYLQEKIRSHNTYQA--QILPSNDISNSLFYVNISIGQPPV 72
           N     RVQ GI    +RL  L   + + +T  +  Q+       N  + + ++IG PPV
Sbjct: 60  NLTKLERVQHGIKRGKSRLQRLNAMVLAASTLDSEDQLEAPIHAGNGEYLMELAIGTPPV 119

Query: 73  PQFTAMDTGSSLLWVHCYPCRDCSPQLGTIFYPSRSSSYAYVPCDSEHCRYFPYARCSAY 132
                +DTGS L+W  C PC  C  Q   IF P +SSS++ V C S  C   P + CS  
Sbjct: 120 SYPAVLDTGSDLIWTQCKPCTQCYKQPTPIFDPKKSSSFSKVSCGSSLCSAVPSSTCS-- 177

Query: 133 KHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDVVFGCGFSTNRNFKF---SGI 189
              C Y   Y     T   ++TE  TF  + ++ + V ++ FGCG   N    F   SG+
Sbjct: 178 -DGCEYVYSYGDYSMTQGVLATETFTFGKS-KNKVSVHNIGFGCG-EDNEGDGFEQASGL 234

Query: 190 FGLGIGRSSLVSQLNS-SFSYCIGSLHDPDYLHNKLILGD-GAIIDEGD--ATPL---QF 242
            GLG G  SLVSQL    FSYC+  + D     + L+LG  G + D  +   TPL     
Sbjct: 235 VGLGRGPLSLVSQLKEPRFSYCLTPMDDTK--ESILLLGSLGKVKDAKEVVTTPLLKNPL 292

Query: 243 IDGHYYITLEAISVDGRMLDINPNIFK-RDDSGGGVMIDSGTDVTWLVKEAYEALRDEVM 301
               YY++LE ISV    L I  + F+  DD  GGV+IDSGT +T++ ++A+EAL+ E +
Sbjct: 293 QPSFYYLSLEGISVGDTRLSIEKSTFEVGDDGNGGVIIDSGTTITYIEQKAFEALKKEFI 352

Query: 302 IRLEGEQMRSYSWPDKLCYHGIMSSDLKGFPTVRFHFRGGAKLALEKDSMFYQPRPDAFC 361
            + +    ++ S    LC+     S     P + FHF+GG      ++ M         C
Sbjct: 353 SQTKLPLDKTSSTGLDLCFSLPSGSTQVEIPKIVFHFKGGDLELPAENYMIGDSNLGVAC 412

Query: 362 MAVNPASINNRYVNFSLIGMMAQQFYNVGYDIGRKQKTFQRMDCEVL 408
           +A+  +S        S+ G + QQ   V +D+ ++  +F    C+ L
Sbjct: 413 LAMGASS------GMSIFGNVQQQNILVNHDLEKETISFVPTSCDQL 453


>gi|356525748|ref|XP_003531485.1| PREDICTED: probable aspartic protease At2g35615-like [Glycine max]
          Length = 436

 Score =  160 bits (404), Expect = 2e-36,   Method: Compositional matrix adjust.
 Identities = 135/432 (31%), Positives = 200/432 (46%), Gaps = 56/432 (12%)

Query: 1   LIHRDSILSPYNNPNENPAHRVQRGINISIARLAYLQEKIRSHNTYQAQILPSNDISN-S 59
           LIHRDS LSP+  P+  P+ R+   IN ++ R  Y   +    +  + + L    I N  
Sbjct: 33  LIHRDSPLSPFYKPSLTPSDRI---INTAL-RSIYQLNRASHSDLNEKKTLERVRIPNHG 88

Query: 60  LFYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLGTIFYPSRSSSYAYVPCDSE 119
            + +   IG PPV +    DT S L+WV C PC  C PQ   +F P +SS++A + CDS+
Sbjct: 89  EYLMRFYIGTPPVERLAIADTASDLIWVQCSPCETCFPQDTPLFEPHKSSTFANLSCDSQ 148

Query: 120 HCRYFPYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDVVFGCGFS 179
            C       C    + C+YT  Y  G  T   + TE + F +    T+     +FGCG  
Sbjct: 149 PCTSSNIYYCPLVGNLCLYTNTYGDGSSTKGVLCTESIHFGS---QTVTFPKTIFGCG-- 203

Query: 180 TNRNF------KFSGIFGLGIGRSSLVSQLNS----SFSYCIGSLHDPDYLHNKL----- 224
           +N +F      K +GI GLG G  SLVSQL       FSYC+        +  K      
Sbjct: 204 SNNDFMHQISNKVTGIVGLGAGPLSLVSQLGDQIGHKFSYCLLPFTSTSTIKLKFGNDTT 263

Query: 225 ILGDGAIIDEGDATPLQFIDGH----YYITLEAISVDGRMLDINPNIFKRDDSGGGVMID 280
           I G+G +     +TPL  ID H    Y++ L  I++  +ML +       D + G ++ID
Sbjct: 264 ITGNGVV-----STPL-IIDPHYPSYYFLHLVGITIGQKMLQVR----TTDHTNGNIIID 313

Query: 281 SGTDVTWLVKEAYE----ALRDEVMIRLEGEQMRSYSWPDKLCYHGIMSSDLKGFPTVRF 336
            GT +T+L    Y      LR+ + I    E      +P   C+    +     FP + F
Sbjct: 314 LGTVLTYLEVNFYHNFVTLLREALGI---SETKDDIPYPFDFCFPNQANIT---FPKIVF 367

Query: 337 HFRGGAKLALEKDSMFYQ-PRPDAFCMAVNPASINNRYVN-FSLIGMMAQQFYNVGYDIG 394
            F  GAK+ L   ++F++    +  C+AV P    + Y   FS+ G +AQ  + V YD  
Sbjct: 368 QFT-GAKVFLSPKNLFFRFDDLNMICLAVLP----DFYAKGFSVFGNLAQVDFQVEYDRK 422

Query: 395 RKQKTFQRMDCE 406
            K+ +F   DC 
Sbjct: 423 GKKVSFAPADCS 434


>gi|147858841|emb|CAN78694.1| hypothetical protein VITISV_037475 [Vitis vinifera]
          Length = 442

 Score =  159 bits (403), Expect = 2e-36,   Method: Compositional matrix adjust.
 Identities = 135/410 (32%), Positives = 195/410 (47%), Gaps = 33/410 (8%)

Query: 1   LIHRDSILSPYNNPNENPAHRVQRGINISIARLAYLQEKIRSHNTYQ-AQILPSNDISN- 58
           LIH  S  SPY N       +    +  +++R AYL  + R     Q A  +P   I + 
Sbjct: 34  LIHIHSPSSPYKNVKAESLAK-DTALESTLSRHAYL--RARQQKALQPADFVPPPLIRDK 90

Query: 59  SLFYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLGTIFYPSRSSSYAYVPCDS 118
           S F  N+SIG PP   +  +DTGS L W+ C PC  C  Q   I+  ++S SY  + C+ 
Sbjct: 91  SAFLANLSIGNPPTNVYVVLDTGSDLFWIQCEPCDVCYKQKDPIYNRTKSDSYTEMLCNE 150

Query: 119 EHCRYFPYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDVVFGCGF 178
             C         +    C+Y   Y  G  TS  +S E++ F +          V FGCG 
Sbjct: 151 PPCVSLGREGQCSDSGSCLYQTAYADGARTSGLLSYEKVAFTSHYSDEDKTAQVGFGCGL 210

Query: 179 STNRNFKFSG----IFGLGIGRSSLVSQLNS------SFSYCIGSLHDPDYLHNKLILGD 228
             N NF  S     + GLG G  SLVSQL++      SF+YC G++ +P+     L+ GD
Sbjct: 211 Q-NLNFITSNRDGGVLGLGPGLVSLVSQLSAIGKVSKSFAYCFGNISNPN-AGGFLVFGD 268

Query: 229 GAIIDEGDATPLQFIDGHYYITLEAI--SVDGRMLDINPNIFKRD-DSGGGVMIDSGTDV 285
              ++ GD TP+  I   YY+ L  I   V    LDIN + F+R  D  GGV+IDSG+ +
Sbjct: 269 ATYLN-GDMTPM-VIAEFYYVNLLGIGLGVGEPRLDINSSSFERKPDGSGGVIIDSGSTL 326

Query: 286 TWLVKEAYEALRDEVMIRLEGEQMRS--YSWPDKLCYHGIMSSDLKGFPTVRFHFRGGAK 343
           +    E YE +R+ V+ +L+     S   S PD  C+ G +  DL  FPT+  +      
Sbjct: 327 SVFPPEVYEVVRNAVVDKLKKGYNISPLTSSPD--CFEGKIERDLPLFPTLVLYLESTGI 384

Query: 344 LALEKDSMFYQPRPDAFCMAVNPASINNRYVNFSLIGMMAQQFYNVGYDI 393
           L  ++ S+F Q   + FC+              S+IG +AQQ Y  GY++
Sbjct: 385 LN-DRWSIFLQRYDELFCLGFTSGE------GLSIIGTLAQQSYKFGYNL 427


>gi|449462553|ref|XP_004149005.1| PREDICTED: probable aspartic protease At2g35615-like [Cucumis
           sativus]
          Length = 418

 Score =  159 bits (403), Expect = 2e-36,   Method: Compositional matrix adjust.
 Identities = 125/417 (29%), Positives = 200/417 (47%), Gaps = 45/417 (10%)

Query: 1   LIHRDSILSPYNNPNENPAHRVQRGINISIARLAYLQEKIRSHNTYQAQILPSNDISNSL 60
           L HRDS+LSP    + +   R+      S++R A L  +  +      Q           
Sbjct: 34  LFHRDSLLSPLEFSSLSHYDRLANAFRRSLSRSAALLNRAATSGAVGLQ----------- 82

Query: 61  FYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLGTIFYPSRSSSYAYVPCDSEH 120
              +  IG PPV      DTGS L W  C PC  C  QL  IF P +S+S+++VPC+++ 
Sbjct: 83  ---SSIIGTPPVDYLGIADTGSDLTWAQCLPCLKCYQQLRPIFNPLKSTSFSHVPCNTQT 139

Query: 121 CRYFPYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDVVFGCGFST 180
           C       C   +  C Y+  Y  G  T    S   L F+     +  V+ V+ GCG ++
Sbjct: 140 CHAVDDGHC-GVQGVCDYSYTY--GDRT---YSKGDLGFEKITIGSSSVKSVI-GCGHAS 192

Query: 181 NRNFKF-SGIFGLGIGRSSLVSQLNSS------FSYCIGSLHDPDYLHNKLILGDGAIID 233
           +  F F SG+ GLG G+ SLVSQ++ +      FSYC+ +L    + + K+  G  A++ 
Sbjct: 193 SGGFGFASGVIGLGGGQLSLVSQMSQTSGISRRFSYCLPTLL--SHANGKINFGQNAVVS 250

Query: 234 EGD--ATPL--QFIDGHYYITLEAISVDGRMLDINPNIFKRDDSGGGVMIDSGTDVTWLV 289
                +TPL  +    +YYITLEAIS+           F +    G V+IDSGT +++L 
Sbjct: 251 GPGVVSTPLISKNTVTYYYITLEAISIGNE----RHMAFAKQ---GNVIIDSGTTLSFLP 303

Query: 290 KEAYEALRDEVMIRLEGEQMRSYSWPDKLCY-HGIMSSDLKGFPTVRFHFRGGAKLALEK 348
           KE Y+ +   ++  ++ ++++       LC+  GI  +   G P +   F GGA + L  
Sbjct: 304 KELYDGVVSSLLKVVKAKRVKDPGNFWDLCFDDGINVATSSGIPIITAQFSGGANVNLLP 363

Query: 349 DSMFYQPRPDAFCMAVNPASINNRYVNFSLIGMMAQQFYNVGYDIGRKQKTFQRMDC 405
            + F +   +  C+ + PAS  +    F +IG +A   + +GYD+  K+ +F+   C
Sbjct: 364 VNTFQKVANNVNCLTLTPASPTDE---FGIIGNLALANFLIGYDLEAKRLSFKPTVC 417


>gi|449515033|ref|XP_004164554.1| PREDICTED: probable aspartic protease At2g35615-like [Cucumis
           sativus]
          Length = 430

 Score =  159 bits (402), Expect = 3e-36,   Method: Compositional matrix adjust.
 Identities = 122/419 (29%), Positives = 203/419 (48%), Gaps = 37/419 (8%)

Query: 1   LIHRDSILSPYNNPNENPAHRVQRGINISIARLAYLQEKIRSHNTY--QAQILPSNDISN 58
           L HRDS+LSP    + +   R+      S++R A L  +  ++     QA + P +    
Sbjct: 34  LFHRDSLLSPLEFSSLSHYDRLTNAFRRSLSRSATLLNRAATNGALDLQAPLTPGS---- 89

Query: 59  SLFYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLGTIFYPSRSSSYAYVPCDS 118
             + +++SIG PPV      DTGS L+W  C PC  C  Q   IF P +S+S+++VPC+S
Sbjct: 90  GEYLMSVSIGTPPVDYIGMADTGSDLMWAQCLPCLKCYKQSRPIFDPLKSTSFSHVPCNS 149

Query: 119 EHCRYFPYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDVVFGCGF 178
           ++C+    + C A +  C Y+  Y  G +T    +   L F+     +  V+ V+ GCG 
Sbjct: 150 QNCKAIDDSHCGA-QGVCDYSYTY--GDQT---YTKGDLGFEKITIGSSSVKSVI-GCGH 202

Query: 179 S-TNRNFKFSGIFGLGIGRSSLVSQLNSS------FSYCIGSLHDPDYLHNKLILGDGAI 231
                    SG+ GLG G+ SLVSQ++ +      FSYC+ +L    + + K+  G  A+
Sbjct: 203 ESGGGFGFASGVIGLGGGQLSLVSQMSQTSGISRRFSYCLPTLL--SHANGKINFGQNAV 260

Query: 232 IDEGDATPLQFIDG----HYYITLEAISVDGRMLDINPNIFKRDDSGGGVMIDSGTDVTW 287
           +          I      +YY+TLEAIS+                  G V+IDSGT +++
Sbjct: 261 VSGPGVVSTPLISKNPVTYYYVTLEAISIGNER-------HMASAKQGNVIIDSGTTLSF 313

Query: 288 LVKEAYEALRDEVMIRLEGEQMRSYSWPDKLCY-HGIMSSDLKGFPTVRFHFRGGAKLAL 346
           L KE Y+ +   ++  ++ ++++       LC+  GI  +   G P +   F GGA + L
Sbjct: 314 LPKELYDGVVSSLLKVVKAKRVKDPGNFWDLCFDDGINVATSSGIPIITAQFSGGANVNL 373

Query: 347 EKDSMFYQPRPDAFCMAVNPASINNRYVNFSLIGMMAQQFYNVGYDIGRKQKTFQRMDC 405
              + F +   +  C+ + PAS  +    F +IG +A   + +GYD+  K+ +F+   C
Sbjct: 374 LPVNTFQKVANNVNCLTLTPASPTDE---FGIIGNLALANFLIGYDLEAKRLSFKPTVC 429


>gi|255566010|ref|XP_002523993.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
           communis]
 gi|223536720|gb|EEF38361.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
           communis]
          Length = 439

 Score =  159 bits (402), Expect = 3e-36,   Method: Compositional matrix adjust.
 Identities = 137/421 (32%), Positives = 203/421 (48%), Gaps = 33/421 (7%)

Query: 1   LIHRDSILSPYNNPNENPAHRVQRGINISIARLAYLQEKIRSH---NTYQAQILPSNDIS 57
           LI+RDS  SP+ NP E P  R+   +  S++R+ +      S    +T Q+++     IS
Sbjct: 33  LINRDSPKSPFYNPRETPTQRIVSAVRRSMSRVHHFSPTKNSDIFTDTAQSEM-----IS 87

Query: 58  NSLFYV-NISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLGTIFYPSRSSSYAYVPC 116
           N   Y+   S+G P        DTGS L+W  C PC  C  Q   +F P  SS+Y  + C
Sbjct: 88  NQGEYLMKFSLGTPAFDILAIADTGSDLIWTQCKPCDQCYEQDAPLFDPKSSSTYRDISC 147

Query: 117 DSEHCRYFPY-ARCSAYKHR-CIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDVVF 174
            ++ C      A CS   ++ C Y+  Y     TS  V+ + +T  +T    + +   + 
Sbjct: 148 STKQCDLLKEGASCSGEGNKTCHYSYSYGDRSFTSGNVAADTITLGSTSGRPVLLPKAII 207

Query: 175 GCGFSTNRNF--KFSGIFGLGIGRSSLVSQLNSS----FSYCIGSLHDPDYLHNKLILGD 228
           GCG +   +F  K SGI GLG G  SL+SQL S+    FSYC+  L       +KL  G 
Sbjct: 208 GCGHNNGGSFTEKGSGIVGLGGGPISLISQLGSTIDGKFSYCLVPLSSNATNSSKLNFGS 267

Query: 229 GAIIDEG--DATPL--QFIDGHYYITLEAISVDGRMLDINPNIFKRDDSGGGVMIDSGTD 284
             I+  G   +TPL  +  D  Y++TLEA+SV    +    + F    S G ++IDSGT 
Sbjct: 268 NGIVSGGGVQSTPLISKDPDTFYFLTLEAVSVGSERIKFPGSSFGT--SEGNIIIDSGTT 325

Query: 285 VTWLVKEAYEALRDEVMIRLEGEQMRSYSWPDKLCYHGIMSSDLKGFPTVRFHFRGGAKL 344
           +T   ++ +  L   V   + G  +   S    LCY   + +DLK FP++  HF  GA +
Sbjct: 326 LTLFPEDFFSELSSAVQDAVAGTPVEDPSGILSLCYS--IDADLK-FPSITAHFD-GADV 381

Query: 345 ALEKDSMFYQPRPDAFCMAVNPASINNRYVNFSLIGMMAQQFYNVGYDIGRKQKTFQRMD 404
            L   + F Q      C A NP  IN+     ++ G +AQ  + VGYD+  K  +F+  D
Sbjct: 382 KLNPLNTFVQVSDTVLCFAFNP--INSG----AIFGNLAQMNFLVGYDLEGKTVSFKPTD 435

Query: 405 C 405
           C
Sbjct: 436 C 436


>gi|297603570|ref|NP_001054261.2| Os04g0677100 [Oryza sativa Japonica Group]
 gi|255675885|dbj|BAF16175.2| Os04g0677100 [Oryza sativa Japonica Group]
          Length = 464

 Score =  159 bits (401), Expect = 3e-36,   Method: Compositional matrix adjust.
 Identities = 128/427 (29%), Positives = 195/427 (45%), Gaps = 51/427 (11%)

Query: 1   LIHRDSILSPYNNPNENPAHRVQRGINISIARLAYLQEKIRSHNT------YQAQILPSN 54
           L+HRD+I S    P+    H+V   +    AR+ +L++++ +  +        ++++P  
Sbjct: 67  LVHRDAI-SGATYPSRR--HQVVGLVARDNARVEHLEKRLVASTSPYLPEDLVSEVVPGV 123

Query: 55  DISNSLFYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLGTIFYPSRSSSYAYV 114
           D  +  ++V + +G PP  Q+  +D+GS ++WV C PC  C  Q   +F P+ SSS++ V
Sbjct: 124 DDGSGEYFVRVGVGSPPTDQYLVVDSGSDVIWVQCRPCEQCYAQTDPLFDPAASSSFSGV 183

Query: 115 PCDSEHCRYFP--YARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDV 172
            C S  CR              +C Y+  Y  G  T   ++ E LT   T      VQ V
Sbjct: 184 SCGSAICRTLSGTGCGGGGDAGKCDYSVTYGDGSYTKGELALETLTLGGT-----AVQGV 238

Query: 173 VFGCGFSTNRNF-KFSGIFGLGIGRSSLVSQLNSS----FSYCIGSLHDPDYLHNKLILG 227
             GCG   +  F   +G+ GLG G  SLV QL  +    FSYC+ S          L+LG
Sbjct: 239 AIGCGHRNSGLFVGAAGLLGLGWGAMSLVGQLGGAAGGVFSYCLASRGAGG--AGSLVLG 296

Query: 228 DGAIIDEGDATPLQFIDGHYYITLEAISVDGRMLDINPNIFK-RDDSGGGVMIDSGTDVT 286
               +  G     +     YY+ L  I V G  L +  ++F+  +D  GGV++D+GT VT
Sbjct: 297 RTEAVPRG-----RRASSFYYVGLTGIGVGGERLPLQDSLFQLTEDGAGGVVMDTGTAVT 351

Query: 287 WLVKEAYEALR---DEVMIRLEGEQMRSYSWPDKLCYHGIMSSDLKGF-----PTVRFHF 338
            L +EAY ALR   D  M  L   +  + S  D  CY      DL G+     PTV F+F
Sbjct: 352 RLPREAYAALRGAFDGAMGAL--PRSPAVSLLDT-CY------DLSGYASVRVPTVSFYF 402

Query: 339 RGGAKLALEKDSMFYQPRPDAFCMAVNPASINNRYVNFSLIGMMAQQFYNVGYDIGRKQK 398
             GA L L   ++  +     FC+A  P+S        S++G + Q+   +  D      
Sbjct: 403 DQGAVLTLPARNLLVEVGGAVFCLAFAPSS-----SGISILGNIQQEGIQITVDSANGYV 457

Query: 399 TFQRMDC 405
            F    C
Sbjct: 458 GFGPNTC 464


>gi|224130548|ref|XP_002320868.1| predicted protein [Populus trichocarpa]
 gi|222861641|gb|EEE99183.1| predicted protein [Populus trichocarpa]
          Length = 488

 Score =  159 bits (401), Expect = 3e-36,   Method: Compositional matrix adjust.
 Identities = 118/361 (32%), Positives = 166/361 (45%), Gaps = 34/361 (9%)

Query: 61  FYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLGTIFYPSRSSSYAYVPCDSEH 120
           ++  + +G P    +  +DTGS ++W+ C PC  C  Q   +F P++S S+A +PC S  
Sbjct: 145 YFTRLGVGTPARYVYMVLDTGSDIVWIQCAPCIKCYSQTDPVFDPTKSRSFANIPCGSPL 204

Query: 121 CRYFPYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDVVFGCGFST 180
           CR   Y  CS  K  C+Y   Y  G  T    STE LTF+ T      V  VV GCG   
Sbjct: 205 CRRLDYPGCSTKKQICLYQVSYGDGSFTVGEFSTETLTFRGT-----RVGRVVLGCGHDN 259

Query: 181 NRNFKFSGI----------FGLGIGRSSLVSQLNSSFSYCIGSLHDPDYLHNKLILGDGA 230
              F  +            F   IGR     + NS FSYC+G         + ++ GD A
Sbjct: 260 EGLFVGAAGLLGLGRGRLSFPSQIGR-----RFNSKFSYCLGD-RSASSRPSSIVFGDSA 313

Query: 231 IIDEGDATPL---QFIDGHYYITLEAISVDG-RMLDINPNIFKRDDSG-GGVMIDSGTDV 285
           I      TPL     +D  YY+ L  ISV G R+  I+ ++FK D +G GGV+IDSGT V
Sbjct: 314 ISRTTRFTPLLSNPKLDTFYYVELLGISVGGTRVSGISASLFKLDSTGNGGVIIDSGTSV 373

Query: 286 TWLVKEAYEALRDEVMIRLEG-EQMRSYSWPDKLCYHGIMSSDLKGFPTVRFHFRGGAKL 344
           T L + AY ALRD  ++     ++   +S  D  C+     +++K  PTV  HFRG    
Sbjct: 374 TRLTRAAYVALRDAFLVGASNLKRAPEFSLFDT-CFDLSGKTEVK-VPTVVLHFRGADVP 431

Query: 345 ALEKDSMFYQPRPDAFCMAVNPASINNRYVNFSLIGMMAQQFYNVGYDIGRKQKTFQRMD 404
               + +       +FC A             S+IG + QQ + V YD+   +  F    
Sbjct: 432 LPASNYLIPVDNSGSFCFA-----FAGTASGLSIIGNIQQQGFRVVYDLATSRVGFAPRG 486

Query: 405 C 405
           C
Sbjct: 487 C 487


>gi|30686482|ref|NP_850251.1| aspartyl protease-like protein [Arabidopsis thaliana]
 gi|122215044|sp|Q3EBM5.1|ASPR1_ARATH RecName: Full=Probable aspartic protease At2g35615; Flags:
           Precursor
 gi|330254036|gb|AEC09130.1| aspartyl protease-like protein [Arabidopsis thaliana]
          Length = 447

 Score =  158 bits (400), Expect = 4e-36,   Method: Compositional matrix adjust.
 Identities = 132/434 (30%), Positives = 197/434 (45%), Gaps = 45/434 (10%)

Query: 1   LIHRDSILSPYNNPNENPAHRVQRGINISIARLAYLQEKIRSHNTYQAQILPSNDISNSL 60
           LIHRDS LSP  NP      R+      S++R      ++ S    Q+ ++ ++      
Sbjct: 30  LIHRDSPLSPIYNPQITVTDRLNAAFLRSVSRSRRFNHQL-SQTDLQSGLIGAD----GE 84

Query: 61  FYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLGTIFYPSRSSSYAYVPCDSEH 120
           F+++I+IG PP+  F   DTGS L WV C PC+ C  + G IF   +SS+Y   PCDS +
Sbjct: 85  FFMSITIGTPPIKVFAIADTGSDLTWVQCKPCQQCYKENGPIFDKKKSSTYKSEPCDSRN 144

Query: 121 CRYFPYAR--CSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDVVFGCGF 178
           C+        C    + C Y   Y     +   V+TE ++  +   S +     VFGCG+
Sbjct: 145 CQALSSTERGCDESNNICKYRYSYGDQSFSKGDVATETVSIDSASGSPVSFPGTVFGCGY 204

Query: 179 STNRNFKFSGIFGLGIGRS--SLVSQLNSS----FSYCIGSLHDPDYLHNKLILGDGAI- 231
           +    F  +G   +G+G    SL+SQL SS    FSYC+          + + LG  +I 
Sbjct: 205 NNGGTFDETGSGIIGLGGGHLSLISQLGSSISKKFSYCLSHKSATTNGTSVINLGTNSIP 264

Query: 232 ----IDEG-------DATPLQFIDGHYYITLEAISVDGRMLDINPNIFKRDDSG------ 274
                D G       D  PL +    YY+TLEAISV  + +    + +  +D G      
Sbjct: 265 SSLSKDSGVVSTPLVDKEPLTY----YYLTLEAISVGKKKIPYTGSSYNPNDDGILSETS 320

Query: 275 GGVMIDSGTDVTWLVKEAYEALRDEVMIRLEGEQMRSYSWPDKLCYHGIMSSDLK-GFPT 333
           G ++IDSGT +T L    ++     V   + G   +  S P  L  H   S   + G P 
Sbjct: 321 GNIIIDSGTTLTLLEAGFFDKFSSAVEESVTG--AKRVSDPQGLLSHCFKSGSAEIGLPE 378

Query: 334 VRFHFRGGAKLALEKDSMFYQPRPDAFCMAVNPASINNRYVNFSLIGMMAQQFYNVGYDI 393
           +  HF  GA + L   + F +   D  C+++ P +    Y NF      AQ  + VGYD+
Sbjct: 379 ITVHFT-GADVRLSPINAFVKLSEDMVCLSMVPTTEVAIYGNF------AQMDFLVGYDL 431

Query: 394 GRKQKTFQRMDCEV 407
             +  +FQ MDC  
Sbjct: 432 ETRTVSFQHMDCSA 445


>gi|356540371|ref|XP_003538663.1| PREDICTED: probable aspartic protease At2g35615-like [Glycine max]
          Length = 374

 Score =  158 bits (400), Expect = 4e-36,   Method: Compositional matrix adjust.
 Identities = 117/368 (31%), Positives = 171/368 (46%), Gaps = 45/368 (12%)

Query: 61  FYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLGTIFYPSRSSSYAYVPCDSEH 120
           + + +SIG PP   +   DTGS L W  C PC  C  Q   IF P +S+SY  + CDS+ 
Sbjct: 25  YLMEVSIGTPPFKIYGIADTGSDLTWTSCVPCNKCYKQRNPIFDPQKSTSYRNISCDSKL 84

Query: 121 CRYFPYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDVVFGCGFST 180
           C       CS  KH C YT  Y     T   ++ E +T  +T   ++ ++ +VFGCG + 
Sbjct: 85  CHKLDTGVCSPQKH-CNYTYAYASAAITQGVLAQETITLSSTKGESVPLKGIVFGCGHNN 143

Query: 181 NRNF--KFSGIFGLGIGRSSLVSQLNSS-----FSYCIGSLHDPDYLHNKLILGDGA--- 230
              F  +  GI GLG G  S +SQ+ SS     FS C+   H    + +K+ LG G+   
Sbjct: 144 TGGFNDREMGIIGLGGGPVSFISQIGSSFGGKRFSQCLVPFHTDVSVSSKMSLGKGSEVS 203

Query: 231 --------IIDEGDATPLQFIDGHYYITLEAISVDGRMLDINPNIFKRDDSGGGVMIDSG 282
                   ++ + D TP       Y++TL  ISV    L  N +  +  +  G V +DSG
Sbjct: 204 GKGVVSTPLVAKQDKTP-------YFVTLLGISVGNTYLHFNGSSSQSVEK-GNVFLDSG 255

Query: 283 TDVTWLVKEAYEAL----RDEVMIRLEGEQMRSYSWPDKLCYHGIMSSDLKGFPTVRFHF 338
           T  T L  + Y+ L    R EV ++             +LCY     ++L+G P +  HF
Sbjct: 256 TPPTILPTQLYDRLVAQVRSEVAMK---PVTNDLDLGPQLCYR--TKNNLRG-PVLTAHF 309

Query: 339 RGGAKLALEKDSMFYQPRPDAFCMA-VNPASINNRYVNFSLIGMMAQQFYNVGYDIGRKQ 397
            GG  + L     F  P+   FC+   N +S    Y NF      AQ  Y +G+D+ R+ 
Sbjct: 310 EGG-DVKLLPTQTFVSPKDGVFCLGFTNTSSDGGVYGNF------AQSNYLIGFDLDRQV 362

Query: 398 KTFQRMDC 405
            +F+ MDC
Sbjct: 363 VSFKPMDC 370


>gi|414877221|tpg|DAA54352.1| TPA: hypothetical protein ZEAMMB73_214154 [Zea mays]
          Length = 447

 Score =  158 bits (400), Expect = 4e-36,   Method: Compositional matrix adjust.
 Identities = 116/362 (32%), Positives = 171/362 (47%), Gaps = 21/362 (5%)

Query: 61  FYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLGTIFYPSRSSSYAYVPCDSEH 120
           + + ++IG PPVP     DTGS L W  C PC+ C PQ   I+  + SSS++ VPC S  
Sbjct: 93  YLMELAIGTPPVPFVALADTGSDLTWTQCQPCKLCFPQDTPIYDTAVSSSFSPVPCASAT 152

Query: 121 CRYFPYAR-CSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDVVFGCGF- 178
           C     +R C+A    C Y   Y  G  ++  + TE LTF       + V  + FGCG  
Sbjct: 153 CLPIWSSRNCTASSSPCRYRYAYGDGAYSAGVLGTETLTFPGAPG--VSVGGIAFGCGVD 210

Query: 179 STNRNFKFSGIFGLGIGRSSLVSQLN-SSFSYCI-----GSLHDPDYLHNKLILGDGAII 232
           +   ++  +G  GLG G  SLV+QL    FSYC+      SL  P        L   +  
Sbjct: 211 NGGLSYNSTGTVGLGRGSLSLVAQLGVGKFSYCLTDFFNTSLGSPVLFGALAELAAPSTG 270

Query: 233 DEGDATPL---QFIDGHYYITLEAISV-DGRMLDINPNIFKRDDSGGGVMIDSGTDVTWL 288
               +TPL    ++   YY++LE IS+ D R+   N     RDD  GG+++DSGT  T+L
Sbjct: 271 AAVQSTPLVQSPYVPTWYYVSLEGISLGDARLPIPNGTFDLRDDGSGGMIVDSGTTFTFL 330

Query: 289 VKEAYEALRDEVMIRLEGEQMRSYSWPDKLCYHGIMSS-DLKGFPTVRFHFRGGAKLALE 347
           V+ A+  + D V   L    + + S  D  C+        L   P +  HF GGA + L 
Sbjct: 331 VESAFRVVVDHVAGVLRQPVVNASSL-DSPCFPAATGEQQLPAMPDMVLHFAGGADMRLH 389

Query: 348 KDS-MFYQPRPDAFCMAVNPASINNRYVNFSLIGMMAQQFYNVGYDIGRKQKTFQRMDCE 406
           +D+ M +     +FC+ +      +   + S++G   QQ   + +DI   Q +F   DC 
Sbjct: 390 RDNYMSFNQEESSFCLNI----AGSPSADVSILGNFQQQNIQMLFDITVGQLSFMPTDCG 445

Query: 407 VL 408
            L
Sbjct: 446 KL 447


>gi|357158688|ref|XP_003578209.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Brachypodium
           distachyon]
          Length = 443

 Score =  158 bits (399), Expect = 5e-36,   Method: Compositional matrix adjust.
 Identities = 130/419 (31%), Positives = 196/419 (46%), Gaps = 50/419 (11%)

Query: 19  AHRVQRGINISIARLAYLQE---KIRSHNTYQAQILPSNDISNSLFYVNISIGQPPVPQF 75
           A  + R +  S AR+A LQ       +     A+IL     S   + +++ IG PP    
Sbjct: 46  AQLLSRAVRRSKARVAALQSLATTTAADAITVARIL--VLASEGEYLMSMGIGTPPRYYS 103

Query: 76  TAMDTGSSLLWVHCYPCRDCSPQLGTIFYPSRSSSYAYVPCDSEHCRYFPYARCSAYKHR 135
             +DTGS L+W  C PC  C  Q    F P++S SYA +PC+S  C    Y  C  Y++ 
Sbjct: 104 AILDTGSDLIWTQCAPCMLCVDQPTPFFDPAQSPSYAKLPCNSPMCNALYYPLC--YRNV 161

Query: 136 CIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDVVFGCG-FSTNRNFKFSGIFGLGI 194
           C+Y   Y     T+  +S E  TF  T+++ + V  + FGCG  +    F  SG+ G G 
Sbjct: 162 CVYQYFYGDSANTAGVLSNETFTF-GTNDTRVTVPRIAFGCGNLNAGSLFNGSGMVGFGR 220

Query: 195 GRSSLVSQLNSS-FSYCIGSLHDPDYLHNKLILGDGAIIDEGDAT---PLQ---FI---- 243
           G  SLVSQL S  FSYC+ S   P  + ++L  G  A ++   A+   P+Q   FI    
Sbjct: 221 GPLSLVSQLGSPRFSYCLTSFMSP--VPSRLYFGAYATLNSTSASTGEPVQSTPFIVNPG 278

Query: 244 -DGHYYITLEAISVDGRMLDINPNIFKRDDS--GGGVMIDSGTDVTWLVKEAY----EAL 296
               YY+ +  ISV G +L I+P++F  +D+   GGV+IDSG+ +T+L + AY    +A 
Sbjct: 279 LPTMYYLNMTGISVGGELLPIDPSVFAINDADGTGGVIIDSGSTITYLARAAYDMVHQAF 338

Query: 297 RDEVMIRLEGEQMRS------YSWPDKLCYHGIMSSDLKGFPTVRFHFRGG-AKLALEKD 349
            D+V + L      +      + WP            +   P + FHF G   +L LE +
Sbjct: 339 ADQVGLPLTNATSLADVLDTCFVWPPP-------PRKIVTMPELAFHFEGANMELPLE-N 390

Query: 350 SMFYQPRPDAFCMAVNPASINNRYVNFSLIGMMAQQFYNVGYDIGRKQKTFQRMDCEVL 408
            M         C+A+  +       + S+IG    Q ++V YD      +F    C V+
Sbjct: 391 YMLIDGDTGNLCLAIAASD------DGSIIGSFQHQNFHVLYDNENSLLSFTPATCNVM 443


>gi|356495752|ref|XP_003516737.1| PREDICTED: probable aspartic protease At2g35615-like [Glycine max]
          Length = 396

 Score =  158 bits (399), Expect = 5e-36,   Method: Compositional matrix adjust.
 Identities = 123/385 (31%), Positives = 186/385 (48%), Gaps = 29/385 (7%)

Query: 35  YLQEKIRSHNTYQAQILPSNDISNSLFYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRD 94
           Y  +++  H      +      +N  + + +++G PPV  +  +DTGS L+W  C PC+ 
Sbjct: 24  YKSDELHMHRLGSNGVFTRVTSNNGDYLMKLTLGTPPVDVYGLVDTGSDLVWAQCTPCQG 83

Query: 95  CSPQLGTIFYPSRSSSYAYVPCDSEHCRYFPYARCSAYKHRCIYTQLYLIGPETSVFVST 154
           C  Q   +F P RS++Y  +PCDSE C       CS  K  C Y+  Y     T   ++ 
Sbjct: 84  CYRQKSPMFEPLRSNTYTPIPCDSEECNSLFGHSCSPQK-LCAYSYAYADSSVTKGVLAR 142

Query: 155 EQLTFKNTDESTIHVQDVVFGCGFSTNRNFKFSGI--FGLGIGRSSLVSQL-----NSSF 207
           E +TF +TD   + V D+VFGCG S +  F  + +   GLG G  SLVSQ      +  F
Sbjct: 143 ETVTFSSTDGEPVVVGDIVFGCGHSNSGTFNENDMGIIGLGGGPLSLVSQFGNLYGSKRF 202

Query: 208 SYCIGSLHDPDYLHNKLILGDGA-IIDEG-DATPLQFIDGH--YYITLEAISVDGRMLDI 263
           S C+   H   +    +  GD + +  EG  ATPL   +G   Y +TLE ISV    +  
Sbjct: 203 SQCLVPFHADPHTLGTISFGDASDVSGEGVAATPLVSEEGQTPYLVTLEGISVGDTFVSF 262

Query: 264 NPNIFKRDDSGGGVMIDSGTDVTWLVKEAYEALRDEVMIRLEGEQMRSYSWPD---KLCY 320
           N +      S G +MIDSGT  T+L +E Y+ L  E  ++++   +     PD   +LCY
Sbjct: 263 NSSEML---SKGNIMIDSGTPATYLPQEFYDRLVKE--LKVQSNMLPIDDDPDLGTQLCY 317

Query: 321 HGIMSSDLKGFPTVRFHFRGGAKLALEKDSMFYQPRPDAFCMAVNPASINNRYVNFSLIG 380
                ++L+G P +  HF  GA + L     F  P+   FC A+   + +  Y+     G
Sbjct: 318 RS--ETNLEG-PILIAHFE-GADVQLMPIQTFIPPKDGVFCFAM-AGTTDGEYI----FG 368

Query: 381 MMAQQFYNVGYDIGRKQKTFQRMDC 405
             AQ    +G+D+ RK  +F+  DC
Sbjct: 369 NFAQSNVLIGFDLDRKTVSFKATDC 393


>gi|20197342|gb|AAC34482.2| putative chloroplast nucleoid DNA binding protein [Arabidopsis
           thaliana]
          Length = 353

 Score =  158 bits (399), Expect = 6e-36,   Method: Compositional matrix adjust.
 Identities = 114/365 (31%), Positives = 181/365 (49%), Gaps = 31/365 (8%)

Query: 63  VNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLGTIFYPSRSSSYAYVPCDSEHCR 122
           + +SIG P V     +DTGS L+W  C PC +C  Q   IF P +SSSY+ V C S  C 
Sbjct: 1   MELSIGNPAVKYSAIVDTGSDLIWTQCKPCTECFDQPTPIFDPEKSSSYSKVGCSSGLCN 60

Query: 123 YFPYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDVVFGCGFSTNR 182
             P + C+  K  C Y   Y     T   ++TE  TF+  DE++I    + FGCG     
Sbjct: 61  ALPRSNCNEDKDACEYLYTYGDYSSTRGLLATETFTFE--DENSIS--GIGFGCGVENEG 116

Query: 183 N--FKFSGIFGLGIGRSSLVSQLNSS-FSYCIGSLHDPDY--------LHNKLILGDGAI 231
           +   + SG+ GLG G  SL+SQL  + FSYC+ S+ D +         L + ++   GA 
Sbjct: 117 DGFSQGSGLVGLGRGPLSLISQLKETKFSYCLTSIEDSEASSSLFIGSLASGIVNKTGAS 176

Query: 232 IDEGDATPLQFI------DGHYYITLEAISVDGRMLDINPNIFK-RDDSGGGVMIDSGTD 284
           +D G+ T    +         YY+ L+ I+V  + L +  + F+  +D  GG++IDSGT 
Sbjct: 177 LD-GEVTKTMSLLRNPDQPSFYYLELQGITVGAKRLSVEKSTFELAEDGTGGMIIDSGTT 235

Query: 285 VTWLVKEAYEALRDEVMIRLEGEQMRSYSWPDKLCYHGIMSSDLKGFPTVRFHFRGGAKL 344
           +T+L + A++ L++E   R+      S S    LC+    ++     P + FHF+ GA L
Sbjct: 236 ITYLEETAFKVLKEEFTSRMSLPVDDSGSTGLDLCFKLPDAAKNIAVPKMIFHFK-GADL 294

Query: 345 ALEKDS-MFYQPRPDAFCMAVNPASINNRYVNFSLIGMMAQQFYNVGYDIGRKQKTFQRM 403
            L  ++ M         C+A+  ++        S+ G + QQ +NV +D+ ++  +F   
Sbjct: 295 ELPGENYMVADSSTGVLCLAMGSSN------GMSIFGNVQQQNFNVLHDLEKETVSFVPT 348

Query: 404 DCEVL 408
           +C  L
Sbjct: 349 ECGKL 353


>gi|449486856|ref|XP_004157423.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 1-like [Cucumis
           sativus]
          Length = 491

 Score =  157 bits (398), Expect = 7e-36,   Method: Compositional matrix adjust.
 Identities = 124/352 (35%), Positives = 174/352 (49%), Gaps = 22/352 (6%)

Query: 61  FYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLGTIFYPSRSSSYAYVPCDSEH 120
           ++  + +GQP  P +  +DTGS + W+ C PC DC  Q   IF P  SSS+A +PC+S+ 
Sbjct: 155 YFSRVGVGQPAKPFYMVLDTGSDINWLQCQPCTDCYQQTDPIFDPRSSSSFASLPCESQQ 214

Query: 121 CRYFPYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDVVFGCGFST 180
           C+    + C A K  C+Y   Y  G  T     TE LTF N+      + DV  GCG   
Sbjct: 215 CQALETSGCRASK--CLYQVSYGDGSFTVGEFVTETLTFGNSG----MINDVAVGCGHDN 268

Query: 181 NRNF-KFSGIFGLGIGRSSLVSQLN-SSFSYCIGSLHDPDYLHNKLILGDGAIIDEGDAT 238
              F   +G+ GLG G  SL SQ+  SSFSYC+  +       + L     A  D  +A 
Sbjct: 269 EGLFVGSAGLLGLGGGPLSLTSQMKASSFSYCL--VDRDSSSSSDLEFNSAAPSDSVNAP 326

Query: 239 PLQF--IDGHYYITLEAISVDGRMLDINPNIFKRDDSG-GGVMIDSGTDVTWLVKEAYEA 295
            L+   +D  YY+ L  +SV G++L I PN+F+ DDSG GG+++DSGT +T L  +AY  
Sbjct: 327 LLKSGKVDTFYYVGLTGMSVGGQLLSIPPNLFQMDDSGYGGIIVDSGTAITRLQTQAYNT 386

Query: 296 LRDEVMIRLE-GEQMRSYSWPDKLCYHGIMSSDLKGFPTVRFHFRGGAKLAL-EKDSMFY 353
           LRD  + R    ++   ++  D  CY  + S      PTV F F GG  L L  K+ +  
Sbjct: 387 LRDAFVSRTPYLKKTNGFALFDT-CYD-LSSQSRVTIPTVSFEFAGGKSLQLPPKNYLIP 444

Query: 354 QPRPDAFCMAVNPASINNRYVNFSLIGMMAQQFYNVGYDIGRKQKTFQRMDC 405
                 FC A  P +      + S+IG + QQ   V YD+      F    C
Sbjct: 445 VDSVGTFCFAFAPTT-----SSLSIIGNVQQQGTRVHYDLANSVVGFSPHKC 491


>gi|125592062|gb|EAZ32412.1| hypothetical protein OsJ_16623 [Oryza sativa Japonica Group]
          Length = 473

 Score =  157 bits (398), Expect = 7e-36,   Method: Compositional matrix adjust.
 Identities = 130/431 (30%), Positives = 196/431 (45%), Gaps = 50/431 (11%)

Query: 1   LIHRDSILSPYNNPNENPAHRVQRGINISIARLAYLQEKIRSHNT------YQAQILPSN 54
           L+HRD+I S    P+    H+V   +    AR+ +L++++ +  +        ++++P  
Sbjct: 67  LVHRDAI-SGATYPSRR--HQVVGLVARDNARVEHLEKRLVASTSPYLPEDLVSEVVPGV 123

Query: 55  DISNSLFYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLGTIFYPSRSSSYAYV 114
           D  +  ++V + +G PP  Q+  +D+GS ++WV C PC  C  Q   +F P+ SSS++ V
Sbjct: 124 DDGSGEYFVRVGVGSPPTDQYLVVDSGSDVIWVQCRPCEQCYAQTDPLFDPAASSSFSGV 183

Query: 115 PCDSEHCRYFP--YARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDV 172
            C S  CR              +C Y+  Y  G  T   ++ E LT   T      VQ V
Sbjct: 184 SCGSAICRTLSGTGCGGGGDAGKCDYSVTYGDGSYTKGELALETLTLGGT-----AVQGV 238

Query: 173 VFGCGFSTNRNF-KFSGIFGLGIGRSSLVSQLNSS----FSYCIGSLHDPDYLHNKLILG 227
             GCG   +  F   +G+ GLG G  SLV QL  +    FSYC+ S          L+LG
Sbjct: 239 AIGCGHRNSGLFVGAAGLLGLGWGAMSLVGQLGGAAGGVFSYCLASRGAGG--AGSLVLG 296

Query: 228 DGAIIDEGDA-TPL---QFIDGHYYITLEAISVDGRMLDINPNIFK-RDDSGGGVMIDSG 282
               +  G    PL         YY+ L  I V G  L +  ++F+  +D  GGV++D+G
Sbjct: 297 RTEAVPVGAVWVPLVRNNQASSFYYVGLTGIGVGGERLPLQDSLFQLTEDGAGGVVMDTG 356

Query: 283 TDVTWLVKEAYEALR---DEVMIRLEGEQMRSYSWPDKLCYHGIMSSDLKGF-----PTV 334
           T VT L +EAY ALR   D  M  L   +  + S  D  CY      DL G+     PTV
Sbjct: 357 TAVTRLPREAYAALRGAFDGAMGAL--PRSPAVSLLDT-CY------DLSGYASVRVPTV 407

Query: 335 RFHFRGGAKLALEKDSMFYQPRPDAFCMAVNPASINNRYVNFSLIGMMAQQFYNVGYDIG 394
            F+F  GA L L   ++  +     FC+A  P+S        S++G + Q+   +  D  
Sbjct: 408 SFYFDQGAVLTLPARNLLVEVGGAVFCLAFAPSS-----SGISILGNIQQEGIQITVDSA 462

Query: 395 RKQKTFQRMDC 405
                F    C
Sbjct: 463 NGYVGFGPNTC 473


>gi|357481191|ref|XP_003610881.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
 gi|355512216|gb|AES93839.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
          Length = 427

 Score =  157 bits (398), Expect = 8e-36,   Method: Compositional matrix adjust.
 Identities = 128/419 (30%), Positives = 206/419 (49%), Gaps = 42/419 (10%)

Query: 1   LIHRDSILSPYNNPNENPAHRVQRGINISIARLAYLQEKIRSHNTYQAQILPSNDISNSL 60
           LIH++S  SP+   N    H+ +      + + +++Q+   +  T           +N  
Sbjct: 34  LIHKNSPNSPFYKSNN--FHKNKLRSFYQVPKKSFVQKSPYTRVTS----------NNGD 81

Query: 61  FYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLGTIFYPSRSSSYAYVPCDSEH 120
           + + +++G PPV  +  +DTGS L+W  C PC  C  Q   +F P RS +Y+ +PC+SE 
Sbjct: 82  YLMKLTLGSPPVDIYGLVDTGSDLVWAQCTPCGGCYRQKSPMFEPLRSKTYSPIPCESEQ 141

Query: 121 CRYFPYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDVVFGCGFST 180
           C +F Y+ CS  K  C Y+  Y     T   ++ E +TF +TD   + V D++FGCG S 
Sbjct: 142 CSFFGYS-CSPQK-MCAYSYSYADSSVTKGVLAREAITFSSTDGDPVVVGDIIFGCGHSN 199

Query: 181 NRNFKFSGIFGLGIGRS--SLVSQLNS-----SFSYCIGSLHDPDYLHNKLILGDGA-II 232
           +  F  + +  +G+G    SLVSQ+ +      FS C+   H   +    +  G+ + + 
Sbjct: 200 SGTFNENDMGIIGMGGGPLSLVSQIGTLYGSKRFSQCLVPFHTDAHTSGTINFGEESDVS 259

Query: 233 DEG-DATPLQFIDGH--YYITLEAISVDGRMLDINPNIFKRDDSGGGVMIDSGTDVTWLV 289
            EG   TPL   +G   Y +TLE ISV    +  N +      S G +MIDSGT  T++ 
Sbjct: 260 GEGVVTTPLASEEGQTSYLVTLEGISVGDTFVRFNSS---ETLSKGNIMIDSGTPATYIP 316

Query: 290 KEAYEALRDEVMIRLEGEQMRSYSWPD---KLCYHGIMSSDLKGFPTVRFHFRGGAKLAL 346
           +E YE L +E  ++++   +     PD   +LCY     ++L+G P +  HF  GA + L
Sbjct: 317 QEFYERLVEE--LKVQSSLLPIEDDPDLGTQLCYRS--ETNLEG-PILTAHFE-GADVQL 370

Query: 347 EKDSMFYQPRPDAFCMAVNPASINNRYVNFSLIGMMAQQFYNVGYDIGRKQKTFQRMDC 405
                F  P+   FC A+   S +  Y+     G  AQ    +G+D+ RK  +F+  DC
Sbjct: 371 LPIQTFIPPKDGVFCFAM-AGSTDGDYI----FGNFAQSNILMGFDLDRKTISFKPTDC 424


>gi|115472517|ref|NP_001059857.1| Os07g0533600 [Oryza sativa Japonica Group]
 gi|22831047|dbj|BAC15910.1| putative nucleoid DNA-binding protein cnd41 [Oryza sativa Japonica
           Group]
 gi|50508280|dbj|BAD32129.1| putative nucleoid DNA-binding protein cnd41 [Oryza sativa Japonica
           Group]
 gi|113611393|dbj|BAF21771.1| Os07g0533600 [Oryza sativa Japonica Group]
 gi|215766673|dbj|BAG98901.1| unnamed protein product [Oryza sativa Japonica Group]
          Length = 441

 Score =  157 bits (397), Expect = 8e-36,   Method: Compositional matrix adjust.
 Identities = 118/366 (32%), Positives = 173/366 (47%), Gaps = 26/366 (7%)

Query: 57  SNSLFYVNISIGQPPVPQFTAMDTGSSLLWVHC-YPCRDCSPQLGTIFYPSRSSSYAYVP 115
           S + + V+I+IG PP+P    +DTGS L+W  C  PCR C PQ   ++ P+RS++YA V 
Sbjct: 88  STATYLVDIAIGTPPLPLTAVLDTGSDLIWTQCDAPCRRCFPQPAPLYAPARSATYANVS 147

Query: 116 CDSEHCRYF--PYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDVV 173
           C S  C+    P++RCS     C Y   Y  G  T   ++TE  T      S   V+ V 
Sbjct: 148 CRSPMCQALQSPWSRCSPPDTGCAYYFSYGDGTSTDGVLATETFTLG----SDTAVRGVA 203

Query: 174 FGCGFST-NRNFKFSGIFGLGIGRSSLVSQLN-SSFSYCI----GSLHDPDYLHNKLILG 227
           FGCG          SG+ G+G G  SLVSQL  + FSYC      +   P +L +   L 
Sbjct: 204 FGCGTENLGSTDNSSGLVGMGRGPLSLVSQLGVTRFSYCFTPFNATAASPLFLGSSARLS 263

Query: 228 DGAIIDEGDATP---LQFIDGHYYITLEAISVDGRMLDINPNIFKRDDSG-GGVMIDSGT 283
             A       +P    +    +YY++LE I+V   +L I+P +F+    G GGV+IDSGT
Sbjct: 264 SAAKTTPFVPSPSGGARRRSSYYYLSLEGITVGDTLLPIDPAVFRLTPMGDGGVIIDSGT 323

Query: 284 DVTWLVKEAYEALRDEVMIRLEGEQMRSYSWPDKLCYHGIMSSDLKGFPTVRFHFRGGAK 343
             T L + A+ AL   +  R+             LC+    S +    P +  HF  GA 
Sbjct: 324 TFTALEERAFVALARALASRVRLPLASGAHLGLSLCFAA-ASPEAVEVPRLVLHFD-GAD 381

Query: 344 LALEKDSMFYQPRPDAF-CMAVNPASINNRYVNFSLIGMMAQQFYNVGYDIGRKQKTFQR 402
           + L ++S   + R     C+ +  A         S++G M QQ  ++ YD+ R   +F+ 
Sbjct: 382 MELRRESYVVEDRSAGVACLGMVSAR------GMSVLGSMQQQNTHILYDLERGILSFEP 435

Query: 403 MDCEVL 408
             C  L
Sbjct: 436 AKCGEL 441


>gi|125558631|gb|EAZ04167.1| hypothetical protein OsI_26309 [Oryza sativa Indica Group]
          Length = 441

 Score =  157 bits (397), Expect = 9e-36,   Method: Compositional matrix adjust.
 Identities = 118/366 (32%), Positives = 173/366 (47%), Gaps = 26/366 (7%)

Query: 57  SNSLFYVNISIGQPPVPQFTAMDTGSSLLWVHC-YPCRDCSPQLGTIFYPSRSSSYAYVP 115
           S + + V+I+IG PP+P    +DTGS L+W  C  PCR C PQ   ++ P+RS++YA V 
Sbjct: 88  STATYLVDIAIGTPPLPLTAVLDTGSDLIWTQCDAPCRRCFPQPAPLYAPARSATYANVS 147

Query: 116 CDSEHCRYF--PYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDVV 173
           C S  C+    P++RCS     C Y   Y  G  T   ++TE  T      S   V+ V 
Sbjct: 148 CRSPMCQALQSPWSRCSPPDTGCAYYFSYGDGTSTDGVLATETFTLG----SDTAVRGVA 203

Query: 174 FGCGFST-NRNFKFSGIFGLGIGRSSLVSQLN-SSFSYCI----GSLHDPDYLHNKLILG 227
           FGCG          SG+ G+G G  SLVSQL  + FSYC      +   P +L +   L 
Sbjct: 204 FGCGTENLGSTDNSSGLVGMGRGPLSLVSQLGVTRFSYCFTPFNATAASPLFLGSSARLS 263

Query: 228 DGAIIDEGDATP---LQFIDGHYYITLEAISVDGRMLDINPNIFKRDDSG-GGVMIDSGT 283
             A       +P    +    +YY++LE I+V   +L I+P +F+    G GGV+IDSGT
Sbjct: 264 SAAKTTPFVPSPSGGARRRSSYYYLSLEGITVGDTLLPIDPAVFRLTPMGDGGVIIDSGT 323

Query: 284 DVTWLVKEAYEALRDEVMIRLEGEQMRSYSWPDKLCYHGIMSSDLKGFPTVRFHFRGGAK 343
             T L + A+ AL   +  R+             LC+    S +    P +  HF  GA 
Sbjct: 324 TFTALEESAFVALARALASRVRLPLASGAHLGLSLCFAA-ASPEAVEVPRLVLHFD-GAD 381

Query: 344 LALEKDSMFYQPRPDAF-CMAVNPASINNRYVNFSLIGMMAQQFYNVGYDIGRKQKTFQR 402
           + L ++S   + R     C+ +  A         S++G M QQ  ++ YD+ R   +F+ 
Sbjct: 382 MELRRESYVVEDRSAGVACLGMVSAR------GMSVLGSMQQQNTHILYDLERGILSFEP 435

Query: 403 MDCEVL 408
             C  L
Sbjct: 436 AKCGEL 441


>gi|15235526|ref|NP_193028.1| aspartyl protease family protein [Arabidopsis thaliana]
 gi|5123933|emb|CAB45491.1| putative protein [Arabidopsis thaliana]
 gi|7267994|emb|CAB78334.1| putative protein [Arabidopsis thaliana]
 gi|332657803|gb|AEE83203.1| aspartyl protease family protein [Arabidopsis thaliana]
          Length = 389

 Score =  157 bits (396), Expect = 1e-35,   Method: Compositional matrix adjust.
 Identities = 111/350 (31%), Positives = 162/350 (46%), Gaps = 21/350 (6%)

Query: 61  FYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQ-LGTIFYPSRSSSYAYVPCDSE 119
           F   I  G P   QF  MDTGSSL W  C+PC DC  Q +   + P+ S +Y    C+  
Sbjct: 58  FMAEIHFGSPQKKQFLHMDTGSSLTWTQCFPCSDCYAQKIYPKYRPAASITYRDAMCEDS 117

Query: 120 HCRYFPYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDVVFGCG-F 178
           H +  P+         C Y Q YL        ++ E +T    D     V  V FGC   
Sbjct: 118 HPKSNPHFAFDPLTRICTYQQHYLDETNIKGTLAQEMITVDTHDGGFKRVHGVYFGCNTL 177

Query: 179 STNRNFKFSGIFGLGIGRSSLVSQLNSSFSYCIGSLHDPDYLHNKLILGDGAIIDEGDAT 238
           S    F  +GI GLG+G+ S++ +  S FS+C+G + +P   HN LILGDGA + +G  T
Sbjct: 178 SDGSYFTGTGILGLGVGKYSIIGEFGSKFSFCLGEISEPKASHN-LILGDGANV-QGHPT 235

Query: 239 PLQFIDGHYYITLEAISVDGRMLDINPNIFKRDDSGGGVMIDSGTDVTWLVKEAYEALRD 298
            +   +GH    LE+I V   +   +P           V +D+G+ ++ L    Y    D
Sbjct: 236 VINITEGHTIFQLESIIVGEEITLDDPV---------QVFVDTGSTLSHLSTNLYYKFVD 286

Query: 299 EVMIRLEGEQMRSYSWPDKLCYHGIMSSDLKGFPTVRFHFRGGAKLALEKDSMFYQPR-P 357
                L G   R  S+   LCY       L+    V F F  GA+L++   ++F Q   P
Sbjct: 287 -AFDDLIGS--RPLSYEPTLCYKADTIERLEKM-DVGFKFDVGAELSVNIHNIFIQQGPP 342

Query: 358 DAFCMAVNPASINNRYVNFSLIGMMAQQFYNVGYDIGRKQKTFQRMDCEV 407
           +  C+A+     N    +  +IG++A Q YNVGYD+  K     + DC++
Sbjct: 343 EIRCLAIQN---NKESFSHVIIGVIAMQGYNVGYDLSAKTAYINKQDCDM 389


>gi|449499012|ref|XP_004160696.1| PREDICTED: probable aspartic protease At2g35615-like [Cucumis
           sativus]
          Length = 435

 Score =  157 bits (396), Expect = 1e-35,   Method: Compositional matrix adjust.
 Identities = 125/423 (29%), Positives = 196/423 (46%), Gaps = 36/423 (8%)

Query: 1   LIHRDSILSPYNNPNENPAHRVQRGINISIARLAYLQEKIRSHNTYQAQILPSNDISNSL 60
           LIHRDS  SP  NP EN  HRV   +  SI+    L       NT +A I  +       
Sbjct: 34  LIHRDSPKSPMYNPLENHYHRVADTLRRSISHNTGLVT-----NTVEAPIYNNR----GE 84

Query: 61  FYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLGTIFYPSRSSSYAYVPCDSEH 120
           + + +S+G PP P     DTGS ++W  C PC +C  Q   +F PS+S++Y  V C S  
Sbjct: 85  YLMKLSVGTPPFPIIAVADTGSDIIWTQCVPCTNCYQQDLPMFNPSKSTTYRKVSCSSPV 144

Query: 121 CRYFPYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDVVFGCGFST 180
           C +       ++K  C Y+  Y     +    + + LT  +T    +       GCG   
Sbjct: 145 CSFTGEDNSCSFKPDCTYSISYGDNSHSQGDFAVDTLTMGSTSGRVVAFPRTAIGCGHDN 204

Query: 181 NRNF--KFSGIFGLGIGRSSLVSQLNSS----FSYCIGSLHDPDYLHNKLILGDGAIIDE 234
             +F    SGI GLG+G +SL+ Q+ S+    FSYC+  + + D   NKL  G  A +  
Sbjct: 205 AGSFDANVSGIVGLGLGPASLIKQMGSAVGGKFSYCLTPIGNDDGGSNKLNFGSNANVSG 264

Query: 235 GDA--TPLQFID---GHYYITLEAISVDGRMLDINPNIFKRDDS--GG--GVMIDSGTDV 285
             A  TP+   D     Y + L+A+SV GR    N   +   +S  GG   ++IDSGT +
Sbjct: 265 SGAVSTPIYISDKFKSFYSLKLKAVSV-GR----NNTFYSTANSILGGKANIIIDSGTTL 319

Query: 286 TWLVKEAYEALRDEVMIRLEGEQMRSYSWPDKLCYHGIMSSDLKGFPTVRFHFRGGAKLA 345
           T L  + Y      +   +  ++    +   + C+    ++D    P +  HF  GA L 
Sbjct: 320 TLLPVDLYHNFAKAISNSINLQRTDDPNQFLEYCFE--TTTDDYKVPFIAMHFE-GANLR 376

Query: 346 LEKDSMFYQPRPDAFCMAVNPASINNRYVNFSLIGMMAQQFYNVGYDIGRKQKTFQRMDC 405
           L+++++  +   +  C+A   A  N    + S+ G +AQ  + VGYD+     +F+ M+C
Sbjct: 377 LQRENVLIRVSDNVICLAFAGAQDN----DISIYGNIAQINFLVGYDVTNMSLSFKPMNC 432

Query: 406 EVL 408
             +
Sbjct: 433 VAM 435


>gi|90399033|emb|CAJ86229.1| H0402C08.5 [Oryza sativa Indica Group]
 gi|125550227|gb|EAY96049.1| hypothetical protein OsI_17922 [Oryza sativa Indica Group]
          Length = 473

 Score =  157 bits (396), Expect = 1e-35,   Method: Compositional matrix adjust.
 Identities = 129/431 (29%), Positives = 195/431 (45%), Gaps = 50/431 (11%)

Query: 1   LIHRDSILSPYNNPNENPAHRVQRGINISIARLAYLQEKIRSHNT------YQAQILPSN 54
           L+HRD+I S    P+    H+V   +    AR+ +L++++ +  +        ++++P  
Sbjct: 67  LVHRDAI-SGATYPSRR--HQVVGLVARDNARVEHLEKRLVASTSPYLPEDLVSEVVPGV 123

Query: 55  DISNSLFYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLGTIFYPSRSSSYAYV 114
           D  +  ++V + +G PP  Q+  +D+GS ++WV C PC  C  Q   +F P+ SSS++ V
Sbjct: 124 DDGSGEYFVRVGVGSPPTDQYLVVDSGSDVIWVQCRPCEQCYAQTDPLFDPAASSSFSGV 183

Query: 115 PCDSEHCRYFP--YARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDV 172
            C S  CR              +C Y+  Y  G  T   ++ E LT   T      VQ V
Sbjct: 184 SCGSAICRTLSGTGCGGGGDAGKCDYSVTYGDGSYTKGELALETLTLGGT-----AVQGV 238

Query: 173 VFGCGFSTNRNF-KFSGIFGLGIGRSSLVSQLNSS----FSYCIGSLHDPDYLHNKLILG 227
             GCG   +  F   +G+ GLG G  SL+ QL  +    FSYC+ S          L+LG
Sbjct: 239 AIGCGHRNSGLFVGAAGLLGLGWGAMSLIGQLGGAAGGVFSYCLASRGAGG--AGSLVLG 296

Query: 228 DGAIIDEGDA-TPL---QFIDGHYYITLEAISVDGRMLDINPNIFK-RDDSGGGVMIDSG 282
               +  G    PL         YY+ L  I V G  L +   +F+  +D  GGV++D+G
Sbjct: 297 RTEAVPVGAVWVPLVRNNQASSFYYVGLTGIGVGGERLPLQDGLFQLTEDGAGGVVMDTG 356

Query: 283 TDVTWLVKEAYEALR---DEVMIRLEGEQMRSYSWPDKLCYHGIMSSDLKGF-----PTV 334
           T VT L +EAY ALR   D  M  L   +  + S  D  CY      DL G+     PTV
Sbjct: 357 TAVTRLPREAYAALRGAFDGAMGAL--PRSPAVSLLDT-CY------DLSGYASVRVPTV 407

Query: 335 RFHFRGGAKLALEKDSMFYQPRPDAFCMAVNPASINNRYVNFSLIGMMAQQFYNVGYDIG 394
            F+F  GA L L   ++  +     FC+A  P+S        S++G + Q+   +  D  
Sbjct: 408 SFYFDQGAVLTLPARNLLVEVGGAVFCLAFAPSS-----SGISILGNIQQEGIQITVDSA 462

Query: 395 RKQKTFQRMDC 405
                F    C
Sbjct: 463 NGYVGFGPNTC 473


>gi|449454652|ref|XP_004145068.1| PREDICTED: probable aspartic protease At2g35615-like [Cucumis
           sativus]
 gi|449470630|ref|XP_004153019.1| PREDICTED: probable aspartic protease At2g35615-like [Cucumis
           sativus]
          Length = 435

 Score =  157 bits (396), Expect = 1e-35,   Method: Compositional matrix adjust.
 Identities = 125/423 (29%), Positives = 196/423 (46%), Gaps = 36/423 (8%)

Query: 1   LIHRDSILSPYNNPNENPAHRVQRGINISIARLAYLQEKIRSHNTYQAQILPSNDISNSL 60
           LIHRDS  SP  NP EN  HRV   +  SI+    L       NT +A I  +       
Sbjct: 34  LIHRDSPKSPMYNPLENHYHRVADTLRRSISHNTGLVT-----NTVEAPIYNNR----GE 84

Query: 61  FYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLGTIFYPSRSSSYAYVPCDSEH 120
           + + +S+G PP P     DTGS ++W  C PC +C  Q   +F PS+S++Y  V C S  
Sbjct: 85  YLMKLSVGTPPFPIIAVADTGSDIIWTQCEPCTNCYQQDLPMFNPSKSTTYRKVSCSSPV 144

Query: 121 CRYFPYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDVVFGCGFST 180
           C +       ++K  C Y+  Y     +    + + LT  +T    +       GCG   
Sbjct: 145 CSFTGEDNSCSFKPDCTYSISYGDNSHSQGDFAVDTLTMGSTSGRVVAFPRTAIGCGHDN 204

Query: 181 NRNF--KFSGIFGLGIGRSSLVSQLNSS----FSYCIGSLHDPDYLHNKLILGDGAIIDE 234
             +F    SGI GLG+G +SL+ Q+ S+    FSYC+  + + D   NKL  G  A +  
Sbjct: 205 AGSFDANVSGIVGLGLGPASLIKQMGSAVGGKFSYCLTPIGNDDGGSNKLNFGSNANVSG 264

Query: 235 GDA--TPLQFID---GHYYITLEAISVDGRMLDINPNIFKRDDS--GG--GVMIDSGTDV 285
             A  TP+   D     Y + L+A+SV GR    N   +   +S  GG   ++IDSGT +
Sbjct: 265 SGAVSTPIYISDKFKSFYSLKLKAVSV-GR----NNTFYSTANSILGGKANIIIDSGTTL 319

Query: 286 TWLVKEAYEALRDEVMIRLEGEQMRSYSWPDKLCYHGIMSSDLKGFPTVRFHFRGGAKLA 345
           T L  + Y      +   +  ++    +   + C+    ++D    P +  HF  GA L 
Sbjct: 320 TLLPVDLYHNFAKAISNSINLQRTDDPNQFLEYCFE--TTTDDYKVPFIAMHFE-GANLR 376

Query: 346 LEKDSMFYQPRPDAFCMAVNPASINNRYVNFSLIGMMAQQFYNVGYDIGRKQKTFQRMDC 405
           L+++++  +   +  C+A   A  N    + S+ G +AQ  + VGYD+     +F+ M+C
Sbjct: 377 LQRENVLIRVSDNVICLAFAGAQDN----DISIYGNIAQINFLVGYDVTNMSLSFKPMNC 432

Query: 406 EVL 408
             +
Sbjct: 433 VAM 435


>gi|357481205|ref|XP_003610888.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
 gi|355512223|gb|AES93846.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
          Length = 413

 Score =  156 bits (395), Expect = 1e-35,   Method: Compositional matrix adjust.
 Identities = 120/362 (33%), Positives = 180/362 (49%), Gaps = 32/362 (8%)

Query: 61  FYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLGTIFYPSRSSSYAYVPCDSEH 120
           + + + IG PP+     +DTGS L+WV C PC  C  Q+  +F P +SS+Y  + CDS  
Sbjct: 64  YLMELYIGTPPIKISGTVDTGSDLIWVQCVPCLGCYNQINPMFDPLKSSTYTNISCDSPL 123

Query: 121 CRYFPY-ARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDVVFGCGFS 179
           C Y PY   CS  K RC YT  Y     T   ++ E +T  +     I +Q ++FGCG +
Sbjct: 124 C-YKPYIGECSPEK-RCDYTYGYADSSLTKGVLAQETVTLTSNTGKPISLQGILFGCGHN 181

Query: 180 TNRNFK--FSGIFGLGIGRSSLVSQL-----NSSFSYCIGSLHDPDYLHNKLILGDGA-I 231
              NF     G+ GLG G +SLVSQ+        FS C+        + +++  G G+ +
Sbjct: 182 NTGNFNDHEMGLIGLGGGPTSLVSQIGPLFGGKKFSQCLVPFLTDITISSQMSFGKGSEV 241

Query: 232 IDEG-DATPLQFID---GHYYITLEAISVDGRMLDINPNIFKRDDSGGGVMIDSGTDVTW 287
           + EG   TPL   +     YY+TL  ISV+   L +N  I K     G +++DSGT    
Sbjct: 242 LGEGVVTTPLVQREQDMTSYYVTLLGISVEDTYLPMNSTIEK-----GNMLVDSGTPPNI 296

Query: 288 LVKEAYEALRDEVMIRLEGEQMR-SYSWPDKLCYHGIMSSDLKGFPTVRFHFRGGAKLAL 346
           L ++ Y+ +  EV  ++  E +    S   +LCY     ++LKG PT+ +HF  GA L L
Sbjct: 297 LPQQLYDRVYVEVKNKVPLEPITDDPSLGPQLCYR--TQTNLKG-PTLTYHFE-GANLLL 352

Query: 347 EKDSMFYQPRPDA---FCMAVNPASINNRYVNFSLIGMMAQQFYNVGYDIGRKQKTFQRM 403
                F  P P+    FC+A+     N    +  + G  AQ  Y +G+D+ R+  +F+  
Sbjct: 353 TPIQTFIPPTPETKGVFCLAIT----NCANSDPGIYGNFAQTNYLIGFDLDRQIVSFKPT 408

Query: 404 DC 405
           DC
Sbjct: 409 DC 410


>gi|224126551|ref|XP_002329582.1| predicted protein [Populus trichocarpa]
 gi|222870291|gb|EEF07422.1| predicted protein [Populus trichocarpa]
          Length = 440

 Score =  156 bits (395), Expect = 1e-35,   Method: Compositional matrix adjust.
 Identities = 129/424 (30%), Positives = 199/424 (46%), Gaps = 34/424 (8%)

Query: 1   LIHRDSILSPYNNPNENPAHRVQRGINISIARLAYLQEKIRSHNTYQAQILPSNDISNSL 60
           L+HRDS  SP  N  +    R  + +  S++R+ + Q   R+  T   + + S  I+N  
Sbjct: 35  LVHRDSPKSPLYNSQQTHLQRWNKAMRRSVSRVHHFQ---RTAATVSPKEVESEIIANGG 91

Query: 61  FYV-NISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLGTIFYPSRSSSYAYVPCDSE 119
            Y+ ++S+G PP       DTGS L+W  C PC  C  Q+  +F P  S +Y  + CD+ 
Sbjct: 92  EYLMSLSLGTPPFEILAIADTGSDLIWTQCTPCDKCYKQIAPLFDPKSSKTYRDLSCDTR 151

Query: 120 HCRYFPYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDVVFGCGFS 179
            C+    +   + +  C Y+  Y     T+  ++ + +T  +T+   ++    V GCG  
Sbjct: 152 QCQNLGESSSCSSEQLCQYSYYYGDRSFTNGNLAVDTVTLPSTNGGPVYFPKTVIGCGRR 211

Query: 180 TNRNF--KFSGIFGLGIGRSSLVSQLNSS----FSYCIGSLHDPDYLH-NKLILGDGAII 232
            N  F  K SGI GLG G  SL+SQ+ SS    FSYC+         + +KL  G  A++
Sbjct: 212 NNGTFDKKDSGIIGLGGGPMSLISQMGSSVGGKFSYCLVPFSSESAGNSSKLHFGRNAVV 271

Query: 233 DEG--DATPL--QFIDGHYYITLEAISVDGRMLDINPNIFKRDDSGGGVMIDSGTDVTW- 287
                 +TPL  +  D  YY+TLEA+SV  +   I         S G ++IDSGT +T  
Sbjct: 272 SGSGVQSTPLISKNPDTFYYLTLEAMSVGDK--KIEFGGSSFGGSEGNIIIDSGTSLTLF 329

Query: 288 ---LVKEAYEALRDEVMIRLEGEQMRSYSWPDKLCYHGIMSSDLKGFPTVRFHFRGGAKL 344
                 E   A+ + V   + GE+ +  S     CY    + DLK  P +  HF  GA +
Sbjct: 330 PVNFFTEFATAVENAV---INGERTQDASGLLSHCYR--PTPDLK-VPVITAHFN-GADV 382

Query: 345 ALEKDSMFYQPRPDAFCMAVNPASINNRYVNFSLIGMMAQQFYNVGYDIGRKQKTFQRMD 404
            L+  + F     D  C+A N         + ++ G +AQ  + +GYDI  K  +F+  D
Sbjct: 383 VLQTLNTFILISDDVLCLAFNSTQ------SGAIFGNVAQMNFLIGYDIQGKSVSFKPTD 436

Query: 405 CEVL 408
           C  L
Sbjct: 437 CTQL 440


>gi|357122568|ref|XP_003562987.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Brachypodium
           distachyon]
          Length = 455

 Score =  155 bits (393), Expect = 3e-35,   Method: Compositional matrix adjust.
 Identities = 120/390 (30%), Positives = 184/390 (47%), Gaps = 48/390 (12%)

Query: 54  NDISNSLFYV-NISIGQPPVPQFTAMDTGSSLLWVHCYPCRD--------CSPQLGTIFY 104
            D+ N   Y+  +SIG PP+      DTGS L+W  C PC D        C  Q G ++ 
Sbjct: 79  KDLRNGGEYIMTLSIGTPPLSYRAIADTGSDLIWTQCAPCGDTVTDTDNQCFKQSGCLYN 138

Query: 105 PSRSSSYAYVPCDSEHCRYFPYARCSAYKH-------RCIYTQLYLIGPETSVFVSTEQL 157
           PS S+++  +PC+S      P + C+A           C+Y Q Y  G  T+   S E  
Sbjct: 139 PSSSTTFGVLPCNS------PLSMCAAMAGPSPPPGCACMYNQTYGTG-WTAGVQSVETF 191

Query: 158 TF-KNTDESTIHVQDVVFGCGFSTNRNFKFS-GIFGLGIGRSSLVSQLNS-SFSYCIGSL 214
           TF  ++    + V ++ FGC  +++ ++  S G+ GLG G  SLVSQL + +FSYC+   
Sbjct: 192 TFGSSSTPPAVRVPNIAFGCSNASSNDWNGSAGLVGLGRGSMSLVSQLGAGAFSYCLTPF 251

Query: 215 HDPDYLHNKLILGDGAIIDEGDATPLQ---FIDG--------HYYITLEAISVDGRMLDI 263
            D +   + L+LG  A        P++   F+ G        +YY+ L  ISV    L I
Sbjct: 252 QDANS-TSTLLLGPSAAAALKGTGPVRSTPFVAGPSKAPMSTYYYLNLTGISVGETALAI 310

Query: 264 NPNIFK-RDDSGGGVMIDSGTDVTWLVKEAYE----ALRDEVMIRLEGEQMRSYSWPDKL 318
            P+ F  R D  GG++IDSGT +T LV  AY+    A+R  ++ RL       +S    L
Sbjct: 311 PPDAFSLRADGTGGLIIDSGTTITTLVDSAYQQVRAAVRSLLVTRLPLAHGPDHSTGLDL 370

Query: 319 CYHGIMSSDLKGFPTVRFHFRGGAKLALEKDSMFYQPRPDAFCMAVNPASINNRYVNFSL 378
           C+    S+     P++  HF GGA + L  ++         +C+A+     N      S+
Sbjct: 371 CFALKASTPPPAMPSMTLHFEGGADMVLPVENYMILGS-GVWCLAMR----NQTVGAMSM 425

Query: 379 IGMMAQQFYNVGYDIGRKQKTFQRMDCEVL 408
           +G   QQ  +V YD+ ++  +F    C  L
Sbjct: 426 VGNYQQQNIHVLYDVRKETLSFAPAVCSSL 455


>gi|326494754|dbj|BAJ94496.1| predicted protein [Hordeum vulgare subsp. vulgare]
 gi|326514480|dbj|BAJ96227.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 449

 Score =  155 bits (392), Expect = 4e-35,   Method: Compositional matrix adjust.
 Identities = 121/368 (32%), Positives = 171/368 (46%), Gaps = 34/368 (9%)

Query: 58  NSLFYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLGTIFYPSRSSSYAYVPCD 117
           N  F +++SIG P V     +DTGS L+W  C PC +C  Q   +F PS SS+YA +PC 
Sbjct: 99  NGEFLMDMSIGTPAVAYAAIIDTGSDLVWTQCKPCVECFNQSTPVFDPSSSSTYAALPCS 158

Query: 118 SEHCRYFPYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDVVFGCG 177
           S  C   P ++C++ K  C YT  Y     T   ++ E  T   T      + DV FGCG
Sbjct: 159 STLCSDLPSSKCTSAK--CGYTYTYGDSSSTQGVLAAETFTLAKT-----KLPDVAFGCG 211

Query: 178 FSTNRNFKFS---GIFGLGIGRSSLVSQLN-SSFSYCIGSLHDPDYLHNKLILGDGAIID 233
             TN    F+   G+ GLG G  SLVSQL  + FSYC+ SL D     + L+LG  A I 
Sbjct: 212 -DTNEGDGFTQGAGLVGLGRGPLSLVSQLGLNKFSYCLTSLDDTS--KSPLLLGSLATIS 268

Query: 234 EG-------DATPL---QFIDGHYYITLEAISVDGRMLDINPNIFK-RDDSGGGVMIDSG 282
           E          TPL         YY+ L+ ++V    + +  + F  +DD  GGV++DSG
Sbjct: 269 ESAAAASSVQTTPLIRNPSQPSFYYVNLKGLTVGSTHITLPSSAFAVQDDGTGGVIVDSG 328

Query: 283 TDVTWLVKEAYEALRDEVMIRLEGEQMRSYSWPDKLCYHGIMSS-DLKGFPTVRFHFRGG 341
           T +T+L  + Y AL+     +++             C+    S  D    P + FH   G
Sbjct: 329 TSITYLELQGYRALKKAFAAQMKLPAADGSGIGLDTCFEAPASGVDQVEVPKLVFHLD-G 387

Query: 342 AKLALEKDS-MFYQPRPDAFCMAVNPASINNRYVNFSLIGMMAQQFYNVGYDIGRKQKTF 400
           A L L  ++ M       A C+ V    + +R    S+IG   QQ     YD+G    +F
Sbjct: 388 ADLDLPAENYMVLDSGSGALCLTV----MGSR--GLSIIGNFQQQNIQFVYDVGENTLSF 441

Query: 401 QRMDCEVL 408
             + C  L
Sbjct: 442 APVQCAKL 449


>gi|61214232|sp|Q766C2.1|NEP2_NEPGR RecName: Full=Aspartic proteinase nepenthesin-2; AltName:
           Full=Nepenthesin-II; Flags: Precursor
 gi|41016423|dbj|BAD07475.1| aspartic proteinase nepenthesin II [Nepenthes gracilis]
          Length = 438

 Score =  155 bits (391), Expect = 4e-35,   Method: Compositional matrix adjust.
 Identities = 112/355 (31%), Positives = 169/355 (47%), Gaps = 25/355 (7%)

Query: 61  FYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLGTIFYPSRSSSYAYVPCDSEH 120
           + +N++IG P       MDTGS L+W  C PC  C  Q   IF P  SSS++ +PC+S++
Sbjct: 96  YLMNVAIGTPDSSFSAIMDTGSDLIWTQCEPCTQCFSQPTPIFNPQDSSSFSTLPCESQY 155

Query: 121 CRYFPYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDVVFGCGFST 180
           C+  P   C+   + C YT  Y  G  T  +++TE  TF+     T  V ++ FGCG   
Sbjct: 156 CQDLPSETCN--NNECQYTYGYGDGSTTQGYMATETFTFE-----TSSVPNIAFGCG-ED 207

Query: 181 NRNF---KFSGIFGLGIGRSSLVSQLN-SSFSYCIGSLHDPDYLHNKLILGDGAI-IDEG 235
           N+ F     +G+ G+G G  SL SQL    FSYC+ S        + L LG  A  + EG
Sbjct: 208 NQGFGQGNGAGLIGMGWGPLSLPSQLGVGQFSYCMTSYGSSS--PSTLALGSAASGVPEG 265

Query: 236 DATPL----QFIDGHYYITLEAISVDGRMLDINPNIFK-RDDSGGGVMIDSGTDVTWLVK 290
             +           +YYITL+ I+V G  L I  + F+ +DD  GG++IDSGT +T+L +
Sbjct: 266 SPSTTLIHSSLNPTYYYITLQGITVGGDNLGIPSSTFQLQDDGTGGMIIDSGTTLTYLPQ 325

Query: 291 EAYEALRDEVMIRLEGEQMRSYSWPDKLCYHGIMSSDLKGFPTVRFHFRGGAKLALEKDS 350
           +AY A+      ++    +   S     C+           P +   F GG  L L + +
Sbjct: 326 DAYNAVAQAFTDQINLPTVDESSSGLSTCFQQPSDGSTVQVPEISMQFDGGV-LNLGEQN 384

Query: 351 MFYQPRPDAFCMAVNPASINNRYVNFSLIGMMAQQFYNVGYDIGRKQKTFQRMDC 405
           +   P     C+A+  +S     +  S+ G + QQ   V YD+     +F    C
Sbjct: 385 ILISPAEGVICLAMGSSS----QLGISIFGNIQQQETQVLYDLQNLAVSFVPTQC 435


>gi|255553149|ref|XP_002517617.1| Aspartic proteinase nepenthesin-2 precursor, putative [Ricinus
           communis]
 gi|223543249|gb|EEF44781.1| Aspartic proteinase nepenthesin-2 precursor, putative [Ricinus
           communis]
          Length = 449

 Score =  155 bits (391), Expect = 4e-35,   Method: Compositional matrix adjust.
 Identities = 137/426 (32%), Positives = 205/426 (48%), Gaps = 36/426 (8%)

Query: 1   LIHRDSILSPYNNPNENPAHRVQRGINISIARLAYLQ-EKIRSHNTYQAQILPSNDISNS 59
           LIHRDS +SP  NP +    R++   + SI+R    +   I +    Q+ I+P       
Sbjct: 36  LIHRDSSVSPLYNPRDTYFDRLRNSFHRSISRANRFKPNSISARALVQSDIVPGG----G 91

Query: 60  LFYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLGTIFYPSRSSSYAYVPCDSE 119
            + + ISIG P V      DTGS L+WV C PC  C  Q   IF P RSSSY  V C +E
Sbjct: 92  EYLMRISIGNPQVEILAIADTGSDLIWVQCQPCEMCYKQNSPIFDPRRSSSYRNVLCGNE 151

Query: 120 HCRYFP-YAR-CSA--YKHRCIYTQLYLIGPETSVFVSTEQL----TFKNTDESTIHVQD 171
            C      AR C A  +   C YT  Y     +   ++ E+     T  NT  +  + Q+
Sbjct: 152 FCNKLDGEARSCDARGFVKTCGYTYSYGDQSFSDGHLAIERFGIGSTNSNTSAAIAYFQE 211

Query: 172 VVFGCGFSTNRNF--KFSGIFGLGIGRSSLVSQ----LNSSFSYCIGSLHDPDYLHNKLI 225
           V FGCG      F    SGI GLG G  SLVSQ    L+  FSYC+    +     +K+ 
Sbjct: 212 VAFGCGTKNGGTFDELGSGIIGLGGGSMSLVSQLGPKLSGKFSYCLVPTSEQSNYTSKIN 271

Query: 226 LGDGAIIDEGD----ATPL--QFIDGHYYITLEAISVDGRMLDINPNIFKRDDSGGGVMI 279
            G+   I   +    +TPL  +  + +YY+TLEAISV+ + L    N++  +   G ++I
Sbjct: 272 FGNDINISGSNYNVVSTPLLPKKPETYYYLTLEAISVENKRLPYT-NLWNGEVEKGNIII 330

Query: 280 DSGTDVTWLVKEAYEALRDEVMIRLEGEQMRSYSWPDKLCYHGIMSSDLKGFPTVRFHFR 339
           DSGT +T+L  E +  L   V   ++GE++        +C+    + +L   P +  HF 
Sbjct: 331 DSGTTLTFLDSEFFNNLDSAVEEAVKGERVSDPHGLFNICFKDEKAIEL---PIITAHFT 387

Query: 340 GGAKLALEKDSMFYQPRPDAFCMAVNPASINNRYVNFSLIGMMAQQFYNVGYDIGRKQKT 399
            GA + L+  + F +   D  C  + P++      + ++ G +AQ  + VGYD+ +K  +
Sbjct: 388 -GADVELQPVNTFAKVEEDLLCFTMIPSN------DIAIFGNLAQMNFLVGYDLEKKAVS 440

Query: 400 FQRMDC 405
           F   DC
Sbjct: 441 FLPTDC 446


>gi|224286173|gb|ACN40797.1| unknown [Picea sitchensis]
          Length = 383

 Score =  155 bits (391), Expect = 5e-35,   Method: Compositional matrix adjust.
 Identities = 128/398 (32%), Positives = 196/398 (49%), Gaps = 26/398 (6%)

Query: 22  VQRGINISIARLAYLQ--EKIRSHNTYQAQILPSNDISNSLFYVNISIGQPPVPQFTAMD 79
           ++R I  S  RL  LQ    + +H     +   + DI +  + + ++IG P +     MD
Sbjct: 1   MKRAIQRSQERLEKLQITSAVNTHQMKDIETPVTPDIGSGEYLIQMAIGTPALSLSAIMD 60

Query: 80  TGSSLLWVHCYPCRDCSPQLGTIFYPSRSSSYAYVPCDSEHCRYFPYARCSAYKHRCIYT 139
           TGS L+W  C PC DCS    +I+ PS SS+Y+ V C S  C+      C+     C Y 
Sbjct: 61  TGSDLVWTKCNPCTDCSTS--SIYDPSSSSTYSKVLCQSSLCQPPSIFSCNN-DGDCEYV 117

Query: 140 QLYLIGPETSVFVSTEQLTFKNTDESTIHVQDVVFGCGFSTNRNFKFSGIFGLGIGRSSL 199
             Y     TS  +S E  TF  + +S   + ++ FGCG       K  G+ G G G  SL
Sbjct: 118 YPYGDRSSTSGILSDE--TFSISSQS---LPNITFGCGHDNQGFDKVGGLVGFGRGSLSL 172

Query: 200 VSQLNSS----FSYCIGSLHDPDYLHNKLILGDGAIIDEGD--ATPL--QFIDGHYYITL 251
           VSQL  S    FSYC+ S  D     + L +G+ A ++     +TPL       HYY++L
Sbjct: 173 VSQLGPSMGNKFSYCLVSRTDSSK-TSPLFIGNTASLEATTVGSTPLVQSSSTNHYYLSL 231

Query: 252 EAISVDGRMLDINPNIFK-RDDSGGGVMIDSGTDVTWLVKEAYEALRDEVMIRLEGEQMR 310
           E ISV G+ L I    F  + D  GG++IDSGT +T+L + AY+A+++ ++  +   Q  
Sbjct: 232 EGISVGGQSLAIPTGTFDIQSDGSGGLIIDSGTTLTFLQQTAYDAVKEAMVSSINLPQAD 291

Query: 311 SYSWPDKLCYHGIMSSDLKGFPTVRFHFRGGAKLALEKDSMFYQPRPDAFCMAVNPASIN 370
                  LC++   SS+  GFP++ FHF+G      +++ +F     D  C+A+ P   N
Sbjct: 292 GQL---DLCFNQQGSSN-PGFPSMTFHFKGADYDVPKENYLFPDSTSDIVCLAMMPT--N 345

Query: 371 NRYVNFSLIGMMAQQFYNVGYDIGRKQKTFQRMDCEVL 408
           +   N ++ G + QQ Y + YD      +F    C+ L
Sbjct: 346 SNLGNMAIFGNVQQQNYQILYDNENNVLSFAPTACDTL 383


>gi|449443786|ref|XP_004139658.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 1-like [Cucumis
           sativus]
 gi|449475416|ref|XP_004154449.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 1-like [Cucumis
           sativus]
          Length = 453

 Score =  155 bits (391), Expect = 5e-35,   Method: Compositional matrix adjust.
 Identities = 122/358 (34%), Positives = 170/358 (47%), Gaps = 28/358 (7%)

Query: 61  FYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLGTIFYPSRSSSYAYVPCDSEH 120
           ++  + +G PP   +  +DTGS ++W+ C PCR C  Q   IF P +S S+A +PC S  
Sbjct: 110 YFTRLGVGTPPRYLYMVLDTGSDVVWLQCSPCRKCYSQSDPIFNPYKSKSFAGIPCSSPL 169

Query: 121 CRYFPYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDVVFGCGFST 180
           CR    + CS  +H C+Y   Y  G  T+   +TE LTF+        +  V  GCG   
Sbjct: 170 CRRLDSSGCSTRRHTCLYQVSYGDGSFTTGDFATETLTFRGNK-----IAKVALGCGHHN 224

Query: 181 NRNF-KFSGIFGLGIGRSSLVSQ----LNSSFSYCIGSLHDPDYLHNKLILGDGAIIDEG 235
              F   +G+ GLG GR S  SQ     N  FSYC+          + ++ GD AI    
Sbjct: 225 EGLFVGAAGLLGLGRGRLSFPSQTGIRFNHKFSYCLVD-RSASSKPSSMVFGDAAISRLA 283

Query: 236 DATPL---QFIDGHYYITLEAISVDG-RMLDINPNIFKRDDSG-GGVMIDSGTDVTWLVK 290
             TPL     +D  YY+ L  ISV G R+  ++P++FK D +G GGV+IDSGT VT L +
Sbjct: 284 RFTPLIRNPKLDTFYYVGLIGISVGGVRVRGVSPSLFKLDSAGNGGVIIDSGTSVTRLTR 343

Query: 291 EAYEALRDEVMIRLEGEQMR---SYSWPDKLCYHGIMSSDLKGFPTVRFHFRGGAKLALE 347
            AY ALRD    R+    ++    +S  D  CY     S +K  PTV  HFRG       
Sbjct: 344 PAYTALRDA--FRVGARHLKRGPEFSLFDT-CYDLSGQSSVK-VPTVVLHFRGADMALPA 399

Query: 348 KDSMFYQPRPDAFCMAVNPASINNRYVNFSLIGMMAQQFYNVGYDIGRKQKTFQRMDC 405
            + +       +FC A             S+IG + QQ + V YD+   +  F    C
Sbjct: 400 TNYLIPVDENGSFCFA-----FAGTISGLSIIGNIQQQGFRVVYDLAGSRIGFAPRGC 452


>gi|38344196|emb|CAE05761.2| OSJNBa0064G10.12 [Oryza sativa Japonica Group]
          Length = 451

 Score =  155 bits (391), Expect = 5e-35,   Method: Compositional matrix adjust.
 Identities = 125/427 (29%), Positives = 189/427 (44%), Gaps = 64/427 (14%)

Query: 1   LIHRDSILSPYNNPNENPAHRVQRGINISIARLAYLQEKIRSHNT------YQAQILPSN 54
           L+HRD+I S    P+    H+V   +    AR+ +L++++ +  +        ++++P  
Sbjct: 67  LVHRDAI-SGATYPSRR--HQVVGLVARDNARVEHLEKRLVASTSPYLPEDLVSEVVPGV 123

Query: 55  DISNSLFYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLGTIFYPSRSSSYAYV 114
           D  +  ++V + +G PP  Q+  +D+GS ++WV C PC  C  Q   +F P+ SSS++ V
Sbjct: 124 DDGSGEYFVRVGVGSPPTDQYLVVDSGSDVIWVQCRPCEQCYAQTDPLFDPAASSSFSGV 183

Query: 115 PCDSEHCRYFP--YARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDV 172
            C S  CR              +C Y+  Y  G  T   ++ E LT   T      VQ V
Sbjct: 184 SCGSAICRTLSGTGCGGGGDAGKCDYSVTYGDGSYTKGELALETLTLGGT-----AVQGV 238

Query: 173 VFGCGFSTNRNF-KFSGIFGLGIGRSSLVSQLNSS----FSYCIGSLHDPDYLHNKLILG 227
             GCG   +  F   +G+ GLG G  SLV QL  +    FSYC+ S              
Sbjct: 239 AIGCGHRNSGLFVGAAGLLGLGWGAMSLVGQLGGAAGGVFSYCLAS-------------- 284

Query: 228 DGAIIDEGDATPLQFIDGHYYITLEAISVDGRMLDINPNIFK-RDDSGGGVMIDSGTDVT 286
                  G           YY+ L  I V G  L +  ++F+  +D  GGV++D+GT VT
Sbjct: 285 ------RGAGGAGSLASSFYYVGLTGIGVGGERLPLQDSLFQLTEDGAGGVVMDTGTAVT 338

Query: 287 WLVKEAYEALR---DEVMIRLEGEQMRSYSWPDKLCYHGIMSSDLKGF-----PTVRFHF 338
            L +EAY ALR   D  M  L   +  + S  D  CY      DL G+     PTV F+F
Sbjct: 339 RLPREAYAALRGAFDGAMGAL--PRSPAVSLLDT-CY------DLSGYASVRVPTVSFYF 389

Query: 339 RGGAKLALEKDSMFYQPRPDAFCMAVNPASINNRYVNFSLIGMMAQQFYNVGYDIGRKQK 398
             GA L L   ++  +     FC+A  P+S        S++G + Q+   +  D      
Sbjct: 390 DQGAVLTLPARNLLVEVGGAVFCLAFAPSS-----SGISILGNIQQEGIQITVDSANGYV 444

Query: 399 TFQRMDC 405
            F    C
Sbjct: 445 GFGPNTC 451


>gi|413947545|gb|AFW80194.1| hypothetical protein ZEAMMB73_386053 [Zea mays]
          Length = 456

 Score =  154 bits (389), Expect = 7e-35,   Method: Compositional matrix adjust.
 Identities = 128/431 (29%), Positives = 194/431 (45%), Gaps = 41/431 (9%)

Query: 1   LIHRDSILSPYNNPNENPAHRVQRGINISIARLAYLQEKIRSHNTYQAQI------LPSN 54
            IHRDS  SP+  P+  P H           R A L   +   +     +      + S 
Sbjct: 34  FIHRDSARSPFAQPSL-PPHARALAAARRSLRGAALGRYVGGASPAPGPVPEADGGVESK 92

Query: 55  DISNS---LFYVNISIGQPPVPQFTAMDTGSSLLWVHCYP--CRDCSPQLGTIFYPSRSS 109
            I+ S   L YVN+  G PP       DTGS L+WV+C        +     +F+PSRS+
Sbjct: 93  IITRSFEYLMYVNV--GTPPAQMLAIADTGSDLVWVNCSSNGGGGGASDGAVVFHPSRST 150

Query: 110 SYAYVPCDSEHCRYFPYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDEST--- 166
           +Y+ + C S  C+    A C A    C Y   Y  G  T   +STE  +F          
Sbjct: 151 TYSLLSCQSAACQALSQASCDA-DSECQYQYAYGDGSRTIGVLSTETFSFAAAGGGGEGQ 209

Query: 167 IHVQDVVFGCGFSTNRNFKFSGIFGLGIGRSSLVSQLNSS------FSYCIGSLHDPDYL 220
           + V  V FGC   +  +F+  G+ GLG G  SLVSQL ++      FSYC+   +     
Sbjct: 210 VRVPRVSFGCSTGSAGSFRSDGLVGLGAGALSLVSQLGAAARIARRFSYCLVPPYAAANS 269

Query: 221 HNKLILGDGAII-DEGDA-TPL--QFIDGHYYITLEAISVDGRMLDINPNIFKRDDSGGG 276
            + L  G  A++ D G A TPL    +D +Y + LE+++V G+  D+      R      
Sbjct: 270 SSTLSFGARAVVSDPGAASTPLVPSEVDSYYTVALESVAVAGQ--DVASANSSR------ 321

Query: 277 VMIDSGTDVTWLVKEAYEALRDEVMIRLEGEQMRSYSWPDKLCY--HGIMSSDLKGFPTV 334
           +++DSGT +T+L       L  E+  R+   + +      +LCY   G   ++  G P V
Sbjct: 322 IIVDSGTTLTFLDPALLRPLVAELERRIRLPRAQPPEQLLQLCYDVQGKSQAEDFGIPDV 381

Query: 335 RFHFRGGAKLALEKDSMFYQPRPDAFCMAVNPASINNRYVNFSLIGMMAQQFYNVGYDIG 394
              F GGA + L  ++ F        C+ + P S +      S++G +AQQ ++VGYD+ 
Sbjct: 382 TLRFGGGASVTLRPENTFSLLEEGTLCLVLVPVSESQ---PVSILGNIAQQNFHVGYDLD 438

Query: 395 RKQKTFQRMDC 405
            +  TF  +DC
Sbjct: 439 ARTVTFAAVDC 449


>gi|297814776|ref|XP_002875271.1| hypothetical protein ARALYDRAFT_484331 [Arabidopsis lyrata subsp.
           lyrata]
 gi|297321109|gb|EFH51530.1| hypothetical protein ARALYDRAFT_484331 [Arabidopsis lyrata subsp.
           lyrata]
          Length = 451

 Score =  154 bits (389), Expect = 8e-35,   Method: Compositional matrix adjust.
 Identities = 124/429 (28%), Positives = 198/429 (46%), Gaps = 47/429 (10%)

Query: 9   SPYNNPNENPAHRVQRGINISIARLAYLQEKIRSHNTYQAQILPSNDISNSLFYVNISIG 68
           SP+ +P +         + +   RL +L  + +     ++ ++      +  ++V++ IG
Sbjct: 39  SPFPSPTQ--------ALALDTRRLHFLSLRRKPVPFVKSPVVSGASSGSGQYFVDLRIG 90

Query: 69  QPPVPQFTAMDTGSSLLWVHCYPCRDCSPQ-LGTIFYPSRSSSYAYVPCDSEHCRYFP-- 125
           QPP       DTGS L+WV C  CR+CS     T+F+P  SS+++   C    CR  P  
Sbjct: 91  QPPQSLLLIADTGSDLVWVKCSACRNCSHHSPATVFFPRHSSTFSPAHCYDPVCRLVPKP 150

Query: 126 --YARCSAYK--HRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDVVFGCGFS-- 179
               RC+  +    C Y   Y  G  TS   + E  + K +      ++ V FGCGF   
Sbjct: 151 GRAPRCNHTRIHSTCPYEYGYADGSLTSGLFARETTSLKTSSGKEAKLKSVAFGCGFRIS 210

Query: 180 ----TNRNFKFS-GIFGLGIGRSSLVSQL----NSSFSYCIGSLHDPDYLHNKLILGDGA 230
               +  +F  + G+ GLG G  S  SQL     + FSYC+          + LI+GDG 
Sbjct: 211 GQSVSGTSFNGANGVMGLGRGPISFASQLGRRFGNKFSYCLMDYTLSPPPTSYLIIGDG- 269

Query: 231 IIDEGDA------TPL---QFIDGHYYITLEAISVDGRMLDINPNIFKRDDSG-GGVMID 280
               GDA      TPL         YY+ L+++ V+G  L I+P+I++ DDSG GG ++D
Sbjct: 270 ----GDAVSKLFFTPLLTNPLSPTFYYVKLKSVFVNGAKLRIDPSIWEIDDSGNGGTVMD 325

Query: 281 SGTDVTWLVKEAYEALRDEVMIRLEGEQMRSYSWPDKLCYH--GIMSSDLKGFPTVRFHF 338
           SGT + +L   AY  +   V  R++       +    LC +  G+   + K  P ++F F
Sbjct: 326 SGTTLAFLADPAYRLVIAAVKQRIKLPNADELTPGFDLCVNVSGVTKPE-KILPRLKFEF 384

Query: 339 RGGAKLALEKDSMFYQPRPDAFCMAVNPASINNRYVNFSLIGMMAQQFYNVGYDIGRKQK 398
            GGA       + F +      C+A+   S++ + V FS+IG + QQ +   +D  R + 
Sbjct: 385 SGGAVFVPPPRNYFIETEEQIQCLAIQ--SVDPK-VGFSVIGNLMQQGFLFEFDRDRSRL 441

Query: 399 TFQRMDCEV 407
            F R  C +
Sbjct: 442 GFSRRGCAL 450


>gi|225427556|ref|XP_002266575.1| PREDICTED: probable aspartic protease At2g35615 [Vitis vinifera]
          Length = 445

 Score =  154 bits (389), Expect = 8e-35,   Method: Compositional matrix adjust.
 Identities = 122/421 (28%), Positives = 192/421 (45%), Gaps = 32/421 (7%)

Query: 1   LIHRDSILSPYNNPNENPAHRVQRGINISIARLAYLQEKIRSHNTYQAQILPSNDISNSL 60
            I RDS  SP+ NP+E    R+Q+    SI R  + +    S N  Q+ ++         
Sbjct: 38  FISRDSPRSPFYNPSETKYQRLQKAFRRSILRGNHFRAIRASPNDIQSNVISGG----GS 93

Query: 61  FYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLGTIFYPSRSSSYAYVPCDSEH 120
           + +NIS+G PPV      DTGS L+W  C PC DC  Q+  +F P +S +Y  + C+++ 
Sbjct: 94  YLMNISLGTPPVSMLGIADTGSDLIWRQCLPCDDCYKQVEPLFDPKKSKTYKTLGCNNDF 153

Query: 121 CRYFPYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDVVFGCGFST 180
           C+           + C  +  Y     T   +S+E  T  +T+        + FGCG S 
Sbjct: 154 CQDLGQQGSCGDDNTCTSSYSYGDQSYTRRDLSSETFTIGSTEGDPASFPGLAFGCGHSN 213

Query: 181 NRNF--KFSGIFGLGIGRSSLVSQLNSS----FSYCIGSLHDPDYLHNKLILGDGAIIDE 234
              F  K SG+ GLG G  SLV QL+S     FSYC+  L       +K+  G  A++  
Sbjct: 214 GGTFNEKDSGLIGLGGGPLSLVMQLSSKVGGQFSYCLVPLSSDSTASSKINFGKSAVVSG 273

Query: 235 GDATPLQFIDG----HYYITLEAISVDGRMLDINPNIFKRDDSG------GGVMIDSGTD 284
                   I G     YY+TLE +S+    +      F ++ S         ++IDSGT 
Sbjct: 274 SGTVSTPLIKGTPDTFYYLTLEGMSLGSEKVAFKG--FSKNKSSPAAAEESNIIIDSGTT 331

Query: 285 VTWLVKEAYEALRDEVMIRLEGEQMRSYSWPDKLCYHGIMSSDLKGFPTVRFHFRGGAKL 344
           +T L ++ Y  +   +   + G+          LCY G+   ++   PT+  HF  GA +
Sbjct: 332 LTLLPRDFYTDMESALTKVIGGQTTTDPRGTFSLCYSGVKKLEI---PTITAHFI-GADV 387

Query: 345 ALEKDSMFYQPRPDAFCMAVNPASINNRYVNFSLIGMMAQQFYNVGYDIGRKQKTFQRMD 404
            L   + F Q + D  C ++ P+S      N ++ G ++Q  + VGYD+   + +F+  D
Sbjct: 388 QLPPLNTFVQAQEDLVCFSMIPSS------NLAIFGNLSQMNFLVGYDLKNNKVSFKPTD 441

Query: 405 C 405
           C
Sbjct: 442 C 442


>gi|449439383|ref|XP_004137465.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 1-like [Cucumis
           sativus]
          Length = 491

 Score =  154 bits (388), Expect = 1e-34,   Method: Compositional matrix adjust.
 Identities = 122/352 (34%), Positives = 173/352 (49%), Gaps = 22/352 (6%)

Query: 61  FYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLGTIFYPSRSSSYAYVPCDSEH 120
           ++  + +GQP  P +  +DTGS + W+ C PC DC  Q   IF P  SSS+A +PC+S+ 
Sbjct: 155 YFSRVGVGQPAKPFYMVLDTGSDINWLQCQPCTDCYQQTDPIFDPRSSSSFASLPCESQQ 214

Query: 121 CRYFPYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDVVFGCGFST 180
           C+    + C A K  C+Y   Y  G  T      E LTF N+      + +V  GCG   
Sbjct: 215 CQALETSGCRASK--CLYQVSYGDGSFTVGEFVIETLTFGNSG----MINNVAVGCGHDN 268

Query: 181 NRNF-KFSGIFGLGIGRSSLVSQLN-SSFSYCIGSLHDPDYLHNKLILGDGAIIDEGDAT 238
              F   +G+ GLG G  SL SQ+  SSFSYC+  +       + L     A  D  +A 
Sbjct: 269 EGLFVGSAGLLGLGGGSLSLTSQMKASSFSYCL--VDRDSSSSSDLEFNSAAPSDSVNAP 326

Query: 239 PLQF--IDGHYYITLEAISVDGRMLDINPNIFKRDDSG-GGVMIDSGTDVTWLVKEAYEA 295
            L+   +D  YY+ L  +SV G++L I PN+F+ DDSG GG+++DSGT +T L  +AY  
Sbjct: 327 LLKSGKVDTFYYVGLTGMSVGGQLLSIPPNLFQMDDSGYGGIIVDSGTAITRLQTQAYNT 386

Query: 296 LRDEVMIRLE-GEQMRSYSWPDKLCYHGIMSSDLKGFPTVRFHFRGGAKLAL-EKDSMFY 353
           LRD  + R    ++   ++  D  CY  + S      PTV F F GG  L L  K+ +  
Sbjct: 387 LRDAFVSRTPYLKKTNGFALFDT-CYD-LSSQSRVTIPTVSFEFAGGKSLQLPPKNYLIP 444

Query: 354 QPRPDAFCMAVNPASINNRYVNFSLIGMMAQQFYNVGYDIGRKQKTFQRMDC 405
                 FC A  P +      + S+IG + QQ   V YD+      F    C
Sbjct: 445 VDSVGTFCFAFAPTT-----SSLSIIGNVQQQGTRVHYDLANSVVGFSPHKC 491


>gi|242056193|ref|XP_002457242.1| hypothetical protein SORBIDRAFT_03g003930 [Sorghum bicolor]
 gi|241929217|gb|EES02362.1| hypothetical protein SORBIDRAFT_03g003930 [Sorghum bicolor]
          Length = 457

 Score =  154 bits (388), Expect = 1e-34,   Method: Compositional matrix adjust.
 Identities = 129/431 (29%), Positives = 196/431 (45%), Gaps = 43/431 (9%)

Query: 1   LIHRDSILSPYNNPNENPAHRVQRGINISIARLAYLQEKIRSHNTYQAQI------LPSN 54
            IHRDS  SPY +P  +P H           R   L       +   A +      + S 
Sbjct: 37  FIHRDSARSPYRHPALSP-HARALAAARRSLRGEVLGRSYSGASPAAAPVSAADGGVESK 95

Query: 55  DISNS---LFYVNISIGQPPVPQFTAMDTGSSLLWVHCYPC----RDCSPQLGTIFYPSR 107
            I+ S   L YVN+  G PP       DTGS L+WV+C        D       +F P+R
Sbjct: 96  IITRSFEYLMYVNV--GTPPTQLLAIADTGSDLVWVNCSSSGGGLADADAGGNVVFQPTR 153

Query: 108 SSSYAYVPCDSEHCRYFPYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTF-KNTDEST 166
           SS+Y+ + C S  C+    A C A    C Y   Y  G  T   +STE  +F     +  
Sbjct: 154 SSTYSQLSCQSNACQALSQASCDA-DSECQYQYSYGDGSRTIGVLSTETFSFVDGGGKGQ 212

Query: 167 IHVQDVVFGCGFSTNRNFKFSGIFGLGIGRSSLVSQLNSS------FSYCIGSLHDPDYL 220
           + V  V FGC  ++   F+  G+ GLG G  SLVSQL ++       SYC+   +D +  
Sbjct: 213 VRVPRVNFGCSTASAGTFRSDGLVGLGAGAFSLVSQLGATTHIDRKLSYCLIPSYDANS- 271

Query: 221 HNKLILGDGAIIDEGDA--TPL--QFIDGHYYITLEAISVDGRMLDINPNIFKRDDSGGG 276
            + L  G  A++ E  A  TPL    +D +Y + LE+++V G+       +   D     
Sbjct: 272 SSTLNFGSRAVVSEPGAASTPLVPSDVDSYYTVALESVAVGGQ------EVATHDSR--- 322

Query: 277 VMIDSGTDVTWLVKEAYEALRDEVMIRLEGEQMRSYSWPDKLCY--HGIMSSDLKGFPTV 334
           +++DSGT +T+L       L  E+  R++ ++++      +LCY   G   +D  G P V
Sbjct: 323 IIVDSGTTLTFLDPALLGPLVTELERRIKLQRVQPPEQLLQLCYDVQGKSETDNFGIPDV 382

Query: 335 RFHFRGGAKLALEKDSMFYQPRPDAFCMAVNPASINNRYVNFSLIGMMAQQFYNVGYDIG 394
              F GGA + L  ++ F   +    C+ + P S +      S++G +AQQ ++VGYD+ 
Sbjct: 383 TLRFGGGAAVTLRPENTFSLLQEGTLCLVLVPVSESQ---PVSILGNIAQQNFHVGYDLD 439

Query: 395 RKQKTFQRMDC 405
            +  TF   DC
Sbjct: 440 ARTVTFAAADC 450


>gi|116786826|gb|ABK24255.1| unknown [Picea sitchensis]
 gi|148906052|gb|ABR16185.1| unknown [Picea sitchensis]
          Length = 485

 Score =  154 bits (388), Expect = 1e-34,   Method: Compositional matrix adjust.
 Identities = 126/439 (28%), Positives = 191/439 (43%), Gaps = 50/439 (11%)

Query: 1   LIHRDSILSPYNNPNE-NPAHRVQRGINISIARLAYLQEKIR------------------ 41
           L+HRD++    N  NE + A R+Q+ +    AR+A +  ++                   
Sbjct: 63  LVHRDAMKGNSNKNNELSYAERMQQRLKRDAARVAAINSRLELAVNGIKRSSLKPDSSSS 122

Query: 42  ---SHNTYQAQILPSNDISNSLFYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQ 98
              + + +Q+ ++   D  +  ++  I +G P   Q   +DTGS + W+ C PC DC  Q
Sbjct: 123 FTMAESDFQSPVVSGMDQGSGEYFSRIGVGAPRRDQLMVLDTGSDVTWIQCEPCSDCYQQ 182

Query: 99  LGTIFYPSRSSSYAYVPCDSEHCRYFPYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLT 158
              I+ P+ SSSY  V C +  C+    + CS     C+Y   Y  G  T    +TE LT
Sbjct: 183 SDPIYNPALSSSYKLVGCQANLCQQLDVSGCS-RNGSCLYQVSYGDGSYTQGNFATETLT 241

Query: 159 FKNTDESTIHVQDVVFGCGFSTNRNFKFSGIFGLGIGR-----SSLVSQLNSSFSYCIGS 213
                     +Q+V  GCG      F  +       G      S L  +    FSYC+  
Sbjct: 242 LGGA-----PLQNVAIGCGHDNEGLFVGAAGLLGLGGGSLSFPSQLTDENGKIFSYCL-- 294

Query: 214 LHDPDYLHNKLILGDGAIIDEGDATPL---QFIDGHYYITLEAISVDGRMLDINPNIFKR 270
           +       + L  G  A+ +     P+     +D  YY++L  ISV G+ML I+ ++F  
Sbjct: 295 VDRDSESSSTLQFGRAAVPNGAVLAPMLKNSRLDTFYYVSLSGISVGGKMLSISDSVFGI 354

Query: 271 DDSG-GGVMIDSGTDVTWLVKEAYEALRDEVMIRLEGEQMRSYSWPD--KLCYHGIMSSD 327
           D SG GGV++DSGT VT L   AY++LRD    R   + + S         CY  + S +
Sbjct: 355 DASGNGGVIVDSGTAVTRLQTAAYDSLRD--AFRAGTKNLPSTDGVSLFDTCYD-LSSKE 411

Query: 328 LKGFPTVRFHFRGGAKLAL-EKDSMFYQPRPDAFCMAVNPASINNRYVNFSLIGMMAQQF 386
               PTV FHF GG  ++L  K+ +        FC A  P S      + S++G + QQ 
Sbjct: 412 SVDVPTVVFHFSGGGSMSLPAKNYLVPVDSMGTFCFAFAPTS-----SSLSIVGNIQQQG 466

Query: 387 YNVGYDIGRKQKTFQRMDC 405
             V +D    Q  F    C
Sbjct: 467 IRVSFDRANNQVGFAVNKC 485


>gi|356495754|ref|XP_003516738.1| PREDICTED: probable aspartic protease At2g35615-like [Glycine max]
          Length = 420

 Score =  154 bits (388), Expect = 1e-34,   Method: Compositional matrix adjust.
 Identities = 114/367 (31%), Positives = 172/367 (46%), Gaps = 44/367 (11%)

Query: 61  FYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLGTIFYPSRSSSYAYVPCDSEH 120
           + + +SIG PP   +   DTGS L W  C PC +C  Q   +F P +S++Y  + CDS+ 
Sbjct: 72  YLMELSIGTPPFKIYGIADTGSDLTWTSCVPCNNCYKQRNPMFDPQKSTTYRNISCDSKL 131

Query: 121 CRYFPYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDVVFGCGFST 180
           C       CS  K RC YT  Y     T   ++ E +T  +T   ++ ++ +VFGCG + 
Sbjct: 132 CHKLDTGVCSPQK-RCNYTYAYASAAITRGVLAQETITLSSTKGKSVPLKGIVFGCGHNN 190

Query: 181 NRNFK--FSGIFGLGIGRSSLVSQLNSS-----FSYCIGSLHDPDYLHNKLILGDGA--- 230
              F     GI GLG G  SL+SQ+ SS     FS C+   H    + +K+  G G+   
Sbjct: 191 TGGFNDHEMGIIGLGGGPVSLISQMGSSFGGKRFSQCLVPFHTDVSVSSKMSFGKGSKVS 250

Query: 231 --------IIDEGDATPLQFIDGHYYITLEAISVDGRMLDINPNIFKRDDSGGGVMIDSG 282
                   ++ + D TP       Y++TL  ISV+   L  N +   ++   G + +DSG
Sbjct: 251 GKGVVSTPLVAKQDKTP-------YFVTLLGISVENTYLHFNGS--SQNVEKGNMFLDSG 301

Query: 283 TDVTWLVKEAYEALRDEVMIRLEGEQMRSYSWPD---KLCYHGIMSSDLKGFPTVRFHFR 339
           T  T L  + Y+ +  +V  R E         PD   +LCY     ++L+G P +  HF 
Sbjct: 302 TPPTILPTQLYDQVVAQV--RSEVAMKPVTDDPDLGPQLCYR--TKNNLRG-PVLTAHFE 356

Query: 340 GGAKLALEKDSMFYQPRPDAFCMA-VNPASINNRYVNFSLIGMMAQQFYNVGYDIGRKQK 398
            GA + L     F  P+   FC+   N +S    Y NF      AQ  Y +G+D+ R+  
Sbjct: 357 -GADVKLSPTQTFISPKDGVFCLGFTNTSSDGGVYGNF------AQSNYLIGFDLDRQVV 409

Query: 399 TFQRMDC 405
           +F+  DC
Sbjct: 410 SFKPKDC 416


>gi|357449529|ref|XP_003595041.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
 gi|355484089|gb|AES65292.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
          Length = 210

 Score =  152 bits (385), Expect = 2e-34,   Method: Compositional matrix adjust.
 Identities = 86/214 (40%), Positives = 125/214 (58%), Gaps = 25/214 (11%)

Query: 198 SLVSQLNSSFSYCIGSLHDPDYLHNKLILGDGAIIDEGDATPLQFIDGHYYITLEAISVD 257
           SL +Q++  FSYC+GSL D DY +N+LILG+ A +  GD TP Q  +G  ++T+E IS+ 
Sbjct: 17  SLATQISKKFSYCMGSLTDKDYDYNQLILGEEAYL-AGDTTPFQVYNGVNHVTMEGISIG 75

Query: 258 GRMLDINPNIFKRDDSGGGVMIDSGTDVTWLVKEAYEALRDEV---MIRLEGEQMRSYSW 314
            + LDI P  FK  ++               V + YE L  EV     RL+ +++R    
Sbjct: 76  QKSLDIAPGTFKMKNN---------------VNDVYELLCKEVRNLFQRLKFQEVRLQGS 120

Query: 315 PDKLCYHGIMSSDLKGFPTVRFHFRGGAKLALEKDSMFYQPRPDAFCMAVNPASINNRYV 374
           P  LCY G +S DLKGFP V F+F GGA + L+  + F Q + D FCM+V+P+       
Sbjct: 121 PWALCYFGSVSRDLKGFPVVTFYFAGGAVIGLDTLNFFVQAKDDVFCMSVHPSH------ 174

Query: 375 NFSLIGMMAQQFYNVGYDIGRKQKTFQRMDCEVL 408
           + S+IG++AQQ YNVGYD  +     + +DC++L
Sbjct: 175 DLSVIGLLAQQSYNVGYDKDKGLIYIESIDCQLL 208


>gi|224067990|ref|XP_002302634.1| predicted protein [Populus trichocarpa]
 gi|222844360|gb|EEE81907.1| predicted protein [Populus trichocarpa]
          Length = 490

 Score =  152 bits (385), Expect = 2e-34,   Method: Compositional matrix adjust.
 Identities = 128/430 (29%), Positives = 189/430 (43%), Gaps = 48/430 (11%)

Query: 2   IHRDSILSPYNNPNENPAHRVQRGINISIARLAYLQEKIRSHNTYQAQILP-SNDISNSL 60
           +H    LS    P +    R+ R  +  +  L  L   + S N  +A+    S+ +++ L
Sbjct: 82  LHHLDALSSDETPQDLFNSRLARDAS-RVKSLTSLAAAVGSTNRTRARGPGFSSSVTSGL 140

Query: 61  ------FYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLGTIFYPSRSSSYAYV 114
                 ++  + +G P    F  +DTGS ++W+ C PC+ C  Q   +F P++S S+A +
Sbjct: 141 AQGSGEYFTRLGVGTPARYVFMVLDTGSDVVWIQCAPCKKCYSQTDPVFNPTKSRSFANI 200

Query: 115 PCDSEHCRYFPYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDVVF 174
           PC S  CR      CS  KH C+Y   Y  G  T    STE LTF+ T      V  V  
Sbjct: 201 PCGSPLCRRLDSPGCSTKKHICLYQVSYGDGSFTYGEFSTETLTFRGT-----RVGRVAL 255

Query: 175 GCGFSTNRNF----------KFSGIFGLGIGRSSLVSQLNSSFSYCI---GSLHDPDYLH 221
           GCG      F          +    F   IGR     + +  FSYC+    +   P Y  
Sbjct: 256 GCGHDNEGLFIGAAGLLGLGRGRLSFPSQIGR-----RFSRKFSYCLVDRSASSKPSY-- 308

Query: 222 NKLILGDGAIIDEGDATPL---QFIDGHYYITLEAISVDG-RMLDINPNIFKRDDSG-GG 276
             ++ GD AI      TPL     +D  YY+ L  +SV G R+  I  ++FK D +G GG
Sbjct: 309 --MVFGDSAISRTARFTPLVSNPKLDTFYYVELLGVSVGGTRVPGITASLFKLDSTGNGG 366

Query: 277 VMIDSGTDVTWLVKEAYEALRDEVMIRLEG-EQMRSYSWPDKLCYHGIMSSDLKGFPTVR 335
           V+IDSGT VT L + AY ALRD   +     ++   +S  D  C+     +++K  PTV 
Sbjct: 367 VIIDSGTSVTRLTRPAYVALRDAFRVGASNLKRAPEFSLFDT-CFDLSGKTEVK-VPTVV 424

Query: 336 FHFRGGAKLALEKDSMFYQPRPDAFCMAVNPASINNRYVNFSLIGMMAQQFYNVGYDIGR 395
            HFRG        + +       +FC A             S++G + QQ + V YD+  
Sbjct: 425 LHFRGADVSLPASNYLIPVDNSGSFCFA-----FAGTMSGLSIVGNIQQQGFRVVYDLAA 479

Query: 396 KQKTFQRMDC 405
            +  F    C
Sbjct: 480 SRVGFAPRGC 489


>gi|414584783|tpg|DAA35354.1| TPA: hypothetical protein ZEAMMB73_186928 [Zea mays]
          Length = 464

 Score =  152 bits (383), Expect = 4e-34,   Method: Compositional matrix adjust.
 Identities = 127/429 (29%), Positives = 190/429 (44%), Gaps = 51/429 (11%)

Query: 1   LIHRDSILSPYNNPNENPAHRVQRGINISIARLAYLQEKIRSHNTYQ--------AQILP 52
           L+ RD++      P+    H V   +    AR  YL  ++ S   YQ        ++++ 
Sbjct: 63  LVRRDAVTGS-TYPSRR--HAVLDLVARDNARAEYLASRL-SPAAYQPTGFSGSESKVVS 118

Query: 53  SNDISNSLFYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLGTIFYPSRSSSYA 112
             D  +  ++V + IG PP  Q+  +D+GS ++WV C PC +C  Q   +F P+ S++++
Sbjct: 119 GLDEGSGEYFVRVGIGSPPTEQYLVVDSGSDVIWVQCKPCLECYAQADPLFDPATSATFS 178

Query: 113 YVPCDSEHCRYFPYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDV 172
            VPC S  CR    + C      C Y   Y  G  T   ++ E LT   T      V+ V
Sbjct: 179 AVPCGSAVCRTLRTSGCGD-SGGCDYEVSYGDGSYTKGALALETLTLGGT-----AVEGV 232

Query: 173 VFGCGFSTNRNF-KFSGIFGLGIGRSSLVSQL----NSSFSYCIGSLHDPDYLHNKLILG 227
             GCG      F   +G+ GLG G  SLV QL      +FSYC+ S          L+LG
Sbjct: 233 AIGCGHRNRGLFVGAAGLLGLGWGPMSLVGQLGGAAGGAFSYCLASRG-----AGSLVLG 287

Query: 228 DGAIIDEGDA-TPL---QFIDGHYYITLEAISVDGRMLDINPNIFK-RDDSGGGVMIDSG 282
               + EG    PL         YY+ L  I V    L +  ++F+  +D  GGV++D+G
Sbjct: 288 RSEAVPEGAVWVPLVRNPQAPSFYYVGLSGIGVGDERLPLQEDLFQLTEDGAGGVVMDTG 347

Query: 283 TDVTWLVKEAYEALRDEVMIRLEG-EQMRSYSWPDKLCYHGIMSSDLKGF-----PTVRF 336
           T VT L +EAY ALRD  +  +    +    S  D  CY      DL G+     PTV F
Sbjct: 348 TAVTRLPQEAYAALRDAFVAAVGALPRAPGVSLLDT-CY------DLSGYTSVRVPTVSF 400

Query: 337 HFRGGAKLALEKDSMFYQPRPDAFCMAVNPASINNRYVNFSLIGMMAQQFYNVGYDIGRK 396
           +F G A L L   ++  +     +C+A  P+S        S++G + Q+   +  D    
Sbjct: 401 YFDGAATLTLPARNLLLEVDGGIYCLAFAPSSSGP-----SILGNIQQEGIQITVDSANG 455

Query: 397 QKTFQRMDC 405
              F    C
Sbjct: 456 YIGFGPTTC 464


>gi|414584780|tpg|DAA35351.1| TPA: hypothetical protein ZEAMMB73_696016 [Zea mays]
          Length = 524

 Score =  152 bits (383), Expect = 4e-34,   Method: Compositional matrix adjust.
 Identities = 120/405 (29%), Positives = 178/405 (43%), Gaps = 46/405 (11%)

Query: 31  ARLAYLQEKIR------SHNTYQAQILPSNDISNSLFYVNISIGQPPVPQFTAMDTGSSL 84
           AR  YL  ++         +  +++++   D  +  + V +S+G PP  Q+  +D+GS +
Sbjct: 135 ARAEYLATRLSPAYQPPGFSGSESKVVSGLDEGSGEYLVRVSVGSPPTEQYLVVDSGSDV 194

Query: 85  LWVHCYPCRDCSPQLGTIFYPSRSSSYAYVPCDSEHCRYFPYARCS-AYKHRCIYTQLYL 143
           +WV C PC +C  Q   +F P+ S++++ V C S  CR  P + C       C Y   Y 
Sbjct: 195 MWVQCKPCLECYVQADPLFDPATSATFSGVSCGSAICRILPTSACGDGELGGCEYEVSYA 254

Query: 144 IGPETSVFVSTEQLTFKNTDESTIHVQDVVFGCGFSTNRNF-KFSGIFGLGIGRSSLVSQ 202
            G  T   ++ E LT   T      V+ VV GCG      F   +G+ GLG G  SLV Q
Sbjct: 255 DGSYTKGALALETLTLGGT-----AVEGVVIGCGHRNRGLFVGAAGLMGLGWGPMSLVGQ 309

Query: 203 L----NSSFSYCIGSLHD-----PDYLHNKLILGDGAIIDEGDA-TPL---QFIDGHYYI 249
           L      +FSYC+ S         D     L+LG    + EG    PL         YY+
Sbjct: 310 LGGEVGGAFSYCLASRGGYGSGAADDDAGWLVLGRSEAVPEGAVWVPLVRNPRAPSFYYV 369

Query: 250 TLEAISVDGRMLDINPNIFK-RDDSGGGVMIDSGTDVTWLVKEAYEALRDEVMIRLEGEQ 308
            L  I V    L +   +F+  +D  G V++D+GT VT L +EAY ALRD  +  L G  
Sbjct: 370 GLSGIEVGDERLPLQAGLFQLTEDGAGDVVMDTGTTVTRLPQEAYAALRDAFVGALAGAV 429

Query: 309 MRSYSWPDKL---CYHGIMSSDLKGF-----PTVRFHFRGGAKLALEKDSMFYQPRPDAF 360
            R+      +   CY      DL G+     PTV F F G A+L L   ++  +     +
Sbjct: 430 PRAQGVSSSVLDTCY------DLSGYASVRVPTVSFCFDGDARLILAARNVLLEVDMGIY 483

Query: 361 CMAVNPASINNRYVNFSLIGMMAQQFYNVGYDIGRKQKTFQRMDC 405
           C+A  P+S        S++G   Q    +  D       F   +C
Sbjct: 484 CLAFAPSS-----SGLSIMGNTQQAGIQITVDSANGYIGFGPANC 523


>gi|357500973|ref|XP_003620775.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
 gi|357500991|ref|XP_003620784.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
 gi|355495790|gb|AES76993.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
 gi|355495799|gb|AES77002.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
          Length = 438

 Score =  152 bits (383), Expect = 4e-34,   Method: Compositional matrix adjust.
 Identities = 125/419 (29%), Positives = 205/419 (48%), Gaps = 30/419 (7%)

Query: 1   LIHRDSILSPYNNPNENPAHRVQRGINISIARLAYLQEKIRSHNTYQAQILPSNDISNSL 60
           LIHRDS  SP+  P +N    V   ++ SI R+ +  +   +       I    D     
Sbjct: 32  LIHRDSSKSPFYKPTQNKYQHVVDAVHRSINRVNHSNKNSLASTPESTVISYEGD----- 86

Query: 61  FYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLGTIFYPSRSSSYAYVPCDSEH 120
           + ++ S+G PP+  +  +DTGS ++W+ C PC  C  Q    F PS+SSSY  + C S+ 
Sbjct: 87  YIMSYSVGTPPIKSYGIVDTGSDIVWLQCEPCEQCYNQTTPKFNPSKSSSYKNISCSSKL 146

Query: 121 CRYFPYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDVVFGCGFST 180
           C+      C+  K  C Y+  Y     +   +S E LT ++T    +     V GCG + 
Sbjct: 147 CQSVRDTSCND-KKNCEYSINYGNQSHSQGDLSLETLTLESTTGRPVSFPKTVIGCGTNN 205

Query: 181 NRNFK--FSGIFGLGIGRSSLVSQLNSS----FSYCIG----SLHDPDYLHNKLILGDGA 230
             +FK   SG+ GLG G +SL++QL  S    FSYC+     +L +     +KL  GD A
Sbjct: 206 IGSFKRVSSGVVGLGGGPASLITQLGPSIGGKFSYCLVRMSITLKNMSMGSSKLNFGDVA 265

Query: 231 IIDEGD--ATPLQFIDGH--YYITLEAISVDGRMLDINPNIFKRDDSGGGVMIDSGTDVT 286
           I+   +  +TP+   D    YY+T+EA SV  + ++   +   +    G ++IDS T VT
Sbjct: 266 IVSGHNVLSTPIVKKDHSFFYYLTIEAFSVGDKRVEFAGS--SKGVEEGNIIIDSSTIVT 323

Query: 287 WLVKEAYEALRDEVMIRLEGEQMRSYSWPDKLCYHGIMSSDLKGFPTVRFHFRGGAKLAL 346
           ++  + Y  L   ++  +  E++   +    LCY+ + S +   FP +  HF+ GA + L
Sbjct: 324 FVPSDVYTKLNSAIVDLVTLERVDDPNQQFSLCYN-VSSDEEYDFPYMTAHFK-GADILL 381

Query: 347 EKDSMFYQPRPDAFCMAVNPASINNRYVNFSLIGMMAQQFYNVGYDIGRKQKTFQRMDC 405
              + F +   D  C A  P++        ++ G  +QQ + VGYD+ +K  +F+ +DC
Sbjct: 382 YATNTFVEVARDVLCFAFAPSN------GGAIFGSFSQQDFMVGYDLQQKTVSFKSVDC 434


>gi|409179880|gb|AFV26025.1| aspartic proteinase nepenthesin 2 [Nepenthes mirabilis]
          Length = 437

 Score =  151 bits (382), Expect = 6e-34,   Method: Compositional matrix adjust.
 Identities = 111/355 (31%), Positives = 167/355 (47%), Gaps = 26/355 (7%)

Query: 61  FYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLGTIFYPSRSSSYAYVPCDSEH 120
           + +N++IG P       MDTGS L+W  C PC  C  Q   IF P  SSS++ +PC+S++
Sbjct: 96  YLMNVAIGTPASSLSAIMDTGSDLIWTQCEPCTQCFSQPTPIFNPQDSSSFSTLPCESQY 155

Query: 121 CRYFPYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDVVFGCGFST 180
           C+  P   C    + C YT  Y  G  T  +++TE  TF+     T  V ++ FGCG   
Sbjct: 156 CQDLPSESC---YNDCQYTYGYGDGSSTQGYMATETFTFE-----TSSVPNIAFGCG-ED 206

Query: 181 NRNF---KFSGIFGLGIGRSSLVSQLN-SSFSYCIGSLHDPDYLHNKLILGDGAI-IDEG 235
           N+ F     +G+ G+G G  SL SQL    FSYC+          + L LG  A  + EG
Sbjct: 207 NQGFGQGNGAGLIGMGWGPLSLPSQLGVGQFSYCM--TSSGSSSPSTLALGSAASGVPEG 264

Query: 236 DATPL----QFIDGHYYITLEAISVDGRMLDINPNIFK-RDDSGGGVMIDSGTDVTWLVK 290
             +           +YYITL+ I+V G  L I  + F+ +DD  GG++IDSGT +T+L +
Sbjct: 265 SPSTTLIHSSLNPTYYYITLQGITVGGDNLGIPSSTFQLQDDGTGGMIIDSGTTLTYLPQ 324

Query: 291 EAYEALRDEVMIRLEGEQMRSYSWPDKLCYHGIMSSDLKGFPTVRFHFRGGAKLALEKDS 350
           +AY A+      ++    +   S     C+           P +   F GG  L L +++
Sbjct: 325 DAYNAVAQAFTDQINLSPVDESSSGLSTCFQLPSDGSTVQVPEISMQFDGGV-LNLGEEN 383

Query: 351 MFYQPRPDAFCMAVNPASINNRYVNFSLIGMMAQQFYNVGYDIGRKQKTFQRMDC 405
           +   P     C+A+  +S        S+ G + QQ   V YD+     +F    C
Sbjct: 384 VLISPAEGVICLAMGSSSQQ----GISIFGNIQQQETQVLYDLQNLAVSFVPTQC 434


>gi|242077672|ref|XP_002448772.1| hypothetical protein SORBIDRAFT_06g032900 [Sorghum bicolor]
 gi|241939955|gb|EES13100.1| hypothetical protein SORBIDRAFT_06g032900 [Sorghum bicolor]
          Length = 471

 Score =  151 bits (381), Expect = 6e-34,   Method: Compositional matrix adjust.
 Identities = 128/434 (29%), Positives = 191/434 (44%), Gaps = 53/434 (12%)

Query: 1   LIHRDSILSPYNNPNENPAHRVQRGINISIARLAYLQEKIRSHNTYQ--------AQILP 52
           L+ RD++         +P H V   ++   AR  YL  ++     YQ        ++++ 
Sbjct: 62  LVRRDAVT---GATYPSPRHAVLDLVSRDNARAEYLASRLSP--AYQPTDFFGSESKVVS 116

Query: 53  SNDISNSLFYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLGTIFYPSRSSSYA 112
             D  +  ++V + IG PP  Q+  +D+GS ++WV C PC +C  Q   +F P+ S++++
Sbjct: 117 GLDEGSGEYFVRVGIGSPPTEQYLVVDSGSDVIWVQCKPCLECYAQADPLFDPASSATFS 176

Query: 113 YVPCDSEHCRYFPYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDV 172
            V C S  CR    + C      C Y   Y  G  T   ++ E LT   T      V+ V
Sbjct: 177 AVSCGSAICRTLRTSGCGD-SGGCEYEVSYGDGSYTKGTLALETLTLGGT-----AVEGV 230

Query: 173 VFGCGFSTNRNF-KFSGIFGLGIGRSSLVSQL----NSSFSYCI----GSLHDPDYLHNK 223
             GCG      F   +G+ GLG G  SLV QL      +FSYC+    GS          
Sbjct: 231 AIGCGHRNRGLFVGAAGLLGLGWGPMSLVGQLGGAAGGAFSYCLASRGGSGSGAADAAGS 290

Query: 224 LILGDGAIIDEGDA-TPL---QFIDGHYYITLEAISVDGRMLDINPNIFK-RDDSGGGVM 278
           L+LG    + EG    PL         YY+ +  I V    L +   +F+  +D GGGV+
Sbjct: 291 LVLGRSEAVPEGAVWVPLVRNPQAPSFYYVGVSGIGVGDERLPLQDGLFQLTEDGGGGVV 350

Query: 279 IDSGTDVTWLVKEAYEALRDEVMIRLEGEQMRS--YSWPDKLCYHGIMSSDLKGF----- 331
           +D+GT VT L +EAY ALRD   +   G   R+   S  D  CY      DL G+     
Sbjct: 351 MDTGTAVTRLPQEAYAALRD-AFVGAVGALPRAPGVSLLDT-CY------DLSGYTSVRV 402

Query: 332 PTVRFHFRGGAKLALEKDSMFYQPRPDAFCMAVNPASINNRYVNFSLIGMMAQQFYNVGY 391
           PTV F+F G A L L   ++  +     +C+A  P+S        S++G + Q+   +  
Sbjct: 403 PTVSFYFDGAATLTLPARNLLLEVDGGIYCLAFAPSS-----SGLSILGNIQQEGIQITV 457

Query: 392 DIGRKQKTFQRMDC 405
           D       F    C
Sbjct: 458 DSANGYIGFGPATC 471


>gi|15230868|ref|NP_189198.1| aspartyl protease family protein [Arabidopsis thaliana]
 gi|11994761|dbj|BAB03090.1| chloroplast nucleoid DNA binding protein-like; nucellin-like
           protein [Arabidopsis thaliana]
 gi|189339286|gb|ACD89063.1| At3g25700 [Arabidopsis thaliana]
 gi|332643533|gb|AEE77054.1| aspartyl protease family protein [Arabidopsis thaliana]
          Length = 452

 Score =  150 bits (380), Expect = 8e-34,   Method: Compositional matrix adjust.
 Identities = 121/424 (28%), Positives = 198/424 (46%), Gaps = 37/424 (8%)

Query: 9   SPYNNPNENPAHRVQRGINISIARLAYLQEKIRSHNTYQAQILPSNDISNSLFYVNISIG 68
           SP+ +P +         + +   RL +L  + +     ++ ++      +  ++V++ IG
Sbjct: 40  SPFPSPTQ--------ALALDTRRLHFLSLRRKPIPFVKSPVVSGAASGSGQYFVDLRIG 91

Query: 69  QPPVPQFTAMDTGSSLLWVHCYPCRDCSPQ-LGTIFYPSRSSSYAYVPCDSEHCRYFPYA 127
           QPP       DTGS L+WV C  CR+CS     T+F+P  SS+++   C    CR  P  
Sbjct: 92  QPPQSLLLIADTGSDLVWVKCSACRNCSHHSPATVFFPRHSSTFSPAHCYDPVCRLVPKP 151

Query: 128 -RCSAYKH-----RCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDVVFGCGFS-- 179
            R     H      C Y   Y  G  TS   + E  + K +      ++ V FGCGF   
Sbjct: 152 DRAPICNHTRIHSTCHYEYGYADGSLTSGLFARETTSLKTSSGKEARLKSVAFGCGFRIS 211

Query: 180 ----TNRNFKFS-GIFGLGIGRSSLVSQL----NSSFSYCIGSLHDPDYLHNKLILGDGA 230
               +  +F  + G+ GLG G  S  SQL     + FSYC+          + LI+G+G 
Sbjct: 212 GQSVSGTSFNGANGVMGLGRGPISFASQLGRRFGNKFSYCLMDYTLSPPPTSYLIIGNGG 271

Query: 231 -IIDEGDATPL---QFIDGHYYITLEAISVDGRMLDINPNIFKRDDSG-GGVMIDSGTDV 285
             I +   TPL         YY+ L+++ V+G  L I+P+I++ DDSG GG ++DSGT +
Sbjct: 272 DGISKLFFTPLLTNPLSPTFYYVKLKSVFVNGAKLRIDPSIWEIDDSGNGGTVVDSGTTL 331

Query: 286 TWLVKEAYEALRDEVMIRLEGEQMRSYSWPDKLCYH--GIMSSDLKGFPTVRFHFRGGAK 343
            +L + AY ++   V  R++     + +    LC +  G+   + K  P ++F F GGA 
Sbjct: 332 AFLAEPAYRSVIAAVRRRVKLPIADALTPGFDLCVNVSGVTKPE-KILPRLKFEFSGGAV 390

Query: 344 LALEKDSMFYQPRPDAFCMAVNPASINNRYVNFSLIGMMAQQFYNVGYDIGRKQKTFQRM 403
                 + F +      C+A+   S++ + V FS+IG + QQ +   +D  R +  F R 
Sbjct: 391 FVPPPRNYFIETEEQIQCLAIQ--SVDPK-VGFSVIGNLMQQGFLFEFDRDRSRLGFSRR 447

Query: 404 DCEV 407
            C +
Sbjct: 448 GCAL 451


>gi|255576511|ref|XP_002529147.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
           communis]
 gi|223531426|gb|EEF33260.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
           communis]
          Length = 479

 Score =  150 bits (380), Expect = 9e-34,   Method: Compositional matrix adjust.
 Identities = 118/366 (32%), Positives = 174/366 (47%), Gaps = 23/366 (6%)

Query: 47  QAQILPSNDISNSLFYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLGTIFYPS 106
           Q  I+      +  ++  + IG+P  P +  +DTGS + W+ C PC DC  Q   IF P+
Sbjct: 130 QGPIISGTSQGSGEYFSRVGIGKPSSPVYMVLDTGSDVNWIQCAPCADCYHQADPIFEPA 189

Query: 107 RSSSYAYVPCDSEHCRYFPYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDEST 166
            S+SY+ + CD++ C+    + C    + C+Y   Y  G  T     TE +T       +
Sbjct: 190 SSTSYSPLSCDTKQCQSLDVSEC--RNNTCLYEVSYGDGSYTVGDFVTETITL-----GS 242

Query: 167 IHVQDVVFGCGFSTNRNF-KFSGIFGLGIGRSSLVSQLN-SSFSYCIGSLHDPDYLHNKL 224
             V +V  GCG +    F   +G+ GLG G+ S  SQ+N SSFSYC   L D D      
Sbjct: 243 ASVDNVAIGCGHNNEGLFIGAAGLLGLGGGKLSFPSQINASSFSYC---LVDRDSDSAST 299

Query: 225 ILGDGAIIDEGDATPL---QFIDGHYYITLEAISVDGRMLDINPNIFKRDDSG-GGVMID 280
           +  + A++      PL   + +D  YY+ +  +SV G +L I  ++F+ D+SG GG++ID
Sbjct: 300 LEFNSALLPHAITAPLLRNRELDTFYYVGMTGLSVGGELLSIPESMFEMDESGNGGIIID 359

Query: 281 SGTDVTWLVKEAYEALRDEVMIRLEGEQMRSYSWPDKLCYHGIMSSDLKGFPTVRFHFRG 340
           SGT VT L   AY ALRD  +   +   + S       CY     + ++  PTV FH  G
Sbjct: 360 SGTAVTRLQTAAYNALRDAFVKGTKDLPVTSEVALFDTCYDLSRKTSVE-VPTVTFHLAG 418

Query: 341 GAKLALEKDSMFYQPRPDA-FCMAVNPASINNRYVNFSLIGMMAQQFYNVGYDIGRKQKT 399
           G  L L   +       D  FC A  P S        S+IG + QQ   VG+D+      
Sbjct: 419 GKVLPLPATNYLIPVDSDGTFCFAFAPTS-----SALSIIGNVQQQGTRVGFDLANSLVG 473

Query: 400 FQRMDC 405
           F+   C
Sbjct: 474 FEPRQC 479


>gi|223946005|gb|ACN27086.1| unknown [Zea mays]
          Length = 336

 Score =  149 bits (375), Expect = 3e-33,   Method: Compositional matrix adjust.
 Identities = 100/345 (28%), Positives = 162/345 (46%), Gaps = 23/345 (6%)

Query: 78  MDTGSSLLWVHCYPCRDCSPQLGTIFYPSRSSSYAYVPCDSEHCRYFPYARCSAYKHRCI 137
           MDTGS L+W  C PC  C+ Q    F   +S++Y  +PC S  C       C  +K  C+
Sbjct: 1   MDTGSDLIWTQCAPCLLCADQPTPYFDVKKSATYRALPCRSSRCASLSSPSC--FKKMCV 58

Query: 138 YTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDVVFGCG-FSTNRNFKFSGIFGLGIGR 196
           Y   Y     T+  ++ E  TF   + + +   ++ FGCG  +       SG+ G G G 
Sbjct: 59  YQYYYGDTASTAGVLANETFTFGAANSTKVRATNIAFGCGSLNAGDLANSSGMVGFGRGP 118

Query: 197 SSLVSQLNSS-FSYCIGSL--HDPDYLHNKLILGDGAIIDEGDATPLQ--------FIDG 245
            SLVSQL  S FSYC+ S     P  L+   +  + +  +    +P+Q         +  
Sbjct: 119 LSLVSQLGPSRFSYCLTSYLSATPSRLYFG-VYANLSSTNTSSGSPVQSTPFVINPALPN 177

Query: 246 HYYITLEAISVDGRMLDINPNIFKRDDSG-GGVMIDSGTDVTWLVKEAYEALRDEVMIRL 304
            Y+++L+AIS+  ++L I+P +F  +D G GGV+IDSGT +TWL ++AYEA+R  ++  +
Sbjct: 178 MYFLSLKAISLGTKLLPIDPLVFAINDDGTGGVIIDSGTSITWLQQDAYEAVRRGLVSAI 237

Query: 305 EGEQMRSYSWPDKLCYHGIMSSDLK-GFPTVRFHFRGGAKLALEKDSMFYQPRPDAFCMA 363
               M         C+      ++    P + FHF       L ++ M         C+ 
Sbjct: 238 PLPAMNDTDIGLDTCFQWPPPPNVTVTVPDLVFHFDSANMTLLPENYMLIASTTGYLCLV 297

Query: 364 VNPASINNRYVNFSLIGMMAQQFYNVGYDIGRKQKTFQRMDCEVL 408
           + P  +       ++IG   QQ  ++ YDIG    +F    C+++
Sbjct: 298 MAPTGVG------TIIGNYQQQNLHLLYDIGNSFLSFVPAPCDII 336


>gi|15222357|ref|NP_174430.1| aspartyl protease-like protein [Arabidopsis thaliana]
 gi|12322538|gb|AAG51267.1|AC027135_8 chloroplast nucleoid DNA binding protein, putative [Arabidopsis
           thaliana]
 gi|67633408|gb|AAY78629.1| aspartyl protease family protein [Arabidopsis thaliana]
 gi|332193236|gb|AEE31357.1| aspartyl protease-like protein [Arabidopsis thaliana]
          Length = 445

 Score =  147 bits (372), Expect = 6e-33,   Method: Compositional matrix adjust.
 Identities = 134/427 (31%), Positives = 192/427 (44%), Gaps = 40/427 (9%)

Query: 1   LIHRDSILSPYNNPNENPAHRVQRGINISIARLAYLQEKIRSHNTYQAQILPSNDISNS- 59
           LIHRDS  SP  NP+   + R+           A+L+   RS        L S  ISN  
Sbjct: 33  LIHRDSPHSPLYNPHHTVSDRLNA---------AFLRSISRSRRFTTKTDLQSGLISNGG 83

Query: 60  LFYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLGTIFYPSRSSSYAYVPCDSE 119
            ++++ISIG PP   F   DTGS L WV C PC+ C  Q   +F   +SS+Y    CDS+
Sbjct: 84  EYFMSISIGTPPSKVFAIADTGSDLTWVQCKPCQQCYKQNSPLFDKKKSSTYKTESCDSK 143

Query: 120 HCRYFPYAR--CSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDVVFGCG 177
            C+        C   K  C Y   Y     T   V+TE ++  ++  S++     VFGCG
Sbjct: 144 TCQALSEHEEGCDESKDICKYRYSYGDNSFTKGDVATETISIDSSSGSSVSFPGTVFGCG 203

Query: 178 FSTNRNFKFSGIFGLGIGRS--SLVSQLNSS----FSYCIGSLHDPDYLHNKLILGDGAI 231
           ++    F+ +G   +G+G    SLVSQL SS    FSYC+          + + LG  +I
Sbjct: 204 YNNGGTFEETGSGIIGLGGGPLSLVSQLGSSIGKKFSYCLSHTAATTNGTSVINLGTNSI 263

Query: 232 IDEGD------ATPLQFID--GHYYITLEAISVDGRMLDINPNIF----KRDDSGGGVMI 279
                       TPL   D   +Y++TLEA++V    L      +    K     G ++I
Sbjct: 264 PSNPSKDSATLTTPLIQKDPETYYFLTLEAVTVGKTKLPYTGGGYGLNGKSSKRTGNIII 323

Query: 280 DSGTDVTWLVKEAYEALRDEVMIRLEGEQMRSYSWPDKLCYHGIMSSDLK-GFPTVRFHF 338
           DSGT +T L    Y+     V   + G   +  S P  L  H   S D + G P +  HF
Sbjct: 324 DSGTTLTLLDSGFYDDFGTAVEESVTG--AKRVSDPQGLLTHCFKSGDKEIGLPAITMHF 381

Query: 339 RGGAKLALEKDSMFYQPRPDAFCMAVNPASINNRYVNFSLIGMMAQQFYNVGYDIGRKQK 398
              A + L   + F +   D  C+++ P +        ++ G M Q  + VGYD+  K  
Sbjct: 382 T-NADVKLSPINAFVKLNEDTVCLSMIPTT------EVAIYGNMVQMDFLVGYDLETKTV 434

Query: 399 TFQRMDC 405
           +FQRMDC
Sbjct: 435 SFQRMDC 441


>gi|357143854|ref|XP_003573079.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Brachypodium
           distachyon]
          Length = 417

 Score =  147 bits (371), Expect = 9e-33,   Method: Compositional matrix adjust.
 Identities = 118/364 (32%), Positives = 169/364 (46%), Gaps = 28/364 (7%)

Query: 61  FYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLGTIFYPSRSSSYAYVPCDSEH 120
           + + ++IG PPVP     DTGS L W  C PC+ C PQ   ++ PS SS+++ VPC S  
Sbjct: 66  YLMELAIGTPPVPFVALADTGSDLTWTQCQPCKLCFPQDTPVYDPSASSTFSPVPCSSAT 125

Query: 121 CRYFPYAR---CSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNT-DESTIHVQDVVFGC 176
           C   P  R   CS     C Y   Y  G  +   + TE LT  ++    T+ V  V FGC
Sbjct: 126 C--LPTWRSRNCSNPSSPCRYIYSYSDGAYSVGILGTETLTIGSSVPGQTVSVGSVAFGC 183

Query: 177 GFSTNRN-FKFSGIFGLGIGRSSLVSQLN-SSFSYCIGSLHDPDYLHNKLILGDGAIIDE 234
           G     +    +G  GLG G  SL++QL    FSYC+    +   + +   LG  A +  
Sbjct: 184 GTDNGGDSLNSTGTVGLGRGTLSLLAQLGVGKFSYCLTDFFN-STMDSPFFLGTLAELAP 242

Query: 235 G----DATPL---QFIDGHYYITLEAISVDGRMLDINPNIFK-RDDSGGGVMIDSGTDVT 286
           G     +TPL         Y++ L+ IS+    L I    F  R D  GG+M+DSGT  T
Sbjct: 243 GPGTVQSTPLLQSPLNPSRYFVNLQGISLGDVRLPIPNGTFDLRADGNGGMMVDSGTTFT 302

Query: 287 WLVKEAYEALRDEVMIRLEGEQMRSYSWPDKLCYHGIMSSDLKGF-PTVRFHFRGGAKLA 345
            L K  +  + D V  +L G+   + S  D  C+    S D + F P +  HF GGA + 
Sbjct: 303 ILAKSGFREVVDRVA-QLLGQPPVNASSLDSPCFP---SPDGEPFMPDLVLHFAGGADMR 358

Query: 346 LEKDS-MFYQPRPDAFCMAVNPASINNRYVNFSLIGMMAQQFYNVGYDIGRKQKTFQRMD 404
           L +D+ M Y     +FC+     +I      +S +G   QQ   + +D+   Q +F   D
Sbjct: 359 LHRDNYMSYNEDDSSFCL-----NIVGSPSTWSRLGNFQQQNIQMLFDMTVGQLSFLPTD 413

Query: 405 CEVL 408
           C  L
Sbjct: 414 CSKL 417


>gi|356537728|ref|XP_003537377.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Glycine max]
          Length = 543

 Score =  147 bits (370), Expect = 1e-32,   Method: Compositional matrix adjust.
 Identities = 118/420 (28%), Positives = 199/420 (47%), Gaps = 46/420 (10%)

Query: 22  VQRGINISIARLAYLQEKIRSHNTYQAQILPSNDISNSL----FYVNISIGQPPVPQFTA 77
           +Q+  N++ A +A L+    S   +   I+ + +   SL    +++++ +G PP   +  
Sbjct: 131 IQQQNNLANAFVASLES---SKGEFSGNIMATLESGASLGTGEYFLDMFVGTPPKHVWLI 187

Query: 78  MDTGSSLLWVHCYPCRDCSPQLGTIFYPSRSSSYAYVPCDSEHCRYF----PYARCSAYK 133
           +DTGS L W+ C PC DC  Q G+ +YP  SS+Y  + C    C+      P   C A  
Sbjct: 188 LDTGSDLSWIQCDPCYDCFEQNGSHYYPKDSSTYRNISCYDPRCQLVSSSDPLQHCKAEN 247

Query: 134 HRCIYTQLYLIGPETSVFVSTE----QLTFKNTDESTIHVQDVVFGCGFSTNRNFKF--S 187
             C Y   Y  G  T+   ++E     LT+ N  E    V DV+FGCG   N+ F +  S
Sbjct: 248 QTCPYFYDYADGSNTTGDFASETFTVNLTWPNGKEKFKQVVDVMFGCG-HWNKGFFYGAS 306

Query: 188 GIFGLGIGRSSLVSQLNS----SFSYCIGSLHDPDYLHNKLILG-DGAIIDEGDATPLQF 242
           G+ GLG G  S  SQ+ S    SFSYC+  L     + +KLI G D  +++  +      
Sbjct: 307 GLLGLGRGPISFPSQIQSIYGHSFSYCLTDLFSNTSVSSKLIFGEDKELLNNHNLNFTTL 366

Query: 243 IDGH-------YYITLEAISVDGRMLDINPNIFKRDDS------GGGVMIDSGTDVTWLV 289
           + G        YY+ +++I V G +LDI+   +           GGG +IDSG+ +T+  
Sbjct: 367 LAGEETPDETFYYLQIKSIMVGGEVLDISEQTWHWSSEGAAADAGGGTIIDSGSTLTFFP 426

Query: 290 KEAYEALRDEVMIRLEGEQMRSYSWPDKLCYH---GIMSSDLKGFPTVRFHFRGGAKLAL 346
             AY+ +++    +++ +Q+ +  +    CY+    +M  +L   P    HF  G     
Sbjct: 427 DSAYDIIKEAFEKKIKLQQIAADDFVMSPCYNVSGAMMQVEL---PDFGIHFADGGVWNF 483

Query: 347 EKDSMFYQPRPD-AFCMAVNPASINNRYVNFSLIGMMAQQFYNVGYDIGRKQKTFQRMDC 405
             ++ FYQ  PD   C+A+        + + ++IG + QQ +++ YD+ R +  +    C
Sbjct: 484 PAENYFYQYEPDEVICLAIMKTP---NHSHLTIIGNLLQQNFHILYDVKRSRLGYSPRRC 540


>gi|15223368|ref|NP_171637.1| aspartyl protease-like protein [Arabidopsis thaliana]
 gi|9665144|gb|AAF97328.1|AC023628_9 Unknown protein [Arabidopsis thaliana]
 gi|22135930|gb|AAM91547.1| chloroplast nucleoid DNA binding protein, putative [Arabidopsis
           thaliana]
 gi|30387595|gb|AAP31963.1| At1g01300 [Arabidopsis thaliana]
 gi|332189147|gb|AEE27268.1| aspartyl protease-like protein [Arabidopsis thaliana]
          Length = 485

 Score =  147 bits (370), Expect = 1e-32,   Method: Compositional matrix adjust.
 Identities = 124/433 (28%), Positives = 187/433 (43%), Gaps = 52/433 (12%)

Query: 1   LIHRDSILSPYNNPNENPAHRVQRGINISIARLAYLQEKIRSHNT--------YQAQILP 52
           L H D+ LS    P+E  + R+QR  +  +  +A L  +I   N         + + ++ 
Sbjct: 76  LDHIDA-LSSNKTPDELFSSRLQRD-SRRVKSIATLAAQIPGRNVTHAPRPGGFSSSVVS 133

Query: 53  SNDISNSLFYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLGTIFYPSRSSSYA 112
                +  ++  + +G P    +  +DTGS ++W+ C PCR C  Q   IF P +S +YA
Sbjct: 134 GLSQGSGEYFTRLGVGTPARYVYMVLDTGSDIVWLQCAPCRRCYSQSDPIFDPRKSKTYA 193

Query: 113 YVPCDSEHCRYFPYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDV 172
            +PC S HCR    A C+  +  C+Y   Y  G  T    STE LTF+        V+ V
Sbjct: 194 TIPCSSPHCRRLDSAGCNTRRKTCLYQVSYGDGSFTVGDFSTETLTFRRN-----RVKGV 248

Query: 173 VFGCGFSTNRNF--------------KFSGIFGLGIGRSSLVSQLNSSFSYCIGSLHDPD 218
             GCG      F               F G  G          + N  FSYC+       
Sbjct: 249 ALGCGHDNEGLFVGAAGLLGLGKGKLSFPGQTG---------HRFNQKFSYCLVD-RSAS 298

Query: 219 YLHNKLILGDGAIIDEGDATPL---QFIDGHYYITLEAISVDG-RMLDINPNIFKRDDSG 274
              + ++ G+ A+      TPL     +D  YY+ L  ISV G R+  +  ++FK D  G
Sbjct: 299 SKPSSVVFGNAAVSRIARFTPLLSNPKLDTFYYVGLLGISVGGTRVPGVTASLFKLDQIG 358

Query: 275 -GGVMIDSGTDVTWLVKEAYEALRDEVMIRLEG-EQMRSYSWPDKLCYHGIMSSDLKGFP 332
            GGV+IDSGT VT L++ AY A+RD   +  +  ++   +S  D  C+     +++K  P
Sbjct: 359 NGGVIIDSGTSVTRLIRPAYIAMRDAFRVGAKTLKRAPDFSLFDT-CFDLSNMNEVK-VP 416

Query: 333 TVRFHFRGGAKLALEKDSMFYQPRPDAFCMAVNPASINNRYVNFSLIGMMAQQFYNVGYD 392
           TV  HFRG        + +        FC A             S+IG + QQ + V YD
Sbjct: 417 TVVLHFRGADVSLPATNYLIPVDTNGKFCFA-----FAGTMGGLSIIGNIQQQGFRVVYD 471

Query: 393 IGRKQKTFQRMDC 405
           +   +  F    C
Sbjct: 472 LASSRVGFAPGGC 484


>gi|413921976|gb|AFW61908.1| hypothetical protein ZEAMMB73_608282 [Zea mays]
          Length = 459

 Score =  147 bits (370), Expect = 1e-32,   Method: Compositional matrix adjust.
 Identities = 118/372 (31%), Positives = 170/372 (45%), Gaps = 31/372 (8%)

Query: 61  FYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLGTIFYPSRSSSYAYVPCDSEH 120
           + + ++IG PPVP     DTGS L W  C PC+ C PQ   I+  + S+S++ VPC S  
Sbjct: 95  YLMELAIGTPPVPFVALADTGSDLTWTQCKPCKLCFPQDTPIYDTAASASFSPVPCASAT 154

Query: 121 CRYFPYAR----CSAYKHR-CIYTQLYLIGPETSVFVSTEQLTFKNTDEST----IHVQD 171
           C   P  R    C+A     C Y   Y  G  ++  + TE LTF  +        + V  
Sbjct: 155 C--LPIWRSSRNCTATTTSPCRYRYAYDDGAYSAGVLGTETLTFAGSSPGAPGPGVSVGG 212

Query: 172 VVFGCGF-STNRNFKFSGIFGLGIGRSSLVSQLN-SSFSYCI-----GSLHDPDYLHNKL 224
           V FGCG  +   ++  +G  GLG G  SLV+QL    FSYC+      SL  P    +  
Sbjct: 213 VAFGCGVDNGGLSYNSTGTVGLGRGSLSLVAQLGVGKFSYCLTDFFNTSLGSPVLFGSLA 272

Query: 225 ILGDGAIIDEGDATPLQFIDG-----HYYITLEAISV-DGRMLDINPNIFKRDDSGGGVM 278
            L   + I          + G      YY++LE IS+ D R+   N     RDD  GG++
Sbjct: 273 ELAAPSTIGGAAVQSTPLVQGPYNPSRYYVSLEGISLGDARLPIPNGTFDLRDDGSGGMI 332

Query: 279 IDSGTDVTWLVKEAYEALRDEVMIRLEGEQMRSYSWPDKLCYHGIMSS-DLKGFPTVRFH 337
           +DSGT  T LV+ A+  + + V   L    + + S  D  C+        L   P +  H
Sbjct: 333 VDSGTIFTVLVESAFRVVVNHVAGVLNQPVVNASSL-DSPCFPATAGEQQLPDMPDMLLH 391

Query: 338 FRGGAKLALEKDS-MFYQPRPDAFCMAVNPASINNRYVNFSLIGMMAQQFYNVGYDIGRK 396
           F GGA + L +D+ M +     +FC+  N A   + Y   S++G   QQ   + +DI   
Sbjct: 392 FAGGADMRLHRDNYMSFNQESSSFCL--NIAGAPSAY--GSILGNFQQQNIQMLFDITVG 447

Query: 397 QKTFQRMDCEVL 408
           Q +F   DC  L
Sbjct: 448 QLSFVPTDCSKL 459


>gi|255563827|ref|XP_002522914.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
           communis]
 gi|223537841|gb|EEF39457.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
           communis]
          Length = 442

 Score =  147 bits (370), Expect = 1e-32,   Method: Compositional matrix adjust.
 Identities = 127/404 (31%), Positives = 193/404 (47%), Gaps = 36/404 (8%)

Query: 20  HRVQRGINISIARLAYLQEKIRSHNTYQAQILPSNDISNSLFYVNISIGQPPVPQFTAMD 79
            R+Q GI  +  RL  L   + + ++  A+I       N  F +N++IG PP      MD
Sbjct: 60  QRIQHGIKRANHRLERLNAMVLAASS-NAEINSPVLSGNGEFLMNLAIGTPPETYSAIMD 118

Query: 80  TGSSLLWVHCYPCRDCSPQLGTIFYPSRSSSYAYVPCDSEHCRYFPYARCSAYKHRCIYT 139
           TGS L+W  C PC  C  Q   IF P +SSS++ + C S+ C+  P + CS     C Y 
Sbjct: 119 TGSDLIWTQCKPCTQCFDQPSPIFDPKKSSSFSKLSCSSQLCKALPQSSCS---DSCEYL 175

Query: 140 QLYLIGPETSVFVSTEQLTFKNTDESTIHVQDVVFGCGFSTNRNFKF---SGIFGLGIGR 196
             Y     T   ++TE  TF       + + +V FGCG   N    F   SG+ GLG G 
Sbjct: 176 YTYGDYSSTQGTMATETFTFGK-----VSIPNVGFGCG-EDNEGDGFTQGSGLVGLGRGP 229

Query: 197 SSLVSQLNSS-FSYCIGSLHDPDYLHNKLILGDGAIIDEGDAT---------PLQFIDGH 246
            SLVSQL  + FSYC+ S+ D     + L++G  A ++   A          PLQ     
Sbjct: 230 LSLVSQLKEAKFSYCLTSIDDTK--TSTLLMGSLASVNGTSAAIRTTPLIQNPLQ--PSF 285

Query: 247 YYITLEAISVDGRMLDINPNIFK-RDDSGGGVMIDSGTDVTWLVKEAYEALRDEVMIRLE 305
           YY++LE ISV G  L I  + F+ +DD  GG++IDSGT +T+L + A++ ++ E   ++ 
Sbjct: 286 YYLSLEGISVGGTRLPIKESTFQLQDDGTGGLIIDSGTTITYLEESAFDLVKKEFTSQMG 345

Query: 306 GEQMRSYSWPDKLCYHGIMSSDLKGFPTVRFHFRGGAKLALEKDS-MFYQPRPDAFCMAV 364
                S +   +LCY+    +     P +  HF  GA L L  ++ M         C+A+
Sbjct: 346 LPVDNSGATGLELCYNLPSDTSELEVPKLVLHFT-GADLELPGENYMIADSSMGVICLAM 404

Query: 365 NPASINNRYVNFSLIGMMAQQFYNVGYDIGRKQKTFQRMDCEVL 408
             +         S+ G + QQ   V +D+ ++  +F   +C  L
Sbjct: 405 GSSG------GMSIFGNVQQQNMFVSHDLEKETLSFLPTNCGQL 442


>gi|297848386|ref|XP_002892074.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
 gi|297337916|gb|EFH68333.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
          Length = 485

 Score =  146 bits (369), Expect = 1e-32,   Method: Compositional matrix adjust.
 Identities = 124/433 (28%), Positives = 186/433 (42%), Gaps = 52/433 (12%)

Query: 1   LIHRDSILSPYNNPNENPAHRVQRGINISIARLAYLQEKIRSHNT--------YQAQILP 52
           L H D+ LS    P E  + R+QR  +  +  +A L  +I   N         + + ++ 
Sbjct: 76  LDHIDA-LSSNKTPQELFSSRLQRD-SRRVKSIATLAAQIPGRNVTHAPRTGGFSSSVVS 133

Query: 53  SNDISNSLFYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLGTIFYPSRSSSYA 112
                +  ++  + +G P    +  +DTGS ++W+ C PCR C  Q   IF P +S +YA
Sbjct: 134 GLSQGSGEYFTRLGVGTPARYVYMVLDTGSDIVWLQCAPCRRCYSQSDPIFDPRKSKTYA 193

Query: 113 YVPCDSEHCRYFPYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDV 172
            +PC S HCR    A C+  +  C+Y   Y  G  T    STE LTF+        V+ V
Sbjct: 194 TIPCSSPHCRRLDSAGCNTRRKTCLYQVSYGDGSFTVGDFSTETLTFRRN-----RVKGV 248

Query: 173 VFGCGFSTNRNF--------------KFSGIFGLGIGRSSLVSQLNSSFSYCIGSLHDPD 218
             GCG      F               F G  G          + N  FSYC+       
Sbjct: 249 ALGCGHDNEGLFVGAAGLLGLGKGKLSFPGQTG---------HRFNQKFSYCLVD-RSAS 298

Query: 219 YLHNKLILGDGAIIDEGDATPL---QFIDGHYYITLEAISVDG-RMLDINPNIFKRDDSG 274
              + ++ G+ A+      TPL     +D  YY+ L  ISV G R+  +  ++FK D  G
Sbjct: 299 SKPSSVVFGNAAVSRIARFTPLLSNPKLDTFYYVELLGISVGGTRVPGVAASLFKLDQIG 358

Query: 275 -GGVMIDSGTDVTWLVKEAYEALRDEVMIRLEG-EQMRSYSWPDKLCYHGIMSSDLKGFP 332
            GGV+IDSGT VT L++ AY A+RD   +  +  ++   +S  D  C+     +++K  P
Sbjct: 359 NGGVIIDSGTSVTRLIRPAYIAMRDAFRVGAKALKRAPDFSLFDT-CFDLSNMNEVK-VP 416

Query: 333 TVRFHFRGGAKLALEKDSMFYQPRPDAFCMAVNPASINNRYVNFSLIGMMAQQFYNVGYD 392
           TV  HFRG        + +        FC A             S+IG + QQ + V YD
Sbjct: 417 TVVLHFRGADVSLPATNYLIPVDTNGKFCFA-----FAGTMGGLSIIGNIQQQGFRVVYD 471

Query: 393 IGRKQKTFQRMDC 405
           +   +  F    C
Sbjct: 472 LASSRVGFAPGGC 484


>gi|226508202|ref|NP_001141111.1| hypothetical protein precursor [Zea mays]
 gi|194702684|gb|ACF85426.1| unknown [Zea mays]
 gi|414590469|tpg|DAA41040.1| TPA: hypothetical protein ZEAMMB73_571218 [Zea mays]
          Length = 439

 Score =  146 bits (368), Expect = 2e-32,   Method: Compositional matrix adjust.
 Identities = 114/369 (30%), Positives = 174/369 (47%), Gaps = 35/369 (9%)

Query: 61  FYVNISIGQPPVPQFTAMDTGSSLLWVHCYPC-RDCSPQLGTIFYPSRSSSYAYVPCDSE 119
           F + ++IG PP+P     DTGS L+W  C PC R C  Q   ++ PS S++++ +PC+S 
Sbjct: 85  FLMTLAIGTPPLPFLAIADTGSDLIWTQCAPCSRQCFQQPTPLYNPSSSTTFSALPCNSS 144

Query: 120 HCRYFPYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTF-KNTDESTIHVQDVVFGC-- 176
                    C A    C+Y   Y  G  T VF  TE  TF  +T    + V  + FGC  
Sbjct: 145 ------LGLC-APACACMYNMTYGSG-WTYVFQGTETFTFGSSTPADQVRVPGIAFGCSN 196

Query: 177 ---GFSTNRNFKFSGIFGLGIGRSSLVSQLNS-SFSYCIGSLHDPDYLHNKLILGDGAII 232
              GF+ +     SG+ GLG G  SLVSQL +  FSYC+    D +     L+    ++ 
Sbjct: 197 ASSGFNASSA---SGLVGLGRGSLSLVSQLGAPKFSYCLTPYQDTNSTSTLLLGPSASLN 253

Query: 233 DEGDATPLQFIDG----HYYITLEAISVDGRMLDINPNIFK-RDDSGGGVMIDSGTDVTW 287
           D G  +   F+      +YY+ L  IS+    L I PN F  + D  GG++IDSGT +T 
Sbjct: 254 DTGVVSSTPFVASPSSIYYYLNLTGISLGTTALPIPPNAFSLKADGTGGLIIDSGTTITM 313

Query: 288 LVKEAYEALRDEV--MIRLEGEQMRSYSWPDKLCYHGIMSSDL-KGFPTVRFHFRGGAKL 344
           L   AY+ +R  V  ++ L      + +  D LC+    S+      P++  HF  GA +
Sbjct: 314 LGNTAYQQVRAAVLSLVTLPTTDGSAATGLD-LCFELPSSTSAPPSMPSMTLHFD-GADM 371

Query: 345 ALEKDSMFY-----QPRPDAFCMAVNPASINNRYVNFSLIGMMAQQFYNVGYDIGRKQKT 399
            L  D+              +C+A+   +  +  V  S++G   QQ  ++ YD+G++  +
Sbjct: 372 VLPADNYMMSLSDPDSDSSLWCLAMQNQTDTDGVV-VSILGNYQQQNMHILYDVGKETLS 430

Query: 400 FQRMDCEVL 408
           F    C  L
Sbjct: 431 FAPAKCSTL 439


>gi|50508275|dbj|BAD32124.1| putative aspartic proteinase nepenthesin I [Oryza sativa Japonica
           Group]
          Length = 451

 Score =  146 bits (368), Expect = 2e-32,   Method: Compositional matrix adjust.
 Identities = 116/379 (30%), Positives = 177/379 (46%), Gaps = 40/379 (10%)

Query: 55  DISNSLFYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLGTIFYPSRSSSYAYV 114
           D S   + +N+SIG PPV      DTGSSL+W  C PC +C+ +    F P+ SS+++ +
Sbjct: 84  DNSAGAYNMNLSIGTPPVTFSVLADTGSSLIWTQCAPCTECAARPAPPFQPASSSTFSKL 143

Query: 115 PCDSEHCRYF--PYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDV 172
           PC S  C++   PY  C+A    C+Y   Y +G  T+ +++TE L              V
Sbjct: 144 PCASSLCQFLTSPYLTCNATG--CVYYYPYGMG-FTAGYLATETLHVGGAS-----FPGV 195

Query: 173 VFGCGFSTNRNFKFSGIFGLGIGRSSLVSQLN-SSFSYCIGSLHDPDYLHNKLILGDGAI 231
            FGC          SGI GLG    SLVSQ+    FSYC+ S  D D   + ++ G  A 
Sbjct: 196 AFGCSTENGVGNSSSGIVGLGRSPLSLVSQVGVGRFSYCLRS--DADAGDSPILFGSLAK 253

Query: 232 IDEGD--ATPL-----QFIDGHYYITLEAISVDGRMLDINPNIF---KRDDSG--GGVMI 279
           +  G+  +TPL          +YY+ L  I+V    L +    F   +   +G  GG ++
Sbjct: 254 VTGGNVQSTPLLENPEMPSSSYYYVNLTGITVGATDLPVTSTTFGFTRGAGAGLVGGTIV 313

Query: 280 DSGTDVTWLVKEAYEALRDEVMIRLEGEQMRSYSWPDK----LCYHGIMSSDLKG--FPT 333
           DSGT +T+LVKE Y  ++   + ++    + +     +    LC+    +    G   PT
Sbjct: 314 DSGTTLTYLVKEGYAMVKRAFLSQMATANLTTTVNGTRFGFDLCFDATAAGGGSGVPVPT 373

Query: 334 VRFHFRGGAKLALEKDSMF------YQPRPDAFCMAVNPASINNRYVNFSLIGMMAQQFY 387
           +   F GGA+ A+ + S         Q R    C+ V PAS     ++ S+IG + Q   
Sbjct: 374 LVLRFAGGAEYAVRRRSYVGVVAVDSQGRAAVECLLVLPAS---EKLSISIIGNVMQMDL 430

Query: 388 NVGYDIGRKQKTFQRMDCE 406
           +V YD+     +F   DC 
Sbjct: 431 HVLYDLDGGMFSFAPADCA 449


>gi|297846526|ref|XP_002891144.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
 gi|297336986|gb|EFH67403.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
          Length = 445

 Score =  145 bits (367), Expect = 3e-32,   Method: Compositional matrix adjust.
 Identities = 131/427 (30%), Positives = 192/427 (44%), Gaps = 40/427 (9%)

Query: 1   LIHRDSILSPYNNPNENPAHRVQRGINISIARLAYLQEKIRSHNTYQAQILPSNDISNS- 59
           LIHRDS  SP  NP    + R+           A+L+   RS        L S  ISN  
Sbjct: 33  LIHRDSPHSPLYNPQHTVSDRLNA---------AFLRSISRSRRFSTKTDLQSGLISNGG 83

Query: 60  LFYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLGTIFYPSRSSSYAYVPCDSE 119
            ++++ISIG PP       DTGS L WV C PC+ C  Q   +F   +SS+Y    CDS 
Sbjct: 84  EYFMSISIGTPPSKFLAIADTGSDLTWVQCKPCQQCYKQNTPLFDKKKSSTYKTESCDSI 143

Query: 120 HCRYFPYAR--CSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDVVFGCG 177
            C         C   ++ C Y   Y     T   V+TE ++  ++  S +      FGCG
Sbjct: 144 TCNALSEHEEGCDESRNACKYRYSYGDESFTKGEVATETISIDSSSGSPVSFPGTAFGCG 203

Query: 178 FSTNRNFKFSGIFGLGIGRS--SLVSQLNSS----FSYCIGSLHDPDYLHNKLILGDGAI 231
           ++    F+ +G   +G+G    SLVSQL SS    FSYC+          + + LG  ++
Sbjct: 204 YNNGGTFEETGSGIIGLGGGPLSLVSQLGSSIGKKFSYCLSHTSATTNGTSVINLGTNSM 263

Query: 232 IDEGD------ATPLQFID--GHYYITLEAISVDGRMLDINP----NIFKRDDSGGGVMI 279
             +         TPL   D   +Y++TLEAI+V    L        ++ ++    G ++I
Sbjct: 264 TSKPSKDSAILTTPLIQKDPETYYFLTLEAITVGKTKLPYTGGGGYSLNRKSKKTGNIII 323

Query: 280 DSGTDVTWLVKEAYEALRDEVMIRLEGEQMRSYSWPDKLCYHGIMSSDLK-GFPTVRFHF 338
           DSGT +T L    Y+     V   + G   +  S P  +  H   S D + G PT+  HF
Sbjct: 324 DSGTTLTLLDSGFYDDFGAVVEESVTG--AKRVSDPQGILTHCFKSGDKEIGLPTITMHF 381

Query: 339 RGGAKLALEKDSMFYQPRPDAFCMAVNPASINNRYVNFSLIGMMAQQFYNVGYDIGRKQK 398
             GA + L   + F +   D  C+++ P +        ++ G M Q  + VGYD+  K  
Sbjct: 382 T-GADVKLSPINSFVKLSEDIVCLSMIPTT------EVAIYGNMVQMDFLVGYDLETKTV 434

Query: 399 TFQRMDC 405
           +FQRMDC
Sbjct: 435 SFQRMDC 441


>gi|326507654|dbj|BAK03220.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 442

 Score =  145 bits (367), Expect = 3e-32,   Method: Compositional matrix adjust.
 Identities = 124/420 (29%), Positives = 187/420 (44%), Gaps = 36/420 (8%)

Query: 11  YNNPNENPAHRVQRGINISIARLAYLQEKIRSHNTYQAQILPSNDISNSLFYV-NISIGQ 69
           ++NP+ +    V+  +   + R A    ++ S            D+ N   Y+  ++IG 
Sbjct: 37  HSNPDVSATEFVRDALRRDMHRHARFTRELASSGDRTVAAPTRKDLPNGGEYIMTLAIGT 96

Query: 70  PPVPQFTAMDTGSSLLWVHCYPC-RDCSPQLGTIFYPSRSSSYAYVPCDS--EHCRYF-- 124
           PP+      DTGS L+W  C PC   C  Q G  + PS S+++  +PC+S    C     
Sbjct: 97  PPLSYPAIADTGSDLIWTQCAPCGSQCFKQAGQPYNPSSSTTFGVLPCNSSVSMCAALAG 156

Query: 125 --PYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDVVFGCGFSTNR 182
             P   CS     C+Y Q Y  G  T+   S E  TF +T      V  + FGC  +++ 
Sbjct: 157 PSPPPGCS-----CMYNQTYGTG-WTAGIQSVETFTFGSTPADQTRVPGIAFGCSNASSD 210

Query: 183 NFKFS-GIFGLGIGRSSLVSQLNSS-FSYCIGSLHDPDYLHNKLILGDGAIIDEGDATPL 240
           ++  S G+ GLG G  SLVSQL +  FSYC+    D +   + L+LG  A ++       
Sbjct: 211 DWNGSAGLVGLGRGSMSLVSQLGAGMFSYCLTPFQDANS-TSTLLLGPSAALNGTGVLTT 269

Query: 241 QFI--------DGHYYITLEAISVDGRMLDINPNIFK-RDDSGGGVMIDSGTDVTWLVKE 291
            F+          +YY+ L  IS+    L I PN F  R D  GG++IDSGT +T LV  
Sbjct: 270 PFVASPSKAPMSTYYYLNLTGISIGTTALSIPPNAFALRTDGTGGLIIDSGTTITSLVDA 329

Query: 292 AYEALRD--EVMIRLEGEQMRSYSWPDKLCYHGIM-SSDLKGFPTVRFHFRGGAKLALEK 348
           AY+ +R   E ++ L        +  D LC+     +S     P++ FHF  GA + L  
Sbjct: 330 AYQQVRAAIESLVTLPVADGSDSTGLD-LCFALTSETSTPPSMPSMTFHFD-GADMVLPV 387

Query: 349 DSMFYQPRPDAFCMAVNPASINNRYVNFSLIGMMAQQFYNVGYDIGRKQKTFQRMDCEVL 408
           D+         +C+A+     N      S  G   QQ  ++ YDI  +  +F    C  L
Sbjct: 388 DNYMILGS-GVWCLAMR----NQTVGAMSTFGNYQQQNVHLLYDIHEETLSFAPAKCSTL 442


>gi|168041176|ref|XP_001773068.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162675615|gb|EDQ62108.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 365

 Score =  145 bits (366), Expect = 4e-32,   Method: Compositional matrix adjust.
 Identities = 106/361 (29%), Positives = 156/361 (43%), Gaps = 28/361 (7%)

Query: 61  FYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLGTIFYPSRSSSYAYVPCDSEH 120
           +   + +G P       +DTGS L WV C PC  C  Q   +F P+ S+S+  + C S  
Sbjct: 13  YLATVRLGTPERVFSVIVDTGSDLTWVQCSPCGKCYSQNDALFLPNTSTSFTKLACGSAL 72

Query: 121 CRYFPYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDVVFGCGFST 180
           C   P+  C+  +  C+Y   Y  G  T+     + +T    +     V +  FGCG   
Sbjct: 73  CNGLPFPMCN--QTTCVYWYSYGDGSLTTGDFVYDTITMDGINGQKQQVPNFAFGCGHDN 130

Query: 181 NRNFKFS-GIFGLGIGRSSLVSQL----NSSFSYCIGSLHDPDYLHNKLILGDGAIIDEG 235
             +F  + GI GLG G  S  SQL    N  FSYC+     P    + L+ GD A+    
Sbjct: 131 EGSFAGADGILGLGQGPLSFHSQLKSVYNGKFSYCLVDWLAPPTQTSPLLFGDAAVPILP 190

Query: 236 DATPLQF-----IDGHYYITLEAISVDGRMLDINPNIFKRDDSGG-GVMIDSGTDVTWLV 289
           D   L       +  +YY+ L  ISV   +L+I+  +F  D  GG G + DSGT VT L 
Sbjct: 191 DVKYLPILANPKVPTYYYVKLNGISVGDNLLNISSTVFDIDSVGGAGTIFDSGTTVTQLA 250

Query: 290 KEAYEALRDEVMIRLEGEQMRSYSWPD-----KLCYHGIMSSDLKGFPTVRFHFRGGAKL 344
           + AY+    EV+  +    M      D      LC  G     L   P + FHF GG  +
Sbjct: 251 EAAYK----EVLAAMNASTMAYSRKIDDISRLDLCLSGFPKDQLPTVPAMTFHFEGGDMV 306

Query: 345 ALEKDSMFYQPRPDAFCMAVNPASINNRYVNFSLIGMMAQQFYNVGYDIGRKQKTFQRMD 404
               +   Y     ++C A+  +       + ++IG + QQ + V YD   ++  F   D
Sbjct: 307 LPPSNYFIYLESSQSYCFAMTSSP------DVNIIGSVQQQNFQVYYDTAGRKLGFVPKD 360

Query: 405 C 405
           C
Sbjct: 361 C 361


>gi|356569424|ref|XP_003552901.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Glycine max]
          Length = 536

 Score =  145 bits (365), Expect = 4e-32,   Method: Compositional matrix adjust.
 Identities = 123/417 (29%), Positives = 198/417 (47%), Gaps = 46/417 (11%)

Query: 22  VQRGINISIARLAYLQEKIRSHNTYQAQILPSNDISNSL----FYVNISIGQPPVPQFTA 77
           +Q+  N++ A +A L+    S + +   I+ + +   SL    +++++ +G PP   +  
Sbjct: 130 IQQQNNLANAVVASLKS---SKDEFSGNIMATLESGASLGTGEYFIDMFVGTPPKHVWLI 186

Query: 78  MDTGSSLLWVHCYPCRDCSPQLGTIFYPSRSSSYAYVPCDSEHCRYF----PYARCSAYK 133
           +DTGS L W+ C PC DC  Q G  + P+ SSSY  + C    C+      P   C    
Sbjct: 187 LDTGSDLSWIQCDPCYDCFEQNGPHYNPNESSSYRNISCYDPRCQLVSSPDPLQHCKTEN 246

Query: 134 HRCIYTQLYLIGPETS----VFVSTEQLTFKNTDESTIHVQDVVFGCGFSTNRNF--KFS 187
             C Y   Y  G  T+    +   T  LT+ N  E   HV DV+FGCG   N+ F     
Sbjct: 247 QTCPYFYDYADGSNTTGDFALETFTVNLTWPNGKEKFKHVVDVMFGCG-HWNKGFFHGAG 305

Query: 188 GIFGLGIGRSSLVSQLNS----SFSYCIGSLHDPDYLHNKLILG-DGAIIDE-------- 234
           G+ GLG G  S  SQL S    SFSYC+  L     + +KLI G D  +++         
Sbjct: 306 GLLGLGRGPLSFPSQLQSIYGHSFSYCLTDLFSNTSVSSKLIFGEDKELLNHHNLNFTKL 365

Query: 235 --GDATPLQFIDGHYYITLEAISVDGRMLDINPNIFKRDDSG-GGVMIDSGTDVTWLVKE 291
             G+ TP    D  YY+ +++I V G +LDI    +     G GG +IDSG+ +T+    
Sbjct: 366 LAGEETP---DDTFYYLQIKSIVVGGEVLDIPEKTWHWSSEGVGGTIIDSGSTLTFFPDS 422

Query: 292 AYEALRDEVMIRLEGEQMRSYSWPDKLCYH--GIMSSDLKGFPTVRFHFRGGAKLALEKD 349
           AY+ +++    +++ +Q+ +  +    CY+  G M  +L   P    HF  GA      +
Sbjct: 423 AYDVIKEAFEKKIKLQQIAADDFIMSPCYNVSGAMQVEL---PDYGIHFADGAVWNFPAE 479

Query: 350 SMFYQPRPD-AFCMAVNPASINNRYVNFSLIGMMAQQFYNVGYDIGRKQKTFQRMDC 405
           + FYQ  PD   C+A+        + + ++IG + QQ +++ YD+ R +  +    C
Sbjct: 480 NYFYQYEPDEVICLAILKTP---NHSHLTIIGNLLQQNFHILYDVKRSRLGYSPRRC 533


>gi|255566835|ref|XP_002524401.1| basic 7S globulin 2 precursor small subunit, putative [Ricinus
           communis]
 gi|223536362|gb|EEF38012.1| basic 7S globulin 2 precursor small subunit, putative [Ricinus
           communis]
          Length = 455

 Score =  145 bits (365), Expect = 4e-32,   Method: Compositional matrix adjust.
 Identities = 117/437 (26%), Positives = 202/437 (46%), Gaps = 45/437 (10%)

Query: 1   LIHRDSILSPYNNPNENPAHRVQRGINISIARLAYLQEKIRSHNTYQAQILPSNDISNSL 60
           L+H+    +P+ +P+E  A  + R +++      +  ++    N++++ ++      +  
Sbjct: 33  LLHK----TPFTSPSEALAFDINRRLSL---LHHHRHQQQHKQNSFRSPVISGASSGSGQ 85

Query: 61  FYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQ-LGTIFYPSRSSSYAYVPCDSE 119
           ++V++ IG PP       DTGS L+WV C PCR+CS +  G+ F+   S++Y+ + C S 
Sbjct: 86  YFVSLRIGTPPQTLLLVADTGSDLIWVKCSPCRNCSHRSPGSAFFARHSTTYSAIHCYSP 145

Query: 120 HCRYFPYAR---CSAYK--HRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDVVF 174
            C+  P+     C+  +    C Y   Y     T+ F S E LT   +      +  + F
Sbjct: 146 QCQLVPHPHPNPCNRTRLHSPCRYQYTYADSSTTTGFFSKEALTLNTSTGKVKKLNGLSF 205

Query: 175 GCGFS------TNRNFKFS-GIFGLGIGRSSLVSQL----NSSFSYCIGSLHDPDYLHNK 223
           GCGF       T  +F+ + G+ GLG    S  SQL     S FSYC+          + 
Sbjct: 206 GCGFRISGPSLTGASFEGAQGVMGLGRAPISFSSQLGRRFGSKFSYCLMDYTLSPPPTSF 265

Query: 224 LILGDG---AIIDEG--DATPL---QFIDGHYYITLEAISVDGRMLDINPNIFKRDDSG- 274
           L +G     A+  +G    TPL         YYI ++ + V+G  L INP+++  DD G 
Sbjct: 266 LTIGGAQNVAVSKKGIMSFTPLLINPLSPTFYYIAIKGVYVNGVKLPINPSVWSIDDLGN 325

Query: 275 GGVMIDSGTDVTWLVKEAY----EALRDEVMIRLEGEQMRSYSWPDKLCYHGIMSSDLKG 330
           GG +IDSGT +T++ + AY    +A +  V +    E    +     LC + +       
Sbjct: 326 GGTIIDSGTTLTFITEPAYTEILKAFKKRVKLPSPAEPTPGF----DLCMN-VSGVTRPA 380

Query: 331 FPTVRFHFRGGAKLALEKDSMFYQPRPDAFCMAVNPASINNRYVNFSLIGMMAQQFYNVG 390
            P + F+  GG+  +    + F +      C+AV P S +     FS++G + QQ + + 
Sbjct: 381 LPRMSFNLAGGSVFSPPPRNYFIETGDQIKCLAVQPVSQDG---GFSVLGNLMQQGFLLE 437

Query: 391 YDIGRKQKTFQRMDCEV 407
           +D  + +  F R  C +
Sbjct: 438 FDRDKSRLGFTRRGCAL 454


>gi|125561849|gb|EAZ07297.1| hypothetical protein OsI_29545 [Oryza sativa Indica Group]
          Length = 451

 Score =  145 bits (365), Expect = 5e-32,   Method: Compositional matrix adjust.
 Identities = 118/430 (27%), Positives = 197/430 (45%), Gaps = 48/430 (11%)

Query: 10  PYNNPNENPAHRVQRGINISIARLAYLQEKIRS-HNTYQAQILPSN----DISNSLFYVN 64
           PY   + +    V+ G   S  R A+L  K+    +  +  + P++     +S+    + 
Sbjct: 35  PYAGSSLSRHDVVRHGARASKTRAAWLTAKLAGVLSNRRGGVSPADVRLSPLSDQGHSLT 94

Query: 65  ISIGQPPVPQFTAMDTGSSLLWVHC-------YPCRDCSPQLGTIFYPSRSSSYAYVPCD 117
           + IG PP P+   +DTGS L+W  C          R  SP    ++ P  SS++A++PC 
Sbjct: 95  VGIGTPPQPRKLIVDTGSDLIWTQCKLSSSTAVAARHGSPP---VYDPGESSTFAFLPCS 151

Query: 118 SEHCR--YFPYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDVVFG 175
              C+   F +  C++ K+RC+Y  +Y       V  S E  TF      ++ +    FG
Sbjct: 152 DRLCQEGQFSFKNCTS-KNRCVYEDVYGSAAAVGVLAS-ETFTFGARRAVSLRLG---FG 206

Query: 176 CG-FSTNRNFKFSGIFGLGIGRSSLVSQLN-SSFSYCIGSLHDPDYLHNKLILGDGAIID 233
           CG  S       +GI GL     SL++QL    FSYC+    D     + L+ G  A + 
Sbjct: 207 CGALSAGSLIGATGILGLSPESLSLITQLKIQRFSYCLTPFADKKT--SPLLFGAMADLS 264

Query: 234 EGDAT-PLQFI--------DGHYYITLEAISVDGRMLDI-NPNIFKRDDSGGGVMIDSGT 283
               T P+Q            +YY+ L  IS+  + L +   ++  R D GGG ++DSG+
Sbjct: 265 RHKTTRPIQTTAIVSNPVKTVYYYVPLVGISLGHKRLAVPAASLAMRPDGGGGTIVDSGS 324

Query: 284 DVTWLVKEAYEALRDEVM--IRLEGEQMRSYSWPDKLCY-----HGIMSSDLKGFPTVRF 336
            V +LV+ A+EA+++ VM  +RL         +  +LC+         + +    P +  
Sbjct: 325 TVAYLVEAAFEAVKEAVMDVVRLPVANRTVEDY--ELCFVLPRRTAAAAMEAVQVPPLVL 382

Query: 337 HFRGGAKLALEKDSMFYQPRPDAFCMAVNPASINNRYVNFSLIGMMAQQFYNVGYDIGRK 396
           HF GGA + L +D+ F +PR    C+AV   +  +     S+IG + QQ  +V +D+   
Sbjct: 383 HFDGGAAMVLPRDNYFQEPRAGLMCLAVGKTTDGS---GVSIIGNVQQQNMHVLFDVQHH 439

Query: 397 QKTFQRMDCE 406
           + +F    C+
Sbjct: 440 KFSFAPTQCD 449


>gi|357143850|ref|XP_003573078.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Brachypodium
           distachyon]
          Length = 431

 Score =  144 bits (364), Expect = 5e-32,   Method: Compositional matrix adjust.
 Identities = 116/393 (29%), Positives = 181/393 (46%), Gaps = 22/393 (5%)

Query: 30  IARLAYLQEKIRSHNTYQAQILPSNDISNSLFYVNISIGQPPVPQFTAMDTGSSLLWVHC 89
           + R A  + ++R+ + Y A   P        + + ++IG PPVP     DTGS L W  C
Sbjct: 47  LMRRAAHRSRLRALSGYDANS-PRLHSVQVEYLMELAIGTPPVPFVALADTGSDLTWTQC 105

Query: 90  YPCRDCSPQLGTIFYPSRSSSYAYVPCDSEHCRYFPYAR-CSAYKHRCIYTQLYLIGPET 148
            PC+ C PQ   ++ PS SS+++ VPC S  C     +R CS     C Y   Y  G  +
Sbjct: 106 QPCKLCFPQDTPVYDPSASSTFSPVPCSSATCLPVLRSRNCSTPSSLCRYGYSYSDGAYS 165

Query: 149 SVFVSTEQLTFKNT-DESTIHVQDVVFGCGFSTNRN-FKFSGIFGLGIGRSSLVSQLN-S 205
           +  + TE LT  ++     + V DV FGCG     +    +G  GLG G  SL++QL   
Sbjct: 166 AGILGTETLTLGSSVPGQAVSVSDVAFGCGTDNGGDSLNSTGTVGLGRGTLSLLAQLGVG 225

Query: 206 SFSYCIGSLHDPDYLHNKLILGDGAIIDEG----DATPL---QFIDGHYYITLEAISVDG 258
            FSYC+    +   L +  +LG  A +  G     +TPL         Y ++L+ I++  
Sbjct: 226 KFSYCLTDFFN-STLDSPFLLGTLAELAPGPGAVQSTPLLQSPLNPSRYVVSLQGITLGD 284

Query: 259 RMLDINPNIFK-RDDSGGGVMIDSGTDVTWLVKEAYEALRDEVMIRLEGEQMRSYSWPDK 317
             L I    F    +S GG+++DSGT  + L +  +  + D V  ++ G+   + S  D 
Sbjct: 285 VRLPIPNKTFDLHANSTGGMVVDSGTTFSILPESGFRVVVDHV-AQVLGQPPVNASSLDS 343

Query: 318 LCYHGIMSS-DLKGFPTVRFHFRGGAKLALEKDS-MFYQPRPDAFCMAVNPASINNRYVN 375
            C+        L   P +  HF GGA + L +D+ M Y     +FC+     +I      
Sbjct: 344 PCFPAPAGERQLPFMPDLVLHFAGGADMRLHRDNYMSYNQEDSSFCL-----NIVGTTST 398

Query: 376 FSLIGMMAQQFYNVGYDIGRKQKTFQRMDCEVL 408
           +S++G   QQ   + +D+   Q +F   DC  L
Sbjct: 399 WSMLGNFQQQNIQMLFDMTVGQLSFLPTDCSKL 431


>gi|168043550|ref|XP_001774247.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162674374|gb|EDQ60883.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 419

 Score =  144 bits (364), Expect = 6e-32,   Method: Compositional matrix adjust.
 Identities = 117/419 (27%), Positives = 192/419 (45%), Gaps = 32/419 (7%)

Query: 1   LIHRDSILSPYNNPNENPAHRVQRGINISIARLAYLQEKIRSHNTYQAQILPSNDIS-NS 59
           LIHR+   SP  +   N +         ++ R A  + ++  H   + ++  +   S N 
Sbjct: 22  LIHREHPSSPLRS---NTSKTTTEIFLAAVKRGAERRAQLSKHILAEGRLFSTPVASGNG 78

Query: 60  LFYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLGTIFYPSRSSSYAYVPCDSE 119
            + ++IS G PP      +DTGS L+W  C PC  C+     IF P +SS+Y  V C S 
Sbjct: 79  EYLIDISFGSPPQKASVIVDTGSDLIWTQCLPCETCNAAASVIFDPVKSSTYDTVSCASN 138

Query: 120 HCRYFPYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDVVFGCGFS 179
            C   P+  C+     C Y  +Y  G  TS       L+ +     T  + +V FGCG +
Sbjct: 139 FCSSLPFQSCTT---SCKYDYMYGDGSSTS-----GALSTETVTVGTGTIPNVAFGCGHT 190

Query: 180 TNRNFK-FSGIFGLGIGRSSLVSQLNS----SFSYCIGSLHDPDYLHNKLILGDGAIIDE 234
              +F   +GI GLG G  SL+SQ +S     FSYC+  L       + +++GD A    
Sbjct: 191 NLGSFAGAAGIVGLGQGPLSLISQASSITSKKFSYCLVPLGSTK--TSPMLIGDSAAAGG 248

Query: 235 GDATPLQFIDGH---YYITLEAISVDGRMLDINPNIFKRDDSG-GGVMIDSGTDVTWLVK 290
              T L     +   YY  L  ISV G+ +      F  D SG GG ++DSGT +T+L  
Sbjct: 249 VAYTALLTNTANPTFYYADLTGISVSGKAVTYPVGTFSIDASGQGGFILDSGTTLTYLET 308

Query: 291 EAYEALRDEVMIRLEGEQMRSYSWPDKLCYHGIMSSDLKGFPTVRFHFRGGAKLALEKDS 350
            A+ AL   +   +   +     +    C+     ++   +PT+ FHF+ GA   L  ++
Sbjct: 309 GAFNALVAALKAEVPFPEADGSLYGLDYCFSTAGVAN-PTYPTMTFHFK-GADYELPPEN 366

Query: 351 MFYQ-PRPDAFCMAVNPASINNRYVNFSLIGMMAQQFYNVGYDIGRKQKTFQRMDCEVL 408
           +F       + C+A+  ++       FS++G + QQ + + +D+  ++  F+  +CE +
Sbjct: 367 VFVALDTGGSICLAMAAST------GFSIMGNIQQQNHLIVHDLVNQRVGFKEANCETI 419


>gi|242081367|ref|XP_002445452.1| hypothetical protein SORBIDRAFT_07g019450 [Sorghum bicolor]
 gi|241941802|gb|EES14947.1| hypothetical protein SORBIDRAFT_07g019450 [Sorghum bicolor]
          Length = 459

 Score =  144 bits (364), Expect = 6e-32,   Method: Compositional matrix adjust.
 Identities = 104/358 (29%), Positives = 168/358 (46%), Gaps = 23/358 (6%)

Query: 63  VNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLGTIFYPSRSSSYAYVPCDSEHCR 122
           + + +G PP P    +D GS LLW  C      + QL  +F  +RSSS++ +PCDS+ C 
Sbjct: 109 LTVGVGTPPQPSKVILDLGSDLLWTQCSLVGPTAKQLEPVFDAARSSSFSVLPCDSKLCE 168

Query: 123 YFPYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDVVFGCGFSTNR 182
              +   +    +C Y   Y I   T V ++TE  TF      +    ++ FGCG   N 
Sbjct: 169 AGTFTNKTCTDRKCAYENDYGIMTATGV-LATETFTFGAHHGVS---ANLTFGCGKLANG 224

Query: 183 NF-KFSGIFGLGIGRSSLVSQLN-SSFSYCIGSLHD----PDYLHNKLILGDGAIIDEGD 236
              + SGI GL  G  S++ QL  + FSYC+    D    P        LG      +  
Sbjct: 225 TIAEASGILGLSPGPLSMLKQLAITKFSYCLTPFADRKTSPVMFGAMADLGKYKTTGKVQ 284

Query: 237 ATPL---QFIDGHYYITLEAISVDGRMLDI-NPNIFKRDDSGGGVMIDSGTDVTWLVKEA 292
             PL      D +YY+ +  +SV  + LD+    +  + D  GG ++DS T + +LV+ A
Sbjct: 285 TIPLLKNPVEDIYYYVPMVGMSVGSKRLDVPQETLAIKPDGTGGTVLDSATTLAYLVEPA 344

Query: 293 YEALRDEVM--IRLEGEQMRSYSWPDKLCYHGIMSSDLKG--FPTVRFHFRGGAKLALEK 348
           +  L+  VM  I+L         +P  +C+       ++G   P +  HF G A+++L +
Sbjct: 345 FTELKKAVMEGIKLPVANRSVDDYP--VCFELPRGMSMEGVQVPPLVLHFDGDAEMSLPR 402

Query: 349 DSMFYQPRPDAFCMAVNPASINNRYVNFSLIGMMAQQFYNVGYDIGRKQKTFQRMDCE 406
           D+ F +P P   C+AV  A         ++IG + QQ  +V YD+G ++ ++    C+
Sbjct: 403 DNYFQEPSPGMMCLAVMQAPFEGAP---NVIGNVQQQNMHVLYDVGNRKFSYAPTKCD 457


>gi|326530426|dbj|BAJ97639.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 479

 Score =  144 bits (363), Expect = 7e-32,   Method: Compositional matrix adjust.
 Identities = 127/433 (29%), Positives = 191/433 (44%), Gaps = 54/433 (12%)

Query: 1   LIHRDSILSPYNNPNENPAHRVQRGINISIARLAYLQEKIRSHNTYQ---AQILPSNDIS 57
           L+HRD++ S    P+    H +        AR+ YLQ ++          ++++      
Sbjct: 73  LLHRDAV-SGRTYPSTR--HAMLGLAARDGARVEYLQRRLSPTTMTTEVGSEVVSGISEG 129

Query: 58  NSLFYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLGTIFYPSRSSSYAYVPCD 117
           +  ++V + +G PP  Q+  +D+GS ++W+ C PC +C  Q   +F P+ S+S+  VPCD
Sbjct: 130 SGEYFVRVGVGSPPTEQYLVVDSGSDVIWIQCRPCAECYQQADPLFDPAASASFTAVPCD 189

Query: 118 SEHCRYFPYARCS-AYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDVVFGC 176
           S  CR  P      A    C Y   Y  G  T   ++ E LTF ++      VQ V  GC
Sbjct: 190 SGVCRTLPGGSSGCADSGACRYQVSYGDGSYTQGVLAMETLTFGDSTP----VQGVAIGC 245

Query: 177 GFSTNRNF-KFSGIFGLGIGRSSLVSQL----NSSFSYCIGSLHDPDYLHNKLILGDGAI 231
           G      F   +G+ GLG G  SLV QL      +FSYC+ S    D     L+ G    
Sbjct: 246 GHRNRGLFVGAAGLLGLGWGPMSLVGQLGGAAGGAFSYCLAS-RGADAGAGSLVFG---- 300

Query: 232 IDEGDATPLQFI----------DGHYYITLEAISVDGRMLDINPNIFK-RDDSGGGVMID 280
               DA P+  +             YY+ L  + V G  L +   +F   +D GGGV++D
Sbjct: 301 --RDDAMPVGAVWVPLLRNAQQPSFYYVGLTGLGVGGERLPLQDGLFDLTEDGGGGVVMD 358

Query: 281 SGTDVTWLVKEAYEALRDEVMIRLEGEQMRS--YSWPDKLCYHGIMSSDLKGF-----PT 333
           +GT VT L  +AY ALRD     + G+  R+   S  D  CY      DL G+     PT
Sbjct: 359 TGTAVTRLPPDAYAALRDAFASTIGGDLPRAPGVSLLDT-CY------DLSGYASVRVPT 411

Query: 334 VRFHF-RGGAKLALEKDSMFYQPRPDAFCMAVNPASINNRYVNFSLIGMMAQQFYNVGYD 392
           V  +F R GA L L   ++  +     +C+A   ++        S++G + QQ   +  D
Sbjct: 412 VALYFGRDGAALTLPARNLLVEMGGGVYCLAFAASA-----SGLSILGNIQQQGIQITVD 466

Query: 393 IGRKQKTFQRMDC 405
                  F    C
Sbjct: 467 SANGYVGFGPSTC 479


>gi|449462551|ref|XP_004149004.1| PREDICTED: probable aspartic protease At2g35615-like [Cucumis
           sativus]
 gi|449515029|ref|XP_004164552.1| PREDICTED: probable aspartic protease At2g35615-like [Cucumis
           sativus]
          Length = 434

 Score =  144 bits (363), Expect = 8e-32,   Method: Compositional matrix adjust.
 Identities = 118/420 (28%), Positives = 184/420 (43%), Gaps = 31/420 (7%)

Query: 1   LIHRDSILSPYNNPNENPAHRVQRGINISIARLAYLQEKIRSHNT--YQAQILPSNDISN 58
           L  RDS LSP +NP+ +    +      S +R A L   + S +T   ++ I+P +    
Sbjct: 32  LFRRDSPLSPLHNPSLSRYDSLIDAFRRSFSRSATLLTHLTSVSTACIRSPIIPDS---- 87

Query: 59  SLFYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLGTIFYPSRSSSYAYVPCDS 118
             F ++I IG PPV      DTGS L W  C PCR+C  Q   IF P RSSSY  V C S
Sbjct: 88  GEFLMSIFIGTPPVNVIAIADTGSDLTWTQCLPCRECFNQSQPIFNPRRSSSYRKVSCAS 147

Query: 119 EHCRYFPYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDVVFGCGF 178
           + CR      C      C Y   Y     T   ++++Q+T       +  +   V GCG 
Sbjct: 148 DTCRSLESYHCGPDLQSCSYGYSYGDRSFTYGDLASDQITI-----GSFKLPKTVIGCGH 202

Query: 179 STNRNF--------KFSGIFGLGIGRSSLVSQLNSSFSYCIGSLHDPDYLHNKLILGDGA 230
                F           G     + +   ++ +   FSYC+ +      +   +  G  A
Sbjct: 203 QNGGTFGGVTSGIIGLGGGSLSLVSQMRTIAGVKPRFSYCLPTFFSNANITGTISFGRKA 262

Query: 231 IID--EGDATPL--QFIDGHYYITLEAISVDGRMLDINPNIFKRDDSGGGVMIDSGTDVT 286
           ++   +  +TPL  +  D  Y++TLEAISV  +       I    +  G ++IDSGT +T
Sbjct: 263 VVSGRQVVSTPLVPRSPDTFYFLTLEAISVGKKRFKAANGISAMTNH-GNIIIDSGTTLT 321

Query: 287 WLVKEAYEALRDEVMIRLEGEQMRSYSWPDKLCYHGIMSSDLKGFPTVRFHFRGGAKLAL 346
            L +  Y  +   +   ++ +++   S   +LCY      DL   P +  HF GGA + L
Sbjct: 322 LLPRSLYYGVFSTLARVIKAKRVDDPSGILELCYSAGQVDDLN-IPIITAHFAGGADVKL 380

Query: 347 EKDSMFYQPRPDAFCMAVNPASINNRYVNFSLIGMMAQQFYNVGYDIGRKQKTFQRMDCE 406
              + F     +  C+   PA+        ++ G +AQ  + VGYD+G K+ +F+   C 
Sbjct: 381 LPVNTFAPVADNVTCLTFAPAT------QVAIFGNLAQINFEVGYDLGNKRLSFEPKLCA 434


>gi|115479485|ref|NP_001063336.1| Os09g0452400 [Oryza sativa Japonica Group]
 gi|51535935|dbj|BAD38017.1| putative aspartic proteinase nepenthesin I [Oryza sativa Japonica
           Group]
 gi|113631569|dbj|BAF25250.1| Os09g0452400 [Oryza sativa Japonica Group]
 gi|215693279|dbj|BAG88661.1| unnamed protein product [Oryza sativa Japonica Group]
          Length = 441

 Score =  144 bits (362), Expect = 1e-31,   Method: Compositional matrix adjust.
 Identities = 126/434 (29%), Positives = 196/434 (45%), Gaps = 55/434 (12%)

Query: 1   LIHRDSILSPYNNPNENPAHRVQRGINISIARLAYLQE-KIRSHNTYQAQILPSNDISNS 59
           L H D+      N     A  + R +  S AR+A LQ     +     A+IL     S  
Sbjct: 35  LTHVDA------NAGYTKAQLLSRAVARSRARVAALQSLATAADAITAARIL--LRFSEG 86

Query: 60  LFYVNISIGQPPVPQFTAM-DTGSSLLWVHCYPCRDCSPQLGTIFYPSRSSSYAYVPCDS 118
            + +++ IG PP   F+AM DTGS L+W  C PC  C  Q    F P++S+SYA +PC S
Sbjct: 87  EYLMDVGIGSPPR-YFSAMIDTGSDLIWTQCAPCLLCVEQPTPYFEPAKSTSYASLPCSS 145

Query: 119 EHCRYFPYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDVVFGCG- 177
             C       C  +++ C+Y   Y     ++  ++ E  TF  T+ + + V  V FGCG 
Sbjct: 146 AMCNALYSPLC--FQNACVYQAFYGDSASSAGVLANETFTF-GTNSTRVAVPRVSFGCGN 202

Query: 178 FSTNRNFKFSGIFGLGIGRSSLVSQLNSS-FSYCIGSLHDPDYLHNKLILGDGAIIDEGD 236
            +    F  SG+ G G G  SLVSQL S  FSYC+ S   P    ++L  G  A ++  +
Sbjct: 203 MNAGTLFNGSGMVGFGRGALSLVSQLGSPRFSYCLTSFMSPA--TSRLYFGAYATLNSTN 260

Query: 237 AT---PLQ--------FIDGHYYITLEAISVDGRMLDINPNIFK--RDDSGGGVMIDSGT 283
            +   P+Q         +   Y++ +  ISV G +L I+P++F     D  GGV+IDSGT
Sbjct: 261 TSSSGPVQSTPFIVNPALPTMYFLNMTGISVAGDLLPIDPSVFAINETDGTGGVIIDSGT 320

Query: 284 DVTWLVKEAYEALRDEVMIRLEGEQMRS---------YSWPDKLCYHGIMSSDLKGFPTV 334
            VT+L + AY  ++   +  +   +  +         + WP            +   P +
Sbjct: 321 TVTFLAQPAYAMVQGAFVAWVGLPRANATPSDTFDTCFKWPPP-------PRRMVTLPEM 373

Query: 335 RFHFRGG-AKLALEKDSMFYQPRPDAFCMAVNPASINNRYVNFSLIGMMAQQFYNVGYDI 393
             HF G   +L LE + M         C+A+ P+       + S+IG    Q +++ YD+
Sbjct: 374 VLHFDGADMELPLE-NYMVMDGGTGNLCLAMLPSD------DGSIIGSFQHQNFHMLYDL 426

Query: 394 GRKQKTFQRMDCEV 407
                +F    C +
Sbjct: 427 ENSLLSFVPAPCNL 440


>gi|242045120|ref|XP_002460431.1| hypothetical protein SORBIDRAFT_02g028000 [Sorghum bicolor]
 gi|241923808|gb|EER96952.1| hypothetical protein SORBIDRAFT_02g028000 [Sorghum bicolor]
          Length = 481

 Score =  143 bits (361), Expect = 1e-31,   Method: Compositional matrix adjust.
 Identities = 127/429 (29%), Positives = 192/429 (44%), Gaps = 38/429 (8%)

Query: 1   LIHRDSILSPYNNPNENPAH-----RVQRGINISIARLAYLQEKIRSHNTYQAQI---LP 52
           ++HR    SP       P+H     R Q  ++ SI RLA  +    + +   A     LP
Sbjct: 68  VVHRHGPCSPLQARGGEPSHAEILDRDQDRVD-SIHRLAAARPSSTADDPSSASKGVSLP 126

Query: 53  SND---ISNSLFYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLGTIFYPSRSS 109
           +     +  + + V++ +G P        DTGS L WV C PC  C  Q   +F PS+S+
Sbjct: 127 ARRGVPLGTANYIVSVGLGTPKRDLLVVFDTGSDLSWVQCKPCDGCYQQHDPLFDPSQST 186

Query: 110 SYAYVPCDSEHCRYFPYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTI-- 167
           +Y+ VPC ++ CR      CS+ K  C Y  +Y    +T   ++ + LT   +  S+   
Sbjct: 187 TYSAVPCGAQECRRLDSGSCSSGK--CRYEVVYGDMSQTDGNLARDTLTLGPSSSSSSSD 244

Query: 168 HVQDVVFGCGFSTNRNF-KFSGIFGLGIGRSSLVSQL----NSSFSYCIGSLHDPDYLHN 222
            +Q+ VFGCG      F K  G+FGLG  R SL SQ      + FSYC+ S         
Sbjct: 245 QLQEFVFGCGDDDTGLFGKADGLFGLGRDRVSLASQAAAKYGAGFSYCLPS---SSTAEG 301

Query: 223 KLILGDGAIIDEGDATPLQFID--GHYYITLEAISVDGRMLDINPNIFKRDDSGGGVMID 280
            L LG  A  +      +   D    YY+ L  I V GR + ++P +F+      G +ID
Sbjct: 302 YLSLGSAAPPNARFTAMVTRSDTPSFYYLNLVGIKVAGRTVRVSPAVFRTP----GTVID 357

Query: 281 SGTDVTWLVKEAYEALRDE---VMIRLEGEQMRSYSWPDKLCYHGIMSSDLKGFPTVRFH 337
           SGT +T L   AY ALR     +M R   ++  + S  D  CY     + ++  P+V   
Sbjct: 358 SGTVITRLPSRAYAALRSSFAGLMRRYSYKRAPALSILDT-CYDFTGRNKVQ-IPSVALL 415

Query: 338 FRGGAKLALEKDSMFYQPRPDAFCMAVNPASINNRYVNFSLIGMMAQQFYNVGYDIGRKQ 397
           F GGA L L    + Y       C+A    + N    + +++G M Q+ + V YD+  ++
Sbjct: 416 FDGGATLNLGFGEVLYVANKSQACLAF---ASNGDDTSIAILGNMQQKTFAVVYDVANQK 472

Query: 398 KTFQRMDCE 406
             F    C 
Sbjct: 473 IGFGAKGCS 481


>gi|125563957|gb|EAZ09337.1| hypothetical protein OsI_31609 [Oryza sativa Indica Group]
 gi|125605916|gb|EAZ44952.1| hypothetical protein OsJ_29595 [Oryza sativa Japonica Group]
          Length = 438

 Score =  143 bits (361), Expect = 1e-31,   Method: Compositional matrix adjust.
 Identities = 126/434 (29%), Positives = 196/434 (45%), Gaps = 55/434 (12%)

Query: 1   LIHRDSILSPYNNPNENPAHRVQRGINISIARLAYLQE-KIRSHNTYQAQILPSNDISNS 59
           L H D+      N     A  + R +  S AR+A LQ     +     A+IL     S  
Sbjct: 32  LTHVDA------NAGYTKAQLLSRAVARSRARVAALQSLATAADAITAARIL--LRFSEG 83

Query: 60  LFYVNISIGQPPVPQFTAM-DTGSSLLWVHCYPCRDCSPQLGTIFYPSRSSSYAYVPCDS 118
            + +++ IG PP   F+AM DTGS L+W  C PC  C  Q    F P++S+SYA +PC S
Sbjct: 84  EYLMDVGIGSPPR-YFSAMIDTGSDLIWTQCAPCLLCVEQPTPYFEPAKSTSYASLPCSS 142

Query: 119 EHCRYFPYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDVVFGCG- 177
             C       C  +++ C+Y   Y     ++  ++ E  TF  T+ + + V  V FGCG 
Sbjct: 143 AMCNALYSPLC--FQNACVYQAFYGDSASSAGVLANETFTF-GTNSTRVAVPRVSFGCGN 199

Query: 178 FSTNRNFKFSGIFGLGIGRSSLVSQLNSS-FSYCIGSLHDPDYLHNKLILGDGAIIDEGD 236
            +    F  SG+ G G G  SLVSQL S  FSYC+ S   P    ++L  G  A ++  +
Sbjct: 200 MNAGTLFNGSGMVGFGRGALSLVSQLGSPRFSYCLTSFMSPA--TSRLYFGAYATLNSTN 257

Query: 237 AT---PLQ--------FIDGHYYITLEAISVDGRMLDINPNIFK--RDDSGGGVMIDSGT 283
            +   P+Q         +   Y++ +  ISV G +L I+P++F     D  GGV+IDSGT
Sbjct: 258 TSSSGPVQSTPFIVNPALPTMYFLNMTGISVAGDLLPIDPSVFAINETDGTGGVIIDSGT 317

Query: 284 DVTWLVKEAYEALRDEVMIRLEGEQMRS---------YSWPDKLCYHGIMSSDLKGFPTV 334
            VT+L + AY  ++   +  +   +  +         + WP            +   P +
Sbjct: 318 TVTFLAQPAYAMVQGAFVAWVGLPRANATPSDTFDTCFKWPPP-------PRRMVTLPEM 370

Query: 335 RFHFRGG-AKLALEKDSMFYQPRPDAFCMAVNPASINNRYVNFSLIGMMAQQFYNVGYDI 393
             HF G   +L LE + M         C+A+ P+       + S+IG    Q +++ YD+
Sbjct: 371 VLHFDGADMELPLE-NYMVMDGGTGNLCLAMLPSD------DGSIIGSFQHQNFHMLYDL 423

Query: 394 GRKQKTFQRMDCEV 407
                +F    C +
Sbjct: 424 ENSLLSFVPAPCNL 437


>gi|242041575|ref|XP_002468182.1| hypothetical protein SORBIDRAFT_01g041210 [Sorghum bicolor]
 gi|241922036|gb|EER95180.1| hypothetical protein SORBIDRAFT_01g041210 [Sorghum bicolor]
          Length = 490

 Score =  143 bits (361), Expect = 1e-31,   Method: Compositional matrix adjust.
 Identities = 127/432 (29%), Positives = 191/432 (44%), Gaps = 57/432 (13%)

Query: 1   LIHRDSILSPYNNPNENPAHRVQRGINISIARLAYLQEKIRSHNT------------YQA 48
           L+HRD   +     N  PA  + R +   + R A++  K  ++ T            + A
Sbjct: 72  LLHRDRFAA-----NATPAQLLARRLQRDVLRAAWIISKAAANGTPPPVAGLSSARGFVA 126

Query: 49  QILPSNDISNSLFYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLGTIFYPSRS 108
            ++ S   ++  +   I++G P V    A+DT S L W+ C PCR C PQ G +F P  S
Sbjct: 127 PVV-SRAPTSGEYIAKIAVGTPGVEALLALDTASDLTWLQCQPCRRCYPQSGPVFDPRHS 185

Query: 109 SSYAYVPCDSEHCRYFPYARCS-AYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTI 167
           +SY  +  ++  C+    +    A +  C+YT  Y  G  T      E LTF       +
Sbjct: 186 TSYREMSFNAADCQALGRSGGGDAKRGTCVYTVGYGDGSTTVGDFIEETLTFAG----GV 241

Query: 168 HVQDVVFGCGFSTNRNF--KFSGIFGLGIGRSSLVSQL--NSSFSYC-IGSLHDPDYLHN 222
            +  +  GCG      F    +GI GLG G  S  +Q+  N +FSYC +  L  P  L +
Sbjct: 242 RLPRISIGCGHDNKGLFGAPAAGILGLGRGLMSFPNQIDHNGTFSYCLVDFLSGPGSLSS 301

Query: 223 KLILGDGAIIDEGDATPLQF--------IDGHYYITLEAISVDGRMLDINPNIFKRD--- 271
            L  G GA+     + P+ F        +   YY+ L  ISV G  +   P + +RD   
Sbjct: 302 TLTFGAGAV---DTSPPVSFTPTVLNLNMPTFYYVRLTGISVGGVRV---PGVTERDLQL 355

Query: 272 ---DSGGGVMIDSGTDVTWLVKEAYEALRD---EVMIRLEGEQMRSYSWPDKLCYHGIMS 325
                 GGV++DSGT VT L + AY A RD    V + L    +   S     CY  +  
Sbjct: 356 DPYTGRGGVIVDSGTAVTRLARPAYTAFRDAFRAVAVDLGQVSIGGPSGFFDTCYT-VGG 414

Query: 326 SDLKGFPTVRFHFRGGAKLALE-KDSMFYQPRPDAFCMAVNPASINNRYVNFSLIGMMAQ 384
             +K  PTV  HF G  ++ L+ K+ +         C A   A+  +  V  S+IG + Q
Sbjct: 415 RGMKKVPTVSMHFAGSVEVKLQPKNYLIPVDSMGTVCFAF--AATGDHSV--SIIGNIQQ 470

Query: 385 QFYNVGYDIGRK 396
           Q + + YDIG +
Sbjct: 471 QGFRIVYDIGGR 482


>gi|413921849|gb|AFW61781.1| hypothetical protein ZEAMMB73_702843 [Zea mays]
          Length = 379

 Score =  143 bits (360), Expect = 2e-31,   Method: Compositional matrix adjust.
 Identities = 99/319 (31%), Positives = 156/319 (48%), Gaps = 30/319 (9%)

Query: 22  VQRGINISIARLAYLQEKIRSHNTY----QAQILPSNDISNSLFYVNISIGQPPVPQFTA 77
           + R I  S AR+A LQ              A++L +   S+  + V+++IG PP+     
Sbjct: 48  LSRAIARSKARVAALQSAAVLPPVVDPITAARVLVT--ASSGEYLVDLAIGTPPLYYTAI 105

Query: 78  MDTGSSLLWVHCYPCRDCSPQLGTIFYPSRSSSYAYVPCDSEHCRYFPYARCSAYKHRCI 137
           MDTGS L+W  C PC  C+ Q    F   +S++Y  +PC S  C       C  +K  C+
Sbjct: 106 MDTGSDLIWTQCAPCLLCADQPTPYFDVKKSATYRALPCRSSRCASLSSPSC--FKKMCV 163

Query: 138 YTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDVVFGCG-FSTNRNFKFSGIFGLGIGR 196
           Y   Y     T+  ++ E  TF   + + +   ++ FGCG  +       SG+ G G G 
Sbjct: 164 YQYYYGDTASTAGVLANETFTFGAANSTKVRATNIAFGCGSLNAGDLANSSGMVGFGRGP 223

Query: 197 SSLVSQLNSS-FSYCIGSL--HDPDYLHNKLILGDGAIIDEGDATPLQ--------FIDG 245
            SLVSQL  S FSYC+ S     P  L+   +  + +  +    +P+Q         +  
Sbjct: 224 LSLVSQLGPSRFSYCLTSYLSATPSRLYFG-VYANLSSTNTSSGSPVQSTPFVINPALPN 282

Query: 246 HYYITLEAISVDGRMLDINPNIFKRDDSG-GGVMIDSGTDVTWLVKEAYEALRDEVMIRL 304
            Y+++L+AIS+  ++L I+P +F  +D G GGV+IDSGT +TWL ++AYEA+R  ++  +
Sbjct: 283 MYFLSLKAISLGTKLLPIDPLVFAINDDGTGGVIIDSGTSITWLQQDAYEAVRRGLVSAI 342

Query: 305 EGEQMRS--------YSWP 315
               M          + WP
Sbjct: 343 PLTAMNDTDIGLDTCFQWP 361


>gi|21594980|gb|AAM66061.1| chloroplast nucleoid DNA binding protein, putative [Arabidopsis
           thaliana]
          Length = 485

 Score =  143 bits (360), Expect = 2e-31,   Method: Compositional matrix adjust.
 Identities = 123/433 (28%), Positives = 186/433 (42%), Gaps = 52/433 (12%)

Query: 1   LIHRDSILSPYNNPNENPAHRVQRGINISIARLAYLQEKIRSHNT--------YQAQILP 52
           L H D+ LS    P E  + R+QR  +  +  +A L  +I   N         + + ++ 
Sbjct: 76  LDHIDA-LSSNKTPQELFSSRLQRD-SRRVRSIATLAAQIPGRNVTHAPRPGGFSSSVVS 133

Query: 53  SNDISNSLFYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLGTIFYPSRSSSYA 112
                +  ++  + +G P    +  +DTGS ++W+ C PCR C  Q   IF P +S +YA
Sbjct: 134 GLSQGSGEYFTRLGVGTPARYVYMVLDTGSDIVWLQCAPCRRCYSQSDPIFDPRKSKTYA 193

Query: 113 YVPCDSEHCRYFPYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDV 172
            +PC S HCR    A C+  +  C+Y   Y  G  T    STE LTF+        V+ V
Sbjct: 194 TIPCSSPHCRRLDSAGCNTRRKTCLYQVSYGDGSFTVGDFSTETLTFRRN-----RVKGV 248

Query: 173 VFGCGFSTNRNF--------------KFSGIFGLGIGRSSLVSQLNSSFSYCIGSLHDPD 218
             GCG      F               F G  G          + N  FSYC+       
Sbjct: 249 ALGCGHDNEGLFVGAAGLLGLGKGKLSFPGQTG---------HRFNQKFSYCLVD-RSAS 298

Query: 219 YLHNKLILGDGAIIDEGDATPL---QFIDGHYYITLEAISVDG-RMLDINPNIFKRDDSG 274
              + ++ G+ A+      TPL     +D  YY+ L  ISV G R+  +  ++FK D  G
Sbjct: 299 SKPSSVVFGNAAVSRIARFTPLLSNPKLDTFYYVGLLGISVGGTRVPGVTASLFKLDQIG 358

Query: 275 -GGVMIDSGTDVTWLVKEAYEALRDEVMIRLEG-EQMRSYSWPDKLCYHGIMSSDLKGFP 332
            GGV+IDSGT VT L++ AY A+RD   +  +  ++  ++S  D  C+     +++K  P
Sbjct: 359 NGGVIIDSGTSVTRLIRPAYIAMRDAFRVGAKTLKRAPNFSLFDT-CFDLSNMNEVK-VP 416

Query: 333 TVRFHFRGGAKLALEKDSMFYQPRPDAFCMAVNPASINNRYVNFSLIGMMAQQFYNVGYD 392
           TV  HFR         + +        FC A             S+IG + QQ + V YD
Sbjct: 417 TVVLHFRRADVSLPATNYLIPVDTNGKFCFA-----FAGTMGGLSIIGNIQQQGFRVVYD 471

Query: 393 IGRKQKTFQRMDC 405
           +   +  F    C
Sbjct: 472 LASSRVGFAPGGC 484


>gi|224060469|ref|XP_002300215.1| predicted protein [Populus trichocarpa]
 gi|222847473|gb|EEE85020.1| predicted protein [Populus trichocarpa]
          Length = 439

 Score =  142 bits (359), Expect = 2e-31,   Method: Compositional matrix adjust.
 Identities = 127/402 (31%), Positives = 191/402 (47%), Gaps = 36/402 (8%)

Query: 20  HRVQRGINISIARLAYLQEKIRSHNTYQAQILPSNDISNSLFYVNISIGQPPVPQFTAMD 79
           H V+RG N  + RL  +     S +  +A +LP N      F + ++IG PP      +D
Sbjct: 61  HGVKRGRN-RLQRLQAMALVASSSSEIEAPVLPGN----GEFLMKLAIGTPPETYSAILD 115

Query: 80  TGSSLLWVHCYPCRDCSPQLGTIFYPSRSSSYAYVPCDSEHCRYFPYARCSAYKHRCIYT 139
           TGS L+W  C PC  C  Q   IF P +SSS++ + C S+ C   P + C+   + C Y 
Sbjct: 116 TGSDLIWTQCKPCTQCFHQSTPIFDPKKSSSFSKLSCSSQLCEALPQSSCN---NGCEYL 172

Query: 140 QLYLIGPETSVFVSTEQLTFKNTDESTIHVQDVVFGCGFSTNRNFKFS---GIFGLGIGR 196
             Y     T   +++E LTF         V +V FGCG + N    FS   G+ GLG G 
Sbjct: 173 YSYGDYSSTQGILASETLTFGKAS-----VPNVAFGCG-ADNEGSGFSQGAGLVGLGRGP 226

Query: 197 SSLVSQLNS-SFSYCIGSLHDPDYLHNKLILGDGAIIDEGDA----TPLQFIDGH---YY 248
            SLVSQL    FSYC+ ++ D     + L++G  A ++   +    TPL     H   YY
Sbjct: 227 LSLVSQLKEPKFSYCLTTVDDTKT--STLLMGSLASVNASSSAIKTTPLIHSPAHPSFYY 284

Query: 249 ITLEAISVDGRMLDINPNIFK-RDDSGGGVMIDSGTDVTWLVKEAYEALRDEVMIRLEGE 307
           ++LE ISV    L I  + F  +DD  GG++IDSGT +T+L + A+  +  E   ++   
Sbjct: 285 LSLEGISVGDTRLPIKKSTFSLQDDGSGGLIIDSGTTITYLEESAFNLVAKEFTAKINLP 344

Query: 308 QMRSYSWPDKLCYHGIMSSDLKGFPTVRFHFRGGAKLALEKDS-MFYQPRPDAFCMAVNP 366
              S S    +C+     S     P + FHF  GA L L  ++ M         C+A+  
Sbjct: 345 VDSSGSTGLDVCFTLPSGSTNIEVPKLVFHFD-GADLELPAENYMIGDSSMGVACLAMGS 403

Query: 367 ASINNRYVNFSLIGMMAQQFYNVGYDIGRKQKTFQRMDCEVL 408
           +S        S+ G + QQ   V +D+ ++  +F    C++L
Sbjct: 404 SS------GMSIFGNVQQQNMLVLHDLEKETLSFLPTQCDLL 439


>gi|413918484|gb|AFW58416.1| hypothetical protein ZEAMMB73_998053 [Zea mays]
          Length = 475

 Score =  142 bits (358), Expect = 3e-31,   Method: Compositional matrix adjust.
 Identities = 118/375 (31%), Positives = 170/375 (45%), Gaps = 36/375 (9%)

Query: 58  NSLFYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLGTIFYPSRSSSYAYVPCD 117
           N  F +++S+G P +P    +DTGS L+W  C PC +C  Q   +F P+ SS+YA +PC 
Sbjct: 113 NGEFLMDLSVGTPALPYAAIVDTGSDLVWTQCKPCVECFNQTTPVFDPAASSTYAALPCS 172

Query: 118 SEHCRYFPYARCSAYKHRCI------YTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQD 171
           S  C   P + C++            YT  Y     T   ++TE  T          V  
Sbjct: 173 SALCADLPTSTCASSSSSSSASSPCGYTYTYGDASSTQGVLATETFTLARQ-----KVPG 227

Query: 172 VVFGCGFSTNRNFKFS---GIFGLGIGRSSLVSQLN-SSFSYCIGSLHD-----PDYLHN 222
           V FGCG  TN    F+   G+ GLG G  SLVSQL    FSYC+ SL D     P  L +
Sbjct: 228 VAFGCG-DTNEGDGFTQGAGLVGLGRGPLSLVSQLGIDRFSYCLTSLDDAAGRSPLLLGS 286

Query: 223 KLILGDGAIIDEGDATPLQFIDGH---YYITLEAISVDGRMLDINPNIFK-RDDSGGGVM 278
              +   A       TPL         YY++L  ++V    L +  + F  +DD  GGV+
Sbjct: 287 AAGISASAATAPAQTTPLVKNPSQPSFYYVSLTGLTVGSTRLALPSSAFAIQDDGTGGVI 346

Query: 279 IDSGTDVTWLVKEAYEALRDEVMIRLEGEQMRSYSWPDKLCYH---GIMSSDLK-GFPTV 334
           +DSGT +T+L   AY ALR   +  +    + +      LC+    G +  D++   P +
Sbjct: 347 VDSGTSITYLELRAYRALRKAFVAHMSLPTVDASEIGLDLCFQGPAGAVDQDVQVQVPKL 406

Query: 335 RFHFRGGAKLALEKDS-MFYQPRPDAFCMAVNPASINNRYVNFSLIGMMAQQFYNVGYDI 393
             HF GGA L L  ++ M       A C+ V    + +R    S+IG   QQ +   YD+
Sbjct: 407 VLHFDGGADLDLPAENYMVLDSASGALCLTV----MASR--GLSIIGNFQQQNFQFVYDV 460

Query: 394 GRKQKTFQRMDCEVL 408
                +F   +C  L
Sbjct: 461 AGDTLSFAPAECNKL 475


>gi|356575389|ref|XP_003555824.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
          Length = 473

 Score =  142 bits (358), Expect = 3e-31,   Method: Compositional matrix adjust.
 Identities = 131/414 (31%), Positives = 187/414 (45%), Gaps = 45/414 (10%)

Query: 1   LIHRDSILSPYNNPNENPAHRVQRGINISIARLAYLQEKIRSHN-TYQAQILPSNDIS-- 57
           L+HRD +  P  N + +   R    +     R+A L+  + +   TY  +   S+ +S  
Sbjct: 70  LVHRDKV--PTFNTSHDHRTRFNARMQRDTKRVAALRRHLAAGKPTYAEEAFGSDVVSGM 127

Query: 58  ---NSLFYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLGTIFYPSRSSSYAYV 114
              +  ++V I +G PP  Q+  +D+GS ++WV C PC  C  Q   +F P+ SSSYA V
Sbjct: 128 EQGSGEYFVRIGVGSPPRNQYVVIDSGSDIIWVQCEPCTQCYHQSDPVFNPADSSSYAGV 187

Query: 115 PCDSEHCRYFPYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDVVF 174
            C S  C +   A C  ++ RC Y   Y  G  T   ++ E LTF  T      +++V  
Sbjct: 188 SCASTVCSHVDNAGC--HEGRCRYEVSYGDGSYTKGTLALETLTFGRT-----LIRNVAI 240

Query: 175 GCGFSTNRNF-KFSGIFGLGIGRSSLVSQL----NSSFSYCIGSLHDPDYLHNKLILGDG 229
           GCG      F   +G+ GLG G  S V QL      +FSYC+ S          L  G  
Sbjct: 241 GCGHHNQGMFVGAAGLLGLGSGPMSFVGQLGGQAGGTFSYCLVSRGIQS--SGLLQFGRE 298

Query: 230 AIIDEGDATPLQF---IDGHYYITLEAISVDGRMLDINPNIFKRDDSG-GGVMIDSGTDV 285
           A+       PL         YY+ L  + V G  + I+ ++FK  + G GGV++D+GT V
Sbjct: 299 AVPVGAAWVPLIHNPRAQSFYYVGLSGLGVGGLRVPISEDVFKLSELGDGGVVMDTGTAV 358

Query: 286 TWLVKEAYEALRDEVMIRLEGEQMRSYSWPDKLCYHGIMSSDLKGF-----PTVRFHFRG 340
           T L   AYEA RD  + +       S       CY      DL GF     PTV F+F G
Sbjct: 359 TRLPTAAYEAFRDAFIAQTTNLPRASGVSIFDTCY------DLFGFVSVRVPTVSFYFSG 412

Query: 341 GAKLALEKDSMFYQPRPD--AFCMAVNPASINNRYVNFSLIGMMAQQFYNVGYD 392
           G  L L   + F  P  D  +FC A  P+S        S+IG + Q+   +  D
Sbjct: 413 GPILTLPARN-FLIPVDDVGSFCFAFAPSS-----SGLSIIGNIQQEGIEISVD 460


>gi|242050428|ref|XP_002462958.1| hypothetical protein SORBIDRAFT_02g035300 [Sorghum bicolor]
 gi|241926335|gb|EER99479.1| hypothetical protein SORBIDRAFT_02g035300 [Sorghum bicolor]
          Length = 460

 Score =  142 bits (358), Expect = 3e-31,   Method: Compositional matrix adjust.
 Identities = 118/378 (31%), Positives = 173/378 (45%), Gaps = 39/378 (10%)

Query: 57  SNSLFYVNISIGQPPVPQFTAMDTGSSLLWVHC-YPCRDCSPQLGTIFYPSRSSSYAYVP 115
           S + + V+ +IG PP+     +DTGS L+W  C  PCR C PQ   ++ P+RS +YA V 
Sbjct: 96  STATYLVDFAIGTPPLALSAVLDTGSDLIWTQCDAPCRRCFPQPAPLYAPARSVTYANVS 155

Query: 116 CDSEHCRYFPYARCSAY-----------KHRCIYTQLYLIGPETSVFVSTEQLTFKNTDE 164
           C S  C   P  R S+            +  C Y   Y  G  T   ++TE  TF     
Sbjct: 156 CGSRLCDALPSLRPSSRCSASASAPAPERGGCTYYYSYGDGSSTDGVLATETFTFGA--G 213

Query: 165 STIHVQDVVFGCGF-STNRNFKFSGIFGLGIGRSSLVSQLN-SSFSYCIGSLHDPDYLHN 222
           +T+H  D+ FGCG  +       SG+ G+G G  SLVSQL  + FSYC    +D     +
Sbjct: 214 TTVH--DLAFGCGTDNLGGTDNSSGLVGMGRGPLSLVSQLGVTKFSYCFTPFND-TTTSS 270

Query: 223 KLILGDGAIIDEGDATPLQFI--------DGHYYITLEAISVDGRMLDINPNIFKRDDSG 274
            L LG  A +    A    F+          +YY++LE I+V   +L I+P +F+   SG
Sbjct: 271 PLFLGSSASLSPA-AKSTPFVPSPSGPRRSSYYYLSLEGITVGDTLLPIDPAVFRLTASG 329

Query: 275 -GGVMIDSGTDVTWLVKEAYEALRDEVMIRLEGEQMRSYSWPDKLCYHGIMSSDLKG--F 331
            GG++IDSGT  T L + A+  L   V  R+             +C+        +    
Sbjct: 330 RGGLIIDSGTTFTALEERAFVVLARAVAARVALPLASGAHLGLSVCFAAPQGRGPEAVDV 389

Query: 332 PTVRFHFRGGAKLALEKDSMFYQPR-PDAFCMAVNPASINNRYVNFSLIGMMAQQFYNVG 390
           P +  HF  GA + L + S   + R     C+ +  A         S++G M QQ  +V 
Sbjct: 390 PRLVLHFD-GADMELPRSSAVVEDRVAGVACLGIVSAR------GMSVLGSMQQQNMHVR 442

Query: 391 YDIGRKQKTFQRMDCEVL 408
           YD+GR   +F+  +C  L
Sbjct: 443 YDVGRDVLSFEPANCGEL 460


>gi|297834758|ref|XP_002885261.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
 gi|297331101|gb|EFH61520.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
          Length = 500

 Score =  142 bits (358), Expect = 3e-31,   Method: Compositional matrix adjust.
 Identities = 117/347 (33%), Positives = 170/347 (48%), Gaps = 28/347 (8%)

Query: 61  FYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLGTIFYPSRSSSYAYVPCDSEH 120
           ++  I +G P    +  +DTGS + W+ C PC DC  Q   +F P+ SS+Y  + C +  
Sbjct: 162 YFSRIGVGTPAKEMYLVLDTGSDVNWIQCEPCSDCYQQSDPVFNPTSSSTYKSLTCSAPQ 221

Query: 121 CRYFPYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDVVFGCGFST 180
           C     + C +  ++C+Y   Y  G  T   ++T+ +TF N+ +    + DV  GCG   
Sbjct: 222 CSLLETSACRS--NKCLYQVSYGDGSFTVGELATDTVTFGNSGK----INDVALGCGHDN 275

Query: 181 NRNFK-FSGIFGLGIGRSSLVSQLN-SSFSYCIGSLHDPDYLHNKLILGDGAIIDEGDAT 238
              F   +G+ GLG G  S+ +Q+  +SFSYC   L D D   +  +  +   +  GDAT
Sbjct: 276 EGLFTGAAGLLGLGGGALSITNQMKATSFSYC---LVDRDSGKSSSLDFNSVQLGSGDAT 332

Query: 239 -PL---QFIDGHYYITLEAISVDGRMLDINPNIFKRDDSG-GGVMIDSGTDVTWLVKEAY 293
            PL   Q ID  YY+ L   SV G+ + +   IF  D SG GGV++D GT VT L  +AY
Sbjct: 333 APLLRNQKIDTFYYVGLSGFSVGGQKVMMPDAIFDVDASGSGGVILDCGTAVTRLQTQAY 392

Query: 294 EALRDEVMIRLEGEQMRSYSWPD--KLCYHGIMSSDLKGFPTVRFHFRGGAKLALEKDSM 351
            +LRD   ++L     +  S       CY     S +K  PTV FHF GG  L L   + 
Sbjct: 393 NSLRD-AFLKLTTNLKKGTSSISLFDTCYDFSSLSSVK-VPTVAFHFTGGKSLDLPAKN- 449

Query: 352 FYQPRPD--AFCMAVNPASINNRYVNFSLIGMMAQQFYNVGYDIGRK 396
           +  P  D   FC A  P S      + S+IG + QQ   + YD+  K
Sbjct: 450 YLIPVDDNGTFCFAFAPTS-----SSLSIIGNVQQQGTRITYDLANK 491


>gi|413953792|gb|AFW86441.1| hypothetical protein ZEAMMB73_342504 [Zea mays]
          Length = 459

 Score =  142 bits (357), Expect = 3e-31,   Method: Compositional matrix adjust.
 Identities = 127/429 (29%), Positives = 192/429 (44%), Gaps = 56/429 (13%)

Query: 1   LIHRDSILSPYNNPNENPA--HRVQRGINISIARLAYLQEKIRSHNTYQAQILPSNDISN 58
           L+HR    +P    ++ P+   R++R    S AR  Y+  +    N      L    + +
Sbjct: 63  LVHRHGPCAPSTRSSDEPSLSERLRR----SRARSKYIMSRASKSNVSIPTHL-GGSVDS 117

Query: 59  SLFYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRD--CSPQLGTIFYPSRSSSYAYVPC 116
             + V + +G P V Q   +DTGS L WV C PC    C PQ   +F PSRSS+YA +PC
Sbjct: 118 LEYVVTVGLGTPAVSQVLLIDTGSDLSWVQCAPCNSTTCYPQKDPLFDPSRSSTYAPIPC 177

Query: 117 DSEHCRYFPY----ARC---SAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHV 169
           +++ CR        + C   S    +C Y   Y  G +T+   S E LT        + V
Sbjct: 178 NTDACRDLTRDGYGSDCTSGSGGGAQCGYAITYGDGSQTTGVYSNETLTMA----PGVTV 233

Query: 170 QDVVFGCGFSTNR-NFKFSGIFGLGIGRSSLVSQLNS----SFSYCIGSLHDPDYLHNKL 224
           +D  FGCG   +  N K+ G+ GLG    SLV Q +S    +FSYC+ + +D        
Sbjct: 234 KDFHFGCGHDQDGPNDKYDGLLGLGGAPESLVVQTSSVYGGAFSYCLPAAND-----QAG 288

Query: 225 ILGDGAIIDEGDA---TPLQFIDGHYYIT-LEAISVDGRMLDINPNIFKRDDSGGGVMID 280
            L  GA +++      TP+      +Y+  +  I+V G  +D+ P+ F      GG++ID
Sbjct: 289 FLALGAPVNDASGFVFTPMVREQQTFYVVNMTGITVGGEPIDVPPSAFS-----GGMIID 343

Query: 281 SGTDVTWLVKEAYEALRDEVMIRLEGEQMRSYSWPDKLCYHGIMSSDLKGFPTVRFHFRG 340
           SGT VT L   AY AL+      +    +      D  CY+    S++   P V   F G
Sbjct: 344 SGTVVTELQHTAYAALQAAFRKAMAAYPLLPNGELDT-CYNFTGHSNVT-VPRVALTFSG 401

Query: 341 GAKLALEKDSMFYQPRPDAF----CMAVNPASINNRYVNFSLIGMMAQQFYNVGYDIGRK 396
           GA + L+         PD      C+A   A  +N+     ++G + Q+   V YD+G  
Sbjct: 402 GATVDLDV--------PDGILLDNCLAFQEAGPDNQP---GILGNVNQRTLEVLYDVGHG 450

Query: 397 QKTFQRMDC 405
           +  F    C
Sbjct: 451 RVGFGADAC 459


>gi|226493786|ref|NP_001142400.1| uncharacterized protein LOC100274575 [Zea mays]
 gi|194708650|gb|ACF88409.1| unknown [Zea mays]
          Length = 392

 Score =  142 bits (357), Expect = 4e-31,   Method: Compositional matrix adjust.
 Identities = 114/374 (30%), Positives = 173/374 (46%), Gaps = 39/374 (10%)

Query: 61  FYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRD-CSPQLGTIFYPSRSSSYAYVPCDSE 119
           + + ++IG PP+P     DTGS L+W  C PC   C  Q   ++ PS S+++A +PC+S 
Sbjct: 32  YLMALAIGTPPLPYQAIADTGSDLIWTQCAPCTSQCFRQPTPLYNPSSSTTFAVLPCNSS 91

Query: 120 ---------HCRYFPYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQ 170
                         P   C+     C Y   Y  G  TSVF  +E  TF +T      V 
Sbjct: 92  LSVCAAALAGTGTAPPPGCA-----CTYNVTYGSG-WTSVFQGSETFTFGSTPAGHARVP 145

Query: 171 DVVFGCGFSTN--RNFKFSGIFGLGIGRSSLVSQLN-SSFSYCIGSLHDPDYLHNKLILG 227
            + FGC  +++       SG+ GLG GR SLVSQL    FSYC+    D +   + L+LG
Sbjct: 146 GIAFGCSTASSGFNASSASGLVGLGRGRLSLVSQLGVPKFSYCLTPYQDTNS-TSTLLLG 204

Query: 228 DGAIIDEG---DATPL------QFIDGHYYITLEAISVDGRMLDINPNIFKRD-DSGGGV 277
             A ++      +TP         ++  YY+ L  IS+    L I P+ F  + D  GG+
Sbjct: 205 PSASLNGTAGVSSTPFVASPSTAPMNTFYYLNLTGISLGTTALSIPPDAFSLNADGTGGL 264

Query: 278 MIDSGTDVTWLVKEAYEALRDEV--MIRLEGEQMRSYSWPDKLCYHGIMSSDL-KGFPTV 334
           +IDSGT +T L   AY+ +R  V  ++ L      + +  D LC+    S+      P++
Sbjct: 265 IIDSGTTITLLGNTAYQQVRAAVVSLVTLPTTDGSADTGLD-LCFMLPSSTSAPPAMPSM 323

Query: 335 RFHFRGGAKLALEKDSMFYQPRPDAFCMAVNPASINNRYVNFSLIGMMAQQFYNVGYDIG 394
             HF  GA + L  DS         +C+A+     N      +++G   QQ  ++ YDIG
Sbjct: 324 TLHFN-GADMVLPADSYMMSDDSGLWCLAMQ----NQTDGEVNILGNYQQQNMHILYDIG 378

Query: 395 RKQKTFQRMDCEVL 408
           ++  +F    C  L
Sbjct: 379 QETLSFAPAKCSAL 392


>gi|15222611|ref|NP_173922.1| aspartyl protease-like protein [Arabidopsis thaliana]
 gi|12321511|gb|AAG50814.1|AC079281_16 hypothetical protein [Arabidopsis thaliana]
 gi|20466516|gb|AAM20575.1| unknown protein [Arabidopsis thaliana]
 gi|23198172|gb|AAN15613.1| unknown protein [Arabidopsis thaliana]
 gi|110736960|dbj|BAF00436.1| hypothetical protein [Arabidopsis thaliana]
 gi|332192515|gb|AEE30636.1| aspartyl protease-like protein [Arabidopsis thaliana]
          Length = 483

 Score =  142 bits (357), Expect = 4e-31,   Method: Compositional matrix adjust.
 Identities = 119/353 (33%), Positives = 166/353 (47%), Gaps = 25/353 (7%)

Query: 61  FYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLGTIFYPSRSSSYAYVPCDSEH 120
           ++  + IG+P    +  +DTGS + W+ C PC DC  Q   IF PS SSSY  + CD+  
Sbjct: 148 YFTRVGIGKPAREVYMVLDTGSDVNWLQCTPCADCYHQTEPIFEPSSSSSYEPLSCDTPQ 207

Query: 121 CRYFPYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDVVFGCGFST 180
           C     + C      C+Y   Y  G  T    +TE LT  +T      VQ+V  GCG S 
Sbjct: 208 CNALEVSECR--NATCLYEVSYGDGSYTVGDFATETLTIGST-----LVQNVAVGCGHSN 260

Query: 181 NRNFKFSGIFGLGIGRSSLV-SQLN-SSFSYCIGSLHDPDYLHNKLILGDGAIIDEGDAT 238
              F  +       G    + SQLN +SFSYC   L D D      +    ++  +    
Sbjct: 261 EGLFVGAAGLLGLGGGLLALPSQLNTTSFSYC---LVDRDSDSASTVDFGTSLSPDAVVA 317

Query: 239 PL---QFIDGHYYITLEAISVDGRMLDINPNIFKRDDSG-GGVMIDSGTDVTWLVKEAYE 294
           PL     +D  YY+ L  ISV G +L I  + F+ D+SG GG++IDSGT VT L  E Y 
Sbjct: 318 PLLRNHQLDTFYYLGLTGISVGGELLQIPQSSFEMDESGSGGIIIDSGTAVTRLQTEIYN 377

Query: 295 ALRDE-VMIRLEGEQMRSYSWPDKLCYHGIMSSDLKGFPTVRFHFRGGAKLAL-EKDSMF 352
           +LRD  V   L+ E+    +  D  CY+    + ++  PTV FHF GG  LAL  K+ M 
Sbjct: 378 SLRDSFVKGTLDLEKAAGVAMFDT-CYNLSAKTTVE-VPTVAFHFPGGKMLALPAKNYMI 435

Query: 353 YQPRPDAFCMAVNPASINNRYVNFSLIGMMAQQFYNVGYDIGRKQKTFQRMDC 405
                  FC+A  P +      + ++IG + QQ   V +D+      F    C
Sbjct: 436 PVDSVGTFCLAFAPTA-----SSLAIIGNVQQQGTRVTFDLANSLIGFSSNKC 483


>gi|226508080|ref|NP_001150678.1| aspartic proteinase nepenthesin-1 precursor [Zea mays]
 gi|195641018|gb|ACG39977.1| aspartic proteinase nepenthesin-1 precursor [Zea mays]
          Length = 450

 Score =  142 bits (357), Expect = 4e-31,   Method: Compositional matrix adjust.
 Identities = 114/374 (30%), Positives = 173/374 (46%), Gaps = 39/374 (10%)

Query: 61  FYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRD-CSPQLGTIFYPSRSSSYAYVPCDSE 119
           + + ++IG PP+P     DTGS L+W  C PC   C  Q   ++ PS S+++A +PC+S 
Sbjct: 90  YLMALAIGTPPLPYQAIADTGSDLIWTQCAPCTSQCFRQPTPLYNPSSSTTFAVLPCNSS 149

Query: 120 ---------HCRYFPYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQ 170
                         P   C+     C Y   Y  G  TSVF  +E  TF +T      V 
Sbjct: 150 LSVCAAALAGTGTAPPPGCA-----CTYNVTYGSG-WTSVFQGSETFTFGSTPAGQSRVP 203

Query: 171 DVVFGCGFSTN--RNFKFSGIFGLGIGRSSLVSQLN-SSFSYCIGSLHDPDYLHNKLILG 227
            + FGC  +++       SG+ GLG GR SLVSQL    FSYC+    D +   + L+LG
Sbjct: 204 GIAFGCSTASSGFNASSASGLVGLGRGRLSLVSQLGVPKFSYCLTPYQDTNS-TSTLLLG 262

Query: 228 DGAIIDEG---DATPL------QFIDGHYYITLEAISVDGRMLDINPNIFKRD-DSGGGV 277
             A ++      +TP         ++  YY+ L  IS+    L I P+ F  + D  GG+
Sbjct: 263 PSASLNGTAGVSSTPFVASPSTAPMNTFYYLNLTGISLGTTALSIPPDAFLLNADGTGGL 322

Query: 278 MIDSGTDVTWLVKEAYEALRDEV--MIRLEGEQMRSYSWPDKLCYHGIMSSDL-KGFPTV 334
           +IDSGT +T L   AY+ +R  V  ++ L      + +  D LC+    S+      P++
Sbjct: 323 IIDSGTTITLLGNTAYQQVRAAVVSLVTLPTTDGSAATGLD-LCFMLPSSTSAPPAMPSM 381

Query: 335 RFHFRGGAKLALEKDSMFYQPRPDAFCMAVNPASINNRYVNFSLIGMMAQQFYNVGYDIG 394
             HF  GA + L  DS         +C+A+     N      +++G   QQ  ++ YDIG
Sbjct: 382 TLHFN-GADMVLPADSYMMSDDSGLWCLAMQ----NQTDGEVNILGNYQQQNMHILYDIG 436

Query: 395 RKQKTFQRMDCEVL 408
           ++  +F    C  L
Sbjct: 437 QETLSFAPAKCSAL 450


>gi|414886964|tpg|DAA62978.1| TPA: aspartic proteinase nepenthesin-1 [Zea mays]
          Length = 452

 Score =  141 bits (356), Expect = 5e-31,   Method: Compositional matrix adjust.
 Identities = 114/374 (30%), Positives = 173/374 (46%), Gaps = 39/374 (10%)

Query: 61  FYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRD-CSPQLGTIFYPSRSSSYAYVPCDSE 119
           + + ++IG PP+P     DTGS L+W  C PC   C  Q   ++ PS S+++A +PC+S 
Sbjct: 92  YLMALAIGTPPLPYQAIADTGSDLIWTQCAPCTSQCFRQPTPLYNPSSSTTFAVLPCNSS 151

Query: 120 ---------HCRYFPYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQ 170
                         P   C+     C Y   Y  G  TSVF  +E  TF +T      V 
Sbjct: 152 LSVCAAALAGTGTAPPPGCA-----CTYNVTYGSG-WTSVFQGSETFTFGSTPAGHARVP 205

Query: 171 DVVFGCGFSTN--RNFKFSGIFGLGIGRSSLVSQLN-SSFSYCIGSLHDPDYLHNKLILG 227
            + FGC  +++       SG+ GLG GR SLVSQL    FSYC+    D +   + L+LG
Sbjct: 206 GIAFGCSTASSGFNASSASGLVGLGRGRLSLVSQLGVPKFSYCLTPYQDTNS-TSTLLLG 264

Query: 228 DGAIIDEG---DATPL------QFIDGHYYITLEAISVDGRMLDINPNIFKRD-DSGGGV 277
             A ++      +TP         ++  YY+ L  IS+    L I P+ F  + D  GG+
Sbjct: 265 PSASLNGTAGVSSTPFVASPSTAPMNTFYYLNLTGISLGTTALSIPPDAFSLNADGTGGL 324

Query: 278 MIDSGTDVTWLVKEAYEALRDEV--MIRLEGEQMRSYSWPDKLCYHGIMSSDL-KGFPTV 334
           +IDSGT +T L   AY+ +R  V  ++ L      + +  D LC+    S+      P++
Sbjct: 325 IIDSGTTITLLGNTAYQQVRAAVVSLVTLPTTDGSADTGLD-LCFMLPSSTSAPPAMPSM 383

Query: 335 RFHFRGGAKLALEKDSMFYQPRPDAFCMAVNPASINNRYVNFSLIGMMAQQFYNVGYDIG 394
             HF  GA + L  DS         +C+A+     N      +++G   QQ  ++ YDIG
Sbjct: 384 TLHFN-GADMVLPADSYMMSDDSGLWCLAMQ----NQTDGEVNILGNYQQQNMHILYDIG 438

Query: 395 RKQKTFQRMDCEVL 408
           ++  +F    C  L
Sbjct: 439 QETLSFAPAKCSAL 452


>gi|255566006|ref|XP_002523991.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
           communis]
 gi|223536718|gb|EEF38359.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
           communis]
          Length = 455

 Score =  141 bits (356), Expect = 5e-31,   Method: Compositional matrix adjust.
 Identities = 130/437 (29%), Positives = 188/437 (43%), Gaps = 52/437 (11%)

Query: 1   LIHRDSILSPYNNPNENPAHRVQRGINISIARLAYLQEKIRSHNTYQAQILPSNDISNSL 60
           LIH DS  SP+ N +E   HR+ + +  S  R+A L     S     A I   +      
Sbjct: 42  LIHIDSPNSPFFNASETTTHRLAKALQRSANRVARLNPLSNSDEGVHASIFSGD----GN 97

Query: 61  FYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLGTIFYPSRSSSYAYVPCDSEH 120
           + + + IG PP     A+DTGS+++W+ C  C+DC  Q  +IF P  SS+Y   PCDS  
Sbjct: 98  YLMKLLIGTPPTEIHAAIDTGSNVIWIPCINCKDCFNQSSSIFNPLASSTYQDAPCDSYQ 157

Query: 121 CRYFPYARCSAYKHRCIYT---QLYLIGPETSVFVSTEQLTFKNTDESTIHVQDVVFGCG 177
           C     + C +  + C+Y+   +  L  P   + V T  LT  +     +   D V  CG
Sbjct: 158 CET-TSSSCQS-DNVCLYSCDEKHQLNCPNGRIAVDTMTLTSSDGRPFPLPYSDFV--CG 213

Query: 178 FSTNRNFKFSGIFGLGIGRSSLVSQL----NSSFSYCIGSLHDPDYLHNKLILGDGAIID 233
            S  + F   G+ GLG G  SL S+L    +  FSYC+   +      +K+  G  + I 
Sbjct: 214 NSIYKTFAGVGVIGLGRGALSLTSKLYHLSDGKFSYCLADYYSKQ--PSKINFGLQSFIS 271

Query: 234 EGDATPLQFIDGH------YYITLEAISVDGRMLDINPNIFKRDD----SGGGVMIDSGT 283
           + D   +    GH      YY+TLE ISV  +  D    ++  DD      G ++IDSGT
Sbjct: 272 DDDLEVVSTTLGHHRHSGNYYVTLEGISVGEKRQD----LYYVDDPFAPPVGNMLIDSGT 327

Query: 284 DVTWLVKEAYEALRDEVMIRL-EGEQMRSYSWPDKLCYHGIMSSDLK-----------GF 331
             T L K+ Y+ L   V   + E  Q      P    +   M + LK            F
Sbjct: 328 MFTLLPKDFYDYLWSTVSYAIPENPQNH----PHNSRFPFSMDNTLKLSPCFWYYPELKF 383

Query: 332 PTVRFHFRGGAKLALEKDSMFYQPRPDAFCMAVNPASINNRYVNFSLIGMMAQQFYNVGY 391
           P +  HF   A + L  D+ F +   D  C A          V     G   Q  + +GY
Sbjct: 384 PKITIHFT-DADVELSDDNSFIRVAEDVVCFAFAATQPGQSTV----YGSWQQMNFILGY 438

Query: 392 DIGRKQKTFQRMDCEVL 408
           D+ R   +F+R DC  L
Sbjct: 439 DLKRGTVSFKRTDCSKL 455


>gi|356523155|ref|XP_003530207.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
          Length = 412

 Score =  141 bits (356), Expect = 6e-31,   Method: Compositional matrix adjust.
 Identities = 131/416 (31%), Positives = 196/416 (47%), Gaps = 50/416 (12%)

Query: 20  HRVQRGINISIARLAYLQEKIR----SHN--TYQAQILPSNDISNSLFYVNISIGQPPVP 73
            R+Q+ +     R+  +Q +IR    SHN    Q QI  S+ I+       +++G     
Sbjct: 16  RRLQKQLISDDLRVRSMQNRIRRVVSSHNVEASQTQIPLSSGINLQTLNYIVTMGLGSTN 75

Query: 74  QFTAMDTGSSLLWVHCYPCRDCSPQLGTIFYPSRSSSYAYVPCDSEHCRYFPYAR----- 128
               +DTGS L WV C PC  C  Q G IF PS SSSY  V C+S  C+   +A      
Sbjct: 76  MTVIIDTGSDLTWVQCEPCMSCYNQQGPIFKPSTSSSYQSVSCNSSTCQSLQFATGNTGA 135

Query: 129 CSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDVVFGCGFSTNRNFK--F 186
           C +    C Y   Y  G  T+  +  EQL+F       + V D VFGCG    RN K  F
Sbjct: 136 CGSNPSTCNYVVNYGDGSYTNGELGVEQLSFGG-----VSVSDFVFGCG----RNNKGLF 186

Query: 187 SGIFGL-GIGRS--SLVSQLNSS----FSYCIGSLHDPDYLHNKLILGDGAIIDEGDATP 239
            G+ GL G+GRS  SLVSQ N++    FSYC+ +          L++G+ + + + + TP
Sbjct: 187 GGVSGLMGLGRSYLSLVSQTNATFGGVFSYCLPTTESG--ASGSLVMGNESSVFK-NVTP 243

Query: 240 LQF--------IDGHYYITLEAISVDGRMLDINPNIFKRDDSGGGVMIDSGTDVTWLVKE 291
           + +        +   Y + L  I VDG  L + P+        GGV+IDSGT +T L   
Sbjct: 244 ITYTRMLPNPQLSNFYILNLTGIDVDGVALQV-PSF-----GNGGVLIDSGTVITRLPSS 297

Query: 292 AYEALRDEVMIRLEG-EQMRSYSWPDKLCYHGIMSSDLKGFPTVRFHFRGGAKLALEKDS 350
            Y+AL+   + +  G      +S  D  C++ +   D    PT+  HF G A+L ++   
Sbjct: 298 VYKALKALFLKQFTGFPSAPGFSILDT-CFN-LTGYDEVSIPTISMHFEGNAELKVDATG 355

Query: 351 MFYQPRPDAFCMAVNPASINNRYVNFSLIGMMAQQFYNVGYDIGRKQKTFQRMDCE 406
            FY  + DA  + +  AS+++ Y + ++IG   Q+   V YD  + +  F    C 
Sbjct: 356 TFYVVKEDASQVCLALASLSDAY-DTAIIGNYQQRNQRVIYDTKQSKVGFAEESCS 410


>gi|255564685|ref|XP_002523337.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
           communis]
 gi|223537425|gb|EEF39053.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
           communis]
          Length = 469

 Score =  141 bits (355), Expect = 6e-31,   Method: Compositional matrix adjust.
 Identities = 128/421 (30%), Positives = 185/421 (43%), Gaps = 33/421 (7%)

Query: 2   IHRDSILSPYNNPNENPAHRVQRGINISIARLAYLQEKI----RSHNTYQAQILPSNDIS 57
           +H    LS  + P      R+QR     +  ++YL E      R    + + ++      
Sbjct: 64  LHHVDALSFNSTPETLFTTRLQRDA-ARVEAISYLAETAGTGKRVGTGFSSSVISGLAQG 122

Query: 58  NSLFYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLGTIFYPSRSSSYAYVPCD 117
           +  ++  I +G PP   +  +DTGS ++W+ C PC+ C  Q   +F P +S S+A + C 
Sbjct: 123 SGEYFTRIGVGTPPRYVYMVLDTGSDIVWIQCAPCKRCYAQSDPVFDPRKSRSFASIACR 182

Query: 118 SEHCRYFPYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDVVFGCG 177
           S  C       C+  K  C+Y   Y  G  T    STE LTF+ T      V  V  GCG
Sbjct: 183 SPLCHRLDSPGCNTQKQTCMYQVSYGDGSFTFGDFSTETLTFRRT-----RVARVALGCG 237

Query: 178 FSTNRNF-KFSGIFGLGIGRSSLVSQ----LNSSFSYCIGSLHDPDYLHNKLILGDGAII 232
                 F   +G+ GLG GR S  SQ     N  FSYC+          + ++ GD A+ 
Sbjct: 238 HDNEGLFVGAAGLLGLGRGRLSFPSQTGRRFNHKFSYCLVD-RSASSKPSSMVFGDSAVS 296

Query: 233 DEGDATPL---QFIDGHYYITLEAISVDG-RMLDINPNIFKRDDSG-GGVMIDSGTDVTW 287
                TPL     +D  YY+ L  ISV G R+  I  ++FK D +G GGV+IDSGT VT 
Sbjct: 297 RTARFTPLVSNPKLDTFYYVELLGISVGGTRVPGITASLFKLDQTGNGGVIIDSGTSVTR 356

Query: 288 LVKEAYEALRDEVMIRLEGEQMR---SYSWPDKLCYHGIMSSDLKGFPTVRFHFRGGAKL 344
           L + AY A RD    R     ++    +S  D  C+     +++K  PTV  HFRG    
Sbjct: 357 LTRPAYIAFRDA--FRAGASNLKRAPQFSLFDT-CFDLSGKTEVK-VPTVVLHFRGADVS 412

Query: 345 ALEKDSMFYQPRPDAFCMAVNPASINNRYVNFSLIGMMAQQFYNVGYDIGRKQKTFQRMD 404
               + +        FC+A             S+IG + QQ + V YD+   +  F    
Sbjct: 413 LPASNYLIPVDTSGNFCLA-----FAGTMGGLSIIGNIQQQGFRVVYDLAGSRVGFAPHG 467

Query: 405 C 405
           C
Sbjct: 468 C 468


>gi|358346443|ref|XP_003637277.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
 gi|355503212|gb|AES84415.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
          Length = 434

 Score =  141 bits (355), Expect = 7e-31,   Method: Compositional matrix adjust.
 Identities = 116/421 (27%), Positives = 195/421 (46%), Gaps = 35/421 (8%)

Query: 1   LIHRDSILSPYNNPNENPAHRVQRGINISIARLAYLQEKIRSHNTYQAQILPSNDISNSL 60
           +IHRD   SP  +P      R    ++ SI R+ Y  ++  S N  Q    P + ++  L
Sbjct: 32  MIHRDFSKSPLYHPTVTKFQRAYNVVHRSINRVNYFTKEF-SLNKNQ----PVSTLTPEL 86

Query: 61  --FYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLGTIFYPSRSSSYAYVPCDS 118
             + ++ S+G PP   +  MDTGS+++W+ C PC  C  Q   IF PS+SSSY  +PC S
Sbjct: 87  GEYLISYSVGTPPFKVYGFMDTGSNIVWLQCQPCNTCFNQTSPIFNPSKSSSYKNIPCTS 146

Query: 119 EHCRYF--PYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDVVFGC 176
             C+     +  CS     C Y+  Y    ++   +S + LT  +T  S++   ++V GC
Sbjct: 147 STCKDTNDTHISCSNGGDVCEYSITYGGDAKSQGDLSNDSLTLDSTSGSSVLFPNIVIGC 206

Query: 177 GFST--NRNFKFSGIFGLGIGRSSLVSQLNSS-----FSYCIGSLHDPDYLHNKLILGDG 229
           G       N + SG+ G+G G  SL+ Q+ SS     FSYC+   +      +KLI G+ 
Sbjct: 207 GHINVLQDNSQSSGVVGMGRGPMSLIKQVGSSSVGSKFSYCLIPYNSDSNSSSKLIFGED 266

Query: 230 AIIDEGD---ATPLQFIDG---HYYITLEAISVDGRMLDINPNIFKRDDSGGGVMIDSGT 283
            ++  G+   +TP+  ++G   +Y++TLEA SV    ++      + + S   ++IDSGT
Sbjct: 267 VVV-SGEIVVSTPMVKVNGQENYYFLTLEAFSVGNNRIEYGE---RSNASTQNILIDSGT 322

Query: 284 DVTWLVKEAYEALRDEVMIRLEGEQMRSYSWPDKLCYHGIMSSDLKGFPTVRFHFRGGAK 343
            +T L       L   V   ++  ++        LCY+   +      P +  HF  GA 
Sbjct: 323 PLTMLPNLFLSKLVSYVAQEVKLPRIEPPDHHLSLCYN--TTGKQLNVPDITAHFN-GAD 379

Query: 344 LALEKDSMFYQPRPDAFCMAVNPASINNRYVNFSLIGMMAQQFYNVGYDIGRKQKTFQRM 403
           + L  +  F+       C     ++         + G +AQ    + YD+ ++  +F+  
Sbjct: 380 VKLNSNGTFFPFEDGIMCFGFISSN------GLEIFGNIAQNNLLIDYDLEKEIISFKPT 433

Query: 404 D 404
           D
Sbjct: 434 D 434


>gi|357481199|ref|XP_003610885.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
 gi|355512220|gb|AES93843.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
          Length = 416

 Score =  141 bits (355), Expect = 7e-31,   Method: Compositional matrix adjust.
 Identities = 109/358 (30%), Positives = 171/358 (47%), Gaps = 29/358 (8%)

Query: 63  VNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLGTIFYPSRSSSYAYVPCDSEHCR 122
           + I IG PP+     +DTGS L+W+ C PC  C  Q+  +F P +SS+Y  + CDS  C 
Sbjct: 70  MEIYIGTPPIKITGLVDTGSDLIWIQCAPCLGCYKQIKPMFDPLKSSTYNNISCDSPLCH 129

Query: 123 YFPYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDVVFGCGFSTNR 182
                 CS  K RC YT  Y     T   ++ +  TF +     + +   +FGCG +   
Sbjct: 130 KLDTGVCSPEK-RCNYTYGYGDNSLTKGVLAQDTATFTSNTGKPVSLSRFLFGCGHNNTG 188

Query: 183 NFK--FSGIFGLGIGRSSLVSQL-----NSSFSYCIGSLHDPDYLHNKLILGDGA-IIDE 234
            F     G+ GLG G +SL+SQ+        FS C+        + +++  G G+ ++  
Sbjct: 189 GFNDHEMGLIGLGGGPTSLISQIGPLFGGKKFSQCLVPFLTDIKISSRMSFGKGSQVLGN 248

Query: 235 G-DATPL--QFIDGHYYITLEAISVDGRMLDINPNIFKRDDSGGGVMIDSGTDVTWLVKE 291
           G   TPL  +  D  Y++TL  ISV+     +N  I K +     +++DSGT    L ++
Sbjct: 249 GVVTTPLVPREKDTSYFVTLLGISVEDTYFPMNSTIGKAN-----MLVDSGTPPILLPQQ 303

Query: 292 AYEALRDEVMIRLEGEQMR-SYSWPDKLCYHGIMSSDLKGFPTVRFHFRGGAKLALEKDS 350
            Y+ +  EV  ++  + +    S   +LCY     ++LKG PT+ FHF  GA + L    
Sbjct: 304 LYDKVFAEVRNKVALKPITDDPSLGTQLCYR--TQTNLKG-PTLTFHFV-GANVLLTPIQ 359

Query: 351 MFYQPRPDA---FCMAVNPASINNRYVNFSLIGMMAQQFYNVGYDIGRKQKTFQRMDC 405
            F  P P     FC+A+     N    +  + G  AQ  Y +G+D+ R+  +F+  DC
Sbjct: 360 TFIPPTPQTKGIFCLAI----YNRTNSDPGVYGNFAQSNYLIGFDLDRQVVSFKPTDC 413


>gi|225455876|ref|XP_002275164.1| PREDICTED: aspartic proteinase nepenthesin-2 [Vitis vinifera]
          Length = 496

 Score =  141 bits (355), Expect = 7e-31,   Method: Compositional matrix adjust.
 Identities = 113/354 (31%), Positives = 168/354 (47%), Gaps = 26/354 (7%)

Query: 61  FYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLGTIFYPSRSSSYAYVPCDSEH 120
           +++ + IG+P    +  +DTGS + W+ C PC DC  Q+  IF P+ SSS++ + C +  
Sbjct: 160 YFLRVGIGRPSKTFYMVIDTGSDVNWLQCKPCDDCYQQVDPIFDPASSSSFSRLGCQTPQ 219

Query: 121 CRYFPYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDVVFGCGFST 180
           CR      C      C+Y   Y  G  T    +TE ++F N+      V  V  GCG   
Sbjct: 220 CRNLDVFAC--RNDSCLYQVSYGDGSYTVGDFATETVSFGNSGS----VDKVAIGCGHDN 273

Query: 181 NRNFKFSGIFGLGIGRS-SLVSQLN-SSFSYCIGSLHDPDYLHNKLILGDGAIIDEGDAT 238
              F  +       G   SL SQ+  SSFSYC   L + D + +  +  + A   +    
Sbjct: 274 EGLFVGAAGLIGLGGGPLSLTSQIKASSFSYC---LVNRDSVDSSTLEFNSAKPSDSVTA 330

Query: 239 PL---QFIDGHYYITLEAISVDGRMLDINPNIFKRDDSG-GGVMIDSGTDVTWLVKEAYE 294
           P+     +D  YY+ +  +SV G  L I P+IF+ D SG GG+++D GT VT L  +AY 
Sbjct: 331 PIFKNSKVDTFYYVGITGMSVGGEKLAIPPSIFEVDGSGKGGIIVDCGTAVTRLQTQAYN 390

Query: 295 ALRDE-VMIRLEGEQMRSYSWPDKLCYHGIMSSDLKGFPTVRFHFRGGAKLALEKDSMFY 353
           ALRD  V +  +      ++  D  CY+    + ++  PTV F F GG  L L   S + 
Sbjct: 391 ALRDTFVKLTKDLPSTSGFALFDT-CYNLSSRTSVR-VPTVAFLFDGGKSLPLPP-SNYL 447

Query: 354 QPRPDA--FCMAVNPASINNRYVNFSLIGMMAQQFYNVGYDIGRKQKTFQRMDC 405
            P   A  FC+A  P +      + S+IG + QQ   V YD+   Q +F    C
Sbjct: 448 IPVDSAGTFCLAFAPTT-----ASLSIIGNVQQQGTRVTYDLANSQVSFSSRKC 496


>gi|116787367|gb|ABK24480.1| unknown [Picea sitchensis]
          Length = 496

 Score =  141 bits (355), Expect = 8e-31,   Method: Compositional matrix adjust.
 Identities = 126/438 (28%), Positives = 187/438 (42%), Gaps = 48/438 (10%)

Query: 1   LIHRDSIL-SPYNNPNENPAHRVQRGINISIARLAYLQEKIRSH---------------- 43
           L+HRDS+L     N   +   R++  +    AR+  L+++I                   
Sbjct: 75  LVHRDSLLFKGAANATASYERRLEEKLRREAARVRALEQRIERKLKLKKDPAGSYENVAG 134

Query: 44  --NTYQAQILPSNDISNSLFYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLGT 101
               + ++++   +  +  ++  I IG P   Q+  +DTGS ++W+ C PCR+C  Q   
Sbjct: 135 VTAEFGSEVVSGMEQGSGEYFTRIGIGTPTREQYMVLDTGSDVVWIQCEPCRECYSQADP 194

Query: 102 IFYPSRSSSYAYVPCDSEHCRYFPYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKN 161
           IF PS S S++ V CDS  C       C  +   C+Y   Y  G  T    +TE LTF  
Sbjct: 195 IFNPSSSVSFSTVGCDSAVCSQLDANDC--HGGGCLYEVSYGDGSYTVGSYATETLTFGT 252

Query: 162 TDESTIHVQDVVFGCGFSTNRNFKFSGIFGLGIGRS-----SLVSQLNSSFSYCIGSLHD 216
           T      +Q+V  GCG      F  +         S      L +Q   +FSYC   L D
Sbjct: 253 TS-----IQNVAIGCGHDNVGLFVGAAGLLGLGAGSLSFPAQLGTQTGRAFSYC---LVD 304

Query: 217 PDYLHN-KLILGDGAIIDEGDATPL---QFIDGHYYITLEAISVDGRMLDINPN-IFKRD 271
            D   +  L  G  ++      TPL    F+   YY+++ AISV G +LD  P+  F+ D
Sbjct: 305 RDSESSGTLEFGPESVPIGSIFTPLVANPFLPTFYYLSMVAISVGGVILDSVPSEAFRID 364

Query: 272 DSG--GGVMIDSGTDVTWLVKEAYEALRDEVMIRLEGEQMRSYSWPDKLCYHGIMSSDLK 329
           ++   GG++IDSGT VT L   AY+ALRD  +   +             CY  + +    
Sbjct: 365 ETTGRGGIIIDSGTAVTRLQTSAYDALRDAFIAGTQHLPRADGISIFDTCYD-LSALQSV 423

Query: 330 GFPTVRFHFRGGAKLAL-EKDSMFYQPRPDAFCMAVNPASINNRYVNFSLIGMMAQQFYN 388
             P V FHF  GA   L  K+ +        FC A  PA       N S++G + QQ   
Sbjct: 424 SIPAVGFHFSNGAGFILPAKNCLIPMDSMGTFCFAFAPAD-----SNLSIMGNIQQQGIR 478

Query: 389 VGYDIGRKQKTFQRMDCE 406
           V +D       F    C+
Sbjct: 479 VSFDSANSLVGFAIDQCQ 496


>gi|449454654|ref|XP_004145069.1| PREDICTED: probable aspartic protease At2g35615-like [Cucumis
           sativus]
 gi|449470632|ref|XP_004153020.1| PREDICTED: probable aspartic protease At2g35615-like [Cucumis
           sativus]
 gi|449499016|ref|XP_004160697.1| PREDICTED: probable aspartic protease At2g35615-like [Cucumis
           sativus]
          Length = 434

 Score =  140 bits (353), Expect = 1e-30,   Method: Compositional matrix adjust.
 Identities = 119/426 (27%), Positives = 187/426 (43%), Gaps = 40/426 (9%)

Query: 1   LIHRDSILSPYNNPNENPAHRVQRGINISIARLAYLQEKIRSHNTYQAQILPSNDISNSL 60
           LIHRDS  SP  N +E    R+   +  S  R   + E     +T +A I  +       
Sbjct: 31  LIHRDSPKSPMYNSSETHFDRIVNALRRSSHRNTVVLES----DTAEAPIFNNG----GE 82

Query: 61  FYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLGTIFYPSRSSSYAYVPCDSEH 120
           + V IS+G PP       DTGS ++W  C PC +C  Q   +F PS+S++Y  V C S  
Sbjct: 83  YLVEISVGTPPFSIVAVADTGSDVIWTQCKPCSNCYQQNAPMFDPSKSTTYKNVACSSPV 142

Query: 121 CRYFPYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDVVFGCGFST 180
           C Y       +    C+Y+  Y     +   ++ + +T ++T    +     V GCG   
Sbjct: 143 CSYSGDGSSCSDDSECLYSIAYGDDSHSQGNLAVDTVTMQSTSGRPVAFPRTVIGCGHDN 202

Query: 181 NRNF--KFSGIFGLGIGRSSLVSQLNSS----FSYCI-----GSLHDPDYLH---NKLIL 226
              F    SGI GLG G +SLV+QL  +    FSYC+     GS +D   L+   N  + 
Sbjct: 203 AGTFNANVSGIVGLGRGPASLVTQLGPATGGKFSYCLIPIGTGSTNDSTKLNFGSNANVS 262

Query: 227 GDGAIIDEGDATPL----QFIDGHYYITLEAISVDGRMLDINPNIFKRDDSGGGVMIDSG 282
           G G +     +TP+    Q+    Y + LEA+SV     +  P    +      ++IDSG
Sbjct: 263 GSGTV-----STPIYSSAQY-KTFYSLKLEAVSVGDTKFNF-PEGASKLGGESNIIIDSG 315

Query: 283 TDVTWLVKEAYEALRDEVMIRLEGEQMRSYSWPDKLCYHGIMSSDLKGFPTVRFHFRGGA 342
           T +T+L      +    +   +     +  S     C+    ++D    P V  HF  GA
Sbjct: 316 TTLTYLPSALLNSFGSAISQSMSLPHAQDPSEFLDYCF--ATTTDDYEMPPVTMHFE-GA 372

Query: 343 KLALEKDSMFYQPRPDAFCMAVNPASINNRYVNFSLIGMMAQQFYNVGYDIGRKQKTFQR 402
            + L+++++F +   D  C+A      +N ++     G +AQ  + VGYDI     +FQ 
Sbjct: 373 DVPLQRENLFVRLSDDTICLAFGSFPDDNIFI----YGNIAQSNFLVGYDIKNLAVSFQP 428

Query: 403 MDCEVL 408
             C  +
Sbjct: 429 AHCGAV 434


>gi|168025534|ref|XP_001765289.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162683608|gb|EDQ70017.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 372

 Score =  140 bits (353), Expect = 1e-30,   Method: Compositional matrix adjust.
 Identities = 110/359 (30%), Positives = 163/359 (45%), Gaps = 22/359 (6%)

Query: 61  FYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLGTIFYPSRSSSYAYVPCDSEH 120
           F V I +G PP      +DTGS L W+   PCR C  Q   IF PS+SS+Y  + C S  
Sbjct: 25  FLVPIYLGTPPQKAVVIIDTGSDLTWIQSEPCRACFEQADPIFDPSKSSTYNKIACSSSA 84

Query: 121 CRYFPYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDVVFGCGFST 180
           C      +  +    CIY   Y  G  T  + S E +T  +T    +     V+  G  T
Sbjct: 85  CADLLGTQTCSAAANCIYAYGYGDGSVTRGYFSKETITATDTAGEEVKFGASVYNTG--T 142

Query: 181 NRNFKFSGIFGLGIGRSSLVSQLNS----SFSYCIGSLHDPDYLHNKLILGDGAIID-EG 235
             +    GI GLG G  S+ SQL S     FSYC+          + +  GD A+   E 
Sbjct: 143 FGDTGGEGILGLGQGPVSMPSQLGSVLGNKFSYCLVDWLSAGSETSTMYFGDAAVPSGEV 202

Query: 236 DATPLQFIDGH---YYITLEAISVDGRMLDINPNIFKRDDSG-GGVMIDSGTDVTWLVKE 291
             TP+     H   YYI ++ ISV G +LDI+ ++++ D  G GG +IDSGT +T+L +E
Sbjct: 203 QYTPIVPNADHPTYYYIAVQGISVGGSLLDIDQSVYEIDSGGSGGTIIDSGTTITYLQQE 262

Query: 292 AYEALRDEVMIRLEGEQMRSYSWPDKLCYHGIMSSDLKGFPTVRFHFRGGAKLALEKDSM 351
            + AL      ++      S +  D LC++    +    FP +  H   G  L L   + 
Sbjct: 263 VFNALVAAYTSQVRYPTTTSATGLD-LCFN-TRGTGSPVFPAMTIHLD-GVHLELPTANT 319

Query: 352 FYQPRPDAFCMAVNPASINNRYVNF--SLIGMMAQQFYNVGYDIGRKQKTFQRMDCEVL 408
           F     +  C+A   A      ++F  ++ G + QQ +++ YD+   +  F   DC  L
Sbjct: 320 FISLETNIICLAFASA------LDFPIAIFGNIQQQNFDIVYDLDNMRIGFAPADCASL 372


>gi|357494221|ref|XP_003617399.1| 60S ribosomal protein L18a [Medicago truncatula]
 gi|355518734|gb|AET00358.1| 60S ribosomal protein L18a [Medicago truncatula]
          Length = 749

 Score =  140 bits (353), Expect = 1e-30,   Method: Compositional matrix adjust.
 Identities = 118/396 (29%), Positives = 182/396 (45%), Gaps = 51/396 (12%)

Query: 46  YQAQILPSNDISNSL----FYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLGT 101
           Y +Q++ + +   SL    +++++ IG PP      +DTGS L W+ C PC  C  Q G 
Sbjct: 173 YSSQLVATLESGVSLGSGEYFMDVFIGTPPKHYSLILDTGSDLNWIQCVPCIACFEQSGP 232

Query: 102 IFYPSRSSSYAYVPCDSEHCRYF----PYARCSAYKHRCIYTQLYLIGPETS----VFVS 153
            + P  SSS+  + C    C+      P   C      C Y   Y     T+    +   
Sbjct: 233 YYDPKESSSFENITCHDPRCKLVSSPDPPKPCKDENQTCPYFYWYGDSSNTTGDFALETF 292

Query: 154 TEQLTFKNTDESTIHVQDVVFGCGFSTNRNFKFSGIF-------GLGIGRSSLVSQLNS- 205
           T  LT  N      HV++V+FGCG   NR     G+F       GLG G  S  SQL S 
Sbjct: 293 TVNLTTPNGKSEQKHVENVMFGCG-HWNR-----GLFHGAAGLLGLGRGPLSFASQLQSI 346

Query: 206 ---SFSYCIGSLHDPDYLHNKLILG-DGAIIDEGDATPLQFIDGH-------YYITLEAI 254
              SFSYC+   +    + +KLI G D  ++   +     F+ G        YY+ +++I
Sbjct: 347 YGHSFSYCLVDRNSDTSVSSKLIFGEDKELLSHPNLNFTSFVGGEENSVDTFYYVGIKSI 406

Query: 255 SVDGRMLDINPNIFK-RDDSGGGVMIDSGTDVTWLVKEAYEALRDEVMIRLEGEQMRSYS 313
            VDG +L I    +    + GGG +IDSGT +T+  + AYE +++  M +++G ++    
Sbjct: 407 MVDGEVLKIPEETWHLSKEGGGGTIIDSGTTLTYFAEPAYEIIKEAFMKKIKGYELVEGF 466

Query: 314 WPDKLCYH--GIMSSDLKGFPTVRFHFRGGAKLALEKDSMFYQPRPDAFCMAV--NPASI 369
            P K CY+  GI   +L  F  +   F  GA      ++ F Q  PD  C+A+   P S 
Sbjct: 467 PPLKPCYNVSGIEKMELPDFGIL---FSDGAMWDFPVENYFIQIEPDLVCLAILGTPKSA 523

Query: 370 NNRYVNFSLIGMMAQQFYNVGYDIGRKQKTFQRMDC 405
                  S+IG   QQ +++ YD+ + +  +  M C
Sbjct: 524 ------LSIIGNYQQQNFHILYDMKKSRLGYAPMKC 553


>gi|255541796|ref|XP_002511962.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
           communis]
 gi|223549142|gb|EEF50631.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
           communis]
          Length = 495

 Score =  140 bits (352), Expect = 1e-30,   Method: Compositional matrix adjust.
 Identities = 126/415 (30%), Positives = 185/415 (44%), Gaps = 36/415 (8%)

Query: 2   IHRDSILSPYNNPNENPAHRVQRGIN-ISIARLAYLQEKIRSHNTYQAQILPSNDISNSL 60
           +HRDS      +  +    R+Q  +N +S + L  LQ +I+  +     +       +  
Sbjct: 106 LHRDS------SRVQAITTRLQLILNGVSKSDLKPLQTEIQPQD-LSTPVSSGTSQGSGE 158

Query: 61  FYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLGTIFYPSRSSSYAYVPCDSEH 120
           ++  + +G P    +  +DTGS + W+ C PC DC  Q   IF P+ SSSY+ + CDS+ 
Sbjct: 159 YFTRVGVGNPAKSYYMVLDTGSDINWIQCQPCSDCYQQSDPIFTPAASSSYSPLTCDSQQ 218

Query: 121 CRYFPYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDVVFGCGFST 180
           C     + C     +C Y   Y  G  T     TE ++F  +      V  +  GCG   
Sbjct: 219 CNSLQMSSC--RNGQCRYQVNYGDGSFTFGDFVTETMSFGGSGT----VNSIALGCGHDN 272

Query: 181 NRNFKFSGIFGLGIGRS-SLVSQLN-SSFSYCIGSLHDPDYLHNKLILGDGAIIDEGDAT 238
              F  +       G   SL SQL  +SFSYC   L + D   +  +  + A + +    
Sbjct: 273 EGLFVGAAGLLGLGGGPLSLTSQLKATSFSYC---LVNRDSAASSTLDFNSAPVGDSVIA 329

Query: 239 PL---QFIDGHYYITLEAISVDGRMLDINPNIFKRDDSG-GGVMIDSGTDVTWLVKEAYE 294
           PL     ID  YY+ L  +SV G +L I   +FK DDSG GGV++D GT +T L  EAY 
Sbjct: 330 PLLKSSKIDTFYYVGLSGMSVGGELLRIPQEVFKLDDSGDGGVIVDCGTAITRLQSEAYN 389

Query: 295 ALRDEVMIRLEGEQMRSYSWPD--KLCYHGIMSSDLKGFPTVRFHFRGGAKLALEKDSMF 352
           +LRD  +       +RS S       CY     S +K  PTV FHF GG    L   + +
Sbjct: 390 SLRDSFVSM--SRHLRSTSGVALFDTCYDLSGQSSVK-VPTVSFHFDGGKSWDLPA-ANY 445

Query: 353 YQPRPDA--FCMAVNPASINNRYVNFSLIGMMAQQFYNVGYDIGRKQKTFQRMDC 405
             P   A  +C A  P +      + S+IG + QQ   V +D+   +  F    C
Sbjct: 446 LIPVDSAGTYCFAFAPTT-----SSLSIIGNVQQQGTRVSFDLANNRVGFSTNKC 495


>gi|242050430|ref|XP_002462959.1| hypothetical protein SORBIDRAFT_02g035310 [Sorghum bicolor]
 gi|241926336|gb|EER99480.1| hypothetical protein SORBIDRAFT_02g035310 [Sorghum bicolor]
          Length = 448

 Score =  139 bits (351), Expect = 2e-30,   Method: Compositional matrix adjust.
 Identities = 107/366 (29%), Positives = 167/366 (45%), Gaps = 27/366 (7%)

Query: 61  FYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCR--DCSPQLGTIFYPSRSSSYAYVPCDS 118
           + + +SIG PP+      DTGS L+W  C PC    C  Q   ++ P+ S+++  +PC+S
Sbjct: 92  YLMTLSIGTPPLSYPAIADTGSDLIWTQCAPCSGDQCFAQPAPLYNPASSTTFGVLPCNS 151

Query: 119 --EHCRYFPYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDVVFGC 176
               C      +       C+Y Q Y  G  T+    +E  TF +       V  + FGC
Sbjct: 152 SLSMCAGVLAGKAPPPGCACMYNQTYGTG-WTAGVQGSETFTFGSAAADQARVPGIAFGC 210

Query: 177 GFSTNRNFKFS-GIFGLGIGRSSLVSQLNSS-FSYCIGSLHDPDYLHNKLILGDGAIIDE 234
             +++ ++  S G+ GLG G  SLVSQL +  FSYC+    D +   + L+LG  A ++ 
Sbjct: 211 SNASSSDWNGSAGLVGLGRGSLSLVSQLGAGRFSYCLTPFQDTNS-TSTLLLGPSAALNG 269

Query: 235 GDATPLQFI--------DGHYYITLEAISVDGRMLDINPNIFK-RDDSGGGVMIDSGTDV 285
                  F+          +YY+ L  IS+  + L I+P+ F  + D  GG++IDSGT +
Sbjct: 270 TGVRSTPFVASPAKAPMSTYYYLNLTGISLGAKALSISPDAFSLKADGTGGLIIDSGTTI 329

Query: 286 TWLVKEAYEALRDEV--MIRLEGEQMRSYSWPDKLCYH-GIMSSDLKGFPTVRFHFRGGA 342
           T LV  AY+ +R  V  ++ L        +  D LCY     +S     P++  HF  GA
Sbjct: 330 TSLVNAAYQQVRAAVQSLVTLPAIDGSDSTGLD-LCYALPTPTSAPPAMPSMTLHFD-GA 387

Query: 343 KLALEKDSMFYQPRPDAFCMAVNPASINNRYVNFSLIGMMAQQFYNVGYDIGRKQKTFQR 402
            + L  DS         +C+A+     N      S  G   QQ  ++ YD+  +  +F  
Sbjct: 388 DMVLPADSYMISGS-GVWCLAMR----NQTDGAMSTFGNYQQQNMHILYDVRNEMLSFAP 442

Query: 403 MDCEVL 408
             C  L
Sbjct: 443 AKCSTL 448


>gi|357143847|ref|XP_003573077.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Brachypodium
           distachyon]
          Length = 420

 Score =  139 bits (351), Expect = 2e-30,   Method: Compositional matrix adjust.
 Identities = 113/390 (28%), Positives = 179/390 (45%), Gaps = 21/390 (5%)

Query: 30  IARLAYLQEKIRSHNTYQAQILPSNDISNSLFYVNISIGQPPVPQFTAMDTGSSLLWVHC 89
           + R A  + ++R+ + Y A   P        + + ++IG+PPVP     DTGS L W  C
Sbjct: 41  LMRRAVHRSRLRALSGYDATS-PRLHSVQVEYLMELAIGKPPVPFVALADTGSDLTWTQC 99

Query: 90  YPCRDCSPQLGTIFYPSRSSSYAYVPCDSEHCRYFPYARCSAYKHRCIYTQLYLIGPETS 149
            PC+ C PQ   ++ PS SS+++ +PC S  C    ++R       C Y   Y  G  ++
Sbjct: 100 QPCKLCFPQDTPVYDPSASSTFSPLPCSSATCLPI-WSRNCTPSSLCRYRYAYGDGAYSA 158

Query: 150 VFVSTEQLTFKNTDESTIHVQDVVFGCGFSTNRN-FKFSGIFGLGIGRSSLVSQLN-SSF 207
             + TE LT      + + V  V FGCG     +    +G  GLG G  SL++QL    F
Sbjct: 159 GILGTETLTL-GPSSAPVSVGGVAFGCGTDNGGDSLNSTGTVGLGRGTLSLLAQLGVGKF 217

Query: 208 SYCIGSLHDPDYLHNKLILGDGAIIDEG----DATPLQFI---DGHYYITLEAISVDGRM 260
           SYC+    +   L +  +LG  A +  G     +TPL         Y+++L+ IS+    
Sbjct: 218 SYCLTDFFN-SALDSPFLLGTLAELAPGPSTVQSTPLLQSPQNPSRYFVSLQGISLGDVR 276

Query: 261 LDINPNIFK-RDDSGGGVMIDSGTDVTWLVKEAYEALRDEVMIRLEGEQMRSYSWPDKLC 319
           L I    F  R D  GG+++DSGT  T L +  +  +   V  R+ G+   + S  D  C
Sbjct: 277 LPIPNGTFDLRGDGTGGMIVDSGTTFTILAESGFREVVGRVA-RVLGQPPVNASSLDAPC 335

Query: 320 YHGIMSSDLKGFPTVRFHFRGGAKLALEKDS-MFYQPRPDAFCMAVNPASINNRYVNFSL 378
           +    + +    P +  HF GGA + L +D+ M Y     +FC+ +   +  +     S+
Sbjct: 336 FPA-PAGEPPYMPDLVLHFAGGADMRLYRDNYMSYNEEDSSFCLNIAGTTPEST----SV 390

Query: 379 IGMMAQQFYNVGYDIGRKQKTFQRMDCEVL 408
           +G   QQ   + +D    Q +F   DC  L
Sbjct: 391 LGNFQQQNIQMLFDTTVGQLSFLPTDCSKL 420


>gi|302806531|ref|XP_002985015.1| hypothetical protein SELMODRAFT_121417 [Selaginella moellendorffii]
 gi|300147225|gb|EFJ13890.1| hypothetical protein SELMODRAFT_121417 [Selaginella moellendorffii]
          Length = 533

 Score =  139 bits (351), Expect = 2e-30,   Method: Compositional matrix adjust.
 Identities = 111/376 (29%), Positives = 178/376 (47%), Gaps = 41/376 (10%)

Query: 61  FYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLGTIFYPSRSSSYAYVPCDSEH 120
           +++++ +G PP      +DTGS L W+ C PC+ C  Q G +F PS+S+S+  +PC++  
Sbjct: 171 YFMDVFVGNPPRHFLLIIDTGSDLTWLQCKPCKACFDQSGPVFDPSQSTSFKIIPCNAAA 230

Query: 121 CRYFPYARCSAYKHR-----CIYTQLYLIGPETSVFVSTEQLTFKNTDE-STIHVQDVVF 174
           C    +  C     +     C Y   Y     TS  ++ E L+   +D  S++ ++D+V 
Sbjct: 231 CDLVVHDECRDNSSKTSPKTCKYFYWYGDSSRTSGDLALESLSVSLSDHPSSLEIRDMVI 290

Query: 175 GCGFSTN-RNFKFSGIFGLGIGRSSLVSQLNS-----SFSYCIGSLHDPDYLHNKLILGD 228
           GCG S         G+ GLG G  S  SQL S     SFSYC+    +   + + +  G 
Sbjct: 291 GCGHSNKGLFQGAGGLLGLGQGALSFPSQLRSSPIGQSFSYCLVDRTNNLSVSSAISFGA 350

Query: 229 GAII----DEGDATPL----QFIDGHYYITLEAISVDGRMLDINPNIFKRDDSG-GGVMI 279
           G  +    D+   TP       ++  YY+ ++ I +D  +L I    F    +G GG +I
Sbjct: 351 GFALSRHFDQMRFTPFVRTNNSVETFYYLGIQGIKIDQELLPIPAERFAIAPNGSGGTII 410

Query: 280 DSGTDVTWLVKEAYEALRDEVMIRLEGEQMRSYSWPDK-----LCYHGIMSSDLKGFPTV 334
           DSGT +T+L ++AY A+    + R+      SY   D      +CY+    + +  FPT+
Sbjct: 411 DSGTTLTYLNRDAYRAVESAFLARI------SYPRADPFDILGICYNATGRTAVP-FPTL 463

Query: 335 RFHFRGGAKLALEKDSMFYQPRPD--AFCMAVNPASINNRYVNFSLIGMMAQQFYNVGYD 392
              F+ GA+L L +++ F QP P     C+A+ P          S+IG   QQ  +  YD
Sbjct: 464 SIVFQNGAELDLPQENYFIQPDPQEAKHCLAILPTD------GMSIIGNFQQQNIHFLYD 517

Query: 393 IGRKQKTFQRMDCEVL 408
           +   +  F   DC  L
Sbjct: 518 VQHARLGFANTDCSAL 533


>gi|326520736|dbj|BAJ92731.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 499

 Score =  139 bits (351), Expect = 2e-30,   Method: Compositional matrix adjust.
 Identities = 109/361 (30%), Positives = 164/361 (45%), Gaps = 30/361 (8%)

Query: 56  ISNSLFYVNISIGQPPVPQFTAMDTGSSLLWVHCYPC-RDCSPQLGTIFYPSRSSSYAYV 114
           +S   + V + +G P        DTGS   WV C PC   C  Q G +F P++SS+YA V
Sbjct: 158 VSTGNYVVTVGLGTPASKYTVVFDTGSDTTWVQCRPCVVKCYKQKGPLFDPAKSSTYANV 217

Query: 115 PCDSEHCRYFPYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDVVF 174
            C    C       C+     C+Y   Y  G  T  F + + LT  +       ++   F
Sbjct: 218 SCTDSACADLDTNGCTG--GHCLYAVQYGDGSYTVGFFAQDTLTIAHD-----AIKGFRF 270

Query: 175 GCGFSTNRNF-KFSGIFGLGIGRSSLVSQL----NSSFSYCIGSLHDPDYLHNKLILGDG 229
           GCG   N  F K +G+ GLG G++SL  Q       +F+YC+ +L         L  G G
Sbjct: 271 GCGEKNNGLFGKTAGLMGLGRGKTSLTVQAYNKYGGAFAYCLPALTTGT---GYLDFGPG 327

Query: 230 AIIDEGDATPLQFIDGH--YYITLEAISVDGRMLDINPNIFKRDDSGGGVMIDSGTDVTW 287
           +  +    TP+    G   YY+ +  I V G+ + +  ++F    S  G ++DSGT +T 
Sbjct: 328 SAGNNARLTPMLTDKGQTFYYVGMTGIRVGGQQVPVAESVF----STAGTLVDSGTVITR 383

Query: 288 LVKEAYEALR---DEVMIRLEGEQMRSYSWPDKLCYHGIMSSDLKGFPTVRFHFRGGAKL 344
           L   AY AL    D+VM+    ++   YS  D  CY     SD++  PTV   F+GGA L
Sbjct: 384 LPATAYTALSSAFDKVMLARGYKKAPGYSILDT-CYDFTGLSDVE-LPTVSLVFQGGACL 441

Query: 345 ALEKDSMFYQPRPDAFCMAVNPASINNRYVNFSLIGMMAQQFYNVGYDIGRKQKTFQRMD 404
            ++   + Y       C+A    + N    + +++G   Q+ Y V YD+G+K   F    
Sbjct: 442 DVDVSGIVYAISEAQVCLAF---ASNGDDESVAIVGNTQQKTYGVLYDLGKKTVGFAPGS 498

Query: 405 C 405
           C
Sbjct: 499 C 499


>gi|356528627|ref|XP_003532901.1| PREDICTED: probable aspartic protease At2g35615-like [Glycine max]
          Length = 430

 Score =  139 bits (351), Expect = 2e-30,   Method: Compositional matrix adjust.
 Identities = 123/423 (29%), Positives = 177/423 (41%), Gaps = 37/423 (8%)

Query: 1   LIHRDSILSPYNNPNENPAHRVQRGINISIARLAYLQEKIRSHNTYQAQILPSNDISNSL 60
           LI R S +SP  N        V+     SI R   +    +        I P  D  +  
Sbjct: 30  LIPRHSPISPLYNSQMTQTELVKSAALRSITRSKRVNFIGQISPPLSPIITPIPD--HGE 87

Query: 61  FYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLGTIFYPSRSSSYAYVPCDSEH 120
           + +  S+G P V +    DTGS L W+ C PC+ C PQ   +F P++SS+Y  VPC+S+ 
Sbjct: 88  YLMRFSLGTPSVERLAIFDTGSDLSWLQCTPCKTCYPQEAPLFDPTQSSTYVDVPCESQP 147

Query: 121 CRYFPYAR--CSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNT--DESTIHVQDVVFGC 176
           C  FP  +  C + K +CIY   Y     T   +  + ++F +T   +        VFGC
Sbjct: 148 CTLFPQNQRECGSSK-QCIYLHQYGTDSFTIGRLGYDTISFSSTGMGQGGATFPKSVFGC 206

Query: 177 GFSTNRNFKFS----GIFGLGIGRSSLVSQLNS----SFSYCIGSLHDPDYLHNKLILGD 228
            F +N  FK S    G  GLG G  SL SQL       FSYC+           KL  G 
Sbjct: 207 AFYSNFTFKISTKANGFVGLGPGPLSLASQLGDQIGHKFSYCMVPFSSTS--TGKLKFGS 264

Query: 229 GAIIDEGDATPLQF---IDGHYYITLEAISVDGRMLDINPNIFKRDDSGGGVMIDSGTDV 285
            A  +E  +TP         +Y + LE I+V  + +            GG ++IDS   +
Sbjct: 265 MAPTNEVVSTPFMINPSYPSYYVLNLEGITVGQKKV-------LTGQIGGNIIIDSVPIL 317

Query: 286 TWLVKEAYEALRDEVMIRLEGEQMRSYSWPDKLCYHGIMSSDLKGFPTVRFHFRGGAKLA 345
           T L +  Y      V   +  E       P + C     + +   FP   FHF  GA + 
Sbjct: 318 THLEQGIYTDFISSVKEAINVEVAEDAPTPFEYCVRNPTNLN---FPEFVFHFT-GADVV 373

Query: 346 LEKDSMFYQPRPDAFCMAVNPASINNRYVNFSLIGMMAQQFYNVGYDIGRKQKTFQRMDC 405
           L   +MF     +  CM V P+         S+ G  AQ  + V YD+G K+ +F   +C
Sbjct: 374 LGPKNMFIALDNNLVCMTVVPSK------GISIFGNWAQVNFQVEYDLGEKKVSFAPTNC 427

Query: 406 EVL 408
             +
Sbjct: 428 STI 430


>gi|116787333|gb|ABK24467.1| unknown [Picea sitchensis]
          Length = 497

 Score =  139 bits (350), Expect = 2e-30,   Method: Compositional matrix adjust.
 Identities = 116/364 (31%), Positives = 161/364 (44%), Gaps = 40/364 (10%)

Query: 61  FYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLGTIFYPSRSSSYAYVPCDSEH 120
           ++  + +G PP   +  +DTGS ++W+ C PC  C  Q   +F P+ SS+Y  VPC +  
Sbjct: 153 YFTRLGVGTPPRYTYMVLDTGSDIMWIQCLPCAKCYGQTDPLFNPAASSTYRKVPCATPL 212

Query: 121 CRYFPYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDVVFGCGFST 180
           C+    + C   K  C Y   Y  G  T    STE LTF+        ++ V  GCG   
Sbjct: 213 CKKLDISGCR-NKRYCEYQVSYGDGSFTVGDFSTETLTFRGQ-----VIRRVALGCGHDN 266

Query: 181 NRNFKFSGIFGLGIGR-----SSLVSQLNSSFSYCIGSLHDPDYLHNKLILGDGAIIDEG 235
              F  +              S   +Q +  FSYC+          + LI G  AI    
Sbjct: 267 EGLFIGAAGLLGLGRGSLSFPSQTGAQFSKRFSYCLVD-RSASGTASSLIFGKAAIPKSA 325

Query: 236 DATPLQF---IDGHYYITLEAISVDGRML-DINPNIFKRDDSG-GGVMIDSGTDVTWLVK 290
             TPL     +D  YY+ L  ISV GR L  I  ++F+ D +G GGV+IDSGT VT LV 
Sbjct: 326 IFTPLLSNPKLDTFYYVELVGISVGGRRLTSIPASVFRMDATGNGGVIIDSGTSVTRLVD 385

Query: 291 EAYEALRDEVMIRLEGEQMRS---YSWPDKLCYHGIMSSDLKGF-----PTVRFHFRGGA 342
            AY  +RD    R+    ++S   +S  D  CY      DL G      PT+ FHF+GGA
Sbjct: 386 SAYSTMRDA--FRVGTGNLKSAGGFSLFDT-CY------DLSGLKTVKVPTLVFHFQGGA 436

Query: 343 KLALEKDSMFYQPRPDA-FCMAVNPASINNRYVNFSLIGMMAQQFYNVGYDIGRKQKTFQ 401
            ++L   +        A FC A             S+IG + QQ Y V +D    +  F+
Sbjct: 437 HISLPATNYLIPVDSSATFCFA-----FAGNTGGLSIIGNIQQQGYRVVFDSLANRVGFK 491

Query: 402 RMDC 405
              C
Sbjct: 492 AGSC 495


>gi|124359514|gb|ABD28633.2| Peptidase aspartic, catalytic [Medicago truncatula]
          Length = 181

 Score =  139 bits (350), Expect = 3e-30,   Method: Compositional matrix adjust.
 Identities = 78/198 (39%), Positives = 117/198 (59%), Gaps = 19/198 (9%)

Query: 211 IGSLHDPDYLHNKLILGDGAIIDEGDATPLQFIDGHYYITLEAISVDGRMLDINPNIFKR 270
           +GSL D DY +N+LILG+ A +  GD TP Q  +G  ++T+E IS+  + LDI P  FK 
Sbjct: 1   MGSLTDKDYDYNQLILGEEAYL-AGDTTPFQVYNGVNHVTMEGISIGQKSLDIAPGTFKM 59

Query: 271 DDSGGGVMIDSGTDVTWLVKEAYEALRDEVMIRLEGEQMRSYSWPDKLCYHGIMSSDLKG 330
            ++G G     G  +T  V+  ++        RL+ +++R    P  LCY G +S DLKG
Sbjct: 60  KNNGTG----GGLSLTQEVRNLFQ--------RLKFQEVRLQGSPWALCYFGSVSRDLKG 107

Query: 331 FPTVRFHFRGGAKLALEKDSMFYQPRPDAFCMAVNPASINNRYVNFSLIGMMAQQFYNVG 390
           FP V F+F GGA + L+  + F Q + D FCM+V+P+       + S+IG++AQQ YNVG
Sbjct: 108 FPVVTFYFAGGAVIGLDTLNFFVQAKDDVFCMSVHPSH------DLSVIGLLAQQSYNVG 161

Query: 391 YDIGRKQKTFQRMDCEVL 408
           YD  +     + +DC++L
Sbjct: 162 YDKDKGLIYIESIDCQLL 179


>gi|414590468|tpg|DAA41039.1| TPA: aspartic proteinase nepenthesin-1 [Zea mays]
          Length = 469

 Score =  139 bits (349), Expect = 3e-30,   Method: Compositional matrix adjust.
 Identities = 106/366 (28%), Positives = 166/366 (45%), Gaps = 26/366 (7%)

Query: 61  FYVNISIGQPPVPQFTAMDTGSSLLWVHCYPC-RDCSPQLGTIFYPSRSSSYAYVPCDS- 118
           + + ++IG PP+P     DTGS L+W  C PC   C  Q   ++ P+ S++++ +PC+S 
Sbjct: 112 YLMTLAIGTPPLPYAAVADTGSDLIWTQCAPCGTQCFEQPAPLYNPASSTTFSVLPCNSS 171

Query: 119 -EHCRYFPYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDVVFGCG 177
              C              C+Y Q Y  G  T+    +E  TF ++      V  V FGC 
Sbjct: 172 LSMCAGALAGAAPPPGCACMYNQTYGTG-WTAGVQGSETFTFGSSAADQARVPGVAFGCS 230

Query: 178 FSTNRNFKFS-GIFGLGIGRSSLVSQLNSS-FSYCIGSLHDPDYLHNKLILGDGAIIDEG 235
            +++ ++  S G+ GLG G  SLVSQL +  FSYC+    D +   + L+LG  A ++  
Sbjct: 231 NASSSDWNGSAGLVGLGRGSLSLVSQLGAGRFSYCLTPFQDTNS-TSTLLLGPSAALNGT 289

Query: 236 DATPLQFI--------DGHYYITLEAISVDGRMLDINPNIFK-RDDSGGGVMIDSGTDVT 286
                 F+          +YY+ L  IS+  + L I+P  F  + D  GG++IDSGT +T
Sbjct: 290 GVRSTPFVASPARAPMSTYYYLNLTGISLGAKALPISPGAFSLKPDGTGGLIIDSGTTIT 349

Query: 287 WLVKEAYEALRDEV--MIRLEGEQMRSYSWPDKLCYH--GIMSSDLKGFPTVRFHFRGGA 342
            L   AY+ +R  V  ++        S S    LC+      S+     P++  HF  GA
Sbjct: 350 SLANAAYQQVRAAVKSLVTTLPTVDGSDSTGLDLCFALPAPTSAPPAVLPSMTLHFD-GA 408

Query: 343 KLALEKDSMFYQPRPDAFCMAVNPASINNRYVNFSLIGMMAQQFYNVGYDIGRKQKTFQR 402
            + L  DS         +C+A+     N      S  G   QQ  ++ YD+  +  +F  
Sbjct: 409 DMVLPADSYMISGS-GVWCLAMR----NQTDGAMSTFGNYQQQNMHILYDVREETLSFAP 463

Query: 403 MDCEVL 408
             C  L
Sbjct: 464 AKCSTL 469


>gi|357127293|ref|XP_003565317.1| PREDICTED: aspartic proteinase nepenthesin-1-like, partial
           [Brachypodium distachyon]
          Length = 540

 Score =  139 bits (349), Expect = 3e-30,   Method: Compositional matrix adjust.
 Identities = 112/357 (31%), Positives = 155/357 (43%), Gaps = 24/357 (6%)

Query: 61  FYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLGTIFYPSRSSSYAYVPCDSEH 120
           ++  I IG P    +  +DTGS + W+ C PC DC  Q   +F P+ SSSYA VPCDS H
Sbjct: 196 YFSRIGIGSPARQLYMVLDTGSDVTWLQCAPCADCYAQSDPLFDPALSSSYATVPCDSPH 255

Query: 121 CRYFPYARC----SAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDVVFGC 176
           CR    + C    +     C+Y   Y  G  T    +TE LT      + +H  DV  GC
Sbjct: 256 CRALDASACHNNAANGNSSCVYEVAYGDGSYTVGDFATETLTLGGDGSAAVH--DVAIGC 313

Query: 177 GFSTNRNFKFSGIFGLGIGRS-SLVSQLNSS-FSYCIGSLHDPDYLHNKLILGDGAIIDE 234
           G      F  +       G   S  SQ++++ FSYC+     P     +    D + +  
Sbjct: 314 GHDNEGLFVGAAGLLALGGGPLSFPSQISATEFSYCLVDRDSPSASTLQFGASDSSTV-- 371

Query: 235 GDATPLQ---FIDGHYYITLEAISVDGRML-DINPNIFKRDDSG-GGVMIDSGTDVTWLV 289
               PL      +  YY+ L  ISV G  L DI P  F  D+ G GGV++DSGT VT L 
Sbjct: 372 --TAPLMRSPRSNTFYYVALNGISVGGETLSDIPPAAFAMDEQGSGGVIVDSGTAVTRLQ 429

Query: 290 KEAYEALRDEVMIRLEGEQMRSYSWPDKLCYHGIMSSDLKGFPTVRFHFRGGAKLAL-EK 348
             AY ALRD  +   +     S       CY     S ++  P V   F GG +L L  K
Sbjct: 430 SSAYSALRDAFVRGTQALPRASGVSLFDTCYDLAGRSSVQ-VPAVSLRFEGGGELKLPAK 488

Query: 349 DSMFYQPRPDAFCMAVNPASINNRYVNFSLIGMMAQQFYNVGYDIGRKQKTFQRMDC 405
           + +        +C+A   A+        S++G + QQ   V +D  +    F    C
Sbjct: 489 NYLIPVDGAGTYCLAF--AATGGA---VSIVGNVQQQGIRVSFDTAKNTVGFSPNKC 540


>gi|242044886|ref|XP_002460314.1| hypothetical protein SORBIDRAFT_02g026340 [Sorghum bicolor]
 gi|241923691|gb|EER96835.1| hypothetical protein SORBIDRAFT_02g026340 [Sorghum bicolor]
          Length = 444

 Score =  139 bits (349), Expect = 4e-30,   Method: Compositional matrix adjust.
 Identities = 105/375 (28%), Positives = 169/375 (45%), Gaps = 41/375 (10%)

Query: 57  SNSLFYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLGTIFYPSRSSSYAYVPC 116
           S+  + + + IG P       +DTGS L+W  C PC  C  Q    F P+ SS+Y  + C
Sbjct: 88  SDGEYLMEMGIGTPARFYSAILDTGSDLIWTQCAPCLLCVDQPTPYFDPANSSTYRSLGC 147

Query: 117 DSEHCRYFPYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDVVFGC 176
            +  C    Y  C  Y+  C+Y   Y     T+  ++ E  TF  T+++ + +  + FGC
Sbjct: 148 SAPACNALYYPLC--YQKTCVYQYFYGDSASTAGVLANETFTF-GTNDTRVTLPRISFGC 204

Query: 177 G-FSTNRNFKFSGIFGLGIGRSSLVSQLNSS-FSYCIGSLHDPDYLHNKLILGDGAIIDE 234
           G  +       SG+ G G G  SLVSQL S  FSYC+ S   P  + ++L  G  A ++ 
Sbjct: 205 GNLNAGSLANGSGMVGFGRGSLSLVSQLGSPRFSYCLTSFLSP--VRSRLYFGAYATLNS 262

Query: 235 GDATPLQ--------FIDGHYYITLEAISVDGRMLDINPNIFKRDDS--GGGVMIDSGTD 284
            +A+ +Q         +   Y++ +  ISV G  L I+P +   +D+   GG +IDSGT 
Sbjct: 263 TNASTVQSTPFIINPALPTMYFLNMTGISVGGNRLPIDPAVLAINDTDGTGGTIIDSGTT 322

Query: 285 VTWLVKEAYEALRDEVMIRL----------EGEQMRS-YSWPDKLCYHGIMSSDLKGFPT 333
           +T+L + AY A+R+  ++ L          E   + + + WP                P 
Sbjct: 323 ITYLAEPAYYAVREAFVLYLNSTLPLLDVTETSVLDTCFQWPPP-------PRQSVTLPQ 375

Query: 334 VRFHFRGGAKLALEKDSMFYQPRPDAFCMAVNPASINNRYVNFSLIGMMAQQFYNVGYDI 393
           +  HF G       ++ M   P     C+A+  +S      + S+IG    Q +NV YD+
Sbjct: 376 LVLHFDGADWELPLQNYMLVDPSTGGLCLAMATSS------DGSIIGSYQHQNFNVLYDL 429

Query: 394 GRKQKTFQRMDCEVL 408
                +F    C ++
Sbjct: 430 ENSLLSFVPAPCNLM 444


>gi|226492465|ref|NP_001150925.1| aspartic proteinase nepenthesin-1 [Zea mays]
 gi|195642996|gb|ACG40966.1| aspartic proteinase nepenthesin-1 precursor [Zea mays]
          Length = 472

 Score =  139 bits (349), Expect = 4e-30,   Method: Compositional matrix adjust.
 Identities = 112/389 (28%), Positives = 175/389 (44%), Gaps = 32/389 (8%)

Query: 41  RSHNTYQAQILPSNDISNSLFYV-NISIGQPPVPQFTAMDTGSSLLWVHCYPC-RDCSPQ 98
           R+  T  A+     D+ N   Y+  ++IG PP+P     DTGS L+W  C PC   C  Q
Sbjct: 95  RTSTTVSART--RKDLPNGGEYLMTLAIGTPPLPYAAVADTGSDLIWTQCAPCGTQCFEQ 152

Query: 99  LGTIFYPSRSSSYAYVPCDS--EHCRYFPYARCSAYKHRCIYTQLYLIGPETSVFVSTEQ 156
              ++ P+ S++++ +PC+S    C              C+Y Q Y  G  T+    +E 
Sbjct: 153 PAPLYNPASSTTFSVLPCNSSLSMCAGALAGAAPPPGCACMYYQTYGTG-WTAGVQGSET 211

Query: 157 LTFKNTDESTIHVQDVVFGCGFSTNRNFKFS-GIFGLGIGRSSLVSQLNSS-FSYCIGSL 214
            TF ++      V  V FGC  +++ ++  S G+ GLG G  SLVSQL +  FSYC+   
Sbjct: 212 FTFGSSAADQARVPGVAFGCSNASSSDWNGSAGLVGLGRGSLSLVSQLGAGRFSYCLTPF 271

Query: 215 HDPDYLHNKLILGDGAIIDEGDATPLQFI--------DGHYYITLEAISVDGRMLDINPN 266
            D +   + L+LG  A ++        F+          +YY+ L  IS+  + L I+P 
Sbjct: 272 QDTNS-TSTLLLGPSAALNGTGVRSTPFVASPARAPMSTYYYLNLTGISLGAKALPISPG 330

Query: 267 IFK-RDDSGGGVMIDSGTDVTWLVKEAYEALRDEVMIRLEGEQMRSYSWPDK----LCYH 321
            F  + D  GG++IDSGT +T L   AY+ +R  V  +L    + +    D     LC+ 
Sbjct: 331 AFSLKPDGTGGLIIDSGTTITSLANAAYQQVRAAVKSQLV-TTLPTVDGSDSTGLDLCFA 389

Query: 322 --GIMSSDLKGFPTVRFHFRGGAKLALEKDSMFYQPRPDAFCMAVNPASINNRYVNFSLI 379
                S+     P++  HF  GA + L  DS         +C+A+     N      S  
Sbjct: 390 LPAPTSAPPAVLPSMTLHFD-GADMVLPADSYMISGS-GVWCLAMR----NQTDGAMSTF 443

Query: 380 GMMAQQFYNVGYDIGRKQKTFQRMDCEVL 408
           G   QQ  ++ YD+  +  +F    C  L
Sbjct: 444 GNYQQQNMHILYDVREETLSFAPAKCSTL 472


>gi|168013126|ref|XP_001759252.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162689565|gb|EDQ75936.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 385

 Score =  138 bits (348), Expect = 4e-30,   Method: Compositional matrix adjust.
 Identities = 118/386 (30%), Positives = 181/386 (46%), Gaps = 30/386 (7%)

Query: 37  QEKIRSHNTYQAQILPSNDISNSLFYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCS 96
           Q K+ S + +QA ++    + +  +++ +S+G PP   +  MDTGS +LW+ C PC  C 
Sbjct: 14  QTKVPSQD-FQAPVISGLSLGSGEYFIRVSVGTPPRGMYLVMDTGSDILWLQCAPCVSCY 72

Query: 97  PQLGTIFYPSRSSSYAYVPCDSEHCRYFPYARCSAYKHRCIYTQLYLIGPETSVFVSTEQ 156
            Q   +F P +SS+Y+ + C+S  C       C    ++C+Y   Y  G  ++   +T+ 
Sbjct: 73  HQCDEVFDPYKSSTYSTLGCNSRQCLNLDVGGC--VGNKCLYQVDYGDGSFSTGEFATDA 130

Query: 157 LTFKNTD-ESTIHVQDVVFGCGFSTNRNF-KFSGIFGLGIGRSSLVSQLNSS----FSYC 210
           ++  +T     + +  +  GCG      F   +G+ GLG G  S  +Q+NS     FSYC
Sbjct: 131 VSLNSTSGGGQVVLNKIPLGCGHDNEGYFVGAAGLLGLGKGPLSFPNQINSENGGRFSYC 190

Query: 211 IGSLHDPDYLHNKLILGDGAIIDEG-----DATPLQFIDGHYYITLEAISVDGRMLDINP 265
           +          + LI GD A+   G      A+ L+ +   YY+ +  ISV G +L I  
Sbjct: 191 LTGRDTDSTERSSLIFGDAAVPPAGVRFTPQASNLR-VSTFYYLKMTGISVGGSILTIPT 249

Query: 266 NIFKRDDSG-GGVMIDSGTDVTWLVKEAYEALRDEVMIRLEGEQMRSYSWPDKLCYHGIM 324
           + F+ D  G GGV+IDSGT VT L   AY +LR+          + +       CY+   
Sbjct: 250 SAFQLDSLGNGGVIIDSGTSVTRLQNAAYASLREAFRAGTSDLVLTTEFSLFDTCYN--- 306

Query: 325 SSDLKG--FPTVRFHFRGGAKLALEKDSMFYQP--RPDAFCMAVNPASINNRYVNFSLIG 380
            SDL     PTV  HF+GGA L L   S +  P      FC+A    +        S+IG
Sbjct: 307 LSDLSSVDVPTVTLHFQGGADLKLPA-SNYLVPVDNSSTFCLAFAGTT------GPSIIG 359

Query: 381 MMAQQFYNVGYDIGRKQKTFQRMDCE 406
            + QQ + V YD    Q  F    C+
Sbjct: 360 NIQQQGFRVIYDNLHNQVGFVPSQCD 385


>gi|242050432|ref|XP_002462960.1| hypothetical protein SORBIDRAFT_02g035320 [Sorghum bicolor]
 gi|241926337|gb|EER99481.1| hypothetical protein SORBIDRAFT_02g035320 [Sorghum bicolor]
          Length = 445

 Score =  138 bits (348), Expect = 4e-30,   Method: Compositional matrix adjust.
 Identities = 118/405 (29%), Positives = 177/405 (43%), Gaps = 42/405 (10%)

Query: 36  LQEKIRSHNTYQAQILPSNDISNSL----------FYVNISIGQPPVPQFTAMDTGSSLL 85
           L+  +  HN  Q     SN  + S           + + ++IG PPV      DTGS L+
Sbjct: 51  LRRDMHRHNARQLAASSSNGTTVSAPTQISPTAGEYLMTLAIGTPPVSYQAIADTGSDLI 110

Query: 86  WVHCYPCRD-CSPQLGTIFYPSRSSSYAYVPCDS--EHCRYFPYARCSAYKHRCIYTQLY 142
           W  C PC   C  Q   ++ PS S+++A +PC+S    C              C+Y   Y
Sbjct: 111 WTQCAPCSSQCFQQPTPLYNPSSSTTFAVLPCNSSLSMCAAALAGTTPPPGCTCMYNMTY 170

Query: 143 LIGPETSVFVSTEQLTF-KNTDESTIHVQDVVFGC-----GFSTNRNFKFSGIFGLGIGR 196
             G  TSV+  +E  TF  +T  +   V  + FGC     GF+T+     SG+ GLG G 
Sbjct: 171 GSG-WTSVYQGSETFTFGSSTPANQTGVPGIAFGCSNASGGFNTS---SASGLVGLGRGS 226

Query: 197 SSLVSQLN-SSFSYCIGSLHDPDYLHNKLILGDGAIIDEGDATPLQFI--------DGHY 247
            SLVSQL    FSYC+    D +     L+    ++ D G  +   F+          +Y
Sbjct: 227 LSLVSQLGVPKFSYCLTPYQDTNSTSTLLLGPSASLNDTGGVSSTPFVASPSDAPMSTYY 286

Query: 248 YITLEAISVDGRMLDINPNIFK-RDDSGGGVMIDSGTDVTWLVKEAYEALRDEV--MIRL 304
           Y+ L  IS+    L I       + D  GG +IDSGT +T L   AY+ +R  V  ++ L
Sbjct: 287 YLNLTGISLGTTALSIPTTALSLKADGTGGFIIDSGTTITLLGNTAYQQVRAAVVSLVTL 346

Query: 305 EGEQMRSYSWPDKLCYHGIMSSDL-KGFPTVRFHFRGGAKLALEKDSMFYQPRPDAFCMA 363
                 S +    LC+    S+      P++  HF  GA + L  DS +     + +C+A
Sbjct: 347 PTTDGGSAATGLDLCFELPSSTSAPPTMPSMTLHFD-GADMVLPADS-YMMLDSNLWCLA 404

Query: 364 VNPASINNRYVNFSLIGMMAQQFYNVGYDIGRKQKTFQRMDCEVL 408
           +     N      S++G   QQ  ++ YD+G++  TF    C  L
Sbjct: 405 MQ----NQTDGGVSILGNYQQQNMHILYDVGQETLTFAPAKCSTL 445


>gi|108707838|gb|ABF95633.1| Eukaryotic aspartyl protease family protein [Oryza sativa Japonica
           Group]
          Length = 391

 Score =  138 bits (348), Expect = 4e-30,   Method: Compositional matrix adjust.
 Identities = 113/379 (29%), Positives = 172/379 (45%), Gaps = 39/379 (10%)

Query: 54  NDISNSLFYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLGTIFYPSRSSSYAY 113
           N +  + + V+++IG PP P    +DTGS L+W  C PC  C  Q    F PS SS+ + 
Sbjct: 28  NGVPTTEYLVHLAIGTPPQPVQLTLDTGSDLIWTQCQPCPACFDQALPYFDPSTSSTLSL 87

Query: 114 VPCDSEHCRYFPYARCSAYK----HRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHV 169
             CDS  C+  P A C + K      C+YT  Y     T+ F+  ++ TF     S   V
Sbjct: 88  TSCDSTLCQGLPVASCGSPKFWPNQTCVYTYSYGDKSVTTGFLEVDKFTFVGAGAS---V 144

Query: 170 QDVVFGCGFSTNRNFKF--SGIFGLGIGRSSLVSQLN-SSFSYCIGSLHDP-------DY 219
             V FGCG   N  FK   +GI G G G  SL SQL   +FS+C  ++          D 
Sbjct: 145 PGVAFGCGLFNNGVFKSNETGIAGFGRGPLSLPSQLKVGNFSHCFTTITGAIPSTVLLDL 204

Query: 220 LHNKLILGDGAIIDEGDATPL-QFIDGH-----YYITLEAISVDGRMLDINPNIFKRDDS 273
             +    G GA+      TPL Q+         YY++L+ I+V    L +  + F   + 
Sbjct: 205 PADLFSNGQGAV----QTTPLIQYAKNEANPTLYYLSLKGITVGSTRLPVPESAFALTNG 260

Query: 274 GGGVMIDSGTDVTWLVKEAYEALRDEVMIRLEGEQMRSYSWPDKLCYHGIMSSDLKGFPT 333
            GG +IDSGT +T L  + Y+ +RDE   +++   +   +     C+    S      P 
Sbjct: 261 TGGTIIDSGTSITSLPPQVYQVVRDEFAAQIKLPVVPGNATGHYTCFSA-PSQAKPDVPK 319

Query: 334 VRFHFRGGAKLALEKDSMFYQPRPDA----FCMAVNPASINNRYVNFSLIGMMAQQFYNV 389
           +  HF  GA + L +++  ++   DA     C+A+      N+    ++IG   QQ  +V
Sbjct: 320 LVLHFE-GATMDLPRENYVFEVPDDAGNSIICLAI------NKGDETTIIGNFQQQNMHV 372

Query: 390 GYDIGRKQKTFQRMDCEVL 408
            YD+     +F    C+ L
Sbjct: 373 LYDLQNNMLSFVAAQCDKL 391


>gi|302809015|ref|XP_002986201.1| hypothetical protein SELMODRAFT_234982 [Selaginella moellendorffii]
 gi|300146060|gb|EFJ12732.1| hypothetical protein SELMODRAFT_234982 [Selaginella moellendorffii]
          Length = 449

 Score =  138 bits (348), Expect = 5e-30,   Method: Compositional matrix adjust.
 Identities = 110/376 (29%), Positives = 177/376 (47%), Gaps = 41/376 (10%)

Query: 61  FYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLGTIFYPSRSSSYAYVPCDSEH 120
           +++++ +G PP      +DTGS L W+ C PC+ C  Q G +F PS+S+S+  +PC++  
Sbjct: 87  YFMDVFVGNPPRHFLLIIDTGSDLTWLQCKPCKACFDQSGPVFDPSQSTSFKIIPCNAAA 146

Query: 121 CRYFPYARCSAYKHR-----CIYTQLYLIGPETSVFVSTEQLTFKNTDE-STIHVQDVVF 174
           C    +  C     +     C Y   Y     TS  ++ E L+   +D  S++ ++D+V 
Sbjct: 147 CDLVVHDECRDNSSKTSPKTCKYFYWYGDSSRTSGDLALESLSVSLSDHPSSLEIRDMVI 206

Query: 175 GCGFSTN-RNFKFSGIFGLGIGRSSLVSQLNS-----SFSYCIGSLHDPDYLHNKLILGD 228
           GCG S         G+ GLG G  S  SQL S     SFSYC+    +   + + +  G 
Sbjct: 207 GCGHSNKGLFQGAGGLLGLGQGALSFPSQLRSSPIGQSFSYCLVDRTNNLSVSSAISFGA 266

Query: 229 GAII----DEGDATPL----QFIDGHYYITLEAISVDGRMLDINPNIFKRDDSG-GGVMI 279
           G  +    D+   TP       ++  YY+ ++ I +D  +L I    F    +G GG +I
Sbjct: 267 GFALSRHFDQMKFTPFVRTNNSVETFYYLGIQGIKIDQELLPIPAERFAIATNGSGGTII 326

Query: 280 DSGTDVTWLVKEAYEALRDEVMIRLEGEQMRSYSWPDK-----LCYHGIMSSDLKGFPTV 334
           DSGT +T+L ++AY A+    + R+      SY   D      +CY+    + +  FP +
Sbjct: 327 DSGTTLTYLNRDAYRAVESAFLARI------SYPRADPFDILGICYNATGRAAVP-FPAL 379

Query: 335 RFHFRGGAKLALEKDSMFYQPRPD--AFCMAVNPASINNRYVNFSLIGMMAQQFYNVGYD 392
              F+ GA+L L +++ F QP P     C+A+ P          S+IG   QQ  +  YD
Sbjct: 380 SIVFQNGAELDLPQENYFIQPDPQEAKHCLAILPTD------GMSIIGNFQQQNIHFLYD 433

Query: 393 IGRKQKTFQRMDCEVL 408
           +   +  F   DC  L
Sbjct: 434 VQHARLGFANTDCSAL 449


>gi|50508279|dbj|BAD32128.1| putative aspartic proteinase nepenthesin II [Oryza sativa Japonica
           Group]
 gi|125600536|gb|EAZ40112.1| hypothetical protein OsJ_24555 [Oryza sativa Japonica Group]
          Length = 455

 Score =  138 bits (347), Expect = 5e-30,   Method: Compositional matrix adjust.
 Identities = 121/379 (31%), Positives = 175/379 (46%), Gaps = 45/379 (11%)

Query: 61  FYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDC--SPQLGTIFYPSRSSSYAYVPCDS 118
           + +NIS+G PP+     +DTGS+L+W  C PC  C   P    +  P+RSS+++ +PC+ 
Sbjct: 91  YNMNISLGTPPLDFPVIVDTGSNLIWAQCAPCTRCFPRPTPAPVLQPARSSTFSRLPCNG 150

Query: 119 EHCRYFPYA---RCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDVVFG 175
             C+Y P +   R       C Y   Y  G  T+ +++TE LT  +          V FG
Sbjct: 151 SFCQYLPTSSRPRTCNATAACAYNYTYGSG-YTAGYLATETLTVGDGT-----FPKVAFG 204

Query: 176 CGFSTNRNFKFSGIFGLGIGRSSLVSQLN-SSFSYCIGSLHDPDYLHNKLILGDGAIIDE 234
           C  + N     SGI GLG G  SLVSQL    FSYC+ S    D   + ++ G  A + E
Sbjct: 205 CS-TENGVDNSSGIVGLGRGPLSLVSQLAVGRFSYCLRS-DMADGGASPILFGSLAKLTE 262

Query: 235 GD---ATPL---QFI--DGHYYITLEAISVDGRMLDINPNI--FKRDDSGGGVMIDSGTD 284
           G    +TPL    ++    HYY+ L  I+VD   L +  +   F +   GGG ++DSGT 
Sbjct: 263 GSVVQSTPLLKNPYLQRSTHYYVNLTGIAVDSTELPVTGSTFGFTQTGLGGGTIVDSGTT 322

Query: 285 VTWLVKEAYEALRDEV---MIRLEGEQMRSYSWPD-KLCYHGIMSSDLKG--FPTVRFHF 338
           +T+L K+ Y  ++      M  L      S +  D  LCY        K    P +   F
Sbjct: 323 LTYLAKDGYAMVKQAFQSQMANLNQTTPASGAPYDLDLCYKPSAGGGGKAVRVPRLALRF 382

Query: 339 RGGAK---------LALEKDSMFYQPRPDAFCMAVNPASINNRYVNFSLIGMMAQQFYNV 389
            GGAK           +E DS   Q R    C+ V PA+ +   +  S+IG + Q   ++
Sbjct: 383 AGGAKYNVPVQNYFAGVEADS---QGRVTVACLLVLPATDD---LPISIIGNLMQMDMHL 436

Query: 390 GYDIGRKQKTFQRMDCEVL 408
            YDI     +F   DC  L
Sbjct: 437 LYDIDGGMFSFAPADCAKL 455


>gi|388502342|gb|AFK39237.1| unknown [Lotus japonicus]
          Length = 440

 Score =  138 bits (347), Expect = 5e-30,   Method: Compositional matrix adjust.
 Identities = 122/426 (28%), Positives = 193/426 (45%), Gaps = 39/426 (9%)

Query: 1   LIHRDSILSPYNNPNENPAHRVQRGINISIARLAYLQEKIRSHNTYQAQILPSNDI--SN 58
           LIH DS  SP+ N +   +  ++     SI+R   L   +        +  P   I  +N
Sbjct: 34  LIHHDSPPSPFYNSSMTRSQLIRNAAMRSISRANQLSLSLSHSLNQLKESSPEPIIIPNN 93

Query: 59  SLFYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRD--CSPQLGTIFYPSRSSSYAYVPC 116
             + + I IG P V +    DTGS L WV C PC +  C  Q   ++ P  SS++  +PC
Sbjct: 94  GNYLMRIYIGTPSVERLAIADTGSDLTWVQCSPCDNTKCFAQNTPLYDPLNSSTFTLLPC 153

Query: 117 DSEHCRYFPYAR--CSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDVVF 174
           DS+ C   PY++  CS Y   CIY   Y  G  +  +      + +       +   + F
Sbjct: 154 DSQPCTQLPYSQYVCSDYGD-CIYAYTY--GDNSYSYGGLSSDSIRLMLLQLHYNSKICF 210

Query: 175 GCG----FSTNRNFKFSGIFGLGIGRSSLVSQLNS----SFSYCIGSLHDPDYLHNKLIL 226
           GCG    F+ +++ K +GI GLG G  SLVSQL       FSYC+  L      ++KL  
Sbjct: 211 GCGFQNKFTADKSGKTTGIVGLGAGPLSLVSQLGDEIGHKFSYCL--LPFSSNSNSKLKF 268

Query: 227 GDGAIIDEGD---ATPLQFIDG--HYYITLEAISVDGRMLDINPNIFKRDDSGGGVMIDS 281
           G+ AI+ +G+   +TPL        YY+ LE I+V  + +       K   + G ++IDS
Sbjct: 269 GEAAIV-QGNGVVSTPLIIKPDLPFYYLNLEGITVGAKTV-------KTGQTDGNIIIDS 320

Query: 282 GTDVTWLVKEAYEALRDEVMIRLEGEQMRSYSWPDKLCYHGIMSSDLKGFPTVRFHFRGG 341
           G+ +T+L +  Y      V   +  E+ +   +P   C+       +   P V FHF GG
Sbjct: 321 GSTLTYLEESFYNEFVSLVKETVAVEEDQYIPYPFDFCF--TYKEGMSTPPDVVFHFTGG 378

Query: 342 AKLALEKDSMFYQPRPDAFCMAVNPASINNRYVNFSLIGMMAQQFYNVGYDIGRKQKTFQ 401
             + L+  +       +  C  V P+     +   ++ G + Q  ++VGYDI   + +F 
Sbjct: 379 -DVVLKPMNTLVLIEDNLICSTVVPS----HFDGIAIFGNLGQIDFHVGYDIQGGKVSFA 433

Query: 402 RMDCEV 407
             DC +
Sbjct: 434 PTDCSL 439


>gi|224126751|ref|XP_002329464.1| predicted protein [Populus trichocarpa]
 gi|222870144|gb|EEF07275.1| predicted protein [Populus trichocarpa]
          Length = 439

 Score =  138 bits (347), Expect = 5e-30,   Method: Compositional matrix adjust.
 Identities = 128/406 (31%), Positives = 190/406 (46%), Gaps = 36/406 (8%)

Query: 16  ENPAHRVQRGINISIARLAYLQEKIRSHNTYQAQILPSNDISNSLFYVNISIGQPPVPQF 75
           E   H V+RG +  + R   +     S++   A +LP N      F + ++IG PP    
Sbjct: 57  ERIQHGVKRGRH-RLQRFKAMALVASSNSEIDAPVLPGN----GEFLMKLAIGTPPETYS 111

Query: 76  TAMDTGSSLLWVHCYPCRDCSPQLGTIFYPSRSSSYAYVPCDSEHCRYFPYARCSAYKHR 135
             MDTGS L+W  C PC  C  Q   IF P +SSS++ + C S+ C   P + CS     
Sbjct: 112 AIMDTGSDLIWTQCKPCTQCFDQPTPIFDPKKSSSFSKLSCSSKLCEALPQSTCS---DG 168

Query: 136 CIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDVVFGCGFSTNRNFKF---SGIFGL 192
           C Y   Y     T   +++E LTF       + V +V FGCG   N    F   SG+ GL
Sbjct: 169 CEYLYGYGDYSSTQGMLASETLTFGK-----VSVPEVAFGCG-EDNEGSGFSQGSGLVGL 222

Query: 193 GIGRSSLVSQLNS-SFSYCIGSLHDPDYLHNKLILGDGAIIDEGDA----TPL---QFID 244
           G G  SLVSQL    FSYC+ S+ D     + L++G  A +   D+    TPL       
Sbjct: 223 GRGPLSLVSQLKEPKFSYCLTSVDDTK--ASTLLMGSLASVKASDSEIKTTPLIQNSAQP 280

Query: 245 GHYYITLEAISVDGRMLDINPNIFK-RDDSGGGVMIDSGTDVTWLVKEAYEALRDEVMIR 303
             YY++LE ISV    L I  + F  ++D  GG++IDSGT +T+L + A++ +  E   +
Sbjct: 281 SFYYLSLEGISVGDTSLPIKKSTFSLQEDGSGGLIIDSGTTITYLEQSAFDLVAKEFTSQ 340

Query: 304 LEGEQMRSYSWPDKLCYHGIMSSDLKGFPTVRFHFRGGAKLALEKDS-MFYQPRPDAFCM 362
           +      S S   ++C+     S     P + FHF  GA L L  ++ M         C+
Sbjct: 341 INLPVDNSGSTGLEVCFTLPSGSTDIEVPKLVFHFD-GADLELPAENYMIADASMGVACL 399

Query: 363 AVNPASINNRYVNFSLIGMMAQQFYNVGYDIGRKQKTFQRMDCEVL 408
           A+  +S        S+ G + QQ   V +D+ ++  +F    C+ L
Sbjct: 400 AMGSSS------GMSIFGNIQQQNMLVLHDLEKETLSFLPTQCDEL 439


>gi|242079449|ref|XP_002444493.1| hypothetical protein SORBIDRAFT_07g022790 [Sorghum bicolor]
 gi|241940843|gb|EES13988.1| hypothetical protein SORBIDRAFT_07g022790 [Sorghum bicolor]
          Length = 449

 Score =  137 bits (346), Expect = 7e-30,   Method: Compositional matrix adjust.
 Identities = 116/405 (28%), Positives = 186/405 (45%), Gaps = 46/405 (11%)

Query: 31  ARLAYLQEKIRSHNTYQAQILPSNDISNSLFYVNISIGQPPVPQFTAMDTGSSLLWVHC- 89
           ARLA      R      A  +P   +S+    + + IG PP P+   +DTGS L+W  C 
Sbjct: 60  ARLA------RVLGNLSAADVPVAPLSDQGHSLTVGIGTPPQPRTLIVDTGSDLIWTQCS 113

Query: 90  ------YPCRDCSPQLGTIFYPSRSSSYAYVPCDSEHCR--YFPYARCSAYKHRCIYTQL 141
                       S Q   ++ P RSSS+AY+PC    C+   F Y  C A  +RC+Y +L
Sbjct: 114 MLSRRTRTAASASRQREPLYEPRRSSSFAYLPCSDRLCQEGQFSYKNC-ARNNRCMYDEL 172

Query: 142 YLIGPETSVFVSTEQLTFKNTDESTIHVQDVVFGCG-FSTNRNFKFSGIFGLGIGRSSLV 200
           Y    E    +++E  TF    + ++ +    FGCG  S       SG+ GL  G  SLV
Sbjct: 173 Y-GSAEAGGVLASETFTFGVNAKVSLPLG---FGCGALSAGDLVGASGLMGLSPGIMSLV 228

Query: 201 SQLN-SSFSYCIGSLHDPDYLHNKLILGDGAII----DEGDATPLQFI------DGHYYI 249
           SQL+   FSYC+    +     + L+ G  A +      G       +        +YY+
Sbjct: 229 SQLSVPRFSYCLTPFAERK--TSPLLFGAMADLRRYRTTGTVQTTSILRNPAMETAYYYV 286

Query: 250 TLEAISVDGRMLDINPNIFK--RDDSGGGVMIDSGTDVTWLVKEAYEALRDEVM--IRLE 305
            L  +S+  + LD+        + D  GG ++DSG+ +++L + A+ A++  V+  +RL 
Sbjct: 287 PLVGLSLGTKRLDVPATSLGMIKPDGSGGTIVDSGSTMSYLEETAFRAVKKAVVEAVRLP 346

Query: 306 GEQMRSYSWPD-KLCYH---GIMSSDLKGFPTVRFHFRGGAKLALEKDSMFYQPRPDAFC 361
                   + D +LC+    G+    +K  P V  HF GGA + L +D+ F +PR    C
Sbjct: 347 VANGTDEDYDDYELCFALPTGVAMEAVKTPPLV-LHFDGGAAMTLPRDNYFQEPRAGLMC 405

Query: 362 MAVNPASINNRYVNFSLIGMMAQQFYNVGYDIGRKQKTFQRMDCE 406
           +AV  +         S+IG + QQ  +V +D+  ++ +F    C+
Sbjct: 406 LAVGTSPDG---FGVSIIGNVQQQNMHVLFDVRNQKFSFAPTKCD 447


>gi|414876414|tpg|DAA53545.1| TPA: hypothetical protein ZEAMMB73_483039 [Zea mays]
          Length = 506

 Score =  137 bits (346), Expect = 8e-30,   Method: Compositional matrix adjust.
 Identities = 113/353 (32%), Positives = 151/353 (42%), Gaps = 20/353 (5%)

Query: 61  FYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLGTIFYPSRSSSYAYVPCDSEH 120
           ++  + IG P    +  +DTGS + WV C PC DC  Q   +F PS S+SYA V CDS+ 
Sbjct: 166 YFSRVGIGSPARQLYMVLDTGSDVTWVQCQPCADCYQQSDPVFDPSLSASYAAVSCDSQR 225

Query: 121 CRYFPYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDVVFGCGFST 180
           CR    A C      C+Y   Y  G  T    +TE LT  +    +  V +V  GCG   
Sbjct: 226 CRDLDTAACRNATGACLYEVAYGDGSYTVGDFATETLTLGD----STPVGNVAIGCGHDN 281

Query: 181 NRNFKFSGIFGLGIGRS-SLVSQLN-SSFSYCIGSLHDPDYLHNKLILGDGAIIDEGDAT 238
              F  +       G   S  SQ++ S+FSYC+     P    + L  GDGA        
Sbjct: 282 EGLFVGAAGLLALGGGPLSFPSQISASTFSYCLVDRDSP--AASTLQFGDGAAEAGTVTA 339

Query: 239 PL---QFIDGHYYITLEAISVDGRMLDINPNIFKRDDS--GGGVMIDSGTDVTWLVKEAY 293
           PL         YY+ L  ISV G+ L I  + F  D +   GGV++DSGT VT L   AY
Sbjct: 340 PLVRSPRTSTFYYVALSGISVGGQPLSIPASAFAMDATSGSGGVIVDSGTAVTRLQSAAY 399

Query: 294 EALRDEVMIRLEGEQMRSYSWPDKLCYHGIMSSDLKGFPTVRFHFRGGAKLAL-EKDSMF 352
            ALRD  +         S       CY     + ++  P V   F GG  L L  K+ + 
Sbjct: 400 AALRDAFVQGAPSLPRTSGVSLFDTCYDLSDRTSVE-VPAVSLRFEGGGALRLPAKNYLI 458

Query: 353 YQPRPDAFCMAVNPASINNRYVNFSLIGMMAQQFYNVGYDIGRKQKTFQRMDC 405
                  +C+A  P +        S+IG + QQ   V +D  R    F    C
Sbjct: 459 PVDGAGTYCLAFAPTN-----AAVSIIGNVQQQGTRVSFDTARGAVGFTPNKC 506


>gi|312281631|dbj|BAJ33681.1| unnamed protein product [Thellungiella halophila]
          Length = 502

 Score =  137 bits (346), Expect = 8e-30,   Method: Compositional matrix adjust.
 Identities = 117/346 (33%), Positives = 171/346 (49%), Gaps = 30/346 (8%)

Query: 61  FYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLGTIFYPSRSSSYAYVPCDSEH 120
           ++  I +G P    +  +DTGS + W+ C PC +C  Q   IF P+ SS++  + C    
Sbjct: 164 YFSRIGVGTPAKEMYVVLDTGSDVNWIQCLPCSECYQQSDPIFDPTSSSTFKSLTCSDPK 223

Query: 121 CRYFPYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDVVFGCGFST 180
           C     + C +  ++C+Y   Y  G  T    +T+ +TF  + +    V DV  GCG   
Sbjct: 224 CASLDVSACRS--NKCLYQVSYGDGSFTVGNYATDTVTFGESGK----VNDVALGCGHDN 277

Query: 181 NRNFK-FSGIFGLGIGRSSLVSQLNS-SFSYCIGSLHDPDYLHNKLILGDGAIIDEGDAT 238
              F   +G+ GLG G  S+ +Q+ + SFSYC   L D D   +  +  +   I  GDAT
Sbjct: 278 EGLFTGAAGLLGLGGGALSMTNQIKAKSFSYC---LVDRDSAKSSSLDFNSVQIGAGDAT 334

Query: 239 -PL---QFIDGHYYITLEAISVDGRMLDINPNIFKRDDSG-GGVMIDSGTDVTWLVKEAY 293
            PL     +D  YY+ L   SV G+ + I  ++F+ D SG GGV++D GT VT L  +AY
Sbjct: 335 APLLRNSKMDTFYYVGLSGFSVGGQQVSIPSSLFEVDASGAGGVILDCGTAVTRLQTQAY 394

Query: 294 EALRDEVMIRLEGEQMRSYSWPDKL---CYHGIMSSDLKGFPTVRFHFRGGAKLALEKDS 350
            +LRD   ++L  +  +  S P  L   CY     S +K  PTV FHF GG  L L   +
Sbjct: 395 NSLRD-AFVKLTTDFKKGTS-PISLFDTCYDFSSLSTVK-VPTVTFHFTGGKSLNLPAKN 451

Query: 351 MFYQPRPDA--FCMAVNPASINNRYVNFSLIGMMAQQFYNVGYDIG 394
            +  P  DA  FC A  P S      + S+IG + QQ   + YD+ 
Sbjct: 452 -YLIPIDDAGTFCFAFAPTS-----SSLSIIGNVQQQGTRITYDLA 491


>gi|125524353|gb|EAY72467.1| hypothetical protein OsI_00323 [Oryza sativa Indica Group]
          Length = 500

 Score =  137 bits (345), Expect = 9e-30,   Method: Compositional matrix adjust.
 Identities = 111/366 (30%), Positives = 158/366 (43%), Gaps = 21/366 (5%)

Query: 47  QAQILPSNDISNSLFYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLGTIFYPS 106
           Q  ++    + +  ++  + +G P    +  +DTGS + WV C PC DC  Q   +F PS
Sbjct: 149 QGPVVSGVGLGSGEYFSRVGVGSPARQLYMVLDTGSDVTWVQCQPCADCYQQSDPVFDPS 208

Query: 107 RSSSYAYVPCDSEHCRYFPYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDEST 166
            S+SYA V CD+  C     A C      C+Y   Y  G  T    +TE LT  +    +
Sbjct: 209 LSTSYASVACDNPRCHDLDAAACRNSTGACLYEVAYGDGSYTVGDFATETLTLGD----S 264

Query: 167 IHVQDVVFGCGFSTNRNFKFSGIFGLGIGRS-SLVSQLN-SSFSYCIGSLHDPDYLHNKL 224
             V  V  GCG      F  +       G   S  SQ++ ++FSYC+     P    + L
Sbjct: 265 APVSSVAIGCGHDNEGLFVGAAGLLALGGGPLSFPSQISATTFSYCLVDRDSPS--SSTL 322

Query: 225 ILGDGAIIDEGDATPL---QFIDGHYYITLEAISVDGRMLDINPNIFKRDDSG-GGVMID 280
             GD A  D     PL         YY+ L  ISV G++L I P+ F  D +G GGV++D
Sbjct: 323 QFGDAA--DAEVTAPLIRSPRTSTFYYVGLSGISVGGQILSIPPSAFAMDGTGAGGVIVD 380

Query: 281 SGTDVTWLVKEAYEALRDEVMIRLEGEQMRSYSWPDKLCYHGIMSSDLKGFPTVRFHFRG 340
           SGT VT L   AY ALRD  +   +     S       CY     + ++  P V   F G
Sbjct: 381 SGTAVTRLQSSAYAALRDAFVRGTQSLPRTSGVSLFDTCYDLSDRTSVE-VPAVSLRFAG 439

Query: 341 GAKLAL-EKDSMFYQPRPDAFCMAVNPASINNRYVNFSLIGMMAQQFYNVGYDIGRKQKT 399
           G +L L  K+ +        +C+A  P +        S+IG + QQ   V +D  +    
Sbjct: 440 GGELRLPAKNYLIPVDGAGTYCLAFAPTN-----AAVSIIGNVQQQGTRVSFDTAKSTVG 494

Query: 400 FQRMDC 405
           F    C
Sbjct: 495 FTSNKC 500


>gi|326512608|dbj|BAJ99659.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 484

 Score =  137 bits (345), Expect = 1e-29,   Method: Compositional matrix adjust.
 Identities = 123/428 (28%), Positives = 182/428 (42%), Gaps = 42/428 (9%)

Query: 1   LIHRDSILSPYNNPNENPAHRVQRGINISIARLAYLQEKIRSHNT--------YQAQILP 52
           ++HR    SP       P H     +N   AR+  +  KI +  +         +   LP
Sbjct: 77  VVHRQGPCSPLQARGAPPPH--AELLNDDQARVDSIHRKIAAAASPVLDQARGKKGVTLP 134

Query: 53  SN---DISNSLFYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLGTIFYPSRSS 109
           +     +    + V++ +G P        DTGS L WV C PC DC  Q   +F P+RSS
Sbjct: 135 AQRGISLGTGNYVVSMGLGTPARDMTVVFDTGSDLSWVQCTPCSDCYEQKDPLFDPARSS 194

Query: 110 SYAYVPCDSEHCRYFPYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHV 169
           +Y+ VPC S  C+      CS  K +C Y  +Y    +T   ++ + LT   +D     +
Sbjct: 195 TYSAVPCASPECQGLDSRSCSRDK-KCRYEVVYGDQSQTDGALARDTLTLTQSDV----L 249

Query: 170 QDVVFGCGFSTNRNF-KFSGIFGLGIGRSSLVSQLNS----SFSYCIGSLHDPDYLHNKL 224
              VFGCG      F +  G+ GLG  + SL SQ  S     FSYC+ S          L
Sbjct: 250 PGFVFGCGEQDTGLFGRADGLVGLGREKVSLSSQAASKYGAGFSYCLPSSPS---AAGYL 306

Query: 225 ILGDGAIIDEGDATPLQF---IDGHYYITLEAISVDGRMLDINPNIFKRDDSGGGVMIDS 281
            LG G        T ++        YY+ L  + V GR + ++P +F    S  G +IDS
Sbjct: 307 SLG-GPAPANARFTAMETRHDSPSFYYVRLVGVKVAGRTVRVSPIVF----SAAGTVIDS 361

Query: 282 GTDVTWLVKEAYEALRD---EVMIRLEGEQMRSYSWPDKLCYHGIMSSDLKGFPTVRFHF 338
           GT +T L    Y ALR      M R   ++  + S  D  CY     + ++  P+V   F
Sbjct: 362 GTVITRLPPRVYAALRSAFARSMGRYGYKRAPALSILDT-CYDFTGHTTVR-IPSVALVF 419

Query: 339 RGGAKLALEKDSMFYQPRPDAFCMAVNPASINNRYVNFSLIGMMAQQFYNVGYDIGRKQK 398
            GGA + L+   + Y  +    C+A  P   N    +  +IG   Q+   V YD+ R++ 
Sbjct: 420 AGGAAVGLDFSGVLYVAKVSQACLAFAP---NGDGADAGIIGNTQQKTLAVVYDVARQKI 476

Query: 399 TFQRMDCE 406
            F    C 
Sbjct: 477 GFGANGCS 484


>gi|357135412|ref|XP_003569303.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Brachypodium
           distachyon]
          Length = 494

 Score =  137 bits (345), Expect = 1e-29,   Method: Compositional matrix adjust.
 Identities = 126/428 (29%), Positives = 184/428 (42%), Gaps = 36/428 (8%)

Query: 1   LIHRDSILSPYNNPNENPAHRVQRG---INISIARLAYLQEKIRSHNTYQAQILPSNDIS 57
           ++HRD  +       E   HR+QR         A         R+ +   A ++      
Sbjct: 80  VVHRDDFVV-NATAAELLGHRLQRDGKRAARISAAAGAANGTRRTGSGVVAPVVSGLAQG 138

Query: 58  NSLFYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLGTIFYPSRSSSYAYVPCD 117
           +  ++  I +G P  P    +DTGS ++W+ C PCR C  Q G +F P RS SY  V C 
Sbjct: 139 SGEYFTKIGVGTPATPALMVLDTGSDVVWLQCAPCRRCYDQSGQVFDPRRSRSYGAVGCS 198

Query: 118 SEHCRYFPYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDVVFGCG 177
           +  CR      C   +  C+Y   Y  G  T+   +TE LTF         V  +  GCG
Sbjct: 199 APLCRRLDSGGCDLRRKACLYQVAYGDGSVTAGDFATETLTFAG----GARVARIALGCG 254

Query: 178 FSTNRNFKFSGIFGLGIGRSSL------VSQLNSSFSYCI---GSLHDPDYLHNKLILGD 228
              N     +    LG+GR SL        +   SFSYC+    S  +P    + +  G 
Sbjct: 255 HD-NEGLFVAAAGLLGLGRGSLSFPAQISRRYGRSFSYCLVDRTSSANPASHSSTVTFGS 313

Query: 229 GAIIDEGDA--TPL---QFIDGHYYITLEAISVDG-RMLDINPNIFKRDDSG--GGVMID 280
           GA+     A  TP+     ++  YY+ L  ISV G R+  +  +  + D S   GGV++D
Sbjct: 314 GAVGSTVAASFTPMVKNPRMETFYYVQLVGISVGGARVSGVADSDLRLDPSSGRGGVIVD 373

Query: 281 SGTDVTWLVKEAYEALRDEVMIRLEGEQMR--SYSWPDKLCYHGIMSSDLKGFPTVRFHF 338
           SGT VT L + AY ALRD       G ++    +S  D  CY  +    +   PTV  HF
Sbjct: 374 SGTSVTRLARPAYSALRDAFRAAAAGLRLSPGGFSLFDT-CYD-LSGRKVVKVPTVSMHF 431

Query: 339 RGGAKLALEKDS-MFYQPRPDAFCMAVNPASINNRYVNFSLIGMMAQQFYNVGYDIGRKQ 397
            GGA+ AL  ++ +        FC A   A  +      S+IG + QQ + V +D   ++
Sbjct: 432 AGGAEAALPPENYLIPVDSKGTFCFAF--AGTDG---GVSIIGNIQQQGFRVVFDGDGQR 486

Query: 398 KTFQRMDC 405
             F    C
Sbjct: 487 VGFVPKGC 494


>gi|148907930|gb|ABR17085.1| unknown [Picea sitchensis]
          Length = 498

 Score =  137 bits (344), Expect = 1e-29,   Method: Compositional matrix adjust.
 Identities = 123/442 (27%), Positives = 189/442 (42%), Gaps = 58/442 (13%)

Query: 1   LIHRDSIL-SPYNNPNENPAHRVQRGINISIARLAYLQEKIRSHNT-------------- 45
           ++HRD++L     N   +   R++  +     R+  L+ +I    T              
Sbjct: 78  VVHRDALLLKNAANATASYERRLKEKLRREAVRVRGLERQIERTLTLNKDPVNRYENVAE 137

Query: 46  ----YQAQILPSNDISNSLFYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLGT 101
               +  +++   +  +  ++  I +G P   Q+  +DTGS + W+ C PCR+C  Q   
Sbjct: 138 VDADFGGEVVSGMEQGSGEYFTRIGVGTPTREQYMVLDTGSDVAWIQCEPCRECYSQADP 197

Query: 102 IFYPSRSSSYAYVPCDSEHCRYFPYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKN 161
           IF PS S+S++ V CDS  C       C  +   C+Y   Y  G  ++   +TE LTF  
Sbjct: 198 IFNPSYSASFSTVGCDSAVCSQLDAYDC--HSGGCLYEASYGDGSYSTGSFATETLTFGT 255

Query: 162 TDESTIHVQDVVFGCGFSTNRNFKFSGIFGLGIGR-----SSLVSQLNSSFSYCIGSLHD 216
           T      V +V  GCG      F  +              + + +Q   +FSYC+     
Sbjct: 256 TS-----VANVAIGCGHKNVGLFIGAAGLLGLGAGALSFPNQIGTQTGHTFSYCLVDRES 310

Query: 217 PDYLHNKLILGDGAIIDEGDATPLQ---FIDGHYYITLEAISVDGRMLD-INPNIFKRDD 272
                  L  G  ++      TPL+    +   YY+++ AISV G +LD I P +F+ D+
Sbjct: 311 DS--SGPLQFGPKSVPVGSIFTPLEKNPHLPTFYYLSVTAISVGGALLDSIPPEVFRIDE 368

Query: 273 SG--GGVMIDSGTDVTWLVKEAYEALRDEVMIRLEGEQMRSYSWPD-KLCYHGIMSSDLK 329
           +   GG +IDSGT VT LV  AY+A+RD   +   G+  R+ +      CY      DL 
Sbjct: 369 TSGHGGFIIDSGTVVTRLVTSAYDAVRD-AFVAGTGQLPRTDAVSIFDTCY------DLS 421

Query: 330 GF-----PTVRFHFRGGAKLAL-EKDSMFYQPRPDAFCMAVNPASINNRYVNFSLIGMMA 383
           G      PTV FHF  GA L L  K+ +        FC A  PA+      + S++G   
Sbjct: 422 GLQFVSVPTVGFHFSNGASLILPAKNYLIPMDTVGTFCFAFAPAA-----SSVSIMGNTQ 476

Query: 384 QQFYNVGYDIGRKQKTFQRMDC 405
           QQ   V +D       F    C
Sbjct: 477 QQHIRVSFDSANSLVGFAFDQC 498


>gi|302753526|ref|XP_002960187.1| hypothetical protein SELMODRAFT_75184 [Selaginella moellendorffii]
 gi|300171126|gb|EFJ37726.1| hypothetical protein SELMODRAFT_75184 [Selaginella moellendorffii]
          Length = 398

 Score =  137 bits (344), Expect = 1e-29,   Method: Compositional matrix adjust.
 Identities = 107/364 (29%), Positives = 161/364 (44%), Gaps = 26/364 (7%)

Query: 61  FYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLGTIFYPSRSSSYAYVPCDSEH 120
           +   IS+G P        DTGS L+W+ C PC+ C  Q   IF P  SSSY  + C    
Sbjct: 40  YVTTISLGTPAKVFSVIADTGSDLIWIQCKPCQACFNQKDPIFDPEGSSSYTTMSCGDTL 99

Query: 121 CRYFPYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDVVFGCGFST 180
           C   P   CS     C Y+  Y  G  T   +S+E +T  +T    +  +++ FGCG   
Sbjct: 100 CDSLPRKSCSP---NCDYSYGYGDGSGTRGTLSSETVTLTSTQGEKLAAKNIAFGCGHLN 156

Query: 181 NRNFK-FSGIFGLGIGRSSLVSQLNS----SFSYCIGSLHDPDYLHNKLILGDGAIIDEG 235
             +F   SG+ GLG G  S VSQL       FSYC+    D     + +  GD +     
Sbjct: 157 RGSFNDASGLVGLGRGNLSFVSQLGDLFGHKFSYCLVPWRDAPSKTSPMFFGDESSSHSS 216

Query: 236 DA------TPL---QFIDGHYYITLEAISVDGRMLDINPNIFK-RDDSGGGVMIDSGTDV 285
                   TP+     ++  YY+ L+ IS+ GR L I    F  + D  GG++ DSGT +
Sbjct: 217 GKKLHYAFTPMIHNPAMESFYYVKLKDISIAGRALRIPAGSFDIKPDGSGGMIFDSGTTL 276

Query: 286 TWLVKEAYEALRDEVMIRLEGEQMRSYSWPDKLCY--HGIMSSDLKGFPTVRFHFRGGA- 342
           T L    Y+ +   +  ++   ++   S    LCY   G  +S  K  P + FHF G   
Sbjct: 277 TLLPDAPYQIVLRALRSKVSFPEIDGSSAGLDLCYDVSGSKASYKKKIPAMVFHFEGADH 336

Query: 343 KLALEKDSMFYQPRPDAFCMAVNPASINNRYVNFSLIGMMAQQFYNVGYDIGRKQKTFQR 402
           +L +E   +         C+A+  +++     +  + G M QQ + V YDIG  +  +  
Sbjct: 337 QLPVENYFIAANDAGTIVCLAMVSSNM-----DIGIYGNMMQQNFRVMYDIGSSKIGWAP 391

Query: 403 MDCE 406
             C+
Sbjct: 392 SQCD 395


>gi|242041115|ref|XP_002467952.1| hypothetical protein SORBIDRAFT_01g037070 [Sorghum bicolor]
 gi|241921806|gb|EER94950.1| hypothetical protein SORBIDRAFT_01g037070 [Sorghum bicolor]
          Length = 774

 Score =  137 bits (344), Expect = 1e-29,   Method: Compositional matrix adjust.
 Identities = 114/382 (29%), Positives = 178/382 (46%), Gaps = 40/382 (10%)

Query: 53  SNDISNSLFYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLGTIFYPSRSSSYA 112
           +N + ++ + V+++IG PP P    +DTGS L+W  C PC  C  +      PS SS++ 
Sbjct: 407 ANGVPDTEYLVHLAIGTPPQPVQLILDTGSDLVWTQCRPCPVCFSRALGPLDPSNSSTFD 466

Query: 113 YVPCDSEHCRYFPYARCSAY---KHRCIYTQLYLIGPETSVFVSTEQLTFKNTDES-TIH 168
            +PC S  C    ++ C  +      C+Y   Y  G  T+  +  E  TF   D +    
Sbjct: 467 VLPCSSPVCDNLTWSSCGKHNWGNQTCVYVYAYADGSITTGHLDAETFTFAAADGTGQAT 526

Query: 169 VQDVVFGCGFSTNRNFKF--SGIFGLGIGRSSLVSQLN-SSFSYCIGSLHDPDYLHNKLI 225
           V D+ FGCG   N  F    +GI G G G  SL SQL   +FS+C  ++   +   + ++
Sbjct: 527 VPDLAFGCGLFNNGIFTSNETGIAGFGRGALSLPSQLKVDNFSHCFTAITGSE--PSSVL 584

Query: 226 LG---------DGAIIDEGDATPL-QFIDG--HYYITLEAISVDGRMLDINPNIFK-RDD 272
           LG         DGA+     +TPL Q       YY++L+ I+V    L I  + F  + D
Sbjct: 585 LGLPANLYSDADGAV----QSTPLVQNFSSLRAYYLSLKGITVGSTRLPIPESTFALKQD 640

Query: 273 SGGGVMIDSGTDVTWLVKEAYEALRDE--VMIRLEGEQMRSYSWPDKLCYHGIMSSDLK- 329
             GG +IDSGT +T L ++AY+ + D     +RL  +   S S   +LC+   +    K 
Sbjct: 641 GTGGTIIDSGTGMTTLPQDAYKLVHDAFTAQVRLPVDNATSSSL-SRLCFSFSVPRRAKP 699

Query: 330 GFPTVRFHFRGGAKLALEKDSMFYQ---PRPDAFCMAVNPASINNRYVNFSLIGMMAQQF 386
             P +  HF  GA L L +++  ++         C+A+N         + ++IG   QQ 
Sbjct: 700 DVPKLVLHFE-GATLDLPRENYMFEFEDAGGSVTCLAINAGD------DLTIIGNYQQQN 752

Query: 387 YNVGYDIGRKQKTFQRMDCEVL 408
            +V YD+ R   +F    C  L
Sbjct: 753 LHVLYDLVRNMLSFVPAQCNRL 774


>gi|168059885|ref|XP_001781930.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162666576|gb|EDQ53226.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 355

 Score =  137 bits (344), Expect = 1e-29,   Method: Compositional matrix adjust.
 Identities = 102/356 (28%), Positives = 157/356 (44%), Gaps = 35/356 (9%)

Query: 61  FYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLGTIFYPSRSSSYAYVPCDSEH 120
           +   + +G P       +DTGS L WV C PC  C  Q  ++F P+ S+S+  + C +E 
Sbjct: 3   YLATVRLGTPERVFSVIVDTGSDLTWVQCSPCGTCYSQNDSLFIPNTSTSFTKLACGTEL 62

Query: 121 CRYFPYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDVVFGCGFST 180
           C   PY  C+  +  C+Y   Y  G  ++     + +T    +     V +  FGCG   
Sbjct: 63  CNGLPYPMCN--QTTCVYWYSYGDGSLSTGDFVYDTITMDGINGQKQQVPNFAFGCGHDN 120

Query: 181 NRNFKFS-GIFGLGIGRSSLVSQL----NSSFSYCIGSLHDPDYLHNKLILGDGAIIDEG 235
             +F  + GI GLG G  S  SQL    N  FSYC+     P    + L+ GD A+    
Sbjct: 121 EGSFAGADGILGLGQGPLSFPSQLKTVFNGKFSYCLVDWLAPPTQTSPLLFGDAAVPTFP 180

Query: 236 DATPLQFIDG-----HYYITLEAISVDGRMLDINPNIFKRDDSG-GGVMIDSGTDVTWLV 289
               +  +       +YY+ L  ISV G++L+I+   F  D  G  G + DSGT VT L 
Sbjct: 181 GVKYISLLTNPKVPTYYYVKLNGISVGGKLLNISSTAFDIDSVGRAGTIFDSGTTVTQLA 240

Query: 290 KEAYEALRDEVMIRLEGEQMRSYSWPDK--------LCYHGIMSSDLKGFPTVRFHFRGG 341
            E ++    EV+  +    M    +P K        LC  G     L   P++ FHF GG
Sbjct: 241 GEVHQ----EVLAAMNASTM---DYPRKSDDSSGLDLCLGGFAEGQLPTVPSMTFHFEGG 293

Query: 342 AKLALEKDSMFYQPRPDAFCMAVNPASINNRYVNFSLIGMMAQQFYNVGYD-IGRK 396
                  +   +     ++C ++  +       + ++IG + QQ + V YD +GRK
Sbjct: 294 DMELPPSNYFIFLESSQSYCFSMVSSP------DVTIIGSIQQQNFQVYYDTVGRK 343


>gi|168005153|ref|XP_001755275.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162693403|gb|EDQ79755.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 429

 Score =  136 bits (343), Expect = 2e-29,   Method: Compositional matrix adjust.
 Identities = 122/423 (28%), Positives = 196/423 (46%), Gaps = 40/423 (9%)

Query: 1   LIHRDSILSPY-----NNPNENPAHRVQRGINISIARLAYLQEKIRSHNTYQAQILPSND 55
           LI+R+   SP        P+E     V+RG      R A L + + + +      + S  
Sbjct: 32  LIYREHQSSPLRSETLKTPSEIFIAAVKRGHE----RRARLAKHVLAGDQLFETPVASG- 86

Query: 56  ISNSLFYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLGTIFYPSRSSSYAYVP 115
             N  + ++IS G PP      +DTGS L WV C PC+ C   L   F PS+S+SY  + 
Sbjct: 87  --NGEYLIDISYGNPPQKSTAIVDTGSDLNWVQCLPCKSCYETLSAKFDPSKSASYKTLG 144

Query: 116 CDSEHCRYFPYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDVVFG 175
           C S  C+  P+  C+A    C Y  +Y  G  TS  +ST+ +T       T  + +V FG
Sbjct: 145 CGSNFCQDLPFQSCAA---SCQYDYMYGDGSSTSGALSTDDVTI-----GTGKIPNVAFG 196

Query: 176 CGFSTNRNFKFSGIFGLGIGRS-SLVSQLNSS----FSYCIGSLHDPDYLHNKLILGDGA 230
           CG S    F  +G          SLVSQL  +    FSYC+  L       + L +GD  
Sbjct: 197 CGNSNLGTFAGAGGLVGLGKGPLSLVSQLGGTATKKFSYCLVPLGSTK--TSPLYIGDST 254

Query: 231 IIDEGDATPLQFIDGH---YYITLEAISVDGRMLDINPNIFKRDDSG-GGVMIDSGTDVT 286
           +      TP+   + +   YY  L+ ISV+G+ ++   N F    +G GG+++DSGT +T
Sbjct: 255 LAGGVAYTPMLTNNNYPTFYYAELQGISVEGKAVNYPANTFDIAATGRGGLILDSGTTLT 314

Query: 287 WLVKEAYEALRDEVMIRLEGEQMRSYSWPDKLCYHGIMSSDLKGFPTVRFHFRGGAKLAL 346
           +L  +A+  +   +   L   +     +  + C+     ++   +PTV FHF  GA +AL
Sbjct: 315 YLDVDAFNPMVAALKAALPYPEADGSFYGLEYCFSTAGVAN-PTYPTVVFHFN-GADVAL 372

Query: 347 EKDSMFYQPRPDA-FCMAVNPASINNRYVNFSLIGMMAQQFYNVGYDIGRKQKTFQRMDC 405
             D+ F     +   C+A+  ++       FS+ G + Q  + + +D+  K+  F+  +C
Sbjct: 373 APDNTFIALDFEGTTCLAMASST------GFSIFGNIQQLNHVIVHDLVNKRIGFKSANC 426

Query: 406 EVL 408
           E +
Sbjct: 427 ETI 429


>gi|118484458|gb|ABK94105.1| unknown [Populus trichocarpa]
          Length = 499

 Score =  136 bits (343), Expect = 2e-29,   Method: Compositional matrix adjust.
 Identities = 118/419 (28%), Positives = 191/419 (45%), Gaps = 41/419 (9%)

Query: 2   IHRDSILSPYNNPNENPAHRVQRGI-NISIARLAYLQEKIRSHNTYQAQILPSNDISNSL 60
           +HRD++   +N+       R+Q  + +IS + L  L+ +I+  +     +       +  
Sbjct: 108 LHRDTVR--FNSLTA----RLQLALEDISKSDLKPLETEIKPED-LSTPVTSGTSQGSGE 160

Query: 61  FYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLGTIFYPSRSSSYAYVPCDSEH 120
           ++  + +G P    +  +DTGS + W+ C PC DC  Q   IF P+ SS+YA V C S+ 
Sbjct: 161 YFTRVGVGNPARQFYMVLDTGSDINWLQCQPCTDCYQQTDPIFDPTASSTYAPVTCQSQQ 220

Query: 121 CRYFPYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDVVFGCGFST 180
           C     + C +   +C+Y   Y  G  T    +TE ++F N+      V++V  GCG   
Sbjct: 221 CSSLEMSSCRS--GQCLYQVNYGDGSYTFGDFATESVSFGNSGS----VKNVALGCGHDN 274

Query: 181 NRNFKFSGIFGLGIGRS-SLVSQLN-SSFSYCIGSLHDPDYLHNKLILGDGAIIDEGDAT 238
              F  +       G   SL +QL  +SFSYC+  ++      + L      +  +    
Sbjct: 275 EGLFVGAAGLLGLGGGPLSLTNQLKATSFSYCL--VNRDSAGSSTLDFNSAQLGVDSVTA 332

Query: 239 PL---QFIDGHYYITLEAISVDGRMLDINPNIFKRDDSG-GGVMIDSGTDVTWLVKEAYE 294
           PL   + ID  YY+ L  +SV G+M+ I  + F+ D+SG GG+++D GT +T L  +AY 
Sbjct: 333 PLMKNRKIDTFYYVGLSGMSVGGQMVSIPESTFRLDESGNGGIIVDCGTAITRLQTQAYN 392

Query: 295 ALRDEVMIRLEGEQMRSYSWPDKLCYHGIMSSDLKG-----FPTVRFHFRGGAKLALEKD 349
            LRD  +   +  ++ S       CY      DL G      PTV FHF  G    L   
Sbjct: 393 PLRDAFVRMTQNLKLTSAVALFDTCY------DLSGQASVRVPTVSFHFADGKSWNLPA- 445

Query: 350 SMFYQPRPDA--FCMAVNPASINNRYVNFSLIGMMAQQFYNVGYDIGRKQKTFQRMDCE 406
           + +  P   A  +C A  P +      + S+IG + QQ   V +D+   +  F    C+
Sbjct: 446 ANYLIPVDSAGTYCFAFAPTT-----SSLSIIGNVQQQGTRVTFDLANNRMGFSPNKCQ 499


>gi|15226315|ref|NP_180368.1| aspartyl protease-like protein [Arabidopsis thaliana]
 gi|4510415|gb|AAD21501.1| putative chloroplast nucleoid DNA binding protein [Arabidopsis
           thaliana]
 gi|330252975|gb|AEC08069.1| aspartyl protease-like protein [Arabidopsis thaliana]
          Length = 396

 Score =  136 bits (343), Expect = 2e-29,   Method: Compositional matrix adjust.
 Identities = 103/360 (28%), Positives = 162/360 (45%), Gaps = 36/360 (10%)

Query: 58  NSLFYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLGTIFYPSRSSSYAYVPCD 117
           NS++ + + +G PP      +DTGS + W  C PC  C  Q   IF PS+SS++    CD
Sbjct: 62  NSVYLMKLQVGTPPFEIQAIIDTGSEITWTQCLPCVHCYEQNAPIFDPSKSSTFKEKRCD 121

Query: 118 SEHCRYFPYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDVVFGCG 177
              C Y        + H       Y +G      ++TE +T  +T      + + + GCG
Sbjct: 122 GHSCPY----EVDYFDHT------YTMGT-----LATETITLHSTSGEPFVMPETIIGCG 166

Query: 178 FSTNRNFK--FSGIFGLGIGRSSLVSQLNSSF----SYCIG--SLHDPDYLHNKLILGDG 229
            + N  FK  FSG+ GL  G SSL++Q+   +    SYC         ++  N ++ GDG
Sbjct: 167 HN-NSWFKPSFSGMVGLNWGPSSLITQMGGEYPGLMSYCFSGQGTSKINFGANAIVAGDG 225

Query: 230 AIIDEGDATPLQFIDGHYYITLEAISVDGRMLDINPNIFKRDDSGGGVMIDSGTDVTWLV 289
            +      T  +   G YY+ L+A+SV    ++     F   +  G ++IDSGT +T+  
Sbjct: 226 VVSTTMFMTTAK--PGFYYLNLDAVSVGNTRIETMGTTFHALE--GNIVIDSGTTLTYFP 281

Query: 290 KEAYEALRDEVMIRLEGEQMRSYSWPDKLCYHGIMSSDLKGFPTVRFHFRGGAKLALEKD 349
                 +R  V   +   +    +  D LCY+   S  +  FP +  HF GG  L L+K 
Sbjct: 282 VSYCNLVRQAVEHVVTAVRAADPTGNDMLCYN---SDTIDIFPVITMHFSGGVDLVLDKY 338

Query: 350 SMFYQPRPDA-FCMAVNPASINNRYVNFSLIGMMAQQFYNVGYDIGRKQKTFQRMDCEVL 408
           +M+ +      FC+A+    I N     ++ G  AQ  + VGYD      +F   +C  L
Sbjct: 339 NMYMESNNGGVFCLAI----ICNSPTQEAIFGNRAQNNFLVGYDSSSLLVSFSPTNCSAL 394


>gi|115434442|ref|NP_001041979.1| Os01g0140100 [Oryza sativa Japonica Group]
 gi|12328547|dbj|BAB21205.1| putative aspartic proteinase nepenthesin I [Oryza sativa Japonica
           Group]
 gi|113531510|dbj|BAF03893.1| Os01g0140100 [Oryza sativa Japonica Group]
 gi|125568961|gb|EAZ10476.1| hypothetical protein OsJ_00309 [Oryza sativa Japonica Group]
 gi|215697206|dbj|BAG91200.1| unnamed protein product [Oryza sativa Japonica Group]
          Length = 504

 Score =  136 bits (343), Expect = 2e-29,   Method: Compositional matrix adjust.
 Identities = 110/366 (30%), Positives = 158/366 (43%), Gaps = 21/366 (5%)

Query: 47  QAQILPSNDISNSLFYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLGTIFYPS 106
           Q  ++    + +  ++  + +G P    +  +DTGS + WV C PC DC  Q   +F PS
Sbjct: 153 QGPVVSGVGLGSGEYFSRVGVGSPARQLYMVLDTGSDVTWVQCQPCADCYQQSDPVFDPS 212

Query: 107 RSSSYAYVPCDSEHCRYFPYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDEST 166
            S+SYA V CD+  C     A C      C+Y   Y  G  T    +TE LT  +    +
Sbjct: 213 LSTSYASVACDNPRCHDLDAAACRNSTGACLYEVAYGDGSYTVGDFATETLTLGD----S 268

Query: 167 IHVQDVVFGCGFSTNRNFKFSGIFGLGIGRS-SLVSQLN-SSFSYCIGSLHDPDYLHNKL 224
             V  V  GCG      F  +       G   S  SQ++ ++FSYC+     P    + L
Sbjct: 269 APVSSVAIGCGHDNEGLFVGAAGLLALGGGPLSFPSQISATTFSYCLVDRDSPS--SSTL 326

Query: 225 ILGDGAIIDEGDATPL---QFIDGHYYITLEAISVDGRMLDINPNIFKRDDSG-GGVMID 280
             GD A  D     PL         YY+ L  +SV G++L I P+ F  D +G GGV++D
Sbjct: 327 QFGDAA--DAEVTAPLIRSPRTSTFYYVGLSGLSVGGQILSIPPSAFAMDSTGAGGVIVD 384

Query: 281 SGTDVTWLVKEAYEALRDEVMIRLEGEQMRSYSWPDKLCYHGIMSSDLKGFPTVRFHFRG 340
           SGT VT L   AY ALRD  +   +     S       CY     + ++  P V   F G
Sbjct: 385 SGTAVTRLQSSAYAALRDAFVRGTQSLPRTSGVSLFDTCYDLSDRTSVE-VPAVSLRFAG 443

Query: 341 GAKLAL-EKDSMFYQPRPDAFCMAVNPASINNRYVNFSLIGMMAQQFYNVGYDIGRKQKT 399
           G +L L  K+ +        +C+A  P +        S+IG + QQ   V +D  +    
Sbjct: 444 GGELRLPAKNYLIPVDGAGTYCLAFAPTN-----AAVSIIGNVQQQGTRVSFDTAKSTVG 498

Query: 400 FQRMDC 405
           F    C
Sbjct: 499 FTTNKC 504


>gi|15229656|ref|NP_188478.1| aspartyl protease family protein [Arabidopsis thaliana]
 gi|75273882|sp|Q9LS40.1|ASPG1_ARATH RecName: Full=Protein ASPARTIC PROTEASE IN GUARD CELL 1;
           Short=AtASPG1; Flags: Precursor
 gi|11994113|dbj|BAB01116.1| CND41, chloroplast nucleoid DNA binding protein-like [Arabidopsis
           thaliana]
 gi|23297732|gb|AAN13013.1| putative chloroplast nucleoid DNA-binding protein [Arabidopsis
           thaliana]
 gi|332642583|gb|AEE76104.1| aspartyl protease family protein [Arabidopsis thaliana]
          Length = 500

 Score =  136 bits (343), Expect = 2e-29,   Method: Compositional matrix adjust.
 Identities = 111/346 (32%), Positives = 167/346 (48%), Gaps = 28/346 (8%)

Query: 61  FYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLGTIFYPSRSSSYAYVPCDSEH 120
           ++  I +G P    +  +DTGS + W+ C PC DC  Q   +F P+ SS+Y  + C +  
Sbjct: 162 YFSRIGVGTPAKEMYLVLDTGSDVNWIQCEPCADCYQQSDPVFNPTSSSTYKSLTCSAPQ 221

Query: 121 CRYFPYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDVVFGCGFST 180
           C     + C +  ++C+Y   Y  G  T   ++T+ +TF N+ +    + +V  GCG   
Sbjct: 222 CSLLETSACRS--NKCLYQVSYGDGSFTVGELATDTVTFGNSGK----INNVALGCGHDN 275

Query: 181 NRNFKFSGIFGLGIGRS-SLVSQLN-SSFSYCIGSLHDPDYLHNKLILGDGAIIDEGDAT 238
              F  +       G   S+ +Q+  +SFSYC   L D D   +  +  +   +  GDAT
Sbjct: 276 EGLFTGAAGLLGLGGGVLSITNQMKATSFSYC---LVDRDSGKSSSLDFNSVQLGGGDAT 332

Query: 239 -PL---QFIDGHYYITLEAISVDGRMLDINPNIFKRDDSG-GGVMIDSGTDVTWLVKEAY 293
            PL   + ID  YY+ L   SV G  + +   IF  D SG GGV++D GT VT L  +AY
Sbjct: 333 APLLRNKKIDTFYYVGLSGFSVGGEKVVLPDAIFDVDASGSGGVILDCGTAVTRLQTQAY 392

Query: 294 EALRD---EVMIRLEGEQMRSYSWPDKLCYHGIMSSDLKGFPTVRFHFRGGAKLAL-EKD 349
            +LRD   ++ + L+ +   S S  D  CY     S +K  PTV FHF GG  L L  K+
Sbjct: 393 NSLRDAFLKLTVNLK-KGSSSISLFDT-CYDFSSLSTVK-VPTVAFHFTGGKSLDLPAKN 449

Query: 350 SMFYQPRPDAFCMAVNPASINNRYVNFSLIGMMAQQFYNVGYDIGR 395
            +        FC A  P S      + S+IG + QQ   + YD+ +
Sbjct: 450 YLIPVDDSGTFCFAFAPTS-----SSLSIIGNVQQQGTRITYDLSK 490


>gi|297845610|ref|XP_002890686.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
 gi|297336528|gb|EFH66945.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
          Length = 486

 Score =  136 bits (342), Expect = 2e-29,   Method: Compositional matrix adjust.
 Identities = 114/352 (32%), Positives = 160/352 (45%), Gaps = 23/352 (6%)

Query: 61  FYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLGTIFYPSRSSSYAYVPCDSEH 120
           ++  + IG P    +  +DTGS + W+ C PC DC  Q   IF PS SSSY  + CD+  
Sbjct: 151 YFTRVGIGNPAREVYMVLDTGSDVNWLQCTPCADCYHQTEPIFEPSSSSSYEPLSCDTPQ 210

Query: 121 CRYFPYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDVVFGCGFST 180
           C     + C      C+Y   Y  G  T    +TE LT  +T      VQ+V  GCG S 
Sbjct: 211 CNALEVSECR--NATCLYEVSYGDGSYTVGDFATETLTIGST-----LVQNVAVGCGHSN 263

Query: 181 NRNFKFSGIFGLGIGRSSLV-SQLN-SSFSYCIGSLHDPDYLHNKLILGDGAIIDEGDAT 238
              F  +       G    + SQLN +SFSYC   L D D      +    ++  +    
Sbjct: 264 EGLFVGAAGLLGLGGGLLALPSQLNTTSFSYC---LVDRDSDSASTVEFGTSLPPDAVVA 320

Query: 239 PL---QFIDGHYYITLEAISVDGRMLDINPNIFKRDDSG-GGVMIDSGTDVTWLVKEAYE 294
           PL     +D  YY+ L  ISV G +L I  + F+ D+SG GG++IDSGT VT L    Y 
Sbjct: 321 PLLRNHQLDTFYYLGLTGISVGGELLQIPQSSFEMDESGSGGIIIDSGTAVTRLQTGIYN 380

Query: 295 ALRDEVMIRLEGEQMRSYSWPDKLCYHGIMSSDLKGFPTVRFHFRGGAKLAL-EKDSMFY 353
           +LRD  +      +  +       CY+    + ++  PTV FHF GG  LAL  K+ M  
Sbjct: 381 SLRDSFLKGTSDLEKAAGVAMFDTCYNLSAKTTIE-VPTVAFHFPGGKMLALPAKNYMIP 439

Query: 354 QPRPDAFCMAVNPASINNRYVNFSLIGMMAQQFYNVGYDIGRKQKTFQRMDC 405
                 FC+A  P +      + ++IG + QQ   V +D+      F    C
Sbjct: 440 VDSVGTFCLAFAPTA-----SSLAIIGNVQQQGTRVTFDLANSLIGFSSNKC 486


>gi|302776610|ref|XP_002971459.1| hypothetical protein SELMODRAFT_64134 [Selaginella moellendorffii]
 gi|300160591|gb|EFJ27208.1| hypothetical protein SELMODRAFT_64134 [Selaginella moellendorffii]
          Length = 357

 Score =  136 bits (342), Expect = 2e-29,   Method: Compositional matrix adjust.
 Identities = 114/357 (31%), Positives = 163/357 (45%), Gaps = 25/357 (7%)

Query: 61  FYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLGTIFYPSRSSSYAYVPCDSEH 120
           ++V + IG P   Q+  MDTGS + W+ C PC+ C  Q   +F P  SSS+  + C +  
Sbjct: 14  YFVRVGIGSPTKLQYLVMDTGSDVPWIQCSPCKSCYKQNDAVFDPRASSSFRRLSCSTPQ 73

Query: 121 CRYFPYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDVVFGCGFST 180
           C+      C++  +RC+Y   Y  G  T   ++++  +      S      VVFGCG   
Sbjct: 74  CKLLDVKACASTDNRCLYQVSYGDGSFTVGDLASDSFSVSRGRTS-----PVVFGCGHDN 128

Query: 181 NRNF-KFSGIFGLGIGRSSLVSQLNS-SFSYCIGSLHDPDYLHNKLILGDGAIIDEGDAT 238
              F   +G+ GLG G+ S  SQL+S  FSYC+ S  +     + L+ GD A+       
Sbjct: 129 EGLFVGAAGLLGLGAGKLSFPSQLSSRKFSYCLVSRDNGVRASSALLFGDSALPTSASFA 188

Query: 239 PLQF-----IDGHYYITLEAISVDGRMLDINPNIFKRDDSG--GGVMIDSGTDVTWLVKE 291
             Q      +D  YY  L  IS+ G +L I    FK   S   GGV+IDSGT VT L   
Sbjct: 189 YTQLLKNPKLDTFYYAGLSGISIGGTLLSIPSTAFKLSSSTGRGGVIIDSGTSVTRLPTY 248

Query: 292 AYEALRDEVMIRLEG-EQMRSYSWPDKLCYHGIMSSDLKGFPTVRFHFRGGAKLALEKDS 350
           AY  +RD      +   +   +S  D  CY     + +   PTV FHF GGA + L   S
Sbjct: 249 AYTVMRDAFRSATQKLPRAADFSLFDT-CYDFSALTSVT-IPTVSFHFEGGASVQLPP-S 305

Query: 351 MFYQP--RPDAFCMAVNPASINNRYVNFSLIGMMAQQFYNVGYDIGRKQKTFQRMDC 405
            +  P      FC A +  S+     + S+IG + QQ   V  D+   +  F    C
Sbjct: 306 NYLVPVDTSGTFCFAFSKTSL-----DLSIIGNIQQQTMRVAIDLDSSRVGFAPRQC 357


>gi|326491519|dbj|BAJ94237.1| predicted protein [Hordeum vulgare subsp. vulgare]
 gi|326524456|dbj|BAK00611.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 499

 Score =  136 bits (342), Expect = 2e-29,   Method: Compositional matrix adjust.
 Identities = 108/361 (29%), Positives = 163/361 (45%), Gaps = 30/361 (8%)

Query: 56  ISNSLFYVNISIGQPPVPQFTAMDTGSSLLWVHCYPC-RDCSPQLGTIFYPSRSSSYAYV 114
           +S   + V + +G P        DTGS   WV C PC   C  Q   +F P++SS+YA V
Sbjct: 158 VSTGNYVVTVGLGTPASKYTVVFDTGSDTTWVQCRPCVVKCYKQKEPLFDPAKSSTYANV 217

Query: 115 PCDSEHCRYFPYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDVVF 174
            C    C       C+     C+Y   Y  G  T  F + + LT  +       ++   F
Sbjct: 218 SCTDSACADLDTNGCTG--GHCLYAVQYGDGSYTVGFFAQDTLTIAHD-----AIKGFRF 270

Query: 175 GCGFSTNRNF-KFSGIFGLGIGRSSLVSQL----NSSFSYCIGSLHDPDYLHNKLILGDG 229
           GCG   N  F K +G+ GLG G++SL  Q       +F+YC+ +L         L  G G
Sbjct: 271 GCGEKNNGLFGKTAGLMGLGRGKTSLTVQAYNKYGGAFAYCLPALTTGT---GYLDFGPG 327

Query: 230 AIIDEGDATPLQFIDGH--YYITLEAISVDGRMLDINPNIFKRDDSGGGVMIDSGTDVTW 287
           +  +    TP+    G   YY+ +  I V G+ + +  ++F    S  G ++DSGT +T 
Sbjct: 328 SAGNNARLTPMLTDKGQTFYYVGMTGIRVGGQQVPVAESVF----STAGTLVDSGTVITR 383

Query: 288 LVKEAYEALR---DEVMIRLEGEQMRSYSWPDKLCYHGIMSSDLKGFPTVRFHFRGGAKL 344
           L   AY AL    D+VM+    ++   YS  D  CY     SD++  PTV   F+GGA L
Sbjct: 384 LPATAYTALSSAFDKVMLARGYKKAPGYSILDT-CYDFTGLSDVE-LPTVSLVFQGGACL 441

Query: 345 ALEKDSMFYQPRPDAFCMAVNPASINNRYVNFSLIGMMAQQFYNVGYDIGRKQKTFQRMD 404
            ++   + Y       C+A    + N    + +++G   Q+ Y V YD+G+K   F    
Sbjct: 442 DVDVSGIVYAISEAQVCLAF---ASNGDDESVAIVGNTQQKTYGVLYDLGKKTVGFAPGS 498

Query: 405 C 405
           C
Sbjct: 499 C 499


>gi|167997964|ref|XP_001751688.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162696786|gb|EDQ83123.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 418

 Score =  136 bits (342), Expect = 2e-29,   Method: Compositional matrix adjust.
 Identities = 108/376 (28%), Positives = 171/376 (45%), Gaps = 24/376 (6%)

Query: 46  YQAQILPSNDISNSLFYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLGTIFYP 105
           +Q+ ++  + + +  ++V+  +G PP      +D+GS LLWV C PCR C  Q   ++ P
Sbjct: 49  FQSPVVSGSTLGSGQYFVDFFLGTPPQKFSLIVDSGSDLLWVQCSPCRQCYAQDSPLYVP 108

Query: 106 SRSSSYAYVPCDSEHCRYFPYAR---CS-AYKHRCIYTQLYLIGPETSVFVSTEQLTFKN 161
           S SS+++ VPC S  C   P      C   Y   C Y  LY    +TS   S     +++
Sbjct: 109 SNSSTFSPVPCLSSDCLLIPATEGFPCDFRYPGACAYEYLYA---DTS--SSKGVFAYES 163

Query: 162 TDESTIHVQDVVFGCGFSTNRNFKFS-GIFGLGIGRSSLVSQL----NSSFSYCIGSLHD 216
                + +  V FGCG     +F  + G+ GLG G  S  SQ+     + F+YC+ +  D
Sbjct: 164 ATVDGVRIDKVAFGCGSDNQGSFAAAGGVLGLGQGPLSFGSQVGYAYGNKFAYCLVNYLD 223

Query: 217 PDYLHNKLILGDGAI--IDEGDATPLQFIDGH---YYITLEAISVDGRMLDINPNIFKRD 271
           P  + + LI GD  I  I +   TP+         YY+ +E ++V G+ L I+ + ++ D
Sbjct: 224 PTSVSSSLIFGDELISTIHDMQYTPIVSNPKSPTLYYVQIEKVTVGGKSLPISDSAWEID 283

Query: 272 DSG-GGVMIDSGTDVTWLVKEAYEALRDEVMIRLEGEQMRSYSWPDKLCYHGIMSSDLKG 330
             G GG + DSGT +T+    AY  +       +   +  S    D LC   +   D   
Sbjct: 284 LLGNGGSIFDSGTTLTYWFPSAYSHILAAFDSGVHYPRAESVQGLD-LCVE-LTGVDQPS 341

Query: 331 FPTVRFHFRGGAKLALEKDSMFYQPRPDAFCMAVNPASINNRYVNFSLIGMMAQQFYNVG 390
           FP+    F  GA    E ++ F    P+  C+A+  A + +    F+ IG + QQ + V 
Sbjct: 342 FPSFTIEFDDGAVFQPEAENYFVDVAPNVRCLAM--AGLASPLGGFNTIGNLLQQNFFVQ 399

Query: 391 YDIGRKQKTFQRMDCE 406
           YD       F    C 
Sbjct: 400 YDREENLIGFAPAKCS 415


>gi|19424106|gb|AAL87345.1| putative chloroplast nucleoid DNA-binding protein [Arabidopsis
           thaliana]
          Length = 500

 Score =  136 bits (342), Expect = 2e-29,   Method: Compositional matrix adjust.
 Identities = 111/346 (32%), Positives = 167/346 (48%), Gaps = 28/346 (8%)

Query: 61  FYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLGTIFYPSRSSSYAYVPCDSEH 120
           ++  I +G P    +  +DTGS + W+ C PC DC  Q   +F P+ SS+Y  + C +  
Sbjct: 162 YFSRIGVGTPAKDMYLVLDTGSDVNWIQCEPCADCYQQSDPVFNPTSSSTYKSLTCSAPQ 221

Query: 121 CRYFPYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDVVFGCGFST 180
           C     + C +  ++C+Y   Y  G  T   ++T+ +TF N+ +    + +V  GCG   
Sbjct: 222 CSLLETSACRS--NKCLYQVSYGDGSFTVGELATDTVTFGNSGK----INNVALGCGHDN 275

Query: 181 NRNFKFSGIFGLGIGRS-SLVSQLN-SSFSYCIGSLHDPDYLHNKLILGDGAIIDEGDAT 238
              F  +       G   S+ +Q+  +SFSYC   L D D   +  +  +   +  GDAT
Sbjct: 276 EGLFTGAAGLLGLGGGVLSITNQMKATSFSYC---LVDRDSGKSSSLDFNSVQLGGGDAT 332

Query: 239 -PL---QFIDGHYYITLEAISVDGRMLDINPNIFKRDDSG-GGVMIDSGTDVTWLVKEAY 293
            PL   + ID  YY+ L   SV G  + +   IF  D SG GGV++D GT VT L  +AY
Sbjct: 333 APLLRNKKIDTFYYVGLSGFSVGGEKVVLPDAIFDVDASGSGGVILDCGTAVTRLQTQAY 392

Query: 294 EALRD---EVMIRLEGEQMRSYSWPDKLCYHGIMSSDLKGFPTVRFHFRGGAKLAL-EKD 349
            +LRD   ++ + L+ +   S S  D  CY     S +K  PTV FHF GG  L L  K+
Sbjct: 393 NSLRDAFLKLTVNLK-KGSSSISLFDT-CYDFSSLSTVK-VPTVAFHFTGGKSLDLPAKN 449

Query: 350 SMFYQPRPDAFCMAVNPASINNRYVNFSLIGMMAQQFYNVGYDIGR 395
            +        FC A  P S      + S+IG + QQ   + YD+ +
Sbjct: 450 YLIPVDDSGTFCFAFAPTS-----SSLSIIGNVQQQGTRITYDLSK 490


>gi|255544139|ref|XP_002513132.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
           communis]
 gi|223548143|gb|EEF49635.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
           communis]
          Length = 481

 Score =  135 bits (341), Expect = 3e-29,   Method: Compositional matrix adjust.
 Identities = 121/418 (28%), Positives = 184/418 (44%), Gaps = 50/418 (11%)

Query: 1   LIHRDSILSPYNNPNENPAHRVQRGINISIARLAYLQEKIRSHNT--------YQAQILP 52
           L+HRD I + +N  + + +H     I     R+A L  ++   +         + A+++ 
Sbjct: 75  LVHRDKI-TAFNKSSYDHSHNFHARIQRDKKRVATLIRRLSPRDATSSYSVEEFGAEVVS 133

Query: 53  SNDISNSLFYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLGTIFYPSRSSSYA 112
             +  +  +++ I +G PP  Q+  +D+GS ++WV C PC  C  Q   +F P+ S+S+ 
Sbjct: 134 GMNQGSGEYFIRIGVGSPPREQYVVIDSGSDIVWVQCQPCTQCYHQTDPVFDPADSASFM 193

Query: 113 YVPCDSEHCRYFPYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDV 172
            VPC S  C     A C A    C Y  +Y  G  T   ++ E LTF  T      V++V
Sbjct: 194 GVPCSSSVCERIENAGCHA--GGCRYEVMYGDGSYTKGTLALETLTFGRT-----VVRNV 246

Query: 173 VFGCGFSTNRNFKFSGIFGLGIGRS-SLVSQL----NSSFSYCIGSLHDPDYLHNKLILG 227
             GCG      F  +       G S SLV QL      +FSYC+ S          L  G
Sbjct: 247 AIGCGHRNRGMFVGAAGLLGLGGGSMSLVGQLGGQTGGAFSYCLVSRGTDS--AGSLEFG 304

Query: 228 DGAIIDEGDATPL---QFIDGHYYITLEAISVDGRMLDINPNIFKRDDSG-GGVMIDSGT 283
            GA+       PL         YYI L  + V G  + I+ ++F+ ++ G GGV++D+GT
Sbjct: 305 RGAMPVGAAWIPLIRNPRAPSFYYIRLSGVGVGGMKVPISEDVFQLNEMGNGGVVMDTGT 364

Query: 284 DVTWLVKEAYEALRDEVMIRLEGEQMRSYSWPDKLCYHGIMSSDLKGF-----PTVRFHF 338
            VT +   AY A RD  + +       S       CY      +L GF     PTV F+F
Sbjct: 365 AVTRIPTVAYVAFRDAFIGQTGNLPRASGVSIFDTCY------NLNGFVSVRVPTVSFYF 418

Query: 339 RGGAKLALEKDSMFYQPRPDA----FCMAVNPASINNRYVNFSLIGMMAQQFYNVGYD 392
            GG  L L   + F  P  D     F  A +P+ +       S+IG + Q+   + +D
Sbjct: 419 AGGPILTLPARN-FLIPVDDVGTFCFAFAASPSGL-------SIIGNIQQEGIQISFD 468


>gi|242069057|ref|XP_002449805.1| hypothetical protein SORBIDRAFT_05g023600 [Sorghum bicolor]
 gi|241935648|gb|EES08793.1| hypothetical protein SORBIDRAFT_05g023600 [Sorghum bicolor]
          Length = 430

 Score =  135 bits (341), Expect = 3e-29,   Method: Compositional matrix adjust.
 Identities = 109/366 (29%), Positives = 167/366 (45%), Gaps = 36/366 (9%)

Query: 61  FYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLGTIFYPSRSSSYAYVPCDSEH 120
           + + ++IG PPVP     DTGS L W  C PC+ C  Q   I+  + SSS++ +PC S  
Sbjct: 83  YLMELAIGTPPVPFIALADTGSDLTWTQCKPCKLCFGQDTPIYDTTTSSSFSPLPCSSAT 142

Query: 121 CRYFPYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDVVFGCGF-S 179
           C     +RCS     C Y   Y  G               + + + I V  + FGCG  +
Sbjct: 143 CLPIWSSRCSTPSATCRYRYAYDDGA-------------YSPECAGISVGGIAFGCGVDN 189

Query: 180 TNRNFKFSGIFGLGIGRSSLVSQLN-SSFSYCI-----GSLHDPDYLHNKLILGDGAIID 233
              ++  +G  GLG G  SLV+QL    FSYC+      SL  P +  +   L   +   
Sbjct: 190 GGLSYNSTGTVGLGRGSLSLVAQLGVGKFSYCLTDFFNTSLSSPVFFGSLAELAASSASA 249

Query: 234 EG---DATPL---QFIDGHYYITLEAISVDGRMLDINPNIF--KRDDSGGGVMIDSGTDV 285
           +     +TPL    +    YY++LE IS+    L I    F    DD  GG+++DSGT  
Sbjct: 250 DAAVVQSTPLVQSPYNPSRYYVSLEGISLGDARLPIPNGTFDLNDDDGSGGMIVDSGTIF 309

Query: 286 TWLVKEAYEALRDEVMIRLEGEQMRSYSWPDKLCYHGIMSS--DLKGFPTVRFHFRGGAK 343
           T LV+  +  + D V   L G+ + + S  D+ C+    +   +L   P +  HF GGA 
Sbjct: 310 TILVETGFRVVVDHVAGVL-GQPVVNASSLDRPCFPAPAAGVQELPDMPDMVLHFAGGAD 368

Query: 344 LALEKDS-MFYQPRPDAFCMAVNPASINNRYVNFSLIGMMAQQFYNVGYDIGRKQKTFQR 402
           + L +D+ M +     +FC+ +    +     + S++G   QQ   + +DI   Q +F  
Sbjct: 369 MRLHRDNYMSFNEEESSFCLNI----VGTESASGSVLGNFQQQNIQMLFDITVGQLSFMP 424

Query: 403 MDCEVL 408
            DC  L
Sbjct: 425 TDCSKL 430


>gi|326529233|dbj|BAK01010.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 441

 Score =  135 bits (341), Expect = 3e-29,   Method: Compositional matrix adjust.
 Identities = 136/428 (31%), Positives = 188/428 (43%), Gaps = 70/428 (16%)

Query: 20  HRVQRGINISIARLAYLQEKIRSHNTYQAQILPSNDISNSL------FYVNISIGQPPVP 73
            RV+R + +S  RLAY Q+        Q Q+  S D+S  +      +     IG PP  
Sbjct: 45  ERVRRAVAVSRERLAYTQQ--------QQQLRASGDVSAPVHLATRQYIAEYLIGDPPQR 96

Query: 74  QFTAMDTGSSLLWVHC-YPC--RDCSPQLGTIFYPSRSSSYAYVPC-DSEHCRYFPYARC 129
               +DTGS+L+W  C   C  + C+ Q    +  SRSS++A VPC DS           
Sbjct: 97  AAALIDTGSNLIWTQCGTTCGLKACAKQDLPYYNLSRSSTFAAVPCADSAKLCAANGVHL 156

Query: 130 SAYKHRCIYTQLYLIGPETSVF--VSTEQLTFKNTDESTIHVQDVVFGCGFSTNRNFK-- 185
                 C +   Y  G   SVF  + TE  TF++          + FGC  S  R  K  
Sbjct: 157 CGLDGSCTFAASYGAG---SVFGSLGTEAFTFQS------GAAKLGFGC-VSLTRITKGA 206

Query: 186 ---FSGIFGLGIGRSSLVSQLNSS-FSYCIGSLHDPDYLHNKLILGDGAIIDEGDA--TP 239
               SG+ GLG GR SLVSQ  ++ FSYC+          + L +G  A +  G    T 
Sbjct: 207 LNGASGLIGLGRGRLSLVSQTGATKFSYCLTPYLRNHGASSHLFVGASASLSGGGGAVTS 266

Query: 240 LQFIDG--------HYYITLEAISVDGRMLDINPNIF--KRDDSG---GGVMIDSGTDVT 286
           + F+           YY+ L  ISV    L I    F  +R  +G   GGV+ID+G+ VT
Sbjct: 267 IPFVKSPEDYPYSTFYYLPLVGISVGETKLPIPSAAFELRRVAAGYWSGGVIIDTGSPVT 326

Query: 287 WLVKEAYEALRDEVMIRLEGEQMRSYSWPDK-----LCYHGIMSSDL-KGFPTVRFHFRG 340
            L + AY AL DEV  +L     RS   P       LC   +   D+ K  P + FHF G
Sbjct: 327 SLAEAAYSALSDEVARQLN----RSLVQPPADTGLDLC---VARQDVDKVVPVLVFHFGG 379

Query: 341 GAKLALEKDSMFYQPRPDAFCMAVNPASINNRYVNFSLIGMMAQQFYNVGYDIGRKQKTF 400
           GA +A+   S +        CM +            ++IG   QQ  ++ YDIG+ + +F
Sbjct: 380 GADMAVSAGSYWGPVDKSTACMLIEEGGYE------TVIGNFQQQDVHLLYDIGKGELSF 433

Query: 401 QRMDCEVL 408
           Q  DC VL
Sbjct: 434 QTADCSVL 441


>gi|312282359|dbj|BAJ34045.1| unnamed protein product [Thellungiella halophila]
          Length = 484

 Score =  135 bits (341), Expect = 3e-29,   Method: Compositional matrix adjust.
 Identities = 116/388 (29%), Positives = 171/388 (44%), Gaps = 42/388 (10%)

Query: 41  RSHNTYQAQILPSNDISNSLFYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLG 100
           RS   +   ++      +  +++ + +G P    +  +DTGS ++W+ C PC+ C  Q  
Sbjct: 116 RSAGGFSGVVISGLSQGSGEYFMRLGVGTPATNMYMVLDTGSDVVWLQCSPCKVCYNQSD 175

Query: 101 TIFYPSRSSSYAYVPCDSEHCRYF-PYARCSAYKHR-CIYTQLYLIGPETSVFVSTEQLT 158
            +F P++S ++A VPC S  CR     + C + + + C+Y   Y  G  T    STE LT
Sbjct: 176 PVFNPAKSKTFATVPCGSRLCRRLDDSSECVSRRSKACLYQVSYGDGSFTVGDFSTETLT 235

Query: 159 FKNTDESTIHVQDVVFGCGFSTNRNFKFSGIFGLGIGR-----SSLVSQLNSSFSYCI-- 211
           F         V  V  GCG      F  +              S   ++ N  FSYC+  
Sbjct: 236 FHGA-----RVDHVALGCGHDNEGLFVGAAGLLGLGRGGLSFPSQTKNRYNGKFSYCLVD 290

Query: 212 -GSLHDPDYLHNKLILGDGAIIDEGDATPLQF---IDGHYYITLEAISVDG-RMLDINPN 266
             S        + ++ G+GA+      TPL     +D  YY+ L  ISV G R+  ++ +
Sbjct: 291 RTSSGSSSKPPSTIVFGNGAVPKTAVFTPLLTNPKLDTFYYLQLLGISVGGSRVPGVSES 350

Query: 267 IFKRDDSG-GGVMIDSGTDVTWLVKEAYEALRDEVMIRLEGEQMR---SYSWPDKLCYHG 322
            FK D +G GGV+IDSGT VT L + AY ALRD    RL   +++   SYS  D  C+  
Sbjct: 351 QFKLDATGNGGVIIDSGTSVTRLTQSAYVALRDA--FRLGATRLKRAPSYSLFDT-CF-- 405

Query: 323 IMSSDLKGF-----PTVRFHFRGGAKLALEKDSMFYQPRPDAFCMAVNPASINNRYVNFS 377
               DL G      PTV FHF GG       + +        FC A           + S
Sbjct: 406 ----DLSGMTTVKVPTVVFHFTGGEVSLPASNYLIPVNNQGRFCFA-----FAGTMGSLS 456

Query: 378 LIGMMAQQFYNVGYDIGRKQKTFQRMDC 405
           +IG + QQ + V YD+   +  F    C
Sbjct: 457 IIGNIQQQGFRVAYDLVGSRVGFLSRAC 484


>gi|414880708|tpg|DAA57839.1| TPA: hypothetical protein ZEAMMB73_997765 [Zea mays]
          Length = 452

 Score =  135 bits (340), Expect = 4e-29,   Method: Compositional matrix adjust.
 Identities = 111/434 (25%), Positives = 184/434 (42%), Gaps = 45/434 (10%)

Query: 1   LIHRDSILSPYNNPNENPAHRVQRGINISIARLAYLQEKIRSHNTYQAQILPSNDISNSL 60
           ++HRD++  P       P     R      A+L  L     + +  ++ ++      +  
Sbjct: 34  VVHRDAVFPPRRG--APPGSFRCRHAAPHTAQLESLHSATAAADLLRSPVMSGVPFDSGE 91

Query: 61  FYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLGTIFYPSRSSSYAYVPCDSEH 120
           ++  I +G PP      +DTGS L+W+ C PCR C  Q+  ++ P  S ++  +PC S  
Sbjct: 92  YFAVIGVGDPPTHALVVIDTGSDLIWLQCLPCRRCYRQVTPLYDPRNSKTHRRIPCASPQ 151

Query: 121 CR-YFPYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDVVFGCGFS 179
           CR    Y  C A    C+Y  +Y  G  +S  ++T+ L     D++ +H  +V  GCG  
Sbjct: 152 CRGVLRYPGCDARTGGCVYMVVYGDGSASSGDLATDTLVLP--DDTRVH--NVTLGCGHD 207

Query: 180 TNRNF-KFSGIFGLGIGRSSLVSQLNSS----FSYCIGS-LHDPDYLHNKLILGDGAIID 233
                   +G+ G G G+ S  +QL  +    FSYC+G  +       + L+ G    + 
Sbjct: 208 NEGLLASAAGLLGAGRGQLSFPTQLAPAYGHVFSYCLGDRMSRARNSSSYLVFGRTPELP 267

Query: 234 EGDATPLQFIDGH---YYITLEAISVDGRM--------LDINPNIFKRDDSGGGVMIDSG 282
               TPL+        YY+ +   SV G          L +NP   +     GGV++DSG
Sbjct: 268 STAFTPLRTNPRRPSLYYVDMVGFSVGGERVAGFSNASLALNPATGR-----GGVVVDSG 322

Query: 283 TDVTWLVKEAYEALRDEVMIRLEGEQMR----SYSWPDKLCY--HGIMSSDLKGFPTVRF 336
           T ++   ++AY A+RD  +       MR     +S  D  CY  HG         P++  
Sbjct: 323 TAISRFTRDAYAAVRDAFVSHAAAAGMRRLRNKFSVFDT-CYDVHGNGPGTGVRVPSIVL 381

Query: 337 HFRGGAKLALEKDSMFYQ----PRPDAFCMAVNPASINNRYVNFSLIGMMAQQFYNVGYD 392
           HF   A +AL + +         R   FC+ +  A         +++G + QQ + V +D
Sbjct: 382 HFAAAADMALPQANYLIPVVGGDRRTYFCLGLQAADD-----GLNVLGNVQQQGFGVVFD 436

Query: 393 IGRKQKTFQRMDCE 406
           + R +  F    C 
Sbjct: 437 VERGRIGFTPNGCS 450


>gi|356504173|ref|XP_003520873.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
          Length = 461

 Score =  135 bits (340), Expect = 4e-29,   Method: Compositional matrix adjust.
 Identities = 109/363 (30%), Positives = 159/363 (43%), Gaps = 38/363 (10%)

Query: 61  FYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLGTIFYPSRSSSYAYVPCDSEH 120
           ++  I +G P    +  +DTGS ++W+ C PCR C  Q   +F P++S +YA +PC +  
Sbjct: 118 YFTRIGVGTPARYVYMVLDTGSDVVWLQCAPCRKCYTQTDHVFDPTKSRTYAGIPCGAPL 177

Query: 121 CRYFPYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDVVFGCGFST 180
           CR      CS     C Y   Y  G  T    STE LTF+        V  V  GCG   
Sbjct: 178 CRRLDSPGCSNKNKVCQYQVSYGDGSFTFGDFSTETLTFRRN-----RVTRVALGCGHDN 232

Query: 181 NRNFKFSGI----------FGLGIGRSSLVSQLNSSFSYCIGSLHDPDYLHNKLILGDGA 230
              F  +            F +  GR     + N  FSYC+          + +I GD A
Sbjct: 233 EGLFTGAAGLLGLGRGRLSFPVQTGR-----RFNHKFSYCLVD-RSASAKPSSVIFGDSA 286

Query: 231 IIDEGDATPL---QFIDGHYYITLEAISVDGR-MLDINPNIFKRDDSG-GGVMIDSGTDV 285
           +      TPL     +D  YY+ L  ISV G  +  ++ ++F+ D +G GGV+IDSGT V
Sbjct: 287 VSRTAHFTPLIKNPKLDTFYYLELLGISVGGAPVRGLSASLFRLDAAGNGGVIIDSGTSV 346

Query: 286 TWLVKEAYEALRDEVMIRLEGEQMR---SYSWPDKLCYHGIMSSDLKGFPTVRFHFRGGA 342
           T L + AY ALRD    R+    ++    +S  D  C+     +++K  PTV  HFRG  
Sbjct: 347 TRLTRPAYIALRDA--FRIGASHLKRAPEFSLFDT-CFDLSGLTEVK-VPTVVLHFRGAD 402

Query: 343 KLALEKDSMFYQPRPDAFCMAVNPASINNRYVNFSLIGMMAQQFYNVGYDIGRKQKTFQR 402
                 + +       +FC A             S+IG + QQ + + YD+   +  F  
Sbjct: 403 VSLPATNYLIPVDNSGSFCFA-----FAGTMSGLSIIGNIQQQGFRISYDLTGSRVGFAP 457

Query: 403 MDC 405
             C
Sbjct: 458 RGC 460


>gi|168008816|ref|XP_001757102.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162691600|gb|EDQ77961.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 406

 Score =  135 bits (340), Expect = 4e-29,   Method: Compositional matrix adjust.
 Identities = 120/402 (29%), Positives = 187/402 (46%), Gaps = 29/402 (7%)

Query: 21  RVQRGIN-ISIARLAYLQEKIRSHNTYQAQILPSNDISNSLFYVNISIGQPPVPQFTAMD 79
           R+ + +N ++ +R    Q K+ S + +QA ++    + +  +++ IS+G PP   +  MD
Sbjct: 18  RINQTVNGLTRSRSRDRQTKVPSQD-FQAPVVSGLSLGSGEYFIRISVGTPPRRMYLVMD 76

Query: 80  TGSSLLWVHCYPCRDCSPQLGTIFYPSRSSSYAYVPCDSEHCRYFPYARCSAYKHRCIYT 139
           TGS +LW+ C PC +C  Q   IF P +SS+Y+ + C +  C       C A  ++C+Y 
Sbjct: 77  TGSDILWLQCAPCVNCYHQSDAIFDPYKSSTYSTLGCSTRQCLNLDIGTCQA--NKCLYQ 134

Query: 140 QLYLIGPETSVFVSTEQLTFKNTDE-STIHVQDVVFGCGFSTNRNF-KFSGIFGLGIGRS 197
             Y  G  T+    T+ ++  +T     + +  +  GCG      F   +G+ GLG G  
Sbjct: 135 VDYGDGSFTTGEFGTDDVSLNSTSGVGQVVLNKIPLGCGHDNEGYFVGAAGLLGLGKGPL 194

Query: 198 SLVSQLN----SSFSYCIGSLHDPDYLHNKLILGDGAIIDEGDA-TPLQF---IDGHYYI 249
           S  +Q++      FSYC+          + L+ G+ A+   G   TP      +   YY+
Sbjct: 195 SFPNQVDPQNGGRFSYCLTDRETDSTEGSSLVFGEAAVPPAGARFTPQDSNMRVPTFYYL 254

Query: 250 TLEAISVDGRMLDINPNIFKRDDSG-GGVMIDSGTDVTWLVKEAYEALRDEVMIRLEG-E 307
            +  ISV G +L I  + F+ D  G GGV+IDSGT VT L   AY +LRD          
Sbjct: 255 KMTGISVGGTILTIPTSAFQLDSLGNGGVIIDSGTSVTRLQNAAYASLRDAFRAGTSDLA 314

Query: 308 QMRSYSWPDKLCY--HGIMSSDLKGFPTVRFHFRGGAKLALEKDS-MFYQPRPDAFCMAV 364
               +S  D  CY   G+ S D+   PTV  HF+GG  L L   + +      + FC+A 
Sbjct: 315 PTAGFSLFDT-CYDLSGLASVDV---PTVTLHFQGGTDLKLPASNYLIPVDNSNTFCLAF 370

Query: 365 NPASINNRYVNFSLIGMMAQQFYNVGYDIGRKQKTFQRMDCE 406
              +        S+IG + QQ + V YD    Q  F    C 
Sbjct: 371 AGTT------GPSIIGNIQQQGFRVIYDNLHNQVGFVPSQCN 406


>gi|302765224|ref|XP_002966033.1| hypothetical protein SELMODRAFT_64135 [Selaginella moellendorffii]
 gi|300166847|gb|EFJ33453.1| hypothetical protein SELMODRAFT_64135 [Selaginella moellendorffii]
          Length = 357

 Score =  135 bits (340), Expect = 4e-29,   Method: Compositional matrix adjust.
 Identities = 114/357 (31%), Positives = 162/357 (45%), Gaps = 25/357 (7%)

Query: 61  FYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLGTIFYPSRSSSYAYVPCDSEH 120
           ++V + IG P   Q+  MDTGS + W+ C PC+ C  Q   +F P  SSS+  + C +  
Sbjct: 14  YFVRVGIGSPTKLQYLVMDTGSDVPWIQCSPCKSCYKQNDAVFDPRASSSFRRLSCSTPQ 73

Query: 121 CRYFPYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDVVFGCGFST 180
           C+      C++  +RC+Y   Y  G  T   ++++         S      VVFGCG   
Sbjct: 74  CKLLDVKACASTDNRCLYQVSYGDGSFTVGDLASDSFLVSRGRTS-----PVVFGCGHDN 128

Query: 181 NRNF-KFSGIFGLGIGRSSLVSQLNS-SFSYCIGSLHDPDYLHNKLILGDGAIIDEGDAT 238
              F   +G+ GLG G+ S  SQL+S  FSYC+ S  +     + L+ GD A+       
Sbjct: 129 EGLFVGAAGLLGLGAGKLSFPSQLSSRKFSYCLVSRDNGVRASSALLFGDSALPTSASFA 188

Query: 239 PLQF-----IDGHYYITLEAISVDGRMLDINPNIFKRDDSG--GGVMIDSGTDVTWLVKE 291
             Q      +D  YY  L  IS+ G +L I    FK   S   GGV+IDSGT VT L   
Sbjct: 189 YTQLLKNPKLDTFYYAGLSGISIGGTLLSIPSTAFKLSSSTGRGGVIIDSGTSVTRLPTY 248

Query: 292 AYEALRDEVMIRLEG-EQMRSYSWPDKLCYHGIMSSDLKGFPTVRFHFRGGAKLALEKDS 350
           AY  +RD      +   +   +S  D  CY     + +   PTV FHF GGA + L   S
Sbjct: 249 AYTVMRDAFRSATQKLPRAADFSLFDT-CYDFSALTSVT-IPTVSFHFEGGASVQLPP-S 305

Query: 351 MFYQP--RPDAFCMAVNPASINNRYVNFSLIGMMAQQFYNVGYDIGRKQKTFQRMDC 405
            +  P      FC A +  S+     + S+IG + QQ   V  D+   +  F    C
Sbjct: 306 NYLVPVDTSGTFCFAFSKTSL-----DLSIIGNIQQQTMRVAIDLDSSRVGFAPRQC 357


>gi|356573235|ref|XP_003554768.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
          Length = 472

 Score =  135 bits (339), Expect = 4e-29,   Method: Compositional matrix adjust.
 Identities = 107/361 (29%), Positives = 159/361 (44%), Gaps = 34/361 (9%)

Query: 61  FYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLGTIFYPSRSSSYAYVPCDSEH 120
           ++  I +G P    +  +DTGS ++W+ C PCR C  Q   +F P++S +YA +PC +  
Sbjct: 129 YFTRIGVGTPARYVYMVLDTGSDVVWLQCAPCRKCYTQADPVFDPTKSRTYAGIPCGAPL 188

Query: 121 CRYFPYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDVVFGCGFST 180
           CR      C+     C Y   Y  G  T    STE LTF+ T      V  V  GCG   
Sbjct: 189 CRRLDSPGCNNKNKVCQYQVSYGDGSFTFGDFSTETLTFRRT-----RVTRVALGCGHDN 243

Query: 181 NRNF----------KFSGIFGLGIGRSSLVSQLNSSFSYCIGSLHDPDYLHNKLILGDGA 230
              F          +    F +  GR     + N  FSYC+          + ++ GD A
Sbjct: 244 EGLFIGAAGLLGLGRGRLSFPVQTGR-----RFNQKFSYCLVD-RSASAKPSSVVFGDSA 297

Query: 231 IIDEGDATPL---QFIDGHYYITLEAISVDGR-MLDINPNIFKRDDSG-GGVMIDSGTDV 285
           +      TPL     +D  YY+ L  ISV G  +  ++ ++F+ D +G GGV+IDSGT V
Sbjct: 298 VSRTARFTPLIKNPKLDTFYYLELLGISVGGSPVRGLSASLFRLDAAGNGGVIIDSGTSV 357

Query: 286 TWLVKEAYEALRDEVMIRLEG-EQMRSYSWPDKLCYHGIMSSDLKGFPTVRFHFRGGAKL 344
           T L + AY ALRD   +     ++   +S  D  C+     +++K  PTV  HFRG    
Sbjct: 358 TRLTRPAYIALRDAFRVGASHLKRAAEFSLFDT-CFDLSGLTEVK-VPTVVLHFRGADVS 415

Query: 345 ALEKDSMFYQPRPDAFCMAVNPASINNRYVNFSLIGMMAQQFYNVGYDIGRKQKTFQRMD 404
               + +       +FC A             S+IG + QQ + V +D+   +  F    
Sbjct: 416 LPATNYLIPVDNSGSFCFA-----FAGTMSGLSIIGNIQQQGFRVSFDLAGSRVGFAPRG 470

Query: 405 C 405
           C
Sbjct: 471 C 471


>gi|356522504|ref|XP_003529886.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
          Length = 473

 Score =  135 bits (339), Expect = 5e-29,   Method: Compositional matrix adjust.
 Identities = 109/358 (30%), Positives = 158/358 (44%), Gaps = 28/358 (7%)

Query: 61  FYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLGTIFYPSRSSSYAYVPCDSEH 120
           ++  + +G PP   +  +DTGS ++W+ C PC  C  Q   IF PS+S S+A +PC S  
Sbjct: 130 YFTRLGVGTPPKYLYMVLDTGSDVVWLQCKPCTKCYSQTDQIFDPSKSKSFAGIPCYSPL 189

Query: 121 CRYFPYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDVVFGCGFST 180
           CR      CS   + C Y   Y  G  T    STE LTF+        V  V  GCG   
Sbjct: 190 CRRLDSPGCSLKNNLCQYQVSYGDGSFTFGDFSTETLTFRRA-----AVPRVAIGCGHDN 244

Query: 181 NRNFKFSGIFGLGIGR-----SSLVSQLNSSFSYCIGSLHDPDYLHNKLILGDGAIIDEG 235
              F  +              +   ++ N+ FSYC+          + ++ GD A+    
Sbjct: 245 EGLFVGAAGLLGLGRGGLSFPTQTGTRFNNKFSYCLTD-RTASAKPSSIVFGDSAVSRTA 303

Query: 236 DATPL---QFIDGHYYITLEAISVDGR-MLDINPNIFKRDDSG-GGVMIDSGTDVTWLVK 290
             TPL     +D  YY+ L  ISV G  +  I+ + F+ D +G GGV+IDSGT VT L +
Sbjct: 304 RFTPLVKNPKLDTFYYVELLGISVGGAPVRGISASFFRLDSTGNGGVIIDSGTSVTRLTR 363

Query: 291 EAYEALRDEVMIRLEGEQMR---SYSWPDKLCYHGIMSSDLKGFPTVRFHFRGGAKLALE 347
            AY +LRD    R+    ++    +S  D  CY     S++K  PTV  HFRG       
Sbjct: 364 PAYVSLRDA--FRVGASHLKRAPEFSLFDT-CYDLSGLSEVK-VPTVVLHFRGADVSLPA 419

Query: 348 KDSMFYQPRPDAFCMAVNPASINNRYVNFSLIGMMAQQFYNVGYDIGRKQKTFQRMDC 405
            + +       +FC A             S+IG + QQ + V +D+   +  F    C
Sbjct: 420 ANYLVPVDNSGSFCFA-----FAGTMSGLSIIGNIQQQGFRVVFDLAGSRVGFAPRGC 472


>gi|242092898|ref|XP_002436939.1| hypothetical protein SORBIDRAFT_10g011740 [Sorghum bicolor]
 gi|241915162|gb|EER88306.1| hypothetical protein SORBIDRAFT_10g011740 [Sorghum bicolor]
          Length = 469

 Score =  135 bits (339), Expect = 5e-29,   Method: Compositional matrix adjust.
 Identities = 129/437 (29%), Positives = 194/437 (44%), Gaps = 58/437 (13%)

Query: 1   LIHRDSILSPYNNPNENPAHRVQRGINISIARLAYLQEKIRSHNTYQAQILPSNDIS--- 57
           L+HR    +P    N  P   +   +  S AR  Y+  +            P +D +   
Sbjct: 59  LVHRYGPCAPSQYSNV-PTPSISETLRRSRARTNYIMSQASKSMGMGMASTPDDDDAAVT 117

Query: 58  ---------NSLFYV-NISIGQPPVPQFTAMDTGSSLLWVHCYPCRD--CSPQLGTIFYP 105
                    +SL YV  +  G P VPQ   MDTGS + WV C PC    C PQ   +F P
Sbjct: 118 IPTRLGGFVDSLEYVVTLGFGTPSVPQVLLMDTGSDVSWVQCTPCNSTKCYPQKDPLFDP 177

Query: 106 SRSSSYAYVPCDSEHCRYFP---YARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNT 162
           S+SS+YA + C+++ CR      +  C++   +C Y+  Y  G  +    S E LT    
Sbjct: 178 SKSSTYAPIACNTDACRKLGDHYHNGCTSGGTQCGYSVEYADGSHSRGVYSNETLTLA-- 235

Query: 163 DESTIHVQDVVFGCGFST-NRNFKFSGIFGLGIGRSSLVSQLNS----SFSYCIGSLHDP 217
               I V+D  FGCG      + K+ G+ GLG    SLV Q +S    +FSYC+ +L+  
Sbjct: 236 --PGITVEDFHFGCGRDQRGPSDKYDGLLGLGGAPVSLVVQTSSVYGGAFSYCLPALNSE 293

Query: 218 DYLHNKLILGDGAIIDEGD--ATPLQFIDGH---YYITLEAISVDGRMLDINPNIFKRDD 272
                 L+LG     ++     TP++ + G+   Y +T+  ISV G+ L I  + F+   
Sbjct: 294 AGF---LVLGSPPSGNKSAFVFTPMRHLPGYATFYMVTMTGISVGGKPLHIPQSAFR--- 347

Query: 273 SGGGVMIDSGTDVTWLVKEAYEALRDEVMIRLEGEQMRSYSWPDKLCYHGIMSSDLKGFP 332
             GG++IDSGT  T L + AY AL   +   L+   +      D  CY+    S++   P
Sbjct: 348 --GGMIIDSGTVDTELPETAYNALEAALRKALKAYPLVPSDDFDT-CYNFTGYSNIT-VP 403

Query: 333 TVRFHFRGGAKLALEKDSMFYQPRPDAF----CMAVNPASINNRYVNFSLIGMMAQQFYN 388
            V F F GGA + L+         P+      C+A   +  ++      +IG + Q+   
Sbjct: 404 RVAFTFSGGATIDLDV--------PNGILVNDCLAFQESGPDD---GLGIIGNVNQRTLE 452

Query: 389 VGYDIGRKQKTFQRMDC 405
           V YD GR    F+   C
Sbjct: 453 VLYDAGRGNVGFRAGAC 469


>gi|125558629|gb|EAZ04165.1| hypothetical protein OsI_26307 [Oryza sativa Indica Group]
          Length = 455

 Score =  134 bits (338), Expect = 6e-29,   Method: Compositional matrix adjust.
 Identities = 120/379 (31%), Positives = 174/379 (45%), Gaps = 45/379 (11%)

Query: 61  FYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDC--SPQLGTIFYPSRSSSYAYVPCDS 118
           + +NIS+G PP+     +DTGS+L+W  C PC  C   P    +  P+RSS+++ +PC+ 
Sbjct: 91  YNMNISLGTPPLDFPVIVDTGSNLIWAQCAPCTRCFPRPTPAPVLQPARSSTFSRLPCNG 150

Query: 119 EHCRYFPYA---RCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDVVFG 175
             C+Y P +   R       C Y   Y  G  T+ +++TE LT  +          V FG
Sbjct: 151 SFCQYLPTSSRPRTCNATAACAYNYTYGSG-YTAGYLATETLTVGDGT-----FPKVAFG 204

Query: 176 CGFSTNRNFKFSGIFGLGIGRSSLVSQLN-SSFSYCIGSLHDPDYLHNKLILGDGAIIDE 234
           C  + N     SGI GLG G  SLVSQL    FSYC+ S    D   + ++ G  A + E
Sbjct: 205 CS-TENGVDNSSGIVGLGRGPLSLVSQLAVGRFSYCLRS-DMADGGASPILFGSLAKLTE 262

Query: 235 GD---ATPL---QFI--DGHYYITLEAISVDGRMLDINPNI--FKRDDSGGGVMIDSGTD 284
                +TPL    ++    HYY+ L  I+VD   L +  +   F +   GGG ++DSGT 
Sbjct: 263 RSVVQSTPLLKNPYLQRSTHYYVNLTGIAVDSTELPVTGSTFGFTQTGLGGGTIVDSGTT 322

Query: 285 VTWLVKEAYEALRDEV---MIRLEGEQMRSYSWPD-KLCYHGIMSSDLKG--FPTVRFHF 338
           +T+L K+ Y  ++      M  L      S +  D  LCY        K    P +   F
Sbjct: 323 LTYLAKDGYAMVKQAFQSQMANLNQTTPASGAPYDLDLCYKPSAGGGGKAVRVPRLALRF 382

Query: 339 RGGAK---------LALEKDSMFYQPRPDAFCMAVNPASINNRYVNFSLIGMMAQQFYNV 389
            GGAK           +E DS   Q R    C+ V PA+ +   +  S+IG + Q   ++
Sbjct: 383 AGGAKYNVPVQNYFAGVEADS---QGRVTVACLLVLPATDD---LPISIIGNLMQMDMHL 436

Query: 390 GYDIGRKQKTFQRMDCEVL 408
            YDI     +F   DC  L
Sbjct: 437 LYDIDGGMFSFAPADCAKL 455


>gi|222635451|gb|EEE65583.1| hypothetical protein OsJ_21095 [Oryza sativa Japonica Group]
          Length = 441

 Score =  134 bits (338), Expect = 6e-29,   Method: Compositional matrix adjust.
 Identities = 141/450 (31%), Positives = 186/450 (41%), Gaps = 74/450 (16%)

Query: 1   LIHRDSILSPYNNPNENP--AHRVQRGINISIARLAYLQEKIRSHNTYQAQI-------- 50
           L+HR    +P       P  A R++R      AR  Y+  K     T    +        
Sbjct: 21  LVHRHGPCAPSAASGGKPSLAERLRR----DRARTNYIVTKATGGRTAATALSDAAGGGT 76

Query: 51  -LPS--NDISNSLFYV-NISIGQPPVPQFTAMDTGSSLLWVHCYPC--RDCSPQLGTIFY 104
            +P+   D  NSL YV  + IG P V Q   +DTGS L WV C PC   +C  Q   +F 
Sbjct: 77  SIPTFLGDSVNSLEYVVTLGIGTPAVQQTVLIDTGSDLSWVQCKPCGAGECYAQKDPLFD 136

Query: 105 PSRSSSYAYVPCDSEHCRYFPYARCSAYKHRCI-----------YTQLYLIGPETSVFVS 153
           PS SSSYA VPCDS+ CR        AY H C            Y   Y     T+   S
Sbjct: 137 PSSSSSYASVPCDSDACRKL---AAGAYGHGCTGVSGGAAALCEYGIEYGNRATTTGVYS 193

Query: 154 TEQLTFKNTDESTIHVQDVVFGCGFSTNRNF-KFSGIFGLGIGRSSLVSQLNSS----FS 208
           TE LT K      + V D  FGCG   +  + KF G+ GLG    SLVSQ +S     FS
Sbjct: 194 TETLTLK----PGVVVADFGFGCGDHQHGPYEKFDGLLGLGGAPESLVSQTSSQFGGPFS 249

Query: 209 YCIGSLHDPDYLHNKLILGDGAIIDEGDATP-----------LQFIDGHYYITLEAISVD 257
           YC+     P        L  GA  +   +T            L  +   Y +TL  ISV 
Sbjct: 250 YCL-----PPTSGGAGFLTLGAPPNSSSSTAASGLSFTPMRRLPSVPTFYIVTLTGISVG 304

Query: 258 GRMLDINPNIFKRDDSGGGVMIDSGTDVTWLVKEAYEALRDEVMIRLEGEQMRSYSWPDK 317
           G  L I P+ F       G++IDSGT +T L   AY ALR      +   ++   S    
Sbjct: 305 GAPLAIPPSAFSS-----GMVIDSGTVITGLPATAYAALRSAFRSAMSEYRLLPPSNGGV 359

Query: 318 L--CYHGIMSSDLKGFPTVRFHFRGGAKLALEKDSMFYQPRPDAFCMAVNPASINNRYVN 375
           L  CY     +++   PT+   F GGA + L   +          C+A   A  +N    
Sbjct: 360 LDTCYDFTGHANVT-VPTISLTFSGGATIDLAAPAGVLVDG----CLAFAGAGTDNA--- 411

Query: 376 FSLIGMMAQQFYNVGYDIGRKQKTFQRMDC 405
             +IG + Q+ + V YD G+    F+   C
Sbjct: 412 IGIIGNVNQRTFEVLYDSGKGTVGFRAGAC 441


>gi|326499093|dbj|BAK06037.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 471

 Score =  134 bits (338), Expect = 7e-29,   Method: Compositional matrix adjust.
 Identities = 130/436 (29%), Positives = 192/436 (44%), Gaps = 47/436 (10%)

Query: 1   LIHRDSILSPYNNPNENPAHRVQRGINISIARLAYLQEKIRSHNTYQAQILPSNDISNSL 60
            IHRDS  SP+++P+     RV      S  R A L       +   A    S   S   
Sbjct: 39  FIHRDSARSPFHDPSLTAPARVLEAARRSTVRAAALSRSYVRVDAPSADGFVSELTSTPF 98

Query: 61  FYV-NISIGQPPVPQFTAMDTGSSLLWVHC---------YPCRDCSPQ-LGTIFYPSRSS 109
            Y+  ++IG PP       DTGS L+W++C            RD   Q  G  F PS+S+
Sbjct: 99  EYLMAVNIGTPPTRMVAIADTGSDLIWLNCSYGGDGPGLAAARDADAQPPGVQFDPSKST 158

Query: 110 SYAYVPCDSEHCRYFPYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNT-----DE 164
           ++  V CDS  C   P A C A   +C Y+  Y  G  TS  +STE  TF +      D 
Sbjct: 159 TFRLVDCDSVACSELPEASCGA-DSKCRYSYSYGDGSHTSGVLSTETFTFADAPGARGDG 217

Query: 165 STIHVQDVVFGCGFSTNRNFKFSGIFGLGIGRSSLVSQLNSS------FSYCIGSLHDPD 218
           +T  V +V FGC  +   +    G+ GLG G  SLVSQL +       FSYC+     P 
Sbjct: 218 TTTRVANVNFGCSTTFVGSSVGDGLVGLGGGDLSLVSQLGADTSLGRRFSYCL----VPY 273

Query: 219 YLHNKLILGDG---AIIDEGD-ATPL--QFIDGHYYITLEAISVDGRMLDINPNIFKRDD 272
            +     L  G   A+ D G   TPL    +  +Y + L ++ V  +        F+  D
Sbjct: 274 SVKASSALNFGPRAAVTDPGAVTTPLIPSQVKAYYIVELRSVKVGNKT-------FEAPD 326

Query: 273 SGGGVMIDSGTDVTWLVKEAYEALRDEVMIRLEGEQMRSYSWPDKLCY--HGIMSSDLKG 330
               +++DSGT +T+L +   + L  E+  R++    +S      LC+   G+    +  
Sbjct: 327 R-SPLIVDSGTTLTFLPEALVDPLVKELTGRIKLPPAQSPERLLPLCFDVSGVREGQVAA 385

Query: 331 F-PTVRFHFRGGAKLALEKDSMFYQPRPDAFCMAVNPASINNRYVNFSLIGMMAQQFYNV 389
             P V     GGA + L+ ++ F + +    C+AV   S  +     S+IG +AQQ  +V
Sbjct: 386 MIPDVTVGLGGGAAVTLKAENTFVEVQEGTLCLAV---SAMSEQFPASIIGNIAQQNMHV 442

Query: 390 GYDIGRKQKTFQRMDC 405
           GYD+ +   TF    C
Sbjct: 443 GYDLDKGTVTFAPAAC 458


>gi|356557203|ref|XP_003546907.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
          Length = 470

 Score =  134 bits (337), Expect = 8e-29,   Method: Compositional matrix adjust.
 Identities = 105/345 (30%), Positives = 162/345 (46%), Gaps = 31/345 (8%)

Query: 78  MDTGSSLLWVHCYPCRDCSPQLGTIFYPSRSSSYAYVPCDSEHCRYFPYARCS---AYKH 134
           +DTGS L WV C PCR C  Q G +F PS S SY  + C+S  C+      C    +   
Sbjct: 137 VDTGSDLTWVQCEPCRSCYNQNGPLFKPSTSPSYQPILCNSTTCQSLELGACGSDPSTSA 196

Query: 135 RCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDVVFGCGFSTNRNF-KFSGIFGLG 193
            C Y   Y  G  TS  +  E+L F       I V + VFGCG +    F   SG+ GLG
Sbjct: 197 TCDYVVNYGDGSYTSGELGIEKLGFGG-----ISVSNFVFGCGRNNKGLFGGASGLMGLG 251

Query: 194 IGRSSLVSQLNSS----FSYCIGSLHDPDYLHNKLILGDGAIIDEGDATPLQF------- 242
               S++SQ N++    FSYC+ S  D       L++G+ + + + + TP+ +       
Sbjct: 252 RSELSMISQTNATFGGVFSYCLPS-TDQAGASGSLVMGNQSGVFK-NVTPIAYTRMLPNL 309

Query: 243 -IDGHYYITLEAISVDGRMLDINPNIFKRDDSGGGVMIDSGTDVTWLVKEAYEALRDEVM 301
            +   Y + L  I V G  L +  + F      GGV++DSGT ++ L    Y+AL+ + +
Sbjct: 310 QLSNFYILNLTGIDVGGVSLHVQASSFGN----GGVILDSGTVISRLAPSVYKALKAKFL 365

Query: 302 IRLEG-EQMRSYSWPDKLCYHGIMSSDLKGFPTVRFHFRGGAKLALEKDSMFYQPRPDAF 360
            +  G      +S  D  C++ +   D    PT+  +F G A+L ++   +FY  + DA 
Sbjct: 366 EQFSGFPSAPGFSILDT-CFN-LTGYDQVNIPTISMYFEGNAELNVDATGIFYLVKEDAS 423

Query: 361 CMAVNPASINNRYVNFSLIGMMAQQFYNVGYDIGRKQKTFQRMDC 405
            + +  AS+++ Y    +IG   Q+   V YD    Q  F +  C
Sbjct: 424 RVCLALASLSDEY-EMGIIGNYQQRNQRVLYDAKLSQVGFAKEPC 467


>gi|357130848|ref|XP_003567056.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Brachypodium
           distachyon]
          Length = 448

 Score =  134 bits (337), Expect = 8e-29,   Method: Compositional matrix adjust.
 Identities = 107/363 (29%), Positives = 161/363 (44%), Gaps = 31/363 (8%)

Query: 61  FYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLGTIFYPSRSSSYAYVPCDSEH 120
           ++ ++ +G PP P    +DTGS ++W+ C PC  C  QL  ++ P  SS+YA  PC    
Sbjct: 99  YFASVGVGTPPTPALLVIDTGSDVVWLQCKPCVHCYRQLSPLYDPRGSSTYAQTPCSPPQ 158

Query: 121 CRYFPYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDVVFGCGFST 180
           CR      C      C Y  +Y     TS  ++T++L F N D S   V +V  GCG   
Sbjct: 159 CRN--PQTCDGTTGGCGYRIVYGDASSTSGNLATDRLVFSN-DTS---VGNVTLGCGHDN 212

Query: 181 NRNF-KFSGIFGLGIGRSSLVSQLNSS----FSYCIGSLHDPDYLHNKLILGDGAIIDEG 235
              F   +G+ G+  G +S  +Q+  S    F+YC+G         + L+ G  A     
Sbjct: 213 EGLFGSAAGLLGVARGNNSFATQVADSYGRYFAYCLGDRTRSGSSSSYLVFGRTAPEPPS 272

Query: 236 DA-TPLQFIDGH---YYITLEAISVDGRMLDINPNIFKRDDSG---GGVMIDSGTDVTWL 288
              TPL+        YY+ +   SV G  +    N     D     GGV++DSGT +T  
Sbjct: 273 SVFTPLRSNPRRPSLYYVDMVGFSVGGEPVTGFSNASLSLDPATGRGGVVVDSGTSITRF 332

Query: 289 VKEAYEALRDEVMIRLEGEQMRSYSWPDKL---CY--HGIMSSDLKGFPTVRFHFRGGAK 343
            ++AY ALRD    R     MR       +   CY   G+  +D    P V  HF GGA 
Sbjct: 333 ARDAYGALRDAFDARAAKVGMRKVGRGISVFDACYDLRGVAVADA---PGVVLHFAGGAD 389

Query: 344 LALEKDSMFYQPRPDAF-CMAVNPASINNRYVNFSLIGMMAQQFYNVGYDIGRKQKTFQR 402
           +AL  ++         + C A+  A     +   S+IG + QQ + V +D+  ++  F+ 
Sbjct: 390 VALPPENYLVPEESGRYHCFALEAAG----HDGLSVIGNVLQQRFRVVFDVENERVGFEP 445

Query: 403 MDC 405
             C
Sbjct: 446 NGC 448


>gi|293329689|dbj|BAJ04354.1| pollen allergen CPA63 [Cryptomeria japonica]
          Length = 472

 Score =  134 bits (337), Expect = 8e-29,   Method: Compositional matrix adjust.
 Identities = 111/363 (30%), Positives = 166/363 (45%), Gaps = 22/363 (6%)

Query: 56  ISNSLFYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLGTIFYPSRSSSYAYVP 115
           IS+S + + +  G PP   +T +DTGS++ W+ C PC  CS +    F PS+SS+Y Y+ 
Sbjct: 119 ISSSNYIIKLGFGTPPQSFYTVLDTGSNIAWIPCNPCSGCSSK-QQPFEPSKSSTYNYLT 177

Query: 116 CDSEHCRYFPYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDVVFG 175
           C S+ C+       S     C  TQ Y    E    +S+E L+  +       V++ VFG
Sbjct: 178 CASQQCQLLRVCTKSDNSVNCSLTQRYGDQSEVDEILSSETLSVGSQ-----QVENFVFG 232

Query: 176 CGFSTNRNFKFS-GIFGLGIGRSSLVSQL----NSSFSYCIGSLHDPDYLHNKLILGDGA 230
           C  +     + +  + G G    S VSQ     +S+FSYC+ SL    +    L+LG  A
Sbjct: 233 CSNAARGLIQRTPSLVGFGRNPLSFVSQTATLYDSTFSYCLPSLFSSAF-TGSLLLGKEA 291

Query: 231 IIDEG-DATPL---QFIDGHYYITLEAISVDGRMLDINPNIFKRDDSGG-GVMIDSGTDV 285
           +  +G   TPL         YY+ L  ISV   ++ I       D+S G G +IDSGT +
Sbjct: 292 LSAQGLKFTPLLSNSRYPSFYYVGLNGISVGEELVSIPAGTLSLDESTGRGTIIDSGTVI 351

Query: 286 TWLVKEAYEALRDEVMIRLEGEQMRSYSWPDKLCYHGIMSSDLKGFPTVRFHFRGGAKLA 345
           T LV+ AY A+RD    +L    M S +     CY+   S D++ FP +  HF     L 
Sbjct: 352 TRLVEPAYNAMRDSFRSQLSNLTMASPTDLFDTCYNR-PSGDVE-FPLITLHFDDNLDLT 409

Query: 346 LEKDSMFYQPRPDA--FCMAVNPASINNRYVNFSLIGMMAQQFYNVGYDIGRKQKTFQRM 403
           L  D++ Y    D    C+A          V  S  G   QQ   + +D+   +      
Sbjct: 410 LPLDNILYPGNDDGSVLCLAFGLPPGGGDDV-LSTFGNYQQQKLRIVHDVAESRLGIASE 468

Query: 404 DCE 406
           +C+
Sbjct: 469 NCD 471


>gi|54290731|dbj|BAD62401.1| putative nucleoid DNA-binding protein cnd41 [Oryza sativa Japonica
           Group]
          Length = 521

 Score =  134 bits (337), Expect = 8e-29,   Method: Compositional matrix adjust.
 Identities = 142/450 (31%), Positives = 187/450 (41%), Gaps = 74/450 (16%)

Query: 1   LIHRDSILSPYNNPNENP--AHRVQRGINISIARLAYLQEKIRSHNTYQAQI-------- 50
           L+HR    +P       P  A R++R      AR  Y+  K     T    +        
Sbjct: 101 LVHRHGPCAPSAASGGKPSLAERLRR----DRARTNYIVTKATGGRTAATALSDAAGGGT 156

Query: 51  -LPS--NDISNSLFYV-NISIGQPPVPQFTAMDTGSSLLWVHCYPC--RDCSPQLGTIFY 104
            +P+   D  NSL YV  + IG P V Q   +DTGS L WV C PC   +C  Q   +F 
Sbjct: 157 SIPTFLGDSVNSLEYVVTLGIGTPAVQQTVLIDTGSDLSWVQCKPCGAGECYAQKDPLFD 216

Query: 105 PSRSSSYAYVPCDSEHCRYFPYARCSAYKHRCI-----------YTQLYLIGPETSVFVS 153
           PS SSSYA VPCDS+ CR        AY H C            Y   Y     T+   S
Sbjct: 217 PSSSSSYASVPCDSDACRKL---AAGAYGHGCTGVSGGAAALCEYGIEYGNRATTTGVYS 273

Query: 154 TEQLTFKNTDESTIHVQDVVFGCGFSTNRNF-KFSGIFGLGIGRSSLVSQLNSS----FS 208
           TE LT K      + V D  FGCG   +  + KF G+ GLG    SLVSQ +S     FS
Sbjct: 274 TETLTLK----PGVVVADFGFGCGDHQHGPYEKFDGLLGLGGAPESLVSQTSSQFGGPFS 329

Query: 209 YCIGSLHDPDYLHNKLILGDGAIIDEGDATP-----------LQFIDGHYYITLEAISVD 257
           YC+     P        L  GA  +   +T            L  +   Y +TL  ISV 
Sbjct: 330 YCL-----PPTSGGAGFLTLGAPPNSSSSTAASGLSFTPMRRLPSVPTFYIVTLTGISVG 384

Query: 258 GRMLDINPNIFKRDDSGGGVMIDSGTDVTWLVKEAYEALRDEVMIRLEGEQMRSYSWPDK 317
           G  L I P+ F       G++IDSGT +T L   AY ALR      +   ++   S    
Sbjct: 385 GAPLAIPPSAFSS-----GMVIDSGTVITGLPATAYAALRSAFRSAMSEYRLLPPSNGGV 439

Query: 318 L--CYHGIMSSDLKGFPTVRFHFRGGAKLALEKDSMFYQPRPDAFCMAVNPASINNRYVN 375
           L  CY     +++   PT+   F GGA + L   +       D  C+A   A  +N    
Sbjct: 440 LDTCYDFTGHANVT-VPTISLTFSGGATIDLAAPAGVLV---DG-CLAFAGAGTDNA--- 491

Query: 376 FSLIGMMAQQFYNVGYDIGRKQKTFQRMDC 405
             +IG + Q+ + V YD G+    F+   C
Sbjct: 492 IGIIGNVNQRTFEVLYDSGKGTVGFRAGAC 521


>gi|356553775|ref|XP_003545228.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
          Length = 559

 Score =  134 bits (337), Expect = 8e-29,   Method: Compositional matrix adjust.
 Identities = 108/369 (29%), Positives = 171/369 (46%), Gaps = 31/369 (8%)

Query: 61  FYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLGTIFYPSRSSSYAYVPCDSEH 120
           +++++ +G PP      +DTGS L W+ C PC  C  Q G  + P  SSS+  + C    
Sbjct: 195 YFMDVFVGTPPKHFSLILDTGSDLNWIQCVPCIACFEQSGPYYDPKDSSSFRNISCHDPR 254

Query: 121 CRYF----PYARCSAYKHRCIYTQLYLIGPETS----VFVSTEQLTFKNTDESTIHVQDV 172
           C+      P   C A    C Y   Y  G  T+    +   T  LT  N      HV++V
Sbjct: 255 CQLVSSPDPPNPCKAENQSCPYFYWYGDGSNTTGDFALETFTVNLTTPNGKSELKHVENV 314

Query: 173 VFGCGFSTNRNF-KFSGIFGLGIGRSSLVSQLNS----SFSYCIGSLHDPDYLHNKLILG 227
           +FGCG      F   +G+ GLG G  S  SQ+ S    SFSYC+   +    + +KLI G
Sbjct: 315 MFGCGHWNRGLFHGAAGLLGLGKGPLSFASQMQSLYGQSFSYCLVDRNSNASVSSKLIFG 374

Query: 228 -DGAIIDEGDATPLQF-------IDGHYYITLEAISVDGRMLDINPNIFKRDDSG-GGVM 278
            D  ++   +     F       +D  YY+ + ++ VD  +L I    +     G GG +
Sbjct: 375 EDKELLSHPNLNFTSFGGGKDGSVDTFYYVQINSVMVDDEVLKIPEETWHLSSEGAGGTI 434

Query: 279 IDSGTDVTWLVKEAYEALRDEVMIRLEGEQMRSYSWPDKLCYH--GIMSSDLKGFPTVRF 336
           IDSGT +T+  + AYE +++  + +++G ++     P K CY+  GI   +L  F  +  
Sbjct: 435 IDSGTTLTYFAEPAYEIIKEAFVRKIKGYELVEGLPPLKPCYNVSGIEKMELPDFGIL-- 492

Query: 337 HFRGGAKLALEKDSMFYQPRPDAFCMAVNPASINNRYVNFSLIGMMAQQFYNVGYDIGRK 396
            F  GA      ++ F Q  PD  C+A+    + N     S+IG   QQ +++ YD+ + 
Sbjct: 493 -FADGAVWNFPVENYFIQIDPDVVCLAI----LGNPRSALSIIGNYQQQNFHILYDMKKS 547

Query: 397 QKTFQRMDC 405
           +  +  M C
Sbjct: 548 RLGYAPMKC 556


>gi|255554715|ref|XP_002518395.1| Aspartic proteinase Asp1 precursor, putative [Ricinus communis]
 gi|223542240|gb|EEF43782.1| Aspartic proteinase Asp1 precursor, putative [Ricinus communis]
          Length = 489

 Score =  134 bits (337), Expect = 9e-29,   Method: Compositional matrix adjust.
 Identities = 115/409 (28%), Positives = 177/409 (43%), Gaps = 46/409 (11%)

Query: 28  ISIARLAYLQEKIRSHNTYQAQILPSNDISNSLFYVNISIGQPPVPQFTAM-DTGSSLLW 86
           IS  R    ++     +T Q  I    D   S ++V+I IG P   +F  + DTGS L W
Sbjct: 86  ISSLRHGTRRKAFEVSHTAQIPIHSGADSGQSQYFVSIRIGTPRPQKFILVTDTGSDLTW 145

Query: 87  VHC-YPCRDC---SPQLGTIFYPSRSSSYAYVPCDSEHCR-----YFPYARCSAYKHRCI 137
           ++C Y C+ C   +P  G +F  + SSS+  +PC S+ C+     YF    C      C+
Sbjct: 146 MNCEYWCKSCPKPNPHPGRVFRANDSSSFRTIPCSSDDCKIELQDYFSLTECPNPNAPCL 205

Query: 138 YTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDVVFGCGFSTNRNFKF-SGIFGLGIGR 196
           +   YL GP      + E +T    D   I + DV+ GC  S N    F  G+ GLG  +
Sbjct: 206 FDYRYLNGPRAIGVFANETVTVGLNDHKKIRLFDVLIGCTESFNETNGFPDGVMGLGYRK 265

Query: 197 SSLVSQL----NSSFSYCIGSLHDPDYLHNKLILGDGAIIDEGDATPLQ-------FIDG 245
            SL  +L     + FSYC+          N L  GD   I E     +Q       +I+ 
Sbjct: 266 HSLALRLAEIFGNKFSYCLVDHLSSSNHKNFLSFGD---IPEMKLPKMQHTELLLGYINA 322

Query: 246 HYYITLEAISVDGRMLDINPNIFKRDDSGGGVMIDSGTDVTWLVKEAYEALRDEV----- 300
            Y + +  ISV G ML I+ +I+      GG+++DSGT +T L  EAY+ + D +     
Sbjct: 323 FYPVNVSGISVGGSMLSISSDIWNVTGV-GGMIVDSGTSLTMLAGEAYDKVVDALKPIFD 381

Query: 301 ----MIRLEGEQMRSYSWPDKLCYHGIMSSDLKGFPTVRFHFRGGAKLALEKDSMFYQPR 356
               ++ +E  ++ ++ + DK         D    P +  HF  GA       S      
Sbjct: 382 KHKKVVPIELPELNNFCFEDK-------GFDRAAVPRLLIHFADGAIFKPPVKSYIIDVA 434

Query: 357 PDAFCMAVNPASINNRYVNFSLIGMMAQQFYNVGYDIGRKQKTFQRMDC 405
               C+ +    I   +   S++G + QQ +   YD+GR +  F    C
Sbjct: 435 EGIKCLGI----IKADFPGSSILGNVMQQNHLWEYDLGRGKLGFGPSSC 479


>gi|225423917|ref|XP_002281973.1| PREDICTED: aspartic proteinase nepenthesin-2 [Vitis vinifera]
          Length = 491

 Score =  134 bits (337), Expect = 9e-29,   Method: Compositional matrix adjust.
 Identities = 116/352 (32%), Positives = 162/352 (46%), Gaps = 22/352 (6%)

Query: 61  FYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLGTIFYPSRSSSYAYVPCDSEH 120
           ++  + IG PP   +  +DTGS + WV C PC DC  Q   IF PS SSSYA + C++  
Sbjct: 155 YFSRVGIGSPPKHVYMVVDTGSDVNWVQCAPCADCYQQADPIFEPSFSSSYAPLTCETHQ 214

Query: 121 CRYFPYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDVVFGCGFST 180
           C+    + C      C+Y   Y  G  T    +TE +T     + +  + +V  GCG   
Sbjct: 215 CKSLDVSECR--NDSCLYEVSYGDGSYTVGDFATETITL----DGSASLNNVAIGCGHDN 268

Query: 181 NRNF-KFSGIFGLGIGRSSLVSQLN-SSFSYCIGSLHDPDYLHNKLILGDGAIIDEGDAT 238
              F   +G+ GLG G  S  SQ+N SSFSYC   L + D      +  +  I       
Sbjct: 269 EGLFVGAAGLLGLGGGSLSFPSQINASSFSYC---LVNRDTDSASTLEFNSPIPSHSVTA 325

Query: 239 PL---QFIDGHYYITLEAISVDGRMLDINPNIFKRDDSG-GGVMIDSGTDVTWLVKEAYE 294
           PL     +D  YY+ +  I V G+ML I  + F+ D+SG GG+++DSGT VT L  + Y 
Sbjct: 326 PLLRNNQLDTFYYLGMTGIGVGGQMLSIPRSSFEVDESGNGGIIVDSGTAVTRLQSDVYN 385

Query: 295 ALRDEVMIRLEGEQMRSYSWPDKLCYHGIMSSDLKGFPTVRFHFRGGAKLAL-EKDSMFY 353
           +LRD  +   +     S       CY     S ++  PTV FHF  G  LAL  K+ +  
Sbjct: 386 SLRDSFVRGTQHLPSTSGVALFDTCYDLSSRSSVE-VPTVSFHFPDGKYLALPAKNYLIP 444

Query: 354 QPRPDAFCMAVNPASINNRYVNFSLIGMMAQQFYNVGYDIGRKQKTFQRMDC 405
                 FC A  P +        S+IG + QQ   V YD+      F    C
Sbjct: 445 VDSAGTFCFAFAPTT-----SALSIIGNVQQQGTRVSYDLSNSLVGFSPNGC 491


>gi|414885970|tpg|DAA61984.1| TPA: hypothetical protein ZEAMMB73_915310 [Zea mays]
          Length = 523

 Score =  134 bits (337), Expect = 9e-29,   Method: Compositional matrix adjust.
 Identities = 123/427 (28%), Positives = 182/427 (42%), Gaps = 45/427 (10%)

Query: 1   LIHRDSILSPYNNPNENPAH-----RVQRGINISIARLAYLQEKIRSHNTYQAQILPSND 55
           ++HR    SP       P+H     R Q  ++ SI R+          +  +   LP++ 
Sbjct: 121 VVHRHGPCSPLLARGGEPSHAEILDRDQDRVD-SIHRMTAGPWTAGQSSASKGVSLPAHR 179

Query: 56  ---ISNSLFYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLGTIFYPSRSSSYA 112
              +  + + V++ +G P        DTGS L WV C PC +C  Q   +F PS+S++Y+
Sbjct: 180 GLRLGTANYIVSVGLGTPRRDLLVVFDTGSDLSWVQCKPCNNCYKQHDPLFDPSQSTTYS 239

Query: 113 YVPCDSEHCRYFPYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDV 172
            VPC ++ C       CS+ K  C Y  +Y    +T   ++ + LT      S+  +Q  
Sbjct: 240 AVPCGAQEC--LDSGTCSSGK--CRYEVVYGDMSQTDGNLARDTLTL---GPSSDQLQGF 292

Query: 173 VFGCGFSTNRNF-KFSGIFGLGIGRSSLVSQL----NSSFSYCIGSLHDPDYLHNKLILG 227
           VFGCG      F +  G+FGLG  R SL SQ      + FSYC+ S    +     L LG
Sbjct: 293 VFGCGDDDTGLFGRADGLFGLGRDRVSLASQAAARYGAGFSYCLPSSWRAE---GYLSLG 349

Query: 228 DGAIIDEGDATPL---QFIDGHYYITLEAISVDGRMLDINPNIFKRDDSGGGVMIDSGTD 284
             A       T +         YY+ L  I V GR + + P +FK      G +IDSGT 
Sbjct: 350 SAAAPPHAQFTAMVTRSDTPSFYYLDLVGIKVAGRTVRVAPAVFKAP----GTVIDSGTV 405

Query: 285 VTWLVKEAYEALRDEVMIRLEGEQMRSYSWPDKL-----CYHGIMSSDLKGFPTVRFHFR 339
           +T L   AY ALR           MR Y     L     CY     + ++  P+V   F 
Sbjct: 406 ITRLPSRAYSALRSSFA-----GFMRRYKRAPALSILDTCYDFTGRTKVQ-IPSVALLFD 459

Query: 340 GGAKLALEKDSMFYQPRPDAFCMAVNPASINNRYVNFSLIGMMAQQFYNVGYDIGRKQKT 399
           GGA L L    + Y       C+A    + N    +  ++G M Q+ + V YD+  ++  
Sbjct: 460 GGATLNLGFGGVLYVANRSQACLAF---ASNGDDTSVGILGNMQQKTFAVVYDLANQKIG 516

Query: 400 FQRMDCE 406
           F    C 
Sbjct: 517 FGAKGCS 523


>gi|302768196|ref|XP_002967518.1| hypothetical protein SELMODRAFT_87804 [Selaginella moellendorffii]
 gi|300165509|gb|EFJ32117.1| hypothetical protein SELMODRAFT_87804 [Selaginella moellendorffii]
          Length = 398

 Score =  134 bits (337), Expect = 9e-29,   Method: Compositional matrix adjust.
 Identities = 107/363 (29%), Positives = 158/363 (43%), Gaps = 24/363 (6%)

Query: 61  FYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLGTIFYPSRSSSYAYVPCDSEH 120
           +   IS+G P        DTGS L+W+ C PC+ C  Q   IF P  SSSY  + C    
Sbjct: 40  YVTTISLGTPAKVFSVIADTGSDLIWIQCKPCQACFNQKDPIFDPEGSSSYTTMSCGDTL 99

Query: 121 CRYFPYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDVVFGCGFST 180
           C   P   CS     C Y+  Y  G  T   +S+E +T  +T    +  +++ FGCG   
Sbjct: 100 CDSLPRKSCSP---DCDYSYGYGDGSGTRGTLSSETVTLTSTQGEKLAAKNIAFGCGHLN 156

Query: 181 NRNFK-FSGIFGLGIGRSSLVSQLNS----SFSYCIGSLHDPDYLHNKLILGDGAIIDEG 235
             +F   SG+ GLG G  S VSQL       FSYC+    D     + +  GD +     
Sbjct: 157 RGSFNDASGLVGLGRGNLSFVSQLGDLFGHKFSYCLVPWRDAPSKTSPMFFGDESSSHSS 216

Query: 236 DA------TPL---QFIDGHYYITLEAISVDGRMLDINPNIFK-RDDSGGGVMIDSGTDV 285
                   TP+     ++  YY+ L+ IS+ GR L I    F  + D  GG++ DSGT +
Sbjct: 217 GKKLHYAFTPMIHNPAMESFYYVKLKDISIAGRALRIPAGSFDIKPDGSGGMIFDSGTTL 276

Query: 286 TWLVKEAYEALRDEVMIRLEGEQMRSYSWPDKLCY--HGIMSSDLKGFPTVRFHFRGGAK 343
           T L    Y+ +   +  ++   ++   S    LCY   G  +S     P + FHF  GA 
Sbjct: 277 TLLPDAPYQIVLRALRSKISFPKIDGSSAGLDLCYDVSGSKASYKMKIPAMVFHFE-GAD 335

Query: 344 LALEKDSMFYQPRPDAFCMAVNPASINNRYVNFSLIGMMAQQFYNVGYDIGRKQKTFQRM 403
             L  ++ F         + +   S N   ++  + G M QQ + V YDIG  +  +   
Sbjct: 336 YQLPVENYFIAANDAGTIVCLAMVSSN---MDIGIYGNMMQQNFRVMYDIGSSKIGWAPS 392

Query: 404 DCE 406
            C+
Sbjct: 393 QCD 395


>gi|224126221|ref|XP_002329620.1| predicted protein [Populus trichocarpa]
 gi|222870359|gb|EEF07490.1| predicted protein [Populus trichocarpa]
          Length = 357

 Score =  134 bits (336), Expect = 1e-28,   Method: Compositional matrix adjust.
 Identities = 105/357 (29%), Positives = 162/357 (45%), Gaps = 31/357 (8%)

Query: 61  FYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLGTIFYPSRSSSYAYVPCDSEH 120
           ++  + +G P    +  +DTGS + W+ C PC DC  Q   IF P+ SS+YA V C S+ 
Sbjct: 20  YFTRVGVGNPARQFYMVLDTGSDINWLQCQPCTDCYQQTDPIFDPTASSTYAPVTCQSQQ 79

Query: 121 CRYFPYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDVVFGCGFST 180
           C     + C +   +C+Y   Y  G  T    +TE ++F N+      V++V  GCG   
Sbjct: 80  CSSLEMSSCRS--GQCLYQVNYGDGSYTFGDFATESVSFGNSGS----VKNVALGCGHDN 133

Query: 181 NRNFKFSGIFGLGIGRS-SLVSQLN-SSFSYCIGSLHDPDYLHNKLILGDGAIIDEGDAT 238
              F  +       G   SL +QL  +SFSYC+  ++      + L      +  +    
Sbjct: 134 EGLFVGAAGLLGLGGGPLSLTNQLKATSFSYCL--VNRDSAGSSTLDFNSAQLGVDSVTA 191

Query: 239 PL---QFIDGHYYITLEAISVDGRMLDINPNIFKRDDSG-GGVMIDSGTDVTWLVKEAYE 294
           PL   + ID  YY+ L  +SV G+M+ I  + F+ D+SG GG+++D GT +T L  +AY 
Sbjct: 192 PLMKNRKIDTFYYVGLSGMSVGGQMVSIPESTFRLDESGNGGIIVDCGTAITRLQTQAYN 251

Query: 295 ALRDEVMIRLEGEQMRSYSWPDKLCYHGIMSSDLKG-----FPTVRFHFRGGAKLAL-EK 348
            LRD  +   +  ++ S       CY      DL G      PTV FHF  G    L   
Sbjct: 252 PLRDAFVRMTQNLKLTSAVALFDTCY------DLSGQASVRVPTVSFHFADGKSWNLPAA 305

Query: 349 DSMFYQPRPDAFCMAVNPASINNRYVNFSLIGMMAQQFYNVGYDIGRKQKTFQRMDC 405
           + +        +C A  P +      + S+IG + QQ   V +D+   +  F    C
Sbjct: 306 NYLIPVDSAGTYCFAFAPTT-----SSLSIIGNVQQQGTRVTFDLANNRMGFSPNKC 357


>gi|449434646|ref|XP_004135107.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 1-like [Cucumis
           sativus]
          Length = 486

 Score =  134 bits (336), Expect = 1e-28,   Method: Compositional matrix adjust.
 Identities = 111/367 (30%), Positives = 163/367 (44%), Gaps = 23/367 (6%)

Query: 46  YQAQILPSNDISNSLFYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLGTIFYP 105
           +++ I+      +  ++  + IG+PP P +  +DTGS + WV C PC +C  Q   IF P
Sbjct: 136 FESPIVSGASQGSGEYFSRVGIGRPPSPVYMVLDTGSDVSWVQCAPCAECYEQTDPIFEP 195

Query: 106 SRSSSYAYVPCDSEHCRYFPYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDES 165
           + S+S+  + C++E C+    + C      C+Y   Y  G  T     TE +T  +T   
Sbjct: 196 TSSASFTSLSCETEQCKSLDVSEC--RNGTCLYEVSYGDGSYTVGDFVTETVTLGST--- 250

Query: 166 TIHVQDVVFGCGFSTNRNF-KFSGIFGLGIGRSSLVSQLN-SSFSYCIGSLHDPDYLHNK 223
              + ++  GCG +    F   +G+ GLG G  S  SQLN SSFSYC   L D D     
Sbjct: 251 --SLGNIAIGCGHNNEGLFIGAAGLLGLGGGSLSFPSQLNASSFSYC---LVDRDSDSTS 305

Query: 224 LILGDGAIIDEGDATPLQF---IDGHYYITLEAISVDGRMLDINPNIFK-RDDSGGGVMI 279
            +  +  I  +    PL     +D  +Y+ L  +SV G +L I    F+  +D  GG+++
Sbjct: 306 TLDFNSPITPDAVTAPLHRNPNLDTFFYLGLTGMSVGGAVLPIPETSFQMSEDGNGGIIV 365

Query: 280 DSGTDVTWLVKEAYEALRDEVMIRLEGEQMRSYSWPDKLCYHGIMSSDLKGFPTVRFHFR 339
           DSGT VT L    Y  LRD  +      Q          CY  + S      PTV FHF 
Sbjct: 366 DSGTAVTRLQTTVYNVLRDAFVKSTHDLQTARGVALFDTCYD-LSSKSRVEVPTVSFHFA 424

Query: 340 GGAKLAL-EKDSMFYQPRPDAFCMAVNPASINNRYVNFSLIGMMAQQFYNVGYDIGRKQK 398
            G +L L  K+ +        FC A  P          S++G   QQ   VG+D+     
Sbjct: 425 NGNELPLPAKNYLIPVDSEGTFCFAFAPTD-----STLSILGNAQQQGTRVGFDLANSLV 479

Query: 399 TFQRMDC 405
            F    C
Sbjct: 480 GFSPNKC 486


>gi|297821064|ref|XP_002878415.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
 gi|297324253|gb|EFH54674.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
          Length = 486

 Score =  134 bits (336), Expect = 1e-28,   Method: Compositional matrix adjust.
 Identities = 118/389 (30%), Positives = 172/389 (44%), Gaps = 44/389 (11%)

Query: 41  RSHNTYQAQILPSNDISNSLFYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLG 100
           RS   +   ++      +  +++ + +G P    +  +DTGS ++W+ C PC+ C  Q  
Sbjct: 118 RSAGGFSGAVISGLSQGSGEYFMRLGVGTPATNVYMVLDTGSDVVWLQCSPCKACYNQSD 177

Query: 101 TIFYPSRSSSYAYVPCDSEHCRYF-PYARCSAYKHR-CIYTQLYLIGPETSVFVSTEQLT 158
            IF P +S ++A VPC S  CR     + C   + + C+Y   Y  G  T    STE LT
Sbjct: 178 VIFDPKKSKTFATVPCGSRLCRRLDDSSECVTRRSKTCLYQVSYGDGSFTEGDFSTETLT 237

Query: 159 FKNTDESTIHVQDVVFGCGFSTNRNFKFSGIFGLGIGR-----SSLVSQLNSSFSYCI-- 211
           F         V  V  GCG      F  +              S   S+ N  FSYC+  
Sbjct: 238 FHGA-----RVDHVPLGCGHDNEGLFVGAAGLLGLGRGGLSFPSQTKSRYNGKFSYCLVD 292

Query: 212 -GSLHDPDYLHNKLILGDGAIIDEGDATPLQF---IDGHYYITLEAISVDG-RMLDINPN 266
             S        + ++ G+ A+      TPL     +D  YY+ L  ISV G R+  ++ +
Sbjct: 293 RTSSGSSSKPPSTIVFGNDAVPKTSVFTPLLTNPKLDTFYYLQLLGISVGGSRVPGVSES 352

Query: 267 IFKRDDSG-GGVMIDSGTDVTWLVKEAYEALRDEVMIRLEGEQMR---SYSWPDKLCYHG 322
            FK D +G GGV+IDSGT VT L + AY ALRD    RL   +++   SYS  D  C+  
Sbjct: 353 QFKLDATGNGGVIIDSGTSVTRLTQSAYVALRDA--FRLGATKLKRAPSYSLFDT-CF-- 407

Query: 323 IMSSDLKGF-----PTVRFHFRGGAKLALEKDSMFYQPRPDA-FCMAVNPASINNRYVNF 376
               DL G      PTV FHF GG +++L   +       +  FC A           + 
Sbjct: 408 ----DLSGMTTVKVPTVVFHF-GGGEVSLPASNYLIPVNTEGRFCFA-----FAGTMGSL 457

Query: 377 SLIGMMAQQFYNVGYDIGRKQKTFQRMDC 405
           S+IG + QQ + V YD+   +  F    C
Sbjct: 458 SIIGNIQQQGFRVAYDLVGSRVGFLSRAC 486


>gi|225463766|ref|XP_002267930.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Vitis vinifera]
          Length = 479

 Score =  133 bits (335), Expect = 2e-28,   Method: Compositional matrix adjust.
 Identities = 120/415 (28%), Positives = 181/415 (43%), Gaps = 47/415 (11%)

Query: 1   LIHRDSILSPYNNPNENPAHRVQRGINISIARLAYLQEKIRSH-------NTYQAQILPS 53
           ++HRD +   + N +++  HR+   +     R+A L  ++ S        + +   ++  
Sbjct: 76  VVHRDQL--SFGNSDDH-RHRLDGRLKRDAKRVASLIRRLSSGGGGSYRVDDFGTDVISG 132

Query: 54  NDISNSLFYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLGTIFYPSRSSSYAY 113
            +  +  ++V I +G PP  Q+  +D+GS ++WV C PC  C  Q   +F P+ S+S+  
Sbjct: 133 MEQGSGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCTQCYHQSDPVFDPADSASFTG 192

Query: 114 VPCDSEHCRYFPYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDVV 173
           V C S  C     A C A   RC Y   Y  G  T   ++ E LTF  T      V+ V 
Sbjct: 193 VSCSSSVCDRLENAGCHA--GRCRYEVSYGDGSYTKGTLALETLTFGRT-----MVRSVA 245

Query: 174 FGCGFSTNRNFKFSGIFGLGIGRS-SLVSQL----NSSFSYCIGSLHDPDYLHNKLILGD 228
            GCG      F  +       G S S V QL      +FSYC+ S          L+ G 
Sbjct: 246 IGCGHRNRGMFVGAAGLLGLGGGSMSFVGQLGGQTGGAFSYCLVSRGTDS--SGSLVFGR 303

Query: 229 GAIIDEGDATPL---QFIDGHYYITLEAISVDGRMLDINPNIFKRDDSG-GGVMIDSGTD 284
            A+       PL         YYI L  + V G  + I+  +F+  + G GGV++D+GT 
Sbjct: 304 EALPAGAAWVPLVRNPRAPSFYYIGLAGLGVGGIRVPISEEVFRLTELGDGGVVMDTGTA 363

Query: 285 VTWLVKEAYEALRDEVMIRLEGEQMRSYSWPDKLCYHGIMSSDLKGF-----PTVRFHFR 339
           VT L   AY+A RD  + +       +       CY      DL GF     PTV F+F 
Sbjct: 364 VTRLPTLAYQAFRDAFLAQTANLPRATGVAIFDTCY------DLLGFVSVRVPTVSFYFS 417

Query: 340 GGAKLALEKDSMFYQPRPDA--FCMAVNPASINNRYVNFSLIGMMAQQFYNVGYD 392
           GG  L L   + F  P  DA  FC A  P++        S++G + Q+   + +D
Sbjct: 418 GGPILTLPARN-FLIPMDDAGTFCFAFAPST-----SGLSILGNIQQEGIQISFD 466


>gi|224111722|ref|XP_002315953.1| predicted protein [Populus trichocarpa]
 gi|222864993|gb|EEF02124.1| predicted protein [Populus trichocarpa]
          Length = 484

 Score =  133 bits (334), Expect = 2e-28,   Method: Compositional matrix adjust.
 Identities = 110/354 (31%), Positives = 167/354 (47%), Gaps = 23/354 (6%)

Query: 47  QAQILPSNDISNSLFYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLGTIFYPS 106
           Q+ I+      +  ++  + IG+PP   +  +DTGS + WV C PC DC  Q   IF P+
Sbjct: 135 QSPIISGTSQGSGEYFSRVGIGKPPSQAYLILDTGSDVNWVQCAPCADCYQQADPIFEPA 194

Query: 107 RSSSYAYVPCDSEHCRYFPYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDEST 166
            S+S++ + C++  CR    + C      C+Y   Y  G  T     TE +T       +
Sbjct: 195 SSASFSTLSCNTRQCRSLDVSEC--RNDTCLYEVSYGDGSYTVGDFVTETITL-----GS 247

Query: 167 IHVQDVVFGCGFSTNRNF-KFSGIFGLGIGRSSLVSQLN-SSFSYCIGSLHDPDYLHNKL 224
             V +V  GCG +    F   +G+ GLG G  S  SQ+N +SFSYC   L D D      
Sbjct: 248 APVDNVAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQINATSFSYC---LVDRDSESAST 304

Query: 225 ILGDGAIIDEGDATPL---QFIDGHYYITLEAISVDGRMLDINPNIFKRDDSG-GGVMID 280
           +  +  +     + PL     +D  YY+ L  +SV G ++ I  + F+ D+SG GGV++D
Sbjct: 305 LEFNSTLPPNAVSAPLLRNHHLDTFYYVGLTGLSVGGELVSIPESAFQIDESGNGGVIVD 364

Query: 281 SGTDVTWLVKEAYEALRDEVMIRLEGEQMRSYSWPDKLCYHGIMSSDLKGFPTVRFHFRG 340
           SGT +T L  + Y +LRD  + R       +       CY      +++  PTV FHF  
Sbjct: 365 SGTAITRLQTDVYNSLRDAFVKRTRDLPSTNGIALFDTCYDLSSKGNVE-VPTVSFHFPD 423

Query: 341 GAKLAL-EKDSMFYQPRPDAFCMAVNPASINNRYVNFSLIGMMAQQFYNVGYDI 393
           G +L L  K+ +        FC A  P +      + S+IG + QQ   V YD+
Sbjct: 424 GKELPLPAKNYLVPLDSEGTFCFAFAPTA-----SSLSIIGNVQQQGTRVVYDL 472


>gi|302763741|ref|XP_002965292.1| hypothetical protein SELMODRAFT_83230 [Selaginella moellendorffii]
 gi|300167525|gb|EFJ34130.1| hypothetical protein SELMODRAFT_83230 [Selaginella moellendorffii]
          Length = 423

 Score =  133 bits (334), Expect = 2e-28,   Method: Compositional matrix adjust.
 Identities = 126/431 (29%), Positives = 187/431 (43%), Gaps = 62/431 (14%)

Query: 2   IHRDSILSPYNNPN-------ENPAHR-------VQRGINISIARL--AYLQEKIRSHNT 45
           +HRDS  SPY   N        N  HR       +   I++ +A +  + L   +++ N 
Sbjct: 1   MHRDSADSPYRPANATVHGLVRNRLHRDELRLLSISSRISLGVAGIPKSSLTNPLKNTNP 60

Query: 46  YQAQILPS------NDISNSLFYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQL 99
           +  Q   +      +D S   ++V++ +G PP       DTGS +LW+ C PC+ C  Q 
Sbjct: 61  FLQQDFETPLRSGLSDGSGE-YFVSLGVGTPPRTVNMVADTGSDVLWLQCLPCQSCYGQT 119

Query: 100 GTIFYPSRSSSYAYVPCDSEHCRYFPYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTF 159
             +F PS SS++  + C S  C+      C   +++C+Y   Y  G  T    STE L+F
Sbjct: 120 DPLFNPSFSSTFQSITCGSSLCQQLLIRGC--RRNQCLYQVSYGDGSFTVGEFSTETLSF 177

Query: 160 KNTDESTIHVQDVVFGCGFSTNRNFKFSGIFGLGIGRS----SLVSQL-NSSFSYCIGSL 214
            +       V  V  GCG +    F  +              S V QL  S FSYC+ + 
Sbjct: 178 GSN-----AVNSVAIGCGHNNQGLFTGAAGLLGLGKGLLSFPSQVGQLYGSVFSYCLPTR 232

Query: 215 HDPDYLHNKLILGDGAIIDEGDATPLQF---IDGHYYITLEAISVDGRMLDINPNIFKRD 271
                +   LI G+ A+      T L     +D  YY+ +  I V G  ++I       D
Sbjct: 233 ESTGSV--PLIFGNQAVASNAQFTTLLTNPKLDTFYYVEMVGIKVGGTSVNIPAGSLSLD 290

Query: 272 DS--GGGVMIDSGTDVTWLVKEAYEALRDEVMIRL--EGEQMRSYSWPDKLCYHGIMSSD 327
            S   GGV++DSGT VT LV  AY  +RD     +  + +    +S  D  CY      D
Sbjct: 291 SSTGNGGVILDSGTAVTRLVTSAYNPMRDAFRAGMPSDAKMTSGFSLFDT-CY------D 343

Query: 328 LKG-----FPTVRFHFRGGAKLAL-EKDSMFYQPRPDAFCMAVNPASINNRYVNFSLIGM 381
           L G      P V F F GGA +AL  ++ M        +C+A  P S      NFS+IG 
Sbjct: 344 LSGRSSIMLPAVSFVFNGGATMALPAQNIMVPVDNSGTYCLAFAPNS-----ENFSIIGN 398

Query: 382 MAQQFYNVGYD 392
           + QQ + + +D
Sbjct: 399 IQQQSFRMSFD 409


>gi|148908573|gb|ABR17396.1| unknown [Picea sitchensis]
          Length = 350

 Score =  133 bits (334), Expect = 2e-28,   Method: Compositional matrix adjust.
 Identities = 114/359 (31%), Positives = 158/359 (44%), Gaps = 29/359 (8%)

Query: 61  FYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLGTIFYPSRSSSYAYVPCDSEH 120
           ++  I IG P   Q+  +DTGS ++W+ C PCR+C  Q   IF PS S S++ V CDS  
Sbjct: 8   YFTRIGIGTPTREQYMVLDTGSDVVWIQCEPCRECYSQADPIFNPSSSVSFSTVGCDSAV 67

Query: 121 CRYFPYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDVVFGCGFST 180
           C       C  +   C+Y   Y  G  T    +TE LTF  T      +Q+V  GCG   
Sbjct: 68  CSQLDANDC--HGGGCLYEVSYGDGSYTVGSYATETLTFGTT-----SIQNVAIGCGHDN 120

Query: 181 NRNFKFSGIFGLGIGRS-----SLVSQLNSSFSYCIGSLHDPDYLHN-KLILGDGAIIDE 234
              F  +         S      L +Q   +FSYC   L D D   +  L  G  ++   
Sbjct: 121 VGLFVGAAGLLGLGAGSLSFPAQLGTQTGRAFSYC---LVDRDSESSGTLEFGPESVPIG 177

Query: 235 GDATPL---QFIDGHYYITLEAISVDGRMLDINPN-IFKRDDSG--GGVMIDSGTDVTWL 288
              TPL    F+   YY+++ AISV G +LD  P+  F+ D++   GG++IDSGT VT L
Sbjct: 178 SIFTPLVANPFLPTFYYLSMVAISVGGVILDSVPSEAFRIDETTGRGGIIIDSGTAVTRL 237

Query: 289 VKEAYEALRDEVMIRLEGEQMRSYSWPDKLCYHGIMSSDLKGFPTVRFHFRGGAKLAL-E 347
              AY+ALRD  +   +             CY  + +      P V FHF  GA   L  
Sbjct: 238 QTSAYDALRDAFIAGTQHLPRADGISIFDTCYD-LSALQSVSIPAVGFHFSNGAGFILPA 296

Query: 348 KDSMFYQPRPDAFCMAVNPASINNRYVNFSLIGMMAQQFYNVGYDIGRKQKTFQRMDCE 406
           K+ +        FC A  PA       N S++G + QQ   V +D       F    C+
Sbjct: 297 KNCLIPMDSMGTFCFAFAPAD-----SNLSIMGNIQQQGIRVSFDSANSLVGFAIDQCQ 350


>gi|125558622|gb|EAZ04158.1| hypothetical protein OsI_26300 [Oryza sativa Indica Group]
          Length = 435

 Score =  133 bits (334), Expect = 2e-28,   Method: Compositional matrix adjust.
 Identities = 109/397 (27%), Positives = 178/397 (44%), Gaps = 38/397 (9%)

Query: 32  RLAYLQEKIRS------HNTYQAQILPSNDISNSLFYVNISIGQPPVPQFTAMDTGSSLL 85
           R+A+L +   +      +++   Q L  N +    + +NIS+G P +      DTGS L+
Sbjct: 53  RIAFLSDATAAGKATTTNSSVSFQALLENGVGG--YNMNISVGTPLLTFPVVADTGSDLI 110

Query: 86  WVHCYPCRDCSPQLGTIFYPSRSSSYAYVPCDSEHCRYFPYARCSAYKHRCIYTQLYLIG 145
           W  C PC  C  Q    F P+ SS+++ +PC S  C++ P +  +     C+Y   Y  G
Sbjct: 111 WTQCAPCTKCFQQPAPPFQPASSSTFSKLPCTSSFCQFLPNSIRTCNATGCVYNYKYGSG 170

Query: 146 PETSVFVSTEQLTFKNTDESTIHVQDVVFGCGFSTNRNFKFSGIFGLGIGRSSLVSQLN- 204
             T+ +++TE  T K  D S      V FGC          SGI GLG G  SL+ QL  
Sbjct: 171 -YTAGYLATE--TLKVGDAS---FPSVAFGCSTENGVGNSTSGIAGLGRGALSLIPQLGV 224

Query: 205 SSFSYCIGSLHDPDYLHNKLILGDGAIIDEGDATPLQFID------GHYYITLEAISVDG 258
             FSYC+ S        + ++ G  A + +G+     F++       +YY+ L  I+V  
Sbjct: 225 GRFSYCLRSGSAAG--ASPILFGSLANLTDGNVQSTPFVNNPAVHPSYYYVNLTGITVGE 282

Query: 259 RMLDINPNI--FKRDDSGGGVMIDSGTDVTWLVKEAYEALRDEVMIRLEGEQMRSYSWPD 316
             L +  +   F ++  GGG ++DSGT +T+L K+ YE ++   + +       + +   
Sbjct: 283 TDLPVTTSTFGFTQNGLGGGTIVDSGTTLTYLAKDGYEMVKQAFLSQTANVTTVNGTRGL 342

Query: 317 KLCYHGIMSSDLKGFPTVRFHFRGGAKLA-------LEKDSMFYQPRPDAFCMAVNPASI 369
            LC+           P++   F GGA+ A       +E DS   Q      C+ + PA  
Sbjct: 343 DLCFKSTGGGGGIAVPSLVLRFDGGAEYAVPTYFAGVETDS---QGSVTVACLMMLPAKG 399

Query: 370 NNRYVNFSLIGMMAQQFYNVGYDIGRKQKTFQRMDCE 406
           +      S+IG + Q   ++ YD+     +F   DC 
Sbjct: 400 DQP---MSVIGNVMQMDMHLLYDLDGGIFSFSPADCA 433


>gi|225470916|ref|XP_002263964.1| PREDICTED: aspartic proteinase nepenthesin-1 [Vitis vinifera]
 gi|147788999|emb|CAN64659.1| hypothetical protein VITISV_009613 [Vitis vinifera]
          Length = 489

 Score =  132 bits (333), Expect = 2e-28,   Method: Compositional matrix adjust.
 Identities = 109/367 (29%), Positives = 156/367 (42%), Gaps = 47/367 (12%)

Query: 61  FYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLGTIFYPSRSSSYAYVPCDSEH 120
           ++  + +G PP   +  +DTGS ++W+ C PCR C  Q   +F P +S S++ + C S  
Sbjct: 147 YFTRLGVGTPPKYVYMVLDTGSDVVWIQCAPCRKCYSQTDPVFDPKKSGSFSSISCRSPL 206

Query: 121 CRYFPYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDVVFGCGFST 180
           C       C++ +  C+Y   Y  G  T    STE LTF+ T      V  V  GCG   
Sbjct: 207 CLRLDSPGCNS-RQSCLYQVAYGDGSFTFGEFSTETLTFRGT-----RVPKVALGCGHDN 260

Query: 181 NRNF--------------KFSGIFGLGIGRSSLVSQLNSSFSYCIGSLHDPDYLHNKLIL 226
              F               F    GL  GR          FSYC+          + ++ 
Sbjct: 261 EGLFVGAAGLLGLGRGRLSFPTQTGLRFGR---------KFSYCLVD-RSASSKPSSVVF 310

Query: 227 GDGAIIDEGDATPL---QFIDGHYYITLEAISVDG-RMLDINPNIFKRDDSG-GGVMIDS 281
           G  A+      TPL     +D  YY+ L  ISV G R+  I  ++FK D +G GGV+IDS
Sbjct: 311 GQSAVSRTAVFTPLITNPKLDTFYYLELTGISVGGARVAGITASLFKLDTAGNGGVIIDS 370

Query: 282 GTDVTWLVKEAYEALRDEVMIRLEGEQMR---SYSWPDKLCYHGIMSSDLKGFPTVRFHF 338
           GT VT L + AY +LRD    R     ++    YS  D  C+     +++K  PTV  HF
Sbjct: 371 GTSVTRLTRRAYVSLRDA--FRAGAADLKRAPDYSLFDT-CFDLSGKTEVK-VPTVVMHF 426

Query: 339 RGGAKLALEKDSMFYQPRPDAFCMAVNPASINNRYVNFSLIGMMAQQFYNVGYDIGRKQK 398
           RG        + +        FC A             S+IG + QQ + V +D+   + 
Sbjct: 427 RGADVSLPATNYLIPVDTNGVFCFA-----FAGTMSGLSIIGNIQQQGFRVVFDVAASRI 481

Query: 399 TFQRMDC 405
            F    C
Sbjct: 482 GFAARGC 488


>gi|302809855|ref|XP_002986620.1| hypothetical protein SELMODRAFT_124369 [Selaginella moellendorffii]
 gi|300145803|gb|EFJ12477.1| hypothetical protein SELMODRAFT_124369 [Selaginella moellendorffii]
          Length = 423

 Score =  132 bits (333), Expect = 2e-28,   Method: Compositional matrix adjust.
 Identities = 126/431 (29%), Positives = 186/431 (43%), Gaps = 62/431 (14%)

Query: 2   IHRDSILSPYNNPN-------ENPAHR-------VQRGINISIARL--AYLQEKIRSHNT 45
           +HRDS  SPY   N        N  HR       +   I++ +A +  + L   +++ N 
Sbjct: 1   MHRDSADSPYRPANATVHGLVRNRLHRDELRLLSISSRISLGVAGIPKSSLTNPLKNTNP 60

Query: 46  YQAQILPS------NDISNSLFYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQL 99
           +  Q   +      +D S   ++V++ +G PP       DTGS +LW+ C PC+ C  Q 
Sbjct: 61  FLQQDFETPLRSGLSDGSGE-YFVSLGVGTPPRTVNMVADTGSDVLWLQCLPCQSCYGQT 119

Query: 100 GTIFYPSRSSSYAYVPCDSEHCRYFPYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTF 159
             +F PS SS++  + C S  C+      C   +++C+Y   Y  G  T    STE L+F
Sbjct: 120 DPLFNPSFSSTFQSITCGSSLCQQLLIRGC--RRNQCLYQVSYGDGSFTVGEFSTETLSF 177

Query: 160 KNTDESTIHVQDVVFGCGFSTNRNFKFSGIFGLGIGRS----SLVSQL-NSSFSYCIGSL 214
            +       V  V  GCG +    F  +              S V QL  S FSYC+ + 
Sbjct: 178 GSN-----AVNSVAIGCGHNNQGLFTGAAGLLGLGKGLLSFPSQVGQLYGSVFSYCLPTR 232

Query: 215 HDPDYLHNKLILGDGAIIDEGDATPLQF---IDGHYYITLEAISVDGRMLDINPNIFKRD 271
                +   LI G+ A+      T L     +D  YY+ +  I V G  + I       D
Sbjct: 233 ESTGSV--PLIFGNQAVASNAQFTTLLTNPKLDTFYYVEMVGIKVGGTSVSIPAGSLSLD 290

Query: 272 DS--GGGVMIDSGTDVTWLVKEAYEALRDEVMIRL--EGEQMRSYSWPDKLCYHGIMSSD 327
            S   GGV++DSGT VT LV  AY  +RD     +  + +    +S  D  CY      D
Sbjct: 291 SSTGNGGVILDSGTAVTRLVTSAYNPMRDAFRAGMPSDAKMTSGFSLFDT-CY------D 343

Query: 328 LKG-----FPTVRFHFRGGAKLAL-EKDSMFYQPRPDAFCMAVNPASINNRYVNFSLIGM 381
           L G      P V F F GGA +AL  ++ M        +C+A  P S      NFS+IG 
Sbjct: 344 LSGRSSIMLPAVSFVFNGGATMALPAQNIMVPVDNSGTYCLAFAPNS-----ENFSIIGN 398

Query: 382 MAQQFYNVGYD 392
           + QQ + + +D
Sbjct: 399 IQQQSFRMSFD 409


>gi|30683732|ref|NP_180371.2| aspartyl protease-like protein [Arabidopsis thaliana]
 gi|28392898|gb|AAO41885.1| putative chloroplast nucleoid DNA binding protein [Arabidopsis
           thaliana]
 gi|56382011|gb|AAV85724.1| At2g28040 [Arabidopsis thaliana]
 gi|330252978|gb|AEC08072.1| aspartyl protease-like protein [Arabidopsis thaliana]
          Length = 395

 Score =  132 bits (333), Expect = 3e-28,   Method: Compositional matrix adjust.
 Identities = 105/363 (28%), Positives = 164/363 (45%), Gaps = 49/363 (13%)

Query: 61  FYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLGTIFYPSRSSSYAYVPCDSEH 120
           + + + IG PP      +DTGS  +W  C PC  C  Q   IF PS+SS++  +      
Sbjct: 65  YLMKLQIGTPPFEIEAVLDTGSEHIWTQCLPCVHCYNQTAPIFDPSKSSTFKEI------ 118

Query: 121 CRYFPYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDVVFGCGFST 180
                  RC  + H C Y  +Y     T   + TE +T  +T      + + + GCG   
Sbjct: 119 -------RCDTHDHSCPYELVYGGKSYTKGTLVTETVTIHSTSGQPFVMPETIIGCG-RN 170

Query: 181 NRNFK--FSGIFGLGIGRSSLVSQLNSSF----SYCIG--SLHDPDYLHNKLILGDGAI- 231
           N  FK  F+G+ GL  G  SL++Q+   +    SYC         ++  N ++ GDG + 
Sbjct: 171 NSGFKPGFAGVVGLDRGPKSLITQMGGEYPGLMSYCFAGKGTSKINFGANAIVAGDGVVS 230

Query: 232 --IDEGDATPLQFIDGHYYITLEAISVDGRMLDINPNIFKRDDSGGGVMIDSGTDVTWLV 289
             +    A P     G YY+ L+A+SV    ++     F      G ++IDSG+ +T+  
Sbjct: 231 TTVFVKTAKP-----GFYYLNLDAVSVGNTRIETVGTPFHALK--GNIVIDSGSTLTYF- 282

Query: 290 KEAYEALRDEVMIRLEGEQMRS---YSWPDKLCYHGIMSSDLKGFPTVRFHFRGGAKLAL 346
            E+Y  L     +R   EQ+ +   +   D LCY+   S  +  FP +  HF GGA L L
Sbjct: 283 PESYCNL-----VRKAVEQVVTAVRFPRSDILCYY---SKTIDIFPVITMHFSGGADLVL 334

Query: 347 EKDSMFYQPRPDA-FCMAVNPASINNRYVNFSLIGMMAQQFYNVGYDIGRKQKTFQRMDC 405
           +K +M+        FC+A+    I N  +  ++ G  AQ  + VGYD      +F+  +C
Sbjct: 335 DKYNMYVASNTGGVFCLAI----ICNSPIEEAIFGNRAQNNFLVGYDSSSLLVSFKPTNC 390

Query: 406 EVL 408
             L
Sbjct: 391 SAL 393


>gi|414589630|tpg|DAA40201.1| TPA: hypothetical protein ZEAMMB73_629620 [Zea mays]
          Length = 443

 Score =  132 bits (332), Expect = 3e-28,   Method: Compositional matrix adjust.
 Identities = 114/420 (27%), Positives = 183/420 (43%), Gaps = 59/420 (14%)

Query: 22  VQRGINISIARLAYLQEKIRSHNTYQAQILPSNDI---------SNSLFYVNISIGQPPV 72
           + R +  S AR+A LQ          A + P + I         S+  + + + IG P  
Sbjct: 50  LSRALRRSSARVATLQSL--------AALAPGDAITAARILVLASDGEYLMEMGIGTPTR 101

Query: 73  PQFTAMDTGSSLLWVHCYPCRDCSPQLGTIFYPSRSSSYAYVPCDSEHCRYFPYARCSAY 132
                +DTGS L+W  C PC  C  Q    F P+RS++Y  + C S  C    Y  C  Y
Sbjct: 102 YYSAILDTGSDLIWTQCAPCLLCVDQPTPYFDPARSATYRSLGCASPACNALYYPLC--Y 159

Query: 133 KHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDVVFGCG-FSTNRNFKFSGIFG 191
           +  C+Y   Y     T+  ++ E  TF  T+E+ + +  + FGCG  +       SG+ G
Sbjct: 160 QKVCVYQYFYGDSASTAGVLANETFTF-GTNETRVSLPGISFGCGNLNAGSLANGSGMVG 218

Query: 192 LGIGRSSLVSQLNSS-FSYCIGSLHDPDYLHNKLILGDGAIIDEGDAT--PLQ------- 241
            G G  SLVSQL S  FSYC+ S   P  + ++L  G  A ++  +A+  P+Q       
Sbjct: 219 FGRGSLSLVSQLGSPRFSYCLTSFLSP--VPSRLYFGVYATLNSTNASSEPVQSTPFVVN 276

Query: 242 -FIDGHYYITLEAISVDGRMLDINPNIFKRDDS--GGGVMIDSGTDVTWLVKEAYEALRD 298
             +   Y++ +  ISV G +L I+P +F  +D+   GG +IDSGT +T+L + AY+A+R 
Sbjct: 277 PALPTMYFLNMTGISVGGYLLPIDPAVFAINDTDGTGGTIIDSGTTITYLAEPAYDAVRA 336

Query: 299 EVMIRLEGEQMR---------SYSWPDKLCYHGIMSSDLKGFPTVRFHFRGGAKLALEKD 349
               ++    +           + WP                P +  HF G       ++
Sbjct: 337 AFASQITLPLLNVTDASVLDTCFQWPPP-------PRQSVTLPQLVLHFDGADWELPLQN 389

Query: 350 SMFYQPRP-DAFCMAVNPASINNRYVNFSLIGMMAQQFYNVGYDIGRKQKTFQRMDCEVL 408
            M   P      C+A+  +S  +   ++        Q +NV YD+     +F    C ++
Sbjct: 390 YMLVDPSTGGGLCLAMASSSDGSIIGSYQ------HQNFNVLYDLENSLMSFVPAPCHLM 443


>gi|4063754|gb|AAC98462.1| putative chloroplast nucleoid DNA binding protein [Arabidopsis
           thaliana]
 gi|20197956|gb|AAM15330.1| putative chloroplast nucleoid DNA binding protein [Arabidopsis
           thaliana]
          Length = 389

 Score =  132 bits (332), Expect = 3e-28,   Method: Compositional matrix adjust.
 Identities = 105/363 (28%), Positives = 164/363 (45%), Gaps = 49/363 (13%)

Query: 61  FYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLGTIFYPSRSSSYAYVPCDSEH 120
           + + + IG PP      +DTGS  +W  C PC  C  Q   IF PS+SS++  +      
Sbjct: 59  YLMKLQIGTPPFEIEAVLDTGSEHIWTQCLPCVHCYNQTAPIFDPSKSSTFKEI------ 112

Query: 121 CRYFPYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDVVFGCGFST 180
                  RC  + H C Y  +Y     T   + TE +T  +T      + + + GCG   
Sbjct: 113 -------RCDTHDHSCPYELVYGGKSYTKGTLVTETVTIHSTSGQPFVMPETIIGCG-RN 164

Query: 181 NRNFK--FSGIFGLGIGRSSLVSQLNSSF----SYCIG--SLHDPDYLHNKLILGDGAI- 231
           N  FK  F+G+ GL  G  SL++Q+   +    SYC         ++  N ++ GDG + 
Sbjct: 165 NSGFKPGFAGVVGLDRGPKSLITQMGGEYPGLMSYCFAGKGTSKINFGANAIVAGDGVVS 224

Query: 232 --IDEGDATPLQFIDGHYYITLEAISVDGRMLDINPNIFKRDDSGGGVMIDSGTDVTWLV 289
             +    A P     G YY+ L+A+SV    ++     F      G ++IDSG+ +T+  
Sbjct: 225 TTVFVKTAKP-----GFYYLNLDAVSVGNTRIETVGTPFHALK--GNIVIDSGSTLTYF- 276

Query: 290 KEAYEALRDEVMIRLEGEQMRS---YSWPDKLCYHGIMSSDLKGFPTVRFHFRGGAKLAL 346
            E+Y  L     +R   EQ+ +   +   D LCY+   S  +  FP +  HF GGA L L
Sbjct: 277 PESYCNL-----VRKAVEQVVTAVRFPRSDILCYY---SKTIDIFPVITMHFSGGADLVL 328

Query: 347 EKDSMFYQPRPDA-FCMAVNPASINNRYVNFSLIGMMAQQFYNVGYDIGRKQKTFQRMDC 405
           +K +M+        FC+A+    I N  +  ++ G  AQ  + VGYD      +F+  +C
Sbjct: 329 DKYNMYVASNTGGVFCLAI----ICNSPIEEAIFGNRAQNNFLVGYDSSSLLVSFKPTNC 384

Query: 406 EVL 408
             L
Sbjct: 385 SAL 387


>gi|148906646|gb|ABR16474.1| unknown [Picea sitchensis]
          Length = 538

 Score =  132 bits (332), Expect = 3e-28,   Method: Compositional matrix adjust.
 Identities = 122/437 (27%), Positives = 187/437 (42%), Gaps = 48/437 (10%)

Query: 1   LIHRDSIL-SPYNNPNENPAHRVQRGINISIARLAYLQEKIR-----------SHNT--- 45
           ++HRDS+L     N   +   R++  +     R+  L+++I            SH     
Sbjct: 118 VVHRDSLLVKDAANATASYERRLEETLRRDARRVRGLEQRIEKRLRLNKDPAGSHENVAE 177

Query: 46  ----YQAQILPSNDISNSLFYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLGT 101
               +  +++      +  ++  I +G P   Q+  +DTGS ++W+ C PC  C  Q+  
Sbjct: 178 VAAEFGGEVVSGMAQGSGEYFTRIGVGTPMREQYMVLDTGSDVVWIQCEPCSKCYSQVDP 237

Query: 102 IFYPSRSSSYAYVPCDSEHCRYFPYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKN 161
           IF PS S+S++ + C+S  C Y     C  +   C+Y   Y  G  T    +TE LTF  
Sbjct: 238 IFNPSLSASFSTLGCNSAVCSYLDAYNC--HGGGCLYKVSYGDGSYTIGSFATEMLTFGT 295

Query: 162 TDESTIHVQDVVFGCGFSTNRNFKFSGIFGLGIGR-----SSLVSQLNSSFSYCIGSLHD 216
           T      V++V  GCG      F  +              S L +Q   +FSYC+     
Sbjct: 296 TS-----VRNVAIGCGHDNAGLFVGAAGLLGLGAGLLSFPSQLGTQTGRAFSYCLVDRFS 350

Query: 217 PDYLHNKLILGDGAIIDEGDATPL---QFIDGHYYITLEAISVDGRMLD-INPNIFKRDD 272
                  L  G  ++      TPL     +   YY+ L +ISV G +LD + P++F+ D+
Sbjct: 351 ES--SGTLEFGPESVPLGSILTPLLTNPSLPTFYYVPLISISVGGALLDSVPPDVFRIDE 408

Query: 273 SG--GGVMIDSGTDVTWLVKEAYEALRDE-VMIRLEGEQMRSYSWPDKLCYHGIMSSDLK 329
           +   GG ++DSGT VT L    Y+A+RD  V    +  +    S  D  CY  +    L 
Sbjct: 409 TSGRGGFIVDSGTAVTRLQTPVYDAVRDAFVAGTRQLPKAEGVSIFDT-CYD-LSGLPLV 466

Query: 330 GFPTVRFHFRGGAKLAL-EKDSMFYQPRPDAFCMAVNPASINNRYVNFSLIGMMAQQFYN 388
             PTV FHF  GA L L  K+ M        FC A  PA+      + S++G + QQ   
Sbjct: 467 NVPTVVFHFSNGASLILPAKNYMIPMDFMGTFCFAFAPAT-----SDLSIMGNIQQQGIR 521

Query: 389 VGYDIGRKQKTFQRMDC 405
           V +D       F    C
Sbjct: 522 VSFDTANSLVGFALRQC 538


>gi|15228618|ref|NP_191741.1| aspartyl protease-like protein [Arabidopsis thaliana]
 gi|6850873|emb|CAB71112.1| putative protein [Arabidopsis thaliana]
 gi|332646739|gb|AEE80260.1| aspartyl protease-like protein [Arabidopsis thaliana]
          Length = 483

 Score =  132 bits (332), Expect = 3e-28,   Method: Compositional matrix adjust.
 Identities = 116/389 (29%), Positives = 172/389 (44%), Gaps = 44/389 (11%)

Query: 41  RSHNTYQAQILPSNDISNSLFYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLG 100
           R+   +   ++      +  +++ + +G P    +  +DTGS ++W+ C PC+ C  Q  
Sbjct: 115 RTAGGFSGAVISGLSQGSGEYFMRLGVGTPATNVYMVLDTGSDVVWLQCSPCKACYNQTD 174

Query: 101 TIFYPSRSSSYAYVPCDSEHCRYF-PYARCSAYKHR-CIYTQLYLIGPETSVFVSTEQLT 158
            IF P +S ++A VPC S  CR     + C   + + C+Y   Y  G  T    STE LT
Sbjct: 175 AIFDPKKSKTFATVPCGSRLCRRLDDSSECVTRRSKTCLYQVSYGDGSFTEGDFSTETLT 234

Query: 159 FKNTDESTIHVQDVVFGCGFSTNRNFKFSGIFGLGIGR-----SSLVSQLNSSFSYCI-- 211
           F         V  V  GCG      F  +              S   ++ N  FSYC+  
Sbjct: 235 FHGA-----RVDHVPLGCGHDNEGLFVGAAGLLGLGRGGLSFPSQTKNRYNGKFSYCLVD 289

Query: 212 -GSLHDPDYLHNKLILGDGAIIDEGDATPLQF---IDGHYYITLEAISVDG-RMLDINPN 266
             S        + ++ G+ A+      TPL     +D  YY+ L  ISV G R+  ++ +
Sbjct: 290 RTSSGSSSKPPSTIVFGNAAVPKTSVFTPLLTNPKLDTFYYLQLLGISVGGSRVPGVSES 349

Query: 267 IFKRDDSG-GGVMIDSGTDVTWLVKEAYEALRDEVMIRLEGEQMR---SYSWPDKLCYHG 322
            FK D +G GGV+IDSGT VT L + AY ALRD    RL   +++   SYS  D  C+  
Sbjct: 350 QFKLDATGNGGVIIDSGTSVTRLTQPAYVALRDA--FRLGATKLKRAPSYSLFDT-CF-- 404

Query: 323 IMSSDLKGF-----PTVRFHFRGGAKLALEKDSMFYQPRPDA-FCMAVNPASINNRYVNF 376
               DL G      PTV FHF GG +++L   +       +  FC A           + 
Sbjct: 405 ----DLSGMTTVKVPTVVFHF-GGGEVSLPASNYLIPVNTEGRFCFA-----FAGTMGSL 454

Query: 377 SLIGMMAQQFYNVGYDIGRKQKTFQRMDC 405
           S+IG + QQ + V YD+   +  F    C
Sbjct: 455 SIIGNIQQQGFRVAYDLVGSRVGFLSRAC 483


>gi|15226317|ref|NP_180370.1| aspartyl protease-like protein [Arabidopsis thaliana]
 gi|4063755|gb|AAC98463.1| putative chloroplast nucleoid DNA binding protein [Arabidopsis
           thaliana]
 gi|20197953|gb|AAM15327.1| putative chloroplast nucleoid DNA binding protein [Arabidopsis
           thaliana]
 gi|330252977|gb|AEC08071.1| aspartyl protease-like protein [Arabidopsis thaliana]
          Length = 392

 Score =  132 bits (332), Expect = 3e-28,   Method: Compositional matrix adjust.
 Identities = 101/360 (28%), Positives = 164/360 (45%), Gaps = 38/360 (10%)

Query: 59  SLFYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLGTIFYPSRSSSYAYVPCDS 118
           +++ + + +G PP      +DTGS L+W  C PC +C  Q   IF PS SS++    C+ 
Sbjct: 59  NIYLMKLQVGTPPFEIEAEIDTGSDLIWTQCMPCTNCYSQYAPIFDPSNSSTFKEKRCNG 118

Query: 119 EHCRY-FPYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDVVFGCG 177
             C Y   YA  +  K                  ++TE +T  +T      + +   GCG
Sbjct: 119 NSCHYKIIYADTTYSKGT----------------LATETVTIHSTSGEPFVMPETTIGCG 162

Query: 178 FSTNRNFK--FSGIFGLGIGRSSLVSQLNSSF----SYCIGS--LHDPDYLHNKLILGDG 229
            +++  FK  FSG+ GL  G SSL++Q+   +    SYC  S      ++  N ++ GDG
Sbjct: 163 HNSSW-FKPTFSGMVGLSWGPSSLITQMGGEYPGLMSYCFASQGTSKINFGTNAIVAGDG 221

Query: 230 AIIDEGDATPLQFIDGHYYITLEAISVDGRMLDINPNIFKRDDSGGGVMIDSGTDVTWLV 289
            +      T  +   G YY+ L+A+SV    ++     F   +  G ++IDSGT +T+  
Sbjct: 222 VVSTTMFLTTAK--PGLYYLNLDAVSVGDTHVETMGTTFHALE--GNIIIDSGTTLTYFP 277

Query: 290 KEAYEALRDEVMIRLEGEQMRSYSWPDKLCYHGIMSSDLKGFPTVRFHFRGGAKLALEKD 349
                 +R+ V   +   +    +  D LCY+   +  +  FP +  HF GGA L L+K 
Sbjct: 278 VSYCNLVREAVDHYVTAVRTADPTGNDMLCYY---TDTIDIFPVITMHFSGGADLVLDKY 334

Query: 350 SMFYQP-RPDAFCMAVNPASINNRYVNFSLIGMMAQQFYNVGYDIGRKQKTFQRMDCEVL 408
           +M+ +      FC+A+    I N     ++ G  AQ  + VGYD      +F   +C  L
Sbjct: 335 NMYIETITRGTFCLAI----ICNNPPQDAIFGNRAQNNFLVGYDSSSLLVSFSPTNCSAL 390


>gi|125600538|gb|EAZ40114.1| hypothetical protein OsJ_24557 [Oryza sativa Japonica Group]
          Length = 412

 Score =  132 bits (332), Expect = 3e-28,   Method: Compositional matrix adjust.
 Identities = 109/360 (30%), Positives = 157/360 (43%), Gaps = 43/360 (11%)

Query: 57  SNSLFYVNISIGQPPVPQFTAMDTGSSLLWVHC-YPCRDCSPQLGTIFYPSRSSSYAYVP 115
           S + + V+I+IG PP+P    +DTGS L+W  C  PCR C PQ   ++ P+RS++YA V 
Sbjct: 88  STATYLVDIAIGTPPLPLTAVLDTGSDLIWTQCDAPCRRCFPQPAPLYAPARSATYANVS 147

Query: 116 CDSEHCRYF--PYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDVV 173
           C S  C+    P++RCS     C Y   Y  G  T   ++TE  T      S   V+ V 
Sbjct: 148 CRSPMCQALQSPWSRCSPPDTGCAYYFSYGDGTSTDGVLATETFTLG----SDTAVRGVA 203

Query: 174 FGCGFST-NRNFKFSGIFGLGIGRSSLVSQLNSSFSY--CIGSLHDPDYLHNKLILGDGA 230
           FGCG          SG+ G+G G  SLVSQL  +     C                    
Sbjct: 204 FGCGTENLGSTDNSSGLVGMGRGPLSLVSQLGVTRPRRSCRARAAARGGGAPT------- 256

Query: 231 IIDEGDATPLQFIDGHYYITLEAISVDGRMLDINPNIFKRDDSG-GGVMIDSGTDVTWLV 289
                  +P           LE I+V   +L I+P +F+    G GGV+IDSGT  T L 
Sbjct: 257 -----TTSP-----------LEGITVGDTLLPIDPAVFRLTPMGDGGVIIDSGTTFTALE 300

Query: 290 KEAYEALRDEVMIRLEGEQMRSYSWPDKLCYHGIMSSDLKGFPTVRFHFRGGAKLALEKD 349
           + A+ AL   +  R+             LC+    S +    P +  HF  GA + L ++
Sbjct: 301 ERAFVALARALASRVRLPLASGAHLGLSLCFAA-ASPEAVEVPRLVLHFD-GADMELRRE 358

Query: 350 SMFYQPRPDAF-CMAVNPASINNRYVNFSLIGMMAQQFYNVGYDIGRKQKTFQRMDCEVL 408
           S   + R     C+ +  A         S++G M QQ  ++ YD+ R   +F+   C  L
Sbjct: 359 SYVVEDRSAGVACLGMVSAR------GMSVLGSMQQQNTHILYDLERGILSFEPAKCGEL 412


>gi|449432044|ref|XP_004133810.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 1-like [Cucumis
           sativus]
          Length = 471

 Score =  132 bits (332), Expect = 3e-28,   Method: Compositional matrix adjust.
 Identities = 127/417 (30%), Positives = 184/417 (44%), Gaps = 36/417 (8%)

Query: 8   LSPYNNPNENPAHRVQRGINISIARLAYLQEKIRSHNT------YQAQILPSNDISNSLF 61
           LS    P E    R+QR   I + +L+ L    R+ +       + + ++      +  +
Sbjct: 71  LSFNRTPEELFHLRLQRDA-IRVKKLSSLGATSRNLSKPGGTTGFSSSVISGLAQGSGEY 129

Query: 62  YVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLGTIFYPSRSSSYAYVPCDSEHC 121
           +  I +G PP   +  +DTGS ++W+ C PC++C  Q   +F P +S S+A V C +  C
Sbjct: 130 FTRIGVGTPPKYVYMVLDTGSDIVWLQCAPCKNCYSQTDPVFNPVKSGSFAKVLCRTPLC 189

Query: 122 RYFPYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDVVFGCGFSTN 181
           R      C+  +  C+Y   Y  G  T+    TE LTF+ T      V+ V  GCG    
Sbjct: 190 RRLESPGCNQ-RQTCLYQVSYGDGSYTTGEFVTETLTFRRT-----KVEQVALGCGHDNE 243

Query: 182 RNF-KFSGIFGLGIGRSSLVSQ----LNSSFSYCIGSLHDPDYLHNKLILGDGAIIDEGD 236
             F   +G+ GLG G  S  SQ     N  FSYC+          + ++ G+ A+     
Sbjct: 244 GLFVGAAGLLGLGRGGLSFPSQAGRTFNQKFSYCLVD-RSASSKPSSVVFGNSAVSRTAR 302

Query: 237 ATPLQF---IDGHYYITLEAISVDGRMLD-INPNIFKRDDSG-GGVMIDSGTDVTWLVKE 291
            TPL     +D  YY+ L  ISV G  +  I  + FK D +G GGV+ID GT VT L K 
Sbjct: 303 FTPLLTNPRLDTFYYVELLGISVGGTPVSGITASHFKLDRTGNGGVIIDCGTSVTRLNKP 362

Query: 292 AYEALRDEVMIRLEGEQMRS---YSWPDKLCYHGIMSSDLKGFPTVRFHFRGGAKLALEK 348
           AY ALRD    R     ++S   +S  D  CY     + +K  PTV  HFRG        
Sbjct: 363 AYIALRDA--FRAGASSLKSAPEFSLFDT-CYDLSGKTTVK-VPTVVLHFRGADVSLPAS 418

Query: 349 DSMFYQPRPDAFCMAVNPASINNRYVNFSLIGMMAQQFYNVGYDIGRKQKTFQRMDC 405
           + +        FC A             S+IG + QQ + V YD+   +  F    C
Sbjct: 419 NYLIPVDGSGRFCFA-----FAGTTSGLSIIGNIQQQGFRVVYDLASSRVGFSPRGC 470


>gi|293335828|ref|NP_001170221.1| uncharacterized protein LOC100384173 precursor [Zea mays]
 gi|224034427|gb|ACN36289.1| unknown [Zea mays]
          Length = 443

 Score =  132 bits (332), Expect = 3e-28,   Method: Compositional matrix adjust.
 Identities = 116/423 (27%), Positives = 184/423 (43%), Gaps = 65/423 (15%)

Query: 22  VQRGINISIARLAYLQEKIRSHNTYQAQILPSNDI---------SNSLFYVNISIGQPPV 72
           + R +  S AR+A LQ          A + P + I         S+  + + + IG P  
Sbjct: 50  LSRALRRSSARVATLQSL--------AALAPGDAITAARILVLASDGEYLMEMGIGTPTR 101

Query: 73  PQFTAMDTGSSLLWVHCYPCRDCSPQLGTIFYPSRSSSYAYVPCDSEHCRYFPYARCSAY 132
                +DTGS L+W  C PC  C  Q    F P+RS++Y  + C S  C    Y  C  Y
Sbjct: 102 YYSAILDTGSDLIWTQCAPCLLCVDQPTPYFDPARSATYRSLGCASPACNALYYPLC--Y 159

Query: 133 KHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDVVFGCGFSTNRNFKF----SG 188
           +  C+Y   Y     T+  ++ E  TF  T+E+ + +  + FGCG   N N       SG
Sbjct: 160 QKVCVYQYFYGDSASTAGVLANETFTF-GTNETRVSLPGISFGCG---NLNAGLLANGSG 215

Query: 189 IFGLGIGRSSLVSQLNSS-FSYCIGSLHDPDYLHNKLILGDGAIIDEGDAT--PLQ---- 241
           + G G G  SLVSQL S  FSYC+ S   P  + ++L  G  A ++  +A+  P+Q    
Sbjct: 216 MVGFGRGSLSLVSQLGSPRFSYCLTSFLSP--VPSRLYFGVYATLNSTNASSEPVQSTPF 273

Query: 242 ----FIDGHYYITLEAISVDGRMLDINPNIFKRDDS--GGGVMIDSGTDVTWLVKEAYEA 295
                +   Y++ +  ISV G +L I+P +F  +D+   GG +IDSGT +T+L + AY+A
Sbjct: 274 VVNPALPTMYFLNMTGISVGGYLLPIDPAVFAINDTDGTGGTIIDSGTTITYLAEPAYDA 333

Query: 296 LRDEVMIRLEGEQMR---------SYSWPDKLCYHGIMSSDLKGFPTVRFHFRGGAKLAL 346
           +R     ++    +           + WP                P +  HF G      
Sbjct: 334 VRAAFASQITLPLLNVTDASVLDTCFQWPPP-------PRQSVTLPQLVLHFDGADWELP 386

Query: 347 EKDSMFYQPRP-DAFCMAVNPASINNRYVNFSLIGMMAQQFYNVGYDIGRKQKTFQRMDC 405
            ++ M   P      C+A+  +S  +   ++        Q +NV YD+     +F    C
Sbjct: 387 LQNYMLVDPSTGGGLCLAMASSSDGSIIGSYQ------HQNFNVLYDLENSLMSFVPAPC 440

Query: 406 EVL 408
            ++
Sbjct: 441 HLM 443


>gi|326506192|dbj|BAJ86414.1| predicted protein [Hordeum vulgare subsp. vulgare]
 gi|326516358|dbj|BAJ92334.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 492

 Score =  132 bits (332), Expect = 4e-28,   Method: Compositional matrix adjust.
 Identities = 114/365 (31%), Positives = 162/365 (44%), Gaps = 32/365 (8%)

Query: 61  FYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLGTIFYPSRSSSYAYVPCDSEH 120
           ++  I +G P  P    +DTGS ++W+ C PCR C  Q G +F P RS SY  V C +  
Sbjct: 140 YFTKIGVGTPATPALMVLDTGSDVVWLQCAPCRRCYEQSGQVFDPRRSRSYNAVGCAAPL 199

Query: 121 CRYFPYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDVVFGCGFST 180
           CR      C   +  C+Y   Y  G  T+   +TE LTF         V  V  GCG   
Sbjct: 200 CRRLDSGGCDLRRSACLYQVAYGDGSVTAGDFATETLTFAG----GARVARVALGCGHD- 254

Query: 181 NRNFKFSGIFGLGIGRSSL------VSQLNSSFSYCI---GSLHDPDYLHNKLILGDGAI 231
           N     +    LG+GR SL        +   SFSYC+    S  +     + +  G GA+
Sbjct: 255 NEGLFVAAAGLLGLGRGSLSFPTQISRRYGRSFSYCLVDRTSSANTASRSSTVTFGSGAV 314

Query: 232 ID--EGDATPL---QFIDGHYYITLEAISVDG-RMLDINPNIFKRDDSG--GGVMIDSGT 283
                   TP+     ++  YY+ L  ISV G R+  +  +  + D S   GGV++DSGT
Sbjct: 315 GSTVASSFTPMVKNPRMETFYYVQLIGISVGGARVPGVANSDLRLDPSSGRGGVIVDSGT 374

Query: 284 DVTWLVKEAYEALRDEVMIRLEGEQMR--SYSWPDKLCYHGIMSSDLKGFPTVRFHFRGG 341
            VT L + AY ALRD       G ++    +S  D  CY  +    +   PTV  HF GG
Sbjct: 375 SVTRLARPAYSALRDAFRGAAAGLRLSPGGFSLFDT-CYD-LSGRKVVKVPTVSMHFAGG 432

Query: 342 AKLALEKDS-MFYQPRPDAFCMAVNPASINNRYVNFSLIGMMAQQFYNVGYDIGRKQKTF 400
           A+ AL  ++ +        FC A   A  +      S+IG + QQ + V +D   ++  F
Sbjct: 433 AEAALPPENYLIPVDSKGTFCFAF--AGTDG---GVSIIGNIQQQGFRVVFDGDGQRVAF 487

Query: 401 QRMDC 405
               C
Sbjct: 488 TPKGC 492


>gi|357440289|ref|XP_003590422.1| Aspartic proteinase-like protein [Medicago truncatula]
 gi|355479470|gb|AES60673.1| Aspartic proteinase-like protein [Medicago truncatula]
          Length = 498

 Score =  132 bits (332), Expect = 4e-28,   Method: Compositional matrix adjust.
 Identities = 112/377 (29%), Positives = 177/377 (46%), Gaps = 42/377 (11%)

Query: 60  LFYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLG-----TIFYPSRSSSYAYV 114
           L+   + +G PP      +DTGS +LW++C  C +C    G       F    SS+ A V
Sbjct: 83  LYTTKVKMGTPPREFTVQIDTGSDILWINCNTCSNCPKSSGLGIELNFFDTVGSSTAALV 142

Query: 115 PCDSEHCRYF---PYARCSAYKHRCIYTQLYLIGPETS-VFVST----EQLTFKNTDEST 166
           PC    C        A+CS   ++C YT  Y  G  TS V+VS     + +  ++T  + 
Sbjct: 143 PCSDPMCASAIQGAAAQCSPQVNQCSYTFQYEDGSGTSGVYVSDAMYFDMILGQSTPANV 202

Query: 167 IHVQDVVFGC-----GFSTNRNFKFSGIFGLGIGRSSLVSQLNSS------FSYCIGSLH 215
                +VFGC     G  T  +    GI G G G  S+VSQL+S       FS+C+    
Sbjct: 203 ASSATIVFGCSTYQSGDLTKTDKAVDGILGFGPGELSVVSQLSSRGITPKVFSHCLKGDG 262

Query: 216 DPDYLHNKLILGDGAIIDEGDA-TPLQFIDGHYYITLEAISVDGRMLDINPNIFKRDDSG 274
           +   +   L+LG+  I++     +PL     HY + L++I+V+G++L INP +F   D  
Sbjct: 263 NGGGI---LVLGE--ILEPSIVYSPLVPSQPHYNLNLQSIAVNGQVLSINPAVFATSDK- 316

Query: 275 GGVMIDSGTDVTWLVKEAYEALRDEVMIRLEGEQMRSYSWPDKLCYHGIMSSDLKGFPTV 334
            G +IDSGT +++LV+EAY+ L + V   +  +   S+      CY  + S D   FPTV
Sbjct: 317 RGTIIDSGTTLSYLVQEAYDPLVNAVDTAVS-QFATSFISKGSQCYLVLTSID-DSFPTV 374

Query: 335 RFHFRGGAKLALEKDSMF----YQPRPDAFCMAVNPASINNRYVNFSLIGMMAQQFYNVG 390
            F+F GGA + L+         +Q     +C+              +++G +  +   V 
Sbjct: 375 SFNFEGGASMDLKPSQYLLNRGFQDGAKMWCIGFQKVQ-----EGVTILGDLVLKDKIVV 429

Query: 391 YDIGRKQKTFQRMDCEV 407
           YD+ R+Q  +   DC +
Sbjct: 430 YDLARQQIGWTNYDCSM 446


>gi|242092892|ref|XP_002436936.1| hypothetical protein SORBIDRAFT_10g011730 [Sorghum bicolor]
 gi|241915159|gb|EER88303.1| hypothetical protein SORBIDRAFT_10g011730 [Sorghum bicolor]
          Length = 469

 Score =  132 bits (331), Expect = 4e-28,   Method: Compositional matrix adjust.
 Identities = 124/366 (33%), Positives = 162/366 (44%), Gaps = 39/366 (10%)

Query: 61  FYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRD--CSPQLGTIFYPSRSSSYAYVPCDS 118
           + V +  G P VPQ   +DTGS L WV C PC    C PQ   +F PS SS+YA VPC S
Sbjct: 122 YVVTLGFGTPAVPQVLLIDTGSDLSWVQCQPCNSSTCYPQKDPVFDPSASSTYAPVPCGS 181

Query: 119 EHCRYF---PYA----RCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQD 171
           E CR      YA      S+    C Y   Y  G  T    STE LT   + E+   V +
Sbjct: 182 EACRDLDPDSYANGCTNSSSGASLCQYGIQYGNGDTTVGVYSTETLTL--SPEAATVVNN 239

Query: 172 VVFGCGF-STNRNFKFSGIFGLGIGRSSLVSQLNSS----FSYCIGSLHDPDYLHNKLIL 226
             FGCG         F G+ GLG    SLVSQ   +    FSYC   L   +     L L
Sbjct: 240 FSFGCGLVQKGVFDLFDGLLGLGGAPESLVSQTTGTYGGAFSYC---LPAGNSTAGFLAL 296

Query: 227 GDGAIIDEGDA----TPLQFIDGHYY-ITLEAISVDGRMLDINPNIFKRDDSGGGVMIDS 281
           G  A      A    TPLQ ++  +Y + L  ISV G+ LDI P +F      GG++IDS
Sbjct: 297 GAPATGGNNTAGFQFTPLQVVETTFYLVKLTGISVGGKQLDIEPTVFA-----GGMIIDS 351

Query: 282 GTDVTWLVKEAYEALRDEVMIRLEGEQMRSYSWPDKL--CYHGIMSSDLKGFPTVRFHFR 339
           GT VT L + AY ALR      +    +   +  + L  CY    ++++   PTV   F 
Sbjct: 352 GTIVTGLPETAYSALRTAFRSAMSAYPLLPPNDDEDLDTCYDFTGNTNVT-VPTVALTFE 410

Query: 340 GGAKLALEKDSMFYQPRPDAFCMAVNPASINNRYVNFSLIGMMAQQFYNVGYDIGRKQKT 399
           GG  + L+  S        AF    +         +  +IG + Q+ + V YD  R    
Sbjct: 411 GGVTIDLDVPSGVLLDGCLAFVAGASDG-------DTGIIGNVNQRTFEVLYDSARGHVG 463

Query: 400 FQRMDC 405
           F+   C
Sbjct: 464 FRAGAC 469


>gi|357118064|ref|XP_003560779.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Brachypodium
           distachyon]
          Length = 472

 Score =  132 bits (331), Expect = 4e-28,   Method: Compositional matrix adjust.
 Identities = 118/373 (31%), Positives = 163/373 (43%), Gaps = 53/373 (14%)

Query: 61  FYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCR--DCSPQLGTIFYPSRSSSYAYVPCDS 118
           + V + IG P V Q   +DTGS L WV C PC   DC PQ   +F PS+SS++A +PC S
Sbjct: 125 YVVTLGIGTPAVQQTVLIDTGSDLSWVQCKPCNASDCYPQKDPLFDPSKSSTFATIPCAS 184

Query: 119 EHCRYFPY--------ARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQ 170
           + C+  P            S    +C Y   Y  G  T    STE L       S+  V+
Sbjct: 185 DACKQLPVDGYDNGCTNNTSGMPPQCGYAIEYGNGAITEGVYSTETLALG----SSAVVK 240

Query: 171 DVVFGCGFSTNRNF-KFSGIFGLGIGRSSLVSQLNS----SFSYCIGSLHDPDYLHNKLI 225
              FGCG   +  + KF G+ GLG    SLVSQ  S    +FSYC+  L+        L 
Sbjct: 241 SFRFGCGSDQHGPYDKFDGLLGLGGAPESLVSQTASVYGGAFSYCLPPLNSGAGF---LT 297

Query: 226 LGDGAIIDEGDA----TPLQF----IDGHYYITLEAISVDGRMLDINPNIFKRDDSGGGV 277
           LG     +  ++    TP+      I   Y +TL  ISV G+ LDI P +F +     G 
Sbjct: 298 LGAPNSTNNSNSGFVFTPMHAFSPKIATFYVVTLTGISVGGKALDIPPAVFAK-----GN 352

Query: 278 MIDSGTDVTWLVKEAYEALRDEVMIRL-EGEQMRSYSWPDKLCY----HGIMSSDLKGFP 332
           ++DSGT +T +   AY+ALR      + E   +         CY    HG ++      P
Sbjct: 353 IVDSGTVITGIPTTAYKALRTAFRSAMAEYPLLPPADSALDTCYNFTGHGTVT-----VP 407

Query: 333 TVRFHFRGGAKLALEKDSMFYQPRPDAFCMAVNPASINNRYVNFSLIGMMAQQFYNVGYD 392
            V   F GGA + L+  S          C+A   A       +F +IG +  +   V YD
Sbjct: 408 KVALTFVGGATVDLDVPSGVLVED----CLAFADAGDG----SFGIIGNVNTRTIEVLYD 459

Query: 393 IGRKQKTFQRMDC 405
            G+    F+   C
Sbjct: 460 SGKGHLGFRAGAC 472


>gi|242056497|ref|XP_002457394.1| hypothetical protein SORBIDRAFT_03g006630 [Sorghum bicolor]
 gi|241929369|gb|EES02514.1| hypothetical protein SORBIDRAFT_03g006630 [Sorghum bicolor]
          Length = 509

 Score =  131 bits (330), Expect = 5e-28,   Method: Compositional matrix adjust.
 Identities = 114/354 (32%), Positives = 152/354 (42%), Gaps = 22/354 (6%)

Query: 61  FYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLGTIFYPSRSSSYAYVPCDSEH 120
           ++  + IG P    +  +DTGS + WV C PC DC  Q   +F PS S+SYA V CDS  
Sbjct: 169 YFSRVGIGSPARELYMVLDTGSDVTWVQCQPCADCYQQSDPVFDPSLSASYAAVSCDSPR 228

Query: 121 CRYFPYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDVVFGCGFST 180
           CR    A C      C+Y   Y  G  T    +TE LT  +    +  V +V  GCG   
Sbjct: 229 CRDLDTAACRNATGACLYEVAYGDGSYTVGDFATETLTLGD----STPVTNVAIGCGHDN 284

Query: 181 NRNFKFSGIFGLGIGRS-SLVSQLN-SSFSYCIGSLHDPDYLHNKLILG-DGAIIDEGDA 237
              F  +       G   S  SQ++ S+FSYC+     P    + L  G DGA  D   A
Sbjct: 285 EGLFVGAAGLLALGGGPLSFPSQISASTFSYCLVDRDSP--AASTLQFGADGAEADTVTA 342

Query: 238 TPL---QFIDGHYYITLEAISVDGRMLDINPNIFKRDDS--GGGVMIDSGTDVTWLVKEA 292
            PL         YY+ L  ISV G+ L I  + F  D +   GGV++DSGT VT L   A
Sbjct: 343 -PLVRSPRTGTFYYVALSGISVGGQALSIPSSAFAMDATSGSGGVIVDSGTAVTRLQSSA 401

Query: 293 YEALRDEVMIRLEGEQMRSYSWPDKLCYHGIMSSDLKGFPTVRFHFRGGAKLAL-EKDSM 351
           Y ALRD  +         S       CY     + ++  P V   F GG  L L  K+ +
Sbjct: 402 YAALRDAFVRGTPSLPRTSGVSLFDTCYDLSDRTSVE-VPAVSLRFEGGGALRLPAKNYL 460

Query: 352 FYQPRPDAFCMAVNPASINNRYVNFSLIGMMAQQFYNVGYDIGRKQKTFQRMDC 405
                   +C+A  P +        S+IG + QQ   V +D  +    F    C
Sbjct: 461 IPVDGAGTYCLAFAPTN-----AAVSIIGNVQQQGTRVSFDTAKGVVGFTPNKC 509


>gi|449530542|ref|XP_004172253.1| PREDICTED: LOW QUALITY PROTEIN: protein ASPARTIC PROTEASE IN GUARD
           CELL 1-like [Cucumis sativus]
          Length = 486

 Score =  131 bits (330), Expect = 5e-28,   Method: Compositional matrix adjust.
 Identities = 110/367 (29%), Positives = 162/367 (44%), Gaps = 23/367 (6%)

Query: 46  YQAQILPSNDISNSLFYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLGTIFYP 105
           +++ I+      +  ++  + IG+PP P +  +DTGS + WV C PC +C  Q    F P
Sbjct: 136 FESPIVSGASQGSGEYFSRVGIGRPPSPVYMVLDTGSDVSWVQCAPCAECYEQTDPXFEP 195

Query: 106 SRSSSYAYVPCDSEHCRYFPYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDES 165
           + S+S+  + C++E C+    + C      C+Y   Y  G  T     TE +T  +T   
Sbjct: 196 TSSASFTSLSCETEQCKSLDVSEC--RNGTCLYEVSYGDGSYTVGDFVTETVTLGST--- 250

Query: 166 TIHVQDVVFGCGFSTNRNF-KFSGIFGLGIGRSSLVSQLN-SSFSYCIGSLHDPDYLHNK 223
              + ++  GCG +    F   +G+ GLG G  S  SQLN SSFSYC   L D D     
Sbjct: 251 --SLGNIAIGCGHNNEGLFIGAAGLLGLGGGSLSFPSQLNASSFSYC---LVDRDSDSTS 305

Query: 224 LILGDGAIIDEGDATPLQF---IDGHYYITLEAISVDGRMLDINPNIFK-RDDSGGGVMI 279
            +  +  I  +    PL     +D  +Y+ L  +SV G +L I    F+  +D  GG+++
Sbjct: 306 TLDFNSPITPDAVTAPLHRNPNLDTFFYLGLTGMSVGGAVLPIPETSFQMSEDGNGGIIV 365

Query: 280 DSGTDVTWLVKEAYEALRDEVMIRLEGEQMRSYSWPDKLCYHGIMSSDLKGFPTVRFHFR 339
           DSGT VT L    Y  LRD  +      Q          CY  + S      PTV FHF 
Sbjct: 366 DSGTAVTRLQTTVYNVLRDAFVKSTHDLQTARGVALFDTCYD-LSSKSRVEVPTVSFHFA 424

Query: 340 GGAKLAL-EKDSMFYQPRPDAFCMAVNPASINNRYVNFSLIGMMAQQFYNVGYDIGRKQK 398
            G +L L  K+ +        FC A  P          S++G   QQ   VG+D+     
Sbjct: 425 NGNELPLPAKNYLIPVDSEGTFCFAFAPTD-----STLSILGNAQQQGTRVGFDLANSLV 479

Query: 399 TFQRMDC 405
            F    C
Sbjct: 480 GFSPNKC 486


>gi|449529638|ref|XP_004171805.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 1-like, partial
           [Cucumis sativus]
          Length = 384

 Score =  131 bits (330), Expect = 5e-28,   Method: Compositional matrix adjust.
 Identities = 116/358 (32%), Positives = 163/358 (45%), Gaps = 29/358 (8%)

Query: 61  FYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLGTIFYPSRSSSYAYVPCDSEH 120
           ++  I +G PP   +  +DTGS ++W+ C PC++C  Q   +F P +S S+A V C +  
Sbjct: 42  YFTRIGVGTPPKYVYMVLDTGSDIVWLQCAPCKNCYSQTDPVFNPVKSGSFAKVLCRTPL 101

Query: 121 CRYFPYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDVVFGCGFST 180
           CR      C+  +  C+Y   Y  G  T+    TE LTF+ T      V+ V  GCG   
Sbjct: 102 CRRLESPGCNQ-RQTCLYQVSYGDGSYTTGEFVTETLTFRRT-----KVEQVALGCGHDN 155

Query: 181 NRNF-KFSGIFGLGIGRSSLVSQ----LNSSFSYCIGSLHDPDYLHNKLILGDGAIIDEG 235
              F   +G+ GLG G  S  SQ     N  FSYC+          + ++ G+ A+    
Sbjct: 156 EGLFVGAAGLLGLGRGGLSFPSQAGRTFNQKFSYCLVD-RSASSKPSSVVFGNSAVSRTA 214

Query: 236 DATPLQF---IDGHYYITLEAISVDGRMLD-INPNIFKRDDSG-GGVMIDSGTDVTWLVK 290
             TPL     +D  YY+ L  ISV G  +  I  + FK D +G GGV+ID GT VT L K
Sbjct: 215 RFTPLLTNPRLDTFYYVELLGISVGGTPVSGITASHFKLDRTGNGGVIIDCGTSVTRLNK 274

Query: 291 EAYEALRDEVMIRLEGEQMRS---YSWPDKLCYHGIMSSDLKGFPTVRFHFRGGAKLALE 347
            AY ALRD    R     ++S   +S  D  CY     + +K  PTV  HFRG       
Sbjct: 275 PAYIALRDA--FRAGASSLKSAPEFSLFDT-CYDLSGKTTVK-VPTVVLHFRGADVSLPA 330

Query: 348 KDSMFYQPRPDAFCMAVNPASINNRYVNFSLIGMMAQQFYNVGYDIGRKQKTFQRMDC 405
            + +        FC A    +        S+IG + QQ + V YD+   +  F    C
Sbjct: 331 SNYLIPVDGSGRFCFAFAGTT-----SGLSIIGNIQQQGFRVVYDLASSRVGFSPRGC 383


>gi|116787398|gb|ABK24493.1| unknown [Picea sitchensis]
          Length = 479

 Score =  131 bits (330), Expect = 5e-28,   Method: Compositional matrix adjust.
 Identities = 109/375 (29%), Positives = 161/375 (42%), Gaps = 43/375 (11%)

Query: 52  PSNDISNSLFYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLGTIFYPSRSSSY 111
           P + +    + V    G P       +DTGS + W+ C PC DC  Q+  IF P +SSSY
Sbjct: 129 PGSKVGTGNYIVTAGFGTPAKNSLLIIDTGSDVTWIQCKPCSDCYSQVDPIFEPQQSSSY 188

Query: 112 AYVPCDSEHCRYFPYARCSAYKH----RCIYTQLYLIGPETSVFVSTEQLTFKNTDESTI 167
            ++ C S  C        +   H     C+Y   Y  G  +    S E LT  +      
Sbjct: 189 KHLSCLSSAC-----TELTTMNHCRLGGCVYEINYGDGSRSQGDFSQETLTLGSDS---- 239

Query: 168 HVQDVVFGCGFSTNRNFKFS-GIFGLGIGRSSLVSQLNSS----FSYCIGSLHDPDYLHN 222
                 FGCG +    FK S G+ GLG    S  SQ  S     FSYC+     PD++ +
Sbjct: 240 -FPSFAFGCGHTNTGLFKGSAGLLGLGRTALSFPSQTKSKYGGQFSYCL-----PDFVSS 293

Query: 223 ----KLILGDGAIIDEGDATPLQFIDGH---YYITLEAISVDGRMLDINPNIFKRDDSGG 275
                  +G G+I       PL     +   Y++ L  ISV G  L I P +  R    G
Sbjct: 294 TSTGSFSVGQGSIPATATFVPLVSNSNYPSFYFVGLNGISVGGERLSIPPAVLGR----G 349

Query: 276 GVMIDSGTDVTWLVKEAYEALRDEVMIRLEG-EQMRSYSWPDKLCYHGIMSSDLKGFPTV 334
           G ++DSGT +T LV +AY+AL+     +       + +S  D  CY     S ++  PT+
Sbjct: 350 GTIVDSGTVITRLVPQAYDALKTSFRSKTRNLPSAKPFSILDT-CYDLSSYSQVR-IPTI 407

Query: 335 RFHFRGGAKLALEKDSMFYQPRPDA--FCMAVNPASINNRYVNFSLIGMMAQQFYNVGYD 392
            FHF+  A +A+    + +  + D    C+A   AS   + ++ ++IG   QQ   V +D
Sbjct: 408 TFHFQNNADVAVSAVGILFTIQSDGSQVCLAFASAS---QSISTNIIGNFQQQRMRVAFD 464

Query: 393 IGRKQKTFQRMDCEV 407
            G  +  F    C  
Sbjct: 465 TGAGRIGFAPGSCAT 479


>gi|15226358|ref|NP_180389.1| aspartyl protease-like protein [Arabidopsis thaliana]
 gi|4803959|gb|AAD29831.1| putative chloroplast nucleoid DNA binding protein [Arabidopsis
           thaliana]
 gi|330252998|gb|AEC08092.1| aspartyl protease-like protein [Arabidopsis thaliana]
          Length = 756

 Score =  131 bits (330), Expect = 6e-28,   Method: Compositional matrix adjust.
 Identities = 103/374 (27%), Positives = 165/374 (44%), Gaps = 62/374 (16%)

Query: 59  SLFYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLGTIFYPSRSSSYAYVPCDS 118
           S++ + + +G PP      +DTGS ++W  C PC +C  Q   IF PS+SS++    C+ 
Sbjct: 419 SIYLMKLQVGTPPFEIVAEIDTGSDIIWTQCMPCPNCYSQFAPIFDPSKSSTFREQRCNG 478

Query: 119 EHCRYFPYARCSAYKHRCIYT-QLYLIGPETSVFVSTEQLTFKNTDESTIHVQDVVFGCG 177
             C Y             IY  + Y  G      ++TE +T  +T      + +   GCG
Sbjct: 479 NSCHY-----------EIIYADKTYSKG-----ILATETVTIPSTSGEPFVMAETKIGCG 522

Query: 178 FSTNRNFKF-------SGIFGLGIGRSSLVSQLNSSF----SYCIG--SLHDPDYLHNKL 224
              N N ++       SGI GL +G  SL+SQ++  +    SYC         ++  N +
Sbjct: 523 LD-NTNLQYSGFASSSSGIVGLNMGPLSLISQMDLPYPGLISYCFSGQGTSKINFGTNAI 581

Query: 225 ILGDGAIIDEGDATPLQFI---DGHYYITLEAISVDGRMLDINPNIFKRDDSGGGVMIDS 281
           + GDG +  +       FI   +  YY+ L+A+SV+  ++      F  +D  G + IDS
Sbjct: 582 VAGDGTVAAD------MFIKKDNPFYYLNLDAVSVEDNLIATLGTPFHAED--GNIFIDS 633

Query: 282 GTDVTWLVKEAYEALRDEVMIRLEGEQMRSYSWPDKLCYHGIMSSDLKGFPTVRFHFRGG 341
           GT +T+        +R+ V   +   ++      + LCY+   S  +  FP +  HF GG
Sbjct: 634 GTTLTYFPMSYCNLVREAVEQVVTAVKVPDMGSDNLLCYY---SDTIDIFPVITMHFSGG 690

Query: 342 AKLALEKDSMFYQPRPDA-FCMAVN------PASINNRYVNFSLIGMMAQQFYNVGYDIG 394
           A L L+K +M+ +      FC+A+       PA   NR          AQ  + VGYD  
Sbjct: 691 ADLVLDKYNMYLETITGGIFCLAIGCNDPSMPAVFGNR----------AQNNFLVGYDPS 740

Query: 395 RKQKTFQRMDCEVL 408
               +F   +C  L
Sbjct: 741 SNVISFSPTNCSAL 754



 Score =  130 bits (328), Expect = 1e-27,   Method: Compositional matrix adjust.
 Identities = 107/387 (27%), Positives = 171/387 (44%), Gaps = 53/387 (13%)

Query: 22  VQRGINISIARLAYLQEKIRSHNTYQAQILPSNDISNSLFYVNISIGQPPVPQFTAMDTG 81
           +QR  N S  RL+  + +++  + Y   +   N     ++ + + +G PP      +DTG
Sbjct: 50  IQRRSNSSSFRLS--KNQLQGASPYADTLFDYN-----IYLMKLQVGTPPFEIAAEIDTG 102

Query: 82  SSLLWVHCYPCRDCSPQLGTIFYPSRSSSYAYVPCDSEHCRYFPYARCSAYKHRCIYTQL 141
           S L+W  C PC DC  Q   IF PS+SS++    C  + C Y      + Y         
Sbjct: 103 SDLIWTQCMPCPDCYSQFDPIFDPSKSSTFNEQRCHGKSCHYEIIYEDNTYSKG------ 156

Query: 142 YLIGPETSVFVSTEQLTFKNTDESTIHVQDVVFGCGFST----NRNF--KFSGIFGLGIG 195
                     ++TE +T  +T      + +   GCG       N  F    SGI GL +G
Sbjct: 157 ---------ILATETVTIHSTSGEPFVMAETTIGCGLHNTDLDNSGFASSSSGIVGLNMG 207

Query: 196 RSSLVSQLNSSF----SYCIG--SLHDPDYLHNKLILGDGAIIDEGDATPLQFI---DGH 246
             SL+SQ++  +    SYC         ++  N ++ GDG +  +       FI   +  
Sbjct: 208 PRSLISQMDLPYPGLISYCFSGQGTSKINFGTNAIVAGDGTVAAD------MFIKKDNPF 261

Query: 247 YYITLEAISVDGRMLDINPNIFKRDDSGGGVMIDSGTDVTWLVKEAYEALRDEVMIRLEG 306
           YY+ L+A+SV+   ++     F  +D  G ++IDSG+ VT+        +R  V   +  
Sbjct: 262 YYLNLDAVSVEDNRIETLGTPFHAED--GNIVIDSGSTVTYFPVSYCNLVRKAVEQVVTA 319

Query: 307 EQMRSYSWPDKLCYHGIMSSDLKGFPTVRFHFRGGAKLALEKDSMFYQPRPDA-FCMAVN 365
            ++   S  D LCY    S  +  FP +  HF GGA L L+K +M+ +      FC+A+ 
Sbjct: 320 VRVPDPSGNDMLCY---FSETIDIFPVITMHFSGGADLVLDKYNMYMESNSGGLFCLAI- 375

Query: 366 PASINNRYVNFSLIGMMAQQFYNVGYD 392
              I N     ++ G  AQ  + VGYD
Sbjct: 376 ---ICNSPTQEAIFGNRAQNNFLVGYD 399


>gi|359473000|ref|XP_002278677.2| PREDICTED: aspartic proteinase nepenthesin-1-like [Vitis vinifera]
          Length = 458

 Score =  131 bits (330), Expect = 6e-28,   Method: Compositional matrix adjust.
 Identities = 107/434 (24%), Positives = 190/434 (43%), Gaps = 54/434 (12%)

Query: 8   LSPYNNPNENPAHRVQRGINISIARLAYLQEKIRSHNTYQAQILPSNDISNSLFYVNISI 67
           + P+  P++         ++    RL++    + +  + ++ ++      +  ++V++ +
Sbjct: 44  IKPFTTPSQ--------ALSFDSHRLSFFFSALHTPQSLKSPVVSGASTGSGQYFVDLRL 95

Query: 68  GQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQL-GTIFYPSRSSSYAYVPCDSEHCRYFPY 126
           G PP       DTGS L+WV C  CR+C+    G+ F    S++++   C    C+  P 
Sbjct: 96  GTPPQKLLLVADTGSDLVWVKCSACRNCTRHTPGSAFLARHSTTFSPNHCYDSACQLVPL 155

Query: 127 ARCSAYKHRCIYTQL---------YLIGPETSVFVSTEQLTFKNTDESTIHVQDVVFGCG 177
            +     HRC + +L         Y  G +TS F S E  T   +      ++ + FGC 
Sbjct: 156 PK----HHRCNHARLHSPCRYEYSYGDGSKTSGFFSKETTTLNTSSGREAKLKGIAFGCA 211

Query: 178 FS------TNRNFKFS-GIFGLGIGRSSLVSQL----NSSFSYCIGSLHDPDYLHNKLIL 226
           F       +  +F  + G+ GLG G  SL SQL     + FSYC+          + L++
Sbjct: 212 FRISGPSVSGASFNGAHGVMGLGRGPISLSSQLGHRFGNKFSYCLMDHDISPSPTSYLLI 271

Query: 227 GDGAIIDEGDATP----LQFIDGH--------YYITLEAISVDGRMLDINPNIFKRDDSG 274
           G      + D  P    ++F   H        YYI +E++SVDG  L INP+++  D+ G
Sbjct: 272 GS----TQNDVAPGKRRMRFTPLHINPLSPTFYYIGIESVSVDGIKLPINPSVWALDELG 327

Query: 275 -GGVMIDSGTDVTWLVKEAYEALRDEVMIRLEGEQMRSYSWPDKLCYHGIMSSDLKGFPT 333
            GG ++DSGT +T+L + AY  +   +  R+        +    LC + +   +    P 
Sbjct: 328 NGGTIVDSGTTLTFLPEPAYLQILTVIKRRVRLPSPAEPTPGFDLCVN-VSEIEHPRLPK 386

Query: 334 VRFHFRGGAKLALEKDSMFYQPRPDAFCMAVNPASINNRYVNFSLIGMMAQQFYNVGYDI 393
           + F   G +  +    + F     D  C+A+      +    FS+IG + QQ + + +D 
Sbjct: 387 LSFKLGGDSVFSPPPRNYFVDTDEDVKCLALQAVMTPS---GFSVIGNLMQQGFLLEFDK 443

Query: 394 GRKQKTFQRMDCEV 407
            R +  F R  C +
Sbjct: 444 DRTRLGFSRHGCAL 457


>gi|115467014|ref|NP_001057106.1| Os06g0209100 [Oryza sativa Japonica Group]
 gi|51091210|dbj|BAD35903.1| putative aspartic proteinase nepenthesin I [Oryza sativa Japonica
           Group]
 gi|113595146|dbj|BAF19020.1| Os06g0209100 [Oryza sativa Japonica Group]
 gi|125554496|gb|EAZ00102.1| hypothetical protein OsI_22105 [Oryza sativa Indica Group]
          Length = 454

 Score =  131 bits (330), Expect = 6e-28,   Method: Compositional matrix adjust.
 Identities = 107/385 (27%), Positives = 170/385 (44%), Gaps = 33/385 (8%)

Query: 47  QAQILPSNDISNSLFYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQ-LGTIFYP 105
           +A +     I  + + +++S+G PP P    +DTGS L+W  C PC DC  Q    +  P
Sbjct: 76  RAGLGAGGGIVTNEYLMHVSVGTPPRPVALTLDTGSDLVWTQCAPCLDCFEQGAAPVLDP 135

Query: 106 SRSSSYAYVPCDSEHCRYFPYARCSAYK---HRCIYTQLYLIGPETSVFVSTEQLTFKNT 162
           + SS++A +PCD+  CR  P+  C         C+Y   Y     T   ++T+  TF   
Sbjct: 136 AASSTHAALPCDAPLCRALPFTSCGGRSWGDRSCVYVYHYGDRSLTVGQLATDSFTFGGD 195

Query: 163 DES-TIHVQDVVFGCGFSTNRNFKF--SGIFGLGIGRSSLVSQLN-SSFSYCIGSLHDPD 218
           D +  +  + V FGCG      F+   +GI G G GR SL SQLN +SFSYC  S+ D  
Sbjct: 196 DNAGGLAARRVTFGCGHINKGIFQANETGIAGFGRGRWSLPSQLNVTSFSYCFTSMFDTK 255

Query: 219 YLHNKLILGDGA--------IIDEGDATPLQFIDGH-----YYITLEAISVDGRMLDINP 265
              + + LG  A            GD    + I        Y++ L  ISV G  + +  
Sbjct: 256 S-SSVVTLGAAAAELLHTHHAAHTGDVRTTRLIKNPSQPSLYFVPLRGISVGGARVAVPE 314

Query: 266 NIFKRDDSGGGVMIDSGTDVTWLVKEAYEALRDEVMIRLEGEQMRSYSWPDKLCYHGIMS 325
           +  +        +IDSG  +T L ++ YEA++ E + ++      + S    LC+   ++
Sbjct: 315 SRLRSS-----TIIDSGASITTLPEDVYEAVKAEFVSQVGLPAAAAGSAALDLCFALPVA 369

Query: 326 SDLK--GFPTVRFHFRGGAKLALEKDSMFYQPRPDAFCMAVNPASINNRYVNFSLIGMMA 383
           +  +    P +  H  GGA   L + +  ++         V  A+   + V    IG   
Sbjct: 370 ALWRRPAVPALTLHLDGGADWELPRGNYVFEDYAARVLCVVLDAAAGEQVV----IGNYQ 425

Query: 384 QQFYNVGYDIGRKQKTFQRMDCEVL 408
           QQ  +V YD+     +F    C+ L
Sbjct: 426 QQNTHVVYDLENDVLSFAPARCDKL 450


>gi|125543634|gb|EAY89773.1| hypothetical protein OsI_11315 [Oryza sativa Indica Group]
          Length = 474

 Score =  131 bits (329), Expect = 6e-28,   Method: Compositional matrix adjust.
 Identities = 112/402 (27%), Positives = 178/402 (44%), Gaps = 35/402 (8%)

Query: 31  ARLAYLQEKIRSHNTYQAQILP---SNDISNSLFYVNISIGQPPVPQFTAMDTGSSLLWV 87
           AR      ++ S     A++ P   ++ + ++ + V+++IG PP P    +DTGS L W 
Sbjct: 78  ARSKARSARLLSGRAASARVDPGSYTDGVPDTEYLVHMAIGTPPQPVQLILDTGSDLTWT 137

Query: 88  HCYPCRDCSPQLGTIFYPSRSSSYAYVPCDSEHCRYFPYARC---SAYKHRCIYTQLYLI 144
            C PC  C  Q    F PSRS +++ +PCD   CR   ++ C   S     C+Y   Y  
Sbjct: 138 QCAPCVSCFRQSLPRFNPSRSMTFSVLPCDLRICRDLTWSSCGEQSWGNGICVYAYAYAD 197

Query: 145 GPETSVFVSTEQLTFKNTDEST--IHVQDVVFGCGFSTNRNF--KFSGIFGLGIGRSSLV 200
              T+  + ++  +F + D +     V D+ FGCG   N  F    +GI G   G  S+ 
Sbjct: 198 HSITTGHLDSDTFSFASADHAIGGASVPDLTFGCGLFNNGIFVSNETGIAGFSRGALSMP 257

Query: 201 SQLN-SSFSYCI----GSLHDPDYLHNKLIL-GDGAIIDEGDATPLQFIDGH------YY 248
           +QL   +FSYC     GS   P +L     L  D A    G       I  H      YY
Sbjct: 258 AQLKVDNFSYCFTAITGSEPSPVFLGVPPNLYSDAAGGGHGVVQSTALIRYHSSQLKAYY 317

Query: 249 ITLEAISVDGRMLDINPNIFK-RDDSGGGVMIDSGTDVTWLVKEAYEALRDEVMIRLEGE 307
           I+L+ ++V    L I  ++F  ++D  GG ++DSGT +T L +  Y  + D  + + +  
Sbjct: 318 ISLKGVTVGTTRLPIPESVFALKEDGTGGTIVDSGTGMTMLPEAVYNLVCDAFVAQTKLT 377

Query: 308 QMRSYSWPDKLCYHGIMSSDLKGFPTVRFHFRGGAKLALEKDSMFYQPRPDA----FCMA 363
              S S   +LC+  +        P +  HF  GA L L +++  ++          C+A
Sbjct: 378 VHNSTSSLSQLCF-SVPPGAKPDVPALVLHFE-GATLDLPRENYMFEIEEAGGIRLTCLA 435

Query: 364 VNPASINNRYVNFSLIGMMAQQFYNVGYDIGRKQKTFQRMDC 405
           +N         + S+IG   QQ  +V YD+     +F    C
Sbjct: 436 INAGE------DLSVIGNFQQQNMHVLYDLANDMLSFVPARC 471


>gi|356526294|ref|XP_003531753.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
          Length = 414

 Score =  131 bits (329), Expect = 7e-28,   Method: Compositional matrix adjust.
 Identities = 129/417 (30%), Positives = 195/417 (46%), Gaps = 50/417 (11%)

Query: 20  HRVQRGINISIARLAYLQEKIR----SHN--TYQAQILPSNDISNSLFYVNISIGQPPVP 73
            R+Q+ + +   R+  +Q +IR    +HN    Q QI  S+ I+       +++G     
Sbjct: 16  RRLQKQLILDDLRVRSMQNRIRRVASTHNVEASQTQIPLSSGINLQTLNYIVTMGLGSKN 75

Query: 74  QFTAMDTGSSLLWVHCYPCRDCSPQLGTIFYPSRSSSYAYVPCDSEHCRYFPYAR----- 128
               +DTGS L WV C PC  C  Q G IF PS SSSY  V C+S  C+   +A      
Sbjct: 76  MTVIIDTGSDLTWVQCEPCMSCYNQQGPIFKPSTSSSYQSVSCNSSTCQSLQFATGNTGA 135

Query: 129 C-SAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDVVFGCGFSTNRNFK-- 185
           C S+    C Y   Y  G  T+  +  E L+F       + V D VFGCG    RN K  
Sbjct: 136 CGSSNPSTCNYVVNYGDGSYTNGELGVEALSFGG-----VSVSDFVFGCG----RNNKGL 186

Query: 186 FSGIFGL-GIGRS--SLVSQLNSS----FSYCIGSLHDPDYLHNKLILGDGAIIDEGDAT 238
           F G+ GL G+GRS  SLVSQ N++    FSYC+ +          L++G+ + + + +A 
Sbjct: 187 FGGVSGLMGLGRSYLSLVSQTNATFGGVFSYCLPTTEAGS--SGSLVMGNESSVFK-NAN 243

Query: 239 PLQF--------IDGHYYITLEAISVDGRMLDINPNIFKRDDSGGGVMIDSGTDVTWLVK 290
           P+ +        +   Y + L  I V G  L   P  F      GG++IDSGT +T L  
Sbjct: 244 PITYTRMLSNPQLSNFYILNLTGIDVGGVALKA-PLSFGN----GGILIDSGTVITRLPS 298

Query: 291 EAYEALRDEVMIRLEG-EQMRSYSWPDKLCYHGIMSSDLKGFPTVRFHFRGGAKLALEKD 349
             Y+AL+ E + +  G      +S  D  C++ +   D    PT+   F G A+L ++  
Sbjct: 299 SVYKALKAEFLKKFTGFPSAPGFSILDT-CFN-LTGYDEVSIPTISLRFEGNAQLNVDAT 356

Query: 350 SMFYQPRPDAFCMAVNPASINNRYVNFSLIGMMAQQFYNVGYDIGRKQKTFQRMDCE 406
             FY  + DA  + +  AS+++ Y + ++IG   Q+   V YD  + +  F    C 
Sbjct: 357 GTFYVVKEDASQVCLALASLSDAY-DTAIIGNYQQRNQRVIYDTKQSKVGFAEEPCS 412


>gi|413952718|gb|AFW85367.1| hypothetical protein ZEAMMB73_231535 [Zea mays]
          Length = 443

 Score =  131 bits (329), Expect = 7e-28,   Method: Compositional matrix adjust.
 Identities = 104/367 (28%), Positives = 165/367 (44%), Gaps = 30/367 (8%)

Query: 61  FYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLGTIFYPSRSSSYAYVPCDSEH 120
           + V +++G P  P    +DTGS L+W  C PCRDC  Q   +  P+ SS+YA +PC +  
Sbjct: 84  YLVRLAVGTPRRPVALTLDTGSDLVWTQCAPCRDCFDQDLPVLDPAASSTYAALPCGAAR 143

Query: 121 CRYFPYARCSAY---KHR-CIYTQLYLIGPETSVFVSTEQLTFKNTDES--TIHVQDVVF 174
           CR  P+  C       HR CIY   Y     T   ++T++ TF ++  S  ++H + + F
Sbjct: 144 CRALPFTSCGVRTLGNHRSCIYAYHYGDKSLTVGEIATDRFTFGDSGGSGESLHTRRLTF 203

Query: 175 GCGFSTNRNFKF--SGIFGLGIGRSSLVSQLN-SSFSYCIGSLHDPDYLHNKLILGDGAI 231
           GCG      F+   +GI G G GR SL SQLN +SFSYC  S+ +       L     A+
Sbjct: 204 GCGHLNKGVFQSNETGIAGFGRGRWSLPSQLNVTSFSYCFTSMFESKSSLVTLGGSPAAL 263

Query: 232 ID-----EGDATPLQFIDGH---YYITLEAISVDGRMLDINPNIFKRDDSGGGVMIDSGT 283
                  E   TP+         Y+++L+ ISV    L +    F+        +IDSG 
Sbjct: 264 YSHAHSGEVRTTPILKNPSQPSLYFLSLKGISVGKTRLPVPETKFR------STIIDSGA 317

Query: 284 DVTWLVKEAYEALRDEVMIRLEGEQMRSYSWPDKLCYHGIMSSDLK--GFPTVRFHFRGG 341
            +T L +E YEA++ E   ++             LC+   +++  +    P++  H  G 
Sbjct: 318 SITTLPEEVYEAVKAEFAAQVGLPPSGVEGSALDLCFALPVTALWRRPAVPSLTLHLEGA 377

Query: 342 AKLALEKDSMFYQPRPDAFCMAVNPASINNRYVNFSLIGMMAQQFYNVGYDIGRKQKTFQ 401
                  + +F        C+ ++ A         ++IG   QQ  +V YD+   + +F 
Sbjct: 378 DWELPRSNYVFEDLGARVMCIVLDAAPGEQ-----TVIGNFQQQNTHVVYDLENDRLSFA 432

Query: 402 RMDCEVL 408
              C+ L
Sbjct: 433 PARCDRL 439


>gi|293336306|ref|NP_001168599.1| uncharacterized protein LOC100382383 [Zea mays]
 gi|223949441|gb|ACN28804.1| unknown [Zea mays]
          Length = 326

 Score =  131 bits (329), Expect = 8e-28,   Method: Compositional matrix adjust.
 Identities = 112/336 (33%), Positives = 145/336 (43%), Gaps = 20/336 (5%)

Query: 78  MDTGSSLLWVHCYPCRDCSPQLGTIFYPSRSSSYAYVPCDSEHCRYFPYARCSAYKHRCI 137
           +DTGS + WV C PC DC  Q   +F PS S+SYA V CDS+ CR    A C      C+
Sbjct: 3   LDTGSDVTWVQCQPCADCYQQSDPVFDPSLSASYAAVSCDSQRCRDLDTAACRNATGACL 62

Query: 138 YTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDVVFGCGFSTNRNFKFSGIFGLGIGRS 197
           Y   Y  G  T    +TE LT  +    +  V +V  GCG      F  +       G  
Sbjct: 63  YEVAYGDGSYTVGDFATETLTLGD----STPVGNVAIGCGHDNEGLFVGAAGLLALGGGP 118

Query: 198 -SLVSQLN-SSFSYCIGSLHDPDYLHNKLILGDGAIIDEGDATPL---QFIDGHYYITLE 252
            S  SQ++ S+FSYC+     P    + L  GDGA        PL         YY+ L 
Sbjct: 119 LSFPSQISASTFSYCLVDRDSP--AASTLQFGDGAAEAGTVTAPLVRSPRTSTFYYVALS 176

Query: 253 AISVDGRMLDINPNIFKRDDS--GGGVMIDSGTDVTWLVKEAYEALRDEVMIRLEGEQMR 310
            ISV G+ L I  + F  D +   GGV++DSGT VT L   AY ALRD  +         
Sbjct: 177 GISVGGQPLSIPASAFAMDATSGSGGVIVDSGTAVTRLQSAAYAALRDAFVQGAPSLPRT 236

Query: 311 SYSWPDKLCYHGIMSSDLKGFPTVRFHFRGGAKLAL-EKDSMFYQPRPDAFCMAVNPASI 369
           S       CY     + ++  P V   F GG  L L  K+ +        +C+A  P   
Sbjct: 237 SGVSLFDTCYDLSDRTSVE-VPAVSLRFEGGGALRLPAKNYLIPVDGAGTYCLAFAP--- 292

Query: 370 NNRYVNFSLIGMMAQQFYNVGYDIGRKQKTFQRMDC 405
            N  V  S+IG + QQ   V +D  R    F    C
Sbjct: 293 TNAAV--SIIGNVQQQGTRVSFDTARGAVGFTPNKC 326


>gi|357127503|ref|XP_003565419.1| PREDICTED: probable aspartic protease At2g35615-like [Brachypodium
           distachyon]
          Length = 486

 Score =  130 bits (328), Expect = 8e-28,   Method: Compositional matrix adjust.
 Identities = 124/445 (27%), Positives = 190/445 (42%), Gaps = 53/445 (11%)

Query: 1   LIHRDSILSPYNNPNENPAHRVQRGINISIARLAYLQEKIRSHNT------YQAQILPSN 54
            IHRDS+ SP+++P   P  R       S AR A L   +   ++        A ++   
Sbjct: 44  FIHRDSVKSPFHDPALTPHGRALAAARRSAARAAELHHLLARRSSGAPSPGTGAGVVAEV 103

Query: 55  DISNSLFYVNISIGQPPVPQFTAMDTGSSLLWVHCYPC---RDCSPQLGTIFYPSRSSSY 111
                 + + I +G PPV      DTGS L+WV C       + +      F PS SS+Y
Sbjct: 104 VSRQFEYLMAIEVGTPPVRVLAIADTGSDLVWVKCKGKDNDNNSTAPPSVYFVPSASSTY 163

Query: 112 AYVPCDSEHCRYFPYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDEST----- 166
             V CD++ CR    A   +    C Y   Y  G   S  +STE  TF    +S+     
Sbjct: 164 GRVGCDTKACRALSSAASCSPDGSCEYLYSYGDGSRASGQLSTETFTFSTIADSSKTNSH 223

Query: 167 ------------IHVQDVVFGCGFSTNRNFKFSGIFGLGIGRSSLVSQLNSS------FS 208
                       + +  + FGC  +T   F+  G+ GLG G  SL SQL ++      FS
Sbjct: 224 GNNNNNSSSHGQVEIAKLDFGCSTTTTGTFRADGLVGLGGGPVSLASQLGATTSLGRKFS 283

Query: 209 YCIGSLHDPDYLHNKLILGDGAIIDEGDATPLQFIDG----HYYITLEAISVDGRMLDIN 264
           YC+    + +   + L  G  A++ E  A     I G    +Y I L++I+V G      
Sbjct: 284 YCLAPYANTN-ASSALNFGSRAVVSEPGAASTPLITGEVETYYTIALDSINVAGT----- 337

Query: 265 PNIFKRDDSGGG--VMIDSGTDVTWLVKEAYEALRDEVMIRLEGEQMRSYSWPDKLCY-- 320
               KR  +     +++DSGT +T+L       L  ++  R++  +  S      LCY  
Sbjct: 338 ----KRPTTAAQAHIIVDSGTTLTYLDSALLTPLVKDLTRRIKLPRAESPEKILDLCYDI 393

Query: 321 HGIMSSDLKGFPTVRFHFRGGAKLALEKDSMFYQPRPDAFCMAVNPASINNRYVNFSLIG 380
            G+   D  G P V     GG ++ L+ D+ F   +    C+A+   S      + S++G
Sbjct: 394 SGVRGEDALGIPDVTLVLGGGGEVTLKPDNTFVVVQEGVLCLALVATSERQ---SVSILG 450

Query: 381 MMAQQFYNVGYDIGRKQKTFQRMDC 405
            +AQQ  +VGYD+ +   TF   DC
Sbjct: 451 NIAQQNLHVGYDLEKGTVTFAAADC 475


>gi|242086412|ref|XP_002443631.1| hypothetical protein SORBIDRAFT_08g022620 [Sorghum bicolor]
 gi|241944324|gb|EES17469.1| hypothetical protein SORBIDRAFT_08g022620 [Sorghum bicolor]
          Length = 507

 Score =  130 bits (328), Expect = 9e-28,   Method: Compositional matrix adjust.
 Identities = 135/450 (30%), Positives = 191/450 (42%), Gaps = 61/450 (13%)

Query: 1   LIHRDSILSPYNNPNENPAHRVQRGINISIARLAYLQEKIRSHNTYQAQILP-------- 52
           L+HRDS        N   A  + R +     R A++     ++ T    ++         
Sbjct: 74  LLHRDSFAV-----NATGAELLARRLQRDELRAAWIISTAAANGTPPPDVVGLSTGRGLV 128

Query: 53  ----SNDISNSLFYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLGTIFYPSRS 108
               S   ++  +   I++G P V    A+DT S L W+ C PCR C PQ G +F P  S
Sbjct: 129 APVVSRAPTSGDYIAKIAVGTPAVEALLALDTASDLTWLQCQPCRRCYPQSGPVFDPRHS 188

Query: 109 SSYAYVPCDSEHCRYFPYARCS-AYKHRCIYTQLYLIG---PETSVFVS---TEQLTFKN 161
           +SY  +  D+  C+    +    A +  CIYT LY  G     TS  V     E LTF  
Sbjct: 189 TSYGEMNYDAPDCQALGRSGGGDAKRGTCIYTVLYGDGDGHGSTSTSVGDLVEETLTFAG 248

Query: 162 TDESTIHVQDVVFGCGFSTNRNFK--FSGIFGLGIGRSSLVSQL-----NSSFSYC-IGS 213
                +    +  GCG      F    +GI GL  G+ S+  Q+     N+SFSYC +  
Sbjct: 249 ----GVRQAYLSIGCGHDNKGLFGAPAAGILGLSRGQISIPHQIAFLGYNASFSYCLVDF 304

Query: 214 LHDPDYLHNKLILGDGAIIDEGDA--TPL---QFIDGHYYITLEAISVDGRMLDINPNIF 268
           +  P    + L  G GA+     A  TP    Q +   YY+ L  +SV G  +   P + 
Sbjct: 305 ISGPGSPSSTLTFGAGAVDTSPPASFTPTVLNQNMPTFYYVRLIGVSVGGVRV---PGVT 361

Query: 269 KRD------DSGGGVMIDSGTDVTWLVKEAYEALRDEVMIRLEGEQMRSYSWPDKL---C 319
           +RD         GGV++DSGT VT L + AY A RD       G    S   P  L   C
Sbjct: 362 ERDLQLDPYTGHGGVILDSGTTVTRLARPAYTAFRDAFRAAATGLGQVSTGGPSGLFDTC 421

Query: 320 YHGIMSSDLK---GFPTVRFHFRGGAKLALE-KDSMFYQPRPDAFCMAVNPASINNRYVN 375
           Y     + L+     P V  HF GG +L+L+ K+ +         C A   A   +R V 
Sbjct: 422 YTVGGRAGLRHCVKVPAVSMHFAGGVELSLQPKNYLITVDSRGTVCFAF--AGTGDRSV- 478

Query: 376 FSLIGMMAQQFYNVGYDIGRKQKTFQRMDC 405
            S+IG + QQ + V YDIG ++  F    C
Sbjct: 479 -SVIGNILQQGFRVVYDIGGQRVGFAPNSC 507


>gi|18461217|dbj|BAB84414.1| chloroplast nucleoid DNA-binding protein cnd41-like [Oryza sativa
           Japonica Group]
          Length = 446

 Score =  130 bits (328), Expect = 9e-28,   Method: Compositional matrix adjust.
 Identities = 116/437 (26%), Positives = 188/437 (43%), Gaps = 55/437 (12%)

Query: 1   LIHRDSILSPYNNPNENPAHRVQRGINISIARLAYLQEKIRSHNTYQAQILPSNDISNSL 60
           + HRD++  P   P       +++ +    AR A L   + +     + +       +  
Sbjct: 31  VFHRDALFPP--PPGAKRGSLLRQRLAADAARYASL---VDATGRLHSPVFSGIPFESGE 85

Query: 61  FYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLGTIFYPSRSSSYAYVPCDSEH 120
           ++  + +G P       +DTGS L+W+ C PCR C  Q G +F P RSS+Y  VPC S  
Sbjct: 86  YFALVGVGTPSTKAMLVIDTGSDLVWLQCSPCRRCYAQRGQVFDPRRSSTYRRVPCSSPQ 145

Query: 121 CRYFPYARC---SAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDVVFGCG 177
           CR   +  C    A    C Y   Y  G  ++  ++T++L F N      +V +V  GCG
Sbjct: 146 CRALRFPGCDSGGAAGGGCRYMVAYGDGSSSTGDLATDKLAFAN----DTYVNNVTLGCG 201

Query: 178 FSTNRNF-KFSGIFGLGIGRSSLVSQL----NSSFSYCIGSLHDPDYLHNKLILGDGAII 232
                 F   +G+ G+G G+ S+ +Q+     S F YC+G         + L+ G     
Sbjct: 202 RDNEGLFDSAAGLLGVGRGKISISTQVAPAYGSVFEYCLGDRTSRSTRSSYLVFGR---T 258

Query: 233 DEGDATPLQFIDGH------YYITLEAISVDGRMLDINPNIFKRDDSG---GGVMIDSGT 283
            E  +T    +  +      YY+ +   SV G  +    N     D+    GGV++DSGT
Sbjct: 259 PEPPSTAFTALLSNPRRPSLYYVDMAGFSVGGERVTGFSNASLALDTATGRGGVVVDSGT 318

Query: 284 DVTWLVKEAYEALRDEVMIRLEGEQMRSYSWPDKL---CYHGIMSSDLKGFPT-----VR 335
            ++   ++AY ALRD    R     MR  +    +   CY      DL+G P      + 
Sbjct: 319 AISRFARDAYAALRDAFDARARAAGMRRLAGEHSVFDACY------DLRGRPAASAPLIV 372

Query: 336 FHFRGGAKLALEKDSMFY-----QPRPDAF--CMAVNPASINNRYVNFSLIGMMAQQFYN 388
            HF GGA +AL  ++ F      + R  ++  C+    A         S+IG + QQ + 
Sbjct: 373 LHFAGGADMALPPENYFLPVDGGRRRAASYRRCLGFEAADD-----GLSVIGNVQQQGFR 427

Query: 389 VGYDIGRKQKTFQRMDC 405
           V +D+ +++  F    C
Sbjct: 428 VVFDVEKERIGFAPKGC 444


>gi|67633548|gb|AAY78698.1| aspartyl protease family protein [Arabidopsis thaliana]
          Length = 392

 Score =  130 bits (328), Expect = 9e-28,   Method: Compositional matrix adjust.
 Identities = 101/360 (28%), Positives = 163/360 (45%), Gaps = 38/360 (10%)

Query: 59  SLFYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLGTIFYPSRSSSYAYVPCDS 118
           +++ + + +G PP      +DTGS L+W  C PC +C  Q   IF PS SS++    C+ 
Sbjct: 59  NIYLMKLQVGTPPFEIEAEIDTGSDLIWTQCMPCTNCYSQYAPIFDPSNSSTFKEKRCNG 118

Query: 119 EHCRY-FPYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDVVFGCG 177
             C Y   YA  +  K                  ++TE +T  +T      + +   GCG
Sbjct: 119 NSCHYKIIYADTTYSKGT----------------LATETVTIHSTSGEPFVMPETTIGCG 162

Query: 178 FSTNRNFK--FSGIFGLGIGRSSLVSQLNSSF----SYCIGS--LHDPDYLHNKLILGDG 229
            +++  FK  FSG+ GL  G SSL++Q+   +    SYC  S      ++  N ++ GDG
Sbjct: 163 HNSSW-FKPTFSGMVGLSWGPSSLITQMGGEYPGLMSYCFASQGTSKINFGTNAIVAGDG 221

Query: 230 AIIDEGDATPLQFIDGHYYITLEAISVDGRMLDINPNIFKRDDSGGGVMIDSGTDVTWLV 289
            +      T  +   G YY+ L+A+SV    ++     F   +  G ++IDSGT +T+  
Sbjct: 222 VVSTTMFLTTAK--PGLYYLNLDAVSVGDTHVETMGTTFHALE--GNIIIDSGTTLTYFP 277

Query: 290 KEAYEALRDEVMIRLEGEQMRSYSWPDKLCYHGIMSSDLKGFPTVRFHFRGGAKLALEKD 349
                 +R+ V   +   +    +  D LCY+   +  +  FP +  HF GGA L L+K 
Sbjct: 278 VSYCNLVREAVDHYVTAVRTADPTGNDMLCYY---TDTIDIFPVITMHFSGGADLVLDKY 334

Query: 350 SMFYQP-RPDAFCMAVNPASINNRYVNFSLIGMMAQQFYNVGYDIGRKQKTFQRMDCEVL 408
           +M+ +      FC+A+    I N     ++ G  AQ  + VGYD       F   +C  L
Sbjct: 335 NMYIETITRGTFCLAI----ICNNPPQDAIFGNRAQNNFLVGYDSSSLLVFFSPTNCSAL 390


>gi|302762735|ref|XP_002964789.1| hypothetical protein SELMODRAFT_82470 [Selaginella moellendorffii]
 gi|300167022|gb|EFJ33627.1| hypothetical protein SELMODRAFT_82470 [Selaginella moellendorffii]
          Length = 390

 Score =  130 bits (327), Expect = 1e-27,   Method: Compositional matrix adjust.
 Identities = 111/400 (27%), Positives = 170/400 (42%), Gaps = 40/400 (10%)

Query: 31  ARLAYLQEKIRSHN---------TYQAQILPSNDISNSLFYVNISIGQPPVPQFTAMDTG 81
           ARL ++  +I+S +            AQ+     + +  ++  + IG P    +  +DTG
Sbjct: 6   ARLRWIHHRIQSSDHRHRRGRSLLQTAQVSSGLSLGSGEYFARMGIGSPQRSYYLELDTG 65

Query: 82  SSLLWVHCYPCRDCSPQLGTIFYPSRSSSYAYVPCDSEHCRYFPYARCSAYKHRCIYTQL 141
           S + W+ C PC  C  Q+  I+ PS SSSY  V C S  C+   Y+ C      C Y  +
Sbjct: 66  SDVTWIQCAPCSSCYSQVDPIYDPSNSSSYRRVYCGSALCQALDYSACQGMG--CSYRVV 123

Query: 142 YLIGPETSVFVSTEQLTFKNTDESTIHVQDVVFGCGFSTNRNFKFSGIFGLGIGR----- 196
           Y     +S  +  E  +F     S+  ++++ FGCG S +  F+         G      
Sbjct: 124 YGDSSASSGDLGIE--SFYLGPNSSTAMRNIAFGCGHSNSGLFRGEAGLLGMGGGTLSFF 181

Query: 197 SSLVSQLNSSFSYCIGSLHDP-DYLHNKLILGDGAIIDEGDATPL---QFIDGHYYITLE 252
           S + + +  +FSYC+   +       + LI G  AI      TPL     ID  YY  L 
Sbjct: 182 SQIAASIGPAFSYCLVDRYSQLQSRSSPLIFGRTAIPFAARFTPLLKNPRIDTFYYAILT 241

Query: 253 AISVDGRMLDINPNIFKRDDSG-GGVMIDSGTDVTWLVKEAYEALRDEVMIRLEGEQMRS 311
            ISV G  L I P  F    +G GG ++DSGT VT +V  AY  LRD             
Sbjct: 242 GISVGGTALPIPPAQFALTGNGTGGAILDSGTSVTRVVPAAYAVLRDAYRAASRNLPPAP 301

Query: 312 YSWPDKLCYHGIMSSDLKGFPTVR-----FHFRGGAKLALEKDSMFYQ-PRPDAFCMAVN 365
             +    C+      + +G PTV+      HF     + L   ++     R   FC+A  
Sbjct: 302 GVYLLDTCF------NFQGLPTVQIPSLVLHFDNDVDMVLPGGNILIPVDRSGTFCLAFA 355

Query: 366 PASINNRYVNFSLIGMMAQQFYNVGYDIGRKQKTFQRMDC 405
           P+S+       S+IG + QQ + +G+D+ R        +C
Sbjct: 356 PSSM-----PISVIGNVQQQTFRIGFDLQRSLIAIAPREC 390


>gi|115452683|ref|NP_001049942.1| Os03g0317300 [Oryza sativa Japonica Group]
 gi|108707833|gb|ABF95628.1| Eukaryotic aspartyl protease family protein, expressed [Oryza
           sativa Japonica Group]
 gi|113548413|dbj|BAF11856.1| Os03g0317300 [Oryza sativa Japonica Group]
          Length = 448

 Score =  130 bits (327), Expect = 1e-27,   Method: Compositional matrix adjust.
 Identities = 112/402 (27%), Positives = 178/402 (44%), Gaps = 35/402 (8%)

Query: 31  ARLAYLQEKIRSHNTYQAQILP---SNDISNSLFYVNISIGQPPVPQFTAMDTGSSLLWV 87
           AR      ++ S     A++ P   ++ + ++ + V+++IG PP P    +DTGS L W 
Sbjct: 52  ARSKARSARLLSGRAASARMDPGSYTDGVPDTEYLVHMAIGTPPQPVQLILDTGSDLTWT 111

Query: 88  HCYPCRDCSPQLGTIFYPSRSSSYAYVPCDSEHCRYFPYARC---SAYKHRCIYTQLYLI 144
            C PC  C  Q    F PSRS +++ +PCD   CR   ++ C   S     C+Y   Y  
Sbjct: 112 QCAPCVSCFRQSLPRFNPSRSMTFSVLPCDLRICRDLTWSSCGEQSWGNGICVYAYAYAD 171

Query: 145 GPETSVFVSTEQLTFKNTDEST--IHVQDVVFGCGFSTNRNF--KFSGIFGLGIGRSSLV 200
              T+  + ++  +F + D +     V D+ FGCG   N  F    +GI G   G  S+ 
Sbjct: 172 HSITTGHLDSDTFSFASADHAIGGASVPDLTFGCGLFNNGIFVSNETGIAGFSRGALSMP 231

Query: 201 SQLN-SSFSYCI----GSLHDPDYLHNKLIL-GDGAIIDEGDATPLQFIDGH------YY 248
           +QL   +FSYC     GS   P +L     L  D A    G       I  H      YY
Sbjct: 232 AQLKVDNFSYCFTAITGSEPSPVFLGVPPNLYSDAAGGGHGVVQSTALIRYHSSQLKAYY 291

Query: 249 ITLEAISVDGRMLDINPNIFK-RDDSGGGVMIDSGTDVTWLVKEAYEALRDEVMIRLEGE 307
           I+L+ ++V    L I  ++F  ++D  GG ++DSGT +T L +  Y  + D  + + +  
Sbjct: 292 ISLKGVTVGTTRLPIPESVFALKEDGTGGTIVDSGTGMTMLPEAVYNLVCDAFVAQTKLT 351

Query: 308 QMRSYSWPDKLCYHGIMSSDLKGFPTVRFHFRGGAKLALEKDSMFYQPRPDA----FCMA 363
              S S   +LC+  +        P +  HF  GA L L +++  ++          C+A
Sbjct: 352 VHNSTSSLSQLCFS-VPPGAKPDVPALVLHFE-GATLDLPRENYMFEIEEAGGIRLTCLA 409

Query: 364 VNPASINNRYVNFSLIGMMAQQFYNVGYDIGRKQKTFQRMDC 405
           +N         + S+IG   QQ  +V YD+     +F    C
Sbjct: 410 INAGE------DLSVIGNFQQQNMHVLYDLANDMLSFVPARC 445


>gi|224099307|ref|XP_002311432.1| predicted protein [Populus trichocarpa]
 gi|222851252|gb|EEE88799.1| predicted protein [Populus trichocarpa]
          Length = 458

 Score =  130 bits (327), Expect = 1e-27,   Method: Compositional matrix adjust.
 Identities = 106/378 (28%), Positives = 168/378 (44%), Gaps = 34/378 (8%)

Query: 61  FYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCR-DCSPQ-LGTIFYPSRSSSYAYVPCDS 118
           ++V+I +G PP       DTGS L WV C  C+ +CS    G+ F    S++++   C S
Sbjct: 83  YFVSIRLGSPPQTLLLVADTGSDLTWVRCSACKTNCSIHPPGSTFLARHSTTFSPTHCFS 142

Query: 119 EHCRYFPYARCSAYKH-----RCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDVV 173
             C+  P    +   H      C Y  +Y  G +TS F S E  T   +    + ++ + 
Sbjct: 143 SLCQLVPQPNPNPCNHTRLHSTCRYEYVYSDGSKTSGFFSKETTTLNTSSGREMKLKSIA 202

Query: 174 FGCGFSTN------RNFK-FSGIFGLGIGRSSLVSQL----NSSFSYCIGSLHDPDYLHN 222
           FGCGF  +       +F   SG+ GLG G  S  SQL      SFSYC+          +
Sbjct: 203 FGCGFHASGPSLIGSSFNGASGVMGLGRGPISFASQLGRRFGRSFSYCLLDYTLSPPPTS 262

Query: 223 KLILGDGAIIDEGDATPLQFI--------DGHYYITLEAISVDGRMLDINPNIFKRDDSG 274
            L++GD     + + + + F            YYI+++ + VDG  L I+P+++  D+ G
Sbjct: 263 YLMIGDVVSTKKDNKSMMSFTPLLINPEAPTFYYISIKGVFVDGVKLHIDPSVWSLDELG 322

Query: 275 -GGVMIDSGTDVTWLVKEAYE----ALRDEVMIRLEGEQMRSYSWPDKLCYHGIMSSDLK 329
            GG +IDSGT +T+L + AY     A + EV +        S      LC + +      
Sbjct: 323 NGGTVIDSGTTLTFLTEPAYREILSAFKREVKLPSPTPGGASTRSGFDLCVN-VTGVSRP 381

Query: 330 GFPTVRFHFRGGAKLALEKDSMFYQPRPDAFCMAVNPASINNRYVNFSLIGMMAQQFYNV 389
            FP +     G +  +    + F        C+A+ P  +      FS+IG + QQ + +
Sbjct: 382 RFPRLSLELGGESLYSPPPRNYFIDISEGIKCLAIQP--VEAESGRFSVIGNLMQQGFLL 439

Query: 390 GYDIGRKQKTFQRMDCEV 407
            +D G+ +  F R  C V
Sbjct: 440 EFDRGKSRLGFSRRGCAV 457


>gi|115476830|ref|NP_001062011.1| Os08g0469100 [Oryza sativa Japonica Group]
 gi|42407408|dbj|BAD09566.1| putative nucleoid DNA-binding protein [Oryza sativa Japonica Group]
 gi|113623980|dbj|BAF23925.1| Os08g0469100 [Oryza sativa Japonica Group]
          Length = 373

 Score =  130 bits (327), Expect = 1e-27,   Method: Compositional matrix adjust.
 Identities = 103/362 (28%), Positives = 168/362 (46%), Gaps = 43/362 (11%)

Query: 73  PQFTAMDTGSSLLWVHC-------YPCRDCSPQLGTIFYPSRSSSYAYVPCDSEHCR--Y 123
           P+   +DTGS L+W  C          R  SP    ++ P  SS++A++PC    C+   
Sbjct: 25  PRKLIVDTGSDLIWTQCKLSSSTAAAARHGSPP---VYDPGESSTFAFLPCSDRLCQEGQ 81

Query: 124 FPYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDVVFGCG-FSTNR 182
           F +  C++ K+RC+Y  +Y       V  S E  TF      ++ +    FGCG  S   
Sbjct: 82  FSFKNCTS-KNRCVYEDVYGSAAAVGVLAS-ETFTFGARRAVSLRLG---FGCGALSAGS 136

Query: 183 NFKFSGIFGLGIGRSSLVSQLN-SSFSYCIGSLHDPDYLHNKLILGDGAIIDEGDAT-PL 240
               +GI GL     SL++QL    FSYC+    D     + L+ G  A +     T P+
Sbjct: 137 LIGATGILGLSPESLSLITQLKIQRFSYCLTPFADKKT--SPLLFGAMADLSRHKTTRPI 194

Query: 241 QFI--------DGHYYITLEAISVDGRMLDI-NPNIFKRDDSGGGVMIDSGTDVTWLVKE 291
           Q            +YY+ L  IS+  + L +   ++  R D GGG ++DSG+ V +LV+ 
Sbjct: 195 QTTAIVSNPVETVYYYVPLVGISLGHKRLAVPAASLAMRPDGGGGTIVDSGSTVAYLVEA 254

Query: 292 AYEALRDEVM--IRLEGEQMRSYSWPDKLCY-----HGIMSSDLKGFPTVRFHFRGGAKL 344
           A+EA+++ VM  +RL         +  +LC+         + +    P +  HF GGA +
Sbjct: 255 AFEAVKEAVMDVVRLPVANRTVEDY--ELCFVLPRRTAAAAMEAVQVPPLVLHFDGGAAM 312

Query: 345 ALEKDSMFYQPRPDAFCMAVNPASINNRYVNFSLIGMMAQQFYNVGYDIGRKQKTFQRMD 404
            L +D+ F +PR    C+AV   +  +     S+IG + QQ  +V +D+   + +F    
Sbjct: 313 VLPRDNYFQEPRAGLMCLAVGKTTDGS---GVSIIGNVQQQNMHVLFDVQHHKFSFAPTQ 369

Query: 405 CE 406
           C+
Sbjct: 370 CD 371


>gi|326517992|dbj|BAK07248.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 500

 Score =  130 bits (327), Expect = 1e-27,   Method: Compositional matrix adjust.
 Identities = 109/354 (30%), Positives = 153/354 (43%), Gaps = 25/354 (7%)

Query: 61  FYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLGTIFYPSRSSSYAYVPCDSEH 120
           ++  + +G+P    +  +DTGS + W+ C PC DC  Q   ++ PS S+SYA V CDS  
Sbjct: 163 YFSRVGVGRPARQLYMVLDTGSDVTWLQCQPCADCYAQSDPVYDPSVSTSYATVGCDSPR 222

Query: 121 CRYFPYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDVVFGCGFST 180
           CR    A C      C+Y   Y  G  T    +TE LT  +    +  V +V  GCG   
Sbjct: 223 CRDLDAAACRNSTGSCLYEVAYGDGSYTVGDFATETLTLGD----SAPVSNVAIGCGHDN 278

Query: 181 NRNFKFSGIFGLGIGRS-SLVSQLN-SSFSYCIGSLHDPDYLHNKLILGDGAIIDEGDAT 238
              F  +       G   S  SQ++ ++FSYC+     P    + L  GD     E  A 
Sbjct: 279 EGLFVGAAGLLALGGGPLSFPSQISATTFSYCLVDRDSPS--SSTLQFGD----SEQPAV 332

Query: 239 PLQFI-----DGHYYITLEAISVDGRMLDINPNIFKRDDSG-GGVMIDSGTDVTWLVKEA 292
               I     +  YY+ L  ISV G  L I  + F  DD+G GGV++DSGT VT L   A
Sbjct: 333 TAPLIRSPRTNTFYYVALSGISVGGEALSIPSSAFAMDDAGSGGVIVDSGTAVTRLQSGA 392

Query: 293 YEALRDEVMIRLEGEQMRSYSWPDKLCYHGIMSSDLKGFPTVRFHFRGGAKLAL-EKDSM 351
           Y ALR+  +   +     S       CY     S ++  P V   F GG +L L  K+ +
Sbjct: 393 YGALREAFVQGTQSLPRASGVSLFDTCYDLAGRSSVQ-VPAVALWFEGGGELKLPAKNYL 451

Query: 352 FYQPRPDAFCMAVNPASINNRYVNFSLIGMMAQQFYNVGYDIGRKQKTFQRMDC 405
                   +C+A    S        S+IG + QQ   V +D  +    F    C
Sbjct: 452 IPVDAAGTYCLAFAGTS-----GPVSIIGNVQQQGVRVSFDTAKNTVGFTADKC 500


>gi|242070719|ref|XP_002450636.1| hypothetical protein SORBIDRAFT_05g008470 [Sorghum bicolor]
 gi|241936479|gb|EES09624.1| hypothetical protein SORBIDRAFT_05g008470 [Sorghum bicolor]
          Length = 410

 Score =  130 bits (326), Expect = 1e-27,   Method: Compositional matrix adjust.
 Identities = 113/398 (28%), Positives = 177/398 (44%), Gaps = 35/398 (8%)

Query: 17  NPAHRVQRGINISIARLAYLQEKIRSHNTYQAQILPSNDISNSLFYVNISIGQPPVPQFT 76
            PA  + R  + S  RL+ L  ++    +  AQ     D     + +  SIG PP     
Sbjct: 38  EPAINLTRAAHKSHQRLSMLAARLDDAASGSAQTPLQLDSGGGAYDMTFSIGTPPQELSA 97

Query: 77  AMDTGSSLLWVHCYPCRDCSPQLGTIFYPSRSSSYAYVPCDSEHCRYFPYARCSAYKHRC 136
             DTGS L+W  C  C  C PQ    +YP++SSS++ +PC    C   P ++CSA    C
Sbjct: 98  LADTGSDLIWAKCGACTRCVPQGSPSYYPNKSSSFSKLPCSGSLCSDLPSSQCSAGGAEC 157

Query: 137 IYTQLYLIGPE----TSVFVSTEQLTFKNTDESTIHVQDVVFGC-GFSTNRNFKFSGIFG 191
            Y   Y +  +    T  ++ +E  T  +       V  + FGC   S       SG+ G
Sbjct: 158 DYKYSYGLASDPHHYTQGYLGSETFTLGSD-----AVPGIGFGCTTMSEGGYGSGSGLVG 212

Query: 192 LGIGRSSLVSQLN-SSFSYCIGSLHDPDYLHNKLILGDGAIIDEG-DATPLQFIDGHYY- 248
           LG G  SLVSQLN  +FSYC+ S        + L+ G GA+   G  +TPL     +YY 
Sbjct: 213 LGRGPLSLVSQLNVGAFSYCLTSDAAKT---SPLLFGSGALTGAGVQSTPLLRTSTYYYT 269

Query: 249 ITLEAISVDGRMLDINPNIFKRDDSGGGVMIDSGTDVTWLVKEAYEALRDEVMIRLEGEQ 308
           + LE+IS+                   G++ DSGT V +L + AY   ++ V+ +     
Sbjct: 270 VNLESISIGAAT--------TAGTGSSGIIFDSGTTVAFLAEPAYTLAKEAVLSQTTNLT 321

Query: 309 MRSYSWPDKLCYHGIMSSDLKGFPTVRFHFRGGAKLALEKDSMFYQPRPDAFCMAVNPAS 368
           M S     ++C+     +    FP++  HF GG  + L  ++ F        C  V    
Sbjct: 322 MASGRDGYEVCFQ----TSGAVFPSMVLHFDGG-DMDLPTENYFGAVDDSVSCWIV---- 372

Query: 369 INNRYVNFSLIGMMAQQFYNVGYDIGRKQKTFQRMDCE 406
              +  + S++G + Q  Y++ YD+ +   +FQ  +C+
Sbjct: 373 --QKSPSLSIVGNIMQMNYHIRYDVEKSMLSFQPANCD 408


>gi|125586057|gb|EAZ26721.1| hypothetical protein OsJ_10629 [Oryza sativa Japonica Group]
          Length = 474

 Score =  130 bits (326), Expect = 1e-27,   Method: Compositional matrix adjust.
 Identities = 112/402 (27%), Positives = 178/402 (44%), Gaps = 35/402 (8%)

Query: 31  ARLAYLQEKIRSHNTYQAQILP---SNDISNSLFYVNISIGQPPVPQFTAMDTGSSLLWV 87
           AR      ++ S     A++ P   ++ + ++ + V+++IG PP P    +DTGS L W 
Sbjct: 78  ARSKARSARLLSGRAASARMDPGSYTDGVPDTEYLVHMAIGTPPQPVQLILDTGSDLTWT 137

Query: 88  HCYPCRDCSPQLGTIFYPSRSSSYAYVPCDSEHCRYFPYARC---SAYKHRCIYTQLYLI 144
            C PC  C  Q    F PSRS +++ +PCD   CR   ++ C   S     C+Y   Y  
Sbjct: 138 QCAPCVSCFRQSLPRFNPSRSMTFSVLPCDLRICRDLTWSSCGEQSWGNGICVYAYAYAD 197

Query: 145 GPETSVFVSTEQLTFKNTDEST--IHVQDVVFGCGFSTNRNF--KFSGIFGLGIGRSSLV 200
              T+  + ++  +F + D +     V D+ FGCG   N  F    +GI G   G  S+ 
Sbjct: 198 HSITTGHLDSDTFSFASADHAIGGASVPDLTFGCGLFNNGIFVSNETGIAGFSRGALSMP 257

Query: 201 SQLN-SSFSYCI----GSLHDPDYLHNKLIL-GDGAIIDEGDATPLQFIDGH------YY 248
           +QL   +FSYC     GS   P +L     L  D A    G       I  H      YY
Sbjct: 258 AQLKVDNFSYCFTAITGSEPSPVFLGVPPNLYSDAAGGGHGVVQSTALIRYHSSQLKAYY 317

Query: 249 ITLEAISVDGRMLDINPNIFK-RDDSGGGVMIDSGTDVTWLVKEAYEALRDEVMIRLEGE 307
           I+L+ ++V    L I  ++F  ++D  GG ++DSGT +T L +  Y  + D  + + +  
Sbjct: 318 ISLKGVTVGTTRLPIPESVFALKEDGTGGTIVDSGTGMTMLPEAVYNLVCDAFVAQTKLT 377

Query: 308 QMRSYSWPDKLCYHGIMSSDLKGFPTVRFHFRGGAKLALEKDSMFYQPRPDA----FCMA 363
              S S   +LC+  +        P +  HF  GA L L +++  ++          C+A
Sbjct: 378 VHNSTSSLSQLCFS-VPPGAKPDVPALVLHFE-GATLDLPRENYMFEIEEAGGIRLTCLA 435

Query: 364 VNPASINNRYVNFSLIGMMAQQFYNVGYDIGRKQKTFQRMDC 405
           +N         + S+IG   QQ  +V YD+     +F    C
Sbjct: 436 INAGE------DLSVIGNFQQQNMHVLYDLANDMLSFVPARC 471


>gi|356546036|ref|XP_003541438.1| PREDICTED: aspartic proteinase-like protein 2-like [Glycine max]
          Length = 486

 Score =  130 bits (326), Expect = 2e-27,   Method: Compositional matrix adjust.
 Identities = 106/378 (28%), Positives = 176/378 (46%), Gaps = 39/378 (10%)

Query: 57  SNSLFYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDC--SPQLG---TIFYPSRSSSY 111
           S  L+Y  + +G PP      +DTGS +LWV+C  C +C  S QLG     F    SS+ 
Sbjct: 74  SVGLYYTKVKMGTPPKEFNVQIDTGSDILWVNCNTCSNCPQSSQLGIELNFFDTVGSSTA 133

Query: 112 AYVPCDSEHCR---YFPYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFK---NTDES 165
           A +PC    C        A CS   ++C YT  Y  G  TS +  ++ + F        +
Sbjct: 134 ALIPCSDPICTSRVQGAAAECSPRVNQCSYTFQYGDGSGTSGYYVSDAMYFSLIMGQPPA 193

Query: 166 TIHVQDVVFGCGFS-----TNRNFKFSGIFGLGIGRSSLVSQLNSS------FSYCIGSL 214
                 +VFGC  S     T  +    GIFG G G  S+VSQL+S       FS+C+   
Sbjct: 194 VNSSATIVFGCSISQSGDLTKTDKAVDGIFGFGPGPLSVVSQLSSRGITPKVFSHCLKGD 253

Query: 215 HDPDYLHNKLILGDGAIIDEGDATPLQFIDGHYYITLEAISVDGRMLDINPNIFKRDDSG 274
            D   +     + + +I+     +PL     HY + L++I+V+G++L INP +F   ++ 
Sbjct: 254 GDGGGVLVLGEILEPSIV----YSPLVPSQPHYNLNLQSIAVNGQLLPINPAVFSISNNR 309

Query: 275 GGVMIDSGTDVTWLVKEAYEALRDEVMIRLEGEQMRSYSWPDKLCYHGIMSSDLKG-FPT 333
           GG ++D GT + +L++EAY+ L   +   +  +  R  +     CY  ++S+ +   FP+
Sbjct: 310 GGTIVDCGTTLAYLIQEAYDPLVTAINTAVS-QSARQTNSKGNQCY--LVSTSIGDIFPS 366

Query: 334 VRFHFRGGAKLALEKDSMF----YQPRPDAFCMAVNPASINNRYVNFSLIGMMAQQFYNV 389
           V  +F GGA + L+ +       Y    + +C+              S++G +  +   V
Sbjct: 367 VSLNFEGGASMVLKPEQYLMHNGYLDGAEMWCIG-----FQKFQEGASILGDLVLKDKIV 421

Query: 390 GYDIGRKQKTFQRMDCEV 407
            YDI +++  +   DC +
Sbjct: 422 VYDIAQQRIGWANYDCSL 439


>gi|115472515|ref|NP_001059856.1| Os07g0532800 [Oryza sativa Japonica Group]
 gi|50508274|dbj|BAD32123.1| putative aspartic proteinase nepenthesin I [Oryza sativa Japonica
           Group]
 gi|113611392|dbj|BAF21770.1| Os07g0532800 [Oryza sativa Japonica Group]
          Length = 436

 Score =  129 bits (325), Expect = 2e-27,   Method: Compositional matrix adjust.
 Identities = 109/398 (27%), Positives = 178/398 (44%), Gaps = 39/398 (9%)

Query: 32  RLAYLQEKIRS------HNTYQAQILPSNDISNSLFYVNISIGQPPVPQFTAMDTGSSLL 85
           R+A+L +   +      +++   Q L  N +    + +NIS+G P +      DTGS L+
Sbjct: 53  RIAFLSDATAAGKATTTNSSVSFQALLENGVGG--YNMNISVGTPLLTFSVVADTGSDLI 110

Query: 86  WVHCYPCRDCSPQLGTIFYPSRSSSYAYVPCDSEHCRYFPYARCSAYKHRCIYTQLYLIG 145
           W  C PC  C  Q    F P+ SS+++ +PC S  C++ P +  +     C+Y   Y  G
Sbjct: 111 WTQCAPCTKCFQQPAPPFQPASSSTFSKLPCTSSFCQFLPNSIRTCNATGCVYNYKYGSG 170

Query: 146 PETSVFVSTEQLTFKNTDESTIHVQDVVFGCGFSTNRNFKFSGIFGLGIGRSSLVSQLN- 204
             T+ +++TE  T K  D S      V FGC          SGI GLG G  SL+ QL  
Sbjct: 171 -YTAGYLATE--TLKVGDAS---FPSVAFGCSTENGVGNSTSGIAGLGRGALSLIPQLGV 224

Query: 205 SSFSYCIGSLHDPDYLHNKLILGDGAIIDEGDATPLQFID------GHYYITLEAISVDG 258
             FSYC+ S        + ++ G  A + +G+     F++       +YY+ L  I+V  
Sbjct: 225 GRFSYCLRSGSAAG--ASPILFGSLANLTDGNVQSTPFVNNPAVHPSYYYVNLTGITVGE 282

Query: 259 RMLDINPNI--FKRDDSGGGVMIDSGTDVTWLVKEAYEALRDEVMIRLEGEQMRSYSWPD 316
             L +  +   F ++  GGG ++DSGT +T+L K+ YE ++   + +       + +   
Sbjct: 283 TDLPVTTSTFGFTQNGLGGGTIVDSGTTLTYLAKDGYEMVKQAFLSQTADVTTVNGTRGL 342

Query: 317 KLCYHGIMSSDLK-GFPTVRFHFRGGAKLA-------LEKDSMFYQPRPDAFCMAVNPAS 368
            LC+            P++   F GGA+ A       +E DS   Q      C+ + PA 
Sbjct: 343 DLCFKSTGGGGGGIAVPSLVLRFDGGAEYAVPTYFAGVETDS---QGSVTVACLMMLPAK 399

Query: 369 INNRYVNFSLIGMMAQQFYNVGYDIGRKQKTFQRMDCE 406
            +      S+IG + Q   ++ YD+     +F   DC 
Sbjct: 400 GDQP---MSVIGNVMQMDMHLLYDLDGGIFSFAPADCA 434


>gi|357137788|ref|XP_003570481.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Brachypodium
           distachyon]
          Length = 455

 Score =  129 bits (324), Expect = 3e-27,   Method: Compositional matrix adjust.
 Identities = 110/374 (29%), Positives = 164/374 (43%), Gaps = 41/374 (10%)

Query: 50  ILPSNDISNSLFYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCR-DCSPQLGTIFYPSRS 108
           + P   +    +   + +G P  P    +DTGSSL W+ C PCR  C  Q G +F P  S
Sbjct: 106 LTPGTSVGVGNYVTRMGLGTPAKPYIMVVDTGSSLTWLQCSPCRVSCHRQSGPVFDPKTS 165

Query: 109 SSYAYVPCDSEHCRYFPYAR-----CSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTD 163
           SSYA V C S  C     A      CS   + CIY   Y     +  ++S + ++F    
Sbjct: 166 SSYAAVSCSSPQCDGLSTATLNPAVCSP-SNVCIYQASYGDSSFSVGYLSKDTVSFGANS 224

Query: 164 ESTIHVQDVVFGCGFSTNRNF-KFSGIFGLGIGRSSLVSQLNS----SFSYCIGSLHDPD 218
                V +  +GCG      F + +G+ GL   + SL+ QL      SFSYC+ S     
Sbjct: 225 -----VPNFYYGCGQDNEGLFGRSAGLMGLARNKLSLLYQLAPTLGYSFSYCLPSTSSSG 279

Query: 219 YLHNKLILGDGAIIDEGDA-TPL---QFIDGHYYITLEAISVDGRMLDINPNIFKRDDSG 274
           YL        G+    G + TP+      D  Y+I+L  ++V G+ L ++ + +    + 
Sbjct: 280 YLS------IGSYNPGGYSYTPMVSNTLDDSLYFISLSGMTVAGKPLAVSSSEYTSLPT- 332

Query: 275 GGVMIDSGTDVTWLVKEAYEALRDEVMIRLEGEQMR--SYSWPDKLCYHGIMSSDLKGFP 332
              +IDSGT +T L    Y AL   V   ++G   R  +YS  D  C+ G  +S L+  P
Sbjct: 333 ---IIDSGTVITRLPTSVYTALSKAVAAAMKGSTKRAAAYSILDT-CFEG-QASKLRAVP 387

Query: 333 TVRFHFRGGAKLALEKDSMFYQPRPDAFCMAVNPASINNRYVNFSLIGMMAQQFYNVGYD 392
            V   F GGA L L   ++         C+A  PA       + ++IG   QQ ++V YD
Sbjct: 388 AVSMAFSGGATLKLSAGNLLVDVDGATTCLAFAPAR------SAAIIGNTQQQTFSVVYD 441

Query: 393 IGRKQKTFQRMDCE 406
           +   +  F    C 
Sbjct: 442 VKSNRIGFAAAGCS 455


>gi|302786560|ref|XP_002975051.1| hypothetical protein SELMODRAFT_54028 [Selaginella moellendorffii]
 gi|300157210|gb|EFJ23836.1| hypothetical protein SELMODRAFT_54028 [Selaginella moellendorffii]
          Length = 359

 Score =  129 bits (324), Expect = 3e-27,   Method: Compositional matrix adjust.
 Identities = 109/366 (29%), Positives = 170/366 (46%), Gaps = 37/366 (10%)

Query: 61  FYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDC--SPQLGTIFYPSRSSSYAYVPCDS 118
           + + +SIG PP      +DTGS L+W+ C  C  C       TIF+   SSSY  +PC+S
Sbjct: 5   YMMELSIGTPPQLIPAMIDTGSDLVWLKCDNCDHCDLDHHGETIFFSDASSSYKKLPCNS 64

Query: 119 EHCRYFPYA----RCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIH---VQD 171
            HC     A    RC   +  C Y   Y  G  TS  V +++++F++      H      
Sbjct: 65  THCSGMSSAGIGPRC---EETCKYKYEYGDGSRTSGDVGSDRISFRSHGAGEDHRSFFDG 121

Query: 172 VVFGCGFSTNRNFKFS-GIFGLGIGRSSLVSQLNS----SFSYCIGSLHDPDYLHNKLIL 226
            +FGCG     ++ F+ G+ GLG    SL+ QL       FSYC+ S   P    + L L
Sbjct: 122 FLFGCGRKLKGDWNFTQGLIGLGQKSHSLIQQLGDKLGYKFSYCLVSYDSPPSAKSFLFL 181

Query: 227 GDGAIIDEGDATPLQFIDGH------YYITLEAISVDGRMLDINPNIFKRDDSGG----- 275
           G  A +   D      + G       YY+ L++I+V G  + +       + S G     
Sbjct: 182 GSSAALRGHDVVSTPILHGDHLDQTLYYVDLQSITVGGVPVVVYDKESGHNTSVGPFLAN 241

Query: 276 GVMIDSGTDVTWLVKEAYEALRDEVMIRLEGEQMRSYSWPDKLCYHGIMSSDLK-GFPTV 334
             +IDSGT  T L    YEA+R  +  ++    + + +  D LC++   S D   GFP+V
Sbjct: 242 KTVIDSGTTYTLLTPPVYEAMRKSIEEQVILPTLGNSAGLD-LCFNS--SGDTSYGFPSV 298

Query: 335 RFHFRGGAKLALEKDSMFYQPRPDAFCMAVNPASINNRYVNFSLIGMMAQQFYNVGYDIG 394
            F+F    +L L  +++F     D  C+     S+++   + S+IG M QQ +++ YD+ 
Sbjct: 299 TFYFANQVQLVLPFENIFQVTSRDVVCL-----SMDSSGGDLSIIGNMQQQNFHILYDLV 353

Query: 395 RKQKTF 400
             Q +F
Sbjct: 354 ASQISF 359


>gi|242058537|ref|XP_002458414.1| hypothetical protein SORBIDRAFT_03g033075 [Sorghum bicolor]
 gi|241930389|gb|EES03534.1| hypothetical protein SORBIDRAFT_03g033075 [Sorghum bicolor]
          Length = 448

 Score =  129 bits (324), Expect = 3e-27,   Method: Compositional matrix adjust.
 Identities = 103/436 (23%), Positives = 186/436 (42%), Gaps = 45/436 (10%)

Query: 1   LIHRDSILSPYNNPNENPAHRVQRGINISIARLAYLQEKIRSHNTYQAQILPSNDISNSL 60
           ++HR ++             R +     +    ++        +  ++ ++      +  
Sbjct: 28  VVHRGAVFPSRRGAPPGSLRRCRHAAPFTAQVASFHSIAADDDDRLRSPVMSGVPFDSGE 87

Query: 61  FYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLGTIFYPSRSSSYAYVPCDSEH 120
           ++  I++G PP      +DTGS L+W+ C PCR C  Q+  ++ P  SS++  +PC S  
Sbjct: 88  YFAVINVGDPPTRALVVIDTGSDLIWLQCVPCRHCYRQVTPLYDPRSSSTHRRIPCASPR 147

Query: 121 CR-YFPYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDVVFGCGFS 179
           CR    Y  C A    C+Y  +Y  G  +S  ++T++L F +      HV +V  GCG  
Sbjct: 148 CRDVLRYPGCDARTGGCVYMVVYGDGSASSGDLATDRLVFPD----DTHVHNVTLGCGHD 203

Query: 180 TNRNFK-FSGIFGLGIGRSSLVSQLNSS----FSYCIGS-LHDPDYLHNKLILGDGAIID 233
                +  +G+ G+G G+ S  +QL  +    FSYC+G  L       + L+ G      
Sbjct: 204 NVGLLESAAGLLGVGRGQLSFPTQLAPAYGHVFSYCLGDRLSRAQNGSSYLVFGRTPEPP 263

Query: 234 EGDATPLQFIDGH---YYITLEAISVDGRM--------LDINPNIFKRDDSGGGVMIDSG 282
               TPL+        YY+ +   SV G          L +NP   +     GG+++DSG
Sbjct: 264 STAFTPLRTNPRRPSLYYVDMVGFSVGGERVTGFSNASLALNPATGR-----GGIVVDSG 318

Query: 283 TDVTWLVKEAYEALRDEV--------MIRLEGEQMRSYSWPDKLCYHGIMSSDLKGFPTV 334
           T ++   ++AY A+RD           +R    +   +     L  +G  ++ ++  P++
Sbjct: 319 TAISRFARDAYAAVRDAFDSHAAAAGTMRKLATKFSVFDACYDLRGNGAPAAAVR-VPSI 377

Query: 335 RFHFRGGAKLALEKDSMFYQ----PRPDAFCMAVNPASINNRYVNFSLIGMMAQQFYNVG 390
             HF GGA +AL + +         R   FC+ +  A         +++G + QQ + + 
Sbjct: 378 VLHFAGGADMALPQANYLIPVQGGDRRTYFCLGLQAADD-----GLNVLGNVQQQGFGLV 432

Query: 391 YDIGRKQKTFQRMDCE 406
           +D+ R +  F    C 
Sbjct: 433 FDVERGRIGFTPNGCS 448


>gi|125543638|gb|EAY89777.1| hypothetical protein OsI_11319 [Oryza sativa Indica Group]
          Length = 390

 Score =  129 bits (324), Expect = 3e-27,   Method: Compositional matrix adjust.
 Identities = 105/379 (27%), Positives = 167/379 (44%), Gaps = 40/379 (10%)

Query: 54  NDISNSLFYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLGTIFYPSRSSSYAY 113
           N +  + + V+++IG PP P    +DTGS L+W  C PC  C  Q    F  SRSS+ A 
Sbjct: 28  NGVPTTEYLVHLAIGTPPQPVQLTLDTGSDLIWTQCKPCVSCFDQPLPYFDTSRSSTNAL 87

Query: 114 VPCDSEHCRYFPY----ARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHV 169
           +PC+S  C+  P      + +     C Y   Y     T   ++ ++ TF     +   +
Sbjct: 88  LPCESTQCKLDPTVTVCVKLNQTVQTCAYYTSYGDNSVTIGLLAADKFTFV----AGTSL 143

Query: 170 QDVVFGCGFSTNRNFKF--SGIFGLGIGRSSLVSQLN-SSFSYCIGSLHDP-------DY 219
             V FGCG +    F    +GI G G G  SL SQL   +FS+C  ++          D 
Sbjct: 144 PGVTFGCGLNNTGVFNSNETGIAGFGRGPLSLPSQLKVGNFSHCFTTITGAIPSTVLLDL 203

Query: 220 LHNKLILGDGAIIDEGDATPL-QFIDGH-----YYITLEAISVDGRMLDINPNIFKRDDS 273
             +    G GA+      TPL Q+         YY++L+ I+V    L +  + F   + 
Sbjct: 204 PADLFSNGQGAV----QTTPLIQYAKNEANPTLYYLSLKGITVGSTRLPVPESAFALTNG 259

Query: 274 GGGVMIDSGTDVTWLVKEAYEALRDEVMIRLEGEQMRSYSWPDKLCYHGIMSSDLKGFPT 333
            GG +IDSGT +T L  + Y+ +RDE   +++   +   +     C+    S      P 
Sbjct: 260 TGGTIIDSGTSITSLPPQVYQVVRDEFAAQIKLPVVPGNATGHYTCFSA-PSQAKPDVPK 318

Query: 334 VRFHFRGGAKLALEKDSMFYQPRPDA----FCMAVNPASINNRYVNFSLIGMMAQQFYNV 389
           +  HF  GA + L +++  ++   DA     C+A+N           ++IG   QQ  +V
Sbjct: 319 LVLHFE-GATMDLPRENYVFEVPDDAGNSIICLAINKGD------ETTIIGNFQQQNMHV 371

Query: 390 GYDIGRKQKTFQRMDCEVL 408
            YD+     +F    C+ L
Sbjct: 372 LYDLQNNMLSFVAAQCDKL 390


>gi|357521081|ref|XP_003630829.1| Aspartic proteinase nepenthesin-2 [Medicago truncatula]
 gi|355524851|gb|AET05305.1| Aspartic proteinase nepenthesin-2 [Medicago truncatula]
          Length = 526

 Score =  129 bits (324), Expect = 3e-27,   Method: Compositional matrix adjust.
 Identities = 105/353 (29%), Positives = 161/353 (45%), Gaps = 20/353 (5%)

Query: 48  AQILPSNDISNSLFYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLGTIFYPSR 107
           A + P      S F V I +G PP   +   D  +   W+ C PC  C  Q  +IF PS+
Sbjct: 174 ASLNPGITTGTSNFLVQIGVGGPPQKFYMIFDLQTDFTWLQCQPCIKCYDQPDSIFDPSQ 233

Query: 108 SSSYAYVPCDSEHCRYFPYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTI 167
           SSSY  + C+++HC   P + CS   + C Y   Y  G  T   +  E ++F    ES+ 
Sbjct: 234 SSSYTLLSCETKHCNLLPNSSCSDDGY-CRYNITYKDGTNTEGVLINETVSF----ESSG 288

Query: 168 HVQDVVFGCGFSTNRNFKFS-GIFGLGIGRSSLVSQLN-SSFSYCIGSLHDPDYLHNKLI 225
            V  V  GC       F  S G FGLG G  S  S++N SS SYC+    D  Y  + L 
Sbjct: 289 WVDRVSLGCSNKNQGPFVGSDGTFGLGRGSLSFPSRINASSMSYCLVESKD-GYSSSTLE 347

Query: 226 LGDGAIIDEGDATPLQ--FIDGHYYITLEAISVDGRMLDINPNIFKRDDSG-GGVMIDSG 282
                      A  LQ    +  YY+ L+ I V G  +D+  + F  D  G GG+++ S 
Sbjct: 348 FNSPPCSGSVKAKLLQNPKAENLYYVGLKGIKVGGEKIDVPNSTFTIDPYGNGGMIVSSS 407

Query: 283 TDVTWLVKEAYEALRDEVMIRLEG-EQMRSYSWPDKLCYHGIMSSDLKGFPTVRFHFRGG 341
           + +T L  + Y  +RD  + + +  E+++++   D  CY+ + S++    P + F    G
Sbjct: 408 SLITMLENDTYNVVRDAFVAKTQHLERLKAFLQFDT-CYN-LSSNNTVELPILEFEVNDG 465

Query: 342 AKLALEKDSMFYQ-PRPDAFCMAVNPASINNRYVNFSLIGMMAQQFYNVGYDI 393
               L K+S  Y   +   FC A  P+       +FS++G + Q    V +D+
Sbjct: 466 KSWLLPKESYLYAVDKNGTFCFAFAPSK-----GSFSILGTLQQYGTRVTFDL 513


>gi|168054484|ref|XP_001779661.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162668975|gb|EDQ55572.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 419

 Score =  129 bits (323), Expect = 3e-27,   Method: Compositional matrix adjust.
 Identities = 112/407 (27%), Positives = 180/407 (44%), Gaps = 28/407 (6%)

Query: 15  NENPAHRVQRGINISIARLAYLQEKIRSHNTYQAQILPSNDISNSLFYVNISIGQPPVPQ 74
           N+N AH       I+    A ++      + +Q+ ++  + + +  ++V+  +G PP   
Sbjct: 23  NDNGAHNSANPPVIT----AVIEGPPSHDHDFQSPVVSGSTLGSGQYFVDFFLGTPPQKF 78

Query: 75  FTAMDTGSSLLWVHCYPCRDCSPQLGTIFYPSRSSSYAYVPCDSEHCRYFPYAR---CS- 130
              +D+GS LLWV C PC  C  Q   ++ PS SS++  VPC S  C   P      C  
Sbjct: 79  SLIVDSGSDLLWVQCAPCLQCYAQDTPLYAPSNSSTFNPVPCLSPECLLIPATEGFPCDF 138

Query: 131 AYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDVVFGCGFSTNRNFKFS-GI 189
            Y   C Y   Y    +TS  +S     +++     + +  V FGCG     +F  + G+
Sbjct: 139 HYPGACAYEYRYA---DTS--LSKGVFAYESATVDDVRIDKVAFGCGRDNQGSFAAAGGV 193

Query: 190 FGLGIGRSSLVSQL----NSSFSYCIGSLHDPDYLHNKLILGDGAI--IDEGDATPLQFI 243
            GLG G  S  SQ+     + F+YC+ +  DP  + + LI GD  I  I +   TP+   
Sbjct: 194 LGLGQGPLSFGSQVGYAYGNKFAYCLVNYLDPTSVSSWLIFGDELISTIHDLQFTPIVSN 253

Query: 244 DGH---YYITLEAISVDGRMLDINPNIFKRDDSG-GGVMIDSGTDVTWLVKEAYEALRDE 299
             +   YY+ +E + V G  L I+ + +  D  G GG + DSGT VT+ +  AY  +   
Sbjct: 254 SRNPTLYYVQIEKVMVGGESLPISHSAWSLDFLGNGGSIFDSGTTVTYWLPPAYRNILAA 313

Query: 300 VMIRLEGEQMRSYSWPDKLCYHGIMSSDLKGFPTVRFHFRGGAKLALEKDSMFYQPRPDA 359
               +   +  S    D LC   +   D   FP+      GGA    ++ + F    P+ 
Sbjct: 314 FDKNVRYPRAASVQGLD-LCVD-VTGVDQPSFPSFTIVLGGGAVFQPQQGNYFVDVAPNV 371

Query: 360 FCMAVNPASINNRYVNFSLIGMMAQQFYNVGYDIGRKQKTFQRMDCE 406
            C+A+  A + +    F+ IG + QQ + V YD    +  F    C 
Sbjct: 372 QCLAM--AGLPSSVGGFNTIGNLLQQNFLVQYDREENRIGFAPAKCS 416


>gi|356499344|ref|XP_003518501.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Glycine max]
          Length = 561

 Score =  129 bits (323), Expect = 3e-27,   Method: Compositional matrix adjust.
 Identities = 106/369 (28%), Positives = 170/369 (46%), Gaps = 31/369 (8%)

Query: 61  FYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLGTIFYPSRSSSYAYVPCDSEH 120
           +++++ +G PP      +DTGS L W+ C PC  C  Q G  + P  SSS+  + C    
Sbjct: 197 YFMDVFVGTPPKHFSLILDTGSDLNWIQCVPCIACFEQSGPYYDPKDSSSFRNISCHDPR 256

Query: 121 CRYF----PYARCSAYKHRCIYTQLYLIGPETS----VFVSTEQLTFKNTDESTIHVQDV 172
           C+      P   C A    C Y   Y  G  T+    +   T  LT  N      HV++V
Sbjct: 257 CQLVSAPDPPKPCKAENQSCPYFYWYGDGSNTTGDFALETFTVNLTTPNGTSELKHVENV 316

Query: 173 VFGCGFSTNRNF-KFSGIFGLGIGRSSLVSQLNS----SFSYCIGSLHDPDYLHNKLILG 227
           +FGCG      F   +G+ GLG G  S  SQ+ S    SFSYC+   +    + +KLI G
Sbjct: 317 MFGCGHWNRGLFHGAAGLLGLGKGPLSFASQMQSLYGQSFSYCLVDRNSNASVSSKLIFG 376

Query: 228 -DGAIIDEGDATPLQF-------IDGHYYITLEAISVDGRMLDINPNIFKRDDSG-GGVM 278
            D  ++   +     F       +D  YY+ ++++ VD  +L I    +     G GG +
Sbjct: 377 EDKELLSHPNLNFTSFGGGKDGSVDTFYYVQIKSVMVDDEVLKIPEETWHLSSEGAGGTI 436

Query: 279 IDSGTDVTWLVKEAYEALRDEVMIRLEGEQMRSYSWPDKLCYH--GIMSSDLKGFPTVRF 336
           IDSGT +T+  + AYE +++  + +++G Q+     P K CY+  GI   +L  F  +  
Sbjct: 437 IDSGTTLTYFAEPAYEIIKEAFVRKIKGYQLVEGLPPLKPCYNVSGIEKMELPDFGIL-- 494

Query: 337 HFRGGAKLALEKDSMFYQPRPDAFCMAVNPASINNRYVNFSLIGMMAQQFYNVGYDIGRK 396
            F   A      ++ F    P+  C+A+    + N     S+IG   QQ +++ YD+ + 
Sbjct: 495 -FADEAVWNFPVENYFIWIDPEVVCLAI----LGNPRSALSIIGNYQQQNFHILYDMKKS 549

Query: 397 QKTFQRMDC 405
           +  +  M C
Sbjct: 550 RLGYAPMKC 558


>gi|125527523|gb|EAY75637.1| hypothetical protein OsI_03542 [Oryza sativa Indica Group]
          Length = 446

 Score =  129 bits (323), Expect = 4e-27,   Method: Compositional matrix adjust.
 Identities = 115/437 (26%), Positives = 187/437 (42%), Gaps = 55/437 (12%)

Query: 1   LIHRDSILSPYNNPNENPAHRVQRGINISIARLAYLQEKIRSHNTYQAQILPSNDISNSL 60
           + HRD++  P   P       +++ +    AR A L   + +     + +       +  
Sbjct: 31  VFHRDALFPP--PPGAKRGSLLRQRLAADAARYASL---VDATGRLHSPVFSGIPFESGE 85

Query: 61  FYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLGTIFYPSRSSSYAYVPCDSEH 120
           ++  + +G P       +DTGS L+W+ C PCR C  Q G +F P RSS+Y  VPC S  
Sbjct: 86  YFALVGVGTPSTKAMLVIDTGSDLVWLQCSPCRRCYAQRGQVFDPRRSSTYRRVPCSSPQ 145

Query: 121 CRYFPYARC---SAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDVVFGCG 177
           CR   +  C    A    C Y   Y  G  ++  ++T++L F N      +V +V  GCG
Sbjct: 146 CRALRFPGCDSGGAAGGGCRYMVAYGDGSSSTGELATDKLAFAN----DTYVNNVTLGCG 201

Query: 178 FSTNRNF-KFSGIFGLGIGRSSLVSQL----NSSFSYCIGSLHDPDYLHNKLILGDGAII 232
                 F   +G+ G+  G+ S+ +Q+     S F YC+G         + L+ G     
Sbjct: 202 RDNEGLFDSAAGLLGVARGKISISTQVAPAYGSVFEYCLGDRTSRSTRSSYLVFGR---T 258

Query: 233 DEGDATPLQFIDGH------YYITLEAISVDGRMLDINPNIFKRDDSG---GGVMIDSGT 283
            E  +T    +  +      YY+ +   SV G  +    N     D+    GGV++DSGT
Sbjct: 259 PEPPSTAFTALLSNPRRPSLYYVDMAGFSVGGERVTGFSNASLALDTATGRGGVVVDSGT 318

Query: 284 DVTWLVKEAYEALRDEVMIRLEGEQMRSYSWPDKL---CYHGIMSSDLKGFPT-----VR 335
            ++   ++AY ALRD    R     MR  +    +   CY      DL+G P      + 
Sbjct: 319 AISRFARDAYAALRDAFDARARAAGMRRLAGEHSVFDACY------DLRGRPAASAPLIV 372

Query: 336 FHFRGGAKLALEKDSMFY-----QPRPDAF--CMAVNPASINNRYVNFSLIGMMAQQFYN 388
            HF GGA +AL  ++ F      + R  ++  C+    A         S+IG + QQ + 
Sbjct: 373 LHFAGGADMALPPENYFLPVDGGRRRAASYRRCLGFEAADD-----GLSVIGNVQQQGFR 427

Query: 389 VGYDIGRKQKTFQRMDC 405
           V +D+ +++  F    C
Sbjct: 428 VVFDVEKERIGFAPKGC 444


>gi|293332561|ref|NP_001170100.1| uncharacterized protein LOC100384018 precursor [Zea mays]
 gi|224033441|gb|ACN35796.1| unknown [Zea mays]
 gi|413944035|gb|AFW76684.1| hypothetical protein ZEAMMB73_746438 [Zea mays]
          Length = 456

 Score =  129 bits (323), Expect = 4e-27,   Method: Compositional matrix adjust.
 Identities = 88/266 (33%), Positives = 129/266 (48%), Gaps = 24/266 (9%)

Query: 50  ILPSNDISNSLFYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLGTIFYPSRSS 109
           +  +  I+ + + V++++G PP P    +DTGS L+W  C PCRDC  Q   +  P+ SS
Sbjct: 75  VAAAGGIATNEYLVHLAVGTPPRPVALTLDTGSDLVWTQCAPCRDCFDQGIPLLDPAASS 134

Query: 110 SYAYVPCDSEHCRYFPYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTF-----KNTDE 164
           +YA +PC +  CR  P+  C      C+Y   Y     T   ++T++ TF     +N D 
Sbjct: 135 TYAALPCGAPRCRALPFTSCGG--RSCVYVYHYGDKSVTVGKIATDRFTFGDNGRRNGDG 192

Query: 165 STIHVQDVVFGCGFSTNRNFKF--SGIFGLGIGRSSLVSQLN-SSFSYCIGSLHDPDYLH 221
           S    + + FGCG      F+   +GI G G GR SL SQLN +SFSYC  S+ D     
Sbjct: 193 SLPATRRLTFGCGHFNKGVFQSNETGIAGFGRGRWSLPSQLNATSFSYCFTSMFDSKSSI 252

Query: 222 NKLILGDGAIID-----EGDATPL---QFIDGHYYITLEAISVDGRMLDINPNIFKRDDS 273
             L     A+       E   TPL         Y+++L+ ISV    L +    F+    
Sbjct: 253 VTLGGAPAALYSHAHSGEVRTTPLFKNPSQPSLYFLSLKGISVGKTRLPVPETKFR---- 308

Query: 274 GGGVMIDSGTDVTWLVKEAYEALRDE 299
               +IDSG  +T L +E YEA++ E
Sbjct: 309 --STIIDSGASITTLPEEVYEAVKAE 332


>gi|115438214|ref|NP_001043485.1| Os01g0598600 [Oryza sativa Japonica Group]
 gi|15290061|dbj|BAB63755.1| nucleoid DNA-binding protein cnd41-like [Oryza sativa Japonica
           Group]
 gi|113533016|dbj|BAF05399.1| Os01g0598600 [Oryza sativa Japonica Group]
 gi|125526702|gb|EAY74816.1| hypothetical protein OsI_02707 [Oryza sativa Indica Group]
          Length = 500

 Score =  129 bits (323), Expect = 4e-27,   Method: Compositional matrix adjust.
 Identities = 116/370 (31%), Positives = 161/370 (43%), Gaps = 41/370 (11%)

Query: 61  FYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLGTIFYPSRSSSYAYVPCDSEH 120
           ++  I +G P  P    +DTGS ++W+ C PCR C  Q G +F P  S SY  V C +  
Sbjct: 147 YFTKIGVGTPVTPALMVLDTGSDVVWLQCAPCRRCYDQSGQMFDPRASHSYGAVDCAAPL 206

Query: 121 CRYFPYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDVVFGCGFST 180
           CR      C   +  C+Y   Y  G  T+   +TE LTF     S   V  V  GCG   
Sbjct: 207 CRRLDSGGCDLRRKACLYQVAYGDGSVTAGDFATETLTFA----SGARVPRVALGCGHDN 262

Query: 181 NRNFKFSGIFGLGI-GRSSLVSQLN----SSFSYCI----GSLHDPDYLHNKLILGDGAI 231
              F  +        G  S  SQ++     SFSYC+     S        + +  G GA+
Sbjct: 263 EGLFVAAAGLLGLGRGSLSFPSQISRRFGRSFSYCLVDRTSSSASATSRSSTVTFGSGAV 322

Query: 232 IDEGDA--TPL---QFIDGHYYITLEAISVDG-RMLDINPNIFKRDDSG--GGVMIDSGT 283
                A  TP+     ++  YY+ L  ISV G R+  +  +  + D S   GGV++DSGT
Sbjct: 323 GPSAAASFTPMVKNPRMETFYYVQLMGISVGGARVPGVAVSDLRLDPSTGRGGVIVDSGT 382

Query: 284 DVTWLVKEAYEALRDEVMIRLEGEQMR--SYSWPDKLCYHGIMSSDLKGF-----PTVRF 336
            VT L + AY ALRD       G ++    +S  D  CY      DL G      PTV  
Sbjct: 383 SVTRLARPAYAALRDAFRAAAAGLRLSPGGFSLFDT-CY------DLSGLKVVKVPTVSM 435

Query: 337 HFRGGAKLALEKDS-MFYQPRPDAFCMAVNPASINNRYVNFSLIGMMAQQFYNVGYDIGR 395
           HF GGA+ AL  ++ +        FC A   A  +      S+IG + QQ + V +D   
Sbjct: 436 HFAGGAEAALPPENYLIPVDSRGTFCFAF--AGTDG---GVSIIGNIQQQGFRVVFDGDG 490

Query: 396 KQKTFQRMDC 405
           ++  F    C
Sbjct: 491 QRLGFVPKGC 500


>gi|242084330|ref|XP_002442590.1| hypothetical protein SORBIDRAFT_08g022570 [Sorghum bicolor]
 gi|241943283|gb|EES16428.1| hypothetical protein SORBIDRAFT_08g022570 [Sorghum bicolor]
          Length = 494

 Score =  129 bits (323), Expect = 4e-27,   Method: Compositional matrix adjust.
 Identities = 133/444 (29%), Positives = 193/444 (43%), Gaps = 56/444 (12%)

Query: 1   LIHRDSILSPYNNPNENPAHRVQRGINISIARLAYLQEKIRSH---------NTYQAQIL 51
           L+HRDS        N   A  + R +     R A++  K  ++         +T +  + 
Sbjct: 68  LLHRDSFAV-----NATAAELLARRLQRDELRAAWIISKAAANGTPPPVVGLSTGRGLVA 122

Query: 52  P--SNDISNSLFYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLGTIFYPSRSS 109
           P  S   ++  +   I++G P V    A+DT S L W+ C PCR C PQ G +F P  S+
Sbjct: 123 PVVSRAPTSGEYMAKIAVGTPAVQALLALDTASDLTWLQCQPCRRCYPQSGPVFDPRHST 182

Query: 110 SYAYVPCDSEHCRYFPYARCS-AYKHRCIYTQLYLIG-PETSVFVS---TEQLTFKNTDE 164
           SY  +  D+  C+    +    A +  CIYT  Y  G   TS  V     E LTF     
Sbjct: 183 SYGEMNYDAPDCQALGRSGGGDAKRGTCIYTVQYGDGHGSTSTSVGDLVEETLTFAG--- 239

Query: 165 STIHVQDVVFGCGFSTNRNF--KFSGIFGLGIGRSSLVSQL-----NSSFSYC-IGSLHD 216
             +    +  GCG      F    +GI GLG G+ S+  Q+     N+SFSYC +  +  
Sbjct: 240 -GVRQAYLSIGCGHDNKGLFGAPAAGILGLGRGQISIPHQIAFLGYNASFSYCLVDFISG 298

Query: 217 PDYLHNKLILGDGAIIDEGDA--TPL---QFIDGHYYITLEAISVDGRMLDINPNIFKRD 271
           P    + L  G GA+     A  TP    Q +   YY+ L  +SV G  +   P + +RD
Sbjct: 299 PGSPSSTLTFGAGAVDTSPPASFTPTVLNQNMPTFYYVRLIGVSVGGVRV---PGVTERD 355

Query: 272 ------DSGGGVMIDSGTDVTWLVKEAYEALRDEVMIRLEGEQMRSYSWPDKL---CYHG 322
                    GGV++DSGT VT L + AY A RD            S   P  L   CY  
Sbjct: 356 LQLDPYTGRGGVILDSGTTVTRLARPAYVAFRDAFRAAATSLGQVSTGGPSGLFDTCYTV 415

Query: 323 IMSSDLKGFPTVRFHFRGGAKLALE-KDSMFYQPRPDAFCMAVNPASINNRYVNFSLIGM 381
              + +K  P V  HF GG +++L+ K+ +         C A   A   +R V  S+IG 
Sbjct: 416 GGRAGVK-VPAVSMHFAGGVEVSLQPKNYLIPVDSRGTVCFAF--AGTGDRSV--SVIGN 470

Query: 382 MAQQFYNVGYDIGRKQKTFQRMDC 405
           + QQ + V YD+  ++  F   +C
Sbjct: 471 ILQQGFRVVYDLAGQRVGFAPNNC 494


>gi|168000296|ref|XP_001752852.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162696015|gb|EDQ82356.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 384

 Score =  129 bits (323), Expect = 4e-27,   Method: Compositional matrix adjust.
 Identities = 110/362 (30%), Positives = 165/362 (45%), Gaps = 27/362 (7%)

Query: 58  NSLFYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLGTIFYPSRSSSYAYVPCD 117
           N  + + +++G PP      +DTGS L WV C PCR C  Q G  F PS+S S+    C 
Sbjct: 36  NGEYLMTLTLGSPPQSFDVIVDTGSDLNWVQCLPCRVCYQQPGPKFDPSKSRSFRKAACT 95

Query: 118 SEHCRY--FPYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDVVFG 175
              C     P   C+A  + C Y   Y     T+  ++ E ++  N    T  V +  FG
Sbjct: 96  DNLCNVSALPLKACAA--NVCQYQYTYGDQSNTNGDLAFETISLNN-GAGTQSVPNFAFG 152

Query: 176 CGFSTNRNFK-FSGIFGLGIGRSSLVSQLN----SSFSYCIGSLHDPDYLHNKLILGDGA 230
           CG      F   +G+ GLG G  SL SQL+    + FSYC+ SL+      + L  G  A
Sbjct: 153 CGTQNLGTFAGAAGLVGLGQGPLSLNSQLSHTFANKFSYCLVSLNSLS--ASPLTFGSIA 210

Query: 231 IIDEGDATPLQFIDGH---YYITLEAISVDGRMLDINPNIFKRDDSG--GGVMIDSGTDV 285
                  T +     H   YY+ L +I V G+ L++ P++F  D S   GG +IDSGT +
Sbjct: 211 AAANIQYTSIVVNARHPTYYYVQLNSIEVGGQPLNLAPSVFAIDQSTGRGGTIIDSGTTI 270

Query: 286 TWLVKEAYEALRDEVMIRLEGEQMRSYSWPDKLCYHGIMSSDLKGFPTVRFHFRGGAKLA 345
           T L   AY A+       +   ++   ++   LC++ I        P + F F+ GA   
Sbjct: 271 TMLTLPAYSAVLRAYESFVNYPRLDGSAYGLDLCFN-IAGVSNPSVPDMVFKFQ-GADFQ 328

Query: 346 LEKDSMFYQPRPDA--FCMAVNPASINNRYVNFSLIGMMAQQFYNVGYDIGRKQKTFQRM 403
           +  +++F      A   C+A+  +        FS+IG + QQ + V YD+  K+  F   
Sbjct: 329 MRGENLFVLVDTSATTLCLAMGGSQ------GFSIIGNIQQQNHLVVYDLEAKKIGFATA 382

Query: 404 DC 405
           DC
Sbjct: 383 DC 384


>gi|297811185|ref|XP_002873476.1| hypothetical protein ARALYDRAFT_325615 [Arabidopsis lyrata subsp.
           lyrata]
 gi|297319313|gb|EFH49735.1| hypothetical protein ARALYDRAFT_325615 [Arabidopsis lyrata subsp.
           lyrata]
          Length = 475

 Score =  128 bits (322), Expect = 4e-27,   Method: Compositional matrix adjust.
 Identities = 131/436 (30%), Positives = 186/436 (42%), Gaps = 55/436 (12%)

Query: 1   LIHRDSILSPYNNPNENPAHRVQRGINISIARL----AYLQEKIRSHNTYQAQI--LPSN 54
           + HR    S  NN        V+  + +  AR+    + L +K+ +++  Q+Q   LP+ 
Sbjct: 65  VTHRHGTCSRLNNGKATSPDHVEI-LRLDQARVNSIHSKLSKKLTTNHVSQSQSTDLPAK 123

Query: 55  D---ISNSLFYVNISIGQPPVPQFTAMDTGSSLLWVHCYPC-RDCSPQLGTIFYPSRSSS 110
           D   + +  + V + +G P        DTGS L W  C PC R C  Q   IF PS+S+S
Sbjct: 124 DGSTLGSGNYIVTVGLGTPKNDLSLIFDTGSDLTWTQCQPCVRTCYDQKEPIFNPSKSTS 183

Query: 111 YAYVPCDSEHCRYFPYA-----RCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDES 165
           Y  V C S  C     A      CSA    CIY   Y     +  F++ ++ T  ++D  
Sbjct: 184 YYNVSCSSAACGSLSSATGNAGSCSA--SNCIYGIQYGDQSFSVGFLAKDKFTLTSSDV- 240

Query: 166 TIHVQDVVFGCGFSTNRNFKFSGIFG-LGIGRSSL------VSQLNSSFSYCIGSLHDPD 218
                 V FGCG   N    F+G+ G LG+GR  L       +  N  FSYC+ S     
Sbjct: 241 ---FDGVYFGCG--ENNQGLFTGVAGLLGLGRDKLSFPSQTATAYNKIFSYCLPS--SAS 293

Query: 219 YLHNKLILGDGAIIDEGDATPLQFI-DGH--YYITLEAISVDGRMLDINPNIFKRDDSGG 275
           Y  + L  G   I      TP+  I DG   Y + + AI+V G+ L I   +F    S  
Sbjct: 294 YTGH-LTFGSAGISRSVKFTPISTITDGTSFYGLNIVAITVGGQKLPIPSTVF----STP 348

Query: 276 GVMIDSGTDVTWLVKEAYEALRDEVMIRLEGEQMRSYSWPDKLCYHGIMSSDLKGF---- 331
           G +IDSGT +T L  +AY ALR     ++      S       C+      DL GF    
Sbjct: 349 GALIDSGTVITRLPPKAYAALRSSFKAKMSKYPTTSGVSILDTCF------DLSGFKTVT 402

Query: 332 -PTVRFHFRGGAKLALEKDSMFYQPRPDAFCMAVNPASINNRYVNFSLIGMMAQQFYNVG 390
            P V F F GGA + L    +FY  +    C+A    + N+   N ++ G + QQ   V 
Sbjct: 403 IPKVAFSFSGGAVVELGSKGIFYAFKISQVCLAF---AGNSDDSNAAIFGNVQQQTLEVV 459

Query: 391 YDIGRKQKTFQRMDCE 406
           YD    +  F    C 
Sbjct: 460 YDGAGGRVGFAPNGCS 475


>gi|242095588|ref|XP_002438284.1| hypothetical protein SORBIDRAFT_10g011120 [Sorghum bicolor]
 gi|241916507|gb|EER89651.1| hypothetical protein SORBIDRAFT_10g011120 [Sorghum bicolor]
          Length = 487

 Score =  128 bits (322), Expect = 5e-27,   Method: Compositional matrix adjust.
 Identities = 113/381 (29%), Positives = 154/381 (40%), Gaps = 50/381 (13%)

Query: 61  FYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRD--CSPQLGTIFYPSRSSSYAYVPCDS 118
           + V I IG PP       DTGS L WV C PC D  C PQ   +F PS+SS+Y  VPC +
Sbjct: 122 YVVTIGIGTPPRNFTVLFDTGSDLTWVQCLPCPDSSCYPQQEPLFDPSKSSTYVDVPCSA 181

Query: 119 EHCRY--FPYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDVVFGC 176
             C        RC A    C Y+  Y    ET   ++ E  T             VVFGC
Sbjct: 182 PECHIGGVQQTRCGATS--CEYSVKYGDESETHGSLAEETFTLSPPSPLAPAATGVVFGC 239

Query: 177 GFS-----TNRNFKFSGIFGLGIGRSSLVSQLNSS-------FSYCIGSLHDPDYLHNKL 224
                    +     +G+ GLG G SS++SQ   S       FSYC   L         L
Sbjct: 240 SHEYISVFNDTGMGVAGLLGLGRGDSSILSQTRRSINSGGGVFSYC---LPPRGSSTGYL 296

Query: 225 ILGDGAIIDEGDATPLQF---------IDGHYYITLEAISVDGRMLDINPNIFKRDDSGG 275
            +G GA   +   + L F         +   Y + L  +SV+G  +DI  + F       
Sbjct: 297 TIGGGAAAPQQQYSNLSFTPLITTISQLRSAYVVNLAGVSVNGAAVDIPASAFSL----- 351

Query: 276 GVMIDSGTDVTWLVKEAYEALRDEVMIRLEGEQMRSYSWPDKL--CYHGIMSSDLKGFPT 333
           G +IDSGT VT +   AY  LRDE  + +   +M        L  CY  +   D+   P 
Sbjct: 352 GAVIDSGTVVTHMPAAAYYPLRDEFRLHMGSYKMLPEGSMKLLDTCYD-VTGQDVVTAPR 410

Query: 334 VRFHFRGGAKLALEKDS-MFYQPRPDA-------FCMAVNPASINNRYVNFSLIGMMAQQ 385
           V   F GGA++ ++    +   P  D         C+A  P           ++G M Q+
Sbjct: 411 VALEFGGGARIDVDASGILLVLPAEDGSGQSLTLACLAFLP----TNSAGLVIVGNMQQR 466

Query: 386 FYNVGYDIGRKQKTFQRMDCE 406
            YNV +D+   +  F    C 
Sbjct: 467 AYNVVFDVDGGRIGFGPNGCS 487


>gi|414881704|tpg|DAA58835.1| TPA: hypothetical protein ZEAMMB73_701358 [Zea mays]
          Length = 485

 Score =  128 bits (322), Expect = 5e-27,   Method: Compositional matrix adjust.
 Identities = 131/438 (29%), Positives = 188/438 (42%), Gaps = 54/438 (12%)

Query: 1   LIHRDSILSPYNNPNENPAHRVQRGINISIARLAYLQEKIRS-----HNTYQAQILPSND 55
           ++HRD+  +      E   HR+QR       R A + E   +          A ++    
Sbjct: 69  VVHRDT-FAVNATAGELLKHRLQRDKR----RAARISEAAGAGGGNGRKGVAAPVVSGLA 123

Query: 56  ISNSLFYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLGTIFYPSRSSSYAYVP 115
             +  ++  I +G P       +DTGS ++WV C PCR C  Q G +F P RSSSY  V 
Sbjct: 124 QGSGEYFTKIGVGTPATQALMVLDTGSDVVWVQCAPCRRCYEQSGPVFDPRRSSSYGAVG 183

Query: 116 CDSEHCRYFPYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDVVFG 175
           C +  CR      C   +  C+Y   Y  G  T+    TE LTF         V  V  G
Sbjct: 184 CGAALCRRLDSGGCDLRRGACMYQVAYGDGSVTAGDFVTETLTFAG----GARVARVALG 239

Query: 176 CGFSTNRNF-KFSGIFGLGIGRSSLVSQLN----SSFSYCI------GSLHDP-DYLHNK 223
           CG      F   +G+ GLG G  S  +Q++     SFSYC+      G+   P  +  + 
Sbjct: 240 CGHDNEGLFVAAAGLLGLGRGGLSFPTQISRRYGRSFSYCLVDRTSSGAGAAPGSHRSST 299

Query: 224 LILGDGAI-IDEGDATPL---QFIDGHYYITLEAISVDG-RMLDINPNIFKRDDSG--GG 276
           +  G G++       TP+     ++  YY+ L  ISV G R+  +  +  + D S   GG
Sbjct: 300 VSFGAGSVGASSASFTPMVRNPRMETFYYVQLVGISVGGARVPGVAESDLRLDPSTGRGG 359

Query: 277 VMIDSGTDVTWLVKEAYEALRDEVMIRLEGEQMRS---YSWPDKLCYHGIMSSDLKG--- 330
           V++DSGT VT L + +Y ALRD       G    S   +S  D  CY      DL G   
Sbjct: 360 VIVDSGTSVTRLARASYSALRDAFRAAAAGGLRLSPGGFSLFDT-CY------DLGGRRV 412

Query: 331 --FPTVRFHFRGGAKLALEKDS-MFYQPRPDAFCMAVNPASINNRYVNFSLIGMMAQQFY 387
              PTV  HF GGA+ AL  ++ +        FC A   A  +      S+IG + QQ +
Sbjct: 413 VKVPTVSMHFAGGAEAALPPENYLIPVDSRGTFCFAF--AGTDG---GVSIIGNIQQQGF 467

Query: 388 NVGYDIGRKQKTFQRMDC 405
            V +D   ++  F    C
Sbjct: 468 RVVFDGDGQRVGFAPKGC 485


>gi|242051593|ref|XP_002454942.1| hypothetical protein SORBIDRAFT_03g001790 [Sorghum bicolor]
 gi|241926917|gb|EES00062.1| hypothetical protein SORBIDRAFT_03g001790 [Sorghum bicolor]
          Length = 431

 Score =  128 bits (321), Expect = 5e-27,   Method: Compositional matrix adjust.
 Identities = 117/418 (27%), Positives = 191/418 (45%), Gaps = 42/418 (10%)

Query: 4   RDSILSPYNNPNENPAHRV-QRGINISIARLAYLQEKIRSHNTYQAQILPSNDISNSLFY 62
           R  +  PY   +  P H + +R    S AR+A L+ ++    +     +P   IS+  + 
Sbjct: 39  RAELHHPYAG-SSLPVHDMWRRSARASKARVARLEARLTGDMS-----VPLARISDEGYT 92

Query: 63  VNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLGTIFYPSRSSSYAYVPCDSEHCR 122
           V I IG PP       DT S L W  C    D + Q+  +F P++SSS+A+V C S+ C 
Sbjct: 93  VTIGIGTPPQLHTLIADTASDLTWTQCNLFNDTAKQVEPLFDPAKSSSFAFVTCSSKLCT 152

Query: 123 Y--FPYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHV-QDVVFGCGFS 179
                  RCS    R +Y  + +   E +  ++ E  T  + ++   H+     FGCG  
Sbjct: 153 EDNPGTKRCSNKTCRYVYPYVSV---EAAGVLAYESFTLSDNNQ---HICMSFGFGCGAL 206

Query: 180 TNRN-FKFSGIFGLGIGRSSLVSQLN-SSFSYCIGSLHDPDYLHNKLILGDGAIIDEGDA 237
           T+ N    SGI G+     S+VSQL    FSYC+    D     + L  G  A +     
Sbjct: 207 TDGNLLGASGILGMSPAILSMVSQLAIPKFSYCLTPYTDRK--SSPLFFGAWADLGRYKT 264

Query: 238 T-PLQ-FIDGHYYITLEAISVDGRMLDINPNIFKRDDSGGGVMIDSGTDVTWLVKEAYEA 295
           T P+Q  +  +YY+ L  +S+  R LD+    F      GG ++D G  V  L + A+ A
Sbjct: 265 TGPIQKSLTFYYYVPLVGLSLGTRRLDVPAATFALKQ--GGTVVDLGCTVGQLAEPAFTA 322

Query: 296 LRDEVM----IRLEGEQMRSYSWPDKLCY---HGIMSSDLKGFPTVRFHFRGGAKLALEK 348
           L++ V+    + L    ++ Y    K+C+    G+    ++  P V  +F GGA + L +
Sbjct: 323 LKEAVLHTLNLPLTNRTVKDY----KVCFALPSGVAMGAVQTPPLV-LYFDGGADMVLPR 377

Query: 349 DSMFYQPRPDAFCMAVNPASINNRYVNFSLIGMMAQQFYNVGYDIGRKQKTFQRMDCE 406
           D+ F +P     C+A+ P          S+IG + QQ +++ +D+   +  F    C+
Sbjct: 378 DNYFQEPTAGLMCLALVPGG------GMSIIGNVQQQNFHLLFDVHDSKFLFAPTICD 429


>gi|8979711|emb|CAB96832.1| nucleoid DNA-binding protein cnd41-like protein [Arabidopsis
           thaliana]
          Length = 446

 Score =  128 bits (321), Expect = 5e-27,   Method: Compositional matrix adjust.
 Identities = 131/436 (30%), Positives = 184/436 (42%), Gaps = 55/436 (12%)

Query: 1   LIHRDSILSPYNNPNENPAHRVQRGINISIARL----AYLQEKIRSHNTYQAQI--LPSN 54
           + HR    S  NN        V+  + +  AR+    + L +K+ + +  +++   LP+ 
Sbjct: 36  VTHRHGTCSRLNNGKATSPDHVEI-LRLDQARVNSIHSKLSKKLATDHVSESKSTDLPAK 94

Query: 55  D---ISNSLFYVNISIGQPPVPQFTAMDTGSSLLWVHCYPC-RDCSPQLGTIFYPSRSSS 110
           D   + +  + V + +G P        DTGS L W  C PC R C  Q   IF PS+S+S
Sbjct: 95  DGSTLGSGNYIVTVGLGTPKNDLSLIFDTGSDLTWTQCQPCVRTCYDQKEPIFNPSKSTS 154

Query: 111 YAYVPCDSEHCRYFPYA-----RCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDES 165
           Y  V C S  C     A      CSA    CIY   Y     +  F++ E+ T  N+D  
Sbjct: 155 YYNVSCSSAACGSLSSATGNAGSCSA--SNCIYGIQYGDQSFSVGFLAKEKFTLTNSDV- 211

Query: 166 TIHVQDVVFGCGFSTNRNFKFSGIFG-LGIGRSSL------VSQLNSSFSYCIGSLHDPD 218
                 V FGCG   N    F+G+ G LG+GR  L       +  N  FSYC+ S     
Sbjct: 212 ---FDGVYFGCG--ENNQGLFTGVAGLLGLGRDKLSFPSQTATAYNKIFSYCLPS--SAS 264

Query: 219 YLHNKLILGDGAIIDEGDATPLQFI-DG--HYYITLEAISVDGRMLDINPNIFKRDDSGG 275
           Y    L  G   I      TP+  I DG   Y + + AI+V G+ L I   +F    S  
Sbjct: 265 YT-GHLTFGSAGISRSVKFTPISTITDGTSFYGLNIVAITVGGQKLPIPSTVF----STP 319

Query: 276 GVMIDSGTDVTWLVKEAYEALRDEVMIRLEGEQMRSYSWPDKLCYHGIMSSDLKGF---- 331
           G +IDSGT +T L  +AY ALR     ++      S       C+      DL GF    
Sbjct: 320 GALIDSGTVITRLPPKAYAALRSSFKAKMSKYPTTSGVSILDTCF------DLSGFKTVT 373

Query: 332 -PTVRFHFRGGAKLALEKDSMFYQPRPDAFCMAVNPASINNRYVNFSLIGMMAQQFYNVG 390
            P V F F GGA + L    +FY  +    C+A    + N+   N ++ G + QQ   V 
Sbjct: 374 IPKVAFSFSGGAVVELGSKGIFYVFKISQVCLAF---AGNSDDSNAAIFGNVQQQTLEVV 430

Query: 391 YDIGRKQKTFQRMDCE 406
           YD    +  F    C 
Sbjct: 431 YDGAGGRVGFAPNGCS 446


>gi|356502456|ref|XP_003520035.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
          Length = 484

 Score =  128 bits (321), Expect = 6e-27,   Method: Compositional matrix adjust.
 Identities = 117/370 (31%), Positives = 178/370 (48%), Gaps = 25/370 (6%)

Query: 44  NTYQAQILPSNDISNSLFYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLGTIF 103
           N  Q  ++      +  +++ + IG+PP   +  +DTGS + W+ C PC +C  Q   IF
Sbjct: 132 NALQGPVVSGTSQGSGEYFLRVGIGKPPSQAYVVLDTGSDVSWIQCAPCSECYQQSDPIF 191

Query: 104 YPSRSSSYAYVPCDSEHCRYFPYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTD 163
            P  S+SY+ + CD+  C+    + C      C+Y   Y  G  T    +TE +T     
Sbjct: 192 DPVSSNSYSPIRCDAPQCKSLDLSEC--RNGTCLYEVSYGDGSYTVGEFATETVTL---- 245

Query: 164 ESTIHVQDVVFGCGFSTNRNF-KFSGIFGLGIGRSSLVSQLN-SSFSYCIGSLHDPDYLH 221
             T  V++V  GCG +    F   +G+ GLG G+ S  +Q+N +SFSYC+ +  D D + 
Sbjct: 246 -GTAAVENVAIGCGHNNEGLFVGAAGLLGLGGGKLSFPAQVNATSFSYCLVN-RDSDAVS 303

Query: 222 NKLILGDGAIIDEGDATPLQF---IDGHYYITLEAISVDGRMLDINPNIFKRD-DSGGGV 277
              +  +  +       PL+    +D  YY+ L+ ISV G  L I  +IF+ D   GGG+
Sbjct: 304 T--LEFNSPLPRNVVTAPLRRNPELDTFYYLGLKGISVGGEALPIPESIFEVDAIGGGGI 361

Query: 278 MIDSGTDVTWLVKEAYEALRDEVMIRLEG-EQMRSYSWPDKLCYHGIMSSDLKGFPTVRF 336
           +IDSGT VT L  E Y+ALRD  +   +G  +    S  D  CY  + S +    PTV F
Sbjct: 362 IIDSGTAVTRLRSEVYDALRDAFVKGAKGIPKANGVSLFDT-CYD-LSSRESVQVPTVSF 419

Query: 337 HFRGGAKLAL-EKDSMFYQPRPDAFCMAVNPASINNRYVNFSLIGMMAQQFYNVGYDIGR 395
           HF  G +L L  ++ +        FC A  P +      + S++G + QQ   VG+DI  
Sbjct: 420 HFPEGRELPLPARNYLIPVDSVGTFCFAFAPTT-----SSLSIMGNVQQQGTRVGFDIAN 474

Query: 396 KQKTFQRMDC 405
               F    C
Sbjct: 475 SLVGFSADSC 484


>gi|22326716|ref|NP_196638.2| aspartyl protease family protein [Arabidopsis thaliana]
 gi|18700103|gb|AAL77663.1| AT5g10770/T30N20_40 [Arabidopsis thaliana]
 gi|24111269|gb|AAN46758.1| At5g10770/T30N20_40 [Arabidopsis thaliana]
 gi|332004211|gb|AED91594.1| aspartyl protease family protein [Arabidopsis thaliana]
          Length = 474

 Score =  127 bits (320), Expect = 7e-27,   Method: Compositional matrix adjust.
 Identities = 130/436 (29%), Positives = 184/436 (42%), Gaps = 55/436 (12%)

Query: 1   LIHRDSILSPYNNPNENPAHRVQRGINISIARL----AYLQEKIRSHNTYQAQI--LPSN 54
           + HR    S  NN        V+  + +  AR+    + L +K+ + +  +++   LP+ 
Sbjct: 64  VTHRHGTCSRLNNGKATSPDHVEI-LRLDQARVNSIHSKLSKKLATDHVSESKSTDLPAK 122

Query: 55  D---ISNSLFYVNISIGQPPVPQFTAMDTGSSLLWVHCYPC-RDCSPQLGTIFYPSRSSS 110
           D   + +  + V + +G P        DTGS L W  C PC R C  Q   IF PS+S+S
Sbjct: 123 DGSTLGSGNYIVTVGLGTPKNDLSLIFDTGSDLTWTQCQPCVRTCYDQKEPIFNPSKSTS 182

Query: 111 YAYVPCDSEHCRYFPYA-----RCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDES 165
           Y  V C S  C     A      CSA    CIY   Y     +  F++ E+ T  N+D  
Sbjct: 183 YYNVSCSSAACGSLSSATGNAGSCSA--SNCIYGIQYGDQSFSVGFLAKEKFTLTNSDV- 239

Query: 166 TIHVQDVVFGCGFSTNRNFKFSGIFG-LGIGRSSL------VSQLNSSFSYCIGSLHDPD 218
                 V FGCG   N    F+G+ G LG+GR  L       +  N  FSYC+ S     
Sbjct: 240 ---FDGVYFGCG--ENNQGLFTGVAGLLGLGRDKLSFPSQTATAYNKIFSYCLPS--SAS 292

Query: 219 YLHNKLILGDGAIIDEGDATPLQFI-DGH--YYITLEAISVDGRMLDINPNIFKRDDSGG 275
           Y  + L  G   I      TP+  I DG   Y + + AI+V G+ L I   +F       
Sbjct: 293 YTGH-LTFGSAGISRSVKFTPISTITDGTSFYGLNIVAITVGGQKLPIPSTVFSTP---- 347

Query: 276 GVMIDSGTDVTWLVKEAYEALRDEVMIRLEGEQMRSYSWPDKLCYHGIMSSDLKGF---- 331
           G +IDSGT +T L  +AY ALR     ++      S       C+      DL GF    
Sbjct: 348 GALIDSGTVITRLPPKAYAALRSSFKAKMSKYPTTSGVSILDTCF------DLSGFKTVT 401

Query: 332 -PTVRFHFRGGAKLALEKDSMFYQPRPDAFCMAVNPASINNRYVNFSLIGMMAQQFYNVG 390
            P V F F GGA + L    +FY  +    C+A    + N+   N ++ G + QQ   V 
Sbjct: 402 IPKVAFSFSGGAVVELGSKGIFYVFKISQVCLAF---AGNSDDSNAAIFGNVQQQTLEVV 458

Query: 391 YDIGRKQKTFQRMDCE 406
           YD    +  F    C 
Sbjct: 459 YDGAGGRVGFAPNGCS 474


>gi|297822477|ref|XP_002879121.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
 gi|297324960|gb|EFH55380.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
          Length = 711

 Score =  127 bits (320), Expect = 8e-27,   Method: Compositional matrix adjust.
 Identities = 103/363 (28%), Positives = 160/363 (44%), Gaps = 42/363 (11%)

Query: 58  NSLFYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLGTIFYPSRSSSYAYVPCD 117
           NS++ + + +G PP      +DTGS + W  C PC  C  Q   IF PS+SS+       
Sbjct: 377 NSVYLMKLQVGTPPFEIEAVIDTGSEITWTQCLPCVHCYKQNAPIFDPSKSST------- 429

Query: 118 SEHCRYFPYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDVVFGCG 177
                 F   RC  + H C Y   Y     T   ++T+ +T  +T      + + + GCG
Sbjct: 430 ------FKEKRC--HDHSCPYEVDYFDKTYTKGTLATDTVTIHSTSGEPFVMAETIIGCG 481

Query: 178 FSTNRNFK--FSGIFGLGIGRSSLVSQLNSSF----SYCIGSLHDPDYLHNKLILGDGAI 231
              N  F+  F G  GL  G  SL++Q+   +    SYC           +K+  G  AI
Sbjct: 482 -RNNSWFRPSFEGFVGLNWGPLSLITQMGGEYPGLMSYCFAGNGT-----SKINFGTNAI 535

Query: 232 IDEGD-ATPLQFID----GHYYITLEAISVDGRMLDINPNIFKRDDSGGGVMIDSGTDVT 286
           +  G   +   F+     G YY+ L+A+SV    ++     F   +  G ++IDSGT +T
Sbjct: 536 VGGGGVVSTTMFVTTARPGFYYLNLDAVSVGDTRIETLGTPFHALE--GNIVIDSGTTLT 593

Query: 287 WLVKEAYEALRDEVMIRLEGEQMRSYSWPDKLCYHGIMSSDLKGFPTVRFHFRGGAKLAL 346
           +  +     +R  V   +        +  D LCY+   S+  + FP +  HF GGA L L
Sbjct: 594 YFPESYCNLVRQAVEHVVPAVPAADPTGNDLLCYY---SNTTEIFPVITMHFSGGADLVL 650

Query: 347 EKDSMFYQPRPDA-FCMAVNPASINNRYVNFSLIGMMAQQFYNVGYDIGRKQKTFQRMDC 405
           +K +MF +      FC+A+    I N     ++ G  AQ  + VGYD      +F+  +C
Sbjct: 651 DKYNMFMESYSGGLFCLAI----ICNNPTQEAIFGNRAQNNFLVGYDSSSLLVSFKPTNC 706

Query: 406 EVL 408
             L
Sbjct: 707 SAL 709



 Score =  124 bits (311), Expect = 9e-26,   Method: Compositional matrix adjust.
 Identities = 98/337 (29%), Positives = 149/337 (44%), Gaps = 48/337 (14%)

Query: 61  FYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLGTIFYPSRSSSYAYVPCDSEH 120
           + + + IG PP      +DTGS L+W  C PC  C  Q   IF PS+SS+          
Sbjct: 65  YLMKLQIGTPPFEVEAVLDTGSELIWTQCLPCLHCYDQKAPIFDPSKSST---------- 114

Query: 121 CRYFPYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDVVFGCGFST 180
              F   RC+   H C Y  +Y     T   ++TE +T  +T      + + + GC    
Sbjct: 115 ---FKETRCNTPDHSCPYKLVYDDKSYTQGTLATETVTIHSTSGVPFVMPETIIGCS-RN 170

Query: 181 NRNFKF----SGIFGLGIGRSSLVSQLNSSFSYCIGSLHDPDYLHNKLILGDGAIIDEGD 236
           N    F    SGI GL  G  SL+SQ+  ++                   GDG +     
Sbjct: 171 NSGSGFRPSSSGIVGLSRGSLSLISQMGGAYP------------------GDGVVSTTMF 212

Query: 237 ATPLQFIDGHYYITLEAISVDGRMLDINPNIFKRDDSGGGVMIDSGTDVTWLVKEAYEAL 296
           A   +   G YY+ L+A+SV    ++     F   +  G ++IDSGT +T+        +
Sbjct: 213 AKTAK--RGQYYLNLDAVSVGDTRIETVGTPFHALN--GNIVIDSGTPLTYFPVSYCNLV 268

Query: 297 RDEVMIRLEGEQMRSYSWPDKLCYHGIMSSDLKGFPTVRFHFRGGAKLALEKDSMFYQ-P 355
           R  V   +  +++   S  D LCY+   S+ ++ FP +  HF GGA L L+K +M+ +  
Sbjct: 269 RKAVERVVTADRVVDPSRNDMLCYY---SNTIEIFPVITVHFSGGADLVLDKYNMYMELN 325

Query: 356 RPDAFCMAVNPASINNRYVNFSLIGMMAQQFYNVGYD 392
           R   FC+A+    I N     ++ G  AQ  + VGYD
Sbjct: 326 RGGVFCLAI----ICNNPTQVAIFGNRAQNNFLVGYD 358


>gi|125536419|gb|EAY82907.1| hypothetical protein OsI_38120 [Oryza sativa Indica Group]
          Length = 448

 Score =  127 bits (320), Expect = 8e-27,   Method: Compositional matrix adjust.
 Identities = 98/330 (29%), Positives = 146/330 (44%), Gaps = 36/330 (10%)

Query: 61  FYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLGTIFYPSRSSSYAYVPCDSEH 120
           + +  SIG+PP+  +  +DTGS L+WV C PC  C+P    ++ P+RS S   +PC S+ 
Sbjct: 87  YIMQFSIGEPPLLIWAEVDTGSDLMWVKCSPCNGCNPPPSPLYDPARSRSSGKLPCSSQL 146

Query: 121 CRYFPYAR-----CSAYKHRCIYTQLYLIGPE--TSVFVSTEQLTFKN---TDESTIHVQ 170
           C+     R     CS     C Y   Y    +  T   + TE  TF +    +  +    
Sbjct: 147 CQALGRGRIISDQCSDDPPLCGYHYAYGHSGDHSTQGVLGTETFTFGDGYVANNVSFGRS 206

Query: 171 DVVFGCGFSTNRNFKFSGIFGLGIGRSSLVSQLNSS-FSYCIGSLHDPDYLHNKLILGDG 229
           D + G  F        +G+ GLG G  SLVSQL +  F+YC+ +  DP+ +++ ++ G  
Sbjct: 207 DTIDGSQFGGT-----AGLVGLGRGHLSLVSQLGAGRFAYCLAA--DPN-VYSTILFGSL 258

Query: 230 AIID--EGDATPLQFI-------DGHYYITLEAISVDGRMLDINPNIFK-RDDSGGGVMI 279
           A +D   GD +    +       D HYY+ L+ ISV G  L I    F    D  GGV  
Sbjct: 259 AALDTSAGDVSSTPLVTNPKPDRDTHYYVNLQGISVGGSRLPIKDGTFAINSDGSGGVFF 318

Query: 280 DSGTDVTWLVKEAYEALRDEVMIRLEGEQMRSYSWPDKLCYHGIMSSDLKGFPTVRFHFR 339
           DSG   T L   AY+ +R  +   +   Q   Y   D  C+       +   P +  HF 
Sbjct: 319 DSGAIDTSLKDAAYQVVRQAITSEI---QRLGYDAGDDTCFVAANQQAVAQMPPLVLHFD 375

Query: 340 GGAKLALEKDSMFYQ----PRPDAFCMAVN 365
            GA ++L   +        P     CMA+ 
Sbjct: 376 DGADMSLNGRNYLKTSTKGPSEVLVCMAIK 405


>gi|20160862|dbj|BAB89801.1| putative aspartic proteinase nepenthesin I [Oryza sativa Japonica
           Group]
          Length = 488

 Score =  127 bits (319), Expect = 1e-26,   Method: Compositional matrix adjust.
 Identities = 114/366 (31%), Positives = 162/366 (44%), Gaps = 45/366 (12%)

Query: 57  SNSLFYV-NISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLGTIFYPSRSSSYAYVP 115
           +N+  YV +  IG PP     A+D  S L+W  C      +P     F P RS++ A VP
Sbjct: 95  TNAGMYVFSYGIGTPPQQVSGALDISSDLVWTAC---GATAP-----FNPVRSTTVADVP 146

Query: 116 CDSEHCRYFPYARCSAYKHRCIYTQLYLIGPE-TSVFVSTEQLTFKNTDESTIHVQDVVF 174
           C  + C+ F    C A    C YT +Y  G   T+  + TE  TF +T      +  VVF
Sbjct: 147 CTDDACQQFAPQTCGAGASECAYTYMYGGGAANTTGLLGTEAFTFGDT-----RIDGVVF 201

Query: 175 GCGFSTNRNFK-FSGIFGLGIGRSSLVSQLN-SSFSYCIGSLHDPDYLHNKLILGDGAII 232
           GCG     +F   SG+ GLG G  SLVSQL    FSY        D   + ++ GD    
Sbjct: 202 GCGLKNVGDFSGVSGVIGLGRGNLSLVSQLQVDRFSYHFAPDDSVD-TQSFILFGD---- 256

Query: 233 DEGDATP---------LQFIDGH---YYITLEAISVDGRMLDINPNIF--KRDDSGGGVM 278
              DATP         L   D +   YY+ L  I VDG+ L I    F  +  D  GGV 
Sbjct: 257 ---DATPQTSHTLSTRLLASDANPSLYYVELAGIQVDGKDLAIPSGTFDLRNKDGSGGVF 313

Query: 279 IDSGTDVTWLVKEAYEALRDEVMIRLEGEQMRSYSWPDKLCYHGIMSSDLKGFPTVRFHF 338
           +     VT L + AY+ LR  V  ++    +   +    LCY G   +  K  P++   F
Sbjct: 314 LSITDLVTVLEEAAYKPLRQAVASKIGLPAVNGSALGLDLCYTGESLAKAK-VPSMALVF 372

Query: 339 RGGAKLALEKDSMFYQPRPDAF-CMAVNPASINNRYVNFSLIGMMAQQFYNVGYDIGRKQ 397
            GGA + LE  + FY        C+ + P+S  +     S++G + Q   ++ YDI   +
Sbjct: 373 AGGAVMELELGNYFYMDSTTGLACLTILPSSAGDG----SVLGSLIQVGTHMMYDINGSK 428

Query: 398 KTFQRM 403
             F+ +
Sbjct: 429 LVFESL 434


>gi|297742733|emb|CBI35367.3| unnamed protein product [Vitis vinifera]
          Length = 521

 Score =  127 bits (318), Expect = 1e-26,   Method: Compositional matrix adjust.
 Identities = 116/412 (28%), Positives = 176/412 (42%), Gaps = 60/412 (14%)

Query: 1   LIHRDSILSPYNNPNENPAHRVQRGINISIARLAYLQEKIRSH-------NTYQAQILPS 53
           ++HRD +   + N +++  HR+   +     R+A L  ++ S        + +   ++  
Sbjct: 137 VVHRDQL--SFGNSDDH-RHRLDGRLKRDAKRVASLIRRLSSGGGGSYRVDDFGTDVISG 193

Query: 54  NDISNSLFYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLGTIFYPSRSSSYAY 113
            +  +  ++V I +G PP  Q+  +D+GS ++WV C PC  C  Q   +F P+ S+S+  
Sbjct: 194 MEQGSGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCTQCYHQSDPVFDPADSASFTG 253

Query: 114 VPCDSEHCRYFPYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDVV 173
           V C S  C     A C A   RC Y   Y  G  T   ++ E LTF  T      V+ V 
Sbjct: 254 VSCSSSVCDRLENAGCHA--GRCRYEVSYGDGSYTKGTLALETLTFGRT-----MVRSVA 306

Query: 174 FGCGFSTNRNFKFSGIFGLGIGRS-SLVSQL----NSSFSYCIGSLHDPDYLHNKLILGD 228
            GCG      F  +       G S S V QL      +FSYC+ S      + N      
Sbjct: 307 IGCGHRNRGMFVGAAGLLGLGGGSMSFVGQLGGQTGGAFSYCLVSAAWVPLVRNPR---- 362

Query: 229 GAIIDEGDATPLQFIDGHYYITLEAISVDGRMLDINPNIFKRDDSG-GGVMIDSGTDVTW 287
                             YYI L  + V G  + I+  +F+  + G GGV++D+GT VT 
Sbjct: 363 --------------APSFYYIGLAGLGVGGIRVPISEEVFRLTELGDGGVVMDTGTAVTR 408

Query: 288 LVKEAYEALRDEVMIRLEGEQMRSYSWPDKLCYHGIMSSDLKGF-----PTVRFHFRGGA 342
           L   AY+A RD  + +       +       CY      DL GF     PTV F+F GG 
Sbjct: 409 LPTLAYQAFRDAFLAQTANLPRATGVAIFDTCY------DLLGFVSVRVPTVSFYFSGGP 462

Query: 343 KLALEKDSMFYQPRPDA--FCMAVNPASINNRYVNFSLIGMMAQQFYNVGYD 392
            L L   + F  P  DA  FC A  P++        S++G + Q+   + +D
Sbjct: 463 ILTLPARN-FLIPMDDAGTFCFAFAPST-----SGLSILGNIQQEGIQISFD 508


>gi|125555058|gb|EAZ00664.1| hypothetical protein OsI_22685 [Oryza sativa Indica Group]
          Length = 465

 Score =  127 bits (318), Expect = 1e-26,   Method: Compositional matrix adjust.
 Identities = 141/446 (31%), Positives = 187/446 (41%), Gaps = 68/446 (15%)

Query: 1   LIHRDSILSPYNNPNENP--AHRVQRGINISIARLAYLQEKIRSHNTYQAQI-------- 50
           L+HR    +P       P  A R++R      AR  Y+  K     T    +        
Sbjct: 47  LVHRHGPCAPSAASGGKPSLAERLRR----DRARANYIVTKAAGGRTAATAVSDAVGGGG 102

Query: 51  --LPS--NDISNSLFYV-NISIGQPPVPQFTAMDTGSSLLWVHCYPC--RDCSPQLGTIF 103
             +P+   D  +SL YV  + IG P V Q   +DTGS L WV C PC   +C  Q   +F
Sbjct: 103 TSIPTFLGDSVDSLEYVVTLGIGTPAVQQIVLIDTGSDLSWVQCKPCGAGECYAQKDPLF 162

Query: 104 YPSRSSSYAYVPCDSEHCRYFPYARCSAYKHRCIYTQLYLI--GPE------TSVFVSTE 155
            PS SSSYA VPCDS+ CR        AY H C      L   G E      T+   STE
Sbjct: 163 DPSSSSSYASVPCDSDACRKL---AAGAYGHGCTSGAAALCEYGIEYGNRATTTGVYSTE 219

Query: 156 QLTFKNTDESTIHVQDVVFGCGFSTNRNF-KFSGIFGLGIGRSSLVSQLNSS----FSYC 210
            LT K      + V D  FGCG   +  + KF G+ GLG    SLVSQ +S     FSYC
Sbjct: 220 TLTLK----PGVVVADFGFGCGDHQHGPYEKFDGLLGLGGAPESLVSQTSSQFGGPFSYC 275

Query: 211 IGSLHDPDYLHNKLILGDGAIIDEGDA------TPLQFIDG---HYYITLEAISVDGRML 261
              L         L LG         A      TP++ I      Y +TL  ISV G  L
Sbjct: 276 ---LPPTSGGAGFLALGAPNSSSSSTAAAGFLFTPMRRIPSVPTFYVVTLTGISVGGAPL 332

Query: 262 DINPNIFKRDDSGGGVMIDSGTDVTWLVKEAYEALRDEVMIRLEGEQMRSYSWPDKL--C 319
            + P+ F       G++IDSGT +T L   AY ALR      +   ++   S    L  C
Sbjct: 333 AVPPSAFSS-----GMVIDSGTVITGLPATAYAALRSAFRSAMSEYRLLPPSNGAVLDTC 387

Query: 320 YHGIMSSDLKGFPTVRFHFRGGAKLALEKDSMFYQPRPDAFCMAVNPASINNRYVNFSLI 379
           Y     +++   PT+   F GGA + L   +          C+A   A  ++      +I
Sbjct: 388 YDFTGHTNVT-VPTIALTFSGGATIDLATPAGVLVDG----CLAFAGAGTDD---TIGII 439

Query: 380 GMMAQQFYNVGYDIGRKQKTFQRMDC 405
           G + Q+ + V YD G+    F+   C
Sbjct: 440 GNVNQRTFEVLYDSGKGTVGFRAGAC 465


>gi|242092900|ref|XP_002436940.1| hypothetical protein SORBIDRAFT_10g011750 [Sorghum bicolor]
 gi|241915163|gb|EER88307.1| hypothetical protein SORBIDRAFT_10g011750 [Sorghum bicolor]
          Length = 465

 Score =  127 bits (318), Expect = 1e-26,   Method: Compositional matrix adjust.
 Identities = 114/367 (31%), Positives = 162/367 (44%), Gaps = 48/367 (13%)

Query: 61  FYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCR--DCSPQLGTIFYPSRSSSYAYVPCDS 118
           + V +  G P VPQ   MDTGS + WV C PC   +C PQ   +F PS+SS+YA + C +
Sbjct: 125 YMVTLGFGTPSVPQVLLMDTGSDVSWVQCAPCNSTECYPQKDPLFDPSKSSTYAPIACGA 184

Query: 119 EHCRYFP---YARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDVVFG 175
           + C          C++   +C Y   Y  G  T    S E +TF       I V+D  FG
Sbjct: 185 DACNKLGDHYRNGCTSGGTQCGYRVEYGDGSSTRGVYSNETITFA----PGITVKDFHFG 240

Query: 176 CGFST-NRNFKFSGIFGLGIGRSSLVSQLNS----SFSYCIGSLH-DPDYLHNKLILGDG 229
           CG      + KF G+ GLG    SLV Q  S    +FSYC+ +L+ +  +L   +     
Sbjct: 241 CGHDQRGPSDKFDGLLGLGGAPESLVVQTASVYGGAFSYCLPALNSEAGFLALGVRPSAA 300

Query: 230 AIIDEGDATPLQFID---GHYYITLEAISVDGRMLDINPNIFKRDDSGGGVMIDSGTDVT 286
                   TP+  +      Y + +  ISV G+ LDI  + F+     GG++IDSGT VT
Sbjct: 301 TNTSAFVFTPMWHLPMDATSYMVNMTGISVGGKPLDIPRSAFR-----GGMLIDSGTIVT 355

Query: 287 WLVKEAYEALRDEVMIRLEGEQMRSYSWPDKLCYHGIMSSDLKGFPTVRFHFRGGAKLAL 346
            L + AY AL   +        M +    D  CY+    S++   P V   F GGA + L
Sbjct: 356 ELPETAYNALNAALRKAFAAYPMVASEDFDT-CYNFTGYSNVT-VPRVALTFSGGATIDL 413

Query: 347 E-------KDSM-FYQPRPDAFCMAVNPASINNRYVNFSLIGMMAQQFYNVGYDIGRKQK 398
           +       KD + F +  PD               V   +IG + Q+   V YD G  + 
Sbjct: 414 DVPNGILVKDCLAFRESGPD---------------VGLGIIGNVNQRTLEVLYDAGHGKV 458

Query: 399 TFQRMDC 405
            F+   C
Sbjct: 459 GFRAGAC 465


>gi|21717166|gb|AAM76359.1|AC074196_17 putative nucleoid DNA binding protein [Oryza sativa Japonica Group]
 gi|31433306|gb|AAP54835.1| Eukaryotic aspartyl protease family protein, expressed [Oryza
           sativa Japonica Group]
 gi|125575546|gb|EAZ16830.1| hypothetical protein OsJ_32301 [Oryza sativa Japonica Group]
          Length = 373

 Score =  127 bits (318), Expect = 1e-26,   Method: Compositional matrix adjust.
 Identities = 100/366 (27%), Positives = 159/366 (43%), Gaps = 31/366 (8%)

Query: 57  SNSLFYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLGTIFYPSRSSSYAYVPC 116
           S  L+  N++IG PP P    +      +W  C PCR C  Q   +F  S SS+Y   PC
Sbjct: 24  SQPLYMANLTIGTPPQPASAIIHLAGEFVWTQCSPCRRCFKQDLPLFNRSASSTYRPEPC 83

Query: 117 DSEHCRYFPYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDVVFGC 176
            +  C   P + CS     C Y    + G +TS    T+         S      + FGC
Sbjct: 84  GTALCESVPASTCSG-DGVCSYEVETMFG-DTSGIGGTDTFAIGTATAS------LAFGC 135

Query: 177 GFSTN--RNFKFSGIFGLGIGRSSLVSQLN-SSFSYCIGSLHDPDYLHNKLILGDGAIID 233
              +N  +    SG+ GLG    SLV Q+N ++FSYC+   H      + L+LG  A + 
Sbjct: 136 AMDSNIKQLLGASGVVGLGRTPWSLVGQMNATAFSYCLAP-HGAAGKKSALLLGASAKLA 194

Query: 234 EGDA---TPLQFI---DGHYYITLEAISVDGRMLDINPNIFKRDDSGGGVMIDSGTDVTW 287
            G +   TPL         Y I LE I     ++   PN       G  V++D+   V++
Sbjct: 195 GGKSAATTPLVNTSDDSSDYMIHLEGIKFGDVIIAPPPN-------GSVVLVDTIFGVSF 247

Query: 288 LVKEAYEALRDEVMIRLEGEQMRSYSWPDKLCY-----HGIMSSDLKGFPTVRFHFRGGA 342
           LV  A++A++  V + +    M + + P  LC+         +S L   P V   F+G A
Sbjct: 248 LVDAAFQAIKKAVTVAVGAAPMATPTKPFDLCFPKAAAAAGANSSLP-LPDVVLTFQGAA 306

Query: 343 KLALEKDSMFYQPRPDAFCMAVNPASINNRYVNFSLIGMMAQQFYNVGYDIGRKQKTFQR 402
            L +      Y       C+A+  +++ N     S++G + Q+  +  +D+ ++  +F+ 
Sbjct: 307 ALTVPPSKYMYDAGNGTVCLAMMSSAMLNLTTELSILGRLHQENIHFLFDLDKETLSFEP 366

Query: 403 MDCEVL 408
            DC  L
Sbjct: 367 ADCSSL 372


>gi|302756591|ref|XP_002961719.1| hypothetical protein SELMODRAFT_64161 [Selaginella moellendorffii]
 gi|300170378|gb|EFJ36979.1| hypothetical protein SELMODRAFT_64161 [Selaginella moellendorffii]
          Length = 357

 Score =  126 bits (317), Expect = 2e-26,   Method: Compositional matrix adjust.
 Identities = 104/361 (28%), Positives = 156/361 (43%), Gaps = 31/361 (8%)

Query: 61  FYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLGTIFYPSRSSSYAYVPCDSEH 120
           ++  + IG P    +  +DTGS + W+ C PC  C  Q+  I+ PS SSSY  V C S  
Sbjct: 12  YFARMGIGNPQRSYYLELDTGSDVTWIQCAPCSSCYSQVDPIYDPSNSSSYRRVYCGSAL 71

Query: 121 CRYFPYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDVVFGCGFST 180
           C+   Y+ C      C Y  +Y     +S  +  E  +F     S+  ++++ FGCG S 
Sbjct: 72  CQALDYSACQGMG--CSYRVVYGDSSASSGDLGIE--SFYLGPNSSTAMRNIAFGCGHSN 127

Query: 181 NRNFKFSGIFGLGIGR-----SSLVSQLNSSFSYCIGSLHDP-DYLHNKLILGDGAIIDE 234
           +  F+         G      S + + +  +FSYC+   +       + LI G  AI   
Sbjct: 128 SGLFRGEAGLLGMGGGTLSFFSQIAASIGPAFSYCLVDRYSQLQSRSSPLIFGRTAIPFA 187

Query: 235 GDATPL---QFIDGHYYITLEAISVDGRMLDINPNIFKRDDSG-GGVMIDSGTDVTWLVK 290
              TPL     I+  YY  L  ISV G  L I P  F    +G GG ++DSGT VT +V 
Sbjct: 188 ARFTPLLKNPRINTFYYAVLTGISVGGTPLPIPPAQFALTGNGTGGAILDSGTSVTRVVP 247

Query: 291 EAYEALRDEVMIRLEGEQMRSYSWPDKLCYHGIMSSDLKGFPTVR-----FHFRGGAKLA 345
            AY  LRD               +    C+      + +G PTV+      HF  G  + 
Sbjct: 248 PAYAVLRDAYRAASRNLPPAPGVYLLDTCF------NFQGLPTVQIPSLVLHFDNGVDMV 301

Query: 346 LEKDSMFYQ-PRPDAFCMAVNPASINNRYVNFSLIGMMAQQFYNVGYDIGRKQKTFQRMD 404
           L   ++     R   FC+A  P+S+       S+IG + QQ + +G+D+ R        +
Sbjct: 302 LPGGNILIPVDRSGTFCLAFAPSSM-----PISVIGNVQQQTFRIGFDLQRSLIAIAPRE 356

Query: 405 C 405
           C
Sbjct: 357 C 357


>gi|302784853|ref|XP_002974198.1| hypothetical protein SELMODRAFT_54030 [Selaginella moellendorffii]
 gi|300157796|gb|EFJ24420.1| hypothetical protein SELMODRAFT_54030 [Selaginella moellendorffii]
          Length = 359

 Score =  126 bits (317), Expect = 2e-26,   Method: Compositional matrix adjust.
 Identities = 107/366 (29%), Positives = 169/366 (46%), Gaps = 37/366 (10%)

Query: 61  FYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDC--SPQLGTIFYPSRSSSYAYVPCDS 118
           + + +SIG PP      +DTGS L+W+ C  C  C       TIF+   SSSY  +PC+S
Sbjct: 5   YMMELSIGTPPQLIPAMIDTGSDLVWLKCDNCDHCDLDHHGETIFFSDASSSYKKLPCNS 64

Query: 119 EHCRYFPYA----RCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIH---VQD 171
            HC     A    RC   +  C Y   Y  G  TS  V +++++F++      H      
Sbjct: 65  THCSGMSSAGIGPRC---EETCKYKYEYGDGSRTSGDVGSDRISFRSHGAGEDHRSFFDG 121

Query: 172 VVFGCGFSTNRNFKFS-GIFGLGIGRSSLVSQLNS----SFSYCIGSLHDPDYLHNKLIL 226
            +FGC      ++ F+ G+ GLG    SL+ QL       FSYC+ S   P    + L L
Sbjct: 122 FLFGCARKLKGDWNFTQGLIGLGQKSHSLIQQLGDKLGYKFSYCLVSYDSPPSAKSFLFL 181

Query: 227 GDGAIIDEGDATPLQFIDGH------YYITLEAISVDGRMLDINPNIFKRDDSGG----- 275
           G  A +   D      + G       YY+ L++I++ G  + +       + S G     
Sbjct: 182 GSSAALRGHDVVSTPILHGDHLDQTLYYVDLQSITIGGVPVVVYDKESGHNTSVGPFLAN 241

Query: 276 GVMIDSGTDVTWLVKEAYEALRDEVMIRLEGEQMRSYSWPDKLCYHGIMSSDLK-GFPTV 334
             +IDSGT  T L    YEA+R  +  ++    + + +  D LC++   S D   GFP+V
Sbjct: 242 KTVIDSGTTYTLLTPPVYEAMRKSIEEQVILPTLGNSAGLD-LCFNS--SGDTSYGFPSV 298

Query: 335 RFHFRGGAKLALEKDSMFYQPRPDAFCMAVNPASINNRYVNFSLIGMMAQQFYNVGYDIG 394
            F+F    +L L  +++F     D  C+     S+++   + S+IG M QQ +++ YD+ 
Sbjct: 299 TFYFANQVQLVLPFENIFQVTSRDVVCL-----SMDSSGGDLSIIGNMQQQNFHILYDLV 353

Query: 395 RKQKTF 400
             Q +F
Sbjct: 354 ASQISF 359


>gi|225216930|gb|ACN85225.1| aspartic proteinase nepenthesin-1 precursor [Oryza punctata]
 gi|225216938|gb|ACN85232.1| aspartic proteinase nepenthesin-1 precursor [Oryza minuta]
          Length = 516

 Score =  126 bits (317), Expect = 2e-26,   Method: Compositional matrix adjust.
 Identities = 103/364 (28%), Positives = 154/364 (42%), Gaps = 27/364 (7%)

Query: 52  PSNDISNSLFYVNISIGQPPVPQFTAMDTGSSLLWVHCYPC-RDCSPQLGTIFYPSRSSS 110
           P   +    + V + +G P        DTGS   WV C PC   C  Q   +F P+RSS+
Sbjct: 170 PGRALGTGNYVVTVGLGTPASRYTVVFDTGSDTTWVQCQPCVVVCYEQREKLFDPARSST 229

Query: 111 YAYVPCDSEHCRYFPYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQ 170
           YA V C +  C       CS     C+Y   Y  G  +  F + + LT  + D     V+
Sbjct: 230 YANVSCAAPACSDLDTRGCSG--GHCLYGVQYGDGSYSIGFFAMDTLTLSSYDA----VK 283

Query: 171 DVVFGCGFSTNRNF-KFSGIFGLGIGRSSLVSQL----NSSFSYCIGSLHDPDYLHNKLI 225
              FGCG      F + +G+ GLG G++SL  Q        F++C+ +          L 
Sbjct: 284 GFRFGCGERNEGLFGEAAGLLGLGRGKTSLPVQTYDKYGGVFAHCLPARSTGT---GYLD 340

Query: 226 LGDGAIIDEGDATPLQFIDG--HYYITLEAISVDGRMLDINPNIFKRDDSGGGVMIDSGT 283
            G G+       TP+   +G   YY+ L  I V GR+L I  ++F       G ++DSGT
Sbjct: 341 FGAGSPAARLTTTPMLVDNGPTFYYVGLTGIRVGGRLLYIPQSVFAT----AGTIVDSGT 396

Query: 284 DVTWLVKEAYEALRDEVMIRLEGEQMRSYSWPDKL--CYHGIMSSDLKGFPTVRFHFRGG 341
            +T L   AY +LR      +     +       L  CY     S +   PTV   F+GG
Sbjct: 397 VITRLPPAAYSSLRSAFAAAMSARGYKKAPAVSLLDTCYDFAGMSQVA-IPTVSLLFQGG 455

Query: 342 AKLALEKDSMFYQPRPDAFCMAVNPASINNRYVNFSLIGMMAQQFYNVGYDIGRKQKTFQ 401
           A+L ++   + Y       C+A    + N    +  ++G    + + V YDIG+K  +F 
Sbjct: 456 ARLDVDASGIMYAASASQVCLAF---AANEDGGDVGIVGNTQLKTFGVAYDIGKKVVSFS 512

Query: 402 RMDC 405
              C
Sbjct: 513 PGAC 516


>gi|302790323|ref|XP_002976929.1| hypothetical protein SELMODRAFT_105896 [Selaginella moellendorffii]
 gi|300155407|gb|EFJ22039.1| hypothetical protein SELMODRAFT_105896 [Selaginella moellendorffii]
          Length = 373

 Score =  126 bits (316), Expect = 2e-26,   Method: Compositional matrix adjust.
 Identities = 112/373 (30%), Positives = 164/373 (43%), Gaps = 34/373 (9%)

Query: 67  IGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLGTIFYPSRSSSYAYVPCDSEHC----R 122
           IG PP      +DT S L WV    C +CSP     F P  SSS+   PC S  C    +
Sbjct: 5   IGTPPREVLLLVDTASELTWVQGTSCTNCSPTKVPPFNPGLSSSFISEPCTSSVCLGRSK 64

Query: 123 YFPYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDVVFGCGFS-TN 181
               + C+     C +   YL G E    ++ E  + ++ D +   + DV+FGC      
Sbjct: 65  LGFQSACNRSTGSCSFQVAYLDGSEAYGVIAREIFSLQSWDGAASTLGDVIFGCASKDLQ 124

Query: 182 RNFKF-SGIFGLGIGRSSLVSQLNS--------SFSYCIGSLHDPDYLHNKLILGDGAI- 231
           R   F SG  GL  G  S  +Q+ S         FSYC  +  +       +I GD  I 
Sbjct: 125 RPVDFSSGTLGLNRGSFSFPAQIGSRSKSGLSDRFSYCFPNRAEHLNSSGVIIFGDSGIP 184

Query: 232 ------IDEGDATPLQFIDGHYYITLEAISVDGRMLDINPNIFKRDDSG-GGVMIDSGTD 284
                 +      P+  I   YY+ L+ ISV G +L I  + FK D  G GG   DSGT 
Sbjct: 185 AHHFQYLSLEQEPPIASIVDFYYVGLQGISVGGELLHIPRSAFKIDRLGNGGTYFDSGTT 244

Query: 285 VTWLVKEAYEALRDEVMIR-LEGEQMRSYSWPDKLCYHGIMSSD--LKGFPTVRFHFRGG 341
           V++LV+ A+ AL +    R L   +     +  +LCY  + + D  L   P V  HF+  
Sbjct: 245 VSFLVEPAHTALVEAFGRRVLHLNRTSGSDFTKELCYD-VAAGDARLPTAPLVTLHFKNN 303

Query: 342 AKLALEKDSMFY----QPRPDAFCMA-VNPASINNRYVNFSLIGMMAQQFYNVGYDIGRK 396
             + L + S++      P+    C+A VN  ++    VN  +IG   QQ Y + +D+ R 
Sbjct: 304 VDMELREASVWVPLARTPQVVTICLAFVNAGAVAQGGVN--VIGNYQQQDYLIEHDLERS 361

Query: 397 QKTFQRMDCEVLD 409
           +  F   +C V+D
Sbjct: 362 RIGFAPANC-VMD 373


>gi|356531224|ref|XP_003534178.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
          Length = 492

 Score =  126 bits (316), Expect = 2e-26,   Method: Compositional matrix adjust.
 Identities = 109/352 (30%), Positives = 160/352 (45%), Gaps = 23/352 (6%)

Query: 61  FYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLGTIFYPSRSSSYAYVPCDSEH 120
           ++  + +GQP  P +  +DTGS + W+ C PC DC  Q   IF P+ SSSY  + CD++ 
Sbjct: 157 YFSRVGVGQPSKPFYMVLDTGSDVNWLQCKPCSDCYQQSDPIFDPTASSSYNPLTCDAQQ 216

Query: 121 CRYFPYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDVVFGCGFST 180
           C+    + C     +C+Y   Y  G  T     TE ++F         V  V  GCG   
Sbjct: 217 CQDLEMSACR--NGKCLYQVSYGDGSFTVGEYVTETVSF-----GAGSVNRVAIGCGHDN 269

Query: 181 NRNF-KFSGIFGLGIGRSSLVSQLN-SSFSYCIGSLHDPDYLHNKLILGDGAIIDEGDAT 238
              F   +G+ GLG G  SL SQ+  +SFSYC   L D D   +  +  +     +    
Sbjct: 270 EGLFVGSAGLLGLGGGPLSLTSQIKATSFSYC---LVDRDSGKSSTLEFNSPRPGDSVVA 326

Query: 239 PL---QFIDGHYYITLEAISVDGRMLDINPNIFKRDDSG-GGVMIDSGTDVTWLVKEAYE 294
           PL   Q ++  YY+ L  +SV G ++ + P  F  D SG GGV++DSGT +T L  +AY 
Sbjct: 327 PLLKNQKVNTFYYVELTGVSVGGEIVTVPPETFAVDQSGAGGVIVDSGTAITRLRTQAYN 386

Query: 295 ALRDEVMIRLEGEQMRSYSWPDKLCYHGIMSSDLKGFPTVRFHFRGGAKLAL-EKDSMFY 353
           ++RD    +    +          CY  + S      PTV FHF G    AL  K+ +  
Sbjct: 387 SVRDAFKRKTSNLRPAEGVALFDTCYD-LSSLQSVRVPTVSFHFSGDRAWALPAKNYLIP 445

Query: 354 QPRPDAFCMAVNPASINNRYVNFSLIGMMAQQFYNVGYDIGRKQKTFQRMDC 405
                 +C A  P +      + S+IG + QQ   V +D+      F    C
Sbjct: 446 VDGAGTYCFAFAPTT-----SSMSIIGNVQQQGTRVSFDLANSLVGFSPNKC 492


>gi|242062704|ref|XP_002452641.1| hypothetical protein SORBIDRAFT_04g029670 [Sorghum bicolor]
 gi|241932472|gb|EES05617.1| hypothetical protein SORBIDRAFT_04g029670 [Sorghum bicolor]
          Length = 466

 Score =  126 bits (316), Expect = 2e-26,   Method: Compositional matrix adjust.
 Identities = 124/427 (29%), Positives = 189/427 (44%), Gaps = 47/427 (11%)

Query: 1   LIHRDSILSPYNNPNENPAHRVQRGINISIARLAYLQEKIRSHN-----TYQAQILPSN- 54
           L HR    SP  + N+ PA   +R +     R AY++ K             A  +P+  
Sbjct: 65  LHHRHGPCSPVPS-NKMPASLEER-LQRDQLRAAYIKRKFSGAKGGDVEQSDAATVPTTL 122

Query: 55  --DISNSLFYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLGTIFYPSRSSSYA 112
              +S   + + + IG P V Q  +MDTGS + WV C PC  C  ++ ++F PS SS+Y+
Sbjct: 123 GTSLSTLEYVITVGIGSPAVTQTMSMDTGSDVSWVQCKPCSQCHSEVDSLFDPSASSTYS 182

Query: 113 YVPCDSEHCRYFPYAR----CSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIH 168
              C S  C     ++    CS+   +C Y   Y+ G  T+   S++ LT  +       
Sbjct: 183 PFSCSSAACVQLSQSQQGNGCSS--SQCQYIVSYVDGSSTTGTYSSDTLTLGSN-----A 235

Query: 169 VQDVVFGCGFSTNRNF--KFSGIFGLGIGRSSLVSQ----LNSSFSYCIGSLHDPDYLHN 222
           ++   FGC  S +  F  +  G+ GLG    SLVSQ       +FSYC+     P    +
Sbjct: 236 IKGFQFGCSQSESGGFSDQTDGLMGLGGDAQSLVSQTAGTFGKAFSYCL-----PPTPGS 290

Query: 223 KLILGDGAIIDEG-DATPL---QFIDGHYYITLEAISVDGRMLDINPNIFKRDDSGGGVM 278
              L  GA    G   TP+     I  +Y + LEAI V G+ L+I  ++F    S G VM
Sbjct: 291 SGFLTLGAASRSGFVKTPMLRSTQIPTYYGVLLEAIRVGGQQLNIPTSVF----SAGSVM 346

Query: 279 IDSGTDVTWLVKEAYEALRDEVMIRLEGEQMRSYSWPDKLCYHGIMSSDLKGFPTVRFHF 338
            DSGT +T L   AY AL       ++       S     C+     S +   P+V   F
Sbjct: 347 -DSGTVITRLPPTAYSALSSAFKAGMKKYPPAQPSGILDTCFDFSGQSSVS-IPSVALVF 404

Query: 339 RGGAKLALEKDSMFYQPRPDAFCMAVNPASINNRYVNFSLIGMMAQQFYNVGYDIGRKQK 398
            GGA + L+ + +  +   D +C+A    + N+   +   IG + Q+ + V YD+G    
Sbjct: 405 SGGAVVNLDFNGIMLE--LDNWCLAF---AANSDDSSLGFIGNVQQRTFEVLYDVGGGAV 459

Query: 399 TFQRMDC 405
            F+   C
Sbjct: 460 GFRAGAC 466


>gi|296090179|emb|CBI39998.3| unnamed protein product [Vitis vinifera]
          Length = 334

 Score =  125 bits (315), Expect = 3e-26,   Method: Compositional matrix adjust.
 Identities = 98/366 (26%), Positives = 160/366 (43%), Gaps = 62/366 (16%)

Query: 52  PSNDISNSLFYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLGTIFYPSRSSSY 111
           P    +N  + + ISIG PP   +   DTGS L+W  C PC  C  Q   +F PS+S+S+
Sbjct: 15  PPVSSNNGEYLMKISIGTPPFDVYGIYDTGSDLMWTQCLPCLSCYKQKNPMFDPSKSTSF 74

Query: 112 AYVPCDSEHCRYFPYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQD 171
             V C+S+ CR                                         ++   + +
Sbjct: 75  KEVSCESQQCRLL---------------------------------------DTPTSILN 95

Query: 172 VVFGCGFSTNRNFKFS--GIFGLGIGRSSLVSQLNSS------FSYCIGSLHDPDYLHNK 223
           +VFGCG + +  F  +  G+FG G    SL SQ+ S+      FS C+        + +K
Sbjct: 96  IVFGCGHNNSGTFNENEMGLFGTGGRPLSLTSQIMSTLGSGRKFSQCLVPFRTDPSITSK 155

Query: 224 LILGDGAIIDEGD--ATPLQFID--GHYYITLEAISVDGRMLDINPNIFKRDDSGGGVMI 279
           +I G  A +   D  +TPL   D   +Y++TL+ ISV  ++   + +      + G V I
Sbjct: 156 IIFGPEAEVSGSDVVSTPLVTKDDPTYYFVTLDGISVGDKLFPFSSS--SPMATKGNVFI 213

Query: 280 DSGTDVTWLVKEAYEALRDEVMIRLEGEQMRSYSWPDKLCYHGIMSSDLKGFPTVRFHFR 339
           D+GT  T L ++ Y  L   V   +  E ++      +LCY    S+ L   P +  HF 
Sbjct: 214 DAGTPPTLLPRDFYNRLVQGVKEAIPMEPVQDPDLQPQLCYR---SATLIDGPILTAHFD 270

Query: 340 GGAKLALEKDSMFYQPRPDAFCMAVNPASINNRYVNFSLIGMMAQQFYNVGYDIGRKQKT 399
            GA + L+  + F  P+   +C A+ P   +       + G   Q  + +G+D+  K+ +
Sbjct: 271 -GADVQLKPLNTFISPKEGVYCFAMQPIDGDT-----GIFGNFVQMNFLIGFDLDGKKVS 324

Query: 400 FQRMDC 405
           F+ +DC
Sbjct: 325 FKAVDC 330


>gi|302795261|ref|XP_002979394.1| hypothetical protein SELMODRAFT_53966 [Selaginella moellendorffii]
 gi|300153162|gb|EFJ19802.1| hypothetical protein SELMODRAFT_53966 [Selaginella moellendorffii]
          Length = 353

 Score =  125 bits (315), Expect = 3e-26,   Method: Compositional matrix adjust.
 Identities = 110/347 (31%), Positives = 157/347 (45%), Gaps = 25/347 (7%)

Query: 61  FYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLGTIFYPSRSSSYAYVPCDSEH 120
           ++  I +G P    +   DTGS + W+ C PCR C  Q   IF PS SSS+  + C S  
Sbjct: 14  YFARIGVGTPARSVYMVADTGSDVSWLQCSPCRKCYRQQDPIFNPSLSSSFKPLACASSI 73

Query: 121 CRYFPYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDVVFGCGFST 180
           C       CS  K++C+Y   Y  G  T    STE L+F         V+ V  GCG + 
Sbjct: 74  CGKLKIKGCS-RKNKCMYQVSYGDGSFTVGDFSTETLSFGEH-----AVRSVAMGCGRNN 127

Query: 181 NRNF-KFSGIFGLGIGRSSLVSQLNSS----FSYCIGSLHDPDYLHNKLILGDGAIIDEG 235
              F   +G+ GLG G  S  SQ  +S    FSYC+        +   L+ G  A+ ++ 
Sbjct: 128 QGLFHGAAGLLGLGRGPLSFPSQTGTSYASVFSYCLPRRE--SAIAASLVFGPSAVPEKA 185

Query: 236 DAT---PLQFIDGHYYITLEAISVDGRMLDINPNIFKRDDSG-GGVMIDSGTDVTWLVKE 291
             T   P + +D +YY+ L  I V G  ++I P+ F     G GGV++DSGT ++ L   
Sbjct: 186 RFTKLLPNRRLDTYYYVGLARIRVAGSPVNIPPDAFAMGSRGTGGVIVDSGTAISRLTTP 245

Query: 292 AYEALRDEVMIRLEGEQMRSYSWPDKLCYHGIMSSDLKGFPTVRFHFRGGAKLALEKDSM 351
           AY ALRD     +        S  D  CY  + S      P V   F GGA + L  D +
Sbjct: 246 AYTALRDAFRSLVTFPSAPGISLFDT-CYD-LSSMKTATLPAVVLDFDGGASMPLPADGI 303

Query: 352 FYQPRPDA-FCMAVNPASINNRYVNFSLIGMMAQQFYNVGYDIGRKQ 397
                 +  +C+A  P         FS+IG + QQ + +  D  ++Q
Sbjct: 304 LVNVDDEGTYCLAFAP-----EEEAFSIIGNVQQQTFRISIDNQKEQ 345


>gi|242091325|ref|XP_002441495.1| hypothetical protein SORBIDRAFT_09g028050 [Sorghum bicolor]
 gi|241946780|gb|EES19925.1| hypothetical protein SORBIDRAFT_09g028050 [Sorghum bicolor]
          Length = 466

 Score =  125 bits (315), Expect = 3e-26,   Method: Compositional matrix adjust.
 Identities = 106/363 (29%), Positives = 163/363 (44%), Gaps = 32/363 (8%)

Query: 61  FYVNISIGQPPVPQFTAM-DTGSSLLWVHCYPCRDCSPQLGTIFYPSRSSSYAYVPCDSE 119
           ++V + +G P V +FT + DTGS L WV C      SP  G +F P  S S+A +PC S+
Sbjct: 116 YFVKLRVGTP-VQEFTLVADTGSDLTWVKCA---GASPP-GRVFRPKTSRSWAPIPCSSD 170

Query: 120 HCRY---FPYARCSAYKHRCIYTQLYLIGPETSV-FVSTEQLTFKNTDESTIHVQDVVFG 175
            C+    F  A CS+    C Y   Y  G   +   V TE  T          ++DVV G
Sbjct: 171 TCKLDVPFTLANCSSPASPCTYDYRYKEGSAGARGIVGTESATIALPGGKVAQLKDVVLG 230

Query: 176 CGFSTN-RNFKFS-GIFGLGIGRSSLVSQ----LNSSFSYCIGSLHDPDYLHNKLILGDG 229
           C  S + ++F+ + G+  LG  + S  +Q       SFSYC+     P      L  G G
Sbjct: 231 CSSSHDGQSFRSADGVLSLGNAKISFATQAAARFGGSFSYCLVDHLAPRNATGYLAFGPG 290

Query: 230 AIIDEGDATPLQFIDGH---YYITLEAISVDGRMLDINPNIFKRDDSGGGVMIDSGTDVT 286
            +          F+D     Y + ++AI V G+ LDI   ++  D   GGV++DSG  +T
Sbjct: 291 QVPRTPATQTKLFLDPEMPFYGVKVDAIHVAGKALDIPAEVW--DAKSGGVILDSGNTLT 348

Query: 287 WLVKEAYEALRDEVMIRLEGEQMRSYSWPDKLCYHGIMSSDLKG----FPTVRFHFRGGA 342
            L   AY+A+   +   L+G    S+  P + CY+   ++   G     P +   F G A
Sbjct: 349 VLAAPAYKAVVAALSKHLDGVPKVSFP-PFEHCYN--WTARRPGAPEIIPKLAVQFAGSA 405

Query: 343 KLALEKDSMFYQPRPDAFCMAVNPASINNRYVNFSLIGMMAQQFYNVGYDIGRKQKTFQR 402
           +L     S     +P   C+ V        +   S+IG + QQ +   +D+   Q  F++
Sbjct: 406 RLEPPAKSYVIDVKPGVKCIGVQ----EGEWPGLSVIGNIMQQEHLWEFDLKNMQVRFKQ 461

Query: 403 MDC 405
            +C
Sbjct: 462 SNC 464


>gi|115452685|ref|NP_001049943.1| Os03g0318400 [Oryza sativa Japonica Group]
 gi|108707841|gb|ABF95636.1| Eukaryotic aspartyl protease family protein, expressed [Oryza
           sativa Japonica Group]
 gi|113548414|dbj|BAF11857.1| Os03g0318400 [Oryza sativa Japonica Group]
          Length = 434

 Score =  125 bits (315), Expect = 3e-26,   Method: Compositional matrix adjust.
 Identities = 112/382 (29%), Positives = 172/382 (45%), Gaps = 49/382 (12%)

Query: 54  NDISNSLFYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLGTIFYPSRSSSYAY 113
           N +  + + V+++IG PP P    +DTGS L+W  C PC  C  Q    F PS SS+ + 
Sbjct: 75  NGVPTTEYLVHLAIGTPPQPVQLTLDTGSDLIWTQCQPCPACFDQALPYFDPSTSSTLSL 134

Query: 114 VPCDSEHCRYFPYARCSAYK----HRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHV 169
             CDS  C+  P A C + K      C+YT  Y     T+ F+  ++ TF     S   V
Sbjct: 135 TSCDSTLCQGLPVASCGSPKFWPNQTCVYTYSYGDKSVTTGFLEVDKFTFVGAGAS---V 191

Query: 170 QDVVFGCGFSTNRNFKF--SGIFGLGIGRSSLVSQLN-SSFSYCIGSLHDPDYLHNKLIL 226
             V FGCG   N  FK   +GI G G G  SL SQL   +FS+C  +++    L    +L
Sbjct: 192 PGVAFGCGLFNNGVFKSNETGIAGFGRGPLSLPSQLKVGNFSHCFTAVNG---LKPSTVL 248

Query: 227 ----------GDGAIIDEGDATPLQFIDGH---YYITLEAISVDGRMLDINPNIFKRDDS 273
                     G GA+     +TPL     +   YY++L+ I+V    L +  + F   + 
Sbjct: 249 LDLPADLYKSGRGAV----QSTPLIQNPANPTFYYLSLKGITVGSTRLPVPESEFALKNG 304

Query: 274 GGGVMIDSGTDVTWLVKEAYEALRDEVMIRLEGEQMRSYSWPDKLCYHGIMSSDLKG--- 330
            GG +IDSGT +T L    Y  +RD    +++   +   +     C    +S+ L+    
Sbjct: 305 TGGTIIDSGTAMTSLPTRVYRLVRDAFAAQVKLPVVSGNTTDPYFC----LSAPLRAKPY 360

Query: 331 FPTVRFHFRGGAKLALEKDSMFYQPRPDA----FCMAVNPASINNRYVNFSLIGMMAQQF 386
            P +  HF  GA + L +++  ++   DA     C+A+            + IG   QQ 
Sbjct: 361 VPKLVLHFE-GATMDLPRENYVFEVE-DAGSSILCLAIIEGG------EVTTIGNFQQQN 412

Query: 387 YNVGYDIGRKQKTFQRMDCEVL 408
            +V YD+   + +F    C+ L
Sbjct: 413 MHVLYDLQNSKLSFVPAQCDKL 434


>gi|148905906|gb|ABR16115.1| unknown [Picea sitchensis]
          Length = 482

 Score =  125 bits (315), Expect = 3e-26,   Method: Compositional matrix adjust.
 Identities = 110/365 (30%), Positives = 154/365 (42%), Gaps = 37/365 (10%)

Query: 61  FYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLGTIFYPSRSSSYAYVPCDSEH 120
           + V    G P       +DTGS L W+ C PC DC  Q+  IF P +SSSY  +PC S  
Sbjct: 137 YIVTAGFGTPAKNSLLIIDTGSDLTWIQCKPCADCYSQVDAIFEPKQSSSYKTLPCLSAT 196

Query: 121 CRYFPYARCSAYK---HRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDVVFGCG 177
           C     +  +        C+Y   Y  G  +    S E LT  +        Q+  FGCG
Sbjct: 197 CTELITSESNPTPCLLGGCVYEINYGDGSSSQGDFSQETLTLGSDS-----FQNFAFGCG 251

Query: 178 FSTNRNFK-FSGIFGLGIGRSSLVSQLNSS----FSYCIGSLHDPD----YLHNKLILGD 228
            +    FK  SG+ GLG    S  SQ  S     F+YC+     PD           +G 
Sbjct: 252 HTNTGLFKGSSGLLGLGQNSLSFPSQSKSKYGGQFAYCL-----PDFGSSTSTGSFSVGK 306

Query: 229 GAIIDEGDATPL--QFI-DGHYYITLEAISVDGRMLDINPNIFKRDDSGGGVMIDSGTDV 285
           G+I      TPL   F+    Y++ L  ISV G  L I P +  R    G  ++DSGT +
Sbjct: 307 GSIPASAVFTPLVSNFMYPTFYFVGLNGISVGGDRLSIPPAVLGR----GSTIVDSGTVI 362

Query: 286 TWLVKEAYEALRDEVMIRLEG-EQMRSYSWPDKLCYHGIMSSDLKGFPTVRFHFRGGAKL 344
           T L+ +AY AL+     +       + +S  D  CY     S ++  PT+ FHF+  A +
Sbjct: 363 TRLLPQAYNALKTSFRSKTRDLPSAKPFSILDT-CYDLSRHSQVR-IPTITFHFQNNADV 420

Query: 345 ALEKDSMF--YQPRPDAFCMAVNPASINNRYVNFSLIGMMAQQFYNVGYDIGRKQKTFQR 402
           A+    +    Q      C+A   AS   +   F++IG   QQ   V +D G  +  F  
Sbjct: 421 AVSDVGILVPVQNGGSQVCLAFASAS---QMDGFNIIGNFQQQRMRVAFDTGAGRIGFAS 477

Query: 403 MDCEV 407
             C  
Sbjct: 478 GSCAA 482


>gi|242095602|ref|XP_002438291.1| hypothetical protein SORBIDRAFT_10g011210 [Sorghum bicolor]
 gi|241916514|gb|EER89658.1| hypothetical protein SORBIDRAFT_10g011210 [Sorghum bicolor]
          Length = 464

 Score =  125 bits (314), Expect = 3e-26,   Method: Compositional matrix adjust.
 Identities = 114/432 (26%), Positives = 187/432 (43%), Gaps = 56/432 (12%)

Query: 1   LIHRDSILSPYNNPNENPAHRVQRGINISIARLAYLQEKIRSHNTYQAQILPSNDIS--- 57
           L HR    SP  +  E P+H  +  +     R AY+Q K+ S     A+ L  + ++   
Sbjct: 62  LSHRHGPCSPVIS-KEKPSH--EETLRRDQLRAAYIQAKVSSRYNNVAKELQQSAVTIPT 118

Query: 58  -------NSLFYVNISIGQPPVPQFTAMDTGSSLLWVHCYPC--RDCSPQLGTIFYPSRS 108
                   + + + ++IG P V Q  ++DTGS + WV C PC  + CS Q   +F P+ S
Sbjct: 119 SSGYSLGTTEYVITVTIGTPAVTQVMSIDTGSDVSWVQCAPCAAQSCSSQKDKLFDPAMS 178

Query: 109 SSYAYVPCDSEHCRYFPYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIH 168
           ++Y+   C S  C           K +C Y   Y  G  T+    ++ L+  ++D     
Sbjct: 179 ATYSAFSCGSAQCAQLGDEGNGCLKSQCQYIVKYGDGSNTAGTYGSDTLSLTSSDA---- 234

Query: 169 VQDVVFGCGFSTNRNFKF----SGIFGLGIGRSSLVSQLNSS----FSYCIGSLHDPDYL 220
           V+   FGC   ++R   F     G+ GLG    SLVSQ  ++    FSYC+         
Sbjct: 235 VKSFQFGC---SHRAAGFVGELDGLMGLGGDTESLVSQTAATYGKAFSYCLPPPSSSGGG 291

Query: 221 HNKLILGDGAIIDEGDATPL-QF-IDGHYYITLEAISVDGRMLDINPNIFKRDDSGGGVM 278
              L    GA       TP+ +F +   Y + L+ I+V G ML++  ++F      G  +
Sbjct: 292 FLTLGAAGGASSSRYSHTPMVRFSVPTFYGVFLQGITVAGTMLNVPASVFS-----GASV 346

Query: 279 IDSGTDVTWLVKEAYEALRDEVMIRLEGEQMRSYSWPDKLCYHGIMSSDLKGF-----PT 333
           +DSGT +T L   AY+ALR      ++     +       C+      D  GF     PT
Sbjct: 347 VDSGTVITQLPPTAYQALRTAFKKEMKAYPSAAPVGSLDTCF------DFSGFNTITVPT 400

Query: 334 VRFHFRGGAKLALEKDSMFYQPRPDAFCMAVNPASINNRYVNFSLIGMMAQQFYNVGYDI 393
           V   F  GA + L+   + Y     A C+A    + +    +  ++G + Q+ + + +D+
Sbjct: 401 VTLTFSRGAAMDLDISGILY-----AGCLAFTATAHDG---DTGILGNVQQRTFEMLFDV 452

Query: 394 GRKQKTFQRMDC 405
           G +   F+   C
Sbjct: 453 GGRTIGFRSGAC 464


>gi|219362525|ref|NP_001136612.1| uncharacterized protein LOC100216735 [Zea mays]
 gi|194696366|gb|ACF82267.1| unknown [Zea mays]
 gi|413953802|gb|AFW86451.1| hypothetical protein ZEAMMB73_488726 [Zea mays]
          Length = 411

 Score =  125 bits (314), Expect = 4e-26,   Method: Compositional matrix adjust.
 Identities = 97/302 (32%), Positives = 139/302 (46%), Gaps = 35/302 (11%)

Query: 61  FYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRD--CSPQLGTIFYPSRSSSYAYVPCDS 118
           + V +S G P VPQ   +DTGS + W+ C PC    C PQ   ++ PS SS+Y+ VPC S
Sbjct: 79  YVVRVSFGTPAVPQVVVIDTGSDVSWLQCKPCSSGQCFPQKDPLYDPSHSSTYSAVPCAS 138

Query: 119 EHCRYF---PYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDVVFG 175
           + C+      Y        +C +   Y  G  T    S ++LT          VQ+  FG
Sbjct: 139 DVCKKLAADAYGSGCTSGKQCGFAISYADGTSTVGAYSQDKLTL----APGAIVQNFYFG 194

Query: 176 CGFSTNR-NFKFSGIFGLGIGRSSLVSQLNSSFSYCIGSLHD-PDYLHNKLILGDGAIID 233
           CG   +     F G+ GLG  R SL ++    FSYC+ S+   P +L     LG G    
Sbjct: 195 CGHGKHAVRGLFDGVLGLGRLRESLGARYGGVFSYCLPSVSSKPGFLA----LGAGKNPS 250

Query: 234 EGDATPLQFIDGH---YYITLEAISVDGRMLDINPNIFKRDDSGGGVMIDSGTDVTWLVK 290
               TP+  + G      +TL  I+V G+ LD+ P+ F      GG+++DSGT +T L  
Sbjct: 251 GFVFTPMGTVPGQPTFSTVTLAGINVGGKKLDLRPSAFS-----GGMIVDSGTVITGLQS 305

Query: 291 EAYEALRDEVMIRLEGEQMRSYSWPDKLCYHGIMSSDLKGF-----PTVRFHFRGGAKLA 345
            AY ALR      +E  ++      D  CY      +L G+     P +   F GGA + 
Sbjct: 306 TAYRALRSAFRKAMEAYRLLPNGDLDT-CY------NLTGYKNVVVPKIALTFTGGATIN 358

Query: 346 LE 347
           L+
Sbjct: 359 LD 360


>gi|125543640|gb|EAY89779.1| hypothetical protein OsI_11321 [Oryza sativa Indica Group]
          Length = 434

 Score =  125 bits (314), Expect = 4e-26,   Method: Compositional matrix adjust.
 Identities = 112/382 (29%), Positives = 172/382 (45%), Gaps = 49/382 (12%)

Query: 54  NDISNSLFYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLGTIFYPSRSSSYAY 113
           N +  + + V+++IG PP P    +DTGS L+W  C PC  C  Q    F PS SS+ + 
Sbjct: 75  NGVPTTEYLVHLAIGTPPQPVQLTLDTGSDLIWTQCQPCPACFDQALPYFDPSTSSTLSL 134

Query: 114 VPCDSEHCRYFPYARCSAYK----HRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHV 169
             CDS  C+  P A C + K      C+YT  Y     T+ F+  ++ TF     S   V
Sbjct: 135 TSCDSTLCQGLPVASCGSPKFWPNQTCVYTYSYGDKSVTTGFLEVDKFTFVGAGAS---V 191

Query: 170 QDVVFGCGFSTNRNFKF--SGIFGLGIGRSSLVSQLN-SSFSYCIGSLHDPDYLHNKLIL 226
             V FGCG   N  FK   +GI G G G  SL SQL   +FS+C  +++    L    +L
Sbjct: 192 PGVAFGCGLFNNGVFKSNETGIAGFGRGPLSLPSQLKVGNFSHCFTAVNG---LKPSTVL 248

Query: 227 ----------GDGAIIDEGDATPLQFIDGH---YYITLEAISVDGRMLDINPNIFKRDDS 273
                     G GA+     +TPL     +   YY++L+ I+V    L +  + F   + 
Sbjct: 249 LDLPADLYKSGRGAV----QSTPLIQNPANPTFYYLSLKGITVGSTRLPVPESEFTLKNG 304

Query: 274 GGGVMIDSGTDVTWLVKEAYEALRDEVMIRLEGEQMRSYSWPDKLCYHGIMSSDLKG--- 330
            GG +IDSGT +T L    Y  +RD    +++   +   +     C    +S+ L+    
Sbjct: 305 TGGTIIDSGTAMTSLPTRVYRLVRDAFAAQVKLPVVSGNTTDPYFC----LSAPLRAKPY 360

Query: 331 FPTVRFHFRGGAKLALEKDSMFYQPRPDA----FCMAVNPASINNRYVNFSLIGMMAQQF 386
            P +  HF  GA + L +++  ++   DA     C+A+            + IG   QQ 
Sbjct: 361 VPKLVLHFE-GATMDLPRENYVFEVE-DAGSSILCLAIIEGG------EVTTIGNFQQQN 412

Query: 387 YNVGYDIGRKQKTFQRMDCEVL 408
            +V YD+   + +F    C+ L
Sbjct: 413 MHVLYDLQNSKLSFVPAQCDKL 434


>gi|302817380|ref|XP_002990366.1| hypothetical protein SELMODRAFT_43971 [Selaginella moellendorffii]
 gi|300141928|gb|EFJ08635.1| hypothetical protein SELMODRAFT_43971 [Selaginella moellendorffii]
          Length = 420

 Score =  125 bits (314), Expect = 4e-26,   Method: Compositional matrix adjust.
 Identities = 110/347 (31%), Positives = 156/347 (44%), Gaps = 25/347 (7%)

Query: 61  FYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLGTIFYPSRSSSYAYVPCDSEH 120
           ++  I +G P    +   DTGS + W+ C PCR C  Q   IF PS SSS+  + C S  
Sbjct: 81  YFARIGVGTPARSVYMVADTGSDVSWLQCSPCRKCYRQQDPIFNPSLSSSFKPLACASSI 140

Query: 121 CRYFPYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDVVFGCGFST 180
           C       CS  K+ C+Y   Y  G  T    STE L+F         V+ V  GCG + 
Sbjct: 141 CGKLKIKGCS-RKNECMYQVSYGDGSFTVGDFSTETLSFGEH-----AVRSVAMGCGRNN 194

Query: 181 NRNF-KFSGIFGLGIGRSSLVSQLNSS----FSYCIGSLHDPDYLHNKLILGDGAIIDEG 235
              F   +G+ GLG G  S  SQ  +S    FSYC+        +   L+ G  A+ ++ 
Sbjct: 195 QGLFHGAAGLLGLGRGPLSFPSQTGTSYASVFSYCLPRRE--SAIAASLVFGPSAVPEKA 252

Query: 236 DAT---PLQFIDGHYYITLEAISVDGRMLDINPNIFKRDDSG-GGVMIDSGTDVTWLVKE 291
             T   P + +D +YY+ L  I V G  ++I P+ F     G GGV++DSGT ++ L   
Sbjct: 253 RFTKLLPNRRLDTYYYVGLARIRVAGSPVNIPPDAFAMGSRGTGGVIVDSGTAISRLTTP 312

Query: 292 AYEALRDEVMIRLEGEQMRSYSWPDKLCYHGIMSSDLKGFPTVRFHFRGGAKLALEKDSM 351
           AY ALRD     +        S  D  CY  + S      P V   F GGA + L  D +
Sbjct: 313 AYTALRDAFRSLVTFPSAPGISLFDT-CYD-LSSMKTATLPAVVLDFDGGASMPLPADGI 370

Query: 352 FYQPRPDA-FCMAVNPASINNRYVNFSLIGMMAQQFYNVGYDIGRKQ 397
                 +  +C+A  P         FS+IG + QQ + +  D  ++Q
Sbjct: 371 LVNVDDEGTYCLAFAP-----EEEAFSIIGNVQQQTFRISIDNQKEQ 412


>gi|356536463|ref|XP_003536757.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
          Length = 475

 Score =  125 bits (314), Expect = 4e-26,   Method: Compositional matrix adjust.
 Identities = 128/416 (30%), Positives = 183/416 (43%), Gaps = 49/416 (11%)

Query: 1   LIHRDSI--LSPYNNPNENPAHRVQRGINISIARLAYLQEKIRSHN-TYQAQILPSNDIS 57
           L+HRD +   + Y++       R+QR       R A L  ++ +   TY A+   S+ +S
Sbjct: 72  LVHRDKVPTFNTYHDHRTRFNARMQR----DTKRAASLLRRLAAGKPTYAAEAFGSDVVS 127

Query: 58  -----NSLFYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLGTIFYPSRSSSYA 112
                +  ++V I +G PP  Q+  MD+GS ++WV C PC  C  Q   +F P+ SSS++
Sbjct: 128 GMEQGSGEYFVRIGVGSPPRNQYVVMDSGSDIIWVQCEPCTQCYHQSDPVFNPADSSSFS 187

Query: 113 YVPCDSEHCRYFPYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDV 172
            V C S  C +   A C  ++ RC Y   Y  G  T   ++ E +TF  T      +++V
Sbjct: 188 GVSCASTVCSHVDNAAC--HEGRCRYEVSYGDGSYTKGTLALETITFGRT-----LIRNV 240

Query: 173 VFGCGFSTNRNFKFSGIFGLGIGRS-SLVSQL----NSSFSYCIGSLHDPDYLHNKLILG 227
             GCG      F  +       G   S V QL      +FSYC+ S          L  G
Sbjct: 241 AIGCGHHNQGMFVGAAGLLGLGGGPMSFVGQLGGQTGGAFSYCLVSRGIES--SGLLEFG 298

Query: 228 DGAIIDEGDATPLQF---IDGHYYITLEAISVDGRMLDINPNIFKRDDSG-GGVMIDSGT 283
             A+       PL         YYI L  + V G  + I+ ++FK  + G GGV++D+GT
Sbjct: 299 REAMPVGAAWVPLIHNPRAQSFYYIGLSGLGVGGLRVSISEDVFKLSELGDGGVVMDTGT 358

Query: 284 DVTWLVKEAYEALRDEVMIRLEGEQMRSYSWPDKLCYHGIMSSDLKGF-----PTVRFHF 338
            VT L   AYEA RD  + +       S       CY      DL GF     PTV F+F
Sbjct: 359 AVTRLPTVAYEAFRDGFIAQTTNLPRASGVSIFDTCY------DLFGFVSVRVPTVSFYF 412

Query: 339 RGGAKLALEKDSMFYQPRPD--AFCMAVNPASINNRYVNFSLIGMMAQQFYNVGYD 392
            GG  L L   + F  P  D   FC A  P+S        S+IG + Q+   +  D
Sbjct: 413 SGGPILTLPARN-FLIPVDDVGTFCFAFAPSS-----SGLSIIGNIQQEGIQISVD 462


>gi|326516344|dbj|BAJ92327.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 464

 Score =  125 bits (313), Expect = 4e-26,   Method: Compositional matrix adjust.
 Identities = 124/435 (28%), Positives = 189/435 (43%), Gaps = 62/435 (14%)

Query: 11  YNNPNENPAHRVQRGINISIARLAYLQEKI------RSHNTYQAQILPSNDIS------- 57
           +++  ++ A      +    AR++ LQ +I      RS +   A  L    ++       
Sbjct: 51  FSSGGKSRAEEAHAVLASDAARVSSLQRRIGSYGLIRSSDAASASKLAQVPVTSGARLRT 110

Query: 58  -NSLFYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLGTIFYPSRSSSYAYVPC 116
            N +  V I  G+  V     +DT S L WV C PC  C  Q   +F PS S SYA VPC
Sbjct: 111 LNYVATVGIGGGEATV----IVDTASELTWVQCEPCDACHDQQEPLFDPSSSPSYAAVPC 166

Query: 117 DSEHCRYFPYAR------CSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQ 170
           +S  C     A       C      C YT  Y  G  +   ++ ++L+    D     +Q
Sbjct: 167 NSSSCDALRVATGMSGQACDDQPAACSYTLSYRDGSYSRGVLAHDRLSLAGED-----IQ 221

Query: 171 DVVFGCGFSTNRNF-KFSGIFGLGIGRSSLVS----QLNSSFSYCIGSLHDPDYLHNKLI 225
             VFGCG S    F   SG+ GLG  + SL+S    Q    FSYC+            L+
Sbjct: 222 GFVFGCGTSNQGPFGGTSGLMGLGRSQLSLISQTMDQFGGVFSYCLPPKESGS--SGSLV 279

Query: 226 LGDGAIIDEGDATPLQF-------IDGHYYIT-LEAISVDGRMLDINPNIFKRDDSGGGV 277
           LGD A +   ++TP+ +       + G +Y+  L  I+V G   D+    F     GG  
Sbjct: 280 LGDDASVYR-NSTPIVYTAMVSDPLQGPFYLANLTGITVGGE--DVQSPGFSA-GGGGKA 335

Query: 278 MIDSGTDVTWLVKEAYEALRDEVMIRL-EGEQMRSYSWPDKLCYHGIMSSDLKGF----- 331
           ++DSGT +T LV   Y A+R E + +L E  Q   +S  D  C+      DL G      
Sbjct: 336 IVDSGTIITSLVPSVYAAVRAEFVSQLAEYPQAAPFSILDT-CF------DLTGLREVQV 388

Query: 332 PTVRFHFRGGAKLALEKDSMFYQPRPDAFCMAVNPASINNRYVNFSLIGMMAQQFYNVGY 391
           P+++  F GGA++ ++   + Y    DA  + +  AS+ + Y +  +IG   Q+   V +
Sbjct: 389 PSLKLVFDGGAEVEVDSKGVLYVVTGDASQVCLALASLKSEY-DTPIIGNYQQKNLRVIF 447

Query: 392 DIGRKQKTFQRMDCE 406
           D    Q  F +  C+
Sbjct: 448 DTVGSQIGFAQETCD 462


>gi|302774174|ref|XP_002970504.1| hypothetical protein SELMODRAFT_93504 [Selaginella moellendorffii]
 gi|300162020|gb|EFJ28634.1| hypothetical protein SELMODRAFT_93504 [Selaginella moellendorffii]
          Length = 483

 Score =  125 bits (313), Expect = 5e-26,   Method: Compositional matrix adjust.
 Identities = 122/440 (27%), Positives = 182/440 (41%), Gaps = 50/440 (11%)

Query: 1   LIHRDSILSPYNNPNENPAHRVQRGINISIARLAYLQEKIRSHNTYQAQILPSNDISNSL 60
           LIHR+S+L             +   +     R+ +++ K +     + +   S D++  +
Sbjct: 60  LIHRNSLLREAKEKLHTHEQLLLETLQRDEQRVRWIESKAQLAGKKKDEA-SSTDLNGPV 118

Query: 61  ----------FYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLGTIFYPSRSSS 110
                     ++V + +G P    F  +DTGS L W+ C PC+ C  Q   IF P  SSS
Sbjct: 119 TSGLLYGSGEYFVRLGVGTPARSLFMVVDTGSDLPWLQCQPCKSCYKQADPIFDPRNSSS 178

Query: 111 YAYVPCDSEHCRYFPYARCSAYK---HRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTI 167
           +  +PC S  C+      CS  +    RC Y   Y  G  +    S++  T     ++  
Sbjct: 179 FQRIPCLSPLCKALEIHSCSGSRGATSRCSYQVAYGDGSFSVGDFSSDLFTLGTGSKA-- 236

Query: 168 HVQDVVFGCGFSTNRNFKFSGI----------FGLGIGRSSLVSQLNSSFSYCIGSLHDP 217
               V FGCGF     F  +            F   I  SS  S   +SFSYC+    +P
Sbjct: 237 --MSVAFGCGFDNEGLFAGAAGLLGLGAGKLSFPSQIFASSTNSSTANSFSYCLVDRSNP 294

Query: 218 -DYLHNKLILGDGAIIDEGDATPL---QFIDGHYYITLEAISVDGRMLDINPNIFKRDDS 273
                + LI G  AI      +PL     +D  YY  +  +SV G  L I+    +   S
Sbjct: 295 MTRSSSSLIFGAAAIPSTAALSPLLKNPKLDTFYYAAMIGVSVGGAQLPISLKSLQLSQS 354

Query: 274 G-GGVMIDSGTDVTWLVKEAYEALRDEVMIRLEGEQMRS---YSWPDKLCYH--GIMSSD 327
           G GGV+IDSGT VT      Y  +RD    R     + S   YS  D  CY+  G  S D
Sbjct: 355 GSGGVIIDSGTSVTRFPTSVYATIRDA--FRNATTNLPSAPRYSLFDT-CYNFSGKASVD 411

Query: 328 LKGFPTVRFHFRGGAKLALEKDS-MFYQPRPDAFCMAVNPASINNRYVNFSLIGMMAQQF 386
           +   P +  HF  GA L L   + +       +FC+A  P S+        +IG + QQ 
Sbjct: 412 V---PALVLHFENGADLQLPPTNYLIPINTAGSFCLAFAPTSM-----ELGIIGNIQQQS 463

Query: 387 YNVGYDIGRKQKTFQRMDCE 406
           + +G+D+ +    F    C+
Sbjct: 464 FRIGFDLQKSHLAFAPQQCK 483


>gi|194696934|gb|ACF82551.1| unknown [Zea mays]
 gi|413936470|gb|AFW71021.1| hypothetical protein ZEAMMB73_589717 [Zea mays]
 gi|413953801|gb|AFW86450.1| hypothetical protein ZEAMMB73_488726 [Zea mays]
          Length = 445

 Score =  125 bits (313), Expect = 5e-26,   Method: Compositional matrix adjust.
 Identities = 97/302 (32%), Positives = 139/302 (46%), Gaps = 35/302 (11%)

Query: 61  FYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRD--CSPQLGTIFYPSRSSSYAYVPCDS 118
           + V +S G P VPQ   +DTGS + W+ C PC    C PQ   ++ PS SS+Y+ VPC S
Sbjct: 113 YVVRVSFGTPAVPQVVVIDTGSDVSWLQCKPCSSGQCFPQKDPLYDPSHSSTYSAVPCAS 172

Query: 119 EHCRYF---PYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDVVFG 175
           + C+      Y        +C +   Y  G  T    S ++LT          VQ+  FG
Sbjct: 173 DVCKKLAADAYGSGCTSGKQCGFAISYADGTSTVGAYSQDKLTL----APGAIVQNFYFG 228

Query: 176 CGFSTNR-NFKFSGIFGLGIGRSSLVSQLNSSFSYCIGSLHD-PDYLHNKLILGDGAIID 233
           CG   +     F G+ GLG  R SL ++    FSYC+ S+   P +    L LG G    
Sbjct: 229 CGHGKHAVRGLFDGVLGLGRLRESLGARYGGVFSYCLPSVSSKPGF----LALGAGKNPS 284

Query: 234 EGDATPLQFIDGH---YYITLEAISVDGRMLDINPNIFKRDDSGGGVMIDSGTDVTWLVK 290
               TP+  + G      +TL  I+V G+ LD+ P+ F      GG+++DSGT +T L  
Sbjct: 285 GFVFTPMGTVPGQPTFSTVTLAGINVGGKKLDLRPSAFS-----GGMIVDSGTVITGLQS 339

Query: 291 EAYEALRDEVMIRLEGEQMRSYSWPDKLCYHGIMSSDLKGF-----PTVRFHFRGGAKLA 345
            AY ALR      +E  ++      D  CY      +L G+     P +   F GGA + 
Sbjct: 340 TAYRALRSAFRKAMEAYRLLPNGDLDT-CY------NLTGYKNVVVPKIALTFTGGATIN 392

Query: 346 LE 347
           L+
Sbjct: 393 LD 394


>gi|168040957|ref|XP_001772959.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162675692|gb|EDQ62184.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 351

 Score =  125 bits (313), Expect = 5e-26,   Method: Compositional matrix adjust.
 Identities = 108/359 (30%), Positives = 165/359 (45%), Gaps = 28/359 (7%)

Query: 61  FYVNISIGQPPVPQFTAM-DTGSSLLWVHCYPCRDCSPQLGTIFYPSRSSSYAYVPCDSE 119
           + + IS+G PP  QF+A+ DTGS L WV C PC  C  Q   +F P  SSSY+   C   
Sbjct: 8   YVLQISLGTPP-QQFSAIVDTGSDLCWVQCAPCARCFEQPDPLFIPLASSSYSNASCTDS 66

Query: 120 HCRYFPYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDVVFGCGFS 179
            C   P   CS  ++ C Y+  Y  G  T    + E +T   +  + I      FGCG +
Sbjct: 67  LCDALPRPTCS-MRNTCTYSYSYGDGSNTRGDFAFETVTLNGSTLARIG-----FGCGHN 120

Query: 180 TNRNFKFS-GIFGLGIGRSSLVSQLNSS----FSYCIGSLHDPDYLHNKLILGDGAIIDE 234
               F  + G+ GLG G  SL SQLNSS    FSYC+          + +  G+ A    
Sbjct: 121 QEGTFAGADGLIGLGQGPLSLPSQLNSSFTHIFSYCLVD-QSTTGTFSPITFGNAAENSR 179

Query: 235 GDATP-LQFID--GHYYITLEAISVDGRMLDINPNIFKRDDSG-GGVMIDSGTDVTWLVK 290
              TP LQ  D   +YY+ +E+ISV  R +   P+ F+ D +G GGV++DSGT +T+   
Sbjct: 180 ASFTPLLQNEDNPSYYYVGVESISVGNRRVPTPPSAFRIDANGVGGVILDSGTTITYWRL 239

Query: 291 EAYEALRDEVMIRLEGEQMRSYSWPDKLCY--HGIMSSDLKGFPTVRFHFRG-GAKLALE 347
            A+  +  E+  ++   +     +   LCY    + +S L   P++  H      ++ + 
Sbjct: 240 AAFIPILAELRRQISYPEADPTPYGLNLCYDISSVSASSLT-LPSMTVHLTNVDFEIPVS 298

Query: 348 KDSMFYQPRPDAFCMAVNPASINNRYVNFSLIGMMAQQFYNVGYDIGRKQKTFQRMDCE 406
              +      +  C A++ +        FS+IG + QQ   +  D+   +  F   DC 
Sbjct: 299 NLWVLVDNFGETVCTAMSTSD------QFSIIGNVQQQNNLIVTDVANSRVGFLATDCS 351


>gi|449441139|ref|XP_004138341.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 1-like [Cucumis
           sativus]
 gi|449477464|ref|XP_004155031.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 1-like [Cucumis
           sativus]
          Length = 336

 Score =  124 bits (312), Expect = 6e-26,   Method: Compositional matrix adjust.
 Identities = 113/351 (32%), Positives = 161/351 (45%), Gaps = 29/351 (8%)

Query: 67  IGQPPVPQFTAMDTGSSLLWVHCYPC---RDCSPQLGTIFYPSRSSSYAYVPCDSEHCRY 123
           +GQP  P F  +DTGS + W+ C PC     C  Q+  IF P  SSSY  V CDSE C+ 
Sbjct: 3   VGQPQQPSFFVLDTGSDVTWLQCLPCAGKNGCYEQITPIFDPELSSSYNPVSCDSEQCQL 62

Query: 124 FPYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDVVFGCGFSTNRN 183
              A C+   + CIY   Y  G  T   ++TE LTF +++     + ++  GCG      
Sbjct: 63  LDEAGCNV--NSCIYKVEYGDGSFTIGELATETLTFVHSN----SIPNISIGCGHDNEGL 116

Query: 184 F-KFSGIFGLGIGRSSLVSQLN-SSFSYCIGSLHDPDYL---HNKLILGDGAI--IDEGD 236
           F    G+ GLG G  S+ SQL  SSFSYC+  +  P +     N     D  I  + + D
Sbjct: 117 FVGADGLIGLGGGAISISSQLKASSFSYCLVDIDSPSFSTLDFNTDPPSDSLISPLVKND 176

Query: 237 ATPLQFIDGHYYITLEAISVDGRMLDINPNIFKRDDSG-GGVMIDSGTDVTWLVKEAYEA 295
             P        Y+ +  +SV G+ L I+ + F+ D+SG GG+++DSGT +T L  + YE 
Sbjct: 177 RFP-----SFRYVKVIGMSVGGKPLPISSSRFEIDESGLGGIIVDSGTTITQLPSDVYEV 231

Query: 296 LRDEVMIRLEGEQMRSYSWPDKLCYHGIMSSDLKGFPTVRFHFRGGAKLAL-EKDSMFYQ 354
           LR+  +             P   CY     S+++  PT+ F   G   L L  K+ +   
Sbjct: 232 LREAFLGLTTNLPPAPEISPFDTCYDLSSQSNVE-VPTIAFILPGENSLQLPAKNCLIQV 290

Query: 355 PRPDAFCMAVNPASINNRYVNFSLIGMMAQQFYNVGYDIGRKQKTFQRMDC 405
                FC+A   A+        S+IG   QQ   V YD+      F    C
Sbjct: 291 DSAGTFCLAFVSATF-----PLSIIGNFQQQGIRVSYDLTNSLVGFSTNKC 336


>gi|449433371|ref|XP_004134471.1| PREDICTED: probable aspartic protease At2g35615-like [Cucumis
           sativus]
 gi|449495479|ref|XP_004159853.1| PREDICTED: probable aspartic protease At2g35615-like [Cucumis
           sativus]
          Length = 424

 Score =  124 bits (312), Expect = 6e-26,   Method: Compositional matrix adjust.
 Identities = 120/425 (28%), Positives = 195/425 (45%), Gaps = 35/425 (8%)

Query: 1   LIHRDSILSPYNNPNENPAHRVQRGINISIARLAYLQEKIR-SHNTYQAQILPSNDISNS 59
           LIH DS LSP+ N       R++  ++ S +RL YL    + S N     +  S  + N 
Sbjct: 12  LIHHDSPLSPFYNHTMTDTARIEATVHRSRSRLNYLYYINKLSENALDNDVSLSPTLVNE 71

Query: 60  --LFYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRD-CSPQ---LGTIFYPSRSSSYAY 113
              + ++ +IG P       +DT + L+WV C  C   C P+   L T F  S+S +Y  
Sbjct: 72  GGEYLMSFNIGNPSSQVMGFLDTSNGLIWVQCSNCNSQCEPEKRGLTTKFLSSKSFTYEM 131

Query: 114 VPCDSEHCRYFP-YARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDV 172
            PC S  C     +  C++    C Y  +Y     TS  +S++   F  +D   + V  +
Sbjct: 132 EPCGSNFCNSLTGFQTCNSSDKWCKYRLVYGDNKATSGILSSDSFGFDTSDGMLVDVGFL 191

Query: 173 VFGCGFS--TNRNFKFSGIFGLGIGRSSLVSQLN-SSFSYCIGSLHDPDYLHNKLILGDG 229
            FGC  +  T     ++G  GL     SL+SQL    FSYC+   ++     +K+  G  
Sbjct: 192 NFGCSEAPLTGDEQSYTGNVGLNQTPLSLISQLGIKKFSYCLVPFNNLGST-SKMYFGSL 250

Query: 230 AIIDEGDATPLQFIDGH-YYITLEAISV--DGRMLDINPNIFKRDDSGGGVMIDSGTDVT 286
            +   G  TPL + +   YY+ +  IS+  D    D   ++++  D   G +ID+G   +
Sbjct: 251 PVT-SGGQTPLLYPNSDAYYVKVLGISIGNDEPHFDGVFDVYEVRD---GWIIDTGITYS 306

Query: 287 WLVKEAYEALRDEVMIRLEGEQMRSYSWPD-KLCYHGIMSSDLKGFPTVRFHFRGGAKLA 345
            L  +A+++L  + +   +  Q +       +LC+    ++DL+ FP V  HF  GA L 
Sbjct: 307 SLETDAFDSLLAKFLTLKDFPQRKDDPKERFELCFELQNANDLESFPDVTVHFD-GADLI 365

Query: 346 LEKDSMFYQPRPDA-FCMAV----NPASINNRYVNFSLIGMMAQQFYNVGYDIGRKQKTF 400
           L  +S F +   D  FC+A+    +P SI         +G    Q Y+VGYD+  +  +F
Sbjct: 366 LNVESTFVKIEDDGIFCLALLRSGSPVSI---------LGNFQLQNYHVGYDLEAQVISF 416

Query: 401 QRMDC 405
             +DC
Sbjct: 417 APVDC 421


>gi|356524287|ref|XP_003530761.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
          Length = 481

 Score =  124 bits (312), Expect = 7e-26,   Method: Compositional matrix adjust.
 Identities = 120/409 (29%), Positives = 165/409 (40%), Gaps = 48/409 (11%)

Query: 26  INISIARLAYLQEKI-------RSHNTYQAQILPSND---ISNSLFYVNISIGQPPVPQF 75
           +N+   R+ Y+Q ++        S     +  LP+     I ++ ++V + +G P     
Sbjct: 91  MNLDNERVKYIQSRLSKNLGRENSVKELDSTTLPAKSGSLIGSANYFVVVGLGTPKRDLS 150

Query: 76  TAMDTGSSLLWVHCYPCR-DCSPQLGTIFYPSRSSSYAYVPCDSEHCRYFPYA----RCS 130
              DTGS L W  C PC   C  Q   IF PS+SSSY  + C S  C     A    RCS
Sbjct: 151 LVFDTGSDLTWTQCEPCAGSCYKQQDAIFDPSKSSSYINITCTSSLCTQLTSAGIKSRCS 210

Query: 131 AYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDVVFGCGFSTNRNFKFS-GI 189
           +    CIY   Y     +  F+S E+LT   TD     V D +FGCG      F  S G+
Sbjct: 211 SSTTACIYGIQYGDKSTSVGFLSQERLTITATD----IVDDFLFGCGQDNEGLFSGSAGL 266

Query: 190 FGLGIGRSSLVSQL----NSSFSYCIGSLHDPDYLHNKLILGDGAIIDEG-DATPLQFID 244
            GLG    S V Q     N  FSYC+ S          L  G  A  +     TPL  I 
Sbjct: 267 IGLGRHPISFVQQTSSIYNKIFSYCLPSTSSS---LGHLTFGASAATNANLKYTPLSTIS 323

Query: 245 GH---YYITLEAISVDGRMLDINPNIFKRDDSGGGVMIDSGTDVTWLVKEAYEALRDEVM 301
           G    Y + +  ISV G  L   P +     S GG +IDSGT +T L   AY ALR    
Sbjct: 324 GDNTFYGLDIVGISVGGTKL---PAVSSSTFSAGGSIIDSGTVITRLAPTAYAALRSAFR 380

Query: 302 IRLEGEQMRSYSWPDKLCYHGIMSSDLKGF-----PTVRFHFRGGAKLALEKDSMFYQPR 356
             +E   + +       CY      D  G+     P + F F GG  + L    +     
Sbjct: 381 QGMEKYPVANEDGLFDTCY------DFSGYKEISVPKIDFEFAGGVTVELPLVGILIGRS 434

Query: 357 PDAFCMAVNPASINNRYVNFSLIGMMAQQFYNVGYDIGRKQKTFQRMDC 405
               C+A      +N   + ++ G + Q+   V YD+   +  F    C
Sbjct: 435 AQQVCLAFAANGNDN---DITIFGNVQQKTLEVVYDVEGGRIGFGAAGC 480


>gi|242053503|ref|XP_002455897.1| hypothetical protein SORBIDRAFT_03g026970 [Sorghum bicolor]
 gi|241927872|gb|EES01017.1| hypothetical protein SORBIDRAFT_03g026970 [Sorghum bicolor]
          Length = 493

 Score =  124 bits (311), Expect = 8e-26,   Method: Compositional matrix adjust.
 Identities = 110/366 (30%), Positives = 156/366 (42%), Gaps = 33/366 (9%)

Query: 61  FYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLGTIFYPSRSSSYAYVPCDSEH 120
           ++  I +G P  P    +DTGS ++W+ C PCR C  Q G +F P RSSSY  V C +  
Sbjct: 140 YFTKIGVGTPSTPALMVLDTGSDVVWLQCAPCRRCYDQSGPVFDPRRSSSYGAVDCAAPL 199

Query: 121 CRYFPYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDVVFGCGFST 180
           CR      C   +  C+Y   Y  G  T+   +TE LTF         V  V  GCG   
Sbjct: 200 CRRLDSGGCDLRRRACLYQVAYGDGSVTAGDFATETLTFAG----GARVARVALGCGHD- 254

Query: 181 NRNFKFSGIFGLGIGRSSL------VSQLNSSFSYCIGSLHDPDYLHNKLILGDGAII-- 232
           N     +    LG+GR SL        +   SFSYC+                   +   
Sbjct: 255 NEGLFVAAAGLLGLGRGSLSFPTQISRRYGKSFSYCLVDRTSSSSSGAASRSRSSTVTFG 314

Query: 233 ----DEGDATPL---QFIDGHYYITLEAISVDG-RMLDINPNIFKRDDSG--GGVMIDSG 282
                    TP+     ++  YY+ L  ISV G R+  +  +  + D S   GGV++DSG
Sbjct: 315 PPSASAASFTPMVRNPRMETFYYVQLVGISVGGARVPGVAESDLRLDPSTGRGGVIVDSG 374

Query: 283 TDVTWLVKEAYEALRDEVMIRLEGEQMR--SYSWPDKLCYHGIMSSDLKGFPTVRFHFRG 340
           T VT L + +Y ALRD       G ++    +S  D  CY  +    +   PTV  HF G
Sbjct: 375 TSVTRLARPSYSALRDAFRAAAAGLRLSPGGFSLFDT-CYD-LGGRKVVKVPTVSMHFAG 432

Query: 341 GAKLALEKDS-MFYQPRPDAFCMAVNPASINNRYVNFSLIGMMAQQFYNVGYDIGRKQKT 399
           GA+ AL  ++ +        FC A   A  +      S+IG + QQ + V +D   ++  
Sbjct: 433 GAEAALPPENYLIPVDSRGTFCFAF--AGTDG---GVSIIGNIQQQGFRVVFDGDGQRVG 487

Query: 400 FQRMDC 405
           F    C
Sbjct: 488 FAPKGC 493


>gi|356527091|ref|XP_003532147.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Glycine max]
          Length = 482

 Score =  124 bits (311), Expect = 8e-26,   Method: Compositional matrix adjust.
 Identities = 123/431 (28%), Positives = 174/431 (40%), Gaps = 44/431 (10%)

Query: 1   LIHRDSILSPYNNPNENPAHRVQRGI-NISIARLAYLQEKIRSH-------NTYQAQILP 52
           ++H+    S  N+  +  A      I N+   R+ Y+Q ++  +           +  LP
Sbjct: 69  VVHKHGPCSQLNHSGKAEATISHNDIMNLDNERVKYIQSRLSKNLGGENRVKELDSTTLP 128

Query: 53  SND---ISNSLFYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCR-DCSPQLGTIFYPSRS 108
           +     I ++ +YV + +G P        DTGS L W  C PC   C  Q   IF PS+S
Sbjct: 129 AKSGRLIGSADYYVVVGLGTPKRDLSLIFDTGSYLTWTQCEPCAGSCYKQQDPIFDPSKS 188

Query: 109 SSYAYVPCDSEHCRYFPYARCSAYK-HRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTI 167
           SSY  + C S  C  F  A CS+     CIY   Y     +  F+S E+LT   TD    
Sbjct: 189 SSYTNIKCTSSLCTQFRSAGCSSSTDASCIYDVKYGDNSISRGFLSQERLTITATD---- 244

Query: 168 HVQDVVFGCGFSTNRNFK-FSGIFGLGIGRSSLVSQL----NSSFSYCIGSLHDPDYLHN 222
            V D +FGCG      F+  +G+ GL     S V Q     N  FSYC+ S   P  L +
Sbjct: 245 IVHDFLFGCGQDNEGLFRGTAGLMGLSRHPISFVQQTSSIYNKIFSYCLPS--TPSSLGH 302

Query: 223 KLILGDGAIIDEGDATPLQFIDGH---YYITLEAISVDGRMLDINPNIFKRDDSGGGVMI 279
                  A       TP   I G    Y + +  ISV G  L   P +     S GG +I
Sbjct: 303 LTFGASAATNANLKYTPFSTISGENSFYGLDIVGISVGGTKL---PAVSSSTFSAGGSII 359

Query: 280 DSGTDVTWLVKEAYEALRDEVMIRLEGEQMRSYSWPDKLCYHGIMSSDLKGF-----PTV 334
           DSGT +T L   AY ALR      +    +   +     CY      D  G+     P +
Sbjct: 360 DSGTVITRLPPTAYAALRSAFRQFMMKYPVAYGTRLLDTCY------DFSGYKEISVPRI 413

Query: 335 RFHFRGGAKLALEKDSMFYQPRPDAFCMAVNPASINNRYVNFSLIGMMAQQFYNVGYDIG 394
            F F GG K+ L    + Y       C+A    + N    + ++ G + Q+   V YD+ 
Sbjct: 414 DFEFAGGVKVELPLVGILYGESAQQLCLAF---AANGNGNDITIFGNVQQKTLEVVYDVE 470

Query: 395 RKQKTFQRMDC 405
             +  F    C
Sbjct: 471 GGRIGFGAAGC 481


>gi|242093566|ref|XP_002437273.1| hypothetical protein SORBIDRAFT_10g023970 [Sorghum bicolor]
 gi|241915496|gb|EER88640.1| hypothetical protein SORBIDRAFT_10g023970 [Sorghum bicolor]
          Length = 503

 Score =  124 bits (311), Expect = 9e-26,   Method: Compositional matrix adjust.
 Identities = 107/363 (29%), Positives = 158/363 (43%), Gaps = 42/363 (11%)

Query: 61  FYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRD-CSPQLGTIFYPSRSSSYAYVPCDSE 119
           + V I +G P        DTGS   WV C PC   C  Q   +F P++S++YA + C S 
Sbjct: 165 YVVPIRLGTPAARFTVVFDTGSDTTWVQCQPCVAYCYQQKEPLFTPTKSATYANISCTSS 224

Query: 120 HCRYFPYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDVVFGCGFS 179
           +C       CS     C+Y   Y  G  T  F + + LT          V+D  FGCG  
Sbjct: 225 YCSDLDTRGCSG--GHCLYAVQYGDGSYTVGFYAQDTLTLGYDT-----VKDFRFGCG-E 276

Query: 180 TNRNF--KFSGIFGLGIGRSSLVSQL----NSSFSYCIGSLHDPDYLHNKLILGDGAIID 233
            NR    K +G+ GLG G++S+  Q     +  F+YCI +        +           
Sbjct: 277 KNRGLFGKAAGLMGLGRGKTSVPVQAYDKYSGVFAYCIPATSSGTGFLD--FGPGAPAAA 334

Query: 234 EGDATPLQFIDG--HYYITLEAISVDGRMLDINPNIFKRDDSGGGVMIDSGTDVTWLVKE 291
               TP+   +G   YY+ +  I V G +L I   +F    S  G ++DSGT +T L   
Sbjct: 335 NARLTPMLVDNGPTFYYVGMTGIKVGGHLLSIPATVF----SDAGALVDSGTVITRLPPS 390

Query: 292 AYEALRDEVMIRLEG---EQMRSYSWPDKLCYHGIMSSDLKGF------PTVRFHFRGGA 342
           AYE LR      +EG   +   ++S  D  CY      DL G+      P V   F+GGA
Sbjct: 391 AYEPLRSAFAKGMEGLGYKTAPAFSILDT-CY------DLTGYQGSIALPAVSLVFQGGA 443

Query: 343 KLALEKDSMFYQPRPDAFCMAVNPASINNRYVNFSLIGMMAQQFYNVGYDIGRKQKTFQR 402
            L ++   + Y       C+A    + N+   + +++G   Q+ Y+V YD+G+K   F  
Sbjct: 444 CLDVDASGILYVADVSQACLAF---AANDDDTDMTIVGNTQQKTYSVLYDLGKKVVGFAP 500

Query: 403 MDC 405
             C
Sbjct: 501 GAC 503


>gi|225216879|gb|ACN85177.1| aspartic proteinase nepenthesin-1 precursor [Oryza nivara]
 gi|225216896|gb|ACN85193.1| aspartic proteinase nepenthesin-1 precursor [Oryza rufipogon]
          Length = 515

 Score =  124 bits (310), Expect = 1e-25,   Method: Compositional matrix adjust.
 Identities = 101/374 (27%), Positives = 154/374 (41%), Gaps = 48/374 (12%)

Query: 52  PSNDISNSLFYVNISIGQPPVPQFTAMDTGSSLLWVHCYPC-RDCSPQLGTIFYPSRSSS 110
           P   +    + V + +G P        DTGS   WV C PC   C  Q   +F P+ SS+
Sbjct: 170 PGRALGTGNYVVTVGLGTPASRYTVVFDTGSDTTWVQCQPCVVACYEQREKLFDPASSST 229

Query: 111 YAYVPCDSEHCRYFPYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQ 170
           YA V C +  C     + CS     C+Y   Y  G  +  F + + LT  + D     V+
Sbjct: 230 YANVSCAAPACSDLDVSGCSG--GHCLYGVQYGDGSYSIGFFAMDTLTLSSYDA----VK 283

Query: 171 DVVFGCGFSTNRNF-KFSGIFGLGIGRSSLVSQL----NSSFSYCI------------GS 213
              FGCG   +  F + +G+ GLG G++SL  Q        F++C+            G+
Sbjct: 284 GFRFGCGERNDGLFGEAAGLLGLGRGKTSLPVQTYGKYGGVFAHCLPARSTGTGYLDFGA 343

Query: 214 LHDPDYLHNKLILGDGAIIDEGDATPLQFIDGHYYITLEAISVDGRMLDINPNIFKRDDS 273
              P      ++ G+G                 YY+ +  I V GR+L I P++F    +
Sbjct: 344 GSPPATTTTPMLTGNGPTF--------------YYVGMTGIRVGGRLLPIAPSVF----A 385

Query: 274 GGGVMIDSGTDVTWLVKEAYEALRDEVMIRLEGEQMRSYSWPDKL--CYHGIMSSDLKGF 331
             G ++DSGT +T L   AY +LR      +     R  +    L  CY     S +   
Sbjct: 386 AAGTIVDSGTVITRLPPAAYSSLRSAFAAAMAARGYRKAAAVSLLDTCYDFTGMSQVA-I 444

Query: 332 PTVRFHFRGGAKLALEKDSMFYQPRPDAFCMAVNPASINNRYVNFSLIGMMAQQFYNVGY 391
           PTV   F+GGA L ++   + Y       C+A    + N    +  ++G    + + V Y
Sbjct: 445 PTVSLLFQGGAALDVDASGIMYTVSASQVCLAF---AGNEDGGDVGIVGNTQLKTFGVAY 501

Query: 392 DIGRKQKTFQRMDC 405
           DIG+K   F    C
Sbjct: 502 DIGKKVVGFSPGAC 515


>gi|225216914|gb|ACN85210.1| aspartic proteinase nepenthesin-1 precursor [Oryza glaberrima]
          Length = 516

 Score =  124 bits (310), Expect = 1e-25,   Method: Compositional matrix adjust.
 Identities = 101/374 (27%), Positives = 154/374 (41%), Gaps = 48/374 (12%)

Query: 52  PSNDISNSLFYVNISIGQPPVPQFTAMDTGSSLLWVHCYPC-RDCSPQLGTIFYPSRSSS 110
           P   +    + V + +G P        DTGS   WV C PC   C  Q   +F P+ SS+
Sbjct: 171 PGRALGTGNYVVTVGLGTPASRYTVVFDTGSDTTWVQCQPCVVACYEQREKLFDPASSST 230

Query: 111 YAYVPCDSEHCRYFPYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQ 170
           YA V C +  C     + CS     C+Y   Y  G  +  F + + LT  + D     V+
Sbjct: 231 YANVSCAAPACSDLDVSGCSG--GHCLYGVQYGDGSYSIGFFAMDTLTLSSYDA----VK 284

Query: 171 DVVFGCGFSTNRNF-KFSGIFGLGIGRSSLVSQL----NSSFSYCI------------GS 213
              FGCG   +  F + +G+ GLG G++SL  Q        F++C+            G+
Sbjct: 285 GFRFGCGERNDGLFGEAAGLLGLGRGKTSLPVQTYGKYGGVFAHCLPPRSTGTGYLDFGA 344

Query: 214 LHDPDYLHNKLILGDGAIIDEGDATPLQFIDGHYYITLEAISVDGRMLDINPNIFKRDDS 273
              P      ++ G+G                 YY+ +  I V GR+L I P++F    +
Sbjct: 345 GSPPATTTTPMLTGNGPTF--------------YYVGMTGIRVGGRLLPIAPSVF----A 386

Query: 274 GGGVMIDSGTDVTWLVKEAYEALRDEVMIRLEGEQMRSYSWPDKL--CYHGIMSSDLKGF 331
             G ++DSGT +T L   AY +LR      +     R  +    L  CY     S +   
Sbjct: 387 AAGTIVDSGTVITRLPPAAYSSLRSAFAAAMAARGYRKAAAVSLLDTCYDFTGMSQVA-I 445

Query: 332 PTVRFHFRGGAKLALEKDSMFYQPRPDAFCMAVNPASINNRYVNFSLIGMMAQQFYNVGY 391
           PTV   F+GGA L ++   + Y       C+A    + N    +  ++G    + + V Y
Sbjct: 446 PTVSLLFQGGAALDVDASGIMYTVSASQVCLAF---AGNEDGGDVGIVGNTQLKTFGVAY 502

Query: 392 DIGRKQKTFQRMDC 405
           DIG+K   F    C
Sbjct: 503 DIGKKVVGFSPGAC 516


>gi|222624819|gb|EEE58951.1| hypothetical protein OsJ_10630 [Oryza sativa Japonica Group]
          Length = 440

 Score =  124 bits (310), Expect = 1e-25,   Method: Compositional matrix adjust.
 Identities = 104/363 (28%), Positives = 160/363 (44%), Gaps = 28/363 (7%)

Query: 61  FYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLGTIFYPSRSSSYAYVPCDSEH 120
           + ++++IG PP P    +DTGS L+W  C PC  C  Q    +  SRSS++A   CDS  
Sbjct: 91  YLLHLAIGTPPQPVQLTLDTGSVLVWTQCQPCAVCFNQSLPYYDASRSSTFALPSCDSTQ 150

Query: 121 CRYFPYARCSAYK--HRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDVVFGCGF 178
           C+  P       +    C Y+  Y     T  F+  E ++F     +   V  VVFGCG 
Sbjct: 151 CKLDPSVTMCVNQTVQTCAYSYSYGDKSATIGFLDVETVSF----VAGASVPGVVFGCGL 206

Query: 179 STNRNFKF--SGIFGLGIGRSSLVSQLN-SSFSYCIGSL--HDPDYLHNKLIL-----GD 228
           +    F+   +GI G G G  SL SQL   +FS+C  ++    P  +   L       G 
Sbjct: 207 NNTGIFRSNETGIAGFGRGPLSLPSQLKVGNFSHCFTAVSGRKPSTVLFDLPADLYKNGR 266

Query: 229 GAIIDEGDATPLQFIDGH---YYITLEAISVDGRMLDINPNIFKRDDSGGGVMIDSGTDV 285
           G +      TPL     H   YY++L+ I+V    L +  + F   +  GG +IDSGT  
Sbjct: 267 GTV----QTTPLIKNPAHPTFYYLSLKGITVGSTRLPVPESAFALKNGTGGTIIDSGTAF 322

Query: 286 TWLVKEAYEALRDEVMIRLEGEQMRSYSWPDKLCYHGIMSSDLKGFPTVRFHFRGGAKLA 345
           T L    Y  + DE    ++   + S      LC+           P +  HF  GA + 
Sbjct: 323 TSLPPRVYRLVHDEFAAHVKLPVVPSNETGPLLCFSAPPLGKAPHVPKLVLHFE-GATMH 381

Query: 346 LEKDSMFYQPRPDAFCMAVNPASINNRYVNFSLIGMMAQQFYNVGYDIGRKQKTFQRMDC 405
           L +++  ++ +    C ++  A I       ++IG   QQ  +V YD+   + +F R  C
Sbjct: 382 LPRENYVFEAKDGGNC-SICLAIIEGE---MTIIGNFQQQNMHVLYDLKNSKLSFVRAKC 437

Query: 406 EVL 408
           + L
Sbjct: 438 DKL 440


>gi|45735840|dbj|BAD12875.1| putative chloroplast nucleoid DNA-binding protein cnd41 [Oryza
           sativa Japonica Group]
 gi|45735966|dbj|BAD12995.1| putative chloroplast nucleoid DNA-binding protein cnd41 [Oryza
           sativa Japonica Group]
 gi|125583491|gb|EAZ24422.1| hypothetical protein OsJ_08175 [Oryza sativa Japonica Group]
          Length = 475

 Score =  124 bits (310), Expect = 1e-25,   Method: Compositional matrix adjust.
 Identities = 112/394 (28%), Positives = 169/394 (42%), Gaps = 63/394 (15%)

Query: 32  RLAYLQEKIRSHNTY---------QAQILPSN---DISNSLFYVNISIGQPPVPQFTAMD 79
           R  Y+Q ++               +A  +P+N    I    + V +S+G P V Q   +D
Sbjct: 101 RAEYIQRRVSGAAAAAPGMQLAGSKAATVPANLGFSIGTLQYVVTVSLGTPAVAQTLEVD 160

Query: 80  TGSSLLWVHCYPCRD--CSPQLGTIFYPSRSSSYAYVPCDSEHCRYFPYARCSAYKHRCI 137
           TGS + WV C PC    C  Q   +F P+RSSSY+ VPC +  C             +C 
Sbjct: 161 TGSDVSWVQCKPCPSPPCYSQRDPLFDPTRSSSYSAVPCAAASCSQLALYSNGCSGGQCG 220

Query: 138 YTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDVVFGCGFSTNRNFK-FSGIFGLGIGR 196
           Y   Y  G  T+   S++ LT   ++     ++  +FGCG +    F    G+ GLG   
Sbjct: 221 YVVSYGDGSTTTGVYSSDTLTLTGSNA----LKGFLFGCGHAQQGLFAGVDGLLGLGRQG 276

Query: 197 SSLVSQLNSS----FSYCIGSLHDPDYLHNKLILGDGAIIDEGDATPLQFI--DGHYYIT 250
            SLVSQ +S+    FSYC+    +       + LG  +       TPL     D  YYI 
Sbjct: 277 QSLVSQASSTYGGVFSYCLPPTQNS---VGYISLGGPSSTAGFSTTPLLTASNDPTYYIV 333

Query: 251 -LEAISVDGRMLDINPNIFKRDDSGGGVMIDSGTDVTWLVKEAYEALRDEVMIRLEGEQM 309
            L  ISV G+ L I+ ++F       G ++D+GT VT L   AY ALR           M
Sbjct: 334 MLAGISVGGQPLSIDASVFAS-----GAVVDTGTVVTRLPPTAYSALRSAFR-----AAM 383

Query: 310 RSYSWPDK-------LCY----HGIMSSDLKGFPTVRFHFRGGAKLALEKDSMFYQPRPD 358
             Y +P          CY    +G ++      PT+   F GGA + L    +       
Sbjct: 384 APYGYPSAPATGILDTCYDFTRYGTVT-----LPTISIAFGGGAAMDLGTSGIL-----T 433

Query: 359 AFCMAVNPASINNRYVNFSLIGMMAQQFYNVGYD 392
           + C+A  P   +++    S++G + Q+ + V +D
Sbjct: 434 SGCLAFAPTGGDSQA---SILGNVQQRSFEVRFD 464


>gi|115468912|ref|NP_001058055.1| Os06g0610800 [Oryza sativa Japonica Group]
 gi|51091964|dbj|BAD35493.1| putative nucleoid DNA-binding protein cnd41, chloroplast [Oryza
           sativa Japonica Group]
 gi|113596095|dbj|BAF19969.1| Os06g0610800 [Oryza sativa Japonica Group]
 gi|218198535|gb|EEC80962.1| hypothetical protein OsI_23679 [Oryza sativa Indica Group]
          Length = 519

 Score =  124 bits (310), Expect = 1e-25,   Method: Compositional matrix adjust.
 Identities = 101/374 (27%), Positives = 154/374 (41%), Gaps = 48/374 (12%)

Query: 52  PSNDISNSLFYVNISIGQPPVPQFTAMDTGSSLLWVHCYPC-RDCSPQLGTIFYPSRSSS 110
           P   +    + V + +G P        DTGS   WV C PC   C  Q   +F P+ SS+
Sbjct: 174 PGRALGTGNYVVTVGLGTPASRYTVVFDTGSDTTWVQCQPCVVACYEQREKLFDPASSST 233

Query: 111 YAYVPCDSEHCRYFPYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQ 170
           YA V C +  C     + CS     C+Y   Y  G  +  F + + LT  + D     V+
Sbjct: 234 YANVSCAAPACSDLDVSGCSG--GHCLYGVQYGDGSYSIGFFAMDTLTLSSYDA----VK 287

Query: 171 DVVFGCGFSTNRNF-KFSGIFGLGIGRSSLVSQL----NSSFSYCI------------GS 213
              FGCG   +  F + +G+ GLG G++SL  Q        F++C+            G+
Sbjct: 288 GFRFGCGERNDGLFGEAAGLLGLGRGKTSLPVQTYGKYGGVFAHCLPARSTGTGYLDFGA 347

Query: 214 LHDPDYLHNKLILGDGAIIDEGDATPLQFIDGHYYITLEAISVDGRMLDINPNIFKRDDS 273
              P      ++ G+G                 YY+ +  I V GR+L I P++F    +
Sbjct: 348 GSPPATTTTPMLTGNGPTF--------------YYVGMTGIRVGGRLLPIAPSVF----A 389

Query: 274 GGGVMIDSGTDVTWLVKEAYEALRDEVMIRLEGEQMRSYSWPDKL--CYHGIMSSDLKGF 331
             G ++DSGT +T L   AY +LR      +     R  +    L  CY     S +   
Sbjct: 390 AAGTIVDSGTVITRLPPAAYSSLRSAFAAAMAARGYRKAAAVSLLDTCYDFTGMSQVA-I 448

Query: 332 PTVRFHFRGGAKLALEKDSMFYQPRPDAFCMAVNPASINNRYVNFSLIGMMAQQFYNVGY 391
           PTV   F+GGA L ++   + Y       C+A    + N    +  ++G    + + V Y
Sbjct: 449 PTVSLLFQGGAALDVDASGIMYTVSASQVCLAF---AGNEDGGDVGIVGNTQLKTFGVAY 505

Query: 392 DIGRKQKTFQRMDC 405
           DIG+K   F    C
Sbjct: 506 DIGKKVVGFSPGAC 519


>gi|226504334|ref|NP_001141706.1| uncharacterized protein LOC100273835 precursor [Zea mays]
 gi|194705620|gb|ACF86894.1| unknown [Zea mays]
 gi|414885968|tpg|DAA61982.1| TPA: hypothetical protein ZEAMMB73_231717 [Zea mays]
          Length = 477

 Score =  124 bits (310), Expect = 1e-25,   Method: Compositional matrix adjust.
 Identities = 118/434 (27%), Positives = 182/434 (41%), Gaps = 53/434 (12%)

Query: 1   LIHRDSILSPYNNPNENPAHRVQRGINISIARLAYLQEKIRSHNTYQAQILPSN------ 54
           ++H     SP  +    P+H    G +    R+  ++ K+ +  T  +   P        
Sbjct: 67  VVHGHGPCSPQESRRGAPSHTEILGRDQD--RVDAIRRKVAAVTTAASSSKPKGVPLQVG 124

Query: 55  -----DISNSLFYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLGTIFYPSRSS 109
                D +N  ++ ++ +G P       +DTGS   W+ C PC DC  Q   +F PS+SS
Sbjct: 125 WGKYLDTTN--YFTSLRLGTPATDLLVELDTGSDQSWIQCKPCPDCYEQHEALFDPSKSS 182

Query: 110 SYAYVPCDSEHCRYFPYAR---CSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDEST 166
           +Y+ + C S  C+    +    CS+ K +C Y   Y     T   ++ + LT   TD   
Sbjct: 183 TYSDITCSSRECQELGSSHKHNCSSDK-KCPYEITYADDSYTVGNLARDTLTLSPTDA-- 239

Query: 167 IHVQDVVFGCGFSTNRNF-KFSGIFGLGIGRSSLVSQL----NSSFSYCIGSLHDPDYLH 221
             V   VFGCG +   +F +  G+ GLG G++SL SQ+     + FSYC+ S   P    
Sbjct: 240 --VPGFVFGCGHNNAGSFGEIDGLLGLGRGKASLSSQVAARYGAGFSYCLPS--SPSAT- 294

Query: 222 NKLILGDGAIIDEGDATPLQFIDGH----YYITLEAISVDGRMLDINPNIFKRDDSGGGV 277
             L     A     +A   + + G     YY+ L  I+V GR + + P++F    +  G 
Sbjct: 295 GYLSFSGAAAAAPTNAQFTEMVAGQHPSFYYLNLTGITVAGRAIKVPPSVFA---TAAGT 351

Query: 278 MIDSGTDVTWLVKEAYEALRDEVMIRLEGEQMRSYSWPDKLCYHGIMSSDLKGFPTVRFH 337
           +IDSGT  + L   AY ALR  V   +   +    S     CY      DL G  TVR  
Sbjct: 352 IIDSGTAFSCLPPSAYAALRSSVRSAMGRYKRAPSSTIFDTCY------DLTGHETVRIP 405

Query: 338 -----FRGGAKLALEKDSMFYQ-PRPDAFCMAVNPASINNRYVNFSLIGMMAQQFYNVGY 391
                F  GA + L    + Y        C+A  P   N    +  ++G   Q+   V Y
Sbjct: 406 SVALVFADGATVHLHPSGVLYTWSNVSQTCLAFLP---NPDDTSLGVLGNTQQRTLAVIY 462

Query: 392 DIGRKQKTFQRMDC 405
           D+  ++  F    C
Sbjct: 463 DVDNQKVGFGANGC 476


>gi|108707835|gb|ABF95630.1| Eukaryotic aspartyl protease family protein [Oryza sativa Japonica
           Group]
          Length = 384

 Score =  124 bits (310), Expect = 1e-25,   Method: Compositional matrix adjust.
 Identities = 104/363 (28%), Positives = 160/363 (44%), Gaps = 28/363 (7%)

Query: 61  FYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLGTIFYPSRSSSYAYVPCDSEH 120
           + ++++IG PP P    +DTGS L+W  C PC  C  Q    +  SRSS++A   CDS  
Sbjct: 35  YLLHLAIGTPPQPVQLTLDTGSVLVWTQCQPCAVCFNQSLPYYDASRSSTFALPSCDSTQ 94

Query: 121 CRYFPYARCSAYK--HRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDVVFGCGF 178
           C+  P       +    C Y+  Y     T  F+  E ++F     +   V  VVFGCG 
Sbjct: 95  CKLDPSVTMCVNQTVQTCAYSYSYGDKSATIGFLDVETVSF----VAGASVPGVVFGCGL 150

Query: 179 STNRNFKF--SGIFGLGIGRSSLVSQLN-SSFSYCIGSL--HDPDYLHNKLIL-----GD 228
           +    F+   +GI G G G  SL SQL   +FS+C  ++    P  +   L       G 
Sbjct: 151 NNTGIFRSNETGIAGFGRGPLSLPSQLKVGNFSHCFTAVSGRKPSTVLFDLPADLYKNGR 210

Query: 229 GAIIDEGDATPLQFIDGH---YYITLEAISVDGRMLDINPNIFKRDDSGGGVMIDSGTDV 285
           G +      TPL     H   YY++L+ I+V    L +  + F   +  GG +IDSGT  
Sbjct: 211 GTV----QTTPLIKNPAHPTFYYLSLKGITVGSTRLPVPESAFALKNGTGGTIIDSGTAF 266

Query: 286 TWLVKEAYEALRDEVMIRLEGEQMRSYSWPDKLCYHGIMSSDLKGFPTVRFHFRGGAKLA 345
           T L    Y  + DE    ++   + S      LC+           P +  HF  GA + 
Sbjct: 267 TSLPPRVYRLVHDEFAAHVKLPVVPSNETGPLLCFSAPPLGKAPHVPKLVLHFE-GATMH 325

Query: 346 LEKDSMFYQPRPDAFCMAVNPASINNRYVNFSLIGMMAQQFYNVGYDIGRKQKTFQRMDC 405
           L +++  ++ +    C ++  A I       ++IG   QQ  +V YD+   + +F R  C
Sbjct: 326 LPRENYVFEAKDGGNC-SICLAIIEGE---MTIIGNFQQQNMHVLYDLKNSKLSFVRAKC 381

Query: 406 EVL 408
           + L
Sbjct: 382 DKL 384


>gi|168064509|ref|XP_001784204.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162664276|gb|EDQ51002.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 367

 Score =  123 bits (309), Expect = 1e-25,   Method: Compositional matrix adjust.
 Identities = 108/375 (28%), Positives = 167/375 (44%), Gaps = 39/375 (10%)

Query: 46  YQAQILPSNDISNSLFYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLGTIFYP 105
           ++A I          ++  + +G P    +  +DTGS + W+ C PC +C  Q   +F P
Sbjct: 1   FEAPIFSGLAFGTGEYFAVVGVGTPRRDMYLVVDTGSDITWLQCAPCTNCYKQKDALFNP 60

Query: 106 SRSSSYAYVPCDSEHCRYFPYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNT-DE 164
           S SSS+  + C S  C       C    ++C+Y   Y  G  T   + T+ +   +    
Sbjct: 61  SSSSSFKVLDCSSSLCLNLDVMGC--LSNKCLYQADYGDGSFTMGELVTDNVVLDDAFGP 118

Query: 165 STIHVQDVVFGCGFSTNRNF-KFSGIFGLGIGRSSLVSQLNSS----FSYCIGSLH-DPD 218
             + + ++  GCG      F   +GI GLG G  S  + L++S    FSYC+     DP+
Sbjct: 119 GQVVLTNIPLGCGHDNEGTFGTAAGILGLGRGPLSFPNNLDASTRNIFSYCLPDRESDPN 178

Query: 219 YLHNKLILGDGAIIDEGDATPLQFI--------DGHYYITLEAISVDGRML-DINPNIFK 269
           +  + L+ GD AI      + ++FI          +YY+ +  ISV G +L +I  ++F+
Sbjct: 179 H-KSTLVFGDAAIPHTATGS-VKFIPQLRNPRVATYYYVQITGISVGGNLLTNIPASVFQ 236

Query: 270 RDDSG-GGVMIDSGTDVTWLVKEAYEALRDEVMIRLEGEQMRSYSWPD----KLCYHGIM 324
            D  G GG + DSGT +T L   AY A+RD          M   S  D      CY    
Sbjct: 237 LDSHGNGGTIFDSGTTITRLEARAYTAVRDA----FRAATMHLTSAADFKIFDTCYDFTG 292

Query: 325 SSDLKGFPTVRFHFRGGAKLALEKDSMFYQP--RPDAFCMAVNPASINNRYVNFSLIGMM 382
            + +   PTV FHF+G   + L   S +  P    + FC A   +      +  S+IG +
Sbjct: 293 MNSIS-VPTVTFHFQGDVDMRLPP-SNYIVPVSNNNIFCFAFAAS------MGPSVIGNV 344

Query: 383 AQQFYNVGYDIGRKQ 397
            QQ + V YD   KQ
Sbjct: 345 QQQSFRVIYDNVHKQ 359


>gi|125540927|gb|EAY87322.1| hypothetical protein OsI_08726 [Oryza sativa Indica Group]
          Length = 464

 Score =  123 bits (309), Expect = 2e-25,   Method: Compositional matrix adjust.
 Identities = 112/394 (28%), Positives = 169/394 (42%), Gaps = 63/394 (15%)

Query: 32  RLAYLQEKIRSHNTY---------QAQILPSN---DISNSLFYVNISIGQPPVPQFTAMD 79
           R  Y+Q ++               +A  +P+N    I    + V +S+G P V Q   +D
Sbjct: 90  RAEYIQRRVSGAAAAAPGMQLAGSKAATVPANLGFSIGTLQYVVTVSLGTPAVAQTLEVD 149

Query: 80  TGSSLLWVHCYPCRD--CSPQLGTIFYPSRSSSYAYVPCDSEHCRYFPYARCSAYKHRCI 137
           TGS + WV C PC    C  Q   +F P+RSSSY+ VPC +  C             +C 
Sbjct: 150 TGSDVSWVQCKPCPSPPCYSQRDPLFDPTRSSSYSAVPCAAASCSQLALYSNGCSGGQCG 209

Query: 138 YTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDVVFGCGFSTNRNFK-FSGIFGLGIGR 196
           Y   Y  G  T+   S++ LT   ++     ++  +FGCG +    F    G+ GLG   
Sbjct: 210 YVVSYGDGSTTTGVYSSDTLTLTGSNA----LKGFLFGCGHAQQGLFAGVDGLLGLGRQG 265

Query: 197 SSLVSQLNSS----FSYCIGSLHDPDYLHNKLILGDGAIIDEGDATPLQFI--DGHYYIT 250
            SLVSQ +S+    FSYC+    +       + LG  +       TPL     D  YYI 
Sbjct: 266 QSLVSQASSTYGGVFSYCLPPTQNS---VGYISLGGPSSTAGFSTTPLLTASNDPTYYIV 322

Query: 251 -LEAISVDGRMLDINPNIFKRDDSGGGVMIDSGTDVTWLVKEAYEALRDEVMIRLEGEQM 309
            L  ISV G+ L I+ ++F       G ++D+GT VT L   AY ALR           M
Sbjct: 323 MLAGISVGGQPLSIDASVFAS-----GAVVDTGTVVTRLPPTAYSALRSAFR-----AAM 372

Query: 310 RSYSWPDK-------LCY----HGIMSSDLKGFPTVRFHFRGGAKLALEKDSMFYQPRPD 358
             Y +P          CY    +G ++      PT+   F GGA + L    +       
Sbjct: 373 APYGYPSAPATGILDTCYDFTRYGTVT-----LPTISIAFGGGAAMDLGTSGIL-----T 422

Query: 359 AFCMAVNPASINNRYVNFSLIGMMAQQFYNVGYD 392
           + C+A  P   +++    S++G + Q+ + V +D
Sbjct: 423 SGCLAFAPTGGDSQA---SILGNVQQRSFEVRFD 453


>gi|413944392|gb|AFW77041.1| hypothetical protein ZEAMMB73_800604 [Zea mays]
          Length = 476

 Score =  123 bits (308), Expect = 2e-25,   Method: Compositional matrix adjust.
 Identities = 101/356 (28%), Positives = 150/356 (42%), Gaps = 25/356 (7%)

Query: 61  FYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRD-CSPQLGTIFYPSRSSSYAYVPCDSE 119
           F V +  G P        DTGS + W+ C PC   C  Q   IF P++S++Y+ VPC   
Sbjct: 135 FVVTVGFGTPAQTYTVIFDTGSDVSWIQCLPCSGHCYKQHDPIFDPTKSATYSVVPCGHP 194

Query: 120 HCRYFPYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDVVFGCGFS 179
            C     ++CS     C+Y   Y  G  ++  +S E L+      ST  +    FGCG +
Sbjct: 195 QCAAADGSKCS--NGTCLYKVEYGDGSSSAGVLSHETLSLT----STRALPGFAFGCGQT 248

Query: 180 TNRNF-KFSGIFGLGIGRSSLVSQLNSS----FSYCIGSLHDPDYLHNKLILGDGAIIDE 234
              +F    G+ GLG G+ SL SQ  +S    FSYC+ S    +  H  L +G       
Sbjct: 249 NLGDFGDVDGLIGLGRGQLSLSSQAAASFGGTFSYCLPS---DNTTHGYLTIGPTTPASN 305

Query: 235 GDATPLQFIDGH-----YYITLEAISVDGRMLDINPNIFKRDDSGGGVMIDSGTDVTWLV 289
            D      +        Y++ L +I + G +L + P +F  D    G  +DSGT +T+L 
Sbjct: 306 DDVQYTAMVQKQDYPSFYFVELVSIDIGGYILPVPPTLFTDD----GTFLDSGTILTYLP 361

Query: 290 KEAYEALRDEVMIRLEGEQMRSYSWPDKLCYHGIMSSDLKGFPTVRFHFRGGAKLALEKD 349
            EAY ALRD     +   +      P   CY     S +   P V F F  G+   L   
Sbjct: 362 PEAYTALRDRFKFTMTQYKPAPAYDPFDTCYDFTGQSAIF-IPAVSFKFSDGSVFDLSFF 420

Query: 350 SMFYQPRPDAFCMAVNPASINNRYVNFSLIGMMAQQFYNVGYDIGRKQKTFQRMDC 405
            +   P   A  +           + F+++G M Q+   V YD+  ++  F    C
Sbjct: 421 GILIFPDDTAPAIGCLGFVARPSAMPFTIVGNMQQRNTEVIYDVAAEKIGFASASC 476


>gi|218189149|gb|EEC71576.1| hypothetical protein OsI_03949 [Oryza sativa Indica Group]
          Length = 504

 Score =  123 bits (308), Expect = 2e-25,   Method: Compositional matrix adjust.
 Identities = 96/319 (30%), Positives = 149/319 (46%), Gaps = 32/319 (10%)

Query: 60  LFYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLG-----TIFYPSRSSSYAYV 114
           L++  + +G PP   F  +DTGS +LWV C PC  C    G       F P  SS+ + +
Sbjct: 90  LYFTRVKLGSPPKEYFVQIDTGSDILWVACSPCTGCPSSSGLNIQLEFFNPDTSSTSSKI 149

Query: 115 PCDSEHCRYF---PYARCSAYKHR-CIYTQLYLIGPETSVFVSTEQLTFKNT---DESTI 167
           PC  + C        A C    +  C YT  Y  G  TS +  ++ + F +    +++  
Sbjct: 150 PCSDDRCTAALQTSEAVCQTSDNSPCGYTFTYGDGSGTSGYYVSDTMYFDSVMGNEQTAN 209

Query: 168 HVQDVVFGCGFS-----TNRNFKFSGIFGLGIGRSSLVSQLNS------SFSYCIGSLHD 216
               +VFGC  S     T  +    GIFG G  + S+VSQLNS       FS+C   L  
Sbjct: 210 SSASIVFGCSNSQSGDLTKTDRAVDGIFGFGQHQLSVVSQLNSLGVSPKVFSHC---LKG 266

Query: 217 PDYLHNKLILGDGAIIDEGDA-TPLQFIDGHYYITLEAISVDGRMLDINPNIFKRDDSGG 275
            D     L+LG+  I++ G   TPL     HY + LE+I V+G+ L I+ ++F   ++  
Sbjct: 267 SDNGGGILVLGE--IVEPGLVYTPLVPSQPHYNLNLESIVVNGQKLPIDSSLFTTSNT-Q 323

Query: 276 GVMIDSGTDVTWLVKEAYEALRDEVMIRLEGEQMRSYSWPDKLCYHGIMSSDLKGFPTVR 335
           G ++DSGT + +L   AY+   + +   +    +RS       C+    S D   FPTV 
Sbjct: 324 GTIVDSGTTLAYLADGAYDPFVNAITAAVS-PSVRSLVSKGNQCFVTSSSVD-SSFPTVS 381

Query: 336 FHFRGGAKLALEKDSMFYQ 354
            +F GG  + ++ ++   Q
Sbjct: 382 LYFMGGVAMTVKPENYLLQ 400


>gi|413944387|gb|AFW77036.1| hypothetical protein ZEAMMB73_461996 [Zea mays]
          Length = 472

 Score =  123 bits (308), Expect = 2e-25,   Method: Compositional matrix adjust.
 Identities = 122/372 (32%), Positives = 161/372 (43%), Gaps = 53/372 (14%)

Query: 61  FYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCR--DCSPQLGTIFYPSRSSSYAYVPCDS 118
           + V + IG P V Q   +DTGS L WV C PC    C PQ   ++ P+ SS+YA VPCDS
Sbjct: 127 YVVTLGIGTPAVQQTVLIDTGSDLSWVQCKPCNSSSCYPQKDPLYDPTASSTYAPVPCDS 186

Query: 119 EHCR-YFPYARCSAYKHRCIY---TQLYLIGPE-----TSVFV-STEQLTFKNTDESTIH 168
           + C+   P     AY H C     T L   G E     T+V V STE LT        + 
Sbjct: 187 KACKDLVP----DAYDHGCTNSSGTSLCQYGIEYGNRDTTVGVYSTETLTL----SPQVS 238

Query: 169 VQDVVFGCGF-STNRNFKFSGIFGLGIGRSSLVSQL----NSSFSYCIGSLHDPDYLHNK 223
           V+D  FGCG         F G+ GLG    SLVSQ       +FSYC+     P      
Sbjct: 239 VKDFGFGCGLVQQGTFDLFDGLLGLGGAPESLVSQTAETYGGAFSYCL-----PPGNSTT 293

Query: 224 LILGDGAIIDEGDATPLQFIDGH--------YYITLEAISVDGRMLDINPNIFKRDDSGG 275
             L  GA  +  D     F   H        Y + L  +SV G+ LDI P +       G
Sbjct: 294 GFLALGAPTNNNDTAGFLFTPLHSLPEQATFYLVNLTGVSVGGKPLDIPPTVLS-----G 348

Query: 276 GVMIDSGTDVTWLVKEAYEALRDEVMIRLEGEQMRSYSWPDKL--CYHGIMSSDLKGFPT 333
           G++IDSGT +T L   AY ALR      +    +   +  D L  CY+    +++   PT
Sbjct: 349 GMIIDSGTIITGLPDTAYSALRTAFRTAMSAYPLLPPNNDDVLDTCYNFTGIANVT-VPT 407

Query: 334 VRFHFRGGAKLALEKDSMFYQPRPDAFCMAVNPASINNRYVNFSLIGMMAQQFYNVGYDI 393
           V   F GGA + L+  S        AF    +   +        +IG + Q+ + V YD 
Sbjct: 408 VALTFDGGATIDLDVPSGVLIQDCLAFAGGASDGDVG-------IIGNVNQRTFEVLYDS 460

Query: 394 GRKQKTFQRMDC 405
           GR    F+   C
Sbjct: 461 GRGHVGFRPGAC 472


>gi|326520291|dbj|BAK07404.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 464

 Score =  123 bits (308), Expect = 2e-25,   Method: Compositional matrix adjust.
 Identities = 118/426 (27%), Positives = 186/426 (43%), Gaps = 55/426 (12%)

Query: 22  VQRGINISIARLAYLQEKIRSHNTYQAQILP----SNDISNSLFYVNISIGQP-PVPQFT 76
           ++R +  S ARLA L    RS     A   P     +D+ +S + +++ IG P P     
Sbjct: 55  LRRMVARSKARLASL----RSSACDTALTAPVDHGGSDVGSSEYLIHLGIGTPRPQRVVL 110

Query: 77  AMDTGSSLLWVHCYPCRDCSPQLGTIFYPSRSSSYAYVPCDSE---HCRYFPYARCSAYK 133
            +DTGS L+W  C  C  C  Q   +F  S S +++ VPC      H  Y P + C+A  
Sbjct: 111 HLDTGSDLVWTQCA-CTVCFDQPVPVFRASVSHTFSRVPCSDPLCGHAVYLPLSGCAARD 169

Query: 134 HRCIYTQLYLIGPETSVFVSTEQLTFKNTD--ESTIHVQDVVFGCG------FSTNRNFK 185
             C Y   Y+    T+  ++ +  TFK  D  ++   V ++ FGCG      F+ N+   
Sbjct: 170 RSCFYAYGYMDHSITTGKMAEDTFTFKAPDRADTAAAVPNIRFGCGMMNYGLFTPNQ--- 226

Query: 186 FSGIFGLGIGRSSLVSQLN-SSFSYCIGSLHDPDYLHNKLILGDGAIIDEGDAT-PLQ-- 241
            SGI G G G  SL SQL    FSYC  ++ +     + +ILG      E  AT P+Q  
Sbjct: 227 -SGIAGFGTGPLSLPSQLKVRRFSYCFTAMEESRV--SPVILGGEPENIEAHATGPIQST 283

Query: 242 -FIDG----------HYYITLEAISVDGRMLDINPNIFK-RDDSGGGVMIDSGTDVTWLV 289
            F  G           Y+++L  ++V    L  N + F  + D  GG  IDSGT +T+  
Sbjct: 284 PFAPGPAGAPVGSQPFYFLSLRGVTVGETRLPFNASTFALKGDGSGGTFIDSGTAITFFP 343

Query: 290 KEAYEALRDEVMIRLEGEQMRSYSWPDK-LCYHGIMSSDLKGFPTVRFHFRGGAKLALEK 348
           +  + +LR+  + ++     + Y+ PD  LC+           P +  H   GA   L +
Sbjct: 344 QAVFRSLREAFVAQVPLPVAKGYTDPDNLLCFSVPAKKKAPAVPKLILHLE-GADWELPR 402

Query: 349 DSMFYQPRPDA------FCMAVNPASINNRYVNFSLIGMMAQQFYNVGYDIGRKQKTFQR 402
           ++       D        C+ +  A  +N     ++IG   QQ  ++ YD+   +  F  
Sbjct: 403 ENYVLDNDDDGSGAGRKLCVVILSAGNSNG----TIIGNFQQQNMHIVYDLESNKMVFAP 458

Query: 403 MDCEVL 408
             C+ L
Sbjct: 459 ARCDKL 464


>gi|357142295|ref|XP_003572524.1| PREDICTED: aspartic proteinase-like protein 2-like [Brachypodium
           distachyon]
          Length = 494

 Score =  123 bits (308), Expect = 2e-25,   Method: Compositional matrix adjust.
 Identities = 102/372 (27%), Positives = 159/372 (42%), Gaps = 35/372 (9%)

Query: 60  LFYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLG-----TIFYPSRSSSYAYV 114
           L++  I IG P    +  +DTGS +LWV+C  C  C  + G     T++ P+ S+S   V
Sbjct: 88  LYFTQIGIGTPSKGYYVQVDTGSDILWVNCISCDSCPRKSGLGIDLTLYDPTASASSKTV 147

Query: 115 PCDSEHCRYFPYA----RCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTD---ESTI 167
            C  E C           C+A    C Y+  Y  G  T+ F   + L +       ++ +
Sbjct: 148 TCGQEFCATATNGGVPPSCAA-NSPCQYSITYGDGSSTTGFFVADFLQYDQVSGDGQTNL 206

Query: 168 HVQDVVFGCGFST-----NRNFKFSGIFGLGIGRSSLVSQLNSS------FSYCIGSLHD 216
               V FGCG        + N    GI G G   SS++SQL S+      FS+C+     
Sbjct: 207 ANASVTFGCGAKIGGALGSSNVALDGILGFGQANSSMLSQLTSAGKVTKIFSHCL----- 261

Query: 217 PDYLHNKLILGDGAIID-EGDATPLQFIDGHYYITLEAISVDGRMLDINPNIFKRDDSGG 275
            D ++   I   G ++  +   TPL     HY + L+ I V G  L +  NIF       
Sbjct: 262 -DTVNGGGIFAIGNVVQPKVKTTPLVPGMPHYNVVLKTIDVGGSTLQLPTNIFDIGGGSR 320

Query: 276 GVMIDSGTDVTWLVKEAYEALRDEVMIRLEGEQMRSYSWPDKLCYHGIMSSDLKGFPTVR 335
           G +IDSGT + +L +  Y+A+   V        +++    D LC+    S D  GFP V 
Sbjct: 321 GTIIDSGTTLAYLPEVVYKAVLSAVFSNHPDVTLKNVQ--DFLCFQYSGSVD-NGFPEVT 377

Query: 336 FHFRGGAKLALEKDSMFYQPRPDAFCMAVNPASINNR-YVNFSLIGMMAQQFYNVGYDIG 394
           FHF G   L +      +Q   D +C+      + ++   +  L+G +A     V YD+ 
Sbjct: 378 FHFDGDLPLVVYPHDYLFQNTEDVYCVGFQSGGVQSKDGKDMVLLGDLALSNKLVVYDLE 437

Query: 395 RKQKTFQRMDCE 406
            +   +   +C 
Sbjct: 438 NQVIGWTNYNCS 449


>gi|53791672|dbj|BAD53242.1| putative aspartic proteinase nepenthesin II [Oryza sativa Japonica
           Group]
          Length = 504

 Score =  123 bits (308), Expect = 2e-25,   Method: Compositional matrix adjust.
 Identities = 96/319 (30%), Positives = 148/319 (46%), Gaps = 32/319 (10%)

Query: 60  LFYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLG-----TIFYPSRSSSYAYV 114
           L++  + +G PP   F  +DTGS +LWV C PC  C    G       F P  SS+ + +
Sbjct: 90  LYFTRVKLGSPPKEYFVQIDTGSDILWVACSPCTGCPSSSGLNIQLEFFNPDTSSTSSKI 149

Query: 115 PCDSEHCRY---FPYARCSAYKHR-CIYTQLYLIGPETSVFVSTEQLTFKNT---DESTI 167
           PC  + C        A C    +  C YT  Y  G  TS +  ++ + F      +++  
Sbjct: 150 PCSDDRCTAALQTSEAVCQTSDNSPCGYTFTYGDGSGTSGYYVSDTMYFDTVMGNEQTAN 209

Query: 168 HVQDVVFGCGFS-----TNRNFKFSGIFGLGIGRSSLVSQLNS------SFSYCIGSLHD 216
               +VFGC  S     T  +    GIFG G  + S+VSQLNS       FS+C   L  
Sbjct: 210 SSASIVFGCSNSQSGDLTKTDRAVDGIFGFGQHQLSVVSQLNSLGVSPKVFSHC---LKG 266

Query: 217 PDYLHNKLILGDGAIIDEGDA-TPLQFIDGHYYITLEAISVDGRMLDINPNIFKRDDSGG 275
            D     L+LG+  I++ G   TPL     HY + LE+I V+G+ L I+ ++F   ++  
Sbjct: 267 SDNGGGILVLGE--IVEPGLVYTPLVPSQPHYNLNLESIVVNGQKLPIDSSLFTTSNT-Q 323

Query: 276 GVMIDSGTDVTWLVKEAYEALRDEVMIRLEGEQMRSYSWPDKLCYHGIMSSDLKGFPTVR 335
           G ++DSGT + +L   AY+   + +   +    +RS       C+    S D   FPTV 
Sbjct: 324 GTIVDSGTTLAYLADGAYDPFVNAITAAVS-PSVRSLVSKGNQCFVTSSSVD-SSFPTVS 381

Query: 336 FHFRGGAKLALEKDSMFYQ 354
            +F GG  + ++ ++   Q
Sbjct: 382 LYFMGGVAMTVKPENYLLQ 400


>gi|302793638|ref|XP_002978584.1| hypothetical protein SELMODRAFT_54048 [Selaginella moellendorffii]
 gi|300153933|gb|EFJ20570.1| hypothetical protein SELMODRAFT_54048 [Selaginella moellendorffii]
          Length = 407

 Score =  123 bits (308), Expect = 2e-25,   Method: Compositional matrix adjust.
 Identities = 112/369 (30%), Positives = 156/369 (42%), Gaps = 39/369 (10%)

Query: 61  FYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLGTIFYPSRSSSYAYVPCDSEH 120
           ++V + +G P    F  +DTGS L W+ C PC+ C  Q   IF P  SSS+  +PC S  
Sbjct: 54  YFVRLGLGTPARSLFMVVDTGSDLPWLQCQPCKSCYKQADPIFDPRNSSSFQRIPCLSPL 113

Query: 121 CRYFPYARCSAYK---HRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDVVFGCG 177
           C+      CS  +    RC Y   Y  G  +    S++  T     ++      V FGCG
Sbjct: 114 CKALEVHSCSGSRGATSRCSYQVAYGDGSFSVGDFSSDLFTLGTGSKA----MSVAFGCG 169

Query: 178 FSTNRNFKFSGI----------FGLGIGRSSLVSQLNSSFSYCIGSLHDP-DYLHNKLIL 226
           F     F  +            F   I  SS  S   +SFSYC+    +P     + LI 
Sbjct: 170 FDNEGLFAGAAGLLGLGAGKLSFPSQIFASSTNSSTANSFSYCLVDRSNPMTRSSSSLIF 229

Query: 227 GDGAIIDEGDATPL---QFIDGHYYITLEAISVDGRMLDINPNIFKRDDSG-GGVMIDSG 282
           G  AI      +PL     +D  YY  +  +SV G  L I+    +   SG GGV+IDSG
Sbjct: 230 GVAAIPSTAALSPLLKNPKLDTFYYAAMIGVSVGGAQLPISLKSLQLSQSGSGGVIIDSG 289

Query: 283 TDVTWLVKEAYEALRD---EVMIRLEGEQMRSYSWPDKLCYH--GIMSSDLKGFPTVRFH 337
           T VT      Y  +RD      I L       YS  D  CY+  G  S D+   P +  H
Sbjct: 290 TSVTRFPTSVYATIRDAFRNATINLPSAPR--YSLFDT-CYNFSGKASVDV---PALVLH 343

Query: 338 FRGGAKLALEKDS-MFYQPRPDAFCMAVNPASINNRYVNFSLIGMMAQQFYNVGYDIGRK 396
           F  GA L L   + +       +FC+A  P S+        +IG + QQ + +G+D+ + 
Sbjct: 344 FENGADLQLPPTNYLIPINTAGSFCLAFAPTSM-----ELGIIGNIQQQSFRIGFDLQKS 398

Query: 397 QKTFQRMDC 405
              F    C
Sbjct: 399 HLAFAPQQC 407


>gi|357127505|ref|XP_003565420.1| PREDICTED: probable aspartic protease At2g35615-like [Brachypodium
           distachyon]
          Length = 466

 Score =  122 bits (307), Expect = 2e-25,   Method: Compositional matrix adjust.
 Identities = 121/447 (27%), Positives = 194/447 (43%), Gaps = 66/447 (14%)

Query: 1   LIHRDSILSPYNNPNENPAHRVQRGINISIARLAYLQEKIRSHNTYQAQILPSNDISNSL 60
           LIHRDSI SP+++P      R                      +  +A  L ++D+S+ L
Sbjct: 31  LIHRDSIKSPFHDPKLTRHDRFL---------------AAARRSRARAAALLASDVSSDL 75

Query: 61  FY------VNISIGQPPVPQFTAMDTGSSLLWVHCYPCRD-------------------C 95
           FY        +++G PPV      DTGS L+W+ C   ++                    
Sbjct: 76  FYGDFEYLAAVNVGTPPVRFLAVADTGSDLVWLKCNTTQNNNGIVSSDSGNNSNSSPPPP 135

Query: 96  SPQLGTIFYPSRSSSYAYVPCDSEHCRYFPY-ARCSAYKHRCIYTQLYLIGPETSVFVST 154
            P+    F P  SSSY+ V CD   C      A C+   H C +   Y  G   +  ++ 
Sbjct: 136 PPEAVVYFNPFDSSSYSRVGCDGPSCLALATNASCNGDSHACDFRYSYRDGASATGLLAA 195

Query: 155 EQLTFK-NTDESTIHVQDVVFGCGFST-NRNFKFSGIFGLGIGRSSLVSQLNSSFSYCIG 212
           +  TF  N +  T     + FGC   T  R F+  G+ GLG G  SL SQL   FS+C+ 
Sbjct: 196 DTFTFGGNINNDTTSTASIDFGCATGTAGREFQADGMVGLGAGPLSLASQLGRKFSFCL- 254

Query: 213 SLHDPDYLHNKLILGDGAII-DEGDA-TPL----QFIDGHYYITLEAISVDGRMLDINPN 266
           + +D D   + L  G  A++ D G A TPL         +Y I+++++ V G+ +    +
Sbjct: 255 TAYDIDDASSILNFGARAVVSDPGAATTPLIASSSNAAAYYAISIDSLKVAGQPVPGTTS 314

Query: 267 IFKRDDSGGGVMIDSGTDVTWLVKEAYEALRDEVMIR-LEGEQMRSYSWPD---KLCYHG 322
           + K       V++D+GT +T+L + A  A   E + R ++G  +     PD   +LCY  
Sbjct: 315 VSK-------VIVDTGTVLTFLDRAALLAPLTESLARVMDGAGLPRAPPPDETLELCYDV 367

Query: 323 IMSSDLKGF---PTVRFHFRGGAKLALEKDSMFYQPRPDAFCMAVNPASINNRYVNFSLI 379
               D+ G     T+     GG ++ L  +  F   +    C+AV   + +      S++
Sbjct: 368 SRVKDVDGVIPDVTLVLGGGGGGEVRLTGEGTFVLVKEGVLCLAV--VTTSPELQPLSVL 425

Query: 380 GMMAQQFYNVGYDIGRKQKTFQRMDCE 406
           G +A Q  +VG D+  +  TF   +C+
Sbjct: 426 GNVALQDLHVGIDLDARTATFATANCD 452


>gi|218192703|gb|EEC75130.1| hypothetical protein OsI_11317 [Oryza sativa Indica Group]
          Length = 440

 Score =  122 bits (307), Expect = 2e-25,   Method: Compositional matrix adjust.
 Identities = 103/363 (28%), Positives = 160/363 (44%), Gaps = 28/363 (7%)

Query: 61  FYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLGTIFYPSRSSSYAYVPCDSEH 120
           + ++++IG PP P    +DTGS L+W  C PC  C  Q    +  SRSS++A   CDS  
Sbjct: 91  YLLHLAIGTPPQPVQLTLDTGSDLVWTQCQPCAVCFNQSLPYYDASRSSTFALPSCDSTQ 150

Query: 121 CRYFPYARCSAYK--HRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDVVFGCGF 178
           C+  P       +    C ++  Y     T  F+  E ++F     +   V  VVFGCG 
Sbjct: 151 CKLDPSVTMCVNQTVQTCAFSYSYGDKSATIGFLDVETVSF----VAGASVPGVVFGCGL 206

Query: 179 STNRNFKF--SGIFGLGIGRSSLVSQLN-SSFSYCIGSL--HDPDYLHNKLIL-----GD 228
           +    F+   +GI G G G  SL SQL   +FS+C  ++    P  +   L       G 
Sbjct: 207 NNTGIFRSNETGIAGFGRGPLSLPSQLKVGNFSHCFTAVSGRKPSTVLFDLPADLYKNGR 266

Query: 229 GAIIDEGDATPLQFIDGH---YYITLEAISVDGRMLDINPNIFKRDDSGGGVMIDSGTDV 285
           G +      TPL     H   YY++L+ I+V    L +  + F   +  GG +IDSGT  
Sbjct: 267 GTV----QTTPLIKNPAHPTFYYLSLKGITVGSTRLPVPESAFALKNGTGGTIIDSGTAF 322

Query: 286 TWLVKEAYEALRDEVMIRLEGEQMRSYSWPDKLCYHGIMSSDLKGFPTVRFHFRGGAKLA 345
           T L    Y  + DE    ++   + S      LC+           P +  HF  GA + 
Sbjct: 323 TSLPPRVYRLVHDEFAAHVKLPVVPSNETGPLLCFSAPPLGKAPHVPKLVLHFE-GATMH 381

Query: 346 LEKDSMFYQPRPDAFCMAVNPASINNRYVNFSLIGMMAQQFYNVGYDIGRKQKTFQRMDC 405
           L +++  ++ +    C ++  A I       ++IG   QQ  +V YD+   + +F R  C
Sbjct: 382 LPRENYVFEAKDGGNC-SICLAIIEGE---MTIIGNFQQQNMHVLYDLKNSKLSFVRAKC 437

Query: 406 EVL 408
           + L
Sbjct: 438 DKL 440


>gi|302811785|ref|XP_002987581.1| hypothetical protein SELMODRAFT_426333 [Selaginella moellendorffii]
 gi|300144735|gb|EFJ11417.1| hypothetical protein SELMODRAFT_426333 [Selaginella moellendorffii]
          Length = 511

 Score =  122 bits (307), Expect = 2e-25,   Method: Compositional matrix adjust.
 Identities = 106/379 (27%), Positives = 164/379 (43%), Gaps = 43/379 (11%)

Query: 61  FYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLGTIFYPSRSSSYAYVPCDSEH 120
           +YV + +G P V     MDTGS + W+ C PC+DC P L   F P  SSS+  +PC S  
Sbjct: 139 YYVPLQVGTPAVEVVLIMDTGSDVSWIQCVPCKDCVPALRPPFNPRHSSSFFKLPCASST 198

Query: 121 CRYF-----PYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNT----DESTIHVQD 171
           C        P+  CS     C+++  Y  G  +S  ++ E +   NT    D   + + +
Sbjct: 199 CTNVYQGVKPF--CSPSGRTCLFSIQYGDGSLSSGLLAMETIA-GNTPNFGDGEPVKLSN 255

Query: 172 VVFGCGFSTNRNFK--FSGIFGLGIGRSSLVSQLNS----SFSYCIGSLHDPDYLHN--- 222
           +  GC            SG+ G+     S  SQL+S     FS+C      PD + +   
Sbjct: 256 ITLGCADIDREGLPTGASGLLGMDRRPISFPSQLSSRYARKFSHCF-----PDKIAHLNS 310

Query: 223 --KLILGDGAIID---------EGDATPLQFIDGHYYITLEAISVDGRMLDINPNIFKRD 271
              +  G+  II          +  A P   +D +YY+ L  ISVD   L ++   F  D
Sbjct: 311 SGLVFFGESDIISPYLRYTPLVQNPAVPSASLD-YYYVGLVGISVDESRLPLSHKNFDID 369

Query: 272 D--SGGGVMIDSGTDVTWLVKEAYEALRDEVMIRLEGEQMRSYSWPDKLCYH---GIMSS 326
                GG +IDSGT  T+L K A++A+R E + R         +     CY+   G  + 
Sbjct: 370 KVTGSGGTIIDSGTAFTYLKKPAFQAMRREFLARTSHLAKVDDNSGFTPCYNITSGTAAL 429

Query: 327 DLKGFPTVRFHFRGGAKLALEKDSMFYQPRPDAFCMAVNPASINNRYVNFSLIGMMAQQF 386
           +    P++  HFRGG  + L K+S+            +  A + +  + F++IG   QQ 
Sbjct: 430 ESTILPSITLHFRGGLDVVLPKNSILIPVSSSEEQTTLCLAFLMSGDIPFNIIGNYQQQN 489

Query: 387 YNVGYDIGRKQKTFQRMDC 405
             V YD+ + +       C
Sbjct: 490 LWVEYDLEKLRLGIAPAQC 508


>gi|242058093|ref|XP_002458192.1| hypothetical protein SORBIDRAFT_03g028480 [Sorghum bicolor]
 gi|241930167|gb|EES03312.1| hypothetical protein SORBIDRAFT_03g028480 [Sorghum bicolor]
          Length = 468

 Score =  122 bits (307), Expect = 3e-25,   Method: Compositional matrix adjust.
 Identities = 116/433 (26%), Positives = 187/433 (43%), Gaps = 52/433 (12%)

Query: 1   LIHRDSILSPYNNPNENPAHRVQRGINISIARLAYLQEKI-RSHNTYQAQILPSNDISNS 59
           L+HR    +P    ++ P+    R +  + AR  Y+  ++ +      A +     +  S
Sbjct: 60  LVHRHGPCAPTQLSSDKPSSFTDR-LRRNRARSKYIMSRVSKGMMGDDADVSIPTHLGGS 118

Query: 60  L----FYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRD--CSPQLGTIFYPSRSSSYAY 113
           +    + V + +G P V Q   +DTGS L WV C PC    C PQ   +F PS+SS+YA 
Sbjct: 119 VDSLEYVVTVGLGTPSVSQVLLIDTGSDLSWVQCQPCNSTTCYPQKDPLFDPSKSSTYAP 178

Query: 114 VPCDSEHCR------YFPYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTI 167
           +PC+++ CR      Y           +C +   Y  G +T    S E L         +
Sbjct: 179 IPCNTDACRDLTDDGYGGGCASGDGAAQCGFAITYGDGSQTRGVYSNETLALA----PGV 234

Query: 168 HVQDVVFGCGFSTN-RNFKFSGIFGLGIGRSSLVSQLNS----SFSYCIGSLHD------ 216
            V+D  FGCG   +  N K+ G+ GLG    SLV Q  S    +FSYC+ +L++      
Sbjct: 235 AVKDFRFGCGHDQDGANDKYDGLLGLGGAPESLVVQTASVYGGAFSYCLPALNNQVGFLA 294

Query: 217 ---PDYLHNKLILGDGAIIDEGDATPL-QFIDGHYYITLEAISVDGRMLDINPNIFKRDD 272
                     ++   G +      TP+ +  +  Y + +  I+V G  +D+ P+ F    
Sbjct: 295 LGGGGAPSGGVVNTSGFVF-----TPMIREEETFYVVNMTGITVGGEPIDVPPSAFS--- 346

Query: 273 SGGGVMIDSGTDVTWLVKEAYEALRDEVMIRLEGEQMRSYSWPDKLCYHGIMSSDLKGFP 332
             GG++IDSGT VT L   AY AL+      +    +      D  CY     S++   P
Sbjct: 347 --GGMIIDSGTVVTELQHTAYNALQAAFRKAMAAYPLVRNGELDT-CYDFSGYSNVT-LP 402

Query: 333 TVRFHFRGGAKLALEKDSMFYQPRPDAFCMAVNPASINNRYVNFSLIGMMAQQFYNVGYD 392
            V   F GGA + L+  +          C+A   +  +++     ++G + Q+   V YD
Sbjct: 403 KVALTFSGGATIDLDVPNGILLDD----CLAFQESGPDDQP---GILGNVNQRTLEVLYD 455

Query: 393 IGRKQKTFQRMDC 405
            GR +  F+   C
Sbjct: 456 AGRGRVGFRAAVC 468


>gi|356498306|ref|XP_003517994.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
          Length = 484

 Score =  122 bits (307), Expect = 3e-25,   Method: Compositional matrix adjust.
 Identities = 115/370 (31%), Positives = 174/370 (47%), Gaps = 25/370 (6%)

Query: 44  NTYQAQILPSNDISNSLFYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLGTIF 103
           N  Q  ++      +  +++ + IG+PP   +  +DTGS + W+ C PC +C  Q   IF
Sbjct: 132 NALQGPVVSGTSQGSGEYFLRVGIGKPPSQAYVVLDTGSDVSWIQCAPCSECYQQSDPIF 191

Query: 104 YPSRSSSYAYVPCDSEHCRYFPYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTD 163
            P  S+SY+ + CD   C+    + C      C+Y   Y  G  T    +TE +T     
Sbjct: 192 DPISSNSYSPIRCDEPQCKSLDLSEC--RNGTCLYEVSYGDGSYTVGEFATETVTL---- 245

Query: 164 ESTIHVQDVVFGCGFSTNRNF-KFSGIFGLGIGRSSLVSQLN-SSFSYCIGSLHDPDYLH 221
             +  V++V  GCG +    F   +G+ GLG G+ S  +Q+N +SFSYC+ +  D D + 
Sbjct: 246 -GSAAVENVAIGCGHNNEGLFVGAAGLLGLGGGKLSFPAQVNATSFSYCLVN-RDSDAVS 303

Query: 222 NKLILGDGAIIDEGDATPLQF---IDGHYYITLEAISVDGRMLDINPNIFKRD-DSGGGV 277
              +  +  +       PL     +D  YY+ L+ ISV G  L I  + F+ D   GGG+
Sbjct: 304 T--LEFNSPLPRNAATAPLMRNPELDTFYYLGLKGISVGGEALPIPESSFEVDAIGGGGI 361

Query: 278 MIDSGTDVTWLVKEAYEALRDEVMIRLEG-EQMRSYSWPDKLCYHGIMSSDLKGFPTVRF 336
           +IDSGT VT L  E Y+ALRD  +   +G  +    S  D  CY  + S +    PTV F
Sbjct: 362 IIDSGTAVTRLRSEVYDALRDAFVKGAKGIPKANGVSLFDT-CYD-LSSRESVEIPTVSF 419

Query: 337 HFRGGAKLAL-EKDSMFYQPRPDAFCMAVNPASINNRYVNFSLIGMMAQQFYNVGYDIGR 395
            F  G +L L  ++ +        FC A  P +      + S+IG + QQ   VG+DI  
Sbjct: 420 RFPEGRELPLPARNYLIPVDSVGTFCFAFAPTT-----SSLSIIGNVQQQGTRVGFDIAN 474

Query: 396 KQKTFQRMDC 405
               F    C
Sbjct: 475 SLVGFSVDSC 484


>gi|115434870|ref|NP_001042193.1| Os01g0178600 [Oryza sativa Japonica Group]
 gi|55296112|dbj|BAD67831.1| putative CDR1 [Oryza sativa Japonica Group]
 gi|55296252|dbj|BAD67993.1| putative CDR1 [Oryza sativa Japonica Group]
 gi|113531724|dbj|BAF04107.1| Os01g0178600 [Oryza sativa Japonica Group]
 gi|125569253|gb|EAZ10768.1| hypothetical protein OsJ_00604 [Oryza sativa Japonica Group]
          Length = 454

 Score =  122 bits (306), Expect = 3e-25,   Method: Compositional matrix adjust.
 Identities = 110/375 (29%), Positives = 174/375 (46%), Gaps = 38/375 (10%)

Query: 53  SNDISNSLFYV-NISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLG---TIFYPSRS 108
           S  +S S  Y+  +++G PP       DTGS L+WV C    + +       T F PSRS
Sbjct: 92  SKVVSRSFEYLMTVNLGSPPRSMLAIADTGSDLVWVKCKKGNNDTSSAAAPTTQFDPSRS 151

Query: 109 SSYAYVPCDSEHCRYFPYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDES--- 165
           S+Y  V C ++ C     A C    + C Y   Y  G  T+  +STE  TF +       
Sbjct: 152 STYGRVSCQTDACEALGRATCDDGSN-CAYLYAYGDGSNTTGVLSTETFTFDDGGSGRSP 210

Query: 166 -TIHVQDVVFGCGFSTNRNFKFSGIFGLGIGRSSLVSQLNSS------FSYCIGSLHDPD 218
             + V  V FGC  +T  +F   G+ GLG G  SLV+QL  +      FSYC+     P 
Sbjct: 211 RQVRVGGVKFGCSTATAGSFPADGLVGLGGGAVSLVTQLGGATSLGRRFSYCL----VPH 266

Query: 219 YLHNKLILGDGAIIDEGD----ATPLQF--IDGHYYITLEAISVDGRMLDINPNIFKRDD 272
            ++    L  GA+ D  +    +TPL    +D +Y + L+++ V  + +           
Sbjct: 267 SVNASSALNFGALADVTEPGAASTPLVAGDVDTYYTVVLDSVKVGNKTV--------ASA 318

Query: 273 SGGGVMIDSGTDVTWLVKEAYEALRDEVMIRLEGEQMRSYSWPDKLCYH--GIMSSDLKG 330
           +   +++DSGT +T+L       + DE+  R+    ++S     +LCY+  G      + 
Sbjct: 319 ASSRIIVDSGTTLTFLDPSLLGPIVDELSRRITLPPVQSPDGLLQLCYNVAGREVEAGES 378

Query: 331 FPTVRFHFRGGAKLALEKDSMFYQPRPDAFCMAVNPASINNRYVNFSLIGMMAQQFYNVG 390
            P +   F GGA +AL+ ++ F   +    C+A+  A+   + V  S++G +AQQ  +VG
Sbjct: 379 IPDLTLEFGGGAAVALKPENAFVAVQEGTLCLAIV-ATTEQQPV--SILGNLAQQNIHVG 435

Query: 391 YDIGRKQKTFQRMDC 405
           YD+     TF   DC
Sbjct: 436 YDLDAGTVTFAGADC 450


>gi|312283333|dbj|BAJ34532.1| unnamed protein product [Thellungiella halophila]
          Length = 428

 Score =  122 bits (306), Expect = 3e-25,   Method: Compositional matrix adjust.
 Identities = 120/416 (28%), Positives = 179/416 (43%), Gaps = 35/416 (8%)

Query: 1   LIHRDSILSPYNNPNENPAHRVQRGINISIARLAYLQEKIRSHNTYQAQILPSNDISNSL 60
           + H +S  SP+  PN       +  +    ARL YL    +  +     I     I  S 
Sbjct: 36  VFHVNSPCSPFKQPN---TVSWESTLLKDKARLQYLSSLAKKPS---VPIASGRAIVQSP 89

Query: 61  FY-VNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLGTIFYPSRSSSYAYVPCDSE 119
            Y V  +IG P  P   A+DT +   WV C  C  C+  +  +F PS+SSS   + CD+ 
Sbjct: 90  TYIVRANIGTPAQPMLVALDTSNDAAWVPCSGCVGCASSV--LFDPSKSSSSRNLQCDAP 147

Query: 120 HCRYFPYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDVVFGC-GF 178
            C+  P   C+A K  C +   Y  G      ++ + LT  N       ++   FGC   
Sbjct: 148 QCKQAPNPTCTAGKS-CGFNMTY-GGSTIEASLTQDTLTLAND-----VIKSYTFGCISK 200

Query: 179 STNRNFKFSGIFGLGIGRSSLVSQLN----SSFSYCIGSLHDPDYLHNKLILGDGAIIDE 234
           +T  +    G+ GLG G  SL+SQ      S+FSYC+ +    ++    L LG       
Sbjct: 201 ATGTSLPAQGLMGLGRGPLSLISQTQNLYMSTFSYCLPNSKSSNF-SGSLRLGPKYQPVR 259

Query: 235 GDATPL---QFIDGHYYITLEAISVDGRMLDINPNIFKRDDS-GGGVMIDSGTDVTWLVK 290
              TPL         YY+ L  I V  +++DI  +    D S G G + DSGT  T LV+
Sbjct: 260 IKTTPLLKNPRRSSLYYVNLVGIRVGNKIVDIPTSALAFDASTGAGTIFDSGTVFTRLVE 319

Query: 291 EAYEALRDEVMIRLEGEQMRSYSWPDKLCYHGIMSSDLKGFPTVRFHFRGGAKLALEKDS 350
            AY A+R+E   R++     S    D  CY G +      +P+V F F  G  + L  D+
Sbjct: 320 PAYVAVRNEFRRRIKNANATSLGGFDT-CYSGSVV-----YPSVTFMF-AGMNVTLPPDN 372

Query: 351 MF-YQPRPDAFCMAVNPASINNRYVNFSLIGMMAQQFYNVGYDIGRKQKTFQRMDC 405
           +  +       C+A+  A+ NN     ++I  M QQ + V  D+   +    R  C
Sbjct: 373 LLIHSSSGSTSCLAM-AAAPNNVNSVLNVIASMQQQNHRVLIDLPNSRLGISRETC 427


>gi|357153697|ref|XP_003576537.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Brachypodium
           distachyon]
          Length = 474

 Score =  122 bits (305), Expect = 4e-25,   Method: Compositional matrix adjust.
 Identities = 133/447 (29%), Positives = 203/447 (45%), Gaps = 64/447 (14%)

Query: 3   HRDSILSPY-NNPNENPAHRVQRGINIS-IARLAYLQEKIRSHNTYQ----------AQI 50
           H  S  SP  N P++     V  G+  S  AR++ LQ +I S+ +            A  
Sbjct: 47  HISSSFSPGPNRPSKTSRGEVDGGVLSSDAARVSSLQRRIESYRSSSEGEEEEASKLALQ 106

Query: 51  LPSNDISN--SLFYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLGTIFYPSRS 108
           +P    +N  +L YV  ++G         +DT S L WV C PC  C  Q   +F PS S
Sbjct: 107 VPITSGANLRTLNYV-ATVGLGAAEATVVVDTASELTWVQCQPCESCHDQQDPLFDPSSS 165

Query: 109 SSYAYVPCDSEHCRYF---------PYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTF 159
            SYA VPC+S  C            P A  +  +  C Y   Y  G  +   ++ ++L  
Sbjct: 166 PSYAAVPCNSSSCDALRVAMAAGTSPCADDNEQQPACSYALSYRDGSYSRGVLARDKLRL 225

Query: 160 KNTDESTIHVQDVVFGCGFSTNRNFKFSGIFGL-GIGRS--SLVS----QLNSSFSYCI- 211
              D     ++  VFGCG ++N+   F G  GL G+GRS  SLVS    Q    FSYC+ 
Sbjct: 226 AGQD-----IEGFVFGCG-TSNQGAPFGGTSGLMGLGRSHVSLVSQTMDQFGGVFSYCLP 279

Query: 212 -------GSLHDPD----YLHNKLILGDGAIIDEGDATPLQFIDGHYYITLEAISVDGRM 260
                  GSL   D    Y ++  I+    + D G   PLQ     Y++ L  I+V G+ 
Sbjct: 280 MRESGSSGSLVLGDDSSAYRNSTPIVYTAMVSDSG---PLQ--GPFYFLNLTGITVGGQ- 333

Query: 261 LDINPNIFKRDDSGGGVMIDSGTDVTWLVKEAYEALRDEVMIRL-EGEQMRSYSWPDKLC 319
            ++    F    S G V+IDSGT +T LV   Y A+R E + +L E  Q  ++S  D  C
Sbjct: 334 -EVESPWF----SAGRVIIDSGTIITTLVPSVYNAVRAEFLSQLAEYPQAPAFSILDT-C 387

Query: 320 YHGIMSSDLKGFPTVRFHFRGGAKLALEKDSMFYQPRPDAFCMAVNPASINNRYVNFSLI 379
           ++     +++  P+++F F G  ++ ++   + Y    DA  + +  AS+ + Y + S+I
Sbjct: 388 FNLTGLKEVQ-VPSLKFVFEGSVEVEVDSKGVLYFVSSDASQVCLALASLKSEY-DTSII 445

Query: 380 GMMAQQFYNVGYDIGRKQKTFQRMDCE 406
           G   Q+   V +D    Q  F +  C+
Sbjct: 446 GNYQQKNLRVIFDTLGSQIGFAQETCD 472


>gi|356543524|ref|XP_003540210.1| PREDICTED: aspartic proteinase-like protein 2-like [Glycine max]
          Length = 493

 Score =  122 bits (305), Expect = 4e-25,   Method: Compositional matrix adjust.
 Identities = 102/384 (26%), Positives = 168/384 (43%), Gaps = 57/384 (14%)

Query: 60  LFYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLG-----TIFYPSRSSSYAYV 114
           L+Y  + +G PPV     +DTGS +LWV C  C  C    G       F P  SS+ + +
Sbjct: 77  LYYTKVQLGTPPVEFNVQIDTGSDVLWVSCNSCNGCPQTSGLQIQLNFFDPGSSSTSSMI 136

Query: 115 PCDSEHC---RYFPYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQD 171
            C  + C   +    A CS+  ++C YT  Y  G  TS +  ++ +      E ++    
Sbjct: 137 ACSDQRCNNGKQSSDATCSSQNNQCSYTFQYGDGSGTSGYYVSDMMHLNTIFEGSMTTNS 196

Query: 172 ---VVFGC-----GFSTNRNFKFSGIFGLGIGRSSLVSQLNSS------FSYC------- 210
              VVFGC     G  T  +    GIFG G    S++SQL+S       FS+C       
Sbjct: 197 TAPVVFGCSNQQTGDLTKSDRAVDGIFGFGQQEMSVISQLSSQGIAPRIFSHCLKGDSSG 256

Query: 211 -----IGSLHDPDYLHNKLILGDGAIIDEGDATPLQFIDGHYYITLEAISVDGRMLDINP 265
                +G + +P+ ++  L+           A P      HY + L++ISV+G+ L I+ 
Sbjct: 257 GGILVLGEIVEPNIVYTSLV----------PAQP------HYNLNLQSISVNGQTLQIDS 300

Query: 266 NIFKRDDSGGGVMIDSGTDVTWLVKEAYEALRDEVMIRLEGEQMRSYSWPDKLCYHGIMS 325
           ++F   +S  G ++DSGT + +L +EAY+     +   +  + +R+       CY  I S
Sbjct: 301 SVFATSNS-RGTIVDSGTTLAYLAEEAYDPFVSAITAAIP-QSVRTVVSRGNQCYL-ITS 357

Query: 326 SDLKGFPTVRFHFRGGAKLALEKDSMFYQPRP--DAFCMAVNPASINNRYVNFSLIGMMA 383
           S    FP V  +F GGA + L       Q      A    +    I  + +  +++G + 
Sbjct: 358 SVTDVFPQVSLNFAGGASMILRPQDYLIQQNSIGGAAVWCIGFQKIQGQGI--TILGDLV 415

Query: 384 QQFYNVGYDIGRKQKTFQRMDCEV 407
            +   V YD+  ++  +   DC +
Sbjct: 416 LKDKIVVYDLAGQRIGWANYDCSL 439


>gi|125532788|gb|EAY79353.1| hypothetical protein OsI_34482 [Oryza sativa Indica Group]
          Length = 394

 Score =  122 bits (305), Expect = 4e-25,   Method: Compositional matrix adjust.
 Identities = 95/361 (26%), Positives = 160/361 (44%), Gaps = 31/361 (8%)

Query: 61  FYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLGTIFYPSRSSSYAYVPCDSEH 120
           +  N +IG PP P    +D    L+W  C  C  C  Q   +F P+ S++Y   PC +  
Sbjct: 51  YVANFTIGTPPQPASAVIDLAGELVWTQCKQCSRCFEQDTPLFDPTASNTYRAEPCGTPL 110

Query: 121 CRYFPYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDVVFGCGFST 180
           C   P    +   + C Y Q      +T   V T+         S      + FGC  ++
Sbjct: 111 CESIPSDSRNCSGNVCAY-QASTNAGDTGGKVGTDTFAVGTAKAS------LAFGCVVAS 163

Query: 181 NRNFKF--SGIFGLGIGRSSLVSQLN-SSFSYCIGSLHDPDYLHNKLILGDGAIIDEGD- 236
           + +     SGI GLG    SLV+Q   ++FSYC+   HD    ++ L LG  A +  G  
Sbjct: 164 DIDTMGGPSGIVGLGRTPWSLVTQTGVAAFSYCLAP-HDAGK-NSALFLGSSAKLAGGGK 221

Query: 237 --ATPLQFIDG-------HYYITLEAISVDGRMLDINPNIFKRDDSGGGVMIDSGTDVTW 287
             +TP   I G       +Y + LE +     M+ + P       SG  V++D+ + +++
Sbjct: 222 AASTPFVNISGNGNDLSNYYKVQLEGLKAGDAMIPLPP-------SGSTVLLDTFSPISF 274

Query: 288 LVKEAYEALRDEVMIRLEGEQMRSYSWPDKLCYHGIMSSDLKGFPTVRFHFRGGAKLALE 347
           LV  AY+A++  V + +    M +   P  LC+    +S     P + F FRGGA + + 
Sbjct: 275 LVDGAYQAVKKAVTVAVGAPPMATPVEPFDLCFPKSGASGAA--PDLVFTFRGGAAMTVA 332

Query: 348 KDSMFYQPRPDAFCMAVNPASINNRYVNFSLIGMMAQQFYNVGYDIGRKQKTFQRMDCEV 407
             +     +    C+A+  ++  N     SL+G + Q+  +  +D+ ++  +F+  DC  
Sbjct: 333 ASNYLLDYKNGTVCLAMLSSARLNSTTELSLLGSLQQENIHFLFDLDKETLSFEPADCTK 392

Query: 408 L 408
           L
Sbjct: 393 L 393


>gi|224079535|ref|XP_002305886.1| predicted protein [Populus trichocarpa]
 gi|222848850|gb|EEE86397.1| predicted protein [Populus trichocarpa]
          Length = 436

 Score =  122 bits (305), Expect = 4e-25,   Method: Compositional matrix adjust.
 Identities = 99/367 (26%), Positives = 168/367 (45%), Gaps = 31/367 (8%)

Query: 60  LFYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLGTIFYPSRSSSYAYVPCDSE 119
           +  V++ IG PP  Q   +DTGS L W+ C+      P   T+F PS SSS++ +PC+  
Sbjct: 76  ILLVSLPIGTPPQSQQMILDTGSQLSWIQCHKKVPRKPPPSTVFDPSLSSSFSVLPCNHP 135

Query: 120 HCRY----FPYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDVVFG 175
            C+     F           C Y+  Y  G      +  E++TF +T +ST     ++ G
Sbjct: 136 LCKPRIPDFTLPTSCDLNRLCHYSYFYADGTLAEGNLVREKITF-STSQST---PPLILG 191

Query: 176 CGFSTNRNFKFSGIFGLGIGRSSLVSQLN-SSFSYCIGSLH-DPDYL-HNKLILGDG--- 229
           C    + +    GI G+ +GR S  SQ   + FSYC+ +    P +       LG+    
Sbjct: 192 CAEDASDD---KGILGMNLGRLSFASQAKITKFSYCVPTRQVRPGFTPTGSFYLGENPNS 248

Query: 230 ------AIIDEGDATPLQFIDG-HYYITLEAISVDGRMLDINPNIFKRDDSGGG-VMIDS 281
                 +++    +  +  +D   + + L+ I +  + L+I  + F+ D SG G  MIDS
Sbjct: 249 AGFQYISLLTFSQSQRMPNLDPLAHTVALQGIRIGNKKLNIPVSAFRADPSGAGQSMIDS 308

Query: 282 GTDVTWLVKEAYEALRDEVMIRLEGEQMRS---YSWPDKLCYHGIMSSDLKGFPTVRFHF 338
           G++ T+LV  AY  +R+EV +RL G +++    YS    +C+ G      +    + F F
Sbjct: 309 GSEFTYLVDVAYNKVREEV-VRLAGPRLKKGYVYSGVSDMCFDGNAMEIGRLIGNMVFEF 367

Query: 339 RGGAKLALEKDSMFYQPRPDAFCMAVNPASINNRYVNFSLIGMMAQQFYNVGYDIGRKQK 398
             G ++ +EK  +         C+ +  + +     N  +IG   QQ   V +DI  ++ 
Sbjct: 368 DKGVEIVIEKGRVLADVGGGVHCVGIGRSEMLGAASN--IIGNFHQQNLWVEFDIANRRV 425

Query: 399 TFQRMDC 405
            F + DC
Sbjct: 426 GFGKADC 432


>gi|255543383|ref|XP_002512754.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
           communis]
 gi|223547765|gb|EEF49257.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
           communis]
          Length = 414

 Score =  122 bits (305), Expect = 4e-25,   Method: Compositional matrix adjust.
 Identities = 125/433 (28%), Positives = 192/433 (44%), Gaps = 52/433 (12%)

Query: 3   HRDSILSPYNNPNENPAHRVQRGINISIARLAYLQEKIRSH------NTYQAQILPSNDI 56
           HRD   S   + + N   ++Q+ + +   R+  LQ +I+S       +   +QI  S+ +
Sbjct: 3   HRDFCNSSGKSTDWN--KKLQKSLILDDFRVRSLQSRIKSIFSGNNIDALDSQIPLSSGV 60

Query: 57  S-NSLFY-VNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLGTIFYPSRSSSYAYV 114
              +L Y V + IG   +     +DTGS L WV C PCR C  Q   +F PS S SY  +
Sbjct: 61  RLQTLNYIVTVEIGGRNMT--VIVDTGSDLTWVQCQPCRLCYNQQDPLFNPSGSPSYQTI 118

Query: 115 PCDSEHCRYFPYAR-----CSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHV 169
            C+S  C+   YA      C +    C Y   Y  G  T   +  EQL        T HV
Sbjct: 119 LCNSSTCQSLQYATGNLGVCGSNTPTCNYVVNYGDGSYTRGDLGMEQLNL-----GTTHV 173

Query: 170 QDVVFGCGFSTNRNF-KFSGIFGLGIGRSSLVSQLNS----SFSYCIGSLHDPDYLHNKL 224
            + +FGCG +    F   SG+ GLG    SLVSQ ++     FSYC+ +          L
Sbjct: 174 SNFIFGCGRNNKGLFGGASGLMGLGKSDLSLVSQTSAIFEGVFSYCLPTTAAD--ASGSL 231

Query: 225 ILGDGAIIDEGDATPLQF--------IDGHYYITLEAISVDGRMLDINPNIFKRDDSGGG 276
           ILG  + + + + TP+ +        +   Y++ L  IS+ G  L   PN  +      G
Sbjct: 232 ILGGNSSVYK-NTTPISYTRMIANPQLPTFYFLNLTGISIGGVALQA-PNYRQS-----G 284

Query: 277 VMIDSGTDVTWLVKEAYEALRDEVMIRLEG-EQMRSYSWPDKLCYHGIMSSDLKGFPTVR 335
           ++IDSGT +T L    Y  L+ E + +  G      +S  D  C++ +   D    PT+R
Sbjct: 285 ILIDSGTVITRLPPPVYRDLKAEFLKQFSGFPSAPPFSILDT-CFN-LNGYDEVDIPTIR 342

Query: 336 FHFRGGAKLALEKDSMFYQPRPDA--FCMAVNPASINNRYVNFSLIGMMAQQFYNVGYDI 393
             F G A+L ++   +FY  + DA   C+A+   S ++      +IG   Q+   V Y+ 
Sbjct: 343 MQFEGNAELTVDVTGIFYFVKTDASQVCLALASLSFDDE---IPIIGNYQQRNQRVIYNT 399

Query: 394 GRKQKTFQRMDCE 406
              +  F    C 
Sbjct: 400 KESKLGFAAEACS 412


>gi|242066176|ref|XP_002454377.1| hypothetical protein SORBIDRAFT_04g029680 [Sorghum bicolor]
 gi|241934208|gb|EES07353.1| hypothetical protein SORBIDRAFT_04g029680 [Sorghum bicolor]
          Length = 474

 Score =  122 bits (305), Expect = 5e-25,   Method: Compositional matrix adjust.
 Identities = 112/401 (27%), Positives = 169/401 (42%), Gaps = 59/401 (14%)

Query: 32  RLAYLQEKIRSHNTYQ---------AQILPSN---DISNSLFYVNISIGQPPVPQFTAMD 79
           R  Y+  ++    T Q            +P+N   +I    + V +S+G P V Q   +D
Sbjct: 99  RAEYILRRVSGRGTPQLWDSKAEAATATVPANWGFNIGTLNYVVTVSLGTPGVAQTLEVD 158

Query: 80  TGSSLLWVHCYPCR--DCSPQLGTIFYPSRSSSYAYVPCDSEHCRYFPYARCSAYKHRCI 137
           TGS L WV C PC    C  Q   +F P++SSSYA VPC    C        S    +C 
Sbjct: 159 TGSDLSWVQCTPCAAPACYSQKDPLFDPAQSSSYAAVPCGGPVCGGLGIYASSCSAAQCG 218

Query: 138 YTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDVVFGCGFSTNRNFKFSGIFGLGIGRS 197
           Y   Y  G +T+   S++ LT    D     V+   FGCG + +      G+ GLG   +
Sbjct: 219 YVVSYGDGSKTTGVYSSDTLTLSPNDA----VRGFFFGCGHAQSGFTGNDGLLGLGREEA 274

Query: 198 SLVSQLNSS----FSYCIGSLHDPDYLHNKLILGDGAIIDEGDATPLQFIDG-----HYY 248
           SLV Q   +    FSYC+ +   P       + G       G +T  Q +       +Y 
Sbjct: 275 SLVEQTAGTYGGVFSYCLPT--RPSTTGYLTLGGPSGAAPPGFST-TQLLSSPNAATYYV 331

Query: 249 ITLEAISVDGRMLDINPNIFKRDDSGGGVMIDSGTDVTWLVKEAYEALRDEVMIRLEGEQ 308
           + L  ISV G+ L +  ++F      GG ++D+GT +T L   AY ALR           
Sbjct: 332 VMLTGISVGGQQLSVPSSVFA-----GGTVVDTGTVITRLPPTAYAALRSAFR-----SG 381

Query: 309 MRSYSWPDKLCYHGIMSS--DLKGF-----PTVRFHFRGGAKLALEKDSMFYQPRPDAFC 361
           M SY +P      GI+ +  +  G+     P V   F GGA + L  D +         C
Sbjct: 382 MASYGYPSAPA-TGILDTCYNFSGYGTVTLPNVALTFSGGATVTLGADGIL-----SFGC 435

Query: 362 MAVNPASINNRYVNFSLIGMMAQQFYNV---GYDIGRKQKT 399
           +A  P+  +      +++G + Q+ + V   G  +G K  +
Sbjct: 436 LAFAPSGSDG---GMAILGNVQQRSFEVRIDGTSVGFKPSS 473


>gi|168064205|ref|XP_001784055.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162664441|gb|EDQ51161.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 459

 Score =  121 bits (304), Expect = 5e-25,   Method: Compositional matrix adjust.
 Identities = 102/373 (27%), Positives = 167/373 (44%), Gaps = 37/373 (9%)

Query: 54  NDISNSLFYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLG-----TIFYPSRS 108
           +  +  L+Y  I +G PP   +  +DTGS + WV+C PC +C          +IF P +S
Sbjct: 41  DTFTTGLYYTRIYLGTPPQQFYVHVDTGSDVAWVNCVPCTNCKRASNVALPISIFDPEKS 100

Query: 109 SSYAYVPCDSEHCRYFPYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNT----DE 164
           +S   + C  E C     ++CS     C Y+ LY  G  T+ ++  + L+F         
Sbjct: 101 TSKTSISCTDEECYLASNSKCSFNSMSCPYSTLYGDGSSTAGYLINDVLSFNQVPSGNST 160

Query: 165 STIHVQDVVFGCGFSTNRNFKFSGIFGLGIGRSSLVSQLNSS------FSYCIGSLHDPD 218
           +T     + FGCG +    +   G+ G G    SL SQL+        F++C   L   +
Sbjct: 161 ATSGTARLTFGCGSNQTGTWLTDGLVGFGQAEVSLPSQLSKQNVSVNIFAHC---LQGDN 217

Query: 219 YLHNKLILGDGAIIDEGDA-TPLQFIDGHYYITLEAISVDGRMLDINPNIFKRDDSGGGV 277
                L++G   I + G   TP+     HY + L  I V G  +   P  F   +S GGV
Sbjct: 218 KGSGTLVIGH--IREPGLVYTPIVPKQSHYNVELLNIGVSGTNV-TTPTAFDLSNS-GGV 273

Query: 278 MIDSGTDVTWLVKEAYEALRDEVMIRLEGEQMRSYSWPDKLCYHGIMSSDLKG-FPTVRF 336
           ++DSGT +T+LV+ AY+  + +V      + MRS   P    +       ++G FP V  
Sbjct: 274 IMDSGTTLTYLVQPAYDQFQAKVR-----DCMRSGVLPVAFQFF----CTIEGYFPNVTL 324

Query: 337 HFRGGAKLALEKDSMFYQPR----PDAFCMAVNPASINNRYVNFSLIGMMAQQFYNVGYD 392
           +F GGA + L   S  Y+        A+C +   ++    Y+++++ G    +   V YD
Sbjct: 325 YFAGGAAMLLSPSSYLYKEMLTTGLSAYCFSWLESTSVYGYLSYTIFGDNVLKDQLVVYD 384

Query: 393 IGRKQKTFQRMDC 405
               +  ++  DC
Sbjct: 385 NVNNRIGWKNFDC 397


>gi|224080963|ref|XP_002306246.1| predicted protein [Populus trichocarpa]
 gi|222855695|gb|EEE93242.1| predicted protein [Populus trichocarpa]
          Length = 382

 Score =  121 bits (304), Expect = 5e-25,   Method: Compositional matrix adjust.
 Identities = 112/383 (29%), Positives = 172/383 (44%), Gaps = 38/383 (9%)

Query: 30  IARLAYLQEKIRSHNT-------YQAQILPSNDISNSLFYVNISIGQPPVPQFTAMDTGS 82
           + R+  L  ++ S +T       + ++++   D  +  ++V I +G PP  Q+  +D+GS
Sbjct: 5   VKRVVSLIRRVSSGSTASYGVEDFGSEVVSGMDQGSGEYFVRIGVGSPPRSQYMVIDSGS 64

Query: 83  SLLWVHCYPCRDCSPQLGTIFYPSRSSSYAYVPCDSEHCRYFPYARCSAYKHRCIYTQLY 142
            ++WV C PC  C  Q   +F P+ S+S+  V C S  C     A C++   RC Y   Y
Sbjct: 65  DIVWVQCKPCTQCYHQTDPLFDPADSASFMGVSCSSAVCDQVDNAGCNS--GRCRYEVSY 122

Query: 143 LIGPETSVFVSTEQLTFKNTDESTIHVQDVVFGCGFSTNRNFKFSGIFGLGIGRS-SLVS 201
             G  T   ++ E LT   T      VQ+V  GCG      F  +       G S S V 
Sbjct: 123 GDGSSTKGTLALETLTLGRT-----VVQNVAIGCGHMNQGMFVGAAGLLGLGGGSMSFVG 177

Query: 202 QLN----SSFSYCIGSLHDPDYLHNKLILGDGAIIDEGDATPL---QFIDGHYYITLEAI 254
           QL+    ++FSYC+  +      +  L  G  A+       PL        +YYI L  +
Sbjct: 178 QLSRERGNAFSYCL--VSRVTNSNGFLEFGSEAMPVGAAWIPLIRNPHSPSYYYIGLSGL 235

Query: 255 SVDGRMLDINPNIFKRDDSG-GGVMIDSGTDVTWLVKEAYEALRDEVMIRLEGEQMRSYS 313
            V    + I+ +IF+  + G GGV++D+GT VT     AYEA RD  + +       S  
Sbjct: 236 GVGDMKVPISEDIFELTELGNGGVVMDTGTAVTRFPTVAYEAFRDAFIDQTGNLPRASGV 295

Query: 314 WPDKLCYH--GIMSSDLKGFPTVRFHFRGGAKLALEKDSMFYQPRPDA--FCMAVNPASI 369
                CY+  G +S  +   PTV F+F GG  L L  ++ F  P  DA  FC A  P+  
Sbjct: 296 SIFDTCYNLFGFLSVRV---PTVSFYFSGGPILTLPANN-FLIPVDDAGTFCFAFAPSP- 350

Query: 370 NNRYVNFSLIGMMAQQFYNVGYD 392
                  S++G + Q+   +  D
Sbjct: 351 ----SGLSILGNIQQEGIQISVD 369


>gi|356524289|ref|XP_003530762.1| PREDICTED: LOW QUALITY PROTEIN: aspartic proteinase
           nepenthesin-2-like [Glycine max]
          Length = 392

 Score =  121 bits (304), Expect = 6e-25,   Method: Compositional matrix adjust.
 Identities = 118/411 (28%), Positives = 172/411 (41%), Gaps = 49/411 (11%)

Query: 26  INISIARLAYLQEKIRSH----NTYQ---AQILPSND---ISNSLFYVNISIGQPPVPQF 75
           +N+   R+ Y+Q ++  +    NT +   +  LP+     I ++ + V + +G P     
Sbjct: 1   MNLDNERVKYIQSRLSKNLGRENTVKDLDSTTLPAESGSLIGSANYVVVVGLGTPKRDLS 60

Query: 76  TAMDTGSSLLWVHCYPCR-DCSPQLGTIFYPSRSSSYAYVPCDSEHCRYFP----YARCS 130
              DTGS L W  C PC   C  Q   IF PS+SSSY  + C S  C         + CS
Sbjct: 61  LVFDTGSDLTWTQCEPCAGSCYKQQDAIFDPSKSSSYTNITCTSSLCTQLTSDGIKSECS 120

Query: 131 AYK-HRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDVVFGCGFSTNRNFKFS-G 188
           +     CIY   Y     +  F+S E+LT   TD     V D +FGCG      F  S G
Sbjct: 121 SSTDASCIYDAKYGDNSTSVGFLSQERLTITATD----IVDDFLFGCGQDNEGLFNGSAG 176

Query: 189 IFGLGIGRSSLVSQLNSS----FSYCIGSLHDPDYLHNKLILGDGAIIDEGDA-TPLQFI 243
           + GLG    S+V Q +S+    FSYC+ +          L  G  A  +     TPL  I
Sbjct: 177 LMGLGRHPISIVQQTSSNYNKIFSYCLPATSSS---LGHLTFGASAATNASLIYTPLSTI 233

Query: 244 DGH---YYITLEAISVDGRMLDINPNIFKRDDSGGGVMIDSGTDVTWLVKEAYEALRDEV 300
            G    Y + + +ISV G  L   P +     S GG +IDSGT +T L    Y ALR   
Sbjct: 234 SGDNSFYGLDIVSISVGGTKL---PAVSSSTFSAGGSIIDSGTVITRLAPTVYAALRSAF 290

Query: 301 MIRLEGEQMRSYSWPDKLCYHGIMSSDLKGF-----PTVRFHFRGGAKLALEKDSMFYQP 355
              +E   + + +     CY      DL G+     P + F F GG  + L    +    
Sbjct: 291 RRXMEKYPVANEAGLLDTCY------DLSGYKEISVPRIDFEFSGGVTVELXHRGILXVE 344

Query: 356 RPDAFCMAVNPASINNRYVNFSLIGMMAQQFYNVGYDIGRKQKTFQRMDCE 406
                C+A      +N   + ++ G + Q+   V YD+   +  F    C+
Sbjct: 345 SEQQVCLAFAANGSDN---DITVFGNVQQKTLEVVYDVKGGRIGFGAAGCK 392


>gi|302822373|ref|XP_002992845.1| hypothetical protein SELMODRAFT_136051 [Selaginella moellendorffii]
 gi|300139393|gb|EFJ06135.1| hypothetical protein SELMODRAFT_136051 [Selaginella moellendorffii]
          Length = 510

 Score =  121 bits (304), Expect = 6e-25,   Method: Compositional matrix adjust.
 Identities = 107/383 (27%), Positives = 164/383 (42%), Gaps = 51/383 (13%)

Query: 61  FYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLGTIFYPSRSSSYAYVPCDSEH 120
           +YV + +G P V     MDTGS + W+ C PC+DC P L   F P  SSS+  +PC S  
Sbjct: 138 YYVPLQLGTPAVEVVLIMDTGSDVSWIQCVPCKDCVPALRPPFNPRHSSSFFKLPCASST 197

Query: 121 CRYF-----PYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNT----DESTIHVQD 171
           C        P+  CS     C+++  Y  G  +S  ++ E +   NT    D   + + +
Sbjct: 198 CTNVYQGVKPF--CSPSGRTCLFSIQYGDGSLSSGLLAMETIA-GNTPNFGDGEPVKLSN 254

Query: 172 VVFGCGFSTNRNFK--FSGIFGLGIGRSSLVSQLNS----SFSYCIGSLHDPDYLHN--- 222
           +  GC            SG+ G+     S  SQL+S     FS+C      PD + +   
Sbjct: 255 ITLGCADIDREGLPTGASGLLGMDRRPISFPSQLSSRYARKFSHCF-----PDKIAHLNS 309

Query: 223 --KLILGDGAIID---------EGDATPLQFIDGHYYITLEAISVDGRMLDINPNIFKRD 271
              +  G+  II          +  A P   +D +YY+ L  ISVD   L ++   F  D
Sbjct: 310 SGLVFFGESDIISPYLRYTPLVQNPAVPSASLD-YYYVGLVGISVDESRLPLSHKNFDID 368

Query: 272 D--SGGGVMIDSGTDVTWLVKEAYEALRDEVMIRLEGEQMRSYSWPDKLCYH---GIMSS 326
                GG +IDSGT  T+L K A++A+R E + R         +     CY+   G  + 
Sbjct: 369 KVTGSGGTIIDSGTAFTYLKKPAFQAMRREFLARTSHLAKVDDNSGFTPCYNITSGTAAL 428

Query: 327 DLKGFPTVRFHFRGGAKLALEKDSMFY----QPRPDAFCMAVNPASINNRYVNFSLIGMM 382
           +    P++  HFRGG  + L K+S+             C+A       +  + F++IG  
Sbjct: 429 ESTILPSITLHFRGGLDVVLPKNSILIPVSSSEEQTTLCLAFQ----MSGDIPFNIIGNY 484

Query: 383 AQQFYNVGYDIGRKQKTFQRMDC 405
            QQ   V YD+ + +       C
Sbjct: 485 QQQNLWVEYDLEKLRLGIAPAQC 507


>gi|297734873|emb|CBI17107.3| unnamed protein product [Vitis vinifera]
          Length = 484

 Score =  121 bits (304), Expect = 6e-25,   Method: Compositional matrix adjust.
 Identities = 99/372 (26%), Positives = 169/372 (45%), Gaps = 34/372 (9%)

Query: 60  LFYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLG-----TIFYPSRSSSYAYV 114
           L++  + +G PP   +  +DTGS +LWV C  C  C    G       F P  SS+ + +
Sbjct: 67  LYFTRVLLGSPPKEFYVQIDTGSDVLWVSCGSCNGCPQSSGLHIPLNFFDPGSSSTASLI 126

Query: 115 PCDSEHCRYFPY---ARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTI--HV 169
            C  + C        A CS+  ++CIYT  Y  G  TS +  ++ L F     S++    
Sbjct: 127 SCSDQRCSLGVQSSDAGCSSQGNQCIYTFQYGDGSGTSGYYVSDLLNFDAIVGSSVTNSS 186

Query: 170 QDVVFGCGFS-----TNRNFKFSGIFGLGIGRSSLVSQLNSS------FSYCIGSLHDPD 218
             +VFGC  S     T  +    GIFG G    S++SQ++S       FS+C+       
Sbjct: 187 ASIVFGCSISQTGDLTKSDRAVDGIFGFGQQDMSVISQMSSQGITPKVFSHCLKGDGGGG 246

Query: 219 YLHNKLILGDGAIIDEGDATPLQFIDGHYYITLEAISVDGRMLDINPNIFKRDDSGGGVM 278
            +     + +  I+     +PL     HY + L++ISV+G+ L I+P +F    +  G +
Sbjct: 247 GILVLGEIVEEDIV----YSPLVPSQPHYNLNLQSISVNGKSLAIDPEVFAT-STNRGTI 301

Query: 279 IDSGTDVTWLVKEAYEALRDEVMIRLEGEQMRSYSWPDKLCYHGIMSSDLKG-FPTVRFH 337
           +DSGT + +L +EAY+     +   +  + +R        CY  +++S +KG FPTV  +
Sbjct: 302 VDSGTTLAYLAEEAYDPFVSAITEAVS-QSVRPLLSKGTQCY--LITSSVKGIFPTVSLN 358

Query: 338 FRGGAKLALEKDSMFYQPRP--DAFCMAVNPASINNRYVNFSLIGMMAQQFYNVGYDIGR 395
           F GG  + L+ +    Q     DA    +    I  +    +++G +  +     YD+  
Sbjct: 359 FAGGVSMNLKPEDYLLQQNSIGDAAVWCIGFQKIQGQ--GITILGDLVLKDKIFVYDLAG 416

Query: 396 KQKTFQRMDCEV 407
           ++  +   DC +
Sbjct: 417 QRIGWANYDCSM 428


>gi|125528511|gb|EAY76625.1| hypothetical protein OsI_04577 [Oryza sativa Indica Group]
          Length = 492

 Score =  121 bits (304), Expect = 6e-25,   Method: Compositional matrix adjust.
 Identities = 114/370 (30%), Positives = 162/370 (43%), Gaps = 49/370 (13%)

Query: 57  SNSLFYV-NISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLGTIFYPSRSSSYAYVP 115
           +N+  YV +  IG PP     A+D  S L+W  C      +P     F P RS++ A VP
Sbjct: 95  TNAGMYVFSYGIGTPPQQVSGALDISSDLVWTAC---GATAP-----FNPVRSTTVADVP 146

Query: 116 CDSEHCRYFPYARC----SAYKHRCIYTQLYLIG-PETSVFVSTEQLTFKNTDESTIHVQ 170
           C  + C+ F    C     A    C YT +Y  G   T+  + TE  TF +T      + 
Sbjct: 147 CTDDACQQFAPQTCGAGAGAGSSECAYTYMYGGGAANTTGLLGTEAFTFGDT-----RID 201

Query: 171 DVVFGCGFSTNRNFK-FSGIFGLGIGRSSLVSQLN-SSFSYCIGSLHDPDYLHNKLILGD 228
            VVFGCG     +F   SG+ GLG G  SLVSQL    FSY        D   + ++ GD
Sbjct: 202 GVVFGCGLQNVGDFSGVSGVIGLGRGNLSLVSQLQVDRFSYHFAPDDSVD-TQSFILFGD 260

Query: 229 GAIIDEGDATP---------LQFIDGH---YYITLEAISVDGRMLDINPNIF--KRDDSG 274
                  DATP         L   D +   YY+ L  I VDG+ L I    F  +  D  
Sbjct: 261 -------DATPQTSHTLSTRLLASDANPSLYYVELAGIQVDGKDLAIPSGTFDLRNKDGS 313

Query: 275 GGVMIDSGTDVTWLVKEAYEALRDEVMIRLEGEQMRSYSWPDKLCYHGIMSSDLKGFPTV 334
           GGV +     VT L + AY+ LR  V  ++    +   +    LCY G   +  K  P++
Sbjct: 314 GGVFLSITDLVTVLEEAAYKPLRQAVASKIGLPAVNGSALGLDLCYTGESLAKAK-VPSM 372

Query: 335 RFHFRGGAKLALEKDSMFYQPRPDAF-CMAVNPASINNRYVNFSLIGMMAQQFYNVGYDI 393
              F GGA + LE  + FY        C+ + P+S  +     S++G + Q   ++ YDI
Sbjct: 373 ALVFAGGAVMELELGNYFYMDSTTGLACLTILPSSAGDG----SVLGSLIQVGTHMMYDI 428

Query: 394 GRKQKTFQRM 403
              +  F+ +
Sbjct: 429 NGSKLVFESL 438


>gi|356539352|ref|XP_003538162.1| PREDICTED: aspartic proteinase-like protein 2-like [Glycine max]
          Length = 489

 Score =  121 bits (304), Expect = 6e-25,   Method: Compositional matrix adjust.
 Identities = 99/319 (31%), Positives = 150/319 (47%), Gaps = 32/319 (10%)

Query: 60  LFYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLGTI------FYPSRSSSYAY 113
           L+Y  + +G PP   +  +DTGS +LWV C  C  C PQ   +      F P  SS+ + 
Sbjct: 76  LYYTKVKLGTPPRELYVQIDTGSDVLWVSCGSCNGC-PQTSGLQIQLNYFDPGSSSTSSL 134

Query: 114 VPCDSEHCR---YFPYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQ 170
           + C    CR       A CS   ++C YT  Y  G  TS +  ++ + F +  E T+   
Sbjct: 135 ISCLDRRCRSGVQTSDASCSGRNNQCTYTFQYGDGSGTSGYYVSDLMHFASIFEGTLTTN 194

Query: 171 ---DVVFGC-----GFSTNRNFKFSGIFGLGIGRSSLVSQLNSS------FSYCIGSLHD 216
               VVFGC     G  T       GIFG G    S++SQL+S       FS+C   L  
Sbjct: 195 SSASVVFGCSILQTGDLTKSERAVDGIFGFGQQGMSVISQLSSQGIAPRVFSHC---LKG 251

Query: 217 PDYLHNKLILGDGAIIDEGDA-TPLQFIDGHYYITLEAISVDGRMLDINPNIFKRDDSGG 275
            +     L+LG+  I++     +PL     HY + L++ISV+G+++ I P++F   ++  
Sbjct: 252 DNSGGGVLVLGE--IVEPNIVYSPLVPSQPHYNLNLQSISVNGQIVRIAPSVFATSNN-R 308

Query: 276 GVMIDSGTDVTWLVKEAYEALRDEVMIRLEGEQMRSYSWPDKLCYHGIMSSDLKGFPTVR 335
           G ++DSGT + +L +EAY      +   +  + +RS       CY    SS++  FP V 
Sbjct: 309 GTIVDSGTTLAYLAEEAYNPFVIAIAAVIP-QSVRSVLSRGNQCYLITTSSNVDIFPQVS 367

Query: 336 FHFRGGAKLALEKDSMFYQ 354
            +F GGA L L       Q
Sbjct: 368 LNFAGGASLVLRPQDYLMQ 386


>gi|294461757|gb|ADE76437.1| unknown [Picea sitchensis]
          Length = 325

 Score =  121 bits (304), Expect = 6e-25,   Method: Compositional matrix adjust.
 Identities = 99/344 (28%), Positives = 152/344 (44%), Gaps = 34/344 (9%)

Query: 75  FTAMDTGSSLLWVHCYPCRDCSPQLGTIFYPSRSSSYAYVPCDSEHCRYFPYARCSAYKH 134
           F  +DTGS + W+ C PC  C  Q  ++F P+ S++Y  +PC+S  C+       S    
Sbjct: 2   FLLIDTGSDITWIQCDPCPQCYKQQDSLFQPAGSATYKPLPCNSTMCQQLQSFSHSCLNS 61

Query: 135 RCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDVVFGCGFSTNRNFKFSGIFGL-G 193
            C Y   Y     T    + E LT ++ D   + V +  FGCG + N+   F+G  GL G
Sbjct: 62  SCNYMVSYGDKSTTRGDFALETLTLRSDDTILVSVPNFAFGCGHA-NKGL-FNGAAGLMG 119

Query: 194 IGRSSLVSQLNSS------FSYCIGSLHDPDYLHNKLILGDGAIID-EGDATPLQFID-- 244
           +G+SS+     +S      FSYC+ S+         L  G+ A++D +   TPL  +D  
Sbjct: 120 LGKSSIGFPAQTSVAFGKVFSYCLPSVSS-TIPSGILHFGEAAMLDYDVRFTPL--VDSS 176

Query: 245 ---GHYYITLEAISVDGRMLDINPNIFKRDDSGGGVMIDSGTDVTWLVKEAYEALRDEVM 301
                Y++++  I+V   +L I+            VM+DSGT ++   + AYE LRD   
Sbjct: 177 SGPSQYFVSMTGINVGDELLPISAT----------VMVDSGTVISRFEQSAYERLRDAFT 226

Query: 302 IRLEGEQMRSYSWPDKLCYHGIMSSDLKGFPTVRFHFRGGAKLALEKDSMFYQPRPDAFC 361
             L G Q      P   C+  + + D    P +  HFR  A+L L    + Y       C
Sbjct: 227 QILPGLQTAVSVAPFDTCFR-VSTVDDINIPLITLHFRDDAELRLSPVHILYPVDDGVMC 285

Query: 362 MAVNPASINNRYVNFSLIGMMAQQFYNVGYDIGRKQKTFQRMDC 405
            A  P+S        S++G   QQ     YDI + +      +C
Sbjct: 286 FAFAPSSSGR-----SVLGNFQQQNLRFVYDIPKSRLGISAFEC 324


>gi|225436397|ref|XP_002272121.1| PREDICTED: aspartic proteinase-like protein 2 isoform 2 [Vitis
           vinifera]
          Length = 499

 Score =  121 bits (304), Expect = 6e-25,   Method: Compositional matrix adjust.
 Identities = 99/372 (26%), Positives = 169/372 (45%), Gaps = 34/372 (9%)

Query: 60  LFYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLG-----TIFYPSRSSSYAYV 114
           L++  + +G PP   +  +DTGS +LWV C  C  C    G       F P  SS+ + +
Sbjct: 82  LYFTRVLLGSPPKEFYVQIDTGSDVLWVSCGSCNGCPQSSGLHIPLNFFDPGSSSTASLI 141

Query: 115 PCDSEHCRYFPY---ARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTI--HV 169
            C  + C        A CS+  ++CIYT  Y  G  TS +  ++ L F     S++    
Sbjct: 142 SCSDQRCSLGVQSSDAGCSSQGNQCIYTFQYGDGSGTSGYYVSDLLNFDAIVGSSVTNSS 201

Query: 170 QDVVFGCGFS-----TNRNFKFSGIFGLGIGRSSLVSQLNSS------FSYCIGSLHDPD 218
             +VFGC  S     T  +    GIFG G    S++SQ++S       FS+C+       
Sbjct: 202 ASIVFGCSISQTGDLTKSDRAVDGIFGFGQQDMSVISQMSSQGITPKVFSHCLKGDGGGG 261

Query: 219 YLHNKLILGDGAIIDEGDATPLQFIDGHYYITLEAISVDGRMLDINPNIFKRDDSGGGVM 278
            +     + +  I+     +PL     HY + L++ISV+G+ L I+P +F    +  G +
Sbjct: 262 GILVLGEIVEEDIV----YSPLVPSQPHYNLNLQSISVNGKSLAIDPEVFAT-STNRGTI 316

Query: 279 IDSGTDVTWLVKEAYEALRDEVMIRLEGEQMRSYSWPDKLCYHGIMSSDLKG-FPTVRFH 337
           +DSGT + +L +EAY+     +   +  + +R        CY  +++S +KG FPTV  +
Sbjct: 317 VDSGTTLAYLAEEAYDPFVSAITEAVS-QSVRPLLSKGTQCY--LITSSVKGIFPTVSLN 373

Query: 338 FRGGAKLALEKDSMFYQPRP--DAFCMAVNPASINNRYVNFSLIGMMAQQFYNVGYDIGR 395
           F GG  + L+ +    Q     DA    +    I  +    +++G +  +     YD+  
Sbjct: 374 FAGGVSMNLKPEDYLLQQNSIGDAAVWCIGFQKIQGQ--GITILGDLVLKDKIFVYDLAG 431

Query: 396 KQKTFQRMDCEV 407
           ++  +   DC +
Sbjct: 432 QRIGWANYDCSM 443


>gi|86438622|emb|CAJ26370.1| chloroplast nucleoid binding protein [Brachypodium sylvaticum]
          Length = 443

 Score =  121 bits (303), Expect = 7e-25,   Method: Compositional matrix adjust.
 Identities = 121/442 (27%), Positives = 181/442 (40%), Gaps = 56/442 (12%)

Query: 1   LIHRDSILSPYNNPNENPAHRVQRGINISIARLAYLQEKIRSHNTYQAQ--ILPSN---D 55
           ++HR    SP   P++ P+      +    AR+  +   I +      Q   LP+     
Sbjct: 22  VMHRHGPCSPLQTPDDAPSDADL--LEHDQARVDSIHRMIANETAVVGQDVSLPAERGIS 79

Query: 56  ISNSLFYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRD--CSPQLGTIFYPSRSSSYAY 113
           +    + V++ +G P        DTGS L WV C PC    C  Q   +F PS SS+++ 
Sbjct: 80  VGTGNYVVSVGLGTPARDLTVVFDTGSDLSWVQCGPCSSGGCYHQQDPLFAPSSSSTFSA 139

Query: 114 VPCDSEHCRYFPYAR--CSAYK--HRCIYTQLYLIGPETSVFVSTEQLTFKNT------D 163
           V C    C   P AR  CS+     RC Y  +Y     T   +  + LT   T      +
Sbjct: 140 VRCGEPEC---PRARQSCSSSPGDDRCPYEVVYGDKSRTVGHLGNDTLTLGTTPSTNASE 196

Query: 164 ESTIHVQDVVFGCGFSTNRNF-KFSGIFGLGIGRSSLVSQL----NSSFSYCIGSLHDPD 218
            ++  +   VFGCG +    F K  G+FGLG G+ SL SQ        FSYC+ S     
Sbjct: 197 NNSNKLPGFVFGCGENNTGLFGKADGLFGLGRGKVSLSSQAAGKYGEGFSYCLPS--SSS 254

Query: 219 YLHNKLILGDGAIID-EGDATPL---QFIDGHYYITLEAISVDGRMLDIN--PNIFKRDD 272
             H  L LG  A        TP+         YY+ L  I V GR + ++  P ++    
Sbjct: 255 NAHGYLSLGTPAPAPAHARFTPMLNRSNTPSFYYVKLVGIRVAGRAIKVSSRPALWP--- 311

Query: 273 SGGGVMIDSGTDVTWLVKEAYEALRDEVMIRLEGEQMRSYSWPDK-------LCYHGIMS 325
              G+++DSGT +T L   AY ALR   +       M  Y +           CY     
Sbjct: 312 --AGLIVDSGTVITRLAPRAYSALRTAFL-----SAMGKYGYKRAPRLSILDTCYDFTAH 364

Query: 326 SDLK-GFPTVRFHFRGGAKLALEKDSMFYQPRPDAFCMAVNPASINNRYVNFSLIGMMAQ 384
           ++     P V   F GGA ++++   + Y  +    C+A  P   N    +  ++G   Q
Sbjct: 365 ANATVSIPAVALVFAGGATISVDFSGVLYVAKVAQACLAFAP---NGNGRSAGILGNTQQ 421

Query: 385 QFYNVGYDIGRKQKTFQRMDCE 406
           +   V YD+GR++  F    C 
Sbjct: 422 RTVAVVYDVGRQKIGFAAKGCS 443


>gi|115489316|ref|NP_001067145.1| Os12g0583300 [Oryza sativa Japonica Group]
 gi|77556903|gb|ABA99699.1| Eukaryotic aspartyl protease family protein, expressed [Oryza
           sativa Japonica Group]
 gi|113649652|dbj|BAF30164.1| Os12g0583300 [Oryza sativa Japonica Group]
 gi|125537189|gb|EAY83677.1| hypothetical protein OsI_38901 [Oryza sativa Indica Group]
          Length = 446

 Score =  121 bits (303), Expect = 7e-25,   Method: Compositional matrix adjust.
 Identities = 129/425 (30%), Positives = 178/425 (41%), Gaps = 54/425 (12%)

Query: 15  NENPAHRVQRGINISIARLAYLQEKIRSHNTYQAQILPSNDISNSLFYVNISIGQPPVPQ 74
           N      V+R +     RLA+L   +           P    +   +     IG PP   
Sbjct: 45  NYTAEELVRRAVAAGKQRLAFLDAAMAGGGDGGGVGAPVR-WATLQYVAEYLIGDPPQRA 103

Query: 75  FTAMDTGSSLLWVHCYPC--RDCSPQLGTIFYPSRSSSYAYVPCDSEHCR------YF-- 124
              +DTGS L+W  C  C  + C+ Q    +  S SS++A VPC +  C       +F  
Sbjct: 104 EALIDTGSDLVWTQCSTCLRKVCARQALPYYNSSASSTFAPVPCAARICAANDDIIHFCD 163

Query: 125 PYARCSAYKHRCIYTQLYLIGPETSVFVS-TEQLTFKNTDESTIHVQDVVFGCGFSTNRN 183
             A CS             +G E   F S T +L F     + I VQ  + G        
Sbjct: 164 LAAGCSVIAGYGAGVVAGTLGTEAFAFQSGTAELAFGCVTFTRI-VQGALHGA------- 215

Query: 184 FKFSGIFGLGIGRSSLVSQLNSS-FSYCIGSLHDPDYLHNK-----LILGDGAIID-EGD 236
              SG+ GLG GR SLVSQ  ++ FSYC+       Y HN      L +G  A +   GD
Sbjct: 216 ---SGLIGLGRGRLSLVSQTGATKFSYCL-----TPYFHNNGATGHLFVGASASLGGHGD 267

Query: 237 ATPLQFIDG-----HYYITLEAISVDGRMLDINPNIFKRDDSG-----GGVMIDSGTDVT 286
               QF+ G      YY+ L  ++V    L I   +F   +       GGV+IDSG+  T
Sbjct: 268 VMTTQFVKGPKGSPFYYLPLIGLTVGETRLPIPATVFDLREVAPGLFSGGVIIDSGSPFT 327

Query: 287 WLVKEAYEALRDEVMIRLEGEQMRSYSWPD--KLCYHGIMSSDL-KGFPTVRFHFRGGAK 343
            LV +AY+AL  E+  RL G  +      D   LC   +   D+ +  P V FHFRGGA 
Sbjct: 328 SLVHDAYDALASELAARLNGSLVAPPPDADDGALC---VARRDVGRVVPAVVFHFRGGAD 384

Query: 344 LALEKDSMFYQPRPDAFCMAVNPASINNRYVNFSLIGMMAQQFYNVGYDIGRKQKTFQRM 403
           +A+  +S ++ P   A   A    +    Y   S+IG   QQ   V YD+     +FQ  
Sbjct: 385 MAVPAES-YWAPVDKA--AACMAIASAGPYRRQSVIGNYQQQNMRVLYDLANGDFSFQPA 441

Query: 404 DCEVL 408
           DC  L
Sbjct: 442 DCSAL 446


>gi|356569916|ref|XP_003553140.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Glycine max]
          Length = 560

 Score =  121 bits (303), Expect = 7e-25,   Method: Compositional matrix adjust.
 Identities = 105/370 (28%), Positives = 167/370 (45%), Gaps = 32/370 (8%)

Query: 61  FYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLGTIFYPSRSSSYAYVPCDSEH 120
           +++++ +G PP      +DTGS L W+ C PC  C  Q G  + P  SSS+  + C    
Sbjct: 195 YFMDVFVGTPPKHFSLILDTGSDLNWIQCVPCYACFEQNGPYYDPKDSSSFKNITCHDPR 254

Query: 121 CRYF----PYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFK-NTDESTIH---VQDV 172
           C+      P   C      C Y   Y     T+   + E  T    T E       V++V
Sbjct: 255 CQLVSSPDPPQPCKGETQSCPYFYWYGDSSNTTGDFALETFTVNLTTPEGKPELKIVENV 314

Query: 173 VFGCGFSTNRNF-KFSGIFGLGIGRSSLVSQLNS----SFSYCIGSLHDPDYLHNKLILG 227
           +FGCG      F   +G+ GLG G  S  +QL S    SFSYC+   +    + +KLI G
Sbjct: 315 MFGCGHWNRGLFHGAAGLLGLGRGPLSFATQLQSLYGHSFSYCLVDRNSNSSVSSKLIFG 374

Query: 228 -DGAIIDEGDATPLQFIDGH-------YYITLEAISVDGRMLDINPNIFK-RDDSGGGVM 278
            D  ++   +     F+ G        YY+ +++I V G +L I    +      GGG +
Sbjct: 375 EDKELLSHPNLNFTSFVGGKENPVDTFYYVLIKSIMVGGEVLKIPEETWHLSAQGGGGTI 434

Query: 279 IDSGTDVTWLVKEAYEALRDEVMIRLEGEQMRSYSWPDKLCYH--GIMSSDLKGFPTVRF 336
           IDSGT +T+  + AYE +++  M +++G  +     P K CY+  G+   +L  F  +  
Sbjct: 435 IDSGTTLTYFAEPAYEIIKEAFMRKIKGFPLVETFPPLKPCYNVSGVEKMELPEFAIL-- 492

Query: 337 HFRGGAKLALEKDSMFYQPRP-DAFCMAVNPASINNRYVNFSLIGMMAQQFYNVGYDIGR 395
            F  GA      ++ F Q  P D  C+A+    +       S+IG   QQ +++ YD+ +
Sbjct: 493 -FADGAMWDFPVENYFIQIEPEDVVCLAI----LGTPRSALSIIGNYQQQNFHILYDLKK 547

Query: 396 KQKTFQRMDC 405
            +  +  M C
Sbjct: 548 SRLGYAPMKC 557


>gi|297842769|ref|XP_002889266.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
 gi|297335107|gb|EFH65525.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
          Length = 489

 Score =  121 bits (303), Expect = 8e-25,   Method: Compositional matrix adjust.
 Identities = 121/420 (28%), Positives = 188/420 (44%), Gaps = 54/420 (12%)

Query: 19  AHRVQRGINISIARLAYLQEKIRS-------HNTYQAQILPSNDIS-NSLFY-VNISIGQ 69
             +++R + +   R+  LQ +I++        +  + QI  ++ I   +L Y V + +G 
Sbjct: 87  GKKMRRALLLDNIRVQSLQLRIKAMTSSTTEQSVSETQIPLTSGIKLETLNYIVTVELGG 146

Query: 70  PPVPQFTAMDTGSSLLWVHCYPCRDCSPQLGTIFYPSRSSSYAYVPCDSEHCRYFPYAR- 128
             +     +DTGS L WV C PCR C  Q G ++ PS SSSY  V C+S  C+    A  
Sbjct: 147 KNMSLI--VDTGSDLTWVQCQPCRSCYNQQGPLYDPSVSSSYKTVFCNSSTCQDLVAATG 204

Query: 129 ----CSAY----KHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDVVFGCGFST 180
               C  +    K  C Y   Y  G  T   +++E +   +T      ++++VFGCG + 
Sbjct: 205 NSGPCGGFNGVVKTTCEYVVSYGDGSYTRGDLASESIVLGDT-----KLENLVFGCGRNN 259

Query: 181 NRNF-KFSGIFGLGIGRSSLVSQ----LNSSFSYCIGSLHDPDYLHNKLILGDGAIIDEG 235
              F   SG+ GLG    SLVSQ     N  FSYC+ SL D       L  G+   + + 
Sbjct: 260 KGLFGGASGLMGLGRSSVSLVSQTLKTFNGVFSYCLPSLEDGA--SGTLSFGNDFSVYKN 317

Query: 236 DA----TPL---QFIDGHYYITLEAISVDGRMLDINPNIFKRDDSGGGVMIDSGTDVTWL 288
                 TPL     +   Y + L   S+ G  L       K    G G++IDSGT +T L
Sbjct: 318 STSVFYTPLVQNPQLRSFYILNLTGASIGGVEL-------KTLSFGRGILIDSGTVITRL 370

Query: 289 VKEAYEALRDEVMIRLEG-EQMRSYSWPDKLCYHGIMSSDLKGFPTVRFHFRGGAKLALE 347
               Y+A++ E + +  G      YS  D  C++     D+   PT++  F G A+L ++
Sbjct: 371 PPSIYKAVKTEFLKQFSGFPSAPGYSILDT-CFNLTSYEDIS-IPTIKMIFEGNAELEVD 428

Query: 348 KDSMFYQPRPDA--FCMAVNPASINNRYVNFSLIGMMAQQFYNVGYDIGRKQKTFQRMDC 405
              +FY  +PDA   C+A+   S  N      +IG   Q+   V YD  +++      +C
Sbjct: 429 VTGVFYFVKPDASLVCLALASLSYENE---VGIIGNYQQKNQRVIYDTTQERLGIAGENC 485


>gi|222619345|gb|EEE55477.1| hypothetical protein OsJ_03658 [Oryza sativa Japonica Group]
          Length = 530

 Score =  120 bits (302), Expect = 9e-25,   Method: Compositional matrix adjust.
 Identities = 95/318 (29%), Positives = 147/318 (46%), Gaps = 32/318 (10%)

Query: 61  FYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLG-----TIFYPSRSSSYAYVP 115
           ++  + +G PP   F  +DTGS +LWV C PC  C    G       F P  SS+ + +P
Sbjct: 117 YFTRVKLGSPPKEYFVQIDTGSDILWVACSPCTGCPSSSGLNIQLEFFNPDTSSTSSKIP 176

Query: 116 CDSEHCRYF---PYARCSAYKHR-CIYTQLYLIGPETSVFVSTEQLTFKNT---DESTIH 168
           C  + C        A C    +  C YT  Y  G  TS +  ++ + F      +++   
Sbjct: 177 CSDDRCTAALQTSEAVCQTSDNSPCGYTFTYGDGSGTSGYYVSDTMYFDTVMGNEQTANS 236

Query: 169 VQDVVFGCGFS-----TNRNFKFSGIFGLGIGRSSLVSQLNS------SFSYCIGSLHDP 217
              +VFGC  S     T  +    GIFG G  + S+VSQLNS       FS+C   L   
Sbjct: 237 SASIVFGCSNSQSGDLTKTDRAVDGIFGFGQHQLSVVSQLNSLGVSPKVFSHC---LKGS 293

Query: 218 DYLHNKLILGDGAIIDEGDA-TPLQFIDGHYYITLEAISVDGRMLDINPNIFKRDDSGGG 276
           D     L+LG+  I++ G   TPL     HY + LE+I V+G+ L I+ ++F   ++  G
Sbjct: 294 DNGGGILVLGE--IVEPGLVYTPLVPSQPHYNLNLESIVVNGQKLPIDSSLFTTSNT-QG 350

Query: 277 VMIDSGTDVTWLVKEAYEALRDEVMIRLEGEQMRSYSWPDKLCYHGIMSSDLKGFPTVRF 336
            ++DSGT + +L   AY+   + +   +    +RS       C+    S D   FPTV  
Sbjct: 351 TIVDSGTTLAYLADGAYDPFVNAITAAVS-PSVRSLVSKGNQCFVTSSSVD-SSFPTVSL 408

Query: 337 HFRGGAKLALEKDSMFYQ 354
           +F GG  + ++ ++   Q
Sbjct: 409 YFMGGVAMTVKPENYLLQ 426


>gi|168014386|ref|XP_001759733.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162689272|gb|EDQ75645.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 392

 Score =  120 bits (302), Expect = 1e-24,   Method: Compositional matrix adjust.
 Identities = 107/382 (28%), Positives = 163/382 (42%), Gaps = 30/382 (7%)

Query: 46  YQAQILPSNDISNSLFYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLGTIFYP 105
           ++  ++    + +  ++V+ S+G P       +DTGS L +V C PC  C  Q G ++ P
Sbjct: 19  FRTPLVSGTTLGSGQYFVDFSLGTPEQKFHLIVDTGSDLAFVQCAPCDLCYEQDGPLYQP 78

Query: 106 SRSSSYAYVPCDSEHCRYFPY---ARCSAY------KHRCIYTQLYLIGPETSVFVSTEQ 156
           S SS++  VPCDS  C   P    A CS+       +  C Y   Y     T    + E 
Sbjct: 79  SNSSTFTPVPCDSAECLLIPAPVGAPCSSSYPESPPQGACSYEYRYGDNSSTVGVFAYET 138

Query: 157 LTFKNTDESTIHVQDVVFGCGFSTNRNF-KFSGIFGLGIGRSSLVSQ----LNSSFSYCI 211
            T        I V  V FGCG     +F    G+ GLG G  S  SQ      + F+YC+
Sbjct: 139 ATVGG-----IRVNHVAFGCGNRNQGSFVSAGGVLGLGQGALSFTSQAGYAFENKFAYCL 193

Query: 212 GSLHDPDYLHNKLILGDG--AIIDEGDATPL---QFIDGHYYITLEAISVDGRMLDINPN 266
            S   P  + + LI GD   + I +   TPL         YY+ +  I   G  L I  +
Sbjct: 194 TSYLSPTSVFSSLIFGDDMMSTIHDLQFTPLVSNPLNPSVYYVQIVRICFGGETLLIPDS 253

Query: 267 IFKRDDSG-GGVMIDSGTDVTWLVKEAYEALRDEVMIRLEGEQMRSYSWPDKLCYHGIMS 325
            +K D  G GG + DSGT VT+   +AY  +       +   +         LC + +  
Sbjct: 254 AWKIDSVGNGGTIFDSGTTVTYWSPQAYARIIAAFEKSVPYPRAPPSPQGLPLCVN-VSG 312

Query: 326 SDLKGFPTVRFHFRGGAKLALEKDSMFYQPRPDAFCMAVNPASINNRYVNFSLIGMMAQQ 385
            D   +P+    F  GA     + + F +  P+  C+A+  +S +     F++IG + QQ
Sbjct: 313 IDHPIYPSFTIEFDQGATYRPNQGNYFIEVSPNIDCLAMLESSSD----GFNVIGNIIQQ 368

Query: 386 FYNVGYDIGRKQKTFQRMDCEV 407
            Y V YD    +  F   +C+ 
Sbjct: 369 NYLVQYDREEHRIGFAHANCDA 390


>gi|115483166|ref|NP_001065176.1| Os10g0537800 [Oryza sativa Japonica Group]
 gi|21717159|gb|AAM76352.1|AC074196_10 putative nucleoid DNA binding protein [Oryza sativa Japonica Group]
 gi|31433285|gb|AAP54823.1| Eukaryotic aspartyl protease family protein, expressed [Oryza
           sativa Japonica Group]
 gi|113639785|dbj|BAF27090.1| Os10g0537800 [Oryza sativa Japonica Group]
 gi|215692411|dbj|BAG87831.1| unnamed protein product [Oryza sativa Japonica Group]
          Length = 394

 Score =  120 bits (302), Expect = 1e-24,   Method: Compositional matrix adjust.
 Identities = 95/361 (26%), Positives = 160/361 (44%), Gaps = 31/361 (8%)

Query: 61  FYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLGTIFYPSRSSSYAYVPCDSEH 120
           +  N +IG PP P    +D    L+W  C  C  C  Q   +F P+ S++Y   PC +  
Sbjct: 51  YVANFTIGTPPQPASAVIDLAGELVWTQCKQCGRCFEQGTPLFDPTASNTYRAEPCGTPL 110

Query: 121 CRYFPYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDVVFGCGFST 180
           C   P    +   + C Y      G +T   V T+         S      + FGC  ++
Sbjct: 111 CESIPSDVRNCSGNVCAYEASTNAG-DTGGKVGTDTFAVGTAKAS------LAFGCVVAS 163

Query: 181 NRNFKF--SGIFGLGIGRSSLVSQLN-SSFSYCIGSLHDPDYLHNKLILGDGAIIDEGD- 236
           + +     SGI GLG    SLV+Q   ++FSYC+   HD    ++ L LG  A +  G  
Sbjct: 164 DIDTMGGPSGIVGLGRTPWSLVTQTGVAAFSYCLAP-HDAGK-NSALFLGSSAKLAGGGK 221

Query: 237 --ATPLQFIDG-------HYYITLEAISVDGRMLDINPNIFKRDDSGGGVMIDSGTDVTW 287
             +TP   I G       +Y + LE +     M+ + P       SG  V++D+ + +++
Sbjct: 222 AASTPFVNISGNGNDLSNYYKVQLEGLKAGDAMIPLPP-------SGSTVLLDTFSPISF 274

Query: 288 LVKEAYEALRDEVMIRLEGEQMRSYSWPDKLCYHGIMSSDLKGFPTVRFHFRGGAKLALE 347
           LV  AY+A++  V + +    M +   P  LC+    +S     P + F FRGGA + + 
Sbjct: 275 LVDGAYQAVKKAVTVAVGAPPMATPVEPFDLCFPKSGASGAA--PDLVFTFRGGAAMTVP 332

Query: 348 KDSMFYQPRPDAFCMAVNPASINNRYVNFSLIGMMAQQFYNVGYDIGRKQKTFQRMDCEV 407
             +     +    C+A+  ++  N     SL+G + Q+  +  +D+ ++  +F+  DC  
Sbjct: 333 ATNYLLDYKNGTVCLAMLSSARLNSTTELSLLGSLQQENIHFLFDLDKETLSFEPADCTK 392

Query: 408 L 408
           L
Sbjct: 393 L 393


>gi|145693992|gb|ABP93696.1| unknown protein isoform 1 [Lemna minor]
          Length = 350

 Score =  120 bits (302), Expect = 1e-24,   Method: Compositional matrix adjust.
 Identities = 110/362 (30%), Positives = 159/362 (43%), Gaps = 35/362 (9%)

Query: 56  ISNSLFYVNISIGQPPVPQFTAMDTGSSLLWVHCYPC-RDCSPQLGTIFYPSRSSSYAYV 114
           I  + + + +  G P   Q    DTGS++ W+ C PC   C PQ   +F P+ SS+Y  +
Sbjct: 11  IGTANYVITVGFGTPKKNQTVIFDTGSNVNWIQCKPCVVSCYPQQEPLFDPTLSSTYRNI 70

Query: 115 PCDSEHCRYFPYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDVVF 174
            C S  C       CS     C+Y   Y  G  T  F++TE  T    +       + +F
Sbjct: 71  SCTSAACTGLSSRGCSG--STCVYGVTYGDGSSTVGFLATETFTLAAGNV----FNNFIF 124

Query: 175 GCGFSTNRNFKFSGIFGL-GIGRS--SLVSQLNSS----FSYCIGSLHDPD-YLH--NKL 224
           GCG   N    F+G  GL G+GRS  SL SQL +S    FSYC+ S      YL+  N L
Sbjct: 125 GCG--QNNQGLFTGAAGLIGLGRSPYSLNSQLATSLGNIFSYCLPSTSSATGYLNIGNPL 182

Query: 225 -ILGDGAIIDEGDATPLQFIDGHYYITLEAISVDGRMLDINPNIFKRDDSGGGVMIDSGT 283
              G  A++    A  L FID      L  ISV G  L ++  +F+      G +IDSGT
Sbjct: 183 RTPGYTAMLTNSRAPTLYFID------LIGISVGGTRLALSSTVFQSV----GTIIDSGT 232

Query: 284 DVTWLVKEAYEALRDEVMIRLEGEQMRSYSWPDKLCYHGIMSSDLKGFPTVRFHFRGGAK 343
            +T L   AY ALR      +      + +     CY    ++ +  FPT++ H+  G  
Sbjct: 233 VITRLPPTAYGALRTAFRAAMTQYTRAAAASILDTCYDFSRTTTVT-FPTIKLHYT-GLD 290

Query: 344 LALEKDSMFYQPRPDAFCMAVNPASINNRYVNFSLIGMMAQQFYNVGYDIGRKQKTFQRM 403
           + +    +FY       C+A    + N+      +IG + Q+   V YD   K+  F   
Sbjct: 291 VTIPGAGVFYVISSSQVCLAF---AGNSDSTQIGIIGNVQQRTMEVTYDNALKRIGFAAG 347

Query: 404 DC 405
            C
Sbjct: 348 AC 349


>gi|297817208|ref|XP_002876487.1| hypothetical protein ARALYDRAFT_486375 [Arabidopsis lyrata subsp.
           lyrata]
 gi|297322325|gb|EFH52746.1| hypothetical protein ARALYDRAFT_486375 [Arabidopsis lyrata subsp.
           lyrata]
          Length = 520

 Score =  120 bits (302), Expect = 1e-24,   Method: Compositional matrix adjust.
 Identities = 108/421 (25%), Positives = 176/421 (41%), Gaps = 42/421 (9%)

Query: 15  NENPAHRVQRGINISIARLAYLQEKIRSHNTYQAQILPSNDISNSLFYVNISIGQPPVPQ 74
           N+N   + Q+  N  +                 A +     + +  +++++ +G PP   
Sbjct: 109 NQNTVSQKQKKKNKEVVTTPVASSVEEQAGQLVATLESGMTLGSGEYFMDVLVGSPPKHF 168

Query: 75  FTAMDTGSSLLWVHCYPCRDCSPQLGTIFYPSRSSSYAYVPCDSEHCRYF----PYARCS 130
              +DTGS L W+ C PC DC  Q G  + P  S+SY  + C+   C       P   C 
Sbjct: 169 SLILDTGSDLNWIQCLPCHDCFQQNGAFYDPKASASYKNITCNDPRCNLVSPPDPPKPCK 228

Query: 131 AYKHRCIYTQLYLIGPETS----VFVSTEQLTFKNTDESTIHVQDVVFGCGFSTNRNFKF 186
           +    C Y   Y     T+    V   T  LT         +V++++FGCG   NR    
Sbjct: 229 SDNQSCPYYYWYGDSSNTTGDFAVETFTVNLTTSGGSSELYNVENMMFGCGH-WNR---- 283

Query: 187 SGIF-------GLGIGRSSLVSQLNS----SFSYCIGSLHDPDYLHNKLILG-DGAIIDE 234
            G+F       GLG G  S  SQL S    SFSYC+   +    + +KLI G D  ++  
Sbjct: 284 -GLFHGAAGLLGLGRGPLSFSSQLQSLYGHSFSYCLVDRNSDTNVSSKLIFGEDKDLLSH 342

Query: 235 GD-------ATPLQFIDGHYYITLEAISVDGRMLDINPNIFK-RDDSGGGVMIDSGTDVT 286
            +       A     +D  YY+ +++I V G +L+I    +    D  GG +IDSGT ++
Sbjct: 343 PNLNFTSFVARKENLVDTFYYVQIKSIIVAGEVLNIPEETWNISSDGAGGTIIDSGTTLS 402

Query: 287 WLVKEAYEALRDEVMIRLEGEQMRSYSWPDKLCYHGIMSSDLKGFPTVRFHFRGGAKLAL 346
           +  + AYE +++++  + +G+      +P       +   D    P +   F  GA    
Sbjct: 403 YFAEPAYEFIKNKIAEKAKGKYPVYRDFPILDPCFNVSGIDSIQLPELGIAFADGAVWNF 462

Query: 347 EKDSMFYQPRPDAFCMAV--NPASINNRYVNFSLIGMMAQQFYNVGYDIGRKQKTFQRMD 404
             ++ F     D  C+A+   P S       FS+IG   QQ +++ YD  R +  +    
Sbjct: 463 PTENSFIWLNEDLVCLAILGTPKSA------FSIIGNYQQQNFHILYDTKRSRLGYAPTK 516

Query: 405 C 405
           C
Sbjct: 517 C 517


>gi|118487542|gb|ABK95598.1| unknown [Populus trichocarpa]
          Length = 471

 Score =  120 bits (301), Expect = 1e-24,   Method: Compositional matrix adjust.
 Identities = 104/380 (27%), Positives = 171/380 (45%), Gaps = 33/380 (8%)

Query: 44  NTYQAQILPSNDISNSLFYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRD-CSPQLGTI 102
           N+    + P   I +  +YV + +G PP      +DTGSSL W+ C PC   C  Q   +
Sbjct: 108 NSASIPLNPGLSIGSGNYYVKLGLGTPPKYYAMILDTGSSLSWLQCQPCAVYCHAQADPL 167

Query: 103 FYPSRSSSYAYVPCDSEHCRYFPYAR-----CSAYKHRCIYTQLYLIGPETSVFVSTEQL 157
           + PS S +Y  + C S  C     A      C    + C+YT  Y     +  ++S + L
Sbjct: 168 YDPSVSKTYKKLSCASVECSRLKAATLNDPLCETDSNACLYTASYGDTSFSIGYLSQDLL 227

Query: 158 TFKNTDESTIHVQDVVFGCGFSTNRNF-KFSGIFGLGIGRSSLVSQLNS----SFSYCIG 212
           T  ++      +    +GCG      F + +GI GL   + S+++QL++    +FSYC+ 
Sbjct: 228 TLTSSQT----LPQFTYGCGQDNQGLFGRAAGIIGLARDKLSMLAQLSTKYGHAFSYCLP 283

Query: 213 SLHDPDYLHNKLILGDGAIIDEGDATPLQFIDGH----YYITLEAISVDGRMLDINPNIF 268
           + +        L +G  +       TP+   D      Y++ L AI+V GR LD+   ++
Sbjct: 284 TANSGSSGGGFLSIGSISPTSY-KFTPM-LTDSKNPSLYFLRLTAITVSGRPLDLAAAMY 341

Query: 269 KRDDSGGGVMIDSGTDVTWLVKEAYEALRDEVMIRLEGEQMR--SYSWPDKLCYHGIMSS 326
           +        +IDSGT +T L    Y ALR   +  +  +  +  +YS  D  C+ G + S
Sbjct: 342 RVP-----TLIDSGTVITRLPMSMYAALRQAFVKIMSTKYAKAPAYSILDT-CFKGSLKS 395

Query: 327 DLKGFPTVRFHFRGGAKLALEKDSMFYQPRPDAFCMAVNPASINNRYVNFSLIGMMAQQF 386
            +   P ++  F+GGA L L   S+  +      C+A   +S  N+    ++IG   QQ 
Sbjct: 396 -ISAVPEIKMIFQGGADLTLRAPSILIEADKGITCLAFAGSSGTNQ---IAIIGNRQQQT 451

Query: 387 YNVGYDIGRKQKTFQRMDCE 406
           YN+ YD+   +  F    C 
Sbjct: 452 YNIAYDVSTSRIGFAPGSCH 471


>gi|293332735|ref|NP_001168472.1| uncharacterized protein LOC100382248 [Zea mays]
 gi|223948487|gb|ACN28327.1| unknown [Zea mays]
          Length = 434

 Score =  120 bits (301), Expect = 1e-24,   Method: Compositional matrix adjust.
 Identities = 107/364 (29%), Positives = 157/364 (43%), Gaps = 44/364 (12%)

Query: 61  FYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRD-CSPQLGTIFYPSRSSSYAYVPCDSE 119
           + V + +G P        DTGS   WV C PC   C  Q   +F P++S++YA + C S 
Sbjct: 96  YVVPVRLGTPAERFTVVFDTGSDTTWVQCQPCVAYCYRQKEPLFDPTKSATYANISCSSS 155

Query: 120 HCRYFPYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDVVFGCGFS 179
           +C     + CS     C+Y   Y  G  T  F + + LT          +++  FGCG  
Sbjct: 156 YCSDLYVSGCSG--GHCLYGIQYGDGSYTIGFYAQDTLTLAYDT-----IKNFRFGCG-E 207

Query: 180 TNRNF--KFSGIFGLGIGRSSLVSQL----NSSFSYCIGSLHDPDYLHNKLILGDGAIID 233
            NR    + +G+ GLG G++SL  Q        F+YC+ +          L LG GA   
Sbjct: 208 KNRGLFGRAAGLLGLGRGKTSLPVQAYDKYGGVFAYCLPATSAGTGF---LDLGPGAPAA 264

Query: 234 EGDATPLQFIDG--HYYITLEAISVDGRMLDINPNIFKRDDSGGGVMIDSGTDVTWLVKE 291
               TP+    G   YY+ +  I V G +L I  ++F    S  G ++DSGT +T L   
Sbjct: 265 NARLTPMLVDRGPTFYYVGMTGIKVGGHVLPIPGSVF----STAGTLVDSGTVITRLPPS 320

Query: 292 AYEALRDEVMIRLEG---EQMRSYSWPDKLCYHGIMSSDLKG-------FPTVRFHFRGG 341
           AY  LR      ++G       ++S  D  CY      DL G        P V   F+GG
Sbjct: 321 AYAPLRSAFSKAMQGLGYSAAPAFSILDT-CY------DLTGHKGGSIALPAVSLVFQGG 373

Query: 342 AKLALEKDSMFYQPRPDAFCMAVNPASINNRYVNFSLIGMMAQQFYNVGYDIGRKQKTFQ 401
           A L ++   + Y       C+A  P   N    + +++G   Q+ + V YDIG+K   F 
Sbjct: 374 ACLDVDASGILYVADVSQACLAFAP---NADDTDVAIVGNTQQKTHGVLYDIGKKIVGFA 430

Query: 402 RMDC 405
              C
Sbjct: 431 PGAC 434


>gi|357124861|ref|XP_003564115.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Brachypodium
           distachyon]
          Length = 477

 Score =  120 bits (301), Expect = 1e-24,   Method: Compositional matrix adjust.
 Identities = 108/385 (28%), Positives = 174/385 (45%), Gaps = 38/385 (9%)

Query: 56  ISNSLFYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLGT-IFYPSRSSSYAYV 114
           I  + + V++S+G PP P    +DTGS L+W  C PC +C  Q    +  P+ SS++A V
Sbjct: 89  IVTNEYLVHLSVGTPPRPVALTLDTGSDLVWTQCAPCLNCFDQGAIPVLDPAASSTHAAV 148

Query: 115 PCDSEHCRYFPYARC----SAYKHR-CIYTQLYLIGPETSVFVSTEQLTF---KNTDEST 166
            CD+  CR  P+  C    S++  R C+Y   Y     T   +++++ TF    N D   
Sbjct: 149 RCDAPVCRALPFTSCGRGGSSWGERSCVYVYHYGDKSITVGKLASDRFTFGPGDNADGGG 208

Query: 167 IHVQDVVFGCGFSTNRNFKF--SGIFGLGIGRSSLVSQLN-SSFSYCIGSLHDPDYLHNK 223
           +  + + FGCG      F+   +GI G G GR SL SQL  +SFSYC  S+ +       
Sbjct: 209 VSERRLTFGCGHFNKGIFQANETGIAGFGRGRWSLPSQLGVTSFSYCFTSMFESTSSLVT 268

Query: 224 LILGDGAIIDEG--DATPLQFIDGH---YYITLEAISVDGRMLDINPNIFKRDDSGGGVM 278
           L +    +   G   +TPL         Y+++L+AI+V    + I P   +R      + 
Sbjct: 269 LGVAPAELHLTGQVQSTPLLRDPSQPSLYFLSLKAITVGATRIPI-PERRQRLREASAI- 326

Query: 279 IDSGTDVTWLVKEAYEALRDEVMIR-------LEGEQMR-SYSWPDKLCYHGIMSSDLKG 330
           IDSG  +T L ++ YEA++ E + +       +EG  +   ++ P             +G
Sbjct: 327 IDSGASITTLPEDVYEAVKAEFVAQVGLPVSAVEGSALDLCFALPSAAAPKSAFGWRWRG 386

Query: 331 --------FPTVRFHFRGGAKLALEKDS-MFYQPRPDAFCMAVNPASINNRYVNFSLIGM 381
                    P + FH  GGA   L +++ +F        C+ ++ A+         +IG 
Sbjct: 387 RGRAMPVRVPRLVFHLGGGADWELPRENYVFEDYGARVMCLVLDAATGGGDQT--VVIGN 444

Query: 382 MAQQFYNVGYDIGRKQKTFQRMDCE 406
             QQ  +V YD+     +F    CE
Sbjct: 445 YQQQNTHVVYDLENDVLSFAPARCE 469


>gi|242086034|ref|XP_002443442.1| hypothetical protein SORBIDRAFT_08g019550 [Sorghum bicolor]
 gi|241944135|gb|EES17280.1| hypothetical protein SORBIDRAFT_08g019550 [Sorghum bicolor]
          Length = 443

 Score =  120 bits (301), Expect = 1e-24,   Method: Compositional matrix adjust.
 Identities = 115/413 (27%), Positives = 174/413 (42%), Gaps = 34/413 (8%)

Query: 15  NENPAHRVQRGINISIARLAYLQEKIRSHNTYQAQILPSNDISNSLFYVNISIGQPPVPQ 74
           N     RV+R I +S       Q  + S       +      +   +     +G PP   
Sbjct: 46  NYTAPERVRRAIALS------RQINLASTRAEGGGVSAPVHWATRQYIAEYMVGDPPQRA 99

Query: 75  FTAMDTGSSLLWVHCYPC--RDCSPQLGTIFYPSRSSSYAYVPCDSEHCRYFPYARCSAY 132
              +DTGSSL+W  C  C  + C  Q    F  S S S+A VPC  + C    Y    A 
Sbjct: 100 EALIDTGSSLIWTQCTACLRKVCVRQDLPYFNASSSGSFAPVPCQDKACAG-NYLHFCAL 158

Query: 133 KHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDVVFGCGFSTNRNFKFSGIFGL 192
              C +   Y  G     F+ T+  TF+ +  +T+    V F    + +     SG+ GL
Sbjct: 159 DGTCTFRVTYGAGGIIG-FLGTDAFTFQ-SGGATLAFGCVSFTRFAAPDVLHGASGLIGL 216

Query: 193 GIGRSSLVSQLNSS-FSYCIGSLHDPDYLHNKLILGDGAIIDEGDAT--PLQFIDG---- 245
           G GR SL SQ  +  FSYC+      +   + L +G  A +  G      + F++     
Sbjct: 217 GRGRLSLASQTGAKRFSYCLTPYFHNNGASSHLFVGAAASLSGGGGAVMSMAFVESPKDY 276

Query: 246 ----HYYITLEAISVDGRMLDINPNIF--KRDDSG---GGVMIDSGTDVTWLVKEAYEAL 296
                YY+ L  I+V    L I    F  +  + G   GGV+IDSG+  T LV++AYE L
Sbjct: 277 PYSTFYYLPLVGITVGETKLAIPSTAFDLQEVEEGFWEGGVIIDSGSPFTSLVEDAYEPL 336

Query: 297 RDEVMIRLEGEQMRSYSWPDKLCYHGIMSSDL-KGFPTVRFHFRGGAKLALEKDSMFYQP 355
             E+  +L G  +      D      +   DL +  PT+  HF GGA +AL  ++ +   
Sbjct: 337 MGELARQLNGSLVPPPGEDDGGMALCVARGDLDRVVPTLVLHFSGGADMALPPENYWAPL 396

Query: 356 RPDAFCMAVNPASINNRYVNFSLIGMMAQQFYNVGYDIGRKQKTFQRMDCEVL 408
                CMA+    +       S+IG   QQ  ++ +D+G  + +FQ  DC  +
Sbjct: 397 EKSTACMAIVRGYLQ------SIIGNFQQQNMHILFDVGGGRLSFQNADCSTI 443


>gi|413943688|gb|AFW76337.1| hypothetical protein ZEAMMB73_223549 [Zea mays]
 gi|413943689|gb|AFW76338.1| hypothetical protein ZEAMMB73_223549 [Zea mays]
          Length = 499

 Score =  120 bits (301), Expect = 1e-24,   Method: Compositional matrix adjust.
 Identities = 109/365 (29%), Positives = 161/365 (44%), Gaps = 46/365 (12%)

Query: 61  FYVNISIGQPPVPQFTAM-DTGSSLLWVHCYPCRD-CSPQLGTIFYPSRSSSYAYVPCDS 118
           + V + +G P   +FT + DTGS   WV C PC   C  Q   +F P++S++YA + C S
Sbjct: 161 YVVPVRLGTP-AERFTVVFDTGSDTTWVQCQPCVAYCYRQKEPLFDPTKSATYANISCSS 219

Query: 119 EHCRYFPYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDVVFGCGF 178
            +C     + CS     C+Y   Y  G  T  F + + LT          +++  FGCG 
Sbjct: 220 SYCSDLYVSGCSG--GHCLYGIQYGDGSYTIGFYAQDTLTLAYDT-----IKNFRFGCG- 271

Query: 179 STNRNF--KFSGIFGLGIGRSSLVSQL----NSSFSYCIGSLHDPDYLHNKLILGDGAII 232
             NR    + +G+ GLG G++SL  Q        F+YC+ +          L LG GA  
Sbjct: 272 EKNRGLFGRAAGLLGLGRGKTSLPVQAYDKYGGVFAYCLPATSAGTGF---LDLGPGAPA 328

Query: 233 DEGDATPLQFIDG--HYYITLEAISVDGRMLDINPNIFKRDDSGGGVMIDSGTDVTWLVK 290
                TP+    G   YY+ +  I V G +L I  ++F    S  G ++DSGT +T L  
Sbjct: 329 ANARLTPMLVDRGPTFYYVGMTGIKVGGHVLPIPGSVF----STAGTLVDSGTVITRLPP 384

Query: 291 EAYEALRDEVMIRLEG---EQMRSYSWPDKLCYHGIMSSDLKG-------FPTVRFHFRG 340
            AY  LR      ++G       ++S  D  CY      DL G        P V   F+G
Sbjct: 385 SAYAPLRSAFSKAMQGLGYSAAPAFSILDT-CY------DLTGHKGGSIALPAVSLVFQG 437

Query: 341 GAKLALEKDSMFYQPRPDAFCMAVNPASINNRYVNFSLIGMMAQQFYNVGYDIGRKQKTF 400
           GA L ++   + Y       C+A  P   N    + +++G   Q+ + V YDIG+K   F
Sbjct: 438 GACLDVDASGILYVADVSQACLAFAP---NADDTDVAIVGNTQQKTHGVLYDIGKKIVGF 494

Query: 401 QRMDC 405
               C
Sbjct: 495 APGAC 499


>gi|255634819|gb|ACU17770.1| unknown [Glycine max]
          Length = 354

 Score =  120 bits (301), Expect = 1e-24,   Method: Compositional matrix adjust.
 Identities = 96/319 (30%), Positives = 148/319 (46%), Gaps = 33/319 (10%)

Query: 60  LFYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLGTI------FYPSRSSSYAY 113
           L+Y  + +G PPV     +DTGS +LWV C  C  C PQ   +      F P  SS+ + 
Sbjct: 24  LYYTKVQLGTPPVEFNVQIDTGSDVLWVSCNSCSGC-PQTSGLQIQLNFFDPGSSSTSSM 82

Query: 114 VPCDSEHCR---YFPYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQ 170
           + C  + C        A CS+  ++C YT  Y  G  TS +  ++ +      E ++   
Sbjct: 83  IACSDQRCNNGIQSSDATCSSQNNQCSYTFQYGDGSGTSGYYVSDMMHLNTIFEGSVTTN 142

Query: 171 D---VVFGC-----GFSTNRNFKFSGIFGLGIGRSSLVSQLNSS------FSYCIGSLHD 216
               VVFGC     G  T  +    GIFG G    S++SQL+S       FS+C   L  
Sbjct: 143 STAPVVFGCSNQQTGDLTKSDRAVDGIFGFGQQEMSVISQLSSQGIAPRVFSHC---LKG 199

Query: 217 PDYLHNKLILGDGAIIDEGDA-TPLQFIDGHYYITLEAISVDGRMLDINPNIFKRDDSGG 275
                  L+LG+  I++     T L     HY + L++I+V+G+ L I+ ++F   +S  
Sbjct: 200 DSSGGGILVLGE--IVEPNIVYTSLVPAQPHYNLNLQSIAVNGQTLQIDSSVFATSNS-R 256

Query: 276 GVMIDSGTDVTWLVKEAYEALRDEVMIRLEGEQMRSYSWPDKLCYHGIMSSDLKGFPTVR 335
           G ++DSGT + +L +EAY+     +   +      + S  ++ CY  I SS  + FP V 
Sbjct: 257 GTIVDSGTTLAYLAEEAYDPFVSAITASIPQSVHTAVSRGNQ-CYL-ITSSVTEVFPQVS 314

Query: 336 FHFRGGAKLALEKDSMFYQ 354
            +F GGA + L       Q
Sbjct: 315 LNFAGGASMILRPQDYLIQ 333


>gi|302757235|ref|XP_002962041.1| hypothetical protein SELMODRAFT_64201 [Selaginella moellendorffii]
 gi|300170700|gb|EFJ37301.1| hypothetical protein SELMODRAFT_64201 [Selaginella moellendorffii]
          Length = 367

 Score =  120 bits (300), Expect = 1e-24,   Method: Compositional matrix adjust.
 Identities = 104/370 (28%), Positives = 162/370 (43%), Gaps = 31/370 (8%)

Query: 61  FYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLGTIFYPSRSSSYAYVPCDSEH 120
           + + I +G PP      +DTGS L+W+ C PC  C  Q   I+ PS SS++A   C +  
Sbjct: 4   YTMEIELGSPPKKFNAIVDTGSDLVWIQCKPCSQCYSQSDPIYDPSASSTFAKTSCSTSS 63

Query: 121 CRYFPYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDVVFGCGFST 180
           C+  P + CS+    CIY   Y     T    + E LT +++  S+    +  FGCG   
Sbjct: 64  CQSLPASGCSSSAKTCIYGYQYGDSSSTQGDFALETLTLRSSGGSSKAFPNFQFGCGRLN 123

Query: 181 NRNF-KFSGIFGLGIGRSSLVSQL----NSSFSYCIGSLHDPDYLHNKLILGDGAIIDEG 235
           + +F   +GI GLG G+ SL +QL    N+ FSYC+    D     + LI G  A    G
Sbjct: 124 SGSFGGAAGIVGLGQGKISLSTQLGSAINNKFSYCLVDFDDDSSKTSPLIFGSSASTGSG 183

Query: 236 D-ATPLQFIDG---HYYITLEAISVDGRMLDINP--------------NIFKRDDSGGGV 277
             +TP+    G   +Y++ LE ISV G+ L +                 +   + + GG 
Sbjct: 184 AISTPIIPNSGRSTYYFVGLEGISVGGKQLSLATRAIDFLSVRSKKKLRVRALEVNSGGT 243

Query: 278 MIDSGTDVTWLVKEAYEALRDEVMIRLEGEQMRSYSWPDKLCYHGIMSSDLKGFPTVRFH 337
           + DSGT +T L    Y  ++      +    + + S    LCY    S + K FP +   
Sbjct: 244 IFDSGTTLTLLDDAVYSKVKSAFASSVSLPTVDASSSGFDLCYDVSKSKNFK-FPALTLA 302

Query: 338 FRGGAKLALEKDSMF--YQPRPDAFCMAVNPASINNRYVNFSLIGMMAQQFYNVGYDIGR 395
           F+ G K +  + + F          C+A+  +      +  +L+    QQ Y+V YD G 
Sbjct: 303 FK-GTKFSPPQKNYFVIVDTAETVACLAMGGSGSLGLGIIGNLM----QQNYHVVYDRGT 357

Query: 396 KQKTFQRMDC 405
              +     C
Sbjct: 358 STISMSPAQC 367


>gi|326515172|dbj|BAK03499.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 494

 Score =  120 bits (300), Expect = 1e-24,   Method: Compositional matrix adjust.
 Identities = 112/354 (31%), Positives = 153/354 (43%), Gaps = 40/354 (11%)

Query: 69  QPPVPQFTAMDTGSSLLWVHCYPC--RDCSPQLGTIFYPSRSSSYAYVPCDSEHCRYFPY 126
           Q  V Q   +DT S + WV C PC    C  Q   ++ P++SS++A +PC S  C+    
Sbjct: 164 QDAVSQTVVVDTSSDIPWVQCLPCPIPQCHLQKDPLYDPAKSSTFAPIPCGSPACKELGS 223

Query: 127 A---RCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDVVFGCGFSTNRN 183
           +    CS     C Y   Y  G  T+    T+ LT       TI V+D  FGC  +   +
Sbjct: 224 SYGNGCSPTTDECKYIVNYGDGKATTGTYVTDTLTM----SPTIVVKDFRFGCSHAVRGS 279

Query: 184 F--KFSGIFGLGIGRSSLVSQL----NSSFSYCIGSLHDPDYLHNKLILGDGAIIDEGDA 237
           F  + +GI  LG GR SL+ Q      ++FSYCI       +L    + G      +   
Sbjct: 280 FSNQNAGILALGGGRGSLLEQTADAYGNAFSYCIPKPSSAGFLS---LGGPVEASLKFSY 336

Query: 238 TPL---QFIDGHYYITLEAISVDGRMLDINPNIFKRDDSGGGVMIDSGTDVTWLVKEAYE 294
           TPL   +     Y + LEAI V G+ L + P  F       G ++DSG  VT L  + Y 
Sbjct: 337 TPLIKNKHAPTFYIVHLEAIIVAGKQLAVPPTAFAT-----GAVMDSGAVVTQLPPQVYA 391

Query: 295 ALRDEVMIRLEGEQMRSYSWPDK---LCYHGIMSSDLKGFPTVRFHFRGGAKLALEKDSM 351
           ALR     R         + P +    CY      D+K  P V   F GGA L LE  S+
Sbjct: 392 ALR--AAFRSAMAAYGPLAAPVRNLDTCYDFTRFPDVK-VPKVSLVFAGGATLDLEPASI 448

Query: 352 FYQPRPDAFCMAVNPASINNRYVNFSLIGMMAQQFYNVGYDIGRKQKTFQRMDC 405
                    C+A   A+     V F  IG + QQ Y V YD+G  +  F+R  C
Sbjct: 449 ILD-----GCLAFA-ATPGEESVGF--IGNVQQQTYEVLYDVGGGKVGFRRGAC 494


>gi|115483168|ref|NP_001065177.1| Os10g0538200 [Oryza sativa Japonica Group]
 gi|21717168|gb|AAM76361.1|AC074196_19 putative nucleoid DNA binding protein [Oryza sativa Japonica Group]
 gi|31433289|gb|AAP54827.1| Eukaryotic aspartyl protease family protein, expressed [Oryza
           sativa Japonica Group]
 gi|113639786|dbj|BAF27091.1| Os10g0538200 [Oryza sativa Japonica Group]
 gi|215686408|dbj|BAG87693.1| unnamed protein product [Oryza sativa Japonica Group]
          Length = 394

 Score =  120 bits (300), Expect = 2e-24,   Method: Compositional matrix adjust.
 Identities = 95/361 (26%), Positives = 159/361 (44%), Gaps = 31/361 (8%)

Query: 61  FYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLGTIFYPSRSSSYAYVPCDSEH 120
           +  N +IG PP P    +D    L+W  C  C  C  Q   +F P+ S++Y   PC +  
Sbjct: 51  YVANFTIGTPPQPASAVIDLAGELVWTQCKQCSRCFEQDTPLFDPTASNTYRAEPCGTPL 110

Query: 121 CRYFPYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDVVFGCGFST 180
           C   P    +   + C Y Q      +T   V T+         S      + FGC  ++
Sbjct: 111 CESIPSDSRNCSGNVCAY-QASTNAGDTGGKVGTDTFAVGTAKAS------LAFGCVVAS 163

Query: 181 NRNFKF--SGIFGLGIGRSSLVSQLN-SSFSYCIGSLHDPDYLHNKLILGDGAIIDEGD- 236
           + +     SGI GLG    SLV+Q   ++FSYC+   HD    ++ L LG  A +  G  
Sbjct: 164 DIDTMGGPSGIVGLGRTPWSLVTQTGVAAFSYCLAP-HDAGR-NSALFLGSSAKLAGGGK 221

Query: 237 --ATPLQFIDG-------HYYITLEAISVDGRMLDINPNIFKRDDSGGGVMIDSGTDVTW 287
             +TP   I G       +Y + LE +     M+ + P       SG  V++D+ + +++
Sbjct: 222 AASTPFVNISGNGNDLSNYYKVQLEGLKAGDAMIPLPP-------SGSTVLLDTFSPISF 274

Query: 288 LVKEAYEALRDEVMIRLEGEQMRSYSWPDKLCYHGIMSSDLKGFPTVRFHFRGGAKLALE 347
           LV  AY+A++  V   +    M +   P  LC+    +S     P + F FRGGA + + 
Sbjct: 275 LVDGAYQAVKKAVTAAVGAPPMATPVEPFDLCFPKSGASGAA--PDLVFTFRGGAAMTVP 332

Query: 348 KDSMFYQPRPDAFCMAVNPASINNRYVNFSLIGMMAQQFYNVGYDIGRKQKTFQRMDCEV 407
             +     +    C+A+  ++  N     SL+G + Q+  +  +D+ ++  +F+  DC  
Sbjct: 333 ATNYLLDYKNGTVCLAMLSSARLNSTTELSLLGSLQQENIHFLFDLDKETLSFEPADCTK 392

Query: 408 L 408
           L
Sbjct: 393 L 393


>gi|255568540|ref|XP_002525244.1| Aspartic proteinase nepenthesin-2 precursor, putative [Ricinus
           communis]
 gi|223535541|gb|EEF37210.1| Aspartic proteinase nepenthesin-2 precursor, putative [Ricinus
           communis]
          Length = 460

 Score =  120 bits (300), Expect = 2e-24,   Method: Compositional matrix adjust.
 Identities = 109/387 (28%), Positives = 171/387 (44%), Gaps = 53/387 (13%)

Query: 44  NTYQAQILPSNDISNSLFYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRD-CSPQLGTI 102
           N+    + P   I +  +Y+ + +G PP      +DTGSSL W+ C PC   C  Q+  +
Sbjct: 103 NSANIPLNPGLSIGSGNYYLKLGLGSPPKYYTMILDTGSSLSWLQCKPCVVYCHSQVDPL 162

Query: 103 FYPSRSSSYAYVPCDSEHCRYFPYAR-----CSAYKHRCIYTQLYLIGPETSVFVSTEQL 157
           F PS S++Y  + C S  C     A      C+A    C+YT  Y     +  ++S + L
Sbjct: 163 FEPSASNTYRPLYCSSSECSLLKAATLNDPLCTA-SGVCVYTASYGDASYSMGYLSRDLL 221

Query: 158 TFKNTDESTIHVQDVVFGCGFSTNRNF-KFSGIFGLGIGRSSLVSQLNS----SFSYCIG 212
           T   +      +    +GCG      F K +GI GL   + S+++QL+     +FSYC+ 
Sbjct: 222 TLTPSQT----LPSFTYGCGQDNEGLFGKAAGIVGLARDKLSMLAQLSPKYGYAFSYCLP 277

Query: 213 SLHDPDYLHNKLILGDGAIIDEGDATPLQFI----------DGHYYITLEAISVDGRMLD 262
           +               G  +  G  +P  +              Y++ L AI+V GR + 
Sbjct: 278 TSTS----------SGGGFLSIGKISPSSYKFTPMIRNSQNPSLYFLRLAAITVAGRPVG 327

Query: 263 INPNIFKRDDSGGGVMIDSGTDVTWLVKEAYEALRD---EVMIRLEGEQMRSYSWPDKLC 319
           +    ++        +IDSGT VT L    Y ALR+   ++M R   EQ  +YS  D  C
Sbjct: 328 VAAAGYQVP-----TIIDSGTVVTRLPISIYAALREAFVKIMSR-RYEQAPAYSILDT-C 380

Query: 320 YHGIMSSDLKGFPTVRFHFRGGAKLALEKDSMFYQPRPDAFCMAVNPASINNRYVNFSLI 379
           + G + S + G P +R  F+GGA L+L   ++  +      C+A   AS N      ++I
Sbjct: 381 FKGSLKS-MSGAPEIRMIFQGGADLSLRAPNILIEADKGIACLAF--ASSN----QIAII 433

Query: 380 GMMAQQFYNVGYDIGRKQKTFQRMDCE 406
           G   QQ YN+ YD+   +  F    C 
Sbjct: 434 GNHQQQTYNIAYDVSASKIGFAPGGCR 460


>gi|224065128|ref|XP_002301682.1| predicted protein [Populus trichocarpa]
 gi|222843408|gb|EEE80955.1| predicted protein [Populus trichocarpa]
          Length = 441

 Score =  120 bits (300), Expect = 2e-24,   Method: Compositional matrix adjust.
 Identities = 94/367 (25%), Positives = 166/367 (45%), Gaps = 31/367 (8%)

Query: 60  LFYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLGTIFYPSRSSSYAYVPCDSE 119
           +  V++ IG PP  Q   +DTGS L W+ C+      P   ++F PS SSS++ +PC+  
Sbjct: 81  ILLVSLPIGTPPQTQQMILDTGSQLSWIQCHKKVPRKPPPSSVFDPSLSSSFSVLPCNHP 140

Query: 120 HCRY----FPYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDVVFG 175
            C+     F           C Y+  Y  G      +  E++TF  +  +      ++ G
Sbjct: 141 LCKPRIPDFTLPTSCDQNRLCHYSYFYADGTLAEGNLVREKITFSRSQSTP----PLILG 196

Query: 176 CGFSTNRNFKFSGIFGLGIGRSSLVSQLN-SSFSYCIGSLH-DPDYL-HNKLILGDGA-- 230
           C   ++      GI G+ +GR S  SQ   + FSYC+ +    P +       LG+    
Sbjct: 197 CAEESS---DAKGILGMNLGRLSFASQAKLTKFSYCVPTRQVRPGFTPTGSFYLGENPNS 253

Query: 231 -------IIDEGDATPLQFIDG-HYYITLEAISVDGRMLDINPNIFKRDDSGGG-VMIDS 281
                  ++    +  +  +D   Y + ++ I +  + L+I  + F+ D SG G  MIDS
Sbjct: 254 GGFRYINLLTFSQSQRMPNLDPLAYTVAMQGIRIGNQKLNIPISAFRPDPSGAGQTMIDS 313

Query: 282 GTDVTWLVKEAYEALRDEVMIRLEGEQMRS---YSWPDKLCYHGIMSSDLKGFPTVRFHF 338
           G++ T+LV EAY  +R+EV +RL G +++    Y     +C++G      +    + F F
Sbjct: 314 GSEFTYLVDEAYNKVREEV-VRLVGARLKKGYVYGGVSDMCFNGNAIEIGRLIGNMVFEF 372

Query: 339 RGGAKLALEKDSMFYQPRPDAFCMAVNPASINNRYVNFSLIGMMAQQFYNVGYDIGRKQK 398
             G ++ +EK+ +         C+ +  + +     N  +IG   QQ   V +D+  ++ 
Sbjct: 373 DKGVEIVVEKERVLADVGGGVHCVGIGRSEMLGAASN--IIGNFHQQNIWVEFDLANRRV 430

Query: 399 TFQRMDC 405
            F + DC
Sbjct: 431 GFGKADC 437


>gi|115465771|ref|NP_001056485.1| Os05g0590000 [Oryza sativa Japonica Group]
 gi|49328116|gb|AAT58814.1| putative nucleoid DNA-binding protein [Oryza sativa Japonica Group]
 gi|113580036|dbj|BAF18399.1| Os05g0590000 [Oryza sativa Japonica Group]
          Length = 481

 Score =  119 bits (299), Expect = 2e-24,   Method: Compositional matrix adjust.
 Identities = 118/392 (30%), Positives = 170/392 (43%), Gaps = 41/392 (10%)

Query: 39  KIRSHNTYQAQILPSNDISNSLFYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQ 98
           + R    + A +L      +  ++  + +G P       +DTGS ++W+ C PCR C  Q
Sbjct: 106 RPRRRGGFAAPLLSGLPQGSGEYFAQVGVGTPATTALMVLDTGSDVVWLQCAPCRHCYAQ 165

Query: 99  LGTIFYPSRSSSYAYVPCDSEHCRYFPYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLT 158
            G +F P RS SYA V C +  CR    A C   ++ C+Y   Y  G  T+   ++E LT
Sbjct: 166 SGRVFDPRRSRSYAAVDCVAPICRRLDSAGCDRRRNSCLYQVAYGDGSVTAGDFASETLT 225

Query: 159 FKNTDESTIHVQDVVFGCGFSTNRNF-KFSGIFGLGIGRSSLVSQL----NSSFSYCI-- 211
           F         VQ V  GCG      F   SG+ GLG GR S  SQ+      SFSYC+  
Sbjct: 226 FAR----GARVQRVAIGCGHDNEGLFIAASGLLGLGRGRLSFPSQIARSFGRSFSYCLVD 281

Query: 212 -GSLHDPDYLHNKLIL---GDGAIIDEGDATPL---QFIDGHYYITLEAISVDGRM---- 260
             S   P    +  +    G  A       TP+     +   YY+ L   SV G      
Sbjct: 282 RTSSVRPSSTRSSTVTFGAGAVAAAAGASFTPMGRNPRMATFYYVHLLGFSVGGARVKGV 341

Query: 261 ----LDINPNIFKRDDSGGGVMIDSGTDVTWLVKEAYEALRDEVMIRLEGEQMR--SYSW 314
               L +NP   +     GGV++DSGT VT L +  YEA+RD       G ++    +S 
Sbjct: 342 SQSDLRLNPTTGR-----GGVILDSGTSVTRLARPVYEAVRDAFRAAAVGLRVSPGGFSL 396

Query: 315 PDKLCYHGIMSSDLKGFPTVRFHFRGGAKLALEKDS-MFYQPRPDAFCMAVNPASINNRY 373
            D  CY+ +    +   PTV  H  GGA +AL  ++ +        FC A+  A  +   
Sbjct: 397 FDT-CYN-LSGRRVVKVPTVSMHLAGGASVALPPENYLIPVDTSGTFCFAM--AGTDG-- 450

Query: 374 VNFSLIGMMAQQFYNVGYDIGRKQKTFQRMDC 405
              S+IG + QQ + V +D   ++  F    C
Sbjct: 451 -GVSIIGNIQQQGFRVVFDGDAQRVGFVPKSC 481


>gi|413953788|gb|AFW86437.1| hypothetical protein ZEAMMB73_618532 [Zea mays]
          Length = 469

 Score =  119 bits (299), Expect = 2e-24,   Method: Compositional matrix adjust.
 Identities = 116/365 (31%), Positives = 159/365 (43%), Gaps = 44/365 (12%)

Query: 61  FYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRD--CSPQLGTIFYPSRSSSYAYVPCDS 118
           +   + +G P VPQ   +DTGSSL WV C PC    C PQ   +F P+ SSSY+ VPCDS
Sbjct: 129 YVATVGLGTPAVPQTLILDTGSSLTWVQCKPCNSSQCYPQRLPLFDPNTSSSYSPVPCDS 188

Query: 119 EHCRYFPYA----RCSA-YKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDVV 173
           + CR          C++     C Y   Y  G   +   ST+ LT          V+   
Sbjct: 189 QECRALAAGIDGDGCTSDGDWGCAYEIHYGSGATPAGEYSTDALTLG----PGAIVKRFH 244

Query: 174 FGCGFSTNRNFKF---SGIFGLGIGRSSLVSQLNSS-----FSYCIGSLHDPDYLHNKLI 225
           FGCG    R  KF    G+ GLG    SL  Q ++      FS+C+     P    +   
Sbjct: 245 FGCGHHQQRG-KFDMADGVLGLGRLPQSLAWQASARRGGGVFSHCL-----PPTGVSTGF 298

Query: 226 LGDGAIIDEGD--ATPLQFIDGH---YYITLEAISVDGRMLDINPNIFKRDDSGGGVMID 280
           L  GA  D      TPL  +D     Y +   AISV G++LDI P +F+      GV+ D
Sbjct: 299 LALGAPHDTSAFVFTPLLTMDDQPWFYQLMPTAISVAGQLLDIPPAVFRE-----GVITD 353

Query: 281 SGTDVTWLVKEAYEALRDEVMIRLEGEQMRSYSWPDKLCYHGIMSSDLKGFPTVRFHFRG 340
           SGT ++ L + AY ALR      +    +         C++     D    PTV   FRG
Sbjct: 354 SGTVLSALQETAYTALRTAFRSAMAEYPLAPPVGHLDTCFN-FTGYDNVTVPTVSLTFRG 412

Query: 341 GAKLALEKDSMFYQPRPDAFCMAVNPASINNRYVNFSLIGMMAQQFYNVGYDIGRKQKTF 400
           GA + L+  S        AF       S  + Y    LIG ++Q+   V YD+  ++  F
Sbjct: 413 GATVHLDASSGVLMDGCLAFW------SSGDEYTG--LIGSVSQRTIEVLYDMPGRKVGF 464

Query: 401 QRMDC 405
           +   C
Sbjct: 465 RTGAC 469


>gi|47777372|gb|AAT38006.1| unknow protein [Oryza sativa Japonica Group]
          Length = 475

 Score =  119 bits (299), Expect = 2e-24,   Method: Compositional matrix adjust.
 Identities = 118/392 (30%), Positives = 170/392 (43%), Gaps = 41/392 (10%)

Query: 39  KIRSHNTYQAQILPSNDISNSLFYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQ 98
           + R    + A +L      +  ++  + +G P       +DTGS ++W+ C PCR C  Q
Sbjct: 100 RPRRRGGFAAPLLSGLPQGSGEYFAQVGVGTPATTALMVLDTGSDVVWLQCAPCRHCYAQ 159

Query: 99  LGTIFYPSRSSSYAYVPCDSEHCRYFPYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLT 158
            G +F P RS SYA V C +  CR    A C   ++ C+Y   Y  G  T+   ++E LT
Sbjct: 160 SGRVFDPRRSRSYAAVDCVAPICRRLDSAGCDRRRNSCLYQVAYGDGSVTAGDFASETLT 219

Query: 159 FKNTDESTIHVQDVVFGCGFSTNRNF-KFSGIFGLGIGRSSLVSQL----NSSFSYCI-- 211
           F         VQ V  GCG      F   SG+ GLG GR S  SQ+      SFSYC+  
Sbjct: 220 FAR----GARVQRVAIGCGHDNEGLFIAASGLLGLGRGRLSFPSQIARSFGRSFSYCLVD 275

Query: 212 -GSLHDPDYLHNKLIL---GDGAIIDEGDATPL---QFIDGHYYITLEAISVDGRM---- 260
             S   P    +  +    G  A       TP+     +   YY+ L   SV G      
Sbjct: 276 RTSSVRPSSTRSSTVTFGAGAVAAAAGASFTPMGRNPRMATFYYVHLLGFSVGGARVKGV 335

Query: 261 ----LDINPNIFKRDDSGGGVMIDSGTDVTWLVKEAYEALRDEVMIRLEGEQMR--SYSW 314
               L +NP   +     GGV++DSGT VT L +  YEA+RD       G ++    +S 
Sbjct: 336 SQSDLRLNPTTGR-----GGVILDSGTSVTRLARPVYEAVRDAFRAAAVGLRVSPGGFSL 390

Query: 315 PDKLCYHGIMSSDLKGFPTVRFHFRGGAKLALEKDS-MFYQPRPDAFCMAVNPASINNRY 373
            D  CY+ +    +   PTV  H  GGA +AL  ++ +        FC A+  A  +   
Sbjct: 391 FDT-CYN-LSGRRVVKVPTVSMHLAGGASVALPPENYLIPVDTSGTFCFAM--AGTDG-- 444

Query: 374 VNFSLIGMMAQQFYNVGYDIGRKQKTFQRMDC 405
              S+IG + QQ + V +D   ++  F    C
Sbjct: 445 -GVSIIGNIQQQGFRVVFDGDAQRVGFVPKSC 475


>gi|356537015|ref|XP_003537027.1| PREDICTED: aspartic proteinase-like protein 2-like [Glycine max]
          Length = 476

 Score =  119 bits (299), Expect = 2e-24,   Method: Compositional matrix adjust.
 Identities = 103/358 (28%), Positives = 166/358 (46%), Gaps = 41/358 (11%)

Query: 78  MDTGSSLLWVHCYPCRDC--SPQLG---TIFYPSRSSSYAYVPCDSEHCR---YFPYARC 129
           +DTGS +LWV+C  C +C  S QLG     F    SS+ A +PC    C        A C
Sbjct: 85  IDTGSDILWVNCNTCSNCPQSSQLGIELNFFDTVGSSTAALIPCSDLICTSGVQGAAAEC 144

Query: 130 SAYKHRCIYTQLYLIGPETSVFVSTEQLTFK---NTDESTIHVQDVVFGCGFS-----TN 181
           S   ++C YT  Y  G  TS +  ++ + F        +      +VFGC  S     T 
Sbjct: 145 SPRVNQCSYTFQYGDGSGTSGYYVSDAMYFNLIMGQPPAVNSTATIVFGCSISQSGDLTK 204

Query: 182 RNFKFSGIFGLGIGRSSLVSQLNSS------FSYCIGSLHDPDYLHNKLILGDGAIIDEG 235
            +    GIFG G G  S+VSQL+S       FS+C   L         L+LG+  I++  
Sbjct: 205 TDKAVDGIFGFGPGPLSVVSQLSSQGITPKVFSHC---LKGDGNGGGILVLGE--ILEPS 259

Query: 236 DA-TPLQFIDGHYYITLEAISVDGRMLDINPNIFKRDDSGGGVMIDSGTDVTWLVKEAYE 294
              +PL     HY + L++I+V+G+ L INP +F   ++ GG ++D GT + +L++EAY+
Sbjct: 260 IVYSPLVPSQPHYNLNLQSIAVNGQPLPINPAVFSISNNRGGTIVDCGTTLAYLIQEAYD 319

Query: 295 ALRDEVMIRLEGEQMRSYSWPDKLCYHGIMSSDLKG-FPTVRFHFRGGAKLALEKDSMF- 352
            L   +   +  +  R  +     CY  ++S+ +   FP V  +F GGA + L+ +    
Sbjct: 320 PLVTAINTAVS-QSARQTNSKGNQCY--LVSTSIGDIFPLVSLNFEGGASMVLKPEQYLM 376

Query: 353 ---YQPRPDAFCMAVNPASINNRYVNFSLIGMMAQQFYNVGYDIGRKQKTFQRMDCEV 407
              Y    + +C+              S++G +  +   V YDI +++  +   DC +
Sbjct: 377 HNGYLDGAEMWCVG-----FQKLQEGASILGDLVLKDKIVVYDIAQQRIGWANYDCSL 429


>gi|125532796|gb|EAY79361.1| hypothetical protein OsI_34489 [Oryza sativa Indica Group]
          Length = 405

 Score =  119 bits (298), Expect = 2e-24,   Method: Compositional matrix adjust.
 Identities = 97/373 (26%), Positives = 159/373 (42%), Gaps = 42/373 (11%)

Query: 57  SNSLFYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLGTIFYPSRSSSYAYVPC 116
           S  L+  N +IG PP P    +D    L+W  C PC+ C  Q   +F P++SS++  +PC
Sbjct: 53  SQGLYVANFTIGTPPQPVSAVVDLTGELVWTQCTPCQPCFEQDLPLFDPTKSSTFRGLPC 112

Query: 117 DSEHCRYFPYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDVVFGC 176
            S  C   P +  +     CIY      G +T     T+        E+      + FGC
Sbjct: 113 GSHLCESIPESSRNCTSDVCIYEAPTKAG-DTGGMAGTDTFAIGAAKET------LGFGC 165

Query: 177 GFSTNRNFKF----SGIFGLGIGRSSLVSQLN-SSFSYCIGSLHDPDYLHNKLILGDGAI 231
              T++  K     SGI GLG    SLV+Q+N ++FSYC+            L LG  A 
Sbjct: 166 VVMTDKRLKTIGGPSGIVGLGRTPWSLVTQMNVTAFSYCLAGKS-----SGALFLGATAK 220

Query: 232 -IDEGDATPLQFI------------DGHYYITLEAISVDGRMLDINPNIFKRDDSGGGVM 278
            +  G  +   F+            + +Y + L  I   G  L           SG  V+
Sbjct: 221 QLAGGKNSSTPFVIKTSAGSSDNGSNPYYMVKLAGIKAGGAPLQ------AASSSGSTVL 274

Query: 279 IDSGTDVTWLVKEAYEALRDEVMIRLEGEQMRSYSWPDKLCYHGIMSSDLKGFPTVRFHF 338
           +D+ +  ++L   AY+AL+  +   +  + + S   P  LC+   ++ D    P + F F
Sbjct: 275 LDTVSRASYLADGAYKALKKALTAAVGVQPVASPPKPYDLCFSKAVAGDA---PELVFTF 331

Query: 339 RGGAKLALEKDSMFYQPRPDAFCMAV-NPASIN--NRYVNFSLIGMMAQQFYNVGYDIGR 395
            GGA L +   +          C+ + + AS+N        S++G + Q+  +V +D+  
Sbjct: 332 DGGAALTVPPANYLLASGNGTVCLTIGSSASLNLTGELEGASILGSLQQENVHVLFDLKE 391

Query: 396 KQKTFQRMDCEVL 408
           +  +F+  DC  L
Sbjct: 392 ETLSFKPADCSSL 404


>gi|242050426|ref|XP_002462957.1| hypothetical protein SORBIDRAFT_02g035290 [Sorghum bicolor]
 gi|241926334|gb|EER99478.1| hypothetical protein SORBIDRAFT_02g035290 [Sorghum bicolor]
          Length = 452

 Score =  119 bits (298), Expect = 3e-24,   Method: Compositional matrix adjust.
 Identities = 105/368 (28%), Positives = 169/368 (45%), Gaps = 31/368 (8%)

Query: 61  FYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRD-CSPQLGTIFYPSRSSSYAYVPCDSE 119
           +++ +S+G PP+     +DTGS L W  C PC   C  Q   ++ P+RSS+++ +PC S 
Sbjct: 96  YHMILSVGTPPLAFPAIIDTGSDLTWTQCAPCTTACFAQPTPLYDPARSSTFSKLPCASP 155

Query: 120 HCRYFPYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTF---KNTDESTIHVQDVVFGC 176
            C+  P A  +     C+Y   Y +G  T+ +++ + L         +++     V FGC
Sbjct: 156 LCQALPSAFRACNATGCVYDYRYAVG-FTAGYLAADTLAIGDGDGDGDASSSFAGVAFGC 214

Query: 177 GFSTNRNF-KFSGIFGLGIGRSSLVSQLN-SSFSYCIGSLHDPDYLHNKLILGDGAII-- 232
             +   +    SGI GLG    SL+SQ+    FSYC+ S  D D   + ++ G  A +  
Sbjct: 215 STANGGDMDGASGIVGLGRSALSLLSQIGVGRFSYCLRS--DADAGASPILFGALANVTG 272

Query: 233 DEGDATPL-------QFIDGHYYITLEAISVDGRMLDINPNIFKRDDSG-GGVMIDSGTD 284
           D+  +T L       +    +YY+ L  I+V    L +  + F    +G GGV++DSGT 
Sbjct: 273 DKVQSTALLRNPVAARRRAPYYYVNLTGIAVGSTDLPVTSSTFGFTAAGAGGVIVDSGTT 332

Query: 285 VTWLVKEAYEALRDEVMIRLEGEQMR--SYSWPDKLCYHGIMSSDLKGFPTVRFHFRGGA 342
            T+L +  Y  LR   + +  G   R     +   LC+    ++D    P + F F GGA
Sbjct: 333 FTYLAEAGYTMLRQAFLSQTAGLLTRVSGAQFDFDLCFEA-GAADTP-VPRLVFRFAGGA 390

Query: 343 KLALEKDSMF--YQPRPDAFCMAVNPASINNRYVNFSLIGMMAQQFYNVGYDIGRKQKTF 400
           + A+ + S F          C+ V P          S+IG + Q   +V YD+     +F
Sbjct: 391 EYAVPRQSYFDAVDEGGRVACLLVLPTR------GVSVIGNVMQMDLHVLYDLDGATFSF 444

Query: 401 QRMDCEVL 408
              DC  L
Sbjct: 445 APADCASL 452


>gi|168042409|ref|XP_001773681.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162675069|gb|EDQ61569.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 399

 Score =  119 bits (298), Expect = 3e-24,   Method: Compositional matrix adjust.
 Identities = 97/374 (25%), Positives = 171/374 (45%), Gaps = 39/374 (10%)

Query: 60  LFYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLG------TIFYPSRSSSYAY 113
           L+Y  I +G PPV  +  +DTGS + W++C PC  C  +        T + PSRSS+   
Sbjct: 36  LYYTKIYLGTPPVGYYVQVDTGSDVTWLNCAPCTSCVTETQLPSIKLTTYDPSRSSTDGA 95

Query: 114 VPCDSEHCRYF---PYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTI--H 168
           + C   +C          C++  + C Y+  Y  G  T  +   + +TF+    +T    
Sbjct: 96  LSCRDSNCGAALGSNEVSCTSAGY-CAYSTTYGDGSSTQGYFIQDVMTFQEIHNNTQVNG 154

Query: 169 VQDVVFGCGFSTNRNFKFS-----GIFGLGIGRSSLVSQLNS------SFSYCIGSLHDP 217
              V FGCG + + N   S     G+ G G    S+ SQL S       F++C   L   
Sbjct: 155 TASVYFGCGTTQSGNLLMSSRALDGLIGFGQAAVSIPSQLASMGKVGNRFAHC---LQGD 211

Query: 218 DYLHNKLILGDGAIIDEGDATPLQFID-GHYYITLEAISVDGRMLDINPNIFKRDDSGGG 276
           +     +++G    + E + +    +   HY + ++ I+V+GR +    +      S GG
Sbjct: 212 NQGGGTIVIGS---VSEPNISYTPIVSRNHYAVGMQNIAVNGRNVTTPASFDTTSTSAGG 268

Query: 277 VMIDSGTDVTWLVKEAYEALRDEVMIRLEGEQMRSYSWPDKLCYHGIMSSDLKGFPTVRF 336
           V++DSGT + +LV  AY    + V    E     S+S   +L +  + +     FPTV+ 
Sbjct: 269 VIMDSGTTLAYLVDPAYTQFVNAVST-FESSMFSSHSQCLQLAWCSLQAD----FPTVKL 323

Query: 337 HFRGGAKLALE-KDSMFYQPRPD---AFCMAVNPASINNRYVNFSLIGMMAQQFYNVGYD 392
            F  GA + L  ++ ++ QP  +   A+CM    ++    Y+++S++G +  + + V YD
Sbjct: 324 FFDAGAVMNLTPRNYLYSQPLQNGQAAYCMGWQKSTTKAGYLSYSILGDIVLKDHLVVYD 383

Query: 393 IGRKQKTFQRMDCE 406
              +   ++  DC+
Sbjct: 384 NDNRVVGWKSFDCK 397


>gi|125553531|gb|EAY99240.1| hypothetical protein OsI_21202 [Oryza sativa Indica Group]
          Length = 475

 Score =  119 bits (298), Expect = 3e-24,   Method: Compositional matrix adjust.
 Identities = 112/397 (28%), Positives = 163/397 (41%), Gaps = 51/397 (12%)

Query: 39  KIRSHNTYQAQILPSNDISNSLFYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQ 98
           + R    + A +L      +  ++  + +G P       +DTGS ++W+ C PCR C  Q
Sbjct: 100 RPRRRGGFAAPLLSGLPQGSGEYFAQVGVGTPATTALMVLDTGSDVVWLQCAPCRHCYAQ 159

Query: 99  LGTIFYPSRSSSYAYVPCDSEHCRYFPYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLT 158
            G +F P RS SYA V C +  CR    A C   ++ C+Y   Y  G  T+   ++E LT
Sbjct: 160 SGRVFDPRRSRSYAAVDCVAPICRRLDSAGCDRRRNSCLYQVAYGDGSVTAGDFASETLT 219

Query: 159 FKNTDESTIHVQDVVFGCGFSTNRNFKFSGI----------FGLGIGRSSLVSQLNSSFS 208
           F         VQ V  GCG      F  +            F   I RS        SFS
Sbjct: 220 FAR----GARVQRVAIGCGHDNEGLFIAASGLLGLGRGRLSFPTQIARS-----FGRSFS 270

Query: 209 YCI---GSLHDPDYLHNKLIL---GDGAIIDEGDATPL---QFIDGHYYITLEAISVDGR 259
           YC+    S   P    +  +    G  A       TP+     +   YY+ L   SV G 
Sbjct: 271 YCLVDRTSSVRPSSTRSSTVTFGAGAVAAAAGASFTPMGRNPRMATFYYVHLLGFSVGGA 330

Query: 260 M--------LDINPNIFKRDDSGGGVMIDSGTDVTWLVKEAYEALRDEVMIRLEGEQMR- 310
                    L +NP   +     GGV++DSGT VT L +  YEA+RD       G ++  
Sbjct: 331 RVKGVSQSDLRLNPTTGR-----GGVILDSGTSVTRLARPVYEAVRDAFRAAAVGLRVSP 385

Query: 311 -SYSWPDKLCYHGIMSSDLKGFPTVRFHFRGGAKLALEKDS-MFYQPRPDAFCMAVNPAS 368
             +S  D  CY+ +    +   PTV  H  GGA +AL  ++ +        FC A+  A 
Sbjct: 386 GGFSLFDT-CYN-LSGRRVVKVPTVSMHLAGGASVALPPENYLIPVDTSGTFCFAM--AG 441

Query: 369 INNRYVNFSLIGMMAQQFYNVGYDIGRKQKTFQRMDC 405
            +      S+IG + QQ + V +D   ++  F    C
Sbjct: 442 TDG---GVSIIGNIQQQGFRVVFDGDAQRVGFVPKSC 475


>gi|226492633|ref|NP_001149953.1| LOC100283580 precursor [Zea mays]
 gi|195635701|gb|ACG37319.1| pepsin A [Zea mays]
          Length = 491

 Score =  119 bits (298), Expect = 3e-24,   Method: Compositional matrix adjust.
 Identities = 103/381 (27%), Positives = 165/381 (43%), Gaps = 40/381 (10%)

Query: 51  LPSNDISNSLFYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLG-----TIFYP 105
           LP++     L+Y  I IG PP   +  +DTGS +LWV+C  C  C  + G     T + P
Sbjct: 77  LPTD---TGLYYTRIEIGSPPKGYYVQVDTGSDILWVNCIRCDGCPTRSGLGIELTQYDP 133

Query: 106 SRSSSYAYVPCDSEHCRYFPYA----RCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKN 161
           + S +   V C+ E C           C +    C +   Y  G  T+ F  T+ + +  
Sbjct: 134 AGSGT--TVGCEQEFCVANSAGGVPPTCPSTSSPCQFRITYGDGSTTTGFYVTDFVQYNQ 191

Query: 162 TD---ESTIHVQDVVFGCGFST-----NRNFKFSGIFGLGIGRSSLVSQLNSS------F 207
                ++T     + FGCG        + N    GI G G   SS++SQL ++      F
Sbjct: 192 VSGNGQTTTSNASITFGCGAQLGGDLGSSNQALDGILGFGQSDSSMLSQLAAARRVRKIF 251

Query: 208 SYCIGSLHDPDYLHNKLILGDGAIID-EGDATPLQFIDGHYYITLEAISVDGRMLDINPN 266
           ++C+      D +    I   G ++  +   TPL     HY + L+ ISV G  L +  +
Sbjct: 252 AHCL------DTVRGGGIFAIGNVVQPKVKTTPLVPNVTHYNVNLQGISVGGATLQLPTS 305

Query: 267 IFKRDDSGGGVMIDSGTDVTWLVKEAYEALRDEVMIRLEGEQMRSYSWPDKLCYHGIMSS 326
            F   DS  G +IDSGT + +L +E Y  L   V  + +   + +Y   D +C+    S 
Sbjct: 306 TFDSGDS-KGTIIDSGTTLAYLPREVYRTLLAAVFDKYQDLPLHNYQ--DFVCFQFSGSI 362

Query: 327 DLKGFPTVRFHFRGGAKLALEKDSMFYQPRPDAFCMAVNPASINNR-YVNFSLIGMMAQQ 385
           D  GFP + F F+G   L +  D   +Q R D +CM      +  +   +  L+G +   
Sbjct: 363 D-DGFPVITFSFKGDLTLNVYPDDYLFQNRNDLYCMGFLDGGVQTKDGKDMLLLGDLVLS 421

Query: 386 FYNVGYDIGRKQKTFQRMDCE 406
              V YD+ ++   +   +C 
Sbjct: 422 NKLVVYDLEKEVIGWTDYNCS 442


>gi|18405138|ref|NP_565911.1| Eukaryotic aspartyl protease family protein [Arabidopsis thaliana]
 gi|13877759|gb|AAK43957.1|AF370142_1 unknown protein [Arabidopsis thaliana]
 gi|15293231|gb|AAK93726.1| unknown protein [Arabidopsis thaliana]
 gi|20196976|gb|AAB87120.2| expressed protein [Arabidopsis thaliana]
 gi|20197046|gb|AAM14894.1| expressed protein [Arabidopsis thaliana]
 gi|330254616|gb|AEC09710.1| Eukaryotic aspartyl protease family protein [Arabidopsis thaliana]
          Length = 442

 Score =  119 bits (297), Expect = 3e-24,   Method: Compositional matrix adjust.
 Identities = 110/380 (28%), Positives = 166/380 (43%), Gaps = 53/380 (13%)

Query: 58  NSLFYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLGTIFYPSRSSSYAYVPCD 117
           N    V +++G PP      +DTGS L W+HC      SP LG++F P  SS+Y+ VPC 
Sbjct: 62  NVTLTVTLAVGDPPQNISMVLDTGSELSWLHCKK----SPNLGSVFNPVSSSTYSPVPCS 117

Query: 118 SEHCRY----FPY-ARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDV 172
           S  CR      P  A C    H C     Y     TS+  +    TF      ++     
Sbjct: 118 SPICRTRTRDLPIPASCDPKTHLCHVAISY--ADATSIEGNLAHETFV---IGSVTRPGT 172

Query: 173 VFGC---GFSTN--RNFKFSGIFGLGIGRSSLVSQLN-SSFSYCIGSLHDPDYLHNKLIL 226
           +FGC   G S+N   + K +G+ G+  G  S V+QL  S FSYCI       +    L+L
Sbjct: 173 LFGCMDSGLSSNSEEDAKSTGLMGMNRGSLSFVNQLGFSKFSYCISGSDSSGF----LLL 228

Query: 227 GDGAIIDEG---------DATPLQFIDG-HYYITLEAISVDGRMLDINPNIFKRDDSGGG 276
           GD +    G          +TPL + D   Y + LE I V  ++L +  ++F  D +G G
Sbjct: 229 GDASYSWLGPIQYTPLVLQSTPLPYFDRVAYTVQLEGIRVGSKILSLPKSVFVPDHTGAG 288

Query: 277 -VMIDSGTDVTWLVKEAYEALRDEVMIRLEGEQMRSYSWPD-------KLCYH--GIMSS 326
             M+DSGT  T+L+   Y AL++E + + +   +R    PD        LCY        
Sbjct: 289 QTMVDSGTQFTFLMGPVYTALKNEFITQTK-SVLRLVDDPDFVFQGTMDLCYKVGSTTRP 347

Query: 327 DLKGFPTVRFHFRG------GAKLALEKDSMFYQPRPDAFCMAVNPASINNRYVNFSLIG 380
           +  G P V   FRG      G KL    +    + + + +C     + +    +   +IG
Sbjct: 348 NFSGLPMVSLMFRGAEMSVSGQKLLYRVNGAGSEGKEEVYCFTFGNSDLLG--IEAFVIG 405

Query: 381 MMAQQFYNVGYDIGRKQKTF 400
              QQ   + +D+ + +  F
Sbjct: 406 HHHQQNVWMEFDLAKSRVGF 425


>gi|357158691|ref|XP_003578210.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Brachypodium
           distachyon]
          Length = 459

 Score =  119 bits (297), Expect = 3e-24,   Method: Compositional matrix adjust.
 Identities = 115/407 (28%), Positives = 168/407 (41%), Gaps = 38/407 (9%)

Query: 24  RGINISIARLAYLQEKIRSHNTYQAQ----ILPSNDISNSLFYVNISIGQPPVPQFTAMD 79
           R   +S+ARL    +  R  +  Q Q    + PS D+    + V++++G PP P    +D
Sbjct: 66  RAAALSVARLGGSNKGARQQDQNQQQPGLPVRPSGDLE---YLVDLAVGTPPQPVSALLD 122

Query: 80  TGSSLLWVHCYPCRDCSPQLGTIFYPSRSSSYAYVPCDSEHCRYFPYARCSAYKHRCIYT 139
           TGS L+W  C PC  C PQ   IF P  SSSY  + C  E C    +  C      C Y 
Sbjct: 123 TGSDLIWTQCAPCASCLPQPDPIFSPGASSSYEPMRCAGELCNDILHHSCQ-RPDTCTYR 181

Query: 140 QLYLIGPETSVFVSTEQLTF---KNTDESTIHVQDVVFGCGFSTNRNF-KFSGIFGLGIG 195
             Y  G  T    +TE+ TF    +  E+T     + FGCG     +    SGI G G  
Sbjct: 182 YSYGDGTTTRGVYATERFTFSSSSSGGETTKLSAPLGFGCGTMNKGSLNNGSGIVGFGRA 241

Query: 196 RSSLVSQLN-SSFSYCIGSLHDPDYLHNKLILGD--GAIIDEGDAT--PLQFIDGH---- 246
             SLVSQL    FSYC+          + L+ G   G + D   AT    + +       
Sbjct: 242 PLSLVSQLAIRRFSYCLTPYASGR--KSTLLFGSLRGGVYDAATATVQTTRLLRSRQNPT 299

Query: 247 -YYITLEAISVDGRMLDINPNIFK-RDDSGGGVMIDSGTDVTW----LVKEAYEALRDEV 300
            YY+    ++V  R L I  + F  R D  GG ++DSGT +T     ++ E   A R + 
Sbjct: 300 FYYVPFTGVTVGARRLRIPISAFALRPDGSGGAIVDSGTALTLFPAPVLAEVVRAFRSQ- 358

Query: 301 MIRLEGEQMRSYSWPDKLCYHGIMSSDLKG--FPTVRFHFRGGAKLALEKDSMFYQPRPD 358
            +RL      S    D +C+    S   +    P + FH +G       ++ +    R  
Sbjct: 359 -LRLPFAANGSSGPDDGVCFAAAASRVPRPAVVPRMVFHLQGADLDLPRRNYVLDDQRKG 417

Query: 359 AFCMAVNPASINNRYVNFSLIGMMAQQFYNVGYDIGRKQKTFQRMDC 405
             C+ +  +  +      + IG   QQ   V YD+     +F    C
Sbjct: 418 NLCLLLADSGDSG-----TTIGNFVQQDMRVLYDLEADTLSFAPAQC 459


>gi|224286159|gb|ACN40790.1| unknown [Picea sitchensis]
          Length = 452

 Score =  119 bits (297), Expect = 3e-24,   Method: Compositional matrix adjust.
 Identities = 109/417 (26%), Positives = 172/417 (41%), Gaps = 36/417 (8%)

Query: 1   LIHRDSILSPYNNPNENPAHRVQRGINISIARLAYLQEKIRSHNTYQAQILPSNDISNSL 60
           LIH  S  SP+  PN      +   I     RL +L+   RS        +P    S   
Sbjct: 56  LIHIYSECSPFRPPNRTWESLMSEKIRGDANRLRFLKRTSRSSKEDANANVPVRSGSGE- 114

Query: 61  FYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLGTIFYPSRSSSYAYVPCDSEH 120
           + + +  G P    +T +DTGS + W+ C  C+ C      IF P++SSSY    CDS+ 
Sbjct: 115 YIIQVDFGTPKQSMYTLIDTGSDVAWIPCKQCQGCH-STAPIFDPAKSSSYKPFACDSQP 173

Query: 121 CRYFPYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDVVFGCG--- 177
           C+      C     +C +  LY  G +    ++++ +T  +      ++ +  FGC    
Sbjct: 174 CQEI-SGNCGG-NSKCQFEVLYGDGTQVDGTLASDAITLGSQ-----YLPNFSFGCAESL 226

Query: 178 ----FSTNRNFKFSGIFGLGIGRSSLVSQLNSSFSYCIGSLHDPDYLHNKLILGDGAIID 233
               +S+       G     + ++        +FSYC   L         L+LG  A + 
Sbjct: 227 SEDTYSSPGLMGLGGGSLSLLTQAPTAELFGGTFSYC---LPSSSTSSGSLVLGKEAAVS 283

Query: 234 EGDATPLQFID-----GHYYITLEAISVDGRMLDINPNIFKRDDSGGGVMIDSGTDVTWL 288
                    I        Y++TL+AISV    + +         SGGG +IDSGT +T+L
Sbjct: 284 SSSLKFTTLIKDPSFPTFYFVTLKAISVGNTRISVPATNIA---SGGGTIIDSGTTITYL 340

Query: 289 VKEAYEALRDEVMIRLEGEQMRSYSWPDKLCYHGIMSSDLKGFPTVRFHFRGGAKLALEK 348
           V  AY+ LRD    +L   Q       D  CY   +SS     PT+  H      L L K
Sbjct: 341 VPSAYKDLRDAFRQQLSSLQPTPVEDMDT-CYD--LSSSSVDVPTITLHLDRNVDLVLPK 397

Query: 349 DSMFYQPRPDAFCMAVNPASINNRYVNFSLIGMMAQQFYNVGYDIGRKQKTFQRMDC 405
           +++         C+A   +S ++R    S+IG + QQ + + +D+   Q  F +  C
Sbjct: 398 ENILITQESGLSCLAF--SSTDSR----SIIGNVQQQNWRIVFDVPNSQVGFAQEQC 448


>gi|357507803|ref|XP_003624190.1| Aspartic proteinase-like protein [Medicago truncatula]
 gi|355499205|gb|AES80408.1| Aspartic proteinase-like protein [Medicago truncatula]
          Length = 476

 Score =  119 bits (297), Expect = 3e-24,   Method: Compositional matrix adjust.
 Identities = 110/388 (28%), Positives = 167/388 (43%), Gaps = 53/388 (13%)

Query: 51  LPSNDISNSLFYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLG-----TIFYP 105
           LPS   S  L+Y  + +G P    +  +DTGS +LWV+C  C  C  + G     T++ P
Sbjct: 65  LPS---STGLYYTKVGLGSPAKEFYVQVDTGSDILWVNCAGCTACPKKSGLGMDLTLYDP 121

Query: 106 SRSSSYAYVPCDSEHCRYFPYARCSAYKH--RCIYTQLYLIGPETSVFVSTEQLTFKNTD 163
           + S +   VPC    C        S  K    C Y+  Y  G  TS     + LTF    
Sbjct: 122 NGSKTSNAVPCGDGFCTDTYSGPISGCKQDMSCPYSITYGDGSTTSGSFVNDSLTFDEV- 180

Query: 164 ESTIHVQ----DVVFGCG------FSTNRNFKFSGIFGLGIGRSSLVSQLNSS------F 207
              +H +     V+FGCG       S+N +    GI G G   SS++SQL +S      F
Sbjct: 181 SGNLHTKPDNSSVIFGCGAKQSGSLSSNSDEALDGIIGFGQANSSVLSQLAASGKVKRIF 240

Query: 208 SYCIGSLHDPDYLHNKLILGDGAIID-EGDATPLQFIDGHYYITLEAISVDGRMLDINPN 266
           S+C+      D  H   I   G +++ + + TPL     HY + L+ + VDG  + +   
Sbjct: 241 SHCL------DSHHGGGIFSIGQVMEPKFNTTPLVPRMAHYNVILKDMDVDGEPILLPLY 294

Query: 267 IFKRDDSGGGVMIDSGTDVTWLVKEAYEALRDEVMIRLEG-------EQMRSYSWPDKLC 319
           +F    SG G +IDSGT + +L    Y  L  +V+ R  G       +Q   + + DKL 
Sbjct: 295 LFDS-GSGRGTIIDSGTTLAYLPLSIYNQLLPKVLGRQPGLKLMIVEDQFTCFHYSDKLD 353

Query: 320 YHGIMSSDLKGFPTVRFHFRGGAKLALEKDSMFYQPRPDAFCMAVNPASINNRY-VNFSL 378
                    +GFP V+FHF G +      D +F   + D +C+    +S   +   +  L
Sbjct: 354 ---------EGFPVVKFHFEGLSLTVHPHDYLFLY-KEDIYCIGWQKSSTQTKEGRDLIL 403

Query: 379 IGMMAQQFYNVGYDIGRKQKTFQRMDCE 406
           IG +      V YD+      +   +C 
Sbjct: 404 IGDLVLSNKLVVYDLENMVIGWTNFNCS 431


>gi|255547548|ref|XP_002514831.1| Aspartic proteinase Asp1 precursor, putative [Ricinus communis]
 gi|223545882|gb|EEF47385.1| Aspartic proteinase Asp1 precursor, putative [Ricinus communis]
          Length = 488

 Score =  119 bits (297), Expect = 4e-24,   Method: Compositional matrix adjust.
 Identities = 94/372 (25%), Positives = 170/372 (45%), Gaps = 38/372 (10%)

Query: 60  LFYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQ--LG---TIFYPSRSSSYAYV 114
           L++  I +G PP   +  +DTGS +LWV+C  C  C  +  LG   T++ P  S+S   +
Sbjct: 81  LYFAKIGLGNPPKDYYVQVDTGSDILWVNCANCDKCPTKSDLGVKLTLYDPQSSTSATRI 140

Query: 115 PCDSEHC--RYFPYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQ-- 170
            CD + C   Y    +       C Y+ +Y  G  T+ F   + L F   D  T ++Q  
Sbjct: 141 YCDDDFCAATYNGVLQGCTKDLPCQYSVVYGDGSSTAGFFVKDNLQF---DRVTGNLQTS 197

Query: 171 ----DVVFGCGFSTNRNFKFS-----GIFGLGIGRSSLVSQLNSS------FSYCIGSLH 215
                V+FGCG   +     S     GI G G   SS++SQL ++      F++C+    
Sbjct: 198 SANGSVIFGCGAKQSGELGTSSEALDGILGFGQANSSMISQLAAAGKVKRVFAHCL---- 253

Query: 216 DPDYLHNKLILGDGAIID-EGDATPLQFIDGHYYITLEAISVDGRMLDINPNIFKRDDSG 274
             D +    I   G ++  + + TP+     HY + ++ I V G +L++  +IF   D  
Sbjct: 254 --DNVKGGGIFAIGEVVSPKVNTTPMVPNQPHYNVVMKEIEVGGNVLELPTDIFDTGDR- 310

Query: 275 GGVMIDSGTDVTWLVKEAYEALRDEVMIRLEGEQMRSYSWPDKLCYHGIMSSDLKGFPTV 334
            G +IDSGT + +L +  YE++  +++    G  ++ ++  ++        +  +GFP V
Sbjct: 311 RGTIIDSGTTLAYLPEVVYESMMTKIVSEQPG--LKLHTVEEQFTCFQYTGNVNEGFPVV 368

Query: 335 RFHFRGGAKLALEKDSMFYQPRPDAFCMAVNPASINNR-YVNFSLIGMMAQQFYNVGYDI 393
           +FHF G   L +      +Q   + +C     + + ++   + +L+G +      V YD+
Sbjct: 369 KFHFNGSLSLTVNPHDYLFQIHEEVWCFGWQNSGMQSKDGRDMTLLGDLVLSNKLVLYDL 428

Query: 394 GRKQKTFQRMDC 405
             +   +   +C
Sbjct: 429 ENQAIGWTDYNC 440


>gi|356564743|ref|XP_003550608.1| PREDICTED: aspartic proteinase-like protein 2-like [Glycine max]
          Length = 490

 Score =  119 bits (297), Expect = 4e-24,   Method: Compositional matrix adjust.
 Identities = 102/373 (27%), Positives = 166/373 (44%), Gaps = 35/373 (9%)

Query: 60  LFYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLG-----TIFYPSRSSSYAYV 114
           L+Y  + +G PPV     +DTGS +LWV C  C  C    G       F P  SS+ + +
Sbjct: 74  LYYTKVQLGTPPVEFNVQIDTGSDVLWVSCNSCSGCPQTSGLQIQLNFFDPGSSSTSSMI 133

Query: 115 PCDSEHCR---YFPYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQD 171
            C  + C        A CS+  ++C YT  Y  G  TS +  ++ +      E ++    
Sbjct: 134 ACSDQRCNNGIQSSDATCSSQNNQCSYTFQYGDGSGTSGYYVSDMMHLNTIFEGSVTTNS 193

Query: 172 ---VVFGC-----GFSTNRNFKFSGIFGLGIGRSSLVSQLNSS------FSYCIGSLHDP 217
              VVFGC     G  T  +    GIFG G    S++SQL+S       FS+C   L   
Sbjct: 194 TAPVVFGCSNQQTGDLTKSDRAVDGIFGFGQQEMSVISQLSSQGIAPRVFSHC---LKGD 250

Query: 218 DYLHNKLILGDGAIIDEGDA-TPLQFIDGHYYITLEAISVDGRMLDINPNIFKRDDSGGG 276
                 L+LG+  I++     T L     HY + L++I+V+G+ L I+ ++F   +S  G
Sbjct: 251 SSGGGILVLGE--IVEPNIVYTSLVPAQPHYNLNLQSIAVNGQTLQIDSSVFATSNS-RG 307

Query: 277 VMIDSGTDVTWLVKEAYEALRDEVMIRLEGEQMRSYSWPDKLCYHGIMSSDLKGFPTVRF 336
            ++DSGT + +L +EAY+     +   +  + + +       CY  I SS  + FP V  
Sbjct: 308 TIVDSGTTLAYLAEEAYDPFVSAITASIP-QSVHTVVSRGNQCYL-ITSSVTEVFPQVSL 365

Query: 337 HFRGGAKLALEKDSMFYQPRP--DAFCMAVNPASINNRYVNFSLIGMMAQQFYNVGYDIG 394
           +F GGA + L       Q      A    +    I  + +  +++G +  +   V YD+ 
Sbjct: 366 NFAGGASMILRPQDYLIQQNSIGGAAVWCIGFQKIQGQGI--TILGDLVLKDKIVVYDLA 423

Query: 395 RKQKTFQRMDCEV 407
            ++  +   DC +
Sbjct: 424 GQRIGWANYDCSL 436


>gi|224092218|ref|XP_002309514.1| predicted protein [Populus trichocarpa]
 gi|222855490|gb|EEE93037.1| predicted protein [Populus trichocarpa]
          Length = 474

 Score =  118 bits (296), Expect = 4e-24,   Method: Compositional matrix adjust.
 Identities = 115/402 (28%), Positives = 166/402 (41%), Gaps = 59/402 (14%)

Query: 31  ARLAYLQEKIRSHNTYQAQI--LPSND---ISNSLFYVNISIGQPPVPQFTAMDTGSSLL 85
           ARL+    KI  H  ++  +  LP+     I    + V + +G P        DTGS + 
Sbjct: 104 ARLS----KISGHGIFEEMVTKLPAQSGIAIGTGNYVVTVGLGTPKEDFTLVFDTGSGIT 159

Query: 86  WVHCYPCR-DCSPQLGTIFYPSRSSSYAYVPCDSEHCRYFPYAR--CSAYKHRCIYTQLY 142
           W  C PC   C PQ    F P++S+SY  V C S  C   P +   CSA    C+Y  +Y
Sbjct: 160 WTQCQPCLGSCYPQKEQKFDPTKSTSYNNVSCSSASCNLLPTSERGCSASNSTCLYQIIY 219

Query: 143 LIGPETSVFVSTEQLTFKNTDESTIHVQDVVFGCGFSTNRNFKFSGIFGLGIGRSS---- 198
                +  F +TE LT  ++D  T    + +FGCG S N      G+FG   G       
Sbjct: 220 GDQSYSQGFFATETLTISSSDVFT----NFLFGCGQSNN------GLFGQAAGLLGLSSS 269

Query: 199 -------LVSQLNSSFSYCIGSL-HDPDYLHNKLILGDGAIIDEGDATPLQ-FIDGHYYI 249
                     +    FSYC+ S      YL+       G +      TP+       Y I
Sbjct: 270 SVSLPSQTAEKYQKQFSYCLPSTPSSTGYLNF-----GGKVSQTAGFTPISPAFSSFYGI 324

Query: 250 TLEAISVDGRMLDINPNIFKRDDSGGGVMIDSGTDVTWLVKEAYEALRDEVMIRLEGEQM 309
            +  ISV G  L I+P+IF       G +IDSGT +T L   AY+AL++        E+M
Sbjct: 325 DIVGISVAGSQLPIDPSIFTTS----GAIIDSGTVITRLPPTAYKALKEAF-----DEKM 375

Query: 310 RSY--SWPDKL---CYHGIMSSDLKGFPTVRFHFRGGAKLALEKDSMFYQPRP-DAFCMA 363
            +Y  +  D+L   CY    +     FP V   F+GG ++ ++   + Y        C+A
Sbjct: 376 SNYPKTNGDELLDTCYD-FSNYTTVSFPKVSVSFKGGVEVDIDASGILYLVNGVKMVCLA 434

Query: 364 VNPASINNRYVNFSLIGMMAQQFYNVGYDIGRKQKTFQRMDC 405
               + N     F + G   Q+ Y V YD  +    F    C
Sbjct: 435 F---AANKDDSEFGIFGNHQQKTYEVVYDGAKGMIGFAAGAC 473


>gi|21617933|gb|AAM66983.1| nucleoid DNA-binding-like protein [Arabidopsis thaliana]
          Length = 425

 Score =  118 bits (296), Expect = 5e-24,   Method: Compositional matrix adjust.
 Identities = 101/355 (28%), Positives = 160/355 (45%), Gaps = 28/355 (7%)

Query: 61  FYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLGTIFYPSRSSSYAYVPCDSEH 120
           + V  +IG P  P   A+DT +   W+ C  C  CS  +  +F PS+SSS   + C++  
Sbjct: 88  YIVRANIGTPAQPMLVALDTSNDAAWIPCSGCVGCSSSV--LFDPSKSSSSRTLQCEAPQ 145

Query: 121 CRYFPYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDVVFGC-GFS 179
           C+  P   C+  K  C +   Y  G     +++ + LT      ++  + +  FGC   +
Sbjct: 146 CKQAPNPSCTVSKS-CGFNMTYG-GSTIEAYLTQDTLTL-----ASDVIPNYTFGCINKA 198

Query: 180 TNRNFKFSGIFGLGIGRSSLVSQ----LNSSFSYCIGSLHDPDYLHNKLILGDGAIIDEG 235
           +  +    G+ GLG G  SL+SQ      S+FSYC+ +    ++    L LG        
Sbjct: 199 SGTSLPAQGLMGLGRGPLSLISQSQNLYQSTFSYCLPNSKSSNF-SGSLRLGPKNQPIRI 257

Query: 236 DATPL---QFIDGHYYITLEAISVDGRMLDINPNIFKRD-DSGGGVMIDSGTDVTWLVKE 291
             TPL         YY+ L  I V  +++DI  +    D  +G G + DSGT  T LV+ 
Sbjct: 258 KTTPLLKNPRRSSLYYVNLVGIRVGNKIVDIPTSALAFDPATGAGTIFDSGTVYTRLVEP 317

Query: 292 AYEALRDEVMIRLEGEQMRSYSWPDKLCYHGIMSSDLKGFPTVRFHFRGGAKLALEKDSM 351
           AY A+R+E   R++     S    D  CY G +      FP+V F F  G  + L  D++
Sbjct: 318 AYVAVRNEFRRRVKNANATSLGGFDT-CYSGSVV-----FPSVTFMF-AGMNVTLPPDNL 370

Query: 352 F-YQPRPDAFCMAVNPASINNRYVNFSLIGMMAQQFYNVGYDIGRKQKTFQRMDC 405
             +    +  C+A+  A +N   V  ++I  M QQ + V  D+   +    R  C
Sbjct: 371 LIHSSAGNLSCLAMAAAPVNVNSV-LNVIASMQQQNHRVLIDVPNSRLGISRETC 424


>gi|326524580|dbj|BAK00673.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 461

 Score =  118 bits (296), Expect = 5e-24,   Method: Compositional matrix adjust.
 Identities = 115/429 (26%), Positives = 177/429 (41%), Gaps = 65/429 (15%)

Query: 22  VQRGINISIARLAYLQ--------------EKIRSHNTYQAQILPSNDISNSLFYVNISI 67
           ++R +  S AR A L               ++   H      + PS D+    + ++++I
Sbjct: 53  IRRAMQRSKARAAALSVARSGSGRVPGKSAQQGEQHQQPGVPVRPSGDLE---YLIDLAI 109

Query: 68  GQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLGTIFYPSRSSSYAYVPCDSEHCRYFPYA 127
           G PP P    +DTGS L+W  C PC  C  Q   +F P+ SSSY  + C  + C    + 
Sbjct: 110 GTPPQPVSALLDTGSDLIWTQCAPCASCLAQPDPLFAPAASSSYVPMRCSGQLCNDILHH 169

Query: 128 RCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDVVFGCG-FSTNRNFKF 186
            C      C Y   Y  G  T    +TE+ TF ++    + V  + FGCG  +       
Sbjct: 170 SCQ-RPDTCTYRYNYGDGTTTLGVYATERFTFASSSGEKLSV-PLGFGCGTMNVGSLNNG 227

Query: 187 SGIFGLGIGRSSLVSQLN-SSFSYCI-------------GSLHDPDYLHNKLILGDGAII 232
           SGI G G    SLVSQL+   FSYC+             GSL D       +  GD A  
Sbjct: 228 SGIVGFGRDPLSLVSQLSIRRFSYCLTPYTSTRKSTLMFGSLSD------GVFEGDDAAT 281

Query: 233 DEGDATPL---QFIDGHYYITLEAISVDGRMLDINPNIFK-RDDSGGGVMIDSGTDVTW- 287
            +   T L   +     YY+    ++V  R L I  + F  R D  GGV++DSGT +T  
Sbjct: 282 GQVQTTRLLQSRQNPTFYYVPFTGVTVGTRRLRIPLSAFALRPDGSGGVIVDSGTALTLF 341

Query: 288 ---LVKEAYEALRDEVMIRLEGEQMRSYSWPDKLCYHGIM--------SSDLKGFPTVRF 336
              ++ E   A R ++ +        S S  D +C+   M        ++ +   P + F
Sbjct: 342 PAAVLTEVLRAFRAQLRLPFTS----SSSPDDGVCFATPMAAGGRRASAATVVSVPRMAF 397

Query: 337 HFRGGAKLALEKDSMFYQPRPDAFCMAVNPASINNRYVNFSLIGMMAQQFYNVGYDIGRK 396
           HF+G       ++ +   PR  + C+ +  +  +      + IG   QQ   V YD+  +
Sbjct: 398 HFQGADLELPRRNYVLDDPRRGSLCILLADSGDSG-----ATIGNFVQQDMRVLYDLEAE 452

Query: 397 QKTFQRMDC 405
             +F    C
Sbjct: 453 TLSFAPAQC 461


>gi|242092368|ref|XP_002436674.1| hypothetical protein SORBIDRAFT_10g006870 [Sorghum bicolor]
 gi|241914897|gb|EER88041.1| hypothetical protein SORBIDRAFT_10g006870 [Sorghum bicolor]
          Length = 461

 Score =  118 bits (296), Expect = 5e-24,   Method: Compositional matrix adjust.
 Identities = 108/406 (26%), Positives = 171/406 (42%), Gaps = 60/406 (14%)

Query: 42  SHNTYQAQILPSNDISNSLFYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLGT 101
           SH            I  + + V++++G PP P    +DTGS L+W  C PCRDC  Q   
Sbjct: 73  SHAVRAGLGAGGGGIVTNEYLVHLAVGTPPRPVALTLDTGSDLVWTQCAPCRDCFHQGLP 132

Query: 102 IFYPSRSSSYAYVPCDSEHCRYFPYARCSA--------YKHRCIYTQLYLIGPETSVFVS 153
           +  P+ SS+YA +PC +  CR  P+  C              C Y   Y     T   ++
Sbjct: 133 LLDPAASSTYAALPCGAPRCRALPFTSCGGGGRSSWGNGNRSCAYIYHYGDKSVTVGEIA 192

Query: 154 TEQLTFKNTD---ESTIHVQDVVFGCGFSTNRNFKF--SGIFGLGIGRSSLVSQLN-SSF 207
           T++ TF   +   +S +  + + FGCG      F+   +GI G G GR SL SQLN ++F
Sbjct: 193 TDRFTFGGDNGDGDSRLPTRRLTFGCGHFNKGVFQSNETGIAGFGRGRWSLPSQLNVTTF 252

Query: 208 SYCIGSLHDPDYLHNKLILGDGA------------IIDEGDATPLQFIDGH---YYITLE 252
           SYC  S+ +     + L+   GA            I  E   TPL         Y+++L+
Sbjct: 253 SYCFTSMFES---KSSLVTLGGAPAAALLYSHAAHISGEVRTTPLLKNPSQPSLYFLSLK 309

Query: 253 AISVDGRMLDINPNIFKRDDSGGGVMIDSGTDVTWLVKEAYEALRDEVMIR--------L 304
            ISV    L + P    R       +IDSG  +T L +  YEA++ E   +        +
Sbjct: 310 GISVGKTRLAV-PEAKLRS-----TIIDSGASITTLPEAVYEAVKAEFAAQVGLPPTGVV 363

Query: 305 EGEQMRSYSWPDKLCYHGIMSSDLK--GFPTVRFHFRGGAKLALEKDSMFYQPRPDAFCM 362
           EG  +        LC+   +++  +    P++  H  G        + +F        C+
Sbjct: 364 EGSAL-------DLCFALPVTALWRRPPVPSLTLHLDGADWELPRGNYVFEDLAARVMCV 416

Query: 363 AVNPASINNRYVNFSLIGMMAQQFYNVGYDIGRKQKTFQRMDCEVL 408
            ++ A  +      ++IG   QQ  +V YD+     +F    C+ L
Sbjct: 417 VLDAAPGDQ-----TVIGNFQQQNTHVVYDLENDWLSFAPARCDSL 457


>gi|15232503|ref|NP_191008.1| aspartyl protease family protein [Arabidopsis thaliana]
 gi|7288018|emb|CAB81805.1| nucleoid DNA-binding-like protein [Arabidopsis thaliana]
 gi|17979257|gb|AAL49945.1| AT3g54400/T12E18_90 [Arabidopsis thaliana]
 gi|21700851|gb|AAM70549.1| AT3g54400/T12E18_90 [Arabidopsis thaliana]
 gi|332645705|gb|AEE79226.1| aspartyl protease family protein [Arabidopsis thaliana]
          Length = 425

 Score =  118 bits (296), Expect = 5e-24,   Method: Compositional matrix adjust.
 Identities = 101/355 (28%), Positives = 160/355 (45%), Gaps = 28/355 (7%)

Query: 61  FYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLGTIFYPSRSSSYAYVPCDSEH 120
           + V  +IG P  P   A+DT +   W+ C  C  CS  +  +F PS+SSS   + C++  
Sbjct: 88  YIVRANIGTPAQPMLVALDTSNDAAWIPCSGCVGCSSSV--LFDPSKSSSSRTLQCEAPQ 145

Query: 121 CRYFPYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDVVFGC-GFS 179
           C+  P   C+  K  C +   Y  G     +++ + LT      ++  + +  FGC   +
Sbjct: 146 CKQAPNPSCTVSKS-CGFNMTYG-GSTIEAYLTQDTLTL-----ASDVIPNYTFGCINKA 198

Query: 180 TNRNFKFSGIFGLGIGRSSLVSQ----LNSSFSYCIGSLHDPDYLHNKLILGDGAIIDEG 235
           +  +    G+ GLG G  SL+SQ      S+FSYC+ +    ++    L LG        
Sbjct: 199 SGTSLPAQGLMGLGRGPLSLISQSQNLYQSTFSYCLPNSKSSNF-SGSLRLGPKNQPIRI 257

Query: 236 DATPL---QFIDGHYYITLEAISVDGRMLDINPNIFKRD-DSGGGVMIDSGTDVTWLVKE 291
             TPL         YY+ L  I V  +++DI  +    D  +G G + DSGT  T LV+ 
Sbjct: 258 KTTPLLKNPRRSSLYYVNLVGIRVGNKIVDIPTSALAFDPATGAGTIFDSGTVYTRLVEP 317

Query: 292 AYEALRDEVMIRLEGEQMRSYSWPDKLCYHGIMSSDLKGFPTVRFHFRGGAKLALEKDSM 351
           AY A+R+E   R++     S    D  CY G +      FP+V F F  G  + L  D++
Sbjct: 318 AYVAVRNEFRRRVKNANATSLGGFDT-CYSGSVV-----FPSVTFMF-AGMNVTLPPDNL 370

Query: 352 F-YQPRPDAFCMAVNPASINNRYVNFSLIGMMAQQFYNVGYDIGRKQKTFQRMDC 405
             +    +  C+A+  A +N   V  ++I  M QQ + V  D+   +    R  C
Sbjct: 371 LIHSSAGNLSCLAMAAAPVNVNSV-LNVIASMQQQNHRVLIDVPNSRLGISRETC 424


>gi|293333012|ref|NP_001168410.1| uncharacterized protein LOC100382179 precursor [Zea mays]
 gi|223948083|gb|ACN28125.1| unknown [Zea mays]
 gi|413953783|gb|AFW86432.1| hypothetical protein ZEAMMB73_737575 [Zea mays]
          Length = 466

 Score =  118 bits (296), Expect = 5e-24,   Method: Compositional matrix adjust.
 Identities = 115/433 (26%), Positives = 188/433 (43%), Gaps = 57/433 (13%)

Query: 1   LIHRDSILSPYNNPNENPAHRVQRGINISIARLAYLQEKIRSHNTYQAQILPSNDIS--- 57
           L+HR    SP  +  E P+H    G +    R A +  K+ S     A+ L  + ++   
Sbjct: 63  LVHRHGPCSPVMS-KEKPSHEETLGRDQ--LRAANIHAKLSSPRNSSAKELQQSGVTIPT 119

Query: 58  -------NSLFYVNISIGQPPVPQFTAMDTGSSLLWVHCYPC--RDCSPQLGTIFYPSRS 108
                     + + +S+G P V Q  ++DTGS + WV C PC  + CS Q   +F P++S
Sbjct: 120 SSGYSLGTPEYVITVSLGTPAVTQVMSIDTGSDVSWVQCAPCAAQSCSSQKDKLFDPAKS 179

Query: 109 SSYAYVPCDSEHCRYFPYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIH 168
           ++Y+   C S  C              C Y   Y+    T+    ++ L    +D     
Sbjct: 180 ATYSAFSCSSAQCAQLGGEGNGCLNSHCQYIVKYVDHSNTTGTYGSDTLGLTTSDA---- 235

Query: 169 VQDVVFGCGFSTNRNF-KFSGIFGLGIGRSSLVSQLNSS----FSYCI--GSLHDPDYLH 221
           V++  FGC    N    +  G+ GLG    SLVSQ  ++    FSYC+   S     +L 
Sbjct: 236 VKNFQFGCSHRANGFVGQLDGLMGLGGDTESLVSQTAATYGKAFSYCLPPSSSSAGGFLT 295

Query: 222 NKLILGDGAIIDEGDATPL-QF-IDGHYYITLEAISVDGRMLDINPNIFKRDDSGGGVMI 279
                G G        TPL +F +   Y + L+AI+V G  L++  ++F      G  ++
Sbjct: 296 LGAAAG-GTSSSRYSRTPLVRFNVPTFYGVFLQAITVAGTKLNVPASVFS-----GASVV 349

Query: 280 DSGTDVTWLVKEAYEALRDEVMIRLEGEQMRSYSWPDKLCYHGIMSS--DLKGFPTVR-- 335
           DSGT +T L   AY+ALR         ++M++Y     +   GI+ +  D  G  TVR  
Sbjct: 350 DSGTVITQLPPTAYQALRTAFK-----KEMKAYPSAAPV---GILDTCFDFSGIKTVRVP 401

Query: 336 ---FHFRGGAKLALEKDSMFYQPRPDAFCMAVNPASINNRYVNFSLIGMMAQQFYNVGYD 392
                F  GA + L+   +FY     A C+A    + +    +  ++G + Q+ + + +D
Sbjct: 402 VVTLTFSRGAVMDLDVSGIFY-----AGCLAFTATAQDG---DTGILGNVQQRTFEMLFD 453

Query: 393 IGRKQKTFQRMDC 405
           +G     F+   C
Sbjct: 454 VGGSTLGFRPGAC 466


>gi|223942467|gb|ACN25317.1| unknown [Zea mays]
 gi|413936886|gb|AFW71437.1| pepsin A [Zea mays]
          Length = 491

 Score =  118 bits (295), Expect = 6e-24,   Method: Compositional matrix adjust.
 Identities = 103/381 (27%), Positives = 164/381 (43%), Gaps = 40/381 (10%)

Query: 51  LPSNDISNSLFYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLG-----TIFYP 105
           LP++     L+Y  I IG PP   +  +DTGS +LWV+C  C  C  + G     T + P
Sbjct: 77  LPTD---TGLYYTRIEIGSPPKGYYVQVDTGSDILWVNCIRCDGCPTRSGLGIELTQYDP 133

Query: 106 SRSSSYAYVPCDSEHCRYFPYA----RCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKN 161
           + S +   V C+ E C           C +    C +   Y  G  T+ F  T+ + +  
Sbjct: 134 AGSGT--TVGCEQEFCVANSAGGVPPTCPSTSSPCQFRITYGDGSTTTGFYVTDFVQYNQ 191

Query: 162 TD---ESTIHVQDVVFGCGFST-----NRNFKFSGIFGLGIGRSSLVSQLNSS------F 207
                ++T     + FGCG        + N    GI G G   SS++SQL ++      F
Sbjct: 192 VSGNGQTTTSNASITFGCGAQLGGDLGSSNQALDGILGFGQSDSSMLSQLAAARRVRKIF 251

Query: 208 SYCIGSLHDPDYLHNKLILGDGAIID-EGDATPLQFIDGHYYITLEAISVDGRMLDINPN 266
           ++C+      D +    I   G ++  +   TPL     HY + L+ ISV G  L +  +
Sbjct: 252 AHCL------DTVRGGGIFAIGNVVQPKVKTTPLVPNVTHYNVNLQGISVGGATLQLPTS 305

Query: 267 IFKRDDSGGGVMIDSGTDVTWLVKEAYEALRDEVMIRLEGEQMRSYSWPDKLCYHGIMSS 326
            F   DS  G +IDSGT + +L +E Y  L   V  + +   + +Y   D +C+    S 
Sbjct: 306 TFDSGDS-KGTIIDSGTTLAYLPREVYRTLLAAVFDKYQDLPLHNYQ--DFVCFQFSGSI 362

Query: 327 DLKGFPTVRFHFRGGAKLALEKDSMFYQPRPDAFCMAVNPASINNR-YVNFSLIGMMAQQ 385
           D  GFP + F F G   L +  D   +Q R D +CM      +  +   +  L+G +   
Sbjct: 363 D-DGFPVITFSFEGDLTLNVYPDDYLFQNRNDLYCMGFLDGGVQTKDGKDMLLLGDLVLS 421

Query: 386 FYNVGYDIGRKQKTFQRMDCE 406
              V YD+ ++   +   +C 
Sbjct: 422 NKLVVYDLEKEVIGWTDYNCS 442


>gi|242079451|ref|XP_002444494.1| hypothetical protein SORBIDRAFT_07g022800 [Sorghum bicolor]
 gi|241940844|gb|EES13989.1| hypothetical protein SORBIDRAFT_07g022800 [Sorghum bicolor]
          Length = 445

 Score =  118 bits (295), Expect = 6e-24,   Method: Compositional matrix adjust.
 Identities = 101/365 (27%), Positives = 170/365 (46%), Gaps = 35/365 (9%)

Query: 63  VNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLGTIFYPSRSSSYAYVPCDSEHCR 122
           + +SIG PP P+   +DTGS L+W  C        +   ++ P++SSS+A  PCD   C 
Sbjct: 91  LTVSIGTPPQPRTLILDTGSDLIWTQCKLFDTRQHREKPLYDPAKSSSFAAAPCDGRLCE 150

Query: 123 YFPYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDVVFGCGFSTNR 182
              +   +  +++CIYT  Y     T   +++E  TF      ++ +    FGCG  T+ 
Sbjct: 151 TGSFNTKNCSRNKCIYTYNY-GSATTKGELASETFTFGEHRRVSVSLD---FGCGKLTSG 206

Query: 183 NFK-FSGIFGLGIGRSSLVSQLN-SSFSYCIGSLHDPDYLHNKLILGDGAIIDEGDAT-P 239
           +    SGI G+   R SLVSQL    FSYC+    D +   + +  G  A + +   T P
Sbjct: 207 SLPGASGILGISPDRLSLVSQLQIPRFSYCLTPFLDRNTT-SHIFFGAMADLSKYRTTGP 265

Query: 240 LQFI------DG---HYYITLEAISVDGRMLDINPNIFK--RDDSGGGVMIDSGTDVTWL 288
           +Q        DG   +YY+ L  ISV  + L++  + F   RD S GG  +DSG     L
Sbjct: 266 IQTTSLVTNPDGSNYYYYVPLIGISVGTKRLNVPVSSFAIGRDGS-GGTFVDSGDTTGML 324

Query: 289 VKEAYEALRDEVM--IRLEGEQMRSYSWPDKLCYH------GIMSSDLKGFPTVRFHFRG 340
                EAL++ ++  ++L       + +  +LC+       G + + ++  P + +HF G
Sbjct: 325 PSVVMEALKEAMVEAVKLPVVNATDHGYEYELCFQLPRNGGGAVETAVQ-VPPLVYHFDG 383

Query: 341 GAKLALEKDSMFYQPRPDAFCMAVNPASINNRYVNFSLIGMMAQQFYNVGYDIGRKQKTF 400
           GA + L +DS   +      C+ ++  +        ++IG   QQ  +V +D+   + +F
Sbjct: 384 GAAMLLRRDSYMVEVSAGRMCLVISSGARG------AIIGNYQQQNMHVLFDVENHEFSF 437

Query: 401 QRMDC 405
               C
Sbjct: 438 APTQC 442


>gi|26451756|dbj|BAC42973.1| unknown protein [Arabidopsis thaliana]
          Length = 442

 Score =  118 bits (295), Expect = 6e-24,   Method: Compositional matrix adjust.
 Identities = 110/380 (28%), Positives = 166/380 (43%), Gaps = 53/380 (13%)

Query: 58  NSLFYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLGTIFYPSRSSSYAYVPCD 117
           N    V +++G PP      +DTGS L W+HC      SP LG++F P  SS+Y+ VPC 
Sbjct: 62  NVTLTVTLAVGDPPQNISMVLDTGSELSWLHCKK----SPNLGSVFNPVSSSTYSPVPCS 117

Query: 118 SEHCRY----FPY-ARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDV 172
           S  CR      P  A C    H C     Y     TS+  +    TF      ++     
Sbjct: 118 SPICRTRTRDLPIPASCDPKTHLCHVAISY--ADATSIEGNLAHETFV---IGSVTRPGT 172

Query: 173 VFGC---GFSTN--RNFKFSGIFGLGIGRSSLVSQLN-SSFSYCIGSLHDPDYLHNKLIL 226
           +FGC   G S+N   + K +G+ G+  G  S V+QL  S FSYCI       +    L+L
Sbjct: 173 LFGCMDSGLSSNSEEDAKSTGLMGMNRGSLSFVNQLGFSKFSYCISGSDSSVF----LLL 228

Query: 227 GDGAIIDEG---------DATPLQFIDG-HYYITLEAISVDGRMLDINPNIFKRDDSGGG 276
           GD +    G          +TPL + D   Y + LE I V  ++L +  ++F  D +G G
Sbjct: 229 GDASYSWLGPIQYTPLVLQSTPLPYFDRVAYTVQLEGIRVGSKILSLPKSVFVPDHTGAG 288

Query: 277 -VMIDSGTDVTWLVKEAYEALRDEVMIRLEGEQMRSYSWPD-------KLCYH--GIMSS 326
             M+DSGT  T+L+   Y AL++E + + +   +R    PD        LCY        
Sbjct: 289 QTMVDSGTQFTFLMGPVYTALKNEFITQTK-SVLRLVDDPDFVFQGTMDLCYKVGSTTRP 347

Query: 327 DLKGFPTVRFHFRG------GAKLALEKDSMFYQPRPDAFCMAVNPASINNRYVNFSLIG 380
           +  G P V   FRG      G KL    +    + + + +C     + +    +   +IG
Sbjct: 348 NFSGLPMVSLMFRGAEMSVSGQKLLYRVNGAGSEGKEEVYCFTFGNSDLLG--IEAFVIG 405

Query: 381 MMAQQFYNVGYDIGRKQKTF 400
              QQ   + +D+ + +  F
Sbjct: 406 HHHQQNVWMEFDLAKSRVGF 425


>gi|297805182|ref|XP_002870475.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
 gi|297316311|gb|EFH46734.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
          Length = 480

 Score =  118 bits (295), Expect = 6e-24,   Method: Compositional matrix adjust.
 Identities = 90/355 (25%), Positives = 159/355 (44%), Gaps = 40/355 (11%)

Query: 39  KIRSHNTYQ-AQILPSNDI---------SNSLFYVNISIGQPPVPQFTAMDTGSSLLWVH 88
           +++SH++++ A++L + D+         S  L++  I +G PP   +  +DTGS +LWV+
Sbjct: 45  ELKSHDSFRHARMLANIDLPLGGDSRADSIGLYFTKIKLGSPPKEYYVQVDTGSDILWVN 104

Query: 89  CYPCRDC--SPQLG---TIFYPSRSSSYAYVPCDSEHCRYFPYARCSAYKHRCIYTQLYL 143
           C PC  C     LG   +++    SS+   V C+   C +   +     K  C Y  +Y 
Sbjct: 105 CAPCPKCPVKTDLGIPLSLYDSKASSTSKNVGCEDAFCSFIMQSETCGAKKPCSYHVVYG 164

Query: 144 IGPETSVFVSTEQLTFKNTD---ESTIHVQDVVFGCGFSTNRNF-----KFSGIFGLGIG 195
            G  +      + +T         +    Q+VVFGCG + +           GI G G  
Sbjct: 165 DGSTSDGDFVKDNITLDQVTGNLRTAPLAQEVVFGCGKNQSGQLGQTESAVDGIMGFGQS 224

Query: 196 RSSLVSQLNSS------FSYCIGSLHDPDYLHNKLILGDGAIIDE-GDATPLQFIDGHYY 248
            +S++SQL +       FS+C+      D ++   I   G +       TPL     HY 
Sbjct: 225 NTSVISQLAAGGSVKRIFSHCL------DNMNGGGIFAIGEVESPVVKTTPLVPNQVHYN 278

Query: 249 ITLEAISVDGRMLDINPNIFKRDDSGGGVMIDSGTDVTWLVKEAYEALRDEVMIRLEGEQ 308
           + L+ + VDG  +D+ P++    +  GG +IDSGT + +L +  Y +L +++  +   +Q
Sbjct: 279 VILKGMDVDGEPIDLPPSL-ASTNGDGGTIIDSGTTLAYLPQNLYNSLIEKITAK---QQ 334

Query: 309 MRSYSWPDKLCYHGIMSSDLKGFPTVRFHFRGGAKLALEKDSMFYQPRPDAFCMA 363
           ++ +   +        S+  K FP V  HF    KL++      +  R D +C  
Sbjct: 335 VKLHMVQETFACFSFTSNTDKAFPVVNLHFEDSLKLSVYPHDYLFSLREDMYCFG 389


>gi|15231625|ref|NP_191467.1| aspartyl protease family protein [Arabidopsis thaliana]
 gi|15983376|gb|AAL11556.1|AF424562_1 AT3g59080/F17J16_130 [Arabidopsis thaliana]
 gi|7529751|emb|CAB86936.1| putative protein [Arabidopsis thaliana]
 gi|20466704|gb|AAM20669.1| putative protein [Arabidopsis thaliana]
 gi|23198236|gb|AAN15645.1| putative protein [Arabidopsis thaliana]
 gi|332646352|gb|AEE79873.1| aspartyl protease family protein [Arabidopsis thaliana]
          Length = 535

 Score =  118 bits (295), Expect = 6e-24,   Method: Compositional matrix adjust.
 Identities = 107/379 (28%), Positives = 171/379 (45%), Gaps = 50/379 (13%)

Query: 61  FYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLGTIFYPSRSSSYAYVPCDSEH 120
           +++++ +G PP      +DTGS L W+ C PC DC  Q G  + P  S+SY  + C+ + 
Sbjct: 170 YFMDVLVGSPPKHFSLILDTGSDLNWIQCLPCYDCFQQNGAFYDPKASASYKNITCNDQR 229

Query: 121 CRYF----PYARCSAYKHRCIYTQLYLIGPETS----VFVSTEQLTFKNTDESTIHVQDV 172
           C       P   C +    C Y   Y     T+    V   T  LT         +V+++
Sbjct: 230 CNLVSSPDPPMPCKSDNQSCPYYYWYGDSSNTTGDFAVETFTVNLTTNGGSSELYNVENM 289

Query: 173 VFGCGFSTNRNFKFSGIF-------GLGIGRSSLVSQLNS----SFSYCIGSLHDPDYLH 221
           +FGCG   NR     G+F       GLG G  S  SQL S    SFSYC+   +    + 
Sbjct: 290 MFGCG-HWNR-----GLFHGAAGLLGLGRGPLSFSSQLQSLYGHSFSYCLVDRNSDTNVS 343

Query: 222 NKLILG-DGAIIDEGDATPLQFIDGH-------YYITLEAISVDGRMLDINPNIFK-RDD 272
           +KLI G D  ++   +     F+ G        YY+ +++I V G +L+I    +    D
Sbjct: 344 SKLIFGEDKDLLSHPNLNFTSFVAGKENLVDTFYYVQIKSILVAGEVLNIPEETWNISSD 403

Query: 273 SGGGVMIDSGTDVTWLVKEAYEALRDEVMIRLEGEQ--MRSYSWPDKLCYH--GIMSSDL 328
             GG +IDSGT +++  + AYE +++++  + +G+    R +   D  C++  GI +  L
Sbjct: 404 GAGGTIIDSGTTLSYFAEPAYEFIKNKIAEKAKGKYPVYRDFPILDP-CFNVSGIHNVQL 462

Query: 329 KGFPTVRFHFRGGAKLALEKDSMFYQPRPDAFCMAV--NPASINNRYVNFSLIGMMAQQF 386
              P +   F  GA      ++ F     D  C+A+   P S       FS+IG   QQ 
Sbjct: 463 ---PELGIAFADGAVWNFPTENSFIWLNEDLVCLAMLGTPKSA------FSIIGNYQQQN 513

Query: 387 YNVGYDIGRKQKTFQRMDC 405
           +++ YD  R +  +    C
Sbjct: 514 FHILYDTKRSRLGYAPTKC 532


>gi|449527151|ref|XP_004170576.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 1-like [Cucumis
           sativus]
          Length = 523

 Score =  118 bits (295), Expect = 6e-24,   Method: Compositional matrix adjust.
 Identities = 115/374 (30%), Positives = 173/374 (46%), Gaps = 25/374 (6%)

Query: 42  SHNTYQAQILPSNDISNSLFYVNISIGQPPVPQFTAMDTGSSLLWVHCYPC---RDCSPQ 98
           S N+  A +          ++  I +GQP    F   DTGS + W+ C PC     C  Q
Sbjct: 165 STNSLTAPVTSGASQGAGEYFARIGVGQPVQSYFFVPDTGSDVSWLQCQPCDGENGCYKQ 224

Query: 99  LGTIFYPSRSSSYAYVPCDSEHCRYFPYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLT 158
           +G IF P  SSSY+ + CDSE C     A C A  + CIY   Y  G  T   ++TE  +
Sbjct: 225 IGPIFDPKSSSSYSPLSCDSEQCHLLDEAACDA--NSCIYEVEYGDGSFTVGELATETFS 282

Query: 159 FKNTDESTIHVQDVVFGCGFSTNRNF-KFSGIFGLGIGRSSLVSQLN-SSFSYCIGSLHD 216
           F++++     + ++  GCG      F   +G+ GLG G  SL SQL  +SFSYC   L D
Sbjct: 283 FRHSNS----IPNLPIGCGHDNEGLFVGAAGLIGLGGGAISLSSQLEATSFSYC---LVD 335

Query: 217 PDYLHNKLILGDGAIIDEGDATPLQFID---GHYYITLEAISVDGRMLDINPNIFKRDDS 273
            D   +  +  +     +   +PL   D      Y+ +  +SV G+ L I+ + F+ D+S
Sbjct: 336 LDSESSSTLDFNADQPSDSLTSPLVKNDRFPTFRYVKVIGMSVGGKPLPISSSSFEIDES 395

Query: 274 G-GGVMIDSGTDVTWLVKEAYEALRDEVMIRLEGEQMRSYSWPDKLCYHGIMSSDLKGFP 332
           G GG+++DSGT +T +  + Y+ LRD  +   +         P   CY     S+++  P
Sbjct: 396 GSGGIIVDSGTTITEIPSDVYDVLRDAFVGLTKNLPPAPGVSPFDTCYDLSSQSNVE-VP 454

Query: 333 TVRFHFRGGAKLAL-EKDSMFYQPRPDAFCMAVNPASINNRYVNFSLIGMMAQQFYNVGY 391
           T+ F   G   L L  K+ +F       FC+A  P++        S+IG + QQ   V Y
Sbjct: 455 TIAFILPGENSLQLPAKNCLFQVDSAGTFCLAFLPSTF-----PLSIIGNVQQQGIRVSY 509

Query: 392 DIGRKQKTFQRMDC 405
           D+      F    C
Sbjct: 510 DLANSLVGFSTDKC 523


>gi|148910443|gb|ABR18297.1| unknown [Picea sitchensis]
          Length = 452

 Score =  118 bits (295), Expect = 6e-24,   Method: Compositional matrix adjust.
 Identities = 114/421 (27%), Positives = 178/421 (42%), Gaps = 44/421 (10%)

Query: 1   LIHRDSILSPYNNPNENPAHRVQRGINISIARLAYLQEKIRSHNTYQAQILPSNDISNSL 60
           LIH  S  SP+  PN      +   I     RL +L+   RS        +P    S   
Sbjct: 56  LIHIYSECSPFRPPNRTWESLMSEKIRGDANRLRFLKRTSRSSKQDANANVPVRSGSGE- 114

Query: 61  FYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLGTIFYPSRSSSYAYVPCDSEH 120
           + + +  G P    +T +DTGS + W+ C  C+ C      IF P++SSSY    CDS+ 
Sbjct: 115 YIIQVDFGTPKQSMYTLIDTGSDVAWIPCKQCQGCH-STAPIFDPAKSSSYKPFACDSQP 173

Query: 121 CRYFPYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDVVFGCGFST 180
           C+      C     +C +   Y  G +    ++++ +T  +      ++ +  FGC  S 
Sbjct: 174 CQEI-SGNCGG-NSKCQFEVSYGDGTQVDGTLASDAITLGSQ-----YLPNFSFGCAESL 226

Query: 181 NRNFKFS-------GIFGLGIGRSSLVSQLNSSFSYCIGSLHDPDYLHNKLILGDGAIID 233
           + +   S       G     + ++        +FSYC   L         L+LG  A + 
Sbjct: 227 SEDTSPSPGLMGLGGGSLSLLTQAPTAELFGGTFSYC---LPSSSTSSGSLVLGKEAAVS 283

Query: 234 EGDATPLQF--------IDGHYYITLEAISVDGRMLDI-NPNIFKRDDSGGGVMIDSGTD 284
              ++ L+F        I   Y++TL+AISV    + +   NI     SGGG +IDSGT 
Sbjct: 284 ---SSSLKFTTLIKDPSIPTFYFVTLKAISVGNTRISVPGTNI----ASGGGTIIDSGTT 336

Query: 285 VTWLVKEAYEALRDEVMIRLEGEQMRSYSWPDKLCYHGIMSSDLKGFPTVRFHFRGGAKL 344
           +T LV  AY ALRD    +L   Q       D  CY   +SS     PT+  H      L
Sbjct: 337 ITHLVPSAYTALRDAFRQQLSSLQPTPVEDMDT-CYD--LSSSSVDVPTITLHLDRNVDL 393

Query: 345 ALEKDSMFYQPRPDAFCMAVNPASINNRYVNFSLIGMMAQQFYNVGYDIGRKQKTFQRMD 404
            L K+++         C+A   +S ++R    S+IG + QQ + + +D+   Q  F +  
Sbjct: 394 VLPKENILITQESGLACLAF--SSTDSR----SIIGNVQQQNWRIVFDVPNSQVGFAQEQ 447

Query: 405 C 405
           C
Sbjct: 448 C 448


>gi|218192707|gb|EEC75134.1| hypothetical protein OsI_11325 [Oryza sativa Indica Group]
          Length = 401

 Score =  118 bits (295), Expect = 6e-24,   Method: Compositional matrix adjust.
 Identities = 98/323 (30%), Positives = 149/323 (46%), Gaps = 38/323 (11%)

Query: 54  NDISNSLFYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLGTIFYPSRSSSYAY 113
           N +  + + V+++IG PP P    +DTGS L+W  C PC  C  Q    F PS SS+ + 
Sbjct: 75  NGVPTTEYLVHLAIGTPPQPVQLTLDTGSDLIWTQCQPCPACFDQALPYFDPSTSSTLSL 134

Query: 114 VPCDSEHCRYFPYARCSAYK----HRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHV 169
             CDS  C+  P A C + K      C+YT  Y     T+ F+  ++ TF     S   V
Sbjct: 135 TSCDSTLCQGLPVASCGSPKFWPNQTCVYTYSYGDKSVTTGFLEVDKFTFVGAGAS---V 191

Query: 170 QDVVFGCGFSTNRNFKF--SGIFGLGIGRSSLVSQLN-SSFSYCIGSLHDPDYLHNKLIL 226
             V FGCG   N  FK   +GI G G G  SL SQL   +FS+C  +++    L    +L
Sbjct: 192 PGVAFGCGLFNNGVFKSNETGIAGFGRGPLSLPSQLKVGNFSHCFTAVNG---LKPSTVL 248

Query: 227 ----------GDGAIIDEGDATPLQFIDGH---YYITLEAISVDGRMLDINPNIFKRDDS 273
                     G GA+     +TPL     +   YY++L+ I+V    L +  + F   + 
Sbjct: 249 LDLPADLYKSGRGAV----QSTPLIQNPANPTFYYLSLKGITVGSTRLPVPESEFALKNG 304

Query: 274 GGGVMIDSGTDVTWLVKEAYEALRDEVMIRLEGEQMRSYSWPDKLCYHGIMSSDLKG--- 330
            GG +IDSGT +T L    Y  +RD    +++   +   +     C    +S+ L+    
Sbjct: 305 TGGTIIDSGTAMTSLPTRVYRLVRDAFAAQVKLPVVSGNTTDPYFC----LSAPLRAKPY 360

Query: 331 FPTVRFHFRGGAKLALEKDSMFY 353
            P +  HF  GA + L +++  +
Sbjct: 361 VPKLVLHFE-GATMDLPRENYVW 382


>gi|9759039|dbj|BAB09366.1| aspartyl protease-like [Arabidopsis thaliana]
          Length = 478

 Score =  118 bits (295), Expect = 6e-24,   Method: Compositional matrix adjust.
 Identities = 89/355 (25%), Positives = 161/355 (45%), Gaps = 40/355 (11%)

Query: 39  KIRSHNTYQ-AQILPSNDI---------SNSLFYVNISIGQPPVPQFTAMDTGSSLLWVH 88
           +++SH++++ A++L + D+         S  L++  I +G PP   +  +DTGS +LWV+
Sbjct: 42  ELKSHDSFRHARMLANIDLPLGGDSRADSIGLYFTKIKLGSPPKEYYVQVDTGSDILWVN 101

Query: 89  CYPCRDC--SPQLG---TIFYPSRSSSYAYVPCDSEHCRYFPYARCSAYKHRCIYTQLYL 143
           C PC  C     LG   +++    SS+   V C+ + C +   +     K  C Y  +Y 
Sbjct: 102 CAPCPKCPVKTDLGIPLSLYDSKTSSTSKNVGCEDDFCSFIMQSETCGAKKPCSYHVVYG 161

Query: 144 IGPETSVFVSTEQLTFKNTD---ESTIHVQDVVFGCGFSTNRNF-----KFSGIFGLGIG 195
            G  +      + +T +       +    Q+VVFGCG + +           GI G G  
Sbjct: 162 DGSTSDGDFIKDNITLEQVTGNLRTAPLAQEVVFGCGKNQSGQLGQTDSAVDGIMGFGQS 221

Query: 196 RSSLVSQLNSS------FSYCIGSLHDPDYLHNKLILGDGAIIDE-GDATPLQFIDGHYY 248
            +S++SQL +       FS+C+      D ++   I   G +       TP+     HY 
Sbjct: 222 NTSIISQLAAGGSTKRIFSHCL------DNMNGGGIFAVGEVESPVVKTTPIVPNQVHYN 275

Query: 249 ITLEAISVDGRMLDINPNIFKRDDSGGGVMIDSGTDVTWLVKEAYEALRDEVMIRLEGEQ 308
           + L+ + VDG  +D+ P++    +  GG +IDSGT + +L +  Y +L +++  +   +Q
Sbjct: 276 VILKGMDVDGDPIDLPPSL-ASTNGDGGTIIDSGTTLAYLPQNLYNSLIEKITAK---QQ 331

Query: 309 MRSYSWPDKLCYHGIMSSDLKGFPTVRFHFRGGAKLALEKDSMFYQPRPDAFCMA 363
           ++ +   +        S+  K FP V  HF    KL++      +  R D +C  
Sbjct: 332 VKLHMVQETFACFSFTSNTDKAFPVVNLHFEDSLKLSVYPHDYLFSLREDMYCFG 386


>gi|357439021|ref|XP_003589787.1| Aspartic proteinase nepenthesin-2 [Medicago truncatula]
 gi|355478835|gb|AES60038.1| Aspartic proteinase nepenthesin-2 [Medicago truncatula]
          Length = 456

 Score =  118 bits (295), Expect = 6e-24,   Method: Compositional matrix adjust.
 Identities = 115/423 (27%), Positives = 171/423 (40%), Gaps = 72/423 (17%)

Query: 1   LIHRDSILSPYNNPNENPAHRVQRGINISIARLAYLQEKIRSHN-----------TYQAQ 49
           L HRD+I    N        R    IN  I R+ +L  ++  +            ++ + 
Sbjct: 62  LFHRDNI----NLKKTTHKTRFISRINRDIKRVTFLLNRLNKNTQEQQTTTATEASFGSD 117

Query: 50  ILPSNDISNSLFYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLGTIFYPSRSS 109
           ++   +  +  ++V I IG P + Q+  +D+GS ++W+ C PC  C  Q   IF P+ S+
Sbjct: 118 VVSGTEEGSGEYFVRIGIGSPAIYQYMVIDSGSDIVWIQCEPCDQCYNQTDPIFNPATSA 177

Query: 110 SYAYVPCDSEHCRYFPYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHV 169
           S+  V C S  C        +  K RC Y   Y  G  T   ++ E +T   T      +
Sbjct: 178 SFIGVACSSNVCNQLD-DDVACRKGRCGYQVAYGDGSYTKGTLALETITIGRT-----VI 231

Query: 170 QDVVFGCGFSTNRNFKFSGIFGLGIGRS-SLVSQLNS----SFSYC-------IGSLHDP 217
           QD   GCG      F  +       G   S V QL +    +F YC       +G++  P
Sbjct: 232 QDTAIGCGHWNEGMFVGAAGLLGLGGGPMSFVGQLGAQTGGAFGYCLVSRAMPVGAMWVP 291

Query: 218 DYLHNKLILGDGAIIDEGDATPLQFIDGHYYITLEAISVDGRMLDINPNIFKRDDSG-GG 276
             +HN                   F    YY++L  ++V G  + I+  IF+  D G GG
Sbjct: 292 -LIHNP------------------FYPSFYYVSLSGLAVGGIRVPISEQIFQLTDIGTGG 332

Query: 277 VMIDSGTDVTWLVKEAYEALRDEVMIRLEGEQMRSYSWPDKLCYHGIMSSDLKGF----- 331
           V++D+GT +T L   AY A RD  + +               CY      DL GF     
Sbjct: 333 VVMDTGTAITRLPTVAYNAFRDAFIAQTTNLPRAPGVSIFDTCY------DLNGFVTVRV 386

Query: 332 PTVRFHFRGGAKLALEKDSMFYQPRPD--AFCMAVNPASINNRYVNFSLIGMMAQQFYNV 389
           PTV F+F GG  L     + F  P  D   FC A  P+         S+IG + Q+   V
Sbjct: 387 PTVSFYFSGGQILTFPARN-FLIPADDVGTFCFAFAPSP-----SGLSIIGNIQQEGIQV 440

Query: 390 GYD 392
             D
Sbjct: 441 SID 443


>gi|30692930|ref|NP_198475.2| aspartyl protease family protein [Arabidopsis thaliana]
 gi|66792626|gb|AAY56415.1| At5g36260 [Arabidopsis thaliana]
 gi|332006680|gb|AED94063.1| aspartyl protease family protein [Arabidopsis thaliana]
          Length = 482

 Score =  118 bits (295), Expect = 7e-24,   Method: Compositional matrix adjust.
 Identities = 89/355 (25%), Positives = 161/355 (45%), Gaps = 40/355 (11%)

Query: 39  KIRSHNTYQ-AQILPSNDI---------SNSLFYVNISIGQPPVPQFTAMDTGSSLLWVH 88
           +++SH++++ A++L + D+         S  L++  I +G PP   +  +DTGS +LWV+
Sbjct: 46  ELKSHDSFRHARMLANIDLPLGGDSRADSIGLYFTKIKLGSPPKEYYVQVDTGSDILWVN 105

Query: 89  CYPCRDC--SPQLG---TIFYPSRSSSYAYVPCDSEHCRYFPYARCSAYKHRCIYTQLYL 143
           C PC  C     LG   +++    SS+   V C+ + C +   +     K  C Y  +Y 
Sbjct: 106 CAPCPKCPVKTDLGIPLSLYDSKTSSTSKNVGCEDDFCSFIMQSETCGAKKPCSYHVVYG 165

Query: 144 IGPETSVFVSTEQLTFKNTD---ESTIHVQDVVFGCGFSTNRNF-----KFSGIFGLGIG 195
            G  +      + +T +       +    Q+VVFGCG + +           GI G G  
Sbjct: 166 DGSTSDGDFIKDNITLEQVTGNLRTAPLAQEVVFGCGKNQSGQLGQTDSAVDGIMGFGQS 225

Query: 196 RSSLVSQLNSS------FSYCIGSLHDPDYLHNKLILGDGAIIDE-GDATPLQFIDGHYY 248
            +S++SQL +       FS+C+      D ++   I   G +       TP+     HY 
Sbjct: 226 NTSIISQLAAGGSTKRIFSHCL------DNMNGGGIFAVGEVESPVVKTTPIVPNQVHYN 279

Query: 249 ITLEAISVDGRMLDINPNIFKRDDSGGGVMIDSGTDVTWLVKEAYEALRDEVMIRLEGEQ 308
           + L+ + VDG  +D+ P++    +  GG +IDSGT + +L +  Y +L +++  +   +Q
Sbjct: 280 VILKGMDVDGDPIDLPPSL-ASTNGDGGTIIDSGTTLAYLPQNLYNSLIEKITAK---QQ 335

Query: 309 MRSYSWPDKLCYHGIMSSDLKGFPTVRFHFRGGAKLALEKDSMFYQPRPDAFCMA 363
           ++ +   +        S+  K FP V  HF    KL++      +  R D +C  
Sbjct: 336 VKLHMVQETFACFSFTSNTDKAFPVVNLHFEDSLKLSVYPHDYLFSLREDMYCFG 390


>gi|115479815|ref|NP_001063501.1| Os09g0482200 [Oryza sativa Japonica Group]
 gi|50725878|dbj|BAD33407.1| putative nucleoid DNA-binding protein cnd41 [Oryza sativa Japonica
           Group]
 gi|50725881|dbj|BAD33410.1| putative nucleoid DNA-binding protein cnd41 [Oryza sativa Japonica
           Group]
 gi|113631734|dbj|BAF25415.1| Os09g0482200 [Oryza sativa Japonica Group]
 gi|125606112|gb|EAZ45148.1| hypothetical protein OsJ_29786 [Oryza sativa Japonica Group]
          Length = 485

 Score =  117 bits (294), Expect = 7e-24,   Method: Compositional matrix adjust.
 Identities = 107/360 (29%), Positives = 147/360 (40%), Gaps = 37/360 (10%)

Query: 61  FYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLGTIFYPSRSSSYAYVPCDSEH 120
           + V++ +G P        DTGS L WV C PC DC  Q   +F PS SS+YA V C +  
Sbjct: 149 YVVSVGLGTPAKQYAVIFDTGSDLSWVQCKPCADCYEQQDPLFDPSLSSTYAAVACGAPE 208

Query: 121 CRYFPYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDVVFGCGFST 180
           C+    + CS+   RC Y   Y    +T   +  + LT   +D     +   VFGCG   
Sbjct: 209 CQELDASGCSS-DSRCRYEVQYGDQSQTDGNLVRDTLTLSASDT----LPGFVFGCGDQN 263

Query: 181 NRNF-KFSGIFGLGIGRSSLVSQLNSS----FSYCIGSLHDPDYLHNKLILGDGAIIDEG 235
              F +  G+FGLG  + SL SQ   S    F+YC+     P     +  L  G      
Sbjct: 264 AGLFGQVDGLFGLGREKVSLPSQGAPSYGPGFTYCL-----PSSSSGRGYLSLGG-APPA 317

Query: 236 DATPLQFIDGH----YYITLEAISVDGRMLDINPNIFKRDDSGGGVMIDSGTDVTWLVKE 291
           +A      DG     YYI L  I V GR + I    F         +IDSGT +T L   
Sbjct: 318 NAQFTALADGATPSFYYIDLVGIKVGGRAIRIPATAFAAAGG---TVIDSGTVITRLPPR 374

Query: 292 AYEALRDEVMIRLEGEQMRSYSWPDKL-----CYHGIMSSDLKGFPTVRFHFRGGAKLAL 346
           AY  LR           M  Y     L     CY           PTV   F GGA ++L
Sbjct: 375 AYAPLRAAFA-----RSMAQYKKAPALSILDTCYD-FTGHRTAQIPTVELAFAGGATVSL 428

Query: 347 EKDSMFYQPRPDAFCMAVNPASINNRYVNFSLIGMMAQQFYNVGYDIGRKQKTFQRMDCE 406
           +   + Y  +    C+A  P   N    + +++G   Q+ + V YD+  ++  F    C 
Sbjct: 429 DFTGVLYVSKVSQACLAFAP---NADDSSIAILGNTQQKTFAVAYDVANQRIGFGAKGCS 485


>gi|358345197|ref|XP_003636668.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
 gi|355502603|gb|AES83806.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
          Length = 294

 Score =  117 bits (294), Expect = 8e-24,   Method: Compositional matrix adjust.
 Identities = 83/265 (31%), Positives = 125/265 (47%), Gaps = 28/265 (10%)

Query: 28  ISIARLAYLQEKIRSHNT-YQAQILPSNDISNSL---------------FYVNISIGQPP 71
           ISI    ++   I +HN  +  +++P N   +                 + + +SIG PP
Sbjct: 10  ISILLFVFIFPHIEAHNGGFTGKLIPRNSSKDFFNRNTIQSPVSANHYDYLMELSIGTPP 69

Query: 72  VPQFTAMDTGSSLLWVHCYPCRDCSPQLGTIFYPSRSSSYAYVPCDSEHCRYFPYARCSA 131
           V  +   DTGS L+W+ C PC +C  QL  +F    SS+++ + C SE C       CS 
Sbjct: 70  VKIYAQADTGSDLIWLQCIPCTNCYKQLNPMFDSQSSSTFSNIACGSESCSKLYSTSCSP 129

Query: 132 YKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDVVFGCGFSTNRNF--KFSGI 189
            +  C Y   Y+ G ET   ++ E LT  +T    +  + V+FGCG + N  F  K  GI
Sbjct: 130 DQINCKYNYSYVDGSETQGVLAQETLTLTSTTGEPVAFKGVIFGCGHNNNGAFNDKEMGI 189

Query: 190 FGLGIGRSSLVSQLNSS-----FSYCIGSLHDPDYLHNKLILGDGA-IIDEG-DATPL-- 240
            GLG G  SLVSQ+ SS     FS C+   +    + + +  G G+ ++  G  +TPL  
Sbjct: 190 IGLGRGPLSLVSQIGSSLGGNMFSQCLVPFNTNPSISSPMSFGKGSEVLGNGVVSTPLVS 249

Query: 241 -QFIDGHYYITLEAISVDGRMLDIN 264
                  Y++TL  ISV+   L  N
Sbjct: 250 KTTYQSFYFVTLLGISVEDINLPFN 274


>gi|22165126|gb|AAM93742.1| putative chloroplast nucleoid DNA binding protein [Oryza sativa
           Japonica Group]
 gi|31433307|gb|AAP54836.1| Eukaryotic aspartyl protease family protein [Oryza sativa Japonica
           Group]
 gi|125575547|gb|EAZ16831.1| hypothetical protein OsJ_32302 [Oryza sativa Japonica Group]
          Length = 405

 Score =  117 bits (294), Expect = 9e-24,   Method: Compositional matrix adjust.
 Identities = 97/373 (26%), Positives = 159/373 (42%), Gaps = 42/373 (11%)

Query: 57  SNSLFYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLGTIFYPSRSSSYAYVPC 116
           S  L+  N +IG PP P    +D    L+W  C PC+ C  Q   +F P++SS++  +PC
Sbjct: 53  SQGLYVANFTIGTPPQPVSAVVDLTGELVWTQCTPCQPCFEQDLPLFDPTKSSTFRGLPC 112

Query: 117 DSEHCRYFPYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDVVFGC 176
            S  C   P +  +     CIY      G +T     T+        E+      + FGC
Sbjct: 113 GSHLCESIPESSRNCTSDVCIYEAPTKAG-DTGGKAGTDTFAIGAAKET------LGFGC 165

Query: 177 GFSTNRNFKF----SGIFGLGIGRSSLVSQLN-SSFSYCIGSLHDPDYLHNKLILGDGAI 231
              T++  K     SGI GLG    SLV+Q+N ++FSYC+            L LG  A 
Sbjct: 166 VVMTDKRLKTIGGPSGIVGLGRTPWSLVTQMNVTAFSYCLAGKS-----SGALFLGATAK 220

Query: 232 -IDEGDATPLQFI------------DGHYYITLEAISVDGRMLDINPNIFKRDDSGGGVM 278
            +  G  +   F+            + +Y + L  I   G  L           SG  V+
Sbjct: 221 QLAGGKNSSTPFVIKTSAGSSDNGSNPYYMVKLAGIKTGGAPLQ------AASSSGSTVL 274

Query: 279 IDSGTDVTWLVKEAYEALRDEVMIRLEGEQMRSYSWPDKLCYHGIMSSDLKGFPTVRFHF 338
           +D+ +  ++L   AY+AL+  +   +  + + S   P  LC+   ++ D    P + F F
Sbjct: 275 LDTVSRASYLADGAYKALKKALTAAVGVQPVASPPKPYDLCFPKAVAGDA---PELVFTF 331

Query: 339 RGGAKLALEKDSMFYQPRPDAFCMAV-NPASIN--NRYVNFSLIGMMAQQFYNVGYDIGR 395
            GGA L +   +          C+ + + AS+N        S++G + Q+  +V +D+  
Sbjct: 332 DGGAALTVPPANYLLASGNGTVCLTIGSSASLNLTGELEGASILGSLQQENVHVLFDLKE 391

Query: 396 KQKTFQRMDCEVL 408
           +  +F+  DC  L
Sbjct: 392 ETLSFKPADCSSL 404


>gi|125540928|gb|EAY87323.1| hypothetical protein OsI_08727 [Oryza sativa Indica Group]
          Length = 463

 Score =  117 bits (294), Expect = 9e-24,   Method: Compositional matrix adjust.
 Identities = 102/360 (28%), Positives = 154/360 (42%), Gaps = 38/360 (10%)

Query: 61  FYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRD--CSPQLGTIFYPSRSSSYAYVPCDS 118
           + +++ +G P V Q   +DTGS + WV C PC +  C  Q G +F P++SS+Y  V C +
Sbjct: 127 YVISVGLGTPAVTQTVTIDTGSDVSWVQCNPCPNPPCHAQTGALFDPAKSSTYRAVSCAA 186

Query: 119 EHCRYFPYA--RCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDVVFGC 176
             C         C A  + C Y   Y  G  T+   S + LT     ++   V+   FGC
Sbjct: 187 AECAQLEQQGNGCGATNYECQYGVQYGDGSTTNGTYSRDTLTLSGASDA---VKGFQFGC 243

Query: 177 -----GFSTNRNFKFSGIFGLGIGRSSLVSQL----NSSFSYCIGSLHDPDYLHNKLILG 227
                GFS   +    G+ GLG G  SLVSQ      +SFSYC+               G
Sbjct: 244 SHLESGFSDQTD----GLMGLGGGAQSLVSQTAAAYGNSFSYCLPPTSGSSGFLTLGGGG 299

Query: 228 DGAIIDEGDATPLQFIDGHYYITLEAISVDGRMLDINPNIFKRDDSGGGVMIDSGTDVTW 287
             +          + I   Y   L+ I+V G+ L ++P++F       G ++DSGT +T 
Sbjct: 300 GASGFVTTRMLRSKQIPTFYGARLQDIAVGGKQLGLSPSVFA-----AGSVVDSGTIITR 354

Query: 288 LVKEAYEALRDEVMIRLEGEQMRSYSWPDKL--CYHGIMSSDLKGFPTVRFHFRGGAKLA 345
           L   AY AL       +  +Q RS      L  C+     + +   PTV   F GGA + 
Sbjct: 355 LPPTAYSALSSAFKAGM--KQYRSAPARSILDTCFDFAGQTQIS-IPTVALVFSGGAAID 411

Query: 346 LEKDSMFYQPRPDAFCMAVNPASINNRYVNFSLIGMMAQQFYNVGYDIGRKQKTFQRMDC 405
           L+ + + Y       C+A      +       +IG + Q+ + V YD+G     F+   C
Sbjct: 412 LDPNGIMY-----GNCLAFAATGDDG---TTGIIGNVQQRTFEVLYDVGSSTLGFRSGAC 463


>gi|297812425|ref|XP_002874096.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
 gi|297319933|gb|EFH50355.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
          Length = 493

 Score =  117 bits (293), Expect = 9e-24,   Method: Compositional matrix adjust.
 Identities = 102/320 (31%), Positives = 143/320 (44%), Gaps = 38/320 (11%)

Query: 60  LFYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLG-----TIFYPSRSSSYAYV 114
           L+Y  I +G PP   +  +DTGS +LWV C  C  C    G       F P  S +   V
Sbjct: 80  LYYTKIRLGSPPRDFYVQVDTGSDVLWVSCASCNGCPQTSGLQIQLNFFDPGSSVTATPV 139

Query: 115 PCDSEHCRYFPYAR---CSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTI---H 168
            C  + C +   +    CS   + C YT  Y  G  TS F  ++ L F     S++    
Sbjct: 140 SCSDQRCSWGIQSSDSGCSVQNNLCAYTFQYGDGSGTSGFYVSDVLQFDMIVGSSLVPNS 199

Query: 169 VQDVVFGCGFSTNRNFKFS-----GIFGLGIGRSSLVSQLNSS------FSYCIGSLHDP 217
              VVFGC  S   +   S     GIFG G    S++SQL S       FS+C   L   
Sbjct: 200 TAPVVFGCSTSQTGDLVKSDRAVDGIFGFGQQGMSVISQLASQGLAPRVFSHC---LKGE 256

Query: 218 DYLHNKLILGDGAIIDEGDA-TPLQFIDGHYYITLEAISVDGRMLDINPNIFKRDDSGGG 276
           +     L+LG+  I++     TPL     HY + L +ISV+G+ L INP++F   + G G
Sbjct: 257 NGGGGILVLGE--IVEPNMVFTPLVPSQPHYNVNLLSISVNGQALPINPSVFSTSN-GQG 313

Query: 277 VMIDSGTDVTWLVKEAYEALRDEVMIRLEGEQMRSYSWPDKLCYHGIMSSDLKGFPTVRF 336
            +ID+GT + +L + AY     E +     + +R        CY  I +S    FP V  
Sbjct: 314 TIIDTGTTLAYLSEAAYVPFV-EAITNAVSQSVRPVVSKGNQCYV-IATSVADIFPPVSL 371

Query: 337 HFRGGAKLALEKDSMFYQPR 356
           +F GGA       SMF  P+
Sbjct: 372 NFAGGA-------SMFLNPQ 384


>gi|225216995|gb|ACN85284.1| aspartic proteinase nepenthesin-1 precursor [Oryza australiensis]
          Length = 519

 Score =  117 bits (293), Expect = 9e-24,   Method: Compositional matrix adjust.
 Identities = 99/357 (27%), Positives = 151/357 (42%), Gaps = 29/357 (8%)

Query: 61  FYVNISIGQPPVPQFTAMDTGSSLLWVHCYPC-RDCSPQLGTIFYPSRSSSYAYVPCDSE 119
           + V + +G P        DTGS   WV C PC   C  Q   +F P+RSS+YA + C + 
Sbjct: 180 YVVTVGLGTPASRYTVVFDTGSDTTWVQCQPCVVVCYEQREKLFDPARSSTYANISCAAP 239

Query: 120 HCRYFPYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDVVFGCGFS 179
            C       CS     C+Y   Y  G  +  F + + LT  + D     V+   FGCG  
Sbjct: 240 ACSDLDTRGCSG--GNCLYGVQYGDGSYSIGFFAMDTLTLSSYDA----VKGFRFGCGER 293

Query: 180 TNRNF-KFSGIFGLGIGRSSLVSQL----NSSFSYCIGSLHDPDYLHNKLILGDGAIIDE 234
               F + +G+ GLG G++SL  Q        F++C+ +          L  G G+    
Sbjct: 294 NEGLFGEAAGLLGLGRGKTSLPVQTYDKYGGVFAHCLPARSSGT---GYLDFGPGSPAAA 350

Query: 235 GD--ATPLQFIDG--HYYITLEAISVDGRMLDINPNIFKRDDSGGGVMIDSGTDVTWLVK 290
           G    TP+   +G   YY+ +  I V G++L I  ++F       G ++DSGT +T L  
Sbjct: 351 GARLTTPMLTDNGPTFYYVGMTGIRVGGQLLSIPQSVFTT----AGTIVDSGTVITRLPP 406

Query: 291 EAYEALRDEVMIRLEGEQMRSYSWPDKL--CYHGIMSSDLKGFPTVRFHFRGGAKLALEK 348
            AY +LR      +     +       L  CY     S +   PTV   F+GGA+L ++ 
Sbjct: 407 AAYSSLRSAFASAMAARGYKKAPAVSLLDTCYDFTGMSQVA-IPTVSLLFQGGARLDVDA 465

Query: 349 DSMFYQPRPDAFCMAVNPASINNRYVNFSLIGMMAQQFYNVGYDIGRKQKTFQRMDC 405
             + Y       C+     + N    +  ++G    + + V YDIG+K   F    C
Sbjct: 466 SGIMYAASVSQVCLGF---AANEDGGDVGIVGNTQLKTFGVAYDIGKKVVGFSPGAC 519


>gi|115448349|ref|NP_001047954.1| Os02g0720600 [Oryza sativa Japonica Group]
 gi|45735841|dbj|BAD12876.1| putative 41 kD chloroplast nucleoid DNA binding protein (CND41)
           [Oryza sativa Japonica Group]
 gi|45735967|dbj|BAD12996.1| putative 41 kD chloroplast nucleoid DNA binding protein (CND41)
           [Oryza sativa Japonica Group]
 gi|113537485|dbj|BAF09868.1| Os02g0720600 [Oryza sativa Japonica Group]
 gi|125583492|gb|EAZ24423.1| hypothetical protein OsJ_08176 [Oryza sativa Japonica Group]
 gi|215740989|dbj|BAG97484.1| unnamed protein product [Oryza sativa Japonica Group]
          Length = 463

 Score =  117 bits (293), Expect = 9e-24,   Method: Compositional matrix adjust.
 Identities = 102/360 (28%), Positives = 154/360 (42%), Gaps = 38/360 (10%)

Query: 61  FYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRD--CSPQLGTIFYPSRSSSYAYVPCDS 118
           + +++ +G P V Q   +DTGS + WV C PC +  C  Q G +F P++SS+Y  V C +
Sbjct: 127 YVISVGLGTPAVTQTVTIDTGSDVSWVQCNPCPNPPCYAQTGALFDPAKSSTYRAVSCAA 186

Query: 119 EHCRYFPYA--RCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDVVFGC 176
             C         C A  + C Y   Y  G  T+   S + LT     ++   V+   FGC
Sbjct: 187 AECAQLEQQGNGCGATNYECQYGVQYGDGSTTNGTYSRDTLTLSGASDA---VKGFQFGC 243

Query: 177 -----GFSTNRNFKFSGIFGLGIGRSSLVSQL----NSSFSYCIGSLHDPDYLHNKLILG 227
                GFS   +    G+ GLG G  SLVSQ      +SFSYC+               G
Sbjct: 244 SHVESGFSDQTD----GLMGLGGGAQSLVSQTAAAYGNSFSYCLPPTSGSSGFLTLGGGG 299

Query: 228 DGAIIDEGDATPLQFIDGHYYITLEAISVDGRMLDINPNIFKRDDSGGGVMIDSGTDVTW 287
             +          + I   Y   L+ I+V G+ L ++P++F       G ++DSGT +T 
Sbjct: 300 GVSGFVTTRMLRSRQIPTFYGARLQDIAVGGKQLGLSPSVFA-----AGSVVDSGTIITR 354

Query: 288 LVKEAYEALRDEVMIRLEGEQMRSYSWPDKL--CYHGIMSSDLKGFPTVRFHFRGGAKLA 345
           L   AY AL       +  +Q RS      L  C+     + +   PTV   F GGA + 
Sbjct: 355 LPPTAYSALSSAFKAGM--KQYRSAPARSILDTCFDFAGQTQIS-IPTVALVFSGGAAID 411

Query: 346 LEKDSMFYQPRPDAFCMAVNPASINNRYVNFSLIGMMAQQFYNVGYDIGRKQKTFQRMDC 405
           L+ + + Y       C+A      +       +IG + Q+ + V YD+G     F+   C
Sbjct: 412 LDPNGIMY-----GNCLAFAATGDDG---TTGIIGNVQQRTFEVLYDVGSSTLGFRSGAC 463


>gi|255566008|ref|XP_002523992.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
           communis]
 gi|223536719|gb|EEF38360.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
           communis]
          Length = 442

 Score =  117 bits (293), Expect = 1e-23,   Method: Compositional matrix adjust.
 Identities = 121/421 (28%), Positives = 180/421 (42%), Gaps = 38/421 (9%)

Query: 1   LIHRDSILSPYNNPNENPAHRVQRGINISIARLAYLQEKIRSHNTYQAQILPSNDISNSL 60
           LIHRDS  SP  N +E    R+   +  S  R+    + I   N+  A   PS  + N  
Sbjct: 41  LIHRDSPNSPLFNASETTDIRLANAVERSADRVNRFNDLIS--NSITAAEFPS-ILDNGD 97

Query: 61  FYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLGTIFY-PSRSSSYAYVPCDSE 119
           F + ISIG PP      + TGS L+W+ C   + C+      F+ P  SS+Y  VPCDS 
Sbjct: 98  FLMKISIGIPPTELLVNVATGSDLVWIPCLSFKPCTHNCDLRFFDPMESSTYKNVPCDSY 157

Query: 120 HCRYFPYARCSAYKHRCIYT---QLYLIGPETSVFVSTEQLTFKNTDESTIHVQDVVFGC 176
            C+    A C      C Y+   +     P+  + + T  LT  +T   +  + +  F C
Sbjct: 158 RCQITNAATCQF--SDCFYSCDPRHQDSCPDGDLAMDT--LTLNSTTGKSFMLPNTGFIC 213

Query: 177 GFSTNRNFKFSGIFGLGIGRSSLVSQ----LNSSFSYCIGSLHDPDYLHNKLILGDGAII 232
           G     ++   GI GLG G  SL+++    ++  FS+CI          +KL  GD A++
Sbjct: 214 GNRIGGDYPGVGILGLGHGSLSLLNRISHLIDGKFSHCIVPYSSNQ--TSKLSFGDKAVV 271

Query: 233 DEGDA---TPLQFIDGHYYITLEAISVDGRMLDINPNIFKRDDSGGGVMIDSGTDVTWLV 289
             G A   T L    G Y  TL    +      I+      D    G+ +DSGT  T+  
Sbjct: 272 -SGSAMFSTRLDMTGGPYSYTLSFYGISVGNKSISAGGIGSDYYMNGLGMDSGTMFTYFP 330

Query: 290 KEAYEALRDEVMIRLEGEQMRSYSWPD-----KLCYHGIMSSDLKGFPTVRFHFRGGAKL 344
           +  Y  L  +V   ++ E +    +PD     +LCY    S D    PT+  HF GG+ +
Sbjct: 331 EYFYSQLEYDVRYAIQQEPL----YPDPTRRLRLCYR--YSPDFSP-PTITMHFEGGS-V 382

Query: 345 ALEKDSMFYQPRPDAFCMAVNPASINNRYVNFSLIGMMAQQFYNVGYDIGRKQKTFQRMD 404
            L   + F +   D  C+A   +S     V     G   Q    +GYD+     +F + D
Sbjct: 383 ELSSSNSFIRMTEDIVCLAFATSSSEQDAV----FGYWQQTNLLIGYDLDAGFLSFLKTD 438

Query: 405 C 405
           C
Sbjct: 439 C 439


>gi|224093400|ref|XP_002309912.1| predicted protein [Populus trichocarpa]
 gi|222852815|gb|EEE90362.1| predicted protein [Populus trichocarpa]
          Length = 382

 Score =  117 bits (293), Expect = 1e-23,   Method: Compositional matrix adjust.
 Identities = 109/383 (28%), Positives = 173/383 (45%), Gaps = 38/383 (9%)

Query: 30  IARLAYLQEKIRSHNT--YQAQILPSNDIS-----NSLFYVNISIGQPPVPQFTAMDTGS 82
           + R+A L  ++ S +   Y+ +   S+ +S     +  ++V I +G PP  Q+  +D+GS
Sbjct: 5   VKRVASLIHRLSSGSAAKYEVEDFGSDVVSGMNQGSGEYFVRIGLGSPPRSQYMVIDSGS 64

Query: 83  SLLWVHCYPCRDCSPQLGTIFYPSRSSSYAYVPCDSEHCRYFPYARCSAYKHRCIYTQLY 142
            ++WV C PC  C  Q   +F P+ S+S+  V C S  C     A C++   RC Y   Y
Sbjct: 65  DIVWVQCKPCTQCYHQTDPLFDPADSASFMGVSCSSAVCDRVENAGCNS--GRCRYEVSY 122

Query: 143 LIGPETSVFVSTEQLTFKNTDESTIHVQDVVFGCGFSTNRNFKFSGIFGLGIGRS----- 197
             G  T   ++ E LTF  T      V++V  GCG S    F  +       G S     
Sbjct: 123 GDGSYTKGTLALETLTFGRT-----VVRNVAIGCGHSNRGMFVGAAGLLGLGGGSMSFMG 177

Query: 198 SLVSQLNSSFSYCIGSLHDPDYLHNKLILGDGAIIDEGDATPL---QFIDGHYYITLEAI 254
            L  Q  ++FSYC+  +      +  L  G  A+       PL         YYI L  +
Sbjct: 178 QLSGQTGNAFSYCL--VSRGTNTNGFLEFGSEAMPVGAAWIPLVRNPRAPSFYYIRLLGL 235

Query: 255 SVDGRMLDINPNIFKRDDSG-GGVMIDSGTDVTWLVKEAYEALRDEVMIRLEGEQMRSYS 313
            V    + ++ ++F+ ++ G GGV++D+GT VT     AYEA R+  + + +     S  
Sbjct: 236 GVGDTRVPVSEDVFQLNELGSGGVVMDTGTAVTRFPTVAYEAFRNAFIEQTQNLPRASGV 295

Query: 314 WPDKLCYH--GIMSSDLKGFPTVRFHFRGGAKLALEKDSMFYQPRPDA--FCMAVNPASI 369
                CY+  G +S  +   PTV F+F GG  L +  ++ F  P  DA  FC A  P+  
Sbjct: 296 SIFDTCYNLFGFLSVRV---PTVSFYFSGGPILTIPANN-FLIPVDDAGTFCFAFAPSP- 350

Query: 370 NNRYVNFSLIGMMAQQFYNVGYD 392
                  S++G + Q+   +  D
Sbjct: 351 ----SGLSILGNIQQEGIQISVD 369


>gi|297826117|ref|XP_002880941.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
 gi|297326780|gb|EFH57200.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
          Length = 397

 Score =  117 bits (293), Expect = 1e-23,   Method: Compositional matrix adjust.
 Identities = 107/375 (28%), Positives = 164/375 (43%), Gaps = 63/375 (16%)

Query: 59  SLFYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLGTIFYPSRSSSYAYVPCDS 118
           S++ + + +G PP      +DTGS L+W  C PC +C  Q   IF PS+SS+        
Sbjct: 59  SIYLMRLQLGTPPFEIVAEIDTGSDLIWTQCMPCPNCYTQFAPIFDPSKSST-------- 110

Query: 119 EHCRYFPYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDVVFGCGF 178
                F   RC  + + C Y  +Y     ++  ++TE +T ++T      + +   GCG 
Sbjct: 111 -----FKEKRC--HGNSCPYEIIYADESYSTGILATETVTIQSTSGEPFVMAETSIGCGL 163

Query: 179 STNRNF-------KFSGIFGLGIGRSSLVSQLN----SSFSYCIGS--LHDPDYLHNKLI 225
           + N N          SGI GL +G SSL+SQ++       SYC  S      ++  N ++
Sbjct: 164 N-NSNLMTPGYAASSSGIVGLNMGPSSLISQMDLPIPGLISYCFSSQGTSKINFGTNAVV 222

Query: 226 LGDGAIIDEGDATPLQFIDG---HYYITLEAISVDGRMLDINPNIFKRDDSGGGVMIDSG 282
            GDG +  +       FI      YY+ L+A+SV  + ++     F   D  G + IDSG
Sbjct: 223 AGDGTVAAD------MFIKKDQPFYYLNLDAVSVGDKRIETLGTPFHAQD--GNIFIDSG 274

Query: 283 TDVTWLVKEAYEALRDEVMIRLEGEQMRS--YSWPDKLCYHGIMSSDLKGFPTVRFHFRG 340
           T  T+L   +Y  L  E +        +    S  + LCY+      ++ FP +  HF G
Sbjct: 275 TTYTYL-PTSYCNLVREAVAASVVAANQVPDPSSENLLCYNW---DTMEIFPVITLHFAG 330

Query: 341 GAKLALEKDSMFYQP-RPDAFCMAVN------PASINNRYVNFSLIGMMAQQFYNVGYDI 393
           GA L L+K +M+ +      FC+A+       PA   NR  N  L          VGYD 
Sbjct: 331 GADLVLDKYNMYVETITGGTFCLAIGCVDPSMPAIFGNRAHNNLL----------VGYDS 380

Query: 394 GRKQKTFQRMDCEVL 408
                +F   +C  L
Sbjct: 381 STLVISFSPTNCSAL 395


>gi|242044888|ref|XP_002460315.1| hypothetical protein SORBIDRAFT_02g026350 [Sorghum bicolor]
 gi|241923692|gb|EER96836.1| hypothetical protein SORBIDRAFT_02g026350 [Sorghum bicolor]
          Length = 456

 Score =  117 bits (292), Expect = 1e-23,   Method: Compositional matrix adjust.
 Identities = 108/393 (27%), Positives = 164/393 (41%), Gaps = 29/393 (7%)

Query: 31  ARLAYLQEKIRSHNTYQAQILPSNDISNSLFYVNISIGQPPVPQFTAMDTGSSLLWVHCY 90
           AR +   +  R+       + PS D+    + V+++IG PP P    +DTGS L+W  C 
Sbjct: 75  ARFSGKNDDQRTTPPTGVSVRPSGDLE---YVVDLAIGTPPQPVSALLDTGSDLIWTQCA 131

Query: 91  PCRDCSPQLGTIFYPSRSSSYAYVPCDSEHCRYFPYARCSAYKHRCIYTQLYLIGPETSV 150
           PC  C  Q   +F P  S+SY  + C  + C    +  C      C Y   Y  G  T  
Sbjct: 132 PCASCLAQPDPLFAPGESASYEPMRCAGQLCSDILHHGCE-MPDTCTYRYNYGDGTMTMG 190

Query: 151 FVSTEQLTFKNTDESTIHVQDVVFGCG-FSTNRNFKFSGIFGLGIGRSSLVSQLN-SSFS 208
             +TE+ TF ++    +    + FGCG  +       SGI G G    SLVSQL+   FS
Sbjct: 191 VYATERFTFTSSGGDRLMTVPLGFGCGSMNVGSLNNGSGIVGFGRNPLSLVSQLSIRRFS 250

Query: 209 YCIGSLHDPDYLHNKLILGDGAIIDEGDAT-PLQFI--------DGHYYITLEAISVDGR 259
           YC+ S        + L+ G  +    GDAT P+Q             YY+ L  ++V  R
Sbjct: 251 YCLTSYGSGR--KSTLLFGSLSGGVYGDATGPVQTTPLLQSLQNPTFYYVHLAGLTVGAR 308

Query: 260 MLDINPNIFK-RDDSGGGVMIDSGTDVTWL----VKEAYEALRDEVMIRLE--GEQMRSY 312
            L I  + F  R D  GGV++DSGT +T L    + E   A R ++ +     G      
Sbjct: 309 RLRIPESAFALRPDGSGGVIVDSGTALTLLPGAVLAEVVRAFRQQLRLPFANGGNPEDGV 368

Query: 313 SWPDKLCYHGIMSSDLKGFPTVRFHFRGGAKLALEKDSMFYQPRPDAFCMAVNPASINNR 372
            +     +    S+     P + FHF+        ++ +    R    C+ +  +  +  
Sbjct: 369 CFLVPAAWRRSSSTSQVPVPRMVFHFQDADLDLPRRNYVLDDHRKGRLCLLLADSGDDG- 427

Query: 373 YVNFSLIGMMAQQFYNVGYDIGRKQKTFQRMDC 405
               S IG + QQ   V YD+  +  +F    C
Sbjct: 428 ----STIGNLVQQDMRVLYDLEAETLSFAPAQC 456


>gi|224085379|ref|XP_002307559.1| predicted protein [Populus trichocarpa]
 gi|222857008|gb|EEE94555.1| predicted protein [Populus trichocarpa]
          Length = 455

 Score =  117 bits (292), Expect = 1e-23,   Method: Compositional matrix adjust.
 Identities = 106/374 (28%), Positives = 171/374 (45%), Gaps = 40/374 (10%)

Query: 61  FYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLGTIFYPSRSSSYAYVPCDSEH 120
           +++++ IG PP      +DTGS L W+ C PC DC  Q G  + P  SSS+  + C    
Sbjct: 90  YFMDVFIGTPPKHYSLILDTGSDLNWIQCVPCHDCFEQNGPYYDPKESSSFRNIGCHDPR 149

Query: 121 CRYF----PYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDEST----IHVQDV 172
           C       P   C A    C Y   Y     T+   +TE  T   T  +       V++V
Sbjct: 150 CHLVSSPDPPLPCKAENQTCPYFYWYGDSSNTTGDFATETFTVNLTSPTGKSEFKRVENV 209

Query: 173 VFGCGFSTNRNFK-FSGIFGLGIGRSSLVSQLNS----SFSYCIGSLHDPDYLHNKLILG 227
           +FGCG      F   SG+ GLG G  S  SQL S    SFSYC+   +    + +KLI G
Sbjct: 210 MFGCGHWNRGLFHGASGLLGLGRGPLSFSSQLQSLYGHSFSYCLVDRNSDTNVSSKLIFG 269

Query: 228 DG-----------AIIDEGDATPLQFIDGHYYITLEAISVDGRMLDINPNIFKRDDSG-G 275
           +              +  G   P   +D  YY+ +++I V G +L+I  + +     G G
Sbjct: 270 EDKDLLNHPELNFTTLVGGKENP---VDTFYYVQIKSIMVGGEVLNIPESTWNMTSDGVG 326

Query: 276 GVMIDSGTDVTWLVKEAYEALRDEVMIRLEGEQM-RSYSWPDKLCYH--GIMSSDLKGFP 332
           G ++DSGT +++  + AY+ ++D  + +++G  + + +   D  CY+  G+   DL  F 
Sbjct: 327 GTIVDSGTTLSYFTEPAYQIIKDAFVKKVKGYPIVQDFPILDP-CYNVSGVEKIDLPDFG 385

Query: 333 TVRFHFRGGAKLALEKDSMFYQPRP-DAFCMAVNPASINNRYVNFSLIGMMAQQFYNVGY 391
            +   F  GA      ++ F +  P +  C+A+    +       S+IG   QQ ++V Y
Sbjct: 386 IL---FADGAVWNFPVENYFIRLDPEEVVCLAI----LGTPRSALSIIGNYQQQNFHVLY 438

Query: 392 DIGRKQKTFQRMDC 405
           D  + +  +  M+C
Sbjct: 439 DTKKSRLGYAPMNC 452


>gi|224142011|ref|XP_002324354.1| predicted protein [Populus trichocarpa]
 gi|222865788|gb|EEF02919.1| predicted protein [Populus trichocarpa]
          Length = 471

 Score =  117 bits (292), Expect = 1e-23,   Method: Compositional matrix adjust.
 Identities = 106/359 (29%), Positives = 161/359 (44%), Gaps = 33/359 (9%)

Query: 61  FYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCR-DCSPQLGTIFYPSRSSSYAYVPCDSE 119
           + V + +G P        DTGS L W  C PC   C PQ    F P++S+SY  + C SE
Sbjct: 132 YAVTVGLGTPKKDFSLLFDTGSDLTWTQCEPCSGGCFPQNDEKFDPTKSTSYKNLSCSSE 191

Query: 120 HCRYF--PYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDVVFGCG 177
            C+      A+  +  + C+Y   Y  G  T  F++TE LT   +D      ++ V GCG
Sbjct: 192 PCKSIGKESAQGCSSSNSCLYGVKYGTG-YTVGFLATETLTITPSDV----FENFVIGCG 246

Query: 178 FSTNRN-FKFSGIFG-LGIGRS--SLVSQLNSS----FSYCIGSLHDPDYLHNKLILGDG 229
               RN  +FSG  G LG+GRS  +L SQ +S+    FSYC   L         L  G G
Sbjct: 247 ---ERNGGRFSGTAGLLGLGRSPVALPSQTSSTYKNLFSYC---LPASSSSTGHLSFG-G 299

Query: 230 AIIDEGDATPLQF-IDGHYYITLEAISVDGRMLDINPNIFKRDDSGGGVMIDSGTDVTWL 288
            +      TP+   I   Y + +  ISV GR L I+P++F+      G +IDSGT +T+L
Sbjct: 300 GVSQAAKFTPITSKIPELYGLDVSGISVGGRKLPIDPSVFRT----AGTIIDSGTTLTYL 355

Query: 289 VKEAYEALRDEVMIRLEGEQMRSYSWPDKLCYH-GIMSSDLKGFPTVRFHFRGGAKLALE 347
              A+ AL       +    +   +   + CY     ++D    P +   F GG ++ ++
Sbjct: 356 PSTAHSALSSAFQEMMTNYTLTKGTSGLQPCYDFSKHANDNITIPQISIFFEGGVEVDID 415

Query: 348 KDSMFYQPRP-DAFCMAVNPASINNRYVNFSLIGMMAQQFYNVGYDIGRKQKTFQRMDC 405
              +F      +  C+A      N    + ++ G + Q+ Y V YD+ +    F    C
Sbjct: 416 DSGIFIAANGLEEVCLAFKD---NGNDTDVAIFGNVQQKTYEVVYDVAKGMVGFAPGGC 471


>gi|356535252|ref|XP_003536162.1| PREDICTED: aspartic proteinase-like protein 2-like [Glycine max]
          Length = 475

 Score =  117 bits (292), Expect = 1e-23,   Method: Compositional matrix adjust.
 Identities = 100/373 (26%), Positives = 167/373 (44%), Gaps = 41/373 (10%)

Query: 18  PAHRVQRGINISIARLAYLQEKIRSHNTYQ--AQILPSNDISNSLFYVNISIGQPPVPQF 75
           P  R +R +N   A  A  + +I S          LP+      L++  + +G PP   +
Sbjct: 28  PVERRKRSLNAVKAHDARRRGRILSAVDLNLGGNGLPTE---TGLYFTKLGLGSPPKDYY 84

Query: 76  TAMDTGSSLLWVHCYPCRDC--SPQLG---TIFYPSRSSSYAYVPCDSEHCRYF---PYA 127
             +DTGS +LWV+C  C  C     LG   T++ P  S +   + CD E C      P  
Sbjct: 85  VQVDTGSDILWVNCVKCSRCPRKSDLGIDLTLYDPKGSETSELISCDQEFCSATYDGPIP 144

Query: 128 RCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDE---STIHVQDVVFGCG------F 178
            C + +  C Y+  Y  G  T+ +   + LT+ + ++   +      ++FGCG       
Sbjct: 145 GCKS-EIPCPYSITYGDGSATTGYYVQDYLTYNHVNDNLRTAPQNSSIIFGCGAVQSGTL 203

Query: 179 STNRNFKFSGIFGLGIGRSSLVSQLNSS------FSYCIGSLHDPDYLHNKLILGDGAII 232
           S++      GI G G   SS++SQL +S      FS+C+      D +    I   G ++
Sbjct: 204 SSSSEEALDGIIGFGQSNSSVLSQLAASGKVKKIFSHCL------DNIRGGGIFAIGEVV 257

Query: 233 D-EGDATPLQFIDGHYYITLEAISVDGRMLDINPNIFKRDDSGGGVMIDSGTDVTWLVKE 291
           + +   TPL     HY + L++I VD  +L +  +IF   + G G +IDSGT + +L   
Sbjct: 258 EPKVSTTPLVPRMAHYNVVLKSIEVDTDILQLPSDIFDSGN-GKGTIIDSGTTLAYLPAI 316

Query: 292 AYEALRDEVMIRLEGEQMRSYSWPDKL-CYHGIMSSDLKGFPTVRFHFRGGAKLALEKDS 350
            Y+ L  +VM R    +++ Y    +  C+    + D +GFP V+ HF     L +    
Sbjct: 317 VYDELIPKVMAR--QPRLKLYLVEQQFSCFQYTGNVD-RGFPVVKLHFEDSLSLTVYPHD 373

Query: 351 MFYQPRPDAFCMA 363
             +Q +   +C+ 
Sbjct: 374 YLFQFKDGIWCIG 386


>gi|125564143|gb|EAZ09523.1| hypothetical protein OsI_31798 [Oryza sativa Indica Group]
          Length = 485

 Score =  117 bits (292), Expect = 1e-23,   Method: Compositional matrix adjust.
 Identities = 107/360 (29%), Positives = 147/360 (40%), Gaps = 37/360 (10%)

Query: 61  FYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLGTIFYPSRSSSYAYVPCDSEH 120
           + V++ +G P        DTGS L WV C PC DC  Q   +F PS SS+YA V C +  
Sbjct: 149 YVVSVGLGTPAKQYAVIFDTGSDLSWVQCKPCADCYEQQDPLFDPSLSSTYAAVACGAPE 208

Query: 121 CRYFPYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDVVFGCGFST 180
           C+    + CS+   RC Y   Y    +T   +  + LT   +D     +   VFGCG   
Sbjct: 209 CQELDASGCSS-DSRCRYEVQYGDQSQTDGNLVRDTLTLSASDT----LPGFVFGCGDQN 263

Query: 181 NRNF-KFSGIFGLGIGRSSLVSQLNSS----FSYCIGSLHDPDYLHNKLILGDGAIIDEG 235
              F +  G+FGLG  + SL SQ   S    F+YC+     P     +  L  G      
Sbjct: 264 AGLFGQVDGLFGLGREKVSLPSQGAPSYGPGFTYCL-----PSSSSGRGYLSLGG-APPA 317

Query: 236 DATPLQFIDGH----YYITLEAISVDGRMLDINPNIFKRDDSGGGVMIDSGTDVTWLVKE 291
           +A      DG     YYI L  I V GR + I    F         +IDSGT +T L   
Sbjct: 318 NAQFTALADGATPSFYYIDLVGIKVGGRAIRIPATAFAAAGG---TVIDSGTVITRLPPR 374

Query: 292 AYEALRDEVMIRLEGEQMRSYSWPDKL-----CYHGIMSSDLKGFPTVRFHFRGGAKLAL 346
           AY  LR           M  Y     L     CY           PTV   F GGA ++L
Sbjct: 375 AYAPLRAAFA-----RSMAQYKKAPALSILDTCYD-FTGHRTAQIPTVELAFAGGATVSL 428

Query: 347 EKDSMFYQPRPDAFCMAVNPASINNRYVNFSLIGMMAQQFYNVGYDIGRKQKTFQRMDCE 406
           +   + Y  +    C+A  P   N    + +++G   Q+ + V YD+  ++  F    C 
Sbjct: 429 DFTGVLYVSKVSQACLAFAP---NADDSSIAILGNTQQKTFAVTYDVANQRIGFGAKGCS 485


>gi|222637180|gb|EEE67312.1| hypothetical protein OsJ_24552 [Oryza sativa Japonica Group]
          Length = 420

 Score =  116 bits (291), Expect = 2e-23,   Method: Compositional matrix adjust.
 Identities = 105/397 (26%), Positives = 174/397 (43%), Gaps = 53/397 (13%)

Query: 32  RLAYLQEKIRS------HNTYQAQILPSNDISNSLFYVNISIGQPPVPQFTAMDTGSSLL 85
           R+A+L +   +      +++   Q L  N +    + +NIS+G P +      DTGS L+
Sbjct: 53  RIAFLSDATAAGKATTTNSSVSFQALLENGVGG--YNMNISVGTPLLTFSVVADTGSDLI 110

Query: 86  WVHCYPCRDCSPQLGTIFYPSRSSSYAYVPCDSEHCRYFPYARCSAYKHRCIYTQLYLIG 145
           W  C PC  C  Q    F P+ SS+++ +PC S  C++ P +  +     C+Y   Y  G
Sbjct: 111 WTQCAPCTKCFQQPAPPFQPASSSTFSKLPCTSSFCQFLPNSIRTCNATGCVYNYKYGSG 170

Query: 146 PETSVFVSTEQLTFKNTDESTIHVQDVVFGCGFSTNRNFKFSGIFGLGIGRSSLVSQLNS 205
             T+ +++TE  T K  D S      V FGC  ST       G   LG+GR         
Sbjct: 171 -YTAGYLATE--TLKVGDAS---FPSVAFGC--STENGL---GQLDLGVGR--------- 210

Query: 206 SFSYCIGSLHDPDYLHNKLILGDGAIIDEGDATPLQFID------GHYYITLEAISVDGR 259
            FSYC+ S        + ++ G  A + +G+     F++       +YY+ L  I+V   
Sbjct: 211 -FSYCLRSGSAAG--ASPILFGSLANLTDGNVQSTPFVNNPAVHPSYYYVNLTGITVGET 267

Query: 260 MLDINPNI--FKRDDSGGGVMIDSGTDVTWLVKEAYEALRDEVMIRLEGEQMRSYSWPDK 317
            L +  +   F ++  GGG ++DSGT +T+L K+ YE ++   + +       + +    
Sbjct: 268 DLPVTTSTFGFTQNGLGGGTIVDSGTTLTYLAKDGYEMVKQAFLSQTADVTTVNGTRGLD 327

Query: 318 LCYHGIMSSDLK-GFPTVRFHFRGGAKLA-------LEKDSMFYQPRPDAFCMAVNPASI 369
           LC+            P++   F GGA+ A       +E DS   Q      C+ + PA  
Sbjct: 328 LCFKSTGGGGGGIAVPSLVLRFDGGAEYAVPTYFAGVETDS---QGSVTVACLMMLPAKG 384

Query: 370 NNRYVNFSLIGMMAQQFYNVGYDIGRKQKTFQRMDCE 406
           +      S+IG + Q   ++ YD+     +F   DC 
Sbjct: 385 DQP---MSVIGNVMQMDMHLLYDLDGGIFSFAPADCA 418


>gi|212722026|ref|NP_001131674.1| uncharacterized protein LOC100193034 precursor [Zea mays]
 gi|194692214|gb|ACF80191.1| unknown [Zea mays]
 gi|413946454|gb|AFW79103.1| hypothetical protein ZEAMMB73_752316 [Zea mays]
          Length = 441

 Score =  116 bits (291), Expect = 2e-23,   Method: Compositional matrix adjust.
 Identities = 102/360 (28%), Positives = 155/360 (43%), Gaps = 26/360 (7%)

Query: 61  FYVNISIGQPPVPQFTAM-DTGSSLLWVHCYPCRDCSPQLGTIFYPSRSSSYAYVPCDSE 119
           ++V + +G P   +FT + DTGS L WV C      SP  G +F P  S S+A VPC S+
Sbjct: 91  YFVKVLVGTP-AQEFTLVADTGSELTWVKC--AGGASPP-GLVFRPEASKSWAPVPCSSD 146

Query: 120 HCRY---FPYARCSAYKHRCIYTQLYLIGPETSV-FVSTEQLTFKNTDESTIHVQDVVFG 175
            C+    F  A CS+    C Y   Y  G   ++  V T+  T          +QDVV G
Sbjct: 147 TCKLDVPFSLANCSSSASPCSYDYRYKEGSAGALGVVGTDSATIALPGGKVAQLQDVVLG 206

Query: 176 CGFSTN-RNFK-FSGIFGLGIGRSSLVSQ----LNSSFSYCIGSLHDPDYLHNKLILGDG 229
           C  + + ++FK   G+  LG  + S  S+       SFSYC+     P      L  G G
Sbjct: 207 CSSTHDGQSFKSVDGVLSLGNAKISFASRAAARFGGSFSYCLVDHLAPRNATGYLAFGPG 266

Query: 230 AIIDEGDATPLQFIDGH---YYITLEAISVDGRMLDINPNIFKRDDSGGGVMIDSGTDVT 286
            +          F+D     Y + ++A+ V G+ LDI   ++  D   GGV++DSGT +T
Sbjct: 267 QVPRTPATQTKLFLDPAMPFYGVKVDAVHVAGQALDIPAEVW--DPKSGGVILDSGTTLT 324

Query: 287 WLVKEAYEALRDEVMIRLEGEQMRSYSWPDKLCYHGIMS-SDLKGFPTVRFHFRGGAKLA 345
            L   AY+A+   +   L G     +  P + CY+           P +   F G A+L 
Sbjct: 325 VLATPAYKAVVAALTKLLAGVPKVDFP-PFEHCYNWTAPRPGAPEIPKLAVQFTGCARLE 383

Query: 346 LEKDSMFYQPRPDAFCMAVNPASINNRYVNFSLIGMMAQQFYNVGYDIGRKQKTFQRMDC 405
               S     +P   C+ +        +   S+IG + QQ +   +D+   +  F    C
Sbjct: 384 PPAKSYVIDVKPGVKCIGLQ----EGEWPGVSVIGNIMQQEHLWEFDLKNMEVRFMPSTC 439


>gi|356542694|ref|XP_003539801.1| PREDICTED: aspartic proteinase-like protein 2-like [Glycine max]
          Length = 489

 Score =  116 bits (291), Expect = 2e-23,   Method: Compositional matrix adjust.
 Identities = 99/319 (31%), Positives = 149/319 (46%), Gaps = 32/319 (10%)

Query: 60  LFYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLGTI------FYPSRSSSYAY 113
           L+Y  + +G PP   +  +DTGS +LWV C  C  C PQ   +      F P  SS+ + 
Sbjct: 76  LYYTKVKLGTPPREFYVQIDTGSDVLWVSCGSCNGC-PQTSGLQIQLNYFDPRSSSTSSL 134

Query: 114 VPCDSEHCR---YFPYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQ 170
           + C    CR       A CS+  ++C YT  Y  G  TS +  ++ + F    E T+   
Sbjct: 135 ISCSDRRCRSGVQTSDASCSSQNNQCTYTFQYGDGSGTSGYYVSDLMHFAGIFEGTLTTN 194

Query: 171 ---DVVFGC-----GFSTNRNFKFSGIFGLGIGRSSLVSQLN------SSFSYCIGSLHD 216
               VVFGC     G  T       GIFG G    S++SQL+        FS+C   L  
Sbjct: 195 SSASVVFGCSILQTGDLTKSERAVDGIFGFGQQGMSVISQLSLQGIAPRVFSHC---LKG 251

Query: 217 PDYLHNKLILGDGAIIDEGDA-TPLQFIDGHYYITLEAISVDGRMLDINPNIFKRDDSGG 275
            +     L+LG+  I++     +PL     HY + L++ISV+G+++ I P +F   ++  
Sbjct: 252 DNSGGGVLVLGE--IVEPNIVYSPLVQSQPHYNLNLQSISVNGQIVPIAPAVFATSNN-R 308

Query: 276 GVMIDSGTDVTWLVKEAYEALRDEVMIRLEGEQMRSYSWPDKLCYHGIMSSDLKGFPTVR 335
           G ++DSGT + +L +EAY    + +   L  + +RS       CY    SS++  FP V 
Sbjct: 309 GTIVDSGTTLAYLAEEAYNPFVNAIT-ALVPQSVRSVLSRGNQCYLITTSSNVDIFPQVS 367

Query: 336 FHFRGGAKLALEKDSMFYQ 354
            +F GGA L L       Q
Sbjct: 368 LNFAGGASLVLRPQDYLMQ 386


>gi|242092878|ref|XP_002436929.1| hypothetical protein SORBIDRAFT_10g011180 [Sorghum bicolor]
 gi|241915152|gb|EER88296.1| hypothetical protein SORBIDRAFT_10g011180 [Sorghum bicolor]
          Length = 505

 Score =  116 bits (291), Expect = 2e-23,   Method: Compositional matrix adjust.
 Identities = 101/367 (27%), Positives = 152/367 (41%), Gaps = 44/367 (11%)

Query: 61  FYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRD-CSPQLGTIFYPSRSSSYAYVPCDSE 119
           F V +  G P      ++DTGS + W+ C PC   C  Q   +F P++S++Y+ VPC   
Sbjct: 161 FVVTVGFGSPAQNYTLSIDTGSDVSWIQCLPCSGHCYKQHDPVFDPTKSATYSAVPCGHP 220

Query: 120 HCRYFPYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDVVFGCGFS 179
            C      +CS     C+Y   Y  G  T+  +S E L+  +T +    +    FGCG +
Sbjct: 221 QCAA-AGGKCSN-SGTCLYKVTYGDGSSTAGVLSHETLSLSSTRD----LPGFAFGCGQT 274

Query: 180 TNRNFKFSGIFGLGI-GRSSLVSQ----LNSSFSYCIGSLHDPDYLHNKLILGDGAIIDE 234
               F           G  SL SQ      ++FSYC+ S    D  H  L +G       
Sbjct: 275 NLGEFGGVDGLVGLGRGALSLPSQAAATFGATFSYCLPSY---DTTHGYLTMGSTTPAAS 331

Query: 235 GDATPLQFI--------DGHYYITLEAISVDGRMLDINPNIFKRDDSGGGVMIDSGTDVT 286
            D   +Q+            Y++ + +I + G +L + P +F RD    G + DSGT +T
Sbjct: 332 NDDDDVQYTAMIQKEDYPSLYFVEVVSIDIGGYILPVPPTVFTRD----GTLFDSGTILT 387

Query: 287 WLVKEAYEALRDEVMIRLEGEQMRSYSWPDKLCYHGIMSSDLKG-----FPTVRFHFRGG 341
           +L  EAY +LRD     +   +      P   CY      D  G      P V F F  G
Sbjct: 388 YLPPEAYASLRDRFKFTMTQYKPAPAYDPFDTCY------DFTGHNAIFMPAVAFKFSDG 441

Query: 342 AKLALEKDSMFYQP---RPDAFCMAVNPASINNRYVNFSLIGMMAQQFYNVGYDIGRKQK 398
           A   L   ++   P    P   C+A  P       + F++IG   Q+   V YD+  ++ 
Sbjct: 442 AVFDLSPVAILIYPDDTAPATGCLAFVP---RPSTMPFNIIGNTQQRGTEVIYDVAAEKI 498

Query: 399 TFQRMDC 405
            F +  C
Sbjct: 499 GFGQFTC 505


>gi|308044575|ref|NP_001183392.1| uncharacterized protein LOC100501808 [Zea mays]
 gi|238011188|gb|ACR36629.1| unknown [Zea mays]
          Length = 342

 Score =  116 bits (291), Expect = 2e-23,   Method: Compositional matrix adjust.
 Identities = 116/356 (32%), Positives = 160/356 (44%), Gaps = 44/356 (12%)

Query: 78  MDTGSSLLWVHCYPCRDCSPQLGTIFYPSRSSSYAYVPCDSEHCRYFPYARCSAYKHRCI 137
           +DTGS ++WV C PCR C  Q G +F P RSSSY  V C +  CR      C   +  C+
Sbjct: 3   LDTGSDVVWVQCAPCRRCYEQSGPVFDPRRSSSYGAVGCGAALCRRLDSGGCDLRRGACM 62

Query: 138 YTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDVVFGCGFSTNRNF-KFSGIFGLGIGR 196
           Y   Y  G  T+    TE LTF         V  V  GCG      F   +G+ GLG G 
Sbjct: 63  YQVAYGDGSVTAGDFVTETLTFAG----GARVARVALGCGHDNEGLFVAAAGLLGLGRGG 118

Query: 197 SSLVSQLN----SSFSYCI------GSLHDP-DYLHNKLILGDGAI-IDEGDATPL---Q 241
            S  +Q++     SFSYC+      G+   P  +  + +  G G++       TP+    
Sbjct: 119 LSFPTQISRRYGRSFSYCLVDRTSSGAGAAPGSHRSSTVSFGAGSVGASSASFTPMVRNP 178

Query: 242 FIDGHYYITLEAISVDG-RMLDINPNIFKRDDSG--GGVMIDSGTDVTWLVKEAYEALRD 298
            ++  YY+ L  ISV G R+  +  +  + D S   GGV++DSGT VT L + +Y ALRD
Sbjct: 179 RMETFYYVQLVGISVGGARVPGVAESDLRLDPSTGRGGVIVDSGTSVTRLARASYSALRD 238

Query: 299 EVMIRLEGEQMRS---YSWPDKLCYHGIMSSDLKG-----FPTVRFHFRGGAKLALEKDS 350
                  G    S   +S  D  CY      DL G      PTV  HF GGA+ AL  ++
Sbjct: 239 AFRAAAAGGLRLSPGGFSLFDT-CY------DLGGRRVVKVPTVSMHFAGGAEAALPPEN 291

Query: 351 -MFYQPRPDAFCMAVNPASINNRYVNFSLIGMMAQQFYNVGYDIGRKQKTFQRMDC 405
            +        FC A   A  +      S+IG + QQ + V +D   ++  F    C
Sbjct: 292 YLIPVDSRGTFCFAF--AGTDG---GVSIIGNIQQQGFRVVFDGDGQRVGFAPKGC 342


>gi|357133002|ref|XP_003568117.1| PREDICTED: aspartic proteinase-like protein 2-like [Brachypodium
           distachyon]
          Length = 497

 Score =  116 bits (291), Expect = 2e-23,   Method: Compositional matrix adjust.
 Identities = 102/371 (27%), Positives = 165/371 (44%), Gaps = 37/371 (9%)

Query: 61  FYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLG-----TIFYPSRSSSYAYVP 115
           +Y  I IG PP P    +DTGS +LWV+C  C  C  + G      ++ P  SSS + V 
Sbjct: 87  YYTKIEIGTPPKPFHVQVDTGSDILWVNCVSCDKCPTKSGLGIDLALYDPKGSSSGSAVS 146

Query: 116 CDSEHCRYF-----PYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFK--NTDESTIH 168
           CD++ C            C+A K  C Y   Y  G  T+    ++ L +   + +  T H
Sbjct: 147 CDNKFCAATYGSGEKLPGCTAGK-PCEYRAEYGDGSSTAGSFVSDSLQYNQLSGNAQTRH 205

Query: 169 VQ-DVVFGCGFS-----TNRNFKFSGIFGLGIGRSSLVSQLNSS------FSYCIGSLHD 216
            + +V+FGCG        + N    GI G G   +S +SQL S+      FS+C+     
Sbjct: 206 AKANVIFGCGAQQGGDLESTNQALDGIIGFGQSNTSTLSQLASAGEVKKIFSHCL----- 260

Query: 217 PDYLHNKLILGDGAIID-EGDATPLQFIDGHYYITLEAISVDGRMLDINPNIFKRDDSGG 275
            D +    I   G ++  +  +TPL     HY + L++I V G  L + P+IF+  +   
Sbjct: 261 -DTIKGGGIFAIGEVVQPKVKSTPLLPNMSHYNVNLQSIDVAGNALQLPPHIFETSEK-R 318

Query: 276 GVMIDSGTDVTWLVKEAYEALRDEVMIRLEGEQMRSYSWPDKLCYHGIMSSDLKGFPTVR 335
           G +IDSGT +T+L +  Y+ +   V  + +    R+      LC+    S D  GFP + 
Sbjct: 319 GTIIDSGTTLTYLPELVYKDILAAVFQKHQDITFRTIQ--GFLCFEYSESVD-DGFPKIT 375

Query: 336 FHFRGGAKLALEKDSMFYQPRPDAFCMAVNPASINNRYV-NFSLIGMMAQQFYNVGYDIG 394
           FHF     L +     F+Q   + +C+         +   +  L+G +      V YD+ 
Sbjct: 376 FHFEDDLGLNVYPHDYFFQNGDNLYCLGFQNGGFQPKDAKDMVLLGDLVLSNKVVVYDLE 435

Query: 395 RKQKTFQRMDC 405
           ++   +   +C
Sbjct: 436 KQVIGWTDYNC 446


>gi|242065058|ref|XP_002453818.1| hypothetical protein SORBIDRAFT_04g018520 [Sorghum bicolor]
 gi|241933649|gb|EES06794.1| hypothetical protein SORBIDRAFT_04g018520 [Sorghum bicolor]
          Length = 490

 Score =  116 bits (291), Expect = 2e-23,   Method: Compositional matrix adjust.
 Identities = 102/376 (27%), Positives = 160/376 (42%), Gaps = 39/376 (10%)

Query: 57  SNSLFYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLG-----TIFYPSRSSSY 111
           +  L+Y  I IG P    +  +DTGS +LWV+C  C  C    G     T + P+ S + 
Sbjct: 81  ATGLYYTQIEIGSPSKGYYVQVDTGSDILWVNCIRCDGCPTTSGLGIELTQYDPAGSGT- 139

Query: 112 AYVPCDSEHC-----RYFPYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTD--- 163
             V CD E C        P A C +    C +   Y  G  T+ F  ++ + +       
Sbjct: 140 -TVGCDQEFCVANSPNGLPPA-CPSTSSPCQFRIAYGDGSSTTGFYVSDSVQYNQVSGNG 197

Query: 164 ESTIHVQDVVFGCGFSTNRNFKFS-----GIFGLGIGRSSLVSQLNSS------FSYCIG 212
           ++T     + FGCG     +   S     GI G G   SS++SQL ++      F++C+ 
Sbjct: 198 QTTPSNASITFGCGAQLGGDLGSSSQALDGILGFGQADSSMLSQLAAARKVRKIFAHCL- 256

Query: 213 SLHDPDYLHNKLILGDGAIID-EGDATPLQFIDGHYYITLEAISVDGRMLDINPNIFKRD 271
                D +H   I   G ++  +   TPL     HY + L+ ISV G  L +  + F   
Sbjct: 257 -----DTVHGGGIFAIGNVVQPKVKTTPLVQNVTHYNVNLQGISVGGATLQLPSSTFDSG 311

Query: 272 DSGGGVMIDSGTDVTWLVKEAYEALRDEVMIRLEGEQMRSYSWPDKLCYHGIMSSDLKGF 331
           DS  G +IDSGT + +L +E Y  L   V  + +   + +Y   D +C+    S D  GF
Sbjct: 312 DS-KGTIIDSGTTLAYLPREVYRTLLTAVFDKYQDLALHNYQ--DFVCFQFSGSID-DGF 367

Query: 332 PTVRFHFRGGAKLALEKDSMFYQPRPDAFCMAVNPASINNR-YVNFSLIGMMAQQFYNVG 390
           P V F F G   L +      +Q   D +CM      +  +   +  L+G +      V 
Sbjct: 368 PVVTFSFEGEITLNVYPHDYLFQNENDLYCMGFLDGGVQTKDGKDMVLLGDLVLSNKLVV 427

Query: 391 YDIGRKQKTFQRMDCE 406
           YD+ ++   +   +C 
Sbjct: 428 YDLEKQVIGWADYNCS 443


>gi|225438908|ref|XP_002279194.1| PREDICTED: aspartic proteinase-like protein 2-like [Vitis vinifera]
          Length = 634

 Score =  116 bits (290), Expect = 2e-23,   Method: Compositional matrix adjust.
 Identities = 106/395 (26%), Positives = 173/395 (43%), Gaps = 55/395 (13%)

Query: 38  EKIRSHNTYQAQILPSNDISNSLFYVN-ISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCS 96
           ++  SH+T  A++   +D+    +Y   I IG PP      +DTGS+L +V C  C  C 
Sbjct: 68  QRSESHSTATARMPLYDDLIPYGYYTTRIWIGTPPQTFALIVDTGSTLTYVPCSTCEQCG 127

Query: 97  PQLGTIFYPSRSSSYAYVPCDSEHCRYFPYARCSAYKHRCIYTQLYLIGPETSVFVSTEQ 156
                 F P  SS+Y  + C  E         C +    C+Y + Y     +S  +  + 
Sbjct: 128 KHQDPNFQPDWSSTYQPLKCSME-------CTCDSEMMHCVYDRQYAEMSSSSGVLGEDI 180

Query: 157 LTFKNTDESTIHVQDVVFGCGFSTNRNF---KFSGIFGLGIGRSSLVSQL------NSSF 207
           ++F    +S +  Q  VFGC      +    +  GI GLG G  S+V QL       +SF
Sbjct: 181 VSFGK--QSELKPQRTVFGCENVETGDIYSQRADGIMGLGRGDLSIVDQLVEKGVIGNSF 238

Query: 208 SYCIGSLHDPDYLHNKLILGDGAIIDEGDATPLQFIDGH--------YYITLEAISVDGR 259
           S C G +           +G GA++  G + P   +  H        Y I L+ I + G+
Sbjct: 239 SLCYGGMD----------VGGGAMVLGGISPPAGMVFTHSDPARSAYYNIDLKEIHIAGK 288

Query: 260 MLDINPNIFKRDDSGGGVMIDSGTDVTWLVKEAYEALRDEVMIRLEGEQMRSYSWPDK-- 317
            L INP +F   D   G ++DSGT   +L + A++A +D +M  L    ++    PD+  
Sbjct: 289 QLPINPMVF---DGKYGTILDSGTTYAYLPEPAFKAFKDAIMKEL--NSLKLIQGPDRNY 343

Query: 318 --LCYHGI---MSSDLKGFPTVRFHFRGGAKLALEKDSMFYQPRP--DAFCMAVNPASIN 370
             +C+ G+   +S   K FP V   F  G +L+L  ++  +Q      A+C+ +      
Sbjct: 344 NDICFSGVGSDVSQLSKTFPAVDLVFSNGNRLSLSPENYLFQHSKAHGAYCLGI----FQ 399

Query: 371 NRYVNFSLIGMMAQQFYNVGYDIGRKQKTFQRMDC 405
           N     +L+G +  +   V YD    +  F + +C
Sbjct: 400 NENDQTTLLGGIIVRNTLVMYDREHLKIGFWKTNC 434


>gi|356567196|ref|XP_003551807.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Glycine max]
          Length = 490

 Score =  116 bits (290), Expect = 2e-23,   Method: Compositional matrix adjust.
 Identities = 110/362 (30%), Positives = 151/362 (41%), Gaps = 49/362 (13%)

Query: 56  ISNSLFYVNISIGQPPVPQFTAMDTGSSLLWVHCYPC-RDCSPQLGTIFYPSRSSSYAYV 114
           I +  ++V + +G P        DTGS L W  C PC R C  Q   IF PS+S+SY+ +
Sbjct: 141 IGSGNYFVVVGLGTPKRDLSLIFDTGSDLTWTQCEPCARSCYKQQDVIFDPSKSTSYSNI 200

Query: 115 PCDSEHCRYFPYAR-----CSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHV 169
            C S  C     A      CSA    CIY   Y     +  + S E+LT   TD     V
Sbjct: 201 TCTSALCTQLSTATGNDPGCSASTKACIYGIQYGDSSFSVGYFSRERLTVTATDV----V 256

Query: 170 QDVVFGCGFSTNRNFKFS-GIFGLGIGRSSLVSQLNSS----FSYCIGSLHDPDYLHNKL 224
            + +FGCG +    F  S G+ GLG    S V Q  +     FSYC+ S          L
Sbjct: 257 DNFLFGCGQNNQGLFGGSAGLIGLGRHPISFVQQTAAKYRKIFSYCLPSTSSST---GHL 313

Query: 225 ILGDGAIIDEGDATPLQFI---DGHYYITLEAISVDGRMLDINPNIFKRDDSGGGVMIDS 281
             G  A       TP   I      Y + + AI+V G  L ++ + F    S GG +IDS
Sbjct: 314 SFGPAATGRYLKYTPFSTISRGSSFYGLDITAIAVGGVKLPVSSSTF----STGGAIIDS 369

Query: 282 GTDVTWLVKEAYEALRDEVMIRLEGEQMRSYSWPDKL-----CYHGIMSSDLKGF----- 331
           GT +T L   AY ALR         + M  Y    +L     CY      DL G+     
Sbjct: 370 GTVITRLPPTAYGALRSAFR-----QGMSKYPSAGELSILDTCY------DLSGYKVFSI 418

Query: 332 PTVRFHFRGGAKLALEKDSMFYQPRPDAFCMAVNPASINNRYVNFSLIGMMAQQFYNVGY 391
           PT+ F F GG  + L    + +       C+A    + N    + ++ G + Q+   V Y
Sbjct: 419 PTIEFSFAGGVTVKLPPQGILFVASTKQVCLAF---AANGDDSDVTIYGNVQQRTIEVVY 475

Query: 392 DI 393
           D+
Sbjct: 476 DV 477


>gi|296087361|emb|CBI33735.3| unnamed protein product [Vitis vinifera]
          Length = 633

 Score =  116 bits (290), Expect = 2e-23,   Method: Compositional matrix adjust.
 Identities = 106/395 (26%), Positives = 173/395 (43%), Gaps = 55/395 (13%)

Query: 38  EKIRSHNTYQAQILPSNDISNSLFYVN-ISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCS 96
           ++  SH+T  A++   +D+    +Y   I IG PP      +DTGS+L +V C  C  C 
Sbjct: 68  QRSESHSTATARMPLYDDLIPYGYYTTRIWIGTPPQTFALIVDTGSTLTYVPCSTCEQCG 127

Query: 97  PQLGTIFYPSRSSSYAYVPCDSEHCRYFPYARCSAYKHRCIYTQLYLIGPETSVFVSTEQ 156
                 F P  SS+Y  + C  E         C +    C+Y + Y     +S  +  + 
Sbjct: 128 KHQDPNFQPDWSSTYQPLKCSME-------CTCDSEMMHCVYDRQYAEMSSSSGVLGEDI 180

Query: 157 LTFKNTDESTIHVQDVVFGCGFSTNRNF---KFSGIFGLGIGRSSLVSQL------NSSF 207
           ++F    +S +  Q  VFGC      +    +  GI GLG G  S+V QL       +SF
Sbjct: 181 VSFGK--QSELKPQRTVFGCENVETGDIYSQRADGIMGLGRGDLSIVDQLVEKGVIGNSF 238

Query: 208 SYCIGSLHDPDYLHNKLILGDGAIIDEGDATPLQFIDGH--------YYITLEAISVDGR 259
           S C G +           +G GA++  G + P   +  H        Y I L+ I + G+
Sbjct: 239 SLCYGGMD----------VGGGAMVLGGISPPAGMVFTHSDPARSAYYNIDLKEIHIAGK 288

Query: 260 MLDINPNIFKRDDSGGGVMIDSGTDVTWLVKEAYEALRDEVMIRLEGEQMRSYSWPDK-- 317
            L INP +F   D   G ++DSGT   +L + A++A +D +M  L    ++    PD+  
Sbjct: 289 QLPINPMVF---DGKYGTILDSGTTYAYLPEPAFKAFKDAIMKEL--NSLKLIQGPDRNY 343

Query: 318 --LCYHGI---MSSDLKGFPTVRFHFRGGAKLALEKDSMFYQPRP--DAFCMAVNPASIN 370
             +C+ G+   +S   K FP V   F  G +L+L  ++  +Q      A+C+ +      
Sbjct: 344 NDICFSGVGSDVSQLSKTFPAVDLVFSNGNRLSLSPENYLFQHSKAHGAYCLGI----FQ 399

Query: 371 NRYVNFSLIGMMAQQFYNVGYDIGRKQKTFQRMDC 405
           N     +L+G +  +   V YD    +  F + +C
Sbjct: 400 NENDQTTLLGGIIVRNTLVMYDREHLKIGFWKTNC 434


>gi|413953772|gb|AFW86421.1| hypothetical protein ZEAMMB73_098827 [Zea mays]
          Length = 482

 Score =  116 bits (290), Expect = 2e-23,   Method: Compositional matrix adjust.
 Identities = 115/374 (30%), Positives = 158/374 (42%), Gaps = 45/374 (12%)

Query: 61  FYVNISIGQPPVPQFTAM-DTGSSLLWVHCYPCRD-CSPQLGTIFYPSRSSSYAYVPCDS 118
           + V I IG P    FT + DTGS L WV C PC D C  Q   +F PS+SS+Y  VPC +
Sbjct: 126 YVVTIGIGTP-ARNFTVLFDTGSDLTWVQCKPCTDSCYQQQEPLFDPSKSSTYVDVPCGT 184

Query: 119 EHCRYFPYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDVVFGCGF 178
             C+       +     C Y+  Y     T   ++ E  T      S      VVFGC  
Sbjct: 185 PQCKIGGGQDLTCGGTTCEYSVKYGDQSVTRGNLAQEAFTLS---PSAPPAAGVVFGCSH 241

Query: 179 STNRNFK-------FSGIFGLGIGRSSLVSQL---NSS--FSYCIGSLHDPDYLHNKLIL 226
             +   K        +G+ GLG G SS++SQ    NS   FSYC   L         L +
Sbjct: 242 EYSSGVKGAEEEMSVAGLLGLGRGDSSILSQTRRGNSGDVFSYC---LPPRGSSAGYLTI 298

Query: 227 GDGAIIDEGDA-TPL----QFIDGHYYITLEAISVDGRMLDINPNIFKRDDSGGGVMIDS 281
           G  A      + TPL      +   Y + L  ISV G  L I+ + F       G +IDS
Sbjct: 299 GAAAPPQSNLSFTPLVTDNSQLSSVYVVNLVGISVSGAALPIDASAFYI-----GTVIDS 353

Query: 282 GTDVTWLVKEAYEALRDEVMIRLEGEQMRSYSWPDKL--CYHGIMSSDLKGFPTVRFHFR 339
           GT +T +   AY  LRDE    + G  M      + L  CY  +   D+   P V   F 
Sbjct: 354 GTVITHMPAAAYYVLRDEFRRHMGGYTMLPEGHVESLDTCYD-VTGHDVVTAPPVALEFG 412

Query: 340 GGAKLALEKDSMFYQPRPDA-------FCMAVNPASINNRYVNFSLIGMMAQQFYNVGYD 392
           GGA++ ++   +      DA        C+A  P ++      F +IG M Q+ YNV +D
Sbjct: 413 GGARIDVDASGILLVFAVDASGQSLTLACLAFVPTNLP----GFVIIGNMQQRAYNVVFD 468

Query: 393 IGRKQKTFQRMDCE 406
           +  ++  F    C 
Sbjct: 469 VEGRRIGFGANGCS 482


>gi|242066142|ref|XP_002454360.1| hypothetical protein SORBIDRAFT_04g029400 [Sorghum bicolor]
 gi|241934191|gb|EES07336.1| hypothetical protein SORBIDRAFT_04g029400 [Sorghum bicolor]
          Length = 466

 Score =  115 bits (289), Expect = 3e-23,   Method: Compositional matrix adjust.
 Identities = 110/398 (27%), Positives = 162/398 (40%), Gaps = 45/398 (11%)

Query: 32  RLAYLQEKI----------------RSHNTYQAQILPSNDISNSLFYVNISIGQPPVPQF 75
           R AY+Q K                 +SH T    +  S D     + + + +G P   Q 
Sbjct: 90  RAAYIQRKFSGGGVNGSRGGAGDVQQSHATVPTTLGTSLDTLE--YLITVRLGSPGKSQT 147

Query: 76  TAMDTGSSLLWVHCYPCRDCSPQLGTIFYPSRSSSYAYVPCDSEHCRYFPYARCSAYKHR 135
             +DTGS + WV C PC  C  Q   +F PS SS+Y+   C S  C             +
Sbjct: 148 MLIDTGSDVSWVQCKPCSQCHSQADPLFDPSSSSTYSPFSCSSAACAQLGQEGNGCSSSQ 207

Query: 136 CIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDVVFGCG-FSTNRNFKFSGIFGLGI 194
           C YT  Y  G  T+   S++ L   +       V+   FGC    +  N +  G+ GLG 
Sbjct: 208 CQYTVTYGDGSSTTGTYSSDTLALGSN-----AVRKFQFGCSNVESGFNDQTDGLMGLGG 262

Query: 195 GRSSLVSQ----LNSSFSYCIGSLHDPDYLHNKLILGDGAIIDEGDATPL---QFIDGHY 247
           G  SLVSQ      ++FSYC+     P    +   L  GA       TP+     +   Y
Sbjct: 263 GAQSLVSQTAGTFGAAFSYCL-----PATSSSSGFLTLGAGTSGFVKTPMLRSSQVPTFY 317

Query: 248 YITLEAISVDGRMLDINPNIFKRDDSGGGVMIDSGTDVTWLVKEAYEALRDEVMIRLEGE 307
            + ++AI V GR L I  ++F       G ++DSGT +T L   AY AL       ++  
Sbjct: 318 GVRIQAIRVGGRQLSIPTSVFS-----AGTIMDSGTVLTRLPPTAYSALSSAFKAGMKQY 372

Query: 308 QMRSYSWPDKLCYHGIMSSDLKGFPTVRFHFRGGAKLALEKDSMFYQPRPDAFCMAVNPA 367
                S     C+     S +   PTV   F GGA + +  D +  Q      C+A    
Sbjct: 373 PSAPPSGILDTCFDFSGQSSVS-IPTVALVFSGGAVVDIASDGIMLQTSNSILCLAF--- 428

Query: 368 SINNRYVNFSLIGMMAQQFYNVGYDIGRKQKTFQRMDC 405
           + N+   +  +IG + Q+ + V YD+G     F+   C
Sbjct: 429 AANSDDSSLGIIGNVQQRTFEVLYDVGGGAVGFKAGAC 466


>gi|449440931|ref|XP_004138237.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 1-like [Cucumis
           sativus]
          Length = 523

 Score =  115 bits (289), Expect = 3e-23,   Method: Compositional matrix adjust.
 Identities = 114/374 (30%), Positives = 171/374 (45%), Gaps = 25/374 (6%)

Query: 42  SHNTYQAQILPSNDISNSLFYVNISIGQPPVPQFTAMDTGSSLLWVHCYPC---RDCSPQ 98
           S N+  A +          ++  I +GQP    F   DTGS + W+ C PC     C  Q
Sbjct: 165 STNSLTAPVTSGASQGAGEYFARIGVGQPVQSYFFVPDTGSDVSWLQCQPCDGENGCYKQ 224

Query: 99  LGTIFYPSRSSSYAYVPCDSEHCRYFPYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLT 158
           +G IF P  SSSY+ + CDSE C     A C A  + CIY   Y  G  T   ++TE  +
Sbjct: 225 IGPIFDPKSSSSYSPLSCDSEQCHLLDEAACDA--NSCIYEVEYGDGSFTVGELATETFS 282

Query: 159 FKNTDESTIHVQDVVFGCGFSTNRNF-KFSGIFGLGIGRSSLVSQLN-SSFSYCIGSLHD 216
           F++++     + ++  GCG      F    G+ GLG G  SL SQL  +SFSYC   L D
Sbjct: 283 FRHSNS----IPNLPIGCGHDNEGLFVGADGLIGLGGGAISLSSQLEATSFSYC---LVD 335

Query: 217 PDYLHNKLILGDGAIIDEGDATPLQFID---GHYYITLEAISVDGRMLDINPNIFKRDDS 273
            D   +  +  +     +   +PL   D      Y+ +  +SV G+ L I+ + F+ D+S
Sbjct: 336 LDSESSSTLDFNADQPSDSLTSPLVKNDRFPTFRYVKVIGMSVGGKPLPISSSSFEIDES 395

Query: 274 G-GGVMIDSGTDVTWLVKEAYEALRDEVMIRLEGEQMRSYSWPDKLCYHGIMSSDLKGFP 332
           G GG+++DSGT +T +  + Y+ LRD  +   +         P   CY     S+++  P
Sbjct: 396 GSGGIIVDSGTTITEIPSDVYDVLRDAFVGLTKNLPPAPGVSPFDTCYDLSSQSNVE-VP 454

Query: 333 TVRFHFRGGAKLAL-EKDSMFYQPRPDAFCMAVNPASINNRYVNFSLIGMMAQQFYNVGY 391
           T+ F   G   L L  K+ +        FC+A  P++        S+IG + QQ   V Y
Sbjct: 455 TIAFILPGENSLQLPAKNCLIQVDSAGTFCLAFLPSTF-----PLSIIGNVQQQGIRVSY 509

Query: 392 DIGRKQKTFQRMDC 405
           D+      F    C
Sbjct: 510 DLANSLVGFSTDKC 523


>gi|326532056|dbj|BAK01404.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 506

 Score =  115 bits (289), Expect = 3e-23,   Method: Compositional matrix adjust.
 Identities = 106/370 (28%), Positives = 162/370 (43%), Gaps = 30/370 (8%)

Query: 56  ISNSLFYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLGTIFYPSRSSSYAYVP 115
           + +  + V++ +G PP      MDTGS L W+ C PC DC  Q G IF P+ S SY  V 
Sbjct: 144 VGSGEYLVDVYLGTPPRRFRMIMDTGSDLNWLQCAPCLDCFEQSGPIFDPAASISYRNVT 203

Query: 116 CDSEHCRYF-PYARCSAYKHR------CIYTQLYLIGPETSVFVSTEQLTFKNTDESTIH 168
           C  + CR   P A  +  + R      C Y   Y     T+  ++ E  T   T   T  
Sbjct: 204 CGDDRCRLVSPPAESAPRECRRPRSDPCPYYYWYGDQSNTTGDLALEAFTVNLTQSGTRR 263

Query: 169 VQDVVFGCGFSTNRNF-KFSGIFGLGIGRSSLVSQLNS-----SFSYCIGSLHDPDYLHN 222
           V  V FGCG      F   +G+ GLG G  S  SQL       +FSYC+  +       +
Sbjct: 264 VDGVAFGCGHRNRGLFHGAAGLLGLGRGPLSFASQLRGVYGGHAFSYCL--VEHGSAAGS 321

Query: 223 KLILG-DGAIIDEGDA-----TPLQFIDGHYYITLEAISVDGRMLDINPNIFKRDDSGGG 276
           K+I G D A++           P    D  YY+ L++I V G  ++I+ +      S GG
Sbjct: 322 KIIFGHDDALLAHPQLNYTAFAPTTDADTFYYLQLKSILVGGEAVNISSDTL----SAGG 377

Query: 277 VMIDSGTDVTWLVKEAYEALRDEVMIRLEGEQMRSYSWPDKLCYHGIMSSDLKGFPTVRF 336
            +IDSGT +++  + AY+A+R   + R+         +P     + +  ++    P +  
Sbjct: 378 TIIDSGTTLSYFPEPAYQAIRQAFIDRMSPSYPLILGFPVLSPCYNVSGAEKVEVPELSL 437

Query: 337 HFRGGAKLALEKDSMFYQPRPDA-FCMAVNPASINNRYVNFSLIGMMAQQFYNVGYDIGR 395
            F  GA      ++ F +  P+   C+AV    +       S+IG   QQ ++V YD+  
Sbjct: 438 VFADGAAWEFPAENYFIRLEPEGIMCLAV----LGTPRSGMSIIGNYQQQNFHVLYDLEH 493

Query: 396 KQKTFQRMDC 405
            +  F    C
Sbjct: 494 NRLGFAPRRC 503


>gi|297820186|ref|XP_002877976.1| hypothetical protein ARALYDRAFT_906847 [Arabidopsis lyrata subsp.
           lyrata]
 gi|297323814|gb|EFH54235.1| hypothetical protein ARALYDRAFT_906847 [Arabidopsis lyrata subsp.
           lyrata]
          Length = 425

 Score =  115 bits (289), Expect = 3e-23,   Method: Compositional matrix adjust.
 Identities = 101/355 (28%), Positives = 158/355 (44%), Gaps = 28/355 (7%)

Query: 61  FYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLGTIFYPSRSSSYAYVPCDSEH 120
           + V  +IG P      A+DT +   W+ C  C  CS  +  +F PS+SSS   + C++  
Sbjct: 88  YIVRANIGTPAQAMLVALDTSNDAAWIPCSGCVGCSSSV--LFDPSKSSSSRTLQCEAPQ 145

Query: 121 CRYFPYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDVVFGC-GFS 179
           C+  P   C+  K  C +   Y  G     +++ + LT      +T  + +  FGC   +
Sbjct: 146 CKQAPNPSCTVSKS-CGFNMTYG-GSAIEAYLTQDTLTL-----ATDVIPNYTFGCINKA 198

Query: 180 TNRNFKFSGIFGLGIGRSSLVSQ----LNSSFSYCIGSLHDPDYLHNKLILGDGAIIDEG 235
           +  +    G+ GLG G  SL+SQ      S+FSYC+ +    ++    L LG        
Sbjct: 199 SGTSLPAQGLMGLGRGPLSLISQSQNLYQSTFSYCLPNSKSSNF-SGSLRLGPKNQPIRI 257

Query: 236 DATPL---QFIDGHYYITLEAISVDGRMLDINPNIFKRD-DSGGGVMIDSGTDVTWLVKE 291
             TPL         YY+ L  I V  +++DI  +    D  +G G + DSGT  T LV+ 
Sbjct: 258 KTTPLLKNPRRSSLYYVNLVGIRVGNKIVDIPTSALAFDPATGAGTIFDSGTVYTRLVEP 317

Query: 292 AYEALRDEVMIRLEGEQMRSYSWPDKLCYHGIMSSDLKGFPTVRFHFRGGAKLALEKDSM 351
           AY A+R+E   R++     S    D  CY G +      FP+V F F  G  + L  D++
Sbjct: 318 AYVAMRNEFRRRVKNANATSLGGFDT-CYSGSVV-----FPSVTFMF-AGMNVTLPPDNL 370

Query: 352 F-YQPRPDAFCMAVNPASINNRYVNFSLIGMMAQQFYNVGYDIGRKQKTFQRMDC 405
             +    +  C+A+  A  N   V  ++I  M QQ + V  D+   +    R  C
Sbjct: 371 LIHSSAGNLSCLAMAAAPTNVNSV-LNVIASMQQQNHRVLIDVPNSRLGISRETC 424


>gi|413952263|gb|AFW84912.1| aspartic proteinase nepenthesin-2 [Zea mays]
          Length = 509

 Score =  115 bits (289), Expect = 3e-23,   Method: Compositional matrix adjust.
 Identities = 94/322 (29%), Positives = 150/322 (46%), Gaps = 36/322 (11%)

Query: 60  LFYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLG-----TIFYPSRSSSYAYV 114
           L++  + +G P    F  +DTGS +LWV C PC  C    G       F P  SS+ + +
Sbjct: 90  LYFTRVKLGNPAKEFFVQIDTGSDILWVTCSPCTGCPTSSGLNIQLESFNPDSSSTASRI 149

Query: 115 PCDSEHCRY---FPYARCSAYKHR---CIYTQLYLIGPETSVFVSTEQLTFKNT---DES 165
            C  + C        A C     +   C YT  Y  G  TS +  ++ + F+     +++
Sbjct: 150 TCSDDRCTAGFQTGEAICQTSNSQSSPCGYTFTYGDGSGTSGYYVSDTMFFETVMGNEQT 209

Query: 166 TIHVQDVVFGCGFS-----TNRNFKFSGIFGLGIGRSSLVSQLNS------SFSYCIGSL 214
                 +VFGC  S     T  +    GIFG G  + S++SQLNS       FS+C   L
Sbjct: 210 ANSSASIVFGCSNSQSGDLTKADRAVDGIFGFGQHQLSVISQLNSLGVSPKVFSHC---L 266

Query: 215 HDPDYLHNKLILGDGAIIDEGDA-TPLQFIDGHYYITLEAISVDGRMLDINPNIFKRDDS 273
              D     L+LG+  I++ G   TPL     HY + LE+I+V+G+ L I+ ++F   ++
Sbjct: 267 KGSDNGGGILVLGE--IVEPGLVYTPLVPSQPHYNLNLESIAVNGQKLPIDSSLFTTSNT 324

Query: 274 GGGVMIDSGTDVTWLVKEAYEALRDEVMIRLEGEQMRSYSWPDKLCYHGIMSSDL-KGFP 332
             G ++DSGT + +L   AY+     +   +    +RS       C+  I SS +   FP
Sbjct: 325 -QGTIVDSGTTLAYLADGAYDPFVSAIAAAVS-PSVRSLVSKGSQCF--ITSSSVDSSFP 380

Query: 333 TVRFHFRGGAKLALEKDSMFYQ 354
           TV  +F GG  ++++ ++   Q
Sbjct: 381 TVTLYFMGGVAMSVKPENYLLQ 402


>gi|242084332|ref|XP_002442591.1| hypothetical protein SORBIDRAFT_08g022580 [Sorghum bicolor]
 gi|241943284|gb|EES16429.1| hypothetical protein SORBIDRAFT_08g022580 [Sorghum bicolor]
          Length = 493

 Score =  115 bits (289), Expect = 3e-23,   Method: Compositional matrix adjust.
 Identities = 118/394 (29%), Positives = 166/394 (42%), Gaps = 41/394 (10%)

Query: 40  IRSHNTYQAQILPSNDISNSLFYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQL 99
           + S   + A ++     ++  +   I++G P V    AMDTGS + W+ C PCR C PQ 
Sbjct: 113 LSSGGAFVAPVVSRAPTTSGEYMAKIAVGTPAVEALLAMDTGSDITWLQCQPCRRCYPQS 172

Query: 100 GTIFYPSRSSSYAYVPCDSEHCRYFPYARCS-AYKHRCIYTQLYLIGPETSV--FVSTEQ 156
           G +F P  S+SY  +  D+  C+    +    A +  C+Y   Y     T+V  F+  E 
Sbjct: 173 GPVFDPRHSTSYREMGYDAPDCQALGRSGGGDAKRMTCVYAVGYGDDGSTTVGDFIE-ET 231

Query: 157 LTFKNTDESTIHVQDVVFGCGFSTNRNFK--FSGIFGLGIGRSSLVSQLN------SSFS 208
           LTF       + V  +  GCG      F    +GI GLG G+ S  SQ+       +SFS
Sbjct: 232 LTFAG----GVQVPHMSIGCGHDNKGLFAAPAAGILGLGRGQISCPSQIAALGYNVTSFS 287

Query: 209 YCIGS--LHDPDY-LHNKLILGDGAIIDEGDATP--------LQFIDGHYYITLEAISVD 257
           YC+    L  P   + + L +GDGA    G   P        L     +Y   +      
Sbjct: 288 YCLADFFLSSPGRSVSSTLTIGDGAA--AGSPPPSFTPTVQNLNMATFYYVRLVGVSVGG 345

Query: 258 GRMLDINPNIFKRD--DSGGGVMIDSGTDVTWLVKEAYEALRDEVMIRLEGEQMRSYSWP 315
            R+  +  +  K D     GGV++DSGT VT L + AY A RD            S   P
Sbjct: 346 VRVPGVTEDDLKLDPYTGRGGVILDSGTAVTRLARRAYIAFRDAFRAAAVDLGQVSIGGP 405

Query: 316 DKL---CYHGIMSSDLKGFPTVRFHFRGGAKLAL-EKDSMFYQPRPDAFCMAVNPASINN 371
                 CY   M       PTV  HF GG +L L  K+ +         C A   A   +
Sbjct: 406 SGFFDTCY--TMGGRAMKVPTVSMHFAGGVELTLPPKNYLIPVDSMGTVCFAF--AGTGD 461

Query: 372 RYVNFSLIGMMAQQFYNVGYDIGRKQKTFQRMDC 405
           R V  S+IG + QQ + V Y+IG  +  F    C
Sbjct: 462 RSV--SIIGNIQQQGFRVVYNIGGGRVGFAPNSC 493


>gi|226508052|ref|NP_001150337.1| LOC100283967 precursor [Zea mays]
 gi|195638522|gb|ACG38729.1| aspartic proteinase nepenthesin-2 precursor [Zea mays]
          Length = 507

 Score =  115 bits (289), Expect = 3e-23,   Method: Compositional matrix adjust.
 Identities = 94/322 (29%), Positives = 150/322 (46%), Gaps = 36/322 (11%)

Query: 60  LFYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLG-----TIFYPSRSSSYAYV 114
           L++  + +G P    F  +DTGS +LWV C PC  C    G       F P  SS+ + +
Sbjct: 88  LYFTRVKLGNPAKEFFVQIDTGSDILWVTCSPCTGCPTSSGLNIQLESFNPDSSSTASRI 147

Query: 115 PCDSEHCRY---FPYARCSAYKHR---CIYTQLYLIGPETSVFVSTEQLTFKNT---DES 165
            C  + C        A C     +   C YT  Y  G  TS +  ++ + F+     +++
Sbjct: 148 TCSDDRCTAGFQTGEAICQTSNSQSSPCGYTFTYGDGSGTSGYYVSDTMFFETVMGNEQT 207

Query: 166 TIHVQDVVFGCGFS-----TNRNFKFSGIFGLGIGRSSLVSQLNS------SFSYCIGSL 214
                 +VFGC  S     T  +    GIFG G  + S++SQLNS       FS+C   L
Sbjct: 208 ANSSASIVFGCSNSQSGDLTKADRAVDGIFGFGQHQLSVISQLNSLGVSPKVFSHC---L 264

Query: 215 HDPDYLHNKLILGDGAIIDEGDA-TPLQFIDGHYYITLEAISVDGRMLDINPNIFKRDDS 273
              D     L+LG+  I++ G   TPL     HY + LE+I+V+G+ L I+ ++F   ++
Sbjct: 265 KGSDNGGGILVLGE--IVEPGLVYTPLVPSQPHYNLNLESIAVNGQKLPIDSSLFTTSNT 322

Query: 274 GGGVMIDSGTDVTWLVKEAYEALRDEVMIRLEGEQMRSYSWPDKLCYHGIMSSDL-KGFP 332
             G ++DSGT + +L   AY+     +   +    +RS       C+  I SS +   FP
Sbjct: 323 -QGTIVDSGTTLAYLADGAYDPFVSAIAAAVS-PSVRSLVSKGSQCF--ITSSSVDSSFP 378

Query: 333 TVRFHFRGGAKLALEKDSMFYQ 354
           TV  +F GG  ++++ ++   Q
Sbjct: 379 TVTLYFMGGVAMSVKPENYLLQ 400


>gi|194707632|gb|ACF87900.1| unknown [Zea mays]
          Length = 423

 Score =  115 bits (289), Expect = 3e-23,   Method: Compositional matrix adjust.
 Identities = 94/322 (29%), Positives = 150/322 (46%), Gaps = 36/322 (11%)

Query: 60  LFYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLG-----TIFYPSRSSSYAYV 114
           L++  + +G P    F  +DTGS +LWV C PC  C    G       F P  SS+ + +
Sbjct: 4   LYFTRVKLGNPAKEFFVQIDTGSDILWVTCSPCTGCPTSSGLNIQLESFNPDSSSTASRI 63

Query: 115 PCDSEHCRY---FPYARCSAYKHR---CIYTQLYLIGPETSVFVSTEQLTFKNT---DES 165
            C  + C        A C     +   C YT  Y  G  TS +  ++ + F+     +++
Sbjct: 64  TCSDDRCTAGFQTGEAICQTSNSQSSPCGYTFTYGDGSGTSGYYVSDTMFFETVMGNEQT 123

Query: 166 TIHVQDVVFGCGFS-----TNRNFKFSGIFGLGIGRSSLVSQLNS------SFSYCIGSL 214
                 +VFGC  S     T  +    GIFG G  + S++SQLNS       FS+C   L
Sbjct: 124 ANSSASIVFGCSNSQSGDLTKADRAVDGIFGFGQHQLSVISQLNSLGVSPKVFSHC---L 180

Query: 215 HDPDYLHNKLILGDGAIIDEGDA-TPLQFIDGHYYITLEAISVDGRMLDINPNIFKRDDS 273
              D     L+LG+  I++ G   TPL     HY + LE+I+V+G+ L I+ ++F   ++
Sbjct: 181 KGSDNGGGILVLGE--IVEPGLVYTPLVPSQPHYNLNLESIAVNGQKLPIDSSLFTTSNT 238

Query: 274 GGGVMIDSGTDVTWLVKEAYEALRDEVMIRLEGEQMRSYSWPDKLCYHGIMSSDL-KGFP 332
             G ++DSGT + +L   AY+     +   +    +RS       C+  I SS +   FP
Sbjct: 239 -QGTIVDSGTTLAYLADGAYDPFVSAIAAAVS-PSVRSLVSKGSQCF--ITSSSVDSSFP 294

Query: 333 TVRFHFRGGAKLALEKDSMFYQ 354
           TV  +F GG  ++++ ++   Q
Sbjct: 295 TVTLYFMGGVAMSVKPENYLLQ 316


>gi|449458736|ref|XP_004147103.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 2-like [Cucumis
           sativus]
 gi|449518669|ref|XP_004166359.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 2-like [Cucumis
           sativus]
          Length = 482

 Score =  115 bits (288), Expect = 4e-23,   Method: Compositional matrix adjust.
 Identities = 113/416 (27%), Positives = 171/416 (41%), Gaps = 46/416 (11%)

Query: 1   LIHRDSI------LSPYNNPNENPAHRVQRGIN-ISIARLAYLQEKIRSHNTYQAQILPS 53
           L+HRD +         +N+  +  A RV   +  +S    A +++       +   ++  
Sbjct: 76  LLHRDKLSHVHGHRRGFNDRMKRDAIRVATLVRRLSHGAPAAVKDSRYKVANFATDVISG 135

Query: 54  NDISNSLFYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLGTIFYPSRSSSYAY 113
            +  +  ++V I +G PP  Q+  +D+GS ++WV C PC  C  Q   +F P+ SSS+A 
Sbjct: 136 MEAGSGEYFVRIGVGSPPRNQYMVIDSGSDIVWVQCKPCSRCYQQSDPVFDPADSSSFAG 195

Query: 114 VPCDSEHCRYFPYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDVV 173
           V C S+ C       C+A   RC Y   Y  G  T   ++ E LT        + ++DV 
Sbjct: 196 VSCGSDVCDRLENTGCNA--GRCRYEVSYGDGSYTKGTLALETLTVGQ-----VMIRDVA 248

Query: 174 FGCGFSTNRNFKFSGIFGLGIGRS-----SLVSQLNSSFSYCIGSLHDPDYLHNKLILGD 228
            GCG +    F  +       G S      L  Q   +FSYC+ S          L  G 
Sbjct: 249 IGCGHTNQGMFIGAAGLLGLGGGSMSFIGQLGGQTGGAFSYCLVSRGTGS--TGALEFGR 306

Query: 229 GAIIDEGDATPLQFID-----GHYYITLEAISVDGRMLDINPNIFKRDDSG-GGVMIDSG 282
           GA+     AT +  I        YYI L  I V G  + +    F+  + G  GV++D+G
Sbjct: 307 GAL--PVGATWISLIRNPRAPSFYYIGLAGIGVGGVRVSVPEETFQLTEYGTNGVVMDTG 364

Query: 283 TDVTWLVKEAYEALRDEVMIRLEGEQMRSYSWPDKLCYHGIMSSDLKGF-----PTVRFH 337
           T VT     AY A RD    +               CY      DL GF     PTV F+
Sbjct: 365 TAVTRFPTAAYVAFRDSFTAQTSNLPRAPGVSIFDTCY------DLNGFESVRVPTVSFY 418

Query: 338 FRGGAKLAL-EKDSMFYQPRPDAFCMAVNPASINNRYVNFSLIGMMAQQFYNVGYD 392
           F  G  L L  ++ +        FC+A  P+         S+IG + Q+   + +D
Sbjct: 419 FSDGPVLTLPARNFLIPVDGGGTFCLAFAPSP-----SGLSIIGNIQQEGIQISFD 469


>gi|359482097|ref|XP_002271077.2| PREDICTED: aspartic proteinase nepenthesin-1-like [Vitis vinifera]
          Length = 458

 Score =  115 bits (288), Expect = 4e-23,   Method: Compositional matrix adjust.
 Identities = 109/386 (28%), Positives = 175/386 (45%), Gaps = 54/386 (13%)

Query: 58  NSLFYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLGTIFYPSRSSSYAYVPCD 117
           N    V++++G PP      +DTGS L W+ C    + +    T F P+RSSSY+ VPC 
Sbjct: 82  NVSLTVSLTVGTPPQNVSMVLDTGSELSWLRC----NKTQTFQTTFDPNRSSSYSPVPCS 137

Query: 118 SEHC----RYFPYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDVV 173
           S  C    R FP          C     Y     +   ++++     N+D     +   +
Sbjct: 138 SLTCTDRTRDFPIPASCDSNQLCHAILSYADASSSEGNLASDTFYIGNSD-----MPGTI 192

Query: 174 FGC---GFSTN--RNFKFSGIFGLGIGRSSLVSQLN-SSFSYCIGSLHDPDYLHNKLILG 227
           FGC    FSTN   + K +G+ G+  G  S VSQ++   FSYCI    D D+    L+LG
Sbjct: 193 FGCMDSSFSTNTEEDSKNTGLMGMNRGSLSFVSQMDFPKFSYCIS---DSDF-SGVLLLG 248

Query: 228 DG-----------AIIDEGDATPLQFIDG-HYYITLEAISVDGRMLDINPNIFKRDDSGG 275
           D             +I    +TPL + D   Y + LE I V  ++L +  ++F  D +G 
Sbjct: 249 DANFSWLMPLNYTPLIQI--STPLPYFDRVAYTVQLEGIKVSSKLLPLPKSVFVPDHTGA 306

Query: 276 G-VMIDSGTDVTWLVKEAYEALRDEVM------IRLEGEQMRSYSWPDKLCYHGIMS-SD 327
           G  M+DSGT  T+L+   Y ALR+E +      +R+  +    +     LCY   +S + 
Sbjct: 307 GQTMVDSGTQFTFLLGPVYSALRNEFLNQTSQILRVLEDPNYVFQGGMDLCYRVPLSQTS 366

Query: 328 LKGFPTVRFHFRGGAKLALEKDSMFYQPRPD------AFCMAVNPASINNRYVNFSLIGM 381
           L   PTV   FR GA++ +  D + Y+   +       +C     + +    V   +IG 
Sbjct: 367 LPWLPTVSLMFR-GAEMKVSGDRLLYRVPGEVRGSDSVYCFTFGNSDL--LAVEAYVIGH 423

Query: 382 MAQQFYNVGYDIGRKQKTFQRMDCEV 407
             QQ   + +D+ + +  F ++ C++
Sbjct: 424 HHQQNVWMEFDLEKSRIGFAQVQCDL 449


>gi|356539818|ref|XP_003538390.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
          Length = 457

 Score =  115 bits (288), Expect = 4e-23,   Method: Compositional matrix adjust.
 Identities = 95/371 (25%), Positives = 159/371 (42%), Gaps = 39/371 (10%)

Query: 63  VNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLGTIFYPSRSSSYAYVPCDSEHCR 122
           V++ IG PP  Q   +DTGS L W+ C+      P     F PS SS+++ +PC    C+
Sbjct: 99  VDLPIGTPPQVQPMVLDTGSQLSWIQCHKKAPAKPPPTASFDPSLSSTFSTLPCTHPVCK 158

Query: 123 Y----FPYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDVVFGCGF 178
                F           C Y+  Y  G      +  E+ TF  +    +    ++ GC  
Sbjct: 159 PRIPDFTLPTSCDQNRLCHYSYFYADGTYAEGNLVREKFTFSRS----LFTPPLILGCAT 214

Query: 179 STNRNFKFSGIFGLGIGRSSLVSQLN-SSFSYCIGS-LHDPDYLHNKLILGDGAIIDEGD 236
            +       GI G+  GR S  SQ   + FSYC+ + +  P Y       G   +    +
Sbjct: 215 ESTDP---RGILGMNRGRLSFASQSKITKFSYCVPTRVTRPGYTPT----GSFYLGHNPN 267

Query: 237 ATPLQFIDG---------------HYYITLEAISVDGRMLDINPNIFKRDDSGGG-VMID 280
           +   ++I+                 Y + L+ I + GR L+I+P +F+ D  G G  M+D
Sbjct: 268 SNTFRYIEMLTFARSQRMPNLDPLAYTVALQGIRIGGRKLNISPAVFRADAGGSGQTMLD 327

Query: 281 SGTDVTWLVKEAYEALRDEVMIRLEGEQMRS---YSWPDKLCYHGIMSSDLKGFPTVRFH 337
           SG++ T+LV EAY+ +R EV +R  G +M+    Y     +C+ G      +    + F 
Sbjct: 328 SGSEFTYLVNEAYDKVRAEV-VRAVGPRMKKGYVYGGVADMCFDGNAIEIGRLIGDMVFE 386

Query: 338 FRGGAKLALEKDSMFYQPRPDAFCMAVNPASINNRYVNFSLIGMMAQQFYNVGYDIGRKQ 397
           F  G ++ + K+ +         C+ +  A+ +      ++IG   QQ   V +D+  ++
Sbjct: 387 FEKGVQIVVPKERVLATVEGGVHCIGI--ANSDKLGAASNIIGNFHQQNLWVEFDLVNRR 444

Query: 398 KTFQRMDCEVL 408
             F   DC  L
Sbjct: 445 MGFGTADCSRL 455


>gi|225216988|gb|ACN85278.1| aspartic proteinase nepenthesin-1 precursor [Oryza alta]
          Length = 518

 Score =  115 bits (288), Expect = 4e-23,   Method: Compositional matrix adjust.
 Identities = 100/357 (28%), Positives = 151/357 (42%), Gaps = 29/357 (8%)

Query: 61  FYVNISIGQPPVPQFTAMDTGSSLLWVHCYPC-RDCSPQLGTIFYPSRSSSYAYVPCDSE 119
           + V + +G P        DTGS   WV C PC   C  Q   +F P+RSS+YA V C + 
Sbjct: 179 YVVTVGLGTPASRYTVVFDTGSDTTWVQCQPCVVVCYEQQEKLFDPARSSTYANVSCAAP 238

Query: 120 HCRYFPYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDVVFGCGFS 179
            C       CS     C+Y   Y  G  +  F + + LT  + D     V+   FGCG  
Sbjct: 239 ACFDLDTRGCSG--GHCLYGVQYGDGSYSIGFFAMDTLTLSSYDA----VKGFRFGCGER 292

Query: 180 TNRNF-KFSGIFGLGIGRSSLVSQL----NSSFSYCIGSLHDPDYLHNKLILGDGAIIDE 234
               F + +G+ GLG G++SL  Q        F++C+ +          L  G G+    
Sbjct: 293 NEGLFGEAAGLLGLGRGKTSLPVQTYDKYGGVFAHCLPARSSGT---GYLDFGPGSPAAA 349

Query: 235 GD--ATPLQFIDG--HYYITLEAISVDGRMLDINPNIFKRDDSGGGVMIDSGTDVTWLVK 290
           G    TP+   +G   YY+ +  I V G++L I  ++F       G ++DSGT +T L  
Sbjct: 350 GARLTTPMLTDNGPTFYYVGMTGIRVGGQLLSIPQSVFAT----AGTIVDSGTVITRLPP 405

Query: 291 EAYEALRDEVMIRLEGEQMRSYSWPDKL--CYHGIMSSDLKGFPTVRFHFRGGAKLALEK 348
            AY +LR   +  +     +       L  CY     S +   PTV   F+GGA L ++ 
Sbjct: 406 PAYSSLRSAFVSAMAARGYKKAPAVSLLDTCYDFTGMSQVA-IPTVSLLFQGGAILDVDA 464

Query: 349 DSMFYQPRPDAFCMAVNPASINNRYVNFSLIGMMAQQFYNVGYDIGRKQKTFQRMDC 405
             + Y       C+     + N    +  ++G    + + V YDIG+K   F    C
Sbjct: 465 SGIMYAASVSQVCLGF---AANEDGGDVGIVGNTQLKTFGVAYDIGKKVVGFSPGAC 518


>gi|307136234|gb|ADN34070.1| aspartic proteinase nepenthesin-1 precursor [Cucumis melo subsp.
           melo]
          Length = 412

 Score =  115 bits (288), Expect = 4e-23,   Method: Compositional matrix adjust.
 Identities = 106/391 (27%), Positives = 169/391 (43%), Gaps = 50/391 (12%)

Query: 52  PSNDIS---NSLFYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLGTIFYPSRS 108
           PSN +S   N    V++++G PP      +DTGS L W+HC      SP L ++F P  S
Sbjct: 28  PSNKLSFHHNVTLTVSLTVGSPPQQVTMVLDTGSELSWLHCKK----SPNLTSVFNPLSS 83

Query: 109 SSYAYVPCDSEHCRY----FPYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDE 164
           SSY+ +PC S  CR      P       K  C     Y         ++++     ++  
Sbjct: 84  SSYSPIPCSSPVCRTRTRDLPNPVTCDPKKLCHAIVSYADASSLEGNLASDNFRIGSS-- 141

Query: 165 STIHVQDVVFGC---GFSTN--RNFKFSGIFGLGIGRSSLVSQLN-SSFSYCIGSLHDPD 218
               +   +FGC   GFS+N   + K +G+ G+  G  S V+QL    FSYCI       
Sbjct: 142 ---ALPGTLFGCMDSGFSSNSEEDAKTTGLMGMNRGSLSFVTQLGLPKFSYCISGRDS-- 196

Query: 219 YLHNKLILGDGAIIDEGD---------ATPLQFIDG-HYYITLEAISVDGRMLDINPNIF 268
                L+ GD  +   G+         +TPL + D   Y + L+ I V  ++L +  +IF
Sbjct: 197 --SGVLLFGDSHLSWLGNLTYTPLVQISTPLPYFDRVAYTVQLDGIRVGNKILPLPKSIF 254

Query: 269 KRDDSGGG-VMIDSGTDVTWLVKEAYEALRDEVMIRLE------GEQMRSYSWPDKLCYH 321
             D +G G  M+DSGT  T+L+   Y ALR+E + + +      G+    +     LCY 
Sbjct: 255 APDHTGAGQTMVDSGTQFTFLLGPVYTALRNEFLEQTKGVLAPLGDPNFVFQGAMDLCYR 314

Query: 322 GIMSSDLKGFPTVRFHFRG-----GAKLALEKDSMFYQPRPDAFCMAVNPASINNRYVNF 376
                 L   P V   FRG     G ++ L K     + +   +C+    + +    +  
Sbjct: 315 VPAGGKLPELPAVSLMFRGAEMVVGGEVLLYKVPGMMKGKEWVYCLTFGNSDLLG--IEA 372

Query: 377 SLIGMMAQQFYNVGYDIGRKQKTFQRMDCEV 407
            +IG   QQ   + +D+ + +  F    C++
Sbjct: 373 FVIGHHHQQNVWMEFDLVKSRVGFVETRCDL 403


>gi|148907752|gb|ABR17002.1| unknown [Picea sitchensis]
          Length = 454

 Score =  115 bits (288), Expect = 4e-23,   Method: Compositional matrix adjust.
 Identities = 93/323 (28%), Positives = 150/323 (46%), Gaps = 32/323 (9%)

Query: 60  LFYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLG-----TIFYPSRSSSYAYV 114
           L+Y  I +G PP P +  +DTGS +LWV+C PC  C    G       F P  SS+ + +
Sbjct: 40  LYYTRIELGTPPRPFYVQIDTGSDILWVNCKPCNACPLTSGLGVALNFFDPRGSSTASPL 99

Query: 115 PCDSEHC---RYFPYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFK---NTDESTIH 168
            C    C        + C+  ++ C Y+  Y  G  T  +  +++  +    N   +   
Sbjct: 100 SCIDSKCVSSNQISESVCTTDRY-CGYSFEYGDGSGTLGYYVSDEFDYNQYVNQYVTNNA 158

Query: 169 VQDVVFGCGFS-----TNRNFKFSGIFGLGIGRSSLVSQLNSS------FSYCIGSLHDP 217
              + FGC ++     T  +    GIFG G    S+VSQLNS       FS+C   L   
Sbjct: 159 SAKITFGCSYNQSGDLTKPDRAVDGIFGFGQNDLSVVSQLNSQGLAPKIFSHC---LEGA 215

Query: 218 DYLHNKLILGDGAIIDEGDA-TPLQFIDGHYYITLEAISVDGRMLDINPNIFKRDDSGGG 276
           D     L+LG+  I + G   TP+     HY + L+ I+V+G+ L I+P +F   ++  G
Sbjct: 216 DPGGGILVLGE--ITEPGMVYTPIVPSQPHYNLNLQGIAVNGQQLSIDPQVFATTNT-RG 272

Query: 277 VMIDSGTDVTWLVKEAYEALRDEVMIRLEGEQMRSYSWPDKLCYHGIMSSDLKGFPTVRF 336
            +ID GT + +L +EAYE   + ++  +  +  + +      C+  + S D + FP+V  
Sbjct: 273 TIIDCGTTLAYLAEEAYEPFVNTIIAAVS-QSTQPFMLKGNPCFLTVHSID-EIFPSVTL 330

Query: 337 HFRGGAKLALEKDSMFYQPRPDA 359
           +F G       KD +  Q  PD+
Sbjct: 331 YFEGAPMDLKPKDYLIQQLSPDS 353


>gi|302781668|ref|XP_002972608.1| hypothetical protein SELMODRAFT_97538 [Selaginella moellendorffii]
 gi|300160075|gb|EFJ26694.1| hypothetical protein SELMODRAFT_97538 [Selaginella moellendorffii]
          Length = 430

 Score =  115 bits (288), Expect = 4e-23,   Method: Compositional matrix adjust.
 Identities = 110/380 (28%), Positives = 164/380 (43%), Gaps = 41/380 (10%)

Query: 61  FYVNISIGQPPVPQFTAMDTGSSLLWVHCY---------PCRDCSPQLGTIFYPSRSSSY 111
           + V+++ G PP       DTGS L+W+ C          P + CS +    F  S+S++ 
Sbjct: 54  YLVSMAFGTPPQEVLLIADTGSDLIWLQCSTTAAPPAFCPKKACSRR--PAFVASKSATL 111

Query: 112 AYVPCDSEHCRYFPYAR-----CS-AYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDES 165
           + VPC +  C   P  R     CS A    C Y   Y  G  T+ F++ +  T  N    
Sbjct: 112 SVVPCSAAQCLLVPAPRGHGPSCSPAAPVPCGYAYDYADGSSTTGFLARDTATISNGTSG 171

Query: 166 TIHVQDVVFGCGFSTNRNFKFS---GIFGLGIGRSSLVSQLNS----SFSYCIGSLHDPD 218
              V+ V FGCG + N+   FS   G+ GLG G+ S  +Q  S    +FSYC+  L    
Sbjct: 172 GAAVRGVAFGCG-TRNQGGSFSGTGGVIGLGQGQLSFPAQSGSLFAQTFSYCLLDLEGGR 230

Query: 219 YLHNK--LILGDGAIIDEGDATPL---QFIDGHYYITLEAISVDGRMLDINPNIFKRDDS 273
              +   L LG          TPL         YY+ + AI V  R+L +  + +  D  
Sbjct: 231 RGRSSSFLFLGRPERRAAFAYTPLVSNPLAPTFYYVGVVAIRVGNRVLPVPGSEWAIDVL 290

Query: 274 G-GGVMIDSGTDVTWLVKEAYEALRDEVMIRLEGEQMRS---YSWPDKLCYHGIMSSDLK 329
           G GG +IDSG+ +T+L   AY  L       +   ++ S   +    +LCY+   SS L 
Sbjct: 291 GNGGTVIDSGSTLTYLRLGAYLHLVSAFAASVHLPRIPSSATFFQGLELCYNVSSSSSLA 350

Query: 330 ----GFPTVRFHFRGGAKLALEKDSMFYQPRPDAFCMAVNPASINNRYVNFSLIGMMAQQ 385
               GFP +   F  G  L L   +       D  C+A+ P         F+++G + QQ
Sbjct: 351 PANGGFPRLTIDFAQGLSLELPTGNYLVDVADDVKCLAIRP---TLSPFAFNVLGNLMQQ 407

Query: 386 FYNVGYDIGRKQKTFQRMDC 405
            Y+V +D    +  F R +C
Sbjct: 408 GYHVEFDRASARIGFARTEC 427


>gi|225458774|ref|XP_002283258.1| PREDICTED: aspartic proteinase-like protein 2 [Vitis vinifera]
 gi|302142232|emb|CBI19435.3| unnamed protein product [Vitis vinifera]
          Length = 659

 Score =  115 bits (288), Expect = 4e-23,   Method: Compositional matrix adjust.
 Identities = 103/379 (27%), Positives = 165/379 (43%), Gaps = 55/379 (14%)

Query: 56  ISNSLFYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLGTIFYPSRSSSYAYVP 115
           +SN  +   + IG PP      +DTGS++ +V C  C  C       F P  SS+Y  V 
Sbjct: 83  LSNGYYTTRLWIGTPPQEFALIVDTGSTVTYVPCSDCEHCGKHQDPRFQPDESSTYHPVK 142

Query: 116 CDSEHCRYFPYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDVVFG 175
           C+ +         C      C+Y + Y     +S  +  + ++F N  +S +  Q  VFG
Sbjct: 143 CNMD-------CNCDHDGVNCVYERRYAEMSSSSGVLGEDIISFGN--QSEVVPQRAVFG 193

Query: 176 CGFSTNRNF---KFSGIFGLGIGRSSLVSQL------NSSFSYCIGSLHDPDYLHNKLIL 226
           C      +    +  GI GLG G+ S+V QL      N SFS C G +H          +
Sbjct: 194 CENVETGDLYSQRADGIMGLGRGQLSIVDQLVDKNVINDSFSLCYGGMH----------V 243

Query: 227 GDGAIIDEGDATPLQFI--------DGHYYITLEAISVDGRMLDINPNIFKRDDSGGGVM 278
           G GA++  G   P   +          +Y I L+ I V G+ L ++P+ F R     G +
Sbjct: 244 GGGAMVLGGIPPPPDMVFSRSDPYRSPYYNIELKEIHVAGKPLKLSPSTFDRKH---GTV 300

Query: 279 IDSGTDVTWLVKEAYEALRDEVMIRLEGEQMRSYSWPDK----LCYHGI---MSSDLKGF 331
           +DSGT   +L +EA+ A RD ++ +     ++    PD     +C+ G    +S   K F
Sbjct: 301 LDSGTTYAYLPEEAFVAFRDAIIKK--SHNLKQIHGPDPNYNDICFSGAGRDVSQLSKAF 358

Query: 332 PTVRFHFRGGAKLALEKDSMFYQPRP--DAFCMAVNPASINNRYVNFSLIGMMAQQFYNV 389
           P V   F  G KL+L  ++  +Q      A+C+      I     + +L+G +  +   V
Sbjct: 359 PEVDMVFSNGQKLSLTPENYLFQHTKVHGAYCLG-----IFRNGDSTTLLGGIIVRNTLV 413

Query: 390 GYDIGRKQKTFQRMDCEVL 408
            YD   ++  F + +C  L
Sbjct: 414 TYDRENEKIGFWKTNCSEL 432


>gi|51536458|gb|AAU05467.1| At5g22850 [Arabidopsis thaliana]
 gi|55733777|gb|AAV59285.1| At5g22850 [Arabidopsis thaliana]
          Length = 426

 Score =  115 bits (288), Expect = 4e-23,   Method: Compositional matrix adjust.
 Identities = 102/339 (30%), Positives = 149/339 (43%), Gaps = 33/339 (9%)

Query: 60  LFYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLG-----TIFYPSRSSSYAYV 114
           L+Y  + +G PP   +  +DTGS +LWV C  C  C    G       F P  S + + +
Sbjct: 80  LYYTKLRLGTPPRDFYVQVDTGSDVLWVSCASCNGCPQTSGLQIQLNFFDPGSSVTASPI 139

Query: 115 PCDSEHCRYFPYAR---CSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTI---H 168
            C  + C +   +    CS   + C YT  Y  G  TS F  ++ L F     S++    
Sbjct: 140 SCSDQRCSWGIQSSDSGCSVQNNLCAYTFQYGDGSGTSGFYVSDVLQFDMIVGSSLVPNS 199

Query: 169 VQDVVFGCGFSTNRNFKFS-----GIFGLGIGRSSLVSQLNSS------FSYCIGSLHDP 217
              VVFGC  S   +   S     GIFG G    S++SQL S       FS+C   L   
Sbjct: 200 TAPVVFGCSTSQTGDLVKSDRAVDGIFGFGQQGMSVISQLASQGIAPRVFSHC---LKGE 256

Query: 218 DYLHNKLILGDGAIIDEGDA-TPLQFIDGHYYITLEAISVDGRMLDINPNIFKRDDSGGG 276
           +     L+LG+  I++     TPL     HY + L +ISV+G+ L INP++F   + G G
Sbjct: 257 NGGGGILVLGE--IVEPNMVFTPLVPSQPHYNVNLLSISVNGQALPINPSVFSTSN-GQG 313

Query: 277 VMIDSGTDVTWLVKEAYEALRDEVMIRLEGEQMRSYSWPDKLCYHGIMSSDLKGFPTVRF 336
            +ID+GT + +L + AY     E +     + +R        CY  I +S    FP V  
Sbjct: 314 TIIDTGTTLAYLSEAAYVPFV-EAITNAVSQSVRPVVSKGNQCYV-ITTSVGDIFPPVSL 371

Query: 337 HFRGGAKLALEKDSMFYQPR--PDAFCMAVNPASINNRY 373
           +F GGA + L       Q      A C      S+ +R+
Sbjct: 372 NFAGGASMFLNPQDYLIQQNNVASALCFLGRYCSVVHRF 410


>gi|357148754|ref|XP_003574882.1| PREDICTED: aspartic proteinase-like protein 2-like [Brachypodium
           distachyon]
          Length = 488

 Score =  115 bits (288), Expect = 4e-23,   Method: Compositional matrix adjust.
 Identities = 104/379 (27%), Positives = 165/379 (43%), Gaps = 36/379 (9%)

Query: 51  LPSNDISNSLFYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDC--SPQLGT---IFYP 105
           LP++     L+Y  I IG PP      +DTGS +LWV+C  C  C     LG    ++ P
Sbjct: 76  LPTD---TGLYYTEIEIGTPPKQYHVQVDTGSDILWVNCISCNKCPRKSDLGIDLRLYDP 132

Query: 106 SRSSSYAYVPCDSEHC--RYFPYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNT- 162
             SSS + V CD + C   Y       A    C Y+ +Y  G  T+ +  ++ L +    
Sbjct: 133 KGSSSGSTVSCDQKFCAATYGGKLPGCAKNIPCEYSVMYGDGSSTTGYFVSDSLQYNQVS 192

Query: 163 -DESTIHVQ-DVVFGCGFST-----NRNFKFSGIFGLGIGRSSLVSQLNSS------FSY 209
            D  T H    V+FGCG        + N    GI G G   +S++SQL ++      FS+
Sbjct: 193 GDGQTRHANASVIFGCGAQQGGDLGSTNQALDGIIGFGQSNTSMLSQLAAAGEVKKIFSH 252

Query: 210 CIGSLHDPDYLHNKLILGDGAIID-EGDATPLQFIDGHYYITLEAISVDGRMLDINPNIF 268
           C+      D +    I   G ++  +  +TPL     HY + LE+I+V G  L +  ++F
Sbjct: 253 CL------DTIKGGGIFAIGDVVQPKVKSTPLVPDMPHYNVNLESINVGGTTLQLPSHMF 306

Query: 269 KRDDSGGGVMIDSGTDVTWLVKEAYEALRDEVMIRLEGEQMRSYSWPDKLCYHGIMSSDL 328
           +  +   G +IDSGT +T+L +  Y+ +   V  +       S    D LC     S D 
Sbjct: 307 ETGEK-KGTIIDSGTTLTYLPELVYKDVLAAVFAKHPDTTFHSVQ--DFLCIQYFQSVD- 362

Query: 329 KGFPTVRFHFRGGAKLALEKDSMFYQPRPDAFCMAVNPASINNR-YVNFSLIGMMAQQFY 387
            GFP + FHF     L +     F+Q   + +C       + ++   +  L+G +     
Sbjct: 363 DGFPKITFHFEDDLGLNVYPHDYFFQNGDNLYCFGFQNGGLQSKDGKDMVLLGDLVLSNK 422

Query: 388 NVGYDIGRKQKTFQRMDCE 406
            V YD+  +   +   +C 
Sbjct: 423 VVVYDLENQVVGWTDYNCS 441


>gi|30688682|ref|NP_197676.2| aspartyl protease family protein [Arabidopsis thaliana]
 gi|110736370|dbj|BAF00154.1| protease-like protein [Arabidopsis thaliana]
 gi|332005704|gb|AED93087.1| aspartyl protease family protein [Arabidopsis thaliana]
          Length = 493

 Score =  115 bits (287), Expect = 5e-23,   Method: Compositional matrix adjust.
 Identities = 100/320 (31%), Positives = 144/320 (45%), Gaps = 38/320 (11%)

Query: 60  LFYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLG-----TIFYPSRSSSYAYV 114
           L+Y  + +G PP   +  +DTGS +LWV C  C  C    G       F P  S + + +
Sbjct: 80  LYYTKLRLGTPPRDFYVQVDTGSDVLWVSCASCNGCPQTSGLQIQLNFFDPGSSVTASPI 139

Query: 115 PCDSEHCRYFPYAR---CSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTI---H 168
            C  + C +   +    CS   + C YT  Y  G  TS F  ++ L F     S++    
Sbjct: 140 SCSDQRCSWGIQSSDSGCSVQNNLCAYTFQYGDGSGTSGFYVSDVLQFDMIVGSSLVPNS 199

Query: 169 VQDVVFGCGFSTNRNFKFS-----GIFGLGIGRSSLVSQLNSS------FSYCIGSLHDP 217
              VVFGC  S   +   S     GIFG G    S++SQL S       FS+C   L   
Sbjct: 200 TAPVVFGCSTSQTGDLVKSDRAVDGIFGFGQQGMSVISQLASQGIAPRVFSHC---LKGE 256

Query: 218 DYLHNKLILGDGAIIDEGDA-TPLQFIDGHYYITLEAISVDGRMLDINPNIFKRDDSGGG 276
           +     L+LG+  I++     TPL     HY + L +ISV+G+ L INP++F   + G G
Sbjct: 257 NGGGGILVLGE--IVEPNMVFTPLVPSQPHYNVNLLSISVNGQALPINPSVFSTSN-GQG 313

Query: 277 VMIDSGTDVTWLVKEAYEALRDEVMIRLEGEQMRSYSWPDKLCYHGIMSSDLKGFPTVRF 336
            +ID+GT + +L + AY     E +     + +R        CY  I +S    FP V  
Sbjct: 314 TIIDTGTTLAYLSEAAYVPFV-EAITNAVSQSVRPVVSKGNQCYV-ITTSVGDIFPPVSL 371

Query: 337 HFRGGAKLALEKDSMFYQPR 356
           +F GGA       SMF  P+
Sbjct: 372 NFAGGA-------SMFLNPQ 384


>gi|224136884|ref|XP_002326969.1| predicted protein [Populus trichocarpa]
 gi|222835284|gb|EEE73719.1| predicted protein [Populus trichocarpa]
          Length = 626

 Score =  115 bits (287), Expect = 5e-23,   Method: Compositional matrix adjust.
 Identities = 110/415 (26%), Positives = 174/415 (41%), Gaps = 63/415 (15%)

Query: 27  NISIARLAYLQEKIRSHNTYQAQILPS-------NDISNSLFYVNISIGQPPVPQFTAMD 79
           NIS  R+ +     R H   Q   LP+       + +SN  +   + IG PP      +D
Sbjct: 38  NISAHRMPFDGHYSRRH--LQNSELPNARMRLFDDLLSNGYYTTRLFIGTPPQEFALIVD 95

Query: 80  TGSSLLWVHCYPCRDCSPQLGTIFYPSRSSSYAYVPCDSEHCRYFPYARCSAYKHRCIYT 139
           TGS++ +V C  C  C       F P  SS+Y  V C+       P   C     +C Y 
Sbjct: 96  TGSTVTYVPCSSCEQCGKHQDPRFQPDLSSTYRPVKCN-------PSCNCDDEGKQCTYE 148

Query: 140 QLYLIGPETSVFVSTEQLTFKNTDESTIHVQDVVFGCGFSTNRNF---KFSGIFGLGIGR 196
           + Y     +S  ++ + ++F N  ES +  Q  VFGC      +    +  GI GLG GR
Sbjct: 149 RRYAEMSSSSGVIAEDVVSFGN--ESELKPQRAVFGCENVETGDLYSQRADGIMGLGRGR 206

Query: 197 SSLVSQL------NSSFSYCIGSLHDPDYLHNKLILGDGAIIDEGDATPLQFIDGH---- 246
            S+V QL        SFS C G +           +G GA++    + P   +  H    
Sbjct: 207 LSVVDQLVDKGVIGDSFSLCYGGMD----------VGGGAMVLGQISPPPNMVFSHSNPY 256

Query: 247 ----YYITLEAISVDGRMLDINPNIFKRDDSGGGVMIDSGTDVTWLVKEAYEALRDEVMI 302
               Y I L+ + V G+ L + P +F   D   G ++DSGT   +  + A+ AL+D +M 
Sbjct: 257 RSPYYNIELKELHVAGKPLKLKPKVF---DEKHGTVLDSGTTYAYFPEAAFHALKDAIMK 313

Query: 303 RLEGEQMRSYSWPDK----LCYHGI---MSSDLKGFPTVRFHFRGGAKLALEKDSMFYQ- 354
            +    ++    PD     +C+ G    +S   K FP V   F  G KL+L  ++  ++ 
Sbjct: 314 EI--RHLKQIPGPDPNYHDICFSGAGREVSHLSKVFPEVNMVFGSGQKLSLSPENYLFRH 371

Query: 355 -PRPDAFCMAVNPASINNRYVNFSLIGMMAQQFYNVGYDIGRKQKTFQRMDCEVL 408
                A+C+ +      N     +L+G +  +   V YD    +  F + +C  L
Sbjct: 372 TKVSGAYCLGI----FQNGNDLTTLLGGIVVRNTLVTYDRENDKIGFWKTNCSEL 422


>gi|225217056|gb|ACN85339.1| aspartic proteinase nepenthesin-1 precursor [Oryza granulata]
          Length = 521

 Score =  115 bits (287), Expect = 5e-23,   Method: Compositional matrix adjust.
 Identities = 104/359 (28%), Positives = 154/359 (42%), Gaps = 33/359 (9%)

Query: 61  FYVNISIGQPPVPQFTAMDTGSSLLWVHCYPC-RDCSPQLGTIFYPSRSSSYAYVPCDSE 119
           + V I +G P        DTGS   WV C PC   C  Q   +F P+RSS+YA V C + 
Sbjct: 182 YVVTIGLGTPASRYTVVFDTGSDTTWVQCQPCVVVCYKQQEKLFDPARSSTYANVSCAAP 241

Query: 120 HCRYFPYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDVVFGCGFS 179
            C    Y R  +  H C+Y+  Y  G  +  F + + LT  + D     V+   FGCG  
Sbjct: 242 ACSDL-YTRGCSGGH-CLYSVQYGDGSYSIGFFAMDTLTLSSYDA----VKGFRFGCGER 295

Query: 180 TNRNF-KFSGIFGLGIGRSSLVSQL----NSSFSYCIGSLHD-PDYLHNKLILGDGAIID 233
               F + +G+ GLG G++SL  Q        F++C+ +      YL      G  A + 
Sbjct: 296 NEGLFGEAAGLLGLGRGKTSLPVQTYDKYGGVFAHCLPARSSGTGYL--DFGPGSPAAVG 353

Query: 234 EGDATPLQFIDG--HYYITLEAISVDGRMLDINPNIFKRDDSGGGVMIDSGTDVTWLVKE 291
               TP+   +G   YY+ +  I V G++L I  ++F    S  G ++DSGT +T L   
Sbjct: 354 ARQTTPMLTDNGPTFYYVGMTGIRVGGQLLSIPQSVF----STAGTIVDSGTVITRLPPA 409

Query: 292 AYEALRDEVMIRLEGEQMRSYSWPDKL-----CYHGIMSSDLKGFPTVRFHFRGGAKLAL 346
           AY +LR      +     R Y     L     CY     S++   P V   F+GGA L +
Sbjct: 410 AYSSLRSAFASAMA---ARGYKKAPALSLLDTCYDFTGMSEVA-IPKVSLLFQGGAYLDV 465

Query: 347 EKDSMFYQPRPDAFCMAVNPASINNRYVNFSLIGMMAQQFYNVGYDIGRKQKTFQRMDC 405
               + Y       C+     + N    +  ++G    + + V YDIG+K   F    C
Sbjct: 466 NASGIMYAASLSQVCLGF---AANEDDDDVGIVGNTQLKTFGVVYDIGKKTVGFSPGAC 521


>gi|359497446|ref|XP_003635520.1| PREDICTED: aspartic proteinase nepenthesin-2-like, partial [Vitis
           vinifera]
          Length = 354

 Score =  115 bits (287), Expect = 5e-23,   Method: Compositional matrix adjust.
 Identities = 102/372 (27%), Positives = 165/372 (44%), Gaps = 38/372 (10%)

Query: 52  PSNDISNSLFYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRD-CSPQLGTIFYPSRSSS 110
           P   I +  +YV + +G P       +DTGSSL W+ C PC   C  Q   +F PS S +
Sbjct: 4   PGASIGSGNYYVKVGLGSPARYYSMIVDTGSSLSWLQCKPCVVYCHVQADPLFDPSASKT 63

Query: 111 YAYVPCDSEHCRYFPYAR-----CSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDES 165
           Y  + C S  C     A      C    + C+YT  Y     +  ++S + LT   +   
Sbjct: 64  YKSLSCTSSQCSSLVDATLNNPLCETSSNVCVYTASYGDSSYSMGYLSQDLLTLAPSQT- 122

Query: 166 TIHVQDVVFGCGFSTNRNF-KFSGIFGLGIGRSSLVSQLNS----SFSYCIGSLHDPDYL 220
              +   V+GCG  +   F + +GI GLG  + S++ Q++S    +FSYC+ +     +L
Sbjct: 123 ---LPGFVYGCGQDSEGLFGRAAGILGLGRNKLSMLGQVSSKFGYAFSYCLPTRGGGGFL 179

Query: 221 HNKLILGDGAIIDEG-DATPLQFIDGH---YYITLEAISVDGRMLDINPNIFKRDDSGGG 276
                +G  ++       TP+    G+   Y++ L AI+V GR L +    ++       
Sbjct: 180 S----IGKASLAGSAYKFTPMTTDPGNPSLYFLRLTAITVGGRALGVAAAQYRVP----- 230

Query: 277 VMIDSGTDVTWLVKEAYEALRDEVMIRLEGEQMRS--YSWPDKLCYHGIMSSDLKGFPTV 334
            +IDSGT +T L    Y   +   +  +  +  R+  +S  D  C+ G +  D++  P V
Sbjct: 231 TIIDSGTVITRLPMSVYTPFQQAFVKIMSSKYARAPGFSILDT-CFKGNL-KDMQSVPEV 288

Query: 335 RFHFRGGAKLALEKDSMFYQPRPDAFCMAVNPASINNRYVNFSLIGMMAQQFYNVGYDIG 394
           R  F+GGA L L   ++  Q      C+A    + NN     ++IG   QQ + V +DI 
Sbjct: 289 RLIFQGGADLNLRPVNVLLQVDEGLTCLAF---AGNN---GVAIIGNHQQQTFKVAHDIS 342

Query: 395 RKQKTFQRMDCE 406
             +  F    C 
Sbjct: 343 TARIGFATGGCN 354


>gi|356568907|ref|XP_003552649.1| PREDICTED: aspartic proteinase-like protein 2-like [Glycine max]
          Length = 490

 Score =  115 bits (287), Expect = 5e-23,   Method: Compositional matrix adjust.
 Identities = 102/375 (27%), Positives = 172/375 (45%), Gaps = 44/375 (11%)

Query: 60  LFYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQ--LG---TIFYPSRSSSYAYV 114
           L+Y  I IG PP   +  +DTGS ++WV+C  C++C  +  LG   T++    SSS  +V
Sbjct: 84  LYYAKIGIGTPPKNYYLQVDTGSDIMWVNCIQCKECPTRSNLGMDLTLYDIKESSSGKFV 143

Query: 115 PCDSEHCRYFP---YARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFK------NTDES 165
           PCD E C+         C+A    C Y ++Y  G  T+ +   + + +        TD +
Sbjct: 144 PCDQEFCKEINGGLLTGCTA-NISCPYLEIYGDGSSTAGYFVKDIVLYDQVSGDLKTDSA 202

Query: 166 TIHVQDVVFGCG------FSTNRNFKFSGIFGLGIGRSSLVSQLNSS------FSYCIGS 213
                 +VFGCG       S++      GI G G   SS++SQL SS      F++C+  
Sbjct: 203 N---GSIVFGCGARQSGDLSSSNEEALGGILGFGKANSSMISQLASSGKVKKMFAHCLNG 259

Query: 214 LHDPDYLHNKLILGDGAIID-EGDATPLQFIDGHYYITLEAISVDGRMLDINPNIFKRDD 272
           ++         I   G ++  + + TPL     HY + + A+ V    L ++ +   + D
Sbjct: 260 VNGGG------IFAIGHVVQPKVNMTPLLPDQPHYSVNMTAVQVGHAFLSLSTDTSTQGD 313

Query: 273 SGGGVMIDSGTDVTWLVKEAYEALRDEVMIRLEGEQMRSYSWPDKLCYHGIMSSDLKGFP 332
              G +IDSGT + +L +  YE L  +++ +    ++R+    +  C+    S D  GFP
Sbjct: 314 R-KGTIIDSGTTLAYLPEGIYEPLVYKIISQHPDLKVRTLH-DEYTCFQYSESVD-DGFP 370

Query: 333 TVRFHFRGGAKLAL-EKDSMFYQPRPDAFCMAVNPASINNR-YVNFSLIGMMAQQFYNVG 390
            V F+F  G  L +   D +F  P  D +C+    +   +R   N +L+G +      V 
Sbjct: 371 AVTFYFENGLSLKVYPHDYLF--PSGDFWCIGWQNSGTQSRDSKNMTLLGDLVLSNKLVF 428

Query: 391 YDIGRKQKTFQRMDC 405
           YD+  +   +   +C
Sbjct: 429 YDLENQVIGWTEYNC 443


>gi|218190722|gb|EEC73149.1| hypothetical protein OsI_07179 [Oryza sativa Indica Group]
          Length = 494

 Score =  115 bits (287), Expect = 5e-23,   Method: Compositional matrix adjust.
 Identities = 95/337 (28%), Positives = 145/337 (43%), Gaps = 32/337 (9%)

Query: 58  NSLFYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDC--SPQLG---TIFYPSRSSSYA 112
             L++  I IG P    +  +DTGS +LWV+C  C  C     LG   T++ P  S S  
Sbjct: 87  TGLYFTRIGIGTPAKRYYVQVDTGSDILWVNCVSCDGCPRKSNLGIELTMYDPRGSQSGE 146

Query: 113 YVPCDSEHC--RYFPYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTD---ESTI 167
            V CD + C   Y            C Y+  Y  G  T+ F  T+ L +       ++T 
Sbjct: 147 LVTCDQQFCVANYGGVLPSCTSTSPCEYSISYGDGSSTAGFFVTDFLQYNQVSGDGQTTP 206

Query: 168 HVQDVVFGCGFS-----TNRNFKFSGIFGLGIGRSSLVSQLNSS------FSYCIGSLHD 216
               V FGCG        + N    GI G G   SS++SQL ++      F++C+     
Sbjct: 207 ANASVSFGCGAKLGGDLGSSNLALDGILGFGQSNSSMLSQLAAAGKVRKMFAHCL----- 261

Query: 217 PDYLHNKLILGDGAIID-EGDATPLQFIDGHYYITLEAISVDGRMLDINPNIFKRDDSGG 275
            D ++   I   G ++  +   TPL     HY + L+ I V G  L +  NIF   +S  
Sbjct: 262 -DTVNGGGIFAIGNVVQPKVKTTPLVSDMPHYNVILKGIDVGGTALGLPTNIFDSGNS-K 319

Query: 276 GVMIDSGTDVTWLVKEAYEALRDEVMIRLEGEQMRSYSWPDKLCYHGIMSSDLKGFPTVR 335
           G +IDSGT + ++ +  Y+AL    M+  + + +   +  D  C+    S D  GFP V 
Sbjct: 320 GTIIDSGTTLAYVPEGVYKALF--AMVFDKHQDISVQTLQDFSCFQYSGSVD-DGFPEVT 376

Query: 336 FHFRGGAKLALEKDSMFYQPRPDAFCMAVNPASINNR 372
           FHF G   L +      +Q   + +CM      +  +
Sbjct: 377 FHFEGDVSLIVSPHDYLFQNGKNLYCMGFQNGGVQTK 413


>gi|359485189|ref|XP_002279141.2| PREDICTED: aspartic proteinase nepenthesin-2-like [Vitis vinifera]
          Length = 546

 Score =  115 bits (287), Expect = 6e-23,   Method: Compositional matrix adjust.
 Identities = 104/375 (27%), Positives = 168/375 (44%), Gaps = 42/375 (11%)

Query: 61  FYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLGTIFYPSRSSSYAYVPCDSEH 120
           +++++ +G PP      +DTGS L W+ C PC +C  Q G  + P +SSSY  + C    
Sbjct: 181 YFIDVFVGTPPKHFSLILDTGSDLNWIQCVPCYECFEQNGPHYDPGQSSSYRNIGCHDSR 240

Query: 121 CRYF----PYARCSAYKHRCIYTQLYLIGPETS----VFVSTEQLTFKNTDESTIHVQDV 172
           C       P   C A    C Y   Y     T+    +   T  LT  +       V++V
Sbjct: 241 CHLVSSPDPPQPCKAENQTCPYYYWYGDSSNTTGDFALETFTVNLTMSSGKPELRRVENV 300

Query: 173 VFGCGFSTNRNF-KFSGIFGLGIGRSSLVSQLNS----SFSYCIGSLHDPDYLHNKLILG 227
           +FGCG      F   +G+ GLG G  S  SQL S    SFSYC+   +    + +KLI G
Sbjct: 301 MFGCGHWNRGLFHGAAGLLGLGRGPLSFSSQLQSLYGHSFSYCLVDRNSDANVSSKLIFG 360

Query: 228 DG-----------AIIDEGDATPLQFIDGHYYITLEAISVDGRMLDINPNIFK-RDDSGG 275
           +              +  G   P   +D  YY+ +++I V G +++I    ++   D  G
Sbjct: 361 EDKDLLSHPELNFTTLVAGKENP---VDTFYYVQIKSIVVGGEVVNIPEEKWQIATDGSG 417

Query: 276 GVMIDSGTDVTWLVKEAYEALRDEVMIRLEGEQMRSYSWPDKLCYH--GIMSSDLKGFPT 333
           G +IDSGT +++  + AY+ +++  M +++G  +       + CY+  G+   DL  F  
Sbjct: 418 GTIIDSGTTLSYFAEPAYQVIKEAFMAKVKGYPVVKDFPVLEPCYNVTGVEQPDLPDFGI 477

Query: 334 VRFHFRGGAKLALEKDSMFYQPRP-DAFCMAV--NPASINNRYVNFSLIGMMAQQFYNVG 390
           V   F  GA      ++ F +  P +  C+A+   P S        S+IG   QQ +++ 
Sbjct: 478 V---FSDGAVWNFPVENYFIEIEPREVVCLAILGTPPSA------LSIIGNYQQQNFHIL 528

Query: 391 YDIGRKQKTFQRMDC 405
           YD  + +  F    C
Sbjct: 529 YDTKKSRLGFAPTKC 543


>gi|255583547|ref|XP_002532530.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
           communis]
 gi|223527742|gb|EEF29846.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
           communis]
          Length = 440

 Score =  114 bits (286), Expect = 6e-23,   Method: Compositional matrix adjust.
 Identities = 98/365 (26%), Positives = 159/365 (43%), Gaps = 32/365 (8%)

Query: 63  VNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLGTI-FYPSRSSSYAYVPCDSEHC 121
           V++ IG PP  Q   +DTGS L W+ C+          T  F PS SSS++ +PC+   C
Sbjct: 82  VSLPIGTPPQTQQMVLDTGSQLSWIQCHKKSVPKKPPPTTSFDPSLSSSFSVLPCNHPLC 141

Query: 122 RY----FPYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDVVFGCG 177
           +     F           C Y+  Y  G      +  E++TF ++  +      ++ GC 
Sbjct: 142 KPRIPDFTLPTTCDQNRLCHYSYFYADGTYAEGSLVREKITFSSSQST----PPLILGCA 197

Query: 178 FSTNRNFKFSGIFGLGIGRSSLVSQLN-SSFSYCIGSLHDPD--------YLHNKLILGD 228
            ++       GI G+ +GR S  SQ   S FSYC+ +             YL N    G 
Sbjct: 198 EASTDE---KGILGMNLGRRSFASQAKISKFSYCVPTRQARAGLSSTGSFYLGNNPNSGR 254

Query: 229 GAIIDEGDATPLQFIDG----HYYITLEAISVDGRMLDINPNIFKRDDSGGG-VMIDSGT 283
              I+    TP Q         Y I ++ I +    L+I+  +F+ D SG G  +IDSG+
Sbjct: 255 FQYINLLTFTPSQRSPNLDPLAYTIPMQGIRMGNARLNISATLFRPDPSGAGQTIIDSGS 314

Query: 284 DVTWLVKEAYEALRDEVMIRLEGEQMRS---YSWPDKLCYHGIMSSDLKGFPTVRFHFRG 340
           + T+LV EAY  +R+EV +RL G +++    Y     +C+ G      +    + F F  
Sbjct: 315 EFTYLVDEAYNKVREEV-VRLVGPKLKKGYVYGGVSDMCFDGNPMEIGRLIGNMVFEFEK 373

Query: 341 GAKLALEKDSMFYQPRPDAFCMAVNPASINNRYVNFSLIGMMAQQFYNVGYDIGRKQKTF 400
           G ++ ++K  +         C+ +  + +     N  +IG   QQ   V YD+  ++   
Sbjct: 374 GVEIVIDKWRVLADVGGGVHCIGIGRSEMLGAASN--IIGNFHQQNLWVEYDLANRRIGL 431

Query: 401 QRMDC 405
            + DC
Sbjct: 432 GKADC 436


>gi|225430555|ref|XP_002285593.1| PREDICTED: aspartic proteinase nepenthesin-2 [Vitis vinifera]
          Length = 481

 Score =  114 bits (286), Expect = 6e-23,   Method: Compositional matrix adjust.
 Identities = 112/381 (29%), Positives = 163/381 (42%), Gaps = 49/381 (12%)

Query: 51  LPSND---ISNSLFYVNISIGQPPVPQFTAMDTGSSLLWVHCYPC-RDCSPQLGTIFYPS 106
           LPS     I    + V + +G P        DTGS L W  C PC R C  Q   IF PS
Sbjct: 125 LPSKSGSTIGTGNYVVTVGLGTPKRDLTFIFDTGSDLTWTQCEPCARYCYHQQEPIFNPS 184

Query: 107 RSSSYAYVPCDSEHCRYFPYA-----RCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKN 161
           +S+SY  + C S  C            CSA    C+Y   Y     +  F + ++L   +
Sbjct: 185 KSTSYTNISCSSPTCDELKSGTGNSPSCSA--STCVYGIQYGDQSYSVGFFAQDKLALTS 242

Query: 162 TDESTIHVQDVVFGCGFSTNRNFKFSGIFGL-GIGRS--SLVSQLNSS----FSYCIGSL 214
           TD       + +FGCG   NR   F G+ GL G+GR+  SLVSQ        FSYC+ S 
Sbjct: 243 TDV----FNNFLFGCG-QNNRGL-FVGVAGLIGLGRNALSLVSQTAQKYGKLFSYCLPST 296

Query: 215 HDPDYLHNKLILGDGAIIDEGDATPLQFIDGH----YYITLEAISVDGRMLDINPNIFKR 270
                    L  G G    +        ++      Y++ L AISV GR L  + ++F  
Sbjct: 297 SSST---GYLTFGSGGGTSKAVKFTPSLVNSQGPSFYFLNLIAISVGGRKLSTSASVF-- 351

Query: 271 DDSGGGVMIDSGTDVTWLVKEAYEALRDEVMIRLEGEQMRSY--SWPDKL---CYHGIMS 325
             S  G +IDSGT ++ L   AY  LR         +QM  Y  + P  +   CY     
Sbjct: 352 --STAGTIIDSGTVISRLPPTAYSDLRASFQ-----QQMSKYPKAAPASILDTCYD-FSQ 403

Query: 326 SDLKGFPTVRFHFRGGAKLALEKDSMFYQPRPDAFCMAVNPASINNRYVNFSLIGMMAQQ 385
            D    P +  +F  GA++ L+   +FY       C+A    + N+   + +++G + Q+
Sbjct: 404 YDTVDVPKINLYFSDGAEMDLDPSGIFYILNISQVCLAF---AGNSDATDIAILGNVQQK 460

Query: 386 FYNVGYDIGRKQKTFQRMDCE 406
            ++V YD+   +  F    CE
Sbjct: 461 TFDVVYDVAGGRIGFAPGGCE 481


>gi|10177232|dbj|BAB10606.1| protease-like protein [Arabidopsis thaliana]
          Length = 539

 Score =  114 bits (286), Expect = 6e-23,   Method: Compositional matrix adjust.
 Identities = 98/318 (30%), Positives = 142/318 (44%), Gaps = 31/318 (9%)

Query: 60  LFYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLG-----TIFYPSRSSSYAYV 114
           L+Y  + +G PP   +  +DTGS +LWV C  C  C    G       F P  S + + +
Sbjct: 80  LYYTKLRLGTPPRDFYVQVDTGSDVLWVSCASCNGCPQTSGLQIQLNFFDPGSSVTASPI 139

Query: 115 PCDSEHCRYFPYAR---CSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTI---H 168
            C  + C +   +    CS   + C YT  Y  G  TS F  ++ L F     S++    
Sbjct: 140 SCSDQRCSWGIQSSDSGCSVQNNLCAYTFQYGDGSGTSGFYVSDVLQFDMIVGSSLVPNS 199

Query: 169 VQDVVFGCGFSTNRNFKFS-----GIFGLGIGRSSLVSQLNSS------FSYCIGSLHDP 217
              VVFGC  S   +   S     GIFG G    S++SQL S       FS+C   L   
Sbjct: 200 TAPVVFGCSTSQTGDLVKSDRAVDGIFGFGQQGMSVISQLASQGIAPRVFSHC---LKGE 256

Query: 218 DYLHNKLILGDGAIIDEGDA-TPLQFIDGHYYITLEAISVDGRMLDINPNIFKRDDSGGG 276
           +     L+LG+  I++     TPL     HY + L +ISV+G+ L INP++F   + G G
Sbjct: 257 NGGGGILVLGE--IVEPNMVFTPLVPSQPHYNVNLLSISVNGQALPINPSVFSTSN-GQG 313

Query: 277 VMIDSGTDVTWLVKEAYEALRDEVMIRLEGEQMRSYSWPDKLCYHGIMSSDLKGFPTVRF 336
            +ID+GT + +L + AY     E +     + +R        CY  I +S    FP V  
Sbjct: 314 TIIDTGTTLAYLSEAAYVPFV-EAITNAVSQSVRPVVSKGNQCYV-ITTSVGDIFPPVSL 371

Query: 337 HFRGGAKLALEKDSMFYQ 354
           +F GGA + L       Q
Sbjct: 372 NFAGGASMFLNPQDYLIQ 389


>gi|15228044|ref|NP_181826.1| aspartyl protease-like protein [Arabidopsis thaliana]
 gi|20197868|gb|AAM15292.1| putative chloroplast nucleoid DNA binding protein [Arabidopsis
           thaliana]
 gi|20197965|gb|AAD21712.2| putative chloroplast nucleoid DNA binding protein [Arabidopsis
           thaliana]
 gi|330255100|gb|AEC10194.1| aspartyl protease-like protein [Arabidopsis thaliana]
          Length = 527

 Score =  114 bits (286), Expect = 6e-23,   Method: Compositional matrix adjust.
 Identities = 100/377 (26%), Positives = 164/377 (43%), Gaps = 44/377 (11%)

Query: 61  FYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLGTIFYPSRSSSYAYVPCDSEH 120
           +++++ +G PP      +DTGS L W+ C PC DC  Q G  + P  S+S+  + C+   
Sbjct: 160 YFMDVLVGTPPKHFSLILDTGSDLNWLQCLPCYDCFHQNGMFYDPKTSASFKNITCNDPR 219

Query: 121 CRYF----PYARCSAYKHRCIYTQLYLIGPETS----VFVSTEQLTFKNTDESTIHVQDV 172
           C       P  +C +    C Y   Y     T+    V   T  LT      S   V ++
Sbjct: 220 CSLISSPDPPVQCESDNQSCPYFYWYGDRSNTTGDFAVETFTVNLTTTEGGSSEYKVGNM 279

Query: 173 VFGCGFSTNRNFKFSGIFGLGIG-----------RSSLVSQLNSSFSYCIGSLHDPDYLH 221
           +FGCG   NR     G+F    G            S L S    SFSYC+   +    + 
Sbjct: 280 MFGCG-HWNR-----GLFSGASGLLGLGRGPLSFSSQLQSLYGHSFSYCLVDRNSNTNVS 333

Query: 222 NKLILG-DGAIIDEGDATPLQFIDGH-------YYITLEAISVDGRMLDINPNIFK-RDD 272
           +KLI G D  +++  +     F++G        YYI +++I V G+ LDI    +    D
Sbjct: 334 SKLIFGEDKDLLNHTNLNFTSFVNGKENSVETFYYIQIKSILVGGKALDIPEETWNISSD 393

Query: 273 SGGGVMIDSGTDVTWLVKEAYEALRDEVMIRLEGEQ--MRSYSWPDKLCYH--GIMSSDL 328
             GG +IDSGT +++  + AYE ++++   +++      R +   D  C++  GI  +++
Sbjct: 394 GDGGTIIDSGTTLSYFAEPAYEIIKNKFAEKMKENYPIFRDFPVLDP-CFNVSGIEENNI 452

Query: 329 KGFPTVRFHFRGGAKLALEKDSMFYQPRPDAFCMAVNPASINNRYVNFSLIGMMAQQFYN 388
              P +   F  G       ++ F     D  C+A+    +      FS+IG   QQ ++
Sbjct: 453 H-LPELGIAFVDGTVWNFPAENSFIWLSEDLVCLAI----LGTPKSTFSIIGNYQQQNFH 507

Query: 389 VGYDIGRKQKTFQRMDC 405
           + YD  R +  F    C
Sbjct: 508 ILYDTKRSRLGFTPTKC 524


>gi|356531884|ref|XP_003534506.1| PREDICTED: aspartic proteinase-like protein 2-like [Glycine max]
          Length = 482

 Score =  114 bits (286), Expect = 7e-23,   Method: Compositional matrix adjust.
 Identities = 99/342 (28%), Positives = 149/342 (43%), Gaps = 38/342 (11%)

Query: 57  SNSLFYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLG-----TIFYPSRSSSY 111
           S  L+Y  I +G  P   +  +DTGS  LWV+C  C  C  + G     T++ P+ S + 
Sbjct: 73  STGLYYTKIGLG--PNDYYVQVDTGSDTLWVNCVGCTTCPKKSGLGMELTLYDPNSSKTS 130

Query: 112 AYVPCDSEHCRYF---PYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIH 168
             VPCD E C      P + C      C Y+  Y  G  TS     + LTF         
Sbjct: 131 KVVPCDDEFCTSTYDGPISGCKK-DMSCPYSITYGDGSTTSGSYIKDDLTFDRVVGDLRT 189

Query: 169 VQD---VVFGCG------FSTNRNFKFSGIFGLGIGRSSLVSQLNSS------FSYCIGS 213
           V D   V+FGCG       S+  +    GI G G   SS++SQL ++      FS+C+  
Sbjct: 190 VPDNTSVIFGCGSKQSGTLSSTTDTSLDGIIGFGQANSSVLSQLAAAGKVKRVFSHCL-- 247

Query: 214 LHDPDYLHNKLILGDGAIID-EGDATPLQFIDGHYYITLEAISVDGRMLDINPNIFKRDD 272
               D ++   I   G ++  +   TPL     HY + L+ I V G  + +  +IF    
Sbjct: 248 ----DTVNGGGIFAIGEVVQPKVKTTPLVPRMAHYNVVLKDIEVAGDPIQLPTDIFDS-T 302

Query: 273 SGGGVMIDSGTDVTWLVKEAYEALRDEVMIRLEGEQMRSYSWPDKL-CYHGIMSSDL-KG 330
           SG G +IDSGT + +L    Y+ L ++ + +  G  M  Y   D+  C+H      L   
Sbjct: 303 SGRGTIIDSGTTLAYLPVSIYDQLLEKTLAQRSG--MELYLVEDQFTCFHYSDEKSLDDA 360

Query: 331 FPTVRFHFRGGAKLALEKDSMFYQPRPDAFCMAVNPASINNR 372
           FPTV+F F  G  L        +  + D +C+    ++   +
Sbjct: 361 FPTVKFTFEEGLTLTAYPHDYLFPFKEDMWCIGWQKSTAQTK 402


>gi|115446115|ref|NP_001046837.1| Os02g0473200 [Oryza sativa Japonica Group]
 gi|47497549|dbj|BAD19621.1| putative aspartic proteinase nepenthesin [Oryza sativa Japonica
           Group]
 gi|47847591|dbj|BAD21978.1| putative aspartic proteinase nepenthesin [Oryza sativa Japonica
           Group]
 gi|113536368|dbj|BAF08751.1| Os02g0473200 [Oryza sativa Japonica Group]
          Length = 494

 Score =  114 bits (286), Expect = 7e-23,   Method: Compositional matrix adjust.
 Identities = 101/372 (27%), Positives = 158/372 (42%), Gaps = 33/372 (8%)

Query: 58  NSLFYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDC--SPQLG---TIFYPSRSSSYA 112
             L++  I IG P    +  +DTGS +LWV+C  C  C     LG   T++ P  S S  
Sbjct: 87  TGLYFTRIGIGTPAKRYYVQVDTGSDILWVNCVSCDGCPRKSNLGIELTMYDPRGSQSGE 146

Query: 113 YVPCDSEHC--RYFPYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTD---ESTI 167
            V CD + C   Y            C Y+  Y  G  T+ F  T+ L +       ++T 
Sbjct: 147 LVTCDQQFCVANYGGVLPSCTSTSPCEYSISYGDGSSTAGFFVTDFLQYNQVSGDGQTTP 206

Query: 168 HVQDVVFGCGFS-----TNRNFKFSGIFGLGIGRSSLVSQLNSS------FSYCIGSLHD 216
               V FGCG        + N    GI G G   SS++SQL ++      F++C+     
Sbjct: 207 ANASVSFGCGAKLGGDLGSSNLALDGILGFGQSNSSMLSQLAAAGKVRKMFAHCL----- 261

Query: 217 PDYLHNKLILGDGAIID-EGDATPLQFIDGHYYITLEAISVDGRMLDINPNIFKRDDSGG 275
            D ++   I   G ++  +   TPL     HY + L+ I V G  L +  NIF   +S  
Sbjct: 262 -DTVNGGGIFAIGNVVQPKVKTTPLVPDMPHYNVILKGIDVGGTALGLPTNIFDSGNS-K 319

Query: 276 GVMIDSGTDVTWLVKEAYEALRDEVMIRLEGEQMRSYSWPDKLCYHGIMSSDLKGFPTVR 335
           G +IDSGT + ++ +  Y+AL    M+  + + +   +  D  C+    S D  GFP V 
Sbjct: 320 GTIIDSGTTLAYVPEGVYKALF--AMVFDKHQDISVQTLQDFSCFQYSGSVD-DGFPEVT 376

Query: 336 FHFRGGAKLALEKDSMFYQPRPDAFCMAVNPASINNR-YVNFSLIGMMAQQFYNVGYDIG 394
           FHF G   L +      +Q   + +CM      +  +   +  L+G +      V YD+ 
Sbjct: 377 FHFEGDVSLIVSPHDYLFQNGKNLYCMGFQNGGVQTKDGKDMVLLGDLVLSNKLVLYDLE 436

Query: 395 RKQKTFQRMDCE 406
            +   +   +C 
Sbjct: 437 NQAIGWADYNCS 448


>gi|79407941|ref|NP_188636.2| aspartyl protease family protein [Arabidopsis thaliana]
 gi|75273243|sp|Q9LHE3.1|ASPG2_ARATH RecName: Full=Protein ASPARTIC PROTEASE IN GUARD CELL 2;
           Short=AtASPG2; Flags: Precursor
 gi|11994777|dbj|BAB03167.1| nucleoid chloroplast DNA-binding protein-like [Arabidopsis
           thaliana]
 gi|28392860|gb|AAO41867.1| unknown protein [Arabidopsis thaliana]
 gi|332642798|gb|AEE76319.1| aspartyl protease family protein [Arabidopsis thaliana]
          Length = 470

 Score =  114 bits (286), Expect = 7e-23,   Method: Compositional matrix adjust.
 Identities = 120/418 (28%), Positives = 177/418 (42%), Gaps = 49/418 (11%)

Query: 1   LIHRDSILS-PYNNPNENPAHRVQRGINISIARLAYLQEKIRSH-------NTYQAQILP 52
           L+HRD   S  Y N +     R++R  +   A L  +  K+          N + + I+ 
Sbjct: 63  LLHRDRFPSVTYRNHHHRLHARMRRDTDRVSAILRRISGKVIPSSDSRYEVNDFGSDIVS 122

Query: 53  SNDISNSLFYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLGTIFYPSRSSSYA 112
             D  +  ++V I +G PP  Q+  +D+GS ++WV C PC+ C  Q   +F P++S SY 
Sbjct: 123 GMDQGSGEYFVRIGVGSPPRDQYMVIDSGSDMVWVQCQPCKLCYKQSDPVFDPAKSGSYT 182

Query: 113 YVPCDSEHCRYFPYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDV 172
            V C S  C     + C  +   C Y  +Y  G  T   ++ E LTF  T      V++V
Sbjct: 183 GVSCGSSVCDRIENSGC--HSGGCRYEVMYGDGSYTKGTLALETLTFAKT-----VVRNV 235

Query: 173 VFGCGFSTNRNFKFSGIFGLGIGRS-SLVSQLN----SSFSYCIGSLHDPDYLHNKLILG 227
             GCG      F  +       G S S V QL+     +F YC+ S          L+ G
Sbjct: 236 AMGCGHRNRGMFIGAAGLLGIGGGSMSFVGQLSGQTGGAFGYCLVSRGTDS--TGSLVFG 293

Query: 228 DGAIIDEGDATPL---QFIDGHYYITLEAISVDGRMLDINPNIFKRDDSG-GGVMIDSGT 283
             A+       PL         YY+ L+ + V G  + +   +F   ++G GGV++D+GT
Sbjct: 294 REALPVGASWVPLVRNPRAPSFYYVGLKGLGVGGVRIPLPDGVFDLTETGDGGVVMDTGT 353

Query: 284 DVTWLVKEAYEALRDEVMIRLEGEQMRSYSWPDKLCYHGIMSSDLKGF-----PTVRFHF 338
            VT L   AY A RD    +       S       CY      DL GF     PTV F+F
Sbjct: 354 AVTRLPTAAYVAFRDGFKSQTANLPRASGVSIFDTCY------DLSGFVSVRVPTVSFYF 407

Query: 339 RGGAKLALEKDSMFYQPRPDA----FCMAVNPASINNRYVNFSLIGMMAQQFYNVGYD 392
             G  L L   + F  P  D+    F  A +P  +       S+IG + Q+   V +D
Sbjct: 408 TEGPVLTLPARN-FLMPVDDSGTYCFAFAASPTGL-------SIIGNIQQEGIQVSFD 457


>gi|226494448|ref|NP_001141341.1| uncharacterized protein LOC100273432 precursor [Zea mays]
 gi|194704078|gb|ACF86123.1| unknown [Zea mays]
 gi|413953775|gb|AFW86424.1| aspartic proteinase nepenthesin-2 [Zea mays]
          Length = 471

 Score =  114 bits (286), Expect = 7e-23,   Method: Compositional matrix adjust.
 Identities = 105/370 (28%), Positives = 164/370 (44%), Gaps = 38/370 (10%)

Query: 52  PSNDISNSLFYVNISIGQPPVPQFTAMDTGSSLLWVHCYPC-RDCSPQLGTIFYPSRSSS 110
           P   +    +   + +G P       +DTGSSL W+ C PC   C  Q+G +F P  SS+
Sbjct: 125 PGTSVGVGNYVTQLGLGTPSTSYAMVVDTGSSLTWLQCSPCVVSCHRQVGPLFDPRASST 184

Query: 111 YAYVPCDSEHCRYFPYAR-----CSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDES 165
           Y  V C +  C     A      CSA  + CIY   Y     +  ++ST+ ++F +T   
Sbjct: 185 YTSVRCSASQCDELQAATLNPSACSA-SNVCIYQASYGDSSFSVGYLSTDTVSFGSTSYP 243

Query: 166 TIHVQDVVFGCGFSTNRNF-KFSGIFGLGIGRSSLVSQLNS----SFSYCIGSLHDPDYL 220
           + +     +GCG      F + +G+ GL   + SL+ QL      SFSYC+ +     YL
Sbjct: 244 SFY-----YGCGQDNEGLFGRSAGLIGLARNKLSLLYQLAPSLGYSFSYCLPTAASTGYL 298

Query: 221 HNKLILGDGAIIDEGDATPL--QFIDGH-YYITLEAISVDGRMLDINPNIFKRDDSGGGV 277
                +G          TP+    +D   Y+ITL  +SV G  L ++P+ +    +    
Sbjct: 299 S----IGPYNTGHYYSYTPMASSSLDASLYFITLSGMSVGGSPLAVSPSEYSSLPT---- 350

Query: 278 MIDSGTDVTWLVKEAYEALRDEVMIRLEGEQ-MRSYSWPDKLCYHGIMSSDLKGFPTVRF 336
           +IDSGT +T L    + AL   V   + G Q   ++S  D  C+ G  +S L+  PTV  
Sbjct: 351 IIDSGTVITRLPTAVHTALSKAVAQAMAGAQRAPAFSILDT-CFEG-QASQLR-VPTVVM 407

Query: 337 HFRGGAKLALEKDSMFYQPRPDAFCMAVNPASINNRYVNFSLIGMMAQQFYNVGYDIGRK 396
            F GGA + L   ++         C+A  P        + ++IG   QQ ++V YD+ + 
Sbjct: 408 AFAGGASMKLTTRNVLIDVDDSTTCLAFAPTD------STAIIGNTQQQTFSVIYDVAQS 461

Query: 397 QKTFQRMDCE 406
           +  F    C 
Sbjct: 462 RIGFSAGGCS 471


>gi|18408451|ref|NP_564867.1| aspartyl protease-like protein [Arabidopsis thaliana]
 gi|12322615|gb|AAG51309.1|AC026480_16 unknown protein [Arabidopsis thaliana]
 gi|14334808|gb|AAK59582.1| unknown protein [Arabidopsis thaliana]
 gi|15293195|gb|AAK93708.1| unknown protein [Arabidopsis thaliana]
 gi|332196351|gb|AEE34472.1| aspartyl protease-like protein [Arabidopsis thaliana]
          Length = 430

 Score =  114 bits (286), Expect = 7e-23,   Method: Compositional matrix adjust.
 Identities = 96/365 (26%), Positives = 169/365 (46%), Gaps = 32/365 (8%)

Query: 63  VNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLGTIFYPSRSSSYAYVPCDSEHCR 122
           +++ IG PP  Q   +DTGS L W+ C+  +   P+  T F PS SSS++ +PC    C+
Sbjct: 74  ISLPIGTPPQAQQMVLDTGSQLSWIQCH-RKKLPPKPKTSFDPSLSSSFSTLPCSHPLCK 132

Query: 123 Y----FPYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDVVFGCGF 178
                F           C Y+  Y  G      +  E++TF NT+ +      ++ GC  
Sbjct: 133 PRIPDFTLPTSCDSNRLCHYSYFYADGTFAEGNLVKEKITFSNTEIT----PPLILGCAT 188

Query: 179 STNRNFKFSGIFGLGIGRSSLVSQLN-SSFSYCIGSLHD-PDYL-HNKLILGDG------ 229
            ++ +    GI G+  GR S VSQ   S FSYCI    + P +       LGD       
Sbjct: 189 ESSDD---RGILGMNRGRLSFVSQAKISKFSYCIPPKSNRPGFTPTGSFYLGDNPNSHGF 245

Query: 230 ---AIIDEGDATPLQFIDG-HYYITLEAISVDGRMLDINPNIFKRDDSGGG-VMIDSGTD 284
              +++   ++  +  +D   Y + +  I    + L+I+ ++F+ D  G G  M+DSG++
Sbjct: 246 KYVSLLTFPESQRMPNLDPLAYTVPMIGIRFGLKKLNISGSVFRPDAGGSGQTMVDSGSE 305

Query: 285 VTWLVKEAYEALRDEVMIRLEGEQMRS---YSWPDKLCYHGIMSSDLKGFPTVRFHFRGG 341
            T LV  AY+ +R E+M R+ G +++    Y     +C+ G ++   +    + F F  G
Sbjct: 306 FTHLVDAAYDKVRAEIMTRV-GRRLKKGYVYGGTADMCFDGNVAMIPRLIGDLVFVFTRG 364

Query: 342 AKLALEKDSMFYQPRPDAFCMAVNPASINNRYVNFSLIGMMAQQFYNVGYDIGRKQKTFQ 401
            ++ + K+ +         C+ +  +S+     N  +IG + QQ   V +D+  ++  F 
Sbjct: 365 VEILVPKERVLVNVGGGIHCVGIGRSSMLGAASN--IIGNVHQQNLWVEFDVTNRRVGFA 422

Query: 402 RMDCE 406
           + DC 
Sbjct: 423 KADCS 427


>gi|125555056|gb|EAZ00662.1| hypothetical protein OsI_22683 [Oryza sativa Indica Group]
          Length = 491

 Score =  114 bits (286), Expect = 7e-23,   Method: Compositional matrix adjust.
 Identities = 107/366 (29%), Positives = 152/366 (41%), Gaps = 44/366 (12%)

Query: 61  FYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRD---CSPQLGTIFYPSRSSSYAYVPCD 117
           F V + +G P  P     DTGS L WV C PC     C PQ   +F PS+SS+YA V C 
Sbjct: 149 FVVAVGLGTPAQPSALIFDTGSDLSWVQCQPCGSSGHCHPQQDPLFDPSKSSTYAAVHCG 208

Query: 118 SEHCRYFPYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDVVFGCG 177
              C       CS     C+Y   Y  G  T+  +S + L       S+  +    FGCG
Sbjct: 209 EPQCAA-AGGLCSEDNTTCLYLVHYGDGSSTTGVLSRDTLALT----SSRALAGFPFGCG 263

Query: 178 FSTNRNFKFSGIFG-----------LGIGRSSLVSQLNSSFSYCIGSLHDPDYLHNKLIL 226
               RN    G FG                S   +   + FSYC+ S +        L +
Sbjct: 264 ---TRNL---GDFGRVDGLLGLGRGELSLPSQAAASFGAVFSYCLPSSNS---TTGYLTI 314

Query: 227 GDGAIIDEGDATPLQFI-----DGHYYITLEAISVDGRMLDINPNIFKRDDSGGGVMIDS 281
           G     D G A     +        Y++ L +I + G +L + P +F R    GG ++DS
Sbjct: 315 GATPATDTGAAQYTAMLRKPQFPSFYFVELVSIDIGGYILPVPPAVFTR----GGTLLDS 370

Query: 282 GTDVTWLVKEAYEALRDEVMIRLEGEQMRSYSWPDKL--CYHGIMSSDLKGFPTVRFHFR 339
           GT +T+L  +AYE LRD    RL  E+       D L  CY     S++   P V F F 
Sbjct: 371 GTVLTYLPAQAYELLRDR--FRLTMERYTPAPPNDVLDACYDFAGESEVI-VPAVSFRFG 427

Query: 340 GGAKLALEKDSMFYQPRPDAFCMAVNPASINNRYVNFSLIGMMAQQFYNVGYDIGRKQKT 399
            GA   L+   +      +  C+A   A+++   +  S+IG   Q+   V YD+  ++  
Sbjct: 428 DGAVFELDFFGVMIFLDENVGCLAF--AAMDAGGLPLSIIGNTQQRSAEVIYDVAAEKIG 485

Query: 400 FQRMDC 405
           F    C
Sbjct: 486 FVPASC 491


>gi|326506682|dbj|BAJ91382.1| predicted protein [Hordeum vulgare subsp. vulgare]
 gi|326525815|dbj|BAJ88954.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 473

 Score =  114 bits (285), Expect = 8e-23,   Method: Compositional matrix adjust.
 Identities = 104/361 (28%), Positives = 160/361 (44%), Gaps = 39/361 (10%)

Query: 61  FYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCR-DCSPQLGTIFYPSRSSSYAYVPCDSE 119
           +   + +G P  P    +DTGSSL W+ C PCR  C  Q G +F P  SSSYA V C + 
Sbjct: 137 YVTRMGLGTPAKPYIMVVDTGSSLTWLQCSPCRVSCHRQSGPVFDPKTSSSYAAVSCSTP 196

Query: 120 HCRYFPYAR-----CSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDVVF 174
            C     A      CS+    CIY   Y     +  ++S + ++F +       V +  +
Sbjct: 197 QCNDLSTATLNPAACSS-SDVCIYQASYGDSSFSVGYLSKDTVSFGSNS-----VPNFYY 250

Query: 175 GCGFSTNRNF-KFSGIFGLGIGRSSLVSQLNS----SFSYCIGSLHDPDYLHNKLILGDG 229
           GCG      F + +G+ GL   + SL+ QL      SFSYC+     P    +  +    
Sbjct: 251 GCGQDNEGLFGRSAGLMGLARNKLSLLYQLAPTLGYSFSYCL-----PSSSSSGYLSIGS 305

Query: 230 AIIDEGDATPL---QFIDGHYYITLEAISVDGRMLDINPNIFKRDDSGGGVMIDSGTDVT 286
               +   TP+      D  Y+I L  ++V G+ L ++ + +    S    +IDSGT +T
Sbjct: 306 YNPGQYSYTPMVSSTLDDSLYFIKLSGMTVAGKPLAVSSSEY----SSLPTIIDSGTVIT 361

Query: 287 WLVKEAYEALRDEVMIRLEG-EQMRSYSWPDKLCYHGIMSSDLKGFPTVRFHFRGGAKLA 345
            L    Y+AL   V   ++G ++  +YS  D  C+ G  +S L+  P V   F GGA L 
Sbjct: 362 RLPTTVYDALSKAVAGAMKGTKRADAYSILDT-CFVG-QASSLR-VPAVSMAFSGGAALK 418

Query: 346 LEKDSMFYQPRPDAFCMAVNPASINNRYVNFSLIGMMAQQFYNVGYDIGRKQKTFQRMDC 405
           L   ++         C+A  PA       + ++IG   QQ ++V YD+   +  F    C
Sbjct: 419 LSAQNLLVDVDSSTTCLAFAPAR------SAAIIGNTQQQTFSVVYDVKSNRIGFAAGGC 472

Query: 406 E 406
            
Sbjct: 473 T 473


>gi|255558694|ref|XP_002520371.1| basic 7S globulin 2 precursor small subunit, putative [Ricinus
           communis]
 gi|223540418|gb|EEF41987.1| basic 7S globulin 2 precursor small subunit, putative [Ricinus
           communis]
          Length = 557

 Score =  114 bits (285), Expect = 9e-23,   Method: Compositional matrix adjust.
 Identities = 102/371 (27%), Positives = 168/371 (45%), Gaps = 34/371 (9%)

Query: 61  FYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLGTIFYPSRSSSYAYVPCDSEH 120
           +++++ IG PP      +DTGS L W+ C PC DC  Q G  + P  SSS+  + C    
Sbjct: 192 YFMDVFIGTPPRHFSLILDTGSDLNWIQCVPCYDCFVQNGPYYDPKESSSFKNIGCHDPR 251

Query: 121 CRYF----PYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDEST----IHVQDV 172
           C       P   C A    C Y   Y     T+   + E  T   T  +       V++V
Sbjct: 252 CHLVSSPDPPQPCKAENQTCPYFYWYGDSSNTTGDFALETFTVNLTSPAGKSEFKRVENV 311

Query: 173 VFGCGFSTNRNF-KFSGIFGLGIGRSSLVSQLNS----SFSYCIGSLHDPDYLHNKLILG 227
           +FGCG      F   +G+ GLG G  S  SQL S    SFSYC+   +    + +KLI G
Sbjct: 312 MFGCGHWNRGLFHGAAGLLGLGRGPLSFSSQLQSLYGHSFSYCLVDRNSDTNVSSKLIFG 371

Query: 228 -DGAIIDEGDATPLQFIDGH-------YYITLEAISVDGRMLDINPNIFKRDDSG-GGVM 278
            D  +++  +      + G        YY+ +++I V G +L I    +     G GG +
Sbjct: 372 EDKDLLNHPEVNFTSLVAGKENPVDTFYYVQIKSIMVGGEVLKIPEETWHLSPEGAGGTI 431

Query: 279 IDSGTDVTWLVKEAYEALRDEVMIRLEG-EQMRSYSWPDKLCYH--GIMSSDLKGFPTVR 335
           +DSGT +++  + +YE ++D  + +++G   ++ +   D  CY+  G+   +L   P  R
Sbjct: 432 VDSGTTLSYFAEPSYEIIKDAFVKKVKGYPVIKDFPILDP-CYNVSGVEKMEL---PEFR 487

Query: 336 FHFRGGAKLALEKDSMFYQPRPDAF-CMAVNPASINNRYVNFSLIGMMAQQFYNVGYDIG 394
             F  GA      ++ F +  P+   C+A+    +       S+IG   QQ +++ YD  
Sbjct: 488 ILFEDGAVWNFPVENYFIKLEPEEIVCLAI----LGTPRSALSIIGNYQQQNFHILYDTK 543

Query: 395 RKQKTFQRMDC 405
           + +  +  M C
Sbjct: 544 KSRLGYAPMKC 554


>gi|242092902|ref|XP_002436941.1| hypothetical protein SORBIDRAFT_10g011760 [Sorghum bicolor]
 gi|241915164|gb|EER88308.1| hypothetical protein SORBIDRAFT_10g011760 [Sorghum bicolor]
          Length = 445

 Score =  114 bits (285), Expect = 9e-23,   Method: Compositional matrix adjust.
 Identities = 107/366 (29%), Positives = 155/366 (42%), Gaps = 48/366 (13%)

Query: 1   LIHRDSILSPYNNPNENPAHRVQRGINISIARLAYLQEKIRSHNTYQAQILPSNDISNSL 60
           L+HR    +P  + +  P+  +      S ARL+Y    I S             + +  
Sbjct: 58  LLHRHGPCAPSLSTDTPPS--MSEMFRRSHARLSY----IVSGKKVSVPAHLGTSVKSLE 111

Query: 61  FYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRD--CSPQLGTIFYPSRSSSYAYVPCDS 118
           +   +S G P VPQ   +DTGS L W+ C PC    CSPQ   +F PS SS+Y+ VPC S
Sbjct: 112 YVATVSFGTPAVPQVVVIDTGSDLTWLQCKPCSSGQCSPQKDPLFDPSHSSTYSAVPCAS 171

Query: 119 EHCRYF---PYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDVVFG 175
             C+      Y    +    C +   Y+ G  T      ++LT          V+D  FG
Sbjct: 172 GECKKLAADAYGSGCSNGQPCGFAISYVDGTSTVGVYGKDKLTL----APGAIVKDFYFG 227

Query: 176 CGFSTNRNFKFSGIFGLGIGRS-SLVSQ--LNSSFSYCIGSLHD-PDYLHNKLILGDGAI 231
           CG S +               S SL +Q      FSYC+ +++  P +L      G G  
Sbjct: 228 CGHSKSSLPGLFDGLLGLGRLSESLGAQYGGGGGFSYCLPAVNSKPGFLA----FGAGRN 283

Query: 232 IDEGDATPLQFIDGH---YYITLEAISVDGRMLDINPNIFKRDDSGGGVMIDSGTDVTWL 288
                 TP+  + G      +TL  I+V G+ LD+ P+ F      GG+++DSGT VT L
Sbjct: 284 PSGFVFTPMGRVPGQPTFSTVTLAGITVGGKKLDLRPSAFS-----GGMIVDSGTVVTVL 338

Query: 289 VKEAYEALRDEVMIRLEGEQMRSYSWPDKLCYHGIMSS--DLKGF-----PTVRFHFRGG 341
               Y ALR         E M++Y        HG + +  DL G+     P +   F GG
Sbjct: 339 QSTVYRALRAAFR-----EAMKAYRL-----VHGDLDTCYDLTGYKNVVVPKIALTFSGG 388

Query: 342 AKLALE 347
           A + L+
Sbjct: 389 ATINLD 394


>gi|449511696|ref|XP_004164029.1| PREDICTED: aspartic proteinase-like protein 2-like [Cucumis
           sativus]
          Length = 639

 Score =  114 bits (285), Expect = 9e-23,   Method: Compositional matrix adjust.
 Identities = 103/403 (25%), Positives = 178/403 (44%), Gaps = 54/403 (13%)

Query: 32  RLAYLQEKIRSHNTYQAQILPSNDISNSLFYVNISIGQPPVPQFTAMDTGSSLLWVHCYP 91
           RL +LQ  ++ H++     L  + ++N  +   + IG PP      +DTGS++ +V C  
Sbjct: 60  RLRHLQNLVKPHSSNARMRLHDDLLTNGYYTTRLWIGSPPQEFALIVDTGSTVTYVPCSN 119

Query: 92  CRDCSPQLGTIFYPSRSSSYAYVPCDSEHCRYFPYARCSAYKHRCIYTQLYLIGPETSVF 151
           C  C       F P  SS+Y  V C+++         C     +C Y + Y     +S  
Sbjct: 120 CVQCGNHQDPRFQPELSSTYQPVKCNAD-------CNCDENGVQCTYERRYAEMSTSSGV 172

Query: 152 VSTEQLTFKNTDESTIHVQDVVFGCGFSTNRNF---KFSGIFGLGIGRSSLVSQL----- 203
           ++ + ++F    ES +  Q  VFGC    + +    +  GI GLG G  S++ QL     
Sbjct: 173 LAEDVMSFGK--ESELVPQRAVFGCETMESGDLYTQRADGIMGLGRGTLSVMDQLVGKGV 230

Query: 204 -NSSFSYCIGSLHDPDYLHNKLILGDGAIIDEGDATPLQFIDGH--------YYITLEAI 254
            ++SFS C G +           +G GA++  G ++P   +  H        Y I L+ I
Sbjct: 231 VSNSFSLCYGGMD----------VGGGAMVLGGISSPPGMVFSHSDPSRSPYYNIELKEI 280

Query: 255 SVDGRMLDINPNIFKRDDSGGGVMIDSGTDVTWLVKEAYEALRDEVMIRLEGEQMRSYSW 314
            V G+ L +NP  F   D   G ++DSGT   +  ++AY A +D +M ++    ++  S 
Sbjct: 281 HVAGKPLKLNPRTF---DGKYGAILDSGTTYAYFPEKAYYAFKDAIMKKI--SFLKQISG 335

Query: 315 PD----KLCYHGI---MSSDLKGFPTVRFHFRGGAKLALEKDSMFYQ--PRPDAFCMAVN 365
           PD     +C+ G    ++   K FP V   F  G K++L  ++  ++      A+C+ + 
Sbjct: 336 PDPNFKDICFSGAGRDVTELPKVFPEVDMVFANGQKISLSPENYLFRHTKVSGAYCLGI- 394

Query: 366 PASINNRYVNFSLIGMMAQQFYNVGYDIGRKQKTFQRMDCEVL 408
                N     +L+G +  +   V Y+       F + +C  L
Sbjct: 395 ---FKNGNDQTTLLGGIIVRNTLVTYNRENSTIGFWKTNCSEL 434


>gi|195625122|gb|ACG34391.1| aspartic proteinase nepenthesin-2 precursor [Zea mays]
          Length = 471

 Score =  114 bits (285), Expect = 9e-23,   Method: Compositional matrix adjust.
 Identities = 106/370 (28%), Positives = 164/370 (44%), Gaps = 38/370 (10%)

Query: 52  PSNDISNSLFYVNISIGQPPVPQFTAMDTGSSLLWVHCYPC-RDCSPQLGTIFYPSRSSS 110
           P   +    +   + +G P       +DTGSSL W+ C PC   C  Q+G +F P  SS+
Sbjct: 125 PGTSVGVGNYVTQLGLGTPSTSYAMVVDTGSSLTWLQCSPCVVSCHRQVGPLFDPRASST 184

Query: 111 YAYVPCDSEHCRYFPYAR-----CSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDES 165
           YA V C +  C     A      CSA  + CIY   Y     +   +ST+ ++F +T   
Sbjct: 185 YASVRCSASQCDELQAATLNPSACSA-SNVCIYQASYGDSSFSVGSLSTDTVSFGSTRYP 243

Query: 166 TIHVQDVVFGCGFSTNRNF-KFSGIFGLGIGRSSLVSQLNS----SFSYCIGSLHDPDYL 220
           + +     +GCG      F + +G+ GL   + SL+ QL      SFSYC+ +     YL
Sbjct: 244 SFY-----YGCGQDNEGLFGRSAGLIGLARNKLSLLYQLAPSLGYSFSYCLPTAASTGYL 298

Query: 221 HNKLILGDGAIIDEGDATPL--QFIDGH-YYITLEAISVDGRMLDINPNIFKRDDSGGGV 277
                +G          TP+    +D   Y+ITL  +SV G  L ++P+ +    +    
Sbjct: 299 S----IGPYNTGHYYSYTPMASSSLDASLYFITLSGMSVGGSPLAVSPSEYSSLPT---- 350

Query: 278 MIDSGTDVTWLVKEAYEALRDEVMIRLEGEQ-MRSYSWPDKLCYHGIMSSDLKGFPTVRF 336
           +IDSGT +T L    + AL   V   + G Q   ++S  D  C+ G  +S L+  PTV  
Sbjct: 351 IIDSGTVITRLPTAVHTALSKAVAQAMAGAQRAPAFSILDT-CFEG-QASQLR-VPTVAM 407

Query: 337 HFRGGAKLALEKDSMFYQPRPDAFCMAVNPASINNRYVNFSLIGMMAQQFYNVGYDIGRK 396
            F GGA + L   ++         C+A  P        + ++IG   QQ ++V YD+ + 
Sbjct: 408 AFAGGASMKLTTRNVLIDVDDSTTCLAFAPTD------STAIIGNTQQQTFSVIYDVAQS 461

Query: 397 QKTFQRMDCE 406
           +  F    C 
Sbjct: 462 RIGFSAGGCS 471


>gi|242095586|ref|XP_002438283.1| hypothetical protein SORBIDRAFT_10g011110 [Sorghum bicolor]
 gi|241916506|gb|EER89650.1| hypothetical protein SORBIDRAFT_10g011110 [Sorghum bicolor]
          Length = 470

 Score =  114 bits (285), Expect = 1e-22,   Method: Compositional matrix adjust.
 Identities = 107/409 (26%), Positives = 174/409 (42%), Gaps = 42/409 (10%)

Query: 12  NNPNENPAHRVQRGINISIARLAYLQEKIRSHNTYQAQILPSNDISNSLFYVNISIGQPP 71
           N P+  P   +++    + A    L + + S       + P   +    +   + +G P 
Sbjct: 90  NAPSRRPTTSLRKPKAAAGASGGPLDDSLAS-----VPLTPGTSVGVGNYVTELGLGTPA 144

Query: 72  VPQFTAMDTGSSLLWVHCYPC-RDCSPQLGTIFYPSRSSSYAYVPCDSEHCRYFPYAR-- 128
                 +DTGSSL W+ C PC   C  Q+G ++ P  SS+YA VPC +  C     A   
Sbjct: 145 TSYAMVVDTGSSLTWLQCSPCVVSCHRQVGPLYDPRASSTYATVPCSASQCDELQAATLN 204

Query: 129 ---CSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDVVFGCGFSTNRNF- 184
              CS  ++ CIY   Y     +  ++S + ++F +      +     +GCG      F 
Sbjct: 205 PSACSV-RNVCIYQASYGDSSFSVGYLSRDTVSFGSGSYPNFY-----YGCGQDNEGLFG 258

Query: 185 KFSGIFGLGIGRSSLVSQLNS----SFSYCIGSLHDPDYLH-NKLILGDGAIIDEGDATP 239
           + +G+ GL   + SL+ QL      SFSYC+ +     YL       G  +      ++ 
Sbjct: 259 RSAGLIGLARNKLSLLYQLAPSLGYSFSYCLPTPASTGYLSIGPYTSGHYSYTPMASSS- 317

Query: 240 LQFIDGH-YYITLEAISVDGRMLDINPNIFKRDDSGGGVMIDSGTDVTWLVKEAYEALRD 298
              +D   Y++TL  +SV G  L ++P  +    +    +IDSGT +T L    Y AL  
Sbjct: 318 ---LDASLYFVTLSGMSVGGSPLAVSPAEYSSLPT----IIDSGTVITRLPTAVYTALSK 370

Query: 299 EVMIRLEGEQ-MRSYSWPDKLCYHGIMSSDLKGFPTVRFHFRGGAKLALEKDSMFYQPRP 357
            V   + G Q   ++S  D  C+ G  +S L+  P V   F GGA L L   ++      
Sbjct: 371 AVAAAMVGVQSAPAFSILDT-CFQG-QASQLR-VPAVAMAFAGGATLKLATQNVLIDVDD 427

Query: 358 DAFCMAVNPASINNRYVNFSLIGMMAQQFYNVGYDIGRKQKTFQRMDCE 406
              C+A  P        + ++IG   QQ ++V YD+ + +  F    C 
Sbjct: 428 STTCLAFAPTD------STTIIGNTQQQTFSVVYDVAQSRIGFAAGGCS 470


>gi|413952720|gb|AFW85369.1| hypothetical protein ZEAMMB73_571116 [Zea mays]
          Length = 451

 Score =  114 bits (285), Expect = 1e-22,   Method: Compositional matrix adjust.
 Identities = 122/419 (29%), Positives = 186/419 (44%), Gaps = 46/419 (10%)

Query: 20  HRVQRGINISIARLAYLQEKIRSHNTYQAQILPSNDISNSLFYVNISIGQPPVPQFTA-- 77
            R+ R    S AR A L ++      Y   +  +   S+  + ++ +IG P  PQ  A  
Sbjct: 49  ERLSRMAVRSRARAASLYQR---GGHYGQPVTATAVPSSGEYLIHFNIGTP-RPQRVALT 104

Query: 78  MDTGSSLLWVHCYPCRDCSPQLGTIFYPSRSSSYAYVPCDSEHCRY---FPYARCSAYKH 134
           MDTGS L+W  C PC  C  Q   +F PS SS++  V C    CR       + C+    
Sbjct: 105 MDTGSDLVWTQCTPCPVCFDQPFPLFDPSVSSTFRAVACPDPICRPSSGLSVSACALKTF 164

Query: 135 RCIYTQLYLIGPETSVFVSTEQLTFKNTD---ESTIHVQDVVFGCG------FSTNRNFK 185
           RC Y   Y     T+ ++  +  TF + +      + V  + FGCG      F++N    
Sbjct: 165 RCFYLCSYGDKSITAGYIFKDTFTFMSPNGEGAPPVAVSGLAFGCGDYNTGVFASNE--- 221

Query: 186 FSGIFGLGIGRSSLVSQLN-SSFSYCIGSLHDPDYLHNK---LILG---DGAIIDEGD-- 236
            SGI G G G  SL SQL    FSYC+ S HD +   NK   + LG   +G         
Sbjct: 222 -SGIAGFGRGPLSLPSQLRVGRFSYCLTS-HD-ETESNKTSAVFLGTPPNGLRAHSSGPF 278

Query: 237 -ATPL---QFIDGHYYITLEAISVDGRMLDINPNIFK-RDDSGGGVMIDSGTDVTWLVKE 291
            +TP+         YY++LE I+V    L ++ ++F  + D  GG +IDSGT VT     
Sbjct: 279 RSTPIIHSPSFPTFYYLSLEGITVGKTRLPVDSSVFALKKDGSGGTVIDSGTGVTTFPAA 338

Query: 292 AYEALRDEVMIRLEGEQMRSYS-WPDKLCYHGIMSSDLKGFPTVRFHFRGGAKLALEKDS 350
            +E L++E + +L   +  + S   + LC+           P + FH    A + L +++
Sbjct: 339 VFEQLKNEFVAQLPLPRYDNTSEVGNLLCFQRPKGGKQVPVPKLIFHL-ASADMDLPREN 397

Query: 351 MFYQPR-PDAFCMAVNPASINNRYVNFSLIGMMAQQFYNVGYDIGRKQKTFQRMDCEVL 408
             Y P   D+  M +    IN   V+  LIG   QQ  ++ YD+   +  F    C+ +
Sbjct: 398 --YIPEDTDSGVMCL---MINGAEVDMVLIGNFQQQNMHIVYDVENSKLLFASAQCDKM 451


>gi|242084336|ref|XP_002442593.1| hypothetical protein SORBIDRAFT_08g022613 [Sorghum bicolor]
 gi|241943286|gb|EES16431.1| hypothetical protein SORBIDRAFT_08g022613 [Sorghum bicolor]
          Length = 482

 Score =  114 bits (284), Expect = 1e-22,   Method: Compositional matrix adjust.
 Identities = 115/424 (27%), Positives = 170/424 (40%), Gaps = 47/424 (11%)

Query: 1   LIHRDSILSPYNNPNENPAHRVQRGINISIARLAYLQEKIRSHNTYQAQILPSNDISNSL 60
           L+HRDS        N + A  + R +   + R A++  K  +    +   + +   ++  
Sbjct: 70  LVHRDSFAV-----NASAADLLARRLQRDMRRAAWIITKAATPADPENGTVVTGAPTSGE 124

Query: 61  FYVNISIGQP-----PVPQFTAMDTGSSLLWVHCYPCRDCSPQLGTIFYPSRSSSYAYVP 115
           +   I++G P           + D GS + W+ C PC  C  Q G ++   +SSS + V 
Sbjct: 125 YIAKITVGTPYENDSSFEALLSPDMGSDVTWLQCMPCFRCYHQPGPVYNRLKSSSASDVG 184

Query: 116 CDSEHCRYF-PYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDVVF 174
           C +  CR       C  + + C Y   Y  G  ++     E LTF       + V  V  
Sbjct: 185 CYAPACRALGSSGGCVQFLNECQYKVEYGDGSSSAGDFGVETLTFP----PGVRVPGVAI 240

Query: 175 GCGFSTNRNF--KFSGIFGLGIGRSSLVSQL----NSSFSYCIGSLHDPDYLHNKLILGD 228
           GCG      F    +GI GLG G  S  SQ+      SFSYC+          + L  G 
Sbjct: 241 GCGSDNQGLFPAPAAGILGLGRGSLSFPSQIAGRYGRSFSYCLAG-QGTGGRSSTLTFGS 299

Query: 229 GAIIDEGDATPLQF--------IDGHYYITLEAISVDG-RMLDINPNIFKRDDSG--GGV 277
           GA       TP  F        +   YY+ L  ISV G R+  +  +  + D S   GGV
Sbjct: 300 GASATTTTTTPPSFTPMLTNSRMYTFYYVGLVGISVGGVRVRGVTESDLRLDPSTGHGGV 359

Query: 278 MIDSGTDVTWLVKEAYEALRDEVMIRLEGEQMRSYSWPDK--------LCYHGIMSSDLK 329
           ++DSGT VT L   AY A RD   +      ++   WP           CY  +    +K
Sbjct: 360 IVDSGTAVTRLSGPAYAAFRDAFRV----AAVKELGWPSPGGPFAFFDTCYSSVRGRVMK 415

Query: 330 GFPTVRFHFRGGAKLALEKDSMFYQPRPDAFCMAVNPASINNRYVNFSLIGMMAQQFYNV 389
             P V  HF GG ++ L   +       +   M    A   +R V  S+IG +  Q + V
Sbjct: 416 KVPAVSMHFAGGVEVKLPPQNYLIPVDSNKGTMCFAFAGSGDRGV--SIIGNIQLQGFRV 473

Query: 390 GYDI 393
            YD+
Sbjct: 474 VYDV 477


>gi|326523839|dbj|BAJ96930.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 473

 Score =  114 bits (284), Expect = 1e-22,   Method: Compositional matrix adjust.
 Identities = 100/368 (27%), Positives = 154/368 (41%), Gaps = 29/368 (7%)

Query: 61  FYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLGT-----IFYPSRSSSYAYVP 115
           ++V   +G P  P     DTGS L WV C   R  SP         +F P+ S S+A +P
Sbjct: 110 YFVQFRVGTPAQPFVLVADTGSDLTWVKCRGRRASSPDASPLASPRVFRPANSKSWAPIP 169

Query: 116 CDSEHCR-YFPY--ARCSAYK---HRCIYTQLYLIGPETSVFVSTEQLTFK---NTDEST 166
           C S+ C+ Y P+  A CSA       C Y   Y         V T+  T     +  +  
Sbjct: 170 CSSDTCKSYVPFSLANCSAGTTPPAPCGYDYRYKDKSSARGVVGTDAATIALSGSGSDRK 229

Query: 167 IHVQDVVFGCGFSTN-RNFKFS-GIFGLGIGRSSLVS----QLNSSFSYCIGSLHDPDYL 220
             +Q+VV GC  S + ++F+ S G+  LG    S  S    +    FSYC+     P   
Sbjct: 230 AKLQEVVLGCTTSYDGQSFQSSDGVLSLGNSNISFASRAAARFGGRFSYCLVDHLAPRNA 289

Query: 221 HNKLILGDGAIIDEGDATPLQF---IDGHYYITLEAISVDGRMLDINPNIFKRDDSGGGV 277
            + L  G          TPL     +   Y +T++A+SV G+ L+I   ++    +GG +
Sbjct: 290 TSYLTFGPVGAAHSPSRTPLLLDAQVAPFYAVTVDAVSVAGKALNIPAEVWDVKKNGGAI 349

Query: 278 MIDSGTDVTWLVKEAYEALRDEVMIRLEGEQMRSYSWPDKLCYHGIMSSDLKGFPTVRFH 337
           + DSGT +T L   AY+A+   +  +L     R    P + CY+   +      P +   
Sbjct: 350 L-DSGTSLTILATPAYKAVVAALSKQL-ARVPRVTMDPFEYCYNWTATRRPPAVPRLEVR 407

Query: 338 FRGGAKLALEKDSMFYQPRPDAFCMAVNPASINNRYVNFSLIGMMAQQFYNVGYDIGRKQ 397
           F G A+L     S      P   C+ +        +   S+IG + QQ +   +D+  + 
Sbjct: 408 FAGSARLRPPTKSYVIDAAPGVKCIGLQ----EGVWPGVSVIGNILQQEHLWEFDLANRW 463

Query: 398 KTFQRMDC 405
             FQ   C
Sbjct: 464 LRFQESRC 471


>gi|449447285|ref|XP_004141399.1| PREDICTED: aspartic proteinase-like protein 2-like [Cucumis
           sativus]
          Length = 609

 Score =  114 bits (284), Expect = 1e-22,   Method: Compositional matrix adjust.
 Identities = 103/403 (25%), Positives = 178/403 (44%), Gaps = 54/403 (13%)

Query: 32  RLAYLQEKIRSHNTYQAQILPSNDISNSLFYVNISIGQPPVPQFTAMDTGSSLLWVHCYP 91
           RL +LQ  ++ H++     L  + ++N  +   + IG PP      +DTGS++ +V C  
Sbjct: 60  RLRHLQNLVKPHSSNARMRLHDDLLTNGYYTTRLWIGSPPQEFALIVDTGSTVTYVPCSN 119

Query: 92  CRDCSPQLGTIFYPSRSSSYAYVPCDSEHCRYFPYARCSAYKHRCIYTQLYLIGPETSVF 151
           C  C       F P  SS+Y  V C+++         C     +C Y + Y     +S  
Sbjct: 120 CVQCGNHQDPRFQPELSSTYQPVKCNAD-------CNCDENGVQCTYERRYAEMSTSSGV 172

Query: 152 VSTEQLTFKNTDESTIHVQDVVFGCGFSTNRNF---KFSGIFGLGIGRSSLVSQL----- 203
           ++ + ++F    ES +  Q  VFGC    + +    +  GI GLG G  S++ QL     
Sbjct: 173 LAEDVMSFGK--ESELVPQRAVFGCETMESGDLYTQRADGIMGLGRGTLSVMDQLVGKGV 230

Query: 204 -NSSFSYCIGSLHDPDYLHNKLILGDGAIIDEGDATPLQFIDGH--------YYITLEAI 254
            ++SFS C G +           +G GA++  G ++P   +  H        Y I L+ I
Sbjct: 231 VSNSFSLCYGGMD----------VGGGAMVLGGISSPPGMVFSHSDPSRSPYYNIELKEI 280

Query: 255 SVDGRMLDINPNIFKRDDSGGGVMIDSGTDVTWLVKEAYEALRDEVMIRLEGEQMRSYSW 314
            V G+ L +NP  F   D   G ++DSGT   +  ++AY A +D +M ++    ++  S 
Sbjct: 281 HVAGKPLKLNPRTF---DGKYGAILDSGTTYAYFPEKAYYAFKDAIMKKI--SFLKQISG 335

Query: 315 PD----KLCYHGI---MSSDLKGFPTVRFHFRGGAKLALEKDSMFYQ--PRPDAFCMAVN 365
           PD     +C+ G    ++   K FP V   F  G K++L  ++  ++      A+C+ + 
Sbjct: 336 PDPNFKDICFSGAGRDVTELPKVFPEVDMVFANGQKISLSPENYLFRHTKVSGAYCLGI- 394

Query: 366 PASINNRYVNFSLIGMMAQQFYNVGYDIGRKQKTFQRMDCEVL 408
                N     +L+G +  +   V Y+       F + +C  L
Sbjct: 395 ---FKNGNDQTTLLGGIIVRNTLVTYNRENSTIGFWKTNCSEL 434


>gi|21618176|gb|AAM67226.1| unknown [Arabidopsis thaliana]
          Length = 430

 Score =  114 bits (284), Expect = 1e-22,   Method: Compositional matrix adjust.
 Identities = 96/365 (26%), Positives = 169/365 (46%), Gaps = 32/365 (8%)

Query: 63  VNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLGTIFYPSRSSSYAYVPCDSEHCR 122
           +++ IG PP  Q   +DTGS L W+ C+  +   P+  T F PS SSS++ +PC    C+
Sbjct: 74  ISLPIGTPPQAQQMVLDTGSQLSWIQCH-RKKLPPKPKTSFDPSLSSSFSTLPCSHPLCK 132

Query: 123 Y----FPYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDVVFGCGF 178
                F           C Y+  Y  G      +  E++TF NT+ +      ++ GC  
Sbjct: 133 PRIPDFTLPTSCDSNRLCHYSYFYADGTFAEGNLVKEKITFSNTEIT----PPLILGCAT 188

Query: 179 STNRNFKFSGIFGLGIGRSSLVSQLN-SSFSYCIGSLHD-PDYL-HNKLILGDG------ 229
            ++ +    GI G+  GR S VSQ   S FSYCI    + P +       LGD       
Sbjct: 189 ESSDD---RGILGMNRGRLSFVSQAKISKFSYCIPPKSNRPGFTPTGSFYLGDNPNSHGF 245

Query: 230 ---AIIDEGDATPLQFIDG-HYYITLEAISVDGRMLDINPNIFKRDDSGGG-VMIDSGTD 284
              +++   ++  +  +D   Y + +  I    + L+I+ ++F+ D  G G  M+DSG++
Sbjct: 246 KYVSLLTFPESQRMPNLDPLAYTVPMIGIRFGLKKLNISGSVFRPDAGGSGQTMVDSGSE 305

Query: 285 VTWLVKEAYEALRDEVMIRLEGEQMRS---YSWPDKLCYHGIMSSDLKGFPTVRFHFRGG 341
            T LV  AY+ +R E+M R+ G +++    Y     +C+ G ++   +    + F F  G
Sbjct: 306 FTHLVDAAYDKVRAEIMTRV-GRRLKKGYVYGGTADMCFDGNVAMIPRLIGDLVFVFTRG 364

Query: 342 AKLALEKDSMFYQPRPDAFCMAVNPASINNRYVNFSLIGMMAQQFYNVGYDIGRKQKTFQ 401
            ++ + K+ +         C+ +  +S+     N  +IG + QQ   V +D+  ++  F 
Sbjct: 365 VEIFVPKERVLVNVGGGIHCVGIGRSSMLGAASN--IIGNVHQQNLWVEFDVTNRRVGFA 422

Query: 402 RMDCE 406
           + DC 
Sbjct: 423 KADCS 427


>gi|225216949|gb|ACN85242.1| aspartic proteinase nepenthesin-1 precursor [Oryza minuta]
          Length = 519

 Score =  114 bits (284), Expect = 1e-22,   Method: Compositional matrix adjust.
 Identities = 100/357 (28%), Positives = 152/357 (42%), Gaps = 29/357 (8%)

Query: 61  FYVNISIGQPPVPQFTAMDTGSSLLWVHCYPC-RDCSPQLGTIFYPSRSSSYAYVPCDSE 119
           + V + +G P        DTGS   WV C PC   C  Q   +F P+RSS+YA V C + 
Sbjct: 180 YVVTVGLGTPASRYTVVFDTGSDTTWVQCQPCVVVCYEQREKLFDPARSSTYANVSCAAP 239

Query: 120 HCRYFPYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDVVFGCGFS 179
            C       CS     C+Y   Y  G  +  F + + LT  + D     V+   FGCG  
Sbjct: 240 ACSDLNIHGCSG--GHCLYGVQYGDGSYSIGFFAMDTLTLSSYDA----VKGFRFGCGER 293

Query: 180 TNRNF-KFSGIFGLGIGRSSLVSQL----NSSFSYCIGSLHDPDYLHNKLILGDGAIIDE 234
               F + +G+ GLG G++SL  Q        F++C+ +          L  G G++   
Sbjct: 294 NEGLFGEAAGLLGLGRGKTSLPVQTYDKYGGVFAHCLPARSTGT---GYLDFGAGSLAAA 350

Query: 235 GD--ATPLQFIDG--HYYITLEAISVDGRMLDINPNIFKRDDSGGGVMIDSGTDVTWLVK 290
                TP+   +G   YY+ +  I V G++L I  ++F       G ++DSGT +T L  
Sbjct: 351 RARLTTPMLTENGPTFYYVGMTGIRVGGQLLSIPQSVFAT----AGTIVDSGTVITRLPP 406

Query: 291 EAYEALRDEVMIRLEGEQMRSYSWPDKL--CYHGIMSSDLKGFPTVRFHFRGGAKLALEK 348
            AY +LR      +     +       L  CY     S +   PTV   F+GGA+L ++ 
Sbjct: 407 AAYSSLRYAFAAAMAARGYKKAPAVSLLDTCYDFTGMSQVA-IPTVSLLFQGGARLDVDA 465

Query: 349 DSMFYQPRPDAFCMAVNPASINNRYVNFSLIGMMAQQFYNVGYDIGRKQKTFQRMDC 405
             + Y       C+A    + N    +  ++G    + + V YDIG+K   F    C
Sbjct: 466 SGIMYAASASQVCLAF---AANEDGGDVGIVGNTQLKTFGVAYDIGKKVVGFYPGAC 519


>gi|255538124|ref|XP_002510127.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
           communis]
 gi|223550828|gb|EEF52314.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
           communis]
          Length = 641

 Score =  114 bits (284), Expect = 1e-22,   Method: Compositional matrix adjust.
 Identities = 102/372 (27%), Positives = 166/372 (44%), Gaps = 40/372 (10%)

Query: 56  ISNSLFYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLGTIFYPSRSSSYAYVP 115
           +SN  +   + IG PP      +DTGS++ +V C  C  C       F P  SS+Y  + 
Sbjct: 83  LSNGYYTTRLFIGTPPQEFALIVDTGSTVTYVPCSTCEQCGKHQDPRFQPESSSTYKPMQ 142

Query: 116 CDSEHCRYFPYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDVVFG 175
           C+       P   C     +C Y + Y     +S  ++ + L+F N  ES +  Q  +FG
Sbjct: 143 CN-------PSCNCDDEGKQCTYERRYAEMSSSSGLLAEDVLSFGN--ESELTPQRAIFG 193

Query: 176 C-GFSTNRNF--KFSGIFGLGIGRSSLVSQL------NSSFSYCIGSLHDPDYLHNKLIL 226
           C    T   F  +  GI GLG G  S+V QL       +SFS C G +   D +   ++L
Sbjct: 194 CETVETGELFSQRADGIMGLGRGPLSVVDQLVIKEVVGNSFSLCYGGM---DVVGGAMVL 250

Query: 227 GD-GAIIDEGDATPLQFIDGHYYITLEAISVDGRMLDINPNIFKRDDSGGGVMIDSGTDV 285
           G+     D   A    +   +Y I L+ + V G+ L +NP +F   D   G ++DSGT  
Sbjct: 251 GNIPPPPDMVFAHSDPYRSAYYNIELKELHVAGKRLKLNPRVF---DGKHGTVLDSGTTY 307

Query: 286 TWLVKEAYEALRDEVMIRLEGEQMRSYSWPDK----LCYHGI---MSSDLKGFPTVRFHF 338
            +L +EA+ A +D ++  ++   ++    PD     +C+ G    +S   K FP V   F
Sbjct: 308 AYLPEEAFVAFKDAIIKEIKF--LKQIHGPDPSYNDICFSGAGRDVSQLSKIFPEVNMVF 365

Query: 339 RGGAKLALEKDSMFYQ--PRPDAFCMAVNPASINNRYVNFSLIGMMAQQFYNVGYDIGRK 396
             G KL+L  ++  ++      A+C+ +      N     +L+G +  +   V YD    
Sbjct: 366 GNGQKLSLSPENYLFRHTKVSGAYCLGI----FQNGKDPTTLLGGIVVRNTLVTYDRDND 421

Query: 397 QKTFQRMDCEVL 408
           +  F + +C  L
Sbjct: 422 KIGFWKTNCSEL 433


>gi|224104765|ref|XP_002313558.1| predicted protein [Populus trichocarpa]
 gi|222849966|gb|EEE87513.1| predicted protein [Populus trichocarpa]
          Length = 468

 Score =  113 bits (283), Expect = 1e-22,   Method: Compositional matrix adjust.
 Identities = 95/311 (30%), Positives = 143/311 (45%), Gaps = 33/311 (10%)

Query: 60  LFYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLG-----TIFYPSRSSSYAYV 114
           L+Y  + +G PP   +  +DTGS +LWV C  C  C    G       F P  S + + +
Sbjct: 51  LYYTRLQLGTPPRDFYVQIDTGSDVLWVSCGSCNGCPVNSGLHIPLNFFDPGSSPTASLI 110

Query: 115 PCDSEHCRYFPYAR---CSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTI---H 168
            C  + C     +    CSA  + C Y   Y  G  TS +  ++ L F      ++    
Sbjct: 111 SCSDQRCSLGLQSSDSVCSAQNNLCGYNFQYGDGSGTSGYYVSDLLHFDTVLGGSVMNNS 170

Query: 169 VQDVVFGC-----GFSTNRNFKFSGIFGLGIGRSSLVSQLNS------SFSYCIGSLHDP 217
              +VFGC     G  T  +    GIFG G    S+VSQL S      +FS+C   L   
Sbjct: 171 SAPIVFGCSALQTGDLTKSDRAVDGIFGFGQQDMSVVSQLASQGISPRAFSHC---LKGD 227

Query: 218 DYLHNKLILGDGAIIDEGDA-TPLQFIDGHYYITLEAISVDGRMLDINPNIFKRDDSGGG 276
           D     L+LG+  I++     TPL     HY + +++ISV+G+ L I+P++F    S  G
Sbjct: 228 DSGGGILVLGE--IVEPNIVYTPLVPSQPHYNLNMQSISVNGQTLAIDPSVFGTSSS-QG 284

Query: 277 VMIDSGTDVTWLVKEAYEALRDEVMIRLEGEQMRSYSWPDKLCYHGIMSSDLKG-FPTVR 335
            +IDSGT + +L + AY+     +   +    +R Y      CY  ++SS +   FP V 
Sbjct: 285 TIIDSGTTLAYLAEAAYDPFISAIT-SIVSPSVRPYLSKGNHCY--LISSSINDIFPQVS 341

Query: 336 FHFRGGAKLAL 346
            +F GGA + L
Sbjct: 342 LNFAGGASMIL 352


>gi|297834938|ref|XP_002885351.1| pepsin A [Arabidopsis lyrata subsp. lyrata]
 gi|297331191|gb|EFH61610.1| pepsin A [Arabidopsis lyrata subsp. lyrata]
          Length = 471

 Score =  113 bits (283), Expect = 1e-22,   Method: Compositional matrix adjust.
 Identities = 119/419 (28%), Positives = 177/419 (42%), Gaps = 50/419 (11%)

Query: 1   LIHRDSILS-PYNNPNENPAHRVQRGINISIARLAYLQEKIRSH--------NTYQAQIL 51
           L+HRD   S  Y N +     R++R  +   A L  +  K+           N + + ++
Sbjct: 63  LLHRDRFPSVTYRNHHHRLHARMRRDTDRVSAILRRISGKVVVASSDSRYEVNDFGSDVV 122

Query: 52  PSNDISNSLFYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLGTIFYPSRSSSY 111
              D  +  ++V I +G PP  Q+  +D+GS ++WV C PC+ C  Q   +F P++S SY
Sbjct: 123 SGMDQGSGEYFVRIGVGSPPRDQYMVIDSGSDMVWVQCQPCKLCYKQSDPVFDPAKSGSY 182

Query: 112 AYVPCDSEHCRYFPYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQD 171
             V C S  C     + C  +   C Y  +Y  G  T   ++ E LTF  T      V++
Sbjct: 183 TGVSCGSSVCDRIENSGC--HSGGCRYEVMYGDGSYTKGTLALETLTFAKT-----VVRN 235

Query: 172 VVFGCGFSTNRNFKFSGIFGLGIGRS-SLVSQLN----SSFSYCIGSLHDPDYLHNKLIL 226
           V  GCG      F  +       G S S V QL+     +F YC+ S          L+ 
Sbjct: 236 VAMGCGHRNRGMFIGAAGLLGIGGGSMSFVGQLSGQTGGAFGYCLVSRGTDS--TGSLVF 293

Query: 227 GDGAIIDEGDATPL---QFIDGHYYITLEAISVDGRMLDINPNIFKRDDSG-GGVMIDSG 282
           G  A+       PL         YY+ L+ + V G  + +   +F   ++G GGV++D+G
Sbjct: 294 GREALPVGASWVPLVRNPRAPSFYYVGLKGLGVGGVRIPLPDGVFDLTETGDGGVVMDTG 353

Query: 283 TDVTWLVKEAYEALRDEVMIRLEGEQMRSYSWPDKLCYHGIMSSDLKGF-----PTVRFH 337
           T VT L   AY A RD    +       S       CY      DL GF     PTV F+
Sbjct: 354 TAVTRLPTGAYAAFRDGFKSQTANLPRASGVSIFDTCY------DLSGFVSVRVPTVSFY 407

Query: 338 FRGGAKLALEKDSMFYQPRPDA----FCMAVNPASINNRYVNFSLIGMMAQQFYNVGYD 392
           F  G  L L   + F  P  D+    F  A +P  +       S+IG + Q+   V +D
Sbjct: 408 FTEGPVLTLPARN-FLMPVDDSGTYCFAFAASPTGL-------SIIGNIQQEGIQVSFD 458


>gi|326512066|dbj|BAJ96014.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 485

 Score =  113 bits (283), Expect = 1e-22,   Method: Compositional matrix adjust.
 Identities = 104/375 (27%), Positives = 163/375 (43%), Gaps = 40/375 (10%)

Query: 58  NSLFYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLG-----TIFYPSRSSSYA 112
             L++  I IG P    +  +DTGS +LWV+C  C  C  + G     T++ PS SSS  
Sbjct: 78  TGLYFTQIGIGTPAKSYYVQVDTGSDILWVNCVFCDTCPRKSGLGIELTLYDPSGSSSGT 137

Query: 113 YVPCDSEHC-----RYFPYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNT---DE 164
            V C  + C        P    +A    C Y+  Y  G  T+ F  T+ L +       +
Sbjct: 138 GVTCGQDFCVATHGGVIPSCVPAA---PCQYSISYGDGSSTTGFFVTDFLQYNQVSGNSQ 194

Query: 165 STIHVQDVVFGCGFSTNRNFKFS-----GIFGLGIGRSSLVSQLNSS------FSYCIGS 213
           +T+    + FGCG     +   S     GI G G   SS++SQL ++      F++C+  
Sbjct: 195 TTLANTSITFGCGAKIGGDLGSSSQALDGILGFGQSNSSMLSQLAAAGKVRKVFAHCL-- 252

Query: 214 LHDPDYLHNKLILGDGAIID-EGDATPLQFIDGHYYITLEAISVDGRMLDINPNIFKRDD 272
               D ++   I   G ++  +   TPL     HY + LEAI V G  L +  NIF   +
Sbjct: 253 ----DTINGGGIFAIGDVVQPKVSTTPLVPGMPHYNVNLEAIDVGGVKLQLPTNIFDIGE 308

Query: 273 SGGGVMIDSGTDVTWLVKEAYEALRDEVMIRLEGEQMRSYSWPDKLCYHGIMSSDLKGFP 332
           S  G +IDSGT + +L    Y A+  +V  +     +++    D  C+    S D  GFP
Sbjct: 309 S-KGTIIDSGTTLAYLPGVVYNAIMSKVFAQYGDMPLKNDQ--DFQCFRYSGSVD-DGFP 364

Query: 333 TVRFHFRGGAKLALEKDSMFYQPRPDAFCMAVNPASINNR-YVNFSLIGMMAQQFYNVGY 391
            + FHF GG  L +      +Q   + +CM      +  +   +  L+G +A     V Y
Sbjct: 365 IITFHFEGGLPLNIHPHDYLFQ-NGELYCMGFQTGGLQTKDGKDMVLLGDLAFSNRLVLY 423

Query: 392 DIGRKQKTFQRMDCE 406
           D+  +   +   +C 
Sbjct: 424 DLENQVIGWTDYNCS 438


>gi|413925432|gb|AFW65364.1| hypothetical protein ZEAMMB73_378208 [Zea mays]
          Length = 418

 Score =  113 bits (283), Expect = 2e-22,   Method: Compositional matrix adjust.
 Identities = 110/405 (27%), Positives = 180/405 (44%), Gaps = 40/405 (9%)

Query: 17  NPAHRVQRGINISIARLAYLQEKIRSHNTYQAQILPSNDISNSLFYVNISIGQPPVPQFT 76
            P     R  + S  RL+ L  ++ + +   AQ     D     + +  S+G PP     
Sbjct: 37  EPTINFTRAAHRSRERLSILATRLGAASAGSAQSPLQMDSGGGAYDMTFSMGTPPQTLSA 96

Query: 77  AMDTGSSLLWVHCYPCRDCSPQLGTIFYPSRSSSYAYVPCDSEHCRYF---PYARCSAYK 133
             DTGS L+W  C  C+ C+P+    +YP++SSS++ +PC S  CR       A C   +
Sbjct: 97  LADTGSDLIWAKCGACKRCAPRGSASYYPTKSSSFSKLPCSSALCRTLESQSLATCGGTR 156

Query: 134 HR---CIYTQLYLIGPE----TSVFVSTEQLTFKNTDESTIHVQDVVFGC-GFSTNRNFK 185
            R   C Y   Y +       T  ++ +E  T  +       VQ + FGC   S      
Sbjct: 157 ARGAVCSYRYSYGLSSNPHHYTQGYMGSETFTLGSD-----AVQGIGFGCTTMSEGGYGS 211

Query: 186 FSGIFGLGIGRSSLVSQLN-SSFSYCIGSLHDPDYLHNKLILGDGAIIDEG-DATPLQFI 243
            SG+ GLG G+ SLV QL   +FSYC+ S  DP    + L+ G GA+   G  +TPL  +
Sbjct: 212 GSGLVGLGRGKLSLVRQLKVGAFSYCLTS--DPS-TSSPLLFGAGALTGPGVQSTPLVNL 268

Query: 244 DGH--YYITLEAISVDGRMLDINPNIFKRDDSGGGVMIDSGTDVTWLVKEAYEALRDEVM 301
                Y + L++IS+        P   +      G++ DSGT +T+L + AY      ++
Sbjct: 269 KTSTFYTVNLDSISIGAAK---TPGTGRH-----GIIFDSGTTLTFLAEPAYTLAEAGLL 320

Query: 302 IRLEGEQMRSYSWPDKLCYHGIMSSDLKGFPTVRFHFRGGAKLALEKDSMFYQPRPDAFC 361
            +         +   ++C+    +S    FP++  HF GG  +AL+ ++ F        C
Sbjct: 321 SQTTNLTRVPGTDGYEVCFQ---TSGGAVFPSMVLHFDGG-DMALKTENYFGAVNDSVSC 376

Query: 362 MAVNPASINNRYVNFSLIGMMAQQFYNVGYDIGRKQKTFQRMDCE 406
             V  +         S++G + Q  Y++ YD+ +   +FQ  +C+
Sbjct: 377 WLVQKSP-----SEMSIVGNIMQMDYHIRYDLDKSVLSFQPTNCD 416


>gi|414878073|tpg|DAA55204.1| TPA: hypothetical protein ZEAMMB73_344109 [Zea mays]
          Length = 440

 Score =  113 bits (283), Expect = 2e-22,   Method: Compositional matrix adjust.
 Identities = 118/422 (27%), Positives = 181/422 (42%), Gaps = 45/422 (10%)

Query: 15  NENPAHRVQRGINISIARLAYLQEKIRSHNTYQAQILPSNDISNSLFYVNISIGQPPVPQ 74
           N +   R++R    +  RLA + E     +  ++Q           +     IG PP   
Sbjct: 36  NCSTEERMRRATERTHRRLASMGEASAPVHWAESQ-----------YIAEYLIGDPPQQA 84

Query: 75  FTAMDTGSSLLWVHCYPCR--DCSPQLGTIFYPSRSSSYAYVPCDSEHCRYFPYARCSAY 132
              +DTGS+L+W  C  C+   C  Q  + + PSRS +   V C+   C      RC+  
Sbjct: 85  EAIIDTGSNLIWTQCSTCQPAGCFSQNLSFYDPSRSRTARPVACNDTACALGSETRCARD 144

Query: 133 KHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDVVFGCGFSTNRNFK----FSG 188
              C     Y  G    V + TE  TF+   E+      + FGC  +T          SG
Sbjct: 145 NKACAVLTAYGAGVIGGV-LGTEAFTFQPQSENV----SLAFGCIAATRLTPGSLDGASG 199

Query: 189 IFGLGIGRSSLVSQL-NSSFSYCIGSLHDPDYLHNKLILGDGAIIDEGDA--TPLQFIDG 245
           I GLG G  SLVSQL ++ FSYC+          ++L +G  A +  G A  T + F+  
Sbjct: 200 IIGLGRGNLSLVSQLGDNKFSYCLTPYFSQSTNTSRLFVGASAGLSSGGAPATSVPFLKN 259

Query: 246 --------HYYITLEAISVDGRMLDINPNIFK-RDDSGG---GVMIDSGTDVTWLVKEAY 293
                    YY+ L  I+V    L +    F  R  + G   G +IDSG+  T LV  AY
Sbjct: 260 PDVDPFSTFYYLPLTGITVGDAKLAVPEAAFDLRQVATGLWAGTLIDSGSPFTSLVDVAY 319

Query: 294 EALRDEVMIRLEGEQMRSYSWPD--KLCYHGIMSSDLKGFPTVRFHF-RGGAKLALEKDS 350
           +ALRDE++ +L    +   +  +   LC         K  P +  HF  GG  +A+  ++
Sbjct: 320 QALRDELVQQLGASIVPPPAGAEGLDLCAAVAHGDVGKLVPPLVLHFGSGGGDVAVPPEN 379

Query: 351 MFYQPRPDA-FCMAVNPASINNRYVNF---SLIGMMAQQFYNVGYDIGRKQKTFQRMDCE 406
            ++ P  D+  CM V  +   N  +     ++IG   QQ  ++ YD+ +   +FQ  DC 
Sbjct: 380 -YWGPVDDSTACMVVFSSGGPNSTLPMNETTIIGNYMQQDMHLLYDLEKGMLSFQPADCS 438

Query: 407 VL 408
            +
Sbjct: 439 SM 440


>gi|449508297|ref|XP_004163275.1| PREDICTED: aspartic proteinase-like protein 2-like [Cucumis
           sativus]
          Length = 641

 Score =  113 bits (283), Expect = 2e-22,   Method: Compositional matrix adjust.
 Identities = 103/377 (27%), Positives = 163/377 (43%), Gaps = 54/377 (14%)

Query: 58  NSLFYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLGTIFYPSRSSSYAYVPCD 117
           N  +   + IG PP      +DTGS++ +V C  C  C       F P  SS+Y  + C+
Sbjct: 80  NGYYTTRLWIGTPPQQFALIVDTGSTVTYVPCSTCEQCGRHQDPKFDPESSSTYKPIKCN 139

Query: 118 SEHCRYFPYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDVVFGC- 176
            +         C +   +C+Y + Y     +S  +  + ++F N  +S +  Q  VFGC 
Sbjct: 140 ID-------CICDSDGVQCVYERQYAEMSTSSGVLGEDVISFGN--QSELIPQRAVFGCE 190

Query: 177 GFSTNRNF--KFSGIFGLGIGRSSLVSQL------NSSFSYCIGSLHDPDYLHNKLILGD 228
              T   F  +  GI GLG G  SLV QL      N SFS C G +           +G 
Sbjct: 191 NMETGDLFSQRADGIMGLGTGDLSLVDQLVEKGAINDSFSLCYGGMD----------IGG 240

Query: 229 GAIIDEGDATPLQFI--------DGHYYITLEAISVDGRMLDINPNIFKRDDSGGGVMID 280
           GA++  G + P   I          +Y + L+ I V G+ L ++  IF   D   G ++D
Sbjct: 241 GAMVLGGISPPSDMIFTYSDPVRSPYYNVDLKEIHVAGKKLPLSSGIF---DGRYGAVLD 297

Query: 281 SGTDVTWLVKEAYEALRDEVMIRLEGEQMRSYSWPD----KLCYHGIMSSDLK---GFPT 333
           SGT   +L  EA+ A +D +M  +    ++    PD     +C+ G  S   +    FPT
Sbjct: 298 SGTTYAYLPAEAFSAFKDAIMDEI--HSLKKIDGPDPNFKDICFSGAGSDAAELSNKFPT 355

Query: 334 VRFHFRGGAKLALEKDSMFYQPRP--DAFCMAVNPASINNRYVNFSLIGMMAQQFYNVGY 391
           V   F  G KL+L  ++ F++      A+C+ +      N     +L+G +  +   V Y
Sbjct: 356 VDMVFENGQKLSLTPENYFFRHSKVHGAYCLGI----FENGNDQTTLLGGIVVRNTLVMY 411

Query: 392 DIGRKQKTFQRMDCEVL 408
           D    +  F + +C  L
Sbjct: 412 DRANSKIGFWKTNCSEL 428


>gi|297828001|ref|XP_002881883.1| hypothetical protein ARALYDRAFT_903676 [Arabidopsis lyrata subsp.
           lyrata]
 gi|297327722|gb|EFH58142.1| hypothetical protein ARALYDRAFT_903676 [Arabidopsis lyrata subsp.
           lyrata]
          Length = 529

 Score =  113 bits (283), Expect = 2e-22,   Method: Compositional matrix adjust.
 Identities = 98/376 (26%), Positives = 163/376 (43%), Gaps = 42/376 (11%)

Query: 61  FYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLGTIFYPSRSSSYAYVPCDSEH 120
           +++++ +G PP      +DTGS L W+ C PC DC  Q    + P  S+S+  + C+   
Sbjct: 162 YFMDVLVGTPPKHFSLILDTGSDLNWLQCLPCYDCFHQNEAFYDPKTSASFKNITCNDPR 221

Query: 121 CRYF----PYARCSAYKHRCIYTQLYLIGPETS----VFVSTEQLTFKNTDESTIHVQDV 172
           C       P  +C +    C Y   Y     T+    V   T  LT      S   V+++
Sbjct: 222 CSLISSPEPPVQCKSDNQSCPYFYWYGDRSNTTGDFAVETFTVNLTTTEGRSSEYKVENM 281

Query: 173 VFGCGFSTNRNFKFSGIFGLGIG-----------RSSLVSQLNSSFSYCIGSLHDPDYLH 221
           +FGCG   NR     G+F    G            S L S    SFSYC+   +    + 
Sbjct: 282 MFGCG-HWNR-----GLFSGASGLLGLGRGPLSFSSQLQSLYGHSFSYCLVDRNSDTNVS 335

Query: 222 NKLILG-DGAIIDEGDATPLQFIDGH-------YYITLEAISVDGRMLDINPNIFK-RDD 272
           +KLI G D  +++  +     F++G        YYI +++I V G  LDI    +    D
Sbjct: 336 SKLIFGEDKDLLNHTNLNFTSFVNGKENSVETFYYIQIKSILVGGEALDIPEETWNISPD 395

Query: 273 SGGGVMIDSGTDVTWLVKEAYEALRDEVMIRLEGEQMRSYSWPD-KLCYH--GIMSSDLK 329
             GG +IDSGT +++  + AYE ++++   +++   +    +P    C++  GI  +++ 
Sbjct: 396 GAGGTIIDSGTTLSYFAEPAYEIIKNKFAEKMKENYLVFRDFPVLDPCFNVSGIEENNIH 455

Query: 330 GFPTVRFHFRGGAKLALEKDSMFYQPRPDAFCMAVNPASINNRYVNFSLIGMMAQQFYNV 389
             P +   F  GA      ++ F     D  C+A+    +      FS+IG   QQ +++
Sbjct: 456 -LPELGIAFADGAVWNFPAENSFIWLSEDLVCLAI----LGTPKSTFSIIGNYQQQNFHI 510

Query: 390 GYDIGRKQKTFQRMDC 405
            YD    +  F    C
Sbjct: 511 LYDTKMSRLGFTPTKC 526


>gi|449463971|ref|XP_004149703.1| PREDICTED: aspartic proteinase-like protein 2-like [Cucumis
           sativus]
          Length = 641

 Score =  113 bits (282), Expect = 2e-22,   Method: Compositional matrix adjust.
 Identities = 103/377 (27%), Positives = 163/377 (43%), Gaps = 54/377 (14%)

Query: 58  NSLFYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLGTIFYPSRSSSYAYVPCD 117
           N  +   + IG PP      +DTGS++ +V C  C  C       F P  SS+Y  + C+
Sbjct: 80  NGYYTTRLWIGTPPQQFALIVDTGSTVTYVPCSTCEQCGRHQDPKFDPESSSTYKPIKCN 139

Query: 118 SEHCRYFPYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDVVFGC- 176
            +         C +   +C+Y + Y     +S  +  + ++F N  +S +  Q  VFGC 
Sbjct: 140 ID-------CICDSDGVQCVYERQYAEMSTSSGVLGEDVISFGN--QSELIPQRAVFGCE 190

Query: 177 GFSTNRNF--KFSGIFGLGIGRSSLVSQL------NSSFSYCIGSLHDPDYLHNKLILGD 228
              T   F  +  GI GLG G  SLV QL      N SFS C G +           +G 
Sbjct: 191 NMETGDLFSQRADGIMGLGTGDLSLVDQLVEKGAINDSFSLCYGGMD----------IGG 240

Query: 229 GAIIDEGDATPLQFI--------DGHYYITLEAISVDGRMLDINPNIFKRDDSGGGVMID 280
           GA++  G + P   I          +Y + L+ I V G+ L ++  IF   D   G ++D
Sbjct: 241 GAMVLGGISPPSDMIFTYSDPVRSPYYNVDLKEIHVAGKKLPLSSGIF---DGRYGAVLD 297

Query: 281 SGTDVTWLVKEAYEALRDEVMIRLEGEQMRSYSWPD----KLCYHGIMSSDLK---GFPT 333
           SGT   +L  EA+ A +D +M  +    ++    PD     +C+ G  S   +    FPT
Sbjct: 298 SGTTYAYLPAEAFSAFKDAIMDEI--HSLKKIDGPDPNFKDICFSGAGSDAAELSNKFPT 355

Query: 334 VRFHFRGGAKLALEKDSMFYQPRP--DAFCMAVNPASINNRYVNFSLIGMMAQQFYNVGY 391
           V   F  G KL+L  ++ F++      A+C+ +      N     +L+G +  +   V Y
Sbjct: 356 VDMVFENGQKLSLTPENYFFRHSKVHGAYCLGI----FENGNDQTTLLGGIVVRNTLVMY 411

Query: 392 DIGRKQKTFQRMDCEVL 408
           D    +  F + +C  L
Sbjct: 412 DRANSKIGFWKTNCSEL 428


>gi|293332531|ref|NP_001169558.1| uncharacterized protein LOC100383437 precursor [Zea mays]
 gi|224030089|gb|ACN34120.1| unknown [Zea mays]
 gi|413925069|gb|AFW65001.1| hypothetical protein ZEAMMB73_160528 [Zea mays]
          Length = 491

 Score =  113 bits (282), Expect = 2e-22,   Method: Compositional matrix adjust.
 Identities = 106/403 (26%), Positives = 175/403 (43%), Gaps = 49/403 (12%)

Query: 40  IRSHN-TYQAQILPSNDI---------SNSLFYVNISIGQPPVPQFTAMDTGSSLLWVHC 89
           +R+H+ T   ++L + D+            L+Y  I +G PP   +  +DTGS +LWV+C
Sbjct: 55  LRAHDGTRHGRLLAAADLPLGGLGLPTDTGLYYTEIKLGTPPKHYYVQVDTGSDILWVNC 114

Query: 90  YPCRDCSPQLG-----TIFYPSRSSSYAYVPCDSEHCRYF---PYARCSAYKHRCIYTQL 141
             C  C  + G     T++ P  SS+ + V CD   C         +C A    C Y+  
Sbjct: 115 ITCEQCPHKSGLGLDLTLYDPKASSTGSMVMCDQAFCAATFGGKLPKCGA-NVPCEYSVT 173

Query: 142 YLIGPETSVFVSTEQLTFKNT---DESTIHVQDVVFGCGFST-----NRNFKFSGIFGLG 193
           Y  G  T     T+ L F       ++      V+FGCG        + N    GI G G
Sbjct: 174 YGDGSSTIGSFVTDALQFDQVTRDGQTQPANASVIFGCGAQQGGDLGSSNQALDGILGFG 233

Query: 194 IGRSSLVSQLNSS------FSYCIGSLHDPDYLHNKLILGDGAIID-EGDATPLQFIDGH 246
              +S++SQL ++      F++C+      D +    I   G ++  +   TPL     H
Sbjct: 234 EANTSMLSQLTTAGKVKKIFAHCL------DTIKGGGIFSIGDVVQPKVKTTPLVADKPH 287

Query: 247 YYITLEAISVDGRMLDINPNIFKRDDSGGGVMIDSGTDVTWLVKEAYEALRDEVMIRL-- 304
           Y + L+ I V G  L +  +IF+  +   G +IDSGT +T+L +  ++    EVM+ +  
Sbjct: 288 YNVNLKTIDVGGTTLQLPAHIFEPGEK-KGTIIDSGTTLTYLPELVFK----EVMLAVFN 342

Query: 305 EGEQMRSYSWPDKLCYHGIMSSDLKGFPTVRFHFRGGAKLALEKDSMFYQPRPDAFCMAV 364
           + + +  +     LC+    S D  GFPT+ FHF     L +     F+    D +C+  
Sbjct: 343 KHQDITFHDVQGFLCFQYPGSVD-DGFPTITFHFEDDLALHVYPHEYFFANGNDVYCVGF 401

Query: 365 -NPASINNRYVNFSLIGMMAQQFYNVGYDIGRKQKTFQRMDCE 406
            N AS +    +  L+G +      V YD+  +   +   +C 
Sbjct: 402 QNGASQSKDGKDIVLMGDLVLSNKLVIYDLENRVIGWTDYNCS 444


>gi|125563959|gb|EAZ09339.1| hypothetical protein OsI_31611 [Oryza sativa Indica Group]
          Length = 453

 Score =  113 bits (282), Expect = 2e-22,   Method: Compositional matrix adjust.
 Identities = 120/422 (28%), Positives = 180/422 (42%), Gaps = 55/422 (13%)

Query: 22  VQRGINISIARLAYLQEKIRSHNTYQAQILPSND----------ISNSLFYV-NISIGQP 70
           ++R +  S AR A L   +R+   +   I  + +           S  L YV ++++G P
Sbjct: 49  IRRAMQRSKARAAAL-SVVRNGGGFYGSIAQAREREREPGMAVRASGDLEYVLDLAVGTP 107

Query: 71  PVPQFTAMDTGSSLLWVHCYPCRDCSPQLGTIFYPSRSSSYAYVPCDSEHCRYFPYARCS 130
           P P    +DTGS L+W  C  C  C  Q   +F P  SSSY  + C  + C    +  C 
Sbjct: 108 PQPITALLDTGSDLIWTQCDTCTACLRQPDPLFSPRMSSSYEPMRCAGQLCGDILHHSC- 166

Query: 131 AYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDVVFGCG-FSTNRNFKFSGI 189
                C Y   Y  G  T  + +TE+ TF ++   T  V  + FGCG  +       SGI
Sbjct: 167 VRPDTCTYRYSYGDGTTTLGYYATERFTFASSSGETQSVP-LGFGCGTMNVGSLNNASGI 225

Query: 190 FGLGIGRSSLVSQLN-SSFSYCIGSLHDPDYLHNKLILGDGAIIDEG---DAT-PLQFI- 243
            G G    SLVSQL+   FSYC+     P     K  L  G++ D G   DAT P+Q   
Sbjct: 226 VGFGRDPLSLVSQLSIRRFSYCL----TPYASSRKSTLQFGSLADVGLYDDATGPVQTTP 281

Query: 244 -------DGHYYITLEAISVDGRMLDINPNIFK-RDDSGGGVMIDSGTDVTW----LVKE 291
                     YY+    ++V  R L I  + F  R D  GGV+IDSGT +T     ++ E
Sbjct: 282 ILQSAQNPTFYYVAFTGVTVGARRLRIPASAFALRPDGSGGVIIDSGTALTLFPAAVLAE 341

Query: 292 AYEALRDEVMIRLEGEQMRSYSWPDKLCYHGIMSSDLK-------GFPTVRFHFRGGAKL 344
              A R ++ +          S  D +C+     +            P + FHF+ GA L
Sbjct: 342 VVRAFRSQLRLPFA----NGSSPDDGVCFAAPAVAAGGGRMARQVAVPRMVFHFQ-GADL 396

Query: 345 ALEKDSMFYQP-RPDAFCMAVNPASINNRYVNFSLIGMMAQQFYNVGYDIGRKQKTFQRM 403
            L +++   +  R    C+ +  +  +      + IG   QQ   V YD+ R+  +F  +
Sbjct: 397 DLPRENYVLEDHRRGHLCVLLGDSGDDG-----ATIGNFVQQDMRVVYDLERETLSFAPV 451

Query: 404 DC 405
           +C
Sbjct: 452 EC 453


>gi|297827577|ref|XP_002881671.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
 gi|297327510|gb|EFH57930.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
          Length = 438

 Score =  113 bits (282), Expect = 2e-22,   Method: Compositional matrix adjust.
 Identities = 107/379 (28%), Positives = 165/379 (43%), Gaps = 51/379 (13%)

Query: 58  NSLFYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLGTIFYPSRSSSYAYVPCD 117
           N    V +++G PP      +DTGS L W+HC      SP LG++F P  SS+Y+ VPC 
Sbjct: 58  NVTLTVTLAVGSPPQNISMVLDTGSELSWLHCKK----SPNLGSVFNPVSSSTYSPVPCS 113

Query: 118 SEHCRY----FPY-ARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDV 172
           S  CR      P  A C    H C     Y     TS+  +    TF      ++     
Sbjct: 114 SPICRTRTRDLPIPASCDPKTHFCHVAISY--ADATSIEGNLAHDTFV---IGSVTRPGT 168

Query: 173 VFGC---GFSTN--RNFKFSGIFGLGIGRSSLVSQLN-SSFSYCIGSLHDPDYLHNKLIL 226
           +FGC   G S++   + K +G+ G+  G  S V+QL  S FSYCI            L+L
Sbjct: 169 LFGCMDSGLSSDSEEDAKSTGLMGMNRGSLSFVNQLGFSKFSYCISGSDS----SGILLL 224

Query: 227 GDGAIIDEG---------DATPLQFIDG-HYYITLEAISVDGRMLDINPNIFKRDDSGGG 276
           GD +    G           TPL + D   Y + LE I V  ++L +  ++F  D +G G
Sbjct: 225 GDASYSWLGPIQYTPLVLQTTPLPYFDRVAYTVQLEGIRVGSKILSLPKSVFVPDHTGAG 284

Query: 277 -VMIDSGTDVTWLVKEAYEALRDEVM------IRLEGEQMRSYSWPDKLCYHGIMSS--D 327
             M+DSGT  T+L+   Y AL++E +      +R+  +    +     LCY    S+  +
Sbjct: 285 QTMVDSGTQFTFLMGPVYTALKNEFIAQTKSVLRIVDDPNFVFQGTMDLCYRVGSSTRPN 344

Query: 328 LKGFPTVRFHFRG------GAKLALEKDSMFYQPRPDAFCMAVNPASINNRYVNFSLIGM 381
             G P +   FRG      G KL    +    + + + +C     + +    +   +IG 
Sbjct: 345 FTGLPVISLMFRGAEMSVSGQKLLYRVNGAGSEGKEEVYCFTFGNSDLLG--IEAFVIGH 402

Query: 382 MAQQFYNVGYDIGRKQKTF 400
             QQ   + +D+ + +  F
Sbjct: 403 HHQQNVWMEFDLAKSRVGF 421


>gi|125558632|gb|EAZ04168.1| hypothetical protein OsI_26310 [Oryza sativa Indica Group]
 gi|125600539|gb|EAZ40115.1| hypothetical protein OsJ_24558 [Oryza sativa Japonica Group]
          Length = 453

 Score =  113 bits (282), Expect = 2e-22,   Method: Compositional matrix adjust.
 Identities = 111/378 (29%), Positives = 171/378 (45%), Gaps = 31/378 (8%)

Query: 54  NDISNSLFYV-NISIGQPPVPQFTAMDTGSSLLWVHCYPCRD-CSPQLGTIFYPSRSSSY 111
            D+ N   Y+  ++IG PP       DTGS L+W  C PC + C  Q   ++ PS S ++
Sbjct: 84  KDLPNGGEYIMTLAIGTPPQSYPAIADTGSDLVWTQCAPCGERCFKQPSPLYNPSSSPTF 143

Query: 112 AYVPCDSEHCRYFPYARCSAYKH----RCIYTQLYLIGPETSVFVSTEQLTFKNTDESTI 167
             +PC S        AR +         C Y Q Y  G  TS    +E  TF ++    +
Sbjct: 144 RVLPCSSALNLCAAEARLAGATPPPGCACRYNQTYGTG-WTSGLQGSETFTFGSSPADQV 202

Query: 168 HVQDVVFGCGFSTNRNFKFSG-IFGLGIGRSSLVSQLNSS-FSYCIGSLHDPDYLHNKLI 225
            V  + FGC  +++ ++  S  + GLG G  SLVSQL +  FSYC+    D     + L+
Sbjct: 203 RVPGIAFGCSNASSDDWNGSAGLVGLGRGGLSLVSQLAAGMFSYCLTPFQDTKS-KSTLL 261

Query: 226 LGDGAIIDEGDATPLQF-----------IDGHYYITLEAISVDGRMLDINPNIFK-RDDS 273
           LG  A     + T ++            +  +YY+ L  ISV    L I P  F  R D 
Sbjct: 262 LGPAAAAAALNGTGVRSTPFVPSPSKPPMSTYYYLNLTGISVGAAALPIPPGAFALRADG 321

Query: 274 GGGVMIDSGTDVTWLVKEAYEALRDEV--MIRLEGEQMRSYSWPDKLCYHGIMSSDLKG- 330
            GG++IDSGT +T LV  AY+ +R  V  +++L      + +  D LC+    SS     
Sbjct: 322 TGGLIIDSGTTITSLVDAAYKRVRAAVRSLVKLPVTDGSNATGLD-LCFALPSSSAPPAT 380

Query: 331 FPTVRFHFRGGAKLALEKDSMFYQPRPDAFCMAVNPASINNRYVNFSLIGMMAQQFYNVG 390
            P++  HF GGA + L  ++         +C+A+     +      S +G   QQ  ++ 
Sbjct: 381 LPSMTLHFGGGADMVLPVENYMIL-DGGMWCLAMR----SQTDGELSTLGNYQQQNLHIL 435

Query: 391 YDIGRKQKTFQRMDCEVL 408
           YD+ ++  +F    C  L
Sbjct: 436 YDVQKETLSFAPAKCSTL 453


>gi|115479489|ref|NP_001063338.1| Os09g0452800 [Oryza sativa Japonica Group]
 gi|51535938|dbj|BAD38020.1| putative aspartic proteinase nepenthesin I [Oryza sativa Japonica
           Group]
 gi|113631571|dbj|BAF25252.1| Os09g0452800 [Oryza sativa Japonica Group]
 gi|125605918|gb|EAZ44954.1| hypothetical protein OsJ_29597 [Oryza sativa Japonica Group]
 gi|215740967|dbj|BAG97462.1| unnamed protein product [Oryza sativa Japonica Group]
          Length = 453

 Score =  113 bits (282), Expect = 2e-22,   Method: Compositional matrix adjust.
 Identities = 120/422 (28%), Positives = 180/422 (42%), Gaps = 55/422 (13%)

Query: 22  VQRGINISIARLAYLQEKIRSHNTYQAQILPSND----------ISNSLFYV-NISIGQP 70
           ++R +  S AR A L   +R+   +   I  + +           S  L YV ++++G P
Sbjct: 49  IRRAMQRSKARAAAL-SVVRNGGGFYGSIAQAREREREPGMAVRASGDLEYVLDLAVGTP 107

Query: 71  PVPQFTAMDTGSSLLWVHCYPCRDCSPQLGTIFYPSRSSSYAYVPCDSEHCRYFPYARCS 130
           P P    +DTGS L+W  C  C  C  Q   +F P  SSSY  + C  + C    +  C 
Sbjct: 108 PQPITALLDTGSDLIWTQCDTCTACLRQPDPLFSPRMSSSYEPMRCAGQLCGDILHHSC- 166

Query: 131 AYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDVVFGCG-FSTNRNFKFSGI 189
                C Y   Y  G  T  + +TE+ TF ++   T  V  + FGCG  +       SGI
Sbjct: 167 VRPDTCTYRYSYGDGTTTLGYYATERFTFASSSGETQSVP-LGFGCGTMNVGSLNNASGI 225

Query: 190 FGLGIGRSSLVSQLN-SSFSYCIGSLHDPDYLHNKLILGDGAIIDEG---DAT-PLQFI- 243
            G G    SLVSQL+   FSYC+     P     K  L  G++ D G   DAT P+Q   
Sbjct: 226 VGFGRDPLSLVSQLSIRRFSYCL----TPYASSRKSTLQFGSLADVGLYDDATGPVQTTP 281

Query: 244 -------DGHYYITLEAISVDGRMLDINPNIFK-RDDSGGGVMIDSGTDVTW----LVKE 291
                     YY+    ++V  R L I  + F  R D  GGV+IDSGT +T     ++ E
Sbjct: 282 ILQSAQNPTFYYVAFTGVTVGARRLRIPASAFALRPDGSGGVIIDSGTALTLFPVAVLAE 341

Query: 292 AYEALRDEVMIRLEGEQMRSYSWPDKLCYHGIMSSDLK-------GFPTVRFHFRGGAKL 344
              A R ++ +          S  D +C+     +            P + FHF+ GA L
Sbjct: 342 VVRAFRSQLRLPFA----NGSSPDDGVCFAAPAVAAGGGRMARQVAVPRMVFHFQ-GADL 396

Query: 345 ALEKDSMFYQP-RPDAFCMAVNPASINNRYVNFSLIGMMAQQFYNVGYDIGRKQKTFQRM 403
            L +++   +  R    C+ +  +  +      + IG   QQ   V YD+ R+  +F  +
Sbjct: 397 DLPRENYVLEDHRRGHLCVLLGDSGDDG-----ATIGNFVQQDMRVVYDLERETLSFAPV 451

Query: 404 DC 405
           +C
Sbjct: 452 EC 453


>gi|357113696|ref|XP_003558637.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Brachypodium
           distachyon]
          Length = 432

 Score =  113 bits (282), Expect = 2e-22,   Method: Compositional matrix adjust.
 Identities = 105/399 (26%), Positives = 164/399 (41%), Gaps = 38/399 (9%)

Query: 31  ARLAYLQEKIRSHNTYQAQILPSNDISNSLFYVNISIGQPPVPQFTAMDTGSSLLWVHCY 90
           ARL +L  K  S     A +  ++  S   + V   +G P  P   A+DT +   W HC 
Sbjct: 49  ARLLFLSSKAASTGVSSAPV--ASGQSPPSYVVRAGLGSPAQPILLALDTSADATWAHCS 106

Query: 91  PCRDCSPQLGTIFYPSRSSSYAYVPCDSEHCRYFPYARC--------SAYKHRCIYTQLY 142
           PC  C P  G++F P+ S+SYA +PC S  C       C        SA    C +T+ +
Sbjct: 107 PCGTC-PSSGSLFAPANSTSYAPLPCSSTMCTVLQGQPCPAQDPYDSSAPLPMCAFTKPF 165

Query: 143 LIGPETSVFVSTEQLTFKNTDESTIHVQDVVFGCGFSTN---RNFKFSGIFGLGIGRSSL 199
                 +   S      K+       + +  FGC  + +    N    G+ GLG G  +L
Sbjct: 166 ADASFQASLASDWLHLGKDA------IPNYAFGCVSAVSGPTANLPKQGLLGLGRGPMAL 219

Query: 200 VSQL----NSSFSYCIGSLHDPDYLHNKLILGDGAIIDEGDATPLQFIDGH---YYITLE 252
           +SQ+    N  FSYC+ S +   Y    L LG          TP+         YY+ + 
Sbjct: 220 LSQVGNMYNGVFSYCLPS-YKSYYFSGSLRLGAAGQPRGVRYTPMLKNPNRSSLYYVNVT 278

Query: 253 AISVDGRMLDINPNIFKRDD-SGGGVMIDSGTDVTWLVKEAYEALRDEVMIRLEGEQMRS 311
            +SV    + +    F  D  +G G ++DSGT +T      Y ALR+E    +      +
Sbjct: 279 GLSVGRAPVKVPAGSFAFDPATGAGTVVDSGTVITRWTPPVYAALREEFRRHVAAPSGYT 338

Query: 312 YSWPDKLCYHGIMSSDLKG--FPTVRFHFRGGAKLALE-KDSMFYQPRPDAFCMAVNPAS 368
                  C++   + ++     P V  H  GG  LAL  ++++ +       C+A+  A 
Sbjct: 339 SLGAFDTCFN---TDEVAAGVAPAVTVHMDGGLDLALPMENTLIHSSATPLACLAMAEAP 395

Query: 369 IN-NRYVNFSLIGMMAQQFYNVGYDIGRKQKTFQRMDCE 406
            N N  VN  ++  + QQ   V +D+   +  F R  C 
Sbjct: 396 QNVNAVVN--VLANLQQQNLRVVFDVANSRVGFARESCN 432


>gi|449441618|ref|XP_004138579.1| PREDICTED: uncharacterized protein LOC101220661 [Cucumis sativus]
          Length = 2819

 Score =  113 bits (282), Expect = 2e-22,   Method: Compositional matrix adjust.
 Identities = 97/333 (29%), Positives = 150/333 (45%), Gaps = 44/333 (13%)

Query: 52   PSNDIS---NSLFYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLGTIFYPSRS 108
            PSN +S   N    V++++G PP      +DTGS L W+HC      SP L ++F P  S
Sbjct: 988  PSNKLSFHHNVTLTVSLTVGSPPQQVTMVLDTGSELSWLHCKK----SPNLTSVFNPLSS 1043

Query: 109  SSYAYVPCDSEHCRY----FPYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDE 164
            SSY+ +PC S  CR      P       K  C     Y         ++++     ++  
Sbjct: 1044 SSYSPIPCSSPICRTRTRDLPNPVTCDPKKLCHAIVSYADASSLEGNLASDNFRIGSSA- 1102

Query: 165  STIHVQDVVFGC---GFSTN--RNFKFSGIFGLGIGRSSLVSQLN-SSFSYCIGSLHDPD 218
                +   +FGC   GFS+N   + K +G+ G+  G  S V+QL    FSYCI       
Sbjct: 1103 ----LPGTLFGCMDSGFSSNSEEDAKTTGLMGMNRGSLSFVTQLGLPKFSYCISGRDSSG 1158

Query: 219  YLHNKLILGDGAIIDEGD---------ATPLQFIDG-HYYITLEAISVDGRMLDINPNIF 268
             L    + GD  +   G+         +TPL + D   Y + L+ I V  ++L +  +IF
Sbjct: 1159 VL----LFGDLHLSWLGNLTYTPLVQISTPLPYFDRVAYTVQLDGIRVGNKILPLPKSIF 1214

Query: 269  KRDDSGGG-VMIDSGTDVTWLVKEAYEALRDEVMIRLE------GEQMRSYSWPDKLCYH 321
              D +G G  M+DSGT  T+L+   Y ALR+E + + +      G+    +     LCY 
Sbjct: 1215 APDHTGAGQTMVDSGTQFTFLLGPVYTALRNEFLEQTKGVLAPLGDPNFVFQGAMDLCYS 1274

Query: 322  GIMSSDLKGFPTVRFHFRGGAKLALEKDSMFYQ 354
                  L   P+V   FR GA++ +  + + Y+
Sbjct: 1275 VAAGGKLPTLPSVSLMFR-GAEMVVGGEVLLYR 1306


>gi|224140237|ref|XP_002323490.1| predicted protein [Populus trichocarpa]
 gi|222868120|gb|EEF05251.1| predicted protein [Populus trichocarpa]
          Length = 478

 Score =  113 bits (282), Expect = 2e-22,   Method: Compositional matrix adjust.
 Identities = 94/313 (30%), Positives = 147/313 (46%), Gaps = 31/313 (9%)

Query: 60  LFYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLG-----TIFYPSRSSSYAYV 114
           L++  + +G PP      +DTGS +LWV C  C +C    G       F  S SS+   V
Sbjct: 65  LYFTKVKLGSPPREFNVQIDTGSDVLWVCCNSCNNCPRTSGLGIQLNFFDSSSSSTAGLV 124

Query: 115 PCDSEHCR---YFPYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQD 171
            C    C         +CS   ++C YT  Y  G  TS +  ++ L F      ++ V  
Sbjct: 125 HCSDPICTSAVQTTVTQCSPQTNQCSYTFQYEDGSGTSGYYVSDTLYFDAILGESLVVNS 184

Query: 172 ---VVFGC-----GFSTNRNFKFSGIFGLGIGRSSLVSQLNS------SFSYCI-GSLHD 216
              +VFGC     G  T  +    GIFG G G  S++SQL++       FS+C+ G    
Sbjct: 185 SALIVFGCSTFQSGDLTMTDKAVDGIFGFGQGELSVISQLSTHGITPRVFSHCLKGEGIG 244

Query: 217 PDYLHNKLILGDGAIIDEGDATPLQFIDGHYYITLEAISVDGRMLDINPNIFKRDDSGGG 276
              L    IL  G +      +PL     HY + L++I+V+G++L I+P++F   +S  G
Sbjct: 245 GGILVLGEILEPGMVY-----SPLVPSQPHYNLNLQSIAVNGKLLPIDPSVFATSNS-QG 298

Query: 277 VMIDSGTDVTWLVKEAYEALRDEVMIRLEGEQMRSYSWPDKLCYHGIMSSDLKGFPTVRF 336
            ++DSGT + +LV EAY+     V + +        S  ++ CY  + +S  + FP   F
Sbjct: 299 TIVDSGTTLAYLVAEAYDPFVSAVNVIVSPSVTPIISKGNQ-CYL-VSTSVSQMFPLASF 356

Query: 337 HFRGGAKLALEKD 349
           +F GGA + L+ +
Sbjct: 357 NFAGGASMVLKPE 369


>gi|413923782|gb|AFW63714.1| hypothetical protein ZEAMMB73_300584 [Zea mays]
          Length = 458

 Score =  113 bits (282), Expect = 2e-22,   Method: Compositional matrix adjust.
 Identities = 109/391 (27%), Positives = 169/391 (43%), Gaps = 36/391 (9%)

Query: 32  RLAYLQEKIRSHNTYQ---AQILPSN---DISNSLFYVNISIGQPPVPQFTAMDTGSSLL 85
           R AY++ K       +   A  +P+     +S   + + + IG P V Q  +MDTGS + 
Sbjct: 87  RAAYIKRKFSGAGDIEQSDAATVPTTLGTSLSTLEYVITVGIGSPAVTQTMSMDTGSDVS 146

Query: 86  WVHCYPCRDCSPQLGTIFYPSRSSSYAYVPCDSEHCRYFPYAR--CSAYKHRCIYTQLYL 143
           WV C PC  C  ++ ++F PS SS+Y+   C S  C     ++        +C Y   Y 
Sbjct: 147 WVQCKPCSQCHSEVDSLFDPSSSSTYSPFSCSSAPCAQLSQSQEGNGCMSSQCQYIVNYG 206

Query: 144 IGPETSVFVSTEQLTFKNTDESTIHVQDVVFGCGFSTNRNF--KFSGIFGLGIGRSSLVS 201
               T+   S++ LT  ++      + D  FGC  S +  F  +  G+ GLG G  SL S
Sbjct: 207 DSSSTTGTYSSDTLTLGSS-----AMTDFQFGCSQSESGGFNDQTDGLMGLGGGAQSLAS 261

Query: 202 Q----LNSSFSYCIGSLHDPDYLHNKLILGDGAIIDEGDATPL---QFIDGHYYITLEAI 254
           Q      ++FSYC   L         L LG G+       TP+     I  +Y + LE+I
Sbjct: 262 QTAGTFGTAFSYC---LPPTSGSSGFLTLGTGS--SGFVKTPMLRSTQIPTYYVVLLESI 316

Query: 255 SVDGRMLDINPNIFKRDDSGGGVMIDSGTDVTWLVKEAYEALRDEVMIRLEGEQMRSYSW 314
            V  + L++  ++F       G ++DSGT +T L   AY AL       ++     + S 
Sbjct: 317 KVGSQQLNLPTSVFS-----AGSLMDSGTIITRLPPTAYSALSSAFKAGMQQYPPATPSG 371

Query: 315 PDKLCYHGIMSSDLKGFPTVRFHFRGGAKLALEKDSMFYQPRPDAFCMAVNPASINNRYV 374
               C+     S +   PTV   F GGA + L  D +  +      C+A  P   N    
Sbjct: 372 ILDTCFDFSGQSSIS-IPTVTLVFSGGAAVDLAFDGIMLEISSSIRCLAFTP---NGDDS 427

Query: 375 NFSLIGMMAQQFYNVGYDIGRKQKTFQRMDC 405
           +  +IG + Q+ + V YD+G     F+   C
Sbjct: 428 SLGIIGNVQQRTFEVLYDVGGGAVGFKAGAC 458


>gi|356568507|ref|XP_003552452.1| PREDICTED: aspartic proteinase-like protein 2-like [Glycine max]
          Length = 481

 Score =  113 bits (282), Expect = 2e-22,   Method: Compositional matrix adjust.
 Identities = 110/389 (28%), Positives = 166/389 (42%), Gaps = 56/389 (14%)

Query: 13  NPNENPAHRVQRGINISIARLAYLQEKIRSHNT-YQAQILPSNDI---------SNSLFY 62
           N N N    V R     +  LA     I++H+   + + L   D+         SN L+Y
Sbjct: 22  NANANLVFPVVRKFKGPVENLA----AIKAHDAGRRGRFLSVVDVALGGNGRPTSNGLYY 77

Query: 63  VNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLG-----TIFYPSRSSSYAYVPCD 117
             I +G  P   +  +DTGS  LWV+C  C  C  + G     T++ P+ S +   VPCD
Sbjct: 78  TKIGLG--PKDYYVQVDTGSDTLWVNCVGCTACPKKSGLGMDLTLYDPNLSKTSKAVPCD 135

Query: 118 SEHCRYFPYARCSAYKH--RCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQD---V 172
            E C      + S       C Y+  Y  G  TS     + LTF         V D   V
Sbjct: 136 DEFCTSTYDGQISGCTKGMSCPYSITYGDGSTTSGSYIKDDLTFDRVVGDLRTVPDNTSV 195

Query: 173 VFGCG------FSTNRNFKFSGIFGLGIGRSSLVSQLNSS------FSYCIGSLHDPDYL 220
           +FGCG       S+  +    GI G G   SS++SQL ++      FS+C+      D +
Sbjct: 196 IFGCGSKQSGTLSSTTDTSLDGIIGFGQANSSVLSQLAAAGKVKRIFSHCL------DSI 249

Query: 221 HNKLILGDGAIID-EGDATPLQFIDGHYYITLEAISVDGRMLDINPNIFKRDDSGGGVMI 279
               I   G ++  +   TPL     HY + L+ I V G  + +  +I     SG G +I
Sbjct: 250 SGGGIFAIGEVVQPKVKTTPLLQGMAHYNVVLKDIEVAGDPIQLPSDILDS-SSGRGTII 308

Query: 280 DSGTDVTWLVKEAYEALRDEVMIRLEGEQMRSYSWPDKL-CYHGIMSSDLKG----FPTV 334
           DSGT + +L    Y+ L ++++ +  G  M+ Y   D+  C+H    SD +     FPTV
Sbjct: 309 DSGTTLAYLPVSIYDQLLEKILAQRSG--MKLYLVEDQFTCFH---YSDEESVDDLFPTV 363

Query: 335 RFHFRGGAKLALEKDSMFYQPRPDAFCMA 363
           +F F  G  L        +  + D +C+ 
Sbjct: 364 KFTFEEGLTLTTYPRDYLFLFKEDMWCVG 392


>gi|222622847|gb|EEE56979.1| hypothetical protein OsJ_06707 [Oryza sativa Japonica Group]
          Length = 494

 Score =  113 bits (282), Expect = 2e-22,   Method: Compositional matrix adjust.
 Identities = 95/326 (29%), Positives = 143/326 (43%), Gaps = 32/326 (9%)

Query: 60  LFYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDC--SPQLG---TIFYPSRSSSYAYV 114
           L++  I IG P    +  +DTGS +LWV+C  C  C     LG   T++ P  S S   V
Sbjct: 89  LYFTRIGIGTPAKRYYVQVDTGSDILWVNCVSCDGCPRKSNLGIELTMYDPRGSQSGELV 148

Query: 115 PCDSEHC--RYFPYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTD---ESTIHV 169
            CD + C   Y            C Y+  Y  G  T+ F  T+ L +       ++T   
Sbjct: 149 TCDQQFCVANYGGVLPSCTSTSPCEYSISYGDGSSTAGFFVTDFLQYNQVSGDGQTTPAN 208

Query: 170 QDVVFGCGFS-----TNRNFKFSGIFGLGIGRSSLVSQLNSS------FSYCIGSLHDPD 218
             V FGCG        + N    GI G G   SS++SQL ++      F++C+      D
Sbjct: 209 ASVSFGCGAKLGGDLGSSNLALDGILGFGQSNSSMLSQLAAAGKVRKMFAHCL------D 262

Query: 219 YLHNKLILGDGAIID-EGDATPLQFIDGHYYITLEAISVDGRMLDINPNIFKRDDSGGGV 277
            ++   I   G ++  +   TPL     HY + L+ I V G  L +  NIF   +S  G 
Sbjct: 263 TVNGGGIFAIGNVVQPKVKTTPLVPDMPHYNVILKGIDVGGTALGLPTNIFDSGNS-KGT 321

Query: 278 MIDSGTDVTWLVKEAYEALRDEVMIRLEGEQMRSYSWPDKLCYHGIMSSDLKGFPTVRFH 337
           +IDSGT + ++ +  Y+AL    M+  + + +   +  D  C+    S D  GFP V FH
Sbjct: 322 IIDSGTTLAYVPEGVYKALF--AMVFDKHQDISVQTLQDFSCFQYSGSVD-DGFPEVTFH 378

Query: 338 FRGGAKLALEKDSMFYQPRPDAFCMA 363
           F G   L +      +Q   + +CM 
Sbjct: 379 FEGDVSLIVSPHDYLFQNGKNLYCMG 404


>gi|225217022|gb|ACN85307.1| aspartic proteinase nepenthesin-1 precursor [Oryza ridleyi]
          Length = 525

 Score =  113 bits (282), Expect = 2e-22,   Method: Compositional matrix adjust.
 Identities = 101/359 (28%), Positives = 149/359 (41%), Gaps = 33/359 (9%)

Query: 61  FYVNISIGQPPVPQFTAMDTGSSLLWVHCYPC-RDCSPQLGTIFYPSRSSSYAYVPCDSE 119
           + V I +G P        DTGS   WV C PC   C  Q   +F P+RSS+ A + C + 
Sbjct: 186 YVVTIGLGTPAGRYTVVFDTGSDTTWVQCEPCVVVCYEQQEKLFDPARSSTDANISCAAP 245

Query: 120 HCRYFPYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDVVFGCGFS 179
            C       CS     C+Y   Y  G  +  F + + LT  + D     ++   FGCG  
Sbjct: 246 ACSDLYTKGCSG--GHCLYGVQYGDGSYSIGFFAMDTLTLSSYDA----IKGFRFGCGER 299

Query: 180 TNRNF-KFSGIFGLGIGRSSLVSQL----NSSFSYCIGSLHD-PDYLHNKLILGDGAIID 233
               F + +G+ GLG G++SL  Q        F++C  +      YL      G    + 
Sbjct: 300 NEGLFGEAAGLLGLGRGKTSLPVQAYDKYGGVFAHCFPARSSGTGYL--DFGPGSSPAVS 357

Query: 234 EGDATPLQFIDG--HYYITLEAISVDGRMLDINPNIFKRDDSGGGVMIDSGTDVTWLVKE 291
               TP+   +G   YY+ L  I V G++L I P++F       G ++DSGT +T L   
Sbjct: 358 TKLTTPMLVDNGLTFYYVGLTGIRVGGKLLSIPPSVFTT----AGTIVDSGTVITRLPPA 413

Query: 292 AYEALRDEVMIRLEGEQMRSYSWPDKL-----CYHGIMSSDLKGFPTVRFHFRGGAKLAL 346
           AY +LR      +     R Y     L     CY     S +   PTV   F+GGA L +
Sbjct: 414 AYSSLRSAFASAIA---ARGYKKAPALSLLDTCYDFTGMSQVA-IPTVSLLFQGGASLDV 469

Query: 347 EKDSMFYQPRPDAFCMAVNPASINNRYVNFSLIGMMAQQFYNVGYDIGRKQKTFQRMDC 405
           +   + Y       C+     + N    +  ++G    + + V YDIG+K   F    C
Sbjct: 470 DASGIIYAASVSQACLGF---AANEEDDDVGIVGNTQLKTFGVVYDIGKKVVGFSPGAC 525


>gi|255567949|ref|XP_002524952.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
           communis]
 gi|223535787|gb|EEF37449.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
           communis]
          Length = 394

 Score =  112 bits (281), Expect = 2e-22,   Method: Compositional matrix adjust.
 Identities = 95/318 (29%), Positives = 143/318 (44%), Gaps = 42/318 (13%)

Query: 58  NSLFYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLGTIFYPSRSSSYAYVPCD 117
           N  +   I IG PP      +DTGS++ +V C  C  C       F P  SS+Y  V C+
Sbjct: 87  NGYYTTRIWIGTPPQTFALIVDTGSTVTYVPCSTCEQCGRHQDPKFEPELSSTYQPVSCN 146

Query: 118 SEHCRYFPYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDVVFGCG 177
            +         C   + +C+Y + Y     +S  +  + ++F N  +S +  Q  +FGC 
Sbjct: 147 ID-------CTCDNERKQCVYERQYAEMSSSSGVLGEDIISFGN--QSELVPQRAIFGCE 197

Query: 178 FSTNRNF---KFSGIFGLGIGRSSLVSQL------NSSFSYCIGSLHDPDYLHNKLILG- 227
                +    +  GI GLG G  S+V QL      + SFS C G +   D     +ILG 
Sbjct: 198 NQETGDLYSQRADGIMGLGRGDLSIVDQLVEKGVISDSFSLCYGGM---DIGGGAMILGG 254

Query: 228 ----DGAIIDEGDATPLQFIDGHYYITLEAISVDGRMLDINPNIFKRDDSGGGVMIDSGT 283
                G +  E D    Q+    Y I L+AI V G+ L ++P+IF   D   G ++DSGT
Sbjct: 255 ISPPSGMVFAESDPVRSQY----YNIDLKAIHVAGKQLHLDPSIF---DGKHGTVLDSGT 307

Query: 284 DVTWLVKEAYEALRDEVMIRLEGEQMRSYSWPD----KLCYHGIMS--SDLKG-FPTVRF 336
              +L + A+ A +D +M  L    ++    PD     +C+ G  S  S L   FP V  
Sbjct: 308 TYAYLPEAAFTAFKDAMMKEL--TSLKQIHGPDPNYNDICFSGAESDVSQLSNTFPAVEM 365

Query: 337 HFRGGAKLALEKDSMFYQ 354
            F  G KL+L  ++  +Q
Sbjct: 366 VFSNGQKLSLSPENYLFQ 383


>gi|242085924|ref|XP_002443387.1| hypothetical protein SORBIDRAFT_08g018620 [Sorghum bicolor]
 gi|241944080|gb|EES17225.1| hypothetical protein SORBIDRAFT_08g018620 [Sorghum bicolor]
          Length = 460

 Score =  112 bits (281), Expect = 3e-22,   Method: Compositional matrix adjust.
 Identities = 112/371 (30%), Positives = 163/371 (43%), Gaps = 33/371 (8%)

Query: 67  IGQPPVPQFTAMDTGSSLLWVHCYPCR--DCSPQLGTIFYPSRSSSYAYVPCDSEHCRYF 124
           IG PP      +DTGS+L+W  C  CR   C  Q  T + PSRS +   V C+   C   
Sbjct: 90  IGDPPQQAAAIIDTGSNLIWTQCSTCRANGCFGQDLTFYDPSRSRTAKPVACNDTACLLG 149

Query: 125 PYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDVVFGCGFSTNRNF 184
              RC+     C     Y  G     F+ TE  TF +   S  +V  + FGC  ++    
Sbjct: 150 SETRCARDGKACAVLTAYGAG-AIGGFLGTEVFTFGHGQSSENNVS-LAFGCITASRLTP 207

Query: 185 K----FSGIFGLGIGRSSLVSQL-NSSFSYCIGSLHDPDYLHNKLILGDGAIIDEGD--A 237
                 SGI GLG G+ SL SQL ++ FSYC+          + L +G  A +  G   A
Sbjct: 208 GSLDGASGIIGLGRGKLSLPSQLGDNKFSYCLTPYFSDAANTSTLFVGASAGLSGGGAPA 267

Query: 238 TPLQFI--------DGHYYITLEAISVDGRMLDINPNIFKRDDSG----GGVMIDSGTDV 285
           T + F+        D  YY+ L  I+V    LD+    F   +      GG +IDSG+  
Sbjct: 268 TSVPFLKNPDDDPFDSFYYLPLTGITVGTAKLDVPAAAFDLREVAPAKWGGTLIDSGSPF 327

Query: 286 TWLVKEAYEALRDEVMIRLEGEQMRSYSWPD--KLCYHGIMSSDL-KGFPTVRFHFRGGA 342
           T L+  AY+ALRDE++ +L    +   +  +   LC  G+   D  K  P +  HF  G 
Sbjct: 328 TSLIDVAYQALRDELVRQLGASVVPPPAGAEGLDLCVGGVAPGDAGKLVPPLVLHFGSGG 387

Query: 343 KLALE---KDSMFYQPRPDA-FCMAVNPASINNRYVNF---SLIGMMAQQFYNVGYDIGR 395
               +       ++ P  D+  CM V  +   N  +     ++IG   QQ  ++ YD+G+
Sbjct: 388 GGGGDVVVPPENYWGPVDDSTACMVVFSSGGPNSTLPLNETTIIGNYMQQDMHLLYDLGQ 447

Query: 396 KQKTFQRMDCE 406
              +FQ  DC 
Sbjct: 448 GVLSFQPADCS 458


>gi|242057809|ref|XP_002458050.1| hypothetical protein SORBIDRAFT_03g026140 [Sorghum bicolor]
 gi|241930025|gb|EES03170.1| hypothetical protein SORBIDRAFT_03g026140 [Sorghum bicolor]
          Length = 489

 Score =  112 bits (281), Expect = 3e-22,   Method: Compositional matrix adjust.
 Identities = 94/347 (27%), Positives = 158/347 (45%), Gaps = 41/347 (11%)

Query: 51  LPSNDISNSLFYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLG-----TIFYP 105
           LP++     L++  I +G PP   +  +DTGS +LWV+C  C  C  + G     T + P
Sbjct: 77  LPTD---TGLYFTEIKLGTPPKRYYVQVDTGSDILWVNCISCEKCPRKSGLGLDLTFYDP 133

Query: 106 SRSSSYAYVPCDSEHCRYFPYAR---CSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNT 162
             SSS + V CD   C      +   C+A    C Y+ +Y  G  T+ F  T+ L F   
Sbjct: 134 KASSSGSTVSCDQGFCAATYGGKLPGCTA-NVPCEYSVMYGDGSSTTGFFVTDALQFDQV 192

Query: 163 ---DESTIHVQDVVFGCGFST-----NRNFKFSGIFGLGIGRSSLVSQLNSS------FS 208
               ++      V FGCG        + N    GI G G   +S++SQL ++      F+
Sbjct: 193 TGDGQTQPGNATVTFGCGAQQGGDLGSSNQALDGILGFGQANTSMLSQLAAAGKVKKIFA 252

Query: 209 YCIGSLHDPDYLHNKLILGDGAIID-EGDATPLQFIDGHYYITLEAISVDGRMLDINPNI 267
           +C+      D +    I   G ++  +   TPL     HY + L++I V G  L +  ++
Sbjct: 253 HCL------DTIKGGGIFAIGNVVQPKVKTTPLVADMPHYNVNLKSIDVGGTTLQLPAHV 306

Query: 268 FKRDDSGGGVMIDSGTDVTWLVKEAYEALRDEVMIRL--EGEQMRSYSWPDKLCYHGIMS 325
           F+  +   G +IDSGT +T+L +  ++    EVM  +  + + +  ++  D +C+    S
Sbjct: 307 FETGER-KGTIIDSGTTLTYLPELVFK----EVMAAIFNKHQDIVFHNVQDFMCFQYPGS 361

Query: 326 SDLKGFPTVRFHFRGGAKLALEKDSMFYQPRPDAFCMAVNPASINNR 372
            D  GFPT+ FHF     L +     F+    D +C+     ++ ++
Sbjct: 362 VD-DGFPTITFHFEDDLALHVYPHEYFFPNGNDMYCVGFQNGALQSK 407


>gi|357125326|ref|XP_003564345.1| PREDICTED: aspartic proteinase-like protein 2-like [Brachypodium
           distachyon]
          Length = 506

 Score =  112 bits (281), Expect = 3e-22,   Method: Compositional matrix adjust.
 Identities = 94/321 (29%), Positives = 147/321 (45%), Gaps = 34/321 (10%)

Query: 60  LFYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLG-----TIFYPSRSSSYAYV 114
           L++  + +G P    F  +DTGS +LWV C PC  C    G       F P  SS+ + +
Sbjct: 88  LYFTRVKLGNPAKEYFVQIDTGSDILWVACSPCTGCPTSSGLNIQLEFFNPDSSSTSSRI 147

Query: 115 PCDSEHCRYF---PYARCSAY---KHRCIYTQLYLIGPETSVFVSTEQLTFKNT---DES 165
           PC  + C        A C +       C YT  Y  G  TS F  ++ + F      +++
Sbjct: 148 PCSDDRCTAALQTGEAVCQSSDSPSSPCGYTFTYGDGSGTSGFYVSDTMYFDTVMGNEQT 207

Query: 166 TIHVQDVVFGCGFSTNRNF-----KFSGIFGLGIGRSSLVSQLNS------SFSYCIGSL 214
                 VVFGC  S + +         GIFG G  + S+VSQL S      +FS+C   L
Sbjct: 208 ANSSASVVFGCSNSQSGDLMKTDRAVDGIFGFGQHQLSVVSQLYSLGVSPKTFSHC---L 264

Query: 215 HDPDYLHNKLILGDGAIIDEGDA-TPLQFIDGHYYITLEAISVDGRMLDINPNIFKRDDS 273
              D     L+LG+  I++ G   TPL     HY + LE+I+V G+ L I+ ++F   ++
Sbjct: 265 KGSDNGGGILVLGE--IVEPGLVFTPLVPSQPHYNLNLESIAVSGQKLPIDSSLFATSNT 322

Query: 274 GGGVMIDSGTDVTWLVKEAYEALRDEVMIRLEGEQMRSYSWPDKLCYHGIMSSDLKGFPT 333
             G ++DSGT + +LV  AY+   + +   +        S   + C+    S D   FPT
Sbjct: 323 -QGTIVDSGTTLVYLVDGAYDPFINAIAAAVSPSVRSVVSKGIQ-CFVTTSSVD-SSFPT 379

Query: 334 VRFHFRGGAKLALEKDSMFYQ 354
              +F+GG  + ++ ++   Q
Sbjct: 380 ATLYFKGGVSMTVKPENYLLQ 400


>gi|224143371|ref|XP_002324933.1| predicted protein [Populus trichocarpa]
 gi|222866367|gb|EEF03498.1| predicted protein [Populus trichocarpa]
          Length = 463

 Score =  112 bits (281), Expect = 3e-22,   Method: Compositional matrix adjust.
 Identities = 113/427 (26%), Positives = 190/427 (44%), Gaps = 49/427 (11%)

Query: 1   LIHRDSILSPYNNPNENPAHRVQRGINISIARLAYLQEKIRSHN-TYQAQILPSN----- 54
           L+HR    +P+   +  PA      +     R+  + +  RS N T   + + S+     
Sbjct: 65  LVHRFGPCNPHRT-STAPASSFNEILRRDKLRVDSIIQARRSMNLTSSVEHMKSSVPFYG 123

Query: 55  --DISNSLFYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLGTIFYPSRSSSYA 112
              I+ S + VN+ IG P        DTGS L+W  C PC+ C P++  +F P++S+S+ 
Sbjct: 124 LSKITASDYIVNVGIGTPKKEMPLIFDTGSGLIWTQCKPCKACYPKV-PVFDPTKSASFK 182

Query: 113 YVPCDSEHCRYFPYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDV 172
            +PC S+ C+     R      +C Y   Y+    ++  ++TE ++F +        +++
Sbjct: 183 GLPCSSKLCQSI---RQGCSSPKCTYLTAYVDNSSSTGTLATETISFSHLK---YDFKNI 236

Query: 173 VFGCGFS-TNRNFKFSGIFGLGIGRSSLVSQL----NSSFSYCIGSLHDPDYLHNKLILG 227
           + GC    +  +   SGI GL     SL SQ     +  FSYCI S          L  G
Sbjct: 237 LIGCSDQVSGESLGESGIMGLNRSPISLASQTANIYDKLFSYCIPSTPGS---TGHLTFG 293

Query: 228 DGAIIDEGDATPLQFI--DGHYYITLEAISVDGRMLDINPNIFKRDDSGGGVMIDSGTDV 285
            G + ++   +P+        Y I +  ISV GR L I+ + FK   +     IDSG  +
Sbjct: 294 -GKVPNDVRFSPVSKTAPSSDYDIKMTGISVGGRKLLIDASAFKIAST-----IDSGAVL 347

Query: 286 TWLVKEAYEALRDEVMIRLEGEQMRSYSWPDK-----LCYHGIMSSDLKGFPTVRFHFRG 340
           T L  +AY ALR   + R   E M+ Y   D+      CY     S +   P++   F G
Sbjct: 348 TRLPPKAYSALRS--VFR---EMMKGYPLLDQDDFLDTCYDFSNYSTV-AIPSISVFFEG 401

Query: 341 GAKLALEKDSMFYQ-PRPDAFCMAVNPASINNRYVNFSLIGMMAQQFYNVGYDIGRKQKT 399
           G ++ ++   + +Q P    +C+A   A +++     S+ G   Q+ Y V +D  +++  
Sbjct: 402 GVEMDIDVSGIMWQVPGSKVYCLAF--AELDDE---VSIFGNFQQKTYTVVFDGAKERIG 456

Query: 400 FQRMDCE 406
           F    C+
Sbjct: 457 FAPGGCD 463


>gi|414887402|tpg|DAA63416.1| TPA: hypothetical protein ZEAMMB73_414910 [Zea mays]
          Length = 407

 Score =  112 bits (280), Expect = 3e-22,   Method: Compositional matrix adjust.
 Identities = 102/373 (27%), Positives = 166/373 (44%), Gaps = 61/373 (16%)

Query: 19  AHRVQRGINISIARLAYLQEKIRSHNTYQAQILPSNDISNSLFYVNISIGQPPVPQFTAM 78
           A R+   +   +   A+   ++R H+           ++N  +   + IG PP      +
Sbjct: 56  ASRLAASLRRGLGDGAHPNARMRLHDDL---------LTNGYYTTRLYIGTPPQEFALIV 106

Query: 79  DTGSSLLWVHCYPCRDCSPQLGTIFYPSRSSSYAYVPCDSEHCRYFPYARCSAYKHRCIY 138
           D+GS++ +V C  C  C       F P  SSSY+ V C+ +         C + K +C Y
Sbjct: 107 DSGSTVTYVPCASCEQCGNHQDPRFQPDLSSSYSPVKCNVD-------CTCDSDKKQCTY 159

Query: 139 TQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDVVFGCGFSTNRNFKFS----GIFGLGI 194
            + Y     +S  +  + ++F    ES +  Q  VFGC  S   +  FS    GI GLG 
Sbjct: 160 ERQYAEMSSSSGVLGEDIVSFGR--ESELKAQRAVFGCENSETGDL-FSQHADGIMGLGR 216

Query: 195 GRSSLVSQL------NSSFSYCIGSLHDPDYLHNKLILGDGAIIDEGDATPLQFI----- 243
           G+ S++ QL      N SFS C G +           +G GA++  G  TP   +     
Sbjct: 217 GQLSIMDQLVEKGVINDSFSLCYGGMD----------IGGGAMVLGGVPTPSDMVFSRSD 266

Query: 244 ---DGHYYITLEAISVDGRMLDINPNIFKRDDSGGGVMIDSGTDVTWLVKEAYEALRDEV 300
                +Y I L+ I V G+ L ++  IF   DS  G ++DSGT   +L ++A+ A +D V
Sbjct: 267 PLRSPYYNIELKEIHVAGKALRVDSRIF---DSKHGTVLDSGTTYAYLPEQAFMAFKDAV 323

Query: 301 MIRLEGEQMRSYSWPD----KLCYHGI---MSSDLKGFPTVRFHFRGGAKLALEKDS-MF 352
             ++    ++    PD     +C+ G    +S   + FP V   F  G KL+L  ++ +F
Sbjct: 324 TSKV--HSLKKIRGPDPSYKDICFAGARRNVSKLHEVFPDVDMVFGNGQKLSLTPENYLF 381

Query: 353 YQPRPD-AFCMAV 364
              + D A+C+ V
Sbjct: 382 RHSKVDGAYCLGV 394


>gi|300681506|emb|CBH32600.1| pepsin A, putative, expressed [Triticum aestivum]
          Length = 477

 Score =  112 bits (280), Expect = 3e-22,   Method: Compositional matrix adjust.
 Identities = 107/385 (27%), Positives = 151/385 (39%), Gaps = 46/385 (11%)

Query: 61  FYVNISIGQPPVPQFTAMDTGSSLLWVHCY-PCR--------DCSPQLGTIFYPSRSSSY 111
           ++V   +G P  P     DTGS L WV C  P          D  P  G  F P  S ++
Sbjct: 97  YFVRFRVGTPAQPFLLVADTGSDLTWVKCRRPASANSSLSPADSGPGPGRAFRPEDSRTW 156

Query: 112 AYVPCDSEHCRY---FPYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLT--FKNTDEST 166
           A + C S+ C     F  A C      C Y   Y  G      V TE  T      +E  
Sbjct: 157 APISCASDTCTKSLPFSLATCPTPGSPCAYDYRYKDGSAARGTVGTESATIALSGREERK 216

Query: 167 IHVQDVVFGCGFS-TNRNFKFS-GIFGLGIG----RSSLVSQLNSSFSYCIGSLHDPDYL 220
             ++ +V GC  S T  +F+ S G+  LG       S   S+    FSYC+     P   
Sbjct: 217 AKLKGLVLGCSSSYTGPSFEASDGVLSLGYSGISFASHAASRFGGRFSYCLVDHLSPRNA 276

Query: 221 HNKLILGDGAIID--------------EGDATPL---QFIDGHYYITLEAISVDGRMLDI 263
            + L  G    +                   TPL   + +   Y ++L+AISV G  L I
Sbjct: 277 TSYLTFGPNPAVSSPRASPSSCAAAAPRARQTPLLLDRRMRPFYDVSLKAISVAGEFLKI 336

Query: 264 NPNIFKRDDSGGGVMIDSGTDVTWLVKEAYEALRDEVMIRLEGEQMRSYSWPDKLCYHGI 323
            P      ++GGGV++DSGT +T L K AY A+   +   L G    +   P + CY+  
Sbjct: 337 -PRAVWDVEAGGGVILDSGTSLTVLAKPAYRAVVAALSKGLAGLPRVTMD-PFEYCYNWT 394

Query: 324 MSSDLK---GFPTVRFHFRGGAKLALEKDSMFYQPRPDAFCMAVNPASINNRYVNFSLIG 380
             S        P +  HF G A+L     S      P   C+ +        +   S+IG
Sbjct: 395 SPSGKDADVAVPKMAVHFAGAARLEPPGKSYVIDAAPGVKCIGLQ----EGPWPGISVIG 450

Query: 381 MMAQQFYNVGYDIGRKQKTFQRMDC 405
            + QQ +   +DI  ++  FQR  C
Sbjct: 451 NILQQEHLWEFDIKNRRLKFQRSRC 475


>gi|125539168|gb|EAY85563.1| hypothetical protein OsI_06935 [Oryza sativa Indica Group]
          Length = 514

 Score =  112 bits (280), Expect = 3e-22,   Method: Compositional matrix adjust.
 Identities = 104/371 (28%), Positives = 161/371 (43%), Gaps = 27/371 (7%)

Query: 56  ISNSLFYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLGTIFYPSRSSSYAYVP 115
           + +  + V++ +G PP      MDTGS L W+ C PC DC  Q G +F P+ S SY  V 
Sbjct: 147 VGSGEYLVDLYVGTPPRRFQMIMDTGSDLNWLQCAPCLDCFEQRGPVFDPATSLSYRNVT 206

Query: 116 CDSEHCRYF-----PYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDE-STIHV 169
           C    C        P A    +   C Y   Y     T+  ++ E  T   T   ++  V
Sbjct: 207 CGDPRCGLVAPPTAPRACRRPHSDPCPYYYWYGDQSNTTGDLALEAFTVNLTAPGASRRV 266

Query: 170 QDVVFGCGFSTNRNF-KFSGIFGLGIGRSSLVSQLNS----SFSYCIGSLHDPDYLHNKL 224
            DVVFGCG S    F   +G+ GLG G  S  SQL +    +FSYC+  +     + +K+
Sbjct: 267 DDVVFGCGHSNRGLFHGAAGLLGLGRGALSFASQLRAVYGHAFSYCL--VDHGSSVGSKI 324

Query: 225 ILGDGAII--------DEGDATPLQFIDGHYYITLEAISVDGRMLDINPNIFK-RDDSGG 275
           + GD   +             +     D  YY+ L+ + V G  L+I+P+ +    D  G
Sbjct: 325 VFGDDDALLGHPRLNYTAFAPSAAAAADTFYYVQLKGVLVGGEKLNISPSTWDVGKDGSG 384

Query: 276 GVMIDSGTDVTWLVKEAYEALRDEVMIRLEGEQMRSYSWPDKLCYHGIMSSDLKGFPTVR 335
           G +IDSGT +++  + AYE +R   + R++        +P     + +   +    P   
Sbjct: 385 GTIIDSGTTLSYFAEPAYEVIRRAFVERMDKAYPLVADFPVLSPCYNVSGVERVEVPEFS 444

Query: 336 FHFRGGAKLALEKDSMFYQPRPDA-FCMAVNPASINNRYVNFSLIGMMAQQFYNVGYDIG 394
             F  GA      ++ F +  PD   C+AV    +       S+IG   QQ ++V YD+ 
Sbjct: 445 LLFADGAVWDFPAENYFVRLDPDGIMCLAV----LGTPRSAMSIIGNFQQQNFHVLYDLQ 500

Query: 395 RKQKTFQRMDC 405
             +  F    C
Sbjct: 501 NNRLGFAPRRC 511


>gi|115445765|ref|NP_001046662.1| Os02g0314600 [Oryza sativa Japonica Group]
 gi|46391044|dbj|BAD15987.1| putative chloroplast nucleoid DNA binding protein [Oryza sativa
           Japonica Group]
 gi|113536193|dbj|BAF08576.1| Os02g0314600 [Oryza sativa Japonica Group]
 gi|125581836|gb|EAZ22767.1| hypothetical protein OsJ_06441 [Oryza sativa Japonica Group]
 gi|215697168|dbj|BAG91162.1| unnamed protein product [Oryza sativa Japonica Group]
          Length = 514

 Score =  112 bits (280), Expect = 3e-22,   Method: Compositional matrix adjust.
 Identities = 104/371 (28%), Positives = 161/371 (43%), Gaps = 27/371 (7%)

Query: 56  ISNSLFYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLGTIFYPSRSSSYAYVP 115
           + +  + V++ +G PP      MDTGS L W+ C PC DC  Q G +F P+ S SY  V 
Sbjct: 147 VGSGEYLVDLYVGTPPRRFQMIMDTGSDLNWLQCAPCLDCFEQRGPVFDPAASLSYRNVT 206

Query: 116 CDSEHCRYF-----PYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDE-STIHV 169
           C    C        P A    +   C Y   Y     T+  ++ E  T   T   ++  V
Sbjct: 207 CGDPRCGLVAPPTAPRACRRPHSDPCPYYYWYGDQSNTTGDLALEAFTVNLTAPGASRRV 266

Query: 170 QDVVFGCGFSTNRNF-KFSGIFGLGIGRSSLVSQLNS----SFSYCIGSLHDPDYLHNKL 224
            DVVFGCG S    F   +G+ GLG G  S  SQL +    +FSYC+  +     + +K+
Sbjct: 267 DDVVFGCGHSNRGLFHGAAGLLGLGRGALSFASQLRAVYGHAFSYCL--VDHGSSVGSKI 324

Query: 225 ILGDGAII--------DEGDATPLQFIDGHYYITLEAISVDGRMLDINPNIFK-RDDSGG 275
           + GD   +             +     D  YY+ L+ + V G  L+I+P+ +    D  G
Sbjct: 325 VFGDDDALLGHPRLNYTAFAPSAAAAADTFYYVQLKGVLVGGEKLNISPSTWDVGKDGSG 384

Query: 276 GVMIDSGTDVTWLVKEAYEALRDEVMIRLEGEQMRSYSWPDKLCYHGIMSSDLKGFPTVR 335
           G +IDSGT +++  + AYE +R   + R++        +P     + +   +    P   
Sbjct: 385 GTIIDSGTTLSYFAEPAYEVIRRAFVERMDKAYPLVADFPVLSPCYNVSGVERVEVPEFS 444

Query: 336 FHFRGGAKLALEKDSMFYQPRPDA-FCMAVNPASINNRYVNFSLIGMMAQQFYNVGYDIG 394
             F  GA      ++ F +  PD   C+AV    +       S+IG   QQ ++V YD+ 
Sbjct: 445 LLFADGAVWDFPAENYFVRLDPDGIMCLAV----LGTPRSAMSIIGNFQQQNFHVLYDLQ 500

Query: 395 RKQKTFQRMDC 405
             +  F    C
Sbjct: 501 NNRLGFAPRRC 511


>gi|242089069|ref|XP_002440367.1| hypothetical protein SORBIDRAFT_09g030430 [Sorghum bicolor]
 gi|241945652|gb|EES18797.1| hypothetical protein SORBIDRAFT_09g030430 [Sorghum bicolor]
          Length = 462

 Score =  112 bits (280), Expect = 3e-22,   Method: Compositional matrix adjust.
 Identities = 119/421 (28%), Positives = 179/421 (42%), Gaps = 56/421 (13%)

Query: 1   LIHRDSILSPYNNPNENPAHRVQRGINISIARLAYLQEKIRSHNTYQAQILPSNDISNSL 60
           L HR++  +P     +  AHR+ R    + A     +   R+   + A ++      +  
Sbjct: 82  LAHREAFAAPNATAAQLLAHRLARDAARAEAISVSARNVTRAGGGFSAPVVSGLAQGSGE 141

Query: 61  FYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLGTIFYPSRSSSYAYVPCDSEH 120
           ++ ++ +G PP P    +DTGS ++W+ C PCR C  Q G +F P RS SYA V C +  
Sbjct: 142 YFASVGVGTPPTPALLVLDTGSDVVWLQCAPCRQCYAQSGRVFDPRRSRSYAAVRCGAPP 201

Query: 121 CRYFPYARCSAYKHR---CIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDVVFGCG 177
           CR            R   C+Y   Y  G  T+  ++TE L F         V  V  GCG
Sbjct: 202 CRGLDAGGGGGCDRRRGTCLYQVAYGDGSVTAGDLATETLWFAR----GARVPRVAVGCG 257

Query: 178 FSTNRNF-KFSGIFGLGIGRSSLVSQL----NSSFSYCIGSLHDPDYLHNKLILGDGAII 232
                 F   +G+ GLG GR SL +Q        FSYC       D  H  +I       
Sbjct: 258 HDNEGLFVAAAGLLGLGRGRLSLPTQTARRYGRRFSYC---FQGSDLDHRTIIR------ 308

Query: 233 DEGDATPLQFIDGHYYITLEAISVDGRMLDINPNIFKRDDSGGGVMIDSGTDVTWLVKEA 292
                T  Q + G          V  R L ++P+  +     GGV++DSGT VT L +  
Sbjct: 309 -----TVHQHVGGA-----RVRGVGERSLRLDPSTGR-----GGVILDSGTSVTRLARPV 353

Query: 293 YEALRDEVMIRLEGEQMR--SYSWPDKLCYHGIMSSDLKG-----FPTVRFHFRGGAKLA 345
           Y A+R+       G ++    +S  D  CY      DL+G      PTV  H  GGA++A
Sbjct: 354 YVAVREAFRAAAGGLRLAPGGFSLFDT-CY------DLRGRRVVKVPTVSVHLAGGAEVA 406

Query: 346 LEKDS-MFYQPRPDAFCMAVNPASINNRYVNFSLIGMMAQQFYNVGYDIGRKQKTFQRMD 404
           L  ++ +        FC+A+  A  +      S++G + QQ + V +D  R++       
Sbjct: 407 LPPENYLIPVDTRGTFCLAL--AGTDG---GVSIVGNIQQQGFRVVFDGDRQRVALVPKS 461

Query: 405 C 405
           C
Sbjct: 462 C 462


>gi|357143680|ref|XP_003573011.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Brachypodium
           distachyon]
          Length = 510

 Score =  112 bits (280), Expect = 3e-22,   Method: Compositional matrix adjust.
 Identities = 103/369 (27%), Positives = 160/369 (43%), Gaps = 25/369 (6%)

Query: 56  ISNSLFYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLGTIFYPSRSSSYAYVP 115
           + +  + V + +G PP      MDTGS L W+ C PC DC  Q G +F P  S+SY  V 
Sbjct: 145 VGSGEYLVEVYVGTPPRRFQMIMDTGSDLNWLQCAPCLDCFDQRGPVFDPMASTSYRNVT 204

Query: 116 CDSEHCRYF-----PYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQ 170
           C    C        P    S+    C Y   Y     T+  ++ E  T   T  S+  V 
Sbjct: 205 CGDTRCGLVSPPAAPRTCRSSRSDPCPYYYWYGDQSNTTGDLALEAFTVNLTASSSRRVD 264

Query: 171 DVVFGCGFSTNRNF-KFSGIFGLGIGRSSLVSQLNS----SFSYCIGSLHDPDYLHNKLI 225
            VV GCG      F   +G+ GLG G  S  SQL +    +FSYC+  +     + +K++
Sbjct: 265 GVVLGCGHRNRGLFHGAAGLLGLGRGPLSFASQLRAVYGHAFSYCL--VDHGSAVGSKIV 322

Query: 226 LGDGAI------IDEGDATPLQFIDGHYYITLEAISVDGRMLDINPNIF--KRDDSGGGV 277
            GD  +      ++     P    +  YY+ L+ I V G MLDI  N +   ++D  GG 
Sbjct: 323 FGDDNVLLSHPQLNYTAFAPSAAENTFYYVQLKGILVGGEMLDIPSNTWGVSKEDGSGGT 382

Query: 278 MIDSGTDVTWLVKEAYEALRDEVMIRLEGEQMRSYSWPDKLCYHGIMSSDLKGFPTVRFH 337
           +IDSGT +++  + AY+A+R   + R++        +P     + +   +    P     
Sbjct: 383 IIDSGTTLSYFPEPAYKAIRQAFVDRMDKAYPLIADFPVLSPCYNVSGVERVEVPEFSLL 442

Query: 338 FRGGAKLALEKDSMFYQPRPDA-FCMAVNPASINNRYVNFSLIGMMAQQFYNVGYDIGRK 396
           F  GA      ++ F +   +   C+AV    +       S+IG   QQ ++V YD+   
Sbjct: 443 FADGAVWDFPAENYFIRLDTEGIMCLAV----LGTPRSAMSIIGNYQQQNFHVLYDLHHN 498

Query: 397 QKTFQRMDC 405
           +  F    C
Sbjct: 499 RLGFAPRRC 507


>gi|54290728|dbj|BAD62398.1| putative nucleoid DNA-binding protein cnd41 [Oryza sativa Japonica
           Group]
          Length = 486

 Score =  112 bits (280), Expect = 3e-22,   Method: Compositional matrix adjust.
 Identities = 106/366 (28%), Positives = 151/366 (41%), Gaps = 44/366 (12%)

Query: 61  FYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRD---CSPQLGTIFYPSRSSSYAYVPCD 117
           F V + +G P  P     DTGS L WV C PC     C PQ   +F PS+SS+YA V C 
Sbjct: 144 FVVAVGLGTPAQPSALIFDTGSDLSWVQCQPCGSSGHCHPQQDPLFDPSKSSTYAAVHCG 203

Query: 118 SEHCRYFPYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDVVFGCG 177
              C       CS     C+Y   Y  G  T+  +S + L   ++   T       FGCG
Sbjct: 204 EPQCAAA-GDLCSEDNTTCLYLVRYGDGSSTTGVLSRDTLALTSSRALT----GFPFGCG 258

Query: 178 FSTNRNFKFSGIFG-----------LGIGRSSLVSQLNSSFSYCIGSLHDPDYLHNKLIL 226
               RN    G FG                S   +   + FSYC+ S +        L +
Sbjct: 259 ---TRNL---GDFGRVDGLLGLGRGELSLPSQAAASFGAVFSYCLPSSNS---TTGYLTI 309

Query: 227 GDGAIIDEGDATPLQFI-----DGHYYITLEAISVDGRMLDINPNIFKRDDSGGGVMIDS 281
           G     D G A     +        Y++ L +I + G +L + P +F R    GG ++DS
Sbjct: 310 GATPATDTGAAQYTAMLRKPQFPSFYFVELVSIDIGGYVLPVPPAVFTR----GGTLLDS 365

Query: 282 GTDVTWLVKEAYEALRDEVMIRLEGEQMRSYSWPDKL--CYHGIMSSDLKGFPTVRFHFR 339
           GT +T+L  +AY  LRD    RL  E+       D L  CY     S++   P V F F 
Sbjct: 366 GTVLTYLPAQAYALLRDR--FRLTMERYTPAPPNDVLDACYDFAGESEVV-VPAVSFRFG 422

Query: 340 GGAKLALEKDSMFYQPRPDAFCMAVNPASINNRYVNFSLIGMMAQQFYNVGYDIGRKQKT 399
            GA   L+   +      +  C+A   A+++   +  S+IG   Q+   V YD+  ++  
Sbjct: 423 DGAVFELDFFGVMIFLDENVGCLAF--AAMDTGGLPLSIIGNTQQRSAEVIYDVAAEKIG 480

Query: 400 FQRMDC 405
           F    C
Sbjct: 481 FVPASC 486


>gi|255563741|ref|XP_002522872.1| Aspartic proteinase nepenthesin-2 precursor, putative [Ricinus
           communis]
 gi|223537956|gb|EEF39570.1| Aspartic proteinase nepenthesin-2 precursor, putative [Ricinus
           communis]
          Length = 448

 Score =  112 bits (280), Expect = 3e-22,   Method: Compositional matrix adjust.
 Identities = 122/440 (27%), Positives = 183/440 (41%), Gaps = 61/440 (13%)

Query: 1   LIHRDSILSPYNNPNENPAHRVQRGINISIARLAYLQEKIRSHNTYQAQILPSNDISNSL 60
           ++HR S  SP+   N     R+ R + +S         KIR+HN     I  S+  S   
Sbjct: 32  IVHRYSRESPFYPGNITDYERITRLVELS---------KIRAHNL---AITTSSGFSPEA 79

Query: 61  FYVNIS-----------IGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLGTIFYPSRSS 109
           F + IS           IG P VP +   DTGS L W  C PC     QL  IF  + S 
Sbjct: 80  FRLRISQDDTCYLVKVIIGSPGVPLYLVPDTGSGLFWTQCEPCTRRFRQLPPIFNSTASR 139

Query: 110 SYAYVPCDSEHCRYFPYA-RCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIH 168
           +Y  +PC  + C       +C     +C+Y   Y  G  T+   + + L     D    +
Sbjct: 140 TYRDLPCQHQFCTNNQNVFQCR--DDKCVYRIAYAGGSATAGVAAQDILQSAENDRIPFY 197

Query: 169 VQDVVFGCGFSTNRNFK-------FSGIFGLGIGRSSLVSQLN----SSFSYCIG--SLH 215
                FGC    N+NF          GI GL +   SL+ Q+N    + FSYC+    L 
Sbjct: 198 -----FGCS-RDNQNFSTFESSGKGGGIIGLNMSPVSLLQQMNHITKNRFSYCLNLFDLS 251

Query: 216 DPDYLHNKLILGDGAIIDEGDATPLQFID----GHYYITLEAISVDGRMLDINPNIFK-R 270
            P +  + L  G+             F+      +Y++ L  +SV G  + I P  F  +
Sbjct: 252 SPSHATSLLRFGNDIRKSRRKYLSTPFVSPRGMPNYFLNLIDVSVAGNRMQIPPGTFALK 311

Query: 271 DDSGGGVMIDSGTDVTWLVKEAYEALRDEVMIRLE--GEQMRSYSWPDKLCYHGIMSSDL 328
            D  GG +IDSGT VT++ + AY  +        +  G Q  +      +CY        
Sbjct: 312 PDGTGGTIIDSGTAVTYISQTAYFPVITAFKNYFDQHGFQRVNIQLSGYICYKQ-QGHTF 370

Query: 329 KGFPTVRFHFRGGAKLALEKDSMFY--QPRPDAFCMAVNPASINNRYVNFSLIGMMAQQF 386
             +P++ FHF+ GA   +E + ++   Q R  AFC+A+ P S   R    ++IG + Q  
Sbjct: 371 HNYPSMAFHFQ-GADFFVEPEYVYLTVQDR-GAFCVALQPISPQQR----TIIGALNQAN 424

Query: 387 YNVGYDIGRKQKTFQRMDCE 406
               YD   +Q  F   +C+
Sbjct: 425 TQFIYDAANRQLLFTPENCQ 444


>gi|225449446|ref|XP_002283126.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Vitis vinifera]
          Length = 436

 Score =  112 bits (280), Expect = 3e-22,   Method: Compositional matrix adjust.
 Identities = 107/406 (26%), Positives = 176/406 (43%), Gaps = 62/406 (15%)

Query: 47  QAQILPSNDIS----------NSLFYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCS 96
           + Q+LPS  +           N    V++++G PP      +DTGS L W+HC      +
Sbjct: 39  KTQVLPSGSVPRPSSKLSFHHNVSLTVSLTVGSPPQTVTMVLDTGSELSWLHCKK----A 94

Query: 97  PQLGTIFYPSRSSSYAYVPCDSEHCRY----FPYARCSAYKHRCIYTQLYLIGPETSVFV 152
           P L ++F P RSSSY+ +PC S  CR     F        K  C     Y         +
Sbjct: 95  PNLHSVFDPLRSSSYSPIPCTSPTCRTRTRDFSIPVSCDKKKLCHAIISYADASSIEGNL 154

Query: 153 STEQLTFKNTDESTIHVQDVVFGC---GFSTN--RNFKFSGIFGLGIGRSSLVSQLN-SS 206
           +++     N+      +   +FGC   GFS+N   + K +G+ G+  G  S V+Q+    
Sbjct: 155 ASDTFHIGNS-----AIPATIFGCMDSGFSSNSDEDSKTTGLIGMNRGSLSFVTQMGLQK 209

Query: 207 FSYCIGSLHDPDYLHNKLILGDGAIIDEGD---------ATPLQFIDG-HYYITLEAISV 256
           FSYCI S  D   +   L+ G+ +               +TPL + D   Y + LE I V
Sbjct: 210 FSYCI-SGQDSSGI---LLFGESSFSWLKALKYTPLVQISTPLPYFDRVAYTVQLEGIKV 265

Query: 257 DGRMLDINPNIFKRDDSGGG-VMIDSGTDVTWLVKEAYEALRDEVMIRLEGEQMRSYSWP 315
              ML +  +++  D +G G  M+DSGT  T+L+   Y AL++E  +R     ++    P
Sbjct: 266 ANSMLQLPKSVYAPDHTGAGQTMVDSGTQFTFLLGPVYTALKNE-FVRQTKASLKVLEDP 324

Query: 316 D-------KLCYH-GIMSSDLKGFPTVRFHFRGGAKLALEKDSMFY------QPRPDAFC 361
           +        LCY   +    L   PTV   FR GA++++  + + Y      +     +C
Sbjct: 325 NFVFQGAMDLCYRVPLTRRTLPPLPTVTLMFR-GAEMSVSAERLMYRVPGVIRGSDSVYC 383

Query: 362 MAVNPASINNRYVNFSLIGMMAQQFYNVGYDIGRKQKTFQRMDCEV 407
                + +    V   +IG   QQ   + +D+ + +  F  + C++
Sbjct: 384 FTFGNSELLG--VESYIIGHHHQQNVWMEFDLAKSRVGFAEVRCDL 427


>gi|225217008|gb|ACN85295.1| aspartic proteinase nepenthesin-1 precursor [Oryza coarctata]
          Length = 500

 Score =  112 bits (280), Expect = 3e-22,   Method: Compositional matrix adjust.
 Identities = 109/428 (25%), Positives = 176/428 (41%), Gaps = 41/428 (9%)

Query: 1   LIHRDSILSPYNNPNENPAHRVQRGINISIARLAYLQEKIRSHNTYQAQI-------LPS 53
           ++HR    SP  + ++      +  +     R   +Q ++ +  T            LP+
Sbjct: 91  IVHRHGPCSPLADAHDGKLPSHEEILAADQNRAKSIQRRVSTTTTVSRGKPKRNRPSLPA 150

Query: 54  ND---ISNSLFYVNISIGQPPVPQFTAMDTGSSLLWVHCYPC-RDCSPQLGTIFYPSRSS 109
           +    +    + V I +G P        DTGS   WV C PC   C  Q   +F P+RSS
Sbjct: 151 SSGSALGTGNYVVTIGLGTPAGRYTVVFDTGSDTTWVQCEPCVVVCYKQQEKLFDPARSS 210

Query: 110 SYAYVPCDSEHCRYFPYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHV 169
           +YA + C +  C       CS     C+Y   Y  G  +  F + + LT  + D     +
Sbjct: 211 TYANISCAAPACSDLYIKGCSG--GHCLYGVQYGDGSYSIGFFAMDTLTLSSYDA----I 264

Query: 170 QDVVFGCGFSTNRNF-KFSGIFGLGIGRSSLVSQL----NSSFSYCIGSLHDPDYLHNKL 224
           +   FGCG      + + +G+ GLG G++SL  Q        F++C  +          L
Sbjct: 265 KGFRFGCGERNEGLYGEAAGLLGLGRGKTSLPVQAYDKYGGVFAHCFPARSSGT---GYL 321

Query: 225 ILGDGAI--IDEGDATPLQFIDG--HYYITLEAISVDGRMLDINPNIFKRDDSGGGVMID 280
             G G++  +     TP+   +G   YY+ L  I V G++L I  ++F       G ++D
Sbjct: 322 DFGPGSLPAVSAKLTTPMLVDNGPTFYYVGLTGIRVGGKLLSIPQSVFTTS----GTIVD 377

Query: 281 SGTDVTWLVKEAYEALRD---EVMIRLEGEQMRSYSWPDKLCYHGIMSSDLKGFPTVRFH 337
           SGT +T L   AY +LR      M     ++  + S  D  CY     S++   PTV   
Sbjct: 378 SGTVITRLPPAAYSSLRSAFASAMAERGYKKAPALSLLDT-CYDFTGMSEVA-IPTVSLL 435

Query: 338 FRGGAKLALEKDSMFYQPRPDAFCMAVNPASINNRYVNFSLIGMMAQQFYNVGYDIGRKQ 397
           F+GGA L +    + Y       C+     + N    +  ++G    + + V YDIG+K 
Sbjct: 436 FQGGASLDVHASGIIYAASVSQACLGF---AGNKEDDDVGIVGNTQLKTFGVVYDIGKKV 492

Query: 398 KTFQRMDC 405
             F    C
Sbjct: 493 VGFCPGAC 500


>gi|297795137|ref|XP_002865453.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
 gi|297311288|gb|EFH41712.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
          Length = 665

 Score =  112 bits (280), Expect = 4e-22,   Method: Compositional matrix adjust.
 Identities = 106/411 (25%), Positives = 175/411 (42%), Gaps = 57/411 (13%)

Query: 28  ISIARLAYLQEKIRSHNTYQAQI------LPSNDISNSLFYVNISIGQPPVPQFTAMDTG 81
           +S + L    E  R    +Q+Q+      L  + +SN  +   + IG PP      +DTG
Sbjct: 41  LSYSSLPPRVEDFRRRRLHQSQLPNAHMKLYDDLLSNGYYTTRLWIGTPPQEFALIVDTG 100

Query: 82  SSLLWVHCYPCRDCSPQLGTIFYPSRSSSYAYVPCDSEHCRYFPYARCSAYKHRCIYTQL 141
           S++ +V C  C+ C       F P  SSSY  + C+       P   C      C+Y + 
Sbjct: 101 STVTYVPCSTCKQCGKHQDPKFQPELSSSYKALKCN-------PDCNCDDEGKLCVYERR 153

Query: 142 YLIGPETSVFVSTEQLTFKNTDESTIHVQDVVFGC-GFSTNRNF--KFSGIFGLGIGRSS 198
           Y     +S  +S + ++F N  ES +  Q  VFGC    T   F  +  GI GLG G+ S
Sbjct: 154 YAEMSSSSGVLSEDLISFGN--ESQLTPQRAVFGCENVETGDLFSQRADGIMGLGRGKLS 211

Query: 199 LVSQL------NSSFSYCIGSLHDPDYLHNKLILGDGAIIDEGDATPLQFIDGH------ 246
           +V QL         FS C G +           +G GA++    + P   +  H      
Sbjct: 212 VVDQLVDKGVIEDVFSLCYGGME----------VGGGAMVLGKISPPAGMVFSHSDPFRS 261

Query: 247 --YYITLEAISVDGRMLDINPNIFKRDDSGGGVMIDSGTDVTWLVKEAYEALRDEVMIRL 304
             Y I L+ + V G+ L +NP +F   +   G ++DSGT   +  KEA+ A++D ++  +
Sbjct: 262 PYYNIDLKQMHVAGKSLKLNPKVF---NGKHGTVLDSGTTYAYFPKEAFIAIKDAIIKEI 318

Query: 305 EGEQMRSYSWP--DKLCYHGI---MSSDLKGFPTVRFHFRGGAKLALEKDSMFYQPRP-- 357
              +      P  D +C+ G    ++     FP +   F  G KL L  ++  ++     
Sbjct: 319 PSLKRIHGPDPNYDDVCFSGAGRDVAEIHNFFPEIDMEFGNGQKLILSPENYLFRHTKVR 378

Query: 358 DAFCMAVNPASINNRYVNFSLIGMMAQQFYNVGYDIGRKQKTFQRMDCEVL 408
            A+C+ + P   +      +L+G +  +   V YD    +  F + +C  L
Sbjct: 379 GAYCLGIFPDRDST-----TLLGGIVVRNTLVTYDRENDKLGFLKTNCSDL 424


>gi|21717175|gb|AAM76368.1|AC074196_26 putative nucleoid DNA binding protein [Oryza sativa Japonica Group]
 gi|31433292|gb|AAP54830.1| Eukaryotic aspartyl protease family protein [Oryza sativa Japonica
           Group]
          Length = 418

 Score =  112 bits (280), Expect = 4e-22,   Method: Compositional matrix adjust.
 Identities = 99/366 (27%), Positives = 162/366 (44%), Gaps = 38/366 (10%)

Query: 63  VNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLGTIFYPSRSSSYAYVPCDSEHCR 122
            N +IG PP P    +D    L+W  C  C  C  Q   +F P+ SS++   PC ++ C+
Sbjct: 69  ANFTIGTPPQPASAIIDVAGELVWTQCSMCSRCFKQDLPLFVPNASSTFRPEPCGTDACK 128

Query: 123 YFPYARCSAYKHRCIY--TQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDVVFGCGFST 180
             P + CS+  + C Y  T    +G  T   V+T+         S      + FGC  ++
Sbjct: 129 SIPTSNCSS--NMCTYEGTINSKLGGHTLGIVATDTFAIGTATAS------LGFGCVVAS 180

Query: 181 NRNFKF--SGIFGLGIGRSSLVSQLN-SSFSYCIGSLHDPDYLHNKLILGDGA-IIDEGD 236
             +     SG+ GLG   SSLVSQ+N + FSYC+ + HD    +++L+LG  A +   G+
Sbjct: 181 GIDTMGGPSGLIGLGRAPSSLVSQMNITKFSYCL-TPHDSGK-NSRLLLGSSAKLAGGGN 238

Query: 237 ATPLQFIDG--------HYYITLEAISVDGRMLDINPNIFKRDDSGGGVMIDSGTDVTWL 288
           +T   F+          +Y I L+ I      + + P       SG  V++ +   +++L
Sbjct: 239 STTTPFVKTSPGDDMSQYYPIQLDGIKAGDAAIALPP-------SGNTVLVQTLAPMSFL 291

Query: 289 VKEAYEALRDEVMIRLEGEQMRSYSWPDKLCYHGIMSSDLKGFPTVRFHF-RGGAKLALE 347
           V  AY+AL+ EV   +      +   P  LC+     S+    P + F F +G A L + 
Sbjct: 292 VDSAYQALKKEVTKAVGAAPTATPLQPFDLCFPKAGLSNASA-PDLVFTFQQGAAALTVP 350

Query: 348 KDSMFYQ--PRPDAFCMAVNPASINNRYV---NFSLIGMMAQQFYNVGYDIGRKQKTFQR 402
                          CMA+   S  N      N +++G + Q+  +   D+ +K  +F+ 
Sbjct: 351 PPKYLIDVGEEKGTVCMAILSTSWLNTTALDENLNILGSLQQENTHFLLDLEKKTLSFEP 410

Query: 403 MDCEVL 408
            DC  L
Sbjct: 411 ADCSSL 416


>gi|22831049|dbj|BAC15912.1| putative nucleoid DNA-binding protein cnd41 [Oryza sativa Japonica
           Group]
 gi|50508281|dbj|BAD32130.1| putative nucleoid DNA-binding protein cnd41 [Oryza sativa Japonica
           Group]
          Length = 453

 Score =  112 bits (280), Expect = 4e-22,   Method: Compositional matrix adjust.
 Identities = 111/378 (29%), Positives = 171/378 (45%), Gaps = 31/378 (8%)

Query: 54  NDISNSLFYV-NISIGQPPVPQFTAMDTGSSLLWVHCYPCRD-CSPQLGTIFYPSRSSSY 111
            D+ N   Y+  ++IG PP       DTGS L+W  C PC + C  Q   ++ PS S ++
Sbjct: 84  KDLPNGGEYIMTLAIGTPPQSYPAIADTGSDLVWTQCAPCGERCFKQPSPLYNPSSSPTF 143

Query: 112 AYVPCDSEHCRYFPYARCSAYKH----RCIYTQLYLIGPETSVFVSTEQLTFKNTDESTI 167
             +PC S        AR +         C Y Q Y  G  TS    +E  TF ++    +
Sbjct: 144 RVLPCSSALNLCAAEARLAGATPPPGCACRYNQTYGTG-WTSGLQGSETFTFGSSPADQV 202

Query: 168 HVQDVVFGCGFSTNRNFKFSG-IFGLGIGRSSLVSQLNSS-FSYCIGSLHDPDYLHNKLI 225
            V  + FGC  +++ ++  S  + GLG G  SLVSQL +  FSYC+    D     + L+
Sbjct: 203 RVPGIAFGCSNASSDDWNGSAGLVGLGRGGLSLVSQLAAGMFSYCLTPFQDTKS-KSTLL 261

Query: 226 LGDGAIIDEGDATPLQF-----------IDGHYYITLEAISVDGRMLDINPNIFK-RDDS 273
           LG  A     + T ++            +  +YY+ L  ISV    L I P  F  R D 
Sbjct: 262 LGPAAAAAALNGTGVRSTPFVPSPSKPPMSTYYYLNLTGISVGPAALPIPPGAFALRADG 321

Query: 274 GGGVMIDSGTDVTWLVKEAYEALRDEV--MIRLEGEQMRSYSWPDKLCYHGIMSSDLKG- 330
            GG++IDSGT +T LV  AY+ +R  V  +++L      + +  D LC+    SS     
Sbjct: 322 TGGLIIDSGTTITSLVDAAYKRVRAAVRSLVKLPVTDGSNATGLD-LCFALPSSSAPPAT 380

Query: 331 FPTVRFHFRGGAKLALEKDSMFYQPRPDAFCMAVNPASINNRYVNFSLIGMMAQQFYNVG 390
            P++  HF GGA + L  ++         +C+A+     +      S +G   QQ  ++ 
Sbjct: 381 LPSMTLHFGGGADMVLPVENYMIL-DGGMWCLAMR----SQTDGELSTLGNYQQQNLHIL 435

Query: 391 YDIGRKQKTFQRMDCEVL 408
           YD+ ++  +F    C  L
Sbjct: 436 YDVQKETLSFAPAKCSTL 453


>gi|388504358|gb|AFK40245.1| unknown [Medicago truncatula]
          Length = 480

 Score =  112 bits (279), Expect = 4e-22,   Method: Compositional matrix adjust.
 Identities = 114/422 (27%), Positives = 186/422 (44%), Gaps = 50/422 (11%)

Query: 17  NPAHRVQRGINISIARLAYLQEKIRS----HNTYQA----QILPSNDISNSLFYVNISIG 68
           N   ++Q+ +     R+  +Q +IR+    HN+ +     QI  ++ I+       ++IG
Sbjct: 79  NWNRKLQKQLIFDDLRVRSMQNRIRAKVSGHNSSEQSSEIQIPLASGINLETLNYIVTIG 138

Query: 69  QPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLGTIFYPSRSSSYAYVPCDSEHCRYFPYAR 128
                    +DTGS L WV C PC  C  Q G +F PS SSSY  + C+S  C+   +  
Sbjct: 139 LGNQNMTVIIDTGSDLTWVQCDPCMSCYSQQGPVFNPSNSSSYNSLLCNSSTCQNLQFTT 198

Query: 129 -----CSAYK-HRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDVVFGCGFSTNR 182
                C +     C +T  Y  G  T   +  E L+F       I V + VFGCG +   
Sbjct: 199 GNTEACESNNPSSCNHTVSYGDGSFTDGELGVEHLSFGG-----ISVSNFVFGCGRNNKG 253

Query: 183 NF-KFSGIFGLGIGRSSLVSQLNSS----FSYCIGSLHDPDYLHNKLILGDGAIIDEGDA 237
            F   SGI GLG    S++SQ N++    FSYC+ +          L++G+ + + + + 
Sbjct: 254 LFGGVSGIMGLGRSNLSMISQTNTTFGGVFSYCLPTTDSG--ASGSLVIGNESSLFK-NL 310

Query: 238 TPLQF--------IDGHYYITLEAISVDGRMLDINPNIFKRDDS--GGGVMIDSGTDVTW 287
           TP+ +        +   Y + L  I V G  +        +D S   GG++IDSGT +T 
Sbjct: 311 TPIAYTSMVSNPQLSNFYVLNLTGIDVGGVAI--------QDTSFGNGGILIDSGTVITR 362

Query: 288 LVKEAYEALRDEVMIRLEGEQMRSYSWPDKLCYHGIMSSDLKGFPTVRFHFRGGAKLALE 347
           L    Y AL+ E + +  G  +         C++ +   +    PT+  HF     L ++
Sbjct: 363 LAPSLYNALKAEFLKQFSGYPIAPALSILDTCFN-LTGIEEVSIPTLSMHFENNVDLNVD 421

Query: 348 KDSMFYQPRPDA-FCMAVNPASINNRYVNFSLIGMMAQQFYNVGYDIGRKQKTFQRMDCE 406
              + Y P+  +  C+A+   S  N   + ++IG   Q+   V YD  + +  F R DC 
Sbjct: 422 AVGILYMPKDGSQVCLALASLSDEN---DMAIIGNYQQRNQRVIYDAKQSKIGFAREDCS 478

Query: 407 VL 408
            +
Sbjct: 479 FI 480


>gi|242072067|ref|XP_002451310.1| hypothetical protein SORBIDRAFT_05g027510 [Sorghum bicolor]
 gi|241937153|gb|EES10298.1| hypothetical protein SORBIDRAFT_05g027510 [Sorghum bicolor]
          Length = 509

 Score =  112 bits (279), Expect = 4e-22,   Method: Compositional matrix adjust.
 Identities = 105/354 (29%), Positives = 154/354 (43%), Gaps = 38/354 (10%)

Query: 69  QPPVPQFTAMDTGSSLLWVHCYPC--RDCSPQLGTIFYPSRSSSYAYVPCDSEHCRYF-P 125
           +P V Q   +DT S + WV C+PC    C  Q   ++ PS+S S     C S  CR   P
Sbjct: 177 RPGVRQLMLLDTASDVAWVQCFPCPASQCYAQTDVLYDPSKSRSSESFACSSPTCRQLGP 236

Query: 126 YAR-CSAYKH---RCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDVVFGCGFSTN 181
           YA  CS+  +   +C Y   Y  G  TS  +  +QL+   T +    V    FGC  +  
Sbjct: 237 YANGCSSSSNSAGQCQYRVRYPDGSTTSGTLVADQLSLSPTSQ----VPKFEFGCSHAAR 292

Query: 182 RNF---KFSGIFGLGIGRSSLVSQLNSS----FSYCIGSLHDPDYLHNK-LILGDGAIID 233
            +F   K +GI  LG G  SLVSQ ++     FSYC      P   H    +LG      
Sbjct: 293 GSFSRSKTAGIMALGRGVQSLVSQTSTKYGQVFSYCF----PPTASHKGFFVLGVPRRSS 348

Query: 234 EGDA-TPLQFIDGHYYITLEAISVDGRMLDINPNIFKRDDSGGGVMIDSGTDVTWLVKEA 292
              A TP+      Y + LEAI+V G+ LD+ P +F       G  +DS T +T L   A
Sbjct: 349 SRYAVTPMLKTPMLYQVRLEAIAVAGQRLDVPPTVF-----AAGAALDSRTVITRLPPTA 403

Query: 293 YEALRDEVMIRLEGEQMRSYSWPDKLCYHGIMSSDLKGFPTVRFHF-RGGAKLALEKDSM 351
           Y+ALR     ++   +  + +     CY     S +   PT+   F R GA + L+   +
Sbjct: 404 YQALRSAFRDKMSMYRPAAANGQLDTCYDFTGVSSIM-LPTISLVFDRTGAGVQLDPSGV 462

Query: 352 FYQPRPDAFCMAVNPASINNRYVNFSLIGMMAQQFYNVGYDIGRKQKTFQRMDC 405
            +       C+A    + ++R     +IG +  Q   V Y++      F+R  C
Sbjct: 463 LF-----GSCLAFASTAGDDRATG--IIGFLQLQTIEVLYNVAGGSVGFRRGAC 509


>gi|224053042|ref|XP_002297678.1| predicted protein [Populus trichocarpa]
 gi|222844936|gb|EEE82483.1| predicted protein [Populus trichocarpa]
          Length = 482

 Score =  112 bits (279), Expect = 4e-22,   Method: Compositional matrix adjust.
 Identities = 103/349 (29%), Positives = 153/349 (43%), Gaps = 40/349 (11%)

Query: 78  MDTGSSLLWVHCYPCRDCSPQLGTIFYPSRSSSYAYVPCDSEHCRYFPYAR-----CSAY 132
           +DTGS L WV C PC+ C  Q   +F PS S SY  V C S  C+    A      C + 
Sbjct: 150 VDTGSDLSWVQCQPCKRCYNQQDPVFNPSTSPSYRTVLCSSPTCQSLQSATGNLGVCGSN 209

Query: 133 KHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDVVFGCGFSTNRNFKFSGIFGL 192
              C Y   Y  G  T   + TE L   N+      V + +FGCG   N    F G  GL
Sbjct: 210 PPSCNYVVNYGDGSYTRGELGTEHLDLGNSTA----VNNFIFGCG--RNNQGLFGGASGL 263

Query: 193 -GIGRS--SLVSQLNSS----FSYCIGSLHDPDYLHNKLILGDGAIIDEGDATPLQFID- 244
            G+GRS  SL+SQ ++     FSYC+            L++G  + + + + TP+ +   
Sbjct: 264 VGLGRSSLSLISQTSAMFGGVFSYCLPITETE--ASGSLVMGGNSSVYK-NTTPISYTRM 320

Query: 245 ------GHYYITLEAISVDGRMLDINPNIFKRDDSGGGVMIDSGTDVTWLVKEAYEALRD 298
                   Y++ L  I+V    + +    F +D    G+MIDSGT +T L    Y+AL+D
Sbjct: 321 IPNPQLPFYFLNLTGITVGS--VAVQAPSFGKD----GMMIDSGTVITRLPPSIYQALKD 374

Query: 299 EVMIRLEGEQMRSYSWPDKLCYHGIMSSDLKGFPTVRFHFRGGAKLALEKDSMFYQPRPD 358
           E + +  G            C++     +++  P ++ HF G A+L ++   +FY  + D
Sbjct: 375 EFVKQFSGFPSAPAFMILDTCFNLSGYQEVE-IPNIKMHFEGNAELNVDVTGVFYFVKTD 433

Query: 359 A--FCMAVNPASINNRYVNFSLIGMMAQQFYNVGYDIGRKQKTFQRMDC 405
           A   C+A+   S  N      +IG   Q+   V YD       F    C
Sbjct: 434 ASQVCLAIASLSYENE---VGIIGNYQQKNQRVIYDTKGSMLGFAAEAC 479


>gi|357460823|ref|XP_003600693.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
 gi|355489741|gb|AES70944.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
          Length = 431

 Score =  112 bits (279), Expect = 4e-22,   Method: Compositional matrix adjust.
 Identities = 93/368 (25%), Positives = 161/368 (43%), Gaps = 37/368 (10%)

Query: 63  VNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLGTIFYPSRSSSYAYVPCDSEHCR 122
           +N+ IG PP  Q   +DTGS L W+ C+  +  +      F PS SS+++ +PC    C+
Sbjct: 77  INLPIGTPPQTQPMVLDTGSQLSWIQCHKKQPPTAS----FDPSLSSTFSILPCTHPLCK 132

Query: 123 Y----FPYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDVVFGCGF 178
                F           C Y+  Y  G      +  E+ TF      ++    ++ GC  
Sbjct: 133 PRIPDFTLPTSCDQNRLCHYSYFYADGTYAEGNLVREKFTFSR----SVSTPPLILGCAT 188

Query: 179 STNRNFKFSGIFGLGIGRSSLVSQLN-SSFSYCI------------GSLHDPDYLHNKLI 225
            +       GI G+ +GR S   Q   + FSYC+            GS +  +   +K  
Sbjct: 189 ESTDP---RGILGMNLGRLSFAKQSKITKFSYCVPPRQTRPGFTPTGSFYLGNNPSSKGF 245

Query: 226 LGDGAIIDEGDATPLQFIDGHYYITLEAISVDGRMLDINPNIFKRDDSGGG-VMIDSGTD 284
              G +       P  F    Y I +  I + G+ L+I+P +F+ D  G G  MIDSG++
Sbjct: 246 KYVGMMTSSRQRMP-NFDPLAYTIPMVGIRIAGKKLNISPAVFRADAGGSGQTMIDSGSE 304

Query: 285 VTWLVKEAYEALRDEVMIRLEGEQMRS---YSWPDKLCYHGIMSSDL-KGFPTVRFHFRG 340
            T+LV EAY+ +R +V +R  G +++    Y     +C+  + + ++ +    + F F  
Sbjct: 305 FTYLVSEAYDKVRAQV-VRAVGPRLKKGYVYGGVADMCFDSVKAVEIGRLIGEMVFEFER 363

Query: 341 GAKLALEKDSMFYQPRPDAFCMAVNPASINNRYVNFSLIGMMAQQFYNVGYDIGRKQKTF 400
           G ++ + K+ +         C+ +   S +      ++IG   QQ   V +D+ R++  F
Sbjct: 364 GVEVVIPKERVLADVGGGVHCVGI--GSSDKLGAASNIIGNFHQQNLWVEFDLVRRRVGF 421

Query: 401 QRMDCEVL 408
            + DC  L
Sbjct: 422 GKADCSRL 429


>gi|115472519|ref|NP_001059858.1| Os07g0533800 [Oryza sativa Japonica Group]
 gi|113611394|dbj|BAF21772.1| Os07g0533800 [Oryza sativa Japonica Group]
          Length = 458

 Score =  112 bits (279), Expect = 4e-22,   Method: Compositional matrix adjust.
 Identities = 111/378 (29%), Positives = 171/378 (45%), Gaps = 31/378 (8%)

Query: 54  NDISNSLFYV-NISIGQPPVPQFTAMDTGSSLLWVHCYPCRD-CSPQLGTIFYPSRSSSY 111
            D+ N   Y+  ++IG PP       DTGS L+W  C PC + C  Q   ++ PS S ++
Sbjct: 89  KDLPNGGEYIMTLAIGTPPQSYPAIADTGSDLVWTQCAPCGERCFKQPSPLYNPSSSPTF 148

Query: 112 AYVPCDSEHCRYFPYARCSAYKH----RCIYTQLYLIGPETSVFVSTEQLTFKNTDESTI 167
             +PC S        AR +         C Y Q Y  G  TS    +E  TF ++    +
Sbjct: 149 RVLPCSSALNLCAAEARLAGATPPPGCACRYNQTYGTG-WTSGLQGSETFTFGSSPADQV 207

Query: 168 HVQDVVFGCGFSTNRNFKFSG-IFGLGIGRSSLVSQLNSS-FSYCIGSLHDPDYLHNKLI 225
            V  + FGC  +++ ++  S  + GLG G  SLVSQL +  FSYC+    D     + L+
Sbjct: 208 RVPGIAFGCSNASSDDWNGSAGLVGLGRGGLSLVSQLAAGMFSYCLTPFQDTKS-KSTLL 266

Query: 226 LGDGAIIDEGDATPLQF-----------IDGHYYITLEAISVDGRMLDINPNIFK-RDDS 273
           LG  A     + T ++            +  +YY+ L  ISV    L I P  F  R D 
Sbjct: 267 LGPAAAAAALNGTGVRSTPFVPSPSKPPMSTYYYLNLTGISVGPAALPIPPGAFALRADG 326

Query: 274 GGGVMIDSGTDVTWLVKEAYEALRDEV--MIRLEGEQMRSYSWPDKLCYHGIMSSDLKG- 330
            GG++IDSGT +T LV  AY+ +R  V  +++L      + +  D LC+    SS     
Sbjct: 327 TGGLIIDSGTTITSLVDAAYKRVRAAVRSLVKLPVTDGSNATGLD-LCFALPSSSAPPAT 385

Query: 331 FPTVRFHFRGGAKLALEKDSMFYQPRPDAFCMAVNPASINNRYVNFSLIGMMAQQFYNVG 390
            P++  HF GGA + L  ++         +C+A+     +      S +G   QQ  ++ 
Sbjct: 386 LPSMTLHFGGGADMVLPVENYMIL-DGGMWCLAMR----SQTDGELSTLGNYQQQNLHIL 440

Query: 391 YDIGRKQKTFQRMDCEVL 408
           YD+ ++  +F    C  L
Sbjct: 441 YDVQKETLSFAPAKCSTL 458


>gi|147821993|emb|CAN70318.1| hypothetical protein VITISV_016757 [Vitis vinifera]
          Length = 429

 Score =  111 bits (278), Expect = 5e-22,   Method: Compositional matrix adjust.
 Identities = 107/406 (26%), Positives = 175/406 (43%), Gaps = 62/406 (15%)

Query: 47  QAQILPSNDIS----------NSLFYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCS 96
           + Q+LPS  +           N    V++++G PP      +DTGS L W+HC      +
Sbjct: 32  KTQVLPSGSVPRPSSKLSFHHNVSLTVSLTVGSPPQTVTMVLDTGSELSWLHCKK----A 87

Query: 97  PQLGTIFYPSRSSSYAYVPCDSEHCRY----FPYARCSAYKHRCIYTQLYLIGPETSVFV 152
           P L ++F P RSSSY+ +PC S  CR     F        K  C     Y         +
Sbjct: 88  PNLHSVFDPLRSSSYSPIPCTSPTCRTRTRDFSIPVSCDKKKLCHAIISYADASSIEGNL 147

Query: 153 STEQLTFKNTDESTIHVQDVVFGC---GFSTN--RNFKFSGIFGLGIGRSSLVSQLN-SS 206
           +++     N+      +   +FGC   GFS+N   + K +G+ G+  G  S V+Q+    
Sbjct: 148 ASDTFHIGNSA-----IPATIFGCMDSGFSSNSDEDSKTTGLIGMNRGSLSFVTQMGLQK 202

Query: 207 FSYCIGSLHDPDYLHNKLILGDGAIIDEGD---------ATPLQFIDG-HYYITLEAISV 256
           FSYCI S  D   +   L+ G+ +               +TPL + D   Y + LE I V
Sbjct: 203 FSYCI-SGQDSSGI---LLFGESSFSWLKALKYTPLVQISTPLPYFDRVAYTVQLEGIKV 258

Query: 257 DGRMLDINPNIFKRDDSGGG-VMIDSGTDVTWLVKEAYEALRDEVMIRLEGEQMRSYSWP 315
              ML +  +++  D +G G  M+DSGT  T+L+   Y AL++E  +R     ++    P
Sbjct: 259 ANSMLQLPKSVYAPDHTGAGQTMVDSGTQFTFLLGPVYTALKNE-FVRQTKASLKVLEDP 317

Query: 316 D-------KLCYH-GIMSSDLKGFPTVRFHFRGGAKLALEKDSMFY------QPRPDAFC 361
           +        LCY   +    L   PTV   FR GA++++  + + Y      +     +C
Sbjct: 318 NFVFQGAMDLCYRVPLTRRTLPPLPTVTLMFR-GAEMSVSAERLMYRVPGVIRGSDSVYC 376

Query: 362 MAVNPASINNRYVNFSLIGMMAQQFYNVGYDIGRKQKTFQRMDCEV 407
                + +    V   +IG   QQ   + +D+ + +  F  + C +
Sbjct: 377 FTFGNSELLG--VESYIIGHHHQQNVWMEFDLAKSRVGFAEVRCXL 420


>gi|255543963|ref|XP_002513044.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
           communis]
 gi|223548055|gb|EEF49547.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
           communis]
          Length = 431

 Score =  111 bits (278), Expect = 5e-22,   Method: Compositional matrix adjust.
 Identities = 99/360 (27%), Positives = 153/360 (42%), Gaps = 30/360 (8%)

Query: 56  ISNSLFYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLGTIFYPSRSSSYAYVP 115
           + +  + V   IG P      AMDT +   W+ C  C  CS    T+F   +S+++  V 
Sbjct: 91  VQSPTYIVRAKIGTPAQTMLLAMDTSNDAAWIPCSGCVGCS---STVFNNVKSTTFKTVG 147

Query: 116 CDSEHCRYFPYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDVVFG 175
           C++  C+  P ++C      C +   Y  G  +     ++ +    TD     +    FG
Sbjct: 148 CEAPQCKQVPNSKCGG--SACAFNMTY--GSSSIAANLSQDVVTLATDS----IPSYTFG 199

Query: 176 C-GFSTNRNFKFSGIFGLGIGRSSLVSQL----NSSFSYCIGSLHDPDYLHNKLILGDGA 230
           C   +T  +    G+ GLG G  SL+SQ      S+FSYC+ S    ++    L LG   
Sbjct: 200 CLTEATGSSIPPQGLLGLGRGPMSLLSQTQNLYQSTFSYCLPSFRSLNF-SGSLRLGPVG 258

Query: 231 IIDEGDATPL---QFIDGHYYITLEAISVDGRMLDINPNIFK-RDDSGGGVMIDSGTDVT 286
                  TPL         YY+ L AI V  R++DI P+       +G G + DSGT  T
Sbjct: 259 QPKRIKTTPLLKNPRRSSLYYVNLMAIRVGRRVVDIPPSALAFNPTTGAGTIFDSGTVFT 318

Query: 287 WLVKEAYEALRDEVMIRLEGEQMRSYSWPDKLCYHGIMSSDLKGFPTVRFHFRGGAKLAL 346
            LV  AY A+RD    R+    + S    D  CY   + +     PT+ F F  G  + L
Sbjct: 319 RLVAPAYTAVRDAFRKRVGNATVTSLGGFDT-CYTSPIVA-----PTITFMF-SGMNVTL 371

Query: 347 EKDSMFYQPRPDAF-CMAVNPASINNRYVNFSLIGMMAQQFYNVGYDIGRKQKTFQRMDC 405
             D++       +  C+A+  A  N   V  ++I  M QQ + + +D+   +    R  C
Sbjct: 372 PPDNLLIHSTASSITCLAMAAAPDNVNSV-LNVIANMQQQNHRILFDVPNSRLGVAREPC 430


>gi|125571687|gb|EAZ13202.1| hypothetical protein OsJ_03122 [Oryza sativa Japonica Group]
          Length = 453

 Score =  111 bits (278), Expect = 6e-22,   Method: Compositional matrix adjust.
 Identities = 122/412 (29%), Positives = 179/412 (43%), Gaps = 49/412 (11%)

Query: 24  RGINISIARLAYLQEKIRSHNTYQAQILPSNDISNSL------FYVNISIGQPPVPQFTA 77
           R +  S +RL+ L  +  S+    A   P       L      + ++  IG P       
Sbjct: 53  RAVQRSRSRLSMLAARAVSN----AGAAPGESAQTPLKKGSGDYAMSFGIGTPATGLSGE 108

Query: 78  MDTGSSLLWVHCYPCRDCSPQLGTIFYPSRSSSYAYVPCDSEHCRYFPYARCS------A 131
            DTGS L+W  C  C  CSP+    +YP+ SSS A+V C    C   P   CS      +
Sbjct: 109 ADTGSDLIWTKCGACARCSPRGSPSYYPTSSSSAAFVACGDRTCGELPRPLCSNVAGGGS 168

Query: 132 YKHRCIYTQLYLIGPETSVFVS----TEQLTFKNTDESTIHVQDVVFGCGFSTNRNFKF- 186
               C Y   Y    +T  +      TE  TF    +       + FGC   +   F   
Sbjct: 169 GSGNCSYHYAYGNARDTHHYTEGILMTETFTFG---DDAAAFPGIAFGCTLRSEGGFGTG 225

Query: 187 SGIFGLGIGRSSLVSQLN-SSFSYCIGS-LHDPDYLH-NKLILGDGAIIDEGDATPL--- 240
           SG+ GLG G+ SLV+QLN  +F Y + S L  P  +    L    G   D   +TPL   
Sbjct: 226 SGLVGLGRGKLSLVTQLNVEAFGYRLSSDLSAPSPISFGSLADVTGGNGDSFMSTPLLTN 285

Query: 241 ---QFIDGHYYITLEAISVDGRMLDINPNIFK--RDDSGGGVMIDSGTDVTWLVKEAYEA 295
              Q +   YY+ L  ISV G+++ I    F   R    GGV+ DSGT +T L   AY  
Sbjct: 286 PVVQDLP-FYYVGLTGISVGGKLVQIPSGTFSFDRSTGAGGVIFDSGTTLTMLPDPAYTL 344

Query: 296 LRDEVMIRLEGEQMRSYSWPDKL-CYHGIMSSDLKGFPTVRFHFRGGAKLALEKDS---- 350
           +RDE++ ++  ++    +  D L C+ G   S    FP++  HF GGA + L  ++    
Sbjct: 345 VRDELLSQMGFQKPPPAANDDDLICFTG--GSSTTTFPSMVLHFDGGADMDLSTENYLPQ 402

Query: 351 MFYQPRPDAFCMAVNPASINNRYVNFSLIGMMAQQFYNVGYDI-GRKQKTFQ 401
           M  Q    A C +V  +S        ++IG + Q  ++V +D+ G  +  FQ
Sbjct: 403 MQGQNGETARCWSVVKSS-----QALTIIGNIMQMDFHVVFDLSGNARMLFQ 449


>gi|357137345|ref|XP_003570261.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Brachypodium
           distachyon]
          Length = 458

 Score =  111 bits (278), Expect = 6e-22,   Method: Compositional matrix adjust.
 Identities = 118/431 (27%), Positives = 183/431 (42%), Gaps = 55/431 (12%)

Query: 1   LIHRDSILSPYNNPNENPAHRVQRGINISIARLAYLQEKIRSHN--------TYQAQILP 52
           L HR    SP  +  E     + R   +   R  Y+Q K+  ++           A  LP
Sbjct: 57  LSHRHGPCSPAPSTVEPTMAELLRRDQL---RAKYIQAKLSVNSGSGTDGVQQSAAITLP 113

Query: 53  SNDIS--NSLFYV-NISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLGTIFYPSRSS 109
           +   S  ++L YV  +SIG P + Q   +DTGS + WVHC+        L   F P +SS
Sbjct: 114 TTLGSALDTLAYVITVSIGTPAMTQAVMIDTGSDVSWVHCHARAGAGSSL--FFDPGKSS 171

Query: 110 SYAYVPCDSEHC-RYFPYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIH 168
           +Y    C S  C R        +    C YT  Y  G  T+    ++ L   +T++    
Sbjct: 172 TYTPFSCSSAACTRLEGRDNGCSLNSTCQYTVRYGDGSNTTGTYGSDTLALNSTEK---- 227

Query: 169 VQDVVFGCGFSTNRNFKF-----SGIFGLGIGRSSLVSQL----NSSFSYCI-GSLHDPD 218
           V++  FGC  +++           G+ GLG G  SLVSQ      S+FSYC+  +     
Sbjct: 228 VENFQFGCSETSDPGEGLDEDQTDGLMGLGGGAPSLVSQTAATYGSAFSYCLPATTRSSG 287

Query: 219 YLHNKLILGDGAIIDEGDATPL---QFIDGHYYITLEAISVDGRMLDINPNIFKRDDSGG 275
           +L      G    +     TP+   +     Y++ L+ I+V G  + I+P +F       
Sbjct: 288 FLTLGASTGTSGFV----TTPMFRSRRAPTFYFVILQGINVGGDPVAISPTVFA-----A 338

Query: 276 GVMIDSGTDVTWLVKEAYEALRDEVMIRLEG-EQMRSYSWPDKLCYHGIMSSDLKGFPTV 334
           G ++DSGT +T L   AY AL       +    + R++S  D  C+      D    P V
Sbjct: 339 GSIMDSGTIITRLPPRAYSALSAAFRAGMRRYPRARAFSILDT-CFD-FTGQDNVSIPAV 396

Query: 335 RFHFRGGAKLALEKDSMFYQPRPDAFCMAVNPASINNRYVNFSLIGMMAQQFYNVGYDIG 394
              F GGA + L+ D + Y       C+A  PA+        S+IG + Q+ + V +D+G
Sbjct: 397 ELVFSGGAVVDLDADGIMY-----GSCLAFAPATGGIG----SIIGNVQQRTFEVLHDVG 447

Query: 395 RKQKTFQRMDC 405
           +    F+   C
Sbjct: 448 QSVLGFRPGAC 458


>gi|125527370|gb|EAY75484.1| hypothetical protein OsI_03384 [Oryza sativa Indica Group]
          Length = 453

 Score =  111 bits (278), Expect = 6e-22,   Method: Compositional matrix adjust.
 Identities = 122/412 (29%), Positives = 179/412 (43%), Gaps = 49/412 (11%)

Query: 24  RGINISIARLAYLQEKIRSHNTYQAQILPSNDISNSL------FYVNISIGQPPVPQFTA 77
           R +  S +RL+ L  +  S+    A   P       L      + ++  IG P       
Sbjct: 53  RAVQRSRSRLSMLAARAVSN----AGAAPGESAQTPLKKGSGDYAMSFGIGTPATGLSGE 108

Query: 78  MDTGSSLLWVHCYPCRDCSPQLGTIFYPSRSSSYAYVPCDSEHCRYFPYARCS------A 131
            DTGS L+W  C  C  CSP+    +YP+ SSS A+V C    C   P   CS      +
Sbjct: 109 ADTGSDLIWTKCGACARCSPRGSPSYYPTSSSSAAFVACGDRTCGELPRPLCSNVAGGGS 168

Query: 132 YKHRCIYTQLYLIGPETSVFVS----TEQLTFKNTDESTIHVQDVVFGCGFSTNRNFKF- 186
               C Y   Y    +T  +      TE  TF    +       + FGC   +   F   
Sbjct: 169 GSGNCSYHYAYGNARDTHHYTEGILMTETFTFG---DDAAAFPGIAFGCTLRSEGGFGTG 225

Query: 187 SGIFGLGIGRSSLVSQLN-SSFSYCIGS-LHDPDYLH-NKLILGDGAIIDEGDATPL--- 240
           SG+ GLG G+ SLV+QLN  +F Y + S L  P  +    L    G   D   +TPL   
Sbjct: 226 SGLVGLGRGKLSLVTQLNVEAFGYRLSSDLSAPSPISFGSLADVTGGNGDSFMSTPLLTN 285

Query: 241 ---QFIDGHYYITLEAISVDGRMLDINPNIFK--RDDSGGGVMIDSGTDVTWLVKEAYEA 295
              Q +   YY+ L  ISV G+++ I    F   R    GGV+ DSGT +T L   AY  
Sbjct: 286 PVVQDLP-FYYVGLTGISVGGKLVQIPSGTFSFDRSTGAGGVIFDSGTTLTMLPDPAYTL 344

Query: 296 LRDEVMIRLEGEQMRSYSWPDKL-CYHGIMSSDLKGFPTVRFHFRGGAKLALEKDS---- 350
           +RDE++ ++  ++    +  D L C+ G   S    FP++  HF GGA + L  ++    
Sbjct: 345 VRDELLSQMGFQKPPPAANDDDLICFTG--GSSTTTFPSMVLHFDGGADMDLSTENYLPQ 402

Query: 351 MFYQPRPDAFCMAVNPASINNRYVNFSLIGMMAQQFYNVGYDI-GRKQKTFQ 401
           M  Q    A C +V  +S        ++IG + Q  ++V +D+ G  +  FQ
Sbjct: 403 MQGQNGETARCWSVVKSS-----QALTIIGNIMQMDFHVVFDLSGNARMLFQ 449


>gi|9757837|dbj|BAB08274.1| unnamed protein product [Arabidopsis thaliana]
          Length = 586

 Score =  111 bits (278), Expect = 6e-22,   Method: Compositional matrix adjust.
 Identities = 101/398 (25%), Positives = 169/398 (42%), Gaps = 57/398 (14%)

Query: 38  EKIRSHNTYQAQI------LPSNDISNSLFYVNISIGQPPVPQFTAMDTGSSLLWVHCYP 91
           E  R    +Q+Q+      L  + +SN  +   + IG PP      +DTGS++ +V C  
Sbjct: 47  EDFRRRRLHQSQLPNAHMKLYDDLLSNGYYTTRLWIGTPPQEFALIVDTGSTVTYVPCST 106

Query: 92  CRDCSPQLGTIFYPSRSSSYAYVPCDSEHCRYFPYARCSAYKHRCIYTQLYLIGPETSVF 151
           C+ C       F P  S+SY  + C+       P   C      C+Y + Y     +S  
Sbjct: 107 CKQCGKHQDPKFQPELSTSYQALKCN-------PDCNCDDEGKLCVYERRYAEMSSSSGV 159

Query: 152 VSTEQLTFKNTDESTIHVQDVVFGCGFSTNRNF---KFSGIFGLGIGRSSLVSQL----- 203
           +S + ++F N  ES +  Q  VFGC      +    +  GI GLG G+ S+V QL     
Sbjct: 160 LSEDLISFGN--ESQLSPQRAVFGCENEETGDLFSQRADGIMGLGRGKLSVVDQLVDKGV 217

Query: 204 -NSSFSYCIGSLHDPDYLHNKLILGDGAIIDEGDATPLQFIDGH--------YYITLEAI 254
               FS C G +           +G GA++    + P   +  H        Y I L+ +
Sbjct: 218 IEDVFSLCYGGME----------VGGGAMVLGKISPPPGMVFSHSDPFRSPYYNIDLKQM 267

Query: 255 SVDGRMLDINPNIFKRDDSGGGVMIDSGTDVTWLVKEAYEALRDEVMIRLEGEQMRSYSW 314
            V G+ L +NP +F   +   G ++DSGT   +  KEA+ A++D V+  +   +      
Sbjct: 268 HVAGKSLKLNPKVF---NGKHGTVLDSGTTYAYFPKEAFIAIKDAVIKEIPSLKRIHGPD 324

Query: 315 P--DKLCYHGI---MSSDLKGFPTVRFHFRGGAKLALEKDSMFYQPRP--DAFCMAVNPA 367
           P  D +C+ G    ++     FP +   F  G KL L  ++  ++      A+C+ + P 
Sbjct: 325 PNYDDVCFSGAGRDVAEIHNFFPEIAMEFGNGQKLILSPENYLFRHTKVRGAYCLGIFPD 384

Query: 368 SINNRYVNFSLIGMMAQQFYNVGYDIGRKQKTFQRMDC 405
             +      +L+G +  +   V YD    +  F + +C
Sbjct: 385 RDST-----TLLGGIVVRNTLVTYDRENDKLGFLKTNC 417


>gi|42568291|ref|NP_199124.3| aspartyl protease family protein [Arabidopsis thaliana]
 gi|332007527|gb|AED94910.1| aspartyl protease family protein [Arabidopsis thaliana]
          Length = 631

 Score =  111 bits (278), Expect = 6e-22,   Method: Compositional matrix adjust.
 Identities = 101/398 (25%), Positives = 169/398 (42%), Gaps = 57/398 (14%)

Query: 38  EKIRSHNTYQAQI------LPSNDISNSLFYVNISIGQPPVPQFTAMDTGSSLLWVHCYP 91
           E  R    +Q+Q+      L  + +SN  +   + IG PP      +DTGS++ +V C  
Sbjct: 47  EDFRRRRLHQSQLPNAHMKLYDDLLSNGYYTTRLWIGTPPQEFALIVDTGSTVTYVPCST 106

Query: 92  CRDCSPQLGTIFYPSRSSSYAYVPCDSEHCRYFPYARCSAYKHRCIYTQLYLIGPETSVF 151
           C+ C       F P  S+SY  + C+       P   C      C+Y + Y     +S  
Sbjct: 107 CKQCGKHQDPKFQPELSTSYQALKCN-------PDCNCDDEGKLCVYERRYAEMSSSSGV 159

Query: 152 VSTEQLTFKNTDESTIHVQDVVFGCGFSTNRNF---KFSGIFGLGIGRSSLVSQL----- 203
           +S + ++F N  ES +  Q  VFGC      +    +  GI GLG G+ S+V QL     
Sbjct: 160 LSEDLISFGN--ESQLSPQRAVFGCENEETGDLFSQRADGIMGLGRGKLSVVDQLVDKGV 217

Query: 204 -NSSFSYCIGSLHDPDYLHNKLILGDGAIIDEGDATPLQFIDGH--------YYITLEAI 254
               FS C G +           +G GA++    + P   +  H        Y I L+ +
Sbjct: 218 IEDVFSLCYGGME----------VGGGAMVLGKISPPPGMVFSHSDPFRSPYYNIDLKQM 267

Query: 255 SVDGRMLDINPNIFKRDDSGGGVMIDSGTDVTWLVKEAYEALRDEVMIRLEGEQMRSYSW 314
            V G+ L +NP +F   +   G ++DSGT   +  KEA+ A++D V+  +   +      
Sbjct: 268 HVAGKSLKLNPKVF---NGKHGTVLDSGTTYAYFPKEAFIAIKDAVIKEIPSLKRIHGPD 324

Query: 315 P--DKLCYHGI---MSSDLKGFPTVRFHFRGGAKLALEKDSMFYQPRP--DAFCMAVNPA 367
           P  D +C+ G    ++     FP +   F  G KL L  ++  ++      A+C+ + P 
Sbjct: 325 PNYDDVCFSGAGRDVAEIHNFFPEIAMEFGNGQKLILSPENYLFRHTKVRGAYCLGIFPD 384

Query: 368 SINNRYVNFSLIGMMAQQFYNVGYDIGRKQKTFQRMDC 405
             +      +L+G +  +   V YD    +  F + +C
Sbjct: 385 RDST-----TLLGGIVVRNTLVTYDRENDKLGFLKTNC 417


>gi|357118074|ref|XP_003560784.1| PREDICTED: aspartic proteinase nepenthesin-2-like, partial
           [Brachypodium distachyon]
          Length = 452

 Score =  111 bits (277), Expect = 7e-22,   Method: Compositional matrix adjust.
 Identities = 105/371 (28%), Positives = 152/371 (40%), Gaps = 42/371 (11%)

Query: 54  NDISNSLFYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRD-CSPQLGTIFYPSRSSSYA 112
            ++    F V +  G P     T  DTGS L W+ C PC   C  Q   +F P++SSSYA
Sbjct: 105 TNLKTPEFVVVVGFGSPAQTSATMFDTGSDLSWIQCQPCSGHCYKQHDPVFDPAKSSSYA 164

Query: 113 YVPCDSEHCRYFPYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDV 172
            VPC +  C     A        C+Y   Y  G  T+  ++ E LTF ++ E T      
Sbjct: 165 VVPCGTTECA---AAGGECNGTTCVYGVEYGDGSSTTGVLARETLTFSSSSEFT----GF 217

Query: 173 VFGCGFSTNRNF-----KFSGIFGLGIGRSSLVSQLNSSFSYCIGSLHDPDYLHNKLILG 227
           +FGCG +   +F           G     S         FSYC+     P Y      L 
Sbjct: 218 IFGCGETNLGDFGEVDGLLGLGRGSLSLSSQAAPAFGGIFSYCL-----PSYNTTPGYLS 272

Query: 228 DGAIIDEGDATPLQFI--------DGHYYITLEAISVDGRMLDINPNIFKRDDSGGGVMI 279
            GA    G   P+Q+            Y+I L +I++ G +L + P+ F +     G ++
Sbjct: 273 IGATPVTGQ-IPVQYTAMVNKPDYPSFYFIELVSINIGGYVLPVPPSEFTKT----GTLL 327

Query: 280 DSGTDVTWLVKEAYEALRDEVMIRLEGEQMRSYSWPDKLCYHGIMSSDLKGFPTVRFHFR 339
           DSGT +T+L   AY ALRD     ++G +          CY     S +   P V F+F 
Sbjct: 328 DSGTILTYLPPPAYTALRDRFKFTMQGSKPAPPYDELDTCYDFTGQSGIL-IPGVSFNFS 386

Query: 340 GGAKLALEKDSMFYQP---RPDAFCMAV--NPASINNRYVNFSLIGMMAQQFYNVGYDIG 394
            GA   L    +   P   +P   C+A    PA +      FS++G   Q+   V YD+ 
Sbjct: 387 DGAVFNLNFFGIMTFPDDTKPAVGCLAFVSRPADM-----PFSVVGSTTQRSAEVIYDVP 441

Query: 395 RKQKTFQRMDC 405
            ++  F    C
Sbjct: 442 AQKIGFIPASC 452


>gi|356527089|ref|XP_003532146.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
          Length = 488

 Score =  111 bits (277), Expect = 7e-22,   Method: Compositional matrix adjust.
 Identities = 118/429 (27%), Positives = 172/429 (40%), Gaps = 62/429 (14%)

Query: 1   LIHRDSILSPYNNPNENPAHRVQRG--INISIARLAYLQEKIR-------SHNTYQAQIL 51
           ++H+    S  NN +     +      +N    R+ Y+  +I        S +   +  L
Sbjct: 73  VVHKHGPCSQLNNHDGKAKSKTPHSEILNQDKERVKYINSRISKNLGQDSSVSELDSVTL 132

Query: 52  PSND---ISNSLFYVNISIGQPPVPQFTAMDTGSSLLWVHCYPC-RDCSPQLGTIFYPSR 107
           P+     I +  ++V + +G P        DTGS L W  C PC R C  Q   IF PS+
Sbjct: 133 PAKSGSLIGSGNYFVVVGLGTPKRDLSLIFDTGSDLTWTQCEPCARSCYKQQDAIFDPSK 192

Query: 108 SSSYAYVPCDSEHCRYFPYAR-----CSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNT 162
           S+SY+ + C S  C     A      CSA    CIY   Y     +  + S E+L+   T
Sbjct: 193 STSYSNITCTSTLCTQLSTATGNEPGCSASTKACIYGIQYGDSSFSVGYFSRERLSVTAT 252

Query: 163 DESTIHVQDVVFGCGFSTNRNFKFS-GIFGLGIGRSSLVSQLNSS----FSYCIGSLHDP 217
           D     V + +FGCG +    F  S G+ GLG    S V Q  +     FSYC+ +    
Sbjct: 253 D----IVDNFLFGCGQNNQGLFGGSAGLIGLGRHPISFVQQTAAVYRKIFSYCLPATSSS 308

Query: 218 DYLHNKLILGDGAIIDEGDATPLQFI---DGHYYITLEAISVDGRMLDINPNIFKRDDSG 274
                +L  G          TP   I      Y + +  ISV G  L ++ + F    S 
Sbjct: 309 T---GRLSFGT-TTTSYVKYTPFSTISRGSSFYGLDITGISVGGAKLPVSSSTF----ST 360

Query: 275 GGVMIDSGTDVTWLVKEAYEALRDEVMIRLEGEQMRSYSWPDKL-----CYHGIMSSDLK 329
           GG +IDSGT +T L   AY ALR         + M  Y    +L     CY      DL 
Sbjct: 361 GGAIIDSGTVITRLPPTAYTALRSAFR-----QGMSKYPSAGELSILDTCY------DLS 409

Query: 330 GF-----PTVRFHFRGGAKLALEKDSMFYQPRPDAFCMAVNPASINNRYVNFSLIGMMAQ 384
           G+     P + F F GG  + L    + Y       C+A    + N    + ++ G + Q
Sbjct: 410 GYEVFSIPKIDFSFAGGVTVQLPPQGILYVASAKQVCLAF---AANGDDSDVTIYGNVQQ 466

Query: 385 QFYNVGYDI 393
           +   V YD+
Sbjct: 467 KTIEVVYDV 475


>gi|363808270|ref|NP_001242239.1| uncharacterized protein LOC100801883 [Glycine max]
 gi|255641727|gb|ACU21134.1| unknown [Glycine max]
          Length = 475

 Score =  111 bits (277), Expect = 7e-22,   Method: Compositional matrix adjust.
 Identities = 93/358 (25%), Positives = 163/358 (45%), Gaps = 44/358 (12%)

Query: 40  IRSHNTYQ-AQILPSNDIS---------NSLFYVNISIGQPPVPQFTAMDTGSSLLWVHC 89
           +R+H+  +  +IL + D++           L++  + +G PP   +  +DTGS +LWV+C
Sbjct: 39  VRAHDVRRRGRILSAVDLNLGGNGLPTETGLYFTKLGLGSPPRDYYVQVDTGSDILWVNC 98

Query: 90  YPCRDC--SPQLG---TIFYPSRSSSYAYVPCDSEHCRYF---PYARCSAYKHRCIYTQL 141
             C  C     LG   T++ P  S +   V CD + C      P   C + +  C Y+  
Sbjct: 99  VECSRCPRKSDLGIDLTLYDPKGSETSDVVSCDQDFCSATFDGPIPGCKS-EIPCPYSIT 157

Query: 142 YLIGPETSVFVSTEQLTFKNTD---ESTIHVQDVVFGCG------FSTNRNFKFSGIFGL 192
           Y  G  T+ +   + LT+   +    ++     ++FGCG        ++      GI G 
Sbjct: 158 YGDGSATTGYYVQDYLTYNRINGNLRTSPQNSSIIFGCGAVQSGTLGSSSEEALDGIIGF 217

Query: 193 GIGRSSLVSQLNSS------FSYCIGSLHDPDYLHNKLILGDGAIID-EGDATPLQFIDG 245
           G   SS++SQL +S      FS+C+      D +    I   G +++ +   TPL     
Sbjct: 218 GQANSSVLSQLAASGKVKKIFSHCL------DNVRGGGIFAIGEVVEPKVSTTPLVPRMA 271

Query: 246 HYYITLEAISVDGRMLDINPNIFKRDDSGGGVMIDSGTDVTWLVKEAYEALRDEVMIRLE 305
           HY + L++I VD  +L +  +IF   + G G +IDSGT + +L    Y+ L  +V+ R  
Sbjct: 272 HYNVVLKSIEVDTDILQLPSDIFDSVN-GKGTVIDSGTTLAYLPDIVYDELIQKVLARQP 330

Query: 306 GEQMRSYSWPDKLCYHGIMSSDLKGFPTVRFHFRGGAKLALEKDSMFYQPRPDAFCMA 363
           G ++       + C+    + D +GFP V+ HF+    L +      +Q +   +C+ 
Sbjct: 331 GLKLYLVEQQFR-CFLYTGNVD-RGFPVVKLHFKDSLSLTVYPHDYLFQFKDGIWCIG 386


>gi|357130715|ref|XP_003566992.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Brachypodium
           distachyon]
          Length = 479

 Score =  111 bits (277), Expect = 8e-22,   Method: Compositional matrix adjust.
 Identities = 104/388 (26%), Positives = 156/388 (40%), Gaps = 49/388 (12%)

Query: 61  FYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLGT-----------IFYPSRSS 109
           ++V   +G P  P     DTGS L WV C P +  +    +            F P +S 
Sbjct: 95  YFVRFRVGTPAQPFLLVADTGSDLTWVKCRPAKAAAASTNSSSSASASSPRRAFRPEKSK 154

Query: 110 SYAYVPCDSEHCRY---FPYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFK------ 160
           ++A +PC S+ C     F  + C      C Y   Y  G      V TE  T        
Sbjct: 155 TWAPIPCASDTCSKSLPFSLSTCPTPGSPCAYDYRYKDGSAARGTVGTESATIALSSSSS 214

Query: 161 --NTDESTIHVQDVVFGC-GFSTNRNFKFS-GIFGLGIGRSSLVSQLNSSF----SYCIG 212
                     +Q +V GC G  T  +F+ S G+  LG    S  S   S F    SYC+ 
Sbjct: 215 SSKNKVKKAKLQGLVLGCTGSYTGPSFEASDGVLSLGYSNVSFASHAASRFGGRFSYCLV 274

Query: 213 SLHDPDYLHNKLILGDGAIIDE---------GDATPLQF---IDGHYYITLEAISVDGRM 260
               P    + L  G  + +              TPL     +   Y ++++AISVDG +
Sbjct: 275 DHLSPRNATSYLTFGPNSALSGPCPAAAGPGARQTPLVLDSRMRPFYDVSIKAISVDGEL 334

Query: 261 LDINPNIFKRDDSGGGVMIDSGTDVTWLVKEAYEALRDEVMIRLEGEQMRSYSWPDKLCY 320
           L I  ++++ D  GGGV++DSGT +T L K AY A+   +  +L     R    P + CY
Sbjct: 335 LKIPRDVWEVD-GGGGVIVDSGTSLTVLAKPAYRAVVAALGKKL-ARFPRVAMDPFEYCY 392

Query: 321 HGIMSS---DLKGFPTVRFHFRGGAKLALEKDSMFYQPRPDAFCMAVNPASINNRYVNFS 377
           +    S   +    P +  HF G A+L     S      P   C+ V        +   S
Sbjct: 393 NWTSPSRKDEGDDLPKLAVHFAGSARLEPPSKSYVIDAAPGVKCIGVQ----EGPWPGIS 448

Query: 378 LIGMMAQQFYNVGYDIGRKQKTFQRMDC 405
           +IG + QQ +   +D+  ++  F+R  C
Sbjct: 449 VIGNILQQEHLWEFDLKNRRLRFKRSRC 476


>gi|255556768|ref|XP_002519417.1| Aspartic proteinase Asp1 precursor, putative [Ricinus communis]
 gi|223541280|gb|EEF42831.1| Aspartic proteinase Asp1 precursor, putative [Ricinus communis]
          Length = 494

 Score =  111 bits (277), Expect = 8e-22,   Method: Compositional matrix adjust.
 Identities = 96/314 (30%), Positives = 149/314 (47%), Gaps = 33/314 (10%)

Query: 60  LFYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLGTI------FYPSRSSSYAY 113
           L++  + +G PP      +DTGS +LWV C  C +C PQ   +      F  + SS+   
Sbjct: 80  LYFTRVKLGTPPREFNVQIDTGSDVLWVTCSSCSNC-PQTSGLGIQLNYFDTTSSSTARL 138

Query: 114 VPCDSEHCR---YFPYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNT-DESTI-- 167
           VPC    C         +C    ++C Y   Y  G  TS +  ++   F     ES I  
Sbjct: 139 VPCSHPICTSQIQTTATQCPPQSNQCSYAFQYGDGSGTSGYYVSDTFYFDAVLGESLIAN 198

Query: 168 HVQDVVFGC-----GFSTNRNFKFSGIFGLGIGRSSLVSQLNSS------FSYCIGSLHD 216
               +VFGC     G  T  +    GIFG G G  S++SQL+S       FS+C   L  
Sbjct: 199 SSAAIVFGCSTYQSGDLTKTDKAVDGIFGFGQGELSVISQLSSHGITPRVFSHC---LKG 255

Query: 217 PDYLHNKLILGDGAIIDEGDA-TPLQFIDGHYYITLEAISVDGRMLDINPNIFKRDDSGG 275
            D     L+LG+  I++ G   +PL     HY + L++I+V G++L I+P  F    S  
Sbjct: 256 EDSGGGILVLGE--ILEPGIVYSPLVPSQPHYNLDLQSIAVSGQLLPIDPAAFAT-SSNR 312

Query: 276 GVMIDSGTDVTWLVKEAYEALRDEVMIRLEGEQMRSYSWPDKLCYHGIMSSDLKGFPTVR 335
           G +ID+GT + +LV+EAY+     +   +      + +  ++ CY  + +S  + FP V 
Sbjct: 313 GTIIDTGTTLAYLVEEAYDPFVSAITAAVSQLATPTINKGNQ-CYL-VSNSVSEVFPPVS 370

Query: 336 FHFRGGAKLALEKD 349
           F+F GGA + L+ +
Sbjct: 371 FNFAGGATMLLKPE 384


>gi|326531454|dbj|BAJ97731.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 476

 Score =  110 bits (276), Expect = 9e-22,   Method: Compositional matrix adjust.
 Identities = 107/400 (26%), Positives = 166/400 (41%), Gaps = 53/400 (13%)

Query: 15  NENPAHRVQRGINISIARLAYLQEKIRSHNTYQAQILPSNDISNSLFYVNISIGQPPVPQ 74
           +E     +QR ++   AR A    +  +  +  A +  +  +    + V +S+G P V Q
Sbjct: 97  DEQRVEYIQRRVSGGGARGAKGALQQLATGSRSATVPTTMGVGTFQYVVTVSLGTPGVSQ 156

Query: 75  FTAMDTGSSLLWVHCYPCR--DCSPQLGTIFYPSRSSSYAYVPCDSEHCRYFPYARCSAY 132
              +DTGS + WV C PC    C+ Q   +F P++SS+Y+ VPC ++ C           
Sbjct: 157 TVEVDTGSDVSWVQCKPCSAPACNSQRDQLFDPAKSSTYSAVPCGADACSELRIYEAGCS 216

Query: 133 KHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDVVFGCGFSTNRNFK-FSGIFG 191
             +C Y   Y  G  T+    ++ L     +     V   +FGCG +    F    G+  
Sbjct: 217 GSQCGYVVSYGDGSNTTGVYGSDTLALAPGNT----VGTFLFGCGHAQAGMFAGIDGLLA 272

Query: 192 LGIGRSSLVSQLNSS----FSYCIGSLHDPDYLHNKLILGDGAIIDEGDATPLQFID--- 244
           LG    SL SQ   +    FSYC+ S          L LG G     G AT         
Sbjct: 273 LGRQSMSLKSQAAGAYGGVFSYCLPSKQS---AAGYLTLG-GPTSASGFATTGLLTAWAA 328

Query: 245 -GHYYITLEAISVDGRMLDINPNIFKRDDSGGGVMIDSGTDVTWLVKEAYEALRDEVMIR 303
              Y + L  ISV G+ + +  + F      GG ++D+GT +T L   AY ALR      
Sbjct: 329 PTFYMVMLTGISVGGQQVAVPASAFA-----GGTVVDTGTVITRLPPTAYAALRSAFR-- 381

Query: 304 LEGEQMRSYSWPDK-------LCY----HGIMSSDLKGFPTVRFHFRGGAKLALEKDSMF 352
                +  Y +P          CY    +G+++      PTV   F GGA LALE   + 
Sbjct: 382 ---GAIAPYGYPSAPANGILDTCYDFSRYGVVT-----LPTVALTFSGGATLALEAPGIL 433

Query: 353 YQPRPDAFCMAVNPASINNRYVNFSLIGMMAQQFYNVGYD 392
                 + C+A  P   N    + +++G + Q+ + V +D
Sbjct: 434 -----SSGCLAFAP---NGGDGDAAILGNVQQRSFAVRFD 465


>gi|115479237|ref|NP_001063212.1| Os09g0423500 [Oryza sativa Japonica Group]
 gi|50725896|dbj|BAD33424.1| putative nucleoid DNA-binding protein cnd41, chloroplast [Oryza
           sativa Japonica Group]
 gi|50726136|dbj|BAD33657.1| putative nucleoid DNA-binding protein cnd41, chloroplast [Oryza
           sativa Japonica Group]
 gi|113631445|dbj|BAF25126.1| Os09g0423500 [Oryza sativa Japonica Group]
 gi|215767614|dbj|BAG99842.1| unnamed protein product [Oryza sativa Japonica Group]
 gi|222641596|gb|EEE69728.1| hypothetical protein OsJ_29412 [Oryza sativa Japonica Group]
          Length = 473

 Score =  110 bits (276), Expect = 1e-21,   Method: Compositional matrix adjust.
 Identities = 102/355 (28%), Positives = 163/355 (45%), Gaps = 51/355 (14%)

Query: 78  MDTGSSLLWVHCYPCRDCSPQLGTIFYPSRSSSYAYVPCDSEHCRYFPYARCSAY----- 132
           +DT S L WV C PC  C  Q G +F P+ S SYA +PC+S  C     A  SA      
Sbjct: 142 VDTASELTWVQCAPCASCHDQQGPLFDPASSPSYAVLPCNSSSCDALQVATGSAAGACGG 201

Query: 133 --KHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDVVFGCGFSTNRNF-KFSGI 189
             +  C YT  Y  G  +   ++ ++L+          +   VFGCG S    F   SG+
Sbjct: 202 GEQPSCSYTLSYRDGSYSQGVLAHDKLSLAGE-----VIDGFVFGCGTSNQGPFGGTSGL 256

Query: 190 FGLGIGRSSLVS----QLNSSFSYCIGSLHDPDYLHNKLILGDGAIIDEGDATPLQF--- 242
            GLG  + SL+S    Q    FSYC+  L + +     L+LGD   +   ++TP+ +   
Sbjct: 257 MGLGRSQLSLISQTMDQFGGVFSYCL-PLKESES-SGSLVLGDDTSVYR-NSTPIVYTTM 313

Query: 243 ----IDG-HYYITLEAISVDGRMLDINPNIFKRDDSGGGVMIDSGTDVTWLVKEAYEALR 297
               + G  Y++ L  I++ G+ +         + S G V++DSGT +T LV   Y A++
Sbjct: 314 VSDPVQGPFYFVNLTGITIGGQEV---------ESSAGKVIVDSGTIITSLVPSVYNAVK 364

Query: 298 DEVMIRL-EGEQMRSYSWPDKLCYHGIMSSDLKGF-----PTVRFHFRGGAKLALEKDSM 351
            E + +  E  Q   +S  D  C+      +L GF     P+++F F G  ++ ++   +
Sbjct: 365 AEFLSQFAEYPQAPGFSILDT-CF------NLTGFREVQIPSLKFVFEGNVEVEVDSSGV 417

Query: 352 FYQPRPDAFCMAVNPASINNRYVNFSLIGMMAQQFYNVGYDIGRKQKTFQRMDCE 406
            Y    D+  + +  AS+ + Y   S+IG   Q+   V +D    Q  F +  C+
Sbjct: 418 LYFVSSDSSQVCLALASLKSEY-ETSIIGNYQQKNLRVIFDTLGSQIGFAQETCD 471


>gi|356505293|ref|XP_003521426.1| PREDICTED: aspartic proteinase-like protein 2-like [Glycine max]
          Length = 499

 Score =  110 bits (276), Expect = 1e-21,   Method: Compositional matrix adjust.
 Identities = 94/316 (29%), Positives = 153/316 (48%), Gaps = 36/316 (11%)

Query: 60  LFYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDC--SPQLGT---IFYPSRSSSYAYV 114
           L++  + +G P    +  +DTGS +LW++C  C +C  S  LG     F  + SS+ A V
Sbjct: 82  LYFTKVKLGSPAKDFYVQIDTGSDILWINCITCSNCPHSSGLGIELDFFDTAGSSTAALV 141

Query: 115 PCDSEHCRYF---PYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQD 171
            C    C Y      + CS+  ++C YT  Y  G  T+ +  ++ + F         V +
Sbjct: 142 SCADPICSYAVQTATSGCSSQANQCSYTFQYGDGSGTTGYYVSDTMYFDTVLLGQSMVAN 201

Query: 172 ----VVFGC-----GFSTNRNFKFSGIFGLGIGRSSLVSQLNSS------FSYCIGSLHD 216
               +VFGC     G  T  +    GIFG G G  S++SQL+S       FS+C   L  
Sbjct: 202 SSSTIVFGCSTYQSGDLTKTDKAVDGIFGFGPGALSVISQLSSRGVTPKVFSHC---LKG 258

Query: 217 PDYLHNKLILGDGAIIDEGDA-TPLQFIDGHYYITLEAISVDGRMLDINPNIFKRDDSGG 275
            +     L+LG+  I++     +PL     HY + L++I+V+G++L I+ N+F   ++  
Sbjct: 259 GENGGGVLVLGE--ILEPSIVYSPLVPSLPHYNLNLQSIAVNGQLLPIDSNVFATTNN-Q 315

Query: 276 GVMIDSGTDVTWLVKEAYEALRDEVMIRLEGEQMRSYSWPDKLCYHGIMSSDLKG--FPT 333
           G ++DSGT + +LV+EAY    D +   +        S  ++ CY   + S+  G  FP 
Sbjct: 316 GTIVDSGTTLAYLVQEAYNPFVDAITAAVSQFSKPIISKGNQ-CY---LVSNSVGDIFPQ 371

Query: 334 VRFHFRGGAKLALEKD 349
           V  +F GGA + L  +
Sbjct: 372 VSLNFMGGASMVLNPE 387


>gi|255566002|ref|XP_002523989.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
           communis]
 gi|223536716|gb|EEF38357.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
           communis]
          Length = 448

 Score =  110 bits (276), Expect = 1e-21,   Method: Compositional matrix adjust.
 Identities = 114/437 (26%), Positives = 180/437 (41%), Gaps = 60/437 (13%)

Query: 1   LIHRDSILSPYNNPNENPAHRVQRGINISIARLAYLQEKIRSHNTYQAQILPSNDISNSL 60
           LI RDS  SP+ N  E  A R         A++        S+   Q+++    + S   
Sbjct: 41  LIRRDSPNSPFYNALEAAATRSTNASQHYDAQIGRFNLMSDSYYASQSEL----NFSKGN 96

Query: 61  FYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLGTIFYPSRSSSYAYVPCDSEH 120
           + + IS+G PP       D    L W+ C  C+DC+   G  F+PS SS+Y    C+S  
Sbjct: 97  YLIKISVGTPPAEILALADITGDLTWLPCKTCQDCTKD-GFTFFPSESSTYTSAACESYQ 155

Query: 121 CRYFPYARCSAYKHRCIYTQLYLIGP--------ETSVFVSTEQLTFKNTDESTIHVQDV 172
           C+    A C      CI    YL GP             V+ + ++F ++    +   + 
Sbjct: 156 CQITNGAVCQT--KMCI----YLCGPLPQQRSSCTNKGLVAMDTISFHSSSGQALSYPNT 209

Query: 173 VFGCG-FSTNRNFKFSGIFGLGIGRSSLVSQL----NSSFSYCIGSLHDP-----DYLHN 222
            F CG F  N ++  +GI GLG G  S+ SQ+    N +FS C+           ++   
Sbjct: 210 NFICGTFIDNWHYIGAGIVGLGRGLFSMTSQMKHLINGTFSQCLVPYSSKQSSKINFGLK 269

Query: 223 KLILGDGA----IIDEGDATPLQFIDGHYYITLEAISVDGRMLDINPNIFKRDDSGGGVM 278
            ++ G+G     I D+G++       G Y++ LEA+SV G    +  N +    S   + 
Sbjct: 270 GVVSGEGVVSTPIADDGES-------GAYFLFLEAMSVGGNR--VANNFYSAPKS--NIY 318

Query: 279 IDSGTDVTWLVKEAYEALRDEVM-------IRLEGEQMRSYSWPDKLCYHGIMSSDLKGF 331
           ID  T  T L  + YE +  EV        I    E+  S      LCY      D    
Sbjct: 319 IDWRTTFTSLPHDFYENVEAEVRKAINLTPINYNNERKLS------LCYKSESDHDFDA- 371

Query: 332 PTVRFHFRGGAKLALEKDSMFYQPRPDAFCMAVNPASIN-NRYVNFSLIGMMAQQFYNVG 390
           P +  HF   A + L   + F +   +  C A    + N  + +  ++ G   Q  + VG
Sbjct: 372 PPITMHFT-NADVQLSPLNTFVRMDWNVVCFAFLDGTFNATKRITHAVYGSWQQMNFIVG 430

Query: 391 YDIGRKQKTFQRMDCEV 407
           YD+     +F++ DC +
Sbjct: 431 YDLKSSTVSFKQADCTL 447


>gi|449456068|ref|XP_004145772.1| PREDICTED: aspartic proteinase-like protein 2-like [Cucumis
           sativus]
 gi|449496218|ref|XP_004160076.1| PREDICTED: aspartic proteinase-like protein 2-like [Cucumis
           sativus]
          Length = 500

 Score =  110 bits (275), Expect = 1e-21,   Method: Compositional matrix adjust.
 Identities = 97/319 (30%), Positives = 144/319 (45%), Gaps = 33/319 (10%)

Query: 60  LFYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLG-----TIFYPSRSSSYAYV 114
           L+Y  + +G PP   +  +DTGS +LWV C  C  C    G       F P  S++ + V
Sbjct: 82  LYYTRVQLGNPPKDFYVQIDTGSDVLWVSCNSCNGCPATSGLQIPLNFFDPGSSTTASLV 141

Query: 115 PCDSEHCRYFPYAR---CSAYKHRCIYTQLYLIGPETSVFVSTEQLTFK---NTDESTIH 168
            C  + C     +    C    ++C Y   Y  G  TS +   + +      ++  ++  
Sbjct: 142 SCSDQICALGVQSSDSACFGQSNQCAYVFQYGDGSGTSGYYVMDMIHLDVVIDSSVTSNS 201

Query: 169 VQDVVFGCGFS-----TNRNFKFSGIFGLGIGRSSLVSQLNSS------FSYCIGSLHDP 217
              VVFGC  S     T  +    GIFG G    S++SQL+S       FS+C   L   
Sbjct: 202 SASVVFGCSTSQTGDLTKSDRAVDGIFGFGQQDLSVISQLSSRGIAPKVFSHC---LKGD 258

Query: 218 DYLHNKLILGDGAIIDEGDA-TPLQFIDGHYYITLEAISVDGRMLDINPNIFKRDDSGGG 276
           D     L+LG+  I++     TPL     HY + L++ISV+G++L I+P +F    S  G
Sbjct: 259 DSGGGILVLGE--IVEPNVVYTPLVPSQPHYNLNLQSISVNGQVLPISPAVFATSSS-QG 315

Query: 277 VMIDSGTDVTWLVKEAYEALRDEVMIRLEGEQMRSYSWPDKLCYHGIMSSDLKG-FPTVR 335
            +IDSGT + +L +EAY A    V   +  +  +S       CY  + SS +   FP V 
Sbjct: 316 TIIDSGTTLAYLAEEAYNAFVVAVT-NIVSQSTQSVVLKGNRCY--VTSSSVSDIFPQVS 372

Query: 336 FHFRGGAKLALEKDSMFYQ 354
            +F GGA L L       Q
Sbjct: 373 LNFAGGASLVLGAQDYLIQ 391


>gi|255571588|ref|XP_002526740.1| aspartic-type endopeptidase, putative [Ricinus communis]
 gi|223533929|gb|EEF35654.1| aspartic-type endopeptidase, putative [Ricinus communis]
          Length = 471

 Score =  110 bits (275), Expect = 1e-21,   Method: Compositional matrix adjust.
 Identities = 115/430 (26%), Positives = 193/430 (44%), Gaps = 36/430 (8%)

Query: 1   LIHRDSILSPYNNPNENPAHRVQRGINISIARLAYLQEKIRSHNTYQAQILPSNDIS--N 58
           LIH  S  SP+  PN  P   ++  +  S AR   ++ KIRS     ++  P + IS  +
Sbjct: 47  LIHWSSPESPFYEPNLTPGELMRASVRTSRARGDRIR-KIRSSGISNSRKYPVSRISIID 105

Query: 59  SLFYVNISIGQPPVPQFTAMDTGSSLLWVHCYP--CRDCSPQLGTIFYPSRSSSYAYVPC 116
            ++ +  +IG PPV  +   DTGS+++W+ C    C +C  Q   +F P++SS+YA   C
Sbjct: 106 KVYVMKFNIGSPPVETYAIPDTGSNIVWIQCGSPICTNCYKQKIPLFNPTKSSTYAIRLC 165

Query: 117 DSEHCRYF-----PYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTF-KNTDESTIHVQ 170
               C+        Y  C +    C Y   Y     +   +ST+ +TF ++  E   +  
Sbjct: 166 GHRECKQALWGLGEYLGCKSSVQVCRYHISYEDHSFSEGTISTDIITFPEHIAEFGNYSL 225

Query: 171 DVVFGCGFSTNR-------NFKFSGIFGLGIGRSSLVSQLN-SSFSYCIGS--LHDPDYL 220
            + FGCG++ +        +F   G+ GLG   +SLV QL    FSYCI +  +  P+  
Sbjct: 226 RMFFGCGYNNSETPGQDPNSFTAPGVVGLGNEMASLVGQLTLGQFSYCISTPDVQKPNGT 285

Query: 221 HNKLILGDGAIIDEGDATPLQFIDGHY-YITLEAISVDGRMLDINPN-IFKRDDSG-GGV 277
             ++  G  A I          ++G Y +  ++ I VD   +   P  +F+  + G GG+
Sbjct: 286 I-EIRFGLAASISGHSTALANNLEGWYIFQNVDGIYVDDTKVKGYPEWVFQFAEGGIGGL 344

Query: 278 MIDSGTDVTWLVKEAYEALRDEVMIRLE-GEQMRSYSWPD-KLCYHGIMSSDLKGFPTVR 335
           ++DSGT  T L   A +AL  E+  ++E     + +S  +  LCY+   +  L   P + 
Sbjct: 345 IMDSGTTYTELYFSALDALIGELKEQIELAPDTQDHSNSNYSLCYNA-ANFLLTYVPAIE 403

Query: 336 FHFRGG--AKLALEKDSMFYQPRPDAFCMAVNPASINNRYVNFSLIGMMAQQFYNVGYDI 393
             F     A       + +     D +C+A+   S        S+IG+   +   +GYD+
Sbjct: 404 LKFTDNKEAYFPFTLRNAWIDNGNDQYCLAMFGTS------GISIIGIYQHRDIKIGYDL 457

Query: 394 GRKQKTFQRM 403
                +F  M
Sbjct: 458 KYNLVSFTEM 467


>gi|226492391|ref|NP_001140482.1| uncharacterized protein LOC100272542 precursor [Zea mays]
 gi|224030447|gb|ACN34299.1| unknown [Zea mays]
 gi|414887506|tpg|DAA63520.1| TPA: hypothetical protein ZEAMMB73_432695 [Zea mays]
          Length = 512

 Score =  110 bits (275), Expect = 1e-21,   Method: Compositional matrix adjust.
 Identities = 105/395 (26%), Positives = 173/395 (43%), Gaps = 35/395 (8%)

Query: 36  LQEKIRSHNTYQAQILPSNDISNSLFYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDC 95
           L E  R   T ++ +     + ++ + +++ +G PP      MDTGS L W+ C PC DC
Sbjct: 125 LSESERVVATVESGVA----VGSAEYLMDVYVGTPPRRFQMIMDTGSDLNWLQCAPCLDC 180

Query: 96  SPQLGTIFYPSRSSSYAYVPCDSEHCRYFPYARCSAY-------KHRCIYTQLYLIGPET 148
             Q G +F P+ SSSY  + C    C +       A        +  C Y   Y     +
Sbjct: 181 FEQRGPVFDPAASSSYRNLTCGDPRCGHVAPPEAPAPRACRRPGEDPCPYYYWYGDQSNS 240

Query: 149 SVFVSTEQLTFKNTDE-STIHVQDVVFGCGFSTNRNF-KFSGIFGLGIGRSSLVSQLNS- 205
           +  ++ E  T   T   ++  V  VVFGCG      F   +G+ GLG G  S  SQL + 
Sbjct: 241 TGDLALESFTVNLTAPGASSRVDGVVFGCGHRNRGLFHGAAGLLGLGRGPLSFASQLRAV 300

Query: 206 ----SFSYCIGSLHDPDYLHNKLILGDGAIIDEGDATPLQFI---------DGHYYITLE 252
               +FSYC+   H  D + +K++ G+   +       L++          D  YY+ L 
Sbjct: 301 YGGHTFSYCLVD-HGSD-VASKVVFGEDDALALAAHPRLKYTAFAPASSPADTFYYVRLT 358

Query: 253 AISVDGRMLDINPNIFKRDDSG-GGVMIDSGTDVTWLVKEAYEALRDEVMIRLEGEQMRS 311
            + V G +L+I+ + +   + G GG +IDSGT +++ V+ AY+ +R   + R+ G     
Sbjct: 359 GVLVGGELLNISSDTWDASEGGSGGTIIDSGTTLSYFVEPAYQVIRRAFIDRMSGSYPPV 418

Query: 312 YSWPDKLCYHGIMSSDLKGFPTVRFHFRGGAKLALEKDSMFYQPRPDA-FCMAVNPASIN 370
             +P     + +   +    P +   F  GA      ++ F +  PD   C+AV    + 
Sbjct: 419 PDFPVLSPCYNVSGVERPEVPELSLLFADGAVWDFPAENYFIRLDPDGIMCLAV----LG 474

Query: 371 NRYVNFSLIGMMAQQFYNVGYDIGRKQKTFQRMDC 405
                 S+IG   QQ ++V YD+   +  F    C
Sbjct: 475 TPRTGMSIIGNFQQQNFHVAYDLHNNRLGFAPRRC 509


>gi|302780575|ref|XP_002972062.1| hypothetical protein SELMODRAFT_96804 [Selaginella moellendorffii]
 gi|300160361|gb|EFJ26979.1| hypothetical protein SELMODRAFT_96804 [Selaginella moellendorffii]
          Length = 429

 Score =  110 bits (275), Expect = 1e-21,   Method: Compositional matrix adjust.
 Identities = 109/380 (28%), Positives = 163/380 (42%), Gaps = 41/380 (10%)

Query: 61  FYVNISIGQPPVPQFTAMDTGSSLLWVHCY---------PCRDCSPQLGTIFYPSRSSSY 111
           + V+++ G PP       DTGS L+W+ C          P + CS +    F  S+S++ 
Sbjct: 53  YLVSMAFGTPPQEVLLIADTGSDLIWLQCSTTAAPPAFCPKKACSRR--PAFVASKSATL 110

Query: 112 AYVPCDSEHCRYFPYAR-----CS-AYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDES 165
           + VPC +  C   P  R     CS A    C Y   Y  G  T+ F++ +  T  N    
Sbjct: 111 SVVPCSAAQCLLVPAPRGHGPACSPAAPVPCGYAYDYADGSSTTGFLARDTATISNGTSG 170

Query: 166 TIHVQDVVFGCGFSTNRNFKFS---GIFGLGIGRSSLVSQLNS----SFSYCIGSLHDPD 218
              V+ V FGCG + N+   FS   G+ GLG G+ S  +Q  S    +FSYC+  L    
Sbjct: 171 GAAVRGVAFGCG-TRNQGGSFSGTGGVIGLGQGQLSFPAQSGSLFAQTFSYCLLDLEGGR 229

Query: 219 YLHNK--LILGDGAIIDEGDATPL---QFIDGHYYITLEAISVDGRMLDINPNIFKRDDS 273
              +   L LG          TPL         YY+ + AI V  R+L +  + +  D  
Sbjct: 230 RGRSSSFLFLGRPERRAAFAYTPLVSNPLAPTFYYVGVVAIRVGNRVLPVPGSEWAIDVL 289

Query: 274 G-GGVMIDSGTDVTWLVKEAYEALRDEVMIRLEGEQMRS---YSWPDKLCYHGIMSSDLK 329
           G GG +IDSG+ +T+L   AY  L       +   ++ S   +    +LCY+   SS   
Sbjct: 290 GNGGTVIDSGSTLTYLRLGAYLHLVSAFAASVHLPRIPSSATFFQGLELCYNVSSSSSSA 349

Query: 330 ----GFPTVRFHFRGGAKLALEKDSMFYQPRPDAFCMAVNPASINNRYVNFSLIGMMAQQ 385
               GFP +   F  G  L L   +       D  C+A+ P         F+++G + QQ
Sbjct: 350 PANGGFPRLTIDFAQGLSLELPTGNYLVDVADDVKCLAIRP---TLSPFAFNVLGNLMQQ 406

Query: 386 FYNVGYDIGRKQKTFQRMDC 405
            Y+V +D    +  F R +C
Sbjct: 407 GYHVEFDRASARIGFARTEC 426


>gi|357154085|ref|XP_003576664.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Brachypodium
           distachyon]
          Length = 509

 Score =  110 bits (275), Expect = 1e-21,   Method: Compositional matrix adjust.
 Identities = 112/438 (25%), Positives = 173/438 (39%), Gaps = 51/438 (11%)

Query: 1   LIHRDSILSPYNNPNENPAHRVQRGINISIARLAYLQEKIRSHNTYQAQ--ILPSN---D 55
           ++HR    SP   P + P+      ++   AR+  +   I +  +       LP+     
Sbjct: 91  VMHRHGPCSPLQTPGDAPSDADL--LDQDQARVDSILGMITNETSAVGPGVSLPAERGIS 148

Query: 56  ISNSLFYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRD--CSPQLGTIFYPSRSSSYAY 113
           +    + V++ +G P        DTGS L WV C PC    C  Q   +F PS SS+++ 
Sbjct: 149 VGTGNYVVSVGLGTPARDLTVVFDTGSDLSWVQCGPCSSGGCYKQQDPLFAPSDSSTFSA 208

Query: 114 VPCDSEHCRYFPYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQD-- 171
           V C +  CR       S    RC Y  +Y     T   +  + LT      +    ++  
Sbjct: 209 VRCGARECRARQSCGGSPGDDRCPYEVVYGDKSRTQGHLGNDTLTLGTMAPANASAENDN 268

Query: 172 ----VVFGCGFSTNRNF-KFSGIFGLGIGRSSLVSQ----LNSSFSYCI--GSLHDPDYL 220
                VFGCG +    F +  G+FGLG G+ SL SQ        FSYC+   S   P YL
Sbjct: 269 KLPGFVFGCGENNTGLFGQADGLFGLGRGKVSLSSQAAGKFGEGFSYCLPSSSSSAPGYL 328

Query: 221 HNKLILGDGAIIDEGDATPL---QFIDGHYYITLEAISVDGRMLDI-NPNIFKRDDSGGG 276
                +   A       TP+         YY+ L  I V GR + + +P +         
Sbjct: 329 SLGTPVPAPA---HAQFTPMLNRTTTPSFYYVKLVGIRVAGRAIRVSSPRVALP------ 379

Query: 277 VMIDSGTDVTWLVKEAYEALRDEVMIRLEGEQMRSYSWPDK-------LCYHGIMSSDLK 329
           +++DSGT +T L   AY ALR   +       M  Y +           CY     ++  
Sbjct: 380 LIVDSGTVITRLAPRAYRALRAAFL-----SAMGKYGYKRAPRLSILDTCYDFTAHANAT 434

Query: 330 -GFPTVRFHFRGGAKLALEKDSMFYQPRPDAFCMAVNPASINNRYVNFSLIGMMAQQFYN 388
              P V   F GGA ++++   + Y  +    C+A  P   N    +  ++G   Q+   
Sbjct: 435 VSIPAVALVFAGGATISVDFSGVLYVAKVAQACLAFAP---NGDGRSAGILGNTQQRTLA 491

Query: 389 VGYDIGRKQKTFQRMDCE 406
           V YD+ R++  F    C 
Sbjct: 492 VVYDVARQKIGFAAKGCS 509


>gi|449464952|ref|XP_004150193.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 2-like [Cucumis
           sativus]
 gi|449526850|ref|XP_004170426.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 2-like [Cucumis
           sativus]
          Length = 476

 Score =  110 bits (275), Expect = 1e-21,   Method: Compositional matrix adjust.
 Identities = 125/417 (29%), Positives = 186/417 (44%), Gaps = 53/417 (12%)

Query: 1   LIHRDSILSPYNNPNENPAHRVQRGINISIARLAYLQEKIRSHNTYQAQILPSNDISNSL 60
           L HRD +  P N   ++P  R +  I+    R++ L   + S +  Q     S+ +S + 
Sbjct: 75  LFHRDKL--PLNFDPDHP-RRFKERISRDSKRVSSLLRLLSSGSDEQVTDFGSDVVSGTE 131

Query: 61  -----FYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLGTIFYPSRSSSYAYVP 115
                ++V I +G PP  Q+  +D+GS ++WV C PC +C  Q   +F P+ S++YA + 
Sbjct: 132 QGSGEYFVRIGVGSPPRSQYVVIDSGSDIVWVQCQPCSECYQQSDPVFDPAGSATYAGIS 191

Query: 116 CDSEHCRYFPYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDVVFG 175
           CDS  C     A C+    RC Y   Y  G  T   ++ E LTF       + ++++  G
Sbjct: 192 CDSSVCDRLDNAGCN--DGRCRYEVSYGDGSYTRGTLALETLTFGR-----VLIRNIAIG 244

Query: 176 CGFSTNRNF-KFSGIFGLGIGRSSLVSQL----NSSFSYCIGSLHDPDYLHNKLILGDGA 230
           CG      F   +G+ GLG G  S V QL      +FSYC+ S          L  G GA
Sbjct: 245 CGHMNRGMFIGAAGLLGLGGGAMSFVGQLGGQTGGAFSYCLVSRGTES--TGTLEFGRGA 302

Query: 231 IIDEGDATPL---QFIDGHYYITLEAISVDGRMLDINPNIFKRDDSG-GGVMIDSGTDVT 286
           +       PL         YY+ L  + V G  + I   IF+  D G GGV++D+GT VT
Sbjct: 303 MPVGAAWVPLIRNPRAPSFYYVGLSGLGVGGIRVPIPEQIFELTDLGYGGVVMDTGTAVT 362

Query: 287 WLVKEAYEALRDEVMIRLEGEQMRSYSWPDKL-----CYHGIMSSDLKGF-----PTVRF 336
            L   AYEA RD  +      Q  +    D++     CY+      L GF     PTV F
Sbjct: 363 RLPAPAYEAFRDTFI-----GQTANLPRSDRVSIFDTCYN------LNGFVSVRVPTVSF 411

Query: 337 HFRGGAKLAL-EKDSMFYQPRPDAFCMAVNPASINNRYVNFSLIGMMAQQFYNVGYD 392
           +F GG  L L  ++ +        FC A   ++        S+IG + Q+   +  D
Sbjct: 412 YFSGGPILTLPARNFLIPVDGEGTFCFAFAASA-----SGLSIIGNIQQEGIQISID 463


>gi|125563764|gb|EAZ09144.1| hypothetical protein OsI_31414 [Oryza sativa Indica Group]
          Length = 472

 Score =  110 bits (275), Expect = 1e-21,   Method: Compositional matrix adjust.
 Identities = 102/355 (28%), Positives = 163/355 (45%), Gaps = 51/355 (14%)

Query: 78  MDTGSSLLWVHCYPCRDCSPQLGTIFYPSRSSSYAYVPCDSEHCRYFPYARCSAY----- 132
           +DT S L WV C PC  C  Q G +F P+ S SYA +PC+S  C     A  SA      
Sbjct: 141 VDTASELTWVQCAPCASCHDQQGPLFDPASSPSYAVLPCNSSSCDALQVATGSAAGACGG 200

Query: 133 --KHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDVVFGCGFSTNRNF-KFSGI 189
             +  C YT  Y  G  +   ++ ++L+          +   VFGCG S    F   SG+
Sbjct: 201 GEQPSCSYTLSYRDGSYSQGVLAHDKLSLAGE-----VIDGFVFGCGTSNQGPFGGTSGL 255

Query: 190 FGLGIGRSSLVS----QLNSSFSYCIGSLHDPDYLHNKLILGDGAIIDEGDATPLQF--- 242
            GLG  + SL+S    Q    FSYC+  L + +     L+LGD   +   ++TP+ +   
Sbjct: 256 MGLGRSQLSLISQTMDQFGGVFSYCL-PLKESES-SGSLVLGDDTSVYR-NSTPIVYTTM 312

Query: 243 ----IDG-HYYITLEAISVDGRMLDINPNIFKRDDSGGGVMIDSGTDVTWLVKEAYEALR 297
               + G  Y++ L  I++ G+ +         + S G V++DSGT +T LV   Y A++
Sbjct: 313 VSDPVQGPFYFVNLTGITIGGQEV---------ESSAGKVIVDSGTIITSLVPSVYNAVK 363

Query: 298 DEVMIRL-EGEQMRSYSWPDKLCYHGIMSSDLKGF-----PTVRFHFRGGAKLALEKDSM 351
            E + +  E  Q   +S  D  C+      +L GF     P+++F F G  ++ ++   +
Sbjct: 364 AEFLSQFAEYPQAPGFSILDT-CF------NLTGFREVQIPSLKFVFEGNVEVEVDSSGV 416

Query: 352 FYQPRPDAFCMAVNPASINNRYVNFSLIGMMAQQFYNVGYDIGRKQKTFQRMDCE 406
            Y    D+  + +  AS+ + Y   S+IG   Q+   V +D    Q  F +  C+
Sbjct: 417 LYFVSSDSSQVCLALASLKSEY-ETSIIGNYQQKNLRVIFDTLGSQIGFAQETCD 470


>gi|255565531|ref|XP_002523756.1| Aspartic proteinase Asp1 precursor, putative [Ricinus communis]
 gi|223537060|gb|EEF38696.1| Aspartic proteinase Asp1 precursor, putative [Ricinus communis]
          Length = 507

 Score =  110 bits (275), Expect = 1e-21,   Method: Compositional matrix adjust.
 Identities = 96/323 (29%), Positives = 144/323 (44%), Gaps = 36/323 (11%)

Query: 60  LFYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLG-----TIFYPSRSSSYAYV 114
           L++  + +G PP   +  +DTGS +LWV C  C  C    G     T F P  S++ A V
Sbjct: 83  LYFTRVQLGSPPKDFYVQIDTGSDVLWVSCSSCNGCPVTSGLQIPLTFFDPGSSTTAALV 142

Query: 115 PCDSEHCRYFPYAR---CSAYKHRCIYTQLYLIGPETS------------VFVSTEQLT- 158
            C  + C     +    CS+  ++C YT  Y  G  TS            + +S+ +L+ 
Sbjct: 143 SCSDQRCTAGIQSSDSLCSSRTNQCGYTFQYGDGSGTSGYYVADLMHLDTLLLSSGELSQ 202

Query: 159 FKNTDESTIHVQDVVFGCGFSTNRNFKFSGIFGLGIGRSSLVSQLNSS------FSYCIG 212
              T +S++         G  T  +    GIFG G    S++SQL S       FS+C  
Sbjct: 203 ICQTYDSSVSFMCSTLQTGDLTKSDRAVDGIFGFGQQEMSVISQLASQGITPRVFSHC-- 260

Query: 213 SLHDPDYLHNKLILGDGAIIDEGDA-TPLQFIDGHYYITLEAISVDGRMLDINPNIFKRD 271
            L   D     L+LG+  I++     TPL     HY + L++ISV G+ L I+P++F   
Sbjct: 261 -LKGDDSGGGVLVLGE--IVEPNIVYTPLVPSQPHYNLYLQSISVAGQTLAIDPSVFGA- 316

Query: 272 DSGGGVMIDSGTDVTWLVKEAYEALRDEVMIRLEGEQMRSYSWPDKLCYHGIMSSDLKGF 331
            S  G ++DSGT + +L + AY+     +   +     R+Y      CY  + SS    F
Sbjct: 317 SSNQGTIVDSGTTLAYLAEGAYDPFVSAIT-SVVSLNARTYLSKGNQCYL-VTSSVNDVF 374

Query: 332 PTVRFHFRGGAKLALEKDSMFYQ 354
           P V  +F GGA L L       Q
Sbjct: 375 PQVSLNFAGGASLILNPQDYLLQ 397


>gi|7715602|gb|AAF68120.1|AC010793_15 F20B17.14 [Arabidopsis thaliana]
 gi|12324588|gb|AAG52249.1|AC011717_17 putative aspartyl protease; 105611-106921 [Arabidopsis thaliana]
          Length = 436

 Score =  110 bits (274), Expect = 1e-21,   Method: Compositional matrix adjust.
 Identities = 121/423 (28%), Positives = 188/423 (44%), Gaps = 56/423 (13%)

Query: 19  AHRVQRGINISIARLAYLQEKIRS-------HNTYQAQILPSNDIS-NSLFY-VNISIGQ 69
             +++R + +   R+  LQ KI++        +  + QI  ++ I   SL Y V + +G 
Sbjct: 36  GKKMRRALVLDNIRVQSLQLKIKAMTSSTTEQSVSETQIPLTSGIKLESLNYIVTVELGG 95

Query: 70  PPVPQFTAMDTGSSLLWVHCYPCRDCSPQLGTIFYPSRSSSYAYVPCDSEHCRYFPYARC 129
             +     +DTGS L WV C PCR C  Q G ++ PS SSSY  V C+S  C+    A  
Sbjct: 96  KNMSLI--VDTGSDLTWVQCQPCRSCYNQQGPLYDPSVSSSYKTVFCNSSTCQDLVAATS 153

Query: 130 SA---------YKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDVVFGCGFST 180
           ++          K  C Y   Y  G  T   +++E +   +T      +++ VFGCG   
Sbjct: 154 NSGPCGGNNGVVKTPCEYVVSYGDGSYTRGDLASESILLGDT-----KLENFVFGCG-RN 207

Query: 181 NRNFKFSGIFGLGIGRS--SLVSQ----LNSSFSYCIGSLHDPDYLHNKLILGDGAIIDE 234
           N+         +G+GRS  SLVSQ     N  FSYC+ SL D       L  G+ + +  
Sbjct: 208 NKGLFGGSSGLMGLGRSSVSLVSQTLKTFNGVFSYCLPSLEDG--ASGSLSFGNDSSVYT 265

Query: 235 GDA----TPL---QFIDGHYYITLEAISVDGRMLDINPNIFKRDDSGGGVMIDSGTDVTW 287
                  TPL     +   Y + L   S+ G  L       K    G G++IDSGT +T 
Sbjct: 266 NSTSVSYTPLVQNPQLRSFYILNLTGASIGGVEL-------KSSSFGRGILIDSGTVITR 318

Query: 288 LVKEAYEALRDEVMIRLEG-EQMRSYSWPDKLCYHGIMSSDLKGFPTVRFHFRGGAKLAL 346
           L    Y+A++ E + +  G      YS  D  C++     D+   P ++  F+G A+L +
Sbjct: 319 LPPSIYKAVKIEFLKQFSGFPTAPGYSILDT-CFNLTSYEDIS-IPIIKMIFQGNAELEV 376

Query: 347 EKDSMFYQPRPDA--FCMAVNPASINNRYVNFSLIGMMAQQFYNVGYDIGRKQKTFQRMD 404
           +   +FY  +PDA   C+A+   S  N      +IG   Q+   V YD  +++      +
Sbjct: 377 DVTGVFYFVKPDASLVCLALASLSYENE---VGIIGNYQQKNQRVIYDTTQERLGIVGEN 433

Query: 405 CEV 407
           C V
Sbjct: 434 CRV 436


>gi|449440933|ref|XP_004138238.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 1-like [Cucumis
           sativus]
          Length = 487

 Score =  110 bits (274), Expect = 1e-21,   Method: Compositional matrix adjust.
 Identities = 112/361 (31%), Positives = 175/361 (48%), Gaps = 29/361 (8%)

Query: 57  SNSLFYVNISIGQPPVPQFTAMDTGSSLLWVHCYPC---RDCSPQLGTIFYPSRSSSYAY 113
           S + +   I +GQP    +   DTGS + W+ C PC     C  Q   IF P  SSSY+ 
Sbjct: 144 SGAEYLAQIGVGQPVKLFYLVPDTGSDVTWLQCQPCASENTCYKQFDPIFDPKSSSSYSP 203

Query: 114 VPCDSEHCRYFPYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDVV 173
           + C+S+ C+    A C++    CIY   Y  G  T+  ++TE L+F N++     + ++ 
Sbjct: 204 LSCNSQQCKLLDKANCNS--DTCIYQVHYGDGSFTTGELATETLSFGNSNS----IPNLP 257

Query: 174 FGCGFSTNRNF-KFSGIFGLGIGRSSLVSQLN-SSFSYCIGSLHDPDYLHNKLILGDGAI 231
            GCG      F   +G+ GLG G  SL SQL  SSFSYC+ +L   D   +  +  +  +
Sbjct: 258 IGCGHDNEGLFAGGAGLIGLGGGAISLSSQLKASSFSYCLVNL---DSDSSSTLEFNSNM 314

Query: 232 IDEGDATPLQFID---GHYYITLEAISVDGRMLDINPNIFKRDDSG-GGVMIDSGTDVTW 287
             +   +PL   D    + Y+ +  ISV G+ L I+P  F+ D+SG GG+++DSGT ++ 
Sbjct: 315 PSDSLTSPLVKNDRFHSYRYVKVVGISVGGKTLPISPTRFEIDESGLGGIIVDSGTIISR 374

Query: 288 LVKEAYEALRDEVMIRLEGEQMRS--YSWPDKLCYHGIMSSDLKGFPTVRFHFRGGAKLA 345
           L  + YE+LR E  ++L      +   S  D  CY+    S+++  PT+ F    G  L 
Sbjct: 375 LPSDVYESLR-EAFVKLTSSLSPAPGISVFDT-CYNFSGQSNVE-VPTIAFVLSEGTSLR 431

Query: 346 L-EKDSMFYQPRPDAFCMAVNPASINNRYVNFSLIGMMAQQFYNVGYDIGRKQKTFQRMD 404
           L  ++ +        +C+A     I  +  + S+IG   QQ   V YD+      F    
Sbjct: 432 LPARNYLIMLDTAGTYCLAF----IKTKS-SLSIIGSFQQQGIRVSYDLTNSLVGFSTNK 486

Query: 405 C 405
           C
Sbjct: 487 C 487


>gi|242050744|ref|XP_002463116.1| hypothetical protein SORBIDRAFT_02g038150 [Sorghum bicolor]
 gi|241926493|gb|EER99637.1| hypothetical protein SORBIDRAFT_02g038150 [Sorghum bicolor]
          Length = 632

 Score =  110 bits (274), Expect = 2e-21,   Method: Compositional matrix adjust.
 Identities = 103/380 (27%), Positives = 170/380 (44%), Gaps = 56/380 (14%)

Query: 56  ISNSLFYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLGTIFYPSRSSSYAYVP 115
           ++N  +   + IG PP      +D+GS++ +V C  C  C       F P  SSSY+ V 
Sbjct: 84  LTNGYYTTRLYIGTPPQEFALIVDSGSTVTYVPCASCEQCGNHQDPRFQPDLSSSYSPVK 143

Query: 116 CDSEHCRYFPYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDVVFG 175
           C+ +         C + K +C Y + Y     +S  +  + ++F    ES +  Q  VFG
Sbjct: 144 CNVD-------CTCDSDKKQCTYERQYAEMSSSSGVLGEDIVSFGR--ESELKPQRAVFG 194

Query: 176 CGFSTNRNFKFS----GIFGLGIGRSSLVSQL------NSSFSYCIGSLHDPDYLHNKLI 225
           C  S   +  FS    GI GLG G+ S++ QL      + SFS C G +           
Sbjct: 195 CENSETGDL-FSQHADGIMGLGRGQLSIMDQLVEKGVISDSFSLCYGGMD---------- 243

Query: 226 LGDGAIIDEGDATPLQFIDGH--------YYITLEAISVDGRMLDINPNIFKRDDSGGGV 277
           +G GA++  G   P   +  H        Y I L+ I V G+ L ++  +F   +S  G 
Sbjct: 244 IGGGAMVLGGVPAPSDMVFSHSDPLRSPYYNIELKEIHVAGKALRVDSRVF---NSKHGT 300

Query: 278 MIDSGTDVTWLVKEAYEALRDEVMIRLEGEQMRSYSWPDK----LCYHGI---MSSDLKG 330
           ++DSGT   +L ++A+ A +D V  ++    ++    PD     +C+ G    +S   + 
Sbjct: 301 VLDSGTTYAYLPEQAFVAFKDAVTSKV--HSLKKIRGPDPNYKDICFAGAGRNVSKLHEV 358

Query: 331 FPTVRFHFRGGAKLALEKDS-MFYQPRPD-AFCMAVNPASINNRYVNFSLIGMMAQQFYN 388
           FP V   F  G KL+L  ++ +F   + D A+C+ V      N     +L+G +  +   
Sbjct: 359 FPDVDMVFGNGQKLSLTPENYLFRHSKVDGAYCLGV----FQNGKDPTTLLGGIIVRNTL 414

Query: 389 VGYDIGRKQKTFQRMDCEVL 408
           V YD   ++  F + +C  L
Sbjct: 415 VTYDRHNEKIGFWKTNCSEL 434


>gi|356570798|ref|XP_003553571.1| PREDICTED: aspartic proteinase-like protein 2-like [Glycine max]
          Length = 500

 Score =  110 bits (274), Expect = 2e-21,   Method: Compositional matrix adjust.
 Identities = 101/376 (26%), Positives = 174/376 (46%), Gaps = 41/376 (10%)

Query: 60  LFYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDC--SPQLGT---IFYPSRSSSYAYV 114
           L++  + +G P    +  +DTGS +LW++C  C +C  S  LG     F  + SS+ A V
Sbjct: 82  LYFTKVKLGSPAKEFYVQIDTGSDILWINCITCSNCPHSSGLGIELDFFDTAGSSTAALV 141

Query: 115 PCDSEHCRYF---PYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNT--DESTI-- 167
            C    C Y      + CS+  ++C YT  Y  G  T+ +  ++ + F      +S +  
Sbjct: 142 SCGDPICSYAVQTATSECSSQANQCSYTFQYGDGSGTTGYYVSDTMYFDTVLLGQSVVAN 201

Query: 168 HVQDVVFGC-----GFSTNRNFKFSGIFGLGIGRSSLVSQLNSS------FSYCIGSLHD 216
               ++FGC     G  T  +    GIFG G G  S++SQL+S       FS+C   L  
Sbjct: 202 SSSTIIFGCSTYQSGDLTKTDKAVDGIFGFGPGALSVISQLSSRGVTPKVFSHC---LKG 258

Query: 217 PDYLHNKLILGDGAIIDEGDA-TPLQFIDGHYYITLEAISVDGRMLDINPNIFKRDDSGG 275
            +     L+LG+  I++     +PL     HY + L++I+V+G++L I+ N+F   ++  
Sbjct: 259 GENGGGVLVLGE--ILEPSIVYSPLVPSQPHYNLNLQSIAVNGQLLPIDSNVFATTNN-Q 315

Query: 276 GVMIDSGTDVTWLVKEAYEALRDEVMIRLEGEQMRSYSWPDKLCYHGIMSSDLKG--FPT 333
           G ++DSGT + +LV+EAY      +   +        S  ++ CY   + S+  G  FP 
Sbjct: 316 GTIVDSGTTLAYLVQEAYNPFVKAITAAVSQFSKPIISKGNQ-CY---LVSNSVGDIFPQ 371

Query: 334 VRFHFRGGAKLALEKDS--MFYQPRPDAFCMAVNPASINNRYVNFSLIGMMAQQFYNVGY 391
           V  +F GGA + L  +   M Y     A    +    +      F+++G +  +     Y
Sbjct: 372 VSLNFMGGASMVLNPEHYLMHYGFLDGAAMWCIGFQKVEQ---GFTILGDLVLKDKIFVY 428

Query: 392 DIGRKQKTFQRMDCEV 407
           D+  ++  +   DC +
Sbjct: 429 DLANQRIGWADYDCSL 444


>gi|297735249|emb|CBI17611.3| unnamed protein product [Vitis vinifera]
          Length = 480

 Score =  110 bits (274), Expect = 2e-21,   Method: Compositional matrix adjust.
 Identities = 102/401 (25%), Positives = 178/401 (44%), Gaps = 43/401 (10%)

Query: 38  EKIRSHNTYQ-AQILPSNDI---------SNSLFYVNISIGQPPVPQFTAMDTGSSLLWV 87
           + +R+H+T +  +IL + D+            L++  I IG P    +  +DTGS +LWV
Sbjct: 41  DALRAHDTRRHGRILSAVDLPLGGNGHPSEAGLYFAKIGIGTPSKDYYVQVDTGSDILWV 100

Query: 88  HCYPCRDCSPQ--LG---TIFYPSRSSSYAYVPCDSEHCRYF--PYARCSAYKHRCIYTQ 140
           +C  C  C  +  LG   T++    S++   V CD   C  +  P   C     +C+Y+ 
Sbjct: 101 NCAGCDRCPTKSDLGVDLTLYDMKASTTSDAVGCDDNFCSLYDGPLPGCKP-GLQCLYSV 159

Query: 141 LYLIGPETSVFVSTEQLTFKNTD---ESTIHVQDVVFGCGFSTNRNFKFS-----GIFGL 192
           LY  G  T+ +   + + +       ++T     VVFGCG   +     S     GI G 
Sbjct: 160 LYGDGSSTTGYFVQDFVQYNRISGNFQTTPTNGTVVFGCGNKQSGELGSSSEALDGILGF 219

Query: 193 GIGRSSLVSQLNSS------FSYCIGSLHDPDYLHNKLILGDGAIID-EGDATPLQFIDG 245
           G   SS++SQL SS      FS+C+      D +    I   G +++ + + TPL     
Sbjct: 220 GQANSSMLSQLASSGKVKKVFSHCL------DNVDGGGIFAIGEVVEPKVNITPLVQNQA 273

Query: 246 HYYITLEAISVDGRMLDINPNIFKRDDSGGGVMIDSGTDVTWLVKEAYEALRDEVMIRLE 305
           HY + ++ I V G  LD+  + F+  D   G +IDSGT + +  +E Y  L ++++ +  
Sbjct: 274 HYNVVMKEIEVGGDPLDVPSDAFESGDR-KGTIIDSGTTLAYFPQEVYVPLIEKILSQQP 332

Query: 306 GEQMRSYSWPDKLCYHGIMSSDLKGFPTVRFHFRGGAKLALEKDSMFYQPRPDAFCMAV- 364
             ++ +       C+    + D  GFPTV  HF     L +      +Q +   +C+   
Sbjct: 333 DLRLHTVEQA-FTCFDYTGNVD-DGFPTVTLHFDKSISLTVYPHEYLFQVKEFEWCIGWQ 390

Query: 365 NPASINNRYVNFSLIGMMAQQFYNVGYDIGRKQKTFQRMDC 405
           N  +      + +L+G +      V YD+ ++   +   +C
Sbjct: 391 NSGAQTKDGKDLTLLGDLVLSNKLVVYDLEKQGIGWVEYNC 431


>gi|242079765|ref|XP_002444651.1| hypothetical protein SORBIDRAFT_07g025440 [Sorghum bicolor]
 gi|241941001|gb|EES14146.1| hypothetical protein SORBIDRAFT_07g025440 [Sorghum bicolor]
          Length = 493

 Score =  110 bits (274), Expect = 2e-21,   Method: Compositional matrix adjust.
 Identities = 98/368 (26%), Positives = 164/368 (44%), Gaps = 48/368 (13%)

Query: 40  IRSHN-TYQAQILPSNDI---------SNSLFYVNISIGQPPVPQFTAMDTGSSLLWVHC 89
           +R+H+ T   ++L + D+            L+Y  + +G PP   +  +DTGS +LWV+C
Sbjct: 57  LRAHDGTRHGRLLATADLPLGGLGLPTDTGLYYTEVRLGTPPKRFYVQVDTGSDILWVNC 116

Query: 90  YPCRDCSPQLG-----TIFYPSRSSSYAYVPCDSEHCRYFPYAR---CSAYKHRCIYTQL 141
             C  C  + G     T++ P  SS+ + V CD   C      R   CSA    C Y+  
Sbjct: 117 ITCDQCPHKSGLGLDLTLYDPKASSTGSTVMCDQGFCADTFGGRLPKCSA-NVPCEYSVT 175

Query: 142 YLIGPETSVFVSTEQLTFKNT---DESTIHVQDVVFGCGFSTNRNFKFS-----GIFGLG 193
           Y  G  T      + L F       ++      V+FGCG     +   S     GI G G
Sbjct: 176 YGDGSSTVGSFVNDALQFDQVTGDGQTQPANASVIFGCGAQQGGDLGSSSQALDGILGFG 235

Query: 194 IGRSSLVSQLNSS------FSYCIGSLHDPDYLHNKLILGDGAIID-EGDATPLQFIDGH 246
              +S++SQL ++      F++C+      D +    I   G ++  +   TPL     H
Sbjct: 236 EANTSMLSQLATAGKVKKIFAHCL------DTIKGGGIFAIGDVVQPKVKTTPLVADKPH 289

Query: 247 YYITLEAISVDGRMLDINPNIFKRDDSGGGVMIDSGTDVTWLVKEAYEALRDEVMIRL-- 304
           Y + L+ I V G  L++  +IFK  +   G +IDSGT +T+L +  ++    +VM+ +  
Sbjct: 290 YNVNLKTIDVGGTTLELPADIFKPGEK-RGTIIDSGTTLTYLPELVFK----KVMLAVFN 344

Query: 305 EGEQMRSYSWPDKLCYHGIMSSDLKGFPTVRFHFRGGAKLALEKDSMFYQPRPDAFCMAV 364
           + + +  +   D LC+    S D  GFPT+ FHF     L +     F+    D +C+  
Sbjct: 345 KHQDITFHDVQDFLCFEYSGSVD-DGFPTLTFHFEDDLALHVYPHEYFFPNGNDVYCVGF 403

Query: 365 NPASINNR 372
              ++ ++
Sbjct: 404 QNGALQSK 411


>gi|388518257|gb|AFK47190.1| unknown [Lotus japonicus]
          Length = 478

 Score =  110 bits (274), Expect = 2e-21,   Method: Compositional matrix adjust.
 Identities = 88/326 (26%), Positives = 151/326 (46%), Gaps = 36/326 (11%)

Query: 60  LFYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDC--SPQLG---TIFYPSRSSSYAYV 114
           L++  I +G P    +  +DTGS +LWV+C  C  C     +G   T++ P RS +  +V
Sbjct: 68  LYFTKIGLGSPSKDYYVQVDTGSDILWVNCVECTRCPRKSDIGIGLTLYDPKRSKTSEFV 127

Query: 115 PCDSEHCRYFPYAR---CSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTD---ESTIH 168
            C+   C      R   C A ++ C Y+  Y  G  T+ +   + LTF   +    +   
Sbjct: 128 SCEHNFCSSTYEGRILGCKA-ENPCPYSISYGDGSATTGYYVQDYLTFNRVNGNPHTATQ 186

Query: 169 VQDVVFGCG------FSTNRNFKFSGIFGLGIGRSSLVSQLNSS------FSYCIGSLHD 216
              ++FGCG      F+++      GI G G   SS++SQL +S      FS+C+     
Sbjct: 187 NSSIIFGCGAAQSGTFASSSEEALDGIIGFGQANSSVLSQLAASGKVKKIFSHCL----- 241

Query: 217 PDYLHNKLILGDGAIID-EGDATPLQFIDGHYYITLEAISVDGRMLDINPNIFKRDDSGG 275
            D      I   G +++ +   TPL     HY + L+ I VDG +L +  + F  ++ G 
Sbjct: 242 -DTNVGGGIFSIGEVVEPKVKTTPLVPNMAHYNVILKNIEVDGDILQLPSDTFDSEN-GK 299

Query: 276 GVMIDSGTDVTWLVKEAYEALRDEVMIRLEGEQMRSYSWPDKL-CYHGIMSSDLKGFPTV 334
           G +IDSGT + +L +  Y+ L  +V+ +    +++ Y   ++  C+    + D  GFP V
Sbjct: 300 GTVIDSGTTLAYLPRIVYDQLMSKVLAKQ--PRLKVYLVEEQYSCFQYTGNVD-SGFPIV 356

Query: 335 RFHFRGGAKLALEKDSMFYQPRPDAF 360
           + HF     L +      +  + D++
Sbjct: 357 KLHFEDSLSLTVYPHDYLFNYKGDSY 382


>gi|125586059|gb|EAZ26723.1| hypothetical protein OsJ_10631 [Oryza sativa Japonica Group]
          Length = 339

 Score =  110 bits (274), Expect = 2e-21,   Method: Compositional matrix adjust.
 Identities = 101/366 (27%), Positives = 155/366 (42%), Gaps = 51/366 (13%)

Query: 67  IGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLGTIFYPSRSSSYAYVPCDSEHCRYFPY 126
           +G PP P    ++ G+ L+W H  P  +C  Q    F P   S            R  P+
Sbjct: 1   MGTPPNPVKLKLENGNELIWNHSNPSPECFEQAFPYFEPLTFS------------RGLPF 48

Query: 127 ARCSAYK----HRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDVVFGCGFSTNR 182
           A C + K      C+YT  Y     T+ F+  ++ TF     S   V  V FGCG   N 
Sbjct: 49  ASCGSPKFWPNQTCVYTYSYGDKSVTTGFLEVDKFTFVGAGAS---VPGVAFGCGLFNNG 105

Query: 183 NFKF--SGIFGLGIGRSSLVSQLN-SSFSYCIGSLHDP-------DYLHNKLILGDGAII 232
            FK   +GI G G G  SL SQL   +FS+C  ++          D   +    G GA+ 
Sbjct: 106 VFKSNETGIAGFGRGPLSLPSQLKVGNFSHCFTTITGAIPSTVLLDLPADLFSNGQGAV- 164

Query: 233 DEGDATPL-QFIDGH-----YYITLEAISVDGRMLDINPNIFKRDDSGGGVMIDSGTDVT 286
                TPL Q+         YY++L+ I+V    L +  + F   +  GG +IDSGT +T
Sbjct: 165 ---QTTPLIQYAKNEANPTLYYLSLKGITVGSTRLPVPESAFALTNGTGGTIIDSGTSIT 221

Query: 287 WLVKEAYEALRDEVMIRLEGEQMRSYSWPDKLCYHGIMSSDLKGFPTVRFHFRGGAKLAL 346
            L  + Y+ +RDE   +++   +   +     C+    S      P +  HF  GA + L
Sbjct: 222 SLPPQVYQVVRDEFAAQIKLPVVPGNATGHYTCFSA-PSQAKPDVPKLVLHFE-GATMDL 279

Query: 347 EKDSMFYQPRPDA----FCMAVNPASINNRYVNFSLIGMMAQQFYNVGYDIGRKQKTFQR 402
            +++  ++   DA     C+A+N           ++IG   QQ  +V YD+     +F  
Sbjct: 280 PRENYVFEVPDDAGNSIICLAINKGD------ETTIIGNFQQQNMHVLYDLQNNMLSFVA 333

Query: 403 MDCEVL 408
             C+ L
Sbjct: 334 AQCDKL 339


>gi|357122155|ref|XP_003562781.1| PREDICTED: aspartic proteinase-like protein 2-like [Brachypodium
           distachyon]
          Length = 629

 Score =  110 bits (274), Expect = 2e-21,   Method: Compositional matrix adjust.
 Identities = 101/380 (26%), Positives = 167/380 (43%), Gaps = 56/380 (14%)

Query: 56  ISNSLFYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLGTIFYPSRSSSYAYVP 115
           ++N  +   + IG PP      +D+GS++ +V C  C  C       F P  SS+Y+ V 
Sbjct: 80  LTNGYYTTRLYIGTPPQEFALIVDSGSTVTYVPCASCEQCGNHQDPRFQPDLSSTYSPVK 139

Query: 116 CDSEHCRYFPYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDVVFG 175
           C ++         C + K +C Y + Y     +S  +  + ++F    ES +  Q  VFG
Sbjct: 140 CSAD-------CTCDSDKSQCTYERQYAEMSSSSGVLGEDIVSFGT--ESELKPQRAVFG 190

Query: 176 CGFSTNRNFKFS----GIFGLGIGRSSLVSQL------NSSFSYCIGSLHDPDYLHNKLI 225
           C  S   +  FS    GI GLG G+ S++ QL        SFS C G +           
Sbjct: 191 CENSETGDL-FSQHADGIMGLGRGQLSIMDQLVDKGVIGDSFSMCYGGMD---------- 239

Query: 226 LGDGAIIDEGDATPLQFI--------DGHYYITLEAISVDGRMLDINPNIFKRDDSGGGV 277
           +G GA++      P   +          +Y I L+ I V G+ L ++P IF   DS  G 
Sbjct: 240 IGGGAMVLGAMPAPPDMVFSRSDPVRSPYYNIELKEIHVAGKALRLDPRIF---DSKHGT 296

Query: 278 MIDSGTDVTWLVKEAYEALRDEVMIRLEGEQMRSYSWPDK----LCYHGI---MSSDLKG 330
           ++DSGT   +L ++A+ A +D V  ++    ++    PD     +C+ G    +S   + 
Sbjct: 297 VLDSGTTYAYLPEQAFVAFKDAVTSKV--RPLKKIRGPDPNYKDICFAGAGRNVSQLSQA 354

Query: 331 FPTVRFHFRGGAKLALEKDSMFYQ--PRPDAFCMAVNPASINNRYVNFSLIGMMAQQFYN 388
           FP V   F  G KL+L  ++  ++      A+C+ V      N     +L+G +  +   
Sbjct: 355 FPDVDMVFGDGQKLSLSPENYLFRHSKVEGAYCLGV----FQNGKDPTTLLGGIVVRNTL 410

Query: 389 VGYDIGRKQKTFQRMDCEVL 408
           V YD   ++  F + +C  L
Sbjct: 411 VTYDRHNEKIGFWKTNCSEL 430


>gi|357448247|ref|XP_003594399.1| Aspartic proteinase nepenthesin-2 [Medicago truncatula]
 gi|355483447|gb|AES64650.1| Aspartic proteinase nepenthesin-2 [Medicago truncatula]
          Length = 452

 Score =  110 bits (274), Expect = 2e-21,   Method: Compositional matrix adjust.
 Identities = 103/366 (28%), Positives = 162/366 (44%), Gaps = 36/366 (9%)

Query: 61  FYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRD-CSPQLGTIFYPSRSSSYAYVPCDSE 119
           +YV + +G P       +DTGSS  W+ C PC   C  Q   +F PS S +Y  VPC S 
Sbjct: 103 YYVKMGLGSPTKYYTMIVDTGSSFSWLQCQPCTIYCHIQEDPVFNPSASKTYKTVPCSSS 162

Query: 120 HCRYFPYA-----RCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDVVF 174
            C     A      CS   + C+Y   Y     +  ++S + LT   +      +   V+
Sbjct: 163 QCSSLKSATLNEPTCSKQSNACVYKASYGDSSFSLGYLSQDVLTLTPSQT----LSSFVY 218

Query: 175 GCGFSTNRNF-KFSGIFGLGIGRSSLVSQLN----SSFSYCI-GSLHDPDYLHNKLI-LG 227
           GCG      F +  GI GL     S++SQL+    ++FSYC+  S   P+      + +G
Sbjct: 219 GCGQDNQGLFGRTDGIIGLANNELSMLSQLSGKYGNAFSYCLPTSFSTPNSPKEGFLSIG 278

Query: 228 DGAIIDEGDA--TPLQFIDGH---YYITLEAISVDGRMLDINPNIFKRDDSGGGVMIDSG 282
             ++        TPL     +   Y+I LE+I+V GR L +  + +K        +IDSG
Sbjct: 279 TSSLTPSSSYKFTPLLKNPNNPSLYFIDLESITVAGRPLGVAASSYKVP-----TIIDSG 333

Query: 283 TDVTWLVKEAYEALRDEVMIRLEG--EQMRSYSWPDKLCYHGIMSSDLKGFPTVRFHFRG 340
           T +T L    Y  L++  +  L    +Q    S  D  C+ G ++   +  P +R  F+G
Sbjct: 334 TVITRLPTPVYTTLKNAYVTILSKKYQQAPGISLLDT-CFKGSLAGISEVAPDIRIIFKG 392

Query: 341 GAKLALEKDSMFYQPRPDAFCMAVNPASINNRYVNFSLIGMMAQQFYNVGYDIGRKQKTF 400
           GA L L+  +   +      C+A+  +S      + ++IG   QQ   V YD+G  +  F
Sbjct: 393 GADLQLKGHNSLVELETGITCLAMAGSS------SIAIIGNYQQQTVKVAYDVGNSRVGF 446

Query: 401 QRMDCE 406
               C+
Sbjct: 447 APGGCQ 452


>gi|356553263|ref|XP_003544977.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
          Length = 445

 Score =  110 bits (274), Expect = 2e-21,   Method: Compositional matrix adjust.
 Identities = 93/372 (25%), Positives = 164/372 (44%), Gaps = 40/372 (10%)

Query: 61  FYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLGTIFYPSRSSSYAYVPCDSEH 120
             V + IG PP PQ   +DTGS L W+ C+   + +P   + F PS SSS+  +PC    
Sbjct: 88  LVVTLPIGTPPQPQQMVLDTGSQLSWIQCH---NKTPPTAS-FDPSLSSSFYVLPCTHPL 143

Query: 121 CRY----FPYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDVVFGC 176
           C+     F           C Y+  Y  G      +  E+L F  +  +      ++ GC
Sbjct: 144 CKPRVPDFTLPTTCDQNRLCHYSYFYADGTYAEGNLVREKLAFSPSQTT----PPLILGC 199

Query: 177 GFSTNRNFKFSGIFGLGIGRSSLVSQLN-SSFSYCIGSLHDPDYLHNKLILGDGAIIDEG 235
             S +R+ +  GI G+ +GR S   Q   + FSYC+ +    +  +N    G   + +  
Sbjct: 200 S-SESRDAR--GILGMNLGRLSFPFQAKVTKFSYCVPTRQPAN--NNNFPTGSFYLGNNP 254

Query: 236 DATPLQFIDG---------------HYYITLEAISVDGRMLDINPNIFKRDDSGGG-VMI 279
           ++   +++                  Y + ++ I + GR L+I P++F+ +  G G  M+
Sbjct: 255 NSARFRYVSMLTFPQSQRMPNLDPLAYTVPMQGIRIGGRKLNIPPSVFRPNAGGSGQTMV 314

Query: 280 DSGTDVTWLVKEAYEALRDEVMIRLEGEQMRS---YSWPDKLCYHGIMSSDLKGFPTVRF 336
           DSG++ T+LV  AY+ +R+E+ IR+ G +++    Y     +C+ G      +    V F
Sbjct: 315 DSGSEFTFLVDVAYDRVREEI-IRVLGPRVKKGYVYGGVADMCFDGNAMEIGRLLGDVAF 373

Query: 337 HFRGGAKLALEKDSMFYQPRPDAFCMAVNPASINNRYVNFSLIGMMAQQFYNVGYDIGRK 396
            F  G ++ + K+ +         C+ +  +       N  +IG   QQ   V +D+  +
Sbjct: 374 EFEKGVEIVVPKERVLADVGGGVHCVGIGRSERLGAASN--IIGNFHQQNLWVEFDLANR 431

Query: 397 QKTFQRMDCEVL 408
           +  F   DC  L
Sbjct: 432 RIGFGVADCSRL 443


>gi|18412482|ref|NP_565219.1| aspartyl protease-like protein [Arabidopsis thaliana]
 gi|19699359|gb|AAL91289.1| At1g79720/F19K16_30 [Arabidopsis thaliana]
 gi|26450464|dbj|BAC42346.1| unknown protein [Arabidopsis thaliana]
 gi|115646741|gb|ABJ17101.1| At1g79720 [Arabidopsis thaliana]
 gi|332198170|gb|AEE36291.1| aspartyl protease-like protein [Arabidopsis thaliana]
          Length = 484

 Score =  109 bits (273), Expect = 2e-21,   Method: Compositional matrix adjust.
 Identities = 121/423 (28%), Positives = 188/423 (44%), Gaps = 56/423 (13%)

Query: 19  AHRVQRGINISIARLAYLQEKIRS-------HNTYQAQILPSNDIS-NSLFY-VNISIGQ 69
             +++R + +   R+  LQ KI++        +  + QI  ++ I   SL Y V + +G 
Sbjct: 84  GKKMRRALVLDNIRVQSLQLKIKAMTSSTTEQSVSETQIPLTSGIKLESLNYIVTVELGG 143

Query: 70  PPVPQFTAMDTGSSLLWVHCYPCRDCSPQLGTIFYPSRSSSYAYVPCDSEHCRYFPYARC 129
             +     +DTGS L WV C PCR C  Q G ++ PS SSSY  V C+S  C+    A  
Sbjct: 144 KNMSLI--VDTGSDLTWVQCQPCRSCYNQQGPLYDPSVSSSYKTVFCNSSTCQDLVAATS 201

Query: 130 SA---------YKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDVVFGCGFST 180
           ++          K  C Y   Y  G  T   +++E +   +T      +++ VFGCG   
Sbjct: 202 NSGPCGGNNGVVKTPCEYVVSYGDGSYTRGDLASESILLGDT-----KLENFVFGCG-RN 255

Query: 181 NRNFKFSGIFGLGIGRS--SLVSQ----LNSSFSYCIGSLHDPDYLHNKLILGDGAIIDE 234
           N+         +G+GRS  SLVSQ     N  FSYC+ SL D       L  G+ + +  
Sbjct: 256 NKGLFGGSSGLMGLGRSSVSLVSQTLKTFNGVFSYCLPSLEDG--ASGSLSFGNDSSVYT 313

Query: 235 GDA----TPL---QFIDGHYYITLEAISVDGRMLDINPNIFKRDDSGGGVMIDSGTDVTW 287
                  TPL     +   Y + L   S+ G  L       K    G G++IDSGT +T 
Sbjct: 314 NSTSVSYTPLVQNPQLRSFYILNLTGASIGGVEL-------KSSSFGRGILIDSGTVITR 366

Query: 288 LVKEAYEALRDEVMIRLEG-EQMRSYSWPDKLCYHGIMSSDLKGFPTVRFHFRGGAKLAL 346
           L    Y+A++ E + +  G      YS  D  C++     D+   P ++  F+G A+L +
Sbjct: 367 LPPSIYKAVKIEFLKQFSGFPTAPGYSILDT-CFNLTSYEDIS-IPIIKMIFQGNAELEV 424

Query: 347 EKDSMFYQPRPDA--FCMAVNPASINNRYVNFSLIGMMAQQFYNVGYDIGRKQKTFQRMD 404
           +   +FY  +PDA   C+A+   S  N      +IG   Q+   V YD  +++      +
Sbjct: 425 DVTGVFYFVKPDASLVCLALASLSYENE---VGIIGNYQQKNQRVIYDTTQERLGIVGEN 481

Query: 405 CEV 407
           C V
Sbjct: 482 CRV 484


>gi|218188634|gb|EEC71061.1| hypothetical protein OsI_02803 [Oryza sativa Indica Group]
          Length = 479

 Score =  109 bits (273), Expect = 2e-21,   Method: Compositional matrix adjust.
 Identities = 106/369 (28%), Positives = 158/369 (42%), Gaps = 48/369 (13%)

Query: 61  FYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSP---QLGTIFYPSRSSSYAYVPCD 117
           + +++ +G P + Q   +DTGS + WV C PC   SP     G +F P+ SS+YA   C 
Sbjct: 135 YVISVGLGSPAMTQRVVIDTGSDVSWVQCEPCPAPSPCHAHAGALFDPAASSTYAAFNCS 194

Query: 118 SEHCRYFPYA----RCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDVV 173
           +  C     +     C A K RC Y   Y  G  T+   S++ LT   +D     V+   
Sbjct: 195 AAACAQLGDSGEANGCDA-KSRCQYIVKYGDGSNTTGTYSSDVLTLSGSDV----VRGFQ 249

Query: 174 FGCG---FSTNRNFKFSGIFGLGIGRSSLVSQLNS----SFSYCI-GSLHDPDYLH-NKL 224
           FGC         + K  G+ GLG    SLVSQ  +    SFSYC+  +     +L     
Sbjct: 250 FGCSHAELGAGMDDKTDGLIGLGGDAQSLVSQTAARYGKSFSYCLPATPASSGFLTLGAP 309

Query: 225 ILGDGAIIDEGDATPL---QFIDGHYYITLEAISVDGRMLDINPNIFKRDDSGGGVMIDS 281
             G G        TP+   + +  +Y+  LE I+V G+ L ++P++F       G ++DS
Sbjct: 310 ASGGGGGASRFATTPMLRSKKVPTYYFAALEDIAVGGKKLGLSPSVFA-----AGSLVDS 364

Query: 282 GTDVTWLVKEAYEALRDEVMIRLEGEQMRSYSWPDKL-----CYHGIMSSDLKGFPTVRF 336
           GT +T L   AY AL            M  Y+  + L     C++     D    PTV  
Sbjct: 365 GTVITRLPPAAYAALSSAFR-----AGMTRYARAEPLGILDTCFN-FTGLDKVSIPTVAL 418

Query: 337 HFRGGAKLALEKDSMFYQPRPDAFCMAVNPASINNRYVNFSLIGMMAQQFYNVGYDIGRK 396
            F GGA + L+   +         C+A  P   +     F  IG + Q+ + V YD+G  
Sbjct: 419 VFAGGAVVDLDAHGIV-----SGGCLAFAPTRDDK---AFGTIGNVQQRTFEVLYDVGGG 470

Query: 397 QKTFQRMDC 405
              F+   C
Sbjct: 471 VFGFRAGAC 479


>gi|217074470|gb|ACJ85595.1| unknown [Medicago truncatula]
 gi|388505166|gb|AFK40649.1| unknown [Medicago truncatula]
          Length = 452

 Score =  109 bits (273), Expect = 2e-21,   Method: Compositional matrix adjust.
 Identities = 103/366 (28%), Positives = 162/366 (44%), Gaps = 36/366 (9%)

Query: 61  FYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRD-CSPQLGTIFYPSRSSSYAYVPCDSE 119
           +YV + +G P       +DTGSS  W+ C PC   C  Q   +F PS S +Y  VPC S 
Sbjct: 103 YYVKMGLGSPTKYYTMIVDTGSSFSWLQCQPCTIYCHIQEDPVFNPSASKTYKTVPCSSS 162

Query: 120 HCRYFPYA-----RCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDVVF 174
            C     A      CS   + C+Y   Y     +  ++S + LT   +      +   V+
Sbjct: 163 QCSSLKSATLNEPTCSKQSNACVYKASYGDSSFSLGYLSQDVLTLTPSQT----LSSFVY 218

Query: 175 GCGFSTNRNF-KFSGIFGLGIGRSSLVSQLN----SSFSYCI-GSLHDPDYLHNKLI-LG 227
           GCG      F +  GI GL     S++SQL+    ++FSYC+  S   P+      + +G
Sbjct: 219 GCGQDNQGLFGRTDGIIGLANNELSMLSQLSGKYGNAFSYCLPTSFSTPNSPKEGFLSIG 278

Query: 228 DGAIIDEGDA--TPLQFIDGH---YYITLEAISVDGRMLDINPNIFKRDDSGGGVMIDSG 282
             ++        TPL     +   Y+I LE+I+V GR L +  + +K        +IDSG
Sbjct: 279 TSSLTPSSSYKFTPLLKNPNNPSLYFIDLESITVAGRPLGVAASSYKVP-----TIIDSG 333

Query: 283 TDVTWLVKEAYEALRDEVMIRLEG--EQMRSYSWPDKLCYHGIMSSDLKGFPTVRFHFRG 340
           T +T L    Y  L++  +  L    +Q    S  D  C+ G ++   +  P +R  F+G
Sbjct: 334 TVITRLPTPVYTTLKNAYVTILSKKYQQAPGISLLDT-CFKGSLAGISEVAPDIRIIFKG 392

Query: 341 GAKLALEKDSMFYQPRPDAFCMAVNPASINNRYVNFSLIGMMAQQFYNVGYDIGRKQKTF 400
           GA L L+  +   +      C+A+  +S      + ++IG   QQ   V YD+G  +  F
Sbjct: 393 GADLQLKGHNSLVELETGITCLAMAGSS------SIAIIGNYQQQTVKVAYDVGNSRVGF 446

Query: 401 QRMDCE 406
               C+
Sbjct: 447 APGGCQ 452


>gi|449440014|ref|XP_004137780.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Cucumis sativus]
 gi|449483406|ref|XP_004156582.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Cucumis sativus]
          Length = 449

 Score =  109 bits (273), Expect = 2e-21,   Method: Compositional matrix adjust.
 Identities = 101/324 (31%), Positives = 152/324 (46%), Gaps = 47/324 (14%)

Query: 63  VNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLGTIFYPSRSSSYAYVPCDSEHC- 121
           V++++G PP      +DTGS L W+HC   ++ S    T F P  SSSY+ +PC S  C 
Sbjct: 75  VSLTVGTPPQNVTMVIDTGSELSWLHCNTSQNSSSSSST-FNPVWSSSYSPIPCSSSTCT 133

Query: 122 ---RYFPYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDVVFGCG- 177
              R FP          C  T  Y     +   ++T+     ++      + +VVFGC  
Sbjct: 134 DQTRDFPIRPSCDSNQFCHATLSYADASSSEGNLATDTFYIGSSG-----IPNVVFGCMD 188

Query: 178 --FSTN--RNFKFSGIFGLGIGRSSLVSQLN-SSFSYCIGSLHDPDYLHNKLILGDG--- 229
             FS+N   + K +G+ G+  G  S VSQ+    FSYCI S +D       L+LGD    
Sbjct: 189 SIFSSNSEEDSKNTGLMGMNRGSLSFVSQMGFPKFSYCI-SEYD---FSGLLLLGDANFS 244

Query: 230 --------AIIDEGDATPLQFIDG-HYYITLEAISVDGRMLDINPNIFKRDDSGGG-VMI 279
                    +I+   +TPL + D   Y + LE I V  ++L I  ++F+ D +G G  M+
Sbjct: 245 WLAPLNYTPLIEM--STPLPYFDRVAYTVQLEGIKVAHKLLPIPESVFEPDHTGAGQTMV 302

Query: 280 DSGTDVTWLVKEAYEALRDEVMIRLEGEQMRSYSWPDKLCYHGIM---------SSDLKG 330
           DSGT  T+L+  AY ALRD  + +  G  +R Y       + G M          + L  
Sbjct: 303 DSGTQFTFLLGPAYTALRDHFLNKTAGS-LRVYE-DSNFVFQGAMDLCYRVPTNQTRLPP 360

Query: 331 FPTVRFHFRGGAKLALEKDSMFYQ 354
            P+V   FR GA++ +  D + Y+
Sbjct: 361 LPSVTLVFR-GAEMTVTGDRILYR 383


>gi|125533812|gb|EAY80360.1| hypothetical protein OsI_35532 [Oryza sativa Indica Group]
          Length = 428

 Score =  109 bits (273), Expect = 2e-21,   Method: Compositional matrix adjust.
 Identities = 99/384 (25%), Positives = 162/384 (42%), Gaps = 43/384 (11%)

Query: 35  YLQEKIRSHNTYQAQILPSNDISNSLFYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRD 94
           Y+  K    +T   Q+     +  SL+ +++ +G P   Q   +DTGSS  WV C  C  
Sbjct: 56  YISNKTSRLSTQAVQVGWDRGLQTSLYVISVGLGTPAKTQIVEIDTGSSTSWVFC-ECDG 114

Query: 95  CSPQLGTIFYPSRSSSYAYVPCDSEHCRYF---PYARCSAYKHRCIYTQLYLIGPETSVF 151
           C     T F  SRS++ A V C +  C      P+ + S     C +   Y  G  +   
Sbjct: 115 CHTNPRT-FLQSRSTTCAKVSCGTSMCLLGGSDPHCQDSENYPDCPFRVSYQDGSASYGI 173

Query: 152 VSTEQLTFKNTDESTIHVQDVVFGC---GFSTNRNFKFSGIFGLGIGRSSLVSQLN---S 205
           +  + LTF +  +    +    FGC    F  N      G+ G+G G  S++ Q +    
Sbjct: 174 LYQDTLTFSDVQK----IPSFTFGCNLDSFGANEFGNVDGLLGMGAGPMSVLKQSSPRFD 229

Query: 206 SFSYCIGSLHDPDYLHNKLI--LGDGAIIDEGDATPLQFIDGH-----YYITLEAISVDG 258
            FSYC+          +K       G +    D    + +        +++ L AISVDG
Sbjct: 230 GFSYCLPLQKSERGFFSKTTGYFSLGKVATRTDVRYTKMVARRKNTELFFVDLAAISVDG 289

Query: 259 RMLDINPNIFKRDDSGGGVMIDSGTDVTWLVKEAYEALRD---EVMIRLEGEQMRSYSWP 315
             L ++P+IF R     GV+ DSG++++++   A   L     E+++R    +  S    
Sbjct: 290 ERLGLSPSIFSRK----GVVFDSGSELSYIPDRALSVLSQRIRELLLRRGAAEEES---- 341

Query: 316 DKLCYHGIMSSDLKGFPTVRFHFRGGAKLALEKDSMFYQ---PRPDAFCMAVNPASINNR 372
           ++ CY  + S D    P +  HF  GA+  L    +F +      D +C+A  P      
Sbjct: 342 ERNCYD-MRSVDEGDMPAISLHFDDGARFDLGSHGVFVERSVQEQDVWCLAFAPTE---- 396

Query: 373 YVNFSLIGMMAQQFYNVGYDIGRK 396
             + S+IG + Q    V YD+ R+
Sbjct: 397 --SVSIIGSLMQTSKEVVYDLKRQ 418


>gi|449527149|ref|XP_004170575.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 1-like [Cucumis
           sativus]
          Length = 487

 Score =  109 bits (272), Expect = 3e-21,   Method: Compositional matrix adjust.
 Identities = 112/361 (31%), Positives = 175/361 (48%), Gaps = 29/361 (8%)

Query: 57  SNSLFYVNISIGQPPVPQFTAMDTGSSLLWVHCYPC---RDCSPQLGTIFYPSRSSSYAY 113
           S + +   I +GQP    +   DTGS + W+ C PC     C  Q   IF P  SSSY+ 
Sbjct: 144 SGAEYLAQIGVGQPVKLFYLVPDTGSDVTWLQCQPCASENTCYKQFDPIFDPKSSSSYSP 203

Query: 114 VPCDSEHCRYFPYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDVV 173
           + C+S+ C+    A C++    CIY   Y  G  T+  ++TE L+F N++     + ++ 
Sbjct: 204 LSCNSQQCKLLDKANCNS--DTCIYQVHYGDGSFTTGELATETLSFGNSNS----IPNLP 257

Query: 174 FGCGFSTNRNF-KFSGIFGLGIGRSSLVSQLN-SSFSYCIGSLHDPDYLHNKLILGDGAI 231
            GCG      F   +G+ GLG G  SL SQL  SSFSYC+ +L   D   +  +  +  +
Sbjct: 258 IGCGHDNEGLFAGGAGLIGLGGGAISLSSQLKASSFSYCLVNL---DSDSSSTLEFNSYM 314

Query: 232 IDEGDATPLQFID---GHYYITLEAISVDGRMLDINPNIFKRDDSG-GGVMIDSGTDVTW 287
             +   +PL   D    + Y+ +  ISV G+ L I+P  F+ D+SG GG+++DSGT ++ 
Sbjct: 315 PSDSLTSPLVKNDRFHSYRYVKVVGISVGGKTLPISPTRFEIDESGLGGIIVDSGTIISR 374

Query: 288 LVKEAYEALRDEVMIRLEGEQMRS--YSWPDKLCYHGIMSSDLKGFPTVRFHFRGGAKLA 345
           L  + YE+LR E  ++L      +   S  D  CY+    S+++  PT+ F    G  L 
Sbjct: 375 LPSDVYESLR-EAFVKLTSSLSPAPGISVFDT-CYNFSGQSNVE-VPTIAFVLSEGTSLR 431

Query: 346 L-EKDSMFYQPRPDAFCMAVNPASINNRYVNFSLIGMMAQQFYNVGYDIGRKQKTFQRMD 404
           L  ++ +        +C+A     I  +  + S+IG   QQ   V YD+      F    
Sbjct: 432 LPARNYLIMLDTAGTYCLAF----IKTK-SSLSIIGSFQQQGIRVSYDLTNSIVGFSTNK 486

Query: 405 C 405
           C
Sbjct: 487 C 487


>gi|414589628|tpg|DAA40199.1| TPA: hypothetical protein ZEAMMB73_627989 [Zea mays]
          Length = 452

 Score =  109 bits (272), Expect = 3e-21,   Method: Compositional matrix adjust.
 Identities = 112/411 (27%), Positives = 169/411 (41%), Gaps = 35/411 (8%)

Query: 22  VQRGINISIARLAYL-----QEKIRSHNTYQ--AQILPSNDISNSLFYVNISIGQPPVPQ 74
           ++R +  S AR A L     + +    N  Q  A +LP     +  + V+++IG PP P 
Sbjct: 50  IRRAMRRSKARAAALSAVRNRARFSGKNEQQTPAGVLPVRPSGDLEYVVDLAIGTPPQPV 109

Query: 75  FTAMDTGSSLLWVHCYPCRDCSPQLGTIFYPSRSSSYAYVPCDSEHCRYFPYARCSAYKH 134
              +DTGS L+W  C PC  C  Q   +F P +S+SY  + C    C    +  C     
Sbjct: 110 SALLDTGSDLIWTQCAPCASCLSQPDPLFAPGQSASYEPMRCAGTLCSDILHHSCE-RPD 168

Query: 135 RCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDVV--FGCG-FSTNRNFKFSGIFG 191
            C Y   Y  G  T    +TE+ TF ++    +    V   FGCG  +       SGI G
Sbjct: 169 TCTYRYNYGDGTMTVGVYATERFTFASSGGGGLTTTTVPLGFGCGSVNVGSLNNGSGIVG 228

Query: 192 LGIGRSSLVSQLN-SSFSYCIGSLHDPDYLHNKLILGDGAIIDEGDA------TPLQFID 244
            G    SLVSQL+   FSYC+ S        + L+ G  +    GDA      TPL    
Sbjct: 229 FGRNPLSLVSQLSIRRFSYCLTSYA--SRRQSTLLFGSLSDGVYGDATGRVQTTPLLQSP 286

Query: 245 GH---YYITLEAISVDGRMLDINPNIFK-RDDSGGGVMIDSGTDVTWL----VKEAYEAL 296
            +   YY+    ++V  R L I  + F  R D  GGV++DSGT +T L    + E   A 
Sbjct: 287 QNPTFYYVHFTGLTVGARRLRIPESAFALRPDGSGGVIVDSGTALTLLPAAVLAEVVRAF 346

Query: 297 RDEVMIRLE--GEQMRSYSWPDKLCYHGIMSSDLKGFPTVRFHFRGGAKLALEKDSMFYQ 354
           R ++ +     G       +     +    S+     P +  HF+G       ++ +   
Sbjct: 347 RQQLRLPFANGGNPEDGVCFLVPAAWRRSSSTSQMPVPRMVLHFQGADLDLPRRNYVLDD 406

Query: 355 PRPDAFCMAVNPASINNRYVNFSLIGMMAQQFYNVGYDIGRKQKTFQRMDC 405
            R    C+ +  +  +      S IG + QQ   V YD+  +  +     C
Sbjct: 407 HRRGRLCLLLADSGDDG-----STIGNLVQQDMRVLYDLEAETLSIAPARC 452


>gi|413923780|gb|AFW63712.1| hypothetical protein ZEAMMB73_689747 [Zea mays]
          Length = 470

 Score =  109 bits (272), Expect = 3e-21,   Method: Compositional matrix adjust.
 Identities = 103/328 (31%), Positives = 141/328 (42%), Gaps = 44/328 (13%)

Query: 48  AQILPSN---DISNSLFYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCR--DCSPQLGTI 102
           A  +P+N   DI  S + V  S+G P + Q   +DTGS L WV C PC    C  Q   +
Sbjct: 121 AATVPANWGYDIGTSNYVVTASLGTPGMAQTLEVDTGSDLSWVQCKPCAAPSCYRQKDPL 180

Query: 103 FYPSRSSSYAYVPCDSEHCRYFPYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNT 162
           F P++SSSYA VPC    C        +    +C Y   Y  G  T+   S++ LT    
Sbjct: 181 FDPAQSSSYAAVPCGRSACAGLGIYASACSAAQCGYVVSYGDGSNTTGVYSSDTLTL--- 237

Query: 163 DESTIHVQDVVFGCGFSTNRNFKFSGIFG-LGIGRS--SLVSQLNSS----FSYCIGSLH 215
             +   VQ  +FGCG + +    F+GI G LG GR   SLV Q   +    FSYC   L 
Sbjct: 238 -AANATVQGFLFGCGHAQSGGL-FTGIDGLLGFGREQPSLVQQTAGAYGGVFSYC---LP 292

Query: 216 DPDYLHNKLILGDGAIIDEGDAT----PLQFIDGHYYITLEAISVDGRMLDINPNIFKRD 271
                   L LG  + +  G +T    P      +Y + L  ISV G+ L +  + F   
Sbjct: 293 TKSSTTGYLTLGGPSGVAPGFSTTQLLPSPNAPTYYVVMLTGISVGGQPLSVPASAFAA- 351

Query: 272 DSGGGVMIDSGTDVTWLVKEAYEALRDEVMIRLEGEQMRSYSWPDKLCYHGIMSS--DLK 329
               G ++D+GT +T L   AY ALR           M SY     +   GI+ +     
Sbjct: 352 ----GTVVDTGTVITRLPPAAYAALRSAFR-----SGMASYPSAPPI---GILDTCYSFA 399

Query: 330 GFPTVR-----FHFRGGAKLALEKDSMF 352
           G+ TV        F  GA + L  D + 
Sbjct: 400 GYGTVNLTSVALTFSSGATMTLGADGIM 427


>gi|224072755|ref|XP_002303865.1| predicted protein [Populus trichocarpa]
 gi|222841297|gb|EEE78844.1| predicted protein [Populus trichocarpa]
          Length = 412

 Score =  109 bits (272), Expect = 3e-21,   Method: Compositional matrix adjust.
 Identities = 99/348 (28%), Positives = 155/348 (44%), Gaps = 39/348 (11%)

Query: 78  MDTGSSLLWVHCYPCRDCSPQLGTIFYPSRSSSYAYVPCDSEHCRYFPYAR-----CSAY 132
           +DTGS L WV C PC  C  Q   +F PS+S SY  V C+S  CR    A      C + 
Sbjct: 81  VDTGSDLSWVQCQPCNRCYNQQDPVFNPSKSPSYRTVLCNSLTCRSLQLATGNSGVCGSN 140

Query: 133 KHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDVVFGCGFSTNRNF-KFSGIFG 191
              C Y   Y  G  TS  V  E L   NT      V + +FGCG      F   SG+ G
Sbjct: 141 PPTCNYVVNYGDGSYTSGEVGMEHLNLGNTT-----VNNFIFGCGRKNQGLFGGASGLVG 195

Query: 192 LGIGRSSLVSQLN----SSFSYCIGSLHDPDYLHNKLILGDGAIIDEGDATPLQF----- 242
           LG    SL+SQ++      FSYC+ +          L++G  + + + + TP+ +     
Sbjct: 196 LGRTDLSLISQISPMFGGVFSYCLPTTEAE--ASGSLVMGGNSSVYK-NTTPISYTRMIH 252

Query: 243 --IDGHYYITLEAISVDGRMLDINPNIFKRDDSGGGVMIDSGTDVTWLVKEAYEALRDEV 300
             +   Y++ L  I+V G  +++    F +D     ++IDSGT ++ L    Y+AL+ E 
Sbjct: 253 NPLLPFYFLNLTGITVGG--VEVQAPSFGKDR----MIIDSGTVISRLPPSIYQALKAEF 306

Query: 301 MIRLEG-EQMRSYSWPDKLCYHGIMSSDLKGFPTVRFHFRGGAKLALEKDSMFYQPRPDA 359
           + +  G     S+   D  C++     ++K  P ++ +F G A+L ++   +FY  + DA
Sbjct: 307 VKQFSGYPSAPSFMILDS-CFNLSGYQEVK-IPDIKMYFEGSAELNVDVTGVFYSVKTDA 364

Query: 360 --FCMAVNPASINNRYVNFSLIGMMAQQFYNVGYDIGRKQKTFQRMDC 405
              C+A+      +      +IG   Q+   + YD       F    C
Sbjct: 365 SQVCLAIASLPYEDE---VGIIGNYQQKNQRIIYDTKGSMLGFAEEAC 409


>gi|359476754|ref|XP_002277058.2| PREDICTED: aspartic proteinase-like protein 2-like isoform 1 [Vitis
           vinifera]
          Length = 561

 Score =  109 bits (272), Expect = 3e-21,   Method: Compositional matrix adjust.
 Identities = 101/401 (25%), Positives = 176/401 (43%), Gaps = 43/401 (10%)

Query: 38  EKIRSHNTYQ-AQILPSNDI---------SNSLFYVNISIGQPPVPQFTAMDTGSSLLWV 87
           + +R+H+T +  +IL + D+            L++  I IG P    +  +DTGS +LWV
Sbjct: 122 DALRAHDTRRHGRILSAVDLPLGGNGHPSEAGLYFAKIGIGTPSKDYYVQVDTGSDILWV 181

Query: 88  HCYPCRDCSPQ--LG---TIFYPSRSSSYAYVPCDSEHCRYF--PYARCSAYKHRCIYTQ 140
           +C  C  C  +  LG   T++    S++   V CD   C  +  P   C     +C+Y+ 
Sbjct: 182 NCAGCDRCPTKSDLGVDLTLYDMKASTTSDAVGCDDNFCSLYDGPLPGCKP-GLQCLYSV 240

Query: 141 LYLIGPETSVFVSTEQLTFKNTD---ESTIHVQDVVFGCGFSTNRNFKFS-----GIFGL 192
           LY  G  T+ +   + + +       ++T     VVFGCG   +     S     GI G 
Sbjct: 241 LYGDGSSTTGYFVQDFVQYNRISGNFQTTPTNGTVVFGCGNKQSGELGSSSEALDGILGF 300

Query: 193 GIGRSSLVSQLNSS------FSYCIGSLHDPDYLHNKLILGDGAIID-EGDATPLQFIDG 245
           G   SS++SQL SS      FS+C+      D +    I   G +++ + + TPL     
Sbjct: 301 GQANSSMLSQLASSGKVKKVFSHCL------DNVDGGGIFAIGEVVEPKVNITPLVQNQA 354

Query: 246 HYYITLEAISVDGRMLDINPNIFKRDDSGGGVMIDSGTDVTWLVKEAYEALRDEVMIRLE 305
           HY + ++ I V G  LD+  + F+  D   G +IDSGT + +  +E Y  L ++++   +
Sbjct: 355 HYNVVMKEIEVGGDPLDVPSDAFESGDR-KGTIIDSGTTLAYFPQEVYVPLIEKILS--Q 411

Query: 306 GEQMRSYSWPDKLCYHGIMSSDLKGFPTVRFHFRGGAKLALEKDSMFYQPRPDAFCMA-V 364
              +R ++            +   GFPTV  HF     L +      +Q +   +C+   
Sbjct: 412 QPDLRLHTVEQAFTCFDYTGNVDDGFPTVTLHFDKSISLTVYPHEYLFQVKEFEWCIGWQ 471

Query: 365 NPASINNRYVNFSLIGMMAQQFYNVGYDIGRKQKTFQRMDC 405
           N  +      + +L+G +      V YD+ ++   +   +C
Sbjct: 472 NSGAQTKDGKDLTLLGDLVLSNKLVVYDLEKQGIGWVEYNC 512


>gi|302780643|ref|XP_002972096.1| hypothetical protein SELMODRAFT_96575 [Selaginella moellendorffii]
 gi|300160395|gb|EFJ27013.1| hypothetical protein SELMODRAFT_96575 [Selaginella moellendorffii]
          Length = 393

 Score =  109 bits (272), Expect = 3e-21,   Method: Compositional matrix adjust.
 Identities = 109/400 (27%), Positives = 171/400 (42%), Gaps = 49/400 (12%)

Query: 31  ARLAYLQEKIRSHNTYQAQILPSNDISNSL------FYVNISIGQPPVPQFTAMDTGSSL 84
           AR+ ++    R++++  + +  + D+ + L      + ++IS+G P        DTGS L
Sbjct: 21  ARVRWMAA--RANSSSWSSMAGTTDVESPLHPDGGGYVMDISVGTPGKRFRAIADTGSDL 78

Query: 85  LWVHCYPCRDCSPQLGTIFYPSRSSSYAYVPCDSEHCRYFPYARCSAYKHRCIYTQLYLI 144
           +WV   PC  CS   GTIF P +SS++  + C S+ C   P   C      C Y+  Y  
Sbjct: 79  VWVQSEPCTGCSG--GTIFDPRQSSTFREMDCSSQLCAELP-GSCEPGSSTCSYSYEYGS 135

Query: 145 GPETSVFVSTEQLTFKNTDESTIHVQDVVFGCGFSTNRNFKFSGIFGLGIGRSSLVSQL- 203
           G ET    + + ++   T + +        GCG   +      G+ GLG G  SL SQL 
Sbjct: 136 G-ETEGEFARDTISLGTTSDGSQKFPSFAVGCGMVNSGFDGVDGLVGLGQGPVSLTSQLS 194

Query: 204 ---NSSFSYCI-----GSLHDPDYLHNKLILGDGAIIDEGDATPLQFIDGHYYITLEAIS 255
              +S FSYC+      S   P        L    I       P      +Y +T+  I+
Sbjct: 195 AAIDSKFSYCLVDINSQSESSPLLFGPSAALHGTGIQSTKITPPSDTYPTYYLLTVNGIA 254

Query: 256 VDGRMLDINPNIFKRDDSGGGVMIDSGTDVTWLVKEAYEAL--RDEVMI---RLEGEQMR 310
           V G+ +           S G  +IDSGT +T++    Y  +  R E M+   R++G  M 
Sbjct: 255 VAGQTM----------GSPGTTIIDSGTTLTYVPSGVYGRVLSRMESMVTLPRVDGSSMG 304

Query: 311 SYSWPDKLCYHGIMSSDLKGFPTVRFHFRGGAKLALEKDSMFY--QPRPDAFCMAVNPAS 368
                  LCY    + + K FP +      GA +     + F       D  C+A+  AS
Sbjct: 305 L-----DLCYDRSSNRNYK-FPALTIRL-AGATMTPPSSNYFLVVDDSGDTVCLAMGSAS 357

Query: 369 INNRYVNFSLIGMMAQQFYNVGYDIGRKQKTFQRMDCEVL 408
                +  S+IG + QQ Y++ YD G  + +F +  CE L
Sbjct: 358 ----GLPVSIIGNVMQQGYHILYDRGSSELSFVQAKCESL 393


>gi|2541876|dbj|BAA22813.1| CND41, chloroplast nucleoid DNA binding protein [Nicotiana tabacum]
          Length = 502

 Score =  109 bits (272), Expect = 3e-21,   Method: Compositional matrix adjust.
 Identities = 105/376 (27%), Positives = 152/376 (40%), Gaps = 57/376 (15%)

Query: 61  FYVNISIGQPPVPQFTAMDTGSSLLWVHCYPC-RDCSPQLGTIFYPSRSSSYAYVPCDSE 119
           + VN+ +G P        DTGS L W  C PC + C  Q   IF PS S +Y+ + C S 
Sbjct: 154 YIVNVGLGTPKKDLSLIFDTGSDLTWTQCQPCVKSCYAQQQPIFDPSTSKTYSNISCTSA 213

Query: 120 HCRYFPYAR-----CSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDVVF 174
            C     A      CS+    C+Y   Y     T  F + ++LT    D         +F
Sbjct: 214 ACSSLKSATGNSPGCSS--SNCVYGIQYGDSSFTIGFFAKDKLTLTQNDV----FDGFMF 267

Query: 175 GCGFSTNRNF-KFSGIFGLGIGRSSLVSQ----LNSSFSYCIGSLHDPDYLHNKLILGDG 229
           GCG +    F K +G+ GLG    S+V Q        FSYC+ +    +     L  G+G
Sbjct: 268 GCGQNNKGLFGKTAGLIGLGRDPLSIVQQTAQKFGKYFSYCLPTSRGSN---GHLTFGNG 324

Query: 230 AIIDEGDA-------TPLQFIDG--HYYITLEAISVDGRMLDINPNIFKRDDSGGGVMID 280
             +    A       TP     G  +Y+I +  ISV G+ L I+P +F+      G +ID
Sbjct: 325 NGVKASKAVKNGITFTPFASSQGTAYYFIDVLGISVGGKALSISPMLFQN----AGTIID 380

Query: 281 SGTDVTWLVKEAYEALRDEVMIRLEGEQMRSYSWPDKL-----CYHGIMSSDLKGF---- 331
           SGT +T L   AY +L+         + M  Y     L     CY      DL  +    
Sbjct: 381 SGTVITRLPSTAYGSLKSAFK-----QFMSKYPTAPALSLLDTCY------DLSNYTSIS 429

Query: 332 -PTVRFHFRGGAKLALEKDSMFYQPRPDAFCMAVNPASINNRYVNFSLIGMMAQQFYNVG 390
            P + F+F G A + L+ + +         C+A    + N    +  + G + QQ   V 
Sbjct: 430 IPKISFNFNGNANVELDPNGILITNGASQVCLAF---AGNGDDDSIGIFGNIQQQTLEVV 486

Query: 391 YDIGRKQKTFQRMDCE 406
           YD+   Q  F    C 
Sbjct: 487 YDVAGGQLGFGYKGCS 502


>gi|224124882|ref|XP_002329972.1| predicted protein [Populus trichocarpa]
 gi|222871994|gb|EEF09125.1| predicted protein [Populus trichocarpa]
          Length = 332

 Score =  109 bits (272), Expect = 3e-21,   Method: Compositional matrix adjust.
 Identities = 96/345 (27%), Positives = 157/345 (45%), Gaps = 33/345 (9%)

Query: 78  MDTGSSLLWVHCYPCRD-CSPQLGTIFYPSRSSSYAYVPCDSEHCRYFPYAR-----CSA 131
           +DTGSSL W+ C PC   C  Q   ++ PS S +Y  + C S  C     A      C  
Sbjct: 3   LDTGSSLSWLQCQPCAVYCHAQADPLYDPSVSKTYKKLSCASVECSRLKAATLNDPLCET 62

Query: 132 YKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDVVFGCGFSTNRNF-KFSGIF 190
             + C+YT  Y     +  ++S + LT  ++      +    +GCG      F + +GI 
Sbjct: 63  DSNACLYTASYGDTSFSIGYLSQDLLTLTSSQT----LPQFTYGCGQDNQGLFGRAAGII 118

Query: 191 GLGIGRSSLVSQLNS----SFSYCIGSLHDPDYLHNKLILGDGAIIDEGDATPLQFIDGH 246
           GL   + S+++QL++    +FSYC+ + +        L +G  +       TP+   D  
Sbjct: 119 GLARDKLSMLAQLSTKYGHAFSYCLPTANSGSSGGGFLSIGSISPTSY-KFTPM-LTDSK 176

Query: 247 ----YYITLEAISVDGRMLDINPNIFKRDDSGGGVMIDSGTDVTWLVKEAYEALRDEVMI 302
               Y++ L AI+V GR LD+   +++        +IDSGT +T L    Y ALR   + 
Sbjct: 177 NPSLYFLRLTAITVSGRPLDLAAAMYRVP-----TLIDSGTVITRLPMSMYAALRQAFVK 231

Query: 303 RLEGEQMR--SYSWPDKLCYHGIMSSDLKGFPTVRFHFRGGAKLALEKDSMFYQPRPDAF 360
            +  +  +  +YS  D  C+ G + S +   P ++  F+GGA L L   S+  +      
Sbjct: 232 IMSTKYAKAPAYSILDT-CFKGSLKS-ISAVPEIKMIFQGGADLTLRAPSILIEADKGIT 289

Query: 361 CMAVNPASINNRYVNFSLIGMMAQQFYNVGYDIGRKQKTFQRMDC 405
           C+A   +S  N+    ++IG   QQ YN+ YD+   +  F    C
Sbjct: 290 CLAFAGSSGTNQ---IAIIGNRQQQTYNIAYDVSTSRIGFAPGSC 331


>gi|195627138|gb|ACG35399.1| aspartic proteinase nepenthesin-1 precursor [Zea mays]
          Length = 431

 Score =  109 bits (272), Expect = 3e-21,   Method: Compositional matrix adjust.
 Identities = 113/420 (26%), Positives = 170/420 (40%), Gaps = 35/420 (8%)

Query: 8   LSPYNN---PNENPAHRVQRGINISIARLAYLQEKIRSHNTYQAQILPSNDISNSLFYVN 64
           LS Y+N   P+ +P   +        ARL +L  K  S     +  + S     S + V 
Sbjct: 24  LSVYHNVHPPSPSPLESIIALARADDARLLFLSSKAASSGGVTSAPVASGQTPPS-YVVR 82

Query: 65  ISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLGTIFYPSRSSSYAYVPCDSEHCRYF 124
             +G P      A+DT +   W HC PC  C    G+ F P+ SSSYA +PC S+ C  F
Sbjct: 83  AGLGTPVQQLLLALDTSADATWSHCAPCDTC--PAGSRFIPASSSSYASLPCASDWCPLF 140

Query: 125 PYARC------SAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDVVFGCGF 178
               C      SA    C +++ +    +TS   S    T +   ++   +    FGC  
Sbjct: 141 EGQPCPANQDASAPLPACAFSKPFA---DTSFQASLGSDTLRLGKDA---IAGYAFGCVG 194

Query: 179 ST---NRNFKFSGIFGLGIGRSSLVSQLNSS----FSYCIGSLHDPDYLHNKLILGDGAI 231
           +      N    G+ GLG G  SL+SQ  S+    FSYC+ S     Y    L LG    
Sbjct: 195 AVAGPTTNLPKQGLLGLGRGPMSLLSQTGSTYNGVFSYCLPSYRS-YYFSGSLRLGAAGQ 253

Query: 232 IDEGDATPLQFIDGH----YYITLEAISVDGRMLDINPNIFKRDD-SGGGVMIDSGTDVT 286
                 TPL   + H    YY+ +  +SV    + +    F  D  +G G +IDSGT +T
Sbjct: 254 PRNVRYTPL-LTNPHRPSLYYVNVTGLSVGRTWVKVPAGSFAFDPATGAGTVIDSGTVIT 312

Query: 287 WLVKEAYEALRDEVMIRLEGEQMRSYSWPDKLCYHGIMSSDLKGFPTVRFHFRGGAKLAL 346
                 Y ALR+E   ++      +       C++        G P V  H  GG  L L
Sbjct: 313 RWTAPVYAALREEFRRQVAAPSGYTSLGAFDTCFN-TDEVAAGGAPPVTLHMDGGVDLTL 371

Query: 347 E-KDSMFYQPRPDAFCMAVNPASINNRYVNFSLIGMMAQQFYNVGYDIGRKQKTFQRMDC 405
             ++++ +       C+A+  A   N     +++  + QQ   V  D+   +  F R  C
Sbjct: 372 PMENTLIHSSATPLACLAMAEAP-QNVNAVVNVVANLQQQNVRVVVDVAGSRVGFAREPC 430


>gi|21595063|gb|AAM66069.1| putative aspartyl protease [Arabidopsis thaliana]
          Length = 484

 Score =  109 bits (272), Expect = 3e-21,   Method: Compositional matrix adjust.
 Identities = 121/423 (28%), Positives = 188/423 (44%), Gaps = 56/423 (13%)

Query: 19  AHRVQRGINISIARLAYLQEKIRS-------HNTYQAQILPSNDIS-NSLFY-VNISIGQ 69
             +++R + +   R+  LQ KI++        +  + QI  ++ I   SL Y V + +G 
Sbjct: 84  GKKMRRALVLDNIRVQSLQLKIKAMTSSTTEQSVSETQIPLTSGIKLESLNYIVTVELGG 143

Query: 70  PPVPQFTAMDTGSSLLWVHCYPCRDCSPQLGTIFYPSRSSSYAYVPCDSEHCRYFPYARC 129
             +     +DTGS L WV C PCR C  Q G ++ PS SSSY  V C+S  C+    A  
Sbjct: 144 KNMSLI--VDTGSDLTWVQCQPCRSCYNQQGPLYDPSVSSSYKTVFCNSSTCQDLVAATS 201

Query: 130 SA---------YKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDVVFGCGFST 180
           ++          K  C Y   Y  G  T   +++E +   +T      +++ VFGCG   
Sbjct: 202 NSGPCGGNNGVVKTPCEYVVSYGDGSYTRGDLASESILLGDT-----KLENFVFGCG-RN 255

Query: 181 NRNFKFSGIFGLGIGRS--SLVSQ----LNSSFSYCIGSLHDPDYLHNKLILGDGAIIDE 234
           N+         +G+GRS  SLVSQ     N  FSYC+ SL D       L  G+ + +  
Sbjct: 256 NKGLFGGSSGLMGLGRSSVSLVSQTLKTFNGVFSYCLPSLEDG--ASGSLSFGNDSSVYT 313

Query: 235 GDA----TPL---QFIDGHYYITLEAISVDGRMLDINPNIFKRDDSGGGVMIDSGTDVTW 287
                  TPL     +   Y + L   S+ G  L       K    G G++IDSGT +T 
Sbjct: 314 NSTSVSYTPLVQNPQLRSFYILNLTGASIGGVEL-------KSSSFGRGILIDSGTVITR 366

Query: 288 LVKEAYEALRDEVMIRLEG-EQMRSYSWPDKLCYHGIMSSDLKGFPTVRFHFRGGAKLAL 346
           L    Y+A++ E + +  G      YS  D  C++     D+   P ++  F+G A+L +
Sbjct: 367 LPPSIYKAVKIEFLKQFSGFPTAPGYSILDT-CFNLTSYEDIS-IPIIKMIFQGNAELEV 424

Query: 347 EKDSMFYQPRPDA--FCMAVNPASINNRYVNFSLIGMMAQQFYNVGYDIGRKQKTFQRMD 404
           +   +FY  +PDA   C+A+   S  N      +IG   Q+   V YD  +++      +
Sbjct: 425 DVTGVFYFVKPDASLVCLALASLSYENE---VGIIGNYQQKNQRVIYDSTQERLGIVGEN 481

Query: 405 CEV 407
           C V
Sbjct: 482 CRV 484


>gi|168022164|ref|XP_001763610.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162685103|gb|EDQ71500.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 308

 Score =  109 bits (272), Expect = 3e-21,   Method: Compositional matrix adjust.
 Identities = 84/290 (28%), Positives = 131/290 (45%), Gaps = 47/290 (16%)

Query: 54  NDI-SNSLFYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCS-----PQLGTIFYPSR 107
           NDI +  L+Y  IS+G PP   +  +DTGS++ WV C PC  C      P   + F P +
Sbjct: 33  NDIFAMGLYYTRISLGTPPQQFYVDVDTGSNVAWVKCAPCTGCEHSGDVPVPMSTFDPRK 92

Query: 108 SSSYAYVPCDSEHCRYF-PYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKN--TDE 164
           S++   + C    C       +CS  +  C Y+ LY  G  T+ +   +  TF    +D 
Sbjct: 93  STTKISISCTDAECGVLNKKLQCSPERLSCPYSLLYGDGSSTAGYYLNDVFTFNQVPSDN 152

Query: 165 STIH--VQDVVFGCGFSTNRNFKFSGIFGLGIGRSSLVSQLNSS------FSYC------ 210
           ST       +VFGCG +   ++   G+ G G    SL +QL         F++C      
Sbjct: 153 STAKSGTARLVFGCGGTQTGSWSVDGLLGFGPTTVSLPNQLAQQNISVNIFAHCLQGDVS 212

Query: 211 ------IGSLHDPDYLHNKLILGDGAIIDEGDATPLQFIDGHYYITLEAISVDGRMLDIN 264
                 IG++ +PD ++                TP+ F + HY + L  I + GR +   
Sbjct: 213 GRGSLVIGTIREPDLVY----------------TPMVFGEDHYNVQLLNIGISGRNV-TT 255

Query: 265 PNIFKRDDSGGGVMIDSGTDVTWLVKEAYEALRDEVMIRLEGEQMRSYSW 314
           P  F  + + GGV+IDSGT +T+LV+ AY+  R  V +  +   +    W
Sbjct: 256 PASFDLEYT-GGVIIDSGTTLTYLVQPAYDEFRRGVSVFKQSSDLAVAFW 304


>gi|356499109|ref|XP_003518386.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Glycine max]
          Length = 428

 Score =  108 bits (271), Expect = 3e-21,   Method: Compositional matrix adjust.
 Identities = 103/392 (26%), Positives = 168/392 (42%), Gaps = 56/392 (14%)

Query: 52  PSNDIS---NSLFYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLGTIFYPSRS 108
           PS  +S   N    V++++G PP      +DTGS L W+HC       P L + F P  S
Sbjct: 48  PSRKLSFHHNVTLTVSLTVGSPPQNVTMVLDTGSELSWLHCKKL----PNLNSTFNPLLS 103

Query: 109 SSYAYVPCDSEHCR------YFPYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNT 162
           SSY   PC+S  C         P A C      C     Y         ++ E  +    
Sbjct: 104 SSYTPTPCNSSICTTRTRDLTIP-ASCDPNNKLCHVIVSYADASSAEGTLAAETFSLAGA 162

Query: 163 DESTIHVQDVVFGC----GFST--NRNFKFSGIFGLGIGRSSLVSQLN-SSFSYCIGSLH 215
            +        +FGC    G+++  N + K +G+ G+  G  SLV+Q++   FSYCI    
Sbjct: 163 AQP-----GTLFGCMDSAGYTSDINEDSKTTGLMGMNRGSLSLVTQMSLPKFSYCISG-- 215

Query: 216 DPDYLHNKLILGDGA----------IIDEGDATPLQFIDGHYYITLEAISVDGRMLDINP 265
             D L   L+LGDG           ++    ++P  F    Y + LE I V  ++L +  
Sbjct: 216 -EDAL-GVLLLGDGTDAPSPLQYTPLVTATTSSPY-FNRVAYTVQLEGIKVSEKLLQLPK 272

Query: 266 NIFKRDDSGGG-VMIDSGTDVTWLVKEAYEALRDEVMIRLEGEQMR------SYSWPDKL 318
           ++F  D +G G  M+DSGT  T+L+   Y +L+DE + + +G   R       +     L
Sbjct: 273 SVFVPDHTGAGQTMVDSGTQFTFLLGSVYSSLKDEFLEQTKGVLTRIEDPNFVFEGAMDL 332

Query: 319 CYHGIMSSDLKGFPTVRFHFRGGAKLALEKDSMFYQPRPDA---FCMAVNPASINNRYVN 375
           CYH   S      P V   F  GA++ +  + + Y+    +   +C     + +    + 
Sbjct: 333 CYHAPAS--FAAVPAVTLVFS-GAEMRVSGERLLYRVSKGSDWVYCFTFGNSDLLG--IE 387

Query: 376 FSLIGMMAQQFYNVGYDIGRKQKTFQRMDCEV 407
             +IG   QQ   + +D+ + +  F +  C++
Sbjct: 388 AYVIGHHHQQNVWMEFDLLKSRVGFTQTTCDL 419


>gi|18421660|ref|NP_568551.1| aspartyl protease family protein [Arabidopsis thaliana]
 gi|10177438|dbj|BAB10671.1| unnamed protein product [Arabidopsis thaliana]
 gi|15809850|gb|AAL06853.1| AT5g37540/mpa22_p_70 [Arabidopsis thaliana]
 gi|20260182|gb|AAM12989.1| unknown protein [Arabidopsis thaliana]
 gi|23197748|gb|AAN15401.1| unknown protein [Arabidopsis thaliana]
 gi|332006821|gb|AED94204.1| aspartyl protease family protein [Arabidopsis thaliana]
          Length = 442

 Score =  108 bits (271), Expect = 3e-21,   Method: Compositional matrix adjust.
 Identities = 97/370 (26%), Positives = 171/370 (46%), Gaps = 34/370 (9%)

Query: 63  VNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLG--TIFYPSRSSSYAYVPCDSEH 120
           +++ IG P   Q   +DTGS L W+ C+P +   P     T F PS SSS++ +PC    
Sbjct: 82  LSLPIGTPSQSQELVLDTGSQLSWIQCHPKKIKKPLPPPTTSFDPSLSSSFSDLPCSHPL 141

Query: 121 CRY----FPYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDVVFGC 176
           C+     F           C Y+  Y  G      +  E+ TF N+  +      ++ GC
Sbjct: 142 CKPRIPDFTLPTSCDSNRLCHYSYFYADGTFAEGNLVKEKFTFSNSQTT----PPLILGC 197

Query: 177 GFSTNRNFKFSGIFGLGIGRSSLVSQLN-SSFSYCIGSLHDPDYLHN--KLILGDG---- 229
              +       GI G+ +GR S +SQ   S FSYCI +  +   L +     LGD     
Sbjct: 198 AKESTDE---KGILGMNLGRLSFISQAKISKFSYCIPTRSNRPGLASTGSFYLGDNPNSR 254

Query: 230 -----AIIDEGDATPLQFIDG-HYYITLEAISVDGRMLDINPNIFKRDDSGGG-VMIDSG 282
                +++    +  +  +D   Y + L+ I +  + L+I  ++F+ D  G G  M+DSG
Sbjct: 255 GFKYVSLLTFPQSQRMPNLDPLAYTVPLQGIRIGQKRLNIPGSVFRPDAGGSGQTMVDSG 314

Query: 283 TDVTWLVKEAYEALRDEVMIRLEGEQMRS---YSWPDKLCYHGIMSSDL-KGFPTVRFHF 338
           ++ T LV  AY+ +++E+ +RL G +++    Y     +C+ G  S ++ +    + F F
Sbjct: 315 SEFTHLVDVAYDKVKEEI-VRLVGSRLKKGYVYGSTADMCFDGNHSMEIGRLIGDLVFEF 373

Query: 339 RGGAKLALEKDSMFYQPRPDAFCMAVNPASINNRYVNFSLIGMMAQQFYNVGYDIGRKQK 398
             G ++ +EK S+         C+ +  +S+     N  +IG + QQ   V +D+  ++ 
Sbjct: 374 GRGVEILVEKQSLLVNVGGGIHCVGIGRSSMLGAASN--IIGNVHQQNLWVEFDVTNRRV 431

Query: 399 TFQRMDCEVL 408
            F + +C +L
Sbjct: 432 GFSKAECRLL 441


>gi|296089645|emb|CBI39464.3| unnamed protein product [Vitis vinifera]
          Length = 477

 Score =  108 bits (271), Expect = 3e-21,   Method: Compositional matrix adjust.
 Identities = 105/409 (25%), Positives = 178/409 (43%), Gaps = 62/409 (15%)

Query: 40  IRSH-NTYQAQILPSNDI---------SNSLFYVNISIGQPPVPQFTAMDTGSSLLWVHC 89
           +++H N+ Q +IL   D+         +  L+Y  I IG P    +  +DTGS ++WV+C
Sbjct: 67  LKAHDNSRQLRILAGVDLPLGGTGRPEAVGLYYAKIGIGTPARDYYVQVDTGSDIMWVNC 126

Query: 90  YPCRDCSPQ--LG---TIFYPSRSSSYAYVPCDSEHCRYF---PYARCSAYKHRCIYTQL 141
             C +C  +  LG   T++    S +   V CD + C      P + C A    C YT++
Sbjct: 127 IQCNECPKKSSLGMELTLYDIKESLTGKLVSCDQDFCYAINGGPPSYCIA-NMSCSYTEI 185

Query: 142 YLIGPETSVFVSTEQLTFKNTD---ESTIHVQDVVFGCGFSTNRNFK----FSGIFGLGI 194
           Y  G  +  +   + + +       E+T     V+FGC  + + +        GI G G 
Sbjct: 186 YADGSSSFGYFVRDIVQYDQVSGDLETTSANGSVIFGCSATQSGDLSSEEALDGILGFGK 245

Query: 195 GRSSLVSQLNSS------FSYCIGSLHDPDYLHNKLILGDGAIID-EGDATPLQFIDGHY 247
             +S++SQL SS      F++C+      D L+   I   G I+  + + TPL     HY
Sbjct: 246 SNTSMISQLASSGKVRKMFAHCL------DGLNGGGIFAIGHIVQPKVNTTPLVPNQTHY 299

Query: 248 YITLEAISVDGRMLDINPNIFKRDDSGGGVMIDSGTDVTWLVKEAYEALRDEVMIRLEGE 307
            + ++A+ V G  L++  ++F   D   G +IDSGT + +L +  Y+ L  ++       
Sbjct: 300 NVNMKAVEVGGYFLNLPTDVFDVGDK-KGTIIDSGTTLAYLPEVVYDQLLSKI------- 351

Query: 308 QMRSYSWPDKLCYHGI--------MSSDL-KGFPTVRFHFRGGAKLALEKDSMFYQPRPD 358
               +SW   L  H I         S  L  GFP V FHF     L +      +     
Sbjct: 352 ----FSWQSDLKVHTIHDQFTCFQYSESLDDGFPAVTFHFENSLYLKVHPHEYLFS-YDG 406

Query: 359 AFCMAVNPASINNR-YVNFSLIGMMAQQFYNVGYDIGRKQKTFQRMDCE 406
            +C+    + + +R   N +L+G +A     V YD+  +   +   +C+
Sbjct: 407 LWCIGWQNSGMQSRDRRNITLLGDLALSNKLVLYDLENQVIGWTEYNCK 455


>gi|357166728|ref|XP_003580821.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Brachypodium
           distachyon]
          Length = 479

 Score =  108 bits (271), Expect = 3e-21,   Method: Compositional matrix adjust.
 Identities = 109/361 (30%), Positives = 161/361 (44%), Gaps = 45/361 (12%)

Query: 1   LIHRDSILSPYNNPNENPAHRVQRGINISIARLAYLQEKIRSHNTYQAQILPSNDIS--- 57
           L+HRD++ S   +P+    H V    +   AR+AYLQ ++    +  +     +  +   
Sbjct: 61  LLHRDTV-SGTKHPSRR--HAVLALASRDTARVAYLQRRLSPSPSPSSTSSVESGGTIVS 117

Query: 58  --NSLFYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLGTIFYPSRSSSYAYVP 115
             +  + V + IG PP+ Q    DTGS ++WV C PC DC  Q   +F P+ S+S++ VP
Sbjct: 118 HGSGEYLVRVGIGSPPLEQHLVADTGSDVIWVQCSPCSDCYAQGDPLFDPANSASFSPVP 177

Query: 116 CDSEHCRY---FPYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDV 172
           C+S  CR    +  + C      C Y   Y     T+  ++ E LT     E    VQ V
Sbjct: 178 CNSGVCRAAARYSSSSCGGGGGECEYKVSYGDKSYTNGVLALETLTLDGGTE----VQGV 233

Query: 173 VFGCGFSTNRNF-KFSGIFGLGIGRSSLVSQL----NSSFSYCIGSLHDPDYLHNKLILG 227
             GCG      F + +G+ GLG G  SLV QL      +FSYC+   +  +   +  +  
Sbjct: 234 AMGCGHENRGLFAEAAGLLGLGWGPMSLVGQLGGAAGGAFSYCLAGYYSGEGSGSGSL-- 291

Query: 228 DGAIIDEGDATPLQFI----------DGHYYITLEAISVDGRMLDIN-PNIFKRDDSGGG 276
              ++   DA P   +             YY+ +  + V G  L +        DD GGG
Sbjct: 292 ---VLGREDAAPTGAVWVPLVRNPDAPSFYYVGVNGLGVAGERLQLQDGLFDLGDDGGGG 348

Query: 277 VMIDSGTDVTWLVKEAYEALRDEVMIRLEGEQMRS--YSWPDKLCYHGIMSSDLKGFPTV 334
           V++D+GT VT L  EAY ALR       E    R+   S  D  CY      DL G+ +V
Sbjct: 349 VVMDTGTAVTRLPAEAYAALRGAFAGAFEEGAPRAPGVSLFDT-CY------DLSGYASV 401

Query: 335 R 335
           R
Sbjct: 402 R 402


>gi|359478045|ref|XP_002267046.2| PREDICTED: aspartic proteinase-like protein 2-like [Vitis vinifera]
          Length = 502

 Score =  108 bits (271), Expect = 3e-21,   Method: Compositional matrix adjust.
 Identities = 105/408 (25%), Positives = 177/408 (43%), Gaps = 62/408 (15%)

Query: 40  IRSH-NTYQAQILPSNDI---------SNSLFYVNISIGQPPVPQFTAMDTGSSLLWVHC 89
           +++H N+ Q +IL   D+         +  L+Y  I IG P    +  +DTGS ++WV+C
Sbjct: 67  LKAHDNSRQLRILAGVDLPLGGTGRPEAVGLYYAKIGIGTPARDYYVQVDTGSDIMWVNC 126

Query: 90  YPCRDCSPQ--LG---TIFYPSRSSSYAYVPCDSEHCRYF---PYARCSAYKHRCIYTQL 141
             C +C  +  LG   T++    S +   V CD + C      P + C A    C YT++
Sbjct: 127 IQCNECPKKSSLGMELTLYDIKESLTGKLVSCDQDFCYAINGGPPSYCIA-NMSCSYTEI 185

Query: 142 YLIGPETSVFVSTEQLTFKNTD---ESTIHVQDVVFGCGFSTNRNFK----FSGIFGLGI 194
           Y  G  +  +   + + +       E+T     V+FGC  + + +        GI G G 
Sbjct: 186 YADGSSSFGYFVRDIVQYDQVSGDLETTSANGSVIFGCSATQSGDLSSEEALDGILGFGK 245

Query: 195 GRSSLVSQLNSS------FSYCIGSLHDPDYLHNKLILGDGAIID-EGDATPLQFIDGHY 247
             +S++SQL SS      F++C+      D L+   I   G I+  + + TPL     HY
Sbjct: 246 SNTSMISQLASSGKVRKMFAHCL------DGLNGGGIFAIGHIVQPKVNTTPLVPNQTHY 299

Query: 248 YITLEAISVDGRMLDINPNIFKRDDSGGGVMIDSGTDVTWLVKEAYEALRDEVMIRLEGE 307
            + ++A+ V G  L++  ++F   D   G +IDSGT + +L +  Y+ L  ++       
Sbjct: 300 NVNMKAVEVGGYFLNLPTDVFDVGDK-KGTIIDSGTTLAYLPEVVYDQLLSKI------- 351

Query: 308 QMRSYSWPDKLCYHGI--------MSSDL-KGFPTVRFHFRGGAKLALEKDSMFYQPRPD 358
               +SW   L  H I         S  L  GFP V FHF     L +      +     
Sbjct: 352 ----FSWQSDLKVHTIHDQFTCFQYSESLDDGFPAVTFHFENSLYLKVHPHEYLFS-YDG 406

Query: 359 AFCMAVNPASINNR-YVNFSLIGMMAQQFYNVGYDIGRKQKTFQRMDC 405
            +C+    + + +R   N +L+G +A     V YD+  +   +   +C
Sbjct: 407 LWCIGWQNSGMQSRDRRNITLLGDLALSNKLVLYDLENQVIGWTEYNC 454


>gi|357132618|ref|XP_003567926.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Brachypodium
           distachyon]
          Length = 468

 Score =  108 bits (271), Expect = 4e-21,   Method: Compositional matrix adjust.
 Identities = 101/371 (27%), Positives = 162/371 (43%), Gaps = 34/371 (9%)

Query: 61  FYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLGT-----IFYPSRSSSYAYVP 115
           ++V + +G P  P     DTGS L WV C      S          +F P+ S S++ +P
Sbjct: 104 YFVRLRVGTPAQPFVLVADTGSDLTWVKCSSPSSSSSSPAASPPQRVFRPAGSKSWSPLP 163

Query: 116 CDSEHCR-YFPY--ARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFK---NTDESTIHV 169
           CDS+ C+ Y P+  A CS+    C Y   Y         V  +  T     N       +
Sbjct: 164 CDSDTCKSYVPFSLANCSSPPDPCSYDYRYKDNSSARGVVGLDSATVSLSGNDGTRKAKL 223

Query: 170 QDVVFGCGFSTN-RNFKFS-GIFGLGIGRSSLVSQLNS----SFSYCIGSLHDPDYLHNK 223
           Q+VV GC  S + ++FK S G+  LG    S  S+  S     FSYC+     P    + 
Sbjct: 224 QEVVLGCTTSYDGQSFKSSDGVLSLGNSNISFASRAASRFGGRFSYCLVDHLAPRNATSF 283

Query: 224 LILGDGAIIDEGDA----TPLQFIDGH-----YYITLEAISVDGRMLDINPNIFKRDDSG 274
           L  G+G      D+    TPL  ++       Y+++++A++V G  L+I P+++    +G
Sbjct: 284 LTFGNGDSSPGDDSSSRRTPLVLLEDARTRPFYFVSVDAVTVAGERLEILPDVWDFRKNG 343

Query: 275 GGVMIDSGTDVTWLVKEAYEALRDEVMIRLEGEQMRSYSWPDKLCYHGIMSSDLKGFPTV 334
           G ++ DSGT +T L   AY+A+   +  +  G    +   P + CY+    S     P +
Sbjct: 344 GAIL-DSGTSLTILATPAYDAVVKAISKQFAGVPRVNMD-PFEYCYNWTGVS--AEIPRM 399

Query: 335 RFHFRGGAKLALEKDSMFYQPRPDAFCMAVNPASINNRYVNFSLIGMMAQQFYNVGYDIG 394
              F G A LA    S      P   C+ V    +   +   S+IG + QQ +   +D+ 
Sbjct: 400 ELRFAGAATLAPPGKSYVIDTAPGVKCIGV----VEGAWPGVSVIGNILQQEHLWEFDLA 455

Query: 395 RKQKTFQRMDC 405
            +   F++  C
Sbjct: 456 NRWLRFKQSRC 466


>gi|297720449|ref|NP_001172586.1| Os01g0776900 [Oryza sativa Japonica Group]
 gi|255673740|dbj|BAH91316.1| Os01g0776900 [Oryza sativa Japonica Group]
          Length = 381

 Score =  108 bits (271), Expect = 4e-21,   Method: Compositional matrix adjust.
 Identities = 86/285 (30%), Positives = 132/285 (46%), Gaps = 31/285 (10%)

Query: 60  LFYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLG-----TIFYPSRSSSYAYV 114
           L++  + +G PP   F  +DTGS +LWV C PC  C    G       F P  SS+ + +
Sbjct: 90  LYFTRVKLGSPPKEYFVQIDTGSDILWVACSPCTGCPSSSGLNIQLEFFNPDTSSTSSKI 149

Query: 115 PCDSEHCRYF---PYARCSAYKHR-CIYTQLYLIGPETSVFVSTEQLTFKNT---DESTI 167
           PC  + C        A C    +  C YT  Y  G  TS +  ++ + F      +++  
Sbjct: 150 PCSDDRCTAALQTSEAVCQTSDNSPCGYTFTYGDGSGTSGYYVSDTMYFDTVMGNEQTAN 209

Query: 168 HVQDVVFGCGFS-----TNRNFKFSGIFGLGIGRSSLVSQLNS------SFSYCIGSLHD 216
               +VFGC  S     T  +    GIFG G  + S+VSQLNS       FS+C   L  
Sbjct: 210 SSASIVFGCSNSQSGDLTKTDRAVDGIFGFGQHQLSVVSQLNSLGVSPKVFSHC---LKG 266

Query: 217 PDYLHNKLILGDGAIIDEGDA-TPLQFIDGHYYITLEAISVDGRMLDINPNIFKRDDSGG 275
            D     L+LG+  I++ G   TPL     HY + LE+I V+G+ L I+ ++F   ++  
Sbjct: 267 SDNGGGILVLGE--IVEPGLVYTPLVPSQPHYNLNLESIVVNGQKLPIDSSLFTTSNT-Q 323

Query: 276 GVMIDSGTDVTWLVKEAYEALRDEVMIRLEGEQMRSYSWPDKLCY 320
           G ++DSGT + +L   AY+   + +   +    +RS       C+
Sbjct: 324 GTIVDSGTTLAYLADGAYDPFVNAITAAVS-PSVRSLVSKGNQCF 367


>gi|195607464|gb|ACG25562.1| aspartic proteinase nepenthesin-2 precursor [Zea mays]
          Length = 478

 Score =  108 bits (271), Expect = 4e-21,   Method: Compositional matrix adjust.
 Identities = 111/369 (30%), Positives = 160/369 (43%), Gaps = 49/369 (13%)

Query: 55  DISNSLFYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCR---DCSPQLGTIFYPSRSSSY 111
           DI    + V  S+G P V Q   +DTGS L WV C PC     C  Q   +F P++SSSY
Sbjct: 134 DIGTLNYVVTASLGTPGVAQTMEVDTGSDLSWVQCKPCSAAPSCYSQKDPLFDPAQSSSY 193

Query: 112 AYVPCDSEHCRYFP-YARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQ 170
           A VPC    C     YA  +    +C Y   Y  G  T+   S++ LT   +      VQ
Sbjct: 194 AAVPCGGPVCAGLGIYAASACSAAQCGYVVSYGDGSNTTGVYSSDTLTLSASSA----VQ 249

Query: 171 DVVFGCGFSTNRNFK-FSGIFGLGIGRSSLVSQLNSS----FSYCIGSL-HDPDYLHNKL 224
              FGCG + +  F    G+ GLG  + SLV Q   +    FSYC+ +      YL   L
Sbjct: 250 GFFFGCGHAQSGLFNGVDGLLGLGREQPSLVEQTAGTYGGVFSYCLPTKPSTAGYL--TL 307

Query: 225 ILGDGAIIDEGDAT----PLQFIDGHYYITLEAISVDGRMLDINPNIFKRDDSGGGVMID 280
            LG  +    G +T    P      +Y + L  ISV G+ L +  + F      GG ++D
Sbjct: 308 GLGGPSGAAPGFSTTQLLPSPNAPTYYVVMLTGISVGGQQLSVPASAFA-----GGTVVD 362

Query: 281 SGTDVTWLVKEAYEALRDEVMIRLEGEQMRSYSWPDKLCYHGIMSS--DLKGF-----PT 333
           +GT +T L   AY ALR           M SY +P     +GI+ +  +  G+     P 
Sbjct: 363 TGTVITRLPPTAYAALRSAFR-----SGMASYGYPTAPS-NGILDTCYNFAGYGTVTLPN 416

Query: 334 VRFHFRGGAKLALEKDSMFYQPRPDAFCMAVNPASINNRYVNFSLIGMMAQQFYNV---G 390
           V   F  GA + L  D +         C+A  P+  +      +++G + Q+ + V   G
Sbjct: 417 VALTFGSGATVMLGADGIL-----SFGCLAFAPSGSDG---GMAILGNVQQRSFEVRIDG 468

Query: 391 YDIGRKQKT 399
             +G K  +
Sbjct: 469 TSVGFKPSS 477


>gi|225216960|gb|ACN85252.1| aspartic proteinase nepenthesin-1 precursor [Oryza officinalis]
          Length = 519

 Score =  108 bits (271), Expect = 4e-21,   Method: Compositional matrix adjust.
 Identities = 103/357 (28%), Positives = 156/357 (43%), Gaps = 29/357 (8%)

Query: 61  FYVNISIGQPPVPQFTAM-DTGSSLLWVHCYPC-RDCSPQLGTIFYPSRSSSYAYVPCDS 118
           + V + +G P V ++T + DTGS   WV C PC   C  Q   +F P+RSS+YA V C +
Sbjct: 180 YVVTVGLGTP-VSRYTVVFDTGSDTTWVQCQPCVVVCYEQREKLFDPARSSTYANVSCAA 238

Query: 119 EHCRYFPYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDVVFGCGF 178
             C       CS     C+Y   Y  G  +  F + + LT  + D     V+   FGCG 
Sbjct: 239 PACSDLNIHGCSG--GHCLYGVQYGDGSYSIGFFAMDTLTLSSYDA----VKGFRFGCGE 292

Query: 179 STNRNF-KFSGIFGLGIGRSSLVSQL----NSSFSYCIGSLH-DPDYLHNKLILGDGAII 232
                F + +G+ GLG G++SL  Q        F++C+ +      YL      G  A  
Sbjct: 293 RNEGLFGEAAGLLGLGRGKTSLPVQTYDKYGGVFAHCLPARSTGTGYL--DFGAGSLAAA 350

Query: 233 DEGDATPLQFIDG--HYYITLEAISVDGRMLDINPNIFKRDDSGGGVMIDSGTDVTWLVK 290
                TP+   +G   YY+ +  I V G++L I  ++F       G ++DSGT +T L  
Sbjct: 351 SARLTTPMLTDNGPTFYYVGMTGIRVGGQLLSIPQSVFAT----AGTIVDSGTVITRLPP 406

Query: 291 EAYEALRDEVMIRLEGEQMRSYSWPDKL--CYHGIMSSDLKGFPTVRFHFRGGAKLALEK 348
            AY +LR      +     +       L  CY     S +   PTV   F+GGA+L ++ 
Sbjct: 407 AAYSSLRYAFAAAMAARGYKKAPAVSLLDTCYDFTGMSQVA-IPTVSLLFQGGARLDVDA 465

Query: 349 DSMFYQPRPDAFCMAVNPASINNRYVNFSLIGMMAQQFYNVGYDIGRKQKTFQRMDC 405
             + Y       C+A    + N    +  ++G    + + V YDIG+K   F    C
Sbjct: 466 SGIMYAASASQVCLAF---AANEDGGDVGIVGNTQLKTFGVAYDIGKKVVGFYPGAC 519


>gi|255549236|ref|XP_002515672.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
           communis]
 gi|223545215|gb|EEF46724.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
           communis]
          Length = 492

 Score =  108 bits (271), Expect = 4e-21,   Method: Compositional matrix adjust.
 Identities = 96/372 (25%), Positives = 166/372 (44%), Gaps = 38/372 (10%)

Query: 60  LFYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDC--SPQLG---TIFYPSRSSSYAYV 114
           L+Y  + IG P    +  +DTGS ++WV+C  CR+C  +  LG   T++    S S   V
Sbjct: 85  LYYAKVGIGTPSKDYYVQVDTGSDIMWVNCIQCRECPRTSSLGMELTLYNIKDSVSGKLV 144

Query: 115 PCDSEHCRYF---PYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTD---ESTIH 168
           PCD E C      P + C+A    C Y ++Y  G  T+ +   + + +       ++T  
Sbjct: 145 PCDEEFCYEVNGGPLSGCTA-NMSCPYLEIYGDGSSTAGYFVKDVVQYDRVSGDLQTTSS 203

Query: 169 VQDVVFGCGFSTNRNF------KFSGIFGLGIGRSSLVSQLNSS------FSYCIGSLHD 216
              V+FGCG   + +          GI G G   SS++SQL ++      F++C+     
Sbjct: 204 NGSVIFGCGARQSGDLGPTSEEALDGILGFGKSNSSMISQLAATRKVKKIFAHCL----- 258

Query: 217 PDYLHNKLILGDGAIID-EGDATPLQFIDGHYYITLEAISVDGRMLDINPNIFKRDDSGG 275
            D ++   I   G ++  + + TPL     HY + + A+ V    L +    F+  D  G
Sbjct: 259 -DGINGGGIFAIGHVVQPKVNMTPLIPNQPHYNVNMTAVQVGEDFLHLPTEEFEAGDRKG 317

Query: 276 GVMIDSGTDVTWLVKEAYEALRDEVMIRLEGEQMRSYSWPDK-LCYHGIMSSDLKGFPTV 334
            + IDSGT + +L +  YE L  +++   +   ++ +   D+  C+    S D  GFP V
Sbjct: 318 AI-IDSGTTLAYLPEIVYEPLVSKIIS--QQPDLKVHIVRDEYTCFQYSGSVD-DGFPNV 373

Query: 335 RFHFRGGAKLALEKDSMFYQPRPDAFCMAVNPASINNR-YVNFSLIGMMAQQFYNVGYDI 393
            FHF     L +      + P    +C+    + + +R   N +L+G +      V YD+
Sbjct: 374 TFHFENSVFLKVHPHEYLF-PFEGLWCIGWQNSGMQSRDRRNMTLLGDLVLSNKLVLYDL 432

Query: 394 GRKQKTFQRMDC 405
             +   +   +C
Sbjct: 433 ENQAIGWTEYNC 444


>gi|356523724|ref|XP_003530485.1| PREDICTED: aspartic proteinase-like protein 2-like [Glycine max]
          Length = 488

 Score =  108 bits (270), Expect = 4e-21,   Method: Compositional matrix adjust.
 Identities = 100/375 (26%), Positives = 170/375 (45%), Gaps = 44/375 (11%)

Query: 60  LFYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQ--LG---TIFYPSRSSSYAYV 114
           L+Y  I IG PP   +  +DTGS ++WV+C  C++C  +  LG   T++    SSS   V
Sbjct: 82  LYYAKIGIGTPPKNYYLQVDTGSDIMWVNCIQCKECPTRSSLGMDLTLYDIKESSSGKLV 141

Query: 115 PCDSEHCRYFP---YARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFK------NTDES 165
           PCD E C+         C+A    C Y ++Y  G  T+ +   + + +        TD +
Sbjct: 142 PCDQEFCKEINGGLLTGCTA-NISCPYLEIYGDGSSTAGYFVKDIVLYDQVSGDLKTDSA 200

Query: 166 TIHVQDVVFGCG------FSTNRNFKFSGIFGLGIGRSSLVSQLNSS------FSYCIGS 213
                 +VFGCG       S++      GI G G   SS++SQL SS      F++C+  
Sbjct: 201 N---GSIVFGCGARQSGDLSSSNEEALDGILGFGKANSSMISQLASSGKVKKMFAHCLNG 257

Query: 214 LHDPDYLHNKLILGDGAIID-EGDATPLQFIDGHYYITLEAISVDGRMLDINPNIFKRDD 272
           ++         I   G ++  + + TPL     HY + + A+ V    L ++ +   + D
Sbjct: 258 VNGGG------IFAIGHVVQPKVNMTPLLPDQPHYSVNMTAVQVGHTFLSLSTDTSAQGD 311

Query: 273 SGGGVMIDSGTDVTWLVKEAYEALRDEVMIRLEGEQMRSYSWPDKLCYHGIMSSDLKGFP 332
              G +IDSGT + +L +  YE L  +++ +    ++++    +  C+    S D  GFP
Sbjct: 312 R-KGTIIDSGTTLAYLPEGIYEPLVYKMISQHPDLKVQTLH-DEYTCFQYSESVD-DGFP 368

Query: 333 TVRFHFRGGAKLAL-EKDSMFYQPRPDAFCMAVNPASINNR-YVNFSLIGMMAQQFYNVG 390
            V F F  G  L +   D +F  P  + +C+    +   +R   N +L+G +      V 
Sbjct: 369 AVTFFFENGLSLKVYPHDYLF--PSVNFWCIGWQNSGTQSRDSKNMTLLGDLVLSNKLVF 426

Query: 391 YDIGRKQKTFQRMDC 405
           YD+  +   +   +C
Sbjct: 427 YDLENQAIGWAEYNC 441


>gi|212274713|ref|NP_001130791.1| uncharacterized protein LOC100191895 precursor [Zea mays]
 gi|194690124|gb|ACF79146.1| unknown [Zea mays]
 gi|194708040|gb|ACF88104.1| unknown [Zea mays]
 gi|223950469|gb|ACN29318.1| unknown [Zea mays]
 gi|414885521|tpg|DAA61535.1| TPA: hypothetical protein ZEAMMB73_650724 [Zea mays]
          Length = 500

 Score =  108 bits (270), Expect = 4e-21,   Method: Compositional matrix adjust.
 Identities = 106/360 (29%), Positives = 160/360 (44%), Gaps = 52/360 (14%)

Query: 78  MDTGSSLLWVHCYPCRDCSPQLGTIFYPSRSSSYAYVPCDSEHCRYFPYAR--------- 128
           +DT S L WV C PC  C  Q G +F PS S SYA VPCDS  C                
Sbjct: 158 VDTASELTWVQCAPCESCHDQQGPLFDPSSSPSYAAVPCDSPSCDALQQQLATGAGAGAP 217

Query: 129 -CSAYK-HRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDVVFGCGFSTNRNFKF 186
            C A +   C Y   Y  G  +   ++ ++L+          +   VFGCG ++N+   F
Sbjct: 218 PCDAGRPAACSYALSYRDGSYSRGVLAHDRLSLAGE-----VIDGFVFGCG-TSNQGPPF 271

Query: 187 SGIFGL-GIGRSSL------VSQLNSSFSYCIGSLHDPDYLHNKLILGDGAIIDEGDATP 239
            G  GL G+GRS L      V Q    FSYC+    + D     L+LGD       ++TP
Sbjct: 272 GGTSGLMGLGRSQLSLVSQTVDQFGGVFSYCLPLSRESD-ASGSLVLGDDPSAYR-NSTP 329

Query: 240 LQF----------IDGHYY-ITLEAISVDGRMLDINPNIFKRDDSGGGVMIDSGTDVTWL 288
           + +          + G +Y + L  I+V G+  ++    F         ++DSGT +T L
Sbjct: 330 VVYTSMVSNSDPLLQGPFYLVNLTGITVGGQ--EVESTGFSAR-----AIVDSGTVITSL 382

Query: 289 VKEAYEALRDEVMIRL-EGEQMRSYSWPDKLCYHGIMSSDLKGFPTVRFHFRGGAKLALE 347
           V   Y A+R E M +L E  Q   +S  D  C++     +++  P++   F GGA++ ++
Sbjct: 383 VPSVYNAVRAEFMSQLAEYPQAPGFSILDT-CFNMTGLKEVQ-VPSLTLVFDGGAEVEVD 440

Query: 348 KDSMFYQPRPDA--FCMAVNPASINNRYVNFSLIGMMAQQFYNVGYDIGRKQKTFQRMDC 405
              + Y    D+   C+AV  AS+ +     S+IG   Q+   V +D    Q  F +  C
Sbjct: 441 SGGVLYFVSSDSSQVCLAV--ASLKSED-ETSIIGNYQQKNLRVVFDTSASQVGFAQETC 497


>gi|356553832|ref|XP_003545255.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Glycine max]
          Length = 427

 Score =  108 bits (270), Expect = 5e-21,   Method: Compositional matrix adjust.
 Identities = 100/383 (26%), Positives = 162/383 (42%), Gaps = 53/383 (13%)

Query: 58  NSLFYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLGTIFYPSRSSSYAYVPCD 117
           N    ++++IG PP      +DTGS L W+HC       P L + F P  SSSY   PC+
Sbjct: 56  NVTLTISLTIGSPPQNVTMVLDTGSELSWLHCKKL----PNLNSTFNPLLSSSYTPTPCN 111

Query: 118 SEHCR------YFPYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQD 171
           S  C         P A C      C     Y         ++ E  +     +       
Sbjct: 112 SSVCMTRTRDLTIP-ASCDPNNKLCHVIVSYADASSAEGTLAAETFSLAGAAQP-----G 165

Query: 172 VVFGC----GFST--NRNFKFSGIFGLGIGRSSLVSQLN-SSFSYCIGSLHDPDYLHNKL 224
            +FGC    G+++  N + K +G+ G+  G  SLV+Q+    FSYCI      +     L
Sbjct: 166 TLFGCMDSAGYTSDINEDAKTTGLMGMNRGSLSLVTQMVLPKFSYCISG----EDAFGVL 221

Query: 225 ILGDG----------AIIDEGDATPLQFIDGHYYITLEAISVDGRMLDINPNIFKRDDSG 274
           +LGDG           ++    ++P  F    Y + LE I V  ++L +  ++F  D +G
Sbjct: 222 LLGDGPSAPSPLQYTPLVTATTSSPY-FDRVAYTVQLEGIKVSEKLLQLPKSVFVPDHTG 280

Query: 275 GG-VMIDSGTDVTWLVKEAYEALRDEVMIRLEGEQMR------SYSWPDKLCYHGIMSSD 327
            G  M+DSGT  T+L+   Y +L+DE + + +G   R       +     LCYH   S  
Sbjct: 281 AGQTMVDSGTQFTFLLGPVYNSLKDEFLEQTKGVLTRIEDPNFVFEGAMDLCYHAPAS-- 338

Query: 328 LKGFPTVRFHFRGGAKLALEKDSMFY---QPRPDAFCMAVNPASINNRYVNFSLIGMMAQ 384
           L   P V   F  GA++ +  + + Y   + R   +C     + +    +   +IG   Q
Sbjct: 339 LAAVPAVTLVF-SGAEMRVSGERLLYRVSKGRDWVYCFTFGNSDLLG--IEAYVIGHHHQ 395

Query: 385 QFYNVGYDIGRKQKTFQRMDCEV 407
           Q   + +D+ + +  F    C++
Sbjct: 396 QNVWMEFDLVKSRVGFTETTCDL 418


>gi|413953782|gb|AFW86431.1| hypothetical protein ZEAMMB73_038825 [Zea mays]
          Length = 462

 Score =  108 bits (270), Expect = 5e-21,   Method: Compositional matrix adjust.
 Identities = 101/360 (28%), Positives = 151/360 (41%), Gaps = 32/360 (8%)

Query: 61  FYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRD-CSPQLGTIFYPSRSSSYAYVPCDSE 119
           F V +  G P        DTGS + W+ C PC   C  Q   IF P++S++Y+ VPC   
Sbjct: 120 FVVTVGFGTPAQTYTLMFDTGSDVSWIQCLPCSGHCYKQHDPIFDPTKSATYSAVPCGHP 179

Query: 120 HCRYFPYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDVVFGCGFS 179
            C      +CS+    C+Y   Y  G  T+  +S E L+      S   +    FGCG +
Sbjct: 180 QCAAA-GGKCSS-NGTCLYKVQYGDGSSTAGVLSHETLSLT----SARALPGFAFGCGET 233

Query: 180 TNRNF-KFSGIFGLGIGRSSL----VSQLNSSFSYCIGSLHDPDYLHNKLILGDGAIIDE 234
              +F    G+ GLG G+ SL     +   ++FSYC+ S +     H  L +G       
Sbjct: 234 NLGDFGDVDGLIGLGRGQLSLSSQAAASFGAAFSYCLPSYNT---SHGYLTIGTTTPASG 290

Query: 235 GDAT------PLQFIDGHYYITLEAISVDGRMLDINPNIFKRDDSGGGVMIDSGTDVTWL 288
            D          Q     Y++ L +I V G +L + P +F RD    G ++DSGT +T+L
Sbjct: 291 SDGVRYTAMIQKQDYPSFYFVDLVSIVVGGFVLPVPPILFTRD----GTLLDSGTVLTYL 346

Query: 289 VKEAYEALRDEVMIRLEGEQMRSYSWPDKLCYHGIMSSDLKGFPTVRFHFRGGAKLALEK 348
             EAY ALRD     +   +      P   CY      +    P V F F  G+   L  
Sbjct: 347 PPEAYTALRDRFKFTMTQYKPAPAYDPFDTCYD-FAGQNAIFMPLVSFKFSDGSSFDLSP 405

Query: 349 DSMFYQP---RPDAFCMAVNPASINNRYVNFSLIGMMAQQFYNVGYDIGRKQKTFQRMDC 405
             +   P    P   C+A  P       + F+++G   Q+   + YD+  ++  F    C
Sbjct: 406 FGVLIFPDDTAPATGCLAFVP---RPSTMPFTIVGNTQQRNTEMIYDVAAEKIGFVSGSC 462


>gi|255565759|ref|XP_002523869.1| Aspartic proteinase nepenthesin-2 precursor, putative [Ricinus
           communis]
 gi|223536957|gb|EEF38595.1| Aspartic proteinase nepenthesin-2 precursor, putative [Ricinus
           communis]
          Length = 447

 Score =  108 bits (270), Expect = 5e-21,   Method: Compositional matrix adjust.
 Identities = 121/429 (28%), Positives = 183/429 (42%), Gaps = 54/429 (12%)

Query: 13  NPNENPAHRVQRGINISIARLAYLQEKIRSHNTYQAQILPSNDISNSLFYVNISIGQPPV 72
           NP+++   ++   ++ S+AR         +H+    Q  P    S   + +++S G PP 
Sbjct: 38  NPSQDHLQKLNYLVSTSLAR---------AHHLKNPQTTPVFSHSYGGYSISLSFGTPPQ 88

Query: 73  PQFTAMDTGSSLLWVHC---YPCRDCS-PQLGTIFYPSRSSSYAYVPCDSEHCRY----- 123
                MDTGSS +W  C   Y C +CS     + F P  SSS   + C +  C +     
Sbjct: 89  TLSFVMDTGSSFVWFPCTLRYLCNNCSFTSRISPFLPKHSSSSKIIGCKNPKCSWIHQTD 148

Query: 124 FPYARCSAYKHRCI-----YTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDVVFGCGF 178
                C      C      Y  LY  G    V +S E L         + V + + GC  
Sbjct: 149 LRCTDCDNNSRNCSQICPPYLILYGSGTTGGVALS-ETLHLHG-----LIVPNFLVGCSV 202

Query: 179 STNRNFKFSGIFGLGIGRSSLVSQLN-SSFSYCIGSLH-DPDYLHNKLILGDGAIIDEGD 236
            ++R  + +GI G G G SSL SQL  + FSYC+ S   D     + L+L   +  D+  
Sbjct: 203 FSSR--QPAGIAGFGRGPSSLPSQLGLTKFSYCLLSHKFDDTQESSSLVLDSQSDSDKKT 260

Query: 237 A----TPL---------QFIDGHYYITLEAISVDGRMLDINPNIFKRD-DSGGGVMIDSG 282
           A    TPL              +YY++L  IS+ GR + I       D D  GG +IDSG
Sbjct: 261 AALMYTPLVKNPKVQDKPAFSVYYYVSLRRISIGGRSVKIPYKYLSPDKDGNGGTIIDSG 320

Query: 283 TDVTWLVKEAYEALRDEVMIRL---EGEQMRSYSWPDKLCYHGIMSSDLKGFPTVRFHFR 339
           T  T++  EA+E L +E + ++   E   M       K C++   + +L+  P +R HF+
Sbjct: 321 TTFTYMSTEAFEILSNEFISQVKNYERALMVEALSGLKPCFNVSGAKELE-LPQLRLHFK 379

Query: 340 GGA--KLALEKDSMFYQPRPDAFCMAVNPASINNRYVNFSLIGMMAQQFYNVGYDIGRKQ 397
           GGA  +L LE    F   R  A    V   +         L     Q FY V YD+  ++
Sbjct: 380 GGADVELPLENYFAFLGSREVACFTVVTDGAEKASGPGMILGNFQMQNFY-VEYDLQNER 438

Query: 398 KTFQRMDCE 406
             F++  C+
Sbjct: 439 LGFKKESCK 447


>gi|115451209|ref|NP_001049205.1| Os03g0186900 [Oryza sativa Japonica Group]
 gi|49532749|dbj|BAD26705.1| Radc1 [Oryza sativa Japonica Group]
 gi|108706569|gb|ABF94364.1| Eukaryotic aspartyl protease family protein, expressed [Oryza
           sativa Japonica Group]
 gi|113547676|dbj|BAF11119.1| Os03g0186900 [Oryza sativa Japonica Group]
 gi|215692805|dbj|BAG88249.1| unnamed protein product [Oryza sativa Japonica Group]
 gi|215767626|dbj|BAG99854.1| unnamed protein product [Oryza sativa Japonica Group]
          Length = 438

 Score =  108 bits (270), Expect = 5e-21,   Method: Compositional matrix adjust.
 Identities = 102/402 (25%), Positives = 158/402 (39%), Gaps = 40/402 (9%)

Query: 31  ARLAYLQEKIRSHNTYQAQILPSNDISNSLFYVNISIGQPPVPQFTAMDTGSSLLWVHCY 90
           ARL +L  K  +     A +  ++  +   + V   +G P      A+DT +   W HC 
Sbjct: 51  ARLLFLSSKAATAGVSSAPV--ASGQAPPSYVVRAGLGSPSQQLLLALDTSADATWAHCS 108

Query: 91  PCRDCSPQLGTIFYPSRSSSYAYVPCDSEHCRYFPYARCSAYKH------------RCIY 138
           PC  C     ++F P+ SSSYA +PC S  C  F    C A +              C +
Sbjct: 109 PCGTCPSS--SLFAPANSSSYASLPCSSSWCPLFQGQACPAPQGGGDAAPPPATLPTCAF 166

Query: 139 TQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDVVFGCGFSTN---RNFKFSGIFGLGIG 195
           ++ +      +   S      K+       + +  FGC  S      N    G+ GLG G
Sbjct: 167 SKPFADASFQAALASDTLRLGKDA------IPNYTFGCVSSVTGPTTNMPRQGLLGLGRG 220

Query: 196 RSSLVSQL----NSSFSYCIGSLHDPDYLHNKLILGDGAIIDEGDATPLQFIDGH----Y 247
             +L+SQ     N  FSYC+ S     Y    L LG G              + H    Y
Sbjct: 221 PMALLSQAGSLYNGVFSYCLPSYRS-YYFSGSLRLGAGGGQPRSVRYTPMLRNPHRSSLY 279

Query: 248 YITLEAISVDGRMLDINPNIFKRDDS-GGGVMIDSGTDVTWLVKEAYEALRDEVMIRLEG 306
           Y+ +  +SV    + +    F  D + G G ++DSGT +T      Y ALR+E   ++  
Sbjct: 280 YVNVTGLSVGHAWVKVPAGSFAFDAATGAGTVVDSGTVITRWTAPVYAALREEFRRQVAA 339

Query: 307 EQMRSYSWPDKLCYHGIMSSDLKGFPTVRFHFRGGAKLALE-KDSMFYQPRPDAFCMAVN 365
               +       C++        G P V  H  GG  LAL  ++++ +       C+A+ 
Sbjct: 340 PSGYTSLGAFDTCFN-TDEVAAGGAPAVTVHMDGGVDLALPMENTLIHSSATPLACLAMA 398

Query: 366 PASIN-NRYVNFSLIGMMAQQFYNVGYDIGRKQKTFQRMDCE 406
            A  N N  VN  +I  + QQ   V +D+   +  F +  C 
Sbjct: 399 EAPQNVNSVVN--VIANLQQQNIRVVFDVANSRVGFAKESCN 438


>gi|326511104|dbj|BAK03251.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 631

 Score =  108 bits (270), Expect = 5e-21,   Method: Compositional matrix adjust.
 Identities = 101/380 (26%), Positives = 167/380 (43%), Gaps = 56/380 (14%)

Query: 56  ISNSLFYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLGTIFYPSRSSSYAYVP 115
           ++N  +   + IG PP      +D+GS++ +V C  C  C       F P  SS+Y+ V 
Sbjct: 83  LTNGYYTTRLHIGTPPQEFALIVDSGSTVTYVPCASCEQCGNHQDPRFQPDLSSTYSPVK 142

Query: 116 CDSEHCRYFPYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDVVFG 175
           C+ +         C + K++C Y + Y     +S  +  + ++F    ES +  Q  VFG
Sbjct: 143 CNVD-------CTCDSDKNQCTYERQYAEMSSSSGVLGEDIVSFGT--ESELKPQRAVFG 193

Query: 176 CGFSTNRNFKFS----GIFGLGIGRSSLVSQL------NSSFSYCIGSLHDPDYLHNKLI 225
           C  S   +  FS    GI GLG G+ S++ QL        SFS C G +           
Sbjct: 194 CENSETGDL-FSQHADGIMGLGRGQLSIMDQLVDKGVIGDSFSMCYGGMD---------- 242

Query: 226 LGDGAIIDEGDATPLQFIDGH--------YYITLEAISVDGRMLDINPNIFKRDDSGGGV 277
           +G GA++      P   I  H        Y I L+ + V G+ L ++P IF   D   G 
Sbjct: 243 IGGGAMVLGAMPAPPGMIYTHSNAVRSPYYNIELKEMHVAGKALRVDPRIF---DGKHGT 299

Query: 278 MIDSGTDVTWLVKEAYEALRDEVMIRLEGEQMRSYSWPDK----LCYHGI---MSSDLKG 330
           ++DSGT   +L ++A+ A +D V  ++    ++    PD     +C+ G    +S   + 
Sbjct: 300 VLDSGTTYAYLPEQAFVAFKDAVSSQV--HPLKKIRGPDSNYKDICFAGAGRNVSQLSEV 357

Query: 331 FPTVRFHFRGGAKLALEKDSMFYQPRP--DAFCMAVNPASINNRYVNFSLIGMMAQQFYN 388
           FP V   F  G KL+L  ++  ++      A+C+ V      N     +L+G +  +   
Sbjct: 358 FPKVDMVFGNGQKLSLSPENYLFRHSKVEGAYCLGV----FQNGKDPTTLLGGIVVRNTL 413

Query: 389 VGYDIGRKQKTFQRMDCEVL 408
           V YD   ++  F + +C  L
Sbjct: 414 VTYDRHNEKIGFWKTNCSEL 433


>gi|15241713|ref|NP_195839.1| aspartyl protease family protein [Arabidopsis thaliana]
 gi|75181297|sp|Q9LZL3.1|PCS1L_ARATH RecName: Full=Aspartic proteinase PCS1; AltName: Full=Aspartic
           protease 38; Short=AtASP38; AltName: Full=Protein EMBRYO
           DEFECTIVE 24; AltName: Full=Protein PROMOTION OF CELL
           SURVIVAL 1; Flags: Precursor
 gi|7340693|emb|CAB82992.1| putative protein [Arabidopsis thaliana]
 gi|50897174|gb|AAT85726.1| At5g02190 [Arabidopsis thaliana]
 gi|53828617|gb|AAU94418.1| At5g02190 [Arabidopsis thaliana]
 gi|110742159|dbj|BAE99007.1| hypothetical protein [Arabidopsis thaliana]
 gi|332003059|gb|AED90442.1| aspartyl protease family protein [Arabidopsis thaliana]
          Length = 453

 Score =  108 bits (270), Expect = 5e-21,   Method: Compositional matrix adjust.
 Identities = 112/385 (29%), Positives = 165/385 (42%), Gaps = 69/385 (17%)

Query: 70  PPVPQFTAMDTGSSLLWVHCYPCRDCSPQLGTIFYPSRSSSYAYVPCDSEHCR-----YF 124
           PP      +DTGS L W+ C   R  +P     F P+RSSSY+ +PC S  CR     + 
Sbjct: 82  PPQNISMVIDTGSELSWLRCN--RSSNPNPVNNFDPTRSSSYSPIPCSSPTCRTRTRDFL 139

Query: 125 PYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDVVFGC-----GFS 179
             A C + K  C  T  Y     +   ++ E   F N+   +    +++FGC     G  
Sbjct: 140 IPASCDSDK-LCHATLSYADASSSEGNLAAEIFHFGNSTNDS----NLIFGCMGSVSGSD 194

Query: 180 TNRNFKFSGIFGLGIGRSSLVSQLN-SSFSYCIGSLHD-PDYLHNKLILGDGAIIDEGD- 236
              + K +G+ G+  G  S +SQ+    FSYCI    D P +    L+LGD         
Sbjct: 195 PEEDTKTTGLLGMNRGSLSFISQMGFPKFSYCISGTDDFPGF----LLLGDSNFTWLTPL 250

Query: 237 --------ATPLQFIDG-HYYITLEAISVDGRMLDINPNIFKRDDSGGG-VMIDSGTDVT 286
                   +TPL + D   Y + L  I V+G++L I  ++   D +G G  M+DSGT  T
Sbjct: 251 NYTPLIRISTPLPYFDRVAYTVQLTGIKVNGKLLPIPKSVLVPDHTGAGQTMVDSGTQFT 310

Query: 287 WLVKEAYEALRDEVMIRLEGEQMRSYSWPD-------KLCYH----GIMSSDLKGFPTVR 335
           +L+   Y ALR   + R  G  +  Y  PD        LCY      I S  L   PTV 
Sbjct: 311 FLLGPVYTALRSHFLNRTNG-ILTVYEDPDFVFQGTMDLCYRISPVRIRSGILHRLPTVS 369

Query: 336 FHFRGGAKLALEKDSMFYQPRPDAFCMAVNPASINNRYV------NFSLIGMMA------ 383
             F  GA++A+    + Y+         V   ++ N  V      N  L+GM A      
Sbjct: 370 LVFE-GAEIAVSGQPLLYR---------VPHLTVGNDSVYCFTFGNSDLMGMEAYVIGHH 419

Query: 384 -QQFYNVGYDIGRKQKTFQRMDCEV 407
            QQ   + +D+ R +     ++C+V
Sbjct: 420 HQQNMWIEFDLQRSRIGLAPVECDV 444


>gi|212722554|ref|NP_001131154.1| uncharacterized protein LOC100192462 precursor [Zea mays]
 gi|194690728|gb|ACF79448.1| unknown [Zea mays]
          Length = 431

 Score =  108 bits (270), Expect = 5e-21,   Method: Compositional matrix adjust.
 Identities = 113/420 (26%), Positives = 169/420 (40%), Gaps = 35/420 (8%)

Query: 8   LSPYNN---PNENPAHRVQRGINISIARLAYLQEKIRSHNTYQAQILPSNDISNSLFYVN 64
           LS Y+N   P+ +P   +        ARL +L  K  S     +  + S     S + V 
Sbjct: 24  LSVYHNVHPPSPSPLESIIALARADDARLLFLSSKAASSGGITSAPVASGQTPPS-YVVR 82

Query: 65  ISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLGTIFYPSRSSSYAYVPCDSEHCRYF 124
             +G P      A+DT +   W HC PC  C    G+ F P+ SSSYA +PC S+ C  F
Sbjct: 83  AGLGTPVQQLLLALDTSADATWSHCAPCDTC--PAGSRFIPASSSSYASLPCASDWCPLF 140

Query: 125 PYARC------SAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDVVFGCGF 178
               C      SA    C +++ +    +TS   S    T +   ++   +    FGC  
Sbjct: 141 EGQPCPANQDASAPLPACAFSKPFA---DTSFQASLGSDTLRLGKDA---IAGYAFGCVG 194

Query: 179 ST---NRNFKFSGIFGLGIGRSSLVSQL----NSSFSYCIGSLHDPDYLHNKLILGDGAI 231
           +      N    G+ GLG G  SL+SQ     N  FSYC+ S     Y    L LG    
Sbjct: 195 AVAGPTTNLPKQGLLGLGRGPMSLLSQTGSRYNGVFSYCLPSYRS-YYFSGSLRLGAAGQ 253

Query: 232 IDEGDATPLQFIDGH----YYITLEAISVDGRMLDINPNIFKRDD-SGGGVMIDSGTDVT 286
                 TPL   + H    YY+ +  +SV    + +    F  D  +G G +IDSGT +T
Sbjct: 254 PRNVRYTPL-LTNPHRPSLYYVNVTGLSVGRTWVKVPAGSFAFDPATGAGTVIDSGTVIT 312

Query: 287 WLVKEAYEALRDEVMIRLEGEQMRSYSWPDKLCYHGIMSSDLKGFPTVRFHFRGGAKLAL 346
                 Y ALR+E   ++      +       C++        G P V  H  GG  L L
Sbjct: 313 RWTAPVYAALREEFRRQVAAPSGYTSLGAFDTCFN-TDEVAAGGAPPVTLHMDGGVDLTL 371

Query: 347 E-KDSMFYQPRPDAFCMAVNPASINNRYVNFSLIGMMAQQFYNVGYDIGRKQKTFQRMDC 405
             ++++ +       C+A+  A   N     +++  + QQ   V  D+   +  F R  C
Sbjct: 372 PMENTLIHSSATPLACLAMAEAP-QNVNAVVNVVANLQQQNVRVVVDVAGSRVGFAREPC 430


>gi|194698750|gb|ACF83459.1| unknown [Zea mays]
 gi|194703964|gb|ACF86066.1| unknown [Zea mays]
 gi|219886221|gb|ACL53485.1| unknown [Zea mays]
 gi|219886359|gb|ACL53554.1| unknown [Zea mays]
 gi|223950085|gb|ACN29126.1| unknown [Zea mays]
 gi|414865218|tpg|DAA43775.1| TPA: aspartic proteinase nepenthesin-1 [Zea mays]
          Length = 431

 Score =  108 bits (269), Expect = 6e-21,   Method: Compositional matrix adjust.
 Identities = 113/420 (26%), Positives = 169/420 (40%), Gaps = 35/420 (8%)

Query: 8   LSPYNN---PNENPAHRVQRGINISIARLAYLQEKIRSHNTYQAQILPSNDISNSLFYVN 64
           LS Y+N   P+ +P   +        ARL +L  K  S     +  + S     S + V 
Sbjct: 24  LSVYHNVHPPSPSPLESIIALARADDARLLFLSSKAASSGGVTSAPVASGQTPPS-YVVR 82

Query: 65  ISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLGTIFYPSRSSSYAYVPCDSEHCRYF 124
             +G P      A+DT +   W HC PC  C    G+ F P+ SSSYA +PC S+ C  F
Sbjct: 83  AGLGTPVQQLLLALDTSADATWSHCAPCDTC--PAGSRFIPASSSSYASLPCASDWCPLF 140

Query: 125 PYARC------SAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDVVFGCGF 178
               C      SA    C +++ +    +TS   S    T +   ++   +    FGC  
Sbjct: 141 EGQPCPANQDASAPLPACAFSKPFA---DTSFQASLGSDTLRLGKDA---IAGYAFGCVG 194

Query: 179 ST---NRNFKFSGIFGLGIGRSSLVSQL----NSSFSYCIGSLHDPDYLHNKLILGDGAI 231
           +      N    G+ GLG G  SL+SQ     N  FSYC+ S     Y    L LG    
Sbjct: 195 AVAGPTTNLPKQGLLGLGRGPMSLLSQTGSRYNGVFSYCLPSYRS-YYFSGSLRLGAAGQ 253

Query: 232 IDEGDATPLQFIDGH----YYITLEAISVDGRMLDINPNIFKRDD-SGGGVMIDSGTDVT 286
                 TPL   + H    YY+ +  +SV    + +    F  D  +G G +IDSGT +T
Sbjct: 254 PRNVRYTPL-LTNPHRPSLYYVNVTGLSVGRTWVKVPAGSFAFDPATGAGTVIDSGTVIT 312

Query: 287 WLVKEAYEALRDEVMIRLEGEQMRSYSWPDKLCYHGIMSSDLKGFPTVRFHFRGGAKLAL 346
                 Y ALR+E   ++      +       C++        G P V  H  GG  L L
Sbjct: 313 RWTAPVYAALREEFRRQVAAPSGYTSLGAFDTCFN-TDEVAAGGAPPVTLHMDGGVDLTL 371

Query: 347 E-KDSMFYQPRPDAFCMAVNPASINNRYVNFSLIGMMAQQFYNVGYDIGRKQKTFQRMDC 405
             ++++ +       C+A+  A   N     +++  + QQ   V  D+   +  F R  C
Sbjct: 372 PMENTLIHSSATPLACLAMAEAP-QNVNAVVNVVANLQQQNVRVVVDVAGSRVGFAREPC 430


>gi|255548664|ref|XP_002515388.1| Aspartic proteinase nepenthesin-2 precursor, putative [Ricinus
           communis]
 gi|223545332|gb|EEF46837.1| Aspartic proteinase nepenthesin-2 precursor, putative [Ricinus
           communis]
          Length = 494

 Score =  108 bits (269), Expect = 6e-21,   Method: Compositional matrix adjust.
 Identities = 114/380 (30%), Positives = 156/380 (41%), Gaps = 43/380 (11%)

Query: 48  AQILPSND---ISNSLFYVNISIGQPPVPQFTAMDTGSSLLWVHCYPC-RDCSPQLGTIF 103
           A  LP+ D   I +  ++V + +G P        DTGS L W  C PC + C  Q   IF
Sbjct: 137 ATTLPAKDGSIIGSGNYFVTVGLGTPKKDFSLIFDTGSDLTWTQCEPCVKSCYNQKEAIF 196

Query: 104 YPSRSSSYAYVPCDSEHCRYFPYARCSAY---KHRCIYTQLYLIGPETSVFVSTEQLTFK 160
            PS+S+SYA + C S  C     A  + +      C+Y   Y     +  F   E+L+  
Sbjct: 197 NPSQSTSYANISCGSTLCDSLASATGNIFNCASSTCVYGIQYGDSSFSIGFFGKEKLSLT 256

Query: 161 NTDESTIHVQDVVFGCGFSTNRNFKFSGIFGLGIGRS--SLVSQL----NSSFSYCIGSL 214
            TD       D  FGCG   N+         LG+GR   SLVSQ     N  FSYC   L
Sbjct: 257 ATDV----FNDFYFGCG-QNNKGLFGGAAGLLGLGRDKLSLVSQTAQRYNKIFSYC---L 308

Query: 215 HDPDYLHNKLILGDGAIIDEGDATPLQFIDG---HYYITLEAISVDGRMLDINPNIFKRD 271
                    L  G G+       TPL  I G    Y + L  ISV GR L I+P++F   
Sbjct: 309 PSSSSSTGFLTFG-GSTSKSASFTPLATISGGSSFYGLDLTGISVGGRKLAISPSVF--- 364

Query: 272 DSGGGVMIDSGTDVTWLVKEAYEALRDEVMIRLEGEQMRSYSWPDKL-----CYHGIMSS 326
            S  G +IDSGT +T L   AY AL          + M  Y     L     C+    + 
Sbjct: 365 -STAGTIIDSGTVITRLPPAAYSALSSTFR-----KLMSQYPAAPALSILDTCFD-FSNH 417

Query: 327 DLKGFPTVRFHFRGGAKLALEKDSMFYQPRPDAFCMAVNPASINNRYVNFSLIGMMAQQF 386
           D    P +   F GG  + ++K  +FY       C+A    + N+   + ++ G + Q+ 
Sbjct: 418 DTISVPKIGLFFSGGVVVDIDKTGIFYVNDLTQVCLAF---AGNSDASDVAIFGNVQQKT 474

Query: 387 YNVGYDIGRKQKTFQRMDCE 406
             V YD    +  F    C 
Sbjct: 475 LEVVYDGAAGRVGFAPAGCS 494


>gi|414881575|tpg|DAA58706.1| TPA: hypothetical protein ZEAMMB73_168363 [Zea mays]
          Length = 506

 Score =  108 bits (269), Expect = 6e-21,   Method: Compositional matrix adjust.
 Identities = 100/391 (25%), Positives = 161/391 (41%), Gaps = 57/391 (14%)

Query: 25  GINISIARLAYLQEKIRSHNTYQAQILPSNDIS----NSLFYVNISIGQPPVPQFTAMDT 80
           G NIS  R     +  R      A  LP   +       L++  I +G PP   +  +DT
Sbjct: 50  GANISALRA---HDGRRHGRLLAAADLPLGGLGLPTDTGLYFTEIKLGTPPKRYYVQVDT 106

Query: 81  GSSLLWVHCYPCRDCSPQLG-----TIFYPSRSSSYAYVPCDSEHCRYFPYAR---CSAY 132
           GS +LWV+C  C  C  + G     T + P  SSS + V CD   C      +   C+A 
Sbjct: 107 GSDILWVNCISCSKCPRKSGLGLDLTFYDPKASSSGSTVSCDQGFCAATYGGKLPGCTA- 165

Query: 133 KHRCIYTQLYLIGPETSVFVSTEQLTFKNT---DESTIHVQDVVFGCGFST-----NRNF 184
              C Y+ +Y  G  T+ F  T+ L F       ++      + FGCG        N N 
Sbjct: 166 NVPCEYSVMYGDGSSTTGFFITDALQFDQVTGDGQTQPGNATITFGCGAQQGGDLGNSNQ 225

Query: 185 KFSGIFGLGIGRSSLVSQLNSS------FSYCIGSLHDPDYLHNKLILGDGAIIDEGDAT 238
              GI G G   +S++SQL ++      F++C+      D +    I   G ++      
Sbjct: 226 ALDGILGFGQANTSMLSQLAAAGKAKKIFAHCL------DTIKGGGIFAIGNVVQPKCYF 279

Query: 239 PLQFIDG-----------------HYYITLEAISVDGRMLDINPNIFKRDDSGGGVMIDS 281
              F  G                 HY + L++I V G  L +  ++F+  +   G +IDS
Sbjct: 280 VFFFAHGLLNIPLFLLVMILLSRPHYNVNLKSIDVGGTTLQLPAHVFETGEK-KGTIIDS 338

Query: 282 GTDVTWLVKEAYEALRDEVMIRLEGEQMRSYSWPDKLCYHGIMSSDLKGFPTVRFHFRGG 341
           GT +T+L +  ++ + D V  +     +  ++  D LC+    S D  GFPT+ FHF   
Sbjct: 339 GTTLTYLPELVFKQVMDVVFSKH--RDIAFHNLQDFLCFQYSGSVD-DGFPTITFHFEDD 395

Query: 342 AKLALEKDSMFYQPRPDAFCMAVNPASINNR 372
             L +     F+    D +C+     ++ ++
Sbjct: 396 LALHVYPHEYFFPNGNDIYCVGFQNGALQSK 426


>gi|357163818|ref|XP_003579856.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Brachypodium
           distachyon]
          Length = 467

 Score =  108 bits (269), Expect = 6e-21,   Method: Compositional matrix adjust.
 Identities = 112/434 (25%), Positives = 179/434 (41%), Gaps = 65/434 (14%)

Query: 22  VQRGINISIARLAYLQEKIRSHNTYQAQILPSNDI--SNSLFYVNISIGQPPVPQFTAMD 79
           ++R I  S  RLA +  ++   ++    ++    +  +   + V + +G P      A+D
Sbjct: 47  LRRAIQRSRDRLASIAPRLLPTSSRNKVVVAEAPVLSAGGEYLVKLGLGTPQHCFTAAID 106

Query: 80  TGSSLLWVHCYPCRDCSPQLGTIFYPSRSSSYAYVPCDSEHCRYFPYARC-----SAYKH 134
           T S L+W  C PC  C  QL  +F P  S+SYA VPC+S+ C      RC     S  + 
Sbjct: 107 TASDLIWTQCQPCVKCYKQLDPVFNPVASTSYAVVPCNSDTCDELDTHRCARDGDSDDED 166

Query: 135 RCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDVVFGCGFST--NRNFKFSGIFGL 192
            C YT  Y     T   ++ ++L   +        + VVFGC  S+      + SG+ GL
Sbjct: 167 ACQYTYSYGGNATTRGILAVDRLAIGDD-----VFRGVVFGCSSSSVGGPPPQVSGVVGL 221

Query: 193 GIGRSSLVSQLN-SSFSYCIGSLHDP-DYLHNKLILGDGAIIDEGDATPLQFI------- 243
           G G  SLVSQL+   F YC   L  P      +L+LG  A     +A+    +       
Sbjct: 222 GRGALSLVSQLSVRRFMYC---LPPPVSRSAGRLVLGADAAATVRNASERVVVPMSTGSR 278

Query: 244 -DGHYYITLEAISVDGRMLDINP----NIFKRDDSGG----------------------G 276
              +YY+ L+ IS+  R +        N      + G                      G
Sbjct: 279 YPSYYYLNLDGISIGDRAMSFRSRNRMNATTPGTAAGAPASPVSGSGDGDGSGTGPDAYG 338

Query: 277 VMIDSGTDVTWLVKEAYEALRDEVMIRLEGEQMRSYSWPDKLCY---HGIMSSDLKGFPT 333
           ++ID  + +T+L +  YE + D++   +   +         LC+    G+  S +   P 
Sbjct: 339 MIIDIASTITFLEESLYEEMVDDLEEEIRLPRGSGSDLGLDLCFILPEGVPMSRVYA-PP 397

Query: 334 VRFHFRGGAKLALEKDSMFYQPRPDA-FCMAVNPASINNRYVNFSLIGMMAQQFYNVGYD 392
           V   F  G  L L+K+ MF + R     C+ V       +    S++G   QQ   V Y+
Sbjct: 398 VSLAFE-GVWLRLDKEQMFVEDRASGMMCLMV------GKTDGVSILGNYQQQNMQVMYN 450

Query: 393 IGRKQKTFQRMDCE 406
           + R + TF +  CE
Sbjct: 451 LRRGRITFIKTACE 464


>gi|125542690|gb|EAY88829.1| hypothetical protein OsI_10302 [Oryza sativa Indica Group]
          Length = 440

 Score =  108 bits (269), Expect = 7e-21,   Method: Compositional matrix adjust.
 Identities = 102/401 (25%), Positives = 158/401 (39%), Gaps = 40/401 (9%)

Query: 31  ARLAYLQEKIRSHNTYQAQILPSNDISNSLFYVNISIGQPPVPQFTAMDTGSSLLWVHCY 90
           ARL +L  K  +     A +  ++  +   + V   +G P      A+DT +   W HC 
Sbjct: 53  ARLLFLSSKAATAGVSSAPV--ASGQAPPSYVVRAGLGSPSQQLLLALDTSADATWAHCS 110

Query: 91  PCRDCSPQLGTIFYPSRSSSYAYVPCDSEHCRYFPYARCSAYKH------------RCIY 138
           PC  C     ++F P+ SSSYA +PC S  C  F    C A +              C +
Sbjct: 111 PCGTCPSS--SLFAPANSSSYASLPCSSSWCPLFQGQACPAPQGGGDAAPPPATLPTCAF 168

Query: 139 TQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDVVFGCGFSTN---RNFKFSGIFGLGIG 195
           ++ +      +   S      K+       + +  FGC  S      N    G+ GLG G
Sbjct: 169 SKPFADASFQAALASDTLRLGKDA------IPNYTFGCVSSVTGPTTNMPRQGLLGLGRG 222

Query: 196 RSSLVSQL----NSSFSYCIGSLHDPDYLHNKLILGDGAIIDEGDATPLQFIDGH----Y 247
             +L+SQ     N  FSYC+ S     Y    L LG G              + H    Y
Sbjct: 223 PMALLSQAGSLYNGVFSYCLPSYRS-YYFSGSLRLGAGGGQPRSVRYTPMLRNPHRSSLY 281

Query: 248 YITLEAISVDGRMLDINPNIFKRDD-SGGGVMIDSGTDVTWLVKEAYEALRDEVMIRLEG 306
           Y+ +  +SV    + +    F  D  +G G ++DSGT +T      Y ALR+E   ++  
Sbjct: 282 YVNVTGLSVGRAWVKVPAGSFAFDAATGAGTVVDSGTVITRWTAPVYAALREEFRRQVAA 341

Query: 307 EQMRSYSWPDKLCYHGIMSSDLKGFPTVRFHFRGGAKLALE-KDSMFYQPRPDAFCMAVN 365
               +       C++        G P V  H  GG  LAL  ++++ +       C+A+ 
Sbjct: 342 PSGYTSLGAFDTCFN-TDEVAAGGAPAVTVHMDGGVDLALPMENTLIHSSATPLACLAMA 400

Query: 366 PASIN-NRYVNFSLIGMMAQQFYNVGYDIGRKQKTFQRMDC 405
            A  N N  VN  +I  + QQ   V +D+   +  F +  C
Sbjct: 401 EAPQNVNSVVN--VIANLQQQNIRVVFDVANSRIGFAKESC 439


>gi|356532386|ref|XP_003534754.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Glycine max]
          Length = 463

 Score =  108 bits (269), Expect = 7e-21,   Method: Compositional matrix adjust.
 Identities = 106/372 (28%), Positives = 168/372 (45%), Gaps = 35/372 (9%)

Query: 55  DISNSLFYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRD-CSPQLGTIFYPSRSSSYAY 113
            I +  +YV I +G P       +DTGSSL W+ C PC   C  Q+  IF PS S +Y  
Sbjct: 107 SIGSGNYYVKIGLGTPAKYFSMIVDTGSSLSWLQCQPCVIYCHVQVDPIFTPSTSKTYKA 166

Query: 114 VPCDSEHCRYFPYAR-----CSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIH 168
           +PC S  C     +      CS     C+Y   Y     +  ++S + LT   ++  +  
Sbjct: 167 LPCSSSQCSSLKSSTLNAPGCSNATGACVYKASYGDTSFSIGYLSQDVLTLTPSEAPS-- 224

Query: 169 VQDVVFGCGFSTNRNF-KFSGIFGLGIGRSSLVSQLN----SSFSYCI---GSLHDPDYL 220
               V+GCG      F + SGI GL   + S++ QL+    ++FSYC+    S  +   L
Sbjct: 225 -SGFVYGCGQDNQGLFGRSSGIIGLANDKISMLGQLSKKYGNAFSYCLPSSFSAPNSSSL 283

Query: 221 HNKLILGDGAIIDEG-DATPL---QFIDGHYYITLEAISVDGRMLDINPNIFKRDDSGGG 276
              L +G  ++       TPL   Q I   Y++ L  I+V G+ L ++ + +        
Sbjct: 284 SGFLSIGASSLTSSPYKFTPLVKNQKIPSLYFLDLTTITVAGKPLGVSASSYNVP----- 338

Query: 277 VMIDSGTDVTWLVKEAYEALRDEVMIRLEGE--QMRSYSWPDKLCYHGIMSSDLKGFPTV 334
            +IDSGT +T L    Y AL+   ++ +  +  Q   +S  D  C+ G +  ++   P +
Sbjct: 339 TIIDSGTVITRLPVAVYNALKKSFVLIMSKKYAQAPGFSILDT-CFKGSV-KEMSTVPEI 396

Query: 335 RFHFRGGAKLALEKDSMFYQPRPDAFCMAVNPASINNRYVNFSLIGMMAQQFYNVGYDIG 394
           +  FRGGA L L+  +   +      C+A+  AS N      S+IG   QQ + V YD+ 
Sbjct: 397 QIIFRGGAGLELKAHNSLVEIEKGTTCLAI-AASSN----PISIIGNYQQQTFKVAYDVA 451

Query: 395 RKQKTFQRMDCE 406
             +  F    C+
Sbjct: 452 NFKIGFAPGGCQ 463


>gi|449445943|ref|XP_004140731.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Cucumis sativus]
          Length = 430

 Score =  107 bits (268), Expect = 7e-21,   Method: Compositional matrix adjust.
 Identities = 102/382 (26%), Positives = 171/382 (44%), Gaps = 44/382 (11%)

Query: 57  SNSLFYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLGTIFYPSRSSSYAYVP- 115
           S++   V++ IG PP P    +DTGS L W+ C+  +    +L  +  P  +S    +  
Sbjct: 62  SSTALVVSLPIGTPPQPTDLVLDTGSQLSWIQCHD-KKVKKRLPPLPKPKTASFDPSLSS 120

Query: 116 ------CDSEHCRY----FPYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDES 165
                 C+   C+     F           C Y+  Y  G      +  E+ TF  +   
Sbjct: 121 SFSLLPCNHPICKPRIPDFTLPTSCDQNRLCHYSYFYADGTLAEGNLVREKFTFSKS--- 177

Query: 166 TIHVQDVVFGCGFSTNRNFKFSGIFGLGIGRSSLVSQLN-SSFSYCIGSL--HDPDYLHN 222
            +    V+ GC  ++  N    GI G+  GR S +SQ   S FSYC+ S    +P  L  
Sbjct: 178 -LSTPPVILGCAQASTEN---RGILGMNHGRLSFISQAKISKFSYCVPSRTGSNPTGL-- 231

Query: 223 KLILGDGAIIDEGD-ATPLQFIDGH---------YYITLEAISVDGRMLDINPNIFKRDD 272
              LGD     +    T L F +           Y + ++AI + G+ L+I P  FK D 
Sbjct: 232 -FYLGDNPNSSKFKYVTMLTFPESQSSPNLDPLAYTLPMKAIKIAGKRLNIPPAAFKPDA 290

Query: 273 SGGG-VMIDSGTDVTWLVKEAYEALRDEVMIRLEGEQMRS-YSWPD--KLCYHGIMSSDL 328
            G G  MIDSG+D+T+LV EAYE +++EV +RL G  M+  Y + D   +C+   +++++
Sbjct: 291 GGSGQTMIDSGSDLTYLVDEAYEKVKEEV-VRLVGAMMKKGYVYADVADMCFDAGVTAEV 349

Query: 329 -KGFPTVRFHFRGGAKLALEK-DSMFYQPRPDAFCMAVNPASINNRYVNFSLIGMMAQQF 386
            +    + F F  G ++ + + + +  +      C+ +  +      +  ++IG + QQ 
Sbjct: 350 GRRIGGISFEFDNGVEIFVGRGEGVLTEVEKGVKCVGIGRS--ERLGIGSNIIGTVHQQN 407

Query: 387 YNVGYDIGRKQKTFQRMDCEVL 408
             V YD+  K+  F   +C  L
Sbjct: 408 MWVEYDLANKRVGFGGAECSRL 429


>gi|225216973|gb|ACN85264.1| aspartic proteinase nepenthesin-1 precursor [Oryza alta]
          Length = 517

 Score =  107 bits (268), Expect = 7e-21,   Method: Compositional matrix adjust.
 Identities = 101/351 (28%), Positives = 149/351 (42%), Gaps = 27/351 (7%)

Query: 61  FYVNISIGQPPVPQFTAMDTGSSLLWVHCYPC-RDCSPQLGTIFYPSRSSSYAYVPCDSE 119
           + V + +G P        DTGS   WV C PC   C  Q   +F P RSS+YA V C + 
Sbjct: 178 YVVTVGLGTPASRYTVVFDTGSDTTWVQCQPCVVVCYEQQEKLFDPVRSSTYANVSCAAP 237

Query: 120 HCRYFPYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDVVFGCGFS 179
            C       CS     C+Y   Y  G  +  F + + LT  + D     V+   FGCG  
Sbjct: 238 ACSDLNIHGCSG--GHCLYGVQYGDGSYSIGFFAMDTLTLSSYDA----VKGFRFGCGER 291

Query: 180 TNRNF-KFSGIFGLGIGRSSLVSQL----NSSFSYCIGSLH-DPDYLHNKLILGDGAIID 233
               F + +G+ GLG G++SL  Q        F++C+ +      YL      G  A   
Sbjct: 292 NEGLFGEAAGLLGLGRGKTSLPVQTYDKYGGVFAHCLPARSTGTGYL--DFGAGSPAAAS 349

Query: 234 EGDATPLQFIDG--HYYITLEAISVDGRMLDINPNIFKRDDSGGGVMIDSGTDVTWLVKE 291
               TP+   +G   YYI +  I V G++L I  ++F       G ++DSGT +T L   
Sbjct: 350 ARLTTPMLTDNGPTFYYIGMTGIRVGGQLLSIPQSVFAT----AGTIVDSGTVITRLPPP 405

Query: 292 AYEALRDEVMIRLEGEQMRSYSWPDKL--CYHGIMSSDLKGFPTVRFHFRGGAKLALEKD 349
           AY +LR      +     +       L  CY     S +   PTV   F+GGA+L ++  
Sbjct: 406 AYSSLRYAFAAAMAARGYKKAPAVSLLDTCYDFTGMSQVA-IPTVSLLFQGGARLDVDAS 464

Query: 350 SMFYQPRPDAFCMAVNPASINNRYVNFSLIGMMAQQFYNVGYDIGRKQKTF 400
            + Y       C+A    + N    +  ++G    + + V YDIG+K   F
Sbjct: 465 GIMYAASASQVCLAF---AANEDGGDVGIVGNTQLKTFGVAYDIGKKVVGF 512


>gi|115448353|ref|NP_001047956.1| Os02g0720900 [Oryza sativa Japonica Group]
 gi|45735844|dbj|BAD12879.1| putative chloroplast nucleoid DNA-binding protein cnd41 [Oryza
           sativa Japonica Group]
 gi|45735970|dbj|BAD12999.1| putative chloroplast nucleoid DNA-binding protein cnd41 [Oryza
           sativa Japonica Group]
 gi|113537487|dbj|BAF09870.1| Os02g0720900 [Oryza sativa Japonica Group]
 gi|125540930|gb|EAY87325.1| hypothetical protein OsI_08729 [Oryza sativa Indica Group]
 gi|215692622|dbj|BAG88042.1| unnamed protein product [Oryza sativa Japonica Group]
          Length = 458

 Score =  107 bits (268), Expect = 8e-21,   Method: Compositional matrix adjust.
 Identities = 100/370 (27%), Positives = 158/370 (42%), Gaps = 39/370 (10%)

Query: 52  PSNDISNSLFYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCR-DCSPQLGTIFYPSRSSS 110
           P   +    +   + +G P       +DTGSSL W+ C PC   C  Q G +F P  SS+
Sbjct: 113 PGASVGVGNYVTRMGLGTPATQYVMVVDTGSSLTWLQCSPCLVSCHRQSGPVFNPKSSST 172

Query: 111 YAYVPCDSEHCRYFPYAR-----CSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDES 165
           YA V C ++ C   P A      CS+  + CIY   Y     +  ++S + ++F +T   
Sbjct: 173 YASVGCSAQQCSDLPSATLNPSACSS-SNVCIYQASYGDSSFSVGYLSKDTVSFGSTS-- 229

Query: 166 TIHVQDVVFGCGFSTNRNF-KFSGIFGLGIGRSSLVSQLNS----SFSYCIGSLHDPDYL 220
              + +  +GCG      F + +G+ GL   + SL+ QL      SF+YC+     P   
Sbjct: 230 ---LPNFYYGCGQDNEGLFGRSAGLIGLARNKLSLLYQLAPSLGYSFTYCL-----PSSS 281

Query: 221 HNKLILGDGAIIDEGDATPL---QFIDGHYYITLEAISVDGRMLDINPNIFKRDDSGGGV 277
            +  +        +   TP+      D  Y+I L  ++V G  L ++ + +    +    
Sbjct: 282 SSGYLSLGSYNPGQYSYTPMVSSSLDDSLYFIKLSGMTVAGNPLSVSSSAYSSLPT---- 337

Query: 278 MIDSGTDVTWLVKEAYEALRDEVMIRLEG-EQMRSYSWPDKLCYHGIMSSDLKGFPTVRF 336
           +IDSGT +T L    Y AL   V   ++G  +  +YS  D  C+ G  S      P V  
Sbjct: 338 IIDSGTVITRLPTSVYSALSKAVAAAMKGTSRASAYSILDT-CFKGQASR--VSAPAVTM 394

Query: 337 HFRGGAKLALEKDSMFYQPRPDAFCMAVNPASINNRYVNFSLIGMMAQQFYNVGYDIGRK 396
            F GGA L L   ++         C+A  PA       + ++IG   QQ ++V YD+   
Sbjct: 395 SFAGGAALKLSAQNLLVDVDDSTTCLAFAPAR------SAAIIGNTQQQTFSVVYDVKSS 448

Query: 397 QKTFQRMDCE 406
           +  F    C 
Sbjct: 449 RIGFAAGGCS 458


>gi|357482719|ref|XP_003611646.1| Aspartic proteinase-like protein [Medicago truncatula]
 gi|355512981|gb|AES94604.1| Aspartic proteinase-like protein [Medicago truncatula]
          Length = 640

 Score =  107 bits (268), Expect = 8e-21,   Method: Compositional matrix adjust.
 Identities = 100/377 (26%), Positives = 160/377 (42%), Gaps = 54/377 (14%)

Query: 58  NSLFYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLGTIFYPSRSSSYAYVPCD 117
           N  +   + IG PP      +DTGS++ +V C  C  C       F P  S +Y  V C 
Sbjct: 86  NGYYTTRLWIGTPPQRFALIVDTGSTVTYVPCSTCEHCGRHQDPKFQPDLSETYQPVKCT 145

Query: 118 SEHCRYFPYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDVVFGCG 177
                  P   C    ++C+Y + Y     +S  +  + ++F N  E  +  Q  VFGC 
Sbjct: 146 -------PDCNCDGDTNQCMYDRQYAEMSSSSGVLGEDVVSFGNLSE--LAPQRAVFGCE 196

Query: 178 FSTNRNF---KFSGIFGLGIGRSSLVSQL------NSSFSYCIGSLHDPDYLHNKLILGD 228
                +    +  GI GLG G  S++ QL      + SFS C G +           +G 
Sbjct: 197 NDETGDLYSQRADGIMGLGRGDLSIMDQLVDKKVISDSFSLCYGGMD----------VGG 246

Query: 229 GAIIDEGDATPLQFIDGH--------YYITLEAISVDGRMLDINPNIFKRDDSGGGVMID 280
           GA+I  G + P   +  H        Y I L+ + V G+ L +NP +F   D   G ++D
Sbjct: 247 GAMILGGISPPEDMVFTHSDPDRSPYYNINLKEMHVAGKKLQLNPKVF---DGKHGTVLD 303

Query: 281 SGTDVTWLVKEAYEALRDEVMIRLEGEQMRSYSWPD----KLCYHGI---MSSDLKGFPT 333
           SGT   +L + A+ A +  +M   E   ++  + PD     +C+ G    +S   K FP 
Sbjct: 304 SGTTYAYLPETAFLAFKRAIM--KERNSLKQINGPDPNYKDICFTGAGIDVSQLAKSFPV 361

Query: 334 VRFHFRGGAKLALEKDSMFYQPRP--DAFCMAVNPASINNRYVNFSLIGMMAQQFYNVGY 391
           V   F  G KL+L  ++  ++      A+C+ V     +N     +L+G +  +   V Y
Sbjct: 362 VDMVFENGHKLSLSPENYLFRHSKVRGAYCLGV----FSNGRDPTTLLGGIFVRNTLVMY 417

Query: 392 DIGRKQKTFQRMDCEVL 408
           D    +  F + +C  L
Sbjct: 418 DRENSKIGFWKTNCSEL 434


>gi|326501422|dbj|BAK02500.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 631

 Score =  107 bits (268), Expect = 8e-21,   Method: Compositional matrix adjust.
 Identities = 101/380 (26%), Positives = 167/380 (43%), Gaps = 56/380 (14%)

Query: 56  ISNSLFYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLGTIFYPSRSSSYAYVP 115
           ++N  +   + IG PP      +D+GS++ +V C  C  C       F P  SS+Y+ V 
Sbjct: 83  LTNGYYTTRLHIGTPPQEFALIVDSGSTVTYVPCASCEQCGNHQDPRFQPDLSSTYSPVK 142

Query: 116 CDSEHCRYFPYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDVVFG 175
           C+ +         C + K++C Y + Y     +S  +  + ++F    ES +  Q  VFG
Sbjct: 143 CNVD-------CTCDSDKNQCTYERQYAEMSSSSGVLGEDIVSFGT--ESELKPQRAVFG 193

Query: 176 CGFSTNRNFKFS----GIFGLGIGRSSLVSQL------NSSFSYCIGSLHDPDYLHNKLI 225
           C  S   +  FS    GI GLG G+ S++ QL        SFS C G +           
Sbjct: 194 CENSETGDL-FSQHADGIMGLGRGQLSIMDQLVDKGVIGDSFSMCYGGMD---------- 242

Query: 226 LGDGAIIDEGDATPLQFIDGH--------YYITLEAISVDGRMLDINPNIFKRDDSGGGV 277
           +G GA++      P   I  H        Y I L+ + V G+ L ++P IF   D   G 
Sbjct: 243 IGGGAMVLGAMPAPPGMIYTHSNAVRSPYYNIELKEMHVAGKALRVDPRIF---DGKHGT 299

Query: 278 MIDSGTDVTWLVKEAYEALRDEVMIRLEGEQMRSYSWPDK----LCYHGI---MSSDLKG 330
           ++DSGT   +L ++A+ A +D V  ++    ++    PD     +C+ G    +S   + 
Sbjct: 300 VLDSGTTYAYLPEQAFVAFKDAVSSQV--HPLKKIRGPDPNYKDICFAGAGRNVSQLSEV 357

Query: 331 FPTVRFHFRGGAKLALEKDSMFYQPRP--DAFCMAVNPASINNRYVNFSLIGMMAQQFYN 388
           FP V   F  G KL+L  ++  ++      A+C+ V      N     +L+G +  +   
Sbjct: 358 FPKVDMVFGNGQKLSLSPENYLFRHSKVEGAYCLGV----FQNGKDPTTLLGGIVVRNTL 413

Query: 389 VGYDIGRKQKTFQRMDCEVL 408
           V YD   ++  F + +C  L
Sbjct: 414 VTYDRHNEKIGFWKTNCSEL 433


>gi|223949775|gb|ACN28971.1| unknown [Zea mays]
 gi|414590177|tpg|DAA40748.1| TPA: hypothetical protein ZEAMMB73_257146 [Zea mays]
          Length = 510

 Score =  107 bits (268), Expect = 9e-21,   Method: Compositional matrix adjust.
 Identities = 114/411 (27%), Positives = 176/411 (42%), Gaps = 41/411 (9%)

Query: 23  QRGINISIARLAYLQEKIRSHNTYQAQILPSN-DISNSLFYVNISIGQPPVPQFTAMDTG 81
           +R     +AR+       R+ +      + S   + +  + +++ +G PP      MDTG
Sbjct: 110 RRAARSGVARMPASSSPRRALSERMVATVESGVAVGSGEYLIDVYVGTPPRRFRMIMDTG 169

Query: 82  SSLLWVHCYPCRDCSPQLGTIFYPSRSSSYAYVPCDSEHCRYF-----PYARCSAYKHRC 136
           S L W+ C PC DC  Q G +F P+ SSSY  V C  + C        P A     +  C
Sbjct: 170 SDLNWLQCAPCLDCFEQRGPVFDPAASSSYRNVTCGDQRCGLVAPPEAPRACRRPAEDSC 229

Query: 137 IYTQLYLIGPETSVFVSTEQLTFKNTDE-STIHVQDVVFGCGFSTNRNF-KFSGIFGLGI 194
            Y   Y     T+  ++ E  T   T   ++  V  VVFGCG      F   +G+ GLG 
Sbjct: 230 PYYYWYGDQSNTTGDLALESFTVNLTAPGASRRVDGVVFGCGHRNRGLFHGAAGLLGLGR 289

Query: 195 GRSSLVSQLNS----SFSYCIGSLHDPD------YLHNKLILGDGAIIDEGDATPLQFID 244
           G  S  SQL +    +FSYC+   H  D      +  + L+L    +     A      D
Sbjct: 290 GPLSFASQLRAVYGHTFSYCLVE-HGSDAGSKVVFGEDYLVLAHPQLKYTAFAPTSSPAD 348

Query: 245 GHYYITLEAISVDGRMLDINPNIFK-RDDSGGGVMIDSGTDVTWLVKEAYEALRD---EV 300
             YY+ L+ + V G +L+I+ + +    D  GG +IDSGT +++ V+ AY+ +R    ++
Sbjct: 349 TFYYVKLKGVLVGGDLLNISSDTWDVGKDGSGGTIIDSGTTLSYFVEPAYQVIRQAFVDL 408

Query: 301 MIRLEGEQMRSYSW-PD----KLCYHGIMSSDLKGFPTVRFHFRGGAKLALEKDSMFYQP 355
           M RL       Y   PD      CY+ +   +    P +   F  GA      ++ F + 
Sbjct: 409 MSRL-------YPLIPDFPVLNPCYN-VSGVERPEVPELSLLFADGAVWDFPAENYFVRL 460

Query: 356 RPDA-FCMAVNPASINNRYVNFSLIGMMAQQFYNVGYDIGRKQKTFQRMDC 405
            PD   C+AV            S+IG   QQ ++V YD+   +  F    C
Sbjct: 461 DPDGIMCLAVR----GTPRTGMSIIGNFQQQNFHVVYDLQNNRLGFAPRRC 507


>gi|224128838|ref|XP_002320434.1| predicted protein [Populus trichocarpa]
 gi|222861207|gb|EEE98749.1| predicted protein [Populus trichocarpa]
          Length = 485

 Score =  107 bits (268), Expect = 9e-21,   Method: Compositional matrix adjust.
 Identities = 94/372 (25%), Positives = 167/372 (44%), Gaps = 38/372 (10%)

Query: 60  LFYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDC--SPQLG---TIFYPSRSSSYAYV 114
           L+Y  I IG P    +  +DTGS ++WV+C  CR+C  +  LG   T++  + S +   V
Sbjct: 77  LYYAKIGIGTPTKDYYVQVDTGSDIMWVNCIQCRECPKTSSLGIDLTLYNINESDTGKLV 136

Query: 115 PCDSEHCRYFPYAR---CSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTD---ESTIH 168
           PCD E C      +   C+A    C Y ++Y  G  T+ +   + + +       ++T  
Sbjct: 137 PCDQEFCYEINGGQLPGCTA-NMSCPYLEIYGDGSSTAGYFVKDVVQYARVSGDLKTTAA 195

Query: 169 VQDVVFGCG------FSTNRNFKFSGIFGLGIGRSSLVSQLNSS------FSYCIGSLHD 216
              V+FGCG        ++      GI G G   SS++SQL  +      F++C+     
Sbjct: 196 NGSVIFGCGARQSGDLGSSNEEALDGILGFGKSNSSMISQLAVTGKVKKIFAHCL----- 250

Query: 217 PDYLHNKLILGDGAIID-EGDATPLQFIDGHYYITLEAISVDGRMLDINPNIFKRDDSGG 275
            D  +   I   G ++  + + TPL     HY + + A+ V    L +  ++F+  D  G
Sbjct: 251 -DGTNGGGIFVIGHVVQPKVNMTPLIPNQPHYNVNMTAVQVGHEFLSLPTDVFEAGDRKG 309

Query: 276 GVMIDSGTDVTWLVKEAYEALRDEVMIRLEGEQMRSYSWPDK-LCYHGIMSSDLKGFPTV 334
            + IDSGT + +L +  Y+ L  +++   +   ++ ++  D+  C+    S D  GFP V
Sbjct: 310 AI-IDSGTTLAYLPEMVYKPLVSKIIS--QQPDLKVHTVRDEYTCFQYSDSLD-DGFPNV 365

Query: 335 RFHFRGGAKLALEKDSMFYQPRPDAFCMAVNPASINNR-YVNFSLIGMMAQQFYNVGYDI 393
            FHF     L +      + P    +C+    + + +R   N +L+G +      V YD+
Sbjct: 366 TFHFENSVILKVYPHEYLF-PFEGLWCIGWQNSGVQSRDRRNMTLLGDLVLSNKLVLYDL 424

Query: 394 GRKQKTFQRMDC 405
             +   +   +C
Sbjct: 425 ENQAIGWTEYNC 436


>gi|222637181|gb|EEE67313.1| hypothetical protein OsJ_24553 [Oryza sativa Japonica Group]
          Length = 414

 Score =  107 bits (268), Expect = 9e-21,   Method: Compositional matrix adjust.
 Identities = 95/339 (28%), Positives = 153/339 (45%), Gaps = 39/339 (11%)

Query: 94  DCSPQLGTIFYPSRSSSYAYVPCDSEHCRYF--PYARCSAYKHRCIYTQLYLIGPETSVF 151
           +C+ +    F P+ SS+++ +PC S  C++   PY  C+A    C+Y   Y +G  T+ +
Sbjct: 87  ECAARPAPPFQPASSSTFSKLPCASSLCQFLTSPYLTCNATG--CVYYYPYGMG-FTAGY 143

Query: 152 VSTEQLTFKNTDESTIHVQDVVFGCGFSTNRNFKFSGIFGLGIGRSSLVSQLN-SSFSYC 210
           ++TE L              V FGC          SGI GLG    SLVSQ+    FSYC
Sbjct: 144 LATETLHVGGAS-----FPGVAFGCSTENGVGNSSSGIVGLGRSPLSLVSQVGVGRFSYC 198

Query: 211 IGSLHDPDYLHNKLILGDGAIIDEGDATPLQFID------GHYYITLEAISVDGRMLDIN 264
           + S  D D   + ++ G  A +  G ++P    +       +YY+ L  I+V    L + 
Sbjct: 199 LRS--DADAGDSPILFGSLAKVTGGKSSPAILENPEMPSSSYYYVNLTGITVGATDLPVT 256

Query: 265 PNIF---KRDDSG--GGVMIDSGTDVTWLVKEAYEALRDEVMIRLEGEQMRSYSWPDK-- 317
              F   +   +G  GG ++DSGT +T+LVKE Y  ++   + ++    + +     +  
Sbjct: 257 STTFGFTRGAGAGLVGGTIVDSGTTLTYLVKEGYAMVKRAFLSQMATANLTTTVNGTRFG 316

Query: 318 --LCYHGIMSSDLKG--FPTVRFHFRGGAKLALEKDSMF------YQPRPDAFCMAVNPA 367
             LC+    +    G   PT+   F GGA+ A+ + S         Q R    C+ V PA
Sbjct: 317 FDLCFDANAAGGGSGVPVPTLVLRFAGGAEYAVRRRSYVGVVEVDSQGRAAVECLLVLPA 376

Query: 368 SINNRYVNFSLIGMMAQQFYNVGYDIGRKQKTFQRMDCE 406
           S     ++ S+IG + Q   +V YD+     +F   DC 
Sbjct: 377 S---EKLSISIIGNVMQMDLHVLYDLDGGMFSFAPADCA 412


>gi|326495920|dbj|BAJ90582.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 476

 Score =  107 bits (268), Expect = 9e-21,   Method: Compositional matrix adjust.
 Identities = 107/395 (27%), Positives = 165/395 (41%), Gaps = 43/395 (10%)

Query: 15  NENPAHRVQRGINISIARLAYLQEKIRSHNTYQAQILPSNDISNSLFYVNISIGQPPVPQ 74
           +E     +QR ++   AR A    +  +  +  A +  +  +    + V +S+G P V Q
Sbjct: 97  DEQRVEYIQRRVSGGGARGAKGALQQLATGSRSATVPTTMGVGTFQYVVTVSLGTPGVSQ 156

Query: 75  FTAMDTGSSLLWVHCYPCR--DCSPQLGTIFYPSRSSSYAYVPCDSEHCRYFPYARCSAY 132
              +DTGS + WV C PC    C+ Q   +F P++SS+Y+ VPC ++ C           
Sbjct: 157 TVEVDTGSDVSWVQCKPCSAPACNSQRDQLFDPAKSSTYSAVPCGADACSELRIYEAGCS 216

Query: 133 KHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDVVFGCGFSTNRNFK-FSGIFG 191
             +C Y   Y  G  T+    ++ L     +     V   +FGCG +    F    G+  
Sbjct: 217 GSQCGYVVSYGDGSNTTGVYGSDTLALAPGNT----VGTFLFGCGHAQAGMFAGIDGLLA 272

Query: 192 LGIGRSSLVSQLNSS----FSYCIGSLHDPDYLHNKLILGDGAIIDEGDATPLQFID--- 244
           LG    SL SQ   +    FSYC+ S          L LG G     G AT         
Sbjct: 273 LGRQSMSLKSQAAGAYGGVFSYCLPSKQS---AAGYLTLG-GPSSASGFATTGLLTAWAA 328

Query: 245 -GHYYITLEAISVDGRMLDINPNIFKRDDSGGGVMIDSGTDVTWLVKEAYEALRDEV--M 301
              Y + L  ISV G+ + +  + F      GG ++D+GT +T L   AY ALR      
Sbjct: 329 PTFYMVMLTGISVGGQQVAVPASAFA-----GGTVVDTGTVITRLPPTAYAALRSAFRGA 383

Query: 302 IRLEGEQMRSYSWPDKLCY----HGIMSSDLKGFPTVRFHFRGGAKLALEKDSMFYQPRP 357
           I   G      +     CY    +G+++      PTV   F GGA LALE   +      
Sbjct: 384 IAPCGYPSAPANGILDTCYDFSRYGVVT-----LPTVALTFSGGATLALEAPGIL----- 433

Query: 358 DAFCMAVNPASINNRYVNFSLIGMMAQQFYNVGYD 392
            + C+A  P   N    + +++G + Q+ + V +D
Sbjct: 434 SSGCLAFAP---NGGDGDAAILGNVQQRSFAVRFD 465


>gi|449485448|ref|XP_004157171.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Cucumis sativus]
          Length = 430

 Score =  107 bits (268), Expect = 9e-21,   Method: Compositional matrix adjust.
 Identities = 101/382 (26%), Positives = 171/382 (44%), Gaps = 44/382 (11%)

Query: 57  SNSLFYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLGTIFYPSRSSSYAYVP- 115
           S++   V++ IG PP P    +DTGS L W+ C+  +    +L  +  P  +S    +  
Sbjct: 62  SSTALVVSLPIGTPPQPTDLVLDTGSQLSWIQCHD-KKIKKRLPPLPKPKTTSFDPSLSS 120

Query: 116 ------CDSEHCRY----FPYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDES 165
                 C+   C+     F           C Y+  Y  G      +  E+ TF  +   
Sbjct: 121 SFSLLPCNHPICKPRIPDFTLPTSCDQNRLCHYSYFYADGTLAEGNLVREKFTFSKS--- 177

Query: 166 TIHVQDVVFGCGFSTNRNFKFSGIFGLGIGRSSLVSQLN-SSFSYCIGSL--HDPDYLHN 222
            +    V+ GC  ++  N    GI G+  GR S +SQ   S FSYC+ S    +P  L  
Sbjct: 178 -LSTPPVILGCAQASTEN---RGILGMNRGRLSFISQAKISKFSYCVPSRTGSNPTGL-- 231

Query: 223 KLILGDGAIIDEGD-ATPLQFIDGH---------YYITLEAISVDGRMLDINPNIFKRDD 272
              LGD     +    T L F +           Y + ++AI + G+ L++ P  FK D 
Sbjct: 232 -FYLGDNPNSSKFKYVTMLTFPESQSSPNLDPLAYTLPMKAIKIAGKRLNVPPAAFKPDA 290

Query: 273 SGGG-VMIDSGTDVTWLVKEAYEALRDEVMIRLEGEQMRS-YSWPD--KLCYHGIMSSDL 328
            G G  MIDSG+D+T+LV EAYE +++EV +RL G  M+  Y + D   +C+   +++++
Sbjct: 291 GGSGQTMIDSGSDLTYLVDEAYEKVKEEV-VRLVGAMMKKGYVYADVADMCFDAGVTAEV 349

Query: 329 -KGFPTVRFHFRGGAKLALEK-DSMFYQPRPDAFCMAVNPASINNRYVNFSLIGMMAQQF 386
            +    + F F  G ++ + + + +  +      C+ +  +      +  ++IG + QQ 
Sbjct: 350 GRRIGGISFEFDNGVEIFVGRGEGVLTEVEKGVKCVGIGRS--ERLGIGSNIIGTVHQQN 407

Query: 387 YNVGYDIGRKQKTFQRMDCEVL 408
             V YD+  K+  F   +C  L
Sbjct: 408 MWVEYDLANKRVGFGGAECSRL 429


>gi|326521034|dbj|BAJ92880.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 448

 Score =  107 bits (267), Expect = 1e-20,   Method: Compositional matrix adjust.
 Identities = 107/390 (27%), Positives = 162/390 (41%), Gaps = 33/390 (8%)

Query: 31  ARLAYLQEKIRSHNTYQAQILPSNDISNSLFYVNISIGQPPVPQFTAMDTGSSLLWVHCY 90
           +RL YL     +   Y         +    + V   +G PP     A+DT +   W+ C 
Sbjct: 78  SRLLYLDSLAVAGRAYAPIASGRQLLQTPTYVVRARLGTPPQQLLLAVDTSNDAAWIPCS 137

Query: 91  PCRDCSPQLGTIFYPSRSSSYAYVPCDSEHCRYFPYARCSAYKHRCIYTQLYLIGPETSV 150
            C  C     T F P+ S SY  VPC S  C   P   CS     C ++  Y    ++S+
Sbjct: 138 GCAGC--PTTTPFNPAASKSYRAVPCGSPACSRAPNPSCSLNTKSCGFSLTYA---DSSL 192

Query: 151 FVSTEQLTFKNTDESTIHVQDVVFGC-GFSTNRNFKFSGIFGLGIGRSSLVSQLN----S 205
             +  Q +    ++    V+   FGC   +T       G+ GLG G  S +SQ       
Sbjct: 193 EAALSQDSLAVANDV---VKSYTFGCLQKATGTATPPQGLLGLGRGPLSFLSQTKDMYEG 249

Query: 206 SFSYCIGSLHDPDYLHNKLILGDGAIIDEGDATPLQFIDGH----YYITLEAISVDGRML 261
           +FSYC+ S    ++    L LG          TPL  ++ H    YY+++  I V  +++
Sbjct: 250 TFSYCLPSFKSLNF-SGTLRLGRKGQPLRIKTTPL-LVNPHRSSLYYVSMTGIRVGKKVV 307

Query: 262 DINPNIFKRDD-SGGGVMIDSGTDVTWLVKEAYEALRDEVMIRLEGEQMRSYSWPDKLCY 320
            I P     D  +G G ++DSGT  T LV  AY A+RDEV  R+ G  + S    D  CY
Sbjct: 308 PIPPAALAFDPATGAGTVLDSGTMFTRLVAPAYVAVRDEVRRRIRGAPLSSLGGFDT-CY 366

Query: 321 HGIMSSDLKGFPTVRFHFRGGAKLALEKDSMFYQP---RPDAFCMAVNPASINNRYVNFS 377
           +  +      +P V F F  G ++ L  D++             MA  P  +N      +
Sbjct: 367 NTTVK-----WPPVTFMFT-GMQVTLPADNLVIHSTYGTTSCLAMAAAPDGVNTV---LN 417

Query: 378 LIGMMAQQFYNVGYDIGRKQKTFQRMDCEV 407
           +I  M QQ + + +D+   +  F R  C  
Sbjct: 418 VIASMQQQNHRILFDVPNGRVGFAREQCTA 447


>gi|356540369|ref|XP_003538662.1| PREDICTED: LOW QUALITY PROTEIN: probable aspartic protease
           At2g35615-like [Glycine max]
          Length = 364

 Score =  107 bits (267), Expect = 1e-20,   Method: Compositional matrix adjust.
 Identities = 107/362 (29%), Positives = 168/362 (46%), Gaps = 42/362 (11%)

Query: 57  SNSLFYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLGTIFYPSRSSSYAYVPC 116
           +N  + + +++G PPV  +  +DT S L+W  C PC+ C  Q   +F P +         
Sbjct: 27  NNGDYLMKLTLGTPPVDVYGLVDTDSDLVWAQCTPCQGCYKQKNPMFDPLK--------- 77

Query: 117 DSEHCRYFPYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDVVFGC 176
               C  F    CS  K  C Y   Y     T   ++ E  TF +TD   I V+ ++FGC
Sbjct: 78  ---ECNSFFDHSCSPEK-ACDYVYAYADDSATKGMLAKEIATFSSTDGKPI-VESIIFGC 132

Query: 177 GFSTNRNFKFSGI--FGLGIGRSSLVSQL-----NSSFSYCIGSLHDPDYLHNKLILGDG 229
           G +    F  + +   GLG G  SLVSQ+     +  FS C+   H   +    + LG+ 
Sbjct: 133 GHNNTGVFNENDMGLIGLGGGPLSLVSQMGNLYGSKRFSQCLVPFHADPHTSGTISLGEA 192

Query: 230 A-IIDEG-DATPLQFIDGH--YYITLEAISVDGRMLDINPNIFKRDDSGGGVMIDSGTDV 285
           + +  EG   TPL   +G   Y +TLE ISV    +  N +      S G +MIDSGT  
Sbjct: 193 SDVSGEGVVTTPLVSEEGQTPYLVTLEGISVGDTFVPFNSSEML---SKGNIMIDSGTPE 249

Query: 286 TWLVKEAYEALRDEVMIRLEGEQMRSYSWPD---KLCYHGIMSSDLKGFPTVRFHFRGGA 342
           T+L +E Y+ L +E+ +++    +  +  PD   +LCY     ++L+G P +  HF  GA
Sbjct: 250 TYLPQEFYDRLVEELKVQINLPPI--HVDPDLGTQLCYKS--ETNLEG-PILTAHFE-GA 303

Query: 343 KLALEKDSMFYQPRPDAFCMAVNPASINNRYVNFSLIGMMAQQFYNVGYDIGRKQKTFQR 402
            + L     F  P+   FC A+   + +  Y+     G  AQ    +G+D+ ++   F+ 
Sbjct: 304 DVKLLPLQTFIPPKDGVFCFAMT-GTTDGLYI----FGNFAQSNVLIGFDLDKRIVFFKP 358

Query: 403 MD 404
            D
Sbjct: 359 TD 360


>gi|359492937|ref|XP_002283889.2| PREDICTED: aspartic proteinase nepenthesin-1-like [Vitis vinifera]
          Length = 439

 Score =  107 bits (267), Expect = 1e-20,   Method: Compositional matrix adjust.
 Identities = 113/426 (26%), Positives = 171/426 (40%), Gaps = 43/426 (10%)

Query: 1   LIHRDSILSPYNNPNENPAHRVQRGINISI-------ARLAYLQEKIRSHNTYQAQILPS 53
           +IH     SP+N       H+    +N  I       AR+ YL   + S       I   
Sbjct: 37  VIHVYGQCSPFNQ------HKAGSWVNTVINMASKDPARVTYLSSLVASPKATSVPIASG 90

Query: 54  NDISN-SLFYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLGTIFYPSRSSSYA 112
             + N   + V + +G P    F  +DT     WV C  C  CS      F P+ SS+YA
Sbjct: 91  QQVLNIGNYVVRVKLGTPGQLMFMVLDTSRDAAWVPCADCAGCS---SPTFSPNTSSTYA 147

Query: 113 YVPCDSEHCRYFPYARC-SAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQD 171
            + C    C       C +     C + Q Y      S  +S + L       +   +  
Sbjct: 148 SLQCSVPQCTQVRGLSCPTTGTAACFFNQTYGGDSSFSAMLSQDSLGL-----AVDTLPS 202

Query: 172 VVFGC-GFSTNRNFKFSGIFGLGIGRSSLVSQLNS----SFSYCIGSLHDPDYLHNKLIL 226
             FGC    +       G+ GLG G  SL+SQ  S     FSYC  S     Y    L L
Sbjct: 203 YSFGCVNAVSGSTLPPQGLLGLGRGPMSLLSQSGSLYSGVFSYCFPSFKS-YYFSGSLRL 261

Query: 227 GDGAIIDEGDATPLQFIDGH----YYITLEAISVDGRMLDINPNIFKRD-DSGGGVMIDS 281
           G          TPL   + H    YY+ L  +SV   ++ + P +   D ++G G +IDS
Sbjct: 262 GPLGQPKNIRTTPL-LRNPHRPTLYYVNLTGVSVGRVLVPVAPELLAFDPNTGAGTIIDS 320

Query: 282 GTDVTWLVKEAYEALRDEVMIRLEGEQMRSYSWPDKLCYHGIMSSDLKGFPTVRFHFRG- 340
           GT +T  V+  Y A+RDE   +++G      ++    C+    + D+   P V FHF G 
Sbjct: 321 GTVITRFVEPVYAAIRDEFRKQVKGPFATIGAF--DTCFAAT-NEDIA--PPVTFHFTGM 375

Query: 341 GAKLALEKDSMFYQPRPDAFCMAVNPASINNRYVNFSLIGMMAQQFYNVGYDIGRKQKTF 400
             KL LE +++ +       C+A+  A+ NN     ++I  + QQ   + +D+   +   
Sbjct: 376 DLKLPLE-NTLIHSSAGSLACLAMA-AAPNNVNSVLNVIANLQQQNLRIMFDVTNSRLGI 433

Query: 401 QRMDCE 406
            R  C 
Sbjct: 434 ARELCN 439


>gi|224058947|ref|XP_002299658.1| predicted protein [Populus trichocarpa]
 gi|222846916|gb|EEE84463.1| predicted protein [Populus trichocarpa]
          Length = 451

 Score =  107 bits (267), Expect = 1e-20,   Method: Compositional matrix adjust.
 Identities = 102/373 (27%), Positives = 165/373 (44%), Gaps = 37/373 (9%)

Query: 60  LFYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLG-----TIFYPSRSSSYAYV 114
           L+Y  + +G PP   +  +DTGS +LWV C  C  C    G       F P  S + + +
Sbjct: 89  LYYTRLQLGSPPRDFYVQIDTGSDVLWVSCSSCNGCPVSSGLHIPLNFFDPGSSPTASLI 148

Query: 115 PCDSEHCRYFPYAR---CSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQD 171
            C  + C     +    C+A  ++C YT  Y  G  TS +  ++ L F      ++    
Sbjct: 149 SCSDQRCSLGLQSSDSVCAAQNNQCGYTFQYGDGSGTSGYYVSDLLHFDTILGGSVMKNS 208

Query: 172 ---VVFGC-----GFSTNRNFKFSGIFGLGIGRSSLVSQLNSS------FSYCIGSLHDP 217
              +VFGC     G  T  +    GIFG G    S++SQL S       FS+C   L   
Sbjct: 209 SAPIVFGCSTLQTGDLTKPDRAVDGIFGFGQQDMSVISQLASQGITPRVFSHC---LKGD 265

Query: 218 DYLHNKLILGDGAIIDEGDA-TPLQFIDGHYYITLEAISVDGRMLDINPNIFKRDDSGGG 276
           D     L+LG+  I++     TPL     HY + L++I V+G+ L I+P++F    S  G
Sbjct: 266 DSGGGILVLGE--IVEPNIVYTPLVPSQPHYNLNLQSIYVNGQTLAIDPSVFAT-SSNQG 322

Query: 277 VMIDSGTDVTWLVKEAYEALRDEVMIRLEGEQMRSYSWPDKLCYHGIMSSDLKG-FPTVR 335
            +IDSGT + +L + AY+     +   +    +  Y      CY  + SS +   FP V 
Sbjct: 323 TIIDSGTTLAYLTEAAYDPFISAITSTVS-PSVSPYLSKGNQCY--LTSSSINDVFPQVS 379

Query: 336 FHFRGGAKLAL-EKDSMFYQPRPDAFCM-AVNPASINNRYVNFSLIGMMAQQFYNVGYDI 393
            +F GG  + L  +D +  Q   +   +  V    I  + +  +++G +  +     YDI
Sbjct: 380 LNFAGGTSMILIPQDYLIQQSSINGAALWCVGFQKIQGQEI--TILGDLVLKDKIFVYDI 437

Query: 394 GRKQKTFQRMDCE 406
             ++  +   DC+
Sbjct: 438 AGQRIGWANYDCK 450


>gi|15232960|ref|NP_186923.1| aspartyl protease-like protein [Arabidopsis thaliana]
 gi|6728988|gb|AAF26986.1|AC018363_31 putative aspartyl protease [Arabidopsis thaliana]
 gi|21593593|gb|AAM65560.1| putative aspartyl protease [Arabidopsis thaliana]
 gi|332640332|gb|AEE73853.1| aspartyl protease-like protein [Arabidopsis thaliana]
          Length = 488

 Score =  107 bits (267), Expect = 1e-20,   Method: Compositional matrix adjust.
 Identities = 100/397 (25%), Positives = 165/397 (41%), Gaps = 42/397 (10%)

Query: 40  IRSHNTYQ-AQILPSNDI---------SNSLFYVNISIGQPPVPQFTAMDTGSSLLWVHC 89
           +R+H+ ++ +++L + DI         S  L++  I +G P       +DTGS +LWV+C
Sbjct: 54  LRAHDVHRHSRLLSAIDIPLGGDSQPESIGLYFAKIGLGTPSRDFHVQVDTGSDILWVNC 113

Query: 90  YPCRDCSPQLGTI----FYPSRSSSYAYVPCDSEHCRYFPYARCSAYKHRCIYTQLYLIG 145
             C  C  +   +    +    SS+   V C    C Y            C Y  +Y  G
Sbjct: 114 AGCIRCPRKSDLVELTPYDVDASSTAKSVSCSDNFCSYVNQRSECHSGSTCQYVIMYGDG 173

Query: 146 PETSVFVSTE----QLTFKNTDESTIHVQDVVFGCGFSTNRNFKFS-----GIFGLGIGR 196
             T+ ++  +     L   N    + +   ++FGCG   +     S     GI G G   
Sbjct: 174 SSTNGYLVKDVVHLDLVTGNRQTGSTN-GTIIFGCGSKQSGQLGESQAAVDGIMGFGQSN 232

Query: 197 SSLVSQLNS------SFSYCIGSLHDPDYLHNKLILGDGAIID-EGDATPLQFIDGHYYI 249
           SS +SQL S      SF++C+      D  +   I   G ++  +   TP+     HY +
Sbjct: 233 SSFISQLASQGKVKRSFAHCL------DNNNGGGIFAIGEVVSPKVKTTPMLSKSAHYSV 286

Query: 250 TLEAISVDGRMLDINPNIFKRDDSGGGVMIDSGTDVTWLVKEAYEALRDEVMIRLEGEQM 309
            L AI V   +L+++ N F   D   GV+IDSGT + +L    Y  L +E++       +
Sbjct: 287 NLNAIEVGNSVLELSSNAFDSGDD-KGVIIDSGTTLVYLPDAVYNPLLNEILASHPELTL 345

Query: 310 RSYSWPDKLCYHGIMSSDLKGFPTVRFHFRGGAKLALEKDSMFYQPRPDAFCMAVNPASI 369
            +       C+H   +  L  FPTV F F     LA+      +Q R D +C       +
Sbjct: 346 HTVQ-ESFTCFH--YTDKLDRFPTVTFQFDKSVSLAVYPREYLFQVREDTWCFGWQNGGL 402

Query: 370 NNR-YVNFSLIGMMAQQFYNVGYDIGRKQKTFQRMDC 405
             +   + +++G MA     V YDI  +   +   +C
Sbjct: 403 QTKGGASLTILGDMALSNKLVVYDIENQVIGWTNHNC 439


>gi|224068901|ref|XP_002326227.1| predicted protein [Populus trichocarpa]
 gi|222833420|gb|EEE71897.1| predicted protein [Populus trichocarpa]
          Length = 477

 Score =  107 bits (267), Expect = 1e-20,   Method: Compositional matrix adjust.
 Identities = 95/313 (30%), Positives = 145/313 (46%), Gaps = 31/313 (9%)

Query: 60  LFYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLG-----TIFYPSRSSSYAYV 114
           L++  + +G PP      +DTGS +LWV C  C +C    G       F  S SS+   V
Sbjct: 65  LYFTKVKLGSPPREFNVQIDTGSDVLWVCCNSCNNCPRTSGLGIQLNFFDSSSSSTAGQV 124

Query: 115 PCDSEHCR---YFPYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNT-DESTIHVQ 170
            C    C         +CS+   +C YT  Y  G  TS +  ++ L F     +S I   
Sbjct: 125 RCSDPICTSAVQTTATQCSSQTDQCSYTFQYGDGSGTSGYYVSDTLYFDAILGQSLIDNS 184

Query: 171 D--VVFGC-----GFSTNRNFKFSGIFGLGIGRSSLVSQLNSS------FSYCIGSLHDP 217
              +VFGC     G  T  +    GIFG G G  S++SQL++       FS+C   L   
Sbjct: 185 SALIVFGCSAYQSGDLTKTDKAVDGIFGFGQGELSVISQLSTRGITPRVFSHC---LKGD 241

Query: 218 DYLHNKLILGDGAIIDEGDA-TPLQFIDGHYYITLEAISVDGRMLDINPNIFKRDDSGGG 276
                 L+LG+  I++ G   +PL     HY + L +I+V+G++L I+P  F   +S  G
Sbjct: 242 GSGGGILVLGE--ILEPGIVYSPLVPSQPHYNLNLLSIAVNGQLLPIDPAAFATSNS-QG 298

Query: 277 VMIDSGTDVTWLVKEAYEALRDEVMIRLEGEQMRSYSWPDKLCYHGIMSSDLKGFPTVRF 336
            ++DSGT + +LV EAY+     V   +    +   +     CY  + +S  + FP   F
Sbjct: 299 TIVDSGTTLAYLVAEAYDPFVSAVN-AIVSPSVTPITSKGNQCYL-VSTSVSQMFPLASF 356

Query: 337 HFRGGAKLALEKD 349
           +F GGA + L+ +
Sbjct: 357 NFAGGASMVLKPE 369


>gi|242091327|ref|XP_002441496.1| hypothetical protein SORBIDRAFT_09g028060 [Sorghum bicolor]
 gi|241946781|gb|EES19926.1| hypothetical protein SORBIDRAFT_09g028060 [Sorghum bicolor]
          Length = 466

 Score =  107 bits (267), Expect = 1e-20,   Method: Compositional matrix adjust.
 Identities = 102/376 (27%), Positives = 157/376 (41%), Gaps = 39/376 (10%)

Query: 61  FYVNISIGQPPVPQFTAMDTGSSLLWVHC------YPCRDCSPQLGTIFYPSRSSSYAYV 114
           ++V   +G P  P     DTGS L WV C            SP    +F  + S S+A +
Sbjct: 101 YFVRFRVGTPAQPFVLVADTGSDLTWVKCRGAGAAAGTGAGSP--ARVFRTAASKSWAPI 158

Query: 115 PCDSEHC-RYFPY--ARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFK----------- 160
            C S+ C  Y P+  A CS+    C Y   Y  G      V T+  T             
Sbjct: 159 ACSSDTCTSYVPFSLANCSSPASPCAYDYRYRDGSAARGVVGTDSATIALSSGSGRGGGD 218

Query: 161 NTDESTIHVQDVVFGCGFSTN-RNFKFS-GIFGLGIGRSSLVS----QLNSSFSYCIGSL 214
           ++      +Q VV GC  + + ++F+ S G+  LG    S  S    +    FSYC+   
Sbjct: 219 SSGGRRAKLQGVVLGCAATYDGQSFQSSDGVLSLGNSNISFASRAAARFGGRFSYCLVDH 278

Query: 215 HDPDYLHNKLILGDGAIIDEGDATPL---QFIDGHYYITLEAISVDGRMLDINPNIFKRD 271
             P    + L  G GA       TPL   + +   Y +T++A+ V G  LDI  +++  D
Sbjct: 279 LAPRNATSYLTFGPGATAPAAQ-TPLLLDRRMTPFYAVTVDAVYVAGEALDIPADVWDVD 337

Query: 272 DSGGGVMIDSGTDVTWLVKEAYEALRDEVMIRLEGEQMRSYSWPDKLCYHGIMSSDLKGF 331
            +GG ++ DSGT +T L   AY A+   +   L G    +   P + CY+   +  L+  
Sbjct: 338 RNGGAIL-DSGTSLTILATPAYRAVVTALSKHLAGLPRVTMD-PFEYCYNWTDAGALE-I 394

Query: 332 PTVRFHFRGGAKLALEKDSMFYQPRPDAFCMAVNPASINNRYVNFSLIGMMAQQFYNVGY 391
           P +  HF G A+L     S      P   C+ V   S    +   S+IG + QQ +   +
Sbjct: 395 PKMEVHFAGSARLEPPAKSYVIDAAPGVKCIGVQEGS----WPGVSVIGNILQQEHLWEF 450

Query: 392 DIGRKQKTFQRMDCEV 407
           D+  +   F+   C +
Sbjct: 451 DLRDRWLRFKHTRCAL 466


>gi|242046218|ref|XP_002460980.1| hypothetical protein SORBIDRAFT_02g038700 [Sorghum bicolor]
 gi|241924357|gb|EER97501.1| hypothetical protein SORBIDRAFT_02g038700 [Sorghum bicolor]
          Length = 517

 Score =  107 bits (266), Expect = 1e-20,   Method: Compositional matrix adjust.
 Identities = 112/380 (29%), Positives = 174/380 (45%), Gaps = 41/380 (10%)

Query: 56  ISNSLFYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLGTIFYPSRSSSYAYVP 115
           + +  + +++ +G PP      MDTGS L W+ C PC DC  Q+G +F P+ SSSY  V 
Sbjct: 146 VGSGEYLMDVYVGTPPRRFRMIMDTGSDLNWLQCAPCLDCFDQVGPVFDPAASSSYRNVT 205

Query: 116 CDSEHCRYF-----PYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDE-STIHV 169
           C  + C        P A     +  C Y   Y     T+  ++ E  T   T   ++  V
Sbjct: 206 CGDQRCGLVAPPEPPRACRRPGEDSCPYYYWYGDQSNTTGDLALESFTVNLTAPGASRRV 265

Query: 170 QDVVFGCGFSTNRNF-KFSGIFGLGIGRSSLVSQLNS----SFSYCIGSLHDPDYLHNKL 224
            DVVFGCG      F   +G+ GLG G  S  SQL +    +FSYC+   H  D + +K+
Sbjct: 266 DDVVFGCGHWNRGLFHGAAGLLGLGRGPLSFASQLRAVYGHTFSYCLVD-HGSD-VASKV 323

Query: 225 ILGDGAIIDEGDATP-LQFI---------DGHYYITLEAISVDGRMLDINPNIF---KRD 271
           + G+   +    A P L +          D  YY+ L+ + V G +L+I+ + +   + +
Sbjct: 324 VFGEDDALALAAAHPQLNYTAFAPASSPADTFYYVKLKGVLVGGELLNISSDTWGVGEGE 383

Query: 272 DSGGGVMIDSGTDVTWLVKEAYEALRDEVMIRLEGEQMRSYSW-PD----KLCYHGIMSS 326
              GG +IDSGT +++ V+ AY+ +R   + R+     RSY   PD      CY+ +   
Sbjct: 384 GGSGGTIIDSGTTLSYFVEPAYQVIRQAFIDRMG----RSYPLIPDFPVLSPCYN-VSGV 438

Query: 327 DLKGFPTVRFHFRGGAKLALEKDSMFYQPRPDA-FCMAVNPASINNRYVNFSLIGMMAQQ 385
           D    P +   F  GA      ++ F +  PD   C+AV    +       S+IG   QQ
Sbjct: 439 DRPEVPELSLLFADGAVWDFPAENYFIRLDPDGIMCLAV----LGTPRTGMSIIGNFQQQ 494

Query: 386 FYNVGYDIGRKQKTFQRMDC 405
            ++V YD+   +  F    C
Sbjct: 495 NFHVVYDLKNNRLGFAPRRC 514


>gi|255548662|ref|XP_002515387.1| Aspartic proteinase nepenthesin-2 precursor, putative [Ricinus
           communis]
 gi|223545331|gb|EEF46836.1| Aspartic proteinase nepenthesin-2 precursor, putative [Ricinus
           communis]
          Length = 463

 Score =  107 bits (266), Expect = 1e-20,   Method: Compositional matrix adjust.
 Identities = 115/427 (26%), Positives = 174/427 (40%), Gaps = 53/427 (12%)

Query: 1   LIHRDSILSPYNNPNENPAHRVQRGINISIARLAYLQEKIRSHNTYQ---AQILPSND-- 55
           ++H+    S  N  N N  + V+  +    +R+  +  K+  H+  +   A  LP+    
Sbjct: 69  VVHKHGPCSQLNQQNGNAPNLVEILLE-DQSRVDSIHAKLSDHSGVKETDAAKLPTKSGM 127

Query: 56  -ISNSLFYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLGTIFYPSRSSSYAYV 114
            +    + V+I +G P        DTGS L W  C             F P++S+SYA V
Sbjct: 128 SLGTGNYIVSIGLGSPKKDLMLIFDTGSDLTWARCSAAE--------TFDPTKSTSYANV 179

Query: 115 PCDSEHCRYFPYA-----RCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHV 169
            C +  C     A     RC+A    C+Y   Y  G  +  F+  E+LT  +TD      
Sbjct: 180 SCSTPLCSSVISATGNPSRCAA--STCVYGIQYGDGSYSIGFLGKERLTIGSTD----IF 233

Query: 170 QDVVFGCGFSTNRNF-KFSGIFGLGIGRSSLVSQL----NSSFSYCIGSLHDPDYLHNKL 224
            +  FGCG   +  F K +G+ GLG  + S+VSQ     N  FSYC+ S     +L    
Sbjct: 234 NNFYFGCGQDVDGLFGKAAGLLGLGRDKLSVVSQTAPKYNQLFSYCLPSSSSTGFLSFGS 293

Query: 225 ILGDGAIIDEGDATPLQFIDGHYYITLEAISVDGRMLDINPNIFKRDDSGGGVMIDSGTD 284
                A      + P  F    Y + L  I+V G+ L I  ++F    S  G +IDSGT 
Sbjct: 294 SQSKSAKFTPLSSGPSSF----YNLDLTGITVGGQKLAIPLSVF----STAGTIIDSGTV 345

Query: 285 VTWLVKEAYEALRDEVMIRLEGEQMRSYSWPDKL-----CYHGIMSSDLKGFPTVRFHFR 339
           VT L   AY ALR         + M SY     L     CY       +K  P +   F 
Sbjct: 346 VTRLPPAAYSALRSAFR-----KAMASYPMGKPLSILDTCYDFSKYKTIK-VPKIVISFS 399

Query: 340 GGAKLALEKDSMFYQPRPDAFCMAVNPASINNRYVNFSLIGMMAQQFYNVGYDIGRKQKT 399
           GG  + +++  +F        C+A    + N    + ++ G   Q+ + V YD+   +  
Sbjct: 400 GGVDVDVDQAGIFVANGLKQVCLAF---AGNTGARDTAIFGNTQQRNFEVVYDVSGGKVG 456

Query: 400 FQRMDCE 406
           F    C 
Sbjct: 457 FAPASCS 463


>gi|224109494|ref|XP_002315215.1| predicted protein [Populus trichocarpa]
 gi|222864255|gb|EEF01386.1| predicted protein [Populus trichocarpa]
          Length = 444

 Score =  107 bits (266), Expect = 1e-20,   Method: Compositional matrix adjust.
 Identities = 103/383 (26%), Positives = 165/383 (43%), Gaps = 48/383 (12%)

Query: 58  NSLFYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLGTIFYPSRSSSYAYVPCD 117
           N     +++IG PP      +DTGS L W+ C       P   +IF P  S +Y  +PC 
Sbjct: 64  NVTLTASLTIGTPPQNITMVLDTGSELSWLRCKK----EPNFTSIFNPLASKTYTKIPCS 119

Query: 118 SEHCRYFPYARCS--AYKHRCIYTQL-YLIGPETSVFVSTEQLTFKNTDESTIHVQDVVF 174
           S+ C+     R S       C   +L + I            L F+     ++     VF
Sbjct: 120 SQTCK----TRTSDLTLPVTCDPAKLCHFIISYADASSVEGHLAFETFRFGSLTRPATVF 175

Query: 175 GC---GFSTN--RNFKFSGIFGLGIGRSSLVSQLN-SSFSYCIGSLHDPDYLHNKLILGD 228
           GC   G S+N   + K +G+ G+  G  S V+Q+    FSYCI  L    +L    +LG+
Sbjct: 176 GCMDSGSSSNTEEDAKTTGLMGMNRGSLSFVNQMGFRKFSYCISGLDSTGFL----LLGE 231

Query: 229 G--AIIDEGDATPLQFIDG--------HYYITLEAISVDGRMLDINPNIFKRDDSGGG-V 277
              + +   + TPL  I           Y + LE I V+ ++L +  ++F  D +G G  
Sbjct: 232 ARYSWLKPLNYTPLVQISTPLPYFDRVAYSVQLEGIKVNNKVLPLPKSVFVPDHTGAGQT 291

Query: 278 MIDSGTDVTWLVKEAYEALRDEVMIRLEG------EQMRSYSWPDKLCYH-GIMSSDLKG 330
           M+DSGT  T+L+   Y ALR E +++  G      E    +     LCY     SS L  
Sbjct: 292 MVDSGTQFTFLLGPVYSALRKEFLLQTAGVLRVLNEPQYVFQGAMDLCYLIDSTSSTLPN 351

Query: 331 FPTVRFHFRGGAKLALEKDSMFY------QPRPDAFCMAVNPASINNRYVNFSLIGMMAQ 384
            P V+  FR GA++++    + Y      + +   +C     +  +   ++  LIG   Q
Sbjct: 352 LPVVKLMFR-GAEMSVSGQRLLYRVPGEVRGKDSVWCFTFGNS--DELGISSFLIGHHQQ 408

Query: 385 QFYNVGYDIGRKQKTFQRMDCEV 407
           Q   + YD+   +  F  + C++
Sbjct: 409 QNVWMEYDLENSRIGFAELRCDL 431


>gi|159464048|ref|XP_001690254.1| pepsin-type aspartyl protease [Chlamydomonas reinhardtii]
 gi|158284242|gb|EDP09992.1| pepsin-type aspartyl protease [Chlamydomonas reinhardtii]
          Length = 485

 Score =  107 bits (266), Expect = 1e-20,   Method: Compositional matrix adjust.
 Identities = 97/368 (26%), Positives = 163/368 (44%), Gaps = 38/368 (10%)

Query: 58  NSLFYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLGTIFYPSRSSSYAYVPCD 117
           +S FY  + +G P       +DTGS++ ++ C  C  C       F P +S++   + C 
Sbjct: 10  HSYFYTTLKLGTPERTFSVIIDTGSTITYIPCKDCSHCGKHTAEWFDPDKSTTAKKLACG 69

Query: 118 SEHCRYFPYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDVVFGC- 176
              C       C+    RC Y++ Y     +  ++  +   F ++D        +VFGC 
Sbjct: 70  DPLCN-CGTPSCTCNNDRCYYSRTYAERSSSEGWMIEDTFGFPDSDSPV----RLVFGCE 124

Query: 177 GFSTNRNFK--FSGIFGLGIGRSSLVSQL------NSSFSYCIGSLHDPDYLHNKLILGD 228
              T   ++    GI G+G   ++  SQL         FS C G   D       L+LGD
Sbjct: 125 NGETGEIYRQMADGIMGMGNNHNAFQSQLVQRKVIEDVFSLCFGYPKD-----GILLLGD 179

Query: 229 GAIIDEGDA--TP-LQFIDGHYY-ITLEAISVDGRMLDINPNIFKRDDSGGGVMIDSGTD 284
             + +  +   TP L  +  HYY + ++ I+V+G+ L  + ++F R   G G ++DSGT 
Sbjct: 180 VTLPEGANTVYTPLLTHLHLHYYNVKMDGITVNGQTLAFDASVFDR---GYGTVLDSGTT 236

Query: 285 VTWLVKEAYEALRDEVMIRLEGEQMRSYSWPD----KLCYHGIMS--SDL-KGFPTVRFH 337
            T+L  +A++A+   V   +E + ++S    D     +C+ G      DL K FP   F 
Sbjct: 237 FTYLPTDAFKAMAKAVGDYVEKKGLQSTPGADPQYNDICWKGAPDQFKDLDKYFPPAEFV 296

Query: 338 FRGGAKLALEKDSMFYQPRPDAFCMAVNPASINNRYVNFSLIGMMAQQFYNVGYDIGRKQ 397
           F GGAKL L      +  +P  +C+      I +   + +L+G ++ +   V YD    +
Sbjct: 297 FGGGAKLTLPPLRYLFLSKPAEYCLG-----IFDNGNSGALVGGVSVRDVVVTYDRRNSK 351

Query: 398 KTFQRMDC 405
             F  M C
Sbjct: 352 VGFTTMAC 359


>gi|357118076|ref|XP_003560785.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Brachypodium
           distachyon]
          Length = 477

 Score =  107 bits (266), Expect = 1e-20,   Method: Compositional matrix adjust.
 Identities = 103/369 (27%), Positives = 151/369 (40%), Gaps = 52/369 (14%)

Query: 61  FYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRD-CSPQLGTIFYPSRSSSYAYVPCDSE 119
           F V +  G P       +DTGS L W+ C PC   C  Q    F P++SSSYA VPC + 
Sbjct: 137 FVVVVGFGTPAQTAAIILDTGSDLSWIQCKPCSGHCYRQHDPDFDPAKSSSYAAVPCGTP 196

Query: 120 HCRYFPYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDVVFGCGFS 179
            C     A        C+Y   Y  G  T+  +S + LTF ++ + T       FGCG  
Sbjct: 197 VCAA---AGGMCNGTTCLYGVQYGDGSSTTGVLSRDTLTFNSSSKFT----GFTFGCGEK 249

Query: 180 TNRNF-----KFSGIFGLGIGRSSLVSQLNSSFSYCIGSLHDPDYLHNKLILGDGAIIDE 234
              +F           G     S         FSYC+     P Y      L  GA    
Sbjct: 250 NIGDFGEVDGLLGLGRGKLSLPSQAAPSFGGVFSYCL-----PSYNTTPGYLNIGAT-KP 303

Query: 235 GDATPLQFI--------DGHYYITLEAISVDGRMLDINPNIFKRDDSGGGVMIDSGTDVT 286
               P+Q+            Y+I L +I++ G +L + P++F +     G ++DSGT +T
Sbjct: 304 TSTVPVQYTAMIKKPQYPSFYFIELVSINIGGYILPVPPSVFTKT----GTLLDSGTILT 359

Query: 287 WLVKEAYEALRDEVMIRLEGEQMRSYSWPDKLCYHGIMSSDLKG-----FPTVRFHFRGG 341
           +L   AY +LRD     ++G +      P   CY      D  G      P V F+F  G
Sbjct: 360 YLPPPAYTSLRDRFKFTMQGNKPAPPYEPLDTCY------DFTGQGAIVIPAVSFNFSDG 413

Query: 342 AKLALEKDSMFYQP---RPDAFCMAV--NPASINNRYVNFSLIGMMAQQFYNVGYDIGRK 396
           A   L+   +   P   +P   C+A    PA++      FS++G   Q+   V YD+  +
Sbjct: 414 AVFDLDFYGIMIFPDDAKPLIGCLAFVSRPAAM-----PFSIVGNTQQRAAEVIYDVPSQ 468

Query: 397 QKTFQRMDC 405
           +  F  + C
Sbjct: 469 KIGFIPISC 477


>gi|356565521|ref|XP_003550988.1| PREDICTED: aspartic proteinase-like protein 2-like [Glycine max]
          Length = 634

 Score =  107 bits (266), Expect = 1e-20,   Method: Compositional matrix adjust.
 Identities = 101/377 (26%), Positives = 162/377 (42%), Gaps = 54/377 (14%)

Query: 58  NSLFYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLGTIFYPSRSSSYAYVPCD 117
           N  +   + IG PP      +DTGS++ +V C  C  C       F P  SS+Y  V C 
Sbjct: 81  NGYYTTRLWIGTPPQMFALIVDTGSTVTYVPCSTCEQCGRHQDPKFQPESSSTYQPVKCT 140

Query: 118 SEHCRYFPYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDVVFGCG 177
            +         C + + +C+Y + Y     +S  +  + ++F N  +S +  Q  VFGC 
Sbjct: 141 ID-------CNCDSDRMQCVYERQYAEMSTSSGVLGEDLISFGN--QSELAPQRAVFGCE 191

Query: 178 FSTNRNF---KFSGIFGLGIGRSSLVSQL------NSSFSYCIGSLHDPDYLHNKLILGD 228
                +       GI GLG G  S++ QL      + SFS C G +           +G 
Sbjct: 192 NVETGDLYSQHADGIMGLGRGDLSIMDQLVDKNVISDSFSLCYGGMD----------VGG 241

Query: 229 GAIIDEGDATPLQFI--------DGHYYITLEAISVDGRMLDINPNIFKRDDSGGGVMID 280
           GA++  G + P              +Y I L+ I V G+ L +N N+F   D   G ++D
Sbjct: 242 GAMVLGGISPPSDMAFAYSDPVRSPYYNIDLKEIHVAGKRLPLNANVF---DGKHGTVLD 298

Query: 281 SGTDVTWLVKEAYEALRDEVMIRLEGEQMRSYSWPD----KLCYHGI---MSSDLKGFPT 333
           SGT   +L + A+ A +D ++  L  + ++  S PD     +C+ G    +S   K FP 
Sbjct: 299 SGTTYAYLPEAAFLAFKDAIVKEL--QSLKKISGPDPNYNDICFSGAGIDVSQLSKSFPV 356

Query: 334 VRFHFRGGAKLALEKDS-MFYQPR-PDAFCMAVNPASINNRYVNFSLIGMMAQQFYNVGY 391
           V   F  G K  L  ++ MF   +   A+C+ V      N     +L+G +  +   V Y
Sbjct: 357 VDMVFENGQKYTLSPENYMFRHSKVRGAYCLGV----FQNGNDQTTLLGGIIVRNTLVVY 412

Query: 392 DIGRKQKTFQRMDCEVL 408
           D  + +  F + +C  L
Sbjct: 413 DREQTKIGFWKTNCAEL 429


>gi|449446233|ref|XP_004140876.1| PREDICTED: aspartic proteinase-like protein 2-like [Cucumis
           sativus]
          Length = 498

 Score =  107 bits (266), Expect = 1e-20,   Method: Compositional matrix adjust.
 Identities = 94/371 (25%), Positives = 164/371 (44%), Gaps = 36/371 (9%)

Query: 60  LFYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDC--SPQLGTIFYP---SRSSSYAYV 114
           L+Y  I IG P    +  +DTGS ++WV+C  CR+C  +  LG    P     S++   V
Sbjct: 86  LYYAKIGIGTPSKDYYVQVDTGSDIVWVNCIQCRECPRTSSLGMELTPYDLEESTTGKLV 145

Query: 115 PCDSEHCRYF---PYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTD---ESTIH 168
            CD + C      P + C+     C Y Q+Y  G  T+ +   + + +       E+T  
Sbjct: 146 SCDEQFCLEVNGGPLSGCTT-NMSCPYLQIYGDGSSTAGYFVKDYVQYNRVSGDLETTAA 204

Query: 169 VQDVVFGCG------FSTNRNFKFSGIFGLGIGRSSLVSQLNSS------FSYCIGSLHD 216
              + FGCG        ++      GI G G   SS++SQL S+      F++C+     
Sbjct: 205 NGSIKFGCGARQSGDLGSSGEEALDGILGFGKSNSSIISQLASTRKVKKMFAHCL----- 259

Query: 217 PDYLHNKLILGDGAIID-EGDATPLQFIDGHYYITLEAISVDGRMLDINPNIFKRDDSGG 275
            D  +   I   G ++  + + TPL     HY + +  + V   +L+I+ ++F+  D   
Sbjct: 260 -DGTNGGGIFAMGHVVQPKVNMTPLVPNQPHYNVNMTGVQVGHIILNISADVFEAGDR-K 317

Query: 276 GVMIDSGTDVTWLVKEAYEALRDEVMIRLEGEQMRSYSWPDKLCYHGIMSSDLKGFPTVR 335
           G +IDSGT + +L +  YE L  +++ +    ++++     K C+      D  GFP V 
Sbjct: 318 GTIIDSGTTLAYLPELIYEPLVAKILSQQHNLEVQTIHGEYK-CFQYSERVD-DGFPPVI 375

Query: 336 FHFRGGAKLALEKDSMFYQPRPDAFCMAVNPASINNR-YVNFSLIGMMAQQFYNVGYDIG 394
           FHF     L +      +Q   + +C+    + + +R   N +L G +      V YD+ 
Sbjct: 376 FHFENSLLLKVYPHEYLFQ-YENLWCIGWQNSGMQSRDRKNVTLFGDLVLSNKLVLYDLE 434

Query: 395 RKQKTFQRMDC 405
            +   +   +C
Sbjct: 435 NQTIGWTEYNC 445


>gi|449519146|ref|XP_004166596.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 1-like [Cucumis
           sativus]
          Length = 752

 Score =  107 bits (266), Expect = 2e-20,   Method: Compositional matrix adjust.
 Identities = 105/384 (27%), Positives = 164/384 (42%), Gaps = 51/384 (13%)

Query: 61  FYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLGTIFYPSRSSSYAYVPCDSEH 120
           +++++ IG PP      +DTGS L W+ C PC DC  Q G  + P  S S+  + C+   
Sbjct: 196 YFIDVFIGSPPKHFSLILDTGSDLNWIQCVPCFDCFEQNGPYYDPKDSISFRNITCNDPR 255

Query: 121 CRYF----PYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDEST-----IHVQD 171
           C+      P   C      C Y   Y     T+   + E  T   T  +T       V++
Sbjct: 256 CQLVSSPDPPRPCKFETQSCPYFYWYGDSSNTTGDFALETFTVNLTSSTTGKSEFRRVEN 315

Query: 172 VVFGCGFSTNRNFKFSGIF-------GLGIGRSSLVSQLNS----SFSYCIGSLHDPDYL 220
           V+FGCG   NR     G+F       GLG G  S  SQL S    SFSYC+        +
Sbjct: 316 VMFGCG-HWNR-----GLFHGAAGLLGLGRGPLSFSSQLQSLYGHSFSYCLVDRDSDTSV 369

Query: 221 HNKLILGDG-----------AIIDEGDATPLQFIDGHYYITLEAISVDGRMLDI-NPNIF 268
            +KLI G+              +  G   P   +D  YY+ +++I V G  L I   N  
Sbjct: 370 SSKLIFGEDKDLLTHPELNFTSLIAGKENP---VDTFYYLQIKSIFVGGEKLQIPEENWN 426

Query: 269 KRDDSGGGVMIDSGTDVTWLVKEAYEALRDEVMIRLEGEQMRSYSWPDKLCYHGIMSSDL 328
              D  GG +IDSGT +++    AY  +++  + +++G ++         CY+ +  +D 
Sbjct: 427 LSADGAGGTIIDSGTTLSYFSDPAYRIIKEAFLRKVKGYKLVEDFPILHPCYN-VSGTDE 485

Query: 329 KGFPTVRFHFRGGAKLALEKDSMFYQ-PRPDAFCMAV--NPASINNRYVNFSLIGMMAQQ 385
             FP     F  GA      ++ F +  + D  C+A+   P S        S+IG   QQ
Sbjct: 486 LNFPEFLIQFADGAVWNFPVENYFIRIQQLDIVCLAMLGTPKSA------LSIIGNYQQQ 539

Query: 386 FYNVGYDIGRKQKTFQRMDCEVLD 409
            +++ YD    +  +  M C  ++
Sbjct: 540 NFHILYDTKNSRLGYAPMRCAEIE 563


>gi|115484725|ref|NP_001067506.1| Os11g0215400 [Oryza sativa Japonica Group]
 gi|77549255|gb|ABA92052.1| Eukaryotic aspartyl protease family protein, expressed [Oryza
           sativa Japonica Group]
 gi|113644728|dbj|BAF27869.1| Os11g0215400 [Oryza sativa Japonica Group]
          Length = 428

 Score =  107 bits (266), Expect = 2e-20,   Method: Compositional matrix adjust.
 Identities = 97/384 (25%), Positives = 163/384 (42%), Gaps = 43/384 (11%)

Query: 35  YLQEKIRSHNTYQAQILPSNDISNSLFYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRD 94
           Y+  K    +T   Q+     +  SL+ +++ +G P   Q   +DTGSS  WV C  C  
Sbjct: 56  YITNKTSRLSTKAVQVGWDRGLQTSLYVISVGLGTPAKTQIVEIDTGSSTSWVFC-ECDG 114

Query: 95  CSPQLGTIFYPSRSSSYAYVPCDSEHCRYF---PYARCSAYKHRCIYTQLYLIGPETSVF 151
           C     T F  SRS++ A V C +  C      P+ + S     C +   Y  G  +   
Sbjct: 115 CHTNPRT-FLQSRSTTCAKVSCGTSMCLLGGSDPHCQDSENYPDCPFRVSYQDGSASYGI 173

Query: 152 VSTEQLTFKNTDESTIHVQDVVFGC---GFSTNRNFKFSGIFGLGIGRSSLVSQLNSS-- 206
           +  + LTF +  +    +    FGC    F  N      G+ G+G G  S++ Q + +  
Sbjct: 174 LYQDTLTFSDVQK----IPGFSFGCNMDSFGANEFGNVDGLLGMGAGPMSVLKQSSPTFD 229

Query: 207 -FSYCIGSLHDPDYLHNKLI--LGDGAIIDEGDATPLQFIDGH-----YYITLEAISVDG 258
            FSYC+          +K       G +    D    + +        +++ L AISVDG
Sbjct: 230 CFSYCLPLQKSERGFFSKTTGYFSLGKVATRTDVRYTKMVARKKNTELFFVDLTAISVDG 289

Query: 259 RMLDINPNIFKRDDSGGGVMIDSGTDVTWLVKEAYEALRD---EVMIRLEGEQMRSYSWP 315
             L ++P++F R     GV+ DSG++++++   A   L     E++++    +  S    
Sbjct: 290 ERLGLSPSVFSRK----GVVFDSGSELSYIPDRALSVLSQRIRELLLKRGAAEEES---- 341

Query: 316 DKLCYHGIMSSDLKGFPTVRFHFRGGAKLALEKDSMFYQ---PRPDAFCMAVNPASINNR 372
           ++ CY  + S D    P +  HF  GA+  L    +F +      D +C+A  P      
Sbjct: 342 ERNCYD-MRSVDEGDMPAISLHFDDGARFDLGSHGVFVERSVQEQDVWCLAFAPTE---- 396

Query: 373 YVNFSLIGMMAQQFYNVGYDIGRK 396
             + S+IG + Q    V YD+ R+
Sbjct: 397 --SVSIIGSLMQTSKEVVYDLKRQ 418


>gi|302781726|ref|XP_002972637.1| hypothetical protein SELMODRAFT_97698 [Selaginella moellendorffii]
 gi|300160104|gb|EFJ26723.1| hypothetical protein SELMODRAFT_97698 [Selaginella moellendorffii]
          Length = 393

 Score =  107 bits (266), Expect = 2e-20,   Method: Compositional matrix adjust.
 Identities = 108/400 (27%), Positives = 169/400 (42%), Gaps = 49/400 (12%)

Query: 31  ARLAYLQEKIRSHNTYQAQILPSNDISNSL------FYVNISIGQPPVPQFTAMDTGSSL 84
           AR+ ++    R++++  + +  + D+ + L      + ++IS+G P        DTGS L
Sbjct: 21  ARVRWMAA--RANSSSWSSMAGTTDVESPLHPDGGGYVMDISVGTPGKRFRAIADTGSDL 78

Query: 85  LWVHCYPCRDCSPQLGTIFYPSRSSSYAYVPCDSEHCRYFPYARCSAYKHRCIYTQLYLI 144
           +WV   PC  CS   GTIF P +SS++  + C S+ C   P   C      C Y+  Y  
Sbjct: 79  VWVQSEPCTGCSG--GTIFDPRQSSTFREMDCSSQLCTELP-GSCEPGSSACSYSYEYGS 135

Query: 145 GPETSVFVSTEQLTFKNTDESTIHVQDVVFGCGFSTNRNFKFSGIFGLGIGRSSLVSQL- 203
           G ET    + + ++   T   +        GCG   +      G+ GLG G  SL SQL 
Sbjct: 136 G-ETEGEFARDTISLGTTSGGSQKFPSFAVGCGMVNSGFDGVDGLVGLGQGPVSLTSQLS 194

Query: 204 ---NSSFSYCI-----GSLHDPDYLHNKLILGDGAIIDEGDATPLQFIDGHYYITLEAIS 255
              +S FSYC+      S   P        L    I       P      +Y +T+  I+
Sbjct: 195 AAIDSKFSYCLVDINSQSESSPLLFGPSAALHGTGIQSTKITPPSDTYPTYYLLTVNGIA 254

Query: 256 VDGRMLDINPNIFKRDDSGGGVMIDSGTDVTWLVKEAYEAL--RDEVMI---RLEGEQMR 310
           V G+ +           S G  +IDSGT +T++    Y  +  R E M+   R++G  M 
Sbjct: 255 VAGQTM----------GSPGTTIIDSGTTLTYVPSGVYGRVLSRMESMVTLPRVDGSSMG 304

Query: 311 SYSWPDKLCYHGIMSSDLKGFPTVRFHFRGGAKLALEKDSMFY--QPRPDAFCMAVNPAS 368
                  LCY    + + K FP +      GA +     + F       D  C+A+  A 
Sbjct: 305 L-----DLCYDRSSNRNYK-FPALTIRL-AGATMTPPSSNYFLVVDDSGDTVCLAMGSAG 357

Query: 369 INNRYVNFSLIGMMAQQFYNVGYDIGRKQKTFQRMDCEVL 408
                +  S+IG + QQ Y++ YD G  + +F +  CE L
Sbjct: 358 ----GLPVSIIGNVMQQGYHILYDRGSSELSFVQAKCESL 393


>gi|297828736|ref|XP_002882250.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
 gi|297328090|gb|EFH58509.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
          Length = 488

 Score =  106 bits (265), Expect = 2e-20,   Method: Compositional matrix adjust.
 Identities = 100/398 (25%), Positives = 168/398 (42%), Gaps = 44/398 (11%)

Query: 40  IRSHNTYQ-AQILPSNDI---------SNSLFYVNISIGQPPVPQFTAMDTGSSLLWVHC 89
           +R+H+ ++ +++L + D+         S  L++  I +G P       +DTGS +LWV+C
Sbjct: 54  LRAHDVHRHSRLLSAIDLPLGGDSQPESIGLYFAKIGLGTPSRDFHVQVDTGSDILWVNC 113

Query: 90  YPCRDCSPQLGTI----FYPSRSSSYAYVPCDSEHCRYFPYARCSAYKHRCIYTQLYLIG 145
             C  C  +   +    +    SS+   V C    C Y            C Y  LY  G
Sbjct: 114 AGCIRCPRKSDLVELTPYDADASSTAKSVSCSDNFCSYVNQRSECHSGSTCQYVILYGDG 173

Query: 146 PETSVFVSTE----QLTFKNTDESTIHVQDVVFGCGFSTNRNFKFS-----GIFGLGIGR 196
             T+ ++  +     L   N    + +   ++FGCG   +     S     GI G G   
Sbjct: 174 SSTNGYLVRDVVHLDLVTGNRQTGSTN-GTIIFGCGSKQSGQLGESQAAVDGIMGFGQSN 232

Query: 197 SSLVSQLNS------SFSYCIGSLHDPDYLHNKLILGDGAIID-EGDATPLQFIDGHYYI 249
           SS +SQL S      SF++C+      D  +   I   G ++  +   TP+     HY +
Sbjct: 233 SSFISQLASQGKVKRSFAHCL------DNNNGGGIFAIGEVVSPKVKTTPMLSKSAHYSV 286

Query: 250 TLEAISVDGRMLDINPNIFKRDDSGGGVMIDSGTDVTWLVKEAYEALRDEVMIRLEGEQM 309
            L AI V   +L ++ + F   D   GV+IDSGT + +L    Y  L ++++     +++
Sbjct: 287 NLNAIEVGNSVLQLSSDAFDSGDD-KGVIIDSGTTLVYLPDAVYNPLMNQILA--SHQEL 343

Query: 310 RSYSWPDKL-CYHGIMSSDLKGFPTVRFHFRGGAKLALEKDSMFYQPRPDAFCMAVNPAS 368
             ++  D   C+H I    L  FPTV F F     LA+      +Q R D +C       
Sbjct: 344 NLHTVQDSFTCFHYI--DRLDRFPTVTFQFDKSVSLAVYPQEYLFQVREDTWCFGWQNGG 401

Query: 369 INNR-YVNFSLIGMMAQQFYNVGYDIGRKQKTFQRMDC 405
           +  +   + +++G MA     V YDI  +   +   +C
Sbjct: 402 LQTKGGASLTILGDMALSNKLVVYDIENQVIGWTNHNC 439


>gi|449453872|ref|XP_004144680.1| PREDICTED: LOW QUALITY PROTEIN: protein ASPARTIC PROTEASE IN GUARD
           CELL 1-like [Cucumis sativus]
          Length = 757

 Score =  106 bits (265), Expect = 2e-20,   Method: Compositional matrix adjust.
 Identities = 105/384 (27%), Positives = 164/384 (42%), Gaps = 51/384 (13%)

Query: 61  FYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLGTIFYPSRSSSYAYVPCDSEH 120
           +++++ IG PP      +DTGS L W+ C PC DC  Q G  + P  S S+  + C+   
Sbjct: 196 YFIDVFIGSPPKHFSLILDTGSDLNWIQCVPCFDCFEQNGPYYDPKDSISFRNITCNDPR 255

Query: 121 CRYF----PYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDEST-----IHVQD 171
           C+      P   C      C Y   Y     T+   + E  T   T  +T       V++
Sbjct: 256 CQLVSSPDPPRPCKFETQSCPYFYWYGDSSNTTGDFALETFTVNLTSSTTGKSEFRRVEN 315

Query: 172 VVFGCGFSTNRNFKFSGIF-------GLGIGRSSLVSQLNS----SFSYCIGSLHDPDYL 220
           V+FGCG   NR     G+F       GLG G  S  SQL S    SFSYC+        +
Sbjct: 316 VMFGCG-HWNR-----GLFHGAAGLLGLGRGPLSFSSQLQSLYGHSFSYCLVDRDSDTSV 369

Query: 221 HNKLILGDG-----------AIIDEGDATPLQFIDGHYYITLEAISVDGRMLDI-NPNIF 268
            +KLI G+              +  G   P   +D  YY+ +++I V G  L I   N  
Sbjct: 370 SSKLIFGEDKDLLTHPELNFTSLIAGKENP---VDTFYYLQIKSIFVGGEKLQIPEENWN 426

Query: 269 KRDDSGGGVMIDSGTDVTWLVKEAYEALRDEVMIRLEGEQMRSYSWPDKLCYHGIMSSDL 328
              D  GG +IDSGT +++    AY  +++  + +++G ++         CY+ +  +D 
Sbjct: 427 LSADGAGGTIIDSGTTLSYFSDPAYRIIKEAFLRKVKGYKLVEDFPILHPCYN-VSGTDE 485

Query: 329 KGFPTVRFHFRGGAKLALEKDSMFYQ-PRPDAFCMAV--NPASINNRYVNFSLIGMMAQQ 385
             FP     F  GA      ++ F +  + D  C+A+   P S        S+IG   QQ
Sbjct: 486 LNFPEFLIQFADGAVWNFPVENYFIRIQQLDIVCLAMLGTPKSA------LSIIGNYQQQ 539

Query: 386 FYNVGYDIGRKQKTFQRMDCEVLD 409
            +++ YD    +  +  M C  ++
Sbjct: 540 NFHILYDTKNSRLGYAPMRCAEIE 563


>gi|343172998|gb|AEL99202.1| aspartyl protease family protein, partial [Silene latifolia]
          Length = 584

 Score =  106 bits (265), Expect = 2e-20,   Method: Compositional matrix adjust.
 Identities = 104/370 (28%), Positives = 155/370 (41%), Gaps = 58/370 (15%)

Query: 67  IGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLGTIFYPSRSSSYAYVPCDSEHCRYFPY 126
           IG PP      +DTGS++ +V C  C  C       F P  S +Y  V C+       P 
Sbjct: 2   IGTPPQEFALIVDTGSTVTYVPCNSCDQCGNHQDPKFQPDLSDTYHPVKCN-------PD 54

Query: 127 ARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDVVFGCGFSTNRNFKF 186
             C     +C Y + Y     +S  +  + ++F N  E  +  Q  VFGC  +   +  F
Sbjct: 55  CTCDTENDQCTYERQYAEMSSSSGILGEDLVSFGNMSE--LKPQRAVFGCENAETGDL-F 111

Query: 187 S----GIFGLGIGRSSLVSQL------NSSFSYCIGSLHDPDYLHNKLILGDGAIIDEGD 236
           S    GI GLG G  S+V QL      N SFS C G +           +G GA++    
Sbjct: 112 SQHADGIMGLGRGDLSIVDQLVEKGVINDSFSLCYGGME----------VGGGAMVLGQI 161

Query: 237 ATPLQFIDGH--------YYITLEAISVDGRMLDINPNIFKRDDSGGGVMIDSGTDVTWL 288
           + P   +  H        Y I L  + V G+ LDINP +F   D   G ++DSGT   +L
Sbjct: 162 SPPSDMVFSHSDPDRSPYYNIELRGLHVAGKKLDINPQVF---DGKHGTILDSGTTYAYL 218

Query: 289 VKEAYEALRDEVMIRLEG-EQMRSYSWPDK----LCYHGIMSSD---LKGFPTVRFHFRG 340
            + A+      +   L G +Q+R    PD     +C+ G  S      K FP+V   F  
Sbjct: 219 PEAAFLPFIQAITSELHGLKQIRG---PDPNYNDVCFSGAGSEIPELYKTFPSVDMVFDN 275

Query: 341 GAKLALEKDSMFYQPRP--DAFCMAVNPASINNRYVNFSLIGMMAQQFYNVGYDIGRKQK 398
           G K +L  ++  ++      A+C+ V      N     +L+G +  +   V YD    + 
Sbjct: 276 GEKYSLSPENYLFKHSKVHGAYCLGV----FQNGKDPTTLLGGIVVRNTLVTYDREHSKV 331

Query: 399 TFQRMDCEVL 408
            F + +C VL
Sbjct: 332 GFWKTNCSVL 341


>gi|343172996|gb|AEL99201.1| aspartyl protease family protein, partial [Silene latifolia]
          Length = 584

 Score =  106 bits (265), Expect = 2e-20,   Method: Compositional matrix adjust.
 Identities = 104/370 (28%), Positives = 155/370 (41%), Gaps = 58/370 (15%)

Query: 67  IGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLGTIFYPSRSSSYAYVPCDSEHCRYFPY 126
           IG PP      +DTGS++ +V C  C  C       F P  S +Y  V C+       P 
Sbjct: 2   IGTPPQEFALIVDTGSTVTYVPCNSCDQCGNHQDPKFQPDLSDTYHPVKCN-------PD 54

Query: 127 ARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDVVFGCGFSTNRNFKF 186
             C     +C Y + Y     +S  +  + ++F N  E  +  Q  VFGC  +   +  F
Sbjct: 55  CTCDTENDQCTYERQYAEMSSSSGILGEDLVSFGNMSE--LKPQRAVFGCENAETGDL-F 111

Query: 187 S----GIFGLGIGRSSLVSQL------NSSFSYCIGSLHDPDYLHNKLILGDGAIIDEGD 236
           S    GI GLG G  S+V QL      N SFS C G +           +G GA++    
Sbjct: 112 SQHADGIMGLGRGDLSIVDQLVEKGVINDSFSLCYGGME----------VGGGAMVLGQI 161

Query: 237 ATPLQFIDGH--------YYITLEAISVDGRMLDINPNIFKRDDSGGGVMIDSGTDVTWL 288
           + P   +  H        Y I L  + V G+ LDINP +F   D   G ++DSGT   +L
Sbjct: 162 SPPSDMVFSHSDPDRSPYYNIELRGLHVAGKKLDINPQVF---DGKHGTILDSGTTYAYL 218

Query: 289 VKEAYEALRDEVMIRLEG-EQMRSYSWPDK----LCYHGIMSSD---LKGFPTVRFHFRG 340
            + A+      +   L G +Q+R    PD     +C+ G  S      K FP+V   F  
Sbjct: 219 PEAAFLPFIQAITSELHGLKQIRG---PDPNYNDVCFSGAGSEIPELYKTFPSVDMVFDN 275

Query: 341 GAKLALEKDSMFYQPRP--DAFCMAVNPASINNRYVNFSLIGMMAQQFYNVGYDIGRKQK 398
           G K +L  ++  ++      A+C+ V      N     +L+G +  +   V YD    + 
Sbjct: 276 GEKYSLSPENYLFKHSKVHGAYCLGV----FQNGKDPTTLLGGIVVRNTLVTYDREHSKV 331

Query: 399 TFQRMDCEVL 408
            F + +C VL
Sbjct: 332 GFWKTNCSVL 341


>gi|293334661|ref|NP_001168795.1| uncharacterized protein LOC100382594 precursor [Zea mays]
 gi|223973065|gb|ACN30720.1| unknown [Zea mays]
          Length = 631

 Score =  106 bits (265), Expect = 2e-20,   Method: Compositional matrix adjust.
 Identities = 102/380 (26%), Positives = 169/380 (44%), Gaps = 56/380 (14%)

Query: 56  ISNSLFYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLGTIFYPSRSSSYAYVP 115
           ++N  +   + IG PP      +D+GS++ +V C  C  C       F P  SSSY+ V 
Sbjct: 83  LTNGYYTTRLYIGTPPQEFALIVDSGSTVTYVPCSSCEQCGNHQDPRFQPDLSSSYSPVK 142

Query: 116 CDSEHCRYFPYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDVVFG 175
           C+ +         C + K +C Y + Y     +S  +  + ++F    ES +  Q  +FG
Sbjct: 143 CNVD-------CTCDSDKKQCTYERQYAEMSSSSGVLGEDIVSFGR--ESELKPQHAIFG 193

Query: 176 CGFSTNRNFKFS----GIFGLGIGRSSLVSQL------NSSFSYCIGSLHDPDYLHNKLI 225
           C  S   +  FS    GI GLG G+ S++ QL      + SFS C G +           
Sbjct: 194 CENSETGDL-FSQHADGIMGLGRGQLSIMDQLVEKGVISDSFSLCYGGMD---------- 242

Query: 226 LGDGAIIDEGDATPLQFI--------DGHYYITLEAISVDGRMLDINPNIFKRDDSGGGV 277
           +G GA++  G   P   I          +Y I L+ I V G+ L +   IF   +S  G 
Sbjct: 243 IGGGAMVLGGMLAPPDMIFSNSDPLRSPYYNIELKEIHVAGKALRVESRIF---NSKHGT 299

Query: 278 MIDSGTDVTWLVKEAYEALRDEVMIRLEGEQMRSYSWPDK----LCYHGI---MSSDLKG 330
           ++DSGT   +L ++A+ A ++ V  ++    ++    PD     +C+ G    +S   + 
Sbjct: 300 VLDSGTTYAYLPEQAFVAFKEAVTSKV--HSLKKIRGPDPSYKDICFAGAGRNVSKLHEV 357

Query: 331 FPTVRFHFRGGAKLALEKDS-MFYQPRPD-AFCMAVNPASINNRYVNFSLIGMMAQQFYN 388
           FP V   F  G KL+L  ++ +F   + D A+C+ V      N     +L+G +  +   
Sbjct: 358 FPDVDMVFGNGQKLSLTPENYLFRHSKVDGAYCLGV----FQNGKDPTTLLGGIIVRNTL 413

Query: 389 VGYDIGRKQKTFQRMDCEVL 408
           V YD   ++  F + +C  L
Sbjct: 414 VTYDRHNEKIGFWKTNCSEL 433


>gi|24430421|dbj|BAC22609.1| 41 kD chloroplast nucleoid DNA binding protein (CND41) [Nicotiana
           sylvestris]
          Length = 502

 Score =  106 bits (265), Expect = 2e-20,   Method: Compositional matrix adjust.
 Identities = 106/377 (28%), Positives = 148/377 (39%), Gaps = 59/377 (15%)

Query: 61  FYVNISIGQPPVPQFTAMDTGSSLLWVHCYPC-RDCSPQLGTIFYPSRSSSYAYVPCDSE 119
           + VN+ +G P        DTGS L W  C PC + C  Q   IF PS S +Y+ + C S 
Sbjct: 154 YIVNVGLGTPKKDLSLIFDTGSDLTWTQCQPCVKSCYAQQQPIFDPSASKTYSNISCTST 213

Query: 120 HCRYFPYAR-----CSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDVVF 174
            C     A      CS+    C+Y   Y     T  F + + LT    D         +F
Sbjct: 214 ACSGLKSATGNSPGCSS--SNCVYGIQYGDSSFTVGFFAKDTLTLTQNDV----FDGFMF 267

Query: 175 GCGFSTNRNF--KFSGIFGLGIGRSSLVSQ----LNSSFSYCIGSLHDPDYLHNKLILGD 228
           GCG   NR    K +G+ GLG    S+V Q        FSYC+ +    +     L  G+
Sbjct: 268 GCG-QNNRGLFGKTAGLIGLGRDPLSIVQQTAQKFGKYFSYCLPTSRGSN---GHLTFGN 323

Query: 229 GAIIDEGDA-------TPLQFIDGH--YYITLEAISVDGRMLDINPNIFKRDDSGGGVMI 279
           G  +    A       TP     G   Y+I +  ISV G+ L I+P +F+      G +I
Sbjct: 324 GNGVKTSKAVKNGITFTPFASSQGATFYFIDVLGISVGGKALSISPMLFQN----AGTII 379

Query: 280 DSGTDVTWLVKEAYEALRDEVMIRLEGEQMRSYSWPDKL-----CYHGIMSSDLKGF--- 331
           DSGT +T L    Y +L+         + M  Y     L     CY      DL  +   
Sbjct: 380 DSGTVITRLPSTVYGSLKSTFK-----QFMSKYPTAPALSLLDTCY------DLSNYTSI 428

Query: 332 --PTVRFHFRGGAKLALEKDSMFYQPRPDAFCMAVNPASINNRYVNFSLIGMMAQQFYNV 389
             P + F+F G A + LE + +         C+A    + N       + G + QQ   V
Sbjct: 429 SIPKISFNFNGNANVDLEPNGILITNGASQVCLAF---AGNGDDDTIGIFGNIQQQTLEV 485

Query: 390 GYDIGRKQKTFQRMDCE 406
            YD+   Q  F    C 
Sbjct: 486 VYDVAGGQLGFGYKGCS 502


>gi|45735845|dbj|BAD12880.1| putative chloroplast nucleoid DNA-binding protein cnd41 [Oryza
           sativa Japonica Group]
 gi|45735971|dbj|BAD13000.1| putative chloroplast nucleoid DNA-binding protein cnd41 [Oryza
           sativa Japonica Group]
          Length = 333

 Score =  106 bits (265), Expect = 2e-20,   Method: Compositional matrix adjust.
 Identities = 99/357 (27%), Positives = 155/357 (43%), Gaps = 39/357 (10%)

Query: 65  ISIGQPPVPQFTAMDTGSSLLWVHCYPCR-DCSPQLGTIFYPSRSSSYAYVPCDSEHCRY 123
           + +G P       +DTGSSL W+ C PC   C  Q G +F P  SS+YA V C ++ C  
Sbjct: 1   MGLGTPATQYVMVVDTGSSLTWLQCSPCLVSCHRQSGPVFNPKSSSTYASVGCSAQQCSD 60

Query: 124 FPYAR-----CSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDVVFGCGF 178
            P A      CS+  + CIY   Y     +  ++S + ++F +T      + +  +GCG 
Sbjct: 61  LPSATLNPSACSS-SNVCIYQASYGDSSFSVGYLSKDTVSFGSTS-----LPNFYYGCGQ 114

Query: 179 STNRNF-KFSGIFGLGIGRSSLVSQLNS----SFSYCIGSLHDPDYLHNKLILGDGAIID 233
                F + +G+ GL   + SL+ QL      SF+YC+     P    +  +        
Sbjct: 115 DNEGLFGRSAGLIGLARNKLSLLYQLAPSLGYSFTYCL-----PSSSSSGYLSLGSYNPG 169

Query: 234 EGDATPL---QFIDGHYYITLEAISVDGRMLDINPNIFKRDDSGGGVMIDSGTDVTWLVK 290
           +   TP+      D  Y+I L  ++V G  L ++ + +    +    +IDSGT +T L  
Sbjct: 170 QYSYTPMVSSSLDDSLYFIKLSGMTVAGNPLSVSSSAYSSLPT----IIDSGTVITRLPT 225

Query: 291 EAYEALRDEVMIRLEG-EQMRSYSWPDKLCYHGIMSSDLKGFPTVRFHFRGGAKLALEKD 349
             Y AL   V   ++G  +  +YS  D  C+ G  S      P V   F GGA L L   
Sbjct: 226 SVYSALSKAVAAAMKGTSRASAYSILDT-CFKGQASR--VSAPAVTMSFAGGAALKLSAQ 282

Query: 350 SMFYQPRPDAFCMAVNPASINNRYVNFSLIGMMAQQFYNVGYDIGRKQKTFQRMDCE 406
           ++         C+A  PA       + ++IG   QQ ++V YD+   +  F    C 
Sbjct: 283 NLLVDVDDSTTCLAFAPAR------SAAIIGNTQQQTFSVVYDVKSSRIGFAAGGCS 333


>gi|115473125|ref|NP_001060161.1| Os07g0592200 [Oryza sativa Japonica Group]
 gi|29027762|dbj|BAC65898.1| putative CND41, chloroplast nucleoid DNA binding protein [Oryza
           sativa Japonica Group]
 gi|113611697|dbj|BAF22075.1| Os07g0592200 [Oryza sativa Japonica Group]
          Length = 631

 Score =  106 bits (265), Expect = 2e-20,   Method: Compositional matrix adjust.
 Identities = 108/425 (25%), Positives = 178/425 (41%), Gaps = 69/425 (16%)

Query: 11  YNNPNENPAHRVQRGINISIARLAYLQEKIRSHNTYQAQILPSNDISNSLFYVNISIGQP 70
           Y N    PA   +RG+                HN      L  + ++N  +   + IG P
Sbjct: 54  YPNATRLPASSARRGLG-------------DGHNPNARMRLHDDLLTNGYYTTRLYIGTP 100

Query: 71  PVPQFTAMDTGSSLLWVHCYPCRDCSPQLGTIFYPSRSSSYAYVPCDSEHCRYFPYARCS 130
                  +D+GS++ +V C  C  C       F P  SS+Y+ V C+ +         C 
Sbjct: 101 SQEFALIVDSGSTVTYVPCATCEQCGNHQDPRFQPDLSSTYSPVKCNVD-------CTCD 153

Query: 131 AYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDVVFGCGFSTNRNFKFS--- 187
             + +C Y + Y     +S  +  + ++F    ES +  Q  VFGC  +T     FS   
Sbjct: 154 NERSQCTYERQYAEMSSSSGVLGEDIMSFGK--ESELKPQRAVFGCE-NTETGDLFSQHA 210

Query: 188 -GIFGLGIGRSSLVSQL------NSSFSYCIGSLHDPDYLHNKLILGDGAIIDEGDATPL 240
            GI GLG G+ S++ QL      + SFS C G +           +G G ++  G   P 
Sbjct: 211 DGIMGLGRGQLSIMDQLVEKGVISDSFSLCYGGMD----------VGGGTMVLGGMPAPP 260

Query: 241 QFIDGH--------YYITLEAISVDGRMLDINPNIFKRDDSGGGVMIDSGTDVTWLVKEA 292
             +  H        Y I L+ I V G+ L ++P IF   +S  G ++DSGT   +L ++A
Sbjct: 261 DMVFSHSNPVRSPYYNIELKEIHVAGKALRLDPKIF---NSKHGTVLDSGTTYAYLPEQA 317

Query: 293 YEALRDEVMIRLEGEQMRSYSWPDK----LCYHGI---MSSDLKGFPTVRFHFRGGAKLA 345
           + A +D V  ++    ++    PD     +C+ G    +S   + FP V   F  G KL+
Sbjct: 318 FVAFKDAVTNKV--NSLKKIRGPDPNYKDICFAGAGRNVSQLSEVFPDVDMVFGNGQKLS 375

Query: 346 LEKDSMFYQ--PRPDAFCMAVNPASINNRYVNFSLIGMMAQQFYNVGYDIGRKQKTFQRM 403
           L  ++  ++      A+C+ V      N     +L+G +  +   V YD   ++  F + 
Sbjct: 376 LSPENYLFRHSKVEGAYCLGV----FQNGKDPTTLLGGIVVRNTLVTYDRHNEKIGFWKT 431

Query: 404 DCEVL 408
           +C  L
Sbjct: 432 NCSEL 436


>gi|224090744|ref|XP_002309070.1| predicted protein [Populus trichocarpa]
 gi|222855046|gb|EEE92593.1| predicted protein [Populus trichocarpa]
          Length = 404

 Score =  106 bits (264), Expect = 2e-20,   Method: Compositional matrix adjust.
 Identities = 102/352 (28%), Positives = 153/352 (43%), Gaps = 64/352 (18%)

Query: 47  QAQILPSNDIS----------NSLFYVNISIGQPPVPQFTAMDTGSSLLWVHC-----YP 91
           + Q++PS  +           N    V++++G PP      +DTGS L W+HC     YP
Sbjct: 7   KTQVIPSGSVPRSPNKPPFHHNVSLIVSLTVGTPPQNVSMVIDTGSELSWLHCNKTLSYP 66

Query: 92  CRDCSPQLGTIFYPSRSSSYAYVPCDSEHC----RYFPYARCSAYKHRCIYTQLYLIGPE 147
                    T F P+RS+SY  +PC S  C    + FP        + C  T  Y     
Sbjct: 67  ---------TTFDPTRSTSYQTIPCSSPTCTNRTQDFPIPASCDSNNLCHATLSYADASS 117

Query: 148 TSVFVSTEQLTFKNTDESTIHVQDVVFGCG---FSTN--RNFKFSGIFGLGIGRSSLVSQ 202
           +   ++++     ++D     +  +VFGC    FS+N   + K +G+ G+  G  S VSQ
Sbjct: 118 SDGNLASDVFHIGSSD-----ISGLVFGCMDSVFSSNSDEDSKSTGLMGMNRGSLSFVSQ 172

Query: 203 LN-SSFSYCIGSLHDPDYLHNKLILGDGAIIDEGD---------ATPLQFIDG-HYYITL 251
           L    FSYCI            L+LG+  +              +TPL + D   Y + L
Sbjct: 173 LGFPKFSYCISGTD----FSGLLLLGESNLTWSVPLNYTPLIQISTPLPYFDRVAYTVQL 228

Query: 252 EAISVDGRMLDINPNIFKRDDSGGG-VMIDSGTDVTWLVKEAYEALRDEVMIRLEGEQMR 310
           E I V  ++L I  + F+ D +G G  M+DSGT  T+L+   Y ALR    +      +R
Sbjct: 229 EGIKVLDKLLPIPKSTFEPDHTGAGQTMVDSGTQFTFLLGPVYNALR-SAFLNQTSSVLR 287

Query: 311 SYSWPD-------KLCYHGIMSSD-LKGFPTVRFHFRGGAKLALEKDSMFYQ 354
               PD        LCY   +S   L   PTV   FR GA++ +  D + Y+
Sbjct: 288 VLEDPDFVFQGAMDLCYLVPLSQRVLPLLPTVTLVFR-GAEMTVSGDRVLYR 338


>gi|357502759|ref|XP_003621668.1| Aspartic proteinase-like protein [Medicago truncatula]
 gi|355496683|gb|AES77886.1| Aspartic proteinase-like protein [Medicago truncatula]
          Length = 481

 Score =  106 bits (264), Expect = 2e-20,   Method: Compositional matrix adjust.
 Identities = 98/376 (26%), Positives = 172/376 (45%), Gaps = 38/376 (10%)

Query: 57  SNSLFYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQ--LG---TIFYPSRSSSY 111
           S  L+Y  I IG P    +  +DTG+ ++WV+C  C++C  +  LG   T++    SSS 
Sbjct: 69  SVGLYYAKIGIGTPSKDYYLQVDTGTDMMWVNCIQCKECPTRSNLGMDLTLYNIKESSSG 128

Query: 112 AYVPCDSEHCRYFP---YARCSAYKH-RCIYTQLYLIGPETSVFVSTEQLTFKNT--DES 165
             VPCD E C+         C++  +  C Y ++Y  G  T+ +   + + F     D  
Sbjct: 129 KLVPCDQELCKEINGGLLTGCTSKTNDSCPYLEIYGDGSSTAGYFVKDVVLFDQVSGDLK 188

Query: 166 TIHVQ-DVVFGCGFSTNRNFKFS------GIFGLGIGRSSLVSQLNSS------FSYCIG 212
           T      V+FGCG   + +  +S      GI G G    S++SQL+SS      F++C+ 
Sbjct: 189 TASANGSVIFGCGARQSGDLSYSNEEALDGILGFGKANYSMISQLSSSGKVKKMFAHCLN 248

Query: 213 SLHDPDYLHNKLILGDGAIIDEG-DATPLQFIDGHYYITLEAISVDGRMLDINPNIFKRD 271
            ++         I   G ++    + TPL     HY + + AI V    L+++ +  ++ 
Sbjct: 249 GVNGGG------IFAIGHVVQPTVNTTPLLPDQPHYSVNMTAIQVGHTFLNLSTDASEQR 302

Query: 272 DSGGGVMIDSGTDVTWLVKEAYEALRDEVMIRLEGEQMRSYSWPDKLCYHGIMSSDLKGF 331
           DS  G +IDSGT + +L    Y+ L  +++ +    ++++    +  C+    S D  GF
Sbjct: 303 DS-KGTIIDSGTTLAYLPDGIYQPLVYKILSQQPNLKVQTLH-DEYTCFQYSGSVD-DGF 359

Query: 332 PTVRFHFRGGAKLAL-EKDSMFYQPRPDAFCMAVNPASINNR-YVNFSLIGMMAQQFYNV 389
           P V F+F  G  L +   D +F     + +C+    +   +R   N +L+G +      V
Sbjct: 360 PNVTFYFENGLSLKVYPHDYLFLS--ENLWCIGWQNSGAQSRDSKNMTLLGDLVLSNKLV 417

Query: 390 GYDIGRKQKTFQRMDC 405
            YD+  +   +   +C
Sbjct: 418 FYDLENQVIGWTEYNC 433


>gi|125553832|gb|EAY99437.1| hypothetical protein OsI_21406 [Oryza sativa Indica Group]
          Length = 409

 Score =  106 bits (264), Expect = 2e-20,   Method: Compositional matrix adjust.
 Identities = 102/358 (28%), Positives = 154/358 (43%), Gaps = 43/358 (12%)

Query: 68  GQPPVPQFTAMDTGSSLLWVHCYPCR--DCSPQLGTIFYPSRSSSYAYVPCDSEHC-RYF 124
           G   V Q   +D+GS + WV C PC    C PQ   +F P+ S++YA VPC S  C R  
Sbjct: 75  GTSAVSQTVIIDSGSDVPWVQCQPCPLLVCHPQRDPLFDPATSTTYAAVPCSSAACARLG 134

Query: 125 PYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDVVFGCGFS---TN 181
           PY R      +C +   Y  G   +   S++ LT    D     V+  +FGC  +   + 
Sbjct: 135 PYRRGCLANSQCQFGITYANGATATGTYSSDDLTLGPYDV----VRGFLFGCAHADQGST 190

Query: 182 RNFKFSGIFGLGIGRSSLVSQLNSS----FSYCI-GSLHDPDYLHNKLILGDGAIIDEGD 236
            ++  +G   LG G  S V Q  S     FSYC+  S     ++   +     A++    
Sbjct: 191 FSYDVAGTLALGGGSQSFVQQTASQYSRVFSYCVPPSTSSFGFIMFGVPPQRAALVPTFV 250

Query: 237 ATPL----QFIDGHYYITLEAISVDGRMLDINPNIFKRDDSGGGVMIDSGTDVTWLVKEA 292
           +TPL          Y + L +I V GR L + P +F         +IDS T ++ +   A
Sbjct: 251 STPLLSSSTMSPTFYRVLLRSIIVAGRPLPVPPTVFSASS-----VIDSATVISRIPPTA 305

Query: 293 YEALRDEVMIRLEGEQMRSYSWPDKL---CY--HGIMSSDLKGFPTVRFHFRGGAKLALE 347
           Y+ALR      +    M   + P  +   CY   G+ S  L   P++   F GGA + L+
Sbjct: 306 YQALRAAFRSAMT---MYRPAPPVSILDTCYDFSGVRSITL---PSIALVFDGGATVNLD 359

Query: 348 KDSMFYQPRPDAFCMAVNPASINNRYVNFSLIGMMAQQFYNVGYDIGRKQKTFQRMDC 405
              +  Q      C+A  P + ++R   F  IG + Q+   V YD+  K   F+   C
Sbjct: 360 AAGILLQ-----GCLAFAPTA-SDRMPGF--IGNVQQRTLEVVYDVPGKAIRFRSAAC 409


>gi|449466304|ref|XP_004150866.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 1-like [Cucumis
           sativus]
 gi|449485213|ref|XP_004157102.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 1-like [Cucumis
           sativus]
          Length = 473

 Score =  106 bits (264), Expect = 3e-20,   Method: Compositional matrix adjust.
 Identities = 107/389 (27%), Positives = 166/389 (42%), Gaps = 44/389 (11%)

Query: 38  EKIRSHNTYQAQILPSNDISNSLFYVNISIGQPPVPQFTAMDTGSSLLWVHCYPC-RDCS 96
           +++R     +        I +  + V++ +G P        DTGS L W  C PC R C 
Sbjct: 108 DRLRGSKATKIPAKSGATIGSGNYIVSVGLGTPKKYLSLIFDTGSDLTWTQCQPCARYCY 167

Query: 97  PQLGTIFYPSRSSSYAYVPCDSEHCRYFPYAR-----CSAYKHRCIYTQLYLIGPETSVF 151
            Q   +F PS+S++Y+ + C S  C            CSA +  CIY   Y     +  +
Sbjct: 168 NQKDPVFVPSQSTTYSNISCSSPDCSQLESGTGNQPGCSAAR-ACIYGIQYGDQSFSVGY 226

Query: 152 VSTEQLTFKNTDESTIHVQDVVFGCGFSTNRNF--KFSGIFGLGIGRSSLVSQLNSS--- 206
            + E LT  +TD     +++ +FGCG   NR      +G+ GLG  + S+V Q       
Sbjct: 227 FAKETLTLTSTDV----IENFLFGCG-QNNRGLFGSAAGLIGLGQDKISIVKQTAQKYGQ 281

Query: 207 -FSYCI-GSLHDPDYLHNKLILGDGAIIDEGDATPLQFIDG---HYYITLEAISVDGRML 261
            FSYC+  +     YL      G GA+      TP+    G    Y + +  + V G  +
Sbjct: 282 VFSYCLPKTSSSTGYLTFGGGGGGGAL----KYTPITKAHGVANFYGVDIVGMKVGGTQI 337

Query: 262 DINPNIFKRDDSGGGVMIDSGTDVTWLVKEAYEALRDEVMIRLEGEQMRSYSWPDKL--- 318
            I+ ++F    S  G +IDSGT +T L  +AY AL+         + M  Y    +L   
Sbjct: 338 PISSSVF----STSGAIIDSGTVITRLPPDAYSALKSAFE-----KGMAKYPKAPELSIL 388

Query: 319 --CYHGIMSSDLKGFPTVRFHFRGGAKLALEKDSMFYQPRPDAFCMAVNPASINNRYVNF 376
             CY     S ++  P V F F+GG +L L+   + Y       C+A    + N      
Sbjct: 389 DTCYDLSKYSTIQ-IPKVGFVFKGGEELDLDGIGIMYGASTSQVCLAF---AGNQDPSTV 444

Query: 377 SLIGMMAQQFYNVGYDIGRKQKTFQRMDC 405
           ++IG + Q+   V YD+G  +  F    C
Sbjct: 445 AIIGNVQQKTLQVVYDVGGGKIGFGYNGC 473


>gi|115465373|ref|NP_001056286.1| Os05g0557100 [Oryza sativa Japonica Group]
 gi|113579837|dbj|BAF18200.1| Os05g0557100 [Oryza sativa Japonica Group]
 gi|125553268|gb|EAY98977.1| hypothetical protein OsI_20935 [Oryza sativa Indica Group]
          Length = 494

 Score =  106 bits (264), Expect = 3e-20,   Method: Compositional matrix adjust.
 Identities = 99/389 (25%), Positives = 153/389 (39%), Gaps = 50/389 (12%)

Query: 61  FYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCS---------------PQLGTIFYP 105
           ++V   +G P  P     DTGS L WV C      S                    +F P
Sbjct: 110 YFVRFRVGTPAQPFVLIADTGSDLTWVKCRGAASPSHATATASPAAAPSPAVAPPRVFRP 169

Query: 106 SRSSSYAYVPCDSEHCRY---FPYARCSAYKHRCIYTQLY--------LIGPETSVFVST 154
             S +++ +PC SE C+    F  A CS+    C Y   Y        ++G +++    +
Sbjct: 170 GDSKTWSPIPCSSETCKSTIPFSLANCSSSTAACSYDYRYNDNSAARGVVGTDSATVALS 229

Query: 155 EQLTFKNTDESTIHVQDVVFGCGFS-TNRNFKFS-GIFGLGIGRSSLVSQLNS----SFS 208
                    +    +Q VV GC  +   + F+ S G+  LG    S  S+  S     FS
Sbjct: 230 GGRGGGGGGDRKAKLQGVVLGCTTAHAGQGFEASDGVLSLGYSNISFASRAASRFGGRFS 289

Query: 209 YCIGSLHDPDYLHNKLILGDG------AIIDEGDATPLQF---IDGHYYITLEAISVDGR 259
           YC+     P    + L  G G      +    G  TPL     +   Y + ++++SVDG 
Sbjct: 290 YCLVDHLAPRNATSYLTFGAGPDAASSSAPAPGSRTPLLLDARVRPFYAVAVDSVSVDGV 349

Query: 260 MLDINPNIFKRDDSGGGVMIDSGTDVTWLVKEAYEALRDEVMIRLEGEQMRSYSWPDKLC 319
            LDI   ++    S GG +IDSGT +T L   AY+A+   +  +L G    +   P   C
Sbjct: 350 ALDIPAEVWDV-GSNGGTIIDSGTSLTVLATPAYKAVVAALSEQLAGLPRVAMD-PFDYC 407

Query: 320 YHGIMSSDLKG---FPTVRFHFRGGAKLALEKDSMFYQPRPDAFCMAVNPASINNRYVNF 376
           Y+     D  G    P +   F G A+L     S      P   C+ V        +   
Sbjct: 408 YNWTARGDGGGDLAVPKLAVQFAGSARLEPPAKSYVIDAAPGVKCIGVQ----EGAWPGV 463

Query: 377 SLIGMMAQQFYNVGYDIGRKQKTFQRMDC 405
           S+IG + QQ +   +D+  +   F++  C
Sbjct: 464 SVIGNILQQEHLWEFDLNNRWLRFRQTSC 492


>gi|359482287|ref|XP_002263129.2| PREDICTED: aspartic proteinase-like protein 2 isoform 2 [Vitis
           vinifera]
 gi|297740017|emb|CBI30199.3| unnamed protein product [Vitis vinifera]
          Length = 502

 Score =  106 bits (264), Expect = 3e-20,   Method: Compositional matrix adjust.
 Identities = 97/312 (31%), Positives = 149/312 (47%), Gaps = 33/312 (10%)

Query: 60  LFYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLG-----TIFYPSRSSSYAYV 114
           L++  + +G PP      +DTGS +LWV C  C DC    G     + F PS SS+ + V
Sbjct: 85  LYFTKVKLGSPPREFNVQIDTGSDILWVTCNSCNDCPRTSGLGIELSFFDPSSSSTTSLV 144

Query: 115 PCDSEHCRYF---PYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQ- 170
            C    C        A CS   ++C Y+  Y  G  T+ +  ++ L F      ++    
Sbjct: 145 SCSHPICTSLVQTTAAECSPQSNQCSYSFHYGDGSGTTGYYVSDMLYFDTVLGDSLIANS 204

Query: 171 --DVVFGC-----GFSTNRNFKFSGIFGLGIGRSSLVSQLNS------SFSYCIGSLHDP 217
              +VFGC     G  T  +    GIFG G    S+VSQL+S       FS+C+    D 
Sbjct: 205 SASIVFGCSTYQSGDLTKVDKAIDGIFGFGQQDLSVVSQLSSLGITPKVFSHCLKGEGDG 264

Query: 218 DYLHNKLILGDGAIIDEGDA--TPLQFIDGHYYITLEAISVDGRMLDINPNIFKRDDSGG 275
                KL+LG+   I E +   +PL     HY + L++ISV+G++L I+P +F   ++  
Sbjct: 265 G---GKLVLGE---ILEPNIIYSPLVPSQSHYNLNLQSISVNGQLLPIDPAVFATSNN-Q 317

Query: 276 GVMIDSGTDVTWLVKEAYEALRDEVMIRLEGEQMRSYSWPDKLCYHGIMSSDLKGFPTVR 335
           G ++DSGT +T+LV+ AY+     +   +        S  ++ CY    S D + FP V 
Sbjct: 318 GTIVDSGTTLTYLVETAYDPFVSAITATVSSSTTPVLSKGNQ-CYLVSTSVD-EIFPPVS 375

Query: 336 FHFRGGAKLALE 347
            +F GGA + L+
Sbjct: 376 LNFAGGASMVLK 387


>gi|18390579|ref|NP_563751.1| aspartyl protease-like protein [Arabidopsis thaliana]
 gi|332189782|gb|AEE27903.1| aspartyl protease-like protein [Arabidopsis thaliana]
          Length = 485

 Score =  106 bits (264), Expect = 3e-20,   Method: Compositional matrix adjust.
 Identities = 97/372 (26%), Positives = 164/372 (44%), Gaps = 38/372 (10%)

Query: 60  LFYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQ--LG---TIFYPSRSSSYAYV 114
           L+Y  I IG P    +  +DTGS ++WV+C  C+ C  +  LG   T++    S S   V
Sbjct: 79  LYYAKIGIGTPAKSYYVQVDTGSDIMWVNCIQCKQCPRRSTLGIELTLYNIDESDSGKLV 138

Query: 115 PCDSEHCRYF---PYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTD---ESTIH 168
            CD + C      P + C A    C Y ++Y  G  T+ +   + + + +     ++   
Sbjct: 139 SCDDDFCYQISGGPLSGCKA-NMSCPYLEIYGDGSSTAGYFVKDVVQYDSVAGDLKTQTA 197

Query: 169 VQDVVFGCG------FSTNRNFKFSGIFGLGIGRSSLVSQLNSS------FSYCIGSLHD 216
              V+FGCG        ++      GI G G   SS++SQL SS      F++C+     
Sbjct: 198 NGSVIFGCGARQSGDLDSSNEEALDGILGFGKANSSMISQLASSGRVKKIFAHCL----- 252

Query: 217 PDYLHNKLILGDGAIID-EGDATPLQFIDGHYYITLEAISVDGRMLDINPNIFKRDDSGG 275
            D  +   I   G ++  + + TPL     HY + + A+ V    L I  ++F+  D  G
Sbjct: 253 -DGRNGGGIFAIGRVVQPKVNMTPLVPNQPHYNVNMTAVQVGQEFLTIPADLFQPGDRKG 311

Query: 276 GVMIDSGTDVTWLVKEAYEALRDEVMIRLEGEQMRSYSWPDKLCYHGIMSSDLKGFPTVR 335
            + IDSGT + +L +  YE L  ++  +    ++      D  C+      D +GFP V 
Sbjct: 312 AI-IDSGTTLAYLPEIIYEPLVKKITSQEPALKVHIVD-KDYKCFQYSGRVD-EGFPNVT 368

Query: 336 FHFRGGAKLAL-EKDSMFYQPRPDAFCMAVNPASINNR-YVNFSLIGMMAQQFYNVGYDI 393
           FHF     L +   D +F  P    +C+    +++ +R   N +L+G +      V YD+
Sbjct: 369 FHFENSVFLRVYPHDYLF--PHEGMWCIGWQNSAMQSRDRRNMTLLGDLVLSNKLVLYDL 426

Query: 394 GRKQKTFQRMDC 405
             +   +   +C
Sbjct: 427 ENQLIGWTEYNC 438


>gi|359488213|ref|XP_002263620.2| PREDICTED: aspartic proteinase nepenthesin-2-like [Vitis vinifera]
          Length = 434

 Score =  105 bits (263), Expect = 3e-20,   Method: Compositional matrix adjust.
 Identities = 95/370 (25%), Positives = 163/370 (44%), Gaps = 44/370 (11%)

Query: 63  VNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLGTIFYPSRSSSYAYVPCDSEHCR 122
           V++ IG PP  Q   +DTGS L W+ C       P   T F P  SSS++ +PC+   C+
Sbjct: 80  VSLPIGTPPQTQQMVLDTGSQLSWIQCKVPPKTPP---TAFDPLLSSSFSVLPCNHSLCK 136

Query: 123 -----YFPYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDVVFGCG 177
                Y     C      C Y+  Y  G      +  E+ TF ++  +      ++ GC 
Sbjct: 137 PRVPDYTLPTSCDQ-NRLCHYSYFYADGTYAEGNLVREKFTFSSSQTT----PPLILGCA 191

Query: 178 FSTNRNFKFSGIFGLGIGRSSLVSQLN-SSFSYCI------------GSLH---DPDYLH 221
             ++      GI G+ +GR S  S    S FSYC+            GS +   +P    
Sbjct: 192 TDSSDT---QGILGMNLGRLSFSSLAKISKFSYCVPPRRSQSGSSPTGSFYLGPNPSSAG 248

Query: 222 NKLILGDGAIIDEGDATPLQFIDG-HYYITLEAISVDGRMLDINPNIFKRDDSGGG-VMI 279
            K +     ++    +  +  +D   Y + +  I ++G+ L+I+ + F+ D SG G  +I
Sbjct: 249 FKYV----NLMTYRQSQRMPNLDPLAYTLPMLGIRINGKKLNISTSAFRADPSGAGQTLI 304

Query: 280 DSGTDVTWLVKEAYEALRDEVMIRLEGEQMRS---YSWPDKLCYHGIMSSDLKGFPTVRF 336
           DSGT  T+LV EAY  +++E+ ++L G +++    Y     +C+ G      +    + F
Sbjct: 305 DSGTWFTFLVDEAYSKVKEEI-VKLAGPKLKKGYVYGGSLDMCFDGDAMVIGRMIGNMAF 363

Query: 337 HFRGGAKLALEKDSMFYQPRPDAFCMAVNPASINNRYVNFSLIGMMAQQFYNVGYDIGRK 396
            F  G ++ +E++ M         C+ +  + +     N  +IG   QQ   V +D+  +
Sbjct: 364 EFENGVEIVVEREKMLADVGGGVQCLGIGRSDLLGVASN--IIGNFHQQDLWVEFDLVGR 421

Query: 397 QKTFQRMDCE 406
           +  F R DC 
Sbjct: 422 RVGFGRTDCS 431


>gi|125527257|gb|EAY75371.1| hypothetical protein OsI_03267 [Oryza sativa Indica Group]
          Length = 484

 Score =  105 bits (263), Expect = 3e-20,   Method: Compositional matrix adjust.
 Identities = 114/438 (26%), Positives = 169/438 (38%), Gaps = 72/438 (16%)

Query: 32  RLAYLQEK--IRSHNTYQAQILPSND---ISNSLFYVNISIGQPPVPQFTAMDTGSSLLW 86
           R+A++  +   R+  T  A  +P +         ++V   +G P  P     DTGS L W
Sbjct: 53  RMAFISSRGRRRAAETASAFAMPLSSGAYTGTGQYFVRFRVGTPAQPFLLVADTGSDLTW 112

Query: 87  VHCY----------------PC-RDCSPQLGTIFYPSRSSSYAYVPCDSEHCRY---FPY 126
           V C+                P     SP+    F P +S ++A +PC S  CR    F  
Sbjct: 113 VKCHRAAAAASASPRNASSLPAPAPASPR--RTFRPDKSRTWAPIPCSSATCRESLPFSL 170

Query: 127 ARCSAYKHRCIYTQLYLIGPET--SVFVSTEQLTFKNTDESTIHVQDVVFGCGFSTN-RN 183
           A C+   + C Y   Y  G     +V V +  +           ++ VV GC  S N ++
Sbjct: 171 AACATPANPCAYDYRYKDGSAARGTVGVDSATIALSGRAARKAKLRGVVLGCTTSYNGQS 230

Query: 184 FKFS-GIFGLGIGRSSLVSQLNSSF----SYCIGSLHDPDYLHNKLILGDGAIID----- 233
           F  S G+  LG    S  S+  S F    SYC+     P    + L  G           
Sbjct: 231 FLASDGVLSLGYSNISFASRAASRFGGRFSYCLVDHLAPRNATSYLTFGPNPAFSSRRPS 290

Query: 234 EGDA--------------------TPLQF---IDGHYYITLEAISVDGRMLDINPNIFKR 270
           EG A                    TPL         Y +T++ +SV G +L I P     
Sbjct: 291 EGIASCKPAPAPTPAPAGAPGARQTPLVLDHRTRPFYAVTVKGVSVAGELLKI-PRAVWD 349

Query: 271 DDSGGGVMIDSGTDVTWLVKEAYEALRDEVMIRLEGEQMRSYSWPDKLCYHGI--MSSDL 328
            + GGG ++DSGT +T L K AY A+   +  RL G    +   P   CY+      SD+
Sbjct: 350 VEQGGGAILDSGTSLTMLAKPAYRAVVAALSKRLAGLPRVTMD-PFDYCYNWTSPSGSDV 408

Query: 329 KG-FPTVRFHFRGGAKLALEKDSMFYQPRPDAFCMAVNPASINNRYVNFSLIGMMAQQFY 387
               P +  HF G A+L     S      P   C+ +        +   S+IG + QQ +
Sbjct: 409 AAPLPMLAVHFAGSARLEPPAKSYVIDAAPGVKCIGLQ----EGPWPGLSVIGNILQQEH 464

Query: 388 NVGYDIGRKQKTFQRMDC 405
              YD+  ++  F+R  C
Sbjct: 465 LWEYDLKNRRLRFKRSRC 482


>gi|297840891|ref|XP_002888327.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
 gi|297334168|gb|EFH64586.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
          Length = 473

 Score =  105 bits (263), Expect = 3e-20,   Method: Compositional matrix adjust.
 Identities = 97/400 (24%), Positives = 170/400 (42%), Gaps = 46/400 (11%)

Query: 38  EKIRSHNTYQ-AQILPSNDI---------SNSLFYVNISIGQPPVPQFTAMDTGSSLLWV 87
           E  +SH+T + +++L S D+         S  L++  I +G PP      +DTGS +LWV
Sbjct: 41  EHFKSHDTRRHSRMLASIDLPLGGDSRVDSVGLYFTKIKLGSPPKEYHVQVDTGSDILWV 100

Query: 88  HCYPCRDCSPQLGTIFYPS-----RSSSYAYVPCDSEHCRYFPYARCSAYKHRCIYTQLY 142
           +C PC +C  +    F+ S      SS+   V CD + C +   +        C Y  +Y
Sbjct: 101 NCKPCPECPSKTNLNFHLSLFDVNASSTSKKVGCDDDFCSFISQSDSCQPAVGCSYHIVY 160

Query: 143 LIGPETSVFVSTEQLTFKNTD---ESTIHVQDVVFGCGFSTNRNFKFS-----GIFGLGI 194
                +      ++LT +      ++    Q+VVFGCG   +     S     G+ G G 
Sbjct: 161 ADESTSEGNFIRDKLTLEQVTGDLQTGPLGQEVVFGCGSDQSGQLGKSDSAVDGVMGFGQ 220

Query: 195 GRSSLVSQLNSS------FSYCIGSLHDPDYLHNKLILGDGAIID-EGDATPLQFIDGHY 247
             +S++SQL ++      FS+C+      D +    I   G +   +   TP+     HY
Sbjct: 221 SNTSVLSQLAATGDAKRVFSHCL------DNVKGGGIFAVGVVDSPKVKTTPMVPNQMHY 274

Query: 248 YITLEAISVDGRMLDINPNIFKRDDSGGGVMIDSGTDVTWLVKEAYEALRDEVMIRLEGE 307
            + L  + VDG  LD+ P+I +     GG ++DSGT + +  K  Y++L + ++ R   +
Sbjct: 275 NVMLMGMDVDGTALDLPPSIMRN----GGTIVDSGTTLAYFPKVLYDSLIETILAR---Q 327

Query: 308 QMRSYSWPDKL-CYHGIMSSDLKGFPTVRFHFRGGAKLALEKDSMFYQPRPDAFCMAVNP 366
            ++ +   D   C+    + D+  FP V F F    KL +      +    + +C     
Sbjct: 328 PVKLHIVEDTFQCFSFSENVDV-AFPPVSFEFEDSVKLTVYPHDYLFTLEKELYCFGWQA 386

Query: 367 ASINN-RYVNFSLIGMMAQQFYNVGYDIGRKQKTFQRMDC 405
             +         L+G +      V YD+  +   +   +C
Sbjct: 387 GGLTTGERTEVILLGDLVLSNKLVVYDLENEVIGWADHNC 426


>gi|222624820|gb|EEE58952.1| hypothetical protein OsJ_10633 [Oryza sativa Japonica Group]
          Length = 415

 Score =  105 bits (263), Expect = 3e-20,   Method: Compositional matrix adjust.
 Identities = 105/375 (28%), Positives = 160/375 (42%), Gaps = 61/375 (16%)

Query: 54  NDISNSLFYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLGTIFYPSRSSSYAY 113
           N +  + + V+++IG PP P    +DTGS L+W  C PC  C  Q    F PS SS+ + 
Sbjct: 82  NGVPTTEYLVHLAIGTPPQPVQLTLDTGSDLIWTQCQPCPACFDQALPYFDPSTSSTLSL 141

Query: 114 VPCDSEHCRYFPYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDVV 173
             CDS  C+  P A                          +++ TF     S   V  V 
Sbjct: 142 TSCDSTLCQGLPVAS----------------------LPRSDKFTFVGAGAS---VPGVA 176

Query: 174 FGCGFSTNRNFKF--SGIFGLGIGRSSLVSQLN-SSFSYCIGSLHDP-------DYLHNK 223
           FGCG   N  FK   +GI G G G  SL SQL   +FS+C  ++          D   + 
Sbjct: 177 FGCGLFNNGVFKSNETGIAGFGRGPLSLPSQLKVGNFSHCFTTITGAIPSTVLLDLPADL 236

Query: 224 LILGDGAIIDEGDATPLQFIDGH---YYITLEAISVDGRMLDINPNIFKRDDSGGGVMID 280
              G GA+      TPL     +   YY++L+ I+V    L +  + F   +  GG +ID
Sbjct: 237 FSNGQGAV----QTTPLIQNPANPTFYYLSLKGITVGSTRLPVPESEFALKNGTGGTIID 292

Query: 281 SGTDVTWLVKEAYEALRDEVMIRLEGEQMRSYSWPDKLCYHGIMSSDLKG---FPTVRFH 337
           SGT +T L    Y  +RD    +++   +   +     C    +S+ L+     P +  H
Sbjct: 293 SGTAMTSLPTRVYRLVRDAFAAQVKLPVVSGNTTDPYFC----LSAPLRAKPYVPKLVLH 348

Query: 338 FRGGAKLALEKDSMFYQPRPDA----FCMAVNPASINNRYVNFSLIGMMAQQFYNVGYDI 393
           F  GA + L +++  ++   DA     C+A+    I    V  + IG   QQ  +V YD+
Sbjct: 349 FE-GATMDLPRENYVFEVE-DAGSSILCLAI----IEGGEV--TTIGNFQQQNMHVLYDL 400

Query: 394 GRKQKTFQRMDCEVL 408
              + +F    C+ L
Sbjct: 401 QNSKLSFVPAQCDKL 415


>gi|226532674|ref|NP_001151415.1| pepsin A precursor [Zea mays]
 gi|195646632|gb|ACG42784.1| pepsin A [Zea mays]
          Length = 492

 Score =  105 bits (263), Expect = 3e-20,   Method: Compositional matrix adjust.
 Identities = 98/377 (25%), Positives = 158/377 (41%), Gaps = 39/377 (10%)

Query: 57  SNSLFYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLG-----TIFYPSRSSSY 111
           +  L+Y  I IG PP   +  +DTGS +LWV+   C  C  + G     T + P+ S + 
Sbjct: 81  ATGLYYTRIEIGSPPKGYYVQVDTGSDILWVNGISCDGCPTRSGLGIELTQYDPAGSGT- 139

Query: 112 AYVPCDSEHCRYFPYAR-----CSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTD--- 163
             V C+ E C     A      C +    C +   Y  G  T+ F  T+ + +       
Sbjct: 140 -TVGCEQEFCVANSAASGVPPACPSAASPCQFRITYGDGSSTTGFYVTDFVQYNQVSGNG 198

Query: 164 ESTIHVQDVVFGCGFSTNRNFKFS-----GIFGLGIGRSSLVSQLNSS------FSYCIG 212
           ++T     + FGCG     +   S     GI G G   +S++SQL ++      F++C+ 
Sbjct: 199 QTTPSNVSITFGCGAQLGGDLGSSSQALDGILGFGQSDASMLSQLAAARKVRKIFAHCL- 257

Query: 213 SLHDPDYLHNKLILGDGAIIDEG--DATPLQFIDGHYYITLEAISVDGRMLDINPNIFKR 270
                D +    I   G ++       TPL     HY + L+ ISV G  L +  + F  
Sbjct: 258 -----DTVRGGGIFAIGNVVQPPIVKTTPLVPNATHYNVNLQGISVGGATLQLPTSTFDS 312

Query: 271 DDSGGGVMIDSGTDVTWLVKEAYEALRDEVMIRLEGEQMRSYSWPDKLCYHGIMSSDLKG 330
            DS  G +IDSGT + +L +E Y  L   V  +     +R+Y   D +C+    S D + 
Sbjct: 313 GDS-KGTIIDSGTTLAYLPREVYRTLLTAVFDKHPDLAVRNYE--DFICFQFSGSLDEE- 368

Query: 331 FPTVRFHFRGGAKLALEKDSMFYQPRPDAFCMAVNPASINNR-YVNFSLIGMMAQQFYNV 389
           FP + F F G   L +      +Q   D +CM      +  +   +  L+G +      V
Sbjct: 369 FPVITFSFEGDLTLNVYPHDYLFQNGNDLYCMGFLDGGVQTKDGKDMVLLGDLVLSNKLV 428

Query: 390 GYDIGRKQKTFQRMDCE 406
            YD+ ++   +   +C 
Sbjct: 429 VYDLEKQVIGWTDYNCS 445


>gi|297848856|ref|XP_002892309.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
 gi|297338151|gb|EFH68568.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
          Length = 484

 Score =  105 bits (263), Expect = 3e-20,   Method: Compositional matrix adjust.
 Identities = 97/372 (26%), Positives = 165/372 (44%), Gaps = 38/372 (10%)

Query: 60  LFYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQ--LG---TIFYPSRSSSYAYV 114
           L+Y  I IG P    +  +DTGS ++WV+C  C+ C  +  LG   T++    S S   V
Sbjct: 79  LYYAKIGIGTPAKSYYVQVDTGSDIMWVNCIQCKQCPRRSTLGIELTLYNIDESDSGKLV 138

Query: 115 PCDSEHCRYF---PYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTD---ESTIH 168
            CD + C      P + C A    C Y ++Y  G  T+ +   + + + +     ++   
Sbjct: 139 SCDDDFCYQISGGPLSGCKA-NMSCPYLEIYGDGSSTAGYFVKDVVQYDSVAGDLKTQTA 197

Query: 169 VQDVVFGCG------FSTNRNFKFSGIFGLGIGRSSLVSQLNSS------FSYCIGSLHD 216
              V+FGCG        ++      GI G G   SS++SQL SS      F++C+     
Sbjct: 198 NGSVIFGCGARQSGDLDSSNEEALDGILGFGKANSSMISQLASSGRVKKIFAHCL----- 252

Query: 217 PDYLHNKLILGDGAIID-EGDATPLQFIDGHYYITLEAISVDGRMLDINPNIFKRDDSGG 275
            D  +   I   G ++  + + TPL     HY + + A+ V    L+I  ++F+  D  G
Sbjct: 253 -DGRNGGGIFAIGRVVQPKVNMTPLVPNQPHYNVNMTAVQVGQEFLNIPADLFQPGDRKG 311

Query: 276 GVMIDSGTDVTWLVKEAYEALRDEVMIRLEGEQMRSYSWPDKLCYHGIMSSDLKGFPTVR 335
            + IDSGT + +L +  YE L  ++  +    ++      D  C+      D +GFP V 
Sbjct: 312 AI-IDSGTTLAYLPEIIYEPLVKKITSQEPALKVHIVD-KDYKCFQYSGRVD-EGFPNVT 368

Query: 336 FHFRGGAKLAL-EKDSMFYQPRPDAFCMAVNPASINNR-YVNFSLIGMMAQQFYNVGYDI 393
           FHF     L +   D +F  P    +C+    +++ +R   N +L+G +      V YD+
Sbjct: 369 FHFENSVFLRVYPHDYLF--PYEGMWCIGWQNSAMQSRDRRNMTLLGDLVLSNKLVLYDL 426

Query: 394 GRKQKTFQRMDC 405
             +   +   +C
Sbjct: 427 ENQLIGWTEYNC 438


>gi|242045564|ref|XP_002460653.1| hypothetical protein SORBIDRAFT_02g032590 [Sorghum bicolor]
 gi|241924030|gb|EER97174.1| hypothetical protein SORBIDRAFT_02g032590 [Sorghum bicolor]
          Length = 525

 Score =  105 bits (263), Expect = 3e-20,   Method: Compositional matrix adjust.
 Identities = 108/388 (27%), Positives = 168/388 (43%), Gaps = 49/388 (12%)

Query: 56  ISNSLFYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLGTIFYPSRSSSYAYVP 115
           + +  + +++ +G PP      MDTGS L W+ C PC DC  Q G +F P+ SSSY  V 
Sbjct: 146 VGSGEYLMDVYVGTPPRRFRMIMDTGSDLNWLQCAPCLDCFEQRGPVFDPAASSSYRNVT 205

Query: 116 CDSEHCRYFPYARCSAY----------KHRCIYTQLYLIGPETSVFVSTEQLTFKNTDE- 164
           C    C +                   +  C Y   Y     T+  ++ E  T   T   
Sbjct: 206 CGDHRCGHVAPPPEPEASSPRTCRRPGEDPCPYYYWYGDQSNTTGDLALESFTVNLTAPG 265

Query: 165 STIHVQDVVFGCGFSTNRNF-KFSGIFGLGIGRSSLVSQLNS----SFSYCIGSLHDPDY 219
           ++  V  VVFGCG      F   +G+ GLG G  S  SQL +    +FSYC+   H  D 
Sbjct: 266 ASRRVDGVVFGCGHRNRGLFHGAAGLLGLGRGPLSFASQLRAVYGHTFSYCLVD-HGSD- 323

Query: 220 LHNKLILGDGAIIDEGDATP-LQFI------------DGHYYITLEAISVDGRMLDINPN 266
           + +K++ G+        A P L++             D  YY+ L+ + V G +L+I+ +
Sbjct: 324 VGSKVVFGEDDDALALAAHPQLKYTAFAPASSSSSPADTFYYVKLKGVLVGGELLNISSD 383

Query: 267 IFK-RDDSGGGVMIDSGTDVTWLVKEAYEALRDEVMIRLEGEQMRSYSWPDKL-----CY 320
            +    D  GG +IDSGT +++ V+ AY+ +R   M R+     RSY    +      CY
Sbjct: 384 TWDVGKDGSGGTIIDSGTTLSYFVEPAYQVIRHAFMDRMS----RSYPLVPEFPVLSPCY 439

Query: 321 HGIMSSDLKGFPTVRFHFRGGAKLALEKDSMFYQPRPDA---FCMAVNPASINNRYVNFS 377
           + +   +    P +   F  GA      ++ F +  PD     C+AV    +       S
Sbjct: 440 N-VSGVERPEVPELSLLFADGAVWDFPAENYFIRLDPDGGSIMCLAV----LGTPRTGMS 494

Query: 378 LIGMMAQQFYNVGYDIGRKQKTFQRMDC 405
           +IG   QQ ++V YD+   +  F    C
Sbjct: 495 IIGNFQQQNFHVVYDLQNNRLGFAPRRC 522


>gi|116310064|emb|CAH67085.1| H0818E04.2 [Oryza sativa Indica Group]
 gi|116310187|emb|CAH67199.1| OSIGBa0152K17.11 [Oryza sativa Indica Group]
          Length = 464

 Score =  105 bits (263), Expect = 3e-20,   Method: Compositional matrix adjust.
 Identities = 110/429 (25%), Positives = 178/429 (41%), Gaps = 55/429 (12%)

Query: 22  VQRGINISIARLAYLQEKIRSHNTYQAQILPSNDI--SNSLFYVNISIGQPPVPQFTAMD 79
           ++R I  S  RLA +        + +  ++    I  +   + V + IG PP     A+D
Sbjct: 48  LRRAIQRSRYRLAGIGMARGEAASARKAVVAETPIMPAGGEYLVKLGIGTPPYKFTAAID 107

Query: 80  TGSSLLWVHCYPCRDCSPQLGTIFYPSRSSSYAYVPCDSEHCRYFPYARCS-AYKHRCIY 138
           T S L+W  C PC  C  Q+  +F P  SS+YA +PC S+ C      RC       C Y
Sbjct: 108 TASDLIWTQCQPCTGCYHQVDPMFNPRVSSTYAALPCSSDTCDELDVHRCGHDDDESCQY 167

Query: 139 TQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDVVFGCGFSTNRNF---KFSGIFGLGIG 195
           T  Y     T   ++ ++L            + V FGC  S+       + SG+ GLG G
Sbjct: 168 TYTYSGNATTEGTLAVDKLVIGED-----AFRGVAFGCSTSSTGGAPPPQASGVVGLGRG 222

Query: 196 RSSLVSQLN-SSFSYCIGSLHDPDYLHNKLILGDGAIIDEGD----ATPLQ---FIDGHY 247
             SLVSQL+   F+YC+        +  KL+LG  A          A P++       +Y
Sbjct: 223 PLSLVSQLSVRRFAYCLPP--PASRIPGKLVLGADADAARNATNRIAVPMRRDPRYPSYY 280

Query: 248 YITLEAISVDGRMLDI---------------------NPN---IFKRDDSGGGVMIDSGT 283
           Y+ L+ + +  R + +                     +PN   +   D +  G++ID  +
Sbjct: 281 YLNLDGLLIGDRTMSLPPTTTTTATATATAPAPAPTPSPNATAVAVGDANRYGMIIDIAS 340

Query: 284 DVTWLVKEAYEALRDEVMIRLEGEQMRSYSWPDKLCY---HGIMSSDLKGFPTVRFHFRG 340
            +T+L    Y+ L +++ + +   +    S    LC+    G+ + D    P V   F  
Sbjct: 341 TITFLEASLYDELVNDLEVEIRLPRGTGSSLGLDLCFILPDGV-AFDRVYVPAVALAF-D 398

Query: 341 GAKLALEKDSMFYQPRPDA-FCMAVNPASINNRYVNFSLIGMMAQQFYNVGYDIGRKQKT 399
           G  L L+K  +F + R     C+ V  A       + S++G   QQ   V Y++ R + T
Sbjct: 399 GRWLRLDKARLFAEDRESGMMCLMVGRAEAG----SVSILGNFQQQNMQVLYNLRRGRVT 454

Query: 400 FQRMDCEVL 408
           F +  C  L
Sbjct: 455 FVQSPCGAL 463


>gi|356523171|ref|XP_003530215.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
          Length = 442

 Score =  105 bits (263), Expect = 3e-20,   Method: Compositional matrix adjust.
 Identities = 95/321 (29%), Positives = 146/321 (45%), Gaps = 41/321 (12%)

Query: 63  VNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLGTIFYPSRSSSYAYVPCDSEHC- 121
           ++I++G PP      +DTGS L W+HC      +      F P+ SSSY  + C S  C 
Sbjct: 68  ISITVGTPPQNMSMVIDTGSELSWLHCNTNTTATIPY-PFFNPNISSSYTPISCSSPTCT 126

Query: 122 ---RYFPYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDVVFGC-- 176
              R FP        + C  T  Y     +   ++++   F ++    I     VFGC  
Sbjct: 127 TRTRDFPIPASCDSNNLCHATLSYADASSSEGNLASDTFGFGSSFNPGI-----VFGCMN 181

Query: 177 -GFSTN--RNFKFSGIFGLGIGRSSLVSQLN-SSFSYCIGSLHDPDYLHNKLILGDGAII 232
             +STN   +   +G+ G+ +G  SLVSQL    FSYCI            L+LG+    
Sbjct: 182 SSYSTNSESDSNTTGLMGMNLGSLSLVSQLKIPKFSYCISG----SDFSGILLLGESNFS 237

Query: 233 DEGD---------ATPLQFID-GHYYITLEAISVDGRMLDINPNIFKRDDSGGG-VMIDS 281
             G          +TPL + D   Y + LE I +  ++L+I+ N+F  D +G G  M D 
Sbjct: 238 WGGSLNYTPLVQISTPLPYFDRSAYTVRLEGIKISDKLLNISGNLFVPDHTGAGQTMFDL 297

Query: 282 GTDVTWLVKEAYEALRDEVMIRLEGEQMRSYSWPD-------KLCYH-GIMSSDLKGFPT 333
           GT  ++L+   Y ALRDE + +  G  +R+   P+        LCY   +  S+L   P+
Sbjct: 298 GTQFSYLLGPVYNALRDEFLNQTNG-TLRALDDPNFVFQIAMDLCYRVPVNQSELPELPS 356

Query: 334 VRFHFRGGAKLALEKDSMFYQ 354
           V   F  GA++ +  D + Y+
Sbjct: 357 VSLVFE-GAEMRVFGDQLLYR 376


>gi|115458646|ref|NP_001052923.1| Os04g0448500 [Oryza sativa Japonica Group]
 gi|38344830|emb|CAD40872.2| OSJNBa0064H22.11 [Oryza sativa Japonica Group]
 gi|113564494|dbj|BAF14837.1| Os04g0448500 [Oryza sativa Japonica Group]
          Length = 464

 Score =  105 bits (263), Expect = 4e-20,   Method: Compositional matrix adjust.
 Identities = 110/429 (25%), Positives = 178/429 (41%), Gaps = 55/429 (12%)

Query: 22  VQRGINISIARLAYLQEKIRSHNTYQAQILPSNDI--SNSLFYVNISIGQPPVPQFTAMD 79
           ++R I  S  RLA +        + +  ++    I  +   + V + IG PP     A+D
Sbjct: 48  LRRAIQRSRYRLAGIGMARGEAASARKAVVAETPIMPAGGEYLVKLGIGTPPYKFTAAID 107

Query: 80  TGSSLLWVHCYPCRDCSPQLGTIFYPSRSSSYAYVPCDSEHCRYFPYARCS-AYKHRCIY 138
           T S L+W  C PC  C  Q+  +F P  SS+YA +PC S+ C      RC       C Y
Sbjct: 108 TASDLIWTQCQPCTGCYHQVDPMFNPRVSSTYAALPCSSDTCDELDVHRCGHDDDESCQY 167

Query: 139 TQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDVVFGCGFSTNRNF---KFSGIFGLGIG 195
           T  Y     T   ++ ++L            + V FGC  S+       + SG+ GLG G
Sbjct: 168 TYTYSGNATTEGTLAVDKLVIGED-----AFRGVAFGCSTSSTGGAPPPQASGVVGLGRG 222

Query: 196 RSSLVSQLN-SSFSYCIGSLHDPDYLHNKLILGDGAIIDEGD----ATPLQ---FIDGHY 247
             SLVSQL+   F+YC+        +  KL+LG  A          A P++       +Y
Sbjct: 223 PLSLVSQLSVRRFAYCLPP--PASRIPGKLVLGADADAARNATNRIAVPMRRDPRYPSYY 280

Query: 248 YITLEAISVDGRMLDI---------------------NPN---IFKRDDSGGGVMIDSGT 283
           Y+ L+ + +  R + +                     +PN   +   D +  G++ID  +
Sbjct: 281 YLNLDGLLIGDRAMSLPPTTTTTATATATAPAPAPTPSPNATAVAVGDANRYGMIIDIAS 340

Query: 284 DVTWLVKEAYEALRDEVMIRLEGEQMRSYSWPDKLCY---HGIMSSDLKGFPTVRFHFRG 340
            +T+L    Y+ L +++ + +   +    S    LC+    G+ + D    P V   F  
Sbjct: 341 TITFLEASLYDELVNDLEVEIRLPRGTGSSLGLDLCFILPDGV-AFDRVYVPAVALAF-D 398

Query: 341 GAKLALEKDSMFYQPRPDA-FCMAVNPASINNRYVNFSLIGMMAQQFYNVGYDIGRKQKT 399
           G  L L+K  +F + R     C+ V  A       + S++G   QQ   V Y++ R + T
Sbjct: 399 GRWLRLDKARLFAEDRESGMMCLMVGRAEAG----SVSILGNFQQQNMQVLYNLRRGRVT 454

Query: 400 FQRMDCEVL 408
           F +  C  L
Sbjct: 455 FVQSPCGAL 463


>gi|359476756|ref|XP_002277082.2| PREDICTED: aspartic proteinase-like protein 2-like isoform 2 [Vitis
           vinifera]
          Length = 560

 Score =  105 bits (262), Expect = 4e-20,   Method: Compositional matrix adjust.
 Identities = 100/400 (25%), Positives = 173/400 (43%), Gaps = 42/400 (10%)

Query: 38  EKIRSHNTYQ-AQILPSNDI---------SNSLFYVNISIGQPPVPQFTAMDTGSSLLWV 87
           + +R+H+T +  +IL + D+            L++  I IG P    +  +DTGS +LWV
Sbjct: 122 DALRAHDTRRHGRILSAVDLPLGGNGHPSEAGLYFAKIGIGTPSKDYYVQVDTGSDILWV 181

Query: 88  HCYPCRDCSPQ--LG---TIFYPSRSSSYAYVPCDSEHCRYF--PYARCSAYKHRCIYTQ 140
           +C  C  C  +  LG   T++    S++   V CD   C  +  P   C     +C+Y+ 
Sbjct: 182 NCAGCDRCPTKSDLGVDLTLYDMKASTTSDAVGCDDNFCSLYDGPLPGCKP-GLQCLYSV 240

Query: 141 LYLIGPETSVFVSTEQLTFKNTD---ESTIHVQDVVFGCGFSTNRNFKFS-----GIFGL 192
           LY  G  T+ +   + + +       ++T     VVFGCG   +     S     GI G 
Sbjct: 241 LYGDGSSTTGYFVQDFVQYNRISGNFQTTPTNGTVVFGCGNKQSGELGSSSEALDGILGF 300

Query: 193 GIGRSSLVSQLNSS------FSYCIGSLHDPDYLHNKLILGDGAIID-EGDATPLQFIDG 245
           G   SS++SQL SS      FS+C+      D +    I   G +++ + + TPL     
Sbjct: 301 GQANSSMLSQLASSGKVKKVFSHCL------DNVDGGGIFAIGEVVEPKVNITPLVQNQA 354

Query: 246 HYYITLEAISVDGRMLDINPNIFKRDDSGGGVMIDSGTDVTWLVKEAYEALRDEVMIRLE 305
           HY + ++ I V G  LD+  + F+  D   G +IDSGT + +  +E Y  L ++++   +
Sbjct: 355 HYNVVMKEIEVGGDPLDVPSDAFESGDR-KGTIIDSGTTLAYFPQEVYVPLIEKILS--Q 411

Query: 306 GEQMRSYSWPDKLCYHGIMSSDLKGFPTVRFHFRGGAKLALEKDSMFYQPRPDAFCMAVN 365
              +R ++            +   GFPTV  HF     L +      +Q   +      N
Sbjct: 412 QPDLRLHTVEQAFTCFDYTGNVDDGFPTVTLHFDKSISLTVYPHEYLFQHEFEWCIGWQN 471

Query: 366 PASINNRYVNFSLIGMMAQQFYNVGYDIGRKQKTFQRMDC 405
             +      + +L+G +      V YD+ ++   +   +C
Sbjct: 472 SGAQTKDGKDLTLLGDLVLSNKLVVYDLEKQGIGWVEYNC 511


>gi|297806153|ref|XP_002870960.1| hypothetical protein ARALYDRAFT_908082 [Arabidopsis lyrata subsp.
           lyrata]
 gi|297316797|gb|EFH47219.1| hypothetical protein ARALYDRAFT_908082 [Arabidopsis lyrata subsp.
           lyrata]
          Length = 453

 Score =  105 bits (262), Expect = 4e-20,   Method: Compositional matrix adjust.
 Identities = 108/379 (28%), Positives = 164/379 (43%), Gaps = 57/379 (15%)

Query: 70  PPVPQFTAMDTGSSLLWVHCYPCRDCSPQLGTIFYPSRSSSYAYVPCDSEHCR-----YF 124
           PP      +DTGS L W+ C   R  +P     F P+RSSSY+ +PC S  CR     + 
Sbjct: 82  PPQNISMVIDTGSELSWLRCN--RSSNPNPVNNFDPTRSSSYSPIPCSSPTCRTRTRDFL 139

Query: 125 PYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDVVFGC-----GFS 179
             A C + K  C  T  Y     +   ++ E   F N+   +    +++FGC     G  
Sbjct: 140 IPASCDSDK-LCHATLSYADASSSEGNLAAEIFHFGNSTNDS----NLIFGCMGSVSGSD 194

Query: 180 TNRNFKFSGIFGLGIGRSSLVSQLN-SSFSYCIGSLHD-PDYLHNKLILGDGAIIDEGD- 236
              + K +G+ G+  G  S +SQ+    FSYCI    D P +    L+LGD         
Sbjct: 195 PEEDTKTTGLLGMNRGSLSFISQMGFPKFSYCISGTDDFPGF----LLLGDSNFTWLTPL 250

Query: 237 --------ATPLQFIDG-HYYITLEAISVDGRMLDINPNIFKRDDSGGG-VMIDSGTDVT 286
                   +TPL + D   Y + L  I V+G++L I  ++   D +G G  M+DSGT  T
Sbjct: 251 NYTPLIRISTPLPYFDRVAYTVQLTGIKVNGKLLPIPKSVLLPDHTGAGQTMVDSGTQFT 310

Query: 287 WLVKEAYEALRDEVMIRLEGEQMRSYSWPD-------KLCYH----GIMSSDLKGFPTVR 335
           +L+   Y ALR + + +  G  +  Y  P+        LCY      I +  L   PTV 
Sbjct: 311 FLLGPVYTALRSDFLNQTNG-ILTVYEDPEFVFQGTMDLCYRISPFRIRTGILHRLPTVS 369

Query: 336 FHFRGGAKLALEKDSMFYQPRPDAFCMAVNPASINNRYVNFSLIGMMA-------QQFYN 388
             F  GA++A+    + Y+        A N +     + N  L+GM A       QQ   
Sbjct: 370 LVFE-GAEIAVSGQPLLYR---VPHLTAGNDSVYCFTFGNSDLMGMEAYVIGHHHQQNMW 425

Query: 389 VGYDIGRKQKTFQRMDCEV 407
           + +D+ R +     + C+V
Sbjct: 426 IEFDLQRSRIGLAPVQCDV 444


>gi|242066140|ref|XP_002454359.1| hypothetical protein SORBIDRAFT_04g029390 [Sorghum bicolor]
 gi|241934190|gb|EES07335.1| hypothetical protein SORBIDRAFT_04g029390 [Sorghum bicolor]
          Length = 460

 Score =  105 bits (262), Expect = 4e-20,   Method: Compositional matrix adjust.
 Identities = 111/432 (25%), Positives = 175/432 (40%), Gaps = 59/432 (13%)

Query: 1   LIHRDSILSPYNNPN----ENPAHRVQRGINISIARLAYLQEKIR---------SHNTYQ 47
           L HR    SP         E+  HR Q        R AY++ K           +    Q
Sbjct: 61  LHHRHGPCSPLPTKKMPSLEDRLHRDQL-------RAAYIKRKFSGDVKKDGQGAGGVEQ 113

Query: 48  AQILPSNDISNSL----FYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLGTIF 103
           + +     +  SL    + + + +G P   Q   +D+GS + WV C PC  C  Q+  +F
Sbjct: 114 SHVTVPTTLGTSLNTLEYLITVRLGSPAKTQTVLIDSGSDVSWVQCKPCLQCHSQVDPLF 173

Query: 104 YPSRSSSYAYVPCDSEHCRYFPY--ARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKN 161
            PS SS+Y+   C S  C         CS+   +C Y   Y  G  T+   S++ L   +
Sbjct: 174 DPSLSSTYSPFSCSSAACAQLGQDGNGCSS-SSQCQYIVRYADGSSTTGTYSSDTLALGS 232

Query: 162 TDESTIHVQDVVFGCG-FSTNRNFKFSGIFGLGIGRSSLVSQ----LNSSFSYCIGSLHD 216
                  + +  FGC    +  N    G+ GLG G  SL SQ      ++FSYC+     
Sbjct: 233 NT-----ISNFQFGCSHVESGFNDLTDGLMGLGGGAPSLASQTAGTFGTAFSYCL----- 282

Query: 217 PDYLHNKLILGDGAIIDEGDATPL---QFIDGHYYITLEAISVDGRMLDINPNIFKRDDS 273
           P    +   L  GA       TP+     +   Y + LEAI V G  L I  ++F     
Sbjct: 283 PPTPSSSGFLTLGAGTSGFVKTPMLRSSPVPTFYGVRLEAIRVGGTQLSIPTSVFS---- 338

Query: 274 GGGVMIDSGTDVTWLVKEAYEALRDEVMIRLEGEQMRSYSWPDKLCYHGIMSSDLKGFPT 333
             G+++DSGT +T L + AY AL       ++  +          C+     S ++  P+
Sbjct: 339 -AGMVMDSGTIITRLPRTAYSALSSAFKAGMKQYRPAPPRSIMDTCFDFSGQSSVR-LPS 396

Query: 334 VRFHFRGGAKLALEKDSMFYQPRPDAFCMAVNPASINNRYVNFSLIGMMAQQFYNVGYDI 393
           V   F GGA + L+ + +         C+A    + N+   +  ++G + Q+ + V YD+
Sbjct: 397 VALVFSGGAVVNLDANGIIL-----GNCLAF---AANSDDSSPGIVGNVQQRTFEVLYDV 448

Query: 394 GRKQKTFQRMDC 405
           G     F+   C
Sbjct: 449 GGGAVGFKAGAC 460


>gi|30682289|ref|NP_187876.2| aspartyl protease family protein [Arabidopsis thaliana]
 gi|11994412|dbj|BAB02414.1| chloroplast nucleoid DNA binding protein-like [Arabidopsis
           thaliana]
 gi|332641715|gb|AEE75236.1| aspartyl protease family protein [Arabidopsis thaliana]
          Length = 461

 Score =  105 bits (262), Expect = 4e-20,   Method: Compositional matrix adjust.
 Identities = 107/426 (25%), Positives = 169/426 (39%), Gaps = 39/426 (9%)

Query: 1   LIHRDSILSPYNNPNENPAHRVQRGINISIARLAYLQEKIRSHNTYQAQILPSNDISNSL 60
           L HRD++L         P  R++  I     R + +  K  S    +  +    D   + 
Sbjct: 53  LAHRDTLL-------PKPLSRIEDVIGADQKRHSLISRKRNSTVGVKMDLGSGIDYGTAQ 105

Query: 61  FYVNISIGQPPVPQFTAMDTGSSLLWVHC-YPCRDCSPQLGTIFYPSRSSSYAYVPCDSE 119
           ++  I +G P       +DTGS L WV+C Y  R    +   +F    S S+  V C ++
Sbjct: 106 YFTEIRVGTPAKKFRVVVDTGSELTWVNCRYRARGKDNR--RVFRADESKSFKTVGCLTQ 163

Query: 120 HCR-----YFPYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDVVF 174
            C+      F    C      C Y   Y  G       + E +T   T+     +   + 
Sbjct: 164 TCKVDLMNLFSLTTCPTPSTPCSYDYRYADGSAAQGVFAKETITVGLTNGRMARLPGHLI 223

Query: 175 GCGFS-TNRNFKFS-GIFGLGIGRSSLVSQLNS----SFSYCIGSLHDPDYLHNKLILGD 228
           GC  S T ++F+ + G+ GL     S  S   S     FSYC+        + N LI G 
Sbjct: 224 GCSSSFTGQSFQGADGVLGLAFSDFSFTSTATSLYGAKFSYCLVDHLSNKNVSNYLIFGS 283

Query: 229 GAIIDEG--DATPLQF--IDGHYYITLEAISVDGRMLDINPNIFKRDDSGGGVMIDSGTD 284
                      TPL    I   Y I +  IS+   MLDI   ++    SGGG ++DSGT 
Sbjct: 284 SRSTKTAFRRTTPLDLTRIPPFYAINVIGISLGYDMLDIPSQVWDA-TSGGGTILDSGTS 342

Query: 285 VTWLVKEAYEALRDEVMIRL-EGEQMRSYSWPDKLCYHGIMSSDLKGFPTVRFHFRGGAK 343
           +T L   AY+ +   +   L E ++++    P + C+      ++   P + FH +GGA+
Sbjct: 343 LTLLADAAYKQVVTGLARYLVELKRVKPEGVPIEYCFSFTSGFNVSKLPQLTFHLKGGAR 402

Query: 344 LALEKDSMFYQPRPDAFCM----AVNPASINNRYVNFSLIGMMAQQFYNVGYDIGRKQKT 399
               + S      P   C+    A  PA+        ++IG + QQ Y   +D+     +
Sbjct: 403 FEPHRKSYLVDAAPGVKCLGFVSAGTPAT--------NVIGNIMQQNYLWEFDLMASTLS 454

Query: 400 FQRMDC 405
           F    C
Sbjct: 455 FAPSAC 460


>gi|297605079|ref|NP_001056639.2| Os06g0121800 [Oryza sativa Japonica Group]
 gi|255676668|dbj|BAF18553.2| Os06g0121800 [Oryza sativa Japonica Group]
          Length = 487

 Score =  105 bits (262), Expect = 4e-20,   Method: Compositional matrix adjust.
 Identities = 103/353 (29%), Positives = 147/353 (41%), Gaps = 32/353 (9%)

Query: 66  SIGQPPVPQFTAMDTGSSLLWVHCYPCR--DCSPQLGTIFYPSRSSSYAYVPCDSEHCRY 123
           +I  P + Q  ++DT   L W+ C PC   +C PQ   +F P RS + A VPC S  C  
Sbjct: 154 AIDDPILAQPMSIDTSIDLPWIQCAPCPMPECYPQQNALFDPRRSRTSAAVPCGSAACGE 213

Query: 124 FPYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDVVFGCGFSTNRN 183
                     ++C Y   Y  G  TS     + LT    + ST+ V +  FGC  +   N
Sbjct: 214 LGRYGAGCSNNQCQYFVDYGDGRATSGTYMVDALTL---NPSTV-VMNFRFGCSHAVRGN 269

Query: 184 FKF--SGIFGLGIGRSSLVSQ----LNSSFSYCIGSLHDPDYLHNKLILGDGAIIDEGDA 237
           F    SG   LG GR SL+SQ      ++FSYC+       +L +     DG        
Sbjct: 270 FSASTSGTMSLGGGRQSLLSQTAATFGNAFSYCVPDPSSSGFL-SLGGPADGGGAGRFAR 328

Query: 238 TPL----QFIDGHYYITLEAISVDGRMLDINPNIFKRDDSGGGVMIDSGTDVTWLVKEAY 293
           TPL      I   Y + L  I V GR L++ P +F      GG ++DS   +T L   AY
Sbjct: 329 TPLVRNPSIIPTLYLVRLRGIEVGGRRLNVPPVVFA-----GGAVMDSSVIITQLPPTAY 383

Query: 294 EALRDEVMIRLEG-EQMRSYSWPDKLCYHGIMSSDLKGFPTVRFHFRGGAKLALEKDSMF 352
            ALR      +    ++         CY  +  + +   P V   F GGA + L+   + 
Sbjct: 384 RALRLAFRSAMAAYPRVAGGRAGLDTCYDFVRFTSVT-VPAVSLVFDGGAVVRLDAMGVM 442

Query: 353 YQPRPDAFCMAVNPASINNRYVNFSLIGMMAQQFYNVGYDIGRKQKTFQRMDC 405
            +      C+A  P   +        IG + QQ + V YD+G     F+R  C
Sbjct: 443 VE-----GCLAFVPTPGD---FALGFIGNVQQQTHEVLYDVGGGSVGFRRGAC 487


>gi|449451076|ref|XP_004143288.1| PREDICTED: aspartic proteinase-like protein 2-like [Cucumis
           sativus]
          Length = 488

 Score =  105 bits (262), Expect = 4e-20,   Method: Compositional matrix adjust.
 Identities = 105/375 (28%), Positives = 173/375 (46%), Gaps = 46/375 (12%)

Query: 60  LFYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDC--SPQLG---TIFYPSRSSSYAYV 114
           L++  + +G P       +DTGS +LWV C PC  C  S  LG    +F  ++SSS   +
Sbjct: 83  LYFTKVKLGNPAREFNVQIDTGSDILWVTCSPCDGCPDSSGLGIELNLFDTTKSSSARVL 142

Query: 115 PCDSEHCRYFPYA--RCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKN-TDESTI--HV 169
           PC    C        +C      C Y+  Y     TS F  T+ + F     ESTI    
Sbjct: 143 PCTDPICAAVSTTTDQCLTQTDHCSYSFHYRDRSGTSGFYVTDSMHFDILLGESTIANSS 202

Query: 170 QDVVFGC-----GFSTNRNFKFSGIFGLGIGRSSLVSQLNSS------FSYCIGSLHDPD 218
             +VFGC     G  T       GIFG G G  S++SQL+S       FS+C+    +  
Sbjct: 203 ATIVFGCSIYQYGDLTRATKALDGIFGFGQGEFSVISQLSSRGITPKVFSHCLKGGENGG 262

Query: 219 YLHNKLILGDGAIIDEGDA-TPLQFIDGHYYITLEAISVDGRMLDINPNIFKRDDSGGGV 277
            +   L+LG+  I++     +PL     HY + L++I++ G++   NP +F   ++ G  
Sbjct: 263 GI---LVLGE--ILEPSIVYSPLIPSQPHYTLKLQSIALSGQLFP-NPTMFPISNA-GET 315

Query: 278 MIDSGTDVTWLVKEAYEALRDEVMIRLEGEQMRSYSWPDKLCYHGIMS-SDLKGFPTVRF 336
           +IDSGT + +LV+E Y+ +   +   +      + S   + C+   MS +D+  FP +RF
Sbjct: 316 IIDSGTTLAYLVEEVYDWIVSVITSAVSQSATPTISRGSQ-CFRVSMSVADI--FPVLRF 372

Query: 337 HFRGGAKLA------LEKDSMFYQPRPDAFCMAVNPASINNRYVNFSLIGMMAQQFYNVG 390
           +F G A +       L+ DS+  +P    +C+    A         +++G +  +   + 
Sbjct: 373 NFEGIASMVVTPEEYLQFDSIVREPA--LWCIGFQKAE-----DGLNILGDLVLKDKIIV 425

Query: 391 YDIGRKQKTFQRMDC 405
           YD+ R++  +   DC
Sbjct: 426 YDLARQRIGWANYDC 440


>gi|357155293|ref|XP_003577072.1| PREDICTED: LOW QUALITY PROTEIN: probable aspartic protease
           At2g35615-like [Brachypodium distachyon]
          Length = 429

 Score =  105 bits (262), Expect = 4e-20,   Method: Compositional matrix adjust.
 Identities = 103/394 (26%), Positives = 175/394 (44%), Gaps = 66/394 (16%)

Query: 50  ILPSNDISNSLFYVNISIGQPPVPQFTAMDTGSSLLWVHC----YPCRDCSPQLGTIFYP 105
           ++ +++I    F+++IS+G PPV     +DTGS+L WV C      C   +P+ G++F P
Sbjct: 64  VVGNHEIHEGKFFMDISLGTPPVANLVTVDTGSTLSWVVCQRCQISCHTTAPEAGSVFDP 123

Query: 106 SRSSSYAYVPCDSEHCR------YFPYARCSAYKHRCIYTQLYLIGPE---TSVFVSTEQ 156
            +S++Y  V C S  C         P+  C      C+Y+  Y  GP    ++  + T++
Sbjct: 124 DKSTTYELVGCSSRDCADVQRSLVAPFG-CIEETDTCLYSLRYGSGPSGQYSAGRLGTDK 182

Query: 157 LTFKNTDESTIHVQDVVFGCGFSTNRNFKF--SGIFGLGIGRSSLVSQLN-----SSFSY 209
           LT  +   S+  +   +FGC  S + +FK   SG+ G G    S  +Q+       +FSY
Sbjct: 183 LTLAS---SSSIIDGFIFGC--SGDDSFKGYESGVIGFGGANFSFFNQVARQTNYRAFSY 237

Query: 210 C------------IGSLHDPDYLHNKLI--LGDGAIIDEGDATPLQFIDGHYYITLEAIS 255
           C            IG+    + ++  LI   GD ++        LQ ID         + 
Sbjct: 238 CFPGDHTAEGFLSIGAYPKDELVYTNLIPHFGDRSVYS------LQQID---------MM 282

Query: 256 VDGRMLDINPNIFKRDDSGGGVMIDSGTDVTWLVKEAYEALRDEVMIRLEGEQMRSYSWP 315
           VDG  L ++ + + +      +++DSGT  T+L+   ++A    +   ++ +   S +  
Sbjct: 283 VDGNRLQVDQSEYTKR----MMVVDSGTVDTFLLGPVFDAFSKAMASAMQAKGFLSDTVG 338

Query: 316 DKLCY--HGIMSSDLKGFPTVRFHFRGGAKLALEKDSMFYQ--PRPDAFCMAVNPASINN 371
            + C+  +G  S D    PTV   F  G  L L  +++F+   P  D  C+A  P     
Sbjct: 339 TETCFRPNGGDSVDSGDLPTVEMRFI-GTTLKLPPENVFHDLLPSHDKICLAFKPDVAGV 397

Query: 372 RYVNFSLIGMMAQQFYNVGYDIGRKQKTFQRMDC 405
           R  N  ++G  A   + V YD+      FQ   C
Sbjct: 398 R--NVQILGNKATXSFRVVYDLQAMYFGFQAGAC 429


>gi|302817726|ref|XP_002990538.1| hypothetical protein SELMODRAFT_131679 [Selaginella moellendorffii]
 gi|300141706|gb|EFJ08415.1| hypothetical protein SELMODRAFT_131679 [Selaginella moellendorffii]
          Length = 434

 Score =  105 bits (262), Expect = 5e-20,   Method: Compositional matrix adjust.
 Identities = 98/369 (26%), Positives = 156/369 (42%), Gaps = 42/369 (11%)

Query: 60  LFYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLGTIFYP------SRSSSYAY 113
           L++  + +G PP      +DTGS LLWV+C+PC  C P    +  P        S+S + 
Sbjct: 35  LYFTQVQLGTPPRTYNLQVDTGSDLLWVNCHPCIGC-PAFSDLKIPIVPYDVKASASSSK 93

Query: 114 VPCDSEHCRYFPYARCSAY--KHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQD 171
           VPC    C        S    +++C Y+  Y  G  T  ++  + L +     +T     
Sbjct: 94  VPCSDPSCTLITQISESGCNDQNQCGYSFQYGDGSGTLGYLVEDVLHYMVNATAT----- 148

Query: 172 VVFGCGFSTNRNFKFS-----GIFGLGIGRSSLVSQLNSS------FSYCIGSLHDPDYL 220
           V+FGCGF  + +   S     GI G G    S  SQL         F++C   L   +  
Sbjct: 149 VIFGCGFKQSGDLSTSERALDGIIGFGASDLSFNSQLAKQGKTPNVFAHC---LDGGERG 205

Query: 221 HNKLILGDGAIIDEGDATPLQFIDGHYYITLEAISVDGRMLDINPNIFKRDDSGGGVMID 280
              L+LG+  I  +   TPL     HY + L++ISV+   L I+P +F  +D   G + D
Sbjct: 206 GGILVLGN-VIEPDIQYTPLVPYMSHYNVVLQSISVNNANLTIDPKLFS-NDVMQGTIFD 263

Query: 281 SGTDVTWLVKEAYEALRDEVMIRLEGEQMRSYSWPDKLCYHGIMSSDLKGFPTVRFHFRG 340
           SGT + +L  EAY+A    V + +          P  LC   +     K FP V  +F G
Sbjct: 264 SGTTLAYLPDEAYQAFTQAVSLVVA---------PFLLCDTRLSRFIYKLFPNVVLYFEG 314

Query: 341 GAKLALEKDSMFYQ---PRPDAFCMAVNPASINNRYVNFSLIGMMAQQFYNVGYDIGRKQ 397
            +      + +  Q        +CM           + +++ G +  +   V YD+ R +
Sbjct: 315 ASMTLTPAEYLIRQASAANAPIWCMGWQSMGSAESELQYTIFGDLVLKNKLVVYDLERGR 374

Query: 398 KTFQRMDCE 406
             ++  DC+
Sbjct: 375 IGWRPFDCK 383


>gi|302803839|ref|XP_002983672.1| hypothetical protein SELMODRAFT_118648 [Selaginella moellendorffii]
 gi|300148509|gb|EFJ15168.1| hypothetical protein SELMODRAFT_118648 [Selaginella moellendorffii]
          Length = 388

 Score =  105 bits (261), Expect = 5e-20,   Method: Compositional matrix adjust.
 Identities = 99/371 (26%), Positives = 157/371 (42%), Gaps = 42/371 (11%)

Query: 60  LFYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLGTIFYP------SRSSSYAY 113
           L++  + +G PP      +DTGS LLWV+C+PC  C P    +  P        S+S + 
Sbjct: 35  LYFTQVQLGTPPRTYNLQVDTGSDLLWVNCHPCIGC-PAFSDLKIPIVPYDVKASASSSK 93

Query: 114 VPCDSEHCRYFPYARCSAY--KHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQD 171
           VPC    C        S    +++C Y+  Y  G  T  ++  + L +     +T     
Sbjct: 94  VPCSDPSCTLITQISESGCNDQNQCGYSFQYGDGSGTLGYLVEDVLHYMVNATAT----- 148

Query: 172 VVFGCGFSTNRNFKFS-----GIFGLGIGRSSLVSQLNSS------FSYCIGSLHDPDYL 220
           V+FGCGF  + +   S     GI G G    S  SQL         F++C   L   +  
Sbjct: 149 VIFGCGFKQSGDLSTSERALDGIIGFGASDLSFNSQLAKQGKTPNVFAHC---LDGGERG 205

Query: 221 HNKLILGDGAIIDEGDATPLQFIDGHYYITLEAISVDGRMLDINPNIFKRDDSGGGVMID 280
              L+LG+  I  +   TPL     HY + L++ISV+   L I+P +F  +D   G + D
Sbjct: 206 GGILVLGN-VIEPDIQYTPLVPYMYHYNVVLQSISVNNANLTIDPKLFS-NDVMQGTIFD 263

Query: 281 SGTDVTWLVKEAYEALRDEVMIRLEGEQMRSYSWPDKLCYHGIMSSDLKGFPTVRFHFRG 340
           SGT + +L  EAY+A    V + +          P  LC   +     K FP V  +F G
Sbjct: 264 SGTTLAYLPDEAYQAFTQAVSLVVA---------PFLLCDTRLSRFIYKLFPNVVLYFEG 314

Query: 341 GAKLALEKDSMFYQ---PRPDAFCMAVNPASINNRYVNFSLIGMMAQQFYNVGYDIGRKQ 397
            +      + +  Q        +CM           + +++ G +  +   V YD+ R +
Sbjct: 315 ASMTLTPAEYLIRQASAANAPIWCMGWQSMGSAESELQYTIFGDLVLKNKLVVYDLERGR 374

Query: 398 KTFQRMDCEVL 408
             ++  DC+ L
Sbjct: 375 IGWRPFDCKFL 385


>gi|356498711|ref|XP_003518193.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Glycine max]
          Length = 466

 Score =  105 bits (261), Expect = 5e-20,   Method: Compositional matrix adjust.
 Identities = 108/450 (24%), Positives = 195/450 (43%), Gaps = 60/450 (13%)

Query: 2   IHRDSILSPYNNPNENPAHRVQRGINISIARLAYLQEKIRSHNTYQAQILPSNDISNSLF 61
           +H   + + + + + +P H ++  ++ SI R  +L    ++H   ++   P +  +   +
Sbjct: 31  LHLSPLFTNHPSSSSHPFHTLKLAVSTSITRAHHL----KNHKPNKSLETPVHPKTYGGY 86

Query: 62  YVNISIGQPPVPQFTAMDTGSSLLWVHC---YPCRDCSPQLGT-IFYPSRSSSYAYVPCD 117
            +++  G P       +DTGS+L+W+ C   Y C  C+    T  F P  SSS  +V C 
Sbjct: 87  SIDLEFGTPSQTFPFVLDTGSTLVWLPCSSHYLCSKCNSFSNTPKFIPKNSSSSKFVGCT 146

Query: 118 SEHCRYF---------------PYARCSAYKHRC-IYTQLYLIGPETSVFVSTEQLTFKN 161
           +  C +                 +  CS     C  YT  Y +G  T+ F+ +E L F  
Sbjct: 147 NPKCAWVFGPDVKSHCCRQDKAAFNNCS---QTCPAYTVQYGLG-STAGFLLSENLNFP- 201

Query: 162 TDESTIHVQDVVFGCGFSTNRNFKFSGIFGLGIGRSSLVSQLN-SSFSYCIGS--LHDPD 218
               T    D + GC  S    ++ +GI G G G  SL SQ+N + FSYC+ S    D  
Sbjct: 202 ----TKKYSDFLLGC--SVVSVYQPAGIAGFGRGEESLPSQMNLTRFSYCLLSHQFDDSA 255

Query: 219 YLHNKLILGDGAIIDEGDATPLQF--------------IDGHYYITLEAISVDGRMLDIN 264
            + + L+L + A   +G    + +                 +YYITL+ I V  + + + 
Sbjct: 256 TITSNLVL-ETASSRDGKTNGVSYTPFLKNPTTKKNPAFGAYYYITLKRIVVGEKRVRVP 314

Query: 265 PNIFKRD-DSGGGVMIDSGTDVTWLVKEAYEALRDEVMIRLEGEQMRSYSWPDKL--CYH 321
             + + + D  GG ++DSG+  T++ +  ++ +  E   ++   + R       L  C+ 
Sbjct: 315 RRLLEPNVDGDGGFIVDSGSTFTFMERPIFDLVAQEFAKQVSYTRAREAEKQFGLSPCFV 374

Query: 322 GIMSSDLKGFPTVRFHFRGGAKLALEKDSMF-YQPRPDAFCMAV---NPASINNRYVNFS 377
               ++   FP +RF FRGGAK+ L   + F    + D  C+ +   + A          
Sbjct: 375 LAGGAETASFPELRFEFRGGAKMRLPVANYFSLVGKGDVACLTIVSDDVAGSGGTVGPAV 434

Query: 378 LIGMMAQQFYNVGYDIGRKQKTFQRMDCEV 407
           ++G   QQ + V YD+  ++  F+   C+ 
Sbjct: 435 ILGNYQQQNFYVEYDLENERFGFRSQSCQT 464


>gi|18409620|ref|NP_566966.1| aspartyl protease family protein [Arabidopsis thaliana]
 gi|13430562|gb|AAK25903.1|AF360193_1 unknown protein [Arabidopsis thaliana]
 gi|4886277|emb|CAB43423.1| putative protein [Arabidopsis thaliana]
 gi|14532764|gb|AAK64083.1| unknown protein [Arabidopsis thaliana]
 gi|15450892|gb|AAK96717.1| Unknown protein [Arabidopsis thaliana]
 gi|30387567|gb|AAP31949.1| At3g52500 [Arabidopsis thaliana]
 gi|332645431|gb|AEE78952.1| aspartyl protease family protein [Arabidopsis thaliana]
          Length = 469

 Score =  105 bits (261), Expect = 5e-20,   Method: Compositional matrix adjust.
 Identities = 125/464 (26%), Positives = 197/464 (42%), Gaps = 85/464 (18%)

Query: 8   LSPYNNPNENPAH---RVQRGINISIARLAYL---------QEKIRSHNTYQAQIL--PS 53
           LSP+++ +++P      ++R    SIAR   L         ++ + S  T  A ++  P 
Sbjct: 23  LSPFSHSDQSPKDPYLSLRRLAESSIARAHKLKHGTSIKPDEDALSSTTTASATVVKSPL 82

Query: 54  NDISNSLFYVNISIGQPPVPQFTAMDTGSSLLWVHC---YPCRDC-----SPQLGTIFYP 105
           +  S   + V++S G P        DTGSSL+W+ C   Y C  C      P L   F P
Sbjct: 83  SAKSYGGYSVSLSFGTPSQTIPFVFDTGSSLVWLPCTSRYLCSGCDFSGLDPTLIPRFIP 142

Query: 106 SRSSSYAYVPCDSEHCRYF--PYARCSA---YKHRCI-----YTQLYLIGPETSVFVSTE 155
             SSS   + C S  C++   P  +C         C      Y   Y +G    V + TE
Sbjct: 143 KNSSSSKIIGCQSPKCQFLYGPNVQCRGCDPNTRNCTVGCPPYILQYGLGSTAGVLI-TE 201

Query: 156 QLTFKNTDESTIHVQDVVFGCGFSTNRNFKFSGIFGLGIGRSSLVSQLN-SSFSYCIGSL 214
           +L F +     + V D V GC   + R  + +GI G G G  SL SQ+N   FS+C+ S 
Sbjct: 202 KLDFPD-----LTVPDFVVGCSIISTR--QPAGIAGFGRGPVSLPSQMNLKRFSHCLVSR 254

Query: 215 H-DPDYLHNKLILGDGAIIDEGDATP---------------LQFIDGHYYITLEAISVDG 258
             D   +   L L  G+  + G  TP                 F++ +YY+ L  I V  
Sbjct: 255 RFDDTNVTTDLDLDTGSGHNSGSKTPGLTYTPFRKNPNVSNKAFLE-YYYLNLRRIYVGR 313

Query: 259 RMLDINPNIFKRDDSG-GGVMIDSGTDVTWLVKEAYEALRDEVMIRLEGEQMRSYSWPDK 317
           + + I         +G GG ++DSG+  T++ +  +E + +E        QM +Y+    
Sbjct: 314 KHVKIPYKYLAPGTNGDGGSIVDSGSTFTFMERPVFELVAEEF-----ASQMSNYTREKD 368

Query: 318 L--------CYHGIMSSDLKGFPTVRFHFRGGAKLALEKDSMF-YQPRPDAFCMAVNPAS 368
           L        C++     D+   P + F F+GGAKL L   + F +    D  C+ V    
Sbjct: 369 LEKETGLGPCFNISGKGDVT-VPELIFEFKGGAKLELPLSNYFTFVGNTDTVCLTV---- 423

Query: 369 INNRYVNFS-------LIGMMAQQFYNVGYDIGRKQKTFQRMDC 405
           ++++ VN S       ++G   QQ Y V YD+   +  F +  C
Sbjct: 424 VSDKTVNPSGGTGPAIILGSFQQQNYLVEYDLENDRFGFAKKKC 467


>gi|17979392|gb|AAL49921.1| unknown protein [Arabidopsis thaliana]
          Length = 439

 Score =  105 bits (261), Expect = 5e-20,   Method: Compositional matrix adjust.
 Identities = 106/422 (25%), Positives = 165/422 (39%), Gaps = 31/422 (7%)

Query: 1   LIHRDSILSPYNNPNENPAHRVQRGINISIARLAYLQEKIRSHNTYQAQILPSNDISNSL 60
           L HRD++L         P  R++  I     R + +  K  S    +  +    D   + 
Sbjct: 31  LAHRDTLL-------PKPLSRIEDVIGADQKRHSLISRKRNSTVGVKMDLGSGIDYGTAQ 83

Query: 61  FYVNISIGQPPVPQFTAMDTGSSLLWVHC-YPCRDCSPQLGTIFYPSRSSSYAYVPCDSE 119
           ++  I +G P       +DTGS L WV+C Y  R    +   +F    S S+  V C ++
Sbjct: 84  YFTEIRVGTPAKKFRVVVDTGSELTWVNCRYRARGKDNR--RVFRADESKSFKTVGCLTQ 141

Query: 120 HCR-----YFPYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDVVF 174
            C+      F    C      C Y   Y  G       + E +T   T+     +   + 
Sbjct: 142 TCKVDLMNLFSLTTCPTPSTPCSYDYRYADGSAAQGVFAKETITVGLTNGRMARLPGHLI 201

Query: 175 GCGFS-TNRNFKFS-GIFGLGIGRSSLVSQLNS----SFSYCIGSLHDPDYLHNKLILGD 228
           GC  S T ++F+ + G+ GL     S  S   S     FSYC+        + N LI G 
Sbjct: 202 GCSSSFTGQSFQGADGVLGLAFSDFSFTSTATSLYGAKFSYCLVDHLSNKNVSNYLIFGS 261

Query: 229 GAIIDEG--DATPLQF--IDGHYYITLEAISVDGRMLDINPNIFKRDDSGGGVMIDSGTD 284
                      TPL    I   Y I +  IS+   MLDI   ++    SGGG ++DSGT 
Sbjct: 262 SRSTKTAFRRTTPLDLTRIPPFYAINVIGISLGYDMLDIPSQVWDA-TSGGGTILDSGTS 320

Query: 285 VTWLVKEAYEALRDEVMIRL-EGEQMRSYSWPDKLCYHGIMSSDLKGFPTVRFHFRGGAK 343
           +T L   AY+ +   +   L E ++++    P + C+      ++   P + FH +GGA+
Sbjct: 321 LTLLADAAYKQVVTGLARYLVELKRVKPEGVPIEYCFSFTSGFNVSKLPQLTFHLKGGAR 380

Query: 344 LALEKDSMFYQPRPDAFCMAVNPASINNRYVNFSLIGMMAQQFYNVGYDIGRKQKTFQRM 403
               + S      P   C+    A      V    IG + QQ Y   +D+     +F   
Sbjct: 381 FEPHRKSYLVDAAPGVKCLGFVSAGTPATNV----IGNIMQQNYLWEFDLMASTLSFAPS 436

Query: 404 DC 405
            C
Sbjct: 437 AC 438


>gi|77553049|gb|ABA95845.1| Eukaryotic aspartyl protease family protein, expressed [Oryza
           sativa Japonica Group]
          Length = 372

 Score =  105 bits (261), Expect = 6e-20,   Method: Compositional matrix adjust.
 Identities = 99/378 (26%), Positives = 169/378 (44%), Gaps = 37/378 (9%)

Query: 48  AQILPSNDISNSLFYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCR----DCSPQLGTIF 103
           + ++  + +  + +++ IS+G PPV     +DTGS+L WV C  C+    D + + G IF
Sbjct: 12  STVIGDDSMRKNKYFMGISLGTPPVFNLVTIDTGSTLSWVQCKNCQIKCYDQAAKAGQIF 71

Query: 104 YPSRSSSYAYVPCDSEHCR------YFPYARCSAYKHRCIYTQLYLIGPETSVFVSTEQL 157
            P  SS+Y+ V C +E C          Y  C      CIY+  Y  G  +  ++  ++L
Sbjct: 72  NPYNSSTYSKVGCSTEACNGMHMDLAVEYG-CVEEDDTCIYSLRYGSGEYSVGYLGKDRL 130

Query: 158 TFKNTDESTIHVQDVVFGCGFSTNRNFKFSGIFGLGIGRSSLVSQL-----NSSFSYCIG 212
           T      S   + + +FGCG     N   +GI G G    S  +Q+      ++FSYC  
Sbjct: 131 TLA----SNRSIDNFIFGCGEDNLYNGVNAGIIGFGTKSYSFFNQVCQQTDYTAFSYCFP 186

Query: 213 SLHDPDYLHNKLILGDGAIIDEGDATPLQFIDGH--YYITLEAISVDGRMLDINPNIFKR 270
             H+ +     L +G  A       T L + D    Y I    + V+G  L+I+P I+  
Sbjct: 187 RDHENE---GSLTIGPYARDINLMWTKLIYYDHKPAYAIQQLDMMVNGIRLEIDPYIYIS 243

Query: 271 DDSGGGVMIDSGTDVTWLVKEAYEALRDEVMIRLEGEQMRSYSWPD-KLCY-HGIMSSDL 328
             +    ++DSGT  T+++   ++AL D+ M +    +  +  W + ++C+     S++ 
Sbjct: 244 KMT----IVDSGTADTYILSPVFDAL-DKAMTKEMQAKGYTRGWDERRICFISNSGSANW 298

Query: 329 KGFPTVRFHF-RGGAKLALEKDSMFYQPRPDAFCMAVNPASINNRYVNFSLIGMMAQQFY 387
             FPTV     R   KL +E  + FY+   +  C    P     R V   ++G  A + +
Sbjct: 299 NDFPTVEMKLIRSTLKLPVE--NAFYESSNNVICSTFLPDDAGVRGV--QMLGNRAVRSF 354

Query: 388 NVGYDIGRKQKTFQRMDC 405
            + +DI      F+   C
Sbjct: 355 KLVFDIQAMNFGFKARAC 372


>gi|218197468|gb|EEC79895.1| hypothetical protein OsI_21423 [Oryza sativa Indica Group]
          Length = 471

 Score =  105 bits (261), Expect = 6e-20,   Method: Compositional matrix adjust.
 Identities = 103/353 (29%), Positives = 147/353 (41%), Gaps = 32/353 (9%)

Query: 66  SIGQPPVPQFTAMDTGSSLLWVHCYPCR--DCSPQLGTIFYPSRSSSYAYVPCDSEHCRY 123
           +I  P + Q  ++DT   L W+ C PC   +C PQ   +F P RS + A VPC S  C  
Sbjct: 138 AIDDPILAQPMSIDTSIDLPWIQCAPCPMPECYPQQNALFDPRRSRTSAAVPCGSAACGE 197

Query: 124 FPYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDVVFGCGFSTNRN 183
                     ++C Y   Y  G  TS     + LT    + ST+ V +  FGC  +   N
Sbjct: 198 LGRYGAGCSNNQCQYFVDYGDGRATSGTYMVDALTL---NPSTV-VMNFRFGCSHAVRGN 253

Query: 184 FKF--SGIFGLGIGRSSLVSQ----LNSSFSYCIGSLHDPDYLHNKLILGDGAIIDEGDA 237
           F    SG   LG GR SL+SQ      ++FSYC+       +L +     DG        
Sbjct: 254 FSASTSGTMSLGGGRQSLLSQTAATFGNAFSYCVPDPSSSGFL-SLGGPADGGGAGRFAR 312

Query: 238 TPL----QFIDGHYYITLEAISVDGRMLDINPNIFKRDDSGGGVMIDSGTDVTWLVKEAY 293
           TPL      I   Y + L  I V GR L++ P +F      GG ++DS   +T L   AY
Sbjct: 313 TPLVRNPSIIPTLYLVRLRGIEVGGRRLNVPPVVFA-----GGAVMDSSVIITQLPPTAY 367

Query: 294 EALRDEVMIRLEG-EQMRSYSWPDKLCYHGIMSSDLKGFPTVRFHFRGGAKLALEKDSMF 352
            ALR      +    ++         CY  +  + +   P V   F GGA + L+   + 
Sbjct: 368 RALRLAFRSAMAAYPRVAGGRAGLDTCYDFVRFTSVT-VPAVSLVFDGGAVVRLDAMGVM 426

Query: 353 YQPRPDAFCMAVNPASINNRYVNFSLIGMMAQQFYNVGYDIGRKQKTFQRMDC 405
            +      C+A  P   +        IG + QQ + V YD+G     F+R  C
Sbjct: 427 VE-----GCLAFVPTPGD---FALGFIGNVQQQTHEVLYDVGGGSVGFRRGAC 471


>gi|218187618|gb|EEC70045.1| hypothetical protein OsI_00635 [Oryza sativa Indica Group]
          Length = 570

 Score =  104 bits (260), Expect = 6e-20,   Method: Compositional matrix adjust.
 Identities = 104/361 (28%), Positives = 162/361 (44%), Gaps = 51/361 (14%)

Query: 53  SNDISNSLFYV-NISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLG---TIFYPSRS 108
           S  +S S  Y+  +++G PP       DTGS L+WV C    + +       T F PSRS
Sbjct: 92  SKVVSRSFEYLMTVNLGSPPRSMLAIADTGSDLVWVKCKKGNNDTSSAAAPTTQFDPSRS 151

Query: 109 SSYAYVPCDSEHCRYFPYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDES--- 165
           S+Y  V C ++ C     A C    + C Y   Y  G  T+  +STE  TF +       
Sbjct: 152 STYGRVSCQTDACEALGRATCDDGSN-CAYLYAYGDGSNTTGVLSTETFTFDDGGAGRSP 210

Query: 166 -TIHVQDVVFGCGFSTNRNFKFSGIFGLGIGRSSLVSQLNSS------FSYCIGSLHDPD 218
             + +  V FGC  +T  +F   G+ GLG G  SLV+QL  +      FSYC+     P 
Sbjct: 211 RQVRIGGVKFGCSTATAGSFPADGLVGLGGGAVSLVTQLGGATSLGRRFSYCL----VPH 266

Query: 219 YLHNKLILGDGAIIDEGD----ATPLQFIDGHYYITLEAISVDGRMLDINPNIFKRDDSG 274
            ++    L  GA+ D  +    +TPL    G+  +   A S                   
Sbjct: 267 SVNASSALNFGALADVTEPGAASTPLV---GNKTVASAASSR------------------ 305

Query: 275 GGVMIDSGTDVTWLVKEAYEALRDEVMIRLEGEQMRSYSWPDKLCYH--GIMSSDLKGFP 332
             +++DSGT +T+L       + DE+  R+    ++S     +LCY+  G      +  P
Sbjct: 306 --IIVDSGTTLTFLDPSLLGPIVDELSRRITLPPVQSPDGLLQLCYNVAGREVEAGESIP 363

Query: 333 TVRFHFRGGAKLALEKDSMFYQPRPDAFCMAVNPASINNRYVNFSLIGMMAQQFYNVGYD 392
            +   F GGA +AL+ ++ F   +    C+A+  A+   + V  S++G +AQQ  +VGYD
Sbjct: 364 DLTLEFGGGAAVALKPENAFVAVQEGTLCLAIV-ATTEQQPV--SILGNLAQQNIHVGYD 420

Query: 393 I 393
           +
Sbjct: 421 L 421



 Score = 59.7 bits (143), Expect = 3e-06,   Method: Compositional matrix adjust.
 Identities = 39/131 (29%), Positives = 67/131 (51%), Gaps = 5/131 (3%)

Query: 277 VMIDSGTDVTWLVKEAYEALRDEVMIRLEGEQMRSYSWPDKLCYH--GIMSSDLKGFPTV 334
           +++DSGT +T+L       + DE+  R+    ++S     +LCY+  G      +  P +
Sbjct: 439 IIVDSGTTLTFLDPSLLGPIVDELSRRITLPPVQSPDGLLQLCYNVAGREVEAGESIPDL 498

Query: 335 RFHFRGGAKLALEKDSMFYQPRPDAFCMAVNPASINNRYVNFSLIGMMAQQFYNVGYDIG 394
              F GGA +AL+ ++ F   +    C+A+  A+   + V  S++G +AQQ  +VGYD+ 
Sbjct: 499 TLEFGGGAAVALKPENAFVAVQEGTLCLAIV-ATTEQQPV--SILGNLAQQNIHVGYDLD 555

Query: 395 RKQKTFQRMDC 405
               TF   DC
Sbjct: 556 AGTVTFAVADC 566


>gi|21717169|gb|AAM76362.1|AC074196_20 putative nucleoid DNA binding protein, 3'-partial [Oryza sativa
           Japonica Group]
          Length = 377

 Score =  104 bits (260), Expect = 7e-20,   Method: Compositional matrix adjust.
 Identities = 88/333 (26%), Positives = 140/333 (42%), Gaps = 40/333 (12%)

Query: 57  SNSLFYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLGTIFYPSRSSSYAYVPC 116
           S  L+  N +IG PP P    +D    L+W  C PC+ C  Q   +F P++SS++  +PC
Sbjct: 53  SQGLYVANFTIGTPPQPVSAVVDLTGELVWTQCTPCQPCFEQDLPLFDPTKSSTFRGLPC 112

Query: 117 DSEHCRYFPYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDVVFGC 176
            S  C   P +  +     CIY      G +T     T+        E+      + FGC
Sbjct: 113 GSHLCESIPESSRNCTSDVCIYEAPTKAG-DTGGKAGTDTFAIGAAKET------LGFGC 165

Query: 177 GFSTNRNFKF----SGIFGLGIGRSSLVSQLN-SSFSYCIGSLHDPDYLHNKLILGDGAI 231
              T++  K     SGI GLG    SLV+Q+N ++FSYC+            L LG  A 
Sbjct: 166 VVMTDKRLKTIGGPSGIVGLGRTPWSLVTQMNVTAFSYCLAGKS-----SGALFLGATAK 220

Query: 232 -IDEGDATPLQFI------------DGHYYITLEAISVDGRMLDINPNIFKRDDSGGGVM 278
            +  G  +   F+            + +Y + L  I   G  L           SG  V+
Sbjct: 221 QLAGGKNSSTPFVIKTSAGSSDNGSNPYYMVKLAGIKTGGAPLQ------AASSSGSTVL 274

Query: 279 IDSGTDVTWLVKEAYEALRDEVMIRLEGEQMRSYSWPDKLCYHGIMSSDLKGFPTVRFHF 338
           +D+ +  ++L   AY+AL+  +   +  + + S   P  LC+   ++ D    P + F F
Sbjct: 275 LDTVSRASYLADGAYKALKKALTAAVGVQPVASPPKPYDLCFPKAVAGDA---PELVFTF 331

Query: 339 RGGAKLALEKDSMFYQPRPDAFCMAV-NPASIN 370
            GGA L +   +          C+ + + AS+N
Sbjct: 332 DGGAALTVPPANYLLASGNGTVCLTIGSSASLN 364


>gi|449436215|ref|XP_004135889.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 2-like [Cucumis
           sativus]
          Length = 496

 Score =  104 bits (260), Expect = 8e-20,   Method: Compositional matrix adjust.
 Identities = 98/351 (27%), Positives = 152/351 (43%), Gaps = 39/351 (11%)

Query: 78  MDTGSSLLWVHCYPCRDCSPQLGTIFYPSRSSSYAYVPCDSEHCRYF-PYARCSAY---- 132
           +DTGS L WV C PCR C  Q   +F PS SSS+  +PC+S  C    P A  S      
Sbjct: 160 VDTGSDLTWVQCLPCRLCYNQQEPLFNPSNSSSFLSLPCNSPTCVALQPTAGSSGLCSNK 219

Query: 133 -KHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDVVFGCGFSTNRNF-KFSGIF 190
               C Y   Y  G  +   +  E+LT   T+     + + +FGCG +    F   SG+ 
Sbjct: 220 NSTSCDYQIDYGDGSYSRGELGFEKLTLGKTE-----IDNFIFGCGRNNKGLFGGASGLM 274

Query: 191 GLGIGRSSLVSQ----LNSSFSYCIGSLHDPDYLHNKLILGDGAIIDEGDATPLQF---- 242
           GL     SLVSQ      S FSYC+ +          L LG     +  + +P+ +    
Sbjct: 275 GLARSELSLVSQTSSLFGSVFSYCLPTTGVGS--SGSLTLGGADFSNFKNISPISYTRMI 332

Query: 243 ----IDGHYYITLEAISVDGRMLDINPNIFKRDDSGGGV--MIDSGTDVTWLVKEAYEAL 296
               +   Y++ L  IS+ G  L++      R  S  GV  ++DSGT +T L    Y+A 
Sbjct: 333 QNPQMSNFYFLNLTGISIGGVNLNV-----PRLSSNEGVLSLLDSGTVITRLSPSIYKAF 387

Query: 297 RDEVMIRLEGEQMRSYSWPDKLCYHGIMSSDLKGFPTVRFHFRGGAKLALEKDSMFYQPR 356
           + E   +  G +          C++ +   +    PTV+F F G A++ ++ + +FY  +
Sbjct: 388 KAEFEKQFSGYRTTPGFSILNTCFN-LTGYEEVNIPTVKFIFEGNAEMIVDVEGVFYFVK 446

Query: 357 PDA--FCMAVNPASINNRYVNFSLIGMMAQQFYNVGYDIGRKQKTFQRMDC 405
            DA   C+A       ++ +   +IG   Q+   V Y+    +  F    C
Sbjct: 447 SDASQICLAFASLGYEDQTM---IIGNYQQKNQRVIYNSKESKVGFAGEPC 494


>gi|357160409|ref|XP_003578755.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Brachypodium
           distachyon]
          Length = 373

 Score =  104 bits (260), Expect = 8e-20,   Method: Compositional matrix adjust.
 Identities = 99/391 (25%), Positives = 169/391 (43%), Gaps = 44/391 (11%)

Query: 40  IRSHNTYQAQILPSNDISNSLFYVNISIGQPPVPQFTAMDTGSSLLWVHCYPC-RDCSPQ 98
           +++ N   + ++  + I  + F++ IS+G P V     +DTGS++ WV C  C   C  Q
Sbjct: 2   VQAANIPDSAVIGDDSIRKNQFFMGISLGTPAVFNLVTIDTGSTISWVQCQYCIVHCYTQ 61

Query: 99  ---LGTIFYPSRSSSYAYVPCDSEHCRYFPYAR-----CSAYKHRCIYTQLYLIGPETSV 150
               G  F  S SS+Y  V C ++ C     ++     C   +  CIY+  Y  G  ++ 
Sbjct: 62  DQRAGPTFNTSSSSTYRRVGCSAQVCHDMHVSQNIPSGCVEEEDSCIYSLRYASGEYSAG 121

Query: 151 FVSTEQLTFKNTDESTIHVQDVVFGCGFSTNRNFKFSGIFGLGIGRSSLVSQLN-----S 205
           ++S ++LT  N    +  +Q  +FGCG     N   +GI G G    S  +Q+      S
Sbjct: 122 YLSQDRLTLAN----SYSIQKFIFGCGSDNRYNGHSAGIIGFGNKSYSFFNQIAQLTNYS 177

Query: 206 SFSYCIGSLHDPDYL---------HNKLILGDGAIIDEGDATPLQFIDGHYYITLEAISV 256
           +FSYC  S  + +            NKLIL    + D G   P+      Y +    + V
Sbjct: 178 AFSYCFPSNQENEGFLSIGPYVRDSNKLILTQ--LFDYGAHLPV------YALQQFDMMV 229

Query: 257 DGRMLDINPNIFKRDDSGGGVMIDSGTDVTWLVKEAYEALRDEVMIRLEGEQMRSYSWPD 316
           +G  L ++P ++    +    ++DSGT  T+++   + AL   +   +  E     S   
Sbjct: 230 NGMRLQVDPPVYTTRMT----VVDSGTVETFVLSPVFRALDRALTKAMVAEGYVRGSDSK 285

Query: 317 KLCYHGIMSS-DLKGFPTVRFHFRGGAKLALEKDSMFYQPRPD-AFCMAVNPASINNRYV 374
           ++C+H    S D    P V   F   + L L  +++FY    D + C    P       V
Sbjct: 286 EICFHSNGDSVDWSKLPVVEIKF-SRSILKLPAENVFYYETSDGSICSTFQPDDAGVPGV 344

Query: 375 NFSLIGMMAQQFYNVGYDIGRKQKTFQRMDC 405
              ++G  A + + V +DI ++   F+   C
Sbjct: 345 Q--ILGNRATRSFRVVFDIQQRNFGFEAGAC 373


>gi|449509162|ref|XP_004163513.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 2-like [Cucumis
           sativus]
          Length = 417

 Score =  104 bits (260), Expect = 8e-20,   Method: Compositional matrix adjust.
 Identities = 91/309 (29%), Positives = 138/309 (44%), Gaps = 36/309 (11%)

Query: 78  MDTGSSLLWVHCYPCRDCSPQLGTIFYPSRSSSYAYVPCDSEHCRYF-PYARCSAY---- 132
           +DTGS L WV C PCR C  Q   +F PS SSS+  +PC+S  C    P A  S      
Sbjct: 81  VDTGSDLTWVQCLPCRLCYNQQEPLFNPSNSSSFLSLPCNSPTCVALQPTAGSSGLCSNK 140

Query: 133 -KHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDVVFGCGFSTNRNF-KFSGIF 190
               C Y   Y  G  +   +  E+LT   T+     + + +FGCG +    F   SG+ 
Sbjct: 141 NSTSCDYQIDYGDGSYSRGELGFEKLTLGKTE-----IDNFIFGCGRNNKGLFGGASGLM 195

Query: 191 GLGIGRSSLVSQ----LNSSFSYCIGSLHDPDYLHNKLILGDGAIIDEGDATPLQF---- 242
           GL     SLVSQ      S FSYC+ +          L LG     +  + +P+ +    
Sbjct: 196 GLARSELSLVSQTSSLFGSVFSYCLPTTGVGS--SGSLTLGGADFSNFKNISPISYTRMI 253

Query: 243 ----IDGHYYITLEAISVDGRMLDINPNIFKRDDSGGGV--MIDSGTDVTWLVKEAYEAL 296
               +   Y++ L  IS+ G  L++      R  S  GV  ++DSGT +T L    Y+A 
Sbjct: 254 QNPQMSNFYFLNLTGISIGGVNLNV-----PRLSSNEGVLSLLDSGTVITRLSPSIYKAF 308

Query: 297 RDEVMIRLEGEQMRSYSWPDKLCYHGIMSSDLKGFPTVRFHFRGGAKLALEKDSMFYQPR 356
           + E   +  G +          C++ +   +    PTV+F F G A++ ++ + +FY  +
Sbjct: 309 KAEFEKQFSGYRTTPGFSILNTCFN-LTGYEEVNIPTVKFIFEGNAEMIVDVEGVFYFVK 367

Query: 357 PDA--FCMA 363
            DA   C+A
Sbjct: 368 SDASQICLA 376


>gi|357476337|ref|XP_003608454.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
 gi|355509509|gb|AES90651.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
          Length = 683

 Score =  104 bits (259), Expect = 8e-20,   Method: Compositional matrix adjust.
 Identities = 101/377 (26%), Positives = 163/377 (43%), Gaps = 54/377 (14%)

Query: 58  NSLFYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLGTIFYPSRSSSYAYVPCD 117
           N  +   + IG PP      +DTGS++ +V C  C  C       F P  SS+Y  V C 
Sbjct: 78  NGYYTTRLWIGTPPQMFALIVDTGSTVTYVPCSTCEQCGRHQDPKFQPDLSSTYQPVKCT 137

Query: 118 SEHCRYFPYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDVVFGCG 177
            +         C   + +C+Y + Y     +S  +  + ++F N  +S +  Q  VFGC 
Sbjct: 138 LD-------CNCDNDRMQCVYERQYAEMSTSSGVLGEDVVSFGN--QSELAPQRAVFGCE 188

Query: 178 FSTNRNF---KFSGIFGLGIGRSSLVSQL------NSSFSYCIGSLHDPDYLHNKLILGD 228
                +       GI GLG G  S++ QL      + SFS C G +           +G 
Sbjct: 189 NVETGDLYSQHADGIMGLGRGDLSIMDQLVDKNVVSDSFSLCYGGMD----------VGG 238

Query: 229 GAIIDEGDATPLQFI--------DGHYYITLEAISVDGRMLDINPNIFKRDDSGGGVMID 280
           GA++  G + P   +          +Y I L+ I V G+ L +NP++F   D   G ++D
Sbjct: 239 GAMVLGGISPPSDMVFAQSDPVRSPYYNIDLKEIHVAGKRLPLNPSVF---DGKHGSVLD 295

Query: 281 SGTDVTWLVKEAYEALRDEVMIRLEGEQMRSYSWPD----KLCYHGI---MSSDLKGFPT 333
           SGT   +L +EA+ A ++ ++  L  +     S PD     LC+ G    +S   K FP 
Sbjct: 296 SGTTYAYLPEEAFLAFKEAIVKEL--QSFSQISGPDPNYNDLCFSGAGIDVSQLSKTFPV 353

Query: 334 VRFHFRGGAKLALEKDS-MFYQPR-PDAFCMAVNPASINNRYVNFSLIGMMAQQFYNVGY 391
           V   F  G K +L  ++ MF   +   A+C+ +      N     +L+G +  +   V Y
Sbjct: 354 VDMIFGNGHKYSLSPENYMFRHSKVRGAYCLGI----FQNGKDPTTLLGGIVVRNTLVLY 409

Query: 392 DIGRKQKTFQRMDCEVL 408
           D  + +  F + +C  L
Sbjct: 410 DREQTKIGFWKTNCAEL 426


>gi|356537161|ref|XP_003537098.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Glycine max]
          Length = 601

 Score =  104 bits (259), Expect = 9e-20,   Method: Compositional matrix adjust.
 Identities = 110/445 (24%), Positives = 188/445 (42%), Gaps = 74/445 (16%)

Query: 15  NENPAHRVQRGINISIARLAYLQEKIRSHNTYQAQILPSNDISNSLFYVNISIGQPPVPQ 74
           N +P H +Q  ++ SI R  +L    ++HN   +     +  +   + +++  G PP   
Sbjct: 174 NSHPFHTLQLAVSTSITRAHHL----KNHNNPSSLKTLVHPKTYGGYSIDLKFGTPPQTF 229

Query: 75  FTAMDTGSSLLWVHCYP---CRDC---SPQLGTIFYPSRSSSYAYVPCDSEHCRYF---- 124
              +DTGSSL+W+ CY    C  C   S      F P  S S  +V C +  C +     
Sbjct: 230 PFVLDTGSSLVWLPCYSHYLCSKCNSFSNNNTPKFIPKDSFSSKFVGCRNPKCAWVFGSD 289

Query: 125 -----------PYARCSAYKHRC-IYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDV 172
                       ++  +     C  YT  Y +G  T+ F+ +E L F   +     V D 
Sbjct: 290 VTSHCCKLAKAAFSNNNNCSQTCPAYTVQYGLG-STAGFLLSENLNFPAKN-----VSDF 343

Query: 173 VFGCGFSTNRNFKFSGIFGLGIGRSSLVSQLN-SSFSYCIGSLHDPDYLHNKLILGDGAI 231
           + GC  S    ++  GI G G G  SL +Q+N + FSYC+ S    +   N  ++ +   
Sbjct: 344 LVGC--SVVSVYQPGGIAGFGRGEESLPAQMNLTRFSYCLLSHQFDESPENSDLVMEATN 401

Query: 232 IDEGD--------------ATPLQFIDGHYYITLEAISVDGRMLDINPNIFKRDDSG-GG 276
             EG               +T       +YYITL  I V  + + +   + + D +G GG
Sbjct: 402 SGEGKKTNGVSYTAFLKNPSTKKPAFGAYYYITLRKIVVGEKRVRVPRRMLEPDVNGDGG 461

Query: 277 VMIDSGTDVTWLVKEAYEALRDEVMIRLEGEQMRSYSWPDKL--CYHGIMSSDLKGFPTV 334
            ++DSG+ +T++ +  ++ + +E + ++   + R       L  C+     ++   FP +
Sbjct: 462 FIVDSGSTLTFMERPIFDLVAEEFVKQVNYTRARELEKQFGLSPCFVLAGGAETASFPEM 521

Query: 335 RFHFRGGAKLALEKDSMFYQ-PRPDAFCM------------AVNPASINNRYVNFSLIGM 381
           RF FRGGAK+ L   + F +  + D  C+            AV PA I         +G 
Sbjct: 522 RFEFRGGAKMRLPVANYFSRVGKGDVACLTIVSDDVAGQGGAVGPAVI---------LGN 572

Query: 382 MAQQFYNVGYDIGRKQKTFQRMDCE 406
             QQ + V  D+  ++  F+   C+
Sbjct: 573 YQQQNFYVECDLENERFGFRSQSCQ 597


>gi|296085499|emb|CBI29231.3| unnamed protein product [Vitis vinifera]
          Length = 308

 Score =  104 bits (259), Expect = 1e-19,   Method: Compositional matrix adjust.
 Identities = 100/379 (26%), Positives = 161/379 (42%), Gaps = 89/379 (23%)

Query: 40  IRSHNTYQAQILPSNDISNSL------FYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCR 93
           I+++NT   Q+   NDI +++      + +NIS+G PPV      DTGS L+W  C PC 
Sbjct: 3   IQTNNTGN-QLASPNDIQSNVISGGGSYLMNISLGTPPVSMLGIADTGSDLIWRQCLPCD 61

Query: 94  DCSPQLGTIFYPSRSSSYAYVPCDSEHCRYFPYARCSAYKHRCIYTQLYLIGPETSVFVS 153
           DC  Q+  +F P +S +Y                                   +T  ++S
Sbjct: 62  DCYKQVEPLFDPKKSKTY-----------------------------------KTLGYLS 86

Query: 154 TEQLTFKNTDESTIHVQDVVFGCGFSTNRNF--KFSGIFGLGIGRSSLVSQLNSS----F 207
           +E  T  +T+        + FGCG S    F  K SG+ GLG G  SLV QL+S     F
Sbjct: 87  SETFTIGSTEGDPASFPGLAFGCGHSNGGTFNEKDSGLIGLGGGPLSLVMQLSSKVGGQF 146

Query: 208 SYCIGSLHDPDYLHNKLILGDGAIID-EGDATPLQFIDGHYYITLEAISVDGRMLDINPN 266
           SYC+  L       +K+  G  A++   G ++P    + +                    
Sbjct: 147 SYCLVPLSSDSTASSKINFGKSAVVSGSGTSSPAAAEESN-------------------- 186

Query: 267 IFKRDDSGGGVMIDSGTDVTWLVKEAYEALRDEVMIRLEGEQMRSYSWPDKLCYHGIMSS 326
                     ++IDSGT +T L ++ Y  +   +   + G+          LCY G+   
Sbjct: 187 ----------IIIDSGTTLTLLPRDFYTDMESALTKVIGGQTTTDPRGTFSLCYSGVKKL 236

Query: 327 DLKGFPTVRFHFRGGAKLALEKDSMFYQPRPDAFCMAVNPASINNRYVNFSLIGMMAQQF 386
           ++   PT+  HF  GA + L   + F Q + D  C ++ P+S      N ++ G ++Q  
Sbjct: 237 EI---PTITAHFI-GADVQLPPLNTFVQAQEDLVCFSMIPSS------NLAIFGNLSQMN 286

Query: 387 YNVGYDIGRKQKTFQRMDC 405
           + VGYD+   + +F+  DC
Sbjct: 287 FLVGYDLKNNKVSFKPTDC 305


>gi|449518783|ref|XP_004166415.1| PREDICTED: aspartic proteinase-like protein 2-like, partial
           [Cucumis sativus]
          Length = 420

 Score =  104 bits (259), Expect = 1e-19,   Method: Compositional matrix adjust.
 Identities = 90/346 (26%), Positives = 155/346 (44%), Gaps = 36/346 (10%)

Query: 60  LFYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDC--SPQLGTIFYP---SRSSSYAYV 114
           L+Y  I IG P    +  +DTGS ++WV+C  CR+C  +  LG    P     S++   V
Sbjct: 86  LYYAKIGIGTPSKDYYVQVDTGSDIVWVNCIQCRECPRTSSLGMELTPYDLEESTTGKLV 145

Query: 115 PCDSEHCRYF---PYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTD---ESTIH 168
            CD + C      P + C+     C Y Q+Y  G  T+ +   + + +       E+T  
Sbjct: 146 SCDEQFCLEVNGGPLSGCTT-NMSCPYLQIYGDGSSTAGYFVKDYVQYNRVSGDLETTAA 204

Query: 169 VQDVVFGCG------FSTNRNFKFSGIFGLGIGRSSLVSQLNSS------FSYCIGSLHD 216
              + FGCG        ++      GI G G   SS++SQL S+      F++C+     
Sbjct: 205 NGSIKFGCGARQSGDLGSSGEEALDGILGFGKSNSSIISQLASTRKVKKMFAHCL----- 259

Query: 217 PDYLHNKLILGDGAIID-EGDATPLQFIDGHYYITLEAISVDGRMLDINPNIFKRDDSGG 275
            D  +   I   G ++  + + TPL     HY + +  + V   +L+I+ ++F+  D   
Sbjct: 260 -DGTNGGGIFAMGHVVQPKVNMTPLVPNQPHYNVNMTGVQVGHIILNISADVFEAGDR-K 317

Query: 276 GVMIDSGTDVTWLVKEAYEALRDEVMIRLEGEQMRSYSWPDKLCYHGIMSSDLKGFPTVR 335
           G +IDSGT + +L +  YE L  +++ +    ++++     K C+      D  GFP V 
Sbjct: 318 GTIIDSGTTLAYLPELIYEPLVAKILSQQHNLEVQTIHGEYK-CFQYSERVD-DGFPPVI 375

Query: 336 FHFRGGAKLALEKDSMFYQPRPDAFCMAVNPASINNR-YVNFSLIG 380
           FHF     L +      +Q   + +C+    + + +R   N +L G
Sbjct: 376 FHFENSLLLKVYPHEYLFQ-YENLWCIGWQNSGMQSRDRKNVTLFG 420


>gi|356514298|ref|XP_003525843.1| PREDICTED: aspartic proteinase-like protein 1-like [Glycine max]
          Length = 663

 Score =  104 bits (259), Expect = 1e-19,   Method: Compositional matrix adjust.
 Identities = 99/377 (26%), Positives = 162/377 (42%), Gaps = 54/377 (14%)

Query: 58  NSLFYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLGTIFYPSRSSSYAYVPCD 117
           N  +   + IG PP      +DTGS++ +V C  C  C       F P  SS+Y  V C 
Sbjct: 109 NGYYTTRLWIGTPPQMFALIVDTGSTVTYVPCSTCEQCGRHQDPKFQPESSSTYQPVKCT 168

Query: 118 SEHCRYFPYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDVVFGCG 177
            +         C   + +C+Y + Y     +S  +  + ++F N  +S +  Q  VFGC 
Sbjct: 169 ID-------CNCDGDRMQCVYERQYAEMSTSSGVLGEDVISFGN--QSELAPQRAVFGCE 219

Query: 178 FSTNRNF---KFSGIFGLGIGRSSLVSQL------NSSFSYCIGSLHDPDYLHNKLILGD 228
                +       GI GLG G  S++ QL      + SFS C G +           +G 
Sbjct: 220 NVETGDLYSQHADGIMGLGRGDLSIMDQLVDKKVISDSFSLCYGGMD----------VGG 269

Query: 229 GAIIDEGDATPLQFI--------DGHYYITLEAISVDGRMLDINPNIFKRDDSGGGVMID 280
           GA++  G + P              +Y I L+ + V G+ L +N N+F   D   G ++D
Sbjct: 270 GAMVLGGISPPSDMTFAYSDPDRSPYYNIDLKEMHVAGKRLPLNANVF---DGKHGTVLD 326

Query: 281 SGTDVTWLVKEAYEALRDEVMIRLEGEQMRSYSWPDK----LCYHGI---MSSDLKGFPT 333
           SGT   +L + A+ A +D ++  L  + ++  S PD     +C+ G    +S   K FP 
Sbjct: 327 SGTTYAYLPEAAFLAFKDAIVKEL--QSLKQISGPDPNYNDICFSGAGNDVSQLSKSFPV 384

Query: 334 VRFHFRGGAKLALEKDS-MFYQPR-PDAFCMAVNPASINNRYVNFSLIGMMAQQFYNVGY 391
           V   F  G K +L  ++ MF   +   A+C+ +      N     +L+G +  +   V Y
Sbjct: 385 VDMVFGNGHKYSLSPENYMFRHSKVRGAYCLGI----FQNGNDQTTLLGGIIVRNTLVMY 440

Query: 392 DIGRKQKTFQRMDCEVL 408
           D  + +  F + +C  L
Sbjct: 441 DREQTKIGFWKTNCAEL 457


>gi|56784779|dbj|BAD82000.1| putative aspartic proteinase nepenthesin II [Oryza sativa Japonica
           Group]
          Length = 486

 Score =  104 bits (259), Expect = 1e-19,   Method: Compositional matrix adjust.
 Identities = 97/361 (26%), Positives = 152/361 (42%), Gaps = 26/361 (7%)

Query: 58  NSLFYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDC---SPQLGTI--FYPSRSSSYA 112
             ++ ++ S+G PP      +D  S  +W+ C  C  C   +P   +   FY   SS+  
Sbjct: 94  TGMYVLSFSVGTPPQVVTGVLDITSDFVWMQCSACATCGADAPAATSAPPFYAFLSSTIR 153

Query: 113 YVPCDSEHCRYFPYARCSAYKHRCIYTQLYLIGPE--TSVFVSTEQLTFKNTDESTIHVQ 170
            V C +  C+      CSA    C Y+ +Y  G    T+  ++ +   F     +T+   
Sbjct: 154 EVRCANRGCQRLVPQTCSADDSPCGYSYVYGGGAANTTAGLLAVDAFAF-----ATVRAD 208

Query: 171 DVVFGCGFSTNRNFKFSGIFGLGIGRSSLVSQLN-SSFSYCIGSLHDPDYLHNKLILGDG 229
            V+FGC  +T  +    G+ GLG G  S VSQL    FSY +      D     L L D 
Sbjct: 209 GVIFGCAVATEGD--IGGVIGLGRGELSPVSQLQIGRFSYYLAPDDAVDVGSFILFLDDA 266

Query: 230 AI-IDEGDATPL---QFIDGHYYITLEAISVDGRMLDINPNIFK-RDDSGGGVMIDSGTD 284
                   +TPL   +     YY+ L  I VDG  L I    F  + D  GGV++     
Sbjct: 267 KPRTSRAVSTPLVASRASRSLYYVELAGIRVDGEDLAIPRGTFDLQADGSGGVVLSITIP 326

Query: 285 VTWLVKEAYEALRDEVMIRLEGEQMRSYSWPDKLCYHGIMSSDLKGFPTVRFHFRGGAKL 344
           VT+L   AY+ +R  +  ++E            LCY     +  K  P++   F GGA +
Sbjct: 327 VTFLDAGAYKVVRQAMASKIELRAADGSELGLDLCYTSESLATAK-VPSMALVFAGGAVM 385

Query: 345 ALEKDSMFYQPRPDAF-CMAVNPASINNRYVNFSLIGMMAQQFYNVGYDIGRKQKTFQRM 403
            LE  + FY        C+ + P+   +     SL+G + Q   ++ YDI   +  F+ +
Sbjct: 386 ELEMGNYFYMDSTTGLECLTILPSPAGDG----SLLGSLIQVGTHMIYDISGSRLVFESL 441

Query: 404 D 404
           +
Sbjct: 442 E 442


>gi|4415912|gb|AAD20143.1| putative protease [Arabidopsis thaliana]
          Length = 469

 Score =  104 bits (259), Expect = 1e-19,   Method: Compositional matrix adjust.
 Identities = 107/383 (27%), Positives = 171/383 (44%), Gaps = 59/383 (15%)

Query: 60  LFYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDC--SPQLGT---IFYPSRSSSYAYV 114
           L++  + +G PP      +DTGS +LWV C  C +C  S  LG     F    S +   V
Sbjct: 99  LYFTKVKLGSPPTEFNVQIDTGSDILWVTCSSCSNCPHSSGLGIDLHFFDAPGSLTAGSV 158

Query: 115 PCDSEHCRYF---PYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQD 171
            C    C        A+CS   ++C Y+  Y  G  TS +  T+   F      ++    
Sbjct: 159 TCSDPICSSVFQTTAAQCSE-NNQCGYSFRYGDGSGTSGYYMTDTFYFDAILGESLVANS 217

Query: 172 ---VVFGC-----GFSTNRNFKFSGIFGLGIGRSSLVSQLNSS------FSYCIGSLHDP 217
              +VFGC     G  T  +    GIFG G G+ S+VSQL+S       FS+C   L   
Sbjct: 218 SAPIVFGCSTYQSGDLTKSDKAVDGIFGFGKGKLSVVSQLSSRGITPPVFSHC---LKGD 274

Query: 218 DYLHNKLILGDGAIIDEGDA-TPLQFIDGHYYITLEAISVDGRMLDINPNIFKRDDSGGG 276
                  +LG+  I+  G   +PL     HY + L +I V+G+ML ++  +F+  ++  G
Sbjct: 275 GSGGGVFVLGE--ILVPGMVYSPLVPSQPHYNLNLLSIGVNGQMLPLDAAVFEASNT-RG 331

Query: 277 VMIDSGTDVTWLVKEAYEALRDEVM---------IRLEGEQMRSYSWPDKLCYHGIMSSD 327
            ++D+GT +T+LVKEAY+   + +          I   GEQ          CY  + +S 
Sbjct: 332 TIVDTGTTLTYLVKEAYDLFLNAISNSVSQLVTPIISNGEQ----------CYL-VSTSI 380

Query: 328 LKGFPTVRFHFRGGAKLALE-KDSMFYQPRPDA---FCMAVNPASINNRYVNFSLIGMMA 383
              FP+V  +F GGA + L  +D +F+    D    +C+    A         +++G + 
Sbjct: 381 SDMFPSVSLNFAGGASMMLRPQDYLFHYGIYDGASMWCIGFQKAPEEQ-----TILGDLV 435

Query: 384 QQFYNVGYDIGRKQKTFQRMDCE 406
            +     YD+ R++  +   DC+
Sbjct: 436 LKDKVFVYDLARQRIGWASYDCK 458


>gi|42569679|ref|NP_181205.2| aspartyl protease-like protein [Arabidopsis thaliana]
 gi|330254186|gb|AEC09280.1| aspartyl protease-like protein [Arabidopsis thaliana]
          Length = 512

 Score =  103 bits (258), Expect = 1e-19,   Method: Compositional matrix adjust.
 Identities = 107/384 (27%), Positives = 171/384 (44%), Gaps = 59/384 (15%)

Query: 60  LFYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDC--SPQLGT---IFYPSRSSSYAYV 114
           L++  + +G PP      +DTGS +LWV C  C +C  S  LG     F    S +   V
Sbjct: 104 LYFTKVKLGSPPTEFNVQIDTGSDILWVTCSSCSNCPHSSGLGIDLHFFDAPGSLTAGSV 163

Query: 115 PCDSEHCRYF---PYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQD 171
            C    C        A+CS   ++C Y+  Y  G  TS +  T+   F      ++    
Sbjct: 164 TCSDPICSSVFQTTAAQCSE-NNQCGYSFRYGDGSGTSGYYMTDTFYFDAILGESLVANS 222

Query: 172 ---VVFGC-----GFSTNRNFKFSGIFGLGIGRSSLVSQLNSS------FSYCIGSLHDP 217
              +VFGC     G  T  +    GIFG G G+ S+VSQL+S       FS+C   L   
Sbjct: 223 SAPIVFGCSTYQSGDLTKSDKAVDGIFGFGKGKLSVVSQLSSRGITPPVFSHC---LKGD 279

Query: 218 DYLHNKLILGDGAIIDEGDA-TPLQFIDGHYYITLEAISVDGRMLDINPNIFKRDDSGGG 276
                  +LG+  I+  G   +PL     HY + L +I V+G+ML ++  +F+  ++  G
Sbjct: 280 GSGGGVFVLGE--ILVPGMVYSPLVPSQPHYNLNLLSIGVNGQMLPLDAAVFEASNT-RG 336

Query: 277 VMIDSGTDVTWLVKEAYEALRDEVM---------IRLEGEQMRSYSWPDKLCYHGIMSSD 327
            ++D+GT +T+LVKEAY+   + +          I   GEQ          CY  + +S 
Sbjct: 337 TIVDTGTTLTYLVKEAYDLFLNAISNSVSQLVTPIISNGEQ----------CYL-VSTSI 385

Query: 328 LKGFPTVRFHFRGGAKLALE-KDSMFYQPRPDA---FCMAVNPASINNRYVNFSLIGMMA 383
              FP+V  +F GGA + L  +D +F+    D    +C+    A         +++G + 
Sbjct: 386 SDMFPSVSLNFAGGASMMLRPQDYLFHYGIYDGASMWCIGFQKAPEEQ-----TILGDLV 440

Query: 384 QQFYNVGYDIGRKQKTFQRMDCEV 407
            +     YD+ R++  +   DC +
Sbjct: 441 LKDKVFVYDLARQRIGWASYDCSM 464


>gi|224092220|ref|XP_002309515.1| predicted protein [Populus trichocarpa]
 gi|222855491|gb|EEE93038.1| predicted protein [Populus trichocarpa]
          Length = 473

 Score =  103 bits (258), Expect = 1e-19,   Method: Compositional matrix adjust.
 Identities = 109/394 (27%), Positives = 169/394 (42%), Gaps = 40/394 (10%)

Query: 32  RLAYLQEKIRSHNTYQAQ--ILP---SNDISNSLFYVNISIGQPPVPQFTAMDTGSSLLW 86
           R+  +  ++ SH  +Q +   LP      I +  + V + +G P        DTGS L W
Sbjct: 99  RVDSIHARLSSHGVFQEKQATLPVQSGASIGSGDYAVTVGLGTPKKEFTLIFDTGSDLTW 158

Query: 87  VHCYPC-RDCSPQLGTIFYPSRSSSYAYVPCDSEHCRYFPY---ARCSAYKHRCIYTQLY 142
             C PC + C  Q      P++S+SY  + C S  C+         CS+    C+Y   Y
Sbjct: 159 TQCEPCAKTCYKQKEPRLDPTKSTSYKNISCSSAFCKLLDTEGGESCSS--PTCLYQVQY 216

Query: 143 LIGPETSVFVSTEQLTFKNTDESTIHVQDVVFGCGFSTNRNFK-FSGIFGLGIGRSSLVS 201
             G  +  F +TE LT  +++      ++ +FGCG   +  F+  +G+ GLG  + SL S
Sbjct: 217 GDGSYSIGFFATETLTLSSSNV----FKNFLFGCGQQNSGLFRGAAGLLGLGRTKLSLPS 272

Query: 202 QLNSS----FSYCIGSLHDPDYLHNKLILGDGAIIDEG-DATPLQ--FIDGHYY-ITLEA 253
           Q        FSYC+     P    +K  L  G  + +    TPL   F    +Y + +  
Sbjct: 273 QTAQKYKKLFSYCL-----PASSSSKGYLSFGGQVSKTVKFTPLSEDFKSTPFYGLDITE 327

Query: 254 ISVDGRMLDINPNIFKRDDSGGGVMIDSGTDVTWLVKEAYEALRDEVM-IRLEGEQMRSY 312
           +SV G  L I+ +IF    S  G +IDSGT +T L   AY AL      +  +      Y
Sbjct: 328 LSVGGNKLSIDASIF----STSGTVIDSGTVITRLPSTAYSALSSAFQKLMTDYPSTDGY 383

Query: 313 SWPDKLCYHGIMSSDLKGFPTVRFHFRGGAKLALEKDSMFYQPRP-DAFCMAVNPASINN 371
           S  D  CY    +  +K  P V   F+GG ++ ++   + Y        C+A    + N 
Sbjct: 384 SIFDT-CYDFSKNETIK-IPKVGVSFKGGVEMDIDVSGILYPVNGLKKVCLAF---AGNG 438

Query: 372 RYVNFSLIGMMAQQFYNVGYDIGRKQKTFQRMDC 405
             V  ++ G   Q+ Y V YD  + +  F    C
Sbjct: 439 DDVKAAIFGNTQQKTYQVVYDDAKGRVGFAPSGC 472


>gi|218186446|gb|EEC68873.1| hypothetical protein OsI_37494 [Oryza sativa Indica Group]
          Length = 353

 Score =  103 bits (258), Expect = 1e-19,   Method: Compositional matrix adjust.
 Identities = 99/365 (27%), Positives = 163/365 (44%), Gaps = 37/365 (10%)

Query: 61  FYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCR----DCSPQLGTIFYPSRSSSYAYVPC 116
           +++ IS+G PPV     +DTGS+L WV C  C+    D + + G IF P  SS+Y+ V C
Sbjct: 6   YFMGISLGTPPVFNLVTIDTGSTLSWVQCKNCQIKCYDQAAKAGQIFNPYNSSTYSKVGC 65

Query: 117 DSEHCR------YFPYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQ 170
            +E C          Y  C      CIY+  Y  G  +  ++  ++LT      S   + 
Sbjct: 66  STEACNGMHMDLAVEYG-CVEEDDTCIYSLRYGSGEYSVGYLGKDRLTLA----SNRSID 120

Query: 171 DVVFGCGFSTNRNFKFSGIFGLGIGRSSLVSQL-----NSSFSYCIGSLHDPDYLHNKLI 225
           + +FGCG     N   +GI G G    S  +Q+      ++FSYC    H+ +     L 
Sbjct: 121 NFIFGCGEDNLYNGVNAGIIGFGTKSYSFFNQVCQQTDYTAFSYCFPRDHENE---GSLT 177

Query: 226 LGDGAIIDEGDATPLQFIDGH--YYITLEAISVDGRMLDINPNIFKRDDSGGGVMIDSGT 283
           +G  A       T L + D    Y I    + V+G  L+I+P I+    +    ++DSGT
Sbjct: 178 IGPYARDINLMWTKLIYYDHKPAYAIQQLDMMVNGIRLEIDPYIYISKMT----IVDSGT 233

Query: 284 DVTWLVKEAYEALRDEVMIRLEGEQMRSYSWPD-KLCY-HGIMSSDLKGFPTVRFHF-RG 340
             T+++   ++AL D+ M +    +  +  W + ++C+     S++   FPTV     R 
Sbjct: 234 ADTYILSPVFDAL-DKAMTKEMQAKGYTRGWDERRICFISNSGSANWNDFPTVEMKLIRS 292

Query: 341 GAKLALEKDSMFYQPRPDAFCMAVNPASINNRYVNFSLIGMMAQQFYNVGYDIGRKQKTF 400
             KL +E  + FY+   +  C    P     R V   ++G  A + + + +DI      F
Sbjct: 293 TLKLPVE--NAFYESSNNVICSTFLPDDAGVRGV--QMLGNRAVRSFKLVFDIQAMNFGF 348

Query: 401 QRMDC 405
           +   C
Sbjct: 349 KARAC 353


>gi|226530755|ref|NP_001150335.1| aspartic proteinase nepenthesin-2 precursor [Zea mays]
 gi|195638492|gb|ACG38714.1| aspartic proteinase nepenthesin-2 precursor [Zea mays]
          Length = 461

 Score =  103 bits (258), Expect = 1e-19,   Method: Compositional matrix adjust.
 Identities = 118/428 (27%), Positives = 173/428 (40%), Gaps = 51/428 (11%)

Query: 1   LIHRDSILSPYNNPN----ENPAHRVQRGINISIARLAYLQEKIRSHNTYQAQILPSNDI 56
           L HR    SP         E   HR Q        R AY+Q K          +  S+  
Sbjct: 62  LHHRHGPCSPLPTKKMPTLEETLHRDQL-------RAAYIQRKFSGGGGAGGDVQRSDAT 114

Query: 57  S--------NSLFY-VNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLGTIFYPSR 107
                    N+L Y + + +G P   Q   +DTGS + WV C PC  C  Q   +F PS 
Sbjct: 115 VPTALGTSLNTLEYLITVGLGSPATSQTMLIDTGSDVSWVQCKPCSQCHSQADPLFDPSS 174

Query: 108 SSSYAYVPCDSEHCRYFPYA--RCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDES 165
           SS+Y+   C S  C         CS+   +C Y   Y  G  T+   S++ L   ++   
Sbjct: 175 SSTYSPFSCGSAACAQLGQEGNGCSS-SSQCQYIVTYGDGSSTTGTYSSDTLALGSS--- 230

Query: 166 TIHVQDVVFGCG-FSTNRNFKFSGIFGLGIGRSSLVSQ----LNSSFSYCIGSLHDPDYL 220
              V+   FGC    +  N +  G+ GLG G  SLVSQ    L  +FSYC+         
Sbjct: 231 --AVKSFQFGCSNVESGFNDQTDGLMGLGGGAQSLVSQTAGTLGRAFSYCLPPTPSSSGF 288

Query: 221 HNKLILGDGAIIDEGDATPL---QFIDGHYYITLEAISVDGRMLDINPNIFKRDDSGGGV 277
              L    G+       TP+     +   Y + L+AI V GR L I  ++F       G 
Sbjct: 289 LT-LGAAGGSGTSGFVKTPMLRSSQVPTFYGVRLQAIRVGGRQLSIPASVFS-----AGT 342

Query: 278 MIDSGTDVTWLVKEAYEALRDEVMIRLEGEQMRSYSWPDKLCYHGIMSSDLKGFPTVRFH 337
           ++DSGT +T L   AY AL       ++       S     C+     S +   P+V   
Sbjct: 343 VMDSGTVITRLPPTAYSALSSAFKAGMKQYPPAQPSGILDTCFDFSGQSSVS-IPSVALV 401

Query: 338 FRGGAKLALEKDSMFYQPRPDAFCMAVNPASINNRYVNFSLIGMMAQQFYNVGYDIGRKQ 397
           F GGA ++L+   +       + C+A    + N+   +  +IG + Q+ + V YD+GR  
Sbjct: 402 FSGGAVVSLDASGIIL-----SNCLAF---AANSDDSSLGIIGNVQQRTFEVLYDVGRGV 453

Query: 398 KTFQRMDC 405
             F+   C
Sbjct: 454 VGFRAGAC 461


>gi|326501496|dbj|BAK02537.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 476

 Score =  103 bits (258), Expect = 1e-19,   Method: Compositional matrix adjust.
 Identities = 105/387 (27%), Positives = 151/387 (39%), Gaps = 48/387 (12%)

Query: 61  FYVNISIGQPPVPQFTAMDTGSSLLWVHCY-PCRDCSPQLGT---IFYPSRSSSYAYVPC 116
           ++V   +G P  P     DTGS L WV C  P  + S         F P  S ++A + C
Sbjct: 94  YFVRFRVGTPAQPFLLVADTGSDLTWVKCRRPAANSSESGSGSGRAFRPEDSRTWAPISC 153

Query: 117 DSEHCRY---FPYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTF----KNTDESTIHV 169
            S+ C     F  A C      C Y   Y  G      V TE  T     +  +E    +
Sbjct: 154 ASDTCTKSLPFSLATCPTPGSPCAYDYRYKDGSAARGTVGTESATIALSGRGREERKAKL 213

Query: 170 QDVVFGCGFS-TNRNFKFS-GIFGLGIGRSSLVSQLNSSF----SYCIGSLHDPDYLHNK 223
           + +V GC  S T  +F+ S G+  LG    S  S   S F    SYC+     P    + 
Sbjct: 214 KGLVLGCTSSYTGPSFEVSDGVLSLGYSDVSFASHAASRFAGRFSYCLVDHLSPRNATSY 273

Query: 224 LILGDGAIIDEGDA----------------------TPLQF---IDGHYYITLEAISVDG 258
           L  G    +    +                      TPL     +   Y + ++A+SV G
Sbjct: 274 LTFGPNPAVASSSSPSSPAPASCTAAAPRPRPRARQTPLLLDRRMRPFYDVAVKAVSVAG 333

Query: 259 RMLDINPNIFKRDDSGGGVMIDSGTDVTWLVKEAYEALRDEVMIRLEGEQMRSYSWPDKL 318
           + L I P      D+GGGV++DSGT +T L K AY A+   +   L G    +   P + 
Sbjct: 334 QFLKI-PRAVWDVDAGGGVILDSGTSLTVLAKPAYRAVVAALSEGLAGLPRVTMD-PFEY 391

Query: 319 CYHGIMSSDLKGFPTVRFHFRGGAKLALEKDSMFYQPRPDAFCMAVNPASINNRYVNFSL 378
           CY+    S     P +  HF G A+L     S      P   C+ +        +   S+
Sbjct: 392 CYNWTSPSGDVTLPKMAVHFAGAARLEPPGKSYVIDAAPGVKCIGLQ----EGPWPGISV 447

Query: 379 IGMMAQQFYNVGYDIGRKQKTFQRMDC 405
           IG + QQ +   +DI  ++  FQR  C
Sbjct: 448 IGNILQQEHLWEFDIKNRRLKFQRSRC 474


>gi|357131735|ref|XP_003567490.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Brachypodium
           distachyon]
          Length = 458

 Score =  103 bits (258), Expect = 1e-19,   Method: Compositional matrix adjust.
 Identities = 89/359 (24%), Positives = 148/359 (41%), Gaps = 23/359 (6%)

Query: 67  IGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLGTI-FYPSRSSSYAYVPCDSEHCRYFP 125
           +G PP     A+D  +   WV C  C  C+P   +  F P++SS+Y  V C +  C   P
Sbjct: 106 LGTPPQTLLVAIDPSNDAAWVPCSACLGCAPGASSPSFDPTQSSTYRPVRCGAPQCAQVP 165

Query: 126 YA--RCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDVVFGC---GFST 180
            A   C A         L          +  + L+  +++ + +      FGC      +
Sbjct: 166 PATPSCPAGPGASCAFNLSYASSTLHAVLGQDALSLSDSNGAAVPDDHYTFGCLRVVTGS 225

Query: 181 NRNFKFSGIFGLGIGRSSLVSQLNSS----FSYCIGSLHDPDYLHNKLILGDGAIIDEGD 236
             +    G+ G G G  S +SQ  ++    FSYC+ S    ++    L LG         
Sbjct: 226 GGSVPPQGLVGFGRGPLSFLSQTKATYGSIFSYCLPSYKSSNF-SGTLRLGPAGQPRRIK 284

Query: 237 ATPLQFIDGH----YYITLEAISVDGRMLDINPNIFKRDDSG--GGVMIDSGTDVTWLVK 290
            TPL   + H    YY+ +  + V+G+ + I  +    D +   GG ++D+GT  T L  
Sbjct: 285 TTPL-LSNPHRPSLYYVAMVGVRVNGKAVPIPASALALDAATGRGGTIVDAGTMFTRLSP 343

Query: 291 EAYEALRDEVMIRLEGEQMRSYSWPDKLCYHGIMSSDLKGFPTVRFHFRGGAKLAL-EKD 349
            AY ALR+     +      +    D  CY+    +  K  P V F F GGA++ L E++
Sbjct: 344 PAYAALRNAFRRGVSAPAAPALGGFDT-CYY---VNGTKSVPAVAFVFAGGARVTLPEEN 399

Query: 350 SMFYQPRPDAFCMAVNPASINNRYVNFSLIGMMAQQFYNVGYDIGRKQKTFQRMDCEVL 408
            +         C+A+     +      +++  M QQ + V +D+G  +  F R  C  +
Sbjct: 400 VVISSTSGGVACLAMAAGPSDGVNAGLNVLASMQQQNHRVVFDVGNGRVGFSRELCTAV 458


>gi|357481195|ref|XP_003610883.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
 gi|355512218|gb|AES93841.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
          Length = 315

 Score =  103 bits (257), Expect = 1e-19,   Method: Compositional matrix adjust.
 Identities = 93/310 (30%), Positives = 143/310 (46%), Gaps = 30/310 (9%)

Query: 112 AYVPCDSEHCRYFPYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQD 171
           A+  CDS  C       CS  K RC YT  Y     T   ++ +  TF +     + +  
Sbjct: 17  AHNSCDSPLCHKLDTGVCSPEK-RCNYTYGYGDNSLTKGVLAQDTATFTSNTGKLVSLSR 75

Query: 172 VVFGCGFSTNRNFK--FSGIFGLGIGRSSLVSQL-----NSSFSYCIGSLHDPDYLHNKL 224
            +FGCG +    F     G+ GLG G +SL+SQ+        FS C+        + +++
Sbjct: 76  FLFGCGHNNTGGFNDHEMGLIGLGGGPTSLISQIGPLFGGKKFSQCLVPFLTDIKISSRM 135

Query: 225 ILGDGAII--DEGDATPLQFID---GHYYITLEAISVDGRMLDINPNIFKRDDSGGGVMI 279
             G G+ +  D    TPL   +     Y++TL  ISV+   L +N  I K     G +++
Sbjct: 136 SFGKGSQVLGDGVVTTPLVQREQDMTSYFVTLLGISVEDTYLPMNSTIEK-----GNMLV 190

Query: 280 DSGTDVTWLVKEAYEALRDEVMIRLEGEQMRS-YSWPDKLCYHGIMSSDLKGFPTVRFHF 338
           DSGT    L ++ Y+ +  EV   +  E + +  S   +LCY     ++LKG PT+ +HF
Sbjct: 191 DSGTPPNILPQQLYDRVYVEVKNNVPLELITNDPSLGPQLCYR--TQTNLKG-PTLTYHF 247

Query: 339 RGGAKLALEKDSMFYQPRPDA---FCMAVNPASINNRYVNFSLIGMMAQQFYNVGYDIGR 395
             GA L L     F  P P+    FC+A+N    N    N  + G  AQ  Y +G+D+ R
Sbjct: 248 E-GANLLLTPIQTFIPPTPETKGVFCLAIN----NYTNSNGGVYGNFAQSNYLIGFDLDR 302

Query: 396 KQKTFQRMDC 405
           +  +F+  DC
Sbjct: 303 QVVSFKATDC 312


>gi|297849132|ref|XP_002892447.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
 gi|297338289|gb|EFH68706.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
          Length = 493

 Score =  103 bits (257), Expect = 1e-19,   Method: Compositional matrix adjust.
 Identities = 105/375 (28%), Positives = 168/375 (44%), Gaps = 42/375 (11%)

Query: 60  LFYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSP------QLGTIFYPSRSSSYAY 113
           L+Y  + +G PP      +DTGS +LWV C  C  C        QL + F P  SSS + 
Sbjct: 83  LYYTKVKLGTPPREFNVQIDTGSDVLWVSCTSCNGCPKTSELQIQL-SFFDPGVSSSASL 141

Query: 114 VPCDSEHC--RYFPYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQD 171
           V C    C   +   + CS   + C Y+  Y  G  TS F  ++ ++F     ST+ +  
Sbjct: 142 VSCSDRRCYSNFQTESGCSP-NNLCSYSFKYGDGSGTSGFYISDFMSFDTVITSTLAINS 200

Query: 172 ---VVFGCGFSTNRNFK-----FSGIFGLGIGRSSLVSQLNSS------FSYCIGSLHDP 217
               VFGC      + +       GIFGLG G  S++SQL         FS+C   L   
Sbjct: 201 SAPFVFGCSNLQTGDLQRPRRAVDGIFGLGQGSLSVISQLAVQGLAPRVFSHC---LKGD 257

Query: 218 DYLHNKLILGDGAIIDEGDA--TPLQFIDGHYYITLEAISVDGRMLDINPNIFKRDDSGG 275
                 ++LG    I   D   TPL     HY + L++I+V+G++L I+P++F    +G 
Sbjct: 258 KSGGGIMVLGQ---IKRPDTVYTPLVPSQPHYNVNLQSIAVNGQILPIDPSVFTI-ATGD 313

Query: 276 GVMIDSGTDVTWLVKEAYEALRDEVMIRLEGEQMRSYSWPDKLCYHGIMSSDLKGFPTVR 335
           G +ID+GT + +L  EAY      +   +  +  R  ++    C+  I + D+  FP V 
Sbjct: 314 GTIIDTGTTLAYLPDEAYSPFIQAIANAVS-QYGRPITYESYQCFE-ITAGDVDVFPEVS 371

Query: 336 FHFRGGAKLALEKDS---MFYQPRPDAFCMAVNPASINNRYVNFSLIGMMAQQFYNVGYD 392
             F GGA + L   +   +F       +C+     S    +   +++G +  +   V YD
Sbjct: 372 LSFAGGASMVLRPHAYLQIFSSSGSSIWCIGFQRMS----HRRITILGDLVLKDKVVVYD 427

Query: 393 IGRKQKTFQRMDCEV 407
           + R++  +   DC +
Sbjct: 428 LVRQRIGWAEYDCSL 442


>gi|42571079|ref|NP_973613.1| aspartyl protease-like protein [Arabidopsis thaliana]
 gi|110737616|dbj|BAF00749.1| putative protease [Arabidopsis thaliana]
 gi|330254187|gb|AEC09281.1| aspartyl protease-like protein [Arabidopsis thaliana]
          Length = 507

 Score =  103 bits (257), Expect = 1e-19,   Method: Compositional matrix adjust.
 Identities = 107/384 (27%), Positives = 171/384 (44%), Gaps = 59/384 (15%)

Query: 60  LFYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDC--SPQLGT---IFYPSRSSSYAYV 114
           L++  + +G PP      +DTGS +LWV C  C +C  S  LG     F    S +   V
Sbjct: 99  LYFTKVKLGSPPTEFNVQIDTGSDILWVTCSSCSNCPHSSGLGIDLHFFDAPGSLTAGSV 158

Query: 115 PCDSEHCRYF---PYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQD 171
            C    C        A+CS   ++C Y+  Y  G  TS +  T+   F      ++    
Sbjct: 159 TCSDPICSSVFQTTAAQCSE-NNQCGYSFRYGDGSGTSGYYMTDTFYFDAILGESLVANS 217

Query: 172 ---VVFGC-----GFSTNRNFKFSGIFGLGIGRSSLVSQLNSS------FSYCIGSLHDP 217
              +VFGC     G  T  +    GIFG G G+ S+VSQL+S       FS+C   L   
Sbjct: 218 SAPIVFGCSTYQSGDLTKSDKAVDGIFGFGKGKLSVVSQLSSRGITPPVFSHC---LKGD 274

Query: 218 DYLHNKLILGDGAIIDEGDA-TPLQFIDGHYYITLEAISVDGRMLDINPNIFKRDDSGGG 276
                  +LG+  I+  G   +PL     HY + L +I V+G+ML ++  +F+  ++  G
Sbjct: 275 GSGGGVFVLGE--ILVPGMVYSPLVPSQPHYNLNLLSIGVNGQMLPLDAAVFEASNT-RG 331

Query: 277 VMIDSGTDVTWLVKEAYEALRDEVM---------IRLEGEQMRSYSWPDKLCYHGIMSSD 327
            ++D+GT +T+LVKEAY+   + +          I   GEQ          CY  + +S 
Sbjct: 332 TIVDTGTTLTYLVKEAYDLFLNAISNSVSQLVTPIISNGEQ----------CYL-VSTSI 380

Query: 328 LKGFPTVRFHFRGGAKLALE-KDSMFYQPRPDA---FCMAVNPASINNRYVNFSLIGMMA 383
              FP+V  +F GGA + L  +D +F+    D    +C+    A         +++G + 
Sbjct: 381 SDMFPSVSLNFAGGASMMLRPQDYLFHYGIYDGASMWCIGFQKAPEEQ-----TILGDLV 435

Query: 384 QQFYNVGYDIGRKQKTFQRMDCEV 407
            +     YD+ R++  +   DC +
Sbjct: 436 LKDKVFVYDLARQRIGWASYDCSM 459


>gi|357127507|ref|XP_003565421.1| PREDICTED: probable aspartic protease At2g35615-like [Brachypodium
           distachyon]
          Length = 438

 Score =  103 bits (257), Expect = 2e-19,   Method: Compositional matrix adjust.
 Identities = 111/429 (25%), Positives = 187/429 (43%), Gaps = 62/429 (14%)

Query: 2   IHRDSILSPYNNPNENPAHRVQRGINISIARLAYLQE--------KIRSHNTYQAQILPS 53
           IHRDS+ S +++P   P  R+++    S+AR A+                +   A ++  
Sbjct: 9   IHRDSVKSLFHDPTLTPEARLRQAARRSMARHAHAARINNSAAAAGASGSDDSDADVVSP 68

Query: 54  NDISNSLFYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLGTIFYPSRSSSYAY 113
               N  + + + +  PPV      DTGSSL+W+ C        +L     P+ SSSYA 
Sbjct: 69  MVPQNFEYLMALDVSTPPVRMLALADTGSSLVWLKC--------KLPAAHTPA-SSSYAR 119

Query: 114 VPCDSEHCRYF-PYARCSAY---KHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHV 169
           +PCD+  C+     A C A     + C+Y   +  G  T+  V+ +  TF    +     
Sbjct: 120 LPCDAFACKALGDAASCRATGSGNNICVYRYAFADGSCTAGPVTVDAFTFSTRLD----- 174

Query: 170 QDVVFGCGFSTN-RNFKFSGIFGLGIGRSSLVSQLNS------SFSYCIGSLHDPDYLHN 222
               FGC   T   +    G+ GL  G  SLVSQL++       FSYC+      + + +
Sbjct: 175 ----FGCATRTEGLSVPDDGLVGLANGPISLVSQLSAKTPFAHKFSYCLVPYSSSETVSS 230

Query: 223 KLILGDGAIIDE---GDATPLQFIDGH----YYITLEAISVDGRMLDINPNIFKRDDSGG 275
            L  G  AI+        TPL  + G     Y I L++I V G+ + +     K      
Sbjct: 231 SLNFGSHAIVSSSPGAATTPL--VAGRNKSFYTIALDSIKVAGKPVPLQTTTTK------ 282

Query: 276 GVMIDSGTDVTWLVKEAYEALRDEVMIRLEGEQMRSYSWPDKLCY--HGIMSSDL-KGFP 332
            +++DSGT +T+L K   + L   +   ++  +++S      +CY        D+ K  P
Sbjct: 283 -LIVDSGTMLTYLPKAVLDPLVAALTAAIKLPRVKSPETLYAVCYDVRRRAPEDVGKSIP 341

Query: 333 TVRFHFRGGAKLALEKDSMF-YQPRPDAFCMAVNPASINNRYVNFSLIGMMAQQFYNVGY 391
            V     GG ++ L   + F  + +    C+A+    + +    F ++G +AQQ  +VG+
Sbjct: 342 DVTLVLGGGGEVRLPWGNTFVVENKGTTVCLAL----VESHLPEF-ILGNVAQQNLHVGF 396

Query: 392 DIGRKQKTF 400
           D+ R+  +F
Sbjct: 397 DLERRTVSF 405


>gi|297827153|ref|XP_002881459.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
 gi|297327298|gb|EFH57718.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
          Length = 507

 Score =  103 bits (257), Expect = 2e-19,   Method: Compositional matrix adjust.
 Identities = 107/384 (27%), Positives = 171/384 (44%), Gaps = 59/384 (15%)

Query: 60  LFYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDC--SPQLGT---IFYPSRSSSYAYV 114
           L++  + +G PP      +DTGS +LWV C  C +C  S  LG     F    S +   V
Sbjct: 99  LYFTKVKLGSPPTEFNVQIDTGSDILWVTCSSCSNCPHSSGLGIDLHFFDAPGSFTAGSV 158

Query: 115 PCDSEHCRYF---PYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQD 171
            C    C        A+CS   ++C Y+  Y  G  TS +  T+   F      ++    
Sbjct: 159 TCSDPICSSVFQTTAAQCSE-NNQCGYSFRYGDGSGTSGYYMTDTFYFDAILGESLVANS 217

Query: 172 ---VVFGC-----GFSTNRNFKFSGIFGLGIGRSSLVSQLNSS------FSYCIGSLHDP 217
              +VFGC     G  T  +    GIFG G G+ S+VSQL+S       FS+C   L   
Sbjct: 218 SAPIVFGCSTYQSGDLTKSDKAVDGIFGFGKGKLSVVSQLSSRGITPPVFSHC---LKGD 274

Query: 218 DYLHNKLILGDGAIIDEGDA-TPLQFIDGHYYITLEAISVDGRMLDINPNIFKRDDSGGG 276
                  +LG+  I+  G   +PL     HY + L +I V+G++L I+  +F+  ++  G
Sbjct: 275 GSGGGVFVLGE--ILVPGMVYSPLLPSQPHYNLNLLSIGVNGQILPIDAAVFEASNT-RG 331

Query: 277 VMIDSGTDVTWLVKEAYEALRDEV---------MIRLEGEQMRSYSWPDKLCYHGIMSSD 327
            ++D+GT +T+LVKEAY+   + +         +I   GEQ          CY  + +S 
Sbjct: 332 TIVDTGTTLTYLVKEAYDPFLNAISNSVSQLVTLIISNGEQ----------CYL-VSTSI 380

Query: 328 LKGFPTVRFHFRGGAKLALE-KDSMFYQPRPDA---FCMAVNPASINNRYVNFSLIGMMA 383
              FP V  +F GGA + L  +D +F+    D    +C+    A         +++G + 
Sbjct: 381 SDMFPPVSLNFAGGASMMLRPQDYLFHYGFYDGASMWCIGFQKAPEEQ-----TILGDLV 435

Query: 384 QQFYNVGYDIGRKQKTFQRMDCEV 407
            +     YD+ R++  +   DC +
Sbjct: 436 LKDKVFVYDLARQRIGWANYDCSM 459


>gi|77808087|gb|AAS48510.2| aspartic protease [Fagopyrum esculentum]
 gi|82780908|gb|ABB88696.2| aspartic proteinase-like protein [Fagopyrum esculentum]
          Length = 447

 Score =  103 bits (256), Expect = 2e-19,   Method: Compositional matrix adjust.
 Identities = 111/419 (26%), Positives = 184/419 (43%), Gaps = 51/419 (12%)

Query: 27  NISIARLAYLQEK--IRSHNTYQAQI-LPSNDISNSLFYVNISIGQPPVPQFTAMDTGSS 83
           +I++A L+ L     ++   T   ++ LP+   S   + V  S+G PP      +DTGSS
Sbjct: 37  SINLAALSSLSRARHLKRPPTLTGKVTLPAYPRSYGGYSVIFSLGTPPQKVSLVLDTGSS 96

Query: 84  LLWVHC------YPCRDCS-----PQLGTIFYPSRSSSYAYVPCDSEHCRYF--PYARCS 130
           L+W  C      Y C++C+     P    I+  ++SS+   +PC S  C +       CS
Sbjct: 97  LVWTPCTIPTATYTCQNCTFSGVDPTKIPIYARNKSSTVQSLPCRSPKCNWVFGSDLNCS 156

Query: 131 AYKHRCIYTQL-YLIGPETSVFVSTEQLTFKNTDESTIHVQDVVFGCGFSTNRNFKFSGI 189
             K RC Y  L Y +G  T   VS + L     +     + D +FGC   +NR  +  GI
Sbjct: 157 TTK-RCPYYGLEYGLGSTTGQLVS-DVLGLSKLN----RIPDFLFGCSLVSNRQPE--GI 208

Query: 190 FGLGIGRSSLVSQLN-SSFSYCIGSLH-DPDYLHNKLILGDGAIIDEGDATPLQFI---- 243
            G G G +S+ +QL  + FSYC+ S   D       L+L  G    +  A  + +     
Sbjct: 209 AGFGRGLASIPAQLGLTKFSYCLVSHRFDDTPQSGDLVLHRGRRHADAAANGVAYAPFTK 268

Query: 244 -------DGHYYITLEAISVDGRMLDINPNIFKRDDSG-GGVMIDSGTDVTWLVKEAYEA 295
                    +YYI+L  I V G+ + I P        G GG+++DSG+  T++ +  ++ 
Sbjct: 269 SPALSPYSEYYYISLSKILVGGKDVPIPPRYLVPSKEGDGGMIVDSGSTFTFMERIIFDP 328

Query: 296 LRDEV---MIRLEGEQMRSYSWPDKLCYHGIMSSDLKGFPTVRFHFRGGAKLALEKDSMF 352
           +  E+   M + +  +    S     CY+    S++   P + F F+GGA + L     F
Sbjct: 329 VARELEKHMTKYKRAKEIEDSSGLGPCYNITGQSEVD-VPKLTFSFKGGANMDLPLTDYF 387

Query: 353 YQPRPDAFCMAV-----NPASINNRYVNFSLIGMMAQQFYNVGYDIGRKQKTFQRMDCE 406
                   CM V      P S     +   ++G   QQ + + YD+ +++  F+   C+
Sbjct: 388 SLVTDGVVCMTVLTDPDEPGSTTGPAI---ILGNYQQQNFYIEYDLKKQRFGFKPQQCD 443


>gi|302781476|ref|XP_002972512.1| hypothetical protein SELMODRAFT_441822 [Selaginella moellendorffii]
 gi|300159979|gb|EFJ26598.1| hypothetical protein SELMODRAFT_441822 [Selaginella moellendorffii]
          Length = 496

 Score =  103 bits (256), Expect = 2e-19,   Method: Compositional matrix adjust.
 Identities = 110/384 (28%), Positives = 159/384 (41%), Gaps = 49/384 (12%)

Query: 59  SLFYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLGTIFYPSRSSSYAYVPCDS 118
           +LF + + IG         +DTGS  + V C        +   +F P+ S SY  VPC S
Sbjct: 98  ALFSMQLGIGSLQKNLSAIIDTGSEAVLVQC------GSRSRPVFDPAASQSYRQVPCIS 151

Query: 119 EHCRYFPYAR-------CSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQ- 170
           + C              C      C Y+  Y     ++   S + +   +T+ S   VQ 
Sbjct: 152 QLCLAVQQQTSNGSSQPCVNSSATCTYSLSYGDSRNSTGDFSQDVIFLNSTNSSGQAVQF 211

Query: 171 -DVVFGCGFSTNR---NFKFSGIFGLGIGRSSLVSQLN-----SSFSYCIGSLHDPDYLH 221
            DV FGC  S      +    GI G   G  SL SQL      S FSYC  S        
Sbjct: 212 RDVAFGCAHSPQGFLVDLGSLGIVGFNRGNLSLPSQLKDRLGGSKFSYCFPSQPWQPRAT 271

Query: 222 NKLILGDGAI---------IDEGDATPLQFIDGHYYITLEAISVDGRMLDINPNIFKRDD 272
             + LGD  +         + +   TP +     YY+ L +ISVDG+ L I  + FK D 
Sbjct: 272 GVIFLGDSGLSKSKVGYTPLLDNPVTPAR--SQLYYVGLTSISVDGKTLAIPESAFKLDP 329

Query: 273 SG--GGVMIDSGTDVTWLVKEAYEALRDEVMIR----LEGEQMRSYSWPDKLCYHGIMSS 326
           S   GG ++DSGT  T +V +AY A R+         L  +   +  + D  CY+    S
Sbjct: 330 STGDGGTVLDSGTTFTRVVDDAYTAFRNAFAASNRSGLRKKVGAAAGFDD--CYNISAGS 387

Query: 327 DLKGFPTVRFHFRGGAKLALEKDSMFYQPRPDA-----FCMAVNPASINNRYVNFSLIGM 381
            L G P VR   +   +L L  + +F  P   A      C+A+  +S  + +   +++G 
Sbjct: 388 SLPGVPEVRLSLQNNVRLELRFEHLFV-PVSAAGNEVTVCLAI-LSSQKSGFGKINVLGN 445

Query: 382 MAQQFYNVGYDIGRKQKTFQRMDC 405
             Q  Y V YD  R +  F+R DC
Sbjct: 446 YQQSNYLVEYDNERSRVGFERADC 469


>gi|218189440|gb|EEC71867.1| hypothetical protein OsI_04576 [Oryza sativa Indica Group]
          Length = 508

 Score =  103 bits (256), Expect = 2e-19,   Method: Compositional matrix adjust.
 Identities = 97/361 (26%), Positives = 152/361 (42%), Gaps = 26/361 (7%)

Query: 58  NSLFYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDC---SPQLGTI--FYPSRSSSYA 112
             ++ ++ S+G PP      +D  S  +W+ C  C  C   +P   +   FY   SS+  
Sbjct: 94  TGMYVLSFSVGTPPQVVTGVLDITSDFVWMQCSACATCGADAPAATSAPPFYAFLSSTIR 153

Query: 113 YVPCDSEHCRYFPYARCSAYKHRCIYTQLYLIGPE--TSVFVSTEQLTFKNTDESTIHVQ 170
            V C +  C+      CSA    C Y+ +Y  G    T+  ++ +   F     +T+   
Sbjct: 154 EVRCANRGCQRLVPQTCSADDSPCGYSYVYGGGAANTTAGLLAVDAFAF-----ATVRAD 208

Query: 171 DVVFGCGFSTNRNFKFSGIFGLGIGRSSLVSQLN-SSFSYCIGSLHDPDYLHNKLILGDG 229
            V+FGC  +T  +    G+ GLG G  SLVSQL    FSY +      D     L L D 
Sbjct: 209 GVIFGCAVATEGD--IGGVIGLGRGELSLVSQLQIGRFSYYLAPDDAVDVGSFILFLDDA 266

Query: 230 AI-IDEGDATPL---QFIDGHYYITLEAISVDGRMLDINPNIFK-RDDSGGGVMIDSGTD 284
                   +TPL   +     YY+ L  I VDG  L I    F  + D  GGV++     
Sbjct: 267 KPRTSRAVSTPLVANRASRSLYYVELAGIRVDGEDLAIPRGTFDLQADGSGGVVLSITIP 326

Query: 285 VTWLVKEAYEALRDEVMIRLEGEQMRSYSWPDKLCYHGIMSSDLKGFPTVRFHFRGGAKL 344
           VT+L   AY+ +R  +  ++             LCY     +  K  P++   F GGA +
Sbjct: 327 VTFLDAGAYKVVRQAMASKIGLRAADGSELGLDLCYTSESLATAK-VPSMALVFAGGAVM 385

Query: 345 ALEKDSMFYQPRPDAF-CMAVNPASINNRYVNFSLIGMMAQQFYNVGYDIGRKQKTFQRM 403
            LE  + FY        C+ + P+   +     SL+G + Q   ++ YDI   +  F+ +
Sbjct: 386 ELEMGNYFYMDSTTGLECLTILPSPAGDG----SLLGSLIQVGTHMIYDISGSRLVFESL 441

Query: 404 D 404
           +
Sbjct: 442 E 442


>gi|125575542|gb|EAZ16826.1| hypothetical protein OsJ_32298 [Oryza sativa Japonica Group]
          Length = 396

 Score =  103 bits (256), Expect = 2e-19,   Method: Compositional matrix adjust.
 Identities = 97/375 (25%), Positives = 153/375 (40%), Gaps = 53/375 (14%)

Query: 63  VNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLGTIFYPSRSSSYAYVPCDSEHCR 122
            N +IG PP P    +D    L+W  C  C  C  Q   +F P+ SS++   PC ++ C+
Sbjct: 45  ANFTIGTPPQPASAIIDVAGELVWTQCSRCSRCFKQDLPLFIPNASSTFRPEPCGTDACK 104

Query: 123 YFPYARCSAYKHRCIY-----------TQLYLIGPET-SVFVSTEQLTFKNTDESTIHVQ 170
             P + CS     C Y           T L ++G ET ++  +T  L F     S I   
Sbjct: 105 STPTSNCSG--DVCTYESTTNIRLDRHTTLGIVGTETFAIGTATASLAFGCVVASDIDTM 162

Query: 171 DVVFGCGFSTNRNFKFSGIFGLGIGRSSLVSQLN-SSFSYCIGSLHDPDYLHNKLILGDG 229
           D               SG  GLG    SLV+Q+  + FSYC+          ++L LG  
Sbjct: 163 DGT-------------SGFIGLGRTPRSLVAQMKLTKFSYCLSPRGTGK--SSRLFLGSS 207

Query: 230 AIIDEGDATPLQ-FI------DGHYY--ITLEAISVDGRMLDINPNIFKRDDSGGGVMID 280
           A +  G++T    FI      D H+Y  ++L+AI      +           SGG +++ 
Sbjct: 208 AKLAGGESTSTAPFIKTSPDDDSHHYYLLSLDAIRAGNTTIATA-------QSGGILVMH 260

Query: 281 SGTDVTWLVKEAYEALRDEVMIRLEG---EQMRSYSWPDKLCYHGIMSSDLKGFPTVRFH 337
           + +  + LV  AY A +  V   + G   + M +   P  LC+           P + F 
Sbjct: 261 TVSPFSLLVDSAYRAFKKAVTEAVGGAAEQPMATPPQPFDLCFKKAAGFSRATAPDLVFT 320

Query: 338 FRGGAKLALEKDSMFYQ--PRPDAFCMAVNPASINNR--YVNFSLIGMMAQQFYNVGYDI 393
           F+G A L +             D  C A+   +  NR      S++G + Q+  +  YD+
Sbjct: 321 FQGAAALTVPPAKYLIDVGEEKDTACAAILSMAWLNRTGLEGVSVLGSLQQEDVHFLYDL 380

Query: 394 GRKQKTFQRMDCEVL 408
            ++  +F+  DC  L
Sbjct: 381 KKETLSFEPADCSSL 395


>gi|226492150|ref|NP_001146362.1| hypothetical protein precursor [Zea mays]
 gi|219886805|gb|ACL53777.1| unknown [Zea mays]
 gi|414878074|tpg|DAA55205.1| TPA: hypothetical protein ZEAMMB73_415404 [Zea mays]
          Length = 440

 Score =  103 bits (256), Expect = 2e-19,   Method: Compositional matrix adjust.
 Identities = 102/369 (27%), Positives = 157/369 (42%), Gaps = 36/369 (9%)

Query: 67  IGQPPVPQFTAMDTGSSLLWVHCYPCR-DCSPQLGTIFYPSRSSSYAYVPCDSEHCRYFP 125
           IG PP      +DTGS+L+W  C  CR  C  Q    + PSRS +   V C+   C    
Sbjct: 77  IGDPPQRAEAIIDTGSNLIWTQCSRCRPTCFRQNLPYYDPSRSRAARAVGCNDAACALGS 136

Query: 126 YARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDVVFGC----GFSTN 181
             +C +    C     Y  G   +  ++TE LTF++   S      +VFGC      S  
Sbjct: 137 ETQCLSDNKTCAVVTGYGAG-NIAGTLATENLTFQSETVS------LVFGCIVVTKLSPG 189

Query: 182 RNFKFSGIFGLGIGRSSLVSQL-NSSFSYCIGSLHDPDYLHNKLILGDGAIIDEGDA--T 238
                SGI GLG G+ SL SQL ++ FSYC+    +     + +++G  A +  G A  T
Sbjct: 190 SLNGASGIIGLGRGKLSLPSQLGDTRFSYCLTPYFEDTIEPSHMVVGASAGLINGSASST 249

Query: 239 PLQFI-----------DGHYYITLEAISVDGRMLDINPNIFK-RDDSGG---GVMIDSGT 283
           P+  +              YY+ L  I+     L +    F  R  + G   G  IDSG 
Sbjct: 250 PVTTVPFVRSPSDDPFSTFYYLPLTGITAGKVKLAVPSAAFDLRQVAPGMWTGTFIDSGA 309

Query: 284 DVTWLVKEAYEALRDEVMIRLEGEQMRSYSWPDKLCYHGIMSSDLKGFPTVRFHFRGGAK 343
            +T LV  AY+ALR E+  +L    ++  +          +    +  P +  HF GG+ 
Sbjct: 310 PLTSLVDVAYQALRAELARQLGAALVQPLAGTTGFDLCVALKDAERLVPPLVLHFGGGSG 369

Query: 344 LALE---KDSMFYQPRPDAFCMAVNPASINNRYVNF---SLIGMMAQQFYNVGYDIGRKQ 397
              +     + ++ P   A    V  +S++ + +     ++IG   QQ  +V YD+    
Sbjct: 370 TGTDLVVPPANYWAPVDSATACMVVFSSVDRKSLPMNETTVIGNYMQQNMHVLYDLAGGV 429

Query: 398 KTFQRMDCE 406
            +FQ  DC 
Sbjct: 430 LSFQPADCS 438


>gi|297810815|ref|XP_002873291.1| hypothetical protein ARALYDRAFT_487523 [Arabidopsis lyrata subsp.
           lyrata]
 gi|297319128|gb|EFH49550.1| hypothetical protein ARALYDRAFT_487523 [Arabidopsis lyrata subsp.
           lyrata]
          Length = 439

 Score =  103 bits (256), Expect = 2e-19,   Method: Compositional matrix adjust.
 Identities = 111/424 (26%), Positives = 169/424 (39%), Gaps = 41/424 (9%)

Query: 1   LIHRDSILSPYNNPNE-NPAHRVQRGINISIARLAYLQEKIRSHNTYQAQILPSND---- 55
           + H DS  SP+ +P+  +   RV + +    ARL YL   +   +     ++P       
Sbjct: 39  IFHIDSPCSPFKSPSPLSWEARVLQTLAQDQARLQYLSSLVAGRS-----VVPIASGRQM 93

Query: 56  ISNSLFYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLGTIFYPSRSSSYAYVP 115
           + ++ + V + IG P  P   AMDT S + W+ C  C  C     T F P++S+S+  V 
Sbjct: 94  LQSTTYIVKVLIGTPAQPLLLAMDTSSDVAWIPCSGCVGCPSN--TAFSPAKSTSFKNVS 151

Query: 116 CDSEHCRYFPYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDVVFG 175
           C +  C+  P   C A    C +   Y     +S+  +  Q T +   +    ++   FG
Sbjct: 152 CSAPQCKQVPNPACGA--RACSFNLTY---GSSSIAANLSQDTIRLAAD---PIKAFTFG 203

Query: 176 C-------GFSTNRNFKFSGIFGLGIGRSSLVSQLNSSFSYCIGSLHDPDYLHNKLILGD 228
           C       G             G     S   S   S+FSYC+ S     +    L LG 
Sbjct: 204 CVNKVAGGGTIPPPQGLLGLGRGPLSLMSQAQSVYKSTFSYCLPSFRSLTF-SGSLRLGP 262

Query: 229 GAIIDEGDATPL---QFIDGHYYITLEAISVDGRMLDINPNIFKRDDS-GGGVMIDSGTD 284
            +       T L         YY+ L AI V  +++D+ P     + S G G + DSGT 
Sbjct: 263 TSQPQRVKYTQLLRNPRRSSLYYVNLVAIRVGRKVVDLPPAAIAFNPSTGAGTIFDSGTV 322

Query: 285 VTWLVKEAYEALRDEVMIRLEGEQMRSYSWPD-KLCYHGIMSSDLKGFPTVRFHFRGGAK 343
            T L K  YEA+R+E   R++       S      CY G +       PT+ F F+G   
Sbjct: 323 YTRLAKPVYEAVRNEFRKRVKPPTAVVTSLGGFDTCYSGQVK-----VPTITFMFKGVNM 377

Query: 344 LALEKDSMFYQPRPDAFCMAVNPASIN-NRYVNFSLIGMMAQQFYNVGYDIGRKQKTFQR 402
                + M +       C+A+  A  N N  VN  +I  M QQ + V  D+   +    R
Sbjct: 378 TMPADNLMLHSTAGSTSCLAMASAPENVNSVVN--VIASMQQQNHRVLIDVPNGRLGLAR 435

Query: 403 MDCE 406
             C 
Sbjct: 436 ERCS 439


>gi|297801286|ref|XP_002868527.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
 gi|297314363|gb|EFH44786.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
          Length = 444

 Score =  102 bits (255), Expect = 3e-19,   Method: Compositional matrix adjust.
 Identities = 96/373 (25%), Positives = 168/373 (45%), Gaps = 38/373 (10%)

Query: 63  VNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLG--TIFYPSRSSSYAYVPCDSEH 120
           +++ IG P   Q   +DTGS L W+ C+P +   P     T F PS SSS++ +PC    
Sbjct: 83  LSLPIGTPSQSQELVLDTGSQLSWIQCHPKKIKKPLPPPTTSFDPSLSSSFSDLPCSHPL 142

Query: 121 CRY----FPYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDVVFGC 176
           C+     F           C Y+  Y  G      +  E+ TF N+  +      ++ GC
Sbjct: 143 CKPRIPDFTLPTSCDSNRLCHYSYFYADGTFAEGNLVKEKFTFSNSQTT----PPLILGC 198

Query: 177 GFSTNRNFKFSGIFGLGIGRSSLVSQLN-SSFSYCIGSLHDPDYLHN--KLILGDG---- 229
                 +    GI G+ +GR S +SQ   S FSYCI +  +   L +     LG+     
Sbjct: 199 A---KESTDVKGILGMNLGRLSFISQAKISKFSYCIPTRSNRPGLASTGSFYLGENPNSR 255

Query: 230 -----AIIDEGDATPLQFIDG-HYYITLEAISVDGRMLDINPNIFKRDDSGGG-VMIDSG 282
                +++    +  +  +D   Y + L  I +  + L+I  ++F+ D  G G  M+DSG
Sbjct: 256 GFKYVSLLTFPQSQRMPNLDPLAYTVPLLGIRIGQKRLNIPSSVFRPDAGGSGQTMVDSG 315

Query: 283 TDVTWLVKEAYEALRDEVMIRLEGEQMRS---YSWPDKLCY---HGIMSSDLKGFPTVRF 336
           ++ T LV  AY+ +++E+ +RL G +++    Y     +C+   H ++   L G   + F
Sbjct: 316 SEFTHLVDVAYDKVKEEI-VRLVGSRLKKGYVYGSTADMCFDGNHQMVIGRLIG--DLVF 372

Query: 337 HFRGGAKLALEKDSMFYQPRPDAFCMAVNPASINNRYVNFSLIGMMAQQFYNVGYDIGRK 396
            F  G ++ +EK  +         C+ +  +S+     N  +IG + QQ   V +D+  +
Sbjct: 373 EFGRGVEILVEKQRLLVNVGGGIHCVGIGRSSMLGAASN--IIGNVHQQNLWVEFDVANR 430

Query: 397 QKTFQRMDCEVLD 409
           +  F + +C  L 
Sbjct: 431 RVGFSKAECSRLS 443


>gi|18390865|ref|NP_563808.1| aspartyl protease-like protein [Arabidopsis thaliana]
 gi|11993877|gb|AAG42922.1|AF329505_1 unknown protein [Arabidopsis thaliana]
 gi|20260142|gb|AAM12969.1| unknown protein [Arabidopsis thaliana]
 gi|22136092|gb|AAM91124.1| unknown protein [Arabidopsis thaliana]
 gi|332190140|gb|AEE28261.1| aspartyl protease-like protein [Arabidopsis thaliana]
          Length = 492

 Score =  102 bits (255), Expect = 3e-19,   Method: Compositional matrix adjust.
 Identities = 105/375 (28%), Positives = 169/375 (45%), Gaps = 42/375 (11%)

Query: 60  LFYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSP------QLGTIFYPSRSSSYAY 113
           L+Y  + +G PP      +DTGS +LWV C  C  C        QL + F P  SSS + 
Sbjct: 83  LYYTKVKLGTPPREFNVQIDTGSDVLWVSCTSCNGCPKTSELQIQL-SFFDPGVSSSASL 141

Query: 114 VPCDSEHC--RYFPYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQD 171
           V C    C   +   + CS   + C Y+  Y  G  TS +  ++ ++F     ST+ +  
Sbjct: 142 VSCSDRRCYSNFQTESGCSP-NNLCSYSFKYGDGSGTSGYYISDFMSFDTVITSTLAINS 200

Query: 172 ---VVFGCGFSTNRNFK-----FSGIFGLGIGRSSLVSQLNSS------FSYCIGSLHDP 217
               VFGC    + + +       GIFGLG G  S++SQL         FS+C   L   
Sbjct: 201 SAPFVFGCSNLQSGDLQRPRRAVDGIFGLGQGSLSVISQLAVQGLAPRVFSHC---LKGD 257

Query: 218 DYLHNKLILGDGAIIDEGDA--TPLQFIDGHYYITLEAISVDGRMLDINPNIFKRDDSGG 275
                 ++LG    I   D   TPL     HY + L++I+V+G++L I+P++F    +G 
Sbjct: 258 KSGGGIMVLGQ---IKRPDTVYTPLVPSQPHYNVNLQSIAVNGQILPIDPSVFTI-ATGD 313

Query: 276 GVMIDSGTDVTWLVKEAYEALRDEVMIRLEGEQMRSYSWPDKLCYHGIMSSDLKGFPTVR 335
           G +ID+GT + +L  EAY      V   +  +  R  ++    C+  I + D+  FP V 
Sbjct: 314 GTIIDTGTTLAYLPDEAYSPFIQAVANAVS-QYGRPITYESYQCFE-ITAGDVDVFPQVS 371

Query: 336 FHFRGGAKLALEKDS---MFYQPRPDAFCMAVNPASINNRYVNFSLIGMMAQQFYNVGYD 392
             F GGA + L   +   +F       +C+     S    +   +++G +  +   V YD
Sbjct: 372 LSFAGGASMVLGPRAYLQIFSSSGSSIWCIGFQRMS----HRRITILGDLVLKDKVVVYD 427

Query: 393 IGRKQKTFQRMDCEV 407
           + R++  +   DC +
Sbjct: 428 LVRQRIGWAEYDCSL 442


>gi|194707866|gb|ACF88017.1| unknown [Zea mays]
          Length = 461

 Score =  102 bits (254), Expect = 3e-19,   Method: Compositional matrix adjust.
 Identities = 118/428 (27%), Positives = 173/428 (40%), Gaps = 51/428 (11%)

Query: 1   LIHRDSILSPYNNPN----ENPAHRVQRGINISIARLAYLQEKIRSHNTYQAQILPSNDI 56
           L HR    SP         E   HR Q        R AY+Q K          +  S+  
Sbjct: 62  LHHRHGPCSPLPTKKMPTLEETLHRDQL-------RAAYIQRKFSGGGGAGGDVQRSDAT 114

Query: 57  S--------NSLFY-VNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLGTIFYPSR 107
                    N+L Y + + +G P   Q   +DTGS + WV C PC  C  Q   +F PS 
Sbjct: 115 VPTALGTSLNTLEYLITVGLGSPATSQTMLIDTGSDVSWVQCKPCSQCHSQADPLFDPSS 174

Query: 108 SSSYAYVPCDSEHCRYFPYA--RCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDES 165
           SS+Y+   C S  C         CS+   +C Y   Y  G  T+   S++ L   ++   
Sbjct: 175 SSTYSPFSCGSADCAQLGQEGNGCSS-SSQCQYIVTYGDGSSTTGTYSSDTLALGSS--- 230

Query: 166 TIHVQDVVFGCG-FSTNRNFKFSGIFGLGIGRSSLVSQ----LNSSFSYCIGSLHDPDYL 220
              V+   FGC    +  N +  G+ GLG G  SLVSQ    L  +FSYC+         
Sbjct: 231 --AVRSFQFGCSNVESGFNDQTDGLMGLGGGAQSLVSQTAGTLGRAFSYCLPPTPSSSGF 288

Query: 221 HNKLILGDGAIIDEGDATPL---QFIDGHYYITLEAISVDGRMLDINPNIFKRDDSGGGV 277
              L    G+       TP+     +   Y + L+AI V GR L I  ++F       G 
Sbjct: 289 LT-LGAAGGSGTSGFVKTPMLRSSQVPTFYGVRLQAIRVGGRQLSIPASVFS-----AGT 342

Query: 278 MIDSGTDVTWLVKEAYEALRDEVMIRLEGEQMRSYSWPDKLCYHGIMSSDLKGFPTVRFH 337
           ++DSGT +T L   AY AL       ++       S     C+     S +   P+V   
Sbjct: 343 VMDSGTVITRLPPTAYSALSSAFKAGMKQYPPAQPSGILDTCFDFSGQSSVS-IPSVALV 401

Query: 338 FRGGAKLALEKDSMFYQPRPDAFCMAVNPASINNRYVNFSLIGMMAQQFYNVGYDIGRKQ 397
           F GGA ++L+   +       + C+A    + N+   +  +IG + Q+ + V YD+GR  
Sbjct: 402 FSGGAVVSLDASGIIL-----SNCLAF---AGNSDDSSLGIIGNVQQRTFEVLYDVGRGV 453

Query: 398 KTFQRMDC 405
             F+   C
Sbjct: 454 VGFRAGAC 461


>gi|255581545|ref|XP_002531578.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
           communis]
 gi|223528808|gb|EEF30814.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
           communis]
          Length = 442

 Score =  102 bits (254), Expect = 3e-19,   Method: Compositional matrix adjust.
 Identities = 98/378 (25%), Positives = 166/378 (43%), Gaps = 48/378 (12%)

Query: 63  VNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLGTIFYPSRSSSYAYVPCDSEHCR 122
           V++++G PP      +DTGS L W+HC   +     L ++F P  S +Y+ VPC S  C+
Sbjct: 71  VSLTVGSPPQNVTMVLDTGSELSWLHCKKTQF----LNSVFNPLSSKTYSKVPCLSPTCK 126

Query: 123 YFPYARCSAYKHRCIYTQLYLIGPETSVFVSTE-QLTFKNTDESTIHVQDVVFGC---GF 178
                R       C  T+L  +    +   S E  L F+     ++     +FGC   GF
Sbjct: 127 --TRTRDLTIPVSCDATKLCHVIVSYADATSIEGNLAFETFRLGSLTKPATIFGCMDSGF 184

Query: 179 STN--RNFKFSGIFGLGIGRSSLVSQLN-SSFSYCIGSLHDPDYLHNKLILGDGAIIDEG 235
           S+N   + K +G+ G+  G  S V+Q+    FSYCI            L+LG+ +     
Sbjct: 185 SSNSEEDSKTTGLIGMNRGSLSFVNQMGYPKFSYCISGFDSAGV----LLLGNASFPWLK 240

Query: 236 D---------ATPLQFIDG-HYYITLEAISVDGRMLDINPNIFKRDDSGGG-VMIDSGTD 284
                     +TPL + D   Y + LE I V  ++L +  ++F  D +G G  M+DSGT 
Sbjct: 241 PLSYTPLVQISTPLPYFDRVAYTVQLEGIKVKNKVLSLPKSVFVPDHTGAGQTMVDSGTQ 300

Query: 285 VTWLVKEAYEALRDEVMIRLEGEQMRSYSWPDKLCYHGIMS---------SDLKGFPTVR 335
            T+L+   Y AL++E + +  G  +      D   + G M           +L+  P V 
Sbjct: 301 FTFLLGPVYTALKNEFLSQTRG--ILKVLNDDNFVFQGAMDLCYLLDSSRPNLQNLPVVS 358

Query: 336 FHFRGGAKLALEKDSMFYQP------RPDAFCMAVNPASINNRYVNFSLIGMMAQQFYNV 389
             F+ GA++++  + + Y+       R   +C     + +    V   +IG   QQ   +
Sbjct: 359 LMFQ-GAEMSVSGERLLYRVPGEVRGRDSVWCFTFGNSDLLG--VEAFVIGHHHQQNVWM 415

Query: 390 GYDIGRKQKTFQRMDCEV 407
            +D+ + +     + C+V
Sbjct: 416 EFDLEKSRIGLADVRCDV 433


>gi|242044724|ref|XP_002460233.1| hypothetical protein SORBIDRAFT_02g025060 [Sorghum bicolor]
 gi|241923610|gb|EER96754.1| hypothetical protein SORBIDRAFT_02g025060 [Sorghum bicolor]
          Length = 512

 Score =  102 bits (254), Expect = 4e-19,   Method: Compositional matrix adjust.
 Identities = 104/358 (29%), Positives = 166/358 (46%), Gaps = 46/358 (12%)

Query: 78  MDTGSSLLWVHCYPCRDCSPQLGTIFYPSRSSSYAYVPCDSEHCRYFPYAR--------- 128
           +DT S L WV C PC  C  Q   +F PS S SYA VPC+S  C     A          
Sbjct: 168 VDTASELTWVQCAPCESCHDQQDPLFDPSSSPSYAAVPCNSSSCDALQLATGGTSGGAAA 227

Query: 129 CSAYKHR---CIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDVVFGCGFSTNRNFK 185
           C         C YT  Y  G  +   ++ ++L+          +   VFGCG ++N+   
Sbjct: 228 CQGQDQSAAACSYTLSYRDGSYSRGVLAHDRLSLAGE-----VIDGFVFGCG-TSNQGPP 281

Query: 186 FSGIFGL-GIGRS--SLVS----QLNSSFSYCIGSLHDPDYLHNKLILGDGAIIDEGDAT 238
           F G  GL G+GRS  SLVS    Q    FSYC+  L + D     L++GD + +   ++T
Sbjct: 282 FGGTSGLMGLGRSQLSLVSQTMDQFGGVFSYCL-PLKESDS-SGSLVIGDDSSVYR-NST 338

Query: 239 PLQF-------IDG-HYYITLEAISVDGRMLDINPNIFKRDDSGGGVMIDSGTDVTWLVK 290
           P+ +       + G  Y++ L  I+V G+  ++  + F     GG  +IDSGT +T LV 
Sbjct: 339 PIVYASMVSDPLQGPFYFVNLTGITVGGQ--EVESSGFSSGGGGGKAIIDSGTVITSLVP 396

Query: 291 EAYEALRDEVMIRL-EGEQMRSYSWPDKLCYHGIMSSDLKGFPTVRFHFRGGAKLALEKD 349
             Y A++ E + +  E  Q   +S  D  C++     +++  P+++  F GG ++ ++  
Sbjct: 397 SIYNAVKAEFLSQFAEYPQAPGFSILDT-CFNMTGLREVQ-VPSLKLVFDGGVEVEVDSG 454

Query: 350 SMFYQPRPDA--FCMAVNPASINNRYVNFSLIGMMAQQFYNVGYDIGRKQKTFQRMDC 405
            + Y    D+   C+A+ P  + + Y   ++IG   Q+   V +D    Q  F +  C
Sbjct: 455 GVLYFVSSDSSQVCLAMAP--LKSEY-ETNIIGNYQQKNLRVIFDTSGSQVGFAQETC 509


>gi|357162717|ref|XP_003579500.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Brachypodium
           distachyon]
          Length = 488

 Score =  102 bits (254), Expect = 4e-19,   Method: Compositional matrix adjust.
 Identities = 102/396 (25%), Positives = 168/396 (42%), Gaps = 53/396 (13%)

Query: 61  FYVNISIGQPPVPQFTAMDTGSSLLWVHC---YPCRDC-----SPQLGTIFYPSRSSSYA 112
           +  ++S+G PP P    +DTGS L WV C   Y CR+C     +     +F+P  SSS  
Sbjct: 91  YAFSVSLGTPPQPLPVLLDTGSHLSWVPCTSSYQCRNCSSSPSAMSAMAVFHPKNSSSSR 150

Query: 113 YVPCDSEHCRYF---PYARCSAYKHR-----C-IYTQLYLIGPETSVFVS-TEQLTFKNT 162
            V C +  CR+      + C +  +      C  Y  +Y  G  + + +S T +L+  ++
Sbjct: 151 LVGCRNPACRWIHSKSPSTCGSTGNNGNGDVCPPYLVVYGSGSTSGLLISDTLRLSPSSS 210

Query: 163 DESTIHVQDVVFGCGFSTNRNFKFSGIFGLGIGRSSLVSQLN-SSFSYCIGS--LHDPDY 219
             +    ++   GC   +      SG+ G G G  S+ SQL    FSYC+ S    D   
Sbjct: 211 SSAPAPFRNFAIGCSIVSVHQ-PPSGLAGFGRGAPSVPSQLKVPKFSYCLLSRRFDDNSA 269

Query: 220 LHNKLILGDGAIIDEGDATPLQFI------------DGHYYITLEAISVDGRMLDINPNI 267
           +  +L+LGD  +      T +Q++              +YY+ L  ISV G+ +++    
Sbjct: 270 VSGELVLGDAMVPAGKKKTTMQYVPLLNNAASKPPYSVYYYLALTGISVGGKPVNLPSRA 329

Query: 268 FKRDDSGGGVMIDSGTDVTWLVKEAYEALRDEVMIRLEGEQMRSYSWPDKLCYHGIMS-- 325
           F    SGGG +IDSGT  T+L    ++ +   +   + G   RS    D L      +  
Sbjct: 330 FV-PSSGGGAIIDSGTTFTYLDPTVFKPVAAAMESAVGGRYNRSRPVEDALGLRPCFALP 388

Query: 326 ---SDLKGFPTVRFHFRGGAKLALEKDSMF--------YQPRPDAFCMAV-----NPASI 369
                    P +   F+GGA + L  ++ F            P A C+AV          
Sbjct: 389 PGPGGAMELPDLELKFKGGAVMRLPVENYFVAAGPAGGPAAGPVAICLAVVSDLPASGGD 448

Query: 370 NNRYVNFSLIGMMAQQFYNVGYDIGRKQKTFQRMDC 405
                   ++G   QQ Y++ YD+G+++  F++  C
Sbjct: 449 GAAAGPAIILGSFQQQNYHIEYDLGKERLGFRQQPC 484


>gi|302758122|ref|XP_002962484.1| hypothetical protein SELMODRAFT_78458 [Selaginella moellendorffii]
 gi|300169345|gb|EFJ35947.1| hypothetical protein SELMODRAFT_78458 [Selaginella moellendorffii]
          Length = 395

 Score =  102 bits (254), Expect = 4e-19,   Method: Compositional matrix adjust.
 Identities = 98/392 (25%), Positives = 175/392 (44%), Gaps = 45/392 (11%)

Query: 48  AQILPSNDISNSLFYVNISIGQPPVPQFTAMDTGSSLLWVHCYP---CRDCSPQLGTIFY 104
           ++++  + I +  ++V + +G P       +DTGS L W+ C P     + S      + 
Sbjct: 14  SRLVSGSSIGSGQYFVELRVGTPAKKFPLIIDTGSDLTWIQCNPPNTTANSSSPPAPWYD 73

Query: 105 PSRSSSYAYVPCDSEHCRYFPY---ARCSAYKHR-CIYTQLYLIGPETSVFVSTEQLTFK 160
            S SSSY  +PC  + C + P    + CS      C YT  Y     T+  ++ E ++ K
Sbjct: 74  KSSSSSYREIPCTDDECLFLPAPIGSSCSIKSPSPCDYTYGYSDQSRTTGILAYETISMK 133

Query: 161 NTDES----------TIHVQDVVFGCGF-STNRNF-KFSGIFGLGIGRSSLVSQ-----L 203
           +   S          TI +++V  GC   S   +F   SG+ GLG G  SL +Q     L
Sbjct: 134 SRKRSGKRAGNHKTRTIRIKNVALGCSRESVGASFLGASGVLGLGQGPISLATQTRHTAL 193

Query: 204 NSSFSYCIGSLHDPDYLHNK-----LILGDGAIIDEGDATPL---QFIDGHYYITLEAIS 255
              FSYC+      DYL        L++G      +   TP+         YY+ +  ++
Sbjct: 194 GGIFSYCL-----VDYLRGSNASSFLVMGR-TRWRKLAHTPIVRNPAAQSFYYVNVTGVA 247

Query: 256 VDGRMLD-INPNIFKRDDSGG-GVMIDSGTDVTWLVKEAYEALRDEVMIRLEGEQMRSYS 313
           VDG+ +D I  + +  D  G  G + DSGT +++L + AY  +   +   +   + +   
Sbjct: 248 VDGKPVDGIASSDWGIDGDGNKGTIFDSGTTLSYLREPAYSKVLGALNASIYLPRAQEIP 307

Query: 314 WPDKLCYHGIMSSDLKGFPTVRFHFRGGAKLALEKDSMFYQPRPDAFCMAVNPASINNRY 373
              +LCY+  ++   KG P +   F+GGA + L  ++       +  C+A+   +  N  
Sbjct: 308 EGFELCYN--VTRMEKGMPKLGVEFQGGAVMELPWNNYMVLVAENVQCVALQKVTTTN-- 363

Query: 374 VNFSLIGMMAQQFYNVGYDIGRKQKTFQRMDC 405
              +++G + QQ +++ YD+ + +  F+   C
Sbjct: 364 -GSNILGNLLQQDHHIEYDLAKARIGFKWSPC 394


>gi|224144963|ref|XP_002325476.1| predicted protein [Populus trichocarpa]
 gi|222862351|gb|EEE99857.1| predicted protein [Populus trichocarpa]
          Length = 372

 Score =  102 bits (253), Expect = 4e-19,   Method: Compositional matrix adjust.
 Identities = 86/360 (23%), Positives = 161/360 (44%), Gaps = 30/360 (8%)

Query: 59  SLFYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQ--LG---TIFYPSRSSSYAY 113
           SL++  I +G P    +  +DTGS +LWV+C  C  C  +  LG   T++ P+ S S   
Sbjct: 25  SLYFAKIGLGNPSKDYYVQVDTGSDILWVNCIGCDKCPTKSDLGIKLTLYDPASSVSATR 84

Query: 114 VPCDSEHCRYFPYARCSAYKHR--CIYTQLYLIGPETSVFVSTEQLTFKNTD---ESTIH 168
           V CD + C           K    C Y  +Y  G  T+ +  ++ + F+      ++ + 
Sbjct: 85  VSCDDDFCTSTYNGLLPDCKKELPCQYNVVYGDGSSTAGYFVSDAVQFERVTGNLQTGLS 144

Query: 169 VQDVVFGCGFSTNRNFKFSGIFGLGIGRSSLVSQLNSSFSYCIGSLHDPDYLHNKLILGD 228
              V FGCG   +         GLG    +L   L  +F++C+      D ++   I   
Sbjct: 145 NGTVTFGCGAQQSG--------GLGTSGEALDGIL-GAFAHCL------DNVNGGGIFAI 189

Query: 229 GAIID-EGDATPLQFIDGHYYITLEAISVDGRMLDINPNIFKRDDSGGGVMIDSGTDVTW 287
           G ++  + + TP+     HY + ++ I V G +L++  ++F   D   G +IDSGT + +
Sbjct: 190 GELVSPKVNTTPMVPNQAHYNVYMKEIEVGGTVLELPTDVFDSGDR-RGTIIDSGTTLAY 248

Query: 288 LVKEAYEALRDEVMIRLEGEQMRSYSWPDKLCYHGIMSSDLKGFPTVRFHFRGGAKLALE 347
           L +  Y+++ +E+  +  G  + +      +C+    + D  GFP ++FHF+    L + 
Sbjct: 249 LPEVVYDSMMNEIRSQQPGLSLHTVE-EQFICFKYSGNVD-DGFPDIKFHFKDSLTLTVY 306

Query: 348 KDSMFYQPRPDAFCMAVNPASINNR-YVNFSLIGMMAQQFYNVGYDIGRKQKTFQRMDCE 406
                +Q   D +C       + ++   + +L+G +      V YDI  +   +   +C+
Sbjct: 307 PHDYLFQISEDIWCFGWQNGGMQSKDGRDMTLLGDLVLSNKLVLYDIENQAIGWTEYNCK 366


>gi|226531872|ref|NP_001147022.1| aspartic proteinase nepenthesin-2 precursor [Zea mays]
 gi|195606574|gb|ACG25117.1| aspartic proteinase nepenthesin-2 precursor [Zea mays]
          Length = 491

 Score =  102 bits (253), Expect = 4e-19,   Method: Compositional matrix adjust.
 Identities = 105/392 (26%), Positives = 163/392 (41%), Gaps = 51/392 (13%)

Query: 61  FYVNISIGQPPVPQFTAMDTGSSLLWVHC---YPCRDCSPQLGT---IFYPSRSSSYAYV 114
           +    S+G PP P    +DTGS L WV C   Y CR+CS    +   +F+P  SSS   V
Sbjct: 99  YAFTASLGTPPQPLPVLLDTGSHLTWVPCTSSYECRNCSSPSASAVPVFHPKNSSSSRLV 158

Query: 115 PCDSEHCRYFPYAR----------CSAYKHRCIYTQLYLIGPETSVF--VSTEQLTFKNT 162
            C +  C++   A           CS     C      +  P   V+   ST  L   +T
Sbjct: 159 GCRNPSCQWVHSAANLATKCRRAPCSPGAANCPAAASNVCPPYAVVYGSGSTAGLLIADT 218

Query: 163 DESTIH-VQDVVFGCGFSTNRNFKFSGIFGLGIGRSSLVSQLN-SSFSYCIGSLHDPDYL 220
             +    V   V GC   +      SG+ G G G  S+ +QL    FSYC+ S    D  
Sbjct: 219 LRAPGRAVPGFVLGCSLVSVHQ-PPSGLAGFGRGAPSVPAQLGLPKFSYCLLSRRFDD-- 275

Query: 221 HNKLILGDGAIIDEGDATPLQFI-------------DGHYYITLEAISVDGRMLDINPNI 267
            N  + G   +   G    +Q++               +YY+ L  ++V G+ + +    
Sbjct: 276 -NAAVSGSLVLGGTGGGEGMQYVPLVKSAAGDKLPYGVYYYLALRGVTVGGKAVRLPARA 334

Query: 268 FKRDDSG-GGVMIDSGTDVTWLVKEAYEALRDEVMIRLEGEQMRSYSWPDKL----CYHG 322
           F  + +G GG ++DSGT  T+L    ++ + D V+  + G   RS    D L    C+  
Sbjct: 335 FAGNAAGSGGTIVDSGTTFTYLDPTVFQPVADAVVAAVGGRYKRSKDAEDGLGLHPCFAL 394

Query: 323 IMSSDLKGFPTVRFHFRGGAKLALEKDSMFY---QPRPDAFCMAV------NPASINNRY 373
              +     P + FHF GGA + L  ++ F    +   +A C+AV         + N   
Sbjct: 395 PQGARSMALPELSFHFEGGAVMQLPVENYFVVAGRGAVEAICLAVVTDFGGGSGAGNEGS 454

Query: 374 VNFSLIGMMAQQFYNVGYDIGRKQKTFQRMDC 405
               ++G   QQ Y V YD+ +++  F+R  C
Sbjct: 455 GPAIILGSFQQQNYLVEYDLEKERLGFRRQSC 486


>gi|224101015|ref|XP_002312106.1| predicted protein [Populus trichocarpa]
 gi|222851926|gb|EEE89473.1| predicted protein [Populus trichocarpa]
          Length = 440

 Score =  102 bits (253), Expect = 4e-19,   Method: Compositional matrix adjust.
 Identities = 102/388 (26%), Positives = 170/388 (43%), Gaps = 58/388 (14%)

Query: 58  NSLFYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLGTIFYPSRSSSYAYVPCD 117
           N    V+++ G P       +DTGS L W+HC       P   +IF P  S +Y  +PC 
Sbjct: 64  NVTLTVSLTAGTPLQNITMVLDTGSELSWLHCKK----EPNFNSIFNPLASKTYTKIPCS 119

Query: 118 SEHC----RYFPYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDVV 173
           S  C    R  P          C +   Y     +SV      L F+     ++     V
Sbjct: 120 SPTCETRTRDLPLPVSCDPAKLCHFIISY--ADASSV---EGNLAFETFRVGSVTGPATV 174

Query: 174 FGC---GFSTN--RNFKFSGIFGLGIGRSSLVSQLN-SSFSYCIGSLHDPDYLHNKLILG 227
           FGC   GFS+N   + K +G+ G+  G  S V+Q+    FSYCI    D D     L+LG
Sbjct: 175 FGCMDSGFSSNSEEDAKTTGLMGMNRGSLSFVNQMGFRKFSYCIS---DRDS-SGVLLLG 230

Query: 228 DGA-----------IIDEGDATPLQFIDG-HYYITLEAISVDGRMLDINPNIFKRDDSGG 275
           + +           +++   +TPL + D   Y + LE I V  ++L +  ++F  D +G 
Sbjct: 231 EASFSWLKPLNYTPLVEM--STPLPYFDRVAYSVQLEGIRVSDKVLSLPKSVFVPDHTGA 288

Query: 276 G-VMIDSGTDVTWLVKEAYEALRDEVMIRLEGEQMRSYSWPDKLCYHGIMS--------- 325
           G  M+DSGT  T+L+   Y AL+ E +++ +G  +R  + P +  + G M          
Sbjct: 289 GQTMVDSGTQFTFLLGPVYSALKQEFLLQTKG-VLRVLNEP-RYVFQGAMDLCYLIEPTR 346

Query: 326 SDLKGFPTVRFHFRGGAKLALEKDSMFYQP------RPDAFCMAVNPASINNRYVNFSLI 379
           + L   P V   FR GA++++    + Y+       +   +C     +  ++  +   +I
Sbjct: 347 AALPNLPVVNLMFR-GAEMSVSGQRLLYRVPGEVRGKDSVWCFTFGNS--DSLGIESFVI 403

Query: 380 GMMAQQFYNVGYDIGRKQKTFQRMDCEV 407
           G   QQ   + YD+ + +  F  + C++
Sbjct: 404 GHHQQQNVWMEYDLEKSRIGFAEVRCDL 431


>gi|222640709|gb|EEE68841.1| hypothetical protein OsJ_27628 [Oryza sativa Japonica Group]
          Length = 375

 Score =  102 bits (253), Expect = 4e-19,   Method: Compositional matrix adjust.
 Identities = 94/353 (26%), Positives = 154/353 (43%), Gaps = 50/353 (14%)

Query: 73  PQFTAMDTGSSLLWVHCYPCRDCSPQLGTIFYPSRSSSYAYVPCDSEHCRYFPYARCSAY 132
           P+   +DTGS L+W  C                S S++ A         R  P AR  A+
Sbjct: 52  PRKLIVDTGSDLIWTQCKL--------------SSSTAAAARHGSPPLSRTAP-ARTGAF 96

Query: 133 KHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDVVFGCG-FSTNRNFKFSGIFG 191
              C  +    +G      +++E  TF      ++ +    FGCG  S       +GI G
Sbjct: 97  TRTCTASAAA-VG-----VLASETFTFGARRAVSLRLG---FGCGALSAGSLIGATGILG 147

Query: 192 LGIGRSSLVSQLN-SSFSYCIGSLHDPDYLHNKLILGDGAIIDEGDAT-PLQFI------ 243
           L     SL++QL    FSYC+    D     + L+ G  A +     T P+Q        
Sbjct: 148 LSPESLSLITQLKIQRFSYCLTPFADKKT--SPLLFGAMADLSRHKTTRPIQTTAIVSNP 205

Query: 244 --DGHYYITLEAISVDGRMLDI-NPNIFKRDDSGGGVMIDSGTDVTWLVKEAYEALRDEV 300
               +YY+ L  IS+  + L +   ++  R D GGG ++DSG+ V +LV+ A+EA+++ V
Sbjct: 206 VETVYYYVPLVGISLGHKRLAVPAASLAMRPDGGGGTIVDSGSTVAYLVEAAFEAVKEAV 265

Query: 301 M--IRLEGEQMRSYSWPDKLCY-----HGIMSSDLKGFPTVRFHFRGGAKLALEKDSMFY 353
           M  +RL         +  +LC+         + +    P +  HF GGA + L +D+ F 
Sbjct: 266 MDVVRLPVANRTVEDY--ELCFVLPRRTAAAAMEAVQVPPLVLHFDGGAAMVLPRDNYFQ 323

Query: 354 QPRPDAFCMAVNPASINNRYVNFSLIGMMAQQFYNVGYDIGRKQKTFQRMDCE 406
           +PR    C+AV   +  +     S+IG + QQ  +V +D+   + +F    C+
Sbjct: 324 EPRAGLMCLAVGKTTDGS---GVSIIGNVQQQNMHVLFDVQHHKFSFAPTQCD 373


>gi|297829808|ref|XP_002882786.1| hypothetical protein ARALYDRAFT_478632 [Arabidopsis lyrata subsp.
           lyrata]
 gi|297328626|gb|EFH59045.1| hypothetical protein ARALYDRAFT_478632 [Arabidopsis lyrata subsp.
           lyrata]
          Length = 449

 Score =  102 bits (253), Expect = 5e-19,   Method: Compositional matrix adjust.
 Identities = 101/426 (23%), Positives = 164/426 (38%), Gaps = 33/426 (7%)

Query: 1   LIHRDSILSPYNNPNENPAHRVQRGINISIARLAYLQEKIRSHNTYQAQILPSNDISNSL 60
           L HRD++         NP  R++  I     R + +  K +     +  +    D   + 
Sbjct: 35  LAHRDTLW-------PNPLSRIEDIIGADQKRHSLISRKRKFKGGVKMDLGSGIDYGTAQ 87

Query: 61  FYVNISIGQPPVPQFTAMDTGSSLLWVHC-YPCRDCSP-QLGTIFYPSRSSSYAYVPCDS 118
           ++  + +G P       +DTGS L WV+C Y  R     +   +F    S S+  V C +
Sbjct: 88  YFTEVRVGTPAKKFRVVVDTGSELTWVNCRYRGRGKGKVKNRRVFRAEESKSFKTVGCFT 147

Query: 119 EHCR-----YFPYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDVV 173
           + C+      F  + C      C Y   Y  G       + E +T   T+     ++ ++
Sbjct: 148 QTCKVDLMNLFSLSTCPTPSTPCSYDYRYADGSAAQGVFAKETITVGLTNGRKARLRGLL 207

Query: 174 FGCGFSTNRNFKFS--GIFGLGIGRSSLVSQLNSSF----SYCIGSLHDPDYLHNKLILG 227
            GC  S +        G+ GL     S  S   S F    SYC+        + N LI G
Sbjct: 208 VGCSSSFSGQSFQGADGVLGLAFSDFSFTSTATSLFGAKLSYCLVDHLSNKNISNYLIFG 267

Query: 228 -----DGAIIDEGDATP--LQFIDGHYYITLEAISVDGRMLDINPNIFKRDDSGGGVMID 280
                       G  TP  L  I   Y I +  IS+   MLDI   ++    +GGG ++D
Sbjct: 268 YSSSSTSTKTAPGRTTPLDLTLIPPFYAINIIGISIGDDMLDIPTQVWDA-TTGGGTILD 326

Query: 281 SGTDVTWLVKEAYEALRDEVMIRL-EGEQMRSYSWPDKLCYHGIMSSDLKGFPTVRFHFR 339
           SGT +T L + AY+ +   +   L E ++++    P + C+      +    P + FH +
Sbjct: 327 SGTSLTLLAEAAYKPVVTGLARYLVELKRVKPEGIPIEYCFSSTSGFNESKLPQLTFHLK 386

Query: 340 GGAKLALEKDSMFYQPRPDAFCMAVNPASINNRYVNFSLIGMMAQQFYNVGYDIGRKQKT 399
           GGA+    + S      P   C+    A      V    +G + QQ Y   +D+     +
Sbjct: 387 GGARFEPHRKSYLVDAAPGVKCLGFMSAGTPATNV----VGNIMQQNYLWEFDLMASTLS 442

Query: 400 FQRMDC 405
           F    C
Sbjct: 443 FAPSTC 448


>gi|413923784|gb|AFW63716.1| aspartic proteinase nepenthesin-2 [Zea mays]
          Length = 531

 Score =  102 bits (253), Expect = 5e-19,   Method: Compositional matrix adjust.
 Identities = 118/428 (27%), Positives = 173/428 (40%), Gaps = 51/428 (11%)

Query: 1   LIHRDSILSPYNNPN----ENPAHRVQRGINISIARLAYLQEKIRSHNTYQAQILPSNDI 56
           L HR    SP         E   HR Q        R AY+Q K          +  S+  
Sbjct: 132 LHHRHGPCSPLPTKKMPTLEETLHRDQ-------LRAAYIQRKFSGGGGAGGDVQRSDAT 184

Query: 57  S--------NSLFY-VNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLGTIFYPSR 107
                    N+L Y + + +G P   Q   +DTGS + WV C PC  C  Q   +F PS 
Sbjct: 185 VPTALGTSLNTLEYLITVGLGSPATSQTMLIDTGSDVSWVQCKPCSQCHSQADPLFDPSS 244

Query: 108 SSSYAYVPCDSEHCRYFPYA--RCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDES 165
           SS+Y+   C S  C         CS+   +C Y   Y  G  T+   S++ L   ++   
Sbjct: 245 SSTYSPFSCGSADCAQLGQEGNGCSS-SSQCQYIVTYGDGSSTTGTYSSDTLALGSS--- 300

Query: 166 TIHVQDVVFGCG-FSTNRNFKFSGIFGLGIGRSSLVSQ----LNSSFSYCIGSLHDPDYL 220
              V+   FGC    +  N +  G+ GLG G  SLVSQ    L  +FSYC+         
Sbjct: 301 --AVRSFQFGCSNVESGFNDQTDGLMGLGGGAQSLVSQTAGTLGRAFSYCLPPTPSSSGF 358

Query: 221 HNKLILGDGAIIDEGDATPL---QFIDGHYYITLEAISVDGRMLDINPNIFKRDDSGGGV 277
              L    G+       TP+     +   Y + L+AI V GR L I  ++F       G 
Sbjct: 359 LT-LGAAGGSGTSGFVKTPMLRSSQVPTFYGVRLQAIRVGGRQLSIPASVFS-----AGT 412

Query: 278 MIDSGTDVTWLVKEAYEALRDEVMIRLEGEQMRSYSWPDKLCYHGIMSSDLKGFPTVRFH 337
           ++DSGT +T L   AY AL       ++       S     C+     S +   P+V   
Sbjct: 413 VMDSGTVITRLPPTAYSALSSAFKAGMKQYPPAQPSGILDTCFDFSGQSSVS-IPSVALV 471

Query: 338 FRGGAKLALEKDSMFYQPRPDAFCMAVNPASINNRYVNFSLIGMMAQQFYNVGYDIGRKQ 397
           F GGA ++L+   +       + C+A    + N+   +  +IG + Q+ + V YD+GR  
Sbjct: 472 FSGGAVVSLDASGIIL-----SNCLAF---AGNSDDSSLGIIGNVQQRTFEVLYDVGRGV 523

Query: 398 KTFQRMDC 405
             F+   C
Sbjct: 524 VGFRAGAC 531


>gi|125553822|gb|EAY99427.1| hypothetical protein OsI_21398 [Oryza sativa Indica Group]
          Length = 469

 Score =  102 bits (253), Expect = 5e-19,   Method: Compositional matrix adjust.
 Identities = 97/352 (27%), Positives = 140/352 (39%), Gaps = 42/352 (11%)

Query: 72  VPQFTAMDTGSSLLWVHCYPCRD--CSPQLGTIFYPSRSSSYAYVPCDSEHCRYF-PYAR 128
           V Q   +DT S + WV C PC    C PQ   ++ P++SSS     C+S  C    PYA 
Sbjct: 142 VTQTMVLDTASDVTWVQCSPCPTPPCYPQKDVLYDPTKSSSSGVFSCNSPTCTQLGPYAN 201

Query: 129 CSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDVVFGCGFSTNRNFKF-- 186
                ++C Y   Y  G  T+    ++ LT          V+   FGC      +F F  
Sbjct: 202 GCTNNNQCQYRVRYPDGTSTAGTYISDLLTIT----PATAVRSFQFGCSHGVQGSFSFGS 257

Query: 187 --SGIFGLGIGRSSLVSQLNSSFSYCIGSLHDPDYLHNKLILGDGAIID---------EG 235
             +GI  LG G  SLVSQ  +++         P        LG   +           + 
Sbjct: 258 SAAGIMALGGGPESLVSQTAATYGRVFSHCFPPPTRRGFFTLGVPRVAAWRYVLTPMLKN 317

Query: 236 DATPLQFIDGHYYITLEAISVDGRMLDINPNIFKRDDSGGGVMIDSGTDVTWLVKEAYEA 295
            A P  F    Y + LEAI+V G+ + + P +F       G  +DS T +T L   AY+A
Sbjct: 318 PAIPPTF----YMVRLEAIAVAGQRIAVPPTVF-----AAGAALDSRTAITRLPPTAYQA 368

Query: 296 LRDEVMIRLEGEQMRSYSWPDKLCY--HGIMSSDLKGFPTVRFHFRGGAKLALEKDSMFY 353
           LR     R+   Q      P   CY   G+ S  L   P +   F   A + L+   + +
Sbjct: 369 LRQAFRDRMAMYQPAPPKGPLDTCYDMAGVRSFAL---PRITLVFDKNAAVELDPSGVLF 425

Query: 354 QPRPDAFCMAVNPASINNRYVNFSLIGMMAQQFYNVGYDIGRKQKTFQRMDC 405
           Q      C+A   A  N++     +IG +  Q   V Y+I      F+   C
Sbjct: 426 Q-----GCLAFT-AGPNDQVPG--IIGNIQLQTLEVLYNIPAALVGFRHAAC 469


>gi|115466060|ref|NP_001056629.1| Os06g0118700 [Oryza sativa Japonica Group]
 gi|55296436|dbj|BAD68559.1| putative nucleoid DNA-binding protein cnd41 [Oryza sativa Japonica
           Group]
 gi|113594669|dbj|BAF18543.1| Os06g0118700 [Oryza sativa Japonica Group]
 gi|215767921|dbj|BAH00150.1| unnamed protein product [Oryza sativa Japonica Group]
          Length = 494

 Score =  102 bits (253), Expect = 5e-19,   Method: Compositional matrix adjust.
 Identities = 97/352 (27%), Positives = 140/352 (39%), Gaps = 42/352 (11%)

Query: 72  VPQFTAMDTGSSLLWVHCYPCRD--CSPQLGTIFYPSRSSSYAYVPCDSEHCRYF-PYAR 128
           V Q   +DT S + WV C PC    C PQ   ++ P++SSS     C+S  C    PYA 
Sbjct: 167 VTQTMVLDTASDVTWVQCSPCPTPPCYPQKDVLYDPTKSSSSGVFSCNSPTCTQLGPYAN 226

Query: 129 CSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDVVFGCGFSTNRNFKF-- 186
                ++C Y   Y  G  T+    ++ LT          V+   FGC      +F F  
Sbjct: 227 GCTNNNQCQYRVRYPDGTSTAGTYISDLLTIT----PATAVRSFQFGCSHGVQGSFSFGS 282

Query: 187 --SGIFGLGIGRSSLVSQLNSSFSYCIGSLHDPDYLHNKLILGDGAIID---------EG 235
             +GI  LG G  SLVSQ  +++         P        LG   +           + 
Sbjct: 283 SAAGIMALGGGPESLVSQTAATYGRVFSHCFPPPTRRGFFTLGVPRVAAWRYVLTPMLKN 342

Query: 236 DATPLQFIDGHYYITLEAISVDGRMLDINPNIFKRDDSGGGVMIDSGTDVTWLVKEAYEA 295
            A P  F    Y + LEAI+V G+ + + P +F       G  +DS T +T L   AY+A
Sbjct: 343 PAIPPTF----YMVRLEAIAVAGQRIAVPPTVF-----AAGAALDSRTAITRLPPTAYQA 393

Query: 296 LRDEVMIRLEGEQMRSYSWPDKLCY--HGIMSSDLKGFPTVRFHFRGGAKLALEKDSMFY 353
           LR     R+   Q      P   CY   G+ S  L   P +   F   A + L+   + +
Sbjct: 394 LRQAFRDRMAMYQPAPPKGPLDTCYDMAGVRSFAL---PRITLVFDKNAAVELDPSGVLF 450

Query: 354 QPRPDAFCMAVNPASINNRYVNFSLIGMMAQQFYNVGYDIGRKQKTFQRMDC 405
           Q      C+A   A  N++     +IG +  Q   V Y+I      F+   C
Sbjct: 451 Q-----GCLAFT-AGPNDQVPG--IIGNIQLQTLEVLYNIPAALVGFRHAAC 494


>gi|222616654|gb|EEE52786.1| hypothetical protein OsJ_35257 [Oryza sativa Japonica Group]
          Length = 346

 Score =  102 bits (253), Expect = 5e-19,   Method: Compositional matrix adjust.
 Identities = 99/363 (27%), Positives = 161/363 (44%), Gaps = 37/363 (10%)

Query: 63  VNISIGQPPVPQFTAMDTGSSLLWVHCYPCR----DCSPQLGTIFYPSRSSSYAYVPCDS 118
           + IS+G PPV     +DTGS+L WV C  C+    D + + G IF P  SS+Y+ V C +
Sbjct: 1   MGISLGTPPVFNLVTIDTGSTLSWVQCKNCQIKCYDQAAKAGQIFNPYNSSTYSKVGCST 60

Query: 119 EHCR------YFPYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDV 172
           E C          Y  C      CIY+  Y  G  +  ++  ++LT      S   + + 
Sbjct: 61  EACNGMHMDLAVEYG-CVEEDDTCIYSLRYGSGEYSVGYLGKDRLTLA----SNRSIDNF 115

Query: 173 VFGCGFSTNRNFKFSGIFGLGIGRSSLVSQLN-----SSFSYCIGSLHDPDYLHNKLILG 227
           +FGCG     N   +GI G G    S  +Q+      ++FSYC    H+ +     L +G
Sbjct: 116 IFGCGEDNLYNGVNAGIIGFGTKSYSFFNQVCQQTDYTAFSYCFPRDHENE---GSLTIG 172

Query: 228 DGAIIDEGDATPLQFIDGH--YYITLEAISVDGRMLDINPNIFKRDDSGGGVMIDSGTDV 285
             A       T L + D    Y I    + V+G  L+I+P I+    +    ++DSGT  
Sbjct: 173 PYARDINLMWTKLIYYDHKPAYAIQQLDMMVNGIRLEIDPYIYISKMT----IVDSGTAD 228

Query: 286 TWLVKEAYEALRDEVMIRLEGEQMRSYSWPD-KLCY-HGIMSSDLKGFPTVRFHF-RGGA 342
           T+++   ++AL D+ M +    +  +  W + ++C+     S++   FPTV     R   
Sbjct: 229 TYILSPVFDAL-DKAMTKEMQAKGYTRGWDERRICFISNSGSANWNDFPTVEMKLIRSTL 287

Query: 343 KLALEKDSMFYQPRPDAFCMAVNPASINNRYVNFSLIGMMAQQFYNVGYDIGRKQKTFQR 402
           KL +E  + FY+   +  C    P     R V   ++G  A + + + +DI      F+ 
Sbjct: 288 KLPVE--NAFYESSNNVICSTFLPDDAGVRGV--QMLGNRAVRSFKLVFDIQAMNFGFKA 343

Query: 403 MDC 405
             C
Sbjct: 344 RAC 346


>gi|223950123|gb|ACN29145.1| unknown [Zea mays]
 gi|413923785|gb|AFW63717.1| hypothetical protein ZEAMMB73_445506 [Zea mays]
          Length = 385

 Score =  101 bits (252), Expect = 5e-19,   Method: Compositional matrix adjust.
 Identities = 109/393 (27%), Positives = 164/393 (41%), Gaps = 40/393 (10%)

Query: 32  RLAYLQEKIRSHNTYQAQILPSNDIS--------NSLFY-VNISIGQPPVPQFTAMDTGS 82
           R AY+Q K          +  S+           N+L Y + + +G P   Q   +DTGS
Sbjct: 14  RAAYIQRKFSGGGGAGGDVQRSDATVPTALGTSLNTLEYLITVGLGSPATSQTMLIDTGS 73

Query: 83  SLLWVHCYPCRDCSPQLGTIFYPSRSSSYAYVPCDSEHCRYFPYA--RCSAYKHRCIYTQ 140
            + WV C PC  C  Q   +F PS SS+Y+   C S  C         CS+   +C Y  
Sbjct: 74  DVSWVQCKPCSQCHSQADPLFDPSSSSTYSPFSCGSADCAQLGQEGNGCSS-SSQCQYIV 132

Query: 141 LYLIGPETSVFVSTEQLTFKNTDESTIHVQDVVFGCG-FSTNRNFKFSGIFGLGIGRSSL 199
            Y  G  T+   S++ L   ++      V+   FGC    +  N +  G+ GLG G  SL
Sbjct: 133 TYGDGSSTTGTYSSDTLALGSS-----AVRSFQFGCSNVESGFNDQTDGLMGLGGGAQSL 187

Query: 200 VSQ----LNSSFSYCIGSLHDPDYLHNKLILGDGAIIDEGDATPL---QFIDGHYYITLE 252
           VSQ    L  +FSYC+            L    G+       TP+     +   Y + L+
Sbjct: 188 VSQTAGTLGRAFSYCLPPTPSSSGFLT-LGAAGGSGTSGFVKTPMLRSSQVPTFYGVRLQ 246

Query: 253 AISVDGRMLDINPNIFKRDDSGGGVMIDSGTDVTWLVKEAYEALRDEVMIRLEGEQMRSY 312
           AI V GR L I  ++F       G ++DSGT +T L   AY AL       ++       
Sbjct: 247 AIRVGGRQLSIPASVFS-----AGTVMDSGTVITRLPPTAYSALSSAFKAGMKQYPPAQP 301

Query: 313 SWPDKLCYHGIMSSDLKGFPTVRFHFRGGAKLALEKDSMFYQPRPDAFCMAVNPASINNR 372
           S     C+     S +   P+V   F GGA ++L+   +       + C+A    + N+ 
Sbjct: 302 SGILDTCFDFSGQSSVS-IPSVALVFSGGAVVSLDASGIIL-----SNCLAF---AGNSD 352

Query: 373 YVNFSLIGMMAQQFYNVGYDIGRKQKTFQRMDC 405
             +  +IG + Q+ + V YD+GR    F+   C
Sbjct: 353 DSSLGIIGNVQQRTFEVLYDVGRGVVGFRAGAC 385


>gi|223973231|gb|ACN30803.1| unknown [Zea mays]
          Length = 459

 Score =  101 bits (252), Expect = 5e-19,   Method: Compositional matrix adjust.
 Identities = 113/432 (26%), Positives = 182/432 (42%), Gaps = 59/432 (13%)

Query: 29  SIARLAYLQEKIRSHNTYQAQI-LPSNDISNSLF-------YVNISIGQPPVPQFTAMDT 80
           S+AR  +L+ +  +H++ +     PS   + +L+           S+G PP P    +DT
Sbjct: 27  SLARALHLKRRDPNHHSQKGSGGHPSVPATAALYPHSYGGYAFTASLGTPPQPLPVLLDT 86

Query: 81  GSSLLWVHC---YPCRDCSPQLGT---IFYPSRSSSYAYVPCDSEHCRYFPYAR------ 128
           GS L WV C   Y CR+CS    +   +F+P  SSS   V C +  C++   A       
Sbjct: 87  GSHLTWVPCTSSYECRNCSSPSASAVPVFHPKNSSSSRLVGCRNPSCQWVHSAANLATKC 146

Query: 129 ----CSAYKHRCIYTQLYLIGPETSVF--VSTEQLTFKNTDESTIH-VQDVVFGCGFSTN 181
               CS     C      +  P   V+   ST  L   +T  +    V   V GC   + 
Sbjct: 147 RRAPCSPGAANCPAAASNVCPPYAVVYGSGSTAGLLIADTLRAPGRAVPGFVLGCSLVSV 206

Query: 182 RNFKFSGIFGLGIGRSSLVSQLN-SSFSYCIGSLHDPDYLHNKLILGDGAIIDEGDATPL 240
                SG+ G G G  S+ +QL    FSYC+ S    D   N  + G   +   G    +
Sbjct: 207 HQ-PPSGLAGFGRGAPSVPAQLGLPKFSYCLLSRRFDD---NAAVSGSLVLGGTGGGEGM 262

Query: 241 QFI-------------DGHYYITLEAISVDGRMLDINPNIFKRDDSG-GGVMIDSGTDVT 286
           Q++               +YY+ L  ++V G+ + +    F  + +G GG ++DSGT  T
Sbjct: 263 QYVPLVKSAAGDKLPYGVYYYLALRGVTVGGKAVRLPARAFAANAAGSGGTIVDSGTTFT 322

Query: 287 WLVKEAYEALRDEVMIRLEGEQMRSYSWPDKL----CYHGIMSSDLKGFPTVRFHFRGGA 342
           +L    ++ + D V+  + G   RS    D+L    C+     +     P + FHF GGA
Sbjct: 323 YLDPTVFQPVADAVVAAVGGRYKRSKDAEDELGLHPCFALPQGARSMALPELSFHFEGGA 382

Query: 343 KLALEKDSMFY---QPRPDAFCMAV------NPASINNRYVNFSLIGMMAQQFYNVGYDI 393
            + L  ++ F    +   +A C+AV         + N       ++G   QQ Y V YD+
Sbjct: 383 VMQLPVENYFVVAGRGAVEAICLAVVTDFSGGSGAGNEGSGPAIILGSFQQQNYLVEYDL 442

Query: 394 GRKQKTFQRMDC 405
            +++  F+R  C
Sbjct: 443 EKERLGFRRQSC 454


>gi|255571584|ref|XP_002526738.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
           communis]
 gi|223533927|gb|EEF35652.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
           communis]
          Length = 457

 Score =  101 bits (252), Expect = 5e-19,   Method: Compositional matrix adjust.
 Identities = 119/432 (27%), Positives = 191/432 (44%), Gaps = 54/432 (12%)

Query: 1   LIHRDSILSPYNNPNENPAHRVQRGINISIARLAYLQEKIRSHNTYQAQILPSNDIS--N 58
           L+H  S  SP+  PN   A   Q  I  S AR   ++  I S N   +   P + +S  +
Sbjct: 40  LLHWLSTESPFYEPNLTLAELTQASIRTSGARGDSIRS-IMSGNITSSMKYPISRMSYTD 98

Query: 59  SLFYVNISIGQPPVPQFTAMDTGSSLLWVHCYP--CRDCSPQLGTIFYPSRSSSYAYVPC 116
             + +  SIG P V  +   D+GSSL+W+ C    CR+C  Q   +F PS+S +Y    C
Sbjct: 99  KAYVMKFSIGSPAVDTYAIPDSGSSLVWLQCGTPYCRNCYRQKIPLFNPSKSVTYMKRLC 158

Query: 117 DSEHCRYF---PYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTF-KNTDESTIHVQDV 172
           ++  CR      Y RC      C Y + YL    T   +ST+  TF ++      +   +
Sbjct: 159 NTAECRVALGDEYWRCKKPNQICKYHEDYLDDSYTEGVISTDIFTFPEHISGFGNYTLRI 218

Query: 173 VFGCGFSTN--RNFKFSGIFGLGIGRSSLVSQLN-SSFSYCIGSLHDPDYLHNKLIL--G 227
           +FGCG++ +  ++F   G+ GL   ++SLV Q++   FSYC+ S+     L   + +  G
Sbjct: 219 IFGCGYNNSDPQHFYPPGLVGLTNNKASLVGQMDVDQFSYCV-SIDTEQNLKGSMEIRFG 277

Query: 228 DGAIIDEGDATPLQFIDGHY-YITLEAISVDGRMLDINPN-IFKRDDSG-GGVMIDSGTD 284
             A I       +   DG Y +  ++ I V+   ++  P  +FK  + G GG+ +D+GT 
Sbjct: 278 LAASISGHSTQLVPNSDGWYIFKNVDGIYVNEFEVEGYPAWVFKYTEGGQGGLTMDTGTT 337

Query: 285 VTWLVKEAYEALRDEVMIRLEGEQMRSYSWPD------KLCYHGIMSSDLKG--FPTVRF 336
            T    E + ++ D  +I+L  E +      D      +LCY    S D  G   P +  
Sbjct: 338 YT----ELHNSVMDP-LIKLLEEHITIVPEKDYSNSGFELCY---FSDDFLGATLPDIEL 389

Query: 337 HFRGGAKLALEKDSMFYQPRPDAF--------CMAVNPASINNRYVNFSLIGMMAQQFYN 388
            F         KD+ F     +A+        C+A+       R    S+IGM   +   
Sbjct: 390 RFTD------NKDTYFSFNTRNAWTPNGRSQMCLAM------FRTNGMSIIGMHQLRDIK 437

Query: 389 VGYDIGRKQKTF 400
           +GYD+     +F
Sbjct: 438 IGYDLHHNIVSF 449


>gi|302821814|ref|XP_002992568.1| hypothetical protein SELMODRAFT_46291 [Selaginella moellendorffii]
 gi|300139637|gb|EFJ06374.1| hypothetical protein SELMODRAFT_46291 [Selaginella moellendorffii]
          Length = 368

 Score =  101 bits (252), Expect = 6e-19,   Method: Compositional matrix adjust.
 Identities = 108/365 (29%), Positives = 154/365 (42%), Gaps = 49/365 (13%)

Query: 78  MDTGSSLLWVHCYPCRDCSPQLGTIFYPSRSSSYAYVPCDSEHCRYFPYAR-------CS 130
           +DTGS  + V C        +   +F P+ S SY  VPC S+ C              C 
Sbjct: 16  IDTGSEAVLVQC------GSRSRPVFDPAASQSYRQVPCISQLCLAVQQQTSNGSSQPCV 69

Query: 131 AYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQ--DVVFGCGFSTNR---NFK 185
                C Y+  Y     ++   S + +   +T+ S+  VQ  DV FGC  S      +  
Sbjct: 70  NSSAACTYSLSYGDSRNSTGDFSQDVIFLNSTNSSSQAVQFRDVAFGCAHSPQGFLVDLG 129

Query: 186 FSGIFGLGIGRSSLVSQLN-----SSFSYCIGSLHDPDYLHNKLILGDGAIIDEGDA-TP 239
             GI G   G  SL SQL      S FSYC  S          + LGD  +     + TP
Sbjct: 130 SLGIVGFNRGNLSLPSQLKDRLGGSKFSYCFPSQPWQPRATGVIFLGDSGLSKSKVSYTP 189

Query: 240 LQFIDGH--------YYITLEAISVDGRMLDINPNIFKRDDSG--GGVMIDSGTDVTWLV 289
           L  +D          YY+ L +ISVDG+ L I  + FK D S   GG ++DSGT  T +V
Sbjct: 190 L--LDNPVTPARSQLYYVGLTSISVDGKTLAIPESAFKLDPSTGDGGTVLDSGTTFTRVV 247

Query: 290 KEAYEALRDEVMIR----LEGEQMRSYSWPDKLCYHGIMSSDLKGFPTVRFHFRGGAKLA 345
            +AY A R+         L  +   +  + D  CY+    S L G P VR   +   +L 
Sbjct: 248 DDAYTAFRNAFAASNRSGLRKKVGAAAGFDD--CYNISAGSSLPGVPEVRLSLQNNVRLE 305

Query: 346 LEKDSMFYQPRPDA-----FCMAVNPASINNRYVNFSLIGMMAQQFYNVGYDIGRKQKTF 400
           L  + +F  P   A      C+A+  +S  + +   +++G   Q  Y V YD  R +  F
Sbjct: 306 LRFEHLFV-PVSAAGNEVTVCLAI-LSSQKSGFGKINVLGNYQQSNYLVEYDNERSRVGF 363

Query: 401 QRMDC 405
           +R DC
Sbjct: 364 ERADC 368


>gi|357159298|ref|XP_003578403.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Brachypodium
           distachyon]
          Length = 442

 Score =  101 bits (251), Expect = 7e-19,   Method: Compositional matrix adjust.
 Identities = 103/367 (28%), Positives = 157/367 (42%), Gaps = 41/367 (11%)

Query: 67  IGQPPVPQFTAMDTGSSLLWVHCYPC---RDCSPQLGTIFYPSRSSSYAYVPC--DSEHC 121
           IG PP      +DTGS L+W  C      + C+ Q    +  S+SS++  VPC   +  C
Sbjct: 92  IGSPPQRTEALIDTGSDLIWTQCATTCLPKSCAKQGLPYYNLSQSSTFVPVPCADKAGFC 151

Query: 122 RYFPYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDVVFGC----G 177
                  C      C +   Y  G      + TE   F++   S      + FGC     
Sbjct: 152 AANGVHLC-GLDGSCTFIASYGAGRVIGS-LGTESFAFESGTTS------LAFGCVSLTR 203

Query: 178 FSTNRNFKFSGIFGLGIGRSSLVSQLNSS-FSYCIGS-LHDPDYLHNKLILGDGAIIDEG 235
            ++      SG+ GLG GR SLVSQ+ ++ FSYC+    H      +  +    ++   G
Sbjct: 204 ITSGALNDASGLIGLGRGRLSLVSQIGATRFSYCLTPYFHSSGASSHLFVGASASLGGGG 263

Query: 236 DATPLQFIDG--------HYYITLEAISV-DGRMLDINPNIFK-----RDDSGGGVMIDS 281
            + P  F+           YY+ LE I+V   R+  +N   F+     +    GGV+ID+
Sbjct: 264 ASMP--FVKSPKDYPYSTFYYLPLEGITVGKTRLPAVNSTTFQLRQLFKGYWAGGVIIDT 321

Query: 282 GTDVTWLVKEAYEALRDEVMIRLEGEQMRSYSWPDKLCYHGIMSSDLKGFPTVRFHFRGG 341
           G+ +T L   AYEAL++EV  +L    +        L          K  P + FHF GG
Sbjct: 322 GSPLTQLASHAYEALKEEVAAQLGNGSLVPAPEDSGLELCVAREGFQKVVPALVFHFGGG 381

Query: 342 AKLALEKDSMFYQPRPDAFCMAVNPASINNRYVNFSLIGMMAQQFYNVGYDIGRKQKTFQ 401
           A +A+   S +      A CM +     +      S+IG   QQ  ++ YD+ R + +FQ
Sbjct: 382 ADMAVPAASYWAPVDKAAACMMILEGGYD------SIIGNFQQQDMHLLYDLRRGRFSFQ 435

Query: 402 RMDCEVL 408
             DC +L
Sbjct: 436 TADCTML 442


>gi|357123876|ref|XP_003563633.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Brachypodium
           distachyon]
          Length = 503

 Score =  101 bits (251), Expect = 7e-19,   Method: Compositional matrix adjust.
 Identities = 106/359 (29%), Positives = 152/359 (42%), Gaps = 32/359 (8%)

Query: 61  FYVNISIGQPPVPQFTAMDTGSSLLWVHCYPC-RDCSPQLGTIFYPSRSSSYAYVPCDSE 119
           + V I +G PP       DTGS   WV C PC   C  Q   +F P++SS+YA V C   
Sbjct: 163 YVVPIGLGTPPSRFTVVFDTGSDTTWVQCRPCVVSCYKQKDRLFDPAKSSTYANVSCADP 222

Query: 120 HCRYFPYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDVVFGCGFS 179
            C     + C+A    C+Y   Y  G  T  F + + L           ++   FGCG  
Sbjct: 223 ACADLDASGCNA--GHCLYGIQYGDGSYTVGFFAKDTLAVAQD-----AIKGFKFGCG-E 274

Query: 180 TNRNF--KFSGIFGLGIGRSSLVSQL----NSSFSYCI-GSLHDPDYLHNKLILGDGAII 232
            NR    + +G+ GLG G +S+  Q       SFSYC+  S     YL    +    +  
Sbjct: 275 KNRGLFGQTAGLLGLGRGPTSITVQAYEKYGGSFSYCLPASSAATGYLEFGPLSPSSSGS 334

Query: 233 DEGDATPLQFIDG--HYYITLEAISVDGRMLDINP-NIFKRDDSGGGVMIDSGTDVTWLV 289
           +    TP+    G   YY+ L  I V G+ L   P ++F    S  G ++DSGT +T L 
Sbjct: 335 NA-KTTPMLTDKGPTFYYVGLTGIRVGGKQLGAIPESVF----SNSGTLVDSGTVITRLP 389

Query: 290 KEAYEALRDEVMIRLEG---EQMRSYSWPDKLCYHGIMSSDLKGFPTVRFHFRGGAKLAL 346
             AY AL       +     ++  +YS  D  CY     S +   PTV   F+GGA L L
Sbjct: 390 DTAYAALSSAFAAAMAASGYKKAAAYSILDT-CYDFTGLSQVS-LPTVSLVFQGGACLDL 447

Query: 347 EKDSMFYQPRPDAFCMAVNPASINNRYVNFSLIGMMAQQFYNVGYDIGRKQKTFQRMDC 405
           +   + Y       C+     + N    +  ++G   Q+ Y V YD+ +K   F    C
Sbjct: 448 DASGIVYAISQSQVCLGF---ASNGDDESVGIVGNTQQRTYGVLYDVSKKVVGFAPGAC 503


>gi|297819968|ref|XP_002877867.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
 gi|297323705|gb|EFH54126.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
          Length = 469

 Score =  101 bits (251), Expect = 8e-19,   Method: Compositional matrix adjust.
 Identities = 122/463 (26%), Positives = 190/463 (41%), Gaps = 83/463 (17%)

Query: 8   LSPYNNPNENPAH---RVQRGINISIARLAYL---------QEKIRSHNTYQAQILPSND 55
           LSP+++ +++P      ++R    SIAR   L         +E + S  T  A ++ S+ 
Sbjct: 23  LSPFSHSDQSPKDPYLSLRRLAESSIARAHKLKHGTSIKPDEEALSSTATASATVVKSHL 82

Query: 56  ISNSL--FYVNISIGQPPVPQFTAMDTGSSLLWVHC---YPCRDCS-----PQLGTIFYP 105
              S   + V++S G P        DTGSSL+W  C   Y C DC+     P     F P
Sbjct: 83  SPKSYGGYSVSLSFGTPSQTIPFVFDTGSSLVWFPCTSRYLCSDCNFSGLDPTQIPRFIP 142

Query: 106 SRSSSYAYVPCDSEHCRYFPYAR-----CSAYKHRCI-----YTQLYLIGPETSVFVSTE 155
             SSS   + C +  C++   A      C      C      Y   Y +G    + +S E
Sbjct: 143 KNSSSSRVIGCQNPKCQFLFGANVQCRGCDPNTRNCTVPCPPYILQYGLGSTAGILIS-E 201

Query: 156 QLTFKNTDESTIHVQDVVFGCGFSTNRNFKFSGIFGLGIGRSSLVSQLN-SSFSYCIGSL 214
           +L F +     + V D V GC   + R    +GI G G G  SL SQ+   SFS+C+ S 
Sbjct: 202 KLDFPD-----LTVPDFVVGCSVISTRT--PAGIAGFGRGPESLPSQMKLKSFSHCLVSR 254

Query: 215 H-DPDYLHNKLILGDGAIIDEGDATP---------------LQFIDGHYYITLEAISVDG 258
             D   +   L L  G+    G  TP                 F++ +YY+ L  I V  
Sbjct: 255 RFDDTNVTTDLGLDTGSGHKSGSKTPGLSYTPFRKNPNVSNTAFLE-YYYLNLRRIYVGS 313

Query: 259 RMLDINPNIFKRDDSG-GGVMIDSGTDVTWLVKEAYEALRDEVMIRLEGEQMRSYSWPDK 317
           + + I         +G GG ++DSG+  T++ +  +E + +E        QM +Y+    
Sbjct: 314 KHVKIPYKFLAPGTNGNGGSIVDSGSTFTFMERPVFELVAEEF-----ATQMSNYTREKD 368

Query: 318 L--------CYHGIMSSDLKGFPTVRFHFRGGAKLALEKDSMF-YQPRPDAFCM------ 362
           L        C++     D+   P + F F+GGAK+ L   + F +    D  C+      
Sbjct: 369 LEKVSGIAPCFNISGKGDVT-VPELIFEFKGGAKMELPLSNYFSFVGNADTVCLTVVSDN 427

Query: 363 AVNPASINNRYVNFSLIGMMAQQFYNVGYDIGRKQKTFQRMDC 405
            VNP       +   ++G   QQ Y V YD+   +  F +  C
Sbjct: 428 TVNPGGGTGPAI---ILGSFQQQNYLVEYDLENDRFGFAKKKC 467


>gi|222637379|gb|EEE67511.1| hypothetical protein OsJ_24961 [Oryza sativa Japonica Group]
          Length = 641

 Score =  101 bits (251), Expect = 8e-19,   Method: Compositional matrix adjust.
 Identities = 110/435 (25%), Positives = 181/435 (41%), Gaps = 79/435 (18%)

Query: 11  YNNPNENPAHRVQRGINISIARLAYLQEKIRSHNTYQAQILPSNDISNSLFYVNISIGQP 70
           Y N    PA   +RG+                HN      L  + ++N  +   + IG P
Sbjct: 54  YPNATRLPASSARRGLG-------------DGHNPNARMRLHDDLLTNGYYTTRLYIGTP 100

Query: 71  PVPQFTAMDTGSSLLWVHCYPCRDC------SPQL----GTIFYPSRSSSYAYVPCDSEH 120
                  +D+GS++ +V C  C  C      SP +       F P  SS+Y+ V C+ + 
Sbjct: 101 SQEFALIVDSGSTVTYVPCATCEQCGNHQSESPNIIEAHDPRFQPDLSSTYSPVKCNVD- 159

Query: 121 CRYFPYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDVVFGCGFST 180
                   C   + +C Y + Y     +S  +  + ++F    ES +  Q  VFGC  +T
Sbjct: 160 ------CTCDNERSQCTYERQYAEMSSSSGVLGEDIMSFGK--ESELKPQRAVFGCE-NT 210

Query: 181 NRNFKFS----GIFGLGIGRSSLVSQL------NSSFSYCIGSLHDPDYLHNKLILGDGA 230
                FS    GI GLG G+ S++ QL      + SFS C G +           +G G 
Sbjct: 211 ETGDLFSQHADGIMGLGRGQLSIMDQLVEKGVISDSFSLCYGGMD----------VGGGT 260

Query: 231 IIDEGDATPLQFIDGH--------YYITLEAISVDGRMLDINPNIFKRDDSGGGVMIDSG 282
           ++  G   P   +  H        Y I L+ I V G+ L ++P IF   +S  G ++DSG
Sbjct: 261 MVLGGMPAPPDMVFSHSNPVRSPYYNIELKEIHVAGKALRLDPKIF---NSKHGTVLDSG 317

Query: 283 TDVTWLVKEAYEALRDEVMIRLEGEQMRSYSWPDK----LCYHGI---MSSDLKGFPTVR 335
           T   +L ++A+ A +D V  ++    ++    PD     +C+ G    +S   + FP V 
Sbjct: 318 TTYAYLPEQAFVAFKDAVTNKV--NSLKKIRGPDPNYKDICFAGAGRNVSQLSEVFPDVD 375

Query: 336 FHFRGGAKLALEKDSMFYQ--PRPDAFCMAVNPASINNRYVNFSLIGMMAQQFYNVGYDI 393
             F  G KL+L  ++  ++      A+C+ V      N     +L+G +  +   V YD 
Sbjct: 376 MVFGNGQKLSLSPENYLFRHSKVEGAYCLGV----FQNGKDPTTLLGGIVVRNTLVTYDR 431

Query: 394 GRKQKTFQRMDCEVL 408
             ++  F + +C  L
Sbjct: 432 HNEKIGFWKTNCSEL 446


>gi|147819672|emb|CAN76394.1| hypothetical protein VITISV_020864 [Vitis vinifera]
          Length = 507

 Score =  101 bits (251), Expect = 8e-19,   Method: Compositional matrix adjust.
 Identities = 106/421 (25%), Positives = 181/421 (42%), Gaps = 60/421 (14%)

Query: 38  EKIRSHNTYQ-AQILPSNDI---------SNSLFYVNISIGQPPVPQFTAMDTGSSLLWV 87
           + +R+H+T +  +IL + D+            L++  I IG P    +  +DTGS +LWV
Sbjct: 45  DALRAHDTRRHGRILSAVDLPLGGNGHPSEAGLYFAKIGIGTPSKDYYVQVDTGSDILWV 104

Query: 88  HCYPCRDCSPQ--LG---TIFYPSRSSSYAYVPCDSEHCRYF--PYARCSAYKHRCIYTQ 140
           +C  C  C  +  LG   T++    S++   V CD   C  +  P   C     +C+Y+ 
Sbjct: 105 NCAGCDRCPTKSDLGVDLTLYDMKASTTSDAVGCDDNFCSLYDGPLPGCKP-GLQCLYSV 163

Query: 141 LYLIGPETSVFVSTEQLTFKNTD---ESTIHVQDVVFGCGFSTNRNFKFS-----GIFGL 192
           LY  G  T+ +   + + +       ++T     VVFGCG   +     S     GI G 
Sbjct: 164 LYGDGSSTTGYFVQDFVQYNRISGNFQTTPTNGTVVFGCGNKQSGELGSSSEALDGILGF 223

Query: 193 GIGRSSLVSQLNSS------FSYC-----------IGSLHDPDYLHNKLILGDGAIIDEG 235
           G   SS++SQL SS      FS+C           IG + +P     + +L +  +I   
Sbjct: 224 GQANSSMLSQLASSGKVKKVFSHCLDNVDGGGIFAIGEVVEPKV---RFLLMNSVMI--- 277

Query: 236 DATPLQFIDGHYYITLEAISVDGRMLDINPNIFKRDDSGGGVMIDSGTDVTWLVKEAYEA 295
               L     HY + ++ I V G  LD+  + F+  D   G +IDSGT + +  +E Y  
Sbjct: 278 --VVLFLSRAHYNVVMKEIEVGGDPLDVPSDAFESGDR-KGTIIDSGTTLAYFPQEVYVP 334

Query: 296 LRDEVMIRLEGEQMRSYSWPDKLCYHGIMSSDLKGFPTVRFHFRGGAKLALEKDSMFYQP 355
           L ++++ +    ++ +       C+    + D  GFPTV  HF     L +      +Q 
Sbjct: 335 LIEKILSQQPDLRLHTVEQA-FTCFDYTGNVD-DGFPTVTLHFDKSISLTVYPHEYLFQV 392

Query: 356 RPDAFCMAV-NPASINNRYVNFSLIGMMAQQFYNVGYDIGR-----KQKTFQRMDCEVLD 409
           +   +C+   N  +      + +L+G  AQ   + G  +G+      ++   R  C V +
Sbjct: 393 KEFEWCIGWQNSGAQTKDGKDLTLLGEDAQCTCHFGSCMGQYKVFPHRELISRPHCPVEE 452

Query: 410 D 410
           D
Sbjct: 453 D 453


>gi|297720195|ref|NP_001172459.1| Os01g0608366 [Oryza sativa Japonica Group]
 gi|53792202|dbj|BAD52835.1| nucleoid DNA-binding protein cnd41-like [Oryza sativa Japonica
           Group]
 gi|255673454|dbj|BAH91189.1| Os01g0608366 [Oryza sativa Japonica Group]
          Length = 452

 Score =  101 bits (251), Expect = 8e-19,   Method: Compositional matrix adjust.
 Identities = 102/356 (28%), Positives = 151/356 (42%), Gaps = 48/356 (13%)

Query: 61  FYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSP---QLGTIFYPSRSSSYAYVPCD 117
           + +++ +G P V Q   +DTGS + WV C PC   SP     G +F P+ SS+YA   C 
Sbjct: 108 YVISVGLGSPAVTQRVVIDTGSDVSWVQCEPCPAPSPCHAHAGALFDPAASSTYAAFNCS 167

Query: 118 SEHCRYFPYA----RCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDVV 173
           +  C     +     C A K RC Y   Y  G  T+   S++ LT   +D     V+   
Sbjct: 168 AAACAQLGDSGEANGCDA-KSRCQYIVKYGDGSNTTGTYSSDVLTLSGSDV----VRGFQ 222

Query: 174 FGCG---FSTNRNFKFSGIFGLGIGRSSLVSQLNS----SFSYCI-GSLHDPDYLH-NKL 224
           FGC         + K  G+ GLG    S VSQ  +    SF YC+  +     +L     
Sbjct: 223 FGCSHAELGAGMDDKTDGLIGLGGDAQSPVSQTAARYGKSFFYCLPATPASSGFLTLGAP 282

Query: 225 ILGDGAIIDEGDATPL---QFIDGHYYITLEAISVDGRMLDINPNIFKRDDSGGGVMIDS 281
             G G        TP+   + +  +Y+  LE I+V G+ L ++P++F       G ++DS
Sbjct: 283 ASGGGGGASRFATTPMLRSKKVPTYYFAALEDIAVGGKKLGLSPSVFA-----AGSLVDS 337

Query: 282 GTDVTWLVKEAYEALRDEVMIRLEGEQMRSYSWPDKL-----CYHGIMSSDLKGFPTVRF 336
           GT +T L   AY AL            M  Y+  + L     C++     D    PTV  
Sbjct: 338 GTVITRLPPAAYAALSSAFR-----AGMTRYARAEPLGILDTCFN-FTGLDKVSIPTVAL 391

Query: 337 HFRGGAKLALEKDSMFYQPRPDAFCMAVNPASINNRYVNFSLIGMMAQQFYNVGYD 392
            F GGA + L+   +         C+A  P   +     F  IG + Q+ + V YD
Sbjct: 392 VFAGGAVVDLDAHGIV-----SGGCLAFAPTRDDKA---FGTIGNVQQRTFEVLYD 439


>gi|302758750|ref|XP_002962798.1| hypothetical protein SELMODRAFT_78156 [Selaginella moellendorffii]
 gi|300169659|gb|EFJ36261.1| hypothetical protein SELMODRAFT_78156 [Selaginella moellendorffii]
          Length = 427

 Score =  101 bits (251), Expect = 8e-19,   Method: Compositional matrix adjust.
 Identities = 97/392 (24%), Positives = 174/392 (44%), Gaps = 45/392 (11%)

Query: 48  AQILPSNDISNSLFYVNISIGQPPVPQFTAMDTGSSLLWVHCYP---CRDCSPQLGTIFY 104
           ++++  + I +  ++V + +G P       +DTGS L W+ C P     + S      + 
Sbjct: 46  SRLVSGSSIGSGQYFVELRVGTPAKKFPLIVDTGSDLTWIQCNPPNTTANSSSPPAPWYD 105

Query: 105 PSRSSSYAYVPCDSEHCRYFPY---ARCS-AYKHRCIYTQLYLIGPETSVFVSTEQLTFK 160
            S SSSY  +PC  + C++ P    + CS      C YT  Y     T+  ++ E ++ K
Sbjct: 106 KSSSSSYREIPCTDDECQFLPAPIGSSCSITSPSPCDYTYGYSDQSRTTGILAYETISMK 165

Query: 161 NTDES----------TIHVQDVVFGCGF-STNRNF-KFSGIFGLGIGRSSLVSQ-----L 203
           +   S           I +++V  GC   S   +F   SG+ GLG G  SL +Q     L
Sbjct: 166 SRKRSGKRAGNHKTRRIRIKNVALGCSRESVGASFLGASGVLGLGQGPISLATQTRHTAL 225

Query: 204 NSSFSYCIGSLHDPDYLHNK-----LILGDGAIIDEGDATPL---QFIDGHYYITLEAIS 255
              FSYC+      DYL        L++G          TP+         YY+ +  ++
Sbjct: 226 GGIFSYCL-----VDYLRGSNASSFLVMGRTHWRKLAH-TPIVRNPAAQSFYYVNVTGVA 279

Query: 256 VDGRMLD-INPNIFKRDDSGG-GVMIDSGTDVTWLVKEAYEALRDEVMIRLEGEQMRSYS 313
           VDG+ +D I  + +  D  G  G + DSGT +++L + AY  +   +   +   + +   
Sbjct: 280 VDGKPVDGIASSDWGIDGDGNKGTIFDSGTTLSYLREPAYSKVLGALNASIYLPRAQEIP 339

Query: 314 WPDKLCYHGIMSSDLKGFPTVRFHFRGGAKLALEKDSMFYQPRPDAFCMAVNPASINNRY 373
              +LCY+  ++   KG P +   F+GGA + L  ++       +  C+A+   +  N  
Sbjct: 340 EGFELCYN--VTRMEKGMPKLGVEFQGGAVMELPWNNYMVLVAENVQCVALQKVTTTN-- 395

Query: 374 VNFSLIGMMAQQFYNVGYDIGRKQKTFQRMDC 405
              +++G + QQ +++ YD+ + +  F+   C
Sbjct: 396 -GSNILGNLLQQDHHIEYDLAKARIGFKWSPC 426


>gi|218199944|gb|EEC82371.1| hypothetical protein OsI_26705 [Oryza sativa Indica Group]
          Length = 642

 Score =  101 bits (251), Expect = 8e-19,   Method: Compositional matrix adjust.
 Identities = 110/435 (25%), Positives = 181/435 (41%), Gaps = 79/435 (18%)

Query: 11  YNNPNENPAHRVQRGINISIARLAYLQEKIRSHNTYQAQILPSNDISNSLFYVNISIGQP 70
           Y N    PA   +RG+                HN      L  + ++N  +   + IG P
Sbjct: 55  YPNATRLPASSARRGLG-------------DGHNPNARMRLHDDLLTNGYYTTRLYIGTP 101

Query: 71  PVPQFTAMDTGSSLLWVHCYPCRDC------SPQL----GTIFYPSRSSSYAYVPCDSEH 120
                  +D+GS++ +V C  C  C      SP +       F P  SS+Y+ V C+ + 
Sbjct: 102 SQEFALIVDSGSTVTYVPCATCEQCGNHQSESPNIIEAHDPRFQPDLSSTYSPVKCNVD- 160

Query: 121 CRYFPYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDVVFGCGFST 180
                   C   + +C Y + Y     +S  +  + ++F    ES +  Q  VFGC  +T
Sbjct: 161 ------CTCDNERSQCTYERQYAEMSSSSGVLGEDIMSFGK--ESELKPQRAVFGCE-NT 211

Query: 181 NRNFKFS----GIFGLGIGRSSLVSQL------NSSFSYCIGSLHDPDYLHNKLILGDGA 230
                FS    GI GLG G+ S++ QL      + SFS C G +           +G G 
Sbjct: 212 ETGDLFSQHADGIMGLGRGQLSIMDQLVEKGVISDSFSLCYGGMD----------VGGGT 261

Query: 231 IIDEGDATPLQFIDGH--------YYITLEAISVDGRMLDINPNIFKRDDSGGGVMIDSG 282
           ++  G   P   +  H        Y I L+ I V G+ L ++P IF   +S  G ++DSG
Sbjct: 262 MVLGGMPAPPDMVFSHSNPVRSPYYNIELKEIHVAGKALRLDPKIF---NSKHGTVLDSG 318

Query: 283 TDVTWLVKEAYEALRDEVMIRLEGEQMRSYSWPDK----LCYHGI---MSSDLKGFPTVR 335
           T   +L ++A+ A +D V  ++    ++    PD     +C+ G    +S   + FP V 
Sbjct: 319 TTYAYLPEQAFVAFKDAVTNKV--NSLKKIRGPDPNYKDICFAGAGRNVSQLSEVFPDVD 376

Query: 336 FHFRGGAKLALEKDSMFYQ--PRPDAFCMAVNPASINNRYVNFSLIGMMAQQFYNVGYDI 393
             F  G KL+L  ++  ++      A+C+ V      N     +L+G +  +   V YD 
Sbjct: 377 MVFGNGQKLSLSPENYLFRHSKVEGAYCLGV----FQNGKDPTTLLGGIVVRNTLVTYDR 432

Query: 394 GRKQKTFQRMDCEVL 408
             ++  F + +C  L
Sbjct: 433 HNEKIGFWKTNCSEL 447


>gi|125536523|gb|EAY83011.1| hypothetical protein OsI_38231 [Oryza sativa Indica Group]
          Length = 469

 Score =  100 bits (250), Expect = 1e-18,   Method: Compositional matrix adjust.
 Identities = 112/398 (28%), Positives = 178/398 (44%), Gaps = 42/398 (10%)

Query: 34  AYLQEKIRSH-NTYQAQILPSNDISNSL--FYVNISIGQPPVPQFTAM-DTGSSLLWVHC 89
            +L++++R+  N  Q Q L     S +     +NI++G P     + + D  S  +W  C
Sbjct: 58  GFLKKQLRNRGNKQQQQQLGGEAASGAAPPLVINITVGTPVAQTVSGLVDITSYFVWAQC 117

Query: 90  YPCRDCS---PQLGTIFYPSRSSSYAYVPCDSEHCRYFPYARC--------SAYKHRCIY 138
            PC   +   P   T F P+ S++++ +PC S+ C       C        +    RC  
Sbjct: 118 APCAAAAGCLPPPATAFRPNGSATFSPLPCSSDMCLPVLRETCGRAGAAANATAGARCDS 177

Query: 139 TQLYLIG--PETSVFVSTEQLTFKNTDESTIHVQDVVFGCGFSTNRNFK-FSGIFGLGIG 195
             L   G    TS +++T+  TF  T      V  VVFGC  ++  +F   SG+ G+G G
Sbjct: 178 YSLTYGGSAANTSGYLATDTFTFGAT-----AVPGVVFGCSDASYGDFAGASGVIGIGRG 232

Query: 196 RSSLVSQLN-SSFSYCIGSLHDPD--YLHNKLILGDGAI--IDEGDATPL---QFIDGHY 247
             SL+SQL    FSY + +    D     + +  GD A+     G +TPL         Y
Sbjct: 233 NLSLISQLQFGKFSYQLLAPEATDDGSADSVIRFGDDAVPKTKRGQSTPLLSSTLYPDFY 292

Query: 248 YITLEAISVDGRMLDINP-NIFK-RDDSGGGVMIDSGTDVTWLVKEAYEALRDEVMIRLE 305
           Y+ L  + VDG  LD  P   F  R +  GGV++ S T VT+L + AY+ +R  V  R+ 
Sbjct: 293 YVNLTGVRVDGNRLDAIPAGTFDLRANGTGGVILSSTTPVTYLEQAAYDVVRAAVASRIG 352

Query: 306 GEQMR-SYSWPDKLCYHGIMSSDLKGFPTVRFHFRGGAKLALEKDSMFYQPRPDAF-CMA 363
              +  S +    LCY+    + +K  P +   F GGA + L   + FY        C+ 
Sbjct: 353 LPAVNGSAALELDLCYNASSMAKVK-VPKLTLVFDGGADMDLSAANYFYIDNDTGLECLT 411

Query: 364 VNPASINNRYVNFSLIGMMAQQFYNVGYDIGRKQKTFQ 401
           + P+         S++G + Q   N+ YD+   + TF+
Sbjct: 412 MLPSQ------GGSVLGTLLQTGTNMIYDVDAGRLTFE 443


>gi|24417232|gb|AAN60226.1| unknown [Arabidopsis thaliana]
          Length = 464

 Score =  100 bits (250), Expect = 1e-18,   Method: Compositional matrix adjust.
 Identities = 100/358 (27%), Positives = 138/358 (38%), Gaps = 38/358 (10%)

Query: 61  FYVNISIGQPPVPQFTAMDTGSSLLWVHCYPC-RDCSPQLGTIFYPSRSSSYAYVPCDSE 119
           + V I IG P        DTGS L W  C PC   C  Q    F PS SS+Y  V C S 
Sbjct: 132 YIVTIGIGTPKHDLSLVFDTGSDLTWTQCEPCLGSCYSQKEPKFNPSSSSTYQNVSCSSP 191

Query: 120 HCRYFPYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDVVFGCGFS 179
            C       CSA    C+Y+  Y     T  F++ E+ T  N+D     ++DV FGCG +
Sbjct: 192 MCE--DAESCSA--SNCVYSIGYGDKSFTQGFLAKEKFTLTNSDV----LEDVYFGCGEN 243

Query: 180 TNRNFKFSGIFGLGIGR-----SSLVSQLNSSFSYCIGSLHDPDYLHNKLILGDGAIIDE 234
               F                 +   +  N+ FSYC+ S       H  L  G   I + 
Sbjct: 244 NQGLFDGVAGLLGLGPGKLSLPAQTTTTYNNIFSYCLPSFTSNSTGH--LTFGSAGISES 301

Query: 235 GDATPLQFIDG--HYYITLEAISVDGRMLDINPNIFKRDDSGGGVMIDSGTDVTWLVKEA 292
              TP+       +Y I +  ISV  + L I PN F  +    G +IDSGT  T L  + 
Sbjct: 302 VKFTPISSFPSAFNYGIDIIGISVGDKELAITPNSFSTE----GAIIDSGTVFTRLPTKV 357

Query: 293 YEALRDEVMIRLEGEQMRSYSWPDKLCYHGIMSSDLKGFPTVRFHFRGGAKLALEKDSMF 352
           Y  LR     ++   +  S       CY      D   +PT+ F F GG  + L+   + 
Sbjct: 358 YAELRSVFKEKMSSYKSTSGYGLFDTCYD-FTGLDTVTYPTIAFSFAGGTVVELDGSGIS 416

Query: 353 YQPRPDAFCMAVN-----PASINNRYVNFSLIGMMAQQFYNVGYDIGRKQKTFQRMDC 405
              +    C+A       PA          + G + Q   +V YD+   +  F    C
Sbjct: 417 LPIKISQVCLAFAGNDDLPA----------IFGNVQQTTLDVVYDVAGGRVGFAPNGC 464


>gi|326503794|dbj|BAK02683.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 456

 Score =  100 bits (250), Expect = 1e-18,   Method: Compositional matrix adjust.
 Identities = 110/395 (27%), Positives = 160/395 (40%), Gaps = 54/395 (13%)

Query: 32  RLAYLQEKIRSHNTYQAQILP-----SNDISNSLFYVNISIGQPPVPQFTAMDTGSSLLW 86
           R  Y+Q K+   +  Q   L       + +    + + + IG P V Q   +DTGS + W
Sbjct: 95  RAKYIQRKLSGTDGLQPLDLTVPTTLGSALDTMEYVITVGIGSPAVTQTMMIDTGSDVSW 154

Query: 87  VHCYPCRDCSPQLGTIFYPSRSSSYAYVPCDSEHCRYFPYARCSAYKHRCIYTQLYLIGP 146
           V C      S    T+F PS+S++YA   C S  C              C Y   Y  G 
Sbjct: 155 VRCN-----STDGLTLFDPSKSTTYAPFSCSSAACAQLGNNGDGCSNSGCQYRVQYGDGS 209

Query: 147 ETSVFVSTEQLTFKNTDESTIHVQDVVFGCGFSTNRNF---KFSGIFGLGIGRSSLVSQL 203
            T+   S++ L    +D     V D  FGC      +F   K  G+ GLG    SLVSQ 
Sbjct: 210 NTTGTYSSDTLALSASDT----VTDFHFGCSHH-EEDFDGEKIDGLMGLGGDAQSLVSQT 264

Query: 204 NS----SFSYCIGSLHDPDYLHNKLILGDGAIIDEGDATPLQF----IDGHYYITLEAIS 255
            +    SFSYC   L   +     L  G       G  T            Y + L+ IS
Sbjct: 265 AATYGKSFSYC---LPPTNRTSGFLTFGAPNGTSGGFVTTPMLRWPKAPTLYGVLLQDIS 321

Query: 256 VDGRMLDINPNIFKRDDSGGGVMIDSGTDVTWLVKEAYEALRDEV---MIRLEGEQMRSY 312
           V G  L I P++        G ++DSGT +TWL + AY AL       M RL  ++    
Sbjct: 322 VGGTPLGIQPSVLSN-----GSVMDSGTVITWLPRRAYSALSSAFRSSMTRLRHQRAAPL 376

Query: 313 SWPDKLCYH--GIMSSDLKGFPTVRFHFRGGAKLALEKDSMFYQPRPDAFCMAVNPASIN 370
              D  CY   G+++  +   P V     GGA + L+ + +  Q      C+A    S +
Sbjct: 377 GILDT-CYDFTGLVNVSI---PAVSLVLDGGAVVDLDGNGIMIQD-----CLAFAATSGD 427

Query: 371 NRYVNFSLIGMMAQQFYNVGYDIGRKQKTFQRMDC 405
                 S+IG + Q+ + V +D+G+    F+   C
Sbjct: 428 ------SIIGNVQQRTFEVLHDVGQGVFGFRSGAC 456


>gi|125554529|gb|EAZ00135.1| hypothetical protein OsI_22138 [Oryza sativa Indica Group]
          Length = 472

 Score =  100 bits (249), Expect = 1e-18,   Method: Compositional matrix adjust.
 Identities = 114/409 (27%), Positives = 177/409 (43%), Gaps = 66/409 (16%)

Query: 37  QEKIRSHNTYQAQILPSNDISNSLFYVNISIGQPPVPQFTAMDTGSSLLWVHCYP----C 92
           +E+I S ++ +  ++  + I++ LF + +S+G+PPV    A+DTGS+L WV C P    C
Sbjct: 90  EEEITSSSSTKIDVIEDSSINDFLFLMAVSLGKPPVVNLVAIDTGSTLSWVQCQPCAVHC 149

Query: 93  RDCSPQLGTIFYPSRSSSYAYVPCDSEHCRYFPY------ARCSAYKHRCIYTQLYLIGP 146
              S + G IF P RS +   V C S  C    Y      A C   +  C Y+  Y  G 
Sbjct: 150 HTQSAKAGPIFDPGRSYTSRRVRCSSVKCGELRYDLRLQQANCMEKEDSCTYSVTYGNGW 209

Query: 147 ETSV-FVSTEQLTFKNTDESTIHVQDVVFGCGFSTNRNFKFSGIFGLGIGRSSLVSQLN- 204
             SV  + T+ L   ++        D++FGC      +   +GIFG G    S   QL  
Sbjct: 210 AYSVGKMVTDTLRIGDS------FMDLMFGCSMDVKYSEFEAGIFGFGSSSFSFFEQLAG 263

Query: 205 -------SSFSYCIGSLH-DPDYLHNKLILG--DGAIIDEGDATPLQFIDG-HYYITLEA 253
                   +FSYC+ +    P Y    +ILG  D A +D G  +  + I+   Y +T+E 
Sbjct: 264 YPDILSYKAFSYCLPTDETKPGY----MILGRYDRAAMDGGYTSLFRSINRPTYSLTMEM 319

Query: 254 ISVDGRMLDINPNIFKRDDSGGGVMIDSGTDVTWLVKEAYEALRDEVMI---------RL 304
           +  +G+ L           S   +++DSG   T L    + AL D+ +          R 
Sbjct: 320 LIANGQRLVT---------SSSEMIVDSGAQRTSLWPSTF-ALLDKTITQAMSSIGYHRT 369

Query: 305 EGEQMRSY----SWPDKLCYHGIMS--SDLKGFPTVRFHFRGGAKLALEKDSMFYQPRPD 358
              +  SY    S  D   ++G ++  S+    P +   F GGA LAL   ++FY     
Sbjct: 370 SRARQESYICYLSEHDYSGWNGTITPFSNWSALPLLEIGFAGGAALALPPRNVFYNDPHR 429

Query: 359 AFCM--AVNPASINNRYVNFSLIGMMAQQFYNVGYDIGRKQKTFQRMDC 405
             CM  A NPA      +   ++G    + +   +DI  KQ  F+   C
Sbjct: 430 GLCMTFAQNPA------LRSQILGNRVTRSFGTTFDIQGKQFGFKYAAC 472


>gi|242081123|ref|XP_002445330.1| hypothetical protein SORBIDRAFT_07g009580 [Sorghum bicolor]
 gi|241941680|gb|EES14825.1| hypothetical protein SORBIDRAFT_07g009580 [Sorghum bicolor]
          Length = 543

 Score =  100 bits (249), Expect = 1e-18,   Method: Compositional matrix adjust.
 Identities = 113/439 (25%), Positives = 170/439 (38%), Gaps = 71/439 (16%)

Query: 14  PNENPAHR---VQRGINISIARLAYLQEKIRSHNTYQAQI--------LPSNDISNSLFY 62
           P+++PA     ++R +    +R    Q +IR+     A          L S     +L Y
Sbjct: 126 PDDDPAAHDRYLRRLLAADESRANSFQLRIRNDRAAAASTQSGSAEVPLTSGIRFQTLNY 185

Query: 63  VNI------SIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLGTIFYPSRSSSYAYVPC 116
           V        S G P       +DTGS L WV C PC  C  Q   +F P+ S++YA V C
Sbjct: 186 VTTIALGGGSSGSPAANLTVIVDTGSDLTWVQCKPCSACYAQRDPLFDPAGSATYAAVRC 245

Query: 117 DSEHCRYFPYA------RCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQ 170
           ++  C     A       C     RC Y   Y  G  +   ++T+ +           + 
Sbjct: 246 NASACAASLKAATGTPGSCGGGNERCYYALAYGDGSFSRGVLATDTVALGGAS-----LD 300

Query: 171 DVVFGCGFSTNRNFKFSGIFGL-GIGRS--SLVSQL----NSSFSYCIGSLHDPDYLHNK 223
             VFGCG S NR   F G  GL G+GR+  SLVSQ        FSYC+ +    D   + 
Sbjct: 301 GFVFGCGLS-NRGL-FGGTAGLMGLGRTELSLVSQTALRYGGVFSYCLPATTSGDASGSL 358

Query: 224 LILGDGAIIDEGDATPLQFID--------GHYYITLEAISVDGRMLDINPNIFKRDDSGG 275
            + GD +     + TP+ +            Y++ +   +V G  L        +     
Sbjct: 359 SLGGDASSYR--NTTPVAYTRMIADPAQPPFYFLNVTGAAVGGTAL------AAQGLGAS 410

Query: 276 GVMIDSGTDVTWLVKEAYEALRDEVMIRLEGEQMRSYSWPDK-------LCYHGIMSSDL 328
            V+IDSGT +T L    Y  +R E        Q  +  +P          CY  +   D 
Sbjct: 411 NVLIDSGTVITRLAPSVYRGVRAEFT-----RQFAAAGYPTAPGFSILDTCYD-LTGHDE 464

Query: 329 KGFPTVRFHFRGGAKLALEKDSMFYQPRPDA--FCMAVNPASINNRYVNFSLIGMMAQQF 386
              P +     GGA++ ++   M +  R D    C+A+   S  ++     +IG   Q+ 
Sbjct: 465 VKVPLLTLRLEGGAEVTVDAAGMLFVVRKDGSQVCLAMASLSYEDQT---PIIGNYQQKN 521

Query: 387 YNVGYDIGRKQKTFQRMDC 405
             V YD    +  F   DC
Sbjct: 522 KRVVYDTVGSRLGFADEDC 540


>gi|242117573|dbj|BAH80056.1| hypothetical protein [Oryza sativa Indica Group]
          Length = 469

 Score =  100 bits (249), Expect = 1e-18,   Method: Compositional matrix adjust.
 Identities = 112/398 (28%), Positives = 178/398 (44%), Gaps = 42/398 (10%)

Query: 34  AYLQEKIRSH-NTYQAQILPSNDISNSL--FYVNISIGQPPVPQFTAM-DTGSSLLWVHC 89
            +L++++R+  N  Q Q L     S +     +NI++G P     + + D  S  +W  C
Sbjct: 58  GFLKKQLRNRGNKQQQQQLGGEAASGAAPPLVINITVGTPVAQTVSGLVDITSYFVWAQC 117

Query: 90  YPCRDCS---PQLGTIFYPSRSSSYAYVPCDSEHCRYFPYARC--------SAYKHRCIY 138
            PC   +   P   T F P+ S++++ +PC S+ C       C        +    RC  
Sbjct: 118 APCAAAAGCLPPPATAFRPNGSATFSPLPCSSDMCLPVLRETCGRAGAAANATAGARCDS 177

Query: 139 TQLYLIG--PETSVFVSTEQLTFKNTDESTIHVQDVVFGCGFSTNRNFK-FSGIFGLGIG 195
             L   G    TS +++T+  TF  T      V  VVFGC  ++  +F   SG+ G+G G
Sbjct: 178 YSLTYGGSAANTSGYLATDTFTFGAT-----AVPGVVFGCSDASYGDFAGASGVIGIGRG 232

Query: 196 RSSLVSQLN-SSFSYCIGSLHDPD--YLHNKLILGDGAI--IDEGDATPL---QFIDGHY 247
             SL+SQL    FSY + +    D     + +  GD A+     G +TPL         Y
Sbjct: 233 NLSLISQLQFGKFSYQLLAPEATDDGSADSVIRFGDDAVPKTKRGRSTPLLSSTLYPDFY 292

Query: 248 YITLEAISVDGRMLDINP-NIFK-RDDSGGGVMIDSGTDVTWLVKEAYEALRDEVMIRLE 305
           Y+ L  + VDG  LD  P   F  R +  GGV++ S T VT+L + AY+ +R  V  R+ 
Sbjct: 293 YVNLTGVRVDGNRLDAIPAGTFDLRANGTGGVILSSTTPVTYLEQAAYDVVRAAVASRIG 352

Query: 306 GEQMR-SYSWPDKLCYHGIMSSDLKGFPTVRFHFRGGAKLALEKDSMFYQPRPDAF-CMA 363
              +  S +    LCY+    + +K  P +   F GGA + L   + FY        C+ 
Sbjct: 353 LPAVNGSAALELDLCYNASSMAKVK-VPKLTLVFDGGADMDLSAANYFYIDNDTGLECLT 411

Query: 364 VNPASINNRYVNFSLIGMMAQQFYNVGYDIGRKQKTFQ 401
           + P+         S++G + Q   N+ YD+   + TF+
Sbjct: 412 MLPSQ------GGSVLGTLLQTGTNMIYDVDAGRLTFE 443


>gi|54290717|dbj|BAD62387.1| putative 41 kD chloroplast nucleoid DNA binding protein [Oryza
           sativa Japonica Group]
 gi|215734915|dbj|BAG95637.1| unnamed protein product [Oryza sativa Japonica Group]
          Length = 469

 Score =  100 bits (249), Expect = 1e-18,   Method: Compositional matrix adjust.
 Identities = 100/380 (26%), Positives = 156/380 (41%), Gaps = 53/380 (13%)

Query: 50  ILPSNDISNSLFYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCR-DCSPQLGTIFYPSRS 108
           + P   ++   +   + +G P       +DTGSSL W+ C PC   C  Q G +F P  S
Sbjct: 120 LTPGASVAVGNYVTRLGLGTPATSYVMVVDTGSSLTWLQCSPCSVSCHRQAGPVFDPRAS 179

Query: 109 SSYAYVPCDSEHCRYFPYAR-----CSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTD 163
            +YA V C S  C     A      CS   + CIY   Y     +  ++S + ++F +  
Sbjct: 180 GTYAAVQCSSSECGELQAATLNPSACSV-SNVCIYQASYGDSSYSVGYLSKDTVSFGSGS 238

Query: 164 ESTIHVQDVVFGCGFSTNRNF-KFSGIFGLGIGRSSLVSQLNSS----FSYCIGSLHDPD 218
               +     +GCG      F + +G+ GL   + SL+ QL  S    FSYC+ +     
Sbjct: 239 FPGFY-----YGCGQDNEGLFGRSAGLIGLAKNKLSLLYQLAPSLGYAFSYCLPTSS--- 290

Query: 219 YLHNKLILGDGAIIDEGDATPLQF---------IDGH-YYITLEAISVDGRMLDINPNIF 268
                        +  G   P Q+         +D   Y++TL  ISV G  L + P+ +
Sbjct: 291 --------AAAGYLSIGSYNPGQYSYTPMASSSLDASLYFVTLSGISVAGAPLAVPPSEY 342

Query: 269 KRDDSGGGVMIDSGTDVTWLVKEAYEAL--RDEVMIRLEGEQMRSYSWPDKLCYHGIMSS 326
           +   +    +IDSGT +T L    Y AL       +     +  +YS  D  C+ G  ++
Sbjct: 343 RSLPT----IIDSGTVITRLPPNVYTALSRAVAAAMASAAPRAPTYSILDT-CFRG-SAA 396

Query: 327 DLKGFPTVRFHFRGGAKLALEKDSMFYQPRPDAFCMAVNPASINNRYVNFSLIGMMAQQF 386
            L+  P V   F GGA LAL   ++         C+A  P          ++IG   QQ 
Sbjct: 397 GLR-VPRVDMAFAGGATLALSPGNVLIDVDDSTTCLAFAPTG------GTAIIGNTQQQT 449

Query: 387 YNVGYDIGRKQKTFQRMDCE 406
           ++V YD+ + +  F    C 
Sbjct: 450 FSVVYDVAQSRIGFAAGGCS 469


>gi|340810987|gb|AEK75420.1| S5 [Oryza rufipogon]
 gi|340810989|gb|AEK75421.1| S5 [Oryza rufipogon]
 gi|340810991|gb|AEK75422.1| S5 [Oryza rufipogon]
 gi|340811001|gb|AEK75427.1| S5 [Oryza rufipogon]
 gi|340811019|gb|AEK75436.1| S5 [Oryza rufipogon]
 gi|340811104|gb|AEK75478.1| S5 [Oryza rufipogon]
 gi|340811124|gb|AEK75488.1| S5 [Oryza rufipogon]
          Length = 472

 Score =  100 bits (249), Expect = 1e-18,   Method: Compositional matrix adjust.
 Identities = 116/410 (28%), Positives = 177/410 (43%), Gaps = 68/410 (16%)

Query: 37  QEKIRSHNTYQAQILPSNDISNSLFYVNISIGQPPVPQFTAMDTGSSLLWVHCYP----C 92
           +E+I S ++ +  ++  + I++ LF + +S+G+PPV    A+DTGS+L WV C P    C
Sbjct: 90  EEEITSSSSTKIDVIEDSSINDFLFLMAVSLGKPPVVNLVAIDTGSTLSWVQCQPCAVHC 149

Query: 93  RDCSPQLGTIFYPSRSSSYAYVPCDSEHCRYFPY------ARCSAYKHRCIYTQLYLIGP 146
              S + G IF P RS +   V C S  C    Y      A C   ++ C Y+  Y  G 
Sbjct: 150 HTQSAKAGPIFDPGRSYTSRRVRCSSVKCGELRYDLRLQQANCMEKENSCTYSVTYGNGW 209

Query: 147 ETSV-FVSTEQLTFKNTDESTIHVQDVVFGCGFSTNRNFKFSGIFGLGIGRSSLVSQLN- 204
             SV  + T+ L   ++        D++FGC      +   +GIFG G    S   QL  
Sbjct: 210 AYSVGKMVTDTLRIGDS------FMDLMFGCSMDVKYSEFEAGIFGFGSSSFSFFEQLAG 263

Query: 205 -------SSFSYCIGSLH-DPDYLHNKLILG--DGAIIDEGDATPL--QFIDGHYYITLE 252
                   +FSYC+ +    P Y    +ILG  D A +D G  TPL        Y +T+E
Sbjct: 264 YPDILSYKAFSYCLPTDETKPGY----MILGRYDRAAMD-GGYTPLFRSINRPTYSLTME 318

Query: 253 AISVDGRMLDINPNIFKRDDSGGGVMIDSGTDVTWLVKEAYEALRDEVMI---------R 303
            +  +G+ L           S   +++DSG   T L    + AL D+ +          R
Sbjct: 319 MLIANGQRLVT---------SSSEMIVDSGAQRTSLWPSTF-ALLDKTITQAMSSIGYHR 368

Query: 304 LEGEQMRSY----SWPDKLCYHGIMS--SDLKGFPTVRFHFRGGAKLALEKDSMFYQPRP 357
               +  SY    S  D   ++G ++  S+    P +   F GGA LAL   ++FY    
Sbjct: 369 TSRARQESYICYLSEHDYSGWNGTITPFSNWSALPLLEIGFAGGAALALPPRNVFYNDPH 428

Query: 358 DAFCM--AVNPASINNRYVNFSLIGMMAQQFYNVGYDIGRKQKTFQRMDC 405
              CM  A NPA      +   ++G    + +   +DI  KQ  F+   C
Sbjct: 429 RGLCMTFAQNPA------LRSQILGNRVTRSFGTTFDIQGKQFGFKYAAC 472


>gi|384252236|gb|EIE25712.1| acid protease [Coccomyxa subellipsoidea C-169]
          Length = 599

 Score =  100 bits (249), Expect = 1e-18,   Method: Compositional matrix adjust.
 Identities = 108/388 (27%), Positives = 165/388 (42%), Gaps = 66/388 (17%)

Query: 61  FYVNISIGQPPVPQFTAMDTGSSLLWVHCYPC-RDCSPQ-LGTIFYPSRSSSYAYVPCDS 118
           FY  + +G P       +DTGS++ +V C  C R+C P      F P+ SSS A + CDS
Sbjct: 62  FYATLHLGTPARQFAVIVDTGSTITYVPCASCGRNCGPHHKDAAFDPASSSSSAVIGCDS 121

Query: 119 EHCRYF-PYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDVVFGCG 177
           + C    P   CS  K  C Y + Y     ++  + ++QL  ++         +VVFGC 
Sbjct: 122 DKCICGRPPCGCSE-KRECTYQRTYAEQSSSAGLLVSDQLQLRDG------AVEVVFGCE 174

Query: 178 FSTNR---NFKFSGIFGLGIGRSSLVSQLNSS------FSYCIGSLHDPDYLHNKLILGD 228
                   N +  GI GLG    SLV+QL  S      F+ C GS+         L+LGD
Sbjct: 175 TKETGEIYNQEADGILGLGNSEVSLVNQLAGSGVIDDVFALCFGSVEG----DGALMLGD 230

Query: 229 GAIIDEGD-ATPLQFI-------DGHYY-ITLEAISVDGRMLDINPNIFKRDDSGGGVMI 279
              +D  +    LQ+          HYY + LEA+ V G+ L + P   +R + G G ++
Sbjct: 231 ---VDAAEYDVALQYTALLSSLAHPHYYSVQLEALWVGGQQLPVKP---ERYEEGYGTVL 284

Query: 280 DSGTDVTWLVKEAYEALRDEVMIRLEGEQMRSYSWPD----------KLCY-------HG 322
           DSGT  T+L  EA++  ++ V        + S   PD           +C+       H 
Sbjct: 285 DSGTTFTYLPSEAFQLFKEAVSAYALEHGLNSVKGPDPKEKSFAQFHDICFGGAPHAGHA 344

Query: 323 IMSSDLKGFPTVRFHFRGGAKLALEKDSMFYQPRPD--AFCMAV--NPASINNRYVNFSL 378
             S   K FP     F  G +L     +  +    +  A+C+ V  N AS        +L
Sbjct: 345 DQSKLEKVFPVFELQFADGVRLRTGPLNYLFMHTGEMGAYCLGVFDNGAS-------GTL 397

Query: 379 IGMMAQQFYNVGYDIGRKQKTFQRMDCE 406
           +G ++ +   V YD   ++  F    C+
Sbjct: 398 LGGISFRNILVQYDRRNRRVGFGAASCQ 425


>gi|449442641|ref|XP_004139089.1| PREDICTED: aspartic proteinase-like protein 2-like [Cucumis
           sativus]
          Length = 478

 Score =  100 bits (249), Expect = 1e-18,   Method: Compositional matrix adjust.
 Identities = 90/372 (24%), Positives = 157/372 (42%), Gaps = 34/372 (9%)

Query: 58  NSLFYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQ--LGT---IFYPSRSSSYA 112
             L+Y  I IG PP      +DTGS +LWV+C  C +C  +  +G    ++ P  SS+  
Sbjct: 70  TGLYYARIGIGSPPNDFHVQVDTGSDILWVNCVGCSNCPKKSDIGVDLQLYNPKSSSTST 129

Query: 113 YVPCDSEHCRY---FPYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFK---NTDEST 166
            + CD   C      P   C      C Y  +Y  G  T+ +   + +  +      +++
Sbjct: 130 LITCDQPFCSATYDAPIPGCKP-DLLCQYKVIYGDGSATAGYFVNDYIQLQRAVGNHKTS 188

Query: 167 IHVQDVVFGCGFSTNRNFKFS-----GIFGLGIGRSSLVSQLNSS------FSYCIGSLH 215
                +VFGCG   +     S     GI G G   SS++SQL ++      F++C+    
Sbjct: 189 ETNGSIVFGCGAKQSGELGSSSEALDGILGFGQANSSMISQLAATGKVKKIFAHCL---- 244

Query: 216 DPDYLHNKLILGDGAIID-EGDATPLQFIDGHYYITLEAISVDGRMLDINPNIFKRDDSG 274
             D +    I   G +++ +   TP+     HY + L  + V    LD+   +F+     
Sbjct: 245 --DSISGGGIFAIGEVVEPKLKTTPVVPNQAHYNVVLNGVKVGDTALDLPLGLFETSYKR 302

Query: 275 GGVMIDSGTDVTWLVKEAYEALRDEVMIRLEGEQMRSYSWPDKLCYHGIMSSDLKGFPTV 334
           G + IDSGT + +L    Y  L ++++      ++R+    D+        +   GFPTV
Sbjct: 303 GAI-IDSGTTLAYLPDSIYLPLMEKILGAQPDLKLRTVD--DQFTCFVFDKNVDDGFPTV 359

Query: 335 RFHFRGGAKLALEKDSMFYQPRPDAFCMAVNPASINNRYVN-FSLIGMMAQQFYNVGYDI 393
            F F     L +      +Q R D +C+    +   ++  N  +L+G +  Q   V Y++
Sbjct: 360 TFKFEESLILTIYPHEYLFQIRDDVWCVGWQNSGAQSKDGNEVTLLGDLVLQNKLVYYNL 419

Query: 394 GRKQKTFQRMDC 405
             +   +   +C
Sbjct: 420 ENQTIGWTEYNC 431


>gi|340810931|gb|AEK75392.1| S5 [Oryza sativa]
 gi|340810983|gb|AEK75418.1| S5 [Oryza nivara]
 gi|340810985|gb|AEK75419.1| S5 [Oryza nivara]
 gi|340810997|gb|AEK75425.1| S5 [Oryza nivara]
 gi|340811011|gb|AEK75432.1| S5 [Oryza nivara]
 gi|340811013|gb|AEK75433.1| S5 [Oryza nivara]
 gi|340811041|gb|AEK75447.1| S5 [Oryza nivara]
 gi|340811043|gb|AEK75448.1| S5 [Oryza nivara]
          Length = 474

 Score =  100 bits (249), Expect = 1e-18,   Method: Compositional matrix adjust.
 Identities = 116/410 (28%), Positives = 176/410 (42%), Gaps = 68/410 (16%)

Query: 37  QEKIRSHNTYQAQILPSNDISNSLFYVNISIGQPPVPQFTAMDTGSSLLWVHCYP----C 92
           +E+I S ++ +  ++  + I++ LF + +S+G+PPV    A+DTGS+L WV C P    C
Sbjct: 92  EEEITSSSSTKIDVIEDSSINDFLFLMAVSLGKPPVVNLVAIDTGSTLSWVQCQPCAVHC 151

Query: 93  RDCSPQLGTIFYPSRSSSYAYVPCDSEHCRYFPY------ARCSAYKHRCIYTQLYLIGP 146
              S + G IF P RS +   V C S  C    Y      A C   +  C Y+  Y  G 
Sbjct: 152 HTQSAKAGPIFDPGRSYTSRRVRCSSVKCGELRYDLRLQQANCMEKEDSCTYSVTYGNGW 211

Query: 147 ETSV-FVSTEQLTFKNTDESTIHVQDVVFGCGFSTNRNFKFSGIFGLGIGRSSLVSQLN- 204
             SV  + T+ L   ++        D++FGC      +   +GIFG G    S   QL  
Sbjct: 212 AYSVGKMVTDTLRIGDS------FMDLMFGCSMDVKYSEFEAGIFGFGSSSFSFFEQLAG 265

Query: 205 -------SSFSYCIGSLH-DPDYLHNKLILG--DGAIIDEGDATPL--QFIDGHYYITLE 252
                   +FSYC+ +    P Y    +ILG  D A +D G  TPL        Y +T+E
Sbjct: 266 YPDILSYKAFSYCLPTDETKPGY----MILGRYDRAAMD-GGYTPLFRSINRPTYSLTME 320

Query: 253 AISVDGRMLDINPNIFKRDDSGGGVMIDSGTDVTWLVKEAYEALRDEVMI---------R 303
            +  +G+ L           S   +++DSG   T L    + AL D+ +          R
Sbjct: 321 MLIANGQRLVT---------SSSEMIVDSGAQRTSLWPSTF-ALLDKTITQAMSSIGYHR 370

Query: 304 LEGEQMRSY----SWPDKLCYHGIMS--SDLKGFPTVRFHFRGGAKLALEKDSMFYQPRP 357
               +  SY    S  D   ++G ++  S+    P +   F GGA LAL   ++FY    
Sbjct: 371 TSRARQESYICYLSEHDYSGWNGTITPFSNWSALPPLEIGFAGGAALALSPRNVFYNDPH 430

Query: 358 DAFCM--AVNPASINNRYVNFSLIGMMAQQFYNVGYDIGRKQKTFQRMDC 405
              CM  A NPA      +   ++G    + +   +DI  KQ  F+   C
Sbjct: 431 RGLCMTFAQNPA------LRSQILGNRVTRSFGTTFDIQGKQFGFKYAAC 474


>gi|357510893|ref|XP_003625735.1| Aspartic proteinase-like protein [Medicago truncatula]
 gi|355500750|gb|AES81953.1| Aspartic proteinase-like protein [Medicago truncatula]
          Length = 535

 Score =  100 bits (248), Expect = 2e-18,   Method: Compositional matrix adjust.
 Identities = 80/264 (30%), Positives = 129/264 (48%), Gaps = 41/264 (15%)

Query: 60  LFYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLG-----TIFYPSRSSSYAYV 114
           L++  + +G P    +  +DTGS +LW++C  C +C    G       F  + SS+ A V
Sbjct: 70  LYFTKVKMGSPAKEFYVQIDTGSDILWLNCNTCNNCPKSSGLGIDLNYFDTASSSTAALV 129

Query: 115 PCDSEHCRYF---PYARCSAYKHRCIYTQLYLIGPETSVF---------VSTEQLTFKNT 162
            C    C Y      ++CS+  ++C YT  Y  G  TS +         V   Q  F N+
Sbjct: 130 SCSDPVCSYAVQTATSQCSSQANQCSYTFQYGDGSGTSGYYVYDAMYFDVIMGQSVFSNS 189

Query: 163 DESTIHVQDVVFGC-----GFSTNRNFKFSGIFGLGIGRSSLVSQLNSS------FSYCI 211
             +      VVFGC     G          GIFG G G  S+VSQ++S       FS+C+
Sbjct: 190 SST------VVFGCSTYQSGDLARTEKAVDGIFGFGPGALSVVSQVSSQGMAPKVFSHCL 243

Query: 212 GSLHDPDYLHNKLILGDGAIIDEGDA-TPLQFIDGHYYITLEAISVDGRMLDINPNIFKR 270
                   +   L+LG+  I++     TPL  +  HY + L++I+V+G++L I+ ++F  
Sbjct: 244 KGQGSGGGI---LVLGE--ILEPNIVYTPLVPLQPHYNLNLQSIAVNGQILPIDQDVFAT 298

Query: 271 DDSGGGVMIDSGTDVTWLVKEAYE 294
            ++  G ++DSGT + +LV+EAY+
Sbjct: 299 GNN-RGTIVDSGTTLAYLVQEAYD 321


>gi|15238250|ref|NP_196637.1| aspartyl protease family protein [Arabidopsis thaliana]
 gi|8979710|emb|CAB96831.1| nucleoid DNA-binding protein cnd41-like protein [Arabidopsis
           thaliana]
 gi|18176136|gb|AAL59990.1| putative nucleoid DNA-binding protein cnd41 [Arabidopsis thaliana]
 gi|22136986|gb|AAM91722.1| putative nucleoid DNA-binding protein cnd41 [Arabidopsis thaliana]
 gi|110740988|dbj|BAE98588.1| nucleoid DNA-binding protein cnd41 - like protein [Arabidopsis
           thaliana]
 gi|332004210|gb|AED91593.1| aspartyl protease family protein [Arabidopsis thaliana]
          Length = 464

 Score =  100 bits (248), Expect = 2e-18,   Method: Compositional matrix adjust.
 Identities = 99/358 (27%), Positives = 138/358 (38%), Gaps = 38/358 (10%)

Query: 61  FYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCR-DCSPQLGTIFYPSRSSSYAYVPCDSE 119
           + V I IG P        DTGS L W  C PC   C  Q    F PS SS+Y  V C S 
Sbjct: 132 YIVTIGIGTPKHDLSLVFDTGSDLTWTQCEPCLGSCYSQKEPKFNPSSSSTYQNVSCSSP 191

Query: 120 HCRYFPYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDVVFGCGFS 179
            C       CSA    C+Y+ +Y     T  F++ E+ T  N+D     ++DV FGCG +
Sbjct: 192 MCE--DAESCSA--SNCVYSIVYGDKSFTQGFLAKEKFTLTNSDV----LEDVYFGCGEN 243

Query: 180 TNRNFKFSGIFGLGIGR-----SSLVSQLNSSFSYCIGSLHDPDYLHNKLILGDGAIIDE 234
               F                 +   +  N+ FSYC+ S       H  L  G   I + 
Sbjct: 244 NQGLFDGVAGLLGLGPGKLSLPAQTTTTYNNIFSYCLPSFTSNSTGH--LTFGSAGISES 301

Query: 235 GDATPLQFIDG--HYYITLEAISVDGRMLDINPNIFKRDDSGGGVMIDSGTDVTWLVKEA 292
              TP+       +Y I +  ISV  + L I PN F  +    G +IDSGT  T L  + 
Sbjct: 302 VKFTPISSFPSAFNYGIDIIGISVGDKELAITPNSFSTE----GAIIDSGTVFTRLPTKV 357

Query: 293 YEALRDEVMIRLEGEQMRSYSWPDKLCYHGIMSSDLKGFPTVRFHFRGGAKLALEKDSMF 352
           Y  LR     ++   +  S       CY      D   +PT+ F F G   + L+   + 
Sbjct: 358 YAELRSVFKEKMSSYKSTSGYGLFDTCYD-FTGLDTVTYPTIAFSFAGSTVVELDGSGIS 416

Query: 353 YQPRPDAFCMAVN-----PASINNRYVNFSLIGMMAQQFYNVGYDIGRKQKTFQRMDC 405
              +    C+A       PA          + G + Q   +V YD+   +  F    C
Sbjct: 417 LPIKISQVCLAFAGNDDLPA----------IFGNVQQTTLDVVYDVAGGRVGFAPNGC 464


>gi|340810907|gb|AEK75380.1| S5 [Oryza sativa]
          Length = 472

 Score =  100 bits (248), Expect = 2e-18,   Method: Compositional matrix adjust.
 Identities = 116/410 (28%), Positives = 176/410 (42%), Gaps = 68/410 (16%)

Query: 37  QEKIRSHNTYQAQILPSNDISNSLFYVNISIGQPPVPQFTAMDTGSSLLWVHCYP----C 92
           +E+I S ++ +  ++  + I++ LF + +S+G+PPV    A+DTGS+L WV C P    C
Sbjct: 90  EEEITSSSSTKIDVIEDSSINDFLFLMAVSLGKPPVVNLVAIDTGSTLSWVQCQPCAVHC 149

Query: 93  RDCSPQLGTIFYPSRSSSYAYVPCDSEHCRYFPY------ARCSAYKHRCIYTQLYLIGP 146
              S + G IF P RS +   V C S  C    Y      A C   +  C Y+  Y  G 
Sbjct: 150 HTQSAKAGPIFDPGRSYTSRRVRCSSVKCGELRYDLRLQQANCMEKEDSCTYSVTYGNGW 209

Query: 147 ETSV-FVSTEQLTFKNTDESTIHVQDVVFGCGFSTNRNFKFSGIFGLGIGRSSLVSQLN- 204
             SV  + T+ L   ++        D++FGC      +   +GIFG G    S   QL  
Sbjct: 210 AYSVGKMVTDTLRIGDS------FMDLMFGCSMDVKYSEFEAGIFGFGSSSFSFFEQLAG 263

Query: 205 -------SSFSYCIGSLH-DPDYLHNKLILG--DGAIIDEGDATPL--QFIDGHYYITLE 252
                   +FSYC+ +    P Y    +ILG  D A +D G  TPL        Y +T+E
Sbjct: 264 YPDILSYKAFSYCLPTDETKPGY----MILGRYDRAAMD-GGYTPLFRSINRPTYSLTME 318

Query: 253 AISVDGRMLDINPNIFKRDDSGGGVMIDSGTDVTWLVKEAYEALRDEVMI---------R 303
            +  +G+ L           S   +++DSG   T L    + AL D+ +          R
Sbjct: 319 MLIANGQRLVT---------SSSEMIVDSGAQRTSLWPSTF-ALLDKTITQAMSSIGYHR 368

Query: 304 LEGEQMRSY----SWPDKLCYHGIMS--SDLKGFPTVRFHFRGGAKLALEKDSMFYQPRP 357
               +  SY    S  D   ++G ++  S+    P +   F GGA LAL   ++FY    
Sbjct: 369 TSRARQESYICYLSEHDYSGWNGTITPFSNWSALPLLEIGFAGGAALALSPRNVFYNDPH 428

Query: 358 DAFCM--AVNPASINNRYVNFSLIGMMAQQFYNVGYDIGRKQKTFQRMDC 405
              CM  A NPA      +   ++G    + +   +DI  KQ  F+   C
Sbjct: 429 RGLCMTFAQNPA------LRSQILGNRVTRSFGTTFDIQGKQFGFKYAAC 472


>gi|449476186|ref|XP_004154665.1| PREDICTED: LOW QUALITY PROTEIN: aspartic proteinase-like protein
           2-like [Cucumis sativus]
          Length = 478

 Score =  100 bits (248), Expect = 2e-18,   Method: Compositional matrix adjust.
 Identities = 90/372 (24%), Positives = 158/372 (42%), Gaps = 34/372 (9%)

Query: 58  NSLFYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQ--LGT---IFYPSRSSSYA 112
             L+Y  I IG PP      +DTGS +LWV+C  C +C  +  +G    ++ P  SS+  
Sbjct: 70  TGLYYARIGIGSPPNDFHVQVDTGSDILWVNCVGCSNCPKKSDIGVDLQLYNPKSSSTST 129

Query: 113 YVPCDSEHCRY---FPYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFK---NTDEST 166
            + CD   C      P   C      C Y  +Y  G  T+ +   + +  +      +++
Sbjct: 130 LITCDQPFCSATYDAPIPGCKP-DLLCQYKVIYGDGSATAGYFVNDYIQLQRAVGNHKTS 188

Query: 167 IHVQDVVFGCGFSTNRNFKFS-----GIFGLGIGRSSLVSQLNSS------FSYCIGSLH 215
                +VFGCG   +     S     GI G G   SS++SQL ++      F++C+    
Sbjct: 189 ETNGSIVFGCGAKQSGELGSSSEALDGILGFGQANSSMISQLAATGKVKKIFAHCL---- 244

Query: 216 DPDYLHNKLILGDGAIID-EGDATPLQFIDGHYYITLEAISVDGRMLDINPNIFKRDDSG 274
             D +    I   G +++ +   TP+     HY + L  + V    LD+   +F+     
Sbjct: 245 --DSISGGGIFAIGEVVEPKLXNTPVVPNQAHYNVVLNGVKVGDTALDLPLGLFETSYKR 302

Query: 275 GGVMIDSGTDVTWLVKEAYEALRDEVMIRLEGEQMRSYSWPDKLCYHGIMSSDLKGFPTV 334
           G + IDSGT + +L +  Y  L ++++      ++R+    D+        +   GFPTV
Sbjct: 303 GAI-IDSGTTLAYLPESIYLPLMEKILGAQPDLKLRTVD--DQFTCFVFDKNVDDGFPTV 359

Query: 335 RFHFRGGAKLALEKDSMFYQPRPDAFCMAVNPASINNRYVN-FSLIGMMAQQFYNVGYDI 393
            F F     L +      +Q R D +C+    +   ++  N  +L+G +  Q   V Y++
Sbjct: 360 TFKFEESLILTIYPHEYLFQIRDDVWCVGWQNSGAQSKDGNEVTLLGDLVLQNKLVYYNL 419

Query: 394 GRKQKTFQRMDC 405
             +   +   +C
Sbjct: 420 ENQTIGWTEYNC 431


>gi|225430551|ref|XP_002283470.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Vitis vinifera]
          Length = 490

 Score =  100 bits (248), Expect = 2e-18,   Method: Compositional matrix adjust.
 Identities = 118/412 (28%), Positives = 176/412 (42%), Gaps = 67/412 (16%)

Query: 31  ARLAYLQEKIR------SHNTYQAQILPSNDIS---NSLFYVNISIGQPPVPQFTAMDTG 81
           +R+A +Q ++       S+       LPS   S   +  + V + +G P        DTG
Sbjct: 108 SRVASIQSRLAKNLAGGSNLKASKATLPSKSASTLGSGNYVVTVGLGSPKRDLTFIFDTG 167

Query: 82  SSLLWVHCYPCRD-CSPQLGTIFYPSRSSSYAYVPCDSEHCRYFPYAR-----CSAYKHR 135
           S L W  C PC   C  Q   IF PS S SY+ V CDS  C     A      CS+    
Sbjct: 168 SDLTWTQCEPCVGYCYQQREHIFDPSTSLSYSNVSCDSPSCEKLESATGNSPGCSS--ST 225

Query: 136 CIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDVVFGCGFSTNRNFKFSGIFG-LGI 194
           C+Y   Y  G  +  F + E+L+  +TD       +  FGCG   NR   F G  G LG+
Sbjct: 226 CLYGIRYGDGSYSIGFFAREKLSLTSTDV----FNNFQFGCG-QNNRGL-FGGTAGLLGL 279

Query: 195 GRS--SLVSQLNSS----FSYCIGSLHDPDYLHNKLILGDGAIIDEGDATPLQFIDGH-- 246
            R+  SLVSQ        FSYC   L         L  G G    +GD+  ++F      
Sbjct: 280 ARNPLSLVSQTAQKYGKVFSYC---LPSSSSSTGYLSFGSG----DGDSKAVKFTPSEVN 332

Query: 247 ------YYITLEAISVDGRMLDINPNIFKRDDSGGGVMIDSGTDVTWLVKEAYEALRDEV 300
                 Y++ +  ISV  R L I  ++F    S  G +IDSGT ++ L    Y +++ +V
Sbjct: 333 SDYPSFYFLDMVGISVGERKLPIPKSVF----STAGTIIDSGTVISRLPPTVYSSVQ-KV 387

Query: 301 MIRLEGE--QMRSYSWPDKLCYHGIMSSDLKGFPTVR-----FHFRGGAKLALEKDSMFY 353
              L  +  +++  S  D  CY      DL  + TV+      +F GGA++ L  + + Y
Sbjct: 388 FRELMSDYPRVKGVSILDT-CY------DLSKYKTVKVPKIILYFSGGAEMDLAPEGIIY 440

Query: 354 QPRPDAFCMAVNPASINNRYVNFSLIGMMAQQFYNVGYDIGRKQKTFQRMDC 405
             +    C+A    S ++     ++IG + Q+  +V YD    +  F    C
Sbjct: 441 VLKVSQVCLAFAGNSDDDE---VAIIGNVQQKTIHVVYDDAEGRVGFAPSGC 489


>gi|16209647|gb|AAL14384.1| AT3g52500/F22O6_120 [Arabidopsis thaliana]
          Length = 469

 Score =  100 bits (248), Expect = 2e-18,   Method: Compositional matrix adjust.
 Identities = 124/464 (26%), Positives = 196/464 (42%), Gaps = 85/464 (18%)

Query: 8   LSPYNNPNENPAH---RVQRGINISIARLAYL---------QEKIRSHNTYQAQIL--PS 53
           LSP+++ +++P      ++R    SIAR   L         ++ + S  T  A ++  P 
Sbjct: 23  LSPFSHSDQSPKDPYLSLRRLAESSIARAHKLKHGTSIKPDEDALSSTTTASATVVKSPL 82

Query: 54  NDISNSLFYVNISIGQPPVPQFTAMDTGSSLLWVHC---YPCRDCS-----PQLGTIFYP 105
           +  S   + V++S G P        DTGSSL+ + C   Y C  C      P L   F P
Sbjct: 83  SAKSYGGYSVSLSFGTPSQTIPFVFDTGSSLVCLPCTSRYLCSGCDFSGLDPTLIPRFIP 142

Query: 106 SRSSSYAYVPCDSEHCRYF--PYARCSA---YKHRCI-----YTQLYLIGPETSVFVSTE 155
             SSS   + C S  C++   P  +C         C      Y   Y +G    V + TE
Sbjct: 143 KNSSSSKIIGCQSPKCQFLYGPNVQCRGCDPNTRNCTVGCPPYILQYGLGSTAGVLI-TE 201

Query: 156 QLTFKNTDESTIHVQDVVFGCGFSTNRNFKFSGIFGLGIGRSSLVSQLN-SSFSYCIGSL 214
           +L F +     + V D V GC   + R  + +GI G G G  SL SQ+N   FS+C+ S 
Sbjct: 202 KLDFPD-----LTVPDFVVGCSIISTR--QPAGIAGFGRGPVSLPSQMNLKRFSHCLVSR 254

Query: 215 H-DPDYLHNKLILGDGAIIDEGDATP---------------LQFIDGHYYITLEAISVDG 258
             D   +   L L  G+  + G  TP                 F++ +YY+ L  I V  
Sbjct: 255 RFDDTNVTTDLDLDTGSGHNSGSKTPGLTYTPFRKNPNVSNKAFLE-YYYLNLRRIYVGR 313

Query: 259 RMLDINPNIFKRDDSG-GGVMIDSGTDVTWLVKEAYEALRDEVMIRLEGEQMRSYSWPDK 317
           + + I         +G GG ++DSG+  T++ +  +E + +E        QM +Y+    
Sbjct: 314 KHVKIPYKYLAPGTNGDGGSIVDSGSTFTFMERPVFELVAEEF-----ASQMSNYTREKD 368

Query: 318 L--------CYHGIMSSDLKGFPTVRFHFRGGAKLALEKDSMF-YQPRPDAFCMAVNPAS 368
           L        C++     D+   P + F F+GGAKL L   + F +    D  C+ V    
Sbjct: 369 LEKETGLGPCFNISGKGDVT-VPELIFEFKGGAKLELPLSNYFTFVGNTDTVCLTV---- 423

Query: 369 INNRYVNFS-------LIGMMAQQFYNVGYDIGRKQKTFQRMDC 405
           ++++ VN S       ++G   QQ Y V YD+   +  F +  C
Sbjct: 424 VSDKTVNPSGGTGPAIILGSFQQQNYLVEYDLENDRFGFAKKKC 467


>gi|196212952|gb|ACG76112.1| S5 [Oryza sativa Indica Group]
 gi|338809989|gb|AEJ08560.1| S5 [Oryza barthii]
 gi|340810883|gb|AEK75368.1| S5 [Oryza sativa]
 gi|340810885|gb|AEK75369.1| S5 [Oryza sativa]
 gi|340810889|gb|AEK75371.1| S5 [Oryza sativa]
 gi|340810895|gb|AEK75374.1| S5 [Oryza sativa]
 gi|340810897|gb|AEK75375.1| S5 [Oryza sativa]
 gi|340810905|gb|AEK75379.1| S5 [Oryza sativa]
 gi|340810909|gb|AEK75381.1| S5 [Oryza sativa]
 gi|340810911|gb|AEK75382.1| S5 [Oryza sativa]
 gi|340810913|gb|AEK75383.1| S5 [Oryza sativa]
 gi|340810923|gb|AEK75388.1| S5 [Oryza sativa]
 gi|340810925|gb|AEK75389.1| S5 [Oryza sativa]
 gi|340810929|gb|AEK75391.1| S5 [Oryza sativa]
 gi|340810935|gb|AEK75394.1| S5 [Oryza sativa]
 gi|340810937|gb|AEK75395.1| S5 [Oryza sativa]
 gi|340810939|gb|AEK75396.1| S5 [Oryza sativa]
 gi|340810941|gb|AEK75397.1| S5 [Oryza sativa]
 gi|340810943|gb|AEK75398.1| S5 [Oryza sativa]
 gi|340810951|gb|AEK75402.1| S5 [Oryza sativa]
 gi|340810953|gb|AEK75403.1| S5 [Oryza sativa]
 gi|340810963|gb|AEK75408.1| S5 [Oryza sativa]
 gi|340810965|gb|AEK75409.1| S5 [Oryza sativa]
 gi|340810973|gb|AEK75413.1| S5 [Oryza nivara]
 gi|340811003|gb|AEK75428.1| S5 [Oryza rufipogon]
 gi|340811005|gb|AEK75429.1| S5 [Oryza rufipogon]
 gi|340811009|gb|AEK75431.1| S5 [Oryza rufipogon]
 gi|340811023|gb|AEK75438.1| S5 [Oryza rufipogon]
 gi|340811025|gb|AEK75439.1| S5 [Oryza nivara]
 gi|340811031|gb|AEK75442.1| S5 [Oryza rufipogon]
 gi|340811033|gb|AEK75443.1| S5 [Oryza rufipogon]
 gi|340811035|gb|AEK75444.1| S5 [Oryza nivara]
 gi|340811039|gb|AEK75446.1| S5 [Oryza rufipogon]
 gi|340811049|gb|AEK75451.1| S5 [Oryza nivara]
 gi|340811053|gb|AEK75453.1| S5 [Oryza rufipogon]
 gi|340811055|gb|AEK75454.1| S5 [Oryza nivara]
 gi|340811057|gb|AEK75455.1| S5 [Oryza rufipogon]
 gi|340811059|gb|AEK75456.1| S5 [Oryza rufipogon]
 gi|340811061|gb|AEK75457.1| S5 [Oryza rufipogon]
 gi|340811065|gb|AEK75459.1| S5 [Oryza nivara]
 gi|340811067|gb|AEK75460.1| S5 [Oryza nivara]
 gi|340811069|gb|AEK75461.1| S5 [Oryza nivara]
 gi|340811071|gb|AEK75462.1| S5 [Oryza rufipogon]
 gi|340811081|gb|AEK75467.1| S5 [Oryza nivara]
 gi|340811083|gb|AEK75468.1| S5 [Oryza nivara]
 gi|340811087|gb|AEK75470.1| S5 [Oryza nivara]
 gi|340811092|gb|AEK75472.1| S5 [Oryza nivara]
 gi|340811102|gb|AEK75477.1| S5 [Oryza rufipogon]
 gi|340811106|gb|AEK75479.1| S5 [Oryza rufipogon]
 gi|340811108|gb|AEK75480.1| S5 [Oryza rufipogon]
 gi|340811110|gb|AEK75481.1| S5 [Oryza rufipogon]
 gi|340811112|gb|AEK75482.1| S5 [Oryza rufipogon]
 gi|340811118|gb|AEK75485.1| S5 [Oryza nivara]
 gi|340811120|gb|AEK75486.1| S5 [Oryza rufipogon]
          Length = 472

 Score =  100 bits (248), Expect = 2e-18,   Method: Compositional matrix adjust.
 Identities = 116/410 (28%), Positives = 176/410 (42%), Gaps = 68/410 (16%)

Query: 37  QEKIRSHNTYQAQILPSNDISNSLFYVNISIGQPPVPQFTAMDTGSSLLWVHCYP----C 92
           +E+I S ++ +  ++  + I++ LF + +S+G+PPV    A+DTGS+L WV C P    C
Sbjct: 90  EEEITSSSSTKIDVIEDSSINDFLFLMAVSLGKPPVVNLVAIDTGSTLSWVQCQPCAVHC 149

Query: 93  RDCSPQLGTIFYPSRSSSYAYVPCDSEHCRYFPY------ARCSAYKHRCIYTQLYLIGP 146
              S + G IF P RS +   V C S  C    Y      A C   +  C Y+  Y  G 
Sbjct: 150 HTQSAKAGPIFDPGRSYTSRRVRCSSVKCGELRYDLRLQQANCMEKEDSCTYSVTYGNGW 209

Query: 147 ETSV-FVSTEQLTFKNTDESTIHVQDVVFGCGFSTNRNFKFSGIFGLGIGRSSLVSQLN- 204
             SV  + T+ L   ++        D++FGC      +   +GIFG G    S   QL  
Sbjct: 210 AYSVGKMVTDTLRIGDS------FMDLMFGCSMDVKYSEFEAGIFGFGSSSFSFFEQLAG 263

Query: 205 -------SSFSYCIGSLH-DPDYLHNKLILG--DGAIIDEGDATPL--QFIDGHYYITLE 252
                   +FSYC+ +    P Y    +ILG  D A +D G  TPL        Y +T+E
Sbjct: 264 YPDILSYKAFSYCLPTDETKPGY----MILGRYDRAAMD-GGYTPLFRSINRPTYSLTME 318

Query: 253 AISVDGRMLDINPNIFKRDDSGGGVMIDSGTDVTWLVKEAYEALRDEVMI---------R 303
            +  +G+ L           S   +++DSG   T L    + AL D+ +          R
Sbjct: 319 MLIANGQRLVT---------SSSEMIVDSGAQRTSLWPSTF-ALLDKTITQAMSSIGYHR 368

Query: 304 LEGEQMRSY----SWPDKLCYHGIMS--SDLKGFPTVRFHFRGGAKLALEKDSMFYQPRP 357
               +  SY    S  D   ++G ++  S+    P +   F GGA LAL   ++FY    
Sbjct: 369 TSRARQESYICYLSEHDYSGWNGTITPFSNWSALPLLEIGFAGGAALALPPRNVFYNDPH 428

Query: 358 DAFCM--AVNPASINNRYVNFSLIGMMAQQFYNVGYDIGRKQKTFQRMDC 405
              CM  A NPA      +   ++G    + +   +DI  KQ  F+   C
Sbjct: 429 RGLCMTFAQNPA------LRSQILGNRVTRSFGTTFDIQGKQFGFKYAAC 472


>gi|308081797|ref|NP_001182920.1| uncharacterized protein LOC100501208 [Zea mays]
 gi|238008190|gb|ACR35130.1| unknown [Zea mays]
 gi|413922182|gb|AFW62114.1| hypothetical protein ZEAMMB73_927324 [Zea mays]
          Length = 269

 Score = 99.8 bits (247), Expect = 2e-18,   Method: Compositional matrix adjust.
 Identities = 79/273 (28%), Positives = 130/273 (47%), Gaps = 30/273 (10%)

Query: 152 VSTEQLTFKNTDESTIHVQDVVFGCGFSTNRNFK-FSGIFGLGIGRSSLVSQLN-SSFSY 209
           ++TE  TF      +    ++ FGCG  TN      SGI G+  G  S++ QL+ + FSY
Sbjct: 7   LATETFTFGAHQNFS---ANLTFGCGKLTNGTIAGASGIMGVSPGPLSVLKQLSITKFSY 63

Query: 210 CIGSLHDPDYLHNKLILGDGAIIDEG--------DATPL---QFIDGHYYITLEAISVDG 258
           C+    D    H    +  GA+ D G           PL      D +YY+ +  IS+  
Sbjct: 64  CLTPFTD----HKTSPVMFGAMADLGKYKTTGKVQTIPLLKNPVEDIYYYVPMVGISIGS 119

Query: 259 RMLDINPNIFK-RDDSGGGVMIDSGTDVTWLVKEAYEALRDEVM--IRLEGEQMRSYSWP 315
           + LD+   I   R D  GG ++DS T + +LV+ A++ L+  VM  ++L         +P
Sbjct: 120 KRLDVPEAILALRPDGTGGTVLDSATTLAYLVEPAFKELKKAVMEGMKLPAANRSIDDYP 179

Query: 316 DKLCYHGIMSSDLKGF--PTVRFHFRGGAKLALEKDSMFYQPRPDAFCMAVNPASINNRY 373
             +C+       ++G   P +  HF G A+++L +DS F +P P   C+AV  A      
Sbjct: 180 --VCFELPRGMSMEGVQVPPLVLHFAGDAEMSLPRDSYFQEPSPGMMCLAVMQAPFEGAP 237

Query: 374 VNFSLIGMMAQQFYNVGYDIGRKQKTFQRMDCE 406
              ++IG + QQ  +V YD+G ++ ++    C+
Sbjct: 238 ---NVIGNVQQQNMHVLYDLGNRKFSYAPTKCD 267


>gi|242045118|ref|XP_002460430.1| hypothetical protein SORBIDRAFT_02g027990 [Sorghum bicolor]
 gi|241923807|gb|EER96951.1| hypothetical protein SORBIDRAFT_02g027990 [Sorghum bicolor]
          Length = 488

 Score = 99.8 bits (247), Expect = 2e-18,   Method: Compositional matrix adjust.
 Identities = 103/372 (27%), Positives = 154/372 (41%), Gaps = 36/372 (9%)

Query: 54  NDISNSLFYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLGTIFYPSRSSSYAY 113
             +S + +  ++ +G P       +DTGS   WV C PC DC  Q   +F P+ SS+Y+ 
Sbjct: 132 KSLSTTNYVASLRLGTPATELVVELDTGSDQSWVQCKPCADCYEQRDPVFDPTASSTYSA 191

Query: 114 VPCDSEHCRYFP-----YARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIH 168
           VPC +  C+            S     C Y   Y     T   ++ + LT   +   +  
Sbjct: 192 VPCGARECQELASSSSSRNCSSDNNKNCPYEVSYDDDSHTVGDLARDTLTLSPSPSPSPA 251

Query: 169 --VQDVVFGCGFSTNRNF-KFSGIFGLGIGRSSLVSQL----NSSFSYCIGSLHDPDYLH 221
             V   VFGCG S    F +  G+ GLG+G++SL SQ+     ++FSYC+     P    
Sbjct: 252 DTVPGFVFGCGHSNAGTFGEVDGLLGLGLGKASLPSQVAARYGAAFSYCL-----PSSPS 306

Query: 222 NKLILGDGAIIDEGDATPLQFIDGH----YYITLEAISVDGRMLDINPNIFKRDDSGGGV 277
               L  G      +A   + + G     YY+ L  I V GR + +  + F    +  G 
Sbjct: 307 AAGYLSFGGAAARANAQFTEMVTGQDPTSYYLNLTGIVVAGRAIKVPASAFA---TAAGT 363

Query: 278 MIDSGTDVTWLVKEAYEALRD---EVMIRLEGEQMRSYSWPDKLCYHGIMSSDLKGFPTV 334
           +IDSGT  + L   AY ALR      M R   ++  S    D  CY       ++  P V
Sbjct: 364 IIDSGTAFSRLPPSAYAALRSSFRSAMGRYRYKRAPSSPIFDT-CYDFTGHETVR-IPAV 421

Query: 335 RFHFRGGAKLALEKDSMFYQPRPDA-FCMAVNPASINNRYVNFSLIGMMAQQFYNVGYDI 393
              F  GA + L    + Y     A  C+A  P        +  ++G   Q+   V YD+
Sbjct: 422 ELVFADGATVHLHPSGVLYTWNDVAQTCLAFVPNH------DLGILGNTQQRTLAVIYDV 475

Query: 394 GRKQKTFQRMDC 405
           G ++  F R  C
Sbjct: 476 GSQRIGFGRKGC 487


>gi|125558627|gb|EAZ04163.1| hypothetical protein OsI_26305 [Oryza sativa Indica Group]
          Length = 404

 Score = 99.8 bits (247), Expect = 2e-18,   Method: Compositional matrix adjust.
 Identities = 103/370 (27%), Positives = 153/370 (41%), Gaps = 69/370 (18%)

Query: 55  DISNSLFYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLGTIFYPSRSSSYAYV 114
           D S   + +N+SIG PPV      DTGSSL+W  C PC +C+ +    F P+ SS+++ +
Sbjct: 84  DNSAGAYNMNLSIGTPPVTFSVLADTGSSLIWTQCAPCTECAARPAPPFQPASSSTFSKL 143

Query: 115 PCDSEHCRYF--PYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDV 172
           PC S  C++   PY  C+A    C+Y   Y +G  T+ +++TE L              V
Sbjct: 144 PCASSLCQFLTSPYRTCNATG--CVYYYPYGMG-FTAGYLATETLHVGGAS-----FPGV 195

Query: 173 VFGCGFSTNRNFKFSGIFGLGIGRSSLVSQLN-SSFSYCIGSLHDPDYLHNKLILGDGAI 231
            FGC          SGI GLG    SLVSQ+  + FSYC+ S  + D   + ++ G  A 
Sbjct: 196 TFGCSTENGVGNSSSGIVGLGRSPLSLVSQVGVARFSYCLRS--NADAGDSPILFGSLAK 253

Query: 232 IDEGD--ATPL-----QFIDGHYYITLEAISVDGRMLDINPNIFKRDDSGGGVMIDSGTD 284
           +  G+  +TPL          +YY+ L  I+V    L +                     
Sbjct: 254 VTGGNVQSTPLLENPEMPSSSYYYVNLTGITVGATDLPM--------------------- 292

Query: 285 VTWLVKEAYEALRDEVMIRLEGEQMRSYSWPDKLCYHGIMSSDLKGFPTVRFHFR--GGA 342
                           M  L       + +   LC+    +    G P      R  GGA
Sbjct: 293 ---------------AMANLTTVNGTRFGF--DLCFDATAAGGGGGVPVPTLVLRFAGGA 335

Query: 343 KLALEKDSMF------YQPRPDAFCMAVNPASINNRYVNFSLIGMMAQQFYNVGYDIGRK 396
           + A+ + S F       Q R    C+ V PAS     ++ S+IG + Q   +V YD+   
Sbjct: 336 EYAVRRRSYFGVVEVDSQGRAAVECLLVLPAS---EKLSISIIGNVMQMDLHVLYDLDGG 392

Query: 397 QKTFQRMDCE 406
             +F   DC 
Sbjct: 393 MFSFAPADCA 402


>gi|224102847|ref|XP_002312826.1| predicted protein [Populus trichocarpa]
 gi|222849234|gb|EEE86781.1| predicted protein [Populus trichocarpa]
          Length = 445

 Score = 99.8 bits (247), Expect = 2e-18,   Method: Compositional matrix adjust.
 Identities = 114/446 (25%), Positives = 187/446 (41%), Gaps = 54/446 (12%)

Query: 6   SILSPYNNPNENPA------HRVQRGINISIARLAYLQEKIRSHNTYQAQILPSNDISNS 59
           SI  P  +P  N         ++   +  S+AR  +L+    +  T     L S+     
Sbjct: 8   SITIPLQHPQTNQIPFQDQYQKLNHLVTTSLARARHLKNPQTTPATTTTAPLFSHSYGG- 66

Query: 60  LFYVNISIGQPPVPQFTAMDTGSSLLWVHC---YPCRDCSPQLGTI------FYPSRSSS 110
            + V++S G PP      MDTGS ++W  C   Y C+ CS    +       F P  SSS
Sbjct: 67  -YSVSLSFGTPPQTLSFIMDTGSDIVWFPCTSHYLCKHCSFSSSSPSSRIQPFIPKESSS 125

Query: 111 YAYVPCDSEHCRYFPYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQ 170
              + C +  C +  ++  +  +   I + L    P   +F  +         E T+H+ 
Sbjct: 126 SKLLGCKNPKCSWIHHSNINCDQDCSIKSCLNQTCPPYMIFYGSGTTGGVALSE-TLHLH 184

Query: 171 DVV---FGCGFSTNRNFKFSGIFGLGIGRSSLVSQLN-SSFSYCIGSLH-DPDYLHNKLI 225
            +    F  G S   + + +GI G G G SSL SQL    FSYC+ S   D D   +  +
Sbjct: 185 SLSKPNFLVGCSVFSSHQPAGIAGFGRGLSSLPSQLGLGKFSYCLLSHRFDDDTKKSSSL 244

Query: 226 LGDGAIIDEGDATP----LQFIDG-----------HYYITLEAISVDGRMLDINPNIFKR 270
           + D   +D    T       F+             +YY+ L  I+V G  + +       
Sbjct: 245 VLDMEQLDSDKKTNALVYTPFVKNPKVDNKSSFSVYYYLGLRRITVGGHHVKVPYKYLSP 304

Query: 271 -DDSGGGVMIDSGTDVTWLVKEAYEALRDEVMIRLEGEQMRSYSWPDKL----CYHGIMS 325
            +D  GGV+IDSGT  T++ +EA+E L DE  IR   +  R     D +    C++ +  
Sbjct: 305 GEDGNGGVIIDSGTTFTFMAREAFEPLSDE-FIRQIKDYRRVKEIEDAIGLRPCFN-VSD 362

Query: 326 SDLKGFPTVRFHFRGGAKLALEKDSMFYQPRPDAFCMAV------NPASINNRYVNFSLI 379
           +    FP +R +F+GGA +AL  ++ F     +  C+ V       P  +    +   ++
Sbjct: 363 AKTVSFPELRLYFKGGADVALPVENYFAFVGGEVACLTVVTDGVAGPERVGGPGM---IL 419

Query: 380 GMMAQQFYNVGYDIGRKQKTFQRMDC 405
           G    Q + V YD+  ++  F++  C
Sbjct: 420 GNFQMQNFYVEYDLRNERLGFKQEKC 445


>gi|38344991|emb|CAE01597.2| OSJNBa0008A08.5 [Oryza sativa Japonica Group]
 gi|116309515|emb|CAH66581.1| OSIGBa0137O04.7 [Oryza sativa Indica Group]
 gi|222628622|gb|EEE60754.1| hypothetical protein OsJ_14310 [Oryza sativa Japonica Group]
          Length = 494

 Score = 99.4 bits (246), Expect = 3e-18,   Method: Compositional matrix adjust.
 Identities = 96/401 (23%), Positives = 174/401 (43%), Gaps = 47/401 (11%)

Query: 41  RSHN-TYQAQILPSNDI---------SNSLFYVNISIGQPPVPQFTAMDTGSSLLWVHCY 90
           R+H+ + + ++L + DI            L+Y  I IG P    +  +DTGS +LWV+C 
Sbjct: 59  RAHDGSRRGRLLAAADIPLGGLGLPTDTGLYYTEIGIGTPTKRYYVQVDTGSDILWVNCI 118

Query: 91  PCRDCSPQLG-----TIFYPSRSSSYAYVPCDSEHC--RYFPYARCSAYKHRCIYTQLYL 143
            C  C  + G     T++ P  SS+ + V CD   C   Y            C Y+  Y 
Sbjct: 119 SCDRCPRKSGLGLELTLYDPKDSSTGSKVSCDQGFCAATYGGLLPGCTTSLPCEYSVTYG 178

Query: 144 IGPETSVFVSTEQLTFKNTD---ESTIHVQDVVFGCGFST-----NRNFKFSGIFGLGIG 195
            G  T+ +  ++ L F       ++      V FGCG        + N    GI G G  
Sbjct: 179 DGSSTTGYFVSDLLQFDQVSGDGQTRPANSTVTFGCGSQQGGDLGSSNQALDGIIGFGQS 238

Query: 196 RSSLVSQLNSS------FSYCIGSLHDPDYLHNKLILGDGAIID-EGDATPLQFIDGHYY 248
            +S++SQL+++      F++C+      D ++   I   G ++  +   TPL     HY 
Sbjct: 239 NTSMLSQLSAAGKVKKIFAHCL------DTINGGGIFAIGNVVQPKVKTTPLVPNMPHYN 292

Query: 249 ITLEAISVDGRMLDINPNIFKRDDSGGGVMIDSGTDVTWLVKEAYEALRDEVMIRL--EG 306
           + L++I V G  L +  ++F   +   G +IDSGT +T+L +  Y+    E+M+ +  + 
Sbjct: 293 VNLKSIDVGGTALKLPSHMFDTGEK-KGTIIDSGTTLTYLPEIVYK----EIMLAVFAKH 347

Query: 307 EQMRSYSWPDKLCYHGIMSSDLKGFPTVRFHFRGGAKLALEKDSMFYQPRPDAFCMAVNP 366
           + +  ++  + LC+  +   D   FP + FHF     L +     F++   + +C+    
Sbjct: 348 KDITFHNVQEFLCFQYVGRVD-DDFPKITFHFENDLPLNVYPHDYFFENGDNLYCVGFQN 406

Query: 367 ASINNR-YVNFSLIGMMAQQFYNVGYDIGRKQKTFQRMDCE 406
             + ++      L+G +      V YD+  +   +   +C 
Sbjct: 407 GGLQSKDGKGMVLLGDLVLSNKLVVYDLENQVIGWTEYNCS 447


>gi|21717173|gb|AAM76366.1|AC074196_24 putative nucleoid DNA binding protein [Oryza sativa Japonica Group]
 gi|31433291|gb|AAP54829.1| Eukaryotic aspartyl protease family protein [Oryza sativa Japonica
           Group]
          Length = 413

 Score = 99.4 bits (246), Expect = 3e-18,   Method: Compositional matrix adjust.
 Identities = 98/369 (26%), Positives = 154/369 (41%), Gaps = 39/369 (10%)

Query: 61  FYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLGTIFYPSRSSSYAYVPCDSEH 120
           +  N +IG PP P    +D    L+W  C  CR C  Q   +F P+ SS++   PC +  
Sbjct: 62  YVANFTIGTPPQPASAIVDVAGELVWTQCSACRRCFKQDLPVFVPNASSTFKPEPCGTAV 121

Query: 121 CRYFPYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDV--VFGCGF 178
           C   P   CS     C Y      GP T +  +T    F  TD   I    V   FGC  
Sbjct: 122 CESIPTRSCSG--DVCSYK-----GPPTQLRGNTSG--FAATDTFAIGTATVRLAFGCVV 172

Query: 179 STNRNF--KFSGIFGLGIGRSSLVSQLN-SSFSYCIGSLHDPDYLHNKLILGDGAIIDEG 235
           +++ +     SG  GLG    SLV+Q+  + FSYC+   +      ++L LG  A +  G
Sbjct: 173 ASDIDTMDGPSGFIGLGRTPWSLVAQMKLTRFSYCLSPRNTGK--SSRLFLGSSAKLAGG 230

Query: 236 DATPLQ-FI------DGHYY--ITLEAISVDGRMLDINPNIFKRDDSGGGVMIDSGTDVT 286
           ++T    FI      D H+Y  ++L+AI      +           SGG +++ + +  +
Sbjct: 231 ESTSTAPFIKTSPDDDSHHYYLLSLDAIRAGNTTIATA-------QSGGILVMHTVSPFS 283

Query: 287 WLVKEAYEALRDEVMIRLEGEQMRSYSWPDK---LCYHGIMSSDLKGFPTVRFHFRGGAK 343
            LV  AY A +  V   + G      + P +   LC+           P + F F+G A 
Sbjct: 284 LLVDSAYRAFKKAVTEAVGGAAAPPMATPPQPFDLCFKKAAGFSRATAPDLVFTFQGAAA 343

Query: 344 LALEKDSMFYQ--PRPDAFCMAVNPASINNR--YVNFSLIGMMAQQFYNVGYDIGRKQKT 399
           L +             D  C A+   +  NR      S++G + Q+  +  YD+ ++  +
Sbjct: 344 LTVPPAKYLIDVGEEKDTACAAILSMAWLNRTGLEGVSVLGSLQQEDVHFLYDLKKETLS 403

Query: 400 FQRMDCEVL 408
           F+  DC  L
Sbjct: 404 FEPADCSSL 412


>gi|326490862|dbj|BAJ90098.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 447

 Score = 99.4 bits (246), Expect = 3e-18,   Method: Compositional matrix adjust.
 Identities = 101/375 (26%), Positives = 155/375 (41%), Gaps = 30/375 (8%)

Query: 53  SNDISNSLFYVNISIGQPPVPQFTAM--DTGSSLLWVHCYPCRDCSPQLGTIFYPSRSSS 110
           S+ +  + + ++  IG P  PQ  A+  DTGS ++W  C PC DC  Q    F  S S +
Sbjct: 84  SHVVGYTEYLIHFGIGTP-RPQQVALEVDTGSDVVWTQCRPCFDCFTQPLPRFDTSASDT 142

Query: 111 YAYVPCDSEHCRYFPYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQ 170
              V C    CR      C  +   C Y   Y     T   ++ +  TF       + V 
Sbjct: 143 VHGVLCTDPICRALRPHAC--FLGGCTYQVNYGDNSVTIGQLAKDSFTFDGKGGGKVTVP 200

Query: 171 DVVFGCGFSTNRNFKF--SGIFGLGIGRSSLVSQLN-SSFSYCIGSLHDPDYLHNKLILG 227
           D+VFGCG     NF    +GI G G G  SL  QL  SSFSYC  ++ +       + LG
Sbjct: 201 DLVFGCGQYNTGNFHSNETGIAGFGRGPLSLPRQLGVSSFSYCFTTIFESK--STPVFLG 258

Query: 228 DGAIID------EGDATPLQFIDGH---YYITLEAISVDGRMLDINPNIF-KRDDSGGGV 277
            GA  D       G      F+  H   YY++L+ I+V    L +  + F  + D  GG 
Sbjct: 259 -GAPADGLRAHATGPILSTPFLPNHPEYYYLSLKGITVGKTRLAVPESAFVVKADGSGGT 317

Query: 278 MIDSGTDVTWLVKEAYEALRDEVM--IRLEGEQMRSYSWPDKLCYHGIMSSDLKGFPTVR 335
           +IDSGT +T   +  + +L +  +  + L          P   C+      D    P  +
Sbjct: 318 IIDSGTAITAFPRAVFRSLWEAFVAQVPLPHTSYNDTGEPTLQCFSTESVPDASKVPVPK 377

Query: 336 --FHFRGGAKLALEKDSMFYQPRPDAFCMAVNPASINNRYVNFSLIGMMAQQFYNVGYDI 393
              H  G       ++ M   P  D  C+ V  A  ++R    ++IG   QQ  ++ +D+
Sbjct: 378 MTLHLEGADWELPRENYMAEYPDSDQLCVVVL-AGDDDR----TMIGNFQQQNMHIVHDL 432

Query: 394 GRKQKTFQRMDCEVL 408
              +   +   C+ +
Sbjct: 433 AGNKLVIEPAQCDKM 447


>gi|115475303|ref|NP_001061248.1| Os08g0207800 [Oryza sativa Japonica Group]
 gi|45735815|dbj|BAD12851.1| unknown protein [Oryza sativa Japonica Group]
 gi|113623217|dbj|BAF23162.1| Os08g0207800 [Oryza sativa Japonica Group]
 gi|125602549|gb|EAZ41874.1| hypothetical protein OsJ_26419 [Oryza sativa Japonica Group]
          Length = 449

 Score = 99.4 bits (246), Expect = 3e-18,   Method: Compositional matrix adjust.
 Identities = 110/390 (28%), Positives = 173/390 (44%), Gaps = 46/390 (11%)

Query: 50  ILPSNDISNSLFYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLGTI--FYPSR 107
           + P     + ++   + IG+    Q+  +DTGSSL+W  C  C  C   +G +  +  S+
Sbjct: 71  VTPFRIYEDVVYLAEMEIGERQQKQYLLIDTGSSLVWTQCDECPHC--HIGDVPPYGRSQ 128

Query: 108 SSSYAYVPC------DSEH--CRYFPYARCSAY-----KHRCIYTQLY-LIGPETSVFVS 153
           S ++  V C      D E     Y P A+   Y       RC++  LY L G   +V   
Sbjct: 129 SRTFQEVSCGDDDDNDKEEAIASYCP-AKPPGYITLCVNGRCMFKALYNLTGQGETVQGY 187

Query: 154 TEQLTFKNTDESTIHVQD---VVFGCGFSTNRNF----KFSGIFGLGIGRSSLVSQLN-S 205
               TF   D+     Q    +VFGC    N       + +GI GLG+G +S + Q   +
Sbjct: 188 MSMDTFHFIDDRRFDYQAKFRMVFGCAHQENIVLTAVKECTGILGLGMGDASFLRQTGIT 247

Query: 206 SFSYCIGSLHDPDY---LHNKLILGDGAIIDEGDATPLQFIDGHYYITLEAISVD-GRML 261
            FSYC+     P Y    H+ L  G  A I  G   PL    G YY+ L AI+     ++
Sbjct: 248 KFSYCVPP-RMPGYSYRRHSWLRFGSHAQI-SGKKVPLVMRWGKYYLPLTAITYTYNELM 305

Query: 262 DINPNI-FKRDDSGGGVMIDSGTDVTWLVKEAYEALRDEV--MIRLEGEQMRSYSWPDKL 318
              P I +K  +    +M+D+GT +  L    ++ L  E+  +I+ E     +  WP K 
Sbjct: 306 SPVPIIAYKSQEDYLHMMVDTGTSLLSLPTSLHDDLIKEMEAIIKSENIMEGATRWP-KH 364

Query: 319 CYHGIMSSDLKGFPTVRFHFRGGAKLALEKDSMFYQ---PRPDAFCMAVNPASINNRYVN 375
           CY   M  ++K   TV   F GG  + L   ++F +    +  A C+AVN    +++   
Sbjct: 365 CYKRTM-DEVKDI-TVTLSFDGGLDIELFTSALFIKTETTKGPAVCLAVNRVDDSSK--- 419

Query: 376 FSLIGMMAQQFYNVGYDIGRKQKTFQRMDC 405
            +++GM AQ   NVGYD+  ++     + C
Sbjct: 420 -AILGMFAQTNINVGYDLLSREIAMDPIRC 448


>gi|356555248|ref|XP_003545946.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
          Length = 453

 Score = 99.0 bits (245), Expect = 3e-18,   Method: Compositional matrix adjust.
 Identities = 91/372 (24%), Positives = 152/372 (40%), Gaps = 59/372 (15%)

Query: 61  FYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLGTIFYPSRSSSYAYVPCDSEH 120
           ++  + +G P    +  +DTGS   W++C                  S S+  V C S  
Sbjct: 113 YFAEVKVGSPGQRFWLVVDTGSEFTWLNC------------------SKSFEAVTCASRK 154

Query: 121 CR-----YFPYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDVVFG 175
           C+      F  + C      C+Y   Y  G     F  T+ +T   T+     + ++  G
Sbjct: 155 CKVDLSELFSLSVCPKPSDPCLYDISYADGSSAKGFFGTDSITVGLTNGKQGKLNNLTIG 214

Query: 176 CGFS----TNRNFKFSGIFGLGIGRSSLV----SQLNSSFSYCIGSLHDPDYLHNKLILG 227
           C  S     N N +  GI GLG  + S +    ++  + FSYC+        + + L +G
Sbjct: 215 CTKSMLNGVNFNEETGGILGLGFAKDSFIDKAANKYGAKFSYCLVDHLSHRSVSSNLTIG 274

Query: 228 ---DGAIIDEGDATPLQFIDGHYYITLEAISVDGRMLDINPNIFKRDDSGGGVMIDSGTD 284
              +  ++ E   T L      Y + +  IS+ G+ML I P ++  + + GG +IDSGT 
Sbjct: 275 GHHNAKLLGEIRRTELILFPPFYGVNVVGISIGGQMLKIPPQVWDFN-AEGGTLIDSGTT 333

Query: 285 VTWLVKEAYEALRDEV------MIRLEGEQMRSYSWPDKLCYHGIMSSDLKGF-----PT 333
           +T L+  AYEA+ + +      + R+ GE   +     + C+      D +GF     P 
Sbjct: 334 LTSLLLPAYEAVFEALTKSLTKVKRVTGEDFDAL----EFCF------DAEGFDDSVVPR 383

Query: 334 VRFHFRGGAKLALEKDSMFYQPRPDAFCMAVNPASINNRYVNFSLIGMMAQQFYNVGYDI 393
           + FHF GGA+      S      P   C+ + P    +     S+IG + QQ +   +D+
Sbjct: 384 LVFHFAGGARFEPPVKSYIIDVAPLVKCIGIVPI---DGIGGASVIGNIMQQNHLWEFDL 440

Query: 394 GRKQKTFQRMDC 405
                 F    C
Sbjct: 441 STNTVGFAPSTC 452


>gi|242053991|ref|XP_002456141.1| hypothetical protein SORBIDRAFT_03g031170 [Sorghum bicolor]
 gi|241928116|gb|EES01261.1| hypothetical protein SORBIDRAFT_03g031170 [Sorghum bicolor]
          Length = 519

 Score = 99.0 bits (245), Expect = 3e-18,   Method: Compositional matrix adjust.
 Identities = 105/417 (25%), Positives = 151/417 (36%), Gaps = 78/417 (18%)

Query: 61  FYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLG-------------------- 100
           ++V   +G P  P     DTGS L WV C+     +P  G                    
Sbjct: 107 YFVRFRVGTPARPFLLVADTGSDLTWVKCHRHDHDAPAPGYGYAAPASNDSSTSSLSAAA 166

Query: 101 -------TIFYPSRSSSYAYVPCDSEHCRY---FPYARCSAYKHRCIYTQLYLIGPETSV 150
                   +F P RS ++A +PC S+ C     F  A C      C Y   Y  G     
Sbjct: 167 ASSSSHARVFRPDRSRTWAPIPCSSDTCTASLPFSLAACPTPGSPCAYDYRYKDGSAARG 226

Query: 151 FVSTEQLTF------KNTDESTIHVQDVVFGCGFS-TNRNFKFS-GIFGLGIGRSSLVS- 201
            V T+  T           +    ++ VV GC  S T  +F  S G+  LG    S  S 
Sbjct: 227 TVGTDSATIALSGRGAKKKQRQAKLRGVVLGCTTSYTGDSFLASDGVLSLGYSNISFASR 286

Query: 202 ---QLNSSFSYCIGSLHDPDYLHNKLILGDGAIIDEGDA--------------------- 237
              +    FSYC+     P    + L  G    +                          
Sbjct: 287 AAARFGGRFSYCLVDHLAPRNATSYLTFGPNPAVSSSPPSKTACAGGGSPAAAPPGPGGA 346

Query: 238 --TPLQF---IDGHYYITLEAISVDGRMLDINPNIFKRDDSGGGVMIDSGTDVTWLVKEA 292
             TPL     +   Y +T+  ISVDG +L I P +      GGG ++DSGT +T LV  A
Sbjct: 347 RQTPLLLDHRMRPFYAVTVNGISVDGELLRI-PRLVWDVAKGGGAILDSGTSLTVLVSPA 405

Query: 293 YEALRDEVMIRLEGEQMRSYSWPDKLCYHGIMSS---DLK-GFPTVRFHFRGGAKLALEK 348
           Y A+   +  +L G    +   P   CY+    S   DL    P +  HF G A+L    
Sbjct: 406 YRAVVAALNKKLAGLPRVTMD-PFDYCYNWTSPSTGEDLTVAMPELAVHFAGSARLQPPA 464

Query: 349 DSMFYQPRPDAFCMAVNPASINNRYVNFSLIGMMAQQFYNVGYDIGRKQKTFQRMDC 405
            S      P   C+ +        +   S+IG + QQ +   +D+  ++  F+R  C
Sbjct: 465 KSYVIDAAPGVKCIGLQ----EGEWPGVSVIGNILQQEHLWEFDLKNRRLRFKRSRC 517


>gi|449482385|ref|XP_004156266.1| PREDICTED: aspartic proteinase-like protein 2-like [Cucumis
           sativus]
          Length = 491

 Score = 99.0 bits (245), Expect = 3e-18,   Method: Compositional matrix adjust.
 Identities = 93/313 (29%), Positives = 148/313 (47%), Gaps = 33/313 (10%)

Query: 60  LFYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDC--SPQLG---TIFYPSRSSSYAYV 114
           L++  + +G P       +DTGS +LWV C PC  C  S  LG    +F  ++SSS   +
Sbjct: 83  LYFTKVKLGNPAREFNVQIDTGSDILWVTCSPCDGCPDSSGLGIELNLFDTTKSSSARVL 142

Query: 115 PCDSEHCRYFPYA--RCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKN-TDESTI--HV 169
           PC    C        +C      C Y+  Y     TS F  T+ + F     ESTI    
Sbjct: 143 PCTDPICAAVSTTTDQCLTQTDHCSYSFHYRDRSGTSGFYVTDSMHFDILLGESTIANSS 202

Query: 170 QDVVFGC-----GFSTNRNFKFSGIFGLGIGRSSLVSQLNSS------FSYCIGSLHDPD 218
             +VFGC     G  T       GIFG G G  S++SQL+S       FS+C+    +  
Sbjct: 203 ATIVFGCSIYQYGDLTRATKALDGIFGFGQGEFSVISQLSSRGITPKVFSHCLKGGENGG 262

Query: 219 YLHNKLILGDGAIIDEGDA-TPLQFIDGHYYITLEAISVDGRMLDINPNIFKRDDSGGGV 277
            +   L+LG+  I++     +PL     HY + L++I++ G++   NP +F   ++ G  
Sbjct: 263 GI---LVLGE--ILEPSIVYSPLIPSQPHYTLKLQSIALSGQLFP-NPTMFPISNA-GET 315

Query: 278 MIDSGTDVTWLVKEAYEALRDEVMIRLEGEQMRSYSWPDKLCYHGIMS-SDLKGFPTVRF 336
           +IDSGT + +LV+E Y+ +   +   +      + S   + C+   MS +D+  FP +RF
Sbjct: 316 IIDSGTTLAYLVEEVYDWIVSVITSAVSQSATPTISRGSQ-CFRVSMSVADI--FPVLRF 372

Query: 337 HFRGGAKLALEKD 349
           +F G A + +  +
Sbjct: 373 NFEGIASMVVTPE 385


>gi|15217887|ref|NP_176703.1| aspartic proteinase-like protein 2 [Arabidopsis thaliana]
 gi|118572746|sp|Q9S9K4.2|ASPL2_ARATH RecName: Full=Aspartic proteinase-like protein 2; Flags: Precursor
 gi|332196226|gb|AEE34347.1| aspartic proteinase-like protein 2 [Arabidopsis thaliana]
          Length = 475

 Score = 99.0 bits (245), Expect = 4e-18,   Method: Compositional matrix adjust.
 Identities = 87/356 (24%), Positives = 154/356 (43%), Gaps = 43/356 (12%)

Query: 38  EKIRSHNTYQ-AQILPSNDI---------SNSLFYVNISIGQPPVPQFTAMDTGSSLLWV 87
           E  +SH+T + +++L S D+         S  L++  I +G PP      +DTGS +LW+
Sbjct: 41  EHFKSHDTRRHSRMLASIDLPLGGDSRVDSVGLYFTKIKLGSPPKEYHVQVDTGSDILWI 100

Query: 88  HCYPCRDCSPQLG-----TIFYPSRSSSYAYVPCDSEHCRYFPYARCSAYKHRCIYTQLY 142
           +C PC  C  +       ++F  + SS+   V CD + C +   +        C Y  +Y
Sbjct: 101 NCKPCPKCPTKTNLNFRLSLFDMNASSTSKKVGCDDDFCSFISQSDSCQPALGCSYHIVY 160

Query: 143 LIGPETSVFVSTEQLTFKNT--DESTIHV-QDVVFGCGFST-----NRNFKFSGIFGLGI 194
                +      + LT +    D  T  + Q+VVFGCG        N +    G+ G G 
Sbjct: 161 ADESTSDGKFIRDMLTLEQVTGDLKTGPLGQEVVFGCGSDQSGQLGNGDSAVDGVMGFGQ 220

Query: 195 GRSSLVSQLNSS------FSYCIGSLHDPDYLHNKLILGDGAIID-EGDATPLQFIDGHY 247
             +S++SQL ++      FS+C+      D +    I   G +   +   TP+     HY
Sbjct: 221 SNTSVLSQLAATGDAKRVFSHCL------DNVKGGGIFAVGVVDSPKVKTTPMVPNQMHY 274

Query: 248 YITLEAISVDGRMLDINPNIFKRDDSGGGVMIDSGTDVTWLVKEAYEALRDEVMIRLEGE 307
            + L  + VDG  LD+  +I +     GG ++DSGT + +  K  Y++L + ++ R   +
Sbjct: 275 NVMLMGMDVDGTSLDLPRSIVRN----GGTIVDSGTTLAYFPKVLYDSLIETILAR---Q 327

Query: 308 QMRSYSWPDKLCYHGIMSSDLKGFPTVRFHFRGGAKLALEKDSMFYQPRPDAFCMA 363
            ++ +   +        ++  + FP V F F    KL +      +    + +C  
Sbjct: 328 PVKLHIVEETFQCFSFSTNVDEAFPPVSFEFEDSVKLTVYPHDYLFTLEEELYCFG 383


>gi|293335955|ref|NP_001168399.1| uncharacterized protein LOC100382168 precursor [Zea mays]
 gi|223948009|gb|ACN28088.1| unknown [Zea mays]
 gi|413922066|gb|AFW61998.1| hypothetical protein ZEAMMB73_694403 [Zea mays]
          Length = 507

 Score = 99.0 bits (245), Expect = 4e-18,   Method: Compositional matrix adjust.
 Identities = 103/374 (27%), Positives = 147/374 (39%), Gaps = 56/374 (14%)

Query: 66  SIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLGTIFYPSRSSSYAYVPCDSEHCRYFP 125
           S G P       +DTGS L WV C PC  C  Q   +F P+ S++YA V C++  C    
Sbjct: 153 SSGSPAANLTVIVDTGSDLTWVQCKPCSACYAQRDPLFDPAGSATYAAVRCNASACADSL 212

Query: 126 YA---------RCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDVVFGC 176
            A            A   +C Y   Y  G  +   ++T+ +           +   VFGC
Sbjct: 213 RAATGTPGSCGSTGAGSEKCYYALAYGDGSFSRGVLATDTVALGGA-----SLGGFVFGC 267

Query: 177 GFSTNRNFKFSGIFGL-GIGRS--SLVSQLNSS----FSYCIGSLHDPDYLHN-KLILGD 228
           G S NR   F G  GL G+GR+  SLVSQ  S     FSYC+ +    D   +  L  GD
Sbjct: 268 GLS-NRGL-FGGTAGLMGLGRTELSLVSQTASRYGGVFSYCLPAATSGDASGSLSLGGGD 325

Query: 229 GAIIDEGDATPLQFID--------GHYYITLEAISVDGRMLDINPNIFKRDDSGGGVMID 280
            A     + TP+ +            Y++ +   +V G  L        +      V+ID
Sbjct: 326 DAASSYRNTTPVAYTRMIADPAQPPFYFLNVTGAAVGGTAL------AAQGLGASNVLID 379

Query: 281 SGTDVTWLVKEAYEALRDEVMIRLEGEQMRSYSWPDK-------LCYHGIMSSDLKGFPT 333
           SGT +T L    Y A+R E M      Q  +  +P          CY  +   D    P 
Sbjct: 380 SGTVITRLAPSVYRAVRAEFM-----RQFGAAGYPAAPGFSILDTCYD-LTGHDEVKVPL 433

Query: 334 VRFHFRGGAKLALEKDSMFYQPRPDA--FCMAVNPASINNRYVNFSLIGMMAQQFYNVGY 391
           +     GGA + ++   M +  R D    C+A+   S  +      +IG   Q+   V Y
Sbjct: 434 LTLRLEGGADVTVDAAGMLFVVRKDGSQVCLAMASLSYED---ETPIIGNYQQKNKRVVY 490

Query: 392 DIGRKQKTFQRMDC 405
           D    +  F   DC
Sbjct: 491 DTLGSRLGFADEDC 504


>gi|356495496|ref|XP_003516613.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Glycine max]
          Length = 645

 Score = 99.0 bits (245), Expect = 4e-18,   Method: Compositional matrix adjust.
 Identities = 101/391 (25%), Positives = 165/391 (42%), Gaps = 38/391 (9%)

Query: 36  LQEKIRSHNTYQAQILPSNDISNSLFYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDC 95
           L+E    H+      L  + + N  +   + IG PP      +DTGS++ +V C  CR C
Sbjct: 68  LKESDSEHHPNARMRLYDDLLRNGYYTARLWIGTPPQRFALIVDTGSTVTYVPCSTCRHC 127

Query: 96  SPQLGTIFYPSRSSSYAYVPCDSEHCRYFPYARCSAYKHRCIYTQLYLIGPETSVFVSTE 155
                  F P  S +Y  V C  +         C   + +C Y + Y     +S  +  +
Sbjct: 128 GSHQDPKFRPEDSETYQPVKCTWQ-------CNCDNDRKQCTYERRYAEMSTSSGALGED 180

Query: 156 QLTFKNTDESTIHVQDVVFGCGFSTN---RNFKFSGIFGLGIGRSSLVSQL------NSS 206
            ++F N  E  +  Q  +FGC         N +  GI GLG G  S++ QL      + S
Sbjct: 181 VVSFGNQTE--LSPQRAIFGCENDETGDIYNQRADGIMGLGRGDLSIMDQLVEKKVISDS 238

Query: 207 FSYCIGSLHDPDYLHNKLILGDGAIIDEGDATPLQFIDGHYYITLEAISVDGRMLDINPN 266
           FS C G +           +   A +    + P++    +Y I L+ I V G+ L +NP 
Sbjct: 239 FSLCYGGMGVGGGAMVLGGISPPADMVFTRSDPVR--SPYYNIDLKEIHVAGKRLHLNPK 296

Query: 267 IFKRDDSGGGVMIDSGTDVTWLVKEAYEALRDEVMIRLEGEQMRSYSWPDK----LCYHG 322
           +F   D   G ++DSGT   +L + A+ A +  +M   E   ++  S PD     +C+ G
Sbjct: 297 VF---DGKHGTVLDSGTTYAYLPESAFLAFKHAIM--KETHSLKRISGPDPRYNDICFSG 351

Query: 323 I---MSSDLKGFPTVRFHFRGGAKLALEKDSMFYQPRP--DAFCMAVNPASINNRYVNFS 377
               +S   K FP V   F  G KL+L  ++  ++      A+C+ V     +N     +
Sbjct: 352 AEIDVSQISKSFPVVEMVFGNGHKLSLSPENYLFRHSKVRGAYCLGV----FSNGNDPTT 407

Query: 378 LIGMMAQQFYNVGYDIGRKQKTFQRMDCEVL 408
           L+G +  +   V YD    +  F + +C  L
Sbjct: 408 LLGGIVVRNTLVMYDREHTKIGFWKTNCSEL 438


>gi|218194599|gb|EEC77026.1| hypothetical protein OsI_15382 [Oryza sativa Indica Group]
          Length = 409

 Score = 99.0 bits (245), Expect = 4e-18,   Method: Compositional matrix adjust.
 Identities = 85/337 (25%), Positives = 150/337 (44%), Gaps = 36/337 (10%)

Query: 60  LFYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLG-----TIFYPSRSSSYAYV 114
           L+Y  I IG P    +  +DTGS +LWV+C  C  C  + G     T++ P  SS+ + V
Sbjct: 3   LYYTEIGIGTPTKRYYVQVDTGSDILWVNCISCDRCPRKSGLGLELTLYDPKDSSTGSKV 62

Query: 115 PCDSEHC--RYFPYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTD---ESTIHV 169
            CD   C   Y            C Y+  Y  G  T+ +  ++ L F       ++    
Sbjct: 63  SCDQGFCAATYGGLLPGCTTSLPCEYSVTYGDGSSTTGYFVSDLLQFDQVSGDGQTRPAN 122

Query: 170 QDVVFGCGFST-----NRNFKFSGIFGLGIGRSSLVSQLNSS------FSYCIGSLHDPD 218
             V FGCG        + N    GI G G   +S++SQL+++      F++C+      D
Sbjct: 123 STVTFGCGSQQGGDLGSSNQALDGIIGFGQSNTSMLSQLSAAGKVKKIFAHCL------D 176

Query: 219 YLHNKLILGDGAIID-EGDATPLQFIDGHYYITLEAISVDGRMLDINPNIFKRDDSGGGV 277
            ++   I   G ++  +   TPL     HY + L++I V G  L +  ++F   +   G 
Sbjct: 177 TINGGGIFAIGNVVQPKVKTTPLVPNMPHYNVNLKSIDVGGTALKLPSHMFDTGEK-KGT 235

Query: 278 MIDSGTDVTWLVKEAYEALRDEVMIRL--EGEQMRSYSWPDKLCYHGIMSSDLKGFPTVR 335
           +IDSGT +T+L +  Y+    E+M+ +  + + +  ++  + LC+  +   D   FP + 
Sbjct: 236 IIDSGTTLTYLPEIVYK----EIMLAVFAKHKDITFHNVQEFLCFQYVGRVD-DDFPKIT 290

Query: 336 FHFRGGAKLALEKDSMFYQPRPDAFCMAVNPASINNR 372
           FHF     L +     F++   + +C+      + ++
Sbjct: 291 FHFENDLPLNVYPHDYFFENGDNLYCVGFQNGGLQSK 327


>gi|297819684|ref|XP_002877725.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
 gi|297323563|gb|EFH53984.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
          Length = 633

 Score = 98.6 bits (244), Expect = 4e-18,   Method: Compositional matrix adjust.
 Identities = 105/380 (27%), Positives = 161/380 (42%), Gaps = 55/380 (14%)

Query: 58  NSLFYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLGTIFYPSRSSSYAYVPCD 117
           N  +   + IG PP      +D+GS++ +V C  C  C       F P  SS+Y  V C+
Sbjct: 91  NGYYTTRLWIGTPPQMFALIVDSGSTVTYVPCSDCEQCGKHQDPKFQPELSSTYQPVKCN 150

Query: 118 SEHCRYFPYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDVVFGCG 177
            +         C   K +C+Y + Y     +   +  + ++F N  ES +  Q  VFGC 
Sbjct: 151 MD-------CNCDDDKEQCVYEREYAEHSSSKGVLGEDLISFGN--ESQLTPQRAVFGCE 201

Query: 178 FSTNRNF---KFSGIFGLGIGRSSLVSQL------NSSFSYCIGSLHDPDYLHNKLILGD 228
                +    +  GI GLG G  SLV QL      ++SF  C G +           +G 
Sbjct: 202 TVETGDLYSQRADGIIGLGQGDLSLVDQLVDKGLISNSFGLCYGGMD----------VGG 251

Query: 229 GAIIDEGDATP--LQFIDG------HYYITLEAISVDGRMLDINPNIFKRDDSGGGVMID 280
           G++I  G   P  + F D       +Y I L  I V G+ L +N  +F   D   G ++D
Sbjct: 252 GSMILGGFDYPSDMIFTDSDPDRSPYYNIDLTGIRVAGKKLSLNSRVF---DGEHGAVLD 308

Query: 281 SGTDVTWLVKEAYEALRDEVMIRLEGEQMRSYSWPD----KLCYHGIMSSDL----KGFP 332
           SGT   +L   A+ A  + VM   E   ++    PD      C+    S+D+    K FP
Sbjct: 309 SGTTYAYLPDAAFAAFEEAVM--REVSPLKQIDGPDPNFKDTCFLVAASNDVSELSKIFP 366

Query: 333 TVRFHFRGGAKLALEKDS-MFYQPRP-DAFCMAVNPASINNRYVNFSLIGMMAQQFYNVG 390
           +V   F+ G    L  ++ MF   +   A+C+ V P    N   + +L+G +  +   V 
Sbjct: 367 SVEMIFKSGQSWLLSPENYMFRHSKVHGAYCLGVFP----NGKDHTTLLGGIVVRNTLVV 422

Query: 391 YDIGRKQKTFQRMDCEVLDD 410
           YD    +  F R +C  L D
Sbjct: 423 YDRENSKVGFWRTNCSELSD 442


>gi|224081804|ref|XP_002306494.1| predicted protein [Populus trichocarpa]
 gi|222855943|gb|EEE93490.1| predicted protein [Populus trichocarpa]
          Length = 564

 Score = 98.6 bits (244), Expect = 5e-18,   Method: Compositional matrix adjust.
 Identities = 98/377 (25%), Positives = 157/377 (41%), Gaps = 54/377 (14%)

Query: 58  NSLFYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLGTIFYPSRSSSYAYVPCD 117
           N  +   + IG PP      +DTGSS+ +V C  C  C       F P  SS+Y  V C+
Sbjct: 10  NGYYTTRLWIGTPPQRFALIVDTGSSVTYVPCSSCEQCGRHQDPKFQPDLSSTYQSVKCN 69

Query: 118 SEHCRYFPYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDVVFGCG 177
            +         C   K +C+Y + Y     +S  +  + ++F N   S +  Q  VFGC 
Sbjct: 70  ID-------CNCDDEKQQCVYERQYAEMSTSSGVLGEDIISFGNL--SALAPQRAVFGCE 120

Query: 178 FSTNRNF---KFSGIFGLGIGRSSLVSQL------NSSFSYCIGSLHDPDYLHNKLILGD 228
                +       GI G+G G  S+V  L      N SFS C          +  + +G 
Sbjct: 121 NMETGDLYSQHADGIMGMGRGDLSIVDHLVDKGVINDSFSLC----------YGGMGIGG 170

Query: 229 GAIIDEGDATPLQFI--------DGHYYITLEAISVDGRMLDINPNIFKRDDSGGGVMID 280
           GA++  G + P   +          +Y I L+ I V G+ L +NP +F   D   G ++D
Sbjct: 171 GAMVLGGISPPSNMVFSQSDPVRSPYYNIDLKEIHVAGKPLPLNPTVF---DGKHGTILD 227

Query: 281 SGTDVTWLVKEAYEALRDEVMIRLEGEQMRSYSWPD----KLCYHGI---MSSDLKGFPT 333
           SGT   +L + A+ + +D +M  L    ++    PD     +C+ G    +S     FP 
Sbjct: 228 SGTTYAYLPEAAFVSFKDAIMKEL--HSLKPIRGPDPNYNDICFSGAGSDISQLSSSFPA 285

Query: 334 VRFHFRGGAKLALEKDSMFYQPRP--DAFCMAVNPASINNRYVNFSLIGMMAQQFYNVGY 391
           V   F  G KL L  ++  ++      A+C+ +      N     +L+G +  +   V Y
Sbjct: 286 VEMVFGNGQKLLLSPENYLFRHSKVHGAYCLGI----FQNGKDPTTLLGGIVVRNTLVLY 341

Query: 392 DIGRKQKTFQRMDCEVL 408
           D    +  F + +C  L
Sbjct: 342 DRENSKIGFWKTNCSEL 358


>gi|224114179|ref|XP_002332420.1| predicted protein [Populus trichocarpa]
 gi|222832373|gb|EEE70850.1| predicted protein [Populus trichocarpa]
          Length = 449

 Score = 98.6 bits (244), Expect = 5e-18,   Method: Compositional matrix adjust.
 Identities = 109/444 (24%), Positives = 184/444 (41%), Gaps = 53/444 (11%)

Query: 1   LIHRDSILSPYNNPNENPAHRVQRGINISIARLAYLQEKIRSHNTYQAQIL-PSNDISNS 59
           LIH+DS  SP    N  P  ++ +      A L +    + ++     +++ P     + 
Sbjct: 18  LIHKDSPQSPLYPGNLPPGEQILQPAACPFAGLHHQTSMMSTNKAVMNRMMSPLTSYGDP 77

Query: 60  -LFYVNISIGQPPVPQ--------FTAMDTGSSLLWVHCYPCRD----CSPQLGTIFYPS 106
            LF   + +G              +  +DTG+ L W+ C  C++    C P     +  S
Sbjct: 78  FLFLAQVGVGSFQEKSHRTHFKTYYFQIDTGNELSWIQCEGCQNKGNMCFPHKDPPYTSS 137

Query: 107 RSSSYAYVPCDSEHCRYFPYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDEST 166
           +S SY  V C+ +H    P  +C   +  C Y   Y  G  TS  ++ E  TF +     
Sbjct: 138 QSKSYKPVSCN-QHSFCEP-NQCK--EGLCAYNVTYGPGSYTSGNLANETFTFYSNHGKH 193

Query: 167 IHVQDVVFGCGF-STNRNFKF-------SGIFGLGIGRSSLVSQLNS----SFSYCIGSL 214
             ++ + FGC   S N  + F       SG+ G+G G  S ++QL S     FSYCI + 
Sbjct: 194 TALKSISFGCSTDSRNMIYAFLLDKNPVSGVLGMGWGPRSFLAQLGSISHGKFSYCITA- 252

Query: 215 HDPDYLHNKLILGDGAIIDEGDATPLQFID----GHYYITLEAISVDGRMLDINP-NIFK 269
              +  HN  +     ++   +    + +       Y++ L  ISV+G  L+I   ++  
Sbjct: 253 ---NNTHNTYLRFGKHVVKSKNLQTTKIMQVKPSAAYHVNLLGISVNGVKLNITKTDLAV 309

Query: 270 RDDSGGGVMIDSGTDVTWLVKEAYEALRDEVMIRLEGEQMRS----YSWPDKLCYHGIMS 325
           R D   G +ID+GT  T LVK  ++ L   +   L   Q       +     LCY  +  
Sbjct: 310 RKDGSRGCIIDAGTLATLLVKPIFDTLHTALSNHLSSNQNLKRWVIHKLHKDLCYEQLSD 369

Query: 326 SDLKGFPTVRFHFRGGAKLALEKDSMFYQPR---PDAFCMAVNPASINNRYVNFSLIGMM 382
           +  K  P V FH    A L ++ +++F        + FC+++   S +++    ++IG  
Sbjct: 370 AGRKNLPVVTFHLE-NADLEVKPEAIFLFREFEGKNVFCLSM--LSDDSK----TIIGAY 422

Query: 383 AQQFYNVGYDIGRKQKTFQRMDCE 406
            Q      YD   +  +F   DCE
Sbjct: 423 QQMKQKFVYDTKARVLSFGPEDCE 446


>gi|6850312|gb|AAF29389.1|AC009999_9 Contains similarity to nucellin from Hordeum vulgare gb|U87148.
           ESTs gb|T22068, gb|F14251, gb|F14237, gb|F14242 come
           from this gene [Arabidopsis thaliana]
          Length = 388

 Score = 98.6 bits (244), Expect = 5e-18,   Method: Compositional matrix adjust.
 Identities = 85/304 (27%), Positives = 134/304 (44%), Gaps = 38/304 (12%)

Query: 60  LFYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQ--LG---TIFYPSRSSSYAYV 114
           L+Y  I IG P    +  +DTGS ++WV+C  C+ C  +  LG   T++    S S   V
Sbjct: 79  LYYAKIGIGTPAKSYYVQVDTGSDIMWVNCIQCKQCPRRSTLGIELTLYNIDESDSGKLV 138

Query: 115 PCDSEHCRYF---PYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTD---ESTIH 168
            CD + C      P + C A    C Y ++Y  G  T+ +   + + + +     ++   
Sbjct: 139 SCDDDFCYQISGGPLSGCKA-NMSCPYLEIYGDGSSTAGYFVKDVVQYDSVAGDLKTQTA 197

Query: 169 VQDVVFGCG------FSTNRNFKFSGIFGLGIGRSSLVSQLNSS------FSYCIGSLHD 216
              V+FGCG        ++      GI G G   SS++SQL SS      F++C+     
Sbjct: 198 NGSVIFGCGARQSGDLDSSNEEALDGILGFGKANSSMISQLASSGRVKKIFAHCL----- 252

Query: 217 PDYLHNKLILGDGAIID-EGDATPLQFIDGHYYITLEAISVDGRMLDINPNIFKRDDSGG 275
            D  +   I   G ++  + + TPL     HY + + A+ V    L I  ++F+  D  G
Sbjct: 253 -DGRNGGGIFAIGRVVQPKVNMTPLVPNQPHYNVNMTAVQVGQEFLTIPADLFQPGDRKG 311

Query: 276 GVMIDSGTDVTWLVKEAYEALRDEVMIRLEGEQMRSYSWPDKLCYHGIMSSDLKGFPTVR 335
            + IDSGT + +L +  YE L     ++ E          D  C+      D +GFP V 
Sbjct: 312 AI-IDSGTTLAYLPEIIYEPL-----VKKEPALKVHIVDKDYKCFQYSGRVD-EGFPNVT 364

Query: 336 FHFR 339
           FHF 
Sbjct: 365 FHFE 368


>gi|4646203|gb|AAD26876.1|AC007230_10 Belongs to PF|00026 Eukaryotic aspartyl protease family
           [Arabidopsis thaliana]
          Length = 449

 Score = 98.6 bits (244), Expect = 5e-18,   Method: Compositional matrix adjust.
 Identities = 87/356 (24%), Positives = 154/356 (43%), Gaps = 43/356 (12%)

Query: 38  EKIRSHNTYQ-AQILPSNDI---------SNSLFYVNISIGQPPVPQFTAMDTGSSLLWV 87
           E  +SH+T + +++L S D+         S  L++  I +G PP      +DTGS +LW+
Sbjct: 41  EHFKSHDTRRHSRMLASIDLPLGGDSRVDSVGLYFTKIKLGSPPKEYHVQVDTGSDILWI 100

Query: 88  HCYPCRDCSPQLG-----TIFYPSRSSSYAYVPCDSEHCRYFPYARCSAYKHRCIYTQLY 142
           +C PC  C  +       ++F  + SS+   V CD + C +   +        C Y  +Y
Sbjct: 101 NCKPCPKCPTKTNLNFRLSLFDMNASSTSKKVGCDDDFCSFISQSDSCQPALGCSYHIVY 160

Query: 143 LIGPETSVFVSTEQLTFKNT--DESTIHV-QDVVFGCGFST-----NRNFKFSGIFGLGI 194
                +      + LT +    D  T  + Q+VVFGCG        N +    G+ G G 
Sbjct: 161 ADESTSDGKFIRDMLTLEQVTGDLKTGPLGQEVVFGCGSDQSGQLGNGDSAVDGVMGFGQ 220

Query: 195 GRSSLVSQLNSS------FSYCIGSLHDPDYLHNKLILGDGAIID-EGDATPLQFIDGHY 247
             +S++SQL ++      FS+C+      D +    I   G +   +   TP+     HY
Sbjct: 221 SNTSVLSQLAATGDAKRVFSHCL------DNVKGGGIFAVGVVDSPKVKTTPMVPNQMHY 274

Query: 248 YITLEAISVDGRMLDINPNIFKRDDSGGGVMIDSGTDVTWLVKEAYEALRDEVMIRLEGE 307
            + L  + VDG  LD+  +I +     GG ++DSGT + +  K  Y++L + ++ R   +
Sbjct: 275 NVMLMGMDVDGTSLDLPRSIVRN----GGTIVDSGTTLAYFPKVLYDSLIETILAR---Q 327

Query: 308 QMRSYSWPDKLCYHGIMSSDLKGFPTVRFHFRGGAKLALEKDSMFYQPRPDAFCMA 363
            ++ +   +        ++  + FP V F F    KL +      +    + +C  
Sbjct: 328 PVKLHIVEETFQCFSFSTNVDEAFPPVSFEFEDSVKLTVYPHDYLFTLEEELYCFG 383


>gi|9759559|dbj|BAB11161.1| nucleoid DNA-binding-like protein [Arabidopsis thaliana]
 gi|21553652|gb|AAM62745.1| nucleoid DNA-binding-like protein [Arabidopsis thaliana]
 gi|109134179|gb|ABG25087.1| At5g07030 [Arabidopsis thaliana]
          Length = 439

 Score = 98.2 bits (243), Expect = 6e-18,   Method: Compositional matrix adjust.
 Identities = 110/424 (25%), Positives = 167/424 (39%), Gaps = 41/424 (9%)

Query: 1   LIHRDSILSPYNNPNE-NPAHRVQRGINISIARLAYLQEKIRSHNTYQAQILPSND---- 55
           + H DS  SP+ + +  +   RV + +    ARL YL   +   +     ++P       
Sbjct: 39  IFHIDSPCSPFKSSSPLSWEARVLQTLAQDQARLQYLSSLVAGRS-----VVPIASGRQM 93

Query: 56  ISNSLFYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLGTIFYPSRSSSYAYVP 115
           + ++ + V   IG P  P   AMDT S + W+ C  C  C     T F P++S+S+  V 
Sbjct: 94  LQSTTYIVKALIGTPAQPLLLAMDTSSDVAWIPCSGCVGCPSN--TAFSPAKSTSFKNVS 151

Query: 116 CDSEHCRYFPYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDVVFG 175
           C +  C+  P   C A    C +   Y     +S+  +  Q T +   +    ++   FG
Sbjct: 152 CSAPQCKQVPNPTCGA--RACSFNLTY---GSSSIAANLSQDTIRLAAD---PIKAFTFG 203

Query: 176 C-------GFSTNRNFKFSGIFGLGIGRSSLVSQLNSSFSYCIGSLHDPDYLHNKLILGD 228
           C       G             G     S   S   S+FSYC+ S     +    L LG 
Sbjct: 204 CVNKVAGGGTIPPPQGLLGLGRGPLSLMSQAQSIYKSTFSYCLPSFRSLTF-SGSLRLGP 262

Query: 229 GAIIDEGDATPL---QFIDGHYYITLEAISVDGRMLDINPNIFKRDDS-GGGVMIDSGTD 284
            +       T L         YY+ L AI V  +++D+ P     + S G G + DSGT 
Sbjct: 263 TSQPQRVKYTQLLRNPRRSSLYYVNLVAIRVGRKVVDLPPAAIAFNPSTGAGTIFDSGTV 322

Query: 285 VTWLVKEAYEALRDEVMIRLEGEQMRSYSWPD-KLCYHGIMSSDLKGFPTVRFHFRGGAK 343
            T L K  YEA+R+E   R++       S      CY G +       PT+ F F+G   
Sbjct: 323 YTRLAKPVYEAVRNEFRKRVKPTTAVVTSLGGFDTCYSGQVK-----VPTITFMFKGVNM 377

Query: 344 LALEKDSMFYQPRPDAFCMAVNPASIN-NRYVNFSLIGMMAQQFYNVGYDIGRKQKTFQR 402
                + M +       C+A+  A  N N  VN  +I  M QQ + V  D+   +    R
Sbjct: 378 TMPADNLMLHSTAGSTSCLAMAAAPENVNSVVN--VIASMQQQNHRVLIDVPNGRLGLAR 435

Query: 403 MDCE 406
             C 
Sbjct: 436 ERCS 439


>gi|242089623|ref|XP_002440644.1| hypothetical protein SORBIDRAFT_09g004500 [Sorghum bicolor]
 gi|241945929|gb|EES19074.1| hypothetical protein SORBIDRAFT_09g004500 [Sorghum bicolor]
          Length = 469

 Score = 98.2 bits (243), Expect = 7e-18,   Method: Compositional matrix adjust.
 Identities = 96/360 (26%), Positives = 155/360 (43%), Gaps = 27/360 (7%)

Query: 55  DISNSLFYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLGTIFYPSRSSSYAYV 114
           D     + +  SIG PP       DTGS L+W  C      +    + ++P+ SS++  +
Sbjct: 94  DGGGGAYDMEFSIGTPPQKLTALADTGSDLIWTKCDAGGGAAWGGSSSYHPNASSTFTRL 153

Query: 115 PCDSEHC---RYFPYARCSAYKHRCIYTQLYLIGPE---TSVFVSTEQLTFKNTDESTIH 168
           PC    C   R +  ARC+A    C Y   Y +G +   T  F+ +E  T          
Sbjct: 154 PCSDRLCAALRSYSLARCAAGGAECDYKYAYGLGDDPDFTQGFLGSETFTLGGD-----A 208

Query: 169 VQDVVFGCGFSTNRNF-KFSGIFGLGIGRSSLVSQLNS-SFSYCIGSLHDPDYLHNKLIL 226
           V  V FGC  +   ++ + +G+ GLG G  SLVSQL++ +F YC+ +        + L+ 
Sbjct: 209 VPGVGFGCTTALEGDYGEGAGLVGLGRGPLSLVSQLDAGTFMYCLTADASK---ASPLLF 265

Query: 227 GDGAIIDEGDATPLQFIDGHYYITLEAISVDGRMLDINPNIFKRDDSGGGVMIDSGTDVT 286
           G  A +  G    +Q        T  A+++  R + I           GGV+ DSGT +T
Sbjct: 266 GALATM-TGAGAGVQSTGLLASTTFYAVNL--RSITIGSATTAGVGGPGGVVFDSGTTLT 322

Query: 287 WLVKEAYEALRDEVMIRLEGEQMRSYSWPDKLCYHGIMSSDLKGFPTVRFHFRGGAKLAL 346
           +L + AY   +   + +          +  + CY    S+ L   P +  HF GGA +AL
Sbjct: 323 YLAEPAYTEAKAAFLSQTTSLTPVEGRYGFEACYEKPDSARL--IPAMVLHFDGGADMAL 380

Query: 347 EKDSMFYQPRPDAFCMAVNPASINNRYVNFSLIGMMAQQFYNVGYDIGRKQKTFQRMDCE 406
              +   +      C  V       R  + S+IG + Q  Y V +D+ +   +FQ  +C+
Sbjct: 381 PVANYVVEVDDGVVCWVV------QRSPSLSIIGNIMQMNYLVLHDVRKSVLSFQPANCD 434


>gi|79507883|ref|NP_196320.2| aspartyl protease family protein [Arabidopsis thaliana]
 gi|332003717|gb|AED91100.1| aspartyl protease family protein [Arabidopsis thaliana]
          Length = 455

 Score = 98.2 bits (243), Expect = 7e-18,   Method: Compositional matrix adjust.
 Identities = 110/424 (25%), Positives = 167/424 (39%), Gaps = 41/424 (9%)

Query: 1   LIHRDSILSPYNNPNE-NPAHRVQRGINISIARLAYLQEKIRSHNTYQAQILPSND---- 55
           + H DS  SP+ + +  +   RV + +    ARL YL   +   +     ++P       
Sbjct: 55  IFHIDSPCSPFKSSSPLSWEARVLQTLAQDQARLQYLSSLVAGRS-----VVPIASGRQM 109

Query: 56  ISNSLFYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLGTIFYPSRSSSYAYVP 115
           + ++ + V   IG P  P   AMDT S + W+ C  C  C     T F P++S+S+  V 
Sbjct: 110 LQSTTYIVKALIGTPAQPLLLAMDTSSDVAWIPCSGCVGCPSN--TAFSPAKSTSFKNVS 167

Query: 116 CDSEHCRYFPYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDVVFG 175
           C +  C+  P   C A    C +   Y     +S+  +  Q T +   +    ++   FG
Sbjct: 168 CSAPQCKQVPNPTCGA--RACSFNLTY---GSSSIAANLSQDTIRLAAD---PIKAFTFG 219

Query: 176 C-------GFSTNRNFKFSGIFGLGIGRSSLVSQLNSSFSYCIGSLHDPDYLHNKLILGD 228
           C       G             G     S   S   S+FSYC+ S     +    L LG 
Sbjct: 220 CVNKVAGGGTIPPPQGLLGLGRGPLSLMSQAQSIYKSTFSYCLPSFRSLTF-SGSLRLGP 278

Query: 229 GAIIDEGDATPL---QFIDGHYYITLEAISVDGRMLDINPNIFKRDDS-GGGVMIDSGTD 284
            +       T L         YY+ L AI V  +++D+ P     + S G G + DSGT 
Sbjct: 279 TSQPQRVKYTQLLRNPRRSSLYYVNLVAIRVGRKVVDLPPAAIAFNPSTGAGTIFDSGTV 338

Query: 285 VTWLVKEAYEALRDEVMIRLEGEQMRSYSWPD-KLCYHGIMSSDLKGFPTVRFHFRGGAK 343
            T L K  YEA+R+E   R++       S      CY G +       PT+ F F+G   
Sbjct: 339 YTRLAKPVYEAVRNEFRKRVKPTTAVVTSLGGFDTCYSGQVK-----VPTITFMFKGVNM 393

Query: 344 LALEKDSMFYQPRPDAFCMAVNPASIN-NRYVNFSLIGMMAQQFYNVGYDIGRKQKTFQR 402
                + M +       C+A+  A  N N  VN  +I  M QQ + V  D+   +    R
Sbjct: 394 TMPADNLMLHSTAGSTSCLAMAAAPENVNSVVN--VIASMQQQNHRVLIDVPNGRLGLAR 451

Query: 403 MDCE 406
             C 
Sbjct: 452 ERCS 455


>gi|21717160|gb|AAM76353.1|AC074196_11 putative nucleoid DNA binding protein [Oryza sativa Japonica Group]
 gi|31433304|gb|AAP54833.1| Eukaryotic aspartyl protease family protein [Oryza sativa Japonica
           Group]
 gi|125575544|gb|EAZ16828.1| hypothetical protein OsJ_32300 [Oryza sativa Japonica Group]
          Length = 419

 Score = 98.2 bits (243), Expect = 7e-18,   Method: Compositional matrix adjust.
 Identities = 104/425 (24%), Positives = 175/425 (41%), Gaps = 64/425 (15%)

Query: 19  AHRVQRGINISIARLAYLQEKIRSHNTYQAQILPSNDI------SNSLFYVNISIGQPPV 72
           AH ++RG++         Q+ +R      A   P          S + +  N +IG PP 
Sbjct: 23  AHGLRRGLD---------QQGMRGRILADATAAPPGGAVVPLHWSGAHYVANFTIGTPPQ 73

Query: 73  PQFTAMDTGSSLLWVHCYPCRD--CSPQLGTIFYPSRSSSYAYVPCDSEHCRYFPYARCS 130
                +D    L+W  C  CR   C  Q   +F PS S++Y    C S  C+  P   CS
Sbjct: 74  AVSGIVDLSGELVWTQCAACRSSGCFKQELPVFDPSASNTYRAEQCGSPLCKSIPTRNCS 133

Query: 131 AYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDVVFGCGFSTNRNFKF---- 186
                C Y    + G +T    ST+ +   N +        + FGC  +++ +       
Sbjct: 134 G-DGECGYEAPSMFG-DTFGIASTDAIAIGNAEGR------LAFGCVVASDGSIDGAMDG 185

Query: 187 -SGIFGLGIGRSSLVSQLN-SSFSYCIGSLHDPDYLHNKLILGDGA-IIDEGDATPLQFI 243
            SG  GLG    SLV Q N ++FSYC+ +LH P    + L LG  A +   G + P   +
Sbjct: 186 PSGFVGLGRTPWSLVGQSNVTAFSYCL-ALHGPGK-KSALFLGASAKLAGAGKSNPPTPL 243

Query: 244 -------------DGHYYITLEAISVDGRMLDINPNIFKRDDSGGGVM----IDSGTDVT 286
                        D +Y + LE I    +  D+         SGGG +    +++   ++
Sbjct: 244 LGQHASNTSDDGSDPYYTVQLEGI----KAGDV---AVAAASSGGGAITVLQLETFRPLS 296

Query: 287 WLVKEAYEALRDEVMIRLEGEQMRSYSWPDKLCYHGIMSSDLKGFPTVRFHFRGGAKLAL 346
           +L   AY+AL   V   L    M +   P  LC+    ++ + G P + F F+GGA L  
Sbjct: 297 YLPDAAYQALEKVVTAALGSPSMANPPEPFDLCFQ---NAAVSGVPDLVFTFQGGATLTA 353

Query: 347 EKDSMFYQP--RPDAFCMAV-NPASINNRYVNFSLIGMMAQQFYNVGYDIGRKQKTFQRM 403
           +               C+++ +   +++     S++G + Q+  +  +D+ ++  +F+  
Sbjct: 354 QPSKYLLGDGNGNGTVCLSILSSTRLDSADDGVSILGSLLQENVHFLFDLEKETLSFEPA 413

Query: 404 DCEVL 408
           DC  L
Sbjct: 414 DCSSL 418


>gi|449440161|ref|XP_004137853.1| PREDICTED: aspartic proteinase-like protein 2-like [Cucumis
           sativus]
 gi|449521209|ref|XP_004167622.1| PREDICTED: aspartic proteinase-like protein 2-like [Cucumis
           sativus]
          Length = 492

 Score = 97.8 bits (242), Expect = 8e-18,   Method: Compositional matrix adjust.
 Identities = 94/315 (29%), Positives = 154/315 (48%), Gaps = 35/315 (11%)

Query: 60  LFYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDC--SPQLGTI--FYPSRSSSYAYVP 115
           L++  + +G PP+     +DTGS +LWV+C  C  C  S  LG    F+ + SSS + + 
Sbjct: 78  LYFTKVKLGTPPMEFTVQIDTGSDILWVNCNSCNGCPRSSGLGIQLNFFDASSSSSSSLV 137

Query: 116 ------CDSEHCRYFPYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNT-DESTI- 167
                 C+S         +C    ++C YT  Y  G  TS +  +E + F     +S I 
Sbjct: 138 SCSDPICNSAFQT--TATQCLTQSNQCSYTFQYGDGSGTSGYYVSESMYFDMVMGQSMIA 195

Query: 168 -HVQDVVFGC-----GFSTNRNFKFSGIFGLGIGRSSLVSQLNSS------FSYCIGSLH 215
                VVFGC     G  T  +    GIFG G G  S++SQL++       FS+C+    
Sbjct: 196 NSSASVVFGCSTYQSGDLTKSDHAIDGIFGFGPGDLSVISQLSARGITPKVFSHCLKGEG 255

Query: 216 DPDYLHNKLILGDGAIIDEGDA-TPLQFIDGHYYITLEAISVDGRMLDINPNIFKRDDSG 274
           +   +   L+LG+  +++ G   +PL     HY + L++ISV+G+ L I+P++F    + 
Sbjct: 256 NGGGI---LVLGE--VLEPGIVYSPLVPSQPHYNLYLQSISVNGQTLPIDPSVFATSIN- 309

Query: 275 GGVMIDSGTDVTWLVKEAYEALRDEVMIRLEGEQMRSYSWPDKLCYHGIMSSDLKGFPTV 334
            G +IDSGT + +LV+EAY      +   +      + S  ++ CY  + +S  + FP V
Sbjct: 310 RGTIIDSGTTLAYLVEEAYTPFVSAITAAVSQSVTPTISKGNQ-CYL-VSTSVGEIFPLV 367

Query: 335 RFHFRGGAKLALEKD 349
             +F G A + L+ +
Sbjct: 368 SLNFAGSASMVLKPE 382


>gi|145693994|gb|ABP93697.1| unknown protein isoform 2 [Lemna minor]
          Length = 351

 Score = 97.8 bits (242), Expect = 9e-18,   Method: Compositional matrix adjust.
 Identities = 101/363 (27%), Positives = 147/363 (40%), Gaps = 36/363 (9%)

Query: 56  ISNSLFYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCR-DCSPQLGTIFYPSRSSSYAYV 114
           I +  + + +  G P   Q    DTGS + W+ C PC   C  Q   +F PS SS+Y  V
Sbjct: 11  IGSGNYVITVGFGTPTRTQTVVFDTGSDVNWLQCKPCAVRCYAQQEPLFDPSLSSTYRNV 70

Query: 115 PCDSEHCRYFPYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDVVF 174
            C    C       CS+    C+Y   Y  G  T  F++ +  TF  T       ++ +F
Sbjct: 71  SCTEPACVGLSTRGCSS--STCLYGVFYGDGSSTIGFLAMD--TFMLTPAQ--KFKNFIF 124

Query: 175 GCGFSTNRNFKFSGIFGL-GIGRSSLVS-------QLNSSFSYCIGSLHDPDYLHN---- 222
           GCG   N    F G  GL G+GRSS  S        L + FSYC+ S        N    
Sbjct: 125 GCG--QNNTGLFQGTAGLVGLGRSSTYSLNSQVAPSLGNVFSYCLPSTSSATGYLNIGNP 182

Query: 223 KLILGDGAIIDEGDATPLQFIDGHYYITLEAISVDGRMLDINPNIFKRDDSGGGVMIDSG 282
           +   G  A++ +     L FID      L  ISV G  L ++  +F+      G +IDSG
Sbjct: 183 QNTPGYTAMLTDTRVPTLYFID------LIGISVGGTRLSLSSTVFQSV----GTIIDSG 232

Query: 283 TDVTWLVKEAYEALRDEVMIRLEGEQMRSYSWPDKLCYHGIMSSDLKGFPTVRFHFRGGA 342
           T +T L   AY AL+  V   +    +         CY    ++ +  +P +  HF  G 
Sbjct: 233 TVITRLPPTAYSALKTAVRAAMTQYTLAPAVTILDTCYDFSRTTSVV-YPVIVLHF-AGL 290

Query: 343 KLALEKDSMFYQPRPDAFCMAVNPASINNRYVNFSLIGMMAQQFYNVGYDIGRKQKTFQR 402
            + +    +F+       C+A    + N       +IG + Q    V YD   K+  F  
Sbjct: 291 DVRIPATGVFFVFNSSQVCLAF---AGNTDSTMIGIIGNVQQLTMEVTYDNELKRIGFSA 347

Query: 403 MDC 405
             C
Sbjct: 348 GAC 350


>gi|326500408|dbj|BAK06293.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 475

 Score = 97.8 bits (242), Expect = 9e-18,   Method: Compositional matrix adjust.
 Identities = 98/350 (28%), Positives = 142/350 (40%), Gaps = 40/350 (11%)

Query: 74  QFTAMDTGSSLLWVHCYPCR--DCSPQLGTIFYPSRSSSYAYVPCDSEHCRYF-PYAR-C 129
           Q  A+DT   + W+ C PC    C PQ   +F P+ SS+ A V C S  CR   PY   C
Sbjct: 148 QTMAIDTTVDVPWIQCAPCPIPQCYPQRDPLFDPTTSSTAAAVRCRSPACRSLGPYGNGC 207

Query: 130 S--AYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDVVFGCGFSTNRNFK-- 185
           S  +    C Y   Y     T+    T+ LT   T      V++  FGC  +    F   
Sbjct: 208 SNRSANAECRYLIEYSDDRATAGTYMTDTLTISGTTA----VRNFRFGCSHAVRGRFSDL 263

Query: 186 FSGIFGLGIGRSSLVSQ----LNSSFSYCIGSLHDPDYLHNKLILGDGAIIDEGD---AT 238
            +G   LG G  SL++Q    L ++FSYC+       +L     +G  A  +       T
Sbjct: 264 TAGTMSLGGGAQSLLAQTARSLGNAFSYCVPQASASGFLS----IGGPATTNSTTVFATT 319

Query: 239 PL---QFIDGHYYITLEAISVDGRMLDINPNIFKRDDSGGGVMIDSGTDVTWLVKEAYEA 295
           PL         Y + L+ I V GR L I P  F       G ++DS   +T L   AY A
Sbjct: 320 PLVRSAINPSLYLVRLQGIVVAGRRLGIPPVAFS-----AGAVMDSSAVITQLPPTAYRA 374

Query: 296 LRDEVMIRLEGEQMRSYSWPDKLCYHGIMSSDLKGFPTVRFHFRGGAKLALEKDSMFYQP 355
           LR      +        +     CY  +  ++++  P V   F GGA + L+  ++    
Sbjct: 375 LRRAFRNAMRAYPRSGATGTLDTCYDFLGLTNVR-VPAVSLVFGGGAVVVLDPPAVMI-- 431

Query: 356 RPDAFCMAVNPASINNRYVNFSLIGMMAQQFYNVGYDIGRKQKTFQRMDC 405
                C+A    S +   +    IG + QQ + V YD+      F+R  C
Sbjct: 432 ---GGCLAFTATSSD---LALGFIGNVQQQTHEVLYDVAAGGVGFRRGAC 475


>gi|356508308|ref|XP_003522900.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
          Length = 439

 Score = 97.4 bits (241), Expect = 1e-17,   Method: Compositional matrix adjust.
 Identities = 98/363 (26%), Positives = 142/363 (39%), Gaps = 31/363 (8%)

Query: 56  ISNSLFYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLGTIFYPSRSSSYAYVP 115
           I +  + V   IG PP     AMDT +   W+ C  C  C+    T+F P +S+++  V 
Sbjct: 93  IQSPTYIVRAKIGSPPQTLLLAMDTSNDAAWIPCTACDGCT---STLFAPEKSTTFKNVS 149

Query: 116 CDSEHCRYFPYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDVVFG 175
           C S  C   P   C      C +   Y     +S+  +  Q T      +T  + D  FG
Sbjct: 150 CGSPQCNQVPNPSCG--TSACTFNLTY---GSSSIAANVVQDTVT---LATDPIPDYTFG 201

Query: 176 C-----GFSTNRNFKFSGIFGLGIGRSSLVSQLNSSFSYCIGSLHDPDYLHNKLILGDGA 230
           C     G S           G     S   +   S+FSYC+ S    ++    L LG  A
Sbjct: 202 CVAKTTGASAPPQGLLGLGRGPLSLLSQTQNLYQSTFSYCLPSFKSLNF-SGSLRLGPVA 260

Query: 231 IIDEGDATPL---QFIDGHYYITLEAISVDGRMLDINPNIFK-RDDSGGGVMIDSGTDVT 286
                  TPL         YY+ L AI V  +++DI P        +G G + DSGT  T
Sbjct: 261 QPIRIKYTPLLKNPRRSSLYYVNLVAIRVGRKVVDIPPEALAFNAATGAGTVFDSGTVFT 320

Query: 287 WLVKEAYEALRDEVMIRLEGEQMRSYSWPD----KLCYHGIMSSDLKGFPTVRFHFRGGA 342
            LV  AY A+RDE   R+      + +         CY   + +     PT+ F F G  
Sbjct: 321 RLVAPAYTAVRDEFQRRVAIAAKANLTVTSLGGFDTCYTVPIVA-----PTITFMFSGMN 375

Query: 343 KLALEKDSMFYQPRPDAFCMAVNPASINNRYVNFSLIGMMAQQFYNVGYDIGRKQKTFQR 402
               E + + +       C+A+  A  N   V  ++I  M QQ + V YD+   +    R
Sbjct: 376 VTLPEDNILIHSTAGSTTCLAMASAPDNVNSV-LNVIANMQQQNHRVLYDVPNSRLGVAR 434

Query: 403 MDC 405
             C
Sbjct: 435 ELC 437


>gi|449493359|ref|XP_004159266.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Cucumis sativus]
          Length = 511

 Score = 97.4 bits (241), Expect = 1e-17,   Method: Compositional matrix adjust.
 Identities = 110/434 (25%), Positives = 179/434 (41%), Gaps = 57/434 (13%)

Query: 17  NPAHRVQRGINISIARLAYLQEKIRSHNT--YQAQILPSNDISNSLFYVNISIGQPPVPQ 74
           +P   +   ++ S+ R  +L+      NT      + P    S   + V+++ G PP   
Sbjct: 89  DPFKTINLLLSASLNRAQHLKTPQSKSNTSIQNVSLFPR---SYGAYSVSLAFGTPPQNL 145

Query: 75  FTAMDTGSSLLWVHC---YPCRDCS-----PQLGTIFYPSRSSSYAYVPCDSEHCRYF-- 124
               DTGSSL+W  C   Y C  CS     P   + F P  SSS   V C +  C +   
Sbjct: 146 SFIFDTGSSLVWFPCTAGYRCSRCSFPYVDPATISKFVPKLSSSVKVVGCRNPKCAWIFG 205

Query: 125 PYAR-----CSAYKHRCI-----YTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDVVF 174
           P  +     C++   +C      Y   Y  G    + +S E L  +N       V D + 
Sbjct: 206 PNLKSRCRNCNSKSRKCSDSCPGYGLQYGSGATAGILLS-ETLDLENK-----RVPDFLV 259

Query: 175 GCGFSTNRNFKFSGIFGLGIGRSSLVSQLN-SSFSYCIGSL-HDPDYLHNKLILGDGAII 232
           GC  S     + +GI G G G  SL SQ+    FS+C+ S   D   + + L+L  G+  
Sbjct: 260 GC--SVMSVHQPAGIAGFGRGPESLPSQMRLKRFSHCLVSRGFDDSPVSSPLVLDSGSES 317

Query: 233 DEGDATPLQF-------------IDGHYYITLEAISVDGRMLDINPNIFKRDDSG-GGVM 278
           DE       +                +YY++L  I + G+ +         D +G GG +
Sbjct: 318 DESKTKSFIYAPFRENPSVSNAAFREYYYLSLRRILIGGKPVKFPYKYLVPDSTGNGGAI 377

Query: 279 IDSGTDVTWLVKEAYEALRDEVMIRL----EGEQMRSYSWPDKLCYHGIMSSDLKGFPTV 334
           IDSG+  T+L K  +EA+ DE+  +L      + + + S   + C++     +   FP V
Sbjct: 378 IDSGSTFTFLDKPIFEAIADELEKQLVKYPRAKDVEAQSG-LRPCFNIPKEEESAEFPDV 436

Query: 335 RFHFRGGAKLALEKD---SMFYQPRPDAFCMAVNPASINNRYVNFSLIGMMAQQFYNVGY 391
              F+GG KL+L  +   +M          M  + A +        ++G   QQ   V Y
Sbjct: 437 VLKFKGGGKLSLAAENYLAMVTDEGVVCLTMMTDEAVVGGGGGPAIILGAFQQQNVLVEY 496

Query: 392 DIGRKQKTFQRMDC 405
           D+ +++  F++  C
Sbjct: 497 DLAKQRIGFRKQKC 510


>gi|356541713|ref|XP_003539318.1| PREDICTED: aspartic proteinase-like protein 2-like [Glycine max]
          Length = 640

 Score = 97.4 bits (241), Expect = 1e-17,   Method: Compositional matrix adjust.
 Identities = 99/392 (25%), Positives = 166/392 (42%), Gaps = 38/392 (9%)

Query: 35  YLQEKIRSHNTYQAQILPSNDISNSLFYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRD 94
           +LQ     H+      L  + + N  +   + IG PP      +DTGS++ +V C  C+ 
Sbjct: 67  HLQGSQSEHHPNARMRLFDDLLRNGYYTTRLWIGTPPQRFALIVDTGSTVTYVPCSTCKH 126

Query: 95  CSPQLGTIFYPSRSSSYAYVPCDSEHCRYFPYARCSAYKHRCIYTQLYLIGPETSVFVST 154
           C       F P  S +Y  V C  +         C   + +C Y + Y     +S  +  
Sbjct: 127 CGSHQDPKFRPEASETYQPVKCTWQ-------CNCDDDRKQCTYERRYAEMSTSSGVLGE 179

Query: 155 EQLTFKNTDESTIHVQDVVFGCGFSTN---RNFKFSGIFGLGIGRSSLVSQL------NS 205
           + ++F N  +S +  Q  +FGC         N +  GI GLG G  S++ QL      + 
Sbjct: 180 DVVSFGN--QSELSPQRAIFGCENDETGDIYNQRADGIMGLGRGDLSIMDQLVEKKVISD 237

Query: 206 SFSYCIGSLHDPDYLHNKLILGDGAIIDEGDATPLQFIDGHYYITLEAISVDGRMLDINP 265
           +FS C G +           +   A +    + P++    +Y I L+ I V G+ L +NP
Sbjct: 238 AFSLCYGGMGVGGGAMVLGGISPPADMVFTHSDPVR--SPYYNIDLKEIHVAGKRLHLNP 295

Query: 266 NIFKRDDSGGGVMIDSGTDVTWLVKEAYEALRDEVMIRLEGEQMRSYSWPDK----LCYH 321
            +F   D   G ++DSGT   +L + A+ A +  +M   E   ++  S PD     +C+ 
Sbjct: 296 KVF---DGKHGTVLDSGTTYAYLPESAFLAFKHAIM--KETHSLKRISGPDPHYNDICFS 350

Query: 322 GI---MSSDLKGFPTVRFHFRGGAKLALEKDSMFYQPRP--DAFCMAVNPASINNRYVNF 376
           G    +S   K FP V   F  G KL+L  ++  ++      A+C+ V     +N     
Sbjct: 351 GAEINVSQLSKSFPVVEMVFGNGHKLSLSPENYLFRHSKVRGAYCLGV----FSNGNDPT 406

Query: 377 SLIGMMAQQFYNVGYDIGRKQKTFQRMDCEVL 408
           +L+G +  +   V YD    +  F + +C  L
Sbjct: 407 TLLGGIVVRNTLVMYDREHSKIGFWKTNCSEL 438


>gi|356503843|ref|XP_003520712.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Glycine max]
          Length = 474

 Score = 97.4 bits (241), Expect = 1e-17,   Method: Compositional matrix adjust.
 Identities = 114/450 (25%), Positives = 189/450 (42%), Gaps = 69/450 (15%)

Query: 7   ILSPYNNPNENPAHRVQRGINISIARLAYLQEKIRSHNTYQAQILPSNDISNSLFYVNIS 66
           ++ P+++ + +P H ++   + S+ R  +L  K R++N+      P+   S   + ++++
Sbjct: 41  LIKPHSS-DSDPFHSLKFAASASLTRAHHL--KHRNNNSPSVATTPAYPKSYGGYSIDLN 97

Query: 67  IGQPPVPQFTAMDTGSSLLWVHC---YPCRDCS-PQLGT----IFYPSRSSSYAYVPCDS 118
           +G PP      +DTGSSL+W  C   Y C  C+ P + T     F P  SS+   + C +
Sbjct: 98  LGTPPQTSPFVLDTGSSLVWFPCTSRYLCSHCNFPNIDTTKIPTFIPKNSSTAKLLGCRN 157

Query: 119 EHCRY-------FPYARCSAYKHRC-----IYTQLYLIGPETSVFVSTEQLTFKNTDEST 166
             C Y       F   +C      C      Y   Y +G  T+ F+  + L F       
Sbjct: 158 PKCGYIFGSDVQFRCPQCKPESQNCSLTCPAYIIQYGLG-STAGFLLLDNLNFPGKT--- 213

Query: 167 IHVQDVVFGCGFSTNRNFKFSGIFGLGIGRSSLVSQLN-SSFSYCIGSLHDPDYLHNK-L 224
             V   + GC   + R  + SGI G G G+ SL SQ+N   FSYC+ S    D   +  L
Sbjct: 214 --VPQFLVGCSILSIR--QPSGIAGFGRGQESLPSQMNLKRFSYCLVSHRFDDTPQSSDL 269

Query: 225 ILGDGAIIDEGDA-------TPLQ--------FIDGHYYITLEAISVDGRMLDINPNIFK 269
           +L    I   GD        TP +            +YY+TL  + V G+ + I P  F 
Sbjct: 270 VL---QISSTGDTKTNGLSYTPFRSNPSTNNPAFKEYYYLTLRKVIVGGKDVKI-PYTFL 325

Query: 270 R--DDSGGGVMIDSGTDVTWLVKEAYEALRDEVMIRLEGEQMRSYSWPDK----LCYHGI 323
               D  GG ++DSG+  T++ +  Y  +  E + +LE    R+     +     C++ I
Sbjct: 326 EPGSDGNGGTIVDSGSTFTFMERPVYNLVAQEFVKQLEKNYSRAEDAETQSGLSPCFN-I 384

Query: 324 MSSDLKGFPTVRFHFRGGAKLALEKDSMF-YQPRPDAFCMAV------NPASINNRYVNF 376
                  FP + F F+GGAK+     + F      +  C+ V       P       +  
Sbjct: 385 SGVKTVTFPELTFKFKGGAKMTQPLQNYFSLVGDAEVVCLTVVSDGGAGPPKTTGPAI-- 442

Query: 377 SLIGMMAQQFYNVGYDIGRKQKTFQRMDCE 406
            ++G   QQ + + YD+  ++  F    C 
Sbjct: 443 -ILGNYQQQNFYIEYDLENERFGFGPRSCR 471


>gi|51091919|dbj|BAD35188.1| putative 41 kD chloroplast nucleoid DNA binding protein [Oryza
           sativa Japonica Group]
 gi|125596474|gb|EAZ36254.1| hypothetical protein OsJ_20576 [Oryza sativa Japonica Group]
 gi|196212950|gb|ACG76111.1| S5 [Oryza sativa Japonica Group]
 gi|340810891|gb|AEK75372.1| S5 [Oryza sativa]
 gi|340810893|gb|AEK75373.1| S5 [Oryza sativa]
 gi|340810899|gb|AEK75376.1| S5 [Oryza sativa]
 gi|340810901|gb|AEK75377.1| S5 [Oryza sativa]
 gi|340810933|gb|AEK75393.1| S5 [Oryza sativa]
 gi|340810947|gb|AEK75400.1| S5 [Oryza sativa]
 gi|340810949|gb|AEK75401.1| S5 [Oryza sativa]
 gi|340810967|gb|AEK75410.1| S5 [Oryza sativa]
 gi|340810969|gb|AEK75411.1| S5 [Oryza sativa]
 gi|340810999|gb|AEK75426.1| S5 [Oryza rufipogon]
 gi|340811017|gb|AEK75435.1| S5 [Oryza rufipogon]
 gi|340811029|gb|AEK75441.1| S5 [Oryza nivara]
 gi|340811051|gb|AEK75452.1| S5 [Oryza nivara]
 gi|340811075|gb|AEK75464.1| S5 [Oryza nivara]
 gi|340811077|gb|AEK75465.1| S5 [Oryza rufipogon]
 gi|340811085|gb|AEK75469.1| S5 [Oryza nivara]
 gi|340811096|gb|AEK75474.1| S5 [Oryza rufipogon]
 gi|340811100|gb|AEK75476.1| S5 [Oryza rufipogon]
 gi|340811114|gb|AEK75483.1| S5 [Oryza nivara]
          Length = 472

 Score = 97.4 bits (241), Expect = 1e-17,   Method: Compositional matrix adjust.
 Identities = 115/410 (28%), Positives = 175/410 (42%), Gaps = 68/410 (16%)

Query: 37  QEKIRSHNTYQAQILPSNDISNSLFYVNISIGQPPVPQFTAMDTGSSLLWVHCYP----C 92
           +E+I S ++ +  ++  + I++ LF + +S+G+PPV    A+DTGS+L WV C P    C
Sbjct: 90  EEEITSSSSTKIDVIEDSSINDFLFLMAVSLGKPPVVNLVAIDTGSTLSWVQCQPCAVHC 149

Query: 93  RDCSPQLGTIFYPSRSSSYAYVPCDSEHCRYFPY------ARCSAYKHRCIYTQLYLIGP 146
              S + G IF P RS +   V C S  C    Y      A C   +  C Y+  Y  G 
Sbjct: 150 HTQSAKAGPIFDPGRSYTSRRVRCSSVKCGELRYDLRLQQANCMEKEDSCTYSVTYGNGW 209

Query: 147 ETSV-FVSTEQLTFKNTDESTIHVQDVVFGCGFSTNRNFKFSGIFGLGIGRSSLVSQLN- 204
             SV  + T+ L   ++        D++FGC      +   +GIFG G    S   QL  
Sbjct: 210 AYSVGKMVTDTLRIGDS------FMDLMFGCSMDVKYSEFEAGIFGFGSSSFSFFEQLAG 263

Query: 205 -------SSFSYCIGSLH-DPDYLHNKLILG--DGAIIDEGDATPL--QFIDGHYYITLE 252
                   + SYC+ +    P Y    +ILG  D A +D G  TPL        Y +T+E
Sbjct: 264 YPDILSYKALSYCLPTDETKPGY----MILGRYDRAAMD-GGYTPLFRSINRPTYSLTME 318

Query: 253 AISVDGRMLDINPNIFKRDDSGGGVMIDSGTDVTWLVKEAYEALRDEVMI---------R 303
            +  +G+ L           S   +++DSG   T L    + AL D+ +          R
Sbjct: 319 MLIANGQRLVT---------SSSEMIVDSGAQRTSLWPSTF-ALLDKTITQAMSSIGYHR 368

Query: 304 LEGEQMRSY----SWPDKLCYHGIMS--SDLKGFPTVRFHFRGGAKLALEKDSMFYQPRP 357
               +  SY    S  D   ++G ++  S+    P +   F GGA LAL   ++FY    
Sbjct: 369 TSRARQESYICYLSEHDYSGWNGTITPFSNWSALPLLEIGFAGGAALALPPRNVFYNDPH 428

Query: 358 DAFCM--AVNPASINNRYVNFSLIGMMAQQFYNVGYDIGRKQKTFQRMDC 405
              CM  A NPA      +   ++G    + +   +DI  KQ  F+   C
Sbjct: 429 RGLCMTFAQNPA------LRSQILGNRVTRSFGTTFDIQGKQFGFKYAVC 472


>gi|340810993|gb|AEK75423.1| S5 [Oryza rufipogon]
 gi|340811015|gb|AEK75434.1| S5 [Oryza nivara]
 gi|340811021|gb|AEK75437.1| S5 [Oryza nivara]
          Length = 474

 Score = 97.4 bits (241), Expect = 1e-17,   Method: Compositional matrix adjust.
 Identities = 115/410 (28%), Positives = 175/410 (42%), Gaps = 68/410 (16%)

Query: 37  QEKIRSHNTYQAQILPSNDISNSLFYVNISIGQPPVPQFTAMDTGSSLLWVHCYP----C 92
           +E+I S ++ +  ++  + I++ LF + +S+G+PPV    A+DTGS+L WV C P    C
Sbjct: 92  EEEITSSSSTKIDVIEDSSINDFLFLMAVSLGKPPVVNLVAIDTGSTLSWVQCQPCAVHC 151

Query: 93  RDCSPQLGTIFYPSRSSSYAYVPCDSEHCRYFPY------ARCSAYKHRCIYTQLYLIGP 146
              S + G IF P RS +   V C S  C    Y      A C   +  C Y+  Y  G 
Sbjct: 152 HTQSAKAGPIFDPGRSYTSRRVRCSSVKCGELRYDLRLQQANCMEKEDSCTYSVTYGNGW 211

Query: 147 ETSV-FVSTEQLTFKNTDESTIHVQDVVFGCGFSTNRNFKFSGIFGLGIGRSSLVSQLN- 204
             SV  + T+ L   ++        D++FGC      +   +GIFG G    S   QL  
Sbjct: 212 AYSVGKMVTDTLRIGDS------FMDLMFGCSMDVKYSEFEAGIFGFGSSSFSFFEQLAG 265

Query: 205 -------SSFSYCIGSLH-DPDYLHNKLILG--DGAIIDEGDATPL--QFIDGHYYITLE 252
                   + SYC+ +    P Y    +ILG  D A +D G  TPL        Y +T+E
Sbjct: 266 YPDILSYKALSYCLPTDETKPGY----MILGRYDRAAMD-GGYTPLFRSINRPTYSLTME 320

Query: 253 AISVDGRMLDINPNIFKRDDSGGGVMIDSGTDVTWLVKEAYEALRDEVMI---------R 303
            +  +G+ L           S   +++DSG   T L    + AL D+ +          R
Sbjct: 321 MLIANGQRLVT---------SSSEMIVDSGAQRTSLWPSTF-ALLDKTITQAMSSIGYHR 370

Query: 304 LEGEQMRSY----SWPDKLCYHGIMS--SDLKGFPTVRFHFRGGAKLALEKDSMFYQPRP 357
               +  SY    S  D   ++G ++  S+    P +   F GGA LAL   ++FY    
Sbjct: 371 TSRARQESYICYLSEHDYSGWNGTITPFSNWSALPLLEIGFAGGAALALPPRNVFYNDPH 430

Query: 358 DAFCM--AVNPASINNRYVNFSLIGMMAQQFYNVGYDIGRKQKTFQRMDC 405
              CM  A NPA      +   ++G    + +   +DI  KQ  F+   C
Sbjct: 431 RGLCMTFAQNPA------LRSQILGNRVTRSFGTTFDIQGKQFGFKYAVC 474


>gi|168030587|ref|XP_001767804.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162680886|gb|EDQ67318.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 399

 Score = 97.4 bits (241), Expect = 1e-17,   Method: Compositional matrix adjust.
 Identities = 102/396 (25%), Positives = 164/396 (41%), Gaps = 57/396 (14%)

Query: 48  AQILPSNDISNSLFYVN-ISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLGTI---- 102
           A++   +D+    +Y + + IG PP      +DTGS++ +V C  C  C     +     
Sbjct: 26  ARMTLHDDLLTKGYYTSRVFIGTPPNEFALIVDTGSTVTYVPCSSCTHCGHHQASFSTHR 85

Query: 103 -------FYPSRSSSYAYVPCDSEHCRYFPYARCSAYKHRCIYTQLYLIGPETSVFVSTE 155
                  F P  SSSY  + C S  C       C +  H+C Y ++Y     +   +  +
Sbjct: 86  LFCRDPRFKPENSSSYQKIGCRSSDC---ITGLCDSNSHQCKYERMYAEMSTSKGVLGKD 142

Query: 156 QLTFKNTDESTIHVQDVVFGCGFSTNRNFKFS---GIFGLGIGRSSLVSQL------NSS 206
            L F     S +  Q + FGC  + + +       GI GLG G  S+V QL        S
Sbjct: 143 LLDFGPA--SRLQSQLLSFGCETAESGDLYLQVADGIMGLGRGPLSIVDQLVGNGAIEDS 200

Query: 207 FSYCIGSLHDPDYLHNKLILG-----DGAIIDEGDATPLQFIDGHYYITLEAISVDGRML 261
           FS C G +   D     ++LG      G +  + D     +    Y + L  I V G  L
Sbjct: 201 FSLCYGGM---DEGGGSMVLGAIPAPSGMVFAKSDPRRSNY----YNLELTEIQVQGASL 253

Query: 262 DINPNIFKRDDSGGGVMIDSGTDVTWLVKEAYEALRDEVMIRLEGEQMRSYSWPD----K 317
            ++ N+F   +   G ++DSGT   +L   A+EA  D V+ +L    +++   PD     
Sbjct: 254 KLDSNVF---NGKFGTILDSGTTYAYLPDRAFEAFTDAVVAQL--GSLQAVDGPDPNYPD 308

Query: 318 LCYHGIMSSDL---KGFPTVRFHFRGGAKLALEKDSMFYQ--PRPDAFCMAVNPASINNR 372
           +CY G  +      K FP V F F    K++L  ++  ++    P A+C+        N+
Sbjct: 309 ICYAGAGTDTKELGKHFPLVDFVFAENQKVSLAPENYLFKHTKVPGAYCLGF----FKNQ 364

Query: 373 YVNFSLIGMMAQQFYNVGYDIGRKQKTFQRMDCEVL 408
                L G++ +    V YD    Q  F + +C  L
Sbjct: 365 DATTLLGGIIVRNML-VTYDRYNHQIGFLKTNCTEL 399


>gi|255581508|ref|XP_002531560.1| Aspartic proteinase nepenthesin-2 precursor, putative [Ricinus
           communis]
 gi|223528821|gb|EEF30826.1| Aspartic proteinase nepenthesin-2 precursor, putative [Ricinus
           communis]
          Length = 407

 Score = 97.4 bits (241), Expect = 1e-17,   Method: Compositional matrix adjust.
 Identities = 105/377 (27%), Positives = 161/377 (42%), Gaps = 43/377 (11%)

Query: 63  VNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLGTIFYPSRSSSYAYVPCDSEHC- 121
           V++++G PP      +DTGS L W++C      +    T F  +RS SY  +PC S  C 
Sbjct: 33  VSLTVGTPPQNVSMVIDTGSELSWLYCNKTTTTT-SYPTTFNQTRSISYRPIPCSSSTCT 91

Query: 122 ---RYFPYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDVVFGCG- 177
              R F           C  T  Y     +   ++++      +D     +  +VFGC  
Sbjct: 92  NQTRDFSIPASCDSNSLCHATLSYADASSSEGNLASDTFHMGASD-----IPGMVFGCMD 146

Query: 178 --FSTN--RNFKFSGIFGLGIGRSSLVSQLN-SSFSYCIGSLHDPDYLHNKLILGDGAII 232
             FS+N   + K +G+ G+  G  S VSQ+    FSYCI            L+LG+    
Sbjct: 147 SVFSSNSDEDSKNTGLMGMNRGSLSFVSQMGFPKFSYCISGTD----FSGMLLLGESNFT 202

Query: 233 DEGD---------ATPLQFIDG-HYYITLEAISVDGRMLDINPNIFKRDDSGGG-VMIDS 281
                        +TPL + D   Y + LE I V  R+L I  ++F+ D +G G  M+DS
Sbjct: 203 WAVPLNYTPLVQISTPLPYFDRIAYTVQLEGIKVSDRLLPIPKSVFEPDHTGAGQTMVDS 262

Query: 282 GTDVTWLVKEAYEALRDEVMIRLEGEQMRSYSWPD-------KLCYHGIMSSD-LKGFPT 333
           GT  T+L+  AY ALR E + +  G  +R    PD        LCY   +S   L   PT
Sbjct: 263 GTQFTFLLGPAYTALRSEFLNQTTGF-LRVLEDPDFVFQGAMDLCYRVPISQRVLPRLPT 321

Query: 334 VRFHFRGGAKLALEKDSMFYQPRPDAFCMAVNPASINNR---YVNFSLIGMMAQQFYNVG 390
           V   F G      ++  ++  P       +V+  S  N     V   +IG   QQ   + 
Sbjct: 322 VSLVFNGAEMTVADERVLYRVPGEIRGNDSVHCLSFGNSDLLGVEAYVIGHHHQQNVWME 381

Query: 391 YDIGRKQKTFQRMDCEV 407
           +D+ R +    ++ C++
Sbjct: 382 FDLERSRIGLAQVRCDL 398


>gi|125532795|gb|EAY79360.1| hypothetical protein OsI_34488 [Oryza sativa Indica Group]
          Length = 342

 Score = 97.1 bits (240), Expect = 1e-17,   Method: Compositional matrix adjust.
 Identities = 87/366 (23%), Positives = 146/366 (39%), Gaps = 62/366 (16%)

Query: 57  SNSLFYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLGTIFYPSRSSSYAYVPC 116
           S  L+  N++IG PP P    +      +W  C PCR C  Q   +F             
Sbjct: 24  SQPLYMANLTIGTPPQPASAIIHLAGEFVWTQCSPCRRCFKQDLPLF------------- 70

Query: 117 DSEHCRYFPYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDVVFGC 176
                        + Y+   ++     IG   +  + T                 + FGC
Sbjct: 71  -------------NRYEVETMFGDTSGIGGTDTFAIGTA-------------TASLAFGC 104

Query: 177 GFSTN--RNFKFSGIFGLGIGRSSLVSQLNSS-FSYCIGSLHDPDYLHNKLILGDGAIID 233
              +N  +    SG+ GLG    SLV Q+N++ FSYC+   H      + L+LG  A + 
Sbjct: 105 AMDSNIKQLLGASGVVGLGRTPWSLVGQMNATAFSYCLAP-HGAAGKKSALLLGASAKLA 163

Query: 234 EGDA---TPLQFID---GHYYITLEAISVDGRMLDINPNIFKRDDSGGGVMIDSGTDVTW 287
            G +   TPL         Y I LE I     +++  PN       G  V++D+   V++
Sbjct: 164 GGKSAATTPLVNTSDDSSDYMIHLEGIKFGDVIIEPPPN-------GSVVLVDTIFGVSF 216

Query: 288 LVKEAYEALRDEVMIRLEGEQMRSYSWPDKLCY-----HGIMSSDLKGFPTVRFHFRGGA 342
           LV  A+ A++  V + +    M + + P  LC+         +S L   P V   F+G A
Sbjct: 217 LVDAAFHAIKKAVTVAVGAAPMATPTKPFDLCFPKAAAAAGANSSLP-LPDVVLTFQGAA 275

Query: 343 KLALEKDSMFYQPRPDAFCMAVNPASINNRYVNFSLIGMMAQQFYNVGYDIGRKQKTFQR 402
            L +      Y       C+A+  +++ N     S++G + Q+  +  +D+ ++  +F+ 
Sbjct: 276 ALTVPPSKYMYDAGNGTVCLAMMSSAMLNLTTELSILGRLHQENIHFLFDLDKETLSFEP 335

Query: 403 MDCEVL 408
            DC  L
Sbjct: 336 ADCSSL 341


>gi|242066168|ref|XP_002454373.1| hypothetical protein SORBIDRAFT_04g029630 [Sorghum bicolor]
 gi|241934204|gb|EES07349.1| hypothetical protein SORBIDRAFT_04g029630 [Sorghum bicolor]
          Length = 458

 Score = 97.1 bits (240), Expect = 1e-17,   Method: Compositional matrix adjust.
 Identities = 102/369 (27%), Positives = 157/369 (42%), Gaps = 36/369 (9%)

Query: 52  PSNDISNSLFYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCR-DCSPQLGTIFYPSRSSS 110
           P   +    +   + +G P       +DTGSSL W+ C PC   C  Q G +F P  SSS
Sbjct: 112 PGTSVGVGNYVTRMGLGTPAKSYVMVVDTGSSLTWLQCSPCLVSCHRQSGPVFNPRSSSS 171

Query: 111 YAYVPCDSEHCRYFPYA-----RCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDES 165
           YA V C +  C     A      CS   + CIY   Y     +  ++S + ++F +T   
Sbjct: 172 YASVSCSAPQCDALTTATLNPSTCST-SNVCIYQASYGDSSFSVGYLSKDTVSFGSTS-- 228

Query: 166 TIHVQDVVFGCGFSTNRNF-KFSGIFGLGIGRSSLVSQLNS----SFSYCIGSLHDPDYL 220
              V +  +GCG      F + +G+ GL   + SL+ QL      SFSYC+ +       
Sbjct: 229 ---VPNFYYGCGQDNEGLFGQSAGLIGLARNKLSLLYQLAPSMGYSFSYCLPTSSSSSGY 285

Query: 221 HNKLILGDGAIIDEGDATPL---QFIDGHYYITLEAISVDGRMLDINPNIFKRDDSGGGV 277
            +      G    +   TP+      D  Y+I +  I+V G+ L ++ + +    S    
Sbjct: 286 LSIGSYNPG----QYSYTPMAKSSLDDSLYFIKMTGITVAGKPLSVSASAY----SSLPT 337

Query: 278 MIDSGTDVTWLVKEAYEALRDEVMIRLEGEQMRSYSWPDKLCYHGIMSSDLKGFPTVRFH 337
           +IDSGT +T L  + Y AL   V   ++G    S       C+ G  +S L+  P V   
Sbjct: 338 IIDSGTVITRLPTDVYSALSKAVAGAMKGTPRASAFSILDTCFQG-QASRLR-VPQVSMA 395

Query: 338 FRGGAKLALEKDSMFYQPRPDAFCMAVNPASINNRYVNFSLIGMMAQQFYNVGYDIGRKQ 397
           F GGA L L+  ++         C+A  PA       + ++IG   QQ ++V YD+   +
Sbjct: 396 FAGGAALKLKATNLLVDVDSATTCLAFAPAR------SAAIIGNTQQQTFSVVYDVKNSK 449

Query: 398 KTFQRMDCE 406
             F    C 
Sbjct: 450 IGFAAGGCS 458


>gi|242046812|ref|XP_002461152.1| hypothetical protein SORBIDRAFT_02g041760 [Sorghum bicolor]
 gi|241924529|gb|EER97673.1| hypothetical protein SORBIDRAFT_02g041760 [Sorghum bicolor]
          Length = 452

 Score = 97.1 bits (240), Expect = 1e-17,   Method: Compositional matrix adjust.
 Identities = 100/365 (27%), Positives = 153/365 (41%), Gaps = 29/365 (7%)

Query: 56  ISNSLFYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLGTIFYPSRSSSYAYVP 115
           +    + V   +G PP     A+DT +   W+ C  C  C       F P+ S+SY  VP
Sbjct: 105 LQTPTYVVRARLGTPPQQLLLAVDTSNDAAWIPCAGCAGCPTSSAPPFDPAASTSYRSVP 164

Query: 116 CDSEHCRYFPYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDVVFG 175
           C S  C   P A C      C ++  Y    ++S+  +  Q +     ++   V+   FG
Sbjct: 165 CGSPLCAQAPNAACPPGGKACGFSLTYA---DSSLQAALSQDSLAVAGDA---VKTYTFG 218

Query: 176 C-GFSTNRNFKFSGIFGLGIGRSSLVSQL----NSSFSYCIGSLHDPDYLHNKLILGDGA 230
           C   +T       G+ GLG G  S +SQ       +FSYC+ S    ++    L LG   
Sbjct: 219 CLQKATGTAAPPQGLLGLGRGPLSFLSQTRDMYQGTFSYCLPSFKSLNF-SGTLRLGRNG 277

Query: 231 IIDEGDATPLQFIDGH----YYITLEAISVDGRMLDINPNIFKRDD-SGGGVMIDSGTDV 285
                  TPL   + H    YY+ +  I V  +++ I P     D  +G G ++DSGT  
Sbjct: 278 QPPRIKTTPL-LANPHRSSLYYVNMTGIRVGRKVVPIPPPALAFDPATGAGTVLDSGTMF 336

Query: 286 TWLVKEAYEALRDEVMIRLEGEQMRSYSWPDKLCYHGIMSSDLKGFPTVRFHFRGGAKLA 345
           T LV  AY A+RDEV  R+ G  + S    D  C++    +    +P V   F G     
Sbjct: 337 TRLVAPAYVAVRDEVRRRV-GAPVSSLGGFDT-CFN----TTAVAWPPVTLLFDGMQVTL 390

Query: 346 LEKDSMFYQPRPDAFC--MAVNPASINNRYVNFSLIGMMAQQFYNVGYDIGRKQKTFQRM 403
            E++ + +       C  MA  P  +N      ++I  M QQ + V +D+   +  F R 
Sbjct: 391 PEENVVIHSTYGTISCLAMAAAPDGVNTV---LNVIASMQQQNHRVLFDVPNGRVGFARE 447

Query: 404 DCEVL 408
            C  +
Sbjct: 448 RCTAV 452


>gi|168014188|ref|XP_001759635.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162689174|gb|EDQ75547.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 485

 Score = 97.1 bits (240), Expect = 2e-17,   Method: Compositional matrix adjust.
 Identities = 111/443 (25%), Positives = 180/443 (40%), Gaps = 66/443 (14%)

Query: 8   LSPYNNPNENPAHRVQRGINISIARLAYLQEKIRSHNTYQ-------------AQILPSN 54
           ++P  N N N  + V+   N   A +  L E  R  +                A+++  +
Sbjct: 32  VNPSENENGNGLNGVRIQSNCGSALVLPLVESKRHGHVVDRRFERRGRGLVEDARMVLHD 91

Query: 55  DISNSLFYVN-ISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLGTI---FYPSRSSS 110
           D+    +Y + + IG P       +DTGS++ +V C  C  C          F P  SSS
Sbjct: 92  DLLTKGYYTSRVFIGTPAQEFALIVDTGSTVTYVPCSSCTHCGHHQACFDPRFKPDNSSS 151

Query: 111 YAYVPCDSEHCRYFPYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQ 170
           Y  V C+S  C       C A  H+C Y ++Y     +   +  + L F N   S +   
Sbjct: 152 YQTVSCNSPDCI---TKMCDARVHQCKYERVYAEMSSSKGVLGKDLLGFGNG--SRLQPH 206

Query: 171 DVVFGCGFSTNRNFKFS---GIFGLGIGRSSLVSQL------NSSFSYCIGSLHDPDYLH 221
            ++FGC  +   +       GI GLG G  S+V QL        SFS C G + +     
Sbjct: 207 PLLFGCETAETGDLYLQHADGIMGLGRGPLSIVDQLVGTGAMEDSFSLCYGGMDE----- 261

Query: 222 NKLILGDGAIIDEGDATPLQFI--------DGHYYITLEAISVDGRMLDINPNIFKRDDS 273
                G G+++      P   +          +Y + L  I V G  L++   +F   + 
Sbjct: 262 -----GGGSMVLGAIPPPPAMVFAKSDPNRSNYYNLELSEIQVQGVSLNVPSEVF---NG 313

Query: 274 GGGVMIDSGTDVTWLVKEAYEALRDEVMIRLEGEQM---RSYSWPDKLCYHGIMSSDL-- 328
             G ++DSGT   +L  +A++A +D +  +L   Q       S+PD +C+ G  S     
Sbjct: 314 RLGTVLDSGTTYAYLPDKAFDAFKDAITQQLGSLQAVPGPDPSYPD-VCFAGAGSDSKAL 372

Query: 329 -KGFPTVRFHFRGGAKLALEKDSMFYQ--PRPDAFCMAVNPASINNRYVNFSLIGMMAQQ 385
            K FP V F F G  K+ L  ++  ++    P A+C+        N+    +L+G +  +
Sbjct: 373 GKHFPPVDFVFSGNQKVFLAPENYLFKHTKVPGAYCLGF----FKNQDAT-TLLGGIVVR 427

Query: 386 FYNVGYDIGRKQKTFQRMDCEVL 408
              V YD    Q  F + +C  L
Sbjct: 428 NTLVTYDRANHQIGFFKTNCTNL 450


>gi|242073262|ref|XP_002446567.1| hypothetical protein SORBIDRAFT_06g018180 [Sorghum bicolor]
 gi|241937750|gb|EES10895.1| hypothetical protein SORBIDRAFT_06g018180 [Sorghum bicolor]
          Length = 453

 Score = 96.7 bits (239), Expect = 2e-17,   Method: Compositional matrix adjust.
 Identities = 116/425 (27%), Positives = 175/425 (41%), Gaps = 64/425 (15%)

Query: 22  VQRGINISIARLAYLQEKIRSHNTYQAQILPSNDISNSLFYVNISIGQPPVPQFTAMDTG 81
           ++R +  S+ R   +  + R     +A ++P        + V + IG P      A+DT 
Sbjct: 54  IRRAVQRSLDRPG-VAARNRKAVVGEAPLVPRG----GEYLVKLGIGTPQHYFSAAIDTA 108

Query: 82  SSLLWVHCYPCRDCSPQLGTIFYPSRSSSYAYVPCDSEHCRYFPYARCSAYKHR-CIYTQ 140
           S L+W+ C PC  C  QL  IF P  SSSYA VPC S+ C      RC     + C Y  
Sbjct: 109 SDLVWLQCQPCVSCYRQLDPIFNPRLSSSYAVVPCSSDTCSQLDGHRCDEDDDQACRYNY 168

Query: 141 LYLIGPETSVFVSTEQLTFKNTDESTIHVQDVVFGCGFST--NRNFKFSGIFGLGIGRSS 198
            Y     T+  ++ ++L       +  H   VV GC  S+      + SG+ GL  G  S
Sbjct: 169 KYSGNAVTNGTLAIDKLAVGG---NVFHA--VVLGCSDSSVGGPPPQASGLVGLARGPLS 223

Query: 199 LVSQLN-SSFSYCIGSLHDP-DYLHNKLILGDGAIIDE----GDATPLQFIDGHYYITLE 252
           L+SQL+   F YC   L  P      KL+LG GA  D      D   +       Y +  
Sbjct: 224 LLSQLSVRRFMYC---LPPPMSRTPGKLVLGAGAGADAVRNVSDRVTVTMSSSTRYPSYY 280

Query: 253 AISVDGRML-DINPNIFKRDDS--------------------GGGVMIDSGTDVTWLVKE 291
            ++ DG  + D  P   +R  S                      G+++D  + +++L   
Sbjct: 281 YLNFDGLAVGDQTPGTIRRPTSPPATGGGVGGGGGDGGSGANAYGMIVDVASTISFLEAS 340

Query: 292 AYEALRDEVMIRLEGEQMRSYSWPDK-----LCY---HGIMSSDLKGFPTVRFHFRGGAK 343
            Y+ L D+    LE E     + P       LC+    G+   D    PTV   F  G  
Sbjct: 341 LYDELADD----LEEEIRLPRATPSTRLGLDLCFILPEGV-GIDRVYVPTVSMSF-DGRW 394

Query: 344 LALEKDSMFYQPRPDAFCMAVNPASINNRYVNFSLIGMMAQQFYNVGYDIGRKQKTFQRM 403
           L LE+D +F +   D   M +    +  R    S++G   QQ  +V Y++ R + TF + 
Sbjct: 395 LELERDRLFLE---DGRMMCL----MIGRTSGVSILGNYQQQNMHVLYNLRRGKITFAKA 447

Query: 404 DCEVL 408
            C+ L
Sbjct: 448 SCDSL 452


>gi|21717162|gb|AAM76355.1|AC074196_13 putative nucleoid DNA binding protein [Oryza sativa Japonica Group]
 gi|31433286|gb|AAP54824.1| Eukaryotic aspartyl protease family protein [Oryza sativa Japonica
           Group]
          Length = 397

 Score = 96.7 bits (239), Expect = 2e-17,   Method: Compositional matrix adjust.
 Identities = 85/359 (23%), Positives = 158/359 (44%), Gaps = 33/359 (9%)

Query: 64  NISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLGTIFYPSRSSSYAYVPCDSEHCRY 123
           N +IG PP      +D    L+W  C  C  C  Q   +F P+ SS++   PC ++ C+ 
Sbjct: 57  NFTIGTPPQAASAFIDLTGELVWTQCSQCIHCFKQDLPVFVPNASSTFKPEPCGTDVCKS 116

Query: 124 FPYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDVVFGCGFSTNRN 183
            P  +C++    C Y  +  +G  T   V+T+         +++      FGC  +++ +
Sbjct: 117 IPTPKCAS--DVCAYDGVTGLGGHTVGIVATDTFAIGTAAPASLG-----FGCVVASDID 169

Query: 184 FKF--SGIFGLGIGRSSLVSQLN-SSFSYCIGSLHDPDYLHNKLILGDGAIIDEGDA--- 237
                SG  GLG    SLV+Q+  + FSYC+   HD    +++L LG  A +  G A   
Sbjct: 170 TMGGPSGFIGLGRTPWSLVAQMKLTRFSYCLAP-HDTGK-NSRLFLGASAKLAGGGAWTP 227

Query: 238 ----TPLQFIDGHYYITLEAISVDGRMLDINPNIFKRDDSGGGVMIDSG-TDVTWLVKEA 292
               +P   +  +Y I LE I    +  D    + +  ++   V++ +    V+ LV   
Sbjct: 228 FVKTSPNDGMSQYYPIELEEI----KAGDATITMPRGRNT---VLVQTAVVRVSLLVDSV 280

Query: 293 YEALRDEVMIRLEGEQMRS-YSWPDKLCYHGIMSSDLKGFPTVRFHFRGGAKLALEKDSM 351
           Y+  +  VM  +      +    P ++C+     + + G P + F F+ GA L +   + 
Sbjct: 281 YQEFKKAVMASVGAAPTATPVGAPFEVCFP---KAGVSGAPDLVFTFQAGAALTVPPANY 337

Query: 352 FYQPRPDAFCMAVNPASINNRYV--NFSLIGMMAQQFYNVGYDIGRKQKTFQRMDCEVL 408
            +    D  C++V   ++ N       +++G   Q+  ++ +D+ +   +F+  DC  L
Sbjct: 338 LFDVGNDTVCLSVMSIALLNITALDGLNILGSFQQENVHLLFDLDKDMLSFEPADCSSL 396


>gi|238011160|gb|ACR36615.1| unknown [Zea mays]
          Length = 461

 Score = 96.7 bits (239), Expect = 2e-17,   Method: Compositional matrix adjust.
 Identities = 104/413 (25%), Positives = 151/413 (36%), Gaps = 76/413 (18%)

Query: 61  FYVNISIGQPPVPQFTAMDTGSSLLWVHC---------------YPCRDCSPQLGT---- 101
           ++V   +G P  P     DTGS L WV C               Y     +P        
Sbjct: 55  YFVRFRVGTPARPFLLVADTGSDLTWVKCRRHAAPAPAPAPAPGYNYGYGAPASNDSSSV 114

Query: 102 ---------IFYPSRSSSYAYVPCDSEHCRY---FPYARCSAYKHRCIYTQLYLIGPETS 149
                    +F P RS ++A +PC S+ C     F  A C      C Y   Y  G    
Sbjct: 115 SAAASSPARVFRPDRSRTWAPIPCSSDTCTASLPFSLAACPTPGSPCAYEYRYKDGSAAR 174

Query: 150 VFVSTEQLTFK------NTDESTIHVQDVVFGCGFS-TNRNFKFS-GIFGLGIGRSSLVS 201
             V T+  T           +    ++ VV GC  S T  +F  S G+  LG    S  S
Sbjct: 175 GTVGTDSATIALSGRRAGKKQRRAKLRGVVLGCTTSYTGESFLASDGVLSLGYSNVSFAS 234

Query: 202 Q----LNSSFSYCIGSLHDPDYLHNKLILGDGAIIDEGDA----------------TPLQ 241
           +        FSYC+     P    + L  G    +    A                TPL 
Sbjct: 235 RAAARFGGRFSYCLVDHLAPRNATSYLTFGPNPAVSSASASRTACAGSAAAPGARQTPLL 294

Query: 242 F---IDGHYYITLEAISVDGRMLDINPNIFKRDDSGGGVMIDSGTDVTWLVKEAYEALRD 298
               +   Y + +  +SVDG +L I P +      GGG ++DSGT +T LV  AY A+  
Sbjct: 295 LDHRMRPFYAVAVNGVSVDGELLRI-PRLVWDVQKGGGAILDSGTSLTVLVSPAYRAVVA 353

Query: 299 EVMIRLEGEQMRSYSWPDKLCYHGIMSSDLKG------FPTVRFHFRGGAKLALEKDSMF 352
            +  +L G    +   P   CY+   +S L G       P +  HF G A+L     S  
Sbjct: 354 ALGKKLVGLPRVAMD-PFDYCYN--WTSPLTGEDLAVAVPALAVHFAGSARLQPPPKSYV 410

Query: 353 YQPRPDAFCMAVNPASINNRYVNFSLIGMMAQQFYNVGYDIGRKQKTFQRMDC 405
               P   C+ +        +   S+IG + QQ +   +D+  ++  F+R  C
Sbjct: 411 IDAAPGVKCIGLQ----EGDWPGVSVIGNILQQEHLWEFDLKNRRLRFKRSRC 459


>gi|218184944|gb|EEC67371.1| hypothetical protein OsI_34484 [Oryza sativa Indica Group]
          Length = 396

 Score = 96.7 bits (239), Expect = 2e-17,   Method: Compositional matrix adjust.
 Identities = 97/369 (26%), Positives = 154/369 (41%), Gaps = 39/369 (10%)

Query: 61  FYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLGTIFYPSRSSSYAYVPCDSEH 120
           +  N +IG PP P    +D    L+W  C  CR C  Q   +F P+ SS++   PC +  
Sbjct: 45  YVANFTIGTPPQPASAIVDVAGELVWTQCSACRRCFKQDLPVFVPNASSTFKPEPCGTAV 104

Query: 121 CRYFPYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDV--VFGCGF 178
           C   P   CS     C Y      GP T +  +T    F  TD   I    V   FGC  
Sbjct: 105 CESIPTRSCSG--DVCSYK-----GPPTQLRGNTSG--FAATDTFAIGTATVRLAFGCVV 155

Query: 179 STNRNF--KFSGIFGLGIGRSSLVSQLN-SSFSYCIGSLHDPDYLHNKLILGDGAIIDEG 235
           +++ +     SG  GLG    SLV+Q+  + FSYC+   +      ++L LG  A +   
Sbjct: 156 ASDIDTMDGPSGFIGLGRTPWSLVAQMKLTRFSYCLSPRNTGK--SSRLFLGSSAKLAGS 213

Query: 236 DATPLQ-FI------DG--HYYITLEAISVDGRMLDINPNIFKRDDSGGGVMIDSGTDVT 286
           ++T    FI      DG  +Y ++L+AI      +           SGG +++ + +  +
Sbjct: 214 ESTSTAPFIKTSPDDDGSNYYLLSLDAIRAGNTTI-------ATAQSGGILVMHTVSPFS 266

Query: 287 WLVKEAYEALRDEVMIRLEGEQMRSYSWPDK---LCYHGIMSSDLKGFPTVRFHFRGGAK 343
            LV  AY+A +  V   + G      + P +   LC+           P + F F+G A 
Sbjct: 267 LLVDSAYKAFKKAVTEAVGGAAAPPMATPPQPFDLCFKKAAGFSRATAPDLVFTFQGAAA 326

Query: 344 LALEKDSMFYQ--PRPDAFCMAVNPASINNR--YVNFSLIGMMAQQFYNVGYDIGRKQKT 399
           L +             D  C A+   +  NR      S++G + Q+  +  YD+ ++  +
Sbjct: 327 LTVPPAKYLIDVGEEKDTACAAILSMAWLNRTGLEGVSVLGSLQQEDVHFLYDLKKETLS 386

Query: 400 FQRMDCEVL 408
           F+  DC  L
Sbjct: 387 FEPADCSSL 395


>gi|326490656|dbj|BAJ89995.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 456

 Score = 96.7 bits (239), Expect = 2e-17,   Method: Compositional matrix adjust.
 Identities = 111/400 (27%), Positives = 168/400 (42%), Gaps = 58/400 (14%)

Query: 32  RLAYLQEKIRSHNTYQAQILPSN-----DISNSL----FYVNISIGQPPVPQFTAMDTGS 82
           R AY+  K    N     +  S+      +  SL    + + + +G P V Q   +DTGS
Sbjct: 89  RAAYITRKYSGVNGSAGDVEGSDVTVPTTLGTSLDTLEYLITVGMGSPAVAQTMLIDTGS 148

Query: 83  SLLWVHCYPCRDCSPQLGTIFYPSRSSSYAYVPCDSEHCRYFPYARCSAYKHRCIYTQLY 142
            + WV C PC  C  Q  ++F PS SS+Y+   C S  C       CS+   +C YT  Y
Sbjct: 149 DVSWVQCKPCSQCHSQADSLFDPSSSSTYSAFSCTSAACAQLRQRGCSS--SQCQYTVKY 206

Query: 143 LIGPETSVFVSTEQLTFKNTDESTIHVQDVVFGCGFSTNRNF---KFSGIFGLGIGRSSL 199
             G   S   S++ L   ++      V++  FGC  S + N    + +G+ GLG G  SL
Sbjct: 207 GDGSTGSGTYSSDTLALGSST-----VENFQFGCSQSESGNLLQDQTAGLMGLGGGAESL 261

Query: 200 VSQ----LNSSFSYCIGSLHDPDYLHNKLILGDGAIIDEGDATPL---QFIDGHYYITLE 252
            +Q       +FSYC+                 G ++     TP+     +  +Y + L+
Sbjct: 262 ATQTAGTFGKAFSYCLPPTPGSSGFLTLGASTSGFVVK----TPMLRSTQVPSYYGVLLQ 317

Query: 253 AISVDGRMLDINPNIFKRDDSGGGVMIDSGTDVTWLVKEAYEALRDEVMIRLEGEQMRSY 312
           AI V GR L+I  + F       G ++DSGT +T L + AY AL            M+ Y
Sbjct: 318 AIRVGGRQLNIPASAFS-----AGSIMDSGTIITRLPRTAYSALSSAFK-----AGMKQY 367

Query: 313 SWPDKLCYHGIMSS--DLKG-----FPTVRFHFRGGAKLALEKDSMFYQPRPDAFCMAVN 365
                +   GI  +  D  G      PTV   F GGA + L  D +         C+A  
Sbjct: 368 PPAQPM---GIFDTCFDFSGQSSVSIPTVALVFSGGAVVDLASDGIIL-----GSCLAF- 418

Query: 366 PASINNRYVNFSLIGMMAQQFYNVGYDIGRKQKTFQRMDC 405
             + N+   +  +IG + Q+ + V YD+G     F+   C
Sbjct: 419 --AANSDDTSLGIIGNVQQRTFEVLYDVGGGAVGFKAGAC 456


>gi|357159746|ref|XP_003578546.1| PREDICTED: aspartic proteinase-like protein 1-like [Brachypodium
           distachyon]
          Length = 530

 Score = 96.3 bits (238), Expect = 3e-17,   Method: Compositional matrix adjust.
 Identities = 103/375 (27%), Positives = 158/375 (42%), Gaps = 48/375 (12%)

Query: 60  LFYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLGT--------IFYPSRSSSY 111
           L Y  +++G P V    A+DTGS L WV C  C  C+P            ++ P +SS+ 
Sbjct: 98  LHYAVVALGTPNVTFLVALDTGSDLFWVPC-DCIKCAPLASPDYGDLKFDMYSPRKSSTS 156

Query: 112 AYVPCDSEHCRYFPYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTD--ESTIHV 169
             VPC S  C   P A CSA  + C Y+  YL    +S  V  E + +  T+  +S I  
Sbjct: 157 RKVPCSSSLCD--PQADCSAASNSCPYSIQYLSENTSSKGVLVEDVLYLTTESGQSKITQ 214

Query: 170 QDVVFGCGFSTNRNFKFS----GIFGLGIGRSSLVSQLNS------SFSYCIGSLHDPDY 219
             + FGCG   + +F  S    G+ GLG+   S+ S L S      SFS C G     + 
Sbjct: 215 APITFGCGQVQSGSFLGSAAPNGLLGLGMDSKSVPSLLASKGIAANSFSMCFG-----ED 269

Query: 220 LHNKLILGDGAIIDEGDATPLQFIDGHYYITLEAISVDGRMLDINPNIFKRDDSGGGVMI 279
            H ++  GD    D+ + TPL     + Y     IS+ G M+       K  D+    ++
Sbjct: 270 GHGRINFGDTGSSDQLE-TPLNIYKQNPYYN---ISITGAMVG-----GKSFDTKFSAVV 320

Query: 280 DSGTDVTWLVKEAYEALRDEVMIRL-EGEQMRSYSWPDKLCYHGIMSSDLKGFPTVRFHF 338
           DSGT  T L    Y  +      ++ E  +    S P + CY  I +      P +    
Sbjct: 321 DSGTSFTALSDPMYTEITSTFNAQVKESRKHLDASMPFEYCYS-ISAQGAVNPPNISLTA 379

Query: 339 RGGAKLALEKDSMFY---QPRPDAFCMAVNPASINNRYVNFSLIGMMAQQFYNVGYDIGR 395
           +GG+   +    +       RP A+C+A+    + +  VN  LIG        + +D  R
Sbjct: 380 KGGSIFPVNGPIITITDTSSRPIAYCLAI----MKSEGVN--LIGENFMSGLKIVFDRER 433

Query: 396 KQKTFQRMDCEVLDD 410
               ++  +C   D+
Sbjct: 434 LVLGWKTFNCYNFDN 448


>gi|147814824|emb|CAN65806.1| hypothetical protein VITISV_015630 [Vitis vinifera]
          Length = 449

 Score = 95.9 bits (237), Expect = 3e-17,   Method: Compositional matrix adjust.
 Identities = 89/381 (23%), Positives = 162/381 (42%), Gaps = 33/381 (8%)

Query: 52  PSNDISNSLFYVNISIGQPPVPQFTAMDTGSSLLWVHC-YPC--RDCSPQLG------TI 102
           P+ D     ++V   +G P        DTGS L W+ C Y C  R+CS +         +
Sbjct: 74  PAADYGIGQYFVAFKVGTPSQKFMLVADTGSDLTWMSCKYHCRSRNCSNRKARRIRHKRV 133

Query: 103 FYPSRSSSYAYVPCDSEHCR-----YFPYARCSAYKHRCIYTQLYLIGPETSVFVSTEQL 157
           F+ + SSS+  +PC ++ C+      F    C      C Y   Y  G     F + E +
Sbjct: 134 FHANLSSSFKTIPCLTDMCKIELMDLFSLTNCPTPLTPCGYDYRYSDGSTALGFFANETV 193

Query: 158 TFKNTDESTIHVQDVVFGCGFS-TNRNFKFS-GIFGLGIGRSSL----VSQLNSSFSYCI 211
           T +  +   + + +V+ GC  S   ++F+ + G+ GLG  + S       +    FSYC+
Sbjct: 194 TVELKEGRKMKLHNVLIGCSESFQGQSFQAADGVMGLGYSKYSFAIKAAEKFGGKFSYCL 253

Query: 212 GSLHDPDYLHNKLILGDG----AIIDEGDATPLQF--IDGHYYITLEAISVDGRMLDINP 265
                   + N L  G      A+++    T L    ++  Y + +  IS+ G ML I  
Sbjct: 254 VDHLSHKNVSNYLTFGSSRSKEALLNNMTYTELVLGMVNSFYAVNMMGISIGGAMLKIPS 313

Query: 266 NIFKRDDSGGGVMIDSGTDVTWLVKEAYEALRDEVMIR-LEGEQMRSYSWPDKLCYHGIM 324
            ++    +GG ++ DSG+ +T+L + AY+ +   + +  L+  ++     P + C++   
Sbjct: 314 EVWDVKGAGGTIL-DSGSSLTFLTEPAYQPVMAALRVSLLKFRKVEMDIGPLEYCFNSTG 372

Query: 325 SSDLKGFPTVRFHFRGGAKLALEKDSMFYQPRPDAFCMAVNPASINNRYVNFSLIGMMAQ 384
             +    P + FHF  GA+      S          C+      ++  +   S++G + Q
Sbjct: 373 FEE-SLVPRLVFHFADGAEFEPPVKSYVISAADGVRCLGF----VSVAWPGTSVVGNIMQ 427

Query: 385 QFYNVGYDIGRKQKTFQRMDC 405
           Q +   +D+G K+  F    C
Sbjct: 428 QNHLWEFDLGLKKLGFAPSSC 448


>gi|356528675|ref|XP_003532925.1| PREDICTED: LOW QUALITY PROTEIN: probable aspartic protease
           At2g35615-like [Glycine max]
          Length = 342

 Score = 95.9 bits (237), Expect = 3e-17,   Method: Compositional matrix adjust.
 Identities = 98/412 (23%), Positives = 159/412 (38%), Gaps = 112/412 (27%)

Query: 1   LIHRDSILSPYNNPNENPAHRVQRGINISIARLAYLQEKIRSHNTYQAQILPSNDISNSL 60
           LIHRDS LSP+ NP+  P+ R      I+ A L+       + N     IL  N   N  
Sbjct: 33  LIHRDSPLSPFYNPSLTPSER------ITDAALS------SNENKLPESILIPN---NGE 77

Query: 61  FYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLGTIFYPSRSSSYAYVPCDSEH 120
           + + + IG PPV +    DTGS  +WV C PC++C                         
Sbjct: 78  YLMRLYIGTPPVERLVIADTGSDFIWVQCSPCQNC------------------------- 112

Query: 121 CRYFPYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDES-TIHVQDVVFGCGFS 179
                         +C+Y  +Y     T   V TE L+F +T  + T+   + +FGCG +
Sbjct: 113 --------------QCVYLNIYANKSFTIEVVGTETLSFDSTGGAQTVSFPNSIFGCGAN 158

Query: 180 TNRNF----KFSGIFGLGIGRSSLVSQLNSSFSYCIGSLHDPDYLHNKLILGDGAIIDEG 235
            N  F    K +G+ GL  G+ SLVSQL +   Y    L    +    +I  +G +    
Sbjct: 159 NNLTFRSSDKATGLVGLVAGQLSLVSQLGAQIGYKFSYLK---FGSEAIITTNGVV---- 211

Query: 236 DATPLQFIDG--HYYITLEAISVDGRMLDINPNIFKRDDSGGGVMIDSGTDVTWLVKEAY 293
            +TPL        Y++ LE +++  +++                                
Sbjct: 212 -STPLIIKPSLPLYFLNLEVVTIGQKVVPTE----------------------------- 241

Query: 294 EALRDEVMIRLEGEQMRSYSWPDKLCYHGIMSSDLKGFPTVRFHFRGGAKLALEKDSMFY 353
                     L  E ++   +P K C+      D    P + F F G +     K+ +  
Sbjct: 242 ---------TLGVESVQDLPFPFKFCFP---YRDNMTVPAIAFQFTGASVALRPKNLLIK 289

Query: 354 QPRPDAFCMAVNPASINNRYVNFSLIGMMAQQFYNVGYDIGRKQKTFQRMDC 405
               +   +AV P++ +   +  S+ G++AQ  + V YD+  K+ +    DC
Sbjct: 290 LQDRNMLXLAVVPSASSLSVI--SIFGIIAQFDFQVLYDLDGKKVSVAPTDC 339


>gi|125560845|gb|EAZ06293.1| hypothetical protein OsI_28528 [Oryza sativa Indica Group]
          Length = 525

 Score = 95.9 bits (237), Expect = 3e-17,   Method: Compositional matrix adjust.
 Identities = 102/362 (28%), Positives = 150/362 (41%), Gaps = 54/362 (14%)

Query: 78  MDTGSSLLWVHCYPCRDCSPQLGTIFYPSRSSSYAYVPCDSEHCRYFPYA------RCSA 131
           +DTGS L WV C PC  C  Q   +F PS S+SYA VPC++  C     A       C+ 
Sbjct: 181 VDTGSDLTWVQCKPCSVCYAQRDPLFDPSGSASYAAVPCNASACEASLKAATGVPGSCAT 240

Query: 132 Y--------KHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDVVFGCGFSTNRN 183
                      RC Y+  Y  G  +   ++T+ +           V   VFGCG S NR 
Sbjct: 241 VGGGGGGGKSERCYYSLAYGDGSFSRGVLATDTVALGGAS-----VDGFVFGCGLS-NRG 294

Query: 184 FKFSGIFGL-GIGRS--SLVSQ----LNSSFSYCIGSLHDPDYLHNKLILGDGAIIDEGD 236
             F G  GL G+GR+  SLVSQ        FSYC+ +    D   +  + GD +     +
Sbjct: 295 L-FGGTAGLMGLGRTELSLVSQTAPRFGGVFSYCLPAATSGDAAGSLSLGGDTSSYR--N 351

Query: 237 ATPLQFID--------GHYYITLEAISVDGRMLDINPNIFKRDDSGGGVMIDSGTDVTWL 288
           ATP+ +            Y++ +   SV G  +               V++DSGT +T L
Sbjct: 352 ATPVSYTRMIADPAQPPFYFMNVTGASVGGAAVAAAGLGAAN------VLLDSGTVITRL 405

Query: 289 VKEAYEALRDEVMIRLEGEQMRS---YSWPDKLCYHGIMSSDLKGFPTVRFHFRGGAKLA 345
               Y A+R E   +   E+  +   +S  D  CY+ +   D    P +     GGA + 
Sbjct: 406 APSVYRAVRAEFARQFGAERYPAAPPFSLLDA-CYN-LTGHDEVKVPLLTLRLEGGADMT 463

Query: 346 LEKDSMFYQPRPDA--FCMAVNPASINNRYVNFSLIGMMAQQFYNVGYDIGRKQKTFQRM 403
           ++   M +  R D    C+A+   S  ++     +IG   Q+   V YD    +  F   
Sbjct: 464 VDAAGMLFMARKDGSQVCLAMASLSFEDQT---PIIGNYQQKNKRVVYDTVGSRLGFADE 520

Query: 404 DC 405
           DC
Sbjct: 521 DC 522


>gi|115475621|ref|NP_001061407.1| Os08g0267300 [Oryza sativa Japonica Group]
 gi|37806402|dbj|BAC99940.1| putative 41 kD chloroplast nucleoid DNA binding protein [Oryza
           sativa Japonica Group]
 gi|113623376|dbj|BAF23321.1| Os08g0267300 [Oryza sativa Japonica Group]
          Length = 524

 Score = 95.9 bits (237), Expect = 4e-17,   Method: Compositional matrix adjust.
 Identities = 102/362 (28%), Positives = 150/362 (41%), Gaps = 54/362 (14%)

Query: 78  MDTGSSLLWVHCYPCRDCSPQLGTIFYPSRSSSYAYVPCDSEHCRYFPYA------RCSA 131
           +DTGS L WV C PC  C  Q   +F PS S+SYA VPC++  C     A       C+ 
Sbjct: 180 VDTGSDLTWVQCKPCSVCYAQRDPLFDPSGSASYAAVPCNASACEASLKAATGVPGSCAT 239

Query: 132 Y--------KHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDVVFGCGFSTNRN 183
                      RC Y+  Y  G  +   ++T+ +           V   VFGCG S NR 
Sbjct: 240 VGGGGGGGKSERCYYSLAYGDGSFSRGVLATDTVALGGAS-----VDGFVFGCGLS-NRG 293

Query: 184 FKFSGIFGL-GIGRS--SLVSQ----LNSSFSYCIGSLHDPDYLHNKLILGDGAIIDEGD 236
             F G  GL G+GR+  SLVSQ        FSYC+ +    D   +  + GD +     +
Sbjct: 294 L-FGGTAGLMGLGRTELSLVSQTAPRFGGVFSYCLPAATSGDAAGSLSLGGDTSSYR--N 350

Query: 237 ATPLQFID--------GHYYITLEAISVDGRMLDINPNIFKRDDSGGGVMIDSGTDVTWL 288
           ATP+ +            Y++ +   SV G  +               V++DSGT +T L
Sbjct: 351 ATPVSYTRMIADPAQPPFYFMNVTGASVGGAAVAAAGLGAAN------VLLDSGTVITRL 404

Query: 289 VKEAYEALRDEVMIRLEGEQMRS---YSWPDKLCYHGIMSSDLKGFPTVRFHFRGGAKLA 345
               Y A+R E   +   E+  +   +S  D  CY+ +   D    P +     GGA + 
Sbjct: 405 APSVYRAVRAEFARQFGAERYPAAPPFSLLDA-CYN-LTGHDEVKVPLLTLRLEGGADMT 462

Query: 346 LEKDSMFYQPRPDA--FCMAVNPASINNRYVNFSLIGMMAQQFYNVGYDIGRKQKTFQRM 403
           ++   M +  R D    C+A+   S  ++     +IG   Q+   V YD    +  F   
Sbjct: 463 VDAAGMLFMARKDGSQVCLAMASLSFEDQT---PIIGNYQQKNKRVVYDTVGSRLGFADE 519

Query: 404 DC 405
           DC
Sbjct: 520 DC 521


>gi|297597434|ref|NP_001043968.2| Os01g0696800 [Oryza sativa Japonica Group]
 gi|255673588|dbj|BAF05882.2| Os01g0696800 [Oryza sativa Japonica Group]
          Length = 334

 Score = 95.5 bits (236), Expect = 4e-17,   Method: Compositional matrix adjust.
 Identities = 100/328 (30%), Positives = 147/328 (44%), Gaps = 39/328 (11%)

Query: 102 IFYPSRSSSYAYVPCDSEHCRYFPYARCS------AYKHRCIYTQLYLIGPETSVFVS-- 153
           + YP+ SSS A+V C    C   P   CS      +    C Y   Y    +T  +    
Sbjct: 14  LLYPTSSSSAAFVACGDRTCGELPRPLCSNVAGGGSGSGNCSYHYAYGNARDTHHYTEGI 73

Query: 154 --TEQLTFKNTDESTIHVQDVVFGCGFSTNRNFKF-SGIFGLGIGRSSLVSQLN-SSFSY 209
             TE  TF    +       + FGC   +   F   SG+ GLG G+ SLV+QLN  +F Y
Sbjct: 74  LMTETFTFG---DDAAAFPGIAFGCTLRSEGGFGTGSGLVGLGRGKLSLVTQLNVEAFGY 130

Query: 210 CIGS-LHDPDYLH-NKLILGDGAIIDEGDATPL------QFIDGHYYITLEAISVDGRML 261
            + S L  P  +    L    G   D   +TPL      Q +   YY+ L  ISV G+++
Sbjct: 131 RLSSDLSAPSPISFGSLADVTGGNGDSFMSTPLLTNPVVQDLP-FYYVGLTGISVGGKLV 189

Query: 262 DINPNIFKRDDS--GGGVMIDSGTDVTWLVKEAYEALRDEVMIRLEGEQMRSYSWPDKL- 318
            I    F  D S   GGV+ DSGT +T L   AY  +RDE++ ++  ++    +  D L 
Sbjct: 190 QIPSGTFSFDRSTGAGGVIFDSGTTLTMLPDPAYTLVRDELLSQMGFQKPPPAANDDDLI 249

Query: 319 CYHGIMSSDLKGFPTVRFHFRGGAKLALEKDS----MFYQPRPDAFCMAVNPASINNRYV 374
           C+ G   S    FP++  HF GGA + L  ++    M  Q    A C +V  +S      
Sbjct: 250 CFTG--GSSTTTFPSMVLHFDGGADMDLSTENYLPQMQGQNGETARCWSVVKSS-----Q 302

Query: 375 NFSLIGMMAQQFYNVGYDI-GRKQKTFQ 401
             ++IG + Q  ++V +D+ G  +  FQ
Sbjct: 303 ALTIIGNIMQMDFHVVFDLSGNARMLFQ 330


>gi|125532792|gb|EAY79357.1| hypothetical protein OsI_34486 [Oryza sativa Indica Group]
          Length = 396

 Score = 95.5 bits (236), Expect = 4e-17,   Method: Compositional matrix adjust.
 Identities = 92/371 (24%), Positives = 154/371 (41%), Gaps = 41/371 (11%)

Query: 57  SNSLFYVNISIGQPPVPQFTAMDTGSSLLWVHCYP-CRDCSPQLGTIFYPSRSSSYAYVP 115
           S + + VN++IG PP P    +D G  L+W  C   CR C  Q   +F  + SS++   P
Sbjct: 47  SQAFYVVNLTIGTPPQPVSAIIDIGGELVWTQCAQHCRRCFKQDLPLFDTNASSTFRPEP 106

Query: 116 CDSEHCRYFPYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDVVFG 175
           C +  C   P   C+         +       TS   +  ++        T     + FG
Sbjct: 107 CGAAVCESIPTRSCAGDGGGACGYEA-----STSFGRTVGRIGTDAVAIGTAATARLAFG 161

Query: 176 CGFSTNRNFKFSGIFGLGIGRS--SLVSQLN-SSFSYCIGSLHDPDY-LHNKLILGDGAI 231
           C  ++  +  +     +G+GR+  SL +Q+N ++FSYC   L  PD    + L LG  A 
Sbjct: 162 CAVASEMDTMWGSSGSVGLGRTNLSLAAQMNATAFSYC---LAPPDTGKSSALFLGASAK 218

Query: 232 I---DEGDAT---------PLQFIDGHYYITLEAISVDGRMLDINPNIFKRDDSGGGVMI 279
           +    +G  T         P   +   Y + LEAI      + +         SG  +M+
Sbjct: 219 LAGAGKGAGTTPFVKTSTPPHSGLSRSYLLRLEAIRAGNATIAM-------PQSGNTIMV 271

Query: 280 DSGTDVTWLVKEAYEALRDEVMIRLEGEQMRSYSWPDKLCYHGIMSSDLKGFPTVRFHFR 339
            + T VT LV   Y  LR  V   +    +        LC+    +S   G P +   F+
Sbjct: 272 STATPVTALVDSVYRDLRKAVADAVGAAPVPPPVQNYDLCFPKASASG--GAPDLVLAFQ 329

Query: 340 GGAKLALEKDSMFYQPRPDAFCMAV--NPASINNRYVNFSLIGMMAQQFYNVGYDIGRKQ 397
           GGA++ +   S  +    D  C+A+  +PA         S++G + Q   ++ +D+ ++ 
Sbjct: 330 GGAEMTVPVSSYLFDAGNDTACVAILGSPA-----LGGVSILGSLQQVNIHLLFDLDKET 384

Query: 398 KTFQRMDCEVL 408
            +F+  DC  L
Sbjct: 385 LSFEPADCSAL 395


>gi|357456413|ref|XP_003598487.1| Peptidase A1, putative [Medicago truncatula]
 gi|355487535|gb|AES68738.1| Peptidase A1, putative [Medicago truncatula]
          Length = 414

 Score = 95.5 bits (236), Expect = 4e-17,   Method: Compositional matrix adjust.
 Identities = 96/359 (26%), Positives = 142/359 (39%), Gaps = 27/359 (7%)

Query: 56  ISNSLFYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLGTIFYPSRSSSYAYVP 115
           I +  + V   IG PP     AMDT +   W+ C  C  C+    T+F P +S+++  V 
Sbjct: 73  IQSPTYIVRAKIGTPPQTLLLAMDTSNDAAWIPCTACDGCA---STLFAPEKSTTFKNVS 129

Query: 116 CDSEHCRYFPYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDVVFG 175
           C +  C+  P   C      C +   Y     +S+  +  Q T      +T  V    FG
Sbjct: 130 CAAPECKQVPNPGCGV--SSCNFNLTY---GSSSIAANLVQDTIT---LATDPVPSYTFG 181

Query: 176 C-----GFSTNRNFKFSGIFGLGIGRSSLVSQLNSSFSYCIGSLHDPDYLHNKLILGDGA 230
           C     G S           G     S   +   S+FSYC+ S    ++    L LG  A
Sbjct: 182 CVSKTTGTSAPPQGLLGLGRGPLSLLSQTQNLYQSTFSYCLPSFKSLNF-SGSLRLGPVA 240

Query: 231 IIDEGDATPL---QFIDGHYYITLEAISVDGRMLDINPNIFK-RDDSGGGVMIDSGTDVT 286
                  TPL         YY+ LEAI V  +++DI P        +G G + DSGT  T
Sbjct: 241 QPKRIKYTPLLKNPRRSSLYYVNLEAIRVGRKVVDIPPAALAFNPTTGAGTIFDSGTVFT 300

Query: 287 WLVKEAYEALRDEVMIRLEGEQMRSYSWPDKLCYHGIMSSDLKGFPTVRFHFRGGAKLAL 346
            LV   Y A+RDE   R+  +   +       CY+  +       PT+ F F G      
Sbjct: 301 RLVAPVYVAVRDEFRRRVGPKLTVTSLGGFDTCYNVPIV-----VPTITFIFTGMNVTLP 355

Query: 347 EKDSMFYQPRPDAFCMAVNPASINNRYVNFSLIGMMAQQFYNVGYDIGRKQKTFQRMDC 405
           + + + +       C+A+  A  N   V  ++I  M QQ + V YD+   +    R  C
Sbjct: 356 QDNILIHSTAGSTTCLAMAGAPDNVNSV-LNVIANMQQQNHRVLYDVPNSRVGVARELC 413


>gi|326495450|dbj|BAJ85821.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 491

 Score = 95.5 bits (236), Expect = 4e-17,   Method: Compositional matrix adjust.
 Identities = 104/357 (29%), Positives = 143/357 (40%), Gaps = 49/357 (13%)

Query: 74  QFTAMDTGSSLLWVHCYPC--RDCSPQLGTIFYPSRSSSYAYVPCDSEHCRYF-PYAR-C 129
           Q  A+DT   + W+ C PC    C PQ    F P RSS+ A V C S  CR    YA  C
Sbjct: 159 QTMAIDTTEDVPWIQCLPCLIPQCYPQRNAFFDPRRSSTGAPVRCGSRACRTLGGYANGC 218

Query: 130 SAYKHR--CIYTQLYLIGPETSVFVSTEQLTFKN--TDESTIHVQDVV----FGCGFSTN 181
           S       C+Y   Y          S  +LT     TD  TI          FGC  +  
Sbjct: 219 SKPNSTGDCLYRIEY----------SDHRLTLGTYMTDTLTISPSTTFLNFRFGCSHAVR 268

Query: 182 RNF--KFSGIFGLGIGRSSLVSQL----NSSFSYCIGSLHDPDYLH--NKLILGDGAIID 233
             F  + SG   LG G  SL+SQ      ++FSYC+       +L     +   DG    
Sbjct: 269 GKFSAQASGTMSLGGGPQSLLSQTARAYGNAFSYCVPGPSAAGFLSIGGPVNGDDGGGSG 328

Query: 234 EGDATPL----QFIDGHYYIT-LEAISVDGRMLDINPNIFKRDDSGGGVMIDSGTDVTWL 288
               TPL      I+   Y+  L+ I V GR L++ P +F      GG ++DS   +T L
Sbjct: 329 AFATTPLVRSANVINPTIYVVRLQGIEVAGRRLNVPPVVFS-----GGTVMDSSAVITQL 383

Query: 289 VKEAYEALRDEVMIRLEGEQMRSYSWPDKLCYHGIMSSDLKGFPTVRFHFRGGAKLALEK 348
              AY ALR      +   + R+ +     C+  +  S +   PTV   F GGA + L  
Sbjct: 384 PPTAYRALRLAFRNAMRAYKTRAPTGNLDTCFDFVGVSKVT-VPTVSLVFDGGAVIELGL 442

Query: 349 DSMFYQPRPDAFCMAVNPASINNRYVNFSLIGMMAQQFYNVGYDIGRKQKTFQRMDC 405
            S+         C+A  P + +        IG + QQ + V YD+      F+   C
Sbjct: 443 LSVLLDS-----CLAFAPMAAD---FALGFIGNVQQQTHEVLYDVAGGAVGFRHGAC 491


>gi|449442281|ref|XP_004138910.1| PREDICTED: aspartic proteinase-like protein 2-like [Cucumis
           sativus]
 gi|449506266|ref|XP_004162699.1| PREDICTED: aspartic proteinase-like protein 2-like [Cucumis
           sativus]
          Length = 482

 Score = 95.5 bits (236), Expect = 4e-17,   Method: Compositional matrix adjust.
 Identities = 92/372 (24%), Positives = 157/372 (42%), Gaps = 34/372 (9%)

Query: 58  NSLFYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQ--LGTIFYPSRSSSYAY-- 113
           + L++  I +G P    +  +DTGS +LWV+C  C +C  +  LG        SS +   
Sbjct: 71  SGLYFAKIGLGTPVQDYYVQVDTGSDILWVNCAGCTNCPKKSDLGIELSLYSPSSSSTSN 130

Query: 114 -VPCDSEHCRYF---PYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTD---EST 166
            V C+ + C      P   C+  +  C Y   Y  G  T+ +   + +         ++T
Sbjct: 131 RVTCNQDFCTSTYDGPIPGCTP-ELLCEYRVAYGDGSSTAGYFVRDHVVLDRVTGNFQTT 189

Query: 167 IHVQDVVFGCGFSTNRNF-----KFSGIFGLGIGRSSLVSQLNSS------FSYCIGSLH 215
                +VFGCG   +           GI G G   SS++SQL SS      F++C+    
Sbjct: 190 STNGSIVFGCGAQQSGQLGATSAALDGILGFGQANSSMISQLASSGKVKRVFAHCL---- 245

Query: 216 DPDYLHNKLILGDGAIID-EGDATPLQFIDGHYYITLEAISVDGRMLDINPNIFKRDDSG 274
             D ++   I   G ++  +   TPL     HY + ++AI VD  +L++  ++F  D   
Sbjct: 246 --DNINGGGIFAIGEVVQPKVRTTPLVPQQAHYNVFMKAIEVDNEVLNLPTDVFDTDLR- 302

Query: 275 GGVMIDSGTDVTWLVKEAYEALRDEVMIRLEGEQMRSYSWPDKLCYHGIMSSDLKGFPTV 334
            G +IDSGT + +     YE L  ++  R    ++ +       C+    + D  GFPTV
Sbjct: 303 KGTIIDSGTTLAYFPDVIYEPLISKIFARQSTLKLHTVE-EQFTCFEYDGNVD-DGFPTV 360

Query: 335 RFHFRGGAKLALEKDSMFYQPRPDAFCMAVNPASINNR-YVNFSLIGMMAQQFYNVGYDI 393
            FHF     L +      +    + +C+    +   +R   +  L+G +  Q   V YD+
Sbjct: 361 TFHFEDSLSLTVYPHEYLFDIDSNKWCVGWQNSGAQSRDGKDMILLGDLVLQNRLVMYDL 420

Query: 394 GRKQKTFQRMDC 405
             +   +   +C
Sbjct: 421 ENQTIGWTEYNC 432


>gi|357491933|ref|XP_003616254.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
 gi|355517589|gb|AES99212.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
          Length = 442

 Score = 95.5 bits (236), Expect = 4e-17,   Method: Compositional matrix adjust.
 Identities = 92/370 (24%), Positives = 161/370 (43%), Gaps = 33/370 (8%)

Query: 61  FYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCR---DCSPQLGTIFYPSRSSSYAYVPCD 117
             V + IG PP  Q   +DTGS L W+ C+  +      P   + F PS SSS+  +PC+
Sbjct: 82  LVVTLPIGTPPQLQQMVLDTGSQLSWIQCHNKKTPQKKQPPTTSSFDPSLSSSFFVLPCN 141

Query: 118 SEHCR-----YFPYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDV 172
              C+     +     C A    C Y+  Y  G      +  E++ F  +  +      +
Sbjct: 142 HPLCKPRVPDFSLPTDCDA-NSLCHYSYFYADGTYAEGNLVREKIAFSPSQTTP----PI 196

Query: 173 VFGCGFSTNRNFKFSGIFGLGIGRSSLVSQLN-SSFSYCIGSLHDPDYLHNKLILGDGA- 230
           + GC   ++      GI G+ +GR    SQ   + FSYC+ +            LG+   
Sbjct: 197 ILGCATQSD---DARGILGMNLGRLGFPSQAKITKFSYCVPT-KQAQPASGSFYLGNNPA 252

Query: 231 --------IIDEGDATPLQFIDG-HYYITLEAISVDGRMLDINPNIFKRDDSGGG-VMID 280
                   ++  G +  +  +D   Y + L+ IS+ G+ L+I P++FK +  G G  MID
Sbjct: 253 SSSFRYVNLLTFGQSQRMPNLDPLAYTLPLQGISIGGKKLNIPPSVFKPNAGGSGQTMID 312

Query: 281 SGTDVTWLVKEAYEALRDEVMIRLEGEQMRSYSWPD--KLCYHGIMSSDLKGFPTVRFHF 338
           SG++ T+LV EAY  +R+E++ ++  +  + Y +     +C+ G      +    + F F
Sbjct: 313 SGSEFTYLVDEAYNVIREELVKKVGPKIKKGYMYGGVADICFDGDAIEIGRLVGDMVFEF 372

Query: 339 RGGAKLALEKDSMFYQPRPDAFCMAVNPASINNRYVNFSLIGMMAQQFYNVGYDIGRKQK 398
             G ++ + K+ +         C+ +  +       N  +IG   QQ   V +D+  ++ 
Sbjct: 373 EKGVQIVIPKERVLATVDGGVHCLGMGRSERLGAGGN--IIGNFHQQNLWVEFDLANRRV 430

Query: 399 TFQRMDCEVL 408
            F   DC  L
Sbjct: 431 GFGEADCSKL 440


>gi|357117301|ref|XP_003560410.1| PREDICTED: uncharacterized protein LOC100833752 [Brachypodium
           distachyon]
          Length = 473

 Score = 95.5 bits (236), Expect = 5e-17,   Method: Compositional matrix adjust.
 Identities = 84/314 (26%), Positives = 133/314 (42%), Gaps = 60/314 (19%)

Query: 78  MDTGSSLLWVHCYPCRDCSPQLGTIFYPSRSSSYAYVPC-DSEHCR--YFPYARCSAYKH 134
           MD  +   W+ C PC  C PQL  +F P++S ++  V   ++  CR  Y P         
Sbjct: 120 MDMAAGFSWMQCAPCHPCLPQLNPVFDPAKSPTFRPVSGHNAVLCRPPYHPL-----QDG 174

Query: 135 RCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDVVFGCGFSTNRNFKF------SG 188
           RC +   Y  G   + +++ +  +F   D +  H+  +VFGC    NR  +F      +G
Sbjct: 175 RCGFGIAYRNGASAAGYLARDTFSFPTGDNNFQHLPGIVFGC---ANRIARFDTHGALAG 231

Query: 189 IFGLGIGR-----SSLVSQL----NSSFSYCIGSLHDPDYLHNKLILGDGA--IIDEGDA 237
           + G+G+G      +  + QL       FSYC             ++ G  A   +  G+ 
Sbjct: 232 VLGMGMGAEGKPLTGFMRQLYHNGGGRFSYC------------PIVPGTTAYSFLRFGND 279

Query: 238 TPLQFIDG----------------HYYITLEAISVDG-RMLDINPNIFKRDDSG-GGVMI 279
            P Q   G                 YY+ L  ISV   R+  + P +F+RD  G GG  I
Sbjct: 280 IPSQPPAGVHRQSMAVLAPTTTSEAYYVKLAGISVGALRVPGVTPEMFERDQHGRGGCAI 339

Query: 280 DSGTDVTWLVKEAYEALRDEVMIRLEGEQMRSYSWPD-KLCYHGIMSSDLKGFPTVRFHF 338
           D GT +T +V+ AY  +   V   L+  + R    P   LC H   + + +  P++  HF
Sbjct: 340 DIGTKMTAIVQTAYAHVEAAVRGHLQRNRARFVQSPGHHLCVHRTPAIEER-LPSMTLHF 398

Query: 339 RGGAKLALEKDSMF 352
            GG  L ++   +F
Sbjct: 399 VGGPWLRVKPQHLF 412


>gi|326524806|dbj|BAK04339.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 460

 Score = 95.5 bits (236), Expect = 5e-17,   Method: Compositional matrix adjust.
 Identities = 91/346 (26%), Positives = 146/346 (42%), Gaps = 37/346 (10%)

Query: 77  AMDTGSSLLWVHCYPCRDCSPQLGTIFYPSRSSSYAYVPCDSEHCRYFPYARCSAYKHRC 136
           A+D  ++LLW+ C P ++   QL   F P++S S+  +P ++  C   P       +  C
Sbjct: 102 ALDMTTNLLWMQCKPVQEPFTQLPPPFEPAKSPSFRRLPGNNAFCLPAPRGHRRTVQDPC 161

Query: 137 IYTQLYLIG-PETSVFVSTEQLTFKNTDESTIHVQDVVFGC-----GFSTNRNFKFSGIF 190
            +  + L G  +    +S E L F  + +    V  VV GC     GF+ N +   +G+ 
Sbjct: 162 KFHSIRLDGSADARGVLSNETLAFAASGQQQTEVTGVVIGCTHNSKGFNFNSHGVLAGVL 221

Query: 191 GLGIGRSSLVSQLNS---------SFSYCIGSLHDPDYLHNKLILGDGAIIDEGD--ATP 239
           GLG    SL+  L            FSYC+ S       H+  +  D  + +     +T 
Sbjct: 222 GLGRQAPSLIWTLGQHRHGTVQVHRFSYCLPSHGSSSSDHHTFLRFDDDVPNTQHMVSTK 281

Query: 240 LQFIDG-------HYYITLEAISVDGRMLDINPNIFKRDDSG----GGVMIDSGTDVTWL 288
           + ++D         Y+++L  ISV G+ L     +FKR   G     G   D+GT    +
Sbjct: 282 IMYMDSTTSRDFRAYFVSLTGISVAGKPLQDVKELFKRHVHGQVWTSGCAFDAGTPTMVM 341

Query: 289 VKEAYEALRDEVMIRLEGEQMRSYSWPDKLCYHGIMSSDLKGFPTVRFHF-RGGAKLALE 347
           +  AY  L+D V+  L+   ++  S    LC+    S   +  PTV   F    A+L L 
Sbjct: 342 IMPAYNKLKDAVVRHLKPLGLQIVSGQYHLCFRAT-SQLWQHLPTVMLQFAETEARLVLP 400

Query: 348 KDSMFYQPRPDAFCMAVNPASINNRYVNFSLIGMMAQQFYNVGYDI 393
              +F     D  C+AV       R  + ++IG M Q      YD+
Sbjct: 401 PQRLFVAVGYD-ICLAV------VRSYDITIIGAMQQVDKRFVYDV 439


>gi|356555807|ref|XP_003546221.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Glycine max]
          Length = 457

 Score = 95.1 bits (235), Expect = 5e-17,   Method: Compositional matrix adjust.
 Identities = 104/374 (27%), Positives = 164/374 (43%), Gaps = 39/374 (10%)

Query: 55  DISNSLFYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRD-CSPQLGTIFYPSRSSSYAY 113
            I +  +YV I +G P       +DTGSSL W+ C PC   C  Q+  IF PS S +Y  
Sbjct: 101 SIGSGNYYVKIGVGTPAKYFSMIVDTGSSLSWLQCQPCVIYCHVQVDPIFTPSVSKTYKA 160

Query: 114 VPCDSEHCRYFPYAR-----CSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIH 168
           + C S  C     +      CS     C+Y   Y     +  ++S + LT      S   
Sbjct: 161 LSCSSSQCSSLKSSTLNAPGCSNATGACVYKASYGDTSFSIGYLSQDVLTL---TPSAAP 217

Query: 169 VQDVVFGCGFSTNRNF-KFSGIFGLGIGRSSLVSQLN----SSFSYCIGSLHDPDYLHNK 223
               V+GCG      F + +GI GL   + S++ QL+    ++FSYC+ S        N 
Sbjct: 218 SSGFVYGCGQDNQGLFGRSAGIIGLANDKLSMLGQLSNKYGNAFSYCLPSSFSAQ--PNS 275

Query: 224 LILGDGAI-IDEGDATPLQF--------IDGHYYITLEAISVDGRMLDINPNIFKRDDSG 274
            + G  +I      ++P +F        I   Y++ L  I+V G+ L ++ + +      
Sbjct: 276 SVSGFLSIGASSLSSSPYKFTPLVKNPKIPSLYFLGLTTITVAGKPLGVSASSYNVP--- 332

Query: 275 GGVMIDSGTDVTWLVKEAYEALRDEVMIRLEGE--QMRSYSWPDKLCYHGIMSSDLKGFP 332
              +IDSGT +T L    Y AL+   ++ +  +  Q   +S  D  C+ G +  ++   P
Sbjct: 333 --TIIDSGTVITRLPVAIYNALKKSFVMIMSKKYAQAPGFSILDT-CFKGSV-KEMSTVP 388

Query: 333 TVRFHFRGGAKLALEKDSMFYQPRPDAFCMAVNPASINNRYVNFSLIGMMAQQFYNVGYD 392
            +R  FRGGA L L+  +   +      C+A+  AS N      S+IG   QQ + V YD
Sbjct: 389 EIRIIFRGGAGLELKVHNSLVEIEKGTTCLAI-AASSN----PISIIGNYQQQTFTVAYD 443

Query: 393 IGRKQKTFQRMDCE 406
           +   +  F    C+
Sbjct: 444 VANSKIGFAPGGCQ 457


>gi|449527515|ref|XP_004170756.1| PREDICTED: aspartic proteinase nepenthesin-1-like, partial [Cucumis
           sativus]
          Length = 364

 Score = 95.1 bits (235), Expect = 5e-17,   Method: Compositional matrix adjust.
 Identities = 102/361 (28%), Positives = 140/361 (38%), Gaps = 28/361 (7%)

Query: 56  ISNSLFYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLGTIFYPSRSSSYAYVP 115
           I +  F V   IG P      A+DT +   W+ C  C  C     T+F   +SSS+  +P
Sbjct: 21  IQSPTFVVRAKIGTPAQTLLLALDTSNDAAWIPCSGCIGCPST--TVFSSDKSSSFRPLP 78

Query: 116 CDSEHCRYFPYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDVVFG 175
           C S  C   P   CS     C +   Y  G  T        L   N   +T  V    FG
Sbjct: 79  CQSPQCNQVPNPSCSG--SACGFNLTY--GSSTVA----ADLVQDNLTLATDSVPSYTFG 130

Query: 176 C-----GFSTNRNFKFSGIFGLGIGRSSLVSQLNSSFSYCIGSLHDPDYLHNKLILGDGA 230
           C     G S           G         S   S+FSYC+ S    ++    L LG  A
Sbjct: 131 CIRKATGSSVPPQGLLGLGRGPLSLLGQSQSLYQSTFSYCLPSFKSVNF-SGSLRLGPVA 189

Query: 231 IIDEGDATPL---QFIDGHYYITLEAISVDGRMLDINPNIFK-RDDSGGGVMIDSGTDVT 286
                  TPL         YY+ L +I V  +++DI P+       +G G +IDSGT  T
Sbjct: 190 QPIRIKYTPLLRNPRRSSLYYVNLISIRVGRKIVDIPPSALAFNSATGAGTVIDSGTTFT 249

Query: 287 WLVKEAYEALRDEVMIRLEGEQMRSYSWPDKLCYHGIMSSDLKGFPTVRFHFRGGAKLAL 346
            LV  AY A+RDE   R+      S       CY   + S     PT+ F F  G  + L
Sbjct: 250 RLVAPAYTAVRDEFRRRVGRNVTVSSLGGFDTCYTVPIIS-----PTITFMF-AGMNVTL 303

Query: 347 EKDS-MFYQPRPDAFCMAVNPASINNRYVNFSLIGMMAQQFYNVGYDIGRKQKTFQRMDC 405
             D+ + +       C+A+  A  N   V  ++I  M QQ + + +DI   +    R  C
Sbjct: 304 PPDNFLIHSTSGSTTCLAMAAAPDNVNSV-LNVIASMQQQNHRILFDIPNSRVGVARESC 362

Query: 406 E 406
            
Sbjct: 363 S 363


>gi|357124468|ref|XP_003563922.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Brachypodium
           distachyon]
          Length = 450

 Score = 95.1 bits (235), Expect = 5e-17,   Method: Compositional matrix adjust.
 Identities = 94/365 (25%), Positives = 152/365 (41%), Gaps = 41/365 (11%)

Query: 61  FYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCR-DCSPQLGTIFYPSRSSSYAYVPCDSE 119
           +   + +G P       +D+GSSL W+ C PC   C PQ G ++ P  SS+YA VPC + 
Sbjct: 108 YITRLGLGTPTTTYVMVVDSGSSLTWLQCAPCAVSCHPQAGPLYDPRASSTYAAVPCSAP 167

Query: 120 HCRYFPYA-----RCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDVVF 174
            C     A      CS     C Y   Y  G  +  ++S + ++  ++           +
Sbjct: 168 QCAELQAATLNPSSCSG-SGVCQYQASYGDGSFSFGYLSKDTVSLSSSGS----FPGFYY 222

Query: 175 GCGFSTNRNF-KFSGIFGLGIGRSSLVSQL----NSSFSYCIGSLHDPDYLHNKLILGDG 229
           GCG      F + +G+ GL   + SL+SQL     +SF+YC+ +        +   L  G
Sbjct: 223 GCGQDNVGLFGRAAGLIGLARNKLSLLSQLAPSVGNSFAYCLPT----SAAASAGYLSFG 278

Query: 230 AIIDEGDATPLQF-------IDGH-YYITLEAISVDGRMLDINPNIFKRDDSGGGVMIDS 281
           +  D  +     +       +D   Y+++L  +SV G  L +  + +    +    +IDS
Sbjct: 279 SNSDNKNPGKYSYTSMVSSSLDASLYFVSLAGMSVAGSPLAVPSSEYGSLPT----IIDS 334

Query: 282 GTDVTWLVKEAYEALRDEVMIRLEGEQMRSYSWPDKLCYHGIMSSDLKGFPTVRFHFRGG 341
           GT +T L    Y AL   V   L      +YS   + C+ G ++      P V   F GG
Sbjct: 335 GTVITRLPTPVYTALSKAVGAALAAPSAPAYSI-LQTCFKGQVAK--LPVPAVNMAFAGG 391

Query: 342 AKLALEKDSMFYQPRPDAFCMAVNPASINNRYVNFSLIGMMAQQFYNVGYDIGRKQKTFQ 401
           A L L   ++         C+A  P        + ++IG   QQ ++V YD+   +  F 
Sbjct: 392 ATLRLTPGNVLVDVNETTTCLAFAPTD------STAIIGNTQQQTFSVVYDVKGSRIGFA 445

Query: 402 RMDCE 406
              C 
Sbjct: 446 AGGCS 450


>gi|21717154|gb|AAM76347.1|AC074196_5 putative nucleoid DNA binding protein [Oryza sativa Japonica Group]
 gi|31433293|gb|AAP54831.1| Eukaryotic aspartyl protease family protein [Oryza sativa Japonica
           Group]
 gi|125532791|gb|EAY79356.1| hypothetical protein OsI_34485 [Oryza sativa Indica Group]
          Length = 397

 Score = 95.1 bits (235), Expect = 5e-17,   Method: Compositional matrix adjust.
 Identities = 95/376 (25%), Positives = 152/376 (40%), Gaps = 54/376 (14%)

Query: 63  VNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLGTIFYPSRSSSYAYVPCDSEHCR 122
            N +IG PP P    +D    L+W  C  C  C  Q   +F P+ SS++   PC ++ C+
Sbjct: 45  ANFTIGTPPQPASAIIDVAGELVWTQCSRCSRCFKQDLPLFIPNASSTFRPEPCGTDACK 104

Query: 123 YFPYARCSAYKHRCIY-----------TQLYLIGPET-SVFVSTEQLTFKNTDESTIHVQ 170
             P + CS     C Y           T L ++G ET ++  +T  L F     S I   
Sbjct: 105 STPTSNCSG--DVCTYESTTNIRLDRHTTLGIVGTETFAIGTATASLAFGCVVASDIDTM 162

Query: 171 DVVFGCGFSTNRNFKFSGIFGLGIGRSSLVSQLN-SSFSYCIGSLHDPDYLHNKLILGDG 229
           D               SG  GLG    SLV+Q+  + FSYC+          ++L LG  
Sbjct: 163 DGT-------------SGFIGLGRTPRSLVAQMKLTKFSYCLSPRGTGK--SSRLFLGSS 207

Query: 230 AIIDEGDATPLQ-FI------DGHYY--ITLEAISVDGRMLDINPNIFKRDDSGGGVMID 280
           A +  G++T    FI      D H+Y  ++L+AI      +           SGG +++ 
Sbjct: 208 AKLAGGESTSTAPFIKTSPDDDSHHYYLLSLDAIRAGNTTIATA-------QSGGILVMH 260

Query: 281 SGTDVTWLVKEAYEALRDEVMIRLEGEQMRSYSWPDK---LCYHGIMSSDLKGFPTVRFH 337
           + +  + LV  AY A +  V   + G      + P +   LC+           P + F 
Sbjct: 261 TVSPFSLLVDSAYRAFKKAVTEAVGGAAAPPMATPPQPFDLCFKKAAGFSRATAPDLVFT 320

Query: 338 FRGGAKLALEKDSMFY---QPRPDAFCMAVNPASINNR--YVNFSLIGMMAQQFYNVGYD 392
           F+GG        + +        D  C A+   +  NR      S++G + Q+  +  YD
Sbjct: 321 FQGGGAALTVPPAKYLIDVGEEKDTACAAILSMARLNRTGLEGVSVLGSLQQENVHFLYD 380

Query: 393 IGRKQKTFQRMDCEVL 408
           + ++  +F+  DC  L
Sbjct: 381 LKKETLSFEPADCSSL 396


>gi|449449334|ref|XP_004142420.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Cucumis sativus]
          Length = 441

 Score = 95.1 bits (235), Expect = 6e-17,   Method: Compositional matrix adjust.
 Identities = 102/361 (28%), Positives = 141/361 (39%), Gaps = 28/361 (7%)

Query: 56  ISNSLFYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLGTIFYPSRSSSYAYVP 115
           I +  F V   IG P      A+DT +   W+ C  C  C     T+F   +SSS+  +P
Sbjct: 98  IQSPTFVVRAKIGTPAQTLLLALDTSNDAAWIPCSGCIGCPST--TVFSSDKSSSFRPLP 155

Query: 116 CDSEHCRYFPYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDVVFG 175
           C S  C   P   CS     C +   Y  G  T        L   N   +T  V    FG
Sbjct: 156 CQSPQCNQVPNPSCSG--SACGFNLTY--GSSTVA----ADLVQDNLTLATDSVPSYTFG 207

Query: 176 C-----GFSTNRNFKFSGIFGLGIGRSSLVSQLNSSFSYCIGSLHDPDYLHNKLILGDGA 230
           C     G S           G         S   S+FSYC+ S    ++    L LG  A
Sbjct: 208 CIRKATGSSVPPQGLLGLGRGPLSLLGQSQSLYQSTFSYCLPSFKSVNF-SGSLRLGPVA 266

Query: 231 IIDEGDATPL---QFIDGHYYITLEAISVDGRMLDINPNIFKRDD-SGGGVMIDSGTDVT 286
                  TPL         YY+ L +I V  +++DI P+    +  +G G +IDSGT  T
Sbjct: 267 QPIRIKYTPLLRNPRRSSLYYVNLISIRVGRKIVDIPPSALAFNSATGAGTVIDSGTTFT 326

Query: 287 WLVKEAYEALRDEVMIRLEGEQMRSYSWPDKLCYHGIMSSDLKGFPTVRFHFRGGAKLAL 346
            LV  AY A+RDE   R+      S       CY   + S     PT+ F F  G  + L
Sbjct: 327 RLVAPAYTAVRDEFRRRVGRNVTVSSLGGFDTCYTVPIIS-----PTITFMF-AGMNVTL 380

Query: 347 EKDS-MFYQPRPDAFCMAVNPASINNRYVNFSLIGMMAQQFYNVGYDIGRKQKTFQRMDC 405
             D+ + +       C+A+  A  N   V  ++I  M QQ + + +DI   +    R  C
Sbjct: 381 PPDNFLIHSTAGSTTCLAMAAAPDNVNSV-LNVIASMQQQNHRILFDIPNSRVGVARESC 439

Query: 406 E 406
            
Sbjct: 440 S 440


>gi|356537928|ref|XP_003537458.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Glycine max]
          Length = 445

 Score = 95.1 bits (235), Expect = 6e-17,   Method: Compositional matrix adjust.
 Identities = 103/381 (27%), Positives = 160/381 (41%), Gaps = 42/381 (11%)

Query: 58  NSLFYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLGTIFYPSRSSSYAYVPCD 117
           N    V++++G PP      +DTGS L W+HC   ++    + ++F P  SSSY  +PC 
Sbjct: 67  NVTLTVSLTVGTPPQSVTMVLDTGSELSWLHCKKQQN----INSVFNPHLSSSYTPIPCM 122

Query: 118 SEHCRYFPYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQ-DVVFG- 175
           S  C+     R       C    L  +    + F S E     +T   +   Q  ++FG 
Sbjct: 123 SPICKT--RTRDFLIPVSCDSNNLCHVTVSYADFTSLEGNLASDTFAISGSGQPGIIFGS 180

Query: 176 --CGFSTNRN--FKFSGIFGLGIGRSSLVSQLN-SSFSYCIGSLHDPDYLHNKLILGDGA 230
              GFS+N N   K +G+ G+  G  S V+Q+    FSYCI S  D   +   L+ GD  
Sbjct: 181 MDSGFSSNANEDSKTTGLMGMNRGSLSFVTQMGFPKFSYCI-SGKDASGV---LLFGDAT 236

Query: 231 IIDEGDA---------TPLQFIDG-HYYITLEAISVDGRMLDINPNIFKRDDSGGG-VMI 279
               G           TPL + D   Y + L  I V  + L +   IF  D +G G  M+
Sbjct: 237 FKWLGPLKYTPLVKMNTPLPYFDRVAYTVRLMGIRVGSKPLQVPKEIFAPDHTGAGQTMV 296

Query: 280 DSGTDVTWLVKEAYEALRDEVMIRLEG------EQMRSYSWPDKLCYHGIMSSDLKGFPT 333
           DSGT  T+L+   Y ALR+E + +  G      +    +     LC+       +   P 
Sbjct: 297 DSGTRFTFLLGSVYTALRNEFVAQTRGVLTLLEDPNFVFEGAMDLCFRVRRGGVVPAVPA 356

Query: 334 VRFHFRGGAKLALEKDSMFYQPRPDAFCMAVNPASINNRYVNFSLIGMMA-------QQF 386
           V   F  GA++++  + + Y+   D      N       + N  L+G+ A       QQ 
Sbjct: 357 VTMVFE-GAEMSVSGERLLYRVGGDGDVAKGNGDVYCLTFGNSDLLGIEAYVIGHHHQQN 415

Query: 387 YNVGYDIGRKQKTFQRMDCEV 407
             + +D+   +  F    CE+
Sbjct: 416 VWMEFDLVNSRVGFADTKCEL 436


>gi|326520109|dbj|BAK03979.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 463

 Score = 95.1 bits (235), Expect = 6e-17,   Method: Compositional matrix adjust.
 Identities = 104/392 (26%), Positives = 167/392 (42%), Gaps = 59/392 (15%)

Query: 1   LIHRDSILSPYNNPNENPAHRVQRGINISIARLAYLQEKIRSHNTYQAQ---ILPSNDIS 57
           ++HR+   +P    ++ P  R    +     R+  L  ++ S    +A    ++ +N + 
Sbjct: 64  ILHREHPCAP---ASKRPVRRSPSALQEYHTRVRRLANRLSSCPADEATASGLIFANGVP 120

Query: 58  NSLF-YVN-ISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLGTIFYPSRSSSYAYVP 115
              + YV  + +G P       +DT SSL WV C PC +    L   F P+ SS+Y  V 
Sbjct: 121 WDYYSYVTQVQLGTPAKTHNVLVDTASSLSWVGCEPCINAC--LIPTFNPNASSTYKVVG 178

Query: 116 CDSEHCRYFPYA-----RCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQ 170
           C S  C   P A      C A    C Y Q Y     +   VS++ LT+       +  Q
Sbjct: 179 CGSALCNAVPSATMARKSCMAPTEGCSYRQSYHDYSLSVGVVSSDTLTYG------LGSQ 232

Query: 171 DVVFGC-GFSTNRNFKFSGIFGLGIGRSSLVSQLN-----SSFSYCI------GSLHDPD 218
             +FGC         ++SGI G+ + + SL SQ+       + SYC       G L    
Sbjct: 233 KFIFGCCNLFRGVGGRYSGILGMSVNKFSLFSQMTVGHRYRAMSYCFPHPRNQGFLQFGR 292

Query: 219 YLHNKLILGDGAIIDEGDATPLQFIDGH-YYITLEAISVDGRMLDINPNIFKRDDSGGGV 277
           Y  +K +L           TPL +IDG+ Y++ +  + V+   LD+         SG   
Sbjct: 293 YDEHKSLL---------RFTPL-YIDGNNYFVHVSNVMVETMSLDV-------QSSGNQT 335

Query: 278 M---IDSGTDVTWLVKEAYEALRDEVMIRLEGEQMRSYSWPDKLCYHG---IMSSDLKGF 331
           M    D+GT  T L +  + +L D V   +EG   R  +   + C+      +  DL   
Sbjct: 336 MRCFFDTGTPYTMLPQSLFVSLSDTVGNLVEG-YYRVGASTGQTCFQADGNWIEGDLY-M 393

Query: 332 PTVRFHFRGGAKLALEKDSMFYQPRPDAFCMA 363
           PTV+  F+ GA++ L  + + +   P+ FC+A
Sbjct: 394 PTVKIEFQNGARITLNSEDLMFMEEPNVFCLA 425


>gi|255548660|ref|XP_002515386.1| Aspartic proteinase nepenthesin-2 precursor, putative [Ricinus
           communis]
 gi|223545330|gb|EEF46835.1| Aspartic proteinase nepenthesin-2 precursor, putative [Ricinus
           communis]
          Length = 387

 Score = 94.7 bits (234), Expect = 6e-17,   Method: Compositional matrix adjust.
 Identities = 100/363 (27%), Positives = 156/363 (42%), Gaps = 37/363 (10%)

Query: 61  FYVNISIGQPPVPQFTAMDTGSSLLWVHCYPC-RDCSPQLGTIFYPSRSSSYAYVPCDSE 119
           + V +++G P +    A+DTGS + W  C PC   C  Q  T F P +SSSY  V C S 
Sbjct: 45  YLVKMALGTPKLSLSLALDTGSDITWTQCEPCVGSCYRQAQTKFDPRKSSSYKNVSCSSS 104

Query: 120 HCRYFPYARCS--AYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDVVFGCG 177
            CR    +  +       CIY   Y  G  +  F +TE+LT   +D     + + +FGCG
Sbjct: 105 SCRIITDSGGARGCVSSTCIYKVQYGDGSYSVGFFATEKLTISPSDV----ISNFLFGCG 160

Query: 178 FSTNRNFKFSGIFGLGIGRSSLVS-----QLNSSFSYCIGSLHDPDYLHNKLILGDGAII 232
                 F               ++     + N+ F+YC+ S       H  L LG G + 
Sbjct: 161 QQNAGRFGRIAGLLGLGRGKLSLALQTSEKYNNLFTYCLPSFSSSSTGH--LTLG-GQVP 217

Query: 233 DEGDATPLQ--FIDGHYY-ITLEAISVDGRMLDINPNIFKRDDSGGGVMIDSGTDVTWLV 289
                TPL   F +  +Y I ++ +SV G +L I+ ++F    S  G +IDSGT +T L 
Sbjct: 218 KSVKFTPLSPAFKNTPFYGIDIKGLSVGGHVLPIDASVF----SNAGAIIDSGTVITRLQ 273

Query: 290 KEAYEALRDEVMIRLEGEQMRSYSWPDKL-----CYHGIMSSDLKGFPTVRFHFRGGAKL 344
              Y AL  +       + M+ Y   D       CY     ++    P + F F+GG ++
Sbjct: 274 PTVYSALSSKFQ-----QLMKDYPKTDGFSILDTCYD-FSGNESISVPRISFFFKGGVEV 327

Query: 345 ALEKDSMFYQPRP-DAFCMAVNPASINNRYVNFSLIGMMAQQFYNVGYDIGRKQKTFQRM 403
            ++   +       D  C+A  P   N+   +F + G   QQ Y+V +D+ + +  F   
Sbjct: 328 DIKFFGILTVINAWDKVCLAFAP---NDDDGDFVVFGNSQQQTYDVVHDLAKGRIGFAPS 384

Query: 404 DCE 406
            C 
Sbjct: 385 GCN 387


>gi|88174565|gb|ABD39357.1| chloroplast nucleoid DNA-binding protein [Oryza glumipatula]
          Length = 323

 Score = 94.7 bits (234), Expect = 7e-17,   Method: Compositional matrix adjust.
 Identities = 86/331 (25%), Positives = 142/331 (42%), Gaps = 39/331 (11%)

Query: 61  FYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLGTIFYPSRSSSYAYVPCDSEH 120
           + +++ +G P   Q   +DTGSS  WV C  C  C     T F  SRS++ A V C +  
Sbjct: 1   YVISVGLGTPSKTQIVEIDTGSSTSWVFC-ECDGCHTNPRT-FLQSRSTTCAKVSCGTSM 58

Query: 121 CRYF---PYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDVVFGC- 176
           C      P+ + S     C +   Y  G  +   +  + LTF +  +    +    FGC 
Sbjct: 59  CLLGGSDPHCQDSENYPDCPFRVSYQDGSASYGILYQDTLTFSDVQK----IPGFTFGCN 114

Query: 177 --GFSTNRNFKFSGIFGLGIGRSSLVSQLNSS---FSYCIGSLHDPDYLHNKLI----LG 227
              F  N      G+ G+G G+ S++ Q + +   FSYC+          +K      LG
Sbjct: 115 MDSFGANEFGNVDGLLGMGAGQMSVLKQSSPTFDGFSYCLPLQMSERGFFSKTTGYFSLG 174

Query: 228 DGAIIDEGDATPLQFIDGH-----YYITLEAISVDGRMLDINPNIFKRDDSGGGVMIDSG 282
                   D    + +        +++ L AISVDG  L ++P+IF R     GV+ DSG
Sbjct: 175 GKIAATRTDVRYTKMVARRKNTELFFVDLTAISVDGERLGLSPSIFSRK----GVVFDSG 230

Query: 283 TDVTWLVKEAYEALRD---EVMIRLEGEQMRSYSWPDKLCYHGIMSSDLKGFPTVRFHFR 339
           ++++++   A   L     E+++R    +  S    ++ CY  + S D    P +  HF 
Sbjct: 231 SELSYIPDRALSVLSQRIRELLLRRGAAEEES----ERNCYD-MRSVDEGDMPAISLHFD 285

Query: 340 GGAKLALEKDSMFYQ---PRPDAFCMAVNPA 367
            GA+  L +  +F +      D +C+A  P 
Sbjct: 286 DGARFDLGRHGVFVERSVQEQDVWCLAFAPT 316


>gi|115466068|ref|NP_001056633.1| Os06g0119600 [Oryza sativa Japonica Group]
 gi|55296446|dbj|BAD68569.1| putative nucleoid DNA-binding protein cnd41 [Oryza sativa Japonica
           Group]
 gi|55296924|dbj|BAD68375.1| putative nucleoid DNA-binding protein cnd41 [Oryza sativa Japonica
           Group]
 gi|113594673|dbj|BAF18547.1| Os06g0119600 [Oryza sativa Japonica Group]
 gi|215694767|dbj|BAG89958.1| unnamed protein product [Oryza sativa Japonica Group]
 gi|215737752|dbj|BAG96882.1| unnamed protein product [Oryza sativa Japonica Group]
          Length = 495

 Score = 94.7 bits (234), Expect = 7e-17,   Method: Compositional matrix adjust.
 Identities = 100/357 (28%), Positives = 147/357 (41%), Gaps = 42/357 (11%)

Query: 68  GQPPVPQFTAMDTGSSLLWVHCYPCR--DCSPQLGTIFYPSRSSSYAYVPCDSEHCRYF- 124
           G   V Q   +D+GS + WV C PC    C  Q   +F P+ S++YA VPC S  C    
Sbjct: 162 GTSAVTQTVIIDSGSDVSWVQCKPCPLPMCHRQRDPLFDPAMSTTYAAVPCTSAACAQLG 221

Query: 125 PYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDVVFGCGFS---TN 181
           PY R  +   +C +   Y  G   +   S + LT    D     ++   FGC  +   + 
Sbjct: 222 PYRRGCSANAQCQFGINYGDGSTATGTYSFDDLTLGPYDV----IRGFRFGCAHADRGSA 277

Query: 182 RNFKFSGIFGLGIGRSSLVSQLNSS----FSYCIGSLHDPDYLHNKLILG----DGAIID 233
            ++  +G   LG G  SLV Q  +     FSYC+            L+LG       +I 
Sbjct: 278 FDYDVAGSLALGGGSQSLVQQTATRYGRVFSYCLPPTASS---LGFLVLGVPPERAQLIP 334

Query: 234 EGDATPL---QFIDGHYYITLEAISVDGRMLDINPNIFKRDDSGGGVMIDSGTDVTWLVK 290
              +TPL         Y + L AI V GR L + P +F         +IDS T ++ L  
Sbjct: 335 SFVSTPLLSSSMAPTFYRVLLRAIIVAGRPLAVPPAVFSASS-----VIDSSTIISRLPP 389

Query: 291 EAYEALRDEVMIRLEGEQMRSYSWPDKLCY--HGIMSSDLKGFPTVRFHFRGGAKLALEK 348
            AY+ALR      +   +          CY   G+ S  L   P++   F GGA + L+ 
Sbjct: 390 TAYQALRAAFRSAMTMYRAAPPVSILDTCYDFTGVRSITL---PSIALVFDGGATVNLDA 446

Query: 349 DSMFYQPRPDAFCMAVNPASINNRYVNFSLIGMMAQQFYNVGYDIGRKQKTFQRMDC 405
             +         C+A  P + ++R   F  IG + Q+   V YD+  K   F+   C
Sbjct: 447 AGILL-----GSCLAFAPTA-SDRMPGF--IGNVQQKTLEVVYDVPAKAMRFRTAAC 495


>gi|15229663|ref|NP_190574.1| aspartyl protease family protein [Arabidopsis thaliana]
 gi|6522926|emb|CAB62113.1| putative protein [Arabidopsis thaliana]
 gi|53828539|gb|AAU94379.1| At3g50050 [Arabidopsis thaliana]
 gi|55733749|gb|AAV59271.1| At3g50050 [Arabidopsis thaliana]
 gi|332645100|gb|AEE78621.1| aspartyl protease family protein [Arabidopsis thaliana]
          Length = 632

 Score = 94.7 bits (234), Expect = 7e-17,   Method: Compositional matrix adjust.
 Identities = 104/380 (27%), Positives = 161/380 (42%), Gaps = 55/380 (14%)

Query: 58  NSLFYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLGTIFYPSRSSSYAYVPCD 117
           N  +   + IG PP      +D+GS++ +V C  C  C       F P  SS+Y  V C+
Sbjct: 90  NGYYTTRLWIGTPPQMFALIVDSGSTVTYVPCSDCEQCGKHQDPKFQPEMSSTYQPVKCN 149

Query: 118 SEHCRYFPYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDVVFGCG 177
            +         C   + +C+Y + Y     +   +  + ++F N  ES +  Q  VFGC 
Sbjct: 150 MD-------CNCDDDREQCVYEREYAEHSSSKGVLGEDLISFGN--ESQLTPQRAVFGCE 200

Query: 178 FSTNRNF---KFSGIFGLGIGRSSLVSQL------NSSFSYCIGSLHDPDYLHNKLILGD 228
                +    +  GI GLG G  SLV QL      ++SF  C G +           +G 
Sbjct: 201 TVETGDLYSQRADGIIGLGQGDLSLVDQLVDKGLISNSFGLCYGGMD----------VGG 250

Query: 229 GAIIDEGDATP--LQFIDG------HYYITLEAISVDGRMLDINPNIFKRDDSGGGVMID 280
           G++I  G   P  + F D       +Y I L  I V G+ L ++  +F   D   G ++D
Sbjct: 251 GSMILGGFDYPSDMVFTDSDPDRSPYYNIDLTGIRVAGKQLSLHSRVF---DGEHGAVLD 307

Query: 281 SGTDVTWLVKEAYEALRDEVMIRLEGEQMRSYSWPD----KLCYHGIMS---SDL-KGFP 332
           SGT   +L   A+ A  + VM   E   ++    PD      C+    S   S+L K FP
Sbjct: 308 SGTTYAYLPDAAFAAFEEAVM--REVSTLKQIDGPDPNFKDTCFQVAASNYVSELSKIFP 365

Query: 333 TVRFHFRGGAKLALEKDS-MFYQPRP-DAFCMAVNPASINNRYVNFSLIGMMAQQFYNVG 390
           +V   F+ G    L  ++ MF   +   A+C+ V P    N   + +L+G +  +   V 
Sbjct: 366 SVEMVFKSGQSWLLSPENYMFRHSKVHGAYCLGVFP----NGKDHTTLLGGIVVRNTLVV 421

Query: 391 YDIGRKQKTFQRMDCEVLDD 410
           YD    +  F R +C  L D
Sbjct: 422 YDRENSKVGFWRTNCSELSD 441


>gi|224067042|ref|XP_002302336.1| predicted protein [Populus trichocarpa]
 gi|222844062|gb|EEE81609.1| predicted protein [Populus trichocarpa]
          Length = 438

 Score = 94.7 bits (234), Expect = 8e-17,   Method: Compositional matrix adjust.
 Identities = 102/391 (26%), Positives = 157/391 (40%), Gaps = 37/391 (9%)

Query: 32  RLAYLQEKIRSHNTYQAQILPSNDISNSLFYV-NISIGQPPVPQFTAMDTGSSLLWVHCY 90
           RL YL   +    T    I P   +     YV  + +G P    F  +DT +   WV C 
Sbjct: 69  RLKYL-STLADQKTTAVPIAPGQQVLKIANYVVRVKLGTPGQQMFMVLDTSNDAAWVPCS 127

Query: 91  PCRDCSPQLGTIFYPSRSSSYAYVPCDSEHCRYFPYARCSAY-KHRCIYTQLYLIGPETS 149
            C  CS    T F P+ S++   + C    C       C A     C++ Q Y  G ++S
Sbjct: 128 GCTGCS---STTFLPNASTTLGSLDCSGAQCSQVRGFSCPATGSSACLFNQSY--GGDSS 182

Query: 150 VFVSTEQLTFKNTDESTIHVQDVV----FGC-GFSTNRNFKFSGIFGLGIGRSSLVSQLN 204
                  LT     ++     DV+    FGC    +  +    G+ GLG G  SL+SQ  
Sbjct: 183 -------LTATLVQDAITLANDVIPGFTFGCINAVSGGSIPPQGLLGLGRGPISLISQAG 235

Query: 205 S----SFSYCIGSLHDPDYLHNKLILGDGAIIDEGDATPLQFIDGH----YYITLEAISV 256
           +     FSYC+ S     Y    L LG          TPL   + H    YY+ L  +SV
Sbjct: 236 AMYSGVFSYCLPSFKS-YYFSGSLKLGPVGQPKSIRTTPL-LRNPHRPSLYYVNLTGVSV 293

Query: 257 DGRMLDINPNIFKRD-DSGGGVMIDSGTDVTWLVKEAYEALRDEVMIRLEGEQMRSYSWP 315
               + I       D ++G G +IDSGT +T  V+  Y A+RDE   ++ G  + S    
Sbjct: 294 GRIKVPIPSEQLVFDPNTGAGTIIDSGTVITRFVQPVYFAIRDEFRKQVNGP-ISSLGAF 352

Query: 316 DKLCYHGIMSSDLKGFPTVRFHFRGGAKLALEKDSMFYQPRPDAFCMAVNPASINNRYVN 375
           D  C+     ++    P +  HF G   +   ++S+ +       C+++  A+ NN    
Sbjct: 353 DT-CFAATNEAEA---PAITLHFEGLNLVLPMENSLIHSSSGSLACLSMA-AAPNNVNSV 407

Query: 376 FSLIGMMAQQFYNVGYDIGRKQKTFQRMDCE 406
            ++I  + QQ   + +D    +    R  C 
Sbjct: 408 LNVIANLQQQNLRIMFDTTNSRLGIARELCN 438


>gi|218184943|gb|EEC67370.1| hypothetical protein OsI_34481 [Oryza sativa Indica Group]
          Length = 367

 Score = 94.4 bits (233), Expect = 9e-17,   Method: Compositional matrix adjust.
 Identities = 84/359 (23%), Positives = 158/359 (44%), Gaps = 33/359 (9%)

Query: 64  NISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLGTIFYPSRSSSYAYVPCDSEHCRY 123
           N +IG PP      +D    L+W  C  C  C  Q   +F P+ SS++   PC ++ C+ 
Sbjct: 27  NFTIGTPPQAASAFIDLTGELVWTQCSQCIHCFKQDLPVFVPNASSTFKPEPCGTDVCKS 86

Query: 124 FPYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDVVFGCGFSTNRN 183
            P  +C++    C +  +  +G  T   V+T+         +++      FGC  +++ +
Sbjct: 87  IPTPKCAS--DVCAFDGVTGLGGHTVGIVATDTFAIGTAAPASLG-----FGCVVASDID 139

Query: 184 FKF--SGIFGLGIGRSSLVSQLN-SSFSYCIGSLHDPDYLHNKLILGDGAIIDEGDA--- 237
                SG  GLG    SLV+Q+  + FSYC+   HD    +++L LG  A +  G A   
Sbjct: 140 TMGGPSGFIGLGRTPWSLVAQMKLTRFSYCLAP-HDTGK-NSRLFLGASAKLAGGGAWTP 197

Query: 238 ----TPLQFIDGHYYITLEAISVDGRMLDINPNIFKRDDSGGGVMIDSG-TDVTWLVKEA 292
               +P   +  +Y I LE I    +  D    + +  ++   V++ +    V+ LV   
Sbjct: 198 FVKTSPNDGMSQYYPIELEEI----KAGDATITMPRGRNT---VLVQTAVVRVSLLVDSV 250

Query: 293 YEALRDEVMIRLEGEQMRS-YSWPDKLCYHGIMSSDLKGFPTVRFHFRGGAKLALEKDSM 351
           Y+  +  VM  +      +    P ++C+     + + G P + F F+ GA L +   + 
Sbjct: 251 YQEFKKAVMASVGAAPTATPVGEPFEVCFP---KAGVSGAPDLVFTFQAGAALTVPPANY 307

Query: 352 FYQPRPDAFCMAVNPASINNRYV--NFSLIGMMAQQFYNVGYDIGRKQKTFQRMDCEVL 408
            +    D  C++V   ++ N       +++G   Q+  ++ +D+ +   +F+  DC  L
Sbjct: 308 LFDVGNDTVCLSVMSIALLNITALDGLNILGSFQQENVHLLFDLDKDMLSFEPADCSSL 366


>gi|326513755|dbj|BAJ87896.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 442

 Score = 94.4 bits (233), Expect = 9e-17,   Method: Compositional matrix adjust.
 Identities = 96/378 (25%), Positives = 153/378 (40%), Gaps = 31/378 (8%)

Query: 48  AQILPSNDISNSLFYVNISIGQP-PVPQFTAMDTGSSLLWVHCYPCRDCSPQLGTIFYPS 106
           A +  +N   NS + +++SIG P   P    +DTGS ++W  C PC +C  Q    F  +
Sbjct: 79  APVGRANTDVNSEYLIHLSIGAPRSQPVVLTLDTGSDVVWTQCEPCAECFTQPLPRFDTA 138

Query: 107 RSSSYAYVPCDSEHCRYFPYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKN-TDES 165
            S++   V C    C       C  + H C Y   Y  G  +      +  TF +     
Sbjct: 139 ASNTVRSVACSDPLCNAHSEHGC--FLHGCTYVSGYGDGSLSFGHFLRDSFTFDDGKGGG 196

Query: 166 TIHVQDVVFGCGFSTNRNF--KFSGIFGLGIGRSSLVSQLN-SSFSYCIGSLHDPDYLHN 222
            + V D+ FGCG      F    +GI G G G  SL SQL    FSYC  +  +     +
Sbjct: 197 KVTVPDIGFGCGMYNAGRFLQTETGIAGFGRGPLSLPSQLKVRQFSYCFTTRFEAK--SS 254

Query: 223 KLILGDGAIIDEGDATPL---QFI--------DGHYYITLEAISVDGRMLDINPNIFKRD 271
            + LG    +      P+    F+        + HY ++ + ++V    L + P I  + 
Sbjct: 255 PVFLGGAGDLKAHATGPILSTPFVRSLPPGTDNSHYVLSFKGVTVGKTRLPV-PEI--KA 311

Query: 272 DSGGGVMIDSGTDVTWLVKEAYEALRDEVMIRLEGEQMRSYSWPDKLCYHGIMSSDLKGF 331
           D  G   IDSGTD+T      +  L+    I      +   +  D +C+           
Sbjct: 312 DGSGATFIDSGTDITTFPDAVFRQLK-SAFIAQAALPVNKTADEDDICFS-WDGKKTAAM 369

Query: 332 PTVRFHFRGGAKLALEKDSMFYQPRPDA-FCMAVNPASINNRYVNFSLIGMMAQQFYNVG 390
           P + FH   GA   L +++   + R     C+AV+ +   +R    +LIG   QQ  ++ 
Sbjct: 370 PKLVFHLE-GADWDLPRENYVTEDRESGQVCVAVSTSGQMDR----TLIGNFQQQNTHIV 424

Query: 391 YDIGRKQKTFQRMDCEVL 408
           YD+   +       C+ L
Sbjct: 425 YDLAAGKLLLVPAQCDKL 442


>gi|88174581|gb|ABD39365.1| chloroplast nucleoid DNA-binding protein [Oryza sativa Indica
           Group]
          Length = 321

 Score = 94.4 bits (233), Expect = 1e-16,   Method: Compositional matrix adjust.
 Identities = 85/329 (25%), Positives = 140/329 (42%), Gaps = 37/329 (11%)

Query: 61  FYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLGTIFYPSRSSSYAYVPCDSEH 120
           + +++ +G P   Q   +DTGSS  WV C  C  C     T F  SRS++ A V C +  
Sbjct: 1   YVISVGLGTPAKTQIVEIDTGSSTSWVFC-ECDGCHTNPRT-FLQSRSTTCAKVSCGTSM 58

Query: 121 CRYF---PYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDVVFGC- 176
           C      P+ + S     C +   Y  G  +   +  + LTF +  +    +    FGC 
Sbjct: 59  CLLGGSDPHCQDSENYPDCPFRVSYQDGSASYGILYQDTLTFSDVQK----IPSFTFGCN 114

Query: 177 --GFSTNRNFKFSGIFGLGIGRSSLVSQLN---SSFSYCIGSLHDPDYLHNKLI--LGDG 229
              F  N      G+ G+G G  S++ Q +     FSYC+          +K       G
Sbjct: 115 LDSFGANEFGNVDGLLGMGAGPMSVLKQSSPRFDGFSYCLPLQKSERGFFSKTTGYFSLG 174

Query: 230 AIIDEGDATPLQFIDGH-----YYITLEAISVDGRMLDINPNIFKRDDSGGGVMIDSGTD 284
            +    D    + +        +++ L AISVDG  L ++P+IF R     GV+ DSG++
Sbjct: 175 KVATRTDVRYTKMVARRKNTELFFVDLAAISVDGERLGLSPSIFSRK----GVVFDSGSE 230

Query: 285 VTWLVKEAYEALRD---EVMIRLEGEQMRSYSWPDKLCYHGIMSSDLKGFPTVRFHFRGG 341
           ++++   A   L     E+++R    +  S    ++ CY  + S D    P +  HF  G
Sbjct: 231 LSYIPDRALSVLSQRIRELLLRRGAAEEES----ERNCYD-MRSVDEGDMPAISLHFDDG 285

Query: 342 AKLALEKDSMFYQ---PRPDAFCMAVNPA 367
           A+  L +  +F +      D +C+A  P 
Sbjct: 286 ARFDLGRRGVFVERSVQEQDVWCLAFAPT 314


>gi|88174569|gb|ABD39359.1| chloroplast nucleoid DNA-binding protein [Oryza nivara]
          Length = 321

 Score = 94.4 bits (233), Expect = 1e-16,   Method: Compositional matrix adjust.
 Identities = 85/329 (25%), Positives = 140/329 (42%), Gaps = 37/329 (11%)

Query: 61  FYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLGTIFYPSRSSSYAYVPCDSEH 120
           + +++ +G P   Q   +DTGSS  WV C  C  C     T F  SRS++ A V C +  
Sbjct: 1   YVISVGLGTPAKTQIVEIDTGSSTTWVFC-ECDGCHTNPRT-FLQSRSTTCAKVSCGTSM 58

Query: 121 CRYF---PYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDVVFGC- 176
           C      P+ + S     C +   Y  G  +   +  + LTF +  +    +    FGC 
Sbjct: 59  CLLGGSDPHCQDSENYPDCPFRVSYQDGSASYGILYQDTLTFSDVQK----IPSFTFGCN 114

Query: 177 --GFSTNRNFKFSGIFGLGIGRSSLVSQLNSS---FSYCIGSLHDPDYLHNKLI--LGDG 229
              F  N      G+ G+G G  S++ Q + +   FSYC+          +K       G
Sbjct: 115 LDSFGANEFGNVDGLLGMGAGPMSVLKQSSPTFDGFSYCLPLQKSERGFFSKTTGYFSLG 174

Query: 230 AIIDEGDATPLQFIDGH-----YYITLEAISVDGRMLDINPNIFKRDDSGGGVMIDSGTD 284
            +    D    + +        +++ L AISVDG  L ++P+IF R     GV+ DSG++
Sbjct: 175 KVATRTDVRYTKMVARRKNTELFFVDLAAISVDGERLGLSPSIFSRK----GVVFDSGSE 230

Query: 285 VTWLVKEAYEALRD---EVMIRLEGEQMRSYSWPDKLCYHGIMSSDLKGFPTVRFHFRGG 341
           ++++   A   L     E+++R    +  S    ++ CY  + S D    P +  HF  G
Sbjct: 231 LSYIPDRALSVLSQRIRELLLRRGAAEEES----ERNCYD-MRSVDEGDMPAISLHFDDG 285

Query: 342 AKLALEKDSMFYQ---PRPDAFCMAVNPA 367
           A+  L    +F +      D +C+A  P 
Sbjct: 286 ARFDLGSRGVFVERSVQEQDVWCLAFAPT 314


>gi|359494621|ref|XP_002265771.2| PREDICTED: aspartic proteinase nepenthesin-2-like [Vitis vinifera]
          Length = 449

 Score = 94.4 bits (233), Expect = 1e-16,   Method: Compositional matrix adjust.
 Identities = 89/381 (23%), Positives = 161/381 (42%), Gaps = 33/381 (8%)

Query: 52  PSNDISNSLFYVNISIGQPPVPQFTAMDTGSSLLWVHC-YPC--RDCSPQLG------TI 102
           P+ D     + V   +G P        DTGS L W+ C Y C  R+CS +         +
Sbjct: 74  PAADYGIGQYSVAFKVGTPSQKFMLVADTGSDLTWMSCKYHCRSRNCSNRKARRIRHKRV 133

Query: 103 FYPSRSSSYAYVPCDSEHCR-----YFPYARCSAYKHRCIYTQLYLIGPETSVFVSTEQL 157
           F+ + SSS+  +PC ++ C+      F    C      C Y   Y  G     F + E +
Sbjct: 134 FHANLSSSFKTIPCLTDMCKIELMDLFSLTNCPTPLTPCGYDYRYSDGSTALGFFANETV 193

Query: 158 TFKNTDESTIHVQDVVFGCGFS-TNRNFKFS-GIFGLGIGRSSL----VSQLNSSFSYCI 211
           T +  +   + + +V+ GC  S   ++F+ + G+ GLG  + S       +    FSYC+
Sbjct: 194 TVELKEGRKMKLHNVLIGCSESFQGQSFQAADGVMGLGYSKYSFAIKAAEKFGGKFSYCL 253

Query: 212 GSLHDPDYLHNKLILGDG----AIIDEGDATPLQF--IDGHYYITLEAISVDGRMLDINP 265
                   + N L  G      A+++    T L    ++  Y + +  IS+ G ML I  
Sbjct: 254 VDHLSHKNVSNYLTFGSSRSKEALLNNMTYTELVLGMVNSFYAVNMMGISIGGAMLKIPS 313

Query: 266 NIFKRDDSGGGVMIDSGTDVTWLVKEAYEALRDEVMIR-LEGEQMRSYSWPDKLCYHGIM 324
            ++    +GG ++ DSG+ +T+L + AY+ +   + +  L+  ++     P + C++   
Sbjct: 314 EVWDVKGAGGTIL-DSGSSLTFLTEPAYQPVMAALRVSLLKFRKVEMDIGPLEYCFNSTG 372

Query: 325 SSDLKGFPTVRFHFRGGAKLALEKDSMFYQPRPDAFCMAVNPASINNRYVNFSLIGMMAQ 384
             +    P + FHF  GA+      S          C+      ++  +   S++G + Q
Sbjct: 373 FEE-SLVPRLVFHFADGAEFEPPVKSYVISAADGVRCLGF----VSVAWPGTSVVGNIMQ 427

Query: 385 QFYNVGYDIGRKQKTFQRMDC 405
           Q +   +D+G K+  F    C
Sbjct: 428 QNHLWEFDLGLKKLGFAPSSC 448


>gi|297736090|emb|CBI24128.3| unnamed protein product [Vitis vinifera]
          Length = 378

 Score = 94.4 bits (233), Expect = 1e-16,   Method: Compositional matrix adjust.
 Identities = 91/383 (23%), Positives = 163/383 (42%), Gaps = 37/383 (9%)

Query: 52  PSNDISNSLFYVNISIGQPPVPQFTAMDTGSSLLWVHC-YPC--RDCSPQLG------TI 102
           P+ D     + V   +G P        DTGS L W+ C Y C  R+CS +         +
Sbjct: 3   PAADYGIGQYSVAFKVGTPSQKFMLVADTGSDLTWMSCKYHCRSRNCSNRKARRIRHKRV 62

Query: 103 FYPSRSSSYAYVPCDSEHCR-----YFPYARCSAYKHRCIYTQLYLIGPETSVFVSTEQL 157
           F+ + SSS+  +PC ++ C+      F    C      C Y   Y  G     F + E +
Sbjct: 63  FHANLSSSFKTIPCLTDMCKIELMDLFSLTNCPTPLTPCGYDYRYSDGSTALGFFANETV 122

Query: 158 TFKNTDESTIHVQDVVFGCGFS-TNRNFKFS-GIFGLGIGRSSL----VSQLNSSFSYCI 211
           T +  +   + + +V+ GC  S   ++F+ + G+ GLG  + S       +    FSYC+
Sbjct: 123 TVELKEGRKMKLHNVLIGCSESFQGQSFQAADGVMGLGYSKYSFAIKAAEKFGGKFSYCL 182

Query: 212 GSLHDPDYLHNKLILGDG----AIIDEGDATPLQF--IDGHYYITLEAISVDGRMLDINP 265
                   + N L  G      A+++    T L    ++  Y + +  IS+ G ML I  
Sbjct: 183 VDHLSHKNVSNYLTFGSSRSKEALLNNMTYTELVLGMVNSFYAVNMMGISIGGAMLKIPS 242

Query: 266 NIFKRDDSGGGVMIDSGTDVTWLVKEAYEALRDEVMIR-LEGEQMRSYSWPDKLCYH--G 322
            ++    + GG ++DSG+ +T+L + AY+ +   + +  L+  ++     P + C++  G
Sbjct: 243 EVWDVKGA-GGTILDSGSSLTFLTEPAYQPVMAALRVSLLKFRKVEMDIGPLEYCFNSTG 301

Query: 323 IMSSDLKGFPTVRFHFRGGAKLALEKDSMFYQPRPDAFCMAVNPASINNRYVNFSLIGMM 382
              S +   P + FHF  GA+      S          C+      ++  +   S++G +
Sbjct: 302 FEESLV---PRLVFHFADGAEFEPPVKSYVISAADGVRCLGF----VSVAWPGTSVVGNI 354

Query: 383 AQQFYNVGYDIGRKQKTFQRMDC 405
            QQ +   +D+G K+  F    C
Sbjct: 355 MQQNHLWEFDLGLKKLGFAPSSC 377


>gi|255586856|ref|XP_002534038.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
           communis]
 gi|223525945|gb|EEF28342.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
           communis]
          Length = 533

 Score = 94.4 bits (233), Expect = 1e-16,   Method: Compositional matrix adjust.
 Identities = 101/376 (26%), Positives = 163/376 (43%), Gaps = 57/376 (15%)

Query: 60  LFYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLG-----------TIFYPSRS 108
           L Y N+SIG P +    A+DTGS L W+ C  C +     G            I+ P+ S
Sbjct: 112 LHYANVSIGTPSLSYLVALDTGSDLFWLPC-DCTNSGCVQGLQFPSGEQIDFNIYRPNAS 170

Query: 109 SSYAYVPCDSEHCRYFPYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIH 168
           S+   +PC++  C     +RC + +  C Y   YL    +S  V  E L    TD++   
Sbjct: 171 STSQTIPCNNTLCSR--QSRCPSAQSTCPYQVQYLSNGTSSTGVLVEDLLHLTTDDAQSR 228

Query: 169 VQD--VVFGCGFSTNRNF----KFSGIFGLGIGRSSLVSQL------NSSFSYCIGSLHD 216
             D  ++FGCG     +F      +G+FGLG+   S+ S L      ++SFS C G    
Sbjct: 229 ALDAKIIFGCGRVQTGSFLDGAAPNGLFGLGMTNISVPSTLAREGYTSNSFSMCFGR--- 285

Query: 217 PDYLHNKLILGDGAIIDEGDATPLQFIDGH--YYITLEAISVDGRMLDINPNIFKRDDSG 274
            D +  ++  GD     +G+ TP      H  Y +++  I+V GR  D+  +        
Sbjct: 286 -DGI-GRISFGDTGSSGQGE-TPFNLRQLHPTYNVSITKINVGGRDADLEFS-------- 334

Query: 275 GGVMIDSGTDVTWLVKEAYEALRDEVMIRLEGEQMRSYS-WPDKLCYHGIMSSDLKGF-- 331
              + DSGT  T+L   AY  + +   I  + ++  S S  P + CY   MSS+      
Sbjct: 335 --AIFDSGTSFTYLNDPAYTLISESFNIGAKEKRYSSISDIPFEYCYE--MSSNQTNLEI 390

Query: 332 PTVRFHFRGGAKLALEKD--SMFYQPRPDAFCMAVNPASINNRYVNFSLIGMMAQQFYNV 389
           PTV    +GG++  +      +  Q     +C+A+    + +  VN  +IG      Y +
Sbjct: 391 PTVNLVMQGGSQFNVTDPIVIVILQGGASIYCLAI----VKSGDVN--IIGQNFMTGYRI 444

Query: 390 GYDIGRKQKTFQRMDC 405
            ++  R    ++  DC
Sbjct: 445 VFNRERNVLGWKASDC 460


>gi|115442309|ref|NP_001045434.1| Os01g0954900 [Oryza sativa Japonica Group]
 gi|20161865|dbj|BAB90778.1| nucleoid DNA-binding-like protein [Oryza sativa Japonica Group]
 gi|113534965|dbj|BAF07348.1| Os01g0954900 [Oryza sativa Japonica Group]
 gi|215766867|dbj|BAG99095.1| unnamed protein product [Oryza sativa Japonica Group]
          Length = 445

 Score = 94.4 bits (233), Expect = 1e-16,   Method: Compositional matrix adjust.
 Identities = 90/352 (25%), Positives = 140/352 (39%), Gaps = 27/352 (7%)

Query: 67  IGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLGTIFYPSRSSSYAYVPCDSEHCRYFPY 126
           +G P      A+D  +   WV C  C  C+    + F P++SS+Y  VPC S  C   P 
Sbjct: 108 LGTPAQTLLVAIDPSNDAAWVPCSACAGCAASSPS-FSPTQSSTYRTVPCGSPQCAQVPS 166

Query: 127 ARCSA-YKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDVVFGC-GFSTNRNF 184
             C A     C +   Y      +V    + L  +N       V    FGC    +  + 
Sbjct: 167 PSCPAGVGSSCGFNLTYAASTFQAVL-GQDSLALENN-----VVVSYTFGCLRVVSGNSV 220

Query: 185 KFSGIFGLGIGRSSLVSQLN----SSFSYCIGSLHDPDYLHNKLILGDGAIIDEGDATPL 240
              G+ G G G  S +SQ      S FSYC+ +    ++    L LG          TPL
Sbjct: 221 PPQGLIGFGRGPLSFLSQTKDTYGSVFSYCLPNYRSSNF-SGTLKLGPIGQPKRIKTTPL 279

Query: 241 QFIDGH----YYITLEAISVDGRMLDINPNIFKRDD-SGGGVMIDSGTDVTWLVKEAYEA 295
            + + H    YY+ +  I V  +++ +  +    +  +G G +ID+GT  T L    Y A
Sbjct: 280 LY-NPHRPSLYYVNMIGIRVGSKVVQVPQSALAFNPVTGSGTIIDAGTMFTRLAAPVYAA 338

Query: 296 LRDEVMIRLEGEQMRSYSWPDKLCYHGIMSSDLKGFPTVRFHFRGGAKLAL-EKDSMFYQ 354
           +RD    R+           D  CY+  +S      PTV F F G   + L E++ M + 
Sbjct: 339 VRDAFRGRVRTPVAPPLGGFDT-CYNVTVS-----VPTVTFMFAGAVAVTLPEENVMIHS 392

Query: 355 PRPDAFCMAVNPASINNRYVNFSLIGMMAQQFYNVGYDIGRKQKTFQRMDCE 406
                 C+A+     +      +++  M QQ   V +D+   +  F R  C 
Sbjct: 393 SSGGVACLAMAAGPSDGVNAALNVLASMQQQNQRVLFDVANGRVGFSRELCT 444


>gi|125579874|gb|EAZ21020.1| hypothetical protein OsJ_36669 [Oryza sativa Japonica Group]
          Length = 382

 Score = 94.0 bits (232), Expect = 1e-16,   Method: Compositional matrix adjust.
 Identities = 84/247 (34%), Positives = 114/247 (46%), Gaps = 31/247 (12%)

Query: 182 RNFKFSGIFGLGIGRSSLVSQLNSS-FSYCIGSLHDPDYLHNK-----LILGDGAIID-E 234
           R+   SG+ GLG GR SLVSQ  ++ FSYC+       Y HN      L +G  A +   
Sbjct: 147 RSMAPSGLMGLGRGRLSLVSQTGATKFSYCL-----TPYFHNNGATGHLFVGASASLGGH 201

Query: 235 GDATPLQFIDG-----HYYITLEAISVDGRMLDINPNIFKRDDSG-----GGVMIDSGTD 284
           GD    QF+ G      YY+ L  ++V    L I   +F   +       GGV+IDSG+ 
Sbjct: 202 GDVMTTQFVKGPKGSPFYYLPLIGLTVGETRLPIPATVFDLREVAPGLFSGGVIIDSGSP 261

Query: 285 VTWLVKEAYEALRDEVMIRLEGEQMRSYSWPD--KLCYHGIMSSDL-KGFPTVRFHFRGG 341
            T LV +AY+AL  E+  RL G  +      D   LC   +   D+ +  P V FHFRGG
Sbjct: 262 FTSLVHDAYDALASELAARLNGSLVAPPPDADDGALC---VARRDVGRVVPAVVFHFRGG 318

Query: 342 AKLALEKDSMFYQPRPDAFCMAVNPASINNRYVNFSLIGMMAQQFYNVGYDIGRKQKTFQ 401
           A +A+  +S +    P     A    +    Y   S+IG   QQ   V YD+     +FQ
Sbjct: 319 ADMAVPAESYW---APVDKAAACMAIASAGPYRRQSVIGNYQQQNMRVLYDLANGDFSFQ 375

Query: 402 RMDCEVL 408
             DC  L
Sbjct: 376 PADCSAL 382


>gi|88174577|gb|ABD39363.1| chloroplast nucleoid DNA-binding protein [Oryza sativa Japonica
           Group]
          Length = 321

 Score = 94.0 bits (232), Expect = 1e-16,   Method: Compositional matrix adjust.
 Identities = 85/329 (25%), Positives = 140/329 (42%), Gaps = 37/329 (11%)

Query: 61  FYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLGTIFYPSRSSSYAYVPCDSEH 120
           +  ++ +G P   Q   +DTGSS+ WV C  C  C     T F  SRS++ A V C +  
Sbjct: 1   YVTSVGLGTPAKTQIVEIDTGSSISWVFC-ECDGCHTNPRT-FLQSRSTTCAKVSCGTSM 58

Query: 121 CRYF---PYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDVVFGC- 176
           C      P+ + S     C +   Y  G  +   +  + LTF +  +    +    FGC 
Sbjct: 59  CLLGGSDPHCQDSENYPDCPFRVSYQDGSASYGILYQDTLTFSDVQK----IPSFTFGCN 114

Query: 177 --GFSTNRNFKFSGIFGLGIGRSSLVSQLNSS---FSYCIGSLHDPDYLHNKLI--LGDG 229
              F  N      G+ G+G G  S++ Q + +   FSYC+          +K       G
Sbjct: 115 LDSFGANEFGNVDGLLGMGAGPMSVLKQSSPTFDGFSYCLPLQKSERGFFSKTTGYFSLG 174

Query: 230 AIIDEGDATPLQFIDGH-----YYITLEAISVDGRMLDINPNIFKRDDSGGGVMIDSGTD 284
            +    D    + +        +++ L AISVDG  L ++P+IF R     GV+ DSG++
Sbjct: 175 KVATRTDVRYTKMVARRKNTELFFVDLAAISVDGERLGLSPSIFSRK----GVVFDSGSE 230

Query: 285 VTWLVKEAYEALRD---EVMIRLEGEQMRSYSWPDKLCYHGIMSSDLKGFPTVRFHFRGG 341
           ++++   A   L     E+++R    +  S    ++ CY  + S D    P +  HF  G
Sbjct: 231 LSYIPDRALSVLSQRIRELLLRRGAAEEES----ERNCYD-MRSVDEGDMPAISLHFDDG 285

Query: 342 AKLALEKDSMFYQ---PRPDAFCMAVNPA 367
           A+  L    +F +      D +C+A  P 
Sbjct: 286 ARFDLGSSGVFVERSVQEQDVWCLAFAPT 314


>gi|125529158|gb|EAY77272.1| hypothetical protein OsI_05246 [Oryza sativa Indica Group]
          Length = 426

 Score = 94.0 bits (232), Expect = 1e-16,   Method: Compositional matrix adjust.
 Identities = 90/351 (25%), Positives = 140/351 (39%), Gaps = 27/351 (7%)

Query: 67  IGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLGTIFYPSRSSSYAYVPCDSEHCRYFPY 126
           +G P      A+D  +   WV C  C  C+    + F P++SS+Y  VPC S  C   P 
Sbjct: 89  LGTPAQTLLVAIDPSNDAAWVPCSACAGCAASSPS-FSPTQSSTYRTVPCGSPQCAQVPS 147

Query: 127 ARCSA-YKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDVVFGC-GFSTNRNF 184
             C A     C +   Y      +V    + L  +N       V    FGC    +  + 
Sbjct: 148 PSCPAGVGSSCGFNLTYAASTFQAVL-GQDSLALENN-----VVVSYTFGCLRVVSGNSV 201

Query: 185 KFSGIFGLGIGRSSLVSQLN----SSFSYCIGSLHDPDYLHNKLILGDGAIIDEGDATPL 240
              G+ G G G  S +SQ      S FSYC+ +    ++    L LG          TPL
Sbjct: 202 PPQGLIGFGRGPLSFLSQTKDTYGSVFSYCLPNYRSSNF-SGTLKLGPIGQPKRIKTTPL 260

Query: 241 QFIDGH----YYITLEAISVDGRMLDINPNIFKRDD-SGGGVMIDSGTDVTWLVKEAYEA 295
            + + H    YY+ +  I V  +++ +  +    +  +G G +ID+GT  T L    Y A
Sbjct: 261 LY-NPHRPSLYYVNMIGIRVGSKVVQVPQSALAFNPVTGSGTIIDAGTMFTRLAAPVYAA 319

Query: 296 LRDEVMIRLEGEQMRSYSWPDKLCYHGIMSSDLKGFPTVRFHFRGGAKLAL-EKDSMFYQ 354
           +RD    R+           D  CY+  +S      PTV F F G   + L E++ M + 
Sbjct: 320 VRDAFRGRVRTPVAPPLGGFDT-CYNVTVS-----VPTVTFMFAGAVAVTLPEENVMIHS 373

Query: 355 PRPDAFCMAVNPASINNRYVNFSLIGMMAQQFYNVGYDIGRKQKTFQRMDC 405
                 C+A+     +      +++  M QQ   V +D+   +  F R  C
Sbjct: 374 SSGGVACLAMAAGPSDGVNAALNVLASMQQQNQRVLFDVANGRVGFSRELC 424


>gi|88174567|gb|ABD39358.1| chloroplast nucleoid DNA-binding protein [Oryza glumipatula]
          Length = 323

 Score = 94.0 bits (232), Expect = 1e-16,   Method: Compositional matrix adjust.
 Identities = 86/331 (25%), Positives = 141/331 (42%), Gaps = 39/331 (11%)

Query: 61  FYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLGTIFYPSRSSSYAYVPCDSEH 120
           + +++ +G P   Q   +DTGSS  WV C  C  C     T F  SRS++ A V C +  
Sbjct: 1   YVISVGLGTPAKTQIVEIDTGSSTSWVFC-ECDGCHTNPRT-FLQSRSTTCAKVSCGTSM 58

Query: 121 CRYF---PYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDVVFGC- 176
           C      P+ + S     C +   Y  G  +   +  + LTF +  +    +    FGC 
Sbjct: 59  CLLGGSDPHCQDSENYPDCPFRVSYQDGSASYGILYQDTLTFSDVQK----IPGFTFGCN 114

Query: 177 --GFSTNRNFKFSGIFGLGIGRSSLVSQLNSS---FSYCIGSLHDPDYLHNKLI----LG 227
              F  N      G+ G+G G+ S++ Q + +   FSYC+          +K      LG
Sbjct: 115 MDSFGANEFGNVDGLLGMGAGQMSVLKQSSPTFDGFSYCLPLQMSERGFFSKTTGYFSLG 174

Query: 228 DGAIIDEGDATPLQFIDGH-----YYITLEAISVDGRMLDINPNIFKRDDSGGGVMIDSG 282
                   D    + +        +++ L AISVDG  L ++P+IF R     GV+ DSG
Sbjct: 175 GKIAATRTDVRYTKMVARRKNTELFFVDLTAISVDGERLGLSPSIFSRK----GVVFDSG 230

Query: 283 TDVTWLVKEAYEALRD---EVMIRLEGEQMRSYSWPDKLCYHGIMSSDLKGFPTVRFHFR 339
           ++++++   A   L     E+++R    +  S    ++ CY  + S D    P +  HF 
Sbjct: 231 SELSYIPDRALSVLSQRIRELLLRRGAAEEES----ERNCYD-MRSVDEGDMPAISLHFD 285

Query: 340 GGAKLALEKDSMFYQ---PRPDAFCMAVNPA 367
            GA+  L    +F +      D +C+A  P 
Sbjct: 286 DGARFDLGSHGVFVERSVQEQDVWCLAFAPT 316


>gi|326517745|dbj|BAK03791.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 556

 Score = 94.0 bits (232), Expect = 1e-16,   Method: Compositional matrix adjust.
 Identities = 97/378 (25%), Positives = 151/378 (39%), Gaps = 38/378 (10%)

Query: 50  ILPSNDISNSLFYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCR-DCSPQL--GTIFYPS 106
           ++ + DI+N LF + I +G PPV    A+DTG++L +V C PC   C  Q   G IF PS
Sbjct: 195 LIQNGDINNFLFLMPIKLGTPPVWNLVAVDTGATLSFVQCEPCTLRCHKQTDAGEIFDPS 254

Query: 107 RSSSYAYVPCDSEHCRYFPYA------RCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFK 160
           +S S++ V C    CR    A       C   +  C+Y+  +      SV          
Sbjct: 255 KSESFSRVGCSENKCRTVQRALHLQSKACMEKEDSCLYSMTFGGTSSYSVGKLVRDRLAI 314

Query: 161 NTDESTIHVQDVVFGCGFSTNRNFKFSGIFGLGIGRSSLVSQLN-----SSFSYCIGSLH 215
                     D +FGC   T  +   +G+ G      S   Q+       +FSYC  S  
Sbjct: 315 GKYAKGYSFPDFLFGCSLDTEYHQYEAGLVGFADEPFSFFEQVAPLVNYKAFSYCFPSDR 374

Query: 216 DPDYLHNKLILGDGAIIDEGDATPLQFI--DGHYYITLEAISVDGRMLDINPNIFKRDDS 273
                   L +GD   ++    TPL        Y + L+ + V+G  L   P+       
Sbjct: 375 RKT---GYLSIGDYTRVNS-TYTPLFLARQQSRYALKLDEVLVNGMALVTTPS------- 423

Query: 274 GGGVMIDSGTDVTWLVKEAYEALRDEVM--IRLEGEQMRSYSWPDKLCY---HGIMSSDL 328
              +++DSG+  T L+ + +  L   +   +R  G     Y   D +C+   H    SD 
Sbjct: 424 --EMIVDSGSRWTILLSDTFTQLDAAITEAMRPLGYNRNYYRGSDYICFEDAHFQQFSDW 481

Query: 329 KGFPTVRFHFRGGAKLALEKDSMFYQPRPDAFCM-AVNPASINNRYVNFSLIGMMAQQFY 387
              P V   F  G K+ L+  S F+       C   +  AS+ +      L+G    +  
Sbjct: 482 AALPVVELKFDMGVKMVLQPQSSFHFNNDYGLCTYFMRDASLGS---GVQLLGNTMTRSV 538

Query: 388 NVGYDIGRKQKTFQRMDC 405
            + +DI   Q  F++ DC
Sbjct: 539 GITFDIQGGQFGFRKGDC 556


>gi|357116170|ref|XP_003559856.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Brachypodium
           distachyon]
          Length = 460

 Score = 94.0 bits (232), Expect = 1e-16,   Method: Compositional matrix adjust.
 Identities = 99/376 (26%), Positives = 157/376 (41%), Gaps = 41/376 (10%)

Query: 61  FYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLGTIFYPSRSSSYAYVPCDSEH 120
           + V  S+G PP     A+DT +   WV C  C  C P     F P+ S+++  VPC +  
Sbjct: 94  YLVRASLGTPPQRLLLAVDTSNDAAWVPCAGCHGC-PTTAPSFNPASSATFRPVPCGAPP 152

Query: 121 CRYFPYARCSAY---KHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDVVFGCG 177
           C   P   C++    K+ C ++  Y    ++S+  +  Q     T    + ++   FGC 
Sbjct: 153 CSQAPNPSCTSLAKSKNSCGFSLSYG---DSSLDATLSQDNLAVTANGGV-IKGYTFGCL 208

Query: 178 FSTNRNFKFS-GIFGLGIGRSSLVSQLNS----SFSYCIGSLH-DPDYLHNKLILG-DGA 230
             +N +   + G+ GLG G    V+Q       +FSYC+ S +         L LG  G 
Sbjct: 209 TKSNGSAAPAQGLLGLGRGPLGFVAQTKGIYEGTFSYCLPSYYRSAANFSGSLTLGRKGQ 268

Query: 231 IIDEGDATPLQFIDGH----YYITLEAISVDGRMLDINPNIFKRDD-SGGGVMIDSGTDV 285
              E   T       H    YY+ +  + +  + + I P+    D  +G G ++DSGT  
Sbjct: 269 PAPEKMKTTPLLASPHRPSLYYVAMTGVRIGKKSVPIPPSALAFDAATGAGTVLDSGTMF 328

Query: 286 TWLVKEAYEALRDEVMIRLEGEQMRSYSWPDKLCYHGIMSSDLKGF-----------PTV 334
             L + AY A+RDEV  R+ G   R            +  S L GF           P V
Sbjct: 329 ARLAQPAYAAVRDEVRRRVAGSLRRR-----GGGGASVSVSSLGGFDTCYNVSTVAWPAV 383

Query: 335 RFHFRGGAKLALEKDSMFYQP---RPDAFCMAVNPASINNRYVNFSLIGMMAQQFYNVGY 391
              F GG ++ L ++++  +          MA +PA   N  +N  +IG + QQ + V +
Sbjct: 384 TLVFGGGMEVRLPEENVVIRSTYGSTSCLAMAASPADGVNAALN--VIGSLQQQNHRVLF 441

Query: 392 DIGRKQKTFQRMDCEV 407
           D+   +  F R  C  
Sbjct: 442 DVPNARVGFARERCTA 457


>gi|242063796|ref|XP_002453187.1| hypothetical protein SORBIDRAFT_04g001340 [Sorghum bicolor]
 gi|241933018|gb|EES06163.1| hypothetical protein SORBIDRAFT_04g001340 [Sorghum bicolor]
          Length = 493

 Score = 94.0 bits (232), Expect = 1e-16,   Method: Compositional matrix adjust.
 Identities = 104/370 (28%), Positives = 155/370 (41%), Gaps = 41/370 (11%)

Query: 61  FYVNISIGQPP-VPQFTAMDTGSSLLWVHCYPC-RDCSPQLGTIFYPSRSSSYAYVPCDS 118
           + + + +G PP   Q   +DTGS + WV C PC + C PQ+  +F PS SS+Y+   C S
Sbjct: 140 YVITVRLGSPPGKSQTMLIDTGSDISWVRCKPCWQQCRPQVDPLFDPSLSSTYSPFSCSS 199

Query: 119 EHCRYF---PYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDVVFG 175
             C        A   +   +C Y  +Y  G   +    +       ++ +T+ V    FG
Sbjct: 200 AACAQLFQEGNANGCSSSGQCQYIAMYGDGSVGTTGTYSSDTLALGSNSNTVVVSKFRFG 259

Query: 176 CGFS-TNRNFKFSGIFGLGIGRSSLVSQL-----NSSFSYCIGSLHDPDYLHNKLILGDG 229
           C  + T      +G+ GLG G  SLVSQ       ++FSYC+            L LG  
Sbjct: 260 CSHAETGITGLTAGLMGLGGGAQSLVSQTAGTFGTTAFSYCLPPTPSSSGF---LTLGAA 316

Query: 230 AIIDEG-DATPL---QFIDGHYYITLEAISVDGRMLDINPNIFKRDDSGGGVMIDSGTDV 285
                G   TP+     +   Y + LEAI V GR L I   +F       G+++DSGT V
Sbjct: 317 GTSSAGFVKTPMLRSSQVPAFYGVRLEAIRVGGRQLSIPTTVFS-----AGMIMDSGTVV 371

Query: 286 TWLVKEAYEALRDEVMIRLEGEQMRSYSWPDKLCYHGIMSS--DLKG-----FPTVRFHF 338
           T L   AY +L            M+ Y         G + +  D+ G      PTV   F
Sbjct: 372 TRLPPTAYSSLSSAFK-----AGMKQYPPAPSSAGGGFLDTCFDMSGQSSVSMPTVALVF 426

Query: 339 R--GGAKLALEKDSMFYQPRPDA-FCMAVNPASINNRYVNFSLIGMMAQQFYNVGYDIGR 395
              GGA + L+   +  Q    + FC+A    S +    +  +IG + Q+ + V YD+  
Sbjct: 427 SGAGGAVVNLDASGILLQMETSSIFCLAFVATSDDG---STGIIGNVQQRTFQVLYDVAG 483

Query: 396 KQKTFQRMDC 405
               F+   C
Sbjct: 484 GAVGFKAGAC 493


>gi|88174556|gb|ABD39353.1| chloroplast nucleoid DNA-binding protein [Oryza barthii]
          Length = 321

 Score = 94.0 bits (232), Expect = 1e-16,   Method: Compositional matrix adjust.
 Identities = 85/329 (25%), Positives = 141/329 (42%), Gaps = 37/329 (11%)

Query: 61  FYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLGTIFYPSRSSSYAYVPCDSEH 120
           + +++ +G P   Q   +DTGSS  WV C  C  C     T F  SRS++ A V C +  
Sbjct: 1   YVISVGLGTPSKTQIVEIDTGSSTSWVFC-ECDGCHTNPRT-FLQSRSTTCAKVSCGTSM 58

Query: 121 CRYF---PYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDVVFGC- 176
           C      P+ + S     C +   Y  G  +   +  + LTF +  +    +    FGC 
Sbjct: 59  CLLGGSDPHCQDSENYPDCPFRVSYQDGSASYGILYQDTLTFSDVQK----IPGFSFGCN 114

Query: 177 --GFSTNRNFKFSGIFGLGIGRSSLVSQLNSS---FSYCIGSLHDPDYLHNKLI--LGDG 229
              F  N      G+ G+G G  S++ Q + +   FSYC+          +K       G
Sbjct: 115 MDSFGANEFGNVDGLLGMGAGAMSVLKQSSPTFDCFSYCLPLQKSERGFFSKTTGYFSLG 174

Query: 230 AIIDEGDATPLQFIDGH-----YYITLEAISVDGRMLDINPNIFKRDDSGGGVMIDSGTD 284
            +    D    + +        +++ L AISVDG  L ++P+IF R     GV+ DSG++
Sbjct: 175 KVATRTDVRYTKMVARKKNTELFFVDLTAISVDGERLGLSPSIFSRK----GVVFDSGSE 230

Query: 285 VTWLVKEAYEALRD---EVMIRLEGEQMRSYSWPDKLCYHGIMSSDLKGFPTVRFHFRGG 341
           ++++   A   L     E+++R    +  S    ++ CY  + S D    P +  HF  G
Sbjct: 231 LSYIPDRALSVLSQRIRELLLRRGAAEEES----ERNCYD-MRSVDEGDMPAISLHFDDG 285

Query: 342 AKLALEKDSMFYQ---PRPDAFCMAVNPA 367
           A+  L +  +F +      D +C+A  P 
Sbjct: 286 ARFDLGRGGVFVERSVQEQDVWCLAFAPT 314


>gi|88174571|gb|ABD39360.1| chloroplast nucleoid DNA-binding protein [Oryza nivara]
          Length = 321

 Score = 93.6 bits (231), Expect = 1e-16,   Method: Compositional matrix adjust.
 Identities = 85/329 (25%), Positives = 139/329 (42%), Gaps = 37/329 (11%)

Query: 61  FYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLGTIFYPSRSSSYAYVPCDSEH 120
           + +++ +G P   Q   +DTGSS  WV C  C  C     T F  SRS++ A V C +  
Sbjct: 1   YVISVGLGTPAKTQIVEIDTGSSTSWVFC-ECDGCHTNPRT-FLQSRSTTCAKVSCGTSM 58

Query: 121 CRYF---PYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDVVFGC- 176
           C      P+ + S     C +   Y  G  +   +  + LTF +  +    +    FGC 
Sbjct: 59  CLLGGSDPHCQDSENYPDCPFRVSYQDGSASYGILYQDTLTFSDVQK----IPSFTFGCN 114

Query: 177 --GFSTNRNFKFSGIFGLGIGRSSLVSQLN---SSFSYCIGSLHDPDYLHNKLI--LGDG 229
              F  N      G+ G+G G  S++ Q +     FSYC+          +K       G
Sbjct: 115 LDSFGANEFGNVDGLLGMGAGPMSVLKQSSPRFDGFSYCLPLQKSERGFFSKTTGYFSLG 174

Query: 230 AIIDEGDATPLQFIDGH-----YYITLEAISVDGRMLDINPNIFKRDDSGGGVMIDSGTD 284
            +    D    + +        +++ L AISVDG  L ++P+IF R     GV+ DSG++
Sbjct: 175 KVATRTDVRYTKMVARRKNTELFFVDLAAISVDGERLGLSPSIFSRK----GVVFDSGSE 230

Query: 285 VTWLVKEAYEALRD---EVMIRLEGEQMRSYSWPDKLCYHGIMSSDLKGFPTVRFHFRGG 341
           ++++   A   L     E+++R    +  S    ++ CY  + S D    P +  HF  G
Sbjct: 231 LSYIPDRALSVLSQRIRELLLRRGAAEEES----ERNCYD-MRSVDEGDMPAISLHFDDG 285

Query: 342 AKLALEKDSMFYQ---PRPDAFCMAVNPA 367
           A+  L    +F +      D +C+A  P 
Sbjct: 286 ARFDLGSKGVFVERSVQEQDVWCLAFAPT 314


>gi|413944032|gb|AFW76681.1| hypothetical protein ZEAMMB73_606599 [Zea mays]
          Length = 315

 Score = 93.6 bits (231), Expect = 1e-16,   Method: Compositional matrix adjust.
 Identities = 89/330 (26%), Positives = 138/330 (41%), Gaps = 45/330 (13%)

Query: 108 SSSYAYVPCDSEHCRY---FPYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDE 164
           SS++  V C    CR       + C+    +C Y   Y     T+  +  +  TF + + 
Sbjct: 2   SSTFKAVACPDPICRPSSGVSVSACAMENFQCFYLCSYGDRSITAGHIFKDTFTFMSPNG 61

Query: 165 STIHVQDVVFGCG------FSTNRNFKFSGIFGLGIGRSSLVSQLN-SSFSYCI------ 211
             + V ++ FGCG      F +N     SGI G G G  SL SQL    FSYC+      
Sbjct: 62  VPVAVSELAFGCGDYNTGLFVSNE----SGIAGFGRGPQSLPSQLKVGRFSYCLTLVTES 117

Query: 212 -------GSLHDPDYLHNKLILGDGAIIDEGDATPL---QFIDGHYYITLEAISVDGRML 261
                  G+  DPD L                +TP+     I   YY++LE I+V    L
Sbjct: 118 KSSVVILGTPPDPDGLR-------AHTTGPFQSTPIIYNPLIPTFYYLSLEGITVGKTRL 170

Query: 262 DINPNIFK-RDDSGGGVMIDSGTDVTWLVKEAYEALRDEVMIRLEGEQM-RSYSWPDKLC 319
             + ++F  + D  GG +IDSGT +T L +  +E L++E++ +    +   +    D+LC
Sbjct: 171 PFDKSVFALKKDGSGGTVIDSGTSLTTLPEAVFELLQEELVAQFPLPRYDNTPEVGDRLC 230

Query: 320 YHGIMSSDLKGFPTVRFHFRGGAKLALEKDSMFY-QPRPDAFCMAVNPASINNRYVNFSL 378
           +           P +  H   GA + L +D+ F  +P     C+ +N A          L
Sbjct: 231 FRRPKGGKQVPVPKLILHL-AGADMDLPRDNYFVEEPDSGVMCLQINGA----EDTTMVL 285

Query: 379 IGMMAQQFYNVGYDIGRKQKTFQRMDCEVL 408
           IG   QQ  +V YD+   +  F    C+ L
Sbjct: 286 IGNFQQQNMHVVYDVENNKLLFAPAQCDKL 315


>gi|125532793|gb|EAY79358.1| hypothetical protein OsI_34487 [Oryza sativa Indica Group]
          Length = 419

 Score = 93.6 bits (231), Expect = 2e-16,   Method: Compositional matrix adjust.
 Identities = 102/419 (24%), Positives = 175/419 (41%), Gaps = 52/419 (12%)

Query: 19  AHRVQRGINISIARLAYLQEKIRSHNTYQAQILPSNDISNSLFYVNISIGQPPVPQFTAM 78
           AH ++RG++    R   L +   +       ++P +  S + +  N +IG PP      +
Sbjct: 23  AHGLRRGLDRQGMRGRILADATAAPP--GGAVVPLH-WSGACYVANFTIGTPPQAVSGIV 79

Query: 79  DTGSSLLWVHCYPCRD--CSPQLGTIFYPSRSSSYAYVPCDSEHCRYFPYARCSAYKHRC 136
           D    L+W  C  CR   C  Q   +F PS S++Y    C S  C+  P   CS     C
Sbjct: 80  DLSGELVWTQCAACRSSGCFKQELPVFDPSASNTYRAEQCGSPLCKSIPTRNCSG-DGEC 138

Query: 137 IYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDVVFGCGFSTNRNFKF-----SGIFG 191
            Y    + G +T    ST+ +   N +        + FGC  +++ +        SG  G
Sbjct: 139 GYEAPSMFG-DTFGIASTDAIAIGNAE------GRLAFGCVVASDGSIDGAMDGPSGFVG 191

Query: 192 LGIGRSSLVSQLN-SSFSYCIGSLHDPDYLHNKLILGDGA-IIDEGDATPLQFI------ 243
           LG    SLV Q N ++FSYC+ + H P    + L LG  A +   G + P   +      
Sbjct: 192 LGRTPWSLVGQSNVTAFSYCL-APHGPGK-KSALFLGASAKLAGAGKSNPPTPLLGQHAS 249

Query: 244 -------DGHYYITLEAISVDGRMLDINPNIFKRDDSGGGVM----IDSGTDVTWLVKEA 292
                  D +Y + LE I    +  D+         SGGG +    +++   +++L   A
Sbjct: 250 NTSDDGSDPYYTVQLEGI----KAGDV---AVAAASSGGGAITILQLETFRPLSYLPDAA 302

Query: 293 YEALRDEVMIRLEGEQMRSYSWPDKLCYHGIMSSDLKGFPTVRFHFRGGAKLALEKDSMF 352
           Y+AL   V   L    M +   P  LC+    ++ + G P + F F+GGA L        
Sbjct: 303 YQALEKVVTAALGSPSMANPPEPFDLCFQ---NAAVSGVPDLVFTFQGGATLTAPPSKYL 359

Query: 353 YQP--RPDAFCMAV-NPASINNRYVNFSLIGMMAQQFYNVGYDIGRKQKTFQRMDCEVL 408
                     C+++ +   +++     S++G + Q+  +  +D+ ++  +F+  DC  L
Sbjct: 360 LGDGNGNGTVCLSILSSTRLDSADDGVSILGSLLQENVHFLFDLEKETLSFEPADCSSL 418


>gi|388502484|gb|AFK39308.1| unknown [Medicago truncatula]
          Length = 425

 Score = 93.6 bits (231), Expect = 2e-16,   Method: Compositional matrix adjust.
 Identities = 93/347 (26%), Positives = 137/347 (39%), Gaps = 27/347 (7%)

Query: 56  ISNSLFYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLGTIFYPSRSSSYAYVP 115
           I +  + V   IG PP     AMDT +   W+ C  C  C+    T+F P +S+++  V 
Sbjct: 88  IQSPTYIVRAKIGTPPQTLLLAMDTSNDAAWIPCTACDGCA---STLFAPEKSTTFKNVS 144

Query: 116 CDSEHCRYFPYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDVVFG 175
           C +  C+  P   C             L    +S+  +  Q T      +T  V    FG
Sbjct: 145 CAAPECKQVPNPGCGVSSR-----NFNLTYGSSSIAANLVQDTIT---LATDPVPSYTFG 196

Query: 176 C-----GFSTNRNFKFSGIFGLGIGRSSLVSQLNSSFSYCIGSLHDPDYLHNKLILGDGA 230
           C     G S           G     S   +   S+FSYC+ S    ++    L LG  A
Sbjct: 197 CVSKTTGTSAPPQGLLGLGRGPLSLLSQTQNLYQSTFSYCLPSFKSLNF-SGSLRLGPVA 255

Query: 231 IIDEGDATPL---QFIDGHYYITLEAISVDGRMLDINPNIFK-RDDSGGGVMIDSGTDVT 286
                  TPL         YY+ LEAI V  +++DI P        +G G + DSGT  T
Sbjct: 256 QPKRIKYTPLLKNPRRSSLYYVNLEAIRVGRKVVDIPPAALAFNPTTGAGTIFDSGTVFT 315

Query: 287 WLVKEAYEALRDEVMIRLEGEQMRSYSWPDKLCYHGIMSSDLKGFPTVRFHFRGGAKLAL 346
            LV   Y A+RDE   R+  +   +       CY+  +       PT+ F F G      
Sbjct: 316 RLVAPVYVAVRDEFRRRVGPKLTVTSLGGFDTCYNVPIV-----VPTITFIFTGMNVTLP 370

Query: 347 EKDSMFYQPRPDAFCMAVNPASINNRYVNFSLIGMMAQQFYNVGYDI 393
           + + + +       C+A+  A  N   V  ++I  M QQ + V YD+
Sbjct: 371 QDNILIHSTAGSTTCLAMAGAPDNVNSV-LNVIANMQQQNHRVLYDV 416


>gi|147801191|emb|CAN68822.1| hypothetical protein VITISV_007106 [Vitis vinifera]
          Length = 443

 Score = 93.6 bits (231), Expect = 2e-16,   Method: Compositional matrix adjust.
 Identities = 63/193 (32%), Positives = 89/193 (46%), Gaps = 4/193 (2%)

Query: 67  IGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLGTIFYPSRSSSYAYVPCDSEHCRYFPY 126
           +G P    +   DTGS L+W+ C PC  C  Q   IF P+ S +Y  V  DS  C     
Sbjct: 63  LGVPSTLVYGIADTGSELIWLQCLPCTHCYNQTPPIFDPAESYTYETVSSDSPICNAVRR 122

Query: 127 ARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDVVFGCGFSTNRNFKF 186
             C      C Y   Y  G  T   +ST+   F++   + + V  + FGC   T    K 
Sbjct: 123 ISCREGDKSCCYQHTYGDGTTTKGTLSTDVFAFEDPTRTIVEVGYLTFGCSHDTKARLKG 182

Query: 187 --SGIFGLGIGRSSLVSQLN-SSFSYCIGSLHDPDYLHNKLILGDGAIIDEGDATPLQFI 243
             +G+ GL    +SLVSQL    FSYC+  + D     +++  G  A+I  G    L+  
Sbjct: 183 HQAGVVGLNRHPNSLVSQLKVKKFSYCM-VIPDDHGSGSRMYFGSRAVILGGKTPLLKGD 241

Query: 244 DGHYYITLEAISV 256
             HY++TL+ ISV
Sbjct: 242 YSHYFVTLKGISV 254



 Score = 47.0 bits (110), Expect = 0.019,   Method: Compositional matrix adjust.
 Identities = 37/110 (33%), Positives = 50/110 (45%), Gaps = 3/110 (2%)

Query: 95  CSPQLGTIFYPSRSSSYAYVPCDSEHCRYFPYARCSAYKHRCIYTQLYLIG-PETSVFVS 153
           C  Q   IF PS+SS+Y+ VP D+  C       C   +  C Y   Y  G   T   +S
Sbjct: 334 CFNQTPPIFDPSKSSTYSTVPWDAPTCYQAGGYACHIDEEDCCYRISYGSGSTSTEGTIS 393

Query: 154 TEQLTFKNTDESTIHVQDVVFGCGFSTNRNFKF--SGIFGLGIGRSSLVS 201
            +   F++  ++ + V  +VFGC   T   FK    GI GL     SLVS
Sbjct: 394 IDAFAFEDNRQNMVDVXHLVFGCSDYTTGTFKGYEVGIVGLNQDSLSLVS 443


>gi|195626958|gb|ACG35309.1| aspartic proteinase nepenthesin-1 precursor [Zea mays]
          Length = 450

 Score = 93.6 bits (231), Expect = 2e-16,   Method: Compositional matrix adjust.
 Identities = 107/390 (27%), Positives = 163/390 (41%), Gaps = 33/390 (8%)

Query: 31  ARLAYLQE-KIRSHNTYQAQILPSNDISNSLFYV-NISIGQPPVPQFTAMDTGSSLLWVH 88
           +RL YL    +R      A I     +  +L YV   S+G PP     A+DT +   W+ 
Sbjct: 80  SRLLYLDSLAVRGRARAYAPIASGRQLLQTLTYVVRASLGTPPQQLLLAVDTSNDASWIP 139

Query: 89  CYPCRDCSPQLGTIFYPSRSSSYAYVPCDSEHCRYFPYARCSAYKHRCIYTQLYLIGPET 148
           C  C  C       F P+ S+SY  VPC S  C   P A C      C ++  Y    ++
Sbjct: 140 CAGCAGCPTSSAAPFDPAASASYRTVPCGSPLCAQAPNAACPPGGKACGFSLTYA---DS 196

Query: 149 SVFVSTEQLTFKNTDESTIHVQDVVFGC-GFSTNRNFKFSGIFGLGIGRSSLVSQLN--- 204
           S+  +  Q +      +   V+   FGC   +T       G+ GLG G  S +SQ     
Sbjct: 197 SLQAALSQDSLAVAGNA---VKAYTFGCLQRATGTAAPPQGLLGLGRGPLSFLSQTKDMY 253

Query: 205 -SSFSYCIGSLHDPDYLHNKLILGDGAIIDEGDATPLQFIDGH----YYITLEAISVDGR 259
            ++FSYC+ S    ++    L LG          TPL   + H    YY+ +  + V  +
Sbjct: 254 EATFSYCLPSFKSLNF-SGTLRLGRNGQPQRIKTTPL-LANPHRSSLYYVNMTGVRVGRK 311

Query: 260 MLDINPNIFKRDDSGGGVMIDSGTDVTWLVKEAYEALRDEVMIRLEGEQMRSYSWPDKLC 319
           ++ I    F    +G G ++DSGT  T LV  AY A+RDEV  R+ G  + S    D  C
Sbjct: 312 VVPI--PAFD-PATGAGTVLDSGTMFTRLVAPAYVAVRDEVRRRV-GAPVSSLGGFDT-C 366

Query: 320 YHGIMSSDLKGFPTVRFHFRGGAKLALEKDSMFYQPRPDAFC--MAVNPASINNRYVNFS 377
           ++    +    +P +   F G      E++ + +       C  MA  P  +N      +
Sbjct: 367 FN----TTAVAWPPMTLLFDGMQVTLPEENVVIHSTYGTISCLAMAAAPDGVNTV---LN 419

Query: 378 LIGMMAQQFYNVGYDIGRKQKTFQRMDCEV 407
           +I  M QQ + V +D+   +  F R  C  
Sbjct: 420 VIASMQQQNHRVLFDVPNGRVGFARERCTA 449


>gi|224142013|ref|XP_002324355.1| predicted protein [Populus trichocarpa]
 gi|222865789|gb|EEF02920.1| predicted protein [Populus trichocarpa]
          Length = 477

 Score = 93.6 bits (231), Expect = 2e-16,   Method: Compositional matrix adjust.
 Identities = 100/379 (26%), Positives = 157/379 (41%), Gaps = 43/379 (11%)

Query: 43  HNTYQAQILPSNDISNSLFYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCR-DCSPQLGT 101
             T  A I+P+       + V + +G P      + DTGS L W  C PC   C PQ   
Sbjct: 126 QTTIPASIVPTGGA----YVVTVGLGTPKKDFTLSFDTGSDLTWTQCEPCLGGCFPQNQP 181

Query: 102 IFYPSRSSSYAYVPCDSEHCRY-----FPYARCSAYKHRCIYTQLYLIGPETSVFVSTEQ 156
            F P+ S+SY  V C SE C+      +P   C    + C+Y   Y  G  T  F++TE 
Sbjct: 182 KFDPTTSTSYKNVSCSSEFCKLIAEGNYPAQDC--ISNTCLYGIQYGSG-YTIGFLATET 238

Query: 157 LTFKNTDESTIHVQDVVFGCGFSTNRNFK-FSGIFGLGIGRSSLVSQLNSS----FSYCI 211
           L   ++D      ++ +FGC   +   F   +G+ GLG    +L SQ  +     FSYC+
Sbjct: 239 LAIASSD----VFKNFLFGCSEESRGTFNGTTGLLGLGRSPIALPSQTTNKYKNLFSYCL 294

Query: 212 -GSLHDPDYLHNKLILGDGAIIDEGDATPLQ-FIDGHYYITLEAISVDGRMLDINPNIFK 269
             S     +L   + +   A      +TP+   +   Y +    ISV GR L IN +I +
Sbjct: 295 PASPSSTGHLSFGVEVSQAA-----KSTPISPKLKQLYGLNTVGISVRGRELPINGSISR 349

Query: 270 RDDSGGGVMIDSGTDVTWLVKEAYEALRDEVMIRLEGEQMRSYSWPDKLCY--HGIMSSD 327
                   +IDSGT  T+L    Y AL       +    + + +   + CY    I +  
Sbjct: 350 -------TIIDSGTTFTFLPSPTYSALGSAFREMMANYTLTNGTSSFQPCYDFSNIGNGT 402

Query: 328 LKGFPTVRFHFRGGAKLALEKDSMFYQPRP-DAFCMAVNPASINNRYVNFSLIGMMAQQF 386
           L   P +   F GG ++ ++   +          C+A      ++   +F++ G   Q+ 
Sbjct: 403 LT-IPGISIFFEGGVEVEIDVSGIMIPVNGLKEVCLAFADTGSDS---DFAIFGNYQQKT 458

Query: 387 YNVGYDIGRKQKTFQRMDC 405
           Y V YD+ +    F    C
Sbjct: 459 YEVIYDVAKGMVGFAPKGC 477


>gi|88174589|gb|ABD39369.1| chloroplast nucleoid DNA-binding protein [Oryza rufipogon]
          Length = 321

 Score = 93.6 bits (231), Expect = 2e-16,   Method: Compositional matrix adjust.
 Identities = 85/329 (25%), Positives = 140/329 (42%), Gaps = 37/329 (11%)

Query: 61  FYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLGTIFYPSRSSSYAYVPCDSEH 120
           + +++ +G P   Q   +DTGSS  WV C  C  C     T F  SRS++ A V C +  
Sbjct: 1   YVISVGLGTPAKTQIVEIDTGSSASWVFC-ECDGCHTNPRT-FLQSRSTTCAKVSCGTSM 58

Query: 121 CRYF---PYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDVVFGC- 176
           C      P+ + S     C +   Y  G  +   +  + LTF +  +    +    FGC 
Sbjct: 59  CLLGGSDPHCQDSENYPDCPFRVSYQDGSASYGILYQDTLTFSDVQK----IPSFTFGCN 114

Query: 177 --GFSTNRNFKFSGIFGLGIGRSSLVSQLNSS---FSYCIGSLHDPDYLHNKLI--LGDG 229
              F  N      G+ G+G G  S++ Q + +   FSYC+          +K       G
Sbjct: 115 LDSFGANEFGNVDGLLGMGAGPMSVLKQSSPTFDGFSYCLPLQKSERGFFSKTTGYFSLG 174

Query: 230 AIIDEGDATPLQFIDGH-----YYITLEAISVDGRMLDINPNIFKRDDSGGGVMIDSGTD 284
            +    D    + +        +++ L AISVDG  L ++P+IF R     GV+ DSG++
Sbjct: 175 KVATRTDVRYTKMVARRKNTELFFVDLAAISVDGERLGLSPSIFSRK----GVVFDSGSE 230

Query: 285 VTWLVKEAYEALRD---EVMIRLEGEQMRSYSWPDKLCYHGIMSSDLKGFPTVRFHFRGG 341
           ++++   A   L     E+++R    +  S    ++ CY  + S D    P +  HF  G
Sbjct: 231 LSYIPDRALSVLSQRIRELLLRRGAAEEES----ERNCYD-MRSVDEGDMPAISLHFDDG 285

Query: 342 AKLALEKDSMFYQ---PRPDAFCMAVNPA 367
           A+  L    +F +      D +C+A  P 
Sbjct: 286 ARFDLGSHGVFVERSVQEQDVWCLAFAPT 314


>gi|88174558|gb|ABD39354.1| chloroplast nucleoid DNA-binding protein [Oryza meridionalis]
          Length = 321

 Score = 93.2 bits (230), Expect = 2e-16,   Method: Compositional matrix adjust.
 Identities = 84/329 (25%), Positives = 141/329 (42%), Gaps = 37/329 (11%)

Query: 61  FYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLGTIFYPSRSSSYAYVPCDSEH 120
           + +++ +G P   Q   +DTGSS  WV C  C  C     T F  SRS++ A V C +  
Sbjct: 1   YVISVGLGTPAKTQIVEIDTGSSTSWVFC-ECDGCHTNPRT-FLQSRSTTCAKVSCGTSM 58

Query: 121 CRYF---PYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDVVFGC- 176
           C      P+ + S     C +   Y  G  +   +  + LTF +  +    +    FGC 
Sbjct: 59  CLLGGSDPHCQDSENYPDCPFRVSYQDGSASYGILYQDTLTFSDVQK----IPGFTFGCN 114

Query: 177 --GFSTNRNFKFSGIFGLGIGRSSLVSQLNSS---FSYCIGSLHDPDYLHNKLI--LGDG 229
              F  N      G+ G+G G  S++ Q + +   FSYC+          +K       G
Sbjct: 115 LDSFGANEFGNVDGLLGMGAGPMSVLKQSSPTFDGFSYCLPLQMSERGFFSKTTGYFSLG 174

Query: 230 AIIDEGDATPLQFIDGH-----YYITLEAISVDGRMLDINPNIFKRDDSGGGVMIDSGTD 284
            +    D    + +        +++ L AISVDG  L ++P++F R     GV+ DSG++
Sbjct: 175 KVATRTDVRYTKMVARKKNTELFFVDLTAISVDGERLGLSPSVFSRK----GVVFDSGSE 230

Query: 285 VTWLVKEAYEALRD---EVMIRLEGEQMRSYSWPDKLCYHGIMSSDLKGFPTVRFHFRGG 341
           ++++   A   LR    E++++    +  S    ++ CY  + S D    P +  HF  G
Sbjct: 231 LSYIPDRALSVLRQRIRELLLKRGAAEEES----ERNCYD-MRSVDEGDMPAISLHFDDG 285

Query: 342 AKLALEKDSMFYQ---PRPDAFCMAVNPA 367
           A+  L    +F +      D +C+A  P 
Sbjct: 286 ARFDLGSHGVFVERSVQEQDVWCLAFAPT 314


>gi|88174575|gb|ABD39362.1| chloroplast nucleoid DNA-binding protein [Oryza sativa Japonica
           Group]
          Length = 321

 Score = 93.2 bits (230), Expect = 2e-16,   Method: Compositional matrix adjust.
 Identities = 85/329 (25%), Positives = 140/329 (42%), Gaps = 37/329 (11%)

Query: 61  FYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLGTIFYPSRSSSYAYVPCDSEH 120
           +  ++ +G P   Q   +DTGSS  WV C  C  C     T F  SRS++ A V C +  
Sbjct: 1   YVTSVGLGTPAKTQIVEIDTGSSTSWVFC-ECDGCHTNPRT-FLQSRSTTCAKVSCGTSM 58

Query: 121 CRYF---PYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDVVFGC- 176
           C      P+ + S     C +   Y  G  +   +  + LTF +  +    +    FGC 
Sbjct: 59  CLLGGSDPHCQDSENYPDCPFRVSYQDGSASYGILYQDTLTFSDVQK----IPSFTFGCN 114

Query: 177 --GFSTNRNFKFSGIFGLGIGRSSLVSQLNSSF---SYCIGSLHDPDYLHNKLI--LGDG 229
              F  N      G+ G+G G  S++ Q + +F   SYC+          +K       G
Sbjct: 115 LDSFGANEFGNVDGLLGMGAGPMSVLKQSSPTFDGFSYCLPLQKSERGFFSKTTGYFSLG 174

Query: 230 AIIDEGDATPLQFIDGH-----YYITLEAISVDGRMLDINPNIFKRDDSGGGVMIDSGTD 284
            +    D    + +        +++ L AISVDG  L ++P+IF R     GV+ DSG++
Sbjct: 175 KVATRTDVRYTKMVARRKNTELFFVDLAAISVDGERLGLSPSIFSRK----GVVFDSGSE 230

Query: 285 VTWLVKEAYEALRD---EVMIRLEGEQMRSYSWPDKLCYHGIMSSDLKGFPTVRFHFRGG 341
           ++++   A   L     E+++R    +  S    ++ CY  + S D    P +  HF  G
Sbjct: 231 LSYIPDRALSVLSQRIRELLLRRGAAEEES----ERNCYD-MRSVDEGDMPAISLHFDDG 285

Query: 342 AKLALEKDSMFYQ---PRPDAFCMAVNPA 367
           A+  L +  +F +      D +C+A  P 
Sbjct: 286 ARFDLGRHGVFVERSVQEQDVWCLAFAPT 314


>gi|88174597|gb|ABD39373.1| chloroplast nucleoid DNA-binding protein [Oryza rufipogon]
 gi|88174601|gb|ABD39375.1| chloroplast nucleoid DNA-binding protein [Oryza rufipogon]
 gi|88174603|gb|ABD39376.1| chloroplast nucleoid DNA-binding protein [Oryza rufipogon]
          Length = 321

 Score = 93.2 bits (230), Expect = 2e-16,   Method: Compositional matrix adjust.
 Identities = 85/329 (25%), Positives = 140/329 (42%), Gaps = 37/329 (11%)

Query: 61  FYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLGTIFYPSRSSSYAYVPCDSEH 120
           + +++ +G P   Q   +DTGSS  WV C  C  C     T F  SRS++ A V C +  
Sbjct: 1   YVISVGLGTPAKTQIVEIDTGSSTSWVFC-ECDGCHTNPRT-FLQSRSTTCAKVSCGTSM 58

Query: 121 CRYF---PYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDVVFGC- 176
           C      P+ + S     C +   Y  G  +   +  + LTF +  +    +    FGC 
Sbjct: 59  CLLGGSDPHCQDSENYPDCPFRVSYQDGSASYGILYQDTLTFSDVQK----IPSFTFGCN 114

Query: 177 --GFSTNRNFKFSGIFGLGIGRSSLVSQLNSS---FSYCIGSLHDPDYLHNKLI--LGDG 229
              F  N      G+ G+G G  S++ Q + +   FSYC+          +K       G
Sbjct: 115 LDSFGANEFGNVDGLLGMGAGPMSVLKQSSPTFDGFSYCLPLQKSERGFFSKTTGYFSLG 174

Query: 230 AIIDEGDATPLQFIDGH-----YYITLEAISVDGRMLDINPNIFKRDDSGGGVMIDSGTD 284
            +    D    + +        +++ L AISVDG  L ++P+IF R     GV+ DSG++
Sbjct: 175 KVATRTDVRYTKMVARRKNTELFFVDLAAISVDGERLGLSPSIFSRK----GVVFDSGSE 230

Query: 285 VTWLVKEAYEALRD---EVMIRLEGEQMRSYSWPDKLCYHGIMSSDLKGFPTVRFHFRGG 341
           ++++   A   L     E+++R    +  S    ++ CY  + S D    P +  HF  G
Sbjct: 231 LSYIPDRALSVLSQRIRELLLRRGAAEEES----ERNCYD-MRSVDEGDMPAISLHFDDG 285

Query: 342 AKLALEKDSMFYQ---PRPDAFCMAVNPA 367
           A+  L    +F +      D +C+A  P 
Sbjct: 286 ARFDLGSHGVFVERSVQEQDVWCLAFAPT 314


>gi|88174587|gb|ABD39368.1| chloroplast nucleoid DNA-binding protein [Oryza sativa Indica
           Group]
          Length = 321

 Score = 93.2 bits (230), Expect = 2e-16,   Method: Compositional matrix adjust.
 Identities = 85/329 (25%), Positives = 140/329 (42%), Gaps = 37/329 (11%)

Query: 61  FYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLGTIFYPSRSSSYAYVPCDSEH 120
           + +++ +G P   Q   +DTGSS  WV C  C  C     T F  SRS++ A V C +  
Sbjct: 1   YVISVGLGTPSKTQIVEIDTGSSASWVFC-ECDGCHTNPRT-FLQSRSTTCAKVSCGTSM 58

Query: 121 CRYF---PYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDVVFGC- 176
           C      P+ + S     C +   Y  G  +   +  + LTF +  +    +    FGC 
Sbjct: 59  CLLGGSDPHCQDSENYPDCPFRVSYQDGSASYGILYQDTLTFSDVQK----IPSFTFGCN 114

Query: 177 --GFSTNRNFKFSGIFGLGIGRSSLVSQLNSS---FSYCIGSLHDPDYLHNKLI--LGDG 229
              F  N      G+ G+G G  S++ Q + +   FSYC+          +K       G
Sbjct: 115 LDSFGANEFGNVDGLLGMGAGPMSVLKQSSPTFDGFSYCLPLQKSERGFFSKTTGYFSLG 174

Query: 230 AIIDEGDATPLQFIDGH-----YYITLEAISVDGRMLDINPNIFKRDDSGGGVMIDSGTD 284
            +    D    + +        +++ L AISVDG  L ++P+IF R     GV+ DSG++
Sbjct: 175 KVATRTDVRYTKMVARRKNTELFFVDLAAISVDGERLGLSPSIFSRK----GVVFDSGSE 230

Query: 285 VTWLVKEAYEALRD---EVMIRLEGEQMRSYSWPDKLCYHGIMSSDLKGFPTVRFHFRGG 341
           ++++   A   L     E+++R    +  S    ++ CY  + S D    P +  HF  G
Sbjct: 231 LSYIPDRALSVLSQRIRELLLRRGAAEEES----ERNCYD-MRSVDEGDMPAISLHFDDG 285

Query: 342 AKLALEKDSMFYQ---PRPDAFCMAVNPA 367
           A+  L    +F +      D +C+A  P 
Sbjct: 286 ARFDLGSHGVFVERSVQEQDVWCLAFAPT 314


>gi|88174605|gb|ABD39377.1| chloroplast nucleoid DNA-binding protein [Oryza rufipogon]
          Length = 321

 Score = 93.2 bits (230), Expect = 2e-16,   Method: Compositional matrix adjust.
 Identities = 85/329 (25%), Positives = 139/329 (42%), Gaps = 37/329 (11%)

Query: 61  FYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLGTIFYPSRSSSYAYVPCDSEH 120
           + +++ +G P   Q   +DTGSS  WV C  C  C     T F  SRS++ A V C +  
Sbjct: 1   YVISVGLGTPAKTQIVEIDTGSSTSWVFC-ECDGCHTNPRT-FLQSRSTTCAKVSCGTSM 58

Query: 121 CRYF---PYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDVVFGC- 176
           C      P+ + S     C +   Y  G  +   +  + LTF +  +    +    FGC 
Sbjct: 59  CLLGGSDPHCQDSENYPDCPFRVSYQDGSASYGILYQDTLTFSDVQK----IPSFTFGCN 114

Query: 177 --GFSTNRNFKFSGIFGLGIGRSSLVSQLN---SSFSYCIGSLHDPDYLHNKLI--LGDG 229
              F  N      G+ G+G G  S++ Q +     FSYC+          +K       G
Sbjct: 115 LDSFGANEFGNVDGLLGMGAGPMSVLKQSSPRFDGFSYCLPLQKSERGFFSKTTGYFSLG 174

Query: 230 AIIDEGDATPLQFIDGH-----YYITLEAISVDGRMLDINPNIFKRDDSGGGVMIDSGTD 284
            +    D    + +        +++ L AISVDG  L ++P+IF R     GV+ DSG++
Sbjct: 175 KVATRTDVRYTKMVARRKNTELFFVDLAAISVDGERLGLSPSIFSRK----GVVFDSGSE 230

Query: 285 VTWLVKEAYEALRD---EVMIRLEGEQMRSYSWPDKLCYHGIMSSDLKGFPTVRFHFRGG 341
           ++++   A   L     E+++R    +  S    ++ CY  + S D    P +  HF  G
Sbjct: 231 LSYIPDRALSVLSQRIRELLLRRGAAEEES----ERNCYD-MRSVDEGDMPAISLHFDDG 285

Query: 342 AKLALEKDSMFYQ---PRPDAFCMAVNPA 367
           A+  L    +F +      D +C+A  P 
Sbjct: 286 ARFDLGSHGVFVERSVQEQDVWCLAFAPT 314


>gi|357118738|ref|XP_003561107.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Brachypodium
           distachyon]
          Length = 491

 Score = 93.2 bits (230), Expect = 2e-16,   Method: Compositional matrix adjust.
 Identities = 99/356 (27%), Positives = 144/356 (40%), Gaps = 40/356 (11%)

Query: 72  VPQFTAMDTGSSLLWVHCYPCRD--CSPQLGTIFYPSRSSSYAYVPCDSEHCRYF-PYAR 128
           V Q   +DT S + WV C PC    C  Q   ++ PS+SSS A  PC S  CR   PYA 
Sbjct: 154 VAQTMVIDTASDVPWVQCAPCPAPHCHAQTDVLYDPSKSSSSAAFPCSSPACRNLGPYAN 213

Query: 129 -CSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDVVFGCGFSTNRNFKF- 186
            C+    +C Y   Y  G  ++    ++ LT  N  +    + +  FGC  +  +   F 
Sbjct: 214 GCTPAGDQCQYRVQYPDGSASAGTYISDVLTL-NPAKPASAISEFRFGCSHALLQPGSFS 272

Query: 187 ---SGIFGLGIGRSSLVSQLNSS----FSYCIGSLHDPDYLHNK-LILGDGAIIDEGDA- 237
              SGI  LG G  SL +Q  ++    FSYC+     P  +H+   ILG   +     A 
Sbjct: 273 NKTSGIMALGRGAQSLPTQTKATYGDVFSYCL----PPTPVHSGFFILGVPRVAASRYAV 328

Query: 238 TPL---QFIDGHYYITLEAISVDGRMLDINPNIFKRDDSGGGVMIDSGTDVTWLVKEAYE 294
           TP+   +     Y + L AI V G+ L + P +F       G ++DS T VT L   AY 
Sbjct: 329 TPMLRSKAAPMLYLVRLIAIEVAGKRLPVPPAVF-----AAGAVMDSRTIVTRLPPTAYM 383

Query: 295 ALRDEVMIRLEGEQMRSYSWPDKLCY----HGIMSSDLKGFPTVRFHFRG-GAKLALEKD 349
           ALR   +  +   +  +       CY               P +   F G    + L+  
Sbjct: 384 ALRAAFVAEMRAYRAAAPKEHLDTCYDFSGAAPGGGGGVKLPKITLVFDGPNGAVELDPS 443

Query: 350 SMFYQPRPDAFCMAVNPASINNRYVNFSLIGMMAQQFYNVGYDIGRKQKTFQRMDC 405
            +         C+A  P   N       +IG + QQ   V Y++      F+R  C
Sbjct: 444 GVLLD-----GCLAFAP---NTDDQMTGIIGNVQQQALEVLYNVDGATVGFRRGAC 491


>gi|88174583|gb|ABD39366.1| chloroplast nucleoid DNA-binding protein [Oryza sativa Indica
           Group]
          Length = 321

 Score = 93.2 bits (230), Expect = 2e-16,   Method: Compositional matrix adjust.
 Identities = 85/329 (25%), Positives = 139/329 (42%), Gaps = 37/329 (11%)

Query: 61  FYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLGTIFYPSRSSSYAYVPCDSEH 120
           + +++ +G P   Q   +DTGSS  WV C  C  C     T F  SRS++ A V C +  
Sbjct: 1   YVISVGLGTPSKTQIVEIDTGSSTSWVFC-ECDGCHTNPRT-FLQSRSTTCAKVSCGTSM 58

Query: 121 CRYF---PYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDVVFGC- 176
           C      P+ + S     C +   Y  G  +   +  + LTF +  +    +    FGC 
Sbjct: 59  CLLGGSDPHCQDSENYPDCPFRVSYQDGSASYGILYQDTLTFSDVQK----IPSFTFGCN 114

Query: 177 --GFSTNRNFKFSGIFGLGIGRSSLVSQLN---SSFSYCIGSLHDPDYLHNKLI--LGDG 229
              F  N      G+ G+G G  S++ Q +     FSYC+          +K       G
Sbjct: 115 LDSFGANEFGNVDGLLGMGAGPMSVLKQSSPRFDGFSYCLPLQKSERGFFSKTTGYFSLG 174

Query: 230 AIIDEGDATPLQFIDGH-----YYITLEAISVDGRMLDINPNIFKRDDSGGGVMIDSGTD 284
            +    D    + +        +++ L AISVDG  L ++P+IF R     GV+ DSG++
Sbjct: 175 KVATRTDVRYTKMVARRKNTELFFVDLAAISVDGERLGLSPSIFSRK----GVVFDSGSE 230

Query: 285 VTWLVKEAYEALRD---EVMIRLEGEQMRSYSWPDKLCYHGIMSSDLKGFPTVRFHFRGG 341
           ++++   A   L     E+++R    +  S    ++ CY  + S D    P +  HF  G
Sbjct: 231 LSYIPDRALSVLSQRIRELLLRRGAAEEES----ERNCYD-MRSVDEGDMPAISLHFDDG 285

Query: 342 AKLALEKDSMFYQ---PRPDAFCMAVNPA 367
           A+  L    +F +      D +C+A  P 
Sbjct: 286 ARFDLGSHGVFVERSVQEQDVWCLAFAPT 314


>gi|21717157|gb|AAM76350.1|AC074196_8 putative nucleoid DNA binding protein [Oryza sativa Japonica Group]
 gi|31433294|gb|AAP54832.1| Eukaryotic aspartyl protease family protein, expressed [Oryza
           sativa Japonica Group]
          Length = 396

 Score = 93.2 bits (230), Expect = 2e-16,   Method: Compositional matrix adjust.
 Identities = 91/371 (24%), Positives = 153/371 (41%), Gaps = 41/371 (11%)

Query: 57  SNSLFYVNISIGQPPVPQFTAMDTGSSLLWVHCYP-CRDCSPQLGTIFYPSRSSSYAYVP 115
           S + + VN++IG PP P    +D G  L+W  C   CR C  Q   +F  + SS++   P
Sbjct: 47  SQAFYVVNLTIGTPPQPVSAIIDIGGELVWTQCAQHCRRCFKQDLPLFDTNASSTFRPEP 106

Query: 116 CDSEHCRYFPYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDVVFG 175
           C +  C   P   C+         +       T   + T+ +        T     + FG
Sbjct: 107 CGAAVCESIPTRSCAGDGGGACGYEASTSFGRTVGRIGTDAVAI-----GTAATARLAFG 161

Query: 176 CGFSTNRNFKFSGIFGLGIGRS--SLVSQLN-SSFSYCIGSLHDPDY-LHNKLILGDGAI 231
           C  ++  +  +     +G+GR+  SL +Q+N ++FSYC   L  PD    + L LG  A 
Sbjct: 162 CAVASEMDTMWGSSGSVGLGRTNLSLAAQMNATAFSYC---LAPPDTGKSSALFLGASAK 218

Query: 232 I---DEGDAT---------PLQFIDGHYYITLEAISVDGRMLDINPNIFKRDDSGGGVMI 279
           +    +G  T         P   +   Y + LEAI      + +         SG  + +
Sbjct: 219 LAGAGKGAGTTPFVKTSTPPNSGLSRSYLLRLEAIRAGNATIAM-------PQSGNTITV 271

Query: 280 DSGTDVTWLVKEAYEALRDEVMIRLEGEQMRSYSWPDKLCYHGIMSSDLKGFPTVRFHFR 339
            + T VT LV   Y  LR  V   +    +        LC+    +S   G P +   F+
Sbjct: 272 STATPVTALVDSVYRDLRKAVADAVGAAPVPPPVQNYDLCFPKASASG--GAPDLVLAFQ 329

Query: 340 GGAKLALEKDSMFYQPRPDAFCMAV--NPASINNRYVNFSLIGMMAQQFYNVGYDIGRKQ 397
           GGA++ +   S  +    D  C+A+  +PA         S++G + Q   ++ +D+ ++ 
Sbjct: 330 GGAEMTVPVSSYLFDAGNDTACVAILGSPA-----LGGVSILGSLQQVNIHLLFDLDKET 384

Query: 398 KTFQRMDCEVL 408
            +F+  DC  L
Sbjct: 385 LSFEPADCSAL 395


>gi|115465777|ref|NP_001056488.1| Os05g0591300 [Oryza sativa Japonica Group]
 gi|113580039|dbj|BAF18402.1| Os05g0591300 [Oryza sativa Japonica Group]
          Length = 453

 Score = 92.8 bits (229), Expect = 2e-16,   Method: Compositional matrix adjust.
 Identities = 104/411 (25%), Positives = 164/411 (39%), Gaps = 76/411 (18%)

Query: 61  FYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCR-DCSPQ---LGTIFYPSRSSSYAYVPC 116
           F + + +G P V     MDTGSSL WV C PC   C  Q   +G IF PS SS++ +V C
Sbjct: 53  FLIPVKLGTPAVQYLVTMDTGSSLSWVQCRPCTIKCHVQPAKVGPIFDPSNSSTFRHVGC 112

Query: 117 DSEHCRYFPYA------RCSAYKHRCIYTQLYLIGPETSVFVS-TEQLTF--KNTDESTI 167
            +  C Y           C  ++  C+YT  Y  G   SV  + T++L      T  +T+
Sbjct: 113 STSICSYLGRTLRIQSKACMEWEDICLYTMSYGGGWAYSVGKAVTDRLVLGGGETTRTTL 172

Query: 168 HVQDVVFGCGFSTN-RNFKFSGIFGLGIGRSSL--VSQLNS--SFSYCIGSLHDPDYLHN 222
            + + VFGC   T     K +GIFGLG    S   ++ L S  +FSYC+ S    D  H 
Sbjct: 173 SLANFVFGCSMDTQYSTHKEAGIFGLGTSNYSFEQIAPLLSYKAFSYCLPS----DEAHQ 228

Query: 223 K-LILGDGAIIDEGDATPLQFIDGH----YYITLEA--ISVDGRMLDINPNIFKRDDSGG 275
             L +G     D     P     G     Y I +    ++V+G +  +            
Sbjct: 229 GYLSIGP----DSSGGVPTSMFPGTPRPVYSIGMTGLTVTVNGEVRSLVSGSGSSPSPSS 284

Query: 276 GVMIDSGTDVTWLVKEAYEALRDEVMIRLE--GEQMRSYSWPDKLCYHGIMSSDLKGF-- 331
            +++DSG  +T L+   +  L D ++  +E  G  + + +  ++LC+  +  SD + +  
Sbjct: 285 LMVVDSGAKLTLLLASTFGQLEDAIIPAMESLGYSLNTAAGQNQLCF--LTESDRQNYLQ 342

Query: 332 ------------PTVRFHFRGGAKLALEKDSMFY-------------------------Q 354
                       P     F  G  L L   + FY                         Q
Sbjct: 343 RKPPPPSNWSALPVFHISFTLGLTLTLPPKNAFYLDARYRQEFLTIAWKQIYVRSIFPFQ 402

Query: 355 PRPDAFCMAVNPASINNRYVNFSLIGMMAQQFYNVGYDIGRKQKTFQRMDC 405
              D   +  + A  +     + ++G +  +   + YDI RKQ  F+  +C
Sbjct: 403 YFSDILGLCSSFARDDYLESGYQILGNLVTKSCGITYDIPRKQFRFRTGEC 453


>gi|88174554|gb|ABD39352.1| chloroplast nucleoid DNA-binding protein [Oryza barthii]
          Length = 321

 Score = 92.8 bits (229), Expect = 3e-16,   Method: Compositional matrix adjust.
 Identities = 85/329 (25%), Positives = 140/329 (42%), Gaps = 37/329 (11%)

Query: 61  FYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLGTIFYPSRSSSYAYVPCDSEH 120
           + +++ +G P   Q   +DTGSS  WV C  C  C     T F  SRS++ A V C +  
Sbjct: 1   YVISVGLGTPSKTQIVEIDTGSSTSWVFC-ECDGCHTNPRT-FLQSRSTTCAKVSCGTSM 58

Query: 121 CRYF---PYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDVVFGC- 176
           C      P+ + S     C +   Y  G  +   +  + LTF +  +    +    FGC 
Sbjct: 59  CLLGGSDPHCQDSENYPDCPFRVSYQDGSASYGILYQDTLTFSDVQK----IPGFSFGCN 114

Query: 177 --GFSTNRNFKFSGIFGLGIGRSSLVSQLNSS---FSYCIGSLHDPDYLHNKLI--LGDG 229
              F  N      G+ G+G G  S++ Q + +   FSYC+          +K       G
Sbjct: 115 MDSFGANEFGNVDGLLGMGAGAMSVLKQSSPTFDCFSYCLPLQKSERGFFSKTTGYFSLG 174

Query: 230 AIIDEGDATPLQFIDGH-----YYITLEAISVDGRMLDINPNIFKRDDSGGGVMIDSGTD 284
            +    D    + +        +++ L AISVDG  L ++P+IF R     GV+ DSG++
Sbjct: 175 KVATRTDVRYTKMVARKKNTELFFVDLTAISVDGERLGLSPSIFSRK----GVVFDSGSE 230

Query: 285 VTWLVKEAYEALRD---EVMIRLEGEQMRSYSWPDKLCYHGIMSSDLKGFPTVRFHFRGG 341
           ++++   A   L     E+++R    +  S    ++ CY  + S D    P +  HF  G
Sbjct: 231 LSYIPDRALSVLSQRIRELLLRRGAAEEES----ERNCYD-MRSVDEGDMPAISLHFDDG 285

Query: 342 AKLALEKDSMFYQ---PRPDAFCMAVNPA 367
           A+  L    +F +      D +C+A  P 
Sbjct: 286 ARFDLGSHGVFVERSVQEQDVWCLAFAPT 314


>gi|212275143|ref|NP_001130306.1| uncharacterized protein LOC100191400 precursor [Zea mays]
 gi|194688798|gb|ACF78483.1| unknown [Zea mays]
 gi|194703430|gb|ACF85799.1| unknown [Zea mays]
 gi|194707192|gb|ACF87680.1| unknown [Zea mays]
 gi|223944599|gb|ACN26383.1| unknown [Zea mays]
 gi|223948667|gb|ACN28417.1| unknown [Zea mays]
 gi|414887962|tpg|DAA63976.1| TPA: aspartic proteinase nepenthesin-1 [Zea mays]
          Length = 450

 Score = 92.8 bits (229), Expect = 3e-16,   Method: Compositional matrix adjust.
 Identities = 108/390 (27%), Positives = 162/390 (41%), Gaps = 33/390 (8%)

Query: 31  ARLAYLQE-KIRSHNTYQAQILPSNDISNSLFYV-NISIGQPPVPQFTAMDTGSSLLWVH 88
           +RL YL    +R      A I     +  +  YV   S+G PP     A+DT +   W+ 
Sbjct: 80  SRLLYLDSLAVRGRARAYAPIASGRQLLQTPTYVVRASLGTPPQQLLLAVDTSNDASWIP 139

Query: 89  CYPCRDCSPQLGTIFYPSRSSSYAYVPCDSEHCRYFPYARCSAYKHRCIYTQLYLIGPET 148
           C  C  C       F P+ S+SY  VPC S  C   P A C      C ++  Y    ++
Sbjct: 140 CAGCAGCPTSSAAPFDPASSASYRTVPCGSPLCAQAPNAACPPGGKACGFSLTYA---DS 196

Query: 149 SVFVSTEQLTFKNTDESTIHVQDVVFGC-GFSTNRNFKFSGIFGLGIGRSSLVSQLN--- 204
           S+  +  Q +      +   V+   FGC   +T       G+ GLG G  S +SQ     
Sbjct: 197 SLQAALSQDSLAVAGNA---VKAYTFGCLQRATGTAAPPQGLLGLGRGPLSFLSQTKDMY 253

Query: 205 -SSFSYCIGSLHDPDYLHNKLILGDGAIIDEGDATPLQFIDGH----YYITLEAISVDGR 259
            ++FSYC+ S    ++    L LG          TPL   + H    YY+ +  I V  +
Sbjct: 254 EATFSYCLPSFKSLNF-SGTLRLGRNGQPQRIKTTPL-LANPHRSSLYYVNMTGIRVGRK 311

Query: 260 MLDINPNIFKRDDSGGGVMIDSGTDVTWLVKEAYEALRDEVMIRLEGEQMRSYSWPDKLC 319
           ++ I    F    +G G ++DSGT  T LV  AY A+RDEV  R+ G  + S    D  C
Sbjct: 312 VVPI--PAFD-PATGAGTVLDSGTMFTRLVAPAYVAVRDEVRRRV-GAPVSSLGGFDT-C 366

Query: 320 YHGIMSSDLKGFPTVRFHFRGGAKLALEKDSMFYQPRPDAFC--MAVNPASINNRYVNFS 377
           ++    +    +P V   F G      E++ + +       C  MA  P  +N      +
Sbjct: 367 FN----TTAVAWPPVTLLFDGMQVTLPEENVVIHSTYGTISCLAMAAAPDGVNTV---LN 419

Query: 378 LIGMMAQQFYNVGYDIGRKQKTFQRMDCEV 407
           +I  M QQ + V +D+   +  F R  C  
Sbjct: 420 VIASMQQQNHRVLFDVPNGRVGFARERCTA 449


>gi|356528623|ref|XP_003532899.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
          Length = 507

 Score = 92.8 bits (229), Expect = 3e-16,   Method: Compositional matrix adjust.
 Identities = 98/419 (23%), Positives = 158/419 (37%), Gaps = 97/419 (23%)

Query: 61  FYVNISIGQPPVPQFTAMDTGSSLLWVHCY------------------------------ 90
           ++  + +G P    + A DTGS   W +C                               
Sbjct: 111 YFTEVKVGSPGQRFWLAADTGSEFTWFNCVMRNATTTATTKKTRKNKTKKKHHHHSKRNR 170

Query: 91  ---------------PCRDCSPQLGTIFYPSRSSSYAYVPCDSEHCR-----YFPYARCS 130
                          PC+        +F P RS S+  V C S+ C+      F  + C 
Sbjct: 171 TRTTRRTKKKKAKSNPCKG-------VFCPHRSKSFQAVTCASQKCKIDLSQLFSLSLCP 223

Query: 131 AYKHRCIYTQLYLIGPETSVFVSTEQLT--FKNTDESTIHVQDVVFGCGFSTNRNFKFS- 187
                C+Y   Y  G     F  T+ +T   KN  E  ++  ++  GC  S      F+ 
Sbjct: 224 KPSDPCLYDISYADGSSAKGFFGTDTITVDLKNGKEGKLN--NLTIGCTKSMENGVNFNE 281

Query: 188 ---GIFGLGIGRSSLVSQL----NSSFSYCIGSLHDPDYLHNKLILG---DGAIIDEGDA 237
              GI GLG  + S + +      + FSYC+        + + L +G   +  ++ E   
Sbjct: 282 DTGGILGLGFAKDSFIDKAAYEYGAKFSYCLVDHLSHRNVSSYLTIGGHHNAKLLGEIKR 341

Query: 238 TPLQFIDGHYYITLEAISVDGRMLDINPNIFKRDDSGGGVMIDSGTDVTWLVKEAYEALR 297
           T L      Y + +  IS+ G+ML I P ++  + S GG +IDSGT +T L+  AYE + 
Sbjct: 342 TELILFPPFYGVNVVGISIGGQMLKIPPQVWDFN-SQGGTLIDSGTTLTALLVPAYEPVF 400

Query: 298 DEVM------IRLEGEQMRSYSWPDKLCYHGIMSSDLKGF-----PTVRFHFRGGAKLAL 346
           + ++       R+ GE   +  +    C+      D +GF     P + FHF GGA+   
Sbjct: 401 EALIKSLTKVKRVTGEDFGALDF----CF------DAEGFDDSVVPRLVFHFAGGARFEP 450

Query: 347 EKDSMFYQPRPDAFCMAVNPASINNRYVNFSLIGMMAQQFYNVGYDIGRKQKTFQRMDC 405
              S      P   C+ + P    +     S+IG + QQ +   +D+      F    C
Sbjct: 451 PVKSYIIDVAPLVKCIGIVPI---DGIGGASVIGNIMQQNHLWEFDLSTNTIGFAPSIC 506


>gi|88174573|gb|ABD39361.1| chloroplast nucleoid DNA-binding protein [Oryza sativa Japonica
           Group]
          Length = 321

 Score = 92.4 bits (228), Expect = 3e-16,   Method: Compositional matrix adjust.
 Identities = 85/329 (25%), Positives = 139/329 (42%), Gaps = 37/329 (11%)

Query: 61  FYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLGTIFYPSRSSSYAYVPCDSEH 120
           +  ++ +G P   Q   +DTGSS  WV C  C  C     T F  SRS++ A V C +  
Sbjct: 1   YVTSVGLGTPSKTQIVEIDTGSSTSWVFC-ECDGCHTNPRT-FLQSRSTTCAKVSCGTSM 58

Query: 121 CRYF---PYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDVVFGC- 176
           C      P+ + S     C +   Y  G  +   +  + LTF +  +    +    FGC 
Sbjct: 59  CLLGGSDPHCQDSENYPDCPFRVSYQDGSASYGILYQDTLTFSDVQK----IPSFTFGCN 114

Query: 177 --GFSTNRNFKFSGIFGLGIGRSSLVSQLNSSF---SYCIGSLHDPDYLHNKLI--LGDG 229
              F  N      G+ G+G G  S++ Q + +F   SYC+          +K       G
Sbjct: 115 LDSFGANEFGNVDGLLGMGAGPMSVLKQSSPTFDGFSYCLPLQKSERGFFSKTTGYFSLG 174

Query: 230 AIIDEGDATPLQFIDGH-----YYITLEAISVDGRMLDINPNIFKRDDSGGGVMIDSGTD 284
            +    D    + +        +++ L AISVDG  L ++P+IF R     GV+ DSG++
Sbjct: 175 KVATRTDVRYTKMVARRKNTELFFVDLAAISVDGERLGLSPSIFSRK----GVVFDSGSE 230

Query: 285 VTWLVKEAYEALRD---EVMIRLEGEQMRSYSWPDKLCYHGIMSSDLKGFPTVRFHFRGG 341
           ++++   A   L     E+++R    +  S    ++ CY  + S D    P +  HF  G
Sbjct: 231 LSYIPDRALSVLSQRIRELLLRRGAAEEES----ERNCYD-MRSVDEGDMPAISLHFDDG 285

Query: 342 AKLALEKDSMFYQ---PRPDAFCMAVNPA 367
           A+  L    +F +      D +C+A  P 
Sbjct: 286 ARFDLGSRGVFVERSVQEQDVWCLAFAPT 314


>gi|115457772|ref|NP_001052486.1| Os04g0334700 [Oryza sativa Japonica Group]
 gi|113564057|dbj|BAF14400.1| Os04g0334700 [Oryza sativa Japonica Group]
          Length = 482

 Score = 92.4 bits (228), Expect = 3e-16,   Method: Compositional matrix adjust.
 Identities = 96/369 (26%), Positives = 154/369 (41%), Gaps = 37/369 (10%)

Query: 60  LFYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLGTI----FYPSRSS-SYAYV 114
           L+Y +I IG P V  +  +DTGS   WV+   C+ C  +   +    FY  RSS S   V
Sbjct: 82  LYYTDIGIGTPAVKYYVQLDTGSKAFWVNGISCKQCPHESDILRKLTFYDPRSSVSSKEV 141

Query: 115 PCDSEHCRYFPYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFK---NTDESTIHVQD 171
            CD   C   P    +    RC Y   Y  G  T   + T+ L +       ++      
Sbjct: 142 KCDDTICTSRPPCNMTL---RCPYITGYADGGLTMGILFTDLLHYHQLYGNGQTQPTSTS 198

Query: 172 VVFGCGFS-----TNRNFKFSGIFGLGIGRSSLVSQLNSS------FSYCIGSLHDPDYL 220
           V FGCG        N      GI G G    + +SQL ++      FS+C+      D  
Sbjct: 199 VTFGCGLQQSGSLNNSAVAIDGIIGFGNSNQTALSQLAAAGKTKKIFSHCL------DST 252

Query: 221 HNKLILGDGAIID-EGDATPLQFIDGHYY-ITLEAISVDGRMLDINPNIFKRDDSGGGVM 278
           +   I   G +++ +   TP+   +  Y+ + L++I+V G  L +  NIF    +  G  
Sbjct: 253 NGGGIFAIGEVVEPKVKTTPIVKNNEVYHLVNLKSINVAGTTLQLPANIFGTTKT-KGTF 311

Query: 279 IDSGTDVTWLVKEAYEALRDEVMIRLEGEQMRS-YSWPDKLCYHGIMSSDLKGFPTVRFH 337
           IDSG+ + +L +  Y  L   V  +     M + Y++    C+H + S D K FP + FH
Sbjct: 312 IDSGSTLVYLPEIIYSELILAVFAKHPDITMGAMYNFQ---CFHFLGSVDDK-FPKITFH 367

Query: 338 FRGGAKLALEKDSMFYQPRPDAFCMAVNPASINNRYVNFSLIGMMAQQFYNVGYDIGRKQ 397
           F     L +       +   + +C     A I+  Y +  ++G M      V YD+ ++ 
Sbjct: 368 FENDLTLDVYPYDYLLEYEGNQYCFGFQDAGIHG-YKDMIILGDMVISNKVVVYDMEKQA 426

Query: 398 KTFQRMDCE 406
             +   +C 
Sbjct: 427 IGWTEHNCS 435


>gi|125543639|gb|EAY89778.1| hypothetical protein OsI_11320 [Oryza sativa Indica Group]
          Length = 488

 Score = 92.4 bits (228), Expect = 4e-16,   Method: Compositional matrix adjust.
 Identities = 88/314 (28%), Positives = 138/314 (43%), Gaps = 36/314 (11%)

Query: 116 CDSEHCRYFPYARCSAYK----HRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQD 171
           CDS  C+    A C   K      C+YT  Y     T+  +  ++ TF     +   V  
Sbjct: 190 CDSTLCQGLLVASCGNTKFWPNQTCVYTYYYNDKSVTTGLLEVDKFTFG----AGASVPG 245

Query: 172 VVFGCGFSTNRNFKF--SGIFGLGIGRSSLVSQLN-SSFSYCIGSLHDP-------DYLH 221
           V FGCG   N  FK   +GI G G G  SL SQL   +FS+C  +++         D L 
Sbjct: 246 VAFGCGLFNNGVFKSNETGIAGFGRGPLSLPSQLKVGNFSHCFTAVNGLKQSTVLLDLLA 305

Query: 222 NKLILGDGAIIDEGDATPLQFIDGH---YYITLEAISVDGRMLDINPNIFKRDDSGGGVM 278
           +    G GA+     +TPL     +   YY++L+ I+V    L +  + F   +  GG +
Sbjct: 306 DLYKNGRGAV----QSTPLIQNSANPTLYYLSLKGITVGSTRLPVPESAFALTNGTGGTI 361

Query: 279 IDSGTDVTWLVKEAYEALRDEVMIRLEGEQMRSYSWPDKLCYHGIMSSDLKGFPTVRFHF 338
           IDSGT +T L  + Y+ +RDE   +++   +   +     C+    S      P +  HF
Sbjct: 362 IDSGTSITSLPPQVYQVVRDEFAAQIKLPVVPGNATGPYTCFSA-PSQAKPDVPKLVLHF 420

Query: 339 RGGAKLALEKDSMFYQPRPDA----FCMAVNPASINNRYVNFSLIGMMAQQFYNVGYDIG 394
             GA + L +++  ++   DA     C+A+N    + R    + IG   QQ  +V YD+ 
Sbjct: 421 E-GATMDLPRENYVFEVPDDAGNSMICLAINELG-DER----ATIGNFQQQNMHVLYDLQ 474

Query: 395 RKQKTFQRMDCEVL 408
               +F    C+ L
Sbjct: 475 NNMLSFVAAQCDKL 488



 Score = 47.4 bits (111), Expect = 0.014,   Method: Compositional matrix adjust.
 Identities = 34/140 (24%), Positives = 61/140 (43%), Gaps = 6/140 (4%)

Query: 253 AISVDGRMLDINPNIFKRDDSGGGVMIDSGTDVTWLVKEAYEALRDEVMIRLEGEQMRSY 312
            I+V    L +  + F   +  GG +IDSGT +T L  + Y+ +RDE   +++   +   
Sbjct: 41  GITVGSTRLPVPESAFALTNGTGGTIIDSGTSITSLPPQVYQVVRDEFAAQIKLPVVPGN 100

Query: 313 SWPDKLCYHGIMSSDLKGFPTVRFHFRGGAKLALEKDSMFYQPRPDA----FCMAVNPAS 368
           +     C+    S      P +  HF  GA + L +++  ++   DA     C+A+N   
Sbjct: 101 ATGPYTCFSA-PSQAKPDVPKLVLHFE-GATMDLPRENYVFEVPDDAGNSIICLAINKGD 158

Query: 369 INNRYVNFSLIGMMAQQFYN 388
                 NF    M A  +++
Sbjct: 159 ETTIIGNFQQQNMHALPYFD 178


>gi|79315693|ref|NP_001030891.1| aspartyl protease family protein [Arabidopsis thaliana]
 gi|332646353|gb|AEE79874.1| aspartyl protease family protein [Arabidopsis thaliana]
          Length = 499

 Score = 92.4 bits (228), Expect = 4e-16,   Method: Compositional matrix adjust.
 Identities = 98/371 (26%), Positives = 161/371 (43%), Gaps = 70/371 (18%)

Query: 61  FYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLGTIFYPSRSSSYAYVPCDSEH 120
           +++++ +G PP      +DTGS L W+ C PC DC  Q                  D++ 
Sbjct: 170 YFMDVLVGSPPKHFSLILDTGSDLNWIQCLPCYDCFQQ-----------------NDNQS 212

Query: 121 CRYFPYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDVVFGCGFST 180
           C Y+ +           Y        + +V   T  LT         +V++++FGCG   
Sbjct: 213 CPYYYW-----------YGDSSNTTGDFAVETFTVNLTTNGGSSELYNVENMMFGCG-HW 260

Query: 181 NRNFKFSGIF-------GLGIGRSSLVSQLNS----SFSYCIGSLHDPDYLHNKLILG-D 228
           NR     G+F       GLG G  S  SQL S    SFSYC+   +    + +KLI G D
Sbjct: 261 NR-----GLFHGAAGLLGLGRGPLSFSSQLQSLYGHSFSYCLVDRNSDTNVSSKLIFGED 315

Query: 229 GAIIDEGDATPLQFIDGH-------YYITLEAISVDGRMLDINPNIFK-RDDSGGGVMID 280
             ++   +     F+ G        YY+ +++I V G +L+I    +    D  GG +ID
Sbjct: 316 KDLLSHPNLNFTSFVAGKENLVDTFYYVQIKSILVAGEVLNIPEETWNISSDGAGGTIID 375

Query: 281 SGTDVTWLVKEAYEALRDEVMIRLEGEQ--MRSYSWPDKLCYH--GIMSSDLKGFPTVRF 336
           SGT +++  + AYE +++++  + +G+    R +   D  C++  GI +  L   P +  
Sbjct: 376 SGTTLSYFAEPAYEFIKNKIAEKAKGKYPVYRDFPILDP-CFNVSGIHNVQL---PELGI 431

Query: 337 HFRGGAKLALEKDSMFYQPRPDAFCMAV--NPASINNRYVNFSLIGMMAQQFYNVGYDIG 394
            F  GA      ++ F     D  C+A+   P S       FS+IG   QQ +++ YD  
Sbjct: 432 AFADGAVWNFPTENSFIWLNEDLVCLAMLGTPKSA------FSIIGNYQQQNFHILYDTK 485

Query: 395 RKQKTFQRMDC 405
           R +  +    C
Sbjct: 486 RSRLGYAPTKC 496


>gi|52076082|dbj|BAD46595.1| aspartic proteinase nepenthesin II -like [Oryza sativa Japonica
           Group]
          Length = 476

 Score = 92.0 bits (227), Expect = 4e-16,   Method: Compositional matrix adjust.
 Identities = 103/381 (27%), Positives = 160/381 (41%), Gaps = 61/381 (16%)

Query: 60  LFYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDC----SPQLGT----IFYPSRSSSY 111
           L Y  +++G P V    A+DTGS L WV C  C  C    SP  G+    ++ P++S++ 
Sbjct: 61  LHYAVVALGTPNVTFLVALDTGSDLFWVPC-DCLKCAPFQSPNYGSLKFDVYSPAQSTTS 119

Query: 112 AYVPCDSEHCRYFPYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTD--ESTIHV 169
             VPC S  C       C +  + C Y+  YL    +S  V  E + +  +D  +S I  
Sbjct: 120 RKVPCSSNLCDL--QNACRSKSNSCPYSIQYLSDNTSSSGVLVEDVLYLTSDSAQSKIVT 177

Query: 170 QDVVFGCGFSTNRNFKFS----GIFGLGIGRSSLVSQLNS------SFSYCIGSLHDPDY 219
             ++FGCG     +F  S    G+ GLG+   S+ S L S      SFS C G     D 
Sbjct: 178 APIMFGCGQVQTGSFLGSAAPNGLLGLGMDSKSVPSLLASKGLAANSFSMCFG-----DD 232

Query: 220 LHNKLILGDGAIIDEGDATPLQFI--DGHYYITLEAISVDGRMLDINPNIFKRDDSGGGV 277
            H ++  GD    D+ + TPL     + +Y IT+  I+V  + +                
Sbjct: 233 GHGRINFGDTGSSDQKE-TPLNVYKQNPYYNITITGITVGSKSISTE----------FSA 281

Query: 278 MIDSGTDVTWLVKEAYEALRD--EVMIRLEGEQMRSYSWPDKLCYHGIMSSDLKGFPTVR 335
           ++DSGT  T L    Y  +    +  IR     M   S P + CY   +S++    P V 
Sbjct: 282 IVDSGTSFTALSDPMYTQITSSFDAQIR-SSRNMLDSSMPFEFCYS--VSANGIVHPNVS 338

Query: 336 FHFRGGAKLALE------KDSMFYQPRPDAFCMAVNPASINNRYVNFSLIGMMAQQFYNV 389
              +GG+   +        D+ F    P  +C+A+    + +  VN  LIG        V
Sbjct: 339 LTAKGGSIFPVNDPIITITDNAF---NPVGYCLAI----MKSEGVN--LIGENFMSGLKV 389

Query: 390 GYDIGRKQKTFQRMDCEVLDD 410
            +D  R    ++  +C   D+
Sbjct: 390 VFDRERMVLGWKNFNCYNFDE 410


>gi|218202547|gb|EEC84974.1| hypothetical protein OsI_32231 [Oryza sativa Indica Group]
          Length = 513

 Score = 92.0 bits (227), Expect = 5e-16,   Method: Compositional matrix adjust.
 Identities = 103/381 (27%), Positives = 160/381 (41%), Gaps = 61/381 (16%)

Query: 60  LFYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDC----SPQLGT----IFYPSRSSSY 111
           L Y  +++G P V    A+DTGS L WV C  C  C    SP  G+    ++ P++S++ 
Sbjct: 98  LHYAVVALGTPNVTFLVALDTGSDLFWVPC-DCLKCAPLQSPNYGSLKFDVYSPAQSTTS 156

Query: 112 AYVPCDSEHCRYFPYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTD--ESTIHV 169
             VPC S  C       C +  + C Y+  YL    +S  V  E + +  +D  +S I  
Sbjct: 157 RKVPCSSNLCDL--QNACRSKSNSCPYSIQYLSDNTSSSGVLVEDVLYLTSDSAQSKIVT 214

Query: 170 QDVVFGCGFSTNRNFKFS----GIFGLGIGRSSLVSQLNS------SFSYCIGSLHDPDY 219
             ++FGCG     +F  S    G+ GLG+   S+ S L S      SFS C G     D 
Sbjct: 215 APIMFGCGQVQTGSFLGSAAPNGLLGLGMDSKSVPSLLASKGLAANSFSMCFG-----DD 269

Query: 220 LHNKLILGDGAIIDEGDATPLQFI--DGHYYITLEAISVDGRMLDINPNIFKRDDSGGGV 277
            H ++  GD    D+ + TPL     + +Y IT+  I+V  + +                
Sbjct: 270 GHGRINFGDTGSSDQKE-TPLNVYKQNPYYNITITGITVGSKSISTE----------FSA 318

Query: 278 MIDSGTDVTWLVKEAYEALRD--EVMIRLEGEQMRSYSWPDKLCYHGIMSSDLKGFPTVR 335
           ++DSGT  T L    Y  +    +  IR     M   S P + CY   +S++    P V 
Sbjct: 319 IVDSGTSFTALSDPMYTQITSSFDAQIR-SSRNMLDSSMPFEFCYS--VSANGIVHPNVS 375

Query: 336 FHFRGGAKLALE------KDSMFYQPRPDAFCMAVNPASINNRYVNFSLIGMMAQQFYNV 389
              +GG+   +        D+ F    P  +C+A+    + +  VN  LIG        V
Sbjct: 376 LTAKGGSIFPVNDPIITITDNAF---NPVGYCLAI----MKSEGVN--LIGENFMSGLKV 426

Query: 390 GYDIGRKQKTFQRMDCEVLDD 410
            +D  R    ++  +C   D+
Sbjct: 427 VFDRERMVLGWKNFNCYNFDE 447


>gi|449495082|ref|XP_004159729.1| PREDICTED: LOW QUALITY PROTEIN: aspartic proteinase-like protein
           1-like [Cucumis sativus]
          Length = 524

 Score = 91.7 bits (226), Expect = 6e-16,   Method: Compositional matrix adjust.
 Identities = 94/377 (24%), Positives = 154/377 (40%), Gaps = 51/377 (13%)

Query: 60  LFYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLGT--------IFYPSRSSSY 111
           L+Y  +++G P VP   A+DTGS L W+ C  C +C   L T        I+ P+ SS+ 
Sbjct: 106 LYYAEVTVGTPGVPYLVALDTGSDLFWLPC-DCVNCITGLNTTQGPVNFNIYSPNNSSTS 164

Query: 112 AYVPCDSEHCRYFPYARCSAYKHRCIYTQLYLIGPETSVFVSTE---QLTFKNTDESTIH 168
             V C S  C +    +CS+    C Y   YL    +S     E    LT  +     ++
Sbjct: 165 KEVQCSSSLCSHL--DQCSSPSDTCPYQVSYLSDNTSSTGYLVEDILHLTTNDVQSKPVN 222

Query: 169 VQDVVFGCGFSTNRNFKFS----GIFGLGIGRSSLVSQL------NSSFSYCIGSLHDPD 218
            + +  GCG   +  F  S    G+FGLGI   S+ S L      ++SFS C G      
Sbjct: 223 AR-ITLGCGKDQSGAFLSSAAPNGLFGLGIENVSVPSILANAGLISNSFSLCFGPAR--- 278

Query: 219 YLHNKLILGDGAIIDEGDATPLQFIDGH--YYITLEAISVDGRMLDINPNIFKRDDSGGG 276
               ++  GD     + + TP      H  Y +++  I V G + D++            
Sbjct: 279 --MGRIEFGDKGSPGQNE-TPFNLGRRHPTYNVSITQIGVGGHISDLD----------VA 325

Query: 277 VMIDSGTDVTWLVKEAYEALRDEVMIRLEGEQMRSYS-WPDKLCYHGIMSSDLKGFPTVR 335
           V+ DSGT  T+L   AY    D+    +E +Q    S  P + CY    +     +P + 
Sbjct: 326 VIFDSGTSFTYLNDPAYSLFADKFASMVEEKQFTMNSDIPFENCYELSPNQTTFTYPLMN 385

Query: 336 FHFRGGAKLALEKDSMFYQPRPD-AFCMAVNPASINNRYVNFSLIGMMAQQFYNVGYDIG 394
              +GG    +    +         FC+A+       R  + ++IG      Y++ +D  
Sbjct: 386 LTMKGGGHFVINHPIVLISTESKRLFCLAI------ARSDSINIIGQNFMTGYHIVFDRE 439

Query: 395 RKQKTFQRMDCEVLDDD 411
           +    ++  +C   +D+
Sbjct: 440 KMVLGWKESNCTGYEDE 456


>gi|449456843|ref|XP_004146158.1| PREDICTED: aspartic proteinase-like protein 1-like [Cucumis
           sativus]
          Length = 547

 Score = 91.7 bits (226), Expect = 6e-16,   Method: Compositional matrix adjust.
 Identities = 94/377 (24%), Positives = 154/377 (40%), Gaps = 51/377 (13%)

Query: 60  LFYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLGT--------IFYPSRSSSY 111
           L+Y  +++G P VP   A+DTGS L W+ C  C +C   L T        I+ P+ SS+ 
Sbjct: 129 LYYAEVTVGTPGVPYLVALDTGSDLFWLPC-DCVNCITGLNTTQGPVNFNIYSPNNSSTS 187

Query: 112 AYVPCDSEHCRYFPYARCSAYKHRCIYTQLYLIGPETSVFVSTE---QLTFKNTDESTIH 168
             V C S  C +    +CS+    C Y   YL    +S     E    LT  +     ++
Sbjct: 188 KEVQCSSSLCSHL--DQCSSPSDTCPYQVSYLSDNTSSTGYLVEDILHLTTNDVQSKPVN 245

Query: 169 VQDVVFGCGFSTNRNFKFS----GIFGLGIGRSSLVSQL------NSSFSYCIGSLHDPD 218
            + +  GCG   +  F  S    G+FGLGI   S+ S L      ++SFS C G      
Sbjct: 246 AR-ITLGCGKDQSGAFLSSAAPNGLFGLGIENVSVPSILANAGLISNSFSLCFGPAR--- 301

Query: 219 YLHNKLILGDGAIIDEGDATPLQFIDGH--YYITLEAISVDGRMLDINPNIFKRDDSGGG 276
               ++  GD     + + TP      H  Y +++  I V G + D++            
Sbjct: 302 --MGRIEFGDKGSPGQNE-TPFNLGRRHPTYNVSITQIGVGGHISDLD----------VA 348

Query: 277 VMIDSGTDVTWLVKEAYEALRDEVMIRLEGEQMRSYS-WPDKLCYHGIMSSDLKGFPTVR 335
           V+ DSGT  T+L   AY    D+    +E +Q    S  P + CY    +     +P + 
Sbjct: 349 VIFDSGTSFTYLNDPAYSLFADKFASMVEEKQFTMNSDIPFENCYELSPNQTTFTYPLMN 408

Query: 336 FHFRGGAKLALEKDSMFYQPRPD-AFCMAVNPASINNRYVNFSLIGMMAQQFYNVGYDIG 394
              +GG    +    +         FC+A+       R  + ++IG      Y++ +D  
Sbjct: 409 LTMKGGGHFVINHPIVLISTESKRLFCLAI------ARSDSINIIGQNFMTGYHIVFDRE 462

Query: 395 RKQKTFQRMDCEVLDDD 411
           +    ++  +C   +D+
Sbjct: 463 KMVLGWKESNCTGYEDE 479


>gi|238010910|gb|ACR36490.1| unknown [Zea mays]
 gi|413942664|gb|AFW75313.1| hypothetical protein ZEAMMB73_520329 [Zea mays]
          Length = 481

 Score = 91.7 bits (226), Expect = 6e-16,   Method: Compositional matrix adjust.
 Identities = 101/362 (27%), Positives = 151/362 (41%), Gaps = 61/362 (16%)

Query: 70  PPVPQFTAMDTGSSLLWVHCYPCR--DCSPQLGTIFYPSRSSSYAYVPCDSEHCRYF-PY 126
           P V Q   +D+ S + WV C PC    C PQ+ + + PSRS S A   C S  C    PY
Sbjct: 155 PGVIQTVVLDSASDVPWVQCVPCPIPPCHPQVDSFYDPSRSPSSAPFSCSSPTCTALGPY 214

Query: 127 ARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDVVFGCGFSTNRNF-- 184
           A   A  ++C Y   Y  G  TS     + LT    +     V    FGC  +   +F  
Sbjct: 215 ANGCA-NNQCQYLVRYPDGSSTSGAYIADLLTLDAGNA----VSGFKFGCSHAEQGSFDA 269

Query: 185 KFSGIFGLGIGRSSLVSQLNS----SFSYCIG---------SLHDPDYLHNKLILGDGAI 231
           + +GI  LG G  SL+SQ  S    +FSYCI          +L  P    ++ ++     
Sbjct: 270 RAAGIMALGGGPESLLSQTASRYGNAFSYCIPATASDSGFFTLGVPRRASSRYVV----- 324

Query: 232 IDEGDATPL---QFIDGHYYITLEAISVDGRMLDINPNIFKRDDSGGGVMIDSGTDVTWL 288
                 TP+   +     Y + L  I+V G+ L + P +F       G ++DS T +T L
Sbjct: 325 ------TPMVRFRQAATFYGVLLRTITVGGQRLGVAPAVFA-----AGSVLDSRTAITRL 373

Query: 289 VKEAYEALRDEVMIRLEGEQMRSY-SWPDK----LCYHGIMSSDLKGFPTVRFHFRGGAK 343
              AY+ALR           M  Y S P K     CY      +++  P +   F   A 
Sbjct: 374 PPTAYQALRSAFR-----SSMTMYRSAPPKGYLDTCYDFTGVVNIR-LPKISLVFDRNAV 427

Query: 344 LALEKDSMFYQPRPDAFCMAVNPASINNRYVNFSLIGMMAQQFYNVGYDIGRKQKTFQRM 403
           L L+   + +       C+A   ++ ++R     ++G + QQ   V YD+G     F++ 
Sbjct: 428 LPLDPSGILFN-----DCLAFT-SNADDRMPG--VLGSVQQQTIEVLYDVGGGAVGFRQG 479

Query: 404 DC 405
            C
Sbjct: 480 AC 481


>gi|118486912|gb|ABK95290.1| unknown [Populus trichocarpa]
          Length = 438

 Score = 91.7 bits (226), Expect = 6e-16,   Method: Compositional matrix adjust.
 Identities = 102/392 (26%), Positives = 160/392 (40%), Gaps = 39/392 (9%)

Query: 32  RLAYLQEKIRSHNTYQAQILPSNDISNSLFYV-NISIGQPPVPQFTAMDTGSSLLWVHCY 90
           RL YL   +    T    I P   +     YV  + +G P    F  +DT +   WV   
Sbjct: 69  RLKYL-STLADQKTTAVPIAPGQQVLKIANYVVRVKLGTPGQQMFMVLDTSNDAAWV--- 124

Query: 91  PCRDCSPQLGTIFYPSRSSSYAYVPCDSEHCRYFPYARCSAY-KHRCIYTQLYLIGPETS 149
           PC  C+    T F P+ S++   + C    C       C A     C++ Q Y  G ++S
Sbjct: 125 PCSGCTGFSSTTFLPNASTTLGSLDCSGAQCSQVRGFSCPATGSSACLFNQSY--GGDSS 182

Query: 150 VFVSTEQLTFKNTDESTIHVQDVV----FGC-GFSTNRNFKFSGIFGLGIGRSSLVSQLN 204
                  LT     ++     DV+    FGC    +  +    G+ GLG G  SL+SQ  
Sbjct: 183 -------LTATLVQDAITLANDVIPGFTFGCINAVSGGSIPPQGLLGLGRGPISLISQAG 235

Query: 205 S----SFSYCIGSLHDPDYLHNKLILGDGAIIDEGDATPLQFIDGH----YYITLEAISV 256
           +     FSYC+ S     Y    L LG          TPL   + H    YY+ L  +SV
Sbjct: 236 AMYSGVFSYCLPSFKS-YYFSGSLKLGPVGQPKSIRTTPL-LRNPHRPSLYYVNLTGVSV 293

Query: 257 DGRMLDINPN--IFKRDDSGGGVMIDSGTDVTWLVKEAYEALRDEVMIRLEGEQMRSYSW 314
            GR+    P+  +    ++G G +IDSGT +T  V+  Y A+RDE   ++ G  + S   
Sbjct: 294 -GRIKVPIPSEQLVFDPNTGAGTIIDSGTVITRFVQPVYFAIRDEFRKQVNGP-ISSLGA 351

Query: 315 PDKLCYHGIMSSDLKGFPTVRFHFRGGAKLALEKDSMFYQPRPDAFCMAVNPASINNRYV 374
            D  C+     ++    P +  HF G   +   ++S+ +       C+++  A+ NN   
Sbjct: 352 FDT-CFAATNEAEA---PAITLHFEGLNLVLPMENSLIHSSSGSLACLSMA-AAPNNVNS 406

Query: 375 NFSLIGMMAQQFYNVGYDIGRKQKTFQRMDCE 406
             ++I  + QQ   + +D    +    R  C 
Sbjct: 407 VLNVIANLQQQNLRIMFDTTNSRLGIARELCN 438


>gi|125571060|gb|EAZ12575.1| hypothetical protein OsJ_02481 [Oryza sativa Japonica Group]
          Length = 501

 Score = 91.7 bits (226), Expect = 6e-16,   Method: Compositional matrix adjust.
 Identities = 101/378 (26%), Positives = 142/378 (37%), Gaps = 56/378 (14%)

Query: 61  FYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLGTIFYPSRSSSYAYVPCDSEH 120
           ++  I +G P  P    +DTGS ++W+ C PCR C  Q G +F P  S SY  V C +  
Sbjct: 147 YFTKIGVGTPVTPALMVLDTGSDVVWLQCAPCRRCYDQSGQMFDPRASHSYGAVDCAAPL 206

Query: 121 CRYFPYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDVVFGCGFST 180
           CR      C   +  C+Y   Y  G  T+   +TE LTF     S   V  V  GCG   
Sbjct: 207 CRRLDSGGCDLRRKACLYQVAYGDGSVTAGDFATETLTFA----SGARVPRVALGCGHDN 262

Query: 181 NRNFKFSGIFGLGI-GRSSLVSQLN----SSFSYCI-------------------GSLHD 216
              F  +        G  S  SQ++     SFSYC+                   GS   
Sbjct: 263 EGLFVAAAGLLGLGRGSLSFPSQISRRFGRSFSYCLVDRTSSSASATSRSSTVTFGSGAR 322

Query: 217 PDYLHNKLILGDGAIIDEGDATPLQFIDGHYYITLEAISVDGRMLDINPNIFKRDDSGGG 276
              L  +++  DG    +GD   L+   GH                 +P+  +     GG
Sbjct: 323 -GALGRRVLHPDGEEPQDGDVL-LRAAHGHQRRRRARPGRGRVRPPPDPSTGR-----GG 375

Query: 277 VMIDSG-TDVTWLVKEAYEALRDEVMIRLEGEQMR--SYSWPDKLCYHGIMSSDLKGF-- 331
           V++DSG     W                  G ++    +S  D  CY      DL G   
Sbjct: 376 VIVDSGRPSPAWARAGRTPPCATRSRAAAAGLRLSPGGFSLFDT-CY------DLSGLKV 428

Query: 332 ---PTVRFHFRGGAKLALEKDS-MFYQPRPDAFCMAVNPASINNRYVNFSLIGMMAQQFY 387
              PTV  HF GGA+ AL  ++ +        FC A   A  +      S+IG + QQ +
Sbjct: 429 VKVPTVSMHFAGGAEAALPPENYLIPVDSRGTFCFAF--AGTDG---GVSIIGNIQQQGF 483

Query: 388 NVGYDIGRKQKTFQRMDC 405
            V +D   ++  F    C
Sbjct: 484 RVVFDGDGQRLGFVPKGC 501


>gi|224072901|ref|XP_002303933.1| predicted protein [Populus trichocarpa]
 gi|222841365|gb|EEE78912.1| predicted protein [Populus trichocarpa]
          Length = 370

 Score = 91.7 bits (226), Expect = 6e-16,   Method: Compositional matrix adjust.
 Identities = 93/360 (25%), Positives = 143/360 (39%), Gaps = 28/360 (7%)

Query: 56  ISNSLFYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLGTIFYPSRSSSYAYVP 115
           I +  + V   +G PP     A+D      W+ C  C  CS    T+F   +S+++  + 
Sbjct: 30  IQSPSYIVKAKVGTPPQTLLMALDNSYDAAWIPCKGCVGCS---STVFNTVKSTTFKTLG 86

Query: 116 CDSEHCRYFPYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDVVFG 175
           C +  C+  P   C      C +   Y     +S  +S   LT      S   V    FG
Sbjct: 87  CGAPQCKQVPNPICGG--STCTWNTTY----GSSTILS--NLTRDTIALSMDPVPYYAFG 138

Query: 176 C-GFSTNRNFKFSGIFGLGIGRSSLVSQL----NSSFSYCIGSLHDPDYLHNKLILGDGA 230
           C   +T  +    G+ G G G  S +SQ      S+FSYC+ S    ++    L LG   
Sbjct: 139 CIQKATGSSVPPQGLLGFGRGPLSFLSQTQNLYKSTFSYCLPSFRTLNF-SGSLRLGPVG 197

Query: 231 IIDEGDATPL---QFIDGHYYITLEAISVDGRMLDINPNIFK-RDDSGGGVMIDSGTDVT 286
                  TPL         YY+ L  I V  +++DI  +       +G G + DSGT  T
Sbjct: 198 QPPRIKTTPLLKNPRRSSLYYVKLNGIRVGRKIVDIPRSALAFNPTTGAGTIFDSGTVFT 257

Query: 287 WLVKEAYEALRDEVMIRLEGEQMRSYSWPDKLCYHGIMSSDLKGFPTVRFHFRGGAKLAL 346
            LV  AY A+R+E   R+    + S    D  CY   +       PT+ F F G      
Sbjct: 258 RLVAPAYIAVRNEFRKRVGNATVSSLGGFDT-CYSVPIVP-----PTITFMFSGMNVTMP 311

Query: 347 EKDSMFYQPRPDAFCMAVNPASINNRYVNFSLIGMMAQQFYNVGYDIGRKQKTFQRMDCE 406
            ++ + +       C+A+  A  N   V  ++I  M QQ + + +D+   +    R  C 
Sbjct: 312 PENLLIHSTAGVTSCLAMAAAPDNVNSV-LNVIASMQQQNHRILFDVPNSRLGVAREQCS 370


>gi|357140068|ref|XP_003571594.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Brachypodium
           distachyon]
          Length = 533

 Score = 91.7 bits (226), Expect = 7e-16,   Method: Compositional matrix adjust.
 Identities = 107/382 (28%), Positives = 154/382 (40%), Gaps = 66/382 (17%)

Query: 61  FYVNISIGQPPVPQFTAM-DTGSSLLWVHCYPC--RDCSPQLGTIFYPSRSSSYAYVPCD 117
           +   I++G       T + DTGS L WV C PC    C  Q   +F P+ S ++A VPC 
Sbjct: 180 YVTTIALGGGGAKNLTVIVDTGSDLTWVQCEPCPGSSCYAQRDPLFDPAASPTFAAVPCG 239

Query: 118 SEHCRYF---------PYARCSA-YKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTI 167
           S  C              AR +   + RC Y   Y  G  +   ++ + L       +T 
Sbjct: 240 SPACAASLKDATGAPGSCARSAGNSEQRCYYALSYGDGSFSRGVLAQDTLGLG----TTT 295

Query: 168 HVQDVVFGCGFSTNRNFKFSGIFGL-GIGRS--SLVSQLNSSF----SYCI-------GS 213
            +   VFGCG S NR   F G  GL G+GR+  SLVSQ  + F    SYC+       GS
Sbjct: 296 KLDGFVFGCGLS-NRGL-FGGTAGLMGLGRTDLSLVSQTAARFGGVFSYCLPATTTSTGS 353

Query: 214 LHD--------PDYLHNKLILGDGAIIDEGDATPLQFIDGHYYITLEAISVDGRMLDINP 265
           L          P+  + ++I          D T   F    Y+I +   +V G      P
Sbjct: 354 LSLGPGPSSSFPNMAYTRMI---------ADPTQPPF----YFINITGAAVGGGAALTAP 400

Query: 266 NIFKRDDSGGGVMIDSGTDVTWLVKEAYEALRDEVMIRLEGEQMRSYSWPDKLCYHGIMS 325
                    G V++DSGT +T L    Y+A+R E   R E      +S  D  CY  +  
Sbjct: 401 GF-----GAGNVLVDSGTVITRLAPSVYKAVRAEFARRFEYPAAPGFSILDA-CYD-LTG 453

Query: 326 SDLKGFPTVRFHFRGGAKLALEKDSMFYQPRPDA--FCMAVNPASINNRYVNFSLIGMMA 383
            D    P +     GGA++ ++   M +  R D    C+A+      ++     +IG   
Sbjct: 454 RDEVNVPLLTLTLEGGAQVTVDAAGMLFVVRKDGSQVCLAMASLPYEDQT---PIIGNYQ 510

Query: 384 QQFYNVGYDIGRKQKTFQRMDC 405
           Q+   V YD    +  F   DC
Sbjct: 511 QRNKRVVYDTVGSRLGFADEDC 532


>gi|88174561|gb|ABD39355.1| chloroplast nucleoid DNA-binding protein [Oryza longistaminata]
          Length = 321

 Score = 91.7 bits (226), Expect = 7e-16,   Method: Compositional matrix adjust.
 Identities = 85/329 (25%), Positives = 140/329 (42%), Gaps = 37/329 (11%)

Query: 61  FYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLGTIFYPSRSSSYAYVPCDSEH 120
           + +++ +G P   Q   +DTGSS  WV C  C  C     T F  SRS++ A V C +  
Sbjct: 1   YVISVGLGTPSKTQILEIDTGSSTSWVFC-ECDGCHTNPRT-FLQSRSTTCAKVSCGTSM 58

Query: 121 CRYF---PYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDVVFGC- 176
           C      P+ + S     C +   Y  G  +   +  + LTF +  +    +    FGC 
Sbjct: 59  CLLGGSDPHCQDSENYPDCPFRVSYQDGSASYGILYQDTLTFSDVQK----IPSFSFGCN 114

Query: 177 --GFSTNRNFKFSGIFGLGIGRSSLVSQLNSS---FSYCIGSLHDPDYLHNKLI--LGDG 229
              F  N      G+ G+G G  S++ Q + +   FSYC+          +K       G
Sbjct: 115 MDSFGANEFGNVDGLLGMGAGPMSVLKQSSPTFDGFSYCLPLQMSERGFFSKTTGYFSLG 174

Query: 230 AIIDEGDATPLQFIDGH-----YYITLEAISVDGRMLDINPNIFKRDDSGGGVMIDSGTD 284
            +    D    + +        +++ L AISVDG  L ++P+IF R     GV+ DSG++
Sbjct: 175 KVATRTDVRYTKMVARKKNTELFFVDLTAISVDGERLGLSPSIFSRK----GVVFDSGSE 230

Query: 285 VTWLVKEAYEALRD---EVMIRLEGEQMRSYSWPDKLCYHGIMSSDLKGFPTVRFHFRGG 341
           ++++   A   L     E+++R    +  S    ++ CY  + S D    P +  HF  G
Sbjct: 231 LSYIPDRALSVLSQRIRELLLRRGAAEEES----ERNCYD-MRSVDEGDMPAISLHFDDG 285

Query: 342 AKLALEKDSMFYQ---PRPDAFCMAVNPA 367
           A+  L    +F +      D +C+A  P 
Sbjct: 286 ARFDLGSHGVFVERSVQEQDVWCLAFAPT 314


>gi|115480451|ref|NP_001063819.1| Os09g0542100 [Oryza sativa Japonica Group]
 gi|113632052|dbj|BAF25733.1| Os09g0542100, partial [Oryza sativa Japonica Group]
          Length = 490

 Score = 91.7 bits (226), Expect = 7e-16,   Method: Compositional matrix adjust.
 Identities = 104/381 (27%), Positives = 161/381 (42%), Gaps = 61/381 (16%)

Query: 60  LFYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDC----SPQLGT----IFYPSRSSSY 111
           L Y  +++G P V    A+DTGS L WV C  C  C    SP  G+    ++ P++S++ 
Sbjct: 75  LHYAVVALGTPNVTFLVALDTGSDLFWVPC-DCLKCAPFQSPNYGSLKFDVYSPAQSTTS 133

Query: 112 AYVPCDSEHCRYFPYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTD--ESTIHV 169
             VPC S  C       C +  + C Y+  YL    +S  V  E + +  +D  +S I  
Sbjct: 134 RKVPCSSNLCDL--QNACRSKSNSCPYSIQYLSDNTSSSGVLVEDVLYLTSDSAQSKIVT 191

Query: 170 QDVVFGCGFSTNRNFKFS----GIFGLGIGRSSLVSQLNS------SFSYCIGSLHDPDY 219
             ++FGCG     +F  S    G+ GLG+   S+ S L S      SFS C G     D 
Sbjct: 192 APIMFGCGQVQTGSFLGSAAPNGLLGLGMDSKSVPSLLASKGLAANSFSMCFG-----DD 246

Query: 220 LHNKLILGDGAIIDEGDATPLQFI--DGHYYITLEAISVDGRMLDINPNIFKRDDSGGGV 277
            H ++  GD    D+ + TPL     + +Y IT+  I+V  + +      F         
Sbjct: 247 GHGRINFGDTGSSDQKE-TPLNVYKQNPYYNITITGITVGSKSISTE---FS-------A 295

Query: 278 MIDSGTDVTWLVKEAYEALRD--EVMIRLEGEQMRSYSWPDKLCYHGIMSSDLKGFPTVR 335
           ++DSGT  T L    Y  +    +  IR     M   S P + CY   +S++    P V 
Sbjct: 296 IVDSGTSFTALSDPMYTQITSSFDAQIR-SSRNMLDSSMPFEFCYS--VSANGIVHPNVS 352

Query: 336 FHFRGGAKLALE------KDSMFYQPRPDAFCMAVNPASINNRYVNFSLIGMMAQQFYNV 389
              +GG+   +        D+ F    P  +C+A+    + +  VN  LIG        V
Sbjct: 353 LTAKGGSIFPVNDPIITITDNAF---NPVGYCLAI----MKSEGVN--LIGENFMSGLKV 403

Query: 390 GYDIGRKQKTFQRMDCEVLDD 410
            +D  R    ++  +C   D+
Sbjct: 404 VFDRERMVLGWKNFNCYNFDE 424


>gi|359806832|ref|NP_001241567.1| uncharacterized protein LOC100819698 precursor [Glycine max]
 gi|255638149|gb|ACU19388.1| unknown [Glycine max]
          Length = 437

 Score = 91.3 bits (225), Expect = 7e-16,   Method: Compositional matrix adjust.
 Identities = 106/388 (27%), Positives = 155/388 (39%), Gaps = 33/388 (8%)

Query: 31  ARLAYLQEKIRSHNTYQAQILPSNDISNSLFY-VNISIGQPPVPQFTAMDTGSSLLWVHC 89
           AR+ YL   +   +     I     I+ S  Y V   IG P      AMDT +   WV C
Sbjct: 69  ARMQYLSSLVARRSI--VPIASGRQITQSPTYIVKAKIGTPAQTLLLAMDTSNDASWVPC 126

Query: 90  YPCRDCSPQLGTIFYPSRSSSYAYVPCDSEHCRYFPYARCSAYKHRCIYTQLYLIGPETS 149
             C  CS    T F P++S+++  V C +  C+      C      C +   Y     +S
Sbjct: 127 TACVGCS--TTTPFAPAKSTTFKKVGCGASQCKQVRNPTCDG--SACAFNFTY---GTSS 179

Query: 150 VFVSTEQLTFKNTDESTIHVQDVVFGC-----GFSTNRNFKFSGIFGLGIGRSSLVSQLN 204
           V  S  Q T      +T  V    FGC     G S           G     +       
Sbjct: 180 VAASLVQDTV---TLATDPVPAYAFGCIQKVTGSSVPPQGLLGLGRGPLSLLAQTQKLYQ 236

Query: 205 SSFSYCIGSLHDPDYLHNKLILGDGAIIDEGDATPL---QFIDGHYYITLEAISVDGRML 261
           S+FSYC+ S    ++    L LG  A       TPL         YY+ L AI V  R++
Sbjct: 237 STFSYCLPSFKTLNF-SGSLRLGPVAQPKRIKFTPLLKNPRRSSLYYVNLVAIRVGRRIV 295

Query: 262 DINPNIFKRD-DSGGGVMIDSGTDVTWLVKEAYEALRDEVMIRLEGEQMRSYSWPDKL-- 318
           DI P     + ++G G + DSGT  T LV+ AY A+R+E   R+   +  + +       
Sbjct: 296 DIPPEALAFNANTGAGTVFDSGTVFTRLVEPAYNAVRNEFRRRIAVHKKLTVTSLGGFDT 355

Query: 319 CYHGIMSSDLKGFPTVRFHFRGGAKLALEKDSMFYQPRPDAF-CMAVNPASINNRYVNFS 377
           CY   + +     PT+ F F  G  + L  D++       +  C+A+ PA  N   V  +
Sbjct: 356 CYTAPIVA-----PTITFMF-SGMNVTLPPDNILIHSTAGSVTCLAMAPAPDNVNSV-LN 408

Query: 378 LIGMMAQQFYNVGYDIGRKQKTFQRMDC 405
           +I  M QQ + V +D+   +    R  C
Sbjct: 409 VIANMQQQNHRVLFDVPNSRLGVARELC 436


>gi|88174591|gb|ABD39370.1| chloroplast nucleoid DNA-binding protein [Oryza rufipogon]
          Length = 321

 Score = 91.3 bits (225), Expect = 8e-16,   Method: Compositional matrix adjust.
 Identities = 85/329 (25%), Positives = 139/329 (42%), Gaps = 37/329 (11%)

Query: 61  FYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLGTIFYPSRSSSYAYVPCDSEH 120
           +  ++ +G P   Q   +DTGSS  WV C  C  C     T F  SRS++ A V C +  
Sbjct: 1   YVTSVGLGTPAKTQIVEIDTGSSTSWVFC-ECDGCHTNPRT-FLQSRSTTCAKVSCGTSM 58

Query: 121 CRYF---PYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDVVFGC- 176
           C      P+ + S     C +   Y  G  +   +  + LTF +  +    +    FGC 
Sbjct: 59  CLLGGSDPHCQDSENYPDCPFRVSYQDGSASYGILYQDTLTFSDVQK----IPSFTFGCN 114

Query: 177 --GFSTNRNFKFSGIFGLGIGRSSLVSQLNSSF---SYCIGSLHDPDYLHNKLI--LGDG 229
              F  N      G+ G+G G  S++ Q + +F   SYC+          +K       G
Sbjct: 115 LDSFGANEFGNVDGLLGMGAGPMSVLKQSSPTFDGFSYCLPLQKSERGFFSKTTGYFSLG 174

Query: 230 AIIDEGDATPLQFIDGH-----YYITLEAISVDGRMLDINPNIFKRDDSGGGVMIDSGTD 284
            +    D    + +        +++ L AISVDG  L ++P+IF R     GV+ DSG++
Sbjct: 175 KVATRTDVRYTKMVARRKNTELFFVDLAAISVDGERLGLSPSIFSRK----GVVFDSGSE 230

Query: 285 VTWLVKEAYEALRD---EVMIRLEGEQMRSYSWPDKLCYHGIMSSDLKGFPTVRFHFRGG 341
           ++++   A   L     E+++R    +  S    ++ CY  + S D    P +  HF  G
Sbjct: 231 LSYIPDRALSVLSQRIRELLLRRGAAEEES----ERNCYD-MRSVDEGDMPAISLHFDDG 285

Query: 342 AKLALEKDSMFYQ---PRPDAFCMAVNPA 367
           A+  L    +F +      D +C+A  P 
Sbjct: 286 ARFDLGIHGVFVERSVQEQDVWCLAFAPT 314


>gi|42565828|ref|NP_190704.2| aspartyl protease family protein [Arabidopsis thaliana]
 gi|332645262|gb|AEE78783.1| aspartyl protease family protein [Arabidopsis thaliana]
          Length = 488

 Score = 91.3 bits (225), Expect = 8e-16,   Method: Compositional matrix adjust.
 Identities = 100/374 (26%), Positives = 155/374 (41%), Gaps = 54/374 (14%)

Query: 60  LFYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLGT---------IFYPSRSSS 110
           L Y N++IG P      A+DTGS L W+ C     C   + T         I+ PS+S S
Sbjct: 88  LHYANVTIGTPAQWFLVALDTGSDLFWLPCNCNSTCVRSMETDQGERIKLNIYNPSKSKS 147

Query: 111 YAYVPCDSEHCRYFPYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQ 170
            + V C+S  C      RC +    C Y   YL     S  V  E +   +T+E      
Sbjct: 148 SSKVTCNSTLCAL--RNRCISPVSDCPYRIRYLSPGSKSTGVLVEDVIHMSTEEGEARDA 205

Query: 171 DVVFGCGFSTNRNFK---FSGIFGLGIGRSSLVSQL------NSSFSYCIGSLHDPDYLH 221
            + FGC  S    FK    +GI GL I   ++ + L      + SFS C G    P+   
Sbjct: 206 RITFGCSESQLGLFKEVAVNGIMGLAIADIAVPNMLVKAGVASDSFSMCFG----PN--- 258

Query: 222 NKLILGDGAII--DEGDATPLQFIDGHYYITLEAISVDGRMLDINPNIFKRD----DSGG 275
                G G I   D+G +  L+        T  + ++     D++   FK      D+  
Sbjct: 259 -----GKGTISFGDKGSSDQLE--------TPLSGTISPMFYDVSITKFKVGKVTVDTEF 305

Query: 276 GVMIDSGTDVTWLVKEAYEALRDEVMIRLEGEQM-RSYSWPDKLCYHGIMSSDLKGFPTV 334
               DSGT VTWL++  Y AL     + +   ++ +S   P + CY    +SD    P+V
Sbjct: 306 TATFDSGTAVTWLIEPYYTALTTNFHLSVPDRRLSKSVDSPFEFCYIITSTSDEDKLPSV 365

Query: 335 RFHFRGGAKLALEKDSMFYQPRPDAF---CMAVNPASINNRYVNFSLIGMMAQQFYNVGY 391
            F  +GGA   +    + +     +F   C+AV    +     +FS+IG      Y + +
Sbjct: 366 SFEMKGGAAYDVFSPILVFDTSDGSFQVYCLAV----LKQVNADFSIIGQNFMTNYRIVH 421

Query: 392 DIGRKQKTFQRMDC 405
           D  R+   +++ +C
Sbjct: 422 DRERRILGWKKSNC 435


>gi|30680102|ref|NP_849967.1| aspartyl protease-like protein [Arabidopsis thaliana]
 gi|17978947|gb|AAL47439.1| putative chloroplast nucleoid DNA-binding protein [Arabidopsis
           thaliana]
 gi|22655368|gb|AAM98276.1| At2g17760/At2g17760 [Arabidopsis thaliana]
 gi|330251585|gb|AEC06679.1| aspartyl protease-like protein [Arabidopsis thaliana]
          Length = 513

 Score = 91.3 bits (225), Expect = 8e-16,   Method: Compositional matrix adjust.
 Identities = 100/372 (26%), Positives = 157/372 (42%), Gaps = 51/372 (13%)

Query: 60  LFYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLGT---------IFYPSRSSS 110
           L Y N+++G P      A+DTGS L W+ C  C +C  +L           I+ P+ SS+
Sbjct: 103 LHYANVTVGTPSDWFMVALDTGSDLFWLPC-DCTNCVRELKAPGGSSLDLNIYSPNASST 161

Query: 111 YAYVPCDSEHCRYFPYARCSAYKHRCIYTQLYLIGPETSVFVSTEQ-LTFKNTDESTIHV 169
              VPC+S  C      RC++ +  C Y   YL    +S  V  E  L   + D+S+  +
Sbjct: 162 STKVPCNSTLCTRGD--RCASPESDCPYQIRYLSNGTSSTGVLVEDVLHLVSNDKSSKAI 219

Query: 170 -QDVVFGCGFSTNRNF----KFSGIFGLGIGRSSLVSQL------NSSFSYCIGSLHDPD 218
              V FGCG      F      +G+FGLG+   S+ S L       +SFS C G+     
Sbjct: 220 PARVTFGCGQVQTGVFHDGAAPNGLFGLGLEDISVPSVLAKEGIAANSFSMCFGNDG--- 276

Query: 219 YLHNKLILGDGAIIDEGDATPLQFIDGH--YYITLEAISVDGRMLDINPNIFKRDDSGGG 276
               ++  GD   +D+ + TPL     H  Y IT+  ISV G   D+  +          
Sbjct: 277 --AGRISFGDKGSVDQRE-TPLNIRQPHPTYNITVTKISVGGNTGDLEFD---------- 323

Query: 277 VMIDSGTDVTWLVKEAYEALRDEV-MIRLEGE-QMRSYSWPDKLCYHGIMSSDLKGFPTV 334
            + DSGT  T+L   AY  + +    + L+   Q      P + CY    + D   +P V
Sbjct: 324 AVFDSGTSFTYLTDAAYTLISESFNSLALDKRYQTTDSELPFEYCYALSPNKDSFQYPAV 383

Query: 335 RFHFRGGAKLALEKDSMFYQPR-PDAFCMAVNPASINNRYVNFSLIGMMAQQFYNVGYDI 393
               +GG+   +    +    +  D +C+A+       +  + S+IG      Y V +D 
Sbjct: 384 NLTMKGGSSYPVYHPLVVIPMKDTDVYCLAI------MKIEDISIIGQNFMTGYRVVFDR 437

Query: 394 GRKQKTFQRMDC 405
            +    ++  DC
Sbjct: 438 EKLILGWKESDC 449


>gi|225464832|ref|XP_002272243.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Vitis vinifera]
          Length = 467

 Score = 91.3 bits (225), Expect = 8e-16,   Method: Compositional matrix adjust.
 Identities = 98/382 (25%), Positives = 161/382 (42%), Gaps = 40/382 (10%)

Query: 61  FYVNISIGQPPVPQFTAMDTGSSLLWVHC---YPCRDCSPQL----GTIFYPSRSSSYAY 113
           + + +S G PP      MDTGS L+W  C   Y CR+CS         IF P  SSS   
Sbjct: 90  YSIPLSFGTPPQTLPLIMDTGSDLVWFPCTHRYVCRNCSFSTSNPSSNIFIPKSSSSSKV 149

Query: 114 VPCDSEHCRYFPYARCSAYKHRCIYT--QLYLIGPETSVF----VSTEQLTFKNTDESTI 167
           + C +  C +   ++  +    C  T      I P   VF    ++   +  +  D    
Sbjct: 150 LGCVNPKCGWIHGSKVQSRCRDCEPTSPNCTQICPPYLVFYGSGITGGIMLSETLDLPGK 209

Query: 168 HVQDVVFGCGFSTNRNFKFSGIFGLGIGRSSLVSQLN-SSFSYCIGSLHDPDYLHNKLIL 226
            V + + GC  S     + +GI G G G  SL SQL    FSYC+ S    D   +  ++
Sbjct: 210 GVPNFIVGC--SVLSTSQPAGISGFGRGPPSLPSQLGLKKFSYCLLSRRYDDTTESSSLV 267

Query: 227 GDGAIIDEGDATP----LQFIDG-----------HYYITLEAISVDGRMLDIN-PNIFKR 270
            DG   D G+ T       F+             +YY+ L  I+V G+ + I    +   
Sbjct: 268 LDGE-SDSGEKTAGLSYTPFVQNPKVAGKHAFSVYYYLGLRHITVGGKHVKIPYKYLIPG 326

Query: 271 DDSGGGVMIDSGTDVTWLVKEAYEALRDEVMIRLEGEQMRSYSWPDKL--CYHGIMSSDL 328
            D  GG +IDSGT  T++  E +E +  E   +++ ++         L  C++ I   + 
Sbjct: 327 ADGDGGTIIDSGTTFTYMKGEIFELVAAEFEKQVQSKRATEVEGITGLRPCFN-ISGLNT 385

Query: 329 KGFPTVRFHFRGGAKLALE-KDSMFYQPRPDAFCMAVNPASINNRYVNFS---LIGMMAQ 384
             FP +   FRGGA++ L   + + +    D  C+ +       +  +     ++G   Q
Sbjct: 386 PSFPELTLKFRGGAEMELPLANYVAFLGGDDVVCLTIVTDGAAGKEFSGGPAIILGNFQQ 445

Query: 385 QFYNVGYDIGRKQKTFQRMDCE 406
           Q + V YD+  ++  F++  C+
Sbjct: 446 QNFYVEYDLRNERLGFRQQSCK 467


>gi|51038078|gb|AAT93881.1| hypothetical protein [Oryza sativa Japonica Group]
          Length = 481

 Score = 91.3 bits (225), Expect = 8e-16,   Method: Compositional matrix adjust.
 Identities = 105/410 (25%), Positives = 154/410 (37%), Gaps = 68/410 (16%)

Query: 61  FYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCR----------DCSPQLGTIFYPSRSSS 110
           +  +  IG PP P    +DTGS L+W  C  CR           C PQ    +  S S +
Sbjct: 78  YIASYGIGDPPQPAEAVVDTGSDLVWTQCSTCRLPAAAAAGGGGCFPQNLPYYNFSLSRT 137

Query: 111 YAYVPCDSEH---CRYFP-YARCS----AYKHRCIYTQLYLIGPETSVFVSTEQLTFKNT 162
              VPCD +    C   P  A C+    +    C+    Y  G    V   T+  TF ++
Sbjct: 138 ARAVPCDDDDGALCGVAPETAGCARGGGSGDDACVVAASYGAGVALGVL-GTDAFTFPSS 196

Query: 163 DESTIHVQDVVFGC----GFSTNRNFKFSGIFGLGIGRSSLVSQLNSS-FSYCIGSLHDP 217
              T+      FGC      S       SGI GLG G  SLVSQLN++ FSYC+      
Sbjct: 197 SSVTL-----AFGCVSQTRISPGALNGASGIIGLGRGALSLVSQLNATEFSYCLTPYFRD 251

Query: 218 DYLHNKLILGDG-----------AIIDEGDATPLQFIDG--------HYYITLEAISVDG 258
               + L +GDG                   T + F            YY+ L  ++   
Sbjct: 252 TVSPSHLFVGDGELAGLSAAAGGGGGGGAPVTTVPFAKNPKDSPFSTFYYLPLVGLAAGN 311

Query: 259 RMLDINPNIFKRDDS-----GGGVMIDSGTDVTWLVKEAYEALRDEVMIRLEGEQM---- 309
             + +    F   ++      GG +IDSG+  T LV  A+ AL  E+  +L G       
Sbjct: 312 ATVALPAGAFDLREAAPKVWAGGALIDSGSPFTRLVDPAHRALTKELARQLRGSGSLVPP 371

Query: 310 -RSYSWPDKLCYHGIMSSD---LKGFPTVRFHF----RGGAKLALEKDSMFYQPRPDAFC 361
                   +LC       D       P +   F     GG +L +  +  + +     +C
Sbjct: 372 PAKLGGALELCVEAGDDGDSLAAAAVPPLVLRFDDGVGGGRELVIPAEKYWARVEASTWC 431

Query: 362 MAVNPASINNRYV---NFSLIGMMAQQFYNVGYDIGRKQKTFQRMDCEVL 408
           MAV  ++  N  +     ++IG   QQ   V YD+     +FQ  +C  +
Sbjct: 432 MAVVSSASGNATLPTNETTIIGNFMQQDMRVLYDLANGLLSFQPANCSAV 481


>gi|88174563|gb|ABD39356.1| chloroplast nucleoid DNA-binding protein [Oryza longistaminata]
          Length = 323

 Score = 91.3 bits (225), Expect = 9e-16,   Method: Compositional matrix adjust.
 Identities = 86/331 (25%), Positives = 140/331 (42%), Gaps = 39/331 (11%)

Query: 61  FYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLGTIFYPSRSSSYAYVPCDSEH 120
           + +++ +G P   Q   +DTGSS  WV C  C  C     T F  SRS++ A V C +  
Sbjct: 1   YVISVGLGTPSKTQILEIDTGSSTSWVFC-ECDGCHTNPRT-FLQSRSTTCAKVSCGTSM 58

Query: 121 CRYF---PYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDVVFGC- 176
           C      P+ + S     C +   Y  G  +   +  + LTF +  +    +    FGC 
Sbjct: 59  CLLGGSDPHCQDSENYPDCPFRVSYQDGSASYGILYQDTLTFSDVQK----IPGFSFGCN 114

Query: 177 --GFSTNRNFKFSGIFGLGIGRSSLVSQLNSS---FSYCIGSLHDPDYLHNKLI----LG 227
              F  N      G+ G+G G  S++ Q + +   FSYC+          +K      LG
Sbjct: 115 MDSFGANEFGNVDGLLGMGAGPMSVLKQSSPTFDGFSYCLPLQMSERGFFSKTTGYFSLG 174

Query: 228 DGAIIDEGDATPLQFIDGH-----YYITLEAISVDGRMLDINPNIFKRDDSGGGVMIDSG 282
                   D    + +        +++ L AISVDG  L ++P+IF R     GV+ DSG
Sbjct: 175 GKIAATRTDVRYTKMVARRKNTELFFVDLTAISVDGERLGLSPSIFSRK----GVVFDSG 230

Query: 283 TDVTWLVKEAYEALRD---EVMIRLEGEQMRSYSWPDKLCYHGIMSSDLKGFPTVRFHFR 339
           ++++++   A   L     E+++R    +  S    ++ CY  + S D    P +  HF 
Sbjct: 231 SELSYIPDRALSVLSQRIRELLLRRGAAEEES----ERNCYD-MRSVDEGDMPAISLHFD 285

Query: 340 GGAKLALEKDSMFYQ---PRPDAFCMAVNPA 367
            GA+  L    +F +      D +C+A  P 
Sbjct: 286 DGARFDLGSHGVFVERSVQEQDVWCLAFAPT 316


>gi|115473845|ref|NP_001060521.1| Os07g0658600 [Oryza sativa Japonica Group]
 gi|22775625|dbj|BAC15479.1| nucleoid DNA-binding-like protein [Oryza sativa Japonica Group]
 gi|50510141|dbj|BAD31106.1| nucleoid DNA-binding-like protein [Oryza sativa Japonica Group]
 gi|113612057|dbj|BAF22435.1| Os07g0658600 [Oryza sativa Japonica Group]
          Length = 449

 Score = 91.3 bits (225), Expect = 9e-16,   Method: Compositional matrix adjust.
 Identities = 104/390 (26%), Positives = 156/390 (40%), Gaps = 31/390 (7%)

Query: 31  ARLAYLQEKIRSHNTYQAQILPSNDISNSLFYVNISIGQPPVPQFTAMDTGSSLLWVHCY 90
           +RL YL         Y         +    + V   +G P      A+DT +   W+ C 
Sbjct: 77  SRLLYLDSLAVKGRAYAPIASGRQLLQTPTYVVRARLGTPAQQLLLAVDTSNDAAWIPCS 136

Query: 91  PCRDCSPQLGTIFYPSRSSSYAYVPCDSEHCRYFPYARCSAYKHRCIYTQLYLIGPETSV 150
            C  C     + F P+ S+SY  VPC S  C   P   CS     C ++  Y    ++S+
Sbjct: 137 GCAGC--PTSSPFNPAASASYRPVPCGSPQCVLAPNPSCSPNAKSCGFSLSYA---DSSL 191

Query: 151 FVSTEQLTFKNTDESTIHVQDVVFGC-GFSTNRNFKFSGIFGLGIGRSSLVSQLN----S 205
             +  Q T     +    V+   FGC   +T       G+ GLG G  S +SQ      +
Sbjct: 192 QAALSQDTLAVAGDV---VKAYTFGCLQRATGTAAPPQGLLGLGRGPLSFLSQTKDMYGA 248

Query: 206 SFSYCIGSLHDPDYLHNKLILGDGAIIDEGDATPLQFIDGH----YYITLEAISVDGRML 261
           +FSYC+ S    ++    L LG          TPL   + H    YY+ +  I V  +++
Sbjct: 249 TFSYCLPSFKSLNF-SGTLRLGRNGQPRRIKTTPL-LANPHRSSLYYVNMTGIRVGKKVV 306

Query: 262 DINPNIFKRDD-SGGGVMIDSGTDVTWLVKEAYEALRDEVMIRL-EGEQMRSYSWPDKLC 319
            I  +    D  +G G ++DSGT  T LV   Y ALRDEV  R+  G    S       C
Sbjct: 307 SIPASALAFDPATGAGTVLDSGTMFTRLVAPVYLALRDEVRRRVGAGAAAVSSLGGFDTC 366

Query: 320 YHGIMSSDLKGFPTVRFHFRGGAKLALEKDSMFYQPRPDAFC--MAVNPASINNRYVNFS 377
           Y+  ++     +P V   F G      E++ + +       C  MA  P  +N      +
Sbjct: 367 YNTTVA-----WPPVTLLFDGMQVTLPEENVVIHTTYGTTSCLAMAAAPDGVNTV---LN 418

Query: 378 LIGMMAQQFYNVGYDIGRKQKTFQRMDCEV 407
           +I  M QQ + V +D+   +  F R  C  
Sbjct: 419 VIASMQQQNHRVLFDVPNGRVGFARESCTA 448


>gi|32479948|emb|CAE01594.1| OSJNBa0008A08.2 [Oryza sativa Japonica Group]
 gi|38347627|emb|CAE05222.2| OSJNBa0011K22.4 [Oryza sativa Japonica Group]
 gi|38567678|emb|CAE75961.1| B1159F04.24 [Oryza sativa Japonica Group]
 gi|116309512|emb|CAH66578.1| OSIGBa0137O04.4 [Oryza sativa Indica Group]
          Length = 431

 Score = 90.9 bits (224), Expect = 9e-16,   Method: Compositional matrix adjust.
 Identities = 95/362 (26%), Positives = 151/362 (41%), Gaps = 37/362 (10%)

Query: 57  SNSLFYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLGTI----FYPSRSS-SY 111
              L+Y +I IG P V  +  +DTGS   WV+   C+ C  +   +    FY  RSS S 
Sbjct: 55  GTGLYYTDIGIGTPAVKYYVQLDTGSKAFWVNGISCKQCPHESDILRKLTFYDPRSSVSS 114

Query: 112 AYVPCDSEHCRYFPYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFK---NTDESTIH 168
             V CD   C   P    +    RC Y   Y  G  T   + T+ L +       ++   
Sbjct: 115 KEVKCDDTICTSRPPCNMTL---RCPYITGYADGGLTMGILFTDLLHYHQLYGNGQTQPT 171

Query: 169 VQDVVFGCGFS-----TNRNFKFSGIFGLGIGRSSLVSQLNSS------FSYCIGSLHDP 217
              V FGCG        N      GI G G    + +SQL ++      FS+C+      
Sbjct: 172 STSVTFGCGLQQSGSLNNSAVAIDGIIGFGNSNQTALSQLAAAGKTKKIFSHCL------ 225

Query: 218 DYLHNKLILGDGAIID-EGDATPLQFIDGHYY-ITLEAISVDGRMLDINPNIFKRDDSGG 275
           D  +   I   G +++ +   TP+   +  Y+ + L++I+V G  L +  NIF    +  
Sbjct: 226 DSTNGGGIFAIGEVVEPKVKTTPIVKNNEVYHLVNLKSINVAGTTLQLPANIFGTTKT-K 284

Query: 276 GVMIDSGTDVTWLVKEAYEALRDEVMIRLEGEQMRS-YSWPDKLCYHGIMSSDLKGFPTV 334
           G  IDSG+ + +L +  Y  L   V  +     M + Y++    C+H + S D K FP +
Sbjct: 285 GTFIDSGSTLVYLPEIIYSELILAVFAKHPDITMGAMYNFQ---CFHFLGSVDDK-FPKI 340

Query: 335 RFHFRGGAKLALEKDSMFYQPRPDAFCMAVNPASINNRYVNFSLIGMMAQQFYNVGYDIG 394
            FHF     L +       +   + +C     A I+  Y +  ++G M      V YD+ 
Sbjct: 341 TFHFENDLTLDVYPYDYLLEYEGNQYCFGFQDAGIHG-YKDMIILGDMVISNKVVVYDME 399

Query: 395 RK 396
           ++
Sbjct: 400 KQ 401


>gi|356502091|ref|XP_003519855.1| PREDICTED: aspartic proteinase-like protein 1-like [Glycine max]
          Length = 519

 Score = 90.9 bits (224), Expect = 1e-15,   Method: Compositional matrix adjust.
 Identities = 98/377 (25%), Positives = 158/377 (41%), Gaps = 52/377 (13%)

Query: 60  LFYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLGTIFY---------PSRSSS 110
           L Y  + IG P V    A+DTGS L WV C  C  C+    T F          P+ SS+
Sbjct: 99  LHYTTVQIGTPGVKFMVALDTGSDLFWVPC-DCTRCAASDSTAFASDFDLNVYNPNGSST 157

Query: 111 YAYVPCDSEHCRYFPYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIH-- 168
              V C++  C +   ++C      C Y   Y +  ETS      +     T E   H  
Sbjct: 158 SKKVTCNNSLCTH--RSQCLGTFSNCPYMVSY-VSAETSTSGILVEDVLHLTQEDNHHDL 214

Query: 169 -VQDVVFGCGFSTNRNF----KFSGIFGLGIGRSSLVSQLN------SSFSYCIGSLHDP 217
              +V+FGCG   + +F      +G+FGLG+ + S+ S L+       SFS C G     
Sbjct: 215 VEANVIFGCGQIQSGSFLDVAAPNGLFGLGMEKISVPSMLSREGFTADSFSMCFGR---- 270

Query: 218 DYLHNKLILGDGAIIDEGDATPLQFIDGH--YYITLEAISVDGRMLDINPNIFKRDDSGG 275
           D +  ++  GD    D+ D TP      H  Y IT+  + V   ++D+            
Sbjct: 271 DGI-GRISFGDKGSFDQ-DETPFNLNPSHPTYNITVTQVRVGTTVIDVEFT--------- 319

Query: 276 GVMIDSGTDVTWLVKEAYEALRDEVMIRLEGEQMRSYS-WPDKLCYHGIMSSDLKGFPTV 334
             + DSGT  T+LV   Y  L +    +++  + RS S  P + CY     ++    P+V
Sbjct: 320 -ALFDSGTSFTYLVDPTYTRLTESFHSQVQDRRHRSDSRIPFEYCYDMSPDANTSLIPSV 378

Query: 335 RFHFRGGAKLALEKDSMFYQPRPD-AFCMAVNPASINNRYVNFSLIGMMAQQFYNVGYDI 393
                GG+  A+    +    + +  +C+AV  ++        ++IG      Y V +D 
Sbjct: 379 SLTMGGGSHFAVYDPIIIISTQSELVYCLAVVKSA------ELNIIGQNFMTGYRVVFDR 432

Query: 394 GRKQKTFQRMDCEVLDD 410
            +    +++ DC  ++D
Sbjct: 433 EKLVLGWKKFDCYDIED 449


>gi|32526671|dbj|BAC79194.1| chloroplast nucleoid DNA-binding protein -like protein [Oryza
           sativa Japonica Group]
          Length = 732

 Score = 90.9 bits (224), Expect = 1e-15,   Method: Compositional matrix adjust.
 Identities = 101/380 (26%), Positives = 160/380 (42%), Gaps = 59/380 (15%)

Query: 60  LFYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDC----SPQLGT----IFYPSRSSSY 111
           L Y  +++G P V    A+DTGS L WV C  C  C    SP  G+    ++ P++S++ 
Sbjct: 98  LHYAVVALGTPNVTFLVALDTGSDLFWVPC-DCLKCAPFQSPNYGSLKFDVYSPAQSTTS 156

Query: 112 AYVPCDSEHCRYFPYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTD--ESTIHV 169
             VPC S  C       C +  + C Y+  YL    +S  V  E + +  +D  +S I  
Sbjct: 157 RKVPCSSNLCDL--QNACRSKSNSCPYSIQYLSDNTSSSGVLVEDVLYLTSDSAQSKIVT 214

Query: 170 QDVVFGCGFSTNRNFKFS----GIFGLGIGRSSLVSQLNS------SFSYCIGSLHDPDY 219
             ++FGCG     +F  S    G+ GLG+   S+ S L S      SFS C G     D 
Sbjct: 215 APIMFGCGQVQTGSFLGSAAPNGLLGLGMDSKSVPSLLASKGLAANSFSMCFG-----DD 269

Query: 220 LHNKLILGDGAIIDEGDATPLQFI--DGHYYITLEAISVDGRMLDINPNIFKRDDSGGGV 277
            H ++  GD    D+ + TPL     + +Y IT+  I+V  + +                
Sbjct: 270 GHGRINFGDTGSSDQKE-TPLNVYKQNPYYNITITGITVGSKSISTE----------FSA 318

Query: 278 MIDSGTDVTWLVKEAYEALRDEVMIRLEGEQ-MRSYSWPDKLCYHGIMSSDLKGFPTVRF 336
           ++DSGT  T L    Y  +      ++   + M   S P + CY   +S++    P V  
Sbjct: 319 IVDSGTSFTALSDPMYTQITSSFDAQIRSSRNMLDSSMPFEFCYS--VSANGIVHPNVSL 376

Query: 337 HFRGGAKLALE------KDSMFYQPRPDAFCMAVNPASINNRYVNFSLIGMMAQQFYNVG 390
             +GG+   +        D+ F    P  +C+A+    + +  VN  LIG        V 
Sbjct: 377 TAKGGSIFPVNDPIITITDNAF---NPVGYCLAI----MKSEGVN--LIGENFMSGLKVV 427

Query: 391 YDIGRKQKTFQRMDCEVLDD 410
           +D  R    ++  +C   D+
Sbjct: 428 FDRERMVLGWKNFNCYNFDE 447


>gi|218194598|gb|EEC77025.1| hypothetical protein OsI_15381 [Oryza sativa Indica Group]
          Length = 422

 Score = 90.9 bits (224), Expect = 1e-15,   Method: Compositional matrix adjust.
 Identities = 95/359 (26%), Positives = 151/359 (42%), Gaps = 37/359 (10%)

Query: 60  LFYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLGTI----FYPSRSS-SYAYV 114
           L+Y +I IG P V  +  +DTGS   WV+   C+ C  +   +    FY  RSS S   V
Sbjct: 58  LYYTDIGIGTPAVKYYVQLDTGSKAFWVNGISCKQCPHESDILRKLTFYDPRSSVSSKEV 117

Query: 115 PCDSEHCRYFPYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFK---NTDESTIHVQD 171
            CD   C   P    +    RC Y   Y  G  T   + T+ L +       ++      
Sbjct: 118 KCDDTICTSRPPCNMTL---RCPYITGYADGGLTMGILFTDLLHYHQLYGNGQTQPTSTS 174

Query: 172 VVFGCGFS-----TNRNFKFSGIFGLGIGRSSLVSQLNSS------FSYCIGSLHDPDYL 220
           V FGCG        N      GI G G    + +SQL ++      FS+C+      D  
Sbjct: 175 VTFGCGLQQSGSLNNSAVAIDGIIGFGNSNQTALSQLAAAGKTKKIFSHCL------DST 228

Query: 221 HNKLILGDGAIID-EGDATPLQFIDGHYY-ITLEAISVDGRMLDINPNIFKRDDSGGGVM 278
           +   I   G +++ +   TP+   +  Y+ + L++I+V G  L +  NIF    +  G  
Sbjct: 229 NGGGIFAIGEVVEPKVKTTPIVKNNEVYHLVNLKSINVAGTTLQLPANIFGTTKT-KGTF 287

Query: 279 IDSGTDVTWLVKEAYEALRDEVMIRLEGEQMRS-YSWPDKLCYHGIMSSDLKGFPTVRFH 337
           IDSG+ + +L +  Y  L   V  +     M + Y++    C+H + S D K FP + FH
Sbjct: 288 IDSGSTLVYLPEIIYSELILAVFAKHPDITMGAMYNFQ---CFHFLGSVDDK-FPKITFH 343

Query: 338 FRGGAKLALEKDSMFYQPRPDAFCMAVNPASINNRYVNFSLIGMMAQQFYNVGYDIGRK 396
           F     L +       +   + +C     A I+  Y +  ++G M      V YD+ ++
Sbjct: 344 FENDLTLDVYPYDYLLEYEGNQYCFGFQDAGIHG-YKDMIILGDMVISNKVVVYDMEKQ 401


>gi|297832400|ref|XP_002884082.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
 gi|297329922|gb|EFH60341.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
          Length = 513

 Score = 90.9 bits (224), Expect = 1e-15,   Method: Compositional matrix adjust.
 Identities = 99/372 (26%), Positives = 157/372 (42%), Gaps = 51/372 (13%)

Query: 60  LFYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLGT---------IFYPSRSSS 110
           L Y N+++G P      A+DTGS L W+ C  C +C  +L           I+ P+ SS+
Sbjct: 103 LHYANVTVGTPSDWFLVALDTGSDLFWLPC-DCTNCVRELKAPGGSSLDLNIYSPNASST 161

Query: 111 YAYVPCDSEHCRYFPYARCSAYKHRCIYTQLYLIGPETSVFVSTEQ-LTFKNTDESTIHV 169
              VPC+S  C      RC++ +  C Y   YL    +S  V  E  L   + D+S+  +
Sbjct: 162 STKVPCNSTLCTRGD--RCASPESNCPYQIRYLSNGTSSTGVLVEDVLHLVSNDKSSKAI 219

Query: 170 -QDVVFGCGFSTNRNF----KFSGIFGLGIGRSSLVSQL------NSSFSYCIGSLHDPD 218
              V  GCG      F      +G+FGLG+   S+ S L       +SFS C G+     
Sbjct: 220 PARVTLGCGQVQTGVFHDGAAPNGLFGLGLEDISVPSVLAKEGIAANSFSMCFGNDG--- 276

Query: 219 YLHNKLILGDGAIIDEGDATPLQFIDGH--YYITLEAISVDGRMLDINPNIFKRDDSGGG 276
               ++  GD   +D+ + TPL     H  Y IT+  ISV+G   D+  +          
Sbjct: 277 --AGRISFGDKGSVDQRE-TPLNIRQPHPTYNITVTKISVEGNTGDLEFD---------- 323

Query: 277 VMIDSGTDVTWLVKEAYEALRDEV-MIRLEGE-QMRSYSWPDKLCYHGIMSSDLKGFPTV 334
            + DSGT  T+L   AY  + +    + L+   Q      P + CY    + D   +P V
Sbjct: 324 AVFDSGTSFTYLTDAAYTLISESFNSLALDKRYQTTDSELPFEYCYALSPNKDSFQYPAV 383

Query: 335 RFHFRGGAKLALEKDSMFYQPR-PDAFCMAVNPASINNRYVNFSLIGMMAQQFYNVGYDI 393
               +GG+   +    +    +  D +C+A+       +  + S+IG      Y V +D 
Sbjct: 384 NLTMKGGSSYPVYHPLVVIPMKDTDVYCLAI------LKIEDISIIGQNFMTGYRVVFDR 437

Query: 394 GRKQKTFQRMDC 405
            +    ++  DC
Sbjct: 438 EKLILGWKESDC 449


>gi|116311058|emb|CAH67989.1| OSIGBa0142I02-OSIGBa0101B20.32 [Oryza sativa Indica Group]
          Length = 488

 Score = 90.9 bits (224), Expect = 1e-15,   Method: Compositional matrix adjust.
 Identities = 99/383 (25%), Positives = 160/383 (41%), Gaps = 50/383 (13%)

Query: 58  NSLFYVNISIGQPP---VPQFTAMDTGSSLLWVHCYPCRDCSPQLGTIFY-PSRSSSYAY 113
            S + V + IG P     P++   DTGS L W  C PC +CS       + PS+S ++  
Sbjct: 120 GSTYLVQLRIGTPTDRISPRYVLFDTGSDLSWTQCEPCTNCSSFTPYPPHDPSKSRTFRR 179

Query: 114 VPCDSEHCRYFPYARCSAY------KHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTI 167
           + C    C       C+A          C++ + Y  G   S  + ++   F    +   
Sbjct: 180 LSCFDPMCEL-----CTAVVDGGGGSAGCLFRRRYGDGGAVSGELVSDVFHFGAAGDGGG 234

Query: 168 H--VQDVVFGCGFSTN----RNFKFSGIFGLGIGRSSLVSQLN-SSFSYCIGS------- 213
           +   +DV FGC    +    R +  +GI  LGIG+ S V+QL    FSYCI +       
Sbjct: 235 YQLERDVAFGCAHVEDSKAVRGYS-TGILALGIGKPSFVTQLGVDRFSYCIPASEITDDD 293

Query: 214 -LHDPDYLHNKLILGDGAIIDEGDATPLQFIDGHYYITLEAISVD--GRMLDINP-NIFK 269
              D +   + L  G  A +  G   P +     Y + L+++     GR+    P  ++ 
Sbjct: 294 DDDDEERSASFLRFGSHARM-TGKRAPFKQDGSGYAVRLKSVVYQHGGRLNQQQPVPVYV 352

Query: 270 RDDSGGGVM---IDSGTDVTWLVKEAYEALRDEVMIRLEGEQMRSYSWPDKLCYHGIMSS 326
             +     M   +DSGT + WL    +  L+  +   +   +    + P   CY G M +
Sbjct: 353 AGEEAAAAMPMLVDSGTTLLWLPGSVFYPLQRRIEEDISLTRRYDLTHPSLYCYLGNM-T 411

Query: 327 DLKGFPTVRFHFRGGAKLALEKDSMFYQPR---PDAFCMAVNPASINNRYVNFSLIGMMA 383
           D++   +V   F GGA L L   S+F+       D  C+AV          N +++G+  
Sbjct: 412 DVEAV-SVTLGFGGGADLELFGTSLFFTDENLTEDWVCLAVAAG-------NRAILGVYP 463

Query: 384 QQFYNVGYDIGRKQKTFQRMDCE 406
           Q+  NVGYD+   +  F R  C+
Sbjct: 464 QRNINVGYDLSTMEIAFDRDQCD 486


>gi|222637611|gb|EEE67743.1| hypothetical protein OsJ_25435 [Oryza sativa Japonica Group]
          Length = 396

 Score = 90.5 bits (223), Expect = 1e-15,   Method: Compositional matrix adjust.
 Identities = 99/360 (27%), Positives = 149/360 (41%), Gaps = 31/360 (8%)

Query: 61  FYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLGTIFYPSRSSSYAYVPCDSEH 120
           + V   +G P      A+DT +   W+ C  C  C     + F P+ S+SY  VPC S  
Sbjct: 54  YVVRARLGTPAQQLLLAVDTSNDAAWIPCSGCAGC--PTSSPFNPAASASYRPVPCGSPQ 111

Query: 121 CRYFPYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDVVFGC-GFS 179
           C   P   CS     C ++  Y    ++S+  +  Q T     +    V+   FGC   +
Sbjct: 112 CVLAPNPSCSPNAKSCGFSLSYA---DSSLQAALSQDTLAVAGDV---VKAYTFGCLQRA 165

Query: 180 TNRNFKFSGIFGLGIGRSSLVSQLN----SSFSYCIGSLHDPDYLHNKLILGDGAIIDEG 235
           T       G+ GLG G  S +SQ      ++FSYC+ S    ++    L LG        
Sbjct: 166 TGTAAPPQGLLGLGRGPLSFLSQTKDMYGATFSYCLPSFKSLNF-SGTLRLGRNGQPRRI 224

Query: 236 DATPLQFIDGH----YYITLEAISVDGRMLDINPNIFKRDD-SGGGVMIDSGTDVTWLVK 290
             TPL   + H    YY+ +  I V  +++ I  +    D  +G G ++DSGT  T LV 
Sbjct: 225 KTTPL-LANPHRSSLYYVNMTGIRVGKKVVSIPASALAFDPATGAGTVLDSGTMFTRLVA 283

Query: 291 EAYEALRDEVMIRL-EGEQMRSYSWPDKLCYHGIMSSDLKGFPTVRFHFRGGAKLALEKD 349
             Y ALRDEV  R+  G    S       CY+  ++     +P V   F G      E++
Sbjct: 284 PVYLALRDEVRRRVGAGAAAVSSLGGFDTCYNTTVA-----WPPVTLLFDGMQVTLPEEN 338

Query: 350 SMFYQPRPDAFC--MAVNPASINNRYVNFSLIGMMAQQFYNVGYDIGRKQKTFQRMDCEV 407
            + +       C  MA  P  +N      ++I  M QQ + V +D+   +  F R  C  
Sbjct: 339 VVIHTTYGTTSCLAMAAAPDGVNTV---LNVIASMQQQNHRVLFDVPNGRVGFARESCTA 395


>gi|356570895|ref|XP_003553619.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Glycine max]
          Length = 470

 Score = 90.5 bits (223), Expect = 1e-15,   Method: Compositional matrix adjust.
 Identities = 113/450 (25%), Positives = 187/450 (41%), Gaps = 68/450 (15%)

Query: 7   ILSPYNNPNENPAHRVQRGINISIARLAYLQEKIRSHNTYQAQILPSNDISNSLFYVNIS 66
           +L+  ++ + +P H V+   + S+ R  +L  K R++N+      P+   S   + ++++
Sbjct: 36  LLTKPHSSDSDPFHSVKLAASSSLTRAHHL--KHRNNNSPSVATTPAYPKSYGGYSIDLN 93

Query: 67  IGQPPVPQFTAMDTGSSLLWVHC---YPCRDCS-----PQLGTIFYPSRSSSYAYVPCDS 118
           +G PP      +DTGSSL+W  C   Y C  C+     P     F P  SS+   + C +
Sbjct: 94  LGTPPQTSPFVLDTGSSLVWFPCTSHYLCSHCNFPNIDPTKIPTFIPKNSSTAKLLGCRN 153

Query: 119 EHCRYF----PYARCSAYK----HRC-----IYTQLYLIGPETSVFVSTEQLTFKNTDES 165
             C Y       +RC   K      C      Y   Y +G  T+ F+  + L F      
Sbjct: 154 PKCGYLFGPDVESRCPQCKKPGSQNCSLTCPSYIIQYGLG-ATAGFLLLDNLNFPGKT-- 210

Query: 166 TIHVQDVVFGCGFSTNRNFKFSGIFGLGIGRSSLVSQLN-SSFSYCIGSLHDPDYLHNK- 223
              V   + GC   + R  + SGI G G G+ SL SQ+N   FSYC+ S    D   +  
Sbjct: 211 ---VPQFLVGCSILSIR--QPSGIAGFGRGQESLPSQMNLKRFSYCLVSHRFDDTPQSSD 265

Query: 224 LILGDGAIIDEGDA-------TPLQ-------FIDGHYYITLEAISVDGRMLDINPNIFK 269
           L+L    I   GD        TP +           +YY+TL  + V G  + I     +
Sbjct: 266 LVL---QISSTGDTKTNGLSYTPFRSNPSNNSVFREYYYVTLRKLIVGGVDVKIPYKFLE 322

Query: 270 R-DDSGGGVMIDSGTDVTWLVKEAYEALRDEVMIRLEGEQMRSYSWPDK----LCYHGIM 324
              D  GG ++DSG+  T++ +  Y  +  E + +L  +  R  +   +     C++ I 
Sbjct: 323 PGSDGNGGTIVDSGSTFTFMERPVYNLVAQEFLRQLGKKYSREENVEAQSGLSPCFN-IS 381

Query: 325 SSDLKGFPTVRFHFRGGAKLALEKDSMF-YQPRPDAFCMAV-------NPASINNRYVNF 376
                 FP   F F+GGAK++    + F +    +  C  V        P +     +  
Sbjct: 382 GVKTISFPEFTFQFKGGAKMSQPLLNYFSFVGDAEVLCFTVVSDGGAGQPKTAGPAII-- 439

Query: 377 SLIGMMAQQFYNVGYDIGRKQKTFQRMDCE 406
             +G   QQ + V YD+  ++  F   +C+
Sbjct: 440 --LGNYQQQNFYVEYDLENERFGFGPRNCK 467


>gi|215766660|dbj|BAG98888.1| unnamed protein product [Oryza sativa Japonica Group]
          Length = 433

 Score = 90.5 bits (223), Expect = 1e-15,   Method: Compositional matrix adjust.
 Identities = 95/359 (26%), Positives = 151/359 (42%), Gaps = 37/359 (10%)

Query: 60  LFYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLGTI----FYPSRSS-SYAYV 114
           L+Y +I IG P V  +  +DTGS   WV+   C+ C  +   +    FY  RSS S   V
Sbjct: 82  LYYTDIGIGTPAVKYYVQLDTGSKAFWVNGISCKQCPHESDILRKLTFYDPRSSVSSKEV 141

Query: 115 PCDSEHCRYFPYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFK---NTDESTIHVQD 171
            CD   C   P    +    RC Y   Y  G  T   + T+ L +       ++      
Sbjct: 142 KCDDTICTSRPPCNMTL---RCPYITGYADGGLTMGILFTDLLHYHQLYGNGQTQPTSTS 198

Query: 172 VVFGCGFS-----TNRNFKFSGIFGLGIGRSSLVSQLNSS------FSYCIGSLHDPDYL 220
           V FGCG        N      GI G G    + +SQL ++      FS+C+      D  
Sbjct: 199 VTFGCGLQQSGSLNNSAVAIDGIIGFGNSNQTALSQLAAAGKTKKIFSHCL------DST 252

Query: 221 HNKLILGDGAIID-EGDATPLQFIDGHYY-ITLEAISVDGRMLDINPNIFKRDDSGGGVM 278
           +   I   G +++ +   TP+   +  Y+ + L++I+V G  L +  NIF    +  G  
Sbjct: 253 NGGGIFAIGEVVEPKVKTTPIVKNNEVYHLVNLKSINVAGTTLQLPANIFGTTKT-KGTF 311

Query: 279 IDSGTDVTWLVKEAYEALRDEVMIRLEGEQMRS-YSWPDKLCYHGIMSSDLKGFPTVRFH 337
           IDSG+ + +L +  Y  L   V  +     M + Y++    C+H + S D K FP + FH
Sbjct: 312 IDSGSTLVYLPEIIYSELILAVFAKHPDITMGAMYNFQ---CFHFLGSVDDK-FPKITFH 367

Query: 338 FRGGAKLALEKDSMFYQPRPDAFCMAVNPASINNRYVNFSLIGMMAQQFYNVGYDIGRK 396
           F     L +       +   + +C     A I+  Y +  ++G M      V YD+ ++
Sbjct: 368 FENDLTLDVYPYDYLLEYEGNQYCFGFQDAGIHG-YKDMIILGDMVISNKVVVYDMEKQ 425


>gi|296086208|emb|CBI31649.3| unnamed protein product [Vitis vinifera]
          Length = 761

 Score = 90.5 bits (223), Expect = 1e-15,   Method: Compositional matrix adjust.
 Identities = 95/387 (24%), Positives = 154/387 (39%), Gaps = 85/387 (21%)

Query: 47  QAQILPSNDIS----------NSLFYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCS 96
           + Q+LPS  +           N    V++++G PP      +DTGS L W+HC      +
Sbjct: 351 KTQVLPSGSVPRPSSKLSFHHNVSLTVSLTVGSPPQTVTMVLDTGSELSWLHC----KKA 406

Query: 97  PQLGTIFYPSRSSSYAYVPCDSEHCRYFPYARCSAYKHRCIYTQLYLIGPETSVFVSTEQ 156
           P L ++F P RSSSY+ +PC S  CR   +++ +            LIG          Q
Sbjct: 407 PNLHSVFDPLRSSSYSPIPCTSPTCRTRTHSKTTG-----------LIGMNRGSLSFVTQ 455

Query: 157 LTFKNTDESTIHVQDVVFGCGFSTNRNFKFSGIFGLGIGRSSLVSQLNSSFSYCIGSLHD 216
           +  +      I  QD               SGI   G           SSFS+     + 
Sbjct: 456 MGLQKFS-YCISGQDS--------------SGILLFG----------ESSFSWLKALKYT 490

Query: 217 PDYLHNKLILGDGAIIDEGDATPLQFIDG-HYYITLEAISVDGRMLDINPNIFKRDDSGG 275
           P                   +TPL + D   Y + LE I V   ML +  +++  D +G 
Sbjct: 491 PLV---------------QISTPLPYFDRVAYTVQLEGIKVANSMLQLPKSVYAPDHTGA 535

Query: 276 G-VMIDSGTDVTWLVKEAYEALRDEVMIRLEGEQMRSYSWPD-------KLCYH-GIMSS 326
           G  M+DSGT  T+L+   Y AL++E  +R     ++    P+        LCY   +   
Sbjct: 536 GQTMVDSGTQFTFLLGPVYTALKNE-FVRQTKASLKVLEDPNFVFQGAMDLCYRVPLTRR 594

Query: 327 DLKGFPTVRFHFRGGAKLALEKDSMFY------QPRPDAFCMAVNPASINNRYVNFSLIG 380
            L   PTV   FR GA++++  + + Y      +     +C     + +    V   +IG
Sbjct: 595 TLPPLPTVTLMFR-GAEMSVSAERLMYRVPGVIRGSDSVYCFTFGNSELLG--VESYIIG 651

Query: 381 MMAQQFYNVGYDIGRKQKTFQRMDCEV 407
              QQ   + +D+ + +  F  + C++
Sbjct: 652 HHHQQNVWMEFDLAKSRVGFAEVRCDL 678


>gi|218195474|gb|EEC77901.1| hypothetical protein OsI_17222 [Oryza sativa Indica Group]
          Length = 467

 Score = 90.5 bits (223), Expect = 1e-15,   Method: Compositional matrix adjust.
 Identities = 99/383 (25%), Positives = 160/383 (41%), Gaps = 50/383 (13%)

Query: 58  NSLFYVNISIGQPP---VPQFTAMDTGSSLLWVHCYPCRDCSPQLGTIFY-PSRSSSYAY 113
            S + V + IG P     P++   DTGS L W  C PC +CS       + PS+S ++  
Sbjct: 99  GSTYLVQLRIGTPTDRISPRYVLFDTGSDLSWTQCEPCTNCSSFTPYPPHDPSKSRTFRR 158

Query: 114 VPCDSEHCRYFPYARCSAY------KHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTI 167
           + C    C       C+A          C++ + Y  G   S  + ++   F    +   
Sbjct: 159 LSCFDPMCEL-----CTAVVDGGGGSAGCLFRRRYGDGGAVSGELVSDVFHFGAAGDGGG 213

Query: 168 H--VQDVVFGCGFSTN----RNFKFSGIFGLGIGRSSLVSQLN-SSFSYCIGS------- 213
           +   +DV FGC    +    R +  +GI  LGIG+ S V+QL    FSYCI +       
Sbjct: 214 YQLERDVAFGCAHVEDSKAVRGYS-TGILALGIGKPSFVTQLGVDRFSYCIPASEITDDD 272

Query: 214 -LHDPDYLHNKLILGDGAIIDEGDATPLQFIDGHYYITLEAISVD--GRMLDINP-NIFK 269
              D +   + L  G  A +  G   P +     Y + L+++     GR+    P  ++ 
Sbjct: 273 DDDDEERSASFLRFGSHARM-TGKRAPFKQDGSGYAVRLKSVVYQHGGRLNQQQPVPVYV 331

Query: 270 RDDSGGGVM---IDSGTDVTWLVKEAYEALRDEVMIRLEGEQMRSYSWPDKLCYHGIMSS 326
             +     M   +DSGT + WL    +  L+  +   +   +    + P   CY G M +
Sbjct: 332 AGEEAAAAMPMLVDSGTTLLWLPGSVFYPLQRRIEEDISLTRRYDLTHPSLYCYLGNM-T 390

Query: 327 DLKGFPTVRFHFRGGAKLALEKDSMFYQPR---PDAFCMAVNPASINNRYVNFSLIGMMA 383
           D++   +V   F GGA L L   S+F+       D  C+AV          N +++G+  
Sbjct: 391 DVEAV-SVTLGFGGGADLELFGTSLFFTDENLTEDWVCLAVAAG-------NRAILGVYP 442

Query: 384 QQFYNVGYDIGRKQKTFQRMDCE 406
           Q+  NVGYD+   +  F R  C+
Sbjct: 443 QRNINVGYDLSTMEIAFDRDQCD 465


>gi|449461377|ref|XP_004148418.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Cucumis sativus]
 gi|449518059|ref|XP_004166061.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Cucumis sativus]
          Length = 436

 Score = 90.5 bits (223), Expect = 1e-15,   Method: Compositional matrix adjust.
 Identities = 103/394 (26%), Positives = 165/394 (41%), Gaps = 40/394 (10%)

Query: 31  ARLAYLQEKIRSHNTYQAQILPSNDISNSLFYV-NISIGQPPVPQFTAMDTGSSLLWVHC 89
           AR+ YL   + +  T  A I     + N   YV  + +G P    +  +DT +   W  C
Sbjct: 65  ARIRYL-SSLTAQKTVAAPIASGQQVLNVGNYVVRVQLGTPGQTMYMVLDTSNDAAWAPC 123

Query: 90  YPCRDCSPQLGTIFYPSRSSSYAYVPCDSEHCRYFPYARCSAYKH-RCIYTQLYLIGPET 148
             C  CS    T F    SS++A + C    C       C    +  C++ Q Y  G ++
Sbjct: 124 SGCIGCSST--TTFSAQNSSTFATLDCSKPECTQARGLSCPTTGNVDCLFNQTY--GGDS 179

Query: 149 SVFVSTEQLTFKNTDESTIH-----VQDVVFGC-GFSTNRNFKFSGIFGLGIGRSSLVSQ 202
           +   +  Q         ++H     + +  FGC   ++  +    G+ GLG G  SL+SQ
Sbjct: 180 TFSATLVQ--------DSLHLGPNVIPNFSFGCISSASGSSIPPQGLMGLGRGPLSLISQ 231

Query: 203 LNSS----FSYCIGSLHDPDYLHNKLILGDGAIIDEGDATPLQFIDGH----YYITLEAI 254
             S     FSYC+ S     Y    L LG          TPL   + H    YY+ L  I
Sbjct: 232 SGSLYSGLFSYCLPSFKS-YYFSGSLKLGPVGQPKAIRTTPL-LHNPHRPSLYYVNLTGI 289

Query: 255 SVDGRMLDINPNIFKRD-DSGGGVMIDSGTDVTWLVKEAYEALRDEVMIRLEGEQMRSYS 313
           SV   ++ I+P +   D ++G G +IDSGT +T  V   Y A+RDE   ++ G      +
Sbjct: 290 SVGRVLVPISPELLAFDPNTGAGTIIDSGTVITRFVPAIYTAVRDEFRKQVGGSFSPLGA 349

Query: 314 WPDKLCYHGIMSSDLKGFPTVRFHFRG-GAKLALEKDSMFYQPRPDAFCMAVNPASINNR 372
           +      +  +S+     P +  H  G   KL +E +S+ +       C+A+  A+ NN 
Sbjct: 350 FDTCFATNNEVSA-----PAITLHLSGLDLKLPME-NSLIHSSAGSLACLAMA-AAPNNV 402

Query: 373 YVNFSLIGMMAQQFYNVGYDIGRKQKTFQRMDCE 406
               ++I  + QQ + + +DI   +    R  C 
Sbjct: 403 NSVVNVIANLQQQNHRILFDINNSKLGIARELCN 436


>gi|88174579|gb|ABD39364.1| chloroplast nucleoid DNA-binding protein [Oryza sativa Japonica
           Group]
 gi|88174585|gb|ABD39367.1| chloroplast nucleoid DNA-binding protein [Oryza sativa Indica
           Group]
 gi|88174595|gb|ABD39372.1| chloroplast nucleoid DNA-binding protein [Oryza rufipogon]
 gi|88174599|gb|ABD39374.1| chloroplast nucleoid DNA-binding protein [Oryza rufipogon]
 gi|88174607|gb|ABD39378.1| chloroplast nucleoid DNA-binding protein [Oryza rufipogon]
          Length = 321

 Score = 90.5 bits (223), Expect = 1e-15,   Method: Compositional matrix adjust.
 Identities = 83/329 (25%), Positives = 140/329 (42%), Gaps = 37/329 (11%)

Query: 61  FYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLGTIFYPSRSSSYAYVPCDSEH 120
           + +++ +G P   Q   +DTGSS  WV C  C  C     T F  SRS++ A V C +  
Sbjct: 1   YVISVGLGTPAKTQIVEIDTGSSTSWVFC-ECDGCHTNPRT-FLQSRSTTCAKVSCGTSM 58

Query: 121 CRYF---PYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDVVFGC- 176
           C      P+ + S     C +   Y  G  +   +  + LTF +  +    +    FGC 
Sbjct: 59  CLLGGSDPHCQDSENYPDCPFRVSYQDGSASYGILYQDTLTFSDVQK----IPGFSFGCN 114

Query: 177 --GFSTNRNFKFSGIFGLGIGRSSLVSQLNSS---FSYCIGSLHDPDYLHNKLI--LGDG 229
              F  N      G+ G+G G  S++ Q + +   FSYC+          +K       G
Sbjct: 115 MDSFGANEFGNVDGLLGMGAGPMSVLKQSSPTFDCFSYCLPLQKSERGFFSKTTGYFSLG 174

Query: 230 AIIDEGDATPLQFIDGH-----YYITLEAISVDGRMLDINPNIFKRDDSGGGVMIDSGTD 284
            +    D    + +        +++ L AISVDG  L ++P++F R     GV+ DSG++
Sbjct: 175 KVATRTDVRYTKMVARKKNTELFFVDLTAISVDGERLGLSPSVFSRK----GVVFDSGSE 230

Query: 285 VTWLVKEAYEALRD---EVMIRLEGEQMRSYSWPDKLCYHGIMSSDLKGFPTVRFHFRGG 341
           ++++   A   L     E++++    +  S    ++ CY  + S D    P +  HF  G
Sbjct: 231 LSYIPDRALSVLSQRIRELLLKRGAAEEES----ERNCYD-MRSVDEGDMPAISLHFDDG 285

Query: 342 AKLALEKDSMFYQ---PRPDAFCMAVNPA 367
           A+  L    +F +      D +C+A  P 
Sbjct: 286 ARFDLGSHGVFVERSVQEQDVWCLAFAPT 314


>gi|226491934|ref|NP_001140743.1| uncharacterized protein LOC100272818 [Zea mays]
 gi|194700872|gb|ACF84520.1| unknown [Zea mays]
          Length = 351

 Score = 90.1 bits (222), Expect = 2e-15,   Method: Compositional matrix adjust.
 Identities = 99/359 (27%), Positives = 150/359 (41%), Gaps = 55/359 (15%)

Query: 70  PPVPQFTAMDTGSSLLWVHCYPCR--DCSPQLGTIFYPSRSSSYAYVPCDSEHCRYF-PY 126
           P V Q   +D+ S + WV C PC    C PQ+ + + PSRS + A   C S  C    PY
Sbjct: 25  PGVIQTVVLDSASDVPWVQCVPCPIPPCHPQVDSFYDPSRSPTSAAFSCSSPTCTALGPY 84

Query: 127 ARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDVVFGCGFSTNRNF-- 184
           A   A  ++C Y   Y  G  TS     + LT    +     V    FGC  +   +F  
Sbjct: 85  ANGCA-NNQCQYLVRYPDGSSTSGAYIADLLTLDAGNA----VSGFKFGCSHAEQGSFDA 139

Query: 185 KFSGIFGLGIGRSSLVSQLNS----SFSYCIG---------SLHDPDYLHNKLILGDGAI 231
           + +GI  LG G  SL+SQ  S    +FSYCI          +L  P    ++ ++     
Sbjct: 140 RAAGIMALGGGPESLLSQTASRYGNAFSYCIPATASDSGFFTLGVPRRASSRYVV----- 194

Query: 232 IDEGDATPL---QFIDGHYYITLEAISVDGRMLDINPNIFKRDDSGGGVMIDSGTDVTWL 288
                 TP+   +     Y + L  I+V G+ L + P +F       G ++DS T +T L
Sbjct: 195 ------TPMVRFRQAATFYGVLLRTITVGGQRLGVAPAVFA-----AGSVLDSRTAITRL 243

Query: 289 VKEAYEALRDEVMIRLEGEQMRSYSWPDKL--CYHGIMSSDLKGFPTVRFHFRGGAKLAL 346
              AY+ALR     R      RS      L  CY      +++  P +   F   A L L
Sbjct: 244 PPTAYQALR--AAFRSSMTMYRSAPPKGYLDTCYDFTGVVNIR-LPKISLVFDRNAVLPL 300

Query: 347 EKDSMFYQPRPDAFCMAVNPASINNRYVNFSLIGMMAQQFYNVGYDIGRKQKTFQRMDC 405
           +   + +       C+A   ++ ++R     ++G + QQ   V YD+G     F++  C
Sbjct: 301 DPSGILFND-----CLAFT-SNADDRMPG--VLGSVQQQTIEVLYDVGGGAVGFRQGAC 351


>gi|356559244|ref|XP_003547910.1| PREDICTED: aspartic proteinase-like protein 1-like [Glycine max]
          Length = 515

 Score = 90.1 bits (222), Expect = 2e-15,   Method: Compositional matrix adjust.
 Identities = 97/377 (25%), Positives = 157/377 (41%), Gaps = 52/377 (13%)

Query: 60  LFYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLGTIFY---------PSRSSS 110
           L Y  + IG P V    A+DTGS L WV C  C  C+    + F          P+ SS+
Sbjct: 95  LHYTTVQIGTPGVKFMVALDTGSDLFWVPC-DCTRCAATDSSAFASDFDLNVYNPNGSST 153

Query: 111 YAYVPCDSEHCRYFPYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIH-- 168
              V C++  C +   ++C      C Y   Y +  ETS      +     T E   H  
Sbjct: 154 SKKVTCNNSLCMH--RSQCLGTLSNCPYMVSY-VSAETSTSGILVEDVLHLTQEDNHHDL 210

Query: 169 -VQDVVFGCGFSTNRNF----KFSGIFGLGIGRSSLVSQLN------SSFSYCIGSLHDP 217
              +V+FGCG   + +F      +G+FGLG+ + S+ S L+       SFS C G     
Sbjct: 211 VEANVIFGCGQIQSGSFLDVAAPNGLFGLGMEKISVPSMLSREGFTADSFSMCFGR---- 266

Query: 218 DYLHNKLILGDGAIIDEGDATPLQFIDGH--YYITLEAISVDGRMLDINPNIFKRDDSGG 275
           D +  ++  GD    D+ D TP      H  Y IT+  + V   ++D+            
Sbjct: 267 DGI-GRISFGDKGSFDQ-DETPFNLNPSHPTYNITVTQVRVGTTLIDVEFT--------- 315

Query: 276 GVMIDSGTDVTWLVKEAYEALRDEVMIRLEGEQMRSYS-WPDKLCYHGIMSSDLKGFPTV 334
             + DSGT  T+LV   Y  L +    +++  + RS S  P + CY     ++    P+V
Sbjct: 316 -ALFDSGTSFTYLVDPTYTRLTESFHSQVQDRRHRSDSRIPFEYCYDMSPDANTSLIPSV 374

Query: 335 RFHFRGGAKLALEKDSMFYQPRPD-AFCMAVNPASINNRYVNFSLIGMMAQQFYNVGYDI 393
                GG+  A+    +    + +  +C+AV       +    ++IG      Y V +D 
Sbjct: 375 SLTMGGGSHFAVYDPIIIISTQSELVYCLAV------VKTAELNIIGQNFMTGYRVVFDR 428

Query: 394 GRKQKTFQRMDCEVLDD 410
            +    +++ DC  ++D
Sbjct: 429 EKLVLGWKKFDCYDIED 445


>gi|414587000|tpg|DAA37571.1| TPA: hypothetical protein ZEAMMB73_036171 [Zea mays]
          Length = 459

 Score = 90.1 bits (222), Expect = 2e-15,   Method: Compositional matrix adjust.
 Identities = 109/389 (28%), Positives = 157/389 (40%), Gaps = 63/389 (16%)

Query: 61  FYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLGTIFYPSRSSSYAYVPCDSEH 120
           + V +  G P      A+DT S L+W+ C PC  C  QL  +F P  SSSYA VPC S+ 
Sbjct: 92  YLVKLGTGTPQHFFSAAIDTASDLVWMQCQPCVSCYRQLDPVFNPKLSSSYAVVPCTSDT 151

Query: 121 CRYFPYARCSAYKH-RCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDVVFGCGFS 179
           C      RC       C YT  Y     T   ++ ++L          H   VVFGC  S
Sbjct: 152 CAQLDGHRCHEDDDGACQYTYKYSGHGVTKGTLAIDKLAIGG---DVFHA--VVFGCSDS 206

Query: 180 T--NRNFKFSGIFGLGIGRSSLVSQLN-SSFSYCIGSLHDP-DYLHNKLILGDG--AIID 233
           +      + SG+ GLG G  SLVSQL+   F YC   L  P      KL+LG G  A+ +
Sbjct: 207 SVGGPAAQASGLVGLGRGPLSLVSQLSVHRFMYC---LPPPMSRTSGKLVLGAGADAVRN 263

Query: 234 EGDATPLQFID-----GHYYITLEAISVDGRMLDINPNIFKRDDS--------------- 273
             D   +          +YY+ L+ ++V     D  P   +   S               
Sbjct: 264 MSDRVTVTMSSSTRYPSYYYLNLDGLAVG----DQTPGTTRNATSPPSGGAGGGGGGGGG 319

Query: 274 ---------GGGVMIDSGTDVTWLVKEAYEALRD--EVMIRLEGEQMRSYSWPDKLCY-- 320
                      G+++D  + +++L    Y+ L D  E  IRL      S      LC+  
Sbjct: 320 GIVGAGGANAYGMIVDVASTISFLETSLYDELADDLEEEIRLP-RATPSLRLGLDLCFIL 378

Query: 321 -HGIMSSDLKGFPTVRFHFRGGAKLALEKDSMFYQPRPDAFCMAVNPASINNRYVNFSLI 379
             G+   D    PTV   F  G  L L++D +F     D   M +    +  R    S++
Sbjct: 379 PEGV-GMDRVYVPTVSLSFD-GRWLELDRDRLFVT---DGRMMCL----MIGRTSGVSIL 429

Query: 380 GMMAQQFYNVGYDIGRKQKTFQRMDCEVL 408
           G    Q   V +++ R + TF +  C+ L
Sbjct: 430 GNFQLQNMRVLFNLRRGKITFAKASCDSL 458


>gi|222629809|gb|EEE61941.1| hypothetical protein OsJ_16693 [Oryza sativa Japonica Group]
          Length = 648

 Score = 90.1 bits (222), Expect = 2e-15,   Method: Compositional matrix adjust.
 Identities = 101/408 (24%), Positives = 167/408 (40%), Gaps = 72/408 (17%)

Query: 61  FYVNISIGQPPVPQFTAMDTGSSLLWVHC---YPCRDCSPQLGT----IFYPSRSSSYAY 113
           +   +S+G PP P    +DTGS L WV C   Y CR+CS         +F+P  SSS   
Sbjct: 89  YAFTVSLGTPPQPLPVLLDTGSHLSWVPCTSSYQCRNCSSLSAASPLHVFHPKNSSSSRL 148

Query: 114 VPCDSEHCRYF---------------PYARCSAYKHRC-----IYTQLYLIGPETSVFVS 153
           + C +  C +                P A C+            Y  +Y  G    + +S
Sbjct: 149 IGCRNPSCLWIHSPDHLSDCRAASSCPGANCTPRNANANNVCPPYLVVYGSGSTAGLLIS 208

Query: 154 TEQLTFKNTDESTIHVQDVVFGCGFSTNRNFKFSGIFGLGIGRSSLVSQLN-SSFSYCIG 212
               T +    +   V++ V GC  ++      SG+ G G G  S+ SQL  + FSYC+ 
Sbjct: 209 D---TLRTPGRA---VRNFVIGCSLASVHQPP-SGLAGFGRGAPSVPSQLGLTKFSYCLL 261

Query: 213 S--LHDPDYLHNKLILGD------------GAIIDEGDATPLQFIDGHYYITLEAISVDG 258
           S    D   +  +LILG               +     A P   +  +YY+ L AI+V G
Sbjct: 262 SRRFDDNAAVSGELILGGAGGKDGGVGMQYAPLARSASARPPYSV--YYYLALTAITVGG 319

Query: 259 RMLDINPNIFKRDDSGGGVMIDSGTDVTWLVKEAYEALRDEVMIRLEGEQMRSYSWPDKL 318
           + + +    F    +GGG ++DSGT  ++  +  +E +   V+  + G   RS    + L
Sbjct: 320 KSVQLPERAFVAGGAGGGAIVDSGTTFSYFDRTVFEPVAAAVVAAVGGRYSRSKVVEEGL 379

Query: 319 ----CYHGIMSSDLKGFPTVRFHFRGGAKLALEKDSMFYQPRP----------DAFCMAV 364
               C+     +     P +  HF+GG+ + L  ++ F    P          +A C+AV
Sbjct: 380 GLSPCFAMPPGTKTMELPEMSLHFKGGSVMNLPVENYFVVAGPAPSGGAPAMAEAICLAV 439

Query: 365 -------NPASINNRYVNFSLIGMMAQQFYNVGYDIGRKQKTFQRMDC 405
                  +  +  +      ++G   QQ Y + YD+ +++  F+R  C
Sbjct: 440 VSDVPTSSGGAGVSSGGPAIILGSFQQQNYYIEYDLEKERLGFRRQQC 487


>gi|125548492|gb|EAY94314.1| hypothetical protein OsI_16081 [Oryza sativa Indica Group]
          Length = 417

 Score = 90.1 bits (222), Expect = 2e-15,   Method: Compositional matrix adjust.
 Identities = 89/324 (27%), Positives = 137/324 (42%), Gaps = 49/324 (15%)

Query: 22  VQRGINISIARLAYLQEKIRSHNTYQAQILPSNDI--SNSLFYVNISIGQPPVPQFTAMD 79
           ++R I  S  RLA +        + +  ++    I  +   + V + IG PP     A+D
Sbjct: 48  LRRAIQRSRYRLAGIGMARGEAASARKAVVAETPIMPAGGEYLVKLGIGTPPYKFTAAID 107

Query: 80  TGSSLLWVHCYPCRDCSPQLGTIFYPSRSSSYAYVPCDSEHCRYFPYARCS-AYKHRCIY 138
           T S L+W  C PC  C  Q+  +F P  SS+YA +PC S+ C      RC       C Y
Sbjct: 108 TASDLIWTQCQPCTGCYHQVDPMFNPRVSSTYAALPCSSDTCDELDVHRCGHDDDESCQY 167

Query: 139 TQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDVVFGCGFSTNRNF---KFSGIFGLGIG 195
           T  Y     T   ++ ++L            + V FGC  S+       + SG+ GLG G
Sbjct: 168 TYTYSGNATTEGTLAVDKLVIGED-----AFRGVAFGCSTSSTGGAPPPQASGVVGLGRG 222

Query: 196 RSSLVSQLN-SSFSYCIGSLHDP-DYLHNKLILGDGAIIDEGD----ATPLQF---IDGH 246
             SLVSQL+   F+YC   L  P   +  KL+LG  A          A P++       +
Sbjct: 223 PLSLVSQLSVRRFAYC---LPPPASRIPGKLVLGADADAARNATNRIAVPMRRDPRYPSY 279

Query: 247 YYITLEAISVDGRMLDI---------------------NPN---IFKRDDSGGGVMIDSG 282
           YY+ L+ + +  R + +                     +PN   +   D +  G++ID  
Sbjct: 280 YYLNLDGLLIGDRTMSLPPTTTTTATATATATAPAPTPSPNATAVAVGDANRYGMIIDIA 339

Query: 283 TDVTWLVKEAYEALRD--EVMIRL 304
           + +T+L    Y+ L +  EV IRL
Sbjct: 340 STITFLEASLYDELVNDLEVEIRL 363


>gi|449434470|ref|XP_004135019.1| PREDICTED: aspartic proteinase-like protein 1-like [Cucumis
           sativus]
 gi|449517144|ref|XP_004165606.1| PREDICTED: aspartic proteinase-like protein 1-like [Cucumis
           sativus]
          Length = 508

 Score = 90.1 bits (222), Expect = 2e-15,   Method: Compositional matrix adjust.
 Identities = 100/375 (26%), Positives = 164/375 (43%), Gaps = 51/375 (13%)

Query: 59  SLFYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQL-----GTIF---YPSRSSS 110
           +L+Y N+SIG P +    A+DTGS L W+ C  C  C   L     G  +   Y S +SS
Sbjct: 102 NLYYANVSIGTPGLYFLVALDTGSDLFWLPC-ECTKCPTYLTKRDNGKFWLNHYSSNASS 160

Query: 111 YAY-VPCDSEHCRYFPYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHV 169
            +  VPC S  C      +CS+ K  C Y   YL    +S     + +    TD+S +  
Sbjct: 161 TSIRVPCSSSLCEL--ANQCSSNKSSCPYQTHYLSENSSSAGYLVQDILHMATDDSQLKP 218

Query: 170 QDVVFGCGFSTNRNFKFS------GIFGLGIGRSSLVSQLNS------SFSYCIGSLHDP 217
            DV    G    +  KFS      G+ GLG+G+ S+ S L S      SFS C G     
Sbjct: 219 VDVKVTLGCGKVQTGKFSNVTAPNGLIGLGMGKVSVPSFLASQGLTTDSFSMCFGY---- 274

Query: 218 DYLHNKLILGDGAIIDEGDATPLQFIDGHYYITLEAISVDGRMLDINPNIFKRDDSGGGV 277
            Y + ++  GD   + + + TP       Y +T+  I V  R  +++             
Sbjct: 275 -YGYGRIDFGDIGPVGQRE-TPFNPASLSYNVTILQIIVTNRPTNVHLT----------A 322

Query: 278 MIDSGTDVTWLVKEAYEALRDEVMIRLEGEQMRSYS-WPDKLCYHGIMSSDLKGFPTVRF 336
           +IDSG   T+L    Y  + + +   +E E+++S S +P + CY   +++  +  P + F
Sbjct: 323 IIDSGASFTYLTDPFYSIITENMDAAMELERIKSDSDFPFEYCYRLSLATIFQQ-PNLNF 381

Query: 337 HFRGGAKLALEKD--SMFYQPRPDAFCMAVNPASINNRYVNFSLIGMMAQQFYNVGYDIG 394
              GG K  +     S+     P A C+A+  ++      + ++IG      Y V ++  
Sbjct: 382 TMEGGRKFDVITSYVSVDTDDGP-ALCLAIVKST------DINVIGHNFFGGYRVVFNRE 434

Query: 395 RKQKTFQRMDCEVLD 409
           +    ++ +DC+  D
Sbjct: 435 KMTLGWKEVDCDSYD 449


>gi|115460260|ref|NP_001053730.1| Os04g0595000 [Oryza sativa Japonica Group]
 gi|113565301|dbj|BAF15644.1| Os04g0595000, partial [Oryza sativa Japonica Group]
          Length = 471

 Score = 90.1 bits (222), Expect = 2e-15,   Method: Compositional matrix adjust.
 Identities = 99/385 (25%), Positives = 160/385 (41%), Gaps = 52/385 (13%)

Query: 58  NSLFYVNISIGQPP---VPQFTAMDTGSSLLWVHCYPCRDCSPQLGTIFY-PSRSSSYAY 113
            S + V + IG P     P++   DTGS L W  C PC +CS       + PS+S ++  
Sbjct: 101 GSTYLVQLRIGTPTDRISPRYVLFDTGSDLSWTQCEPCTNCSSFTPYPPHDPSKSRTFRR 160

Query: 114 VPCDSEHCRYFPYARCSAY------KHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTI 167
           + C    C       C+A          C++ + Y  G   S  + ++   F    +   
Sbjct: 161 LSCFDPMCEL-----CTAVVDGGGGSAGCLFRRRYGDGGAVSGELVSDVFHFGAAGDGGG 215

Query: 168 H--VQDVVFGCGFSTN----RNFKFSGIFGLGIGRSSLVSQLN-SSFSYCIGS------- 213
           +   +DV FGC    +    R +  +GI  LGIG+ S V+QL    FSYCI +       
Sbjct: 216 YQLERDVAFGCAHVEDSKAVRGYS-TGILALGIGKPSFVTQLGVDRFSYCIPASEITDDD 274

Query: 214 ---LHDPDYLHNKLILGDGAIIDEGDATPLQFIDGHYYITLEAISVD--GRMLDINP-NI 267
                D +   + L  G  A +  G   P +     Y + L+++     GR+    P  +
Sbjct: 275 DDDDDDEERSASFLRFGSHARM-TGKRAPFKQDGSGYAVRLKSVVYQHGGRLNQQQPVPV 333

Query: 268 FKRDDSGGGVM---IDSGTDVTWLVKEAYEALRDEVMIRLEGEQMRSYSWPDKLCYHGIM 324
           +   +     M   +DSGT + WL    +  L+  +   +   +    + P   CY G M
Sbjct: 334 YVAGEEAAAAMPMLVDSGTTLLWLPGSVFYPLQRRIEEDISLTRRYDLTHPSLYCYLGNM 393

Query: 325 SSDLKGFPTVRFHFRGGAKLALEKDSMFYQPR---PDAFCMAVNPASINNRYVNFSLIGM 381
            +D++   +V   F GGA L L   S+F+       D  C+AV          N +++G+
Sbjct: 394 -TDVEAV-SVTLGFGGGADLELFGTSLFFTDENLTEDWVCLAVAAG-------NRAILGV 444

Query: 382 MAQQFYNVGYDIGRKQKTFQRMDCE 406
             Q+  NVGYD+   +  F R  C+
Sbjct: 445 YPQRNINVGYDLSTMEIAFDRDQCD 469


>gi|226509408|ref|NP_001141440.1| uncharacterized protein LOC100273550 precursor [Zea mays]
 gi|194704586|gb|ACF86377.1| unknown [Zea mays]
 gi|413938617|gb|AFW73168.1| hypothetical protein ZEAMMB73_633272 [Zea mays]
          Length = 478

 Score = 90.1 bits (222), Expect = 2e-15,   Method: Compositional matrix adjust.
 Identities = 108/370 (29%), Positives = 152/370 (41%), Gaps = 51/370 (13%)

Query: 55  DISNSLFYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCR---DCSPQLGTIFYPSRSSSY 111
           DI    + V  S+G P V Q   +DTGS L WV C PC     C  Q   +F P++SSSY
Sbjct: 134 DIGTLNYVVTASLGTPGVAQTMEVDTGSDLSWVQCKPCAAAPSCYSQKDPLFDPAQSSSY 193

Query: 112 AYVPCDSEHCRYFP-YARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQ 170
           A VPC    C     YA  +    +C Y   Y  G  T+   S++ LT   +      VQ
Sbjct: 194 AAVPCGGPVCAGLGIYAASACSAAQCGYVVSYGDGSNTTGVYSSDTLTLSASSA----VQ 249

Query: 171 DVVFGCGFSTNRNFK-FSGIFGLGIGRSSLVSQLNSS----FSYCIGSLHDPDYLHNKLI 225
              FGCG + +  F    G+ GLG  + SLV Q   +    FSYC   L         L 
Sbjct: 250 GFFFGCGHAQSGLFNGVDGLLGLGREQPSLVEQTAGTYGGVFSYC---LPTKPSTAGYLT 306

Query: 226 LGDGAIIDEGDA------TPLQFIDGHYYITLEAISVDGRMLDINPNIFKRDDSGGGVMI 279
           LG G               P      +Y + L  ISV G+ L +  + F         ++
Sbjct: 307 LGVGGPSGAAPGFSTTQLLPSPNAPTYYVVMLTGISVGGQQLSVPASAFAGG-----TVV 361

Query: 280 DSGTDVTWLVKEAYEALRDEVMIRLEGEQMRSYSWPDKLCYHGIMSS--DLKGF-----P 332
           D+GT VT L   AY ALR           M SY +P     +GI+ +  +  G+     P
Sbjct: 362 DTGTVVTRLPPTAYAALRSAFR-----SGMASYGYPTAPS-NGILDTCYNFAGYGTVTLP 415

Query: 333 TVRFHFRGGAKLALEKDSMFYQPRPDAFCMAVNPASINNRYVNFSLIGMMAQQFYNV--- 389
            V   F  GA + L  D +         C+A  P+  +      +++G + Q+ + V   
Sbjct: 416 NVALTFGSGATVTLGADGIL-----SFGCLAFAPSGSDG---GMAILGNVQQRSFEVRID 467

Query: 390 GYDIGRKQKT 399
           G  +G K  +
Sbjct: 468 GTSVGFKPSS 477


>gi|32489096|emb|CAE03928.1| OSJNba0093F12.2 [Oryza sativa Japonica Group]
 gi|58532027|emb|CAD41565.3| OSJNBa0006A01.20 [Oryza sativa Japonica Group]
          Length = 489

 Score = 90.1 bits (222), Expect = 2e-15,   Method: Compositional matrix adjust.
 Identities = 99/385 (25%), Positives = 160/385 (41%), Gaps = 52/385 (13%)

Query: 58  NSLFYVNISIGQPP---VPQFTAMDTGSSLLWVHCYPCRDCSPQLGTIFY-PSRSSSYAY 113
            S + V + IG P     P++   DTGS L W  C PC +CS       + PS+S ++  
Sbjct: 119 GSTYLVQLRIGTPTDRISPRYVLFDTGSDLSWTQCEPCTNCSSFTPYPPHDPSKSRTFRR 178

Query: 114 VPCDSEHCRYFPYARCSAY------KHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTI 167
           + C    C       C+A          C++ + Y  G   S  + ++   F    +   
Sbjct: 179 LSCFDPMCEL-----CTAVVDGGGGSAGCLFRRRYGDGGAVSGELVSDVFHFGAAGDGGG 233

Query: 168 H--VQDVVFGCGFSTN----RNFKFSGIFGLGIGRSSLVSQLN-SSFSYCIGS------- 213
           +   +DV FGC    +    R +  +GI  LGIG+ S V+QL    FSYCI +       
Sbjct: 234 YQLERDVAFGCAHVEDSKAVRGYS-TGILALGIGKPSFVTQLGVDRFSYCIPASEITDDD 292

Query: 214 ---LHDPDYLHNKLILGDGAIIDEGDATPLQFIDGHYYITLEAISVD--GRMLDINP-NI 267
                D +   + L  G  A +  G   P +     Y + L+++     GR+    P  +
Sbjct: 293 DDDDDDEERSASFLRFGSHARM-TGKRAPFKQDGSGYAVRLKSVVYQHGGRLNQQQPVPV 351

Query: 268 FKRDDSGGGVM---IDSGTDVTWLVKEAYEALRDEVMIRLEGEQMRSYSWPDKLCYHGIM 324
           +   +     M   +DSGT + WL    +  L+  +   +   +    + P   CY G M
Sbjct: 352 YVAGEEAAAAMPMLVDSGTTLLWLPGSVFYPLQRRIEEDISLTRRYDLTHPSLYCYLGNM 411

Query: 325 SSDLKGFPTVRFHFRGGAKLALEKDSMFYQPR---PDAFCMAVNPASINNRYVNFSLIGM 381
            +D++   +V   F GGA L L   S+F+       D  C+AV          N +++G+
Sbjct: 412 -TDVEAV-SVTLGFGGGADLELFGTSLFFTDENLTEDWVCLAVAAG-------NRAILGV 462

Query: 382 MAQQFYNVGYDIGRKQKTFQRMDCE 406
             Q+  NVGYD+   +  F R  C+
Sbjct: 463 YPQRNINVGYDLSTMEIAFDRDQCD 487


>gi|326503602|dbj|BAJ86307.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 461

 Score = 89.7 bits (221), Expect = 2e-15,   Method: Compositional matrix adjust.
 Identities = 80/258 (31%), Positives = 113/258 (43%), Gaps = 22/258 (8%)

Query: 58  NSLFYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLGTIFYPSRSSSYAYVPCD 117
           N    V++++G PP      +DTGS L W+ C P    +      F P  SS++A VPC 
Sbjct: 82  NVSLTVSLAVGTPPQNVTMVLDTGSELSWLLCAPAGARNKFSAMSFRPRASSTFAAVPCA 141

Query: 118 SEHCRY----FPYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDVV 173
           S  CR      P A C     RC  +  Y  G  +   ++T+     +            
Sbjct: 142 SAQCRSRDLPSPPA-CDGASSRCSVSLSYADGSSSDGALATDVFAVGSGPP-----LRAA 195

Query: 174 FGC---GF-STNRNFKFSGIFGLGIGRSSLVSQLNS-SFSYCIGSLHDPDYL---HNKL- 224
           FGC    F S+      +G+ G+  G  S VSQ ++  FSYCI    D   L   H+ L 
Sbjct: 196 FGCMSSAFDSSPDGVASAGLLGMNRGALSFVSQASTRRFSYCISDRDDAGVLLLGHSDLP 255

Query: 225 -ILGDGAIIDEGDATPLQFIDG-HYYITLEAISVDGRMLDINPNIFKRDDSGGG-VMIDS 281
             L          A PL + D   Y + L  I V G+ L I  ++   D +G G  M+DS
Sbjct: 256 TFLPLNYTPMYQPALPLPYFDRVAYSVQLLGIRVGGKHLPIPASVLAPDHTGAGQTMVDS 315

Query: 282 GTDVTWLVKEAYEALRDE 299
           GT  T+L+ +AY AL+ E
Sbjct: 316 GTQFTFLLGDAYSALKAE 333


>gi|90399145|emb|CAJ86169.1| H0913C04.10 [Oryza sativa Indica Group]
 gi|125550292|gb|EAY96114.1| hypothetical protein OsI_17992 [Oryza sativa Indica Group]
          Length = 491

 Score = 89.7 bits (221), Expect = 2e-15,   Method: Compositional matrix adjust.
 Identities = 101/408 (24%), Positives = 167/408 (40%), Gaps = 72/408 (17%)

Query: 61  FYVNISIGQPPVPQFTAMDTGSSLLWVHC---YPCRDCSPQLGT----IFYPSRSSSYAY 113
           +   +S+G PP P    +DTGS L WV C   Y CR+CS         +F+P  SSS   
Sbjct: 89  YAFTVSLGTPPQPLPVLLDTGSHLSWVPCTSSYQCRNCSSLSAASPLHVFHPKNSSSSRL 148

Query: 114 VPCDSEHCRYF---------------PYARCSAYKHRC-----IYTQLYLIGPETSVFVS 153
           + C +  C +                P A C+            Y  +Y  G    + +S
Sbjct: 149 IGCRNPSCLWIHSPDHLSDCRAASSCPGANCTPRNANANNVCPPYLVVYGSGSTAGLLIS 208

Query: 154 TEQLTFKNTDESTIHVQDVVFGCGFSTNRNFKFSGIFGLGIGRSSLVSQLN-SSFSYCIG 212
               T +    +   V++ V GC  ++      SG+ G G G  S+ SQL  + FSYC+ 
Sbjct: 209 D---TLRTPGRA---VRNFVIGCSLASVHQPP-SGLAGFGRGAPSVPSQLGLTKFSYCLL 261

Query: 213 S--LHDPDYLHNKLILGD------------GAIIDEGDATPLQFIDGHYYITLEAISVDG 258
           S    D   +  +LILG               +     A P   +  +YY+ L AI+V G
Sbjct: 262 SRRFDDNAAVSGELILGGAGGKDGGVGMQYAPLARSASARPPYSV--YYYLALTAITVGG 319

Query: 259 RMLDINPNIFKRDDSGGGVMIDSGTDVTWLVKEAYEALRDEVMIRLEGEQMRSYSWPDKL 318
           + + +    F    +GGG ++DSGT  ++  +  +E +   V+  + G   RS    + L
Sbjct: 320 KSVQLPERAFVAGGAGGGAIVDSGTTFSYFDRTVFEPVAAAVVAAVGGRYSRSKVVEEGL 379

Query: 319 ----CYHGIMSSDLKGFPTVRFHFRGGAKLALEKDSMFYQPRP----------DAFCMAV 364
               C+     +     P +  HF+GG+ + L  ++ F    P          +A C+AV
Sbjct: 380 GLSPCFAMPPGTKTMELPEMSLHFKGGSVMNLPVENYFVVAGPAPSGGAPAMAEAICLAV 439

Query: 365 -------NPASINNRYVNFSLIGMMAQQFYNVGYDIGRKQKTFQRMDC 405
                  +  +  +      ++G   QQ Y + YD+ +++  F+R  C
Sbjct: 440 VSDVPTSSGGAGVSSGGPAIILGSFQQQNYYIEYDLEKERLGFRRQQC 487


>gi|413938607|gb|AFW73158.1| aspartic proteinase nepenthesin-2 [Zea mays]
          Length = 478

 Score = 89.7 bits (221), Expect = 2e-15,   Method: Compositional matrix adjust.
 Identities = 108/370 (29%), Positives = 152/370 (41%), Gaps = 51/370 (13%)

Query: 55  DISNSLFYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCR---DCSPQLGTIFYPSRSSSY 111
           DI    + V  S+G P V Q   +DTGS L WV C PC     C  Q   +F P++SSSY
Sbjct: 134 DIGTLNYVVTASLGTPGVAQTMEVDTGSDLSWVQCKPCSAAPSCYSQKDPLFDPAQSSSY 193

Query: 112 AYVPCDSEHCRYFP-YARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQ 170
           A VPC    C     YA  +    +C Y   Y  G  T+   S++ LT   +      VQ
Sbjct: 194 AAVPCGGPVCAGLGIYAASACSAAQCGYVVSYGDGSNTTGVYSSDTLTLSASSA----VQ 249

Query: 171 DVVFGCGFSTNRNFK-FSGIFGLGIGRSSLVSQLNSS----FSYCIGSLHDPDYLHNKLI 225
              FGCG + +  F    G+ GLG  + SLV Q   +    FSYC   L         L 
Sbjct: 250 GFFFGCGHAQSGLFNGVDGLLGLGREQPSLVEQTAGTYGGVFSYC---LPTKPSTAGYLT 306

Query: 226 LGDGAIIDEGDA------TPLQFIDGHYYITLEAISVDGRMLDINPNIFKRDDSGGGVMI 279
           LG G               P      +Y + L  ISV G+ L +  + F         ++
Sbjct: 307 LGVGGPSGAAPGFSTTQLLPSPNAPTYYVVMLTGISVGGQQLSVPASAFAGG-----TVV 361

Query: 280 DSGTDVTWLVKEAYEALRDEVMIRLEGEQMRSYSWPDKLCYHGIMSS--DLKGF-----P 332
           D+GT VT L   AY ALR           M SY +P     +GI+ +  +  G+     P
Sbjct: 362 DTGTVVTRLPPTAYAALRSAFR-----SGMASYGYPTAPS-NGILDTCYNFAGYGTVTLP 415

Query: 333 TVRFHFRGGAKLALEKDSMFYQPRPDAFCMAVNPASINNRYVNFSLIGMMAQQFYNV--- 389
            V   F  GA + L  D +         C+A  P+  +      +++G + Q+ + V   
Sbjct: 416 NVALTFGSGATVTLGADGIL-----SFGCLAFAPSGSDG---GMAILGNVQQRSFEVRID 467

Query: 390 GYDIGRKQKT 399
           G  +G K  +
Sbjct: 468 GTSVGFKPSS 477


>gi|125575541|gb|EAZ16825.1| hypothetical protein OsJ_32297 [Oryza sativa Japonica Group]
          Length = 416

 Score = 89.7 bits (221), Expect = 2e-15,   Method: Compositional matrix adjust.
 Identities = 94/366 (25%), Positives = 156/366 (42%), Gaps = 54/366 (14%)

Query: 63  VNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLGTIFYPSRSSSYAYVPCDSEHCR 122
            N +IG PP P    +D           PC           +P+ SS++   PC ++ C+
Sbjct: 69  ANFTIGTPPQPASAIIDVAGP------APCS----------FPNASSTFRPEPCGTDACK 112

Query: 123 YFPYARCSAYKHRCIY--TQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDVVFGCGFST 180
             P + CS+  + C Y  T    +G  T   V+T+         S      + FGC  ++
Sbjct: 113 SIPTSNCSS--NMCTYEGTINSKLGGHTLGIVATDTFAIGTATAS------LGFGCVVAS 164

Query: 181 NRNFKF--SGIFGLGIGRSSLVSQLN-SSFSYCIGSLHDPDYLHNKLILGDGA-IIDEGD 236
             +     SG+ GLG   SSLVSQ+N + FSYC+ + HD    +++L+LG  A +   G+
Sbjct: 165 GIDTMGGPSGLIGLGRAPSSLVSQMNITKFSYCL-TPHDSGK-NSRLLLGSSAKLAGGGN 222

Query: 237 ATPLQFIDG--------HYYITLEAISVDGRMLDINPNIFKRDDSGGGVMIDSGTDVTWL 288
           +T   F+          +Y I L+ I      + + P       SG  V++ +   +++L
Sbjct: 223 STTTPFVKTSPGDDMSQYYPIQLDGIKAGDAAIALPP-------SGNTVLVQTLAPMSFL 275

Query: 289 VKEAYEALRDEVMIRLEGEQMRSYSWPDKLCYHGIMSSDLKGFPTVRFHF-RGGAKLALE 347
           V  AY+AL+ EV   +      +   P  LC+     S+    P + F F +G A L + 
Sbjct: 276 VDSAYQALKKEVTKAVGAAPTATPLQPFDLCFPKAGLSNASA-PDLVFTFQQGAAALTVP 334

Query: 348 KDSMFYQ--PRPDAFCMAVNPASINNRYV---NFSLIGMMAQQFYNVGYDIGRKQKTFQR 402
                          CMA+   S  N      N +++G + Q+  +   D+ +K  +F+ 
Sbjct: 335 PPKYLIDVGEEKGTVCMAILSTSWLNTTALDENLNILGSLQQENTHFLLDLEKKTLSFEP 394

Query: 403 MDCEVL 408
            DC  L
Sbjct: 395 ADCAHL 400


>gi|356539555|ref|XP_003538263.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
          Length = 438

 Score = 89.7 bits (221), Expect = 2e-15,   Method: Compositional matrix adjust.
 Identities = 95/363 (26%), Positives = 141/363 (38%), Gaps = 31/363 (8%)

Query: 56  ISNSLFYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLGTIFYPSRSSSYAYVP 115
           I +  + V   IG PP     A+DT +   W+ C  C  C+    T+F P +S+++  V 
Sbjct: 92  IQSPTYIVRAKIGTPPQTLLLAIDTSNDAAWIPCTACDGCT---STLFAPEKSTTFKNVS 148

Query: 116 CDSEHCRYFPYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDVVFG 175
           C S  C   P   C      C +   Y     +S+  +  Q T      +T  +    FG
Sbjct: 149 CGSPECNKVPSPSCGT--SACTFNLTY---GSSSIAANVVQDTVT---LATDPIPGYTFG 200

Query: 176 C-----GFSTNRNFKFSGIFGLGIGRSSLVSQLNSSFSYCIGSLHDPDYLHNKLILGDGA 230
           C     G ST          G     S   +   S+FSYC+ S    ++    L LG  A
Sbjct: 201 CVAKTTGPSTPPQGLLGLGRGPLSLLSQTQNLYQSTFSYCLPSFKSLNF-SGSLRLGPVA 259

Query: 231 IIDEGDATPL---QFIDGHYYITLEAISVDGRMLDINPNIFK-RDDSGGGVMIDSGTDVT 286
                  TPL         YY+ L AI V  +++DI P        +G G + DSGT  T
Sbjct: 260 QPIRIKYTPLLKNPRRSSLYYVNLFAIRVGRKIVDIPPAALAFNAATGAGTVFDSGTVFT 319

Query: 287 WLVKEAYEALRDEVMIRLEGEQMRSYSWPD----KLCYHGIMSSDLKGFPTVRFHFRGGA 342
            LV   Y A+RDE   R+      + +         CY   + +     PT+ F F G  
Sbjct: 320 RLVAPVYTAVRDEFRRRVAMAAKANLTVTSLGGFDTCYTVPIVA-----PTITFMFSGMN 374

Query: 343 KLALEKDSMFYQPRPDAFCMAVNPASINNRYVNFSLIGMMAQQFYNVGYDIGRKQKTFQR 402
               + + + +       C+A+  A  N   V  ++I  M QQ + V YD+   +    R
Sbjct: 375 VTLPQDNILIHSTAGSTSCLAMASAPDNVNSV-LNVIANMQQQNHRVLYDVPNSRLGVAR 433

Query: 403 MDC 405
             C
Sbjct: 434 ELC 436


>gi|222629462|gb|EEE61594.1| hypothetical protein OsJ_16002 [Oryza sativa Japonica Group]
          Length = 468

 Score = 89.7 bits (221), Expect = 2e-15,   Method: Compositional matrix adjust.
 Identities = 99/385 (25%), Positives = 160/385 (41%), Gaps = 52/385 (13%)

Query: 58  NSLFYVNISIGQPP---VPQFTAMDTGSSLLWVHCYPCRDCSPQLGTIFY-PSRSSSYAY 113
            S + V + IG P     P++   DTGS L W  C PC +CS       + PS+S ++  
Sbjct: 98  GSTYLVQLRIGTPTDRISPRYVLFDTGSDLSWTQCEPCTNCSSFTPYPPHDPSKSRTFRR 157

Query: 114 VPCDSEHCRYFPYARCSAY------KHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTI 167
           + C    C       C+A          C++ + Y  G   S  + ++   F    +   
Sbjct: 158 LSCFDPMCEL-----CTAVVDGGGGSAGCLFRRRYGDGGAVSGELVSDVFHFGAAGDGGG 212

Query: 168 H--VQDVVFGCGFSTN----RNFKFSGIFGLGIGRSSLVSQLN-SSFSYCIGS------- 213
           +   +DV FGC    +    R +  +GI  LGIG+ S V+QL    FSYCI +       
Sbjct: 213 YQLERDVAFGCAHVEDSKAVRGYS-TGILALGIGKPSFVTQLGVDRFSYCIPASEITDDD 271

Query: 214 ---LHDPDYLHNKLILGDGAIIDEGDATPLQFIDGHYYITLEAISVD--GRMLDINP-NI 267
                D +   + L  G  A +  G   P +     Y + L+++     GR+    P  +
Sbjct: 272 DDDDDDEERSASFLRFGSHARM-TGKRAPFKQDGSGYAVRLKSVVYQHGGRLNQQQPVPV 330

Query: 268 FKRDDSGGGVM---IDSGTDVTWLVKEAYEALRDEVMIRLEGEQMRSYSWPDKLCYHGIM 324
           +   +     M   +DSGT + WL    +  L+  +   +   +    + P   CY G M
Sbjct: 331 YVAGEEAAAAMPMLVDSGTTLLWLPGSVFYPLQRRIEEDISLTRRYDLTHPSLYCYLGNM 390

Query: 325 SSDLKGFPTVRFHFRGGAKLALEKDSMFYQPR---PDAFCMAVNPASINNRYVNFSLIGM 381
            +D++   +V   F GGA L L   S+F+       D  C+AV          N +++G+
Sbjct: 391 -TDVEAV-SVTLGFGGGADLELFGTSLFFTDENLTEDWVCLAVAAG-------NRAILGV 441

Query: 382 MAQQFYNVGYDIGRKQKTFQRMDCE 406
             Q+  NVGYD+   +  F R  C+
Sbjct: 442 YPQRNINVGYDLSTMEIAFDRDQCD 466


>gi|222642011|gb|EEE70143.1| hypothetical protein OsJ_30189 [Oryza sativa Japonica Group]
          Length = 671

 Score = 89.7 bits (221), Expect = 3e-15,   Method: Compositional matrix adjust.
 Identities = 94/335 (28%), Positives = 144/335 (42%), Gaps = 55/335 (16%)

Query: 60  LFYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDC----SPQLGT----IFYPSRSSSY 111
           L Y  +++G P V    A+DTGS L WV C  C  C    SP  G+    ++ P++S++ 
Sbjct: 34  LHYAVVALGTPNVTFLVALDTGSDLFWVPC-DCLKCAPFQSPNYGSLKFDVYSPAQSTTS 92

Query: 112 AYVPCDSEHCRYFPYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTD--ESTIHV 169
             VPC S  C       C +  + C Y+  YL    +S  V  E + +  +D  +S I  
Sbjct: 93  RKVPCSSNLCDL--QNACRSKSNSCPYSIQYLSDNTSSSGVLVEDVLYLTSDSAQSKIVT 150

Query: 170 QDVVFGCGFSTNRNFKFS----GIFGLGIGRSSLVSQLNS------SFSYCIGSLHDPDY 219
             ++FGCG     +F  S    G+ GLG+   S+ S L S      SFS C G     D 
Sbjct: 151 APIMFGCGQVQTGSFLGSAAPNGLLGLGMDSKSVPSLLASKGLAANSFSMCFG-----DD 205

Query: 220 LHNKLILGDGAIIDEGDATPLQFI--DGHYYITLEAISVDGRMLDINPNIFKRDDSGGGV 277
            H ++  GD    D+ + TPL     + +Y IT+  I+V  + +      F         
Sbjct: 206 GHGRINFGDTGSSDQKE-TPLNVYKQNPYYNITITGITVGSKSISTE---FS-------A 254

Query: 278 MIDSGTDVTWLVKEAYEALRD--EVMIRLEGEQMRSYSWPDKLCYHGIMSSDLKGFPTVR 335
           ++DSGT  T L    Y  +    +  IR     M   S P + CY   +S++    P V 
Sbjct: 255 IVDSGTSFTALSDPMYTQITSSFDAQIR-SSRNMLDSSMPFEFCYS--VSANGIVHPNVS 311

Query: 336 FHFRGGAKLALE------KDSMFYQPRPDAFCMAV 364
              +GG+   +        D+ F    P  +C+A+
Sbjct: 312 LTAKGGSIFPVNDPIITITDNAF---NPVGYCLAI 343


>gi|108707839|gb|ABF95634.1| Eukaryotic aspartyl protease family protein [Oryza sativa Japonica
           Group]
          Length = 330

 Score = 89.4 bits (220), Expect = 3e-15,   Method: Compositional matrix adjust.
 Identities = 84/302 (27%), Positives = 132/302 (43%), Gaps = 43/302 (14%)

Query: 116 CDSEHCRYFPYARCSAYK----HRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQD 171
           CDS  C+    A C   K      C+YT  Y     T+  +  ++ TF     +   V  
Sbjct: 38  CDSTLCQGLLVASCGNTKFWPNQTCVYTYYYNDKSVTTGLIEVDKFTFG----AGASVPG 93

Query: 172 VVFGCGFSTNRNFKF--SGIFGLGIGRSSLVSQLN-SSFSYCIGSLHDPDYLHNKLIL-- 226
           V FGCG   N  FK   +GI G G G  SL SQL   +FS+C  +++    L    +L  
Sbjct: 94  VAFGCGLFNNGVFKSNETGIAGFGRGPLSLPSQLKVGNFSHCFTAVNG---LKQSTVLLD 150

Query: 227 --------GDGAIIDEGDATPLQFIDGH---YYITLEAISVDGRMLDINPNIFKRDDSGG 275
                   G GA+     +TPL     +   YY++L+ I+V    L +  + F   +  G
Sbjct: 151 LPADLYKNGRGAV----QSTPLIQNSANPTFYYLSLKGITVGSTRLPVPESAFALTNGTG 206

Query: 276 GVMIDSGTDVTWLVKEAYEALRDEVMIRLEGEQMRSYSWPDKLCYHGIMSSDLKGFPTVR 335
           G +IDSGT +T L  + Y+ +RDE   +++   +   +     C+    S      P + 
Sbjct: 207 GTIIDSGTSITSLPPQVYQVVRDEFAAQIKLPVVPGNATGPYTCFSA-PSQAKPDVPKLV 265

Query: 336 FHFRGGAKLALEKDSMFYQPRPDA----FCMAVNPASINNRYVNFSLIGMMAQQFYNVGY 391
            HF  GA + L +++  ++   DA     C+A+N           ++IG   QQ  +V Y
Sbjct: 266 LHFE-GATMDLPRENYVFEVPDDAGNSIICLAINKGD------ETTIIGNFQQQNMHVLY 318

Query: 392 DI 393
           D+
Sbjct: 319 DL 320


>gi|115441003|ref|NP_001044781.1| Os01g0844500 [Oryza sativa Japonica Group]
 gi|19571042|dbj|BAB86469.1| putative aspartic proteinase nepenthesin II [Oryza sativa Japonica
           Group]
 gi|20160609|dbj|BAB89555.1| putative aspartic proteinase nepenthesin II [Oryza sativa Japonica
           Group]
 gi|113534312|dbj|BAF06695.1| Os01g0844500 [Oryza sativa Japonica Group]
 gi|125572614|gb|EAZ14129.1| hypothetical protein OsJ_04051 [Oryza sativa Japonica Group]
          Length = 442

 Score = 89.4 bits (220), Expect = 3e-15,   Method: Compositional matrix adjust.
 Identities = 81/259 (31%), Positives = 114/259 (44%), Gaps = 23/259 (8%)

Query: 58  NSLFYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLGTI--FYPSRSSSYAYVP 115
           N    V++++G PP      +DTGS L W+ C P         +   F P  S ++A VP
Sbjct: 63  NVSLTVSLAVGTPPQNVTMVLDTGSELSWLLCAPGGGGGGGGRSALSFRPRASLTFASVP 122

Query: 116 CDSEHCRY----FPYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQD 171
           CDS  CR      P A C     +C  +  Y  G  +   ++TE  T             
Sbjct: 123 CDSAQCRSRDLPSPPA-CDGASKQCRVSLSYADGSSSDGALATEVFTVGQGPP-----LR 176

Query: 172 VVFGC---GFSTNRN-FKFSGIFGLGIGRSSLVSQLNS-SFSYCIGSLHDPDYL---HNK 223
             FGC    F T+ +    +G+ G+  G  S VSQ ++  FSYCI    D   L   H+ 
Sbjct: 177 AAFGCMATAFDTSPDGVATAGLLGMNRGALSFVSQASTRRFSYCISDRDDAGVLLLGHSD 236

Query: 224 L-ILGDGAIIDEGDATPLQFIDG-HYYITLEAISVDGRMLDINPNIFKRDDSGGG-VMID 280
           L  L          A PL + D   Y + L  I V G+ L I  ++   D +G G  M+D
Sbjct: 237 LPFLPLNYTPLYQPAMPLPYFDRVAYSVQLLGIRVGGKPLPIPASVLAPDHTGAGQTMVD 296

Query: 281 SGTDVTWLVKEAYEALRDE 299
           SGT  T+L+ +AY AL+ E
Sbjct: 297 SGTQFTFLLGDAYSALKAE 315


>gi|388498308|gb|AFK37220.1| unknown [Lotus japonicus]
          Length = 363

 Score = 89.4 bits (220), Expect = 3e-15,   Method: Compositional matrix adjust.
 Identities = 70/192 (36%), Positives = 94/192 (48%), Gaps = 25/192 (13%)

Query: 36  LQEKIRSHNTYQAQI---LPSNDISNSLFY-VNISIGQPPVPQFTAMDTGSSLLWVHCYP 91
           L++ + SH+   +QI   L S     +L Y V + +G   +     +DTGS L WV C P
Sbjct: 116 LRKMVSSHSVEVSQIQIPLASGVNFQTLNYIVTMELGGQDMT--VIIDTGSDLTWVQCEP 173

Query: 92  CRDCSPQLGTIFYPSRSSSYAYVPCDSEHCRYFPY-----ARCSAYKHRCIYTQLYLIGP 146
           C  C  Q G +F PS SSSY  +PC+S  C+           C +    C Y   Y  G 
Sbjct: 174 CMSCYNQQGPVFKPSTSSSYQSIPCNSSTCQSLQLTTGNAGACESNPSNCSYAVNYGDGS 233

Query: 147 ETSVFVSTEQLTFKNTDESTIHVQDVVFGCGFSTNRNFKFSGIFGL-GIGRS--SLVSQL 203
            T+  +  E L+F       I V + VFGCG   N    F G+ GL G+GRS  SL+SQ 
Sbjct: 234 YTNGELGAEHLSFGG-----ISVSNFVFGCG--KNNKGLFGGVSGLMGLGRSNLSLISQT 286

Query: 204 NSS----FSYCI 211
           NS+    FSYC+
Sbjct: 287 NSTFGGVFSYCL 298


>gi|326515330|dbj|BAK03578.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 445

 Score = 89.4 bits (220), Expect = 3e-15,   Method: Compositional matrix adjust.
 Identities = 108/384 (28%), Positives = 155/384 (40%), Gaps = 64/384 (16%)

Query: 63  VNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLGTIFYPSRSSSYAYVPCDSEHCR 122
           V I  G+     F  +DT SSL W+ C  C     Q   +F PS SSSY  +   S  CR
Sbjct: 78  VTIGTGRGKSTYFLVLDTASSLPWMRCAHCLPVQRQRSPVFDPSDSSSYRPLHPTSPLCR 137

Query: 123 YFPYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDVVFGCGFST-- 180
             P     A   +C +   +L G E   +V T+ +   N    T+ +  V FGC  ST  
Sbjct: 138 A-PNPVLPA-GDKCSF---HLPG-EAHGYVGTDTIILGN---PTLPIHSVAFGCAQSTEG 188

Query: 181 -NRNFKFSGIFGLGIGRSSLVSQLN----SSFSYC-IGSLHDPD---------------- 218
            +    F+G  G+G   +SL+ Q+     S FSYC IG  H P                 
Sbjct: 189 FDTKGTFAGTLGMGKLPTSLIMQIKDRVGSRFSYCLIGLGHSPGRNGFIRFGADIPDPTL 248

Query: 219 YLHNKLILGDGAIIDEGDATPLQFIDGHYYITLEAISVDGRML-DINPNIF-KRDDSGGG 276
            +H+++      I+      P    D  YY+ L  IS++G  +  I   +F +R D  GG
Sbjct: 249 LVHHRI-----KILPTPPHLPHGVADSAYYVKLLGISLNGTPIPGIRQAMFERRSDGSGG 303

Query: 277 VMIDSGTDVTWLVKEAYEALRDEVMIRLEGEQMRSYSWPD-KLCYH---GIMSSDLKGFP 332
             +D+GT VT LV  AY  + + V   ++    +    P+  LC+    GI S      P
Sbjct: 304 CFVDAGTQVTHLVPAAYAVVEEAVAHMVQQWGYKRVRDPNFSLCFREHPGIWSH----IP 359

Query: 333 TVRFHFRGGAK-----LALEKDSMFY----QPRPDAFCMAVNPASINNRYVNFSLIGMMA 383
            +   F G A      L +   ++F     QP     C  V   S  +  V    +G M 
Sbjct: 360 KLTLDFEGPASRTVAHLEIVSRNLFLKVDNQP---LVCFGVYRTSRGSPTV----VGAMQ 412

Query: 384 QQFYNVGYDIGRKQKTFQRMDCEV 407
           Q      +D+     TF R  CE 
Sbjct: 413 QVDTRFIFDLHANTITFHRESCEA 436


>gi|224091907|ref|XP_002309394.1| predicted protein [Populus trichocarpa]
 gi|222855370|gb|EEE92917.1| predicted protein [Populus trichocarpa]
          Length = 469

 Score = 89.0 bits (219), Expect = 3e-15,   Method: Compositional matrix adjust.
 Identities = 104/436 (23%), Positives = 181/436 (41%), Gaps = 62/436 (14%)

Query: 15  NENPAHRVQRGINISIARLAYLQEKIRSHNTYQAQILPSNDISNSLFYVNISIGQPPVPQ 74
           ++NP   +    ++S++R  +++      +  +  + P    S   + ++++ G PP   
Sbjct: 49  SKNPWGALNHLASLSLSRAHHIKSPKTKFSLLKTPLFPR---SYGGYSISLNFGTPPQTT 105

Query: 75  FTAMDTGSSLLWVHC---YPCRDCS-PQLGT----IFYPSRSSSYAYVPCDSEHCRYF-- 124
              MDTGSSL+W  C   Y C  C  P +       F P +SSS   + C +  C +   
Sbjct: 106 KFVMDTGSSLVWFPCTSRYLCSRCDFPNIEVTGIPTFIPKQSSSSNLIGCKNHKCSWLFG 165

Query: 125 PYAR-----CSAYKHRCI-----YTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDVVF 174
           P  +     C      C      Y   Y +G    + +S E L F +       +   + 
Sbjct: 166 PKVQSKCQECDPTTQNCTQSCPPYVIQYGLGSTAGLLLS-ETLDFPHKKT----IPGFLV 220

Query: 175 GCGFSTNRNFKFSGIFGLGIGRSSLVSQLN-SSFSYCIGS-LHDPDYLHNKLILGDGAII 232
           GC   + R  +  GI G G    SL SQL    FSYC+ S   D     + L+L  G+  
Sbjct: 221 GCSLFSIRQPE--GIAGFGRSPESLPSQLGLKKFSYCLVSHAFDDTPASSDLVLDTGSGS 278

Query: 233 DEGDA-----TPLQ------FIDGHYYITLEAISVDGRMLDINPN-IFKRDDSGGGVMID 280
           D+        TP Q      F D +YY+ L  I +    + +    +    D  GG ++D
Sbjct: 279 DDTKTPGLSYTPFQKNPTAAFRD-YYYVLLRNIVIGDTHVKVPYKFLVPGSDGNGGTIVD 337

Query: 281 SGTDVTWLVKEAYEALRDEVMIRLEGEQMRSYSWPDKL--------CYHGIMSSDLKGFP 332
           SGT  T++ K  YE +  E       +Q+  Y+   ++        C++ I        P
Sbjct: 338 SGTTFTFMEKPVYELVAKEFE-----KQVAHYTVATEVQNQTGLRPCFN-ISGEKSVSVP 391

Query: 333 TVRFHFRGGAKLALEKDSMFYQPRPDAFCMAVNPASINNRYVNFS---LIGMMAQQFYNV 389
              FHF+GGAK+AL   + F        C+ +   +++   +      ++G   Q+ ++V
Sbjct: 392 EFIFHFKGGAKMALPLANYFSFVDSGVICLTIVSDNMSGSGIGGGPAIILGNYQQRNFHV 451

Query: 390 GYDIGRKQKTFQRMDC 405
            +D+  ++  F++ +C
Sbjct: 452 EFDLKNERFGFKQQNC 467


>gi|359496797|ref|XP_002277380.2| PREDICTED: aspartic proteinase nepenthesin-2-like, partial [Vitis
           vinifera]
          Length = 358

 Score = 89.0 bits (219), Expect = 4e-15,   Method: Compositional matrix adjust.
 Identities = 75/278 (26%), Positives = 123/278 (44%), Gaps = 28/278 (10%)

Query: 37  QEKIRSHNTYQAQILPSNDISNSLFYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRD-C 95
           ++ IR   +    + P   I +  +YV +  G P       +DTGSSL W+ C PC   C
Sbjct: 94  KKDIRFPKSVSVPLNPGASIGSGNYYVKVGFGSPARYYSMIVDTGSSLSWLQCKPCVVYC 153

Query: 96  SPQLGTIFYPSRSSSYAYVPCDSEHCRYFPYAR-----CSAYKHRCIYTQLYLIGPETSV 150
             Q   +F PS S +Y  + C S  C     A      C    + C+YT  Y     +  
Sbjct: 154 HVQADPLFDPSASKTYKSLSCTSSQCSSLVDATLNNPLCETSSNVCVYTASYGDSSYSMG 213

Query: 151 FVSTEQLTFKNTDESTIHVQDVVFGCGFSTNRNF-KFSGIFGLGIGRSSLVSQLNS---- 205
           ++S + LT   +      +   V+GCG  ++  F + +GI GLG  + S++ Q++S    
Sbjct: 214 YLSQDLLTLAPSQT----LPGFVYGCGQDSDGLFGRAAGILGLGRNKLSMLGQVSSKFGY 269

Query: 206 SFSYCIGSLHDPDYLHNKLILGDGAIIDEG-DATPLQFIDGH---YYITLEAISVDGRML 261
           +FSYC+ +     +L     +G  ++       TP+    G+   Y++ L AI+V GR L
Sbjct: 270 AFSYCLPTRGGGGFLS----IGKASLAGSAYKFTPMTTDPGNPSLYFLRLTAITVGGRAL 325

Query: 262 DINPNIFKRDDSGGGVMIDSGTDVTWLVKEAYEALRDE 299
            +    ++        +IDSGT +T L    Y   +  
Sbjct: 326 GVAAAQYRVP-----TIIDSGTVITRLPMSVYTPFQQA 358


>gi|125543284|gb|EAY89423.1| hypothetical protein OsI_10930 [Oryza sativa Indica Group]
          Length = 447

 Score = 89.0 bits (219), Expect = 4e-15,   Method: Compositional matrix adjust.
 Identities = 104/397 (26%), Positives = 160/397 (40%), Gaps = 57/397 (14%)

Query: 58  NSLFYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLGTIFYPSRSSSYAYVPCD 117
           N    V +++G PP      +DTGS L W+ C      +P L   F  S SSSY  VPC 
Sbjct: 52  NVSLTVPVAVGTPPQNVTMVLDTGSELSWLLCN--GSYAPPLTPAFNASGSSSYGAVPCP 109

Query: 118 SEHCRY------FPYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQD 171
           S  C +       P    +   + C  +  Y         ++T+  TF  T  +      
Sbjct: 110 STACEWRGRDLPVPPFCDTPPSNACRVSLSYADASSADGVLATD--TFLLTGGAPPVAVG 167

Query: 172 VVFGC-------------GFSTNRNFKFSGIFGLGIGRSSLVSQLNSS-FSYCIGSLHDP 217
             FGC             G  T+ +   +G+ G+  G  S V+Q  +  F+YCI     P
Sbjct: 168 AYFGCITSYSSTTATNSNGTGTDVSEAATGLLGMNRGTLSFVTQTGTRRFAYCIAPGEGP 227

Query: 218 DYLHNKLILGDGAIIDEGDATPLQFIDG--------HYYITLEAISVDGRMLDINPNIFK 269
             L   L+  DG +    + TPL  I           Y + LE I V   +L I  ++  
Sbjct: 228 GVL---LLGDDGGVAPPLNYTPLIEISQPLPYFDRVAYSVQLEGIRVGCALLPIPKSVLT 284

Query: 270 RDDSGGG-VMIDSGTDVTWLVKEAYEALRDE------VMIRLEGEQMRSYSWPDKLCYHG 322
            D +G G  M+DSGT  T+L+ +AY AL+ E      +++   GE    +      C+ G
Sbjct: 285 PDHTGAGQTMVDSGTQFTFLLADAYAALKAEFTSQARLLLAPLGEPGFVFQGAFDACFRG 344

Query: 323 -----IMSSDLKGFPTVRFHFRGGAKLALEKDSMFY----QPRPDAFCMAVNPASINNR- 372
                  +S L   P V    R GA++A+  + + Y    + R +    AV   +  N  
Sbjct: 345 PEARVAAASGL--LPVVGLVLR-GAEVAVSGEKLLYMVPGERRGEGGAEAVWCLTFGNSD 401

Query: 373 --YVNFSLIGMMAQQFYNVGYDIGRKQKTFQRMDCEV 407
              ++  +IG   QQ   V YD+   +  F    C++
Sbjct: 402 MAGMSAYVIGHHHQQNVWVEYDLQNGRVGFAPARCDL 438


>gi|302769978|ref|XP_002968408.1| hypothetical protein SELMODRAFT_89951 [Selaginella moellendorffii]
 gi|300164052|gb|EFJ30662.1| hypothetical protein SELMODRAFT_89951 [Selaginella moellendorffii]
          Length = 492

 Score = 89.0 bits (219), Expect = 4e-15,   Method: Compositional matrix adjust.
 Identities = 104/406 (25%), Positives = 168/406 (41%), Gaps = 62/406 (15%)

Query: 36  LQEKIRSHNTYQAQILPS------NDISNSLFYVN-ISIGQPPVPQFTAMDTGSSLLWVH 88
           L+    SH     ++L S      +D+    +Y + + IG PP      +DTGS++ +V 
Sbjct: 3   LELVANSHRRRDRELLGSARMDLHDDLLTKGYYTSRVKIGTPPHEFSLIVDTGSTVTYVP 62

Query: 89  CYPCRDCSPQLGTIFYPSRSSSYAYVPCDSEHCRYFPYARCSAYKHRCIYTQLYLIGPET 148
           C  C  C       F P+ SSSY  + C SE    F    C   +    Y + Y     +
Sbjct: 63  CSSCTHCGNHQDPRFSPALSSSYKPLECGSECSTGF----CDGSRK---YQRQYAEKSTS 115

Query: 149 SVFVSTEQLTFKNTDESTIHVQDVVFGCGFSTNRNF---KFSGIFGLGIGRSSLVSQL-- 203
           S  +  + + F N+  S +  Q +VFGC  +   +       GI GLG G  S++ QL  
Sbjct: 116 SGVLGKDVIGFSNS--SDLGGQRLVFGCETAETGDLYDQTADGIIGLGRGPLSIIDQLVE 173

Query: 204 ----NSSFSYCIGSLHDPDYLHNKLILGDGAIIDEGDATPLQFI----DGH----YYITL 251
                  FS C G + +          G GA+I  G   P   +    D H    Y + L
Sbjct: 174 KNAMEDVFSLCYGGMDE----------GGGAMILGGFQPPKDMVFTASDPHRSPYYNLML 223

Query: 252 EAISVDGRMLDINPNIFKRDDSGGGVMIDSGTDVTWLVKEAYEALRDEVMIRLEGEQMRS 311
           + I V G  L + P +F   D   G ++DSGT   +    A++A +  V  ++    ++ 
Sbjct: 224 KGIRVGGSPLRLKPEVF---DGKYGTVLDSGTTYAYFPGAAFQAFKSAVKEQV--GSLKE 278

Query: 312 YSWPDK----LCYHGI---MSSDLKGFPTVRFHFRGGAKLALEKDSMFYQPRP--DAFCM 362
              PD+    +CY G    +S+  + FP+V F F  G  + L  ++  ++      A+C+
Sbjct: 279 VPGPDEKFKDICYAGAGTNVSNLSQFFPSVDFVFGDGQSVTLSPENYLFRHTKISGAYCL 338

Query: 363 AVNPASINNRYVNFSLIGMMAQQFYNVGYDIGRKQKTFQRMDCEVL 408
            V      N      L G++ +    V Y+ G+    F +  C  L
Sbjct: 339 GV----FENGDPTTLLGGIIVRNML-VTYNRGKASIGFLKTKCNDL 379


>gi|413938615|gb|AFW73166.1| hypothetical protein ZEAMMB73_633272 [Zea mays]
          Length = 386

 Score = 89.0 bits (219), Expect = 4e-15,   Method: Compositional matrix adjust.
 Identities = 108/370 (29%), Positives = 152/370 (41%), Gaps = 51/370 (13%)

Query: 55  DISNSLFYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCR---DCSPQLGTIFYPSRSSSY 111
           DI    + V  S+G P V Q   +DTGS L WV C PC     C  Q   +F P++SSSY
Sbjct: 42  DIGTLNYVVTASLGTPGVAQTMEVDTGSDLSWVQCKPCAAAPSCYSQKDPLFDPAQSSSY 101

Query: 112 AYVPCDSEHCRYFP-YARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQ 170
           A VPC    C     YA  +    +C Y   Y  G  T+   S++ LT   +      VQ
Sbjct: 102 AAVPCGGPVCAGLGIYAASACSAAQCGYVVSYGDGSNTTGVYSSDTLTLSASSA----VQ 157

Query: 171 DVVFGCGFSTNRNFK-FSGIFGLGIGRSSLVSQLNSS----FSYCIGSLHDPDYLHNKLI 225
              FGCG + +  F    G+ GLG  + SLV Q   +    FSYC   L         L 
Sbjct: 158 GFFFGCGHAQSGLFNGVDGLLGLGREQPSLVEQTAGTYGGVFSYC---LPTKPSTAGYLT 214

Query: 226 LGDGAIIDEGDA------TPLQFIDGHYYITLEAISVDGRMLDINPNIFKRDDSGGGVMI 279
           LG G               P      +Y + L  ISV G+ L +  + F         ++
Sbjct: 215 LGVGGPSGAAPGFSTTQLLPSPNAPTYYVVMLTGISVGGQQLSVPASAFAGGT-----VV 269

Query: 280 DSGTDVTWLVKEAYEALRDEVMIRLEGEQMRSYSWPDKLCYHGIMSS--DLKGF-----P 332
           D+GT VT L   AY ALR           M SY +P     +GI+ +  +  G+     P
Sbjct: 270 DTGTVVTRLPPTAYAALRSAFR-----SGMASYGYPTAPS-NGILDTCYNFAGYGTVTLP 323

Query: 333 TVRFHFRGGAKLALEKDSMFYQPRPDAFCMAVNPASINNRYVNFSLIGMMAQQFYNV--- 389
            V   F  GA + L  D +         C+A  P+  +      +++G + Q+ + V   
Sbjct: 324 NVALTFGSGATVTLGADGIL-----SFGCLAFAPSGSDG---GMAILGNVQQRSFEVRID 375

Query: 390 GYDIGRKQKT 399
           G  +G K  +
Sbjct: 376 GTSVGFKPSS 385


>gi|357507805|ref|XP_003624191.1| Aspartic proteinase-like protein [Medicago truncatula]
 gi|355499206|gb|AES80409.1| Aspartic proteinase-like protein [Medicago truncatula]
          Length = 406

 Score = 88.6 bits (218), Expect = 5e-15,   Method: Compositional matrix adjust.
 Identities = 98/353 (27%), Positives = 148/353 (41%), Gaps = 49/353 (13%)

Query: 84  LLWVHCYPCRDCSPQLG---TIFYPSRSSSYAYVPCDSEHCRYFPYARCSAYKH--RCIY 138
           LL + C  C   S  LG   T++ P+ S +   VPC    C        S  K    C Y
Sbjct: 28  LLQLGCTACPKKS-GLGMDLTLYDPNGSKTSNAVPCGDGFCTDTYSGPISGCKQDMSCPY 86

Query: 139 TQLYLIGPETSVFVSTEQLTFKNTDESTIHVQ----DVVFGCG------FSTNRNFKFSG 188
           +  Y  G  TS     + LTF       +H +     V+FGCG       S+N +    G
Sbjct: 87  SITYGDGSTTSGSFVNDSLTFDEV-SGNLHTKPDNSSVIFGCGAKQSGSLSSNSDEALDG 145

Query: 189 IFGLGIGRSSLVSQLNSS------FSYCIGSLHDPDYLHNKLILGDGAIID-EGDATPLQ 241
           I G G   SS++SQL +S      FS+C+      D  H   I   G +++ + + TPL 
Sbjct: 146 IIGFGQANSSVLSQLAASGKVKRIFSHCL------DSHHGGGIFSIGQVMEPKFNTTPLV 199

Query: 242 FIDGHYYITLEAISVDGRMLDINPNIFKRDDSGGGVMIDSGTDVTWLVKEAYEALRDEVM 301
               HY + L+ + VDG  + +   +F    SG G +IDSGT + +L    Y  L  +V+
Sbjct: 200 PRMAHYNVILKDMDVDGEPILLPLYLFDSG-SGRGTIIDSGTTLAYLPLSIYNQLLPKVL 258

Query: 302 IRLEG-------EQMRSYSWPDKLCYHGIMSSDLKGFPTVRFHFRGGAKLALEKDSMFYQ 354
            R  G       +Q   + + DKL          +GFP V+FHF G +      D +F  
Sbjct: 259 GRQPGLKLMIVEDQFTCFHYSDKLD---------EGFPVVKFHFEGLSLTVHPHDYLFLY 309

Query: 355 PRPDAFCMAVNPASINNRY-VNFSLIGMMAQQFYNVGYDIGRKQKTFQRMDCE 406
            + D +C+    +S   +   +  LIG +      V YD+      +   +C 
Sbjct: 310 -KEDIYCIGWQKSSTQTKEGRDLILIGDLVLSNKLVVYDLENMVIGWTNFNCS 361


>gi|115452187|ref|NP_001049694.1| Os03g0271900 [Oryza sativa Japonica Group]
 gi|29893618|gb|AAP06872.1| hypothetical protein [Oryza sativa Japonica Group]
 gi|108707424|gb|ABF95219.1| Eukaryotic aspartyl protease family protein, expressed [Oryza
           sativa Japonica Group]
 gi|108707425|gb|ABF95220.1| Eukaryotic aspartyl protease family protein, expressed [Oryza
           sativa Japonica Group]
 gi|113548165|dbj|BAF11608.1| Os03g0271900 [Oryza sativa Japonica Group]
 gi|215715205|dbj|BAG94956.1| unnamed protein product [Oryza sativa Japonica Group]
 gi|215737033|dbj|BAG95962.1| unnamed protein product [Oryza sativa Japonica Group]
 gi|215740994|dbj|BAG97489.1| unnamed protein product [Oryza sativa Japonica Group]
          Length = 447

 Score = 88.6 bits (218), Expect = 5e-15,   Method: Compositional matrix adjust.
 Identities = 104/397 (26%), Positives = 160/397 (40%), Gaps = 57/397 (14%)

Query: 58  NSLFYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLGTIFYPSRSSSYAYVPCD 117
           N    V +++G PP      +DTGS L W+ C      +P L   F  S SSSY  VPC 
Sbjct: 52  NVSLTVPVAVGTPPQNVTMVLDTGSELSWLLCN--GSYAPPLTPAFNASGSSSYGAVPCP 109

Query: 118 SEHCRY------FPYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQD 171
           S  C +       P    +   + C  +  Y         ++T+  TF  T  +      
Sbjct: 110 STACEWRGRDLPVPPFCDTPPSNACRVSLSYADASSADGVLATD--TFLLTGGAPPVAVG 167

Query: 172 VVFGC-------------GFSTNRNFKFSGIFGLGIGRSSLVSQLNSS-FSYCIGSLHDP 217
             FGC             G  T+ +   +G+ G+  G  S V+Q  +  F+YCI     P
Sbjct: 168 AYFGCITSYSSTTATNSNGTGTDVSEAATGLLGMNRGTLSFVTQTGTRRFAYCIAPGEGP 227

Query: 218 DYLHNKLILGDGAIIDEGDATPLQFIDG--------HYYITLEAISVDGRMLDINPNIFK 269
             L   L+  DG +    + TPL  I           Y + LE I V   +L I  ++  
Sbjct: 228 GVL---LLGDDGGVAPPLNYTPLIEISQPLPYFDRVAYSVQLEGIRVGCALLPIPKSVLT 284

Query: 270 RDDSGGG-VMIDSGTDVTWLVKEAYEALRDE------VMIRLEGEQMRSYSWPDKLCYHG 322
            D +G G  M+DSGT  T+L+ +AY AL+ E      +++   GE    +      C+ G
Sbjct: 285 PDHTGAGQTMVDSGTQFTFLLADAYAALKAEFTSQARLLLAPLGEPGFVFQGAFDACFRG 344

Query: 323 -----IMSSDLKGFPTVRFHFRGGAKLALEKDSMFY----QPRPDAFCMAVNPASINNR- 372
                  +S L   P V    R GA++A+  + + Y    + R +    AV   +  N  
Sbjct: 345 PEARVAAASGL--LPEVGLVLR-GAEVAVSGEKLLYMVPGERRGEGGAEAVWCLTFGNSD 401

Query: 373 --YVNFSLIGMMAQQFYNVGYDIGRKQKTFQRMDCEV 407
              ++  +IG   QQ   V YD+   +  F    C++
Sbjct: 402 MAGMSAYVIGHHHQQNVWVEYDLQNGRVGFAPARCDL 438


>gi|449467979|ref|XP_004151699.1| PREDICTED: probable aspartic protease At2g35615-like, partial
           [Cucumis sativus]
          Length = 209

 Score = 88.6 bits (218), Expect = 5e-15,   Method: Compositional matrix adjust.
 Identities = 58/186 (31%), Positives = 93/186 (50%), Gaps = 13/186 (6%)

Query: 1   LIHRDSILSPYNNPNENPAHRVQRGINISIARLAYLQEKIRSHNTY--QAQILPSNDISN 58
           L HRDS+LSP    + +   R+      S++R A L  +  ++     QA + P +    
Sbjct: 34  LFHRDSLLSPLEFSSLSHYDRLTNAFRRSLSRSATLLNRAATNGALDLQAPLTPGS---- 89

Query: 59  SLFYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLGTIFYPSRSSSYAYVPCDS 118
             + +++SIG PPV      DTGS L+W  C PC  C  Q   IF P +S+S+++VPC+S
Sbjct: 90  GEYLMSVSIGTPPVDYIGMADTGSDLMWAQCLPCLKCYKQSRPIFDPLKSTSFSHVPCNS 149

Query: 119 EHCRYFPYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDVVFGCGF 178
           ++C+    + C A +  C Y+  Y  G +T    +   L F+     +  V+ V+ GCG 
Sbjct: 150 QNCKAIDDSHCGA-QGVCDYS--YTYGDQT---YTKGDLGFEKITIGSSSVKSVI-GCGH 202

Query: 179 STNRNF 184
            +   F
Sbjct: 203 ESGGGF 208


>gi|168063189|ref|XP_001783556.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162664943|gb|EDQ51645.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 414

 Score = 88.6 bits (218), Expect = 6e-15,   Method: Compositional matrix adjust.
 Identities = 98/381 (25%), Positives = 157/381 (41%), Gaps = 47/381 (12%)

Query: 58  NSLFYVNISIGQPPVPQFTAMDTGSSLLWVHC-YPCRDCSPQLGTIFYPSRSSSYAYVPC 116
           + L+Y+ + IG P    +  MDTGS L W+ C  PCR C+     ++ P R+     V C
Sbjct: 28  DGLYYMAMRIGNPAKLYYLDMDTGSDLTWLQCDAPCRSCAVGPHGLYDPKRAR---VVDC 84

Query: 117 DSEHCRYFPYA---RCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDVV 173
               C          CS    +C Y   Y+ G  T   +  + +T   T+ +    + V+
Sbjct: 85  RRPTCAQVQRGGQFTCSGDVRQCDYEVDYVDGSSTMGILVEDTITLVLTNGTRFQTRAVI 144

Query: 174 FGCGFSTNRNFKFS-----GIFGLGIGRSSLVSQL------NSSFSYCI-GSLHDPDYL- 220
            GCG+        +     G+ GL   + SL SQL      N+   +C+ G  +   YL 
Sbjct: 145 -GCGYDQQGTLAKAPAVTDGVIGLSSSKISLPSQLAAKGIANNVIGHCLAGGSNGGGYLF 203

Query: 221 -HNKLILGDGAIIDEGDATPLQFIDGHYYITLEAISVDGRMLDINPNIFKRDDSGGGVMI 279
             + L+   G         PL  ++G Y   L +I   G +L++        D  GG M 
Sbjct: 204 FGDTLVPALGMTWTPMIGRPL--VEG-YQARLRSIKYGGEVLELEGTT----DDVGGAMF 256

Query: 280 DSGTDVTWLVKEAYEALRDEV--------MIRLEGEQMRSYSWPDKLCYHGIMSSDLKG- 330
           DSGT  T+LV  AY A+   V        + R++ +    + W     +  +  +D+   
Sbjct: 257 DSGTSFTYLVPNAYTAVLSAVVRQAQRSGLERIKTDTTLPFCWRGPSPFESV--ADVSAY 314

Query: 331 FPTVRFHFRG------GAKLALEKDSMFYQPRPDAFCMAVNPASINNRYVNFSLIGMMAQ 384
           F TV   F G      G  L L  +           C+ V  AS+ +  V  +++G ++ 
Sbjct: 315 FKTVTLDFGGSTWWSSGKLLELSPEGYLIVSTQGNVCLGVLDASVASLEVT-NILGDISM 373

Query: 385 QFYNVGYDIGRKQKTFQRMDC 405
           + Y V YD  R+Q  + R +C
Sbjct: 374 RGYLVVYDNMREQIGWVRRNC 394


>gi|255563739|ref|XP_002522871.1| DNA binding protein, putative [Ricinus communis]
 gi|223537955|gb|EEF39569.1| DNA binding protein, putative [Ricinus communis]
          Length = 414

 Score = 88.2 bits (217), Expect = 6e-15,   Method: Compositional matrix adjust.
 Identities = 109/435 (25%), Positives = 170/435 (39%), Gaps = 89/435 (20%)

Query: 1   LIHRDSILSPYNNPNENPAHRVQRGINISIARLAYLQEKIRSHN---TYQAQIL-PSNDI 56
           LIHRDS  SP+       + R+ R +  S         KIR+HN    + ++   P    
Sbjct: 36  LIHRDSPESPFYPGKLTNSERISRLVEFS---------KIRAHNFDSGFSSEAFRPPVFQ 86

Query: 57  SNSLFYVNISIGQPPVPQFTAMDTGSSLLWV----HCYPCRDCSPQLGTIFYPSRSSSYA 112
             + + V + IG P +P +   DTGS+L+W     + + CR+                  
Sbjct: 87  DFTCYLVKVRIGNPGIPLYLVPDTGSALIWTVNNQNIFQCRN------------------ 128

Query: 113 YVPCDSEHCRYFPYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDV 172
                                ++C YT+ Y  G  T+   + + L  + ++    +    
Sbjct: 129 ---------------------NKCSYTRRYDDGSITTGVAAQDILQSEGSERIPFY---- 163

Query: 173 VFGCGFSTNRNF-------KFSGIFGLGIGRSSLVSQLN----SSFSYCIGSLHDPDYLH 221
            FGC    N+NF       K  G+ GL     SL+ QL+      FSYC+          
Sbjct: 164 -FGCS-RDNQNFSVFEHTGKSGGVMGLNTSPVSLLQQLSHITQRRFSYCLNPYQHGSEPP 221

Query: 222 NKLILGDGAIIDEG----DATPLQFIDG--HYYITLEAISVDGRMLDINPNIFK-RDDSG 274
              +L  G  I +G     +TPL       +Y++ L  ++V G+ L + P  F  R D  
Sbjct: 222 PSSLLRFGNDIRKGRRRFQSTPLMSSPDRPNYFLNLLDMTVAGQRLHLPPGTFALRQDGT 281

Query: 275 GGVMIDSGTDVTWLVKEAYEALRDEVMIRLEGEQMRSYSWPD-KLCYHGIMSSDLKGFPT 333
           GG +IDSGT +T++ + AY  L        +    +    P+  LCY    +       +
Sbjct: 282 GGTIIDSGTGLTFITQTAYPRLISAFQNYFDHRGFQRVHIPEFDLCYSFRGNHTFHDHAS 341

Query: 334 VRFHFRGGAKLALEKDSMFYQPRPD--AFCMAVNPASINNRYVNFSLIGMMAQQFYNVGY 391
           + FHF   A   ++ D + Y P  D  AFC+A+ P     R V    IG + Q      Y
Sbjct: 342 MTFHFE-RADFTVQADYV-YLPMEDDNAFCVALQPTPPQQRTV----IGAINQGNTRFIY 395

Query: 392 DIGRKQKTFQRMDCE 406
           D    Q  F   +C 
Sbjct: 396 DAAAHQLLFIAENCR 410


>gi|413946455|gb|AFW79104.1| hypothetical protein ZEAMMB73_209101 [Zea mays]
          Length = 480

 Score = 88.2 bits (217), Expect = 6e-15,   Method: Compositional matrix adjust.
 Identities = 102/381 (26%), Positives = 155/381 (40%), Gaps = 46/381 (12%)

Query: 61  FYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDC---SPQLGTIFYPSRSSSYAYVPCD 117
           ++V   +G P  P     DTGS L WV C    D    +P+   +F  + S S+A + C 
Sbjct: 112 YFVRFRVGTPAQPFVLVADTGSDLTWVKCSGAGDGTGDAPR--RVFRAAASRSWAPIACS 169

Query: 118 SEHC-RYFPY--ARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTF-------KNTDESTI 167
           S+ C  Y P+  A CS+    C Y   Y  G      V T+  T        ++      
Sbjct: 170 SDTCTSYVPFSLANCSSPASPCAYDYRYNDGSAARGVVGTDSATIALSGSESRDGGGRRA 229

Query: 168 HVQDVVFGCGFS-TNRNFKFS-GIFGLGIGRSSLVS----QLNSSFSYCIGSLHDPDYLH 221
            +Q VV GC  S   ++F+ S G+  LG    S  S    +    FSYC+     P    
Sbjct: 230 KLQGVVLGCTASYDGQSFQSSDGVLSLGNSNISFASRAAARFGGRFSYCLVDHLAPRNAT 289

Query: 222 NKLILGDGAIIDEGDA------------TPL---QFIDGHYYITLEAISVDGRMLDINPN 266
           + L  G      EG A            TPL   + +   Y + ++A+ V G  LDI  +
Sbjct: 290 SYLTFGPPG--PEGGAAASSSSSSAAARTPLLLDRRMSPFYAVAVDAVHVAGEALDIPAD 347

Query: 267 IFKRDDSGGGVMIDSGTDVTWLVKEAYEALRDEVMIRLEGEQMRSYSWPDKLCYHGIMSS 326
           ++     GGG ++DSGT +T L   AY A+   +  RL G    S   P + CY+   ++
Sbjct: 348 VWDV-ARGGGAILDSGTSLTVLATPAYRAVVAALSERLAGLPRVSMD-PFEYCYN--WTA 403

Query: 327 DLKGFPTVRFHFRGGAKLALEKDSMFYQPRPDAFCMAVNPASINNRYVNFSLIGMMAQQF 386
                P +   F G A+L     S      P   C+ V        +   S+IG + QQ 
Sbjct: 404 AALEIPGLEVRFAGSARLQPPAKSYVVDAAPGVKCIGVQ----EGAWPGVSVIGNILQQD 459

Query: 387 YNVGYDIGRKQKTFQRMDCEV 407
           +   +D+  +   F+   C +
Sbjct: 460 HLWEFDLRDRWLRFKHTRCAL 480


>gi|413951979|gb|AFW84628.1| putative aspartic protease family protein [Zea mays]
          Length = 435

 Score = 88.2 bits (217), Expect = 7e-15,   Method: Compositional matrix adjust.
 Identities = 83/292 (28%), Positives = 125/292 (42%), Gaps = 33/292 (11%)

Query: 24  RGINISIARLAYLQEKIRSHNTYQAQILPSNDISNSLFYVNISIGQPPVPQFTAMDTGSS 83
           R   + +  L     K+R H+             N    V++++G PP      +DTGS 
Sbjct: 37  RSRQVPVGALPRPPSKLRFHH-------------NVSLTVSLAVGTPPQNVTMVLDTGSE 83

Query: 84  LLWVHCYPCRDCSPQLGTIFYPSRSSSYAYVPCDSEHC--RYFPYA-RCSAYKHRCIYTQ 140
           L W+ C   R  +    + F P  S+++A VPC S  C  R  P    C A   RC  + 
Sbjct: 84  LSWLLCATGRAAAAAADS-FRPRASATFAAVPCGSARCSSRDLPAPPSCDAASRRCRVSL 142

Query: 141 LYLIGPETSVFVSTEQLTFKNTDESTIHVQDVVFGCGF----STNRNFKFSGIFGLGIGR 196
            Y  G  +   ++T+   F   D   +      FGC      S+      +G+ G+  G 
Sbjct: 143 SYADGSASDGALATD--VFAVGDAPPLRS---AFGCMSAAYDSSPDAVATAGLLGMNRGA 197

Query: 197 SSLVSQLNSS-FSYCIGSLHDPDYL---HNKL-ILGDGAIIDEGDATPLQFIDG-HYYIT 250
            S V+Q ++  FSYCI    D   L   H+ L  L            PL + D   Y + 
Sbjct: 198 LSFVTQASTRRFSYCISDRDDAGVLLLGHSDLPFLPLNYTPLYQPTPPLPYFDRVAYSVQ 257

Query: 251 LEAISVDGRMLDINPNIFKRDDSGGG-VMIDSGTDVTWLVKEAYEALRDEVM 301
           L  I V G+ L I P++   D +G G  M+DSGT  T+L+ +AY A++ E +
Sbjct: 258 LLGIRVGGKPLPIPPSVLAPDHTGAGQTMVDSGTQFTFLLGDAYSAVKAEFL 309


>gi|413944596|gb|AFW77245.1| hypothetical protein ZEAMMB73_545774 [Zea mays]
 gi|414876929|tpg|DAA54060.1| TPA: hypothetical protein ZEAMMB73_875469 [Zea mays]
          Length = 459

 Score = 87.8 bits (216), Expect = 8e-15,   Method: Compositional matrix adjust.
 Identities = 98/368 (26%), Positives = 152/368 (41%), Gaps = 38/368 (10%)

Query: 55  DISNSLFYVNISIGQPPVPQFTAMDTGSSLLWVHC--YPCRDCSPQLGTIFYPSRSSSYA 112
           D S   + +  S+G PP       DTGS L+W  C       C PQ    + P+ SS++A
Sbjct: 85  DDSGGAYDMEFSMGTPPQKLTALADTGSDLIWAKCGGACTTSCEPQGSPSYLPNASSTFA 144

Query: 113 YVPCDSEHC---RYFPYARCSAYKHRCIYTQLYLIGPE----TSVFVSTEQLTFKNTDES 165
            +PC    C   R    A C+A    C Y   Y +G +    T  F++ E  T       
Sbjct: 145 KLPCSDRLCSLLRSDSVAWCAAAGAECDYRYSYGLGDDDHHYTQGFLARETFTLGAD--- 201

Query: 166 TIHVQDVVFGCGFSTNRNFKFSGIFGLGI-GRSSLVSQLN-SSFSYCIGSLHDPDYLHNK 223
              V  V FGC  ++   +           G  SLVSQLN S+F YC+ S        + 
Sbjct: 202 --AVPSVRFGCTTASEGGYGSGSGLVGLGRGPLSLVSQLNASTFMYCLTSDASK---ASP 256

Query: 224 LILGDGAIID--EGDATPLQFIDGHYYITLEAISVDGRMLDINPNIFKRDDSGGGVMIDS 281
           L+ G  A +   +  +T L      Y + L +IS+        P + + +    GV+ DS
Sbjct: 257 LLFGSLASLTGAQVQSTGLLASTTFYAVNLRSISIGSA---TTPGVGEPE----GVVFDS 309

Query: 282 GTDVTWLVKEAYEALRDEVMIRLEGEQMRSYSWPDKLCYHGIMSSDLK--GFPTVRFHFR 339
           GT +T+L + AY   +   + +   +Q+      +  C+    +  L     PT+  HF 
Sbjct: 310 GTTLTYLAEPAYSEAKAAFLSQTSLDQVEDTDGFEA-CFQKPANGRLSNAAVPTMVLHFD 368

Query: 340 GGAKLALEKDSMFYQPRPDAFCMAVNPASINNRYVNFSLIGMMAQQFYNVGYDIGRKQKT 399
            GA +AL   +   +      C  V       R  + S+IG + Q  Y V +D+ R   +
Sbjct: 369 -GADMALPVANYVVEVEDGVVCWIV------QRSPSLSIIGNIMQVNYLVLHDVHRSVLS 421

Query: 400 FQRMDCEV 407
           FQ  +C+ 
Sbjct: 422 FQPANCDT 429


>gi|125571841|gb|EAZ13356.1| hypothetical protein OsJ_03278 [Oryza sativa Japonica Group]
          Length = 447

 Score = 87.8 bits (216), Expect = 8e-15,   Method: Compositional matrix adjust.
 Identities = 51/180 (28%), Positives = 81/180 (45%), Gaps = 12/180 (6%)

Query: 1   LIHRDSILSPYNNPNENPAHRVQRGINISIARLAYLQEKIRSHNTYQAQILPSNDISNSL 60
           + HRD++  P   P       +++ +    AR A L   + +     + +       +  
Sbjct: 31  VFHRDALFPP--PPGAKRGSLLRQRLAADAARYASL---VDATGRLHSPVFSGIPFESGE 85

Query: 61  FYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLGTIFYPSRSSSYAYVPCDSEH 120
           ++  + +G P       +DTGS L+W+ C PCR C  Q G +F P RSS+Y  VPC S  
Sbjct: 86  YFALVGVGTPSTKAMLVIDTGSDLVWLQCSPCRRCYAQRGQVFDPRRSSTYRRVPCSSPQ 145

Query: 121 CRYFPYARC---SAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDVVFGCG 177
           CR   +  C    A    C Y   Y  G  ++  ++T++L F N      +V +V  GCG
Sbjct: 146 CRALRFPGCDSGGAAGGGCRYMVAYGDGSSSTGDLATDKLAFAN----DTYVNNVTLGCG 201


>gi|222632750|gb|EEE64882.1| hypothetical protein OsJ_19741 [Oryza sativa Japonica Group]
          Length = 456

 Score = 87.8 bits (216), Expect = 8e-15,   Method: Compositional matrix adjust.
 Identities = 108/388 (27%), Positives = 156/388 (40%), Gaps = 52/388 (13%)

Query: 39  KIRSHNTYQAQILPSNDISNSLFYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQ 98
           + R    + A +L         ++  + +G P       +DTGS ++W    P R   P 
Sbjct: 100 RPRRRGGFAAPLLSGLPQGTGEYFAQVGVGTPATTALMVLDTGSDVVWA---PVRALPPL 156

Query: 99  LGTIFYPSRSSSYAYVP-----CDSEHCRYFPYARCSAYKHRCIYTQLYLIGPETSVFVS 153
           L  +   S S+  A  P     C +  CR    A C   ++ C+Y   Y  G  T+   +
Sbjct: 157 LRAVRQGS-STGAAPAPTPRWNCVAPICRRLDSAGCDRRRNSCLYQVAYGDGSVTAGDFA 215

Query: 154 TEQLTFKNTDESTIHVQDVVFGCGFSTNRNF-KFSGIFGLGIGRSSLVSQL----NSSFS 208
           +E LTF         VQ V  GCG      F   SG+ GLG GR S  SQ+      SFS
Sbjct: 216 SETLTFAR----GARVQRVAIGCGHDNEGLFIAASGLLGLGRGRLSFPSQIARSFGRSFS 271

Query: 209 YCIGSLHDPDYLHNKLILGDGAIIDEGDATPLQFIDGHYYITLEAISVDGRM-------- 260
           YC+               G          TP   +   YY+ L   SV G          
Sbjct: 272 YCLVDRTSSRRARPSRRWG---------GTPR--MATFYYVHLLGFSVGGARVKGVSQSD 320

Query: 261 LDINPNIFKRDDSGGGVMIDSGTDVTWLVKEAYEALRDEVMIRLEGEQMR--SYSWPDKL 318
           L +NP   +     GGV++DSGT VT L +  YEA+RD       G ++    +S  D  
Sbjct: 321 LRLNPTTGR-----GGVILDSGTSVTRLARPVYEAVRDAFRAAAVGLRVSPGGFSLFDT- 374

Query: 319 CYHGIMSSDLKGFPTVRFHFRGGAKLALEKDS-MFYQPRPDAFCMAVNPASINNRYVNFS 377
           CY+ +    +   PTV  H  GGA +AL  ++ +        FC A+  A  +      S
Sbjct: 375 CYN-LSGRRVVKVPTVSMHLAGGASVALPPENYLIPVDTSGTFCFAM--AGTDG---GVS 428

Query: 378 LIGMMAQQFYNVGYDIGRKQKTFQRMDC 405
           +IG + QQ + V +D   ++  F    C
Sbjct: 429 IIGNIQQQGFRVVFDGDAQRVGFVPKSC 456


>gi|115457778|ref|NP_001052489.1| Os04g0337000 [Oryza sativa Japonica Group]
 gi|113564060|dbj|BAF14403.1| Os04g0337000, partial [Oryza sativa Japonica Group]
          Length = 321

 Score = 87.8 bits (216), Expect = 8e-15,   Method: Compositional matrix adjust.
 Identities = 79/302 (26%), Positives = 137/302 (45%), Gaps = 35/302 (11%)

Query: 57  SNSLFYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLG-----TIFYPSRSSSY 111
           +  L+Y  I IG P    +  +DTGS +LWV+C  C  C  + G     T++ P  SS+ 
Sbjct: 29  ATRLYYTEIGIGTPTKRYYVQVDTGSDILWVNCISCDRCPRKSGLGLELTLYDPKDSSTG 88

Query: 112 AYVPCDSEHC--RYFPYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTD---EST 166
           + V CD   C   Y            C Y+  Y  G  T+ +  ++ L F       ++ 
Sbjct: 89  SKVSCDQGFCAATYGGLLPGCTTSLPCEYSVTYGDGSSTTGYFVSDLLQFDQVSGDGQTR 148

Query: 167 IHVQDVVFGCGFST-----NRNFKFSGIFGLGIGRSSLVSQLNSS------FSYCIGSLH 215
                V FGCG        + N    GI G G   +S++SQL+++      F++C+    
Sbjct: 149 PANSTVTFGCGSQQGGDLGSSNQALDGIIGFGQSNTSMLSQLSAAGKVKKIFAHCL---- 204

Query: 216 DPDYLHNKLILGDGAIID-EGDATPLQFIDGHYYITLEAISVDGRMLDINPNIFKRDDSG 274
             D ++   I   G ++  +   TPL     HY + L++I V G  L +  ++F   +  
Sbjct: 205 --DTINGGGIFAIGNVVQPKVKTTPLVPNMPHYNVNLKSIDVGGTALKLPSHMFDTGEK- 261

Query: 275 GGVMIDSGTDVTWLVKEAYEALRDEVMIRL--EGEQMRSYSWPDKLCYHGIMSSDLKGFP 332
            G +IDSGT +T+L +  Y+    E+M+ +  + + +  ++  + LC+  +    L+  P
Sbjct: 262 KGTIIDSGTTLTYLPEIVYK----EIMLAVFAKHKDITFHNVQEFLCFQYVGRYTLQHTP 317

Query: 333 TV 334
           +V
Sbjct: 318 SV 319


>gi|125547728|gb|EAY93550.1| hypothetical protein OsI_15341 [Oryza sativa Indica Group]
          Length = 418

 Score = 87.8 bits (216), Expect = 9e-15,   Method: Compositional matrix adjust.
 Identities = 93/365 (25%), Positives = 148/365 (40%), Gaps = 51/365 (13%)

Query: 61  FYVNISIGQPPVPQFTAMDTGSSLLWVHCY--PCRDCSPQLGTIFYPSRSSSYAYVPCDS 118
           + V+++ G PP      +DTGS + W  C   P   C  Q   +F PS SSS+A +PC S
Sbjct: 88  YLVHLAAGTPPQEVQLTLDTGSDITWTQCKRCPASACFNQTLPLFDPSASSSFASLPCSS 147

Query: 119 EHCRYFPYARCS--AYKHRCIYTQLYLIGPETSVFVSTEQLTFKN--TDESTIHVQDVVF 174
             C   P       A    C Y+  Y  G  +   +  E  TF +   + S+  V  +VF
Sbjct: 148 PACETTPPCGGGNDATSRPCNYSISYGDGSVSRGEIGREVFTFASGTGEGSSAAVPGLVF 207

Query: 175 GCGFSTNRNFKF--SGIFGLGIGRSSLVSQLN-SSFSYCIGSLHDPDYLHNKLILGDGAI 231
           GCG +    F    +GI G G G  SL SQL   +FS+C  ++       + ++LG   +
Sbjct: 208 GCGHANRGVFTSNETGIAGFGRGSLSLPSQLKVGNFSHCFTTITGSKT--SAVLLGLPGV 265

Query: 232 IDEGDATPLQFIDGHYYITLEAISVDGRMLDINPNIFKRDDSGGGVMIDSGTDVTWLVKE 291
                A+PL    G Y       S                        +SGT +T L   
Sbjct: 266 APP-SASPLGRRRGSYRCRSTPRSS-----------------------NSGTSITSLPPR 301

Query: 292 AYEALRDEVMIRLEGEQMRSYSWPDKLCYHGIMSSDLKGFPTVRFHFRGGAKLALEKDSM 351
            Y A+R+E   +++   +   +     C+   +       PT+  HF  GA + L +++ 
Sbjct: 302 TYRAVREEFAAQVKLPVVPGNATDPFTCFSAPLRGPKPDVPTMALHFE-GATMRLPQENY 360

Query: 352 FYQPRPD--------AFCMAVNPASINNRYVNFSLIGMMAQQFYNVGYDIGRKQKTFQRM 403
            ++   D          C+AV    I    +   ++G + QQ  +V YD+   + +F   
Sbjct: 361 VFEVVDDDDAGNSSRIICLAV----IEGGEI---ILGNIQQQNMHVLYDLQNSKLSFVPA 413

Query: 404 DCEVL 408
            C+ L
Sbjct: 414 QCDQL 418


>gi|326533540|dbj|BAK05301.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 410

 Score = 87.8 bits (216), Expect = 1e-14,   Method: Compositional matrix adjust.
 Identities = 94/386 (24%), Positives = 160/386 (41%), Gaps = 58/386 (15%)

Query: 61  FYVNISIGQPPVPQFTAMDTGSSLLWVHC----YPCRDCSPQLGTIFYPSRSSSYAYVPC 116
           FY  ++IG+P  P F  +DTGS+L W+ C    + C+ C P+    +Y     +   V C
Sbjct: 38  FYATLNIGEPAKPYFLDVDTGSNLTWLECHHPVHGCKGCHPRPPHPYYTPADGNLKVV-C 96

Query: 117 DSEHC----RYFP-YARCSAY-KHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQ 170
            S  C    R  P    CS    HRC Y   Y+ G ++   ++T+ ++    D+  I   
Sbjct: 97  GSPLCVAVRRDVPGIPECSRNDPHRCHYEIQYVTG-KSEGDLATDIISVNGRDKKRI--- 152

Query: 171 DVVFGCGFSTNRNF-----KFSGIFGLGIGRSSLVSQLN-------SSFSYCIGSLHDPD 218
              FGCG+              GI GLG+G++ L +QL        +   +C+ S     
Sbjct: 153 --AFGCGYKQEEPADSPPSPVDGILGLGMGKAGLAAQLKGHKMIKENVIGHCLSSKGK-- 208

Query: 219 YLHNKLILGDGAIIDEG-DATPLQFIDGHYYITLEAISVDGRMLDINPNIFKRDDSGGGV 277
                L +GD      G    P++    +Y   L  + +D + +  NP            
Sbjct: 209 ---GVLYVGDFNPPTRGVTWAPMRESLFYYSPGLAEVFIDKQPIRGNPTF--------EA 257

Query: 278 MIDSGTDVTWLVKEAYEALRDEVMIRLEG---EQMRSYSWPDKLCYHGI--------MSS 326
           + DSG+  T +  + Y  +  +V + L     E+++  + P  LC+ G         + +
Sbjct: 258 VFDSGSTYTHVPAQIYNEIVSKVRVTLSESSLEEVKGRALP--LCWKGKKPFGSVNDVKN 315

Query: 327 DLKGFPTVRFHFRGGAKLALEKDSMFYQPRPDAFCMAVNPASINN--RYVNFSLIGMMAQ 384
             K       H RG + L +   +  +       C+A+  AS++   + +NF LIG +  
Sbjct: 316 QFKALSLKITHARGTSNLDIPPQNYLFVKEDGETCLAILDASLDPVLKELNFILIGAVTM 375

Query: 385 QFYNVGYDIGRKQKTFQRMDCEVLDD 410
           Q   V YD  +KQ  + R  C+ + +
Sbjct: 376 QDLFVIYDNEKKQLGWVRAQCDRVQE 401


>gi|225431324|ref|XP_002269880.1| PREDICTED: aspartic proteinase-like protein 1 [Vitis vinifera]
 gi|297739017|emb|CBI28369.3| unnamed protein product [Vitis vinifera]
          Length = 518

 Score = 87.4 bits (215), Expect = 1e-14,   Method: Compositional matrix adjust.
 Identities = 93/380 (24%), Positives = 156/380 (41%), Gaps = 58/380 (15%)

Query: 60  LFYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLGT---------IFYPSRSSS 110
           L Y  +S+G P      A+DTGS L WV C  C  C+P  GT         I+ P  SS+
Sbjct: 102 LHYTTVSLGTPGKKFLVALDTGSDLFWVPC-DCSRCAPTEGTTYASDFELSIYNPKGSST 160

Query: 111 YAYVPCDSEHCRYFPYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQ 170
              V CD+  C +    RC      C Y   Y+    ++  +  E +    T+++     
Sbjct: 161 SRKVTCDNSLCAH--RNRCLGTFSNCPYMVSYVSAETSTSGILVEDVLHLTTEDNRQEFV 218

Query: 171 D--VVFGCGFSTNRNF----KFSGIFGLGIGRSSLVSQLNS------SFSYCIGSLHDPD 218
           +  V FGCG     +F      +G+FGLG+ + S+ S L+       SFS C G    PD
Sbjct: 219 EAYVTFGCGQVQTGSFLDIAAPNGLFGLGLEKISVPSILSKEGFTADSFSMCFG----PD 274

Query: 219 YLHNKLILGDGAIIDEGDATPLQFIDGH--YYITLEAISVDGRMLDINPNIFKRDDSGGG 276
            +  ++  GD    D+ + TP      H  Y IT+  + V   ++D++            
Sbjct: 275 GI-GRISFGDKGSPDQ-EETPFNLNALHPTYNITVTQVRVGTTLIDLDFT---------- 322

Query: 277 VMIDSGTDVTWLVKEAYEALRDEVMIRLEGEQMRSYSWPD-----KLCYHGIMSSDLKGF 331
            + DSGT  T+LV   Y      V+     +   S   PD     + CY      +    
Sbjct: 323 ALFDSGTSFTYLVDPIYT----NVLKSFHSQAQDSRRPPDSRIPFEFCYDMSPGENTSLI 378

Query: 332 PTVRFHFRGGAKLALEKDSMFYQPRPD-AFCMAVNPASINNRYVNFSLIGMMAQQFYNVG 390
           P++    +GG++  +    +    + +  +CMAV       R    ++IG      Y + 
Sbjct: 379 PSMSLTMKGGSQFPVYDPIIIISSQSELIYCMAV------VRSAELNIIGQNFMTGYRII 432

Query: 391 YDIGRKQKTFQRMDCEVLDD 410
           +D  +    ++  +C+ +++
Sbjct: 433 FDREKLVLGWKEFECDDIEN 452


>gi|302776054|ref|XP_002971323.1| hypothetical protein SELMODRAFT_63598 [Selaginella moellendorffii]
 gi|300161305|gb|EFJ27921.1| hypothetical protein SELMODRAFT_63598 [Selaginella moellendorffii]
          Length = 395

 Score = 87.4 bits (215), Expect = 1e-14,   Method: Compositional matrix adjust.
 Identities = 88/350 (25%), Positives = 148/350 (42%), Gaps = 49/350 (14%)

Query: 53  SNDISNSLFYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLG-----TIFYPSR 107
           ++ +S  L++  + +G P       +DTGS +LWV+C PC  C  +       T++ P  
Sbjct: 21  ADPLSGGLYFTQVGLGNPVKHYIVQVDTGSDVLWVNCRPCSGCPRKSALNIPLTMYDPRE 80

Query: 108 SSSYAYVPCDSEHC---RYFPYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDE 164
           SS+ + V C    C   R F  A+CS   + C Y   Y  G  +  +   + + +     
Sbjct: 81  SSTTSLVSCSDPLCVRGRRFAEAQCSQTTNNCEYIFSYGDGSTSEGYYVRDAMQYNVISS 140

Query: 165 STIH--VQDVVFGCGFSTNRNFKFS-----GIFGLGIGRSSLVSQLNSS------FSYCI 211
           + +      V+FGC      +   S     GI G G    S+ +QL +       FS+C+
Sbjct: 141 NGLANTTSQVLFGCSIRQTGDLSTSQQAVDGIIGFGQLELSVPNQLAAQQNIPRVFSHCL 200

Query: 212 GSLHDPDYLHNKLILGDGAIIDEGDATPLQFIDGHYYITLEAISVDGRMLDINPNIFKRD 271
                   +     + +  +      TPL     HY + L  ISV+   L I+   F   
Sbjct: 201 EGEKRGGGILVIGGIAEPGMT----YTPLVPDSVHYNVVLRGISVNSNRLPIDAEDFSST 256

Query: 272 DSGGGVMIDSGTDVTWLVKEAY----EALRDEVM---IRLEGEQMRSYSWPDKLCYHGIM 324
           +   GV++DSGT + +    AY    +A+R+      +R++G   + +    +L      
Sbjct: 257 ND-TGVIMDSGTTLAYFPSGAYNVFVQAIREATSATPVRVQGMDTQCFLVSGRL------ 309

Query: 325 SSDLKGFPTVRFHFRGGAKLALEKDS--MFYQPRP----DAFCMAVNPAS 368
            SDL  FP V  +F GGA + L+ D+  M+    P    D +C+    +S
Sbjct: 310 -SDL--FPNVTLNFEGGA-MELQPDNYLMWGGTAPTGTTDVWCIGWQSSS 355


>gi|449434468|ref|XP_004135018.1| PREDICTED: aspartic proteinase-like protein 1-like [Cucumis
           sativus]
          Length = 568

 Score = 87.4 bits (215), Expect = 1e-14,   Method: Compositional matrix adjust.
 Identities = 84/310 (27%), Positives = 137/310 (44%), Gaps = 53/310 (17%)

Query: 60  LFYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLGTI---------FYPSRSSS 110
           L+Y N+S+G P +    A+DTGS L W+ C  C  C   L T          + P+ S++
Sbjct: 103 LYYANVSVGTPSLDFLVALDTGSDLFWLPC-ECSSCFTYLNTSNGGKFMLNHYSPNDSTT 161

Query: 111 YAYVPCDSEHCRYFPYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQ 170
            + VPC S  C      RC++ ++ C Y   YL    +S+    E +    TD+S +   
Sbjct: 162 SSTVPCTSSLCN-----RCTSNQNVCPYEMRYLSANTSSIGYLVEDVLHLATDDSLLKPV 216

Query: 171 D--VVFGCGFSTNRNFKFS----GIFGLGIGRSSLVSQL------NSSFSYCIGSLHDPD 218
           +  + FGCG      F  +    G+ GLG+ + S+ S L      ++SFS C G+     
Sbjct: 217 EAKITFGCGTVQTGIFATTAAPNGLIGLGMEKISVPSFLADQGLTSNSFSMCFGADG--- 273

Query: 219 YLHNKLILGDGAIIDEGDATPLQFIDGH--YYITLEAISVDGRMLDINPNIFKRDDSGGG 276
             + ++  GD    D+   TP   +  +  Y +T   I+V G      PN     D    
Sbjct: 274 --YGRIDFGDTGPADQ-KQTPFNTMLEYQSYNVTFNVINVGGE-----PN-----DVPFT 320

Query: 277 VMIDSGTDVTWLVKEAYEALRDEVMIRLEGEQMRSYS-----WPDKLCYHGIMSSDLKGF 331
            + DSGT  T+L + AY  +  ++     G +++ YS     +P + CY     +    +
Sbjct: 321 AIFDSGTSFTYLTEPAYSTITKQMD---AGMKLKRYSLFGPNFPFEYCYEIPPGAKEFQY 377

Query: 332 PTVRFHFRGG 341
            T+ F  +GG
Sbjct: 378 LTLNFTMKGG 387


>gi|194708432|gb|ACF88300.1| unknown [Zea mays]
          Length = 452

 Score = 87.4 bits (215), Expect = 1e-14,   Method: Compositional matrix adjust.
 Identities = 98/373 (26%), Positives = 155/373 (41%), Gaps = 51/373 (13%)

Query: 80  TGSSLLWVHC---YPCRDCSPQLGT---IFYPSRSSSYAYVPCDSEHCRYFPYAR----- 128
           +GS L WV C   Y CR+CS    +   +F+P  SSS   V C +  C++   A      
Sbjct: 79  SGSHLTWVPCTSSYECRNCSSPSASAVPVFHPKNSSSSRLVGCRNPSCQWVHSAANLATK 138

Query: 129 -----CSAYKHRCIYTQLYLIGPETSVFVS--TEQLTFKNTDESTIH-VQDVVFGCGFST 180
                CS     C      +  P   V+ S  T  L   +T  +    V   V GC   +
Sbjct: 139 CRRAPCSPGAANCPAAASNVCPPYAVVYGSGSTAGLLIADTLRAPGRAVPGFVLGCSLVS 198

Query: 181 NRNFKFSGIFGLGIGRSSLVSQLN-SSFSYCIGSLHDPDYLHNKLILGDGAIIDEGDATP 239
                 SG+ G G G  S+ +QL    FSYC+ S    D   N  + G   +   G    
Sbjct: 199 VHQPP-SGLAGFGRGAPSVPAQLGLPKFSYCLLSRRFDD---NAAVSGSLVLGGTGGGEG 254

Query: 240 LQFI-------------DGHYYITLEAISVDGRMLDINPNIFKRDDSG-GGVMIDSGTDV 285
           +Q++               +YY+ L  ++V G+ + +    F  + +G GG ++DSGT  
Sbjct: 255 MQYVPLVKSAAGDKLPYGVYYYLALRGVTVGGKAVRLPARAFAANAAGSGGTIVDSGTTF 314

Query: 286 TWLVKEAYEALRDEVMIRLEGEQMRSYSWPDKL----CYHGIMSSDLKGFPTVRFHFRGG 341
           T+L    ++ + D V+  + G   RS    D+L    C+     +     P + FHF GG
Sbjct: 315 TYLDPTVFQPVADAVVAAVGGRYKRSKDAEDELGLHPCFALPQGARSMALPELSFHFEGG 374

Query: 342 AKLALEKDSMFY---QPRPDAFCMAV------NPASINNRYVNFSLIGMMAQQFYNVGYD 392
           A + L  ++ F    +   +A C+AV         + N       ++G   QQ Y V YD
Sbjct: 375 AVMQLPVENYFVVAGRGAVEAICLAVVTDFSGGSGAGNEGSGPAIILGSFQQQNYLVEYD 434

Query: 393 IGRKQKTFQRMDC 405
           + +++  F+R  C
Sbjct: 435 LEKERLGFRRQSC 447


>gi|88174593|gb|ABD39371.1| chloroplast nucleoid DNA-binding protein [Oryza rufipogon]
          Length = 321

 Score = 87.0 bits (214), Expect = 1e-14,   Method: Compositional matrix adjust.
 Identities = 82/329 (24%), Positives = 139/329 (42%), Gaps = 37/329 (11%)

Query: 61  FYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLGTIFYPSRSSSYAYVPCDSEH 120
           + +++ +G P   Q   +DTGSS  WV C  C  C     T F  SRS++ A V C +  
Sbjct: 1   YVISVGLGTPAKTQIVEIDTGSSTSWVFC-ECDGCHTNPRT-FLQSRSTTCAKVSCGTSM 58

Query: 121 CRYF---PYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDVVFGC- 176
           C      P+ + S     C +   Y  G  +   +  + LTF +  +    +    FGC 
Sbjct: 59  CLLGGSDPHCQDSENYPDCPFRVSYQDGSASYGILYQDTLTFSDVQK----IPGFSFGCN 114

Query: 177 --GFSTNRNFKFSGIFGLGIGRSSLVSQLNSS---FSYCIGSLHDPDYLHNKLI--LGDG 229
              F  N      G+ G+G G  S++ Q + +   FSYC+          +K       G
Sbjct: 115 MDSFGANEFGNVDGLLGMGAGPMSVLKQSSPTFDCFSYCLPLQKSERGFFSKTTGYFSLG 174

Query: 230 AIIDEGDATPLQFIDGH-----YYITLEAISVDGRMLDINPNIFKRDDSGGGVMIDSGTD 284
            +    D    + +        +++ L AISVDG  L ++P++F R     GV+ DSG++
Sbjct: 175 KVATRTDVRYTKMVARKKNTELFFVDLIAISVDGERLGLSPSVFSRK----GVVFDSGSE 230

Query: 285 VTWLVKEAYEALRD---EVMIRLEGEQMRSYSWPDKLCYHGIMSSDLKGFPTVRFHFRGG 341
           ++++   A   L     E++++    +  S    ++ CY  + S D    P +  HF   
Sbjct: 231 LSYIPDRALSVLSQRIRELLLKRGAAEEES----ERNCYD-MRSVDEGDMPAISLHFDDA 285

Query: 342 AKLALEKDSMFYQ---PRPDAFCMAVNPA 367
           A+  L    +F +      D +C+A  P 
Sbjct: 286 ARFDLGSHGVFVERSVQEQDVWCLAFAPT 314


>gi|222618833|gb|EEE54965.1| hypothetical protein OsJ_02555 [Oryza sativa Japonica Group]
          Length = 393

 Score = 87.0 bits (214), Expect = 2e-14,   Method: Compositional matrix adjust.
 Identities = 93/347 (26%), Positives = 132/347 (38%), Gaps = 87/347 (25%)

Query: 61  FYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSP---QLGTIFYPSRSSSYAYVPCD 117
           + +++ +G P V Q   +DTGS + WV C PC   SP     G +F P+ SS+YA   C 
Sbjct: 106 YVISVGLGSPAVTQRVVIDTGSDVSWVQCEPCPAPSPCHAHAGALFDPAASSTYAAFNCS 165

Query: 118 SEHCRYFPYA----RCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDVV 173
           +  C     +     C A K RC Y   Y  G  T+         F+             
Sbjct: 166 AAACAQLGDSGEANGCDA-KSRCQYIVKYGDGSNTT------GTGFQ------------- 205

Query: 174 FGCG---FSTNRNFKFSGIFGLGIGRSSLVSQLNSSFSYCIGSLHDPDYLHNKLILGDGA 230
           FGC         + K  G+ GLG    SLVSQ  +       S   P Y           
Sbjct: 206 FGCSHAELGAGMDDKTDGLIGLGGDAQSLVSQTAAR------SKKVPTY----------- 248

Query: 231 IIDEGDATPLQFIDGHYYITLEAISVDGRMLDINPNIFKRDDSGGGVMIDSGTDVTWLVK 290
                           Y+  LE I+V G+ L ++P++F       G ++DSGT +T L  
Sbjct: 249 ----------------YFAALEDIAVGGKKLGLSPSVFA-----AGSLVDSGTVITRLPP 287

Query: 291 EAYEALRDEVMIRLEGEQMRSYSWPDKL-----CYHGIMSSDLKGFPTVRFHFRGGAKLA 345
            AY AL            M  Y+  + L     C++     D    PTV   F GGA + 
Sbjct: 288 AAYAALSSAFR-----AGMTRYARAEPLGILDTCFN-FTGLDKVSIPTVALVFAGGAVVD 341

Query: 346 LEKDSMFYQPRPDAFCMAVNPASINNRYVNFSLIGMMAQQFYNVGYD 392
           L+   +         C+A  P   +     F  IG + Q+ + V YD
Sbjct: 342 LDAHGIV-----SGGCLAFAPTRDDKA---FGTIGNVQQRTFEVLYD 380


>gi|196212948|gb|ACG76110.1| S5 [Oryza sativa Japonica Group]
 gi|340810887|gb|AEK75370.1| S5 [Oryza sativa]
 gi|340810903|gb|AEK75378.1| S5 [Oryza sativa]
 gi|340810921|gb|AEK75387.1| S5 [Oryza sativa]
 gi|340810955|gb|AEK75404.1| S5 [Oryza sativa]
 gi|340811079|gb|AEK75466.1| S5 [Oryza nivara]
 gi|340811090|gb|AEK75471.1| S5 [Oryza rufipogon]
 gi|340811116|gb|AEK75484.1| S5 [Oryza nivara]
          Length = 357

 Score = 87.0 bits (214), Expect = 2e-14,   Method: Compositional matrix adjust.
 Identities = 109/383 (28%), Positives = 161/383 (42%), Gaps = 66/383 (17%)

Query: 63  VNISIGQPPVPQFTAMDTGSSLLWVHCYPCR-DC---SPQLGTIFYPSRSSSYAYVPCDS 118
           + +S+G+PPV    A+DTGS+L WV C PC   C   S + G IF P RS +   V C S
Sbjct: 1   MAVSLGKPPVVNLVAIDTGSTLSWVQCQPCAVHCHTQSAKAGPIFDPGRSYTSRRVRCSS 60

Query: 119 EHCRYFPY------ARCSAYKHRCIYTQLYLIGPETSV-FVSTEQLTFKNTDESTIHVQD 171
             C    Y      A C   +  C Y+  Y  G   SV  + T+ L   ++        D
Sbjct: 61  VKCGELRYDLRLQQANCMEKEDSCTYSVTYGNGWAYSVGKMVTDTLRIGDS------FMD 114

Query: 172 VVFGCGFSTNRNFKFSGIFGLGIGRSSLVSQLN--------SSFSYCIGSLH-DPDYLHN 222
           ++FGC      +   +GIFG G    S   QL          +FSYC+ +    P Y   
Sbjct: 115 LMFGCSMDVKYSEFEAGIFGFGSSSFSFFEQLAGYPDILSYKAFSYCLPTDETKPGY--- 171

Query: 223 KLILG--DGAIIDEGDATPLQFIDG-HYYITLEAISVDGRMLDINPNIFKRDDSGGGVMI 279
            +ILG  D A +D G     + I+   Y +T+E +  +G+ L           S   +++
Sbjct: 172 -MILGRYDRAAMDGGYTPLFRSINRPTYSLTMEMLIANGQRLVT---------SSSEMIV 221

Query: 280 DSGTDVTWLVKEAYEALRDEVMI---------RLEGEQMRSY----SWPDKLCYHGIMS- 325
           DSG   T L    + AL D+ +          R    +  SY    S  D   ++G ++ 
Sbjct: 222 DSGAQRTSLWPSTF-ALLDKTITQAMSSIGYHRTSRARQESYICYLSEHDYSGWNGTITP 280

Query: 326 -SDLKGFPTVRFHFRGGAKLALEKDSMFYQPRPDAFCM--AVNPASINNRYVNFSLIGMM 382
            S+    P +   F GGA LAL   ++FY       CM  A NPA      +   ++G  
Sbjct: 281 FSNWSALPLLEIGFAGGAALALPPRNVFYNDPHRGLCMTFAQNPA------LRSQILGNR 334

Query: 383 AQQFYNVGYDIGRKQKTFQRMDC 405
             + +   +DI  KQ  F+   C
Sbjct: 335 VTRSFGTTFDIQGKQFGFKYAAC 357


>gi|25347778|pir||B84556 hypothetical protein At2g17760 [imported] - Arabidopsis thaliana
          Length = 473

 Score = 87.0 bits (214), Expect = 2e-14,   Method: Compositional matrix adjust.
 Identities = 101/381 (26%), Positives = 158/381 (41%), Gaps = 60/381 (15%)

Query: 60  LFYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLGT---------IFYPSRSSS 110
           L Y N+++G P      A+DTGS L W+ C  C +C  +L           I+ P+ SS+
Sbjct: 54  LHYANVTVGTPSDWFMVALDTGSDLFWLPC-DCTNCVRELKAPGGSSLDLNIYSPNASST 112

Query: 111 YAYVPCDSEHCRYFPYARCSAYKHRCIYTQLYLIGPETSVFVSTEQ-LTFKNTDESTIHV 169
              VPC+S  C      RC++ +  C Y   YL    +S  V  E  L   + D+S+  +
Sbjct: 113 STKVPCNSTLCTR--GDRCASPESDCPYQIRYLSNGTSSTGVLVEDVLHLVSNDKSSKAI 170

Query: 170 -QDVVFGCGFSTNRNF----KFSGIFGLGIGRSSLVSQL------NSSFSYCIGSLHDPD 218
              V FGCG      F      +G+FGLG+   S+ S L       +SFS C G+     
Sbjct: 171 PARVTFGCGQVQTGVFHDGAAPNGLFGLGLEDISVPSVLAKEGIAANSFSMCFGNDG--- 227

Query: 219 YLHNKLILGDGAIIDEGDATPLQFIDGH--YYITLEAISVDGRMLDINPNIFKRDDSGGG 276
               ++  GD   +D+ + TPL     H  Y IT+  ISV G   D+  +          
Sbjct: 228 --AGRISFGDKGSVDQRE-TPLNIRQPHPTYNITVTKISVGGNTGDLEFD---------- 274

Query: 277 VMIDSGTDVTWLVKEAYEALRDEV-MIRLEGE-QMRSYSWPDKLCY---------HGIMS 325
            + DSGT  T+L   AY  + +    + L+   Q      P + CY         H   +
Sbjct: 275 AVFDSGTSFTYLTDAAYTLISESFNSLALDKRYQTTDSELPFEYCYALRLPLYSGHHHPN 334

Query: 326 SDLKGFPTVRFHFRGGAKLALEKDSMFYQPR-PDAFCMAVNPASINNRYVNFSLIGMMAQ 384
            D   +P V    +GG+   +    +    +  D +C+A+       +  + S+IG    
Sbjct: 335 KDSFQYPAVNLTMKGGSSYPVYHPLVVIPMKDTDVYCLAI------MKIEDISIIGQNFM 388

Query: 385 QFYNVGYDIGRKQKTFQRMDC 405
             Y V +D  +    ++  DC
Sbjct: 389 TGYRVVFDREKLILGWKESDC 409


>gi|118482048|gb|ABK92955.1| unknown [Populus trichocarpa]
          Length = 425

 Score = 87.0 bits (214), Expect = 2e-14,   Method: Compositional matrix adjust.
 Identities = 97/363 (26%), Positives = 151/363 (41%), Gaps = 34/363 (9%)

Query: 56  ISNSLFYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLGTIFYPSRSSSYAYVP 115
           + +  + V  ++G P      A+DT +   W+ C  C  CS    T+F    S+++  + 
Sbjct: 85  VQSPTYIVKANVGTPAQTFLMALDTSNDAAWIPCNGCVGCS---STVFNSVTSTTFKTLG 141

Query: 116 CDSEHCRYFPYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDVVFG 175
           CD+  C+  P   C      C +   Y  G  T +      LT      ST  V    FG
Sbjct: 142 CDAPQCKQVPNPTCGG--STCTWNTTY--GGSTIL----SNLTRDTIALSTDIVPGYTFG 193

Query: 176 C-GFSTNRNFKFSGIFGLGIGRSSLVSQL----NSSFSYCIGSLHDPDYLHNKLILGDGA 230
           C   +T  +    G+ GLG G  S +SQ      S+FSYC+ S    ++    L LG   
Sbjct: 194 CIQKTTGSSVPPQGLLGLGRGPLSFLSQTQDLYKSTFSYCLPSFRTLNF-SGTLRLGPAG 252

Query: 231 IIDEGDATPL---QFIDGHYYITLEAISVDGRMLDINPNIFKRD-DSGGGVMIDSGTDVT 286
                  TPL         YY+ L  I V  +++DI  +    +  +G G + DSGT  T
Sbjct: 253 QPLRIKTTPLLKNPRRSSLYYVNLIGIRVGRKIVDIPASALAFNPTTGAGTIFDSGTVFT 312

Query: 287 WLVKEAYEALRDEVMIRLEGEQMRSYSWPDKLCYHGIMSSDLKGFPTVRFHFRGGAKLAL 346
            LV   Y A+RDE   R+    + S    D  CY G + +     PT+ F F  G  + L
Sbjct: 313 RLVAPVYTAVRDEFRKRVGNAIVSSLGGFDT-CYTGPIVA-----PTMTFMF-SGMNVTL 365

Query: 347 EKDSMFYQPRPDA---FCMAVNPASINNRYVNFSLIGMMAQQFYNVGYDIGRKQKTFQRM 403
             D++  +    +     MA  P ++N+     ++I  M QQ + + +D+   +    R 
Sbjct: 366 PTDNLLIRSTAGSTSCLAMAAAPDNVNSV---LNVIANMQQQNHRILFDVPNSRIGVARE 422

Query: 404 DCE 406
            C 
Sbjct: 423 PCS 425


>gi|224057272|ref|XP_002299201.1| predicted protein [Populus trichocarpa]
 gi|118483775|gb|ABK93780.1| unknown [Populus trichocarpa]
 gi|222846459|gb|EEE84006.1| predicted protein [Populus trichocarpa]
          Length = 425

 Score = 86.7 bits (213), Expect = 2e-14,   Method: Compositional matrix adjust.
 Identities = 97/363 (26%), Positives = 150/363 (41%), Gaps = 34/363 (9%)

Query: 56  ISNSLFYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLGTIFYPSRSSSYAYVP 115
           + +  + V  ++G P      A+DT +   W+ C  C  CS    T+F    S+++  + 
Sbjct: 85  VQSPTYIVKANVGTPAQTFLMALDTSNDAAWIPCNGCVGCS---STVFNSVTSTTFKTLG 141

Query: 116 CDSEHCRYFPYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDVVFG 175
           CD+  C+  P   C      C +   Y  G  T +      LT      ST  V    FG
Sbjct: 142 CDAPQCKQVPNPTCGG--STCTWNTTY--GGSTIL----SNLTRDTIALSTDIVPGYTFG 193

Query: 176 C-GFSTNRNFKFSGIFGLGIGRSSLVSQL----NSSFSYCIGSLHDPDYLHNKLILGDGA 230
           C   +T  +    G+ GLG G  S +SQ      S+FSYC+ S    ++    L LG   
Sbjct: 194 CIQKTTGSSVPPQGLLGLGRGPLSFLSQTQDLYKSTFSYCLPSFRTLNF-SGTLRLGPAG 252

Query: 231 IIDEGDATPL---QFIDGHYYITLEAISVDGRMLDINPNIFK-RDDSGGGVMIDSGTDVT 286
                  TPL         YY+ L  I V  +++DI  +       +G G + DSGT  T
Sbjct: 253 QPLRIKTTPLLKNPRRSSLYYVNLIGIRVGRKIVDIPASALAFNPTTGAGTIFDSGTVFT 312

Query: 287 WLVKEAYEALRDEVMIRLEGEQMRSYSWPDKLCYHGIMSSDLKGFPTVRFHFRGGAKLAL 346
            LV   Y A+RDE   R+    + S    D  CY G + +     PT+ F F  G  + L
Sbjct: 313 RLVAPVYTAVRDEFRKRVGNAIVSSLGGFDT-CYTGPIVA-----PTMTFMF-SGMNVTL 365

Query: 347 EKDSMFYQPRPDA---FCMAVNPASINNRYVNFSLIGMMAQQFYNVGYDIGRKQKTFQRM 403
             D++  +    +     MA  P ++N+     ++I  M QQ + + +D+   +    R 
Sbjct: 366 PPDNLLIRSTAGSTSCLAMAAAPDNVNSV---LNVIANMQQQNHRILFDVPNSRIGVARE 422

Query: 404 DCE 406
            C 
Sbjct: 423 PCS 425


>gi|125528357|gb|EAY76471.1| hypothetical protein OsI_04407 [Oryza sativa Indica Group]
          Length = 441

 Score = 86.7 bits (213), Expect = 2e-14,   Method: Compositional matrix adjust.
 Identities = 80/259 (30%), Positives = 113/259 (43%), Gaps = 23/259 (8%)

Query: 58  NSLFYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLGTI--FYPSRSSSYAYVP 115
           N    V++++G PP      +DTGS L W+ C P         +   F P  S ++A VP
Sbjct: 62  NVSLTVSLAVGTPPQNVTMVLDTGSELSWLLCAPGGGGGGGGRSALSFRPRASLTFASVP 121

Query: 116 CDSEHCRY----FPYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQD 171
           C S  CR      P A C     +C  +  Y  G  +   ++TE  T             
Sbjct: 122 CGSAQCRSRDLPSPPA-CDGASKQCRVSLSYADGSSSDGALATEVFTVGQGPP-----LR 175

Query: 172 VVFGC---GFSTNRN-FKFSGIFGLGIGRSSLVSQLNS-SFSYCIGSLHDPDYL---HNK 223
             FGC    F T+ +    +G+ G+  G  S VSQ ++  FSYCI    D   L   H+ 
Sbjct: 176 AAFGCMATAFDTSPDGVATAGLLGMNRGALSFVSQASTRRFSYCISDRDDAGVLLLGHSD 235

Query: 224 L-ILGDGAIIDEGDATPLQFIDG-HYYITLEAISVDGRMLDINPNIFKRDDSGGG-VMID 280
           L  L          A PL + D   Y + L  I V G+ L I  ++   D +G G  M+D
Sbjct: 236 LPFLPLNYTPLYQPAMPLPYFDRVAYSVQLLGIRVGGKPLPIPASVLAPDHTGAGQTMVD 295

Query: 281 SGTDVTWLVKEAYEALRDE 299
           SGT  T+L+ +AY AL+ E
Sbjct: 296 SGTQFTFLLGDAYSALKAE 314


>gi|190896584|gb|ACE96805.1| aspartyl protease [Populus tremula]
 gi|190896586|gb|ACE96806.1| aspartyl protease [Populus tremula]
 gi|190896588|gb|ACE96807.1| aspartyl protease [Populus tremula]
 gi|190896590|gb|ACE96808.1| aspartyl protease [Populus tremula]
 gi|190896592|gb|ACE96809.1| aspartyl protease [Populus tremula]
 gi|190896594|gb|ACE96810.1| aspartyl protease [Populus tremula]
 gi|190896596|gb|ACE96811.1| aspartyl protease [Populus tremula]
 gi|190896598|gb|ACE96812.1| aspartyl protease [Populus tremula]
 gi|190896600|gb|ACE96813.1| aspartyl protease [Populus tremula]
 gi|190896602|gb|ACE96814.1| aspartyl protease [Populus tremula]
 gi|190896604|gb|ACE96815.1| aspartyl protease [Populus tremula]
 gi|190896606|gb|ACE96816.1| aspartyl protease [Populus tremula]
 gi|190896610|gb|ACE96818.1| aspartyl protease [Populus tremula]
 gi|190896612|gb|ACE96819.1| aspartyl protease [Populus tremula]
 gi|190896614|gb|ACE96820.1| aspartyl protease [Populus tremula]
 gi|190896616|gb|ACE96821.1| aspartyl protease [Populus tremula]
 gi|190896618|gb|ACE96822.1| aspartyl protease [Populus tremula]
 gi|190896620|gb|ACE96823.1| aspartyl protease [Populus tremula]
 gi|190896622|gb|ACE96824.1| aspartyl protease [Populus tremula]
 gi|190896624|gb|ACE96825.1| aspartyl protease [Populus tremula]
 gi|190896626|gb|ACE96826.1| aspartyl protease [Populus tremula]
 gi|190896628|gb|ACE96827.1| aspartyl protease [Populus tremula]
 gi|190896630|gb|ACE96828.1| aspartyl protease [Populus tremula]
 gi|190896632|gb|ACE96829.1| aspartyl protease [Populus tremula]
 gi|190896634|gb|ACE96830.1| aspartyl protease [Populus tremula]
 gi|190896636|gb|ACE96831.1| aspartyl protease [Populus tremula]
 gi|190896638|gb|ACE96832.1| aspartyl protease [Populus tremula]
 gi|190896640|gb|ACE96833.1| aspartyl protease [Populus tremula]
 gi|190896642|gb|ACE96834.1| aspartyl protease [Populus tremula]
 gi|190896644|gb|ACE96835.1| aspartyl protease [Populus tremula]
 gi|190896646|gb|ACE96836.1| aspartyl protease [Populus tremula]
 gi|190896648|gb|ACE96837.1| aspartyl protease [Populus tremula]
 gi|190896650|gb|ACE96838.1| aspartyl protease [Populus tremula]
 gi|190896652|gb|ACE96839.1| aspartyl protease [Populus tremula]
 gi|190896654|gb|ACE96840.1| aspartyl protease [Populus tremula]
 gi|190896656|gb|ACE96841.1| aspartyl protease [Populus tremula]
 gi|190896658|gb|ACE96842.1| aspartyl protease [Populus tremula]
          Length = 339

 Score = 86.7 bits (213), Expect = 2e-14,   Method: Compositional matrix adjust.
 Identities = 91/323 (28%), Positives = 131/323 (40%), Gaps = 32/323 (9%)

Query: 32  RLAYLQEKIRSHNTYQAQILPSNDISNSLFYV-NISIGQPPVPQFTAMDTGSSLLWVHCY 90
           RL YL   +    T    I P   +     YV  + +G P    F  +DT +   WV C 
Sbjct: 16  RLKYL-STLADQKTTAVPIAPGQQVLKIANYVVRVKLGTPGQQMFMVLDTSNDAAWVPCS 74

Query: 91  PCRDCSPQLGTIFYPSRSSSYAYVPCDSEHCRYFPYARCSAY-KHRCIYTQLYLIGPETS 149
            C  CS    T F P+ S++   + C    C       C A     C++ Q Y  G ++S
Sbjct: 75  GCTGCS---STTFLPNASTTLGSLDCSEAQCSQVRGFSCPATGSSACLFNQSY--GGDSS 129

Query: 150 VFVSTEQ--LTFKNTDESTIHVQDVVFGC-GFSTNRNFKFSGIFGLGIGRSSLVSQLNS- 205
           +  +  Q  +T  N       +    FGC    +  +    G+ GLG G  SL+SQ  + 
Sbjct: 130 LAATLVQDAITLAND-----VIPGFTFGCINAVSGGSIPPQGLLGLGRGPISLISQAGAM 184

Query: 206 ---SFSYCIGSLHDPDYLHNKLILGDGAIIDEGDATPLQFIDGH----YYITLEAISVDG 258
               FSYC+ S     Y    L LG          TPL   + H    YY+ L  +SV  
Sbjct: 185 YSGVFSYCLPSFKS-YYFSGSLKLGPVGQPKSIRTTPL-LRNPHRPSLYYVNLTGVSVGR 242

Query: 259 RMLDINPNIFKRD-DSGGGVMIDSGTDVTWLVKEAYEALRDEVMIRLEGEQMRSYSWPDK 317
             + I       D ++G G +IDSGT +T  V+  Y A+RDE   ++ G  + S    D 
Sbjct: 243 IKVPIPSEQLVFDPNTGAGTIIDSGTVITRFVQPVYFAIRDEFRKQVNGP-ISSLGAFDT 301

Query: 318 LCYHGIMSSDLKGFPTVRFHFRG 340
            C+     ++    P V  HF G
Sbjct: 302 -CFAATNEAEA---PAVTLHFEG 320


>gi|224133616|ref|XP_002327639.1| predicted protein [Populus trichocarpa]
 gi|222836724|gb|EEE75117.1| predicted protein [Populus trichocarpa]
          Length = 484

 Score = 86.7 bits (213), Expect = 2e-14,   Method: Compositional matrix adjust.
 Identities = 84/294 (28%), Positives = 130/294 (44%), Gaps = 49/294 (16%)

Query: 57  SNSLF-----YVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQL----GT----IF 103
           SN LF     Y N+S+G P V    A+DTGS+LLW+ C  C  C   L    GT    I+
Sbjct: 53  SNGLFGYILHYANVSVGTPSVSFLVALDTGSNLLWLPC-DCSSCVHSLRSPSGTVDLNIY 111

Query: 104 YPSRSSSYAYVPCDSEHCRYFPYARCSAYKHRCIYTQLYLI-GPETSVFVSTEQLTFKNT 162
            P+ SS+   VPC+S  C      RC + +  C Y  +YL  G  T+ ++  + L   + 
Sbjct: 112 SPNTSSTSEKVPCNSTLCSQTQRDRCPSDQSNCPYQVVYLSNGTSTTGYIVQDLLHLISD 171

Query: 163 DESTIHV-QDVVFGCGFSTNRNF----KFSGIFGLGIGRSSLVSQL------NSSFSYCI 211
           D  +  V   + FGCG     +F      +G+FGLG+   S+ S L      + SFS C 
Sbjct: 172 DSQSKAVDAKITFGCGKVQTGSFLTGGAPNGLFGLGMSNISVPSTLAHNGYTSGSFSMCF 231

Query: 212 GSLHDPDYLHNKLILGDGAIIDEGDATPLQFIDGH-----YYITLEAISVDGRMLDINPN 266
                P+ +  ++  GD     +G+ +   F  G      Y I++   S+ G+  D+   
Sbjct: 232 ----SPNGI-GRISFGDKGSTGQGETS---FNQGQPRSSLYNISITQTSIGGQASDL--- 280

Query: 267 IFKRDDSGGGVMIDSGTDVTWLVKEAYEALRDEVMIRLEGEQMRSYSWPDKLCY 320
           ++         + DSGT  T+L   AY  + +     ++  +  S   P   CY
Sbjct: 281 VYS-------AIFDSGTSFTYLNDPAYTLIAESFNKLVKETRRSSTQVPFDYCY 327


>gi|148910602|gb|ABR18371.1| unknown [Picea sitchensis]
          Length = 446

 Score = 86.7 bits (213), Expect = 2e-14,   Method: Compositional matrix adjust.
 Identities = 105/430 (24%), Positives = 175/430 (40%), Gaps = 61/430 (14%)

Query: 12  NNPNENPAHRVQRGINISIARLAYLQEKIRSHNTYQAQILPSNDISNSLFYVNISIGQPP 71
           N+   + A  V R  N    RL   Q  I S        L  N +   L+YV + +G P 
Sbjct: 38  NDEEGSKASFVSRDTNRIGRRLQAHQTAIFS--------LKGNVVPYGLYYVTMLVGNPS 89

Query: 72  VPQFTAMDTGSSLLWVHC-YPCRDCSP--------QLGTIFYPSRSSSYAYVPCDSEHCR 122
            P F  +D+GS L W+ C  PC  C+         + G++  PS+    A V   S H  
Sbjct: 90  KPYFLDVDSGSELTWIQCDAPCISCAKGPHPLYKLKKGSLV-PSKDPLCAAVQAGSGH-- 146

Query: 123 YFPYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDVVFGCGFSTNR 182
              Y        RC Y   Y     +  F+  + +    T++ T+   + VFGCG++   
Sbjct: 147 ---YHNHKEASQRCDYDVAYADHGYSEGFLVRDSVRALLTNK-TVLTANSVFGCGYNQRE 202

Query: 183 NFKFS-----GIFGLGIGRSSLVSQ------LNSSFSYCI-GSLHDPDYLHNKLILGDGA 230
           +   S     GI GLG G +SL SQ      + +   +CI G+  D  Y    +  GD  
Sbjct: 203 SLPVSDARTDGILGLGSGMASLPSQWAKQGLIKNVIGHCIFGAGRDGGY----MFFGD-D 257

Query: 231 IIDEGDATPLQFID----GHYYITLEAISVDGRMLDINPNIFKRDDSGGGVMIDSGTDVT 286
           ++     T +  +      HYY+    ++   + LD + +  K     GG++ DSG+  T
Sbjct: 258 LVSTSAMTWVPMLGRPSIKHYYVGAAQMNFGNKPLDKDGDGKKL----GGIIFDSGSTYT 313

Query: 287 WLVKEAYEALRDEVMIRLEGEQMR--------SYSWPDKLCYHGIMSSDLKGFP-TVRFH 337
           +   +AY A    V   L G+Q+         S  W  K  +  +  +     P T++F 
Sbjct: 314 YFTNQAYGAFLSVVKENLSGKQLEQDSSDSFLSLCWRRKEGFRSVAEAAAYFKPLTLKFR 373

Query: 338 FRGGAKLALEKDSMFYQPRPDAFCMAV-NPASINNRYVNFSLIGMMAQQFYNVGYDIGRK 396
                ++ +  +      +    C+ + N  +I    V+ +++G ++ Q   V YD  + 
Sbjct: 374 STKTKQMEIFPEGYLVVNKKGNVCLGILNGTAIG--IVDTNVLGDISFQGQLVVYDNEKN 431

Query: 397 QKTFQRMDCE 406
           Q  + R DC+
Sbjct: 432 QIGWARSDCQ 441


>gi|340810915|gb|AEK75384.1| S5 [Oryza sativa]
 gi|340810917|gb|AEK75385.1| S5 [Oryza sativa]
 gi|340810919|gb|AEK75386.1| S5 [Oryza sativa]
 gi|340810927|gb|AEK75390.1| S5 [Oryza sativa]
 gi|340810975|gb|AEK75414.1| S5 [Oryza nivara]
 gi|340810979|gb|AEK75416.1| S5 [Oryza nivara]
 gi|340810995|gb|AEK75424.1| S5 [Oryza nivara]
 gi|340811027|gb|AEK75440.1| S5 [Oryza nivara]
 gi|340811063|gb|AEK75458.1| S5 [Oryza nivara]
          Length = 357

 Score = 86.3 bits (212), Expect = 2e-14,   Method: Compositional matrix adjust.
 Identities = 110/383 (28%), Positives = 162/383 (42%), Gaps = 66/383 (17%)

Query: 63  VNISIGQPPVPQFTAMDTGSSLLWVHCYPCR-DC---SPQLGTIFYPSRSSSYAYVPCDS 118
           + +S+G+PPV    A+DTGS+L WV C PC   C   S + G IF P RS +   V C S
Sbjct: 1   MAVSLGKPPVVNLVAIDTGSTLSWVQCQPCAVHCHTQSAKAGPIFDPGRSYTSRRVRCSS 60

Query: 119 EHC---RY---FPYARCSAYKHRCIYTQLYLIGPETSV-FVSTEQLTFKNTDESTIHVQD 171
             C   RY      A C   +  C Y+  Y  G   SV  + T+ L   ++        D
Sbjct: 61  VKCGEPRYDLRLQQANCMEKEDSCTYSVTYGNGWAYSVGKMVTDTLRIGDS------FMD 114

Query: 172 VVFGCGFSTNRNFKFSGIFGLGIGRSSLVSQLN--------SSFSYCIGSLH-DPDYLHN 222
           ++FGC      +   +GIFG G    S   QL          +FSYC+ +    P Y   
Sbjct: 115 LMFGCSMDVKYSEFEAGIFGFGSSSFSFFEQLAGYPDILSYKAFSYCLPTDETKPGY--- 171

Query: 223 KLILG--DGAIIDEGDATPLQFIDG-HYYITLEAISVDGRMLDINPNIFKRDDSGGGVMI 279
            +ILG  D A +D G     + I+   Y +T+E +  +G+ L           S   +++
Sbjct: 172 -MILGRYDRAAMDGGYTPLFRSINRPTYSLTMEMLIANGQRLVT---------SSSEMIV 221

Query: 280 DSGTDVTWLVKEAYEALRDEVMI---------RLEGEQMRSY----SWPDKLCYHGIMS- 325
           DSG   T L    + AL D+ +          R    +  SY    S  D   ++G ++ 
Sbjct: 222 DSGAQRTSLWPSTF-ALLDKTITQAMSSIGYHRTSRARQESYICYLSEHDYSGWNGTITP 280

Query: 326 -SDLKGFPTVRFHFRGGAKLALEKDSMFYQPRPDAFCM--AVNPASINNRYVNFSLIGMM 382
            S+    P +   F GGA LAL   ++FY       CM  A NPA      +   ++G  
Sbjct: 281 FSNWSALPLLEIGFAGGAALALSPRNVFYNDPHRGLCMTFAQNPA------LRSQILGNR 334

Query: 383 AQQFYNVGYDIGRKQKTFQRMDC 405
             + +   +DI  KQ  F+   C
Sbjct: 335 VTRSFGTTFDIQGKQFGFKYAAC 357


>gi|326513976|dbj|BAJ92138.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 342

 Score = 86.3 bits (212), Expect = 3e-14,   Method: Compositional matrix adjust.
 Identities = 72/256 (28%), Positives = 127/256 (49%), Gaps = 21/256 (8%)

Query: 167 IHVQDVV-FGCG-FSTNRNFKFSGIFGLGIGRSSLVSQLN-SSFSYCIGSLHDPDYLHNK 223
           +HV+  + FGCG  S       SG+ GL  G  SL+SQL+   FSYC+    +     + 
Sbjct: 88  VHVRRALGFGCGALSAGSLVGASGLMGLSPGTMSLISQLSVPRFSYCLTPFAERKT--SP 145

Query: 224 LILGDGAIIDEGDAT-PLQ--------FIDG-HYYITLEAISVDGRMLDI-NPNIFKRDD 272
           ++ G  A + + + T P+Q         +D  +YY+ L  +S+  + L +   ++    D
Sbjct: 146 MLFGAMADLRKYNTTGPIQTTAILRNPAMDTFYYYVPLVGLSLGTKRLRVPAASLAINPD 205

Query: 273 SGGGVMIDSGTDVTWLVKEAYEALRDEVMIRLEGEQMRSYSWPDKLCY---HGIMSSDLK 329
             GG ++DSG+ +  L  +A++A++  V+  ++           +LC+    G+  + +K
Sbjct: 206 GTGGTIVDSGSTMAHLAGKAFDAVKKAVLEAVKLPVFNGTVEDYELCFAVPSGVAMAAVK 265

Query: 330 GFPTVRFHFRGGAKLALEKDSMFYQPRPDAFCMAVNPASINNRYVNFSLIGMMAQQFYNV 389
             P V  HF GGA +AL +D+ F +PR    C+AV   S  +     S+IG + QQ  +V
Sbjct: 266 TPPLV-LHFDGGAAMALPRDNYFQEPRAGLMCLAVA-RSPEDLGAPISIIGNVQQQNMHV 323

Query: 390 GYDIGRKQKTFQRMDC 405
            +D+  ++ +F    C
Sbjct: 324 LFDVHNQKFSFAPTKC 339


>gi|340811098|gb|AEK75475.1| S5 [Oryza nivara]
          Length = 357

 Score = 86.3 bits (212), Expect = 3e-14,   Method: Compositional matrix adjust.
 Identities = 109/383 (28%), Positives = 160/383 (41%), Gaps = 66/383 (17%)

Query: 63  VNISIGQPPVPQFTAMDTGSSLLWVHCYPCR-DC---SPQLGTIFYPSRSSSYAYVPCDS 118
           + +S+G+PPV    A+DTGS+L WV C PC   C   S + G IF P RS +   V C S
Sbjct: 1   MAVSLGKPPVVNLVAIDTGSTLSWVQCQPCAVHCHTQSAKAGPIFDPGRSYTSRRVRCSS 60

Query: 119 EHCRYFPY------ARCSAYKHRCIYTQLYLIGPETSV-FVSTEQLTFKNTDESTIHVQD 171
             C    Y      A C   +  C Y+  Y  G   SV  + T+ L   ++        D
Sbjct: 61  VKCGELRYDLRLQQANCMEKEDSCTYSVTYGNGWAYSVGKMVTDTLRIGDS------FMD 114

Query: 172 VVFGCGFSTNRNFKFSGIFGLGIGRSSLVSQLN--------SSFSYCIGSLH-DPDYLHN 222
           ++FGC      +   +GIFG G    S   QL          +FSYC+ +    P Y   
Sbjct: 115 LMFGCSMDVKYSEFEAGIFGFGSSSFSFFEQLAGYPDILSYKAFSYCLPTDETKPGY--- 171

Query: 223 KLILG--DGAIIDEGDATPLQFIDG-HYYITLEAISVDGRMLDINPNIFKRDDSGGGVMI 279
            +ILG  D A +D G     + I+   Y +T E +  +G+ L           S   +++
Sbjct: 172 -MILGRYDRAAMDGGYTPLFRSINRPTYSLTTEMLIANGQRLVT---------SSSEMIV 221

Query: 280 DSGTDVTWLVKEAYEALRDEVMI---------RLEGEQMRSY----SWPDKLCYHGIMS- 325
           DSG   T L    + AL D+ +          R    +  SY    S  D   ++G ++ 
Sbjct: 222 DSGAQRTSLWPSTF-ALLDKTITQAMSSIGYHRTSRARQESYICYLSEHDYSGWNGTITP 280

Query: 326 -SDLKGFPTVRFHFRGGAKLALEKDSMFYQPRPDAFCM--AVNPASINNRYVNFSLIGMM 382
            S+    P +   F GGA LAL   ++FY       CM  A NPA      +   ++G  
Sbjct: 281 FSNWSALPLLEIGFAGGAALALSPRNVFYNDPHRGLCMTFAQNPA------LRSQILGNR 334

Query: 383 AQQFYNVGYDIGRKQKTFQRMDC 405
             + +   +DI  KQ  F+   C
Sbjct: 335 VTRSFGTTFDIQGKQFGFKYAAC 357


>gi|222628951|gb|EEE61083.1| hypothetical protein OsJ_14969 [Oryza sativa Japonica Group]
          Length = 367

 Score = 86.3 bits (212), Expect = 3e-14,   Method: Compositional matrix adjust.
 Identities = 77/280 (27%), Positives = 117/280 (41%), Gaps = 28/280 (10%)

Query: 22  VQRGINISIARLAYLQEKIRSHNTYQAQILPSNDI--SNSLFYVNISIGQPPVPQFTAMD 79
           ++R I  S  RLA +        + +  ++    I  +   + V + IG PP     A+D
Sbjct: 48  LRRAIQRSRYRLAGIGMARGEAASARKAVVAETPIMPAGGEYLVKLGIGTPPYKFTAAID 107

Query: 80  TGSSLLWVHCYPCRDCSPQLGTIFYPSRSSSYAYVPCDSEHCRYFPYARCS-AYKHRCIY 138
           T S L+W  C PC  C  Q+  +F P  SS+YA +PC S+ C      RC       C Y
Sbjct: 108 TASDLIWTQCQPCTGCYHQVDPMFNPRVSSTYAALPCSSDTCDELDVHRCGHDDDESCQY 167

Query: 139 TQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDVVFGCGFSTNRNF---KFSGIFGLGIG 195
           T  Y     T   ++ ++L            + V FGC  S+       + SG+ GLG G
Sbjct: 168 TYTYSGNATTEGTLAVDKLVIGED-----AFRGVAFGCSTSSTGGAPPPQASGVVGLGRG 222

Query: 196 RSSLVSQLN-----------SSFSYCIGSLHDP--DYLHNKLILGDGAIIDEGDATPLQF 242
             SLVSQL+           S+ ++   SL+D   + L  ++ L  G     G       
Sbjct: 223 PLSLVSQLSVRRYGMIIDIASTITFLEASLYDELVNDLEVEIRLPRGTGSSLGLDLCFIL 282

Query: 243 IDG----HYYITLEAISVDGRMLDINPNIFKRDDSGGGVM 278
            DG      Y+   A++ DGR L ++      +D   G+M
Sbjct: 283 PDGVAFDRVYVPAVALAFDGRWLRLDKARLFAEDRESGMM 322


>gi|190896608|gb|ACE96817.1| aspartyl protease [Populus tremula]
          Length = 339

 Score = 85.9 bits (211), Expect = 3e-14,   Method: Compositional matrix adjust.
 Identities = 91/323 (28%), Positives = 131/323 (40%), Gaps = 32/323 (9%)

Query: 32  RLAYLQEKIRSHNTYQAQILPSNDISNSLFYV-NISIGQPPVPQFTAMDTGSSLLWVHCY 90
           RL YL   +    T    I P   +     YV  + +G P    F  +DT +   WV C 
Sbjct: 16  RLKYL-STLADQKTTAVPIAPGQQVLKIANYVVRVKLGTPGQQMFMVLDTSNDAAWVPCS 74

Query: 91  PCRDCSPQLGTIFYPSRSSSYAYVPCDSEHCRYFPYARCSAY-KHRCIYTQLYLIGPETS 149
            C  CS    T F P+ S++   + C    C       C A     C++ Q Y  G ++S
Sbjct: 75  GCTGCS---STTFLPNASTTLGSLDCSEAQCSQVRGFSCPATGSSACLFNQSY--GGDSS 129

Query: 150 VFVSTEQ--LTFKNTDESTIHVQDVVFGC-GFSTNRNFKFSGIFGLGIGRSSLVSQLNS- 205
           +  +  Q  +T  N       +    FGC    +  +    G+ GLG G  SL+SQ  + 
Sbjct: 130 LAATLVQDAITLAND-----VIPGFTFGCINAVSGGSIPPQGLLGLGRGPISLISQAGAM 184

Query: 206 ---SFSYCIGSLHDPDYLHNKLILGDGAIIDEGDATPLQFIDGH----YYITLEAISVDG 258
               FSYC+ S     Y    L LG          TPL   + H    YY+ L  +SV  
Sbjct: 185 YSGVFSYCLPSFKS-YYFSGSLKLGPVGQPKSIRTTPL-LRNPHRPSLYYVNLTGVSVGR 242

Query: 259 RMLDINPNIFKRD-DSGGGVMIDSGTDVTWLVKEAYEALRDEVMIRLEGEQMRSYSWPDK 317
             + I       D ++G G +IDSGT +T  V+  Y A+RDE   ++ G  + S    D 
Sbjct: 243 IKVPIPSEQLVFDPNTGAGTIIDSGTVITRFVQPVYFAIRDEFRKQVNGP-ISSLGAFDT 301

Query: 318 LCYHGIMSSDLKGFPTVRFHFRG 340
            C+     ++    P V  HF G
Sbjct: 302 -CFAETNEAEA---PAVTLHFEG 320


>gi|340810959|gb|AEK75406.1| S5 [Oryza sativa]
 gi|340810971|gb|AEK75412.1| S5 [Oryza rufipogon]
          Length = 357

 Score = 85.9 bits (211), Expect = 3e-14,   Method: Compositional matrix adjust.
 Identities = 110/383 (28%), Positives = 162/383 (42%), Gaps = 66/383 (17%)

Query: 63  VNISIGQPPVPQFTAMDTGSSLLWVHCYPCR-DC---SPQLGTIFYPSRSSSYAYVPCDS 118
           + +S+G+PPV    A+DTGS+L WV C PC   C   S + G IF P RS +   V C S
Sbjct: 1   MAVSLGKPPVVNLVAIDTGSTLSWVQCQPCAVHCHTQSAKAGPIFDPGRSYTSRRVRCSS 60

Query: 119 EHC---RY---FPYARCSAYKHRCIYTQLYLIGPETSV-FVSTEQLTFKNTDESTIHVQD 171
             C   RY      A C   +  C Y+  Y  G   SV  + T+ L   ++        D
Sbjct: 61  VKCGEPRYDLRLQQANCMEKEDSCTYSVTYGNGWAYSVGKMVTDTLRIGDS------FMD 114

Query: 172 VVFGCGFSTNRNFKFSGIFGLGIGRSSLVSQLN--------SSFSYCIGSLH-DPDYLHN 222
           ++FGC      +   +GIFG G    S   QL          +FSYC+ +    P Y   
Sbjct: 115 LMFGCSMDVKYSEFEAGIFGFGSSSFSFFEQLAGYPDILSYKAFSYCLPTDETKPGY--- 171

Query: 223 KLILG--DGAIIDEGDATPLQFIDG-HYYITLEAISVDGRMLDINPNIFKRDDSGGGVMI 279
            +ILG  D A +D G     + I+   Y +T+E +  +G+ L           S   +++
Sbjct: 172 -MILGRYDRAAMDGGYTPLFRSINRPTYSLTMEMLIANGQRLVT---------SSSEMIV 221

Query: 280 DSGTDVTWLVKEAYEALRDEVMI---------RLEGEQMRSY----SWPDKLCYHGIMS- 325
           DSG   T L    + AL D+ +          R    +  SY    S  D   ++G ++ 
Sbjct: 222 DSGAQRTSLWPSTF-ALLDKTITQAMSSIGYHRTSRARQESYICYLSEHDYSGWNGTITP 280

Query: 326 -SDLKGFPTVRFHFRGGAKLALEKDSMFYQPRPDAFCM--AVNPASINNRYVNFSLIGMM 382
            S+    P +   F GGA LAL   ++FY       CM  A NPA      +   ++G  
Sbjct: 281 FSNWSALPLLEIGFAGGAALALPPRNVFYNDPHRGLCMTFAQNPA------LRSQILGNR 334

Query: 383 AQQFYNVGYDIGRKQKTFQRMDC 405
             + +   +DI  KQ  F+   C
Sbjct: 335 VTRSFGTTFDIQGKQFGFKYAAC 357


>gi|302756119|ref|XP_002961483.1| hypothetical protein SELMODRAFT_76765 [Selaginella moellendorffii]
 gi|300170142|gb|EFJ36743.1| hypothetical protein SELMODRAFT_76765 [Selaginella moellendorffii]
          Length = 388

 Score = 85.9 bits (211), Expect = 3e-14,   Method: Compositional matrix adjust.
 Identities = 93/386 (24%), Positives = 161/386 (41%), Gaps = 52/386 (13%)

Query: 60  LFYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLG-----TIFYPSRSSSYAYV 114
           L++  + +G P       +DTGS +LWV+C PC  C  +       T++ P  SS+ + V
Sbjct: 1   LYFTQVGLGNPVKHYIVQVDTGSDVLWVNCRPCSGCPRKSALNIPLTMYDPRESSTTSLV 60

Query: 115 PCDSEHC---RYFPYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIH--V 169
            C    C   R F  A+CS   + C Y   Y  G  +  +   + + +     + +    
Sbjct: 61  SCSDPLCVRGRRFAEAQCSQATNNCEYIFSYGDGSTSEGYYVRDAMQYNVISSNGLANTT 120

Query: 170 QDVVFGCGFSTNRNFKFS-----GIFGLGIGRSSLVSQLNSS------FSYCIGSLHDPD 218
             V+FGC      +   S     GI G G    S+ +QL +       FS+C+       
Sbjct: 121 SQVLFGCSIRQTGDLSTSQQAVDGIIGFGQLELSVPNQLAAQQNIPRVFSHCLEGEKRGG 180

Query: 219 YLHNKLILGDGAIIDEGDATPLQFIDGHYYITLEAISVDGRMLDINPNIFKRDDSGGGVM 278
            +     + +  +      TPL     HY + L  ISV+   L I+   F   +   GV+
Sbjct: 181 GILVIGGIAEPGMT----YTPLVPDSVHYNVVLRGISVNSNRLPIDAEDFSSTND-TGVI 235

Query: 279 IDSGTDVTWLVKEAY----EALRDEVM---IRLEGEQMRSYSWPDKLCYHGIMSSDLKGF 331
           +DSGT + +    AY    +A+R+      +R++G   + +    +L       SDL  F
Sbjct: 236 MDSGTTLAYFPSGAYNVFVQAIREATSATPVRVQGMDTQCFLVSGRL-------SDL--F 286

Query: 332 PTVRFHFRGGAKLALEKDS--MFYQPRP----DAFCMAVNPASIN---NRYVNFSLIGMM 382
           P V  +F GGA + L+ D+  M+    P    D +C+    +S +         +++G +
Sbjct: 287 PNVTLNFEGGA-MELQPDNYLMWGGTAPTGTTDVWCIGWQSSSSSAGPKDGSQLTILGDI 345

Query: 383 AQQFYNVGYDIGRKQKTFQRMDCEVL 408
             +   V YD+   +  +   +C+ L
Sbjct: 346 VLKDKLVVYDLDNSRIGWMSYNCKFL 371


>gi|226495123|ref|NP_001141522.1| uncharacterized protein LOC100273634 precursor [Zea mays]
 gi|194704920|gb|ACF86544.1| unknown [Zea mays]
 gi|223949445|gb|ACN28806.1| unknown [Zea mays]
 gi|413924531|gb|AFW64463.1| pepsin A [Zea mays]
          Length = 515

 Score = 85.9 bits (211), Expect = 3e-14,   Method: Compositional matrix adjust.
 Identities = 97/388 (25%), Positives = 160/388 (41%), Gaps = 57/388 (14%)

Query: 52  PSNDISNSLFYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLG---------TI 102
           P ND+   L+Y  + +G P      A+DTGS L WV C  C  C+P  G          I
Sbjct: 88  PGNDL-GWLYYAWVDVGTPATSFLVALDTGSDLFWVPC-DCIQCAPLSGYRGNLDRDLRI 145

Query: 103 FYPSRSSSYAYVPCDSEHCRYFPYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNT 162
           + P+ S++  ++PC  E C+  P   C+  K  C Y   Y     TS  +  E     N 
Sbjct: 146 YRPAESTTSRHLPCSHELCQSVP--GCTNPKQPCPYNIDYFSENTTSSGLLIEDTLHLNY 203

Query: 163 DESTIHVQ-DVVFGCGFSTNRNF----KFSGIFGLGIGRSSLVSQL------NSSFSYCI 211
            E  + V   V+ GCG   + ++       G+ GLG+   S+ S L       +SFS C 
Sbjct: 204 REDHVPVNASVIIGCGQKQSGDYLDGIAPDGLLGLGMADISVPSFLARAGLVQNSFSMCF 263

Query: 212 GSLHDPDYLHNKLILGDGAIIDEGDATPLQFIDGHYYITLEAISVDGRMLD---INPNIF 268
                 +    ++  GD  +  +  +TP  F+  +  +   A++VD   +    +    F
Sbjct: 264 K-----EDSSGRIFFGDQGVPSQ-QSTP--FVPLYGKLQTYAVNVDKSCIGHKCLEGTSF 315

Query: 269 KRDDSGGGVMIDSGTDVTWLVKEAYEALRDEVMIRLEGEQMRSYSWPDKLCYHGIMSSDL 328
           K        ++DSGT  T L  + Y+A   E   ++   ++       K CY      ++
Sbjct: 316 K-------ALVDSGTSFTSLPFDVYKAFTMEFDKQMNATRVPYEDTTWKYCYSA-SPLEM 367

Query: 329 KGFPTVRFHFRGGAKLALEKDSMFYQPRPDA---FCMAVNPASINNRYVNFSLIGMMAQQ 385
              PT+   F     L      + +  +  A   FC+AV P++          IG++AQ 
Sbjct: 368 PDVPTITLTFAADKSLQAVNPILPFNDKQGALAGFCLAVLPST--------EPIGIIAQN 419

Query: 386 F---YNVGYDIGRKQKTFQRMDCEVLDD 410
           F   Y+V +D    +  + R +C  ++D
Sbjct: 420 FLVGYHVVFDRESMKLGWYRSECRYVED 447


>gi|108862257|gb|ABA96612.2| Eukaryotic aspartyl protease family protein, expressed [Oryza
           sativa Japonica Group]
          Length = 451

 Score = 85.9 bits (211), Expect = 4e-14,   Method: Compositional matrix adjust.
 Identities = 99/385 (25%), Positives = 163/385 (42%), Gaps = 54/385 (14%)

Query: 58  NSLFYVNISIGQPPVPQFTAMDTGSSLLWVHC-YPCRDCSPQLGTIFYPSRSSSYAYVPC 116
           + L+YV +SIG PP P F  +DTGS L W+ C  PC  CS     ++ P+++     VPC
Sbjct: 55  HGLYYVAMSIGNPPRPYFLDVDTGSDLTWLQCDAPCVSCSKVPHPLYRPTKNK---LVPC 111

Query: 117 DSEHCRYFPYA-----RCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQD 171
             + C           +C + K +C Y   Y     +   + T+    +  + S +    
Sbjct: 112 VDQMCAALHGGLTGRHKCDSPKQQCDYEIKYADQGSSLGVLVTDSFALRLANSSIVR-PG 170

Query: 172 VVFGCGF-----STNRNFKFSGIFGLGIGRSSLVSQL------NSSFSYCIGSLHDPDYL 220
           + FGCG+     S+       G+ GLG G  SL+SQL       +   +C+ +       
Sbjct: 171 LAFGCGYDQQVGSSTEVSATDGVLGLGSGSVSLLSQLKQHGITKNVVGHCLSTRGG---- 226

Query: 221 HNKLILGDGAIIDEGDAT--PLQFIDGHYYITLEAISV--DGRMLDINPNIFKRDDSGGG 276
              L  GD  I+    AT  P+       Y +  + ++   GR L + P           
Sbjct: 227 -GFLFFGD-DIVPYSRATWAPMARSTSRNYYSPGSANLYFGGRPLGVRPM---------E 275

Query: 277 VMIDSGTDVTWLVKEAYEALRDEVMIRLEG--EQMRSYSWPDKLCYHGI--MSSDL---K 329
           V+ DSG+  T+   + Y+AL D +   L    +++  +S P  LC+ G     S L   K
Sbjct: 276 VVFDSGSSFTYFSAQPYQALVDAIKGDLSKNLKEVPDHSLP--LCWKGKKPFKSVLDVKK 333

Query: 330 GFPTVRFHFRGGAKLALE---KDSMFYQPRPDAFCMAVNPASINNRYVNFSLIGMMAQQF 386
            F TV   F  G K  +E   ++ +      +A    +N + +  + +N  ++G +  Q 
Sbjct: 334 EFRTVVLSFSNGKKALMEIPPENYLIVTKYGNACLGILNGSEVGLKDLN--IVGDITMQD 391

Query: 387 YNVGYDIGRKQKTFQRMDCEVLDDD 411
             V YD  R Q  + R  C+ + +D
Sbjct: 392 QMVIYDNERGQIGWIRAPCDRIPND 416


>gi|297723027|ref|NP_001173877.1| Os04g0336942 [Oryza sativa Japonica Group]
 gi|255675342|dbj|BAH92605.1| Os04g0336942 [Oryza sativa Japonica Group]
          Length = 388

 Score = 85.9 bits (211), Expect = 4e-14,   Method: Compositional matrix adjust.
 Identities = 85/302 (28%), Positives = 130/302 (43%), Gaps = 36/302 (11%)

Query: 60  LFYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLGTI----FYPSRSS-SYAYV 114
           L+Y +I IG P V  +  +DTGS   WV+   C+ C  +   +    FY  RSS S   V
Sbjct: 82  LYYTDIGIGTPAVKYYVQLDTGSKAFWVNGISCKQCPHESDILRKLTFYDPRSSVSSKEV 141

Query: 115 PCDSEHCRYFPYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFK---NTDESTIHVQD 171
            CD   C   P    +    RC Y   Y  G  T   + T+ L +       ++      
Sbjct: 142 KCDDTICTSRPPCNMTL---RCPYITGYADGGLTMGILFTDLLHYHQLYGNGQTQPTSTS 198

Query: 172 VVFGCGFS-----TNRNFKFSGIFGLGIGRSSLVSQLNSS------FSYCIGSLHDPDYL 220
           V FGCG        N      GI G G    + +SQL ++      FS+C+      D  
Sbjct: 199 VTFGCGLQQSGSLNNSAVAIDGIIGFGNSNQTALSQLAAAGKTKKIFSHCL------DST 252

Query: 221 HNKLILGDGAIID-EGDATPLQFIDGHYY-ITLEAISVDGRMLDINPNIFKRDDSGGGVM 278
           +   I   G +++ +   TP+   +  Y+ + L++I+V G  L +  NIF    +  G  
Sbjct: 253 NGGGIFAIGEVVEPKVKTTPIVKNNEVYHLVNLKSINVAGTTLQLPANIFGTTKT-KGTF 311

Query: 279 IDSGTDVTWLVKEAYEALRDEVMIRLEGEQMRS-YSWPDKLCYHGIMSSDLKGFPTVRFH 337
           IDSG+ + +L +  Y  L   V  +     M + Y++    C+H + S D K FP + FH
Sbjct: 312 IDSGSTLVYLPEIIYSELILAVFAKHPDITMGAMYNFQ---CFHFLGSVDDK-FPKITFH 367

Query: 338 FR 339
           F 
Sbjct: 368 FE 369


>gi|242074844|ref|XP_002447358.1| hypothetical protein SORBIDRAFT_06g033560 [Sorghum bicolor]
 gi|241938541|gb|EES11686.1| hypothetical protein SORBIDRAFT_06g033560 [Sorghum bicolor]
          Length = 497

 Score = 85.5 bits (210), Expect = 4e-14,   Method: Compositional matrix adjust.
 Identities = 108/395 (27%), Positives = 162/395 (41%), Gaps = 54/395 (13%)

Query: 61  FYVNISIGQPPVPQFTAMDTGSSLLWVHC---YPCRDCSPQLGT---IFYPSRSSSYAYV 114
           +    S+G PP P    +DTGS L WV C   Y CR+CS        +F+P  SSS   V
Sbjct: 103 YAFTASLGTPPQPLPVLLDTGSQLTWVPCTSNYDCRNCSSPFAAAVPVFHPKNSSSSRLV 162

Query: 115 PCDSEHCRYF----PYARCSAYKHRCI-YTQLYLIGPETSVFV---STEQLTFKNTDEST 166
            C +  C +       A+C A   R    T    + P  +V     ST  L   +T  + 
Sbjct: 163 GCRNPSCLWVHSAEHVAKCRAPCSRGANCTPASNVCPPYAVVYGSGSTAGLLIADTLRAP 222

Query: 167 IH-VQDVVFGCGFSTNRNFKFSGIFGLGIGRSSLVSQLN-SSFSYCIGS--LHDPDYLHN 222
              V   V GC   +      SG+ G G G  S+ +QL  S FSYC+ S    D   +  
Sbjct: 223 GRAVSGFVLGCSLVSVHQ-PPSGLAGFGRGAPSVPAQLGLSKFSYCLLSRRFDDNAAVSG 281

Query: 223 KLILG---DG------AIIDEGDATPLQFIDGHYYITLEAISVDGRMLDI-NPNIFKRDD 272
            L+LG   DG           GD  P      +YY+ L  ++V G+ + +          
Sbjct: 282 SLVLGGDNDGMQYVPLVKSAAGDKQPYAV---YYYLALSGVTVGGKAVRLPARAFAANAA 338

Query: 273 SGGGVMIDSGTDVTWLVKEAYEALRDEVMIRLEGEQMRSYSWPDKL----CYHGIMSSDL 328
             GG ++DSGT  T+L    ++ + D V+  + G   RS    + L    C+     +  
Sbjct: 339 GSGGAIVDSGTTFTYLDPTVFQPVADAVVAAVGGRYKRSKDVEEGLGLHPCFALPQGAKS 398

Query: 329 KGFPTVRFHFRGGAKLALEKDSMFY----QPRP---------DAFCMAV-----NPASIN 370
              P +  HF+GGA + L  ++ F      P P         +A C+AV        + +
Sbjct: 399 MALPELSLHFKGGAVMQLPLENYFVVAGRAPVPGAGAGAGAAEAICLAVVTDFGGSGAGD 458

Query: 371 NRYVNFSLIGMMAQQFYNVGYDIGRKQKTFQRMDC 405
                  ++G   QQ Y V YD+ +++  F+R  C
Sbjct: 459 EGGGPAIILGSFQQQNYLVEYDLEKERLGFRRQPC 493


>gi|224142001|ref|XP_002324349.1| predicted protein [Populus trichocarpa]
 gi|222865783|gb|EEF02914.1| predicted protein [Populus trichocarpa]
          Length = 490

 Score = 85.5 bits (210), Expect = 4e-14,   Method: Compositional matrix adjust.
 Identities = 105/377 (27%), Positives = 151/377 (40%), Gaps = 37/377 (9%)

Query: 48  AQILPSND---ISNSLFYVNISIGQPPVPQFTAMDTGSSLLWVHCYPC-RDCSPQLGTIF 103
           +  +P+ D   + +  + V + +G P        DTGS + W  C PC R C  Q   IF
Sbjct: 133 STTIPAKDGSTVGSGNYIVTVGLGTPKKDLSLIFDTGSDITWTQCQPCARSCYKQKEQIF 192

Query: 104 YPSRSSSYAYVPCDSEHCRYFPYAR-----CSAYKHRCIYTQLYLIGPETSVFVSTEQLT 158
            PS+S+SY  + C S  C     A      C++    C+Y   Y     +  F  TE+LT
Sbjct: 193 DPSQSTSYTNISCSSSICNSLTSATGNTPGCAS--SACVYGIQYGDSSFSVGFFGTEKLT 250

Query: 159 FKNTDESTIHVQDVVFGCGFSTNRNFKFSGIFGLGIGRS--SLVSQL----NSSFSYCIG 212
             +TD       ++ FGCG   N+         LG+GR   S+VSQ     N  FSYC  
Sbjct: 251 LTSTDA----FNNIYFGCG-QNNQGLFGGSAGLLGLGRDKLSVVSQTAQKYNKIFSYC-- 303

Query: 213 SLHDPDYLHNKLILGDGAIIDEGDATPLQFIDG---HYYITLEAISVDGRMLDINPNIFK 269
            L         L  G G+       TPL  I      Y +    ISV G+ L I+ ++F 
Sbjct: 304 -LPSSSSSTGFLTFG-GSASKNAKFTPLSTISAGPSFYGLDFTGISVGGKKLAISASVF- 360

Query: 270 RDDSGGGVMIDSGTDVTWLVKEAYEALRDEVMIRLEGEQMRSYSWPDKLCYHGIMSSDLK 329
              S  G +IDSGT +T L   AY ALR      +    M         CY    S    
Sbjct: 361 ---STAGAIIDSGTVITRLPPAAYSALRASFRNLMSKYPMTKALSILDTCYD-FSSYTTI 416

Query: 330 GFPTVRFHFRGGAKLALEKDSMFYQPRPDAFCMAVNPASINNRYVNFSLIGMMAQQFYNV 389
             P + F F  G ++ ++   + Y       C+A    + N+   +  + G + Q+   V
Sbjct: 417 SVPKIGFSFSSGIEVDIDATGILYASSLSQVCLAF---AGNSDATDVFIFGNVQQKTLEV 473

Query: 390 GYDIGRKQKTFQRMDCE 406
            YD    +  F    C 
Sbjct: 474 FYDGSAGKVGFAPGGCS 490


>gi|195619700|gb|ACG31680.1| pepsin A [Zea mays]
          Length = 485

 Score = 85.5 bits (210), Expect = 4e-14,   Method: Compositional matrix adjust.
 Identities = 98/390 (25%), Positives = 162/390 (41%), Gaps = 61/390 (15%)

Query: 52  PSNDISNSLFYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLG---------TI 102
           P ND+   L+Y  + +G P      A+DTGS L WV C  C  C+P  G          I
Sbjct: 58  PGNDL-GWLYYAWVDVGTPATSFLVALDTGSDLFWVPC-DCIQCAPLSGYRGNLDRDLRI 115

Query: 103 FYPSRSSSYAYVPCDSEHCRYFPYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNT 162
           + P+ S++  ++PC  E C+  P   C+  K  C Y   Y     TS  +  E     N 
Sbjct: 116 YRPAESTTSRHLPCSHELCQSVP--GCTNPKQPCPYNIDYFSENTTSSGLLIEDTLHLNY 173

Query: 163 DESTIHVQ-DVVFGCGFSTNRNF----KFSGIFGLGIGRSSLVSQL------NSSFSYCI 211
            E  + V   V+ GCG   + ++       G+ GLG+   S+ S L       +SFS C 
Sbjct: 174 REDHVPVNASVIIGCGQKQSGDYLDGIAPDGLLGLGMADISVPSFLARAGLVQNSFSMCF 233

Query: 212 GSLHDPDYLHNKLILGDGAIIDEGDATPLQFIDGHYYITLEAISVDGRMLD---INPNIF 268
                 +    ++  GD  +  +  +TP  F+  +  +   A++VD   +    +    F
Sbjct: 234 K-----EDSSGRIFFGDQGVPSQ-QSTP--FVPLYGKLQTYAVNVDKSCIGHKCLEGTSF 285

Query: 269 KRDDSGGGVMIDSGTDVTWLVKEAYEALRDEVMIRLEGEQM--RSYSWPDKLCYHGIMSS 326
           K        ++DSGT  T L  + Y+A   E   ++   ++     +W  K CY      
Sbjct: 286 K-------ALVDSGTSFTSLPLDVYKAFTMEFDKQMNATRVPYEDTTW--KYCYSA-SPL 335

Query: 327 DLKGFPTVRFHFRGGAKLALEKDSMFYQPRPDA---FCMAVNPASINNRYVNFSLIGMMA 383
           ++   PT+   F     L      + +  +  A   FC+AV P++          IG++A
Sbjct: 336 EMPDVPTITLTFAADKSLQAVNPILPFNDKQGALAGFCLAVLPST--------EPIGIIA 387

Query: 384 QQF---YNVGYDIGRKQKTFQRMDCEVLDD 410
           Q F   Y+V +D    +  + R +C  ++D
Sbjct: 388 QNFLVGYHVVFDRESMKLGWYRSECHDVED 417


>gi|388495448|gb|AFK35790.1| unknown [Medicago truncatula]
          Length = 434

 Score = 85.5 bits (210), Expect = 4e-14,   Method: Compositional matrix adjust.
 Identities = 95/360 (26%), Positives = 141/360 (39%), Gaps = 28/360 (7%)

Query: 56  ISNSLFYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLGTIFYPSRSSSYAYVP 115
           I +  + V    G PP     A+DT S   W+ C  C  CS      F P +S+S+  V 
Sbjct: 92  IQSPTYIVKAKFGTPPQTLLLALDTSSDAAWIPCSGCVGCS--TSKPFAPIKSTSFRNVS 149

Query: 116 CDSEHCRYFPYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDVVFG 175
           C S HC+  P   C      C +   Y     +S+  S  Q T     +    +    FG
Sbjct: 150 CGSPHCKQVPNPTCGG--SACAFNFTYG---SSSIAASVVQDTLTLAADP---IPGYTFG 201

Query: 176 C-----GFSTNRNFKFSGIFGLGIGRSSLVSQLNSSFSYCIGSLHDPDYLHNKLILGDGA 230
           C     G S  +        G     S   +   S+FSYC+ S    ++    L LG   
Sbjct: 202 CVNKTTGSSAPQQGLLGLGRGPLSLLSQSQNLYKSTFSYCLPSFKSINF-SGSLRLGPVY 260

Query: 231 IIDEGDATPL---QFIDGHYYITLEAISVDGRMLDINPNIFK-RDDSGGGVMIDSGTDVT 286
                  TPL         YY+ L AI V  +++DI P        +G G + DSGT  T
Sbjct: 261 QPKRIKYTPLLRNPRRSSLYYVNLVAIKVGRKIVDIPPAALAFNPTTGAGTIFDSGTVFT 320

Query: 287 WLVKEAYEALRDEVMIRLEGEQMRSYSWPDKLCYHGIMSSDLKGFPTVRFHFRGGAKLAL 346
            L +  Y A+R+E   R+  +   +       CY+  +       PT+ F F  G  +AL
Sbjct: 321 RLAEPVYTAVRNEFRRRVGPKLPVTTLGGFDTCYNVPIV-----VPTITFLF-SGMNVAL 374

Query: 347 EKDSM-FYQPRPDAFCMAVNPASINNRYVNFSLIGMMAQQFYNVGYDIGRKQKTFQRMDC 405
             D++  +       C+A+  A  N   V  ++I  M QQ + V +D+   +    R  C
Sbjct: 375 PPDNIVIHSTAGSTTCLAMAGAPDNVNSV-LNVIANMQQQNHRVLFDVPNSRIGIARELC 433


>gi|357492303|ref|XP_003616440.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
 gi|355517775|gb|AES99398.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
          Length = 521

 Score = 85.5 bits (210), Expect = 4e-14,   Method: Compositional matrix adjust.
 Identities = 97/412 (23%), Positives = 162/412 (39%), Gaps = 83/412 (20%)

Query: 30  IARLAYLQEKIRSHNTYQAQILPSNDISNSLFYVNISIGQPPVPQFTAMDTGSSLLWVHC 89
           + ++++L+ K     +   + LP     N    V++++G PP      +DTGS L W+HC
Sbjct: 7   VVQISFLKVKTLPQTSLSPRKLPFQH--NVTLTVSLTVGSPPQRVTMVLDTGSELSWLHC 64

Query: 90  YPCRDCSPQLGTIFYPSRSSSYAYVPCDSEHCR-----YFPYARCSAYKHRCIYTQLYLI 144
                  P L  IF P  SSSY   PC S  C            C A K  C     ++ 
Sbjct: 65  KKL----PNLNFIFNPLVSSSYTPTPCTSPICTTQTRDLINPVSCDANK-LCHIITFFVG 119

Query: 145 GPETSVFVSTEQLTFKNTDESTIHVQDVVFGC----GFSTNRNFKFSGIFGLGIGRSSLV 200
           GP                       + +VFGC      S + + K +G+ G+ +G  S  
Sbjct: 120 GPAQ---------------------RGMVFGCMDTGTSSGDEDSKTTGLMGMDLGSLSFS 158

Query: 201 SQLN-SSFSYCIGSLHDPDYLHNKLILGDGAIIDEGDATPLQFIDGHYYITLEAIS---- 255
           +Q+    FSYCI +               G ++ E  A P +    HY   ++  +    
Sbjct: 159 NQMRLPKFSYCISNKDS-----------TGVLVLENIANPPRLGPLHYTPLVKKTTPLPY 207

Query: 256 VDGRMLDINPNIFKRDDSGGG-VMIDSGTDVTWLVKEAYEALRDEVMIRLE------GEQ 308
            +        + F  D +G G  M+DS T  T+L +  Y AL++E  I+ +      G+ 
Sbjct: 208 FNRNCCLFQKSAFLPDHTGAGQTMVDSATQFTFLRQPVYTALKNEFAIQTKNILTPLGDP 267

Query: 309 MRSYSWPDKLCYHGIMSSDLKGFPTVRFHFRGGAKLALEKDSMFYQPRPDAFCMAVNPAS 368
              +     LC+   + S L   P V   F  GA+L +  + + Y+         V+  +
Sbjct: 268 KFVFQGVMDLCFRVPIGSTLPVLPVVTLMF-DGAELRVTGERLLYK---------VSNVA 317

Query: 369 INNRYV------NFSLIGMMA-------QQFYNVGYDIGRKQKTFQRMDCEV 407
            +N ++      N  L+G+ A       Q+   + YD+   +  F   +C+V
Sbjct: 318 KSNSWIYCFTFGNSDLLGIEAFIIGHHHQRNVWMEYDLANSRIGFSDTNCDV 369


>gi|302802500|ref|XP_002983004.1| hypothetical protein SELMODRAFT_13348 [Selaginella moellendorffii]
 gi|300149157|gb|EFJ15813.1| hypothetical protein SELMODRAFT_13348 [Selaginella moellendorffii]
          Length = 332

 Score = 85.5 bits (210), Expect = 5e-14,   Method: Compositional matrix adjust.
 Identities = 91/361 (25%), Positives = 152/361 (42%), Gaps = 45/361 (12%)

Query: 60  LFYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLGTIFYPSRSSSYAYVPCDSE 119
           ++Y  I++G PP      MDTGS L WV C P   CSP   + F    S++Y  + C  +
Sbjct: 2   VYYSTITLGSPPKDFSLVMDTGSDLTWVRCDP---CSPDCSSTFDRLASNTYKALTCADD 58

Query: 120 HCRYFPYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDVVFGCGFS 179
           +   + Y   S       +TQ         + V T ++    +DE        VFGCG  
Sbjct: 59  YS--YGYGDGS-------FTQ-------GDLSVDTLKMAGAASDELE-EFPGFVFGCGSL 101

Query: 180 TNRNFKFS-GIFGLGIGRSSLVSQL----NSSFSYCIGSLHDPDYLHNK-LILGDGAI-- 231
                    GI  L  G  S  SQ+     + FSYC+      + L    ++ G+ A+  
Sbjct: 102 LKGLISGEVGILALSPGSLSFPSQIGEKYGNKFSYCLLRQTAQNSLKKSPMVFGEAAVEL 161

Query: 232 -------IDEGDATPLQFIDGHYYITLEAISVDGRMLDINPNIFKRDDSGGGVMIDSGTD 284
                  + E   TP+     +Y + L+ ISV  + LD++P+ F  +      + DSGT 
Sbjct: 162 KEPGSGKLQELQYTPIGESSIYYTVRLDGISVGNQRLDLSPSAF-LNGQDKPTIFDSGTT 220

Query: 285 VTWLVKEAYEALRDEVMIRLEGEQMRSYSWPDKLCYHGIMSSDLKGFPTVRFHFRGGAKL 344
           +T L     ++++  +   + G +  +    D  C+  +  S  +G P + FHF GGA  
Sbjct: 221 LTMLPPGVCDSIKQSLASMVSGAEFVAIKGLDA-CFR-VPPSSGQGLPDITFHFNGGADF 278

Query: 345 ALEKDSMFYQPRPDAFCMAVNPASINNRYVNFSLIGMMAQQFYNVGYDIGRKQKTFQRMD 404
            + + S +        C+   P +        S+ G + QQ + V +D+  ++  F+  D
Sbjct: 279 -VTRPSNYVIDLGSLQCLIFVPTN------EVSIFGNLQQQDFFVLHDMDNRRIGFKETD 331

Query: 405 C 405
           C
Sbjct: 332 C 332


>gi|326529727|dbj|BAK04810.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 488

 Score = 85.5 bits (210), Expect = 5e-14,   Method: Compositional matrix adjust.
 Identities = 113/447 (25%), Positives = 181/447 (40%), Gaps = 67/447 (14%)

Query: 20  HRVQRGINISIARLAYLQEKIRSHNTYQAQILPSNDI----SNSLFYVNISIGQPPVPQF 75
           H + R    S+AR      ++R H+  QA   P        S   +  ++S+G PP P  
Sbjct: 45  HPLSRLARASLAR----ASRLRGHHQGQAASSPVRAALYPHSYGGYAFSLSLGTPPQPLP 100

Query: 76  TAMDTGSSLLWVHC---YPCRDCSPQLGT--IFYPSRSSSYAYVPCDSEHCRYF------ 124
             +DTGS L WV C   Y C++CS   G+  +F+P  SSS   V C S  C +       
Sbjct: 101 VLLDTGSHLTWVPCTSNYQCQNCSAAAGSFPVFHPKSSSSSLLVSCSSPSCLWIHSKSHL 160

Query: 125 -----PYARCSAYKHRCIYTQLYLIGPETSVF--VSTEQLTFKNT---DESTIHVQDVVF 174
                  A C      C  T   +  P   V+   ST  L   +T          ++   
Sbjct: 161 SDCARDSAPCRPSTANCSATATNVCPPYLVVYGSGSTAGLLVSDTLRLSPRGAASRNFAV 220

Query: 175 GCGFSTNRNFKFSGIFGLGIGRSSLVSQLN-SSFSYCIGS--LHDPDYLHNKLILGDGA- 230
           GC  ++      SG+ G G G  S+ +QL  + FSYC+ S    D   +  +L+LG  + 
Sbjct: 221 GCSLASVHQ-PPSGLAGFGRGAPSVPAQLGVNKFSYCLLSRRFDDDAAISGELVLGASSA 279

Query: 231 -----------IIDEGDATPLQFIDGHYYITLEAISVDGRMLDINPNIFK--RDDSGGGV 277
                      ++    A P   +  +YY++L  I+V G+ + +            GGG 
Sbjct: 280 GKAKAMMQYAPLLKNAGARPPYSV--YYYLSLTGIAVGGKSVALPARALAPVSGGGGGGA 337

Query: 278 MIDSGTDVTWLVKEAYEALRDEVMIRLEGEQMRSYSWPDKL----CYHGIMSSDLKGFPT 333
           +IDSGT  T+L    ++ +   ++  + G   RS      L    C+     +     P 
Sbjct: 338 IIDSGTTFTYLDPTVFKPVAAAMVAAVGGRYNRSKDVEGALGLRPCFALPAGARTMDLPE 397

Query: 334 VRFHFRGGAKLALEKDSMFYQP------RPDAFCMAV--------NPASINNRYVNFSLI 379
           +  HF GGA++ L  ++ F          P+A C+AV          A ++       ++
Sbjct: 398 LSLHFSGGAEMRLPIENYFLAAGPASGVAPEAICLAVVSDVSSASGGAGVSGGGGPAIIL 457

Query: 380 GMMAQQFYNVGYDIGRKQKTFQRMDCE 406
           G   QQ Y V YD+ + +  F++  C 
Sbjct: 458 GSFQQQNYQVEYDLEKNRLGFRQQPCS 484


>gi|2570402|gb|AAB97155.1| EEA1 [Hordeum vulgare subsp. vulgare]
          Length = 410

 Score = 85.1 bits (209), Expect = 5e-14,   Method: Compositional matrix adjust.
 Identities = 94/390 (24%), Positives = 162/390 (41%), Gaps = 66/390 (16%)

Query: 61  FYVNISIGQPPVPQFTAMDTGSSLLWVHCYP----CRDCSPQLGTIFYPSRSSSYAYVPC 116
           FY  ++IG+P  P F  +DTGS+L W+ C+P    C+ C P+    +Y         V C
Sbjct: 38  FYATLNIGEPAKPYFLDVDTGSNLTWLECHPPVHGCKGCHPRPPHPYYTPADGKLKVV-C 96

Query: 117 DSEHC----RYFP-YARCSAY-KHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQ 170
            S  C    R  P    CS    HRC Y   Y+ G ++   ++T+ ++    D+  I   
Sbjct: 97  GSPLCVAVRRDVPGIPECSRNDPHRCHYEIQYVTG-KSEGDLATDIISVNGRDKKRI--- 152

Query: 171 DVVFGCGFS-----TNRNFKFSGIFGLGIGRSSLVSQLN-------SSFSYCIGSLHDPD 218
              FGCG+       +     +GI GLG+G++   +QL        +   +C+ S     
Sbjct: 153 --AFGCGYKQEEPPDSPPSPVNGILGLGMGKAGFAAQLKGLKMIKENVIGHCLSSKG--- 207

Query: 219 YLHNKLILGDGAIIDEG-DATPLQFIDGHYYITLEAISVDGRMLDINPNIFKRDDSGGGV 277
                L +GD      G    P++    +Y   L  + +D + +  NP            
Sbjct: 208 --KGVLYVGDFNPPTRGVTWAPMRESLFYYSPGLAEVFIDKQPIRGNPTF--------EA 257

Query: 278 MIDSGTDVTWLVKEAYEALRDEVMIRLEG-------EQMRSYSWPDKLCYHGI------- 323
           + DSG+  T +  + Y    +E++ ++ G       E+++  + P  LC+ G        
Sbjct: 258 VFDSGSTYTHVPAQIY----NEIVSKVRGTFSESSLEEVKGRALP--LCWKGKKPFGSVN 311

Query: 324 -MSSDLKGFPTVRFHFRGGAKLALEKDSMFYQPRPDAFCMAVNPASINN--RYVNFSLIG 380
            + +  K       H RG   L +   +  +       C+A+  AS++   + +NF LIG
Sbjct: 312 DVKNQFKALSLKITHARGTNNLDIPPQNYLFVKEDGETCLAILDASLDPVLKELNFILIG 371

Query: 381 MMAQQFYNVGYDIGRKQKTFQRMDCEVLDD 410
            +  Q   V YD  +KQ  + R  C+ + +
Sbjct: 372 AVTMQDLFVIYDNEKKQLGWVRAQCDRVQE 401


>gi|356567798|ref|XP_003552102.1| PREDICTED: aspartic proteinase-like protein 1-like [Glycine max]
          Length = 520

 Score = 85.1 bits (209), Expect = 5e-14,   Method: Compositional matrix adjust.
 Identities = 109/418 (26%), Positives = 169/418 (40%), Gaps = 64/418 (15%)

Query: 22  VQRGINISIARLAYLQEKIRSHNTYQAQILPSNDISNSLFYVNISIGQPPVPQFTAMDTG 81
           ++R I +  AR   L     SH +    +   ND    L Y  I IG P      A+D G
Sbjct: 63  LRRKIKVGGARYQLL---FPSHGSKTMSL--GNDF-GWLHYTWIDIGTPSTSFLVALDAG 116

Query: 82  SSLLWVHCYPCRDCSPQLGTIFY-----------PSRSSSYAYVPCDSEHCRYFPYARCS 130
           S LLW+ C  C  C+P L + +Y           PSRS S  ++ C  + C     + C 
Sbjct: 117 SDLLWIPC-DCVQCAP-LSSSYYSNLDRDLNEYSPSRSLSSKHLSCSHQLCDKG--SNCK 172

Query: 131 AYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIH--VQ-DVVFGCGFSTNRNF--- 184
           + + +C Y   YL    +S  +  E +    +  S  +  VQ  VV GCG   +  +   
Sbjct: 173 SSQQQCPYMVSYLSENTSSSGLLVEDILHLQSGGSLSNSSVQAPVVLGCGMKQSGGYLDG 232

Query: 185 -KFSGIFGLGIGRSSLVSQLNSSFSYCIGSLHDPDYL------HNKLILGD-GAIIDEGD 236
               G+ GLG G SS+ S L  S     G +HD   L        ++  GD G  I +  
Sbjct: 233 VAPDGLLGLGPGESSVPSFLAKS-----GLIHDSFSLCFNEDDSGRIFFGDQGPTIQQST 287

Query: 237 A-TPLQFIDGHYYITLEAISVDGRMLDINPNIFKRDDSGGGVMIDSGTDVTWLVKEAYEA 295
           +  PL  +   Y I +E+  V    L +    FK       V +DSGT  T+L    Y A
Sbjct: 288 SFLPLDGLYSTYIIGVESCCVGNSCLKMTS--FK-------VQVDSGTSFTFLPGHVYGA 338

Query: 296 LRDEVMIRLEGEQMRSYSWPDKLCYHGIMSSDLKGFPTVRFHFRGGAKLALEKDSMFYQP 355
           + +E   ++ G +      P + CY    S +L   P++   F+      +      +  
Sbjct: 339 IAEEFDQQVNGSRSSFEGSPWEYCYV-PSSQELPKVPSLTLTFQQNNSFVVYDPVFVFYG 397

Query: 356 RPD--AFCMAVNPASINNRYVNFSLIGMMAQQF---YNVGYDIGRKQKTFQRMDCEVL 408
                 FC+A+ P   +        +G + Q F   Y + +D G K+  + R +C+ L
Sbjct: 398 NEGVIGFCLAIQPTEGD--------MGTIGQNFMTGYRLVFDRGNKKLAWSRSNCQDL 447


>gi|340810945|gb|AEK75399.1| S5 [Oryza sativa]
 gi|340810957|gb|AEK75405.1| S5 [Oryza sativa]
 gi|340811007|gb|AEK75430.1| S5 [Oryza nivara]
 gi|340811073|gb|AEK75463.1| S5 [Oryza rufipogon]
 gi|340811094|gb|AEK75473.1| S5 [Oryza rufipogon]
          Length = 357

 Score = 85.1 bits (209), Expect = 6e-14,   Method: Compositional matrix adjust.
 Identities = 110/383 (28%), Positives = 161/383 (42%), Gaps = 66/383 (17%)

Query: 63  VNISIGQPPVPQFTAMDTGSSLLWVHCYPCR-DC---SPQLGTIFYPSRSSSYAYVPCDS 118
           + +S+G+PPV    A+DTGS+L WV C PC   C   S + G IF P RS +   V C S
Sbjct: 1   MAVSLGKPPVVNLVAIDTGSTLSWVQCQPCAVHCHTQSAKAGPIFDPGRSYTSRRVRCSS 60

Query: 119 EHC---RY---FPYARCSAYKHRCIYTQLYLIGPETSV-FVSTEQLTFKNTDESTIHVQD 171
             C   RY      A C   +  C Y+  Y  G   SV  + T+ L   ++        D
Sbjct: 61  VKCGEPRYDLRLQQANCMEKEDSCTYSVTYGNGWAYSVGKMVTDTLRIGDS------FMD 114

Query: 172 VVFGCGFSTNRNFKFSGIFGLGIGRSSLVSQLN--------SSFSYCIGSLH-DPDYLHN 222
           ++FGC      +   +GIFG G    S   QL          +FSYC+ +    P Y   
Sbjct: 115 LMFGCSMDVKYSEFEAGIFGFGSSSFSFFEQLAGYPDILSYKAFSYCLPTDETKPGY--- 171

Query: 223 KLILG--DGAIIDEGDATPLQFIDG-HYYITLEAISVDGRMLDINPNIFKRDDSGGGVMI 279
            +ILG  D A +D G     + I+   Y +T E +  +G+ L           S   +++
Sbjct: 172 -MILGRYDRAAMDGGYTPLFRSINRPTYSLTTEMLIANGQRLVT---------SSSEMIV 221

Query: 280 DSGTDVTWLVKEAYEALRDEVMI---------RLEGEQMRSY----SWPDKLCYHGIMS- 325
           DSG   T L    + AL D+ +          R    +  SY    S  D   ++G ++ 
Sbjct: 222 DSGAQRTSLWPSTF-ALLDKTITQAMSSIGYHRTSRARQESYICYLSEHDYSGWNGTITP 280

Query: 326 -SDLKGFPTVRFHFRGGAKLALEKDSMFYQPRPDAFCM--AVNPASINNRYVNFSLIGMM 382
            S+    P +   F GGA LAL   ++FY       CM  A NPA      +   ++G  
Sbjct: 281 FSNWSALPLLEIGFAGGAALALSPRNVFYNDPHRGLCMTFAQNPA------LRSQILGNR 334

Query: 383 AQQFYNVGYDIGRKQKTFQRMDC 405
             + +   +DI  KQ  F+   C
Sbjct: 335 VTRSFGTTFDIQGKQFGFKYAAC 357


>gi|302794412|ref|XP_002978970.1| hypothetical protein SELMODRAFT_418789 [Selaginella moellendorffii]
 gi|300153288|gb|EFJ19927.1| hypothetical protein SELMODRAFT_418789 [Selaginella moellendorffii]
          Length = 462

 Score = 85.1 bits (209), Expect = 6e-14,   Method: Compositional matrix adjust.
 Identities = 97/380 (25%), Positives = 152/380 (40%), Gaps = 51/380 (13%)

Query: 61  FYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLGTIFYPSRSSSYAYVPC-DSE 119
           +Y +I +G P       +DTGS L W+ C PC+ C+P + TI+  +RS SY  V C +S+
Sbjct: 100 YYTSIKLGSPGQEAILIVDTGSELTWLKCLPCKVCAPSVDTIYDAARSVSYKPVTCNNSQ 159

Query: 120 HC---RYFPYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKN-TDESTIHVQDVVFG 175
            C       YA C A   +C +   Y  G  +   +ST+ L  +       + VQD  FG
Sbjct: 160 LCSNSSQGTYAYC-ARGSQCQFAAFYGDGSFSYGSLSTDTLIMETVVGGKPVTVQDFAFG 218

Query: 176 C--GFSTNRNFKFSGIFGLGIGRSSLVSQLNS----SFSYCIGSLHDPDYLHNKLILGDG 229
           C  G         SGI GL  G+ +L  QL       FS+C             +  G+ 
Sbjct: 219 CAQGDLELVPTGASGILGLNAGKMALPMQLGQRFGWKFSHCFPDRSSHLNSTGVVFFGNA 278

Query: 230 AIIDEG------DATPLQFIDGHYYITLEAISVDGRMLDINPNIFKRDDSGGGVMIDSGT 283
            +  E         T  +     Y++ L+ +S++   L + P        G  V++DSG+
Sbjct: 279 ELPHEQVQYTSVALTNSELQRKFYHVALKGVSINSHELVLLPR-------GSVVILDSGS 331

Query: 284 DVTWLV-------KEAYEALRDEVMIRLEGEQMRSYSWPDKLCYHGIMSSDL----KGFP 332
             +  V       +EA+   R   +  LEG+     S+ D      + + D+    +  P
Sbjct: 332 SFSSFVRPFHSQLREAFLKHRPPSLKHLEGD-----SFGDLGTCFKVSNDDIDELHRTLP 386

Query: 333 TVRFHFRGGAKLALEKDSMF-----YQPRPDAFCMAVNPASINNRYVNFSLIGMMAQQFY 387
           ++   F  G  + +    +      YQ      C A      N      ++IG   QQ  
Sbjct: 387 SLSLVFEDGVTIGIPSIGVLLPVARYQNHVK-MCFAFEDGGPN----PVNVIGNYQQQNL 441

Query: 388 NVGYDIGRKQKTFQRMDCEV 407
            V YDI R +  F R  C +
Sbjct: 442 WVEYDIQRSRVGFARASCVI 461


>gi|302789618|ref|XP_002976577.1| hypothetical protein SELMODRAFT_416663 [Selaginella moellendorffii]
 gi|300155615|gb|EFJ22246.1| hypothetical protein SELMODRAFT_416663 [Selaginella moellendorffii]
          Length = 437

 Score = 85.1 bits (209), Expect = 6e-14,   Method: Compositional matrix adjust.
 Identities = 104/435 (23%), Positives = 171/435 (39%), Gaps = 55/435 (12%)

Query: 1   LIHRDSILSPYNNPNENPAHRVQRGINISIARLAYLQEKIRSHNTYQAQI---LPSNDIS 57
           L HR S L   +  NE      + G+ +S   L +L E       +   I   L  N   
Sbjct: 26  LQHRYSGLEGSSKQNE------KLGLGMSKQHLQHLVEHNDRRGRFLQGISFPLKGNYSD 79

Query: 58  NSLFYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCS------PQLGTIFYPSRSSSY 111
             L+Y  I +G P       +DTGS +LWV C PCR C       P L      + S+S 
Sbjct: 80  LGLYYTEIGLGNPVQKLKVIVDTGSDILWVKCSPCRSCLSKQDIIPPLSIYNLSASSTSS 139

Query: 112 AYVPCDSEHCRYFPYARCSAYKHRCIYTQLYL-IGPETSVFVSTEQLTFKNTDESTIHVQ 170
                D            S     C Y   Y         +V  +     +   +T    
Sbjct: 140 VSSCSDPLCTGEEVVCSRSGNNSACAYVSSYQDKSASVGAYVRDDMHYVLHGGNAT--TS 197

Query: 171 DVVFGCGFSTNRNFKFSGIFGLGIGRSSLVSQLNSS------FSYCIGSLHDPDYLHNKL 224
            + FGC  +   ++   GI G G+   ++ +Q+ +       FS+C+G      +    L
Sbjct: 198 RIFFGCATNITGSWPVDGIMGFGLISKTVPNQIATQRNMSRVFSHCLGG---EKHGGGIL 254

Query: 225 ILGDGAIIDEGDATPLQFIDGHYYITLEAISVDGRMLDINPNIF---KRDDSGGGVMIDS 281
             G+     E   TPL  +  HY + L +ISV+ ++L I+P  F   +   +  GV+IDS
Sbjct: 255 EFGEAPNTTEMVFTPLLNVTTHYNVDLLSISVNSKVLPIDPKEFSYVRNSTNNTGVIIDS 314

Query: 282 GTDVTWLVKEAYEALRDEVMIRLEGEQMRSYSWPDKL----CYHGIMSSDL---KGFPTV 334
           GT    L  +A   L  E+      + + +     KL    C++  + S L     FP V
Sbjct: 315 GTTFVLLTTKANRMLFQEI------KSLTTAKLGPKLEGLECFY--LKSGLTMETSFPNV 366

Query: 335 RFHFRGGAKLALEKDSMF----YQPRPDAFCMAVNPASINNRYVNFSLIGMMAQQFYNVG 390
              F GG+ + L+ D+      Y+ + + +C A + A         ++ G +  +   V 
Sbjct: 367 TLTFSGGSTMKLKPDNYLVMAEYKKKRNGYCYAWSSAD------GLTIFGEIVLKDKLVF 420

Query: 391 YDIGRKQKTFQRMDC 405
           YD+  ++  ++  +C
Sbjct: 421 YDVENRRIGWKGQNC 435


>gi|357489329|ref|XP_003614952.1| Aspartic proteinase-like protein [Medicago truncatula]
 gi|355516287|gb|AES97910.1| Aspartic proteinase-like protein [Medicago truncatula]
          Length = 530

 Score = 85.1 bits (209), Expect = 6e-14,   Method: Compositional matrix adjust.
 Identities = 88/360 (24%), Positives = 149/360 (41%), Gaps = 46/360 (12%)

Query: 60  LFYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLGTIFYPS-----------RS 108
           L Y  I IG P V    A+DTGS + WV C  C +C+P L   FY +            S
Sbjct: 101 LHYTWIDIGTPNVSFLVALDTGSDMFWVPC-DCIECAP-LSAAFYNALDRDLNQYSPSLS 158

Query: 109 SSYAYVPCDSEHCRYFPYARCSAYKHRCIYTQLYLI-GPETSVFVSTEQLTFKNTDESTI 167
           SS  ++PC  + C     + C  +K RC Y + Y      +S F+  ++L   + + +  
Sbjct: 159 SSSRHLPCGHQLCNQ--NSNCKGFKDRCPYIKEYTSDNTSSSGFLIEDKLHLASNNATKN 216

Query: 168 HVQ-DVVFGCGFSTNRNF----KFSGIFGLGIGRSSLVSQL------NSSFSYCIGSLHD 216
            +Q  V+ GCG   +  F      +G+ GLG G  S+ + L       +S S C+     
Sbjct: 217 SIQASVILGCGRKQSGYFLEGAAPNGMLGLGPGSISVPALLAKAGLIRNSISICLN---- 272

Query: 217 PDYLHNKLILGDGAIIDEGDATPLQFIDG---HYYITLEAISVDGRMLDINPNIFKRDDS 273
            +    +++ GD     +  +TP    DG   +Y++ +E   V           F   ++
Sbjct: 273 -EKGSGRILFGDQGHATQRRSTPFLLDDGELLNYFVGVERFCVGS---------FCYKET 322

Query: 274 GGGVMIDSGTDVTWLVKEAYEALRDEVMIRLEGEQMRSYSWPDKLCYHGIMSSDLKGFPT 333
                ID+GT  T+L K  YE +  E   ++   ++ S    D  C +   S +   FP 
Sbjct: 323 EFKAFIDTGTSFTYLPKGVYETVVAEFEKQVHATRITSQIQSDFNCCYNASSRESNNFPP 382

Query: 334 VRFHFRGGAKLALEKDSMFYQPRPDAFCMAVNPASINNRYVNFSLIGMMAQQFYNVGYDI 393
           ++F F       ++   +         C+AV     ++  +       +A Q + +GYD+
Sbjct: 383 MKFTFSKNQSFIIQNPFISMDQEDTTICLAV--VQSDDELITIGRKYTIACQNFLMGYDM 440


>gi|357515189|ref|XP_003627883.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
 gi|355521905|gb|AET02359.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
          Length = 434

 Score = 84.7 bits (208), Expect = 7e-14,   Method: Compositional matrix adjust.
 Identities = 95/360 (26%), Positives = 141/360 (39%), Gaps = 28/360 (7%)

Query: 56  ISNSLFYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLGTIFYPSRSSSYAYVP 115
           I +  + V    G PP     A+DT S   W+ C  C  CS      F P +S+S+  V 
Sbjct: 92  IQSPTYIVKAKFGTPPQTLLLALDTSSDAAWIPCSGCVGCS--TSKPFAPIKSTSFRNVS 149

Query: 116 CDSEHCRYFPYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDVVFG 175
           C S HC+  P   C      C +   Y     +S+  S  Q T      +T  +    FG
Sbjct: 150 CGSPHCKQVPNPTCGG--SACAFNFTYG---SSSIAASVVQDTL---TLATDPIPGYTFG 201

Query: 176 C-----GFSTNRNFKFSGIFGLGIGRSSLVSQLNSSFSYCIGSLHDPDYLHNKLILGDGA 230
           C     G S  +        G     S   +   S+FSYC+ S    ++    L LG   
Sbjct: 202 CVNKTTGSSAPQQGLLGLGRGPLSLLSQSQNLYKSTFSYCLPSFKSINF-SGSLRLGPVY 260

Query: 231 IIDEGDATPL---QFIDGHYYITLEAISVDGRMLDINPNIFK-RDDSGGGVMIDSGTDVT 286
                  TPL         YY+ L AI V  +++DI P        +G G + DSGT  T
Sbjct: 261 QPKRIKYTPLLRNPRRSSLYYVNLVAIKVGRKIVDIPPAALAFNPTTGAGTIFDSGTVFT 320

Query: 287 WLVKEAYEALRDEVMIRLEGEQMRSYSWPDKLCYHGIMSSDLKGFPTVRFHFRGGAKLAL 346
            L +  Y A+R+E   R+  +   +       CY+  +       PT+ F F  G  + L
Sbjct: 321 RLAEPVYTAVRNEFRRRVGPKLPVTTLGGFDTCYNVPIV-----VPTITFLF-SGMNVTL 374

Query: 347 EKDSM-FYQPRPDAFCMAVNPASINNRYVNFSLIGMMAQQFYNVGYDIGRKQKTFQRMDC 405
             D++  +       C+A+  A  N   V  ++I  M QQ + V +D+   +    R  C
Sbjct: 375 PPDNIVIHSTAGSTTCLAMAGAPDNVNSV-LNVIANMQQQNHRVLFDVPNSRIGIARELC 433


>gi|77555282|gb|ABA98078.1| Eukaryotic aspartyl protease family protein, expressed [Oryza
           sativa Japonica Group]
          Length = 409

 Score = 84.7 bits (208), Expect = 7e-14,   Method: Compositional matrix adjust.
 Identities = 85/292 (29%), Positives = 133/292 (45%), Gaps = 32/292 (10%)

Query: 123 YFPYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDVVFGCGFSTNR 182
           YF +A+C+   +             TS +++T+  TF  T      V  VVFGC  ++  
Sbjct: 111 YFVWAQCAPLTYGGS-------AANTSGYLATDTFTFGAT-----AVPGVVFGCSDASYG 158

Query: 183 NFK-FSGIFGLGIGRSSLVSQLN-SSFSYCIGS--LHDPDYLHNKLILGDGAI--IDEGD 236
           +F   SG+ G+G G  SL+SQL    FSY + +    D     + +  GD A+     G 
Sbjct: 159 DFAGASGVIGIGRGNLSLISQLQFGKFSYQLLAPEATDDGSADSVIRFGDDAVPKTKRGR 218

Query: 237 ATPL---QFIDGHYYITLEAISVDGRMLDINP-NIFK-RDDSGGGVMIDSGTDVTWLVKE 291
           +TPL         YY+ L  + VDG  LD  P   F  R +  GGV++ S T VT+L + 
Sbjct: 219 STPLLSSTLYPDFYYVNLTGVRVDGNRLDAIPAGTFDLRANGTGGVILSSTTPVTYLEQA 278

Query: 292 AYEALRDEVMIRLEGEQMR-SYSWPDKLCYHGIMSSDLKGFPTVRFHFRGGAKLALEKDS 350
           AY+ +R  V  R+    +  S +    LCY+    + +K  P +   F GGA + L   +
Sbjct: 279 AYDVVRAAVASRIGLPAVNGSAALELDLCYNASSMAKVK-VPKLTLVFDGGADMDLSAAN 337

Query: 351 MFYQPRPDAF-CMAVNPASINNRYVNFSLIGMMAQQFYNVGYDIGRKQKTFQ 401
            FY        C+ + P+         S++G + Q   N+ YD+   + TF+
Sbjct: 338 YFYIDNDTGLECLTMLPSQ------GGSVLGTLLQTGTNMIYDVDAGRLTFE 383


>gi|302764208|ref|XP_002965525.1| hypothetical protein SELMODRAFT_406966 [Selaginella moellendorffii]
 gi|300166339|gb|EFJ32945.1| hypothetical protein SELMODRAFT_406966 [Selaginella moellendorffii]
          Length = 464

 Score = 84.7 bits (208), Expect = 7e-14,   Method: Compositional matrix adjust.
 Identities = 94/384 (24%), Positives = 161/384 (41%), Gaps = 44/384 (11%)

Query: 37  QEKIRSHNTYQAQILPSNDISNSLFYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCS 96
           +E+   H+  Q    P +  +  ++Y +I++G PP      MDTGS L WV C PC   S
Sbjct: 103 EEEEVEHDLAQT---PVSFTNGGVYYSSITLGSPPKDFSLVMDTGSDLTWVRCDPC---S 156

Query: 97  PQLGTIFYPSRSSSYAYVPCDSEHCRYFPYARCSAYKHRCIYTQLYLIGPETSVFVSTEQ 156
           P   + F    S++Y  + C ++  R     R        ++ +L+  G        T +
Sbjct: 157 PDCSSTFDRLASNTYKALTC-ADDLRLPVLLR--------LWRRLFHSGRS---LRDTLK 204

Query: 157 LTFKNTDESTIHVQDVVFGCGFSTNRNFKFS-GIFGLGIGRSSLVSQLN----SSFSYCI 211
           +    +DE        VFGCG           GI  L  G  S  SQ+     + FSYC+
Sbjct: 205 MAGAASDELE-EFPGFVFGCGSLLKGLISGEVGILALSPGSLSFPSQIGEKYGNKFSYCL 263

Query: 212 GSLHDPDYLHNK-LILGDGAI---------IDEGDATPLQFIDGHYYITLEAISVDGRML 261
                 + L    ++ G+ A+           E   TP+     +Y + L+ ISV  + L
Sbjct: 264 LRQTAQNSLKKSPMVFGEAAVELKEPGSGKPQELQYTPIGESSIYYTVRLDGISVGNQRL 323

Query: 262 DINPNIFKRDDSGGGVMIDSGTDVTWLVKEAYEALRDEVMIRLEGEQMRSYSWPDKLCYH 321
           D++P+ F  +      + DSGT +T L     ++++  +   + G +  +    D  C+ 
Sbjct: 324 DLSPSTF-LNGQDKPTIFDSGTTLTMLPSGVCDSIKQSLASMVSGAEFVAIKGLDA-CFR 381

Query: 322 GIMSSDLKGFPTVRFHFRGGAKLALEKDSMFYQPRPDAFCMAVNPASINNRYVNFSLIGM 381
            +  S  +G P + FHF GGA   + + S +        C+   P +        S+ G 
Sbjct: 382 -VPPSSGQGLPDITFHFNGGADF-VTRPSNYVIDLGSLQCLIFVPTN------EVSIFGN 433

Query: 382 MAQQFYNVGYDIGRKQKTFQRMDC 405
           + QQ + V +D+  ++  F+  DC
Sbjct: 434 LQQQDFFVLHDMDNRRIGFKETDC 457


>gi|340810981|gb|AEK75417.1| S5 [Oryza rufipogon]
          Length = 357

 Score = 84.7 bits (208), Expect = 7e-14,   Method: Compositional matrix adjust.
 Identities = 108/383 (28%), Positives = 160/383 (41%), Gaps = 66/383 (17%)

Query: 63  VNISIGQPPVPQFTAMDTGSSLLWVHCYPCR-DC---SPQLGTIFYPSRSSSYAYVPCDS 118
           + +S+G+PPV    A+DTGS+L WV C PC   C   S + G IF P RS +   V C S
Sbjct: 1   MAVSLGKPPVVNLVAIDTGSTLSWVQCQPCAVHCHTQSAKAGPIFDPGRSYTSRRVRCSS 60

Query: 119 EHCRYFPY------ARCSAYKHRCIYTQLYLIGPETSV-FVSTEQLTFKNTDESTIHVQD 171
             C    Y      A C   +  C Y+  Y  G   SV  + T+ L   ++        D
Sbjct: 61  VKCGELRYDLRLQQANCMEKEDSCTYSVTYGNGWAYSVGKMVTDTLRIGDS------FMD 114

Query: 172 VVFGCGFSTNRNFKFSGIFGLGIGRSSLVSQLN--------SSFSYCIGSLH-DPDYLHN 222
           ++FGC      +   +GIFG G    S   QL          + SYC+ +    P Y   
Sbjct: 115 LMFGCSMDVKYSEFEAGIFGFGSSSFSFFEQLAGYPDILSYKALSYCLPTDETKPGY--- 171

Query: 223 KLILG--DGAIIDEGDATPLQFIDG-HYYITLEAISVDGRMLDINPNIFKRDDSGGGVMI 279
            +ILG  D A +D G     + I+   Y +T+E +  +G+ L           S   +++
Sbjct: 172 -MILGRYDRAAMDGGYTPLFRSINRPTYSLTMEMLIANGQRLVT---------SSSEMIV 221

Query: 280 DSGTDVTWLVKEAYEALRDEVMI---------RLEGEQMRSY----SWPDKLCYHGIMS- 325
           DSG   T L    + AL D+ +          R    +  SY    S  D   ++G ++ 
Sbjct: 222 DSGAQRTSLWPSTF-ALLDKTITQAMSSIGYHRTSRARQESYICYLSEHDYSGWNGTITP 280

Query: 326 -SDLKGFPTVRFHFRGGAKLALEKDSMFYQPRPDAFCM--AVNPASINNRYVNFSLIGMM 382
            S+    P +   F GGA LAL   ++FY       CM  A NPA      +   ++G  
Sbjct: 281 FSNWSALPLLEIGFAGGAALALSPRNVFYNDPHRGLCMTFAQNPA------LRSQILGNR 334

Query: 383 AQQFYNVGYDIGRKQKTFQRMDC 405
             + +   +DI  KQ  F+   C
Sbjct: 335 VTRSFGTTFDIQGKQFGFKYAVC 357


>gi|125595861|gb|EAZ35641.1| hypothetical protein OsJ_19928 [Oryza sativa Japonica Group]
          Length = 629

 Score = 84.7 bits (208), Expect = 7e-14,   Method: Compositional matrix adjust.
 Identities = 96/340 (28%), Positives = 141/340 (41%), Gaps = 48/340 (14%)

Query: 68  GQPPVPQFTAMDTGSSLLWVHCYPCR--DCSPQLGTIFYPSRSSSYAYVPCDSEHCRYF- 124
           G   V Q   +D+GS + WV C PC    C  Q   +F P+ S++YA VPC S  C    
Sbjct: 71  GTSAVTQTVIIDSGSDVSWVQCKPCPLPMCHRQRDPLFDPAMSTTYAAVPCTSAACAQLG 130

Query: 125 PYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDVVFGCGFS---TN 181
           PY R  +   +C +   Y  G   +   S + LT    D     ++   FGC  +   + 
Sbjct: 131 PYRRGCSANAQCQFGINYGDGSTATGTYSFDDLTLGPYDV----IRGFRFGCAHADRGSA 186

Query: 182 RNFKFSGIFGLGIGRSSLVSQLNSS----FSYCIGSLHDPDYLHNKLILG----DGAIID 233
            ++  +G   LG G  SLV Q  +     FSYC+            L+LG       +I 
Sbjct: 187 FDYDVAGSLALGGGSQSLVQQTATRYGRVFSYCLPPTASS---LGFLVLGVPPERAQLIP 243

Query: 234 EGDATPL---QFIDGHYYITLEAISVDGRMLDINPNIFKRDDSGGGVMIDSGTDVTWLVK 290
              +TPL         Y + L AI V GR L + P +F         +IDS T ++ L  
Sbjct: 244 SFVSTPLLSSSMAPTFYRVLLRAIIVAGRPLAVPPAVFSASS-----VIDSSTIISRLPP 298

Query: 291 EAYEALRDEVMIRLEGEQMRSYSWPDKL---CY--HGIMSSDLKGFPTVRFHFRGGAKLA 345
            AY+ALR           M   + P  +   CY   G+ S  L   P++   F GGA + 
Sbjct: 299 TAYQALRAAFR---SAMTMYRAAPPVSILDTCYDFTGVRSITL---PSIALVFDGGATVN 352

Query: 346 LEKDSMFYQPRPDAFCMAVNPASINNRYVNFSLIGMMAQQ 385
           L+   +         C+A  P + ++R   F  IG + Q+
Sbjct: 353 LDAAGILL-----GSCLAFAPTA-SDRMPGF--IGNVQQK 384


>gi|297740190|emb|CBI30372.3| unnamed protein product [Vitis vinifera]
          Length = 445

 Score = 84.7 bits (208), Expect = 7e-14,   Method: Compositional matrix adjust.
 Identities = 106/377 (28%), Positives = 154/377 (40%), Gaps = 56/377 (14%)

Query: 12  NNPNENPAHRVQRGINISIARLAYLQEKIRSHNTYQAQILPSNDISNSLFYVNISIGQPP 71
            NP+ +P   +    + S+ R  +L+ +    NT      P    S   + V++S G P 
Sbjct: 61  KNPSSDPWQLLSHLTSASLTRAHHLKHR---KNTSSVNT-PLFAHSYGGYSVSLSFGTPS 116

Query: 72  VPQFTAMDTGSSLLWVHC---YPCRDCS-----PQLGTIFYPSRSSSYAYVPCDSEHCRY 123
                 MDTGSSL+W  C   Y C  CS     P     F P  SSS   V C +  C +
Sbjct: 117 QTLSFVMDTGSSLVWFPCTSRYVCTRCSFPNIDPAKIPTFIPKLSSSAKIVGCLNPKCGF 176

Query: 124 FPYARCSAY-KHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDVVFGCGFSTNR 182
              +  SA     C    +      T   +  E L F    E      D V GC   ++R
Sbjct: 177 VMDSENSANCTKACPTYAIQYGLGTTVGLLLLESLVFAERTE-----PDFVVGCSILSSR 231

Query: 183 NFKFSGIFGLGIGRSSLVSQLN-SSFSYCIGSLH---DPDYLHNKLILGDGAIIDEGDA- 237
             + SGI G G G SSL  Q+    FSYC+ S      P      L +G  +  D+    
Sbjct: 232 --QPSGIAGFGRGPSSLPKQMGLKKFSYCLLSHRFDDSPKSSKMTLYVGPDSKDDKTGGL 289

Query: 238 --TPLQ--------FIDGHYYITLEAISVDGRMLDINPNIF--KRDDSGGGVMIDSGTDV 285
             TP +            +YY+TL  I V  + + + P  F     D  GG ++DSG+  
Sbjct: 290 SYTPFRKNPVSSNSAFKEYYYVTLRHIIVGDKRVKV-PYSFMVAGSDGNGGTIVDSGSTF 348

Query: 286 TWLVKEAYEALRDEVMIRLEGEQMRSYSWPD--------KLCYH--GIMSSDLKGFPTVR 335
           T++ K  +EA+  E        QM +Y+           K C++  G+ S  L   P++ 
Sbjct: 349 TFMEKPVFEAVATEF-----DRQMANYTRAADVEALSGLKPCFNLSGVGSVAL---PSLV 400

Query: 336 FHFRGGAKLALEKDSMF 352
           F F+GGAK+ L   + F
Sbjct: 401 FQFKGGAKMELPVANYF 417


>gi|358346736|ref|XP_003637421.1| Aspartic proteinase nepenthesin-1, partial [Medicago truncatula]
 gi|355503356|gb|AES84559.1| Aspartic proteinase nepenthesin-1, partial [Medicago truncatula]
          Length = 280

 Score = 84.7 bits (208), Expect = 8e-14,   Method: Compositional matrix adjust.
 Identities = 59/186 (31%), Positives = 87/186 (46%), Gaps = 12/186 (6%)

Query: 31  ARLAYLQEKIRSH---NTYQAQILPSNDISNSLFYVNISIGQPPVPQFTAMDTGSSLLWV 87
           AR+ Y+  K+  +   +     I+      +  ++  I IG+PP   +  +DTGS + WV
Sbjct: 99  ARVKYITTKLNQNFNTDKLSGPIISGTSQGSGEYFSRIGIGEPPSQAYMVLDTGSDISWV 158

Query: 88  HCYPCRDCSPQLGTIFYPSRSSSYAYVPCDSEHCRYFPYARCSAYKHRCIYTQLYLIGPE 147
            C PC DC  Q   IF P+ S+SYA + C++  CRY   ++C      C+Y   Y  G  
Sbjct: 159 QCAPCADCYRQADPIFEPTASASYAPLSCEAAQCRYLDQSQCR--NGNCLYQVSYGDGSY 216

Query: 148 TSVFVSTEQLTFKNTDESTIHVQDVVFGCGFSTNRNFKFSGIFGLGIGRS-SLVSQLNS- 205
           T     TE +T          V++V  GCG +    F  +       G   S  +QLNS 
Sbjct: 217 TVGDFVTETVTI-----GVNKVKNVALGCGHNNEGLFVGAAGLIGLGGGPLSFPAQLNST 271

Query: 206 SFSYCI 211
           SFSYC+
Sbjct: 272 SFSYCL 277


>gi|218197465|gb|EEC79892.1| hypothetical protein OsI_21411 [Oryza sativa Indica Group]
          Length = 720

 Score = 84.7 bits (208), Expect = 8e-14,   Method: Compositional matrix adjust.
 Identities = 96/340 (28%), Positives = 141/340 (41%), Gaps = 48/340 (14%)

Query: 68  GQPPVPQFTAMDTGSSLLWVHCYPCR--DCSPQLGTIFYPSRSSSYAYVPCDSEHCRYF- 124
           G   V Q   +D+GS + WV C PC    C  Q   +F P+ S++YA VPC S  C    
Sbjct: 162 GTSAVTQTVIIDSGSDVSWVQCKPCPLPMCHRQRDPLFDPAMSTTYAAVPCTSAACAQLG 221

Query: 125 PYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDVVFGCGFS---TN 181
           PY R  +   +C +   Y  G   +   S + LT    D     ++   FGC  +   + 
Sbjct: 222 PYRRGCSANAQCQFGINYGDGSTATGTYSFDDLTLGPYDV----IRGFRFGCAHADRGSA 277

Query: 182 RNFKFSGIFGLGIGRSSLVSQLNSS----FSYCIGSLHDPDYLHNKLILG----DGAIID 233
            ++  +G   LG G  SLV Q  +     FSYC+            L+LG       +I 
Sbjct: 278 FDYDVAGSLALGGGSQSLVQQTATRYGRVFSYCLPPTAS---SLGFLVLGVPPERAQLIP 334

Query: 234 EGDATPL---QFIDGHYYITLEAISVDGRMLDINPNIFKRDDSGGGVMIDSGTDVTWLVK 290
              +TPL         Y + L AI V GR L + P +F         +IDS T ++ L  
Sbjct: 335 SFVSTPLLSSSMAPTFYRVLLRAIIVAGRPLAVPPAVFSASS-----VIDSSTIISRLPP 389

Query: 291 EAYEALRDEVMIRLEGEQMRSYSWPDKL---CY--HGIMSSDLKGFPTVRFHFRGGAKLA 345
            AY+ALR           M   + P  +   CY   G+ S  L   P++   F GGA + 
Sbjct: 390 TAYQALRAAFR---SAMTMYRAAPPVSILDTCYDFTGVRSITL---PSIALVFDGGATVN 443

Query: 346 LEKDSMFYQPRPDAFCMAVNPASINNRYVNFSLIGMMAQQ 385
           L+   +         C+A  P + ++R   F  IG + Q+
Sbjct: 444 LDAAGILL-----GSCLAFAPTA-SDRMPGF--IGNVQQK 475


>gi|224140036|ref|XP_002323393.1| predicted protein [Populus trichocarpa]
 gi|222868023|gb|EEF05154.1| predicted protein [Populus trichocarpa]
          Length = 459

 Score = 84.7 bits (208), Expect = 8e-14,   Method: Compositional matrix adjust.
 Identities = 106/436 (24%), Positives = 174/436 (39%), Gaps = 62/436 (14%)

Query: 15  NENPAHRVQRGINISIARLAYLQEKIRSHNTYQAQILPSNDISNSLFYVNISIGQPPVPQ 74
           ++ P   +    ++S++R  +++    + +  +  + P    S   + ++++ G PP   
Sbjct: 40  SKKPWGSLNHLASLSLSRAHHIKSPKTNFSLIKTPLFPR---SYGGYSISLNFGTPPQTT 96

Query: 75  FTAMDTGSSLLWVHC---YPCRDCS-PQLGT----IFYPSRSSSYAYVPCDSEHCRYF-- 124
              MDTGSSL+W  C   Y C +C+ P +       F P  SSS   + C +  C     
Sbjct: 97  KFVMDTGSSLVWFPCTSRYLCSECNFPNIKKTGIPTFLPKLSSSSKLIGCKNPRCSMIFG 156

Query: 125 -----PYARCSAYKHRCIYT-QLYLI---GPETSVFVSTEQLTFKNTDESTIHVQDVVFG 175
                    C +    C  T   Y+I      T+  + +E L F N       + D + G
Sbjct: 157 PEIQSKCQECDSTAQNCTQTCPPYVIQYGSGSTAGLLLSETLDFPNKKT----IPDFLVG 212

Query: 176 CG-FSTNRNFKFSGIFGLGIGRSSLVSQLN-SSFSYCIGS-LHDPDYLHNKLIL----GD 228
           C  FS  +     GI G G    SL SQL    FSYC+ S   D     + L+L    G 
Sbjct: 213 CSIFSIKQP---EGIAGFGRSPESLPSQLGLKKFSYCLVSHAFDDTPTSSDLVLDTGSGS 269

Query: 229 GAIIDEGDA------TPLQFIDGHYYITLEAISVDGRMLDINPNIF--KRDDSGGGVMID 280
           G     G +       P      +YY+ L  I +    + + P  F     D  GG ++D
Sbjct: 270 GVTKTAGLSHTPFLKNPTTAFRDYYYVLLRNIVIGDTHVKV-PYKFLVPGTDGNGGTIVD 328

Query: 281 SGTDVTWLVKEAYEALRDEVMIRLEGEQMRSYSWPDKL--------CYHGIMSSDLKGFP 332
           SGT  T++    YE +  E       +QM  Y+   ++        CY+ I        P
Sbjct: 329 SGTTFTFMENPVYELVAKEFE-----KQMAHYTVATEIQNLTGLRPCYN-ISGEKSLSVP 382

Query: 333 TVRFHFRGGAKLALEKDSMFYQPRPDAFCMAV---NPASINNRYVNFSLIGMMAQQFYNV 389
            + F F+GGAK+AL   + F        C+ +   N A          ++G   Q+ + V
Sbjct: 383 DLIFQFKGGAKMALPLSNYFSIVDSGVICLTIVSDNVAGPGLGGGPAIILGNYQQRNFYV 442

Query: 390 GYDIGRKQKTFQRMDC 405
            +D+  ++  F++  C
Sbjct: 443 EFDLENEKFGFKQQSC 458


>gi|222637182|gb|EEE67314.1| hypothetical protein OsJ_24556 [Oryza sativa Japonica Group]
          Length = 304

 Score = 84.3 bits (207), Expect = 9e-14,   Method: Compositional matrix adjust.
 Identities = 97/360 (26%), Positives = 148/360 (41%), Gaps = 76/360 (21%)

Query: 63  VNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDC----SPQLGTIFYPSRSSSYAYVPCDS 118
           + +++G PPV    A+   S L WV C PC  C    +P  G   Y  R++S ++ P   
Sbjct: 1   MELAVGTPPV-TVQALFGISDLCWVECTPCSGCNNNAAPPAGARLY-DRANSSSFSPLAD 58

Query: 119 EHCRY-FPYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDVVFGCG 177
             C Y + Y      ++       Y+ G      + TE + F + D +T  VQ   FGC 
Sbjct: 59  TECGYRYVYGATDTDRN-------YVKG-----ILGTETIKFGSNDAAT--VQSFTFGCT 104

Query: 178 FSTNRNFKF---SGIFGLGIGRSSLVSQLN-SSFSYCIGSLHDPDYLHNKLILGDGAIID 233
            +  RN  F   +G+ GLG  + SLV QL    FSYC+ S  +P+ + + ++ G  A +D
Sbjct: 105 NTVYRNDLFDGNTGVVGLGRSKLSLVGQLGLDRFSYCLAS--NPN-VASPVLFGSTASMD 161

Query: 234 EG--DATPLQFIDGHYYITLEAISVDGRMLDINPNIFKRDDSGGGVMIDSGTDVTWLVKE 291
                +TPL   D +YY+ L  ISVDG  L I PN   R                  +  
Sbjct: 162 GNGVSSTPLLPDDANYYVNLLGISVDGTRLAI-PNDTAR------------------MSR 202

Query: 292 AYEALRDEVMIRLEGEQMRSYSWPDKLCYHGIMSSDLKGFPTVRFHFRGGAKLALEKDSM 351
            YEA        + G  +  +   D        S ++   PT+  HF G     L  +  
Sbjct: 203 TYEA--------VNGSGLLCFLVDDA-------SKNVVTVPTMTMHFDGMDMELLFGNYF 247

Query: 352 FYQPRP------DAFCMAVNPASINNRYVNFSLIGMMAQQFYNVGYDIGRKQKTFQRMDC 405
            Y  +       D  C+ +  +S  +R      IG   Q  ++V Y++     + Q  DC
Sbjct: 248 AYTGKQSGGGGGDVLCLMIGKSSTGSR------IGNYLQMDFHVLYELKNSVLSVQPADC 301


>gi|326505434|dbj|BAJ95388.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 529

 Score = 84.3 bits (207), Expect = 9e-14,   Method: Compositional matrix adjust.
 Identities = 104/375 (27%), Positives = 151/375 (40%), Gaps = 50/375 (13%)

Query: 60  LFYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDC----SPQLGT----IFYPSRSSSY 111
           L Y  +++G P V    A+DTGS L WV C  C  C    SP  G     ++ P +SS+ 
Sbjct: 107 LHYAVVALGTPNVTFLVALDTGSDLFWVPC-DCLKCAPLSSPDYGNLKFDVYSPRKSSTS 165

Query: 112 AYVPCDSEHCRYFPYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDE--STIHV 169
             VPC S  C       CSA  + C Y   YL    +S  V  E + +  T+   S I  
Sbjct: 166 RKVPCSSNMCDL--QTECSAASNSCPYKIEYLSDNTSSKGVLVEDVMYLATESGHSKITQ 223

Query: 170 QDVVFGCGFSTNRNFKFS----GIFGLGIGRSSLVSQLNS------SFSYCIGSLHDPDY 219
             + FGCG     +F  S    G+ GLG+   S+ S L S      SFS C G     + 
Sbjct: 224 APITFGCGQVQTGSFLGSAAPNGLLGLGMDSKSVPSLLASQGVAANSFSMCFG-----ED 278

Query: 220 LHNKLILGDGAIIDEGDATPLQFIDGHYYITLEAISVDGRMLDINPNIFKRDDSGGGVMI 279
            H ++  GD    D+ + TPL     + Y     IS+ G M        K   +    ++
Sbjct: 279 GHGRINFGDTGSADQLE-TPLNIYKHNPYYN---ISIVGAMAG-----GKTFSTKFSAVV 329

Query: 280 DSGTDVTWLVKEAYEALRDEVMIRL-EGEQMRSYSWPDKLCYHGIMSSDLKGFPTVRFHF 338
           DSGT  T L    Y  +      ++ E       S P + CY  I S      P +    
Sbjct: 330 DSGTSFTALSDPMYTEITSAFDKQVKEKRNPADSSLPFEYCYT-ISSKGAVSPPNISLTA 388

Query: 339 RGGAKLALEKDSMF----YQPRPDAFCMAVNPASINNRYVNFSLIGMMAQQFYNVGYDIG 394
           +GG+   + KD +         P  +C+A+    + +  VN  LIG        V +D  
Sbjct: 389 KGGSVFPV-KDPIITITDISSSPVGYCLAI----MKSEGVN--LIGENFMSGLKVVFDRE 441

Query: 395 RKQKTFQRMDCEVLD 409
           R    ++  +C  +D
Sbjct: 442 RLVLGWKSFNCYSVD 456


>gi|413923876|gb|AFW63808.1| hypothetical protein ZEAMMB73_793799 [Zea mays]
          Length = 415

 Score = 84.3 bits (207), Expect = 9e-14,   Method: Compositional matrix adjust.
 Identities = 90/363 (24%), Positives = 145/363 (39%), Gaps = 59/363 (16%)

Query: 78  MDTGSSLLWVHCYPCRDCSPQLGTIFYPSRSSSYAYVPCDSEHCRYFPYARC-------- 129
           +DTGS++ W     C             SRS + + +PC S  C       C        
Sbjct: 73  VDTGSNIFWTTEKEC-------------SRSKTRSMLPCCSPKCEQRASCGCRRSELKAE 119

Query: 130 SAYKHRCIYTQLYL--IGPETSVFVSTEQLTFKNTDESTI----HVQDVVFGCGFSTNRN 183
           +  + +C Y   Y       T+  +  ++LT        +      ++V  GC  S    
Sbjct: 120 AEKETKCTYAIKYGGNANDSTAGVLYEDKLTIVAVASKAVPGSQSFEEVAIGCSTSATLK 179

Query: 184 FK---FSGIFGLGIGRSSLVSQLN-SSFSYCIGSLHDPDYLHNKLILGDGAIIDEGDAT- 238
           FK     G+FGLG   +SL  QLN S FSYC+ S   PD L + L+L     +  G    
Sbjct: 180 FKDPSIKGVFGLGRSATSLPRQLNFSKFSYCLSSYQKPD-LPSYLLLTAAPDMATGAVGG 238

Query: 239 ----------PLQFIDGHYYITLEAISVDGRMLDINPNIFKRDDSGGGVMIDSGTDVTWL 288
                     P       Y++ L+ IS+ G  L   P +  +  SGG + +D+GT  T L
Sbjct: 239 AAAVATTALQPNSDYKTRYFVDLQGISIGGTRL---PAVSTK--SGGNMFVDTGTSFTRL 293

Query: 289 VKEAYEALRDEVMIRLEGEQMRSYSWPDK----LCYH--GIMSSDLKGFPTVRFHFRGGA 342
               +  L  E + R+  E+      P +    +CY      + +    P +  HF   A
Sbjct: 294 EGTVFAKLVTE-LDRIMKERKYVKEQPGRNNGQICYSPPSTAADESSKLPDMVLHFADSA 352

Query: 343 KLALEKDSMFYQPRPDAFCMAVNPASINNRYVNFSLIGMMAQQFYNVGYDIGRKQKTFQR 402
            + L  DS  ++      C+A++ ++I       S++G    Q  ++  D G ++ +F R
Sbjct: 353 NMVLPWDSYLWK-TTSKLCLAIDKSNIKG---GISVLGNFQMQNTHMLLDTGNEKLSFVR 408

Query: 403 MDC 405
            DC
Sbjct: 409 ADC 411


>gi|302824729|ref|XP_002994005.1| hypothetical protein SELMODRAFT_431957 [Selaginella moellendorffii]
 gi|300138167|gb|EFJ04945.1| hypothetical protein SELMODRAFT_431957 [Selaginella moellendorffii]
          Length = 462

 Score = 84.3 bits (207), Expect = 9e-14,   Method: Compositional matrix adjust.
 Identities = 97/382 (25%), Positives = 152/382 (39%), Gaps = 55/382 (14%)

Query: 61  FYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLGTIFYPSRSSSYAYVPCD-SE 119
           +Y +I +G P       +DTGS L W+ C PC+ C+P + TI+  +RS+SY  V C+ S+
Sbjct: 100 YYTSIKLGSPGQEAILIVDTGSELTWLQCLPCKVCAPSVDTIYDAARSASYRPVTCNNSQ 159

Query: 120 HCR---YFPYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNT-DESTIHVQDVVFG 175
            C       YA C A   +C +   Y  G  +   +ST+ L  +       + VQD  FG
Sbjct: 160 LCSNSSQGTYAYC-ARGSQCQFAAFYGDGSFSYGSLSTDTLIMETVVGGKPVTVQDFAFG 218

Query: 176 C--GFSTNRNFKFSGIFGLGIGRSSLVSQLNS----SFSYCIGSLHDPDYLHNKLILGDG 229
           C  G         SGI GL  G+ +L  QL       FS+C             +  G+ 
Sbjct: 219 CAQGDLELVPTGASGILGLNAGKMALPMQLGQRFGWKFSHCFPDRSSHLNSTGVVFFGNA 278

Query: 230 AIIDEG------DATPLQFIDGHYYITLEAISVDGRMLDINPNIFKRDDSGGGVMIDSGT 283
            +  E         T  +     Y++ L+ +S++   L   P        G  V++DSG+
Sbjct: 279 ELPHEQVQYTSVALTNSELQRKFYHVALKGVSINSHELVFLPR-------GSVVILDSGS 331

Query: 284 DVTWLVK-------EAYEALRDEVMIRLEGEQMRSYSWPDKLCYHGIMSSDL----KGFP 332
             +  V+       EA+   R   +  LEG+     S+ D      + + D+    +  P
Sbjct: 332 SFSSFVRPFHSQLREAFLKHRPPSLKHLEGD-----SFGDLGTCFKVSNDDIDELHRTLP 386

Query: 333 TVRFHFRGGAKLALEKDSMFYQPRPDA-------FCMAVNPASINNRYVNFSLIGMMAQQ 385
           ++   F  G  + +    +     P A        C A      N      ++IG   QQ
Sbjct: 387 SLSLVFEDGVTIGIPSIGVLL---PVARFQNHVKMCFAFEDGGPNP----VNVIGNYQQQ 439

Query: 386 FYNVGYDIGRKQKTFQRMDCEV 407
              V YDI R +  F R  C +
Sbjct: 440 NLWVEYDIQRSRVGFARASCVI 461


>gi|340810961|gb|AEK75407.1| S5 [Oryza sativa]
 gi|340811037|gb|AEK75445.1| S5 [Oryza rufipogon]
          Length = 357

 Score = 84.3 bits (207), Expect = 9e-14,   Method: Compositional matrix adjust.
 Identities = 108/383 (28%), Positives = 160/383 (41%), Gaps = 66/383 (17%)

Query: 63  VNISIGQPPVPQFTAMDTGSSLLWVHCYPCR-DC---SPQLGTIFYPSRSSSYAYVPCDS 118
           + +S+G+PPV    A+DTGS+L WV C PC   C   S + G IF P RS +   V C S
Sbjct: 1   MAVSLGKPPVVNLVAIDTGSTLSWVQCQPCAVHCHTQSAKAGPIFDPGRSYTSRRVRCSS 60

Query: 119 EHCRYFPY------ARCSAYKHRCIYTQLYLIGPETSV-FVSTEQLTFKNTDESTIHVQD 171
             C    Y      A C   +  C Y+  Y  G   SV  + T+ L   ++        D
Sbjct: 61  VKCGELRYDLRLQQANCMEKEDSCTYSVTYGNGWAYSVGKMVTDTLRIGDS------FMD 114

Query: 172 VVFGCGFSTNRNFKFSGIFGLGIGRSSLVSQLN--------SSFSYCIGSLH-DPDYLHN 222
           ++FGC      +   +GIFG G    S   QL          + SYC+ +    P Y   
Sbjct: 115 LMFGCSMDVKYSEFEAGIFGFGSSSFSFFEQLAGYPDILSYKALSYCLPTDETKPGY--- 171

Query: 223 KLILG--DGAIIDEGDATPLQFIDG-HYYITLEAISVDGRMLDINPNIFKRDDSGGGVMI 279
            +ILG  D A +D G     + I+   Y +T+E +  +G+ L           S   +++
Sbjct: 172 -MILGRYDRAAMDGGYTPLFRSINRPTYSLTMEMLIANGQRLVT---------SSSEMIV 221

Query: 280 DSGTDVTWLVKEAYEALRDEVMI---------RLEGEQMRSY----SWPDKLCYHGIMS- 325
           DSG   T L    + AL D+ +          R    +  SY    S  D   ++G ++ 
Sbjct: 222 DSGAQRTSLWPSTF-ALLDKTITQAMSSIGYHRTSRARQESYICYLSEHDYSGWNGTITP 280

Query: 326 -SDLKGFPTVRFHFRGGAKLALEKDSMFYQPRPDAFCM--AVNPASINNRYVNFSLIGMM 382
            S+    P +   F GGA LAL   ++FY       CM  A NPA      +   ++G  
Sbjct: 281 FSNWSALPLLEIGFAGGAALALPPRNVFYNDPHRGLCMTFAQNPA------LRSQILGNR 334

Query: 383 AQQFYNVGYDIGRKQKTFQRMDC 405
             + +   +DI  KQ  F+   C
Sbjct: 335 VTRSFGTTFDIQGKQFGFKYAVC 357


>gi|224142007|ref|XP_002324352.1| predicted protein [Populus trichocarpa]
 gi|222865786|gb|EEF02917.1| predicted protein [Populus trichocarpa]
          Length = 460

 Score = 84.3 bits (207), Expect = 9e-14,   Method: Compositional matrix adjust.
 Identities = 103/379 (27%), Positives = 153/379 (40%), Gaps = 41/379 (10%)

Query: 47  QAQILP---SNDISNSLFYVNISIGQPPVPQFTAMDTGSSLLWVHCYPC-RDCSPQLGTI 102
           QA  LP      I    + V + +G P        DTGS + W  C PC + C  Q    
Sbjct: 102 QATTLPVQSGASIGAGDYVVTVGLGTPKKEFTLIFDTGSDITWTQCEPCVKTCYKQKEPR 161

Query: 103 FYPSRSSSYAYVPCDSEHCRYFPYAR-----CSAYKHRCIYTQLYLIGPETSVFVSTEQL 157
             PS S+SY  + C S  C+     +     CS+    C+Y   Y  G  +  F +TE L
Sbjct: 162 LNPSTSTSYKNISCSSALCKLVASGKKFSQSCSS--STCLYQVQYGDGSYSIGFFATETL 219

Query: 158 TFKNTDESTIHVQDVVFGCGFSTNRNFKFSGIFGLGI-GRSSLVSQLNSS----FSYCI- 211
           T  +++      ++ +FGCG   N  F  +         + +L SQ   +    FSYC+ 
Sbjct: 220 TLSSSNV----FKNFLFGCGQQNNGLFGGAAGLLGLGRTKLALPSQTAKTYKKLFSYCLP 275

Query: 212 GSLHDPDYLHNKLILGDGAIIDEGDATPLQ--FIDGHYY-ITLEAISVDGRMLDINPNIF 268
            S     YL     LG G +      TPL   F    +Y + +  +SV GR L I+ + F
Sbjct: 276 ASSSSKGYLS----LG-GQVSKSVKFTPLSADFDSTPFYGLDITGLSVGGRKLSIDESAF 330

Query: 269 KRDDSGGGVMIDSGTDVTWLVKEAYEALRDEVM-IRLEGEQMRSYSWPDKLCYHGIMSSD 327
                  G +IDSGT +T L   AY  L      +  +      YS  D  CY      D
Sbjct: 331 S-----AGTVIDSGTVITRLSPTAYSELSSAFQNLMTDYPSTSGYSIFDT-CYD-FSKYD 383

Query: 328 LKGFPTVRFHFRGGAKLALEKDSMFYQPRP-DAFCMAVNPASINNRYVNFSLIGMMAQQF 386
               P V   F+GG ++ ++   + Y        C+A    + N+   + S+ G + Q+ 
Sbjct: 384 TVRIPKVGVTFKGGVEMDIDVSGILYPVNGLKKVCLAF---AGNDDDSDTSIFGNVQQRT 440

Query: 387 YNVGYDIGRKQKTFQRMDC 405
           Y V YD  + +  F    C
Sbjct: 441 YQVVYDGAKGRVGFAPGGC 459


>gi|224142009|ref|XP_002324353.1| predicted protein [Populus trichocarpa]
 gi|222865787|gb|EEF02918.1| predicted protein [Populus trichocarpa]
          Length = 412

 Score = 84.3 bits (207), Expect = 9e-14,   Method: Compositional matrix adjust.
 Identities = 105/398 (26%), Positives = 160/398 (40%), Gaps = 44/398 (11%)

Query: 32  RLAYLQEKIRSHNTY---QAQILP---SNDISNSLFYVNISIGQPPVPQFTAMDTGSSLL 85
           R+  +  ++ S   +   QA  LP      I    + V + +G P        DTGS + 
Sbjct: 36  RVDSIHARLSSRGMFPEKQATTLPVQSGASIGAGDYVVTVGLGTPKKEFTLIFDTGSDIT 95

Query: 86  WVHCYPC-RDCSPQLGTIFYPSRSSSYAYVPCDSEHCRYFPYAR-----CSAYKHRCIYT 139
           W  C PC + C  Q      PS S+SY  + C S  C+     +     CS+    C+Y 
Sbjct: 96  WTQCEPCVKTCYKQKEPRLNPSTSTSYKNISCSSALCKLVASGKKFSQSCSS--STCLYQ 153

Query: 140 QLYLIGPETSVFVSTEQLTFKNTDESTIHVQDVVFGCGFSTNRNFKFSGIFGLGI-GRSS 198
             Y  G  +  F +TE LT  +++      ++ +FGCG   N  F  +         + +
Sbjct: 154 VQYGDGSYSIGFFATETLTLSSSNV----FKNFLFGCGQQNNGLFGGAAGLLGLGRTKLA 209

Query: 199 LVSQLNSS----FSYCI-GSLHDPDYLHNKLILGDGAIIDEGDATPLQ--FIDGHYY-IT 250
           L SQ   +    FSYC+  S     YL     LG G +      TPL   F    +Y + 
Sbjct: 210 LPSQTAKTYKKLFSYCLPASSSSKGYLS----LG-GQVSKSVKFTPLSADFDSTPFYGLD 264

Query: 251 LEAISVDGRMLDINPNIFKRDDSGGGVMIDSGTDVTWLVKEAYEALRDEVM-IRLEGEQM 309
           +  +SV GR L I+ + F       G +IDSGT +T L   AY  L      +  +    
Sbjct: 265 ITGLSVGGRQLSIDESAFS-----AGTVIDSGTVITRLSPTAYSELSSAFQNLMTDYPST 319

Query: 310 RSYSWPDKLCYHGIMSSDLKGFPTVRFHFRGGAKLALEKDSMFYQPRP-DAFCMAVNPAS 368
             YS  D  CY      D    P V   F+GG ++ ++   + Y        C+A    +
Sbjct: 320 SGYSIFDT-CYD-FSKYDTVRIPKVGVTFKGGVEMDIDVSGILYPVNGLKKVCLAF---A 374

Query: 369 INNRYVNFSLIGMMAQQFYNVGYDIGRKQKTFQRMDCE 406
            N+   + S+ G + Q+ Y V YD  + +  F    C 
Sbjct: 375 GNDDDSDTSIFGNVQQRTYQVVYDGAKGRVGFAPGGCS 412


>gi|242059211|ref|XP_002458751.1| hypothetical protein SORBIDRAFT_03g039590 [Sorghum bicolor]
 gi|241930726|gb|EES03871.1| hypothetical protein SORBIDRAFT_03g039590 [Sorghum bicolor]
          Length = 444

 Score = 84.3 bits (207), Expect = 1e-13,   Method: Compositional matrix adjust.
 Identities = 106/432 (24%), Positives = 170/432 (39%), Gaps = 55/432 (12%)

Query: 13  NPNENPAHRVQRGINISIARLAYLQEKIRSHNTYQAQILPSNDISNSLFYVNISIGQPPV 72
            P   P     R   +    L     K+R H+             N    V++++G PP 
Sbjct: 28  RPAAKPRAFPLRARQVPAGALPRPPSKLRFHH-------------NVSLTVSLAVGTPPQ 74

Query: 73  PQFTAMDTGSSLLWVHCYPCRDCSPQ------LGTIFYPSRSSSYAYVPCDSEHC--RYF 124
                +DTGS L W+ C   R  S        +G  F P  S+++A VPC S  C  R  
Sbjct: 75  NVTMVLDTGSELSWLLCATGRQGSAAAGAAAAMGESFRPRASATFAAVPCGSTQCSSRDL 134

Query: 125 PYA-RCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDVVFGCGF----S 179
           P    C     +C  +  Y  G  +   ++T+   F   +   +      FGC      S
Sbjct: 135 PAPPSCDGASRQCHVSLSYADGSASDGALATD--VFAVGEAPPLRS---AFGCMSTAYDS 189

Query: 180 TNRNFKFSGIFGLGIGRSSLVSQLNSS-FSYCIGSLHDPDYL---HNKL-ILGDGAIIDE 234
           +      +G+ G+  G  S V+Q ++  FSYCI    D   L   H+ L  L        
Sbjct: 190 SPDGVATAGLLGMNRGTLSFVTQASTRRFSYCISDRDDAGVLLLGHSDLPFLPLNYTPLY 249

Query: 235 GDATPLQFIDG-HYYITLEAISVDGRMLDINPNIFKRDDSGGG-VMIDSGTDVTWLVKEA 292
               PL + D   Y + L  I V G+ L I  ++   D +G G  M+DSGT  T+L+ +A
Sbjct: 250 QPTLPLPYFDRVAYSVQLLGIRVGGKALPIPASVLAPDHTGAGQTMVDSGTQFTFLLGDA 309

Query: 293 YEALRDEVMIR----LEGEQMRSYSWPDKL--CYHGIMSSDLKG--FPTVRFHFRGGAKL 344
           Y AL+ E + +    L      S+++ + L  C+             P V   F  GA++
Sbjct: 310 YSALKAEFLKQTKPLLRALDDPSFAFQEALDTCFRVPAGRPPPSARLPPVTLLFN-GAEM 368

Query: 345 ALEKDSMFYQPRPD------AFCMAVNPASINNRYVNFSLIGMMAQQFYNVGYDIGRKQK 398
           ++  D + Y+   +       +C+    A +    +   +IG   Q    V YD+ R + 
Sbjct: 369 SVAGDRLLYKVPGEHRGADGVWCLTFGNADMVP--LTAYVIGHHHQMNLWVEYDLERGRV 426

Query: 399 TFQRMDCEVLDD 410
               + C+V  +
Sbjct: 427 GLAPVKCDVASE 438


>gi|223950045|gb|ACN29106.1| unknown [Zea mays]
          Length = 392

 Score = 84.3 bits (207), Expect = 1e-13,   Method: Compositional matrix adjust.
 Identities = 90/363 (24%), Positives = 145/363 (39%), Gaps = 59/363 (16%)

Query: 78  MDTGSSLLWVHCYPCRDCSPQLGTIFYPSRSSSYAYVPCDSEHCRYFPYARC-------- 129
           +DTGS++ W     C             SRS + + +PC S  C       C        
Sbjct: 50  VDTGSNIFWTTEKEC-------------SRSKTRSMLPCCSPKCEQRASCGCRRSELKAE 96

Query: 130 SAYKHRCIYTQLYL--IGPETSVFVSTEQLTFKNTDESTI----HVQDVVFGCGFSTNRN 183
           +  + +C Y   Y       T+  +  ++LT        +      ++V  GC  S    
Sbjct: 97  AEKETKCTYAIKYGGNANDSTAGVLYEDKLTIVAVASKAVPGSQSFEEVAIGCSTSATLK 156

Query: 184 FK---FSGIFGLGIGRSSLVSQLN-SSFSYCIGSLHDPDYLHNKLILGDGAIIDEGDAT- 238
           FK     G+FGLG   +SL  QLN S FSYC+ S   PD L + L+L     +  G    
Sbjct: 157 FKDPSIKGVFGLGRSATSLPRQLNFSKFSYCLSSYQKPD-LPSYLLLTAAPDMATGAVGG 215

Query: 239 ----------PLQFIDGHYYITLEAISVDGRMLDINPNIFKRDDSGGGVMIDSGTDVTWL 288
                     P       Y++ L+ IS+ G  L   P +  +  SGG + +D+GT  T L
Sbjct: 216 AAAVATTALQPNSDYKTRYFVDLQGISIGGTRL---PAVSTK--SGGNMFVDTGTSFTRL 270

Query: 289 VKEAYEALRDEVMIRLEGEQMRSYSWPDK----LCYH--GIMSSDLKGFPTVRFHFRGGA 342
               +  L  E + R+  E+      P +    +CY      + +    P +  HF   A
Sbjct: 271 EGTVFAKLVTE-LDRIMKERKYVKEQPGRNNGQICYSPPSTAADESSKLPDMVLHFADSA 329

Query: 343 KLALEKDSMFYQPRPDAFCMAVNPASINNRYVNFSLIGMMAQQFYNVGYDIGRKQKTFQR 402
            + L  DS  ++      C+A++ ++I       S++G    Q  ++  D G ++ +F R
Sbjct: 330 NMVLPWDSYLWK-TTSKLCLAIDKSNIKG---GISVLGNFQMQNTHMLLDTGNEKLSFVR 385

Query: 403 MDC 405
            DC
Sbjct: 386 ADC 388


>gi|224142005|ref|XP_002324351.1| predicted protein [Populus trichocarpa]
 gi|222865785|gb|EEF02916.1| predicted protein [Populus trichocarpa]
          Length = 472

 Score = 84.3 bits (207), Expect = 1e-13,   Method: Compositional matrix adjust.
 Identities = 103/379 (27%), Positives = 153/379 (40%), Gaps = 41/379 (10%)

Query: 47  QAQILP---SNDISNSLFYVNISIGQPPVPQFTAMDTGSSLLWVHCYPC-RDCSPQLGTI 102
           QA  LP      I    + V + +G P        DTGS + W  C PC + C  Q    
Sbjct: 114 QATTLPVQSGASIGAGDYVVTVGLGTPKKEFTLIFDTGSDITWTQCEPCVKTCYKQKEPR 173

Query: 103 FYPSRSSSYAYVPCDSEHCRYFPYAR-----CSAYKHRCIYTQLYLIGPETSVFVSTEQL 157
             PS S+SY  + C S  C+     +     CS+    C+Y   Y  G  +  F +TE L
Sbjct: 174 LNPSTSTSYKNISCSSALCKLVASGKKFSQSCSS--STCLYQVQYGDGSYSIGFFATETL 231

Query: 158 TFKNTDESTIHVQDVVFGCGFSTNRNFKFSGIFGLGI-GRSSLVSQLNSS----FSYCI- 211
           T  +++      ++ +FGCG   N  F  +         + +L SQ   +    FSYC+ 
Sbjct: 232 TLSSSNV----FKNFLFGCGQQNNGLFGGAAGLLGLGRTKLALPSQTAKTYKKLFSYCLP 287

Query: 212 GSLHDPDYLHNKLILGDGAIIDEGDATPLQ--FIDGHYY-ITLEAISVDGRMLDINPNIF 268
            S     YL     LG G +      TPL   F    +Y + +  +SV GR L I+ + F
Sbjct: 288 ASSSSKGYLS----LG-GQVSKSVKFTPLSADFDSTPFYGLDITGLSVGGRKLSIDESAF 342

Query: 269 KRDDSGGGVMIDSGTDVTWLVKEAYEALRDEVM-IRLEGEQMRSYSWPDKLCYHGIMSSD 327
                  G +IDSGT +T L   AY  L      +  +      YS  D  CY      D
Sbjct: 343 S-----AGTVIDSGTVITRLSPTAYSELSSAFQNLMTDYPSTSGYSIFDT-CYD-FSKYD 395

Query: 328 LKGFPTVRFHFRGGAKLALEKDSMFYQPRP-DAFCMAVNPASINNRYVNFSLIGMMAQQF 386
               P V   F+GG ++ ++   + Y        C+A    + N+   + S+ G + Q+ 
Sbjct: 396 TVRIPKVGVTFKGGVEMDIDVSGILYPVNGLKKVCLAF---AGNDDDSDTSIFGNVQQRT 452

Query: 387 YNVGYDIGRKQKTFQRMDC 405
           Y V YD  + +  F    C
Sbjct: 453 YQVVYDGAKGRVGFAPGGC 471


>gi|218186522|gb|EEC68949.1| hypothetical protein OsI_37668 [Oryza sativa Indica Group]
          Length = 421

 Score = 84.0 bits (206), Expect = 1e-13,   Method: Compositional matrix adjust.
 Identities = 98/380 (25%), Positives = 160/380 (42%), Gaps = 54/380 (14%)

Query: 58  NSLFYVNISIGQPPVPQFTAMDTGSSLLWVHC-YPCRDCSPQLGTIFYPSRSSSYAYVPC 116
           + L+YV +SIG PP P F  +DTGS L W+ C  PC  CS     ++ P+++     VPC
Sbjct: 55  HGLYYVAMSIGNPPRPYFLDVDTGSDLTWLQCDAPCVSCSKVPHPLYRPTKNK---LVPC 111

Query: 117 DSEHCRYFPYA-----RCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQD 171
             + C           +C + K +C Y   Y     +   + T+    +  + S +    
Sbjct: 112 VDQMCAALHGGLTGRHKCDSPKQQCDYEIKYADQGSSLGVLVTDSFALRLANSSIVR-PG 170

Query: 172 VVFGCGF-----STNRNFKFSGIFGLGIGRSSLVSQL------NSSFSYCIGSLHDPDYL 220
           + FGCG+     S+       G+ GLG G  SL+SQL       +   +C+ +       
Sbjct: 171 LAFGCGYDQQVGSSTEVSATDGVLGLGSGSVSLLSQLKQHGITKNVVGHCLSTRGG---- 226

Query: 221 HNKLILGDGAIIDEGDAT--PLQFIDGHYYITLEAISV--DGRMLDINPNIFKRDDSGGG 276
              L  GD  I+    AT  P+       Y +  + ++   GR L + P           
Sbjct: 227 -GFLFFGD-DIVPYSRATWAPMARSTSRNYYSPGSANLYFGGRPLGVRPM---------E 275

Query: 277 VMIDSGTDVTWLVKEAYEALRDEVMIRLEG--EQMRSYSWPDKLCYHGI--MSSDL---K 329
           V+ DSG+  T+   + Y+AL D +   L    +++  +S P  LC+ G     S L   K
Sbjct: 276 VVFDSGSSFTYFSAQPYQALVDAIKGDLSKNLKEVPDHSLP--LCWKGKKPFKSVLDVKK 333

Query: 330 GFPTVRFHFRGGAKLALE---KDSMFYQPRPDAFCMAVNPASINNRYVNFSLIGMMAQQF 386
            F TV   F  G K  +E   ++ +      +A    +N + +  + +N  ++G +  Q 
Sbjct: 334 EFKTVVLSFSNGKKALMEIPPENYLIVTKYGNACLGILNGSEVGLKDLN--IVGDITMQD 391

Query: 387 YNVGYDIGRKQKTFQRMDCE 406
             V YD  R Q  + R  C+
Sbjct: 392 QMVIYDNERGQIGWIRAPCD 411


>gi|225440731|ref|XP_002280866.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Vitis vinifera]
          Length = 469

 Score = 84.0 bits (206), Expect = 1e-13,   Method: Compositional matrix adjust.
 Identities = 117/449 (26%), Positives = 176/449 (39%), Gaps = 78/449 (17%)

Query: 12  NNPNENPAHRVQRGINISIARLAYLQEKIRSHNTYQAQILPSNDISNSLFYVNISIGQPP 71
            NP+ +P   +    + S+ R  +L+ +    NT      P    S   + V++S G P 
Sbjct: 45  KNPSSDPWQLLSHLTSASLTRAHHLKHR---KNTSSVNT-PLFAHSYGGYSVSLSFGTPS 100

Query: 72  VPQFTAMDTGSSLLWVHC---YPCRDCS-----PQLGTIFYPSRSSSYAYVPCDSEHCRY 123
                 MDTGSSL+W  C   Y C  CS     P     F P  SSS   V C +  C +
Sbjct: 101 QTLSFVMDTGSSLVWFPCTSRYVCTRCSFPNIDPAKIPTFIPKLSSSAKIVGCLNPKCGF 160

Query: 124 F----PYARCSAYKHR-------CIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDV 172
                   RC             C    +      T   +  E L F    E      D 
Sbjct: 161 VMDSEVRTRCPGCDQNSANCTKACPTYAIQYGLGTTVGLLLLESLVFAERTE-----PDF 215

Query: 173 VFGCGFSTNRNFKFSGIFGLGIGRSSLVSQLN-SSFSYCIGSLH---DPDYLHNKLILGD 228
           V GC   ++R  + SGI G G G SSL  Q+    FSYC+ S      P      L +G 
Sbjct: 216 VVGCSILSSR--QPSGIAGFGRGPSSLPKQMGLKKFSYCLLSHRFDDSPKSSKMTLYVGP 273

Query: 229 GAIIDEGDA---TPLQ--------FIDGHYYITLEAISVDGRMLDINPNIF--KRDDSGG 275
            +  D+      TP +            +YY+TL  I V  + + + P  F     D  G
Sbjct: 274 DSKDDKTGGLSYTPFRKNPVSSNSAFKEYYYVTLRHIIVGDKRVKV-PYSFMVAGSDGNG 332

Query: 276 GVMIDSGTDVTWLVKEAYEALRDEVMIRLEGEQMRSYSWPD--------KLCYH--GIMS 325
           G ++DSG+  T++ K  +EA+  E        QM +Y+           K C++  G+ S
Sbjct: 333 GTIVDSGSTFTFMEKPVFEAVATEF-----DRQMANYTRAADVEALSGLKPCFNLSGVGS 387

Query: 326 SDLKGFPTVRFHFRGGAKLALEKDSMF-YQPRPDAFCMAVNPASINNRYVNFSL------ 378
             L   P++ F F+GGAK+ L   + F         C+ +    ++N  V  +L      
Sbjct: 388 VAL---PSLVFQFKGGAKMELPVANYFSLVGDLSVLCLTI----VSNEAVGSTLSSGPSI 440

Query: 379 -IGMMAQQFYNVGYDIGRKQKTFQRMDCE 406
            +G    Q +   YD+  ++  F+R  C+
Sbjct: 441 ILGNYQSQNFYTEYDLENERFGFRRQRCK 469


>gi|242041951|ref|XP_002468370.1| hypothetical protein SORBIDRAFT_01g044790 [Sorghum bicolor]
 gi|241922224|gb|EER95368.1| hypothetical protein SORBIDRAFT_01g044790 [Sorghum bicolor]
          Length = 408

 Score = 84.0 bits (206), Expect = 1e-13,   Method: Compositional matrix adjust.
 Identities = 91/352 (25%), Positives = 136/352 (38%), Gaps = 30/352 (8%)

Query: 61  FYVNISIGQPPVPQFTAMDTGSSLLWVHCYPCRDCSPQLGTIFYPSRSSSYAYVPCDSEH 120
           + V   +G P      A+DT +   W HC PC  C    G+ F P+ SSSYA +PC S+ 
Sbjct: 79  YVVRAGLGTPVQQLLLALDTSADATWSHCAPCDTC--PAGSRFIPASSSSYASLPCASDW 136

Query: 121 CRYFPYARCSAYKHRCIYTQLYLIGPETSVFVSTEQLTFKNTDESTIHVQDVVFGCGFS- 179
           C  F        +   +  +   +G    V +   Q   +      +        CG++ 
Sbjct: 137 CPLF--------RRPAVPGEPGRVGAAADVRL--LQAASRTPRSGVLAATR----CGWAR 182

Query: 180 TNRNFKFSGIFGLGIGRSSLVSQLNSSFSYCIGSLHDPDYLHNKLILGDGAIIDEGDATP 239
           T      SG   L    S   S+ N  FSYC+ S     Y    L LG          TP
Sbjct: 183 TPSPATRSGPMSL---LSQTGSRYNGVFSYCLPSYRS-YYFSGSLRLGAAGQPRNVRYTP 238

Query: 240 LQFIDGH----YYITLEAISVDGRMLDINPNIFKRDDS-GGGVMIDSGTDVTWLVKEAYE 294
           L   + H    YY+ +  +SV   ++      F  D S G G +IDSGT +T      Y 
Sbjct: 239 L-LTNPHRPSLYYVNVTGLSVGRALVKAPAGSFAFDPSTGAGTVIDSGTVITRWTAPVYA 297

Query: 295 ALRDEVMIRLEGEQMRSYSWPDKLCYHGIMSSDLKGFPTVRFHFRGGAKLALE-KDSMFY 353
           ALRDE   ++      +       C++        G P V  H  GG  L L  ++++ +
Sbjct: 298 ALRDEFRRQVAAPSGYTSLGAFDTCFN-TDEVAAGGAPPVTLHMGGGVDLTLPMENTLIH 356

Query: 354 QPRPDAFCMAVNPASINNRYVNFSLIGMMAQQFYNVGYDIGRKQKTFQRMDC 405
                  C+A+  A   N     +++  + QQ   V  D+   +  F R  C
Sbjct: 357 SSATPLACLAMAEAP-QNVNSVVNVVANLQQQNVRVVVDVAGSRVGFAREPC 407


  Database: nr
    Posted date:  Mar 3, 2013 10:45 PM
  Number of letters in database: 999,999,864
  Number of sequences in database:  2,912,245
  
  Database: /local_scratch/syshi//blastdatabase/nr.01
    Posted date:  Mar 3, 2013 10:52 PM
  Number of letters in database: 999,999,666
  Number of sequences in database:  2,912,720
  
  Database: /local_scratch/syshi//blastdatabase/nr.02
    Posted date:  Mar 3, 2013 10:58 PM
  Number of letters in database: 999,999,938
  Number of sequences in database:  3,014,250
  
  Database: /local_scratch/syshi//blastdatabase/nr.03
    Posted date:  Mar 3, 2013 11:03 PM
  Number of letters in database: 999,999,780
  Number of sequences in database:  2,805,020
  
  Database: /local_scratch/syshi//blastdatabase/nr.04
    Posted date:  Mar 3, 2013 11:08 PM
  Number of letters in database: 999,999,551
  Number of sequences in database:  2,816,253
  
  Database: /local_scratch/syshi//blastdatabase/nr.05
    Posted date:  Mar 3, 2013 11:13 PM
  Number of letters in database: 999,999,897
  Number of sequences in database:  2,981,387
  
  Database: /local_scratch/syshi//blastdatabase/nr.06
    Posted date:  Mar 3, 2013 11:18 PM
  Number of letters in database: 999,999,649
  Number of sequences in database:  2,911,476
  
  Database: /local_scratch/syshi//blastdatabase/nr.07
    Posted date:  Mar 3, 2013 11:24 PM
  Number of letters in database: 999,999,452
  Number of sequences in database:  2,920,260
  
  Database: /local_scratch/syshi//blastdatabase/nr.08
    Posted date:  Mar 3, 2013 11:25 PM
  Number of letters in database: 64,230,274
  Number of sequences in database:  189,558
  
Lambda     K      H
   0.322    0.139    0.431 

Lambda     K      H
   0.267   0.0410    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 6,815,809,171
Number of Sequences: 23463169
Number of extensions: 294895144
Number of successful extensions: 538252
Number of sequences better than 100.0: 1000
Number of HSP's better than 100.0 without gapping: 1207
Number of HSP's successfully gapped in prelim test: 1110
Number of HSP's that attempted gapping in prelim test: 531949
Number of HSP's gapped (non-prelim): 2711
length of query: 411
length of database: 8,064,228,071
effective HSP length: 145
effective length of query: 266
effective length of database: 8,957,035,862
effective search space: 2382571539292
effective search space used: 2382571539292
T: 11
A: 40
X1: 16 ( 7.4 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.9 bits)
S2: 78 (34.7 bits)