BLASTP 2.2.26 [Sep-21-2011]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for compositional score matrix adjustment: Altschul, Stephen F.,
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.
Query= 017265
(374 letters)
Database: nr
23,463,169 sequences; 8,064,228,071 total letters
Searching..................................................done
>gi|255566010|ref|XP_002523993.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
gi|223536720|gb|EEF38361.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
Length = 439
Score = 335 bits (860), Expect = 2e-89, Method: Compositional matrix adjust.
Identities = 195/443 (44%), Positives = 260/443 (58%), Gaps = 73/443 (16%)
Query: 1 MATFLSCVFILFFLCFYVVSPIEAQTGGFSVELIHRDSPKSPFYNSSETPYQRLRDALTR 60
MA +S + I+ + + PI+A GF+VELI+RDSPKSPFYN ETP QR+ A+ R
Sbjct: 1 MAASVSLLAIVTLIFSGTLVPIDAAKDGFTVELINRDSPKSPFYNPRETPTQRIVSAVRR 60
Query: 61 SLNRLNHFN--QNSSISSSKASQADIIPNNANYLIRISIGTPPTERLAVADTGSDLIWTQ 118
S++R++HF+ +NS I + A Q+++I N YL++ S+GTP + LA+ADTGSDLIWTQ
Sbjct: 61 SMSRVHHFSPTKNSDIFTDTA-QSEMISNQGEYLMKFSLGTPAFDILAIADTGSDLIWTQ 119
Query: 119 CEPCPPSQCYMQDSPLFDPKMSSTYKSLPCSSSQCASLNQ-KSCSG---VNCQYSVSYGD 174
C+PC QCY QD+PLFDPK SSTY+ + CS+ QC L + SCSG C YS SYGD
Sbjct: 120 CKPC--DQCYEQDAPLFDPKSSSTYRDISCSTKQCDLLKEGASCSGEGNKTCHYSYSYGD 177
Query: 175 GSFSNGNLATETVTLGSTTGQAVALPGITFGCGTNNGGLFNSK----------------- 217
SF++GN+A +T+TLGST+G+ V LP GCG NNGG F K
Sbjct: 178 RSFTSGNVAADTITLGSTSGRPVLLPKAIIGCGHNNGGSFTEKGSGIVGLGGGPISLISQ 237
Query: 218 -----------------------------TTGIVGLGG-GDISLISQ---------MRTT 238
+ GIV GG LIS+ +
Sbjct: 238 LGSTIDGKFSYCLVPLSSNATNSSKLNFGSNGIVSGGGVQSTPLISKDPDTFYFLTLEAV 297
Query: 239 IAGNQRL-------GVSTPDIVIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSL 291
G++R+ G S +I+IDSGTTLT P+ + S L S + + PV DP+G L
Sbjct: 298 SVGSERIKFPGSSFGTSEGNIIIDSGTTLTLFPEDFFSELSSAVQDAVAGTPVEDPSGIL 357
Query: 292 ELCYSFNSLSQVPEVTIHFRGADVKLSRSNFFVKVSEDIVCSVFKGITNSVPIYGNIMQT 351
LCYS ++ + P +T HF GADVKL+ N FV+VS+ ++C F I NS I+GN+ Q
Sbjct: 358 SLCYSIDADLKFPSITAHFDGADVKLNPLNTFVQVSDTVLCFAFNPI-NSGAIFGNLAQM 416
Query: 352 NFLVGYDIEQQTVSFKPTDCTKQ 374
NFLVGYD+E +TVSFKPTDCT+
Sbjct: 417 NFLVGYDLEGKTVSFKPTDCTQD 439
>gi|297805038|ref|XP_002870403.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
gi|297316239|gb|EFH46662.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
Length = 440
Score = 316 bits (809), Expect = 1e-83, Method: Compositional matrix adjust.
Identities = 196/413 (47%), Positives = 242/413 (58%), Gaps = 70/413 (16%)
Query: 28 GFSVELIHRDSPKSPFYNSSETPYQRLRDALTRSLNRLNHFNQNSSISSSK-ASQADIIP 86
GF+ +LIHRDSPKSPFYN +ET QRLR+A+ RS++R+ HF S +S A Q D+
Sbjct: 30 GFTADLIHRDSPKSPFYNPTETSSQRLRNAIHRSVSRVFHFTDISQKDASDNAPQIDLTS 89
Query: 87 NNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSL 146
N+ YL+ IS+GTPP +A+ADTGSDL+WTQC+PC CY Q PLFDPK SSTYK +
Sbjct: 90 NSGEYLMNISLGTPPFPIMAIADTGSDLLWTQCKPC--DDCYTQVDPLFDPKASSTYKDV 147
Query: 147 PCSSSQCASL-NQKSCSGVN--CQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGIT 203
CSSSQC +L NQ SCS + C YS SYGD S++ GN+A +T+TLGST + V L I
Sbjct: 148 SCSSSQCTALENQASCSTEDNTCSYSTSYGDRSYTKGNIAVDTLTLGSTDTRPVQLKNII 207
Query: 204 FGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAG------------NQRLG----- 246
GCG NN G FN K +GIVGLGGG +SLI+Q+ +I G N R
Sbjct: 208 IGCGHNNAGTFNKKGSGIVGLGGGAVSLITQLGDSIDGKFSYCLVPLTSENDRTSKINFG 267
Query: 247 ----------VSTPDIVI-------------------------DSG-----------TTL 260
VSTP I DSG TTL
Sbjct: 268 TNAVVSGTGVVSTPLIAKSQETFYYLTLKSISVGSKEVQYPGSDSGSGEGNIIIDSGTTL 327
Query: 261 TFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSLSQVPEVTIHFRGADVKLSRS 320
T LP + S L ++S I+A+ DP L LCYS +VP +T+HF GADV L S
Sbjct: 328 TLLPTEFYSELEDAVASSIDAEKKQDPQTGLSLCYSATGDLKVPAITMHFDGADVNLKPS 387
Query: 321 NFFVKVSEDIVCSVFKGITNSVPIYGNIMQTNFLVGYDIEQQTVSFKPTDCTK 373
N FV++SED+VC F+G + S IYGN+ Q NFLVGYD +TVSFKPTDC K
Sbjct: 388 NCFVQISEDLVCFAFRG-SPSFSIYGNVAQMNFLVGYDTVSKTVSFKPTDCAK 439
>gi|356546370|ref|XP_003541599.1| PREDICTED: probable aspartic protease At2g35615-like [Glycine max]
Length = 434
Score = 307 bits (786), Expect = 6e-81, Method: Compositional matrix adjust.
Identities = 176/438 (40%), Positives = 247/438 (56%), Gaps = 86/438 (19%)
Query: 9 FILFFLC--FYVVSPIEAQTGGFSVELIHRDSPKSPFYNSSETPYQRLRDALTRSLNRLN 66
+LF+LC FY +EA GGFSVE+IHRDS +SPF+ +ET +QR+ +A+ RS+NR N
Sbjct: 11 LVLFYLCNIFY----LEAFNGGFSVEMIHRDSSRSPFFRPTETQFQRVANAVHRSVNRAN 66
Query: 67 HFNQNSSISSSKASQADIIPNNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQ 126
HF++ + KA++A I N+ YLI S+G PP + + DTGSD+IW QC+PC +
Sbjct: 67 HFHK-----AHKAAKATITQNDGEYLISYSVGIPPFQLYGIIDTGSDMIWLQCKPC--EK 119
Query: 127 CYMQDSPLFDPKMSSTYKSLPCSSSQCASLNQKSCSGVN---CQYSVSYGDGSFSNGNLA 183
CY Q + +FDP S+TYK LP SS+ C S+ SCS N C+Y++ YGDGS+S G+L+
Sbjct: 120 CYNQTTRIFDPSKSNTYKILPFSSTTCQSVEDTSCSSDNRKMCEYTIYYGDGSYSQGDLS 179
Query: 184 TETVTLGSTTGQAVALPGITFGCGTNNGGLFNSKTTGIVGLGG----------------- 226
ET+TLGST G +V GCG NN F K++GIVGLG
Sbjct: 180 VETLTLGSTNGSSVKFRRTVIGCGRNNTVSFEGKSSGIVGLGNGPVSLINQLRRRSSSIG 239
Query: 227 -------------------GDISLIS--------------------QMRTTIAGNQRLGV 247
GD +++S + GN R+
Sbjct: 240 RKFSYCLASMSNISSKLNFGDAAVVSGDGTVSTPIVTHDPKVFYYLTLEAFSVGNNRIEF 299
Query: 248 STP--------DIVIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCY--SF 297
++ +I+IDSGTTLT LP S L S ++ ++E V DP L LCY +F
Sbjct: 300 TSSSFRFGEKGNIIIDSGTTLTLLPNDIYSKLESAVADLVELDRVKDPLKQLSLCYRSTF 359
Query: 298 NSLSQVPEVTIHFRGADVKLSRSNFFVKVSEDIVCSVFKGITNSV-PIYGNIMQTNFLVG 356
+ L+ P + HF GADVKL+ N F++V + + C F I++ + PI+GN+ Q NFLVG
Sbjct: 360 DELN-APVIMAHFSGADVKLNAVNTFIEVEQGVTCLAF--ISSKIGPIFGNMAQQNFLVG 416
Query: 357 YDIEQQTVSFKPTDCTKQ 374
YD++++ VSFKPTDC+KQ
Sbjct: 417 YDLQKKIVSFKPTDCSKQ 434
>gi|297805036|ref|XP_002870402.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
gi|297316238|gb|EFH46661.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
Length = 435
Score = 304 bits (778), Expect = 6e-80, Method: Compositional matrix adjust.
Identities = 191/429 (44%), Positives = 246/429 (57%), Gaps = 67/429 (15%)
Query: 10 ILFFLCFY---VVSPIEAQTG-GFSVELIHRDSPKSPFYNSSETPYQRLRDALTRSLNRL 65
+L LC + ++S + A+ GF+ +LIHRDSPKSPFYN +ETP QR+R+A+ RS NR+
Sbjct: 8 VLLSLCLFSSHILSNVNAKPKLGFTTDLIHRDSPKSPFYNPAETPSQRIRNAIHRSFNRV 67
Query: 66 NHFNQNSSISSSKAS-QADIIPNNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPP 124
+HF S + +S S Q DI P YL+ +S+GTPP+ +AVADTGS+LIWTQC+PC
Sbjct: 68 SHFTDLSEMDASLNSPQTDITPCGGEYLMNLSLGTPPSPIMAVADTGSNLIWTQCKPC-- 125
Query: 125 SQCYMQDSPLFDPKMSSTYKSLPCSSSQCASL-NQKSCSGVN--CQYSVSYGDGSFSNGN 181
CY Q PLFDPK SSTYK + CSSSQC +L NQ SCS + C Y VSY DGS++ G
Sbjct: 126 DDCYTQVDPLFDPKASSTYKDVSCSSSQCTALENQASCSTEDKTCSYLVSYADGSYTMGK 185
Query: 182 LATETVTLGSTTGQAVALPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAG 241
A +T+TLGST + V L I GCG NN F +K++G+VGLGGG +SLI Q+ +I G
Sbjct: 186 FAVDTLTLGSTDNRPVQLKNIIIGCGQNNAVTFRNKSSGVVGLGGGAVSLIKQLGDSIDG 245
Query: 242 ---------NQRLG---------------VSTPDIV------------------------ 253
N + VSTP +V
Sbjct: 246 KFSYCLVPENDQTSKINFGTNAVVSGPGTVSTPLVVKSRDTFYYLTLKSISVGSKNMQTP 305
Query: 254 ---------IDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSLSQVP 304
IDSGTTLT LP Y + + ++S+I A D LCY+ + +P
Sbjct: 306 DSNIKGNMVIDSGTTLTLLPVKYYIEIENAVASLINADKSKDERIGSSLCYNATADLNIP 365
Query: 305 EVTIHFRGADVKLSRSNFFVKVSEDIVCSVFKGITNSVPIYGNIMQTNFLVGYDIEQQTV 364
+T+HF GADVKL N F KV+ED+VC F IYGN+ Q NFLVGYD +T+
Sbjct: 366 VITMHFEGADVKLYPYNSFFKVTEDLVCLAFGMSFYRNGIYGNVAQKNFLVGYDTASKTM 425
Query: 365 SFKPTDCTK 373
SFKPTDC K
Sbjct: 426 SFKPTDCAK 434
>gi|296085498|emb|CBI29230.3| unnamed protein product [Vitis vinifera]
Length = 542
Score = 300 bits (768), Expect = 8e-79, Method: Compositional matrix adjust.
Identities = 166/408 (40%), Positives = 242/408 (59%), Gaps = 53/408 (12%)
Query: 1 MATFLSCVFILFFLCFYVVSPIEAQTGGFSVELIHRDSPKSPFYNSSETPYQRLRDALTR 60
+ F + V + F F ++ A+ GGFSV+LIHRDSP SPF++ S+T +RL DA R
Sbjct: 6 VKIFFNVVVVGFL--FQLLEVALARGGGFSVDLIHRDSPHSPFFDPSKTQAERLTDAFRR 63
Query: 61 SLNRLNHFNQNSSISSSKASQADIIPNNANYLIRISIGTPPTERLAVADTGSDLIWTQCE 120
S++R+ F + +S Q+ I+P+ YL+ + IGTPP +A+ DTGSDL WTQC
Sbjct: 64 SVSRVGRFRPTAM--TSDGIQSRIVPSAGEYLMNLYIGTPPVPVIAIVDTGSDLTWTQCR 121
Query: 121 PCPPSQCYMQDSPLFDPKMSSTYKSLPCSSSQCASLNQ-KSCSG-VNCQYSVSYGDGSFS 178
PC + CY Q PLFDPK SSTY+ C +S C +L + +SCS C + SY DGSF+
Sbjct: 122 PC--THCYKQVVPLFDPKNSSTYRDSSCGTSFCLALGKDRSCSKEKKCTFRYSYADGSFT 179
Query: 179 NGNLATETVTLGSTTGQAVALPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTT 238
GNLA+ET+T+ ST G+ V+ PG FGCG ++GG+F+ ++GIVGLGGG++SLISQ+++T
Sbjct: 180 GGNLASETLTVDSTAGKPVSFPGFAFGCGHSSGGIFDKSSSGIVGLGGGELSLISQLKST 239
Query: 239 IAG---------------NQRLG------------VSTP-----------------DIVI 254
I G + R+ VSTP +I++
Sbjct: 240 INGLFSYCLLPVSTDSSISSRINFGASGRVSGYGTVSTPLRLPYKGYSKKTEVEEGNIIV 299
Query: 255 DSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSLSQVPEVTIHFRGAD 314
DSGTT TFLPQ + S L +++ I+ + V DP G LCY+ + P +T HF+ A+
Sbjct: 300 DSGTTYTFLPQEFYSKLEKSVANSIKGKRVRDPNGIFSLCYNTTAEINAPIITAHFKDAN 359
Query: 315 VKLSRSNFFVKVSEDIVCSVFKGITNSVPIYGNIMQTNFLVGYDIEQQ 362
V+L N F+++ ED+VC T+ + + GN+ Q NFLVG+D+ ++
Sbjct: 360 VELQPLNTFMRMQEDLVCFTV-APTSDIGVLGNLAQVNFLVGFDLRKK 406
Score = 97.4 bits (241), Expect = 9e-18, Method: Compositional matrix adjust.
Identities = 53/142 (37%), Positives = 86/142 (60%), Gaps = 6/142 (4%)
Query: 234 QMRTTIAGNQRLGVSTPDIVIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLEL 293
+R +++ V +I++DSGTT T+LP + L ++ I+ + V DP G L
Sbjct: 402 DLRKKRGFSKKAEVEEGNIIVDSGTTYTYLPLEFYVKLEESVAHSIKGKRVRDPNGISSL 461
Query: 294 CYSFNSLSQV--PEVTIHFRGADVKLSRSNFFVKVSEDIVC-SVFKGITNSVPIYGNIMQ 350
CY+ ++ Q+ P +T HF+ A+V+L N F+++ ED+VC +V T+ + I GN+ Q
Sbjct: 462 CYN-TTVDQIDAPIITAHFKDANVELQPWNTFLRMQEDLVCFTVLP--TSDIGILGNLAQ 518
Query: 351 TNFLVGYDIEQQTVSFKPTDCT 372
NFLVG+D+ ++ VSFK DCT
Sbjct: 519 VNFLVGFDLRKKRVSFKAADCT 540
>gi|356546372|ref|XP_003541600.1| PREDICTED: probable aspartic protease At2g35615-like [Glycine max]
Length = 433
Score = 297 bits (760), Expect = 6e-78, Method: Compositional matrix adjust.
Identities = 179/432 (41%), Positives = 237/432 (54%), Gaps = 75/432 (17%)
Query: 9 FILFFLC--FYVVSPIEAQTGGFSVELIHRDSPKSPFYNSSETPYQRLRDALTRSLNRLN 66
+LF+LC FY +EA GGFSVE+IHRDS +SPF++ +ET +QR+ +A+ RS+NR N
Sbjct: 11 LVLFYLCNIFY----LEAFNGGFSVEMIHRDSSRSPFFSPTETQFQRVANAVHRSINRAN 66
Query: 67 HFNQNSSISSSKASQADIIPNNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQ 126
H NQ S S + + +I YLI S+GTP + + DTGSD+IW QC+PC +
Sbjct: 67 HLNQ--SFVSPNSPETTVISALGEYLISYSVGTPSLQVFGILDTGSDIIWLQCQPC--KK 122
Query: 127 CYMQDSPLFDPKMSSTYKSLPCSSSQCASLNQKSCSG-VNCQYSVSYGDGSFSNGNLATE 185
CY Q +P+FD S TYK+LPC S+ C S+ CS +C YS+ Y DGS S G+L+ E
Sbjct: 123 CYEQTTPIFDSSKSQTYKTLPCPSNTCQSVQGTFCSSRKHCLYSIHYVDGSQSLGDLSVE 182
Query: 186 TVTLGSTTGQAVALPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGN--- 242
T+TLGST G V PG GCG N K +GIVGLG G +SLI+Q+ + G
Sbjct: 183 TLTLGSTNGSPVQFPGTVIGCGRYNAIGIEEKNSGIVGLGRGPMSLITQLSPSTGGKFSY 242
Query: 243 ----------------------QRLGVSTP------------------------------ 250
R VSTP
Sbjct: 243 CLVPGLSTASSKLNFGNAAVVSGRGTVSTPLFSKNGLVFYFLTLEAFSVGRNRIEFGSPG 302
Query: 251 -----DIVIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSF--NSL-SQ 302
+I+IDSGTTLT LP G S L + ++ + Q V DP L LCY + L +
Sbjct: 303 SGGKGNIIIDSGTTLTALPNGVYSKLEAAVAKTVILQRVRDPNQVLGLCYKVTPDKLDAS 362
Query: 303 VPEVTIHFRGADVKLSRSNFFVKVSEDIVCSVFKGITNSVPIYGNIMQTNFLVGYDIEQQ 362
VP +T HF GADV L+ N FV+V++D+VC F+ T + ++GN+ Q N LVGYD++
Sbjct: 363 VPVITAHFSGADVTLNAINTFVQVADDVVCFAFQP-TETGAVFGNLAQQNLLVGYDLQMN 421
Query: 363 TVSFKPTDCTKQ 374
TVSFK TDCTKQ
Sbjct: 422 TVSFKHTDCTKQ 433
>gi|357492389|ref|XP_003616483.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
gi|355517818|gb|AES99441.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
Length = 434
Score = 297 bits (760), Expect = 6e-78, Method: Compositional matrix adjust.
Identities = 172/429 (40%), Positives = 239/429 (55%), Gaps = 72/429 (16%)
Query: 11 LFFLCFYVVSPIEAQTGGFSVELIHRDSPKSPFYNSSETPYQRLRDALTRSLNRLNHFNQ 70
LF LCF + S A + GFSVELIHRDSPKSP+Y +E YQ DA RS+NR NHF +
Sbjct: 11 LFSLCF-IASFSHALSNGFSVELIHRDSPKSPYYKPTENKYQHFVDAARRSINRANHFFK 69
Query: 71 NSSISSSKASQADIIPNNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQ 130
+S S+ +++ +IP+ YL+ S+GTPPT+ +ADTGSD++W QCEPC QCY Q
Sbjct: 70 DSDTSTPEST---VIPDRGGYLMTYSVGTPPTKIYGIADTGSDIVWLQCEPC--EQCYNQ 124
Query: 131 DSPLFDPKMSSTYKSLPCSSSQCASLNQKSCSGVN-CQYSVSYGDGSFSNGNLATETVTL 189
+P+F+P SS+YK++PCSS C S+ SCS N CQY +SYGD S S G+L+ +T++L
Sbjct: 125 TTPIFNPSKSSSYKNIPCSSKLCHSVRDTSCSDQNSCQYKISYGDSSHSQGDLSVDTLSL 184
Query: 190 GSTTGQAVALPGITFGC-----------------------------GTNNGG-------- 212
ST+G V+ P I GC G++ GG
Sbjct: 185 ESTSGSPVSFPKIVIGCGTDNAGTFGGASSGIVGLGGGPVSLITQLGSSIGGKFSYCLVP 244
Query: 213 LFNSKTTGIVGLGGGDISLIS-------------------QMRTTIAGNQRL-------- 245
L N ++ L GD +++S ++ GN+R+
Sbjct: 245 LLNKESNASSILSFGDAAVVSGDGVVSTPLIKKDPVFYFLTLQAFSVGNKRVEFGGSSEG 304
Query: 246 GVSTPDIVIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSLS-QVP 304
G +I+IDSGTTLT +P +NL S + +++ V DP LCYS S P
Sbjct: 305 GDDEGNIIIDSGTTLTLIPSDVYTNLESAVVDLVKLDRVDDPNQQFSLCYSLKSNEYDFP 364
Query: 305 EVTIHFRGADVKLSRSNFFVKVSEDIVCSVFKGITNSVPIYGNIMQTNFLVGYDIEQQTV 364
+T+HF+GADV+L + FV +++ IVC F+ I+GN+ Q N LVGYD++Q+TV
Sbjct: 365 IITVHFKGADVELHSISTFVPITDGIVCFAFQPSPQLGSIFGNLAQQNLLVGYDLQQKTV 424
Query: 365 SFKPTDCTK 373
SFKPTDCTK
Sbjct: 425 SFKPTDCTK 433
>gi|357492401|ref|XP_003616489.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
gi|355517824|gb|AES99447.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
Length = 434
Score = 291 bits (744), Expect = 4e-76, Method: Compositional matrix adjust.
Identities = 171/439 (38%), Positives = 238/439 (54%), Gaps = 72/439 (16%)
Query: 1 MATFLSCVFILFFLCFYVVSPIEAQTGGFSVELIHRDSPKSPFYNSSETPYQRLRDALTR 60
M T LF LCF + S A + GFSVELIHRDSPKSP+Y +E YQ DA R
Sbjct: 1 MNTLCFLTLSLFSLCF-IASFSHALSNGFSVELIHRDSPKSPYYKPTENKYQHFVDAARR 59
Query: 61 SLNRLNHFNQNSSISSSKASQADIIPNNANYLIRISIGTPPTERLAVADTGSDLIWTQCE 120
S+NR NHF ++S S+ +++ +IP+ YL+ S+GTPPT+ +ADTGSD++W QCE
Sbjct: 60 SINRANHFFKDSDTSTPEST---VIPDRGGYLMTYSVGTPPTKIYGIADTGSDIVWLQCE 116
Query: 121 PCPPSQCYMQDSPLFDPKMSSTYKSLPCSSSQCASLNQKSCSGVN-CQYSVSYGDGSFSN 179
PC QCY Q +P+F+P SS+YK++PC S C S+ SCS N CQY +SYGD S S
Sbjct: 117 PC--EQCYNQTTPIFNPSKSSSYKNIPCLSKLCHSVRDTSCSDQNSCQYKISYGDSSHSQ 174
Query: 180 GNLATETVTLGSTTGQAVALPGITFGC-----------------------------GTNN 210
G+L+ +T++L ST+G V+ P GC G++
Sbjct: 175 GDLSVDTLSLESTSGSPVSFPKTVIGCGTDNAGTFGGASSGIVGLGGGPVSLITQLGSSI 234
Query: 211 GG--------LFNSKTTGIVGLGGGDISLIS-------------------QMRTTIAGNQ 243
GG L N ++ L GD +++S ++ GN+
Sbjct: 235 GGKFSYCLVPLLNKESNASSILSFGDAAVVSGDGVVSTPLIKKDPVFYFLTLQAFSVGNK 294
Query: 244 RL--------GVSTPDIVIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCY 295
R+ G +I+IDSGTTLT +P +NL S + +++ V DP LCY
Sbjct: 295 RVEFGGSSEGGDDEGNIIIDSGTTLTLIPSDVYTNLESAVVDLVKLDRVDDPNQQFSLCY 354
Query: 296 SFNSLS-QVPEVTIHFRGADVKLSRSNFFVKVSEDIVCSVFKGITNSVPIYGNIMQTNFL 354
S S P +T HF+GAD++L + FV +++ IVC F+ I+GN+ Q N L
Sbjct: 355 SLKSNEYDFPIITAHFKGADIELHSISTFVPITDGIVCFAFQPSPQLGSIFGNLAQQNLL 414
Query: 355 VGYDIEQQTVSFKPTDCTK 373
VGYD++Q+TVSFKPTDCTK
Sbjct: 415 VGYDLQQKTVSFKPTDCTK 433
>gi|356555042|ref|XP_003545848.1| PREDICTED: probable aspartic protease At2g35615-like [Glycine max]
Length = 431
Score = 280 bits (716), Expect = 7e-73, Method: Compositional matrix adjust.
Identities = 169/429 (39%), Positives = 226/429 (52%), Gaps = 71/429 (16%)
Query: 11 LFFLCFYVVSPIEAQTGGFSVELIHRDSPKSPFYNSSETPYQRLRDALTRSLNRLNHFNQ 70
L LC Y + EA GFSVE+IHRDS +SPFY ++ET +QR+ +A+ RS+NR NHFNQ
Sbjct: 9 LVLLCLYNICFSEALKSGFSVEIIHRDSSRSPFYRATETQFQRVTNAVRRSMNRANHFNQ 68
Query: 71 NSSISSSKASQADIIPNNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQ 130
S S++ S ++ ++ +YL+ S+GTPP + DT SD+IW QC+ C CY
Sbjct: 69 ISVYSNAVESPVTLL-DDGDYLMSYSLGTPPFPVYGIVDTASDIIWVQCQLC--ETCYND 125
Query: 131 DSPLFDPKMSSTYKSLPCSSSQCASLNQKSCSGVN---CQYSVSYGDGSFSNGNLATETV 187
SP+FDP S TYK+LPCSS+ C S+ SCS C+++V+Y DGS S G+L ETV
Sbjct: 126 TSPMFDPSYSKTYKNLPCSSTTCKSVQGTSCSSDERKICEHTVNYKDGSHSQGDLIVETV 185
Query: 188 TLGSTTGQAVALPGITFGCGTNNGGLFNSKTTGIVGLGG--------------------- 226
TLGS V P GC N F+S GIVGLGG
Sbjct: 186 TLGSYNDPFVHFPRTVIGCIRNTNVSFDS--IGIVGLGGGPVSLVPQLSSSISKKFSYCL 243
Query: 227 ------------GDISLIS--------------------QMRTTIAGNQRLGVSTP---- 250
GD +++S + GN R+ +
Sbjct: 244 APISDRSSKLKFGDAAMVSGDGTVSTRIVFKDWKKFYYLTLEAFSVGNNRIEFRSSSSRS 303
Query: 251 ----DIVIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCY-SFNSLSQVPE 305
+I+IDSGTT T LP S L S ++ +++ + DP LCY S VP
Sbjct: 304 SGKGNIIIDSGTTFTVLPDDVYSKLESAVADVVKLERAEDPLKQFSLCYKSTYDKVDVPV 363
Query: 306 VTIHFRGADVKLSRSNFFVKVSEDIVCSVFKGITNSVPIYGNIMQTNFLVGYDIEQQTVS 365
+T HF GADVKL+ N F+ S +VC F + S I+GN+ Q NFLVGYD++++ VS
Sbjct: 364 ITAHFSGADVKLNALNTFIVASHRVVCLAFLS-SQSGAIFGNLAQQNFLVGYDLQRKIVS 422
Query: 366 FKPTDCTKQ 374
FKPTDCTKQ
Sbjct: 423 FKPTDCTKQ 431
>gi|357500973|ref|XP_003620775.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
gi|357500991|ref|XP_003620784.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
gi|355495790|gb|AES76993.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
gi|355495799|gb|AES77002.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
Length = 438
Score = 279 bits (713), Expect = 2e-72, Method: Compositional matrix adjust.
Identities = 164/443 (37%), Positives = 243/443 (54%), Gaps = 81/443 (18%)
Query: 2 ATFLSCVFILFFLCFYVVSPIEAQTGGFSVELIHRDSPKSPFYNSSETPYQRLRDALTRS 61
+FL+ F FFLCF +S +A + GFS+ELIHRDS KSPFY ++ YQ + DA+ RS
Sbjct: 4 VSFLTLSF--FFLCF-SISFSQAVSNGFSIELIHRDSSKSPFYKPTQNKYQHVVDAVHRS 60
Query: 62 LNRLNHFNQNSSISSSKASQADIIPNNANYLIRISIGTPPTERLAVADTGSDLIWTQCEP 121
+NR+NH N+NS S+ +++ +I +Y++ S+GTPP + + DTGSD++W QCEP
Sbjct: 61 INRVNHSNKNSLASTPEST---VISYEGDYIMSYSVGTPPIKSYGIVDTGSDIVWLQCEP 117
Query: 122 CPPSQCYMQDSPLFDPKMSSTYKSLPCSSSQCASLNQKSCSGV-NCQYSVSYGDGSFSNG 180
C QCY Q +P F+P SS+YK++ CSS C S+ SC+ NC+YS++YG+ S S G
Sbjct: 118 C--EQCYNQTTPKFNPSKSSSYKNISCSSKLCQSVRDTSCNDKKNCEYSINYGNQSHSQG 175
Query: 181 NLATETVTLGSTTGQAVALPGITFGCGTNN-----------------------------G 211
+L+ ET+TL STTG+ V+ P GCGTNN G
Sbjct: 176 DLSLETLTLESTTGRPVSFPKTVIGCGTNNIGSFKRVSSGVVGLGGGPASLITQLGPSIG 235
Query: 212 GLF-----------NSKTTGIVGLGGGDISLIS--------------------------- 233
G F + + G L GD++++S
Sbjct: 236 GKFSYCLVRMSITLKNMSMGSSKLNFGDVAIVSGHNVLSTPIVKKDHSFFYYLTIEAFSV 295
Query: 234 -QMRTTIAGNQRLGVSTPDIVIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLE 292
R AG+ + GV +I+IDS T +TF+P + L S + ++ + V DP
Sbjct: 296 GDKRVEFAGSSK-GVEEGNIIIDSSTIVTFVPSDVYTKLNSAIVDLVTLERVDDPNQQFS 354
Query: 293 LCYSFNSLSQ--VPEVTIHFRGADVKLSRSNFFVKVSEDIVCSVFKGITNSVPIYGNIMQ 350
LCY+ +S + P +T HF+GAD+ L +N FV+V+ D++C F +N I+G+ Q
Sbjct: 355 LCYNVSSDEEYDFPYMTAHFKGADILLYATNTFVEVARDVLCFAF-APSNGGAIFGSFSQ 413
Query: 351 TNFLVGYDIEQQTVSFKPTDCTK 373
+F+VGYD++Q+TVSFK DCT+
Sbjct: 414 QDFMVGYDLQQKTVSFKSVDCTE 436
>gi|357514995|ref|XP_003627786.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
gi|355521808|gb|AET02262.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
Length = 436
Score = 276 bits (707), Expect = 1e-71, Method: Compositional matrix adjust.
Identities = 165/433 (38%), Positives = 231/433 (53%), Gaps = 75/433 (17%)
Query: 10 ILFFLCFYVVSPIEAQTGGFSVELIHRDSPKSPFYNSSETPYQRLRDALTRSLNRLNHFN 69
+LFF ++VS AQ GFSVELIHRDS KSP Y ++ YQ DA RS+NR NHF
Sbjct: 9 LLFFSICFIVSFSHAQKNGFSVELIHRDSLKSPLYKPTQNKYQYFVDAARRSINRANHFY 68
Query: 70 QNSSISSSKASQADIIPNNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYM 129
+ S + Q+ +IP+ YL+ S+GTPP + + DTGSD++W QCEPC +CY
Sbjct: 69 K---YSLANIPQSTVIPDIGEYLMTYSVGTPPFKLYGIVDTGSDIVWLQCEPC--QECYN 123
Query: 130 QDSPLFDPKMSSTYKSLPCSSSQCASLNQKSCSGVN-CQYSVSYGDGSFSNGNLATETVT 188
Q +P+F+P SS+YK++PC S C S+ SC+ N C+YS YGD S S G+L+ +T+T
Sbjct: 124 QTTPMFNPSKSSSYKNIPCPSKLCQSMEDTSCNDKNYCEYSTYYGDNSHSGGDLSVDTLT 183
Query: 189 LGSTTGQAVALPGITFGCGTNN-----------------------------GGLF----- 214
L ST G V+ P I GCGTNN GG F
Sbjct: 184 LESTNGLTVSFPNIVIGCGTNNILSYEGASSGIVGFGSGPASFITQLGSSTGGKFSYCLT 243
Query: 215 ------NSKTTGIVGLGGGDISLIS--------------------QMRTTIAGNQRL--- 245
N ++ L GD + +S + GN+R+
Sbjct: 244 PLFSVTNIQSNATSKLNFGDAATVSGDGVVTTPILKKDPETFYYLTLEAFSVGNRRVEIG 303
Query: 246 ----GVSTPDIVIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSLS 301
G + +I+IDSGTTLT L + S L S + +++ + V DPT +L LCYS +
Sbjct: 304 GVPNGDNEGNIIIDSGTTLTSLTKDDYSFLESAVVDLVKLERVDDPTQTLNLCYSVKAEG 363
Query: 302 -QVPEVTIHFRGADVKLSRSNFFVKVSEDIVCSVFKGITNSVPIYGNIMQTNFLVGYDIE 360
P +T+HF+GADV L + FV V++ + C F+ + I+GN+ Q N +VGYD++
Sbjct: 364 YDFPIITMHFKGADVDLHPISTFVSVADGVFCLAFESSQDHA-IFGNLAQQNLMVGYDLQ 422
Query: 361 QQTVSFKPTDCTK 373
Q+ VSFKP+DCTK
Sbjct: 423 QKIVSFKPSDCTK 435
>gi|317106730|dbj|BAJ53226.1| JHL06P13.6 [Jatropha curcas]
Length = 445
Score = 269 bits (688), Expect = 1e-69, Method: Compositional matrix adjust.
Identities = 165/420 (39%), Positives = 222/420 (52%), Gaps = 82/420 (19%)
Query: 27 GGFSVELIHRDSPKSPFYNSSETPYQRLRDALTRSLNRLNHFNQNSSISSSKASQADIIP 86
G F+ LIHRDSP SP YN T + RL+ + RS++R N F NS +S++K + DIIP
Sbjct: 31 GSFTASLIHRDSPISPLYNPKNTYFDRLQSSFHRSISRANRFTPNS-VSAAKTLEYDIIP 89
Query: 87 NNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSL 146
Y +RISIGTPP E L +ADTGSDLIW QC+PC +CY Q SP+F+PK SSTY+ +
Sbjct: 90 GGGEYFMRISIGTPPIEVLVIADTGSDLIWVQCQPC--QECYKQKSPIFNPKQSSTYRRV 147
Query: 147 PCSSSQCASLN--QKSCSGVN----CQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALP 200
C + C +LN ++CS C YS SYGD SF+ G LATE +GST ++
Sbjct: 148 LCETRYCNALNSDMRACSAHGFFKACGYSYSYGDHSFTMGYLATERFIIGSTNN---SIQ 204
Query: 201 GITFGCGTNNGGLFNSKTTG-----------IVGLGG----------------------- 226
+ FGCG +NGG F+ +G I LG
Sbjct: 205 ELAFGCGNSNGGNFDEVGSGIVGLGGGSLSLISQLGTKIDNKFSYCLVPILEKSNFSLGK 264
Query: 227 ---GDISLISQMRTTIA---------------------GNQRLG---------VSTPDIV 253
GD S IS T ++ GN+RL V +I+
Sbjct: 265 IVFGDNSFISGSDTYVSTPLVSKEPETFYYLTLEAISVGNERLAYENSRNDGNVEKGNII 324
Query: 254 IDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSLSQVPEVTIHFRGA 313
IDSGTTLTFL + L V+ +E + V+DP G +C+ ++P +T+HF A
Sbjct: 325 IDSGTTLTFLDSKLYNKLELVLEKAVEGERVSDPNGIFSICFRDKIGIELPIITVHFTDA 384
Query: 314 DVKLSRSNFFVKVSEDIVCSVFKGI-TNSVPIYGNIMQTNFLVGYDIEQQTVSFKPTDCT 372
DV+L N F K ED++C F I +N + I+GN+ Q NFLVGYD+++ VSF PTDC+
Sbjct: 385 DVELKPINTFAKAEEDLLC--FTMIPSNGIAIFGNLAQMNFLVGYDLDKNCVSFMPTDCS 442
>gi|357514989|ref|XP_003627783.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
gi|355521805|gb|AET02259.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
Length = 416
Score = 267 bits (683), Expect = 5e-69, Method: Compositional matrix adjust.
Identities = 163/437 (37%), Positives = 226/437 (51%), Gaps = 98/437 (22%)
Query: 8 VFILFF--LCFYVVSPIEAQTGGFSVELIHRDSPKSPFYNSSETPYQRLRDALTRSLNRL 65
+ ILF+ LCF ++S A GFSVELIHRDS KSP Y ++ YQ + +A RS+NR
Sbjct: 6 LLILFYFSLCF-IISLSHALNNGFSVELIHRDSSKSPLYQPTQNKYQHIVNAARRSINRA 64
Query: 66 NHFNQNSSISSSKASQADIIPNNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPS 125
NHF + + + Q+ +IP++ YL+ S+GTPP + +ADTGSD++W QCEPC
Sbjct: 65 NHFYKTAL---TNTPQSTVIPDHGEYLMTYSVGTPPFKLYGIADTGSDIVWLQCEPC--K 119
Query: 126 QCYMQDSPLFDPKMSSTYKSLPCSSSQCASLNQKSCSGVNCQYSVSYGDGSFSNGNLATE 185
+CY Q +P F P SSTYK++PCSS C S Q GNL+ +
Sbjct: 120 ECYNQTTPKFKPSKSSTYKNIPCSSDLCKSGQQ---------------------GNLSVD 158
Query: 186 TVTLGSTTGQAVALPGITFGCGTNNGGLFNSKTTGIVGLGG------------------- 226
T+TL S+TG ++ P GCGT+N F ++GIVGLGG
Sbjct: 159 TLTLESSTGHPISFPKTVIGCGTDNTVSFEGASSGIVGLGGGPASLITQLGSSIDAKFSY 218
Query: 227 -----------------GDISLIS--------------------QMRTTIAGNQRL---- 245
GD +++S + GN+R+
Sbjct: 219 CLLPNPVESNTTSKLNFGDTAVVSGDGVVSTPIVKKDPIVFYYLTLEAFSVGNKRIEFEG 278
Query: 246 ---GVSTPDIVIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSLS- 301
G +I+IDSGTTLT +P +NL S + +++ + V DPT LCYS S
Sbjct: 279 SSNGGHEGNIIIDSGTTLTVIPTDVYNNLESAVLELVKLKRVNDPTRLFNLCYSVTSDGY 338
Query: 302 QVPEVTIHFRGADVKLSRSNFFVKVSEDIVCSVFKGITNSVP-----IYGNIMQTNFLVG 356
P +T HF+GADVKL + FV V++ IVC F + +P I+GN+ Q N LVG
Sbjct: 339 DFPIITTHFKGADVKLHPISTFVDVADGIVCLAFATTSAFIPSDVVSIFGNLAQQNLLVG 398
Query: 357 YDIEQQTVSFKPTDCTK 373
YD++Q+ VSFKPTDC+K
Sbjct: 399 YDLQQKIVSFKPTDCSK 415
>gi|449454652|ref|XP_004145068.1| PREDICTED: probable aspartic protease At2g35615-like [Cucumis
sativus]
gi|449470630|ref|XP_004153019.1| PREDICTED: probable aspartic protease At2g35615-like [Cucumis
sativus]
Length = 435
Score = 266 bits (679), Expect = 1e-68, Method: Compositional matrix adjust.
Identities = 166/443 (37%), Positives = 234/443 (52%), Gaps = 83/443 (18%)
Query: 1 MATFLSCVFILFFLCFYVVSPIEAQTG---GFSVELIHRDSPKSPFYNSSETPYQRLRDA 57
MA S V ++ FL V + A TG GF+VELIHRDSPKSP YN E Y R+ D
Sbjct: 1 MAPIFSLVIVIIFLISTAV--VSAATGPDYGFTVELIHRDSPKSPMYNPLENHYHRVADT 58
Query: 58 LTRSLNRLNHFNQNSSISSSKASQADIIPNNANYLIRISIGTPPTERLAVADTGSDLIWT 117
L RS++ N+ + ++ +A I N YL+++S+GTPP +AVADTGSD+IWT
Sbjct: 59 LRRSIS------HNTGLVTNTV-EAPIYNNRGEYLMKLSVGTPPFPIIAVADTGSDIIWT 111
Query: 118 QCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCSSSQCASLNQ-KSCS-GVNCQYSVSYGDG 175
QCEPC + CY QD P+F+P S+TY+ + CSS C+ + SCS +C YS+SYGD
Sbjct: 112 QCEPC--TNCYQQDLPMFNPSKSTTYRKVSCSSPVCSFTGEDNSCSFKPDCTYSISYGDN 169
Query: 176 SFSNGNLATETVTLGSTTGQAVALPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQM 235
S S G+ A +T+T+GST+G+ VA P GCG +N G F++ +GIVGLG G SLI QM
Sbjct: 170 SHSQGDFAVDTLTMGSTSGRVVAFPRTAIGCGHDNAGSFDANVSGIVGLGLGPASLIKQM 229
Query: 236 RTTIAGN---------------QRL------------GVSTPDIVIDS-----GTTLTFL 263
+ + G +L VSTP + D L +
Sbjct: 230 GSAVGGKFSYCLTPIGNDDGGSNKLNFGSNANVSGSGAVSTPIYISDKFKSFYSLKLKAV 289
Query: 264 PQGYNSNLLSVMSSMI--EAQPVADPTGSLELC--------------------------- 294
G N+ S +S++ +A + D +L L
Sbjct: 290 SVGRNNTFYSTANSILGGKANIIIDSGTTLTLLPVDLYHNFAKAISNSINLQRTDDPNQF 349
Query: 295 --YSFNSLSQ---VPEVTIHFRGADVKLSRSNFFVKVSEDIVCSVFKGI-TNSVPIYGNI 348
Y F + + VP + +HF GA+++L R N ++VS++++C F G N + IYGNI
Sbjct: 350 LEYCFETTTDDYKVPFIAMHFEGANLRLQRENVLIRVSDNVICLAFAGAQDNDISIYGNI 409
Query: 349 MQTNFLVGYDIEQQTVSFKPTDC 371
Q NFLVGYD+ ++SFKP +C
Sbjct: 410 AQINFLVGYDVTNMSLSFKPMNC 432
>gi|15217764|ref|NP_176663.1| aspartyl protease-like protein [Arabidopsis thaliana]
gi|5042418|gb|AAD38257.1|AC006193_13 Hypothetical Protein [Arabidopsis thaliana]
gi|332196174|gb|AEE34295.1| aspartyl protease-like protein [Arabidopsis thaliana]
Length = 431
Score = 265 bits (677), Expect = 3e-68, Method: Compositional matrix adjust.
Identities = 171/411 (41%), Positives = 224/411 (54%), Gaps = 70/411 (17%)
Query: 28 GFSVELIHRDSPKSPFYNSSETPYQRLRDALTRSLNRLNHFNQNSSISSSKASQADIIPN 87
GF+++LIHRDSPKSPFYNS+ET QR+R+A+ RS F+ + + S + Q+ I N
Sbjct: 25 GFTIDLIHRDSPKSPFYNSAETSSQRMRNAIRRSARSTLQFSNDDA--SPNSPQSFITSN 82
Query: 88 NANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLP 147
YL+ ISIGTPP LA+ADTGSDLIWTQC PC CY Q SPLFDPK SSTY+ +
Sbjct: 83 RGEYLMNISIGTPPVPILAIADTGSDLIWTQCNPC--EDCYQQTSPLFDPKESSTYRKVS 140
Query: 148 CSSSQCASLNQKSCSGVN--CQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITFG 205
CSSSQC +L SCS C Y+++YGD S++ G++A +TVT+GS+ + V+L + G
Sbjct: 141 CSSSQCRALEDASCSTDENTCSYTITYGDNSYTKGDVAVDTVTMGSSGRRPVSLRNMIIG 200
Query: 206 CGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGN---------------QRLGVSTP 250
CG N G F+ +GI+GLGGG SL+SQ+R +I G ++ T
Sbjct: 201 CGHENTGTFDPAGSGIIGLGGGSTSLVSQLRKSINGKFSYCLVPFTSETGLTSKINFGTN 260
Query: 251 DIVIDSGTTLTFL----PQGY---NSNLLSVMSSMI----------EAQPVADPTGSLEL 293
IV G T + P Y N +SV S I E V D +L L
Sbjct: 261 GIVSGDGVVSTSMVKKDPATYYFLNLEAISVGSKKIQFTSTIFGTGEGNIVIDSGTTLTL 320
Query: 294 -------------------------------CYSFNSLSQVPEVTIHFRGADVKLSRSNF 322
CY +S +VP++T+HF+G DVKL N
Sbjct: 321 LPSNFYYELESVVASTIKAERVQDPDGILSLCYRDSSSFKVPDITVHFKGGDVKLGNLNT 380
Query: 323 FVKVSEDIVCSVFKGITNSVPIYGNIMQTNFLVGYDIEQQTVSFKPTDCTK 373
FV VSED+ C F + I+GN+ Q NFLVGYD TVSFK TDC++
Sbjct: 381 FVAVSEDVSCFAFAA-NEQLTIFGNLAQMNFLVGYDTVSGTVSFKKTDCSQ 430
>gi|449499012|ref|XP_004160696.1| PREDICTED: probable aspartic protease At2g35615-like [Cucumis
sativus]
Length = 435
Score = 263 bits (672), Expect = 1e-67, Method: Compositional matrix adjust.
Identities = 165/443 (37%), Positives = 233/443 (52%), Gaps = 83/443 (18%)
Query: 1 MATFLSCVFILFFLCFYVVSPIEAQTG---GFSVELIHRDSPKSPFYNSSETPYQRLRDA 57
MA S V ++ FL V + A TG GF+VELIHRDSPKSP YN E Y R+ D
Sbjct: 1 MAPIFSLVIVIIFLISTAV--VSAATGPDYGFTVELIHRDSPKSPMYNPLENHYHRVADT 58
Query: 58 LTRSLNRLNHFNQNSSISSSKASQADIIPNNANYLIRISIGTPPTERLAVADTGSDLIWT 117
L RS++ N+ + ++ +A I N YL+++S+GTPP +AVADTGSD+IWT
Sbjct: 59 LRRSIS------HNTGLVTNTV-EAPIYNNRGEYLMKLSVGTPPFPIIAVADTGSDIIWT 111
Query: 118 QCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCSSSQCASLNQ-KSCS-GVNCQYSVSYGDG 175
QC PC + CY QD P+F+P S+TY+ + CSS C+ + SCS +C YS+SYGD
Sbjct: 112 QCVPC--TNCYQQDLPMFNPSKSTTYRKVSCSSPVCSFTGEDNSCSFKPDCTYSISYGDN 169
Query: 176 SFSNGNLATETVTLGSTTGQAVALPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQM 235
S S G+ A +T+T+GST+G+ VA P GCG +N G F++ +GIVGLG G SLI QM
Sbjct: 170 SHSQGDFAVDTLTMGSTSGRVVAFPRTAIGCGHDNAGSFDANVSGIVGLGLGPASLIKQM 229
Query: 236 RTTIAGN---------------QRL------------GVSTPDIVIDS-----GTTLTFL 263
+ + G +L VSTP + D L +
Sbjct: 230 GSAVGGKFSYCLTPIGNDDGGSNKLNFGSNANVSGSGAVSTPIYISDKFKSFYSLKLKAV 289
Query: 264 PQGYNSNLLSVMSSMI--EAQPVADPTGSLELC--------------------------- 294
G N+ S +S++ +A + D +L L
Sbjct: 290 SVGRNNTFYSTANSILGGKANIIIDSGTTLTLLPVDLYHNFAKAISNSINLQRTDDPNQF 349
Query: 295 --YSFNSLSQ---VPEVTIHFRGADVKLSRSNFFVKVSEDIVCSVFKGI-TNSVPIYGNI 348
Y F + + VP + +HF GA+++L R N ++VS++++C F G N + IYGNI
Sbjct: 350 LEYCFETTTDDYKVPFIAMHFEGANLRLQRENVLIRVSDNVICLAFAGAQDNDISIYGNI 409
Query: 349 MQTNFLVGYDIEQQTVSFKPTDC 371
Q NFLVGYD+ ++SFKP +C
Sbjct: 410 AQINFLVGYDVTNMSLSFKPMNC 432
>gi|147811402|emb|CAN61225.1| hypothetical protein VITISV_006732 [Vitis vinifera]
Length = 440
Score = 258 bits (658), Expect = 4e-66, Method: Compositional matrix adjust.
Identities = 162/441 (36%), Positives = 218/441 (49%), Gaps = 75/441 (17%)
Query: 1 MATFLSCVFILFFLCFYVVSPIEAQTGGFSVELIHRDSPKSPFYNSSETPYQRLRDALTR 60
+ S V L F+ +S E + G FS++LIHRDSPKSP YN SETP +RL R
Sbjct: 7 LGLLFSIVIALSFVSVAHISAAEVKNGRFSIDLIHRDSPKSPLYNPSETPAERL----DR 62
Query: 61 SLNRLNHFNQNSSISSSKASQADIIPNNANYLIRISIGTPPTERLAVADTGSDLIWTQCE 120
R F++ S S + + NN YL++ISIGTPP + + DTGSDL+WTQC
Sbjct: 63 FFRRFMSFSEASI--SPNTPEPPVSSNNGEYLMKISIGTPPFDVYGIYDTGSDLMWTQCL 120
Query: 121 PCPPSQCYMQDSPLFDPKMSSTYKSLPCSSSQCASLNQKSCSGVN--CQYSVSYGDGSFS 178
PC CY Q +P+FDP S+++K + C S QC L+ SCS C +S YGDGS +
Sbjct: 121 PC--LSCYKQKNPMFDPSKSTSFKEVSCESQQCRLLDTVSCSQPQKLCDFSYGYGDGSLA 178
Query: 179 NGNLATETVTLGSTTGQAVALPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTT 238
G +ATET+TL S +GQ ++ I FGCG NN G FN G+ G GG +SL SQ+ +T
Sbjct: 179 QGVIATETLTLNSNSGQPXSIXNIVFGCGHNNSGTFNENEMGLFGTGGRPLSLTSQIMST 238
Query: 239 IAGNQRLG-----------------------------VSTPDIVIDSGT----TL----- 260
+ ++ VSTP + D T TL
Sbjct: 239 LGSGRKFSQCLVPFRTDPSITSKIIFGPEAEVSGSXVVSTPLVTKDDPTYYFVTLDGISV 298
Query: 261 ------------------TFLPQGYNSNLL------SVMSSMIEAQP---VADPTGSLEL 293
F+ G LL ++ + EA P V DP +L
Sbjct: 299 GDKLFPFSSSSPMATKGNVFIDAGTPPTLLPRDFYNRLVQGVKEAIPMEPVQDPDLQPQL 358
Query: 294 CYSFNSLSQVPEVTIHFRGADVKLSRSNFFVKVSEDIVCSVFKGITNSVPIYGNIMQTNF 353
CY +L P +T HF GADV+L N F+ E + C + I I+GN +Q NF
Sbjct: 359 CYRSATLIDGPILTAHFDGADVQLKPLNTFISPKEGVYCFAMQPIDGDTGIFGNFVQMNF 418
Query: 354 LVGYDIEQQTVSFKPTDCTKQ 374
L+G+D++ + VSFK DCTKQ
Sbjct: 419 LIGFDLDGKKVSFKAVDCTKQ 439
>gi|225436202|ref|XP_002271145.1| PREDICTED: probable aspartic protease At2g35615-like [Vitis
vinifera]
Length = 440
Score = 257 bits (657), Expect = 5e-66, Method: Compositional matrix adjust.
Identities = 162/441 (36%), Positives = 218/441 (49%), Gaps = 75/441 (17%)
Query: 1 MATFLSCVFILFFLCFYVVSPIEAQTGGFSVELIHRDSPKSPFYNSSETPYQRLRDALTR 60
+ S V L F+ +S E + G FS++LIHRDSPKSP YN SETP +RL R
Sbjct: 7 LGLLFSIVIALSFVSVAHISAAEVKNGRFSIDLIHRDSPKSPLYNPSETPAERL----DR 62
Query: 61 SLNRLNHFNQNSSISSSKASQADIIPNNANYLIRISIGTPPTERLAVADTGSDLIWTQCE 120
R F++ S S + + NN YL++ISIGTPP + + DTGSDL+WTQC
Sbjct: 63 FFRRFMSFSEASI--SPNTPEPPVSSNNGEYLMKISIGTPPFDVYGIYDTGSDLMWTQCL 120
Query: 121 PCPPSQCYMQDSPLFDPKMSSTYKSLPCSSSQCASLNQKSCSGVN--CQYSVSYGDGSFS 178
PC CY Q +P+FDP S+++K + C S QC L+ SCS C +S YGDGS +
Sbjct: 121 PC--LSCYKQKNPMFDPSKSTSFKEVSCESQQCRLLDTVSCSQPQKLCDFSYGYGDGSLA 178
Query: 179 NGNLATETVTLGSTTGQAVALPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTT 238
G +ATET+TL S +GQ ++ I FGCG NN G FN G+ G GG +SL SQ+ +T
Sbjct: 179 QGVIATETLTLNSNSGQPTSILNIVFGCGHNNSGTFNENEMGLFGTGGRPLSLTSQIMST 238
Query: 239 IAGNQRLG-----------------------------VSTPDIVIDSGT----TL----- 260
+ ++ VSTP + D T TL
Sbjct: 239 LGSGRKFSQCLVPFRTDPSITSKIIFGPEAEVSGSDVVSTPLVTKDDPTYYFVTLDGISV 298
Query: 261 ------------------TFLPQGYNSNLL------SVMSSMIEAQP---VADPTGSLEL 293
F+ G LL ++ + EA P V DP +L
Sbjct: 299 GDKLFPFSSSSPMATKGNVFIDAGTPPTLLPRDFYNRLVQGVKEAIPMEPVQDPDLQPQL 358
Query: 294 CYSFNSLSQVPEVTIHFRGADVKLSRSNFFVKVSEDIVCSVFKGITNSVPIYGNIMQTNF 353
CY +L P +T HF GADV+L N F+ E + C + I I+GN +Q NF
Sbjct: 359 CYRSATLIDGPILTAHFDGADVQLKPLNTFISPKEGVYCFAMQPIDGDTGIFGNFVQMNF 418
Query: 354 LVGYDIEQQTVSFKPTDCTKQ 374
L+G+D++ + VSFK DCTKQ
Sbjct: 419 LIGFDLDGKKVSFKAVDCTKQ 439
>gi|357515001|ref|XP_003627789.1| Aspartic proteinase nepenthesin-2 [Medicago truncatula]
gi|355521811|gb|AET02265.1| Aspartic proteinase nepenthesin-2 [Medicago truncatula]
Length = 415
Score = 256 bits (655), Expect = 1e-65, Method: Compositional matrix adjust.
Identities = 160/433 (36%), Positives = 223/433 (51%), Gaps = 86/433 (19%)
Query: 3 TFLSCVFILFFLCFYVVSPIEAQTGGFSVELIHRDSPKSPFYNSSETPYQRLRDALTRSL 62
+FL+ +F F CF ++S A GF++ELIHRDS KSPFY ++ Y+R+ +A+ RS+
Sbjct: 5 SFLTLLFFTIF-CF-IISLSHALNNGFTLELIHRDSSKSPFYQPTQNKYERIANAVRRSI 62
Query: 63 NRLNHFNQNSSISSSKASQADIIPNNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPC 122
NR+NHF + S S+ Q+ + + YL+ SIGTPP + DTGSDL+W QCEPC
Sbjct: 63 NRVNHFYKYSLTST---PQSTVNSDKGEYLMSYSIGTPPFKVFGFVDTGSDLVWLQCEPC 119
Query: 123 PPSQCYMQDSPLFDPKMSSTYKSLPCSSSQCASLNQKSCSGVNCQYSVSYGDGSFSNGNL 182
QCY Q +P+FDP +SS+Y+++PC S C S+ SC G L
Sbjct: 120 --KQCYPQITPIFDPSLSSSYQNIPCLSDTCHSMRTTSCD---------------VRGYL 162
Query: 183 ATETVTLGSTTGQAVALPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGN 242
+ ET+TL STTG +V+ P GCG N G F+ ++GIVGLG G +SL SQ+ T+I G
Sbjct: 163 SVETLTLDSTTGYSVSFPKTMIGCGYRNTGTFHGPSSGIVGLGSGPMSLPSQLGTSIGGK 222
Query: 243 -------------QRLGVSTPDIVIDSGTTLTFLPQ-----GYNSNL--LSVMSSMIE-- 280
+L IV G T + + GY L SV + +IE
Sbjct: 223 FSYCLGPWLPNSTSKLNFGDAAIVYGDGAMTTPIVKKDAQSGYYLTLEAFSVGNKLIEFG 282
Query: 281 ---------------------------------------AQPVADPTGSLELCYSFNSLS 301
+ V DP G+ +LCY+
Sbjct: 283 GPTYGGNEGNILIDSGTTFTFLPYDVYYRFESAVAEYINLEHVEDPNGTFKLCYNVAYHG 342
Query: 302 -QVPEVTIHFRGADVKLSRSNFFVKVSEDIVCSVFKGITNSVPIYGNIMQTNFLVGYDIE 360
+ P +T HF+GAD+KL + F+KVS+ I C F I + I+GN+ Q N LVGY++
Sbjct: 343 FEAPLITAHFKGADIKLYYISTFIKVSDGIACLAF--IPSQTAIFGNVAQQNLLVGYNLV 400
Query: 361 QQTVSFKPTDCTK 373
Q TV+FKP DCTK
Sbjct: 401 QNTVTFKPVDCTK 413
>gi|224130878|ref|XP_002320947.1| predicted protein [Populus trichocarpa]
gi|222861720|gb|EEE99262.1| predicted protein [Populus trichocarpa]
Length = 440
Score = 249 bits (637), Expect = 1e-63, Method: Compositional matrix adjust.
Identities = 124/246 (50%), Positives = 167/246 (67%), Gaps = 6/246 (2%)
Query: 1 MATF---LSCVFILFFLCFYVVSPIEAQTGGFSVELIHRDSPKSPFYNSSETPYQRLRDA 57
MA F LS + LC I A+ GF+V+LIHRDSP SPFYNS ET QR+ +A
Sbjct: 1 MAAFRSPLSFALAIALLCVSGFGCIYARKVGFTVDLIHRDSPLSPFYNSEETDLQRINNA 60
Query: 58 LTRSLNRLNHFNQNSSIS-SSKASQADIIPNNANYLIRISIGTPPTERLAVADTGSDLIW 116
L RS++R++HF+ ++ S S KA+++D+ N YL+ +S+GTPP + + +ADTGSDLIW
Sbjct: 61 LRRSISRVHHFDPIAAASVSPKAAESDVTSNRGEYLMSLSLGTPPFKIMGIADTGSDLIW 120
Query: 117 TQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCSSSQCASLNQKSCSGVNCQYSVSYGDGS 176
TQC+PC +CY Q PLFDPK S TY+ C + QC+ L+Q +CSG CQY SYGD S
Sbjct: 121 TQCKPC--ERCYKQVDPLFDPKSSKTYRDFSCDARQCSLLDQSTCSGNICQYQYSYGDRS 178
Query: 177 FSNGNLATETVTLGSTTGQAVALPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMR 236
++ GN+A++T+TL STTG V+ P GCG N G F+ K +GIVGLG G +SLISQM
Sbjct: 179 YTMGNVASDTITLDSTTGSPVSFPKTVIGCGHENDGTFSDKGSGIVGLGAGPLSLISQMG 238
Query: 237 TTIAGN 242
+++ G
Sbjct: 239 SSVGGK 244
Score = 139 bits (350), Expect = 2e-30, Method: Compositional matrix adjust.
Identities = 66/134 (49%), Positives = 92/134 (68%)
Query: 241 GNQRLGVSTPDIVIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSL 300
G+ LG +I+IDSGTTLT +P + SNL + + + +E + DP+G L +CYS S
Sbjct: 307 GDSSLGTGEGNIIIDSGTTLTIVPDDFFSNLSTAVGNQVEGRRAEDPSGFLSVCYSATSD 366
Query: 301 SQVPEVTIHFRGADVKLSRSNFFVKVSEDIVCSVFKGITNSVPIYGNIMQTNFLVGYDIE 360
+VP +T HF GADVKL N FV+VS+D+VC F T+ + IYGN+ Q NFLV Y+I+
Sbjct: 367 LKVPAITAHFTGADVKLKPINTFVQVSDDVVCLAFASTTSGISIYGNVAQMNFLVEYNIQ 426
Query: 361 QQTVSFKPTDCTKQ 374
+++SFKPTDCTK+
Sbjct: 427 GKSLSFKPTDCTKK 440
>gi|224126551|ref|XP_002329582.1| predicted protein [Populus trichocarpa]
gi|222870291|gb|EEF07422.1| predicted protein [Populus trichocarpa]
Length = 440
Score = 246 bits (627), Expect = 2e-62, Method: Compositional matrix adjust.
Identities = 128/249 (51%), Positives = 162/249 (65%), Gaps = 12/249 (4%)
Query: 1 MATFLSCVFILFF-----LCFYVVSPIEAQTGGFSVELIHRDSPKSPFYNSSETPYQRLR 55
MATF S +L F LC I A GF+ EL+HRDSPKSP YNS +T QR
Sbjct: 1 MATFQS---VLSFASAIALCVASFGCIYAHNAGFTTELVHRDSPKSPLYNSQQTHLQRWN 57
Query: 56 DALTRSLNRLNHFNQNSSISSSKASQADIIPNNANYLIRISIGTPPTERLAVADTGSDLI 115
A+ RS++R++HF + ++ S K +++II N YL+ +S+GTPP E LA+ADTGSDLI
Sbjct: 58 KAMRRSVSRVHHFQRTAATVSPKEVESEIIANGGEYLMSLSLGTPPFEILAIADTGSDLI 117
Query: 116 WTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCSSSQCASLNQ-KSCSGVN-CQYSVSYG 173
WTQC PC +CY Q +PLFDPK S TY+ L C + QC +L + SCS CQYS YG
Sbjct: 118 WTQCTPC--DKCYKQIAPLFDPKSSKTYRDLSCDTRQCQNLGESSSCSSEQLCQYSYYYG 175
Query: 174 DGSFSNGNLATETVTLGSTTGQAVALPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLIS 233
D SF+NGNLA +TVTL ST G V P GCG N G F+ K +GI+GLGGG +SLIS
Sbjct: 176 DRSFTNGNLAVDTVTLPSTNGGPVYFPKTVIGCGRRNNGTFDKKDSGIIGLGGGPMSLIS 235
Query: 234 QMRTTIAGN 242
QM +++ G
Sbjct: 236 QMGSSVGGK 244
Score = 106 bits (264), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 54/124 (43%), Positives = 79/124 (63%), Gaps = 2/124 (1%)
Query: 251 DIVIDSGTTLTFLPQGYNSNLL-SVMSSMIEAQPVADPTGSLELCYSFNSLSQVPEVTIH 309
+I+IDSGT+LT P + + +V +++I + D +G L CY +VP +T H
Sbjct: 317 NIIIDSGTSLTLFPVNFFTEFATAVENAVINGERTQDASGLLSHCYRPTPDLKVPVITAH 376
Query: 310 FRGADVKLSRSNFFVKVSEDIVCSVFKGITNSVPIYGNIMQTNFLVGYDIEQQTVSFKPT 369
F GADV L N F+ +S+D++C F T S I+GN+ Q NFL+GYDI+ ++VSFKPT
Sbjct: 377 FNGADVVLQTLNTFILISDDVLCLAFNS-TQSGAIFGNVAQMNFLIGYDIQGKSVSFKPT 435
Query: 370 DCTK 373
DCT+
Sbjct: 436 DCTQ 439
>gi|343198386|gb|AEM05966.1| nodulin 41 [Phaseolus vulgaris]
Length = 437
Score = 243 bits (619), Expect = 1e-61, Method: Compositional matrix adjust.
Identities = 171/437 (39%), Positives = 226/437 (51%), Gaps = 76/437 (17%)
Query: 9 FILFFLCFYVVSPI---EAQTG--GFSVELIHRDSPKSPFYNSSETPYQRLRDALTRSLN 63
F+ F L FY VS + EA GF+V+LIHRDSP SPFYN S TP QR+ +A RS++
Sbjct: 4 FVFFCLAFYSVSSLFSTEANESPSGFTVDLIHRDSPLSPFYNPSLTPSQRIINAALRSIS 63
Query: 64 RLNHFNQNSSISSSKASQADIIPNNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCP 123
RLN + N ++K Q+ +I +N YL+R IGTPP ERLA ADTGSDLIW QC PC
Sbjct: 64 RLNRVS-NLLDQNNKLPQSVLILHNGEYLMRFYIGTPPVERLATADTGSDLIWVQCSPC- 121
Query: 124 PSQCYMQDSPLFDPKMSSTYKSLPCSSSQCASL--NQKSC--SGVNCQYSVSYGDG-SFS 178
+ C+ Q +PLF P SST+ C S C L QK C SG C Y+ YGD SFS
Sbjct: 122 -ASCFPQSTPLFQPLKSSTFMPTTCRSQPCTLLLPEQKGCGKSG-ECIYTYKYGDQYSFS 179
Query: 179 NGNLATETVTLGSTTG-QAVALPGITFGCGT-NNGGLFNS-KTTGIVGLGGGDISLISQM 235
G L+TET+ S G Q VA P FGCG NN +F S K TGI+GLG G +SL+SQ+
Sbjct: 180 EGLLSTETLRFDSQGGVQTVAFPNSFFGCGLYNNITVFPSYKLTGIMGLGAGPLSLVSQI 239
Query: 236 RTTIA--------------------GNQRL-----GVSTP-------------------- 250
I GN+ + VSTP
Sbjct: 240 GDQIGHKFSYCLLPLGSTSTSKLKFGNESIITGEGVVSTPMIIKPWLPTYYFLNLEAVTV 299
Query: 251 ------------DIVIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFN 298
+++IDSGT LT+L + + N + + + + V D L C+ +
Sbjct: 300 AQKTVPTGSTDGNVIIDSGTLLTYLGESFYYNFAASLQESLAVELVQDVLSPLPFCFPYR 359
Query: 299 SLSQVPEVTIHFRGADVKLSRSNFFVKVSE-DIVCSVFKGITNS-VPIYGNIMQTNFLVG 356
PE+ F GA V L +N FV + + VC + + S + I+G+ Q +F V
Sbjct: 360 DNFVFPEIAFQFTGARVSLKPANLFVMTEDRNTVCLMIAPSSVSGISIFGSFSQIDFQVE 419
Query: 357 YDIEQQTVSFKPTDCTK 373
YD+E + VSF+PTDC+K
Sbjct: 420 YDLEGKKVSFQPTDCSK 436
>gi|356557014|ref|XP_003546813.1| PREDICTED: probable aspartic protease At2g35615-like [Glycine max]
Length = 435
Score = 242 bits (617), Expect = 2e-61, Method: Compositional matrix adjust.
Identities = 157/432 (36%), Positives = 222/432 (51%), Gaps = 71/432 (16%)
Query: 9 FILFFLCFYVVSPIEAQTG--GFSVELIHRDSPKSPFYNSSETPYQRLRDALTRSLNRLN 66
IL +S EA+ G GFSV+LIHRDSP SPFYN S TP +R+ +A RS++RL
Sbjct: 7 MILALFSLSTLSSREAREGLRGFSVDLIHRDSPSSPFYNPSLTPSERIINAALRSMSRLQ 66
Query: 67 HFNQNSSISSSKASQADIIPNNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQ 126
+ + +K ++ +IP+ YL+R IG+PP ERLA+ DTGS LIW QC PC
Sbjct: 67 RVSH--FLDENKLPESLLIPDKGEYLMRFYIGSPPVERLAMVDTGSSLIWLQCSPC--HN 122
Query: 127 CYMQDSPLFDPKMSSTYKSLPCSSSQCASL--NQKSCSGV-NCQYSVSYGDGSFSNGNLA 183
C+ Q++PLF+P SSTYK C S C L +Q+ C + C Y + YGD SFS G L
Sbjct: 123 CFPQETPLFEPLKSSTYKYATCDSQPCTLLQPSQRDCGKLGQCIYGIMYGDKSFSVGILG 182
Query: 184 TETVTLGSTTG-QAVALPGITFGCGT-NNGGLFNS-KTTGIVGLGGGDISLISQMRTTIA 240
TET++ GST G Q V+ P FGCG NN ++ S K GI GLG G +SL+SQ+ I
Sbjct: 183 TETLSFGSTGGAQTVSFPNTIFGCGVDNNFTIYTSNKVMGIAGLGAGPLSLVSQLGAQIG 242
Query: 241 ---------------GNQRLG----------VSTP------------------------- 250
+ G VSTP
Sbjct: 243 HKFSYCLLPYDSTSTSKLKFGSEAIITTNGVVSTPLIIKPSLPTYYFLNLEAVTIGQKVV 302
Query: 251 -------DIVIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSLSQV 303
+IVIDSGT LT+L + +N ++ + + + + D L+ C+ + +
Sbjct: 303 STGQTDGNIVIDSGTPLTYLENTFYNNFVASLQETLGVKLLQDLPSPLKTCFPNRANLAI 362
Query: 304 PEVTIHFRGADVKLSRSNFFVKVSE-DIVC-SVFKGITNSVPIYGNIMQTNFLVGYDIEQ 361
P++ F GA V L N + +++ +I+C +V + ++G+I Q +F V YD+E
Sbjct: 363 PDIAFQFTGASVALRPKNVLIPLTDSNILCLAVVPSSGIGISLFGSIAQYDFQVEYDLEG 422
Query: 362 QTVSFKPTDCTK 373
+ VSF PTDC K
Sbjct: 423 KKVSFAPTDCAK 434
>gi|388502342|gb|AFK39237.1| unknown [Lotus japonicus]
Length = 440
Score = 240 bits (613), Expect = 8e-61, Method: Compositional matrix adjust.
Identities = 152/411 (36%), Positives = 219/411 (53%), Gaps = 69/411 (16%)
Query: 29 FSVELIHRDSPKSPFYNSSETPYQRLRDALTRSLNRLNHFNQNSSISSSKASQAD----I 84
F+++LIH DSP SPFYNSS T Q +R+A RS++R N + + S S ++ ++ I
Sbjct: 30 FTIDLIHHDSPPSPFYNSSMTRSQLIRNAAMRSISRANQLSLSLSHSLNQLKESSPEPII 89
Query: 85 IPNNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYK 144
IPNN NYL+RI IGTP ERLA+ADTGSDL W QC PC ++C+ Q++PL+DP SST+
Sbjct: 90 IPNNGNYLMRIYIGTPSVERLAIADTGSDLTWVQCSPCDNTKCFAQNTPLYDPLNSSTFT 149
Query: 145 SLPCSSSQCASL--NQKSCSGV-NCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPG 201
LPC S C L +Q CS +C Y+ +YGD S+S G L+++++ L Q
Sbjct: 150 LLPCDSQPCTQLPYSQYVCSDYGDCIYAYTYGDNSYSYGGLSSDSIRL--MLLQLHYNSK 207
Query: 202 ITFGCGTNNGGLFNS--KTTGIVGLGGGDISLISQMRTTIA-------------GNQRLG 246
I FGCG N + KTTGIVGLG G +SL+SQ+ I N +L
Sbjct: 208 ICFGCGFQNKFTADKSGKTTGIVGLGAGPLSLVSQLGDEIGHKFSYCLLPFSSNSNSKLK 267
Query: 247 ------------VSTP-------------------------------DIVIDSGTTLTFL 263
VSTP +I+IDSG+TLT+L
Sbjct: 268 FGEAAIVQGNGVVSTPLIIKPDLPFYYLNLEGITVGAKTVKTGQTDGNIIIDSGSTLTYL 327
Query: 264 PQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNS-LSQVPEVTIHFRGADVKLSRSNF 322
+ + + +S++ + + + C+++ +S P+V HF G DV L N
Sbjct: 328 EESFYNEFVSLVKETVAVEEDQYIPYPFDFCFTYKEGMSTPPDVVFHFTGGDVVLKPMNT 387
Query: 323 FVKVSEDIVCS-VFKGITNSVPIYGNIMQTNFLVGYDIEQQTVSFKPTDCT 372
V + ++++CS V + + I+GN+ Q +F VGYDI+ VSF PTDC+
Sbjct: 388 LVLIEDNLICSTVVPSHFDGIAIFGNLGQIDFHVGYDIQGGKVSFAPTDCS 438
>gi|357486591|ref|XP_003613583.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
gi|355514918|gb|AES96541.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
Length = 437
Score = 240 bits (612), Expect = 1e-60, Method: Compositional matrix adjust.
Identities = 158/440 (35%), Positives = 225/440 (51%), Gaps = 76/440 (17%)
Query: 4 FLSCVFILFFLCFYVVSPIEAQTGGFSVELIHRDSPKSPFYNSSETPYQRLRDALTRSLN 63
F S + +LF CF VS + Q GFSVELIH S KSPFYN++E+ +QR+ + + S N
Sbjct: 3 FYSSLLLLF--CFCRVSVSKTQNNGFSVELIHPISSKSPFYNTAESHFQRMSNNMKHSTN 60
Query: 64 RLNHFNQNSSISSSKASQADIIPNNAN-YLIRISIGTPPTERLAVADTGSDLIWTQCEPC 122
R+++ N S +K + P + Y+I IGTPP + V DT +D IW QC PC
Sbjct: 61 RVHYLNHVFSFPPNKVPNIVVSPFMGDGYIISFLIGTPPFQLYGVMDTANDNIWFQCNPC 120
Query: 123 PPSQCYMQDSPLFDPKMSSTYKSLPCSSSQCASLNQKSCSGVN---CQYSVSYGDGSFSN 179
P C+ SP+FDP SSTYK++PCSS +C ++ CS + C+YS +YG ++S
Sbjct: 121 KP--CFNTTSPMFDPSKSSTYKTIPCSSPKCKNVENTHCSSDDKKVCEYSFTYGGEAYSQ 178
Query: 180 GNLATETVTLGSTTGQAVALPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTI 239
G+L+ +T+TL S ++ I GCG N G +G +GLG G +S ISQ+ ++I
Sbjct: 179 GDLSIDTLTLNSNNDTPISFKNIVIGCGHRNKGPLEGYVSGNIGLGRGPLSFISQLNSSI 238
Query: 240 AG-----------NQRL-------------GVSTPDIVIDSG-----TTLTFLPQG---- 266
G N+ + GV T I +G TTL L G
Sbjct: 239 GGKFSYCLVPLFSNEGISGKLHFGDKSVVSGVGTVSTPITAGEIGYSTTLNALSVGDHII 298
Query: 267 -------YNSNL----------LSVM------------SSMIEAQPVADPTGSLELCY-- 295
N NL L+++ +SM++ + P +LCY
Sbjct: 299 KFENSTSKNDNLGNTIIDSGTTLTILPENVYSRLESIVTSMVKLERAKSPNQQFKLCYKA 358
Query: 296 SFNSLSQVPEVTIHFRGADVKLSRSNFFVKVSEDIVCSVFKGITNSVP--IYGNIMQTNF 353
+ +L VP +T HF GADV L+ N F + ++VC F + N P I GNI Q NF
Sbjct: 359 TLKNL-DVPIITAHFNGADVHLNSLNTFYPIDHEVVCFAFVSVGN-FPGTIIGNIAQQNF 416
Query: 354 LVGYDIEQQTVSFKPTDCTK 373
LVG+D+++ +SFKPTDCTK
Sbjct: 417 LVGFDLQKNIISFKPTDCTK 436
>gi|449515031|ref|XP_004164553.1| PREDICTED: probable aspartic protease At2g35615-like [Cucumis
sativus]
Length = 430
Score = 239 bits (610), Expect = 2e-60, Method: Compositional matrix adjust.
Identities = 155/440 (35%), Positives = 228/440 (51%), Gaps = 80/440 (18%)
Query: 1 MATFLSCVF--ILFFLCFYVVSPIEAQTGGFSVELIHRDSPKSPFYNSSETPYQRLRDAL 58
MA +S F ILF + F + I G F+ L HRDS SP SS + Y RL +A
Sbjct: 1 MAATISLFFHLILFLISFSQTTIINGDNG-FTTSLFHRDSLLSPLEFSSLSHYDRLANAF 59
Query: 59 TRSLNRLNHFNQNSSISSSKASQADIIPNNANYLIRISIGTPPTERLAVADTGSDLIWTQ 118
RSL+R ++ S + Q+ I P + YL+ +SIGTPP + L +ADTGSDL W Q
Sbjct: 60 RRSLSRSAALLNRAATSGAVGLQSSIGPGSGEYLMSVSIGTPPVDYLGIADTGSDLTWAQ 119
Query: 119 CEPCPPSQCYMQDSPLFDPKMSSTYKSLPCSSSQCASLNQKSCSGVN--CQYSVSYGDGS 176
C PC +CY Q P+F+P S+++ +PC++ C +++ C GV C YS +YGD +
Sbjct: 120 CLPCL--KCYQQLRPIFNPLKSTSFSHVPCNTQTCHAVDDGHC-GVQGVCDYSYTYGDRT 176
Query: 177 FSNGNLATETVTLGSTTGQAVALPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMR 236
+S G+L E +T+GS++ ++V GCG + G F +G++GLGGG +SL+SQM
Sbjct: 177 YSKGDLGFEKITIGSSSVKSV------IGCGHASSGGFGF-ASGVIGLGGGQLSLVSQMS 229
Query: 237 TTIAGNQRLG---------------------------VSTP------------------- 250
T ++R VSTP
Sbjct: 230 QTSGISRRFSYCLPTLLSHANGKINFGENAVVSGPGVVSTPLISKNTVTYYYITLEAISI 289
Query: 251 ------------DIVIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCY--S 296
+++IDSGTTLT LP+ ++S + +++A+ V DP GSL+LC+
Sbjct: 290 GNERHMAFAKQGNVIIDSGTTLTILPKELYDGVVSSLLKVVKAKRVKDPHGSLDLCFDDG 349
Query: 297 FNSLSQ--VPEVTIHFR-GADVKLSRSNFFVKVSEDIVCSVFKGI--TNSVPIYGNIMQT 351
N+ + +P +T HF GA+V L N F KV++++ C K T I GN+ Q
Sbjct: 350 INAAASLGIPVITAHFSGGANVNLLPINTFRKVADNVNCLTLKAASPTTEFGIIGNLAQA 409
Query: 352 NFLVGYDIEQQTVSFKPTDC 371
NFL+GYD+E + +SFKPT C
Sbjct: 410 NFLIGYDLEAKRLSFKPTVC 429
>gi|358346443|ref|XP_003637277.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
gi|355503212|gb|AES84415.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
Length = 434
Score = 238 bits (607), Expect = 3e-60, Method: Compositional matrix adjust.
Identities = 145/439 (33%), Positives = 228/439 (51%), Gaps = 74/439 (16%)
Query: 1 MATFLSCVFILFFLCFYVVSPIEAQTGGFSVELIHRDSPKSPFYNSSETPYQRLRDALTR 60
M+ F I F+LC ++ A G S+E+IHRD KSP Y+ + T +QR + + R
Sbjct: 1 MSRFSVLTLIFFYLCCFIYFS-HASKKGLSIEMIHRDFSKSPLYHPTVTKFQRAYNVVHR 59
Query: 61 SLNRLNHFNQNSSISSSKASQADIIPNNANYLIRISIGTPPTERLAVADTGSDLIWTQCE 120
S+NR+N+F + S++ ++ + + P YLI S+GTPP + DTGS+++W QC+
Sbjct: 60 SINRVNYFTKEFSLNKNQPV-STLTPELGEYLISYSVGTPPFKVYGFMDTGSNIVWLQCQ 118
Query: 121 PCPPSQCYMQDSPLFDPKMSSTYKSLPCSSSQCASLNQK--SCS--GVNCQYSVSYGDGS 176
PC + C+ Q SP+F+P SS+YK++PC+SS C N SCS G C+YS++YG +
Sbjct: 119 PC--NTCFNQTSPIFNPSKSSSYKNIPCTSSTCKDTNDTHISCSNGGDVCEYSITYGGDA 176
Query: 177 FSNGNLATETVTLGSTTGQAVALPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMR 236
S G+L+ +++TL ST+G +V P I GCG N NS+++G+VG+G G +SLI Q+
Sbjct: 177 KSQGDLSNDSLTLDSTSGSSVLFPNIVIGCGHINVLQDNSQSSGVVGMGRGPMSLIKQVG 236
Query: 237 TTIAGNQ----------------------------RLGVSTPDIVIDSGTTLTFL----- 263
++ G++ + VSTP + ++ FL
Sbjct: 237 SSSVGSKFSYCLIPYNSDSNSSSKLIFGEDVVVSGEIVVSTPMVKVNGQENYYFLTLEAF 296
Query: 264 -------------------------------PQGYNSNLLSVMSSMIEAQPVADPTGSLE 292
P + S L+S ++ ++ + P L
Sbjct: 297 SVGNNRIEYGERSNASTQNILIDSGTPLTMLPNLFLSKLVSYVAQEVKLPRIEPPDHHLS 356
Query: 293 LCYSFNSLS-QVPEVTIHFRGADVKLSRSNFFVKVSEDIVCSVFKGITNSVPIYGNIMQT 351
LCY+ VP++T HF GADVKL+ + F + I+C F +N + I+GNI Q
Sbjct: 357 LCYNTTGKQLNVPDITAHFNGADVKLNSNGTFFPFEDGIMCFGFIS-SNGLEIFGNIAQN 415
Query: 352 NFLVGYDIEQQTVSFKPTD 370
N L+ YD+E++ +SFKPTD
Sbjct: 416 NLLIDYDLEKEIISFKPTD 434
>gi|357487593|ref|XP_003614084.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
gi|355515419|gb|AES97042.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
Length = 412
Score = 236 bits (601), Expect = 2e-59, Method: Compositional matrix adjust.
Identities = 151/429 (35%), Positives = 215/429 (50%), Gaps = 86/429 (20%)
Query: 9 FILFFLCFYVVSPIEAQTGGFSVELIHRDSPKSPFYNSSETPYQRLRDALTRSLNRLNHF 68
F+L CF +S + Q GF+VELIH S +SPFYN ET QR+ L S+NR+ +
Sbjct: 7 FVLLLFCFCRLSLTKTQNHGFNVELIHPISSRSPFYNPKETQIQRISSILNYSINRVRYL 66
Query: 69 NQNSSISSSKASQADIIP-NNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQC 127
N S S +K + A Y++ SIGTPP + ++ DTG+D IW QC+PC P C
Sbjct: 67 NHVFSFSPNKIQDVPLSSFMGAGYVMSYSIGTPPFQLYSLIDTGNDNIWFQCKPCKP--C 124
Query: 128 YMQDSPLFDPKMSSTYKSLPCSSSQCASLNQKSCSGVNCQYSVSYGDGSFSNGNLATETV 187
Q SP+F P SSTYK++PC+S C + DG + L +T+
Sbjct: 125 LNQTSPMFHPSKSSTYKTIPCTSPICKN-----------------ADGHY----LGVDTL 163
Query: 188 TLGSTTGQAVALPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGN----- 242
TL S G ++ I GCG N G +G +GL G +S ISQ+ ++I G
Sbjct: 164 TLNSNNGTPISFKNIVIGCGHRNQGPLEGYVSGNIGLARGPLSFISQLNSSIGGKFSYCL 223
Query: 243 ---------------------QRLG-VSTP-----------------DIVI--------- 254
LG VSTP D +I
Sbjct: 224 VPLFSKENVSSKLHFGDKSTVSGLGTVSTPIKEENGYFVSLEAFSVGDHIIKLENSDNRG 283
Query: 255 ----DSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNS---LSQVPEVT 307
DSGTT+T LP+ S L SV+ M++ + V DP+ LCY S L++V +T
Sbjct: 284 NSIIDSGTTMTILPKDVYSRLESVVLDMVKLKRVKDPSQQFNLCYQTTSTTLLTKVLIIT 343
Query: 308 IHFRGADVKLSRSNFFVKVSEDIVCSVF--KGITNSVPIYGNIMQTNFLVGYDIEQQTVS 365
HF G++V L+ N F ++++++C F G +S+ I+GN++Q NFLVG+D+ ++T+S
Sbjct: 344 AHFSGSEVHLNALNTFYPITDEVICFAFVSGGNFSSLAIFGNVVQQNFLVGFDLNKKTIS 403
Query: 366 FKPTDCTKQ 374
FKPTDCTK
Sbjct: 404 FKPTDCTKH 412
>gi|357450869|ref|XP_003595711.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
gi|355484759|gb|AES65962.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
Length = 443
Score = 234 bits (596), Expect = 7e-59, Method: Compositional matrix adjust.
Identities = 151/435 (34%), Positives = 209/435 (48%), Gaps = 105/435 (24%)
Query: 9 FILFFLCFYVVSPIEAQTGGFSVELIHRDSPKSPFYNSSETPYQRLRDALTRSLNRLNHF 68
+ L ++ IEA G F+V+LI R NSS+ + R+
Sbjct: 9 LLAILLLVFIFPSIEAHNGRFTVKLIPR--------NSSQVLFNRI-------------- 46
Query: 69 NQNSSISSSKASQADIIPNNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCY 128
+Q + ++ +YL+ +SIGTPP + A DTGSDLIW QC PC + CY
Sbjct: 47 ----------TAQTPVSVHHYDYLMELSIGTPPVKTYAQVDTGSDLIWLQCIPC--TNCY 94
Query: 129 MQDSPLFDPKMSSTYKSLPCSSSQCASLNQKSCS--GVNCQYSVSYGDGSFSNGNLATET 186
Q +P+FDP+ SSTY ++ S C+ L SCS NC Y+ SY D S + G LA ET
Sbjct: 95 KQLNPMFDPQSSSTYSNIAYGSESCSKLYSTSCSPDQNNCNYTYSYEDDSITEGVLAQET 154
Query: 187 VTLGSTTGQAVALPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGNQ--- 243
+TL STTG+ VAL G+ FGCG NN G+FN K GI+GLG G +SL+SQ+ ++ G
Sbjct: 155 LTLTSTTGKPVALKGVIFGCGHNNNGVFNDKEMGIIGLGRGPLSLVSQIGSSFGGKMFSQ 214
Query: 244 ----------------------------------------------RLGVSTPDI----- 252
LG+S DI
Sbjct: 215 CLVPFHTNPSITSPMSFGKGSEVLGNGVVSTPLVSKNTHQAFYFVTLLGISVEDINLPFN 274
Query: 253 -------------VIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVA-DPTGSLELCYSFN 298
VIDSGT T LP+ + L+ + + + P+ DPT +LCY
Sbjct: 275 DGSSLEPITKGNMVIDSGTPTTLLPEDFYHRLVEEVRNKVALDPIPIDPTLGYQLCYRTP 334
Query: 299 SLSQVPEVTIHFRGADVKLSRSNFFVKVSEDIVCSVFKG-ITNSVPIYGNIMQTNFLVGY 357
+ + +T HF GADV L+ + F+ V + I C F +N IYGN Q+N+L+G+
Sbjct: 335 TNLKGTTLTAHFEGADVLLTPTQIFIPVQDGIFCFAFTSTFSNEYGIYGNHAQSNYLIGF 394
Query: 358 DIEQQTVSFKPTDCT 372
D+E+Q VSFK TDCT
Sbjct: 395 DLEKQLVSFKATDCT 409
>gi|356525748|ref|XP_003531485.1| PREDICTED: probable aspartic protease At2g35615-like [Glycine max]
Length = 436
Score = 231 bits (590), Expect = 3e-58, Method: Compositional matrix adjust.
Identities = 154/433 (35%), Positives = 214/433 (49%), Gaps = 79/433 (18%)
Query: 14 LCFYVVSPI---EAQTG--GFSVELIHRDSPKSPFYNSSETPYQRLRDALTRSLNRLNHF 68
L Y++S + E G GFS++LIHRDSP SPFY S TP R+ + RS+ +LN
Sbjct: 9 LALYLLSTVSSREVSEGQRGFSIDLIHRDSPLSPFYKPSLTPSDRIINTALRSIYQLNR- 67
Query: 69 NQNSSISSSKASQADIIPNNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCY 128
+S ++ K + IPN+ YL+R IGTPP ERLA+ADT SDLIW QC PC C+
Sbjct: 68 ASHSDLNEKKTLERVRIPNHGEYLMRFYIGTPPVERLAIADTASDLIWVQCSPC--ETCF 125
Query: 129 MQDSPLFDPKMSSTYKSLPCSSSQCASLNQKSCSGVN--CQYSVSYGDGSFSNGNLATET 186
QD+PLF+P SST+ +L C S C S N C V C Y+ +YGDGS + G L TE+
Sbjct: 126 PQDTPLFEPHKSSTFANLSCDSQPCTSSNIYYCPLVGNLCLYTNTYGDGSSTKGVLCTES 185
Query: 187 VTLGSTTGQAVALPGITFGCGTNNGGL--FNSKTTGIVGLGGGDISLISQM--------- 235
+ GS Q V P FGCG+NN + ++K TGIVGLG G +SL+SQ+
Sbjct: 186 IHFGS---QTVTFPKTIFGCGSNNDFMHQISNKVTGIVGLGAGPLSLVSQLGDQIGHKFS 242
Query: 236 ------------------RTTIAGN----------------------------QRLGVST 249
TTI GN + L V T
Sbjct: 243 YCLLPFTSTSTIKLKFGNDTTITGNGVVSTPLIIDPHYPSYYFLHLVGITIGQKMLQVRT 302
Query: 250 PD-----IVIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTG-SLELCYSFNSLSQV 303
D I+ID GT LT+L + N ++++ + D + C+ +
Sbjct: 303 TDHTNGNIIIDLGTVLTYLEVNFYHNFVTLLREALGISETKDDIPYPFDFCFPNQANITF 362
Query: 304 PEVTIHFRGADVKLSRSNFFVKVSE-DIVC-SVFKGI-TNSVPIYGNIMQTNFLVGYDIE 360
P++ F GA V LS N F + + +++C +V ++GN+ Q +F V YD +
Sbjct: 363 PKIVFQFTGAKVFLSPKNLFFRFDDLNMICLAVLPDFYAKGFSVFGNLAQVDFQVEYDRK 422
Query: 361 QQTVSFKPTDCTK 373
+ VSF P DC+K
Sbjct: 423 GKKVSFAPADCSK 435
>gi|225427550|ref|XP_002266461.1| PREDICTED: probable aspartic protease At2g35615 [Vitis vinifera]
Length = 439
Score = 231 bits (590), Expect = 3e-58, Method: Compositional matrix adjust.
Identities = 117/240 (48%), Positives = 163/240 (67%), Gaps = 8/240 (3%)
Query: 4 FLSCVFILFFLCFYVVSPIEAQTGGFSVELIHRDSPKSPFYNSSETPYQRLRDALTRSLN 63
F + V + F F ++ A+ GGFSV+LIHRDSP SPF++ S+T +RL DA RS++
Sbjct: 9 FFNVVVVGFL--FQLLEVALARGGGFSVDLIHRDSPHSPFFDPSKTQAERLTDAFRRSVS 66
Query: 64 RLNHFNQNSSISSSKASQADIIPNNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCP 123
R+ F + +S Q+ I+P+ YL+ + IGTPP +A+ DTGSDL WTQC PC
Sbjct: 67 RVGRFRPTAM--TSDGIQSRIVPSAGEYLMNLYIGTPPVPVIAIVDTGSDLTWTQCRPC- 123
Query: 124 PSQCYMQDSPLFDPKMSSTYKSLPCSSSQCASLNQ-KSCSG-VNCQYSVSYGDGSFSNGN 181
+ CY Q PLFDPK SSTY+ C +S C +L + +SCS C + SY DGSF+ GN
Sbjct: 124 -THCYKQVVPLFDPKNSSTYRDSSCGTSFCLALGKDRSCSKEKKCTFRYSYADGSFTGGN 182
Query: 182 LATETVTLGSTTGQAVALPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAG 241
LA+ET+T+ ST G+ V+ PG FGCG ++GG+F+ ++GIVGLGGG++SLISQ+++TI G
Sbjct: 183 LASETLTVDSTAGKPVSFPGFAFGCGHSSGGIFDKSSSGIVGLGGGELSLISQLKSTING 242
Score = 107 bits (268), Expect = 6e-21, Method: Compositional matrix adjust.
Identities = 52/133 (39%), Positives = 83/133 (62%), Gaps = 1/133 (0%)
Query: 242 NQRLGVSTPDIVIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSLS 301
+++ V +I++DSGTT TFLPQ + S L +++ I+ + V DP G LCY+ +
Sbjct: 308 SKKTEVEEGNIIVDSGTTYTFLPQEFYSKLEKSVANSIKGKRVRDPNGIFSLCYNTTAEI 367
Query: 302 QVPEVTIHFRGADVKLSRSNFFVKVSEDIVCSVFKGITNSVPIYGNIMQTNFLVGYDIEQ 361
P +T HF+ A+V+L N F+++ ED+VC T+ + + GN+ Q NFLVG+D+ +
Sbjct: 368 NAPIITAHFKDANVELQPLNTFMRMQEDLVCFTV-APTSDIGVLGNLAQVNFLVGFDLRK 426
Query: 362 QTVSFKPTDCTKQ 374
+ VSFK DCT+
Sbjct: 427 KRVSFKAADCTQH 439
>gi|356546378|ref|XP_003541603.1| PREDICTED: probable aspartic protease At2g35615-like [Glycine max]
Length = 439
Score = 227 bits (578), Expect = 9e-57, Method: Compositional matrix adjust.
Identities = 104/218 (47%), Positives = 148/218 (67%), Gaps = 5/218 (2%)
Query: 1 MATFLSCVFILFFLCFYVVSPIEAQTGGFSVELIHRDSPKSPFYNSSETPYQRLRDALTR 60
M T + ++ C Y +S ++A GGFSVE+IHRDS +SP Y +ETP+QR+ +A+ R
Sbjct: 3 MITRYCSLALVLLWCLYNISFLKANDGGFSVEMIHRDSSRSPLYRPTETPFQRVANAVRR 62
Query: 61 SLNRLNHFNQNSSISSSKASQADIIPNNANYLIRISIGTPPTERLAVADTGSDLIWTQCE 120
S+NR NHF + + S+ ++++ ++ + YL+R S+G+PP + L + DTGSD++W QCE
Sbjct: 63 SINRGNHFKK--AFVSTDSAESTVVASQGEYLMRYSVGSPPFQVLGIVDTGSDILWLQCE 120
Query: 121 PCPPSQCYMQDSPLFDPKMSSTYKSLPCSSSQCASLNQKSCSGVN-CQYSVSYGDGSFSN 179
PC CY Q +P+FDP S TYK+LPCSS+ C SL +CS N C+YS+ YGDGS S+
Sbjct: 121 PC--EDCYKQTTPIFDPSKSKTYKTLPCSSNTCESLRNTACSSDNVCEYSIDYGDGSHSD 178
Query: 180 GNLATETVTLGSTTGQAVALPGITFGCGTNNGGLFNSK 217
G+L+ ET+TLGST G +V P GCG NNGG F +
Sbjct: 179 GDLSVETLTLGSTDGSSVHFPKTVIGCGHNNGGTFQEE 216
Score = 111 bits (277), Expect = 6e-22, Method: Compositional matrix adjust.
Identities = 61/125 (48%), Positives = 84/125 (67%), Gaps = 4/125 (3%)
Query: 251 DIVIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSLS-QVPEVTIH 309
+I+IDSGTTLT LPQ NL S +S +I+ + DP+ L LCY S +P +T H
Sbjct: 316 NIIIDSGTTLTLLPQEDYLNLESAVSDVIKLERARDPSKLLSLCYKTTSDELDLPVITAH 375
Query: 310 FRGADVKLSRSNFFVKVSEDIVCSVFKGITNSV-PIYGNIMQTNFLVGYDIEQQTVSFKP 368
F+GADV+L+ + FV V + +VC F I++ + I+GN+ Q N LVGYD+ ++TVSFKP
Sbjct: 376 FKGADVELNPISTFVPVEKGVVCFAF--ISSKIGAIFGNLAQQNLLVGYDLVKKTVSFKP 433
Query: 369 TDCTK 373
TDCTK
Sbjct: 434 TDCTK 438
>gi|225427552|ref|XP_002266498.1| PREDICTED: probable aspartic protease At2g35615 [Vitis vinifera]
Length = 441
Score = 226 bits (576), Expect = 1e-56, Method: Compositional matrix adjust.
Identities = 114/241 (47%), Positives = 159/241 (65%), Gaps = 8/241 (3%)
Query: 4 FLSCVFILFFLCFYVVSPIEAQTGGFSVELIHRDSPKSPFYNSSETPYQRLRDALTRSLN 63
F + V + F F+++ A GGFSV+LIHRDSP SPF++ S+T +RL DA RS +
Sbjct: 9 FFNVVVVGFL--FHLLEVGLASGGGFSVDLIHRDSPHSPFFDPSKTRTERLTDAFHRSAS 66
Query: 64 RLNHFNQNSSISSSKASQADIIPNNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCP 123
R+ F Q S +S Q+ ++P+ Y++ +SIGTPP +A+ DTGSDL WTQC PC
Sbjct: 67 RVGRFRQ--SAMTSDGIQSRLVPSAGEYIMNLSIGTPPVPVIAIVDTGSDLTWTQCRPC- 123
Query: 124 PSQCYMQDSPLFDPKMSSTYKSLPCSSSQCASL-NQKSC-SGVNCQYSVSYGDGSFSNGN 181
+ CY Q P FDPK SSTY+ C +S C +L N +SC +G C + SY DGSF+ GN
Sbjct: 124 -THCYKQVVPFFDPKNSSTYRDSSCGTSFCLALGNDRSCRNGKKCTFMYSYADGSFTGGN 182
Query: 182 LATETVTLGSTTGQAVALPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAG 241
LA ET+T+ ST G+ V+ PG FGC +GG+F+ ++GIVGLG ++S+ISQ+++TI G
Sbjct: 183 LAVETLTVASTAGKPVSFPGFAFGCVHRSGGIFDEHSSGIVGLGVAELSMISQLKSTING 242
Query: 242 N 242
Sbjct: 243 R 243
Score = 97.1 bits (240), Expect = 1e-17, Method: Compositional matrix adjust.
Identities = 54/145 (37%), Positives = 89/145 (61%), Gaps = 7/145 (4%)
Query: 232 ISQMRTTIAG-NQRLGVSTPDIVIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGS 290
+ + R + G +++ V +I++DSGTT T+LP + L ++ I+ + V DP G
Sbjct: 298 VGKKRLSYKGFSKKAEVEEGNIIVDSGTTYTYLPLEFYVKLEESVAHSIKGKRVRDPNGI 357
Query: 291 LELCYSFNSLSQV--PEVTIHFRGADVKLSRSNFFVKVSEDIVC-SVFKGITNSVPIYGN 347
LCY+ ++ Q+ P +T HF+ A+V+L N F+++ ED+VC +V T+ + I GN
Sbjct: 358 SSLCYN-TTVDQIDAPIITAHFKDANVELQPWNTFLRMQEDLVCFTVLP--TSDIGILGN 414
Query: 348 IMQTNFLVGYDIEQQTVSFKPTDCT 372
+ Q NFLVG+D+ ++ VSFK DCT
Sbjct: 415 LAQVNFLVGFDLRKKRVSFKAADCT 439
>gi|449454654|ref|XP_004145069.1| PREDICTED: probable aspartic protease At2g35615-like [Cucumis
sativus]
gi|449470632|ref|XP_004153020.1| PREDICTED: probable aspartic protease At2g35615-like [Cucumis
sativus]
gi|449499016|ref|XP_004160697.1| PREDICTED: probable aspartic protease At2g35615-like [Cucumis
sativus]
Length = 434
Score = 223 bits (569), Expect = 9e-56, Method: Compositional matrix adjust.
Identities = 121/241 (50%), Positives = 159/241 (65%), Gaps = 12/241 (4%)
Query: 5 LSCVFILFFLC--FYVVSPIEAQTGGFSVELIHRDSPKSPFYNSSETPYQRLRDALTRSL 62
++ VF L FL V S + A+ GF+VELIHRDSPKSP YNSSET + R+ +AL RS
Sbjct: 1 MAPVFSLLFLISTASVFSAVTARDYGFTVELIHRDSPKSPMYNSSETHFDRIVNALRRSS 60
Query: 63 NRLNHFNQNSSISSSKASQADIIPNNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPC 122
+R N+ + S ++A I N YL+ IS+GTPP +AVADTGSD+IWTQC+PC
Sbjct: 61 HR------NTVVLESDTAEAPIFNNGGEYLVEISVGTPPFSIVAVADTGSDVIWTQCKPC 114
Query: 123 PPSQCYMQDSPLFDPKMSSTYKSLPCSSSQCA-SLNQKSCSG-VNCQYSVSYGDGSFSNG 180
S CY Q++P+FDP S+TYK++ CSS C+ S + SCS C YS++YGD S S G
Sbjct: 115 --SNCYQQNAPMFDPSKSTTYKNVACSSPVCSYSGDGSSCSDDSECLYSIAYGDDSHSQG 172
Query: 181 NLATETVTLGSTTGQAVALPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIA 240
NLA +TVT+ ST+G+ VA P GCG +N G FN+ +GIVGLG G SL++Q+
Sbjct: 173 NLAVDTVTMQSTSGRPVAFPRTVIGCGHDNAGTFNANVSGIVGLGRGPASLVTQLGPATG 232
Query: 241 G 241
G
Sbjct: 233 G 233
Score = 108 bits (270), Expect = 4e-21, Method: Compositional matrix adjust.
Identities = 57/133 (42%), Positives = 84/133 (63%), Gaps = 3/133 (2%)
Query: 241 GNQRLGVSTPDIVIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSL 300
G +LG + +I+IDSGTTLT+LP ++ S +S + DP+ L+ C++ +
Sbjct: 300 GASKLGGES-NIIIDSGTTLTYLPSALLNSFGSAISQSMSLPHAQDPSEFLDYCFATTTD 358
Query: 301 S-QVPEVTIHFRGADVKLSRSNFFVKVSEDIVCSVFKGI-TNSVPIYGNIMQTNFLVGYD 358
++P VT+HF GADV L R N FV++S+D +C F +++ IYGNI Q+NFLVGYD
Sbjct: 359 DYEMPPVTMHFEGADVPLQRENLFVRLSDDTICLAFGSFPDDNIFIYGNIAQSNFLVGYD 418
Query: 359 IEQQTVSFKPTDC 371
I+ VSF+P C
Sbjct: 419 IKNLAVSFQPAHC 431
>gi|357487631|ref|XP_003614103.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
gi|355515438|gb|AES97061.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
Length = 431
Score = 222 bits (565), Expect = 3e-55, Method: Compositional matrix adjust.
Identities = 146/427 (34%), Positives = 221/427 (51%), Gaps = 68/427 (15%)
Query: 10 ILFFLCFYVVSPIEAQTGGFSVELIHRDSPKSPFYNSSETPYQRLRDALTRSLNRLNHFN 69
+LFF +++S + FS ELIHRDS KSP Y ++ +Q + +A RS+NR N
Sbjct: 9 LLFFSLCFIISFSHSLRNSFSFELIHRDSSKSPLYKPAQNKFQHVVNAARRSINRANRLF 68
Query: 70 QNSSISSSKASQADIIPNNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYM 129
++S S ++ + N YL+ S+GTPP V DTGSD++W QC+PC QCY
Sbjct: 69 KDSL---SNTPESTVYVNGGEYLMTYSVGTPPFNVYGVVDTGSDIVWLQCKPC--EQCYK 123
Query: 130 QDSPLFDPKMSSTYKSLPCSSSQCASLNQKSCSGVN-CQYSVSYGDGSFSNGNLATETVT 188
Q +P+F+P SS+YK++PCSS+ C S+ SC+ N C+Y++++ D S+S G L+ ET+T
Sbjct: 124 QTTPIFNPSKSSSYKNIPCSSNLCQSVRYTSCNKQNSCEYTINFSDQSYSQGELSVETLT 183
Query: 189 LGSTTGQAVALPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAG------- 241
L STTG +V+ P GCG NN G+F +T+GIVGLG G +SL +Q++++I G
Sbjct: 184 LDSTTGHSVSFPKTVIGCGHNNRGMFQGETSGIVGLGIGPVSLTTQLKSSIGGKFSYCLL 243
Query: 242 ------NQRLGVSTPDIVIDSGTTLT---FL---PQGY---NSNLLSVMSSMIEAQPVAD 286
N+ ++ D + SG + F+ PQ + SV + IE + + D
Sbjct: 244 PLLVDSNKTSKLNFGDAAVVSGDGVVSTPFVKKDPQAFYYLTLEAFSVGNKRIEFEVLDD 303
Query: 287 PTGSLELCYSFNSLSQVP-EVTIHFRGAD---VKLSRS-------NFFVKVSED-----I 330
+ S +L+ +P V + A VKL R N ++ D I
Sbjct: 304 SEEGNIILDSGTTLTLLPSHVYTNLESAVAQLVKLDRVDDPNQLLNLCYSITSDQYDFPI 363
Query: 331 VCSVFKGIT---NSVPIYGNI---------------------MQTNFLVGYDIEQQTVSF 366
+ + FKG N + + ++ Q N LVGYD++Q VSF
Sbjct: 364 ITAHFKGADIKLNPISTFAHVADGVVCLAFTSSQTGPIFGNLAQLNLLVGYDLQQNIVSF 423
Query: 367 KPTDCTK 373
KP+DC K
Sbjct: 424 KPSDCIK 430
>gi|255566008|ref|XP_002523992.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
gi|223536719|gb|EEF38360.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
Length = 442
Score = 221 bits (563), Expect = 5e-55, Method: Compositional matrix adjust.
Identities = 153/440 (34%), Positives = 219/440 (49%), Gaps = 76/440 (17%)
Query: 3 TFLSCVFILFFLCFYV--VSPIEAQTGGFSVELIHRDSPKSPFYNSSETPYQRLRDALTR 60
T LS + FL + S ++A+ F+ ELIHRDSP SP +N+SET RL +A+ R
Sbjct: 9 TLLSFALSIIFLTVSMSGFSLVQAEKLSFTTELIHRDSPNSPLFNASETTDIRLANAVER 68
Query: 61 SLNRLNHFNQNSSISSSKASQADIIPNNANYLIRISIGTPPTERLAVADTGSDLIWTQC- 119
S +R+N FN S S + A I+ +N ++L++ISIG PPTE L TGSDL+W C
Sbjct: 69 SADRVNRFNDLISNSITAAEFPSIL-DNGDFLMKISIGIPPTELLVNVATGSDLVWIPCL 127
Query: 120 --EPCPPSQCYMQDSPLFDPKMSSTYKSLPCSSSQCASLNQKSCSGVNCQYSVS-YGDGS 176
+PC C D FDP SSTYK++PC S +C N +C +C YS S
Sbjct: 128 SFKPC-THNC---DLRFFDPMESSTYKNVPCDSYRCQITNAATCQFSDCFYSCDPRHQDS 183
Query: 177 FSNGNLATETVTLGSTTGQAVALPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMR 236
+G+LA +T+TL STTG++ LP F CG GG + GI+GLG G +SL++++
Sbjct: 184 CPDGDLAMDTLTLNSTTGKSFMLPNTGFICGNRIGG--DYPGVGILGLGHGSLSLLNRIS 241
Query: 237 TTIAG-----------NQRLGVSTPDIVIDSGTTL------------------------- 260
I G NQ +S D + SG+ +
Sbjct: 242 HLIDGKFSHCIVPYSSNQTSKLSFGDKAVVSGSAMFSTRLDMTGGPYSYTLSFYGISVGN 301
Query: 261 -------------------------TFLPQGYNSNLLSVMSSMIEAQPV-ADPTGSLELC 294
T+ P+ + S L + I+ +P+ DPT L LC
Sbjct: 302 KSISAGGIGSDYYMNGLGMDSGTMFTYFPEYFYSQLEYDVRYAIQQEPLYPDPTRRLRLC 361
Query: 295 YSFNSLSQVPEVTIHFRGADVKLSRSNFFVKVSEDIVCSVFK-GITNSVPIYGNIMQTNF 353
Y ++ P +T+HF G V+LS SN F++++EDIVC F + ++G QTN
Sbjct: 362 YRYSPDFSPPTITMHFEGGSVELSSSNSFIRMTEDIVCLAFATSSSEQDAVFGYWQQTNL 421
Query: 354 LVGYDIEQQTVSFKPTDCTK 373
L+GYD++ +SF TDCTK
Sbjct: 422 LIGYDLDAGFLSFLKTDCTK 441
>gi|357450863|ref|XP_003595708.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
gi|355484756|gb|AES65959.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
Length = 407
Score = 219 bits (559), Expect = 1e-54, Method: Compositional matrix adjust.
Identities = 138/375 (36%), Positives = 192/375 (51%), Gaps = 76/375 (20%)
Query: 70 QNSSISSSKAS--QADIIPNNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQC 127
+NSS S K S Q+ + + YL+ +SIGTPP + A ADTGSDL+W QC PC ++C
Sbjct: 37 RNSSHDSYKPSTIQSPVSAYDCEYLMELSIGTPPIKIYAEADTGSDLVWFQCIPC--TKC 94
Query: 128 YMQDSPLFDPKMSSTYKSLPCSSSQCASLNQKSCS--GVNCQYSVSYGDGSFSNGNLATE 185
Y Q +P+FDP+ SS+Y ++ C + C L+ CS C Y+ SY D S + G LA E
Sbjct: 95 YKQQNPMFDPRSSSSYTNITCGTESCNKLDSSLCSTDQKTCNYTYSYADNSITQGVLAQE 154
Query: 186 TVTLGSTTGQAVALPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIA----- 240
T+TL STTG+ VA GI FGCG NN G FN + G++GLG G +SLISQ+ +++
Sbjct: 155 TLTLTSTTGEPVAFQGIIFGCGHNNSG-FNDREMGLIGLGRGPLSLISQIGSSLGAGGNM 213
Query: 241 ----------------------GNQRLG---VSTPDI----------------------- 252
G++ LG VSTP I
Sbjct: 214 FSQCLVPFNTDPSITSQMNFGKGSEVLGNGTVSTPLISKDGTGYFATLLGISVEDINLPF 273
Query: 253 --------------VIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFN 298
+IDSGTT+T+LP+ + L+ + + + +P ELCY
Sbjct: 274 SNGSSLGTITKGNILIDSGTTITYLPEEFYHRLIEQVRNKVALEPFR--IDGYELCYQTP 331
Query: 299 SLSQVPEVTIHFRGADVKLSRSNFFVKVSEDIVCSVFKGITNSVPIYGNIMQTNFLVGYD 358
+ P +TIHF G DV L+ + F+ V +D C YGN Q+N+L+G+D
Sbjct: 332 TNLNGPTLTIHFEGGDVLLTPAQMFIPVQDDNFCFAVFDTNEEYVTYGNYAQSNYLIGFD 391
Query: 359 IEQQTVSFKPTDCTK 373
+E+Q VSFK TDCTK
Sbjct: 392 LERQVVSFKATDCTK 406
>gi|15242803|ref|NP_198319.1| aspartyl protease family protein [Arabidopsis thaliana]
gi|75130158|sp|Q6XBF8.1|CDR1_ARATH RecName: Full=Aspartic proteinase CDR1; AltName: Full=Protein
CONSTITUTIVE DISEASE RESISTANCE 1; Flags: Precursor
gi|37935737|gb|AAP72988.1| CDR1 [Arabidopsis thaliana]
gi|91806924|gb|ABE66189.1| aspartyl protease family protein [Arabidopsis thaliana]
gi|109946613|gb|ABG48485.1| At5g33340 [Arabidopsis thaliana]
gi|332006513|gb|AED93896.1| aspartyl protease family protein [Arabidopsis thaliana]
Length = 437
Score = 219 bits (558), Expect = 2e-54, Method: Compositional matrix adjust.
Identities = 120/217 (55%), Positives = 152/217 (70%), Gaps = 8/217 (3%)
Query: 28 GFSVELIHRDSPKSPFYNSSETPYQRLRDALTRSLNRLNHFNQNSSISSSKASQADIIPN 87
GF+ +LIHRDSPKSPFYN ET QRLR+A+ RS+NR+ HF + + + Q D+ N
Sbjct: 30 GFTADLIHRDSPKSPFYNPMETSSQRLRNAIHRSVNRVFHFTEKDN---TPQPQIDLTSN 86
Query: 88 NANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLP 147
+ YL+ +SIGTPP +A+ADTGSDL+WTQC PC CY Q PLFDPK SSTYK +
Sbjct: 87 SGEYLMNVSIGTPPFPIMAIADTGSDLLWTQCAPC--DDCYTQVDPLFDPKTSSTYKDVS 144
Query: 148 CSSSQCASL-NQKSCSGVN--CQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITF 204
CSSSQC +L NQ SCS + C YS+SYGD S++ GN+A +T+TLGS+ + + L I
Sbjct: 145 CSSSQCTALENQASCSTNDNTCSYSLSYGDNSYTKGNIAVDTLTLGSSDTRPMQLKNIII 204
Query: 205 GCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAG 241
GCG NN G FN K +GIVGLGGG +SLI Q+ +I G
Sbjct: 205 GCGHNNAGTFNKKGSGIVGLGGGPVSLIKQLGDSIDG 241
Score = 136 bits (342), Expect = 2e-29, Method: Compositional matrix adjust.
Identities = 69/123 (56%), Positives = 85/123 (69%), Gaps = 1/123 (0%)
Query: 251 DIVIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSLSQVPEVTIHF 310
+I+IDSGTTLT LP + S L ++S I+A+ DP L LCYS +VP +T+HF
Sbjct: 315 NIIIDSGTTLTLLPTEFYSELEDAVASSIDAEKKQDPQSGLSLCYSATGDLKVPVITMHF 374
Query: 311 RGADVKLSRSNFFVKVSEDIVCSVFKGITNSVPIYGNIMQTNFLVGYDIEQQTVSFKPTD 370
GADVKL SN FV+VSED+VC F+G + S IYGN+ Q NFLVGYD +TVSFKPTD
Sbjct: 375 DGADVKLDSSNAFVQVSEDLVCFAFRG-SPSFSIYGNVAQMNFLVGYDTVSKTVSFKPTD 433
Query: 371 CTK 373
C K
Sbjct: 434 CAK 436
>gi|116831531|gb|ABK28718.1| unknown [Arabidopsis thaliana]
Length = 438
Score = 219 bits (558), Expect = 2e-54, Method: Compositional matrix adjust.
Identities = 120/217 (55%), Positives = 152/217 (70%), Gaps = 8/217 (3%)
Query: 28 GFSVELIHRDSPKSPFYNSSETPYQRLRDALTRSLNRLNHFNQNSSISSSKASQADIIPN 87
GF+ +LIHRDSPKSPFYN ET QRLR+A+ RS+NR+ HF + + + Q D+ N
Sbjct: 30 GFTADLIHRDSPKSPFYNPMETSSQRLRNAIHRSVNRVFHFTEKDN---TPQPQIDLTSN 86
Query: 88 NANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLP 147
+ YL+ +SIGTPP +A+ADTGSDL+WTQC PC CY Q PLFDPK SSTYK +
Sbjct: 87 SGEYLMNVSIGTPPFPIMAIADTGSDLLWTQCAPC--DDCYTQVDPLFDPKTSSTYKDVS 144
Query: 148 CSSSQCASL-NQKSCSGVN--CQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITF 204
CSSSQC +L NQ SCS + C YS+SYGD S++ GN+A +T+TLGS+ + + L I
Sbjct: 145 CSSSQCTALENQASCSTNDNTCSYSLSYGDNSYTKGNIAVDTLTLGSSDTRPMQLKNIII 204
Query: 205 GCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAG 241
GCG NN G FN K +GIVGLGGG +SLI Q+ +I G
Sbjct: 205 GCGHNNAGTFNKKGSGIVGLGGGPVSLIKQLGDSIDG 241
Score = 136 bits (342), Expect = 2e-29, Method: Compositional matrix adjust.
Identities = 69/123 (56%), Positives = 85/123 (69%), Gaps = 1/123 (0%)
Query: 251 DIVIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSLSQVPEVTIHF 310
+I+IDSGTTLT LP + S L ++S I+A+ DP L LCYS +VP +T+HF
Sbjct: 315 NIIIDSGTTLTLLPTEFYSELEDAVASSIDAEKKQDPQSGLSLCYSATGDLKVPVITMHF 374
Query: 311 RGADVKLSRSNFFVKVSEDIVCSVFKGITNSVPIYGNIMQTNFLVGYDIEQQTVSFKPTD 370
GADVKL SN FV+VSED+VC F+G + S IYGN+ Q NFLVGYD +TVSFKPTD
Sbjct: 375 DGADVKLDSSNAFVQVSEDLVCFAFRG-SPSFSIYGNVAQMNFLVGYDTVSKTVSFKPTD 433
Query: 371 CTK 373
C K
Sbjct: 434 CAK 436
>gi|356495754|ref|XP_003516738.1| PREDICTED: probable aspartic protease At2g35615-like [Glycine max]
Length = 420
Score = 218 bits (556), Expect = 3e-54, Method: Compositional matrix adjust.
Identities = 141/440 (32%), Positives = 212/440 (48%), Gaps = 87/440 (19%)
Query: 1 MATFLSCVFILFFLCFYVVSPIEAQTGGFSVELIHRDSPKSPFYNSSETPYQRLRDALTR 60
M + + + +C ++ + T GFSV LI ++S ++ P +RL +
Sbjct: 1 MVVYPTSFHLATIICLMLLPLHISATEGFSVNLIRKNSS-----HAHVLPLRRLMEL--- 52
Query: 61 SLNRLNHFNQNSSISSSKASQADIIPNNANYLIRISIGTPPTERLAVADTGSDLIWTQCE 120
S++ + Q+ I +YL+ +SIGTPP + +ADTGSDL WT C
Sbjct: 53 -----------SAMEKTLTPQSPIYAYLGHYLMELSIGTPPFKIYGIADTGSDLTWTSCV 101
Query: 121 PCPPSQCYMQDSPLFDPKMSSTYKSLPCSSSQCASLNQKSCS-GVNCQYSVSYGDGSFSN 179
PC + CY Q +P+FDP+ S+TY+++ C S C L+ CS C Y+ +Y + +
Sbjct: 102 PC--NNCYKQRNPMFDPQKSTTYRNISCDSKLCHKLDTGVCSPQKRCNYTYAYASAAITR 159
Query: 180 GNLATETVTLGSTTGQAVALPGITFGCGTNNGGLFNSKTTGIVGLGGG------------ 227
G LA ET+TL ST G++V L GI FGCG NN G FN GI+GLGGG
Sbjct: 160 GVLAQETITLSSTKGKSVPLKGIVFGCGHNNTGGFNDHEMGIIGLGGGPVSLISQMGSSF 219
Query: 228 --------------DISLISQMR--------------TTIAGNQR--------LGVSTPD 251
D+S+ S+M T + Q LG+S +
Sbjct: 220 GGKRFSQCLVPFHTDVSVSSKMSFGKGSKVSGKGVVSTPLVAKQDKTPYFVTLLGISVEN 279
Query: 252 ----------------IVIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVAD-PTGSLELC 294
+ +DSGT T LP +++ + S + +PV D P +LC
Sbjct: 280 TYLHFNGSSQNVEKGNMFLDSGTPPTILPTQLYDQVVAQVRSEVAMKPVTDDPDLGPQLC 339
Query: 295 YSFNSLSQVPEVTIHFRGADVKLSRSNFFVKVSEDIVCSVFKGITNSVPIYGNIMQTNFL 354
Y + + P +T HF GADVKLS + F+ + + C F ++ +YGN Q+N+L
Sbjct: 340 YRTKNNLRGPVLTAHFEGADVKLSPTQTFISPKDGVFCLGFTNTSSDGGVYGNFAQSNYL 399
Query: 355 VGYDIEQQTVSFKPTDCTKQ 374
+G+D+++Q VSFKP DCTK
Sbjct: 400 IGFDLDRQVVSFKPKDCTKH 419
>gi|449515033|ref|XP_004164554.1| PREDICTED: probable aspartic protease At2g35615-like [Cucumis
sativus]
Length = 430
Score = 216 bits (550), Expect = 1e-53, Method: Compositional matrix adjust.
Identities = 141/430 (32%), Positives = 216/430 (50%), Gaps = 76/430 (17%)
Query: 9 FILFFLCFYVVSPIEAQTGGFSVELIHRDSPKSPFYNSSETPYQRLRDALTRSLNRLNHF 68
IL + F + I G F+ L HRDS SP SS + Y RL +A RSL+R
Sbjct: 11 LILLLISFSQTTIINGDNG-FTTSLFHRDSLLSPLEFSSLSHYDRLTNAFRRSLSRSATL 69
Query: 69 NQNSSISSSKASQADIIPNNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCY 128
++ + + QA + P + YL+ +SIGTPP + + +ADTGSDL+W QC PC +CY
Sbjct: 70 LNRAATNGALDLQAPLTPGSGEYLMSVSIGTPPVDYIGMADTGSDLMWAQCLPC--LKCY 127
Query: 129 MQDSPLFDPKMSSTYKSLPCSSSQCASLNQKSCSGVN-CQYSVSYGDGSFSNGNLATETV 187
Q P+FDP S+++ +PC+S C +++ C C YS +YGD +++ G+L E +
Sbjct: 128 KQSRPIFDPLKSTSFSHVPCNSQNCKAIDDSHCGAQGVCDYSYTYGDQTYTKGDLGFEKI 187
Query: 188 TLGSTTGQAVALPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGNQRLG- 246
T+GS++ ++V GCG + G +G++GLGGG +SL+SQM T ++R
Sbjct: 188 TIGSSSVKSV------IGCG-HESGGGFGFASGVIGLGGGQLSLVSQMSQTSGISRRFSY 240
Query: 247 --------------------------VSTP------------------------------ 250
VSTP
Sbjct: 241 CLPTLLSHANGKINFGQNAVVSGPGVVSTPLISKNPVTYYYVTLEAISIGNERHMASAKQ 300
Query: 251 -DIVIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCY----SFNSLSQVPE 305
+++IDSGTTL+FLP+ ++S + +++A+ V DP +LC+ + + S +P
Sbjct: 301 GNVIIDSGTTLSFLPKELYDGVVSSLLKVVKAKRVKDPGNFWDLCFDDGINVATSSGIPI 360
Query: 306 VTIHFR-GADVKLSRSNFFVKVSEDIVCSVF--KGITNSVPIYGNIMQTNFLVGYDIEQQ 362
+T F GA+V L N F KV+ ++ C T+ I GN+ NFL+GYD+E +
Sbjct: 361 ITAQFSGGANVNLLPVNTFQKVANNVNCLTLTPASPTDEFGIIGNLALANFLIGYDLEAK 420
Query: 363 TVSFKPTDCT 372
+SFKPT CT
Sbjct: 421 RLSFKPTVCT 430
>gi|225427554|ref|XP_002266533.1| PREDICTED: probable aspartic protease At2g35615-like [Vitis
vinifera]
Length = 447
Score = 216 bits (550), Expect = 2e-53, Method: Compositional matrix adjust.
Identities = 121/235 (51%), Positives = 158/235 (67%), Gaps = 7/235 (2%)
Query: 10 ILFFLCFYVVSPIEA-QTGGFSVELIHRDSPKSPFYNSSETPYQRLRDALTRSLNRLNHF 68
++FF+ F +S EA GGFS +LI RDSP SPFYN SET + RL+ A RS++R NHF
Sbjct: 15 VIFFIHFSGLSHTEASNKGGFSTDLISRDSPLSPFYNPSETQFDRLQKAFHRSISRANHF 74
Query: 69 NQNSSISSSKASQADIIPNNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCY 128
N S+ + Q+ +I NN YL+ IS+GTPP +ADTGSDL+W QC+PC CY
Sbjct: 75 RANGV--STNSIQSPVISNNGEYLMNISLGTPPVSMHGIADTGSDLLWRQCKPC--DSCY 130
Query: 129 MQDSPLFDPKMSSTYKSLPCSSSQCASLN-QKSCSGVN-CQYSVSYGDGSFSNGNLATET 186
Q P+FDP S TY+ L C C++L Q CS N C YS SYGDGS ++G+LA +T
Sbjct: 131 EQIEPIFDPAKSKTYQILSCEGKSCSNLGGQGGCSDDNTCIYSYSYGDGSHTSGDLAVDT 190
Query: 187 VTLGSTTGQAVALPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAG 241
+T+GSTTG+ V++P + FGCG NNGG F +G+VGLGGG +S+ISQ+R I G
Sbjct: 191 LTIGSTTGRPVSVPKVVFGCGHNNGGTFELHGSGLVGLGGGPLSMISQLRPLIGG 245
Score = 122 bits (307), Expect = 2e-25, Method: Compositional matrix adjust.
Identities = 62/123 (50%), Positives = 83/123 (67%), Gaps = 1/123 (0%)
Query: 251 DIVIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSLSQVPEVTIHF 310
+I+IDSGTTLT LPQ + L S + S I +PV DP LCYS S ++P +T HF
Sbjct: 324 NIIIDSGTTLTLLPQDFYGTLESNVVSAIGGKPVRDPNNVFSLCYSNLSGLRIPTITAHF 383
Query: 311 RGADVKLSRSNFFVKVSEDIVCSVFKGITNSVPIYGNIMQTNFLVGYDIEQQTVSFKPTD 370
GAD++L N FV+V ED+ C +++ + I+GN+ Q NFLVGYD++ +TVSFKPTD
Sbjct: 384 VGADLELKPLNTFVQVQEDLFCFAMIPVSD-LAIFGNLAQMNFLVGYDLKSRTVSFKPTD 442
Query: 371 CTK 373
CTK
Sbjct: 443 CTK 445
>gi|296085499|emb|CBI29231.3| unnamed protein product [Vitis vinifera]
Length = 308
Score = 216 bits (549), Expect = 2e-53, Method: Compositional matrix adjust.
Identities = 136/339 (40%), Positives = 185/339 (54%), Gaps = 70/339 (20%)
Query: 69 NQNSSISSSKASQADIIPNNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCY 128
N + ++S Q+++I +YL+ IS+GTPP L +ADTGSDLIW QC PC CY
Sbjct: 7 NTGNQLASPNDIQSNVISGGGSYLMNISLGTPPVSMLGIADTGSDLIWRQCLPC--DDCY 64
Query: 129 MQDSPLFDPKMSSTYKSLPCSSSQCASLNQKSCSGVNCQYSVSYGDGSFSNGNLATETVT 188
Q PLFDPK S TYK+L G L++ET T
Sbjct: 65 KQVEPLFDPKKSKTYKTL---------------------------------GYLSSETFT 91
Query: 189 LGSTTGQAVALPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGNQRL--- 245
+GST G + PG+ FGCG +NGG FN K +G++GLGGG +SL+ Q+ + + G
Sbjct: 92 IGSTEGDPASFPGLAFGCGHSNGGTFNEKDSGLIGLGGGPLSLVMQLSSKVGGQFSYCLV 151
Query: 246 -----------------------GVSTP------DIVIDSGTTLTFLPQGYNSNLLSVMS 276
G S+P +I+IDSGTTLT LP+ + +++ S ++
Sbjct: 152 PLSSDSTASSKINFGKSAVVSGSGTSSPAAAEESNIIIDSGTTLTLLPRDFYTDMESALT 211
Query: 277 SMIEAQPVADPTGSLELCYSFNSLSQVPEVTIHFRGADVKLSRSNFFVKVSEDIVCSVFK 336
+I Q DP G+ LCYS ++P +T HF GADV+L N FV+ ED+VC F
Sbjct: 212 KVIGGQTTTDPRGTFSLCYSGVKKLEIPTITAHFIGADVQLPPLNTFVQAQEDLVC--FS 269
Query: 337 GITNS-VPIYGNIMQTNFLVGYDIEQQTVSFKPTDCTKQ 374
I +S + I+GN+ Q NFLVGYD++ VSFKPTDCTKQ
Sbjct: 270 MIPSSNLAIFGNLSQMNFLVGYDLKNNKVSFKPTDCTKQ 308
>gi|356528627|ref|XP_003532901.1| PREDICTED: probable aspartic protease At2g35615-like [Glycine max]
Length = 430
Score = 216 bits (549), Expect = 2e-53, Method: Compositional matrix adjust.
Identities = 152/430 (35%), Positives = 199/430 (46%), Gaps = 71/430 (16%)
Query: 8 VFILFFLCFYVVSPIEAQTGGFSVELIHRDSPKSPFYNSSETPYQRLRDALTRSLNRLNH 67
V LFFL ++ GFS++LI R SP SP YNS T + ++ A RS+ R
Sbjct: 5 VLTLFFLVSTMLVDASKSLMGFSIDLIPRHSPISPLYNSQMTQTELVKSAALRSITRSKR 64
Query: 68 FNQNSSISSSKASQADIIPNNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQC 127
N IS + IP++ YL+R S+GTP ERLA+ DTGSDL W QC PC C
Sbjct: 65 VNFIGQISPPLSPIITPIPDHGEYLMRFSLGTPSVERLAIFDTGSDLSWLQCTPC--KTC 122
Query: 128 YMQDSPLFDPKMSSTYKSLPCSSSQCASL--NQKSC-SGVNCQYSVSYGDGSFSNGNLAT 184
Y Q++PLFDP SSTY +PC S C NQ+ C S C Y YG SF+ G L
Sbjct: 123 YPQEAPLFDPTQSSTYVDVPCESQPCTLFPQNQRECGSSKQCIYLHQYGTDSFTIGRLGY 182
Query: 185 ETVTLGST-TGQAVA-LPGITFGCGTNNGGLFN--SKTTGIVGLGGGDISLISQM----- 235
+T++ ST GQ A P FGC + F +K G VGLG G +SL SQ+
Sbjct: 183 DTISFSSTGMGQGGATFPKSVFGCAFYSNFTFKISTKANGFVGLGPGPLSLASQLGDQIG 242
Query: 236 ----------RTTIAGNQRLG--------VSTP--------------------------- 250
+T G + G VSTP
Sbjct: 243 HKFSYCMVPFSSTSTGKLKFGSMAPTNEVVSTPFMINPSYPSYYVLNLEGITVGQKKVLT 302
Query: 251 -----DIVIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSLSQVPE 305
+I+IDS LT L QG ++ +S + I + D E C + PE
Sbjct: 303 GQIGGNIIIDSVPILTHLEQGIYTDFISSVKEAINVEVAEDAPTPFEYCVRNPTNLNFPE 362
Query: 306 VTIHFRGADVKLSRSNFFVKVSEDIVCSVF---KGITNSVPIYGNIMQTNFLVGYDIEQQ 362
HF GADV L N F+ + ++VC KGI+ I+GN Q NF V YD+ ++
Sbjct: 363 FVFHFTGADVVLGPKNMFIALDNNLVCMTVVPSKGIS----IFGNWAQVNFQVEYDLGEK 418
Query: 363 TVSFKPTDCT 372
VSF PT+C+
Sbjct: 419 KVSFAPTNCS 428
>gi|356528671|ref|XP_003532923.1| PREDICTED: probable aspartic protease At2g35615-like [Glycine max]
Length = 440
Score = 213 bits (543), Expect = 9e-53, Method: Compositional matrix adjust.
Identities = 151/442 (34%), Positives = 212/442 (47%), Gaps = 83/442 (18%)
Query: 8 VFILFFLCFY-VVSPIEAQT--GGFSVELIHRDSPKSPFYNSSETPYQRLRDALTRSLNR 64
VF LC + + S EA GFS+ LIHR+SP SPFYN S TP +R+++ + RS R
Sbjct: 5 VFCFLLLCSHSIASFAEASKTLSGFSINLIHRESPLSPFYNPSLTPSERIKNTVLRSFAR 64
Query: 65 LNH---FNQNSSISSSKASQADIIPNN--ANYLIRISIGTPPTERLAVADTGSDLIWTQC 119
+QN S + IP+ YL+R IGTPP ER A+ADTGSDLIW QC
Sbjct: 65 SKRRLRLSQNDDRSPGTIT----IPDEPITEYLMRFYIGTPPVERFAIADTGSDLIWVQC 120
Query: 120 EPCPPSQCYMQDSPLFDPKMSSTYKSLPCSSSQCASL--NQKSCSGVN--CQYSVSYGDG 175
PC +C Q++PLFDP+ SST+K++PC S C L +Q++C G + C Y YGD
Sbjct: 121 APC--EKCVPQNAPLFDPRKSSTFKTVPCDSQPCTLLPPSQRACVGKSGQCYYQYIYGDH 178
Query: 176 SFSNGNLATETVTLGSTTGQAVALPGITFGCG-TNNGGLFNSK-TTGIVGLGGGDISLIS 233
+ +G L E++ GS A+ P +TFGC +NN + SK G+VGLG G +SLIS
Sbjct: 179 TLVSGILGFESINFGSKN-NAIKFPKLTFGCTFSNNDTVDESKRNMGLVGLGVGPLSLIS 237
Query: 234 QMRTTIA---------------GNQRLG-----------VSTP----------------- 250
Q+ I R G VSTP
Sbjct: 238 QLGYQIGRKFSYCFPPLSSNSTSKMRFGNDAIVKQIKGVVSTPLIIKSIGPSYYYLNLEG 297
Query: 251 -----------------DIVIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLEL 293
+I+IDSGT+ T L Q + + ++++ + + V P
Sbjct: 298 VSIGNKKVKTSESQTDGNILIDSGTSFTILKQSFYNKFVALVKEVYGVEAVKIPPLVYNF 357
Query: 294 CYSFN-SLSQVPEVTIHFRGADVKLSRSNFFVKVSEDIVCSVFKGITNSVP-IYGNIMQT 351
C+ + P+V F GA V++ SN F +++C V ++ I+GN Q
Sbjct: 358 CFENKGKRKRFPDVVFLFTGAKVRVDASNLFEAEDNNLLCMVALPTSDEDDSIFGNHAQI 417
Query: 352 NFLVGYDIEQQTVSFKPTDCTK 373
+ V YD++ VSF P DC K
Sbjct: 418 GYQVEYDLQGGMVSFAPADCAK 439
>gi|357481191|ref|XP_003610881.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
gi|355512216|gb|AES93839.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
Length = 427
Score = 213 bits (541), Expect = 1e-52, Method: Compositional matrix adjust.
Identities = 141/422 (33%), Positives = 202/422 (47%), Gaps = 82/422 (19%)
Query: 20 SPIEAQTGGFSVELIHRDSPKSPFYNSSETPYQRLRDALTRSLNRLNHFNQNSSIS-SSK 78
+P EA GFS +LIH++SP SPFY S+ + + N+L F Q S K
Sbjct: 21 TPTEAYNKGFSFKLIHKNSPNSPFYKSNN--FHK---------NKLRSFYQVPKKSFVQK 69
Query: 79 ASQADIIPNNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPK 138
+ + NN +YL+++++G+PP + + DTGSDL+W QC PC CY Q SP+F+P
Sbjct: 70 SPYTRVTSNNGDYLMKLTLGSPPVDIYGLVDTGSDLVWAQCTPC--GGCYRQKSPMFEPL 127
Query: 139 MSSTYKSLPCSSSQCASLNQKSCSGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVA 198
S TY +PC S QC+ C YS SY D S + G LA E +T ST G V
Sbjct: 128 RSKTYSPIPCESEQCSFFGYSCSPQKMCAYSYSYADSSVTKGVLAREAITFSSTDGDPVV 187
Query: 199 LPGITFGCGTNN-------------------------GGLFNSK--------------TT 219
+ I FGCG +N G L+ SK T+
Sbjct: 188 VGDIIFGCGHSNSGTFNENDMGIIGMGGGPLSLVSQIGTLYGSKRFSQCLVPFHTDAHTS 247
Query: 220 GIVGLGG-GDISLISQMRTTIAGNQRL--------GVSTPD---------------IVID 255
G + G D+S + T +A + G+S D I+ID
Sbjct: 248 GTINFGEESDVSGEGVVTTPLASEEGQTSYLVTLEGISVGDTFVRFNSSETLSKGNIMID 307
Query: 256 SGTTLTFLPQGYNSNL---LSVMSSMIEAQPVADPTGSLELCYSFNSLSQVPEVTIHFRG 312
SGT T++PQ + L L V SS++ + DP +LCY + + P +T HF G
Sbjct: 308 SGTPATYIPQEFYERLVEELKVQSSLLPIE--DDPDLGTQLCYRSETNLEGPILTAHFEG 365
Query: 313 ADVKLSRSNFFVKVSEDIVCSVFKGITNSVPIYGNIMQTNFLVGYDIEQQTVSFKPTDCT 372
ADV+L F+ + + C G T+ I+GN Q+N L+G+D++++T+SFKPTDCT
Sbjct: 366 ADVQLLPIQTFIPPKDGVFCFAMAGSTDGDYIFGNFAQSNILMGFDLDRKTISFKPTDCT 425
Query: 373 KQ 374
Q
Sbjct: 426 NQ 427
>gi|225427558|ref|XP_002266614.1| PREDICTED: probable aspartic protease At2g35615-like [Vitis
vinifera]
Length = 444
Score = 211 bits (538), Expect = 3e-52, Method: Compositional matrix adjust.
Identities = 120/241 (49%), Positives = 155/241 (64%), Gaps = 7/241 (2%)
Query: 4 FLSCVF-ILFFLCFYVVSPIEAQTGGFSVELIHRDSPKSPFYNSSETPYQRLRDALTRSL 62
F+ C I+ + F S EA+ GF+ + I RDSP SPFYN SET YQRL+ A RS+
Sbjct: 8 FVFCTLAIIILIHFSEHSHAEAKIDGFTTDFISRDSPHSPFYNPSETKYQRLQKAFRRSI 67
Query: 63 NRLNHFNQNSSISSSKASQADIIPNNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPC 122
R NHF + +S Q+D+I YL+ IS+GTPP L +ADTGSDLIW QC PC
Sbjct: 68 LRGNHFR--AMRASPNDIQSDVISGGGAYLMNISLGTPPVPMLGIADTGSDLIWRQCLPC 125
Query: 123 PPSQCYMQDSPLFDPKMSSTYKSLPCSSSQCASLNQK-SCSGVN-CQYSVSYGDGSFSNG 180
P CY Q PLFDPK S TYK+L C + C L Q+ SC N C YS SYGD S++ G
Sbjct: 126 P--NCYEQVEPLFDPKESETYKTLDCDNEFCQDLGQQGSCDDDNTCTYSYSYGDRSYTRG 183
Query: 181 NLATETVTLGSTTGQAVALPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIA 240
+L+++T+T+GST G + PGI FGCG +NGG FN K G++GLGGG +SL+ Q+ + +
Sbjct: 184 DLSSDTLTIGSTEGDPASFPGIAFGCGHDNGGTFNEKDGGLIGLGGGPLSLVMQLSSEVG 243
Query: 241 G 241
G
Sbjct: 244 G 244
Score = 117 bits (293), Expect = 8e-24, Method: Compositional matrix adjust.
Identities = 62/129 (48%), Positives = 85/129 (65%), Gaps = 3/129 (2%)
Query: 246 GVSTPDIVIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSLSQVPE 305
V +I+IDSGTTLT LPQ + +++ S +++ I Q DP G LCYS + ++P
Sbjct: 318 AVEEGNIIIDSGTTLTLLPQDFYTDVESALTNAIGGQTTTDPNGIFSLCYSSVNNLEIPT 377
Query: 306 VTIHFRGADVKLSRSNFFVKVSEDIVCSVFKGITNS-VPIYGNIMQTNFLVGYDIEQQTV 364
+T HF GADV+L N FV+V ED+VC F I +S + I+GN+ Q NFLVGYD++ V
Sbjct: 378 ITAHFTGADVQLPPLNTFVQVQEDLVC--FSMIPSSNLAIFGNLAQINFLVGYDLKNNKV 435
Query: 365 SFKPTDCTK 373
SFK TDCT+
Sbjct: 436 SFKQTDCTE 444
>gi|356542355|ref|XP_003539632.1| PREDICTED: probable aspartic protease At2g35615-like [Glycine max]
Length = 444
Score = 210 bits (535), Expect = 8e-52, Method: Compositional matrix adjust.
Identities = 102/212 (48%), Positives = 145/212 (68%), Gaps = 7/212 (3%)
Query: 11 LFFLCFYV-VSPIEA-QTGGFSVELIHRDSPKSPFYNSSETPYQRLRDALTRSLNRLNHF 68
+ LC Y+ +S + A GGFSVE+IHRDS +SP+Y +ET +QR+ +AL RS+NR NHF
Sbjct: 12 IVLLCLYINISFLNALDGGGFSVEIIHRDSSRSPYYRPTETQFQRVANALRRSINRANHF 71
Query: 69 NQNSSISSSKASQADIIPNNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCY 128
N+ + ++S+ +++ +I + YL+ S+GTPP + L + DTGSD+IW QC+PC CY
Sbjct: 72 NKPNLVASTNTAESTVIASQGEYLMSYSVGTPPFQILGIVDTGSDIIWLQCQPC--EDCY 129
Query: 129 MQDSPLFDPKMSSTYKSLPCSSSQCASL-NQKSCSGVN--CQYSVSYGDGSFSNGNLATE 185
Q +P+FDP S TYK+LPCSS+ C S+ + SCS N C+Y+++YGD S S G+L+ E
Sbjct: 130 NQTTPIFDPSQSKTYKTLPCSSNICQSVQSAASCSSNNDECEYTITYGDNSHSQGDLSVE 189
Query: 186 TVTLGSTTGQAVALPGITFGCGTNNGGLFNSK 217
T+TLGST G +V P GCG NN G F +
Sbjct: 190 TLTLGSTDGSSVQFPKTVIGCGHNNKGTFQRE 221
Score = 120 bits (302), Expect = 9e-25, Method: Compositional matrix adjust.
Identities = 62/126 (49%), Positives = 86/126 (68%), Gaps = 3/126 (2%)
Query: 251 DIVIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSLSQ--VPEVTI 308
+I+IDSGTTLT LP+ NL S ++ IE + V DP+ L LCY S + VP +T
Sbjct: 320 NIIIDSGTTLTILPEDDYLNLESAVADAIELERVEDPSKFLRLCYRTTSSDELNVPVITA 379
Query: 309 HFRGADVKLSRSNFFVKVSEDIVCSVFKGITNSVPIYGNIMQTNFLVGYDIEQQTVSFKP 368
HF+GADV+L+ + F++V E +VC F+ + PI+GN+ Q N LVGYD+ +QTVSFKP
Sbjct: 380 HFKGADVELNPISTFIEVDEGVVCFAFRS-SKIGPIFGNLAQQNLLVGYDLVKQTVSFKP 438
Query: 369 TDCTKQ 374
TDCT++
Sbjct: 439 TDCTQE 444
>gi|449462553|ref|XP_004149005.1| PREDICTED: probable aspartic protease At2g35615-like [Cucumis
sativus]
Length = 418
Score = 208 bits (529), Expect = 4e-51, Method: Compositional matrix adjust.
Identities = 146/441 (33%), Positives = 217/441 (49%), Gaps = 92/441 (20%)
Query: 1 MATFLSCVF--ILFFLCFYVVSPIEAQTGGFSVELIHRDSPKSPFYNSSETPYQRLRDAL 58
MA +S F ILF + F + I G F+ L HRDS SP SS + Y RL +A
Sbjct: 1 MAATISLFFHLILFLISFSQTTIINGDNG-FTTSLFHRDSLLSPLEFSSLSHYDRLANAF 59
Query: 59 TRSLNRLNHFNQNSSISSSKASQADIIPNNANYLIRISIGTPPTERLAVADTGSDLIWTQ 118
RSL+R ++ S + Q+ II GTPP + L +ADTGSDL W Q
Sbjct: 60 RRSLSRSAALLNRAATSGAVGLQSSII------------GTPPVDYLGIADTGSDLTWAQ 107
Query: 119 CEPCPPSQCYMQDSPLFDPKMSSTYKSLPCSSSQCASLNQKSCSGVN--CQYSVSYGDGS 176
C PC +CY Q P+F+P S+++ +PC++ C +++ C GV C YS +YGD +
Sbjct: 108 CLPCL--KCYQQLRPIFNPLKSTSFSHVPCNTQTCHAVDDGHC-GVQGVCDYSYTYGDRT 164
Query: 177 FSNGNLATETVTLGSTTGQAVALPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMR 236
+S G+L E +T+GS++ ++V GCG + G F +G++GLGGG +SL+SQM
Sbjct: 165 YSKGDLGFEKITIGSSSVKSV------IGCGHASSGGFGF-ASGVIGLGGGQLSLVSQMS 217
Query: 237 TTIAGNQRLG---------------------------VSTP------------------- 250
T ++R VSTP
Sbjct: 218 QTSGISRRFSYCLPTLLSHANGKINFGQNAVVSGPGVVSTPLISKNTVTYYYITLEAISI 277
Query: 251 ------------DIVIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCY--- 295
+++IDSGTTL+FLP+ ++S + +++A+ V DP +LC+
Sbjct: 278 GNERHMAFAKQGNVIIDSGTTLSFLPKELYDGVVSSLLKVVKAKRVKDPGNFWDLCFDDG 337
Query: 296 -SFNSLSQVPEVTIHFR-GADVKLSRSNFFVKVSEDIVCSVF--KGITNSVPIYGNIMQT 351
+ + S +P +T F GA+V L N F KV+ ++ C T+ I GN+
Sbjct: 338 INVATSSGIPIITAQFSGGANVNLLPVNTFQKVANNVNCLTLTPASPTDEFGIIGNLALA 397
Query: 352 NFLVGYDIEQQTVSFKPTDCT 372
NFL+GYD+E + +SFKPT CT
Sbjct: 398 NFLIGYDLEAKRLSFKPTVCT 418
>gi|357481199|ref|XP_003610885.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
gi|355512220|gb|AES93843.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
Length = 416
Score = 206 bits (525), Expect = 1e-50, Method: Compositional matrix adjust.
Identities = 142/438 (32%), Positives = 200/438 (45%), Gaps = 106/438 (24%)
Query: 4 FLSCVFILFFLCFYVVSPIEAQTGGFSVELIHRDSPKSPFYNSSETPYQRLRDALTRSLN 63
+L+ +F+LF + +S IEAQ GF+++L + S N
Sbjct: 18 YLAIIFLLFHVLH--LSSIEAQNDGFTIKLFRKTS------------------------N 51
Query: 64 RLNHFNQNSSISSSKASQADIIPNNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCP 123
+ + QA I +L+ I IGTPP + + DTGSDLIW QC PC
Sbjct: 52 NIQNI-----------VQAPINAYIGQHLMEIYIGTPPIKITGLVDTGSDLIWIQCAPC- 99
Query: 124 PSQCYMQDSPLFDPKMSSTYKSLPCSSSQCASLNQKSCS-GVNCQYSVSYGDGSFSNGNL 182
CY Q P+FDP SSTY ++ C S C L+ CS C Y+ YGD S + G L
Sbjct: 100 -LGCYKQIKPMFDPLKSSTYNNISCDSPLCHKLDTGVCSPEKRCNYTYGYGDNSLTKGVL 158
Query: 183 ATETVTLGSTTGQAVALPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGN 242
A +T T S TG+ V+L FGCG NN G FN G++GLGGG SLISQ+ G
Sbjct: 159 AQDTATFTSNTGKPVSLSRFLFGCGHNNTGGFNDHEMGLIGLGGGPTSLISQIGPLFGGK 218
Query: 243 Q------------------------------------------------RLGVSTPD--- 251
+ LG+S D
Sbjct: 219 KFSQCLVPFLTDIKISSRMSFGKGSQVLGNGVVTTPLVPREKDTSYFVTLLGISVEDTYF 278
Query: 252 ----------IVIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVA-DPTGSLELCYSFNSL 300
+++DSGT LPQ + + + + + +P+ DP+ +LCY +
Sbjct: 279 PMNSTIGKANMLVDSGTPPILLPQQLYDKVFAEVRNKVALKPITDDPSLGTQLCYRTQTN 338
Query: 301 SQVPEVTIHFRGADVKLSRSNFFV---KVSEDIVCSVFKGITNSVP-IYGNIMQTNFLVG 356
+ P +T HF GA+V L+ F+ ++ I C TNS P +YGN Q+N+L+G
Sbjct: 339 LKGPTLTFHFVGANVLLTPIQTFIPPTPQTKGIFCLAIYNRTNSDPGVYGNFAQSNYLIG 398
Query: 357 YDIEQQTVSFKPTDCTKQ 374
+D+++Q VSFKPTDCTKQ
Sbjct: 399 FDLDRQVVSFKPTDCTKQ 416
>gi|225427556|ref|XP_002266575.1| PREDICTED: probable aspartic protease At2g35615 [Vitis vinifera]
Length = 445
Score = 205 bits (522), Expect = 3e-50, Method: Compositional matrix adjust.
Identities = 116/241 (48%), Positives = 157/241 (65%), Gaps = 7/241 (2%)
Query: 4 FLSCVFILFFLCFYVV-SPIEAQTGGFSVELIHRDSPKSPFYNSSETPYQRLRDALTRSL 62
F+ C+ + FL ++ S EA+ GF+ + I RDSP+SPFYN SET YQRL+ A RS+
Sbjct: 8 FVFCLLAIIFLIYFAKHSQAEAKVDGFTTDFISRDSPRSPFYNPSETKYQRLQKAFRRSI 67
Query: 63 NRLNHFNQNSSISSSKASQADIIPNNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPC 122
R NHF + +S Q+++I +YL+ IS+GTPP L +ADTGSDLIW QC PC
Sbjct: 68 LRGNHFR--AIRASPNDIQSNVISGGGSYLMNISLGTPPVSMLGIADTGSDLIWRQCLPC 125
Query: 123 PPSQCYMQDSPLFDPKMSSTYKSLPCSSSQCASLNQK-SCSGVN-CQYSVSYGDGSFSNG 180
CY Q PLFDPK S TYK+L C++ C L Q+ SC N C S SYGD S++
Sbjct: 126 --DDCYKQVEPLFDPKKSKTYKTLGCNNDFCQDLGQQGSCGDDNTCTSSYSYGDQSYTRR 183
Query: 181 NLATETVTLGSTTGQAVALPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIA 240
+L++ET T+GST G + PG+ FGCG +NGG FN K +G++GLGGG +SL+ Q+ + +
Sbjct: 184 DLSSETFTIGSTEGDPASFPGLAFGCGHSNGGTFNEKDSGLIGLGGGPLSLVMQLSSKVG 243
Query: 241 G 241
G
Sbjct: 244 G 244
Score = 123 bits (308), Expect = 2e-25, Method: Compositional matrix adjust.
Identities = 62/125 (49%), Positives = 85/125 (68%), Gaps = 3/125 (2%)
Query: 251 DIVIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSLSQVPEVTIHF 310
+I+IDSGTTLT LP+ + +++ S ++ +I Q DP G+ LCYS ++P +T HF
Sbjct: 323 NIIIDSGTTLTLLPRDFYTDMESALTKVIGGQTTTDPRGTFSLCYSGVKKLEIPTITAHF 382
Query: 311 RGADVKLSRSNFFVKVSEDIVCSVFKGITNS-VPIYGNIMQTNFLVGYDIEQQTVSFKPT 369
GADV+L N FV+ ED+VC F I +S + I+GN+ Q NFLVGYD++ VSFKPT
Sbjct: 383 IGADVQLPPLNTFVQAQEDLVC--FSMIPSSNLAIFGNLSQMNFLVGYDLKNNKVSFKPT 440
Query: 370 DCTKQ 374
DCTKQ
Sbjct: 441 DCTKQ 445
>gi|356546376|ref|XP_003541602.1| PREDICTED: probable aspartic protease At2g35615-like [Glycine max]
Length = 450
Score = 205 bits (521), Expect = 3e-50, Method: Compositional matrix adjust.
Identities = 95/190 (50%), Positives = 130/190 (68%), Gaps = 5/190 (2%)
Query: 28 GFSVELIHRDSPKSPFYNSSETPYQRLRDALTRSLNRLNHFNQNSSISSSKASQADIIPN 87
GFSVE+IHRDS +SP Y +ETP+QR+ +A+ RS+NR NHFN+ S ++S+ +++ + +
Sbjct: 34 GFSVEMIHRDSSRSPLYRHTETPFQRVANAMRRSINRANHFNKKSFVASTNTAESTVKAS 93
Query: 88 NANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLP 147
YL+ S+GTPP E L V DTGS + W QC+ C CY Q +P+FDP S TYK+LP
Sbjct: 94 QGEYLMSYSVGTPPFEILGVVDTGSGITWMQCQRC--EDCYEQTTPIFDPSKSKTYKTLP 151
Query: 148 CSSSQCAS-LNQKSCSG--VNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITF 204
CSS+ C S ++ SCS + C+Y++ YGDGS S G+L+ ET+TLGST G +V P
Sbjct: 152 CSSNMCQSVISTPSCSSDKIGCKYTIKYGDGSHSQGDLSVETLTLGSTNGSSVQFPNTVI 211
Query: 205 GCGTNNGGLF 214
GCG NN G F
Sbjct: 212 GCGHNNKGTF 221
Score = 123 bits (309), Expect = 1e-25, Method: Compositional matrix adjust.
Identities = 65/125 (52%), Positives = 86/125 (68%), Gaps = 3/125 (2%)
Query: 252 IVIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSLSQ--VPEVTIH 309
I+IDSGTTLT LPQ SNL S ++ I+A V+DP+ L LCY Q VP +T H
Sbjct: 327 IIIDSGTTLTLLPQEDYSNLESAVADAIQANRVSDPSNFLSLCYQTTPSGQLDVPVITAH 386
Query: 310 FRGADVKLSRSNFFVKVSEDIVCSVFKGITNSVPIYGNIMQTNFLVGYDIEQQTVSFKPT 369
F+GADV+L+ + FV+V+E +VC F + V I+GN+ Q N LVGYD+ +QTVSFKPT
Sbjct: 387 FKGADVELNPISTFVQVAEGVVCFAFHS-SEVVSIFGNLAQLNLLVGYDLMEQTVSFKPT 445
Query: 370 DCTKQ 374
DCT++
Sbjct: 446 DCTQE 450
>gi|356495752|ref|XP_003516737.1| PREDICTED: probable aspartic protease At2g35615-like [Glycine max]
Length = 396
Score = 205 bits (521), Expect = 3e-50, Method: Compositional matrix adjust.
Identities = 137/417 (32%), Positives = 194/417 (46%), Gaps = 89/417 (21%)
Query: 24 AQTGGFSVELIHRDSPK-SPFYNSSETPYQRLRDALTRSLNRLNHFNQNSSISSSKASQA 82
A GF+++LI +SP SPFY S E RL S
Sbjct: 3 ADNSGFTIQLIRHNSPNYSPFYKSDELHMHRL---------------------GSNGVFT 41
Query: 83 DIIPNNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSST 142
+ NN +YL+++++GTPP + + DTGSDL+W QC PC CY Q SP+F+P S+T
Sbjct: 42 RVTSNNGDYLMKLTLGTPPVDVYGLVDTGSDLVWAQCTPC--QGCYRQKSPMFEPLRSNT 99
Query: 143 YKSLPCSSSQCASLNQKSCSGVN-CQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPG 201
Y +PC S +C SL SCS C YS +Y D S + G LA ETVT ST G+ V +
Sbjct: 100 YTPIPCDSEECNSLFGHSCSPQKLCAYSYAYADSSVTKGVLARETVTFSSTDGEPVVVGD 159
Query: 202 ITFGCGTNN-------------------------GGLFNSK--------------TTGIV 222
I FGCG +N G L+ SK T G +
Sbjct: 160 IVFGCGHSNSGTFNENDMGIIGLGGGPLSLVSQFGNLYGSKRFSQCLVPFHADPHTLGTI 219
Query: 223 GLG-GGDIS--------LISQMRTT---------------IAGNQRLGVSTPDIVIDSGT 258
G D+S L+S+ T ++ N +S +I+IDSGT
Sbjct: 220 SFGDASDVSGEGVAATPLVSEEGQTPYLVTLEGISVGDTFVSFNSSEMLSKGNIMIDSGT 279
Query: 259 TLTFLPQGYNSNLLSVMSSMIEAQPVAD-PTGSLELCYSFNSLSQVPEVTIHFRGADVKL 317
T+LPQ + L+ + P+ D P +LCY + + P + HF GADV+L
Sbjct: 280 PATYLPQEFYDRLVKELKVQSNMLPIDDDPDLGTQLCYRSETNLEGPILIAHFEGADVQL 339
Query: 318 SRSNFFVKVSEDIVCSVFKGITNSVPIYGNIMQTNFLVGYDIEQQTVSFKPTDCTKQ 374
F+ + + C G T+ I+GN Q+N L+G+D++++TVSFK TDC+ Q
Sbjct: 340 MPIQTFIPPKDGVFCFAMAGTTDGEYIFGNFAQSNVLIGFDLDRKTVSFKATDCSNQ 396
>gi|224074591|ref|XP_002304395.1| predicted protein [Populus trichocarpa]
gi|222841827|gb|EEE79374.1| predicted protein [Populus trichocarpa]
Length = 443
Score = 204 bits (518), Expect = 8e-50, Method: Compositional matrix adjust.
Identities = 115/244 (47%), Positives = 157/244 (64%), Gaps = 14/244 (5%)
Query: 9 FILFFLCFYVVSPI------EAQTGGFSVELIHRDSPKSPFYNSSETPYQRLRDALTRSL 62
F+ +CF +SP + GFS+ LIHRDSP SP YN + T + RLR+A +RS+
Sbjct: 8 FVTIVICFISLSPFPLLGAAASPDPGFSLNLIHRDSPLSPLYNPNHTDFDRLRNAFSRSI 67
Query: 63 NRLNHFNQNSSISSSKASQADIIPNNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPC 122
+R+N F + +S Q D++PN Y +++SIGTP E + +ADTGSDL W QC PC
Sbjct: 68 SRVNVFKTKAVDINS--FQNDLVPNGGEYFMKMSIGTPLVEVIVIADTGSDLTWVQCLPC 125
Query: 123 PPSQCYMQDSPLFDPKMSSTYKSLPCSSSQCASLN--QKSCSGVN--CQYSVSYGDGSFS 178
P CY Q SPLFDP SS+Y+ + C S C +L+ +++C+ C+Y SYGD S++
Sbjct: 126 DP--CYRQKSPLFDPSRSSSYRHMLCGSRFCNALDVSEQACTMDTNICEYHYSYGDKSYT 183
Query: 179 NGNLATETVTLGSTTGQAVALPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTT 238
NGNLATE T+GST+ + V L I FGCGT NGG F+ +GIVGLGGG +SL+SQ+ +
Sbjct: 184 NGNLATEKFTIGSTSSRPVHLSPIVFGCGTGNGGTFDELGSGIVGLGGGALSLVSQLSSI 243
Query: 239 IAGN 242
I G
Sbjct: 244 IKGK 247
Score = 120 bits (300), Expect = 1e-24, Method: Compositional matrix adjust.
Identities = 63/145 (43%), Positives = 89/145 (61%), Gaps = 12/145 (8%)
Query: 240 AGNQRL---------GVSTPDIVIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGS 290
GN+RL V +++IDSGTTLTFL + + L V+ ++A+ V+DP G
Sbjct: 301 VGNKRLPYTNGLLNGNVEKGNVIIDSGTTLTFLDSEFFTELERVLEETVKAERVSDPRGL 360
Query: 291 LELCYSFNSLSQVPEVTIHFRGADVKLSRSNFFVKVSEDIVCSVFKGI-TNSVPIYGNIM 349
+C+ +P + +HF ADVKL N FVK ED++C F I +N + I+GN+
Sbjct: 361 FSVCFRSAGDIDLPVIAVHFNDADVKLQPLNTFVKADEDLLC--FTMISSNQIGIFGNLA 418
Query: 350 QTNFLVGYDIEQQTVSFKPTDCTKQ 374
Q +FLVGYD+E++TVSFKPTDCTK
Sbjct: 419 QMDFLVGYDLEKRTVSFKPTDCTKH 443
>gi|255566006|ref|XP_002523991.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
gi|223536718|gb|EEF38359.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
Length = 455
Score = 204 bits (518), Expect = 8e-50, Method: Compositional matrix adjust.
Identities = 148/449 (32%), Positives = 208/449 (46%), Gaps = 84/449 (18%)
Query: 3 TFLSCVFILFFLCFYVVSPIEAQTGGFSVELIHRDSPKSPFYNSSETPYQRLRDALTRSL 62
+F S + IL + + I+A F+ ELIH DSP SPF+N+SET RL AL RS
Sbjct: 12 SFTSLIIILSTVFLSSFAIIQADKFSFTAELIHIDSPNSPFFNASETTTHRLAKALQRSA 71
Query: 63 NRLNHFNQNSSISSSKASQADIIPNNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPC 122
NR+ N S+ S + A I + NYL+++ IGTPPTE A DTGS++IW C C
Sbjct: 72 NRVARLNPLSN--SDEGVHASIFSGDGNYLMKLLIGTPPTEIHAAIDTGSNVIWIPCINC 129
Query: 123 PPSQCYMQDSPLFDPKMSSTYKSLPCSSSQCASLNQKSCSGVNCQYSVSYGDG-SFSNGN 181
C+ Q S +F+P SSTY+ PC S QC + + S C YS + NG
Sbjct: 130 --KDCFNQSSSIFNPLASSTYQDAPCDSYQCETTSSSCQSDNVCLYSCDEKHQLNCPNGR 187
Query: 182 LATETVTLGSTTGQAVALPGITFGC--------------GTNNGGLFNSKTTGIVGLGGG 227
+A +T+TL S+ G+ LP F C G G L S T+ + L G
Sbjct: 188 IAVDTMTLTSSDGRPFPLPYSDFVCGNSIYKTFAGVGVIGLGRGAL--SLTSKLYHLSDG 245
Query: 228 DISL-------------------------ISQMRTTIAGNQRLG---------------- 246
S + + TT+ ++ G
Sbjct: 246 KFSYCLADYYSKQPSKINFGLQSFISDDDLEVVSTTLGHHRHSGNYYVTLEGISVGEKRQ 305
Query: 247 ----VSTP------DIVIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGS------ 290
V P +++IDSGT T LP+ + L S +S I P P S
Sbjct: 306 DLYYVDDPFAPPVGNMLIDSGTMFTLLPKDFYDYLWSTVSYAIPENPQNHPHNSRFPFSM 365
Query: 291 -----LELCYSFNSLSQVPEVTIHFRGADVKLSRSNFFVKVSEDIVCSVFKGIT-NSVPI 344
L C+ + + P++TIHF ADV+LS N F++V+ED+VC F +
Sbjct: 366 DNTLKLSPCFWYYPELKFPKITIHFTDADVELSDDNSFIRVAEDVVCFAFAATQPGQSTV 425
Query: 345 YGNIMQTNFLVGYDIEQQTVSFKPTDCTK 373
YG+ Q NF++GYD+++ TVSFK TDC+K
Sbjct: 426 YGSWQQMNFILGYDLKRGTVSFKRTDCSK 454
>gi|357481205|ref|XP_003610888.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
gi|355512223|gb|AES93846.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
Length = 413
Score = 202 bits (513), Expect = 3e-49, Method: Compositional matrix adjust.
Identities = 142/421 (33%), Positives = 191/421 (45%), Gaps = 101/421 (23%)
Query: 22 IEAQTGGFSVELIHRDSPKSPFYNSSETPYQRLRDALTRSLNRLNHFNQNSSISSSKASQ 81
IEAQ GF+V+LI + S H + N+ Q
Sbjct: 26 IEAQNDGFTVKLIRKSS----------------------------HLSSNNI---QDIVQ 54
Query: 82 ADIIPNNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSS 141
A I YL+ + IGTPP + DTGSDLIW QC PC CY Q +P+FDP SS
Sbjct: 55 APINAYIGQYLMELYIGTPPIKISGTVDTGSDLIWVQCVPC--LGCYNQINPMFDPLKSS 112
Query: 142 TYKSLPCSSSQCASLNQKSCS-GVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALP 200
TY ++ C S C CS C Y+ Y D S + G LA ETVTL S TG+ ++L
Sbjct: 113 TYTNISCDSPLCYKPYIGECSPEKRCDYTYGYADSSLTKGVLAQETVTLTSNTGKPISLQ 172
Query: 201 GITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGN------------------ 242
GI FGCG NN G FN G++GLGGG SL+SQ+ G
Sbjct: 173 GILFGCGHNNTGNFNDHEMGLIGLGGGPTSLVSQIGPLFGGKKFSQCLVPFLTDITISSQ 232
Query: 243 --------------------QR-----------LGVSTPD-------------IVIDSGT 258
QR LG+S D +++DSGT
Sbjct: 233 MSFGKGSEVLGEGVVTTPLVQREQDMTSYYVTLLGISVEDTYLPMNSTIEKGNMLVDSGT 292
Query: 259 TLTFLPQGYNSNLLSVMSSMIEAQPVA-DPTGSLELCYSFNSLSQVPEVTIHFRGADVKL 317
LPQ + + + + +P+ DP+ +LCY + + P +T HF GA++ L
Sbjct: 293 PPNILPQQLYDRVYVEVKNKVPLEPITDDPSLGPQLCYRTQTNLKGPTLTYHFEGANLLL 352
Query: 318 SRSNFFVKVSED---IVCSVFKGITNSVP-IYGNIMQTNFLVGYDIEQQTVSFKPTDCTK 373
+ F+ + + + C NS P IYGN QTN+L+G+D+++Q VSFKPTDCTK
Sbjct: 353 TPIQTFIPPTPETKGVFCLAITNCANSDPGIYGNFAQTNYLIGFDLDRQIVSFKPTDCTK 412
Query: 374 Q 374
Q
Sbjct: 413 Q 413
>gi|255553149|ref|XP_002517617.1| Aspartic proteinase nepenthesin-2 precursor, putative [Ricinus
communis]
gi|223543249|gb|EEF44781.1| Aspartic proteinase nepenthesin-2 precursor, putative [Ricinus
communis]
Length = 449
Score = 201 bits (511), Expect = 5e-49, Method: Compositional matrix adjust.
Identities = 116/244 (47%), Positives = 155/244 (63%), Gaps = 13/244 (5%)
Query: 8 VFILFFLCFYVVSPIEAQTGGFSVELIHRDSPKSPFYNSSETPYQRLRDALTRSLNRLNH 67
+FI F S +EA+ GFS LIHRDS SP YN +T + RLR++ RS++R N
Sbjct: 11 LFIAFISMVSAFSLVEARNAGFSANLIHRDSSVSPLYNPRDTYFDRLRNSFHRSISRANR 70
Query: 68 FNQNSSISSSKASQADIIPNNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQC 127
F NS IS+ Q+DI+P YL+RISIG P E LA+ADTGSDLIW QC+PC C
Sbjct: 71 FKPNS-ISARALVQSDIVPGGGEYLMRISIGNPQVEILAIADTGSDLIWVQCQPC--EMC 127
Query: 128 YMQDSPLFDPKMSSTYKSLPCSSSQCASLN--QKSCSG----VNCQYSVSYGDGSFSNGN 181
Y Q+SP+FDP+ SS+Y+++ C + C L+ +SC C Y+ SYGD SFS+G+
Sbjct: 128 YKQNSPIFDPRRSSSYRNVLCGNEFCNKLDGEARSCDARGFVKTCGYTYSYGDQSFSDGH 187
Query: 182 LATETVTLGST---TGQAVA-LPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRT 237
LA E +GST T A+A + FGCGT NGG F+ +GI+GLGGG +SL+SQ+
Sbjct: 188 LAIERFGIGSTNSNTSAAIAYFQEVAFGCGTKNGGTFDELGSGIIGLGGGSMSLVSQLGP 247
Query: 238 TIAG 241
++G
Sbjct: 248 KLSG 251
Score = 122 bits (307), Expect = 2e-25, Method: Compositional matrix adjust.
Identities = 61/129 (47%), Positives = 85/129 (65%), Gaps = 3/129 (2%)
Query: 247 VSTPDIVIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSLSQVPEV 306
V +I+IDSGTTLTFL + +NL S + ++ + V+DP G +C+ ++P +
Sbjct: 323 VEKGNIIIDSGTTLTFLDSEFFNNLDSAVEEAVKGERVSDPHGLFNICFKDEKAIELPII 382
Query: 307 TIHFRGADVKLSRSNFFVKVSEDIVCSVFKGI-TNSVPIYGNIMQTNFLVGYDIEQQTVS 365
T HF GADV+L N F KV ED++C F I +N + I+GN+ Q NFLVGYD+E++ VS
Sbjct: 383 TAHFTGADVELQPVNTFAKVEEDLLC--FTMIPSNDIAIFGNLAQMNFLVGYDLEKKAVS 440
Query: 366 FKPTDCTKQ 374
F PTDCTKQ
Sbjct: 441 FLPTDCTKQ 449
>gi|30686482|ref|NP_850251.1| aspartyl protease-like protein [Arabidopsis thaliana]
gi|122215044|sp|Q3EBM5.1|ASPR1_ARATH RecName: Full=Probable aspartic protease At2g35615; Flags:
Precursor
gi|330254036|gb|AEC09130.1| aspartyl protease-like protein [Arabidopsis thaliana]
Length = 447
Score = 199 bits (505), Expect = 3e-48, Method: Compositional matrix adjust.
Identities = 113/237 (47%), Positives = 149/237 (62%), Gaps = 9/237 (3%)
Query: 8 VFILFFLCFYVVSPIEAQTGGFSVELIHRDSPKSPFYNSSETPYQRLRDALTRSLNRLNH 67
+ + FFL F V FSVELIHRDSP SP YN T RL A RS++R
Sbjct: 5 ILLCFFLFFSVTLSSSGHPKNFSVELIHRDSPLSPIYNPQITVTDRLNAAFLRSVSRSRR 64
Query: 68 FNQNSSISSSKASQADIIPNNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQC 127
FN S + Q+ +I + + + I+IGTPP + A+ADTGSDL W QC+PC QC
Sbjct: 65 FNHQLSQTDL---QSGLIGADGEFFMSITIGTPPIKVFAIADTGSDLTWVQCKPC--QQC 119
Query: 128 YMQDSPLFDPKMSSTYKSLPCSSSQCASLN--QKSCSGVN--CQYSVSYGDGSFSNGNLA 183
Y ++ P+FD K SSTYKS PC S C +L+ ++ C N C+Y SYGD SFS G++A
Sbjct: 120 YKENGPIFDKKKSSTYKSEPCDSRNCQALSSTERGCDESNNICKYRYSYGDQSFSKGDVA 179
Query: 184 TETVTLGSTTGQAVALPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIA 240
TETV++ S +G V+ PG FGCG NNGG F+ +GI+GLGGG +SLISQ+ ++I+
Sbjct: 180 TETVSIDSASGSPVSFPGTVFGCGYNNGGTFDETGSGIIGLGGGHLSLISQLGSSIS 236
Score = 110 bits (275), Expect = 1e-21, Method: Compositional matrix adjust.
Identities = 63/130 (48%), Positives = 84/130 (64%), Gaps = 3/130 (2%)
Query: 245 LGVSTPDIVIDSGTTLTFLPQGYNSNLLS-VMSSMIEAQPVADPTGSLELCYSFNSLS-Q 302
L ++ +I+IDSGTTLT L G+ S V S+ A+ V+DP G L C+ S
Sbjct: 316 LSETSGNIIIDSGTTLTLLEAGFFDKFSSAVEESVTGAKRVSDPQGLLSHCFKSGSAEIG 375
Query: 303 VPEVTIHFRGADVKLSRSNFFVKVSEDIVCSVFKGITNSVPIYGNIMQTNFLVGYDIEQQ 362
+PE+T+HF GADV+LS N FVK+SED+VC + T V IYGN Q +FLVGYD+E +
Sbjct: 376 LPEITVHFTGADVRLSPINAFVKLSEDMVC-LSMVPTTEVAIYGNFAQMDFLVGYDLETR 434
Query: 363 TVSFKPTDCT 372
TVSF+ DC+
Sbjct: 435 TVSFQHMDCS 444
>gi|357457681|ref|XP_003599121.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
gi|355488169|gb|AES69372.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
Length = 439
Score = 196 bits (498), Expect = 1e-47, Method: Compositional matrix adjust.
Identities = 141/426 (33%), Positives = 210/426 (49%), Gaps = 73/426 (17%)
Query: 18 VVSPIEAQTGGFSVELIHRDSPKSPFYNSSETPYQRLRDALTRSLNRLNHFNQNSSISSS 77
VV+PIE+Q GFSVELIH DS +SPFYN ET QR+ + +T S+ R ++ N S+S +
Sbjct: 16 VVTPIESQNRGFSVELIHPDSSRSPFYNIRETQLQRISNVVTHSIKRAHYLNHVFSLSHN 75
Query: 78 KASQADIIPNNAN-YLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFD 136
+ IIP + Y++ SIGTPP + V DTGSD IW QC+PC P C Q SP+F+
Sbjct: 76 DLPKPTIIPYAGSYYVMSYSIGTPPFQLYGVVDTGSDGIWFQCKPCKP--CLNQTSPIFN 133
Query: 137 PKMSSTYKSLPCSSSQCASLNQKSCSG---VNCQYSVSYGDGSFSNGNLATETVTLGSTT 193
P SSTYK++ CSS C + CS C+Y ++Y D S S G+++ +T+TL S
Sbjct: 134 PSKSSTYKNIRCSSPICKRGEKTRCSSNRKRKCEYEITYLDRSGSQGDISKDTLTLNSND 193
Query: 194 GQAVALPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGN---------QR 244
G ++ P I GCG N +GI+G G G+ S++SQ+ ++I G +
Sbjct: 194 GSPISFPKIVIGCGHKNSLTTEGLASGIIGFGRGNFSIVSQLGSSIGGKFSYCLASLFSK 253
Query: 245 LGVSTP----DIVIDSGTTL-------TFLPQGYNSNL--LSVMSSMIEAQP---VADPT 288
+S+ D+ + SG + +F Y +NL SV +I+ + + D
Sbjct: 254 ANISSKLYFGDMAVVSGHGVVSTPLIQSFYVGNYFTNLEAFSVGDHIIKLKDSSLIPDNE 313
Query: 289 GSLELCYSFNSLSQVP-EVTIHFRGA---DVKLSR------------SNFFVKVSEDIVC 332
G+ + S ++++Q+P +V A VKL R K I+
Sbjct: 314 GN-AVIDSGSTITQLPNDVYSQLETAVISMVKLKRVKDPTQQLSLCYKTTLKKYEVPIIT 372
Query: 333 SVFKGITNSVPIYGNIMQTN-------------------------FLVGYDIEQQTVSFK 367
+ F+G + + +Q N FLVGYD + +SFK
Sbjct: 373 AHFRGADVKLNAFNTFIQMNHEVMCFAFNSSAFPWVVYGNIAQQNFLVGYDTLKNIISFK 432
Query: 368 PTDCTK 373
PT+CTK
Sbjct: 433 PTNCTK 438
>gi|242056193|ref|XP_002457242.1| hypothetical protein SORBIDRAFT_03g003930 [Sorghum bicolor]
gi|241929217|gb|EES02362.1| hypothetical protein SORBIDRAFT_03g003930 [Sorghum bicolor]
Length = 457
Score = 191 bits (484), Expect = 7e-46, Method: Compositional matrix adjust.
Identities = 146/432 (33%), Positives = 204/432 (47%), Gaps = 89/432 (20%)
Query: 24 AQTGGFSVELIHRDSPKSPFYNSSETPYQRLR---------DALTRSLNRLNHFNQNSSI 74
A GGFSV+ IHRDS +SP+ + + +P+ R + L RS + + S
Sbjct: 28 AGEGGFSVDFIHRDSARSPYRHPALSPHARALAAARRSLRGEVLGRSYSGASPAAAPVSA 87
Query: 75 SSSKASQADIIPNNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSP- 133
+ ++ II + YL+ +++GTPPT+ LA+ADTGSDL+W C S + D+
Sbjct: 88 ADGGV-ESKIITRSFEYLMYVNVGTPPTQLLAIADTGSDLVWVNCS---SSGGGLADADA 143
Query: 134 ----LFDPKMSSTYKSLPCSSSQCASLNQKSCSG-VNCQYSVSYGDGSFSNGNLATETVT 188
+F P SSTY L C S+ C +L+Q SC CQY SYGDGS + G L+TET +
Sbjct: 144 GGNVVFQPTRSSTYSQLSCQSNACQALSQASCDADSECQYQYSYGDGSRTIGVLSTETFS 203
Query: 189 L--GSTTGQAVALPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGNQRL- 245
G GQ V +P + FGC T + G F S G+VGLG G SL+SQ+ T +++L
Sbjct: 204 FVDGGGKGQ-VRVPRVNFGCSTASAGTFRSD--GLVGLGAGAFSLVSQLGATTHIDRKLS 260
Query: 246 ---------------------------GVSTPDIVID----------------------- 255
STP + D
Sbjct: 261 YCLIPSYDANSSSTLNFGSRAVVSEPGAASTPLVPSDVDSYYTVALESVAVGGQEVATHD 320
Query: 256 ------SGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSLSQ-----VP 304
SGTTLTFL L++ + I+ Q V P L+LCY S+ +P
Sbjct: 321 SRIIVDSGTTLTFLDPALLGPLVTELERRIKLQRVQPPEQLLQLCYDVQGKSETDNFGIP 380
Query: 305 EVTIHF-RGADVKLSRSNFFVKVSEDIVCSVFKGITNSVP--IYGNIMQTNFLVGYDIEQ 361
+VT+ F GA V L N F + E +C V ++ S P I GNI Q NF VGYD++
Sbjct: 381 DVTLRFGGGAAVTLRPENTFSLLQEGTLCLVLVPVSESQPVSILGNIAQQNFHVGYDLDA 440
Query: 362 QTVSFKPTDCTK 373
+TV+F DC +
Sbjct: 441 RTVTFAAADCAR 452
>gi|413947545|gb|AFW80194.1| hypothetical protein ZEAMMB73_386053 [Zea mays]
Length = 456
Score = 188 bits (478), Expect = 3e-45, Method: Compositional matrix adjust.
Identities = 144/428 (33%), Positives = 199/428 (46%), Gaps = 81/428 (18%)
Query: 25 QTGGFSVELIHRDSPKSPFYNSSETPYQRLRDALTRSLN--RLNHFNQNSSISSSKASQA 82
+ GGFSV+ IHRDS +SPF S P+ R A RSL L + +S + +A
Sbjct: 26 EAGGFSVDFIHRDSARSPFAQPSLPPHARALAAARRSLRGAALGRYVGGASPAPGPVPEA 85
Query: 83 D------IIPNNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFD 136
D II + YL+ +++GTPP + LA+ADTGSDL+W C + +F
Sbjct: 86 DGGVESKIITRSFEYLMYVNVGTPPAQMLAIADTGSDLVWVNCSSNGGGGGASDGAVVFH 145
Query: 137 PKMSSTYKSLPCSSSQCASLNQKSCSG-VNCQYSVSYGDGSFSNGNLATETVTLGSTTGQ 195
P S+TY L C S+ C +L+Q SC CQY +YGDGS + G L+TET + + G
Sbjct: 146 PSRSTTYSLLSCQSAACQALSQASCDADSECQYQYAYGDGSRTIGVLSTETFSFAAAGGG 205
Query: 196 A---VALPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQM----------------- 235
V +P ++FGC T + G F S G+VGLG G +SL+SQ+
Sbjct: 206 GEGQVRVPRVSFGCSTGSAGSFRSD--GLVGLGAGALSLVSQLGAAARIARRFSYCLVPP 263
Query: 236 ------RTTIAGNQRLGVSTPDIV--------IDS------------------------- 256
+T++ R VS P +DS
Sbjct: 264 YAAANSSSTLSFGARAVVSDPGAASTPLVPSEVDSYYTVALESVAVAGQDVASANSSRII 323
Query: 257 ---GTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSLSQ-----VPEVTI 308
GTTLTFL L++ + I P L+LCY SQ +P+VT+
Sbjct: 324 VDSGTTLTFLDPALLRPLVAELERRIRLPRAQPPEQLLQLCYDVQGKSQAEDFGIPDVTL 383
Query: 309 HF-RGADVKLSRSNFFVKVSEDIVCSVFKGITNSVP--IYGNIMQTNFLVGYDIEQQTVS 365
F GA V L N F + E +C V ++ S P I GNI Q NF VGYD++ +TV+
Sbjct: 384 RFGGGASVTLRPENTFSLLEEGTLCLVLVPVSESQPVSILGNIAQQNFHVGYDLDARTVT 443
Query: 366 FKPTDCTK 373
F DCT+
Sbjct: 444 FAAVDCTR 451
>gi|15222357|ref|NP_174430.1| aspartyl protease-like protein [Arabidopsis thaliana]
gi|12322538|gb|AAG51267.1|AC027135_8 chloroplast nucleoid DNA binding protein, putative [Arabidopsis
thaliana]
gi|67633408|gb|AAY78629.1| aspartyl protease family protein [Arabidopsis thaliana]
gi|332193236|gb|AEE31357.1| aspartyl protease-like protein [Arabidopsis thaliana]
Length = 445
Score = 187 bits (476), Expect = 6e-45, Method: Compositional matrix adjust.
Identities = 112/242 (46%), Positives = 150/242 (61%), Gaps = 14/242 (5%)
Query: 3 TFLSCVFILFFLCFYVVSPIEAQTGGFSVELIHRDSPKSPFYNSSETPYQRLRDALTRSL 62
TFL C L + F+ S A +VELIHRDSP SP YN T RL A RS+
Sbjct: 5 TFLYCS--LLAISFFFASNSSANRENLTVELIHRDSPHSPLYNPHHTVSDRLNAAFLRSI 62
Query: 63 NRLNHFNQNSSISSSKASQADIIPNNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPC 122
+R F + + Q+ +I N Y + ISIGTPP++ A+ADTGSDL W QC+PC
Sbjct: 63 SRSRRFTTKTDL------QSGLISNGGEYFMSISIGTPPSKVFAIADTGSDLTWVQCKPC 116
Query: 123 PPSQCYMQDSPLFDPKMSSTYKSLPCSSSQCASL--NQKSCSGVN--CQYSVSYGDGSFS 178
QCY Q+SPLFD K SSTYK+ C S C +L +++ C C+Y SYGD SF+
Sbjct: 117 --QQCYKQNSPLFDKKKSSTYKTESCDSKTCQALSEHEEGCDESKDICKYRYSYGDNSFT 174
Query: 179 NGNLATETVTLGSTTGQAVALPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTT 238
G++ATET+++ S++G +V+ PG FGCG NNGG F +GI+GLGGG +SL+SQ+ ++
Sbjct: 175 KGDVATETISIDSSSGSSVSFPGTVFGCGYNNGGTFEETGSGIIGLGGGPLSLVSQLGSS 234
Query: 239 IA 240
I
Sbjct: 235 IG 236
Score = 105 bits (261), Expect = 5e-20, Method: Compositional matrix adjust.
Identities = 61/133 (45%), Positives = 85/133 (63%), Gaps = 3/133 (2%)
Query: 242 NQRLGVSTPDIVIDSGTTLTFLPQGYNSNL-LSVMSSMIEAQPVADPTGSLELCY-SFNS 299
N + T +I+IDSGTTLT L G+ + +V S+ A+ V+DP G L C+ S +
Sbjct: 311 NGKSSKRTGNIIIDSGTTLTLLDSGFYDDFGTAVEESVTGAKRVSDPQGLLTHCFKSGDK 370
Query: 300 LSQVPEVTIHFRGADVKLSRSNFFVKVSEDIVCSVFKGITNSVPIYGNIMQTNFLVGYDI 359
+P +T+HF ADVKLS N FVK++ED VC + T V IYGN++Q +FLVGYD+
Sbjct: 371 EIGLPAITMHFTNADVKLSPINAFVKLNEDTVC-LSMIPTTEVAIYGNMVQMDFLVGYDL 429
Query: 360 EQQTVSFKPTDCT 372
E +TVSF+ DC+
Sbjct: 430 ETKTVSFQRMDCS 442
>gi|297823357|ref|XP_002879561.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
gi|297325400|gb|EFH55820.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
Length = 447
Score = 187 bits (475), Expect = 6e-45, Method: Compositional matrix adjust.
Identities = 105/216 (48%), Positives = 139/216 (64%), Gaps = 9/216 (4%)
Query: 29 FSVELIHRDSPKSPFYNSSETPYQRLRDALTRSLNRLNHFNQNSSISSSKASQADIIPNN 88
SVELIHRDSP SP YN T RL A RS++R N I S Q+ +I +
Sbjct: 26 LSVELIHRDSPLSPLYNPKNTVTDRLNAAFLRSISRSRRLNN---ILSQTDLQSGLIGAD 82
Query: 89 ANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPC 148
+ + I+IGTPP + A+ADTGSDL W QC+PC QCY ++ P+FD K SSTYKS PC
Sbjct: 83 GEFFMSITIGTPPMKVFAIADTGSDLTWVQCKPC--QQCYKENGPIFDKKKSSTYKSEPC 140
Query: 149 SSSQCASLN--QKSCSGVN--CQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITF 204
S C +L+ ++ C C+Y SYGD SFS G++ATET+++ S +G V+ PG F
Sbjct: 141 DSRNCHALSSSERGCDESKNVCKYRYSYGDQSFSKGDVATETISIDSASGSPVSFPGTVF 200
Query: 205 GCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIA 240
GCG NNGG F+ +GI+GLGGG +SLISQ+ ++I+
Sbjct: 201 GCGYNNGGTFDETGSGIIGLGGGHLSLISQLGSSIS 236
Score = 110 bits (275), Expect = 1e-21, Method: Compositional matrix adjust.
Identities = 60/124 (48%), Positives = 81/124 (65%), Gaps = 3/124 (2%)
Query: 251 DIVIDSGTTLTFLPQGYNSNLLSVMSSMIE-AQPVADPTGSLELCYSFNSLS-QVPEVTI 308
+I+IDSGTTLT L G+ + + ++ A+ V+DP G L C+ S +PE+T+
Sbjct: 322 NIIIDSGTTLTLLDSGFFDKFGAAVEELVTGAKRVSDPQGLLSHCFKSGSAEIGLPEITV 381
Query: 309 HFRGADVKLSRSNFFVKVSEDIVCSVFKGITNSVPIYGNIMQTNFLVGYDIEQQTVSFKP 368
HF GADV+LS N FVKVSED+VC + T V IYGN Q +FLVGYD+E +TVSF+
Sbjct: 382 HFTGADVRLSPINAFVKVSEDMVC-LSMVPTTEVAIYGNFAQMDFLVGYDLETRTVSFQR 440
Query: 369 TDCT 372
DC+
Sbjct: 441 MDCS 444
>gi|297822477|ref|XP_002879121.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
gi|297324960|gb|EFH55380.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
Length = 711
Score = 187 bits (474), Expect = 9e-45, Method: Compositional matrix adjust.
Identities = 133/399 (33%), Positives = 196/399 (49%), Gaps = 84/399 (21%)
Query: 1 MATFLSCVFILFFLCFYVVSPIEAQTGGFSVELIHRDSPKSPFYNSSETPYQRLRDALTR 60
+AT + +F L + +++ + + GF+++LIHR S S
Sbjct: 3 LATTMIAIF-LQIITYFLFTTTASSPHGFTIDLIHRRSNAS------------------- 42
Query: 61 SLNRLNHFNQNSSISSSKASQ--ADIIPNNANYLIRISIGTPPTERLAVADTGSDLIWTQ 118
+S +S+++A AD + + YL+++ IGTPP E AV DTGS+LIWTQ
Sbjct: 43 ----------SSRVSNTQAGSPYADTVFDTYEYLMKLQIGTPPFEVEAVLDTGSELIWTQ 92
Query: 119 CEPCPPSQCYMQDSPLFDPKMSSTYKSLPCSSSQCASLNQKSCSGVNCQYSVSYGDGSFS 178
C PC CY Q +P+FDP SST+K C++ + C Y + Y D S++
Sbjct: 93 CLPC--LHCYDQKAPIFDPSKSSTFKETRCNTPDHS-----------CPYKLVYDDKSYT 139
Query: 179 NGNLATETVTLGSTTGQAVALPGITFGCGTNNGGL-FNSKTTGIVGLGGGDISLISQM-- 235
G LATETVT+ ST+G +P GC NN G F ++GIVGL G +SLISQM
Sbjct: 140 QGTLATETVTIHSTSGVPFVMPETIIGCSRNNSGSGFRPSSSGIVGLSRGSLSLISQMGG 199
Query: 236 ---------RTTIAGNQRLG------------------VSTP------DIVIDSGTTLTF 262
T A + G V TP +IVIDSGT LT+
Sbjct: 200 AYPGDGVVSTTMFAKTAKRGQYYLNLDAVSVGDTRIETVGTPFHALNGNIVIDSGTPLTY 259
Query: 263 LPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSLSQVPEVTIHFR-GADVKLSRSN 321
P Y + + + ++ A V DP+ + LCY N++ P +T+HF GAD+ L + N
Sbjct: 260 FPVSYCNLVRKAVERVVTADRVVDPSRNDMLCYYSNTIEIFPVITVHFSGGADLVLDKYN 319
Query: 322 FFVKVSE-DIVC-SVFKGITNSVPIYGNIMQTNFLVGYD 358
+++++ + C ++ V I+GN Q NFLVGYD
Sbjct: 320 MYMELNRGGVFCLAIICNNPTQVAIFGNRAQNNFLVGYD 358
Score = 167 bits (424), Expect = 6e-39, Method: Compositional matrix adjust.
Identities = 120/353 (33%), Positives = 166/353 (47%), Gaps = 78/353 (22%)
Query: 82 ADIIPNNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSS 141
AD + +N+ YL+++ +GTPP E AV DTGS++ WTQC PC CY Q++P+FDP SS
Sbjct: 371 ADTVFDNSVYLMKLQVGTPPFEIEAVIDTGSEITWTQCLPC--VHCYKQNAPIFDPSKSS 428
Query: 142 TYKSLPCSSSQCASLNQKSCSGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPG 201
T+K +K C +C Y V Y D +++ G LAT+TVT+ ST+G+ +
Sbjct: 429 TFK-------------EKRCHDHSCPYEVDYFDKTYTKGTLATDTVTIHSTSGEPFVMAE 475
Query: 202 ITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQ--------MRTTIAGN--QRLGVSTPD 251
GCG NN F G VGL G +SLI+Q M AGN ++ T
Sbjct: 476 TIIGCGRNN-SWFRPSFEGFVGLNWGPLSLITQMGGEYPGLMSYCFAGNGTSKINFGTNA 534
Query: 252 IVIDSGTTLTFL------PQGYNSNLLSV------------------------------- 274
IV G T + P Y NL +V
Sbjct: 535 IVGGGGVVSTTMFVTTARPGFYYLNLDAVSVGDTRIETLGTPFHALEGNIVIDSGTTLTY 594
Query: 275 ------------MSSMIEAQPVADPTGSLELCYSFNSLSQVPEVTIHFR-GADVKLSRSN 321
+ ++ A P ADPTG+ LCY N+ P +T+HF GAD+ L + N
Sbjct: 595 FPESYCNLVRQAVEHVVPAVPAADPTGNDLLCYYSNTTEIFPVITMHFSGGADLVLDKYN 654
Query: 322 FFVK-VSEDIVC-SVFKGITNSVPIYGNIMQTNFLVGYDIEQQTVSFKPTDCT 372
F++ S + C ++ I+GN Q NFLVGYD VSFKPT+C+
Sbjct: 655 MFMESYSGGLFCLAIICNNPTQEAIFGNRAQNNFLVGYDSSSLLVSFKPTNCS 707
>gi|125600538|gb|EAZ40114.1| hypothetical protein OsJ_24557 [Oryza sativa Japonica Group]
Length = 412
Score = 185 bits (469), Expect = 3e-44, Method: Compositional matrix adjust.
Identities = 140/426 (32%), Positives = 206/426 (48%), Gaps = 72/426 (16%)
Query: 1 MATFLSCVFILFFLCFYVVSPIEAQTGGFSVELIHRDSPKSPFYNSSETPYQRLRDALTR 60
MA FL V+IL L + +S + G +EL H D Y +E R+R A R
Sbjct: 1 MAAFL--VWILLLLPYVAISSTASH--GVRLELTHADDRGG--YVGAE----RVRRAADR 50
Query: 61 SLNRLNHF-----------NQNSSISSSKASQADIIPNNANYLIRISIGTPPTERLAVAD 109
S R+N F S + + ++A + + A YL+ I+IGTPP AV D
Sbjct: 51 SHRRVNGFLGAIEGPSSTARLGSDGAGAGGAEASVHASTATYLVDIAIGTPPLPLTAVLD 110
Query: 110 TGSDLIWTQCE-PCPPSQCYMQDSPLFDPKMSSTYKSLPCSSSQCASLNQ--KSCS--GV 164
TGSDLIWTQC+ PC +C+ Q +PL+ P S+TY ++ C S C +L CS
Sbjct: 111 TGSDLIWTQCDAPC--RRCFPQPAPLYAPARSATYANVSCRSPMCQALQSPWSRCSPPDT 168
Query: 165 NCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITFGCGTNNGGLFNSKTTGIVGL 224
C Y SYGDG+ ++G LATET TLGS T A+ G+ FGCGT N G ++G+VG+
Sbjct: 169 GCAYYFSYGDGTSTDGVLATETFTLGSDT----AVRGVAFGCGTENLG-STDNSSGLVGM 223
Query: 225 GGGDISLISQMRTT---------------------------IAGNQRLGVS------TP- 250
G G +SL+SQ+ T G+ L + TP
Sbjct: 224 GRGPLSLVSQLGVTRPRRSCRARAAARGGGAPTTTSPLEGITVGDTLLPIDPAVFRLTPM 283
Query: 251 ---DIVIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNS--LSQVPE 305
++IDSGTT T L + L ++S + + L LC++ S +VP
Sbjct: 284 GDGGVIIDSGTTFTALEERAFVALARALASRVRLPLASGAHLGLSLCFAAASPEAVEVPR 343
Query: 306 VTIHFRGADVKLSRSNFFVKVSEDIVCSVFKGITNSVPIYGNIMQTNFLVGYDIEQQTVS 365
+ +HF GAD++L R ++ V+ V + + + G++ Q N + YD+E+ +S
Sbjct: 344 LVLHFDGADMELRRESYVVEDRSAGVACLGMVSARGMSVLGSMQQQNTHILYDLERGILS 403
Query: 366 FKPTDC 371
F+P C
Sbjct: 404 FEPAKC 409
>gi|356557010|ref|XP_003546811.1| PREDICTED: probable aspartic protease At2g35615-like [Glycine max]
Length = 437
Score = 184 bits (466), Expect = 8e-44, Method: Compositional matrix adjust.
Identities = 107/222 (48%), Positives = 139/222 (62%), Gaps = 20/222 (9%)
Query: 8 VFILF-FLCFYVVSPI---EAQTG--GFSVELIHRDSPKSPFYNSSETPYQRLRDALTRS 61
VF++F L Y S I EA G GFS++LIHRDSP SPFY+ S TP +R+ +A RS
Sbjct: 5 VFMVFMLLALYSPSSISTREAGEGLRGFSIDLIHRDSPLSPFYDPSLTPSERITNAAFRS 64
Query: 62 ---LNRLNHFNQNSSISSSKASQADIIPNNANYLIRISIGTPPTERLAVADTGSDLIWTQ 118
LNR++HF +++ S +IP N YL+ + IGTPP ERLA+ADTGSDLIW Q
Sbjct: 65 SSRLNRVSHFLDENNLPESL-----LIPENGEYLMTLYIGTPPVERLAIADTGSDLIWVQ 119
Query: 119 CEPCPPSQCYMQDSPLFDPKMSSTYKSLPCSSSQCASL--NQKSCSGV-NCQYSVSYGDG 175
C PC C+ QD+PLF+P SST+K+ C S C S+ +Q+ C V C YS SYGD
Sbjct: 120 CSPC--QNCFPQDTPLFEPLKSSTFKAATCDSQPCTSVPPSQRQCGKVGQCIYSYSYGDK 177
Query: 176 SFSNGNLATETVTLGST-TGQAVALPGITFGCGTNNGGLFNS 216
SF+ G + TET++ GST Q V+ P FGCG N F++
Sbjct: 178 SFTVGVVGTETLSFGSTGDAQTVSFPSSIFGCGVYNNFTFHT 219
Score = 82.4 bits (202), Expect = 3e-13, Method: Compositional matrix adjust.
Identities = 41/125 (32%), Positives = 73/125 (58%), Gaps = 3/125 (2%)
Query: 251 DIVIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSLSQVPEVTIHF 310
+I+IDSGT LT+L Q + +N ++ + ++ + D + C+ + ++ +P + F
Sbjct: 313 NIIIDSGTVLTYLEQTFYNNFVASLQEVLSVESAQDLPFPFKFCFPYRDMT-IPVIAFQF 371
Query: 311 RGADVKLSRSNFFVKVSE-DIVC-SVFKGITNSVPIYGNIMQTNFLVGYDIEQQTVSFKP 368
GA V L N +K+ + +++C +V + + I+GN+ Q +F V YD+E + VSF P
Sbjct: 372 TGASVALQPKNLLIKLQDRNMLCLAVVPSSLSGISIFGNVAQFDFQVVYDLEGKKVSFAP 431
Query: 369 TDCTK 373
TDCTK
Sbjct: 432 TDCTK 436
>gi|297846526|ref|XP_002891144.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
gi|297336986|gb|EFH67403.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
Length = 445
Score = 180 bits (457), Expect = 8e-43, Method: Compositional matrix adjust.
Identities = 110/242 (45%), Positives = 146/242 (60%), Gaps = 14/242 (5%)
Query: 3 TFLSCVFILFFLCFYVVSPIEAQTGGFSVELIHRDSPKSPFYNSSETPYQRLRDALTRSL 62
T L C L + + S A SVELIHRDSP SP YN T RL A
Sbjct: 5 TLLYCS--LLAITIFFTSTSSAHRKNLSVELIHRDSPHSPLYNPQHTVSDRLNAAF---- 58
Query: 63 NRLNHFNQNSSISSSKASQADIIPNNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPC 122
L +++ S+ Q+ +I N Y + ISIGTPP++ LA+ADTGSDL W QC+PC
Sbjct: 59 --LRSISRSRRFSTKTDLQSGLISNGGEYFMSISIGTPPSKFLAIADTGSDLTWVQCKPC 116
Query: 123 PPSQCYMQDSPLFDPKMSSTYKSLPCSSSQCASL--NQKSC--SGVNCQYSVSYGDGSFS 178
QCY Q++PLFD K SSTYK+ C S C +L +++ C S C+Y SYGD SF+
Sbjct: 117 --QQCYKQNTPLFDKKKSSTYKTESCDSITCNALSEHEEGCDESRNACKYRYSYGDESFT 174
Query: 179 NGNLATETVTLGSTTGQAVALPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTT 238
G +ATET+++ S++G V+ PG FGCG NNGG F +GI+GLGGG +SL+SQ+ ++
Sbjct: 175 KGEVATETISIDSSSGSPVSFPGTAFGCGYNNGGTFEETGSGIIGLGGGPLSLVSQLGSS 234
Query: 239 IA 240
I
Sbjct: 235 IG 236
Score = 111 bits (278), Expect = 6e-22, Method: Compositional matrix adjust.
Identities = 70/151 (46%), Positives = 93/151 (61%), Gaps = 16/151 (10%)
Query: 225 GGGDISLISQMRTTIAGNQRLGVSTPDIVIDSGTTLTFLPQGYNSNLLSVMS-SMIEAQP 283
GGG SL R T +I+IDSGTTLT L G+ + +V+ S+ A+
Sbjct: 305 GGGGYSL-----------NRKSKKTGNIIIDSGTTLTLLDSGFYDDFGAVVEESVTGAKR 353
Query: 284 VADPTGSLELCY-SFNSLSQVPEVTIHFRGADVKLSRSNFFVKVSEDIVC-SVFKGITNS 341
V+DP G L C+ S + +P +T+HF GADVKLS N FVK+SEDIVC S+ T
Sbjct: 354 VSDPQGILTHCFKSGDKEIGLPTITMHFTGADVKLSPINSFVKLSEDIVCLSMIP--TTE 411
Query: 342 VPIYGNIMQTNFLVGYDIEQQTVSFKPTDCT 372
V IYGN++Q +FLVGYD+E +TVSF+ DC+
Sbjct: 412 VAIYGNMVQMDFLVGYDLETKTVSFQRMDCS 442
>gi|38344196|emb|CAE05761.2| OSJNBa0064G10.12 [Oryza sativa Japonica Group]
Length = 451
Score = 179 bits (454), Expect = 2e-42, Method: Compositional matrix adjust.
Identities = 122/399 (30%), Positives = 183/399 (45%), Gaps = 68/399 (17%)
Query: 30 SVELIHRDSPKSPFYNSSETPYQRLRDALTRSLNRLNHFNQNSSISSSKASQADIIPN-- 87
S+ L+HRD+ Y S ++ + R R+ H + S+S D++
Sbjct: 64 SLSLVHRDAISGATYPSRR---HQVVGLVARDNARVEHLEKRLVASTSPYLPEDLVSEVV 120
Query: 88 ------NANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSS 141
+ Y +R+ +G+PPT++ V D+GSD+IW QC PC QCY Q PLFDP SS
Sbjct: 121 PGVDDGSGEYFVRVGVGSPPTDQYLVVDSGSDVIWVQCRPC--EQCYAQTDPLFDPAASS 178
Query: 142 TYKSLPCSSSQCASLNQKSCSGV----NCQYSVSYGDGSFSNGNLATETVTLGSTTGQAV 197
++ + C S+ C +L+ C G C YSV+YGDGS++ G LA ET+TLG T
Sbjct: 179 SFSGVSCGSAICRTLSGTGCGGGGDAGKCDYSVTYGDGSYTKGELALETLTLGGT----- 233
Query: 198 ALPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAG---------------- 241
A+ G+ GCG N GLF G++GLG G +SL+ Q+ G
Sbjct: 234 AVQGVAIGCGHRNSGLF-VGAAGLLGLGWGAMSLVGQLGGAAGGVFSYCLASRGAGGAGS 292
Query: 242 ----------------NQRLGVS------TPD----IVIDSGTTLTFLPQGYNSNLLSVM 275
+RL + T D +V+D+GT +T LP+ + L
Sbjct: 293 LASSFYYVGLTGIGVGGERLPLQDSLFQLTEDGAGGVVMDTGTAVTRLPREAYAALRGAF 352
Query: 276 SSMIEAQPVADPTGSLELCYSFNSLS--QVPEVTIHF-RGADVKLSRSNFFVKVSEDIVC 332
+ A P + L+ CY + + +VP V+ +F +GA + L N V+V + C
Sbjct: 353 DGAMGALPRSPAVSLLDTCYDLSGYASVRVPTVSFYFDQGAVLTLPARNLLVEVGGAVFC 412
Query: 333 SVFKGITNSVPIYGNIMQTNFLVGYDIEQQTVSFKPTDC 371
F ++ + I GNI Q + D V F P C
Sbjct: 413 LAFAPSSSGISILGNIQQEGIQITVDSANGYVGFGPNTC 451
>gi|125547728|gb|EAY93550.1| hypothetical protein OsI_15341 [Oryza sativa Indica Group]
Length = 418
Score = 179 bits (453), Expect = 2e-42, Method: Compositional matrix adjust.
Identities = 127/385 (32%), Positives = 177/385 (45%), Gaps = 65/385 (16%)
Query: 49 TPYQRLRDALTRSLNRLNHF-NQNSSISSSKASQADIIPNN-------ANYLIRISIGTP 100
T ++ LR RS R H + +++ A + P YL+ ++ GTP
Sbjct: 38 THWELLRRMAQRSKARATHLLSAQDQSGRGRSASAPVNPGAYDDGFPFTEYLVHLAAGTP 97
Query: 101 PTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCSSSQCASLNQKS 160
P E DTGSD+ WTQC+ CP S C+ Q PLFDP SS++ SLPCSS C +
Sbjct: 98 PQEVQLTLDTGSDITWTQCKRCPASACFNQTLPLFDPSASSSFASLPCSSPACET--TPP 155
Query: 161 CSGVN------CQYSVSYGDGSFSNGNLATETVTLGSTTGQ--AVALPGITFGCGTNNGG 212
C G N C YS+SYGDGS S G + E T S TG+ + A+PG+ FGCG N G
Sbjct: 156 CGGGNDATSRPCNYSISYGDGSVSRGEIGREVFTFASGTGEGSSAAVPGLVFGCGHANRG 215
Query: 213 LFNSKTTGIVGLGGGDISLISQMR--------TTIAGNQRLGV----------------- 247
+F S TGI G G G +SL SQ++ TTI G++ V
Sbjct: 216 VFTSNETGIAGFGRGSLSLPSQLKVGNFSHCFTTITGSKTSAVLLGLPGVAPPSASPLGR 275
Query: 248 --------STPDIVIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSF-- 297
STP +SGT++T LP + ++ ++ V C+S
Sbjct: 276 RRGSYRCRSTPR-SSNSGTSITSLPPRTYRAVREEFAAQVKLPVVPGNATDPFTCFSAPL 334
Query: 298 -NSLSQVPEVTIHFRGADVKLSRSNFFVKVSED--------IVCSVFKGITNSVPIYGNI 348
VP + +HF GA ++L + N+ +V +D I+C I I GNI
Sbjct: 335 RGPKPDVPTMALHFEGATMRLPQENYVFEVVDDDDAGNSSRIICLAV--IEGGEIILGNI 392
Query: 349 MQTNFLVGYDIEQQTVSFKPTDCTK 373
Q N V YD++ +SF P C +
Sbjct: 393 QQQNMHVLYDLQNSKLSFVPAQCDQ 417
>gi|15226358|ref|NP_180389.1| aspartyl protease-like protein [Arabidopsis thaliana]
gi|4803959|gb|AAD29831.1| putative chloroplast nucleoid DNA binding protein [Arabidopsis
thaliana]
gi|330252998|gb|AEC08092.1| aspartyl protease-like protein [Arabidopsis thaliana]
Length = 756
Score = 179 bits (453), Expect = 2e-42, Method: Compositional matrix adjust.
Identities = 122/375 (32%), Positives = 183/375 (48%), Gaps = 80/375 (21%)
Query: 63 NRLNHFNQNSSISSSKASQADIIPNNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPC 122
N L ++ +S + + AD + + + YL+++ +GTPP E +A DTGSD+IWTQC PC
Sbjct: 393 NFLVGYDSSSLLLQGASPYADTLYDYSIYLMKLQVGTPPFEIVAEIDTGSDIIWTQCMPC 452
Query: 123 PPSQCYMQDSPLFDPKMSSTYKSLPCSSSQCASLNQKSCSGVNCQYSVSYGDGSFSNGNL 182
P CY Q +P+FDP SST++ ++ C+G +C Y + Y D ++S G L
Sbjct: 453 P--NCYSQFAPIFDPSKSSTFR-------------EQRCNGNSCHYEIIYADKTYSKGIL 497
Query: 183 ATETVTLGSTTGQAVALPGITFGCGTNNGGL----FNSKTTGIVGLGGGDISLISQM--- 235
ATETVT+ ST+G+ + GCG +N L F S ++GIVGL G +SLISQM
Sbjct: 498 ATETVTIPSTSGEPFVMAETKIGCGLDNTNLQYSGFASSSSGIVGLNMGPLSLISQMDLP 557
Query: 236 ---------------------RTTIAGNQRL------------------GVSTPDIVI-- 254
+AG+ + VS D +I
Sbjct: 558 YPGLISYCFSGQGTSKINFGTNAIVAGDGTVAADMFIKKDNPFYYLNLDAVSVEDNLIAT 617
Query: 255 --------------DSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSL 300
DSGTTLT+ P Y + + + ++ A V D LCY +++
Sbjct: 618 LGTPFHAEDGNIFIDSGTTLTYFPMSYCNLVREAVEQVVTAVKVPDMGSDNLLCYYSDTI 677
Query: 301 SQVPEVTIHFR-GADVKLSRSNFFVK-VSEDIVCSVFKGITNSVP-IYGNIMQTNFLVGY 357
P +T+HF GAD+ L + N +++ ++ I C S+P ++GN Q NFLVGY
Sbjct: 678 DIFPVITMHFSGGADLVLDKYNMYLETITGGIFCLAIGCNDPSMPAVFGNRAQNNFLVGY 737
Query: 358 DIEQQTVSFKPTDCT 372
D +SF PT+C+
Sbjct: 738 DPSSNVISFSPTNCS 752
Score = 165 bits (417), Expect = 4e-38, Method: Compositional matrix adjust.
Identities = 131/423 (30%), Positives = 188/423 (44%), Gaps = 106/423 (25%)
Query: 1 MATFLSCVFILFFLCFYVVSPIEAQTGGFSVELIHRDSPKSPFYNSSETPYQRLRDALTR 60
+AT + +F+ CF + + + G F+++LI R S S F RL
Sbjct: 18 LATTMIVLFLQIITCFLFTTTVSSPHG-FTIDLIQRRSNSSSF---------RL------ 61
Query: 61 SLNRLNHFNQNSSISSSKASQADIIPNNANYLIRISIGTPPTERLAVADTGSDLIWTQCE 120
S N+L + AD + + YL+++ +GTPP E A DTGSDLIWTQC
Sbjct: 62 SKNQLQ----------GASPYADTLFDYNIYLMKLQVGTPPFEIAAEIDTGSDLIWTQCM 111
Query: 121 PCPPSQCYMQDSPLFDPKMSSTYKSLPCSSSQCASLNQKSCSGVNCQYSVSYGDGSFSNG 180
PCP CY Q P+FDP SST+ N++ C G +C Y + Y D ++S G
Sbjct: 112 PCP--DCYSQFDPIFDPSKSSTF-------------NEQRCHGKSCHYEIIYEDNTYSKG 156
Query: 181 NLATETVTLGSTTGQAVALPGITFGCGTNNGGL----FNSKTTGIVGLGGGDISLISQMR 236
LATETVT+ ST+G+ + T GCG +N L F S ++GIVGL G SLISQM
Sbjct: 157 ILATETVTIHSTSGEPFVMAETTIGCGLHNTDLDNSGFASSSSGIVGLNMGPRSLISQMD 216
Query: 237 TTI----------AGNQRLGVSTPDIVIDSGT---------------------------- 258
G ++ T IV GT
Sbjct: 217 LPYPGLISYCFSGQGTSKINFGTNAIVAGDGTVAADMFIKKDNPFYYLNLDAVSVEDNRI 276
Query: 259 --------------------TLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFN 298
T+T+ P Y + + + ++ A V DP+G+ LCY
Sbjct: 277 ETLGTPFHAEDGNIVIDSGSTVTYFPVSYCNLVRKAVEQVVTAVRVPDPSGNDMLCYFSE 336
Query: 299 SLSQVPEVTIHFR-GADVKLSRSNFFVKV-SEDIVC-SVFKGITNSVPIYGNIMQTNFLV 355
++ P +T+HF GAD+ L + N +++ S + C ++ I+GN Q NFLV
Sbjct: 337 TIDIFPVITMHFSGGADLVLDKYNMYMESNSGGLFCLAIICNSPTQEAIFGNRAQNNFLV 396
Query: 356 GYD 358
GYD
Sbjct: 397 GYD 399
>gi|356528675|ref|XP_003532925.1| PREDICTED: LOW QUALITY PROTEIN: probable aspartic protease
At2g35615-like [Glycine max]
Length = 342
Score = 178 bits (452), Expect = 3e-42, Method: Compositional matrix adjust.
Identities = 125/370 (33%), Positives = 177/370 (47%), Gaps = 80/370 (21%)
Query: 28 GFSVELIHRDSPKSPFYNSSETPYQRLRDALTRSLNRLNHFNQNSSISSSKASQADIIPN 87
GFS++LIHRDSP SPFYN S TP +R+ DA S N+N K ++ +IPN
Sbjct: 28 GFSIDLIHRDSPLSPFYNPSLTPSERITDAALSS-------NEN------KLPESILIPN 74
Query: 88 NANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLP 147
N YL+R+ IGTPP ERL +ADTGSD IW QC P
Sbjct: 75 NGEYLMRLYIGTPPVERLVIADTGSDFIWVQCS--------------------------P 108
Query: 148 CSSSQCASLNQKSCSGVNCQYSVSYGDGSFSNGNLATETVTLGSTTG-QAVALPGITFGC 206
C + QC LN Y + SF+ + TET++ ST G Q V+ P FGC
Sbjct: 109 CQNCQCVYLN-------------IYANKSFTIEVVGTETLSFDSTGGAQTVSFPNSIFGC 155
Query: 207 GTNNGGLFNS--KTTGIVGLGGGDISLISQMRTTIA--------GNQRL----GVSTPDI 252
G NN F S K TG+VGL G +SL+SQ+ I G++ + GV + +
Sbjct: 156 GANNNLTFRSSDKATGLVGLVAGQLSLVSQLGAQIGYKFSYLKFGSEAIITTNGVVSTPL 215
Query: 253 VIDSGTTLTFLPQGYNSNLLSVMSSMIE-----AQPVADPTGSLELCYSFNSLSQVPEVT 307
+I L FL N ++++ ++ + V D + C+ + VP +
Sbjct: 216 IIKPSLPLYFL----NLEVVTIGQKVVPTETLGVESVQDLPFPFKFCFPYRDNMTVPAIA 271
Query: 308 IHFRGADVKLSRSNFFVKVSED--IVCSVFKGIT--NSVPIYGNIMQTNFLVGYDIEQQT 363
F GA V L N +K+ + + +V + + + I+G I Q +F V YD++ +
Sbjct: 272 FQFTGASVALRPKNLLIKLQDRNMLXLAVVPSASSLSVISIFGIIAQFDFQVLYDLDGKK 331
Query: 364 VSFKPTDCTK 373
VS PTDCTK
Sbjct: 332 VSVAPTDCTK 341
>gi|255566002|ref|XP_002523989.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
gi|223536716|gb|EEF38357.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
Length = 448
Score = 176 bits (447), Expect = 1e-41, Method: Compositional matrix adjust.
Identities = 146/432 (33%), Positives = 199/432 (46%), Gaps = 96/432 (22%)
Query: 22 IEAQTGGFSVELIHRDSPKSPFYNSSETPYQRLRDALTRSLNRLNHFNQ-----NSSISS 76
++A GF+ ELI RDSP SPFYN+ L A TRS N H++ N S
Sbjct: 30 VKADNFGFTAELIRRDSPNSPFYNA-------LEAAATRSTNASQHYDAQIGRFNLMSDS 82
Query: 77 SKASQADIIPNNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFD 136
ASQ+++ + NYLI+IS+GTPP E LA+AD DL W C+ C Q +D F
Sbjct: 83 YYASQSELNFSKGNYLIKISVGTPPAEILALADITGDLTWLPCKTC---QDCTKDGFTFF 139
Query: 137 PKMSSTYKSLPCSSSQCASLNQKSCSGVNCQY---SVSYGDGSFSN-GNLATETVTLGST 192
P SSTY S C S QC N C C Y + S +N G +A +T++ S+
Sbjct: 140 PSESSTYTSAACESYQCQITNGAVCQTKMCIYLCGPLPQQRSSCTNKGLVAMDTISFHSS 199
Query: 193 TGQAVALPGITFGCGT--NNGGLFNSKTTGIVGLGGGDISLISQMRTTIAG--NQRL--- 245
+GQA++ P F CGT +N + GIVGLG G S+ SQM+ I G +Q L
Sbjct: 200 SGQALSYPNTNFICGTFIDNWHYIGA---GIVGLGRGLFSMTSQMKHLINGTFSQCLVPY 256
Query: 246 -------------GVSTPDIVI--------DSGTTLTFL--------------------- 263
GV + + V+ +SG FL
Sbjct: 257 SSKQSSKINFGLKGVVSGEGVVSTPIADDGESGAYFLFLEAMSVGGNRVANNFYSAPKSN 316
Query: 264 ------------PQGYNSNLLSVMSSMIEAQPV-ADPTGSLELCYSFNSLS--QVPEVTI 308
P + N+ + + I P+ + L LCY S P +T+
Sbjct: 317 IYIDWRTTFTSLPHDFYENVEAEVRKAINLTPINYNNERKLSLCYKSESDHDFDAPPITM 376
Query: 309 HFRGADVKLSRSNFFVKVSEDIVCSVF--------KGITNSVPIYGNIMQTNFLVGYDIE 360
HF ADV+LS N FV++ ++VC F K IT++V YG+ Q NF+VGYD++
Sbjct: 377 HFTNADVQLSPLNTFVRMDWNVVCFAFLDGTFNATKRITHAV--YGSWQQMNFIVGYDLK 434
Query: 361 QQTVSFKPTDCT 372
TVSFK DCT
Sbjct: 435 SSTVSFKQADCT 446
>gi|358345197|ref|XP_003636668.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
gi|355502603|gb|AES83806.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
Length = 294
Score = 176 bits (446), Expect = 1e-41, Method: Compositional matrix adjust.
Identities = 102/243 (41%), Positives = 142/243 (58%), Gaps = 37/243 (15%)
Query: 3 TFLSCVFILFFLCFYVVSP-IEAQTGGFSVELIHRDSPKSPFYNSSETPYQRLRDALTRS 61
T+ + ++ L F + P IEA GGF+ +LI R+S K
Sbjct: 2 TYPRKIHLISILLFVFIFPHIEAHNGGFTGKLIPRNSSK--------------------- 40
Query: 62 LNRLNHFNQNSSISSSKASQADIIPNNANYLIRISIGTPPTERLAVADTGSDLIWTQCEP 121
+ FN+N+ Q+ + N+ +YL+ +SIGTPP + A ADTGSDLIW QC P
Sbjct: 41 ----DFFNRNTI-------QSPVSANHYDYLMELSIGTPPVKIYAQADTGSDLIWLQCIP 89
Query: 122 CPPSQCYMQDSPLFDPKMSSTYKSLPCSSSQCASLNQKSCS--GVNCQYSVSYGDGSFSN 179
C + CY Q +P+FD + SST+ ++ C S C+ L SCS +NC+Y+ SY DGS +
Sbjct: 90 C--TNCYKQLNPMFDSQSSSTFSNIACGSESCSKLYSTSCSPDQINCKYNYSYVDGSETQ 147
Query: 180 GNLATETVTLGSTTGQAVALPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTI 239
G LA ET+TL STTG+ VA G+ FGCG NN G FN K GI+GLG G +SL+SQ+ +++
Sbjct: 148 GVLAQETLTLTSTTGEPVAFKGVIFGCGHNNNGAFNDKEMGIIGLGRGPLSLVSQIGSSL 207
Query: 240 AGN 242
GN
Sbjct: 208 GGN 210
>gi|30683732|ref|NP_180371.2| aspartyl protease-like protein [Arabidopsis thaliana]
gi|28392898|gb|AAO41885.1| putative chloroplast nucleoid DNA binding protein [Arabidopsis
thaliana]
gi|56382011|gb|AAV85724.1| At2g28040 [Arabidopsis thaliana]
gi|330252978|gb|AEC08072.1| aspartyl protease-like protein [Arabidopsis thaliana]
Length = 395
Score = 176 bits (445), Expect = 2e-41, Method: Compositional matrix adjust.
Identities = 140/436 (32%), Positives = 200/436 (45%), Gaps = 111/436 (25%)
Query: 1 MATFLSCVFILFFLCFYVVSPIEAQTGGFSVELIHRDSPKSPFYNSSETPYQRLRDALTR 60
+AT + +F L + +++++ + GF+++LIHR S S +R
Sbjct: 3 LATTMIAIF-LQIITYFLITTTASSPQGFTIDLIHRRSNASS----------------SR 45
Query: 61 SLNRLNHFNQNSSISSSKASQADIIPNNANYLIRISIGTPPTERLAVADTGSDLIWTQCE 120
N + + AD + + YL+++ IGTPP E AV DTGS+ IWTQC
Sbjct: 46 VFN-----------TQLGSPYADTVFDTYEYLMKLQIGTPPFEIEAVLDTGSEHIWTQCL 94
Query: 121 PCPPSQCYMQDSPLFDPKMSSTYKSLPCSSSQCASLNQKSCSGVNCQYSVSYGDGSFSNG 180
PC CY Q +P+FDP SST+K + C + + C Y + YG S++ G
Sbjct: 95 PC--VHCYNQTAPIFDPSKSSTFKEIRCDTHDHS-----------CPYELVYGGKSYTKG 141
Query: 181 NLATETVTLGSTTGQAVALPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQM----- 235
L TETVT+ ST+GQ +P GCG NN G F G+VGL G SLI+QM
Sbjct: 142 TLVTETVTIHSTSGQPFVMPETIIGCGRNNSG-FKPGFAGVVGLDRGPKSLITQMGGEYP 200
Query: 236 -----------------------------RTTI------------------AGNQRLG-V 247
TT+ GN R+ V
Sbjct: 201 GLMSYCFAGKGTSKINFGANAIVAGDGVVSTTVFVKTAKPGFYYLNLDAVSVGNTRIETV 260
Query: 248 STP------DIVIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSLS 301
TP +IVIDSG+TLT+ P+ Y + + + ++ A V P + LCY ++
Sbjct: 261 GTPFHALKGNIVIDSGSTLTYFPESYCNLVRKAVEQVVTA--VRFPRSDI-LCYYSKTID 317
Query: 302 QVPEVTIHFR-GADVKLSRSNFFVKVSE-DIVCSVFKGITNS---VPIYGNIMQTNFLVG 356
P +T+HF GAD+ L + N +V + + C I NS I+GN Q NFLVG
Sbjct: 318 IFPVITMHFSGGADLVLDKYNMYVASNTGGVFCLAI--ICNSPIEEAIFGNRAQNNFLVG 375
Query: 357 YDIEQQTVSFKPTDCT 372
YD VSFKPT+C+
Sbjct: 376 YDSSSLLVSFKPTNCS 391
>gi|222628951|gb|EEE61083.1| hypothetical protein OsJ_14969 [Oryza sativa Japonica Group]
Length = 367
Score = 174 bits (441), Expect = 7e-41, Method: Compositional matrix adjust.
Identities = 114/339 (33%), Positives = 177/339 (52%), Gaps = 34/339 (10%)
Query: 49 TPYQRLRDALTRSLNRLNHFN--QNSSISSSKASQAD--IIPNNANYLIRISIGTPPTER 104
T ++ LR A+ RS RL + + S+ KA A+ I+P YL+++ IGTPP +
Sbjct: 43 TEHELLRRAIQRSRYRLAGIGMARGEAASARKAVVAETPIMPAGGEYLVKLGIGTPPYKF 102
Query: 105 LAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCSSSQCASLNQKSCSGV 164
A DT SDLIWTQC+PC + CY Q P+F+P++SSTY +LPCSS C L+ C
Sbjct: 103 TAAIDTASDLIWTQCQPC--TGCYHQVDPMFNPRVSSTYAALPCSSDTCDELDVHRCGHD 160
Query: 165 N---CQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITFGCGTNN-GGLFNSKTTG 220
+ CQY+ +Y + + G LA + + +G A G+ FGC T++ GG + +G
Sbjct: 161 DDESCQYTYTYSGNATTEGTLAVDKLVIGED-----AFRGVAFGCSTSSTGGAPPPQASG 215
Query: 221 IVGLGGGDISLISQMRTTIAGNQRLGVSTPDIVIDSGTTLTFLPQGYNSNLLSVMSSMIE 280
+VGLG G +SL+SQ L V ++ID +T+TFL L++ + I
Sbjct: 216 VVGLGRGPLSLVSQ----------LSVRRYGMIIDIASTITFLEASLYDELVNDLEVEIR 265
Query: 281 AQPVADPTGSLELCY------SFNSLSQVPEVTIHFRGADVKLSRSNFFVKVSED-IVC- 332
+ L+LC+ +F+ + VP V + F G ++L ++ F + E ++C
Sbjct: 266 LPRGTGSSLGLDLCFILPDGVAFDRV-YVPAVALAFDGRWLRLDKARLFAEDRESGMMCL 324
Query: 333 SVFKGITNSVPIYGNIMQTNFLVGYDIEQQTVSFKPTDC 371
V + SV I GN Q N V Y++ + V+F + C
Sbjct: 325 MVGRAEAGSVSILGNFQQQNMQVLYNLRRGRVTFVQSPC 363
>gi|4063754|gb|AAC98462.1| putative chloroplast nucleoid DNA binding protein [Arabidopsis
thaliana]
gi|20197956|gb|AAM15330.1| putative chloroplast nucleoid DNA binding protein [Arabidopsis
thaliana]
Length = 389
Score = 174 bits (440), Expect = 7e-41, Method: Compositional matrix adjust.
Identities = 137/426 (32%), Positives = 194/426 (45%), Gaps = 110/426 (25%)
Query: 11 LFFLCFYVVSPIEAQTGGFSVELIHRDSPKSPFYNSSETPYQRLRDALTRSLNRLNHFNQ 70
L + +++++ + GF+++LIHR S S +R N
Sbjct: 6 LQIITYFLITTTASSPQGFTIDLIHRRSNASS----------------SRVFN------- 42
Query: 71 NSSISSSKASQADIIPNNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQ 130
+ + AD + + YL+++ IGTPP E AV DTGS+ IWTQC PC CY Q
Sbjct: 43 ----TQLGSPYADTVFDTYEYLMKLQIGTPPFEIEAVLDTGSEHIWTQCLPC--VHCYNQ 96
Query: 131 DSPLFDPKMSSTYKSLPCSSSQCASLNQKSCSGVNCQYSVSYGDGSFSNGNLATETVTLG 190
+P+FDP SST+K + C + + C Y + YG S++ G L TETVT+
Sbjct: 97 TAPIFDPSKSSTFKEIRCDTHDHS-----------CPYELVYGGKSYTKGTLVTETVTIH 145
Query: 191 STTGQAVALPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQM--------------- 235
ST+GQ +P GCG NN G F G+VGL G SLI+QM
Sbjct: 146 STSGQPFVMPETIIGCGRNNSG-FKPGFAGVVGLDRGPKSLITQMGGEYPGLMSYCFAGK 204
Query: 236 -------------------RTTI------------------AGNQRLG-VSTP------D 251
TT+ GN R+ V TP +
Sbjct: 205 GTSKINFGANAIVAGDGVVSTTVFVKTAKPGFYYLNLDAVSVGNTRIETVGTPFHALKGN 264
Query: 252 IVIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSLSQVPEVTIHFR 311
IVIDSG+TLT+ P+ Y + + + ++ A V P + LCY ++ P +T+HF
Sbjct: 265 IVIDSGSTLTYFPESYCNLVRKAVEQVVTA--VRFPRSDI-LCYYSKTIDIFPVITMHFS 321
Query: 312 -GADVKLSRSNFFVKVSE-DIVCSVFKGITNS---VPIYGNIMQTNFLVGYDIEQQTVSF 366
GAD+ L + N +V + + C I NS I+GN Q NFLVGYD VSF
Sbjct: 322 GGADLVLDKYNMYVASNTGGVFCLAI--ICNSPIEEAIFGNRAQNNFLVGYDSSSLLVSF 379
Query: 367 KPTDCT 372
KPT+C+
Sbjct: 380 KPTNCS 385
>gi|449453902|ref|XP_004144695.1| PREDICTED: LOW QUALITY PROTEIN: aspartic proteinase
nepenthesin-1-like, partial [Cucumis sativus]
Length = 716
Score = 172 bits (437), Expect = 2e-40, Method: Compositional matrix adjust.
Identities = 89/221 (40%), Positives = 134/221 (60%), Gaps = 14/221 (6%)
Query: 21 PIEAQTGGFSVELIHRDSPKSPFYNSSETPYQRLRDALTRSLNRLNHFNQNSSISSSKAS 80
P + + GF V L H D K+ T ++RLR + R NRL+ N ++++ A+
Sbjct: 298 PNKLPSHGFRVRLKHVDHVKN------LTRFERLRRGVARGKNRLHRLNA-MVLAAANAT 350
Query: 81 QAD-----IIPNNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLF 135
D ++ N +L++++IG+PP A+ DTGSDLIWTQC+PC QC+ Q +P+F
Sbjct: 351 VGDQVKAPVVAGNGEFLMKLAIGSPPRSFSAIMDTGSDLIWTQCKPC--QQCFDQSTPIF 408
Query: 136 DPKMSSTYKSLPCSSSQCASLNQKSCSGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQ 195
DPK SS++ + CSS C +L +CS C+Y +YGD S + G LA ET T G +T
Sbjct: 409 DPKQSSSFYKISCSSELCGALPTSTCSSDGCEYLYTYGDSSSTQGVLAFETFTFGDSTED 468
Query: 196 AVALPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMR 236
+++PG+ FGCG +N G S+ G+VGLG G +SL+SQ++
Sbjct: 469 QISIPGLGFGCGNDNNGDGFSQGAGLVGLGRGPLSLVSQLK 509
Score = 81.6 bits (200), Expect = 6e-13, Method: Compositional matrix adjust.
Identities = 45/126 (35%), Positives = 78/126 (61%), Gaps = 9/126 (7%)
Query: 252 IVIDSGTTLTFLPQGYNSNLLSVMSSMIEAQ--PVADP-TGSLELCYSF---NSLSQVPE 305
++IDSGTT+T++ NS S+ + I PV D TG L+LC++ + +VP+
Sbjct: 591 VIIDSGTTITYVE---NSAFTSLKNEFIAQMNLPVDDSGTGGLDLCFNLPAGTNQVEVPK 647
Query: 306 VTIHFRGADVKLSRSNFFVKVSEDIVCSVFKGITNSVPIYGNIMQTNFLVGYDIEQQTVS 365
+T HF+GAD++L N+ + S+ + + G + + I+GN+ Q NF+V +D++++T+S
Sbjct: 648 LTFHFKGADLELPGENYMIGDSKAGLLCLAIGSSRGMSIFGNLQQQNFMVVHDLQEETLS 707
Query: 366 FKPTDC 371
F PT C
Sbjct: 708 FLPTQC 713
>gi|15226317|ref|NP_180370.1| aspartyl protease-like protein [Arabidopsis thaliana]
gi|4063755|gb|AAC98463.1| putative chloroplast nucleoid DNA binding protein [Arabidopsis
thaliana]
gi|20197953|gb|AAM15327.1| putative chloroplast nucleoid DNA binding protein [Arabidopsis
thaliana]
gi|330252977|gb|AEC08071.1| aspartyl protease-like protein [Arabidopsis thaliana]
Length = 392
Score = 172 bits (435), Expect = 3e-40, Method: Compositional matrix adjust.
Identities = 120/355 (33%), Positives = 172/355 (48%), Gaps = 82/355 (23%)
Query: 82 ADIIPNNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSS 141
AD + + YL+++ +GTPP E A DTGSDLIWTQC PC + CY Q +P+FDP SS
Sbjct: 52 ADTLFDYNIYLMKLQVGTPPFEIEAEIDTGSDLIWTQCMPC--TNCYSQYAPIFDPSNSS 109
Query: 142 TYKSLPCSSSQCASLNQKSCSGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPG 201
T+K +K C+G +C Y + Y D ++S G LATETVT+ ST+G+ +P
Sbjct: 110 TFK-------------EKRCNGNSCHYKIIYADTTYSKGTLATETVTIHSTSGEPFVMPE 156
Query: 202 ITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQM------------------------RT 237
T GCG +N F +G+VGL G SLI+QM
Sbjct: 157 TTIGCG-HNSSWFKPTFSGMVGLSWGPSSLITQMGGEYPGLMSYCFASQGTSKINFGTNA 215
Query: 238 TIAGNQRL-------------------GVSTPDIVIDS-GTTL---------------TF 262
+AG+ + VS D +++ GTT T+
Sbjct: 216 IVAGDGVVSTTMFLTTAKPGLYYLNLDAVSVGDTHVETMGTTFHALEGNIIIDSGTTLTY 275
Query: 263 LPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSLSQVPEVTIHFR-GADVKLSRSN 321
P Y + + + + A ADPTG+ LCY +++ P +T+HF GAD+ L + N
Sbjct: 276 FPVSYCNLVREAVDHYVTAVRTADPTGNDMLCYYTDTIDIFPVITMHFSGGADLVLDKYN 335
Query: 322 FFVK-VSEDIVCSVFKGITNSVP---IYGNIMQTNFLVGYDIEQQTVSFKPTDCT 372
+++ ++ C I N+ P I+GN Q NFLVGYD VSF PT+C+
Sbjct: 336 MYIETITRGTFCLAI--ICNNPPQDAIFGNRAQNNFLVGYDSSSLLVSFSPTNCS 388
>gi|449523529|ref|XP_004168776.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Cucumis sativus]
Length = 461
Score = 172 bits (435), Expect = 3e-40, Method: Compositional matrix adjust.
Identities = 88/220 (40%), Positives = 132/220 (60%), Gaps = 12/220 (5%)
Query: 21 PIEAQTGGFSVELIHRDSPKSPFYNSSETPYQRLRDALTRSLNRLNHFNQ----NSSISS 76
P + + GF V L H D K+ T ++RLR + R NRL+ N ++ +
Sbjct: 43 PNKLPSHGFRVRLKHVDHVKN------LTRFERLRRGVARGKNRLHRLNAMVLAAANATV 96
Query: 77 SKASQADIIPNNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFD 136
+A ++ N +L++++IG+PP A+ DTGSDLIWTQC+PC QC+ Q +P+FD
Sbjct: 97 GDQVKAPVVAGNGEFLMKLAIGSPPRSFSAIMDTGSDLIWTQCKPC--QQCFDQSTPIFD 154
Query: 137 PKMSSTYKSLPCSSSQCASLNQKSCSGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQA 196
PK SS++ + CSS C +L +CS C+Y +YGD S + G LA ET T G +T
Sbjct: 155 PKQSSSFYKISCSSELCGALPTSTCSSDGCEYLYTYGDSSSTQGVLAFETFTFGDSTEDQ 214
Query: 197 VALPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMR 236
+++PG+ FGCG +N G S+ G+VGLG G +SL+SQ++
Sbjct: 215 ISIPGLGFGCGNDNNGDGFSQGAGLVGLGRGPLSLVSQLK 254
Score = 82.4 bits (202), Expect = 3e-13, Method: Compositional matrix adjust.
Identities = 45/126 (35%), Positives = 79/126 (62%), Gaps = 9/126 (7%)
Query: 252 IVIDSGTTLTFLPQGYNSNLLSVMSSMIEAQ--PVADP-TGSLELCYSFNSLS---QVPE 305
++IDSGTT+T++ NS S+ + I PV D TG L+LC++ + + +VP+
Sbjct: 336 VIIDSGTTITYVE---NSAFTSLKNEFIAQMNLPVDDSGTGGLDLCFNLPAGTNQVEVPK 392
Query: 306 VTIHFRGADVKLSRSNFFVKVSEDIVCSVFKGITNSVPIYGNIMQTNFLVGYDIEQQTVS 365
+T HF+GAD++L N+ + S+ + + G + + I+GN+ Q NF+V +D++++T+S
Sbjct: 393 LTFHFKGADLELPGENYMIGDSKAGLLCLAIGSSRGMSIFGNLQQQNFMVVHDLQEETLS 452
Query: 366 FKPTDC 371
F PT C
Sbjct: 453 FLPTQC 458
>gi|61214232|sp|Q766C2.1|NEP2_NEPGR RecName: Full=Aspartic proteinase nepenthesin-2; AltName:
Full=Nepenthesin-II; Flags: Precursor
gi|41016423|dbj|BAD07475.1| aspartic proteinase nepenthesin II [Nepenthes gracilis]
Length = 438
Score = 170 bits (431), Expect = 9e-40, Method: Compositional matrix adjust.
Identities = 123/406 (30%), Positives = 185/406 (45%), Gaps = 73/406 (17%)
Query: 28 GFSVELIHRDSPKSPFYNSSETPYQRLRDALTRSLNRLNHFNQNSSISSSKASQADIIPN 87
G V+L DS K+ T Y+ ++ A+ R R+ N + + SS + +
Sbjct: 41 GLRVDLEQVDSGKN------LTKYELIKRAIKRGERRMRSIN--AMLQSSSGIETPVYAG 92
Query: 88 NANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLP 147
+ YL+ ++IGTP + A+ DTGSDLIWTQCEPC +QC+ Q +P+F+P+ SS++ +LP
Sbjct: 93 DGEYLMNVAIGTPDSSFSAIMDTGSDLIWTQCEPC--TQCFSQPTPIFNPQDSSSFSTLP 150
Query: 148 CSSSQCASLNQKSCSGVNCQYSVSYGDGSFSNGNLATETVTL-----------------G 190
C S C L ++C+ CQY+ YGDGS + G +ATET T G
Sbjct: 151 CESQYCQDLPSETCNNNECQYTYGYGDGSTTQGYMATETFTFETSSVPNIAFGCGEDNQG 210
Query: 191 STTGQAVALPGITFG---------------CGTNNGG------LFNSKTTGIVGLGGGDI 229
G L G+ +G C T+ G S +G V G
Sbjct: 211 FGQGNGAGLIGMGWGPLSLPSQLGVGQFSYCMTSYGSSSPSTLALGSAASG-VPEGSPST 269
Query: 230 SLISQ----------MRTTIAGNQRLGV----------STPDIVIDSGTTLTFLPQGYNS 269
+LI ++ G LG+ T ++IDSGTTLT+LPQ +
Sbjct: 270 TLIHSSLNPTYYYITLQGITVGGDNLGIPSSTFQLQDDGTGGMIIDSGTTLTYLPQDAYN 329
Query: 270 NLLSVMSSMIEAQPVADPTGSLELCY---SFNSLSQVPEVTIHFRGADVKLSRSNFFVKV 326
+ + I V + + L C+ S S QVPE+++ F G + L N +
Sbjct: 330 AVAQAFTDQINLPTVDESSSGLSTCFQQPSDGSTVQVPEISMQFDGGVLNLGEQNILISP 389
Query: 327 SEDIVCSVFKGITN-SVPIYGNIMQTNFLVGYDIEQQTVSFKPTDC 371
+E ++C + + I+GNI Q V YD++ VSF PT C
Sbjct: 390 AEGVICLAMGSSSQLGISIFGNIQQQETQVLYDLQNLAVSFVPTQC 435
>gi|297826117|ref|XP_002880941.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
gi|297326780|gb|EFH57200.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
Length = 397
Score = 170 bits (431), Expect = 1e-39, Method: Compositional matrix adjust.
Identities = 122/357 (34%), Positives = 173/357 (48%), Gaps = 81/357 (22%)
Query: 82 ADIIPNNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSS 141
AD + + + YL+R+ +GTPP E +A DTGSDLIWTQC PCP CY Q +P+FDP SS
Sbjct: 52 ADTVFDYSIYLMRLQLGTPPFEIVAEIDTGSDLIWTQCMPCP--NCYTQFAPIFDPSKSS 109
Query: 142 TYKSLPCSSSQCASLNQKSCSGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPG 201
T+K +K C G +C Y + Y D S+S G LATETVT+ ST+G+ +
Sbjct: 110 TFK-------------EKRCHGNSCPYEIIYADESYSTGILATETVTIQSTSGEPFVMAE 156
Query: 202 ITFGCGTNNGGL----FNSKTTGIVGLGGGDISLISQMRTTI----------AGNQRLGV 247
+ GCG NN L + + ++GIVGL G SLISQM I G ++
Sbjct: 157 TSIGCGLNNSNLMTPGYAASSSGIVGLNMGPSSLISQMDLPIPGLISYCFSSQGTSKINF 216
Query: 248 STPDIVIDSGT------------------------------------------------T 259
T +V GT T
Sbjct: 217 GTNAVVAGDGTVAADMFIKKDQPFYYLNLDAVSVGDKRIETLGTPFHAQDGNIFIDSGTT 276
Query: 260 LTFLPQGY-NSNLLSVMSSMIEAQPVADPTGSLELCYSFNSLSQVPEVTIHFR-GADVKL 317
T+LP Y N +V +S++ A V DP+ LCY+++++ P +T+HF GAD+ L
Sbjct: 277 YTYLPTSYCNLVREAVAASVVAANQVPDPSSENLLCYNWDTMEIFPVITLHFAGGADLVL 336
Query: 318 SRSNFFVK-VSEDIVCSVFKGITNSVP-IYGNIMQTNFLVGYDIEQQTVSFKPTDCT 372
+ N +V+ ++ C + S+P I+GN N LVGYD +SF PT+C+
Sbjct: 337 DKYNMYVETITGGTFCLAIGCVDPSMPAIFGNRAHNNLLVGYDSSTLVISFSPTNCS 393
>gi|67633548|gb|AAY78698.1| aspartyl protease family protein [Arabidopsis thaliana]
Length = 392
Score = 169 bits (429), Expect = 1e-39, Method: Compositional matrix adjust.
Identities = 119/355 (33%), Positives = 171/355 (48%), Gaps = 82/355 (23%)
Query: 82 ADIIPNNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSS 141
AD + + YL+++ +GTPP E A DTGSDLIWTQC PC + CY Q +P+FDP SS
Sbjct: 52 ADTLFDYNIYLMKLQVGTPPFEIEAEIDTGSDLIWTQCMPC--TNCYSQYAPIFDPSNSS 109
Query: 142 TYKSLPCSSSQCASLNQKSCSGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPG 201
T+K +K C+G +C Y + Y D ++S G LATETVT+ ST+G+ +P
Sbjct: 110 TFK-------------EKRCNGNSCHYKIIYADTTYSKGTLATETVTIHSTSGEPFVMPE 156
Query: 202 ITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQM------------------------RT 237
T GCG +N F +G+VGL G SLI+QM
Sbjct: 157 TTIGCG-HNSSWFKPTFSGMVGLSWGPSSLITQMGGEYPGLMSYCFASQGTSKINFGTNA 215
Query: 238 TIAGNQRL-------------------GVSTPDIVIDS-GTTL---------------TF 262
+AG+ + VS D +++ GTT T+
Sbjct: 216 IVAGDGVVSTTMFLTTAKPGLYYLNLDAVSVGDTHVETMGTTFHALEGNIIIDSGTTLTY 275
Query: 263 LPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSLSQVPEVTIHFR-GADVKLSRSN 321
P Y + + + + A ADPTG+ LCY +++ P +T+HF GAD+ L + N
Sbjct: 276 FPVSYCNLVREAVDHYVTAVRTADPTGNDMLCYYTDTIDIFPVITMHFSGGADLVLDKYN 335
Query: 322 FFVK-VSEDIVCSVFKGITNSVP---IYGNIMQTNFLVGYDIEQQTVSFKPTDCT 372
+++ ++ C I N+ P I+GN Q NFLVGYD V F PT+C+
Sbjct: 336 MYIETITRGTFCLAI--ICNNPPQDAIFGNRAQNNFLVGYDSSSLLVFFSPTNCS 388
>gi|326491519|dbj|BAJ94237.1| predicted protein [Hordeum vulgare subsp. vulgare]
gi|326524456|dbj|BAK00611.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 499
Score = 169 bits (427), Expect = 2e-39, Method: Compositional matrix adjust.
Identities = 118/348 (33%), Positives = 160/348 (45%), Gaps = 70/348 (20%)
Query: 87 NNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSL 146
+ NY++ + +GTP ++ V DTGSD W QC PC +CY Q PLFDP SSTY ++
Sbjct: 159 STGNYVVTVGLGTPASKYTVVFDTGSDTTWVQCRPCV-VKCYKQKEPLFDPAKSSTYANV 217
Query: 147 PCSSSQCASLNQKSCSGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITFGC 206
C+ S CA L+ C+G +C Y+V YGDGS++ G A +T+T+ A+ G FGC
Sbjct: 218 SCTDSACADLDTNGCTGGHCLYAVQYGDGSYTVGFFAQDTLTIAHD-----AIKGFRFGC 272
Query: 207 GTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAG------------------------- 241
G N GLF KT G++GLG G SL Q G
Sbjct: 273 GEKNNGLFG-KTAGLMGLGRGKTSLTVQAYNKYGGAFAYCLPALTTGTGYLDFGPGSAGN 331
Query: 242 NQRL-----------------GV--------------STPDIVIDSGTTLTFLPQGYNSN 270
N RL G+ ST ++DSGT +T LP +
Sbjct: 332 NARLTPMLTDKGQTFYYVGMTGIRVGGQQVPVAESVFSTAGTLVDSGTVITRLPATAYTA 391
Query: 271 LLSVMSSMIEAQPVADPTGS--LELCYSFNSLSQV--PEVTIHFR-GADVKLSRSNFFVK 325
L S ++ A+ G L+ CY F LS V P V++ F+ GA + + S
Sbjct: 392 LSSAFDKVMLARGYKKAPGYSILDTCYDFTGLSDVELPTVSLVFQGGACLDVDVSGIVYA 451
Query: 326 VSEDIVCSVFK--GITNSVPIYGNIMQTNFLVGYDIEQQTVSFKPTDC 371
+SE VC F G SV I GN Q + V YD+ ++TV F P C
Sbjct: 452 ISEAQVCLAFASNGDDESVAIVGNTQQKTYGVLYDLGKKTVGFAPGSC 499
>gi|326520736|dbj|BAJ92731.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 499
Score = 169 bits (427), Expect = 3e-39, Method: Compositional matrix adjust.
Identities = 118/348 (33%), Positives = 160/348 (45%), Gaps = 70/348 (20%)
Query: 87 NNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSL 146
+ NY++ + +GTP ++ V DTGSD W QC PC +CY Q PLFDP SSTY ++
Sbjct: 159 STGNYVVTVGLGTPASKYTVVFDTGSDTTWVQCRPCV-VKCYKQKGPLFDPAKSSTYANV 217
Query: 147 PCSSSQCASLNQKSCSGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITFGC 206
C+ S CA L+ C+G +C Y+V YGDGS++ G A +T+T+ A+ G FGC
Sbjct: 218 SCTDSACADLDTNGCTGGHCLYAVQYGDGSYTVGFFAQDTLTIAHD-----AIKGFRFGC 272
Query: 207 GTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAG------------------------- 241
G N GLF KT G++GLG G SL Q G
Sbjct: 273 GEKNNGLFG-KTAGLMGLGRGKTSLTVQAYNKYGGAFAYCLPALTTGTGYLDFGPGSAGN 331
Query: 242 NQRL-----------------GV--------------STPDIVIDSGTTLTFLPQGYNSN 270
N RL G+ ST ++DSGT +T LP +
Sbjct: 332 NARLTPMLTDKGQTFYYVGMTGIRVGGQQVPVAESVFSTAGTLVDSGTVITRLPATAYTA 391
Query: 271 LLSVMSSMIEAQPVADPTGS--LELCYSFNSLSQV--PEVTIHFR-GADVKLSRSNFFVK 325
L S ++ A+ G L+ CY F LS V P V++ F+ GA + + S
Sbjct: 392 LSSAFDKVMLARGYKKAPGYSILDTCYDFTGLSDVELPTVSLVFQGGACLDVDVSGIVYA 451
Query: 326 VSEDIVCSVFK--GITNSVPIYGNIMQTNFLVGYDIEQQTVSFKPTDC 371
+SE VC F G SV I GN Q + V YD+ ++TV F P C
Sbjct: 452 ISEAQVCLAFASNGDDESVAIVGNTQQKTYGVLYDLGKKTVGFAPGSC 499
>gi|218187618|gb|EEC70045.1| hypothetical protein OsI_00635 [Oryza sativa Indica Group]
Length = 570
Score = 169 bits (427), Expect = 3e-39, Method: Compositional matrix adjust.
Identities = 133/409 (32%), Positives = 192/409 (46%), Gaps = 73/409 (17%)
Query: 24 AQTGGFSVELIHRDSPKSPFYNSSETPYQRLRDALTRSLNRLNHFNQNSSISSSKASQAD 83
A GGFSVE IHRDSP+SPF++ + T + R A RS+ R ++S S+S AD
Sbjct: 29 ASGGGFSVEFIHRDSPRSPFHDPAFTAHGRALAAARRSVARAAAIAGSASSSASGGGAAD 88
Query: 84 -----IIPNNANYLIRISIGTPPTERLAVADTGSDLIWTQCE---------PCPPSQCYM 129
++ + YL+ +++G+PP LA+ADTGSDL+W +C+ P +Q
Sbjct: 89 DVVSKVVSRSFEYLMTVNLGSPPRSMLAIADTGSDLVWVKCKKGNNDTSSAAAPTTQ--- 145
Query: 130 QDSPLFDPKMSSTYKSLPCSSSQCASLNQKSC-SGVNCQYSVSYGDGSFSNGNLATETVT 188
FDP SSTY + C + C +L + +C G NC Y +YGDGS + G L+TET T
Sbjct: 146 -----FDPSRSSTYGRVSCQTDACEALGRATCDDGSNCAYLYAYGDGSNTTGVLSTETFT 200
Query: 189 L----GSTTGQAVALPGITFGCGTNNGGLF---------NSKTTGIVGLGG--------- 226
+ + V + G+ FGC T G F + + LGG
Sbjct: 201 FDDGGAGRSPRQVRIGGVKFGCSTATAGSFPADGLVGLGGGAVSLVTQLGGATSLGRRFS 260
Query: 227 -------------------GDISLISQMRTTIAGNQRLG-VSTPDIVIDSGTTLTFLPQG 266
D++ T + GN+ + ++ I++DSGTTLTFL
Sbjct: 261 YCLVPHSVNASSALNFGALADVTEPGAASTPLVGNKTVASAASSRIIVDSGTTLTFLDPS 320
Query: 267 YNSNLLSVMSSMIEAQPVADPTGSLELCYS-----FNSLSQVPEVTIHF-RGADVKLSRS 320
++ +S I PV P G L+LCY+ + +P++T+ F GA V L
Sbjct: 321 LLGPIVDELSRRITLPPVQSPDGLLQLCYNVAGREVEAGESIPDLTLEFGGGAAVALKPE 380
Query: 321 NFFVKVSEDIVCSVFKGITNSVP--IYGNIMQTNFLVGYDIEQQTVSFK 367
N FV V E +C T P I GN+ Q N VGYD++ TV K
Sbjct: 381 NAFVAVQEGTLCLAIVATTEQQPVSILGNLAQQNIHVGYDLDAGTVGNK 429
Score = 82.0 bits (201), Expect = 5e-13, Method: Compositional matrix adjust.
Identities = 49/128 (38%), Positives = 66/128 (51%), Gaps = 8/128 (6%)
Query: 252 IVIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYS-----FNSLSQVPEV 306
I++DSGTTLTFL ++ +S I PV P G L+LCY+ + +P++
Sbjct: 439 IIVDSGTTLTFLDPSLLGPIVDELSRRITLPPVQSPDGLLQLCYNVAGREVEAGESIPDL 498
Query: 307 TIHF-RGADVKLSRSNFFVKVSEDIVCSVFKGITNSVP--IYGNIMQTNFLVGYDIEQQT 363
T+ F GA V L N FV V E +C T P I GN+ Q N VGYD++ T
Sbjct: 499 TLEFGGGAAVALKPENAFVAVQEGTLCLAIVATTEQQPVSILGNLAQQNIHVGYDLDAGT 558
Query: 364 VSFKPTDC 371
V+F DC
Sbjct: 559 VTFAVADC 566
>gi|15226315|ref|NP_180368.1| aspartyl protease-like protein [Arabidopsis thaliana]
gi|4510415|gb|AAD21501.1| putative chloroplast nucleoid DNA binding protein [Arabidopsis
thaliana]
gi|330252975|gb|AEC08069.1| aspartyl protease-like protein [Arabidopsis thaliana]
Length = 396
Score = 169 bits (427), Expect = 3e-39, Method: Compositional matrix adjust.
Identities = 131/434 (30%), Positives = 197/434 (45%), Gaps = 106/434 (24%)
Query: 1 MATFLSCVFILFFLCFYVVSPIEAQTGGFSVELIHRDSPKSPFYNSSETPYQRLRDALTR 60
+AT + +F+ LCF + + + GF+++LIHR
Sbjct: 3 LATTIIVLFLQISLCF-LFTTTASPPHGFTMDLIHR------------------------ 37
Query: 61 SLNRLNHFNQNSSISSSKASQADIIPNNANYLIRISIGTPPTERLAVADTGSDLIWTQCE 120
R N ++ S+ S + A+ + +N+ YL+++ +GTPP E A+ DTGS++ WTQC
Sbjct: 38 ---RSNASSRVSNTQSGSSPYANTVFDNSVYLMKLQVGTPPFEIQAIIDTGSEITWTQCL 94
Query: 121 PCPPSQCYMQDSPLFDPKMSSTYKSLPCSSSQCASLNQKSCSGVNCQYSVSYGDGSFSNG 180
PC CY Q++P+FDP SST+K +K C G +C Y V Y D +++ G
Sbjct: 95 PC--VHCYEQNAPIFDPSKSSTFK-------------EKRCDGHSCPYEVDYFDHTYTMG 139
Query: 181 NLATETVTLGSTTGQAVALPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTI- 239
LATET+TL ST+G+ +P GCG NN F +G+VGL G SLI+QM
Sbjct: 140 TLATETITLHSTSGEPFVMPETIIGCGHNN-SWFKPSFSGMVGLNWGPSSLITQMGGEYP 198
Query: 240 ---------AGNQRLGVSTPDIVIDSGTTLTFL------PQGYNSNL--LSVMSSMIE-- 280
G ++ IV G T + P Y NL +SV ++ IE
Sbjct: 199 GLMSYCFSGQGTSKINFGANAIVAGDGVVSTTMFMTTAKPGFYYLNLDAVSVGNTRIETM 258
Query: 281 ---------------------------------------AQPVADPTGSLELCYSFNSLS 301
A ADPTG+ LCY+ +++
Sbjct: 259 GTTFHALEGNIVIDSGTTLTYFPVSYCNLVRQAVEHVVTAVRAADPTGNDMLCYNSDTID 318
Query: 302 QVPEVTIHFRGA-DVKLSRSNFFVKVSE-DIVC-SVFKGITNSVPIYGNIMQTNFLVGYD 358
P +T+HF G D+ L + N +++ + + C ++ I+GN Q NFLVGYD
Sbjct: 319 IFPVITMHFSGGVDLVLDKYNMYMESNNGGVFCLAIICNSPTQEAIFGNRAQNNFLVGYD 378
Query: 359 IEQQTVSFKPTDCT 372
VSF PT+C+
Sbjct: 379 SSSLLVSFSPTNCS 392
>gi|356540369|ref|XP_003538662.1| PREDICTED: LOW QUALITY PROTEIN: probable aspartic protease
At2g35615-like [Glycine max]
Length = 364
Score = 168 bits (425), Expect = 5e-39, Method: Compositional matrix adjust.
Identities = 112/352 (31%), Positives = 161/352 (45%), Gaps = 80/352 (22%)
Query: 87 NNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSL 146
NN +YL+++++GTPP + + DT SDL+W QC PC CY Q +P+FDP
Sbjct: 27 NNGDYLMKLTLGTPPVDVYGLVDTDSDLVWAQCTPC--QGCYKQKNPMFDPL-------- 76
Query: 147 PCSSSQCASLNQKSCSGVN-CQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITFG 205
+C S SCS C Y +Y D S + G LA E T ST G+ + + I FG
Sbjct: 77 ----KECNSFFDHSCSPEKACDYVYAYADDSATKGMLAKEIATFSSTDGKPI-VESIIFG 131
Query: 206 CGTNNGGLFNS---------------------------------------KTTGIVGLG- 225
CG NN G+FN T+G + LG
Sbjct: 132 CGHNNTGVFNENDMGLIGLGGGPLSLVSQMGNLYGSKRFSQCLVPFHADPHTSGTISLGE 191
Query: 226 GGDIS--------LISQMRTT---------------IAGNQRLGVSTPDIVIDSGTTLTF 262
D+S L+S+ T + N +S +I+IDSGT T+
Sbjct: 192 ASDVSGEGVVTTPLVSEEGQTPYLVTLEGISVGDTFVPFNSSEMLSKGNIMIDSGTPETY 251
Query: 263 LPQGYNSNLLSVMSSMIEAQPV-ADPTGSLELCYSFNSLSQVPEVTIHFRGADVKLSRSN 321
LPQ + L+ + I P+ DP +LCY + + P +T HF GADVKL
Sbjct: 252 LPQEFYDRLVEELKVQINLPPIHVDPDLGTQLCYKSETNLEGPILTAHFEGADVKLLPLQ 311
Query: 322 FFVKVSEDIVCSVFKGITNSVPIYGNIMQTNFLVGYDIEQQTVSFKPTDCTK 373
F+ + + C G T+ + I+GN Q+N L+G+D++++ V FKPTD TK
Sbjct: 312 TFIPPKDGVFCFAMTGTTDGLYIFGNFAQSNVLIGFDLDKRIVFFKPTDFTK 363
>gi|297603570|ref|NP_001054261.2| Os04g0677100 [Oryza sativa Japonica Group]
gi|255675885|dbj|BAF16175.2| Os04g0677100 [Oryza sativa Japonica Group]
Length = 464
Score = 167 bits (424), Expect = 6e-39, Method: Compositional matrix adjust.
Identities = 122/412 (29%), Positives = 182/412 (44%), Gaps = 81/412 (19%)
Query: 30 SVELIHRDSPKSPFYNSSETPYQRLRDALTRSLNRLNHFNQNSSISSSKASQADIIPN-- 87
S+ L+HRD+ Y S ++ + R R+ H + S+S D++
Sbjct: 64 SLSLVHRDAISGATYPSRR---HQVVGLVARDNARVEHLEKRLVASTSPYLPEDLVSEVV 120
Query: 88 ------NANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSS 141
+ Y +R+ +G+PPT++ V D+GSD+IW QC PC QCY Q PLFDP SS
Sbjct: 121 PGVDDGSGEYFVRVGVGSPPTDQYLVVDSGSDVIWVQCRPC--EQCYAQTDPLFDPAASS 178
Query: 142 TYKSLPCSSSQCASLNQKSCSGV----NCQYSVSYGDGSFSNGNLATETVTLGSTTGQAV 197
++ + C S+ C +L+ C G C YSV+YGDGS++ G LA ET+TLG T
Sbjct: 179 SFSGVSCGSAICRTLSGTGCGGGGDAGKCDYSVTYGDGSYTKGELALETLTLGGT----- 233
Query: 198 ALPGITFGCGTNNGGLFNSKTTGIVGLG---------------------------GGDIS 230
A+ G+ GCG N GLF G++GLG GG S
Sbjct: 234 AVQGVAIGCGHRNSGLF-VGAAGLLGLGWGAMSLVGQLGGAAGGVFSYCLASRGAGGAGS 292
Query: 231 LISQMRTTI------------------AGNQRLGVS------TPD----IVIDSGTTLTF 262
L+ + G +RL + T D +V+D+GT +T
Sbjct: 293 LVLGRTEAVPRGRRASSFYYVGLTGIGVGGERLPLQDSLFQLTEDGAGGVVMDTGTAVTR 352
Query: 263 LPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSLS--QVPEVTIHF-RGADVKLSR 319
LP+ + L + A P + L+ CY + + +VP V+ +F +GA + L
Sbjct: 353 LPREAYAALRGAFDGAMGALPRSPAVSLLDTCYDLSGYASVRVPTVSFYFDQGAVLTLPA 412
Query: 320 SNFFVKVSEDIVCSVFKGITNSVPIYGNIMQTNFLVGYDIEQQTVSFKPTDC 371
N V+V + C F ++ + I GNI Q + D V F P C
Sbjct: 413 RNLLVEVGGAVFCLAFAPSSSGISILGNIQQEGIQITVDSANGYVGFGPNTC 464
>gi|61214233|sp|Q766C3.1|NEP1_NEPGR RecName: Full=Aspartic proteinase nepenthesin-1; AltName:
Full=Nepenthesin-I; Flags: Precursor
gi|41016421|dbj|BAD07474.1| aspartic proteinase nepenthesin I [Nepenthes gracilis]
Length = 437
Score = 167 bits (422), Expect = 1e-38, Method: Compositional matrix adjust.
Identities = 90/216 (41%), Positives = 128/216 (59%), Gaps = 15/216 (6%)
Query: 23 EAQTGGFSVELIHRDSPKSPFYNSSETPYQRLRDALTRSLNRLNHFNQNSSISSSKASQA 82
EA+ GF + L H DS K+ T +Q L A+ R RL + ++ +
Sbjct: 35 EAKVTGFQIMLEHVDSGKN------LTKFQLLERAIERGSRRLQRLE--AMLNGPSGVET 86
Query: 83 DIIPNNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSST 142
+ + YL+ +SIGTP A+ DTGSDLIWTQC+PC +QC+ Q +P+F+P+ SS+
Sbjct: 87 SVYAGDGEYLMNLSIGTPAQPFSAIMDTGSDLIWTQCQPC--TQCFNQSTPIFNPQGSSS 144
Query: 143 YKSLPCSSSQCASLNQKSCSGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGI 202
+ +LPCSS C +L+ +CS CQY+ YGDGS + G++ TET+T GS V++P I
Sbjct: 145 FSTLPCSSQLCQALSSPTCSNNFCQYTYGYGDGSETQGSMGTETLTFGS-----VSIPNI 199
Query: 203 TFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTT 238
TFGCG NN G G+VG+G G +SL SQ+ T
Sbjct: 200 TFGCGENNQGFGQGNGAGLVGMGRGPLSLPSQLDVT 235
Score = 71.2 bits (173), Expect = 8e-10, Method: Compositional matrix adjust.
Identities = 42/126 (33%), Positives = 60/126 (47%), Gaps = 3/126 (2%)
Query: 249 TPDIVIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCY---SFNSLSQVPE 305
T I+IDSGTTLT+ ++ S I V + +LC+ S S Q+P
Sbjct: 309 TGGIIIDSGTTLTYFVNNAYQSVRQEFISQINLPVVNGSSSGFDLCFQTPSDPSNLQIPT 368
Query: 306 VTIHFRGADVKLSRSNFFVKVSEDIVCSVFKGITNSVPIYGNIMQTNFLVGYDIEQQTVS 365
+HF G D++L N+F+ S ++C + + I+GNI Q N LV YD VS
Sbjct: 369 FVMHFDGGDLELPSENYFISPSNGLICLAMGSSSQGMSIFGNIQQQNMLVVYDTGNSVVS 428
Query: 366 FKPTDC 371
F C
Sbjct: 429 FASAQC 434
>gi|242045120|ref|XP_002460431.1| hypothetical protein SORBIDRAFT_02g028000 [Sorghum bicolor]
gi|241923808|gb|EER96952.1| hypothetical protein SORBIDRAFT_02g028000 [Sorghum bicolor]
Length = 481
Score = 166 bits (419), Expect = 2e-38, Method: Compositional matrix adjust.
Identities = 125/419 (29%), Positives = 183/419 (43%), Gaps = 84/419 (20%)
Query: 33 LIHRDSPKSPFYNSSETPYQRLRDALTRSLNRLNHFNQ-------NSSISSSKASQADII 85
++HR P SP P + L R +R++ ++ +++ S AS+ +
Sbjct: 68 VVHRHGPCSPLQARGGEPSHA--EILDRDQDRVDSIHRLAAARPSSTADDPSSASKGVSL 125
Query: 86 P-------NNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPK 138
P ANY++ + +GTP + L V DTGSDL W QC+PC CY Q PLFDP
Sbjct: 126 PARRGVPLGTANYIVSVGLGTPKRDLLVVFDTGSDLSWVQCKPC--DGCYQQHDPLFDPS 183
Query: 139 MSSTYKSLPCSSSQCASLNQKSCSGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVA 198
S+TY ++PC + +C L+ SCS C+Y V YGD S ++GNLA +T+TLG ++ + +
Sbjct: 184 QSTTYSAVPCGAQECRRLDSGSCSSGKCRYEVVYGDMSQTDGNLARDTLTLGPSSSSSSS 243
Query: 199 --LPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMR--------------TTIAGN 242
L FGCG ++ GLF K G+ GLG +SL SQ +T G
Sbjct: 244 DQLQEFVFGCGDDDTGLFG-KADGLFGLGRDRVSLASQAAAKYGAGFSYCLPSSSTAEGY 302
Query: 243 QRLGVSTP------------------------------------------DIVIDSGTTL 260
LG + P VIDSGT +
Sbjct: 303 LSLGSAAPPNARFTAMVTRSDTPSFYYLNLVGIKVAGRTVRVSPAVFRTPGTVIDSGTVI 362
Query: 261 TFLPQGYNSNLLSVMSSMIE--AQPVADPTGSLELCYSFNSLS--QVPEVTIHFR-GADV 315
T LP + L S + ++ + A L+ CY F + Q+P V + F GA +
Sbjct: 363 TRLPSRAYAALRSSFAGLMRRYSYKRAPALSILDTCYDFTGRNKVQIPSVALLFDGGATL 422
Query: 316 KLSRSNFFVKVSEDIVCSVFK--GITNSVPIYGNIMQTNFLVGYDIEQQTVSFKPTDCT 372
L ++ C F G S+ I GN+ Q F V YD+ Q + F C+
Sbjct: 423 NLGFGEVLYVANKSQACLAFASNGDDTSIAILGNMQQKTFAVVYDVANQKIGFGAKGCS 481
>gi|409179878|gb|AFV26024.1| aspartic proteinase nepenthesin 1 [Nepenthes mirabilis]
Length = 437
Score = 165 bits (417), Expect = 4e-38, Method: Compositional matrix adjust.
Identities = 88/216 (40%), Positives = 127/216 (58%), Gaps = 15/216 (6%)
Query: 23 EAQTGGFSVELIHRDSPKSPFYNSSETPYQRLRDALTRSLNRLNHFNQNSSISSSKASQA 82
E + GF + L H DS K+ T ++ L A+ R RL + ++ +
Sbjct: 35 EPKVAGFQIMLEHVDSGKN------LTKFELLERAVERGSRRLQRLE--AMLNGPSGVET 86
Query: 83 DIIPNNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSST 142
+ + YL+ +SIGTP A+ DTGSDLIWTQC+PC +QC+ Q +P+F+P+ SS+
Sbjct: 87 PVYAGDGEYLMNLSIGTPAQPFSAIMDTGSDLIWTQCQPC--TQCFNQSTPIFNPQGSSS 144
Query: 143 YKSLPCSSSQCASLNQKSCSGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGI 202
+ +LPCSS C +L +CS +CQY+ YGDGS + G++ TET+T GS V++P I
Sbjct: 145 FSTLPCSSQLCQALQSPTCSNNSCQYTYGYGDGSETQGSMGTETLTFGS-----VSIPNI 199
Query: 203 TFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTT 238
TFGCG NN G G+VG+G G +SL SQ+ T
Sbjct: 200 TFGCGENNQGFGQGNGAGLVGMGRGPLSLPSQLDVT 235
Score = 69.3 bits (168), Expect = 3e-09, Method: Compositional matrix adjust.
Identities = 41/126 (32%), Positives = 58/126 (46%), Gaps = 3/126 (2%)
Query: 249 TPDIVIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCY---SFNSLSQVPE 305
T I+IDSGTTLT+ + S + V + +LC+ S S Q+P
Sbjct: 309 TGGIIIDSGTTLTYFADNAYQAVRQAFISQMNLSVVNGSSSGFDLCFQMPSDQSNLQIPT 368
Query: 306 VTIHFRGADVKLSRSNFFVKVSEDIVCSVFKGITNSVPIYGNIMQTNFLVGYDIEQQTVS 365
+HF G D+ L N+F+ S ++C + + I+GNI Q N LV YD VS
Sbjct: 369 FVMHFDGGDLVLPSENYFISPSNGLICLAMGSSSQGMSIFGNIQQQNLLVVYDTGNSVVS 428
Query: 366 FKPTDC 371
F C
Sbjct: 429 FLFAQC 434
>gi|165292434|dbj|BAF98915.1| aspartic proteinase nepenthesin I [Nepenthes alata]
Length = 437
Score = 165 bits (417), Expect = 4e-38, Method: Compositional matrix adjust.
Identities = 88/216 (40%), Positives = 127/216 (58%), Gaps = 15/216 (6%)
Query: 23 EAQTGGFSVELIHRDSPKSPFYNSSETPYQRLRDALTRSLNRLNHFNQNSSISSSKASQA 82
E + GF + L H DS K+ T ++ L A+ R RL + ++ +
Sbjct: 35 EPKVAGFQIMLEHVDSGKN------LTKFELLERAVERGSRRLQRLE--AMLNGPSGVET 86
Query: 83 DIIPNNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSST 142
+ + YL+ +SIGTP A+ DTGSDLIWTQC+PC +QC+ Q +P+F+P+ SS+
Sbjct: 87 PVYAGDGEYLMNLSIGTPAQPFSAIMDTGSDLIWTQCQPC--TQCFNQSTPIFNPQGSSS 144
Query: 143 YKSLPCSSSQCASLNQKSCSGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGI 202
+ +LPCSS C +L +CS +CQY+ YGDGS + G++ TET+T GS V++P I
Sbjct: 145 FSTLPCSSQLCQALQSPTCSNNSCQYTYGYGDGSETQGSMGTETLTFGS-----VSIPNI 199
Query: 203 TFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTT 238
TFGCG NN G G+VG+G G +SL SQ+ T
Sbjct: 200 TFGCGENNQGFGQGNGAGLVGMGRGPLSLPSQLDVT 235
Score = 69.7 bits (169), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 41/126 (32%), Positives = 58/126 (46%), Gaps = 3/126 (2%)
Query: 249 TPDIVIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCY---SFNSLSQVPE 305
T I+IDSGTTLT+ + S + V + +LC+ S S Q+P
Sbjct: 309 TGGIIIDSGTTLTYFVDNAYQAVRQAFISQMNLSVVNGSSSGFDLCFQMPSDQSNLQIPT 368
Query: 306 VTIHFRGADVKLSRSNFFVKVSEDIVCSVFKGITNSVPIYGNIMQTNFLVGYDIEQQTVS 365
+HF G D+ L N+F+ S ++C + + I+GNI Q N LV YD VS
Sbjct: 369 FVMHFDGGDLVLPSENYFISPSNGLICLAMGSSSQGMSIFGNIQQQNLLVVYDTGNSVVS 428
Query: 366 FKPTDC 371
F C
Sbjct: 429 FLSAQC 434
>gi|30678047|ref|NP_565298.2| aspartyl protease-like protein [Arabidopsis thaliana]
gi|30102688|gb|AAP21262.1| At2g03200 [Arabidopsis thaliana]
gi|110736021|dbj|BAE99983.1| putative chloroplast nucleoid DNA binding protein [Arabidopsis
thaliana]
gi|330250580|gb|AEC05674.1| aspartyl protease-like protein [Arabidopsis thaliana]
Length = 461
Score = 163 bits (413), Expect = 1e-37, Method: Compositional matrix adjust.
Identities = 132/427 (30%), Positives = 203/427 (47%), Gaps = 91/427 (21%)
Query: 28 GFSVELIHRDSPKSPFYNSSETPYQRLRDALTRSLNRLNHFNQNSSIS-SSKASQADIIP 86
GF + L H DS K+ T Q+++ + R +RLN + ++ +SK + I
Sbjct: 44 GFRLSLRHVDSGKNL------TKIQKIQRGINRGFHRLNRLGAVAVLAVASKPDDTNNIK 97
Query: 87 -----NNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSS 141
+ +L+ +SIG P + A+ DTGSDLIWTQC+PC ++C+ Q +P+FDP+ SS
Sbjct: 98 APTHGGSGEFLMELSIGNPAVKYSAIVDTGSDLIWTQCKPC--TECFDQPTPIFDPEKSS 155
Query: 142 TYKSLPCSSSQCASLNQKSCSGVN--CQYSVSYGDGSFSNGNLATETVTL-------GST 192
+Y + CSS C +L + +C+ C+Y +YGD S + G LATET T G
Sbjct: 156 SYSKVGCSSGLCNALPRSNCNEDKDACEYLYTYGDYSSTRGLLATETFTFEDENSISGIG 215
Query: 193 TGQAVALPGITFGCGTNNGGL---------------------------------FNSKTT 219
G V G F G+ GL S +
Sbjct: 216 FGCGVENEGDGFSQGSGLVGLGRGPLSLISQLKETKFSYCLTSIEDSEASSSLFIGSLAS 275
Query: 220 GIVGLGGGDI--------SLIS----------QMRTTIAGNQRLGVS----------TPD 251
GIV G + SL+ +++ G +RL V T
Sbjct: 276 GIVNKTGASLDGEVTKTMSLLRNPDQPSFYYLELQGITVGAKRLSVEKSTFELAEDGTGG 335
Query: 252 IVIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGS--LELCYSFNSLSQ---VPEV 306
++IDSGTT+T+L + L +S + + PV D +GS L+LC+ ++ VP++
Sbjct: 336 MIIDSGTTITYLEETAFKVLKEEFTSRM-SLPV-DDSGSTGLDLCFKLPDAAKNIAVPKM 393
Query: 307 TIHFRGADVKLSRSNFFVKVSEDIVCSVFKGITNSVPIYGNIMQTNFLVGYDIEQQTVSF 366
HF+GAD++L N+ V S V + G +N + I+GN+ Q NF V +D+E++TVSF
Sbjct: 394 IFHFKGADLELPGENYMVADSSTGVLCLAMGSSNGMSIFGNVQQQNFNVLHDLEKETVSF 453
Query: 367 KPTDCTK 373
PT+C K
Sbjct: 454 VPTECGK 460
>gi|242066142|ref|XP_002454360.1| hypothetical protein SORBIDRAFT_04g029400 [Sorghum bicolor]
gi|241934191|gb|EES07336.1| hypothetical protein SORBIDRAFT_04g029400 [Sorghum bicolor]
Length = 466
Score = 163 bits (413), Expect = 1e-37, Method: Compositional matrix adjust.
Identities = 133/425 (31%), Positives = 189/425 (44%), Gaps = 84/425 (19%)
Query: 22 IEAQTGGFSVELIHRDSPKSPF-YNSSETPYQRLRDALTRSLNRLNHFNQNSSISSSKAS 80
+++ TG +V L HR P SP T +RL R+ F+ S +
Sbjct: 51 VKSSTGAATVPLHHRHGPCSPLPTKKMPTLEERLHRDQLRAAYIQRKFSGGGVNGSRGGA 110
Query: 81 QADIIPNNA-------------NYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQC 127
D+ ++A YLI + +G+P + + DTGSD+ W QC+PC SQC
Sbjct: 111 -GDVQQSHATVPTTLGTSLDTLEYLITVRLGSPGKSQTMLIDTGSDVSWVQCKPC--SQC 167
Query: 128 YMQDSPLFDPKMSSTYKSLPCSSSQCASLNQKS--CSGVNCQYSVSYGDGSFSNGNLATE 185
+ Q PLFDP SSTY CSS+ CA L Q+ CS CQY+V+YGDGS + G +++
Sbjct: 168 HSQADPLFDPSSSSTYSPFSCSSAACAQLGQEGNGCSSSQCQYTVTYGDGSSTTGTYSSD 227
Query: 186 TVTLGSTTGQAVALPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIA----- 240
T+ LGS A+ FGC G FN +T G++GLGGG SL+SQ T
Sbjct: 228 TLALGSN-----AVRKFQFGCSNVESG-FNDQTDGLMGLGGGAQSLVSQTAGTFGAAFSY 281
Query: 241 ---------------------------------------------GNQRLGVST----PD 251
G ++L + T
Sbjct: 282 CLPATSSSSGFLTLGAGTSGFVKTPMLRSSQVPTFYGVRIQAIRVGGRQLSIPTSVFSAG 341
Query: 252 IVIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSLSQV--PEVTIH 309
++DSGT LT LP S L S + ++ P A P+G L+ C+ F+ S V P V +
Sbjct: 342 TIMDSGTVLTRLPPTAYSALSSAFKAGMKQYPSAPPSGILDTCFDFSGQSSVSIPTVALV 401
Query: 310 FR-GADVKLSRSNFFVKVSEDIVCSVFKGITN--SVPIYGNIMQTNFLVGYDIEQQTVSF 366
F GA V ++ ++ S I+C F ++ S+ I GN+ Q F V YD+ V F
Sbjct: 402 FSGGAVVDIASDGIMLQTSNSILCLAFAANSDDSSLGIIGNVQQRTFEVLYDVGGGAVGF 461
Query: 367 KPTDC 371
K C
Sbjct: 462 KAGAC 466
>gi|413952718|gb|AFW85367.1| hypothetical protein ZEAMMB73_231535 [Zea mays]
Length = 443
Score = 163 bits (412), Expect = 1e-37, Method: Compositional matrix adjust.
Identities = 111/359 (30%), Positives = 162/359 (45%), Gaps = 78/359 (21%)
Query: 90 NYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCS 149
YL+R+++GTP DTGSDL+WTQC PC C+ QD P+ DP SSTY +LPC
Sbjct: 83 EYLVRLAVGTPRRPVALTLDTGSDLVWTQCAPC--RDCFDQDLPVLDPAASSTYAALPCG 140
Query: 150 SSQCASLNQKSCSGVN-------CQYSVSYGDGSFSNGNLATETVTLGST--TGQAVALP 200
+++C +L SC GV C Y+ YGD S + G +AT+ T G + +G+++
Sbjct: 141 AARCRALPFTSC-GVRTLGNHRSCIYAYHYGDKSLTVGEIATDRFTFGDSGGSGESLHTR 199
Query: 201 GITFGCGTNNGGLFNSKTTGIVG---------------------------------LGGG 227
+TFGCG N G+F S TGI G LGG
Sbjct: 200 RLTFGCGHLNKGVFQSNETGIAGFGRGRWSLPSQLNVTSFSYCFTSMFESKSSLVTLGGS 259
Query: 228 DISLIS-----QMRTT-------------------IAGNQRLGVSTPDI---VIDSGTTL 260
+L S ++RTT G RL V +IDSG ++
Sbjct: 260 PAALYSHAHSGEVRTTPILKNPSQPSLYFLSLKGISVGKTRLPVPETKFRSTIIDSGASI 319
Query: 261 TFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSLS-----QVPEVTIHFRGADV 315
T LP+ + + ++ + P +L+LC++ + VP +T+H GAD
Sbjct: 320 TTLPEEVYEAVKAEFAAQVGLPPSGVEGSALDLCFALPVTALWRRPAVPSLTLHLEGADW 379
Query: 316 KLSRSNF-FVKVSEDIVCSVFKGITNSVPIYGNIMQTNFLVGYDIEQQTVSFKPTDCTK 373
+L RSN+ F + ++C V + GN Q N V YD+E +SF P C +
Sbjct: 380 ELPRSNYVFEDLGARVMCIVLDAAPGEQTVIGNFQQQNTHVVYDLENDRLSFAPARCDR 438
>gi|297817972|ref|XP_002876869.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
gi|297322707|gb|EFH53128.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
Length = 462
Score = 163 bits (412), Expect = 1e-37, Method: Compositional matrix adjust.
Identities = 132/429 (30%), Positives = 205/429 (47%), Gaps = 95/429 (22%)
Query: 28 GFSVELIHRDSPKSPFYNSSETPYQRLRDALTRSLNRLNHFNQNSSISSSKASQADIIPN 87
GF + L H DS K+ T Q+++ + R +RLN + ++ AS D N
Sbjct: 45 GFRLSLRHVDSGKNL------TKIQKIQRGINRGFHRLNRLGAVAVLAV--ASNPDDTNN 96
Query: 88 --------NANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKM 139
+ +L+ +SIG P + A+ DTGSDLIWTQC+PC ++C+ Q +P+FDP+
Sbjct: 97 IKAPTHGGSGEFLMELSIGNPAVKYAAIVDTGSDLIWTQCKPC--TECFDQPTPIFDPEK 154
Query: 140 SSTYKSLPCSSSQCASLNQKSCS--GVNCQYSVSYGDGSFSNGNLATETVTL-------G 190
SS+Y + CSS C +L + +C+ +C+Y +YGD S + G LATET T G
Sbjct: 155 SSSYSKVGCSSGLCNALPRSNCNEDKDSCEYLYTYGDYSSTRGLLATETFTFEDENSISG 214
Query: 191 STTGQAVALPGITFGCGTNNGGL---------------------------------FNSK 217
G V G F G+ GL S
Sbjct: 215 IGFGCGVENEGDGFSQGSGLVGLGRGPLSLISQLKETKFSYCLTSIEDSEASSSLFIGSL 274
Query: 218 TTGIVGLGGGDI--------SLIS----------QMRTTIAGNQRLGVS----------T 249
+GIV G ++ SL+ +++ G +RL V T
Sbjct: 275 ASGIVNKTGANLDGEVTKTMSLLRNPDQPSFYYLELQGITVGAKRLSVEKSTFELSEDGT 334
Query: 250 PDIVIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGS--LELCYSFNSLSQ---VP 304
++IDSGTT+T+L + L +S + + PV D +GS L+LC+ + ++ VP
Sbjct: 335 GGMIIDSGTTITYLEETAFKVLKEEFTSRM-SLPV-DDSGSTGLDLCFKLPNAAKNIAVP 392
Query: 305 EVTIHFRGADVKLSRSNFFVKVSEDIVCSVFKGITNSVPIYGNIMQTNFLVGYDIEQQTV 364
++ HF+GAD++L N+ V S V + G +N + I+GN+ Q NF V +D+E++TV
Sbjct: 393 KLIFHFKGADLELPGENYMVADSSTGVLCLAMGSSNGMSIFGNVQQQNFNVLHDLEKETV 452
Query: 365 SFKPTDCTK 373
+F PT+C K
Sbjct: 453 TFVPTECGK 461
>gi|145693992|gb|ABP93696.1| unknown protein isoform 1 [Lemna minor]
Length = 350
Score = 163 bits (412), Expect = 1e-37, Method: Compositional matrix adjust.
Identities = 110/342 (32%), Positives = 156/342 (45%), Gaps = 65/342 (19%)
Query: 89 ANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPC 148
ANY+I + GTP + + DTGS++ W QC+PC S CY Q PLFDP +SSTY+++ C
Sbjct: 14 ANYVITVGFGTPKKNQTVIFDTGSNVNWIQCKPCVVS-CYPQQEPLFDPTLSSTYRNISC 72
Query: 149 SSSQCASLNQKSCSGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITFGCGT 208
+S+ C L+ + CSG C Y V+YGDGS + G LATET TL + FGCG
Sbjct: 73 TSAACTGLSSRGCSGSTCVYGVTYGDGSSTVGFLATETFTLAAGN----VFNNFIFGCGQ 128
Query: 209 NNGGLFNSKTTGIVGLGGGDISLISQMRTTIA---------------------------- 240
NN GLF + G++GLG SL SQ+ T++
Sbjct: 129 NNQGLF-TGAAGLIGLGRSPYSLNSQLATSLGNIFSYCLPSTSSATGYLNIGNPLRTPGY 187
Query: 241 ----------------------GNQRLGVSTP-----DIVIDSGTTLTFLPQGYNSNLLS 273
G RL +S+ +IDSGT +T LP L +
Sbjct: 188 TAMLTNSRAPTLYFIDLIGISVGGTRLALSSTVFQSVGTIIDSGTVITRLPPTAYGALRT 247
Query: 274 VMSSMIEAQPVADPTGSLELCYSFNSLSQV--PEVTIHFRGADVKLSRSNFFVKVSEDIV 331
+ + A L+ CY F+ + V P + +H+ G DV + + F +S V
Sbjct: 248 AFRAAMTQYTRAAAASILDTCYDFSRTTTVTFPTIKLHYTGLDVTIPGAGVFYVISSSQV 307
Query: 332 CSVFKGITNS--VPIYGNIMQTNFLVGYDIEQQTVSFKPTDC 371
C F G ++S + I GN+ Q V YD + + F C
Sbjct: 308 CLAFAGNSDSTQIGIIGNVQQRTMEVTYDNALKRIGFAAGAC 349
>gi|225216930|gb|ACN85225.1| aspartic proteinase nepenthesin-1 precursor [Oryza punctata]
gi|225216938|gb|ACN85232.1| aspartic proteinase nepenthesin-1 precursor [Oryza minuta]
Length = 516
Score = 161 bits (407), Expect = 5e-37, Method: Compositional matrix adjust.
Identities = 114/348 (32%), Positives = 155/348 (44%), Gaps = 69/348 (19%)
Query: 87 NNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSL 146
NY++ + +GTP + V DTGSD W QC+PC CY Q LFDP SSTY ++
Sbjct: 175 GTGNYVVTVGLGTPASRYTVVFDTGSDTTWVQCQPCV-VVCYEQREKLFDPARSSTYANV 233
Query: 147 PCSSSQCASLNQKSCSGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITFGC 206
C++ C+ L+ + CSG +C Y V YGDGS+S G A +T+TL S A+ G FGC
Sbjct: 234 SCAAPACSDLDTRGCSGGHCLYGVQYGDGSYSIGFFAMDTLTLSSYD----AVKGFRFGC 289
Query: 207 GTNNGGLFNSKTTGIVGLGGGDISLISQ--------------MRTTIAGNQRLGVSTP-- 250
G N GLF + G++GLG G SL Q R+T G G +P
Sbjct: 290 GERNEGLFG-EAAGLLGLGRGKTSLPVQTYDKYGGVFAHCLPARSTGTGYLDFGAGSPAA 348
Query: 251 ----------------------------------------DIVIDSGTTLTFLPQGYNSN 270
++DSGT +T LP S+
Sbjct: 349 RLTTTPMLVDNGPTFYYVGLTGIRVGGRLLYIPQSVFATAGTIVDSGTVITRLPPAAYSS 408
Query: 271 LLSVMSSMIEAQ--PVADPTGSLELCYSFNSLSQV--PEVTIHFR-GADVKLSRSNFFVK 325
L S ++ + A+ A L+ CY F +SQV P V++ F+ GA + + S
Sbjct: 409 LRSAFAAAMSARGYKKAPAVSLLDTCYDFAGMSQVAIPTVSLLFQGGARLDVDASGIMYA 468
Query: 326 VSEDIVCSVFKGITNS--VPIYGNIMQTNFLVGYDIEQQTVSFKPTDC 371
S VC F + V I GN F V YDI ++ VSF P C
Sbjct: 469 ASASQVCLAFAANEDGGDVGIVGNTQLKTFGVAYDIGKKVVSFSPGAC 516
>gi|115448349|ref|NP_001047954.1| Os02g0720600 [Oryza sativa Japonica Group]
gi|45735841|dbj|BAD12876.1| putative 41 kD chloroplast nucleoid DNA binding protein (CND41)
[Oryza sativa Japonica Group]
gi|45735967|dbj|BAD12996.1| putative 41 kD chloroplast nucleoid DNA binding protein (CND41)
[Oryza sativa Japonica Group]
gi|113537485|dbj|BAF09868.1| Os02g0720600 [Oryza sativa Japonica Group]
gi|125583492|gb|EAZ24423.1| hypothetical protein OsJ_08176 [Oryza sativa Japonica Group]
gi|215740989|dbj|BAG97484.1| unnamed protein product [Oryza sativa Japonica Group]
Length = 463
Score = 160 bits (406), Expect = 8e-37, Method: Compositional matrix adjust.
Identities = 126/429 (29%), Positives = 186/429 (43%), Gaps = 89/429 (20%)
Query: 22 IEAQTGGFSVELIHRDSPKSPFYNSSETPYQRLRDALTRSLNRLNHFN------------ 69
I + G +V L HR P SP +S + P + + L R R H
Sbjct: 45 ISSSLSGTTVALNHRHGPCSPVPSSKKRPTEE--ELLKRDQLRAEHIQRKFAMNAAVDGA 102
Query: 70 ---QNSSISSSKASQADIIPNNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQ 126
Q S +SSS ++ + Y+I + +GTP + DTGSD+ W QC PCP
Sbjct: 103 GDLQQSKVSSSVPTKLGSSLDTLEYVISVGLGTPAVTQTVTIDTGSDVSWVQCNPCPNPP 162
Query: 127 CYMQDSPLFDPKMSSTYKSLPCSSSQCASLNQK--SCSGVN--CQYSVSYGDGSFSNGNL 182
CY Q LFDP SSTY+++ C++++CA L Q+ C N CQY V YGDGS +NG
Sbjct: 163 CYAQTGALFDPAKSSTYRAVSCAAAECAQLEQQGNGCGATNYECQYGVQYGDGSTTNGTY 222
Query: 183 ATETVTLGSTTGQAVALPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQ-------- 234
+ +T+TL +G + A+ G FGC G F+ +T G++GLGGG SL+SQ
Sbjct: 223 SRDTLTL---SGASDAVKGFQFGCSHVESG-FSDQTDGLMGLGGGAQSLVSQTAAAYGNS 278
Query: 235 --------------------------------------------MRTTIAGNQRLGVSTP 250
++ G ++LG+S P
Sbjct: 279 FSYCLPPTSGSSGFLTLGGGGGVSGFVTTRMLRSRQIPTFYGARLQDIAVGGKQLGLS-P 337
Query: 251 DI-----VIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSLSQ--V 303
+ V+DSGT +T LP S L S + ++ A L+ C+ F +Q +
Sbjct: 338 SVFAAGSVVDSGTIITRLPPTAYSALSSAFKAGMKQYRSAPARSILDTCFDFAGQTQISI 397
Query: 304 PEVTIHFR-GADVKLSRSNFFVKVSEDIVCSVFKGITNSVPIYGNIMQTNFLVGYDIEQQ 362
P V + F GA + L + + + G + I GN+ Q F V YD+
Sbjct: 398 PTVALVFSGGAAIDLDPNGIMYG---NCLAFAATGDDGTTGIIGNVQQRTFEVLYDVGSS 454
Query: 363 TVSFKPTDC 371
T+ F+ C
Sbjct: 455 TLGFRSGAC 463
>gi|449462551|ref|XP_004149004.1| PREDICTED: probable aspartic protease At2g35615-like [Cucumis
sativus]
gi|449515029|ref|XP_004164552.1| PREDICTED: probable aspartic protease At2g35615-like [Cucumis
sativus]
Length = 434
Score = 160 bits (405), Expect = 1e-36, Method: Compositional matrix adjust.
Identities = 97/225 (43%), Positives = 132/225 (58%), Gaps = 9/225 (4%)
Query: 28 GFSVELIHRDSPKSPFYNSSETPYQRLRDALTRSLNRLNHFNQNSSISSSKASQADIIPN 87
GF+ L RDSP SP +N S + Y L DA RS +R + + S+ ++ IIP+
Sbjct: 27 GFTTSLFRRDSPLSPLHNPSLSRYDSLIDAFRRSFSRSATLLTHLTSVSTACIRSPIIPD 86
Query: 88 NANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLP 147
+ +L+ I IGTPP +A+ADTGSDL WTQC PC +C+ Q P+F+P+ SS+Y+ +
Sbjct: 87 SGEFLMSIFIGTPPVNVIAIADTGSDLTWTQCLPC--RECFNQSQPIFNPRRSSSYRKVS 144
Query: 148 CSSSQCASLNQKSCSG--VNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITFG 205
C+S C SL C +C Y SYGD SF+ G+LA++ +T+GS LP G
Sbjct: 145 CASDTCRSLESYHCGPDLQSCSYGYSYGDRSFTYGDLASDQITIGS-----FKLPKTVIG 199
Query: 206 CGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGNQRLGVSTP 250
CG NGG F T+GI+GLGGG +SL+SQMRT R P
Sbjct: 200 CGHQNGGTFGGVTSGIIGLGGGSLSLVSQMRTIAGVKPRFSYCLP 244
Score = 102 bits (254), Expect = 3e-19, Method: Compositional matrix adjust.
Identities = 55/124 (44%), Positives = 78/124 (62%), Gaps = 4/124 (3%)
Query: 251 DIVIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSLSQ--VPEVTI 308
+I+IDSGTTLT LP+ + S ++ +I+A+ V DP+G LELCYS + +P +T
Sbjct: 311 NIIIDSGTTLTLLPRSLYYGVFSTLARVIKAKRVDDPSGILELCYSAGQVDDLNIPIITA 370
Query: 309 HFR-GADVKLSRSNFFVKVSEDIVCSVFKGITNSVPIYGNIMQTNFLVGYDIEQQTVSFK 367
HF GADVKL N F V++++ C F T V I+GN+ Q NF VGYD+ + +SF+
Sbjct: 371 HFAGGADVKLLPVNTFAPVADNVTCLTFAPATQ-VAIFGNLAQINFEVGYDLGNKRLSFE 429
Query: 368 PTDC 371
P C
Sbjct: 430 PKLC 433
>gi|24417232|gb|AAN60226.1| unknown [Arabidopsis thaliana]
Length = 464
Score = 159 bits (402), Expect = 2e-36, Method: Compositional matrix adjust.
Identities = 128/399 (32%), Positives = 173/399 (43%), Gaps = 80/399 (20%)
Query: 32 ELIHRDSPKSPFYNSSETPYQRLRDALTRSLNRLNHFNQNSSISSSKASQADIIPNNANY 91
E+I RD + E+ Y +L S N N ++ + S+ +++ I + NY
Sbjct: 87 EIIRRDQARV------ESIYSKL------SKNSANEVSE--AKSTELPAKSGITLGSGNY 132
Query: 92 LIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCSSS 151
++ I IGTP + V DTGSDL WTQCEPC S CY Q P F+P SSTY+++ CSS
Sbjct: 133 IVTIGIGTPKHDLSLVFDTGSDLTWTQCEPCLGS-CYSQKEPKFNPSSSSTYQNVSCSSP 191
Query: 152 QCASLNQKSCSGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITFGCGTNNG 211
C + +SCS NC YS+ YGD SF+ G LA E TL ++ L + FGCG NN
Sbjct: 192 MCE--DAESCSASNCVYSIGYGDKSFTQGFLAKEKFTLTNSD----VLEDVYFGCGENNQ 245
Query: 212 GLFN-----------------------------------SKTTGIVGLGGGDISL----- 231
GLF+ S +TG + G IS
Sbjct: 246 GLFDGVAGLLGLGPGKLSLPAQTTTTYNNIFSYCLPSFTSNSTGHLTFGSAGISESVKFT 305
Query: 232 -ISQMRTTI----------AGNQRLGV-----STPDIVIDSGTTLTFLPQGYNSNLLSVM 275
IS + G++ L + ST +IDSGT T LP + L SV
Sbjct: 306 PISSFPSAFNYGIDIIGISVGDKELAITPNSFSTEGAIIDSGTVFTRLPTKVYAELRSVF 365
Query: 276 SSMIEAQPVADPTGSLELCYSFNSLSQVPEVTIHFR---GADVKLSRSNFFVKVSEDIVC 332
+ + G + CY F L V TI F G V+L S + + VC
Sbjct: 366 KEKMSSYKSTSGYGLFDTCYDFTGLDTVTYPTIAFSFAGGTVVELDGSGISLPIKISQVC 425
Query: 333 SVFKGITNSVPIYGNIMQTNFLVGYDIEQQTVSFKPTDC 371
F G + I+GN+ QT V YD+ V F P C
Sbjct: 426 LAFAGNDDLPAIFGNVQQTTLDVVYDVAGGRVGFAPNGC 464
>gi|125540928|gb|EAY87323.1| hypothetical protein OsI_08727 [Oryza sativa Indica Group]
Length = 463
Score = 158 bits (400), Expect = 3e-36, Method: Compositional matrix adjust.
Identities = 125/429 (29%), Positives = 186/429 (43%), Gaps = 89/429 (20%)
Query: 22 IEAQTGGFSVELIHRDSPKSPFYNSSETPYQRLRDALTRSLNRLNHFN------------ 69
I + G +V L HR P SP +S + P + + L R R H
Sbjct: 45 ISSSLSGTTVALNHRHGPCSPVPSSKKRPTEE--ELLKRDQLRAEHIQRKFAMNAAVDGA 102
Query: 70 ---QNSSISSSKASQADIIPNNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQ 126
Q S +SSS ++ + Y+I + +GTP + DTGSD+ W QC PCP
Sbjct: 103 GDLQQSKVSSSVPTKLGSSLDTLEYVISVGLGTPAVTQTVTIDTGSDVSWVQCNPCPNPP 162
Query: 127 CYMQDSPLFDPKMSSTYKSLPCSSSQCASLNQK--SCSGVN--CQYSVSYGDGSFSNGNL 182
C+ Q LFDP SSTY+++ C++++CA L Q+ C N CQY V YGDGS +NG
Sbjct: 163 CHAQTGALFDPAKSSTYRAVSCAAAECAQLEQQGNGCGATNYECQYGVQYGDGSTTNGTY 222
Query: 183 ATETVTLGSTTGQAVALPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQ-------- 234
+ +T+TL +G + A+ G FGC G F+ +T G++GLGGG SL+SQ
Sbjct: 223 SRDTLTL---SGASDAVKGFQFGCSHLESG-FSDQTDGLMGLGGGAQSLVSQTAAAYGNS 278
Query: 235 --------------------------------------------MRTTIAGNQRLGVSTP 250
++ G ++LG+S P
Sbjct: 279 FSYCLPPTSGSSGFLTLGGGGGASGFVTTRMLRSKQIPTFYGARLQDIAVGGKQLGLS-P 337
Query: 251 DI-----VIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSLSQ--V 303
+ V+DSGT +T LP S L S + ++ A L+ C+ F +Q +
Sbjct: 338 SVFAAGSVVDSGTIITRLPPTAYSALSSAFKAGMKQYRSAPARSILDTCFDFAGQTQISI 397
Query: 304 PEVTIHFR-GADVKLSRSNFFVKVSEDIVCSVFKGITNSVPIYGNIMQTNFLVGYDIEQQ 362
P V + F GA + L + + + G + I GN+ Q F V YD+
Sbjct: 398 PTVALVFSGGAAIDLDPNGIMYG---NCLAFAATGDDGTTGIIGNVQQRTFEVLYDVGSS 454
Query: 363 TVSFKPTDC 371
T+ F+ C
Sbjct: 455 TLGFRSGAC 463
>gi|242062704|ref|XP_002452641.1| hypothetical protein SORBIDRAFT_04g029670 [Sorghum bicolor]
gi|241932472|gb|EES05617.1| hypothetical protein SORBIDRAFT_04g029670 [Sorghum bicolor]
Length = 466
Score = 158 bits (400), Expect = 4e-36, Method: Compositional matrix adjust.
Identities = 131/420 (31%), Positives = 186/420 (44%), Gaps = 85/420 (20%)
Query: 26 TGGFSVELIHRDSPKSPFYNSSETPYQRLRDALTRSLNRLNHFNQNSSISSS---KASQA 82
+GG +V L HR P SP S++ P L + L R R + + S + + S A
Sbjct: 58 SGGITVPLHHRHGPCSPV-PSNKMP-ASLEERLQRDQLRAAYIKRKFSGAKGGDVEQSDA 115
Query: 83 DIIP-------NNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLF 135
+P + Y+I + IG+P + DTGSD+ W QC+PC SQC+ + LF
Sbjct: 116 ATVPTTLGTSLSTLEYVITVGIGSPAVTQTMSMDTGSDVSWVQCKPC--SQCHSEVDSLF 173
Query: 136 DPKMSSTYKSLPCSSSQCASLNQK----SCSGVNCQYSVSYGDGSFSNGNLATETVTLGS 191
DP SSTY CSS+ C L+Q CS CQY VSY DGS + G +++T+TLGS
Sbjct: 174 DPSASSTYSPFSCSSAACVQLSQSQQGNGCSSSQCQYIVSYVDGSSTTGTYSSDTLTLGS 233
Query: 192 TTGQAVALPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIA----------- 240
A+ G FGC + G F+ +T G++GLGG SL+SQ T
Sbjct: 234 N-----AIKGFQFGCSQSESGGFSDQTDGLMGLGGDAQSLVSQTAGTFGKAFSYCLPPTP 288
Query: 241 ----------------------------------------GNQRLGVST----PDIVIDS 256
G Q+L + T V+DS
Sbjct: 289 GSSGFLTLGAASRSGFVKTPMLRSTQIPTYYGVLLEAIRVGGQQLNIPTSVFSAGSVMDS 348
Query: 257 GTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSLSQV--PEVTIHFR-GA 313
GT +T LP S L S + ++ P A P+G L+ C+ F+ S V P V + F GA
Sbjct: 349 GTVITRLPPTAYSALSSAFKAGMKKYPPAQPSGILDTCFDFSGQSSVSIPSVALVFSGGA 408
Query: 314 DVKLSRSNFFVKVSEDIVCSVFKGITN--SVPIYGNIMQTNFLVGYDIEQQTVSFKPTDC 371
V L + +++ D C F ++ S+ GN+ Q F V YD+ V F+ C
Sbjct: 409 VVNLDFNGIMLEL--DNWCLAFAANSDDSSLGFIGNVQQRTFEVLYDVGGGAVGFRAGAC 466
>gi|326512608|dbj|BAJ99659.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 484
Score = 158 bits (399), Expect = 4e-36, Method: Compositional matrix adjust.
Identities = 129/418 (30%), Positives = 181/418 (43%), Gaps = 82/418 (19%)
Query: 30 SVELIHRDSPKSPFYNSSETP--YQRLRDALTR--SLNRLNHFNQNSSISSSKASQADII 85
++ ++HR P SP P + L D R S++R + + ++ + +
Sbjct: 74 ALNVVHRQGPCSPLQARGAPPPHAELLNDDQARVDSIHRKIAAAASPVLDQARGKKGVTL 133
Query: 86 P-------NNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPK 138
P NY++ + +GTP + V DTGSDL W QC PC S CY Q PLFDP
Sbjct: 134 PAQRGISLGTGNYVVSMGLGTPARDMTVVFDTGSDLSWVQCTPC--SDCYEQKDPLFDPA 191
Query: 139 MSSTYKSLPCSSSQCASLNQKSCS-GVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAV 197
SSTY ++PC+S +C L+ +SCS C+Y V YGD S ++G LA +T+TL Q+
Sbjct: 192 RSSTYSAVPCASPECQGLDSRSCSRDKKCRYEVVYGDQSQTDGALARDTLTL----TQSD 247
Query: 198 ALPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRT--------------TIAGNQ 243
LPG FGCG + GLF + G+VGLG +SL SQ + + AG
Sbjct: 248 VLPGFVFGCGEQDTGLFG-RADGLVGLGREKVSLSSQAASKYGAGFSYCLPSSPSAAGYL 306
Query: 244 RLGVSTP------------------------------------------DIVIDSGTTLT 261
LG P VIDSGT +T
Sbjct: 307 SLGGPAPANARFTAMETRHDSPSFYYVRLVGVKVAGRTVRVSPIVFSAAGTVIDSGTVIT 366
Query: 262 FLPQGYNSNLLSVMS-SMIEAQPVADPTGS-LELCYSF--NSLSQVPEVTIHFR-GADVK 316
LP + L S + SM P S L+ CY F ++ ++P V + F GA V
Sbjct: 367 RLPPRVYAALRSAFARSMGRYGYKRAPALSILDTCYDFTGHTTVRIPSVALVFAGGAAVG 426
Query: 317 LSRSN--FFVKVSEDIVCSVFKGITNSVPIYGNIMQTNFLVGYDIEQQTVSFKPTDCT 372
L S + KVS+ + G I GN Q V YD+ +Q + F C+
Sbjct: 427 LDFSGVLYVAKVSQACLAFAPNGDGADAGIIGNTQQKTLAVVYDVARQKIGFGANGCS 484
>gi|15238250|ref|NP_196637.1| aspartyl protease family protein [Arabidopsis thaliana]
gi|8979710|emb|CAB96831.1| nucleoid DNA-binding protein cnd41-like protein [Arabidopsis
thaliana]
gi|18176136|gb|AAL59990.1| putative nucleoid DNA-binding protein cnd41 [Arabidopsis thaliana]
gi|22136986|gb|AAM91722.1| putative nucleoid DNA-binding protein cnd41 [Arabidopsis thaliana]
gi|110740988|dbj|BAE98588.1| nucleoid DNA-binding protein cnd41 - like protein [Arabidopsis
thaliana]
gi|332004210|gb|AED91593.1| aspartyl protease family protein [Arabidopsis thaliana]
Length = 464
Score = 158 bits (399), Expect = 4e-36, Method: Compositional matrix adjust.
Identities = 127/399 (31%), Positives = 174/399 (43%), Gaps = 80/399 (20%)
Query: 32 ELIHRDSPKSPFYNSSETPYQRLRDALTRSLNRLNHFNQNSSISSSKASQADIIPNNANY 91
E+I RD + E+ Y +L S N N ++ + S+ +++ I + NY
Sbjct: 87 EIIRRDQARV------ESIYSKL------SKNSANEVSE--AKSTELPAKSGITLGSGNY 132
Query: 92 LIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCSSS 151
++ I IGTP + V DTGSDL WTQCEPC S CY Q P F+P SSTY+++ CSS
Sbjct: 133 IVTIGIGTPKHDLSLVFDTGSDLTWTQCEPCLGS-CYSQKEPKFNPSSSSTYQNVSCSSP 191
Query: 152 QCASLNQKSCSGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITFGCGTNNG 211
C + +SCS NC YS+ YGD SF+ G LA E TL ++ L + FGCG NN
Sbjct: 192 MCE--DAESCSASNCVYSIVYGDKSFTQGFLAKEKFTLTNSD----VLEDVYFGCGENNQ 245
Query: 212 GLFN-----------------------------------SKTTGIVGLGGGDISL----- 231
GLF+ S +TG + G IS
Sbjct: 246 GLFDGVAGLLGLGPGKLSLPAQTTTTYNNIFSYCLPSFTSNSTGHLTFGSAGISESVKFT 305
Query: 232 -ISQMRTTI----------AGNQRLGV-----STPDIVIDSGTTLTFLPQGYNSNLLSVM 275
IS + G++ L + ST +IDSGT T LP + L SV
Sbjct: 306 PISSFPSAFNYGIDIIGISVGDKELAITPNSFSTEGAIIDSGTVFTRLPTKVYAELRSVF 365
Query: 276 SSMIEAQPVADPTGSLELCYSFNSLSQV--PEVTIHFRGAD-VKLSRSNFFVKVSEDIVC 332
+ + G + CY F L V P + F G+ V+L S + + VC
Sbjct: 366 KEKMSSYKSTSGYGLFDTCYDFTGLDTVTYPTIAFSFAGSTVVELDGSGISLPIKISQVC 425
Query: 333 SVFKGITNSVPIYGNIMQTNFLVGYDIEQQTVSFKPTDC 371
F G + I+GN+ QT V YD+ V F P C
Sbjct: 426 LAFAGNDDLPAIFGNVQQTTLDVVYDVAGGRVGFAPNGC 464
>gi|115448353|ref|NP_001047956.1| Os02g0720900 [Oryza sativa Japonica Group]
gi|45735844|dbj|BAD12879.1| putative chloroplast nucleoid DNA-binding protein cnd41 [Oryza
sativa Japonica Group]
gi|45735970|dbj|BAD12999.1| putative chloroplast nucleoid DNA-binding protein cnd41 [Oryza
sativa Japonica Group]
gi|113537487|dbj|BAF09870.1| Os02g0720900 [Oryza sativa Japonica Group]
gi|125540930|gb|EAY87325.1| hypothetical protein OsI_08729 [Oryza sativa Indica Group]
gi|215692622|dbj|BAG88042.1| unnamed protein product [Oryza sativa Japonica Group]
Length = 458
Score = 158 bits (399), Expect = 5e-36, Method: Compositional matrix adjust.
Identities = 134/426 (31%), Positives = 185/426 (43%), Gaps = 90/426 (21%)
Query: 28 GFSVELIHRDSPKSPFYNSSETPYQRL---RDALTRSLN-RLNHFNQNSSISSSKASQAD 83
G +EL H SP SP ++ P+ + DA SL RL + S + A
Sbjct: 42 GLHLELHHPRSPCSPAPVPADLPFTAVLTHDDARISSLAARLAKTPSARATSLDADADAG 101
Query: 84 IIPNNA-------------NYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQ 130
+ + A NY+ R+ +GTP T+ + V DTGS L W QC PC S C+ Q
Sbjct: 102 LAGSLASVPLSPGASVGVGNYVTRMGLGTPATQYVMVVDTGSSLTWLQCSPCLVS-CHRQ 160
Query: 131 DSPLFDPKMSSTYKSLPCSSSQC-----ASLNQKSCSGVN-CQYSVSYGDGSFSNGNLAT 184
P+F+PK SSTY S+ CS+ QC A+LN +CS N C Y SYGD SFS G L+
Sbjct: 161 SGPVFNPKSSSTYASVGCSAQQCSDLPSATLNPSACSSSNVCIYQASYGDSSFSVGYLSK 220
Query: 185 ETVTLGSTTGQAVALPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMR-------- 236
+TV+ GST+ LP +GCG +N GLF ++ G++GL +SL+ Q+
Sbjct: 221 DTVSFGSTS-----LPNFYYGCGQDNEGLFG-RSAGLIGLARNKLSLLYQLAPSLGYSFT 274
Query: 237 -------------------------------------------TTIAGN-----QRLGVS 248
T+AGN S
Sbjct: 275 YCLPSSSSSGYLSLGSYNPGQYSYTPMVSSSLDDSLYFIKLSGMTVAGNPLSVSSSAYSS 334
Query: 249 TPDIVIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFN-SLSQVPEVT 307
P I+ DSGT +T LP S L +++ ++ A L+ C+ S P VT
Sbjct: 335 LPTII-DSGTVITRLPTSVYSALSKAVAAAMKGTSRASAYSILDTCFKGQASRVSAPAVT 393
Query: 308 IHFR-GADVKLSRSNFFVKVSEDIVCSVFKGITNSVPIYGNIMQTNFLVGYDIEQQTVSF 366
+ F GA +KLS N V V + C F S I GN Q F V YD++ + F
Sbjct: 394 MSFAGGAALKLSAQNLLVDVDDSTTCLAF-APARSAAIIGNTQQQTFSVVYDVKSSRIGF 452
Query: 367 KPTDCT 372
C+
Sbjct: 453 AAGGCS 458
>gi|356540371|ref|XP_003538663.1| PREDICTED: probable aspartic protease At2g35615-like [Glycine max]
Length = 374
Score = 157 bits (398), Expect = 6e-36, Method: Compositional matrix adjust.
Identities = 76/173 (43%), Positives = 112/173 (64%), Gaps = 3/173 (1%)
Query: 72 SSISSSKASQADIIPNNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQD 131
S++ + + Q+ I +YL+ +SIGTPP + +ADTGSDL WT C PC ++CY Q
Sbjct: 6 SAMEKTVSPQSPIYAYLGHYLMEVSIGTPPFKIYGIADTGSDLTWTSCVPC--NKCYKQR 63
Query: 132 SPLFDPKMSSTYKSLPCSSSQCASLNQKSCS-GVNCQYSVSYGDGSFSNGNLATETVTLG 190
+P+FDP+ S++Y+++ C S C L+ CS +C Y+ +Y + + G LA ET+TL
Sbjct: 64 NPIFDPQKSTSYRNISCDSKLCHKLDTGVCSPQKHCNYTYAYASAAITQGVLAQETITLS 123
Query: 191 STTGQAVALPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGNQ 243
ST G++V L GI FGCG NN G FN + GI+GLGGG +S ISQ+ ++ G +
Sbjct: 124 STKGESVPLKGIVFGCGHNNTGGFNDREMGIIGLGGGPVSFISQIGSSFGGKR 176
Score = 89.7 bits (221), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 48/135 (35%), Positives = 74/135 (54%), Gaps = 1/135 (0%)
Query: 241 GNQRLGVSTPDIVIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVA-DPTGSLELCYSFNS 299
G+ V ++ +DSGT T LP L++ + S + +PV D +LCY +
Sbjct: 239 GSSSQSVEKGNVFLDSGTPPTILPTQLYDRLVAQVRSEVAMKPVTNDLDLGPQLCYRTKN 298
Query: 300 LSQVPEVTIHFRGADVKLSRSNFFVKVSEDIVCSVFKGITNSVPIYGNIMQTNFLVGYDI 359
+ P +T HF G DVKL + FV + + C F ++ +YGN Q+N+L+G+D+
Sbjct: 299 NLRGPVLTAHFEGGDVKLLPTQTFVSPKDGVFCLGFTNTSSDGGVYGNFAQSNYLIGFDL 358
Query: 360 EQQTVSFKPTDCTKQ 374
++Q VSFKP DCTK
Sbjct: 359 DRQVVSFKPMDCTKH 373
>gi|362799904|dbj|BAL41445.1| aspartyl protease 1 [Linum grandiflorum]
Length = 449
Score = 157 bits (398), Expect = 6e-36, Method: Compositional matrix adjust.
Identities = 92/213 (43%), Positives = 139/213 (65%), Gaps = 17/213 (7%)
Query: 31 VELIHRDSPKSPFYNSSETPYQRLRDALTRSLNRLNHFNQNSSISSSKASQADIIPNNAN 90
++LIHRDSP SP + + T RL+ + R+++R Q+ + Q D++P+
Sbjct: 29 LDLIHRDSPLSPLHTPNLTFSDRLQASFLRAISR-----QSRHVDF----QTDLLPSGGE 79
Query: 91 YLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCSS 150
Y++ +SIGTPP LA+ADTGSDL W Q +PC QCY Q P+FDP S+T+ LPC++
Sbjct: 80 YMMNLSIGTPPFPILAIADTGSDLTWLQSKPC--DQCYPQKGPIFDPSNSTTFHKLPCTT 137
Query: 151 SQCASLNQ--KSCSG-VNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITFGCG 207
+ C +L++ +SC+ C Y+ SYGD S++ G LA++TVT+G+ +V + + FGCG
Sbjct: 138 APCNALDESARSCTDPTTCGYTYSYGDHSYTTGYLASDTVTVGNA---SVQIRNVAFGCG 194
Query: 208 TNNGGLFNSKTTGIVGLGGGDISLISQMRTTIA 240
T NGG F+ + +GIVGLGGG++S +SQ+ TI
Sbjct: 195 TRNGGNFDEQGSGIVGLGGGNLSFVSQLGDTIG 227
Score = 99.8 bits (247), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 58/131 (44%), Positives = 81/131 (61%), Gaps = 4/131 (3%)
Query: 247 VSTPDIVIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSL-ELCY-SFNSLSQVP 304
V +I+IDSGTTLTFL + + L + + I+ + V D S+ LC+ S ++P
Sbjct: 320 VEEGNIIIDSGTTLTFLEEEFYGALEAALVEEIKMERVNDVKNSMFSLCFKSGKEEVELP 379
Query: 305 EVTIHFRG-ADVKLSRSNFFVKVSEDIVCSVFKGITNSVPIYGNIMQTNFLVGYDIEQQT 363
+ +HFRG ADV+L N FV+ E +VC TN V IYGN+ Q NF+VGYD+ ++T
Sbjct: 380 LMKVHFRGGADVELKPVNTFVRAEEGLVCFTMLP-TNDVGIYGNLAQMNFVVGYDLGKRT 438
Query: 364 VSFKPTDCTKQ 374
VSF P DC+KQ
Sbjct: 439 VSFLPADCSKQ 449
>gi|225216879|gb|ACN85177.1| aspartic proteinase nepenthesin-1 precursor [Oryza nivara]
gi|225216896|gb|ACN85193.1| aspartic proteinase nepenthesin-1 precursor [Oryza rufipogon]
Length = 515
Score = 157 bits (398), Expect = 7e-36, Method: Compositional matrix adjust.
Identities = 115/347 (33%), Positives = 156/347 (44%), Gaps = 68/347 (19%)
Query: 87 NNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSL 146
NY++ + +GTP + V DTGSD W QC+PC + CY Q LFDP SSTY ++
Sbjct: 175 GTGNYVVTVGLGTPASRYTVVFDTGSDTTWVQCQPCVVA-CYEQREKLFDPASSSTYANV 233
Query: 147 PCSSSQCASLNQKSCSGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITFGC 206
C++ C+ L+ CSG +C Y V YGDGS+S G A +T+TL S A+ G FGC
Sbjct: 234 SCAAPACSDLDVSGCSGGHCLYGVQYGDGSYSIGFFAMDTLTLSSYD----AVKGFRFGC 289
Query: 207 GTNNGGLFNSKTTGIVGLGGGDISLISQ--------------MRTTIAGNQRLGVSTP-- 250
G N GLF + G++GLG G SL Q R+T G G +P
Sbjct: 290 GERNDGLFG-EAAGLLGLGRGKTSLPVQTYGKYGGVFAHCLPARSTGTGYLDFGAGSPPA 348
Query: 251 ---------------------------------------DIVIDSGTTLTFLPQGYNSNL 271
++DSGT +T LP S+L
Sbjct: 349 TTTTPMLTGNGPTFYYVGMTGIRVGGRLLPIAPSVFAAAGTIVDSGTVITRLPPAAYSSL 408
Query: 272 LSVMSSMIEAQPV--ADPTGSLELCYSFNSLSQV--PEVTIHFR-GADVKLSRSNFFVKV 326
S ++ + A+ A L+ CY F +SQV P V++ F+ GA + + S V
Sbjct: 409 RSAFAAAMAARGYRKAAAVSLLDTCYDFTGMSQVAIPTVSLLFQGGAALDVDASGIMYTV 468
Query: 327 SEDIVCSVFKGITNS--VPIYGNIMQTNFLVGYDIEQQTVSFKPTDC 371
S VC F G + V I GN F V YDI ++ V F P C
Sbjct: 469 SASQVCLAFAGNEDGGDVGIVGNTQLKTFGVAYDIGKKVVGFSPGAC 515
>gi|225446261|ref|XP_002265547.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Vitis vinifera]
Length = 440
Score = 157 bits (397), Expect = 7e-36, Method: Compositional matrix adjust.
Identities = 127/421 (30%), Positives = 193/421 (45%), Gaps = 91/421 (21%)
Query: 20 SPIEAQTGGFSVELIHRDSPKSPFYNSSETPYQRL-RDALTRSLNRLNHFNQNSSISSSK 78
P+ TG LIH+DS S YQ L R+ + R R F +
Sbjct: 37 KPLRLVTG-----LIHQDSILSS--------YQSLDRNNVERRRTRRAAF-------ITD 76
Query: 79 ASQADIIPNNAN--YLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFD 136
QA+++ ++ +L+ S+G PP +L DTGSDL+W QC PC + C+ Q +P+FD
Sbjct: 77 EIQANMVADDRGQAFLVNFSVGRPPVPQLVGIDTGSDLLWVQCRPC--ADCFRQSTPIFD 134
Query: 137 PKMSSTYKSLPCSSSQCASLNQKSCSGVN-CQYSVSYGDGSFSNGNLATETVTLGSTTGQ 195
P SSTY L S C + QK + +N C Y+ SY DGS S+GNLATE + ++
Sbjct: 135 PSKSSTYVDLSYDSPICPNSPQKKYNHLNQCIYNASYADGSTSSGNLATEDIVFETSDQG 194
Query: 196 AVALPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIA--------------- 240
V + + FGCG +N G F+ + +GI+GL GD S++S++ + +
Sbjct: 195 TVTVSSVVFGCGHSNRGRFDGQQSGILGLSAGDQSIVSRLGSRFSYCIGDLFDPHYTHNQ 254
Query: 241 -----GNQRLGVSTP-------------------------------------DIVIDSGT 258
G + G STP +V+DSGT
Sbjct: 255 LVLGDGVKMEGSSTPFHTFNGFYYVTLEGISVGETRLDINPEVFQRTESGQGGVVMDSGT 314
Query: 259 TLTFLPQGYNSNLLSVMSSMIEA--QPVADPTGSLELCYSF---NSLSQVPEVTIHF-RG 312
T TFL + L + + ++ Q V T LCY L PE+ HF G
Sbjct: 315 TATFLAKDGFDPLSNEIQRLVRGHFQQVIYRTIPGWLCYKGRVNEDLRGFPELAFHFAEG 374
Query: 313 ADVKLSRSNFFVKVSEDIVC-SVFK-GITNSVPIYGNIMQTNFLVGYDIEQQTVSFKPTD 370
AD+ L ++ FV+ ++D+ C +V + + N + G + Q ++ V YD+ + V F+ TD
Sbjct: 375 ADLVLDANSLFVQKNQDVFCLAVLESNLKNIGSVIGIMAQQHYNVAYDLIGKRVYFQRTD 434
Query: 371 C 371
C
Sbjct: 435 C 435
>gi|115468912|ref|NP_001058055.1| Os06g0610800 [Oryza sativa Japonica Group]
gi|51091964|dbj|BAD35493.1| putative nucleoid DNA-binding protein cnd41, chloroplast [Oryza
sativa Japonica Group]
gi|113596095|dbj|BAF19969.1| Os06g0610800 [Oryza sativa Japonica Group]
gi|218198535|gb|EEC80962.1| hypothetical protein OsI_23679 [Oryza sativa Indica Group]
Length = 519
Score = 157 bits (397), Expect = 7e-36, Method: Compositional matrix adjust.
Identities = 115/347 (33%), Positives = 156/347 (44%), Gaps = 68/347 (19%)
Query: 87 NNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSL 146
NY++ + +GTP + V DTGSD W QC+PC + CY Q LFDP SSTY ++
Sbjct: 179 GTGNYVVTVGLGTPASRYTVVFDTGSDTTWVQCQPCVVA-CYEQREKLFDPASSSTYANV 237
Query: 147 PCSSSQCASLNQKSCSGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITFGC 206
C++ C+ L+ CSG +C Y V YGDGS+S G A +T+TL S A+ G FGC
Sbjct: 238 SCAAPACSDLDVSGCSGGHCLYGVQYGDGSYSIGFFAMDTLTLSSYD----AVKGFRFGC 293
Query: 207 GTNNGGLFNSKTTGIVGLGGGDISLISQ--------------MRTTIAGNQRLGVSTP-- 250
G N GLF + G++GLG G SL Q R+T G G +P
Sbjct: 294 GERNDGLFG-EAAGLLGLGRGKTSLPVQTYGKYGGVFAHCLPARSTGTGYLDFGAGSPPA 352
Query: 251 ---------------------------------------DIVIDSGTTLTFLPQGYNSNL 271
++DSGT +T LP S+L
Sbjct: 353 TTTTPMLTGNGPTFYYVGMTGIRVGGRLLPIAPSVFAAAGTIVDSGTVITRLPPAAYSSL 412
Query: 272 LSVMSSMIEAQPV--ADPTGSLELCYSFNSLSQV--PEVTIHFR-GADVKLSRSNFFVKV 326
S ++ + A+ A L+ CY F +SQV P V++ F+ GA + + S V
Sbjct: 413 RSAFAAAMAARGYRKAAAVSLLDTCYDFTGMSQVAIPTVSLLFQGGAALDVDASGIMYTV 472
Query: 327 SEDIVCSVFKGITNS--VPIYGNIMQTNFLVGYDIEQQTVSFKPTDC 371
S VC F G + V I GN F V YDI ++ V F P C
Sbjct: 473 SASQVCLAFAGNEDGGDVGIVGNTQLKTFGVAYDIGKKVVGFSPGAC 519
>gi|296090179|emb|CBI39998.3| unnamed protein product [Vitis vinifera]
Length = 334
Score = 157 bits (397), Expect = 9e-36, Method: Compositional matrix adjust.
Identities = 113/353 (32%), Positives = 151/353 (42%), Gaps = 104/353 (29%)
Query: 87 NNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSL 146
NN YL++ISIGTPP + + DTGSDL+WTQC PC CY Q +P+FDP S+++K +
Sbjct: 20 NNGEYLMKISIGTPPFDVYGIYDTGSDLMWTQCLPC--LSCYKQKNPMFDPSKSTSFKEV 77
Query: 147 PCSSSQCASLNQKSCSGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITFGC 206
C S QC L+ T T L I FGC
Sbjct: 78 SCESQQCRLLD--------------------------TPTSILN-----------IVFGC 100
Query: 207 GTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGNQRLG-------------------- 246
G NN G FN G+ G GG +SL SQ+ +T+ ++
Sbjct: 101 GHNNSGTFNENEMGLFGTGGRPLSLTSQIMSTLGSGRKFSQCLVPFRTDPSITSKIIFGP 160
Query: 247 ---------VSTPDIVIDSGT----TL-----------------------TFLPQGYNSN 270
VSTP + D T TL F+ G
Sbjct: 161 EAEVSGSDVVSTPLVTKDDPTYYFVTLDGISVGDKLFPFSSSSPMATKGNVFIDAGTPPT 220
Query: 271 LL------SVMSSMIEAQP---VADPTGSLELCYSFNSLSQVPEVTIHFRGADVKLSRSN 321
LL ++ + EA P V DP +LCY +L P +T HF GADV+L N
Sbjct: 221 LLPRDFYNRLVQGVKEAIPMEPVQDPDLQPQLCYRSATLIDGPILTAHFDGADVQLKPLN 280
Query: 322 FFVKVSEDIVCSVFKGITNSVPIYGNIMQTNFLVGYDIEQQTVSFKPTDCTKQ 374
F+ E + C + I I+GN +Q NFL+G+D++ + VSFK DCTKQ
Sbjct: 281 TFISPKEGVYCFAMQPIDGDTGIFGNFVQMNFLIGFDLDGKKVSFKAVDCTKQ 333
>gi|225216914|gb|ACN85210.1| aspartic proteinase nepenthesin-1 precursor [Oryza glaberrima]
Length = 516
Score = 157 bits (396), Expect = 1e-35, Method: Compositional matrix adjust.
Identities = 115/347 (33%), Positives = 156/347 (44%), Gaps = 68/347 (19%)
Query: 87 NNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSL 146
NY++ + +GTP + V DTGSD W QC+PC + CY Q LFDP SSTY ++
Sbjct: 176 GTGNYVVTVGLGTPASRYTVVFDTGSDTTWVQCQPCVVA-CYEQREKLFDPASSSTYANV 234
Query: 147 PCSSSQCASLNQKSCSGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITFGC 206
C++ C+ L+ CSG +C Y V YGDGS+S G A +T+TL S A+ G FGC
Sbjct: 235 SCAAPACSDLDVSGCSGGHCLYGVQYGDGSYSIGFFAMDTLTLSSYD----AVKGFRFGC 290
Query: 207 GTNNGGLFNSKTTGIVGLGGGDISLISQM--------------RTTIAGNQRLGVSTP-- 250
G N GLF + G++GLG G SL Q R+T G G +P
Sbjct: 291 GERNDGLFG-EAAGLLGLGRGKTSLPVQTYGKYGGVFAHCLPPRSTGTGYLDFGAGSPPA 349
Query: 251 ---------------------------------------DIVIDSGTTLTFLPQGYNSNL 271
++DSGT +T LP S+L
Sbjct: 350 TTTTPMLTGNGPTFYYVGMTGIRVGGRLLPIAPSVFAAAGTIVDSGTVITRLPPAAYSSL 409
Query: 272 LSVMSSMIEAQPV--ADPTGSLELCYSFNSLSQV--PEVTIHFR-GADVKLSRSNFFVKV 326
S ++ + A+ A L+ CY F +SQV P V++ F+ GA + + S V
Sbjct: 410 RSAFAAAMAARGYRKAAAVSLLDTCYDFTGMSQVAIPTVSLLFQGGAALDVDASGIMYTV 469
Query: 327 SEDIVCSVFKGITNS--VPIYGNIMQTNFLVGYDIEQQTVSFKPTDC 371
S VC F G + V I GN F V YDI ++ V F P C
Sbjct: 470 SASQVCLAFAGNEDGGDVGIVGNTQLKTFGVAYDIGKKVVGFSPGAC 516
>gi|242089623|ref|XP_002440644.1| hypothetical protein SORBIDRAFT_09g004500 [Sorghum bicolor]
gi|241945929|gb|EES19074.1| hypothetical protein SORBIDRAFT_09g004500 [Sorghum bicolor]
Length = 469
Score = 157 bits (396), Expect = 1e-35, Method: Compositional matrix adjust.
Identities = 135/424 (31%), Positives = 195/424 (45%), Gaps = 85/424 (20%)
Query: 21 PIEAQTGGFSVELIHRDSPKSP-----FYNSSETPYQRLRDALTRSLNRLNHFNQNSSIS 75
P A++ GFS +I R + F ++ ++RL +RS ++++ Q+SS S
Sbjct: 22 PAHAESRGFSGTMIRRGRTDTTTAAINFTQAALESHRRLSFLASRS-SQVDK-PQSSSAS 79
Query: 76 SSKASQADIIP-----NNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQ 130
+ D +P Y + SIGTPP + A+ADTGSDLIWT+C+
Sbjct: 80 QLSNNDTDTVPLRMDGGGGAYDMEFSIGTPPQKLTALADTGSDLIWTKCDAG--GGAAWG 137
Query: 131 DSPLFDPKMSSTYKSLPCSSSQCASLNQKSCS-----GVNCQYSVSYG---DGSFSNGNL 182
S + P SST+ LPCS CA+L S + G C Y +YG D F+ G L
Sbjct: 138 GSSSYHPNASSTFTRLPCSDRLCAALRSYSLARCAAGGAECDYKYAYGLGDDPDFTQGFL 197
Query: 183 ATETVTLGSTTGQAVALPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQ-------- 234
+ET TLG A+PG+ FGC T G + + G+VGLG G +SL+SQ
Sbjct: 198 GSETFTLGGD-----AVPGVGFGCTTALEGDYG-EGAGLVGLGRGPLSLVSQLDAGTFMY 251
Query: 235 -------------------MRTTIAGNQRLG---------VSTPDIVI------------ 254
M AG Q G V+ I I
Sbjct: 252 CLTADASKASPLLFGALATMTGAGAGVQSTGLLASTTFYAVNLRSITIGSATTAGVGGPG 311
Query: 255 ----DSGTTLTFLPQ-GYNSNLLSVMSSMIEAQPVADPTGSLELCYSF-NSLSQVPEVTI 308
DSGTTLT+L + Y + +S PV G E CY +S +P + +
Sbjct: 312 GVVFDSGTTLTYLAEPAYTEAKAAFLSQTTSLTPVEGRYG-FEACYEKPDSARLIPAMVL 370
Query: 309 HFR-GADVKLSRSNFFVKVSEDIVCSVFKGITNSVPIYGNIMQTNFLVGYDIEQQTVSFK 367
HF GAD+ L +N+ V+V + +VC V + + S+ I GNIMQ N+LV +D+ + +SF+
Sbjct: 371 HFDGGADMALPVANYVVEVDDGVVCWVVQ-RSPSLSIIGNIMQMNYLVLHDVRKSVLSFQ 429
Query: 368 PTDC 371
P +C
Sbjct: 430 PANC 433
>gi|296090291|emb|CBI40110.3| unnamed protein product [Vitis vinifera]
Length = 408
Score = 157 bits (396), Expect = 1e-35, Method: Compositional matrix adjust.
Identities = 127/421 (30%), Positives = 193/421 (45%), Gaps = 91/421 (21%)
Query: 20 SPIEAQTGGFSVELIHRDSPKSPFYNSSETPYQRL-RDALTRSLNRLNHFNQNSSISSSK 78
P+ TG LIH+DS S YQ L R+ + R R F +
Sbjct: 5 KPLRLVTG-----LIHQDSILSS--------YQSLDRNNVERRRTRRAAF-------ITD 44
Query: 79 ASQADIIPNNAN--YLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFD 136
QA+++ ++ +L+ S+G PP +L DTGSDL+W QC PC + C+ Q +P+FD
Sbjct: 45 EIQANMVADDRGQAFLVNFSVGRPPVPQLVGIDTGSDLLWVQCRPC--ADCFRQSTPIFD 102
Query: 137 PKMSSTYKSLPCSSSQCASLNQKSCSGVN-CQYSVSYGDGSFSNGNLATETVTLGSTTGQ 195
P SSTY L S C + QK + +N C Y+ SY DGS S+GNLATE + ++
Sbjct: 103 PSKSSTYVDLSYDSPICPNSPQKKYNHLNQCIYNASYADGSTSSGNLATEDIVFETSDQG 162
Query: 196 AVALPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIA--------------- 240
V + + FGCG +N G F+ + +GI+GL GD S++S++ + +
Sbjct: 163 TVTVSSVVFGCGHSNRGRFDGQQSGILGLSAGDQSIVSRLGSRFSYCIGDLFDPHYTHNQ 222
Query: 241 -----GNQRLGVSTP-------------------------------------DIVIDSGT 258
G + G STP +V+DSGT
Sbjct: 223 LVLGDGVKMEGSSTPFHTFNGFYYVTLEGISVGETRLDINPEVFQRTESGQGGVVMDSGT 282
Query: 259 TLTFLPQGYNSNLLSVMSSMIEA--QPVADPTGSLELCYSF---NSLSQVPEVTIHF-RG 312
T TFL + L + + ++ Q V T LCY L PE+ HF G
Sbjct: 283 TATFLAKDGFDPLSNEIQRLVRGHFQQVIYRTIPGWLCYKGRVNEDLRGFPELAFHFAEG 342
Query: 313 ADVKLSRSNFFVKVSEDIVC-SVFK-GITNSVPIYGNIMQTNFLVGYDIEQQTVSFKPTD 370
AD+ L ++ FV+ ++D+ C +V + + N + G + Q ++ V YD+ + V F+ TD
Sbjct: 343 ADLVLDANSLFVQKNQDVFCLAVLESNLKNIGSVIGIMAQQHYNVAYDLIGKRVYFQRTD 402
Query: 371 C 371
C
Sbjct: 403 C 403
>gi|224060469|ref|XP_002300215.1| predicted protein [Populus trichocarpa]
gi|222847473|gb|EEE85020.1| predicted protein [Populus trichocarpa]
Length = 439
Score = 157 bits (396), Expect = 1e-35, Method: Compositional matrix adjust.
Identities = 91/210 (43%), Positives = 131/210 (62%), Gaps = 15/210 (7%)
Query: 28 GFSVELIHRDSPKSPFYNSSETPYQRLRDALTRSLNRLNHFNQNSSISSSKAS-QADIIP 86
GF V L H DS K+ T +R+R + R NRL + ++SS + +A ++P
Sbjct: 39 GFRVRLKHVDSGKN------LTKLERIRHGVKRGRNRLQRLQAMALVASSSSEIEAPVLP 92
Query: 87 NNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSL 146
N +L++++IGTPP A+ DTGSDLIWTQC+PC +QC+ Q +P+FDPK SS++ L
Sbjct: 93 GNGEFLMKLAIGTPPETYSAILDTGSDLIWTQCKPC--TQCFHQSTPIFDPKKSSSFSKL 150
Query: 147 PCSSSQCASLNQKSCSGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITFGC 206
CSS C +L Q SC+ C+Y SYGD S + G LA+ET+T G ++P + FGC
Sbjct: 151 SCSSQLCEALPQSSCNN-GCEYLYSYGDYSSTQGILASETLTFGK-----ASVPNVAFGC 204
Query: 207 GTNNGGLFNSKTTGIVGLGGGDISLISQMR 236
G +N G S+ G+VGLG G +SL+SQ++
Sbjct: 205 GADNEGSGFSQGAGLVGLGRGPLSLVSQLK 234
Score = 76.3 bits (186), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 45/125 (36%), Positives = 77/125 (61%), Gaps = 7/125 (5%)
Query: 252 IVIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGS--LELCYSFNSLS---QVPEV 306
++IDSGTT+T+L + + + ++ I PV D +GS L++C++ S S +VP++
Sbjct: 314 LIIDSGTTITYLEESAFNLVAKEFTAKINL-PV-DSSGSTGLDVCFTLPSGSTNIEVPKL 371
Query: 307 TIHFRGADVKLSRSNFFVKVSEDIVCSVFKGITNSVPIYGNIMQTNFLVGYDIEQQTVSF 366
HF GAD++L N+ + S V + G ++ + I+GN+ Q N LV +D+E++T+SF
Sbjct: 372 VFHFDGADLELPAENYMIGDSSMGVACLAMGSSSGMSIFGNVQQQNMLVLHDLEKETLSF 431
Query: 367 KPTDC 371
PT C
Sbjct: 432 LPTQC 436
>gi|116790042|gb|ABK25480.1| unknown [Picea sitchensis]
Length = 460
Score = 156 bits (395), Expect = 1e-35, Method: Compositional matrix adjust.
Identities = 94/216 (43%), Positives = 131/216 (60%), Gaps = 10/216 (4%)
Query: 28 GFSVELIHRDSPKSPFYNSSETPYQRLRDALTRSLNRLNHFNQNSSISSSKASQADIIPN 87
G ++L+ DSP SPF + + +R + A+ RS +RL S+ KA +A +
Sbjct: 54 GLRIDLVRTDSPLSPFSPGNISSTERFKRAIKRSQDRLEKLQM--SVDEVKAVEAPVYAG 111
Query: 88 NANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLP 147
N +L++++IGTP A+ DTGSDL WTQC+PC + CY Q +P++DP SSTY +P
Sbjct: 112 NGEFLMKMAIGTPSLSFSAILDTGSDLTWTQCKPC--TDCYPQPTPIYDPSQSSTYSKVP 169
Query: 148 CSSSQCASLNQKSCSGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITFGCG 207
CSSS C +L SCSG NC+Y SYGD S + G L+ E+ TL S +LP I FGCG
Sbjct: 170 CSSSMCQALPMYSCSGANCEYLYSYGDQSSTQGILSYESFTLTSQ-----SLPHIAFGCG 224
Query: 208 TNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGNQ 243
N G S+ G+VG G G +SLISQ+ ++ GN+
Sbjct: 225 QENEGGGFSQGGGLVGFGRGPLSLISQLGQSL-GNK 259
Score = 76.6 bits (187), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 50/127 (39%), Positives = 71/127 (55%), Gaps = 6/127 (4%)
Query: 249 TPDIVIDSGTTLTFLPQ-GYNSNLLSVMSSMIEAQPVADPTGSLELCY---SFNSLSQVP 304
T ++IDSGTT+T+L Q GY+ +V+SS+ Q G L+LC+ S +S S P
Sbjct: 333 TGGVIIDSGTTVTYLEQSGYDVVKKAVISSINLPQVDGSNIG-LDLCFEPQSGSSTSHFP 391
Query: 305 EVTIHFRGADVKLSRSNFFVKVSEDIVCSVFKGITNSVPIYGNIMQTNFLVGYDIEQQTV 364
+T HF GAD L + N+ S I C +N + I+GNI Q N+ + YD E+ +
Sbjct: 392 TITFHFEGADFNLPKENYIYTDSSGIACLAMLP-SNGMSIFGNIQQQNYQILYDNERNVL 450
Query: 365 SFKPTDC 371
SF PT C
Sbjct: 451 SFAPTVC 457
>gi|147786881|emb|CAN62311.1| hypothetical protein VITISV_008929 [Vitis vinifera]
Length = 408
Score = 156 bits (395), Expect = 1e-35, Method: Compositional matrix adjust.
Identities = 127/421 (30%), Positives = 193/421 (45%), Gaps = 91/421 (21%)
Query: 20 SPIEAQTGGFSVELIHRDSPKSPFYNSSETPYQRL-RDALTRSLNRLNHFNQNSSISSSK 78
P+ TG LIH+DS S YQ L R+ + R R F +
Sbjct: 5 KPLRLVTG-----LIHQDSILSS--------YQSLDRNNVERRRTRRAAFIXDEI----- 46
Query: 79 ASQADIIPNNAN--YLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFD 136
QA+++ ++ +L+ S+G PP +L DTGSDL+W QC PC + C+ Q +P+FD
Sbjct: 47 --QANMVADDRGQAFLVNFSVGRPPVPQLVGIDTGSDLLWVQCRPC--ADCFRQSTPIFD 102
Query: 137 PKMSSTYKSLPCSSSQCASLNQKSCSGVN-CQYSVSYGDGSFSNGNLATETVTLGSTTGQ 195
P SSTY L S C + QK + +N C Y+ SY DGS S+GNLATE + ++
Sbjct: 103 PSKSSTYVDLSYDSPICPNSPQKKYNHLNQCIYNASYADGSTSSGNLATEDIVFETSDQG 162
Query: 196 AVALPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIA--------------- 240
V + + FGCG +N G F+ + +GI+GL GD S++S++ + +
Sbjct: 163 TVTVSSVVFGCGHSNRGRFDGQQSGILGLSAGDQSIVSRLGSRFSYCIGDLFDPHYTHNQ 222
Query: 241 -----GNQRLGVSTP-------------------------------------DIVIDSGT 258
G + G STP +V+DSGT
Sbjct: 223 LVLGDGVKMEGSSTPFHTFNGFYYVTLEGISVGETRLDINPEVFQRTESGQGGVVMDSGT 282
Query: 259 TLTFLPQGYNSNLLSVMSSMIEA--QPVADPTGSLELCYSF---NSLSQVPEVTIHF-RG 312
T TFL + L + + ++ Q V T LCY L PE+ HF G
Sbjct: 283 TATFLAKDGFDPLSNEIQRLVRGHFQQVIYRTIPGWLCYKGRVNEDLRGFPELAFHFAEG 342
Query: 313 ADVKLSRSNFFVKVSEDIVC-SVFK-GITNSVPIYGNIMQTNFLVGYDIEQQTVSFKPTD 370
AD+ L ++ FV+ ++D+ C +V + + N + G + Q ++ V YD+ + V F+ TD
Sbjct: 343 ADLVLDANSLFVQKNQDVFCLAVLESNLKNIGSVIGIMAQQHYNVAYDLIGKRVYFQRTD 402
Query: 371 C 371
C
Sbjct: 403 C 403
>gi|242092368|ref|XP_002436674.1| hypothetical protein SORBIDRAFT_10g006870 [Sorghum bicolor]
gi|241914897|gb|EER88041.1| hypothetical protein SORBIDRAFT_10g006870 [Sorghum bicolor]
Length = 461
Score = 156 bits (394), Expect = 2e-35, Method: Compositional matrix adjust.
Identities = 112/366 (30%), Positives = 159/366 (43%), Gaps = 86/366 (23%)
Query: 90 NYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCS 149
YL+ +++GTPP DTGSDL+WTQC PC C+ Q PL DP SSTY +LPC
Sbjct: 91 EYLVHLAVGTPPRPVALTLDTGSDLVWTQCAPC--RDCFHQGLPLLDPAASSTYAALPCG 148
Query: 150 SSQCASLNQKSCSG----------VNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVA- 198
+ +C +L SC G +C Y YGD S + G +AT+ T G G +
Sbjct: 149 APRCRALPFTSCGGGGRSSWGNGNRSCAYIYHYGDKSVTVGEIATDRFTFGGDNGDGDSR 208
Query: 199 LP--GITFGCGTNNGGLFNSKTTGIVG--------------------------------- 223
LP +TFGCG N G+F S TGI G
Sbjct: 209 LPTRRLTFGCGHFNKGVFQSNETGIAGFGRGRWSLPSQLNVTTFSYCFTSMFESKSSLVT 268
Query: 224 LGGGDISLI---------SQMRTTI-------------------AGNQRLGVSTPDI--- 252
LGG + + ++RTT G RL V +
Sbjct: 269 LGGAPAAALLYSHAAHISGEVRTTPLLKNPSQPSLYFLSLKGISVGKTRLAVPEAKLRST 328
Query: 253 VIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGS-LELCYSF--NSLSQ---VPEV 306
+IDSG ++T LP+ + + ++ + P GS L+LC++ +L + VP +
Sbjct: 329 IIDSGASITTLPEAVYEAVKAEFAAQVGLPPTGVVEGSALDLCFALPVTALWRRPPVPSL 388
Query: 307 TIHFRGADVKLSRSNF-FVKVSEDIVCSVFKGITNSVPIYGNIMQTNFLVGYDIEQQTVS 365
T+H GAD +L R N+ F ++ ++C V + GN Q N V YD+E +S
Sbjct: 389 TLHLDGADWELPRGNYVFEDLAARVMCVVLDAAPGDQTVIGNFQQQNTHVVYDLENDWLS 448
Query: 366 FKPTDC 371
F P C
Sbjct: 449 FAPARC 454
>gi|90399033|emb|CAJ86229.1| H0402C08.5 [Oryza sativa Indica Group]
gi|125550227|gb|EAY96049.1| hypothetical protein OsI_17922 [Oryza sativa Indica Group]
Length = 473
Score = 155 bits (393), Expect = 2e-35, Method: Compositional matrix adjust.
Identities = 120/415 (28%), Positives = 184/415 (44%), Gaps = 78/415 (18%)
Query: 30 SVELIHRDSPKSPFYNSSETPYQRLRDALTRSLNRLNHFNQNSSISSSKASQADIIP--- 86
S+ L+HRD+ Y S ++ + R R+ H + S+S D++
Sbjct: 64 SLSLVHRDAISGATYPSRR---HQVVGLVARDNARVEHLEKRLVASTSPYLPEDLVSEVV 120
Query: 87 -----NNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSS 141
+ Y +R+ +G+PPT++ V D+GSD+IW QC PC QCY Q PLFDP SS
Sbjct: 121 PGVDDGSGEYFVRVGVGSPPTDQYLVVDSGSDVIWVQCRPC--EQCYAQTDPLFDPAASS 178
Query: 142 TYKSLPCSSSQCASLNQKSCSGV----NCQYSVSYGDGSFSNGNLATETVTLGSTTGQAV 197
++ + C S+ C +L+ C G C YSV+YGDGS++ G LA ET+TLG T Q V
Sbjct: 179 SFSGVSCGSAICRTLSGTGCGGGGDAGKCDYSVTYGDGSYTKGELALETLTLGGTAVQGV 238
Query: 198 A----------------LPGITFGC-------GTNNGGLFN----SKTTGIVG------- 223
A L G+ +G G GG+F+ S+ G G
Sbjct: 239 AIGCGHRNSGLFVGAAGLLGLGWGAMSLIGQLGGAAGGVFSYCLASRGAGGAGSLVLGRT 298
Query: 224 ----LGGGDISLISQMRTTI----------AGNQRLGVS------TPD----IVIDSGTT 259
+G + L+ + + G +RL + T D +V+D+GT
Sbjct: 299 EAVPVGAVWVPLVRNNQASSFYYVGLTGIGVGGERLPLQDGLFQLTEDGAGGVVMDTGTA 358
Query: 260 LTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSLS--QVPEVTIHF-RGADVK 316
+T LP+ + L + A P + L+ CY + + +VP V+ +F +GA +
Sbjct: 359 VTRLPREAYAALRGAFDGAMGALPRSPAVSLLDTCYDLSGYASVRVPTVSFYFDQGAVLT 418
Query: 317 LSRSNFFVKVSEDIVCSVFKGITNSVPIYGNIMQTNFLVGYDIEQQTVSFKPTDC 371
L N V+V + C F ++ + I GNI Q + D V F P C
Sbjct: 419 LPARNLLVEVGGAVFCLAFAPSSSGISILGNIQQEGIQITVDSANGYVGFGPNTC 473
>gi|125592062|gb|EAZ32412.1| hypothetical protein OsJ_16623 [Oryza sativa Japonica Group]
Length = 473
Score = 155 bits (393), Expect = 2e-35, Method: Compositional matrix adjust.
Identities = 120/415 (28%), Positives = 184/415 (44%), Gaps = 78/415 (18%)
Query: 30 SVELIHRDSPKSPFYNSSETPYQRLRDALTRSLNRLNHFNQNSSISSSKASQADIIPN-- 87
S+ L+HRD+ Y S ++ + R R+ H + S+S D++
Sbjct: 64 SLSLVHRDAISGATYPSRR---HQVVGLVARDNARVEHLEKRLVASTSPYLPEDLVSEVV 120
Query: 88 ------NANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSS 141
+ Y +R+ +G+PPT++ V D+GSD+IW QC PC QCY Q PLFDP SS
Sbjct: 121 PGVDDGSGEYFVRVGVGSPPTDQYLVVDSGSDVIWVQCRPC--EQCYAQTDPLFDPAASS 178
Query: 142 TYKSLPCSSSQCASLNQKSCSGV----NCQYSVSYGDGSFSNGNLATETVTLGSTTGQAV 197
++ + C S+ C +L+ C G C YSV+YGDGS++ G LA ET+TLG T Q V
Sbjct: 179 SFSGVSCGSAICRTLSGTGCGGGGDAGKCDYSVTYGDGSYTKGELALETLTLGGTAVQGV 238
Query: 198 A----------------LPGITFGC-------GTNNGGLFN----SKTTGIVG------- 223
A L G+ +G G GG+F+ S+ G G
Sbjct: 239 AIGCGHRNSGLFVGAAGLLGLGWGAMSLVGQLGGAAGGVFSYCLASRGAGGAGSLVLGRT 298
Query: 224 ----LGGGDISLISQMRTTI----------AGNQRLGVS------TPD----IVIDSGTT 259
+G + L+ + + G +RL + T D +V+D+GT
Sbjct: 299 EAVPVGAVWVPLVRNNQASSFYYVGLTGIGVGGERLPLQDSLFQLTEDGAGGVVMDTGTA 358
Query: 260 LTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSLS--QVPEVTIHF-RGADVK 316
+T LP+ + L + A P + L+ CY + + +VP V+ +F +GA +
Sbjct: 359 VTRLPREAYAALRGAFDGAMGALPRSPAVSLLDTCYDLSGYASVRVPTVSFYFDQGAVLT 418
Query: 317 LSRSNFFVKVSEDIVCSVFKGITNSVPIYGNIMQTNFLVGYDIEQQTVSFKPTDC 371
L N V+V + C F ++ + I GNI Q + D V F P C
Sbjct: 419 LPARNLLVEVGGAVFCLAFAPSSSGISILGNIQQEGIQITVDSANGYVGFGPNTC 473
>gi|312281631|dbj|BAJ33681.1| unnamed protein product [Thellungiella halophila]
Length = 502
Score = 155 bits (393), Expect = 2e-35, Method: Compositional matrix adjust.
Identities = 113/351 (32%), Positives = 163/351 (46%), Gaps = 76/351 (21%)
Query: 88 NANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLP 147
+ Y RI +GTP E V DTGSD+ W QC PC S+CY Q P+FDP SST+KSL
Sbjct: 161 SGEYFSRIGVGTPAKEMYVVLDTGSDVNWIQCLPC--SECYQQSDPIFDPTSSSTFKSLT 218
Query: 148 CSSSQCASLNQKSCSGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITFGCG 207
CS +CASL+ +C C Y VSYGDGSF+ GN AT+TVT G++ + + GCG
Sbjct: 219 CSDPKCASLDVSACRSNKCLYQVSYGDGSFTVGNYATDTVTF----GESGKVNDVALGCG 274
Query: 208 TNNGGLF---------------------------------NSKTTGI----VGLGGGDIS 230
+N GLF ++K++ + V +G GD +
Sbjct: 275 HDNEGLFTGAAGLLGLGGGALSMTNQIKAKSFSYCLVDRDSAKSSSLDFNSVQIGAGDAT 334
Query: 231 --LI--SQMRT---------TIAGNQRLGVSTPD------------IVIDSGTTLTFL-P 264
L+ S+M T ++ G Q VS P +++D GT +T L
Sbjct: 335 APLLRNSKMDTFYYVGLSGFSVGGQQ---VSIPSSLFEVDASGAGGVILDCGTAVTRLQT 391
Query: 265 QGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSLS--QVPEVTIHFRGAD-VKLSRSN 321
Q YNS + + + + P + CY F+SLS +VP VT HF G + L N
Sbjct: 392 QAYNSLRDAFVKLTTDFKKGTSPISLFDTCYDFSSLSTVKVPTVTFHFTGGKSLNLPAKN 451
Query: 322 FFVKVSE-DIVCSVFKGITNSVPIYGNIMQTNFLVGYDIEQQTVSFKPTDC 371
+ + + + C F ++S+ I GN+ Q + YD+ + C
Sbjct: 452 YLIPIDDAGTFCFAFAPTSSSLSIIGNVQQQGTRITYDLANNLIGLSANKC 502
>gi|356508918|ref|XP_003523200.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
Length = 453
Score = 155 bits (392), Expect = 3e-35, Method: Compositional matrix adjust.
Identities = 92/215 (42%), Positives = 129/215 (60%), Gaps = 14/215 (6%)
Query: 26 TGGFSVELIHRDSPKSPFYNSSETPYQRLRDALTRSLNRLNHFNQ----NSSISSSKASQ 81
T GF V L H DS K+ T +R++ + R +RL N S++ S +
Sbjct: 45 TKGFRVMLRHVDSGKN------LTKLERVQHGIKRGKSRLQRLNAMVLAASTLDSEDQLE 98
Query: 82 ADIIPNNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSS 141
A I N YL+ ++IGTPP AV DTGSDLIWTQC+PC +QCY Q +P+FDPK SS
Sbjct: 99 APIHAGNGEYLMELAIGTPPVSYPAVLDTGSDLIWTQCKPC--TQCYKQPTPIFDPKKSS 156
Query: 142 TYKSLPCSSSQCASLNQKSCSGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPG 201
++ + C SS C+++ +CS C+Y SYGD S + G LATET T G + + V++
Sbjct: 157 SFSKVSCGSSLCSAVPSSTCSD-GCEYVYSYGDYSMTQGVLATETFTFGKSKNK-VSVHN 214
Query: 202 ITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMR 236
I FGCG +N G + +G+VGLG G +SL+SQ++
Sbjct: 215 IGFGCGEDNEGDGFEQASGLVGLGRGPLSLVSQLK 249
Score = 80.1 bits (196), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 48/156 (30%), Positives = 82/156 (52%), Gaps = 7/156 (4%)
Query: 221 IVGLGGGDISLISQMRTTIAGNQRLGVSTPDIVIDSGTTLTFLPQGYNSNLLSVMSSMIE 280
+ G+ GD L + T G+ G ++IDSGTT+T++ Q L S +
Sbjct: 301 LEGISVGDTRLSIEKSTFEVGDDGNG----GVIIDSGTTITYIEQKAFEALKKEFISQTK 356
Query: 281 AQPVADPTGSLELCYSFNSLS---QVPEVTIHFRGADVKLSRSNFFVKVSEDIVCSVFKG 337
+ L+LC+S S S ++P++ HF+G D++L N+ + S V + G
Sbjct: 357 LPLDKTSSTGLDLCFSLPSGSTQVEIPKIVFHFKGGDLELPAENYMIGDSNLGVACLAMG 416
Query: 338 ITNSVPIYGNIMQTNFLVGYDIEQQTVSFKPTDCTK 373
++ + I+GN+ Q N LV +D+E++T+SF PT C +
Sbjct: 417 ASSGMSIFGNVQQQNILVNHDLEKETISFVPTSCDQ 452
>gi|297742733|emb|CBI35367.3| unnamed protein product [Vitis vinifera]
Length = 521
Score = 155 bits (392), Expect = 3e-35, Method: Compositional matrix adjust.
Identities = 115/411 (27%), Positives = 178/411 (43%), Gaps = 76/411 (18%)
Query: 23 EAQTGG--FSVELIHRDSPKSPFYNSSETPYQRLRDALTRSLNRLNHFNQNSSISSSKAS 80
+ + GG + ++++HRD + +S+ RL L R R+ + S +
Sbjct: 125 DHEEGGEKWMMKVVHRDQLS---FGNSDDHRHRLDGRLKRDAKRVASLIRRLSSGGGGSY 181
Query: 81 QAD---------IIPNNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQD 131
+ D + + Y +RI +G+PP + V D+GSD++W QC+PC +QCY Q
Sbjct: 182 RVDDFGTDVISGMEQGSGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPC--TQCYHQS 239
Query: 132 SPLFDPKMSSTYKSLPCSSSQCASLNQKSCSGVNCQYSVSYGDGSFSNGNLATETVTLGS 191
P+FDP S+++ + CSSS C L C C+Y VSYGDGS++ G LA ET+T G
Sbjct: 240 DPVFDPADSASFTGVSCSSSVCDRLENAGCHAGRCRYEVSYGDGSYTKGTLALETLTFGR 299
Query: 192 TTGQAVALPGITFGCGTNNGGLF---------NSKTTGIVGLGGGDIS------LISQMR 236
T ++VA+ GCG N G+F + VG GG L+S
Sbjct: 300 TMVRSVAI-----GCGHRNRGMFVGAAGLLGLGGGSMSFVGQLGGQTGGAFSYCLVSAAW 354
Query: 237 TTIAGNQRL--------------GVSTP--------------DIVIDSGTTLTFLP---- 264
+ N R G+ P +V+D+GT +T LP
Sbjct: 355 VPLVRNPRAPSFYYIGLAGLGVGGIRVPISEEVFRLTELGDGGVVMDTGTAVTRLPTLAY 414
Query: 265 QGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSL--SQVPEVTIHFRGADV-KLSRSN 321
Q + L+ +++ A VA + CY +VP V+ +F G + L N
Sbjct: 415 QAFRDAFLAQTANLPRATGVA----IFDTCYDLLGFVSVRVPTVSFYFSGGPILTLPARN 470
Query: 322 FFVKVSE-DIVCSVFKGITNSVPIYGNIMQTNFLVGYDIEQQTVSFKPTDC 371
F + + + C F T+ + I GNI Q + +D V F P C
Sbjct: 471 FLIPMDDAGTFCFAFAPSTSGLSILGNIQQEGIQISFDGANGYVGFGPNIC 521
>gi|225438315|ref|XP_002272802.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Vitis vinifera]
Length = 436
Score = 155 bits (392), Expect = 3e-35, Method: Compositional matrix adjust.
Identities = 134/451 (29%), Positives = 200/451 (44%), Gaps = 94/451 (20%)
Query: 1 MATFLSCVFILFFLCFYV----VSPIEAQTGG---------FSVELIHRDSPKSPFYNSS 47
MA+ S + I+ L V VSP + + G F V L H DS +
Sbjct: 1 MASSGSHMIIVILLALAVSSALVSPAASTSRGLDRRPEKTWFRVSLRHVDS------GGN 54
Query: 48 ETPYQRLRDALTRSLNRLNHFNQNSSISSSKASQADIIPNNANYLIRISIGTPPTERLAV 107
T ++RL+ A+ R RL + ++ S + +A + N +L++++IGTP A+
Sbjct: 55 YTKFERLQRAMKRGKLRLQRLSAKTA-SFESSVEAPVHAGNGEFLMKLAIGTPAETYSAI 113
Query: 108 ADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCSSSQCASLNQKSCSGVNCQ 167
DTGSDLIWTQC+PC C+ Q +P+FDPK SS++ LPCSS CA+L SCS C+
Sbjct: 114 MDTGSDLIWTQCKPC--KDCFDQPTPIFDPKKSSSFSKLPCSSDLCAALPISSCSD-GCE 170
Query: 168 YSVSY-----------------GDGSFSNGNLATETVTLGSTTGQAVALPGITFG----- 205
Y SY GD S S GS Q L G+ G
Sbjct: 171 YLYSYGDYSSTQGVLATETFAFGDASVSKIGFGCGEDNDGSGFSQGAGLVGLGRGPLSLI 230
Query: 206 ----------CGTN--------------NGGLFNSKTTGIV--------------GLGGG 227
C T+ + N+ TT ++ G+ G
Sbjct: 231 SQLGEPKFSYCLTSMDDSKGISSLLVGSEATMKNAITTPLIQNPSQPSFYYLSLEGISVG 290
Query: 228 DISLISQMRTTIAGNQRLGVSTPDIVIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADP 287
D L + T N G ++IDSGTT+T+L + L S ++ D
Sbjct: 291 DTLLPIEKSTFSIQNDGSG----GLIIDSGTTITYLEDSAFAALKKEFISQLKLD--VDE 344
Query: 288 TGS--LELCYSF---NSLSQVPEVTIHFRGADVKLSRSNFFVKVSEDIVCSVFKGITNSV 342
+GS L+LC++ S VP++ HF GAD+KL N+ + S V + G ++ +
Sbjct: 345 SGSTGLDLCFTLPPDASTVDVPQLVFHFEGADLKLPAENYIIADSGLGVICLTMGSSSGM 404
Query: 343 PIYGNIMQTNFLVGYDIEQQTVSFKPTDCTK 373
I+GN Q N +V +D+E++T+SF P C +
Sbjct: 405 SIFGNFQQQNIVVLHDLEKETISFAPAQCNQ 435
>gi|297811185|ref|XP_002873476.1| hypothetical protein ARALYDRAFT_325615 [Arabidopsis lyrata subsp.
lyrata]
gi|297319313|gb|EFH49735.1| hypothetical protein ARALYDRAFT_325615 [Arabidopsis lyrata subsp.
lyrata]
Length = 475
Score = 155 bits (391), Expect = 3e-35, Method: Compositional matrix adjust.
Identities = 130/434 (29%), Positives = 181/434 (41%), Gaps = 88/434 (20%)
Query: 18 VVSPIEAQTGGFSVELIHRDSPKSPFYNSSET-----PYQRLRDALTRSLNR-------L 65
V+SP A T S+ + HR S N T RL A S++
Sbjct: 51 VLSP-RASTTKSSLHVTHRHGTCSRLNNGKATSPDHVEILRLDQARVNSIHSKLSKKLTT 109
Query: 66 NHFNQNSSISSSKASQADIIPNNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPS 125
NH +Q+ S + + + NY++ + +GTP + + DTGSDL WTQC+PC +
Sbjct: 110 NHVSQSQSTDLPAKDGSTL--GSGNYIVTVGLGTPKNDLSLIFDTGSDLTWTQCQPCVRT 167
Query: 126 QCYMQDSPLFDPKMSSTYKSLPCSSSQCASL-----NQKSCSGVNCQYSVSYGDGSFSNG 180
CY Q P+F+P S++Y ++ CSS+ C SL N SCS NC Y + YGD SFS G
Sbjct: 168 -CYDQKEPIFNPSKSTSYYNVSCSSAACGSLSSATGNAGSCSASNCIYGIQYGDQSFSVG 226
Query: 181 NLATETVTLGSTTGQAVALPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTT-- 238
LA + TL S+ G+ FGCG NN GLF + G++GLG +S SQ T
Sbjct: 227 FLAKDKFTLTSSD----VFDGVYFGCGENNQGLF-TGVAGLLGLGRDKLSFPSQTATAYN 281
Query: 239 --------------------------------------------------IAGNQRLGV- 247
G Q+L +
Sbjct: 282 KIFSYCLPSSASYTGHLTFGSAGISRSVKFTPISTITDGTSFYGLNIVAITVGGQKLPIP 341
Query: 248 ----STPDIVIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSLSQV 303
STP +IDSGT +T LP + L S + + P L+ C+ + V
Sbjct: 342 STVFSTPGALIDSGTVITRLPPKAYAALRSSFKAKMSKYPTTSGVSILDTCFDLSGFKTV 401
Query: 304 --PEVTIHFR-GADVKLSRSNFFVKVSEDIVCSVFKGITN--SVPIYGNIMQTNFLVGYD 358
P+V F GA V+L F VC F G ++ + I+GN+ Q V YD
Sbjct: 402 TIPKVAFSFSGGAVVELGSKGIFYAFKISQVCLAFAGNSDDSNAAIFGNVQQQTLEVVYD 461
Query: 359 IEQQTVSFKPTDCT 372
V F P C+
Sbjct: 462 GAGGRVGFAPNGCS 475
>gi|147862576|emb|CAN79341.1| hypothetical protein VITISV_006338 [Vitis vinifera]
Length = 436
Score = 155 bits (391), Expect = 4e-35, Method: Compositional matrix adjust.
Identities = 119/412 (28%), Positives = 187/412 (45%), Gaps = 77/412 (18%)
Query: 25 QTGGFSVELIHRDSPKSPFYNSSETPYQRLRDALTRSLNRLNHFNQNSSISSSKASQADI 84
+ GF V L H DS + T ++RL+ A+ R RL + ++ S + +A +
Sbjct: 38 EKNGFRVSLRHVDS------GGNYTKFERLQRAVKRGRLRLQRLSAKTA-SFEPSVEAPV 90
Query: 85 IPNNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYK 144
N +L+ ++IGTP A+ DTGSDLIWTQC+PC C+ Q +P+FDP+ SS++
Sbjct: 91 HAGNGEFLMNLAIGTPAETYSAIMDTGSDLIWTQCKPC--KVCFDQPTPIFDPEKSSSFS 148
Query: 145 SLPCSSSQCASLNQKSCSGVNCQYSVS-----------------YGDGSFSNGNLATETV 187
LPCSS C +L SCS C+Y S +GD S S
Sbjct: 149 KLPCSSDLCVALPISSCSD-GCEYRYSYGDHSSTQGVLATETFTFGDASVSKIGFGCGED 207
Query: 188 TLGSTTGQAVALPGITFG---------------CGTNNGGLFNSKTTGIVGLGGGDISLI 232
G Q L G+ G C T+ + +SK + L G + ++
Sbjct: 208 NRGRAYSQGAGLVGLGRGPLSLISQLGVPKFSYCLTS---IDDSKGISTL-LVGSEATVK 263
Query: 233 SQMRTTIAGNQRL---------GVSTPD-------------------IVIDSGTTLTFLP 264
S + T + N G+S D ++IDSGTT+T+L
Sbjct: 264 SAIPTPLIQNPSRPSFYYLSLEGISVGDTLLPIEKSTFSIQDDGSGGLIIDSGTTITYLK 323
Query: 265 QGYNSNLLSVMSSMIEAQPVADPTGSLELCYSF---NSLSQVPEVTIHFRGADVKLSRSN 321
+ L S ++ A + LELC++ S +VP++ HF G D+KL + N
Sbjct: 324 DNAFAALKKEFISQMKLDVDASGSTELELCFTLPPDGSPVEVPQLVFHFEGVDLKLPKEN 383
Query: 322 FFVKVSEDIVCSVFKGITNSVPIYGNIMQTNFLVGYDIEQQTVSFKPTDCTK 373
+ ++ S V + G ++ + I+GN Q N +V +D+E++T+SF P C +
Sbjct: 384 YIIEDSALRVICLTMGSSSGMSIFGNFQQQNIVVLHDLEKETISFAPAQCNQ 435
>gi|225217056|gb|ACN85339.1| aspartic proteinase nepenthesin-1 precursor [Oryza granulata]
Length = 521
Score = 154 bits (390), Expect = 5e-35, Method: Compositional matrix adjust.
Identities = 128/445 (28%), Positives = 183/445 (41%), Gaps = 103/445 (23%)
Query: 24 AQTGGFSVELIHRDSPKSPFYNSSETPYQRLRDALTRSLNRLNHFNQNSSISSSKASQAD 83
A + G + ++HR P SP ++ P + L NR+ + S +++ +
Sbjct: 83 ASSSGTRMTIVHRHGPCSPLADAHGKPPSH-DEILAADQNRVESIHHRVSTTATVRGKPK 141
Query: 84 IIPN---------------------------------NANYLIRISIGTPPTERLAVADT 110
P+ NY++ I +GTP + V DT
Sbjct: 142 RRPSPSRRQQQPSAPAPAASLSSSTASLPASSGRALGTGNYVVTIGLGTPASRYTVVFDT 201
Query: 111 GSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCSSSQCASLNQKSCSGVNCQYSV 170
GSD W QC+PC CY Q LFDP SSTY ++ C++ C+ L + CSG +C YSV
Sbjct: 202 GSDTTWVQCQPCV-VVCYKQQEKLFDPARSSTYANVSCAAPACSDLYTRGCSGGHCLYSV 260
Query: 171 SYGDGSFSNGNLATETVTLGSTTGQAVALPGITFGCGTNNGGLF---------------- 214
YGDGS+S G A +T+TL S A+ G FGCG N GLF
Sbjct: 261 QYGDGSYSIGFFAMDTLTLSSYD----AVKGFRFGCGERNEGLFGEAAGLLGLGRGKTSL 316
Query: 215 ------------------NSKTTGIVGLGGGDISLISQMRTT------------------ 238
S TG + G G + + +TT
Sbjct: 317 PVQTYDKYGGVFAHCLPARSSGTGYLDFGPGSPAAVGARQTTPMLTDNGPTFYYVGMTGI 376
Query: 239 IAGNQRLGV-----STPDIVIDSGTTLTFLPQGYNSNLLSVMSSMIEAQ--PVADPTGSL 291
G Q L + ST ++DSGT +T LP S+L S +S + A+ A L
Sbjct: 377 RVGGQLLSIPQSVFSTAGTIVDSGTVITRLPPAAYSSLRSAFASAMAARGYKKAPALSLL 436
Query: 292 ELCYSFNSLSQV--PEVTIHFR-GADVKLSRSNFFVKVSEDIVCSVFKG--ITNSVPIYG 346
+ CY F +S+V P+V++ F+ GA + ++ S S VC F + V I G
Sbjct: 437 DTCYDFTGMSEVAIPKVSLLFQGGAYLDVNASGIMYAASLSQVCLGFAANEDDDDVGIVG 496
Query: 347 NIMQTNFLVGYDIEQQTVSFKPTDC 371
N F V YDI ++TV F P C
Sbjct: 497 NTQLKTFGVVYDIGKKTVGFSPGAC 521
>gi|224126751|ref|XP_002329464.1| predicted protein [Populus trichocarpa]
gi|222870144|gb|EEF07275.1| predicted protein [Populus trichocarpa]
Length = 439
Score = 154 bits (390), Expect = 6e-35, Method: Compositional matrix adjust.
Identities = 89/210 (42%), Positives = 133/210 (63%), Gaps = 15/210 (7%)
Query: 28 GFSVELIHRDSPKSPFYNSSETPYQRLRDALTRSLNRLNHFNQNSSISSSKAS-QADIIP 86
GF +L H DS K+ T ++R++ + R +RL F + ++SS + A ++P
Sbjct: 39 GFRAKLKHVDSGKNL------TKFERIQHGVKRGRHRLQRFKAMALVASSNSEIDAPVLP 92
Query: 87 NNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSL 146
N +L++++IGTPP A+ DTGSDLIWTQC+PC +QC+ Q +P+FDPK SS++ L
Sbjct: 93 GNGEFLMKLAIGTPPETYSAIMDTGSDLIWTQCKPC--TQCFDQPTPIFDPKKSSSFSKL 150
Query: 147 PCSSSQCASLNQKSCSGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITFGC 206
CSS C +L Q +CS C+Y YGD S + G LA+ET+T G V++P + FGC
Sbjct: 151 SCSSKLCEALPQSTCSD-GCEYLYGYGDYSSTQGMLASETLTFGK-----VSVPEVAFGC 204
Query: 207 GTNNGGLFNSKTTGIVGLGGGDISLISQMR 236
G +N G S+ +G+VGLG G +SL+SQ++
Sbjct: 205 GEDNEGSGFSQGSGLVGLGRGPLSLVSQLK 234
Score = 79.3 bits (194), Expect = 3e-12, Method: Compositional matrix adjust.
Identities = 48/127 (37%), Positives = 77/127 (60%), Gaps = 7/127 (5%)
Query: 252 IVIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGS--LELCYSFNSLS---QVPEV 306
++IDSGTT+T+L Q + +S I PV D +GS LE+C++ S S +VP++
Sbjct: 314 LIIDSGTTITYLEQSAFDLVAKEFTSQINL-PV-DNSGSTGLEVCFTLPSGSTDIEVPKL 371
Query: 307 TIHFRGADVKLSRSNFFVKVSEDIVCSVFKGITNSVPIYGNIMQTNFLVGYDIEQQTVSF 366
HF GAD++L N+ + + V + G ++ + I+GNI Q N LV +D+E++T+SF
Sbjct: 372 VFHFDGADLELPAENYMIADASMGVACLAMGSSSGMSIFGNIQQQNMLVLHDLEKETLSF 431
Query: 367 KPTDCTK 373
PT C +
Sbjct: 432 LPTQCDE 438
>gi|356516413|ref|XP_003526889.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
Length = 453
Score = 154 bits (389), Expect = 6e-35, Method: Compositional matrix adjust.
Identities = 93/216 (43%), Positives = 129/216 (59%), Gaps = 15/216 (6%)
Query: 26 TGGFSVELIHRDSPKSPFYNSSETPYQRLRDALTRSLNRLNHFNQ-----NSSISSSKAS 80
T GF V L H DS K+ T +R++ + R +RL N +S+ S
Sbjct: 44 TNGFRVMLRHVDSGKN------LTKLERVQHGIKRGKSRLQKLNAMVLAASSTPDSEDQL 97
Query: 81 QADIIPNNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMS 140
+A I N YLI ++IGTPP AV DTGSDLIWTQC+PC ++CY Q +P+FDPK S
Sbjct: 98 EAPIHAGNGEYLIELAIGTPPVSYPAVLDTGSDLIWTQCKPC--TRCYKQPTPIFDPKKS 155
Query: 141 STYKSLPCSSSQCASLNQKSCSGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALP 200
S++ + C SS C++L +CS C+Y SYGD S + G LATET T G + + V++
Sbjct: 156 SSFSKVSCGSSLCSALPSSTCSD-GCEYVYSYGDYSMTQGVLATETFTFGKSKNK-VSVH 213
Query: 201 GITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMR 236
I FGCG +N G + +G+VGLG G +SL+SQ++
Sbjct: 214 NIGFGCGEDNEGDGFEQASGLVGLGRGPLSLVSQLK 249
Score = 79.3 bits (194), Expect = 3e-12, Method: Compositional matrix adjust.
Identities = 44/126 (34%), Positives = 74/126 (58%), Gaps = 5/126 (3%)
Query: 252 IVIDSGTTLTFLPQ-GYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSLS---QVPEVT 307
++IDSGTT+T++ Q Y + +S A TG L+LC+S S S ++P++
Sbjct: 328 VIIDSGTTITYVQQKAYEALKKEFISQTKLALDKTSSTG-LDLCFSLPSGSTQVEIPKLV 386
Query: 308 IHFRGADVKLSRSNFFVKVSEDIVCSVFKGITNSVPIYGNIMQTNFLVGYDIEQQTVSFK 367
HF+G D++L N+ + S V + G ++ + I+GN+ Q N LV +D+E++T+SF
Sbjct: 387 FHFKGGDLELPAENYMIGDSNLGVACLAMGASSGMSIFGNVQQQNILVNHDLEKETISFV 446
Query: 368 PTDCTK 373
PT C +
Sbjct: 447 PTSCDQ 452
>gi|168000296|ref|XP_001752852.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162696015|gb|EDQ82356.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 384
Score = 154 bits (389), Expect = 6e-35, Method: Compositional matrix adjust.
Identities = 116/388 (29%), Positives = 171/388 (44%), Gaps = 77/388 (19%)
Query: 56 DALTRSLNRLNHFNQNSSISS--SKASQADIIPNNANYLIRISIGTPPTERLAVADTGSD 113
+A+ RS R+ + S + S+ Q+ + N YL+ +++G+PP + DTGSD
Sbjct: 2 EAVQRSHERVAFYTLKLSPDAFGSQEFQSPVKAGNGEYLMTLTLGSPPQSFDVIVDTGSD 61
Query: 114 LIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCSSSQC--ASLNQKSCSGVNCQYSVS 171
L W QC PC CY Q P FDP S +++ C+ + C ++L K+C+ CQY +
Sbjct: 62 LNWVQCLPC--RVCYQQPGPKFDPSKSRSFRKAACTDNLCNVSALPLKACAANVCQYQYT 119
Query: 172 YGDGSFSNGNLATETVTLGSTTGQAVALPGITFGCGTNNGGLFNSKTTGIVGLGGGDISL 231
YGD S +NG+LA ET++L + G ++P FGCGT N G F + G+VGLG G +SL
Sbjct: 120 YGDQSNTNGDLAFETISLNNGAGTQ-SVPNFAFGCGTQNLGTF-AGAAGLVGLGQGPLSL 177
Query: 232 ISQMRTTIA--------------------------------------------------- 240
SQ+ T A
Sbjct: 178 NSQLSHTFANKFSYCLVSLNSLSASPLTFGSIAAAANIQYTSIVVNARHPTYYYVQLNSI 237
Query: 241 --GNQRLGVSTPDIVIDSGT-----------TLTFLPQGYNSNLLSVMSSMIEAQPVADP 287
G Q L ++ ID T T+T L S +L S + +
Sbjct: 238 EVGGQPLNLAPSVFAIDQSTGRGGTIIDSGTTITMLTLPAYSAVLRAYESFVNYPRLDGS 297
Query: 288 TGSLELCYSFNSLSQ--VPEVTIHFRGADVKLSRSNFFVKV--SEDIVCSVFKGITNSVP 343
L+LC++ +S VP++ F+GAD ++ N FV V S +C G +
Sbjct: 298 AYGLDLCFNIAGVSNPSVPDMVFKFQGADFQMRGENLFVLVDTSATTLCLAMGG-SQGFS 356
Query: 344 IYGNIMQTNFLVGYDIEQQTVSFKPTDC 371
I GNI Q N LV YD+E + + F DC
Sbjct: 357 IIGNIQQQNHLVVYDLEAKKIGFATADC 384
>gi|356575389|ref|XP_003555824.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
Length = 473
Score = 154 bits (389), Expect = 7e-35, Method: Compositional matrix adjust.
Identities = 79/221 (35%), Positives = 122/221 (55%), Gaps = 18/221 (8%)
Query: 29 FSVELIHRDSPKSPFYNSSETPYQRLRDALTRSLNRL----NHFNQNSSISSSKASQADI 84
+ ++L+HRD K P +N+S R + R R+ H + +A +D+
Sbjct: 66 YKLKLVHRD--KVPTFNTSHDHRTRFNARMQRDTKRVAALRRHLAAGKPTYAEEAFGSDV 123
Query: 85 IPN----NANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMS 140
+ + Y +RI +G+PP + V D+GSD+IW QCEPC +QCY Q P+F+P S
Sbjct: 124 VSGMEQGSGEYFVRIGVGSPPRNQYVVIDSGSDIIWVQCEPC--TQCYHQSDPVFNPADS 181
Query: 141 STYKSLPCSSSQCASLNQKSCSGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALP 200
S+Y + C+S+ C+ ++ C C+Y VSYGDGS++ G LA ET+T G T + VA+
Sbjct: 182 SSYAGVSCASTVCSHVDNAGCHEGRCRYEVSYGDGSYTKGTLALETLTFGRTLIRNVAI- 240
Query: 201 GITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAG 241
GCG +N G+F G++GLG G +S + Q+ G
Sbjct: 241 ----GCGHHNQGMF-VGAAGLLGLGSGPMSFVGQLGGQAGG 276
>gi|225437854|ref|XP_002264056.1| PREDICTED: aspartic proteinase nepenthesin-1 [Vitis vinifera]
Length = 436
Score = 154 bits (388), Expect = 8e-35, Method: Compositional matrix adjust.
Identities = 119/412 (28%), Positives = 186/412 (45%), Gaps = 77/412 (18%)
Query: 25 QTGGFSVELIHRDSPKSPFYNSSETPYQRLRDALTRSLNRLNHFNQNSSISSSKASQADI 84
+ GF V L H DS + T ++RL+ A+ R RL + ++ S + +A +
Sbjct: 38 EKNGFRVSLRHVDS------GGNYTKFERLQRAVKRGRLRLQRLSAKTA-SFEPSVEAPV 90
Query: 85 IPNNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYK 144
N +L+ ++IGTP A+ DTGSDLIWTQC+PC C+ Q +P+FDP+ SS++
Sbjct: 91 HAGNGEFLMNLAIGTPAETYSAIMDTGSDLIWTQCKPC--KVCFDQPTPIFDPEKSSSFS 148
Query: 145 SLPCSSSQCASLNQKSCSGVNCQYSVS-----------------YGDGSFSNGNLATETV 187
LPCSS C +L SCS C+Y S +GD S S
Sbjct: 149 KLPCSSDLCVALPISSCSD-GCEYRYSYGDHSSTQGVLATETFTFGDASVSKIGFGCGED 207
Query: 188 TLGSTTGQAVALPGITFG---------------CGTNNGGLFNSKTTGIVGLGGGDISLI 232
G Q L G+ G C T+ + +SK + L G + ++
Sbjct: 208 NRGRAYSQGAGLVGLGRGPLSLISQLGVPKFSYCLTS---IDDSKGISTL-LVGSEATVK 263
Query: 233 SQMRTTIAGNQRL---------GVSTPD-------------------IVIDSGTTLTFLP 264
S + T + N G+S D ++IDSGTT+T+L
Sbjct: 264 SAIPTPLIQNPSRPSFYYLSLEGISVGDTLLPIEKSTFSIQDDGSGGLIIDSGTTITYLK 323
Query: 265 QGYNSNLLSVMSSMIEAQPVADPTGSLELCYSF---NSLSQVPEVTIHFRGADVKLSRSN 321
+ L S ++ A + LELC++ S VP++ HF G D+KL + N
Sbjct: 324 DSAFAALKKEFISQMKLDVDASGSTELELCFTLPPDGSPVDVPQLVFHFEGVDLKLPKEN 383
Query: 322 FFVKVSEDIVCSVFKGITNSVPIYGNIMQTNFLVGYDIEQQTVSFKPTDCTK 373
+ ++ S V + G ++ + I+GN Q N +V +D+E++T+SF P C +
Sbjct: 384 YIIEDSALRVICLTMGSSSGMSIFGNFQQQNIVVLHDLEKETISFAPAQCNQ 435
>gi|225217008|gb|ACN85295.1| aspartic proteinase nepenthesin-1 precursor [Oryza coarctata]
Length = 500
Score = 154 bits (388), Expect = 9e-35, Method: Compositional matrix adjust.
Identities = 127/416 (30%), Positives = 174/416 (41%), Gaps = 83/416 (19%)
Query: 33 LIHRDSPKSPFYNSSETPYQRLRDALTRSLNRLNHFNQN----SSISSSK---------A 79
++HR P SP ++ + + L NR + +++S K A
Sbjct: 91 IVHRHGPCSPLADAHDGKLPSHEEILAADQNRAKSIQRRVSTTTTVSRGKPKRNRPSLPA 150
Query: 80 SQADIIPNNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKM 139
S + NY++ I +GTP V DTGSD W QCEPC CY Q LFDP
Sbjct: 151 SSGSAL-GTGNYVVTIGLGTPAGRYTVVFDTGSDTTWVQCEPCV-VVCYKQQEKLFDPAR 208
Query: 140 SSTYKSLPCSSSQCASLNQKSCSGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVAL 199
SSTY ++ C++ C+ L K CSG +C Y V YGDGS+S G A +T+TL S A+
Sbjct: 209 SSTYANISCAAPACSDLYIKGCSGGHCLYGVQYGDGSYSIGFFAMDTLTLSSYD----AI 264
Query: 200 PGITFGCGTNNGGLF----------------------------------NSKTTGIVGLG 225
G FGCG N GL+ S TG + G
Sbjct: 265 KGFRFGCGERNEGLYGEAAGLLGLGRGKTSLPVQAYDKYGGVFAHCFPARSSGTGYLDFG 324
Query: 226 GGDISLISQMRTT------------------IAGNQRLGV-----STPDIVIDSGTTLTF 262
G + +S TT G + L + +T ++DSGT +T
Sbjct: 325 PGSLPAVSAKLTTPMLVDNGPTFYYVGLTGIRVGGKLLSIPQSVFTTSGTIVDSGTVITR 384
Query: 263 LPQGYNSNLLSVM-SSMIEAQPVADPTGS-LELCYSFNSLSQV--PEVTIHFR-GADVKL 317
LP S+L S S+M E P S L+ CY F +S+V P V++ F+ GA + +
Sbjct: 385 LPPAAYSSLRSAFASAMAERGYKKAPALSLLDTCYDFTGMSEVAIPTVSLLFQGGASLDV 444
Query: 318 SRSNFFVKVSEDIVCSVFKG--ITNSVPIYGNIMQTNFLVGYDIEQQTVSFKPTDC 371
S S C F G + V I GN F V YDI ++ V F P C
Sbjct: 445 HASGIIYAASVSQACLGFAGNKEDDDVGIVGNTQLKTFGVVYDIGKKVVGFCPGAC 500
>gi|145693994|gb|ABP93697.1| unknown protein isoform 2 [Lemna minor]
Length = 351
Score = 153 bits (387), Expect = 1e-34, Method: Compositional matrix adjust.
Identities = 110/344 (31%), Positives = 153/344 (44%), Gaps = 66/344 (19%)
Query: 88 NANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLP 147
+ NY+I + GTP + V DTGSD+ W QC+PC +CY Q PLFDP +SSTY+++
Sbjct: 13 SGNYVITVGFGTPTRTQTVVFDTGSDVNWLQCKPCA-VRCYAQQEPLFDPSLSSTYRNVS 71
Query: 148 CSSSQCASLNQKSCSGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITFGCG 207
C+ C L+ + CS C Y V YGDGS + G LA +T L A FGCG
Sbjct: 72 CTEPACVGLSTRGCSSSTCLYGVFYGDGSSTIGFLAMDTFML----TPAQKFKNFIFGCG 127
Query: 208 TNNGGLFNSKTTGIVGLG-GGDISLISQMRTTIA-------------------------- 240
NN GLF T G+VGLG SL SQ+ ++
Sbjct: 128 QNNTGLFQG-TAGLVGLGRSSTYSLNSQVAPSLGNVFSYCLPSTSSATGYLNIGNPQNTP 186
Query: 241 ------------------------GNQRLGVSTP-----DIVIDSGTTLTFLPQGYNSNL 271
G RL +S+ +IDSGT +T LP S L
Sbjct: 187 GYTAMLTDTRVPTLYFIDLIGISVGGTRLSLSSTVFQSVGTIIDSGTVITRLPPTAYSAL 246
Query: 272 LSVMSSMIEAQPVADPTGSLELCYSFNSLSQV--PEVTIHFRGADVKLSRSNFFVKVSED 329
+ + + + +A L+ CY F+ + V P + +HF G DV++ + F +
Sbjct: 247 KTAVRAAMTQYTLAPAVTILDTCYDFSRTTSVVYPVIVLHFAGLDVRIPATGVFFVFNSS 306
Query: 330 IVCSVFKGITNS--VPIYGNIMQTNFLVGYDIEQQTVSFKPTDC 371
VC F G T+S + I GN+ Q V YD E + + F C
Sbjct: 307 QVCLAFAGNTDSTMIGIIGNVQQLTMEVTYDNELKRIGFSAGAC 350
>gi|15229656|ref|NP_188478.1| aspartyl protease family protein [Arabidopsis thaliana]
gi|75273882|sp|Q9LS40.1|ASPG1_ARATH RecName: Full=Protein ASPARTIC PROTEASE IN GUARD CELL 1;
Short=AtASPG1; Flags: Precursor
gi|11994113|dbj|BAB01116.1| CND41, chloroplast nucleoid DNA binding protein-like [Arabidopsis
thaliana]
gi|23297732|gb|AAN13013.1| putative chloroplast nucleoid DNA-binding protein [Arabidopsis
thaliana]
gi|332642583|gb|AEE76104.1| aspartyl protease family protein [Arabidopsis thaliana]
Length = 500
Score = 153 bits (387), Expect = 1e-34, Method: Compositional matrix adjust.
Identities = 127/407 (31%), Positives = 175/407 (42%), Gaps = 92/407 (22%)
Query: 29 FSVELIHRDSPKSPFYNSSETPYQRLRDALTRSLNRLNHFNQNSSISSSKASQADIIPNN 88
F+VE + R K P YN +T YQ + LT + S ASQ +
Sbjct: 122 FAVEGVDRSDLK-PVYNE-DTRYQT--EDLTTPV-------------VSGASQG-----S 159
Query: 89 ANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPC 148
Y RI +GTP E V DTGSD+ W QCEPC + CY Q P+F+P SSTYKSL C
Sbjct: 160 GEYFSRIGVGTPAKEMYLVLDTGSDVNWIQCEPC--ADCYQQSDPVFNPTSSSTYKSLTC 217
Query: 149 SSSQCASLNQKSCSGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITFGCGT 208
S+ QC+ L +C C Y VSYGDGSF+ G LAT+TVT G++ + + GCG
Sbjct: 218 SAPQCSLLETSACRSNKCLYQVSYGDGSFTVGELATDTVTFGNSG----KINNVALGCGH 273
Query: 209 NNGGLFNS------------------KTTGI-------------------VGLGGGDISL 231
+N GLF K T V LGGGD +
Sbjct: 274 DNEGLFTGAAGLLGLGGGVLSITNQMKATSFSYCLVDRDSGKSSSLDFNSVQLGGGDATA 333
Query: 232 ISQMRTTIAGNQRLGVS----------TPD------------IVIDSGTTLTFL-PQGYN 268
I +G+S PD +++D GT +T L Q YN
Sbjct: 334 PLLRNKKIDTFYYVGLSGFSVGGEKVVLPDAIFDVDASGSGGVILDCGTAVTRLQTQAYN 393
Query: 269 SNLLSVMSSMIEAQPVADPTGSLELCYSFNSLS--QVPEVTIHFRGAD-VKLSRSNFFVK 325
S + + + + + + CY F+SLS +VP V HF G + L N+ +
Sbjct: 394 SLRDAFLKLTVNLKKGSSSISLFDTCYDFSSLSTVKVPTVAFHFTGGKSLDLPAKNYLIP 453
Query: 326 VSED-IVCSVFKGITNSVPIYGNIMQTNFLVGYDIEQQTVSFKPTDC 371
V + C F ++S+ I GN+ Q + YD+ + + C
Sbjct: 454 VDDSGTFCFAFAPTSSSLSIIGNVQQQGTRITYDLSKNVIGLSGNKC 500
>gi|326506682|dbj|BAJ91382.1| predicted protein [Hordeum vulgare subsp. vulgare]
gi|326525815|dbj|BAJ88954.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 473
Score = 153 bits (386), Expect = 1e-34, Method: Compositional matrix adjust.
Identities = 119/382 (31%), Positives = 174/382 (45%), Gaps = 78/382 (20%)
Query: 59 TRSLNRLNHFNQNSSISSSKASQADIIPNNA----NYLIRISIGTPPTERLAVADTGSDL 114
T ++ L N ++++ S AS + P + NY+ R+ +GTP + V DTGS L
Sbjct: 102 TVTVASLYRANDDAAVDGSLAS-VPLTPGTSYGVGNYVTRMGLGTPAKPYIMVVDTGSSL 160
Query: 115 IWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCSSSQC-----ASLNQKSCSGVN-CQY 168
W QC PC S C+ Q P+FDPK SS+Y ++ CS+ QC A+LN +CS + C Y
Sbjct: 161 TWLQCSPCRVS-CHRQSGPVFDPKTSSSYAAVSCSTPQCNDLSTATLNPAACSSSDVCIY 219
Query: 169 SVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITFGCGTNNGGLFNSKTTGIVGLGGGD 228
SYGD SFS G L+ +TV+ GS + +P +GCG +N GLF ++ G++GL
Sbjct: 220 QASYGDSSFSVGYLSKDTVSFGSNS-----VPNFYYGCGQDNEGLFG-RSAGLMGLARNK 273
Query: 229 ISLISQMR---------------------------------------------------T 237
+SL+ Q+
Sbjct: 274 LSLLYQLAPTLGYSFSYCLPSSSSSGYLSIGSYNPGQYSYTPMVSSTLDDSLYFIKLSGM 333
Query: 238 TIAGNQRLGVSTPD-----IVIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLE 292
T+AG + L VS+ + +IDSGT +T LP L ++ ++ AD L+
Sbjct: 334 TVAG-KPLAVSSSEYSSLPTIIDSGTVITRLPTTVYDALSKAVAGAMKGTKRADAYSILD 392
Query: 293 LCYSFNSLS-QVPEVTIHFRG-ADVKLSRSNFFVKVSEDIVCSVFKGITNSVPIYGNIMQ 350
C+ + S +VP V++ F G A +KLS N V V C F S I GN Q
Sbjct: 393 TCFVGQASSLRVPAVSMAFSGGAALKLSAQNLLVDVDSSTTCLAF-APARSAAIIGNTQQ 451
Query: 351 TNFLVGYDIEQQTVSFKPTDCT 372
F V YD++ + F CT
Sbjct: 452 QTFSVVYDVKSNRIGFAAGGCT 473
>gi|22326716|ref|NP_196638.2| aspartyl protease family protein [Arabidopsis thaliana]
gi|18700103|gb|AAL77663.1| AT5g10770/T30N20_40 [Arabidopsis thaliana]
gi|24111269|gb|AAN46758.1| At5g10770/T30N20_40 [Arabidopsis thaliana]
gi|332004211|gb|AED91594.1| aspartyl protease family protein [Arabidopsis thaliana]
Length = 474
Score = 152 bits (385), Expect = 2e-34, Method: Compositional matrix adjust.
Identities = 127/427 (29%), Positives = 179/427 (41%), Gaps = 83/427 (19%)
Query: 23 EAQTGGFSVELIHRDSPKSPFYNSSET-----PYQRLRDALTRSLN-RLNHFNQNSSISS 76
A T S+ + HR S N T RL A S++ +L+ +S
Sbjct: 54 RASTTKSSLHVTHRHGTCSRLNNGKATSPDHVEILRLDQARVNSIHSKLSKKLATDHVSE 113
Query: 77 SKA----SQADIIPNNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDS 132
SK+ ++ + NY++ + +GTP + + DTGSDL WTQC+PC + CY Q
Sbjct: 114 SKSTDLPAKDGSTLGSGNYIVTVGLGTPKNDLSLIFDTGSDLTWTQCQPCVRT-CYDQKE 172
Query: 133 PLFDPKMSSTYKSLPCSSSQCASL-----NQKSCSGVNCQYSVSYGDGSFSNGNLATETV 187
P+F+P S++Y ++ CSS+ C SL N SCS NC Y + YGD SFS G LA E
Sbjct: 173 PIFNPSKSTSYYNVSCSSAACGSLSSATGNAGSCSASNCIYGIQYGDQSFSVGFLAKEKF 232
Query: 188 TLGSTTGQAVALPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTT--------- 238
TL ++ G+ FGCG NN GLF + G++GLG +S SQ T
Sbjct: 233 TLTNSD----VFDGVYFGCGENNQGLF-TGVAGLLGLGRDKLSFPSQTATAYNKIFSYCL 287
Query: 239 -------------------------------------------IAGNQRLGV-----STP 250
G Q+L + STP
Sbjct: 288 PSSASYTGHLTFGSAGISRSVKFTPISTITDGTSFYGLNIVAITVGGQKLPIPSTVFSTP 347
Query: 251 DIVIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSLSQV--PEVTI 308
+IDSGT +T LP + L S + + P L+ C+ + V P+V
Sbjct: 348 GALIDSGTVITRLPPKAYAALRSSFKAKMSKYPTTSGVSILDTCFDLSGFKTVTIPKVAF 407
Query: 309 HFR-GADVKLSRSNFFVKVSEDIVCSVFKGITN--SVPIYGNIMQTNFLVGYDIEQQTVS 365
F GA V+L F VC F G ++ + I+GN+ Q V YD V
Sbjct: 408 SFSGGAVVELGSKGIFYVFKISQVCLAFAGNSDDSNAAIFGNVQQQTLEVVYDGAGGRVG 467
Query: 366 FKPTDCT 372
F P C+
Sbjct: 468 FAPNGCS 474
>gi|242073260|ref|XP_002446566.1| hypothetical protein SORBIDRAFT_06g018170 [Sorghum bicolor]
gi|241937749|gb|EES10894.1| hypothetical protein SORBIDRAFT_06g018170 [Sorghum bicolor]
Length = 452
Score = 152 bits (385), Expect = 2e-34, Method: Compositional matrix adjust.
Identities = 85/214 (39%), Positives = 125/214 (58%), Gaps = 16/214 (7%)
Query: 27 GGFSVELIHRDSPKSPFYNSSETPYQRLRDALTRSLNRLNHFNQNSSISSSKASQADI-I 85
GG V L H D+ + + + Q L+ A RS +R++ ++ + A D+ +
Sbjct: 38 GGLRVRLTHVDA------HGNYSRLQLLQRAARRSHHRMSRLVARATGVKAVAGGGDLQV 91
Query: 86 P---NNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSST 142
P N +L+ ++IGTP A+ DTGSDL+WTQC+PC C+ Q +P+FDP SST
Sbjct: 92 PVHAGNGEFLMDVAIGTPALSYAAIVDTGSDLVWTQCKPC--VDCFKQSTPVFDPSSSST 149
Query: 143 YKSLPCSSSQCASLNQKSC-SGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPG 201
Y ++PCSS+ C+ L +C S C Y+ +YGD S + G LA+ET TLG + LPG
Sbjct: 150 YATVPCSSALCSDLPTSTCTSASKCGYTYTYGDASSTQGVLASETFTLGK---EKKKLPG 206
Query: 202 ITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQM 235
+ FGCG N G ++ G+VGLG G +SL+SQ+
Sbjct: 207 VAFGCGDTNEGDGFTQGAGLVGLGRGPLSLVSQL 240
Score = 62.8 bits (151), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 43/132 (32%), Positives = 67/132 (50%), Gaps = 9/132 (6%)
Query: 249 TPDIVIDSGTTLTFLP-QGYNSNLLSVMSSMIEAQPVADPTG-SLELCYSFNSLS----Q 302
T +++DSGT++T+L QGY + + ++ M A P D + L+LC+ + Q
Sbjct: 322 TGGVIVDSGTSITYLELQGYRALKKAFVAQM--ALPTVDGSEIGLDLCFQGPAKGVDEVQ 379
Query: 303 VPEVTIHFRG-ADVKLSRSNFFVKVSEDIVCSVFKGITNSVPIYGNIMQTNFLVGYDIEQ 361
VP++ +HF G AD+ L N+ V S + + + I GN Q NF YD+
Sbjct: 380 VPKLVLHFDGGADLDLPAENYMVLDSASGALCLTVAPSRGLSIIGNFQQQNFQFVYDVAG 439
Query: 362 QTVSFKPTDCTK 373
T+SF P C K
Sbjct: 440 DTLSFAPVQCNK 451
>gi|357143850|ref|XP_003573078.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Brachypodium
distachyon]
Length = 431
Score = 152 bits (384), Expect = 2e-34, Method: Compositional matrix adjust.
Identities = 90/239 (37%), Positives = 138/239 (57%), Gaps = 28/239 (11%)
Query: 1 MATFLSCVFILFFLCFYVVSPIEAQTGGFSVELIHRDSPKSPFYNSSETPYQRLRDALTR 60
+ +SC+ +L L + + G+ + L H DS ++ T
Sbjct: 8 LQALMSCLVLLTSLAV-------SASSGYRLALTHVDS--------------KIGLTKTE 46
Query: 61 SLNRLNHFNQNSSISSSKASQADIIPNNANYLIRISIGTPPTERLAVADTGSDLIWTQCE 120
+ R H ++ ++S A+ + YL+ ++IGTPP +A+ADTGSDL WTQC+
Sbjct: 47 LMRRAAHRSRLRALSGYDANSPRLHSVQVEYLMELAIGTPPVPFVALADTGSDLTWTQCQ 106
Query: 121 PCPPSQCYMQDSPLFDPKMSSTYKSLPCSSSQCAS-LNQKSCSGVN--CQYSVSYGDGSF 177
PC C+ QD+P++DP SST+ +PCSS+ C L ++CS + C+Y SY DG++
Sbjct: 107 PC--KLCFPQDTPVYDPSASSTFSPVPCSSATCLPVLRSRNCSTPSSLCRYGYSYSDGAY 164
Query: 178 SNGNLATETVTLGSTT-GQAVALPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQM 235
S G L TET+TLGS+ GQAV++ + FGCGT+NGG + +TG VGLG G +SL++Q+
Sbjct: 165 SAGILGTETLTLGSSVPGQAVSVSDVAFGCGTDNGG-DSLNSTGTVGLGRGTLSLLAQL 222
>gi|8979711|emb|CAB96832.1| nucleoid DNA-binding protein cnd41-like protein [Arabidopsis
thaliana]
Length = 446
Score = 152 bits (384), Expect = 2e-34, Method: Compositional matrix adjust.
Identities = 125/420 (29%), Positives = 177/420 (42%), Gaps = 83/420 (19%)
Query: 30 SVELIHRDSPKSPFYNSSET-----PYQRLRDALTRSLN-RLNHFNQNSSISSSKA---- 79
S+ + HR S N T RL A S++ +L+ +S SK+
Sbjct: 33 SLHVTHRHGTCSRLNNGKATSPDHVEILRLDQARVNSIHSKLSKKLATDHVSESKSTDLP 92
Query: 80 SQADIIPNNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKM 139
++ + NY++ + +GTP + + DTGSDL WTQC+PC + CY Q P+F+P
Sbjct: 93 AKDGSTLGSGNYIVTVGLGTPKNDLSLIFDTGSDLTWTQCQPCVRT-CYDQKEPIFNPSK 151
Query: 140 SSTYKSLPCSSSQCASL-----NQKSCSGVNCQYSVSYGDGSFSNGNLATETVTLGSTTG 194
S++Y ++ CSS+ C SL N SCS NC Y + YGD SFS G LA E TL ++
Sbjct: 152 STSYYNVSCSSAACGSLSSATGNAGSCSASNCIYGIQYGDQSFSVGFLAKEKFTLTNSD- 210
Query: 195 QAVALPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTT---------------- 238
G+ FGCG NN GLF + G++GLG +S SQ T
Sbjct: 211 ---VFDGVYFGCGENNQGLF-TGVAGLLGLGRDKLSFPSQTATAYNKIFSYCLPSSASYT 266
Query: 239 ------------------------------------IAGNQRLGV-----STPDIVIDSG 257
G Q+L + STP +IDSG
Sbjct: 267 GHLTFGSAGISRSVKFTPISTITDGTSFYGLNIVAITVGGQKLPIPSTVFSTPGALIDSG 326
Query: 258 TTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSLSQV--PEVTIHFR-GAD 314
T +T LP + L S + + P L+ C+ + V P+V F GA
Sbjct: 327 TVITRLPPKAYAALRSSFKAKMSKYPTTSGVSILDTCFDLSGFKTVTIPKVAFSFSGGAV 386
Query: 315 VKLSRSNFFVKVSEDIVCSVFKGITN--SVPIYGNIMQTNFLVGYDIEQQTVSFKPTDCT 372
V+L F VC F G ++ + I+GN+ Q V YD V F P C+
Sbjct: 387 VELGSKGIFYVFKISQVCLAFAGNSDDSNAAIFGNVQQQTLEVVYDGAGGRVGFAPNGCS 446
>gi|19424106|gb|AAL87345.1| putative chloroplast nucleoid DNA-binding protein [Arabidopsis
thaliana]
Length = 500
Score = 152 bits (383), Expect = 3e-34, Method: Compositional matrix adjust.
Identities = 126/407 (30%), Positives = 175/407 (42%), Gaps = 92/407 (22%)
Query: 29 FSVELIHRDSPKSPFYNSSETPYQRLRDALTRSLNRLNHFNQNSSISSSKASQADIIPNN 88
F+VE + R K P YN +T YQ + LT + S ASQ +
Sbjct: 122 FAVEGVDRSDLK-PVYNE-DTRYQT--EDLTTPV-------------VSGASQG-----S 159
Query: 89 ANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPC 148
Y RI +GTP + V DTGSD+ W QCEPC + CY Q P+F+P SSTYKSL C
Sbjct: 160 GEYFSRIGVGTPAKDMYLVLDTGSDVNWIQCEPC--ADCYQQSDPVFNPTSSSTYKSLTC 217
Query: 149 SSSQCASLNQKSCSGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITFGCGT 208
S+ QC+ L +C C Y VSYGDGSF+ G LAT+TVT G++ + + GCG
Sbjct: 218 SAPQCSLLETSACRSNKCLYQVSYGDGSFTVGELATDTVTFGNSG----KINNVALGCGH 273
Query: 209 NNGGLFNS------------------KTTGI-------------------VGLGGGDISL 231
+N GLF K T V LGGGD +
Sbjct: 274 DNEGLFTGAAGLLGLGGGVLSITNQMKATSFSYCLVDRDSGKSSSLDFNSVQLGGGDATA 333
Query: 232 ISQMRTTIAGNQRLGVS----------TPD------------IVIDSGTTLTFL-PQGYN 268
I +G+S PD +++D GT +T L Q YN
Sbjct: 334 PLLRNKKIDTFYYVGLSGFSVGGEKVVLPDAIFDVDASGSGGVILDCGTAVTRLQTQAYN 393
Query: 269 SNLLSVMSSMIEAQPVADPTGSLELCYSFNSLS--QVPEVTIHFRGAD-VKLSRSNFFVK 325
S + + + + + + CY F+SLS +VP V HF G + L N+ +
Sbjct: 394 SLRDAFLKLTVNLKKGSSSISLFDTCYDFSSLSTVKVPTVAFHFTGGKSLDLPAKNYLIP 453
Query: 326 VSED-IVCSVFKGITNSVPIYGNIMQTNFLVGYDIEQQTVSFKPTDC 371
V + C F ++S+ I GN+ Q + YD+ + + C
Sbjct: 454 VDDSGTFCFAFAPTSSSLSIIGNVQQQGTRITYDLSKNVIGLSGNKC 500
>gi|357154085|ref|XP_003576664.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Brachypodium
distachyon]
Length = 509
Score = 152 bits (383), Expect = 4e-34, Method: Compositional matrix adjust.
Identities = 126/425 (29%), Positives = 184/425 (43%), Gaps = 91/425 (21%)
Query: 33 LIHRDSPKSPFYNSSETPYQRLRDALTRSLNRLNHF-----NQNSSISS--SKASQADII 85
++HR P SP + P D L + R++ N+ S++ S ++ I
Sbjct: 91 VMHRHGPCSPLQTPGDAPSDA--DLLDQDQARVDSILGMITNETSAVGPGVSLPAERGIS 148
Query: 86 PNNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKS 145
NY++ + +GTP + V DTGSDL W QC PC CY Q PLF P SST+ +
Sbjct: 149 VGTGNYVVSVGLGTPARDLTVVFDTGSDLSWVQCGPCSSGGCYKQQDPLFAPSDSSTFSA 208
Query: 146 LPCSSSQCASLNQKSCSGV----NCQYSVSYGDGSFSNGNLATETVTLGSTT-GQAVA-- 198
+ C + +C + ++SC G C Y V YGD S + G+L +T+TLG+ A A
Sbjct: 209 VRCGARECRA--RQSCGGSPGDDRCPYEVVYGDKSRTQGHLGNDTLTLGTMAPANASAEN 266
Query: 199 ---LPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQM-------------------- 235
LPG FGCG NN GLF + G+ GLG G +SL SQ
Sbjct: 267 DNKLPGFVFGCGENNTGLFG-QADGLFGLGRGKVSLSSQAAGKFGEGFSYCLPSSSSSAP 325
Query: 236 ----------------------RTT-------------IAGNQRLGVSTPDI----VIDS 256
RTT +AG + + VS+P + ++DS
Sbjct: 326 GYLSLGTPVPAPAHAQFTPMLNRTTTPSFYYVKLVGIRVAG-RAIRVSSPRVALPLIVDS 384
Query: 257 GTTLTFL-PQGYNSNLLSVMSSMIEAQPVADPTGS-LELCYSF----NSLSQVPEVTIHF 310
GT +T L P+ Y + + +S+M + P S L+ CY F N+ +P V + F
Sbjct: 385 GTVITRLAPRAYRALRAAFLSAMGKYGYKRAPRLSILDTCYDFTAHANATVSIPAVALVF 444
Query: 311 RGA---DVKLSRSNFFVKVSEDIVCSVFKGITNSVPIYGNIMQTNFLVGYDIEQQTVSFK 367
G V S + KV++ + G S I GN Q V YD+ +Q + F
Sbjct: 445 AGGATISVDFSGVLYVAKVAQACLAFAPNGDGRSAGILGNTQQRTLAVVYDVARQKIGFA 504
Query: 368 PTDCT 372
C+
Sbjct: 505 AKGCS 509
>gi|242095586|ref|XP_002438283.1| hypothetical protein SORBIDRAFT_10g011110 [Sorghum bicolor]
gi|241916506|gb|EER89650.1| hypothetical protein SORBIDRAFT_10g011110 [Sorghum bicolor]
Length = 470
Score = 152 bits (383), Expect = 4e-34, Method: Compositional matrix adjust.
Identities = 115/348 (33%), Positives = 162/348 (46%), Gaps = 75/348 (21%)
Query: 90 NYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCS 149
NY+ + +GTP T V DTGS L W QC PC S C+ Q PL+DP+ SSTY ++PCS
Sbjct: 133 NYVTELGLGTPATSYAMVVDTGSSLTWLQCSPCVVS-CHRQVGPLYDPRASSTYATVPCS 191
Query: 150 SSQC-----ASLNQKSCSGVN-CQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGIT 203
+SQC A+LN +CS N C Y SYGD SFS G L+ +TV+ GS + P
Sbjct: 192 ASQCDELQAATLNPSACSVRNVCIYQASYGDSSFSVGYLSRDTVSFGSGS-----YPNFY 246
Query: 204 FGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIA----------------------- 240
+GCG +N GLF ++ G++GL +SL+ Q+ ++
Sbjct: 247 YGCGQDNEGLFG-RSAGLIGLARNKLSLLYQLAPSLGYSFSYCLPTPASTGYLSIGPYTS 305
Query: 241 ---------------------------GNQRLGVSTPD-----IVIDSGTTLTFLPQG-Y 267
G L VS + +IDSGT +T LP Y
Sbjct: 306 GHYSYTPMASSSLDASLYFVTLSGMSVGGSPLAVSPAEYSSLPTIIDSGTVITRLPTAVY 365
Query: 268 NSNLLSVMSSMIEAQPVADPTGS-LELCYSFN-SLSQVPEVTIHFR-GADVKLSRSNFFV 324
+ +V ++M+ Q + P S L+ C+ S +VP V + F GA +KL+ N +
Sbjct: 366 TALSKAVAAAMVGVQ--SAPAFSILDTCFQGQASQLRVPAVAMAFAGGATLKLATQNVLI 423
Query: 325 KVSEDIVCSVFKGITNSVPIYGNIMQTNFLVGYDIEQQTVSFKPTDCT 372
V + C F T+S I GN Q F V YD+ Q + F C+
Sbjct: 424 DVDDSTTCLAFA-PTDSTTIIGNTQQQTFSVVYDVAQSRIGFAAGGCS 470
>gi|255585473|ref|XP_002533429.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
gi|223526717|gb|EEF28949.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
Length = 448
Score = 151 bits (381), Expect = 6e-34, Method: Compositional matrix adjust.
Identities = 119/425 (28%), Positives = 190/425 (44%), Gaps = 75/425 (17%)
Query: 18 VVSPIEAQTGGFSVELIHRDSPKSPFYNSSETPYQRLRDALTRSLNRLNH-FNQNSSISS 76
+V AQ +LIH S SP++N + + +R + S R+ + + Q
Sbjct: 23 IVEAYNAQPKQLVTKLIHWGSILSPYFNPNASVAERAERIVKTSATRIAYLYAQIKGDIH 82
Query: 77 SKASQADIIPNNAN--YLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPL 134
+ +++P+ +L+ S+G P T +LA+ DTGS+++W +C PC +C Q+ PL
Sbjct: 83 MNDFELNLLPSTYEPLFLVNFSMGQPATPQLAIMDTGSNILWVRCAPC--KRCTQQNGPL 140
Query: 135 FDPKMSSTYKSLPCSSSQCASLNQKSCSGVN-CQYSVSYGDGSFSNGNLATETVTLGSTT 193
DP SSTY SLPC+++ C C+ +N C Y++SY G S G LATE + S+
Sbjct: 141 LDPSKSSTYASLPCTNTMCHYAPSAYCNRLNQCGYNLSYATGLSSAGVLATEQLIFHSSD 200
Query: 194 GQAVALPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIA-----------GN 242
A+P + FGC NG + + TG+ GLG G S +++M + + G
Sbjct: 201 EGVNAVPSVVFGCSHENGDYKDRRFTGVFGLGKGITSFVTRMGSKFSYCLGNIADPHYGY 260
Query: 243 QRL---------GVSTP----------------------DI--------------VIDSG 257
+L G STP DI +IDSG
Sbjct: 261 NQLVFGEKANFEGYSTPLKVVNGHYYVTLEGISVGEKRLDIDSTAFSMKGNEKSALIDSG 320
Query: 258 TTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSLSQ----VPEVTIHFR-G 312
T LT+L + L + + +++ + GS CY ++SQ P VT HF G
Sbjct: 321 TALTWLAESAFRALDNEVRQLLDGVLMPFWRGSFA-CYK-GTVSQDLIGFPVVTFHFSGG 378
Query: 313 ADVKLSRSNFFVKVSEDIVC------SVFKGITNSVPIYGNIMQTNFLVGYDIEQQTVSF 366
AD+ L + F + + DI+C S + S + G + Q + + YD+ + F
Sbjct: 379 ADLDLDTESMFYQATPDILCIAVRQASAYGNDFKSFSVIGLMAQQYYNMAYDLNSNKLFF 438
Query: 367 KPTDC 371
+ DC
Sbjct: 439 QRIDC 443
>gi|242070719|ref|XP_002450636.1| hypothetical protein SORBIDRAFT_05g008470 [Sorghum bicolor]
gi|241936479|gb|EES09624.1| hypothetical protein SORBIDRAFT_05g008470 [Sorghum bicolor]
Length = 410
Score = 150 bits (380), Expect = 7e-34, Method: Compositional matrix adjust.
Identities = 116/378 (30%), Positives = 173/378 (45%), Gaps = 65/378 (17%)
Query: 50 PYQRLRDALTRSLNRLNHFNQNSSISSSKASQADIIPNNAN--YLIRISIGTPPTERLAV 107
P L A +S RL+ ++S ++Q + ++ Y + SIGTPP E A+
Sbjct: 39 PAINLTRAAHKSHQRLSMLAARLDDAASGSAQTPLQLDSGGGAYDMTFSIGTPPQELSAL 98
Query: 108 ADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCSSSQCASLNQKSCS--GVN 165
ADTGSDLIW +C C ++C Q SP + P SS++ LPCS S C+ L CS G
Sbjct: 99 ADTGSDLIWAKCGAC--TRCVPQGSPSYYPNKSSSFSKLPCSGSLCSDLPSSQCSAGGAE 156
Query: 166 CQYSVSYGDGS----FSNGNLATETVTLGSTTGQAVALPGITFGCGTN------------ 209
C Y SYG S ++ G L +ET TLGS A+PGI FGC T
Sbjct: 157 CDYKYSYGLASDPHHYTQGYLGSETFTLGSD-----AVPGIGFGCTTMSEGGYGSGSGLV 211
Query: 210 -------------NGGLFN-------SKTTGIV----GLGGGDISLISQMRTT------- 238
N G F+ +KT+ ++ L G + +RT+
Sbjct: 212 GLGRGPLSLVSQLNVGAFSYCLTSDAAKTSPLLFGSGALTGAGVQSTPLLRTSTYYYTVN 271
Query: 239 -----IAGNQRLGVSTPDIVIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLEL 293
I G + I+ DSGTT+ FL + + + S +A E+
Sbjct: 272 LESISIGAATTAGTGSSGIIFDSGTTVAFLAEPAYTLAKEAVLSQTTNLTMASGRDGYEV 331
Query: 294 CYSFNSLSQVPEVTIHFRGADVKLSRSNFFVKVSEDIVCSVFKGITNSVPIYGNIMQTNF 353
C+ S + P + +HF G D+ L N+F V + + C + + + S+ I GNIMQ N+
Sbjct: 332 CFQ-TSGAVFPSMVLHFDGGDMDLPTENYFGAVDDSVSCWIVQK-SPSLSIVGNIMQMNY 389
Query: 354 LVGYDIEQQTVSFKPTDC 371
+ YD+E+ +SF+P +C
Sbjct: 390 HIRYDVEKSMLSFQPANC 407
>gi|297834758|ref|XP_002885261.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
gi|297331101|gb|EFH61520.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
Length = 500
Score = 150 bits (380), Expect = 8e-34, Method: Compositional matrix adjust.
Identities = 113/350 (32%), Positives = 154/350 (44%), Gaps = 74/350 (21%)
Query: 88 NANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLP 147
+ Y RI +GTP E V DTGSD+ W QCEPC S CY Q P+F+P SSTYKSL
Sbjct: 159 SGEYFSRIGVGTPAKEMYLVLDTGSDVNWIQCEPC--SDCYQQSDPVFNPTSSSTYKSLT 216
Query: 148 CSSSQCASLNQKSCSGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITFGCG 207
CS+ QC+ L +C C Y VSYGDGSF+ G LAT+TVT G++ + + GCG
Sbjct: 217 CSAPQCSLLETSACRSNKCLYQVSYGDGSFTVGELATDTVTFGNSG----KINDVALGCG 272
Query: 208 TNNGGLFNS------------------KTTGI-------------------VGLGGGDIS 230
+N GLF K T V LG GD +
Sbjct: 273 HDNEGLFTGAAGLLGLGGGALSITNQMKATSFSYCLVDRDSGKSSSLDFNSVQLGSGDAT 332
Query: 231 ---LISQMRTTI---------AGNQRLGVSTPD------------IVIDSGTTLTFL-PQ 265
L +Q T G Q+ V PD +++D GT +T L Q
Sbjct: 333 APLLRNQKIDTFYYVGLSGFSVGGQK--VMMPDAIFDVDASGSGGVILDCGTAVTRLQTQ 390
Query: 266 GYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSLS--QVPEVTIHFRGAD-VKLSRSNF 322
YNS + + + + CY F+SLS +VP V HF G + L N+
Sbjct: 391 AYNSLRDAFLKLTTNLKKGTSSISLFDTCYDFSSLSSVKVPTVAFHFTGGKSLDLPAKNY 450
Query: 323 FVKVSED-IVCSVFKGITNSVPIYGNIMQTNFLVGYDIEQQTVSFKPTDC 371
+ V ++ C F ++S+ I GN+ Q + YD+ + + C
Sbjct: 451 LIPVDDNGTFCFAFAPTSSSLSIIGNVQQQGTRITYDLANKIIGLSGNKC 500
>gi|224142011|ref|XP_002324354.1| predicted protein [Populus trichocarpa]
gi|222865788|gb|EEF02919.1| predicted protein [Populus trichocarpa]
Length = 471
Score = 150 bits (379), Expect = 9e-34, Method: Compositional matrix adjust.
Identities = 118/386 (30%), Positives = 170/386 (44%), Gaps = 77/386 (19%)
Query: 55 RDALTRSLNRLNHFNQNSSISSSKASQADIIPN---NANYLIRISIGTPPTERLAVADTG 111
RD L R H + NSS + +P Y + + +GTP + + DTG
Sbjct: 94 RDQLRVKSIRAKH-SMNSSTTGVFNEMKTRVPTTHFGGGYAVTVGLGTPKKDFSLLFDTG 152
Query: 112 SDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCSSSQCASLNQKSCSGV----NCQ 167
SDL WTQCEPC C+ Q+ FDP S++YK+L CSS C S+ ++S G +C
Sbjct: 153 SDLTWTQCEPC-SGGCFPQNDEKFDPTKSTSYKNLSCSSEPCKSIGKESAQGCSSSNSCL 211
Query: 168 YSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITFGCGTNNGGLFNSKTTGIVGLGGG 227
Y V YG G ++ G LATET+T+ + GCG NGG F S T G++GLG
Sbjct: 212 YGVKYGTG-YTVGFLATETLTITPSD----VFENFVIGCGERNGGRF-SGTAGLLGLGRS 265
Query: 228 DISLISQMRTTI------------------------------------------------ 239
++L SQ +T
Sbjct: 266 PVALPSQTSSTYKNLFSYCLPASSSSTGHLSFGGGVSQAAKFTPITSKIPELYGLDVSGI 325
Query: 240 -AGNQRLGVS-----TPDIVIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLEL 293
G ++L + T +IDSGTTLT+LP +S L S M+ + T L+
Sbjct: 326 SVGGRKLPIDPSVFRTAGTIIDSGTTLTYLPSTAHSALSSAFQEMMTNYTLTKGTSGLQP 385
Query: 294 CYSFNSLSQ----VPEVTIHFRGA-DVKLSRSNFFVKVSE-DIVCSVFK--GITNSVPIY 345
CY F+ + +P+++I F G +V + S F+ + + VC FK G V I+
Sbjct: 386 CYDFSKHANDNITIPQISIFFEGGVEVDIDDSGIFIAANGLEEVCLAFKDNGNDTDVAIF 445
Query: 346 GNIMQTNFLVGYDIEQQTVSFKPTDC 371
GN+ Q + V YD+ + V F P C
Sbjct: 446 GNVQQKTYEVVYDVAKGMVGFAPGGC 471
>gi|116787398|gb|ABK24493.1| unknown [Picea sitchensis]
Length = 479
Score = 150 bits (379), Expect = 9e-34, Method: Compositional matrix adjust.
Identities = 125/423 (29%), Positives = 172/423 (40%), Gaps = 84/423 (19%)
Query: 23 EAQTGGFSVELIHRDSPKSPFYNSSETPY-QRLRDALTRSLNRLNHFNQNSSISSSKASQ 81
EA G + L H SP + + + + + R +RLN ++ + S S
Sbjct: 65 EALKPGVKIRLDHIHGACSPLRPINSSSWIDMVSQSFDRDNDRLNTIWSKNNGTYSTMSN 124
Query: 82 ADIIPNN----ANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDP 137
+ P + NY++ GTP L + DTGSD+ W QC+PC S CY Q P+F+P
Sbjct: 125 LPLQPGSKVGTGNYIVTAGFGTPAKNSLLIIDTGSDVTWIQCKPC--SDCYSQVDPIFEP 182
Query: 138 KMSSTYKSLPCSSSQCASL-NQKSCSGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQA 196
+ SS+YK L C SS C L C C Y ++YGDGS S G+ + ET+TLGS +
Sbjct: 183 QQSSSYKHLSCLSSACTELTTMNHCRLGGCVYEINYGDGSRSQGDFSQETLTLGSDS--- 239
Query: 197 VALPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGNQRLGVSTPDIVIDS 256
P FGCG N GLF + G++GLG +S SQ ++ G + PD V +
Sbjct: 240 --FPSFAFGCGHTNTGLFKG-SAGLLGLGRTALSFPSQTKSKYGG--QFSYCLPDFVSST 294
Query: 257 GT------------TLTFLPQGYNSN---------------------------------- 270
T T TF+P NSN
Sbjct: 295 STGSFSVGQGSIPATATFVPLVSNSNYPSFYFVGLNGISVGGERLSIPPAVLGRGGTIVD 354
Query: 271 ---------------LLSVMSSMIEAQPVADPTGSLELCYSFNSLSQV--PEVTIHFR-G 312
L + S P A P L+ CY +S SQV P +T HF+
Sbjct: 355 SGTVITRLVPQAYDALKTSFRSKTRNLPSAKPFSILDTCYDLSSYSQVRIPTITFHFQNN 414
Query: 313 ADVKLSRSNFFVKVSED--IVCSVFKGITNSVP--IYGNIMQTNFLVGYDIEQQTVSFKP 368
ADV +S + D VC F + S+ I GN Q V +D + F P
Sbjct: 415 ADVAVSAVGILFTIQSDGSQVCLAFASASQSISTNIIGNFQQQRMRVAFDTGAGRIGFAP 474
Query: 369 TDC 371
C
Sbjct: 475 GSC 477
>gi|297848386|ref|XP_002892074.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
gi|297337916|gb|EFH68333.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
Length = 485
Score = 150 bits (379), Expect = 1e-33, Method: Compositional matrix adjust.
Identities = 105/354 (29%), Positives = 154/354 (43%), Gaps = 78/354 (22%)
Query: 88 NANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLP 147
+ Y R+ +GTP V DTGSD++W QC PC +CY Q P+FDP+ S TY ++P
Sbjct: 139 SGEYFTRLGVGTPARYVYMVLDTGSDIVWLQCAPC--RRCYSQSDPIFDPRKSKTYATIP 196
Query: 148 CSSSQCASLNQKSCSGVN--CQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITFG 205
CSS C L+ C+ C Y VSYGDGSF+ G+ +TET+T + VAL G
Sbjct: 197 CSSPHCRRLDSAGCNTRRKTCLYQVSYGDGSFTVGDFSTETLTFRRNRVKGVAL-----G 251
Query: 206 CGTNNGGLF-------------------------------------NSKTTGIVGLGGGD 228
CG +N GLF +SK + +V G
Sbjct: 252 CGHDNEGLFVGAAGLLGLGKGKLSFPGQTGHRFNQKFSYCLVDRSASSKPSSVV-FGNAA 310
Query: 229 ISLISQ------------------MRTTIAGNQRLGVSTPDIVID----------SGTTL 260
+S I++ + ++ G + GV+ +D SGT++
Sbjct: 311 VSRIARFTPLLSNPKLDTFYYVELLGISVGGTRVPGVAASLFKLDQIGNGGVIIDSGTSV 370
Query: 261 TFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSLSQ--VPEVTIHFRGADVKLS 318
T L + + +A A + C+ +++++ VP V +HFRGADV L
Sbjct: 371 TRLIRPAYIAMRDAFRVGAKALKRAPDFSLFDTCFDLSNMNEVKVPTVVLHFRGADVSLP 430
Query: 319 RSNFFVKVSED-IVCSVFKGITNSVPIYGNIMQTNFLVGYDIEQQTVSFKPTDC 371
+N+ + V + C F G + I GNI Q F V YD+ V F P C
Sbjct: 431 ATNYLIPVDTNGKFCFAFAGTMGGLSIIGNIQQQGFRVVYDLASSRVGFAPGGC 484
>gi|225216995|gb|ACN85284.1| aspartic proteinase nepenthesin-1 precursor [Oryza australiensis]
Length = 519
Score = 150 bits (379), Expect = 1e-33, Method: Compositional matrix adjust.
Identities = 113/350 (32%), Positives = 154/350 (44%), Gaps = 71/350 (20%)
Query: 87 NNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSL 146
NY++ + +GTP + V DTGSD W QC+PC CY Q LFDP SSTY ++
Sbjct: 176 GTGNYVVTVGLGTPASRYTVVFDTGSDTTWVQCQPCV-VVCYEQREKLFDPARSSTYANI 234
Query: 147 PCSSSQCASLNQKSCSGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITFGC 206
C++ C+ L+ + CSG NC Y V YGDGS+S G A +T+TL S A+ G FGC
Sbjct: 235 SCAAPACSDLDTRGCSGGNCLYGVQYGDGSYSIGFFAMDTLTLSSYD----AVKGFRFGC 290
Query: 207 GTNNGGLFNSKTTGIVGLGGGDISLISQ--------------MRTTIAGNQRLGVSTPD- 251
G N GLF + G++GLG G SL Q R++ G G +P
Sbjct: 291 GERNEGLFG-EAAGLLGLGRGKTSLPVQTYDKYGGVFAHCLPARSSGTGYLDFGPGSPAA 349
Query: 252 --------IVIDSG-----------------------------------TTLTFLPQGYN 268
++ D+G T +T LP
Sbjct: 350 AGARLTTPMLTDNGPTFYYVGMTGIRVGGQLLSIPQSVFTTAGTIVDSGTVITRLPPAAY 409
Query: 269 SNLLSVMSSMIEAQ--PVADPTGSLELCYSFNSLSQV--PEVTIHFR-GADVKLSRSNFF 323
S+L S +S + A+ A L+ CY F +SQV P V++ F+ GA + + S
Sbjct: 410 SSLRSAFASAMAARGYKKAPAVSLLDTCYDFTGMSQVAIPTVSLLFQGGARLDVDASGIM 469
Query: 324 VKVSEDIVCSVFKGITNS--VPIYGNIMQTNFLVGYDIEQQTVSFKPTDC 371
S VC F + V I GN F V YDI ++ V F P C
Sbjct: 470 YAASVSQVCLGFAANEDGGDVGIVGNTQLKTFGVAYDIGKKVVGFSPGAC 519
>gi|255548662|ref|XP_002515387.1| Aspartic proteinase nepenthesin-2 precursor, putative [Ricinus
communis]
gi|223545331|gb|EEF46836.1| Aspartic proteinase nepenthesin-2 precursor, putative [Ricinus
communis]
Length = 463
Score = 150 bits (378), Expect = 1e-33, Method: Compositional matrix adjust.
Identities = 116/414 (28%), Positives = 176/414 (42%), Gaps = 87/414 (21%)
Query: 30 SVELIHRDSPKSPFYNSSETPYQRLRDALTRSLNRLNHFNQN-SSISSSKASQADIIPNN 88
S++++H+ P S N L + L +R++ + S S K + A +P
Sbjct: 66 SLKVVHKHGPCSQL-NQQNGNAPNLVEILLEDQSRVDSIHAKLSDHSGVKETDAAKLPTK 124
Query: 89 A-------NYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSS 141
+ NY++ I +G+P + + + DTGSDL W +C + FDP S+
Sbjct: 125 SGMSLGTGNYIVSIGLGSPKKDLMLIFDTGSDLTWARCSA----------AETFDPTKST 174
Query: 142 TYKSLPCSSSQCASL-----NQKSCSGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQA 196
+Y ++ CS+ C+S+ N C+ C Y + YGDGS+S G L E +T+GST
Sbjct: 175 SYANVSCSTPLCSSVISATGNPSRCAASTCVYGIQYGDGSYSIGFLGKERLTIGSTD--- 231
Query: 197 VALPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQ---------------------- 234
FGCG + GLF K G++GLG +S++SQ
Sbjct: 232 -IFNNFYFGCGQDVDGLFG-KAAGLLGLGRDKLSVVSQTAPKYNQLFSYCLPSSSSTGFL 289
Query: 235 --------------------------MRTTIAGNQRLGV-----STPDIVIDSGTTLTFL 263
+ G Q+L + ST +IDSGT +T L
Sbjct: 290 SFGSSQSKSAKFTPLSSGPSSFYNLDLTGITVGGQKLAIPLSVFSTAGTIIDSGTVVTRL 349
Query: 264 PQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSLS--QVPEVTIHFRGA-DVKLSRS 320
P S L S + + P+ P L+ CY F+ +VP++ I F G DV + ++
Sbjct: 350 PPAAYSALRSAFRKAMASYPMGKPLSILDTCYDFSKYKTIKVPKIVISFSGGVDVDVDQA 409
Query: 321 NFFVKVSEDIVCSVFKGITNS--VPIYGNIMQTNFLVGYDIEQQTVSFKPTDCT 372
FV VC F G T + I+GN Q NF V YD+ V F P C+
Sbjct: 410 GIFVANGLKQVCLAFAGNTGARDTAIFGNTQQRNFEVVYDVSGGKVGFAPASCS 463
>gi|255544139|ref|XP_002513132.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
gi|223548143|gb|EEF49635.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
Length = 481
Score = 149 bits (377), Expect = 2e-33, Method: Compositional matrix adjust.
Identities = 114/420 (27%), Positives = 175/420 (41%), Gaps = 82/420 (19%)
Query: 27 GGFSVELIHRDSPKS---PFYNSSETPYQRLRDALTRSLNRLNHFNQNSSISSSKASQ-- 81
G + ++L+HRD + Y+ S + R++ R + + + SS +
Sbjct: 69 GKWKLKLVHRDKITAFNKSSYDHSHNFHARIQRDKKRVATLIRRLSPRDATSSYSVEEFG 128
Query: 82 ADIIPN----NANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDP 137
A+++ + Y IRI +G+PP E+ V D+GSD++W QC+PC +QCY Q P+FDP
Sbjct: 129 AEVVSGMNQGSGEYFIRIGVGSPPREQYVVIDSGSDIVWVQCQPC--TQCYHQTDPVFDP 186
Query: 138 KMSSTYKSLPCSSSQCASLNQKSCSGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAV 197
S+++ +PCSSS C + C C+Y V YGDGS++ G LA ET+T G T + V
Sbjct: 187 ADSASFMGVPCSSSVCERIENAGCHAGGCRYEVMYGDGSYTKGTLALETLTFGRTVVRNV 246
Query: 198 ALPGITFGCGTNN----------------------------GGLFN----SKTT------ 219
A+ GCG N GG F+ S+ T
Sbjct: 247 AI-----GCGHRNRGMFVGAAGLLGLGGGSMSLVGQLGGQTGGAFSYCLVSRGTDSAGSL 301
Query: 220 ----GIVGLGGGDISLISQMRT------TIAGNQRLGVSTP--------------DIVID 255
G + +G I LI R ++G G+ P +V+D
Sbjct: 302 EFGRGAMPVGAAWIPLIRNPRAPSFYYIRLSGVGVGGMKVPISEDVFQLNEMGNGGVVMD 361
Query: 256 SGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSLS--QVPEVTIHFRGA 313
+GT +T +P P A + CY+ N +VP V+ +F G
Sbjct: 362 TGTAVTRIPTVAYVAFRDAFIGQTGNLPRASGVSIFDTCYNLNGFVSVRVPTVSFYFAGG 421
Query: 314 DV-KLSRSNFFVKVSE-DIVCSVFKGITNSVPIYGNIMQTNFLVGYDIEQQTVSFKPTDC 371
+ L NF + V + C F + + I GNI Q + +D V F P C
Sbjct: 422 PILTLPARNFLIPVDDVGTFCFAFAASPSGLSIIGNIQQEGIQISFDGANGFVGFGPNVC 481
>gi|326499093|dbj|BAK06037.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 471
Score = 149 bits (377), Expect = 2e-33, Method: Compositional matrix adjust.
Identities = 130/434 (29%), Positives = 195/434 (44%), Gaps = 99/434 (22%)
Query: 28 GFSVELIHRDSPKSPFYNSSETPYQRLRDALTRSLNRLNHFNQN---SSISSSKASQADI 84
GFSVE IHRDS +SPF++ S T R+ +A RS R +++ S+ +++
Sbjct: 34 GFSVEFIHRDSARSPFHDPSLTAPARVLEAARRSTVRAAALSRSYVRVDAPSADGFVSEL 93
Query: 85 IPNNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSP----------- 133
YL+ ++IGTPPT +A+ADTGSDLIW C Y D P
Sbjct: 94 TSTPFEYLMAVNIGTPPTRMVAIADTGSDLIWLNCS-------YGGDGPGLAAARDADAQ 146
Query: 134 ----LFDPKMSSTYKSLPCSSSQCASLNQKSC-SGVNCQYSVSYGDGSFSNGNLATETVT 188
FDP S+T++ + C S C+ L + SC + C+YS SYGDGS ++G L+TET T
Sbjct: 147 PPGVQFDPSKSTTFRLVDCDSVACSELPEASCGADSKCRYSYSYGDGSHTSGVLSTETFT 206
Query: 189 LGSTTGQ-----AVALPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGNQ 243
G + + FGC T G +S G+VGLGGGD+SL+SQ+ + +
Sbjct: 207 FADAPGARGDGTTTRVANVNFGCSTTFVG--SSVGDGLVGLGGGDLSLVSQLGADTSLGR 264
Query: 244 RLGVSTPDIVIDSGTTLTFLPQGYNSNLLSVMSSMIEAQ--------------------- 282
R + + + L F P+ ++ +V + +I +Q
Sbjct: 265 RFSYCLVPYSVKASSALNFGPRAAVTDPGAVTTPLIPSQVKAYYIVELRSVKVGNKTFEA 324
Query: 283 -------------------PVADP-----TGSLEL------------CYSFNSLSQ---- 302
+ DP TG ++L C+ + + +
Sbjct: 325 PDRSPLIVDSGTTLTFLPEALVDPLVKELTGRIKLPPAQSPERLLPLCFDVSGVREGQVA 384
Query: 303 --VPEVTIHF-RGADVKLSRSNFFVKVSEDIVCSVFKGITNSVP--IYGNIMQTNFLVGY 357
+P+VT+ GA V L N FV+V E +C ++ P I GNI Q N VGY
Sbjct: 385 AMIPDVTVGLGGGAAVTLKAENTFVEVQEGTLCLAVSAMSEQFPASIIGNIAQQNMHVGY 444
Query: 358 DIEQQTVSFKPTDC 371
D+++ TV+F P C
Sbjct: 445 DLDKGTVTFAPAAC 458
>gi|255563827|ref|XP_002522914.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
gi|223537841|gb|EEF39457.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
Length = 442
Score = 149 bits (376), Expect = 2e-33, Method: Compositional matrix adjust.
Identities = 90/210 (42%), Positives = 131/210 (62%), Gaps = 15/210 (7%)
Query: 28 GFSVELIHRDSPKSPFYNSSETPYQRLRDALTRSLNRLNHFNQNSSISSSKAS-QADIIP 86
GF + L H DS K+ T +QR++ + R+ +RL N +SS A + ++
Sbjct: 42 GFRITLKHVDSDKN------LTKFQRIQHGIKRANHRLERLNAMVLAASSNAEINSPVLS 95
Query: 87 NNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSL 146
N +L+ ++IGTPP A+ DTGSDLIWTQC+PC +QC+ Q SP+FDPK SS++ L
Sbjct: 96 GNGEFLMNLAIGTPPETYSAIMDTGSDLIWTQCKPC--TQCFDQPSPIFDPKKSSSFSKL 153
Query: 147 PCSSSQCASLNQKSCSGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITFGC 206
CSS C +L Q SCS +C+Y +YGD S + G +ATET T G V++P + FGC
Sbjct: 154 SCSSQLCKALPQSSCSD-SCEYLYTYGDYSSTQGTMATETFTFGK-----VSIPNVGFGC 207
Query: 207 GTNNGGLFNSKTTGIVGLGGGDISLISQMR 236
G +N G ++ +G+VGLG G +SL+SQ++
Sbjct: 208 GEDNEGDGFTQGSGLVGLGRGPLSLVSQLK 237
Score = 77.8 bits (190), Expect = 8e-12, Method: Compositional matrix adjust.
Identities = 46/128 (35%), Positives = 77/128 (60%), Gaps = 7/128 (5%)
Query: 249 TPDIVIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGS--LELCYSFNSLS---QV 303
T ++IDSGTT+T+L + + +S + PV D +G+ LELCY+ S + +V
Sbjct: 314 TGGLIIDSGTTITYLEESAFDLVKKEFTSQM-GLPV-DNSGATGLELCYNLPSDTSELEV 371
Query: 304 PEVTIHFRGADVKLSRSNFFVKVSEDIVCSVFKGITNSVPIYGNIMQTNFLVGYDIEQQT 363
P++ +HF GAD++L N+ + S V + G + + I+GN+ Q N V +D+E++T
Sbjct: 372 PKLVLHFTGADLELPGENYMIADSSMGVICLAMGSSGGMSIFGNVQQQNMFVSHDLEKET 431
Query: 364 VSFKPTDC 371
+SF PT+C
Sbjct: 432 LSFLPTNC 439
>gi|413953792|gb|AFW86441.1| hypothetical protein ZEAMMB73_342504 [Zea mays]
Length = 459
Score = 149 bits (376), Expect = 2e-33, Method: Compositional matrix adjust.
Identities = 123/416 (29%), Positives = 177/416 (42%), Gaps = 90/416 (21%)
Query: 30 SVELIHRDSPKSPFYNSSETPY--QRLRDALTRSLNRLNHFNQ-NSSISSSKASQADIIP 86
SV L+HR P +P SS+ P +RLR + RS ++ ++ N SI + D +
Sbjct: 60 SVPLVHRHGPCAPSTRSSDEPSLSERLRRSRARSKYIMSRASKSNVSIPTHLGGSVDSL- 118
Query: 87 NNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSL 146
Y++ + +GTP ++ + DTGSDL W QC PC + CY Q PLFDP SSTY +
Sbjct: 119 ---EYVVTVGLGTPAVSQVLLIDTGSDLSWVQCAPCNSTTCYPQKDPLFDPSRSSTYAPI 175
Query: 147 PCSSSQCASLNQK---------SCSGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAV 197
PC++ C L + S G C Y+++YGDGS + G + ET+T+ V
Sbjct: 176 PCNTDACRDLTRDGYGSDCTSGSGGGAQCGYAITYGDGSQTTGVYSNETLTMAP----GV 231
Query: 198 ALPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAG---------NQRLG-- 246
+ FGCG + G N K G++GLGG SL+ Q + G N + G
Sbjct: 232 TVKDFHFGCGHDQDGP-NDKYDGLLGLGGAPESLVVQTSSVYGGAFSYCLPAANDQAGFL 290
Query: 247 ------------VSTPDI--------------------------------VIDSGTTLTF 262
V TP + +IDSGT +T
Sbjct: 291 ALGAPVNDASGFVFTPMVREQQTFYVVNMTGITVGGEPIDVPPSAFSGGMIIDSGTVVTE 350
Query: 263 LPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSLSQ--VPEVTIHFRGADVKLSRS 320
L + L + + A P+ P G L+ CY+F S VP V + F G +
Sbjct: 351 LQHTAYAALQAAFRKAMAAYPLL-PNGELDTCYNFTGHSNVTVPRVALTFSGG------A 403
Query: 321 NFFVKVSEDIV---CSVFK--GITNSVPIYGNIMQTNFLVGYDIEQQTVSFKPTDC 371
+ V + I+ C F+ G N I GN+ Q V YD+ V F C
Sbjct: 404 TVDLDVPDGILLDNCLAFQEAGPDNQPGILGNVNQRTLEVLYDVGHGRVGFGADAC 459
>gi|359497446|ref|XP_003635520.1| PREDICTED: aspartic proteinase nepenthesin-2-like, partial [Vitis
vinifera]
Length = 354
Score = 149 bits (376), Expect = 2e-33, Method: Compositional matrix adjust.
Identities = 111/355 (31%), Positives = 157/355 (44%), Gaps = 80/355 (22%)
Query: 87 NNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSL 146
+ NY +++ +G+P + DTGS L W QC+PC C++Q PLFDP S TYKSL
Sbjct: 9 GSGNYYVKVGLGSPARYYSMIVDTGSSLSWLQCKPCV-VYCHVQADPLFDPSASKTYKSL 67
Query: 147 PCSSSQC-----ASLNQKSC--SGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVAL 199
C+SSQC A+LN C S C Y+ SYGD S+S G L+ + +TL + L
Sbjct: 68 SCTSSQCSSLVDATLNNPLCETSSNVCVYTASYGDSSYSMGYLSQDLLTLAPSQ----TL 123
Query: 200 PGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIA------------------- 240
PG +GCG ++ GLF + GI+GLG +S++ Q+ +
Sbjct: 124 PGFVYGCGQDSEGLFG-RAAGILGLGRNKLSMLGQVSSKFGYAFSYCLPTRGGGGFLSIG 182
Query: 241 ---------------------------------GNQRLGVSTPDI----VIDSGTTLTFL 263
G + LGV+ +IDSGT +T L
Sbjct: 183 KASLAGSAYKFTPMTTDPGNPSLYFLRLTAITVGGRALGVAAAQYRVPTIIDSGTVITRL 242
Query: 264 PQG----YNSNLLSVMSSMIEAQPVADPTGSLELCYSFN--SLSQVPEVTIHFR-GADVK 316
P + + +MSS P L+ C+ N + VPEV + F+ GAD+
Sbjct: 243 PMSVYTPFQQAFVKIMSSKYARAPGFS---ILDTCFKGNLKDMQSVPEVRLIFQGGADLN 299
Query: 317 LSRSNFFVKVSEDIVCSVFKGITNSVPIYGNIMQTNFLVGYDIEQQTVSFKPTDC 371
L N ++V E + C F G N V I GN Q F V +DI + F C
Sbjct: 300 LRPVNVLLQVDEGLTCLAFAG-NNGVAIIGNHQQQTFKVAHDISTARIGFATGGC 353
>gi|357143847|ref|XP_003573077.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Brachypodium
distachyon]
Length = 420
Score = 149 bits (376), Expect = 2e-33, Method: Compositional matrix adjust.
Identities = 84/209 (40%), Positives = 127/209 (60%), Gaps = 19/209 (9%)
Query: 28 GFSVELIHRDSPKSPFYNSSETPYQRLRDALTRSLNRLNHFNQNSSISSSKASQADIIPN 87
G+ + L H DS T + +R A+ RS R ++S A+ +
Sbjct: 22 GYRLVLTHVDS------KGGYTKTELMRRAVHRSRLR--------ALSGYDATSPRLHSV 67
Query: 88 NANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLP 147
YL+ ++IG PP +A+ADTGSDL WTQC+PC C+ QD+P++DP SST+ LP
Sbjct: 68 QVEYLMELAIGKPPVPFVALADTGSDLTWTQCQPC--KLCFPQDTPVYDPSASSTFSPLP 125
Query: 148 CSSSQCASLNQKSCSGVN-CQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITFGC 206
CSS+ C + ++C+ + C+Y +YGDG++S G L TET+TLG ++ V++ G+ FGC
Sbjct: 126 CSSATCLPIWSRNCTPSSLCRYRYAYGDGAYSAGILGTETLTLGPSSA-PVSVGGVAFGC 184
Query: 207 GTNNGGLFNSKTTGIVGLGGGDISLISQM 235
GT+NGG + +TG VGLG G +SL++Q+
Sbjct: 185 GTDNGG-DSLNSTGTVGLGRGTLSLLAQL 212
Score = 52.4 bits (124), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 39/133 (29%), Positives = 64/133 (48%), Gaps = 6/133 (4%)
Query: 246 GVSTPDIVIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSF--NSLSQV 303
G T +++DSGTT T L + ++ ++ ++ QP + + C+ +
Sbjct: 288 GDGTGGMIVDSGTTFTILAESGFREVVGRVARVL-GQPPVNASSLDAPCFPAPAGEPPYM 346
Query: 304 PEVTIHFRG-ADVKLSRSNFFVKVSEDI-VCSVFKGIT-NSVPIYGNIMQTNFLVGYDIE 360
P++ +HF G AD++L R N+ ED C G T S + GN Q N + +D
Sbjct: 347 PDLVLHFAGGADMRLYRDNYMSYNEEDSSFCLNIAGTTPESTSVLGNFQQQNIQMLFDTT 406
Query: 361 QQTVSFKPTDCTK 373
+SF PTDC+K
Sbjct: 407 VGQLSFLPTDCSK 419
>gi|449467979|ref|XP_004151699.1| PREDICTED: probable aspartic protease At2g35615-like, partial
[Cucumis sativus]
Length = 209
Score = 149 bits (375), Expect = 3e-33, Method: Compositional matrix adjust.
Identities = 78/207 (37%), Positives = 118/207 (57%), Gaps = 10/207 (4%)
Query: 9 FILFFLCFYVVSPIEAQTGGFSVELIHRDSPKSPFYNSSETPYQRLRDALTRSLNRLNHF 68
IL + F + I G F+ L HRDS SP SS + Y RL +A RSL+R
Sbjct: 11 LILLLISFSQTTIINGDNG-FTTSLFHRDSLLSPLEFSSLSHYDRLTNAFRRSLSRSATL 69
Query: 69 NQNSSISSSKASQADIIPNNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCY 128
++ + + QA + P + YL+ +SIGTPP + + +ADTGSDL+W QC PC +CY
Sbjct: 70 LNRAATNGALDLQAPLTPGSGEYLMSVSIGTPPVDYIGMADTGSDLMWAQCLPCL--KCY 127
Query: 129 MQDSPLFDPKMSSTYKSLPCSSSQCASLNQKSCSGVN-CQYSVSYGDGSFSNGNLATETV 187
Q P+FDP S+++ +PC+S C +++ C C YS +YGD +++ G+L E +
Sbjct: 128 KQSRPIFDPLKSTSFSHVPCNSQNCKAIDDSHCGAQGVCDYSYTYGDQTYTKGDLGFEKI 187
Query: 188 TLGSTTGQAVALPGITFGCGTNNGGLF 214
T+GS++ ++V GCG +GG F
Sbjct: 188 TIGSSSVKSV------IGCGHESGGGF 208
>gi|242079447|ref|XP_002444492.1| hypothetical protein SORBIDRAFT_07g022780 [Sorghum bicolor]
gi|241940842|gb|EES13987.1| hypothetical protein SORBIDRAFT_07g022780 [Sorghum bicolor]
Length = 441
Score = 149 bits (375), Expect = 3e-33, Method: Compositional matrix adjust.
Identities = 85/214 (39%), Positives = 125/214 (58%), Gaps = 16/214 (7%)
Query: 28 GFSVELIHRDSPKSPFYNSSETPYQRLRDALTRSLNRLNHFNQNSSISSSK------ASQ 81
GF ++L H D+ +S T Q L A+ RS R+ Q++++S + A++
Sbjct: 27 GFQLKLTHVDA------GTSYTKPQLLSRAIARSKARVAAL-QSAAVSPAPVADPITAAR 79
Query: 82 ADIIPNNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSS 141
+ ++ YL+ ++IGTPP A+ DTGSDLIWTQC PC C Q +P FD K S+
Sbjct: 80 VLVTASSGEYLVDLAIGTPPLYYTAIMDTGSDLIWTQCAPC--LLCAAQPTPYFDVKRSA 137
Query: 142 TYKSLPCSSSQCASLNQKSCSGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPG 201
TY++LPC SS+CA+L+ SC C Y YGD + + G LA ET T G+ + V
Sbjct: 138 TYRALPCRSSRCAALSSPSCFKKMCVYQYYYGDTASTAGVLANETFTFGAASSTKVRAAN 197
Query: 202 ITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQM 235
I+FGCG+ N G + ++G+VG G G +SL+SQ+
Sbjct: 198 ISFGCGSLNAGEL-ANSSGMVGFGRGPLSLVSQL 230
Score = 55.1 bits (131), Expect = 5e-05, Method: Compositional matrix adjust.
Identities = 36/124 (29%), Positives = 57/124 (45%), Gaps = 4/124 (3%)
Query: 252 IVIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSF----NSLSQVPEVT 307
++IDSGT++T+L Q + ++S I + D L+ C+ + N VP+
Sbjct: 315 VIIDSGTSITWLQQDAYEAVRRGLASTIPLPAMNDTDIGLDTCFQWPPPPNVTVTVPDFV 374
Query: 308 IHFRGADVKLSRSNFFVKVSEDIVCSVFKGITNSVPIYGNIMQTNFLVGYDIEQQTVSFK 367
HF GA++ L N+ + S + T+ I GN Q N + YDI +SF
Sbjct: 375 FHFDGANMTLPPENYMLIASTTGYLCLAMAPTSVGTIIGNYQQQNLHLLYDIANSFLSFV 434
Query: 368 PTDC 371
P C
Sbjct: 435 PAPC 438
>gi|45735845|dbj|BAD12880.1| putative chloroplast nucleoid DNA-binding protein cnd41 [Oryza
sativa Japonica Group]
gi|45735971|dbj|BAD13000.1| putative chloroplast nucleoid DNA-binding protein cnd41 [Oryza
sativa Japonica Group]
Length = 333
Score = 148 bits (373), Expect = 5e-33, Method: Compositional matrix adjust.
Identities = 113/342 (33%), Positives = 154/342 (45%), Gaps = 73/342 (21%)
Query: 95 ISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCSSSQC- 153
+ +GTP T+ + V DTGS L W QC PC S C+ Q P+F+PK SSTY S+ CS+ QC
Sbjct: 1 MGLGTPATQYVMVVDTGSSLTWLQCSPCLVS-CHRQSGPVFNPKSSSTYASVGCSAQQCS 59
Query: 154 ----ASLNQKSCSGVN-CQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITFGCGT 208
A+LN +CS N C Y SYGD SFS G L+ +TV+ GST+ LP +GCG
Sbjct: 60 DLPSATLNPSACSSSNVCIYQASYGDSSFSVGYLSKDTVSFGSTS-----LPNFYYGCGQ 114
Query: 209 NNGGLFNSKTTGIVGLGGGDISLISQMR-------------------------------- 236
+N GLF ++ G++GL +SL+ Q+
Sbjct: 115 DNEGLFG-RSAGLIGLARNKLSLLYQLAPSLGYSFTYCLPSSSSSGYLSLGSYNPGQYSY 173
Query: 237 -------------------TTIAGN-----QRLGVSTPDIVIDSGTTLTFLPQGYNSNLL 272
T+AGN S P I+ DSGT +T LP S L
Sbjct: 174 TPMVSSSLDDSLYFIKLSGMTVAGNPLSVSSSAYSSLPTII-DSGTVITRLPTSVYSALS 232
Query: 273 SVMSSMIEAQPVADPTGSLELCYSFN-SLSQVPEVTIHFR-GADVKLSRSNFFVKVSEDI 330
+++ ++ A L+ C+ S P VT+ F GA +KLS N V V +
Sbjct: 233 KAVAAAMKGTSRASAYSILDTCFKGQASRVSAPAVTMSFAGGAALKLSAQNLLVDVDDST 292
Query: 331 VCSVFKGITNSVPIYGNIMQTNFLVGYDIEQQTVSFKPTDCT 372
C F S I GN Q F V YD++ + F C+
Sbjct: 293 TCLAF-APARSAAIIGNTQQQTFSVVYDVKSSRIGFAAGGCS 333
>gi|409179880|gb|AFV26025.1| aspartic proteinase nepenthesin 2 [Nepenthes mirabilis]
Length = 437
Score = 147 bits (372), Expect = 6e-33, Method: Compositional matrix adjust.
Identities = 75/187 (40%), Positives = 113/187 (60%), Gaps = 10/187 (5%)
Query: 49 TPYQRLRDALTRSLNRLNHFNQNSSISSSKASQADIIPNNANYLIRISIGTPPTERLAVA 108
T Y+ ++ A+ R R+ N + + SS + + + YL+ ++IGTP + A+
Sbjct: 56 TKYELIKRAIKRGERRMRSIN--AMLQSSSGIETPVYAGSGEYLMNVAIGTPASSLSAIM 113
Query: 109 DTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCSSSQCASLNQKSCSGVNCQY 168
DTGSDLIWTQCEPC +QC+ Q +P+F+P+ SS++ +LPC S C L +SC +CQY
Sbjct: 114 DTGSDLIWTQCEPC--TQCFSQPTPIFNPQDSSSFSTLPCESQYCQDLPSESCYN-DCQY 170
Query: 169 SVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITFGCGTNNGGLFNSKTTGIVGLGGGD 228
+ YGDGS + G +ATET T + ++P I FGCG +N G G++G+G G
Sbjct: 171 TYGYGDGSSTQGYMATETFTF-----ETSSVPNIAFGCGEDNQGFGQGNGAGLIGMGWGP 225
Query: 229 ISLISQM 235
+SL SQ+
Sbjct: 226 LSLPSQL 232
Score = 77.0 bits (188), Expect = 1e-11, Method: Compositional matrix adjust.
Identities = 42/127 (33%), Positives = 65/127 (51%), Gaps = 4/127 (3%)
Query: 249 TPDIVIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSF---NSLSQVPE 305
T ++IDSGTTLT+LPQ + + + I PV + + L C+ S QVPE
Sbjct: 308 TGGMIIDSGTTLTYLPQDAYNAVAQAFTDQINLSPVDESSSGLSTCFQLPSDGSTVQVPE 367
Query: 306 VTIHFRGADVKLSRSNFFVKVSEDIVC-SVFKGITNSVPIYGNIMQTNFLVGYDIEQQTV 364
+++ F G + L N + +E ++C ++ + I+GNI Q V YD++ V
Sbjct: 368 ISMQFDGGVLNLGEENVLISPAEGVICLAMGSSSQQGISIFGNIQQQETQVLYDLQNLAV 427
Query: 365 SFKPTDC 371
SF PT C
Sbjct: 428 SFVPTQC 434
>gi|413923782|gb|AFW63714.1| hypothetical protein ZEAMMB73_300584 [Zea mays]
Length = 458
Score = 147 bits (372), Expect = 6e-33, Method: Compositional matrix adjust.
Identities = 132/412 (32%), Positives = 184/412 (44%), Gaps = 75/412 (18%)
Query: 28 GFSVELIHRDSPKSPFYNSS-ETPYQRLRDALTRSLNRLNHFNQNSSISSSKASQADIIP 86
G +V L HR P SP + T +RLR R+ F+ I S A+
Sbjct: 54 GVTVPLHHRYDPCSPVPSKKVPTLEERLRRDQLRAAYIKRKFSGAGDIEQSDAATVPTTL 113
Query: 87 NNA----NYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSST 142
+ Y+I + IG+P + DTGSD+ W QC+PC SQC+ + LFDP SST
Sbjct: 114 GTSLSTLEYVITVGIGSPAVTQTMSMDTGSDVSWVQCKPC--SQCHSEVDSLFDPSSSST 171
Query: 143 YKSLPCSSSQCASLNQK----SCSGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVA 198
Y CSS+ CA L+Q C CQY V+YGD S + G +++T+TLGS+ A
Sbjct: 172 YSPFSCSSAPCAQLSQSQEGNGCMSSQCQYIVNYGDSSSTTGTYSSDTLTLGSS-----A 226
Query: 199 LPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIA------------------ 240
+ FGC + G FN +T G++GLGGG SL SQ T
Sbjct: 227 MTDFQFGCSQSESGGFNDQTDGLMGLGGGAQSLASQTAGTFGTAFSYCLPPTSGSSGFLT 286
Query: 241 --------------------------------GNQRLGVST----PDIVIDSGTTLTFLP 264
G+Q+L + T ++DSGT +T LP
Sbjct: 287 LGTGSSGFVKTPMLRSTQIPTYYVVLLESIKVGSQQLNLPTSVFSAGSLMDSGTIITRLP 346
Query: 265 QGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSLS--QVPEVTIHFR-GADVKLSRSN 321
S L S + ++ P A P+G L+ C+ F+ S +P VT+ F GA V L+
Sbjct: 347 PTAYSALSSAFKAGMQQYPPATPSGILDTCFDFSGQSSISIPTVTLVFSGGAAVDLAFDG 406
Query: 322 FFVKVSEDIVCSVF--KGITNSVPIYGNIMQTNFLVGYDIEQQTVSFKPTDC 371
+++S I C F G +S+ I GN+ Q F V YD+ V FK C
Sbjct: 407 IMLEISSSIRCLAFTPNGDDSSLGIIGNVQQRTFEVLYDVGGGAVGFKAGAC 458
>gi|20197342|gb|AAC34482.2| putative chloroplast nucleoid DNA binding protein [Arabidopsis
thaliana]
Length = 353
Score = 147 bits (371), Expect = 9e-33, Method: Compositional matrix adjust.
Identities = 115/356 (32%), Positives = 172/356 (48%), Gaps = 79/356 (22%)
Query: 93 IRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCSSSQ 152
+ +SIG P + A+ DTGSDLIWTQC+PC ++C+ Q +P+FDP+ SS+Y + CSS
Sbjct: 1 MELSIGNPAVKYSAIVDTGSDLIWTQCKPC--TECFDQPTPIFDPEKSSSYSKVGCSSGL 58
Query: 153 CASLNQKSCSGVN--CQYSVSYGDGSFSNGNLATETVTL-------GSTTGQAVALPGIT 203
C +L + +C+ C+Y +YGD S + G LATET T G G V G
Sbjct: 59 CNALPRSNCNEDKDACEYLYTYGDYSSTRGLLATETFTFEDENSISGIGFGCGVENEGDG 118
Query: 204 FGCGTNNGGL---------------------------------FNSKTTGIVGLGGGDI- 229
F G+ GL S +GIV G +
Sbjct: 119 FSQGSGLVGLGRGPLSLISQLKETKFSYCLTSIEDSEASSSLFIGSLASGIVNKTGASLD 178
Query: 230 -------SLIS----------QMRTTIAGNQRLGVS----------TPDIVIDSGTTLTF 262
SL+ +++ G +RL V T ++IDSGTT+T+
Sbjct: 179 GEVTKTMSLLRNPDQPSFYYLELQGITVGAKRLSVEKSTFELAEDGTGGMIIDSGTTITY 238
Query: 263 LPQGYNSNLLSVMSSMIEAQPVADPTGS--LELCYSFNSLSQ---VPEVTIHFRGADVKL 317
L + L +S + + PV D +GS L+LC+ ++ VP++ HF+GAD++L
Sbjct: 239 LEETAFKVLKEEFTSRM-SLPV-DDSGSTGLDLCFKLPDAAKNIAVPKMIFHFKGADLEL 296
Query: 318 SRSNFFVKVSEDIVCSVFKGITNSVPIYGNIMQTNFLVGYDIEQQTVSFKPTDCTK 373
N+ V S V + G +N + I+GN+ Q NF V +D+E++TVSF PT+C K
Sbjct: 297 PGENYMVADSSTGVLCLAMGSSNGMSIFGNVQQQNFNVLHDLEKETVSFVPTECGK 352
>gi|356524287|ref|XP_003530761.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
Length = 481
Score = 147 bits (371), Expect = 9e-33, Method: Compositional matrix adjust.
Identities = 121/427 (28%), Positives = 177/427 (41%), Gaps = 93/427 (21%)
Query: 30 SVELIHRDSP---------------KSPFYNSSETPYQRLRDALTRSLNRLNHFNQNSSI 74
S+E++H+ P + N + ++ L+++L R N + S
Sbjct: 62 SLEVVHKHGPCSQLNHNGKAKTTISHTDIMNLDNERVKYIQSRLSKNLGRENSVKELDST 121
Query: 75 SSSKASQADIIPNNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPL 134
+ S + I +ANY + + +GTP + V DTGSDL WTQCEPC S CY Q +
Sbjct: 122 TLPAKSGSLI--GSANYFVVVGLGTPKRDLSLVFDTGSDLTWTQCEPCAGS-CYKQQDAI 178
Query: 135 FDPKMSSTYKSLPCSSSQCASLNQK------SCSGVNCQYSVSYGDGSFSNGNLATETVT 188
FDP SS+Y ++ C+SS C L S S C Y + YGD S S G L+ E +T
Sbjct: 179 FDPSKSSSYINITCTSSLCTQLTSAGIKSRCSSSTTACIYGIQYGDKSTSVGFLSQERLT 238
Query: 189 LGSTTGQAVALPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMR------------ 236
+ +T + FGCG +N GLF S + G++GLG IS + Q
Sbjct: 239 ITATD----IVDDFLFGCGQDNEGLF-SGSAGLIGLGRHPISFVQQTSSIYNKIFSYCLP 293
Query: 237 --------------------------TTIAGNQRL-----------GVSTPDI------- 252
+TI+G+ G P +
Sbjct: 294 STSSSLGHLTFGASAATNANLKYTPLSTISGDNTFYGLDIVGISVGGTKLPAVSSSTFSA 353
Query: 253 ---VIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSLSQ--VPEVT 307
+IDSGT +T L + L S +E PVA+ G + CY F+ + VP++
Sbjct: 354 GGSIIDSGTVITRLAPTAYAALRSAFRQGMEKYPVANEDGLFDTCYDFSGYKEISVPKID 413
Query: 308 IHFRGA-DVKLSRSNFFVKVSEDIVCSVF--KGITNSVPIYGNIMQTNFLVGYDIEQQTV 364
F G V+L + S VC F G N + I+GN+ Q V YD+E +
Sbjct: 414 FEFAGGVTVELPLVGILIGRSAQQVCLAFAANGNDNDITIFGNVQQKTLEVVYDVEGGRI 473
Query: 365 SFKPTDC 371
F C
Sbjct: 474 GFGAAGC 480
>gi|414881704|tpg|DAA58835.1| TPA: hypothetical protein ZEAMMB73_701358 [Zea mays]
Length = 485
Score = 147 bits (370), Expect = 1e-32, Method: Compositional matrix adjust.
Identities = 115/429 (26%), Positives = 168/429 (39%), Gaps = 100/429 (23%)
Query: 32 ELIHRDSPKSPFYNSSETPYQRLRDALTRSLNRLNHFNQNSSISSSKASQADIIP----- 86
++HRD+ + + T + L+ L R R ++ + + P
Sbjct: 68 RVVHRDT-----FAVNATAGELLKHRLQRDKRRAARISEAAGAGGGNGRKGVAAPVVSGL 122
Query: 87 --NNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYK 144
+ Y +I +GTP T+ L V DTGSD++W QC PC +CY Q P+FDP+ SS+Y
Sbjct: 123 AQGSGEYFTKIGVGTPATQALMVLDTGSDVVWVQCAPC--RRCYEQSGPVFDPRRSSSYG 180
Query: 145 SLPCSSSQCASLNQKSCSGVN--CQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGI 202
++ C ++ C L+ C C Y V+YGDGS + G+ TET+T G VA +
Sbjct: 181 AVGCGAALCRRLDSGGCDLRRGACMYQVAYGDGSVTAGDFVTETLTFAG--GARVAR--V 236
Query: 203 TFGCGTNNGGLF--------------------------------------------NSKT 218
GCG +N GLF S
Sbjct: 237 ALGCGHDNEGLFVAAAGLLGLGRGGLSFPTQISRRYGRSFSYCLVDRTSSGAGAAPGSHR 296
Query: 219 TGIVGLGGGDISLISQMRTTIAGNQRL-------------------GVSTPDIVID---- 255
+ V G G + S T + N R+ GV+ D+ +D
Sbjct: 297 SSTVSFGAGSVGASSASFTPMVRNPRMETFYYVQLVGISVGGARVPGVAESDLRLDPSTG 356
Query: 256 -------SGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTG--SLELCYSFNS--LSQVP 304
SGT++T L + S L + P G + CY + +VP
Sbjct: 357 RGGVIVDSGTSVTRLARASYSALRDAFRAAAAGGLRLSPGGFSLFDTCYDLGGRRVVKVP 416
Query: 305 EVTIHFR-GADVKLSRSNFFVKV-SEDIVCSVFKGITNSVPIYGNIMQTNFLVGYDIEQQ 362
V++HF GA+ L N+ + V S C F G V I GNI Q F V +D + Q
Sbjct: 417 TVSMHFAGGAEAALPPENYLIPVDSRGTFCFAFAGTDGGVSIIGNIQQQGFRVVFDGDGQ 476
Query: 363 TVSFKPTDC 371
V F P C
Sbjct: 477 RVGFAPKGC 485
>gi|225216949|gb|ACN85242.1| aspartic proteinase nepenthesin-1 precursor [Oryza minuta]
Length = 519
Score = 147 bits (370), Expect = 1e-32, Method: Compositional matrix adjust.
Identities = 110/349 (31%), Positives = 146/349 (41%), Gaps = 69/349 (19%)
Query: 87 NNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSL 146
NY++ + +GTP + V DTGSD W QC+PC CY Q LFDP SSTY ++
Sbjct: 176 GTGNYVVTVGLGTPASRYTVVFDTGSDTTWVQCQPCV-VVCYEQREKLFDPARSSTYANV 234
Query: 147 PCSSSQCASLNQKSCSGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITFGC 206
C++ C+ LN CSG +C Y V YGDGS+S G A +T+TL S A+ G FGC
Sbjct: 235 SCAAPACSDLNIHGCSGGHCLYGVQYGDGSYSIGFFAMDTLTLSSYD----AVKGFRFGC 290
Query: 207 GTNNGGLF----------------------------------NSKTTGIVGLGGGDISLI 232
G N GLF S TG + G G ++
Sbjct: 291 GERNEGLFGEAAGLLGLGRGKTSLPVQTYDKYGGVFAHCLPARSTGTGYLDFGAGSLAAA 350
Query: 233 SQMRTT------------------IAGNQRLGV-----STPDIVIDSGTTLTFLPQGYNS 269
TT G Q L + +T ++DSGT +T LP S
Sbjct: 351 RARLTTPMLTENGPTFYYVGMTGIRVGGQLLSIPQSVFATAGTIVDSGTVITRLPPAAYS 410
Query: 270 NL--LSVMSSMIEAQPVADPTGSLELCYSFNSLSQV--PEVTIHFRG-ADVKLSRSNFFV 324
+L + A L+ CY F +SQV P V++ F+G A + + S
Sbjct: 411 SLRYAFAAAMAARGYKKAPAVSLLDTCYDFTGMSQVAIPTVSLLFQGGARLDVDASGIMY 470
Query: 325 KVSEDIVCSVFKGITNS--VPIYGNIMQTNFLVGYDIEQQTVSFKPTDC 371
S VC F + V I GN F V YDI ++ V F P C
Sbjct: 471 AASASQVCLAFAANEDGGDVGIVGNTQLKTFGVAYDIGKKVVGFYPGAC 519
>gi|414584783|tpg|DAA35354.1| TPA: hypothetical protein ZEAMMB73_186928 [Zea mays]
Length = 464
Score = 147 bits (370), Expect = 1e-32, Method: Compositional matrix adjust.
Identities = 83/217 (38%), Positives = 120/217 (55%), Gaps = 23/217 (10%)
Query: 30 SVELIHRDSPKSPFYNSSETPYQRLRDALTRSLNRLNHFNQNSSISSSK-----ASQADI 84
S L+ RD+ Y S + D + R R + S ++ + S++ +
Sbjct: 60 SFALVRRDAVTGSTYPSRR---HAVLDLVARDNARAEYLASRLSPAAYQPTGFSGSESKV 116
Query: 85 I----PNNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMS 140
+ + Y +R+ IG+PPTE+ V D+GSD+IW QC+PC +CY Q PLFDP S
Sbjct: 117 VSGLDEGSGEYFVRVGIGSPPTEQYLVVDSGSDVIWVQCKPC--LECYAQADPLFDPATS 174
Query: 141 STYKSLPCSSSQCASLNQKSC--SGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVA 198
+T+ ++PC S+ C +L C SG C Y VSYGDGS++ G LA ET+TLG T A
Sbjct: 175 ATFSAVPCGSAVCRTLRTSGCGDSG-GCDYEVSYGDGSYTKGALALETLTLGGT-----A 228
Query: 199 LPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQM 235
+ G+ GCG N GLF G++GLG G +SL+ Q+
Sbjct: 229 VEGVAIGCGHRNRGLFVG-AAGLLGLGWGPMSLVGQL 264
>gi|255568540|ref|XP_002525244.1| Aspartic proteinase nepenthesin-2 precursor, putative [Ricinus
communis]
gi|223535541|gb|EEF37210.1| Aspartic proteinase nepenthesin-2 precursor, putative [Ricinus
communis]
Length = 460
Score = 147 bits (370), Expect = 1e-32, Method: Compositional matrix adjust.
Identities = 115/355 (32%), Positives = 159/355 (44%), Gaps = 83/355 (23%)
Query: 88 NANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLP 147
+ NY +++ +G+PP + DTGS L W QC+PC C+ Q PLF+P S+TY+ L
Sbjct: 117 SGNYYLKLGLGSPPKYYTMILDTGSSLSWLQCKPCV-VYCHSQVDPLFEPSASNTYRPLY 175
Query: 148 CSSSQC-----ASLNQKSC--SGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALP 200
CSSS+C A+LN C SGV C Y+ SYGD S+S G L+ + +TL T Q LP
Sbjct: 176 CSSSECSLLKAATLNDPLCTASGV-CVYTASYGDASYSMGYLSRDLLTL--TPSQT--LP 230
Query: 201 GITFGCGTNNGGLFNSKTTGIVGL-----------------------------GGGDISL 231
T+GCG +N GLF K GIVGL GGG +S+
Sbjct: 231 SFTYGCGQDNEGLFG-KAAGIVGLARDKLSMLAQLSPKYGYAFSYCLPTSTSSGGGFLSI 289
Query: 232 ------------------------ISQMRTTIAGN----QRLGVSTPDIVIDSGTTLTFL 263
+ T+AG G P I IDSGT +T L
Sbjct: 290 GKISPSSYKFTPMIRNSQNPSLYFLRLAAITVAGRPVGVAAAGYQVPTI-IDSGTVVTRL 348
Query: 264 P----QGYNSNLLSVMSSMIEAQPVADPTGSLELCY--SFNSLSQVPEVTIHFRG-ADVK 316
P + +MS E P L+ C+ S S+S PE+ + F+G AD+
Sbjct: 349 PISIYAALREAFVKIMSRRYEQAPAYS---ILDTCFKGSLKSMSGAPEIRMIFQGGADLS 405
Query: 317 LSRSNFFVKVSEDIVCSVFKGITNSVPIYGNIMQTNFLVGYDIEQQTVSFKPTDC 371
L N ++ + I C F +N + I GN Q + + YD+ + F P C
Sbjct: 406 LRAPNILIEADKGIACLAFAS-SNQIAIIGNHQQQTYNIAYDVSASKIGFAPGGC 459
>gi|357167693|ref|XP_003581287.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Brachypodium
distachyon]
Length = 468
Score = 147 bits (370), Expect = 1e-32, Method: Compositional matrix adjust.
Identities = 90/222 (40%), Positives = 125/222 (56%), Gaps = 20/222 (9%)
Query: 21 PIEAQTGGFSVELIHRDSPKSPFYNSSETPYQRLRDALTRSLNRLNHFNQNSSISSSKAS 80
P G V L H D+ + + T Q LR A RS +R++ ++ S KA+
Sbjct: 49 PAAGLLDGLRVPLTHVDA------HGNYTKLQLLRRAARRSHHRMSRLVARTATGSVKAA 102
Query: 81 -----QADIIPNNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLF 135
Q + N +L+ +SIGTP A+ DTGSDL+WTQC+PC +C+ Q +P+F
Sbjct: 103 AAPDLQVPVHAGNGEFLMDMSIGTPALAYAAIVDTGSDLVWTQCKPC--VECFNQSTPVF 160
Query: 136 DPKMSSTYKSLPCSSSQCASLNQKSCSGV--NCQYSVSYGDGSFSNGNLATETVTLGSTT 193
DP SSTY +LPCSSS C+ L +C+ +C Y+ +YGD S + G LA ET TL T
Sbjct: 161 DPSSSSTYSTLPCSSSLCSDLPTSTCTSAAKDCGYTYTYGDASSTQGVLAAETFTLAKTK 220
Query: 194 GQAVALPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQM 235
LPG+ FGCG N G ++ G+VGLG G +SL+SQ+
Sbjct: 221 -----LPGVAFGCGDTNEGDGFTQGAGLVGLGRGPLSLVSQL 257
Score = 62.0 bits (149), Expect = 5e-07, Method: Compositional matrix adjust.
Identities = 44/133 (33%), Positives = 69/133 (51%), Gaps = 11/133 (8%)
Query: 249 TPDIVIDSGTTLTFLP-QGYNSNLLSVMSSMIEAQPVADPTG-SLELCYSFNSLS----Q 302
T +++DSGT++T+L QGY + + M PVAD + L+LC+ + +
Sbjct: 338 TGGVIVDSGTSITYLELQGYRPLKKAFAAQM--KLPVADGSAVGLDLCFKAPASGVDDVE 395
Query: 303 VPEVTIHFRG-ADVKLSRSNFFVKVSED-IVCSVFKGITNSVPIYGNIMQTNFLVGYDIE 360
VP++ +HF G AD+ L N+ V S +C G + + I GN Q N YD++
Sbjct: 396 VPKLVLHFDGGADLDLPAENYMVLDSASGALCLTVMG-SRGLSIIGNFQQQNIQFVYDVD 454
Query: 361 QQTVSFKPTDCTK 373
+ T+SF P C K
Sbjct: 455 KDTLSFAPVQCAK 467
>gi|115472517|ref|NP_001059857.1| Os07g0533600 [Oryza sativa Japonica Group]
gi|22831047|dbj|BAC15910.1| putative nucleoid DNA-binding protein cnd41 [Oryza sativa Japonica
Group]
gi|50508280|dbj|BAD32129.1| putative nucleoid DNA-binding protein cnd41 [Oryza sativa Japonica
Group]
gi|113611393|dbj|BAF21771.1| Os07g0533600 [Oryza sativa Japonica Group]
gi|215766673|dbj|BAG98901.1| unnamed protein product [Oryza sativa Japonica Group]
Length = 441
Score = 147 bits (370), Expect = 1e-32, Method: Compositional matrix adjust.
Identities = 101/254 (39%), Positives = 138/254 (54%), Gaps = 33/254 (12%)
Query: 1 MATFLSCVFILFFLCFYVVSPIEAQTGGFSVELIHRDSPKSPFYNSSETPYQRLRDALTR 60
MA FL V+IL L + +S + G +EL H D Y +E R+R A R
Sbjct: 1 MAAFL--VWILLLLPYVAISSTASH--GVRLELTHADDRGG--YVGAE----RVRRAADR 50
Query: 61 SLNRLNHF-----------NQNSSISSSKASQADIIPNNANYLIRISIGTPPTERLAVAD 109
S R+N F S + + ++A + + A YL+ I+IGTPP AV D
Sbjct: 51 SHRRVNGFLGAIEGPSSTARLGSDGAGAGGAEASVHASTATYLVDIAIGTPPLPLTAVLD 110
Query: 110 TGSDLIWTQCE-PCPPSQCYMQDSPLFDPKMSSTYKSLPCSSSQCASLNQ--KSCS--GV 164
TGSDLIWTQC+ PC +C+ Q +PL+ P S+TY ++ C S C +L CS
Sbjct: 111 TGSDLIWTQCDAPC--RRCFPQPAPLYAPARSATYANVSCRSPMCQALQSPWSRCSPPDT 168
Query: 165 NCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITFGCGTNNGGLFNSKTTGIVGL 224
C Y SYGDG+ ++G LATET TLGS T A+ G+ FGCGT N G ++G+VG+
Sbjct: 169 GCAYYFSYGDGTSTDGVLATETFTLGSDT----AVRGVAFGCGTENLG-STDNSSGLVGM 223
Query: 225 GGGDISLISQMRTT 238
G G +SL+SQ+ T
Sbjct: 224 GRGPLSLVSQLGVT 237
Score = 63.5 bits (153), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 35/122 (28%), Positives = 62/122 (50%), Gaps = 2/122 (1%)
Query: 252 IVIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNS--LSQVPEVTIH 309
++IDSGTT T L + L ++S + + L LC++ S +VP + +H
Sbjct: 317 VIIDSGTTFTALEERAFVALARALASRVRLPLASGAHLGLSLCFAAASPEAVEVPRLVLH 376
Query: 310 FRGADVKLSRSNFFVKVSEDIVCSVFKGITNSVPIYGNIMQTNFLVGYDIEQQTVSFKPT 369
F GAD++L R ++ V+ V + + + G++ Q N + YD+E+ +SF+P
Sbjct: 377 FDGADMELRRESYVVEDRSAGVACLGMVSARGMSVLGSMQQQNTHILYDLERGILSFEPA 436
Query: 370 DC 371
C
Sbjct: 437 KC 438
>gi|226503109|ref|NP_001147206.1| aspartic proteinase nepenthesin-2 precursor [Zea mays]
gi|195608496|gb|ACG26078.1| aspartic proteinase nepenthesin-2 precursor [Zea mays]
gi|413921850|gb|AFW61782.1| aspartic proteinase nepenthesin-2 [Zea mays]
Length = 441
Score = 146 bits (368), Expect = 2e-32, Method: Compositional matrix adjust.
Identities = 84/212 (39%), Positives = 118/212 (55%), Gaps = 13/212 (6%)
Query: 28 GFSVELIHRDSPKSPFYNSSETPYQRLRDALTRSLNRLNHFNQNSS----ISSSKASQAD 83
GF ++L H D+ +S T Q L A+ RS R+ + + A++
Sbjct: 28 GFQLKLTHVDA------GTSYTKLQLLSRAIARSKARVAALQSAAVLPPVVDPITAARVL 81
Query: 84 IIPNNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTY 143
+ ++ YL+ ++IGTPP A+ DTGSDLIWTQC PC C Q +P FD K S+TY
Sbjct: 82 VTASSGEYLVDLAIGTPPLYYTAIMDTGSDLIWTQCAPC--LLCADQPTPYFDVKKSATY 139
Query: 144 KSLPCSSSQCASLNQKSCSGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGIT 203
++LPC SS+CASL+ SC C Y YGD + + G LA ET T G+ V I
Sbjct: 140 RALPCRSSRCASLSSPSCFKKMCVYQYYYGDTASTAGVLANETFTFGAANSTKVRATNIA 199
Query: 204 FGCGTNNGGLFNSKTTGIVGLGGGDISLISQM 235
FGCG+ N G + ++G+VG G G +SL+SQ+
Sbjct: 200 FGCGSLNAGDL-ANSSGMVGFGRGPLSLVSQL 230
Score = 51.2 bits (121), Expect = 7e-04, Method: Compositional matrix adjust.
Identities = 35/124 (28%), Positives = 55/124 (44%), Gaps = 4/124 (3%)
Query: 252 IVIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSF----NSLSQVPEVT 307
++IDSGT++T+L Q + + S I + D L+ C+ + N VP++
Sbjct: 315 VIIDSGTSITWLQQDAYEAVRRGLVSAIPLPAMNDTDIGLDTCFQWPPPPNVTVTVPDLV 374
Query: 308 IHFRGADVKLSRSNFFVKVSEDIVCSVFKGITNSVPIYGNIMQTNFLVGYDIEQQTVSFK 367
HF A++ L N+ + S + T I GN Q N + YDI +SF
Sbjct: 375 FHFDSANMTLLPENYMLIASTTGYLCLVMAPTGVGTIIGNYQQQNLHLLYDIGNSFLSFV 434
Query: 368 PTDC 371
P C
Sbjct: 435 PAPC 438
>gi|326494754|dbj|BAJ94496.1| predicted protein [Hordeum vulgare subsp. vulgare]
gi|326514480|dbj|BAJ96227.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 449
Score = 146 bits (368), Expect = 2e-32, Method: Compositional matrix adjust.
Identities = 86/215 (40%), Positives = 119/215 (55%), Gaps = 20/215 (9%)
Query: 28 GFSVELIHRDSPKSPFYNSSETPYQRLRDA-------LTRSLNRLNHFNQNSSISSSKAS 80
G V L H D+ + + T Q LR A ++R + R SS + + A
Sbjct: 38 GLRVALTHVDA------HGNYTKLQLLRRAARRSRHRMSRLVARTTGVPVMSSKAVAPAL 91
Query: 81 QADIIPNNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMS 140
Q + N +L+ +SIGTP A+ DTGSDL+WTQC+PC +C+ Q +P+FDP S
Sbjct: 92 QVPVHAGNGEFLMDMSIGTPAVAYAAIIDTGSDLVWTQCKPC--VECFNQSTPVFDPSSS 149
Query: 141 STYKSLPCSSSQCASLNQKSCSGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALP 200
STY +LPCSS+ C+ L C+ C Y+ +YGD S + G LA ET TL T LP
Sbjct: 150 STYAALPCSSTLCSDLPSSKCTSAKCGYTYTYGDSSSTQGVLAAETFTLAKTK-----LP 204
Query: 201 GITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQM 235
+ FGCG N G ++ G+VGLG G +SL+SQ+
Sbjct: 205 DVAFGCGDTNEGDGFTQGAGLVGLGRGPLSLVSQL 239
Score = 61.6 bits (148), Expect = 6e-07, Method: Compositional matrix adjust.
Identities = 42/132 (31%), Positives = 66/132 (50%), Gaps = 10/132 (7%)
Query: 249 TPDIVIDSGTTLTFLP-QGYNSNLLSVMSSMIEAQPVADPTG-SLELCYSFNSLS----Q 302
T +++DSGT++T+L QGY + + + M P AD +G L+ C+ + +
Sbjct: 320 TGGVIVDSGTSITYLELQGYRALKKAFAAQM--KLPAADGSGIGLDTCFEAPASGVDQVE 377
Query: 303 VPEVTIHFRGADVKLSRSNFFVKVS-EDIVCSVFKGITNSVPIYGNIMQTNFLVGYDIEQ 361
VP++ H GAD+ L N+ V S +C G + + I GN Q N YD+ +
Sbjct: 378 VPKLVFHLDGADLDLPAENYMVLDSGSGALCLTVMG-SRGLSIIGNFQQQNIQFVYDVGE 436
Query: 362 QTVSFKPTDCTK 373
T+SF P C K
Sbjct: 437 NTLSFAPVQCAK 448
>gi|125558631|gb|EAZ04167.1| hypothetical protein OsI_26309 [Oryza sativa Indica Group]
Length = 441
Score = 146 bits (368), Expect = 2e-32, Method: Compositional matrix adjust.
Identities = 103/254 (40%), Positives = 138/254 (54%), Gaps = 33/254 (12%)
Query: 1 MATFLSCVFILFFLCFYVVSPIEAQTGGFSVELIHRDSPKSPFYNSSETPYQRLRDALTR 60
MA FL V+IL L + +S + G +EL H D Y +E R+R A R
Sbjct: 1 MAAFL--VWILLLLPYVAISSTASH--GVRLELTHADDRGG--YVGAE----RVRRAADR 50
Query: 61 SLNRLNHFNQNSSISSSKA-----------SQADIIPNNANYLIRISIGTPPTERLAVAD 109
S R+N F SS A ++A + + A YL+ I+IGTPP AV D
Sbjct: 51 SHRRVNGFLGAIEGPSSTARLGIDGAGAGGAEASVHASTATYLVDIAIGTPPLPLTAVLD 110
Query: 110 TGSDLIWTQCE-PCPPSQCYMQDSPLFDPKMSSTYKSLPCSSSQCASLNQ--KSCS--GV 164
TGSDLIWTQC+ PC +C+ Q +PL+ P S+TY ++ C S C +L CS
Sbjct: 111 TGSDLIWTQCDAPC--RRCFPQPAPLYAPARSATYANVSCRSPMCQALQSPWSRCSPPDT 168
Query: 165 NCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITFGCGTNNGGLFNSKTTGIVGL 224
C Y SYGDG+ ++G LATET TLGS T A+ G+ FGCGT N G ++G+VG+
Sbjct: 169 GCAYYFSYGDGTSTDGVLATETFTLGSDT----AVRGVAFGCGTENLG-STDNSSGLVGM 223
Query: 225 GGGDISLISQMRTT 238
G G +SL+SQ+ T
Sbjct: 224 GRGPLSLVSQLGVT 237
Score = 63.5 bits (153), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 35/122 (28%), Positives = 62/122 (50%), Gaps = 2/122 (1%)
Query: 252 IVIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNS--LSQVPEVTIH 309
++IDSGTT T L + L ++S + + L LC++ S +VP + +H
Sbjct: 317 VIIDSGTTFTALEESAFVALARALASRVRLPLASGAHLGLSLCFAAASPEAVEVPRLVLH 376
Query: 310 FRGADVKLSRSNFFVKVSEDIVCSVFKGITNSVPIYGNIMQTNFLVGYDIEQQTVSFKPT 369
F GAD++L R ++ V+ V + + + G++ Q N + YD+E+ +SF+P
Sbjct: 377 FDGADMELRRESYVVEDRSAGVACLGMVSARGMSVLGSMQQQNTHILYDLERGILSFEPA 436
Query: 370 DC 371
C
Sbjct: 437 KC 438
>gi|224143371|ref|XP_002324933.1| predicted protein [Populus trichocarpa]
gi|222866367|gb|EEF03498.1| predicted protein [Populus trichocarpa]
Length = 463
Score = 146 bits (368), Expect = 2e-32, Method: Compositional matrix adjust.
Identities = 121/414 (29%), Positives = 179/414 (43%), Gaps = 79/414 (19%)
Query: 27 GGFSVELIHRDSPKSPFYNSSETPYQRLRDALTRSLNRLNHFNQ---NSSISSSKASQAD 83
G S++L+HR P +P + +S P + L R R++ Q + +++SS
Sbjct: 59 GSSSLKLVHRFGPCNP-HRTSTAPASSFNEILRRDKLRVDSIIQARRSMNLTSSVEHMKS 117
Query: 84 IIP-------NNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFD 136
+P ++Y++ + IGTP E + DTGS LIWTQC+PC CY + P+FD
Sbjct: 118 SVPFYGLSKITASDYIVNVGIGTPKKEMPLIFDTGSGLIWTQCKPC--KACYPK-VPVFD 174
Query: 137 PKMSSTYKSLPCSSSQCASLNQKSCSGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQA 196
P S+++K LPCSS C S+ Q CS C Y +Y D S S G LATET++ +
Sbjct: 175 PTKSASFKGLPCSSKLCQSIRQ-GCSSPKCTYLTAYVDNSSSTGTLATETISF---SHLK 230
Query: 197 VALPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTI--------------AGN 242
I GC G + +GI+GL ISL SQ G+
Sbjct: 231 YDFKNILIGCSDQVSGE-SLGESGIMGLNRSPISLASQTANIYDKLFSYCIPSTPGSTGH 289
Query: 243 QRLGVSTPDIV----------------------------------------IDSGTTLTF 262
G P+ V IDSG LT
Sbjct: 290 LTFGGKVPNDVRFSPVSKTAPSSDYDIKMTGISVGGRKLLIDASAFKIASTIDSGAVLTR 349
Query: 263 LPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSLSQV--PEVTIHFRGA---DVKL 317
LP S L SV M++ P+ D L+ CY F++ S V P +++ F G D+ +
Sbjct: 350 LPPKAYSALRSVFREMMKGYPLLDQDDFLDTCYDFSNYSTVAIPSISVFFEGGVEMDIDV 409
Query: 318 SRSNFFVKVSEDIVCSVFKGITNSVPIYGNIMQTNFLVGYDIEQQTVSFKPTDC 371
S + V S+ + C F + + V I+GN Q + V +D ++ + F P C
Sbjct: 410 SGIMWQVPGSK-VYCLAFAELDDEVSIFGNFQQKTYTVVFDGAKERIGFAPGGC 462
>gi|242077672|ref|XP_002448772.1| hypothetical protein SORBIDRAFT_06g032900 [Sorghum bicolor]
gi|241939955|gb|EES13100.1| hypothetical protein SORBIDRAFT_06g032900 [Sorghum bicolor]
Length = 471
Score = 146 bits (368), Expect = 2e-32, Method: Compositional matrix adjust.
Identities = 83/216 (38%), Positives = 117/216 (54%), Gaps = 22/216 (10%)
Query: 30 SVELIHRDSPKSPFYNSSETPYQRLRDALTRSLNRLNHFNQNSS--------ISSSKASQ 81
S L+ RD+ Y S P + D ++R R + S S
Sbjct: 59 SFALVRRDAVTGATYPS---PRHAVLDLVSRDNARAEYLASRLSPAYQPTDFFGSESKVV 115
Query: 82 ADIIPNNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSS 141
+ + + Y +R+ IG+PPTE+ V D+GSD+IW QC+PC +CY Q PLFDP S+
Sbjct: 116 SGLDEGSGEYFVRVGIGSPPTEQYLVVDSGSDVIWVQCKPC--LECYAQADPLFDPASSA 173
Query: 142 TYKSLPCSSSQCASLNQKSC--SGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVAL 199
T+ ++ C S+ C +L C SG C+Y VSYGDGS++ G LA ET+TLG T A+
Sbjct: 174 TFSAVSCGSAICRTLRTSGCGDSG-GCEYEVSYGDGSYTKGTLALETLTLGGT-----AV 227
Query: 200 PGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQM 235
G+ GCG N GLF G++GLG G +SL+ Q+
Sbjct: 228 EGVAIGCGHRNRGLF-VGAAGLLGLGWGPMSLVGQL 262
Score = 58.2 bits (139), Expect = 7e-06, Method: Compositional matrix adjust.
Identities = 36/123 (29%), Positives = 56/123 (45%), Gaps = 3/123 (2%)
Query: 252 IVIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSLS--QVPEVTIH 309
+V+D+GT +T LPQ + L + A P A L+ CY + + +VP V+ +
Sbjct: 349 VVMDTGTAVTRLPQEAYAALRDAFVGAVGALPRAPGVSLLDTCYDLSGYTSVRVPTVSFY 408
Query: 310 FRG-ADVKLSRSNFFVKVSEDIVCSVFKGITNSVPIYGNIMQTNFLVGYDIEQQTVSFKP 368
F G A + L N ++V I C F ++ + I GNI Q + D + F P
Sbjct: 409 FDGAATLTLPARNLLLEVDGGIYCLAFAPSSSGLSILGNIQQEGIQITVDSANGYIGFGP 468
Query: 369 TDC 371
C
Sbjct: 469 ATC 471
>gi|225217022|gb|ACN85307.1| aspartic proteinase nepenthesin-1 precursor [Oryza ridleyi]
Length = 525
Score = 145 bits (367), Expect = 2e-32, Method: Compositional matrix adjust.
Identities = 114/349 (32%), Positives = 149/349 (42%), Gaps = 69/349 (19%)
Query: 87 NNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSL 146
NY++ I +GTP V DTGSD W QCEPC CY Q LFDP SST ++
Sbjct: 182 GTGNYVVTIGLGTPAGRYTVVFDTGSDTTWVQCEPCV-VVCYEQQEKLFDPARSSTDANI 240
Query: 147 PCSSSQCASLNQKSCSGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITFGC 206
C++ C+ L K CSG +C Y V YGDGS+S G A +T+TL S A+ G FGC
Sbjct: 241 SCAAPACSDLYTKGCSGGHCLYGVQYGDGSYSIGFFAMDTLTLSSYD----AIKGFRFGC 296
Query: 207 GTNNGGLF----------------------------------NSKTTGIVGLGGGDISLI 232
G N GLF S TG + G G +
Sbjct: 297 GERNEGLFGEAAGLLGLGRGKTSLPVQAYDKYGGVFAHCFPARSSGTGYLDFGPGSSPAV 356
Query: 233 SQMRTT------------------IAGNQRLGV-----STPDIVIDSGTTLTFLPQGYNS 269
S TT G + L + +T ++DSGT +T LP S
Sbjct: 357 STKLTTPMLVDNGLTFYYVGLTGIRVGGKLLSIPPSVFTTAGTIVDSGTVITRLPPAAYS 416
Query: 270 NLLSVMSSMIEAQ--PVADPTGSLELCYSFNSLSQV--PEVTIHFRGA---DVKLSRSNF 322
+L S +S I A+ A L+ CY F +SQV P V++ F+G DV S +
Sbjct: 417 SLRSAFASAIAARGYKKAPALSLLDTCYDFTGMSQVAIPTVSLLFQGGASLDVDASGIIY 476
Query: 323 FVKVSEDIVCSVFKGITNSVPIYGNIMQTNFLVGYDIEQQTVSFKPTDC 371
VS+ + + V I GN F V YDI ++ V F P C
Sbjct: 477 AASVSQACLGFAANEEDDDVGIVGNTQLKTFGVVYDIGKKVVGFSPGAC 525
>gi|413921849|gb|AFW61781.1| hypothetical protein ZEAMMB73_702843 [Zea mays]
Length = 379
Score = 145 bits (367), Expect = 2e-32, Method: Compositional matrix adjust.
Identities = 87/213 (40%), Positives = 119/213 (55%), Gaps = 15/213 (7%)
Query: 28 GFSVELIHRDSPKSPFYNSSETPYQRLRDALTRSLNRLNHFNQNSS----ISSSKASQAD 83
GF ++L H D+ +S T Q L A+ RS R+ + + A++
Sbjct: 28 GFQLKLTHVDA------GTSYTKLQLLSRAIARSKARVAALQSAAVLPPVVDPITAARVL 81
Query: 84 IIPNNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTY 143
+ ++ YL+ ++IGTPP A+ DTGSDLIWTQC PC C Q +P FD K S+TY
Sbjct: 82 VTASSGEYLVDLAIGTPPLYYTAIMDTGSDLIWTQCAPC--LLCADQPTPYFDVKKSATY 139
Query: 144 KSLPCSSSQCASLNQKSCSGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGIT 203
++LPC SS+CASL+ SC C Y YGD + + G LA ET T G+ V I
Sbjct: 140 RALPCRSSRCASLSSPSCFKKMCVYQYYYGDTASTAGVLANETFTFGAANSTKVRATNIA 199
Query: 204 FGCGT-NNGGLFNSKTTGIVGLGGGDISLISQM 235
FGCG+ N G L NS +G+VG G G +SL+SQ+
Sbjct: 200 FGCGSLNAGDLANS--SGMVGFGRGPLSLVSQL 230
>gi|357143854|ref|XP_003573079.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Brachypodium
distachyon]
Length = 417
Score = 145 bits (366), Expect = 3e-32, Method: Compositional matrix adjust.
Identities = 78/181 (43%), Positives = 118/181 (65%), Gaps = 7/181 (3%)
Query: 59 TRSLNRLNHFNQNSSISSSKASQADIIPNNANYLIRISIGTPPTERLAVADTGSDLIWTQ 118
T + R H ++ ++S A+ + YL+ ++IGTPP +A+ADTGSDL WTQ
Sbjct: 34 TELMRRAAHRSRLQALSGYDANSPRLHSVQVEYLMELAIGTPPVPFVALADTGSDLTWTQ 93
Query: 119 CEPCPPSQCYMQDSPLFDPKMSSTYKSLPCSSSQC-ASLNQKSCSGVN--CQYSVSYGDG 175
C+PC C+ QD+P++DP SST+ +PCSS+ C + ++CS + C+Y SY DG
Sbjct: 94 CQPC--KLCFPQDTPVYDPSASSTFSPVPCSSATCLPTWRSRNCSNPSSPCRYIYSYSDG 151
Query: 176 SFSNGNLATETVTLGSTT-GQAVALPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQ 234
++S G L TET+T+GS+ GQ V++ + FGCGT+NGG + +TG VGLG G +SL++Q
Sbjct: 152 AYSVGILGTETLTIGSSVPGQTVSVGSVAFGCGTDNGG-DSLNSTGTVGLGRGTLSLLAQ 210
Query: 235 M 235
+
Sbjct: 211 L 211
>gi|326524580|dbj|BAK00673.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 461
Score = 145 bits (366), Expect = 3e-32, Method: Compositional matrix adjust.
Identities = 121/444 (27%), Positives = 187/444 (42%), Gaps = 105/444 (23%)
Query: 23 EAQTGGFSVELIHRDSPKSPFYNSSETPYQRLRDALTRSLNRLNHFNQNSS----ISSSK 78
+A G + L H D+ K + + +R A+ RS R + S +
Sbjct: 28 DAFAGDVRLHLTHVDAGKQ------MSRRELIRRAMQRSKARAAALSVARSGSGRVPGKS 81
Query: 79 ASQAD--------IIPN-NANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYM 129
A Q + + P+ + YLI ++IGTPP A+ DTGSDLIWTQC PC + C
Sbjct: 82 AQQGEQHQQPGVPVRPSGDLEYLIDLAIGTPPQPVSALLDTGSDLIWTQCAPC--ASCLA 139
Query: 130 QDSPLFDPKMSSTYKSLPCSSSQCASLNQKSCSGVN-CQYSVSYGDGSFSNGNLATETVT 188
Q PLF P SS+Y + CS C + SC + C Y +YGDG+ + G ATE T
Sbjct: 140 QPDPLFAPAASSSYVPMRCSGQLCNDILHHSCQRPDTCTYRYNYGDGTTTLGVYATERFT 199
Query: 189 LGSTTGQAVALPGITFGCGTNNGGLFNSKTTGIVGLGG---------------------- 226
S++G+ +++P + FGCGT N G N+ + GIVG G
Sbjct: 200 FASSSGEKLSVP-LGFGCGTMNVGSLNNGS-GIVGFGRDPLSLVSQLSIRRFSYCLTPYT 257
Query: 227 ------------------GDISLISQMRTT-------------------IAGNQRLGVS- 248
GD + Q++TT G +RL +
Sbjct: 258 STRKSTLMFGSLSDGVFEGDDAATGQVQTTRLLQSRQNPTFYYVPFTGVTVGTRRLRIPL 317
Query: 249 -----TPD----IVIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYS--- 296
PD +++DSGT LT P + +L + + + + +C++
Sbjct: 318 SAFALRPDGSGGVIVDSGTALTLFPAAVLTEVLRAFRAQLRLPFTSSSSPDDGVCFATPM 377
Query: 297 --------FNSLSQVPEVTIHFRGADVKLSRSNFFVK-VSEDIVCSVFKGITNSVPIYGN 347
++ VP + HF+GAD++L R N+ + +C + +S GN
Sbjct: 378 AAGGRRASAATVVSVPRMAFHFQGADLELPRRNYVLDDPRRGSLCILLADSGDSGATIGN 437
Query: 348 IMQTNFLVGYDIEQQTVSFKPTDC 371
+Q + V YD+E +T+SF P C
Sbjct: 438 FVQQDMRVLYDLEAETLSFAPAQC 461
>gi|356532386|ref|XP_003534754.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Glycine max]
Length = 463
Score = 145 bits (366), Expect = 3e-32, Method: Compositional matrix adjust.
Identities = 122/415 (29%), Positives = 181/415 (43%), Gaps = 86/415 (20%)
Query: 32 ELIHRDSPKSPFYNSSETPYQRLRDALTRSLNRLNHFNQNSSISSSKASQADIIPNNANY 91
++I +D + F +S T + +R++ T + S+ S+ ++ + + NY
Sbjct: 59 DMITKDEERVRFLHSRLTNKESVRNSATT-----DKLRGGPSLVSTTPLKSGLSIGSGNY 113
Query: 92 LIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPC--- 148
++I +GTP + DTGS L W QC+PC C++Q P+F P S TYK+LPC
Sbjct: 114 YVKIGLGTPAKYFSMIVDTGSSLSWLQCQPCV-IYCHVQVDPIFTPSTSKTYKALPCSSS 172
Query: 149 --SSSQCASLNQKSCSGVN--CQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITF 204
SS + ++LN CS C Y SYGD SFS G L+ + +TL T G +
Sbjct: 173 QCSSLKSSTLNAPGCSNATGACVYKASYGDTSFSIGYLSQDVLTL---TPSEAPSSGFVY 229
Query: 205 GCGTNNGGLFNSKTTGIVGLGGGDISLISQMR---------------------------- 236
GCG +N GLF +++GI+GL IS++ Q+
Sbjct: 230 GCGQDNQGLFG-RSSGIIGLANDKISMLGQLSKKYGNAFSYCLPSSFSAPNSSSLSGFLS 288
Query: 237 -------------TTIAGNQR------------------LGVST-----PDIVIDSGTTL 260
T + NQ+ LGVS P I IDSGT +
Sbjct: 289 IGASSLTSSPYKFTPLVKNQKIPSLYFLDLTTITVAGKPLGVSASSYNVPTI-IDSGTVI 347
Query: 261 TFLPQG-YNSNLLSVMSSMIEAQPVADPTGSLELCY--SFNSLSQVPEVTIHFR-GADVK 316
T LP YN+ S + M + A L+ C+ S +S VPE+ I FR GA ++
Sbjct: 348 TRLPVAVYNALKKSFVLIMSKKYAQAPGFSILDTCFKGSVKEMSTVPEIQIIFRGGAGLE 407
Query: 317 LSRSNFFVKVSEDIVCSVFKGITNSVPIYGNIMQTNFLVGYDIEQQTVSFKPTDC 371
L N V++ + C +N + I GN Q F V YD+ + F P C
Sbjct: 408 LKAHNSLVEIEKGTTCLAIAASSNPISIIGNYQQQTFKVAYDVANFKIGFAPGGC 462
>gi|449464952|ref|XP_004150193.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 2-like [Cucumis
sativus]
gi|449526850|ref|XP_004170426.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 2-like [Cucumis
sativus]
Length = 476
Score = 145 bits (366), Expect = 3e-32, Method: Compositional matrix adjust.
Identities = 115/416 (27%), Positives = 175/416 (42%), Gaps = 83/416 (19%)
Query: 29 FSVELIHRDSPKSPFYNSSETPYQRLRDALTRSLNRLNHFNQNSSISSSKASQ---ADII 85
+ ++L HRD K P + P +R ++ ++R R++ + S S + +D++
Sbjct: 71 WKLKLFHRD--KLPLNFDPDHP-RRFKERISRDSKRVSSLLRLLSSGSDEQVTDFGSDVV 127
Query: 86 PN----NANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSS 141
+ Y +RI +G+PP + V D+GSD++W QC+PC S+CY Q P+FDP S+
Sbjct: 128 SGTEQGSGEYFVRIGVGSPPRSQYVVIDSGSDIVWVQCQPC--SECYQQSDPVFDPAGSA 185
Query: 142 TYKSLPCSSSQCASLNQKSCSGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPG 201
TY + C SS C L+ C+ C+Y VSYGDGS++ G LA ET+T G V +
Sbjct: 186 TYAGISCDSSVCDRLDNAGCNDGRCRYEVSYGDGSYTRGTLALETLTFGR-----VLIRN 240
Query: 202 ITFGC----------------------------GTNNGGLFN-------SKTTGIVGLGG 226
I GC G GG F+ +++TG + G
Sbjct: 241 IAIGCGHMNRGMFIGAAGLLGLGGGAMSFVGQLGGQTGGAFSYCLVSRGTESTGTLEFGR 300
Query: 227 GD-------ISLISQMRT--------TIAGNQRLGVSTPD------------IVIDSGTT 259
G + LI R + G + V P+ +V+D+GT
Sbjct: 301 GAMPVGAAWVPLIRNPRAPSFYYVGLSGLGVGGIRVPIPEQIFELTDLGYGGVVMDTGTA 360
Query: 260 LTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSLS--QVPEVTIHFRGADV-K 316
+T LP P +D + CY+ N +VP V+ +F G +
Sbjct: 361 VTRLPAPAYEAFRDTFIGQTANLPRSDRVSIFDTCYNLNGFVSVRVPTVSFYFSGGPILT 420
Query: 317 LSRSNFFVKVS-EDIVCSVFKGITNSVPIYGNIMQTNFLVGYDIEQQTVSFKPTDC 371
L NF + V E C F + + I GNI Q + D V F PT C
Sbjct: 421 LPARNFLIPVDGEGTFCFAFAASASGLSIIGNIQQEGIQISIDGSNGFVGFGPTIC 476
>gi|414885970|tpg|DAA61984.1| TPA: hypothetical protein ZEAMMB73_915310 [Zea mays]
Length = 523
Score = 145 bits (365), Expect = 4e-32, Method: Compositional matrix adjust.
Identities = 82/213 (38%), Positives = 121/213 (56%), Gaps = 21/213 (9%)
Query: 33 LIHRDSPKSPFYNSSETPYQRLRDALTRSLNRLNHFNQNSS----ISSSKASQADIIPNN 88
++HR P SP P + L R +R++ ++ ++ S AS+ +P +
Sbjct: 121 VVHRHGPCSPLLARGGEPSHA--EILDRDQDRVDSIHRMTAGPWTAGQSSASKGVSLPAH 178
Query: 89 -------ANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSS 141
ANY++ + +GTP + L V DTGSDL W QC+PC + CY Q PLFDP S+
Sbjct: 179 RGLRLGTANYIVSVGLGTPRRDLLVVFDTGSDLSWVQCKPC--NNCYKQHDPLFDPSQST 236
Query: 142 TYKSLPCSSSQCASLNQKSCSGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPG 201
TY ++PC + +C L+ +CS C+Y V YGD S ++GNLA +T+TLG ++ Q L G
Sbjct: 237 TYSAVPCGAQEC--LDSGTCSSGKCRYEVVYGDMSQTDGNLARDTLTLGPSSDQ---LQG 291
Query: 202 ITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQ 234
FGCG ++ GLF + G+ GLG +SL SQ
Sbjct: 292 FVFGCGDDDTGLFG-RADGLFGLGRDRVSLASQ 323
Score = 58.5 bits (140), Expect = 5e-06, Method: Compositional matrix adjust.
Identities = 41/128 (32%), Positives = 54/128 (42%), Gaps = 5/128 (3%)
Query: 250 PDIVIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSLS--QVPEVT 307
P VIDSGT +T LP S L S + + A L+ CY F + Q+P V
Sbjct: 396 PGTVIDSGTVITRLPSRAYSALRSSFAGFMRRYKRAPALSILDTCYDFTGRTKVQIPSVA 455
Query: 308 IHFR-GADVKLSRSNFFVKVSEDIVCSVFK--GITNSVPIYGNIMQTNFLVGYDIEQQTV 364
+ F GA + L + C F G SV I GN+ Q F V YD+ Q +
Sbjct: 456 LLFDGGATLNLGFGGVLYVANRSQACLAFASNGDDTSVGILGNMQQKTFAVVYDLANQKI 515
Query: 365 SFKPTDCT 372
F C+
Sbjct: 516 GFGAKGCS 523
>gi|242050432|ref|XP_002462960.1| hypothetical protein SORBIDRAFT_02g035320 [Sorghum bicolor]
gi|241926337|gb|EER99481.1| hypothetical protein SORBIDRAFT_02g035320 [Sorghum bicolor]
Length = 445
Score = 144 bits (364), Expect = 6e-32, Method: Compositional matrix adjust.
Identities = 89/214 (41%), Positives = 125/214 (58%), Gaps = 19/214 (8%)
Query: 31 VEL--IHRDSPKSPFYNSSETPYQRLRDALTRSLNRLN--HFNQNSSISSSKASQADIIP 86
VEL IH D S T Q +RDAL R ++R N +SS ++ ++ I P
Sbjct: 30 VELTRIHADP--------SVTASQFVRDALRRDMHRHNARQLAASSSNGTTVSAPTQISP 81
Query: 87 NNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSL 146
YL+ ++IGTPP A+ADTGSDLIWTQC PC SQC+ Q +PL++P S+T+ L
Sbjct: 82 TAGEYLMTLAIGTPPVSYQAIADTGSDLIWTQCAPC-SSQCFQQPTPLYNPSSSTTFAVL 140
Query: 147 PCSS--SQCASLNQKSC--SGVNCQYSVSYGDGSFSNGNLATETVTLGSTT-GQAVALPG 201
PC+S S CA+ + G C Y+++YG G +++ +ET T GS+T +PG
Sbjct: 141 PCNSSLSMCAAALAGTTPPPGCTCMYNMTYGSG-WTSVYQGSETFTFGSSTPANQTGVPG 199
Query: 202 ITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQM 235
I FGC +GG S +G+VGLG G +SL+SQ+
Sbjct: 200 IAFGCSNASGGFNTSSASGLVGLGRGSLSLVSQL 233
Score = 59.7 bits (143), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 40/129 (31%), Positives = 68/129 (52%), Gaps = 12/129 (9%)
Query: 253 VIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVAD----PTGSLELCYSFNSLSQ----VP 304
+IDSGTT+T L + + + S++ P D TG L+LC+ S + +P
Sbjct: 318 IIDSGTTITLLGNTAYQQVRAAVVSLVTL-PTTDGGSAATG-LDLCFELPSSTSAPPTMP 375
Query: 305 EVTIHFRGADVKLSRSNFFVKVSEDIVCSVFKGITN-SVPIYGNIMQTNFLVGYDIEQQT 363
+T+HF GAD+ L ++ + + ++ C + T+ V I GN Q N + YD+ Q+T
Sbjct: 376 SMTLHFDGADMVLPADSYMM-LDSNLWCLAMQNQTDGGVSILGNYQQQNMHILYDVGQET 434
Query: 364 VSFKPTDCT 372
++F P C+
Sbjct: 435 LTFAPAKCS 443
>gi|296082170|emb|CBI21175.3| unnamed protein product [Vitis vinifera]
Length = 386
Score = 144 bits (363), Expect = 7e-32, Method: Compositional matrix adjust.
Identities = 118/379 (31%), Positives = 177/379 (46%), Gaps = 48/379 (12%)
Query: 30 SVELIHRDSPKS---PFYNSSETPYQRLRDALTRSLNRLNHFNQNSSISSS-KASQADII 85
S+E++H+ P S P +S + Q L +R + + +N + S+ KAS+A +
Sbjct: 18 SLEVVHKHGPCSKLRPHKANSPSHTQILAQDESRVASIQSRLAKNLAGGSNLKASKATLP 77
Query: 86 PNNA------NYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKM 139
+A NY++ + +G+P + + DTGSDL WTQCEPC CY Q +FDP
Sbjct: 78 SKSASTLGSGNYVVTVGLGSPKRDLTFIFDTGSDLTWTQCEPC-VGYCYQQREHIFDPST 136
Query: 140 SSTYKSLPCSSSQCASL-----NQKSCSGVNCQYSVSYGDGSFSNGNLATETVTLGSTTG 194
S +Y ++ C S C L N CS C Y + YGDGS+S G A E ++L ST
Sbjct: 137 SLSYSNVSCDSPSCEKLESATGNSPGCSSSTCLYGIRYGDGSYSIGFFAREKLSLTSTD- 195
Query: 195 QAVALPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGNQRLGVSTPDIVI 254
FGCG NN GLF T G++GL +SL+SQ T + P
Sbjct: 196 ---VFNNFQFGCGQNNRGLFGG-TAGLLGLARNPLSLVSQ--TAQKYGKVFSYCLPSSSS 249
Query: 255 DSGTTLTF---------------LPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNS 299
+G L+F LP S++ V ++ P L+ CY +
Sbjct: 250 STG-YLSFGSGDGDSKAVKFTPRLPPTVYSSVQKVFRELMSDYPRVKGVSILDTCYDLSK 308
Query: 300 LS--QVPEVTIHFR-GADVKLSRSN--FFVKVSEDIVCSVFKGIT--NSVPIYGNIMQTN 352
+VP++ ++F GA++ L+ + +KVS+ VC F G + + V I GN+ Q
Sbjct: 309 YKTVKVPKIILYFSGGAEMDLAPEGIIYVLKVSQ--VCLAFAGNSDDDEVAIIGNVQQKT 366
Query: 353 FLVGYDIEQQTVSFKPTDC 371
V YD + V F P+ C
Sbjct: 367 IHVVYDDAEGRVGFAPSGC 385
>gi|449458736|ref|XP_004147103.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 2-like [Cucumis
sativus]
gi|449518669|ref|XP_004166359.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 2-like [Cucumis
sativus]
Length = 482
Score = 144 bits (362), Expect = 9e-32, Method: Compositional matrix adjust.
Identities = 119/429 (27%), Positives = 178/429 (41%), Gaps = 84/429 (19%)
Query: 19 VSPIEAQTGGFSVELIHRDSPKSPFYNSSETPYQRLRDALTRS--LNRLNHFNQNSSISS 76
VS ++ F + L+HRD + + RDA+ + + RL+H +++
Sbjct: 62 VSGYKSDNNTFKLNLLHRDKLSHVHGHRRGFNDRMKRDAIRVATLVRRLSH-GAPAAVKD 120
Query: 77 SKASQA----DII----PNNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCY 128
S+ A D+I + Y +RI +G+PP + V D+GSD++W QC+PC S+CY
Sbjct: 121 SRYKVANFATDVISGMEAGSGEYFVRIGVGSPPRNQYMVIDSGSDIVWVQCKPC--SRCY 178
Query: 129 MQDSPLFDPKMSSTYKSLPCSSSQCASLNQKSCSGVNCQYSVSYGDGSFSNGNLATETVT 188
Q P+FDP SS++ + C S C L C+ C+Y VSYGDGS++ G LA ET+T
Sbjct: 179 QQSDPVFDPADSSSFAGVSCGSDVCDRLENTGCNAGRCRYEVSYGDGSYTKGTLALETLT 238
Query: 189 LGSTTGQAVALPGITFGCGTNNGGLF---------------------------------- 214
+G + VA+ GCG N G+F
Sbjct: 239 VGQVMIRDVAI-----GCGHTNQGMFIGAAGLLGLGGGSMSFIGQLGGQTGGAFSYCLVS 293
Query: 215 -NSKTTGIVGLGGGD-------ISLISQMRT------TIAGNQRLG--VSTPD------- 251
+ +TG + G G ISLI R +AG G VS P+
Sbjct: 294 RGTGSTGALEFGRGALPVGATWISLIRNPRAPSFYYIGLAGIGVGGVRVSVPEETFQLTE 353
Query: 252 -----IVIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSLS--QVP 304
+V+D+GT +T P ++ P A + CY N +VP
Sbjct: 354 YGTNGVVMDTGTAVTRFPTAAYVAFRDSFTAQTSNLPRAPGVSIFDTCYDLNGFESVRVP 413
Query: 305 EVTIHFR-GADVKLSRSNFFVKV-SEDIVCSVFKGITNSVPIYGNIMQTNFLVGYDIEQQ 362
V+ +F G + L NF + V C F + + I GNI Q + +D
Sbjct: 414 TVSFYFSDGPVLTLPARNFLIPVDGGGTFCLAFAPSPSGLSIIGNIQQEGIQISFDGANG 473
Query: 363 TVSFKPTDC 371
V F P C
Sbjct: 474 FVGFGPNIC 482
>gi|357124468|ref|XP_003563922.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Brachypodium
distachyon]
Length = 450
Score = 144 bits (362), Expect = 9e-32, Method: Compositional matrix adjust.
Identities = 87/226 (38%), Positives = 124/226 (54%), Gaps = 24/226 (10%)
Query: 28 GFSVELIHRDSPKSPFYNSSETPYQ--------RLRDALTRSLNRLNHFNQNSSISSSKA 79
G L H SP SP SS+ P+ R+ +R + + SS+ +
Sbjct: 41 GLHQTLHHPQSPCSPAPLSSDLPFSAFITHDAARIAGLASRLATKDKDWVAASSVPLASG 100
Query: 80 SQADIIPNNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKM 139
+ + NY+ R+ +GTP T + V D+GS L W QC PC S C+ Q PL+DP+
Sbjct: 101 ASVGV----GNYITRLGLGTPTTTYVMVVDSGSSLTWLQCAPCAVS-CHPQAGPLYDPRA 155
Query: 140 SSTYKSLPCSSSQC-----ASLNQKSCSGVN-CQYSVSYGDGSFSNGNLATETVTLGSTT 193
SSTY ++PCS+ QC A+LN SCSG CQY SYGDGSFS G L+ +TV+L S+
Sbjct: 156 SSTYAAVPCSAPQCAELQAATLNPSSCSGSGVCQYQASYGDGSFSFGYLSKDTVSLSSSG 215
Query: 194 GQAVALPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTI 239
+ PG +GCG +N GLF + G++GL +SL+SQ+ ++
Sbjct: 216 ----SFPGFYYGCGQDNVGLFG-RAAGLIGLARNKLSLLSQLAPSV 256
Score = 45.4 bits (106), Expect = 0.049, Method: Compositional matrix adjust.
Identities = 37/122 (30%), Positives = 56/122 (45%), Gaps = 4/122 (3%)
Query: 253 VIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSLS-QVPEVTIHFR 311
+IDSGT +T LP + L + + + A + L+ C+ VP V + F
Sbjct: 331 IIDSGTVITRLPTPVYTALSKAVGAALAAPSAPAYS-ILQTCFKGQVAKLPVPAVNMAFA 389
Query: 312 G-ADVKLSRSNFFVKVSEDIVCSVFKGITNSVPIYGNIMQTNFLVGYDIEQQTVSFKPTD 370
G A ++L+ N V V+E C F T+S I GN Q F V YD++ + F
Sbjct: 390 GGATLRLTPGNVLVDVNETTTCLAFA-PTDSTAIIGNTQQQTFSVVYDVKGSRIGFAAGG 448
Query: 371 CT 372
C+
Sbjct: 449 CS 450
>gi|413944596|gb|AFW77245.1| hypothetical protein ZEAMMB73_545774 [Zea mays]
gi|414876929|tpg|DAA54060.1| TPA: hypothetical protein ZEAMMB73_875469 [Zea mays]
Length = 459
Score = 144 bits (362), Expect = 9e-32, Method: Compositional matrix adjust.
Identities = 116/345 (33%), Positives = 161/345 (46%), Gaps = 72/345 (20%)
Query: 91 YLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCSS 150
Y + S+GTPP + A+ADTGSDLIW +C + C Q SP + P SST+ LPCS
Sbjct: 91 YDMEFSMGTPPQKLTALADTGSDLIWAKCGGACTTSCEPQGSPSYLPNASSTFAKLPCSD 150
Query: 151 SQCASLNQKS-----CSGVNCQYSVSYG----DGSFSNGNLATETVTLGSTTGQAVALPG 201
C+ L S +G C Y SYG D ++ G LA ET TLG A A+P
Sbjct: 151 RLCSLLRSDSVAWCAAAGAECDYRYSYGLGDDDHHYTQGFLARETFTLG-----ADAVPS 205
Query: 202 ITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMR------------------------- 236
+ FGC T +G+VGLG G +SL+SQ+
Sbjct: 206 VRFGC-TTASEGGYGSGSGLVGLGRGPLSLVSQLNASTFMYCLTSDASKASPLLFGSLAS 264
Query: 237 ---------------TTIAGNQRL---------GVSTPD-IVIDSGTTLTFLPQGYNSNL 271
T A N R GV P+ +V DSGTTLT+L + S
Sbjct: 265 LTGAQVQSTGLLASTTFYAVNLRSISIGSATTPGVGEPEGVVFDSGTTLTYLAEPAYSEA 324
Query: 272 LSVMSSMIEAQPVADPTGSLELCYSFN-----SLSQVPEVTIHFRGADVKLSRSNFFVKV 326
+ S V D G E C+ S + VP + +HF GAD+ L +N+ V+V
Sbjct: 325 KAAFLSQTSLDQVEDTDG-FEACFQKPANGRLSNAAVPTMVLHFDGADMALPVANYVVEV 383
Query: 327 SEDIVCSVFKGITNSVPIYGNIMQTNFLVGYDIEQQTVSFKPTDC 371
+ +VC + + + S+ I GNIMQ N+LV +D+ + +SF+P +C
Sbjct: 384 EDGVVCWIVQR-SPSLSIIGNIMQVNYLVLHDVHRSVLSFQPANC 427
>gi|225216988|gb|ACN85278.1| aspartic proteinase nepenthesin-1 precursor [Oryza alta]
Length = 518
Score = 144 bits (362), Expect = 9e-32, Method: Compositional matrix adjust.
Identities = 114/352 (32%), Positives = 154/352 (43%), Gaps = 75/352 (21%)
Query: 87 NNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSL 146
NY++ + +GTP + V DTGSD W QC+PC CY Q LFDP SSTY ++
Sbjct: 175 GTGNYVVTVGLGTPASRYTVVFDTGSDTTWVQCQPCV-VVCYEQQEKLFDPARSSTYANV 233
Query: 147 PCSSSQCASLNQKSCSGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITFGC 206
C++ C L+ + CSG +C Y V YGDGS+S G A +T+TL S A+ G FGC
Sbjct: 234 SCAAPACFDLDTRGCSGGHCLYGVQYGDGSYSIGFFAMDTLTLSSYD----AVKGFRFGC 289
Query: 207 GTNNGGLFNSKTTGIVGLGGGDISLISQ--------------MRTTIAGNQRLGVSTPD- 251
G N GLF + G++GLG G SL Q R++ G G +P
Sbjct: 290 GERNEGLFG-EAAGLLGLGRGKTSLPVQTYDKYGGVFAHCLPARSSGTGYLDFGPGSPAA 348
Query: 252 --------IVIDSGTT-----------------------------------LTFLPQGYN 268
++ D+G T +T LP
Sbjct: 349 AGARLTTPMLTDNGPTFYYVGMTGIRVGGQLLSIPQSVFATAGTIVDSGTVITRLPPPAY 408
Query: 269 SNLLSVMSSMIEAQ--PVADPTGSLELCYSFNSLSQV--PEVTIHFRGA---DVKLSRSN 321
S+L S S + A+ A L+ CY F +SQV P V++ F+G DV S
Sbjct: 409 SSLRSAFVSAMAARGYKKAPAVSLLDTCYDFTGMSQVAIPTVSLLFQGGAILDVDASGIM 468
Query: 322 FFVKVSEDIVCSVFKGITNS--VPIYGNIMQTNFLVGYDIEQQTVSFKPTDC 371
+ VS+ VC F + V I GN F V YDI ++ V F P C
Sbjct: 469 YAASVSQ--VCLGFAANEDGGDVGIVGNTQLKTFGVAYDIGKKVVGFSPGAC 518
>gi|302753526|ref|XP_002960187.1| hypothetical protein SELMODRAFT_75184 [Selaginella moellendorffii]
gi|300171126|gb|EFJ37726.1| hypothetical protein SELMODRAFT_75184 [Selaginella moellendorffii]
Length = 398
Score = 144 bits (362), Expect = 1e-31, Method: Compositional matrix adjust.
Identities = 74/155 (47%), Positives = 101/155 (65%), Gaps = 4/155 (2%)
Query: 81 QADIIPNNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMS 140
++ + +Y+ IS+GTP +ADTGSDLIW QC+PC C+ Q P+FDP+ S
Sbjct: 30 ESPVASGGGDYVTTISLGTPAKVFSVIADTGSDLIWIQCKPC--QACFNQKDPIFDPEGS 87
Query: 141 STYKSLPCSSSQCASLNQKSCSGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALP 200
S+Y ++ C + C SL +KSCS NC YS YGDGS + G L++ETVTL ST G+ +A
Sbjct: 88 SSYTTMSCGDTLCDSLPRKSCS-PNCDYSYGYGDGSGTRGTLSSETVTLTSTQGEKLAAK 146
Query: 201 GITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQM 235
I FGCG N G FN +G+VGLG G++S +SQ+
Sbjct: 147 NIAFGCGHLNRGSFN-DASGLVGLGRGNLSFVSQL 180
Score = 78.6 bits (192), Expect = 5e-12, Method: Compositional matrix adjust.
Identities = 48/155 (30%), Positives = 74/155 (47%), Gaps = 14/155 (9%)
Query: 228 DISLISQMRTTIAGNQRLGVSTPD----IVIDSGTTLTFLPQGYNSNLLSVMSSMIEAQP 283
DIS+ + AG+ + PD ++ DSGTTLT LP +L + S +
Sbjct: 243 DISIAGRALRIPAGSFDI---KPDGSGGMIFDSGTTLTLLPDAPYQIVLRALRSKVSFPE 299
Query: 284 VADPTGSLELCYSFNS-----LSQVPEVTIHFRGADVKLSRSNFFVKVSE--DIVCSVFK 336
+ + L+LCY + ++P + HF GAD +L N+F+ ++ IVC
Sbjct: 300 IDGSSAGLDLCYDVSGSKASYKKKIPAMVFHFEGADHQLPVENYFIAANDAGTIVCLAMV 359
Query: 337 GITNSVPIYGNIMQTNFLVGYDIEQQTVSFKPTDC 371
+ IYGN+MQ NF V YDI + + P+ C
Sbjct: 360 SSNMDIGIYGNMMQQNFRVMYDIGSSKIGWAPSQC 394
>gi|356524289|ref|XP_003530762.1| PREDICTED: LOW QUALITY PROTEIN: aspartic proteinase
nepenthesin-2-like [Glycine max]
Length = 392
Score = 143 bits (361), Expect = 1e-31, Method: Compositional matrix adjust.
Identities = 114/391 (29%), Positives = 169/391 (43%), Gaps = 79/391 (20%)
Query: 52 QRLRDALTRSLNRLNHFNQNSSISSSKASQADIIPNNANYLIRISIGTPPTERLAVADTG 111
+ ++ L+++L R N S ++ +++ + +ANY++ + +GTP + V DTG
Sbjct: 9 KYIQSRLSKNLGRENTVKDLDS--TTLPAESGSLIGSANYVVVVGLGTPKRDLSLVFDTG 66
Query: 112 SDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCSSSQCASLN----QKSCSG---V 164
SDL WTQCEPC S CY Q +FDP SS+Y ++ C+SS C L + CS
Sbjct: 67 SDLTWTQCEPCAGS-CYKQQDAIFDPSKSSSYTNITCTSSLCTQLTSDGIKSECSSSTDA 125
Query: 165 NCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITFGCGTNNGGLFNSKTTGIVGL 224
+C Y YGD S S G L+ E +T+ +T + FGCG +N GLFN + G++GL
Sbjct: 126 SCIYDAKYGDNSTSVGFLSQERLTITATD----IVDDFLFGCGQDNEGLFNG-SAGLMGL 180
Query: 225 GGGDIS--------------------------------------LISQMRTTIAGNQRL- 245
G IS LI +TI+G+
Sbjct: 181 GRHPISIVQQTSSNYNKIFSYCLPATSSSLGHLTFGASAATNASLIYTPLSTISGDNSFY 240
Query: 246 ----------GVSTPDI----------VIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVA 285
G P + +IDSGT +T L + L S +E PVA
Sbjct: 241 GLDIVSISVGGTKLPAVSSSTFSAGGSIIDSGTVITRLAPTVYAALRSAFRRXMEKYPVA 300
Query: 286 DPTGSLELCYSFNSLSQ--VPEVTIHFRGA-DVKLSRSNFFVKVSEDIVCSVF--KGITN 340
+ G L+ CY + + VP + F G V+L SE VC F G N
Sbjct: 301 NEAGLLDTCYDLSGYKEISVPRIDFEFSGGVTVELXHRGILXVESEQQVCLAFAANGSDN 360
Query: 341 SVPIYGNIMQTNFLVGYDIEQQTVSFKPTDC 371
+ ++GN+ Q V YD++ + F C
Sbjct: 361 DITVFGNVQQKTLEVVYDVKGGRIGFGAAGC 391
>gi|115458644|ref|NP_001052922.1| Os04g0448300 [Oryza sativa Japonica Group]
gi|113564493|dbj|BAF14836.1| Os04g0448300 [Oryza sativa Japonica Group]
gi|215766465|dbj|BAG98773.1| unnamed protein product [Oryza sativa Japonica Group]
gi|215767943|dbj|BAH00172.1| unnamed protein product [Oryza sativa Japonica Group]
Length = 454
Score = 143 bits (361), Expect = 1e-31, Method: Compositional matrix adjust.
Identities = 90/217 (41%), Positives = 124/217 (57%), Gaps = 22/217 (10%)
Query: 28 GFSVELIHRDSPKSPFYNSSETPYQRLRDALTRS---LNRLNHFNQNSSISSSKAS---- 80
G V L H D+ + + + +Q LR A RS ++RL ++SSKA+
Sbjct: 40 GLRVHLTHVDA------HGNYSRHQLLRRAARRSHHRMSRLVARATGVPMTSSKAAGGGD 93
Query: 81 -QADIIPNNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKM 139
Q + N +L+ +SIGTP A+ DTGSDL+WTQC+PC C+ Q +P+FDP
Sbjct: 94 LQVPVHAGNGEFLMDVSIGTPALAYSAIVDTGSDLVWTQCKPC--VDCFKQSTPVFDPSS 151
Query: 140 SSTYKSLPCSSSQCASLNQKSC-SGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVA 198
SSTY ++PCSS+ C+ L C S C Y+ +YGD S + G LATET TL +
Sbjct: 152 SSTYATVPCSSASCSDLPTSKCTSASKCGYTYTYGDSSSTQGVLATETFTLAKSK----- 206
Query: 199 LPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQM 235
LPG+ FGCG N G S+ G+VGLG G +SL+SQ+
Sbjct: 207 LPGVVFGCGDTNEGDGFSQGAGLVGLGRGPLSLVSQL 243
Score = 64.3 bits (155), Expect = 9e-08, Method: Compositional matrix adjust.
Identities = 45/133 (33%), Positives = 67/133 (50%), Gaps = 11/133 (8%)
Query: 249 TPDIVIDSGTTLTFLP-QGYNSNLLSVMSSMIEAQPVADPTG-SLELCYSFNSLS----Q 302
T +++DSGT++T+L QGY + + + M A P AD +G L+LC+ + +
Sbjct: 324 TGGVIVDSGTSITYLEVQGYRALKKAFAAQM--ALPAADGSGVGLDLCFRAPAKGVDQVE 381
Query: 303 VPEVTIHFRG-ADVKLSRSNFFV-KVSEDIVCSVFKGITNSVPIYGNIMQTNFLVGYDIE 360
VP + HF G AD+ L N+ V +C G + + I GN Q NF YD+
Sbjct: 382 VPRLVFHFDGGADLDLPAENYMVLDGGSGALCLTVMG-SRGLSIIGNFQQQNFQFVYDVG 440
Query: 361 QQTVSFKPTDCTK 373
T+SF P C K
Sbjct: 441 HDTLSFAPVQCNK 453
>gi|225430555|ref|XP_002285593.1| PREDICTED: aspartic proteinase nepenthesin-2 [Vitis vinifera]
Length = 481
Score = 143 bits (361), Expect = 1e-31, Method: Compositional matrix adjust.
Identities = 118/422 (27%), Positives = 171/422 (40%), Gaps = 88/422 (20%)
Query: 30 SVELIHRDSPKSPFYNSSETPYQRLRDALTRSLNRLNHFN--------QNSSISSSKA-- 79
S+E+IH+ P S R + L + +R+N + SK
Sbjct: 67 SLEVIHKHGPCSKLSQDKGRSPSRTQ-MLDQDESRVNSIRSRLAKNPADGGKLKGSKVTL 125
Query: 80 -SQADIIPNNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPK 138
S++ NY++ + +GTP + + DTGSDL WTQCEPC CY Q P+F+P
Sbjct: 126 PSKSGSTIGTGNYVVTVGLGTPKRDLTFIFDTGSDLTWTQCEPC-ARYCYHQQEPIFNPS 184
Query: 139 MSSTYKSLPCSSSQCASL-----NQKSCSGVNCQYSVSYGDGSFSNGNLATETVTLGSTT 193
S++Y ++ CSS C L N SCS C Y + YGD S+S G A + + L ST
Sbjct: 185 KSTSYTNISCSSPTCDELKSGTGNSPSCSASTCVYGIQYGDQSYSVGFFAQDKLALTSTD 244
Query: 194 GQAVALPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTI-------------- 239
FGCG NN GLF G++GLG +SL+SQ
Sbjct: 245 ----VFNNFLFGCGQNNRGLFVG-VAGLIGLGRNALSLVSQTAQKYGKLFSYCLPSTSSS 299
Query: 240 ---------AGNQRLGVSTPDIVIDSGTTLTFL--------------------------- 263
G + TP +V G + FL
Sbjct: 300 TGYLTFGSGGGTSKAVKFTPSLVNSQGPSFYFLNLIAISVGGRKLSTSASVFSTAGTIID 359
Query: 264 ---------PQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSLS--QVPEVTIHFR- 311
P Y S+L + + P A P L+ CY F+ VP++ ++F
Sbjct: 360 SGTVISRLPPTAY-SDLRASFQQQMSKYPKAAPASILDTCYDFSQYDTVDVPKINLYFSD 418
Query: 312 GADVKLSRSNFFVKVSEDIVCSVFKGITNS--VPIYGNIMQTNFLVGYDIEQQTVSFKPT 369
GA++ L S F ++ VC F G +++ + I GN+ Q F V YD+ + F P
Sbjct: 419 GAEMDLDPSGIFYILNISQVCLAFAGNSDATDIAILGNVQQKTFDVVYDVAGGRIGFAPG 478
Query: 370 DC 371
C
Sbjct: 479 GC 480
>gi|38344829|emb|CAD40873.2| OSJNBa0064H22.10 [Oryza sativa Japonica Group]
gi|116310063|emb|CAH67084.1| H0818E04.1 [Oryza sativa Indica Group]
gi|116310186|emb|CAH67198.1| OSIGBa0152K17.10 [Oryza sativa Indica Group]
Length = 444
Score = 143 bits (360), Expect = 1e-31, Method: Compositional matrix adjust.
Identities = 90/217 (41%), Positives = 124/217 (57%), Gaps = 22/217 (10%)
Query: 28 GFSVELIHRDSPKSPFYNSSETPYQRLRDALTRS---LNRLNHFNQNSSISSSKAS---- 80
G V L H D+ + + + +Q LR A RS ++RL ++SSKA+
Sbjct: 30 GLRVHLTHVDA------HGNYSRHQLLRRAARRSHHRMSRLVARATGVPMTSSKAAGGGD 83
Query: 81 -QADIIPNNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKM 139
Q + N +L+ +SIGTP A+ DTGSDL+WTQC+PC C+ Q +P+FDP
Sbjct: 84 LQVPVHAGNGEFLMDVSIGTPALAYSAIVDTGSDLVWTQCKPC--VDCFKQSTPVFDPSS 141
Query: 140 SSTYKSLPCSSSQCASLNQKSC-SGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVA 198
SSTY ++PCSS+ C+ L C S C Y+ +YGD S + G LATET TL +
Sbjct: 142 SSTYATVPCSSASCSDLPTSKCTSASKCGYTYTYGDSSSTQGVLATETFTLAKSK----- 196
Query: 199 LPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQM 235
LPG+ FGCG N G S+ G+VGLG G +SL+SQ+
Sbjct: 197 LPGVVFGCGDTNEGDGFSQGAGLVGLGRGPLSLVSQL 233
Score = 64.3 bits (155), Expect = 8e-08, Method: Compositional matrix adjust.
Identities = 45/133 (33%), Positives = 67/133 (50%), Gaps = 11/133 (8%)
Query: 249 TPDIVIDSGTTLTFLP-QGYNSNLLSVMSSMIEAQPVADPTG-SLELCYSFNSLS----Q 302
T +++DSGT++T+L QGY + + + M A P AD +G L+LC+ + +
Sbjct: 314 TGGVIVDSGTSITYLEVQGYRALKKAFAAQM--ALPAADGSGVGLDLCFRAPAKGVDQVE 371
Query: 303 VPEVTIHFRG-ADVKLSRSNFFV-KVSEDIVCSVFKGITNSVPIYGNIMQTNFLVGYDIE 360
VP + HF G AD+ L N+ V +C G + + I GN Q NF YD+
Sbjct: 372 VPRLVFHFDGGADLDLPAENYMVLDGGSGALCLTVMG-SRGLSIIGNFQQQNFQFVYDVG 430
Query: 361 QQTVSFKPTDCTK 373
T+SF P C K
Sbjct: 431 HDTLSFAPVQCNK 443
>gi|242095588|ref|XP_002438284.1| hypothetical protein SORBIDRAFT_10g011120 [Sorghum bicolor]
gi|241916507|gb|EER89651.1| hypothetical protein SORBIDRAFT_10g011120 [Sorghum bicolor]
Length = 487
Score = 142 bits (359), Expect = 2e-31, Method: Compositional matrix adjust.
Identities = 126/436 (28%), Positives = 193/436 (44%), Gaps = 93/436 (21%)
Query: 28 GFSVELIHR------DSPKSPFYNSSETPYQRLRDALTRSLNRLNHFNQNSSISSSKASQ 81
G +++++HR D P ++ +R R + RS+ R + ++ +++ ++
Sbjct: 54 GSTLQIVHRACLQTGDDIAVPDHHHYTGILRRDRHRV-RSIYRRLTAAETTTTTTTIPAR 112
Query: 82 ADIIPNNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSS 141
+ + Y++ I IGTPP + DTGSDL W QC PCP S CY Q PLFDP SS
Sbjct: 113 LGLAFQSLEYVVTIGIGTPPRNFTVLFDTGSDLTWVQCLPCPDSSCYPQQEPLFDPSKSS 172
Query: 142 TYKSLPCSSSQC--ASLNQKSCSGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVAL 199
TY +PCS+ +C + Q C +C+YSV YGD S ++G+LA ET TL + A A
Sbjct: 173 TYVDVPCSAPECHIGGVQQTRCGATSCEYSVKYGDESETHGSLAEETFTLSPPSPLAPAA 232
Query: 200 PGITFGCGTNNGGLFNSK---TTGIVGLGGGDISLISQMRTTIAGNQRL--------GVS 248
G+ FGC +FN G++GLG GD S++SQ R +I + G S
Sbjct: 233 TGVVFGCSHEYISVFNDTGMGVAGLLGLGRGDSSILSQTRRSINSGGGVFSYCLPPRGSS 292
Query: 249 TPDIVIDSG-----------------TTLTFLPQGYNSNLLSV---------------MS 276
T + I G TT++ L Y NL V +
Sbjct: 293 TGYLTIGGGAAAPQQQYSNLSFTPLITTISQLRSAYVVNLAGVSVNGAAVDIPASAFSLG 352
Query: 277 SMIEA------------QPVAD------------PTGSLEL---CYSFNSLSQV--PEVT 307
++I++ P+ D P GS++L CY V P V
Sbjct: 353 AVIDSGTVVTHMPAAAYYPLRDEFRLHMGSYKMLPEGSMKLLDTCYDVTGQDVVTAPRVA 412
Query: 308 IHF-RGADVKLSRSNFFVKV-SED-------IVCSVFKGITNS--VPIYGNIMQTNFLVG 356
+ F GA + + S + + +ED + C F TNS + I GN+ Q + V
Sbjct: 413 LEFGGGARIDVDASGILLVLPAEDGSGQSLTLACLAFL-PTNSAGLVIVGNMQQRAYNVV 471
Query: 357 YDIEQQTVSFKPTDCT 372
+D++ + F P C+
Sbjct: 472 FDVDGGRIGFGPNGCS 487
>gi|326530426|dbj|BAJ97639.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 479
Score = 142 bits (359), Expect = 2e-31, Method: Compositional matrix adjust.
Identities = 81/213 (38%), Positives = 117/213 (54%), Gaps = 16/213 (7%)
Query: 30 SVELIHRDSPKSPFYNSSETPYQRL--RDALTRSL--NRLNHFNQNSSISSSKASQADII 85
S+ L+HRD+ Y S+ L RD RL+ + + S S I
Sbjct: 70 SLALLHRDAVSGRTYPSTRHAMLGLAARDGARVEYLQRRLSPTTMTTEVGSEVVS--GIS 127
Query: 86 PNNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKS 145
+ Y +R+ +G+PPTE+ V D+GSD+IW QC PC ++CY Q PLFDP S+++ +
Sbjct: 128 EGSGEYFVRVGVGSPPTEQYLVVDSGSDVIWIQCRPC--AECYQQADPLFDPAASASFTA 185
Query: 146 LPCSSSQCASLNQKSCSGVN---CQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGI 202
+PC S C +L S + C+Y VSYGDGS++ G LA ET+T G +T + G+
Sbjct: 186 VPCDSGVCRTLPGGSSGCADSGACRYQVSYGDGSYTQGVLAMETLTFGDST----PVQGV 241
Query: 203 TFGCGTNNGGLFNSKTTGIVGLGGGDISLISQM 235
GCG N GLF G++GLG G +SL+ Q+
Sbjct: 242 AIGCGHRNRGLF-VGAAGLLGLGWGPMSLVGQL 273
Score = 54.7 bits (130), Expect = 7e-05, Method: Compositional matrix adjust.
Identities = 36/125 (28%), Positives = 56/125 (44%), Gaps = 5/125 (4%)
Query: 252 IVIDSGTTLTFLPQGYNSNLLSVMSSMIEAQ-PVADPTGSLELCYSFNSLS--QVPEVTI 308
+V+D+GT +T LP + L +S I P A L+ CY + + +VP V +
Sbjct: 355 VVMDTGTAVTRLPPDAYAALRDAFASTIGGDLPRAPGVSLLDTCYDLSGYASVRVPTVAL 414
Query: 309 HF--RGADVKLSRSNFFVKVSEDIVCSVFKGITNSVPIYGNIMQTNFLVGYDIEQQTVSF 366
+F GA + L N V++ + C F + + I GNI Q + D V F
Sbjct: 415 YFGRDGAALTLPARNLLVEMGGGVYCLAFAASASGLSILGNIQQQGIQITVDSANGYVGF 474
Query: 367 KPTDC 371
P+ C
Sbjct: 475 GPSTC 479
>gi|414877221|tpg|DAA54352.1| TPA: hypothetical protein ZEAMMB73_214154 [Zea mays]
Length = 447
Score = 142 bits (358), Expect = 3e-31, Method: Compositional matrix adjust.
Identities = 75/165 (45%), Positives = 111/165 (67%), Gaps = 10/165 (6%)
Query: 75 SSSKASQADIIPNNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPL 134
+SS A A + A YL+ ++IGTPP +A+ADTGSDL WTQC+PC C+ QD+P+
Sbjct: 77 TSSDAGPARLRSGQAEYLMELAIGTPPVPFVALADTGSDLTWTQCQPC--KLCFPQDTPI 134
Query: 135 FDPKMSSTYKSLPCSSSQCASL-NQKSCSGVN--CQYSVSYGDGSFSNGNLATETVTLGS 191
+D +SS++ +PC+S+ C + + ++C+ + C+Y +YGDG++S G L TET+T
Sbjct: 135 YDTAVSSSFSPVPCASATCLPIWSSRNCTASSSPCRYRYAYGDGAYSAGVLGTETLTFPG 194
Query: 192 TTGQAVALPGITFGCGTNNGGL-FNSKTTGIVGLGGGDISLISQM 235
G +V GI FGCG +NGGL +NS TG VGLG G +SL++Q+
Sbjct: 195 APGVSVG--GIAFGCGVDNGGLSYNS--TGTVGLGRGSLSLVAQL 235
Score = 57.4 bits (137), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 38/129 (29%), Positives = 66/129 (51%), Gaps = 8/129 (6%)
Query: 252 IVIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNS----LSQVPEVT 307
+++DSGTT TFL + ++ ++ ++ QPV + + C+ + L +P++
Sbjct: 319 MIVDSGTTFTFLVESAFRVVVDHVAGVLR-QPVVNASSLDSPCFPAATGEQQLPAMPDMV 377
Query: 308 IHFRG-ADVKLSRSNF--FVKVSEDIVCSVFKGITNSVPIYGNIMQTNFLVGYDIEQQTV 364
+HF G AD++L R N+ F + ++ + V I GN Q N + +DI +
Sbjct: 378 LHFAGGADMRLHRDNYMSFNQEESSFCLNIAGSPSADVSILGNFQQQNIQMLFDITVGQL 437
Query: 365 SFKPTDCTK 373
SF PTDC K
Sbjct: 438 SFMPTDCGK 446
>gi|413923780|gb|AFW63712.1| hypothetical protein ZEAMMB73_689747 [Zea mays]
Length = 470
Score = 142 bits (358), Expect = 3e-31, Method: Compositional matrix adjust.
Identities = 123/418 (29%), Positives = 171/418 (40%), Gaps = 90/418 (21%)
Query: 31 VELIHRDSPKSPFYNSSETPYQRLRDALTRSLNRLNHFNQNSS------ISSSKASQADI 84
+ L HR P +P SS + D L R H + S + KA+ A +
Sbjct: 66 LRLTHRHGPCAPLRASSLAA-PSVADTLRADQRRAEHILRRVSGRGAPQLWDYKAAAATV 124
Query: 85 IPN------NANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPK 138
N +NY++ S+GTP + DTGSDL W QC+PC CY Q PLFDP
Sbjct: 125 PANWGYDIGTSNYVVTASLGTPGMAQTLEVDTGSDLSWVQCKPCAAPSCYRQKDPLFDPA 184
Query: 139 MSSTYKSLPCSSSQCASLN--QKSCSGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQA 196
SS+Y ++PC S CA L +CS C Y VSYGDGS + G +++T+TL +
Sbjct: 185 QSSSYAAVPCGRSACAGLGIYASACSAAQCGYVVSYGDGSNTTGVYSSDTLTLAAN---- 240
Query: 197 VALPGITFGCG-TNNGGLFNSKTTGIVGLGGGDISLISQ--------------MRTTIAG 241
+ G FGCG +GGLF + G++G G SL+ Q +++ G
Sbjct: 241 ATVQGFLFGCGHAQSGGLF-TGIDGLLGFGREQPSLVQQTAGAYGGVFSYCLPTKSSTTG 299
Query: 242 NQRLG-------------------------------------VSTPDIVIDSGT------ 258
LG +S P +GT
Sbjct: 300 YLTLGGPSGVAPGFSTTQLLPSPNAPTYYVVMLTGISVGGQPLSVPASAFAAGTVVDTGT 359
Query: 259 TLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSLSQV--PEVTIHF-RGADV 315
+T LP + L S S + + P A P G L+ CYSF V V + F GA +
Sbjct: 360 VITRLPPAAYAALRSAFRSGMASYPSAPPIGILDTCYSFAGYGTVNLTSVALTFSSGATM 419
Query: 316 KLSRSNFFVKVSEDIVCSVFK--GITNSVPIYGNIMQTNFLVGYDIEQQTVSFKPTDC 371
L C F G S+ I GN+ Q +F V I+ +V F+P+ C
Sbjct: 420 TLGADGIM-----SFGCLAFASSGSDGSMAILGNVQQRSFEV--RIDGSSVGFRPSSC 470
>gi|225430551|ref|XP_002283470.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Vitis vinifera]
Length = 490
Score = 142 bits (358), Expect = 3e-31, Method: Compositional matrix adjust.
Identities = 122/422 (28%), Positives = 183/422 (43%), Gaps = 88/422 (20%)
Query: 30 SVELIHRDSPKS---PFYNSSETPYQRLRDALTRSLNRLNHFNQNSSISSS-KASQADII 85
S+E++H+ P S P +S + Q L +R + + +N + S+ KAS+A +
Sbjct: 76 SLEVVHKHGPCSKLRPHKANSPSHTQILAQDESRVASIQSRLAKNLAGGSNLKASKATLP 135
Query: 86 PNNA------NYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKM 139
+A NY++ + +G+P + + DTGSDL WTQCEPC CY Q +FDP
Sbjct: 136 SKSASTLGSGNYVVTVGLGSPKRDLTFIFDTGSDLTWTQCEPCV-GYCYQQREHIFDPST 194
Query: 140 SSTYKSLPCSSSQCASL-----NQKSCSGVNCQYSVSYGDGSFSNGNLATETVTLGSTTG 194
S +Y ++ C S C L N CS C Y + YGDGS+S G A E ++L ST
Sbjct: 195 SLSYSNVSCDSPSCEKLESATGNSPGCSSSTCLYGIRYGDGSYSIGFFAREKLSLTSTD- 253
Query: 195 QAVALPGITFGCGTNNGGLFNSKTTGIVGL------------------------------ 224
FGCG NN GLF T G++GL
Sbjct: 254 ---VFNNFQFGCGQNNRGLFGG-TAGLLGLARNPLSLVSQTAQKYGKVFSYCLPSSSSST 309
Query: 225 -------GGGDISLIS----------------QMRTTIAGNQRLGV-----STPDIVIDS 256
G GD + M G ++L + ST +IDS
Sbjct: 310 GYLSFGSGDGDSKAVKFTPSEVNSDYPSFYFLDMVGISVGERKLPIPKSVFSTAGTIIDS 369
Query: 257 GTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSLS--QVPEVTIHFR-GA 313
GT ++ LP S++ V ++ P L+ CY + +VP++ ++F GA
Sbjct: 370 GTVISRLPPTVYSSVQKVFRELMSDYPRVKGVSILDTCYDLSKYKTVKVPKIILYFSGGA 429
Query: 314 DVKLSRSN--FFVKVSEDIVCSVFKGIT--NSVPIYGNIMQTNFLVGYDIEQQTVSFKPT 369
++ L+ + +KVS+ VC F G + + V I GN+ Q V YD + V F P+
Sbjct: 430 EMDLAPEGIIYVLKVSQ--VCLAFAGNSDDDEVAIIGNVQQKTIHVVYDDAEGRVGFAPS 487
Query: 370 DC 371
C
Sbjct: 488 GC 489
>gi|302768196|ref|XP_002967518.1| hypothetical protein SELMODRAFT_87804 [Selaginella moellendorffii]
gi|300165509|gb|EFJ32117.1| hypothetical protein SELMODRAFT_87804 [Selaginella moellendorffii]
Length = 398
Score = 142 bits (357), Expect = 3e-31, Method: Compositional matrix adjust.
Identities = 73/155 (47%), Positives = 101/155 (65%), Gaps = 4/155 (2%)
Query: 81 QADIIPNNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMS 140
++ + +Y+ IS+GTP +ADTGSDLIW QC+PC C+ Q P+FDP+ S
Sbjct: 30 ESPVASGGGDYVTTISLGTPAKVFSVIADTGSDLIWIQCKPC--QACFNQKDPIFDPEGS 87
Query: 141 STYKSLPCSSSQCASLNQKSCSGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALP 200
S+Y ++ C + C SL +KSCS +C YS YGDGS + G L++ETVTL ST G+ +A
Sbjct: 88 SSYTTMSCGDTLCDSLPRKSCS-PDCDYSYGYGDGSGTRGTLSSETVTLTSTQGEKLAAK 146
Query: 201 GITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQM 235
I FGCG N G FN +G+VGLG G++S +SQ+
Sbjct: 147 NIAFGCGHLNRGSFN-DASGLVGLGRGNLSFVSQL 180
Score = 79.3 bits (194), Expect = 3e-12, Method: Compositional matrix adjust.
Identities = 49/155 (31%), Positives = 74/155 (47%), Gaps = 14/155 (9%)
Query: 228 DISLISQMRTTIAGNQRLGVSTPD----IVIDSGTTLTFLPQGYNSNLLSVMSSMIEAQP 283
DIS+ + AG+ + PD ++ DSGTTLT LP +L + S I
Sbjct: 243 DISIAGRALRIPAGSFDI---KPDGSGGMIFDSGTTLTLLPDAPYQIVLRALRSKISFPK 299
Query: 284 VADPTGSLELCYSFNSLS-----QVPEVTIHFRGADVKLSRSNFFVKVSE--DIVCSVFK 336
+ + L+LCY + ++P + HF GAD +L N+F+ ++ IVC
Sbjct: 300 IDGSSAGLDLCYDVSGSKASYKMKIPAMVFHFEGADYQLPVENYFIAANDAGTIVCLAMV 359
Query: 337 GITNSVPIYGNIMQTNFLVGYDIEQQTVSFKPTDC 371
+ IYGN+MQ NF V YDI + + P+ C
Sbjct: 360 SSNMDIGIYGNMMQQNFRVMYDIGSSKIGWAPSQC 394
>gi|224092220|ref|XP_002309515.1| predicted protein [Populus trichocarpa]
gi|222855491|gb|EEE93038.1| predicted protein [Populus trichocarpa]
Length = 473
Score = 142 bits (357), Expect = 3e-31, Method: Compositional matrix adjust.
Identities = 121/416 (29%), Positives = 172/416 (41%), Gaps = 79/416 (18%)
Query: 29 FSVELIHRDSPKSPFYNSSETPYQ-RLRDALTRSLNRLNHFNQNSSISSSKASQADIIP- 86
S+E++HR P N + + L + +R++ + S + +P
Sbjct: 63 LSLEVVHRSGPCIQVLNQEKAANAPSNMEILLQDRHRVDSIHARLSSHGVFQEKQATLPV 122
Query: 87 ------NNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMS 140
+ +Y + + +GTP E + DTGSDL WTQCEPC + CY Q P DP S
Sbjct: 123 QSGASIGSGDYAVTVGLGTPKKEFTLIFDTGSDLTWTQCEPCAKT-CYKQKEPRLDPTKS 181
Query: 141 STYKSLPCSSSQCASLNQ---KSCSGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAV 197
++YK++ CSS+ C L+ +SCS C Y V YGDGS+S G ATET+TL S+
Sbjct: 182 TSYKNISCSSAFCKLLDTEGGESCSSPTCLYQVQYGDGSYSIGFFATETLTLSSSN---- 237
Query: 198 ALPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIA----------------- 240
FGCG N GLF G++GLG +SL SQ
Sbjct: 238 VFKNFLFGCGQQNSGLFRG-AAGLLGLGRTKLSLPSQTAQKYKKLFSYCLPASSSSKGYL 296
Query: 241 --GNQ-------------------------RLGVSTPDIVID------SGTTL------T 261
G Q L V + ID SGT + T
Sbjct: 297 SFGGQVSKTVKFTPLSEDFKSTPFYGLDITELSVGGNKLSIDASIFSTSGTVIDSGTVIT 356
Query: 262 FLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSF--NSLSQVPEVTIHFRGA-DVKLS 318
LP S L S ++ P D + CY F N ++P+V + F+G ++ +
Sbjct: 357 RLPSTAYSALSSAFQKLMTDYPSTDGYSIFDTCYDFSKNETIKIPKVGVSFKGGVEMDID 416
Query: 319 RSNFFVKVSE-DIVCSVFKGITNSVP--IYGNIMQTNFLVGYDIEQQTVSFKPTDC 371
S V+ VC F G + V I+GN Q + V YD + V F P+ C
Sbjct: 417 VSGILYPVNGLKKVCLAFAGNGDDVKAAIFGNTQQKTYQVVYDDAKGRVGFAPSGC 472
>gi|224130548|ref|XP_002320868.1| predicted protein [Populus trichocarpa]
gi|222861641|gb|EEE99183.1| predicted protein [Populus trichocarpa]
Length = 488
Score = 142 bits (357), Expect = 3e-31, Method: Compositional matrix adjust.
Identities = 106/354 (29%), Positives = 148/354 (41%), Gaps = 78/354 (22%)
Query: 88 NANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLP 147
+ Y R+ +GTP V DTGSD++W QC PC +CY Q P+FDP S ++ ++P
Sbjct: 142 SGEYFTRLGVGTPARYVYMVLDTGSDIVWIQCAPC--IKCYSQTDPVFDPTKSRSFANIP 199
Query: 148 CSSSQCASLNQKSCSGVN--CQYSVSYGDGSFSNGNLATETVTL-GSTTGQAVALPGITF 204
C S C L+ CS C Y VSYGDGSF+ G +TET+T G+ G+ V
Sbjct: 200 CGSPLCRRLDYPGCSTKKQICLYQVSYGDGSFTVGEFSTETLTFRGTRVGRVV------L 253
Query: 205 GCGTNNGGL------------------------FNSKTTGIVG------------LGGGD 228
GCG +N GL FNSK + +G G
Sbjct: 254 GCGHDNEGLFVGAAGLLGLGRGRLSFPSQIGRRFNSKFSYCLGDRSASSRPSSIVFGDSA 313
Query: 229 IS------------------LISQMRTTIAGNQRLGVSTP----------DIVIDSGTTL 260
IS + + ++ G + G+S ++IDSGT++
Sbjct: 314 ISRTTRFTPLLSNPKLDTFYYVELLGISVGGTRVSGISASLFKLDSTGNGGVIIDSGTSV 373
Query: 261 TFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSLSQV--PEVTIHFRGADVKLS 318
T L + L A + C+ + ++V P V +HFRGADV L
Sbjct: 374 TRLTRAAYVALRDAFLVGASNLKRAPEFSLFDTCFDLSGKTEVKVPTVVLHFRGADVPLP 433
Query: 319 RSNFFVKVSED-IVCSVFKGITNSVPIYGNIMQTNFLVGYDIEQQTVSFKPTDC 371
SN+ + V C F G + + I GNI Q F V YD+ V F P C
Sbjct: 434 ASNYLIPVDNSGSFCFAFAGTASGLSIIGNIQQQGFRVVYDLATSRVGFAPRGC 487
>gi|118484458|gb|ABK94105.1| unknown [Populus trichocarpa]
Length = 499
Score = 141 bits (356), Expect = 5e-31, Method: Compositional matrix adjust.
Identities = 115/420 (27%), Positives = 167/420 (39%), Gaps = 96/420 (22%)
Query: 39 PKSPFYNSSETPYQRL-RDALTRSLNRLNHFNQNSSISSSKASQADIIP----------- 86
P+ Y Y+ L L R R N ++ S++D+ P
Sbjct: 88 PRETIYKIHHKDYKSLVLSRLHRDTVRFNSLTARLQLALEDISKSDLKPLETEIKPEDLS 147
Query: 87 ---------NNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDP 137
+ Y R+ +G P + V DTGSD+ W QC+PC + CY Q P+FDP
Sbjct: 148 TPVTSGTSQGSGEYFTRVGVGNPARQFYMVLDTGSDINWLQCQPC--TDCYQQTDPIFDP 205
Query: 138 KMSSTYKSLPCSSSQCASLNQKSCSGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAV 197
SSTY + C S QC+SL SC C Y V+YGDGS++ G+ ATE+V+ G++
Sbjct: 206 TASSTYAPVTCQSQQCSSLEMSSCRSGQCLYQVNYGDGSYTFGDFATESVSFGNSG---- 261
Query: 198 ALPGITFGCGTNNGGL------------------------------------------FN 215
++ + GCG +N GL FN
Sbjct: 262 SVKNVALGCGHDNEGLFVGAAGLLGLGGGPLSLTNQLKATSFSYCLVNRDSAGSSTLDFN 321
Query: 216 SKTTGI------------------VGLGGGDISLISQMRTTIAGNQRLGVS-TPDIVIDS 256
S G+ VGL G +S+ QM + RL S I++D
Sbjct: 322 SAQLGVDSVTAPLMKNRKIDTFYYVGLSG--MSVGGQMVSIPESTFRLDESGNGGIIVDC 379
Query: 257 GTTLTFLP-QGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSLS--QVPEVTIHFR-G 312
GT +T L Q YN L M + + + CY + + +VP V+ HF G
Sbjct: 380 GTAITRLQTQAYNP-LRDAFVRMTQNLKLTSAVALFDTCYDLSGQASVRVPTVSFHFADG 438
Query: 313 ADVKLSRSNFFVKV-SEDIVCSVFKGITNSVPIYGNIMQTNFLVGYDIEQQTVSFKPTDC 371
L +N+ + V S C F T+S+ I GN+ Q V +D+ + F P C
Sbjct: 439 KSWNLPAANYLIPVDSAGTYCFAFAPTTSSLSIIGNVQQQGTRVTFDLANNRMGFSPNKC 498
>gi|242066140|ref|XP_002454359.1| hypothetical protein SORBIDRAFT_04g029390 [Sorghum bicolor]
gi|241934190|gb|EES07335.1| hypothetical protein SORBIDRAFT_04g029390 [Sorghum bicolor]
Length = 460
Score = 141 bits (355), Expect = 6e-31, Method: Compositional matrix adjust.
Identities = 129/426 (30%), Positives = 177/426 (41%), Gaps = 91/426 (21%)
Query: 22 IEAQTGGFSVELIHRDSPKSPFYNSSETPYQRLRDALTRSLNRLNHFNQNSSISSSKASQ 81
+ + +G +V L HR P SP + + P L D L R R + + S K Q
Sbjct: 50 VRSSSGATTVPLHHRHGPCSPL-PTKKMP--SLEDRLHRDQLRAAYIKRKFSGDVKKDGQ 106
Query: 82 AD--------IIP-------NNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQ 126
+P N YLI + +G+P + + D+GSD+ W QC+PC Q
Sbjct: 107 GAGGVEQSHVTVPTTLGTSLNTLEYLITVRLGSPAKTQTVLIDSGSDVSWVQCKPC--LQ 164
Query: 127 CYMQDSPLFDPKMSSTYKSLPCSSSQCASLNQKS---CSGVNCQYSVSYGDGSFSNGNLA 183
C+ Q PLFDP +SSTY CSS+ CA L Q S CQY V Y DGS + G +
Sbjct: 165 CHSQVDPLFDPSLSSTYSPFSCSSAACAQLGQDGNGCSSSSQCQYIVRYADGSSTTGTYS 224
Query: 184 TETVTLGSTTGQAVALPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTI---- 239
++T+ LGS T + FGC G FN T G++GLGGG SL SQ T
Sbjct: 225 SDTLALGSNT-----ISNFQFGCSHVESG-FNDLTDGLMGLGGGAPSLASQTAGTFGTAF 278
Query: 240 ----------AGNQRLG------VSTP--------------------------------- 250
+G LG V TP
Sbjct: 279 SYCLPPTPSSSGFLTLGAGTSGFVKTPMLRSSPVPTFYGVRLEAIRVGGTQLSIPTSVFS 338
Query: 251 -DIVIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSLSQV--PEVT 307
+V+DSGT +T LP+ S L S + ++ A P ++ C+ F+ S V P V
Sbjct: 339 AGMVMDSGTIITRLPRTAYSALSSAFKAGMKQYRPAPPRSIMDTCFDFSGQSSVRLPSVA 398
Query: 308 IHFRGADVKLSRSNFFVKVSEDIVCSVFKGITN--SVPIYGNIMQTNFLVGYDIEQQTVS 365
+ F G V +N + + C F ++ S I GN+ Q F V YD+ V
Sbjct: 399 LVFSGGAVVNLDANGIILGN----CLAFAANSDDSSPGIVGNVQQRTFEVLYDVGGGAVG 454
Query: 366 FKPTDC 371
FK C
Sbjct: 455 FKAGAC 460
>gi|293332735|ref|NP_001168472.1| uncharacterized protein LOC100382248 [Zea mays]
gi|223948487|gb|ACN28327.1| unknown [Zea mays]
Length = 434
Score = 140 bits (354), Expect = 8e-31, Method: Compositional matrix adjust.
Identities = 110/350 (31%), Positives = 147/350 (42%), Gaps = 72/350 (20%)
Query: 87 NNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSL 146
NY++ + +GTP V DTGSD W QC+PC + CY Q PLFDP S+TY ++
Sbjct: 92 GTGNYVVPVRLGTPAERFTVVFDTGSDTTWVQCQPCV-AYCYRQKEPLFDPTKSATYANI 150
Query: 147 PCSSSQCASLNQKSCSGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITFGC 206
CSSS C+ L CSG +C Y + YGDGS++ G A +T+TL T + FGC
Sbjct: 151 SCSSSYCSDLYVSGCSGGHCLYGIQYGDGSYTIGFYAQDTLTLAYDT-----IKNFRFGC 205
Query: 207 GTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAG------------------------- 241
G N GLF + G++GLG G SL Q G
Sbjct: 206 GEKNRGLFG-RAAGLLGLGRGKTSLPVQAYDKYGGVFAYCLPATSAGTGFLDLGPGAPAA 264
Query: 242 NQRL-----------------GV--------------STPDIVIDSGTTLTFLPQGYNSN 270
N RL G+ ST ++DSGT +T LP +
Sbjct: 265 NARLTPMLVDRGPTFYYVGMTGIKVGGHVLPIPGSVFSTAGTLVDSGTVITRLPPSAYAP 324
Query: 271 LLSVMSSMIEAQPV-ADPTGS-LELCYSFNSLS----QVPEVTIHFRGA---DVKLSRSN 321
L S S ++ A P S L+ CY +P V++ F+G DV S
Sbjct: 325 LRSAFSKAMQGLGYSAAPAFSILDTCYDLTGHKGGSIALPAVSLVFQGGACLDVDASGIL 384
Query: 322 FFVKVSEDIVCSVFKGITNSVPIYGNIMQTNFLVGYDIEQQTVSFKPTDC 371
+ VS+ + V I GN Q V YDI ++ V F P C
Sbjct: 385 YVADVSQACLAFAPNADDTDVAIVGNTQQKTHGVLYDIGKKIVGFAPGAC 434
>gi|115458646|ref|NP_001052923.1| Os04g0448500 [Oryza sativa Japonica Group]
gi|38344830|emb|CAD40872.2| OSJNBa0064H22.11 [Oryza sativa Japonica Group]
gi|113564494|dbj|BAF14837.1| Os04g0448500 [Oryza sativa Japonica Group]
Length = 464
Score = 140 bits (354), Expect = 8e-31, Method: Compositional matrix adjust.
Identities = 78/195 (40%), Positives = 115/195 (58%), Gaps = 15/195 (7%)
Query: 49 TPYQRLRDALTRSLNRLNHFN--QNSSISSSKASQAD--IIPNNANYLIRISIGTPPTER 104
T ++ LR A+ RS RL + + S+ KA A+ I+P YL+++ IGTPP +
Sbjct: 43 TEHELLRRAIQRSRYRLAGIGMARGEAASARKAVVAETPIMPAGGEYLVKLGIGTPPYKF 102
Query: 105 LAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCSSSQCASLNQKSCSGV 164
A DT SDLIWTQC+PC + CY Q P+F+P++SSTY +LPCSS C L+ C
Sbjct: 103 TAAIDTASDLIWTQCQPC--TGCYHQVDPMFNPRVSSTYAALPCSSDTCDELDVHRCGHD 160
Query: 165 N---CQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITFGCGTNN-GGLFNSKTTG 220
+ CQY+ +Y + + G LA + + +G A G+ FGC T++ GG + +G
Sbjct: 161 DDESCQYTYTYSGNATTEGTLAVDKLVIGED-----AFRGVAFGCSTSSTGGAPPPQASG 215
Query: 221 IVGLGGGDISLISQM 235
+VGLG G +SL+SQ+
Sbjct: 216 VVGLGRGPLSLVSQL 230
Score = 46.2 bits (108), Expect = 0.024, Method: Compositional matrix adjust.
Identities = 34/128 (26%), Positives = 61/128 (47%), Gaps = 9/128 (7%)
Query: 252 IVIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCY------SFNSLSQVPE 305
++ID +T+TFL L++ + I + L+LC+ +F+ + VP
Sbjct: 334 MIIDIASTITFLEASLYDELVNDLEVEIRLPRGTGSSLGLDLCFILPDGVAFDRV-YVPA 392
Query: 306 VTIHFRGADVKLSRSNFFVKVSED-IVC-SVFKGITNSVPIYGNIMQTNFLVGYDIEQQT 363
V + F G ++L ++ F + E ++C V + SV I GN Q N V Y++ +
Sbjct: 393 VALAFDGRWLRLDKARLFAEDRESGMMCLMVGRAEAGSVSILGNFQQQNMQVLYNLRRGR 452
Query: 364 VSFKPTDC 371
V+F + C
Sbjct: 453 VTFVQSPC 460
>gi|413943688|gb|AFW76337.1| hypothetical protein ZEAMMB73_223549 [Zea mays]
gi|413943689|gb|AFW76338.1| hypothetical protein ZEAMMB73_223549 [Zea mays]
Length = 499
Score = 140 bits (354), Expect = 8e-31, Method: Compositional matrix adjust.
Identities = 110/350 (31%), Positives = 147/350 (42%), Gaps = 72/350 (20%)
Query: 87 NNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSL 146
NY++ + +GTP V DTGSD W QC+PC + CY Q PLFDP S+TY ++
Sbjct: 157 GTGNYVVPVRLGTPAERFTVVFDTGSDTTWVQCQPCV-AYCYRQKEPLFDPTKSATYANI 215
Query: 147 PCSSSQCASLNQKSCSGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITFGC 206
CSSS C+ L CSG +C Y + YGDGS++ G A +T+TL T + FGC
Sbjct: 216 SCSSSYCSDLYVSGCSGGHCLYGIQYGDGSYTIGFYAQDTLTLAYDT-----IKNFRFGC 270
Query: 207 GTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAG------------------------- 241
G N GLF + G++GLG G SL Q G
Sbjct: 271 GEKNRGLFG-RAAGLLGLGRGKTSLPVQAYDKYGGVFAYCLPATSAGTGFLDLGPGAPAA 329
Query: 242 NQRL-----------------GV--------------STPDIVIDSGTTLTFLPQGYNSN 270
N RL G+ ST ++DSGT +T LP +
Sbjct: 330 NARLTPMLVDRGPTFYYVGMTGIKVGGHVLPIPGSVFSTAGTLVDSGTVITRLPPSAYAP 389
Query: 271 LLSVMSSMIEAQPV-ADPTGS-LELCYSFNSLS----QVPEVTIHFRGA---DVKLSRSN 321
L S S ++ A P S L+ CY +P V++ F+G DV S
Sbjct: 390 LRSAFSKAMQGLGYSAAPAFSILDTCYDLTGHKGGSIALPAVSLVFQGGACLDVDASGIL 449
Query: 322 FFVKVSEDIVCSVFKGITNSVPIYGNIMQTNFLVGYDIEQQTVSFKPTDC 371
+ VS+ + V I GN Q V YDI ++ V F P C
Sbjct: 450 YVADVSQACLAFAPNADDTDVAIVGNTQQKTHGVLYDIGKKIVGFAPGAC 499
>gi|242092898|ref|XP_002436939.1| hypothetical protein SORBIDRAFT_10g011740 [Sorghum bicolor]
gi|241915162|gb|EER88306.1| hypothetical protein SORBIDRAFT_10g011740 [Sorghum bicolor]
Length = 469
Score = 140 bits (354), Expect = 8e-31, Method: Compositional matrix adjust.
Identities = 124/436 (28%), Positives = 180/436 (41%), Gaps = 100/436 (22%)
Query: 22 IEAQTGGFSVELIHRDSPKSPFYNSSETPYQRLRDALTRSLNRLNHF--NQNSSISSSKA 79
+E + S+ L+HR P +P S P + + L RS R N+ + S+ A
Sbjct: 48 LEPSSATVSMSLVHRYGPCAP-SQYSNVPTPSISETLRRSRARTNYIMSQASKSMGMGMA 106
Query: 80 SQAD------IIP-------NNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQ 126
S D IP ++ Y++ + GTP ++ + DTGSD+ W QC PC ++
Sbjct: 107 STPDDDDAAVTIPTRLGGFVDSLEYVVTLGFGTPSVPQVLLMDTGSDVSWVQCTPCNSTK 166
Query: 127 CYMQDSPLFDPKMSSTYKSLPCSSSQCASL-----NQKSCSGVNCQYSVSYGDGSFSNGN 181
CY Q PLFDP SSTY + C++ C L N + G C YSV Y DGS S G
Sbjct: 167 CYPQKDPLFDPSKSSTYAPIACNTDACRKLGDHYHNGCTSGGTQCGYSVEYADGSHSRGV 226
Query: 182 LATETVTLGSTTGQAVALPGIT-----FGCGTNNGGLFNSKTTGIVGLGGGDISLISQMR 236
+ ET+TL PGIT FGCG + G + K G++GLGG +SL+ Q
Sbjct: 227 YSNETLTLA---------PGITVEDFHFGCGRDQRGP-SDKYDGLLGLGGAPVSLVVQTS 276
Query: 237 TTI------------------------AGNQRLGVSTP---------------------- 250
+ +GN+ V TP
Sbjct: 277 SVYGGAFSYCLPALNSEAGFLVLGSPPSGNKSAFVFTPMRHLPGYATFYMVTMTGISVGG 336
Query: 251 ------------DIVIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFN 298
++IDSGT T LP+ + L + + ++A P+ P+ + CY+F
Sbjct: 337 KPLHIPQSAFRGGMIIDSGTVDTELPETAYNALEAALRKALKAYPLV-PSDDFDTCYNFT 395
Query: 299 SLSQ--VPEVTIHFR-GADVKLSRSNFFVKVSEDIVCSVFKGITNSVPIYGNIMQTNFLV 355
S VP V F GA + L N + D + G + + I GN+ Q V
Sbjct: 396 GYSNITVPRVAFTFSGGATIDLDVPNGILV--NDCLAFQESGPDDGLGIIGNVNQRTLEV 453
Query: 356 GYDIEQQTVSFKPTDC 371
YD + V F+ C
Sbjct: 454 LYDAGRGNVGFRAGAC 469
>gi|116310064|emb|CAH67085.1| H0818E04.2 [Oryza sativa Indica Group]
gi|116310187|emb|CAH67199.1| OSIGBa0152K17.11 [Oryza sativa Indica Group]
Length = 464
Score = 140 bits (354), Expect = 8e-31, Method: Compositional matrix adjust.
Identities = 78/195 (40%), Positives = 115/195 (58%), Gaps = 15/195 (7%)
Query: 49 TPYQRLRDALTRSLNRLNHFN--QNSSISSSKASQAD--IIPNNANYLIRISIGTPPTER 104
T ++ LR A+ RS RL + + S+ KA A+ I+P YL+++ IGTPP +
Sbjct: 43 TEHELLRRAIQRSRYRLAGIGMARGEAASARKAVVAETPIMPAGGEYLVKLGIGTPPYKF 102
Query: 105 LAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCSSSQCASLNQKSCSGV 164
A DT SDLIWTQC+PC + CY Q P+F+P++SSTY +LPCSS C L+ C
Sbjct: 103 TAAIDTASDLIWTQCQPC--TGCYHQVDPMFNPRVSSTYAALPCSSDTCDELDVHRCGHD 160
Query: 165 N---CQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITFGCGTNN-GGLFNSKTTG 220
+ CQY+ +Y + + G LA + + +G A G+ FGC T++ GG + +G
Sbjct: 161 DDESCQYTYTYSGNATTEGTLAVDKLVIGED-----AFRGVAFGCSTSSTGGAPPPQASG 215
Query: 221 IVGLGGGDISLISQM 235
+VGLG G +SL+SQ+
Sbjct: 216 VVGLGRGPLSLVSQL 230
Score = 46.2 bits (108), Expect = 0.025, Method: Compositional matrix adjust.
Identities = 34/128 (26%), Positives = 61/128 (47%), Gaps = 9/128 (7%)
Query: 252 IVIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCY------SFNSLSQVPE 305
++ID +T+TFL L++ + I + L+LC+ +F+ + VP
Sbjct: 334 MIIDIASTITFLEASLYDELVNDLEVEIRLPRGTGSSLGLDLCFILPDGVAFDRV-YVPA 392
Query: 306 VTIHFRGADVKLSRSNFFVKVSED-IVC-SVFKGITNSVPIYGNIMQTNFLVGYDIEQQT 363
V + F G ++L ++ F + E ++C V + SV I GN Q N V Y++ +
Sbjct: 393 VALAFDGRWLRLDKARLFAEDRESGMMCLMVGRAEAGSVSILGNFQQQNMQVLYNLRRGR 452
Query: 364 VSFKPTDC 371
V+F + C
Sbjct: 453 VTFVQSPC 460
>gi|414584780|tpg|DAA35351.1| TPA: hypothetical protein ZEAMMB73_696016 [Zea mays]
Length = 524
Score = 140 bits (353), Expect = 1e-30, Method: Compositional matrix adjust.
Identities = 83/227 (36%), Positives = 120/227 (52%), Gaps = 30/227 (13%)
Query: 30 SVELIHRDSPKSPFYNSSETPYQRLRDALTRSLNRLNHFNQNSSISSSKASQ-------- 81
S+ L+ RD Y S LR A+ + R N + + S A Q
Sbjct: 105 SLALVRRDEVTGSTYPS-------LRHAVLDLVARDNARAEYLATRLSPAYQPPGFSGSE 157
Query: 82 ----ADIIPNNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDP 137
+ + + YL+R+S+G+PPTE+ V D+GSD++W QC+PC +CY+Q PLFDP
Sbjct: 158 SKVVSGLDEGSGEYLVRVSVGSPPTEQYLVVDSGSDVMWVQCKPC--LECYVQADPLFDP 215
Query: 138 KMSSTYKSLPCSSSQCASLNQKSCSGVN---CQYSVSYGDGSFSNGNLATETVTLGSTTG 194
S+T+ + C S+ C L +C C+Y VSY DGS++ G LA ET+TLG T
Sbjct: 216 ATSATFSGVSCGSAICRILPTSACGDGELGGCEYEVSYADGSYTKGALALETLTLGGT-- 273
Query: 195 QAVALPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAG 241
A+ G+ GCG N GLF G++GLG G +SL+ Q+ + G
Sbjct: 274 ---AVEGVVIGCGHRNRGLFVG-AAGLMGLGWGPMSLVGQLGGEVGG 316
>gi|195625122|gb|ACG34391.1| aspartic proteinase nepenthesin-2 precursor [Zea mays]
Length = 471
Score = 140 bits (352), Expect = 1e-30, Method: Compositional matrix adjust.
Identities = 114/346 (32%), Positives = 156/346 (45%), Gaps = 70/346 (20%)
Query: 90 NYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCS 149
NY+ ++ +GTP T V DTGS L W QC PC S C+ Q PLFDP+ SSTY S+ CS
Sbjct: 133 NYVTQLGLGTPSTSYAMVVDTGSSLTWLQCSPCVVS-CHRQVGPLFDPRASSTYASVRCS 191
Query: 150 SSQC-----ASLNQKSCSGVN-CQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGIT 203
+SQC A+LN +CS N C Y SYGD SFS G+L+T+TV+ GST P
Sbjct: 192 ASQCDELQAATLNPSACSASNVCIYQASYGDSSFSVGSLSTDTVSFGSTR-----YPSFY 246
Query: 204 FGCGT-NNG---------GLFNSK-----------------------TTGIVGLG----G 226
+GCG N G GL +K +TG + +G G
Sbjct: 247 YGCGQDNEGLFGRSAGLIGLARNKLSLLYQLAPSLGYSFSYCLPTAASTGYLSIGPYNTG 306
Query: 227 GDISLISQMRTTI-------------AGNQRLGVSTPD-----IVIDSGTTLTFLPQGYN 268
S +++ G L VS + +IDSGT +T LP +
Sbjct: 307 HYYSYTPMASSSLDASLYFITLSGMSVGGSPLAVSPSEYSSLPTIIDSGTVITRLPTAVH 366
Query: 269 SNLLSVMSSMIEAQPVADPTGSLELCYSFN-SLSQVPEVTIHFRG-ADVKLSRSNFFVKV 326
+ L ++ + A L+ C+ S +VP V + F G A +KL+ N + V
Sbjct: 367 TALSKAVAQAMAGAQRAPAFSILDTCFEGQASQLRVPTVAMAFAGGASMKLTTRNVLIDV 426
Query: 327 SEDIVCSVFKGITNSVPIYGNIMQTNFLVGYDIEQQTVSFKPTDCT 372
+ C F T+S I GN Q F V YD+ Q + F C+
Sbjct: 427 DDSTTCLAFA-PTDSTAIIGNTQQQTFSVIYDVAQSRIGFSAGGCS 471
>gi|356536463|ref|XP_003536757.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
Length = 475
Score = 139 bits (351), Expect = 2e-30, Method: Compositional matrix adjust.
Identities = 69/194 (35%), Positives = 109/194 (56%), Gaps = 17/194 (8%)
Query: 29 FSVELIHRDSPKSPFYNSSETPYQRLRDALTRSLNR----LNHFNQNSSISSSKASQADI 84
+ ++L+HRD K P +N+ R + R R L +++A +D+
Sbjct: 68 YKLKLVHRD--KVPTFNTYHDHRTRFNARMQRDTKRAASLLRRLAAGKPTYAAEAFGSDV 125
Query: 85 IPN----NANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMS 140
+ + Y +RI +G+PP + V D+GSD+IW QCEPC +QCY Q P+F+P S
Sbjct: 126 VSGMEQGSGEYFVRIGVGSPPRNQYVVMDSGSDIIWVQCEPC--TQCYHQSDPVFNPADS 183
Query: 141 STYKSLPCSSSQCASLNQKSCSGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALP 200
S++ + C+S+ C+ ++ +C C+Y VSYGDGS++ G LA ET+T G T + VA+
Sbjct: 184 SSFSGVSCASTVCSHVDNAACHEGRCRYEVSYGDGSYTKGTLALETITFGRTLIRNVAI- 242
Query: 201 GITFGCGTNNGGLF 214
GCG +N G+F
Sbjct: 243 ----GCGHHNQGMF 252
Score = 40.4 bits (93), Expect = 1.6, Method: Compositional matrix adjust.
Identities = 34/128 (26%), Positives = 57/128 (44%), Gaps = 12/128 (9%)
Query: 252 IVIDSGTTLTFLP----QGYNSNLLSVMSSMIEAQPVADPTGSLELCYS-FNSLS-QVPE 305
+V+D+GT +T LP + + ++ +++ P A + CY F +S +VP
Sbjct: 352 VVMDTGTAVTRLPTVAYEAFRDGFIAQTTNL----PRASGVSIFDTCYDLFGFVSVRVPT 407
Query: 306 VTIHFRGADV-KLSRSNFFVKVSE-DIVCSVFKGITNSVPIYGNIMQTNFLVGYDIEQQT 363
V+ +F G + L NF + V + C F ++ + I GNI Q + D
Sbjct: 408 VSFYFSGGPILTLPARNFLIPVDDVGTFCFAFAPSSSGLSIIGNIQQEGIQISVDGANGF 467
Query: 364 VSFKPTDC 371
V F P C
Sbjct: 468 VGFGPNVC 475
>gi|293335828|ref|NP_001170221.1| uncharacterized protein LOC100384173 precursor [Zea mays]
gi|224034427|gb|ACN36289.1| unknown [Zea mays]
Length = 443
Score = 139 bits (351), Expect = 2e-30, Method: Compositional matrix adjust.
Identities = 85/211 (40%), Positives = 115/211 (54%), Gaps = 13/211 (6%)
Query: 28 GFSVELIHRDSPKSPFYNSSETPYQRLRDALTRSLNRLNHFNQNSSISSSKASQAD---I 84
GF L H D+ ++ T Q L AL RS R+ ++++ A A +
Sbjct: 30 GFKATLRHVDA------DAGYTEEQLLSRALRRSSARVATLQSLAALAPGDAITAARILV 83
Query: 85 IPNNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYK 144
+ ++ YL+ + IGTP A+ DTGSDLIWTQC PC C Q +P FDP S+TY+
Sbjct: 84 LASDGEYLMEMGIGTPTRYYSAILDTGSDLIWTQCAPC--LLCVDQPTPYFDPARSATYR 141
Query: 145 SLPCSSSQCASLNQKSCSGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITF 204
SL C+S C +L C C Y YGD + + G LA ET T G T V+LPGI+F
Sbjct: 142 SLGCASPACNALYYPLCYQKVCVYQYFYGDSASTAGVLANETFTFG-TNETRVSLPGISF 200
Query: 205 GCGTNNGGLFNSKTTGIVGLGGGDISLISQM 235
GCG N GL + +G+VG G G +SL+SQ+
Sbjct: 201 GCGNLNAGLL-ANGSGMVGFGRGSLSLVSQL 230
Score = 42.7 bits (99), Expect = 0.30, Method: Compositional matrix adjust.
Identities = 36/126 (28%), Positives = 62/126 (49%), Gaps = 8/126 (6%)
Query: 253 VIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGS--LELCYSFNSLSQ----VPEV 306
+IDSGTT+T+L + + + +S I P+ + T + L+ C+ + + +P++
Sbjct: 316 IIDSGTTITYLAEPAYDAVRAAFASQITL-PLLNVTDASVLDTCFQWPPPPRQSVTLPQL 374
Query: 307 TIHFRGADVKLSRSNF-FVKVSEDIVCSVFKGITNSVPIYGNIMQTNFLVGYDIEQQTVS 365
+HF GAD +L N+ V S + ++ I G+ NF V YD+E +S
Sbjct: 375 VLHFDGADWELPLQNYMLVDPSTGGGLCLAMASSSDGSIIGSYQHQNFNVLYDLENSLMS 434
Query: 366 FKPTDC 371
F P C
Sbjct: 435 FVPAPC 440
>gi|125548488|gb|EAY94310.1| hypothetical protein OsI_16079 [Oryza sativa Indica Group]
Length = 423
Score = 139 bits (351), Expect = 2e-30, Method: Compositional matrix adjust.
Identities = 72/150 (48%), Positives = 95/150 (63%), Gaps = 8/150 (5%)
Query: 87 NNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSL 146
N +L+ +SIGTP A+ DTGSDL+WTQC+PC C+ Q +P+FDP SSTY ++
Sbjct: 70 GNGEFLMDVSIGTPALAYSAIVDTGSDLVWTQCKPC--VDCFKQSTPVFDPSSSSTYATV 127
Query: 147 PCSSSQCASLNQKSC-SGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITFG 205
PCSS+ C+ L C S C Y+ +YGD S + G LATET TL + LPG+ FG
Sbjct: 128 PCSSASCSDLPTSKCTSASKCGYTYTYGDSSSTQGVLATETFTLAKSK-----LPGVVFG 182
Query: 206 CGTNNGGLFNSKTTGIVGLGGGDISLISQM 235
CG N G S+ G+VGLG G +SL+SQ+
Sbjct: 183 CGDTNEGDGFSQGAGLVGLGRGPLSLVSQL 212
Score = 64.3 bits (155), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 45/133 (33%), Positives = 67/133 (50%), Gaps = 11/133 (8%)
Query: 249 TPDIVIDSGTTLTFLP-QGYNSNLLSVMSSMIEAQPVADPTG-SLELCYSFNSLS----Q 302
T +++DSGT++T+L QGY + + + M A P AD +G L+LC+ + +
Sbjct: 293 TGGVIVDSGTSITYLEVQGYRALKKAFAAQM--ALPAADGSGVGLDLCFRAPAKGVDQVE 350
Query: 303 VPEVTIHFRG-ADVKLSRSNFFV-KVSEDIVCSVFKGITNSVPIYGNIMQTNFLVGYDIE 360
VP + HF G AD+ L N+ V +C G + + I GN Q NF YD+
Sbjct: 351 VPRLVFHFDGGADLDLPAENYMVLDGGSGALCLTVMG-SRGLSIIGNFQQQNFQFVYDVG 409
Query: 361 QQTVSFKPTDCTK 373
T+SF P C K
Sbjct: 410 HDTLSFAPVQCNK 422
>gi|168043550|ref|XP_001774247.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162674374|gb|EDQ60883.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 419
Score = 139 bits (350), Expect = 2e-30, Method: Compositional matrix adjust.
Identities = 125/409 (30%), Positives = 182/409 (44%), Gaps = 82/409 (20%)
Query: 32 ELIHRDSPKSPFY-NSSETPYQRLRDALTRSLNRLNHFNQNSSISSSKASQADIIPNNAN 90
ELIHR+ P SP N+S+T + A+ R R +++ ++ + + N
Sbjct: 21 ELIHREHPSSPLRSNTSKTTTEIFLAAVKRGAERRAQLSKHI-LAEGRLFSTPVASGNGE 79
Query: 91 YLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCSS 150
YLI IS G+PP + + DTGSDLIWTQC PC C S +FDP SSTY ++ C+S
Sbjct: 80 YLIDISFGSPPQKASVIVDTGSDLIWTQCLPC--ETCNAAASVIFDPVKSSTYDTVSCAS 137
Query: 151 SQCASLNQKSCSGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITFGCGTNN 210
+ C+SL +SC+ +C+Y YGDGS ++G L+TETVT+ +P + FGCG N
Sbjct: 138 NFCSSLPFQSCT-TSCKYDYMYGDGSSTSGALSTETVTV-----GTGTIPNVAFGCGHTN 191
Query: 211 GGLFNSKTTGIVGLGGGDISLISQMRTTIAGNQR-----LG--VSTPDIVIDS----GTT 259
G F + GIVGLG G +SLISQ + + LG ++P ++ DS G
Sbjct: 192 LGSF-AGAAGIVGLGQGPLSLISQASSITSKKFSYCLVPLGSTKTSPMLIGDSAAAGGVA 250
Query: 260 LTFL------PQGYNSNLLSVMSSMIEAQPVADPTGSLE----------------LCY-- 295
T L P Y ++L + + + V P G+ L Y
Sbjct: 251 YTALLTNTANPTFYYADLTGI---SVSGKAVTYPVGTFSIDASGQGGFILDSGTTLTYLE 307
Query: 296 --SFNSLSQVPEVTIHFRGAD-------------------------------VKLSRSNF 322
+FN+L + + F AD +L N
Sbjct: 308 TGAFNALVAALKAEVPFPEADGSLYGLDYCFSTAGVANPTYPTMTFHFKGADYELPPENV 367
Query: 323 FVKVSEDIVCSVFKGITNSVPIYGNIMQTNFLVGYDIEQQTVSFKPTDC 371
FV + + + I GNI Q N L+ +D+ Q V FK +C
Sbjct: 368 FVALDTGGSICLAMAASTGFSIMGNIQQQNHLIVHDLVNQRVGFKEANC 416
>gi|226494448|ref|NP_001141341.1| uncharacterized protein LOC100273432 precursor [Zea mays]
gi|194704078|gb|ACF86123.1| unknown [Zea mays]
gi|413953775|gb|AFW86424.1| aspartic proteinase nepenthesin-2 [Zea mays]
Length = 471
Score = 139 bits (350), Expect = 2e-30, Method: Compositional matrix adjust.
Identities = 114/346 (32%), Positives = 156/346 (45%), Gaps = 70/346 (20%)
Query: 90 NYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCS 149
NY+ ++ +GTP T V DTGS L W QC PC S C+ Q PLFDP+ SSTY S+ CS
Sbjct: 133 NYVTQLGLGTPSTSYAMVVDTGSSLTWLQCSPCVVS-CHRQVGPLFDPRASSTYTSVRCS 191
Query: 150 SSQC-----ASLNQKSCSGVN-CQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGIT 203
+SQC A+LN +CS N C Y SYGD SFS G L+T+TV+ GST+ P
Sbjct: 192 ASQCDELQAATLNPSACSASNVCIYQASYGDSSFSVGYLSTDTVSFGSTS-----YPSFY 246
Query: 204 FGCGT-NNG---------GLFNSK-----------------------TTGIVGLG----G 226
+GCG N G GL +K +TG + +G G
Sbjct: 247 YGCGQDNEGLFGRSAGLIGLARNKLSLLYQLAPSLGYSFSYCLPTAASTGYLSIGPYNTG 306
Query: 227 GDISLISQMRTTI-------------AGNQRLGVSTPD-----IVIDSGTTLTFLPQGYN 268
S +++ G L VS + +IDSGT +T LP +
Sbjct: 307 HYYSYTPMASSSLDASLYFITLSGMSVGGSPLAVSPSEYSSLPTIIDSGTVITRLPTAVH 366
Query: 269 SNLLSVMSSMIEAQPVADPTGSLELCYSFN-SLSQVPEVTIHFRG-ADVKLSRSNFFVKV 326
+ L ++ + A L+ C+ S +VP V + F G A +KL+ N + V
Sbjct: 367 TALSKAVAQAMAGAQRAPAFSILDTCFEGQASQLRVPTVVMAFAGGASMKLTTRNVLIDV 426
Query: 327 SEDIVCSVFKGITNSVPIYGNIMQTNFLVGYDIEQQTVSFKPTDCT 372
+ C F T+S I GN Q F V YD+ Q + F C+
Sbjct: 427 DDSTTCLAFA-PTDSTAIIGNTQQQTFSVIYDVAQSRIGFSAGGCS 471
>gi|125548492|gb|EAY94314.1| hypothetical protein OsI_16081 [Oryza sativa Indica Group]
Length = 417
Score = 139 bits (350), Expect = 2e-30, Method: Compositional matrix adjust.
Identities = 78/195 (40%), Positives = 115/195 (58%), Gaps = 15/195 (7%)
Query: 49 TPYQRLRDALTRSLNRLNHFN--QNSSISSSKASQAD--IIPNNANYLIRISIGTPPTER 104
T ++ LR A+ RS RL + + S+ KA A+ I+P YL+++ IGTPP +
Sbjct: 43 TEHELLRRAIQRSRYRLAGIGMARGEAASARKAVVAETPIMPAGGEYLVKLGIGTPPYKF 102
Query: 105 LAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCSSSQCASLNQKSCSGV 164
A DT SDLIWTQC+PC + CY Q P+F+P++SSTY +LPCSS C L+ C
Sbjct: 103 TAAIDTASDLIWTQCQPC--TGCYHQVDPMFNPRVSSTYAALPCSSDTCDELDVHRCGHD 160
Query: 165 N---CQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITFGCGTNN-GGLFNSKTTG 220
+ CQY+ +Y + + G LA + + +G A G+ FGC T++ GG + +G
Sbjct: 161 DDESCQYTYTYSGNATTEGTLAVDKLVIGED-----AFRGVAFGCSTSSTGGAPPPQASG 215
Query: 221 IVGLGGGDISLISQM 235
+VGLG G +SL+SQ+
Sbjct: 216 VVGLGRGPLSLVSQL 230
>gi|296082172|emb|CBI21177.3| unnamed protein product [Vitis vinifera]
Length = 376
Score = 139 bits (350), Expect = 2e-30, Method: Compositional matrix adjust.
Identities = 105/362 (29%), Positives = 155/362 (42%), Gaps = 73/362 (20%)
Query: 30 SVELIHRDSPKSPF---YNSSETPYQRLRDALTRSLNRLNHFNQNSSISSSKASQADIIP 86
S+E+IH+ P S S + Q L +R + + +N + +P
Sbjct: 67 SLEVIHKHGPCSKLSQDKGRSPSRTQMLDQDESRVNSIRSRLAKNPADGGKLKGSKVTLP 126
Query: 87 NNA-------NYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKM 139
+ + NY++ + +GTP + + DTGSDL WTQCEPC CY Q P+F+P
Sbjct: 127 SKSGSTIGTGNYVVTVGLGTPKRDLTFIFDTGSDLTWTQCEPC-ARYCYHQQEPIFNPSK 185
Query: 140 SSTYKSLPCSSSQCASL-----NQKSCSGVNCQYSVSYGDGSFSNGNLATETVTLGSTTG 194
S++Y ++ CSS C L N SCS C Y + YGD S+S G A + + L ST
Sbjct: 186 STSYTNISCSSPTCDELKSGTGNSPSCSASTCVYGIQYGDQSYSVGFFAQDKLALTSTD- 244
Query: 195 QAVALPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGNQRLGVSTPDIVI 254
FGCG NN GLF G++GLG +SL+S+
Sbjct: 245 ---VFNNFLFGCGQNNRGLF-VGVAGLIGLGRNALSLMSKY------------------- 281
Query: 255 DSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSLS--QVPEVTIHFR- 311
P A P L+ CY F+ VP++ ++F
Sbjct: 282 ----------------------------PKAAPASILDTCYDFSQYDTVDVPKINLYFSD 313
Query: 312 GADVKLSRSNFFVKVSEDIVCSVFKGITNS--VPIYGNIMQTNFLVGYDIEQQTVSFKPT 369
GA++ L S F ++ VC F G +++ + I GN+ Q F V YD+ + F P
Sbjct: 314 GAEMDLDPSGIFYILNISQVCLAFAGNSDATDIAILGNVQQKTFDVVYDVAGGRIGFAPG 373
Query: 370 DC 371
C
Sbjct: 374 GC 375
>gi|224126221|ref|XP_002329620.1| predicted protein [Populus trichocarpa]
gi|222870359|gb|EEF07490.1| predicted protein [Populus trichocarpa]
Length = 357
Score = 139 bits (349), Expect = 3e-30, Method: Compositional matrix adjust.
Identities = 104/349 (29%), Positives = 148/349 (42%), Gaps = 75/349 (21%)
Query: 89 ANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPC 148
Y R+ +G P + V DTGSD+ W QC+PC + CY Q P+FDP SSTY + C
Sbjct: 18 GEYFTRVGVGNPARQFYMVLDTGSDINWLQCQPC--TDCYQQTDPIFDPTASSTYAPVTC 75
Query: 149 SSSQCASLNQKSCSGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITFGCGT 208
S QC+SL SC C Y V+YGDGS++ G+ ATE+V+ G++ ++ + GCG
Sbjct: 76 QSQQCSSLEMSSCRSGQCLYQVNYGDGSYTFGDFATESVSFGNSG----SVKNVALGCGH 131
Query: 209 NNGGL------------------------------------------FNSKTTGI----- 221
+N GL FNS G+
Sbjct: 132 DNEGLFVGAAGLLGLGGGPLSLTNQLKATSFSYCLVNRDSAGSSTLDFNSAQLGVDSVTA 191
Query: 222 -------------VGLGGGDISLISQMRTTIAGNQRLGVS-TPDIVIDSGTTLTFLP-QG 266
VGL G +S+ QM + RL S I++D GT +T L Q
Sbjct: 192 PLMKNRKIDTFYYVGLSG--MSVGGQMVSIPESTFRLDESGNGGIIVDCGTAITRLQTQA 249
Query: 267 YNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSLS--QVPEVTIHFR-GADVKLSRSNFF 323
YN L M + + + CY + + +VP V+ HF G L +N+
Sbjct: 250 YNP-LRDAFVRMTQNLKLTSAVALFDTCYDLSGQASVRVPTVSFHFADGKSWNLPAANYL 308
Query: 324 VKV-SEDIVCSVFKGITNSVPIYGNIMQTNFLVGYDIEQQTVSFKPTDC 371
+ V S C F T+S+ I GN+ Q V +D+ + F P C
Sbjct: 309 IPVDSAGTYCFAFAPTTSSLSIIGNVQQQGTRVTFDLANNRMGFSPNKC 357
>gi|326532056|dbj|BAK01404.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 506
Score = 139 bits (349), Expect = 3e-30, Method: Compositional matrix adjust.
Identities = 107/411 (26%), Positives = 175/411 (42%), Gaps = 96/411 (23%)
Query: 56 DALTRSLNRLNHFNQNSSISSSKASQADIIPNNA------------------NYLIRISI 97
D+ + R++ ++ +++S S A++ D P A YL+ + +
Sbjct: 96 DSAEKDAVRIDTMHRRAALSGSAAARRDSAPRRALSERVVATVESGVPVGSGEYLVDVYL 155
Query: 98 GTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCSSSQC---- 153
GTPP + DTGSDL W QC PC C+ Q P+FDP S +Y+++ C +C
Sbjct: 156 GTPPRRFRMIMDTGSDLNWLQCAPC--LDCFEQSGPIFDPAASISYRNVTCGDDRCRLVS 213
Query: 154 --ASLNQKSCSGVN---CQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITFGCGT 208
A + C C Y YGD S + G+LA E T+ T + G+ FGCG
Sbjct: 214 PPAESAPRECRRPRSDPCPYYYWYGDQSNTTGDLALEAFTVNLTQSGTRRVDGVAFGCGH 273
Query: 209 NNGGLFN----------------SKTTGIVG-------------------LGGGDISLIS 233
N GLF+ S+ G+ G + G D +L++
Sbjct: 274 RNRGLFHGAAGLLGLGRGPLSFASQLRGVYGGHAFSYCLVEHGSAAGSKIIFGHDDALLA 333
Query: 234 ---------------------QMRTTIAGNQRLGVSTPDI-----VIDSGTTLTFLPQ-G 266
Q+++ + G + + +S+ + +IDSGTTL++ P+
Sbjct: 334 HPQLNYTAFAPTTDADTFYYLQLKSILVGGEAVNISSDTLSAGGTIIDSGTTLSYFPEPA 393
Query: 267 YNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSLS--QVPEVTIHFR-GADVKLSRSNFF 323
Y + + + M + P+ L CY+ + +VPE+++ F GA + N+F
Sbjct: 394 YQAIRQAFIDRMSPSYPLILGFPVLSPCYNVSGAEKVEVPELSLVFADGAAWEFPAENYF 453
Query: 324 VKVS-EDIVCSVFKGITNS-VPIYGNIMQTNFLVGYDIEQQTVSFKPTDCT 372
+++ E I+C G S + I GN Q NF V YD+E + F P C
Sbjct: 454 IRLEPEGIMCLAVLGTPRSGMSIIGNYQQQNFHVLYDLEHNRLGFAPRRCA 504
>gi|224142013|ref|XP_002324355.1| predicted protein [Populus trichocarpa]
gi|222865789|gb|EEF02920.1| predicted protein [Populus trichocarpa]
Length = 477
Score = 139 bits (349), Expect = 3e-30, Method: Compositional matrix adjust.
Identities = 109/354 (30%), Positives = 156/354 (44%), Gaps = 71/354 (20%)
Query: 82 ADIIPNNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSS 141
A I+P Y++ + +GTP + DTGSDL WTQCEPC C+ Q+ P FDP S+
Sbjct: 131 ASIVPTGGAYVVTVGLGTPKKDFTLSFDTGSDLTWTQCEPC-LGGCFPQNQPKFDPTTST 189
Query: 142 TYKSLPCSSSQCASLNQ-----KSCSGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQA 196
+YK++ CSS C + + + C C Y + YG G ++ G LATET+ + S+
Sbjct: 190 SYKNVSCSSEFCKLIAEGNYPAQDCISNTCLYGIQYGSG-YTIGFLATETLAIASSD--- 245
Query: 197 VALPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTI--------------AGN 242
FGC + G FN TTG++GLG I+L SQ G+
Sbjct: 246 -VFKNFLFGCSEESRGTFNG-TTGLLGLGRSPIALPSQTTNKYKNLFSYCLPASPSSTGH 303
Query: 243 QRLGV-------STP------------------------------DIVIDSGTTLTFLPQ 265
GV STP +IDSGTT TFLP
Sbjct: 304 LSFGVEVSQAAKSTPISPKLKQLYGLNTVGISVRGRELPINGSISRTIIDSGTTFTFLPS 363
Query: 266 GYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSLSQ----VPEVTIHFRGA-DVKLSRS 320
S L S M+ + + T S + CY F+++ +P ++I F G +V++ S
Sbjct: 364 PTYSALGSAFREMMANYTLTNGTSSFQPCYDFSNIGNGTLTIPGISIFFEGGVEVEIDVS 423
Query: 321 NFFVKVSE-DIVCSVFK--GITNSVPIYGNIMQTNFLVGYDIEQQTVSFKPTDC 371
+ V+ VC F G + I+GN Q + V YD+ + V F P C
Sbjct: 424 GIMIPVNGLKEVCLAFADTGSDSDFAIFGNYQQKTYEVIYDVAKGMVGFAPKGC 477
>gi|115467014|ref|NP_001057106.1| Os06g0209100 [Oryza sativa Japonica Group]
gi|51091210|dbj|BAD35903.1| putative aspartic proteinase nepenthesin I [Oryza sativa Japonica
Group]
gi|113595146|dbj|BAF19020.1| Os06g0209100 [Oryza sativa Japonica Group]
gi|125554496|gb|EAZ00102.1| hypothetical protein OsI_22105 [Oryza sativa Indica Group]
Length = 454
Score = 139 bits (349), Expect = 3e-30, Method: Compositional matrix adjust.
Identities = 103/363 (28%), Positives = 153/363 (42%), Gaps = 81/363 (22%)
Query: 90 NYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDS-PLFDPKMSSTYKSLPC 148
YL+ +S+GTPP DTGSDL+WTQC PC C+ Q + P+ DP SST+ +LPC
Sbjct: 89 EYLMHVSVGTPPRPVALTLDTGSDLVWTQCAPC--LDCFEQGAAPVLDPAASSTHAALPC 146
Query: 149 SSSQCASLNQKSCSGVN-----CQYSVSYGDGSFSNGNLATETVTLG-STTGQAVALPGI 202
+ C +L SC G + C Y YGD S + G LAT++ T G +A +
Sbjct: 147 DAPLCRALPFTSCGGRSWGDRSCVYVYHYGDRSLTVGQLATDSFTFGGDDNAGGLAARRV 206
Query: 203 TFGCGTNNGG----------------------------------LFNSKTTGIVGLGGGD 228
TFGCG N G +F++K++ +V LG
Sbjct: 207 TFGCGHINKGIFQANETGIAGFGRGRWSLPSQLNVTSFSYCFTSMFDTKSSSVVTLGAAA 266
Query: 229 ISLISQMRTTIAGNQR-------------------------LGVSTPD------IVIDSG 257
L+ G+ R V+ P+ +IDSG
Sbjct: 267 AELLHTHHAAHTGDVRTTRLIKNPSQPSLYFVPLRGISVGGARVAVPESRLRSSTIIDSG 326
Query: 258 TTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSLS-----QVPEVTIHFRG 312
++T LP+ + + S + A + +L+LC++ + VP +T+H G
Sbjct: 327 ASITTLPEDVYEAVKAEFVSQVGLPAAAAGSAALDLCFALPVAALWRRPAVPALTLHLDG 386
Query: 313 -ADVKLSRSNF-FVKVSEDIVCSVFKGITNSVPIYGNIMQTNFLVGYDIEQQTVSFKPTD 370
AD +L R N+ F + ++C V + GN Q N V YD+E +SF P
Sbjct: 387 GADWELPRGNYVFEDYAARVLCVVLDAAAGEQVVIGNYQQQNTHVVYDLENDVLSFAPAR 446
Query: 371 CTK 373
C K
Sbjct: 447 CDK 449
>gi|357137788|ref|XP_003570481.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Brachypodium
distachyon]
Length = 455
Score = 138 bits (348), Expect = 4e-30, Method: Compositional matrix adjust.
Identities = 110/348 (31%), Positives = 157/348 (45%), Gaps = 73/348 (20%)
Query: 90 NYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCS 149
NY+ R+ +GTP + V DTGS L W QC PC S C+ Q P+FDPK SS+Y ++ CS
Sbjct: 116 NYVTRMGLGTPAKPYIMVVDTGSSLTWLQCSPCRVS-CHRQSGPVFDPKTSSSYAAVSCS 174
Query: 150 SSQC-----ASLNQKSCSGVN-CQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGIT 203
S QC A+LN CS N C Y SYGD SFS G L+ +TV+ G A ++P
Sbjct: 175 SPQCDGLSTATLNPAVCSPSNVCIYQASYGDSSFSVGYLSKDTVSFG-----ANSVPNFY 229
Query: 204 FGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTI-------------AGNQRLGVSTP 250
+GCG +N GLF ++ G++GL +SL+ Q+ T+ +G +G P
Sbjct: 230 YGCGQDNEGLFG-RSAGLMGLARNKLSLLYQLAPTLGYSFSYCLPSTSSSGYLSIGSYNP 288
Query: 251 D-----------------IVIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPV--------- 284
+ SG T+ P +S+ + + ++I++ V
Sbjct: 289 GGYSYTPMVSNTLDDSLYFISLSGMTVAGKPLAVSSSEYTSLPTIIDSGTVITRLPTSVY 348
Query: 285 --------ADPTGS---------LELCYS--FNSLSQVPEVTIHFR-GADVKLSRSNFFV 324
A GS L+ C+ + L VP V++ F GA +KLS N V
Sbjct: 349 TALSKAVAAAMKGSTKRAAAYSILDTCFEGQASKLRAVPAVSMAFSGGATLKLSAGNLLV 408
Query: 325 KVSEDIVCSVFKGITNSVPIYGNIMQTNFLVGYDIEQQTVSFKPTDCT 372
V C F S I GN Q F V YD++ + F C+
Sbjct: 409 DVDGATTCLAF-APARSAAIIGNTQQQTFSVVYDVKSNRIGFAAAGCS 455
>gi|356527089|ref|XP_003532146.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
Length = 488
Score = 138 bits (347), Expect = 5e-30, Method: Compositional matrix adjust.
Identities = 116/412 (28%), Positives = 175/412 (42%), Gaps = 88/412 (21%)
Query: 30 SVELIHRDSPKSPFYN-----SSETPYQRLRDALTRSLNRLN-----HFNQNSSISSSKA 79
S+E++H+ P S N S+TP+ + + + +N + Q+SS+S +
Sbjct: 70 SLEVVHKHGPCSQLNNHDGKAKSKTPHSEILNQDKERVKYINSRISKNLGQDSSVSELDS 129
Query: 80 ----SQADIIPNNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLF 135
+++ + + NY + + +GTP + + DTGSDL WTQCEPC S CY Q +F
Sbjct: 130 VTLPAKSGSLIGSGNYFVVVGLGTPKRDLSLIFDTGSDLTWTQCEPCARS-CYKQQDAIF 188
Query: 136 DPKMSSTYKSLPCSSSQCASL-----NQKSCSGVN--CQYSVSYGDGSFSNGNLATETVT 188
DP S++Y ++ C+S+ C L N+ CS C Y + YGD SFS G + E ++
Sbjct: 189 DPSKSTSYSNITCTSTLCTQLSTATGNEPGCSASTKACIYGIQYGDSSFSVGYFSRERLS 248
Query: 189 LGSTTGQAVALPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMR------------ 236
+ +T + FGCG NN GLF + G++GLG IS + Q
Sbjct: 249 VTATD----IVDNFLFGCGQNNQGLFGG-SAGLIGLGRHPISFVQQTAAVYRKIFSYCLP 303
Query: 237 ------------------------TTIA---------------GNQRLGVS-----TPDI 252
+TI+ G +L VS T
Sbjct: 304 ATSSSTGRLSFGTTTTSYVKYTPFSTISRGSSFYGLDITGISVGGAKLPVSSSTFSTGGA 363
Query: 253 VIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSLS--QVPEVTIHF 310
+IDSGT +T LP + L S + P A L+ CY + +P++ F
Sbjct: 364 IIDSGTVITRLPPTAYTALRSAFRQGMSKYPSAGELSILDTCYDLSGYEVFSIPKIDFSF 423
Query: 311 RGA-DVKLSRSNFFVKVSEDIVCSVF--KGITNSVPIYGNIMQTNFLVGYDI 359
G V+L S VC F G + V IYGN+ Q V YD+
Sbjct: 424 AGGVTVQLPPQGILYVASAKQVCLAFAANGDDSDVTIYGNVQQKTIEVVYDV 475
>gi|357123876|ref|XP_003563633.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Brachypodium
distachyon]
Length = 503
Score = 138 bits (347), Expect = 5e-30, Method: Compositional matrix adjust.
Identities = 122/446 (27%), Positives = 171/446 (38%), Gaps = 102/446 (22%)
Query: 21 PIEAQTGGFSVELIHRDSPKSPFYNSSETPYQRLRDALTRSLNRLNHFNQNSSISSS--- 77
PI A + V ++HR P SP + + L NR+ + S +++
Sbjct: 65 PITATSSAARVPIVHRHGPCSPLAGAHAGKPPSHAEILAADQNRVESLHHRVSSTTTGLG 124
Query: 78 -KASQADIIPNN------------------------ANYLIRISIGTPPTERLAVADTGS 112
K P + ANY++ I +GTPP+ V DTGS
Sbjct: 125 GKPRTKKKTPGHSSVPASSSSSSSSVPASSGLSLGTANYVVPIGLGTPPSRFTVVFDTGS 184
Query: 113 DLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCSSSQCASLNQKSCSGVNCQYSVSY 172
D W QC PC S CY Q LFDP SSTY ++ C+ CA L+ C+ +C Y + Y
Sbjct: 185 DTTWVQCRPCVVS-CYKQKDRLFDPAKSSTYANVSCADPACADLDASGCNAGHCLYGIQY 243
Query: 173 GDGSFSNGNLATETVTLGSTTGQAVALPGITFGCGTNN---------------------- 210
GDGS++ G A +T+ + A+ G FGCG N
Sbjct: 244 GDGSYTVGFFAKDTLAVAQD-----AIKGFKFGCGEKNRGLFGQTAGLLGLGRGPTSITV 298
Query: 211 ------GGLF--------------------------NSKTTGIVGLGGGDISLISQMRTT 238
GG F N+KTT ++ G +
Sbjct: 299 QAYEKYGGSFSYCLPASSAATGYLEFGPLSPSSSGSNAKTTPMLTDKGPTFYYVGLTGIR 358
Query: 239 IAGNQRLGV------STPDIVIDSGTTLTFLPQ--GYNSNLLSVMSSMIEAQPVADPTGS 290
+ G Q LG S ++DSGT +T LP + + A
Sbjct: 359 VGGKQ-LGAIPESVFSNSGTLVDSGTVITRLPDTAYAALSSAFAAAMAASGYKKAAAYSI 417
Query: 291 LELCYSFNSLSQV--PEVTIHFRG-ADVKLSRSNFFVKVSEDIVCSVFK--GITNSVPIY 345
L+ CY F LSQV P V++ F+G A + L S +S+ VC F G SV I
Sbjct: 418 LDTCYDFTGLSQVSLPTVSLVFQGGACLDLDASGIVYAISQSQVCLGFASNGDDESVGIV 477
Query: 346 GNIMQTNFLVGYDIEQQTVSFKPTDC 371
GN Q + V YD+ ++ V F P C
Sbjct: 478 GNTQQRTYGVLYDVSKKVVGFAPGAC 503
>gi|242066176|ref|XP_002454377.1| hypothetical protein SORBIDRAFT_04g029680 [Sorghum bicolor]
gi|241934208|gb|EES07353.1| hypothetical protein SORBIDRAFT_04g029680 [Sorghum bicolor]
Length = 474
Score = 137 bits (346), Expect = 6e-30, Method: Compositional matrix adjust.
Identities = 128/423 (30%), Positives = 174/423 (41%), Gaps = 97/423 (22%)
Query: 31 VELIHRDSPKSPFYNSS-ETPYQRLRDALTRSLNRLNHFNQNSS------ISSSKASQAD 83
+ L H+ P +P SS TP + D L R + + S + SKA A
Sbjct: 67 LRLTHKHGPCAPSRASSLATP--SVADTLRADQRRAEYILRRVSGRGTPQLWDSKAEAAT 124
Query: 84 I-IPNNA-------NYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLF 135
+P N NY++ +S+GTP + DTGSDL W QC PC CY Q PLF
Sbjct: 125 ATVPANWGFNIGTLNYVVTVSLGTPGVAQTLEVDTGSDLSWVQCTPCAAPACYSQKDPLF 184
Query: 136 DPKMSSTYKSLPCSSSQCASLN--QKSCSGVNCQYSVSYGDGSFSNGNLATETVTLGSTT 193
DP SS+Y ++PC C L SCS C Y VSYGDGS + G +++T+TL
Sbjct: 185 DPAQSSSYAAVPCGGPVCGGLGIYASSCSAAQCGYVVSYGDGSKTTGVYSSDTLTLSPND 244
Query: 194 GQAVALPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQ------------------- 234
A+ G FGCG G + G++GLG + SL+ Q
Sbjct: 245 ----AVRGFFFGCGHAQSGFTGND--GLLGLGREEASLVEQTAGTYGGVFSYCLPTRPST 298
Query: 235 ----------------------------------MRTTIA-GNQRLGVSTP----DIVID 255
M T I+ G Q+L V + V+D
Sbjct: 299 TGYLTLGGPSGAAPPGFSTTQLLSSPNAATYYVVMLTGISVGGQQLSVPSSVFAGGTVVD 358
Query: 256 SGTTLTFLPQGYNSNLLSVMSSMIEA--QPVADPTGSLELCYSFNSLSQV--PEVTIHFR 311
+GT +T LP + L S S + + P A TG L+ CY+F+ V P V + F
Sbjct: 359 TGTVITRLPPTAYAALRSAFRSGMASYGYPSAPATGILDTCYNFSGYGTVTLPNVALTFS 418
Query: 312 -GADVKLSRSNFFVKVSEDIVCSVFK--GITNSVPIYGNIMQTNFLVGYDIEQQTVSFKP 368
GA V L C F G + I GN+ Q +F V I+ +V FKP
Sbjct: 419 GGATVTLGADGIL-----SFGCLAFAPSGSDGGMAILGNVQQRSFEV--RIDGTSVGFKP 471
Query: 369 TDC 371
+ C
Sbjct: 472 SSC 474
>gi|414886964|tpg|DAA62978.1| TPA: aspartic proteinase nepenthesin-1 [Zea mays]
Length = 452
Score = 137 bits (346), Expect = 6e-30, Method: Compositional matrix adjust.
Identities = 93/244 (38%), Positives = 129/244 (52%), Gaps = 20/244 (8%)
Query: 2 ATFLSCVFILFFLCFYVVSPIEAQTGGFSVEL--IHRDSPKSPFYNSSETPYQRLRDALT 59
A S ++ L F ++ G VEL +H D S T Q +R AL
Sbjct: 7 AQMASLAVLIISLVFAALASDSDAAAGVRVELTRVHADP--------SVTASQFVRGALR 58
Query: 60 RSLNRLNHFNQNSSISSSKASQADI--IPNNANYLIRISIGTPPTERLAVADTGSDLIWT 117
R ++R N + SS A P YL+ ++IGTPP A+ADTGSDLIWT
Sbjct: 59 RDMHRHNARKLALAASSGATVSAPTQDSPTAGEYLMALAIGTPPLPYQAIADTGSDLIWT 118
Query: 118 QCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCSSSQ--CASLNQKSCS----GVNCQYSVS 171
QC PC SQC+ Q +PL++P S+T+ LPC+SS CA+ + + G C Y+V+
Sbjct: 119 QCAPC-TSQCFRQPTPLYNPSSSTTFAVLPCNSSLSVCAAALAGTGTAPPPGCACTYNVT 177
Query: 172 YGDGSFSNGNLATETVTLGSTTGQAVALPGITFGCGTNNGGLFNSKTTGIVGLGGGDISL 231
YG G +++ +ET T GST +PGI FGC T + G S +G+VGLG G +SL
Sbjct: 178 YGSG-WTSVFQGSETFTFGSTPAGHARVPGIAFGCSTASSGFNASSASGLVGLGRGRLSL 236
Query: 232 ISQM 235
+SQ+
Sbjct: 237 VSQL 240
Score = 64.3 bits (155), Expect = 9e-08, Method: Compositional matrix adjust.
Identities = 40/129 (31%), Positives = 66/129 (51%), Gaps = 8/129 (6%)
Query: 252 IVIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTG--SLELCYSFNSLSQ----VPE 305
++IDSGTT+T L + + + S++ P D + L+LC+ S + +P
Sbjct: 324 LIIDSGTTITLLGNTAYQQVRAAVVSLVTL-PTTDGSADTGLDLCFMLPSSTSAPPAMPS 382
Query: 306 VTIHFRGADVKLSRSNFFVKVSEDIVCSVFKGITN-SVPIYGNIMQTNFLVGYDIEQQTV 364
+T+HF GAD+ L ++ + + C + T+ V I GN Q N + YDI Q+T+
Sbjct: 383 MTLHFNGADMVLPADSYMMSDDSGLWCLAMQNQTDGEVNILGNYQQQNMHILYDIGQETL 442
Query: 365 SFKPTDCTK 373
SF P C+
Sbjct: 443 SFAPAKCSA 451
>gi|225463766|ref|XP_002267930.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Vitis vinifera]
Length = 479
Score = 137 bits (345), Expect = 8e-30, Method: Compositional matrix adjust.
Identities = 113/432 (26%), Positives = 179/432 (41%), Gaps = 99/432 (22%)
Query: 23 EAQTGG--FSVELIHRDSPKSPFYNSSETPYQRLRDALTRSLNRLNHFNQNSSISSSKAS 80
+ + GG + ++++HRD + +S+ RL L R R+ + S +
Sbjct: 64 DHEEGGEKWMMKVVHRDQLS---FGNSDDHRHRLDGRLKRDAKRVASLIRRLSSGGGGSY 120
Query: 81 QAD---------IIPNNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQD 131
+ D + + Y +RI +G+PP + V D+GSD++W QC+PC +QCY Q
Sbjct: 121 RVDDFGTDVISGMEQGSGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPC--TQCYHQS 178
Query: 132 SPLFDPKMSSTYKSLPCSSSQCASLNQKSCSGVNCQYSVSYGDGSFSNGNLATETVT--- 188
P+FDP S+++ + CSSS C L C C+Y VSYGDGS++ G LA ET+T
Sbjct: 179 DPVFDPADSASFTGVSCSSSVCDRLENAGCHAGRCRYEVSYGDGSYTKGTLALETLTFGR 238
Query: 189 -------------------------------------LGSTTGQAVALPGITFGCGTNNG 211
LG TG A + ++ G ++
Sbjct: 239 TMVRSVAIGCGHRNRGMFVGAAGLLGLGGGSMSFVGQLGGQTGGAFSYCLVSRGTDSSGS 298
Query: 212 GLFNSKT---------------------TGIVGLGGGDISL-ISQ--MRTTIAGNQRLGV 247
+F + G+ GLG G I + IS+ R T G+
Sbjct: 299 LVFGREALPAGAAWVPLVRNPRAPSFYYIGLAGLGVGGIRVPISEEVFRLTELGDG---- 354
Query: 248 STPDIVIDSGTTLTFLP----QGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSLS-- 301
+V+D+GT +T LP Q + L+ +++ A VA + CY
Sbjct: 355 ---GVVMDTGTAVTRLPTLAYQAFRDAFLAQTANLPRATGVA----IFDTCYDLLGFVSV 407
Query: 302 QVPEVTIHFRGADV-KLSRSNFFVKVSE-DIVCSVFKGITNSVPIYGNIMQTNFLVGYDI 359
+VP V+ +F G + L NF + + + C F T+ + I GNI Q + +D
Sbjct: 408 RVPTVSFYFSGGPILTLPARNFLIPMDDAGTFCFAFAPSTSGLSILGNIQQEGIQISFDG 467
Query: 360 EQQTVSFKPTDC 371
V F P C
Sbjct: 468 ANGYVGFGPNIC 479
>gi|413921976|gb|AFW61908.1| hypothetical protein ZEAMMB73_608282 [Zea mays]
Length = 459
Score = 137 bits (345), Expect = 9e-30, Method: Compositional matrix adjust.
Identities = 72/171 (42%), Positives = 111/171 (64%), Gaps = 14/171 (8%)
Query: 75 SSSKASQADIIPNNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPL 134
+SS A A + A YL+ ++IGTPP +A+ADTGSDL WTQC+PC C+ QD+P+
Sbjct: 79 TSSNAGPARLRSGQAEYLMELAIGTPPVPFVALADTGSDLTWTQCKPC--KLCFPQDTPI 136
Query: 135 FDPKMSSTYKSLPCSSSQCASL--NQKSCSGVN---CQYSVSYGDGSFSNGNLATETVTL 189
+D S+++ +PC+S+ C + + ++C+ C+Y +Y DG++S G L TET+T
Sbjct: 137 YDTAASASFSPVPCASATCLPIWRSSRNCTATTTSPCRYRYAYDDGAYSAGVLGTETLTF 196
Query: 190 GSTT----GQAVALPGITFGCGTNNGGL-FNSKTTGIVGLGGGDISLISQM 235
++ G V++ G+ FGCG +NGGL +NS TG VGLG G +SL++Q+
Sbjct: 197 AGSSPGAPGPGVSVGGVAFGCGVDNGGLSYNS--TGTVGLGRGSLSLVAQL 245
Score = 52.0 bits (123), Expect = 5e-04, Method: Compositional matrix adjust.
Identities = 36/129 (27%), Positives = 66/129 (51%), Gaps = 8/129 (6%)
Query: 252 IVIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNS----LSQVPEVT 307
+++DSGT T L + +++ ++ ++ QPV + + C+ + L +P++
Sbjct: 331 MIVDSGTIFTVLVESAFRVVVNHVAGVLN-QPVVNASSLDSPCFPATAGEQQLPDMPDML 389
Query: 308 IHFRG-ADVKLSRSNF--FVKVSEDIVCSVFKGITNSVPIYGNIMQTNFLVGYDIEQQTV 364
+HF G AD++L R N+ F + S ++ + I GN Q N + +DI +
Sbjct: 390 LHFAGGADMRLHRDNYMSFNQESSSFCLNIAGAPSAYGSILGNFQQQNIQMLFDITVGQL 449
Query: 365 SFKPTDCTK 373
SF PTDC+K
Sbjct: 450 SFVPTDCSK 458
>gi|222618833|gb|EEE54965.1| hypothetical protein OsJ_02555 [Oryza sativa Japonica Group]
Length = 393
Score = 137 bits (345), Expect = 9e-30, Method: Compositional matrix adjust.
Identities = 114/373 (30%), Positives = 163/373 (43%), Gaps = 68/373 (18%)
Query: 30 SVELIHRDSPKSPFYNSSETPYQRLRDALTRSLNRLNHFNQNSSISSSKASQAD------ 83
SV L HR P SP +S + L R R ++ + S S+ A+ D
Sbjct: 32 SVTLSHRYGPCSPADPNSGEKRPTDEELLRRDQLRADYIRRKFSGSNGTAAGEDGQSSKV 91
Query: 84 IIP-------NNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCP-PSQCYMQDSPLF 135
+P + Y+I + +G+P + V DTGSD+ W QCEPCP PS C+ LF
Sbjct: 92 SVPTTLGSSLDTLEYVISVGLGSPAVTQRVVIDTGSDVSWVQCEPCPAPSPCHAHAGALF 151
Query: 136 DPKMSSTYKSLPCSSSQCASL-NQKSCSGVN----CQYSVSYGDGSFSNGNLATETVTLG 190
DP SSTY + CS++ CA L + +G + CQY V YGDGS + G
Sbjct: 152 DPAASSTYAAFNCSAAACAQLGDSGEANGCDAKSRCQYIVKYGDGSNTTGT--------- 202
Query: 191 STTGQAVALPGITFGCGTNN-GGLFNSKTTGIVGLGGGDISLISQ--------------- 234
G FGC G + KT G++GLGG SL+SQ
Sbjct: 203 ----------GFQFGCSHAELGAGMDDKTDGLIGLGGDAQSLVSQTAARSKKVPTYYFAA 252
Query: 235 MRTTIAGNQRLGVSTPDI-----VIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTG 289
+ G ++LG+S P + ++DSGT +T LP + L S + + A+P G
Sbjct: 253 LEDIAVGGKKLGLS-PSVFAAGSLVDSGTVITRLPPAAYAALSSAFRAGMTRYARAEPLG 311
Query: 290 SLELCYSFNSLSQV--PEVTIHFRGADVKLSRSNFFVKVSEDIVCSVFKGITN--SVPIY 345
L+ C++F L +V P V + F G V ++ V C F + +
Sbjct: 312 ILDTCFNFTGLDKVSIPTVALVFAGGAVVDLDAHGIVSGG----CLAFAPTRDDKAFGTI 367
Query: 346 GNIMQTNFLVGYD 358
GN+ Q F V YD
Sbjct: 368 GNVQQRTFEVLYD 380
>gi|255548664|ref|XP_002515388.1| Aspartic proteinase nepenthesin-2 precursor, putative [Ricinus
communis]
gi|223545332|gb|EEF46837.1| Aspartic proteinase nepenthesin-2 precursor, putative [Ricinus
communis]
Length = 494
Score = 137 bits (345), Expect = 9e-30, Method: Compositional matrix adjust.
Identities = 117/416 (28%), Positives = 181/416 (43%), Gaps = 80/416 (19%)
Query: 31 VELIHRDSPKSPFY--NSSETPYQRLRDALTRSLNRLNHFNQNSSISSSKASQADIIP-- 86
++++H+ P S + +E Y L+D +R + + +++S +S KA+ A +P
Sbjct: 85 LKVVHKHGPCSDLRQGHKAEAQYILLQDQ-SRVDSIHSKLSKDSGLSDVKATAATTLPAK 143
Query: 87 -----NNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSS 141
+ NY + + +GTP + + DTGSDL WTQCEPC S CY Q +F+P S+
Sbjct: 144 DGSIIGSGNYFVTVGLGTPKKDFSLIFDTGSDLTWTQCEPCVKS-CYNQKEAIFNPSQST 202
Query: 142 TYKSLPCSSSQCASL-----NQKSCSGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQA 196
+Y ++ C S+ C SL N +C+ C Y + YGD SFS G E ++L +T
Sbjct: 203 SYANISCGSTLCDSLASATGNIFNCASSTCVYGIQYGDSSFSIGFFGKEKLSLTATD--- 259
Query: 197 VALPGITFGCGTNN----------------------------GGLFN------SKTTGIV 222
FGCG NN +F+ S +TG +
Sbjct: 260 -VFNDFYFGCGQNNKGLFGGAAGLLGLGRDKLSLVSQTAQRYNKIFSYCLPSSSSSTGFL 318
Query: 223 GLGG---------------GDISLISQMRTTIA-GNQRLGV-----STPDIVIDSGTTLT 261
GG G S T I+ G ++L + ST +IDSGT +T
Sbjct: 319 TFGGSTSKSASFTPLATISGGSSFYGLDLTGISVGGRKLAISPSVFSTAGTIIDSGTVIT 378
Query: 262 FLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSLS--QVPEVTIHFRGA-DVKLS 318
LP S L S ++ P A L+ C+ F++ VP++ + F G V +
Sbjct: 379 RLPPAAYSALSSTFRKLMSQYPAAPALSILDTCFDFSNHDTISVPKIGLFFSGGVVVDID 438
Query: 319 RSNFFVKVSEDIVCSVFKGITNS--VPIYGNIMQTNFLVGYDIEQQTVSFKPTDCT 372
++ F VC F G +++ V I+GN+ Q V YD V F P C+
Sbjct: 439 KTGIFYVNDLTQVCLAFAGNSDASDVAIFGNVQQKTLEVVYDGAAGRVGFAPAGCS 494
>gi|356567196|ref|XP_003551807.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Glycine max]
Length = 490
Score = 136 bits (343), Expect = 1e-29, Method: Compositional matrix adjust.
Identities = 121/416 (29%), Positives = 172/416 (41%), Gaps = 95/416 (22%)
Query: 30 SVELIHRDSPKSPFYN-----SSETPYQRLRDALTRSLNRLNHFN--------QNSSI-- 74
S+E++H+ P S + S TP+ D L + R+ + N Q+SS+
Sbjct: 71 SLEVVHKHGPCSQLNDHDGKAKSTTPHS---DILNQDKERVKYINSRLSKNLGQDSSVEE 127
Query: 75 --SSSKASQADIIPNNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDS 132
S++ +++ + + NY + + +GTP + + DTGSDL WTQCEPC S CY Q
Sbjct: 128 LDSATLPAKSGSLIGSGNYFVVVGLGTPKRDLSLIFDTGSDLTWTQCEPCARS-CYKQQD 186
Query: 133 PLFDPKMSSTYKSLPCSSSQCASL-----NQKSCSGVN--CQYSVSYGDGSFSNGNLATE 185
+FDP S++Y ++ C+S+ C L N CS C Y + YGD SFS G + E
Sbjct: 187 VIFDPSKSTSYSNITCTSALCTQLSTATGNDPGCSASTKACIYGIQYGDSSFSVGYFSRE 246
Query: 186 TVTLGSTTGQAVALPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMR--------- 236
+T+ +T + FGCG NN GLF + G++GLG IS + Q
Sbjct: 247 RLTVTATD----VVDNFLFGCGQNNQGLFGG-SAGLIGLGRHPISFVQQTAAKYRKIFSY 301
Query: 237 ------------------------------------------TTIA-GNQRLGVS----- 248
T IA G +L VS
Sbjct: 302 CLPSTSSSTGHLSFGPAATGRYLKYTPFSTISRGSSFYGLDITAIAVGGVKLPVSSSTFS 361
Query: 249 TPDIVIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSLS--QVPEV 306
T +IDSGT +T LP L S + P A L+ CY + +P +
Sbjct: 362 TGGAIIDSGTVITRLPPTAYGALRSAFRQGMSKYPSAGELSILDTCYDLSGYKVFSIPTI 421
Query: 307 TIHFRGA-DVKLSRSNFFVKVSEDIVCSVF--KGITNSVPIYGNIMQTNFLVGYDI 359
F G VKL S VC F G + V IYGN+ Q V YD+
Sbjct: 422 EFSFAGGVTVKLPPQGILFVASTKQVCLAFAANGDDSDVTIYGNVQQRTIEVVYDV 477
>gi|242050430|ref|XP_002462959.1| hypothetical protein SORBIDRAFT_02g035310 [Sorghum bicolor]
gi|241926336|gb|EER99480.1| hypothetical protein SORBIDRAFT_02g035310 [Sorghum bicolor]
Length = 448
Score = 136 bits (343), Expect = 1e-29, Method: Compositional matrix adjust.
Identities = 91/240 (37%), Positives = 128/240 (53%), Gaps = 27/240 (11%)
Query: 11 LFFLCFYVVSPIEAQTGGFSVEL----IHRDSPKSPFYNSSETPYQRLRDALTRSLNRL- 65
L L F VV A +G SV + IH D P + E +RDAL R ++R
Sbjct: 11 LAVLVFLVVCATLA-SGAASVRVGLTRIHSD----PDITAPE----FVRDALRRDMHRQQ 61
Query: 66 ------NHFNQNSSISSSKASQADIIPNNANYLIRISIGTPPTERLAVADTGSDLIWTQC 119
++ + S ++ D+ PN YL+ +SIGTPP A+ADTGSDLIWTQC
Sbjct: 62 SRSLFGRELAESDGTTVSARTRKDL-PNGGEYLMTLSIGTPPLSYPAIADTGSDLIWTQC 120
Query: 120 EPCPPSQCYMQDSPLFDPKMSSTYKSLPCSS--SQCASL--NQKSCSGVNCQYSVSYGDG 175
PC QC+ Q +PL++P S+T+ LPC+S S CA + + G C Y+ +YG G
Sbjct: 121 APCSGDQCFAQPAPLYNPASSTTFGVLPCNSSLSMCAGVLAGKAPPPGCACMYNQTYGTG 180
Query: 176 SFSNGNLATETVTLGSTTGQAVALPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQM 235
++ G +ET T GS +PGI FGC + +N + G+VGLG G +SL+SQ+
Sbjct: 181 -WTAGVQGSETFTFGSAAADQARVPGIAFGCSNASSSDWNG-SAGLVGLGRGSLSLVSQL 238
Score = 62.4 bits (150), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 38/131 (29%), Positives = 70/131 (53%), Gaps = 9/131 (6%)
Query: 249 TPDIVIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPV--ADPTGSLELCYSF----NSLSQ 302
T ++IDSGTT+T L + + + S++ + +D TG L+LCY+ ++
Sbjct: 318 TGGLIIDSGTTITSLVNAAYQQVRAAVQSLVTLPAIDGSDSTG-LDLCYALPTPTSAPPA 376
Query: 303 VPEVTIHFRGADVKLSRSNFFVKVSEDIVCSVFKGITN-SVPIYGNIMQTNFLVGYDIEQ 361
+P +T+HF GAD+ L ++ + S + C + T+ ++ +GN Q N + YD+
Sbjct: 377 MPSMTLHFDGADMVLPADSYMISGS-GVWCLAMRNQTDGAMSTFGNYQQQNMHILYDVRN 435
Query: 362 QTVSFKPTDCT 372
+ +SF P C+
Sbjct: 436 EMLSFAPAKCS 446
>gi|414589630|tpg|DAA40201.1| TPA: hypothetical protein ZEAMMB73_629620 [Zea mays]
Length = 443
Score = 136 bits (343), Expect = 2e-29, Method: Compositional matrix adjust.
Identities = 84/211 (39%), Positives = 114/211 (54%), Gaps = 13/211 (6%)
Query: 28 GFSVELIHRDSPKSPFYNSSETPYQRLRDALTRSLNRLNHFNQNSSISSSKASQAD---I 84
GF L H D+ ++ T Q L AL RS R+ ++++ A A +
Sbjct: 30 GFKATLRHVDA------DAGYTEEQLLSRALRRSSARVATLQSLAALAPGDAITAARILV 83
Query: 85 IPNNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYK 144
+ ++ YL+ + IGTP A+ DTGSDLIWTQC PC C Q +P FDP S+TY+
Sbjct: 84 LASDGEYLMEMGIGTPTRYYSAILDTGSDLIWTQCAPC--LLCVDQPTPYFDPARSATYR 141
Query: 145 SLPCSSSQCASLNQKSCSGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITF 204
SL C+S C +L C C Y YGD + + G LA ET T G T V+LPGI+F
Sbjct: 142 SLGCASPACNALYYPLCYQKVCVYQYFYGDSASTAGVLANETFTFG-TNETRVSLPGISF 200
Query: 205 GCGTNNGGLFNSKTTGIVGLGGGDISLISQM 235
GCG N G + +G+VG G G +SL+SQ+
Sbjct: 201 GCGNLNAGSL-ANGSGMVGFGRGSLSLVSQL 230
Score = 42.7 bits (99), Expect = 0.31, Method: Compositional matrix adjust.
Identities = 36/126 (28%), Positives = 62/126 (49%), Gaps = 8/126 (6%)
Query: 253 VIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGS--LELCYSFNSLSQ----VPEV 306
+IDSGTT+T+L + + + +S I P+ + T + L+ C+ + + +P++
Sbjct: 316 IIDSGTTITYLAEPAYDAVRAAFASQITL-PLLNVTDASVLDTCFQWPPPPRQSVTLPQL 374
Query: 307 TIHFRGADVKLSRSNF-FVKVSEDIVCSVFKGITNSVPIYGNIMQTNFLVGYDIEQQTVS 365
+HF GAD +L N+ V S + ++ I G+ NF V YD+E +S
Sbjct: 375 VLHFDGADWELPLQNYMLVDPSTGGGLCLAMASSSDGSIIGSYQHQNFNVLYDLENSLMS 434
Query: 366 FKPTDC 371
F P C
Sbjct: 435 FVPAPC 440
>gi|357439021|ref|XP_003589787.1| Aspartic proteinase nepenthesin-2 [Medicago truncatula]
gi|355478835|gb|AES60038.1| Aspartic proteinase nepenthesin-2 [Medicago truncatula]
Length = 456
Score = 136 bits (343), Expect = 2e-29, Method: Compositional matrix adjust.
Identities = 111/411 (27%), Positives = 174/411 (42%), Gaps = 80/411 (19%)
Query: 29 FSVELIHRDSPKSPFYNSSETPYQ-RLRDALTRSLNR----LNHFNQNSSISSSKASQ-- 81
+ +L HRD+ N +T ++ R + R + R LN N+N+ + +
Sbjct: 58 WKTKLFHRDN-----INLKKTTHKTRFISRINRDIKRVTFLLNRLNKNTQEQQTTTATEA 112
Query: 82 ---ADII----PNNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPL 134
+D++ + Y +RI IG+P + V D+GSD++W QCEPC QCY Q P+
Sbjct: 113 SFGSDVVSGTEEGSGEYFVRIGIGSPAIYQYMVIDSGSDIVWIQCEPC--DQCYNQTDPI 170
Query: 135 FDPKMSSTYKSLPCSSSQCASLNQK-SCSGVNCQYSVSYGDGSFSNGNLATETVTLGSTT 193
F+P S+++ + CSS+ C L+ +C C Y V+YGDGS++ G LA ET+T+G T
Sbjct: 171 FNPATSASFIGVACSSNVCNQLDDDVACRKGRCGYQVAYGDGSYTKGTLALETITIGRTV 230
Query: 194 GQAVALPGITFGCGTNNGGLF--------------------NSKTTGIVG---------L 224
Q A+ GCG N G+F ++T G G +
Sbjct: 231 IQDTAI-----GCGHWNEGMFVGAAGLLGLGGGPMSFVGQLGAQTGGAFGYCLVSRAMPV 285
Query: 225 GGGDISLISQ------------------MRTTIAGN--QRLGVSTPDIVIDSGTTLTFLP 264
G + LI +R I+ Q + T +V+D+GT +T LP
Sbjct: 286 GAMWVPLIHNPFYPSFYYVSLSGLAVGGIRVPISEQIFQLTDIGTGGVVMDTGTAITRLP 345
Query: 265 QGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSL--SQVPEVTIHFRGADVKLSRSNF 322
+ + P A + CY N +VP V+ +F G + +
Sbjct: 346 TVAYNAFRDAFIAQTTNLPRAPGVSIFDTCYDLNGFVTVRVPTVSFYFSGGQILTFPARN 405
Query: 323 FVKVSEDI--VCSVFKGITNSVPIYGNIMQTNFLVGYDIEQQTVSFKPTDC 371
F+ ++D+ C F + + I GNI Q V D V F P C
Sbjct: 406 FLIPADDVGTFCFAFAPSPSGLSIIGNIQQEGIQVSIDGTNGFVGFGPNVC 456
>gi|148906646|gb|ABR16474.1| unknown [Picea sitchensis]
Length = 538
Score = 135 bits (341), Expect = 2e-29, Method: Compositional matrix adjust.
Identities = 75/207 (36%), Positives = 116/207 (56%), Gaps = 28/207 (13%)
Query: 29 FSVELIHRDSP-KSPFYNSSETPYQRLRDALTRSLNRLNHFNQ--------NSSISSSKA 79
+SV+++HRDS N++ + +RL + L R R+ Q N + S
Sbjct: 114 WSVQVVHRDSLLVKDAANATASYERRLEETLRRDARRVRGLEQRIEKRLRLNKDPAGSHE 173
Query: 80 SQADIIPN------------NANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQC 127
+ A++ + Y RI +GTP E+ V DTGSD++W QCEPC S+C
Sbjct: 174 NVAEVAAEFGGEVVSGMAQGSGEYFTRIGVGTPMREQYMVLDTGSDVVWIQCEPC--SKC 231
Query: 128 YMQDSPLFDPKMSSTYKSLPCSSSQCASLNQKSCSGVNCQYSVSYGDGSFSNGNLATETV 187
Y Q P+F+P +S+++ +L C+S+ C+ L+ +C G C Y VSYGDGS++ G+ ATE +
Sbjct: 232 YSQVDPIFNPSLSASFSTLGCNSAVCSYLDAYNCHGGGCLYKVSYGDGSYTIGSFATEML 291
Query: 188 TLGSTTGQAVALPGITFGCGTNNGGLF 214
T G+T+ + VA+ GCG +N GLF
Sbjct: 292 TFGTTSVRNVAI-----GCGHDNAGLF 313
>gi|125602787|gb|EAZ42112.1| hypothetical protein OsJ_26672 [Oryza sativa Japonica Group]
Length = 477
Score = 135 bits (341), Expect = 2e-29, Method: Compositional matrix adjust.
Identities = 104/305 (34%), Positives = 144/305 (47%), Gaps = 47/305 (15%)
Query: 107 VADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCSSSQC-ASLNQK-----S 160
+ DTGSDL W QC+PC S CY Q PLFDP S++Y ++PC++S C ASL S
Sbjct: 179 IVDTGSDLTWVQCKPC--SVCYAQRDPLFDPSGSASYAAVPCNASACEASLKAATGVPGS 236
Query: 161 CSGV----------NCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITFGCGTNN 210
C+ V C YS++YGDGSFS G LAT+TV LG + + G FGCG +N
Sbjct: 237 CATVGGGGGGGKSERCYYSLAYGDGSFSRGVLATDTVALGGAS-----VDGFVFGCGLSN 291
Query: 211 GGLFNSKTTGIVGLG--GGDISL------------ISQMRTTIAGNQRLGVSTPDIVIDS 256
GLF T G++GLG G L ++ A G+ ++++DS
Sbjct: 292 RGLFGG-TAGLMGLGPDGALAGLPDGAPPPFYFMNVTGASVGGAAVAAAGLGAANVLLDS 350
Query: 257 GTTLTFL-PQGYNSNLLSVMSSM-IEAQPVADPTGSLELCYSFNSLSQ--VPEVTIHFR- 311
GT +T L P Y + E P A P L+ CY+ + VP +T+
Sbjct: 351 GTVITRLAPSVYRAVRAEFARQFGAERYPAAPPFSLLDACYNLTGHDEVKVPLLTLRLEG 410
Query: 312 GADVKLSRSNFFVKVSED--IVCSVFKGIT--NSVPIYGNIMQTNFLVGYDIEQQTVSFK 367
GAD+ + + +D VC ++ + PI GN Q N V YD + F
Sbjct: 411 GADMTVDAAGMLFMARKDGSQVCLAMASLSFEDQTPIIGNYQQKNKRVVYDTVGSRLGFA 470
Query: 368 PTDCT 372
DC+
Sbjct: 471 DEDCS 475
>gi|357140068|ref|XP_003571594.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Brachypodium
distachyon]
Length = 533
Score = 135 bits (341), Expect = 2e-29, Method: Compositional matrix adjust.
Identities = 76/166 (45%), Positives = 96/166 (57%), Gaps = 20/166 (12%)
Query: 90 NYLIRISIGTPPTERLAV-ADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPC 148
NY+ I++G + L V DTGSDL W QCEPCP S CY Q PLFDP S T+ ++PC
Sbjct: 179 NYVTTIALGGGGAKNLTVIVDTGSDLTWVQCEPCPGSSCYAQRDPLFDPAASPTFAAVPC 238
Query: 149 SSSQCASLNQKSCSGV-------------NCQYSVSYGDGSFSNGNLATETVTLGSTTGQ 195
S CA+ + K +G C Y++SYGDGSFS G LA +T+ LG+TT
Sbjct: 239 GSPACAA-SLKDATGAPGSCARSAGNSEQRCYYALSYGDGSFSRGVLAQDTLGLGTTT-- 295
Query: 196 AVALPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAG 241
L G FGCG +N GLF T G++GLG D+SL+SQ G
Sbjct: 296 --KLDGFVFGCGLSNRGLFGG-TAGLMGLGRTDLSLVSQTAARFGG 338
>gi|226508080|ref|NP_001150678.1| aspartic proteinase nepenthesin-1 precursor [Zea mays]
gi|195641018|gb|ACG39977.1| aspartic proteinase nepenthesin-1 precursor [Zea mays]
Length = 450
Score = 135 bits (340), Expect = 3e-29, Method: Compositional matrix adjust.
Identities = 89/218 (40%), Positives = 121/218 (55%), Gaps = 20/218 (9%)
Query: 28 GFSVEL--IHRDSPKSPFYNSSETPYQRLRDALTRSLNRLNHFNQNSSISSSKASQADI- 84
G VEL +H D S T Q +R AL R ++R N + SS A
Sbjct: 31 GVRVELTRVHADP--------SVTASQFVRGALRRDMHRHNARKLALAASSGATVSAPTQ 82
Query: 85 -IPNNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTY 143
P YL+ ++IGTPP A+ADTGSDLIWTQC PC SQC+ Q +PL++P S+T+
Sbjct: 83 NSPTAGEYLMALAIGTPPLPYQAIADTGSDLIWTQCAPC-TSQCFRQPTPLYNPSSSTTF 141
Query: 144 KSLPCSSSQ--CASLNQKSCS----GVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAV 197
LPC+SS CA+ + + G C Y+V+YG G +++ +ET T GST
Sbjct: 142 AVLPCNSSLSVCAAALAGTGTAPPPGCACTYNVTYGSG-WTSVFQGSETFTFGSTPAGQS 200
Query: 198 ALPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQM 235
+PGI FGC T + G S +G+VGLG G +SL+SQ+
Sbjct: 201 RVPGIAFGCSTASSGFNASSASGLVGLGRGRLSLVSQL 238
Score = 65.1 bits (157), Expect = 6e-08, Method: Compositional matrix adjust.
Identities = 40/129 (31%), Positives = 67/129 (51%), Gaps = 8/129 (6%)
Query: 252 IVIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGS--LELCYSFNSLSQ----VPE 305
++IDSGTT+T L + + + S++ P D + + L+LC+ S + +P
Sbjct: 322 LIIDSGTTITLLGNTAYQQVRAAVVSLVTL-PTTDGSAATGLDLCFMLPSSTSAPPAMPS 380
Query: 306 VTIHFRGADVKLSRSNFFVKVSEDIVCSVFKGITN-SVPIYGNIMQTNFLVGYDIEQQTV 364
+T+HF GAD+ L ++ + + C + T+ V I GN Q N + YDI Q+T+
Sbjct: 381 MTLHFNGADMVLPADSYMMSDDSGLWCLAMQNQTDGEVNILGNYQQQNMHILYDIGQETL 440
Query: 365 SFKPTDCTK 373
SF P C+
Sbjct: 441 SFAPAKCSA 449
>gi|242095602|ref|XP_002438291.1| hypothetical protein SORBIDRAFT_10g011210 [Sorghum bicolor]
gi|241916514|gb|EER89658.1| hypothetical protein SORBIDRAFT_10g011210 [Sorghum bicolor]
Length = 464
Score = 134 bits (338), Expect = 5e-29, Method: Compositional matrix adjust.
Identities = 115/416 (27%), Positives = 172/416 (41%), Gaps = 80/416 (19%)
Query: 28 GFSVELIHRDSPKSPFYNSSETPYQRL--RDALTRSLNRLNHFNQNSSISSSKASQADII 85
G ++ L HR P SP + + ++ RD L + + ++ ++++ A I
Sbjct: 57 GSTLALSHRHGPCSPVISKEKPSHEETLRRDQLRAAYIQAKVSSRYNNVAKELQQSAVTI 116
Query: 86 PNNA-------NYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPK 138
P ++ Y+I ++IGTP ++ DTGSD+ W QC PC C Q LFDP
Sbjct: 117 PTSSGYSLGTTEYVITVTIGTPAVTQVMSIDTGSDVSWVQCAPCAAQSCSSQKDKLFDPA 176
Query: 139 MSSTYKSLPCSSSQCASLNQK--SCSGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQA 196
MS+TY + C S+QCA L + C CQY V YGDGS + G ++T++L S+
Sbjct: 177 MSATYSAFSCGSAQCAQLGDEGNGCLKSQCQYIVKYGDGSNTAGTYGSDTLSLTSSD--- 233
Query: 197 VALPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTI---------------AG 241
A+ FGC G F + G++GLGG SL+SQ T G
Sbjct: 234 -AVKSFQFGCSHRAAG-FVGELDGLMGLGGDTESLVSQTAATYGKAFSYCLPPPSSSGGG 291
Query: 242 NQRLGVS----------TPDIVIDSGTTLTFLPQGYN--SNLLSVMSSMIEAQPVAD--- 286
LG + TP + T QG +L+V +S+ V D
Sbjct: 292 FLTLGAAGGASSSRYSHTPMVRFSVPTFYGVFLQGITVAGTMLNVPASVFSGASVVDSGT 351
Query: 287 ----------------------------PTGSLELCYSFNSLS--QVPEVTIHF-RGADV 315
P GSL+ C+ F+ + VP VT+ F RGA +
Sbjct: 352 VITQLPPTAYQALRTAFKKEMKAYPSAAPVGSLDTCFDFSGFNTITVPTVTLTFSRGAAM 411
Query: 316 KLSRSNFFVKVSEDIVCSVFKGITNSVPIYGNIMQTNFLVGYDIEQQTVSFKPTDC 371
L S + G T I GN+ Q F + +D+ +T+ F+ C
Sbjct: 412 DLDISGILYAGCLAFTATAHDGDTG---ILGNVQQRTFEMLFDVGGRTIGFRSGAC 464
>gi|115479485|ref|NP_001063336.1| Os09g0452400 [Oryza sativa Japonica Group]
gi|51535935|dbj|BAD38017.1| putative aspartic proteinase nepenthesin I [Oryza sativa Japonica
Group]
gi|113631569|dbj|BAF25250.1| Os09g0452400 [Oryza sativa Japonica Group]
gi|215693279|dbj|BAG88661.1| unnamed protein product [Oryza sativa Japonica Group]
Length = 441
Score = 134 bits (338), Expect = 5e-29, Method: Compositional matrix adjust.
Identities = 83/210 (39%), Positives = 117/210 (55%), Gaps = 13/210 (6%)
Query: 28 GFSVELIHRDSPKSPFYNSSETPYQRLRDALTRSLNRLNHFNQNSSISSSKASQADIIP- 86
GF L H D+ N+ T Q L A+ RS R+ ++ + + + ++
Sbjct: 30 GFKATLTHVDA------NAGYTKAQLLSRAVARSRARVAALQSLATAADAITAARILLRF 83
Query: 87 NNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSL 146
+ YL+ + IG+PP A+ DTGSDLIWTQC PC C Q +P F+P S++Y SL
Sbjct: 84 SEGEYLMDVGIGSPPRYFSAMIDTGSDLIWTQCAPC--LLCVEQPTPYFEPAKSTSYASL 141
Query: 147 PCSSSQCASLNQKSCSGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITFGC 206
PCSS+ C +L C C Y YGD + S G LA ET T G+ + + VA+P ++FGC
Sbjct: 142 PCSSAMCNALYSPLCFQNACVYQAFYGDSASSAGVLANETFTFGTNSTR-VAVPRVSFGC 200
Query: 207 GTNNGG-LFNSKTTGIVGLGGGDISLISQM 235
G N G LFN +G+VG G G +SL+SQ+
Sbjct: 201 GNMNAGTLFNG--SGMVGFGRGALSLVSQL 228
Score = 55.8 bits (133), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 35/128 (27%), Positives = 62/128 (48%), Gaps = 5/128 (3%)
Query: 249 TPDIVIDSGTTLTFLPQGYNSNLLSVMSSMIE-AQPVADPTGSLELCYSF----NSLSQV 303
T ++IDSGTT+TFL Q + + + + + A P+ + + C+ + + +
Sbjct: 311 TGGVIIDSGTTVTFLAQPAYAMVQGAFVAWVGLPRANATPSDTFDTCFKWPPPPRRMVTL 370
Query: 304 PEVTIHFRGADVKLSRSNFFVKVSEDIVCSVFKGITNSVPIYGNIMQTNFLVGYDIEQQT 363
PE+ +HF GAD++L N+ V + ++ I G+ NF + YD+E
Sbjct: 371 PEMVLHFDGADMELPLENYMVMDGGTGNLCLAMLPSDDGSIIGSFQHQNFHMLYDLENSL 430
Query: 364 VSFKPTDC 371
+SF P C
Sbjct: 431 LSFVPAPC 438
>gi|125563957|gb|EAZ09337.1| hypothetical protein OsI_31609 [Oryza sativa Indica Group]
gi|125605916|gb|EAZ44952.1| hypothetical protein OsJ_29595 [Oryza sativa Japonica Group]
Length = 438
Score = 134 bits (338), Expect = 5e-29, Method: Compositional matrix adjust.
Identities = 83/210 (39%), Positives = 117/210 (55%), Gaps = 13/210 (6%)
Query: 28 GFSVELIHRDSPKSPFYNSSETPYQRLRDALTRSLNRLNHFNQNSSISSSKASQADIIP- 86
GF L H D+ N+ T Q L A+ RS R+ ++ + + + ++
Sbjct: 27 GFKATLTHVDA------NAGYTKAQLLSRAVARSRARVAALQSLATAADAITAARILLRF 80
Query: 87 NNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSL 146
+ YL+ + IG+PP A+ DTGSDLIWTQC PC C Q +P F+P S++Y SL
Sbjct: 81 SEGEYLMDVGIGSPPRYFSAMIDTGSDLIWTQCAPC--LLCVEQPTPYFEPAKSTSYASL 138
Query: 147 PCSSSQCASLNQKSCSGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITFGC 206
PCSS+ C +L C C Y YGD + S G LA ET T G+ + + VA+P ++FGC
Sbjct: 139 PCSSAMCNALYSPLCFQNACVYQAFYGDSASSAGVLANETFTFGTNSTR-VAVPRVSFGC 197
Query: 207 GTNNGG-LFNSKTTGIVGLGGGDISLISQM 235
G N G LFN +G+VG G G +SL+SQ+
Sbjct: 198 GNMNAGTLFNG--SGMVGFGRGALSLVSQL 225
Score = 55.8 bits (133), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 35/128 (27%), Positives = 62/128 (48%), Gaps = 5/128 (3%)
Query: 249 TPDIVIDSGTTLTFLPQGYNSNLLSVMSSMIE-AQPVADPTGSLELCYSF----NSLSQV 303
T ++IDSGTT+TFL Q + + + + + A P+ + + C+ + + +
Sbjct: 308 TGGVIIDSGTTVTFLAQPAYAMVQGAFVAWVGLPRANATPSDTFDTCFKWPPPPRRMVTL 367
Query: 304 PEVTIHFRGADVKLSRSNFFVKVSEDIVCSVFKGITNSVPIYGNIMQTNFLVGYDIEQQT 363
PE+ +HF GAD++L N+ V + ++ I G+ NF + YD+E
Sbjct: 368 PEMVLHFDGADMELPLENYMVMDGGTGNLCLAMLPSDDGSIIGSFQHQNFHMLYDLENSL 427
Query: 364 VSFKPTDC 371
+SF P C
Sbjct: 428 LSFVPAPC 435
>gi|45735840|dbj|BAD12875.1| putative chloroplast nucleoid DNA-binding protein cnd41 [Oryza
sativa Japonica Group]
gi|45735966|dbj|BAD12995.1| putative chloroplast nucleoid DNA-binding protein cnd41 [Oryza
sativa Japonica Group]
gi|125583491|gb|EAZ24422.1| hypothetical protein OsJ_08175 [Oryza sativa Japonica Group]
Length = 475
Score = 134 bits (338), Expect = 6e-29, Method: Compositional matrix adjust.
Identities = 84/229 (36%), Positives = 121/229 (52%), Gaps = 24/229 (10%)
Query: 31 VELIHRDSPKSPFYNSSE--TPYQRLRDALTRSLNRLNHFNQNSSISSSKA-------SQ 81
+ L HR P +P +S +P L D L R + + S +++ A S+
Sbjct: 67 LRLTHRHGPCAPAGKASALGSPPSFL-DTLRADQRRAEYIQRRVSGAAAAAPGMQLAGSK 125
Query: 82 ADIIPNNAN-------YLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPL 134
A +P N Y++ +S+GTP + DTGSD+ W QC+PCP CY Q PL
Sbjct: 126 AATVPANLGFSIGTLQYVVTVSLGTPAVAQTLEVDTGSDVSWVQCKPCPSPPCYSQRDPL 185
Query: 135 FDPKMSSTYKSLPCSSSQCA--SLNQKSCSGVNCQYSVSYGDGSFSNGNLATETVTLGST 192
FDP SS+Y ++PC+++ C+ +L CSG C Y VSYGDGS + G +++T+TL +
Sbjct: 186 FDPTRSSSYSAVPCAAASCSQLALYSNGCSGGQCGYVVSYGDGSTTTGVYSSDTLTLTGS 245
Query: 193 TGQAVALPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAG 241
AL G FGCG GLF + G++GLG SL+SQ +T G
Sbjct: 246 N----ALKGFLFGCGHAQQGLF-AGVDGLLGLGRQGQSLVSQASSTYGG 289
>gi|125540927|gb|EAY87322.1| hypothetical protein OsI_08726 [Oryza sativa Indica Group]
Length = 464
Score = 134 bits (337), Expect = 7e-29, Method: Compositional matrix adjust.
Identities = 84/229 (36%), Positives = 121/229 (52%), Gaps = 24/229 (10%)
Query: 31 VELIHRDSPKSPFYNSSE--TPYQRLRDALTRSLNRLNHFNQNSSISSSKA-------SQ 81
+ L HR P +P +S +P L D L R + + S +++ A S+
Sbjct: 56 LRLTHRHGPCAPAGKASALGSPPSFL-DTLRADQRRAEYIQRRVSGAAAAAPGMQLAGSK 114
Query: 82 ADIIPNNAN-------YLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPL 134
A +P N Y++ +S+GTP + DTGSD+ W QC+PCP CY Q PL
Sbjct: 115 AATVPANLGFSIGTLQYVVTVSLGTPAVAQTLEVDTGSDVSWVQCKPCPSPPCYSQRDPL 174
Query: 135 FDPKMSSTYKSLPCSSSQCA--SLNQKSCSGVNCQYSVSYGDGSFSNGNLATETVTLGST 192
FDP SS+Y ++PC+++ C+ +L CSG C Y VSYGDGS + G +++T+TL +
Sbjct: 175 FDPTRSSSYSAVPCAAASCSQLALYSNGCSGGQCGYVVSYGDGSTTTGVYSSDTLTLTGS 234
Query: 193 TGQAVALPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAG 241
AL G FGCG GLF + G++GLG SL+SQ +T G
Sbjct: 235 N----ALKGFLFGCGHAQQGLF-AGVDGLLGLGRQGQSLVSQASSTYGG 278
>gi|326513755|dbj|BAJ87896.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 442
Score = 134 bits (337), Expect = 7e-29, Method: Compositional matrix adjust.
Identities = 119/444 (26%), Positives = 173/444 (38%), Gaps = 89/444 (20%)
Query: 6 SCVF-ILFFLCFYVVSPIEAQTGGFSVELIHRDSPKSPFYNSSETPYQRLRDALTRSLNR 64
SC+ LFFL P+ + T L H D + T + LR + RS R
Sbjct: 11 SCMLPYLFFLAILFAWPVTSAT--LRAHLSHVDDGRG------FTKRELLRRMVVRSRAR 62
Query: 65 LNHFNQNSSISSSKASQADIIPN---NANYLIRISIGTPPTERLAVA-DTGSDLIWTQCE 120
+ S ++ A+ N N+ YLI +SIG P ++ + + DTGSD++WTQCE
Sbjct: 63 AANLCPYSGATARPATAPVGRANTDVNSEYLIHLSIGAPRSQPVVLTLDTGSDVVWTQCE 122
Query: 121 PCPPSQCYMQDSPLFDPKMSSTYKSLPCSSSQCASLNQKSCSGVNCQYSVSYGDGSFSNG 180
PC ++C+ Q P FD S+T +S+ CS C + ++ C C Y YGDGS S G
Sbjct: 123 PC--AECFTQPLPRFDTAASNTVRSVACSDPLCNAHSEHGCFLHGCTYVSGYGDGSLSFG 180
Query: 181 NLATETVTLGSTTGQA-VALPGITFGCGTNNGGLFNSKTTGIVGL--------------- 224
+ ++ T G V +P I FGCG N G F TGI G
Sbjct: 181 HFLRDSFTFDDGKGGGKVTVPDIGFGCGMYNAGRFLQTETGIAGFGRGPLSLPSQLKVRQ 240
Query: 225 -------------------GGGDIS------------------------LISQMRTTIAG 241
G GD+ + + G
Sbjct: 241 FSYCFTTRFEAKSSPVFLGGAGDLKAHATGPILSTPFVRSLPPGTDNSHYVLSFKGVTVG 300
Query: 242 NQRLGVSTPDI--------VIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLEL 293
RL V P+I IDSGT +T P L S + A PV ++
Sbjct: 301 KTRLPV--PEIKADGSGATFIDSGTDITTFPDAVFRQLKSAFIAQ-AALPVNKTADEDDI 357
Query: 294 CYSFN--SLSQVPEVTIHFRGADVKLSRSNFFVKVSE--DIVCSVFKGITNSVPIYGNIM 349
C+S++ + +P++ H GAD L R N+ + E + +V + GN
Sbjct: 358 CFSWDGKKTAAMPKLVFHLEGADWDLPRENYVTEDRESGQVCVAVSTSGQMDRTLIGNFQ 417
Query: 350 QTNFLVGYDIEQQTVSFKPTDCTK 373
Q N + YD+ + P C K
Sbjct: 418 QQNTHIVYDLAAGKLLLVPAQCDK 441
>gi|242089069|ref|XP_002440367.1| hypothetical protein SORBIDRAFT_09g030430 [Sorghum bicolor]
gi|241945652|gb|EES18797.1| hypothetical protein SORBIDRAFT_09g030430 [Sorghum bicolor]
Length = 462
Score = 134 bits (337), Expect = 8e-29, Method: Compositional matrix adjust.
Identities = 96/331 (29%), Positives = 144/331 (43%), Gaps = 52/331 (15%)
Query: 87 NNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSL 146
+ Y + +GTPPT L V DTGSD++W QC PC QCY Q +FDP+ S +Y ++
Sbjct: 138 GSGEYFASVGVGTPPTPALLVLDTGSDVVWLQCAPC--RQCYAQSGRVFDPRRSRSYAAV 195
Query: 147 PCSSSQC-----ASLNQKSCSGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPG 201
C + C C Y V+YGDGS + G+LATET+ + +P
Sbjct: 196 RCGAPPCRGLDAGGGGGCDRRRGTCLYQVAYGDGSVTAGDLATETLWF----ARGARVPR 251
Query: 202 ITFGCGTNNGGLF----------------NSKTTGIVG------LGGGDISLISQMRTT- 238
+ GCG +N GLF ++T G G D+ + +RT
Sbjct: 252 VAVGCGHDNEGLFVAAAGLLGLGRGRLSLPTQTARRYGRRFSYCFQGSDLDHRTIIRTVH 311
Query: 239 --IAGNQRLGVSTPD-----------IVIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVA 285
+ G + GV +++DSGT++T L + + + +A
Sbjct: 312 QHVGGARVRGVGERSLRLDPSTGRGGVILDSGTSVTRLARPVYVAVREAFRAAAGGLRLA 371
Query: 286 DPTGSL-ELCYSFNS--LSQVPEVTIHFR-GADVKLSRSNFFVKV-SEDIVCSVFKGITN 340
SL + CY + +VP V++H GA+V L N+ + V + C G
Sbjct: 372 PGGFSLFDTCYDLRGRRVVKVPTVSVHLAGGAEVALPPENYLIPVDTRGTFCLALAGTDG 431
Query: 341 SVPIYGNIMQTNFLVGYDIEQQTVSFKPTDC 371
V I GNI Q F V +D ++Q V+ P C
Sbjct: 432 GVSIVGNIQQQGFRVVFDGDRQRVALVPKSC 462
>gi|413925432|gb|AFW65364.1| hypothetical protein ZEAMMB73_378208 [Zea mays]
Length = 418
Score = 134 bits (337), Expect = 8e-29, Method: Compositional matrix adjust.
Identities = 110/380 (28%), Positives = 172/380 (45%), Gaps = 74/380 (19%)
Query: 57 ALTRSLNRLNHFNQNSSISSSKASQADIIPNNAN--YLIRISIGTPPTERLAVADTGSDL 114
A RS RL+ +S+ ++Q+ + ++ Y + S+GTPP A+ADTGSDL
Sbjct: 45 AAHRSRERLSILATRLGAASAGSAQSPLQMDSGGGAYDMTFSMGTPPQTLSALADTGSDL 104
Query: 115 IWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCSSSQCASLNQKS---CSGVN-----C 166
IW +C C +C + S + P SS++ LPCSS+ C +L +S C G C
Sbjct: 105 IWAKCGAC--KRCAPRGSASYYPTKSSSFSKLPCSSALCRTLESQSLATCGGTRARGAVC 162
Query: 167 QYSVSYGDGS----FSNGNLATETVTLGSTTGQAVALPGITFGCGTNNGGLFNSKTTGIV 222
Y SYG S ++ G + +ET TLGS A+ GI FGC T +G+V
Sbjct: 163 SYRYSYGLSSNPHHYTQGYMGSETFTLGSD-----AVQGIGFGC-TTMSEGGYGSGSGLV 216
Query: 223 GLGGGDISLISQMRT--------------------------------------------- 237
GLG G +SL+ Q++
Sbjct: 217 GLGRGKLSLVRQLKVGAFSYCLTSDPSTSSPLLFGAGALTGPGVQSTPLVNLKTSTFYTV 276
Query: 238 -----TIAGNQRLGVSTPDIVIDSGTTLTFLPQ-GYNSNLLSVMSSMIEAQPVADPTGSL 291
+I + G I+ DSGTTLTFL + Y ++S V G
Sbjct: 277 NLDSISIGAAKTPGTGRHGIIFDSGTTLTFLAEPAYTLAEAGLLSQTTNLTRVPGTDG-Y 335
Query: 292 ELCYSFNSLSQVPEVTIHFRGADVKLSRSNFFVKVSEDIVCSVFKGITNSVPIYGNIMQT 351
E+C+ + + P + +HF G D+ L N+F V++ + C + + + + I GNIMQ
Sbjct: 336 EVCFQTSGGAVFPSMVLHFDGGDMALKTENYFGAVNDSVSCWLVQKSPSEMSIVGNIMQM 395
Query: 352 NFLVGYDIEQQTVSFKPTDC 371
++ + YD+++ +SF+PT+C
Sbjct: 396 DYHIRYDLDKSVLSFQPTNC 415
>gi|242050426|ref|XP_002462957.1| hypothetical protein SORBIDRAFT_02g035290 [Sorghum bicolor]
gi|241926334|gb|EER99478.1| hypothetical protein SORBIDRAFT_02g035290 [Sorghum bicolor]
Length = 452
Score = 134 bits (336), Expect = 9e-29, Method: Compositional matrix adjust.
Identities = 108/359 (30%), Positives = 159/359 (44%), Gaps = 83/359 (23%)
Query: 91 YLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCSS 150
Y + +S+GTPP A+ DTGSDL WTQC PC + C+ Q +PL+DP SST+ LPC+S
Sbjct: 96 YHMILSVGTPPLAFPAIIDTGSDLTWTQCAPC-TTACFAQPTPLYDPARSSTFSKLPCAS 154
Query: 151 SQCASLNQ--KSCSGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVA---LPGITFG 205
C +L ++C+ C Y Y G F+ G LA +T+ +G G A G+ FG
Sbjct: 155 PLCQALPSAFRACNATGCVYDYRYAVG-FTAGYLAADTLAIGDGDGDGDASSSFAGVAFG 213
Query: 206 CGTNNGGLFNSKTTGIVGLGGGDISLISQ------------------------------- 234
C T NGG + +GIVGLG +SL+SQ
Sbjct: 214 CSTANGGDMDGA-SGIVGLGRSALSLLSQIGVGRFSYCLRSDADAGASPILFGALANVTG 272
Query: 235 --------MRTTIAGNQR----------LGVSTPDIVIDS-----------------GTT 259
+R +A +R + V + D+ + S GTT
Sbjct: 273 DKVQSTALLRNPVAARRRAPYYYVNLTGIAVGSTDLPVTSSTFGFTAAGAGGVIVDSGTT 332
Query: 260 LTFLPQ-GY---NSNLLSVMSSMIEAQPVADPTGSLELCYSFNSL-SQVPEVTIHFR-GA 313
T+L + GY LS + ++ V+ +LC+ + + VP + F GA
Sbjct: 333 FTYLAEAGYTMLRQAFLSQTAGLLTR--VSGAQFDFDLCFEAGAADTPVPRLVFRFAGGA 390
Query: 314 DVKLSRSNFFVKVSE-DIVCSVFKGITNSVPIYGNIMQTNFLVGYDIEQQTVSFKPTDC 371
+ + R ++F V E V + T V + GN+MQ + V YD++ T SF P DC
Sbjct: 391 EYAVPRQSYFDAVDEGGRVACLLVLPTRGVSVIGNVMQMDLHVLYDLDGATFSFAPADC 449
>gi|116787367|gb|ABK24480.1| unknown [Picea sitchensis]
Length = 496
Score = 134 bits (336), Expect = 1e-28, Method: Compositional matrix adjust.
Identities = 75/207 (36%), Positives = 113/207 (54%), Gaps = 28/207 (13%)
Query: 29 FSVELIHRDSPK-SPFYNSSETPYQRLRDALTRSLNRLNHFNQN---------------- 71
+SV+L+HRDS N++ + +RL + L R R+ Q
Sbjct: 71 WSVQLVHRDSLLFKGAANATASYERRLEEKLRREAARVRALEQRIERKLKLKKDPAGSYE 130
Query: 72 --SSISSSKASQ--ADIIPNNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQC 127
+ +++ S+ + + + Y RI IGTP E+ V DTGSD++W QCEPC +C
Sbjct: 131 NVAGVTAEFGSEVVSGMEQGSGEYFTRIGIGTPTREQYMVLDTGSDVVWIQCEPC--REC 188
Query: 128 YMQDSPLFDPKMSSTYKSLPCSSSQCASLNQKSCSGVNCQYSVSYGDGSFSNGNLATETV 187
Y Q P+F+P S ++ ++ C S+ C+ L+ C G C Y VSYGDGS++ G+ ATET+
Sbjct: 189 YSQADPIFNPSSSVSFSTVGCDSAVCSQLDANDCHGGGCLYEVSYGDGSYTVGSYATETL 248
Query: 188 TLGSTTGQAVALPGITFGCGTNNGGLF 214
T G+T+ Q VA+ GCG +N GLF
Sbjct: 249 TFGTTSIQNVAI-----GCGHDNVGLF 270
Score = 46.2 bits (108), Expect = 0.029, Method: Compositional matrix adjust.
Identities = 37/124 (29%), Positives = 50/124 (40%), Gaps = 4/124 (3%)
Query: 252 IVIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSLSQV--PEVTIH 309
I+IDSGT +T L L + + P AD + CY ++L V P V H
Sbjct: 372 IIIDSGTAVTRLQTSAYDALRDAFIAGTQHLPRADGISIFDTCYDLSALQSVSIPAVGFH 431
Query: 310 F-RGADVKLSRSNFFVKV-SEDIVCSVFKGITNSVPIYGNIMQTNFLVGYDIEQQTVSFK 367
F GA L N + + S C F +++ I GNI Q V +D V F
Sbjct: 432 FSNGAGFILPAKNCLIPMDSMGTFCFAFAPADSNLSIMGNIQQQGIRVSFDSANSLVGFA 491
Query: 368 PTDC 371
C
Sbjct: 492 IDQC 495
>gi|226508202|ref|NP_001141111.1| hypothetical protein precursor [Zea mays]
gi|194702684|gb|ACF85426.1| unknown [Zea mays]
gi|414590469|tpg|DAA41040.1| TPA: hypothetical protein ZEAMMB73_571218 [Zea mays]
Length = 439
Score = 133 bits (335), Expect = 1e-28, Method: Compositional matrix adjust.
Identities = 82/194 (42%), Positives = 116/194 (59%), Gaps = 11/194 (5%)
Query: 45 NSSETPYQRLRDALTRSLNRLNHFNQNSSISSSKASQADIIPNN--ANYLIRISIGTPPT 102
+ S T Q +R AL R ++R N +S SS A + P +L+ ++IGTPP
Sbjct: 38 DPSVTASQFVRAALHRDMHRHNARKLAAS-SSDGTVSAPVSPTTVPGEFLMTLAIGTPPL 96
Query: 103 ERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCSSSQCASLNQKSCS 162
LA+ADTGSDLIWTQC PC QC+ Q +PL++P S+T+ +LPC+SS L +C+
Sbjct: 97 PFLAIADTGSDLIWTQCAPC-SRQCFQQPTPLYNPSSSTTFSALPCNSS--LGLCAPACA 153
Query: 163 GVNCQYSVSYGDGSFSNGNLATETVTLGSTT-GQAVALPGITFGCGTNNGGLFNSKTTGI 221
C Y+++YG G ++ TET T GS+T V +PGI FGC + G S +G+
Sbjct: 154 ---CMYNMTYGSG-WTYVFQGTETFTFGSSTPADQVRVPGIAFGCSNASSGFNASSASGL 209
Query: 222 VGLGGGDISLISQM 235
VGLG G +SL+SQ+
Sbjct: 210 VGLGRGSLSLVSQL 223
Score = 57.4 bits (137), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 40/136 (29%), Positives = 70/136 (51%), Gaps = 16/136 (11%)
Query: 252 IVIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGS--LELCYSFNSLSQ----VPE 305
++IDSGTT+T L + + + S++ P D + + L+LC+ S + +P
Sbjct: 303 LIIDSGTTITMLGNTAYQQVRAAVLSLVTL-PTTDGSAATGLDLCFELPSSTSAPPSMPS 361
Query: 306 VTIHFRGADVKLSRSNFFVKVSEDIV-----CSVFKGITNS----VPIYGNIMQTNFLVG 356
+T+HF GAD+ L N+ + +S+ C + T++ V I GN Q N +
Sbjct: 362 MTLHFDGADMVLPADNYMMSLSDPDSDSSLWCLAMQNQTDTDGVVVSILGNYQQQNMHIL 421
Query: 357 YDIEQQTVSFKPTDCT 372
YD+ ++T+SF P C+
Sbjct: 422 YDVGKETLSFAPAKCS 437
>gi|413918484|gb|AFW58416.1| hypothetical protein ZEAMMB73_998053 [Zea mays]
Length = 475
Score = 133 bits (334), Expect = 2e-28, Method: Compositional matrix adjust.
Identities = 83/231 (35%), Positives = 121/231 (52%), Gaps = 36/231 (15%)
Query: 28 GFSVELIHRDSPKSPFYNSSETPYQRLRDALTRSLNRLNHFNQNSS-------------- 73
G V L H D+ + + + Q L+ A RS +R++ ++
Sbjct: 44 GLRVRLTHVDA------HGNYSRLQLLQRAARRSHHRMSRLVARATGAASTSSSKAAAAG 97
Query: 74 -ISSSKASQADIIPNNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDS 132
S K Q + N +L+ +S+GTP A+ DTGSDL+WTQC+PC +C+ Q +
Sbjct: 98 DGSGGKDLQVPVHAGNGEFLMDLSVGTPALPYAAIVDTGSDLVWTQCKPC--VECFNQTT 155
Query: 133 PLFDPKMSSTYKSLPCSSSQCASLNQKSCSGVNCQ--------YSVSYGDGSFSNGNLAT 184
P+FDP SSTY +LPCSS+ CA L +C+ + Y+ +YGD S + G LAT
Sbjct: 156 PVFDPAASSTYAALPCSSALCADLPTSTCASSSSSSSASSPCGYTYTYGDASSTQGVLAT 215
Query: 185 ETVTLGSTTGQAVALPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQM 235
ET TL +PG+ FGCG N G ++ G+VGLG G +SL+SQ+
Sbjct: 216 ETFTLARQK-----VPGVAFGCGDTNEGDGFTQGAGLVGLGRGPLSLVSQL 261
Score = 57.4 bits (137), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 39/132 (29%), Positives = 67/132 (50%), Gaps = 12/132 (9%)
Query: 252 IVIDSGTTLTFLP-QGYNSNLLSVMSSMIEAQPVADPTG-SLELCYSFNSLS-------Q 302
+++DSGT++T+L + Y + + ++ M + P D + L+LC+ + + Q
Sbjct: 345 VIVDSGTSITYLELRAYRALRKAFVAHM--SLPTVDASEIGLDLCFQGPAGAVDQDVQVQ 402
Query: 303 VPEVTIHFRG-ADVKLSRSNFFVKVSEDIVCSVFKGITNSVPIYGNIMQTNFLVGYDIEQ 361
VP++ +HF G AD+ L N+ V S + + + I GN Q NF YD+
Sbjct: 403 VPKLVLHFDGGADLDLPAENYMVLDSASGALCLTVMASRGLSIIGNFQQQNFQFVYDVAG 462
Query: 362 QTVSFKPTDCTK 373
T+SF P +C K
Sbjct: 463 DTLSFAPAECNK 474
>gi|326507654|dbj|BAK03220.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 442
Score = 133 bits (334), Expect = 2e-28, Method: Compositional matrix adjust.
Identities = 78/188 (41%), Positives = 111/188 (59%), Gaps = 9/188 (4%)
Query: 54 LRDALTRSLNRLNHFNQNSSISSSKASQADI---IPNNANYLIRISIGTPPTERLAVADT 110
+RDAL R ++R F + + S + A +PN Y++ ++IGTPP A+ADT
Sbjct: 48 VRDALRRDMHRHARFTRELASSGDRTVAAPTRKDLPNGGEYIMTLAIGTPPLSYPAIADT 107
Query: 111 GSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCSS--SQCASLNQKS-CSGVNCQ 167
GSDLIWTQC PC SQC+ Q ++P S+T+ LPC+S S CA+L S G +C
Sbjct: 108 GSDLIWTQCAPC-GSQCFKQAGQPYNPSSSTTFGVLPCNSSVSMCAALAGPSPPPGCSCM 166
Query: 168 YSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITFGCGTNNGGLFNSKTTGIVGLGGG 227
Y+ +YG G ++ G + ET T GST +PGI FGC + +N + G+VGLG G
Sbjct: 167 YNQTYGTG-WTAGIQSVETFTFGSTPADQTRVPGIAFGCSNASSDDWNG-SAGLVGLGRG 224
Query: 228 DISLISQM 235
+SL+SQ+
Sbjct: 225 SMSLVSQL 232
Score = 67.4 bits (163), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 42/131 (32%), Positives = 71/131 (54%), Gaps = 9/131 (6%)
Query: 249 TPDIVIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGS--LELCYSFNSLSQ---- 302
T ++IDSGTT+T L + + + S++ PVAD + S L+LC++ S +
Sbjct: 312 TGGLIIDSGTTITSLVDAAYQQVRAAIESLVTL-PVADGSDSTGLDLCFALTSETSTPPS 370
Query: 303 VPEVTIHFRGADVKLSRSNFFVKVSEDIVCSVFKGIT-NSVPIYGNIMQTNFLVGYDIEQ 361
+P +T HF GAD+ L N+ + + + C + T ++ +GN Q N + YDI +
Sbjct: 371 MPSMTFHFDGADMVLPVDNYMI-LGSGVWCLAMRNQTVGAMSTFGNYQQQNVHLLYDIHE 429
Query: 362 QTVSFKPTDCT 372
+T+SF P C+
Sbjct: 430 ETLSFAPAKCS 440
>gi|413944392|gb|AFW77041.1| hypothetical protein ZEAMMB73_800604 [Zea mays]
Length = 476
Score = 132 bits (333), Expect = 2e-28, Method: Compositional matrix adjust.
Identities = 104/355 (29%), Positives = 148/355 (41%), Gaps = 79/355 (22%)
Query: 87 NNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSL 146
+ +++ + GTP + DTGSD+ W QC PC CY Q P+FDP S+TY +
Sbjct: 131 DTLEFVVTVGFGTPAQTYTVIFDTGSDVSWIQCLPCS-GHCYKQHDPIFDPTKSATYSVV 189
Query: 147 PCSSSQCASLNQKSCSGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITFGC 206
PC QCA+ + CS C Y V YGDGS S G L+ ET++L ST ALPG FGC
Sbjct: 190 PCGHPQCAAADGSKCSNGTCLYKVEYGDGSSSAGVLSHETLSLTSTR----ALPGFAFGC 245
Query: 207 GTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGN--------------QRLGVSTP-- 250
G N G F G++GLG G +SL SQ + G +G +TP
Sbjct: 246 GQTNLGDFG-DVDGLIGLGRGQLSLSSQAAASFGGTFSYCLPSDNTTHGYLTIGPTTPAS 304
Query: 251 -------------------------------------DIVIDSGT------TLTFL-PQG 266
+ D GT LT+L P+
Sbjct: 305 NDDVQYTAMVQKQDYPSFYFVELVSIDIGGYILPVPPTLFTDDGTFLDSGTILTYLPPEA 364
Query: 267 YNS--NLLSVMSSMIEAQPVADPTGSLELCYSFNSLSQVPEVTIHFRGADVKLSRSNFF- 323
Y + + + + P DP + CY F S + + F+ +D + +FF
Sbjct: 365 YTALRDRFKFTMTQYKPAPAYDP---FDTCYDFTGQSAIFIPAVSFKFSDGSVFDLSFFG 421
Query: 324 VKVSED-----IVCSVFKGITNSVP--IYGNIMQTNFLVGYDIEQQTVSFKPTDC 371
+ + D I C F +++P I GN+ Q N V YD+ + + F C
Sbjct: 422 ILIFPDDTAPAIGCLGFVARPSAMPFTIVGNMQQRNTEVIYDVAAEKIGFASASC 476
>gi|242041115|ref|XP_002467952.1| hypothetical protein SORBIDRAFT_01g037070 [Sorghum bicolor]
gi|241921806|gb|EER94950.1| hypothetical protein SORBIDRAFT_01g037070 [Sorghum bicolor]
Length = 774
Score = 132 bits (333), Expect = 2e-28, Method: Compositional matrix adjust.
Identities = 79/192 (41%), Positives = 107/192 (55%), Gaps = 15/192 (7%)
Query: 55 RDALTRSLNRLNHFNQNSSISSSKAS---QADIIPNNANYLIRISIGTPPTERLAVADTG 111
R+ L R RL F+ + +S++ A+ +P+ YL+ ++IGTPP + DTG
Sbjct: 378 REVLHRMAARL-LFSASGRAASARVDPGPYANGVPDT-EYLVHLAIGTPPQPVQLILDTG 435
Query: 112 SDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCSSSQCASLNQKSCSGVN-----C 166
SDL+WTQC PCP C+ + DP SST+ LPCSS C +L SC N C
Sbjct: 436 SDLVWTQCRPCP--VCFSRALGPLDPSNSSTFDVLPCSSPVCDNLTWSSCGKHNWGNQTC 493
Query: 167 QYSVSYGDGSFSNGNLATETVTLGST--TGQAVALPGITFGCGTNNGGLFNSKTTGIVGL 224
Y +Y DGS + G+L ET T + TGQA +P + FGCG N G+F S TGI G
Sbjct: 494 VYVYAYADGSITTGHLDAETFTFAAADGTGQAT-VPDLAFGCGLFNNGIFTSNETGIAGF 552
Query: 225 GGGDISLISQMR 236
G G +SL SQ++
Sbjct: 553 GRGALSLPSQLK 564
Score = 59.7 bits (143), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 39/130 (30%), Positives = 62/130 (47%), Gaps = 11/130 (8%)
Query: 253 VIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSL--ELCYSF----NSLSQVPEV 306
+IDSGT +T LPQ + ++ + PV + T S LC+SF + VP++
Sbjct: 646 IIDSGTGMTTLPQDAYKLVHDAFTAQVRL-PVDNATSSSLSRLCFSFSVPRRAKPDVPKL 704
Query: 307 TIHFRGADVKLSRSNFFVKVSE---DIVCSVFKGITNSVPIYGNIMQTNFLVGYDIEQQT 363
+HF GA + L R N+ + + + C + + I GN Q N V YD+ +
Sbjct: 705 VLHFEGATLDLPRENYMFEFEDAGGSVTCLAINA-GDDLTIIGNYQQQNLHVLYDLVRNM 763
Query: 364 VSFKPTDCTK 373
+SF P C +
Sbjct: 764 LSFVPAQCNR 773
>gi|413952720|gb|AFW85369.1| hypothetical protein ZEAMMB73_571116 [Zea mays]
Length = 451
Score = 132 bits (333), Expect = 2e-28, Method: Compositional matrix adjust.
Identities = 87/245 (35%), Positives = 123/245 (50%), Gaps = 20/245 (8%)
Query: 1 MATFLSCVFILFFLCFYVVSPIEAQTGGFSVELIHRDSPKSPFYNSSETPYQRLRDALTR 60
M+ L+ I L +P T +L H D + T ++RL R
Sbjct: 6 MSELLAYALIFTLLFTAAATPTAGLT--MRADLTHVDKGRG------FTRWERLSRMAVR 57
Query: 61 SLNRLNHFNQNSSISSSKASQADIIPNNANYLIRISIGTPPTERLAVA-DTGSDLIWTQC 119
S R Q + A +P++ YLI +IGTP +R+A+ DTGSDL+WTQC
Sbjct: 58 SRARAASLYQRGG-HYGQPVTATAVPSSGEYLIHFNIGTPRPQRVALTMDTGSDLVWTQC 116
Query: 120 EPCPPSQCYMQDSPLFDPKMSSTYKSLPCSSSQC---ASLNQKSCS--GVNCQYSVSYGD 174
PCP C+ Q PLFDP +SST++++ C C + L+ +C+ C Y SYGD
Sbjct: 117 TPCP--VCFDQPFPLFDPSVSSTFRAVACPDPICRPSSGLSVSACALKTFRCFYLCSYGD 174
Query: 175 GSFSNGNLATETVTLGSTTGQA---VALPGITFGCGTNNGGLFNSKTTGIVGLGGGDISL 231
S + G + +T T S G+ VA+ G+ FGCG N G+F S +GI G G G +SL
Sbjct: 175 KSITAGYIFKDTFTFMSPNGEGAPPVAVSGLAFGCGDYNTGVFASNESGIAGFGRGPLSL 234
Query: 232 ISQMR 236
SQ+R
Sbjct: 235 PSQLR 239
Score = 46.6 bits (109), Expect = 0.017, Method: Compositional matrix adjust.
Identities = 34/127 (26%), Positives = 56/127 (44%), Gaps = 7/127 (5%)
Query: 253 VIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLE--LCYSFNSLSQ---VPEVT 307
VIDSGT +T P L + + + P D T + LC+ + VP++
Sbjct: 325 VIDSGTGVTTFPAAVFEQLKNEFVAQLPL-PRYDNTSEVGNLLCFQRPKGGKQVPVPKLI 383
Query: 308 IHFRGADVKLSRSNFFVKVSED-IVCSVFKGITNSVPIYGNIMQTNFLVGYDIEQQTVSF 366
H AD+ L R N+ + ++ ++C + G + + GN Q N + YD+E + F
Sbjct: 384 FHLASADMDLPRENYIPEDTDSGVMCLMINGAEVDMVLIGNFQQQNMHIVYDVENSKLLF 443
Query: 367 KPTDCTK 373
C K
Sbjct: 444 ASAQCDK 450
>gi|294461757|gb|ADE76437.1| unknown [Picea sitchensis]
Length = 325
Score = 132 bits (332), Expect = 3e-28, Method: Compositional matrix adjust.
Identities = 92/324 (28%), Positives = 138/324 (42%), Gaps = 66/324 (20%)
Query: 109 DTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCSSSQCASLNQ--KSCSGVNC 166
DTGSD+ W QC+PCP QCY Q LF P S+TYK LPC+S+ C L SC +C
Sbjct: 6 DTGSDITWIQCDPCP--QCYKQQDSLFQPAGSATYKPLPCNSTMCQQLQSFSHSCLNSSC 63
Query: 167 QYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITFGCGTNNGGLFNSKTTGIVGLGG 226
Y VSYGD S + G+ A ET+TL S V++P FGCG N GLFN G++GLG
Sbjct: 64 NYMVSYGDKSTTRGDFALETLTLRSDDTILVSVPNFAFGCGHANKGLFNG-AAGLMGLGK 122
Query: 227 GDISLISQMRTTIAGNQRLGVSTPDI-------------------------VIDS----- 256
I +Q T++A + P + ++DS
Sbjct: 123 SSIGFPAQ--TSVAFGKVFSYCLPSVSSTIPSGILHFGEAAMLDYDVRFTPLVDSSSGPS 180
Query: 257 --------------------------GTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGS 290
GT ++ Q L + ++ A
Sbjct: 181 QYFVSMTGINVGDELLPISATVMVDSGTVISRFEQSAYERLRDAFTQILPGLQTAVSVAP 240
Query: 291 LELCYSFNSLSQ--VPEVTIHFR-GADVKLSRSNFFVKVSEDIVCSVFKGITNSVPIYGN 347
+ C+ +++ +P +T+HFR A+++LS + V + ++C F ++ + GN
Sbjct: 241 FDTCFRVSTVDDINIPLITLHFRDDAELRLSPVHILYPVDDGVMCFAFAPSSSGRSVLGN 300
Query: 348 IMQTNFLVGYDIEQQTVSFKPTDC 371
Q N YDI + + +C
Sbjct: 301 FQQQNLRFVYDIPKSRLGISAFEC 324
>gi|302809855|ref|XP_002986620.1| hypothetical protein SELMODRAFT_124369 [Selaginella moellendorffii]
gi|300145803|gb|EFJ12477.1| hypothetical protein SELMODRAFT_124369 [Selaginella moellendorffii]
Length = 423
Score = 132 bits (332), Expect = 3e-28, Method: Compositional matrix adjust.
Identities = 121/429 (28%), Positives = 177/429 (41%), Gaps = 98/429 (22%)
Query: 34 IHRDSPKSPFYNSSETPYQRLRDALTRSLNRLNHFNQNSSISSS---KASQADIIPN--- 87
+HRDS SP+ ++ T + +R+ L R RL + S+ + K+S + + N
Sbjct: 1 MHRDSADSPYRPANATVHGLVRNRLHRDELRLLSISSRISLGVAGIPKSSLTNPLKNTNP 60
Query: 88 -----------------NANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQ 130
+ Y + + +GTPP VADTGSD++W QC PC CY Q
Sbjct: 61 FLQQDFETPLRSGLSDGSGEYFVSLGVGTPPRTVNMVADTGSDVLWLQCLPC--QSCYGQ 118
Query: 131 DSPLFDPKMSSTYKSLPCSSSQCASLNQKSCSGVNCQYSVSYGDGSFSNGNLATETVTLG 190
PLF+P SST++S+ C SS C L + C C Y VSYGDGSF+ G +TET++ G
Sbjct: 119 TDPLFNPSFSSTFQSITCGSSLCQQLLIRGCRRNQCLYQVSYGDGSFTVGEFSTETLSFG 178
Query: 191 STTGQAVALPGITFGCGTNNGGLFNS---------------------------------K 217
S +VA+ GCG NN GLF +
Sbjct: 179 SNAVNSVAI-----GCGHNNQGLFTGAAGLLGLGKGLLSFPSQVGQLYGSVFSYCLPTRE 233
Query: 218 TTGIVGLGGGDISLISQMR-TTIAGNQRL----------------GVSTP---------- 250
+TG V L G+ ++ S + TT+ N +L VS P
Sbjct: 234 STGSVPLIFGNQAVASNAQFTTLLTNPKLDTFYYVEMVGIKVGGTSVSIPAGSLSLDSST 293
Query: 251 ---DIVIDSGTTLTFL-PQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSLSQV--P 304
+++DSGT +T L YN + + M + + CY + S + P
Sbjct: 294 GNGGVILDSGTAVTRLVTSAYNPMRDAFRAGMPSDAKMTSGFSLFDTCYDLSGRSSIMLP 353
Query: 305 EVTIHFR-GADVKLSRSNFFVKV-SEDIVCSVFKGITNSVPIYGNIMQTNFLVGYDIEQQ 362
V+ F GA + L N V V + C F + + I GNI Q +F + +D
Sbjct: 354 AVSFVFNGGATMALPAQNIMVPVDNSGTYCLAFAPNSENFSIIGNIQQQSFRMSFDSTGN 413
Query: 363 TVSFKPTDC 371
V C
Sbjct: 414 RVGIGANQC 422
>gi|242041575|ref|XP_002468182.1| hypothetical protein SORBIDRAFT_01g041210 [Sorghum bicolor]
gi|241922036|gb|EER95180.1| hypothetical protein SORBIDRAFT_01g041210 [Sorghum bicolor]
Length = 490
Score = 132 bits (332), Expect = 3e-28, Method: Compositional matrix adjust.
Identities = 75/230 (32%), Positives = 119/230 (51%), Gaps = 27/230 (11%)
Query: 22 IEAQTGGFSVELIHRDSPKSPFYNSSETPYQRLRDALTRSLNRLNHFNQNSS-------- 73
+ A + + L+HRD + ++ TP Q L L R + R ++
Sbjct: 61 VAASSSTLHIRLLHRDR-----FAANATPAQLLARRLQRDVLRAAWIISKAAANGTPPPV 115
Query: 74 --ISSSKASQADII---PNNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCY 128
+SS++ A ++ P + Y+ +I++GTP E L DT SDL W QC+PC +CY
Sbjct: 116 AGLSSARGFVAPVVSRAPTSGEYIAKIAVGTPGVEALLALDTASDLTWLQCQPC--RRCY 173
Query: 129 MQDSPLFDPKMSSTYKSLPCSSSQCASLNQKSCSGVN---CQYSVSYGDGSFSNGNLATE 185
Q P+FDP+ S++Y+ + +++ C +L + C Y+V YGDGS + G+ E
Sbjct: 174 PQSGPVFDPRHSTSYREMSFNAADCQALGRSGGGDAKRGTCVYTVGYGDGSTTVGDFIEE 233
Query: 186 TVTLGSTTGQAVALPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQM 235
T+T V LP I+ GCG +N GLF + GI+GLG G +S +Q+
Sbjct: 234 TLTFAG----GVRLPRISIGCGHDNKGLFGAPAAGILGLGRGLMSFPNQI 279
Score = 59.3 bits (142), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 41/129 (31%), Positives = 65/129 (50%), Gaps = 11/129 (8%)
Query: 252 IVIDSGTTLTFLPQ----GYNSNLLSVMSSMIEAQPVADPTGSLELCYSF--NSLSQVPE 305
+++DSGT +T L + + +V + + + P+G + CY+ + +VP
Sbjct: 364 VIVDSGTAVTRLARPAYTAFRDAFRAVAVDLGQVS-IGGPSGFFDTCYTVGGRGMKKVPT 422
Query: 306 VTIHFRGA-DVKLSRSNFFVKV-SEDIVCSVFKGI-TNSVPIYGNIMQTNFLVGYDIEQQ 362
V++HF G+ +VKL N+ + V S VC F +SV I GNI Q F + YDI +
Sbjct: 423 VSMHFAGSVEVKLQPKNYLIPVDSMGTVCFAFAATGDHSVSIIGNIQQQGFRIVYDIGGR 482
Query: 363 TVSFKPTDC 371
V F P C
Sbjct: 483 -VGFAPNSC 490
>gi|125564143|gb|EAZ09523.1| hypothetical protein OsI_31798 [Oryza sativa Indica Group]
Length = 485
Score = 132 bits (331), Expect = 3e-28, Method: Compositional matrix adjust.
Identities = 79/221 (35%), Positives = 113/221 (51%), Gaps = 26/221 (11%)
Query: 33 LIHRDSPKSPFYNSSETPYQRLRDA--LTRSLNRLNHFNQN--------SSISSSKAS-- 80
++HR P SP + A L R R++ ++ S + ++AS
Sbjct: 73 VVHRHGPCSPVQARRRGGGGAVTHAEILERDQARVDSIHRKVAGAGGAPSVVDPARASEQ 132
Query: 81 ------QADIIPNNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPL 134
Q I NY++ + +GTP + + DTGSDL W QC+PC + CY Q PL
Sbjct: 133 GVSLPAQRGISLGTGNYVVSVGLGTPAKQYAVIFDTGSDLSWVQCKPC--ADCYEQQDPL 190
Query: 135 FDPKMSSTYKSLPCSSSQCASLNQKSCSG-VNCQYSVSYGDGSFSNGNLATETVTLGSTT 193
FDP +SSTY ++ C + +C L+ CS C+Y V YGD S ++GNL +T+TL ++
Sbjct: 191 FDPSLSSTYAAVACGAPECQELDASGCSSDSRCRYEVQYGDQSQTDGNLVRDTLTLSASD 250
Query: 194 GQAVALPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQ 234
LPG FGCG N GLF + G+ GLG +SL SQ
Sbjct: 251 ----TLPGFVFGCGDQNAGLFG-QVDGLFGLGREKVSLPSQ 286
Score = 53.5 bits (127), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 38/125 (30%), Positives = 57/125 (45%), Gaps = 5/125 (4%)
Query: 253 VIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSF--NSLSQVPEVTIHF 310
VIDSGT +T LP + L + + + A L+ CY F + +Q+P V + F
Sbjct: 361 VIDSGTVITRLPPRAYAPLRAAFARSMAQYKKAPALSILDTCYDFTGHRTAQIPTVELAF 420
Query: 311 RG-ADVKLSRSN--FFVKVSEDIVCSVFKGITNSVPIYGNIMQTNFLVGYDIEQQTVSFK 367
G A V L + + KVS+ + +S+ I GN Q F V YD+ Q + F
Sbjct: 421 AGGATVSLDFTGVLYVSKVSQACLAFAPNADDSSIAILGNTQQKTFAVTYDVANQRIGFG 480
Query: 368 PTDCT 372
C+
Sbjct: 481 AKGCS 485
>gi|147801191|emb|CAN68822.1| hypothetical protein VITISV_007106 [Vitis vinifera]
Length = 443
Score = 132 bits (331), Expect = 3e-28, Method: Compositional matrix adjust.
Identities = 100/340 (29%), Positives = 159/340 (46%), Gaps = 32/340 (9%)
Query: 43 FYNSSETPYQRLRDALTRSLNRLNHFNQNSSISSSKASQA-DIIPNNANYLIRISIGTPP 101
+Y+ + T R A RS+ LN+ +S SSS + ++P Y++ +G P
Sbjct: 8 YYDHNMTSTDRSIWAADRSIAXLNYLLSVTSSSSSLGDISSKLVPEYYEYIMMYYLGVPS 67
Query: 102 TERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCSSSQCASLNQKSC 161
T +ADTGS+LIW QC PC + CY Q P+FDP S TY+++ S C ++ + SC
Sbjct: 68 TLVYGIADTGSELIWLQCLPC--THCYNQTPPIFDPAESYTYETVSSDSPICNAVRRISC 125
Query: 162 --SGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITFGCGTNNGGLFNSKTT 219
+C Y +YGDG+ + G L+T+ T V + +TFGC +
Sbjct: 126 REGDKSCCYQHTYGDGTTTKGTLSTDVFAFEDPTRTIVEVGYLTFGCSHDTKARLKGHQA 185
Query: 220 GIVGLGGGDISLISQMRTTI-----------AGNQRLGVSTPDIVIDSGTTLTFLPQGYN 268
G+VGL SL+SQ++ R+ + +++ T L L Y+
Sbjct: 186 GVVGLNRHPNSLVSQLKVKKFSYCMVIPDDHGSGSRMYFGSRAVILGGKTPL--LKGDYS 243
Query: 269 SNLLSVMS-SMIEAQPVADPTGSLELCYSFNSLSQVPEVTIHFRGADVKLSRSNFFVKVS 327
+++ S+ E + +D S P++T HF GAD L++ +V+V
Sbjct: 244 HYFVTLKGISVGEEKGRSDELASAG-----------PDITFHFYGADFILTKXTTYVEVE 292
Query: 328 EDIVCSVF--KGITNSVPIYGNIMQTNFLVGYDIEQQTVS 365
+ + C T + I GNI Q N+ VGYD+E Q V+
Sbjct: 293 KGLWCLAMLSSNSTRKLSILGNIQQQNYHVGYDLEAQEVA 332
Score = 63.5 bits (153), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 37/112 (33%), Positives = 53/112 (47%), Gaps = 3/112 (2%)
Query: 125 SQCYMQDSPLFDPKMSSTYKSLPCSSSQCASLNQKSC--SGVNCQYSVSYGDGSFS-NGN 181
+QC+ Q P+FDP SSTY ++P + C +C +C Y +SYG GS S G
Sbjct: 332 AQCFNQTPPIFDPSKSSTYSTVPWDAPTCYQAGGYACHIDEEDCCYRISYGSGSTSTEGT 391
Query: 182 LATETVTLGSTTGQAVALPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLIS 233
++ + V + + FGC G F GIVGL +SL+S
Sbjct: 392 ISIDAFAFEDNRQNMVDVXHLVFGCSDYTTGTFKGYEVGIVGLNQDSLSLVS 443
>gi|326490656|dbj|BAJ89995.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 456
Score = 132 bits (331), Expect = 4e-28, Method: Compositional matrix adjust.
Identities = 127/418 (30%), Positives = 180/418 (43%), Gaps = 79/418 (18%)
Query: 22 IEAQTGGFSVELIHRDSPKS--PFYNSSETPYQRLRDALTRSLNRLNHFNQNSSISSSKA 79
+ +G +V L HR P S P N+ RD L + + N S +
Sbjct: 50 VAPSSGVVTVPLHHRHGPCSTVPSTNAPTLEDMLRRDQLRAAYITRKYSGVNGSAGDVEG 109
Query: 80 SQADIIP------NNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSP 133
S + + YLI + +G+P + + DTGSD+ W QC+PC SQC+ Q
Sbjct: 110 SDVTVPTTLGTSLDTLEYLITVGMGSPAVAQTMLIDTGSDVSWVQCKPC--SQCHSQADS 167
Query: 134 LFDPKMSSTYKSLPCSSSQCASLNQKSCSGVNCQYSVSYGDGSFSNGNLATETVTLGSTT 193
LFDP SSTY + C+S+ CA L Q+ CS CQY+V YGDGS +G +++T+ LGS+T
Sbjct: 168 LFDPSSSSTYSAFSCTSAACAQLRQRGCSSSQCQYTVKYGDGSTGSGTYSSDTLALGSST 227
Query: 194 GQAVALPGITFGCG-TNNGGLFNSKTTGIVGLGGGDISLISQMRTTI------------- 239
+ FGC + +G L +T G++GLGGG SL +Q T
Sbjct: 228 -----VENFQFGCSQSESGNLLQDQTAGLMGLGGGAESLATQTAGTFGKAFSYCLPPTPG 282
Query: 240 -AGNQRLGVSTPDIVI-----------------------------------------DSG 257
+G LG ST V+ DSG
Sbjct: 283 SSGFLTLGASTSGFVVKTPMLRSTQVPSYYGVLLQAIRVGGRQLNIPASAFSAGSIMDSG 342
Query: 258 TTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSLSQV--PEVTIHFRGADV 315
T +T LP+ S L S + ++ P A P G + C+ F+ S V P V + F G V
Sbjct: 343 TIITRLPRTAYSALSSAFKAGMKQYPPAQPMGIFDTCFDFSGQSSVSIPTVALVFSGGAV 402
Query: 316 KLSRSNFFVKVSEDIVCSVFKGITN--SVPIYGNIMQTNFLVGYDIEQQTVSFKPTDC 371
S+ + S C F ++ S+ I GN+ Q F V YD+ V FK C
Sbjct: 403 VDLASDGIILGS----CLAFAANSDDTSLGIIGNVQQRTFEVLYDVGGGAVGFKAGAC 456
>gi|18461217|dbj|BAB84414.1| chloroplast nucleoid DNA-binding protein cnd41-like [Oryza sativa
Japonica Group]
Length = 446
Score = 132 bits (331), Expect = 4e-28, Method: Compositional matrix adjust.
Identities = 80/213 (37%), Positives = 115/213 (53%), Gaps = 21/213 (9%)
Query: 31 VELIHRDS--PKSPFYNSSETPYQRLRDALTRSLNRLNHFNQNSSISSSKASQADIIP-N 87
V + HRD+ P P QRL R + ++ + S S IP
Sbjct: 29 VPVFHRDALFPPPPGAKRGSLLRQRLAADAARYASLVDATGRLHSPVFSG------IPFE 82
Query: 88 NANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLP 147
+ Y + +GTP T+ + V DTGSDL+W QC PC +CY Q +FDP+ SSTY+ +P
Sbjct: 83 SGEYFALVGVGTPSTKAMLVIDTGSDLVWLQCSPC--RRCYAQRGQVFDPRRSSTYRRVP 140
Query: 148 CSSSQCASLNQKSC-----SGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGI 202
CSS QC +L C +G C+Y V+YGDGS S G+LAT+ + + T + +
Sbjct: 141 CSSPQCRALRFPGCDSGGAAGGGCRYMVAYGDGSSSTGDLATDKLAFANDT----YVNNV 196
Query: 203 TFGCGTNNGGLFNSKTTGIVGLGGGDISLISQM 235
T GCG +N GLF+S G++G+G G IS+ +Q+
Sbjct: 197 TLGCGRDNEGLFDS-AAGLLGVGRGKISISTQV 228
Score = 51.6 bits (122), Expect = 6e-04, Method: Compositional matrix adjust.
Identities = 32/134 (23%), Positives = 59/134 (44%), Gaps = 13/134 (9%)
Query: 252 IVIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGS---LELCYSFNS--LSQVPEV 306
+V+DSGT ++ + + L + A + G + CY + P +
Sbjct: 312 VVVDSGTAISRFARDAYAALRDAFDARARAAGMRRLAGEHSVFDACYDLRGRPAASAPLI 371
Query: 307 TIHFRG-ADVKLSRSNFFV-------KVSEDIVCSVFKGITNSVPIYGNIMQTNFLVGYD 358
+HF G AD+ L N+F+ + + C F+ + + + GN+ Q F V +D
Sbjct: 372 VLHFAGGADMALPPENYFLPVDGGRRRAASYRRCLGFEAADDGLSVIGNVQQQGFRVVFD 431
Query: 359 IEQQTVSFKPTDCT 372
+E++ + F P CT
Sbjct: 432 VEKERIGFAPKGCT 445
>gi|293332561|ref|NP_001170100.1| uncharacterized protein LOC100384018 precursor [Zea mays]
gi|224033441|gb|ACN35796.1| unknown [Zea mays]
gi|413944035|gb|AFW76684.1| hypothetical protein ZEAMMB73_746438 [Zea mays]
Length = 456
Score = 132 bits (331), Expect = 4e-28, Method: Compositional matrix adjust.
Identities = 69/154 (44%), Positives = 86/154 (55%), Gaps = 7/154 (4%)
Query: 90 NYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCS 149
YL+ +++GTPP DTGSDL+WTQC PC C+ Q PL DP SSTY +LPC
Sbjct: 85 EYLVHLAVGTPPRPVALTLDTGSDLVWTQCAPC--RDCFDQGIPLLDPAASSTYAALPCG 142
Query: 150 SSQCASLNQKSCSGVNCQYSVSYGDGSFSNGNLATETVTLGST-----TGQAVALPGITF 204
+ +C +L SC G +C Y YGD S + G +AT+ T G G A +TF
Sbjct: 143 APRCRALPFTSCGGRSCVYVYHYGDKSVTVGKIATDRFTFGDNGRRNGDGSLPATRRLTF 202
Query: 205 GCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTT 238
GCG N G+F S TGI G G G SL SQ+ T
Sbjct: 203 GCGHFNKGVFQSNETGIAGFGRGRWSLPSQLNAT 236
>gi|357166728|ref|XP_003580821.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Brachypodium
distachyon]
Length = 479
Score = 132 bits (331), Expect = 4e-28, Method: Compositional matrix adjust.
Identities = 89/216 (41%), Positives = 124/216 (57%), Gaps = 18/216 (8%)
Query: 30 SVELIHRDSPKSPFYNSSETPYQRL--RD-ALTRSLNRLNHFNQNSSISSSKASQADIIP 86
S++L+HRD+ + S L RD A L R + + S +SS S I+
Sbjct: 58 SLQLLHRDTVSGTKHPSRRHAVLALASRDTARVAYLQRRLSPSPSPSSTSSVESGGTIVS 117
Query: 87 N-NANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKS 145
+ + YL+R+ IG+PP E+ VADTGSD+IW QC PC S CY Q PLFDP S+++
Sbjct: 118 HGSGEYLVRVGIGSPPLEQHLVADTGSDVIWVQCSPC--SDCYAQGDPLFDPANSASFSP 175
Query: 146 LPCSSSQCASLNQ-----KSCSGVNCQYSVSYGDGSFSNGNLATETVTL-GSTTGQAVAL 199
+PC+S C + + G C+Y VSYGD S++NG LA ET+TL G T Q VA+
Sbjct: 176 VPCNSGVCRAAARYSSSSCGGGGGECEYKVSYGDKSYTNGVLALETLTLDGGTEVQGVAM 235
Query: 200 PGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQM 235
GCG N GLF ++ G++GLG G +SL+ Q+
Sbjct: 236 -----GCGHENRGLF-AEAAGLLGLGWGPMSLVGQL 265
Score = 46.2 bits (108), Expect = 0.025, Method: Compositional matrix adjust.
Identities = 34/131 (25%), Positives = 53/131 (40%), Gaps = 11/131 (8%)
Query: 252 IVIDSGTTLTFLP-QGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSLS--QVPEVTI 308
+V+D+GT +T LP + Y + + + E P A + CY + + +VP V +
Sbjct: 349 VVMDTGTAVTRLPAEAYAALRGAFAGAFEEGAPRAPGVSLFDTCYDLSGYASVRVPTVAL 408
Query: 309 HF-------RGADVKLSRSNFFVKVSED-IVCSVFKGITNSVPIYGNIMQTNFLVGYDIE 360
+F A + L N V V + C F + + I GNI Q + D
Sbjct: 409 YFGGGGQGQEAASLTLPARNLLVPVDDGGTYCLAFAAVASGPSILGNIQQQGIEITVDSA 468
Query: 361 QQTVSFKPTDC 371
V F P C
Sbjct: 469 SGYVGFGPATC 479
>gi|115479815|ref|NP_001063501.1| Os09g0482200 [Oryza sativa Japonica Group]
gi|50725878|dbj|BAD33407.1| putative nucleoid DNA-binding protein cnd41 [Oryza sativa Japonica
Group]
gi|50725881|dbj|BAD33410.1| putative nucleoid DNA-binding protein cnd41 [Oryza sativa Japonica
Group]
gi|113631734|dbj|BAF25415.1| Os09g0482200 [Oryza sativa Japonica Group]
gi|125606112|gb|EAZ45148.1| hypothetical protein OsJ_29786 [Oryza sativa Japonica Group]
Length = 485
Score = 131 bits (330), Expect = 4e-28, Method: Compositional matrix adjust.
Identities = 79/221 (35%), Positives = 113/221 (51%), Gaps = 26/221 (11%)
Query: 33 LIHRDSPKSPFYNSSETPYQRLRDA--LTRSLNRLNHFNQN--------SSISSSKAS-- 80
++HR P SP + A L R R++ ++ S + ++AS
Sbjct: 73 VVHRHGPCSPVQARPRGGGGAVTHAEILERDQARVDSIHRKVAGAGGAPSVVDPARASEQ 132
Query: 81 ------QADIIPNNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPL 134
Q I NY++ + +GTP + + DTGSDL W QC+PC + CY Q PL
Sbjct: 133 GVSLPAQRGISLGTGNYVVSVGLGTPAKQYAVIFDTGSDLSWVQCKPC--ADCYEQQDPL 190
Query: 135 FDPKMSSTYKSLPCSSSQCASLNQKSCSG-VNCQYSVSYGDGSFSNGNLATETVTLGSTT 193
FDP +SSTY ++ C + +C L+ CS C+Y V YGD S ++GNL +T+TL ++
Sbjct: 191 FDPSLSSTYAAVACGAPECQELDASGCSSDSRCRYEVQYGDQSQTDGNLVRDTLTLSASD 250
Query: 194 GQAVALPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQ 234
LPG FGCG N GLF + G+ GLG +SL SQ
Sbjct: 251 ----TLPGFVFGCGDQNAGLFG-QVDGLFGLGREKVSLPSQ 286
Score = 54.7 bits (130), Expect = 7e-05, Method: Compositional matrix adjust.
Identities = 38/125 (30%), Positives = 57/125 (45%), Gaps = 5/125 (4%)
Query: 253 VIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSF--NSLSQVPEVTIHF 310
VIDSGT +T LP + L + + + A L+ CY F + +Q+P V + F
Sbjct: 361 VIDSGTVITRLPPRAYAPLRAAFARSMAQYKKAPALSILDTCYDFTGHRTAQIPTVELAF 420
Query: 311 RG-ADVKLSRSN--FFVKVSEDIVCSVFKGITNSVPIYGNIMQTNFLVGYDIEQQTVSFK 367
G A V L + + KVS+ + +S+ I GN Q F V YD+ Q + F
Sbjct: 421 AGGATVSLDFTGVLYVSKVSQACLAFAPNADDSSIAILGNTQQKTFAVAYDVANQRIGFG 480
Query: 368 PTDCT 372
C+
Sbjct: 481 AKGCS 485
>gi|357143680|ref|XP_003573011.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Brachypodium
distachyon]
Length = 510
Score = 131 bits (330), Expect = 4e-28, Method: Compositional matrix adjust.
Identities = 102/372 (27%), Positives = 159/372 (42%), Gaps = 81/372 (21%)
Query: 81 QADIIPNNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMS 140
++ + + YL+ + +GTPP + DTGSDL W QC PC C+ Q P+FDP S
Sbjct: 140 ESGVAVGSGEYLVEVYVGTPPRRFQMIMDTGSDLNWLQCAPC--LDCFDQRGPVFDPMAS 197
Query: 141 STYKSLPCSSSQC-------ASLNQKSCSGVNCQYSVSYGDGSFSNGNLATETVTLGSTT 193
++Y+++ C ++C A +S C Y YGD S + G+LA E T+ T
Sbjct: 198 TSYRNVTCGDTRCGLVSPPAAPRTCRSSRSDPCPYYYWYGDQSNTTGDLALEAFTVNLTA 257
Query: 194 GQAVALPGITFGCGTNNGGLFN----------------SKTTGIVGLG------------ 225
+ + G+ GCG N GLF+ S+ + G
Sbjct: 258 SSSRRVDGVVLGCGHRNRGLFHGAAGLLGLGRGPLSFASQLRAVYGHAFSYCLVDHGSAV 317
Query: 226 ------GGDISLIS---------------------QMRTTIAGNQRL-------GVSTPD 251
G D L+S Q++ + G + L GVS D
Sbjct: 318 GSKIVFGDDNVLLSHPQLNYTAFAPSAAENTFYYVQLKGILVGGEMLDIPSNTWGVSKED 377
Query: 252 ----IVIDSGTTLTFLPQ-GYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSLS--QVP 304
+IDSGTTL++ P+ Y + + + M +A P+ L CY+ + + +VP
Sbjct: 378 GSGGTIIDSGTTLSYFPEPAYKAIRQAFVDRMDKAYPLIADFPVLSPCYNVSGVERVEVP 437
Query: 305 EVTIHFR-GADVKLSRSNFFVKV-SEDIVCSVFKGITNS-VPIYGNIMQTNFLVGYDIEQ 361
E ++ F GA N+F+++ +E I+C G S + I GN Q NF V YD+
Sbjct: 438 EFSLLFADGAVWDFPAENYFIRLDTEGIMCLAVLGTPRSAMSIIGNYQQQNFHVLYDLHH 497
Query: 362 QTVSFKPTDCTK 373
+ F P C +
Sbjct: 498 NRLGFAPRRCAE 509
>gi|226504334|ref|NP_001141706.1| uncharacterized protein LOC100273835 precursor [Zea mays]
gi|194705620|gb|ACF86894.1| unknown [Zea mays]
gi|414885968|tpg|DAA61982.1| TPA: hypothetical protein ZEAMMB73_231717 [Zea mays]
Length = 477
Score = 131 bits (330), Expect = 5e-28, Method: Compositional matrix adjust.
Identities = 83/233 (35%), Positives = 121/233 (51%), Gaps = 24/233 (10%)
Query: 18 VVSPIEAQTGGFSVELIHRDSPKSPFYNSSETPYQRLRDALTRSLNRLNHFNQNSSISSS 77
V +P +A ++ ++H P SP + P + L R +R++ + + ++
Sbjct: 52 VCTPTKAAPSSSALTVVHGHGPCSPQESRRGAPSHT--EILGRDQDRVDAIRRKVAAVTT 109
Query: 78 KASQADI--IP---------NNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQ 126
AS + +P + NY + +GTP T+ L DTGSD W QC+PCP
Sbjct: 110 AASSSKPKGVPLQVGWGKYLDTTNYFTSLRLGTPATDLLVELDTGSDQSWIQCKPCP--D 167
Query: 127 CYMQDSPLFDPKMSSTYKSLPCSSSQCASL---NQKSCSG-VNCQYSVSYGDGSFSNGNL 182
CY Q LFDP SSTY + CSS +C L ++ +CS C Y ++Y D S++ GNL
Sbjct: 168 CYEQHEALFDPSKSSTYSDITCSSRECQELGSSHKHNCSSDKKCPYEITYADDSYTVGNL 227
Query: 183 ATETVTLGSTTGQAVALPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQM 235
A +T+TL T A+PG FGCG NN G F + G++GLG G SL SQ+
Sbjct: 228 ARDTLTLSPTD----AVPGFVFGCGHNNAGSFG-EIDGLLGLGRGKASLSSQV 275
Score = 40.4 bits (93), Expect = 1.4, Method: Compositional matrix adjust.
Identities = 34/125 (27%), Positives = 53/125 (42%), Gaps = 6/125 (4%)
Query: 253 VIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSF--NSLSQVPEVTIHF 310
+IDSGT + LP + L S + S + A + + CY + ++P V + F
Sbjct: 352 IIDSGTAFSCLPPSAYAALRSSVRSAMGRYKRAPSSTIFDTCYDLTGHETVRIPSVALVF 411
Query: 311 R-GADVKLSRSNF---FVKVSEDIVCSVFKGITNSVPIYGNIMQTNFLVGYDIEQQTVSF 366
GA V L S + VS+ + + S+ + GN Q V YD++ Q V F
Sbjct: 412 ADGATVHLHPSGVLYTWSNVSQTCLAFLPNPDDTSLGVLGNTQQRTLAVIYDVDNQKVGF 471
Query: 367 KPTDC 371
C
Sbjct: 472 GANGC 476
>gi|125590542|gb|EAZ30892.1| hypothetical protein OsJ_14967 [Oryza sativa Japonica Group]
Length = 516
Score = 131 bits (329), Expect = 6e-28, Method: Compositional matrix adjust.
Identities = 69/140 (49%), Positives = 89/140 (63%), Gaps = 8/140 (5%)
Query: 97 IGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCSSSQCASL 156
IGTP A+ DTGSDL+WTQC+PC C+ Q +P+FDP SSTY ++PCSS+ C+ L
Sbjct: 173 IGTPALAYSAIVDTGSDLVWTQCKPC--VDCFKQSTPVFDPSSSSTYATVPCSSASCSDL 230
Query: 157 NQKSC-SGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITFGCGTNNGGLFN 215
C S C Y+ +YGD S + G LATET TL + LPG+ FGCG N G
Sbjct: 231 PTSKCTSASKCGYTYTYGDSSSTQGVLATETFTLAKSK-----LPGVVFGCGDTNEGDGF 285
Query: 216 SKTTGIVGLGGGDISLISQM 235
S+ G+VGLG G +SL+SQ+
Sbjct: 286 SQGAGLVGLGRGPLSLVSQL 305
Score = 64.7 bits (156), Expect = 6e-08, Method: Compositional matrix adjust.
Identities = 45/133 (33%), Positives = 67/133 (50%), Gaps = 11/133 (8%)
Query: 249 TPDIVIDSGTTLTFLP-QGYNSNLLSVMSSMIEAQPVADPTG-SLELCYSFNSLS----Q 302
T +++DSGT++T+L QGY + + + M A P AD +G L+LC+ + +
Sbjct: 386 TGGVIVDSGTSITYLEVQGYRALKKAFAAQM--ALPAADGSGVGLDLCFRAPAKGVDQVE 443
Query: 303 VPEVTIHFRG-ADVKLSRSNFFV-KVSEDIVCSVFKGITNSVPIYGNIMQTNFLVGYDIE 360
VP + HF G AD+ L N+ V +C G + + I GN Q NF YD+
Sbjct: 444 VPRLVFHFDGGADLDLPAENYMVLDGGSGALCLTVMG-SRGLSIIGNFQQQNFQFVYDVG 502
Query: 361 QQTVSFKPTDCTK 373
T+SF P C K
Sbjct: 503 HDTLSFAPVQCNK 515
>gi|224091849|ref|XP_002309371.1| predicted protein [Populus trichocarpa]
gi|222855347|gb|EEE92894.1| predicted protein [Populus trichocarpa]
Length = 438
Score = 131 bits (329), Expect = 6e-28, Method: Compositional matrix adjust.
Identities = 116/410 (28%), Positives = 172/410 (41%), Gaps = 86/410 (20%)
Query: 32 ELIHRDSPKSPFYNSSETPYQRLRDALTRSLNRLNH--------FNQNSSISSSKASQAD 83
+LIHRDS SP+Y S++T R + SL RL++ F+ N + S ++
Sbjct: 40 KLIHRDSIVSPYYRSNDTVADRTERTMKASLARLSYLYAKIERDFDINDLWLNLHPSASE 99
Query: 84 IIPNNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQD-SPLFDPKMSST 142
+ +L+ S+G PP +LA+ DTGS L+W QC PC C Q P+FDP +SST
Sbjct: 100 PL-----FLVNFSMGQPPVPQLAIMDTGSSLLWIQCAPC--KSCSQQIIGPMFDPSISST 152
Query: 143 YKSLPCSSSQCASLNQKSC-SGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPG 201
Y SL C + C C S C Y+ +Y +G S G +ATE + GS+ A+
Sbjct: 153 YDSLSCKNIICRYAPSGECDSSSQCVYNQTYVEGLPSVGVIATEQLIFGSSDEGRNAVNN 212
Query: 202 ITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIA--------------------G 241
+ FGC NG + + TG+ GLG G S+++QM + + G
Sbjct: 213 VLFGCSHRNGNYKDRRFTGVFGLGSGITSVVNQMGSKFSYCIGNIADPDYSYNQLVLSEG 272
Query: 242 NQRLGVSTP------------------------------------DIVIDSGTTLTFLPQ 265
G STP ++IDSGT T+L +
Sbjct: 273 VNMEGYSTPLDVVDGHYQVILEGISVGETRLVIDPSAFKRTEKQRRVIIDSGTAPTWLAE 332
Query: 266 GYNSNLLSVMSSMIEAQPVADPTGSLELCYSFN---SLSQVPEVTIHF-RGADVKLSRSN 321
L + ++++ S LCY L P VT HF GAD+
Sbjct: 333 NEYRALEREVRNLLDRFLTPFMRESF-LCYKGKVGQDLVGFPAVTFHFAEGADL------ 385
Query: 322 FFVKVSEDIVCSVFKGITNSVPIYGNIMQTNFLVGYDIEQQTVSFKPTDC 371
V +E SV+ + G + Q + V YD+ + + F+ DC
Sbjct: 386 --VVDTEMRQASVYGKDFKDFSVIGLMAQQYYNVAYDLNKHKLFFQRIDC 433
>gi|242081123|ref|XP_002445330.1| hypothetical protein SORBIDRAFT_07g009580 [Sorghum bicolor]
gi|241941680|gb|EES14825.1| hypothetical protein SORBIDRAFT_07g009580 [Sorghum bicolor]
Length = 543
Score = 131 bits (329), Expect = 6e-28, Method: Compositional matrix adjust.
Identities = 83/205 (40%), Positives = 115/205 (56%), Gaps = 32/205 (15%)
Query: 54 LRDALTRSLNRLNHF-----NQNSSISSSKASQADIIPNNA------NYLIRISIG---- 98
LR L +R N F N ++ +S+++ A++ + NY+ I++G
Sbjct: 137 LRRLLAADESRANSFQLRIRNDRAAAASTQSGSAEVPLTSGIRFQTLNYVTTIALGGGSS 196
Query: 99 -TPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCSSSQC-ASL 156
+P + DTGSDL W QC+PC S CY Q PLFDP S+TY ++ C++S C ASL
Sbjct: 197 GSPAANLTVIVDTGSDLTWVQCKPC--SACYAQRDPLFDPAGSATYAAVRCNASACAASL 254
Query: 157 NQK-----SCSGVN--CQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITFGCGTN 209
SC G N C Y+++YGDGSFS G LAT+TV LG + L G FGCG +
Sbjct: 255 KAATGTPGSCGGGNERCYYALAYGDGSFSRGVLATDTVALGGAS-----LDGFVFGCGLS 309
Query: 210 NGGLFNSKTTGIVGLGGGDISLISQ 234
N GLF T G++GLG ++SL+SQ
Sbjct: 310 NRGLFGG-TAGLMGLGRTELSLVSQ 333
Score = 43.9 bits (102), Expect = 0.14, Method: Compositional matrix adjust.
Identities = 36/135 (26%), Positives = 56/135 (41%), Gaps = 9/135 (6%)
Query: 246 GVSTPDIVIDSGTTLTFLPQGYNSNLLSVMSSMIEAQ--PVADPTGSLELCYSFNSLSQV 303
G+ +++IDSGT +T L + + + A P A L+ CY +V
Sbjct: 406 GLGASNVLIDSGTVITRLAPSVYRGVRAEFTRQFAAAGYPTAPGFSILDTCYDLTGHDEV 465
Query: 304 --PEVTIHFRG-ADVKLSRSNFFVKVSED--IVCSVFKGIT--NSVPIYGNIMQTNFLVG 356
P +T+ G A+V + + V +D VC ++ + PI GN Q N V
Sbjct: 466 KVPLLTLRLEGGAEVTVDAAGMLFVVRKDGSQVCLAMASLSYEDQTPIIGNYQQKNKRVV 525
Query: 357 YDIEQQTVSFKPTDC 371
YD + F DC
Sbjct: 526 YDTVGSRLGFADEDC 540
>gi|414876414|tpg|DAA53545.1| TPA: hypothetical protein ZEAMMB73_483039 [Zea mays]
Length = 506
Score = 130 bits (328), Expect = 7e-28, Method: Compositional matrix adjust.
Identities = 102/386 (26%), Positives = 159/386 (41%), Gaps = 78/386 (20%)
Query: 56 DALTRSLNRLNHFNQNSSISSSKASQADIIPN----NANYLIRISIGTPPTERLAVADTG 111
D +TR L+ L N ++ ++S A Q ++ + Y R+ IG+P + V DTG
Sbjct: 129 DGVTR-LD-LRPANGSAVFAASAAIQGPVVSGVGQGSGEYFSRVGIGSPARQLYMVLDTG 186
Query: 112 SDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCSSSQCASLNQKSCSGVN--CQYS 169
SD+ W QC+PC + CY Q P+FDP +S++Y ++ C S +C L+ +C C Y
Sbjct: 187 SDVTWVQCQPC--ADCYQQSDPVFDPSLSASYAAVSCDSQRCRDLDTAACRNATGACLYE 244
Query: 170 VSYGDGSFSNGNLATETVTLGSTTGQAVALPGITFGCGTNNGGLF--------------- 214
V+YGDGS++ G+ ATET+TLG +T + + GCG +N GLF
Sbjct: 245 VAYGDGSYTVGDFATETLTLGDST----PVGNVAIGCGHDNEGLFVGAAGLLALGGGPLS 300
Query: 215 -----------------NSKTTGIVGLGGGDIS-------LISQMRTTI----------A 240
+S + G G L+ RT+
Sbjct: 301 FPSQISASTFSYCLVDRDSPAASTLQFGDGAAEAGTVTAPLVRSPRTSTFYYVALSGISV 360
Query: 241 GNQRLGVSTPDIVIDS-----------GTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTG 289
G Q L + +D+ GT +T L + L + P
Sbjct: 361 GGQPLSIPASAFAMDATSGSGGVIVDSGTAVTRLQSAAYAALRDAFVQGAPSLPRTSGVS 420
Query: 290 SLELCYSFNSLS--QVPEVTIHFRGAD-VKLSRSNFFVKVS-EDIVCSVFKGITNSVPIY 345
+ CY + + +VP V++ F G ++L N+ + V C F +V I
Sbjct: 421 LFDTCYDLSDRTSVEVPAVSLRFEGGGALRLPAKNYLIPVDGAGTYCLAFAPTNAAVSII 480
Query: 346 GNIMQTNFLVGYDIEQQTVSFKPTDC 371
GN+ Q V +D + V F P C
Sbjct: 481 GNVQQQGTRVSFDTARGAVGFTPNKC 506
>gi|115479489|ref|NP_001063338.1| Os09g0452800 [Oryza sativa Japonica Group]
gi|51535938|dbj|BAD38020.1| putative aspartic proteinase nepenthesin I [Oryza sativa Japonica
Group]
gi|113631571|dbj|BAF25252.1| Os09g0452800 [Oryza sativa Japonica Group]
gi|125605918|gb|EAZ44954.1| hypothetical protein OsJ_29597 [Oryza sativa Japonica Group]
gi|215740967|dbj|BAG97462.1| unnamed protein product [Oryza sativa Japonica Group]
Length = 453
Score = 130 bits (328), Expect = 9e-28, Method: Compositional matrix adjust.
Identities = 88/242 (36%), Positives = 127/242 (52%), Gaps = 25/242 (10%)
Query: 8 VFILFFLCFYVVSPIEAQTGGFSVELIHRDSPKSPFYNSSETPYQRL-RDALTRSLNRL- 65
V I +LC V+ A G V+L H D+ K E P + L R A+ RS R
Sbjct: 9 VLIACWLCGCPVAGEAAFAGDIRVDLTHVDAGK-------ELPKRELIRRAMQRSKARAA 61
Query: 66 ------NHFNQNSSISSSKASQ-----ADIIPNNANYLIRISIGTPPTERLAVADTGSDL 114
N SI+ ++ + A + Y++ +++GTPP A+ DTGSDL
Sbjct: 62 ALSVVRNGGGFYGSIAQAREREREPGMAVRASGDLEYVLDLAVGTPPQPITALLDTGSDL 121
Query: 115 IWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCSSSQCASLNQKSCSGVN-CQYSVSYG 173
IWTQC+ C + C Q PLF P+MSS+Y+ + C+ C + SC + C Y SYG
Sbjct: 122 IWTQCDTC--TACLRQPDPLFSPRMSSSYEPMRCAGQLCGDILHHSCVRPDTCTYRYSYG 179
Query: 174 DGSFSNGNLATETVTLGSTTGQAVALPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLIS 233
DG+ + G ATE T S++G+ ++P + FGCGT N G N+ +GIVG G +SL+S
Sbjct: 180 DGTTTLGYYATERFTFASSSGETQSVP-LGFGCGTMNVGSLNN-ASGIVGFGRDPLSLVS 237
Query: 234 QM 235
Q+
Sbjct: 238 QL 239
Score = 57.8 bits (138), Expect = 9e-06, Method: Compositional matrix adjust.
Identities = 35/132 (26%), Positives = 66/132 (50%), Gaps = 13/132 (9%)
Query: 252 IVIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLE-LCYSFNSLS--------- 301
++IDSGT LT P + ++ S + P A+ + + +C++ +++
Sbjct: 323 VIIDSGTALTLFPVAVLAEVVRAFRSQLRL-PFANGSSPDDGVCFAAPAVAAGGGRMARQ 381
Query: 302 -QVPEVTIHFRGADVKLSRSNFFVK-VSEDIVCSVFKGITNSVPIYGNIMQTNFLVGYDI 359
VP + HF+GAD+ L R N+ ++ +C + + GN +Q + V YD+
Sbjct: 382 VAVPRMVFHFQGADLDLPRENYVLEDHRRGHLCVLLGDSGDDGATIGNFVQQDMRVVYDL 441
Query: 360 EQQTVSFKPTDC 371
E++T+SF P +C
Sbjct: 442 ERETLSFAPVEC 453
>gi|125563959|gb|EAZ09339.1| hypothetical protein OsI_31611 [Oryza sativa Indica Group]
Length = 453
Score = 130 bits (328), Expect = 9e-28, Method: Compositional matrix adjust.
Identities = 88/242 (36%), Positives = 127/242 (52%), Gaps = 25/242 (10%)
Query: 8 VFILFFLCFYVVSPIEAQTGGFSVELIHRDSPKSPFYNSSETPYQRL-RDALTRSLNRL- 65
V I +LC V+ A G V+L H D+ K E P + L R A+ RS R
Sbjct: 9 VLIACWLCGCPVAGEAAFAGDIRVDLTHVDAGK-------ELPKRELIRRAMQRSKARAA 61
Query: 66 ------NHFNQNSSISSSKASQ-----ADIIPNNANYLIRISIGTPPTERLAVADTGSDL 114
N SI+ ++ + A + Y++ +++GTPP A+ DTGSDL
Sbjct: 62 ALSVVRNGGGFYGSIAQAREREREPGMAVRASGDLEYVLDLAVGTPPQPITALLDTGSDL 121
Query: 115 IWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCSSSQCASLNQKSCSGVN-CQYSVSYG 173
IWTQC+ C + C Q PLF P+MSS+Y+ + C+ C + SC + C Y SYG
Sbjct: 122 IWTQCDTC--TACLRQPDPLFSPRMSSSYEPMRCAGQLCGDILHHSCVRPDTCTYRYSYG 179
Query: 174 DGSFSNGNLATETVTLGSTTGQAVALPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLIS 233
DG+ + G ATE T S++G+ ++P + FGCGT N G N+ +GIVG G +SL+S
Sbjct: 180 DGTTTLGYYATERFTFASSSGETQSVP-LGFGCGTMNVGSLNN-ASGIVGFGRDPLSLVS 237
Query: 234 QM 235
Q+
Sbjct: 238 QL 239
Score = 58.5 bits (140), Expect = 6e-06, Method: Compositional matrix adjust.
Identities = 35/132 (26%), Positives = 66/132 (50%), Gaps = 13/132 (9%)
Query: 252 IVIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLE-LCYSFNSLS--------- 301
++IDSGT LT P + ++ S + P A+ + + +C++ +++
Sbjct: 323 VIIDSGTALTLFPAAVLAEVVRAFRSQLRL-PFANGSSPDDGVCFAAPAVAAGGGRMARQ 381
Query: 302 -QVPEVTIHFRGADVKLSRSNFFVK-VSEDIVCSVFKGITNSVPIYGNIMQTNFLVGYDI 359
VP + HF+GAD+ L R N+ ++ +C + + GN +Q + V YD+
Sbjct: 382 VAVPRMVFHFQGADLDLPRENYVLEDHRRGHLCVLLGDSGDDGATIGNFVQQDMRVVYDL 441
Query: 360 EQQTVSFKPTDC 371
E++T+SF P +C
Sbjct: 442 ERETLSFAPVEC 453
>gi|356498306|ref|XP_003517994.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
Length = 484
Score = 130 bits (327), Expect = 1e-27, Method: Compositional matrix adjust.
Identities = 71/166 (42%), Positives = 92/166 (55%), Gaps = 11/166 (6%)
Query: 53 RLRDALTRSLNRLNHFNQNSSISSSKASQADIIPNNA----NYLIRISIGTPPTERLAVA 108
RL L R N H ++ + S A Q ++ + Y +R+ IG PP++ V
Sbjct: 107 RLDLFLKRVSNSDLHPAESKAEFESNALQGPVVSGTSQGSGEYFLRVGIGKPPSQAYVVL 166
Query: 109 DTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCSSSQCASLNQKSCSGVNCQY 168
DTGSD+ W QC PC S+CY Q P+FDP S++Y + C QC SL+ C C Y
Sbjct: 167 DTGSDVSWIQCAPC--SECYQQSDPIFDPISSNSYSPIRCDEPQCKSLDLSECRNGTCLY 224
Query: 169 SVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITFGCGTNNGGLF 214
VSYGDGS++ G ATETVTLGS + VA+ GCG NN GLF
Sbjct: 225 EVSYGDGSYTVGEFATETVTLGSAAVENVAI-----GCGHNNEGLF 265
Score = 48.1 bits (113), Expect = 0.007, Method: Compositional matrix adjust.
Identities = 36/121 (29%), Positives = 50/121 (41%), Gaps = 4/121 (3%)
Query: 255 DSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSLSQVPEVTIHFR--- 311
DSGT +T L L + P A+ + CY +S V T+ FR
Sbjct: 364 DSGTAVTRLRSEVYDALRDAFVKGAKGIPKANGVSLFDTCYDLSSRESVEIPTVSFRFPE 423
Query: 312 GADVKLSRSNFFVKV-SEDIVCSVFKGITNSVPIYGNIMQTNFLVGYDIEQQTVSFKPTD 370
G ++ L N+ + V S C F T+S+ I GN+ Q VG+DI V F
Sbjct: 424 GRELPLPARNYLIPVDSVGTFCFAFAPTTSSLSIIGNVQQQGTRVGFDIANSLVGFSVDS 483
Query: 371 C 371
C
Sbjct: 484 C 484
>gi|24430421|dbj|BAC22609.1| 41 kD chloroplast nucleoid DNA binding protein (CND41) [Nicotiana
sylvestris]
Length = 502
Score = 130 bits (327), Expect = 1e-27, Method: Compositional matrix adjust.
Identities = 68/160 (42%), Positives = 93/160 (58%), Gaps = 11/160 (6%)
Query: 80 SQADIIPNNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKM 139
+Q+ + NY++ + +GTP + + DTGSDL WTQC+PC S CY Q P+FDP
Sbjct: 143 AQSGLPLGTGNYIVNVGLGTPKKDLSLIFDTGSDLTWTQCQPCVKS-CYAQQQPIFDPSA 201
Query: 140 SSTYKSLPCSSSQCASL-----NQKSCSGVNCQYSVSYGDGSFSNGNLATETVTLGSTTG 194
S TY ++ C+S+ C+ L N CS NC Y + YGD SF+ G A +T+TL
Sbjct: 202 SKTYSNISCTSTACSGLKSATGNSPGCSSSNCVYGIQYGDSSFTVGFFAKDTLTL----T 257
Query: 195 QAVALPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQ 234
Q G FGCG NN GLF KT G++GLG +S++ Q
Sbjct: 258 QNDVFDGFMFGCGQNNRGLFG-KTAGLIGLGRDPLSIVQQ 296
Score = 53.9 bits (128), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 35/125 (28%), Positives = 58/125 (46%), Gaps = 5/125 (4%)
Query: 253 VIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSLS--QVPEVTIHF 310
+IDSGT +T LP +L S + P A L+ CY ++ + +P+++ +F
Sbjct: 378 IIDSGTVITRLPSTVYGSLKSTFKQFMSKYPTAPALSLLDTCYDLSNYTSISIPKISFNF 437
Query: 311 RG-ADVKLSRSNFFVKVSEDIVCSVF--KGITNSVPIYGNIMQTNFLVGYDIEQQTVSFK 367
G A+V L + + VC F G +++ I+GNI Q V YD+ + F
Sbjct: 438 NGNANVDLEPNGILITNGASQVCLAFAGNGDDDTIGIFGNIQQQTLEVVYDVAGGQLGFG 497
Query: 368 PTDCT 372
C+
Sbjct: 498 YKGCS 502
>gi|86438622|emb|CAJ26370.1| chloroplast nucleoid binding protein [Brachypodium sylvaticum]
Length = 443
Score = 130 bits (327), Expect = 1e-27, Method: Compositional matrix adjust.
Identities = 117/425 (27%), Positives = 174/425 (40%), Gaps = 88/425 (20%)
Query: 33 LIHRDSPKSPFYNSSETPYQRLRDALTRSLNRLNHFNQNSSISSSKASQADIIP------ 86
++HR P SP + P D L R++ ++ + ++ Q +P
Sbjct: 22 VMHRHGPCSPLQTPDDAPSDA--DLLEHDQARVDSIHRMIANETAVVGQDVSLPAERGIS 79
Query: 87 -NNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKS 145
NY++ + +GTP + V DTGSDL W QC PC CY Q PLF P SST+ +
Sbjct: 80 VGTGNYVVSVGLGTPARDLTVVFDTGSDLSWVQCGPCSSGGCYHQQDPLFAPSSSSTFSA 139
Query: 146 LPCSSSQCASLNQKSCSGV----NCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVA--- 198
+ C +C Q SCS C Y V YGD S + G+L +T+TLG+T +
Sbjct: 140 VRCGEPECPRARQ-SCSSSPGDDRCPYEVVYGDKSRTVGHLGNDTLTLGTTPSTNASENN 198
Query: 199 ---LPGITFGCGTNN-------GGLF---------NSKTTGIVGLG------------GG 227
LPG FGCG NN GLF +S+ G G G G
Sbjct: 199 SNKLPGFVFGCGENNTGLFGKADGLFGLGRGKVSLSSQAAGKYGEGFSYCLPSSSSNAHG 258
Query: 228 DISL--------------------------ISQMRTTIAG-----NQRLGVSTPDIVIDS 256
+SL + + +AG + R + +++DS
Sbjct: 259 YLSLGTPAPAPAHARFTPMLNRSNTPSFYYVKLVGIRVAGRAIKVSSRPALWPAGLIVDS 318
Query: 257 GTTLTFL-PQGYNSNLLSVMSSMIEAQPVADPTGS-LELCYSF----NSLSQVPEVTIHF 310
GT +T L P+ Y++ + +S+M + P S L+ CY F N+ +P V + F
Sbjct: 319 GTVITRLAPRAYSALRTAFLSAMGKYGYKRAPRLSILDTCYDFTAHANATVSIPAVALVF 378
Query: 311 RGA---DVKLSRSNFFVKVSEDIVCSVFKGITNSVPIYGNIMQTNFLVGYDIEQQTVSFK 367
G V S + KV++ + G S I GN Q V YD+ +Q + F
Sbjct: 379 AGGATISVDFSGVLYVAKVAQACLAFAPNGNGRSAGILGNTQQRTVAVVYDVGRQKIGFA 438
Query: 368 PTDCT 372
C+
Sbjct: 439 AKGCS 443
>gi|242093566|ref|XP_002437273.1| hypothetical protein SORBIDRAFT_10g023970 [Sorghum bicolor]
gi|241915496|gb|EER88640.1| hypothetical protein SORBIDRAFT_10g023970 [Sorghum bicolor]
Length = 503
Score = 130 bits (327), Expect = 1e-27, Method: Compositional matrix adjust.
Identities = 68/155 (43%), Positives = 91/155 (58%), Gaps = 7/155 (4%)
Query: 87 NNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSL 146
N NY++ I +GTP V DTGSD W QC+PC + CY Q PLF P S+TY ++
Sbjct: 161 NTGNYVVPIRLGTPAARFTVVFDTGSDTTWVQCQPCV-AYCYQQKEPLFTPTKSATYANI 219
Query: 147 PCSSSQCASLNQKSCSGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITFGC 206
C+SS C+ L+ + CSG +C Y+V YGDGS++ G A +T+TLG T + FGC
Sbjct: 220 SCTSSYCSDLDTRGCSGGHCLYAVQYGDGSYTVGFYAQDTLTLGYDT-----VKDFRFGC 274
Query: 207 GTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAG 241
G N GLF K G++GLG G S+ Q +G
Sbjct: 275 GEKNRGLFG-KAAGLMGLGRGKTSVPVQAYDKYSG 308
Score = 42.4 bits (98), Expect = 0.34, Method: Compositional matrix adjust.
Identities = 34/127 (26%), Positives = 52/127 (40%), Gaps = 8/127 (6%)
Query: 253 VIDSGTTLTFLPQGYNSNLLSVMSSMIEA--QPVADPTGSLELCYSFNSLS---QVPEVT 307
++DSGT +T LP L S + +E A L+ CY +P V+
Sbjct: 377 LVDSGTVITRLPPSAYEPLRSAFAKGMEGLGYKTAPAFSILDTCYDLTGYQGSIALPAVS 436
Query: 308 IHFRGA---DVKLSRSNFFVKVSEDIVCSVFKGITNSVPIYGNIMQTNFLVGYDIEQQTV 364
+ F+G DV S + VS+ + + I GN Q + V YD+ ++ V
Sbjct: 437 LVFQGGACLDVDASGILYVADVSQACLAFAANDDDTDMTIVGNTQQKTYSVLYDLGKKVV 496
Query: 365 SFKPTDC 371
F P C
Sbjct: 497 GFAPGAC 503
>gi|356502456|ref|XP_003520035.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
Length = 484
Score = 130 bits (326), Expect = 1e-27, Method: Compositional matrix adjust.
Identities = 69/166 (41%), Positives = 94/166 (56%), Gaps = 11/166 (6%)
Query: 53 RLRDALTRSLNRLNHFNQNSSISSSKASQADIIPNNA----NYLIRISIGTPPTERLAVA 108
RL L R N H ++++ + A Q ++ + Y +R+ IG PP++ V
Sbjct: 107 RLDLVLKRVSNSDLHPAESNAEFEANALQGPVVSGTSQGSGEYFLRVGIGKPPSQAYVVL 166
Query: 109 DTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCSSSQCASLNQKSCSGVNCQY 168
DTGSD+ W QC PC S+CY Q P+FDP S++Y + C + QC SL+ C C Y
Sbjct: 167 DTGSDVSWIQCAPC--SECYQQSDPIFDPVSSNSYSPIRCDAPQCKSLDLSECRNGTCLY 224
Query: 169 SVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITFGCGTNNGGLF 214
VSYGDGS++ G ATETVTLG+ + VA+ GCG NN GLF
Sbjct: 225 EVSYGDGSYTVGEFATETVTLGTAAVENVAI-----GCGHNNEGLF 265
Score = 52.8 bits (125), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 38/121 (31%), Positives = 52/121 (42%), Gaps = 4/121 (3%)
Query: 255 DSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSLS--QVPEVTIHF-R 311
DSGT +T L L + P A+ + CY +S QVP V+ HF
Sbjct: 364 DSGTAVTRLRSEVYDALRDAFVKGAKGIPKANGVSLFDTCYDLSSRESVQVPTVSFHFPE 423
Query: 312 GADVKLSRSNFFVKV-SEDIVCSVFKGITNSVPIYGNIMQTNFLVGYDIEQQTVSFKPTD 370
G ++ L N+ + V S C F T+S+ I GN+ Q VG+DI V F
Sbjct: 424 GRELPLPARNYLIPVDSVGTFCFAFAPTTSSLSIMGNVQQQGTRVGFDIANSLVGFSADS 483
Query: 371 C 371
C
Sbjct: 484 C 484
>gi|116786826|gb|ABK24255.1| unknown [Picea sitchensis]
gi|148906052|gb|ABR16185.1| unknown [Picea sitchensis]
Length = 485
Score = 130 bits (326), Expect = 1e-27, Method: Compositional matrix adjust.
Identities = 79/225 (35%), Positives = 117/225 (52%), Gaps = 35/225 (15%)
Query: 18 VVSPIEAQT---GGFSVELIHRDSPKSPFYNSSETPY-QRLRDALTRSLNRLNHFNQNSS 73
VV P + +T +S+ L+HRD+ K ++E Y +R++ L R R+ N
Sbjct: 45 VVQPAKEETLEIKPWSIPLVHRDAMKGNSNKNNELSYAERMQQRLKRDAARVAAINSRLE 104
Query: 74 IS-------------------SSKASQADIIPN----NANYLIRISIGTPPTERLAVADT 110
++ + Q+ ++ + Y RI +G P ++L V DT
Sbjct: 105 LAVNGIKRSSLKPDSSSSFTMAESDFQSPVVSGMDQGSGEYFSRIGVGAPRRDQLMVLDT 164
Query: 111 GSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCSSSQCASLNQKSCS-GVNCQYS 169
GSD+ W QCEPC S CY Q P+++P +SS+YK + C ++ C L+ CS +C Y
Sbjct: 165 GSDVTWIQCEPC--SDCYQQSDPIYNPALSSSYKLVGCQANLCQQLDVSGCSRNGSCLYQ 222
Query: 170 VSYGDGSFSNGNLATETVTLGSTTGQAVALPGITFGCGTNNGGLF 214
VSYGDGS++ GN ATET+TLG Q VA+ GCG +N GLF
Sbjct: 223 VSYGDGSYTQGNFATETLTLGGAPLQNVAI-----GCGHDNEGLF 262
Score = 50.4 bits (119), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 36/124 (29%), Positives = 51/124 (41%), Gaps = 4/124 (3%)
Query: 252 IVIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSLS--QVPEVTIH 309
+++DSGT +T L +L + + P D + CY +S VP V H
Sbjct: 362 VIVDSGTAVTRLQTAAYDSLRDAFRAGTKNLPSTDGVSLFDTCYDLSSKESVDVPTVVFH 421
Query: 310 FR-GADVKLSRSNFFVKV-SEDIVCSVFKGITNSVPIYGNIMQTNFLVGYDIEQQTVSFK 367
F G + L N+ V V S C F ++S+ I GNI Q V +D V F
Sbjct: 422 FSGGGSMSLPAKNYLVPVDSMGTFCFAFAPTSSSLSIVGNIQQQGIRVSFDRANNQVGFA 481
Query: 368 PTDC 371
C
Sbjct: 482 VNKC 485
>gi|2541876|dbj|BAA22813.1| CND41, chloroplast nucleoid DNA binding protein [Nicotiana tabacum]
Length = 502
Score = 129 bits (325), Expect = 2e-27, Method: Compositional matrix adjust.
Identities = 68/160 (42%), Positives = 93/160 (58%), Gaps = 11/160 (6%)
Query: 80 SQADIIPNNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKM 139
+Q+ + NY++ + +GTP + + DTGSDL WTQC+PC S CY Q P+FDP
Sbjct: 143 AQSGLPLGTGNYIVNVGLGTPKKDLSLIFDTGSDLTWTQCQPCVKS-CYAQQQPIFDPST 201
Query: 140 SSTYKSLPCSSSQCASL-----NQKSCSGVNCQYSVSYGDGSFSNGNLATETVTLGSTTG 194
S TY ++ C+S+ C+SL N CS NC Y + YGD SF+ G A + +TL
Sbjct: 202 SKTYSNISCTSAACSSLKSATGNSPGCSSSNCVYGIQYGDSSFTIGFFAKDKLTL----T 257
Query: 195 QAVALPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQ 234
Q G FGCG NN GLF KT G++GLG +S++ Q
Sbjct: 258 QNDVFDGFMFGCGQNNKGLFG-KTAGLIGLGRDPLSIVQQ 296
Score = 55.8 bits (133), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 36/125 (28%), Positives = 59/125 (47%), Gaps = 5/125 (4%)
Query: 253 VIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSLS--QVPEVTIHF 310
+IDSGT +T LP +L S + P A L+ CY ++ + +P+++ +F
Sbjct: 378 IIDSGTVITRLPSTAYGSLKSAFKQFMSKYPTAPALSLLDTCYDLSNYTSISIPKISFNF 437
Query: 311 RG-ADVKLSRSNFFVKVSEDIVCSVF--KGITNSVPIYGNIMQTNFLVGYDIEQQTVSFK 367
G A+V+L + + VC F G +S+ I+GNI Q V YD+ + F
Sbjct: 438 NGNANVELDPNGILITNGASQVCLAFAGNGDDDSIGIFGNIQQQTLEVVYDVAGGQLGFG 497
Query: 368 PTDCT 372
C+
Sbjct: 498 YKGCS 502
>gi|302774174|ref|XP_002970504.1| hypothetical protein SELMODRAFT_93504 [Selaginella moellendorffii]
gi|300162020|gb|EFJ28634.1| hypothetical protein SELMODRAFT_93504 [Selaginella moellendorffii]
Length = 483
Score = 129 bits (324), Expect = 2e-27, Method: Compositional matrix adjust.
Identities = 72/200 (36%), Positives = 104/200 (52%), Gaps = 22/200 (11%)
Query: 27 GGFSVELIHRDSPKSPFYNSSETPYQRLRDALTRSLNRLNHFNQNSSISSSKASQAD--- 83
G S+ELIHR+S T Q L + L R R+ + ++ K +A
Sbjct: 54 GTLSLELIHRNSLLREAKEKLHTHEQLLLETLQRDEQRVRWIESKAQLAGKKKDEASSTD 113
Query: 84 --------IIPNNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLF 135
++ + Y +R+ +GTP V DTGSDL W QC+PC CY Q P+F
Sbjct: 114 LNGPVTSGLLYGSGEYFVRLGVGTPARSLFMVVDTGSDLPWLQCQPC--KSCYKQADPIF 171
Query: 136 DPKMSSTYKSLPCSSSQCASLNQKSCSG-----VNCQYSVSYGDGSFSNGNLATETVTLG 190
DP+ SS+++ +PC S C +L SCSG C Y V+YGDGSFS G+ +++ TLG
Sbjct: 172 DPRNSSSFQRIPCLSPLCKALEIHSCSGSRGATSRCSYQVAYGDGSFSVGDFSSDLFTLG 231
Query: 191 STTGQAVALPGITFGCGTNN 210
T +A++ + FGCG +N
Sbjct: 232 -TGSKAMS---VAFGCGFDN 247
Score = 63.9 bits (154), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 34/124 (27%), Positives = 64/124 (51%), Gaps = 4/124 (3%)
Query: 252 IVIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSLS--QVPEVTIH 309
++IDSGT++T P + + + P A + CY+F+ + VP + +H
Sbjct: 359 VIIDSGTSVTRFPTSVYATIRDAFRNATTNLPSAPRYSLFDTCYNFSGKASVDVPALVLH 418
Query: 310 FR-GADVKLSRSNFFVKV-SEDIVCSVFKGITNSVPIYGNIMQTNFLVGYDIEQQTVSFK 367
F GAD++L +N+ + + + C F + + I GNI Q +F +G+D+++ ++F
Sbjct: 419 FENGADLQLPPTNYLIPINTAGSFCLAFAPTSMELGIIGNIQQQSFRIGFDLQKSHLAFA 478
Query: 368 PTDC 371
P C
Sbjct: 479 PQQC 482
>gi|297834938|ref|XP_002885351.1| pepsin A [Arabidopsis lyrata subsp. lyrata]
gi|297331191|gb|EFH61610.1| pepsin A [Arabidopsis lyrata subsp. lyrata]
Length = 471
Score = 129 bits (324), Expect = 2e-27, Method: Compositional matrix adjust.
Identities = 66/197 (33%), Positives = 101/197 (51%), Gaps = 18/197 (9%)
Query: 29 FSVELIHRDS-PKSPFYNSSETPYQRLRDALTRSLNRLNHFNQNSSISSSKASQ------ 81
+++ L+HRD P + N + R+R R L + ++SS +
Sbjct: 59 YTLRLLHRDRFPSVTYRNHHHRLHARMRRDTDRVSAILRRISGKVVVASSDSRYEVNDFG 118
Query: 82 ADIIPN----NANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDP 137
+D++ + Y +RI +G+PP ++ V D+GSD++W QC+PC CY Q P+FDP
Sbjct: 119 SDVVSGMDQGSGEYFVRIGVGSPPRDQYMVIDSGSDMVWVQCQPC--KLCYKQSDPVFDP 176
Query: 138 KMSSTYKSLPCSSSQCASLNQKSCSGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAV 197
S +Y + C SS C + C C+Y V YGDGS++ G LA ET+T T + V
Sbjct: 177 AKSGSYTGVSCGSSVCDRIENSGCHSGGCRYEVMYGDGSYTKGTLALETLTFAKTVVRNV 236
Query: 198 ALPGITFGCGTNNGGLF 214
A+ GCG N G+F
Sbjct: 237 AM-----GCGHRNRGMF 248
Score = 47.8 bits (112), Expect = 0.010, Method: Compositional matrix adjust.
Identities = 35/124 (28%), Positives = 50/124 (40%), Gaps = 4/124 (3%)
Query: 252 IVIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSL--SQVPEVTIH 309
+V+D+GT +T LP G + S P A + CY + +VP V+ +
Sbjct: 348 VVMDTGTAVTRLPTGAYAAFRDGFKSQTANLPRASGVSIFDTCYDLSGFVSVRVPTVSFY 407
Query: 310 F-RGADVKLSRSNFFVKVSED-IVCSVFKGITNSVPIYGNIMQTNFLVGYDIEQQTVSFK 367
F G + L NF + V + C F + I GNI Q V +D V F
Sbjct: 408 FTEGPVLTLPARNFLMPVDDSGTYCFAFAASPTGLSIIGNIQQEGIQVSFDGANGFVGFG 467
Query: 368 PTDC 371
P C
Sbjct: 468 PNVC 471
>gi|125527523|gb|EAY75637.1| hypothetical protein OsI_03542 [Oryza sativa Indica Group]
Length = 446
Score = 129 bits (324), Expect = 2e-27, Method: Compositional matrix adjust.
Identities = 79/213 (37%), Positives = 113/213 (53%), Gaps = 21/213 (9%)
Query: 31 VELIHRDS--PKSPFYNSSETPYQRLRDALTRSLNRLNHFNQNSSISSSKASQADIIP-N 87
V + HRD+ P P QRL R + ++ + S S IP
Sbjct: 29 VPVFHRDALFPPPPGAKRGSLLRQRLAADAARYASLVDATGRLHSPVFSG------IPFE 82
Query: 88 NANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLP 147
+ Y + +GTP T+ + V DTGSDL+W QC PC +CY Q +FDP+ SSTY+ +P
Sbjct: 83 SGEYFALVGVGTPSTKAMLVIDTGSDLVWLQCSPC--RRCYAQRGQVFDPRRSSTYRRVP 140
Query: 148 CSSSQCASLNQKSC-----SGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGI 202
CSS QC +L C +G C+Y V+YGDGS S G LAT+ + + T + +
Sbjct: 141 CSSPQCRALRFPGCDSGGAAGGGCRYMVAYGDGSSSTGELATDKLAFANDT----YVNNV 196
Query: 203 TFGCGTNNGGLFNSKTTGIVGLGGGDISLISQM 235
T GCG +N GLF+S G++G+ G IS+ +Q+
Sbjct: 197 TLGCGRDNEGLFDS-AAGLLGVARGKISISTQV 228
Score = 51.6 bits (122), Expect = 5e-04, Method: Compositional matrix adjust.
Identities = 32/134 (23%), Positives = 59/134 (44%), Gaps = 13/134 (9%)
Query: 252 IVIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGS---LELCYSFNS--LSQVPEV 306
+V+DSGT ++ + + L + A + G + CY + P +
Sbjct: 312 VVVDSGTAISRFARDAYAALRDAFDARARAAGMRRLAGEHSVFDACYDLRGRPAASAPLI 371
Query: 307 TIHFRG-ADVKLSRSNFFV-------KVSEDIVCSVFKGITNSVPIYGNIMQTNFLVGYD 358
+HF G AD+ L N+F+ + + C F+ + + + GN+ Q F V +D
Sbjct: 372 VLHFAGGADMALPPENYFLPVDGGRRRAASYRRCLGFEAADDGLSVIGNVQQQGFRVVFD 431
Query: 359 IEQQTVSFKPTDCT 372
+E++ + F P CT
Sbjct: 432 VEKERIGFAPKGCT 445
>gi|449466304|ref|XP_004150866.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 1-like [Cucumis
sativus]
gi|449485213|ref|XP_004157102.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 1-like [Cucumis
sativus]
Length = 473
Score = 129 bits (324), Expect = 2e-27, Method: Compositional matrix adjust.
Identities = 82/223 (36%), Positives = 115/223 (51%), Gaps = 28/223 (12%)
Query: 30 SVELIHRDSPKSPFYNSSETPYQRLRDALTRSLNRLNHFNQN-----SSISSSKASQADI 84
S+E+IHR P +++ T + L + +R++ + S+ + S+A
Sbjct: 62 SLEVIHRHGPCGDEVSNAPTA----AEMLVKDQSRVDFIHSKIAGELESVDRLRGSKATK 117
Query: 85 IP-------NNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDP 137
IP + NY++ + +GTP + DTGSDL WTQC+PC CY Q P+F P
Sbjct: 118 IPAKSGATIGSGNYIVSVGLGTPKKYLSLIFDTGSDLTWTQCQPC-ARYCYNQKDPVFVP 176
Query: 138 KMSSTYKSLPCSSSQCASL-----NQKSCSGVN-CQYSVSYGDGSFSNGNLATETVTLGS 191
S+TY ++ CSS C+ L NQ CS C Y + YGD SFS G A ET+TL S
Sbjct: 177 SQSTTYSNISCSSPDCSQLESGTGNQPGCSAARACIYGIQYGDQSFSVGYFAKETLTLTS 236
Query: 192 TTGQAVALPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQ 234
T + FGCG NN GLF S G++GLG IS++ Q
Sbjct: 237 TD----VIENFLFGCGQNNRGLFGS-AAGLIGLGQDKISIVKQ 274
Score = 53.9 bits (128), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 40/129 (31%), Positives = 55/129 (42%), Gaps = 5/129 (3%)
Query: 248 STPDIVIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSLS--QVPE 305
ST +IDSGT +T LP S L S + P A L+ CY + S Q+P+
Sbjct: 345 STSGAIIDSGTVITRLPPDAYSALKSAFEKGMAKYPKAPELSILDTCYDLSKYSTIQIPK 404
Query: 306 VTIHFRGA-DVKLSRSNFFVKVSEDIVCSVFKGITN--SVPIYGNIMQTNFLVGYDIEQQ 362
V F+G ++ L S VC F G + +V I GN+ Q V YD+
Sbjct: 405 VGFVFKGGEELDLDGIGIMYGASTSQVCLAFAGNQDPSTVAIIGNVQQKTLQVVYDVGGG 464
Query: 363 TVSFKPTDC 371
+ F C
Sbjct: 465 KIGFGYNGC 473
>gi|388498308|gb|AFK37220.1| unknown [Lotus japonicus]
Length = 363
Score = 129 bits (324), Expect = 2e-27, Method: Compositional matrix adjust.
Identities = 82/230 (35%), Positives = 120/230 (52%), Gaps = 23/230 (10%)
Query: 25 QTGGFSVELIHRD--SPKSPFYNSSETPYQRLRDALTRSL-NRLNHFNQNSSISSSKAS- 80
+ G +E+ R S K ++ L D RS+ NRL + S+ S+
Sbjct: 73 EKGAIMLEMKDRSYCSKKKVNWHRKLHNQLTLDDLHVRSMQNRLRKMVSSHSVEVSQIQI 132
Query: 81 --QADIIPNNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPK 138
+ + NY++ + +G + + DTGSDL W QCEPC CY Q P+F P
Sbjct: 133 PLASGVNFQTLNYIVTMELGG--QDMTVIIDTGSDLTWVQCEPC--MSCYNQQGPVFKPS 188
Query: 139 MSSTYKSLPCSSSQCASL-----NQKSCSG--VNCQYSVSYGDGSFSNGNLATETVTLGS 191
SS+Y+S+PC+SS C SL N +C NC Y+V+YGDGS++NG L E ++ G
Sbjct: 189 TSSSYQSIPCNSSTCQSLQLTTGNAGACESNPSNCSYAVNYGDGSYTNGELGAEHLSFG- 247
Query: 192 TTGQAVALPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAG 241
+++ FGCG NN GLF +G++GLG ++SLISQ +T G
Sbjct: 248 ----GISVSNFVFGCGKNNKGLFGG-VSGLMGLGRSNLSLISQTNSTFGG 292
>gi|79407941|ref|NP_188636.2| aspartyl protease family protein [Arabidopsis thaliana]
gi|75273243|sp|Q9LHE3.1|ASPG2_ARATH RecName: Full=Protein ASPARTIC PROTEASE IN GUARD CELL 2;
Short=AtASPG2; Flags: Precursor
gi|11994777|dbj|BAB03167.1| nucleoid chloroplast DNA-binding protein-like [Arabidopsis
thaliana]
gi|28392860|gb|AAO41867.1| unknown protein [Arabidopsis thaliana]
gi|332642798|gb|AEE76319.1| aspartyl protease family protein [Arabidopsis thaliana]
Length = 470
Score = 129 bits (324), Expect = 3e-27, Method: Compositional matrix adjust.
Identities = 69/196 (35%), Positives = 105/196 (53%), Gaps = 17/196 (8%)
Query: 29 FSVELIHRDS-PKSPFYNSSETPYQRLR---DALTRSLNRLNHFNQNSSISSSKASQ--A 82
+++ L+HRD P + N + R+R D ++ L R++ SS S + + +
Sbjct: 59 YTLRLLHRDRFPSVTYRNHHHRLHARMRRDTDRVSAILRRISGKVIPSSDSRYEVNDFGS 118
Query: 83 DIIPN----NANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPK 138
DI+ + Y +RI +G+PP ++ V D+GSD++W QC+PC CY Q P+FDP
Sbjct: 119 DIVSGMDQGSGEYFVRIGVGSPPRDQYMVIDSGSDMVWVQCQPC--KLCYKQSDPVFDPA 176
Query: 139 MSSTYKSLPCSSSQCASLNQKSCSGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVA 198
S +Y + C SS C + C C+Y V YGDGS++ G LA ET+T T + VA
Sbjct: 177 KSGSYTGVSCGSSVCDRIENSGCHSGGCRYEVMYGDGSYTKGTLALETLTFAKTVVRNVA 236
Query: 199 LPGITFGCGTNNGGLF 214
+ GCG N G+F
Sbjct: 237 M-----GCGHRNRGMF 247
Score = 43.9 bits (102), Expect = 0.11, Method: Compositional matrix adjust.
Identities = 34/124 (27%), Positives = 48/124 (38%), Gaps = 4/124 (3%)
Query: 252 IVIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSLS--QVPEVTIH 309
+V+D+GT +T LP S P A + CY + +VP V+ +
Sbjct: 347 VVMDTGTAVTRLPTAAYVAFRDGFKSQTANLPRASGVSIFDTCYDLSGFVSVRVPTVSFY 406
Query: 310 F-RGADVKLSRSNFFVKVSED-IVCSVFKGITNSVPIYGNIMQTNFLVGYDIEQQTVSFK 367
F G + L NF + V + C F + I GNI Q V +D V F
Sbjct: 407 FTEGPVLTLPARNFLMPVDDSGTYCFAFAASPTGLSIIGNIQQEGIQVSFDGANGFVGFG 466
Query: 368 PTDC 371
P C
Sbjct: 467 PNVC 470
>gi|357158688|ref|XP_003578209.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Brachypodium
distachyon]
Length = 443
Score = 129 bits (323), Expect = 3e-27, Method: Compositional matrix adjust.
Identities = 82/212 (38%), Positives = 109/212 (51%), Gaps = 15/212 (7%)
Query: 28 GFSVELIHRDSPKSPFYNSSETPYQRLRDALTRSLNR---LNHFNQNSSISSSKASQADI 84
GF L H D+ + T Q L A+ RS R L ++ + ++ +
Sbjct: 29 GFQATLTHIDA------GAGYTEAQLLSRAVRRSKARVAALQSLATTTAADAITVARILV 82
Query: 85 IPNNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYK 144
+ + YL+ + IGTPP A+ DTGSDLIWTQC PC C Q +P FDP S +Y
Sbjct: 83 LASEGEYLMSMGIGTPPRYYSAILDTGSDLIWTQCAPC--MLCVDQPTPFFDPAQSPSYA 140
Query: 145 SLPCSSSQCASLNQKSCSGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITF 204
LPC+S C +L C C Y YGD + + G L+ ET T G T V +P I F
Sbjct: 141 KLPCNSPMCNALYYPLCYRNVCVYQYFYGDSANTAGVLSNETFTFG-TNDTRVTVPRIAF 199
Query: 205 GCGT-NNGGLFNSKTTGIVGLGGGDISLISQM 235
GCG N G LFN +G+VG G G +SL+SQ+
Sbjct: 200 GCGNLNAGSLFNG--SGMVGFGRGPLSLVSQL 229
Score = 47.8 bits (112), Expect = 0.008, Method: Compositional matrix adjust.
Identities = 32/129 (24%), Positives = 59/129 (45%), Gaps = 12/129 (9%)
Query: 252 IVIDSGTTLTFLPQGYNSNLLSVMSSMI-----EAQPVADPTGSLELCYSF----NSLSQ 302
++IDSG+T+T+L + + + + A +AD L+ C+ + +
Sbjct: 315 VIIDSGSTITYLARAAYDMVHQAFADQVGLPLTNATSLAD---VLDTCFVWPPPPRKIVT 371
Query: 303 VPEVTIHFRGADVKLSRSNFFVKVSEDIVCSVFKGITNSVPIYGNIMQTNFLVGYDIEQQ 362
+PE+ HF GA+++L N+ + + + ++ I G+ NF V YD E
Sbjct: 372 MPELAFHFEGANMELPLENYMLIDGDTGNLCLAIAASDDGSIIGSFQHQNFHVLYDNENS 431
Query: 363 TVSFKPTDC 371
+SF P C
Sbjct: 432 LLSFTPATC 440
>gi|225216960|gb|ACN85252.1| aspartic proteinase nepenthesin-1 precursor [Oryza officinalis]
Length = 519
Score = 129 bits (323), Expect = 3e-27, Method: Compositional matrix adjust.
Identities = 106/345 (30%), Positives = 145/345 (42%), Gaps = 61/345 (17%)
Query: 87 NNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSL 146
NY++ + +GTP + V DTGSD W QC+PC CY Q LFDP SSTY ++
Sbjct: 176 GTGNYVVTVGLGTPVSRYTVVFDTGSDTTWVQCQPCV-VVCYEQREKLFDPARSSTYANV 234
Query: 147 PCSSSQCASLNQKSCSGVNCQYSVSYGDGSFSNGNLATETVTLGSTT------------- 193
C++ C+ LN CSG +C Y V YGDGS+S G A +T+TL S
Sbjct: 235 SCAAPACSDLNIHGCSGGHCLYGVQYGDGSYSIGFFAMDTLTLSSYDAVKGFRFGCGERN 294
Query: 194 ----GQAVALPGITFGCGT-------NNGGLFN----SKTTGIVGLG------------- 225
G+A L G+ G + GG+F +++TG L
Sbjct: 295 EGLFGEAAGLLGLGRGKTSLPVQTYDKYGGVFAHCLPARSTGTGYLDFGAGSLAAASARL 354
Query: 226 -------GGDISLISQMRTTIAGNQRLGV-----STPDIVIDSGTTLTFLPQGYNSNL-- 271
G M G Q L + +T ++DSGT +T LP S+L
Sbjct: 355 TTPMLTDNGPTFYYVGMTGIRVGGQLLSIPQSVFATAGTIVDSGTVITRLPPAAYSSLRY 414
Query: 272 LSVMSSMIEAQPVADPTGSLELCYSFNSLSQV--PEVTIHFRG-ADVKLSRSNFFVKVSE 328
+ A L+ CY F +SQV P V++ F+G A + + S S
Sbjct: 415 AFAAAMAARGYKKAPAVSLLDTCYDFTGMSQVAIPTVSLLFQGGARLDVDASGIMYAASA 474
Query: 329 DIVCSVFKGITNS--VPIYGNIMQTNFLVGYDIEQQTVSFKPTDC 371
VC F + V I GN F V YDI ++ V F P C
Sbjct: 475 SQVCLAFAANEDGGDVGIVGNTQLKTFGVAYDIGKKVVGFYPGAC 519
>gi|297803034|ref|XP_002869401.1| aspartyl protease family [Arabidopsis lyrata subsp. lyrata]
gi|297315237|gb|EFH45660.1| aspartyl protease family [Arabidopsis lyrata subsp. lyrata]
Length = 438
Score = 129 bits (323), Expect = 3e-27, Method: Compositional matrix adjust.
Identities = 113/389 (29%), Positives = 172/389 (44%), Gaps = 89/389 (22%)
Query: 61 SLNRLNHFNQNSS--ISSSKASQADIIPNNANYLIRISIGTPPTERLAVADTGSDLIWTQ 118
S+ RL + ++ I + + IIP +L+ ISIG+PP +L DT SDL+W Q
Sbjct: 55 SVERLEYLKAKATGDIIAHLSPNVPIIPQA--FLVNISIGSPPVTQLLHMDTASDLLWLQ 112
Query: 119 CEPCPPSQCYMQDSPLFDPKMSSTYKSLPCSSSQCA----SLNQKSCSGVNCQYSVSYGD 174
C PC CY Q P+FDP S T+++ C +SQ + N K+ S C+YS+ Y D
Sbjct: 113 CRPC--INCYAQSLPIFDPSRSYTHRNESCRTSQYSMPSLRFNAKTRS---CEYSMRYMD 167
Query: 175 GSFSNGNLATETVTLGSTTGQ--AVALPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLI 232
G+ S G LA E + + + + AL + FGCG +N G TGI+GLG G+ SL+
Sbjct: 168 GTGSKGILAKEMLMFNTIYDESSSAALHDVVFGCGHDNYGE-PLVGTGILGLGYGEFSLV 226
Query: 233 SQMRTTIA---------------------GNQRLGVSTP--------------------- 250
+ T + G LG +TP
Sbjct: 227 HRFGTKFSYCFGSLDDPSYPHNVLVLGDDGANILGDTTPLEIYNGFYYVTIEAISVDGII 286
Query: 251 -----------------DIVIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLEL 293
+ID+G +LT L + L + + E + A ++
Sbjct: 287 LPIDPWVFNRNHQTGLGGTIIDTGNSLTSLVEEAYKPLKNKIEDYFEGRFTAADVNQDDM 346
Query: 294 ----CYSFN-----SLSQVPEVTIHFR-GADVKLSRSNFFVKVSEDIVC-SVFKGITNSV 342
CY+ N S P VT HF GA++ L + F+K+S ++ C +V G NS+
Sbjct: 347 FKVECYNGNLERDLVESGFPIVTFHFSDGAELSLDVKSVFMKLSPNVFCLAVTPGNMNSI 406
Query: 343 PIYGNIMQTNFLVGYDIEQQTVSFKPTDC 371
G Q ++ +GYD+E + +SF+ DC
Sbjct: 407 ---GATAQQSYNIGYDLEAKKISFERIDC 432
>gi|226492465|ref|NP_001150925.1| aspartic proteinase nepenthesin-1 [Zea mays]
gi|195642996|gb|ACG40966.1| aspartic proteinase nepenthesin-1 precursor [Zea mays]
Length = 472
Score = 128 bits (322), Expect = 4e-27, Method: Compositional matrix adjust.
Identities = 91/246 (36%), Positives = 128/246 (52%), Gaps = 34/246 (13%)
Query: 11 LFFLCFYVVSPIEAQTGGFSVEL----IHRDSPKSPFYNSSETPYQRLRDALTRSLNR-- 64
L L F VV A +G SV + IH D T Q +RDAL R ++R
Sbjct: 27 LAVLVFLVVCATLA-SGAASVRVGLTRIHSDP--------DTTAPQFVRDALRRDMHRQR 77
Query: 65 -----------LNHFNQNSSISSSKASQADIIPNNANYLIRISIGTPPTERLAVADTGSD 113
L + +S + S ++ D+ PN YL+ ++IGTPP AVADTGSD
Sbjct: 78 SRSFGRDRDRELAESDGRTSTTVSARTRKDL-PNGGEYLMTLAIGTPPLPYAAVADTGSD 136
Query: 114 LIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCSS--SQCASLNQKSCSGVN--CQYS 169
LIWTQC PC +QC+ Q +PL++P S+T+ LPC+S S CA + C Y
Sbjct: 137 LIWTQCAPC-GTQCFEQPAPLYNPASSTTFSVLPCNSSLSMCAGALAGAAPPPGCACMYY 195
Query: 170 VSYGDGSFSNGNLATETVTLGSTTGQAVALPGITFGCGTNNGGLFNSKTTGIVGLGGGDI 229
+YG G ++ G +ET T GS+ +PG+ FGC + +N + G+VGLG G +
Sbjct: 196 QTYGTG-WTAGVQGSETFTFGSSAADQARVPGVAFGCSNASSSDWNG-SAGLVGLGRGSL 253
Query: 230 SLISQM 235
SL+SQ+
Sbjct: 254 SLVSQL 259
Score = 65.1 bits (157), Expect = 5e-08, Method: Compositional matrix adjust.
Identities = 39/130 (30%), Positives = 70/130 (53%), Gaps = 10/130 (7%)
Query: 252 IVIDSGTTLTFLPQG-YNSNLLSVMSSMIEAQPVADPTGS--LELCYSFNSLSQ-----V 303
++IDSGTT+T L Y +V S ++ P D + S L+LC++ + + +
Sbjct: 342 LIIDSGTTITSLANAAYQQVRAAVKSQLVTTLPTVDGSDSTGLDLCFALPAPTSAPPAVL 401
Query: 304 PEVTIHFRGADVKLSRSNFFVKVSEDIVCSVFKGITN-SVPIYGNIMQTNFLVGYDIEQQ 362
P +T+HF GAD+ L ++ + S + C + T+ ++ +GN Q N + YD+ ++
Sbjct: 402 PSMTLHFDGADMVLPADSYMISGS-GVWCLAMRNQTDGAMSTFGNYQQQNMHILYDVREE 460
Query: 363 TVSFKPTDCT 372
T+SF P C+
Sbjct: 461 TLSFAPAKCS 470
>gi|414589628|tpg|DAA40199.1| TPA: hypothetical protein ZEAMMB73_627989 [Zea mays]
Length = 452
Score = 128 bits (321), Expect = 5e-27, Method: Compositional matrix adjust.
Identities = 110/407 (27%), Positives = 174/407 (42%), Gaps = 93/407 (22%)
Query: 54 LRDALTRSLNRLNHFN--QNSSISSSKASQ---ADIIP----NNANYLIRISIGTPPTER 104
+R A+ RS R + +N + S K Q A ++P + Y++ ++IGTPP
Sbjct: 50 IRRAMRRSKARAAALSAVRNRARFSGKNEQQTPAGVLPVRPSGDLEYVVDLAIGTPPQPV 109
Query: 105 LAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCSSSQCASLNQKSCSGV 164
A+ DTGSDLIWTQC PC + C Q PLF P S++Y+ + C+ + C+ + SC
Sbjct: 110 SALLDTGSDLIWTQCAPC--ASCLSQPDPLFAPGQSASYEPMRCAGTLCSDILHHSCERP 167
Query: 165 N-CQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGIT--FGCGTNNGG--------- 212
+ C Y +YGDG+ + G ATE T S+ G + + FGCG+ N G
Sbjct: 168 DTCTYRYNYGDGTMTVGVYATERFTFASSGGGGLTTTTVPLGFGCGSVNVGSLNNGSGIV 227
Query: 213 ---------------------------------LFNSKTTGIVGLGGGDISLISQMRT-- 237
LF S + G+ G G + +++
Sbjct: 228 GFGRNPLSLVSQLSIRRFSYCLTSYASRRQSTLLFGSLSDGVYGDATGRVQTTPLLQSPQ 287
Query: 238 -----------TIAGNQRLGVS------TPD----IVIDSGTTLTFLPQGYNSNLLSVMS 276
G +RL + PD +++DSGT LT LP + ++
Sbjct: 288 NPTFYYVHFTGLTVGARRLRIPESAFALRPDGSGGVIVDSGTALTLLPAAVLAEVVRAFR 347
Query: 277 SMIEAQPVADPTGSLE--LCY---------SFNSLSQVPEVTIHFRGADVKLSRSNFFV- 324
+ P A+ G+ E +C+ S S VP + +HF+GAD+ L R N+ +
Sbjct: 348 QQLRL-PFAN-GGNPEDGVCFLVPAAWRRSSSTSQMPVPRMVLHFQGADLDLPRRNYVLD 405
Query: 325 KVSEDIVCSVFKGITNSVPIYGNIMQTNFLVGYDIEQQTVSFKPTDC 371
+C + + GN++Q + V YD+E +T+S P C
Sbjct: 406 DHRRGRLCLLLADSGDDGSTIGNLVQQDMRVLYDLEAETLSIAPARC 452
>gi|449516339|ref|XP_004165204.1| PREDICTED: probable aspartic protease At2g35615-like [Cucumis
sativus]
Length = 456
Score = 128 bits (321), Expect = 5e-27, Method: Compositional matrix adjust.
Identities = 124/446 (27%), Positives = 186/446 (41%), Gaps = 99/446 (22%)
Query: 14 LCFYVVSPIEAQT------GGFSVELIHRDSPKSPFYNSSETPYQRLRDALTRSLNRLNH 67
L FY+ + I + T + +LIHR+S P Y+ +ET R + T S+ R +
Sbjct: 17 LAFYLSTAIISSTLITTKPSRLATKLIHRNSYLHPLYDQNETVEDRSKREQTSSIERFDF 76
Query: 68 FNQNSSISSSKA----SQADIIPNN--ANYLIRISIGTPPTERLAVADTGSDLIWTQCEP 121
S I K+ +++ +IP N + +L+ +SIG+PP +L V DTGS L+W QC P
Sbjct: 77 LE--SKIKELKSVGNEARSSLIPFNRGSGFLVNLSIGSPPVTQLVVVDTGSSLLWVQCLP 134
Query: 122 CPPSQCYMQDSPLFDPKMSSTYKSLPCSSSQCASLNQKSCSGVN-CQYSVSYGDGSFSNG 180
C C+ Q + FDP S ++K+L C +N C+ N +Y + Y G S G
Sbjct: 135 CI--NCFQQSTSWFDPLKSVSFKTLGCGFPGYNYINGYKCNRFNQAEYKLRYLGGDSSQG 192
Query: 181 NLATETVTLGSTTGQAVALPGITFGCG-----TNNGGLFNSKTTGIVGLGG--------- 226
LA E++ + + ITFGCG TNN +N G+ GLG
Sbjct: 193 ILAKESLLFETLDEGKIKKSNITFGCGHMNIKTNNDDAYN----GVFGLGAYPHITMATQ 248
Query: 227 ---------GDIS--------LISQMRTTIAGN---------------QRLGVSTPDIVI 254
GDI+ L+ + I G+ Q + V + + I
Sbjct: 249 LGNKFSYCIGDINNPLYTHNHLVLGQGSYIEGDSTPLQIHFGHYYVTLQSISVGSKTLKI 308
Query: 255 -----------------DSGTTLTFLPQG----YNSNLLSVMSSMIEAQPVADPTGSLEL 293
DSG T T L G ++ +M ++E P L
Sbjct: 309 DPNAFKISSDGSGGVLIDSGMTYTKLANGGFELLYDEIVDLMKGLLERIPTQRKFEG--L 366
Query: 294 CYS---FNSLSQVPEVTIHFR-GADVKLSRSNFFVKVSEDIVCSVFKGITNS----VPIY 345
C+ L P VT HF GAD+ L + F + D C +NS + +
Sbjct: 367 CFKGVVSRDLVGFPAVTFHFAGGADLVLESGSLFRQHGGDRFCLAILP-SNSELLNLSVI 425
Query: 346 GNIMQTNFLVGYDIEQQTVSFKPTDC 371
G + Q N+ VG+D+EQ V F+ DC
Sbjct: 426 GILAQQNYNVGFDLEQMKVFFRRIDC 451
>gi|54290717|dbj|BAD62387.1| putative 41 kD chloroplast nucleoid DNA binding protein [Oryza
sativa Japonica Group]
gi|215734915|dbj|BAG95637.1| unnamed protein product [Oryza sativa Japonica Group]
Length = 469
Score = 128 bits (321), Expect = 5e-27, Method: Compositional matrix adjust.
Identities = 86/244 (35%), Positives = 123/244 (50%), Gaps = 39/244 (15%)
Query: 28 GFSVELIHRDSPKSPFYNSSETPYQ--------RLRDALTRSLNR---------LNHFNQ 70
G + L H SP SP S+ P+ R+ +R N L H ++
Sbjct: 42 GLHLTLHHPQSPCSPAPLPSDLPFSAVVTHDDARIAHLASRLANNHPTSPSSSSLLHGHR 101
Query: 71 NSSISSSKASQAD-----IIPNNA----NYLIRISIGTPPTERLAVADTGSDLIWTQCEP 121
SQA + P + NY+ R+ +GTP T + V DTGS L W QC P
Sbjct: 102 KKKAGGVGGSQASSSSVPLTPGASVAVGNYVTRLGLGTPATSYVMVVDTGSSLTWLQCSP 161
Query: 122 CPPSQCYMQDSPLFDPKMSSTYKSLPCSSSQC-----ASLNQKSCSGVN-CQYSVSYGDG 175
C S C+ Q P+FDP+ S TY ++ CSSS+C A+LN +CS N C Y SYGD
Sbjct: 162 CSVS-CHRQAGPVFDPRASGTYAAVQCSSSECGELQAATLNPSACSVSNVCIYQASYGDS 220
Query: 176 SFSNGNLATETVTLGSTTGQAVALPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQM 235
S+S G L+ +TV+ GS + PG +GCG +N GLF ++ G++GL +SL+ Q+
Sbjct: 221 SYSVGYLSKDTVSFGSGS-----FPGFYYGCGQDNEGLFG-RSAGLIGLAKNKLSLLYQL 274
Query: 236 RTTI 239
++
Sbjct: 275 APSL 278
Score = 48.1 bits (113), Expect = 0.006, Method: Compositional matrix adjust.
Identities = 38/123 (30%), Positives = 58/123 (47%), Gaps = 4/123 (3%)
Query: 253 VIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGS-LELCYSFNSLS-QVPEVTIHF 310
+IDSGT +T LP + L +++ + + PT S L+ C+ ++ +VP V + F
Sbjct: 348 IIDSGTVITRLPPNVYTALSRAVAAAMASAAPRAPTYSILDTCFRGSAAGLRVPRVDMAF 407
Query: 311 RG-ADVKLSRSNFFVKVSEDIVCSVFKGITNSVPIYGNIMQTNFLVGYDIEQQTVSFKPT 369
G A + LS N + V + C F T I GN Q F V YD+ Q + F
Sbjct: 408 AGGATLALSPGNVLIDVDDSTTCLAFA-PTGGTAIIGNTQQQTFSVVYDVAQSRIGFAAG 466
Query: 370 DCT 372
C+
Sbjct: 467 GCS 469
>gi|224093400|ref|XP_002309912.1| predicted protein [Populus trichocarpa]
gi|222852815|gb|EEE90362.1| predicted protein [Populus trichocarpa]
Length = 382
Score = 128 bits (321), Expect = 5e-27, Method: Compositional matrix adjust.
Identities = 97/348 (27%), Positives = 147/348 (42%), Gaps = 69/348 (19%)
Query: 88 NANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLP 147
+ Y +RI +G+PP + V D+GSD++W QC+PC +QCY Q PLFDP S+++ +
Sbjct: 40 SGEYFVRIGLGSPPRSQYMVIDSGSDIVWVQCKPC--TQCYHQTDPLFDPADSASFMGVS 97
Query: 148 CSSSQCASLNQKSCSGVNCQYSVSYGDGSFSNGNLATETVTLGST--------------- 192
CSS+ C + C+ C+Y VSYGDGS++ G LA ET+T G T
Sbjct: 98 CSSAVCDRVENAGCNSGRCRYEVSYGDGSYTKGTLALETLTFGRTVVRNVAIGCGHSNRG 157
Query: 193 -------------------------TGQAVALPGITFGCGTNNGGLFNSKTTGIVGLGGG 227
TG A + ++ G TN F S+ + G
Sbjct: 158 MFVGAAGLLGLGGGSMSFMGQLSGQTGNAFSYCLVSRGTNTNGFLEFGSEAMPV---GAA 214
Query: 228 DISLISQMRTTI----------AGNQRLGVS----------TPDIVIDSGTTLTFLPQGY 267
I L+ R G+ R+ VS + +V+D+GT +T P
Sbjct: 215 WIPLVRNPRAPSFYYIRLLGLGVGDTRVPVSEDVFQLNELGSGGVVMDTGTAVTRFPTVA 274
Query: 268 NSNLLSVMSSMIEAQPVADPTGSLELCYS-FNSLS-QVPEVTIHFRGADV-KLSRSNFFV 324
+ + P A + CY+ F LS +VP V+ +F G + + +NF +
Sbjct: 275 YEAFRNAFIEQTQNLPRASGVSIFDTCYNLFGFLSVRVPTVSFYFSGGPILTIPANNFLI 334
Query: 325 KVSE-DIVCSVFKGITNSVPIYGNIMQTNFLVGYDIEQQTVSFKPTDC 371
V + C F + + I GNI Q + D + V F P C
Sbjct: 335 PVDDAGTFCFAFAPSPSGLSILGNIQQEGIQISVDEANEFVGFGPNIC 382
>gi|414590468|tpg|DAA41039.1| TPA: aspartic proteinase nepenthesin-1 [Zea mays]
Length = 469
Score = 127 bits (320), Expect = 6e-27, Method: Compositional matrix adjust.
Identities = 88/243 (36%), Positives = 125/243 (51%), Gaps = 30/243 (12%)
Query: 11 LFFLCFYVVSPIEAQTGGFSVEL----IHRDSPKSPFYNSSETPYQRLRDALTRSLNRLN 66
L L F VV A +G SV + IH D T Q +RDAL R ++R
Sbjct: 27 LAVLVFLVVCATLA-SGAASVRVGLTRIHSDP--------DTTAPQFVRDALRRDMHRQR 77
Query: 67 ----------HFNQNSSISSSKASQADIIPNNANYLIRISIGTPPTERLAVADTGSDLIW 116
++ ++ A +PN YL+ ++IGTPP AVADTGSDLIW
Sbjct: 78 SRSFGRDRDRELAESDGRTTVSARTRKDLPNGGEYLMTLAIGTPPLPYAAVADTGSDLIW 137
Query: 117 TQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCSS--SQCASLNQKSCSGVN--CQYSVSY 172
TQC PC +QC+ Q +PL++P S+T+ LPC+S S CA + C Y+ +Y
Sbjct: 138 TQCAPC-GTQCFEQPAPLYNPASSTTFSVLPCNSSLSMCAGALAGAAPPPGCACMYNQTY 196
Query: 173 GDGSFSNGNLATETVTLGSTTGQAVALPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLI 232
G G ++ G +ET T GS+ +PG+ FGC + +N + G+VGLG G +SL+
Sbjct: 197 GTG-WTAGVQGSETFTFGSSAADQARVPGVAFGCSNASSSDWNG-SAGLVGLGRGSLSLV 254
Query: 233 SQM 235
SQ+
Sbjct: 255 SQL 257
Score = 67.0 bits (162), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 37/129 (28%), Positives = 70/129 (54%), Gaps = 9/129 (6%)
Query: 252 IVIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGS--LELCYSFNSLSQ-----VP 304
++IDSGTT+T L + + + S++ P D + S L+LC++ + + +P
Sbjct: 340 LIIDSGTTITSLANAAYQQVRAAVKSLVTTLPTVDGSDSTGLDLCFALPAPTSAPPAVLP 399
Query: 305 EVTIHFRGADVKLSRSNFFVKVSEDIVCSVFKGITN-SVPIYGNIMQTNFLVGYDIEQQT 363
+T+HF GAD+ L ++ + S + C + T+ ++ +GN Q N + YD+ ++T
Sbjct: 400 SMTLHFDGADMVLPADSYMISGS-GVWCLAMRNQTDGAMSTFGNYQQQNMHILYDVREET 458
Query: 364 VSFKPTDCT 372
+SF P C+
Sbjct: 459 LSFAPAKCS 467
>gi|224142007|ref|XP_002324352.1| predicted protein [Populus trichocarpa]
gi|222865786|gb|EEF02917.1| predicted protein [Populus trichocarpa]
Length = 460
Score = 127 bits (320), Expect = 7e-27, Method: Compositional matrix adjust.
Identities = 117/426 (27%), Positives = 176/426 (41%), Gaps = 95/426 (22%)
Query: 29 FSVELIHRDSPKSPFYNS---SETPYQRLRDALTRSLNRLNHFNQN-SSISSSKASQADI 84
S+E++HR P N ++ P + R NR++ + SS QA
Sbjct: 48 LSLEVVHRHGPCIGIVNQEKGADAPSNM--EIFLRDQNRVDSIHARLSSRGMFPEKQATT 105
Query: 85 IP-------NNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDP 137
+P +Y++ + +GTP E + DTGSD+ WTQCEPC + CY Q P +P
Sbjct: 106 LPVQSGASIGAGDYVVTVGLGTPKKEFTLIFDTGSDITWTQCEPCVKT-CYKQKEPRLNP 164
Query: 138 KMSSTYKSLPCSSSQCASLN-----QKSCSGVNCQYSVSYGDGSFSNGNLATETVTLGST 192
S++YK++ CSS+ C + +SCS C Y V YGDGS+S G ATET+TL S+
Sbjct: 165 STSTSYKNISCSSALCKLVASGKKFSQSCSSSTCLYQVQYGDGSYSIGFFATETLTLSSS 224
Query: 193 TGQAVALPGITFGCGTNNG----------------------------GLF------NSKT 218
FGCG N LF +S +
Sbjct: 225 N----VFKNFLFGCGQQNNGLFGGAAGLLGLGRTKLALPSQTAKTYKKLFSYCLPASSSS 280
Query: 219 TGIVGLGGG----------------------DISLISQMRTTIAGNQRLGVS----TPDI 252
G + LGG DI+ +S G ++L + +
Sbjct: 281 KGYLSLGGQVSKSVKFTPLSADFDSTPFYGLDITGLS------VGGRKLSIDESAFSAGT 334
Query: 253 VIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSLS--QVPEVTIHF 310
VIDSGT +T L S L S +++ P + CY F+ ++P+V + F
Sbjct: 335 VIDSGTVITRLSPTAYSELSSAFQNLMTDYPSTSGYSIFDTCYDFSKYDTVRIPKVGVTF 394
Query: 311 RGA-DVKLSRSNFFVKVSE-DIVCSVFKGITN--SVPIYGNIMQTNFLVGYDIEQQTVSF 366
+G ++ + S V+ VC F G + I+GN+ Q + V YD + V F
Sbjct: 395 KGGVEMDIDVSGILYPVNGLKKVCLAFAGNDDDSDTSIFGNVQQRTYQVVYDGAKGRVGF 454
Query: 367 KPTDCT 372
P C+
Sbjct: 455 APGGCS 460
>gi|224142005|ref|XP_002324351.1| predicted protein [Populus trichocarpa]
gi|222865785|gb|EEF02916.1| predicted protein [Populus trichocarpa]
Length = 472
Score = 127 bits (320), Expect = 7e-27, Method: Compositional matrix adjust.
Identities = 117/426 (27%), Positives = 176/426 (41%), Gaps = 95/426 (22%)
Query: 29 FSVELIHRDSPKSPFYNS---SETPYQRLRDALTRSLNRLNHFNQN-SSISSSKASQADI 84
S+E++HR P N ++ P + R NR++ + SS QA
Sbjct: 60 LSLEVVHRHGPCIGIVNQEKGADAPSNM--EIFLRDQNRVDSIHARLSSRGMFPEKQATT 117
Query: 85 IP-------NNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDP 137
+P +Y++ + +GTP E + DTGSD+ WTQCEPC + CY Q P +P
Sbjct: 118 LPVQSGASIGAGDYVVTVGLGTPKKEFTLIFDTGSDITWTQCEPCVKT-CYKQKEPRLNP 176
Query: 138 KMSSTYKSLPCSSSQCASLN-----QKSCSGVNCQYSVSYGDGSFSNGNLATETVTLGST 192
S++YK++ CSS+ C + +SCS C Y V YGDGS+S G ATET+TL S+
Sbjct: 177 STSTSYKNISCSSALCKLVASGKKFSQSCSSSTCLYQVQYGDGSYSIGFFATETLTLSSS 236
Query: 193 TGQAVALPGITFGCGTNNG----------------------------GLF------NSKT 218
FGCG N LF +S +
Sbjct: 237 N----VFKNFLFGCGQQNNGLFGGAAGLLGLGRTKLALPSQTAKTYKKLFSYCLPASSSS 292
Query: 219 TGIVGLGGG----------------------DISLISQMRTTIAGNQRLGVS----TPDI 252
G + LGG DI+ +S G ++L + +
Sbjct: 293 KGYLSLGGQVSKSVKFTPLSADFDSTPFYGLDITGLS------VGGRKLSIDESAFSAGT 346
Query: 253 VIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSLS--QVPEVTIHF 310
VIDSGT +T L S L S +++ P + CY F+ ++P+V + F
Sbjct: 347 VIDSGTVITRLSPTAYSELSSAFQNLMTDYPSTSGYSIFDTCYDFSKYDTVRIPKVGVTF 406
Query: 311 RGA-DVKLSRSNFFVKVSE-DIVCSVFKGITN--SVPIYGNIMQTNFLVGYDIEQQTVSF 366
+G ++ + S V+ VC F G + I+GN+ Q + V YD + V F
Sbjct: 407 KGGVEMDIDVSGILYPVNGLKKVCLAFAGNDDDSDTSIFGNVQQRTYQVVYDGAKGRVGF 466
Query: 367 KPTDCT 372
P C+
Sbjct: 467 APGGCS 472
>gi|242058093|ref|XP_002458192.1| hypothetical protein SORBIDRAFT_03g028480 [Sorghum bicolor]
gi|241930167|gb|EES03312.1| hypothetical protein SORBIDRAFT_03g028480 [Sorghum bicolor]
Length = 468
Score = 127 bits (320), Expect = 7e-27, Method: Compositional matrix adjust.
Identities = 77/226 (34%), Positives = 112/226 (49%), Gaps = 19/226 (8%)
Query: 30 SVELIHRDSPKSPFYNSSETP---YQRLRDALTRS---LNRLNHFNQNSSISSSKASQAD 83
SV L+HR P +P SS+ P RLR RS ++R++ S +
Sbjct: 57 SVPLVHRHGPCAPTQLSSDKPSSFTDRLRRNRARSKYIMSRVSKGMMGDDADVSIPTHLG 116
Query: 84 IIPNNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTY 143
++ Y++ + +GTP ++ + DTGSDL W QC+PC + CY Q PLFDP SSTY
Sbjct: 117 GSVDSLEYVVTVGLGTPSVSQVLLIDTGSDLSWVQCQPCNSTTCYPQKDPLFDPSKSSTY 176
Query: 144 KSLPCSSSQCASLNQKSCSG--------VNCQYSVSYGDGSFSNGNLATETVTLGSTTGQ 195
+PC++ C L G C ++++YGDGS + G + ET+ L
Sbjct: 177 APIPCNTDACRDLTDDGYGGGCASGDGAAQCGFAITYGDGSQTRGVYSNETLALAP---- 232
Query: 196 AVALPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAG 241
VA+ FGCG + G N K G++GLGG SL+ Q + G
Sbjct: 233 GVAVKDFRFGCGHDQDGA-NDKYDGLLGLGGAPESLVVQTASVYGG 277
Score = 43.9 bits (102), Expect = 0.13, Method: Compositional matrix adjust.
Identities = 36/123 (29%), Positives = 55/123 (44%), Gaps = 6/123 (4%)
Query: 252 IVIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSLSQV--PEVTIH 309
++IDSGT +T L + L + + A P+ G L+ CY F+ S V P+V +
Sbjct: 349 MIIDSGTVVTELQHTAYNALQAAFRKAMAAYPLVR-NGELDTCYDFSGYSNVTLPKVALT 407
Query: 310 FRG-ADVKLSRSNFFVKVSEDIVCSVFKGITNSVPIYGNIMQTNFLVGYDIEQQTVSFKP 368
F G A + L N + +D + G + I GN+ Q V YD + V F+
Sbjct: 408 FSGGATIDLDVPNGILL--DDCLAFQESGPDDQPGILGNVNQRTLEVLYDAGRGRVGFRA 465
Query: 369 TDC 371
C
Sbjct: 466 AVC 468
>gi|297842769|ref|XP_002889266.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
gi|297335107|gb|EFH65525.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
Length = 489
Score = 127 bits (319), Expect = 9e-27, Method: Compositional matrix adjust.
Identities = 70/163 (42%), Positives = 95/163 (58%), Gaps = 21/163 (12%)
Query: 90 NYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCS 149
NY++ + +G + DTGSDL W QC+PC CY Q PL+DP +SS+YK++ C+
Sbjct: 137 NYIVTVELGGKNMS--LIVDTGSDLTWVQCQPC--RSCYNQQGPLYDPSVSSSYKTVFCN 192
Query: 150 SSQCASL-----NQKSCSGVN------CQYSVSYGDGSFSNGNLATETVTLGSTTGQAVA 198
SS C L N C G N C+Y VSYGDGS++ G+LA+E++ LG T
Sbjct: 193 SSTCQDLVAATGNSGPCGGFNGVVKTTCEYVVSYGDGSYTRGDLASESIVLGDT-----K 247
Query: 199 LPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAG 241
L + FGCG NN GLF +G++GLG +SL+SQ T G
Sbjct: 248 LENLVFGCGRNNKGLFGG-ASGLMGLGRSSVSLVSQTLKTFNG 289
Score = 46.2 bits (108), Expect = 0.029, Method: Compositional matrix adjust.
Identities = 35/127 (27%), Positives = 55/127 (43%), Gaps = 7/127 (5%)
Query: 252 IVIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSLSQ--VPEVTIH 309
I+IDSGT +T LP + + P A L+ C++ S +P + +
Sbjct: 359 ILIDSGTVITRLPPSIYKAVKTEFLKQFSGFPSAPGYSILDTCFNLTSYEDISIPTIKMI 418
Query: 310 FRG---ADVKLSRSNFFVKVSEDIVCSVFKGIT--NSVPIYGNIMQTNFLVGYDIEQQTV 364
F G +V ++ +FVK +VC ++ N V I GN Q N V YD Q+ +
Sbjct: 419 FEGNAELEVDVTGVFYFVKPDASLVCLALASLSYENEVGIIGNYQQKNQRVIYDTTQERL 478
Query: 365 SFKPTDC 371
+C
Sbjct: 479 GIAGENC 485
>gi|242044888|ref|XP_002460315.1| hypothetical protein SORBIDRAFT_02g026350 [Sorghum bicolor]
gi|241923692|gb|EER96836.1| hypothetical protein SORBIDRAFT_02g026350 [Sorghum bicolor]
Length = 456
Score = 127 bits (319), Expect = 9e-27, Method: Compositional matrix adjust.
Identities = 118/460 (25%), Positives = 189/460 (41%), Gaps = 106/460 (23%)
Query: 8 VFILFFLCFYVVSPIEAQT----GGFSVELIHRDSPKSPFYNSSETPYQRLRDALTRSLN 63
V +L Y P+ + V L H D+ K + SE +R A+ RS
Sbjct: 7 VLVLAIASLYYACPVASAAFVGDDDVRVALKHVDAGKQ--LSRSEL----IRRAMQRSKA 60
Query: 64 RLNHFN--QNSSISSSKASQAD-----------IIPN-NANYLIRISIGTPPTERLAVAD 109
R + +N + S+ + + D + P+ + Y++ ++IGTPP A+ D
Sbjct: 61 RAAALSAVRNRAASARFSGKNDDQRTTPPTGVSVRPSGDLEYVVDLAIGTPPQPVSALLD 120
Query: 110 TGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCSSSQCASLNQKSCSGVN-CQY 168
TGSDLIWTQC PC + C Q PLF P S++Y+ + C+ C+ + C + C Y
Sbjct: 121 TGSDLIWTQCAPC--ASCLAQPDPLFAPGESASYEPMRCAGQLCSDILHHGCEMPDTCTY 178
Query: 169 SVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITFGCGTNNGG---------------- 212
+YGDG+ + G ATE T S+ G + + FGCG+ N G
Sbjct: 179 RYNYGDGTMTMGVYATERFTFTSSGGDRLMTVPLGFGCGSMNVGSLNNGSGIVGFGRNPL 238
Query: 213 --------------------------LFNSKTTGIVGLGGGDISLISQMRTT-------- 238
LF S + G+ G G + +++
Sbjct: 239 SLVSQLSIRRFSYCLTSYGSGRKSTLLFGSLSGGVYGDATGPVQTTPLLQSLQNPTFYYV 298
Query: 239 -----IAGNQRLGVST------PD----IVIDSGTTLTFLPQGYNSNLLSVMSSMIEAQP 283
G +RL + PD +++DSGT LT LP + ++ + P
Sbjct: 299 HLAGLTVGARRLRIPESAFALRPDGSGGVIVDSGTALTLLPGAVLAEVVRAFRQQLRL-P 357
Query: 284 VADPTGSLE--LCY---------SFNSLSQVPEVTIHFRGADVKLSRSNFFVK-VSEDIV 331
A+ G+ E +C+ S S VP + HF+ AD+ L R N+ + + +
Sbjct: 358 FAN-GGNPEDGVCFLVPAAWRRSSSTSQVPVPRMVFHFQDADLDLPRRNYVLDDHRKGRL 416
Query: 332 CSVFKGITNSVPIYGNIMQTNFLVGYDIEQQTVSFKPTDC 371
C + + GN++Q + V YD+E +T+SF P C
Sbjct: 417 CLLLADSGDDGSTIGNLVQQDMRVLYDLEAETLSFAPAQC 456
>gi|413953772|gb|AFW86421.1| hypothetical protein ZEAMMB73_098827 [Zea mays]
Length = 482
Score = 127 bits (319), Expect = 9e-27, Method: Compositional matrix adjust.
Identities = 79/229 (34%), Positives = 112/229 (48%), Gaps = 21/229 (9%)
Query: 28 GFSVELIHRDSPKSPFYNSSETPYQRLRDALTRSLNRLNHFNQNSSISSSKASQADIIP- 86
G +++++HR +S + + L R NR+ ++ + + A+ IP
Sbjct: 59 GNTIQIVHRACLQSGDRKTVPDHHPHYTGILRRDHNRVRSIHRRLTGAGDTAA---TIPA 115
Query: 87 ------NNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMS 140
++ Y++ I IGTP + DTGSDL W QC+PC S CY Q PLFDP S
Sbjct: 116 SLGLAFHSLEYVVTIGIGTPARNFTVLFDTGSDLTWVQCKPCTDS-CYQQQEPLFDPSKS 174
Query: 141 STYKSLPCSSSQC--ASLNQKSCSGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVA 198
STY +PC + QC +C G C+YSV YGD S + GNLA E TL + A
Sbjct: 175 STYVDVPCGTPQCKIGGGQDLTCGGTTCEYSVKYGDQSVTRGNLAQEAFTLSPSAPPAA- 233
Query: 199 LPGITFGCGTN-----NGGLFNSKTTGIVGLGGGDISLISQMRTTIAGN 242
G+ FGC G G++GLG GD S++SQ R +G+
Sbjct: 234 --GVVFGCSHEYSSGVKGAEEEMSVAGLLGLGRGDSSILSQTRRGNSGD 280
>gi|115434442|ref|NP_001041979.1| Os01g0140100 [Oryza sativa Japonica Group]
gi|12328547|dbj|BAB21205.1| putative aspartic proteinase nepenthesin I [Oryza sativa Japonica
Group]
gi|113531510|dbj|BAF03893.1| Os01g0140100 [Oryza sativa Japonica Group]
gi|125568961|gb|EAZ10476.1| hypothetical protein OsJ_00309 [Oryza sativa Japonica Group]
gi|215697206|dbj|BAG91200.1| unnamed protein product [Oryza sativa Japonica Group]
Length = 504
Score = 127 bits (318), Expect = 1e-26, Method: Compositional matrix adjust.
Identities = 93/348 (26%), Positives = 146/348 (41%), Gaps = 69/348 (19%)
Query: 87 NNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSL 146
+ Y R+ +G+P + V DTGSD+ W QC+PC + CY Q P+FDP +S++Y S+
Sbjct: 163 GSGEYFSRVGVGSPARQLYMVLDTGSDVTWVQCQPC--ADCYQQSDPVFDPSLSTSYASV 220
Query: 147 PCSSSQCASLNQKSC--SGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITF 204
C + +C L+ +C S C Y V+YGDGS++ G+ ATET+TLG + + +
Sbjct: 221 ACDNPRCHDLDAAACRNSTGACLYEVAYGDGSYTVGDFATETLTLGDS----APVSSVAI 276
Query: 205 GCGTNNGGLF--------------------------------NSKTTGIVGLGGGDIS-- 230
GCG +N GLF +S ++ + G +
Sbjct: 277 GCGHDNEGLFVGAAGLLALGGGPLSFPSQISATTFSYCLVDRDSPSSSTLQFGDAADAEV 336
Query: 231 ---LISQMRTTI----------AGNQRLGVSTPDIVIDS----------GTTLTFLPQGY 267
LI RT+ G Q L + +DS GT +T L
Sbjct: 337 TAPLIRSPRTSTFYYVGLSGLSVGGQILSIPPSAFAMDSTGAGGVIVDSGTAVTRLQSSA 396
Query: 268 NSNLLSVMSSMIEAQPVADPTGSLELCYSFNSLS--QVPEVTIHFR-GADVKLSRSNFFV 324
+ L ++ P + CY + + +VP V++ F G +++L N+ +
Sbjct: 397 YAALRDAFVRGTQSLPRTSGVSLFDTCYDLSDRTSVEVPAVSLRFAGGGELRLPAKNYLI 456
Query: 325 KVS-EDIVCSVFKGITNSVPIYGNIMQTNFLVGYDIEQQTVSFKPTDC 371
V C F +V I GN+ Q V +D + TV F C
Sbjct: 457 PVDGAGTYCLAFAPTNAAVSIIGNVQQQGTRVSFDTAKSTVGFTTNKC 504
>gi|225216973|gb|ACN85264.1| aspartic proteinase nepenthesin-1 precursor [Oryza alta]
Length = 517
Score = 127 bits (318), Expect = 1e-26, Method: Compositional matrix adjust.
Identities = 109/345 (31%), Positives = 147/345 (42%), Gaps = 61/345 (17%)
Query: 87 NNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSL 146
NY++ + +GTP + V DTGSD W QC+PC CY Q LFDP SSTY ++
Sbjct: 174 GTGNYVVTVGLGTPASRYTVVFDTGSDTTWVQCQPCV-VVCYEQQEKLFDPVRSSTYANV 232
Query: 147 PCSSSQCASLNQKSCSGVNCQYSVSYGDGSFSNGNLATETVTLGSTT------------- 193
C++ C+ LN CSG +C Y V YGDGS+S G A +T+TL S
Sbjct: 233 SCAAPACSDLNIHGCSGGHCLYGVQYGDGSYSIGFFAMDTLTLSSYDAVKGFRFGCGERN 292
Query: 194 ----GQAVALPGITFGCGT-------NNGGLF------NSKTTGIVGLGGGDISLISQMR 236
G+A L G+ G + GG+F S TG + G G + S
Sbjct: 293 EGLFGEAAGLLGLGRGKTSLPVQTYDKYGGVFAHCLPARSTGTGYLDFGAGSPAAASARL 352
Query: 237 TT------------------IAGNQRLGV-----STPDIVIDSGTTLTFLPQGYNSNL-- 271
TT G Q L + +T ++DSGT +T LP S+L
Sbjct: 353 TTPMLTDNGPTFYYIGMTGIRVGGQLLSIPQSVFATAGTIVDSGTVITRLPPPAYSSLRY 412
Query: 272 LSVMSSMIEAQPVADPTGSLELCYSFNSLSQV--PEVTIHFRG-ADVKLSRSNFFVKVSE 328
+ A L+ CY F +SQV P V++ F+G A + + S S
Sbjct: 413 AFAAAMAARGYKKAPAVSLLDTCYDFTGMSQVAIPTVSLLFQGGARLDVDASGIMYAASA 472
Query: 329 DIVCSVFKGITNS--VPIYGNIMQTNFLVGYDIEQQTVSFKPTDC 371
VC F + V I GN F V YDI ++ V F P C
Sbjct: 473 SQVCLAFAANEDGGDVGIVGNTQLKTFGVAYDIGKKVVGFYPGVC 517
>gi|357118064|ref|XP_003560779.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Brachypodium
distachyon]
Length = 472
Score = 127 bits (318), Expect = 1e-26, Method: Compositional matrix adjust.
Identities = 80/234 (34%), Positives = 120/234 (51%), Gaps = 30/234 (12%)
Query: 30 SVELIHRDSPKSPFYNSS---ETP--YQRLRDALTRSLNRLNHFNQNSSISSSKASQADI 84
SV L HR P +P +S+ + P +RLR R+ + L + +S +
Sbjct: 55 SVPLAHRHGPCAPKGSSATDKKKPSFAERLRSDRARADHILRKASGRRMMSEGGGAS--- 111
Query: 85 IP-------NNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDP 137
IP ++ Y++ + IGTP ++ + DTGSDL W QC+PC S CY Q PLFDP
Sbjct: 112 IPTYLGGFVDSLEYVVTLGIGTPAVQQTVLIDTGSDLSWVQCKPCNASDCYPQKDPLFDP 171
Query: 138 KMSSTYKSLPCSSSQCASL----------NQKSCSGVNCQYSVSYGDGSFSNGNLATETV 187
SST+ ++PC+S C L N S C Y++ YG+G+ + G +TET+
Sbjct: 172 SKSSTFATIPCASDACKQLPVDGYDNGCTNNTSGMPPQCGYAIEYGNGAITEGVYSTETL 231
Query: 188 TLGSTTGQAVALPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAG 241
LGS+ + FGCG++ G ++ K G++GLGG SL+SQ + G
Sbjct: 232 ALGSS----AVVKSFRFGCGSDQHGPYD-KFDGLLGLGGAPESLVSQTASVYGG 280
>gi|255571588|ref|XP_002526740.1| aspartic-type endopeptidase, putative [Ricinus communis]
gi|223533929|gb|EEF35654.1| aspartic-type endopeptidase, putative [Ricinus communis]
Length = 471
Score = 126 bits (317), Expect = 1e-26, Method: Compositional matrix adjust.
Identities = 96/284 (33%), Positives = 141/284 (49%), Gaps = 37/284 (13%)
Query: 6 SCVFILFFLCFYVVSPI------------EAQTGGFSVELIHRDSPKSPFYNSSETPYQR 53
S +F LF L ++ P+ + + GF LIH SP+SPFY + TP +
Sbjct: 8 SAIFRLFLLILHIPFPLSSSFSLPLKELAKGKAYGFKAPLIHWSSPESPFYEPNLTPGEL 67
Query: 54 LRDALTRSLNRLNHFNQ--NSSISSSK---ASQADIIPNNANYLIRISIGTPPTERLAVA 108
+R ++ S R + + +S IS+S+ S+ II + Y+++ +IG+PP E A+
Sbjct: 68 MRASVRTSRARGDRIRKIRSSGISNSRKYPVSRISII--DKVYVMKFNIGSPPVETYAIP 125
Query: 109 DTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCSSSQCAS--------LNQKS 160
DTGS+++W QC + CY Q PLF+P SSTY C +C L KS
Sbjct: 126 DTGSNIVWIQCGSPICTNCYKQKIPLFNPTKSSTYAIRLCGHRECKQALWGLGEYLGCKS 185
Query: 161 CSGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALP-GITFGCGTNN----GGLFN 215
V C+Y +SY D SFS G ++T+ +T + + FGCG NN G N
Sbjct: 186 SVQV-CRYHISYEDHSFSEGTISTDIITFPEHIAEFGNYSLRMFFGCGYNNSETPGQDPN 244
Query: 216 SKTT-GIVGLGGGDISLISQMRTTIAGNQRLGVSTPDIVIDSGT 258
S T G+VGLG SL+ Q+ G +STPD+ +GT
Sbjct: 245 SFTAPGVVGLGNEMASLVGQL---TLGQFSYCISTPDVQKPNGT 285
>gi|358346736|ref|XP_003637421.1| Aspartic proteinase nepenthesin-1, partial [Medicago truncatula]
gi|355503356|gb|AES84559.1| Aspartic proteinase nepenthesin-1, partial [Medicago truncatula]
Length = 280
Score = 126 bits (317), Expect = 1e-26, Method: Compositional matrix adjust.
Identities = 80/200 (40%), Positives = 107/200 (53%), Gaps = 17/200 (8%)
Query: 20 SPIEAQTGGFSVELIHRDSPKSPFYNSSETPYQRLRDA-----LTRSLNRLNHFNQNSSI 74
SP + T S++L R S S S T + RD+ +T LN+ + ++ S
Sbjct: 61 SPFTSSTSTLSLQLHSRASLSSHADYKSLTLSRLDRDSARVKYITTKLNQNFNTDKLSGP 120
Query: 75 SSSKASQADIIPNNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPL 134
S SQ + Y RI IG PP++ V DTGSD+ W QC PC + CY Q P+
Sbjct: 121 IISGTSQG-----SGEYFSRIGIGEPPSQAYMVLDTGSDISWVQCAPC--ADCYRQADPI 173
Query: 135 FDPKMSSTYKSLPCSSSQCASLNQKSCSGVNCQYSVSYGDGSFSNGNLATETVTLGSTTG 194
F+P S++Y L C ++QC L+Q C NC Y VSYGDGS++ G+ TETVT+G
Sbjct: 174 FEPTASASYAPLSCEAAQCRYLDQSQCRNGNCLYQVSYGDGSYTVGDFVTETVTIGVNKV 233
Query: 195 QAVALPGITFGCGTNNGGLF 214
+ VAL GCG NN GLF
Sbjct: 234 KNVAL-----GCGHNNEGLF 248
>gi|302763741|ref|XP_002965292.1| hypothetical protein SELMODRAFT_83230 [Selaginella moellendorffii]
gi|300167525|gb|EFJ34130.1| hypothetical protein SELMODRAFT_83230 [Selaginella moellendorffii]
Length = 423
Score = 126 bits (317), Expect = 1e-26, Method: Compositional matrix adjust.
Identities = 76/204 (37%), Positives = 106/204 (51%), Gaps = 30/204 (14%)
Query: 34 IHRDSPKSPFYNSSETPYQRLRDALTRSLNRLNHFNQNSSISSS---KASQADIIPN--- 87
+HRDS SP+ ++ T + +R+ L R RL + S+ + K+S + + N
Sbjct: 1 MHRDSADSPYRPANATVHGLVRNRLHRDELRLLSISSRISLGVAGIPKSSLTNPLKNTNP 60
Query: 88 -----------------NANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQ 130
+ Y + + +GTPP VADTGSD++W QC PC CY Q
Sbjct: 61 FLQQDFETPLRSGLSDGSGEYFVSLGVGTPPRTVNMVADTGSDVLWLQCLPC--QSCYGQ 118
Query: 131 DSPLFDPKMSSTYKSLPCSSSQCASLNQKSCSGVNCQYSVSYGDGSFSNGNLATETVTLG 190
PLF+P SST++S+ C SS C L + C C Y VSYGDGSF+ G +TET++ G
Sbjct: 119 TDPLFNPSFSSTFQSITCGSSLCQQLLIRGCRRNQCLYQVSYGDGSFTVGEFSTETLSFG 178
Query: 191 STTGQAVALPGITFGCGTNNGGLF 214
S +VA+ GCG NN GLF
Sbjct: 179 SNAVNSVAI-----GCGHNNQGLF 197
>gi|148907930|gb|ABR17085.1| unknown [Picea sitchensis]
Length = 498
Score = 126 bits (316), Expect = 2e-26, Method: Compositional matrix adjust.
Identities = 72/207 (34%), Positives = 107/207 (51%), Gaps = 28/207 (13%)
Query: 29 FSVELIHRDSPKSPFYNSSETPYQRL------RDA-----LTRSLNRLNHFNQNSSISSS 77
+SVE++HRD+ ++ Y+R R+A L R + R N++
Sbjct: 74 WSVEVVHRDALLLKNAANATASYERRLKEKLRREAVRVRGLERQIERTLTLNKDPVNRYE 133
Query: 78 KASQAD----------IIPNNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQC 127
++ D + + Y RI +GTP E+ V DTGSD+ W QCEPC +C
Sbjct: 134 NVAEVDADFGGEVVSGMEQGSGEYFTRIGVGTPTREQYMVLDTGSDVAWIQCEPC--REC 191
Query: 128 YMQDSPLFDPKMSSTYKSLPCSSSQCASLNQKSCSGVNCQYSVSYGDGSFSNGNLATETV 187
Y Q P+F+P S+++ ++ C S+ C+ L+ C C Y SYGDGS+S G+ ATET+
Sbjct: 192 YSQADPIFNPSYSASFSTVGCDSAVCSQLDAYDCHSGGCLYEASYGDGSYSTGSFATETL 251
Query: 188 TLGSTTGQAVALPGITFGCGTNNGGLF 214
T G+T+ VA+ GCG N GLF
Sbjct: 252 TFGTTSVANVAI-----GCGHKNVGLF 273
>gi|148905906|gb|ABR16115.1| unknown [Picea sitchensis]
Length = 482
Score = 126 bits (316), Expect = 2e-26, Method: Compositional matrix adjust.
Identities = 83/229 (36%), Positives = 114/229 (49%), Gaps = 18/229 (7%)
Query: 23 EAQTGGFSVELIHRDSPKSPFYNSSETPYQRL-RDALTRSLNRLNHFNQNSSISSSKAS- 80
EA G + L H SP + + + L + R RLN +S + S
Sbjct: 64 EALKPGVKIRLDHIHGACSPLRPINSSSWIDLVSQSFERDNARLNTIRSKNSGPYTTMSN 123
Query: 81 ---QADIIPNNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDP 137
Q+ NY++ GTP L + DTGSDL W QC+PC + CY Q +F+P
Sbjct: 124 LPLQSGTTVGTGNYIVTAGFGTPAKNSLLIIDTGSDLTWIQCKPC--ADCYSQVDAIFEP 181
Query: 138 KMSSTYKSLPCSSSQCASL-----NQKSCSGVNCQYSVSYGDGSFSNGNLATETVTLGST 192
K SS+YK+LPC S+ C L N C C Y ++YGDGS S G+ + ET+TLGS
Sbjct: 182 KQSSSYKTLPCLSATCTELITSESNPTPCLLGGCVYEINYGDGSSSQGDFSQETLTLGSD 241
Query: 193 TGQAVALPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAG 241
+ Q A FGCG N GLF ++G++GLG +S SQ ++ G
Sbjct: 242 SFQNFA-----FGCGHTNTGLFKG-SSGLLGLGQNSLSFPSQSKSKYGG 284
>gi|168013126|ref|XP_001759252.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162689565|gb|EDQ75936.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 385
Score = 126 bits (316), Expect = 2e-26, Method: Compositional matrix adjust.
Identities = 64/127 (50%), Positives = 77/127 (60%), Gaps = 3/127 (2%)
Query: 89 ANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPC 148
Y IR+S+GTPP V DTGSD++W QC PC CY Q +FDP SSTY +L C
Sbjct: 35 GEYFIRVSVGTPPRGMYLVMDTGSDILWLQCAPC--VSCYHQCDEVFDPYKSSTYSTLGC 92
Query: 149 SSSQCASLNQKSCSGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQA-VALPGITFGCG 207
+S QC +L+ C G C Y V YGDGSFS G AT+ V+L ST+G V L I GCG
Sbjct: 93 NSRQCLNLDVGGCVGNKCLYQVDYGDGSFSTGEFATDAVSLNSTSGGGQVVLNKIPLGCG 152
Query: 208 TNNGGLF 214
+N G F
Sbjct: 153 HDNEGYF 159
Score = 62.4 bits (150), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 44/124 (35%), Positives = 61/124 (49%), Gaps = 5/124 (4%)
Query: 252 IVIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSLS--QVPEVTIH 309
++IDSGT++T L ++L + + + CY+ + LS VP VT+H
Sbjct: 262 VIIDSGTSVTRLQNAAYASLREAFRAGTSDLVLTTEFSLFDTCYNLSDLSSVDVPTVTLH 321
Query: 310 FRG-ADVKLSRSNFFVKV-SEDIVCSVFKGITNSVPIYGNIMQTNFLVGYDIEQQTVSFK 367
F+G AD+KL SN+ V V + C F G T I GNI Q F V YD V F
Sbjct: 322 FQGGADLKLPASNYLVPVDNSSTFCLAFAGTTGP-SIIGNIQQQGFRVIYDNLHNQVGFV 380
Query: 368 PTDC 371
P+ C
Sbjct: 381 PSQC 384
>gi|125543638|gb|EAY89777.1| hypothetical protein OsI_11319 [Oryza sativa Indica Group]
Length = 390
Score = 126 bits (316), Expect = 2e-26, Method: Compositional matrix adjust.
Identities = 73/155 (47%), Positives = 89/155 (57%), Gaps = 18/155 (11%)
Query: 91 YLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCSS 150
YL+ ++IGTPP DTGSDLIWTQC+PC C+ Q P FD SST LPC S
Sbjct: 35 YLVHLAIGTPPQPVQLTLDTGSDLIWTQCKPC--VSCFDQPLPYFDTSRSSTNALLPCES 92
Query: 151 SQC---------ASLNQKSCSGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPG 201
+QC LNQ + C Y SYGD S + G LA + T + T +LPG
Sbjct: 93 TQCKLDPTVTVCVKLNQTVQT---CAYYTSYGDNSVTIGLLAADKFTFVAGT----SLPG 145
Query: 202 ITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMR 236
+TFGCG NN G+FNS TGI G G G +SL SQ++
Sbjct: 146 VTFGCGLNNTGVFNSNETGIAGFGRGPLSLPSQLK 180
Score = 58.2 bits (139), Expect = 6e-06, Method: Compositional matrix adjust.
Identities = 40/128 (31%), Positives = 64/128 (50%), Gaps = 9/128 (7%)
Query: 253 VIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSLSQ--VPEVTIHF 310
+IDSGT++T LP + ++ I+ V C+S S ++ VP++ +HF
Sbjct: 264 IIDSGTSITSLPPQVYQVVRDEFAAQIKLPVVPGNATGHYTCFSAPSQAKPDVPKLVLHF 323
Query: 311 RGADVKLSRSNFFVKVSED----IVC-SVFKGITNSVPIYGNIMQTNFLVGYDIEQQTVS 365
GA + L R N+ +V +D I+C ++ KG + I GN Q N V YD++ +S
Sbjct: 324 EGATMDLPRENYVFEVPDDAGNSIICLAINKG--DETTIIGNFQQQNMHVLYDLQNNMLS 381
Query: 366 FKPTDCTK 373
F C K
Sbjct: 382 FVAAQCDK 389
>gi|226493786|ref|NP_001142400.1| uncharacterized protein LOC100274575 [Zea mays]
gi|194708650|gb|ACF88409.1| unknown [Zea mays]
Length = 392
Score = 126 bits (316), Expect = 2e-26, Method: Compositional matrix adjust.
Identities = 71/156 (45%), Positives = 98/156 (62%), Gaps = 8/156 (5%)
Query: 86 PNNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKS 145
P YL+ ++IGTPP A+ADTGSDLIWTQC PC SQC+ Q +PL++P S+T+
Sbjct: 27 PTAGEYLMALAIGTPPLPYQAIADTGSDLIWTQCAPC-TSQCFRQPTPLYNPSSSTTFAV 85
Query: 146 LPCSSSQ--CASLNQKSCS----GVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVAL 199
LPC+SS CA+ + + G C Y+V+YG G +++ +ET T GST +
Sbjct: 86 LPCNSSLSVCAAALAGTGTAPPPGCACTYNVTYGSG-WTSVFQGSETFTFGSTPAGHARV 144
Query: 200 PGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQM 235
PGI FGC T + G S +G+VGLG G +SL+SQ+
Sbjct: 145 PGIAFGCSTASSGFNASSASGLVGLGRGRLSLVSQL 180
Score = 64.3 bits (155), Expect = 9e-08, Method: Compositional matrix adjust.
Identities = 40/129 (31%), Positives = 66/129 (51%), Gaps = 8/129 (6%)
Query: 252 IVIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTG--SLELCYSFNSLSQ----VPE 305
++IDSGTT+T L + + + S++ P D + L+LC+ S + +P
Sbjct: 264 LIIDSGTTITLLGNTAYQQVRAAVVSLVTL-PTTDGSADTGLDLCFMLPSSTSAPPAMPS 322
Query: 306 VTIHFRGADVKLSRSNFFVKVSEDIVCSVFKGITN-SVPIYGNIMQTNFLVGYDIEQQTV 364
+T+HF GAD+ L ++ + + C + T+ V I GN Q N + YDI Q+T+
Sbjct: 323 MTLHFNGADMVLPADSYMMSDDSGLWCLAMQNQTDGEVNILGNYQQQNMHILYDIGQETL 382
Query: 365 SFKPTDCTK 373
SF P C+
Sbjct: 383 SFAPAKCSA 391
>gi|224092218|ref|XP_002309514.1| predicted protein [Populus trichocarpa]
gi|222855490|gb|EEE93037.1| predicted protein [Populus trichocarpa]
Length = 474
Score = 126 bits (316), Expect = 2e-26, Method: Compositional matrix adjust.
Identities = 110/411 (26%), Positives = 177/411 (43%), Gaps = 73/411 (17%)
Query: 30 SVELIHRDSPKSPFYNS-SETPY---QRLR-DALTRSLNRLNHFNQNSSISSSKASQADI 84
S++++H+ P N S + +LR D++ L++++ + + +Q+ I
Sbjct: 69 SLQVLHKYGPCMQVLNDRSHVEFLLQDQLRVDSIQARLSKISGHGIFEEMVTKLPAQSGI 128
Query: 85 IPNNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYK 144
NY++ + +GTP + V DTGS + WTQC+PC S CY Q FDP S++Y
Sbjct: 129 AIGTGNYVVTVGLGTPKEDFTLVFDTGSGITWTQCQPCLGS-CYPQKEQKFDPTKSTSYN 187
Query: 145 SLPCSSSQCASL--NQKSCSGVN--CQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALP 200
++ CSS+ C L +++ CS N C Y + YGD S+S G ATET+T+ S+
Sbjct: 188 NVSCSSASCNLLPTSERGCSASNSTCLYQIIYGDQSYSQGFFATETLTISSSD----VFT 243
Query: 201 GITFGCGTNNGGLFNSK----------------------------------TTGIVGLGG 226
FGCG +N GLF +TG + GG
Sbjct: 244 NFLFGCGQSNNGLFGQAAGLLGLSSSSVSLPSQTAEKYQKQFSYCLPSTPSSTGYLNFGG 303
Query: 227 -----GDISLISQMRTTIAGNQRLGVS--------------TPDIVIDSGTTLTFLPQGY 267
+ IS ++ G +G+S T +IDSGT +T LP
Sbjct: 304 KVSQTAGFTPISPAFSSFYGIDIVGISVAGSQLPIDPSIFTTSGAIIDSGTVITRLPPTA 363
Query: 268 NSNLLSVMSSMIEAQPVADPTGSLELCYSFNSLSQV--PEVTIHFRGA-DVKLSRSNFFV 324
L + P + L+ CY F++ + V P+V++ F+G +V + S
Sbjct: 364 YKALKEAFDEKMSNYPKTNGDELLDTCYDFSNYTTVSFPKVSVSFKGGVEVDIDASGILY 423
Query: 325 KVSE-DIVCSVFKGITN--SVPIYGNIMQTNFLVGYDIEQQTVSFKPTDCT 372
V+ +VC F + I+GN Q + V YD + + F C+
Sbjct: 424 LVNGVKMVCLAFAANKDDSEFGIFGNHQQKTYEVVYDGAKGMIGFAAGACS 474
>gi|222635451|gb|EEE65583.1| hypothetical protein OsJ_21095 [Oryza sativa Japonica Group]
Length = 441
Score = 125 bits (315), Expect = 2e-26, Method: Compositional matrix adjust.
Identities = 81/234 (34%), Positives = 117/234 (50%), Gaps = 27/234 (11%)
Query: 30 SVELIHRDSPKSPFYNSSETP--YQRLRDALTRS---LNRLNHFNQNSSISSSKASQADI 84
SV L+HR P +P S P +RLR R+ + + ++ S A
Sbjct: 18 SVPLVHRHGPCAPSAASGGKPSLAERLRRDRARTNYIVTKATGGRTAATALSDAAGGGTS 77
Query: 85 IP-------NNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDP 137
IP N+ Y++ + IGTP ++ + DTGSDL W QC+PC +CY Q PLFDP
Sbjct: 78 IPTFLGDSVNSLEYVVTLGIGTPAVQQTVLIDTGSDLSWVQCKPCGAGECYAQKDPLFDP 137
Query: 138 KMSSTYKSLPCSSSQCASLNQKS----CSGVN------CQYSVSYGDGSFSNGNLATETV 187
SS+Y S+PC S C L + C+GV+ C+Y + YG+ + + G +TET+
Sbjct: 138 SSSSSYASVPCDSDACRKLAAGAYGHGCTGVSGGAAALCEYGIEYGNRATTTGVYSTETL 197
Query: 188 TLGSTTGQAVALPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAG 241
TL V + FGCG + G + K G++GLGG SL+SQ + G
Sbjct: 198 TL----KPGVVVADFGFGCGDHQHGPYE-KFDGLLGLGGAPESLVSQTSSQFGG 246
Score = 50.1 bits (118), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 38/125 (30%), Positives = 59/125 (47%), Gaps = 7/125 (5%)
Query: 252 IVIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPT--GSLELCYSFNSLSQV--PEVT 307
+VIDSGT +T LP + L S S + + P+ G L+ CY F + V P ++
Sbjct: 319 MVIDSGTVITGLPATAYAALRSAFRSAMSEYRLLPPSNGGVLDTCYDFTGHANVTVPTIS 378
Query: 308 IHFRG-ADVKLSRSNFFVKVSEDIVCSVFKGITNSVPIYGNIMQTNFLVGYDIEQQTVSF 366
+ F G A + L+ + + + G N++ I GN+ Q F V YD + TV F
Sbjct: 379 LTFSGGATIDLAAPAGVLV--DGCLAFAGAGTDNAIGIIGNVNQRTFEVLYDSGKGTVGF 436
Query: 367 KPTDC 371
+ C
Sbjct: 437 RAGAC 441
>gi|357481195|ref|XP_003610883.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
gi|355512218|gb|AES93841.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
Length = 315
Score = 125 bits (315), Expect = 2e-26, Method: Compositional matrix adjust.
Identities = 94/295 (31%), Positives = 132/295 (44%), Gaps = 68/295 (23%)
Query: 148 CSSSQCASLNQKSCS-GVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITFGC 206
C S C L+ CS C Y+ YGD S + G LA +T T S TG+ V+L FGC
Sbjct: 21 CDSPLCHKLDTGVCSPEKRCNYTYGYGDNSLTKGVLAQDTATFTSNTGKLVSLSRFLFGC 80
Query: 207 GTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGN------------------------ 242
G NN G FN G++GLGGG SLISQ+ G
Sbjct: 81 GHNNTGGFNDHEMGLIGLGGGPTSLISQIGPLFGGKKFSQCLVPFLTDIKISSRMSFGKG 140
Query: 243 --------------QR-----------LGVSTPD-------------IVIDSGTTLTFLP 264
QR LG+S D +++DSGT LP
Sbjct: 141 SQVLGDGVVTTPLVQREQDMTSYFVTLLGISVEDTYLPMNSTIEKGNMLVDSGTPPNILP 200
Query: 265 QG-YNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSLSQVPEVTIHFRGADVKLSRSNFF 323
Q Y+ + V +++ DP+ +LCY + + P +T HF GA++ L+ F
Sbjct: 201 QQLYDRVYVEVKNNVPLELITNDPSLGPQLCYRTQTNLKGPTLTYHFEGANLLLTPIQTF 260
Query: 324 V---KVSEDIVCSVFKGITNS-VPIYGNIMQTNFLVGYDIEQQTVSFKPTDCTKQ 374
+ ++ + C TNS +YGN Q+N+L+G+D+++Q VSFK TDCTKQ
Sbjct: 261 IPPTPETKGVFCLAINNYTNSNGGVYGNFAQSNYLIGFDLDRQVVSFKATDCTKQ 315
>gi|297744129|emb|CBI37099.3| unnamed protein product [Vitis vinifera]
Length = 299
Score = 125 bits (315), Expect = 3e-26, Method: Compositional matrix adjust.
Identities = 98/333 (29%), Positives = 145/333 (43%), Gaps = 91/333 (27%)
Query: 25 QTGGFSVELIHRDSPKSPFYNSSETPYQRLRDALTRSLNRLNHFNQNSSISSSKASQADI 84
+ GF V L H DS + T ++RL+ A+ R RL + ++ S + +A +
Sbjct: 38 EKNGFRVSLRHVDS------GGNYTKFERLQRAVKRGRLRLQRLSAKTA-SFEPSVEAPV 90
Query: 85 IPNNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYK 144
N +L+ ++IGTP A+ DTGSDLIWTQC+PC C+ Q +P+FDP+ SS++
Sbjct: 91 HAGNGEFLMNLAIGTPAETYSAIMDTGSDLIWTQCKPC--KVCFDQPTPIFDPEKSSSFS 148
Query: 145 SLPCSSSQCASLNQKSCSGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITF 204
LPCSS L S GV LATET T G + + I F
Sbjct: 149 KLPCSS----DLYHSSTQGV-----------------LATETFTFGDAS-----VSKIGF 182
Query: 205 GCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGNQRLGVSTPDIVIDSGTTLTFLP 264
GCG +N G S+ G+ ISQM+ + SG+T
Sbjct: 183 GCGEDNRGRAYSQGAGL---------FISQMKLDVDA--------------SGST----- 214
Query: 265 QGYNSNLLSVMSSMIEAQPVADPTGSLELCYSF---NSLSQVPEVTIHFRGADVKLSRSN 321
LELC++ S VP++ HF G D+KL + N
Sbjct: 215 -------------------------ELELCFTLPPDGSPVDVPQLVFHFEGVDLKLPKEN 249
Query: 322 FFVKVSEDIVCSVFKGITNSVPIYGNIMQTNFL 354
+ ++ S V + G ++ + I+GN Q N +
Sbjct: 250 YIIEDSALRVICLTMGSSSGMSIFGNFQQQNIV 282
>gi|293333012|ref|NP_001168410.1| uncharacterized protein LOC100382179 precursor [Zea mays]
gi|223948083|gb|ACN28125.1| unknown [Zea mays]
gi|413953783|gb|AFW86432.1| hypothetical protein ZEAMMB73_737575 [Zea mays]
Length = 466
Score = 125 bits (315), Expect = 3e-26, Method: Compositional matrix adjust.
Identities = 116/419 (27%), Positives = 173/419 (41%), Gaps = 85/419 (20%)
Query: 28 GFSVELIHRDSPKSPFYNSSETPYQRL--RDALTRSLNRLNHFNQNSSISSSKASQADI- 84
G ++ L+HR P SP + + ++ RD L R+ N + + S+ + Q+ +
Sbjct: 58 GATLPLVHRHGPCSPVMSKEKPSHEETLGRDQL-RAANIHAKLSSPRNSSAKELQQSGVT 116
Query: 85 IPNNA-------NYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDP 137
IP ++ Y+I +S+GTP ++ DTGSD+ W QC PC C Q LFDP
Sbjct: 117 IPTSSGYSLGTPEYVITVSLGTPAVTQVMSIDTGSDVSWVQCAPCAAQSCSSQKDKLFDP 176
Query: 138 KMSSTYKSLPCSSSQCASLNQKS--CSGVNCQYSVSYGD-----GSFSNGNLATET---- 186
S+TY + CSS+QCA L + C +CQY V Y D G++ + L T
Sbjct: 177 AKSATYSAFSCSSAQCAQLGGEGNGCLNSHCQYIVKYVDHSNTTGTYGSDTLGLTTSDAV 236
Query: 187 ---------------------VTLGSTTGQAVALPGITFG---------CGTNNGGLFNS 216
+ LG T V+ T+G ++ GG
Sbjct: 237 KNFQFGCSHRANGFVGQLDGLMGLGGDTESLVSQTAATYGKAFSYCLPPSSSSAGGFLT- 295
Query: 217 KTTGIVGLGGGDISLISQMRT------------------TIAG---NQRLGVSTPDIVID 255
+G G S RT T+AG N V + V+D
Sbjct: 296 -----LGAAAGGTSSSRYSRTPLVRFNVPTFYGVFLQAITVAGTKLNVPASVFSGASVVD 350
Query: 256 SGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSLS--QVPEVTIHF-RG 312
SGT +T LP L + ++A P A P G L+ C+ F+ + +VP VT+ F RG
Sbjct: 351 SGTVITQLPPTAYQALRTAFKKEMKAYPSAAPVGILDTCFDFSGIKTVRVPVVTLTFSRG 410
Query: 313 ADVKLSRSNFFVKVSEDIVCSVFKGITNSVPIYGNIMQTNFLVGYDIEQQTVSFKPTDC 371
A + L S F + G T I GN+ Q F + +D+ T+ F+P C
Sbjct: 411 AVMDLDVSGIFYAGCLAFTATAQDGDTG---ILGNVQQRTFEMLFDVGGSTLGFRPGAC 466
>gi|168040957|ref|XP_001772959.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162675692|gb|EDQ62184.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 351
Score = 125 bits (315), Expect = 3e-26, Method: Compositional matrix adjust.
Identities = 70/153 (45%), Positives = 96/153 (62%), Gaps = 9/153 (5%)
Query: 88 NANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLP 147
+ Y+++IS+GTPP + A+ DTGSDL W QC PC ++C+ Q PLF P SS+Y +
Sbjct: 5 SGEYVLQISLGTPPQQFSAIVDTGSDLCWVQCAPC--ARCFEQPDPLFIPLASSSYSNAS 62
Query: 148 CSSSQCASLNQKSCSGVN-CQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITFGC 206
C+ S C +L + +CS N C YS SYGDGS + G+ A ETVTL +T L I FGC
Sbjct: 63 CTDSLCDALPRPTCSMRNTCTYSYSYGDGSNTRGDFAFETVTLNGST-----LARIGFGC 117
Query: 207 GTNNGGLFNSKTTGIVGLGGGDISLISQMRTTI 239
G N G F + G++GLG G +SL SQ+ ++
Sbjct: 118 GHNQEGTF-AGADGLIGLGQGPLSLPSQLNSSF 149
Score = 60.8 bits (146), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 41/128 (32%), Positives = 66/128 (51%), Gaps = 9/128 (7%)
Query: 252 IVIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTG-SLELCYSFNSLSQ----VPEV 306
+++DSGTT+T+ +L+ + I + P ADPT L LCY +S+S +P +
Sbjct: 226 VILDSGTTITYWRLAAFIPILAELRRQI-SYPEADPTPYGLNLCYDISSVSASSLTLPSM 284
Query: 307 TIHFRGADVKLSRSNFFVKVSE--DIVCSVFKGITNSVPIYGNIMQTNFLVGYDIEQQTV 364
T+H D ++ SN +V V + VC+ ++ I GN+ Q N L+ D+ V
Sbjct: 285 TVHLTNVDFEIPVSNLWVLVDNFGETVCTAMS-TSDQFSIIGNVQQQNNLIVTDVANSRV 343
Query: 365 SFKPTDCT 372
F TDC+
Sbjct: 344 GFLATDCS 351
>gi|302817380|ref|XP_002990366.1| hypothetical protein SELMODRAFT_43971 [Selaginella moellendorffii]
gi|300141928|gb|EFJ08635.1| hypothetical protein SELMODRAFT_43971 [Selaginella moellendorffii]
Length = 420
Score = 125 bits (315), Expect = 3e-26, Method: Compositional matrix adjust.
Identities = 63/133 (47%), Positives = 83/133 (62%), Gaps = 8/133 (6%)
Query: 84 IIPNNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTY 143
I + +Y RI +GTP VADTGSD+ W QC PC +CY Q P+F+P +SS++
Sbjct: 74 IAGGSGDYFARIGVGTPARSVYMVADTGSDVSWLQCSPC--RKCYRQQDPIFNPSLSSSF 131
Query: 144 KSLPCSSSQCASLNQKSCSGVN-CQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGI 202
K L C+SS C L K CS N C Y VSYGDGSF+ G+ +TET++ G ++VA+
Sbjct: 132 KPLACASSICGKLKIKGCSRKNECMYQVSYGDGSFTVGDFSTETLSFGEHAVRSVAM--- 188
Query: 203 TFGCGTNNGGLFN 215
GCG NN GLF+
Sbjct: 189 --GCGRNNQGLFH 199
>gi|414880708|tpg|DAA57839.1| TPA: hypothetical protein ZEAMMB73_997765 [Zea mays]
Length = 452
Score = 125 bits (315), Expect = 3e-26, Method: Compositional matrix adjust.
Identities = 80/219 (36%), Positives = 114/219 (52%), Gaps = 28/219 (12%)
Query: 29 FSVELIHRDSPKSPFYNSSETPYQRLRDALTRSLNRLNHFNQNSSISSSKASQADIIP-- 86
+ ++HRD+ P + + R R A H Q S+ S+ A+ AD++
Sbjct: 30 LHIPVVHRDAVFPPRRGAPPGSF-RCRHAAP-------HTAQLESLHSATAA-ADLLRSP 80
Query: 87 -------NNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKM 139
++ Y I +G PPT L V DTGSDLIW QC PC +CY Q +PL+DP+
Sbjct: 81 VMSGVPFDSGEYFAVIGVGDPPTHALVVIDTGSDLIWLQCLPC--RRCYRQVTPLYDPRN 138
Query: 140 SSTYKSLPCSSSQCAS-LNQKSCSGV--NCQYSVSYGDGSFSNGNLATETVTLGSTTGQA 196
S T++ +PC+S QC L C C Y V YGDGS S+G+LAT+T+ L T
Sbjct: 139 SKTHRRIPCASPQCRGVLRYPGCDARTGGCVYMVVYGDGSASSGDLATDTLVLPDDT--- 195
Query: 197 VALPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQM 235
+ +T GCG +N GL S G++G G G +S +Q+
Sbjct: 196 -RVHNVTLGCGHDNEGLLAS-AAGLLGAGRGQLSFPTQL 232
Score = 46.6 bits (109), Expect = 0.018, Method: Compositional matrix adjust.
Identities = 31/136 (22%), Positives = 62/136 (45%), Gaps = 13/136 (9%)
Query: 252 IVIDSGTTLT-FLPQGYNSNLLSVMSSMIEA--QPVADPTGSLELCYSFNSLS-----QV 303
+V+DSGT ++ F Y + + +S A + + + + CY + +V
Sbjct: 317 VVVDSGTAISRFTRDAYAAVRDAFVSHAAAAGMRRLRNKFSVFDTCYDVHGNGPGTGVRV 376
Query: 304 PEVTIHFRGA-DVKLSRSNFFVKV----SEDIVCSVFKGITNSVPIYGNIMQTNFLVGYD 358
P + +HF A D+ L ++N+ + V C + + + + GN+ Q F V +D
Sbjct: 377 PSIVLHFAAAADMALPQANYLIPVVGGDRRTYFCLGLQAADDGLNVLGNVQQQGFGVVFD 436
Query: 359 IEQQTVSFKPTDCTKQ 374
+E+ + F P C+ +
Sbjct: 437 VERGRIGFTPNGCSGE 452
>gi|116789442|gb|ABK25248.1| unknown [Picea sitchensis]
Length = 366
Score = 125 bits (315), Expect = 3e-26, Method: Compositional matrix adjust.
Identities = 72/207 (34%), Positives = 107/207 (51%), Gaps = 28/207 (13%)
Query: 29 FSVELIHRDSPKSPFYNSSETPYQRL------RDA-----LTRSLNRLNHFNQNSSISSS 77
+SVE++HRD+ ++ Y+R R+A L R + R N++
Sbjct: 74 WSVEVVHRDALLLKNAANATASYERRLKEKLRREAVRVRGLERQIERTLTLNKDPVNRYE 133
Query: 78 KASQAD----------IIPNNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQC 127
++ D + + Y RI +GTP E+ V DTGSD+ W QCEPC +C
Sbjct: 134 NVAEVDADFGGEVVSGMEQGSGEYFTRIGVGTPTREQYMVLDTGSDVAWIQCEPC--REC 191
Query: 128 YMQDSPLFDPKMSSTYKSLPCSSSQCASLNQKSCSGVNCQYSVSYGDGSFSNGNLATETV 187
Y Q P+F+P S+++ ++ C S+ C+ L+ C C Y SYGDGS+S G+ ATET+
Sbjct: 192 YSQADPIFNPSYSASFSTVGCDSAVCSQLDAYDCHSGGCLYEASYGDGSYSTGSFATETL 251
Query: 188 TLGSTTGQAVALPGITFGCGTNNGGLF 214
T G+T+ VA+ GCG N GLF
Sbjct: 252 TFGTTSVANVAI-----GCGHKNVGLF 273
>gi|168008816|ref|XP_001757102.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162691600|gb|EDQ77961.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 406
Score = 125 bits (314), Expect = 3e-26, Method: Compositional matrix adjust.
Identities = 70/164 (42%), Positives = 91/164 (55%), Gaps = 12/164 (7%)
Query: 56 DALTRSLNRLNHFNQNSSISSSKASQADIIPN----NANYLIRISIGTPPTERLAVADTG 111
+ LTRS +R + S+ QA ++ + Y IRIS+GTPP V DTG
Sbjct: 24 NGLTRSRSR-----DRQTKVPSQDFQAPVVSGLSLGSGEYFIRISVGTPPRRMYLVMDTG 78
Query: 112 SDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCSSSQCASLNQKSCSGVNCQYSVS 171
SD++W QC PC CY Q +FDP SSTY +L CS+ QC +L+ +C C Y V
Sbjct: 79 SDILWLQCAPC--VNCYHQSDAIFDPYKSSTYSTLGCSTRQCLNLDIGTCQANKCLYQVD 136
Query: 172 YGDGSFSNGNLATETVTLGSTTGQA-VALPGITFGCGTNNGGLF 214
YGDGSF+ G T+ V+L ST+G V L I GCG +N G F
Sbjct: 137 YGDGSFTTGEFGTDDVSLNSTSGVGQVVLNKIPLGCGHDNEGYF 180
Score = 60.5 bits (145), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 45/127 (35%), Positives = 62/127 (48%), Gaps = 11/127 (8%)
Query: 252 IVIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLEL---CYSFNSLS--QVPEV 306
++IDSGT++T L N+ S+ + PT L CY + L+ VP V
Sbjct: 283 VIIDSGTSVTRLQ---NAAYASLRDAFRAGTSDLAPTAGFSLFDTCYDLSGLASVDVPTV 339
Query: 307 TIHFRGA-DVKLSRSNFFVKV-SEDIVCSVFKGITNSVPIYGNIMQTNFLVGYDIEQQTV 364
T+HF+G D+KL SN+ + V + + C F G T I GNI Q F V YD V
Sbjct: 340 TLHFQGGTDLKLPASNYLIPVDNSNTFCLAFAGTTGP-SIIGNIQQQGFRVIYDNLHNQV 398
Query: 365 SFKPTDC 371
F P+ C
Sbjct: 399 GFVPSQC 405
>gi|125532788|gb|EAY79353.1| hypothetical protein OsI_34482 [Oryza sativa Indica Group]
Length = 394
Score = 125 bits (314), Expect = 3e-26, Method: Compositional matrix adjust.
Identities = 97/352 (27%), Positives = 156/352 (44%), Gaps = 77/352 (21%)
Query: 90 NYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCS 149
NY+ +IGTPP AV D +L+WTQC+ C S+C+ QD+PLFDP S+TY++ PC
Sbjct: 50 NYVANFTIGTPPQPASAVIDLAGELVWTQCKQC--SRCFEQDTPLFDPTASNTYRAEPCG 107
Query: 150 SSQCASL--NQKSCSGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITFGCG 207
+ C S+ + ++CSG C Y S G + G + T+T +G+ A + FGC
Sbjct: 108 TPLCESIPSDSRNCSGNVCAYQASTNAGD-TGGKVGTDTFAVGT------AKASLAFGCV 160
Query: 208 TNNGGLFNSKTTGIVGLGGGDISLISQMRTT----------------------------- 238
+ +GIVGLG SL++Q
Sbjct: 161 VASDIDTMGGPSGIVGLGRTPWSLVTQTGVAAFSYCLAPHDAGKNSALFLGSSAKLAGGG 220
Query: 239 ---------IAGN----------QRLGVSTPDIVI-----------DSGTTLTFLPQGYN 268
I+GN Q G+ D +I D+ + ++FL G
Sbjct: 221 KAASTPFVNISGNGNDLSNYYKVQLEGLKAGDAMIPLPPSGSTVLLDTFSPISFLVDGAY 280
Query: 269 SNLLSVMSSMIEAQPVADPTGSLELCYSFNSLS-QVPEVTIHFR-GADVKLSRSNFFVKV 326
+ ++ + A P+A P +LC+ + S P++ FR GA + ++ SN+ +
Sbjct: 281 QAVKKAVTVAVGAPPMATPVEPFDLCFPKSGASGAAPDLVFTFRGGAAMTVAASNYLLDY 340
Query: 327 SEDIVC-----SVFKGITNSVPIYGNIMQTNFLVGYDIEQQTVSFKPTDCTK 373
VC S T + + G++ Q N +D++++T+SF+P DCTK
Sbjct: 341 KNGTVCLAMLSSARLNSTTELSLLGSLQQENIHFLFDLDKETLSFEPADCTK 392
>gi|116787333|gb|ABK24467.1| unknown [Picea sitchensis]
Length = 497
Score = 125 bits (314), Expect = 3e-26, Method: Compositional matrix adjust.
Identities = 72/210 (34%), Positives = 110/210 (52%), Gaps = 35/210 (16%)
Query: 31 VELIHRDSPKSPFYNSSETPYQRLRDALTRSLNRLNHFNQNSSISSSKASQADIIPNNAN 90
++++HRDS S +++ + L++ L R R++ N +++ S+A++ P N +
Sbjct: 70 LQVVHRDSLSSS--SNTSLVKEILQERLKRDAARVDSINARVQLAAMGVSKAEMKPLNGS 127
Query: 91 -------------------------YLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPS 125
Y R+ +GTPP V DTGSD++W QC PC +
Sbjct: 128 SIDARFDAKDFSSSIISGLAQGSGEYFTRLGVGTPPRYTYMVLDTGSDIMWIQCLPC--A 185
Query: 126 QCYMQDSPLFDPKMSSTYKSLPCSSSQCASLNQKSCSGVN-CQYSVSYGDGSFSNGNLAT 184
+CY Q PLF+P SSTY+ +PC++ C L+ C C+Y VSYGDGSF+ G+ +T
Sbjct: 186 KCYGQTDPLFNPAASSTYRKVPCATPLCKKLDISGCRNKRYCEYQVSYGDGSFTVGDFST 245
Query: 185 ETVTLGSTTGQAVALPGITFGCGTNNGGLF 214
ET+T + VAL GCG +N GLF
Sbjct: 246 ETLTFRGQVIRRVAL-----GCGHDNEGLF 270
Score = 51.6 bits (122), Expect = 6e-04, Method: Compositional matrix adjust.
Identities = 43/133 (32%), Positives = 62/133 (46%), Gaps = 22/133 (16%)
Query: 252 IVIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSL---------ELCYSFNSLS- 301
++IDSGT++T L S S+M +A V TG+L + CY + L
Sbjct: 372 VIIDSGTSVTRLVD-------SAYSTMRDAFRVG--TGNLKSAGGFSLFDTCYDLSGLKT 422
Query: 302 -QVPEVTIHFRG-ADVKLSRSNFFVKV-SEDIVCSVFKGITNSVPIYGNIMQTNFLVGYD 358
+VP + HF+G A + L +N+ + V S C F G T + I GNI Q + V +D
Sbjct: 423 VKVPTLVFHFQGGAHISLPATNYLIPVDSSATFCFAFAGNTGGLSIIGNIQQQGYRVVFD 482
Query: 359 IEQQTVSFKPTDC 371
V FK C
Sbjct: 483 SLANRVGFKAGSC 495
>gi|302795261|ref|XP_002979394.1| hypothetical protein SELMODRAFT_53966 [Selaginella moellendorffii]
gi|300153162|gb|EFJ19802.1| hypothetical protein SELMODRAFT_53966 [Selaginella moellendorffii]
Length = 353
Score = 125 bits (314), Expect = 3e-26, Method: Compositional matrix adjust.
Identities = 63/133 (47%), Positives = 83/133 (62%), Gaps = 8/133 (6%)
Query: 84 IIPNNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTY 143
I + +Y RI +GTP VADTGSD+ W QC PC +CY Q P+F+P +SS++
Sbjct: 7 IAGGSGDYFARIGVGTPARSVYMVADTGSDVSWLQCSPC--RKCYRQQDPIFNPSLSSSF 64
Query: 144 KSLPCSSSQCASLNQKSCSGVN-CQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGI 202
K L C+SS C L K CS N C Y VSYGDGSF+ G+ +TET++ G ++VA+
Sbjct: 65 KPLACASSICGKLKIKGCSRKNKCMYQVSYGDGSFTVGDFSTETLSFGEHAVRSVAM--- 121
Query: 203 TFGCGTNNGGLFN 215
GCG NN GLF+
Sbjct: 122 --GCGRNNQGLFH 132
>gi|242092900|ref|XP_002436940.1| hypothetical protein SORBIDRAFT_10g011750 [Sorghum bicolor]
gi|241915163|gb|EER88307.1| hypothetical protein SORBIDRAFT_10g011750 [Sorghum bicolor]
Length = 465
Score = 125 bits (314), Expect = 4e-26, Method: Compositional matrix adjust.
Identities = 102/360 (28%), Positives = 160/360 (44%), Gaps = 53/360 (14%)
Query: 22 IEAQTGGFSVELIHRDSPKSPFYNSSETPYQRLRDALTRSLNRLNHFNQNSS--ISSSKA 79
+E + SV L+HR P + S+ P + L S R N+ +S ++S+
Sbjct: 48 LEPSSATLSVPLVHRYGPCAA-SQYSDMPTPSFSETLRHSRARTNYIKSRASTGMASTPD 106
Query: 80 SQADIIP-------NNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDS 132
A +P ++ Y++ + GTP ++ + DTGSD+ W QC PC ++CY Q
Sbjct: 107 DAAVTVPTRLGGFVDSLEYMVTLGFGTPSVPQVLLMDTGSDVSWVQCAPCNSTECYPQKD 166
Query: 133 PLFDPKMSSTYKSLPCSSSQCASL-----NQKSCSGVNCQYSVSYGDGSFSNGNLATETV 187
PLFDP SSTY + C + C L N + G C Y V YGDGS + G + ET+
Sbjct: 167 PLFDPSKSSTYAPIACGADACNKLGDHYRNGCTSGGTQCGYRVEYGDGSSTRGVYSNETI 226
Query: 188 TLGSTTGQAVALPGIT-----FGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTI--- 239
T PGIT FGCG + G + K G++GLGG SL+ Q +
Sbjct: 227 TFA---------PGITVKDFHFGCGHDQRGP-SDKFDGLLGLGGAPESLVVQTASVYGGA 276
Query: 240 -----------AGNQRLGVSTPDIVIDSG----TTLTFLPQGYNSNLLSVMSSMIEAQPV 284
AG LGV P ++ T + LP S ++++ + +P+
Sbjct: 277 FSYCLPALNSEAGFLALGVR-PSAATNTSAFVFTPMWHLPMDATSYMVNMTGISVGGKPL 335
Query: 285 ADPTGSLE---LCYSFNSLSQVPEVTIHFRGADVKLSRSNFFVKVSEDI-VCSVFKGITN 340
P + L S ++++PE + A ++ + + + + SED C F G +N
Sbjct: 336 DIPRSAFRGGMLIDSGTIVTELPETAYNALNAALRKAFAAYPMVASEDFDTCYNFTGYSN 395
>gi|449434646|ref|XP_004135107.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 1-like [Cucumis
sativus]
Length = 486
Score = 125 bits (313), Expect = 4e-26, Method: Compositional matrix adjust.
Identities = 61/127 (48%), Positives = 78/127 (61%), Gaps = 7/127 (5%)
Query: 88 NANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLP 147
+ Y R+ IG PP+ V DTGSD+ W QC PC ++CY Q P+F+P S+++ SL
Sbjct: 148 SGEYFSRVGIGRPPSPVYMVLDTGSDVSWVQCAPC--AECYEQTDPIFEPTSSASFTSLS 205
Query: 148 CSSSQCASLNQKSCSGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITFGCG 207
C + QC SL+ C C Y VSYGDGS++ G+ TETVTLGST +L I GCG
Sbjct: 206 CETEQCKSLDVSECRNGTCLYEVSYGDGSYTVGDFVTETVTLGST-----SLGNIAIGCG 260
Query: 208 TNNGGLF 214
NN GLF
Sbjct: 261 HNNEGLF 267
Score = 53.1 bits (126), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 37/124 (29%), Positives = 54/124 (43%), Gaps = 4/124 (3%)
Query: 252 IVIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSLS--QVPEVTIH 309
I++DSGT +T L + L A + CY +S S +VP V+ H
Sbjct: 363 IIVDSGTAVTRLQTTVYNVLRDAFVKSTHDLQTARGVALFDTCYDLSSKSRVEVPTVSFH 422
Query: 310 F-RGADVKLSRSNFFVKV-SEDIVCSVFKGITNSVPIYGNIMQTNFLVGYDIEQQTVSFK 367
F G ++ L N+ + V SE C F +++ I GN Q VG+D+ V F
Sbjct: 423 FANGNELPLPAKNYLIPVDSEGTFCFAFAPTDSTLSILGNAQQQGTRVGFDLANSLVGFS 482
Query: 368 PTDC 371
P C
Sbjct: 483 PNKC 486
>gi|168005153|ref|XP_001755275.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162693403|gb|EDQ79755.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 429
Score = 125 bits (313), Expect = 4e-26, Method: Compositional matrix adjust.
Identities = 73/187 (39%), Positives = 105/187 (56%), Gaps = 10/187 (5%)
Query: 29 FSVELIHRDSPKSPFYNSS-ETPYQRLRDALTRSLNRLNHFNQNSSISSSKASQADIIPN 87
F ELI+R+ SP + + +TP + A+ R R ++ ++ + + +
Sbjct: 28 FRAELIYREHQSSPLRSETLKTPSEIFIAAVKRGHERRARLAKHV-LAGDQLFETPVASG 86
Query: 88 NANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLP 147
N YLI IS G PP + A+ DTGSDL W QC PC CY S FDP S++YK+L
Sbjct: 87 NGEYLIDISYGNPPQKSTAIVDTGSDLNWVQCLPC--KSCYETLSAKFDPSKSASYKTLG 144
Query: 148 CSSSQCASLNQKSCSGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITFGCG 207
C S+ C L +SC+ +CQY YGDGS ++G L+T+ VT+G TG+ +P + FGCG
Sbjct: 145 CGSNFCQDLPFQSCA-ASCQYDYMYGDGSSTSGALSTDDVTIG--TGK---IPNVAFGCG 198
Query: 208 TNNGGLF 214
+N G F
Sbjct: 199 NSNLGTF 205
Score = 62.0 bits (149), Expect = 4e-07, Method: Compositional matrix adjust.
Identities = 38/126 (30%), Positives = 66/126 (52%), Gaps = 10/126 (7%)
Query: 252 IVIDSGTTLTFLP-QGYNSNLLSVMSSMIEAQPVADPTGS---LELCYSFNSLSQ--VPE 305
+++DSGTTLT+L +N +++++ A P + GS LE C+S ++ P
Sbjct: 305 LILDSGTTLTYLDVDAFNP----MVAALKAALPYPEADGSFYGLEYCFSTAGVANPTYPT 360
Query: 306 VTIHFRGADVKLSRSNFFVKVSEDIVCSVFKGITNSVPIYGNIMQTNFLVGYDIEQQTVS 365
V HF GADV L+ N F+ + + + + I+GNI Q N ++ +D+ + +
Sbjct: 361 VVFHFNGADVALAPDNTFIALDFEGTTCLAMASSTGFSIFGNIQQLNHVIVHDLVNKRIG 420
Query: 366 FKPTDC 371
FK +C
Sbjct: 421 FKSANC 426
>gi|54290731|dbj|BAD62401.1| putative nucleoid DNA-binding protein cnd41 [Oryza sativa Japonica
Group]
Length = 521
Score = 125 bits (313), Expect = 4e-26, Method: Compositional matrix adjust.
Identities = 81/234 (34%), Positives = 117/234 (50%), Gaps = 27/234 (11%)
Query: 30 SVELIHRDSPKSPFYNSSETP--YQRLRDALTRS---LNRLNHFNQNSSISSSKASQADI 84
SV L+HR P +P S P +RLR R+ + + ++ S A
Sbjct: 98 SVPLVHRHGPCAPSAASGGKPSLAERLRRDRARTNYIVTKATGGRTAATALSDAAGGGTS 157
Query: 85 IP-------NNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDP 137
IP N+ Y++ + IGTP ++ + DTGSDL W QC+PC +CY Q PLFDP
Sbjct: 158 IPTFLGDSVNSLEYVVTLGIGTPAVQQTVLIDTGSDLSWVQCKPCGAGECYAQKDPLFDP 217
Query: 138 KMSSTYKSLPCSSSQCASLNQKS----CSGVN------CQYSVSYGDGSFSNGNLATETV 187
SS+Y S+PC S C L + C+GV+ C+Y + YG+ + + G +TET+
Sbjct: 218 SSSSSYASVPCDSDACRKLAAGAYGHGCTGVSGGAAALCEYGIEYGNRATTTGVYSTETL 277
Query: 188 TLGSTTGQAVALPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAG 241
TL V + FGCG + G + K G++GLGG SL+SQ + G
Sbjct: 278 TL----KPGVVVADFGFGCGDHQHGPYE-KFDGLLGLGGAPESLVSQTSSQFGG 326
Score = 50.1 bits (118), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 38/125 (30%), Positives = 59/125 (47%), Gaps = 7/125 (5%)
Query: 252 IVIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPT--GSLELCYSFNSLSQV--PEVT 307
+VIDSGT +T LP + L S S + + P+ G L+ CY F + V P ++
Sbjct: 399 MVIDSGTVITGLPATAYAALRSAFRSAMSEYRLLPPSNGGVLDTCYDFTGHANVTVPTIS 458
Query: 308 IHFRG-ADVKLSRSNFFVKVSEDIVCSVFKGITNSVPIYGNIMQTNFLVGYDIEQQTVSF 366
+ F G A + L+ + + + G N++ I GN+ Q F V YD + TV F
Sbjct: 459 LTFSGGATIDLAAPAGVLV--DGCLAFAGAGTDNAIGIIGNVNQRTFEVLYDSGKGTVGF 516
Query: 367 KPTDC 371
+ C
Sbjct: 517 RAGAC 521
>gi|293335955|ref|NP_001168399.1| uncharacterized protein LOC100382168 precursor [Zea mays]
gi|223948009|gb|ACN28088.1| unknown [Zea mays]
gi|413922066|gb|AFW61998.1| hypothetical protein ZEAMMB73_694403 [Zea mays]
Length = 507
Score = 125 bits (313), Expect = 5e-26, Method: Compositional matrix adjust.
Identities = 71/167 (42%), Positives = 93/167 (55%), Gaps = 23/167 (13%)
Query: 90 NYLIRISIG----TPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKS 145
NY+ IS+G +P + DTGSDL W QC+PC S CY Q PLFDP S+TY +
Sbjct: 143 NYVTTISLGGSSGSPAANLTVIVDTGSDLTWVQCKPC--SACYAQRDPLFDPAGSATYAA 200
Query: 146 LPCSSSQCA-----------SLNQKSCSGVNCQYSVSYGDGSFSNGNLATETVTLGSTTG 194
+ C++S CA S C Y+++YGDGSFS G LAT+TV LG +
Sbjct: 201 VRCNASACADSLRAATGTPGSCGSTGAGSEKCYYALAYGDGSFSRGVLATDTVALGGAS- 259
Query: 195 QAVALPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAG 241
L G FGCG +N GLF T G++GLG ++SL+SQ + G
Sbjct: 260 ----LGGFVFGCGLSNRGLFGG-TAGLMGLGRTELSLVSQTASRYGG 301
>gi|356498789|ref|XP_003518231.1| PREDICTED: probable aspartic protease At2g35615-like [Glycine max]
Length = 446
Score = 125 bits (313), Expect = 5e-26, Method: Compositional matrix adjust.
Identities = 121/418 (28%), Positives = 169/418 (40%), Gaps = 98/418 (23%)
Query: 31 VELIHRDSPKSPFYNSSETPYQRLRDALTRSLNRLNHFNQNSSISSSKASQADIIPNNAN 90
++LIH +S SP YNS +T + + + + S+ S P
Sbjct: 45 IKLIHHESSLSP-YNSKDTIWDHYSHKILKQ-----------TFSNDYISNLVPSPRYVV 92
Query: 91 YLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCSS 150
+L+ SIG PP +LAV DTGS L W C PC S C Q P+FDP SSTY +L CS
Sbjct: 93 FLMNFSIGEPPIPQLAVMDTGSSLTWVMCHPC--SSCSQQSVPIFDPSKSSTYSNLSCSE 150
Query: 151 -SQCASLNQKSCSGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITFGCG-- 207
++C +N + C YSV Y S G A E +TL + + +P + FGCG
Sbjct: 151 CNKCDVVNGE------CPYSVEYVGSGSSQGIYAREQLTLETIDESIIKVPSLIFGCGRK 204
Query: 208 --TNNGGLFNSKTTGIVGLGGGDISL-----------ISQMRTTIAGNQRL--------- 245
++ G G+ GLG G SL I +R T RL
Sbjct: 205 FSISSNGYPYQGINGVFGLGSGRFSLLPSFGKKFSYCIGNLRNTNYKFNRLVLGDKANMQ 264
Query: 246 ----------------------GVSTPDI----------------VIDSGTTLTFLPQGY 267
G DI +IDSG T+L + Y
Sbjct: 265 GDSTTLNVINGLYYVNLEAISIGGRKLDIDPTLFERSITDNNSGVIIDSGADHTWLTK-Y 323
Query: 268 NSNLLSV-MSSMIEAQPV---ADPTGSLELCYS---FNSLSQVPEVTIHF-RGADVKLSR 319
+LS + +++E V D LCYS LS P VT HF GA + L
Sbjct: 324 GFEVLSFEVENLLEGVLVLAQQDKHNPYTLCYSGVVSQDLSGFPLVTFHFAEGAVLDLDV 383
Query: 320 SNFFVKVSEDIVC------SVFKGITNSVPIYGNIMQTNFLVGYDIEQQTVSFKPTDC 371
++ F++ +E+ C + F S G + Q N+ VGYD+ + V F+ DC
Sbjct: 384 TSMFIQTTENEFCMAMLPGNYFGDDYESFSSIGMLAQQNYNVGYDLNRMRVYFQRIDC 441
>gi|356555807|ref|XP_003546221.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Glycine max]
Length = 457
Score = 125 bits (313), Expect = 5e-26, Method: Compositional matrix adjust.
Identities = 111/358 (31%), Positives = 154/358 (43%), Gaps = 79/358 (22%)
Query: 88 NANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLP 147
+ NY ++I +GTP + DTGS L W QC+PC C++Q P+F P +S TYK+L
Sbjct: 104 SGNYYVKIGVGTPAKYFSMIVDTGSSLSWLQCQPCV-IYCHVQVDPIFTPSVSKTYKALS 162
Query: 148 CSSSQCA-----SLNQKSCSGVN--CQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALP 200
CSSSQC+ +LN CS C Y SYGD SFS G L+ + +TL T A
Sbjct: 163 CSSSQCSSLKSSTLNAPGCSNATGACVYKASYGDTSFSIGYLSQDVLTL---TPSAAPSS 219
Query: 201 GITFGCGTNNGGLF----------------------------------------NSKTTG 220
G +GCG +N GLF NS +G
Sbjct: 220 GFVYGCGQDNQGLFGRSAGIIGLANDKLSMLGQLSNKYGNAFSYCLPSSFSAQPNSSVSG 279
Query: 221 IVGLGGG-------------------DISLISQMRTTIAGNQRLGVSTPDI----VIDSG 257
+ +G + + T+AG + LGVS +IDSG
Sbjct: 280 FLSIGASSLSSSPYKFTPLVKNPKIPSLYFLGLTTITVAG-KPLGVSASSYNVPTIIDSG 338
Query: 258 TTLTFLPQG-YNSNLLSVMSSMIEAQPVADPTGSLELCY--SFNSLSQVPEVTIHFRG-A 313
T +T LP YN+ S + M + A L+ C+ S +S VPE+ I FRG A
Sbjct: 339 TVITRLPVAIYNALKKSFVMIMSKKYAQAPGFSILDTCFKGSVKEMSTVPEIRIIFRGGA 398
Query: 314 DVKLSRSNFFVKVSEDIVCSVFKGITNSVPIYGNIMQTNFLVGYDIEQQTVSFKPTDC 371
++L N V++ + C +N + I GN Q F V YD+ + F P C
Sbjct: 399 GLELKVHNSLVEIEKGTTCLAIAASSNPISIIGNYQQQTFTVAYDVANSKIGFAPGGC 456
>gi|357448247|ref|XP_003594399.1| Aspartic proteinase nepenthesin-2 [Medicago truncatula]
gi|355483447|gb|AES64650.1| Aspartic proteinase nepenthesin-2 [Medicago truncatula]
Length = 452
Score = 125 bits (313), Expect = 5e-26, Method: Compositional matrix adjust.
Identities = 128/462 (27%), Positives = 199/462 (43%), Gaps = 106/462 (22%)
Query: 5 LSCVFILFFLCFYVV------SPIEAQTGGFSVELIHRDSPKSPFYNSSETPYQRLRDAL 58
+S + LFF + S ++ + ++L H S KSP NS+ + +
Sbjct: 1 MSLFWFLFFSAHLAIASSLKDSGLKHKQPDMQLKLYHMTSLKSP-PNSTSLLFAYM---F 56
Query: 59 TRSLNRLNHFNQN-SSISSSKASQADIIPNNA-------------NYLIRISIGTPPTER 104
+ R+ +F+ + S + AS + P A NY +++ +G+P
Sbjct: 57 AKDEERIRYFHSRLAKNSDANASSKKVGPKLAGIPLKSGLSMGSGNYYVKMGLGSPTKYY 116
Query: 105 LAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCSSSQC-----ASLNQK 159
+ DTGS W QC+PC C++Q+ P+F+P S TYK++PCSSSQC A+LN+
Sbjct: 117 TMIVDTGSSFSWLQCQPCT-IYCHIQEDPVFNPSASKTYKTVPCSSSQCSSLKSATLNEP 175
Query: 160 SCSGVN--CQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITFGCGTNNGGLF--- 214
+CS + C Y SYGD SFS G L+ + +TL T Q L +GCG +N GLF
Sbjct: 176 TCSKQSNACVYKASYGDSSFSLGYLSQDVLTL--TPSQT--LSSFVYGCGQDNQGLFGRT 231
Query: 215 ------------------------------------NSKTTGIVGLGGGDIS-------- 230
NS G + +G ++
Sbjct: 232 DGIIGLANNELSMLSQLSGKYGNAFSYCLPTSFSTPNSPKEGFLSIGTSSLTPSSSYKFT 291
Query: 231 ------------LISQMRTTIAGNQRLGVSTPDI----VIDSGTTLTFLPQGYNSNLLSV 274
I T+AG + LGV+ +IDSGT +T LP + L +
Sbjct: 292 PLLKNPNNPSLYFIDLESITVAG-RPLGVAASSYKVPTIIDSGTVITRLPTPVYTTLKNA 350
Query: 275 MSSMIEAQPVADPTGSL-ELCY--SFNSLSQV-PEVTIHFRG-ADVKLSRSNFFVKVSED 329
+++ + P SL + C+ S +S+V P++ I F+G AD++L N V++
Sbjct: 351 YVTILSKKYQQAPGISLLDTCFKGSLAGISEVAPDIRIIFKGGADLQLKGHNSLVELETG 410
Query: 330 IVCSVFKGITNSVPIYGNIMQTNFLVGYDIEQQTVSFKPTDC 371
I C G ++S+ I GN Q V YD+ V F P C
Sbjct: 411 ITCLAMAG-SSSIAIIGNYQQQTVKVAYDVGNSRVGFAPGGC 451
>gi|115483168|ref|NP_001065177.1| Os10g0538200 [Oryza sativa Japonica Group]
gi|21717168|gb|AAM76361.1|AC074196_19 putative nucleoid DNA binding protein [Oryza sativa Japonica Group]
gi|31433289|gb|AAP54827.1| Eukaryotic aspartyl protease family protein, expressed [Oryza
sativa Japonica Group]
gi|113639786|dbj|BAF27091.1| Os10g0538200 [Oryza sativa Japonica Group]
gi|215686408|dbj|BAG87693.1| unnamed protein product [Oryza sativa Japonica Group]
Length = 394
Score = 124 bits (312), Expect = 5e-26, Method: Compositional matrix adjust.
Identities = 96/352 (27%), Positives = 156/352 (44%), Gaps = 77/352 (21%)
Query: 90 NYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCS 149
NY+ +IGTPP AV D +L+WTQC+ C S+C+ QD+PLFDP S+TY++ PC
Sbjct: 50 NYVANFTIGTPPQPASAVIDLAGELVWTQCKQC--SRCFEQDTPLFDPTASNTYRAEPCG 107
Query: 150 SSQCASL--NQKSCSGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITFGCG 207
+ C S+ + ++CSG C Y S G + G + T+T +G+ A + FGC
Sbjct: 108 TPLCESIPSDSRNCSGNVCAYQASTNAGD-TGGKVGTDTFAVGT------AKASLAFGCV 160
Query: 208 TNNGGLFNSKTTGIVGLGGGDISLISQMRTT----------------------------- 238
+ +GIVGLG SL++Q
Sbjct: 161 VASDIDTMGGPSGIVGLGRTPWSLVTQTGVAAFSYCLAPHDAGRNSALFLGSSAKLAGGG 220
Query: 239 ---------IAGN----------QRLGVSTPDIVI-----------DSGTTLTFLPQGYN 268
I+GN Q G+ D +I D+ + ++FL G
Sbjct: 221 KAASTPFVNISGNGNDLSNYYKVQLEGLKAGDAMIPLPPSGSTVLLDTFSPISFLVDGAY 280
Query: 269 SNLLSVMSSMIEAQPVADPTGSLELCYSFNSLS-QVPEVTIHFR-GADVKLSRSNFFVKV 326
+ +++ + A P+A P +LC+ + S P++ FR GA + + +N+ +
Sbjct: 281 QAVKKAVTAAVGAPPMATPVEPFDLCFPKSGASGAAPDLVFTFRGGAAMTVPATNYLLDY 340
Query: 327 SEDIVC-----SVFKGITNSVPIYGNIMQTNFLVGYDIEQQTVSFKPTDCTK 373
VC S T + + G++ Q N +D++++T+SF+P DCTK
Sbjct: 341 KNGTVCLAMLSSARLNSTTELSLLGSLQQENIHFLFDLDKETLSFEPADCTK 392
>gi|308044575|ref|NP_001183392.1| uncharacterized protein LOC100501808 [Zea mays]
gi|238011188|gb|ACR36629.1| unknown [Zea mays]
Length = 342
Score = 124 bits (312), Expect = 5e-26, Method: Compositional matrix adjust.
Identities = 99/347 (28%), Positives = 137/347 (39%), Gaps = 88/347 (25%)
Query: 107 VADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCSSSQCASLNQKSCSGVN- 165
V DTGSD++W QC PC +CY Q P+FDP+ SS+Y ++ C ++ C L+ C
Sbjct: 2 VLDTGSDVVWVQCAPC--RRCYEQSGPVFDPRRSSSYGAVGCGAALCRRLDSGGCDLRRG 59
Query: 166 -CQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITFGCGTNNGGLF---------- 214
C Y V+YGDGS + G+ TET+T G VA + GCG +N GLF
Sbjct: 60 ACMYQVAYGDGSVTAGDFVTETLTF--AGGARVAR--VALGCGHDNEGLFVAAAGLLGLG 115
Query: 215 ----------------------------------NSKTTGIVGLGGGDISLISQMRTTIA 240
S + V G G + S T +
Sbjct: 116 RGGLSFPTQISRRYGRSFSYCLVDRTSSGAGAAPGSHRSSTVSFGAGSVGASSASFTPMV 175
Query: 241 GNQRL-------------------GVSTPDIVID-----------SGTTLTFLPQGYNSN 270
N R+ GV+ D+ +D SGT++T L + S
Sbjct: 176 RNPRMETFYYVQLVGISVGGARVPGVAESDLRLDPSTGRGGVIVDSGTSVTRLARASYSA 235
Query: 271 LLSVMSSMIEAQPVADPTG--SLELCYSFNS--LSQVPEVTIHFR-GADVKLSRSNFFVK 325
L + P G + CY + +VP V++HF GA+ L N+ +
Sbjct: 236 LRDAFRAAAAGGLRLSPGGFSLFDTCYDLGGRRVVKVPTVSMHFAGGAEAALPPENYLIP 295
Query: 326 V-SEDIVCSVFKGITNSVPIYGNIMQTNFLVGYDIEQQTVSFKPTDC 371
V S C F G V I GNI Q F V +D + Q V F P C
Sbjct: 296 VDSRGTFCFAFAGTDGGVSIIGNIQQQGFRVVFDGDGQRVGFAPKGC 342
>gi|356573235|ref|XP_003554768.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
Length = 472
Score = 124 bits (312), Expect = 5e-26, Method: Compositional matrix adjust.
Identities = 73/199 (36%), Positives = 102/199 (51%), Gaps = 24/199 (12%)
Query: 28 GFSVELIHRDSPKSPFYNSSETPYQRLRDALTRSLNRLNHFNQNSSISSSKASQAD---- 83
S+ L H D+ +S++TP Q + L R R+ ++++ S A ++
Sbjct: 61 ALSLHLHHIDA-----LSSNKTPEQLFQLRLQRDAKRVEGVVALAALNQSHARRSGSSFS 115
Query: 84 ------IIPNNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDP 137
+ + Y RI +GTP V DTGSD++W QC PC +CY Q P+FDP
Sbjct: 116 SSIISGLAQGSGEYFTRIGVGTPARYVYMVLDTGSDVVWLQCAPC--RKCYTQADPVFDP 173
Query: 138 KMSSTYKSLPCSSSQCASLNQKSCSGVN--CQYSVSYGDGSFSNGNLATETVTLGSTTGQ 195
S TY +PC + C L+ C+ N CQY VSYGDGSF+ G+ +TET+T T
Sbjct: 174 TKSRTYAGIPCGAPLCRRLDSPGCNNKNKVCQYQVSYGDGSFTFGDFSTETLTFRRTRVT 233
Query: 196 AVALPGITFGCGTNNGGLF 214
VAL GCG +N GLF
Sbjct: 234 RVAL-----GCGHDNEGLF 247
Score = 65.5 bits (158), Expect = 4e-08, Method: Compositional matrix adjust.
Identities = 39/123 (31%), Positives = 57/123 (46%), Gaps = 3/123 (2%)
Query: 252 IVIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSLSQV--PEVTIH 309
++IDSGT++T L + L A + C+ + L++V P V +H
Sbjct: 349 VIIDSGTSVTRLTRPAYIALRDAFRVGASHLKRAAEFSLFDTCFDLSGLTEVKVPTVVLH 408
Query: 310 FRGADVKLSRSNFFVKVSED-IVCSVFKGITNSVPIYGNIMQTNFLVGYDIEQQTVSFKP 368
FRGADV L +N+ + V C F G + + I GNI Q F V +D+ V F P
Sbjct: 409 FRGADVSLPATNYLIPVDNSGSFCFAFAGTMSGLSIIGNIQQQGFRVSFDLAGSRVGFAP 468
Query: 369 TDC 371
C
Sbjct: 469 RGC 471
>gi|125561848|gb|EAZ07296.1| hypothetical protein OsI_29544 [Oryza sativa Indica Group]
Length = 448
Score = 124 bits (312), Expect = 5e-26, Method: Compositional matrix adjust.
Identities = 69/159 (43%), Positives = 96/159 (60%), Gaps = 6/159 (3%)
Query: 79 ASQADIIPNNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPK 138
A++ + + YL+ ++IGTPP A+ DTGSDLIWTQC PC C Q +P F P
Sbjct: 80 AARILVAASQGEYLMDLAIGTPPLRYTAMVDTGSDLIWTQCAPC--VLCADQPTPYFRPA 137
Query: 139 MSSTYKSLPCSSSQCASLNQKSCSGVN-CQYSVSYGDGSFSNGNLATETVTLGSTTGQAV 197
S+TY+ +PC S CA+L +C + C Y YGD + + G LA+ET T G+ V
Sbjct: 138 RSATYRLVPCRSPLCAALPYPACFQRSVCVYQYYYGDEASTAGVLASETFTFGAANSSKV 197
Query: 198 ALPGITFGCG-TNNGGLFNSKTTGIVGLGGGDISLISQM 235
+ + FGCG N+G L NS +G+VGLG G +SL+SQ+
Sbjct: 198 MVSDVAFGCGNINSGQLANS--SGMVGLGRGPLSLVSQL 234
>gi|148908573|gb|ABR17396.1| unknown [Picea sitchensis]
Length = 350
Score = 124 bits (312), Expect = 6e-26, Method: Compositional matrix adjust.
Identities = 60/128 (46%), Positives = 82/128 (64%), Gaps = 7/128 (5%)
Query: 87 NNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSL 146
+ Y RI IGTP E+ V DTGSD++W QCEPC +CY Q P+F+P S ++ ++
Sbjct: 4 GSGEYFTRIGIGTPTREQYMVLDTGSDVVWIQCEPC--RECYSQADPIFNPSSSVSFSTV 61
Query: 147 PCSSSQCASLNQKSCSGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITFGC 206
C S+ C+ L+ C G C Y VSYGDGS++ G+ ATET+T G+T+ Q VA+ GC
Sbjct: 62 GCDSAVCSQLDANDCHGGGCLYEVSYGDGSYTVGSYATETLTFGTTSIQNVAI-----GC 116
Query: 207 GTNNGGLF 214
G +N GLF
Sbjct: 117 GHDNVGLF 124
Score = 46.2 bits (108), Expect = 0.024, Method: Compositional matrix adjust.
Identities = 37/124 (29%), Positives = 50/124 (40%), Gaps = 4/124 (3%)
Query: 252 IVIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSLSQV--PEVTIH 309
I+IDSGT +T L L + + P AD + CY ++L V P V H
Sbjct: 226 IIIDSGTAVTRLQTSAYDALRDAFIAGTQHLPRADGISIFDTCYDLSALQSVSIPAVGFH 285
Query: 310 F-RGADVKLSRSNFFVKV-SEDIVCSVFKGITNSVPIYGNIMQTNFLVGYDIEQQTVSFK 367
F GA L N + + S C F +++ I GNI Q V +D V F
Sbjct: 286 FSNGAGFILPAKNCLIPMDSMGTFCFAFAPADSNLSIMGNIQQQGIRVSFDSANSLVGFA 345
Query: 368 PTDC 371
C
Sbjct: 346 IDQC 349
>gi|115476828|ref|NP_001062010.1| Os08g0469000 [Oryza sativa Japonica Group]
gi|42407407|dbj|BAD09565.1| putative nucleoid DNA-binding protein [Oryza sativa Japonica Group]
gi|113623979|dbj|BAF23924.1| Os08g0469000 [Oryza sativa Japonica Group]
gi|125603713|gb|EAZ43038.1| hypothetical protein OsJ_27627 [Oryza sativa Japonica Group]
Length = 448
Score = 124 bits (312), Expect = 6e-26, Method: Compositional matrix adjust.
Identities = 69/159 (43%), Positives = 96/159 (60%), Gaps = 6/159 (3%)
Query: 79 ASQADIIPNNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPK 138
A++ + + YL+ ++IGTPP A+ DTGSDLIWTQC PC C Q +P F P
Sbjct: 80 AARILVAASQGEYLMDLAIGTPPLRYTAMVDTGSDLIWTQCAPC--VLCADQPTPYFRPA 137
Query: 139 MSSTYKSLPCSSSQCASLNQKSCSGVN-CQYSVSYGDGSFSNGNLATETVTLGSTTGQAV 197
S+TY+ +PC S CA+L +C + C Y YGD + + G LA+ET T G+ V
Sbjct: 138 RSATYRLVPCRSPLCAALPYPACFQRSVCVYQYYYGDEASTAGVLASETFTFGAANSSKV 197
Query: 198 ALPGITFGCG-TNNGGLFNSKTTGIVGLGGGDISLISQM 235
+ + FGCG N+G L NS +G+VGLG G +SL+SQ+
Sbjct: 198 MVSDVAFGCGNINSGQLANS--SGMVGLGRGPLSLVSQL 234
>gi|242044886|ref|XP_002460314.1| hypothetical protein SORBIDRAFT_02g026340 [Sorghum bicolor]
gi|241923691|gb|EER96835.1| hypothetical protein SORBIDRAFT_02g026340 [Sorghum bicolor]
Length = 444
Score = 124 bits (312), Expect = 6e-26, Method: Compositional matrix adjust.
Identities = 70/157 (44%), Positives = 92/157 (58%), Gaps = 4/157 (2%)
Query: 79 ASQADIIPNNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPK 138
A++ ++ ++ YL+ + IGTP A+ DTGSDLIWTQC PC C Q +P FDP
Sbjct: 80 AARILVLASDGEYLMEMGIGTPARFYSAILDTGSDLIWTQCAPC--LLCVDQPTPYFDPA 137
Query: 139 MSSTYKSLPCSSSQCASLNQKSCSGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVA 198
SSTY+SL CS+ C +L C C Y YGD + + G LA ET T G T V
Sbjct: 138 NSSTYRSLGCSAPACNALYYPLCYQKTCVYQYFYGDSASTAGVLANETFTFG-TNDTRVT 196
Query: 199 LPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQM 235
LP I+FGCG N G + +G+VG G G +SL+SQ+
Sbjct: 197 LPRISFGCGNLNAGSL-ANGSGMVGFGRGSLSLVSQL 232
Score = 48.9 bits (115), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 37/127 (29%), Positives = 64/127 (50%), Gaps = 9/127 (7%)
Query: 253 VIDSGTTLTFLPQ-GYNSNLLSVMSSMIEAQPVADPTGS--LELCYSFNSLSQ----VPE 305
+IDSGTT+T+L + Y + + + + P+ D T + L+ C+ + + +P+
Sbjct: 316 IIDSGTTITYLAEPAYYAVREAFVLYLNSTLPLLDVTETSVLDTCFQWPPPPRQSVTLPQ 375
Query: 306 VTIHFRGADVKLSRSNF-FVKVSEDIVCSVFKGITNSVPIYGNIMQTNFLVGYDIEQQTV 364
+ +HF GAD +L N+ V S +C + ++ I G+ NF V YD+E +
Sbjct: 376 LVLHFDGADWELPLQNYMLVDPSTGGLC-LAMATSSDGSIIGSYQHQNFNVLYDLENSLL 434
Query: 365 SFKPTDC 371
SF P C
Sbjct: 435 SFVPAPC 441
>gi|224080963|ref|XP_002306246.1| predicted protein [Populus trichocarpa]
gi|222855695|gb|EEE93242.1| predicted protein [Populus trichocarpa]
Length = 382
Score = 124 bits (312), Expect = 6e-26, Method: Compositional matrix adjust.
Identities = 58/127 (45%), Positives = 81/127 (63%), Gaps = 7/127 (5%)
Query: 88 NANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLP 147
+ Y +RI +G+PP + V D+GSD++W QC+PC +QCY Q PLFDP S+++ +
Sbjct: 40 SGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCKPC--TQCYHQTDPLFDPADSASFMGVS 97
Query: 148 CSSSQCASLNQKSCSGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITFGCG 207
CSS+ C ++ C+ C+Y VSYGDGS + G LA ET+TLG T Q VA+ GCG
Sbjct: 98 CSSAVCDQVDNAGCNSGRCRYEVSYGDGSSTKGTLALETLTLGRTVVQNVAI-----GCG 152
Query: 208 TNNGGLF 214
N G+F
Sbjct: 153 HMNQGMF 159
Score = 43.5 bits (101), Expect = 0.16, Method: Compositional matrix adjust.
Identities = 45/159 (28%), Positives = 66/159 (41%), Gaps = 14/159 (8%)
Query: 220 GIVGLGGGDISL-ISQ--MRTTIAGNQRLGVSTPDIVIDSGTTLTFLPQGYNSNLLSVMS 276
G+ GLG GD+ + IS+ T GN +V+D+GT +T P
Sbjct: 231 GLSGLGVGDMKVPISEDIFELTELGNG-------GVVMDTGTAVTRFPTVAYEAFRDAFI 283
Query: 277 SMIEAQPVADPTGSLELCYS-FNSLS-QVPEVTIHFRGADV-KLSRSNFFVKVSE-DIVC 332
P A + CY+ F LS +VP V+ +F G + L +NF + V + C
Sbjct: 284 DQTGNLPRASGVSIFDTCYNLFGFLSVRVPTVSFYFSGGPILTLPANNFLIPVDDAGTFC 343
Query: 333 SVFKGITNSVPIYGNIMQTNFLVGYDIEQQTVSFKPTDC 371
F + + I GNI Q + D + V F P C
Sbjct: 344 FAFAPSPSGLSILGNIQQEGIQISVDGANEFVGFGPNVC 382
>gi|326490862|dbj|BAJ90098.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 447
Score = 124 bits (311), Expect = 7e-26, Method: Compositional matrix adjust.
Identities = 107/423 (25%), Positives = 170/423 (40%), Gaps = 83/423 (19%)
Query: 28 GFSVELIHRDSPKSPFYNSSETPYQRLRDALTRSLNRL--NHFNQNSSISSSKASQADII 85
G +L H DS + + +E + + + R+ +L + +++ AS + ++
Sbjct: 30 GLRADLTHIDSGRG--FTRNELLRRMVLRSRARAAKQLCPSRSGTPVRVTAPVASGSHVV 87
Query: 86 PNNANYLIRISIGTPPTERLAV-ADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYK 144
YLI IGTP +++A+ DTGSD++WTQC PC C+ Q P FD S T
Sbjct: 88 -GYTEYLIHFGIGTPRPQQVALEVDTGSDVVWTQCRPC--FDCFTQPLPRFDTSASDTVH 144
Query: 145 SLPCSSSQCASLNQKSCSGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITF 204
+ C+ C +L +C C Y V+YGD S + G LA ++ T G V +P + F
Sbjct: 145 GVLCTDPICRALRPHACFLGGCTYQVNYGDNSVTIGQLAKDSFTFDGKGGGKVTVPDLVF 204
Query: 205 GCGTNNGG----------------------------------LFNSKTTGIVGLGGGDIS 230
GCG N G +F SK+T + LGG
Sbjct: 205 GCGQYNTGNFHSNETGIAGFGRGPLSLPRQLGVSSFSYCFTTIFESKSTPVF-LGGAPAD 263
Query: 231 LISQ----------------------MRTTIAGNQRLGVSTPDIV----------IDSGT 258
+ ++ G RL V V IDSGT
Sbjct: 264 GLRAHATGPILSTPFLPNHPEYYYLSLKGITVGKTRLAVPESAFVVKADGSGGTIIDSGT 323
Query: 259 TLTFLPQGYNSNLLSVMSSMIEAQPVA-DPTGSLEL-CYSFNSLSQ-----VPEVTIHFR 311
+T P+ +L + + + + TG L C+S S+ VP++T+H
Sbjct: 324 AITAFPRAVFRSLWEAFVAQVPLPHTSYNDTGEPTLQCFSTESVPDASKVPVPKMTLHLE 383
Query: 312 GADVKLSRSNFFVKVSE-DIVCSVFKGITNSVPIYGNIMQTNFLVGYDIEQQTVSFKPTD 370
GAD +L R N+ + + D +C V + + GN Q N + +D+ + +P
Sbjct: 384 GADWELPRENYMAEYPDSDQLCVVVLAGDDDRTMIGNFQQQNMHIVHDLAGNKLVIEPAQ 443
Query: 371 CTK 373
C K
Sbjct: 444 CDK 446
>gi|224142001|ref|XP_002324349.1| predicted protein [Populus trichocarpa]
gi|222865783|gb|EEF02914.1| predicted protein [Populus trichocarpa]
Length = 490
Score = 124 bits (311), Expect = 7e-26, Method: Compositional matrix adjust.
Identities = 118/421 (28%), Positives = 180/421 (42%), Gaps = 83/421 (19%)
Query: 30 SVELIHRDSPKSPFYNSSETPYQRLRDALTRSLNRLNHFNQNSSISSS------KASQAD 83
S++++H+ P S + + L + +R+ + S S + K + +
Sbjct: 75 SLKVVHKHGPCSKLSQDEASAAPTHTEILLQDQSRVKSIHSRLSNSKTSGGKDVKVTDST 134
Query: 84 IIP-------NNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFD 136
IP + NY++ + +GTP + + DTGSD+ WTQC+PC S CY Q +FD
Sbjct: 135 TIPAKDGSTVGSGNYIVTVGLGTPKKDLSLIFDTGSDITWTQCQPCARS-CYKQKEQIFD 193
Query: 137 PKMSSTYKSLPCSSSQCASL-----NQKSCSGVNCQYSVSYGDGSFSNGNLATETVTLGS 191
P S++Y ++ CSSS C SL N C+ C Y + YGD SFS G TE +TL S
Sbjct: 194 PSQSTSYTNISCSSSICNSLTSATGNTPGCASSACVYGIQYGDSSFSVGFFGTEKLTLTS 253
Query: 192 TTGQAVALPGITFGCGTNN----------------------------GGLFN------SK 217
T A I FGCG NN +F+ S
Sbjct: 254 TD----AFNNIYFGCGQNNQGLFGGSAGLLGLGRDKLSVVSQTAQKYNKIFSYCLPSSSS 309
Query: 218 TTGIVGLGG--GDISLISQMRTTIAG--------------NQRLGVS-----TPDIVIDS 256
+TG + GG + + + T AG ++L +S T +IDS
Sbjct: 310 STGFLTFGGSASKNAKFTPLSTISAGPSFYGLDFTGISVGGKKLAISASVFSTAGAIIDS 369
Query: 257 GTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSLS--QVPEVTIHF-RGA 313
GT +T LP S L + +++ P+ L+ CY F+S + VP++ F G
Sbjct: 370 GTVITRLPPAAYSALRASFRNLMSKYPMTKALSILDTCYDFSSYTTISVPKIGFSFSSGI 429
Query: 314 DVKLSRSNFFVKVSEDIVCSVFKGITNS--VPIYGNIMQTNFLVGYDIEQQTVSFKPTDC 371
+V + + S VC F G +++ V I+GN+ Q V YD V F P C
Sbjct: 430 EVDIDATGILYASSLSQVCLAFAGNSDATDVFIFGNVQQKTLEVFYDGSAGKVGFAPGGC 489
Query: 372 T 372
+
Sbjct: 490 S 490
>gi|449486856|ref|XP_004157423.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 1-like [Cucumis
sativus]
Length = 491
Score = 124 bits (311), Expect = 8e-26, Method: Compositional matrix adjust.
Identities = 71/207 (34%), Positives = 102/207 (49%), Gaps = 31/207 (14%)
Query: 29 FSVELIHRDSPKSPFYNSSETPYQRL-RDALTRSLNRLNHFNQNSSISSSKASQADIIP- 86
FS++L RDS +N+ Y+ L L+R +R+ + S+ ++D+ P
Sbjct: 76 FSLQLHPRDS----LHNAGHKDYKSLVLSRLSRDSSRVKSIYDRLEFALSELKRSDLEPL 131
Query: 87 -------------------NNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQC 127
+ Y R+ +G P V DTGSD+ W QC+PC + C
Sbjct: 132 KTEILPEDLSTPIISGTSQGSGEYFSRVGVGQPAKPFYMVLDTGSDINWLQCQPC--TDC 189
Query: 128 YMQDSPLFDPKMSSTYKSLPCSSSQCASLNQKSCSGVNCQYSVSYGDGSFSNGNLATETV 187
Y Q P+FDP+ SS++ SLPC S QC +L C C Y VSYGDGSF+ G TET+
Sbjct: 190 YQQTDPIFDPRSSSSFASLPCESQQCQALETSGCRASKCLYQVSYGDGSFTVGEFVTETL 249
Query: 188 TLGSTTGQAVALPGITFGCGTNNGGLF 214
T G++ + + GCG +N GLF
Sbjct: 250 TFGNSG----MINDVAVGCGHDNEGLF 272
Score = 53.5 bits (127), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 41/128 (32%), Positives = 59/128 (46%), Gaps = 12/128 (9%)
Query: 252 IVIDSGTTLTFLP-QGYNSNLLSVMSSMIEAQPVADPTGSLEL---CYSFNSLSQV--PE 305
I++DSGT +T L Q YN+ + + + P T L CY +S S+V P
Sbjct: 368 IIVDSGTAITRLQTQAYNT----LRDAFVSRTPYLKKTNGFALFDTCYDLSSQSRVTIPT 423
Query: 306 VTIHFRGAD-VKLSRSNFFVKV-SEDIVCSVFKGITNSVPIYGNIMQTNFLVGYDIEQQT 363
V+ F G ++L N+ + V S C F T+S+ I GN+ Q V YD+
Sbjct: 424 VSFEFAGGKSLQLPPKNYLIPVDSVGTFCFAFAPTTSSLSIIGNVQQQGTRVHYDLANSV 483
Query: 364 VSFKPTDC 371
V F P C
Sbjct: 484 VGFSPHKC 491
>gi|449530542|ref|XP_004172253.1| PREDICTED: LOW QUALITY PROTEIN: protein ASPARTIC PROTEASE IN GUARD
CELL 1-like [Cucumis sativus]
Length = 486
Score = 124 bits (310), Expect = 9e-26, Method: Compositional matrix adjust.
Identities = 61/127 (48%), Positives = 77/127 (60%), Gaps = 7/127 (5%)
Query: 88 NANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLP 147
+ Y R+ IG PP+ V DTGSD+ W QC PC ++CY Q P F+P S+++ SL
Sbjct: 148 SGEYFSRVGIGRPPSPVYMVLDTGSDVSWVQCAPC--AECYEQTDPXFEPTSSASFTSLS 205
Query: 148 CSSSQCASLNQKSCSGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITFGCG 207
C + QC SL+ C C Y VSYGDGS++ G+ TETVTLGST +L I GCG
Sbjct: 206 CETEQCKSLDVSECRNGTCLYEVSYGDGSYTVGDFVTETVTLGST-----SLGNIAIGCG 260
Query: 208 TNNGGLF 214
NN GLF
Sbjct: 261 HNNEGLF 267
Score = 53.1 bits (126), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 37/124 (29%), Positives = 54/124 (43%), Gaps = 4/124 (3%)
Query: 252 IVIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSLS--QVPEVTIH 309
I++DSGT +T L + L A + CY +S S +VP V+ H
Sbjct: 363 IIVDSGTAVTRLQTTVYNVLRDAFVKSTHDLQTARGVALFDTCYDLSSKSRVEVPTVSFH 422
Query: 310 F-RGADVKLSRSNFFVKV-SEDIVCSVFKGITNSVPIYGNIMQTNFLVGYDIEQQTVSFK 367
F G ++ L N+ + V SE C F +++ I GN Q VG+D+ V F
Sbjct: 423 FANGNELPLPAKNYLIPVDSEGTFCFAFAPTDSTLSILGNAQQQGTRVGFDLANSLVGFS 482
Query: 368 PTDC 371
P C
Sbjct: 483 PNKC 486
>gi|194696934|gb|ACF82551.1| unknown [Zea mays]
gi|413936470|gb|AFW71021.1| hypothetical protein ZEAMMB73_589717 [Zea mays]
gi|413953801|gb|AFW86450.1| hypothetical protein ZEAMMB73_488726 [Zea mays]
Length = 445
Score = 124 bits (310), Expect = 1e-25, Method: Compositional matrix adjust.
Identities = 105/402 (26%), Positives = 162/402 (40%), Gaps = 73/402 (18%)
Query: 31 VELIHRDSPKSPFYNSSETPYQRLRDALTRSLNRLNHFNQNSSISSSKASQADIIPNNAN 90
V L+HR P +P S T + D RS R ++ + +S ++ +
Sbjct: 56 VPLVHRHGPCAP-APSLSTDTRSFADIFRRSRARPSYIVRGKKVSVPAHLGTSVM--SLE 112
Query: 91 YLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCSS 150
Y++R+S GTP ++ V DTGSD+ W QC+PC QC+ Q PL+DP SSTY ++PC+S
Sbjct: 113 YVVRVSFGTPAVPQVVVIDTGSDVSWLQCKPCSSGQCFPQKDPLYDPSHSSTYSAVPCAS 172
Query: 151 SQCASLNQKS-----CSGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITFG 205
C L + SG C +++SY DG+ + G + + +TL + FG
Sbjct: 173 DVCKKLAADAYGSGCTSGKQCGFAISYADGTSTVGAYSQDKLTL----APGAIVQNFYFG 228
Query: 206 CGTNN------------------------GGLFN------SKTTGIVGLGGGD------- 228
CG GG+F+ S G + LG G
Sbjct: 229 CGHGKHAVRGLFDGVLGLGRLRESLGARYGGVFSYCLPSVSSKPGFLALGAGKNPSGFVF 288
Query: 229 -----------ISLISQMRTTIAGNQ---RLGVSTPDIVIDSGTTLTFLPQGYNSNLLSV 274
S ++ + G + R + +++DSGT +T L L S
Sbjct: 289 TPMGTVPGQPTFSTVTLAGINVGGKKLDLRPSAFSGGMIVDSGTVITGLQSTAYRALRSA 348
Query: 275 MSSMIEAQPVADPTGSLELCYSFNSLSQ--VPEVTIHFR-GADVKLSRSNFFVKVSEDIV 331
+EA + P G L+ CY+ VP++ + F GA + L N +
Sbjct: 349 FRKAMEAYRLL-PNGDLDTCYNLTGYKNVVVPKIALTFTGGATINLDVPNGILVNG---- 403
Query: 332 CSVFK--GITNSVPIYGNIMQTNFLVGYDIEQQTVSFKPTDC 371
C F G S + GN+ Q F V +D F+ C
Sbjct: 404 CLAFAESGPDGSAGVLGNVNQRAFEVLFDTSTSKFGFRAKAC 445
>gi|219362525|ref|NP_001136612.1| uncharacterized protein LOC100216735 [Zea mays]
gi|194696366|gb|ACF82267.1| unknown [Zea mays]
gi|413953802|gb|AFW86451.1| hypothetical protein ZEAMMB73_488726 [Zea mays]
Length = 411
Score = 124 bits (310), Expect = 1e-25, Method: Compositional matrix adjust.
Identities = 108/423 (25%), Positives = 168/423 (39%), Gaps = 73/423 (17%)
Query: 10 ILFFLCFYVVSPIEAQTGGFSVELIHRDSPKSPFYNSSETPYQRLRDALTRSLNRLNHFN 69
++ + YV E V L+HR P +P S T + D RS R ++
Sbjct: 1 MILHIYIYVSVKPEQNGSTVYVPLVHRHGPCAP-APSLSTDTRSFADIFRRSRARPSYIV 59
Query: 70 QNSSISSSKASQADIIPNNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYM 129
+ +S ++ + Y++R+S GTP ++ V DTGSD+ W QC+PC QC+
Sbjct: 60 RGKKVSVPAHLGTSVM--SLEYVVRVSFGTPAVPQVVVIDTGSDVSWLQCKPCSSGQCFP 117
Query: 130 QDSPLFDPKMSSTYKSLPCSSSQCASLNQKS-----CSGVNCQYSVSYGDGSFSNGNLAT 184
Q PL+DP SSTY ++PC+S C L + SG C +++SY DG+ + G +
Sbjct: 118 QKDPLYDPSHSSTYSAVPCASDVCKKLAADAYGSGCTSGKQCGFAISYADGTSTVGAYSQ 177
Query: 185 ETVTLGSTTGQAVALPGITFGCGTNN------------------------GGLFN----- 215
+ +TL + FGCG GG+F+
Sbjct: 178 DKLTL----APGAIVQNFYFGCGHGKHAVRGLFDGVLGLGRLRESLGARYGGVFSYCLPS 233
Query: 216 -SKTTGIVGLGGGD------------------ISLISQMRTTIAGNQ---RLGVSTPDIV 253
S G + LG G S ++ + G + R + ++
Sbjct: 234 VSSKPGFLALGAGKNPSGFVFTPMGTVPGQPTFSTVTLAGINVGGKKLDLRPSAFSGGMI 293
Query: 254 IDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSLSQ--VPEVTIHFR 311
+DSGT +T L L S +EA + P G L+ CY+ VP++ + F
Sbjct: 294 VDSGTVITGLQSTAYRALRSAFRKAMEAYRLL-PNGDLDTCYNLTGYKNVVVPKIALTFT 352
Query: 312 -GADVKLSRSNFFVKVSEDIVCSVFK--GITNSVPIYGNIMQTNFLVGYDIEQQTVSFKP 368
GA + L N + C F G S + GN+ Q F V +D F+
Sbjct: 353 GGATINLDVPNGILVNG----CLAFAESGPDGSAGVLGNVNQRAFEVLFDTSTSKFGFRA 408
Query: 369 TDC 371
C
Sbjct: 409 KAC 411
>gi|218200805|gb|EEC83232.1| hypothetical protein OsI_28526 [Oryza sativa Indica Group]
Length = 450
Score = 123 bits (309), Expect = 1e-25, Method: Compositional matrix adjust.
Identities = 102/331 (30%), Positives = 142/331 (42%), Gaps = 72/331 (21%)
Query: 107 VADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCSSSQC-ASLNQK-----S 160
+ DTGSDL W QC+PC S CY Q PLFDP S++Y ++PC++S C ASL S
Sbjct: 125 IVDTGSDLTWVQCKPC--SVCYAQRDPLFDPSGSASYAAVPCNASACEASLKAATGVPGS 182
Query: 161 CSGV----------NCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITFGCGTNN 210
C+ V C YS++YGDGSFS G LAT+TV LG + + G FGCG +N
Sbjct: 183 CATVGGGGGGGKSERCYYSLAYGDGSFSRGVLATDTVALGGAS-----VDGFVFGCGLSN 237
Query: 211 GGLFNSKTT---------GIVGLGGGDISL---ISQMRTTIAGNQRLGVSTP-------- 250
GL + G G G +SL S R + ++ P
Sbjct: 238 RGLRRPGSAASSPTASPPGTSGDAAGSLSLGGDTSSYRNATPVSYTRMIADPAQPPFYFM 297
Query: 251 --------------------DIVIDSGTTLTFL-PQGYNSNLLSVMSSM-IEAQPVADPT 288
++++DSGT +T L P Y + E P A P
Sbjct: 298 NVTGASVGGAAVAAAGLGAANVLLDSGTVITRLAPSVYRAVRAEFARQFGAERYPAAPPF 357
Query: 289 GSLELCYSFNSLSQ--VPEVTIHFR-GADVKLSRSNFFVKVSED--IVCSVFKGIT--NS 341
L+ CY+ + VP +T+ GAD+ + + +D VC ++ +
Sbjct: 358 SLLDACYNLTGHDEVKVPLLTLRLEAGADMTVDAAGMLFMARKDGSQVCLAMASLSFEDQ 417
Query: 342 VPIYGNIMQTNFLVGYDIEQQTVSFKPTDCT 372
PI GN Q N V YD + F DC+
Sbjct: 418 TPIIGNYQQKNKRVVYDTVGSRLGFADEDCS 448
>gi|242069057|ref|XP_002449805.1| hypothetical protein SORBIDRAFT_05g023600 [Sorghum bicolor]
gi|241935648|gb|EES08793.1| hypothetical protein SORBIDRAFT_05g023600 [Sorghum bicolor]
Length = 430
Score = 123 bits (309), Expect = 1e-25, Method: Compositional matrix adjust.
Identities = 71/174 (40%), Positives = 102/174 (58%), Gaps = 22/174 (12%)
Query: 65 LNHFNQNSSISSSKASQADIIPNNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPP 124
L H++ S+ SS A + A YL+ ++IGTPP +A+ADTGSDL WTQC+PC
Sbjct: 59 LLHYSTLST--SSDPGPARLRSGQAEYLMELAIGTPPVPFIALADTGSDLTWTQCKPC-- 114
Query: 125 SQCYMQDSPLFDPKMSSTYKSLPCSSSQCASLNQKSCS--GVNCQYSVSYGDGSFSNGNL 182
C+ QD+P++D SS++ LPCSS+ C + CS C+Y +Y DG++
Sbjct: 115 KLCFGQDTPIYDTTTSSSFSPLPCSSATCLPIWSSRCSTPSATCRYRYAYDDGAY----- 169
Query: 183 ATETVTLGSTTGQAVALPGITFGCGTNNGGL-FNSKTTGIVGLGGGDISLISQM 235
S +++ GI FGCG +NGGL +NS TG VGLG G +SL++Q+
Sbjct: 170 --------SPECAGISVGGIAFGCGVDNGGLSYNS--TGTVGLGRGSLSLVAQL 213
Score = 51.6 bits (122), Expect = 5e-04, Method: Compositional matrix adjust.
Identities = 37/131 (28%), Positives = 63/131 (48%), Gaps = 11/131 (8%)
Query: 252 IVIDSGTTLTFLPQ-GYNSNLLSVMSSMIEAQPVADPTGSLELCY-----SFNSLSQVPE 305
+++DSGT T L + G+ + V + QPV + + C+ L +P+
Sbjct: 301 MIVDSGTIFTILVETGFRVVVDHVAG--VLGQPVVNASSLDRPCFPAPAAGVQELPDMPD 358
Query: 306 VTIHFRG-ADVKLSRSNF--FVKVSEDIVCSVFKGITNSVPIYGNIMQTNFLVGYDIEQQ 362
+ +HF G AD++L R N+ F + ++ + S + GN Q N + +DI
Sbjct: 359 MVLHFAGGADMRLHRDNYMSFNEEESSFCLNIVGTESASGSVLGNFQQQNIQMLFDITVG 418
Query: 363 TVSFKPTDCTK 373
+SF PTDC+K
Sbjct: 419 QLSFMPTDCSK 429
>gi|297745479|emb|CBI40559.3| unnamed protein product [Vitis vinifera]
Length = 436
Score = 123 bits (309), Expect = 1e-25, Method: Compositional matrix adjust.
Identities = 86/240 (35%), Positives = 119/240 (49%), Gaps = 39/240 (16%)
Query: 70 QNSSISSSKASQADIIPNNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYM 129
Q SSS S + + Y R+ +GTPP V DTGSD++W QC PC +CY
Sbjct: 155 QGGGFSSSVTS--GLAQGSGEYFTRLGVGTPPKYVYMVLDTGSDVVWIQCAPC--RKCYS 210
Query: 130 QDSPLFDPKMSSTYKSLPCSSSQCASLNQKSC-SGVNCQYSVSYGDGSFSNGNLATETVT 188
Q P+FDPK S ++ S+ C S C L+ C S +C Y V+YGDGSF+ G +TET+T
Sbjct: 211 QTDPVFDPKKSGSFSSISCRSPLCLRLDSPGCNSRQSCLYQVAYGDGSFTFGEFSTETLT 270
Query: 189 LGSTTGQAVALPGITFGCGTNNGGLFNSKTTGIVGL-----------GGGDISLI--SQM 235
+ +P + GCG +N GLF G++GL GG ++ I S
Sbjct: 271 F-----RGTRVPKVALGCGHDNEGLFVG-AAGLLGLGRQPRLNRPPVGGARVAGITASLF 324
Query: 236 RTTIAGNQRLGVSTPDIVIDSGTTLTFLPQ---GYNSNLLSVMSSMIEAQ-----PVADP 287
+ AGN ++IDSGT++T L + G +SN S +SS P A P
Sbjct: 325 KLDTAGNG-------GVIIDSGTSVTRLTRRAYGTSSNKGSGLSSTWRLHGSGSLPAAAP 377
>gi|413922067|gb|AFW61999.1| hypothetical protein ZEAMMB73_694403, partial [Zea mays]
Length = 328
Score = 123 bits (309), Expect = 1e-25, Method: Compositional matrix adjust.
Identities = 71/167 (42%), Positives = 93/167 (55%), Gaps = 23/167 (13%)
Query: 90 NYLIRISIG----TPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKS 145
NY+ IS+G +P + DTGSDL W QC+PC S CY Q PLFDP S+TY +
Sbjct: 91 NYVTTISLGGSSGSPAANLTVIVDTGSDLTWVQCKPC--SACYAQRDPLFDPAGSATYAA 148
Query: 146 LPCSSSQCA-----------SLNQKSCSGVNCQYSVSYGDGSFSNGNLATETVTLGSTTG 194
+ C++S CA S C Y+++YGDGSFS G LAT+TV LG
Sbjct: 149 VRCNASACADSLRAATGTPGSCGSTGAGSEKCYYALAYGDGSFSRGVLATDTVALG---- 204
Query: 195 QAVALPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAG 241
+L G FGCG +N GLF T G++GLG ++SL+SQ + G
Sbjct: 205 -GASLGGFVFGCGLSNRGLFGG-TAGLMGLGRTELSLVSQTASRYGG 249
>gi|414587000|tpg|DAA37571.1| TPA: hypothetical protein ZEAMMB73_036171 [Zea mays]
Length = 459
Score = 123 bits (309), Expect = 1e-25, Method: Compositional matrix adjust.
Identities = 68/195 (34%), Positives = 107/195 (54%), Gaps = 15/195 (7%)
Query: 49 TPYQRLRDALTRSLNRLNHFNQNSSISSSKA-----SQADIIPNNANYLIRISIGTPPTE 103
T + +R A+ RSL+R ++ ++ +A S+A ++P YL+++ GTP
Sbjct: 45 TDQELIRRAVQRSLDRPGIVARSGGGAADEAGKAVASEAPLVPGGGEYLVKLGTGTPQHF 104
Query: 104 RLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCSSSQCASLNQKSCSG 163
A DT SDL+W QC+PC CY Q P+F+PK+SS+Y +PC+S CA L+ C
Sbjct: 105 FSAAIDTASDLVWMQCQPC--VSCYRQLDPVFNPKLSSSYAVVPCTSDTCAQLDGHRCHE 162
Query: 164 VN---CQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITFGCGTNNGGLFNSKTTG 220
+ CQY+ Y + G LA + + +G AV FGC ++ G ++ +G
Sbjct: 163 DDDGACQYTYKYSGHGVTKGTLAIDKLAIGGDVFHAV-----VFGCSDSSVGGPAAQASG 217
Query: 221 IVGLGGGDISLISQM 235
+VGLG G +SL+SQ+
Sbjct: 218 LVGLGRGPLSLVSQL 232
>gi|242092892|ref|XP_002436936.1| hypothetical protein SORBIDRAFT_10g011730 [Sorghum bicolor]
gi|241915159|gb|EER88303.1| hypothetical protein SORBIDRAFT_10g011730 [Sorghum bicolor]
Length = 469
Score = 123 bits (308), Expect = 1e-25, Method: Compositional matrix adjust.
Identities = 118/420 (28%), Positives = 174/420 (41%), Gaps = 85/420 (20%)
Query: 30 SVELIHRDSPKSPFYNSSETPYQRLRDALTRSLNRLNHFNQNSS-----ISSSKASQADI 84
S+ L++R P +P +++ T + L R R NH + +S + S +
Sbjct: 57 SMPLMYRHGPCAP-ASAAATNRPSPAEMLRRDRARRNHILRKASGRRITLGVSIPTSLGA 115
Query: 85 IPNNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYK 144
++ Y++ + GTP ++ + DTGSDL W QC+PC S CY Q P+FDP SSTY
Sbjct: 116 FVDSLQYVVTLGFGTPAVPQVLLIDTGSDLSWVQCQPCNSSTCYPQKDPVFDPSASSTYA 175
Query: 145 SLPCSSSQCASLN--------QKSCSGVN-CQYSVSYGDGSFSNGNLATETVTLGSTTGQ 195
+PC S C L+ S SG + CQY + YG+G + G +TET+TL +
Sbjct: 176 PVPCGSEACRDLDPDSYANGCTNSSSGASLCQYGIQYGNGDTTVGVYSTETLTL--SPEA 233
Query: 196 AVALPGITFGCGT----------------------------NNGGLFN------SKTTGI 221
A + +FGCG GG F+ + T G
Sbjct: 234 ATVVNNFSFGCGLVQKGVFDLFDGLLGLGGAPESLVSQTTGTYGGAFSYCLPAGNSTAGF 293
Query: 222 VGLG----GGD----------------ISLISQMRTTIAGNQ---RLGVSTPDIVIDSGT 258
+ LG GG+ L+ ++ G Q V ++IDSGT
Sbjct: 294 LALGAPATGGNNTAGFQFTPLQVVETTFYLVKLTGISVGGKQLDIEPTVFAGGMIIDSGT 353
Query: 259 TLTFLPQGYNSNLLSVMSSMIEAQPVADPTG--SLELCYSF--NSLSQVPEVTIHFRGA- 313
+T LP+ S L + S + A P+ P L+ CY F N+ VP V + F G
Sbjct: 354 IVTGLPETAYSALRTAFRSAMSAYPLLPPNDDEDLDTCYDFTGNTNVTVPTVALTFEGGV 413
Query: 314 --DVKLSRSNFFVKVSEDIVCSVFKGITNSVPIYGNIMQTNFLVGYDIEQQTVSFKPTDC 371
D+ + S + V G T I GN+ Q F V YD + V F+ C
Sbjct: 414 TIDLDVP-SGVLLDGCLAFVAGASDGDTG---IIGNVNQRTFEVLYDSARGHVGFRAGAC 469
>gi|357127505|ref|XP_003565420.1| PREDICTED: probable aspartic protease At2g35615-like [Brachypodium
distachyon]
Length = 466
Score = 123 bits (308), Expect = 2e-25, Method: Compositional matrix adjust.
Identities = 127/440 (28%), Positives = 183/440 (41%), Gaps = 108/440 (24%)
Query: 27 GGFSVELIHRDSPKSPFYNSSETPYQRLRDALTRSLNRLNHFNQNSSISSSKASQADIIP 86
GGFSVELIHRDS KSPF++ T + R A R S +SS D+
Sbjct: 25 GGFSVELIHRDSIKSPFHDPKLTRHDRFL-AAARRSRARAAALLASDVSS------DLFY 77
Query: 87 NNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYM----------------- 129
+ YL +++GTPP LAVADTGSDL+W +C + +
Sbjct: 78 GDFEYLAAVNVGTPPVRFLAVADTGSDLVWLKCNTTQNNNGIVSSDSGNNSNSSPPPPPP 137
Query: 130 QDSPLFDPKMSSTYKSLPCSSSQCASL-NQKSCSGVN--CQYSVSYGDGSFSNGNLATET 186
+ F+P SS+Y + C C +L SC+G + C + SY DG+ + G LA +T
Sbjct: 138 EAVVYFNPFDSSSYSRVGCDGPSCLALATNASCNGDSHACDFRYSYRDGASATGLLAADT 197
Query: 187 VTL-GSTTGQAVALPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQM---------- 235
T G+ + I FGC T G + G+VGLG G +SL SQ+
Sbjct: 198 FTFGGNINNDTTSTASIDFGCATGTAGR-EFQADGMVGLGAGPLSLASQLGRKFSFCLTA 256
Query: 236 ------RTTIAGNQRLGVSTPD------------------IVIDS--------------- 256
+ + R VS P I IDS
Sbjct: 257 YDIDDASSILNFGARAVVSDPGAATTPLIASSSNAAAYYAISIDSLKVAGQPVPGTTSVS 316
Query: 257 ------GTTLTFLPQG-----YNSNLLSVM--SSMIEAQPVADPTGSLELCYSFNSLSQV 303
GT LTFL + +L VM + + A P P +LELCY + + V
Sbjct: 317 KVIVDTGTVLTFLDRAALLAPLTESLARVMDGAGLPRAPP---PDETLELCYDVSRVKDV 373
Query: 304 ----PEVTIHF---RGADVKLSRSNFFVKVSEDIVCSVFKGITNS-----VPIYGNIMQT 351
P+VT+ G +V+L+ FV V E ++C +T S + + GN+
Sbjct: 374 DGVIPDVTLVLGGGGGGEVRLTGEGTFVLVKEGVLC--LAVVTTSPELQPLSVLGNVALQ 431
Query: 352 NFLVGYDIEQQTVSFKPTDC 371
+ VG D++ +T +F +C
Sbjct: 432 DLHVGIDLDARTATFATANC 451
>gi|413923784|gb|AFW63716.1| aspartic proteinase nepenthesin-2 [Zea mays]
Length = 531
Score = 123 bits (308), Expect = 2e-25, Method: Compositional matrix adjust.
Identities = 86/229 (37%), Positives = 118/229 (51%), Gaps = 29/229 (12%)
Query: 27 GGFSVELIHRDSPKSPFYNSSETPYQRLRDALTRSLNRLNHFNQNSSISSSKASQADI-- 84
G +V L HR P SP + + P L + L R R + + S + D+
Sbjct: 126 GAATVPLHHRHGPCSPL-PTKKMP--TLEETLHRDQLRAAYIQRK--FSGGGGAGGDVQR 180
Query: 85 ----IP-------NNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSP 133
+P N YLI + +G+P T + + DTGSD+ W QC+PC SQC+ Q P
Sbjct: 181 SDATVPTALGTSLNTLEYLITVGLGSPATSQTMLIDTGSDVSWVQCKPC--SQCHSQADP 238
Query: 134 LFDPKMSSTYKSLPCSSSQCASLNQKS---CSGVNCQYSVSYGDGSFSNGNLATETVTLG 190
LFDP SSTY C S+ CA L Q+ S CQY V+YGDGS + G +++T+ LG
Sbjct: 239 LFDPSSSSTYSPFSCGSADCAQLGQEGNGCSSSSQCQYIVTYGDGSSTTGTYSSDTLALG 298
Query: 191 STTGQAVALPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTI 239
S+ A+ FGC G FN +T G++GLGGG SL+SQ T+
Sbjct: 299 SS-----AVRSFQFGCSNVESG-FNDQTDGLMGLGGGAQSLVSQTAGTL 341
Score = 58.2 bits (139), Expect = 7e-06, Method: Compositional matrix adjust.
Identities = 43/124 (34%), Positives = 61/124 (49%), Gaps = 10/124 (8%)
Query: 253 VIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSLSQV--PEVTIHF 310
V+DSGT +T LP S L S + ++ P A P+G L+ C+ F+ S V P V + F
Sbjct: 413 VMDSGTVITRLPPTAYSALSSAFKAGMKQYPPAQPSGILDTCFDFSGQSSVSIPSVALVF 472
Query: 311 R-GADVKLSRSNFFVKVSEDIVCSVFKGITN--SVPIYGNIMQTNFLVGYDIEQQTVSFK 367
GA V L S + C F G ++ S+ I GN+ Q F V YD+ + V F+
Sbjct: 473 SGGAVVSLDASGIILS-----NCLAFAGNSDDSSLGIIGNVQQRTFEVLYDVGRGVVGFR 527
Query: 368 PTDC 371
C
Sbjct: 528 AGAC 531
>gi|413953782|gb|AFW86431.1| hypothetical protein ZEAMMB73_038825 [Zea mays]
Length = 462
Score = 123 bits (308), Expect = 2e-25, Method: Compositional matrix adjust.
Identities = 105/353 (29%), Positives = 147/353 (41%), Gaps = 80/353 (22%)
Query: 90 NYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCS 149
+++ + GTP + DTGSD+ W QC PC CY Q P+FDP S+TY ++PC
Sbjct: 119 EFVVTVGFGTPAQTYTLMFDTGSDVSWIQCLPCS-GHCYKQHDPIFDPTKSATYSAVPCG 177
Query: 150 SSQCASLNQKSCSGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITFGCGTN 209
QCA+ K S C Y V YGDGS + G L+ ET++L S A ALPG FGCG
Sbjct: 178 HPQCAAAGGKCSSNGTCLYKVQYGDGSSTAGVLSHETLSLTS----ARALPGFAFGCGET 233
Query: 210 NGGLFNSKTTGIVGLGGGDISLISQMRT--------------TIAGNQRLGVSTP----- 250
N G F G++GLG G +SL SQ T G +G +TP
Sbjct: 234 NLGDFG-DVDGLIGLGRGQLSLSSQAAASFGAAFSYCLPSYNTSHGYLTIGTTTPASGSD 292
Query: 251 -------------------DIV----------------------IDSGTTLTFL-PQGYN 268
D+V +DSGT LT+L P+ Y
Sbjct: 293 GVRYTAMIQKQDYPSFYFVDLVSIVVGGFVLPVPPILFTRDGTLLDSGTVLTYLPPEAYT 352
Query: 269 S--NLLSVMSSMIEAQPVADPTGSLELCYSFNSLSQVPEVTIHFRGAD-VKLSRSNFFVK 325
+ + + + P DP + CY F + + + F+ +D S F V
Sbjct: 353 ALRDRFKFTMTQYKPAPAYDP---FDTCYDFAGQNAIFMPLVSFKFSDGSSFDLSPFGVL 409
Query: 326 VSEDIV-----CSVFKGITNSVP--IYGNIMQTNFLVGYDIEQQTVSFKPTDC 371
+ D C F +++P I GN Q N + YD+ + + F C
Sbjct: 410 IFPDDTAPATGCLAFVPRPSTMPFTIVGNTQQRNTEMIYDVAAEKIGFVSGSC 462
>gi|357122568|ref|XP_003562987.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Brachypodium
distachyon]
Length = 455
Score = 122 bits (307), Expect = 2e-25, Method: Compositional matrix adjust.
Identities = 89/245 (36%), Positives = 128/245 (52%), Gaps = 30/245 (12%)
Query: 10 ILFFLCFYVVSPIEAQTGGFSVELIHRDSPKSPFYNSSETPYQRLRDALTRSLNRLNHFN 69
+L L +++ A + IH D P +SE +R AL R ++R F
Sbjct: 6 VLLILACTILASDAAAAVRVGLTRIHAD----PEVTASEF----VRGALRRDMHRHARFA 57
Query: 70 QNSSISSSKAS---------QADIIPNNANYLIRISIGTPPTERLAVADTGSDLIWTQCE 120
+ SS A+ Q D+ N Y++ +SIGTPP A+ADTGSDLIWTQC
Sbjct: 58 REQLAPSSAAAAGLTVGAPTQKDLR-NGGEYIMTLSIGTPPLSYRAIADTGSDLIWTQCA 116
Query: 121 PCPPS------QCYMQDSPLFDPKMSSTYKSLPCSS--SQCASLNQKS-CSGVNCQYSVS 171
PC + QC+ Q L++P S+T+ LPC+S S CA++ S G C Y+ +
Sbjct: 117 PCGDTVTDTDNQCFKQSGCLYNPSSSTTFGVLPCNSPLSMCAAMAGPSPPPGCACMYNQT 176
Query: 172 YGDGSFSNGNLATETVTLG-STTGQAVALPGITFGCGTNNGGLFNSKTTGIVGLGGGDIS 230
YG G ++ G + ET T G S+T AV +P I FGC + +N + G+VGLG G +S
Sbjct: 177 YGTG-WTAGVQSVETFTFGSSSTPPAVRVPNIAFGCSNASSNDWNG-SAGLVGLGRGSMS 234
Query: 231 LISQM 235
L+SQ+
Sbjct: 235 LVSQL 239
Score = 61.6 bits (148), Expect = 6e-07, Method: Compositional matrix adjust.
Identities = 41/133 (30%), Positives = 71/133 (53%), Gaps = 10/133 (7%)
Query: 249 TPDIVIDSGTTLTFL-PQGYNSNLLSVMSSMIEAQPVA---DPTGSLELCYSFNSLS--- 301
T ++IDSGTT+T L Y +V S ++ P+A D + L+LC++ + +
Sbjct: 322 TGGLIIDSGTTITTLVDSAYQQVRAAVRSLLVTRLPLAHGPDHSTGLDLCFALKASTPPP 381
Query: 302 QVPEVTIHFRG-ADVKLSRSNFFVKVSEDIVCSVFKGIT-NSVPIYGNIMQTNFLVGYDI 359
+P +T+HF G AD+ L N+ + + + C + T ++ + GN Q N V YD+
Sbjct: 382 AMPSMTLHFEGGADMVLPVENYMI-LGSGVWCLAMRNQTVGAMSMVGNYQQQNIHVLYDV 440
Query: 360 EQQTVSFKPTDCT 372
++T+SF P C+
Sbjct: 441 RKETLSFAPAVCS 453
>gi|226530755|ref|NP_001150335.1| aspartic proteinase nepenthesin-2 precursor [Zea mays]
gi|195638492|gb|ACG38714.1| aspartic proteinase nepenthesin-2 precursor [Zea mays]
Length = 461
Score = 122 bits (307), Expect = 2e-25, Method: Compositional matrix adjust.
Identities = 86/230 (37%), Positives = 118/230 (51%), Gaps = 29/230 (12%)
Query: 27 GGFSVELIHRDSPKSPFYNSSETPYQRLRDALTRSLNRLNHFNQNSSISSSKASQADI-- 84
G +V L HR P SP + + P L + L R R + + S + D+
Sbjct: 56 GAATVPLHHRHGPCSPL-PTKKMP--TLEETLHRDQLRAAYIQRK--FSGGGGAGGDVQR 110
Query: 85 ----IP-------NNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSP 133
+P N YLI + +G+P T + + DTGSD+ W QC+PC SQC+ Q P
Sbjct: 111 SDATVPTALGTSLNTLEYLITVGLGSPATSQTMLIDTGSDVSWVQCKPC--SQCHSQADP 168
Query: 134 LFDPKMSSTYKSLPCSSSQCASLNQKS---CSGVNCQYSVSYGDGSFSNGNLATETVTLG 190
LFDP SSTY C S+ CA L Q+ S CQY V+YGDGS + G +++T+ LG
Sbjct: 169 LFDPSSSSTYSPFSCGSAACAQLGQEGNGCSSSSQCQYIVTYGDGSSTTGTYSSDTLALG 228
Query: 191 STTGQAVALPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIA 240
S+ A+ FGC G FN +T G++GLGGG SL+SQ T+
Sbjct: 229 SS-----AVKSFQFGCSNVESG-FNDQTDGLMGLGGGAQSLVSQTAGTLG 272
Score = 56.2 bits (134), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 42/124 (33%), Positives = 60/124 (48%), Gaps = 10/124 (8%)
Query: 253 VIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSLSQV--PEVTIHF 310
V+DSGT +T LP S L S + ++ P A P+G L+ C+ F+ S V P V + F
Sbjct: 343 VMDSGTVITRLPPTAYSALSSAFKAGMKQYPPAQPSGILDTCFDFSGQSSVSIPSVALVF 402
Query: 311 R-GADVKLSRSNFFVKVSEDIVCSVFKGITN--SVPIYGNIMQTNFLVGYDIEQQTVSFK 367
GA V L S + C F ++ S+ I GN+ Q F V YD+ + V F+
Sbjct: 403 SGGAVVSLDASGIILS-----NCLAFAANSDDSSLGIIGNVQQRTFEVLYDVGRGVVGFR 457
Query: 368 PTDC 371
C
Sbjct: 458 AGAC 461
>gi|297811181|ref|XP_002873474.1| hypothetical protein ARALYDRAFT_909035 [Arabidopsis lyrata subsp.
lyrata]
gi|297319311|gb|EFH49733.1| hypothetical protein ARALYDRAFT_909035 [Arabidopsis lyrata subsp.
lyrata]
Length = 293
Score = 122 bits (306), Expect = 3e-25, Method: Compositional matrix adjust.
Identities = 80/190 (42%), Positives = 105/190 (55%), Gaps = 12/190 (6%)
Query: 53 RLRDALTRSLNRLNHFNQNSSISSSKASQA----DIIPNNANYLIRISIGTPPTERLAVA 108
R +A S++ N +S +K+++ II + NY++ I IGTP + +
Sbjct: 92 RRDEARVESIHSKLSKNIADEVSKAKSTKLPAKNGIILGSPNYIVTIGIGTPKHDISLMF 151
Query: 109 DTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCSSSQCASLNQKSCSGVNCQY 168
DTGSDL WTQCEPC S CY Q P F+P SS+Y ++ CSS C N +SCS NC Y
Sbjct: 152 DTGSDLTWTQCEPCLGS-CYSQKEPKFNPSSSSSYHNVSCSSPMCG--NPESCSASNCLY 208
Query: 169 SVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITFGCGTNNGGLFNSKTTGIVGLGGGD 228
+ YGDGS + G LA E TL ++ L I FGCG NN G+F + GI+GLG G
Sbjct: 209 GIGYGDGSVTVGFLAKEKFTLTNSD----VLDDIYFGCGENNKGVFIG-SAGILGLGPGK 263
Query: 229 ISLISQMRTT 238
S Q TT
Sbjct: 264 FSFPLQTTTT 273
>gi|388504358|gb|AFK40245.1| unknown [Medicago truncatula]
Length = 480
Score = 122 bits (306), Expect = 3e-25, Method: Compositional matrix adjust.
Identities = 123/435 (28%), Positives = 186/435 (42%), Gaps = 98/435 (22%)
Query: 23 EAQTGGFSVELIHRD--SPKSPFYNSSETPYQRLRDALTRSL-NRL-------NHFNQNS 72
+ G +E+ R S + +N D RS+ NR+ N Q+S
Sbjct: 57 RKEKGAIVLEMKDRGYCSERKINWNRKLQKQLIFDDLRVRSMQNRIRAKVSGHNSSEQSS 116
Query: 73 SISSSKASQADIIPNNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDS 132
I AS ++ NY++ I +G + DTGSDL W QC+PC CY Q
Sbjct: 117 EIQIPLASGINL--ETLNYIVTIGLGNQ--NMTVIIDTGSDLTWVQCDPCMS--CYSQQG 170
Query: 133 PLFDPKMSSTYKSLPCSSSQCASL-----NQKSCSGVN---CQYSVSYGDGSFSNGNLAT 184
P+F+P SS+Y SL C+SS C +L N ++C N C ++VSYGDGSF++G L
Sbjct: 171 PVFNPSNSSSYNSLLCNSSTCQNLQFTTGNTEACESNNPSSCNHTVSYGDGSFTDGELGV 230
Query: 185 ETVTLGSTTGQAVALPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMR-------- 236
E ++ G +++ FGCG NN GLF +GI+GLG ++S+ISQ
Sbjct: 231 EHLSFG-----GISVSNFVFGCGRNNKGLFGG-VSGIMGLGRSNLSMISQTNTTFGGVFS 284
Query: 237 -------------------------------TTIAGNQRL--------------GVSTPD 251
T++ N +L GV+ D
Sbjct: 285 YCLPTTDSGASGSLVIGNESSLFKNLTPIAYTSMVSNPQLSNFYVLNLTGIDVGGVAIQD 344
Query: 252 -------IVIDSGTTLTFL-PQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSLSQV 303
I+IDSGT +T L P YN+ L + P+A L+ C++ + +V
Sbjct: 345 TSFGNGGILIDSGTVITRLAPSLYNA-LKAEFLKQFSGYPIAPALSILDTCFNLTGIEEV 403
Query: 304 --PEVTIHFR-GADVKLSRSN-FFVKVSEDIVCSVFKGIT--NSVPIYGNIMQTNFLVGY 357
P +++HF D+ + ++ VC ++ N + I GN Q N V Y
Sbjct: 404 SIPTLSMHFENNVDLNVDAVGILYMPKDGSQVCLALASLSDENDMAIIGNYQQRNQRVIY 463
Query: 358 DIEQQTVSFKPTDCT 372
D +Q + F DC+
Sbjct: 464 DAKQSKIGFAREDCS 478
>gi|302757235|ref|XP_002962041.1| hypothetical protein SELMODRAFT_64201 [Selaginella moellendorffii]
gi|300170700|gb|EFJ37301.1| hypothetical protein SELMODRAFT_64201 [Selaginella moellendorffii]
Length = 367
Score = 122 bits (306), Expect = 3e-25, Method: Compositional matrix adjust.
Identities = 76/171 (44%), Positives = 96/171 (56%), Gaps = 7/171 (4%)
Query: 91 YLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCSS 150
Y + I +G+PP + A+ DTGSDL+W QC+PC SQCY Q P++DP SST+ CS+
Sbjct: 4 YTMEIELGSPPKKFNAIVDTGSDLVWIQCKPC--SQCYSQSDPIYDPSASSTFAKTSCST 61
Query: 151 SQCASLNQKSCS--GVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITFGCGT 208
S C SL CS C Y YGD S + G+ A ET+TL S+ G + A P FGCG
Sbjct: 62 SSCQSLPASGCSSSAKTCIYGYQYGDSSSTQGDFALETLTLRSSGGSSKAFPNFQFGCGR 121
Query: 209 NNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGNQRLGVSTPDIVIDSGTT 259
N G F GIVGLG G ISL +Q+ + I N + D DS T
Sbjct: 122 LNSGSFGG-AAGIVGLGQGKISLSTQLGSAI--NNKFSYCLVDFDDDSSKT 169
Score = 56.2 bits (134), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 40/134 (29%), Positives = 65/134 (48%), Gaps = 5/134 (3%)
Query: 243 QRLGVSTPDIVIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSLS- 301
+ L V++ + DSGTTLT L S + S +S + V + +LCY +
Sbjct: 234 RALEVNSGGTIFDSGTTLTLLDDAVYSKVKSAFASSVSLPTVDASSSGFDLCYDVSKSKN 293
Query: 302 -QVPEVTIHFRGADVKLSRSNFFVKV--SEDIVC-SVFKGITNSVPIYGNIMQTNFLVGY 357
+ P +T+ F+G + N+FV V +E + C ++ + + I GN+MQ N+ V Y
Sbjct: 294 FKFPALTLAFKGTKFSPPQKNYFVIVDTAETVACLAMGGSGSLGLGIIGNLMQQNYHVVY 353
Query: 358 DIEQQTVSFKPTDC 371
D T+S P C
Sbjct: 354 DRGTSTISMSPAQC 367
>gi|356532672|ref|XP_003534895.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
Length = 449
Score = 122 bits (306), Expect = 3e-25, Method: Compositional matrix adjust.
Identities = 115/423 (27%), Positives = 176/423 (41%), Gaps = 100/423 (23%)
Query: 32 ELIHRDSPKSPFYNSSETPYQRLRDALTRSLNRLNHFNQ--NSSISSSKASQADIIPN-- 87
+LIH S P Y +ET R+ + S R + S+ S+ +A + P+
Sbjct: 38 KLIHPGSVHHPHYKPNETAKDRMELDIQHSAARFAYIQARIEGSLVSNNEYKARVSPSLT 97
Query: 88 NANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSL- 146
+ ISIG PP +L V DTGSD++W C PC + C LFDP MSST+ L
Sbjct: 98 GRTIMANISIGQPPIPQLVVMDTGSDILWVMCTPC--TNCDNHLGLLFDPSMSSTFSPLC 155
Query: 147 --PCSSSQCASLNQKSCSGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITF 204
PC C+ C + ++V+Y D S ++G +TV +T +P + F
Sbjct: 156 KTPCDFKGCS-----RCDPI--PFTVTYADNSTASGMFGRDTVVFETTDEGTSRIPDVLF 208
Query: 205 GCGTNNGGLFNSKTTGIVGLGGGDISLISQ------------------------------ 234
GCG N G + GI+GL G SL ++
Sbjct: 209 GCGHNIGQDTDPGHNGILGLNNGPDSLATKIGQKFSYCIGDLADPYYNYHQLILGEGADL 268
Query: 235 -----------------MRTTIAGNQRLGVS----------TPDIVIDSGTTLTFLPQGY 267
M G +RL ++ T ++ID+G+T+TFL
Sbjct: 269 EGYSTPFEVHNGFYYVTMEGISVGEKRLDIAPETFEMKKNRTGGVIIDTGSTITFLVDSV 328
Query: 268 N-------SNLL--SVMSSMIEAQPVADPTGSLELCYSFNSLSQ----VPEVTIHFR-GA 313
+ NLL S + IE P C+ + S+S+ P VT HF GA
Sbjct: 329 HRLLSKEVRNLLGWSFRQTTIEKSPWMQ-------CF-YGSISRDLVGFPVVTFHFADGA 380
Query: 314 DVKLSRSNFFVKVSEDIVCSVFKGIT----NSVP-IYGNIMQTNFLVGYDIEQQTVSFKP 368
D+ L +FF ++++++ C ++ S P + G + Q ++ VGYD+ Q V F+
Sbjct: 381 DLALDSGSFFNQLNDNVFCMTVGPVSSLNLKSKPSLIGLLAQQSYSVGYDLVNQFVYFQR 440
Query: 369 TDC 371
DC
Sbjct: 441 IDC 443
>gi|449439383|ref|XP_004137465.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 1-like [Cucumis
sativus]
Length = 491
Score = 122 bits (306), Expect = 3e-25, Method: Compositional matrix adjust.
Identities = 70/207 (33%), Positives = 101/207 (48%), Gaps = 31/207 (14%)
Query: 29 FSVELIHRDSPKSPFYNSSETPYQRL-RDALTRSLNRLNHFNQNSSISSSKASQADIIP- 86
FS++L RDS +N+ Y+ L L+R +R+ + S+ ++D+ P
Sbjct: 76 FSLQLHPRDS----LHNAGHKDYKSLVLSRLSRDSSRVKSIYDRLEFALSELKRSDLEPL 131
Query: 87 -------------------NNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQC 127
+ Y R+ +G P V DTGSD+ W QC+PC + C
Sbjct: 132 KTEILPEDLSTPIISGTSQGSGEYFSRVGVGQPAKPFYMVLDTGSDINWLQCQPC--TDC 189
Query: 128 YMQDSPLFDPKMSSTYKSLPCSSSQCASLNQKSCSGVNCQYSVSYGDGSFSNGNLATETV 187
Y Q P+FDP+ SS++ SLPC S QC +L C C Y VSYGDGSF+ G ET+
Sbjct: 190 YQQTDPIFDPRSSSSFASLPCESQQCQALETSGCRASKCLYQVSYGDGSFTVGEFVIETL 249
Query: 188 TLGSTTGQAVALPGITFGCGTNNGGLF 214
T G++ + + GCG +N GLF
Sbjct: 250 TFGNSG----MINNVAVGCGHDNEGLF 272
Score = 53.5 bits (127), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 41/128 (32%), Positives = 59/128 (46%), Gaps = 12/128 (9%)
Query: 252 IVIDSGTTLTFLP-QGYNSNLLSVMSSMIEAQPVADPTGSLEL---CYSFNSLSQV--PE 305
I++DSGT +T L Q YN+ + + + P T L CY +S S+V P
Sbjct: 368 IIVDSGTAITRLQTQAYNT----LRDAFVSRTPYLKKTNGFALFDTCYDLSSQSRVTIPT 423
Query: 306 VTIHFRGAD-VKLSRSNFFVKV-SEDIVCSVFKGITNSVPIYGNIMQTNFLVGYDIEQQT 363
V+ F G ++L N+ + V S C F T+S+ I GN+ Q V YD+
Sbjct: 424 VSFEFAGGKSLQLPPKNYLIPVDSVGTFCFAFAPTTSSLSIIGNVQQQGTRVHYDLANSV 483
Query: 364 VSFKPTDC 371
V F P C
Sbjct: 484 VGFSPHKC 491
>gi|225423917|ref|XP_002281973.1| PREDICTED: aspartic proteinase nepenthesin-2 [Vitis vinifera]
Length = 491
Score = 122 bits (306), Expect = 3e-25, Method: Compositional matrix adjust.
Identities = 67/196 (34%), Positives = 99/196 (50%), Gaps = 27/196 (13%)
Query: 40 KSPFYNSSETPYQRLRDA-LTRSLNRLNHFNQNSSISSSKASQADIIP------------ 86
++ + SS Y+ L A L R +R+ ++ + +++D+ P
Sbjct: 83 RTSIHKSSHKDYKSLVLARLERDSDRVRSLATRMDLAIAGITKSDLKPVEKELEAEALET 142
Query: 87 --------NNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPK 138
+ Y R+ IG+PP V DTGSD+ W QC PC + CY Q P+F+P
Sbjct: 143 PLVSGASQGSGEYFSRVGIGSPPKHVYMVVDTGSDVNWVQCAPC--ADCYQQADPIFEPS 200
Query: 139 MSSTYKSLPCSSSQCASLNQKSCSGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVA 198
SS+Y L C + QC SL+ C +C Y VSYGDGS++ G+ ATET+TL + +
Sbjct: 201 FSSSYAPLTCETHQCKSLDVSECRNDSCLYEVSYGDGSYTVGDFATETITLDGS----AS 256
Query: 199 LPGITFGCGTNNGGLF 214
L + GCG +N GLF
Sbjct: 257 LNNVAIGCGHDNEGLF 272
Score = 53.9 bits (128), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 37/124 (29%), Positives = 55/124 (44%), Gaps = 4/124 (3%)
Query: 252 IVIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSLS--QVPEVTIH 309
I++DSGT +T L ++L + P + CY +S S +VP V+ H
Sbjct: 368 IIVDSGTAVTRLQSDVYNSLRDSFVRGTQHLPSTSGVALFDTCYDLSSRSSVEVPTVSFH 427
Query: 310 F-RGADVKLSRSNFFVKV-SEDIVCSVFKGITNSVPIYGNIMQTNFLVGYDIEQQTVSFK 367
F G + L N+ + V S C F T+++ I GN+ Q V YD+ V F
Sbjct: 428 FPDGKYLALPAKNYLIPVDSAGTFCFAFAPTTSALSIIGNVQQQGTRVSYDLSNSLVGFS 487
Query: 368 PTDC 371
P C
Sbjct: 488 PNGC 491
>gi|194707866|gb|ACF88017.1| unknown [Zea mays]
Length = 461
Score = 122 bits (306), Expect = 3e-25, Method: Compositional matrix adjust.
Identities = 86/230 (37%), Positives = 118/230 (51%), Gaps = 29/230 (12%)
Query: 27 GGFSVELIHRDSPKSPFYNSSETPYQRLRDALTRSLNRLNHFNQNSSISSSKASQADI-- 84
G +V L HR P SP + + P L + L R R + + S + D+
Sbjct: 56 GAATVPLHHRHGPCSPL-PTKKMP--TLEETLHRDQLRAAYIQRK--FSGGGGAGGDVQR 110
Query: 85 ----IP-------NNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSP 133
+P N YLI + +G+P T + + DTGSD+ W QC+PC SQC+ Q P
Sbjct: 111 SDATVPTALGTSLNTLEYLITVGLGSPATSQTMLIDTGSDVSWVQCKPC--SQCHSQADP 168
Query: 134 LFDPKMSSTYKSLPCSSSQCASLNQKS---CSGVNCQYSVSYGDGSFSNGNLATETVTLG 190
LFDP SSTY C S+ CA L Q+ S CQY V+YGDGS + G +++T+ LG
Sbjct: 169 LFDPSSSSTYSPFSCGSADCAQLGQEGNGCSSSSQCQYIVTYGDGSSTTGTYSSDTLALG 228
Query: 191 STTGQAVALPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIA 240
S+ A+ FGC G FN +T G++GLGGG SL+SQ T+
Sbjct: 229 SS-----AVRSFQFGCSNVESG-FNDQTDGLMGLGGGAQSLVSQTAGTLG 272
Score = 58.5 bits (140), Expect = 6e-06, Method: Compositional matrix adjust.
Identities = 43/124 (34%), Positives = 61/124 (49%), Gaps = 10/124 (8%)
Query: 253 VIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSLSQV--PEVTIHF 310
V+DSGT +T LP S L S + ++ P A P+G L+ C+ F+ S V P V + F
Sbjct: 343 VMDSGTVITRLPPTAYSALSSAFKAGMKQYPPAQPSGILDTCFDFSGQSSVSIPSVALVF 402
Query: 311 R-GADVKLSRSNFFVKVSEDIVCSVFKGITN--SVPIYGNIMQTNFLVGYDIEQQTVSFK 367
GA V L S + C F G ++ S+ I GN+ Q F V YD+ + V F+
Sbjct: 403 SGGAVVSLDASGIILS-----NCLAFAGNSDDSSLGIIGNVQQRTFEVLYDVGRGVVGFR 457
Query: 368 PTDC 371
C
Sbjct: 458 AGAC 461
>gi|115445765|ref|NP_001046662.1| Os02g0314600 [Oryza sativa Japonica Group]
gi|46391044|dbj|BAD15987.1| putative chloroplast nucleoid DNA binding protein [Oryza sativa
Japonica Group]
gi|113536193|dbj|BAF08576.1| Os02g0314600 [Oryza sativa Japonica Group]
gi|125581836|gb|EAZ22767.1| hypothetical protein OsJ_06441 [Oryza sativa Japonica Group]
gi|215697168|dbj|BAG91162.1| unnamed protein product [Oryza sativa Japonica Group]
Length = 514
Score = 122 bits (305), Expect = 3e-25, Method: Compositional matrix adjust.
Identities = 99/374 (26%), Positives = 160/374 (42%), Gaps = 83/374 (22%)
Query: 81 QADIIPNNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMS 140
++ + + YL+ + +GTPP + DTGSDL W QC PC C+ Q P+FDP S
Sbjct: 142 ESGVAVGSGEYLVDLYVGTPPRRFQMIMDTGSDLNWLQCAPC--LDCFEQRGPVFDPAAS 199
Query: 141 STYKSLPCSSSQCASL----NQKSCSGVN---CQYSVSYGDGSFSNGNLATETVTLGSTT 193
+Y+++ C +C + ++C + C Y YGD S + G+LA E T+ T
Sbjct: 200 LSYRNVTCGDPRCGLVAPPTAPRACRRPHSDPCPYYYWYGDQSNTTGDLALEAFTVNLTA 259
Query: 194 -GQAVALPGITFGCGTNNGGLFN----------------SKTTGIVGLG----------- 225
G + + + FGCG +N GLF+ S+ + G
Sbjct: 260 PGASRRVDDVVFGCGHSNRGLFHGAAGLLGLGRGALSFASQLRAVYGHAFSYCLVDHGSS 319
Query: 226 -------GGDISLISQ-----------------------MRTTIAGNQRLGV--STPDI- 252
G D +L+ ++ + G ++L + ST D+
Sbjct: 320 VGSKIVFGDDDALLGHPRLNYTAFAPSAAAAADTFYYVQLKGVLVGGEKLNISPSTWDVG 379
Query: 253 -------VIDSGTTLTFLPQ-GYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSLS--Q 302
+IDSGTTL++ + Y + + M +A P+ L CY+ + + +
Sbjct: 380 KDGSGGTIIDSGTTLSYFAEPAYEVIRRAFVERMDKAYPLVADFPVLSPCYNVSGVERVE 439
Query: 303 VPEVTIHFR-GADVKLSRSNFFVKVSED-IVCSVFKGITNS-VPIYGNIMQTNFLVGYDI 359
VPE ++ F GA N+FV++ D I+C G S + I GN Q NF V YD+
Sbjct: 440 VPEFSLLFADGAVWDFPAENYFVRLDPDGIMCLAVLGTPRSAMSIIGNFQQQNFHVLYDL 499
Query: 360 EQQTVSFKPTDCTK 373
+ + F P C +
Sbjct: 500 QNNRLGFAPRRCAE 513
>gi|217074470|gb|ACJ85595.1| unknown [Medicago truncatula]
gi|388505166|gb|AFK40649.1| unknown [Medicago truncatula]
Length = 452
Score = 122 bits (305), Expect = 3e-25, Method: Compositional matrix adjust.
Identities = 108/359 (30%), Positives = 162/359 (45%), Gaps = 82/359 (22%)
Query: 88 NANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLP 147
+ NY +++ +G+P + DTGS W QC+PC C++Q+ P+F+P S TYK++P
Sbjct: 100 SGNYYVKMGLGSPTKYYTMIVDTGSSFSWLQCQPCT-IYCHIQEDPVFNPSASKTYKTVP 158
Query: 148 CSSSQC-----ASLNQKSCSGVN--CQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALP 200
CSSSQC A+LN+ +CS + C Y SYGD SFS G L+ + +TL T Q L
Sbjct: 159 CSSSQCSSLKSATLNEPTCSKQSNACVYKASYGDSSFSLGYLSQDVLTL--TPSQT--LS 214
Query: 201 GITFGCGTNNGGLF---------------------------------------NSKTTGI 221
+GCG +N GLF NS G
Sbjct: 215 SFVYGCGQDNQGLFGRTDGIIGLANNELSMLSQLSGKYGNAFSYCLPTSFSTPNSPKEGF 274
Query: 222 VGLGGGDIS--------------------LISQMRTTIAGNQRLGVSTPDI----VIDSG 257
+ +G ++ I T+AG + LGV+ +IDSG
Sbjct: 275 LSIGTSSLTPSSSYKFTPLLKNPNNPSLYFIDLESITVAG-RPLGVAASSYKVPTIIDSG 333
Query: 258 TTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSL-ELCY--SFNSLSQV-PEVTIHFRG- 312
T +T LP + L + +++ + P SL + C+ S +S+V P++ I F+G
Sbjct: 334 TVITRLPTPVYTTLKNAYVTILSKKYQQAPGISLLDTCFKGSLAGISEVAPDIRIIFKGG 393
Query: 313 ADVKLSRSNFFVKVSEDIVCSVFKGITNSVPIYGNIMQTNFLVGYDIEQQTVSFKPTDC 371
AD++L N V++ I C G ++S+ I GN Q V YD+ V F P C
Sbjct: 394 ADLQLKGHNSLVELETGITCLAMAG-SSSIAIIGNYQQQTVKVAYDVGNSRVGFAPGGC 451
>gi|125555058|gb|EAZ00664.1| hypothetical protein OsI_22685 [Oryza sativa Indica Group]
Length = 465
Score = 122 bits (305), Expect = 4e-25, Method: Compositional matrix adjust.
Identities = 78/234 (33%), Positives = 114/234 (48%), Gaps = 29/234 (12%)
Query: 30 SVELIHRDSPKSPFYNSSETPYQRLRDALTRSLNRLNHF--------NQNSSISSSKASQ 81
SV L+HR P +P S P L + L R R N+ +++S +
Sbjct: 44 SVPLVHRHGPCAPSAASGGKP--SLAERLRRDRARANYIVTKAAGGRTAATAVSDAVGGG 101
Query: 82 ADIIP-------NNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPL 134
IP ++ Y++ + IGTP +++ + DTGSDL W QC+PC +CY Q PL
Sbjct: 102 GTSIPTFLGDSVDSLEYVVTLGIGTPAVQQIVLIDTGSDLSWVQCKPCGAGECYAQKDPL 161
Query: 135 FDPKMSSTYKSLPCSSSQCASLNQ-------KSCSGVNCQYSVSYGDGSFSNGNLATETV 187
FDP SS+Y S+PC S C L S + C+Y + YG+ + + G +TET+
Sbjct: 162 FDPSSSSSYASVPCDSDACRKLAAGAYGHGCTSGAAALCEYGIEYGNRATTTGVYSTETL 221
Query: 188 TLGSTTGQAVALPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAG 241
TL V + FGCG + G + K G++GLGG SL+SQ + G
Sbjct: 222 TL----KPGVVVADFGFGCGDHQHGPYE-KFDGLLGLGGAPESLVSQTSSQFGG 270
Score = 45.1 bits (105), Expect = 0.061, Method: Compositional matrix adjust.
Identities = 36/125 (28%), Positives = 57/125 (45%), Gaps = 7/125 (5%)
Query: 252 IVIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGS--LELCYSFNSLSQV--PEVT 307
+VIDSGT +T LP + L S S + + P+ L+ CY F + V P +
Sbjct: 343 MVIDSGTVITGLPATAYAALRSAFRSAMSEYRLLPPSNGAVLDTCYDFTGHTNVTVPTIA 402
Query: 308 IHFRG-ADVKLSRSNFFVKVSEDIVCSVFKGITNSVPIYGNIMQTNFLVGYDIEQQTVSF 366
+ F G A + L+ + + + G +++ I GN+ Q F V YD + TV F
Sbjct: 403 LTFSGGATIDLATPAGVLV--DGCLAFAGAGTDDTIGIIGNVNQRTFEVLYDSGKGTVGF 460
Query: 367 KPTDC 371
+ C
Sbjct: 461 RAGAC 465
>gi|125539168|gb|EAY85563.1| hypothetical protein OsI_06935 [Oryza sativa Indica Group]
Length = 514
Score = 122 bits (305), Expect = 4e-25, Method: Compositional matrix adjust.
Identities = 99/374 (26%), Positives = 160/374 (42%), Gaps = 83/374 (22%)
Query: 81 QADIIPNNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMS 140
++ + + YL+ + +GTPP + DTGSDL W QC PC C+ Q P+FDP S
Sbjct: 142 ESGVAVGSGEYLVDLYVGTPPRRFQMIMDTGSDLNWLQCAPC--LDCFEQRGPVFDPATS 199
Query: 141 STYKSLPCSSSQCASL----NQKSCSGVN---CQYSVSYGDGSFSNGNLATETVTLGSTT 193
+Y+++ C +C + ++C + C Y YGD S + G+LA E T+ T
Sbjct: 200 LSYRNVTCGDPRCGLVAPPTAPRACRRPHSDPCPYYYWYGDQSNTTGDLALEAFTVNLTA 259
Query: 194 -GQAVALPGITFGCGTNNGGLFN----------------SKTTGIVGLG----------- 225
G + + + FGCG +N GLF+ S+ + G
Sbjct: 260 PGASRRVDDVVFGCGHSNRGLFHGAAGLLGLGRGALSFASQLRAVYGHAFSYCLVDHGSS 319
Query: 226 -------GGDISLISQ-----------------------MRTTIAGNQRLGV--STPDI- 252
G D +L+ ++ + G ++L + ST D+
Sbjct: 320 VGSKIVFGDDDALLGHPRLNYTAFAPSAAAAADTFYYVQLKGVLVGGEKLNISPSTWDVG 379
Query: 253 -------VIDSGTTLTFLPQ-GYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSLS--Q 302
+IDSGTTL++ + Y + + M +A P+ L CY+ + + +
Sbjct: 380 KDGSGGTIIDSGTTLSYFAEPAYEVIRRAFVERMDKAYPLVADFPVLSPCYNVSGVERVE 439
Query: 303 VPEVTIHFR-GADVKLSRSNFFVKVSED-IVCSVFKGITNS-VPIYGNIMQTNFLVGYDI 359
VPE ++ F GA N+FV++ D I+C G S + I GN Q NF V YD+
Sbjct: 440 VPEFSLLFADGAVWDFPAENYFVRLDPDGIMCLAVLGTPRSAMSIIGNFQQQNFHVLYDL 499
Query: 360 EQQTVSFKPTDCTK 373
+ + F P C +
Sbjct: 500 QNNRLGFAPRRCAE 513
>gi|449457263|ref|XP_004146368.1| PREDICTED: probable aspartic protease At2g35615-like [Cucumis
sativus]
Length = 469
Score = 122 bits (305), Expect = 4e-25, Method: Compositional matrix adjust.
Identities = 126/459 (27%), Positives = 187/459 (40%), Gaps = 112/459 (24%)
Query: 14 LCFYVVSPIEAQT------GGFSVELIHRDSPKSPFYNSSETPYQRLRDALTRSLNRLNH 67
L FY+ + I + T + +LIHR+S P Y+ +ET R + T S+ R +
Sbjct: 17 LAFYLSTAIISSTLITTKPSRLATKLIHRNSYLHPLYDQNETVEDRSKREQTSSIERFDF 76
Query: 68 FNQNSSISSSKA----SQADIIPNN--ANYLIRISIGTPPTERLAVADTGSDLIWTQCEP 121
S I K+ +++ +IP N + +L+ +SIG+PP +L V DTGS L+W QC P
Sbjct: 77 LE--SKIKELKSVGNEARSSLIPFNRGSGFLVNLSIGSPPVTQLVVVDTGSSLLWVQCLP 134
Query: 122 CPPSQCYMQDSPLFDPKMSSTYKSLPCSSSQCASLNQKSCSGVN-CQYSVSYGDGSFSNG 180
C C+ Q + FDP S ++K+L C +N C+ N +Y + Y G S G
Sbjct: 135 CI--NCFQQSTSWFDPLKSVSFKTLGCGFPGYNYINGYKCNRFNQAEYKLRYLGGDSSQG 192
Query: 181 NLATETVTLG-------------STTGQAVALPGITFGCG-----TNNGGLFNSKTTGIV 222
LA E++ ST + ITFGCG TNN +N G+
Sbjct: 193 ILAKESLLFETLDEGRVFQYNAISTQISKIKKSNITFGCGHMNIKTNNDDAYN----GVF 248
Query: 223 GLGG------------------GDIS--------LISQMRTTIAGN-------------- 242
GLG GDI+ L+ + I G+
Sbjct: 249 GLGAYPHITMATQLGNKFSYCIGDINNPLYTHNHLVLGQGSYIEGDSTPLQIHFGHYYVT 308
Query: 243 -QRLGVSTPDIVI-----------------DSGTTLTFLPQG----YNSNLLSVMSSMIE 280
Q + V + + I DSG T T L G ++ +M ++E
Sbjct: 309 LQSISVGSKTLKIDPNAFKISSDGSGGVLIDSGMTYTKLANGGFELLYDEIVDLMKGLLE 368
Query: 281 AQPVADPTGSLELCYS---FNSLSQVPEVTIHFR-GADVKLSRSNFFVKVSEDIVCSVFK 336
P LC+ L P VT HF GAD+ L + F + D C
Sbjct: 369 RIPTQRKFEG--LCFKGVVSRDLVGFPAVTFHFAGGADLVLESGSLFRQHGGDRFCLAIL 426
Query: 337 GITNS----VPIYGNIMQTNFLVGYDIEQQTVSFKPTDC 371
+NS + + G + Q N+ VG+D+EQ V F+ DC
Sbjct: 427 P-SNSELLNLSVIGILAQQNYNVGFDLEQMKVFFRRIDC 464
>gi|413953788|gb|AFW86437.1| hypothetical protein ZEAMMB73_618532 [Zea mays]
Length = 469
Score = 122 bits (305), Expect = 4e-25, Method: Compositional matrix adjust.
Identities = 113/419 (26%), Positives = 167/419 (39%), Gaps = 88/419 (21%)
Query: 30 SVELIHRDSPKSPFYNSSETPYQRLRDALTRSLNRLNHFN---------QNSSISSSKAS 80
SV L HR+ P SP E P + L R R + Q+++ + S +
Sbjct: 62 SVPLAHRNGPCSPVRGKGELPRAEM---LRRDRERTEYIIRRASRSRRLQDNNDAVSVPT 118
Query: 81 QADIIPNNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMS 140
Q ++ Y+ + +GTP + + DTGS L W QC+PC SQCY Q PLFDP S
Sbjct: 119 QLGSSYDSQEYVATVGLGTPAVPQTLILDTGSSLTWVQCKPCNSSQCYPQRLPLFDPNTS 178
Query: 141 STYKSLPCSSSQC----ASLNQKSCSG---VNCQYSVSYGDGSFSNGNLATETVTLGSTT 193
S+Y +PC S +C A ++ C+ C Y + YG G+ G +T+ +TLG
Sbjct: 179 SSYSPVPCDSQECRALAAGIDGDGCTSDGDWGCAYEIHYGSGATPAGEYSTDALTLGP-- 236
Query: 194 GQAVALPGITFGCGTN-NGGLFNSKTTGIVGLGGGDISLISQMRTTIAGN------QRLG 246
+ FGCG + G F+ G++GLG SL Q G G
Sbjct: 237 --GAIVKRFHFGCGHHQQRGKFD-MADGVLGLGRLPQSLAWQASARRGGGVFSHCLPPTG 293
Query: 247 VS-----------------TPDIVID---------------------------------- 255
VS TP + +D
Sbjct: 294 VSTGFLALGAPHDTSAFVFTPLLTMDDQPWFYQLMPTAISVAGQLLDIPPAVFREGVITD 353
Query: 256 SGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSLSQ--VPEVTIHFR-G 312
SGT L+ L + + L + S + P+A P G L+ C++F VP V++ FR G
Sbjct: 354 SGTVLSALQETAYTALRTAFRSAMAEYPLAPPVGHLDTCFNFTGYDNVTVPTVSLTFRGG 413
Query: 313 ADVKLSRSNFFVKVSEDIVCSVFKGITNSVPIYGNIMQTNFLVGYDIEQQTVSFKPTDC 371
A V L S+ V D + + + G++ Q V YD+ + V F+ C
Sbjct: 414 ATVHLDASS---GVLMDGCLAFWSSGDEYTGLIGSVSQRTIEVLYDMPGRKVGFRTGAC 469
>gi|224111722|ref|XP_002315953.1| predicted protein [Populus trichocarpa]
gi|222864993|gb|EEF02124.1| predicted protein [Populus trichocarpa]
Length = 484
Score = 121 bits (304), Expect = 4e-25, Method: Compositional matrix adjust.
Identities = 57/127 (44%), Positives = 78/127 (61%), Gaps = 7/127 (5%)
Query: 88 NANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLP 147
+ Y R+ IG PP++ + DTGSD+ W QC PC + CY Q P+F+P S+++ +L
Sbjct: 146 SGEYFSRVGIGKPPSQAYLILDTGSDVNWVQCAPC--ADCYQQADPIFEPASSASFSTLS 203
Query: 148 CSSSQCASLNQKSCSGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITFGCG 207
C++ QC SL+ C C Y VSYGDGS++ G+ TET+TLGS VA+ GCG
Sbjct: 204 CNTRQCRSLDVSECRNDTCLYEVSYGDGSYTVGDFVTETITLGSAPVDNVAI-----GCG 258
Query: 208 TNNGGLF 214
NN GLF
Sbjct: 259 HNNEGLF 265
Score = 55.1 bits (131), Expect = 6e-05, Method: Compositional matrix adjust.
Identities = 45/157 (28%), Positives = 68/157 (43%), Gaps = 10/157 (6%)
Query: 220 GIVGLG-GGDISLISQMRTTIAGNQRLGVSTPDIVIDSGTTLTFLPQGYNSNLLSVMSSM 278
G+ GL GG++ I + I + GV ++DSGT +T L ++L
Sbjct: 333 GLTGLSVGGELVSIPESAFQIDESGNGGV-----IVDSGTAITRLQTDVYNSLRDAFVKR 387
Query: 279 IEAQPVADPTGSLELCYSFNSLS--QVPEVTIHF-RGADVKLSRSNFFVKV-SEDIVCSV 334
P + + CY +S +VP V+ HF G ++ L N+ V + SE C
Sbjct: 388 TRDLPSTNGIALFDTCYDLSSKGNVEVPTVSFHFPDGKELPLPAKNYLVPLDSEGTFCFA 447
Query: 335 FKGITNSVPIYGNIMQTNFLVGYDIEQQTVSFKPTDC 371
F +S+ I GN+ Q V YD+ V F P C
Sbjct: 448 FAPTASSLSIIGNVQQQGTRVVYDLVNHLVGFVPNKC 484
>gi|357130848|ref|XP_003567056.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Brachypodium
distachyon]
Length = 448
Score = 121 bits (304), Expect = 5e-25, Method: Compositional matrix adjust.
Identities = 64/150 (42%), Positives = 90/150 (60%), Gaps = 11/150 (7%)
Query: 88 NANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLP 147
+ Y + +GTPPT L V DTGSD++W QC+PC CY Q SPL+DP+ SSTY P
Sbjct: 96 SGEYFASVGVGTPPTPALLVIDTGSDVVWLQCKPC--VHCYRQLSPLYDPRGSSTYAQTP 153
Query: 148 CSSSQCASLNQKSCSGV--NCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITFG 205
CS QC N ++C G C Y + YGD S ++GNLAT+ + + T ++ +T G
Sbjct: 154 CSPPQCR--NPQTCDGTTGGCGYRIVYGDASSTSGNLATDRLVFSNDT----SVGNVTLG 207
Query: 206 CGTNNGGLFNSKTTGIVGLGGGDISLISQM 235
CG +N GLF S G++G+ G+ S +Q+
Sbjct: 208 CGHDNEGLFGS-AAGLLGVARGNNSFATQV 236
Score = 55.1 bits (131), Expect = 5e-05, Method: Compositional matrix adjust.
Identities = 35/128 (27%), Positives = 62/128 (48%), Gaps = 8/128 (6%)
Query: 252 IVIDSGTTLT-FLPQGYNS--NLLSVMSSMIEAQPVADPTGSLELCYSFN--SLSQVPEV 306
+V+DSGT++T F Y + + ++ + + V + CY +++ P V
Sbjct: 321 VVVDSGTSITRFARDAYGALRDAFDARAAKVGMRKVGRGISVFDACYDLRGVAVADAPGV 380
Query: 307 TIHFRG-ADVKLSRSNFFV-KVSEDIVCSVFKGI-TNSVPIYGNIMQTNFLVGYDIEQQT 363
+HF G ADV L N+ V + S C + + + + GN++Q F V +D+E +
Sbjct: 381 VLHFAGGADVALPPENYLVPEESGRYHCFALEAAGHDGLSVIGNVLQQRFRVVFDVENER 440
Query: 364 VSFKPTDC 371
V F+P C
Sbjct: 441 VGFEPNGC 448
>gi|242073262|ref|XP_002446567.1| hypothetical protein SORBIDRAFT_06g018180 [Sorghum bicolor]
gi|241937750|gb|EES10895.1| hypothetical protein SORBIDRAFT_06g018180 [Sorghum bicolor]
Length = 453
Score = 121 bits (304), Expect = 5e-25, Method: Compositional matrix adjust.
Identities = 66/190 (34%), Positives = 105/190 (55%), Gaps = 13/190 (6%)
Query: 49 TPYQRLRDALTRSLNRLNHFNQNSSISSSKASQADIIPNNANYLIRISIGTPPTERLAVA 108
T ++ +R A+ RSL+R +N + +A ++P YL+++ IGTP A
Sbjct: 49 TDHELIRRAVQRSLDRPGVAARNRK---AVVGEAPLVPRGGEYLVKLGIGTPQHYFSAAI 105
Query: 109 DTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCSSSQCASLNQKSCSGVN--- 165
DT SDL+W QC+PC CY Q P+F+P++SS+Y +PCSS C+ L+ C +
Sbjct: 106 DTASDLVWLQCQPC--VSCYRQLDPIFNPRLSSSYAVVPCSSDTCSQLDGHRCDEDDDQA 163
Query: 166 CQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITFGCGTNNGGLFNSKTTGIVGLG 225
C+Y+ Y + +NG LA + + +G AV L GC ++ G + +G+VGL
Sbjct: 164 CRYNYKYSGNAVTNGTLAIDKLAVGGNVFHAVVL-----GCSDSSVGGPPPQASGLVGLA 218
Query: 226 GGDISLISQM 235
G +SL+SQ+
Sbjct: 219 RGPLSLLSQL 228
>gi|326531454|dbj|BAJ97731.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 476
Score = 121 bits (304), Expect = 5e-25, Method: Compositional matrix adjust.
Identities = 104/347 (29%), Positives = 142/347 (40%), Gaps = 77/347 (22%)
Query: 90 NYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCS 149
Y++ +S+GTP + DTGSD+ W QC+PC C Q LFDP SSTY ++PC
Sbjct: 142 QYVVTVSLGTPGVSQTVEVDTGSDVSWVQCKPCSAPACNSQRDQLFDPAKSSTYSAVPCG 201
Query: 150 SSQCASLN--QKSCSGVNCQYSVSYGDGSFSNGNLATETVTL--GSTTGQAVALPGITFG 205
+ C+ L + CSG C Y VSYGDGS + G ++T+ L G+T G FG
Sbjct: 202 ADACSELRIYEAGCSGSQCGYVVSYGDGSNTTGVYGSDTLALAPGNTVGT------FLFG 255
Query: 206 CGTNNGGLFNSKTTGIVGLGGGDISLISQM--------------RTTIAGNQRLG----- 246
CG G+F + G++ LG +SL SQ + + AG LG
Sbjct: 256 CGHAQAGMF-AGIDGLLALGRQSMSLKSQAAGAYGGVFSYCLPSKQSAAGYLTLGGPTSA 314
Query: 247 -------------------------------VSTP------DIVIDSGTTLTFLPQGYNS 269
V+ P V+D+GT +T LP +
Sbjct: 315 SGFATTGLLTAWAAPTFYMVMLTGISVGGQQVAVPASAFAGGTVVDTGTVITRLPPTAYA 374
Query: 270 NLLSVMSSMIE--AQPVADPTGSLELCYSFNSLSQV--PEVTIHFR-GADVKLSRSNFFV 324
L S I P A G L+ CY F+ V P V + F GA + L
Sbjct: 375 ALRSAFRGAIAPYGYPSAPANGILDTCYDFSRYGVVTLPTVALTFSGGATLALEAPGI-- 432
Query: 325 KVSEDIVCSVFKGITNSVPIYGNIMQTNFLVGYDIEQQTVSFKPTDC 371
+S + G I GN+ Q +F V +D TV F P C
Sbjct: 433 -LSSGCLAFAPNGGDGDAAILGNVQQRSFAVRFD--GSTVGFMPGAC 476
>gi|125571841|gb|EAZ13356.1| hypothetical protein OsJ_03278 [Oryza sativa Japonica Group]
Length = 447
Score = 121 bits (303), Expect = 6e-25, Method: Compositional matrix adjust.
Identities = 75/202 (37%), Positives = 108/202 (53%), Gaps = 23/202 (11%)
Query: 31 VELIHRDS--PKSPFYNSSETPYQRLRDALTRSLNRLNHFNQ-NSSISSSKASQADIIP- 86
V + HRD+ P P QRL R + ++ + +S + S IP
Sbjct: 29 VPVFHRDALFPPPPGAKRGSLLRQRLAADAARYASLVDATGRLHSPVFSG-------IPF 81
Query: 87 NNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSL 146
+ Y + +GTP T+ + V DTGSDL+W QC PC +CY Q +FDP+ SSTY+ +
Sbjct: 82 ESGEYFALVGVGTPSTKAMLVIDTGSDLVWLQCSPC--RRCYAQRGQVFDPRRSSTYRRV 139
Query: 147 PCSSSQCASLNQKSC-----SGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPG 201
PCSS QC +L C +G C+Y V+YGDGS S G+LAT+ + + T +
Sbjct: 140 PCSSPQCRALRFPGCDSGGAAGGGCRYMVAYGDGSSSTGDLATDKLAFANDT----YVNN 195
Query: 202 ITFGCGTNNGGLFNSKTTGIVG 223
+T GCG +N GLF+S G++G
Sbjct: 196 VTLGCGRDNEGLFDS-AAGLLG 216
Score = 49.7 bits (117), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 24/92 (26%), Positives = 43/92 (46%), Gaps = 10/92 (10%)
Query: 291 LELCYSFNS--LSQVPEVTIHFRG-ADVKLSRSNFFV-------KVSEDIVCSVFKGITN 340
+ CY + P + +HF G AD+ L N+F+ + + C F+ +
Sbjct: 355 FDACYDLRGRPAASAPLIVLHFAGGADMALPPENYFLPVDGGRRRAASYRRCLGFEAADD 414
Query: 341 SVPIYGNIMQTNFLVGYDIEQQTVSFKPTDCT 372
+ + GN+ Q F V +D+E++ + F P CT
Sbjct: 415 GLSVIGNVQQQGFRVVFDVEKERIGFAPKGCT 446
>gi|326495920|dbj|BAJ90582.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 476
Score = 121 bits (303), Expect = 6e-25, Method: Compositional matrix adjust.
Identities = 104/347 (29%), Positives = 142/347 (40%), Gaps = 77/347 (22%)
Query: 90 NYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCS 149
Y++ +S+GTP + DTGSD+ W QC+PC C Q LFDP SSTY ++PC
Sbjct: 142 QYVVTVSLGTPGVSQTVEVDTGSDVSWVQCKPCSAPACNSQRDQLFDPAKSSTYSAVPCG 201
Query: 150 SSQCASLN--QKSCSGVNCQYSVSYGDGSFSNGNLATETVTL--GSTTGQAVALPGITFG 205
+ C+ L + CSG C Y VSYGDGS + G ++T+ L G+T G FG
Sbjct: 202 ADACSELRIYEAGCSGSQCGYVVSYGDGSNTTGVYGSDTLALAPGNTVGT------FLFG 255
Query: 206 CGTNNGGLFNSKTTGIVGLGGGDISLISQM--------------RTTIAGNQRLG----- 246
CG G+F + G++ LG +SL SQ + + AG LG
Sbjct: 256 CGHAQAGMF-AGIDGLLALGRQSMSLKSQAAGAYGGVFSYCLPSKQSAAGYLTLGGPSSA 314
Query: 247 -------------------------------VSTP------DIVIDSGTTLTFLPQGYNS 269
V+ P V+D+GT +T LP +
Sbjct: 315 SGFATTGLLTAWAAPTFYMVMLTGISVGGQQVAVPASAFAGGTVVDTGTVITRLPPTAYA 374
Query: 270 NLLSVMSSMIE--AQPVADPTGSLELCYSFNSLSQV--PEVTIHFR-GADVKLSRSNFFV 324
L S I P A G L+ CY F+ V P V + F GA + L
Sbjct: 375 ALRSAFRGAIAPCGYPSAPANGILDTCYDFSRYGVVTLPTVALTFSGGATLALEAPGI-- 432
Query: 325 KVSEDIVCSVFKGITNSVPIYGNIMQTNFLVGYDIEQQTVSFKPTDC 371
+S + G I GN+ Q +F V +D TV F P C
Sbjct: 433 -LSSGCLAFAPNGGDGDAAILGNVQQRSFAVRFD--GSTVGFMPGAC 476
>gi|357163818|ref|XP_003579856.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Brachypodium
distachyon]
Length = 467
Score = 121 bits (303), Expect = 6e-25, Method: Compositional matrix adjust.
Identities = 68/198 (34%), Positives = 109/198 (55%), Gaps = 18/198 (9%)
Query: 49 TPYQRLRDALTRSLNRLNHFNQNSSISSSK----ASQADIIPNNANYLIRISIGTPPTER 104
T ++ LR A+ RS +RL +SS+ ++A ++ YL+++ +GTP
Sbjct: 42 TDHELLRRAIQRSRDRLASIAPRLLPTSSRNKVVVAEAPVLSAGGEYLVKLGLGTPQHCF 101
Query: 105 LAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCSSSQCASLNQKSCSGV 164
A DT SDLIWTQC+PC +CY Q P+F+P S++Y +PC+S C L+ C+
Sbjct: 102 TAAIDTASDLIWTQCQPC--VKCYKQLDPVFNPVASTSYAVVPCNSDTCDELDTHRCARD 159
Query: 165 N-------CQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITFGCGTNNGGLFNSK 217
CQY+ SYG + + G LA + + +G + G+ FGC +++ G +
Sbjct: 160 GDSDDEDACQYTYSYGGNATTRGILAVDRLAIGDDVFR-----GVVFGCSSSSVGGPPPQ 214
Query: 218 TTGIVGLGGGDISLISQM 235
+G+VGLG G +SL+SQ+
Sbjct: 215 VSGVVGLGRGALSLVSQL 232
Score = 50.8 bits (120), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 32/125 (25%), Positives = 59/125 (47%), Gaps = 5/125 (4%)
Query: 252 IVIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNS---LSQV--PEV 306
++ID +T+TFL + ++ + I + L+LC+ +S+V P V
Sbjct: 339 MIIDIASTITFLEESLYEEMVDDLEEEIRLPRGSGSDLGLDLCFILPEGVPMSRVYAPPV 398
Query: 307 TIHFRGADVKLSRSNFFVKVSEDIVCSVFKGITNSVPIYGNIMQTNFLVGYDIEQQTVSF 366
++ F G ++L + FV+ + + G T+ V I GN Q N V Y++ + ++F
Sbjct: 399 SLAFEGVWLRLDKEQMFVEDRASGMMCLMVGKTDGVSILGNYQQQNMQVMYNLRRGRITF 458
Query: 367 KPTDC 371
T C
Sbjct: 459 IKTAC 463
>gi|242058537|ref|XP_002458414.1| hypothetical protein SORBIDRAFT_03g033075 [Sorghum bicolor]
gi|241930389|gb|EES03534.1| hypothetical protein SORBIDRAFT_03g033075 [Sorghum bicolor]
Length = 448
Score = 121 bits (303), Expect = 7e-25, Method: Compositional matrix adjust.
Identities = 64/152 (42%), Positives = 89/152 (58%), Gaps = 10/152 (6%)
Query: 87 NNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSL 146
++ Y I++G PPT L V DTGSDLIW QC PC CY Q +PL+DP+ SST++ +
Sbjct: 84 DSGEYFAVINVGDPPTRALVVIDTGSDLIWLQCVPC--RHCYRQVTPLYDPRSSSTHRRI 141
Query: 147 PCSSSQCAS-LNQKSCSGV--NCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGIT 203
PC+S +C L C C Y V YGDGS S+G+LAT+ + T + +T
Sbjct: 142 PCASPRCRDVLRYPGCDARTGGCVYMVVYGDGSASSGDLATDRLVFPDDT----HVHNVT 197
Query: 204 FGCGTNNGGLFNSKTTGIVGLGGGDISLISQM 235
GCG +N GL S G++G+G G +S +Q+
Sbjct: 198 LGCGHDNVGLLES-AAGLLGVGRGQLSFPTQL 228
Score = 49.7 bits (117), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 31/136 (22%), Positives = 60/136 (44%), Gaps = 15/136 (11%)
Query: 252 IVIDSGTTLTFLPQGYNSNLLSVMSSMIEA----QPVADPTGSLELCYSFN------SLS 301
IV+DSGT ++ + + + S A + +A + CY +
Sbjct: 313 IVVDSGTAISRFARDAYAAVRDAFDSHAAAAGTMRKLATKFSVFDACYDLRGNGAPAAAV 372
Query: 302 QVPEVTIHFRG-ADVKLSRSNFFVKV----SEDIVCSVFKGITNSVPIYGNIMQTNFLVG 356
+VP + +HF G AD+ L ++N+ + V C + + + + GN+ Q F +
Sbjct: 373 RVPSIVLHFAGGADMALPQANYLIPVQGGDRRTYFCLGLQAADDGLNVLGNVQQQGFGLV 432
Query: 357 YDIEQQTVSFKPTDCT 372
+D+E+ + F P C+
Sbjct: 433 FDVERGRIGFTPNGCS 448
>gi|118487542|gb|ABK95598.1| unknown [Populus trichocarpa]
Length = 471
Score = 120 bits (302), Expect = 8e-25, Method: Compositional matrix adjust.
Identities = 65/157 (41%), Positives = 93/157 (59%), Gaps = 13/157 (8%)
Query: 88 NANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLP 147
+ NY +++ +GTPP + DTGS L W QC+PC C+ Q PL+DP +S TYK L
Sbjct: 122 SGNYYVKLGLGTPPKYYAMILDTGSSLSWLQCQPC-AVYCHAQADPLYDPSVSKTYKKLS 180
Query: 148 CSSSQC-----ASLNQKSCSGVN--CQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALP 200
C+S +C A+LN C + C Y+ SYGD SFS G L+ + +TL S+ LP
Sbjct: 181 CASVECSRLKAATLNDPLCETDSNACLYTASYGDTSFSIGYLSQDLLTLTSSQ----TLP 236
Query: 201 GITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRT 237
T+GCG +N GLF + GI+GL +S+++Q+ T
Sbjct: 237 QFTYGCGQDNQGLFG-RAAGIIGLARDKLSMLAQLST 272
Score = 64.7 bits (156), Expect = 8e-08, Method: Compositional matrix adjust.
Identities = 39/125 (31%), Positives = 61/125 (48%), Gaps = 6/125 (4%)
Query: 253 VIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGS-LELCY--SFNSLSQVPEVTIH 309
+IDSGT +T LP + L ++ + P S L+ C+ S S+S VPE+ +
Sbjct: 346 LIDSGTVITRLPMSMYAALRQAFVKIMSTKYAKAPAYSILDTCFKGSLKSISAVPEIKMI 405
Query: 310 FRG-ADVKLSRSNFFVKVSEDIVCSVFKGI--TNSVPIYGNIMQTNFLVGYDIEQQTVSF 366
F+G AD+ L + ++ + I C F G TN + I GN Q + + YD+ + F
Sbjct: 406 FQGGADLTLRAPSILIEADKGITCLAFAGSSGTNQIAIIGNRQQQTYNIAYDVSTSRIGF 465
Query: 367 KPTDC 371
P C
Sbjct: 466 APGSC 470
>gi|15234607|ref|NP_194733.1| aspartyl protease family protein [Arabidopsis thaliana]
gi|4938480|emb|CAB43839.1| putative protein [Arabidopsis thaliana]
gi|7269904|emb|CAB80997.1| putative protein [Arabidopsis thaliana]
gi|67633776|gb|AAY78812.1| aspartyl protease family protein [Arabidopsis thaliana]
gi|332660311|gb|AEE85711.1| aspartyl protease family protein [Arabidopsis thaliana]
Length = 427
Score = 120 bits (302), Expect = 8e-25, Method: Compositional matrix adjust.
Identities = 122/442 (27%), Positives = 192/442 (43%), Gaps = 91/442 (20%)
Query: 1 MATFL-SCVFILFFLCFYVVSPIEAQTGGFSVELIHRDSPKSPFYNSSETPYQRLRDALT 59
MA F S +F L LCF + + + + L+H Y+ +++A
Sbjct: 1 MAIFFTSPLFFLIILCFSISVVHLSASPTLVLNLVH----SYHIYSRKPPHVYHIKEA-- 54
Query: 60 RSLNRLNHFNQNSS--ISSSKASQADIIPNNANYLIRISIGTPPTERLAVADTGSDLIWT 117
S+ RL + ++ I + + IIP +L+ ISIG+PP +L DT SDL+W
Sbjct: 55 -SVERLEYLKAKTTGDIIAHLSPNVPIIPQA--FLVNISIGSPPITQLLHMDTASDLLWI 111
Query: 118 QCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCSSSQCASLNQK-SCSGVNCQYSVSYGDGS 176
QC PC CY Q P+FDP S T+++ C +SQ + + K + + +C+YS+ Y D +
Sbjct: 112 QCLPC--INCYAQSLPIFDPSRSYTHRNETCRTSQYSMPSLKFNANTRSCEYSMRYVDDT 169
Query: 177 FSNGNLATETVTLGSTTGQ--AVALPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQ 234
S G LA E + + + + AL + FGCG +N G TGI+GLG G+ SL+ +
Sbjct: 170 GSKGILAREMLLFNTIYDESSSAALHDVVFGCGHDNYGE-PLVGTGILGLGYGEFSLVHR 228
Query: 235 MRTTIA---------------------GNQRLGVSTP-----------------DIVI-- 254
+ G LG +TP D +I
Sbjct: 229 FGKKFSYCFGSLDDPSYPHNVLVLGDDGANILGDTTPLEIHNGFYYVTIEAISVDGIILP 288
Query: 255 -------------------DSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLEL-- 293
D+G +LT L + L + + + E + A ++
Sbjct: 289 IDPRVFNRNHQTGLGGTIIDTGNSLTSLVEEAYKPLKNRIEDIFEGRFTAADVSQDDMIK 348
Query: 294 --CYSFN-----SLSQVPEVTIHF-RGADVKLSRSNFFVKVSEDIVC-SVFKGITNSVPI 344
CY+ N S P VT HF GA++ L + F+K+S ++ C +V G NS+
Sbjct: 349 MECYNGNFERDLVESGFPIVTFHFSEGAELSLDVKSLFMKLSPNVFCLAVTPGNLNSI-- 406
Query: 345 YGNIMQTNFLVGYDIEQQTVSF 366
G Q ++ +GYD+E VSF
Sbjct: 407 -GATAQQSYNIGYDLEAMEVSF 427
>gi|242050428|ref|XP_002462958.1| hypothetical protein SORBIDRAFT_02g035300 [Sorghum bicolor]
gi|241926335|gb|EER99479.1| hypothetical protein SORBIDRAFT_02g035300 [Sorghum bicolor]
Length = 460
Score = 120 bits (302), Expect = 9e-25, Method: Compositional matrix adjust.
Identities = 88/240 (36%), Positives = 122/240 (50%), Gaps = 43/240 (17%)
Query: 28 GFSVELIHRDSPKSPFYNSSETPYQRLRDALTRSLNRLNHFNQNSSISSSKASQAD---- 83
G +EL H D+ + F S R+R A RS R+N + ++ ++D
Sbjct: 29 GIRLELTHVDA-RGDFTGS-----DRVRRAADRSHRRVNGLLAAAPPPAASTLRSDGGGG 82
Query: 84 ----------IIPNNANYLIRISIGTPPTERLAVADTGSDLIWTQCE-PCPPSQCYMQDS 132
+ + A YL+ +IGTPP AV DTGSDLIWTQC+ PC +C+ Q +
Sbjct: 83 GACAATAAASVHASTATYLVDFAIGTPPLALSAVLDTGSDLIWTQCDAPC--RRCFPQPA 140
Query: 133 PLFDPKMSSTYKSLPCSSSQCASLNQKSCSGV-------------NCQYSVSYGDGSFSN 179
PL+ P S TY ++ C S C +L S C Y SYGDGS ++
Sbjct: 141 PLYAPARSVTYANVSCGSRLCDALPSLRPSSRCSASASAPAPERGGCTYYYSYGDGSSTD 200
Query: 180 GNLATETVTLGSTTGQAVALPGITFGCGTNN-GGLFNSKTTGIVGLGGGDISLISQMRTT 238
G LATET T G+ T + + FGCGT+N GG NS +G+VG+G G +SL+SQ+ T
Sbjct: 201 GVLATETFTFGAGT----TVHDLAFGCGTDNLGGTDNS--SGLVGMGRGPLSLVSQLGVT 254
Score = 51.2 bits (121), Expect = 8e-04, Method: Compositional matrix adjust.
Identities = 38/128 (29%), Positives = 65/128 (50%), Gaps = 11/128 (8%)
Query: 252 IVIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFN-----SLSQVPEV 306
++IDSGTT T L + L +++ + + L +C++ VP +
Sbjct: 333 LIIDSGTTFTALEERAFVVLARAVAARVALPLASGAHLGLSVCFAAPQGRGPEAVDVPRL 392
Query: 307 TIHFRGADVKLSRSNFFVKVSEDIVCSVF-KGITNS--VPIYGNIMQTNFLVGYDIEQQT 363
+HF GAD++L RS+ V ED V V GI ++ + + G++ Q N V YD+ +
Sbjct: 393 VLHFDGADMELPRSS---AVVEDRVAGVACLGIVSARGMSVLGSMQQQNMHVRYDVGRDV 449
Query: 364 VSFKPTDC 371
+SF+P +C
Sbjct: 450 LSFEPANC 457
>gi|356531224|ref|XP_003534178.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
Length = 492
Score = 120 bits (302), Expect = 9e-25, Method: Compositional matrix adjust.
Identities = 72/208 (34%), Positives = 97/208 (46%), Gaps = 33/208 (15%)
Query: 29 FSVELIHRDSPKSPFYNSSETPYQRL-RDALTRSLNRLNHFNQNSSISSSKASQADIIPN 87
FS++L P+ N Y+ L L R R+N N ++ S +++D+ P
Sbjct: 77 FSLQL----HPRETLLNEQHPNYKTLVLSRLARDTARVNSLNTKLQLALSSLNRSDLYPT 132
Query: 88 N---------------------ANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQ 126
Y R+ +G P V DTGSD+ W QC+PC S
Sbjct: 133 ETELLRPEDLSTPVSSGTAQGSGEYFSRVGVGQPSKPFYMVLDTGSDVNWLQCKPC--SD 190
Query: 127 CYMQDSPLFDPKMSSTYKSLPCSSSQCASLNQKSCSGVNCQYSVSYGDGSFSNGNLATET 186
CY Q P+FDP SS+Y L C + QC L +C C Y VSYGDGSF+ G TET
Sbjct: 191 CYQQSDPIFDPTASSSYNPLTCDAQQCQDLEMSACRNGKCLYQVSYGDGSFTVGEYVTET 250
Query: 187 VTLGSTTGQAVALPGITFGCGTNNGGLF 214
V+ G+ + VA+ GCG +N GLF
Sbjct: 251 VSFGAGSVNRVAI-----GCGHDNEGLF 273
Score = 54.3 bits (129), Expect = 8e-05, Method: Compositional matrix adjust.
Identities = 39/125 (31%), Positives = 57/125 (45%), Gaps = 6/125 (4%)
Query: 252 IVIDSGTTLTFL-PQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSLS--QVPEVTI 308
+++DSGT +T L Q YNS + +P A+ + CY +SL +VP V+
Sbjct: 369 VIVDSGTAITRLRTQAYNSVRDAFKRKTSNLRP-AEGVALFDTCYDLSSLQSVRVPTVSF 427
Query: 309 HFRGADV-KLSRSNFFVKVS-EDIVCSVFKGITNSVPIYGNIMQTNFLVGYDIEQQTVSF 366
HF G L N+ + V C F T+S+ I GN+ Q V +D+ V F
Sbjct: 428 HFSGDRAWALPAKNYLIPVDGAGTYCFAFAPTTSSMSIIGNVQQQGTRVSFDLANSLVGF 487
Query: 367 KPTDC 371
P C
Sbjct: 488 SPNKC 492
>gi|242084332|ref|XP_002442591.1| hypothetical protein SORBIDRAFT_08g022580 [Sorghum bicolor]
gi|241943284|gb|EES16429.1| hypothetical protein SORBIDRAFT_08g022580 [Sorghum bicolor]
Length = 493
Score = 120 bits (301), Expect = 9e-25, Method: Compositional matrix adjust.
Identities = 76/227 (33%), Positives = 113/227 (49%), Gaps = 31/227 (13%)
Query: 29 FSVELIHRDSPKSPFYNSSETPYQRLRDALTRSLNRLNHFNQNS------------SISS 76
V L+HRDS + + TP Q L L R R + + +SS
Sbjct: 61 LHVRLLHRDS-----FAVNATPAQLLARRLQRDELRAAWIIKAAAPAAAANDTPVVGLSS 115
Query: 77 SKASQADIIPN----NANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDS 132
A A ++ + Y+ +I++GTP E L DTGSD+ W QC+PC +CY Q
Sbjct: 116 GGAFVAPVVSRAPTTSGEYMAKIAVGTPAVEALLAMDTGSDITWLQCQPC--RRCYPQSG 173
Query: 133 PLFDPKMSSTYKSLPCSSSQCASLNQKS---CSGVNCQYSVSYG-DGSFSNGNLATETVT 188
P+FDP+ S++Y+ + + C +L + + C Y+V YG DGS + G+ ET+T
Sbjct: 174 PVFDPRHSTSYREMGYDAPDCQALGRSGGGDAKRMTCVYAVGYGDDGSTTVGDFIEETLT 233
Query: 189 LGSTTGQAVALPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQM 235
V +P ++ GCG +N GLF + GI+GLG G IS SQ+
Sbjct: 234 FAG----GVQVPHMSIGCGHDNKGLFAAPAAGILGLGRGQISCPSQI 276
Score = 56.6 bits (135), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 41/127 (32%), Positives = 66/127 (51%), Gaps = 7/127 (5%)
Query: 252 IVIDSGTTLTFLPQ-GYNSNLLSVMSSMIEAQPVA--DPTGSLELCYSFNSLS-QVPEVT 307
+++DSGT +T L + Y + + ++ ++ V+ P+G + CY+ + +VP V+
Sbjct: 367 VILDSGTAVTRLARRAYIAFRDAFRAAAVDLGQVSIGGPSGFFDTCYTMGGRAMKVPTVS 426
Query: 308 IHFRGA-DVKLSRSNFFVKV-SEDIVCSVFKGITN-SVPIYGNIMQTNFLVGYDIEQQTV 364
+HF G ++ L N+ + V S VC F G + SV I GNI Q F V Y+I V
Sbjct: 427 MHFAGGVELTLPPKNYLIPVDSMGTVCFAFAGTGDRSVSIIGNIQQQGFRVVYNIGGGRV 486
Query: 365 SFKPTDC 371
F P C
Sbjct: 487 GFAPNSC 493
>gi|218192707|gb|EEC75134.1| hypothetical protein OsI_11325 [Oryza sativa Indica Group]
Length = 401
Score = 120 bits (301), Expect = 1e-24, Method: Compositional matrix adjust.
Identities = 67/153 (43%), Positives = 83/153 (54%), Gaps = 11/153 (7%)
Query: 90 NYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCS 149
YL+ ++IGTPP DTGSDLIWTQC+PCP C+ Q P FDP SST C
Sbjct: 81 EYLVHLAIGTPPQPVQLTLDTGSDLIWTQCQPCP--ACFDQALPYFDPSTSSTLSLTSCD 138
Query: 150 SSQCASLNQKSCSGVN------CQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGIT 203
S+ C L SC C Y+ SYGD S + G L + T G ++PG+
Sbjct: 139 STLCQGLPVASCGSPKFWPNQTCVYTYSYGDKSVTTGFLEVDKFTF---VGAGASVPGVA 195
Query: 204 FGCGTNNGGLFNSKTTGIVGLGGGDISLISQMR 236
FGCG N G+F S TGI G G G +SL SQ++
Sbjct: 196 FGCGLFNNGVFKSNETGIAGFGRGPLSLPSQLK 228
>gi|297737850|emb|CBI27051.3| unnamed protein product [Vitis vinifera]
Length = 256
Score = 120 bits (301), Expect = 1e-24, Method: Compositional matrix adjust.
Identities = 57/127 (44%), Positives = 76/127 (59%), Gaps = 6/127 (4%)
Query: 88 NANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLP 147
+ Y R+ IG+PP V DTGSD+ W QC PC + CY Q P+F+P SS+Y L
Sbjct: 50 SGEYFSRVGIGSPPKHVYMVVDTGSDVNWVQCAPC--ADCYQQADPIFEPSFSSSYAPLT 107
Query: 148 CSSSQCASLNQKSCSGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITFGCG 207
C + QC SL+ C +C Y VSYGDGS++ G+ ATET+TL + +L + GCG
Sbjct: 108 CETHQCKSLDVSECRNDSCLYEVSYGDGSYTVGDFATETITLDG----SASLNNVAIGCG 163
Query: 208 TNNGGLF 214
+N GLF
Sbjct: 164 HDNEGLF 170
>gi|115475621|ref|NP_001061407.1| Os08g0267300 [Oryza sativa Japonica Group]
gi|37806402|dbj|BAC99940.1| putative 41 kD chloroplast nucleoid DNA binding protein [Oryza
sativa Japonica Group]
gi|113623376|dbj|BAF23321.1| Os08g0267300 [Oryza sativa Japonica Group]
Length = 524
Score = 120 bits (301), Expect = 1e-24, Method: Compositional matrix adjust.
Identities = 69/151 (45%), Positives = 89/151 (58%), Gaps = 24/151 (15%)
Query: 107 VADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCSSSQC-ASLNQK-----S 160
+ DTGSDL W QC+PC S CY Q PLFDP S++Y ++PC++S C ASL S
Sbjct: 179 IVDTGSDLTWVQCKPC--SVCYAQRDPLFDPSGSASYAAVPCNASACEASLKAATGVPGS 236
Query: 161 CSGV----------NCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITFGCGTNN 210
C+ V C YS++YGDGSFS G LAT+TV LG + + G FGCG +N
Sbjct: 237 CATVGGGGGGGKSERCYYSLAYGDGSFSRGVLATDTVALGGAS-----VDGFVFGCGLSN 291
Query: 211 GGLFNSKTTGIVGLGGGDISLISQMRTTIAG 241
GLF T G++GLG ++SL+SQ G
Sbjct: 292 RGLFGG-TAGLMGLGRTELSLVSQTAPRFGG 321
Score = 44.7 bits (104), Expect = 0.066, Method: Compositional matrix adjust.
Identities = 36/131 (27%), Positives = 56/131 (42%), Gaps = 9/131 (6%)
Query: 251 DIVIDSGTTLTFL-PQGYNSNLLSVMSSM-IEAQPVADPTGSLELCYSFNSLSQV--PEV 306
++++DSGT +T L P Y + E P A P L+ CY+ +V P +
Sbjct: 392 NVLLDSGTVITRLAPSVYRAVRAEFARQFGAERYPAAPPFSLLDACYNLTGHDEVKVPLL 451
Query: 307 TIHFRG-ADVKLSRSNFFVKVSED--IVCSVFKGIT--NSVPIYGNIMQTNFLVGYDIEQ 361
T+ G AD+ + + +D VC ++ + PI GN Q N V YD
Sbjct: 452 TLRLEGGADMTVDAAGMLFMARKDGSQVCLAMASLSFEDQTPIIGNYQQKNKRVVYDTVG 511
Query: 362 QTVSFKPTDCT 372
+ F DC+
Sbjct: 512 SRLGFADEDCS 522
>gi|125560845|gb|EAZ06293.1| hypothetical protein OsI_28528 [Oryza sativa Indica Group]
Length = 525
Score = 120 bits (301), Expect = 1e-24, Method: Compositional matrix adjust.
Identities = 69/151 (45%), Positives = 89/151 (58%), Gaps = 24/151 (15%)
Query: 107 VADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCSSSQC-ASLNQK-----S 160
+ DTGSDL W QC+PC S CY Q PLFDP S++Y ++PC++S C ASL S
Sbjct: 180 IVDTGSDLTWVQCKPC--SVCYAQRDPLFDPSGSASYAAVPCNASACEASLKAATGVPGS 237
Query: 161 CSGV----------NCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITFGCGTNN 210
C+ V C YS++YGDGSFS G LAT+TV LG + + G FGCG +N
Sbjct: 238 CATVGGGGGGGKSERCYYSLAYGDGSFSRGVLATDTVALGGAS-----VDGFVFGCGLSN 292
Query: 211 GGLFNSKTTGIVGLGGGDISLISQMRTTIAG 241
GLF T G++GLG ++SL+SQ G
Sbjct: 293 RGLFGG-TAGLMGLGRTELSLVSQTAPRFGG 322
Score = 44.7 bits (104), Expect = 0.068, Method: Compositional matrix adjust.
Identities = 36/131 (27%), Positives = 56/131 (42%), Gaps = 9/131 (6%)
Query: 251 DIVIDSGTTLTFL-PQGYNSNLLSVMSSM-IEAQPVADPTGSLELCYSFNSLSQV--PEV 306
++++DSGT +T L P Y + E P A P L+ CY+ +V P +
Sbjct: 393 NVLLDSGTVITRLAPSVYRAVRAEFARQFGAERYPAAPPFSLLDACYNLTGHDEVKVPLL 452
Query: 307 TIHFRG-ADVKLSRSNFFVKVSED--IVCSVFKGIT--NSVPIYGNIMQTNFLVGYDIEQ 361
T+ G AD+ + + +D VC ++ + PI GN Q N V YD
Sbjct: 453 TLRLEGGADMTVDAAGMLFMARKDGSQVCLAMASLSFEDQTPIIGNYQQKNKRVVYDTVG 512
Query: 362 QTVSFKPTDCT 372
+ F DC+
Sbjct: 513 SRLGFADEDCS 523
>gi|115452685|ref|NP_001049943.1| Os03g0318400 [Oryza sativa Japonica Group]
gi|108707841|gb|ABF95636.1| Eukaryotic aspartyl protease family protein, expressed [Oryza
sativa Japonica Group]
gi|113548414|dbj|BAF11857.1| Os03g0318400 [Oryza sativa Japonica Group]
Length = 434
Score = 120 bits (301), Expect = 1e-24, Method: Compositional matrix adjust.
Identities = 67/153 (43%), Positives = 83/153 (54%), Gaps = 11/153 (7%)
Query: 90 NYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCS 149
YL+ ++IGTPP DTGSDLIWTQC+PCP C+ Q P FDP SST C
Sbjct: 81 EYLVHLAIGTPPQPVQLTLDTGSDLIWTQCQPCP--ACFDQALPYFDPSTSSTLSLTSCD 138
Query: 150 SSQCASLNQKSCSGVN------CQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGIT 203
S+ C L SC C Y+ SYGD S + G L + T G ++PG+
Sbjct: 139 STLCQGLPVASCGSPKFWPNQTCVYTYSYGDKSVTTGFLEVDKFTF---VGAGASVPGVA 195
Query: 204 FGCGTNNGGLFNSKTTGIVGLGGGDISLISQMR 236
FGCG N G+F S TGI G G G +SL SQ++
Sbjct: 196 FGCGLFNNGVFKSNETGIAGFGRGPLSLPSQLK 228
>gi|125543640|gb|EAY89779.1| hypothetical protein OsI_11321 [Oryza sativa Indica Group]
Length = 434
Score = 120 bits (300), Expect = 1e-24, Method: Compositional matrix adjust.
Identities = 67/153 (43%), Positives = 83/153 (54%), Gaps = 11/153 (7%)
Query: 90 NYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCS 149
YL+ ++IGTPP DTGSDLIWTQC+PCP C+ Q P FDP SST C
Sbjct: 81 EYLVHLAIGTPPQPVQLTLDTGSDLIWTQCQPCP--ACFDQALPYFDPSTSSTLSLTSCD 138
Query: 150 SSQCASLNQKSCSGVN------CQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGIT 203
S+ C L SC C Y+ SYGD S + G L + T G ++PG+
Sbjct: 139 STLCQGLPVASCGSPKFWPNQTCVYTYSYGDKSVTTGFLEVDKFTF---VGAGASVPGVA 195
Query: 204 FGCGTNNGGLFNSKTTGIVGLGGGDISLISQMR 236
FGCG N G+F S TGI G G G +SL SQ++
Sbjct: 196 FGCGLFNNGVFKSNETGIAGFGRGPLSLPSQLK 228
Score = 58.2 bits (139), Expect = 6e-06, Method: Compositional matrix adjust.
Identities = 38/127 (29%), Positives = 61/127 (48%), Gaps = 8/127 (6%)
Query: 253 VIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYS--FNSLSQVPEVTIHF 310
+IDSGT +T LP + ++ ++ V+ T C S + VP++ +HF
Sbjct: 309 IIDSGTAMTSLPTRVYRLVRDAFAAQVKLPVVSGNTTDPYFCLSAPLRAKPYVPKLVLHF 368
Query: 311 RGADVKLSRSNFFVKVSE---DIVC-SVFKGITNSVPIYGNIMQTNFLVGYDIEQQTVSF 366
GA + L R N+ +V + I+C ++ +G V GN Q N V YD++ +SF
Sbjct: 369 EGATMDLPRENYVFEVEDAGSSILCLAIIEG--GEVTTIGNFQQQNMHVLYDLQNSKLSF 426
Query: 367 KPTDCTK 373
P C K
Sbjct: 427 VPAQCDK 433
>gi|255537017|ref|XP_002509575.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
gi|223549474|gb|EEF50962.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
Length = 459
Score = 120 bits (300), Expect = 1e-24, Method: Compositional matrix adjust.
Identities = 113/425 (26%), Positives = 177/425 (41%), Gaps = 89/425 (20%)
Query: 30 SVELIHRDSPKSPFYNSSETPYQRLRDALTRSLNRLNHF----NQNSSI---------SS 76
+ +LIHRDS SP YN +++ R + L S R ++ +NS++ ++
Sbjct: 36 TTKLIHRDSIFSPAYNPNDSIKDRAKRMLKNSNARFDYVQAISKRNSAVVDYDGGDTSAA 95
Query: 77 SKASQADIIPNNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFD 136
A +A ++ +L+ SIG PP + AV DTGS L W QCEPC C+ Q PL++
Sbjct: 96 DDAYEASLLSELCTFLVNFSIGQPPVPQYAVMDTGSSLTWIQCEPC--INCHQQKGPLYN 153
Query: 137 PKMSSTYKSLPCSSSQCASLNQKSCSGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQA 196
P SSTY S + + G +C YS +Y D + + G A E + +
Sbjct: 154 PSSSSTYVSCSDFDRTDTTFT--ATHGSDCNYSQTYADKTTTRGTYAREQLLFETPDDGI 211
Query: 197 VALPGITFGCGTNNGGL--FNSKTTGIVGLGGGDISLISQM------------------- 235
+ + FGCG NN L +G+ GLG S+IS++
Sbjct: 212 TIMHDVIFGCGHNNTQLPGPTGYASGVFGLGDSGSSIISKLGFGFSYCIGNIGDPLYGFH 271
Query: 236 RTTIAGNQRL-GVSTP--------------------------------------DIVIDS 256
R T+ ++ G STP IVIDS
Sbjct: 272 RLTLGNKLKIEGYSTPLVPRGLYYITLVGISIGQERLDIDPIVFQRVDLNGISSRIVIDS 331
Query: 257 GTTLTFLP-QGYN---SNLLSVMSSMIEAQPVADPTGSLELCY--SFN-SLSQVPEVTIH 309
G TL+++P Q YN + S++S + L LCY N L P+ T H
Sbjct: 332 GATLSYIPRQAYNVVRDKVSSILSGFLSRYRYI--ARHLSLCYIGKLNQDLQGFPDATFH 389
Query: 310 F-RGADVKLSRSNFFVKVSEDIVCSVFKGITNSVP--IYGNIMQTNFLVGYDIEQQTVSF 366
GAD+ F + +++++C + + G + Q + V YD++QQ + F
Sbjct: 390 LADGADLVFQVEGLFFQYTDNVLCLALVPTESDEETCLIGLLAQQYYNVAYDLKQQKLYF 449
Query: 367 KPTDC 371
+ +C
Sbjct: 450 QRIEC 454
>gi|357127293|ref|XP_003565317.1| PREDICTED: aspartic proteinase nepenthesin-1-like, partial
[Brachypodium distachyon]
Length = 540
Score = 120 bits (300), Expect = 1e-24, Method: Compositional matrix adjust.
Identities = 59/133 (44%), Positives = 79/133 (59%), Gaps = 10/133 (7%)
Query: 88 NANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLP 147
+ Y RI IG+P + V DTGSD+ W QC PC + CY Q PLFDP +SS+Y ++P
Sbjct: 193 SGEYFSRIGIGSPARQLYMVLDTGSDVTWLQCAPC--ADCYAQSDPLFDPALSSSYATVP 250
Query: 148 CSSSQCASLNQKSC------SGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPG 201
C S C +L+ +C +C Y V+YGDGS++ G+ ATET+TLG AV
Sbjct: 251 CDSPHCRALDASACHNNAANGNSSCVYEVAYGDGSYTVGDFATETLTLGGDGSAAVH--D 308
Query: 202 ITFGCGTNNGGLF 214
+ GCG +N GLF
Sbjct: 309 VAIGCGHDNEGLF 321
Score = 61.6 bits (148), Expect = 6e-07, Method: Compositional matrix adjust.
Identities = 38/124 (30%), Positives = 54/124 (43%), Gaps = 4/124 (3%)
Query: 252 IVIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSLS--QVPEVTIH 309
+++DSGT +T L S L +A P A + CY S QVP V++
Sbjct: 417 VIVDSGTAVTRLQSSAYSALRDAFVRGTQALPRASGVSLFDTCYDLAGRSSVQVPAVSLR 476
Query: 310 FRGA-DVKLSRSNFFVKVS-EDIVCSVFKGITNSVPIYGNIMQTNFLVGYDIEQQTVSFK 367
F G ++KL N+ + V C F +V I GN+ Q V +D + TV F
Sbjct: 477 FEGGGELKLPAKNYLIPVDGAGTYCLAFAATGGAVSIVGNVQQQGIRVSFDTAKNTVGFS 536
Query: 368 PTDC 371
P C
Sbjct: 537 PNKC 540
>gi|224053042|ref|XP_002297678.1| predicted protein [Populus trichocarpa]
gi|222844936|gb|EEE82483.1| predicted protein [Populus trichocarpa]
Length = 482
Score = 120 bits (300), Expect = 1e-24, Method: Compositional matrix adjust.
Identities = 80/231 (34%), Positives = 120/231 (51%), Gaps = 23/231 (9%)
Query: 25 QTGGFSVELIHRDSPKSPF--YNSSETPYQRLRDALTRSL-NRLNHF----NQNSSISSS 77
+ G +E+ H+DS +N + + D RSL +R+ N + S+ +
Sbjct: 62 ENGATILEMKHKDSCSGKILDWNKKLKKHLIMDDFQLRSLQSRMKSIISGRNIDDSVDAP 121
Query: 78 KASQADIIPNNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDP 137
+ I NY++ + +G + + DTGSDL W QC+PC +CY Q P+F+P
Sbjct: 122 IPLTSGIRLQTLNYIVTVELGG--RKMTVIVDTGSDLSWVQCQPC--KRCYNQQDPVFNP 177
Query: 138 KMSSTYKSLPCSSSQCASLNQKSCS-GV------NCQYSVSYGDGSFSNGNLATETVTLG 190
S +Y+++ CSS C SL + + GV +C Y V+YGDGS++ G L TE + LG
Sbjct: 178 STSPSYRTVLCSSPTCQSLQSATGNLGVCGSNPPSCNYVVNYGDGSYTRGELGTEHLDLG 237
Query: 191 STTGQAVALPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAG 241
++T A+ FGCG NN GLF +G+VGLG +SLISQ G
Sbjct: 238 NST----AVNNFIFGCGRNNQGLFGG-ASGLVGLGRSSLSLISQTSAMFGG 283
Score = 50.1 bits (118), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 37/128 (28%), Positives = 55/128 (42%), Gaps = 7/128 (5%)
Query: 252 IVIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSLSQV--PEVTIH 309
++IDSGT +T LP L P A L+ C++ + +V P + +H
Sbjct: 353 MMIDSGTVITRLPPSIYQALKDEFVKQFSGFPSAPAFMILDTCFNLSGYQEVEIPNIKMH 412
Query: 310 FRG---ADVKLSRSNFFVKVSEDIVCSVFKGIT--NSVPIYGNIMQTNFLVGYDIEQQTV 364
F G +V ++ +FVK VC ++ N V I GN Q N V YD + +
Sbjct: 413 FEGNAELNVDVTGVFYFVKTDASQVCLAIASLSYENEVGIIGNYQQKNQRVIYDTKGSML 472
Query: 365 SFKPTDCT 372
F CT
Sbjct: 473 GFAAEACT 480
>gi|356504173|ref|XP_003520873.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
Length = 461
Score = 120 bits (300), Expect = 1e-24, Method: Compositional matrix adjust.
Identities = 69/178 (38%), Positives = 93/178 (52%), Gaps = 15/178 (8%)
Query: 45 NSSETPYQRLRDALTRSLNRLN------HFNQNSSISSSKASQADIIPNNANYLIRISIG 98
+S++TP Q L R R+ H +++ S S + + + + Y RI +G
Sbjct: 66 SSNKTPEQLFHLRLQRDAKRVEALLNQIHARRSAGSSFSSSIISGLAQGSGEYFTRIGVG 125
Query: 99 TPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCSSSQCASLNQ 158
TP V DTGSD++W QC PC +CY Q +FDP S TY +PC + C L+
Sbjct: 126 TPARYVYMVLDTGSDVVWLQCAPC--RKCYTQTDHVFDPTKSRTYAGIPCGAPLCRRLDS 183
Query: 159 KSCSGVN--CQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITFGCGTNNGGLF 214
CS N CQY VSYGDGSF+ G+ +TET+T VAL GCG +N GLF
Sbjct: 184 PGCSNKNKVCQYQVSYGDGSFTFGDFSTETLTFRRNRVTRVAL-----GCGHDNEGLF 236
Score = 65.9 bits (159), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 46/155 (29%), Positives = 68/155 (43%), Gaps = 12/155 (7%)
Query: 222 VGLGGGDISLISQ--MRTTIAGNQRLGVSTPDIVIDSGTTLTFLPQGYNSNLLSVMSSMI 279
+ +GG + +S R AGN ++IDSGT++T L + L
Sbjct: 313 ISVGGAPVRGLSASLFRLDAAGNG-------GVIIDSGTSVTRLTRPAYIALRDAFRIGA 365
Query: 280 EAQPVADPTGSLELCYSFNSLSQV--PEVTIHFRGADVKLSRSNFFVKVSED-IVCSVFK 336
A + C+ + L++V P V +HFRGADV L +N+ + V C F
Sbjct: 366 SHLKRAPEFSLFDTCFDLSGLTEVKVPTVVLHFRGADVSLPATNYLIPVDNSGSFCFAFA 425
Query: 337 GITNSVPIYGNIMQTNFLVGYDIEQQTVSFKPTDC 371
G + + I GNI Q F + YD+ V F P C
Sbjct: 426 GTMSGLSIIGNIQQQGFRISYDLTGSRVGFAPRGC 460
>gi|449443786|ref|XP_004139658.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 1-like [Cucumis
sativus]
gi|449475416|ref|XP_004154449.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 1-like [Cucumis
sativus]
Length = 453
Score = 120 bits (300), Expect = 2e-24, Method: Compositional matrix adjust.
Identities = 70/188 (37%), Positives = 97/188 (51%), Gaps = 15/188 (7%)
Query: 29 FSVELIHRDSPKSPFYNSSETPYQRLRDALTRSLNRLNHFNQNSSISSSKASQADIIPNN 88
+++L H DS + ++TP L R R++ N ++ SS + +
Sbjct: 54 LTLDLHHLDS-----LSLNKTPTDLFNLRLHRDTLRVHALNSRAAGFSSSVVSG-LSQGS 107
Query: 89 ANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPC 148
Y R+ +GTPP V DTGSD++W QC PC +CY Q P+F+P S ++ +PC
Sbjct: 108 GEYFTRLGVGTPPRYLYMVLDTGSDVVWLQCSPC--RKCYSQSDPIFNPYKSKSFAGIPC 165
Query: 149 SSSQCASLNQKSCSGVN--CQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITFGC 206
SS C L+ CS C Y VSYGDGSF+ G+ ATET+T VAL GC
Sbjct: 166 SSPLCRRLDSSGCSTRRHTCLYQVSYGDGSFTTGDFATETLTFRGNKIAKVAL-----GC 220
Query: 207 GTNNGGLF 214
G +N GLF
Sbjct: 221 GHHNEGLF 228
Score = 68.6 bits (166), Expect = 5e-09, Method: Compositional matrix adjust.
Identities = 40/124 (32%), Positives = 59/124 (47%), Gaps = 3/124 (2%)
Query: 252 IVIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSLS--QVPEVTIH 309
++IDSGT++T L + + L + CY + S +VP V +H
Sbjct: 330 VIIDSGTSVTRLTRPAYTALRDAFRVGARHLKRGPEFSLFDTCYDLSGQSSVKVPTVVLH 389
Query: 310 FRGADVKLSRSNFFVKVSED-IVCSVFKGITNSVPIYGNIMQTNFLVGYDIEQQTVSFKP 368
FRGAD+ L +N+ + V E+ C F G + + I GNI Q F V YD+ + F P
Sbjct: 390 FRGADMALPATNYLIPVDENGSFCFAFAGTISGLSIIGNIQQQGFRVVYDLAGSRIGFAP 449
Query: 369 TDCT 372
CT
Sbjct: 450 RGCT 453
>gi|125558627|gb|EAZ04163.1| hypothetical protein OsI_26305 [Oryza sativa Indica Group]
Length = 404
Score = 119 bits (299), Expect = 2e-24, Method: Compositional matrix adjust.
Identities = 69/155 (44%), Positives = 95/155 (61%), Gaps = 13/155 (8%)
Query: 84 IIPNNAN-YLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSST 142
++ N+A Y + +SIGTPP +ADTGS LIWTQC PC ++C + +P F P SST
Sbjct: 82 LLDNSAGAYNMNLSIGTPPVTFSVLADTGSSLIWTQCAPC--TECAARPAPPFQPASSST 139
Query: 143 YKSLPCSSSQCASLNQ--KSCSGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALP 200
+ LPC+SS C L ++C+ C Y YG G F+ G LATET+ +G + P
Sbjct: 140 FSKLPCASSLCQFLTSPYRTCNATGCVYYYPYGMG-FTAGYLATETLHVGGAS-----FP 193
Query: 201 GITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQM 235
G+TFGC T NG + ++GIVGLG +SL+SQ+
Sbjct: 194 GVTFGCSTENG--VGNSSSGIVGLGRSPLSLVSQV 226
>gi|115472515|ref|NP_001059856.1| Os07g0532800 [Oryza sativa Japonica Group]
gi|50508274|dbj|BAD32123.1| putative aspartic proteinase nepenthesin I [Oryza sativa Japonica
Group]
gi|113611392|dbj|BAF21770.1| Os07g0532800 [Oryza sativa Japonica Group]
Length = 436
Score = 119 bits (299), Expect = 2e-24, Method: Compositional matrix adjust.
Identities = 71/162 (43%), Positives = 96/162 (59%), Gaps = 12/162 (7%)
Query: 76 SSKASQADIIPNNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLF 135
SS + QA + Y + IS+GTP VADTGSDLIWTQC PC ++C+ Q +P F
Sbjct: 71 SSVSFQALLENGVGGYNMNISVGTPLLTFSVVADTGSDLIWTQCAPC--TKCFQQPAPPF 128
Query: 136 DPKMSSTYKSLPCSSSQCASL--NQKSCSGVNCQYSVSYGDGSFSNGNLATETVTLGSTT 193
P SST+ LPC+SS C L + ++C+ C Y+ YG G ++ G LATET+ +G
Sbjct: 129 QPASSSTFSKLPCTSSFCQFLPNSIRTCNATGCVYNYKYGSG-YTAGYLATETLKVGD-- 185
Query: 194 GQAVALPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQM 235
+ P + FGC T NG + T+GI GLG G +SLI Q+
Sbjct: 186 ---ASFPSVAFGCSTENG--VGNSTSGIAGLGRGALSLIPQL 222
Score = 49.3 bits (116), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 38/134 (28%), Positives = 60/134 (44%), Gaps = 15/134 (11%)
Query: 253 VIDSGTTLTFLPQ-GYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSLS----QVPEVT 307
++DSGTTLT+L + GY + +S + V + T L+LC+ VP +
Sbjct: 304 IVDSGTTLTYLAKDGYEMVKQAFLSQTADVTTV-NGTRGLDLCFKSTGGGGGGIAVPSLV 362
Query: 308 IHFRGADVKLSRSNFFVKVSEDIVCSVFKGITNSVP--------IYGNIMQTNFLVGYDI 359
+ F G + + +F V D SV +P + GN+MQ + + YD+
Sbjct: 363 LRFDGG-AEYAVPTYFAGVETDSQGSVTVACLMMLPAKGDQPMSVIGNVMQMDMHLLYDL 421
Query: 360 EQQTVSFKPTDCTK 373
+ SF P DC K
Sbjct: 422 DGGIFSFAPADCAK 435
>gi|242079451|ref|XP_002444494.1| hypothetical protein SORBIDRAFT_07g022800 [Sorghum bicolor]
gi|241940844|gb|EES13989.1| hypothetical protein SORBIDRAFT_07g022800 [Sorghum bicolor]
Length = 445
Score = 119 bits (299), Expect = 2e-24, Method: Compositional matrix adjust.
Identities = 107/365 (29%), Positives = 153/365 (41%), Gaps = 89/365 (24%)
Query: 90 NYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCS 149
++ + +SIGTPP R + DTGSDLIWTQC+ Q ++ PL+DP SS++ + PC
Sbjct: 88 HHTLTVSIGTPPQPRTLILDTGSDLIWTQCKLFDTRQ--HREKPLYDPAKSSSFAAAPCD 145
Query: 150 SSQC--ASLNQKSCSGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITFGCG 207
C S N K+CS C Y+ +YG + + G LA+ET T G +V+L FGCG
Sbjct: 146 GRLCETGSFNTKNCSRNKCIYTYNYGSAT-TKGELASETFTFGEHRRVSVSL---DFGCG 201
Query: 208 TNNGGLFNSKTTGIVGLGGGDISLI--------------------------------SQM 235
G +GI+G+ +SL+ S+
Sbjct: 202 KLTSGSL-PGASGILGISPDRLSLVSQLQIPRFSYCLTPFLDRNTTSHIFFGAMADLSKY 260
Query: 236 RTT--------------------------IAGNQRLGVSTPDIVI-DSGTTLTFLPQGYN 268
RTT G +RL V I G+ TF+ G
Sbjct: 261 RTTGPIQTTSLVTNPDGSNYYYYVPLIGISVGTKRLNVPVSSFAIGRDGSGGTFVDSGDT 320
Query: 269 SNLLS--VMSSMIEAQ------PVADPTG---SLELCY--------SFNSLSQVPEVTIH 309
+ +L VM ++ EA PV + T ELC+ + + QVP + H
Sbjct: 321 TGMLPSVVMEALKEAMVEAVKLPVVNATDHGYEYELCFQLPRNGGGAVETAVQVPPLVYH 380
Query: 310 FRG-ADVKLSRSNFFVKVSEDIVCSVFKGITNSVPIYGNIMQTNFLVGYDIEQQTVSFKP 368
F G A + L R ++ V+VS +C V I GN Q N V +D+E SF P
Sbjct: 381 FDGGAAMLLRRDSYMVEVSAGRMCLVISSGARGA-IIGNYQQQNMHVLFDVENHEFSFAP 439
Query: 369 TDCTK 373
T C +
Sbjct: 440 TQCNQ 444
>gi|413944387|gb|AFW77036.1| hypothetical protein ZEAMMB73_461996 [Zea mays]
Length = 472
Score = 119 bits (299), Expect = 2e-24, Method: Compositional matrix adjust.
Identities = 119/428 (27%), Positives = 175/428 (40%), Gaps = 102/428 (23%)
Query: 30 SVELIHRDSPKSPFYNSSETPYQRLRDALTRSLNRLNHFNQNSSISSSKASQADI-IP-- 86
S+ L HR P +P SS + L + L R R +H + + S + +D+ IP
Sbjct: 61 SMPLAHRHGPCAPATTSS---WPSLAERLRRDRARRDHITRKAKASGRTTTLSDVSIPTS 117
Query: 87 -----NNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSS 141
++ Y++ + IGTP ++ + DTGSDL W QC+PC S CY Q PL+DP SS
Sbjct: 118 LGAAVDSLEYVVTLGIGTPAVQQTVLIDTGSDLSWVQCKPCNSSSCYPQKDPLYDPTASS 177
Query: 142 TYKSLPCSSSQCASL----NQKSC---SGVN-CQYSVSYGDGSFSNGNLATETVTLGSTT 193
TY +PC S C L C SG + CQY + YG+ + G +TET+TL
Sbjct: 178 TYAPVPCDSKACKDLVPDAYDHGCTNSSGTSLCQYGIEYGNRDTTVGVYSTETLTL---- 233
Query: 194 GQAVALPGITFGCG----------------------------TNNGGLFN------SKTT 219
V++ FGCG GG F+ + TT
Sbjct: 234 SPQVSVKDFGFGCGLVQQGTFDLFDGLLGLGGAPESLVSQTAETYGGAFSYCLPPGNSTT 293
Query: 220 GIVGLGG---------------------GDISLISQMRTTIAGNQRLGVSTPDIVI---- 254
G + LG L++ ++ G + P V+
Sbjct: 294 GFLALGAPTNNNDTAGFLFTPLHSLPEQATFYLVNLTGVSVGGKP---LDIPPTVLSGGM 350
Query: 255 --DSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGS--LELCYSFNSLSQ--VPEVTI 308
DSGT +T LP S L + + + A P+ P L+ CY+F ++ VP V +
Sbjct: 351 IIDSGTIITGLPDTAYSALRTAFRTAMSAYPLLPPNNDDVLDTCYNFTGIANVTVPTVAL 410
Query: 309 HFRGADVKLSRSNFFVKVSEDIV---CSVFKGITN--SVPIYGNIMQTNFLVGYDIEQQT 363
F G + + V ++ C F G + V I GN+ Q F V YD +
Sbjct: 411 TFDGG------ATIDLDVPSGVLIQDCLAFAGGASDGDVGIIGNVNQRTFEVLYDSGRGH 464
Query: 364 VSFKPTDC 371
V F+P C
Sbjct: 465 VGFRPGAC 472
>gi|15222611|ref|NP_173922.1| aspartyl protease-like protein [Arabidopsis thaliana]
gi|12321511|gb|AAG50814.1|AC079281_16 hypothetical protein [Arabidopsis thaliana]
gi|20466516|gb|AAM20575.1| unknown protein [Arabidopsis thaliana]
gi|23198172|gb|AAN15613.1| unknown protein [Arabidopsis thaliana]
gi|110736960|dbj|BAF00436.1| hypothetical protein [Arabidopsis thaliana]
gi|332192515|gb|AEE30636.1| aspartyl protease-like protein [Arabidopsis thaliana]
Length = 483
Score = 119 bits (299), Expect = 2e-24, Method: Compositional matrix adjust.
Identities = 60/127 (47%), Positives = 77/127 (60%), Gaps = 7/127 (5%)
Query: 88 NANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLP 147
+ Y R+ IG P E V DTGSD+ W QC PC + CY Q P+F+P SS+Y+ L
Sbjct: 145 SGEYFTRVGIGKPAREVYMVLDTGSDVNWLQCTPC--ADCYHQTEPIFEPSSSSSYEPLS 202
Query: 148 CSSSQCASLNQKSCSGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITFGCG 207
C + QC +L C C Y VSYGDGS++ G+ ATET+T+GST Q VA+ GCG
Sbjct: 203 CDTPQCNALEVSECRNATCLYEVSYGDGSYTVGDFATETLTIGSTLVQNVAV-----GCG 257
Query: 208 TNNGGLF 214
+N GLF
Sbjct: 258 HSNEGLF 264
Score = 46.6 bits (109), Expect = 0.019, Method: Compositional matrix adjust.
Identities = 36/125 (28%), Positives = 58/125 (46%), Gaps = 6/125 (4%)
Query: 252 IVIDSGTTLTFLP-QGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSLS--QVPEVTI 308
I+IDSGT +T L + YNS S + ++ + A + CY+ ++ + +VP V
Sbjct: 360 IIIDSGTAVTRLQTEIYNSLRDSFVKGTLDLEKAAG-VAMFDTCYNLSAKTTVEVPTVAF 418
Query: 309 HFRGAD-VKLSRSNFFVKV-SEDIVCSVFKGITNSVPIYGNIMQTNFLVGYDIEQQTVSF 366
HF G + L N+ + V S C F +S+ I GN+ Q V +D+ + F
Sbjct: 419 HFPGGKMLALPAKNYMIPVDSVGTFCLAFAPTASSLAIIGNVQQQGTRVTFDLANSLIGF 478
Query: 367 KPTDC 371
C
Sbjct: 479 SSNKC 483
>gi|356522504|ref|XP_003529886.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
Length = 473
Score = 119 bits (298), Expect = 2e-24, Method: Compositional matrix adjust.
Identities = 57/129 (44%), Positives = 78/129 (60%), Gaps = 9/129 (6%)
Query: 88 NANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLP 147
+ Y R+ +GTPP V DTGSD++W QC+PC ++CY Q +FDP S ++ +P
Sbjct: 127 SGEYFTRLGVGTPPKYLYMVLDTGSDVVWLQCKPC--TKCYSQTDQIFDPSKSKSFAGIP 184
Query: 148 CSSSQCASLNQKSCSGVN--CQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITFG 205
C S C L+ CS N CQY VSYGDGSF+ G+ +TET+T + A+P + G
Sbjct: 185 CYSPLCRRLDSPGCSLKNNLCQYQVSYGDGSFTFGDFSTETLTF-----RRAAVPRVAIG 239
Query: 206 CGTNNGGLF 214
CG +N GLF
Sbjct: 240 CGHDNEGLF 248
Score = 68.2 bits (165), Expect = 7e-09, Method: Compositional matrix adjust.
Identities = 42/123 (34%), Positives = 58/123 (47%), Gaps = 3/123 (2%)
Query: 252 IVIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSLSQV--PEVTIH 309
++IDSGT++T L + +L A + CY + LS+V P V +H
Sbjct: 350 VIIDSGTSVTRLTRPAYVSLRDAFRVGASHLKRAPEFSLFDTCYDLSGLSEVKVPTVVLH 409
Query: 310 FRGADVKLSRSNFFVKVSED-IVCSVFKGITNSVPIYGNIMQTNFLVGYDIEQQTVSFKP 368
FRGADV L +N+ V V C F G + + I GNI Q F V +D+ V F P
Sbjct: 410 FRGADVSLPAANYLVPVDNSGSFCFAFAGTMSGLSIIGNIQQQGFRVVFDLAGSRVGFAP 469
Query: 369 TDC 371
C
Sbjct: 470 RGC 472
>gi|108707838|gb|ABF95633.1| Eukaryotic aspartyl protease family protein [Oryza sativa Japonica
Group]
Length = 391
Score = 119 bits (298), Expect = 2e-24, Method: Compositional matrix adjust.
Identities = 67/153 (43%), Positives = 83/153 (54%), Gaps = 11/153 (7%)
Query: 90 NYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCS 149
YL+ ++IGTPP DTGSDLIWTQC+PCP C+ Q P FDP SST C
Sbjct: 34 EYLVHLAIGTPPQPVQLTLDTGSDLIWTQCQPCP--ACFDQALPYFDPSTSSTLSLTSCD 91
Query: 150 SSQCASLNQKSCSGVN------CQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGIT 203
S+ C L SC C Y+ SYGD S + G L + T G ++PG+
Sbjct: 92 STLCQGLPVASCGSPKFWPNQTCVYTYSYGDKSVTTGFLEVDKFTF---VGAGASVPGVA 148
Query: 204 FGCGTNNGGLFNSKTTGIVGLGGGDISLISQMR 236
FGCG N G+F S TGI G G G +SL SQ++
Sbjct: 149 FGCGLFNNGVFKSNETGIAGFGRGPLSLPSQLK 181
Score = 58.2 bits (139), Expect = 7e-06, Method: Compositional matrix adjust.
Identities = 40/128 (31%), Positives = 64/128 (50%), Gaps = 9/128 (7%)
Query: 253 VIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSLSQ--VPEVTIHF 310
+IDSGT++T LP + ++ I+ V C+S S ++ VP++ +HF
Sbjct: 265 IIDSGTSITSLPPQVYQVVRDEFAAQIKLPVVPGNATGHYTCFSAPSQAKPDVPKLVLHF 324
Query: 311 RGADVKLSRSNFFVKVSED----IVC-SVFKGITNSVPIYGNIMQTNFLVGYDIEQQTVS 365
GA + L R N+ +V +D I+C ++ KG + I GN Q N V YD++ +S
Sbjct: 325 EGATMDLPRENYVFEVPDDAGNSIICLAINKG--DETTIIGNFQQQNMHVLYDLQNNMLS 382
Query: 366 FKPTDCTK 373
F C K
Sbjct: 383 FVAAQCDK 390
>gi|115472519|ref|NP_001059858.1| Os07g0533800 [Oryza sativa Japonica Group]
gi|113611394|dbj|BAF21772.1| Os07g0533800 [Oryza sativa Japonica Group]
Length = 458
Score = 119 bits (298), Expect = 2e-24, Method: Compositional matrix adjust.
Identities = 69/169 (40%), Positives = 102/169 (60%), Gaps = 16/169 (9%)
Query: 52 QRLRDALTRSLNRLNHFNQNSSISSSKASQADII--------PNNANYLIRISIGTPPTE 103
Q +RDAL R ++R F + + SSS +S A + PN Y++ ++IGTPP
Sbjct: 50 QFVRDALRRDMHRRARFGRELASSSSSSSPAGTVSAPTRKDLPNGGEYIMTLAIGTPPQS 109
Query: 104 RLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCSSS--QCASLNQKSC 161
A+ADTGSDL+WTQC PC +C+ Q SPL++P S T++ LPCSS+ CA+ + +
Sbjct: 110 YPAIADTGSDLVWTQCAPC-GERCFKQPSPLYNPSSSPTFRVLPCSSALNLCAAEARLAG 168
Query: 162 S----GVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITFGC 206
+ G C+Y+ +YG G +++G +ET T GS+ V +PGI FGC
Sbjct: 169 ATPPPGCACRYNQTYGTG-WTSGLQGSETFTFGSSPADQVRVPGIAFGC 216
Score = 60.5 bits (145), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 39/129 (30%), Positives = 71/129 (55%), Gaps = 10/129 (7%)
Query: 252 IVIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGS--LELCYSFNSLSQ----VPE 305
++IDSGTT+T L + + + S+++ PV D + + L+LC++ S S +P
Sbjct: 330 LIIDSGTTITSLVDAAYKRVRAAVRSLVKL-PVTDGSNATGLDLCFALPSSSAPPATLPS 388
Query: 306 VTIHFRG-ADVKLSRSNFFVKVSEDIVCSVFKGITN-SVPIYGNIMQTNFLVGYDIEQQT 363
+T+HF G AD+ L N+ + + + C + T+ + GN Q N + YD++++T
Sbjct: 389 MTLHFGGGADMVLPVENYMI-LDGGMWCLAMRSQTDGELSTLGNYQQQNLHILYDVQKET 447
Query: 364 VSFKPTDCT 372
+SF P C+
Sbjct: 448 LSFAPAKCS 456
>gi|449527149|ref|XP_004170575.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 1-like [Cucumis
sativus]
Length = 487
Score = 119 bits (298), Expect = 2e-24, Method: Compositional matrix adjust.
Identities = 82/206 (39%), Positives = 110/206 (53%), Gaps = 26/206 (12%)
Query: 29 FSVELIHRDSPKSPFYNSSETPYQ-RL-RDA---------LTRSLNRLNHFNQNSSISSS 77
FS+ L R + +P Y T + RL RDA L RSLN HF + SI+ S
Sbjct: 69 FSLPLYPRLALHNPSYKDYNTLVRARLTRDAARVQFLNRNLERSLNGGTHFGE--SINES 126
Query: 78 KASQADIIP--------NNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQ-CY 128
+ P + A YL +I +G P V DTGSD+ W QC+PC CY
Sbjct: 127 LIGDSITAPVVSGQSKGSGAEYLAQIGVGQPVKLFYLVPDTGSDVTWLQCQPCASENTCY 186
Query: 129 MQDSPLFDPKMSSTYKSLPCSSSQCASLNQKSCSGVNCQYSVSYGDGSFSNGNLATETVT 188
Q P+FDPK SS+Y L C+S QC L++ +C+ C Y V YGDGSF+ G LATET++
Sbjct: 187 KQFDPIFDPKSSSSYSPLSCNSQQCKLLDKANCNSDTCIYQVHYGDGSFTTGELATETLS 246
Query: 189 LGSTTGQAVALPGITFGCGTNNGGLF 214
G++ ++P + GCG +N GLF
Sbjct: 247 FGNSN----SIPNLPIGCGHDNEGLF 268
Score = 46.6 bits (109), Expect = 0.022, Method: Compositional matrix adjust.
Identities = 32/124 (25%), Positives = 52/124 (41%), Gaps = 4/124 (3%)
Query: 252 IVIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSLSQVPEVTIHF- 310
I++DSGT ++ LP +L + + A + CY+F+ S V TI F
Sbjct: 364 IIVDSGTIISRLPSDVYESLREAFVKLTSSLSPAPGISVFDTCYNFSGQSNVEVPTIAFV 423
Query: 311 --RGADVKLSRSNFFVKV-SEDIVCSVFKGITNSVPIYGNIMQTNFLVGYDIEQQTVSFK 367
G ++L N+ + + + C F +S+ I G+ Q V YD+ V F
Sbjct: 424 LSEGTSLRLPARNYLIMLDTAGTYCLAFIKTKSSLSIIGSFQQQGIRVSYDLTNSIVGFS 483
Query: 368 PTDC 371
C
Sbjct: 484 TNKC 487
>gi|297845610|ref|XP_002890686.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
gi|297336528|gb|EFH66945.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
Length = 486
Score = 119 bits (298), Expect = 2e-24, Method: Compositional matrix adjust.
Identities = 60/127 (47%), Positives = 77/127 (60%), Gaps = 7/127 (5%)
Query: 88 NANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLP 147
+ Y R+ IG P E V DTGSD+ W QC PC + CY Q P+F+P SS+Y+ L
Sbjct: 148 SGEYFTRVGIGNPAREVYMVLDTGSDVNWLQCTPC--ADCYHQTEPIFEPSSSSSYEPLS 205
Query: 148 CSSSQCASLNQKSCSGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITFGCG 207
C + QC +L C C Y VSYGDGS++ G+ ATET+T+GST Q VA+ GCG
Sbjct: 206 CDTPQCNALEVSECRNATCLYEVSYGDGSYTVGDFATETLTIGSTLVQNVAV-----GCG 260
Query: 208 TNNGGLF 214
+N GLF
Sbjct: 261 HSNEGLF 267
Score = 48.5 bits (114), Expect = 0.006, Method: Compositional matrix adjust.
Identities = 40/128 (31%), Positives = 60/128 (46%), Gaps = 12/128 (9%)
Query: 252 IVIDSGTTLTFLPQG-YNS---NLLSVMSSMIEAQPVADPTGSLELCYSFNSLS--QVPE 305
I+IDSGT +T L G YNS + L S + +A VA + CY+ ++ + +VP
Sbjct: 363 IIIDSGTAVTRLQTGIYNSLRDSFLKGTSDLEKAAGVA----MFDTCYNLSAKTTIEVPT 418
Query: 306 VTIHFRGAD-VKLSRSNFFVKV-SEDIVCSVFKGITNSVPIYGNIMQTNFLVGYDIEQQT 363
V HF G + L N+ + V S C F +S+ I GN+ Q V +D+
Sbjct: 419 VAFHFPGGKMLALPAKNYMIPVDSVGTFCLAFAPTASSLAIIGNVQQQGTRVTFDLANSL 478
Query: 364 VSFKPTDC 371
+ F C
Sbjct: 479 IGFSSNKC 486
>gi|125558632|gb|EAZ04168.1| hypothetical protein OsI_26310 [Oryza sativa Indica Group]
gi|125600539|gb|EAZ40115.1| hypothetical protein OsJ_24558 [Oryza sativa Japonica Group]
Length = 453
Score = 119 bits (298), Expect = 2e-24, Method: Compositional matrix adjust.
Identities = 69/169 (40%), Positives = 102/169 (60%), Gaps = 16/169 (9%)
Query: 52 QRLRDALTRSLNRLNHFNQNSSISSSKASQADII--------PNNANYLIRISIGTPPTE 103
Q +RDAL R ++R F + + SSS +S A + PN Y++ ++IGTPP
Sbjct: 45 QFVRDALRRDMHRRARFGRELASSSSSSSPAGTVSAPTRKDLPNGGEYIMTLAIGTPPQS 104
Query: 104 RLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCSSS--QCASLNQKSC 161
A+ADTGSDL+WTQC PC +C+ Q SPL++P S T++ LPCSS+ CA+ + +
Sbjct: 105 YPAIADTGSDLVWTQCAPC-GERCFKQPSPLYNPSSSPTFRVLPCSSALNLCAAEARLAG 163
Query: 162 S----GVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITFGC 206
+ G C+Y+ +YG G +++G +ET T GS+ V +PGI FGC
Sbjct: 164 ATPPPGCACRYNQTYGTG-WTSGLQGSETFTFGSSPADQVRVPGIAFGC 211
Score = 60.5 bits (145), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 39/129 (30%), Positives = 71/129 (55%), Gaps = 10/129 (7%)
Query: 252 IVIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGS--LELCYSFNSLSQ----VPE 305
++IDSGTT+T L + + + S+++ PV D + + L+LC++ S S +P
Sbjct: 325 LIIDSGTTITSLVDAAYKRVRAAVRSLVKL-PVTDGSNATGLDLCFALPSSSAPPATLPS 383
Query: 306 VTIHFRG-ADVKLSRSNFFVKVSEDIVCSVFKGITN-SVPIYGNIMQTNFLVGYDIEQQT 363
+T+HF G AD+ L N+ + + + C + T+ + GN Q N + YD++++T
Sbjct: 384 MTLHFGGGADMVLPVENYMI-LDGGMWCLAMRSQTDGELSTLGNYQQQNLHILYDVQKET 442
Query: 364 VSFKPTDCT 372
+SF P C+
Sbjct: 443 LSFAPAKCS 451
>gi|125558622|gb|EAZ04158.1| hypothetical protein OsI_26300 [Oryza sativa Indica Group]
Length = 435
Score = 119 bits (298), Expect = 2e-24, Method: Compositional matrix adjust.
Identities = 71/162 (43%), Positives = 96/162 (59%), Gaps = 12/162 (7%)
Query: 76 SSKASQADIIPNNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLF 135
SS + QA + Y + IS+GTP VADTGSDLIWTQC PC ++C+ Q +P F
Sbjct: 71 SSVSFQALLENGVGGYNMNISVGTPLLTFPVVADTGSDLIWTQCAPC--TKCFQQPAPPF 128
Query: 136 DPKMSSTYKSLPCSSSQCASL--NQKSCSGVNCQYSVSYGDGSFSNGNLATETVTLGSTT 193
P SST+ LPC+SS C L + ++C+ C Y+ YG G ++ G LATET+ +G
Sbjct: 129 QPASSSTFSKLPCTSSFCQFLPNSIRTCNATGCVYNYKYGSG-YTAGYLATETLKVGD-- 185
Query: 194 GQAVALPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQM 235
+ P + FGC T NG + T+GI GLG G +SLI Q+
Sbjct: 186 ---ASFPSVAFGCSTENG--VGNSTSGIAGLGRGALSLIPQL 222
Score = 48.1 bits (113), Expect = 0.007, Method: Compositional matrix adjust.
Identities = 38/133 (28%), Positives = 59/133 (44%), Gaps = 14/133 (10%)
Query: 253 VIDSGTTLTFLPQ-GYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSLS---QVPEVTI 308
++DSGTTLT+L + GY + +S V + T L+LC+ VP + +
Sbjct: 304 IVDSGTTLTYLAKDGYEMVKQAFLSQTANVTTV-NGTRGLDLCFKSTGGGGGIAVPSLVL 362
Query: 309 HFRGADVKLSRSNFFVKVSEDIVCSVFKGITNSVP--------IYGNIMQTNFLVGYDIE 360
F G + + +F V D SV +P + GN+MQ + + YD++
Sbjct: 363 RFDGG-AEYAVPTYFAGVETDSQGSVTVACLMMLPAKGDQPMSVIGNVMQMDMHLLYDLD 421
Query: 361 QQTVSFKPTDCTK 373
SF P DC K
Sbjct: 422 GGIFSFSPADCAK 434
>gi|449440933|ref|XP_004138238.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 1-like [Cucumis
sativus]
Length = 487
Score = 119 bits (298), Expect = 3e-24, Method: Compositional matrix adjust.
Identities = 82/206 (39%), Positives = 110/206 (53%), Gaps = 26/206 (12%)
Query: 29 FSVELIHRDSPKSPFYNSSETPYQ-RL-RDA---------LTRSLNRLNHFNQNSSISSS 77
FS+ L R + +P Y T + RL RDA L RSLN HF + SI+ S
Sbjct: 69 FSLPLYPRLALHNPSYKDYNTLVRARLTRDAARVQFLNRNLERSLNGGTHFGE--SINES 126
Query: 78 KASQADIIP--------NNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQ-CY 128
+ P + A YL +I +G P V DTGSD+ W QC+PC CY
Sbjct: 127 LIGDSITAPVVSGQSKGSGAEYLAQIGVGQPVKLFYLVPDTGSDVTWLQCQPCASENTCY 186
Query: 129 MQDSPLFDPKMSSTYKSLPCSSSQCASLNQKSCSGVNCQYSVSYGDGSFSNGNLATETVT 188
Q P+FDPK SS+Y L C+S QC L++ +C+ C Y V YGDGSF+ G LATET++
Sbjct: 187 KQFDPIFDPKSSSSYSPLSCNSQQCKLLDKANCNSDTCIYQVHYGDGSFTTGELATETLS 246
Query: 189 LGSTTGQAVALPGITFGCGTNNGGLF 214
G++ ++P + GCG +N GLF
Sbjct: 247 FGNSN----SIPNLPIGCGHDNEGLF 268
Score = 46.2 bits (108), Expect = 0.026, Method: Compositional matrix adjust.
Identities = 32/124 (25%), Positives = 52/124 (41%), Gaps = 4/124 (3%)
Query: 252 IVIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSLSQVPEVTIHF- 310
I++DSGT ++ LP +L + + A + CY+F+ S V TI F
Sbjct: 364 IIVDSGTIISRLPSDVYESLREAFVKLTSSLSPAPGISVFDTCYNFSGQSNVEVPTIAFV 423
Query: 311 --RGADVKLSRSNFFVKV-SEDIVCSVFKGITNSVPIYGNIMQTNFLVGYDIEQQTVSFK 367
G ++L N+ + + + C F +S+ I G+ Q V YD+ V F
Sbjct: 424 LSEGTSLRLPARNYLIMLDTAGTYCLAFIKTKSSLSIIGSFQQQGIRVSYDLTNSLVGFS 483
Query: 368 PTDC 371
C
Sbjct: 484 TNKC 487
>gi|225455876|ref|XP_002275164.1| PREDICTED: aspartic proteinase nepenthesin-2 [Vitis vinifera]
Length = 496
Score = 119 bits (298), Expect = 3e-24, Method: Compositional matrix adjust.
Identities = 69/208 (33%), Positives = 103/208 (49%), Gaps = 32/208 (15%)
Query: 29 FSVELIHRDSPKSPFYNSSETPYQRLR-DALTRSLNRLNHFNQNSSISSSKASQADIIP- 86
FS+EL P+ + S Y+ L L R R+ N ++ S ++D++P
Sbjct: 80 FSLEL----HPRELLHGGSHKDYRALMLSRLARDSARVKAINTKLQLAVSGTDKSDLVPM 135
Query: 87 --------------------NNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQ 126
+ Y +R+ IG P V DTGSD+ W QC+PC
Sbjct: 136 DTEILHPQDFSTPVTSGTSQGSGEYFLRVGIGRPSKTFYMVIDTGSDVNWLQCKPC--DD 193
Query: 127 CYMQDSPLFDPKMSSTYKSLPCSSSQCASLNQKSCSGVNCQYSVSYGDGSFSNGNLATET 186
CY Q P+FDP SS++ L C + QC +L+ +C +C Y VSYGDGS++ G+ ATET
Sbjct: 194 CYQQVDPIFDPASSSSFSRLGCQTPQCRNLDVFACRNDSCLYQVSYGDGSYTVGDFATET 253
Query: 187 VTLGSTTGQAVALPGITFGCGTNNGGLF 214
V+ G++ ++ + GCG +N GLF
Sbjct: 254 VSFGNSG----SVDKVAIGCGHDNEGLF 277
Score = 48.1 bits (113), Expect = 0.007, Method: Compositional matrix adjust.
Identities = 47/158 (29%), Positives = 69/158 (43%), Gaps = 12/158 (7%)
Query: 220 GIVGLG-GGDISLISQMRTTIAGNQRLGVSTPDIVIDSGTTLTFLP-QGYNSNLLSVMSS 277
GI G+ GG+ I + G+ + G I++D GT +T L Q YN+ L
Sbjct: 345 GITGMSVGGEKLAIPPSIFEVDGSGKGG-----IIVDCGTAVTRLQTQAYNA-LRDTFVK 398
Query: 278 MIEAQPVADPTGSLELCYSFNSLS--QVPEVTIHFRGAD-VKLSRSNFFVKV-SEDIVCS 333
+ + P + CY+ +S + +VP V F G + L SN+ + V S C
Sbjct: 399 LTKDLPSTSGFALFDTCYNLSSRTSVRVPTVAFLFDGGKSLPLPPSNYLIPVDSAGTFCL 458
Query: 334 VFKGITNSVPIYGNIMQTNFLVGYDIEQQTVSFKPTDC 371
F T S+ I GN+ Q V YD+ VSF C
Sbjct: 459 AFAPTTASLSIIGNVQQQGTRVTYDLANSQVSFSSRKC 496
>gi|22831049|dbj|BAC15912.1| putative nucleoid DNA-binding protein cnd41 [Oryza sativa Japonica
Group]
gi|50508281|dbj|BAD32130.1| putative nucleoid DNA-binding protein cnd41 [Oryza sativa Japonica
Group]
Length = 453
Score = 119 bits (298), Expect = 3e-24, Method: Compositional matrix adjust.
Identities = 69/169 (40%), Positives = 102/169 (60%), Gaps = 16/169 (9%)
Query: 52 QRLRDALTRSLNRLNHFNQNSSISSSKASQADII--------PNNANYLIRISIGTPPTE 103
Q +RDAL R ++R F + + SSS +S A + PN Y++ ++IGTPP
Sbjct: 45 QFVRDALRRDMHRRARFGRELASSSSSSSPAGTVSAPTRKDLPNGGEYIMTLAIGTPPQS 104
Query: 104 RLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCSSS--QCASLNQKSC 161
A+ADTGSDL+WTQC PC +C+ Q SPL++P S T++ LPCSS+ CA+ + +
Sbjct: 105 YPAIADTGSDLVWTQCAPC-GERCFKQPSPLYNPSSSPTFRVLPCSSALNLCAAEARLAG 163
Query: 162 S----GVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITFGC 206
+ G C+Y+ +YG G +++G +ET T GS+ V +PGI FGC
Sbjct: 164 ATPPPGCACRYNQTYGTG-WTSGLQGSETFTFGSSPADQVRVPGIAFGC 211
Score = 60.5 bits (145), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 39/129 (30%), Positives = 71/129 (55%), Gaps = 10/129 (7%)
Query: 252 IVIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGS--LELCYSFNSLSQ----VPE 305
++IDSGTT+T L + + + S+++ PV D + + L+LC++ S S +P
Sbjct: 325 LIIDSGTTITSLVDAAYKRVRAAVRSLVKL-PVTDGSNATGLDLCFALPSSSAPPATLPS 383
Query: 306 VTIHFRG-ADVKLSRSNFFVKVSEDIVCSVFKGITN-SVPIYGNIMQTNFLVGYDIEQQT 363
+T+HF G AD+ L N+ + + + C + T+ + GN Q N + YD++++T
Sbjct: 384 MTLHFGGGADMVLPVENYMI-LDGGMWCLAMRSQTDGELSTLGNYQQQNLHILYDVQKET 442
Query: 364 VSFKPTDCT 372
+SF P C+
Sbjct: 443 LSFAPAKCS 451
>gi|21594980|gb|AAM66061.1| chloroplast nucleoid DNA binding protein, putative [Arabidopsis
thaliana]
Length = 485
Score = 119 bits (298), Expect = 3e-24, Method: Compositional matrix adjust.
Identities = 58/129 (44%), Positives = 76/129 (58%), Gaps = 9/129 (6%)
Query: 88 NANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLP 147
+ Y R+ +GTP V DTGSD++W QC PC +CY Q P+FDP+ S TY ++P
Sbjct: 139 SGEYFTRLGVGTPARYVYMVLDTGSDIVWLQCAPC--RRCYSQSDPIFDPRKSKTYATIP 196
Query: 148 CSSSQCASLNQKSCSGVN--CQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITFG 205
CSS C L+ C+ C Y VSYGDGSF+ G+ +TET+T + VAL G
Sbjct: 197 CSSPHCRRLDSAGCNTRRKTCLYQVSYGDGSFTVGDFSTETLTFRRNRVKGVAL-----G 251
Query: 206 CGTNNGGLF 214
CG +N GLF
Sbjct: 252 CGHDNEGLF 260
Score = 61.6 bits (148), Expect = 6e-07, Method: Compositional matrix adjust.
Identities = 37/123 (30%), Positives = 58/123 (47%), Gaps = 3/123 (2%)
Query: 252 IVIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSLSQV--PEVTIH 309
++IDSGT++T L + + + A + C+ +++++V P V +H
Sbjct: 362 VIIDSGTSVTRLIRPAYIAMRDAFRVGAKTLKRAPNFSLFDTCFDLSNMNEVKVPTVVLH 421
Query: 310 FRGADVKLSRSNFFVKVSED-IVCSVFKGITNSVPIYGNIMQTNFLVGYDIEQQTVSFKP 368
FR ADV L +N+ + V + C F G + I GNI Q F V YD+ V F P
Sbjct: 422 FRRADVSLPATNYLIPVDTNGKFCFAFAGTMGGLSIIGNIQQQGFRVVYDLASSRVGFAP 481
Query: 369 TDC 371
C
Sbjct: 482 GGC 484
>gi|356557203|ref|XP_003546907.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
Length = 470
Score = 119 bits (297), Expect = 3e-24, Method: Compositional matrix adjust.
Identities = 75/232 (32%), Positives = 111/232 (47%), Gaps = 25/232 (10%)
Query: 23 EAQTGGFSVELIHRDSPKSPFYNSSETPYQRLRDAL-TRSLNRLNHFNQNSSISSSKASQ 81
+ G +E+ R Q + D L RS+ NH + +S S S
Sbjct: 48 RKEKGAIILEMKDRGECSESERKGDWVEKQLVLDGLHVRSIQ--NHIRKRTSSSQIADSS 105
Query: 82 ADIIP-------NNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPL 134
+P NY++ + +G+ + DTGSDL W QCEPC CY Q+ PL
Sbjct: 106 ETQVPLTSGIKFQTLNYIVTMGLGSQNMS--VIVDTGSDLTWVQCEPC--RSCYNQNGPL 161
Query: 135 FDPKMSSTYKSLPCSSSQCASLNQKSC-----SGVNCQYSVSYGDGSFSNGNLATETVTL 189
F P S +Y+ + C+S+ C SL +C + C Y V+YGDGS+++G L E +
Sbjct: 162 FKPSTSPSYQPILCNSTTCQSLELGACGSDPSTSATCDYVVNYGDGSYTSGELGIEKLGF 221
Query: 190 GSTTGQAVALPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAG 241
G +++ FGCG NN GLF +G++GLG ++S+ISQ T G
Sbjct: 222 G-----GISVSNFVFGCGRNNKGLFGG-ASGLMGLGRSELSMISQTNATFGG 267
Score = 43.9 bits (102), Expect = 0.11, Method: Compositional matrix adjust.
Identities = 34/128 (26%), Positives = 56/128 (43%), Gaps = 7/128 (5%)
Query: 252 IVIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSLSQV--PEVTIH 309
+++DSGT ++ L L + P A L+ C++ QV P ++++
Sbjct: 341 VILDSGTVISRLAPSVYKALKAKFLEQFSGFPSAPGFSILDTCFNLTGYDQVNIPTISMY 400
Query: 310 FRG-ADVKLSRSNFFVKVSEDI--VCSVFKGITN--SVPIYGNIMQTNFLVGYDIEQQTV 364
F G A++ + + F V ED VC +++ + I GN Q N V YD + V
Sbjct: 401 FEGNAELNVDATGIFYLVKEDASRVCLALASLSDEYEMGIIGNYQQRNQRVLYDAKLSQV 460
Query: 365 SFKPTDCT 372
F CT
Sbjct: 461 GFAKEPCT 468
>gi|15223368|ref|NP_171637.1| aspartyl protease-like protein [Arabidopsis thaliana]
gi|9665144|gb|AAF97328.1|AC023628_9 Unknown protein [Arabidopsis thaliana]
gi|22135930|gb|AAM91547.1| chloroplast nucleoid DNA binding protein, putative [Arabidopsis
thaliana]
gi|30387595|gb|AAP31963.1| At1g01300 [Arabidopsis thaliana]
gi|332189147|gb|AEE27268.1| aspartyl protease-like protein [Arabidopsis thaliana]
Length = 485
Score = 119 bits (297), Expect = 3e-24, Method: Compositional matrix adjust.
Identities = 58/129 (44%), Positives = 76/129 (58%), Gaps = 9/129 (6%)
Query: 88 NANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLP 147
+ Y R+ +GTP V DTGSD++W QC PC +CY Q P+FDP+ S TY ++P
Sbjct: 139 SGEYFTRLGVGTPARYVYMVLDTGSDIVWLQCAPC--RRCYSQSDPIFDPRKSKTYATIP 196
Query: 148 CSSSQCASLNQKSCSGVN--CQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITFG 205
CSS C L+ C+ C Y VSYGDGSF+ G+ +TET+T + VAL G
Sbjct: 197 CSSPHCRRLDSAGCNTRRKTCLYQVSYGDGSFTVGDFSTETLTFRRNRVKGVAL-----G 251
Query: 206 CGTNNGGLF 214
CG +N GLF
Sbjct: 252 CGHDNEGLF 260
Score = 65.1 bits (157), Expect = 6e-08, Method: Compositional matrix adjust.
Identities = 38/123 (30%), Positives = 59/123 (47%), Gaps = 3/123 (2%)
Query: 252 IVIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSLSQV--PEVTIH 309
++IDSGT++T L + + + A + C+ +++++V P V +H
Sbjct: 362 VIIDSGTSVTRLIRPAYIAMRDAFRVGAKTLKRAPDFSLFDTCFDLSNMNEVKVPTVVLH 421
Query: 310 FRGADVKLSRSNFFVKVSED-IVCSVFKGITNSVPIYGNIMQTNFLVGYDIEQQTVSFKP 368
FRGADV L +N+ + V + C F G + I GNI Q F V YD+ V F P
Sbjct: 422 FRGADVSLPATNYLIPVDTNGKFCFAFAGTMGGLSIIGNIQQQGFRVVYDLASSRVGFAP 481
Query: 369 TDC 371
C
Sbjct: 482 GGC 484
>gi|223950123|gb|ACN29145.1| unknown [Zea mays]
gi|413923785|gb|AFW63717.1| hypothetical protein ZEAMMB73_445506 [Zea mays]
Length = 385
Score = 119 bits (297), Expect = 3e-24, Method: Compositional matrix adjust.
Identities = 70/157 (44%), Positives = 93/157 (59%), Gaps = 11/157 (7%)
Query: 87 NNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSL 146
N YLI + +G+P T + + DTGSD+ W QC+PC SQC+ Q PLFDP SSTY
Sbjct: 48 NTLEYLITVGLGSPATSQTMLIDTGSDVSWVQCKPC--SQCHSQADPLFDPSSSSTYSPF 105
Query: 147 PCSSSQCASLNQKS---CSGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGIT 203
C S+ CA L Q+ S CQY V+YGDGS + G +++T+ LGS+ A+
Sbjct: 106 SCGSADCAQLGQEGNGCSSSSQCQYIVTYGDGSSTTGTYSSDTLALGSS-----AVRSFQ 160
Query: 204 FGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIA 240
FGC G FN +T G++GLGGG SL+SQ T+
Sbjct: 161 FGCSNVESG-FNDQTDGLMGLGGGAQSLVSQTAGTLG 196
Score = 58.2 bits (139), Expect = 6e-06, Method: Compositional matrix adjust.
Identities = 43/124 (34%), Positives = 61/124 (49%), Gaps = 10/124 (8%)
Query: 253 VIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSLSQV--PEVTIHF 310
V+DSGT +T LP S L S + ++ P A P+G L+ C+ F+ S V P V + F
Sbjct: 267 VMDSGTVITRLPPTAYSALSSAFKAGMKQYPPAQPSGILDTCFDFSGQSSVSIPSVALVF 326
Query: 311 R-GADVKLSRSNFFVKVSEDIVCSVFKGITN--SVPIYGNIMQTNFLVGYDIEQQTVSFK 367
GA V L S + C F G ++ S+ I GN+ Q F V YD+ + V F+
Sbjct: 327 SGGAVVSLDASGIILS-----NCLAFAGNSDDSSLGIIGNVQQRTFEVLYDVGRGVVGFR 381
Query: 368 PTDC 371
C
Sbjct: 382 AGAC 385
>gi|242056497|ref|XP_002457394.1| hypothetical protein SORBIDRAFT_03g006630 [Sorghum bicolor]
gi|241929369|gb|EES02514.1| hypothetical protein SORBIDRAFT_03g006630 [Sorghum bicolor]
Length = 509
Score = 119 bits (297), Expect = 3e-24, Method: Compositional matrix adjust.
Identities = 55/130 (42%), Positives = 79/130 (60%), Gaps = 8/130 (6%)
Query: 87 NNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSL 146
+ Y R+ IG+P E V DTGSD+ W QC+PC + CY Q P+FDP +S++Y ++
Sbjct: 165 GSGEYFSRVGIGSPARELYMVLDTGSDVTWVQCQPC--ADCYQQSDPVFDPSLSASYAAV 222
Query: 147 PCSSSQCASLNQKSCSGVN--CQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITF 204
C S +C L+ +C C Y V+YGDGS++ G+ ATET+TLG +T + +
Sbjct: 223 SCDSPRCRDLDTAACRNATGACLYEVAYGDGSYTVGDFATETLTLGDST----PVTNVAI 278
Query: 205 GCGTNNGGLF 214
GCG +N GLF
Sbjct: 279 GCGHDNEGLF 288
Score = 48.1 bits (113), Expect = 0.007, Method: Compositional matrix adjust.
Identities = 31/124 (25%), Positives = 51/124 (41%), Gaps = 4/124 (3%)
Query: 252 IVIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSLS--QVPEVTIH 309
+++DSGT +T L + L + P + CY + + +VP V++
Sbjct: 386 VIVDSGTAVTRLQSSAYAALRDAFVRGTPSLPRTSGVSLFDTCYDLSDRTSVEVPAVSLR 445
Query: 310 FRGAD-VKLSRSNFFVKVS-EDIVCSVFKGITNSVPIYGNIMQTNFLVGYDIEQQTVSFK 367
F G ++L N+ + V C F +V I GN+ Q V +D + V F
Sbjct: 446 FEGGGALRLPAKNYLIPVDGAGTYCLAFAPTNAAVSIIGNVQQQGTRVSFDTAKGVVGFT 505
Query: 368 PTDC 371
P C
Sbjct: 506 PNKC 509
>gi|218192703|gb|EEC75130.1| hypothetical protein OsI_11317 [Oryza sativa Indica Group]
Length = 440
Score = 119 bits (297), Expect = 3e-24, Method: Compositional matrix adjust.
Identities = 74/191 (38%), Positives = 103/191 (53%), Gaps = 14/191 (7%)
Query: 52 QRLRDALTRSLNRLNHFNQNSSISS-SKASQADIIPNNANYLIRISIGTPPTERLAVADT 110
+ +R RS R +S+ + S + D +P YL+ ++IGTPP DT
Sbjct: 52 ELMRRMALRSKARAPRLLSSSATAPVSPGAYDDGVPMT-EYLLHLAIGTPPQPVQLTLDT 110
Query: 111 GSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCSSSQCASLNQKSCSGVN----- 165
GSDL+WTQC+PC + C+ Q P +D SST+ C S+QC L+ VN
Sbjct: 111 GSDLVWTQCQPC--AVCFNQSLPYYDASRSSTFALPSCDSTQC-KLDPSVTMCVNQTVQT 167
Query: 166 CQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITFGCGTNNGGLFNSKTTGIVGLG 225
C +S SYGD S + G L ETV+ + ++PG+ FGCG NN G+F S TGI G G
Sbjct: 168 CAFSYSYGDKSATIGFLDVETVSFVA----GASVPGVVFGCGLNNTGIFRSNETGIAGFG 223
Query: 226 GGDISLISQMR 236
G +SL SQ++
Sbjct: 224 RGPLSLPSQLK 234
Score = 57.8 bits (138), Expect = 8e-06, Method: Compositional matrix adjust.
Identities = 40/126 (31%), Positives = 62/126 (49%), Gaps = 6/126 (4%)
Query: 253 VIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPV-ADPTGSLELCYSFNSLSQ---VPEVTI 308
+IDSGT T LP + ++ ++ V ++ TG L LC+S L + VP++ +
Sbjct: 315 IIDSGTAFTSLPPRVYRLVHDEFAAHVKLPVVPSNETGPL-LCFSAPPLGKAPHVPKLVL 373
Query: 309 HFRGADVKLSRSNFFVKVSEDIVCSV-FKGITNSVPIYGNIMQTNFLVGYDIEQQTVSFK 367
HF GA + L R N+ + + CS+ I + I GN Q N V YD++ +SF
Sbjct: 374 HFEGATMHLPRENYVFEAKDGGNCSICLAIIEGEMTIIGNFQQQNMHVLYDLKNSKLSFV 433
Query: 368 PTDCTK 373
C K
Sbjct: 434 RAKCDK 439
>gi|168041176|ref|XP_001773068.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162675615|gb|EDQ62108.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 365
Score = 118 bits (296), Expect = 4e-24, Method: Compositional matrix adjust.
Identities = 61/161 (37%), Positives = 85/161 (52%), Gaps = 3/161 (1%)
Query: 82 ADIIPNNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSS 141
A + YL + +GTP + DTGSDL W QC PC +CY Q+ LF P S+
Sbjct: 4 APVAAARGEYLATVRLGTPERVFSVIVDTGSDLTWVQCSPC--GKCYSQNDALFLPNTST 61
Query: 142 TYKSLPCSSSQCASLNQKSCSGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPG 201
++ L C S+ C L C+ C Y SYGDGS + G+ +T+T+ GQ +P
Sbjct: 62 SFTKLACGSALCNGLPFPMCNQTTCVYWYSYGDGSLTTGDFVYDTITMDGINGQKQQVPN 121
Query: 202 ITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGN 242
FGCG +N G F + GI+GLG G +S SQ+++ G
Sbjct: 122 FAFGCGHDNEGSF-AGADGILGLGQGPLSFHSQLKSVYNGK 161
Score = 62.0 bits (149), Expect = 5e-07, Method: Compositional matrix adjust.
Identities = 41/127 (32%), Positives = 63/127 (49%), Gaps = 6/127 (4%)
Query: 253 VIDSGTTLTFLPQGYNSNLLSVM--SSMIEAQPVADPTGSLELCYS---FNSLSQVPEVT 307
+ DSGTT+T L + +L+ M S+M ++ + D L+LC S + L VP +T
Sbjct: 239 IFDSGTTVTQLAEAAYKEVLAAMNASTMAYSRKIDD-ISRLDLCLSGFPKDQLPTVPAMT 297
Query: 308 IHFRGADVKLSRSNFFVKVSEDIVCSVFKGITNSVPIYGNIMQTNFLVGYDIEQQTVSFK 367
HF G D+ L SN+F+ + + V I G++ Q NF V YD + + F
Sbjct: 298 FHFEGGDMVLPPSNYFIYLESSQSYCFAMTSSPDVNIIGSVQQQNFQVYYDTAGRKLGFV 357
Query: 368 PTDCTKQ 374
P DC +
Sbjct: 358 PKDCVGR 364
>gi|297741705|emb|CBI32837.3| unnamed protein product [Vitis vinifera]
Length = 455
Score = 118 bits (296), Expect = 4e-24, Method: Compositional matrix adjust.
Identities = 113/444 (25%), Positives = 184/444 (41%), Gaps = 82/444 (18%)
Query: 5 LSCVFILFFLCF-YVVSP---IEAQTGGFSVELIHRDSPKSPFYNSSETPYQRLRDALTR 60
++ V +L +CF ++ SP + + GFS LIH SP SP+ N + AL
Sbjct: 15 MASVNLLLIICFTFIFSPCISAASDSKGFSTNLIHIHSPSSPYKNVKAESLAK-DTALES 73
Query: 61 SLNRLNHFN--QNSSISSSKASQADIIPNNANYLIRISIGTPPTERLAVADTGSDLIWTQ 118
+L+R + Q ++ + +I + + +L +SIG PPT V DTGSDL W Q
Sbjct: 74 TLSRHAYLRARQQKALQPADFVPPPLIRDKSAFLANLSIGNPPTNVYVVLDTGSDLFWIQ 133
Query: 119 CEPCPPSQCYMQDSPLFDPKMSSTYKSLPCSSSQCASLNQK-SCSGV-NCQYSVSYGDGS 176
CEPC CY Q P+++ S +Y + C+ C SL ++ CS +C Y SY DGS
Sbjct: 134 CEPC--DVCYKQKDPIYNRTKSDSYTEMLCNEPPCLSLGREGQCSDSGSCLYQTSYADGS 191
Query: 177 FSNGNLATETVTLGSTTGQAVALPGITFGCGTNN-------------------------- 210
++G L+ E V S + FGCG N
Sbjct: 192 RTSGLLSYEKVAFTSHYSDEDKTAQVGFGCGLQNLNFVTSSRDGGVLGLGPGLVSLVSQL 251
Query: 211 --------------GGLFNSKTTGIVGLG-----GGDIS--LISQMRTTIAGNQRLGVST 249
G L N G + G GD++ +I++ LGV
Sbjct: 252 SAIGKVSKSFAYCFGNLSNPNAGGFLVFGDATYLNGDMTPMVIAEFYYVNLLGIGLGVEE 311
Query: 250 PD-----------------IVIDSGTTLT-FLPQGYNSNLLSVMSSMIEAQPVADPTGSL 291
P ++IDSG+TL+ F P+ Y +V+ + + ++ T S
Sbjct: 312 PRLDINSSSFERKPDGSGGVIIDSGSTLSIFPPEVYEVVRNAVVDKLKKGYNISPLTSSP 371
Query: 292 ELCYSFN---SLSQVPEVTIHFRGADVKLSRSNFFVKVSEDIVCSVFKGITNSVPIYGNI 348
+ C+ L P + ++ + R + F++ +++ C F + I G +
Sbjct: 372 D-CFEGKIGRDLPLFPTLVLYLESTGILNDRWSIFLQRYDELFCLGFTS-GEGLSIIGTL 429
Query: 349 MQTNFLVGYDIEQQTVSFKPT-DC 371
Q ++ GY++E T+S + DC
Sbjct: 430 AQQSYKFGYNLELSTLSIESNPDC 453
>gi|115483166|ref|NP_001065176.1| Os10g0537800 [Oryza sativa Japonica Group]
gi|21717159|gb|AAM76352.1|AC074196_10 putative nucleoid DNA binding protein [Oryza sativa Japonica Group]
gi|31433285|gb|AAP54823.1| Eukaryotic aspartyl protease family protein, expressed [Oryza
sativa Japonica Group]
gi|113639785|dbj|BAF27090.1| Os10g0537800 [Oryza sativa Japonica Group]
gi|215692411|dbj|BAG87831.1| unnamed protein product [Oryza sativa Japonica Group]
Length = 394
Score = 118 bits (296), Expect = 4e-24, Method: Compositional matrix adjust.
Identities = 94/352 (26%), Positives = 153/352 (43%), Gaps = 77/352 (21%)
Query: 90 NYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCS 149
NY+ +IGTPP AV D +L+WTQC+ C +C+ Q +PLFDP S+TY++ PC
Sbjct: 50 NYVANFTIGTPPQPASAVIDLAGELVWTQCKQC--GRCFEQGTPLFDPTASNTYRAEPCG 107
Query: 150 SSQCASL--NQKSCSGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITFGCG 207
+ C S+ + ++CSG C Y S G + G + T+T +G+ A + FGC
Sbjct: 108 TPLCESIPSDVRNCSGNVCAYEASTNAGD-TGGKVGTDTFAVGT------AKASLAFGCV 160
Query: 208 TNNGGLFNSKTTGIVGLGGGDISLISQMRTT----------------------------- 238
+ +GIVGLG SL++Q
Sbjct: 161 VASDIDTMGGPSGIVGLGRTPWSLVTQTGVAAFSYCLAPHDAGKNSALFLGSSAKLAGGG 220
Query: 239 ---------IAGN----------QRLGVSTPDIVI-----------DSGTTLTFLPQGYN 268
I+GN Q G+ D +I D+ + ++FL G
Sbjct: 221 KAASTPFVNISGNGNDLSNYYKVQLEGLKAGDAMIPLPPSGSTVLLDTFSPISFLVDGAY 280
Query: 269 SNLLSVMSSMIEAQPVADPTGSLELCYSFNSLS-QVPEVTIHFR-GADVKLSRSNFFVKV 326
+ ++ + A P+A P +LC+ + S P++ FR GA + + +N+ +
Sbjct: 281 QAVKKAVTVAVGAPPMATPVEPFDLCFPKSGASGAAPDLVFTFRGGAAMTVPATNYLLDY 340
Query: 327 SEDIVC-----SVFKGITNSVPIYGNIMQTNFLVGYDIEQQTVSFKPTDCTK 373
VC S T + + G++ Q N +D++++T+SF+P DCTK
Sbjct: 341 KNGTVCLAMLSSARLNSTTELSLLGSLQQENIHFLFDLDKETLSFEPADCTK 392
>gi|357137345|ref|XP_003570261.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Brachypodium
distachyon]
Length = 458
Score = 118 bits (295), Expect = 5e-24, Method: Compositional matrix adjust.
Identities = 120/423 (28%), Positives = 167/423 (39%), Gaps = 95/423 (22%)
Query: 28 GFSVELIHRDSPKSPFYNSSETPYQRL--RDAL-TRSLNRLNHFNQNSSISSSKASQADI 84
G +V L HR P SP ++ E L RD L + + N S + S A
Sbjct: 52 GTTVPLSHRHGPCSPAPSTVEPTMAELLRRDQLRAKYIQAKLSVNSGSGTDGVQQSAAIT 111
Query: 85 IP-------NNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDP 137
+P + Y+I +SIGTP + + DTGSD+ W C ++ S FDP
Sbjct: 112 LPTTLGSALDTLAYVITVSIGTPAMTQAVMIDTGSDVSWVHCH----ARAGAGSSLFFDP 167
Query: 138 KMSSTYKSLPCSSSQCASLNQK--SCS-GVNCQYSVSYGDGSFSNGNLATETVTLGSTTG 194
SSTY CSS+ C L + CS CQY+V YGDGS + G ++T+ L ST
Sbjct: 168 GKSSTYTPFSCSSAACTRLEGRDNGCSLNSTCQYTVRYGDGSNTTGTYGSDTLALNSTE- 226
Query: 195 QAVALPGITFGCGTNNG---GLFNSKTTGIVGLGGGDISLISQMRTTI------------ 239
+ FGC + GL +T G++GLGGG SL+SQ T
Sbjct: 227 ---KVENFQFGCSETSDPGEGLDEDQTDGLMGLGGGAPSLVSQTAATYGSAFSYCLPATT 283
Query: 240 --AGNQRLGVST-------------------------------------PDI-----VID 255
+G LG ST P + ++D
Sbjct: 284 RSSGFLTLGASTGTSGFVTTPMFRSRRAPTFYFVILQGINVGGDPVAISPTVFAAGSIMD 343
Query: 256 SGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSLSQV--PEVTIHFRGA 313
SGT +T LP S L + + + P A L+ C+ F V P V + F G
Sbjct: 344 SGTIITRLPPRAYSALSAAFRAGMRRYPRARAFSILDTCFDFTGQDNVSIPAVELVFSGG 403
Query: 314 DVKLSRSNFFVKVSEDIV----CSVFKGITNSV-PIYGNIMQTNFLVGYDIEQQTVSFKP 368
V V + D + C F T + I GN+ Q F V +D+ Q + F+P
Sbjct: 404 AV--------VDLDADGIMYGSCLAFAPATGGIGSIIGNVQQRTFEVLHDVGQSVLGFRP 455
Query: 369 TDC 371
C
Sbjct: 456 GAC 458
>gi|255576511|ref|XP_002529147.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
gi|223531426|gb|EEF33260.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
Length = 479
Score = 118 bits (295), Expect = 5e-24, Method: Compositional matrix adjust.
Identities = 58/127 (45%), Positives = 75/127 (59%), Gaps = 7/127 (5%)
Query: 88 NANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLP 147
+ Y R+ IG P + V DTGSD+ W QC PC + CY Q P+F+P S++Y L
Sbjct: 141 SGEYFSRVGIGKPSSPVYMVLDTGSDVNWIQCAPC--ADCYHQADPIFEPASSTSYSPLS 198
Query: 148 CSSSQCASLNQKSCSGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITFGCG 207
C + QC SL+ C C Y VSYGDGS++ G+ TET+TLGS + VA+ GCG
Sbjct: 199 CDTKQCQSLDVSECRNNTCLYEVSYGDGSYTVGDFVTETITLGSASVDNVAI-----GCG 253
Query: 208 TNNGGLF 214
NN GLF
Sbjct: 254 HNNEGLF 260
Score = 58.9 bits (141), Expect = 4e-06, Method: Compositional matrix adjust.
Identities = 37/124 (29%), Positives = 56/124 (45%), Gaps = 4/124 (3%)
Query: 252 IVIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSLS--QVPEVTIH 309
I+IDSGT +T L + L + PV + CY + + +VP VT H
Sbjct: 356 IIIDSGTAVTRLQTAAYNALRDAFVKGTKDLPVTSEVALFDTCYDLSRKTSVEVPTVTFH 415
Query: 310 FRGADV-KLSRSNFFVKVSED-IVCSVFKGITNSVPIYGNIMQTNFLVGYDIEQQTVSFK 367
G V L +N+ + V D C F ++++ I GN+ Q VG+D+ V F+
Sbjct: 416 LAGGKVLPLPATNYLIPVDSDGTFCFAFAPTSSALSIIGNVQQQGTRVGFDLANSLVGFE 475
Query: 368 PTDC 371
P C
Sbjct: 476 PRQC 479
>gi|168025534|ref|XP_001765289.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162683608|gb|EDQ70017.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 372
Score = 118 bits (295), Expect = 6e-24, Method: Compositional matrix adjust.
Identities = 69/164 (42%), Positives = 93/164 (56%), Gaps = 14/164 (8%)
Query: 86 PNNANY---LIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSST 142
P +A Y L+ I +GTPP + + + DTGSDL W Q EPC C+ Q P+FDP SST
Sbjct: 17 PESAGYGEFLVPIYLGTPPQKAVVIIDTGSDLTWIQSEPC--RACFEQADPIFDPSKSST 74
Query: 143 YKSLPCSSSQCASL--NQKSCSGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALP 200
Y + CSSS CA L Q + NC Y+ YGDGS + G + ET+T T G+ V
Sbjct: 75 YNKIACSSSACADLLGTQTCSAAANCIYAYGYGDGSVTRGYFSKETITATDTAGEEVK-- 132
Query: 201 GITFGCGTNNGGLF-NSKTTGIVGLGGGDISLISQMRTTIAGNQ 243
FG N G F ++ GI+GLG G +S+ SQ+ + + GN+
Sbjct: 133 ---FGASVYNTGTFGDTGGEGILGLGQGPVSMPSQLGSVL-GNK 172
Score = 75.1 bits (183), Expect = 5e-11, Method: Compositional matrix adjust.
Identities = 40/122 (32%), Positives = 66/122 (54%), Gaps = 4/122 (3%)
Query: 253 VIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSLSQ--VPEVTIHF 310
+IDSGTT+T+L Q + L++ +S + TG L+LC++ P +TIH
Sbjct: 249 IIDSGTTITYLQQEVFNALVAAYTSQVRYPTTTSATG-LDLCFNTRGTGSPVFPAMTIHL 307
Query: 311 RGADVKLSRSNFFVKVSEDIVCSVFKGITN-SVPIYGNIMQTNFLVGYDIEQQTVSFKPT 369
G ++L +N F+ + +I+C F + + I+GNI Q NF + YD++ + F P
Sbjct: 308 DGVHLELPTANTFISLETNIICLAFASALDFPIAIFGNIQQQNFDIVYDLDNMRIGFAPA 367
Query: 370 DC 371
DC
Sbjct: 368 DC 369
>gi|242063796|ref|XP_002453187.1| hypothetical protein SORBIDRAFT_04g001340 [Sorghum bicolor]
gi|241933018|gb|EES06163.1| hypothetical protein SORBIDRAFT_04g001340 [Sorghum bicolor]
Length = 493
Score = 118 bits (295), Expect = 6e-24, Method: Compositional matrix adjust.
Identities = 125/433 (28%), Positives = 180/433 (41%), Gaps = 93/433 (21%)
Query: 30 SVELIHRDSPKSPFYNSS-ETPYQRL-RDALT-----RSLNRLNHFNQNSSISSS--KAS 80
+V L HR P SP N T +RL RD L R L+R + + S
Sbjct: 63 TVPLHHRHGPCSPLPNKKMPTLEERLHRDKLRAAYIHRKLSRGKKQGGGGAGGDVVVQQS 122
Query: 81 QADIIP-------NNANYLIRISIGTPPTE-RLAVADTGSDLIWTQCEPCPPSQCYMQDS 132
A +P + Y+I + +G+PP + + + DTGSD+ W +C+PC QC Q
Sbjct: 123 HAMTVPTTLGTSLDTLEYVITVRLGSPPGKSQTMLIDTGSDISWVRCKPCW-QQCRPQVD 181
Query: 133 PLFDPKMSSTYKSLPCSSSQCASL----NQKSCSGV-NCQYSVSYGDGSF-SNGNLATET 186
PLFDP +SSTY CSS+ CA L N CS CQY YGDGS + G +++T
Sbjct: 182 PLFDPSLSSTYSPFSCSSAACAQLFQEGNANGCSSSGQCQYIAMYGDGSVGTTGTYSSDT 241
Query: 187 VTLGSTTGQAVALPGITFGCGTNNGGL----------------FNSKTTGIVG------- 223
+ LGS + V + FGC G+ S+T G G
Sbjct: 242 LALGSNS-NTVVVSKFRFGCSHAETGITGLTAGLMGLGGGAQSLVSQTAGTFGTTAFSYC 300
Query: 224 ------------LGGGDISLISQMRTTI------------------AGNQRLGVST---- 249
LG S ++T + G ++L + T
Sbjct: 301 LPPTPSSSGFLTLGAAGTSSAGFVKTPMLRSSQVPAFYGVRLEAIRVGGRQLSIPTTVFS 360
Query: 250 PDIVIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPT---GSLELCYSFNSLSQV--P 304
+++DSGT +T LP S+L S + ++ P A + G L+ C+ + S V P
Sbjct: 361 AGMIMDSGTVVTRLPPTAYSSLSSAFKAGMKQYPPAPSSAGGGFLDTCFDMSGQSSVSMP 420
Query: 305 EVTIHFRGAD---VKLSRSNFFVKV-SEDIVCSVFKGITN--SVPIYGNIMQTNFLVGYD 358
V + F GA V L S +++ + I C F ++ S I GN+ Q F V YD
Sbjct: 421 TVALVFSGAGGAVVNLDASGILLQMETSSIFCLAFVATSDDGSTGIIGNVQQRTFQVLYD 480
Query: 359 IEQQTVSFKPTDC 371
+ V FK C
Sbjct: 481 VAGGAVGFKAGAC 493
>gi|168059885|ref|XP_001781930.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162666576|gb|EDQ53226.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 355
Score = 118 bits (295), Expect = 6e-24, Method: Compositional matrix adjust.
Identities = 61/154 (39%), Positives = 81/154 (52%), Gaps = 3/154 (1%)
Query: 89 ANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPC 148
YL + +GTP + DTGSDL W QC PC CY Q+ LF P S+++ L C
Sbjct: 1 GEYLATVRLGTPERVFSVIVDTGSDLTWVQCSPC--GTCYSQNDSLFIPNTSTSFTKLAC 58
Query: 149 SSSQCASLNQKSCSGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITFGCGT 208
+ C L C+ C Y SYGDGS S G+ +T+T+ GQ +P FGCG
Sbjct: 59 GTELCNGLPYPMCNQTTCVYWYSYGDGSLSTGDFVYDTITMDGINGQKQQVPNFAFGCGH 118
Query: 209 NNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGN 242
+N G F + GI+GLG G +S SQ++T G
Sbjct: 119 DNEGSF-AGADGILGLGQGPLSFPSQLKTVFNGK 151
Score = 60.8 bits (146), Expect = 9e-07, Method: Compositional matrix adjust.
Identities = 48/154 (31%), Positives = 75/154 (48%), Gaps = 11/154 (7%)
Query: 226 GGDISLISQMRTTIAGNQRLGVSTPDIVIDSGTTLTFLPQGYNSNLLSVM-SSMIEAQPV 284
GG + IS I R G + DSGTT+T L + +L+ M +S ++
Sbjct: 207 GGKLLNISSTAFDIDSVGRAGT-----IFDSGTTVTQLAGEVHQEVLAAMNASTMDYPRK 261
Query: 285 ADPTGSLELC---YSFNSLSQVPEVTIHFRGADVKLSRSNFFVKVSEDIVCSVFKGITN- 340
+D + L+LC ++ L VP +T HF G D++L SN+F+ + E F +++
Sbjct: 262 SDDSSGLDLCLGGFAEGQLPTVPSMTFHFEGGDMELPPSNYFIFL-ESSQSYCFSMVSSP 320
Query: 341 SVPIYGNIMQTNFLVGYDIEQQTVSFKPTDCTKQ 374
V I G+I Q NF V YD + + F P C +
Sbjct: 321 DVTIIGSIQQQNFQVYYDTVGRKIGFVPKSCVGR 354
>gi|357158691|ref|XP_003578210.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Brachypodium
distachyon]
Length = 459
Score = 117 bits (294), Expect = 6e-24, Method: Compositional matrix adjust.
Identities = 97/363 (26%), Positives = 148/363 (40%), Gaps = 81/363 (22%)
Query: 87 NNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSL 146
+ YL+ +++GTPP A+ DTGSDLIWTQC PC + C Q P+F P SS+Y+ +
Sbjct: 100 GDLEYLVDLAVGTPPQPVSALLDTGSDLIWTQCAPC--ASCLPQPDPIFSPGASSSYEPM 157
Query: 147 PCSSSQCASLNQKSCSGVN-CQYSVSYGDGSFSNGNLATETVTLGSTTGQA----VALPG 201
C+ C + SC + C Y SYGDG+ + G ATE T S++ ++ P
Sbjct: 158 RCAGELCNDILHHSCQRPDTCTYRYSYGDGTTTRGVYATERFTFSSSSSGGETTKLSAP- 216
Query: 202 ITFGCGT------NNGG------------------------------------LFNSKTT 219
+ FGCGT NNG LF S
Sbjct: 217 LGFGCGTMNKGSLNNGSGIVGFGRAPLSLVSQLAIRRFSYCLTPYASGRKSTLLFGSLRG 276
Query: 220 GIVGLGGGDISLISQMRT-------------TIAGNQRLGVS------TPD----IVIDS 256
G+ + +R+ G +RL + PD ++DS
Sbjct: 277 GVYDAATATVQTTRLLRSRQNPTFYYVPFTGVTVGARRLRIPISAFALRPDGSGGAIVDS 336
Query: 257 GTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSLSQ-------VPEVTIH 309
GT LT P + ++ S + A+ + + F + + VP + H
Sbjct: 337 GTALTLFPAPVLAEVVRAFRSQLRLPFAANGSSGPDDGVCFAAAASRVPRPAVVPRMVFH 396
Query: 310 FRGADVKLSRSNFFVK-VSEDIVCSVFKGITNSVPIYGNIMQTNFLVGYDIEQQTVSFKP 368
+GAD+ L R N+ + + +C + +S GN +Q + V YD+E T+SF P
Sbjct: 397 LQGADLDLPRRNYVLDDQRKGNLCLLLADSGDSGTTIGNFVQQDMRVLYDLEADTLSFAP 456
Query: 369 TDC 371
C
Sbjct: 457 AQC 459
>gi|302793638|ref|XP_002978584.1| hypothetical protein SELMODRAFT_54048 [Selaginella moellendorffii]
gi|300153933|gb|EFJ20570.1| hypothetical protein SELMODRAFT_54048 [Selaginella moellendorffii]
Length = 407
Score = 117 bits (294), Expect = 7e-24, Method: Compositional matrix adjust.
Identities = 64/178 (35%), Positives = 94/178 (52%), Gaps = 22/178 (12%)
Query: 49 TPYQRLRDALTRSLNRLNHFNQNSSISSSKASQAD-----------IIPNNANYLIRISI 97
T Q L + L R R+ + ++ K +A ++ + Y +R+ +
Sbjct: 1 THEQLLLETLQRDERRVRWIESKAKLAGKKKDEASSTDLNGPVTSGLLYGSGEYFVRLGL 60
Query: 98 GTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCSSSQCASLN 157
GTP V DTGSDL W QC+PC CY Q P+FDP+ SS+++ +PC S C +L
Sbjct: 61 GTPARSLFMVVDTGSDLPWLQCQPC--KSCYKQADPIFDPRNSSSFQRIPCLSPLCKALE 118
Query: 158 QKSCSG-----VNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITFGCGTNN 210
SCSG C Y V+YGDGSFS G+ +++ TLG T +A++ + FGCG +N
Sbjct: 119 VHSCSGSRGATSRCSYQVAYGDGSFSVGDFSSDLFTLG-TGSKAMS---VAFGCGFDN 172
Score = 63.2 bits (152), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 34/124 (27%), Positives = 64/124 (51%), Gaps = 4/124 (3%)
Query: 252 IVIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSLS--QVPEVTIH 309
++IDSGT++T P + + + P A + CY+F+ + VP + +H
Sbjct: 284 VIIDSGTSVTRFPTSVYATIRDAFRNATINLPSAPRYSLFDTCYNFSGKASVDVPALVLH 343
Query: 310 FR-GADVKLSRSNFFVKV-SEDIVCSVFKGITNSVPIYGNIMQTNFLVGYDIEQQTVSFK 367
F GAD++L +N+ + + + C F + + I GNI Q +F +G+D+++ ++F
Sbjct: 344 FENGADLQLPPTNYLIPINTAGSFCLAFAPTSMELGIIGNIQQQSFRIGFDLQKSHLAFA 403
Query: 368 PTDC 371
P C
Sbjct: 404 PQQC 407
>gi|293336306|ref|NP_001168599.1| uncharacterized protein LOC100382383 [Zea mays]
gi|223949441|gb|ACN28804.1| unknown [Zea mays]
Length = 326
Score = 117 bits (294), Expect = 7e-24, Method: Compositional matrix adjust.
Identities = 89/331 (26%), Positives = 133/331 (40%), Gaps = 72/331 (21%)
Query: 107 VADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCSSSQCASLNQKSCSGVN- 165
V DTGSD+ W QC+PC + CY Q P+FDP +S++Y ++ C S +C L+ +C
Sbjct: 2 VLDTGSDVTWVQCQPC--ADCYQQSDPVFDPSLSASYAAVSCDSQRCRDLDTAACRNATG 59
Query: 166 -CQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITFGCGTNNGGLF---------- 214
C Y V+YGDGS++ G+ ATET+TLG +T + + GCG +N GLF
Sbjct: 60 ACLYEVAYGDGSYTVGDFATETLTLGDST----PVGNVAIGCGHDNEGLFVGAAGLLALG 115
Query: 215 ----------------------NSKTTGIVGLGGGDIS-------LISQMRTTI------ 239
+S + G G L+ RT+
Sbjct: 116 GGPLSFPSQISASTFSYCLVDRDSPAASTLQFGDGAAEAGTVTAPLVRSPRTSTFYYVAL 175
Query: 240 ----AGNQRLGVSTPDIVID-----------SGTTLTFLPQGYNSNLLSVMSSMIEAQPV 284
G Q L + +D SGT +T L + L + P
Sbjct: 176 SGISVGGQPLSIPASAFAMDATSGSGGVIVDSGTAVTRLQSAAYAALRDAFVQGAPSLPR 235
Query: 285 ADPTGSLELCYSFNSLS--QVPEVTIHFRGAD-VKLSRSNFFVKVS-EDIVCSVFKGITN 340
+ CY + + +VP V++ F G ++L N+ + V C F
Sbjct: 236 TSGVSLFDTCYDLSDRTSVEVPAVSLRFEGGGALRLPAKNYLIPVDGAGTYCLAFAPTNA 295
Query: 341 SVPIYGNIMQTNFLVGYDIEQQTVSFKPTDC 371
+V I GN+ Q V +D + V F P C
Sbjct: 296 AVSIIGNVQQQGTRVSFDTARGAVGFTPNKC 326
>gi|218188634|gb|EEC71061.1| hypothetical protein OsI_02803 [Oryza sativa Indica Group]
Length = 479
Score = 117 bits (294), Expect = 8e-24, Method: Compositional matrix adjust.
Identities = 81/225 (36%), Positives = 114/225 (50%), Gaps = 24/225 (10%)
Query: 30 SVELIHRDSPKSPFYNSSETPYQRLRDALTRSLNRLNHFNQNSSISSSKASQAD------ 83
SV L HR P SP +S + L R R ++ + S S+ A+ D
Sbjct: 61 SVTLSHRYGPCSPADPNSGEKRPTDEELLRRDQLRADYIRRKFSGSNGTAAGEDGQSSKV 120
Query: 84 IIP-------NNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCP-PSQCYMQDSPLF 135
+P + Y+I + +G+P + V DTGSD+ W QCEPCP PS C+ LF
Sbjct: 121 SVPTTLGSSLDTLEYVISVGLGSPAMTQRVVIDTGSDVSWVQCEPCPAPSPCHAHAGALF 180
Query: 136 DPKMSSTYKSLPCSSSQCASL-NQKSCSGVN----CQYSVSYGDGSFSNGNLATETVTLG 190
DP SSTY + CS++ CA L + +G + CQY V YGDGS + G +++ +TL
Sbjct: 181 DPAASSTYAAFNCSAAACAQLGDSGEANGCDAKSRCQYIVKYGDGSNTTGTYSSDVLTL- 239
Query: 191 STTGQAVALPGITFGCGTNN-GGLFNSKTTGIVGLGGGDISLISQ 234
+G V + G FGC G + KT G++GLGG SL+SQ
Sbjct: 240 --SGSDV-VRGFQFGCSHAELGAGMDDKTDGLIGLGGDAQSLVSQ 281
Score = 47.8 bits (112), Expect = 0.009, Method: Compositional matrix adjust.
Identities = 39/140 (27%), Positives = 62/140 (44%), Gaps = 14/140 (10%)
Query: 241 GNQRLGVSTPDI-----VIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCY 295
G ++LG+S P + ++DSGT +T LP + L S + + A+P G L+ C+
Sbjct: 345 GGKKLGLS-PSVFAAGSLVDSGTVITRLPPAAYAALSSAFRAGMTRYARAEPLGILDTCF 403
Query: 296 SFNSLSQV--PEVTIHFRGADVKLSRSNFFVKVSEDIVCSVFKGITN--SVPIYGNIMQT 351
+F L +V P V + F G V ++ V C F + + GN+ Q
Sbjct: 404 NFTGLDKVSIPTVALVFAGGAVVDLDAHGIVSGG----CLAFAPTRDDKAFGTIGNVQQR 459
Query: 352 NFLVGYDIEQQTVSFKPTDC 371
F V YD+ F+ C
Sbjct: 460 TFEVLYDVGGGVFGFRAGAC 479
>gi|359496797|ref|XP_002277380.2| PREDICTED: aspartic proteinase nepenthesin-2-like, partial [Vitis
vinifera]
Length = 358
Score = 117 bits (293), Expect = 8e-24, Method: Compositional matrix adjust.
Identities = 85/247 (34%), Positives = 123/247 (49%), Gaps = 40/247 (16%)
Query: 18 VVSPIEAQTGGFSVELIHRD-------SPKSPFYNSSETPYQRLRDALTRSLN-RLNHFN 69
+ P Q+GG IH +P+ P S + DA ++LN RL
Sbjct: 28 ALGPRVNQSGGVVQMTIHHVHGPGSSLAPQPPVSFSDVLAWD---DARVKTLNSRLTR-- 82
Query: 70 QNSSISSSKASQADI-------IPNN-------ANYLIRISIGTPPTERLAVADTGSDLI 115
+++ S ++ DI +P N NY +++ G+P + DTGS L
Sbjct: 83 KDTRFPKSVLTKKDIRFPKSVSVPLNPGASIGSGNYYVKVGFGSPARYYSMIVDTGSSLS 142
Query: 116 WTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCSSSQC-----ASLNQKSC--SGVNCQY 168
W QC+PC C++Q PLFDP S TYKSL C+SSQC A+LN C S C Y
Sbjct: 143 WLQCKPCV-VYCHVQADPLFDPSASKTYKSLSCTSSQCSSLVDATLNNPLCETSSNVCVY 201
Query: 169 SVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITFGCGTNNGGLFNSKTTGIVGLGGGD 228
+ SYGD S+S G L+ + +TL + LPG +GCG ++ GLF + GI+GLG
Sbjct: 202 TASYGDSSYSMGYLSQDLLTLAPSQ----TLPGFVYGCGQDSDGLFG-RAAGILGLGRNK 256
Query: 229 ISLISQM 235
+S++ Q+
Sbjct: 257 LSMLGQV 263
>gi|255564685|ref|XP_002523337.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
gi|223537425|gb|EEF39053.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
Length = 469
Score = 117 bits (293), Expect = 9e-24, Method: Compositional matrix adjust.
Identities = 72/197 (36%), Positives = 99/197 (50%), Gaps = 23/197 (11%)
Query: 29 FSVELIHRDSPKSPFYNSSETPYQRLRDALTRSLNRLNHFNQNSSISS---------SKA 79
FSV+L H D+ +NS TP L R R+ + + + S +
Sbjct: 60 FSVQLHHVDALS---FNS--TPETLFTTRLQRDAARVEAISYLAETAGTGKRVGTGFSSS 114
Query: 80 SQADIIPNNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKM 139
+ + + Y RI +GTPP V DTGSD++W QC PC +CY Q P+FDP+
Sbjct: 115 VISGLAQGSGEYFTRIGVGTPPRYVYMVLDTGSDIVWIQCAPC--KRCYAQSDPVFDPRK 172
Query: 140 SSTYKSLPCSSSQCASLNQKSCS--GVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAV 197
S ++ S+ C S C L+ C+ C Y VSYGDGSF+ G+ +TET+T T V
Sbjct: 173 SRSFASIACRSPLCHRLDSPGCNTQKQTCMYQVSYGDGSFTFGDFSTETLTFRRTRVARV 232
Query: 198 ALPGITFGCGTNNGGLF 214
AL GCG +N GLF
Sbjct: 233 AL-----GCGHDNEGLF 244
Score = 63.2 bits (152), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 43/128 (33%), Positives = 63/128 (49%), Gaps = 13/128 (10%)
Query: 252 IVIDSGTTLTFLPQ----GYNSNLLSVMSSMIEAQPVADPTGSL-ELCYSFNSLSQV--P 304
++IDSGT++T L + + + S++ A P SL + C+ + ++V P
Sbjct: 346 VIIDSGTSVTRLTRPAYIAFRDAFRAGASNLKRA-----PQFSLFDTCFDLSGKTEVKVP 400
Query: 305 EVTIHFRGADVKLSRSNFFVKV-SEDIVCSVFKGITNSVPIYGNIMQTNFLVGYDIEQQT 363
V +HFRGADV L SN+ + V + C F G + I GNI Q F V YD+
Sbjct: 401 TVVLHFRGADVSLPASNYLIPVDTSGNFCLAFAGTMGGLSIIGNIQQQGFRVVYDLAGSR 460
Query: 364 VSFKPTDC 371
V F P C
Sbjct: 461 VGFAPHGC 468
>gi|242084330|ref|XP_002442590.1| hypothetical protein SORBIDRAFT_08g022570 [Sorghum bicolor]
gi|241943283|gb|EES16428.1| hypothetical protein SORBIDRAFT_08g022570 [Sorghum bicolor]
Length = 494
Score = 117 bits (292), Expect = 1e-23, Method: Compositional matrix adjust.
Identities = 72/223 (32%), Positives = 114/223 (51%), Gaps = 23/223 (10%)
Query: 29 FSVELIHRDSPKSPFYNSSETPYQRLRDALTRSLNRLNHFNQNSS------ISSSKASQA 82
+ L+HRDS + ++E +RL+ R+ ++ N + +S+ + A
Sbjct: 64 LHIHLLHRDS-FAVNATAAELLARRLQRDELRAAWIISKAAANGTPPPVVGLSTGRGLVA 122
Query: 83 DII---PNNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKM 139
++ P + Y+ +I++GTP + L DT SDL W QC+PC +CY Q P+FDP+
Sbjct: 123 PVVSRAPTSGEYMAKIAVGTPAVQALLALDTASDLTWLQCQPC--RRCYPQSGPVFDPRH 180
Query: 140 SSTYKSLPCSSSQCASLNQKSCSGVN---CQYSVSYGDG----SFSNGNLATETVTLGST 192
S++Y + + C +L + C Y+V YGDG S S G+L ET+T
Sbjct: 181 STSYGEMNYDAPDCQALGRSGGGDAKRGTCIYTVQYGDGHGSTSTSVGDLVEETLTFAGG 240
Query: 193 TGQAVALPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQM 235
QA ++ GCG +N GLF + GI+GLG G IS+ Q+
Sbjct: 241 VRQAY----LSIGCGHDNKGLFGAPAAGILGLGRGQISIPHQI 279
Score = 63.9 bits (154), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 43/128 (33%), Positives = 68/128 (53%), Gaps = 8/128 (6%)
Query: 252 IVIDSGTTLTFLPQ-GYNSNLLSVMSSMIEAQPVA--DPTGSLELCYSFNSLS--QVPEV 306
+++DSGTT+T L + Y + + ++ V+ P+G + CY+ + +VP V
Sbjct: 367 VILDSGTTVTRLARPAYVAFRDAFRAAATSLGQVSTGGPSGLFDTCYTVGGRAGVKVPAV 426
Query: 307 TIHFRGA-DVKLSRSNFFVKV-SEDIVCSVFKGITN-SVPIYGNIMQTNFLVGYDIEQQT 363
++HF G +V L N+ + V S VC F G + SV + GNI+Q F V YD+ Q
Sbjct: 427 SMHFAGGVEVSLQPKNYLIPVDSRGTVCFAFAGTGDRSVSVIGNILQQGFRVVYDLAGQR 486
Query: 364 VSFKPTDC 371
V F P +C
Sbjct: 487 VGFAPNNC 494
>gi|255558694|ref|XP_002520371.1| basic 7S globulin 2 precursor small subunit, putative [Ricinus
communis]
gi|223540418|gb|EEF41987.1| basic 7S globulin 2 precursor small subunit, putative [Ricinus
communis]
Length = 557
Score = 117 bits (292), Expect = 1e-23, Method: Compositional matrix adjust.
Identities = 95/367 (25%), Positives = 155/367 (42%), Gaps = 86/367 (23%)
Query: 89 ANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPC 148
Y + + IGTPP + DTGSDL W QC PC C++Q+ P +DPK SS++K++ C
Sbjct: 190 GEYFMDVFIGTPPRHFSLILDTGSDLNWIQCVPC--YDCFVQNGPYYDPKESSSFKNIGC 247
Query: 149 SSSQCASLNQ----KSCSGVN--CQYSVSYGDGSFSNGNLATETVTLGSTT----GQAVA 198
+C ++ + C N C Y YGD S + G+ A ET T+ T+ +
Sbjct: 248 HDPRCHLVSSPDPPQPCKAENQTCPYFYWYGDSSNTTGDFALETFTVNLTSPAGKSEFKR 307
Query: 199 LPGITFGCGTNNGGL----------------FNSKTTGIVG------------------- 223
+ + FGCG N GL F+S+ + G
Sbjct: 308 VENVMFGCGHWNRGLFHGAAGLLGLGRGPLSFSSQLQSLYGHSFSYCLVDRNSDTNVSSK 367
Query: 224 -LGGGDISLIS-----------------------QMRTTIAGNQRLGVS------TPD-- 251
+ G D L++ Q+++ + G + L + +P+
Sbjct: 368 LIFGEDKDLLNHPEVNFTSLVAGKENPVDTFYYVQIKSIMVGGEVLKIPEETWHLSPEGA 427
Query: 252 --IVIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSLS--QVPEVT 307
++DSGTTL++ + + ++ PV L+ CY+ + + ++PE
Sbjct: 428 GGTIVDSGTTLSYFAEPSYEIIKDAFVKKVKGYPVIKDFPILDPCYNVSGVEKMELPEFR 487
Query: 308 IHFR-GADVKLSRSNFFVKVS-EDIVCSVFKGITNS-VPIYGNIMQTNFLVGYDIEQQTV 364
I F GA N+F+K+ E+IVC G S + I GN Q NF + YD ++ +
Sbjct: 488 ILFEDGAVWNFPVENYFIKLEPEEIVCLAILGTPRSALSIIGNYQQQNFHILYDTKKSRL 547
Query: 365 SFKPTDC 371
+ P C
Sbjct: 548 GYAPMKC 554
>gi|222624819|gb|EEE58951.1| hypothetical protein OsJ_10630 [Oryza sativa Japonica Group]
Length = 440
Score = 117 bits (292), Expect = 1e-23, Method: Compositional matrix adjust.
Identities = 74/191 (38%), Positives = 102/191 (53%), Gaps = 14/191 (7%)
Query: 52 QRLRDALTRSLNRLNHFNQNSSISS-SKASQADIIPNNANYLIRISIGTPPTERLAVADT 110
+ +R RS R +S+ + S + D +P YL+ ++IGTPP DT
Sbjct: 52 ELMRRMALRSKARAPRLLSSSATAPVSPGAYDDGVPMT-EYLLHLAIGTPPQPVQLTLDT 110
Query: 111 GSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCSSSQCASLNQKSCSGVN----- 165
GS L+WTQC+PC + C+ Q P +D SST+ C S+QC L+ VN
Sbjct: 111 GSVLVWTQCQPC--AVCFNQSLPYYDASRSSTFALPSCDSTQC-KLDPSVTMCVNQTVQT 167
Query: 166 CQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITFGCGTNNGGLFNSKTTGIVGLG 225
C YS SYGD S + G L ETV+ + ++PG+ FGCG NN G+F S TGI G G
Sbjct: 168 CAYSYSYGDKSATIGFLDVETVSFVA----GASVPGVVFGCGLNNTGIFRSNETGIAGFG 223
Query: 226 GGDISLISQMR 236
G +SL SQ++
Sbjct: 224 RGPLSLPSQLK 234
Score = 57.8 bits (138), Expect = 8e-06, Method: Compositional matrix adjust.
Identities = 40/126 (31%), Positives = 62/126 (49%), Gaps = 6/126 (4%)
Query: 253 VIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPV-ADPTGSLELCYSFNSLSQ---VPEVTI 308
+IDSGT T LP + ++ ++ V ++ TG L LC+S L + VP++ +
Sbjct: 315 IIDSGTAFTSLPPRVYRLVHDEFAAHVKLPVVPSNETGPL-LCFSAPPLGKAPHVPKLVL 373
Query: 309 HFRGADVKLSRSNFFVKVSEDIVCSV-FKGITNSVPIYGNIMQTNFLVGYDIEQQTVSFK 367
HF GA + L R N+ + + CS+ I + I GN Q N V YD++ +SF
Sbjct: 374 HFEGATMHLPRENYVFEAKDGGNCSICLAIIEGEMTIIGNFQQQNMHVLYDLKNSKLSFV 433
Query: 368 PTDCTK 373
C K
Sbjct: 434 RAKCDK 439
>gi|357127507|ref|XP_003565421.1| PREDICTED: probable aspartic protease At2g35615-like [Brachypodium
distachyon]
Length = 438
Score = 117 bits (292), Expect = 1e-23, Method: Compositional matrix adjust.
Identities = 77/224 (34%), Positives = 113/224 (50%), Gaps = 37/224 (16%)
Query: 28 GFSVELIHRDSPKSPFYNSSETPYQRLRDALTRSLNRLNHFNQNSSISSSKASQ------ 81
GFSVE IHRDS KS F++ + TP RLR A RS+ R H + ++ +++ +
Sbjct: 3 GFSVEFIHRDSVKSLFHDPTLTPEARLRQAARRSMARHAHAARINNSAAAAGASGSDDSD 62
Query: 82 ----ADIIPNNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDP 137
+ ++P N YL+ + + TPP LA+ADTGS L+W +C+ P
Sbjct: 63 ADVVSPMVPQNFEYLMALDVSTPPVRMLALADTGSSLVWLKCK-----------LPAAHT 111
Query: 138 KMSSTYKSLPCSSSQCASL-NQKSC----SGVN-CQYSVSYGDGSFSNGNLATETVTLGS 191
SS+Y LPC + C +L + SC SG N C Y ++ DGS + G + + T +
Sbjct: 112 PASSSYARLPCDAFACKALGDAASCRATGSGNNICVYRYAFADGSCTAGPVTVDAFTFST 171
Query: 192 TTGQAVALPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQM 235
+ FGC T GL + G+VGL G ISL+SQ+
Sbjct: 172 R---------LDFGCATRTEGL-SVPDDGLVGLANGPISLVSQL 205
Score = 63.5 bits (153), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 41/132 (31%), Positives = 69/132 (52%), Gaps = 12/132 (9%)
Query: 245 LGVSTPDIVIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSLS--- 301
L +T +++DSGT LT+LP+ L++ +++ I+ V P +CY +
Sbjct: 276 LQTTTTKLIVDSGTMLTYLPKAVLDPLVAALTAAIKLPRVKSPETLYAVCYDVRRRAPED 335
Query: 302 ---QVPEVTIHF-RGADVKLSRSN-FFVKVSEDIVCSVFKGITNSVP--IYGNIMQTNFL 354
+P+VT+ G +V+L N F V+ VC + + +P I GN+ Q N
Sbjct: 336 VGKSIPDVTLVLGGGGEVRLPWGNTFVVENKGTTVCLAL--VESHLPEFILGNVAQQNLH 393
Query: 355 VGYDIEQQTVSF 366
VG+D+E++TVSF
Sbjct: 394 VGFDLERRTVSF 405
>gi|356569424|ref|XP_003552901.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Glycine max]
Length = 536
Score = 116 bits (291), Expect = 1e-23, Method: Compositional matrix adjust.
Identities = 94/368 (25%), Positives = 152/368 (41%), Gaps = 87/368 (23%)
Query: 91 YLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCSS 150
Y I + +GTPP + DTGSDL W QC+PC C+ Q+ P ++P SS+Y+++ C
Sbjct: 170 YFIDMFVGTPPKHVWLILDTGSDLSWIQCDPC--YDCFEQNGPHYNPNESSSYRNISCYD 227
Query: 151 SQCASLNQ----KSCSGVN--CQYSVSYGDGSFSNGNLATETVTLGST----TGQAVALP 200
+C ++ + C N C Y Y DGS + G+ A ET T+ T + +
Sbjct: 228 PRCQLVSSPDPLQHCKTENQTCPYFYDYADGSNTTGDFALETFTVNLTWPNGKEKFKHVV 287
Query: 201 GITFGCGTNNGGL----------------FNSKTTGIVG--------------------L 224
+ FGCG N G F S+ I G +
Sbjct: 288 DVMFGCGHWNKGFFHGAGGLLGLGRGPLSFPSQLQSIYGHSFSYCLTDLFSNTSVSSKLI 347
Query: 225 GGGDISLIS-----------------------QMRTTIAGNQRLGVSTPDI--------- 252
G D L++ Q+++ + G + L +
Sbjct: 348 FGEDKELLNHHNLNFTKLLAGEETPDDTFYYLQIKSIVVGGEVLDIPEKTWHWSSEGVGG 407
Query: 253 -VIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSLSQV--PEVTIH 309
+IDSG+TLTF P + I+ Q +A + CY+ + QV P+ IH
Sbjct: 408 TIIDSGSTLTFFPDSAYDVIKEAFEKKIKLQQIAADDFIMSPCYNVSGAMQVELPDYGIH 467
Query: 310 F-RGADVKLSRSNFFVKVSED-IVC-SVFKGITNS-VPIYGNIMQTNFLVGYDIEQQTVS 365
F GA N+F + D ++C ++ K +S + I GN++Q NF + YD+++ +
Sbjct: 468 FADGAVWNFPAENYFYQYEPDEVICLAILKTPNHSHLTIIGNLLQQNFHILYDVKRSRLG 527
Query: 366 FKPTDCTK 373
+ P C +
Sbjct: 528 YSPRRCAE 535
>gi|255541796|ref|XP_002511962.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
gi|223549142|gb|EEF50631.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
Length = 495
Score = 116 bits (291), Expect = 1e-23, Method: Compositional matrix adjust.
Identities = 74/186 (39%), Positives = 99/186 (53%), Gaps = 20/186 (10%)
Query: 34 IHRDSPKSPFYNSSETPYQRLRDALTRS-LNRLNHFNQNSSIS---SSKASQADIIPNNA 89
+HRDS + + T Q + + +++S L L Q +S SS SQ +
Sbjct: 106 LHRDSSR---VQAITTRLQLILNGVSKSDLKPLQTEIQPQDLSTPVSSGTSQG-----SG 157
Query: 90 NYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCS 149
Y R+ +G P V DTGSD+ W QC+PC S CY Q P+F P SS+Y L C
Sbjct: 158 EYFTRVGVGNPAKSYYMVLDTGSDINWIQCQPC--SDCYQQSDPIFTPAASSSYSPLTCD 215
Query: 150 SSQCASLNQKSCSGVNCQYSVSYGDGSFSNGNLATETVTL-GSTTGQAVALPGITFGCGT 208
S QC SL SC C+Y V+YGDGSF+ G+ TET++ GS T ++AL GCG
Sbjct: 216 SQQCNSLQMSSCRNGQCRYQVNYGDGSFTFGDFVTETMSFGGSGTVNSIAL-----GCGH 270
Query: 209 NNGGLF 214
+N GLF
Sbjct: 271 DNEGLF 276
Score = 47.0 bits (110), Expect = 0.014, Method: Compositional matrix adjust.
Identities = 37/125 (29%), Positives = 54/125 (43%), Gaps = 6/125 (4%)
Query: 252 IVIDSGTTLTFL-PQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSLS--QVPEVTI 308
+++D GT +T L + YNS L SM + CY + S +VP V+
Sbjct: 372 VIVDCGTAITRLQSEAYNS-LRDSFVSMSRHLRSTSGVALFDTCYDLSGQSSVKVPTVSF 430
Query: 309 HFRGAD-VKLSRSNFFVKV-SEDIVCSVFKGITNSVPIYGNIMQTNFLVGYDIEQQTVSF 366
HF G L +N+ + V S C F T+S+ I GN+ Q V +D+ V F
Sbjct: 431 HFDGGKSWDLPAANYLIPVDSAGTYCFAFAPTTSSLSIIGNVQQQGTRVSFDLANNRVGF 490
Query: 367 KPTDC 371
C
Sbjct: 491 STNKC 495
>gi|255548660|ref|XP_002515386.1| Aspartic proteinase nepenthesin-2 precursor, putative [Ricinus
communis]
gi|223545330|gb|EEF46835.1| Aspartic proteinase nepenthesin-2 precursor, putative [Ricinus
communis]
Length = 387
Score = 116 bits (291), Expect = 1e-23, Method: Compositional matrix adjust.
Identities = 118/395 (29%), Positives = 165/395 (41%), Gaps = 88/395 (22%)
Query: 54 LRDALTRSLNRLNHFNQNSSISSSKASQADI-----IPNNA-NYLIRISIGTPPTERLAV 107
L+D L R + F+ ++ S K QADI IP A NYL+++++GTP
Sbjct: 3 LQDQL-RVKSMHARFSNKNAGSHFKEMQADIPVQSGIPLGAGNYLVKMALGTPKLSLSLA 61
Query: 108 ADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCSSSQCA----SLNQKSCSG 163
DTGSD+ WTQCEPC S CY Q FDP+ SS+YK++ CSSS C S + C
Sbjct: 62 LDTGSDITWTQCEPCVGS-CYRQAQTKFDPRKSSSYKNVSCSSSSCRIITDSGGARGCVS 120
Query: 164 VNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITFGCGTNNGGL---------- 213
C Y V YGDGS+S G ATE +T+ + + FGCG N G
Sbjct: 121 STCIYKVQYGDGSYSVGFFATEKLTISPSD----VISNFLFGCGQQNAGRFGRIAGLLGL 176
Query: 214 -------------------------FNSKTTGIVGLGGGDISLIS--------------- 233
F+S +TG + LGG +
Sbjct: 177 GRGKLSLALQTSEKYNNLFTYCLPSFSSSSTGHLTLGGQVPKSVKFTPLSPAFKNTPFYG 236
Query: 234 -QMRTTIAGNQRLGV-----STPDIVIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADP 287
++ G L + S +IDSGT +T L S L S +++ P D
Sbjct: 237 IDIKGLSVGGHVLPIDASVFSNAGAIIDSGTVITRLQPTVYSALSSKFQQLMKDYPKTDG 296
Query: 288 TGSLELCYSF--NSLSQVPEVTIHFRGA---DVKLSRSNFF----VKVSEDIVCSVF--K 336
L+ CY F N VP ++ F+G D+K FF V + D VC F
Sbjct: 297 FSILDTCYDFSGNESISVPRISFFFKGGVEVDIK-----FFGILTVINAWDKVCLAFAPN 351
Query: 337 GITNSVPIYGNIMQTNFLVGYDIEQQTVSFKPTDC 371
++GN Q + V +D+ + + F P+ C
Sbjct: 352 DDDGDFVVFGNSQQQTYDVVHDLAKGRIGFAPSGC 386
>gi|296085638|emb|CBI29432.3| unnamed protein product [Vitis vinifera]
Length = 337
Score = 116 bits (291), Expect = 2e-23, Method: Compositional matrix adjust.
Identities = 65/160 (40%), Positives = 92/160 (57%), Gaps = 13/160 (8%)
Query: 88 NANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLP 147
+ NY +++ G+P + DTGS L W QC+PC C++Q PLFDP S TYKSL
Sbjct: 115 SGNYYVKVGFGSPARYYSMIVDTGSSLSWLQCKPCV-VYCHVQADPLFDPSASKTYKSLS 173
Query: 148 CSSSQC-----ASLNQKSC--SGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALP 200
C+SSQC A+LN C S C Y+ SYGD S+S G L+ + +TL + LP
Sbjct: 174 CTSSQCSSLVDATLNNPLCETSSNVCVYTASYGDSSYSMGYLSQDLLTLAPSQ----TLP 229
Query: 201 GITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIA 240
G +GCG ++ GLF + GI+GLG +S++ Q+ +
Sbjct: 230 GFVYGCGQDSDGLFG-RAAGILGLGRNKLSMLGQVSSKFG 268
>gi|50508275|dbj|BAD32124.1| putative aspartic proteinase nepenthesin I [Oryza sativa Japonica
Group]
Length = 451
Score = 116 bits (290), Expect = 2e-23, Method: Compositional matrix adjust.
Identities = 68/155 (43%), Positives = 93/155 (60%), Gaps = 13/155 (8%)
Query: 84 IIPNNAN-YLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSST 142
++ N+A Y + +SIGTPP +ADTGS LIWTQC PC ++C + +P F P SST
Sbjct: 82 LLDNSAGAYNMNLSIGTPPVTFSVLADTGSSLIWTQCAPC--TECAARPAPPFQPASSST 139
Query: 143 YKSLPCSSSQCASLNQK--SCSGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALP 200
+ LPC+SS C L +C+ C Y YG G F+ G LATET+ +G + P
Sbjct: 140 FSKLPCASSLCQFLTSPYLTCNATGCVYYYPYGMG-FTAGYLATETLHVG-----GASFP 193
Query: 201 GITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQM 235
G+ FGC T NG + ++GIVGLG +SL+SQ+
Sbjct: 194 GVAFGCSTENG--VGNSSSGIVGLGRSPLSLVSQV 226
Score = 49.7 bits (117), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 41/137 (29%), Positives = 65/137 (47%), Gaps = 18/137 (13%)
Query: 253 VIDSGTTLTFL-PQGYNSNLLSVMSSMIEAQPVADPTGS---LELCYSFNSL---SQVPE 305
++DSGTTLT+L +GY + +S M A G+ +LC+ + S VP
Sbjct: 312 IVDSGTTLTYLVKEGYAMVKRAFLSQMATANLTTTVNGTRFGFDLCFDATAAGGGSGVPV 371
Query: 306 VTIHFR---GADVKLSRSNFFVKVSED------IVCSVFKGITN--SVPIYGNIMQTNFL 354
T+ R GA+ + R ++ V+ D + C + + S+ I GN+MQ +
Sbjct: 372 PTLVLRFAGGAEYAVRRRSYVGVVAVDSQGRAAVECLLVLPASEKLSISIIGNVMQMDLH 431
Query: 355 VGYDIEQQTVSFKPTDC 371
V YD++ SF P DC
Sbjct: 432 VLYDLDGGMFSFAPADC 448
>gi|223946005|gb|ACN27086.1| unknown [Zea mays]
Length = 336
Score = 116 bits (290), Expect = 2e-23, Method: Compositional matrix adjust.
Identities = 62/127 (48%), Positives = 79/127 (62%), Gaps = 3/127 (2%)
Query: 109 DTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCSSSQCASLNQKSCSGVNCQY 168
DTGSDLIWTQC PC C Q +P FD K S+TY++LPC SS+CASL+ SC C Y
Sbjct: 2 DTGSDLIWTQCAPC--LLCADQPTPYFDVKKSATYRALPCRSSRCASLSSPSCFKKMCVY 59
Query: 169 SVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITFGCGTNNGGLFNSKTTGIVGLGGGD 228
YGD + + G LA ET T G+ V I FGCG+ N G + ++G+VG G G
Sbjct: 60 QYYYGDTASTAGVLANETFTFGAANSTKVRATNIAFGCGSLNAGDL-ANSSGMVGFGRGP 118
Query: 229 ISLISQM 235
+SL+SQ+
Sbjct: 119 LSLVSQL 125
Score = 51.6 bits (122), Expect = 6e-04, Method: Compositional matrix adjust.
Identities = 35/124 (28%), Positives = 55/124 (44%), Gaps = 4/124 (3%)
Query: 252 IVIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSF----NSLSQVPEVT 307
++IDSGT++T+L Q + + S I + D L+ C+ + N VP++
Sbjct: 210 VIIDSGTSITWLQQDAYEAVRRGLVSAIPLPAMNDTDIGLDTCFQWPPPPNVTVTVPDLV 269
Query: 308 IHFRGADVKLSRSNFFVKVSEDIVCSVFKGITNSVPIYGNIMQTNFLVGYDIEQQTVSFK 367
HF A++ L N+ + S + T I GN Q N + YDI +SF
Sbjct: 270 FHFDSANMTLLPENYMLIASTTGYLCLVMAPTGVGTIIGNYQQQNLHLLYDIGNSFLSFV 329
Query: 368 PTDC 371
P C
Sbjct: 330 PAPC 333
>gi|356558300|ref|XP_003547445.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
Length = 447
Score = 116 bits (290), Expect = 2e-23, Method: Compositional matrix adjust.
Identities = 112/423 (26%), Positives = 172/423 (40%), Gaps = 101/423 (23%)
Query: 32 ELIHRDSPKSPFYNSSETPYQRLRDALTRSLNRLNHFNQ--NSSISSSKASQADIIPN-- 87
+LIH S P Y +ET R+ + S RL + S+ S+ +A + P+
Sbjct: 38 KLIHPGSVHHPHYKPNETAKDRMELDIQHSAARLANIQARIEGSLVSNNDYKARVSPSLT 97
Query: 88 NANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSL- 146
+ ISIG PP +L V DTGSD++W C PC + C LFDP SST+ L
Sbjct: 98 GRTIMANISIGQPPIPQLVVMDTGSDILWVMCTPC--TNCDNDLGLLFDPSKSSTFSPLC 155
Query: 147 --PCSSSQCASLNQKSCSGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITF 204
PC C C + ++V+Y D S ++G +TV +T + + F
Sbjct: 156 KTPCDFEGC------RCDPI--PFTVTYADNSTASGTFGRDTVVFETTDEGTSRISDVLF 207
Query: 205 GCGTNNGGLFNSKTTGIVGLGGGDISLISQ------------------------------ 234
GCG N G + GI+GL G SL+++
Sbjct: 208 GCGHNIGHDTDPGHNGILGLNNGPDSLVTKLGQKFSYCIGNLADPYYNYHQLILGEGADL 267
Query: 235 -----------------MRTTIAGNQRLGVS----------TPDIVIDSGTTLTFLPQGY 267
M G +RL ++ ++ID+G+T+TFL
Sbjct: 268 EGYSTPFEVYNGFYYVTMEGISVGEKRLDIAPETFEMKENRAGGVIIDTGSTITFLVDSV 327
Query: 268 NS-------NLL--SVMSSMIEAQPVADPTGSLELCYSFNSLSQ----VPEVTIHF-RGA 313
+ NLL S + IE P C+ + S+S+ P VT HF GA
Sbjct: 328 HKLLSKEVRNLLGWSFRQATIEKSPWMQ-------CF-YGSISRDLVGFPVVTFHFSDGA 379
Query: 314 DVKLSRSNFFVKVSEDIVCSVFK-----GITNSVPIYGNIMQTNFLVGYDIEQQTVSFKP 368
D+ L +FF ++++++ C I + + G + Q ++ VGYD+ Q V F+
Sbjct: 380 DLALDSGSFFNQLNDNVFCMTVGPVSSLNIKSKPSLIGLLAQQSYNVGYDLVNQFVYFQR 439
Query: 369 TDC 371
DC
Sbjct: 440 IDC 442
>gi|226509408|ref|NP_001141440.1| uncharacterized protein LOC100273550 precursor [Zea mays]
gi|194704586|gb|ACF86377.1| unknown [Zea mays]
gi|413938617|gb|AFW73168.1| hypothetical protein ZEAMMB73_633272 [Zea mays]
Length = 478
Score = 115 bits (289), Expect = 2e-23, Method: Compositional matrix adjust.
Identities = 107/352 (30%), Positives = 142/352 (40%), Gaps = 82/352 (23%)
Query: 90 NYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPS-QCYMQDSPLFDPKMSSTYKSLPC 148
NY++ S+GTP + DTGSDL W QC+PC + CY Q PLFDP SS+Y ++PC
Sbjct: 139 NYVVTASLGTPGVAQTMEVDTGSDLSWVQCKPCAAAPSCYSQKDPLFDPAQSSSYAAVPC 198
Query: 149 SSSQCASLN---QKSCSGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITFG 205
CA L +CS C Y VSYGDGS + G +++T+TL +++ A+ G FG
Sbjct: 199 GGPVCAGLGIYAASACSAAQCGYVVSYGDGSNTTGVYSSDTLTLSASS----AVQGFFFG 254
Query: 206 CGTNNGGLFNSKTTGIVGLGGGDISLISQ--------------MRTTIAGNQRLGVSTPD 251
CG GLFN G++GLG SL+ Q + + AG LGV P
Sbjct: 255 CGHAQSGLFNG-VDGLLGLGREQPSLVEQTAGTYGGVFSYCLPTKPSTAGYLTLGVGGPS 313
Query: 252 IVIDSGTTLTFLPQGYNSNLLSVMSSMIE------------------------------- 280
+T LP VM + I
Sbjct: 314 GAAPGFSTTQLLPSPNAPTYYVVMLTGISVGGQQLSVPASAFAGGTVVDTGTVVTRLPPT 373
Query: 281 ----------------AQPVADPTGSLELCYSFNSLSQV--PEVTIHF-RGADVKLSRSN 321
P A G L+ CY+F V P V + F GA V L
Sbjct: 374 AYAALRSAFRSGMASYGYPTAPSNGILDTCYNFAGYGTVTLPNVALTFGSGATVTLGADG 433
Query: 322 FFVKVSEDIVCSVFK--GITNSVPIYGNIMQTNFLVGYDIEQQTVSFKPTDC 371
C F G + I GN+ Q +F V I+ +V FKP+ C
Sbjct: 434 IL-----SFGCLAFAPSGSDGGMAILGNVQQRSFEV--RIDGTSVGFKPSSC 478
>gi|125543634|gb|EAY89773.1| hypothetical protein OsI_11315 [Oryza sativa Indica Group]
Length = 474
Score = 115 bits (289), Expect = 2e-23, Method: Compositional matrix adjust.
Identities = 68/197 (34%), Positives = 102/197 (51%), Gaps = 20/197 (10%)
Query: 55 RDALTRSLNRLNHFNQNSSISSSKASQADIIPN-------NANYLIRISIGTPPTERLAV 107
R+ L R R +++ + S +A+ A + P + YL+ ++IGTPP +
Sbjct: 70 RELLHRMAARSK--ARSARLLSGRAASARVDPGSYTDGVPDTEYLVHMAIGTPPQPVQLI 127
Query: 108 ADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCSSSQCASLNQKSC------ 161
DTGSDL WTQC PC C+ Q P F+P S T+ LPC C L SC
Sbjct: 128 LDTGSDLTWTQCAPC--VSCFRQSLPRFNPSRSMTFSVLPCDLRICRDLTWSSCGEQSWG 185
Query: 162 SGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQ--AVALPGITFGCGTNNGGLFNSKTT 219
+G+ C Y+ +Y D S + G+L ++T + S ++P +TFGCG N G+F S T
Sbjct: 186 NGI-CVYAYAYADHSITTGHLDSDTFSFASADHAIGGASVPDLTFGCGLFNNGIFVSNET 244
Query: 220 GIVGLGGGDISLISQMR 236
GI G G +S+ +Q++
Sbjct: 245 GIAGFSRGALSMPAQLK 261
Score = 57.4 bits (137), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 37/128 (28%), Positives = 60/128 (46%), Gaps = 9/128 (7%)
Query: 253 VIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSL-ELCYSF--NSLSQVPEVTIH 309
++DSGT +T LP+ + + + + V + T SL +LC+S + VP + +H
Sbjct: 348 IVDSGTGMTMLPEAVYNLVCDAFVAQTKLT-VHNSTSSLSQLCFSVPPGAKPDVPALVLH 406
Query: 310 FRGADVKLSRSNFFVKVSE----DIVCSVFKGITNSVPIYGNIMQTNFLVGYDIEQQTVS 365
F GA + L R N+ ++ E + C + + GN Q N V YD+ +S
Sbjct: 407 FEGATLDLPRENYMFEIEEAGGIRLTCLAINA-GEDLSVIGNFQQQNMHVLYDLANDMLS 465
Query: 366 FKPTDCTK 373
F P C K
Sbjct: 466 FVPARCNK 473
>gi|108707835|gb|ABF95630.1| Eukaryotic aspartyl protease family protein [Oryza sativa Japonica
Group]
Length = 384
Score = 115 bits (289), Expect = 2e-23, Method: Compositional matrix adjust.
Identities = 66/152 (43%), Positives = 87/152 (57%), Gaps = 12/152 (7%)
Query: 90 NYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCS 149
YL+ ++IGTPP DTGS L+WTQC+PC + C+ Q P +D SST+ C
Sbjct: 34 EYLLHLAIGTPPQPVQLTLDTGSVLVWTQCQPC--AVCFNQSLPYYDASRSSTFALPSCD 91
Query: 150 SSQCASLNQKSCSGVN-----CQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITF 204
S+QC L+ VN C YS SYGD S + G L ETV+ + ++PG+ F
Sbjct: 92 STQC-KLDPSVTMCVNQTVQTCAYSYSYGDKSATIGFLDVETVSFVA----GASVPGVVF 146
Query: 205 GCGTNNGGLFNSKTTGIVGLGGGDISLISQMR 236
GCG NN G+F S TGI G G G +SL SQ++
Sbjct: 147 GCGLNNTGIFRSNETGIAGFGRGPLSLPSQLK 178
Score = 57.4 bits (137), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 40/126 (31%), Positives = 62/126 (49%), Gaps = 6/126 (4%)
Query: 253 VIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPV-ADPTGSLELCYSFNSLSQ---VPEVTI 308
+IDSGT T LP + ++ ++ V ++ TG L LC+S L + VP++ +
Sbjct: 259 IIDSGTAFTSLPPRVYRLVHDEFAAHVKLPVVPSNETGPL-LCFSAPPLGKAPHVPKLVL 317
Query: 309 HFRGADVKLSRSNFFVKVSEDIVCSV-FKGITNSVPIYGNIMQTNFLVGYDIEQQTVSFK 367
HF GA + L R N+ + + CS+ I + I GN Q N V YD++ +SF
Sbjct: 318 HFEGATMHLPRENYVFEAKDGGNCSICLAIIEGEMTIIGNFQQQNMHVLYDLKNSKLSFV 377
Query: 368 PTDCTK 373
C K
Sbjct: 378 RAKCDK 383
>gi|125586057|gb|EAZ26721.1| hypothetical protein OsJ_10629 [Oryza sativa Japonica Group]
Length = 474
Score = 115 bits (289), Expect = 2e-23, Method: Compositional matrix adjust.
Identities = 69/195 (35%), Positives = 99/195 (50%), Gaps = 14/195 (7%)
Query: 52 QRLRDALTRSLNRLNHF--NQNSSISSSKASQADIIPNNANYLIRISIGTPPTERLAVAD 109
+ LR RS R + +S S D +P+ YL+ ++IGTPP + D
Sbjct: 71 ELLRRMAARSKARSARLLSGRAASARMDPGSYTDGVPDT-EYLVHMAIGTPPQPVQLILD 129
Query: 110 TGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCSSSQCASLNQKSC------SG 163
TGSDL WTQC PC C+ Q P F+P S T+ LPC C L SC +G
Sbjct: 130 TGSDLTWTQCAPC--VSCFRQSLPRFNPSRSMTFSVLPCDLRICRDLTWSSCGEQSWGNG 187
Query: 164 VNCQYSVSYGDGSFSNGNLATETVTLGSTTGQ--AVALPGITFGCGTNNGGLFNSKTTGI 221
+ C Y+ +Y D S + G+L ++T + S ++P +TFGCG N G+F S TGI
Sbjct: 188 I-CVYAYAYADHSITTGHLDSDTFSFASADHAIGGASVPDLTFGCGLFNNGIFVSNETGI 246
Query: 222 VGLGGGDISLISQMR 236
G G +S+ +Q++
Sbjct: 247 AGFSRGALSMPAQLK 261
Score = 57.4 bits (137), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 37/128 (28%), Positives = 60/128 (46%), Gaps = 9/128 (7%)
Query: 253 VIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSL-ELCYSF--NSLSQVPEVTIH 309
++DSGT +T LP+ + + + + V + T SL +LC+S + VP + +H
Sbjct: 348 IVDSGTGMTMLPEAVYNLVCDAFVAQTKLT-VHNSTSSLSQLCFSVPPGAKPDVPALVLH 406
Query: 310 FRGADVKLSRSNFFVKVSE----DIVCSVFKGITNSVPIYGNIMQTNFLVGYDIEQQTVS 365
F GA + L R N+ ++ E + C + + GN Q N V YD+ +S
Sbjct: 407 FEGATLDLPRENYMFEIEEAGGIRLTCLAINA-GEDLSVIGNFQQQNMHVLYDLANDMLS 465
Query: 366 FKPTDCTK 373
F P C K
Sbjct: 466 FVPARCNK 473
>gi|225470916|ref|XP_002263964.1| PREDICTED: aspartic proteinase nepenthesin-1 [Vitis vinifera]
gi|147788999|emb|CAN64659.1| hypothetical protein VITISV_009613 [Vitis vinifera]
Length = 489
Score = 115 bits (289), Expect = 2e-23, Method: Compositional matrix adjust.
Identities = 60/146 (41%), Positives = 81/146 (55%), Gaps = 10/146 (6%)
Query: 70 QNSSISSSKASQADIIPNNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYM 129
Q SSS S + + Y R+ +GTPP V DTGSD++W QC PC +CY
Sbjct: 128 QGGGFSSSVTS--GLAQGSGEYFTRLGVGTPPKYVYMVLDTGSDVVWIQCAPC--RKCYS 183
Query: 130 QDSPLFDPKMSSTYKSLPCSSSQCASLNQKSC-SGVNCQYSVSYGDGSFSNGNLATETVT 188
Q P+FDPK S ++ S+ C S C L+ C S +C Y V+YGDGSF+ G +TET+T
Sbjct: 184 QTDPVFDPKKSGSFSSISCRSPLCLRLDSPGCNSRQSCLYQVAYGDGSFTFGEFSTETLT 243
Query: 189 LGSTTGQAVALPGITFGCGTNNGGLF 214
+ +P + GCG +N GLF
Sbjct: 244 F-----RGTRVPKVALGCGHDNEGLF 264
Score = 63.9 bits (154), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 43/155 (27%), Positives = 71/155 (45%), Gaps = 12/155 (7%)
Query: 222 VGLGGGDISLI--SQMRTTIAGNQRLGVSTPDIVIDSGTTLTFLPQGYNSNLLSVMSSMI 279
+ +GG ++ I S + AGN ++IDSGT++T L + +L +
Sbjct: 341 ISVGGARVAGITASLFKLDTAGNG-------GVIIDSGTSVTRLTRRAYVSLRDAFRAGA 393
Query: 280 EAQPVADPTGSLELCYSFNSLSQV--PEVTIHFRGADVKLSRSNFFVKVSED-IVCSVFK 336
A + C+ + ++V P V +HFRGADV L +N+ + V + + C F
Sbjct: 394 ADLKRAPDYSLFDTCFDLSGKTEVKVPTVVMHFRGADVSLPATNYLIPVDTNGVFCFAFA 453
Query: 337 GITNSVPIYGNIMQTNFLVGYDIEQQTVSFKPTDC 371
G + + I GNI Q F V +D+ + F C
Sbjct: 454 GTMSGLSIIGNIQQQGFRVVFDVAASRIGFAARGC 488
>gi|115452683|ref|NP_001049942.1| Os03g0317300 [Oryza sativa Japonica Group]
gi|108707833|gb|ABF95628.1| Eukaryotic aspartyl protease family protein, expressed [Oryza
sativa Japonica Group]
gi|113548413|dbj|BAF11856.1| Os03g0317300 [Oryza sativa Japonica Group]
Length = 448
Score = 115 bits (289), Expect = 3e-23, Method: Compositional matrix adjust.
Identities = 69/195 (35%), Positives = 99/195 (50%), Gaps = 14/195 (7%)
Query: 52 QRLRDALTRSLNRLNHF--NQNSSISSSKASQADIIPNNANYLIRISIGTPPTERLAVAD 109
+ LR RS R + +S S D +P+ YL+ ++IGTPP + D
Sbjct: 45 ELLRRMAARSKARSARLLSGRAASARMDPGSYTDGVPDT-EYLVHMAIGTPPQPVQLILD 103
Query: 110 TGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCSSSQCASLNQKSC------SG 163
TGSDL WTQC PC C+ Q P F+P S T+ LPC C L SC +G
Sbjct: 104 TGSDLTWTQCAPC--VSCFRQSLPRFNPSRSMTFSVLPCDLRICRDLTWSSCGEQSWGNG 161
Query: 164 VNCQYSVSYGDGSFSNGNLATETVTLGSTTGQ--AVALPGITFGCGTNNGGLFNSKTTGI 221
+ C Y+ +Y D S + G+L ++T + S ++P +TFGCG N G+F S TGI
Sbjct: 162 I-CVYAYAYADHSITTGHLDSDTFSFASADHAIGGASVPDLTFGCGLFNNGIFVSNETGI 220
Query: 222 VGLGGGDISLISQMR 236
G G +S+ +Q++
Sbjct: 221 AGFSRGALSMPAQLK 235
Score = 57.4 bits (137), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 37/128 (28%), Positives = 60/128 (46%), Gaps = 9/128 (7%)
Query: 253 VIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSL-ELCYSF--NSLSQVPEVTIH 309
++DSGT +T LP+ + + + + V + T SL +LC+S + VP + +H
Sbjct: 322 IVDSGTGMTMLPEAVYNLVCDAFVAQTKLT-VHNSTSSLSQLCFSVPPGAKPDVPALVLH 380
Query: 310 FRGADVKLSRSNFFVKVSE----DIVCSVFKGITNSVPIYGNIMQTNFLVGYDIEQQTVS 365
F GA + L R N+ ++ E + C + + GN Q N V YD+ +S
Sbjct: 381 FEGATLDLPRENYMFEIEEAGGIRLTCLAINA-GEDLSVIGNFQQQNMHVLYDLANDMLS 439
Query: 366 FKPTDCTK 373
F P C K
Sbjct: 440 FVPARCNK 447
>gi|356527091|ref|XP_003532147.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Glycine max]
Length = 482
Score = 115 bits (289), Expect = 3e-23, Method: Compositional matrix adjust.
Identities = 73/221 (33%), Positives = 112/221 (50%), Gaps = 22/221 (9%)
Query: 30 SVELIHRDSPKSPFYNSSETPY------------QRLRDALTRSLNRLNHFNQNSSISSS 77
S+E++H+ P S +S + +R++ +R L N+ + S+
Sbjct: 66 SLEVVHKHGPCSQLNHSGKAEATISHNDIMNLDNERVKYIQSRLSKNLGGENRVKELDST 125
Query: 78 KA-SQADIIPNNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFD 136
+++ + +A+Y + + +GTP + + DTGS L WTQCEPC S CY Q P+FD
Sbjct: 126 TLPAKSGRLIGSADYYVVVGLGTPKRDLSLIFDTGSYLTWTQCEPCAGS-CYKQQDPIFD 184
Query: 137 PKMSSTYKSLPCSSSQCASLNQKSCSG---VNCQYSVSYGDGSFSNGNLATETVTLGSTT 193
P SS+Y ++ C+SS C CS +C Y V YGD S S G L+ E +T+ +T
Sbjct: 185 PSKSSSYTNIKCTSSLCTQFRSAGCSSSTDASCIYDVKYGDNSISRGFLSQERLTITATD 244
Query: 194 GQAVALPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQ 234
+ FGCG +N GLF T G++GL IS + Q
Sbjct: 245 ----IVHDFLFGCGQDNEGLFRG-TAGLMGLSRHPISFVQQ 280
Score = 58.2 bits (139), Expect = 6e-06, Method: Compositional matrix adjust.
Identities = 39/124 (31%), Positives = 54/124 (43%), Gaps = 5/124 (4%)
Query: 253 VIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSLSQ--VPEVTIHF 310
+IDSGT +T LP + L S + PVA T L+ CY F+ + VP + F
Sbjct: 358 IIDSGTVITRLPPTAYAALRSAFRQFMMKYPVAYGTRLLDTCYDFSGYKEISVPRIDFEF 417
Query: 311 RGA-DVKLSRSNFFVKVSEDIVCSVF--KGITNSVPIYGNIMQTNFLVGYDIEQQTVSFK 367
G V+L S +C F G N + I+GN+ Q V YD+E + F
Sbjct: 418 AGGVKVELPLVGILYGESAQQLCLAFAANGNGNDITIFGNVQQKTLEVVYDVEGGRIGFG 477
Query: 368 PTDC 371
C
Sbjct: 478 AAGC 481
>gi|413938607|gb|AFW73158.1| aspartic proteinase nepenthesin-2 [Zea mays]
Length = 478
Score = 115 bits (289), Expect = 3e-23, Method: Compositional matrix adjust.
Identities = 122/424 (28%), Positives = 164/424 (38%), Gaps = 96/424 (22%)
Query: 31 VELIHRDSPKSPFYNSSETPYQRLRDALTRSLNRLNHFNQNSS------ISSSKASQADI 84
+ L HR P +P SS + D L R + + S S A+ A
Sbjct: 68 LRLTHRHGPCAPSRASSLA-APSVADTLRADQRRAEYILRRVSGRAPQLWDSKAAAAAAT 126
Query: 85 IP-------NNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPS-QCYMQDSPLFD 136
+P NY++ S+GTP + DTGSDL W QC+PC + CY Q PLFD
Sbjct: 127 VPASWGYDIGTLNYVVTASLGTPGVAQTMEVDTGSDLSWVQCKPCSAAPSCYSQKDPLFD 186
Query: 137 PKMSSTYKSLPCSSSQCASLN---QKSCSGVNCQYSVSYGDGSFSNGNLATETVTLGSTT 193
P SS+Y ++PC CA L +CS C Y VSYGDGS + G +++T+TL +++
Sbjct: 187 PAQSSSYAAVPCGGPVCAGLGIYAASACSAAQCGYVVSYGDGSNTTGVYSSDTLTLSASS 246
Query: 194 GQAVALPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQ--------------MRTTI 239
A+ G FGCG GLFN G++GLG SL+ Q + +
Sbjct: 247 ----AVQGFFFGCGHAQSGLFNG-VDGLLGLGREQPSLVEQTAGTYGGVFSYCLPTKPST 301
Query: 240 AGNQRLGVSTPDIVIDSGTTLTFLPQGYNSNLLSVMSSMIE------------------- 280
AG LGV P +T LP VM + I
Sbjct: 302 AGYLTLGVGGPSGAAPGFSTTQLLPSPNAPTYYVVMLTGISVGGQQLSVPASAFAGGTVV 361
Query: 281 ----------------------------AQPVADPTGSLELCYSFNSLSQV--PEVTIHF 310
P A G L+ CY+F V P V + F
Sbjct: 362 DTGTVVTRLPPTAYAALRSAFRSGMASYGYPTAPSNGILDTCYNFAGYGTVTLPNVALTF 421
Query: 311 -RGADVKLSRSNFFVKVSEDIVCSVFK--GITNSVPIYGNIMQTNFLVGYDIEQQTVSFK 367
GA V L C F G + I GN+ Q +F V I+ +V FK
Sbjct: 422 GSGATVTLGADGIL-----SFGCLAFAPSGSDGGMAILGNVQQRSFEV--RIDGTSVGFK 474
Query: 368 PTDC 371
P+ C
Sbjct: 475 PSSC 478
>gi|242045118|ref|XP_002460430.1| hypothetical protein SORBIDRAFT_02g027990 [Sorghum bicolor]
gi|241923807|gb|EER96951.1| hypothetical protein SORBIDRAFT_02g027990 [Sorghum bicolor]
Length = 488
Score = 115 bits (289), Expect = 3e-23, Method: Compositional matrix adjust.
Identities = 110/421 (26%), Positives = 165/421 (39%), Gaps = 84/421 (19%)
Query: 30 SVELIHRDSPKSPFYN-SSETPYQRLRDALTRSLNRLNHFNQNSSISSSKASQADIIPNN 88
S+ ++HR P SP + S P + L R +R++ + + SS+K + N
Sbjct: 72 SLTVVHRHGPCSPLRSRGSGAPSHT--EILRRDQDRVDAIRRKVTASSNKPKGGVSLLAN 129
Query: 89 -------ANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSS 141
NY+ + +GTP TE + DTGSD W QC+PC + CY Q P+FDP SS
Sbjct: 130 WGKSLSTTNYVASLRLGTPATELVVELDTGSDQSWVQCKPC--ADCYEQRDPVFDPTASS 187
Query: 142 TYKSLPCSSSQCASLNQKSCSGV-------NCQYSVSYGDGSFSNGNLATETVTLGSTTG 194
TY ++PC + +C L S S NC Y VSY D S + G+LA +T+TL +
Sbjct: 188 TYSAVPCGARECQELASSSSSRNCSSDNNKNCPYEVSYDDDSHTVGDLARDTLTLSPSPS 247
Query: 195 QAVA--LPGITFGCGTNNGGLFNS----------------------------------KT 218
+ A +PG FGCG +N G F
Sbjct: 248 PSPADTVPGFVFGCGHSNAGTFGEVDGLLGLGLGKASLPSQVAARYGAAFSYCLPSSPSA 307
Query: 219 TGIVGLGGGDISLISQMRTTIAGN----------------QRLGVSTPDIVIDSGTTL-- 260
G + GG +Q + G + + V +GT +
Sbjct: 308 AGYLSFGGAAARANAQFTEMVTGQDPTSYYLNLTGIVVAGRAIKVPASAFATAAGTIIDS 367
Query: 261 -----TFLPQGYNSNLLSVMSSMIEAQPVADPTGSL-ELCYSF--NSLSQVPEVTIHFR- 311
P Y + S S+M + P+ + + CY F + ++P V + F
Sbjct: 368 GTAFSRLPPSAYAALRSSFRSAMGRYRYKRAPSSPIFDTCYDFTGHETVRIPAVELVFAD 427
Query: 312 GADVKLSRSNFFVKVSEDIVCSVFKGITN-SVPIYGNIMQTNFLVGYDIEQQTVSFKPTD 370
GA V L S + D+ + + N + I GN Q V YD+ Q + F
Sbjct: 428 GATVHLHPSGVLYTWN-DVAQTCLAFVPNHDLGILGNTQQRTLAVIYDVGSQRIGFGRKG 486
Query: 371 C 371
C
Sbjct: 487 C 487
>gi|413938615|gb|AFW73166.1| hypothetical protein ZEAMMB73_633272 [Zea mays]
Length = 386
Score = 115 bits (289), Expect = 3e-23, Method: Compositional matrix adjust.
Identities = 107/352 (30%), Positives = 142/352 (40%), Gaps = 82/352 (23%)
Query: 90 NYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPS-QCYMQDSPLFDPKMSSTYKSLPC 148
NY++ S+GTP + DTGSDL W QC+PC + CY Q PLFDP SS+Y ++PC
Sbjct: 47 NYVVTASLGTPGVAQTMEVDTGSDLSWVQCKPCAAAPSCYSQKDPLFDPAQSSSYAAVPC 106
Query: 149 SSSQCASLN---QKSCSGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITFG 205
CA L +CS C Y VSYGDGS + G +++T+TL +++ A+ G FG
Sbjct: 107 GGPVCAGLGIYAASACSAAQCGYVVSYGDGSNTTGVYSSDTLTLSASS----AVQGFFFG 162
Query: 206 CGTNNGGLFNSKTTGIVGLGGGDISLISQ--------------MRTTIAGNQRLGVSTPD 251
CG GLFN G++GLG SL+ Q + + AG LGV P
Sbjct: 163 CGHAQSGLFNG-VDGLLGLGREQPSLVEQTAGTYGGVFSYCLPTKPSTAGYLTLGVGGPS 221
Query: 252 IVIDSGTTLTFLPQGYNSNLLSVMSSMIE------------------------------- 280
+T LP VM + I
Sbjct: 222 GAAPGFSTTQLLPSPNAPTYYVVMLTGISVGGQQLSVPASAFAGGTVVDTGTVVTRLPPT 281
Query: 281 ----------------AQPVADPTGSLELCYSFNSLSQV--PEVTIHF-RGADVKLSRSN 321
P A G L+ CY+F V P V + F GA V L
Sbjct: 282 AYAALRSAFRSGMASYGYPTAPSNGILDTCYNFAGYGTVTLPNVALTFGSGATVTLGADG 341
Query: 322 FFVKVSEDIVCSVFK--GITNSVPIYGNIMQTNFLVGYDIEQQTVSFKPTDC 371
C F G + I GN+ Q +F V I+ +V FKP+ C
Sbjct: 342 IL-----SFGCLAFAPSGSDGGMAILGNVQQRSFEV--RIDGTSVGFKPSSC 386
>gi|255543383|ref|XP_002512754.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
gi|223547765|gb|EEF49257.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
Length = 414
Score = 115 bits (288), Expect = 3e-23, Method: Compositional matrix adjust.
Identities = 80/224 (35%), Positives = 113/224 (50%), Gaps = 25/224 (11%)
Query: 33 LIHRD----SPKSPFYNSSETPYQRLRDALTRSL-NRLNHF---NQNSSISSSKASQADI 84
+ HRD S KS +N L D RSL +R+ N ++ S + +
Sbjct: 1 MKHRDFCNSSGKSTDWNKKLQKSLILDDFRVRSLQSRIKSIFSGNNIDALDSQIPLSSGV 60
Query: 85 IPNNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYK 144
NY++ + IG + DTGSDL W QC+PC CY Q PLF+P S +Y+
Sbjct: 61 RLQTLNYIVTVEIGG--RNMTVIVDTGSDLTWVQCQPC--RLCYNQQDPLFNPSGSPSYQ 116
Query: 145 SLPCSSSQCASLNQKSCS-GV------NCQYSVSYGDGSFSNGNLATETVTLGSTTGQAV 197
++ C+SS C SL + + GV C Y V+YGDGS++ G+L E + LG+T
Sbjct: 117 TILCNSSTCQSLQYATGNLGVCGSNTPTCNYVVNYGDGSYTRGDLGMEQLNLGTT----- 171
Query: 198 ALPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAG 241
+ FGCG NN GLF +G++GLG D+SL+SQ G
Sbjct: 172 HVSNFIFGCGRNNKGLFGG-ASGLMGLGKSDLSLVSQTSAIFEG 214
Score = 52.8 bits (125), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 36/128 (28%), Positives = 58/128 (45%), Gaps = 7/128 (5%)
Query: 252 IVIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSLSQV--PEVTIH 309
I+IDSGT +T LP +L + P A P L+ C++ N +V P + +
Sbjct: 285 ILIDSGTVITRLPPPVYRDLKAEFLKQFSGFPSAPPFSILDTCFNLNGYDEVDIPTIRMQ 344
Query: 310 FRG---ADVKLSRSNFFVKVSEDIVCSVFKGIT--NSVPIYGNIMQTNFLVGYDIEQQTV 364
F G V ++ +FVK VC ++ + +PI GN Q N V Y+ ++ +
Sbjct: 345 FEGNAELTVDVTGIFYFVKTDASQVCLALASLSFDDEIPIIGNYQQRNQRVIYNTKESKL 404
Query: 365 SFKPTDCT 372
F C+
Sbjct: 405 GFAAEACS 412
>gi|195607464|gb|ACG25562.1| aspartic proteinase nepenthesin-2 precursor [Zea mays]
Length = 478
Score = 115 bits (288), Expect = 3e-23, Method: Compositional matrix adjust.
Identities = 123/424 (29%), Positives = 172/424 (40%), Gaps = 96/424 (22%)
Query: 31 VELIHRDSPKSPFYNSSETPYQRLRDALTRSLNRLNHFNQNSS------ISSSKASQADI 84
+ L HR P +P SS + D L R + + S S A+ A
Sbjct: 68 LRLTHRHGPCAPSRASSLA-APSVADTLRADQRRAEYILRRVSGRAPQLWDSKAAAAAAT 126
Query: 85 IP-------NNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPS-QCYMQDSPLFD 136
+P NY++ S+GTP + DTGSDL W QC+PC + CY Q PLFD
Sbjct: 127 VPASWGYDIGTLNYVVTASLGTPGVAQTMEVDTGSDLSWVQCKPCSAAPSCYSQKDPLFD 186
Query: 137 PKMSSTYKSLPCSSSQCASLN---QKSCSGVNCQYSVSYGDGSFSNGNLATETVTLGSTT 193
P SS+Y ++PC CA L +CS C Y VSYGDGS + G +++T+TL +++
Sbjct: 187 PAQSSSYAAVPCGGPVCAGLGIYAASACSAAQCGYVVSYGDGSNTTGVYSSDTLTLSASS 246
Query: 194 GQAVALPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQ--------------MRTTI 239
A+ G FGCG GLFN G++GLG SL+ Q + +
Sbjct: 247 ----AVQGFFFGCGHAQSGLFNG-VDGLLGLGREQPSLVEQTAGTYGGVFSYCLPTKPST 301
Query: 240 AGNQRLGVSTPD---------------------IVIDSGTTL------------------ 260
AG LG+ P +V+ +G ++
Sbjct: 302 AGYLTLGLGGPSGAAPGFSTTQLLPSPNAPTYYVVMLTGISVGGQQLSVPASAFAGGTVV 361
Query: 261 ------TFLPQGYNSNLLSVMSSMIEA--QPVADPTGSLELCYSFNSLSQV--PEVTIHF 310
T LP + L S S + + P A G L+ CY+F V P V + F
Sbjct: 362 DTGTVITRLPPTAYAALRSAFRSGMASYGYPTAPSNGILDTCYNFAGYGTVTLPNVALTF 421
Query: 311 -RGADVKLSRSNFFVKVSEDIVCSVFK--GITNSVPIYGNIMQTNFLVGYDIEQQTVSFK 367
GA V L C F G + I GN+ Q +F V I+ +V FK
Sbjct: 422 GSGATVMLGADGIL-----SFGCLAFAPSGSDGGMAILGNVQQRSFEV--RIDGTSVGFK 474
Query: 368 PTDC 371
P+ C
Sbjct: 475 PSSC 478
>gi|168064509|ref|XP_001784204.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162664276|gb|EDQ51002.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 367
Score = 115 bits (288), Expect = 3e-23, Method: Compositional matrix adjust.
Identities = 62/144 (43%), Positives = 78/144 (54%), Gaps = 4/144 (2%)
Query: 89 ANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPC 148
Y + +GTP + V DTGSD+ W QC PC + CY Q LF+P SS++K L C
Sbjct: 14 GEYFAVVGVGTPRRDMYLVVDTGSDITWLQCAPC--TNCYKQKDALFNPSSSSSFKVLDC 71
Query: 149 SSSQCASLNQKSCSGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQA-VALPGITFGCG 207
SSS C +L+ C C Y YGDGSF+ G L T+ V L G V L I GCG
Sbjct: 72 SSSLCLNLDVMGCLSNKCLYQADYGDGSFTMGELVTDNVVLDDAFGPGQVVLTNIPLGCG 131
Query: 208 TNNGGLFNSKTTGIVGLGGGDISL 231
+N G F + GI+GLG G +S
Sbjct: 132 HDNEGTFGT-AAGILGLGRGPLSF 154
>gi|222624820|gb|EEE58952.1| hypothetical protein OsJ_10633 [Oryza sativa Japonica Group]
Length = 415
Score = 115 bits (287), Expect = 4e-23, Method: Compositional matrix adjust.
Identities = 99/354 (27%), Positives = 140/354 (39%), Gaps = 97/354 (27%)
Query: 90 NYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCS 149
YL+ ++IGTPP DTGSDLIWTQC+PCP C+ Q P FDP SST C
Sbjct: 88 EYLVHLAIGTPPQPVQLTLDTGSDLIWTQCQPCP--ACFDQALPYFDPSTSSTLSLTSCD 145
Query: 150 SSQCASLNQKSCSGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITFGCGTN 209
S+ C L S ++ T G ++PG+ FGCG
Sbjct: 146 STLCQGLPVASLP--------------------RSDKFTF---VGAGASVPGVAFGCGLF 182
Query: 210 NGGLFNSKTTGIVGLGGGDISLISQMR--------TTI---------------------- 239
N G+F S TGI G G G +SL SQ++ TTI
Sbjct: 183 NNGVFKSNETGIAGFGRGPLSLPSQLKVGNFSHCFTTITGAIPSTVLLDLPADLFSNGQG 242
Query: 240 -------------------------AGNQRLGVSTPDIVIDSG---------TTLTFLPQ 265
G+ RL V + + +G T +T LP
Sbjct: 243 AVQTTPLIQNPANPTFYYLSLKGITVGSTRLPVPESEFALKNGTGGTIIDSGTAMTSLPT 302
Query: 266 GYNSNLLSVMSSMIEAQPVADPTGSLELCYS--FNSLSQVPEVTIHFRGADVKLSRSNFF 323
+ ++ ++ V+ T C S + VP++ +HF GA + L R N+
Sbjct: 303 RVYRLVRDAFAAQVKLPVVSGNTTDPYFCLSAPLRAKPYVPKLVLHFEGATMDLPRENYV 362
Query: 324 VKVSE---DIVC-SVFKGITNSVPIYGNIMQTNFLVGYDIEQQTVSFKPTDCTK 373
+V + I+C ++ +G V GN Q N V YD++ +SF P C K
Sbjct: 363 FEVEDAGSSILCLAIIEG--GEVTTIGNFQQQNMHVLYDLQNSKLSFVPAQCDK 414
>gi|297798978|ref|XP_002867373.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
gi|297313209|gb|EFH43632.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
Length = 434
Score = 115 bits (287), Expect = 4e-23, Method: Compositional matrix adjust.
Identities = 122/429 (28%), Positives = 178/429 (41%), Gaps = 80/429 (18%)
Query: 11 LFFLCFYVVSPIEAQTGGFSVELIHRDSPKSPFYNSSETPYQR---LRDALTRSLNRLNH 67
L + F + P + + F++ L H S K+ E+P + L T + +RL+
Sbjct: 13 LLIILFALTCPKQCTSYRFTLRL-HTKSIKT-----KESPKIKPGYLHSKSTPAPSRLD- 65
Query: 68 FNQNSSISSSKASQADIIPNNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQC 127
N ++ + S IPN A +L ISIG PP +L + DTGSDL W QC PC +C
Sbjct: 66 -NLWTTEIADIVSHVTPIPNPAAFLANISIGDPPVPQLLLIDTGSDLTWIQCLPC---KC 121
Query: 128 YMQDSPLFDPKMSSTYKSLPCSSSQCA--SLNQKSCSGVNCQYSVSYGDGSFSNGNLATE 185
Y Q P F P SSTY++ C S+ A + + +G NC+Y + Y D S + G LA E
Sbjct: 122 YPQTIPFFHPSRSSTYRNASCESAPHAMPQIFRDEKTG-NCRYHLRYRDFSNTRGILAKE 180
Query: 186 TVTLGSTTGQAVALPGITFGCGTNNGGL--------------------FNSKTTGIVG-- 223
+T ++ ++ P I FGCG +N G F SK + G
Sbjct: 181 KLTFQTSDEGLISKPNIVFGCGQDNSGFTQYSGVLGLGPGTFSIVTRNFGSKFSYCFGSL 240
Query: 224 -----------LGGGD----------------------ISLISQMRTTIAGNQRLGVSTP 250
LG G ISL ++ G + S
Sbjct: 241 IDPTYPHNFLILGNGARIEGDPTPLQIFQDRYYLDLQAISLGEKLLDIEPGIFQRYRSKG 300
Query: 251 DIVIDSGTTLTFLPQGYNSNLLSVMSSMIEA--QPVADPTGSLELCYSFN---SLSQVPE 305
VID+G + T L + L + ++ + V D CY N L P
Sbjct: 301 GTVIDTGCSPTILAREAYETLSEEIDFLLGEVLRRVKDWEQYTNHCYEGNLKLDLYGFPV 360
Query: 306 VTIHFRG-ADVKLSRSNFFVK-VSEDIVCSVFKGIT-NSVPIYGNIMQTNFLVGYDIEQQ 362
VT HF G A++ L + FV S D C T + + + G + Q N+ VGY++
Sbjct: 361 VTFHFAGGAELALDVESLFVSSESGDSFCLAMTMNTFDDMSVIGAMAQQNYNVGYNLRTM 420
Query: 363 TVSFKPTDC 371
V F+ TDC
Sbjct: 421 KVYFQRTDC 429
>gi|224072755|ref|XP_002303865.1| predicted protein [Populus trichocarpa]
gi|222841297|gb|EEE78844.1| predicted protein [Populus trichocarpa]
Length = 412
Score = 115 bits (287), Expect = 4e-23, Method: Compositional matrix adjust.
Identities = 75/221 (33%), Positives = 116/221 (52%), Gaps = 24/221 (10%)
Query: 35 HRDSPKSPFYNSSETPYQRL-------RDALTRSLNRLNHFNQNSSISSSKASQADIIPN 87
H+DS + ++ +RL R +R N + N + S+ + + I
Sbjct: 3 HKDSCSGKILDWNKKLQKRLIMDNFQLRSLQSRIKNIILSGNIDDSVDTQIPLTSGIRLQ 62
Query: 88 NANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLP 147
+ NY++ + +G + + DTGSDL W QC+PC ++CY Q P+F+P S +Y+++
Sbjct: 63 SLNYIVTVELGG--RKMTVIVDTGSDLSWVQCQPC--NRCYNQQDPVFNPSKSPSYRTVL 118
Query: 148 CSSSQCASLNQKSC-SGV------NCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALP 200
C+S C SL + SGV C Y V+YGDGS+++G + E + LG+TT +
Sbjct: 119 CNSLTCRSLQLATGNSGVCGSNPPTCNYVVNYGDGSYTSGEVGMEHLNLGNTT-----VN 173
Query: 201 GITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAG 241
FGCG N GLF +G+VGLG D+SLISQ+ G
Sbjct: 174 NFIFGCGRKNQGLFGG-ASGLVGLGRTDLSLISQISPMFGG 213
>gi|302781726|ref|XP_002972637.1| hypothetical protein SELMODRAFT_97698 [Selaginella moellendorffii]
gi|300160104|gb|EFJ26723.1| hypothetical protein SELMODRAFT_97698 [Selaginella moellendorffii]
Length = 393
Score = 115 bits (287), Expect = 4e-23, Method: Compositional matrix adjust.
Identities = 73/195 (37%), Positives = 102/195 (52%), Gaps = 14/195 (7%)
Query: 52 QRLRDALTRSLNRLNHF--NQNSSISSSKASQADI----IPNNANYLIRISIGTPPTERL 105
+ +R + +S R+ NSS SS A D+ P+ Y++ IS+GTP
Sbjct: 10 EAIRGLVAKSHARVRWMAARANSSSWSSMAGTTDVESPLHPDGGGYVMDISVGTPGKRFR 69
Query: 106 AVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCSSSQCASLNQKSCSGVN 165
A+ADTGSDL+W Q EPC + C +FDP+ SST++ + CSS C L G +
Sbjct: 70 AIADTGSDLVWVQSEPC--TGC--SGGTIFDPRQSSTFREMDCSSQLCTELPGSCEPGSS 125
Query: 166 -CQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITFGCGTNNGGLFNSKTTGIVGL 224
C YS YG G + G A +T++LG+T+G + P GCG N G G+VGL
Sbjct: 126 ACSYSYEYGSGE-TEGEFARDTISLGTTSGGSQKFPSFAVGCGMVNSGF--DGVDGLVGL 182
Query: 225 GGGDISLISQMRTTI 239
G G +SL SQ+ I
Sbjct: 183 GQGPVSLTSQLSAAI 197
Score = 69.7 bits (169), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 47/125 (37%), Positives = 65/125 (52%), Gaps = 7/125 (5%)
Query: 253 VIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCY--SFNSLSQVPEVTIHF 310
+IDSGTTLT++P G +LS M SM+ V + L+LCY S N + P +TI
Sbjct: 267 IIDSGTTLTYVPSGVYGRVLSRMESMVTLPRVDGSSMGLDLCYDRSSNRNYKFPALTIRL 326
Query: 311 RGADVKLSRSNFFVKV--SEDIVCSVFKGITNSVP--IYGNIMQTNFLVGYDIEQQTVSF 366
GA + SN+F+ V S D VC + G +P I GN+MQ + + YD +SF
Sbjct: 327 AGATMTPPSSNYFLVVDDSGDTVC-LAMGSAGGLPVSIIGNVMQQGYHILYDRGSSELSF 385
Query: 367 KPTDC 371
C
Sbjct: 386 VQAKC 390
>gi|224067990|ref|XP_002302634.1| predicted protein [Populus trichocarpa]
gi|222844360|gb|EEE81907.1| predicted protein [Populus trichocarpa]
Length = 490
Score = 115 bits (287), Expect = 5e-23, Method: Compositional matrix adjust.
Identities = 69/198 (34%), Positives = 98/198 (49%), Gaps = 26/198 (13%)
Query: 31 VELIHRDSPKSPFYNSSETPYQRLRDALTRSLNRLNHFNQ-NSSISSSKASQAD------ 83
V+L H D+ +S ETP L R +R+ +++ S+ ++A
Sbjct: 80 VQLHHLDA-----LSSDETPQDLFNSRLARDASRVKSLTSLAAAVGSTNRTRARGPGFSS 134
Query: 84 -----IIPNNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPK 138
+ + Y R+ +GTP V DTGSD++W QC PC +CY Q P+F+P
Sbjct: 135 SVTSGLAQGSGEYFTRLGVGTPARYVFMVLDTGSDVVWIQCAPC--KKCYSQTDPVFNPT 192
Query: 139 MSSTYKSLPCSSSQCASLNQKSCSGVN--CQYSVSYGDGSFSNGNLATETVTLGSTTGQA 196
S ++ ++PC S C L+ CS C Y VSYGDGSF+ G +TET+T T
Sbjct: 193 KSRSFANIPCGSPLCRRLDSPGCSTKKHICLYQVSYGDGSFTYGEFSTETLTFRGTRVGR 252
Query: 197 VALPGITFGCGTNNGGLF 214
VAL GCG +N GLF
Sbjct: 253 VAL-----GCGHDNEGLF 265
Score = 64.7 bits (156), Expect = 7e-08, Method: Compositional matrix adjust.
Identities = 40/123 (32%), Positives = 56/123 (45%), Gaps = 3/123 (2%)
Query: 252 IVIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSLSQV--PEVTIH 309
++IDSGT++T L + L A + C+ + ++V P V +H
Sbjct: 367 VIIDSGTSVTRLTRPAYVALRDAFRVGASNLKRAPEFSLFDTCFDLSGKTEVKVPTVVLH 426
Query: 310 FRGADVKLSRSNFFVKVSED-IVCSVFKGITNSVPIYGNIMQTNFLVGYDIEQQTVSFKP 368
FRGADV L SN+ + V C F G + + I GNI Q F V YD+ V F P
Sbjct: 427 FRGADVSLPASNYLIPVDNSGSFCFAFAGTMSGLSIVGNIQQQGFRVVYDLAASRVGFAP 486
Query: 369 TDC 371
C
Sbjct: 487 RGC 489
>gi|125524353|gb|EAY72467.1| hypothetical protein OsI_00323 [Oryza sativa Indica Group]
Length = 500
Score = 115 bits (287), Expect = 5e-23, Method: Compositional matrix adjust.
Identities = 53/130 (40%), Positives = 79/130 (60%), Gaps = 8/130 (6%)
Query: 87 NNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSL 146
+ Y R+ +G+P + V DTGSD+ W QC+PC + CY Q P+FDP +S++Y S+
Sbjct: 159 GSGEYFSRVGVGSPARQLYMVLDTGSDVTWVQCQPC--ADCYQQSDPVFDPSLSTSYASV 216
Query: 147 PCSSSQCASLNQKSC--SGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITF 204
C + +C L+ +C S C Y V+YGDGS++ G+ ATET+TLG + + +
Sbjct: 217 ACDNPRCHDLDAAACRNSTGACLYEVAYGDGSYTVGDFATETLTLGDS----APVSSVAI 272
Query: 205 GCGTNNGGLF 214
GCG +N GLF
Sbjct: 273 GCGHDNEGLF 282
Score = 48.5 bits (114), Expect = 0.006, Method: Compositional matrix adjust.
Identities = 32/130 (24%), Positives = 54/130 (41%), Gaps = 4/130 (3%)
Query: 246 GVSTPDIVIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSLS--QV 303
G +++DSGT +T L + L ++ P + CY + + +V
Sbjct: 371 GTGAGGVIVDSGTAVTRLQSSAYAALRDAFVRGTQSLPRTSGVSLFDTCYDLSDRTSVEV 430
Query: 304 PEVTIHFR-GADVKLSRSNFFVKVS-EDIVCSVFKGITNSVPIYGNIMQTNFLVGYDIEQ 361
P V++ F G +++L N+ + V C F +V I GN+ Q V +D +
Sbjct: 431 PAVSLRFAGGGELRLPAKNYLIPVDGAGTYCLAFAPTNAAVSIIGNVQQQGTRVSFDTAK 490
Query: 362 QTVSFKPTDC 371
TV F C
Sbjct: 491 STVGFTSNKC 500
>gi|115434870|ref|NP_001042193.1| Os01g0178600 [Oryza sativa Japonica Group]
gi|55296112|dbj|BAD67831.1| putative CDR1 [Oryza sativa Japonica Group]
gi|55296252|dbj|BAD67993.1| putative CDR1 [Oryza sativa Japonica Group]
gi|113531724|dbj|BAF04107.1| Os01g0178600 [Oryza sativa Japonica Group]
gi|125569253|gb|EAZ10768.1| hypothetical protein OsJ_00604 [Oryza sativa Japonica Group]
Length = 454
Score = 115 bits (287), Expect = 5e-23, Method: Compositional matrix adjust.
Identities = 79/210 (37%), Positives = 113/210 (53%), Gaps = 27/210 (12%)
Query: 24 AQTGGFSVELIHRDSPKSPFYNSSETPYQRLRDALTRSLNRLNHFNQNSSISSSKASQAD 83
A GGFSVE IHRDSP+SPF++ + T + R A RS+ R ++S S+S AD
Sbjct: 29 ASGGGFSVEFIHRDSPRSPFHDPAFTAHGRALAAARRSVARAAAIAGSASSSASGGGAAD 88
Query: 84 -----IIPNNANYLIRISIGTPPTERLAVADTGSDLIWTQCE---------PCPPSQCYM 129
++ + YL+ +++G+PP LA+ADTGSDL+W +C+ P +Q
Sbjct: 89 DVVSKVVSRSFEYLMTVNLGSPPRSMLAIADTGSDLVWVKCKKGNNDTSSAAAPTTQ--- 145
Query: 130 QDSPLFDPKMSSTYKSLPCSSSQCASLNQKSC-SGVNCQYSVSYGDGSFSNGNLATETVT 188
FDP SSTY + C + C +L + +C G NC Y +YGDGS + G L+TET T
Sbjct: 146 -----FDPSRSSTYGRVSCQTDACEALGRATCDDGSNCAYLYAYGDGSNTTGVLSTETFT 200
Query: 189 L----GSTTGQAVALPGITFGCGTNNGGLF 214
+ + V + G+ FGC T G F
Sbjct: 201 FDDGGSGRSPRQVRVGGVKFGCSTATAGSF 230
Score = 81.6 bits (200), Expect = 5e-13, Method: Compositional matrix adjust.
Identities = 49/128 (38%), Positives = 66/128 (51%), Gaps = 8/128 (6%)
Query: 252 IVIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYS-----FNSLSQVPEV 306
I++DSGTTLTFL ++ +S I PV P G L+LCY+ + +P++
Sbjct: 323 IIVDSGTTLTFLDPSLLGPIVDELSRRITLPPVQSPDGLLQLCYNVAGREVEAGESIPDL 382
Query: 307 TIHF-RGADVKLSRSNFFVKVSEDIVCSVFKGITNSVP--IYGNIMQTNFLVGYDIEQQT 363
T+ F GA V L N FV V E +C T P I GN+ Q N VGYD++ T
Sbjct: 383 TLEFGGGAAVALKPENAFVAVQEGTLCLAIVATTEQQPVSILGNLAQQNIHVGYDLDAGT 442
Query: 364 VSFKPTDC 371
V+F DC
Sbjct: 443 VTFAGADC 450
>gi|297720195|ref|NP_001172459.1| Os01g0608366 [Oryza sativa Japonica Group]
gi|53792202|dbj|BAD52835.1| nucleoid DNA-binding protein cnd41-like [Oryza sativa Japonica
Group]
gi|255673454|dbj|BAH91189.1| Os01g0608366 [Oryza sativa Japonica Group]
Length = 452
Score = 115 bits (287), Expect = 5e-23, Method: Compositional matrix adjust.
Identities = 80/225 (35%), Positives = 113/225 (50%), Gaps = 24/225 (10%)
Query: 30 SVELIHRDSPKSPFYNSSETPYQRLRDALTRSLNRLNHFNQNSSISSSKASQAD------ 83
SV L HR P SP +S + L R R ++ + S S+ A+ D
Sbjct: 34 SVTLSHRYGPCSPADPNSGEKRPTDEELLRRDQLRADYIRRKFSGSNGTAAGEDGQSSKV 93
Query: 84 IIP-------NNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCP-PSQCYMQDSPLF 135
+P + Y+I + +G+P + V DTGSD+ W QCEPCP PS C+ LF
Sbjct: 94 SVPTTLGSSLDTLEYVISVGLGSPAVTQRVVIDTGSDVSWVQCEPCPAPSPCHAHAGALF 153
Query: 136 DPKMSSTYKSLPCSSSQCASL-NQKSCSGVN----CQYSVSYGDGSFSNGNLATETVTLG 190
DP SSTY + CS++ CA L + +G + CQY V YGDGS + G +++ +TL
Sbjct: 154 DPAASSTYAAFNCSAAACAQLGDSGEANGCDAKSRCQYIVKYGDGSNTTGTYSSDVLTL- 212
Query: 191 STTGQAVALPGITFGCGTNN-GGLFNSKTTGIVGLGGGDISLISQ 234
+G V + G FGC G + KT G++GLGG S +SQ
Sbjct: 213 --SGSDV-VRGFQFGCSHAELGAGMDDKTDGLIGLGGDAQSPVSQ 254
Score = 43.1 bits (100), Expect = 0.21, Method: Compositional matrix adjust.
Identities = 37/136 (27%), Positives = 60/136 (44%), Gaps = 14/136 (10%)
Query: 232 ISQMRTTIAGNQRLGVSTPDI-----VIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVAD 286
+ + G ++LG+S P + ++DSGT +T LP + L S + + A+
Sbjct: 309 FAALEDIAVGGKKLGLS-PSVFAAGSLVDSGTVITRLPPAAYAALSSAFRAGMTRYARAE 367
Query: 287 PTGSLELCYSFNSLSQV--PEVTIHFRGADVKLSRSNFFVKVSEDIVCSVFKGITN--SV 342
P G L+ C++F L +V P V + F G V ++ V C F + +
Sbjct: 368 PLGILDTCFNFTGLDKVSIPTVALVFAGGAVVDLDAHGIVSGG----CLAFAPTRDDKAF 423
Query: 343 PIYGNIMQTNFLVGYD 358
GN+ Q F V YD
Sbjct: 424 GTIGNVQQRTFEVLYD 439
>gi|115466068|ref|NP_001056633.1| Os06g0119600 [Oryza sativa Japonica Group]
gi|55296446|dbj|BAD68569.1| putative nucleoid DNA-binding protein cnd41 [Oryza sativa Japonica
Group]
gi|55296924|dbj|BAD68375.1| putative nucleoid DNA-binding protein cnd41 [Oryza sativa Japonica
Group]
gi|113594673|dbj|BAF18547.1| Os06g0119600 [Oryza sativa Japonica Group]
gi|215694767|dbj|BAG89958.1| unnamed protein product [Oryza sativa Japonica Group]
gi|215737752|dbj|BAG96882.1| unnamed protein product [Oryza sativa Japonica Group]
Length = 495
Score = 114 bits (286), Expect = 5e-23, Method: Compositional matrix adjust.
Identities = 94/343 (27%), Positives = 145/343 (42%), Gaps = 78/343 (22%)
Query: 98 GTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCSSSQCASLN 157
GT + + D+GSD+ W QC+PCP C+ Q PLFDP MS+TY ++PC+S+ CA L
Sbjct: 162 GTSAVTQTVIIDSGSDVSWVQCKPCPLPMCHRQRDPLFDPAMSTTYAAVPCTSAACAQLG 221
Query: 158 --QKSCSG-VNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITFGCG-TNNGGL 213
++ CS CQ+ ++YGDGS + G + + +TLG + G FGC + G
Sbjct: 222 PYRRGCSANAQCQFGINYGDGSTATGTYSFDDLTLGPYD----VIRGFRFGCAHADRGSA 277
Query: 214 FNSKTTGIVGLGGGDISLISQMRTTIA--------------GNQRLG------------V 247
F+ G + LGGG SL+ Q T G LG V
Sbjct: 278 FDYDVAGSLALGGGSQSLVQQTATRYGRVFSYCLPPTASSLGFLVLGVPPERAQLIPSFV 337
Query: 248 STPDI---------------VIDSGTTLTFLPQGYNSNLLSVMSSMIEAQP--------- 283
STP + +I +G L P ++++ + S++I P
Sbjct: 338 STPLLSSSMAPTFYRVLLRAIIVAGRPLAVPPAVFSASSVIDSSTIISRLPPTAYQALRA 397
Query: 284 ----------VADPTGSLELCYSFNSLSQV--PEVTIHFR-GADVKLSRSNFFVKVSEDI 330
A P L+ CY F + + P + + F GA V L + +
Sbjct: 398 AFRSAMTMYRAAPPVSILDTCYDFTGVRSITLPSIALVFDGGATVNLDAAGILLG----- 452
Query: 331 VCSVFK-GITNSVPIY-GNIMQTNFLVGYDIEQQTVSFKPTDC 371
C F ++ +P + GN+ Q V YD+ + + F+ C
Sbjct: 453 SCLAFAPTASDRMPGFIGNVQQKTLEVVYDVPAKAMRFRTAAC 495
>gi|147858841|emb|CAN78694.1| hypothetical protein VITISV_037475 [Vitis vinifera]
Length = 442
Score = 114 bits (286), Expect = 7e-23, Method: Compositional matrix adjust.
Identities = 109/439 (24%), Positives = 181/439 (41%), Gaps = 82/439 (18%)
Query: 10 ILFFLCF-YVVSP---IEAQTGGFSVELIHRDSPKSPFYNSSETPYQRLRDALTRSLNRL 65
+L +CF ++ SP + + GFS LIH SP SP+ N + AL +L+R
Sbjct: 7 LLLIICFTFIFSPCISAASDSKGFSTNLIHIHSPSSPYKNVKAESLAK-DTALESTLSRH 65
Query: 66 NHFN--QNSSISSSKASQADIIPNNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCP 123
+ Q ++ + +I + + +L +SIG PPT V DTGSDL W QCEPC
Sbjct: 66 AYLRARQQKALQPADFVPPPLIRDKSAFLANLSIGNPPTNVYVVLDTGSDLFWIQCEPC- 124
Query: 124 PSQCYMQDSPLFDPKMSSTYKSLPCSSSQCASLNQK-SCSGV-NCQYSVSYGDGSFSNGN 181
CY Q P+++ S +Y + C+ C SL ++ CS +C Y +Y DG+ ++G
Sbjct: 125 -DVCYKQKDPIYNRTKSDSYTEMLCNEPPCVSLGREGQCSDSGSCLYQTAYADGARTSGL 183
Query: 182 LATETVTLGSTTGQAVALPGITFGCGTNN------------------------------- 210
L+ E V S + FGCG N
Sbjct: 184 LSYEKVAFTSHYSDEDKTAQVGFGCGLQNLNFITSNRDGGVLGLGPGLVSLVSQLSAIGK 243
Query: 211 ---------GGLFNSKTTGIVGLG-----GGDIS--LISQMRTTIAGNQRLGVSTPDI-- 252
G + N G + G GD++ +I++ LGV P +
Sbjct: 244 VSKSFAYCFGNISNPNAGGFLVFGDATYLNGDMTPMVIAEFYYVNLLGIGLGVGEPRLDI 303
Query: 253 ---------------VIDSGTTLT-FLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYS 296
+IDSG+TL+ F P+ Y +V+ + + ++ T S + C+
Sbjct: 304 NSSSFERKPDGSGGVIIDSGSTLSVFPPEVYEVVRNAVVDKLKKGYNISPLTSSPD-CFE 362
Query: 297 ---FNSLSQVPEVTIHFRGADVKLSRSNFFVKVSEDIVCSVFKGITNSVPIYGNIMQTNF 353
L P + ++ + R + F++ +++ C F + I G + Q ++
Sbjct: 363 GKIERDLPLFPTLVLYLESTGILNDRWSIFLQRYDELFCLGFTS-GEGLSIIGTLAQQSY 421
Query: 354 LVGYDIEQQTVSFKPT-DC 371
GY++E T+S + DC
Sbjct: 422 KFGYNLELSTLSIESNPDC 440
>gi|21717166|gb|AAM76359.1|AC074196_17 putative nucleoid DNA binding protein [Oryza sativa Japonica Group]
gi|31433306|gb|AAP54835.1| Eukaryotic aspartyl protease family protein, expressed [Oryza
sativa Japonica Group]
gi|125575546|gb|EAZ16830.1| hypothetical protein OsJ_32301 [Oryza sativa Japonica Group]
Length = 373
Score = 114 bits (285), Expect = 7e-23, Method: Compositional matrix adjust.
Identities = 92/354 (25%), Positives = 149/354 (42%), Gaps = 83/354 (23%)
Query: 91 YLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCSS 150
Y+ ++IGTPP A+ + +WTQC PC +C+ QD PLF+ SSTY+ PC +
Sbjct: 28 YMANLTIGTPPQPASAIIHLAGEFVWTQCSPC--RRCFKQDLPLFNRSASSTYRPEPCGT 85
Query: 151 SQCASLNQKSCSGVN-CQYSVS--YGDGSFSNGNLATETVTLGSTTGQAVALPGITFGCG 207
+ C S+ +CSG C Y V +GD S G T+T +G+ T + FGC
Sbjct: 86 ALCESVPASTCSGDGVCSYEVETMFGDTSGIGG---TDTFAIGTATAS------LAFGCA 136
Query: 208 TNNGGLFNSKTTGIVGLGGGDISLISQMRTT-------------------------IAGN 242
++ +G+VGLG SL+ QM T +AG
Sbjct: 137 MDSNIKQLLGASGVVGLGRTPWSLVGQMNATAFSYCLAPHGAAGKKSALLLGASAKLAGG 196
Query: 243 QRL--------------------GVSTPDIVI-----------DSGTTLTFLPQGYNSNL 271
+ G+ D++I D+ ++FL +
Sbjct: 197 KSAATTPLVNTSDDSSDYMIHLEGIKFGDVIIAPPPNGSVVLVDTIFGVSFLVDAAFQAI 256
Query: 272 LSVMSSMIEAQPVADPTGSLELCY-------SFNSLSQVPEVTIHFRG-ADVKLSRSNFF 323
++ + A P+A PT +LC+ NS +P+V + F+G A + + S +
Sbjct: 257 KKAVTVAVGAAPMATPTKPFDLCFPKAAAAAGANSSLPLPDVVLTFQGAAALTVPPSKYM 316
Query: 324 VKVSEDIVC-----SVFKGITNSVPIYGNIMQTNFLVGYDIEQQTVSFKPTDCT 372
VC S +T + I G + Q N +D++++T+SF+P DC+
Sbjct: 317 YDAGNGTVCLAMMSSAMLNLTTELSILGRLHQENIHFLFDLDKETLSFEPADCS 370
>gi|238479902|ref|NP_001154646.1| aspartyl protease family protein [Arabidopsis thaliana]
gi|332643534|gb|AEE77055.1| aspartyl protease family protein [Arabidopsis thaliana]
Length = 350
Score = 114 bits (285), Expect = 7e-23, Method: Compositional matrix adjust.
Identities = 91/350 (26%), Positives = 149/350 (42%), Gaps = 57/350 (16%)
Query: 40 KSPFYNSSETPYQRLRDALTRSLNRLNHFNQNSSISSSKASQADIIPNNANYLIRISIGT 99
KSPF + ++ R SL R S + S AS + Y + + IG
Sbjct: 39 KSPFPSPTQALALDTRRLHFLSLRRKPIPFVKSPVVSGAAS------GSGQYFVDLRIGQ 92
Query: 100 PPTERLAVADTGSDLIWTQCEPCPPSQC-YMQDSPLFDPKMSSTYKSLPCSSSQCASLNQ 158
PP L +ADTGSDL+W +C C C + + +F P+ SST+ C C + +
Sbjct: 93 PPQSLLLIADTGSDLVWVKCSAC--RNCSHHSPATVFFPRHSSTFSPAHCYDPVCRLVPK 150
Query: 159 KSCSGV--------NCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITFGCGTNN 210
+ + C Y Y DGS ++G A ET +L +++G+ L + FGCG
Sbjct: 151 PDRAPICNHTRIHSTCHYEYGYADGSLTSGLFARETTSLKTSSGKEARLKSVAFGCG--- 207
Query: 211 GGLFNSKTTGIVGLGGGDISLISQMRTTIAGNQRLGVSTPDIVIDSGTTLTFLPQGYNSN 270
F + G GG V+DSGTTL FL + +
Sbjct: 208 ---FRISGQSVSGNGG-------------------------TVVDSGTTLAFLAEPAYRS 239
Query: 271 LLSVMSSMIEAQPVADP-TGSLELCYSFNSLSQ----VPEVTIHFRGADVKL-SRSNFFV 324
+++ + ++ P+AD T +LC + + +++ +P + F G V + N+F+
Sbjct: 240 VIAAVRRRVKL-PIADALTPGFDLCVNVSGVTKPEKILPRLKFEFSGGAVFVPPPRNYFI 298
Query: 325 KVSEDIVCSVFKGITNSV--PIYGNIMQTNFLVGYDIEQQTVSFKPTDCT 372
+ E I C + + V + GN+MQ FL +D ++ + F C
Sbjct: 299 ETEEQIQCLAIQSVDPKVGFSVIGNLMQQGFLFEFDRDRSRLGFSRRGCA 348
>gi|326515172|dbj|BAK03499.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 494
Score = 114 bits (285), Expect = 8e-23, Method: Compositional matrix adjust.
Identities = 93/332 (28%), Positives = 138/332 (41%), Gaps = 76/332 (22%)
Query: 107 VADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCSSSQCASLNQKSCSGVN- 165
V DT SD+ W QC PCP QC++Q PL+DP SST+ +PC S C L +G +
Sbjct: 172 VVDTSSDIPWVQCLPCPIPQCHLQKDPLYDPAKSSTFAPIPCGSPACKELGSSYGNGCSP 231
Query: 166 ----CQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITFGCGTNNGGLFNSKTTGI 221
C+Y V+YGDG + G T+T+T+ T + + FGC G F+++ GI
Sbjct: 232 TTDECKYIVNYGDGKATTGTYVTDTLTMSPT----IVVKDFRFGCSHAVRGSFSNQNAGI 287
Query: 222 VGLGGGDISLISQM-------------RTTIAGNQRLG---------------------- 246
+ LGGG SL+ Q + + AG LG
Sbjct: 288 LALGGGRGSLLEQTADAYGNAFSYCIPKPSSAGFLSLGGPVEASLKFSYTPLIKNKHAPT 347
Query: 247 ---------------VSTPDIVIDSGTTL------TFLP-QGYNSNLLSVMSSMIEAQPV 284
++ P +G + T LP Q Y + + S+M P+
Sbjct: 348 FYIVHLEAIIVAGKQLAVPPTAFATGAVMDSGAVVTQLPPQVYAALRAAFRSAMAAYGPL 407
Query: 285 ADPTGSLELCYSFNSLS--QVPEVTIHFR-GADVKLSRSNFFVKVSEDIVCSVFKGI--T 339
A P +L+ CY F +VP+V++ F GA + L ++ + C F
Sbjct: 408 AAPVRNLDTCYDFTRFPDVKVPKVSLVFAGGATLDLEPASIILD-----GCLAFAATPGE 462
Query: 340 NSVPIYGNIMQTNFLVGYDIEQQTVSFKPTDC 371
SV GN+ Q + V YD+ V F+ C
Sbjct: 463 ESVGFIGNVQQQTYEVLYDVGGGKVGFRRGAC 494
>gi|302780643|ref|XP_002972096.1| hypothetical protein SELMODRAFT_96575 [Selaginella moellendorffii]
gi|300160395|gb|EFJ27013.1| hypothetical protein SELMODRAFT_96575 [Selaginella moellendorffii]
Length = 393
Score = 114 bits (284), Expect = 9e-23, Method: Compositional matrix adjust.
Identities = 73/195 (37%), Positives = 102/195 (52%), Gaps = 14/195 (7%)
Query: 52 QRLRDALTRSLNRLNHF--NQNSSISSSKASQADI----IPNNANYLIRISIGTPPTERL 105
+ +R + +S R+ NSS SS A D+ P+ Y++ IS+GTP
Sbjct: 10 EAIRALVAKSHARVRWMAARANSSSWSSMAGTTDVESPLHPDGGGYVMDISVGTPGKRFR 69
Query: 106 AVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCSSSQCASLNQKSCSGVN 165
A+ADTGSDL+W Q EPC + C +FDP+ SST++ + CSS CA L G +
Sbjct: 70 AIADTGSDLVWVQSEPC--TGC--SGGTIFDPRQSSTFREMDCSSQLCAELPGSCEPGSS 125
Query: 166 -CQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITFGCGTNNGGLFNSKTTGIVGL 224
C YS YG G + G A +T++LG+T+ + P GCG N G G+VGL
Sbjct: 126 TCSYSYEYGSGE-TEGEFARDTISLGTTSDGSQKFPSFAVGCGMVNSGF--DGVDGLVGL 182
Query: 225 GGGDISLISQMRTTI 239
G G +SL SQ+ I
Sbjct: 183 GQGPVSLTSQLSAAI 197
Score = 69.7 bits (169), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 47/125 (37%), Positives = 66/125 (52%), Gaps = 7/125 (5%)
Query: 253 VIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCY--SFNSLSQVPEVTIHF 310
+IDSGTTLT++P G +LS M SM+ V + L+LCY S N + P +TI
Sbjct: 267 IIDSGTTLTYVPSGVYGRVLSRMESMVTLPRVDGSSMGLDLCYDRSSNRNYKFPALTIRL 326
Query: 311 RGADVKLSRSNFFVKV--SEDIVCSVFKGITNSVP--IYGNIMQTNFLVGYDIEQQTVSF 366
GA + SN+F+ V S D VC + G + +P I GN+MQ + + YD +SF
Sbjct: 327 AGATMTPPSSNYFLVVDDSGDTVC-LAMGSASGLPVSIIGNVMQQGYHILYDRGSSELSF 385
Query: 367 KPTDC 371
C
Sbjct: 386 VQAKC 390
>gi|22165126|gb|AAM93742.1| putative chloroplast nucleoid DNA binding protein [Oryza sativa
Japonica Group]
gi|31433307|gb|AAP54836.1| Eukaryotic aspartyl protease family protein [Oryza sativa Japonica
Group]
gi|125575547|gb|EAZ16831.1| hypothetical protein OsJ_32302 [Oryza sativa Japonica Group]
Length = 405
Score = 114 bits (284), Expect = 1e-22, Method: Compositional matrix adjust.
Identities = 92/355 (25%), Positives = 144/355 (40%), Gaps = 82/355 (23%)
Query: 91 YLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCSS 150
Y+ +IGTPP AV D +L+WTQC PC P C+ QD PLFDP SST++ LPC S
Sbjct: 57 YVANFTIGTPPQPVSAVVDLTGELVWTQCTPCQP--CFEQDLPLFDPTKSSTFRGLPCGS 114
Query: 151 SQCASLNQKS--CSGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITFGC-- 206
C S+ + S C+ C Y G + G T+T +G+ A + FGC
Sbjct: 115 HLCESIPESSRNCTSDVCIYEAPTKAGD-TGGKAGTDTFAIGA------AKETLGFGCVV 167
Query: 207 ---------------------------------------GTNNGGLF------------N 215
G ++G LF N
Sbjct: 168 MTDKRLKTIGGPSGIVGLGRTPWSLVTQMNVTAFSYCLAGKSSGALFLGATAKQLAGGKN 227
Query: 216 SKTTGIVGLGGGD---------ISLISQMRTTIAGNQRLGVSTPDIVIDSGTTLTFLPQG 266
S T ++ G + ++ ++T A Q S +++D+ + ++L G
Sbjct: 228 SSTPFVIKTSAGSSDNGSNPYYMVKLAGIKTGGAPLQAASSSGSTVLLDTVSRASYLADG 287
Query: 267 YNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSLSQVPEVTIHFRG-ADVKLSRSNFFVK 325
L +++ + QPVA P +LC+ PE+ F G A + + +N+ +
Sbjct: 288 AYKALKKALTAAVGVQPVASPPKPYDLCFPKAVAGDAPELVFTFDGGAALTVPPANYLLA 347
Query: 326 VSEDIVCSV--------FKGITNSVPIYGNIMQTNFLVGYDIEQQTVSFKPTDCT 372
VC G I G++ Q N V +D++++T+SFKP DC+
Sbjct: 348 SGNGTVCLTIGSSASLNLTGELEGASILGSLQQENVHVLFDLKEETLSFKPADCS 402
>gi|297828001|ref|XP_002881883.1| hypothetical protein ARALYDRAFT_903676 [Arabidopsis lyrata subsp.
lyrata]
gi|297327722|gb|EFH58142.1| hypothetical protein ARALYDRAFT_903676 [Arabidopsis lyrata subsp.
lyrata]
Length = 529
Score = 113 bits (283), Expect = 1e-22, Method: Compositional matrix adjust.
Identities = 98/369 (26%), Positives = 152/369 (41%), Gaps = 88/369 (23%)
Query: 89 ANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPC 148
Y + + +GTPP + DTGSDL W QC PC C+ Q+ +DPK S+++K++ C
Sbjct: 160 GEYFMDVLVGTPPKHFSLILDTGSDLNWLQCLPC--YDCFHQNEAFYDPKTSASFKNITC 217
Query: 149 SSSQCASLNQKS----CSGVN--CQYSVSYGDGSFSNGNLATETVTLGSTTGQA----VA 198
+ +C+ ++ C N C Y YGD S + G+ A ET T+ TT +
Sbjct: 218 NDPRCSLISSPEPPVQCKSDNQSCPYFYWYGDRSNTTGDFAVETFTVNLTTTEGRSSEYK 277
Query: 199 LPGITFGCGTNNGGL----------------FNSKTTGIVG------------------- 223
+ + FGCG N GL F+S+ + G
Sbjct: 278 VENMMFGCGHWNRGLFSGASGLLGLGRGPLSFSSQLQSLYGHSFSYCLVDRNSDTNVSSK 337
Query: 224 -LGGGDISLIS-----------------------QMRTTIAGNQRLGVS------TPD-- 251
+ G D L++ Q+++ + G + L + +PD
Sbjct: 338 LIFGEDKDLLNHTNLNFTSFVNGKENSVETFYYIQIKSILVGGEALDIPEETWNISPDGA 397
Query: 252 --IVIDSGTTLTFLPQ-GYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSLSQ----VP 304
+IDSGTTL++ + Y M E V L+ C++ + + + +P
Sbjct: 398 GGTIIDSGTTLSYFAEPAYEIIKNKFAEKMKENYLVFRDFPVLDPCFNVSGIEENNIHLP 457
Query: 305 EVTIHF-RGADVKLSRSNFFVKVSEDIVCSVFKGITNSV-PIYGNIMQTNFLVGYDIEQQ 362
E+ I F GA N F+ +SED+VC G S I GN Q NF + YD +
Sbjct: 458 ELGIAFADGAVWNFPAENSFIWLSEDLVCLAILGTPKSTFSIIGNYQQQNFHILYDTKMS 517
Query: 363 TVSFKPTDC 371
+ F PT C
Sbjct: 518 RLGFTPTKC 526
>gi|9759039|dbj|BAB09366.1| aspartyl protease-like [Arabidopsis thaliana]
Length = 478
Score = 113 bits (283), Expect = 1e-22, Method: Compositional matrix adjust.
Identities = 101/363 (27%), Positives = 156/363 (42%), Gaps = 87/363 (23%)
Query: 91 YLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDS-----PLFDPKMSSTYKS 145
Y +I +G+PP E DTGSD++W C PCP +C ++ L+D K SST K+
Sbjct: 74 YFTKIKLGSPPKEYYVQVDTGSDILWVNCAPCP--KCPVKTDLGIPLSLYDSKTSSTSKN 131
Query: 146 LPCSSSQCASLNQKSCSGVN--CQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALP--- 200
+ C C+ + Q G C Y V YGDGS S+G+ + +TL TG P
Sbjct: 132 VGCEDDFCSFIMQSETCGAKKPCSYHVVYGDGSTSDGDFIKDNITLEQVTGNLRTAPLAQ 191
Query: 201 GITFGCGTNNGGLF---NSKTTGIVG----------------------------LGGGDI 229
+ FGCG N G +S GI+G + GG I
Sbjct: 192 EVVFGCGKNQSGQLGQTDSAVDGIMGFGQSNTSIISQLAAGGSTKRIFSHCLDNMNGGGI 251
Query: 230 SLISQMR------TTIAGNQ---------------------RLGVSTPD--IVIDSGTTL 260
+ ++ T I NQ L + D +IDSGTTL
Sbjct: 252 FAVGEVESPVVKTTPIVPNQVHYNVILKGMDVDGDPIDLPPSLASTNGDGGTIIDSGTTL 311
Query: 261 TFLPQG-YNSNLLSVMSSMIEAQPVADPTGSLELCYSF--NSLSQVPEVTIHFRGADVKL 317
+LPQ YNS + + + + T + C+SF N+ P V +HF + +KL
Sbjct: 312 AYLPQNLYNSLIEKITAKQQVKLHMVQETFA---CFSFTSNTDKAFPVVNLHFEDS-LKL 367
Query: 318 SR--SNFFVKVSEDIVCSVFK--GITN----SVPIYGNIMQTNFLVGYDIEQQTVSFKPT 369
S ++ + ED+ C ++ G+T V + G+++ +N LV YD+E + + +
Sbjct: 368 SVYPHDYLFSLREDMYCFGWQSGGMTTQDGADVILLGDLVLSNKLVVYDLENEVIGWADH 427
Query: 370 DCT 372
+C+
Sbjct: 428 NCS 430
>gi|30692930|ref|NP_198475.2| aspartyl protease family protein [Arabidopsis thaliana]
gi|66792626|gb|AAY56415.1| At5g36260 [Arabidopsis thaliana]
gi|332006680|gb|AED94063.1| aspartyl protease family protein [Arabidopsis thaliana]
Length = 482
Score = 113 bits (283), Expect = 1e-22, Method: Compositional matrix adjust.
Identities = 101/363 (27%), Positives = 156/363 (42%), Gaps = 87/363 (23%)
Query: 91 YLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDS-----PLFDPKMSSTYKS 145
Y +I +G+PP E DTGSD++W C PCP +C ++ L+D K SST K+
Sbjct: 78 YFTKIKLGSPPKEYYVQVDTGSDILWVNCAPCP--KCPVKTDLGIPLSLYDSKTSSTSKN 135
Query: 146 LPCSSSQCASLNQKSCSGVN--CQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALP--- 200
+ C C+ + Q G C Y V YGDGS S+G+ + +TL TG P
Sbjct: 136 VGCEDDFCSFIMQSETCGAKKPCSYHVVYGDGSTSDGDFIKDNITLEQVTGNLRTAPLAQ 195
Query: 201 GITFGCGTNNGGLF---NSKTTGIVG----------------------------LGGGDI 229
+ FGCG N G +S GI+G + GG I
Sbjct: 196 EVVFGCGKNQSGQLGQTDSAVDGIMGFGQSNTSIISQLAAGGSTKRIFSHCLDNMNGGGI 255
Query: 230 SLISQMR------TTIAGNQ---------------------RLGVSTPD--IVIDSGTTL 260
+ ++ T I NQ L + D +IDSGTTL
Sbjct: 256 FAVGEVESPVVKTTPIVPNQVHYNVILKGMDVDGDPIDLPPSLASTNGDGGTIIDSGTTL 315
Query: 261 TFLPQG-YNSNLLSVMSSMIEAQPVADPTGSLELCYSF--NSLSQVPEVTIHFRGADVKL 317
+LPQ YNS + + + + T + C+SF N+ P V +HF + +KL
Sbjct: 316 AYLPQNLYNSLIEKITAKQQVKLHMVQETFA---CFSFTSNTDKAFPVVNLHFEDS-LKL 371
Query: 318 SR--SNFFVKVSEDIVCSVFK--GITN----SVPIYGNIMQTNFLVGYDIEQQTVSFKPT 369
S ++ + ED+ C ++ G+T V + G+++ +N LV YD+E + + +
Sbjct: 372 SVYPHDYLFSLREDMYCFGWQSGGMTTQDGADVILLGDLVLSNKLVVYDLENEVIGWADH 431
Query: 370 DCT 372
+C+
Sbjct: 432 NCS 434
>gi|242053503|ref|XP_002455897.1| hypothetical protein SORBIDRAFT_03g026970 [Sorghum bicolor]
gi|241927872|gb|EES01017.1| hypothetical protein SORBIDRAFT_03g026970 [Sorghum bicolor]
Length = 493
Score = 113 bits (283), Expect = 1e-22, Method: Compositional matrix adjust.
Identities = 70/199 (35%), Positives = 101/199 (50%), Gaps = 27/199 (13%)
Query: 32 ELIHRDSPKSPFYNSSETPYQRLRDALTRSLNRLNHFNQ----------NSSISSSKASQ 81
++HRD+ + ++ T + LR L R R ++ N + S A
Sbjct: 72 RVVHRDA-----FAANATAAELLRHRLQRDKRRAARISKAAAGGGAGAANGTRSRGGAVA 126
Query: 82 ADIIPNNA----NYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDP 137
A ++ A Y +I +GTP T L V DTGSD++W QC PC +CY Q P+FDP
Sbjct: 127 APVVSGLAQGSGEYFTKIGVGTPSTPALMVLDTGSDVVWLQCAPC--RRCYDQSGPVFDP 184
Query: 138 KMSSTYKSLPCSSSQCASLNQKSCS--GVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQ 195
+ SS+Y ++ C++ C L+ C C Y V+YGDGS + G+ ATET+T G
Sbjct: 185 RRSSSYGAVDCAAPLCRRLDSGGCDLRRRACLYQVAYGDGSVTAGDFATETLTFAG--GA 242
Query: 196 AVALPGITFGCGTNNGGLF 214
VA + GCG +N GLF
Sbjct: 243 RVAR--VALGCGHDNEGLF 259
Score = 59.7 bits (143), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 40/125 (32%), Positives = 59/125 (47%), Gaps = 5/125 (4%)
Query: 252 IVIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSL-ELCYSF--NSLSQVPEVTI 308
+++DSGT++T L + S L + ++ SL + CY + +VP V++
Sbjct: 369 VIVDSGTSVTRLARPSYSALRDAFRAAAAGLRLSPGGFSLFDTCYDLGGRKVVKVPTVSM 428
Query: 309 HFRG-ADVKLSRSNFFVKV-SEDIVCSVFKGITNSVPIYGNIMQTNFLVGYDIEQQTVSF 366
HF G A+ L N+ + V S C F G V I GNI Q F V +D + Q V F
Sbjct: 429 HFAGGAEAALPPENYLIPVDSRGTFCFAFAGTDGGVSIIGNIQQQGFRVVFDGDGQRVGF 488
Query: 367 KPTDC 371
P C
Sbjct: 489 APKGC 493
>gi|326516344|dbj|BAJ92327.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 464
Score = 113 bits (283), Expect = 1e-22, Method: Compositional matrix adjust.
Identities = 81/237 (34%), Positives = 117/237 (49%), Gaps = 28/237 (11%)
Query: 23 EAQTGGFSVELIHRDSPKSPFYNSSETPYQRLR-DA-----LTRSLNRLNHFNQNSSISS 76
A++G +EL H S S + +E + L DA L R + + + S+
Sbjct: 35 RAESGATVLELRHHASFSSGGKSRAEEAHAVLASDAARVSSLQRRIGSYGLIRSSDAASA 94
Query: 77 SKASQADIIP----NNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDS 132
SK +Q + NY+ + IG E + DT S+L W QCEPC C+ Q
Sbjct: 95 SKLAQVPVTSGARLRTLNYVATVGIGG--GEATVIVDTASELTWVQCEPC--DACHDQQE 150
Query: 133 PLFDPKMSSTYKSLPCSSSQCASL------NQKSCSG--VNCQYSVSYGDGSFSNGNLAT 184
PLFDP S +Y ++PC+SS C +L + ++C C Y++SY DGS+S G LA
Sbjct: 151 PLFDPSSSPSYAAVPCNSSSCDALRVATGMSGQACDDQPAACSYTLSYRDGSYSRGVLAH 210
Query: 185 ETVTLGSTTGQAVALPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAG 241
+ ++L Q G FGCGT+N G F T+G++GLG +SLISQ G
Sbjct: 211 DRLSLAGEDIQ-----GFVFGCGTSNQGPFGG-TSGLMGLGRSQLSLISQTMDQFGG 261
Score = 43.1 bits (100), Expect = 0.22, Method: Compositional matrix adjust.
Identities = 36/127 (28%), Positives = 55/127 (43%), Gaps = 9/127 (7%)
Query: 253 VIDSGTTLT-FLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSLS--QVPEVTIH 309
++DSGT +T +P Y + +S + E P A P L+ C+ L QVP + +
Sbjct: 336 IVDSGTIITSLVPSVYAAVRAEFVSQLAE-YPQAAPFSILDTCFDLTGLREVQVPSLKLV 394
Query: 310 FR-GADVKLSRSNFFVKVSEDI--VCSVFKGITNS--VPIYGNIMQTNFLVGYDIEQQTV 364
F GA+V++ V+ D VC + + PI GN Q N V +D +
Sbjct: 395 FDGGAEVEVDSKGVLYVVTGDASQVCLALASLKSEYDTPIIGNYQQKNLRVIFDTVGSQI 454
Query: 365 SFKPTDC 371
F C
Sbjct: 455 GFAQETC 461
>gi|367066697|gb|AEX12632.1| hypothetical protein 2_5918_01 [Pinus taeda]
gi|367066699|gb|AEX12633.1| hypothetical protein 2_5918_01 [Pinus taeda]
gi|367066701|gb|AEX12634.1| hypothetical protein 2_5918_01 [Pinus taeda]
gi|367066703|gb|AEX12635.1| hypothetical protein 2_5918_01 [Pinus taeda]
gi|367066705|gb|AEX12636.1| hypothetical protein 2_5918_01 [Pinus taeda]
gi|367066707|gb|AEX12637.1| hypothetical protein 2_5918_01 [Pinus taeda]
gi|367066709|gb|AEX12638.1| hypothetical protein 2_5918_01 [Pinus taeda]
gi|367066711|gb|AEX12639.1| hypothetical protein 2_5918_01 [Pinus taeda]
gi|367066713|gb|AEX12640.1| hypothetical protein 2_5918_01 [Pinus taeda]
gi|367066715|gb|AEX12641.1| hypothetical protein 2_5918_01 [Pinus taeda]
gi|367066717|gb|AEX12642.1| hypothetical protein 2_5918_01 [Pinus taeda]
Length = 137
Score = 113 bits (282), Expect = 1e-22, Method: Compositional matrix adjust.
Identities = 59/135 (43%), Positives = 81/135 (60%), Gaps = 7/135 (5%)
Query: 78 KASQADIIPNNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDP 137
K QA + N +L++++IG P A+ DTGSDL WTQC PC S CY Q +P++DP
Sbjct: 8 KDVQAPVSAGNGEFLMQLAIGKPSLAYSAILDTGSDLTWTQCMPC--SDCYKQPTPIYDP 65
Query: 138 KMSSTYKSLPCSSSQCASLNQKSCSGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAV 197
+SSTY ++ C SS C +L +C C+Y +YGD S + G L+ ET TL S +
Sbjct: 66 SLSSTYGTVSCKSSLCLALPASACISATCEYLYTYGDYSSTQGILSYETFTLSSQS---- 121
Query: 198 ALPGITFGCGTNNGG 212
+P I FGCG +N G
Sbjct: 122 -IPHIAFGCGQDNEG 135
>gi|449436215|ref|XP_004135889.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 2-like [Cucumis
sativus]
Length = 496
Score = 113 bits (282), Expect = 2e-22, Method: Compositional matrix adjust.
Identities = 65/153 (42%), Positives = 86/153 (56%), Gaps = 18/153 (11%)
Query: 90 NYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCS 149
NY++ + IG + + DTGSDL W QC PC CY Q PLF+P SS++ SLPC+
Sbjct: 144 NYIVTVGIGGQNST--LIVDTGSDLTWVQCLPC--RLCYNQQEPLFNPSNSSSFLSLPCN 199
Query: 150 SSQCASLNQKS-----CSGVN---CQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPG 201
S C +L + CS N C Y + YGDGS+S G L E +TLG T +
Sbjct: 200 SPTCVALQPTAGSSGLCSNKNSTSCDYQIDYGDGSYSRGELGFEKLTLGKT-----EIDN 254
Query: 202 ITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQ 234
FGCG NN GLF +G++GL ++SL+SQ
Sbjct: 255 FIFGCGRNNKGLFGG-ASGLMGLARSELSLVSQ 286
>gi|297817208|ref|XP_002876487.1| hypothetical protein ARALYDRAFT_486375 [Arabidopsis lyrata subsp.
lyrata]
gi|297322325|gb|EFH52746.1| hypothetical protein ARALYDRAFT_486375 [Arabidopsis lyrata subsp.
lyrata]
Length = 520
Score = 113 bits (282), Expect = 2e-22, Method: Compositional matrix adjust.
Identities = 97/365 (26%), Positives = 156/365 (42%), Gaps = 86/365 (23%)
Query: 91 YLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCSS 150
Y + + +G+PP + DTGSDL W QC PC C+ Q+ +DPK S++YK++ C+
Sbjct: 155 YFMDVLVGSPPKHFSLILDTGSDLNWIQCLPC--HDCFQQNGAFYDPKASASYKNITCND 212
Query: 151 SQCASLNQ----KSCSGVN--CQYSVSYGDGSFSNGNLATETVTLGSTTG----QAVALP 200
+C ++ K C N C Y YGD S + G+ A ET T+ TT + +
Sbjct: 213 PRCNLVSPPDPPKPCKSDNQSCPYYYWYGDSSNTTGDFAVETFTVNLTTSGGSSELYNVE 272
Query: 201 GITFGCGTNNGGL----------------FNSKTTGIVG--------------------L 224
+ FGCG N GL F+S+ + G +
Sbjct: 273 NMMFGCGHWNRGLFHGAAGLLGLGRGPLSFSSQLQSLYGHSFSYCLVDRNSDTNVSSKLI 332
Query: 225 GGGDISLIS-----------------------QMRTTIAGNQRLGV--STPDI------- 252
G D L+S Q+++ I + L + T +I
Sbjct: 333 FGEDKDLLSHPNLNFTSFVARKENLVDTFYYVQIKSIIVAGEVLNIPEETWNISSDGAGG 392
Query: 253 -VIDSGTTLTFLPQGYNSNLLSVMSSMIEAQ-PVADPTGSLELCYSFNSLS--QVPEVTI 308
+IDSGTTL++ + + + ++ + + PV L+ C++ + + Q+PE+ I
Sbjct: 393 TIIDSGTTLSYFAEPAYEFIKNKIAEKAKGKYPVYRDFPILDPCFNVSGIDSIQLPELGI 452
Query: 309 HF-RGADVKLSRSNFFVKVSEDIVCSVFKGITNSV-PIYGNIMQTNFLVGYDIEQQTVSF 366
F GA N F+ ++ED+VC G S I GN Q NF + YD ++ + +
Sbjct: 453 AFADGAVWNFPTENSFIWLNEDLVCLAILGTPKSAFSIIGNYQQQNFHILYDTKRSRLGY 512
Query: 367 KPTDC 371
PT C
Sbjct: 513 APTKC 517
>gi|242086412|ref|XP_002443631.1| hypothetical protein SORBIDRAFT_08g022620 [Sorghum bicolor]
gi|241944324|gb|EES17469.1| hypothetical protein SORBIDRAFT_08g022620 [Sorghum bicolor]
Length = 507
Score = 113 bits (282), Expect = 2e-22, Method: Compositional matrix adjust.
Identities = 73/226 (32%), Positives = 113/226 (50%), Gaps = 26/226 (11%)
Query: 29 FSVELIHRDSPKSPFYNSSETPYQRLRDALTRSLNRLNHFNQNSS-------ISSSKASQ 81
V L+HRDS + +E +RL+ R+ ++ N + +S+ +
Sbjct: 70 MHVRLLHRDS-FAVNATGAELLARRLQRDELRAAWIISTAAANGTPPPDVVGLSTGRGLV 128
Query: 82 ADII---PNNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPK 138
A ++ P + +Y+ +I++GTP E L DT SDL W QC+PC +CY Q P+FDP+
Sbjct: 129 APVVSRAPTSGDYIAKIAVGTPAVEALLALDTASDLTWLQCQPC--RRCYPQSGPVFDPR 186
Query: 139 MSSTYKSLPCSSSQCASLNQKSCSGVN---CQYSVSYGDG------SFSNGNLATETVTL 189
S++Y + + C +L + C Y+V YGDG S S G+L ET+T
Sbjct: 187 HSTSYGEMNYDAPDCQALGRSGGGDAKRGTCIYTVLYGDGDGHGSTSTSVGDLVEETLTF 246
Query: 190 GSTTGQAVALPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQM 235
QA ++ GCG +N GLF + GI+GL G IS+ Q+
Sbjct: 247 AGGVRQAY----LSIGCGHDNKGLFGAPAAGILGLSRGQISIPHQI 288
Score = 62.0 bits (149), Expect = 5e-07, Method: Compositional matrix adjust.
Identities = 43/132 (32%), Positives = 67/132 (50%), Gaps = 12/132 (9%)
Query: 252 IVIDSGTTLTFLPQ-GYNSNLLSVMSSMIEAQPVA--DPTGSLELCYSFNSLS------Q 302
+++DSGTT+T L + Y + + ++ V+ P+G + CY+ + +
Sbjct: 376 VILDSGTTVTRLARPAYTAFRDAFRAAATGLGQVSTGGPSGLFDTCYTVGGRAGLRHCVK 435
Query: 303 VPEVTIHFRGA-DVKLSRSNFFVKV-SEDIVCSVFKGITN-SVPIYGNIMQTNFLVGYDI 359
VP V++HF G ++ L N+ + V S VC F G + SV + GNI+Q F V YDI
Sbjct: 436 VPAVSMHFAGGVELSLQPKNYLITVDSRGTVCFAFAGTGDRSVSVIGNILQQGFRVVYDI 495
Query: 360 EQQTVSFKPTDC 371
Q V F P C
Sbjct: 496 GGQRVGFAPNSC 507
>gi|21717162|gb|AAM76355.1|AC074196_13 putative nucleoid DNA binding protein [Oryza sativa Japonica Group]
gi|31433286|gb|AAP54824.1| Eukaryotic aspartyl protease family protein [Oryza sativa Japonica
Group]
Length = 397
Score = 112 bits (281), Expect = 2e-22, Method: Compositional matrix adjust.
Identities = 96/353 (27%), Positives = 149/353 (42%), Gaps = 79/353 (22%)
Query: 87 NNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSL 146
N AN+ +IGTPP A D +L+WTQC C C+ QD P+F P SST+K
Sbjct: 54 NVANF----TIGTPPQAASAFIDLTGELVWTQCSQC--IHCFKQDLPVFVPNASSTFKPE 107
Query: 147 PCSSSQCASLNQKSCSGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITFGC 206
PC + C S+ C+ C Y G G + G +AT+T +G+ A + FGC
Sbjct: 108 PCGTDVCKSIPTPKCASDVCAYDGVTGLGGHTVGIVATDTFAIGTA-----APASLGFGC 162
Query: 207 GTNNGGLFNSKTTGIVGLGGGDISLISQMRTT----------IAGNQRL----------- 245
+ +G +GLG SL++QM+ T N RL
Sbjct: 163 VVASDIDTMGGPSGFIGLGRTPWSLVAQMKLTRFSYCLAPHDTGKNSRLFLGASAKLAGG 222
Query: 246 GVSTPDIV-----------------IDSGTTLTFLPQGYNSNLL---------------- 272
G TP + I +G +P+G N+ L+
Sbjct: 223 GAWTPFVKTSPNDGMSQYYPIELEEIKAGDATITMPRGRNTVLVQTAVVRVSLLVDSVYQ 282
Query: 273 ----SVMSSMIEAQPVADPTGS-LELCYSFNSLSQVPEVTIHFR-GADVKLSRSNFFVKV 326
+VM+S + A P A P G+ E+C+ +S P++ F+ GA + + +N+ V
Sbjct: 283 EFKKAVMAS-VGAAPTATPVGAPFEVCFPKAGVSGAPDLVFTFQAGAALTVPPANYLFDV 341
Query: 327 SEDIVCSVFKGIT-------NSVPIYGNIMQTNFLVGYDIEQQTVSFKPTDCT 372
D VC I + + I G+ Q N + +D+++ +SF+P DC+
Sbjct: 342 GNDTVCLSVMSIALLNITALDGLNILGSFQQENVHLLFDLDKDMLSFEPADCS 394
>gi|367066719|gb|AEX12643.1| hypothetical protein 2_5918_01 [Pinus radiata]
Length = 137
Score = 112 bits (281), Expect = 2e-22, Method: Compositional matrix adjust.
Identities = 59/135 (43%), Positives = 81/135 (60%), Gaps = 7/135 (5%)
Query: 78 KASQADIIPNNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDP 137
K QA + N +L++++IG P A+ DTGSDL WTQC PC S CY Q +P++DP
Sbjct: 8 KDVQAPVSAGNGEFLMQLAIGKPSLAYSAILDTGSDLTWTQCIPC--SDCYKQPTPIYDP 65
Query: 138 KMSSTYKSLPCSSSQCASLNQKSCSGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAV 197
+SSTY ++ C SS C +L +C C+Y +YGD S + G L+ ET TL S +
Sbjct: 66 SLSSTYGTVSCKSSLCLALPASACISATCEYLYTYGDYSSTQGILSYETFTLSSQS---- 121
Query: 198 ALPGITFGCGTNNGG 212
+P I FGCG +N G
Sbjct: 122 -IPHIAFGCGQDNEG 135
>gi|18412482|ref|NP_565219.1| aspartyl protease-like protein [Arabidopsis thaliana]
gi|19699359|gb|AAL91289.1| At1g79720/F19K16_30 [Arabidopsis thaliana]
gi|26450464|dbj|BAC42346.1| unknown protein [Arabidopsis thaliana]
gi|115646741|gb|ABJ17101.1| At1g79720 [Arabidopsis thaliana]
gi|332198170|gb|AEE36291.1| aspartyl protease-like protein [Arabidopsis thaliana]
Length = 484
Score = 112 bits (281), Expect = 2e-22, Method: Compositional matrix adjust.
Identities = 69/163 (42%), Positives = 93/163 (57%), Gaps = 21/163 (12%)
Query: 90 NYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCS 149
NY++ + +G + DTGSDL W QC+PC CY Q PL+DP +SS+YK++ C+
Sbjct: 134 NYIVTVELGGKNMS--LIVDTGSDLTWVQCQPC--RSCYNQQGPLYDPSVSSSYKTVFCN 189
Query: 150 SSQCASL-----NQKSCSGVN------CQYSVSYGDGSFSNGNLATETVTLGSTTGQAVA 198
SS C L N C G N C+Y VSYGDGS++ G+LA+E++ LG T
Sbjct: 190 SSTCQDLVAATSNSGPCGGNNGVVKTPCEYVVSYGDGSYTRGDLASESILLGDT-----K 244
Query: 199 LPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAG 241
L FGCG NN GLF + ++GLG +SL+SQ T G
Sbjct: 245 LENFVFGCGRNNKGLFGGSSG-LMGLGRSSVSLVSQTLKTFNG 286
Score = 43.9 bits (102), Expect = 0.13, Method: Compositional matrix adjust.
Identities = 35/127 (27%), Positives = 55/127 (43%), Gaps = 7/127 (5%)
Query: 252 IVIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSLSQV--PEVTIH 309
I+IDSGT +T LP + P A L+ C++ S + P + +
Sbjct: 356 ILIDSGTVITRLPPSIYKAVKIEFLKQFSGFPTAPGYSILDTCFNLTSYEDISIPIIKMI 415
Query: 310 FRG---ADVKLSRSNFFVKVSEDIVCSVFKGIT--NSVPIYGNIMQTNFLVGYDIEQQTV 364
F+G +V ++ +FVK +VC ++ N V I GN Q N V YD Q+ +
Sbjct: 416 FQGNAELEVDVTGVFYFVKPDASLVCLALASLSYENEVGIIGNYQQKNQRVIYDTTQERL 475
Query: 365 SFKPTDC 371
+C
Sbjct: 476 GIVGENC 482
>gi|326520291|dbj|BAK07404.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 464
Score = 112 bits (281), Expect = 2e-22, Method: Compositional matrix adjust.
Identities = 80/241 (33%), Positives = 116/241 (48%), Gaps = 31/241 (12%)
Query: 12 FFLCFYVVSPIEAQTGGFSVELIHRDSPKSPFYNSSETPYQRLRDALTRSLNRLNHFNQN 71
F C +++ A++ +L H DS + T ++ LR + RS RL
Sbjct: 19 LFPCVLLLTFSLAESAALRADLTHVDSGRG------FTKHELLRRMVARSKARL------ 66
Query: 72 SSISSSKASQADIIP--------NNANYLIRISIGTPPTERLAVA-DTGSDLIWTQCEPC 122
+S+ SS A P ++ YLI + IGTP +R+ + DTGSDL+WTQC C
Sbjct: 67 ASLRSSACDTALTAPVDHGGSDVGSSEYLIHLGIGTPRPQRVVLHLDTGSDLVWTQCA-C 125
Query: 123 PPSQCYMQDSPLFDPKMSSTYKSLPCSSSQCASLNQKSCSGV-----NCQYSVSYGDGSF 177
+ C+ Q P+F +S T+ +PCS C SG +C Y+ Y D S
Sbjct: 126 --TVCFDQPVPVFRASVSHTFSRVPCSDPLCGHAVYLPLSGCAARDRSCFYAYGYMDHSI 183
Query: 178 SNGNLATETVTLGS--TTGQAVALPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQM 235
+ G +A +T T + A A+P I FGCG N GLF +GI G G G +SL SQ+
Sbjct: 184 TTGKMAEDTFTFKAPDRADTAAAVPNIRFGCGMMNYGLFTPNQSGIAGFGTGPLSLPSQL 243
Query: 236 R 236
+
Sbjct: 244 K 244
Score = 60.8 bits (146), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 42/133 (31%), Positives = 61/133 (45%), Gaps = 15/133 (11%)
Query: 254 IDSGTTLTFLPQGYNSNLLSVMSSMIE---AQPVADPTGSLELCYSFNSLSQ---VPEVT 307
IDSGT +TF PQ +L + + A+ DP LC+S + + VP++
Sbjct: 333 IDSGTAITFFPQAVFRSLREAFVAQVPLPVAKGYTDPDN--LLCFSVPAKKKAPAVPKLI 390
Query: 308 IHFRGADVKLSRSNFFVKVSED------IVCSVFKGITNS-VPIYGNIMQTNFLVGYDIE 360
+H GAD +L R N+ + +D +C V NS I GN Q N + YD+E
Sbjct: 391 LHLEGADWELPRENYVLDNDDDGSGAGRKLCVVILSAGNSNGTIIGNFQQQNMHIVYDLE 450
Query: 361 QQTVSFKPTDCTK 373
+ F P C K
Sbjct: 451 SNKMVFAPARCDK 463
>gi|21595063|gb|AAM66069.1| putative aspartyl protease [Arabidopsis thaliana]
Length = 484
Score = 112 bits (281), Expect = 2e-22, Method: Compositional matrix adjust.
Identities = 69/163 (42%), Positives = 93/163 (57%), Gaps = 21/163 (12%)
Query: 90 NYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCS 149
NY++ + +G + DTGSDL W QC+PC CY Q PL+DP +SS+YK++ C+
Sbjct: 134 NYIVTVELGGKNMS--LIVDTGSDLTWVQCQPC--RSCYNQQGPLYDPSVSSSYKTVFCN 189
Query: 150 SSQCASL-----NQKSCSGVN------CQYSVSYGDGSFSNGNLATETVTLGSTTGQAVA 198
SS C L N C G N C+Y VSYGDGS++ G+LA+E++ LG T
Sbjct: 190 SSTCQDLVAATSNSGPCGGNNGVVKTPCEYVVSYGDGSYTRGDLASESILLGDT-----K 244
Query: 199 LPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAG 241
L FGCG NN GLF + ++GLG +SL+SQ T G
Sbjct: 245 LENFVFGCGRNNKGLFGGSSG-LMGLGRSSVSLVSQTLKTFNG 286
Score = 43.1 bits (100), Expect = 0.21, Method: Compositional matrix adjust.
Identities = 35/127 (27%), Positives = 55/127 (43%), Gaps = 7/127 (5%)
Query: 252 IVIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSLSQV--PEVTIH 309
I+IDSGT +T LP + P A L+ C++ S + P + +
Sbjct: 356 ILIDSGTVITRLPPSIYKAVKIEFLKQFSGFPTAPGYSILDTCFNLTSYEDISIPIIKMI 415
Query: 310 FRG---ADVKLSRSNFFVKVSEDIVCSVFKGIT--NSVPIYGNIMQTNFLVGYDIEQQTV 364
F+G +V ++ +FVK +VC ++ N V I GN Q N V YD Q+ +
Sbjct: 416 FQGNAELEVDVTGVFYFVKPDASLVCLALASLSYENEVGIIGNYQQKNQRVIYDSTQERL 475
Query: 365 SFKPTDC 371
+C
Sbjct: 476 GIVGENC 482
>gi|449509162|ref|XP_004163513.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 2-like [Cucumis
sativus]
Length = 417
Score = 112 bits (281), Expect = 2e-22, Method: Compositional matrix adjust.
Identities = 65/153 (42%), Positives = 86/153 (56%), Gaps = 18/153 (11%)
Query: 90 NYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCS 149
NY++ + IG + + DTGSDL W QC PC CY Q PLF+P SS++ SLPC+
Sbjct: 65 NYIVTVGIGGQNST--LIVDTGSDLTWVQCLPC--RLCYNQQEPLFNPSNSSSFLSLPCN 120
Query: 150 SSQCASLNQKS-----CSGVN---CQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPG 201
S C +L + CS N C Y + YGDGS+S G L E +TLG T +
Sbjct: 121 SPTCVALQPTAGSSGLCSNKNSTSCDYQIDYGDGSYSRGELGFEKLTLGKT-----EIDN 175
Query: 202 ITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQ 234
FGCG NN GLF +G++GL ++SL+SQ
Sbjct: 176 FIFGCGRNNKGLFGG-ASGLMGLARSELSLVSQ 207
>gi|7715602|gb|AAF68120.1|AC010793_15 F20B17.14 [Arabidopsis thaliana]
gi|12324588|gb|AAG52249.1|AC011717_17 putative aspartyl protease; 105611-106921 [Arabidopsis thaliana]
Length = 436
Score = 112 bits (281), Expect = 2e-22, Method: Compositional matrix adjust.
Identities = 69/163 (42%), Positives = 93/163 (57%), Gaps = 21/163 (12%)
Query: 90 NYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCS 149
NY++ + +G + DTGSDL W QC+PC CY Q PL+DP +SS+YK++ C+
Sbjct: 86 NYIVTVELGGKNMS--LIVDTGSDLTWVQCQPC--RSCYNQQGPLYDPSVSSSYKTVFCN 141
Query: 150 SSQCASL-----NQKSCSGVN------CQYSVSYGDGSFSNGNLATETVTLGSTTGQAVA 198
SS C L N C G N C+Y VSYGDGS++ G+LA+E++ LG T
Sbjct: 142 SSTCQDLVAATSNSGPCGGNNGVVKTPCEYVVSYGDGSYTRGDLASESILLGDT-----K 196
Query: 199 LPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAG 241
L FGCG NN GLF + ++GLG +SL+SQ T G
Sbjct: 197 LENFVFGCGRNNKGLFGGSSG-LMGLGRSSVSLVSQTLKTFNG 238
Score = 43.9 bits (102), Expect = 0.12, Method: Compositional matrix adjust.
Identities = 35/127 (27%), Positives = 55/127 (43%), Gaps = 7/127 (5%)
Query: 252 IVIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSLSQV--PEVTIH 309
I+IDSGT +T LP + P A L+ C++ S + P + +
Sbjct: 308 ILIDSGTVITRLPPSIYKAVKIEFLKQFSGFPTAPGYSILDTCFNLTSYEDISIPIIKMI 367
Query: 310 FRG---ADVKLSRSNFFVKVSEDIVCSVFKGIT--NSVPIYGNIMQTNFLVGYDIEQQTV 364
F+G +V ++ +FVK +VC ++ N V I GN Q N V YD Q+ +
Sbjct: 368 FQGNAELEVDVTGVFYFVKPDASLVCLALASLSYENEVGIIGNYQQKNQRVIYDTTQERL 427
Query: 365 SFKPTDC 371
+C
Sbjct: 428 GIVGENC 434
>gi|302762735|ref|XP_002964789.1| hypothetical protein SELMODRAFT_82470 [Selaginella moellendorffii]
gi|300167022|gb|EFJ33627.1| hypothetical protein SELMODRAFT_82470 [Selaginella moellendorffii]
Length = 390
Score = 112 bits (281), Expect = 2e-22, Method: Compositional matrix adjust.
Identities = 60/168 (35%), Positives = 88/168 (52%), Gaps = 4/168 (2%)
Query: 48 ETPYQRLRDALTRSLNRLNHFNQNSSISSSKASQADIIPNNANYLIRISIGTPPTERLAV 107
E RLR R + + + S+ + + + + Y R+ IG+P
Sbjct: 2 ERDEARLRWIHHRIQSSDHRHRRGRSLLQTAQVSSGLSLGSGEYFARMGIGSPQRSYYLE 61
Query: 108 ADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCSSSQCASLNQKSCSGVNCQ 167
DTGSD+ W QC PC S CY Q P++DP SS+Y+ + C S+ C +L+ +C G+ C
Sbjct: 62 LDTGSDVTWIQCAPC--SSCYSQVDPIYDPSNSSSYRRVYCGSALCQALDYSACQGMGCS 119
Query: 168 YSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITFGCGTNNGGLFN 215
Y V YGD S S+G+L E+ LG + + A+ I FGCG +N GLF
Sbjct: 120 YRVVYGDSSASSGDLGIESFYLGPNS--STAMRNIAFGCGHSNSGLFR 165
Score = 52.4 bits (124), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 32/130 (24%), Positives = 59/130 (45%), Gaps = 4/130 (3%)
Query: 246 GVSTPDIVIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSLS--QV 303
G T ++DSGT++T + + L + P A L+ C++F L Q+
Sbjct: 261 GNGTGGAILDSGTSVTRVVPAAYAVLRDAYRAASRNLPPAPGVYLLDTCFNFQGLPTVQI 320
Query: 304 PEVTIHF-RGADVKLSRSNFFVKVSED-IVCSVFKGITNSVPIYGNIMQTNFLVGYDIEQ 361
P + +HF D+ L N + V C F + + + GN+ Q F +G+D+++
Sbjct: 321 PSLVLHFDNDVDMVLPGGNILIPVDRSGTFCLAFAPSSMPISVIGNVQQQTFRIGFDLQR 380
Query: 362 QTVSFKPTDC 371
++ P +C
Sbjct: 381 SLIAIAPREC 390
>gi|357127503|ref|XP_003565419.1| PREDICTED: probable aspartic protease At2g35615-like [Brachypodium
distachyon]
Length = 486
Score = 112 bits (280), Expect = 3e-22, Method: Compositional matrix adjust.
Identities = 81/236 (34%), Positives = 113/236 (47%), Gaps = 42/236 (17%)
Query: 16 FYVVSPIEAQTGGFSVELIHRDSPKSPFYNSSETPYQRLRDALTRSLNRLNHFNQNSSIS 75
+V + A+ GFSVE IHRDS KSPF++ + TP+ R A RS R + +
Sbjct: 27 LFVSPAVGAEEDGFSVEFIHRDSVKSPFHDPALTPHGRALAAARRSAARAAELHHLLARR 86
Query: 76 SSKASQ--------ADIIPNNANYLIRISIGTPPTERLAVADTGSDLIWTQCE------- 120
SS A A+++ YL+ I +GTPP LA+ADTGSDL+W +C+
Sbjct: 87 SSGAPSPGTGAGVVAEVVSRQFEYLMAIEVGTPPVRVLAIADTGSDLVWVKCKGKDNDNN 146
Query: 121 -PCPPSQCYMQDSPLFDPKMSSTYKSLPCSSSQCASLNQ-KSCS-GVNCQYSVSYGDGSF 177
PPS F P SSTY + C + C +L+ SCS +C+Y SYGDGS
Sbjct: 147 STAPPSV-------YFVPSASSTYGRVGCDTKACRALSSAASCSPDGSCEYLYSYGDGSR 199
Query: 178 SNGNLATETVTLGSTTGQA-----------------VALPGITFGCGTNNGGLFNS 216
++G L+TET T + + V + + FGC T G F +
Sbjct: 200 ASGQLSTETFTFSTIADSSKTNSHGNNNNNSSSHGQVEIAKLDFGCSTTTTGTFRA 255
Score = 80.5 bits (197), Expect = 1e-12, Method: Compositional matrix adjust.
Identities = 50/144 (34%), Positives = 74/144 (51%), Gaps = 9/144 (6%)
Query: 239 IAGNQR-LGVSTPDIVIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSF 297
+AG +R + I++DSGTTLT+L + L+ ++ I+ P L+LCY
Sbjct: 334 VAGTKRPTTAAQAHIIVDSGTTLTYLDSALLTPLVKDLTRRIKLPRAESPEKILDLCYDI 393
Query: 298 NSLSQ-----VPEVTIHFRGA-DVKLSRSNFFVKVSEDIVCSVFKGITN--SVPIYGNIM 349
+ + +P+VT+ G +V L N FV V E ++C + SV I GNI
Sbjct: 394 SGVRGEDALGIPDVTLVLGGGGEVTLKPDNTFVVVQEGVLCLALVATSERQSVSILGNIA 453
Query: 350 QTNFLVGYDIEQQTVSFKPTDCTK 373
Q N VGYD+E+ TV+F DC K
Sbjct: 454 QQNLHVGYDLEKGTVTFAAADCAK 477
>gi|125595861|gb|EAZ35641.1| hypothetical protein OsJ_19928 [Oryza sativa Japonica Group]
Length = 629
Score = 112 bits (279), Expect = 3e-22, Method: Compositional matrix adjust.
Identities = 57/144 (39%), Positives = 82/144 (56%), Gaps = 8/144 (5%)
Query: 98 GTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCSSSQCASLN 157
GT + + D+GSD+ W QC+PCP C+ Q PLFDP MS+TY ++PC+S+ CA L
Sbjct: 71 GTSAVTQTVIIDSGSDVSWVQCKPCPLPMCHRQRDPLFDPAMSTTYAAVPCTSAACAQLG 130
Query: 158 --QKSCSG-VNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITFGCG-TNNGGL 213
++ CS CQ+ ++YGDGS + G + + +TLG + G FGC + G
Sbjct: 131 PYRRGCSANAQCQFGINYGDGSTATGTYSFDDLTLGPYD----VIRGFRFGCAHADRGSA 186
Query: 214 FNSKTTGIVGLGGGDISLISQMRT 237
F+ G + LGGG SL+ Q T
Sbjct: 187 FDYDVAGSLALGGGSQSLVQQTAT 210
>gi|356526294|ref|XP_003531753.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
Length = 414
Score = 112 bits (279), Expect = 4e-22, Method: Compositional matrix adjust.
Identities = 77/202 (38%), Positives = 108/202 (53%), Gaps = 26/202 (12%)
Query: 54 LRDALTRSLNRLNHFNQNSSISSSKASQADIIPNNA------NYLIRISIGTPPTERLAV 107
L D RS+ N + +S + +ASQ I ++ NY++ + +G+ +
Sbjct: 24 LDDLRVRSMQ--NRIRRVASTHNVEASQTQIPLSSGINLQTLNYIVTMGLGSK--NMTVI 79
Query: 108 ADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCSSSQCASL-----NQKSCS 162
DTGSDL W QCEPC CY Q P+F P SS+Y+S+ C+SS C SL N +C
Sbjct: 80 IDTGSDLTWVQCEPC--MSCYNQQGPIFKPSTSSSYQSVSCNSSTCQSLQFATGNTGACG 137
Query: 163 GVN---CQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITFGCGTNNGGLFNSKTT 219
N C Y V+YGDGS++NG L E ++ G V++ FGCG NN GLF +
Sbjct: 138 SSNPSTCNYVVNYGDGSYTNGELGVEALSFG-----GVSVSDFVFGCGRNNKGLFGG-VS 191
Query: 220 GIVGLGGGDISLISQMRTTIAG 241
G++GLG +SL+SQ T G
Sbjct: 192 GLMGLGRSYLSLVSQTNATFGG 213
Score = 47.8 bits (112), Expect = 0.009, Method: Compositional matrix adjust.
Identities = 37/128 (28%), Positives = 56/128 (43%), Gaps = 7/128 (5%)
Query: 252 IVIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSLSQV--PEVTIH 309
I+IDSGT +T LP L + P A L+ C++ +V P +++
Sbjct: 285 ILIDSGTVITRLPSSVYKALKAEFLKKFTGFPSAPGFSILDTCFNLTGYDEVSIPTISLR 344
Query: 310 FRG-ADVKLSRSNFFVKVSEDI--VCSVFKGITNS--VPIYGNIMQTNFLVGYDIEQQTV 364
F G A + + + F V ED VC ++++ I GN Q N V YD +Q V
Sbjct: 345 FEGNAQLNVDATGTFYVVKEDASQVCLALASLSDAYDTAIIGNYQQRNQRVIYDTKQSKV 404
Query: 365 SFKPTDCT 372
F C+
Sbjct: 405 GFAEEPCS 412
>gi|15228044|ref|NP_181826.1| aspartyl protease-like protein [Arabidopsis thaliana]
gi|20197868|gb|AAM15292.1| putative chloroplast nucleoid DNA binding protein [Arabidopsis
thaliana]
gi|20197965|gb|AAD21712.2| putative chloroplast nucleoid DNA binding protein [Arabidopsis
thaliana]
gi|330255100|gb|AEC10194.1| aspartyl protease-like protein [Arabidopsis thaliana]
Length = 527
Score = 112 bits (279), Expect = 4e-22, Method: Compositional matrix adjust.
Identities = 96/369 (26%), Positives = 153/369 (41%), Gaps = 88/369 (23%)
Query: 89 ANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPC 148
Y + + +GTPP + DTGSDL W QC PC C+ Q+ +DPK S+++K++ C
Sbjct: 158 GEYFMDVLVGTPPKHFSLILDTGSDLNWLQCLPC--YDCFHQNGMFYDPKTSASFKNITC 215
Query: 149 SSSQCASLN------QKSCSGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQA----VA 198
+ +C+ ++ Q +C Y YGD S + G+ A ET T+ TT +
Sbjct: 216 NDPRCSLISSPDPPVQCESDNQSCPYFYWYGDRSNTTGDFAVETFTVNLTTTEGGSSEYK 275
Query: 199 LPGITFGCGTNNGGL----------------FNSKTTGIVG------------------- 223
+ + FGCG N GL F+S+ + G
Sbjct: 276 VGNMMFGCGHWNRGLFSGASGLLGLGRGPLSFSSQLQSLYGHSFSYCLVDRNSNTNVSSK 335
Query: 224 -LGGGDISLIS-----------------------QMRTTIAGNQRLGV--STPDI----- 252
+ G D L++ Q+++ + G + L + T +I
Sbjct: 336 LIFGEDKDLLNHTNLNFTSFVNGKENSVETFYYIQIKSILVGGKALDIPEETWNISSDGD 395
Query: 253 ---VIDSGTTLTFLPQ-GYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSLSQ----VP 304
+IDSGTTL++ + Y M E P+ L+ C++ + + + +P
Sbjct: 396 GGTIIDSGTTLSYFAEPAYEIIKNKFAEKMKENYPIFRDFPVLDPCFNVSGIEENNIHLP 455
Query: 305 EVTIHF-RGADVKLSRSNFFVKVSEDIVCSVFKGITNSV-PIYGNIMQTNFLVGYDIEQQ 362
E+ I F G N F+ +SED+VC G S I GN Q NF + YD ++
Sbjct: 456 ELGIAFVDGTVWNFPAENSFIWLSEDLVCLAILGTPKSTFSIIGNYQQQNFHILYDTKRS 515
Query: 363 TVSFKPTDC 371
+ F PT C
Sbjct: 516 RLGFTPTKC 524
>gi|357124861|ref|XP_003564115.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Brachypodium
distachyon]
Length = 477
Score = 112 bits (279), Expect = 4e-22, Method: Compositional matrix adjust.
Identities = 102/382 (26%), Positives = 157/382 (41%), Gaps = 106/382 (27%)
Query: 90 NYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDS-PLFDPKMSSTYKSLPC 148
YL+ +S+GTPP DTGSDL+WTQC PC C+ Q + P+ DP SST+ ++ C
Sbjct: 93 EYLVHLSVGTPPRPVALTLDTGSDLVWTQCAPC--LNCFDQGAIPVLDPAASSTHAAVRC 150
Query: 149 SSSQCASLNQKSCSG-------VNCQYSVSYGDGSFSNGNLATETVTLG---STTGQAVA 198
+ C +L SC +C Y YGD S + G LA++ T G + G V+
Sbjct: 151 DAPVCRALPFTSCGRGGSSWGERSCVYVYHYGDKSITVGKLASDRFTFGPGDNADGGGVS 210
Query: 199 LPGITFGCGTNNGG----------------------------------LFNSKTTGI-VG 223
+TFGCG N G +F S ++ + +G
Sbjct: 211 ERRLTFGCGHFNKGIFQANETGIAGFGRGRWSLPSQLGVTSFSYCFTSMFESTSSLVTLG 270
Query: 224 LGGGDISLISQMRTT-------------------IAGNQRLGV-------STPDIVIDSG 257
+ ++ L Q+++T G R+ + +IDSG
Sbjct: 271 VAPAELHLTGQVQSTPLLRDPSQPSLYFLSLKAITVGATRIPIPERRQRLREASAIIDSG 330
Query: 258 TTLTFLPQGYNSNLLSVMSSMIEAQ---PVADPTGS-LELCYSFNSLS------------ 301
++T LP+ ++ + + AQ PV+ GS L+LC++ S +
Sbjct: 331 ASITTLPE----DVYEAVKAEFVAQVGLPVSAVEGSALDLCFALPSAAAPKSAFGWRWRG 386
Query: 302 -------QVPEVTIHFRG-ADVKLSRSNF-FVKVSEDIVCSVFKGIT---NSVPIYGNIM 349
+VP + H G AD +L R N+ F ++C V T + + GN
Sbjct: 387 RGRAMPVRVPRLVFHLGGGADWELPRENYVFEDYGARVMCLVLDAATGGGDQTVVIGNYQ 446
Query: 350 QTNFLVGYDIEQQTVSFKPTDC 371
Q N V YD+E +SF P C
Sbjct: 447 QQNTHVVYDLENDVLSFAPARC 468
>gi|356532674|ref|XP_003534896.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
Length = 446
Score = 112 bits (279), Expect = 4e-22, Method: Compositional matrix adjust.
Identities = 112/417 (26%), Positives = 169/417 (40%), Gaps = 90/417 (21%)
Query: 32 ELIHRDSPKSPFYNSSETPYQRLRDALTRSLNRLNHFNQ--NSSISSSKASQADIIPN-- 87
+LIH S P Y +ET R+ + S RL + S+ + A + P+
Sbjct: 38 KLIHPGSVHHPHYKPNETAKDRMELDIEHSAARLAYIQARIEGSLVYNNDYTASVSPSLT 97
Query: 88 NANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSL- 146
L+ +SIG P +L V DTGSD++W C PC + C LFDP MSST+ L
Sbjct: 98 GRTILVNLSIGQPSIPQLVVMDTGSDILWIMCNPC--TNCDNHLGLLFDPSMSSTFSPLC 155
Query: 147 --PCSSSQCASLNQKSCSGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITF 204
PC C C + +++SY D S ++G + + +T + +
Sbjct: 156 KTPCGFKGC------KCDPI--PFTISYVDNSSASGTFGRDILVFETTDEGTSQISDVII 207
Query: 205 GCGTNNGGLFNSK--TTGIVGLGGGDISLISQ---------------------------- 234
GCG N G FNS GI+GL G SL +Q
Sbjct: 208 GCGHNIG--FNSDPGYNGILGLNNGPNSLATQIGRKFSYCIGNLADPYYNYNQLRLGEGA 265
Query: 235 -------------------MRTTIAGNQRLGVS----------TPDIVIDSGTTLTFLPQ 265
M G +RL ++ T +++DSGTT+T+L
Sbjct: 266 DLEGYSTPFEVYHGFYYVTMEGISVGEKRLDIALETFEMKRNGTGGVILDSGTTITYLVD 325
Query: 266 GYNSNLLSVMSSMIEA--QPVADPTGSLELCYS---FNSLSQVPEVTIHF-RGADVKLSR 319
+ L + + ++++ + V +LCY L P VT HF GAD+ L
Sbjct: 326 SAHKLLYNEVRNLLKWSFRQVIFENAPWKLCYYGIISRDLVGFPVVTFHFVDGADLALDT 385
Query: 320 SNFFVKVSEDIVC-----SVFKGITNSVPIYGNIMQTNFLVGYDIEQQTVSFKPTDC 371
+FF + +DI C + T S + G + Q ++ VGYD+ Q V F+ DC
Sbjct: 386 GSFFSQ-RDDIFCMTVSPASILNTTISPSVIGLLAQQSYNVGYDLVNQFVYFQRIDC 441
>gi|218197465|gb|EEC79892.1| hypothetical protein OsI_21411 [Oryza sativa Indica Group]
Length = 720
Score = 112 bits (279), Expect = 4e-22, Method: Compositional matrix adjust.
Identities = 57/144 (39%), Positives = 82/144 (56%), Gaps = 8/144 (5%)
Query: 98 GTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCSSSQCASLN 157
GT + + D+GSD+ W QC+PCP C+ Q PLFDP MS+TY ++PC+S+ CA L
Sbjct: 162 GTSAVTQTVIIDSGSDVSWVQCKPCPLPMCHRQRDPLFDPAMSTTYAAVPCTSAACAQLG 221
Query: 158 --QKSCSG-VNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITFGCG-TNNGGL 213
++ CS CQ+ ++YGDGS + G + + +TLG + G FGC + G
Sbjct: 222 PYRRGCSANAQCQFGINYGDGSTATGTYSFDDLTLGPYD----VIRGFRFGCAHADRGSA 277
Query: 214 FNSKTTGIVGLGGGDISLISQMRT 237
F+ G + LGGG SL+ Q T
Sbjct: 278 FDYDVAGSLALGGGSQSLVQQTAT 301
>gi|302765224|ref|XP_002966033.1| hypothetical protein SELMODRAFT_64135 [Selaginella moellendorffii]
gi|300166847|gb|EFJ33453.1| hypothetical protein SELMODRAFT_64135 [Selaginella moellendorffii]
Length = 357
Score = 111 bits (278), Expect = 4e-22, Method: Compositional matrix adjust.
Identities = 56/131 (42%), Positives = 82/131 (62%), Gaps = 13/131 (9%)
Query: 88 NANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLP 147
+ Y +R+ IG+P + V DTGSD+ W QC PC CY Q+ +FDP+ SS+++ L
Sbjct: 11 SGEYFVRVGIGSPTKLQYLVMDTGSDVPWIQCSPC--KSCYKQNDAVFDPRASSSFRRLS 68
Query: 148 CSSSQCASLNQKSCSGVN--CQYSVSYGDGSFSNGNLATET--VTLGSTTGQAVALPGIT 203
CS+ QC L+ K+C+ + C Y VSYGDGSF+ G+LA+++ V+ G T+ +
Sbjct: 69 CSTPQCKLLDVKACASTDNRCLYQVSYGDGSFTVGDLASDSFLVSRGRTS-------PVV 121
Query: 204 FGCGTNNGGLF 214
FGCG +N GLF
Sbjct: 122 FGCGHDNEGLF 132
Score = 64.7 bits (156), Expect = 7e-08, Method: Compositional matrix adjust.
Identities = 41/124 (33%), Positives = 60/124 (48%), Gaps = 4/124 (3%)
Query: 252 IVIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSLSQV--PEVTIH 309
++IDSGT++T LP + + S + P A + CY F++L+ V P V+ H
Sbjct: 234 VIIDSGTSVTRLPTYAYTVMRDAFRSATQKLPRAADFSLFDTCYDFSALTSVTIPTVSFH 293
Query: 310 FR-GADVKLSRSNFFVKV-SEDIVCSVFKGITNSVPIYGNIMQTNFLVGYDIEQQTVSFK 367
F GA V+L SN+ V V + C F + + I GNI Q V D++ V F
Sbjct: 294 FEGGASVQLPPSNYLVPVDTSGTFCFAFSKTSLDLSIIGNIQQQTMRVAIDLDSSRVGFA 353
Query: 368 PTDC 371
P C
Sbjct: 354 PRQC 357
>gi|326517992|dbj|BAK07248.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 500
Score = 111 bits (278), Expect = 5e-22, Method: Compositional matrix adjust.
Identities = 52/130 (40%), Positives = 79/130 (60%), Gaps = 8/130 (6%)
Query: 87 NNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSL 146
+ Y R+ +G P + V DTGSD+ W QC+PC + CY Q P++DP +S++Y ++
Sbjct: 159 GSGEYFSRVGVGRPARQLYMVLDTGSDVTWLQCQPC--ADCYAQSDPVYDPSVSTSYATV 216
Query: 147 PCSSSQCASLNQKSC--SGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITF 204
C S +C L+ +C S +C Y V+YGDGS++ G+ ATET+TLG + + +
Sbjct: 217 GCDSPRCRDLDAAACRNSTGSCLYEVAYGDGSYTVGDFATETLTLGDS----APVSNVAI 272
Query: 205 GCGTNNGGLF 214
GCG +N GLF
Sbjct: 273 GCGHDNEGLF 282
Score = 60.1 bits (144), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 37/124 (29%), Positives = 54/124 (43%), Gaps = 4/124 (3%)
Query: 252 IVIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSLS--QVPEVTIH 309
+++DSGT +T L G L ++ P A + CY S QVP V +
Sbjct: 377 VIVDSGTAVTRLQSGAYGALREAFVQGTQSLPRASGVSLFDTCYDLAGRSSVQVPAVALW 436
Query: 310 FRGA-DVKLSRSNFFVKV-SEDIVCSVFKGITNSVPIYGNIMQTNFLVGYDIEQQTVSFK 367
F G ++KL N+ + V + C F G + V I GN+ Q V +D + TV F
Sbjct: 437 FEGGGELKLPAKNYLIPVDAAGTYCLAFAGTSGPVSIIGNVQQQGVRVSFDTAKNTVGFT 496
Query: 368 PTDC 371
C
Sbjct: 497 ADKC 500
>gi|125561849|gb|EAZ07297.1| hypothetical protein OsI_29545 [Oryza sativa Indica Group]
Length = 451
Score = 111 bits (278), Expect = 5e-22, Method: Compositional matrix adjust.
Identities = 104/370 (28%), Positives = 160/370 (43%), Gaps = 89/370 (24%)
Query: 87 NNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDS--PLFDPKMSSTYK 144
++ + + + IGTPP R + DTGSDLIWTQC+ + + P++DP SST+
Sbjct: 87 SDQGHSLTVGIGTPPQPRKLIVDTGSDLIWTQCKLSSSTAVAARHGSPPVYDPGESSTFA 146
Query: 145 SLPCSSSQC--ASLNQKSCSGVN-CQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPG 201
LPCS C + K+C+ N C Y YG + + G LA+ET T G+ +AV+L
Sbjct: 147 FLPCSDRLCQEGQFSFKNCTSKNRCVYEDVYGSAA-AVGVLASETFTFGAR--RAVSLR- 202
Query: 202 ITFGCGTNNGGLFNSKTTGIVGLGGGDISL------------------------------ 231
+ FGCG + G TGI+GL +SL
Sbjct: 203 LGFGCGALSAGSLIG-ATGILGLSPESLSLITQLKIQRFSYCLTPFADKKTSPLLFGAMA 261
Query: 232 -ISQMRTTIA-------------------------GNQRLGVST------PD----IVID 255
+S+ +TT G++RL V PD ++D
Sbjct: 262 DLSRHKTTRPIQTTAIVSNPVKTVYYYVPLVGISLGHKRLAVPAASLAMRPDGGGGTIVD 321
Query: 256 SGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPT-GSLELCYSFNSLS--------QVPEV 306
SG+T+ +L + + + ++ PVA+ T ELC+ + QVP +
Sbjct: 322 SGSTVAYLVEAAFEAVKEAVMDVVRL-PVANRTVEDYELCFVLPRRTAAAAMEAVQVPPL 380
Query: 307 TIHFRG-ADVKLSRSNFFVKVSEDIVCSVFKGITNS--VPIYGNIMQTNFLVGYDIEQQT 363
+HF G A + L R N+F + ++C T+ V I GN+ Q N V +D++
Sbjct: 381 VLHFDGGAAMVLPRDNYFQEPRAGLMCLAVGKTTDGSGVSIIGNVQQQNMHVLFDVQHHK 440
Query: 364 VSFKPTDCTK 373
SF PT C +
Sbjct: 441 FSFAPTQCDQ 450
>gi|297794561|ref|XP_002865165.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
gi|297795163|ref|XP_002865466.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
gi|297311000|gb|EFH41424.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
gi|297311301|gb|EFH41725.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
Length = 134
Score = 111 bits (278), Expect = 5e-22, Method: Compositional matrix adjust.
Identities = 60/138 (43%), Positives = 77/138 (55%), Gaps = 18/138 (13%)
Query: 3 TFLSCVFILFFLCFYVVSPIEAQTGGFSVELIHRDSPKSPFYNSSETPYQRLRDALTRSL 62
+ + C F+ FF +VELIH DSP SP YN T L A RS+
Sbjct: 7 SLVDCDFLFFF----------NDWENLTVELIHSDSPHSPLYNPHHTVSDGLNAAFLRSI 56
Query: 63 NRLNHFNQNSSISSSKASQADIIPNNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPC 122
+R FN + + Q+ +I N Y + ISIGTPP++ LA+ADTGSDL W QC+PC
Sbjct: 57 SRSRRFNTKTDL------QSGLISNGGEYFMSISIGTPPSKVLAIADTGSDLTWVQCKPC 110
Query: 123 PPSQCYMQDSPLFDPKMS 140
QCY Q+SPLFD K+S
Sbjct: 111 --QQCYKQNSPLFDKKIS 126
>gi|302776610|ref|XP_002971459.1| hypothetical protein SELMODRAFT_64134 [Selaginella moellendorffii]
gi|300160591|gb|EFJ27208.1| hypothetical protein SELMODRAFT_64134 [Selaginella moellendorffii]
Length = 357
Score = 111 bits (278), Expect = 5e-22, Method: Compositional matrix adjust.
Identities = 56/131 (42%), Positives = 82/131 (62%), Gaps = 13/131 (9%)
Query: 88 NANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLP 147
+ Y +R+ IG+P + V DTGSD+ W QC PC CY Q+ +FDP+ SS+++ L
Sbjct: 11 SGEYFVRVGIGSPTKLQYLVMDTGSDVPWIQCSPC--KSCYKQNDAVFDPRASSSFRRLS 68
Query: 148 CSSSQCASLNQKSCSGVN--CQYSVSYGDGSFSNGNLATE--TVTLGSTTGQAVALPGIT 203
CS+ QC L+ K+C+ + C Y VSYGDGSF+ G+LA++ +V+ G T+ +
Sbjct: 69 CSTPQCKLLDVKACASTDNRCLYQVSYGDGSFTVGDLASDSFSVSRGRTS-------PVV 121
Query: 204 FGCGTNNGGLF 214
FGCG +N GLF
Sbjct: 122 FGCGHDNEGLF 132
Score = 64.7 bits (156), Expect = 7e-08, Method: Compositional matrix adjust.
Identities = 41/124 (33%), Positives = 60/124 (48%), Gaps = 4/124 (3%)
Query: 252 IVIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSLSQV--PEVTIH 309
++IDSGT++T LP + + S + P A + CY F++L+ V P V+ H
Sbjct: 234 VIIDSGTSVTRLPTYAYTVMRDAFRSATQKLPRAADFSLFDTCYDFSALTSVTIPTVSFH 293
Query: 310 FR-GADVKLSRSNFFVKV-SEDIVCSVFKGITNSVPIYGNIMQTNFLVGYDIEQQTVSFK 367
F GA V+L SN+ V V + C F + + I GNI Q V D++ V F
Sbjct: 294 FEGGASVQLPPSNYLVPVDTSGTFCFAFSKTSLDLSIIGNIQQQTMRVAIDLDSSRVGFA 353
Query: 368 PTDC 371
P C
Sbjct: 354 PRQC 357
>gi|297821064|ref|XP_002878415.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
gi|297324253|gb|EFH54674.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
Length = 486
Score = 111 bits (278), Expect = 5e-22, Method: Compositional matrix adjust.
Identities = 71/212 (33%), Positives = 102/212 (48%), Gaps = 29/212 (13%)
Query: 22 IEAQTGGFSVELIHRDSPKSPFYNSSETPYQRLRDALTRSLNRLNHFNQNSSISSSK--- 78
+ T SV L H D+ S F ++S +LR L R R+ +++S+ +
Sbjct: 57 VSESTTSLSVHLSHVDALSS-FSDASPVDLFKLR--LQRDSLRVKSITSLAAVSTGRNAT 113
Query: 79 ------------ASQADIIPNNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQ 126
A + + + Y +R+ +GTP T V DTGSD++W QC PC
Sbjct: 114 KRTPRSAGGFSGAVISGLSQGSGEYFMRLGVGTPATNVYMVLDTGSDVVWLQCSPC--KA 171
Query: 127 CYMQDSPLFDPKMSSTYKSLPCSSSQCASLNQKS-C---SGVNCQYSVSYGDGSFSNGNL 182
CY Q +FDPK S T+ ++PC S C L+ S C C Y VSYGDGSF+ G+
Sbjct: 172 CYNQSDVIFDPKKSKTFATVPCGSRLCRRLDDSSECVTRRSKTCLYQVSYGDGSFTEGDF 231
Query: 183 ATETVTLGSTTGQAVALPGITFGCGTNNGGLF 214
+TET+T + + GCG +N GLF
Sbjct: 232 STETLTF-----HGARVDHVPLGCGHDNEGLF 258
Score = 60.8 bits (146), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 39/123 (31%), Positives = 55/123 (44%), Gaps = 3/123 (2%)
Query: 252 IVIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSLS--QVPEVTIH 309
++IDSGT++T L Q L A + C+ + ++ +VP V H
Sbjct: 364 VIIDSGTSVTRLTQSAYVALRDAFRLGATKLKRAPSYSLFDTCFDLSGMTTVKVPTVVFH 423
Query: 310 FRGADVKLSRSNFFVKV-SEDIVCSVFKGITNSVPIYGNIMQTNFLVGYDIEQQTVSFKP 368
F G +V L SN+ + V +E C F G S+ I GNI Q F V YD+ V F
Sbjct: 424 FGGGEVSLPASNYLIPVNTEGRFCFAFAGTMGSLSIIGNIQQQGFRVAYDLVGSRVGFLS 483
Query: 369 TDC 371
C
Sbjct: 484 RAC 486
>gi|15234606|ref|NP_194732.1| aspartyl protease family protein [Arabidopsis thaliana]
gi|4938479|emb|CAB43838.1| putative protein [Arabidopsis thaliana]
gi|7269903|emb|CAB80996.1| putative protein [Arabidopsis thaliana]
gi|67633774|gb|AAY78811.1| aspartyl protease family protein [Arabidopsis thaliana]
gi|332660310|gb|AEE85710.1| aspartyl protease family protein [Arabidopsis thaliana]
Length = 424
Score = 111 bits (278), Expect = 5e-22, Method: Compositional matrix adjust.
Identities = 63/158 (39%), Positives = 89/158 (56%), Gaps = 8/158 (5%)
Query: 79 ASQADIIPNNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPK 138
S IPN A +L ISIG PP +L + DTGSDL W C PC +CY Q P F P
Sbjct: 66 VSHVTPIPNPAAFLANISIGNPPVPQLLLIDTGSDLTWIHCLPC---KCYPQTIPFFHPS 122
Query: 139 MSSTYKSLPCSSSQCA--SLNQKSCSGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQA 196
SSTY++ C S+ A + + +G NCQY + Y D S + G LA E +T ++
Sbjct: 123 RSSTYRNASCVSAPHAMPQIFRDEKTG-NCQYHLRYRDFSNTRGILAEEKLTFETSDDGL 181
Query: 197 VALPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQ 234
++ I FGCG +N G +K +G++GLG G S++++
Sbjct: 182 ISKQNIVFGCGQDNSGF--TKYSGVLGLGPGTFSIVTR 217
>gi|18400416|ref|NP_565559.1| aspartyl protease family protein [Arabidopsis thaliana]
gi|20197296|gb|AAM15014.1| predicted protein [Arabidopsis thaliana]
gi|330252412|gb|AEC07506.1| aspartyl protease family protein [Arabidopsis thaliana]
Length = 458
Score = 111 bits (277), Expect = 6e-22, Method: Compositional matrix adjust.
Identities = 114/445 (25%), Positives = 173/445 (38%), Gaps = 83/445 (18%)
Query: 9 FILFFLCFYVVSPIEAQTGGFSVELIHRDSPK--SPFYNSSETPYQRLRDALTRSLNRLN 66
+LF Y V + +++LIHR+S +P TP ++ S R
Sbjct: 9 LLLFITVSYFVVTESIKPNRMAMKLIHRESVARLNPNARVPITPEDHIKHLTDISSARFK 68
Query: 67 HFNQNSSISSSKAS--QADI--IPNNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPC 122
+ QNS +S Q D+ + +L+ S+G PP +L + DTGS L+W QC+PC
Sbjct: 69 YL-QNSIDKELGSSNFQVDVEQAIKTSLFLVNFSVGQPPVPQLTIMDTGSSLLWIQCQPC 127
Query: 123 PPSQCYMQDSPLFDPKMSSTYKSLPCSSSQCASLNQKSCSGVN-CQYSVSYGDGSFSNGN 181
P+F+P +SST+ C C C N C Y Y G+ S G
Sbjct: 128 KHCSSDHMIHPVFNPALSSTFVECSCDDRFCRYAPNGHCGSSNKCVYEQVYISGTGSKGV 187
Query: 182 LATETVTLGSTTGQAVALPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIA- 240
LA E +T + G V I FGCG NG S TGI+GLG SL Q+ + +
Sbjct: 188 LAKERLTFTTPNGNTVVTQPIAFGCGYENGEQLESHFTGILGLGAKPTSLAVQLGSKFSY 247
Query: 241 ----------GNQRLGVSTPDIVIDSGTTLTFLPQG--YNSNL--LSVMSSMIEAQPVA- 285
G +L + ++ T + F + Y NL +SV + + +PV
Sbjct: 248 CIGDLANKNYGYNQLVLGEDADILGDPTPIEFETENSIYYMNLEGISVGDTQLNIEPVVF 307
Query: 286 ------------------------------------DPTGSLE-------LCYS---FNS 299
DP LE LCY
Sbjct: 308 KRRGPRTGVILDSGTLYTWLADIAYRELYNEIKSILDP--KLERFWFRDFLCYHGRVSEE 365
Query: 300 LSQVPEVTIHFR-GADVKLSRSNFFVKVSE----DIVCSVFK------GITNSVPIYGNI 348
L P VT HF GA++ + ++ F +SE ++ C K G G +
Sbjct: 366 LIGFPVVTFHFAGGAELAMEATSMFYPLSEPNTFNVFCMSVKPTKEHGGEYKEFTAIGLM 425
Query: 349 MQTNFLVGYDIEQQTVSFKPTDCTK 373
Q + +GYD++++ + + DC +
Sbjct: 426 AQQYYNIGYDLKEKNIYLQRIDCVQ 450
>gi|125532796|gb|EAY79361.1| hypothetical protein OsI_34489 [Oryza sativa Indica Group]
Length = 405
Score = 110 bits (276), Expect = 8e-22, Method: Compositional matrix adjust.
Identities = 94/349 (26%), Positives = 146/349 (41%), Gaps = 70/349 (20%)
Query: 91 YLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCSS 150
Y+ +IGTPP AV D +L+WTQC PC P C+ QD PLFDP SST++ LPC S
Sbjct: 57 YVANFTIGTPPQPVSAVVDLTGELVWTQCTPCQP--CFEQDLPLFDPTKSSTFRGLPCGS 114
Query: 151 SQCASLNQKS--CSGVNCQYSVS---------YGDGSFSNGNLATETVTLG--------- 190
C S+ + S C+ C Y G +F+ G A ET+ G
Sbjct: 115 HLCESIPESSRNCTSDVCIYEAPTKAGDTGGMAGTDTFAIGA-AKETLGFGCVVMTDKRL 173
Query: 191 STTGQAVALPGI--------------TFG---CGTNNGGLF------------NSKTTGI 221
T G + G+ F G ++G LF NS T +
Sbjct: 174 KTIGGPSGIVGLGRTPWSLVTQMNVTAFSYCLAGKSSGALFLGATAKQLAGGKNSSTPFV 233
Query: 222 VGLGGGD---------ISLISQMRTTIAGNQRLGVSTPDIVIDSGTTLTFLPQGYNSNLL 272
+ G + ++ ++ A Q S +++D+ + ++L G L
Sbjct: 234 IKTSAGSSDNGSNPYYMVKLAGIKAGGAPLQAASSSGSTVLLDTVSRASYLADGAYKALK 293
Query: 273 SVMSSMIEAQPVADPTGSLELCYSFNSLSQVPEVTIHFR-GADVKLSRSNFFVKVSEDIV 331
+++ + QPVA P +LC+S PE+ F GA + + +N+ + V
Sbjct: 294 KALTAAVGVQPVASPPKPYDLCFSKAVAGDAPELVFTFDGGAALTVPPANYLLASGNGTV 353
Query: 332 CSV--------FKGITNSVPIYGNIMQTNFLVGYDIEQQTVSFKPTDCT 372
C G I G++ Q N V +D++++T+SFKP DC+
Sbjct: 354 CLTIGSSASLNLTGELEGASILGSLQQENVHVLFDLKEETLSFKPADCS 402
>gi|219886223|gb|ACL53486.1| unknown [Zea mays]
gi|238015146|gb|ACR38608.1| unknown [Zea mays]
gi|413938611|gb|AFW73162.1| hypothetical protein ZEAMMB73_440759 [Zea mays]
gi|413938612|gb|AFW73163.1| hypothetical protein ZEAMMB73_440759 [Zea mays]
gi|413938613|gb|AFW73164.1| hypothetical protein ZEAMMB73_440759 [Zea mays]
gi|413938614|gb|AFW73165.1| hypothetical protein ZEAMMB73_440759 [Zea mays]
Length = 467
Score = 110 bits (276), Expect = 8e-22, Method: Compositional matrix adjust.
Identities = 69/152 (45%), Positives = 94/152 (61%), Gaps = 13/152 (8%)
Query: 90 NYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCS 149
NY+ R+ +GTP + V DTGS L W QC PC S C+ Q P+F+PK SS+Y S+ CS
Sbjct: 128 NYVTRMGLGTPAKSYVMVVDTGSSLTWLQCSPCVVS-CHRQSGPVFNPKASSSYTSVSCS 186
Query: 150 SSQC-----ASLNQKSCSGVN-CQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGIT 203
+ QC A+LN SCS N C Y SYGD SFS G L+ +TV+ GST+ +P
Sbjct: 187 AQQCSDLTTATLNPASCSTSNVCIYQASYGDSSFSVGYLSKDTVSFGSTS-----VPNFY 241
Query: 204 FGCGTNNGGLFNSKTTGIVGLGGGDISLISQM 235
+GCG +N GLF ++ G++GL +SL+ Q+
Sbjct: 242 YGCGQDNEGLFG-QSAGLIGLARNKLSLLYQL 272
Score = 54.3 bits (129), Expect = 9e-05, Method: Compositional matrix adjust.
Identities = 38/122 (31%), Positives = 54/122 (44%), Gaps = 3/122 (2%)
Query: 253 VIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSLS-QVPEVTIHFR 311
+IDSGT +T LP G S L ++ ++ P A L+ C+ + +VPEVT+ F
Sbjct: 347 IIDSGTVITRLPTGVYSALSKAVAGAMKGTPRASAFSILDTCFQGQAARLRVPEVTMAFA 406
Query: 312 GADVKLSRS-NFFVKVSEDIVCSVFKGITNSVPIYGNIMQTNFLVGYDIEQQTVSFKPTD 370
G + N V V C F S I GN Q F V YD++ + F
Sbjct: 407 GGAALKLAARNLLVDVDSATTCLAF-APARSAAIIGNTQQQTFSVVYDVKNSKIGFAAGG 465
Query: 371 CT 372
C+
Sbjct: 466 CS 467
>gi|242084336|ref|XP_002442593.1| hypothetical protein SORBIDRAFT_08g022613 [Sorghum bicolor]
gi|241943286|gb|EES16431.1| hypothetical protein SORBIDRAFT_08g022613 [Sorghum bicolor]
Length = 482
Score = 110 bits (276), Expect = 9e-22, Method: Compositional matrix adjust.
Identities = 89/253 (35%), Positives = 119/253 (47%), Gaps = 44/253 (17%)
Query: 31 VELIHRDSPKSPFYNSSETPYQRLRDALTRSLNRLNHFNQNSSISSSKASQADI------ 84
V L+HRDS N+S D L R L R + + I + A+ AD
Sbjct: 68 VRLVHRDSFA---VNASAA------DLLARRLQR--DMRRAAWIITKAATPADPENGTVV 116
Query: 85 --IPNNANYLIRISIGTPPT-----ERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDP 137
P + Y+ +I++GTP E L D GSD+ W QC PC +CY Q P+++
Sbjct: 117 TGAPTSGEYIAKITVGTPYENDSSFEALLSPDMGSDVTWLQCMPC--FRCYHQPGPVYNR 174
Query: 138 KMSSTYKSLPCSSSQCASLNQKS-CSGV--NCQYSVSYGDGSFSNGNLATETVTLGSTTG 194
SS+ + C + C +L C CQY V YGDGS S G+ ET+T
Sbjct: 175 LKSSSASDVGCYAPACRALGSSGGCVQFLNECQYKVEYGDGSSSAGDFGVETLTFP---- 230
Query: 195 QAVALPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGNQRLGVSTPDIVI 254
V +PG+ GCG++N GLF + GI+GLG G +S SQ IAG R G S +
Sbjct: 231 PGVRVPGVAIGCGSDNQGLFPAPAAGILGLGRGSLSFPSQ----IAG--RYGRSFSYCLA 284
Query: 255 DSGT-----TLTF 262
GT TLTF
Sbjct: 285 GQGTGGRSSTLTF 297
Score = 49.7 bits (117), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 47/168 (27%), Positives = 78/168 (46%), Gaps = 17/168 (10%)
Query: 210 NGGLFNSKTTGIVGLGGGDISLISQMRTTIAGNQRLGVSTPD--IVIDSGTTLTFLPQ-G 266
N ++ G+VG+ G + ++R + RL ST +++DSGT +T L
Sbjct: 319 NSRMYTFYYVGLVGISVGGV----RVRGVTESDLRLDPSTGHGGVIVDSGTAVTRLSGPA 374
Query: 267 YNSNLLSVMSSMIEAQPVADPTGSL---ELCYS---FNSLSQVPEVTIHFRGA-DVKLSR 319
Y + + + ++ P G + CYS + +VP V++HF G +VKL
Sbjct: 375 YAAFRDAFRVAAVKELGWPSPGGPFAFFDTCYSSVRGRVMKKVPAVSMHFAGGVEVKLPP 434
Query: 320 SNFFVKV--SEDIVCSVFKGITN-SVPIYGNIMQTNFLVGYDIEQQTV 364
N+ + V ++ +C F G + V I GNI F V YD++ Q V
Sbjct: 435 QNYLIPVDSNKGTMCFAFAGSGDRGVSIIGNIQLQGFRVVYDVDGQRV 482
>gi|21717157|gb|AAM76350.1|AC074196_8 putative nucleoid DNA binding protein [Oryza sativa Japonica Group]
gi|31433294|gb|AAP54832.1| Eukaryotic aspartyl protease family protein, expressed [Oryza
sativa Japonica Group]
Length = 396
Score = 110 bits (276), Expect = 9e-22, Method: Compositional matrix adjust.
Identities = 91/354 (25%), Positives = 154/354 (43%), Gaps = 75/354 (21%)
Query: 87 NNANYLIRISIGTPPTERLAVADTGSDLIWTQC-EPCPPSQCYMQDSPLFDPKMSSTYKS 145
+ A Y++ ++IGTPP A+ D G +L+WTQC + C +C+ QD PLFD SST++
Sbjct: 47 SQAFYVVNLTIGTPPQPVSAIIDIGGELVWTQCAQHC--RRCFKQDLPLFDTNASSTFRP 104
Query: 146 LPCSSSQCASLNQKSCSGVNCQYSVSYGDGSFSN--GNLATETVTLGSTTGQAVALPGIT 203
PC ++ C S+ +SC+G SF G + T+ V +G+ A +
Sbjct: 105 EPCGAAVCESIPTRSCAGDGGGACGYEASTSFGRTVGRIGTDAVAIGT-----AATARLA 159
Query: 204 FGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTT------------------------I 239
FGC + ++G VGLG ++SL +QM T +
Sbjct: 160 FGCAVASEMDTMWGSSGSVGLGRTNLSLAAQMNATAFSYCLAPPDTGKSSALFLGASAKL 219
Query: 240 AGNQRLGVSTPDI-------------------VIDSGTTLTFLPQGYNS----------- 269
AG + +TP + I +G +PQ N+
Sbjct: 220 AGAGKGAGTTPFVKTSTPPNSGLSRSYLLRLEAIRAGNATIAMPQSGNTITVSTATPVTA 279
Query: 270 -------NLLSVMSSMIEAQPVADPTGSLELCYSFNSLS-QVPEVTIHFR-GADVKLSRS 320
+L ++ + A PV P + +LC+ S S P++ + F+ GA++ + S
Sbjct: 280 LVDSVYRDLRKAVADAVGAAPVPPPVQNYDLCFPKASASGGAPDLVLAFQGGAEMTVPVS 339
Query: 321 NFFVKVSEDIVCSVFKG--ITNSVPIYGNIMQTNFLVGYDIEQQTVSFKPTDCT 372
++ D C G V I G++ Q N + +D++++T+SF+P DC+
Sbjct: 340 SYLFDAGNDTACVAILGSPALGGVSILGSLQQVNIHLLFDLDKETLSFEPADCS 393
>gi|218184943|gb|EEC67370.1| hypothetical protein OsI_34481 [Oryza sativa Indica Group]
Length = 367
Score = 110 bits (275), Expect = 1e-21, Method: Compositional matrix adjust.
Identities = 95/353 (26%), Positives = 148/353 (41%), Gaps = 79/353 (22%)
Query: 87 NNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSL 146
N AN+ +IGTPP A D +L+WTQC C C+ QD P+F P SST+K
Sbjct: 24 NVANF----TIGTPPQAASAFIDLTGELVWTQCSQC--IHCFKQDLPVFVPNASSTFKPE 77
Query: 147 PCSSSQCASLNQKSCSGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITFGC 206
PC + C S+ C+ C + G G + G +AT+T +G+ A + FGC
Sbjct: 78 PCGTDVCKSIPTPKCASDVCAFDGVTGLGGHTVGIVATDTFAIGTA-----APASLGFGC 132
Query: 207 GTNNGGLFNSKTTGIVGLGGGDISLISQMRTT----------IAGNQRL----------- 245
+ +G +GLG SL++QM+ T N RL
Sbjct: 133 VVASDIDTMGGPSGFIGLGRTPWSLVAQMKLTRFSYCLAPHDTGKNSRLFLGASAKLAGG 192
Query: 246 GVSTPDIV-----------------IDSGTTLTFLPQGYNSNLL---------------- 272
G TP + I +G +P+G N+ L+
Sbjct: 193 GAWTPFVKTSPNDGMSQYYPIELEEIKAGDATITMPRGRNTVLVQTAVVRVSLLVDSVYQ 252
Query: 273 ----SVMSSMIEAQPVADPTGS-LELCYSFNSLSQVPEVTIHFR-GADVKLSRSNFFVKV 326
+VM+S + A P A P G E+C+ +S P++ F+ GA + + +N+ V
Sbjct: 253 EFKKAVMAS-VGAAPTATPVGEPFEVCFPKAGVSGAPDLVFTFQAGAALTVPPANYLFDV 311
Query: 327 SEDIVCSVFKGIT-------NSVPIYGNIMQTNFLVGYDIEQQTVSFKPTDCT 372
D VC I + + I G+ Q N + +D+++ +SF+P DC+
Sbjct: 312 GNDTVCLSVMSIALLNITALDGLNILGSFQQENVHLLFDLDKDMLSFEPADCS 364
>gi|125532792|gb|EAY79357.1| hypothetical protein OsI_34486 [Oryza sativa Indica Group]
Length = 396
Score = 110 bits (275), Expect = 1e-21, Method: Compositional matrix adjust.
Identities = 91/354 (25%), Positives = 154/354 (43%), Gaps = 75/354 (21%)
Query: 87 NNANYLIRISIGTPPTERLAVADTGSDLIWTQC-EPCPPSQCYMQDSPLFDPKMSSTYKS 145
+ A Y++ ++IGTPP A+ D G +L+WTQC + C +C+ QD PLFD SST++
Sbjct: 47 SQAFYVVNLTIGTPPQPVSAIIDIGGELVWTQCAQHC--RRCFKQDLPLFDTNASSTFRP 104
Query: 146 LPCSSSQCASLNQKSCSGVNCQYSVSYGDGSFSN--GNLATETVTLGSTTGQAVALPGIT 203
PC ++ C S+ +SC+G SF G + T+ V +G+ A +
Sbjct: 105 EPCGAAVCESIPTRSCAGDGGGACGYEASTSFGRTVGRIGTDAVAIGT-----AATARLA 159
Query: 204 FGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTT------------------------I 239
FGC + ++G VGLG ++SL +QM T +
Sbjct: 160 FGCAVASEMDTMWGSSGSVGLGRTNLSLAAQMNATAFSYCLAPPDTGKSSALFLGASAKL 219
Query: 240 AGNQRLGVSTPDI-------------------VIDSGTTLTFLPQGYNS----------- 269
AG + +TP + I +G +PQ N+
Sbjct: 220 AGAGKGAGTTPFVKTSTPPHSGLSRSYLLRLEAIRAGNATIAMPQSGNTIMVSTATPVTA 279
Query: 270 -------NLLSVMSSMIEAQPVADPTGSLELCYSFNSLS-QVPEVTIHFR-GADVKLSRS 320
+L ++ + A PV P + +LC+ S S P++ + F+ GA++ + S
Sbjct: 280 LVDSVYRDLRKAVADAVGAAPVPPPVQNYDLCFPKASASGGAPDLVLAFQGGAEMTVPVS 339
Query: 321 NFFVKVSEDIVCSVFKG--ITNSVPIYGNIMQTNFLVGYDIEQQTVSFKPTDCT 372
++ D C G V I G++ Q N + +D++++T+SF+P DC+
Sbjct: 340 SYLFDAGNDTACVAILGSPALGGVSILGSLQQVNIHLLFDLDKETLSFEPADCS 393
>gi|356523155|ref|XP_003530207.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
Length = 412
Score = 110 bits (275), Expect = 1e-21, Method: Compositional matrix adjust.
Identities = 76/198 (38%), Positives = 108/198 (54%), Gaps = 23/198 (11%)
Query: 56 DALTRSL-NRLNHFNQNSSISSSKAS---QADIIPNNANYLIRISIGTPPTERLAVADTG 111
D RS+ NR+ + ++ +S+ + I NY++ + +G+ T + DTG
Sbjct: 26 DLRVRSMQNRIRRVVSSHNVEASQTQIPLSSGINLQTLNYIVTMGLGS--TNMTVIIDTG 83
Query: 112 SDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCSSSQCASL-----NQKSCSGVN- 165
SDL W QCEPC CY Q P+F P SS+Y+S+ C+SS C SL N +C G N
Sbjct: 84 SDLTWVQCEPC--MSCYNQQGPIFKPSTSSSYQSVSCNSSTCQSLQFATGNTGAC-GSNP 140
Query: 166 --CQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITFGCGTNNGGLFNSKTTGIVG 223
C Y V+YGDGS++NG L E ++ G V++ FGCG NN GLF +G++G
Sbjct: 141 STCNYVVNYGDGSYTNGELGVEQLSFG-----GVSVSDFVFGCGRNNKGLFGG-VSGLMG 194
Query: 224 LGGGDISLISQMRTTIAG 241
LG +SL+SQ T G
Sbjct: 195 LGRSYLSLVSQTNATFGG 212
Score = 55.1 bits (131), Expect = 6e-05, Method: Compositional matrix adjust.
Identities = 38/128 (29%), Positives = 60/128 (46%), Gaps = 7/128 (5%)
Query: 252 IVIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSLSQV--PEVTIH 309
++IDSGT +T LP L ++ P A L+ C++ +V P +++H
Sbjct: 283 VLIDSGTVITRLPSSVYKALKALFLKQFTGFPSAPGFSILDTCFNLTGYDEVSIPTISMH 342
Query: 310 FRG-ADVKLSRSNFFVKVSEDI--VCSVFKGITNS--VPIYGNIMQTNFLVGYDIEQQTV 364
F G A++K+ + F V ED VC ++++ I GN Q N V YD +Q V
Sbjct: 343 FEGNAELKVDATGTFYVVKEDASQVCLALASLSDAYDTAIIGNYQQRNQRVIYDTKQSKV 402
Query: 365 SFKPTDCT 372
F C+
Sbjct: 403 GFAEESCS 410
>gi|115438214|ref|NP_001043485.1| Os01g0598600 [Oryza sativa Japonica Group]
gi|15290061|dbj|BAB63755.1| nucleoid DNA-binding protein cnd41-like [Oryza sativa Japonica
Group]
gi|113533016|dbj|BAF05399.1| Os01g0598600 [Oryza sativa Japonica Group]
gi|125526702|gb|EAY74816.1| hypothetical protein OsI_02707 [Oryza sativa Indica Group]
Length = 500
Score = 110 bits (275), Expect = 1e-21, Method: Compositional matrix adjust.
Identities = 53/130 (40%), Positives = 74/130 (56%), Gaps = 8/130 (6%)
Query: 87 NNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSL 146
+ Y +I +GTP T L V DTGSD++W QC PC +CY Q +FDP+ S +Y ++
Sbjct: 143 GSGEYFTKIGVGTPVTPALMVLDTGSDVVWLQCAPC--RRCYDQSGQMFDPRASHSYGAV 200
Query: 147 PCSSSQCASLNQKSCS--GVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITF 204
C++ C L+ C C Y V+YGDGS + G+ ATET+T S +P +
Sbjct: 201 DCAAPLCRRLDSGGCDLRRKACLYQVAYGDGSVTAGDFATETLTFAS----GARVPRVAL 256
Query: 205 GCGTNNGGLF 214
GCG +N GLF
Sbjct: 257 GCGHDNEGLF 266
Score = 59.3 bits (142), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 44/156 (28%), Positives = 70/156 (44%), Gaps = 16/156 (10%)
Query: 232 ISQMRTTIAGNQRLGVSTPD-----------IVIDSGTTLTFLPQGYNSNLLSVMSSMIE 280
+ M ++ G + GV+ D +++DSGT++T L + + L +
Sbjct: 345 VQLMGISVGGARVPGVAVSDLRLDPSTGRGGVIVDSGTSVTRLARPAYAALRDAFRAAAA 404
Query: 281 AQPVADPTGSL-ELCYSFNSLS--QVPEVTIHFRG-ADVKLSRSNFFVKV-SEDIVCSVF 335
++ SL + CY + L +VP V++HF G A+ L N+ + V S C F
Sbjct: 405 GLRLSPGGFSLFDTCYDLSGLKVVKVPTVSMHFAGGAEAALPPENYLIPVDSRGTFCFAF 464
Query: 336 KGITNSVPIYGNIMQTNFLVGYDIEQQTVSFKPTDC 371
G V I GNI Q F V +D + Q + F P C
Sbjct: 465 AGTDGGVSIIGNIQQQGFRVVFDGDGQRLGFVPKGC 500
>gi|302756591|ref|XP_002961719.1| hypothetical protein SELMODRAFT_64161 [Selaginella moellendorffii]
gi|300170378|gb|EFJ36979.1| hypothetical protein SELMODRAFT_64161 [Selaginella moellendorffii]
Length = 357
Score = 110 bits (274), Expect = 1e-21, Method: Compositional matrix adjust.
Identities = 54/129 (41%), Positives = 74/129 (57%), Gaps = 4/129 (3%)
Query: 87 NNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSL 146
+ Y R+ IG P DTGSD+ W QC PC S CY Q P++DP SS+Y+ +
Sbjct: 8 GSGEYFARMGIGNPQRSYYLELDTGSDVTWIQCAPC--SSCYSQVDPIYDPSNSSSYRRV 65
Query: 147 PCSSSQCASLNQKSCSGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITFGC 206
C S+ C +L+ +C G+ C Y V YGD S S+G+L E+ LG + + A+ I FGC
Sbjct: 66 YCGSALCQALDYSACQGMGCSYRVVYGDSSASSGDLGIESFYLGPNS--STAMRNIAFGC 123
Query: 207 GTNNGGLFN 215
G +N GLF
Sbjct: 124 GHSNSGLFR 132
Score = 54.7 bits (130), Expect = 7e-05, Method: Compositional matrix adjust.
Identities = 35/131 (26%), Positives = 62/131 (47%), Gaps = 6/131 (4%)
Query: 246 GVSTPDIVIDSGTTLT-FLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSLS--Q 302
G T ++DSGT++T +P Y + L + P A L+ C++F L Q
Sbjct: 228 GNGTGGAILDSGTSVTRVVPPAY-AVLRDAYRAASRNLPPAPGVYLLDTCFNFQGLPTVQ 286
Query: 303 VPEVTIHF-RGADVKLSRSNFFVKVSED-IVCSVFKGITNSVPIYGNIMQTNFLVGYDIE 360
+P + +HF G D+ L N + V C F + + + GN+ Q F +G+D++
Sbjct: 287 IPSLVLHFDNGVDMVLPGGNILIPVDRSGTFCLAFAPSSMPISVIGNVQQQTFRIGFDLQ 346
Query: 361 QQTVSFKPTDC 371
+ ++ P +C
Sbjct: 347 RSLIAIAPREC 357
>gi|15228618|ref|NP_191741.1| aspartyl protease-like protein [Arabidopsis thaliana]
gi|6850873|emb|CAB71112.1| putative protein [Arabidopsis thaliana]
gi|332646739|gb|AEE80260.1| aspartyl protease-like protein [Arabidopsis thaliana]
Length = 483
Score = 110 bits (274), Expect = 1e-21, Method: Compositional matrix adjust.
Identities = 55/131 (41%), Positives = 73/131 (55%), Gaps = 11/131 (8%)
Query: 88 NANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLP 147
+ Y +R+ +GTP T V DTGSD++W QC PC CY Q +FDPK S T+ ++P
Sbjct: 132 SGEYFMRLGVGTPATNVYMVLDTGSDVVWLQCSPC--KACYNQTDAIFDPKKSKTFATVP 189
Query: 148 CSSSQCASLNQKS-C---SGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGIT 203
C S C L+ S C C Y VSYGDGSF+ G+ +TET+T + +
Sbjct: 190 CGSRLCRRLDDSSECVTRRSKTCLYQVSYGDGSFTEGDFSTETLTF-----HGARVDHVP 244
Query: 204 FGCGTNNGGLF 214
GCG +N GLF
Sbjct: 245 LGCGHDNEGLF 255
Score = 60.1 bits (144), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 39/123 (31%), Positives = 55/123 (44%), Gaps = 3/123 (2%)
Query: 252 IVIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSLS--QVPEVTIH 309
++IDSGT++T L Q L A + C+ + ++ +VP V H
Sbjct: 361 VIIDSGTSVTRLTQPAYVALRDAFRLGATKLKRAPSYSLFDTCFDLSGMTTVKVPTVVFH 420
Query: 310 FRGADVKLSRSNFFVKV-SEDIVCSVFKGITNSVPIYGNIMQTNFLVGYDIEQQTVSFKP 368
F G +V L SN+ + V +E C F G S+ I GNI Q F V YD+ V F
Sbjct: 421 FGGGEVSLPASNYLIPVNTEGRFCFAFAGTMGSLSIIGNIQQQGFRVAYDLVGSRVGFLS 480
Query: 369 TDC 371
C
Sbjct: 481 RAC 483
>gi|14532550|gb|AAK64003.1| AT3g61820/F15G16_210 [Arabidopsis thaliana]
Length = 362
Score = 110 bits (274), Expect = 1e-21, Method: Compositional matrix adjust.
Identities = 55/131 (41%), Positives = 73/131 (55%), Gaps = 11/131 (8%)
Query: 88 NANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLP 147
+ Y +R+ +GTP T V DTGSD++W QC PC CY Q +FDPK S T+ ++P
Sbjct: 132 SGEYFMRLGVGTPATNVYMVLDTGSDVVWLQCSPC--KACYNQTDAIFDPKKSKTFATVP 189
Query: 148 CSSSQCASLNQKS-C---SGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGIT 203
C S C L+ S C C Y VSYGDGSF+ G+ +TET+T + +
Sbjct: 190 CGSRLCRRLDDSSECVTRRSKTCLYQVSYGDGSFTEGDFSTETLTF-----HGARVDHVP 244
Query: 204 FGCGTNNGGLF 214
GCG +N GLF
Sbjct: 245 LGCGHDNEGLF 255
>gi|195638734|gb|ACG38835.1| aspartic proteinase nepenthesin-1 precursor [Zea mays]
Length = 465
Score = 110 bits (274), Expect = 2e-21, Method: Compositional matrix adjust.
Identities = 69/156 (44%), Positives = 96/156 (61%), Gaps = 13/156 (8%)
Query: 90 NYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCS 149
NY+ R+ +GTP + V DTGS L W QC PC S C+ Q P+F+PK SS+Y S+ CS
Sbjct: 126 NYVTRMGLGTPAKSYVMVVDTGSSLTWLQCSPCVVS-CHRQSGPVFNPKASSSYASVSCS 184
Query: 150 SSQC-----ASLNQKSCSGVN-CQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGIT 203
+ QC A+LN SCS N C Y SYGD SFS G L+ +TV+ GST+ +P
Sbjct: 185 AQQCSDLTTATLNPASCSTSNVCIYQASYGDSSFSVGYLSKDTVSFGSTS-----VPNFY 239
Query: 204 FGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTI 239
+GCG +N GLF ++ G++GL +SL+ Q+ ++
Sbjct: 240 YGCGQDNEGLFG-QSAGLIGLARNKLSLLYQLAPSM 274
Score = 55.1 bits (131), Expect = 6e-05, Method: Compositional matrix adjust.
Identities = 38/122 (31%), Positives = 54/122 (44%), Gaps = 3/122 (2%)
Query: 253 VIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSLS-QVPEVTIHFR 311
+IDSGT +T LP G S L ++ ++ P A L+ C+ + +VPEVT+ F
Sbjct: 345 IIDSGTVITRLPTGVYSALSKAVAGAMKGTPRASAFSILDTCFQGQAARLRVPEVTMAFA 404
Query: 312 GADVKLSRS-NFFVKVSEDIVCSVFKGITNSVPIYGNIMQTNFLVGYDIEQQTVSFKPTD 370
G + N V V C F S I GN Q F V YD++ + F
Sbjct: 405 GGAALKLAARNLLVDVDSATTCLAF-APARSAAIIGNTQQQTFSVVYDVKNSKIGFAAAG 463
Query: 371 CT 372
C+
Sbjct: 464 CS 465
>gi|242092902|ref|XP_002436941.1| hypothetical protein SORBIDRAFT_10g011760 [Sorghum bicolor]
gi|241915164|gb|EER88308.1| hypothetical protein SORBIDRAFT_10g011760 [Sorghum bicolor]
Length = 445
Score = 110 bits (274), Expect = 2e-21, Method: Compositional matrix adjust.
Identities = 102/405 (25%), Positives = 158/405 (39%), Gaps = 79/405 (19%)
Query: 31 VELIHRDSPKSPFYNSSETPYQRLRDALTRSLNRLNHFNQNSSISSSKASQADIIPNNAN 90
V L+HR P +P ++ P + + RS RL++ +S + +
Sbjct: 56 VPLLHRHGPCAPSLSTDTPP--SMSEMFRRSHARLSYIVSGKKVSVPAHLGTSV--KSLE 111
Query: 91 YLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCSS 150
Y+ +S GTP ++ V DTGSDL W QC+PC QC Q PLFDP SSTY ++PC+S
Sbjct: 112 YVATVSFGTPAVPQVVVIDTGSDLTWLQCKPCSSGQCSPQKDPLFDPSHSSTYSAVPCAS 171
Query: 151 SQCASLNQKS----CS-GVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITFG 205
+C L + CS G C +++SY DG+ + G + +TL + FG
Sbjct: 172 GECKKLAADAYGSGCSNGQPCGFAISYVDGTSTVGVYGKDKLTL----APGAIVKDFYFG 227
Query: 206 CGTNN--------------------GGLF-------------NSKTTGIVGLGGGD---- 228
CG + G + NSK G + G G
Sbjct: 228 CGHSKSSLPGLFDGLLGLGRLSESLGAQYGGGGGFSYCLPAVNSK-PGFLAFGAGRNPSG 286
Query: 229 --------------ISLISQMRTTIAGNQ---RLGVSTPDIVIDSGTTLTFLPQGYNSNL 271
S ++ T+ G + R + +++DSGT +T L L
Sbjct: 287 FVFTPMGRVPGQPTFSTVTLAGITVGGKKLDLRPSAFSGGMIVDSGTVVTVLQSTVYRAL 346
Query: 272 LSVMSSMIEAQPVADPTGSLELCYSFNSLSQ--VPEVTIHFR-GADVKLSRSNFFVKVSE 328
+ ++A + G L+ CY VP++ + F GA + L N +
Sbjct: 347 RAAFREAMKAYRLVH--GDLDTCYDLTGYKNVVVPKIALTFSGGATINLDVPNGILVNG- 403
Query: 329 DIVCSVFK--GITNSVPIYGNIMQTNFLVGYDIEQQTVSFKPTDC 371
C F G + + GN+ Q F V +D F+ C
Sbjct: 404 ---CLAFAETGKDGTAGVLGNVNQRTFEVLFDTSASKFGFRAKAC 445
>gi|223975883|gb|ACN32129.1| unknown [Zea mays]
gi|223975971|gb|ACN32173.1| unknown [Zea mays]
gi|224034191|gb|ACN36171.1| unknown [Zea mays]
gi|413938623|gb|AFW73174.1| aspartic proteinase nepenthesin-1 isoform 1 [Zea mays]
gi|413938624|gb|AFW73175.1| aspartic proteinase nepenthesin-1 isoform 2 [Zea mays]
Length = 465
Score = 110 bits (274), Expect = 2e-21, Method: Compositional matrix adjust.
Identities = 69/156 (44%), Positives = 96/156 (61%), Gaps = 13/156 (8%)
Query: 90 NYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCS 149
NY+ R+ +GTP + V DTGS L W QC PC S C+ Q P+F+PK SS+Y S+ CS
Sbjct: 126 NYVTRMGLGTPAKSYVMVVDTGSSLTWLQCSPCVVS-CHRQSGPVFNPKASSSYASVSCS 184
Query: 150 SSQC-----ASLNQKSCSGVN-CQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGIT 203
+ QC A+LN SCS N C Y SYGD SFS G L+ +TV+ GST+ +P
Sbjct: 185 AQQCSDLTTATLNPASCSTSNVCIYQASYGDSSFSVGYLSKDTVSFGSTS-----VPNFY 239
Query: 204 FGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTI 239
+GCG +N GLF ++ G++GL +SL+ Q+ ++
Sbjct: 240 YGCGQDNEGLFG-QSAGLIGLARNKLSLLYQLAPSM 274
Score = 54.3 bits (129), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 38/122 (31%), Positives = 54/122 (44%), Gaps = 3/122 (2%)
Query: 253 VIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSLS-QVPEVTIHFR 311
+IDSGT +T LP G S L ++ ++ P A L+ C+ + +VPEVT+ F
Sbjct: 345 IIDSGTVITRLPTGVYSALSKAVAGAMKGTPRASAFSILDTCFQGQAARLRVPEVTMAFA 404
Query: 312 GADVKLSRS-NFFVKVSEDIVCSVFKGITNSVPIYGNIMQTNFLVGYDIEQQTVSFKPTD 370
G + N V V C F S I GN Q F V YD++ + F
Sbjct: 405 GGAALKLAARNLLVDVDSATTCLAF-APARSAAIIGNTQQQTFSVVYDVKNSKIGFAAGG 463
Query: 371 CT 372
C+
Sbjct: 464 CS 465
>gi|115466078|ref|NP_001056638.1| Os06g0121500 [Oryza sativa Japonica Group]
gi|113594678|dbj|BAF18552.1| Os06g0121500 [Oryza sativa Japonica Group]
Length = 442
Score = 110 bits (274), Expect = 2e-21, Method: Compositional matrix adjust.
Identities = 86/288 (29%), Positives = 124/288 (43%), Gaps = 39/288 (13%)
Query: 109 DTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCSSSQCASLNQ--KSCSGVNC 166
DT DL W QC PCP +CY Q + LFDP+ S T ++PC S+ C L + CS C
Sbjct: 169 DTSIDLPWIQCAPCPMPECYPQQNALFDPRRSRTSAAVPCGSAACGELGRYGAGCSNNQC 228
Query: 167 QYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITFGCGTNNGGLFNSKTTGIVGLGG 226
QY V YGDG ++G + +TL +T + FGC G F++ T+G +
Sbjct: 229 QYFVDYGDGRATSGTYMVDALTLNPST----VVMNFRFGCSHAVRGNFSASTSGTMFART 284
Query: 227 GDIS--------LISQMRTTIAGNQRLGVS----TPDIVIDSGTTLTFL-PQGYNSNLLS 273
+ + ++R G +RL V V+DS +T L P Y + L+
Sbjct: 285 PLVRNPSIIPTLYLVRLRGIEVGGRRLNVPPVVFAGGAVMDSSVIITQLPPTAYRALRLA 344
Query: 274 VMSSMIEAQPVADPTGSLELCYSFNSLSQ--VPEVTIHFRGADVKLSRSNFFVKVSEDIV 331
S+M VA L+ CY F + VP V++ F G V V D +
Sbjct: 345 FRSAMAAYPRVAGGRAGLDTCYDFVRFTSVTVPAVSLVFDGGAV----------VRLDAM 394
Query: 332 CSVFKGITNSVP--------IYGNIMQTNFLVGYDIEQQTVSFKPTDC 371
+ +G VP GN+ Q V YD+ +V F+ C
Sbjct: 395 GVMVEGCLAFVPTPGDFALGFIGNVQQQTHEVLYDVVGGSVGFRRGAC 442
>gi|55296886|dbj|BAD68338.1| putative nucleoid DNA-binding protein cnd41, chloroplast [Oryza
sativa Japonica Group]
gi|55296941|dbj|BAD68392.1| putative nucleoid DNA-binding protein cnd41, chloroplast [Oryza
sativa Japonica Group]
Length = 424
Score = 110 bits (274), Expect = 2e-21, Method: Compositional matrix adjust.
Identities = 86/288 (29%), Positives = 124/288 (43%), Gaps = 39/288 (13%)
Query: 109 DTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCSSSQCASLNQ--KSCSGVNC 166
DT DL W QC PCP +CY Q + LFDP+ S T ++PC S+ C L + CS C
Sbjct: 151 DTSIDLPWIQCAPCPMPECYPQQNALFDPRRSRTSAAVPCGSAACGELGRYGAGCSNNQC 210
Query: 167 QYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITFGCGTNNGGLFNSKTTGIVGLGG 226
QY V YGDG ++G + +TL +T + FGC G F++ T+G +
Sbjct: 211 QYFVDYGDGRATSGTYMVDALTLNPST----VVMNFRFGCSHAVRGNFSASTSGTMFART 266
Query: 227 GDIS--------LISQMRTTIAGNQRLGVS----TPDIVIDSGTTLTFL-PQGYNSNLLS 273
+ + ++R G +RL V V+DS +T L P Y + L+
Sbjct: 267 PLVRNPSIIPTLYLVRLRGIEVGGRRLNVPPVVFAGGAVMDSSVIITQLPPTAYRALRLA 326
Query: 274 VMSSMIEAQPVADPTGSLELCYSFNSLSQ--VPEVTIHFRGADVKLSRSNFFVKVSEDIV 331
S+M VA L+ CY F + VP V++ F G V V D +
Sbjct: 327 FRSAMAAYPRVAGGRAGLDTCYDFVRFTSVTVPAVSLVFDGGAV----------VRLDAM 376
Query: 332 CSVFKGITNSVP--------IYGNIMQTNFLVGYDIEQQTVSFKPTDC 371
+ +G VP GN+ Q V YD+ +V F+ C
Sbjct: 377 GVMVEGCLAFVPTPGDFALGFIGNVQQQTHEVLYDVGGGSVGFRRGAC 424
>gi|55296937|dbj|BAD68388.1| putative nucleoid DNA-binding protein cnd41, chloroplast [Oryza
sativa Japonica Group]
gi|218197467|gb|EEC79894.1| hypothetical protein OsI_21421 [Oryza sativa Indica Group]
Length = 424
Score = 109 bits (273), Expect = 2e-21, Method: Compositional matrix adjust.
Identities = 86/288 (29%), Positives = 124/288 (43%), Gaps = 39/288 (13%)
Query: 109 DTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCSSSQCASLNQ--KSCSGVNC 166
DT DL W QC PCP +CY Q + LFDP+ S T ++PC S+ C L + CS C
Sbjct: 151 DTSIDLPWIQCAPCPMPECYPQQNALFDPRRSRTSAAVPCGSAACGELGRYGAGCSNNQC 210
Query: 167 QYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITFGCGTNNGGLFNSKTTGIVGLGG 226
QY V YGDG ++G + +TL +T + FGC G F++ T+G +
Sbjct: 211 QYFVDYGDGRATSGTYMVDALTLNPST----VVMNFRFGCSHAVRGNFSASTSGTMFART 266
Query: 227 GDIS--------LISQMRTTIAGNQRLGVS----TPDIVIDSGTTLTFL-PQGYNSNLLS 273
+ + ++R G +RL V V+DS +T L P Y + L+
Sbjct: 267 PLVRNPSIIPTLYLVRLRGIEVGGRRLNVPPVVFAGGAVMDSSVIITQLPPTAYRALRLA 326
Query: 274 VMSSMIEAQPVADPTGSLELCYSFNSLSQ--VPEVTIHFRGADVKLSRSNFFVKVSEDIV 331
S+M VA L+ CY F + VP V++ F G V V D +
Sbjct: 327 FRSAMAAYPRVAGGRAGLDTCYDFVRFTSVTVPAVSLVFDGGAV----------VRLDAM 376
Query: 332 CSVFKGITNSVP--------IYGNIMQTNFLVGYDIEQQTVSFKPTDC 371
+ +G VP GN+ Q V YD+ +V F+ C
Sbjct: 377 GVMVEGCLAFVPTPGDFALGFIGNVQQQTHEVLYDVVGGSVGFRRGAC 424
>gi|125571060|gb|EAZ12575.1| hypothetical protein OsJ_02481 [Oryza sativa Japonica Group]
Length = 501
Score = 109 bits (273), Expect = 2e-21, Method: Compositional matrix adjust.
Identities = 53/130 (40%), Positives = 74/130 (56%), Gaps = 8/130 (6%)
Query: 87 NNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSL 146
+ Y +I +GTP T L V DTGSD++W QC PC +CY Q +FDP+ S +Y ++
Sbjct: 143 GSGEYFTKIGVGTPVTPALMVLDTGSDVVWLQCAPC--RRCYDQSGQMFDPRASHSYGAV 200
Query: 147 PCSSSQCASLNQKSCS--GVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITF 204
C++ C L+ C C Y V+YGDGS + G+ ATET+T S +P +
Sbjct: 201 DCAAPLCRRLDSGGCDLRRKACLYQVAYGDGSVTAGDFATETLTFAS----GARVPRVAL 256
Query: 205 GCGTNNGGLF 214
GCG +N GLF
Sbjct: 257 GCGHDNEGLF 266
Score = 53.5 bits (127), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 30/85 (35%), Positives = 41/85 (48%), Gaps = 4/85 (4%)
Query: 291 LELCYSFNSLS--QVPEVTIHFRG-ADVKLSRSNFFVKV-SEDIVCSVFKGITNSVPIYG 346
+ CY + L +VP V++HF G A+ L N+ + V S C F G V I G
Sbjct: 417 FDTCYDLSGLKVVKVPTVSMHFAGGAEAALPPENYLIPVDSRGTFCFAFAGTDGGVSIIG 476
Query: 347 NIMQTNFLVGYDIEQQTVSFKPTDC 371
NI Q F V +D + Q + F P C
Sbjct: 477 NIQQQGFRVVFDGDGQRLGFVPKGC 501
>gi|242066168|ref|XP_002454373.1| hypothetical protein SORBIDRAFT_04g029630 [Sorghum bicolor]
gi|241934204|gb|EES07349.1| hypothetical protein SORBIDRAFT_04g029630 [Sorghum bicolor]
Length = 458
Score = 109 bits (272), Expect = 2e-21, Method: Compositional matrix adjust.
Identities = 86/233 (36%), Positives = 122/233 (52%), Gaps = 33/233 (14%)
Query: 28 GFSVELIHRDSPKSPFYNSSETPYQRL-------------RDALTRSLNRLNHFNQNSSI 74
G + L H SP SP ++ P+ + R A T S +R + SS
Sbjct: 40 GLHLTLHHPRSPCSPAPLPADVPFSAVLTHDHARIASLAARLAKTPS-SRPTKLRRGSSS 98
Query: 75 SSSKASQADII--PNNA----NYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCY 128
S S A + P + NY+ R+ +GTP + V DTGS L W QC PC S C+
Sbjct: 99 SPDAESLASVPLGPGTSVGVGNYVTRMGLGTPAKSYVMVVDTGSSLTWLQCSPCLVS-CH 157
Query: 129 MQDSPLFDPKMSSTYKSLPCSSSQC-----ASLNQKSCSGVN-CQYSVSYGDGSFSNGNL 182
Q P+F+P+ SS+Y S+ CS+ QC A+LN +CS N C Y SYGD SFS G L
Sbjct: 158 RQSGPVFNPRSSSSYASVSCSAPQCDALTTATLNPSTCSTSNVCIYQASYGDSSFSVGYL 217
Query: 183 ATETVTLGSTTGQAVALPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQM 235
+ +TV+ GST+ +P +GCG +N GLF ++ G++GL +SL+ Q+
Sbjct: 218 SKDTVSFGSTS-----VPNFYYGCGQDNEGLFG-QSAGLIGLARNKLSLLYQL 264
Score = 59.7 bits (143), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 39/122 (31%), Positives = 57/122 (46%), Gaps = 3/122 (2%)
Query: 253 VIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFN-SLSQVPEVTIHFR 311
+IDSGT +T LP S L ++ ++ P A L+ C+ S +VP+V++ F
Sbjct: 338 IIDSGTVITRLPTDVYSALSKAVAGAMKGTPRASAFSILDTCFQGQASRLRVPQVSMAFA 397
Query: 312 -GADVKLSRSNFFVKVSEDIVCSVFKGITNSVPIYGNIMQTNFLVGYDIEQQTVSFKPTD 370
GA +KL +N V V C F S I GN Q F V YD++ + F
Sbjct: 398 GGAALKLKATNLLVDVDSATTCLAF-APARSAAIIGNTQQQTFSVVYDVKNSKIGFAAGG 456
Query: 371 CT 372
C+
Sbjct: 457 CS 458
>gi|125558629|gb|EAZ04165.1| hypothetical protein OsI_26307 [Oryza sativa Indica Group]
Length = 455
Score = 109 bits (272), Expect = 3e-21, Method: Compositional matrix adjust.
Identities = 68/168 (40%), Positives = 97/168 (57%), Gaps = 18/168 (10%)
Query: 75 SSSKASQADIIPNNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQ--DS 132
SSS QA + Y + IS+GTPP + + DTGS+LIW QC PC ++C+ + +
Sbjct: 75 SSSVNVQAQLENGAGAYNMNISLGTPPLDFPVIVDTGSNLIWAQCAPC--TRCFPRPTPA 132
Query: 133 PLFDPKMSSTYKSLPCSSSQCASL----NQKSCSG-VNCQYSVSYGDGSFSNGNLATETV 187
P+ P SST+ LPC+ S C L ++C+ C Y+ +YG G ++ G LATET+
Sbjct: 133 PVLQPARSSTFSRLPCNGSFCQYLPTSSRPRTCNATAACAYNYTYGSG-YTAGYLATETL 191
Query: 188 TLGSTTGQAVALPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQM 235
T+G T P + FGC T NG ++GIVGLG G +SL+SQ+
Sbjct: 192 TVGDGT-----FPKVAFGCSTENG---VDNSSGIVGLGRGPLSLVSQL 231
Score = 62.0 bits (149), Expect = 5e-07, Method: Compositional matrix adjust.
Identities = 43/139 (30%), Positives = 68/139 (48%), Gaps = 18/139 (12%)
Query: 253 VIDSGTTLTFLPQ-GY---NSNLLSVMSSMIEAQPVADPTGSLELCYSFNS-----LSQV 303
++DSGTTLT+L + GY S M+++ + P + L+LCY ++ +V
Sbjct: 316 IVDSGTTLTYLAKDGYAMVKQAFQSQMANLNQTTPASGAPYDLDLCYKPSAGGGGKAVRV 375
Query: 304 PEVTIHFRG-ADVKLSRSNFFVKVSED------IVCSVFKGITNSVPI--YGNIMQTNFL 354
P + + F G A + N+F V D + C + T+ +PI GN+MQ +
Sbjct: 376 PRLALRFAGGAKYNVPVQNYFAGVEADSQGRVTVACLLVLPATDDLPISIIGNLMQMDMH 435
Query: 355 VGYDIEQQTVSFKPTDCTK 373
+ YDI+ SF P DC K
Sbjct: 436 LLYDIDGGMFSFAPADCAK 454
>gi|449441139|ref|XP_004138341.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 1-like [Cucumis
sativus]
gi|449477464|ref|XP_004155031.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 1-like [Cucumis
sativus]
Length = 336
Score = 108 bits (271), Expect = 3e-21, Method: Compositional matrix adjust.
Identities = 54/121 (44%), Positives = 75/121 (61%), Gaps = 5/121 (4%)
Query: 95 ISIGTPPTERLAVADTGSDLIWTQCEPCP-PSQCYMQDSPLFDPKMSSTYKSLPCSSSQC 153
+ +G P V DTGSD+ W QC PC + CY Q +P+FDP++SS+Y + C S QC
Sbjct: 1 MRVGQPQQPSFFVLDTGSDVTWLQCLPCAGKNGCYEQITPIFDPELSSSYNPVSCDSEQC 60
Query: 154 ASLNQKSCSGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITFGCGTNNGGL 213
L++ C+ +C Y V YGDGSF+ G LATET+T + ++P I+ GCG +N GL
Sbjct: 61 QLLDEAGCNVNSCIYKVEYGDGSFTIGELATETLTFVHSN----SIPNISIGCGHDNEGL 116
Query: 214 F 214
F
Sbjct: 117 F 117
Score = 47.0 bits (110), Expect = 0.015, Method: Compositional matrix adjust.
Identities = 36/124 (29%), Positives = 51/124 (41%), Gaps = 4/124 (3%)
Query: 252 IVIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSLS--QVPEVTIH 309
I++DSGTT+T LP L + P A + CY +S S +VP +
Sbjct: 213 IIVDSGTTITQLPSDVYEVLREAFLGLTTNLPPAPEISPFDTCYDLSSQSNVEVPTIAFI 272
Query: 310 FRGAD-VKLSRSNFFVKV-SEDIVCSVFKGITNSVPIYGNIMQTNFLVGYDIEQQTVSFK 367
G + ++L N ++V S C F T + I GN Q V YD+ V F
Sbjct: 273 LPGENSLQLPAKNCLIQVDSAGTFCLAFVSATFPLSIIGNFQQQGIRVSYDLTNSLVGFS 332
Query: 368 PTDC 371
C
Sbjct: 333 TNKC 336
>gi|312282359|dbj|BAJ34045.1| unnamed protein product [Thellungiella halophila]
Length = 484
Score = 108 bits (271), Expect = 3e-21, Method: Compositional matrix adjust.
Identities = 57/131 (43%), Positives = 74/131 (56%), Gaps = 11/131 (8%)
Query: 88 NANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLP 147
+ Y +R+ +GTP T V DTGSD++W QC PC CY Q P+F+P S T+ ++P
Sbjct: 133 SGEYFMRLGVGTPATNMYMVLDTGSDVVWLQCSPC--KVCYNQSDPVFNPAKSKTFATVP 190
Query: 148 CSSSQCASLNQKS-CSGVN---CQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGIT 203
C S C L+ S C C Y VSYGDGSF+ G+ +TET+T VAL
Sbjct: 191 CGSRLCRRLDDSSECVSRRSKACLYQVSYGDGSFTVGDFSTETLTFHGARVDHVAL---- 246
Query: 204 FGCGTNNGGLF 214
GCG +N GLF
Sbjct: 247 -GCGHDNEGLF 256
Score = 59.3 bits (142), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 38/123 (30%), Positives = 55/123 (44%), Gaps = 3/123 (2%)
Query: 252 IVIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSLS--QVPEVTIH 309
++IDSGT++T L Q L A + C+ + ++ +VP V H
Sbjct: 362 VIIDSGTSVTRLTQSAYVALRDAFRLGATRLKRAPSYSLFDTCFDLSGMTTVKVPTVVFH 421
Query: 310 FRGADVKLSRSNFFVKVS-EDIVCSVFKGITNSVPIYGNIMQTNFLVGYDIEQQTVSFKP 368
F G +V L SN+ + V+ + C F G S+ I GNI Q F V YD+ V F
Sbjct: 422 FTGGEVSLPASNYLIPVNNQGRFCFAFAGTMGSLSIIGNIQQQGFRVAYDLVGSRVGFLS 481
Query: 369 TDC 371
C
Sbjct: 482 RAC 484
>gi|356513697|ref|XP_003525547.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
Length = 252
Score = 108 bits (271), Expect = 3e-21, Method: Compositional matrix adjust.
Identities = 76/202 (37%), Positives = 108/202 (53%), Gaps = 26/202 (12%)
Query: 54 LRDALTRSLNRLNHFNQNSSISSSKASQADIIPNNA------NYLIRISIGTPPTERLAV 107
L D RS+ N + +S + +ASQ I ++ NY++ + +G+ +
Sbjct: 24 LDDLRVRSMQ--NRIRRVASTHNVEASQTQIPLSSGINLQTLNYIVTMGLGSK--NMTVI 79
Query: 108 ADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCSSSQCASL-----NQKSCS 162
DT SDL W QCEPC CY Q P+F P SS+Y+S+ C+SS C SL N +C
Sbjct: 80 IDTRSDLTWVQCEPCMS--CYNQQGPIFKPSTSSSYQSVSCNSSTCQSLQFATGNTGACG 137
Query: 163 GVN---CQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITFGCGTNNGGLFNSKTT 219
N C Y V+YGDGS++NG+L E ++ G V++ FGCG NN GLF +
Sbjct: 138 SSNPSTCNYVVNYGDGSYTNGDLGVEALSFG-----GVSVSDFVFGCGRNNKGLFGG-VS 191
Query: 220 GIVGLGGGDISLISQMRTTIAG 241
G++GLG +SL+SQ T G
Sbjct: 192 GLMGLGRSYLSLVSQTNATFGG 213
>gi|50508279|dbj|BAD32128.1| putative aspartic proteinase nepenthesin II [Oryza sativa Japonica
Group]
gi|125600536|gb|EAZ40112.1| hypothetical protein OsJ_24555 [Oryza sativa Japonica Group]
Length = 455
Score = 108 bits (271), Expect = 3e-21, Method: Compositional matrix adjust.
Identities = 68/168 (40%), Positives = 97/168 (57%), Gaps = 18/168 (10%)
Query: 75 SSSKASQADIIPNNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQ--DS 132
SSS QA + Y + IS+GTPP + + DTGS+LIW QC PC ++C+ + +
Sbjct: 75 SSSVNVQAQLENGAGAYNMNISLGTPPLDFPVIVDTGSNLIWAQCAPC--TRCFPRPTPA 132
Query: 133 PLFDPKMSSTYKSLPCSSSQCASL----NQKSCSG-VNCQYSVSYGDGSFSNGNLATETV 187
P+ P SST+ LPC+ S C L ++C+ C Y+ +YG G ++ G LATET+
Sbjct: 133 PVLQPARSSTFSRLPCNGSFCQYLPTSSRPRTCNATAACAYNYTYGSG-YTAGYLATETL 191
Query: 188 TLGSTTGQAVALPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQM 235
T+G T P + FGC T NG ++GIVGLG G +SL+SQ+
Sbjct: 192 TVGDGT-----FPKVAFGCSTENG---VDNSSGIVGLGRGPLSLVSQL 231
Score = 62.0 bits (149), Expect = 5e-07, Method: Compositional matrix adjust.
Identities = 43/139 (30%), Positives = 68/139 (48%), Gaps = 18/139 (12%)
Query: 253 VIDSGTTLTFLPQ-GY---NSNLLSVMSSMIEAQPVADPTGSLELCYSFNS-----LSQV 303
++DSGTTLT+L + GY S M+++ + P + L+LCY ++ +V
Sbjct: 316 IVDSGTTLTYLAKDGYAMVKQAFQSQMANLNQTTPASGAPYDLDLCYKPSAGGGGKAVRV 375
Query: 304 PEVTIHFRG-ADVKLSRSNFFVKVSED------IVCSVFKGITNSVPI--YGNIMQTNFL 354
P + + F G A + N+F V D + C + T+ +PI GN+MQ +
Sbjct: 376 PRLALRFAGGAKYNVPVQNYFAGVEADSQGRVTVACLLVLPATDDLPISIIGNLMQMDMH 435
Query: 355 VGYDIEQQTVSFKPTDCTK 373
+ YDI+ SF P DC K
Sbjct: 436 LLYDIDGGMFSFAPADCAK 454
>gi|224142009|ref|XP_002324353.1| predicted protein [Populus trichocarpa]
gi|222865787|gb|EEF02918.1| predicted protein [Populus trichocarpa]
Length = 412
Score = 108 bits (271), Expect = 3e-21, Method: Compositional matrix adjust.
Identities = 69/197 (35%), Positives = 98/197 (49%), Gaps = 23/197 (11%)
Query: 30 SVELIHRDSPKSPFYN---SSETPYQRLRDALTRSLNRLNHFNQN-SSISSSKASQADII 85
S+E++HR P N ++ P + R NR++ + SS QA +
Sbjct: 1 SLEVVHRHGPCIGIVNQEKGADAPSNM--EIFLRDQNRVDSIHARLSSRGMFPEKQATTL 58
Query: 86 P-------NNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPK 138
P +Y++ + +GTP E + DTGSD+ WTQCEPC + CY Q P +P
Sbjct: 59 PVQSGASIGAGDYVVTVGLGTPKKEFTLIFDTGSDITWTQCEPCVKT-CYKQKEPRLNPS 117
Query: 139 MSSTYKSLPCSSSQCASLN-----QKSCSGVNCQYSVSYGDGSFSNGNLATETVTLGSTT 193
S++YK++ CSS+ C + +SCS C Y V YGDGS+S G ATET+TL S+
Sbjct: 118 TSTSYKNISCSSALCKLVASGKKFSQSCSSSTCLYQVQYGDGSYSIGFFATETLTLSSSN 177
Query: 194 GQAVALPGITFGCGTNN 210
FGCG N
Sbjct: 178 ----VFKNFLFGCGQQN 190
Score = 48.1 bits (113), Expect = 0.006, Method: Compositional matrix adjust.
Identities = 36/126 (28%), Positives = 57/126 (45%), Gaps = 6/126 (4%)
Query: 253 VIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSLS--QVPEVTIHF 310
VIDSGT +T L S L S +++ P + CY F+ ++P+V + F
Sbjct: 287 VIDSGTVITRLSPTAYSELSSAFQNLMTDYPSTSGYSIFDTCYDFSKYDTVRIPKVGVTF 346
Query: 311 RGA-DVKLSRSNFFVKVSE-DIVCSVFKGITN--SVPIYGNIMQTNFLVGYDIEQQTVSF 366
+G ++ + S V+ VC F G + I+GN+ Q + V YD + V F
Sbjct: 347 KGGVEMDIDVSGILYPVNGLKKVCLAFAGNDDDSDTSIFGNVQQRTYQVVYDGAKGRVGF 406
Query: 367 KPTDCT 372
P C+
Sbjct: 407 APGGCS 412
>gi|449440931|ref|XP_004138237.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 1-like [Cucumis
sativus]
Length = 523
Score = 108 bits (271), Expect = 3e-21, Method: Compositional matrix adjust.
Identities = 64/156 (41%), Positives = 85/156 (54%), Gaps = 10/156 (6%)
Query: 60 RSLNRLNHFNQNSSISSSKASQADIIPNNANYLIRISIGTPPTERLAVADTGSDLIWTQC 119
R +N + N ++ +S ASQ Y RI +G P V DTGSD+ W QC
Sbjct: 158 RRINGSDSTNSLTAPVTSGASQG-----AGEYFARIGVGQPVQSYFFVPDTGSDVSWLQC 212
Query: 120 EPCPPSQ-CYMQDSPLFDPKMSSTYKSLPCSSSQCASLNQKSCSGVNCQYSVSYGDGSFS 178
+PC CY Q P+FDPK SS+Y L C S QC L++ +C +C Y V YGDGSF+
Sbjct: 213 QPCDGENGCYKQIGPIFDPKSSSSYSPLSCDSEQCHLLDEAACDANSCIYEVEYGDGSFT 272
Query: 179 NGNLATETVTLGSTTGQAVALPGITFGCGTNNGGLF 214
G LATET + + ++P + GCG +N GLF
Sbjct: 273 VGELATETFSFRHSN----SIPNLPIGCGHDNEGLF 304
Score = 46.6 bits (109), Expect = 0.018, Method: Compositional matrix adjust.
Identities = 35/124 (28%), Positives = 53/124 (42%), Gaps = 4/124 (3%)
Query: 252 IVIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSLS--QVPEVTIH 309
I++DSGTT+T +P L + + P A + CY +S S +VP +
Sbjct: 400 IIVDSGTTITEIPSDVYDVLRDAFVGLTKNLPPAPGVSPFDTCYDLSSQSNVEVPTIAFI 459
Query: 310 FRGAD-VKLSRSNFFVKV-SEDIVCSVFKGITNSVPIYGNIMQTNFLVGYDIEQQTVSFK 367
G + ++L N ++V S C F T + I GN+ Q V YD+ V F
Sbjct: 460 LPGENSLQLPAKNCLIQVDSAGTFCLAFLPSTFPLSIIGNVQQQGIRVSYDLANSLVGFS 519
Query: 368 PTDC 371
C
Sbjct: 520 TDKC 523
>gi|449527151|ref|XP_004170576.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 1-like [Cucumis
sativus]
Length = 523
Score = 108 bits (271), Expect = 3e-21, Method: Compositional matrix adjust.
Identities = 64/156 (41%), Positives = 85/156 (54%), Gaps = 10/156 (6%)
Query: 60 RSLNRLNHFNQNSSISSSKASQADIIPNNANYLIRISIGTPPTERLAVADTGSDLIWTQC 119
R +N + N ++ +S ASQ Y RI +G P V DTGSD+ W QC
Sbjct: 158 RRINGSDSTNSLTAPVTSGASQG-----AGEYFARIGVGQPVQSYFFVPDTGSDVSWLQC 212
Query: 120 EPCPPSQ-CYMQDSPLFDPKMSSTYKSLPCSSSQCASLNQKSCSGVNCQYSVSYGDGSFS 178
+PC CY Q P+FDPK SS+Y L C S QC L++ +C +C Y V YGDGSF+
Sbjct: 213 QPCDGENGCYKQIGPIFDPKSSSSYSPLSCDSEQCHLLDEAACDANSCIYEVEYGDGSFT 272
Query: 179 NGNLATETVTLGSTTGQAVALPGITFGCGTNNGGLF 214
G LATET + + ++P + GCG +N GLF
Sbjct: 273 VGELATETFSFRHSN----SIPNLPIGCGHDNEGLF 304
Score = 45.4 bits (106), Expect = 0.043, Method: Compositional matrix adjust.
Identities = 35/124 (28%), Positives = 52/124 (41%), Gaps = 4/124 (3%)
Query: 252 IVIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSLS--QVPEVTIH 309
I++DSGTT+T +P L + + P A + CY +S S +VP +
Sbjct: 400 IIVDSGTTITEIPSDVYDVLRDAFVGLTKNLPPAPGVSPFDTCYDLSSQSNVEVPTIAFI 459
Query: 310 FRGAD-VKLSRSNFFVKV-SEDIVCSVFKGITNSVPIYGNIMQTNFLVGYDIEQQTVSFK 367
G + ++L N +V S C F T + I GN+ Q V YD+ V F
Sbjct: 460 LPGENSLQLPAKNCLFQVDSAGTFCLAFLPSTFPLSIIGNVQQQGIRVSYDLANSLVGFS 519
Query: 368 PTDC 371
C
Sbjct: 520 TDKC 523
>gi|302781668|ref|XP_002972608.1| hypothetical protein SELMODRAFT_97538 [Selaginella moellendorffii]
gi|300160075|gb|EFJ26694.1| hypothetical protein SELMODRAFT_97538 [Selaginella moellendorffii]
Length = 430
Score = 108 bits (271), Expect = 3e-21, Method: Compositional matrix adjust.
Identities = 65/158 (41%), Positives = 88/158 (55%), Gaps = 13/158 (8%)
Query: 90 NYLIRISIGTPPTERLAVADTGSDLIWTQCE--PCPPSQCYMQ---DSPLFDPKMSSTYK 144
YL+ ++ GTPP E L +ADTGSDLIW QC PP+ C + P F S+T
Sbjct: 53 QYLVSMAFGTPPQEVLLIADTGSDLIWLQCSTTAAPPAFCPKKACSRRPAFVASKSATLS 112
Query: 145 SLPCSSSQCASL-----NQKSCSG---VNCQYSVSYGDGSFSNGNLATETVTLGSTTGQA 196
+PCS++QC + + SCS V C Y+ Y DGS + G LA +T T+ + T
Sbjct: 113 VVPCSAAQCLLVPAPRGHGPSCSPAAPVPCGYAYDYADGSSTTGFLARDTATISNGTSGG 172
Query: 197 VALPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQ 234
A+ G+ FGCGT N G S T G++GLG G +S +Q
Sbjct: 173 AAVRGVAFGCGTRNQGGSFSGTGGVIGLGQGQLSFPAQ 210
Score = 60.1 bits (144), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 39/135 (28%), Positives = 67/135 (49%), Gaps = 13/135 (9%)
Query: 253 VIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPT---GSLELCYSFNSLSQV------ 303
VIDSG+TLT+L G +L+S ++ + + LELCY+ +S S +
Sbjct: 296 VIDSGSTLTYLRLGAYLHLVSAFAASVHLPRIPSSATFFQGLELCYNVSSSSSLAPANGG 355
Query: 304 -PEVTIHF-RGADVKLSRSNFFVKVSEDIVCSVFKGITN--SVPIYGNIMQTNFLVGYDI 359
P +TI F +G ++L N+ V V++D+ C + + + + GN+MQ + V +D
Sbjct: 356 FPRLTIDFAQGLSLELPTGNYLVDVADDVKCLAIRPTLSPFAFNVLGNLMQQGYHVEFDR 415
Query: 360 EQQTVSFKPTDCTKQ 374
+ F T+C
Sbjct: 416 ASARIGFARTECVAH 430
>gi|212275300|ref|NP_001130675.1| uncharacterized protein LOC100191778 precursor [Zea mays]
gi|194706308|gb|ACF87238.1| unknown [Zea mays]
Length = 467
Score = 108 bits (271), Expect = 4e-21, Method: Compositional matrix adjust.
Identities = 68/152 (44%), Positives = 94/152 (61%), Gaps = 13/152 (8%)
Query: 90 NYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCS 149
NY+ R+ +GTP + V DTGS L W QC PC S C+ Q P+F+PK SS+Y S+ CS
Sbjct: 128 NYVTRMGLGTPAKSYVMVVDTGSSLTWLQCSPCVVS-CHRQSGPVFNPKASSSYTSVSCS 186
Query: 150 SSQC-----ASLNQKSCSGVN-CQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGIT 203
+ QC A+L+ SCS N C Y SYGD SFS G L+ +TV+ GST+ +P
Sbjct: 187 AQQCSDLTTATLSPASCSTSNVCIYQASYGDSSFSVGYLSKDTVSFGSTS-----VPNFY 241
Query: 204 FGCGTNNGGLFNSKTTGIVGLGGGDISLISQM 235
+GCG +N GLF ++ G++GL +SL+ Q+
Sbjct: 242 YGCGQDNEGLFG-QSAGLIGLARNKLSLLYQL 272
Score = 54.3 bits (129), Expect = 9e-05, Method: Compositional matrix adjust.
Identities = 38/122 (31%), Positives = 54/122 (44%), Gaps = 3/122 (2%)
Query: 253 VIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSLS-QVPEVTIHFR 311
+IDSGT +T LP G S L ++ ++ P A L+ C+ + +VPEVT+ F
Sbjct: 347 IIDSGTVITRLPTGVYSALSKAVAGAMKGTPRASAFSILDTCFQGQAARLRVPEVTMAFA 406
Query: 312 GADVKLSRS-NFFVKVSEDIVCSVFKGITNSVPIYGNIMQTNFLVGYDIEQQTVSFKPTD 370
G + N V V C F S I GN Q F V YD++ + F
Sbjct: 407 GGAALKLAARNLLVDVDSATTCLAF-APARSAAIIGNTQQQTFSVVYDVKNSKIGFAAGG 465
Query: 371 CT 372
C+
Sbjct: 466 CS 467
>gi|449519146|ref|XP_004166596.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 1-like [Cucumis
sativus]
Length = 752
Score = 108 bits (270), Expect = 4e-21, Method: Compositional matrix adjust.
Identities = 92/370 (24%), Positives = 152/370 (41%), Gaps = 87/370 (23%)
Query: 89 ANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPC 148
Y I + IG+PP + DTGSDL W QC PC C+ Q+ P +DPK S +++++ C
Sbjct: 194 GEYFIDVFIGSPPKHFSLILDTGSDLNWIQCVPC--FDCFEQNGPYYDPKDSISFRNITC 251
Query: 149 SSSQCASLNQ----KSCS--GVNCQYSVSYGDGSFSNGNLATETVTLG---STTGQA--V 197
+ +C ++ + C +C Y YGD S + G+ A ET T+ STTG++
Sbjct: 252 NDPRCQLVSSPDPPRPCKFETQSCPYFYWYGDSSNTTGDFALETFTVNLTSSTTGKSEFR 311
Query: 198 ALPGITFGCGTNNGGL----------------FNSKTTGIVG------------------ 223
+ + FGCG N GL F+S+ + G
Sbjct: 312 RVENVMFGCGHWNRGLFHGAAGLLGLGRGPLSFSSQLQSLYGHSFSYCLVDRDSDTSVSS 371
Query: 224 --LGGGDISLIS-----------------------QMRTTIAGNQRLGVSTPDI------ 252
+ G D L++ Q+++ G ++L + +
Sbjct: 372 KLIFGEDKDLLTHPELNFTSLIAGKENPVDTFYYLQIKSIFVGGEKLQIPEENWNLSADG 431
Query: 253 ----VIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSLSQV--PEV 306
+IDSGTTL++ + ++ + + L CY+ + ++ PE
Sbjct: 432 AGGTIIDSGTTLSYFSDPAYRIIKEAFLRKVKGYKLVEDFPILHPCYNVSGTDELNFPEF 491
Query: 307 TIHFR-GADVKLSRSNFFVKVSE-DIVCSVFKGITNS-VPIYGNIMQTNFLVGYDIEQQT 363
I F GA N+F+++ + DIVC G S + I GN Q NF + YD +
Sbjct: 492 LIQFADGAVWNFPVENYFIRIQQLDIVCLAMLGTPKSALSIIGNYQQQNFHILYDTKNSR 551
Query: 364 VSFKPTDCTK 373
+ + P C +
Sbjct: 552 LGYAPMRCAE 561
>gi|222634868|gb|EEE65000.1| hypothetical protein OsJ_19937 [Oryza sativa Japonica Group]
Length = 402
Score = 108 bits (270), Expect = 4e-21, Method: Compositional matrix adjust.
Identities = 86/280 (30%), Positives = 119/280 (42%), Gaps = 45/280 (16%)
Query: 109 DTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCSSSQCASLNQ--KSCSGVNC 166
DT DL W QC PCP +CY Q + LFDP+ S T ++PC S+ C L + CS C
Sbjct: 151 DTSIDLPWIQCAPCPMPECYPQQNALFDPRRSRTSAAVPCGSAACGELGRYGAGCSNNQC 210
Query: 167 QYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITFGCGTNNGGLFNSKTTGIVGLGG 226
QY V YGDG ++G TL +T + FGC G F++ T+G +G+
Sbjct: 211 QYFVDYGDGRATSGRTWWTPSTLNPST----VVMNFRFGCSHAVRGNFSASTSGTMGIE- 265
Query: 227 GDISLISQMRTTIAGNQRLGVS----TPDIVIDSGTTLTFL-PQGYNSNLLSVMSSMIEA 281
G +RL V V+DS +T L P Y + L+ S+M
Sbjct: 266 -------------VGGRRLNVPPVVFAGGAVMDSSVIITQLPPTAYRALRLAFRSAMAAY 312
Query: 282 QPVADPTGSLELCYSFNSLSQ--VPEVTIHFRGADVKLSRSNFFVKVSEDIVCSVFKGIT 339
VA L+ CY F + VP V++ F G V V D + + +G
Sbjct: 313 PRVAGGRAGLDTCYDFVRFTSVTVPAVSLVFDGGAV----------VRLDAMGVMVEGCL 362
Query: 340 NSVP--------IYGNIMQTNFLVGYDIEQQTVSFKPTDC 371
VP GN+ Q V YD+ +V F+ C
Sbjct: 363 AFVPTPGDFALGFIGNVQQQTHEVLYDVVGGSVGFRRGAC 402
>gi|357118871|ref|XP_003561171.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Brachypodium
distachyon]
Length = 506
Score = 108 bits (270), Expect = 4e-21, Method: Compositional matrix adjust.
Identities = 125/458 (27%), Positives = 180/458 (39%), Gaps = 112/458 (24%)
Query: 17 YVVSPIEAQTGGFSVELIHRDSPKSPFYNSSET--PYQRLRDALTRSLNRLNHFN----- 69
+ VSP + +GG L H SP SP ++ P + L L +R H
Sbjct: 58 HRVSP--SSSGGSWAPLSHLHSPCSPAAGGRDSAPPPKTLSATLQWDEHRAGHIQRKLSG 115
Query: 70 -------------QNSSISSSKASQADIIPNNANYLIRISI-----------GTPPTERL 105
Q++ ++SS A+ ++ ++ + I P +
Sbjct: 116 NAAPMDDAGEETPQSTQVTSSPAANVNVGKSSTDSAFEQGIVPAATGPGGQKKLPGVAQS 175
Query: 106 AVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCSSSQCASLNQKS--CSG 163
V DT SD+ W QC PCP QCY Q L+DP S PCSS QC SL + + C+G
Sbjct: 176 MVVDTASDVPWVQCAPCPQPQCYAQSDVLYDPTKSILSAPFPCSSPQCRSLGRYANGCTG 235
Query: 164 VN----CQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITFGCGTN--NGGLFNSK 217
CQY V Y DGS ++G ++ +TL + AV+ FGC G FN+K
Sbjct: 236 AGNTGTCQYRVLYPDGSGTSGTYVSDLLTLNADPKGAVS--KFQFGCSHALLRPGSFNNK 293
Query: 218 TTGIVGLGGGDISLISQMRTTIA------------GNQR----LGVS---------TPD- 251
T G + LG G SL SQ + T + G+ + LGV TP
Sbjct: 294 TAGFMALGRGAQSLSSQTKGTFSKGNVFSYCLPPTGSHKGFLSLGVPQHAASRYAVTPML 353
Query: 252 -------------IVID-SGTTLTFLPQGYNSN-------------------LLSVMSSM 278
I ID +G L P + +N L + +
Sbjct: 354 KSKMAPMIYMVRLIGIDVAGQRLPVPPAVFAANAAMDSRTIITRLPPTAYMALRAAFRAQ 413
Query: 279 IEAQPVADPTGSLELCYSFNSLSQV--PEVTIHF-RGADVKLSRSNFFVKVSEDIVCSVF 335
+ A P G L+ CY F + V P+VT+ F R A V+L S + C F
Sbjct: 414 MRAYRAVAPKGQLDTCYDFTGVPMVRLPKVTLVFDRNAAVELDPSGVMLD-----SCLAF 468
Query: 336 KGITNS-VP-IYGNIMQTNFLVGYDIEQQTVSFKPTDC 371
N +P I GN+ Q V Y+++ +V F+ C
Sbjct: 469 APNANDFMPGIIGNVQQQTLEVLYNVDGASVGFRRAAC 506
>gi|449453872|ref|XP_004144680.1| PREDICTED: LOW QUALITY PROTEIN: protein ASPARTIC PROTEASE IN GUARD
CELL 1-like [Cucumis sativus]
Length = 757
Score = 108 bits (270), Expect = 5e-21, Method: Compositional matrix adjust.
Identities = 92/370 (24%), Positives = 152/370 (41%), Gaps = 87/370 (23%)
Query: 89 ANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPC 148
Y I + IG+PP + DTGSDL W QC PC C+ Q+ P +DPK S +++++ C
Sbjct: 194 GEYFIDVFIGSPPKHFSLILDTGSDLNWIQCVPC--FDCFEQNGPYYDPKDSISFRNITC 251
Query: 149 SSSQCASLNQ----KSCS--GVNCQYSVSYGDGSFSNGNLATETVTLG---STTGQA--V 197
+ +C ++ + C +C Y YGD S + G+ A ET T+ STTG++
Sbjct: 252 NDPRCQLVSSPDPPRPCKFETQSCPYFYWYGDSSNTTGDFALETFTVNLTSSTTGKSEFR 311
Query: 198 ALPGITFGCGTNNGGL----------------FNSKTTGIVG------------------ 223
+ + FGCG N GL F+S+ + G
Sbjct: 312 RVENVMFGCGHWNRGLFHGAAGLLGLGRGPLSFSSQLQSLYGHSFSYCLVDRDSDTSVSS 371
Query: 224 --LGGGDISLIS-----------------------QMRTTIAGNQRLGVSTPDI------ 252
+ G D L++ Q+++ G ++L + +
Sbjct: 372 KLIFGEDKDLLTHPELNFTSLIAGKENPVDTFYYLQIKSIFVGGEKLQIPEENWNLSADG 431
Query: 253 ----VIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSLSQV--PEV 306
+IDSGTTL++ + ++ + + L CY+ + ++ PE
Sbjct: 432 AGGTIIDSGTTLSYFSDPAYRIIKEAFLRKVKGYKLVEDFPILHPCYNVSGTDELNFPEF 491
Query: 307 TIHFR-GADVKLSRSNFFVKVSE-DIVCSVFKGITNS-VPIYGNIMQTNFLVGYDIEQQT 363
I F GA N+F+++ + DIVC G S + I GN Q NF + YD +
Sbjct: 492 LIQFADGAVWNFPVENYFIRIQQLDIVCLAMLGTPKSALSIIGNYQQQNFHILYDTKNSR 551
Query: 364 VSFKPTDCTK 373
+ + P C +
Sbjct: 552 LGYAPMRCAE 561
>gi|224286173|gb|ACN40797.1| unknown [Picea sitchensis]
Length = 383
Score = 108 bits (269), Expect = 5e-21, Method: Compositional matrix adjust.
Identities = 107/393 (27%), Positives = 172/393 (43%), Gaps = 88/393 (22%)
Query: 54 LRDALTRSLNRLNHFNQNSSISSSKASQAD--IIPN--NANYLIRISIGTPPTERLAVAD 109
++ A+ RS RL S++++ + + + P+ + YLI+++IGTP A+ D
Sbjct: 1 MKRAIQRSQERLEKLQITSAVNTHQMKDIETPVTPDIGSGEYLIQMAIGTPALSLSAIMD 60
Query: 110 TGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCSSSQCASLNQKSCSGV-NCQY 168
TGSDL+WT+C PC + C SSTY + C SS C + SC+ +C+Y
Sbjct: 61 TGSDLVWTKCNPC--TDCSTSSIYDP--SSSSTYSKVLCQSSLCQPPSIFSCNNDGDCEY 116
Query: 169 SVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITFGCGTNNGGLFNSKTTGIVGLGGGD 228
YGD S ++G L+ ET ++ S +LP ITFGCG +N G K G+VG G G
Sbjct: 117 VYPYGDRSSTSGILSDETFSISSQ-----SLPNITFGCGHDNQGF--DKVGGLVGFGRGS 169
Query: 229 ISLISQMRTTIAGNQ-----------------RLGVSTPDIVIDSGTTLTFLPQGYNSNL 271
+SL+SQ+ ++ GN+ +G + G+T N
Sbjct: 170 LSLVSQLGPSM-GNKFSYCLVSRTDSSKTSPLFIGNTASLEATTVGSTPLVQSSSTNHYY 228
Query: 272 LSVMSSMIEAQPVADPTGSLEL--------------------CYSFNSLSQVPEVTIHFR 311
LS+ + Q +A PTG+ ++ +++++ + +I+
Sbjct: 229 LSLEGISVGGQSLAIPTGTFDIQSDGSGGLIIDSGTTLTFLQQTAYDAVKEAMVSSINLP 288
Query: 312 GADVKLS----------------------------RSNF-FVKVSEDIVCSVFKGITNS- 341
AD +L + N+ F + DIVC TNS
Sbjct: 289 QADGQLDLCFNQQGSSNPGFPSMTFHFKGADYDVPKENYLFPDSTSDIVCLAMMP-TNSN 347
Query: 342 ---VPIYGNIMQTNFLVGYDIEQQTVSFKPTDC 371
+ I+GN+ Q N+ + YD E +SF PT C
Sbjct: 348 LGNMAIFGNVQQQNYQILYDNENNVLSFAPTAC 380
>gi|357135412|ref|XP_003569303.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Brachypodium
distachyon]
Length = 494
Score = 108 bits (269), Expect = 5e-21, Method: Compositional matrix adjust.
Identities = 56/130 (43%), Positives = 74/130 (56%), Gaps = 8/130 (6%)
Query: 87 NNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSL 146
+ Y +I +GTP T L V DTGSD++W QC PC +CY Q +FDP+ S +Y ++
Sbjct: 138 GSGEYFTKIGVGTPATPALMVLDTGSDVVWLQCAPC--RRCYDQSGQVFDPRRSRSYGAV 195
Query: 147 PCSSSQCASLNQKSCS--GVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITF 204
CS+ C L+ C C Y V+YGDGS + G+ ATET+T G VA I
Sbjct: 196 GCSAPLCRRLDSGGCDLRRKACLYQVAYGDGSVTAGDFATETLTFAG--GARVAR--IAL 251
Query: 205 GCGTNNGGLF 214
GCG +N GLF
Sbjct: 252 GCGHDNEGLF 261
Score = 59.7 bits (143), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 41/125 (32%), Positives = 61/125 (48%), Gaps = 5/125 (4%)
Query: 252 IVIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSL-ELCY--SFNSLSQVPEVTI 308
+++DSGT++T L + S L + ++ SL + CY S + +VP V++
Sbjct: 370 VIVDSGTSVTRLARPAYSALRDAFRAAAAGLRLSPGGFSLFDTCYDLSGRKVVKVPTVSM 429
Query: 309 HFRG-ADVKLSRSNFFVKV-SEDIVCSVFKGITNSVPIYGNIMQTNFLVGYDIEQQTVSF 366
HF G A+ L N+ + V S+ C F G V I GNI Q F V +D + Q V F
Sbjct: 430 HFAGGAEAALPPENYLIPVDSKGTFCFAFAGTDGGVSIIGNIQQQGFRVVFDGDGQRVGF 489
Query: 367 KPTDC 371
P C
Sbjct: 490 VPKGC 494
>gi|302780575|ref|XP_002972062.1| hypothetical protein SELMODRAFT_96804 [Selaginella moellendorffii]
gi|300160361|gb|EFJ26979.1| hypothetical protein SELMODRAFT_96804 [Selaginella moellendorffii]
Length = 429
Score = 108 bits (269), Expect = 6e-21, Method: Compositional matrix adjust.
Identities = 63/158 (39%), Positives = 85/158 (53%), Gaps = 13/158 (8%)
Query: 90 NYLIRISIGTPPTERLAVADTGSDLIWTQCE--PCPPSQCYMQ---DSPLFDPKMSSTYK 144
YL+ ++ GTPP E L +ADTGSDLIW QC PP+ C + P F S+T
Sbjct: 52 QYLVSMAFGTPPQEVLLIADTGSDLIWLQCSTTAAPPAFCPKKACSRRPAFVASKSATLS 111
Query: 145 SLPCSSSQCASLNQKSCSG--------VNCQYSVSYGDGSFSNGNLATETVTLGSTTGQA 196
+PCS++QC + G V C Y+ Y DGS + G LA +T T+ + T
Sbjct: 112 VVPCSAAQCLLVPAPRGHGPACSPAAPVPCGYAYDYADGSSTTGFLARDTATISNGTSGG 171
Query: 197 VALPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQ 234
A+ G+ FGCGT N G S T G++GLG G +S +Q
Sbjct: 172 AAVRGVAFGCGTRNQGGSFSGTGGVIGLGQGQLSFPAQ 209
Score = 56.2 bits (134), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 40/135 (29%), Positives = 67/135 (49%), Gaps = 13/135 (9%)
Query: 253 VIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPV---ADPTGSLELCYSFNSLSQ------- 302
VIDSG+TLT+L G +L+S ++ + + A LELCY+ +S S
Sbjct: 295 VIDSGSTLTYLRLGAYLHLVSAFAASVHLPRIPSSATFFQGLELCYNVSSSSSSAPANGG 354
Query: 303 VPEVTIHF-RGADVKLSRSNFFVKVSEDIVCSVFKGITN--SVPIYGNIMQTNFLVGYDI 359
P +TI F +G ++L N+ V V++D+ C + + + + GN+MQ + V +D
Sbjct: 355 FPRLTIDFAQGLSLELPTGNYLVDVADDVKCLAIRPTLSPFAFNVLGNLMQQGYHVEFDR 414
Query: 360 EQQTVSFKPTDCTKQ 374
+ F T+C
Sbjct: 415 ASARIGFARTECVAH 429
>gi|297828736|ref|XP_002882250.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
gi|297328090|gb|EFH58509.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
Length = 488
Score = 108 bits (269), Expect = 6e-21, Method: Compositional matrix adjust.
Identities = 105/360 (29%), Positives = 157/360 (43%), Gaps = 82/360 (22%)
Query: 91 YLIRISIGTPPTERLAVADTGSDLIWTQCE---PCPPSQCYMQDSPLFDPKMSSTYKSLP 147
Y +I +GTP + DTGSD++W C CP ++ +P +D SST KS+
Sbjct: 85 YFAKIGLGTPSRDFHVQVDTGSDILWVNCAGCIRCPRKSDLVELTP-YDADASSTAKSVS 143
Query: 148 CSSSQCASLNQKS-C-SGVNCQYSVSYGDGSFSNGNLATETVTLGSTTG--QAVALPG-I 202
CS + C+ +NQ+S C SG CQY + YGDGS +NG L + V L TG Q + G I
Sbjct: 144 CSDNFCSYVNQRSECHSGSTCQYVILYGDGSSTNGYLVRDVVHLDLVTGNRQTGSTNGTI 203
Query: 203 TFGCGTNNGGLF---NSKTTGIVGLG----------------------------GGDISL 231
FGCG+ G + GI+G G GG I
Sbjct: 204 IFGCGSKQSGQLGESQAAVDGIMGFGQSNSSFISQLASQGKVKRSFAHCLDNNNGGGIFA 263
Query: 232 ISQM-----RTT----------------IAGNQRLGVST--------PDIVIDSGTTLTF 262
I ++ +TT GN L +S+ ++IDSGTTL +
Sbjct: 264 IGEVVSPKVKTTPMLSKSAHYSVNLNAIEVGNSVLQLSSDAFDSGDDKGVIIDSGTTLVY 323
Query: 263 LPQG-YNSNLLSVMSSMIE--AQPVADPTGSLELCYSFNSLSQVPEVTIHF-RGADVKLS 318
LP YN + +++S E V D S + + L + P VT F + + +
Sbjct: 324 LPDAVYNPLMNQILASHQELNLHTVQD---SFTCFHYIDRLDRFPTVTFQFDKSVSLAVY 380
Query: 319 RSNFFVKVSEDIVCSVFK--GITN----SVPIYGNIMQTNFLVGYDIEQQTVSFKPTDCT 372
+ +V ED C ++ G+ S+ I G++ +N LV YDIE Q + + +C+
Sbjct: 381 PQEYLFQVREDTWCFGWQNGGLQTKGGASLTILGDMALSNKLVVYDIENQVIGWTNHNCS 440
>gi|357153697|ref|XP_003576537.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Brachypodium
distachyon]
Length = 474
Score = 108 bits (269), Expect = 6e-21, Method: Compositional matrix adjust.
Identities = 62/163 (38%), Positives = 87/163 (53%), Gaps = 20/163 (12%)
Query: 90 NYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCS 149
NY+ + +G E V DT S+L W QC+PC C+ Q PLFDP S +Y ++PC+
Sbjct: 119 NYVATVGLGA--AEATVVVDTASELTWVQCQPC--ESCHDQQDPLFDPSSSPSYAAVPCN 174
Query: 150 SSQCASLNQKSCSGVN-----------CQYSVSYGDGSFSNGNLATETVTLGSTTGQAVA 198
SS C +L +G + C Y++SY DGS+S G LA + + L GQ +
Sbjct: 175 SSSCDALRVAMAAGTSPCADDNEQQPACSYALSYRDGSYSRGVLARDKLRL---AGQDIE 231
Query: 199 LPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAG 241
G FGCGT+N G T+G++GLG +SL+SQ G
Sbjct: 232 --GFVFGCGTSNQGAPFGGTSGLMGLGRSHVSLVSQTMDQFGG 272
>gi|226491934|ref|NP_001140743.1| uncharacterized protein LOC100272818 [Zea mays]
gi|194700872|gb|ACF84520.1| unknown [Zea mays]
Length = 351
Score = 108 bits (269), Expect = 6e-21, Method: Compositional matrix adjust.
Identities = 94/330 (28%), Positives = 135/330 (40%), Gaps = 75/330 (22%)
Query: 107 VADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCSSSQCASLN--QKSCSGV 164
V D+ SD+ W QC PCP C+ Q +DP S T + CSS C +L C+
Sbjct: 32 VLDSASDVPWVQCVPCPIPPCHPQVDSFYDPSRSPTSAAFSCSSPTCTALGPYANGCANN 91
Query: 165 NCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITFGCGTNNGGLFNSKTTGIVGL 224
CQY V Y DGS ++G + +TL + G AV+ G FGC G F+++ GI+ L
Sbjct: 92 QCQYLVRYPDGSSTSGAYIADLLTLDA--GNAVS--GFKFGCSHAEQGSFDARAAGIMAL 147
Query: 225 GGGDISLISQ-------------------------------------------------- 234
GGG SL+SQ
Sbjct: 148 GGGPESLLSQTASRYGNAFSYCIPATASDSGFFTLGVPRRASSRYVVTPMVRFRQAATFY 207
Query: 235 ---MRTTIAGNQRLGVSTPDI-----VIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVAD 286
+RT G QRLGV+ P + V+DS T +T LP L + S + A
Sbjct: 208 GVLLRTITVGGQRLGVA-PAVFAAGSVLDSRTAITRLPPTAYQALRAAFRSSMTMYRSAP 266
Query: 287 PTGSLELCYSFNSLS--QVPEVTIHF-RGADVKLSRSNFFVKVSEDIVCSVF-KGITNSV 342
P G L+ CY F + ++P++++ F R A + L S C F + +
Sbjct: 267 PKGYLDTCYDFTGVVNIRLPKISLVFDRNAVLPLDPSGILFN-----DCLAFTSNADDRM 321
Query: 343 P-IYGNIMQTNFLVGYDIEQQTVSFKPTDC 371
P + G++ Q V YD+ V F+ C
Sbjct: 322 PGVLGSVQQQTIEVLYDVGGGAVGFRQGAC 351
>gi|297818124|ref|XP_002876945.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
gi|297322783|gb|EFH53204.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
Length = 206
Score = 107 bits (268), Expect = 7e-21, Method: Compositional matrix adjust.
Identities = 60/134 (44%), Positives = 76/134 (56%), Gaps = 9/134 (6%)
Query: 3 TFLSCVFILFFLCFYVVSPIEAQTGGFSVELIHRDSPKSPFYNSSETPYQRLRDALTRSL 62
+F + L+ F S I A +VELIHRDSP SP YN T L RS+
Sbjct: 70 SFFEVILHLYTAIFCFSSTI-ANRENLTVELIHRDSPHSPLYNPHHTVSDGLNATFLRSI 128
Query: 63 NRLNHFNQNSSISSSKASQADIIPNNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPC 122
+R FN + + Q+ +I N YL+ ISIGTPP++ LA+ADTGSDL W QC+P
Sbjct: 129 SRSRRFNTKTDL------QSGLISNGGEYLMSISIGTPPSKVLAIADTGSDLTWVQCKPY 182
Query: 123 PPSQCYMQDSPLFD 136
QCY Q+SPLFD
Sbjct: 183 --QQCYKQNSPLFD 194
>gi|326506192|dbj|BAJ86414.1| predicted protein [Hordeum vulgare subsp. vulgare]
gi|326516358|dbj|BAJ92334.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 492
Score = 107 bits (267), Expect = 8e-21, Method: Compositional matrix adjust.
Identities = 63/194 (32%), Positives = 93/194 (47%), Gaps = 22/194 (11%)
Query: 32 ELIHRDSPKSPFYNSSETPYQRLRDALTRSLNRLNHFNQNSSISSSKASQAD-------- 83
L+HRD ++ + T + L L R R + + ++
Sbjct: 77 RLVHRDD-----FSVNATAAELLAYRLERDAKRAARLSAAAGPANGTRRGGGGVVAPVVS 131
Query: 84 -IIPNNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSST 142
+ + Y +I +GTP T L V DTGSD++W QC PC +CY Q +FDP+ S +
Sbjct: 132 GLAQGSGEYFTKIGVGTPATPALMVLDTGSDVVWLQCAPC--RRCYEQSGQVFDPRRSRS 189
Query: 143 YKSLPCSSSQCASLNQKSCS--GVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALP 200
Y ++ C++ C L+ C C Y V+YGDGS + G+ ATET+T G VA
Sbjct: 190 YNAVGCAAPLCRRLDSGGCDLRRSACLYQVAYGDGSVTAGDFATETLTFAG--GARVAR- 246
Query: 201 GITFGCGTNNGGLF 214
+ GCG +N GLF
Sbjct: 247 -VALGCGHDNEGLF 259
Score = 60.1 bits (144), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 41/125 (32%), Positives = 61/125 (48%), Gaps = 5/125 (4%)
Query: 252 IVIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSL-ELCY--SFNSLSQVPEVTI 308
+++DSGT++T L + S L ++ SL + CY S + +VP V++
Sbjct: 368 VIVDSGTSVTRLARPAYSALRDAFRGAAAGLRLSPGGFSLFDTCYDLSGRKVVKVPTVSM 427
Query: 309 HFRG-ADVKLSRSNFFVKV-SEDIVCSVFKGITNSVPIYGNIMQTNFLVGYDIEQQTVSF 366
HF G A+ L N+ + V S+ C F G V I GNI Q F V +D + Q V+F
Sbjct: 428 HFAGGAEAALPPENYLIPVDSKGTFCFAFAGTDGGVSIIGNIQQQGFRVVFDGDGQRVAF 487
Query: 367 KPTDC 371
P C
Sbjct: 488 TPKGC 492
>gi|242081367|ref|XP_002445452.1| hypothetical protein SORBIDRAFT_07g019450 [Sorghum bicolor]
gi|241941802|gb|EES14947.1| hypothetical protein SORBIDRAFT_07g019450 [Sorghum bicolor]
Length = 459
Score = 107 bits (267), Expect = 1e-20, Method: Compositional matrix adjust.
Identities = 101/358 (28%), Positives = 151/358 (42%), Gaps = 89/358 (24%)
Query: 93 IRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCSSSQ 152
+ + +GTPP + D GSDL+WTQC P+ Q P+FD SS++ LPC S
Sbjct: 109 LTVGVGTPPQPSKVILDLGSDLLWTQCSLVGPTA--KQLEPVFDAARSSSFSVLPCDSKL 166
Query: 153 C--ASLNQKSCSGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITFGCGT-N 209
C + K+C+ C Y YG + + G LATET T G+ G + L TFGCG
Sbjct: 167 CEAGTFTNKTCTDRKCAYENDYGIMT-ATGVLATETFTFGAHHGVSANL---TFGCGKLA 222
Query: 210 NGGL-------------------------------FNSKTTGIVGLGG----------GD 228
NG + F + T V G G
Sbjct: 223 NGTIAEASGILGLSPGPLSMLKQLAITKFSYCLTPFADRKTSPVMFGAMADLGKYKTTGK 282
Query: 229 ISLISQMRTTI-------------AGNQRLGVS------TPD----IVIDSGTTLTFLPQ 265
+ I ++ + G++RL V PD V+DS TTL +L +
Sbjct: 283 VQTIPLLKNPVEDIYYYVPMVGMSVGSKRLDVPQETLAIKPDGTGGTVLDSATTLAYLVE 342
Query: 266 GYNSNLLSVMSSMIEAQPVADPT-GSLELCYSF-NSLS----QVPEVTIHFRG-ADVKLS 318
+ L + I+ PVA+ + +C+ +S QVP + +HF G A++ L
Sbjct: 343 PAFTELKKAVMEGIKL-PVANRSVDDYPVCFELPRGMSMEGVQVPPLVLHFDGDAEMSLP 401
Query: 319 RSNFFVKVSEDIVC-----SVFKGITNSVPIYGNIMQTNFLVGYDIEQQTVSFKPTDC 371
R N+F + S ++C + F+G N + GN+ Q N V YD+ + S+ PT C
Sbjct: 402 RDNYFQEPSPGMMCLAVMQAPFEGAPN---VIGNVQQQNMHVLYDVGNRKFSYAPTKC 456
>gi|212274713|ref|NP_001130791.1| uncharacterized protein LOC100191895 precursor [Zea mays]
gi|194690124|gb|ACF79146.1| unknown [Zea mays]
gi|194708040|gb|ACF88104.1| unknown [Zea mays]
gi|223950469|gb|ACN29318.1| unknown [Zea mays]
gi|414885521|tpg|DAA61535.1| TPA: hypothetical protein ZEAMMB73_650724 [Zea mays]
Length = 500
Score = 107 bits (266), Expect = 1e-20, Method: Compositional matrix adjust.
Identities = 78/235 (33%), Positives = 116/235 (49%), Gaps = 36/235 (15%)
Query: 27 GGFSV-ELIHRDSPKSPFYNSSETPYQRLRDALTR--SLN-RLNHFNQNSSISSSK---- 78
GG +V EL H +P + E L R SL R+ H+ ++ SS++
Sbjct: 65 GGATVLELRHHSFSPAPANSREEEADALLSTDAARVSSLQGRIEHYRLTTTSSSAEVAVT 124
Query: 79 ASQADI-IPNNA-----NYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDS 132
AS+A + + + A NY+ + +G E + DT S+L W QC PC C+ Q
Sbjct: 125 ASKAQVPVSSGARLRTLNYVATVGLGG--GEATVIVDTASELTWVQCAPC--ESCHDQQG 180
Query: 133 PLFDPKMSSTYKSLPCSSSQCASLNQKSCSGVN-------------CQYSVSYGDGSFSN 179
PLFDP S +Y ++PC S C +L Q+ +G C Y++SY DGS+S
Sbjct: 181 PLFDPSSSPSYAAVPCDSPSCDALQQQLATGAGAGAPPCDAGRPAACSYALSYRDGSYSR 240
Query: 180 GNLATETVTLGSTTGQAVALPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQ 234
G LA + ++L + G FGCGT+N G T+G++GLG +SL+SQ
Sbjct: 241 GVLAHDRLSLAGEV-----IDGFVFGCGTSNQGPPFGGTSGLMGLGRSQLSLVSQ 290
>gi|238010910|gb|ACR36490.1| unknown [Zea mays]
gi|413942664|gb|AFW75313.1| hypothetical protein ZEAMMB73_520329 [Zea mays]
Length = 481
Score = 107 bits (266), Expect = 1e-20, Method: Compositional matrix adjust.
Identities = 94/330 (28%), Positives = 134/330 (40%), Gaps = 75/330 (22%)
Query: 107 VADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCSSSQCASLN--QKSCSGV 164
V D+ SD+ W QC PCP C+ Q +DP S + CSS C +L C+
Sbjct: 162 VLDSASDVPWVQCVPCPIPPCHPQVDSFYDPSRSPSSAPFSCSSPTCTALGPYANGCANN 221
Query: 165 NCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITFGCGTNNGGLFNSKTTGIVGL 224
CQY V Y DGS ++G + +TL + G AV+ G FGC G F+++ GI+ L
Sbjct: 222 QCQYLVRYPDGSSTSGAYIADLLTLDA--GNAVS--GFKFGCSHAEQGSFDARAAGIMAL 277
Query: 225 GGGDISLISQ-------------------------------------------------- 234
GGG SL+SQ
Sbjct: 278 GGGPESLLSQTASRYGNAFSYCIPATASDSGFFTLGVPRRASSRYVVTPMVRFRQAATFY 337
Query: 235 ---MRTTIAGNQRLGVSTPDI-----VIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVAD 286
+RT G QRLGV+ P + V+DS T +T LP L S S + A
Sbjct: 338 GVLLRTITVGGQRLGVA-PAVFAAGSVLDSRTAITRLPPTAYQALRSAFRSSMTMYRSAP 396
Query: 287 PTGSLELCYSFNSLS--QVPEVTIHF-RGADVKLSRSNFFVKVSEDIVCSVF-KGITNSV 342
P G L+ CY F + ++P++++ F R A + L S C F + +
Sbjct: 397 PKGYLDTCYDFTGVVNIRLPKISLVFDRNAVLPLDPSGILFN-----DCLAFTSNADDRM 451
Query: 343 P-IYGNIMQTNFLVGYDIEQQTVSFKPTDC 371
P + G++ Q V YD+ V F+ C
Sbjct: 452 PGVLGSVQQQTIEVLYDVGGGAVGFRQGAC 481
>gi|449432044|ref|XP_004133810.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 1-like [Cucumis
sativus]
Length = 471
Score = 106 bits (265), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 54/128 (42%), Positives = 70/128 (54%), Gaps = 8/128 (6%)
Query: 88 NANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLP 147
+ Y RI +GTPP V DTGSD++W QC PC CY Q P+F+P S ++ +
Sbjct: 126 SGEYFTRIGVGTPPKYVYMVLDTGSDIVWLQCAPC--KNCYSQTDPVFNPVKSGSFAKVL 183
Query: 148 CSSSQCASLNQKSCSGVN-CQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITFGC 206
C + C L C+ C Y VSYGDGS++ G TET+T T + VAL GC
Sbjct: 184 CRTPLCRRLESPGCNQRQTCLYQVSYGDGSYTTGEFVTETLTFRRTKVEQVAL-----GC 238
Query: 207 GTNNGGLF 214
G +N GLF
Sbjct: 239 GHDNEGLF 246
Score = 67.0 bits (162), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 41/123 (33%), Positives = 58/123 (47%), Gaps = 3/123 (2%)
Query: 252 IVIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSLS--QVPEVTIH 309
++ID GT++T L + L + + A + CY + + +VP V +H
Sbjct: 348 VIIDCGTSVTRLNKPAYIALRDAFRAGASSLKSAPEFSLFDTCYDLSGKTTVKVPTVVLH 407
Query: 310 FRGADVKLSRSNFFVKVS-EDIVCSVFKGITNSVPIYGNIMQTNFLVGYDIEQQTVSFKP 368
FRGADV L SN+ + V C F G T+ + I GNI Q F V YD+ V F P
Sbjct: 408 FRGADVSLPASNYLIPVDGSGRFCFAFAGTTSGLSIIGNIQQQGFRVVYDLASSRVGFSP 467
Query: 369 TDC 371
C
Sbjct: 468 RGC 470
>gi|224114179|ref|XP_002332420.1| predicted protein [Populus trichocarpa]
gi|222832373|gb|EEE70850.1| predicted protein [Populus trichocarpa]
Length = 449
Score = 106 bits (265), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 72/239 (30%), Positives = 119/239 (49%), Gaps = 26/239 (10%)
Query: 25 QTGGFSVELIHRDSPKSPFYNSSETPYQRLRDALTRSLNRLNHFNQNSSISSSKASQADI 84
Q G ++ELIH+DSP+SP Y + P +++ L+H Q S +S++KA +
Sbjct: 10 QLDGLTMELIHKDSPQSPLYPGNLPPGEQILQPAACPFAGLHH--QTSMMSTNKAVMNRM 67
Query: 85 IPNNANY------LIRISIGT--PPTERLAVA------DTGSDLIWTQCEPC--PPSQCY 128
+ +Y L ++ +G+ + R DTG++L W QCE C + C+
Sbjct: 68 MSPLTSYGDPFLFLAQVGVGSFQEKSHRTHFKTYYFQIDTGNELSWIQCEGCQNKGNMCF 127
Query: 129 MQDSPLFDPKMSSTYKSLPCSSSQCASLNQKSCSGVNCQYSVSYGDGSFSNGNLATETVT 188
P + S +YK + C+ NQ C C Y+V+YG GS+++GNLA ET T
Sbjct: 128 PHKDPPYTSSQSKSYKPVSCNQHSFCEPNQ--CKEGLCAYNVTYGPGSYTSGNLANETFT 185
Query: 189 LGSTTGQAVALPGITFGCGTNNGGLF------NSKTTGIVGLGGGDISLISQMRTTIAG 241
S G+ AL I+FGC T++ + + +G++G+G G S ++Q+ + G
Sbjct: 186 FYSNHGKHTALKSISFGCSTDSRNMIYAFLLDKNPVSGVLGMGWGPRSFLAQLGSISHG 244
>gi|326529233|dbj|BAK01010.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 441
Score = 106 bits (264), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 81/245 (33%), Positives = 115/245 (46%), Gaps = 34/245 (13%)
Query: 8 VFILFFLCFYVVSPIEAQTG-GFSVELIHRDSPKSPFYNSSETPYQRLRDALTRSLNRLN 66
VF+L LCF + TG G ++L H D + T +R+R A+ S RL
Sbjct: 6 VFLLVLLCFRASLVTSSSTGAGLRMKLTHVDD------KAGYTTEERVRRAVAVSRERLA 59
Query: 67 HFNQNSSISSSKASQADIIPNNANYLIRISIGTPPTERLAVADTGSDLIWTQC-EPCPPS 125
+ Q + +S A + Y+ IG PP A+ DTGS+LIWTQC C
Sbjct: 60 YTQQQQQLRASGDVSAPVHLATRQYIAEYLIGDPPQRAAALIDTGSNLIWTQCGTTCGLK 119
Query: 126 QCYMQDSPLFDPKMSSTYKSLPCSSSQ--CASLNQKSCSGVN--CQYSVSYGDGSFSNGN 181
C QD P ++ SST+ ++PC+ S CA+ C G++ C ++ SYG GS G+
Sbjct: 120 ACAKQDLPYYNLSRSSTFAAVPCADSAKLCAANGVHLC-GLDGSCTFAASYGAGSV-FGS 177
Query: 182 LATETVTLGSTTGQAVALPGITFGC--------GTNNGGLFNSKTTGIVGLGGGDISLIS 233
L TE T S + + FGC G NG +G++GLG G +SL+S
Sbjct: 178 LGTEAFTFQSGAAK------LGFGCVSLTRITKGALNG------ASGLIGLGRGRLSLVS 225
Query: 234 QMRTT 238
Q T
Sbjct: 226 QTGAT 230
Score = 47.8 bits (112), Expect = 0.008, Method: Compositional matrix adjust.
Identities = 42/144 (29%), Positives = 71/144 (49%), Gaps = 13/144 (9%)
Query: 234 QMRTTIAGNQRLGVSTPDIVIDSGTTLTFLPQGYNSNLLSVMSSMIE---AQPVADPTGS 290
++R AG GV +ID+G+ +T L + S L ++ + QP AD TG
Sbjct: 304 ELRRVAAGYWSGGV-----IIDTGSPVTSLAEAAYSALSDEVARQLNRSLVQPPAD-TG- 356
Query: 291 LELCYSFNSLSQV-PEVTIHFRG-ADVKLSRSNFFVKVSEDIVCSVFKGITNSVPIYGNI 348
L+LC + + +V P + HF G AD+ +S +++ V + C + + I GN
Sbjct: 357 LDLCVARQDVDKVVPVLVFHFGGGADMAVSAGSYWGPVDKSTACMLIEEGGYETVI-GNF 415
Query: 349 MQTNFLVGYDIEQQTVSFKPTDCT 372
Q + + YDI + +SF+ DC+
Sbjct: 416 QQQDVHLLYDIGKGELSFQTADCS 439
>gi|21717175|gb|AAM76368.1|AC074196_26 putative nucleoid DNA binding protein [Oryza sativa Japonica Group]
gi|31433292|gb|AAP54830.1| Eukaryotic aspartyl protease family protein [Oryza sativa Japonica
Group]
Length = 418
Score = 106 bits (264), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 95/360 (26%), Positives = 150/360 (41%), Gaps = 86/360 (23%)
Query: 87 NNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSL 146
N AN+ +IGTPP A+ D +L+WTQC C S+C+ QD PLF P SST++
Sbjct: 67 NVANF----TIGTPPQPASAIIDVAGELVWTQCSMC--SRCFKQDLPLFVPNASSTFRPE 120
Query: 147 PCSSSQCASLNQKSCSGVNCQY--SVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITF 204
PC + C S+ +CS C Y +++ G + G +AT+T +G+ T + F
Sbjct: 121 PCGTDACKSIPTSNCSSNMCTYEGTINSKLGGHTLGIVATDTFAIGTATAS------LGF 174
Query: 205 GCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTI----------AGNQRL--------- 245
GC +G +G++GLG SL+SQM T N RL
Sbjct: 175 GCVVASGIDTMGGPSGLIGLGRAPSSLVSQMNITKFSYCLTPHDSGKNSRLLLGSSAKLA 234
Query: 246 ----GVSTPDIV-----------------IDSGTTLTFLPQGYNSNLLSVMSSM------ 278
+TP + I +G LP N+ L+ ++ M
Sbjct: 235 GGGNSTTTPFVKTSPGDDMSQYYPIQLDGIKAGDAAIALPPSGNTVLVQTLAPMSFLVDS 294
Query: 279 ------------IEAQPVADPTGSLELCYSFNSLSQ--VPEVTIHFR--GADVKLSRSNF 322
+ A P A P +LC+ LS P++ F+ A + + +
Sbjct: 295 AYQALKKEVTKAVGAAPTATPLQPFDLCFPKAGLSNASAPDLVFTFQQGAAALTVPPPKY 354
Query: 323 FVKVSED--IVCSVF--------KGITNSVPIYGNIMQTNFLVGYDIEQQTVSFKPTDCT 372
+ V E+ VC + ++ I G++ Q N D+E++T+SF+P DC+
Sbjct: 355 LIDVGEEKGTVCMAILSTSWLNTTALDENLNILGSLQQENTHFLLDLEKKTLSFEPADCS 414
>gi|224124882|ref|XP_002329972.1| predicted protein [Populus trichocarpa]
gi|222871994|gb|EEF09125.1| predicted protein [Populus trichocarpa]
Length = 332
Score = 105 bits (263), Expect = 3e-20, Method: Compositional matrix adjust.
Identities = 59/138 (42%), Positives = 82/138 (59%), Gaps = 13/138 (9%)
Query: 107 VADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCSSSQC-----ASLNQKSC 161
+ DTGS L W QC+PC C+ Q PL+DP +S TYK L C+S +C A+LN C
Sbjct: 2 ILDTGSSLSWLQCQPCA-VYCHAQADPLYDPSVSKTYKKLSCASVECSRLKAATLNDPLC 60
Query: 162 SGVN--CQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITFGCGTNNGGLFNSKTT 219
+ C Y+ SYGD SFS G L+ + +TL S+ LP T+GCG +N GLF +
Sbjct: 61 ETDSNACLYTASYGDTSFSIGYLSQDLLTLTSSQ----TLPQFTYGCGQDNQGLFG-RAA 115
Query: 220 GIVGLGGGDISLISQMRT 237
GI+GL +S+++Q+ T
Sbjct: 116 GIIGLARDKLSMLAQLST 133
Score = 64.7 bits (156), Expect = 7e-08, Method: Compositional matrix adjust.
Identities = 39/125 (31%), Positives = 61/125 (48%), Gaps = 6/125 (4%)
Query: 253 VIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGS-LELCY--SFNSLSQVPEVTIH 309
+IDSGT +T LP + L ++ + P S L+ C+ S S+S VPE+ +
Sbjct: 207 LIDSGTVITRLPMSMYAALRQAFVKIMSTKYAKAPAYSILDTCFKGSLKSISAVPEIKMI 266
Query: 310 FRG-ADVKLSRSNFFVKVSEDIVCSVFKGI--TNSVPIYGNIMQTNFLVGYDIEQQTVSF 366
F+G AD+ L + ++ + I C F G TN + I GN Q + + YD+ + F
Sbjct: 267 FQGGADLTLRAPSILIEADKGITCLAFAGSSGTNQIAIIGNRQQQTYNIAYDVSTSRIGF 326
Query: 367 KPTDC 371
P C
Sbjct: 327 APGSC 331
>gi|449529638|ref|XP_004171805.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 1-like, partial
[Cucumis sativus]
Length = 384
Score = 105 bits (263), Expect = 3e-20, Method: Compositional matrix adjust.
Identities = 54/129 (41%), Positives = 70/129 (54%), Gaps = 8/129 (6%)
Query: 87 NNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSL 146
+ Y RI +GTPP V DTGSD++W QC PC CY Q P+F+P S ++ +
Sbjct: 38 GSGEYFTRIGVGTPPKYVYMVLDTGSDIVWLQCAPC--KNCYSQTDPVFNPVKSGSFAKV 95
Query: 147 PCSSSQCASLNQKSCSGVN-CQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITFG 205
C + C L C+ C Y VSYGDGS++ G TET+T T + VAL G
Sbjct: 96 LCRTPLCRRLESPGCNQRQTCLYQVSYGDGSYTTGEFVTETLTFRRTKVEQVAL-----G 150
Query: 206 CGTNNGGLF 214
CG +N GLF
Sbjct: 151 CGHDNEGLF 159
Score = 67.8 bits (164), Expect = 9e-09, Method: Compositional matrix adjust.
Identities = 41/123 (33%), Positives = 58/123 (47%), Gaps = 3/123 (2%)
Query: 252 IVIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSLS--QVPEVTIH 309
++ID GT++T L + L + + A + CY + + +VP V +H
Sbjct: 261 VIIDCGTSVTRLNKPAYIALRDAFRAGASSLKSAPEFSLFDTCYDLSGKTTVKVPTVVLH 320
Query: 310 FRGADVKLSRSNFFVKV-SEDIVCSVFKGITNSVPIYGNIMQTNFLVGYDIEQQTVSFKP 368
FRGADV L SN+ + V C F G T+ + I GNI Q F V YD+ V F P
Sbjct: 321 FRGADVSLPASNYLIPVDGSGRFCFAFAGTTSGLSIIGNIQQQGFRVVYDLASSRVGFSP 380
Query: 369 TDC 371
C
Sbjct: 381 RGC 383
>gi|297825301|ref|XP_002880533.1| hypothetical protein ARALYDRAFT_481251 [Arabidopsis lyrata subsp.
lyrata]
gi|297326372|gb|EFH56792.1| hypothetical protein ARALYDRAFT_481251 [Arabidopsis lyrata subsp.
lyrata]
Length = 430
Score = 105 bits (262), Expect = 3e-20, Method: Compositional matrix adjust.
Identities = 67/212 (31%), Positives = 95/212 (44%), Gaps = 7/212 (3%)
Query: 30 SVELIHRDS--PKSPFYNSSETPYQRLRDALTRSLNRLNHFNQNSSISSSKASQADIIPN 87
+++LI R+S +P TP ++ S R + QNS + +S + +
Sbjct: 2 AMKLIRRESVVRHNPDARVPVTPEDHIQHMTDISSARFKYL-QNSIVKELGSSDFQVDVH 60
Query: 88 NAN----YLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTY 143
A + + S+G PP + + DTGS L+W QC PC P+F+P +SST+
Sbjct: 61 QAIKTSLFFVNFSVGQPPVPQFTIMDTGSSLLWIQCHPCKHCSSNHMIHPVFNPALSSTF 120
Query: 144 KSLPCSSSQCASLNQKSCSGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGIT 203
C C CS C Y Y G+ S G LA E +T + G V I
Sbjct: 121 VECSCDDRFCRYAPNGHCSSNKCVYEQVYISGTGSKGVLAKERLTFTTPNGNTVVTQPIA 180
Query: 204 FGCGTNNGGLFNSKTTGIVGLGGGDISLISQM 235
FGCG NG S+ TGI+GLG SL Q+
Sbjct: 181 FGCGHENGEQLESEFTGILGLGAKPTSLAVQL 212
>gi|222637180|gb|EEE67312.1| hypothetical protein OsJ_24552 [Oryza sativa Japonica Group]
Length = 420
Score = 105 bits (261), Expect = 4e-20, Method: Compositional matrix adjust.
Identities = 60/138 (43%), Positives = 81/138 (58%), Gaps = 10/138 (7%)
Query: 76 SSKASQADIIPNNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLF 135
SS + QA + Y + IS+GTP VADTGSDLIWTQC PC ++C+ Q +P F
Sbjct: 71 SSVSFQALLENGVGGYNMNISVGTPLLTFSVVADTGSDLIWTQCAPC--TKCFQQPAPPF 128
Query: 136 DPKMSSTYKSLPCSSSQCASL--NQKSCSGVNCQYSVSYGDGSFSNGNLATETVTLGSTT 193
P SST+ LPC+SS C L + ++C+ C Y+ YG G ++ G LATET+ +G
Sbjct: 129 QPASSSTFSKLPCTSSFCQFLPNSIRTCNATGCVYNYKYGSG-YTAGYLATETLKVGD-- 185
Query: 194 GQAVALPGITFGCGTNNG 211
+ P + FGC T NG
Sbjct: 186 ---ASFPSVAFGCSTENG 200
>gi|357118738|ref|XP_003561107.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Brachypodium
distachyon]
Length = 491
Score = 105 bits (261), Expect = 5e-20, Method: Compositional matrix adjust.
Identities = 97/339 (28%), Positives = 139/339 (41%), Gaps = 80/339 (23%)
Query: 107 VADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCSSSQCASL----NQKSCS 162
V DT SD+ W QC PCP C+ Q L+DP SS+ + PCSS C +L N + +
Sbjct: 159 VIDTASDVPWVQCAPCPAPHCHAQTDVLYDPSKSSSSAAFPCSSPACRNLGPYANGCTPA 218
Query: 163 GVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITFGCGTN--NGGLFNSKTTG 220
G CQY V Y DGS S G ++ +TL A A+ FGC G F++KT+G
Sbjct: 219 GDQCQYRVQYPDGSASAGTYISDVLTLNPAK-PASAISEFRFGCSHALLQPGSFSNKTSG 277
Query: 221 IVGLGGGDISLISQMRTT-------------------IAGNQRLGVS----TPDI----- 252
I+ LG G SL +Q + T I G R+ S TP +
Sbjct: 278 IMALGRGAQSLPTQTKATYGDVFSYCLPPTPVHSGFFILGVPRVAASRYAVTPMLRSKAA 337
Query: 253 -----------------------------VIDSGTTLTFLPQGYNSNLLSVMSSMIEAQP 283
V+DS T +T LP L + + + A
Sbjct: 338 PMLYLVRLIAIEVAGKRLPVPPAVFAAGAVMDSRTIVTRLPPTAYMALRAAFVAEMRAYR 397
Query: 284 VADPTGSLELCYSFNSLS-------QVPEVTIHFRGAD--VKLSRSNFFVKVSEDIVCSV 334
A P L+ CY F+ + ++P++T+ F G + V+L S + C
Sbjct: 398 AAAPKEHLDTCYDFSGAAPGGGGGVKLPKITLVFDGPNGAVELDPSGVLLD-----GCLA 452
Query: 335 FKGITNS--VPIYGNIMQTNFLVGYDIEQQTVSFKPTDC 371
F T+ I GN+ Q V Y+++ TV F+ C
Sbjct: 453 FAPNTDDQMTGIIGNVQQQALEVLYNVDGATVGFRRGAC 491
>gi|326500408|dbj|BAK06293.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 475
Score = 105 bits (261), Expect = 5e-20, Method: Compositional matrix adjust.
Identities = 93/332 (28%), Positives = 131/332 (39%), Gaps = 78/332 (23%)
Query: 109 DTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCSSSQCASLNQ--KSCS---- 162
DT D+ W QC PCP QCY Q PLFDP SST ++ C S C SL CS
Sbjct: 153 DTTVDVPWIQCAPCPIPQCYPQRDPLFDPTTSSTAAAVRCRSPACRSLGPYGNGCSNRSA 212
Query: 163 GVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITFGCGTNNGGLFNSKTTGIV 222
C+Y + Y D + G T+T+T+ TT A+ FGC G F+ T G +
Sbjct: 213 NAECRYLIEYSDDRATAGTYMTDTLTISGTT----AVRNFRFGCSHAVRGRFSDLTAGTM 268
Query: 223 GLGGGDISLISQ-----------------------------------------MRTTI-- 239
LGGG SL++Q +R+ I
Sbjct: 269 SLGGGAQSLLAQTARSLGNAFSYCVPQASASGFLSIGGPATTNSTTVFATTPLVRSAINP 328
Query: 240 -----------AGNQRLGVS----TPDIVIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPV 284
+RLG+ + V+DS +T LP L + + A P
Sbjct: 329 SLYLVRLQGIVVAGRRLGIPPVAFSAGAVMDSSAVITQLPPTAYRALRRAFRNAMRAYPR 388
Query: 285 ADPTGSLELCYSFNSLS--QVPEVTIHF-RGADVKLSRSNFFVKVSEDIVCSVFKGITNS 341
+ TG+L+ CY F L+ +VP V++ F GA V L + C F ++
Sbjct: 389 SGATGTLDTCYDFLGLTNVRVPAVSLVFGGGAVVVLDPPAVMIG-----GCLAFTATSSD 443
Query: 342 VPI--YGNIMQTNFLVGYDIEQQTVSFKPTDC 371
+ + GN+ Q V YD+ V F+ C
Sbjct: 444 LALGFIGNVQQQTHEVLYDVAAGGVGFRRGAC 475
>gi|242092878|ref|XP_002436929.1| hypothetical protein SORBIDRAFT_10g011180 [Sorghum bicolor]
gi|241915152|gb|EER88296.1| hypothetical protein SORBIDRAFT_10g011180 [Sorghum bicolor]
Length = 505
Score = 104 bits (260), Expect = 6e-20, Method: Compositional matrix adjust.
Identities = 59/154 (38%), Positives = 79/154 (51%), Gaps = 6/154 (3%)
Query: 87 NNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSL 146
+ +++ + G+P DTGSD+ W QC PC CY Q P+FDP S+TY ++
Sbjct: 157 DTLEFVVTVGFGSPAQNYTLSIDTGSDVSWIQCLPCS-GHCYKQHDPVFDPTKSATYSAV 215
Query: 147 PCSSSQCASLNQKSCSGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITFGC 206
PC QCA+ K + C Y V+YGDGS + G L+ ET++L ST LPG FGC
Sbjct: 216 PCGHPQCAAAGGKCSNSGTCLYKVTYGDGSSTAGVLSHETLSLSSTRD----LPGFAFGC 271
Query: 207 GTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIA 240
G N G F + G +SL SQ T
Sbjct: 272 GQTNLGEFGGVDGLVGLGRGA-LSLPSQAAATFG 304
>gi|449508297|ref|XP_004163275.1| PREDICTED: aspartic proteinase-like protein 2-like [Cucumis
sativus]
Length = 641
Score = 104 bits (260), Expect = 6e-20, Method: Compositional matrix adjust.
Identities = 100/384 (26%), Positives = 161/384 (41%), Gaps = 84/384 (21%)
Query: 63 NRLNHFNQNSSISSSKASQADIIPNNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPC 122
RL F + ++S+++ D + N Y R+ IGTPP + + DTGS + + C C
Sbjct: 55 RRLRQFPTSDNLSNARMRLYDDLLLNGYYTTRLWIGTPPQQFALIVDTGSTVTYVPCSTC 114
Query: 123 PPSQCYMQDSPLFDPKMSSTYKSLPCS-SSQCASLNQKSCSGVNCQYSVSYGDGSFSNGN 181
QC P FDP+ SSTYK + C+ C S GV C Y Y + S S+G
Sbjct: 115 --EQCGRHQDPKFDPESSSTYKPIKCNIDCICDS------DGVQCVYERQYAEMSTSSGV 166
Query: 182 LATETVTLGSTTGQAVALP-GITFGC-GTNNGGLFNSKTTGIVGLGGGDISLISQMRTTI 239
L + ++ G+ Q+ +P FGC G LF+ + GI+GLG GD+SL+ Q+
Sbjct: 167 LGEDVISFGN---QSELIPQRAVFGCENMETGDLFSQRADGIMGLGTGDLSLVDQLVEKG 223
Query: 240 AGNQRL-----------------GVSTP-DIV---------------------------- 253
A N G+S P D++
Sbjct: 224 AINDSFSLCYGGMDIGGGAMVLGGISPPSDMIFTYSDPVRSPYYNVDLKEIHVAGKKLPL 283
Query: 254 ------------IDSGTTLTFLP-QGYNSNLLSVMSSMIEAQPVADPTGSL-ELCYS--- 296
+DSGTT +LP + +++ ++M + + + P + ++C+S
Sbjct: 284 SSGIFDGRYGAVLDSGTTYAYLPAEAFSAFKDAIMDEIHSLKKIDGPDPNFKDICFSGAG 343
Query: 297 --FNSLS-QVPEVTIHFR-GADVKLSRSNFFV---KVSEDIVCSVFKGITNSVPIYGNIM 349
LS + P V + F G + L+ N+F KV +F+ + + G I+
Sbjct: 344 SDAAELSNKFPTVDMVFENGQKLSLTPENYFFRHSKVHGAYCLGIFENGNDQTTLLGGIV 403
Query: 350 QTNFLVGYDIEQQTVSFKPTDCTK 373
N LV YD + F T+C++
Sbjct: 404 VRNTLVMYDRANSKIGFWKTNCSE 427
>gi|449433371|ref|XP_004134471.1| PREDICTED: probable aspartic protease At2g35615-like [Cucumis
sativus]
gi|449495479|ref|XP_004159853.1| PREDICTED: probable aspartic protease At2g35615-like [Cucumis
sativus]
Length = 424
Score = 104 bits (260), Expect = 6e-20, Method: Compositional matrix adjust.
Identities = 71/219 (32%), Positives = 110/219 (50%), Gaps = 12/219 (5%)
Query: 28 GFSVELIHRDSPKSPFYNSSETPYQRLRDALTRSLNRLNHFNQNSSISSSKASQA----- 82
GF+ LIH DSP SPFYN + T R+ + RS +RLN+ + +S +
Sbjct: 7 GFTARLIHHDSPLSPFYNHTMTDTARIEATVHRSRSRLNYLYYINKLSENALDNDVSLSP 66
Query: 83 DIIPNNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSS- 141
++ YL+ +IG P ++ + DT + LIW QC C SQC + L +SS
Sbjct: 67 TLVNEGGEYLMSFNIGNPSSQVMGFLDTSNGLIWVQCSNC-NSQCEPEKRGLTTKFLSSK 125
Query: 142 --TYKSLPCSSSQCASLNQ-KSCSGVN--CQYSVSYGDGSFSNGNLATETVTLGSTTGQA 196
TY+ PC S+ C SL ++C+ + C+Y + YGD ++G L++++ ++ G
Sbjct: 126 SFTYEMEPCGSNFCNSLTGFQTCNSSDKWCKYRLVYGDNKATSGILSSDSFGFDTSDGML 185
Query: 197 VALPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQM 235
V + + FGC TG VGL +SLISQ+
Sbjct: 186 VDVGFLNFGCSEAPLTGDEQSYTGNVGLNQTPLSLISQL 224
Score = 79.3 bits (194), Expect = 3e-12, Method: Compositional matrix adjust.
Identities = 46/124 (37%), Positives = 66/124 (53%), Gaps = 5/124 (4%)
Query: 253 VIDSGTTLTFLPQGYNSNLLSVMSSMIE-AQPVADPTGSLELCYSF---NSLSQVPEVTI 308
+ID+G T + L +LL+ ++ + Q DP ELC+ N L P+VT+
Sbjct: 298 IIDTGITYSSLETDAFDSLLAKFLTLKDFPQRKDDPKERFELCFELQNANDLESFPDVTV 357
Query: 309 HFRGADVKLSRSNFFVKVSED-IVCSVFKGITNSVPIYGNIMQTNFLVGYDIEQQTVSFK 367
HF GAD+ L+ + FVK+ +D I C + V I GN N+ VGYD+E Q +SF
Sbjct: 358 HFDGADLILNVESTFVKIEDDGIFCLALLRSGSPVSILGNFQLQNYHVGYDLEAQVISFA 417
Query: 368 PTDC 371
P DC
Sbjct: 418 PVDC 421
>gi|449463971|ref|XP_004149703.1| PREDICTED: aspartic proteinase-like protein 2-like [Cucumis
sativus]
Length = 641
Score = 104 bits (260), Expect = 6e-20, Method: Compositional matrix adjust.
Identities = 100/384 (26%), Positives = 161/384 (41%), Gaps = 84/384 (21%)
Query: 63 NRLNHFNQNSSISSSKASQADIIPNNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPC 122
RL F + ++S+++ D + N Y R+ IGTPP + + DTGS + + C C
Sbjct: 55 RRLRQFPTSDNLSNARMRLYDDLLLNGYYTTRLWIGTPPQQFALIVDTGSTVTYVPCSTC 114
Query: 123 PPSQCYMQDSPLFDPKMSSTYKSLPCS-SSQCASLNQKSCSGVNCQYSVSYGDGSFSNGN 181
QC P FDP+ SSTYK + C+ C S GV C Y Y + S S+G
Sbjct: 115 --EQCGRHQDPKFDPESSSTYKPIKCNIDCICDS------DGVQCVYERQYAEMSTSSGV 166
Query: 182 LATETVTLGSTTGQAVALP-GITFGC-GTNNGGLFNSKTTGIVGLGGGDISLISQMRTTI 239
L + ++ G+ Q+ +P FGC G LF+ + GI+GLG GD+SL+ Q+
Sbjct: 167 LGEDVISFGN---QSELIPQRAVFGCENMETGDLFSQRADGIMGLGTGDLSLVDQLVEKG 223
Query: 240 AGNQRL-----------------GVSTP-DIV---------------------------- 253
A N G+S P D++
Sbjct: 224 AINDSFSLCYGGMDIGGGAMVLGGISPPSDMIFTYSDPVRSPYYNVDLKEIHVAGKKLPL 283
Query: 254 ------------IDSGTTLTFLP-QGYNSNLLSVMSSMIEAQPVADPTGSL-ELCYS--- 296
+DSGTT +LP + +++ ++M + + + P + ++C+S
Sbjct: 284 SSGIFDGRYGAVLDSGTTYAYLPAEAFSAFKDAIMDEIHSLKKIDGPDPNFKDICFSGAG 343
Query: 297 --FNSLS-QVPEVTIHFR-GADVKLSRSNFFV---KVSEDIVCSVFKGITNSVPIYGNIM 349
LS + P V + F G + L+ N+F KV +F+ + + G I+
Sbjct: 344 SDAAELSNKFPTVDMVFENGQKLSLTPENYFFRHSKVHGAYCLGIFENGNDQTTLLGGIV 403
Query: 350 QTNFLVGYDIEQQTVSFKPTDCTK 373
N LV YD + F T+C++
Sbjct: 404 VRNTLVMYDRANSKIGFWKTNCSE 427
>gi|47777372|gb|AAT38006.1| unknow protein [Oryza sativa Japonica Group]
Length = 475
Score = 104 bits (259), Expect = 8e-20, Method: Compositional matrix adjust.
Identities = 49/130 (37%), Positives = 71/130 (54%), Gaps = 8/130 (6%)
Query: 87 NNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSL 146
+ Y ++ +GTP T L V DTGSD++W QC PC CY Q +FDP+ S +Y ++
Sbjct: 118 GSGEYFAQVGVGTPATTALMVLDTGSDVVWLQCAPC--RHCYAQSGRVFDPRRSRSYAAV 175
Query: 147 PCSSSQCASLNQKSCSGVN--CQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITF 204
C + C L+ C C Y V+YGDGS + G+ A+ET+T + + +
Sbjct: 176 DCVAPICRRLDSAGCDRRRNSCLYQVAYGDGSVTAGDFASETLTF----ARGARVQRVAI 231
Query: 205 GCGTNNGGLF 214
GCG +N GLF
Sbjct: 232 GCGHDNEGLF 241
Score = 51.2 bits (121), Expect = 7e-04, Method: Compositional matrix adjust.
Identities = 37/125 (29%), Positives = 57/125 (45%), Gaps = 5/125 (4%)
Query: 252 IVIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSL-ELCYSF--NSLSQVPEVTI 308
+++DSGT++T L + + + V+ SL + CY+ + +VP V++
Sbjct: 351 VILDSGTSVTRLARPVYEAVRDAFRAAAVGLRVSPGGFSLFDTCYNLSGRRVVKVPTVSM 410
Query: 309 HFRG-ADVKLSRSNFFVKV-SEDIVCSVFKGITNSVPIYGNIMQTNFLVGYDIEQQTVSF 366
H G A V L N+ + V + C G V I GNI Q F V +D + Q V F
Sbjct: 411 HLAGGASVALPPENYLIPVDTSGTFCFAMAGTDGGVSIIGNIQQQGFRVVFDGDAQRVGF 470
Query: 367 KPTDC 371
P C
Sbjct: 471 VPKSC 475
>gi|125553531|gb|EAY99240.1| hypothetical protein OsI_21202 [Oryza sativa Indica Group]
Length = 475
Score = 104 bits (259), Expect = 8e-20, Method: Compositional matrix adjust.
Identities = 49/130 (37%), Positives = 71/130 (54%), Gaps = 8/130 (6%)
Query: 87 NNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSL 146
+ Y ++ +GTP T L V DTGSD++W QC PC CY Q +FDP+ S +Y ++
Sbjct: 118 GSGEYFAQVGVGTPATTALMVLDTGSDVVWLQCAPC--RHCYAQSGRVFDPRRSRSYAAV 175
Query: 147 PCSSSQCASLNQKSCSGVN--CQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITF 204
C + C L+ C C Y V+YGDGS + G+ A+ET+T + + +
Sbjct: 176 DCVAPICRRLDSAGCDRRRNSCLYQVAYGDGSVTAGDFASETLTF----ARGARVQRVAI 231
Query: 205 GCGTNNGGLF 214
GCG +N GLF
Sbjct: 232 GCGHDNEGLF 241
Score = 51.2 bits (121), Expect = 8e-04, Method: Compositional matrix adjust.
Identities = 37/125 (29%), Positives = 57/125 (45%), Gaps = 5/125 (4%)
Query: 252 IVIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSL-ELCYSF--NSLSQVPEVTI 308
+++DSGT++T L + + + V+ SL + CY+ + +VP V++
Sbjct: 351 VILDSGTSVTRLARPVYEAVRDAFRAAAVGLRVSPGGFSLFDTCYNLSGRRVVKVPTVSM 410
Query: 309 HFRG-ADVKLSRSNFFVKV-SEDIVCSVFKGITNSVPIYGNIMQTNFLVGYDIEQQTVSF 366
H G A V L N+ + V + C G V I GNI Q F V +D + Q V F
Sbjct: 411 HLAGGASVALPPENYLIPVDTSGTFCFAMAGTDGGVSIIGNIQQQGFRVVFDGDAQRVGF 470
Query: 367 KPTDC 371
P C
Sbjct: 471 VPKSC 475
>gi|147794033|emb|CAN68918.1| hypothetical protein VITISV_035156 [Vitis vinifera]
Length = 398
Score = 104 bits (259), Expect = 9e-20, Method: Compositional matrix adjust.
Identities = 97/353 (27%), Positives = 145/353 (41%), Gaps = 55/353 (15%)
Query: 32 ELIHRDSPKSPFYNSSETPYQRLRDALTRSLNRLNHFNQNSSISSSKASQADIIPNNANY 91
E+ RD + F NS Y S N NH + N ++ + N+
Sbjct: 88 EIXGRDESRVSFINSKCNQYT--------SGNLKNHAHNN-----------NLFDEDGNF 128
Query: 92 LIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCSSS 151
L+ ++ GTPP + DTGS + WTQC+ C C FB SSTY C
Sbjct: 129 LVDVAFGTPPQXFXLILDTGSSITWTQCKAC--VNCLQDSXRYFBXSASSTYSXGSCIPX 186
Query: 152 QCASLNQKSCSGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITFGCGTNNG 211
V Y+++YGD S S GN T+TL + FG G NN
Sbjct: 187 T-----------VENNYNMTYGDDSTSVGNYGCXTMTLEPSD----VFQKFQFGXGRNNK 231
Query: 212 GLFNSKTTGIVGLGGGDISLISQMRTTIAGNQRLGVSTPDIVIDSGTTLTFLPQGYNSNL 271
G F S G++GLG G +S +SQ + + P+ DS +L F + + +
Sbjct: 232 GDFGSGADGMLGLGQGQLSTVSQTASKFX--KVFSYCLPE--EDSIGSLLFGEKATSQSS 287
Query: 272 LSVMSSMIEAQPVADPTGSLELCYSFNSLSQV------PEVTIHF-RGADVKLSRSNFFV 324
+S++ +G E Y F L + PE+ +HF GADV+L+ +N
Sbjct: 288 SLKFTSLVNGPGT---SGLXESGYYFVKLLDISVDVLLPEIVLHFGGGADVRLNGTNIVW 344
Query: 325 KVSEDIVCSVFKGITNS-----VPIYGNIMQTNFLVGYDIEQQTVSFKPTDCT 372
+C F G + S + I GN Q + V YDI+ + F+ C+
Sbjct: 345 GSDASRLCLAFAGNSKSTMNPELTIIGNRQQLSLTVLYDIQGGRIGFRSNGCS 397
>gi|296082634|emb|CBI21639.3| unnamed protein product [Vitis vinifera]
Length = 278
Score = 103 bits (258), Expect = 9e-20, Method: Compositional matrix adjust.
Identities = 69/191 (36%), Positives = 95/191 (49%), Gaps = 37/191 (19%)
Query: 29 FSVELIHRDSPKSPFYNSSETPYQRLRDALTRSLNRLNHFNQNSSISSSKASQADIIPNN 88
F V L H DS + T ++RL+ A+ R RL + ++ S + +A + N
Sbjct: 35 FRVSLRHVDS------GGNYTKFERLQRAMKRGKLRLQRLSAKTA-SFESSVEAPVHAGN 87
Query: 89 ANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPC 148
+L++++IGTP A+ DTGSDLIWTQC+PC C+ Q +P+FDPK SS++ LPC
Sbjct: 88 GEFLMKLAIGTPAETYSAIMDTGSDLIWTQCKPC--KDCFDQPTPIFDPKKSSSFSKLPC 145
Query: 149 SSSQCASLNQKSCSGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITFGCGT 208
SS S Q G LATET G + + I FGCG
Sbjct: 146 SSDLYYSSTQ---------------------GVLATETFAFGDAS-----VSKIGFGCGE 179
Query: 209 NNGGLFNSKTT 219
+N G NS TT
Sbjct: 180 DNDG--NSGTT 188
>gi|115465771|ref|NP_001056485.1| Os05g0590000 [Oryza sativa Japonica Group]
gi|49328116|gb|AAT58814.1| putative nucleoid DNA-binding protein [Oryza sativa Japonica Group]
gi|113580036|dbj|BAF18399.1| Os05g0590000 [Oryza sativa Japonica Group]
Length = 481
Score = 103 bits (258), Expect = 9e-20, Method: Compositional matrix adjust.
Identities = 49/130 (37%), Positives = 71/130 (54%), Gaps = 8/130 (6%)
Query: 87 NNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSL 146
+ Y ++ +GTP T L V DTGSD++W QC PC CY Q +FDP+ S +Y ++
Sbjct: 124 GSGEYFAQVGVGTPATTALMVLDTGSDVVWLQCAPC--RHCYAQSGRVFDPRRSRSYAAV 181
Query: 147 PCSSSQCASLNQKSCSGVN--CQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITF 204
C + C L+ C C Y V+YGDGS + G+ A+ET+T + + +
Sbjct: 182 DCVAPICRRLDSAGCDRRRNSCLYQVAYGDGSVTAGDFASETLTF----ARGARVQRVAI 237
Query: 205 GCGTNNGGLF 214
GCG +N GLF
Sbjct: 238 GCGHDNEGLF 247
Score = 51.2 bits (121), Expect = 8e-04, Method: Compositional matrix adjust.
Identities = 37/125 (29%), Positives = 57/125 (45%), Gaps = 5/125 (4%)
Query: 252 IVIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSL-ELCYSF--NSLSQVPEVTI 308
+++DSGT++T L + + + V+ SL + CY+ + +VP V++
Sbjct: 357 VILDSGTSVTRLARPVYEAVRDAFRAAAVGLRVSPGGFSLFDTCYNLSGRRVVKVPTVSM 416
Query: 309 HFRG-ADVKLSRSNFFVKV-SEDIVCSVFKGITNSVPIYGNIMQTNFLVGYDIEQQTVSF 366
H G A V L N+ + V + C G V I GNI Q F V +D + Q V F
Sbjct: 417 HLAGGASVALPPENYLIPVDTSGTFCFAMAGTDGGVSIIGNIQQQGFRVVFDGDAQRVGF 476
Query: 367 KPTDC 371
P C
Sbjct: 477 VPKSC 481
>gi|357118074|ref|XP_003560784.1| PREDICTED: aspartic proteinase nepenthesin-2-like, partial
[Brachypodium distachyon]
Length = 452
Score = 103 bits (258), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 52/126 (41%), Positives = 72/126 (57%), Gaps = 6/126 (4%)
Query: 90 NYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCS 149
+++ + G+P + DTGSDL W QC+PC CY Q P+FDP SS+Y +PC
Sbjct: 111 EFVVVVGFGSPAQTSATMFDTGSDLSWIQCQPCS-GHCYKQHDPVFDPAKSSSYAVVPCG 169
Query: 150 SSQCASLNQKSCSGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITFGCGTN 209
+++CA+ + C+G C Y V YGDGS + G LA ET+T S++ G FGCG
Sbjct: 170 TTECAAAGGE-CNGTTCVYGVEYGDGSSTTGVLARETLTFSSSS----EFTGFIFGCGET 224
Query: 210 NGGLFN 215
N G F
Sbjct: 225 NLGDFG 230
Score = 44.7 bits (104), Expect = 0.067, Method: Compositional matrix adjust.
Identities = 34/127 (26%), Positives = 54/127 (42%), Gaps = 8/127 (6%)
Query: 253 VIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSLSQVPEVTIHFRG 312
++DSGT LT+LP + L ++ A P L+ CY F S + + F
Sbjct: 326 LLDSGTILTYLPPPAYTALRDRFKFTMQGSKPAPPYDELDTCYDFTGQSGILIPGVSFNF 385
Query: 313 ADVKLSRSNFF--VKVSED----IVCSVFKGITNSVP--IYGNIMQTNFLVGYDIEQQTV 364
+D + NFF + +D + C F +P + G+ Q + V YD+ Q +
Sbjct: 386 SDGAVFNLNFFGIMTFPDDTKPAVGCLAFVSRPADMPFSVVGSTTQRSAEVIYDVPAQKI 445
Query: 365 SFKPTDC 371
F P C
Sbjct: 446 GFIPASC 452
>gi|357118398|ref|XP_003560942.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Brachypodium
distachyon]
Length = 478
Score = 103 bits (258), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 77/231 (33%), Positives = 114/231 (49%), Gaps = 25/231 (10%)
Query: 74 ISSSKASQADIIPNNANYLIRISIGTPPTERLAVA-DTGSDLIWTQCEPCPPSQCYMQDS 132
++ ADI ++ YLI +SIGTP +R+A+ DTGSDL+WTQC C C+ Q
Sbjct: 86 LARGTVGDADI---DSEYLIHLSIGTPRPQRVALTLDTGSDLVWTQCA-C--HVCFAQPF 139
Query: 133 PLFDPKMSSTYKSLPCSSSQCAS----LNQKSCSGVNCQYSVSYGDGSFSNGNLATETVT 188
P FD S T ++PCS C S L+ + + C Y Y D S ++G + +T T
Sbjct: 140 PTFDALASQTTLAVPCSDPICTSGKYPLSGCTFNDNTCFYLYDYADKSITSGRIVEDTFT 199
Query: 189 L-------GSTTGQAVALPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAG 241
GS VA+P + FGCG N G+F S +GI G G +SL SQ++
Sbjct: 200 FRSPQGNNGSKAHAGVAVPNVRFGCGQYNKGIFKSNESGIAGFSRGPMSLPSQLKVARFS 259
Query: 242 NQRLGVSTPDIVIDSGTTLTFLPQGYN-SNLLSVMSSMIEAQPVADPTGSL 291
+ ++ D+ T+ FL NL + + +++ P A+ GSL
Sbjct: 260 HCFTAIA------DARTSPVFLGGAPGPDNLGAHATGPVQSTPFANSNGSL 304
Score = 48.5 bits (114), Expect = 0.005, Method: Compositional matrix adjust.
Identities = 39/140 (27%), Positives = 63/140 (45%), Gaps = 20/140 (14%)
Query: 253 VIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLE---LCYS---------FNSL 300
+IDSGT + LP +L + + ++ PVA+ + + LC+
Sbjct: 339 IIDSGTGIRTLPGPMYRSLRAAFVARVKL-PVANESAADAESTLCFEAARSASLPPEAPA 397
Query: 301 SQVPEVTIHFRGADVKLSRSNFFVKVSEDI------VCSVFKGITNS-VPIYGNIMQTNF 353
+P+V +H GAD L R ++ + + ED +C V +S + I GN Q N
Sbjct: 398 PALPKVVLHVAGADWDLPRESYVLDLLEDEDGSGSGLCLVMNSAGDSDLTIIGNFQQQNM 457
Query: 354 LVGYDIEQQTVSFKPTDCTK 373
V YD+E+ + F P C K
Sbjct: 458 HVAYDLEKNKLVFVPARCDK 477
>gi|413938618|gb|AFW73169.1| hypothetical protein ZEAMMB73_633272 [Zea mays]
Length = 324
Score = 103 bits (258), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 101/333 (30%), Positives = 132/333 (39%), Gaps = 82/333 (24%)
Query: 109 DTGSDLIWTQCEPCPPS-QCYMQDSPLFDPKMSSTYKSLPCSSSQCASLN---QKSCSGV 164
DTGSDL W QC+PC + CY Q PLFDP SS+Y ++PC CA L +CS
Sbjct: 4 DTGSDLSWVQCKPCAAAPSCYSQKDPLFDPAQSSSYAAVPCGGPVCAGLGIYAASACSAA 63
Query: 165 NCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITFGCGTNNGGLFNSKTTGIVGL 224
C Y VSYGDGS + G +++T+TL +++ A+ G FGCG GLFN G++GL
Sbjct: 64 QCGYVVSYGDGSNTTGVYSSDTLTLSASS----AVQGFFFGCGHAQSGLFNG-VDGLLGL 118
Query: 225 GGGDISLISQ--------------MRTTIAGNQRLGVSTPDIVIDSGTTLTFLPQGYNSN 270
G SL+ Q + + AG LGV P +T LP
Sbjct: 119 GREQPSLVEQTAGTYGGVFSYCLPTKPSTAGYLTLGVGGPSGAAPGFSTTQLLPSPNAPT 178
Query: 271 LLSVMSSMIE-----------------------------------------------AQP 283
VM + I P
Sbjct: 179 YYVVMLTGISVGGQQLSVPASAFAGGTVVDTGTVVTRLPPTAYAALRSAFRSGMASYGYP 238
Query: 284 VADPTGSLELCYSFNSLSQV--PEVTIHF-RGADVKLSRSNFFVKVSEDIVCSVFK--GI 338
A G L+ CY+F V P V + F GA V L C F G
Sbjct: 239 TAPSNGILDTCYNFAGYGTVTLPNVALTFGSGATVTLGADGIL-----SFGCLAFAPSGS 293
Query: 339 TNSVPIYGNIMQTNFLVGYDIEQQTVSFKPTDC 371
+ I GN+ Q +F V I+ +V FKP+ C
Sbjct: 294 DGGMAILGNVQQRSFEV--RIDGTSVGFKPSSC 324
>gi|359476193|ref|XP_003631802.1| PREDICTED: LOW QUALITY PROTEIN: aspartic proteinase
nepenthesin-2-like [Vitis vinifera]
Length = 496
Score = 103 bits (256), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 100/368 (27%), Positives = 144/368 (39%), Gaps = 96/368 (26%)
Query: 83 DIIPNN------ANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFD 136
D PNN N+L+ ++ GTPP + + DTGS + WTQC+PC +C FD
Sbjct: 148 DHTPNNKLFDEDGNFLVDVAFGTPPQKFTLILDTGSSITWTQCKPC--VRCLKASRRHFD 205
Query: 137 PKMSSTYKSLPCSSSQCASLNQKSCSGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQA 196
P S TY C S V Y+++YGD S S GN +T+TL +
Sbjct: 206 PSASLTYSLGSCIPST-----------VGNTYNMTYGDKSTSVGNYGCDTMTLE----HS 250
Query: 197 VALPGITFGCGTNNGG-------------------------------------------- 212
P FGCG NN G
Sbjct: 251 DVFPKFQFGCGRNNEGDFGSGADGMLGLGQGQLSTVSQTASKFKKVFSYCLPEEDSIGSL 310
Query: 213 LFNSKTT---------------GIVGLGGGDISLISQMRTTIAGNQRLGV-----STPDI 252
LF K T G GL + + ++ GN+RL + ++P
Sbjct: 311 LFGEKATSQSSSLKFTSLVNGPGTSGLEESGYYFVKLLDISV-GNKRLNIPSSVFASPGT 369
Query: 253 VIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGS----LELCYSFNSLSQV--PEV 306
+IDSGT +T LPQ S L + + P+++ L+ CY+ + V PE+
Sbjct: 370 IIDSGTVITRLPQRAYSALKAAFKKAMAKYPLSNGRRKKGDILDTCYNLSGRKDVLLPEI 429
Query: 307 TIHF-RGADVKLSRSNFFVKVSEDIVCSVFKGITNSVPIYGNIMQTNFLVGYDIEQQTVS 365
+HF GADV+L+ +C F G + + I GN Q + V YDI+ +
Sbjct: 430 VLHFGEGADVRLNGKRVIWGNDASRLCLAFAG-NSELTIIGNRQQVSLTVLYDIQGGRIG 488
Query: 366 FKPTDCTK 373
F C+K
Sbjct: 489 FGGNGCSK 496
>gi|125595855|gb|EAZ35635.1| hypothetical protein OsJ_19925 [Oryza sativa Japonica Group]
Length = 335
Score = 103 bits (256), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 76/281 (27%), Positives = 133/281 (47%), Gaps = 27/281 (9%)
Query: 98 GTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCSSSQCASLN 157
GT + + D+GSD+ W QC+PCP C+ Q PLFDP S+TY ++PCSS+ CA L
Sbjct: 75 GTSAVSQTVIIDSGSDVPWVQCQPCPLLVCHPQRDPLFDPATSTTYAAVPCSSAACARLG 134
Query: 158 --QKSC-SGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITFGCG-TNNGGL 213
++ C + CQ+ ++Y +G+ + G +++ +TLG + G FGC + G
Sbjct: 135 PYRRGCLANSQCQFGITYANGATATGTYSSDDLTLGPYD----VVRGFLFGCAHADQGST 190
Query: 214 FNSKTTGIVGLGGGDISLISQMRTTIAGNQRLGVSTPDIVIDSGTTLTFLPQGYNSNLLS 273
F+ G + LGGG S + Q + ++ P G + +P + + +
Sbjct: 191 FSYDVAGTLALGGGSQSFVQQTASQY--SRVFSYCVPPSTSSFGFIMFGVPPQRAALVPT 248
Query: 274 VMSSMIEAQPVADPTGSLELCYSFNSLSQVPEVTIHFR-GADVKLSRSNFFVKVSEDIVC 332
+S+ + + PT F S++ +P + + F GA V L + ++ C
Sbjct: 249 FVSTPLLSSSTMSPT--------FYSIT-LPSIALVFDGGATVNLDAAGILLQ-----GC 294
Query: 333 SVFK-GITNSVPIY-GNIMQTNFLVGYDIEQQTVSFKPTDC 371
F ++ +P + GN+ Q V YD+ + + F+ C
Sbjct: 295 LAFAPTASDRMPGFIGNVQQRTLEVVYDVPGKAIRFRSAAC 335
>gi|357143657|ref|XP_003573000.1| PREDICTED: LOW QUALITY PROTEIN: aspartic proteinase
nepenthesin-2-like [Brachypodium distachyon]
Length = 464
Score = 102 bits (255), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 74/231 (32%), Positives = 106/231 (45%), Gaps = 29/231 (12%)
Query: 23 EAQTGGFSVELIHRDSPKSPFYNSSETPYQRLRDALTRSLNRLN---------HFNQNSS 73
++ + G +V L HR P SP S + + L R R N H+ +
Sbjct: 52 DSSSSGATVPLNHRHGPCSPV-PSGKKKQPTFTELLRRDQLRANYIQRQFSDEHYPRTGG 110
Query: 74 ISSSKAS---QADIIPNNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQ 130
+ S+A+ + N Y+I +SIG+P DTGSD+ W +C+
Sbjct: 111 LQQSEATVPIALGSLLNTLEYVITVSIGSPAVAXTMFIDTGSDVSWLRCK---------- 160
Query: 131 DSPLFDPKMSSTYKSLPCSSSQCASLNQKSC---SGVNCQYSVSYGDGSFSNGNLATETV 187
S L+DP SSTY CS+ CA L ++ SG C YSV YGDGS + G ++T+
Sbjct: 161 -SRLYDPGTSSTYAPFSCSAPACAQLGRRGTGCSSGSTCVYSVKYGDGSNTTGTYGSDTL 219
Query: 188 TLGSTTGQAVALPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTT 238
TL T+ ++ G FGC G T G++GLGG S +SQ T
Sbjct: 220 TLAGTSEPLIS--GFQFGCSAVEHGFEEDNTDGLMGLGGDAQSFVSQTAAT 268
Score = 45.4 bits (106), Expect = 0.049, Method: Compositional matrix adjust.
Identities = 38/129 (29%), Positives = 53/129 (41%), Gaps = 15/129 (11%)
Query: 253 VIDSGTTLTFLP-QGYNSNLLSVMSSMI--EAQPVADPTGSLELCYSFNSLSQ-----VP 304
++DSGT +T LP Y + + M + QP A P G L+ C+ F + VP
Sbjct: 341 IVDSGTVITRLPPTAYGALSAAFRDGMARYQYQPAA-PRGLLDTCFDFTGHGEGNNFTVP 399
Query: 305 EVTIHFRGADVKLSRSNFFVKVSEDIVCSVFKGITNS--VPIYGNIMQTNFLVGYDIEQQ 362
V + G V N V+ C F + I GN+ Q F V YD+ Q
Sbjct: 400 SVALVLDGGAVVDLHPNGIVQDG----CLAFAATDDDGRTGIIGNVQQRTFEVLYDVGQS 455
Query: 363 TVSFKPTDC 371
F+P C
Sbjct: 456 VFGFRPGAC 464
>gi|115479237|ref|NP_001063212.1| Os09g0423500 [Oryza sativa Japonica Group]
gi|50725896|dbj|BAD33424.1| putative nucleoid DNA-binding protein cnd41, chloroplast [Oryza
sativa Japonica Group]
gi|50726136|dbj|BAD33657.1| putative nucleoid DNA-binding protein cnd41, chloroplast [Oryza
sativa Japonica Group]
gi|113631445|dbj|BAF25126.1| Os09g0423500 [Oryza sativa Japonica Group]
gi|215767614|dbj|BAG99842.1| unnamed protein product [Oryza sativa Japonica Group]
gi|222641596|gb|EEE69728.1| hypothetical protein OsJ_29412 [Oryza sativa Japonica Group]
Length = 473
Score = 102 bits (254), Expect = 3e-19, Method: Compositional matrix adjust.
Identities = 62/161 (38%), Positives = 86/161 (53%), Gaps = 19/161 (11%)
Query: 90 NYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCS 149
NY+ + +G E + DT S+L W QC PC + C+ Q PLFDP S +Y LPC+
Sbjct: 126 NYVATVGLGG--GEATVIVDTASELTWVQCAPC--ASCHDQQGPLFDPASSPSYAVLPCN 181
Query: 150 SSQCASLNQKSCSGV---------NCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALP 200
SS C +L + S +C Y++SY DGS+S G LA + ++L +
Sbjct: 182 SSSCDALQVATGSAAGACGGGEQPSCSYTLSYRDGSYSQGVLAHDKLSLAGEV-----ID 236
Query: 201 GITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAG 241
G FGCGT+N G F T+G++GLG +SLISQ G
Sbjct: 237 GFVFGCGTSNQGPFGG-TSGLMGLGRSQLSLISQTMDQFGG 276
>gi|242072067|ref|XP_002451310.1| hypothetical protein SORBIDRAFT_05g027510 [Sorghum bicolor]
gi|241937153|gb|EES10298.1| hypothetical protein SORBIDRAFT_05g027510 [Sorghum bicolor]
Length = 509
Score = 102 bits (254), Expect = 3e-19, Method: Compositional matrix adjust.
Identities = 100/348 (28%), Positives = 134/348 (38%), Gaps = 80/348 (22%)
Query: 94 RISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCSSSQC 153
R S P +L + DT SD+ W QC PCP SQCY Q L+DP S + +S CSS C
Sbjct: 172 RRSRLRPGVRQLMLLDTASDVAWVQCFPCPASQCYAQTDVLYDPSKSRSSESFACSSPTC 231
Query: 154 ASL-------NQKSCSGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITFGC 206
L + S S CQY V Y DGS ++G L + ++L T+ +P FGC
Sbjct: 232 RQLGPYANGCSSSSNSAGQCQYRVRYPDGSTTSGTLVADQLSLSPTS----QVPKFEFGC 287
Query: 207 GTNNGGLFN-SKTTGIVGLGGGDISLISQMRT---------------------------- 237
G F+ SKT GI+ LG G SL+SQ T
Sbjct: 288 SHAARGSFSRSKTAGIMALGRGVQSLVSQTSTKYGQVFSYCFPPTASHKGFFVLGVPRRS 347
Query: 238 -----------------------TIAGNQRLGVS----TPDIVIDSGTTLTFLPQGYNSN 270
+AG QRL V +DS T +T LP
Sbjct: 348 SSRYAVTPMLKTPMLYQVRLEAIAVAG-QRLDVPPTVFAAGAALDSRTVITRLPPTAYQA 406
Query: 271 LLSVMSSMIEAQPVADPTGSLELCYSFNSLSQV--PEVTIHFR--GADVKLSRSNFFVKV 326
L S + A G L+ CY F +S + P +++ F GA V+L S
Sbjct: 407 LRSAFRDKMSMYRPAAANGQLDTCYDFTGVSSIMLPTISLVFDRTGAGVQLDPSGVLFG- 465
Query: 327 SEDIVCSVF---KGITNSVPIYGNIMQTNFLVGYDIEQQTVSFKPTDC 371
C F G + I G + V Y++ +V F+ C
Sbjct: 466 ----SCLAFASTAGDDRATGIIGFLQLQTIEVLYNVAGGSVGFRRGAC 509
>gi|125553836|gb|EAY99441.1| hypothetical protein OsI_21409 [Oryza sativa Indica Group]
Length = 376
Score = 102 bits (254), Expect = 3e-19, Method: Compositional matrix adjust.
Identities = 55/144 (38%), Positives = 81/144 (56%), Gaps = 8/144 (5%)
Query: 98 GTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCSSSQCASLN 157
GT + + D+GSD+ W QC+PCP C+ Q PLFDP S+TY ++PCSS+ CA L
Sbjct: 155 GTSAVRQTVIIDSGSDVPWVQCQPCPLLVCHPQRDPLFDPATSTTYSAVPCSSAACARLG 214
Query: 158 --QKSCSG-VNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITFGCG-TNNGGL 213
++ CS V CQ+ +Y DG+ + G +++ +TLG + G FGC + G
Sbjct: 215 PYRRGCSANVQCQFGFTYTDGATATGTYSSDDLTLGPYD----VVRGFLFGCAHADRGST 270
Query: 214 FNSKTTGIVGLGGGDISLISQMRT 237
F+ +G + LGGG S + Q T
Sbjct: 271 FSFDVSGTLALGGGAQSFVQQTAT 294
>gi|125563764|gb|EAZ09144.1| hypothetical protein OsI_31414 [Oryza sativa Indica Group]
Length = 472
Score = 102 bits (254), Expect = 3e-19, Method: Compositional matrix adjust.
Identities = 62/161 (38%), Positives = 86/161 (53%), Gaps = 19/161 (11%)
Query: 90 NYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCS 149
NY+ + +G E + DT S+L W QC PC + C+ Q PLFDP S +Y LPC+
Sbjct: 125 NYVATVGLGG--GEATVIVDTASELTWVQCAPC--ASCHDQQGPLFDPASSPSYAVLPCN 180
Query: 150 SSQCASLNQKSCSGV---------NCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALP 200
SS C +L + S +C Y++SY DGS+S G LA + ++L +
Sbjct: 181 SSSCDALQVATGSAAGACGGGEQPSCSYTLSYRDGSYSQGVLAHDKLSLAGEV-----ID 235
Query: 201 GITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAG 241
G FGCGT+N G F T+G++GLG +SLISQ G
Sbjct: 236 GFVFGCGTSNQGPFGG-TSGLMGLGRSQLSLISQTMDQFGG 275
>gi|297605079|ref|NP_001056639.2| Os06g0121800 [Oryza sativa Japonica Group]
gi|255676668|dbj|BAF18553.2| Os06g0121800 [Oryza sativa Japonica Group]
Length = 487
Score = 102 bits (253), Expect = 4e-19, Method: Compositional matrix adjust.
Identities = 54/132 (40%), Positives = 72/132 (54%), Gaps = 6/132 (4%)
Query: 109 DTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCSSSQCASLNQ--KSCSGVNC 166
DT DL W QC PCP +CY Q + LFDP+ S T ++PC S+ C L + CS C
Sbjct: 167 DTSIDLPWIQCAPCPMPECYPQQNALFDPRRSRTSAAVPCGSAACGELGRYGAGCSNNQC 226
Query: 167 QYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITFGCGTNNGGLFNSKTTGIVGLGG 226
QY V YGDG ++G + +TL +T + FGC G F++ T+G + LGG
Sbjct: 227 QYFVDYGDGRATSGTYMVDALTLNPST----VVMNFRFGCSHAVRGNFSASTSGTMSLGG 282
Query: 227 GDISLISQMRTT 238
G SL+SQ T
Sbjct: 283 GRQSLLSQTAAT 294
>gi|357133002|ref|XP_003568117.1| PREDICTED: aspartic proteinase-like protein 2-like [Brachypodium
distachyon]
Length = 497
Score = 102 bits (253), Expect = 4e-19, Method: Compositional matrix adjust.
Identities = 91/362 (25%), Positives = 151/362 (41%), Gaps = 81/362 (22%)
Query: 91 YLIRISIGTPPTERLAVADTGSDLIWTQ---CEPCPPSQCYMQDSPLFDPKMSSTYKSLP 147
Y +I IGTPP DTGSD++W C+ CP D L+DPK SS+ ++
Sbjct: 87 YYTKIEIGTPPKPFHVQVDTGSDILWVNCVSCDKCPTKSGLGIDLALYDPKGSSSGSAVS 146
Query: 148 CSSSQCASLNQKS------CSGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAV---A 198
C + CA+ +G C+Y YGDGS + G+ ++++ +G A A
Sbjct: 147 CDNKFCAATYGSGEKLPGCTAGKPCEYRAEYGDGSSTAGSFVSDSLQYNQLSGNAQTRHA 206
Query: 199 LPGITFGCGTNNGGLF---NSKTTGIVGLGGGDISLISQMRT-------------TIAGN 242
+ FGCG GG N GI+G G + S +SQ+ + TI G
Sbjct: 207 KANVIFGCGAQQGGDLESTNQALDGIIGFGQSNTSTLSQLASAGEVKKIFSHCLDTIKGG 266
Query: 243 QRLGV---------STP---------------DI--------------------VIDSGT 258
+ STP D+ +IDSGT
Sbjct: 267 GIFAIGEVVQPKVKSTPLLPNMSHYNVNLQSIDVAGNALQLPPHIFETSEKRGTIIDSGT 326
Query: 259 TLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSLSQVPEVTIHFRGADVKLS 318
TLT+LP+ ++L+ + + G L YS + P++T HF D+ L+
Sbjct: 327 TLTYLPELVYKDILAAVFQKHQDITFRTIQGFLCFEYSESVDDGFPKITFHFE-DDLGLN 385
Query: 319 R--SNFFVKVSEDIVCSVFKG------ITNSVPIYGNIMQTNFLVGYDIEQQTVSFKPTD 370
++F + +++ C F+ + + G+++ +N +V YD+E+Q + + +
Sbjct: 386 VYPHDYFFQNGDNLYCLGFQNGGFQPKDAKDMVLLGDLVLSNKVVVYDLEKQVIGWTDYN 445
Query: 371 CT 372
C+
Sbjct: 446 CS 447
>gi|357143328|ref|XP_003572882.1| PREDICTED: uncharacterized protein LOC100846829 [Brachypodium
distachyon]
Length = 836
Score = 102 bits (253), Expect = 4e-19, Method: Compositional matrix adjust.
Identities = 119/425 (28%), Positives = 173/425 (40%), Gaps = 97/425 (22%)
Query: 31 VELIHRDSPKSPFYNSSETPY--------QRLRDALTRSLNRLNHFNQNSSISSSKASQA 82
+ L HR P + S+ P +R + + R ++ +++ +S++
Sbjct: 425 LRLTHRHGPCAGPSRSASAPSFAEVLRADERRAEYIQRRMSGAKGPGGLQQFTAASSSKS 484
Query: 83 DIIPNNAN-------YLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLF 135
IP N Y++ +S+GTP + DTGSD+ W QC PC CY Q LF
Sbjct: 485 VTIPANIGHSIGTLQYVVTVSLGTPGVAQTVEVDTGSDVSWVQCAPCAAPACYAQKDQLF 544
Query: 136 DPKMSSTYKSLPCSSSQCASLN---QKSCSGVNCQYSVSYGDGSFSNGNLATETVTLGST 192
DP SS+Y ++PC++ C+ L+ +G C Y VSYGDGS + G ++T+TL
Sbjct: 545 DPAKSSSYSAVPCAADACSELSTYGHGCAAGSQCGYVVSYGDGSNTTGVYGSDTLTLT-- 602
Query: 193 TGQAVALPGITFGCGTNNGGLF----------------NSKTTGIVGLG----------- 225
A A+ G FGCG GLF S+T+G G G
Sbjct: 603 --DADAVTGFLFGCGHAQAGLFAGIDGLLALGRKGMSLTSQTSGAYGGGVFSYCLPPSPS 660
Query: 226 -------GG----------------DISLISQMRTT---IAGNQRLGVSTPDI----VID 255
GG D+ + T + G Q GV V+D
Sbjct: 661 STGFLTLGGPSSASGFATTGLLTAWDVPTFYMVMLTGIGVGGQQLSGVPASAFAGGTVVD 720
Query: 256 SGTTLTFLPQGYNSNLLSVMSSMIE--AQPVADPTGSLELCYSFNSLSQV--PEVTIHFR 311
+GT +T LP + L + + + P A TG L+ CY+F V P V++ F
Sbjct: 721 TGTVITRLPPTAYAALRAAFRAAMAPYGYPAAPATGILDTCYNFTDYGTVTLPTVSLTFS 780
Query: 312 -GADVKLSRSNFFVKVSEDIVCSVFKGITNS----VPIYGNIMQTNFLVGYDIEQQTVSF 366
GA +KL F C F TNS I GN+ Q +F V +D +V F
Sbjct: 781 GGATLKLDAPGFLSS-----GCLAFA--TNSGDGDPAILGNVQQRSFAVRFD--GSSVGF 831
Query: 367 KPTDC 371
P C
Sbjct: 832 MPHSC 836
>gi|242044724|ref|XP_002460233.1| hypothetical protein SORBIDRAFT_02g025060 [Sorghum bicolor]
gi|241923610|gb|EER96754.1| hypothetical protein SORBIDRAFT_02g025060 [Sorghum bicolor]
Length = 512
Score = 102 bits (253), Expect = 4e-19, Method: Compositional matrix adjust.
Identities = 60/166 (36%), Positives = 86/166 (51%), Gaps = 23/166 (13%)
Query: 90 NYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCS 149
NY+ + +G E + DT S+L W QC PC C+ Q PLFDP S +Y ++PC+
Sbjct: 152 NYVATVGLGG--GEATVIVDTASELTWVQCAPC--ESCHDQQDPLFDPSSSPSYAAVPCN 207
Query: 150 SSQCASLN---------QKSCSGVN-----CQYSVSYGDGSFSNGNLATETVTLGSTTGQ 195
SS C +L +C G + C Y++SY DGS+S G LA + ++L
Sbjct: 208 SSSCDALQLATGGTSGGAAACQGQDQSAAACSYTLSYRDGSYSRGVLAHDRLSLAGEV-- 265
Query: 196 AVALPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAG 241
+ G FGCGT+N G T+G++GLG +SL+SQ G
Sbjct: 266 ---IDGFVFGCGTSNQGPPFGGTSGLMGLGRSQLSLVSQTMDQFGG 308
Score = 37.7 bits (86), Expect = 8.8, Method: Compositional matrix adjust.
Identities = 36/127 (28%), Positives = 53/127 (41%), Gaps = 9/127 (7%)
Query: 253 VIDSGTTLT-FLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSLS--QVPEVTIH 309
+IDSGT +T +P YN+ +S E P A L+ C++ L QVP + +
Sbjct: 384 IIDSGTVITSLVPSIYNAVKAEFLSQFAE-YPQAPGFSILDTCFNMTGLREVQVPSLKLV 442
Query: 310 FRGA-DVKLSRSN--FFVKVSEDIVCSVFKGITNSVP--IYGNIMQTNFLVGYDIEQQTV 364
F G +V++ +FV VC + + I GN Q N V +D V
Sbjct: 443 FDGGVEVEVDSGGVLYFVSSDSSQVCLAMAPLKSEYETNIIGNYQQKNLRVIFDTSGSQV 502
Query: 365 SFKPTDC 371
F C
Sbjct: 503 GFAQETC 509
>gi|297805182|ref|XP_002870475.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
gi|297316311|gb|EFH46734.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
Length = 480
Score = 102 bits (253), Expect = 4e-19, Method: Compositional matrix adjust.
Identities = 58/158 (36%), Positives = 81/158 (51%), Gaps = 15/158 (9%)
Query: 91 YLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDS-----PLFDPKMSSTYKS 145
Y +I +G+PP E DTGSD++W C PCP +C ++ L+D K SST K+
Sbjct: 77 YFTKIKLGSPPKEYYVQVDTGSDILWVNCAPCP--KCPVKTDLGIPLSLYDSKASSTSKN 134
Query: 146 LPCSSSQCASLNQKSCSGVN--CQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALP--- 200
+ C + C+ + Q G C Y V YGDGS S+G+ + +TL TG P
Sbjct: 135 VGCEDAFCSFIMQSETCGAKKPCSYHVVYGDGSTSDGDFVKDNITLDQVTGNLRTAPLAQ 194
Query: 201 GITFGCGTNNGGLF---NSKTTGIVGLGGGDISLISQM 235
+ FGCG N G S GI+G G + S+ISQ+
Sbjct: 195 EVVFGCGKNQSGQLGQTESAVDGIMGFGQSNTSVISQL 232
>gi|218197468|gb|EEC79895.1| hypothetical protein OsI_21423 [Oryza sativa Indica Group]
Length = 471
Score = 101 bits (252), Expect = 5e-19, Method: Compositional matrix adjust.
Identities = 54/132 (40%), Positives = 72/132 (54%), Gaps = 6/132 (4%)
Query: 109 DTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCSSSQCASLNQ--KSCSGVNC 166
DT DL W QC PCP +CY Q + LFDP+ S T ++PC S+ C L + CS C
Sbjct: 151 DTSIDLPWIQCAPCPMPECYPQQNALFDPRRSRTSAAVPCGSAACGELGRYGAGCSNNQC 210
Query: 167 QYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITFGCGTNNGGLFNSKTTGIVGLGG 226
QY V YGDG ++G + +TL +T + FGC G F++ T+G + LGG
Sbjct: 211 QYFVDYGDGRATSGTYMVDALTLNPST----VVMNFRFGCSHAVRGNFSASTSGTMSLGG 266
Query: 227 GDISLISQMRTT 238
G SL+SQ T
Sbjct: 267 GRQSLLSQTAAT 278
>gi|297840891|ref|XP_002888327.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
gi|297334168|gb|EFH64586.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
Length = 473
Score = 101 bits (252), Expect = 6e-19, Method: Compositional matrix adjust.
Identities = 96/394 (24%), Positives = 162/394 (41%), Gaps = 88/394 (22%)
Query: 63 NRLNHFNQNSSISSSKASQADIIPNNAN--------YLIRISIGTPPTERLAVADTGSDL 114
+L HF + + S+ + +P + Y +I +G+PP E DTGSD+
Sbjct: 38 KKLEHFKSHDTRRHSRMLASIDLPLGGDSRVDSVGLYFTKIKLGSPPKEYHVQVDTGSDI 97
Query: 115 IWTQCEPCP--PSQCYMQ-DSPLFDPKMSSTYKSLPCSSSQCASLNQ-KSCS-GVNCQYS 169
+W C+PCP PS+ + LFD SST K + C C+ ++Q SC V C Y
Sbjct: 98 LWVNCKPCPECPSKTNLNFHLSLFDVNASSTSKKVGCDDDFCSFISQSDSCQPAVGCSYH 157
Query: 170 VSYGDGSFSNGNLATETVTLGSTTGQAVALP---GITFGCGTNNGGLF---NSKTTGIVG 223
+ Y D S S GN + +TL TG P + FGCG++ G +S G++G
Sbjct: 158 IVYADESTSEGNFIRDKLTLEQVTGDLQTGPLGQEVVFGCGSDQSGQLGKSDSAVDGVMG 217
Query: 224 LGGGDISLISQMRTT-------------IAGNQRLGVST--------------------- 249
G + S++SQ+ T + G V
Sbjct: 218 FGQSNTSVLSQLAATGDAKRVFSHCLDNVKGGGIFAVGVVDSPKVKTTPMVPNQMHYNVM 277
Query: 250 --------------PDIVIDSG------TTLTFLPQGYNSNLLSVMSSMIEAQPVA-DPT 288
P I+ + G TTL + P+ +L+ +++ QPV
Sbjct: 278 LMGMDVDGTALDLPPSIMRNGGTIVDSGTTLAYFPKVLYDSLI---ETILARQPVKLHIV 334
Query: 289 GSLELCYSFNSLSQV--PEVTIHFRGADVKLSR--SNFFVKVSEDIVCSVFK------GI 338
C+SF+ V P V+ F + VKL+ ++ + +++ C ++ G
Sbjct: 335 EDTFQCFSFSENVDVAFPPVSFEFEDS-VKLTVYPHDYLFTLEKELYCFGWQAGGLTTGE 393
Query: 339 TNSVPIYGNIMQTNFLVGYDIEQQTVSFKPTDCT 372
V + G+++ +N LV YD+E + + + +C+
Sbjct: 394 RTEVILLGDLVLSNKLVVYDLENEVIGWADHNCS 427
>gi|167997964|ref|XP_001751688.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162696786|gb|EDQ83123.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 418
Score = 100 bits (250), Expect = 8e-19, Method: Compositional matrix adjust.
Identities = 58/154 (37%), Positives = 81/154 (52%), Gaps = 14/154 (9%)
Query: 88 NANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLP 147
+ Y + +GTPP + + D+GSDL+W QC PC QCY QDSPL+ P SST+ +P
Sbjct: 61 SGQYFVDFFLGTPPQKFSLIVDSGSDLLWVQCSPC--RQCYAQDSPLYVPSNSSTFSPVP 118
Query: 148 CSSSQCASLNQKSCSGVN------CQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPG 201
C SS C + + C Y Y D S S G A E+ T+ V +
Sbjct: 119 CLSSDCLLIPATEGFPCDFRYPGACAYEYLYADTSSSKGVFAYESATV-----DGVRIDK 173
Query: 202 ITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQM 235
+ FGCG++N G F + G++GLG G +S SQ+
Sbjct: 174 VAFGCGSDNQGSF-AAAGGVLGLGQGPLSFGSQV 206
Score = 66.6 bits (161), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 42/129 (32%), Positives = 65/129 (50%), Gaps = 9/129 (6%)
Query: 253 VIDSGTTLTF-LPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSLSQ--VPEVTIH 309
+ DSGTTLT+ P Y S++L+ S + P A+ L+LC + Q P TI
Sbjct: 291 IFDSGTTLTYWFPSAY-SHILAAFDSGVH-YPRAESVQGLDLCVELTGVDQPSFPSFTIE 348
Query: 310 FR-GADVKLSRSNFFVKVSEDIVCSVFKGITNSVPIY---GNIMQTNFLVGYDIEQQTVS 365
F GA + N+FV V+ ++ C G+ + + + GN++Q NF V YD E+ +
Sbjct: 349 FDDGAVFQPEAENYFVDVAPNVRCLAMAGLASPLGGFNTIGNLLQQNFFVQYDREENLIG 408
Query: 366 FKPTDCTKQ 374
F P C+
Sbjct: 409 FAPAKCSSH 417
>gi|302811785|ref|XP_002987581.1| hypothetical protein SELMODRAFT_426333 [Selaginella moellendorffii]
gi|300144735|gb|EFJ11417.1| hypothetical protein SELMODRAFT_426333 [Selaginella moellendorffii]
Length = 511
Score = 100 bits (250), Expect = 9e-19, Method: Compositional matrix adjust.
Identities = 60/160 (37%), Positives = 87/160 (54%), Gaps = 12/160 (7%)
Query: 90 NYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCS 149
Y + + +GTP E + + DTGSD+ W QC PC C P F+P+ SS++ LPC+
Sbjct: 138 EYYVPLQVGTPAVEVVLIMDTGSDVSWIQCVPC--KDCVPALRPPFNPRHSSSFFKLPCA 195
Query: 150 SSQCASLNQK-----SCSGVNCQYSVSYGDGSFSNGNLATETVTLGST----TGQAVALP 200
SS C ++ Q S SG C +S+ YGDGS S+G LA ET+ G+T G+ V L
Sbjct: 196 SSTCTNVYQGVKPFCSPSGRTCLFSIQYGDGSLSSGLLAMETIA-GNTPNFGDGEPVKLS 254
Query: 201 GITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIA 240
IT GC + + +G++G+ IS SQ+ + A
Sbjct: 255 NITLGCADIDREGLPTGASGLLGMDRRPISFPSQLSSRYA 294
Score = 50.4 bits (119), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 39/136 (28%), Positives = 61/136 (44%), Gaps = 22/136 (16%)
Query: 253 VIDSGTTLTFLP----QGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSLSQ------ 302
+IDSGT T+L Q L+ S + + D CY+ S +
Sbjct: 378 IIDSGTAFTYLKKPAFQAMRREFLARTSHLAKV----DDNSGFTPCYNITSGTAALESTI 433
Query: 303 VPEVTIHFRGA-DVKLSRSNFFVKVS----EDIVCSVFKGITNSVP--IYGNIMQTNFLV 355
+P +T+HFRG DV L +++ + VS + +C F ++ +P I GN Q N V
Sbjct: 434 LPSITLHFRGGLDVVLPKNSILIPVSSSEEQTTLCLAFL-MSGDIPFNIIGNYQQQNLWV 492
Query: 356 GYDIEQQTVSFKPTDC 371
YD+E+ + P C
Sbjct: 493 EYDLEKLRLGIAPAQC 508
>gi|125555056|gb|EAZ00662.1| hypothetical protein OsI_22683 [Oryza sativa Indica Group]
Length = 491
Score = 100 bits (249), Expect = 1e-18, Method: Compositional matrix adjust.
Identities = 57/132 (43%), Positives = 74/132 (56%), Gaps = 8/132 (6%)
Query: 87 NNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPS-QCYMQDSPLFDPKMSSTYKS 145
+ +++ + +GTP + DTGSDL W QC+PC S C+ Q PLFDP SSTY +
Sbjct: 145 DTLEFVVAVGLGTPAQPSALIFDTGSDLSWVQCQPCGSSGHCHPQQDPLFDPSKSSTYAA 204
Query: 146 LPCSSSQCASLNQKSCSGVN--CQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGIT 203
+ C QCA+ CS N C Y V YGDGS + G L+ +T+ L S+ AL G
Sbjct: 205 VHCGEPQCAAAGGL-CSEDNTTCLYLVHYGDGSSTTGVLSRDTLALTSSR----ALAGFP 259
Query: 204 FGCGTNNGGLFN 215
FGCGT N G F
Sbjct: 260 FGCGTRNLGDFG 271
Score = 55.1 bits (131), Expect = 5e-05, Method: Compositional matrix adjust.
Identities = 37/125 (29%), Positives = 56/125 (44%), Gaps = 6/125 (4%)
Query: 253 VIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSLSQVPEVTIHFRG 312
++DSGT LT+LP L +E A P L+ CY F S+V + FR
Sbjct: 367 LLDSGTVLTYLPAQAYELLRDRFRLTMERYTPAPPNDVLDACYDFAGESEVIVPAVSFRF 426
Query: 313 ADVKLSRSNFF---VKVSEDIVCSVFKGI-TNSVP--IYGNIMQTNFLVGYDIEQQTVSF 366
D + +FF + + E++ C F + +P I GN Q + V YD+ + + F
Sbjct: 427 GDGAVFELDFFGVMIFLDENVGCLAFAAMDAGGLPLSIIGNTQQRSAEVIYDVAAEKIGF 486
Query: 367 KPTDC 371
P C
Sbjct: 487 VPASC 491
>gi|302822373|ref|XP_002992845.1| hypothetical protein SELMODRAFT_136051 [Selaginella moellendorffii]
gi|300139393|gb|EFJ06135.1| hypothetical protein SELMODRAFT_136051 [Selaginella moellendorffii]
Length = 510
Score = 100 bits (249), Expect = 1e-18, Method: Compositional matrix adjust.
Identities = 60/160 (37%), Positives = 87/160 (54%), Gaps = 12/160 (7%)
Query: 90 NYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCS 149
Y + + +GTP E + + DTGSD+ W QC PC C P F+P+ SS++ LPC+
Sbjct: 137 EYYVPLQLGTPAVEVVLIMDTGSDVSWIQCVPC--KDCVPALRPPFNPRHSSSFFKLPCA 194
Query: 150 SSQCASLNQK-----SCSGVNCQYSVSYGDGSFSNGNLATETVTLGST----TGQAVALP 200
SS C ++ Q S SG C +S+ YGDGS S+G LA ET+ G+T G+ V L
Sbjct: 195 SSTCTNVYQGVKPFCSPSGRTCLFSIQYGDGSLSSGLLAMETIA-GNTPNFGDGEPVKLS 253
Query: 201 GITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIA 240
IT GC + + +G++G+ IS SQ+ + A
Sbjct: 254 NITLGCADIDREGLPTGASGLLGMDRRPISFPSQLSSRYA 293
Score = 52.0 bits (123), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 39/137 (28%), Positives = 62/137 (45%), Gaps = 22/137 (16%)
Query: 253 VIDSGTTLTFLP----QGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSLSQ------ 302
+IDSGT T+L Q L+ S + + D CY+ S +
Sbjct: 377 IIDSGTAFTYLKKPAFQAMRREFLARTSHLAKV----DDNSGFTPCYNITSGTAALESTI 432
Query: 303 VPEVTIHFRGA-DVKLSRSNFFVKVS----EDIVCSVFKGITNSVP--IYGNIMQTNFLV 355
+P +T+HFRG DV L +++ + VS + +C F+ ++ +P I GN Q N V
Sbjct: 433 LPSITLHFRGGLDVVLPKNSILIPVSSSEEQTTLCLAFQ-MSGDIPFNIIGNYQQQNLWV 491
Query: 356 GYDIEQQTVSFKPTDCT 372
YD+E+ + P C
Sbjct: 492 EYDLEKLRLGIAPAQCA 508
>gi|147814824|emb|CAN65806.1| hypothetical protein VITISV_015630 [Vitis vinifera]
Length = 449
Score = 100 bits (249), Expect = 1e-18, Method: Compositional matrix adjust.
Identities = 109/450 (24%), Positives = 172/450 (38%), Gaps = 111/450 (24%)
Query: 31 VELIHRDSPKSPFYNSSETPYQRLRDAL----TRSLNRLNHFNQNSSISSSKASQ----- 81
+ELIHR SP+ +T QRL++ + R L L H + I KA +
Sbjct: 3 LELIHRHSPQ--VMGRPKTQLQRLKELVHSDSVRQLMIL-HKLRGGQIPRRKAKEVLSSS 59
Query: 82 -------ADIIPNN-------ANYLIRISIGTPPTERLAVADTGSDLIWTQCE-PCPPSQ 126
A +P + Y + +GTP + + VADTGSDL W C+ C
Sbjct: 60 SGRGSDDAIEVPMHPAADYGIGQYFVAFKVGTPSQKFMLVADTGSDLTWMSCKYHCRSRN 119
Query: 127 C------YMQDSPLFDPKMSSTYKSLPCSSSQCA-------SLNQKSCSGVNCQYSVSYG 173
C ++ +F +SS++K++PC + C SL C Y Y
Sbjct: 120 CSNRKARRIRHKRVFHANLSSSFKTIPCLTDMCKIELMDLFSLTNCPTPLTPCGYDYRYS 179
Query: 174 DGSFSNGNLATETVTLGSTTGQAVALPGITFGCGTNNGGLFNSKTTGIVGLG-------- 225
DGS + G A ETVT+ G+ + L + GC + G G++GLG
Sbjct: 180 DGSTALGFFANETVTVELKEGRKMKLHNVLIGCSESFQGQSFQAADGVMGLGYSKYSFAI 239
Query: 226 ------GGDIS-----------------------------------LISQMRTTIAGNQR 244
GG S L+ M +
Sbjct: 240 KAAEKFGGKFSYCLVDHLSHKNVSNYLTFGSSRSKEALLNNMTYTELVLGMVNSFYAVNM 299
Query: 245 LGVS-------TPDIVID----SGTTL------TFLPQ-GYNSNLLSVMSSMIEAQPVAD 286
+G+S P V D GT L TFL + Y + ++ S+++ + V
Sbjct: 300 MGISIGGAMLKIPSEVWDVKGAGGTILDSGSSLTFLTEPAYQPVMAALRVSLLKFRKVEM 359
Query: 287 PTGSLELCYSFNSLSQ--VPEVTIHFR-GADVKLSRSNFFVKVSEDIVCSVFKGITN-SV 342
G LE C++ + VP + HF GA+ + ++ + ++ + C F +
Sbjct: 360 DIGPLEYCFNSTGFEESLVPRLVFHFADGAEFEPPVKSYVISAADGVRCLGFVSVAWPGT 419
Query: 343 PIYGNIMQTNFLVGYDIEQQTVSFKPTDCT 372
+ GNIMQ N L +D+ + + F P+ CT
Sbjct: 420 SVVGNIMQQNHLWEFDLGLKKLGFAPSSCT 449
>gi|357118076|ref|XP_003560785.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Brachypodium
distachyon]
Length = 477
Score = 100 bits (249), Expect = 1e-18, Method: Compositional matrix adjust.
Identities = 52/129 (40%), Positives = 71/129 (55%), Gaps = 6/129 (4%)
Query: 87 NNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSL 146
+ +++ + GTP + DTGSDL W QC+PC CY Q P FDP SS+Y ++
Sbjct: 133 DTLEFVVVVGFGTPAQTAAIILDTGSDLSWIQCKPC-SGHCYRQHDPDFDPAKSSSYAAV 191
Query: 147 PCSSSQCASLNQKSCSGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITFGC 206
PC + CA+ C+G C Y V YGDGS + G L+ +T+T S++ G TFGC
Sbjct: 192 PCGTPVCAAAGGM-CNGTTCLYGVQYGDGSSTTGVLSRDTLTFNSSS----KFTGFTFGC 246
Query: 207 GTNNGGLFN 215
G N G F
Sbjct: 247 GEKNIGDFG 255
Score = 45.4 bits (106), Expect = 0.048, Method: Compositional matrix adjust.
Identities = 34/127 (26%), Positives = 54/127 (42%), Gaps = 8/127 (6%)
Query: 253 VIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSLSQVPEVTIHFRG 312
++DSGT LT+LP ++L ++ A P L+ CY F + + F
Sbjct: 351 LLDSGTILTYLPPPAYTSLRDRFKFTMQGNKPAPPYEPLDTCYDFTGQGAIVIPAVSFNF 410
Query: 313 ADVKLSRSNFF-VKVSED-----IVCSVFKGITNSVP--IYGNIMQTNFLVGYDIEQQTV 364
+D + +F+ + + D I C F ++P I GN Q V YD+ Q +
Sbjct: 411 SDGAVFDLDFYGIMIFPDDAKPLIGCLAFVSRPAAMPFSIVGNTQQRAAEVIYDVPSQKI 470
Query: 365 SFKPTDC 371
F P C
Sbjct: 471 GFIPISC 477
>gi|125553832|gb|EAY99437.1| hypothetical protein OsI_21406 [Oryza sativa Indica Group]
Length = 409
Score = 100 bits (249), Expect = 1e-18, Method: Compositional matrix adjust.
Identities = 51/141 (36%), Positives = 79/141 (56%), Gaps = 8/141 (5%)
Query: 98 GTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCSSSQCASLN 157
GT + + D+GSD+ W QC+PCP C+ Q PLFDP S+TY ++PCSS+ CA L
Sbjct: 75 GTSAVSQTVIIDSGSDVPWVQCQPCPLLVCHPQRDPLFDPATSTTYAAVPCSSAACARLG 134
Query: 158 --QKSC-SGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITFGCG-TNNGGL 213
++ C + CQ+ ++Y +G+ + G +++ +TLG + G FGC + G
Sbjct: 135 PYRRGCLANSQCQFGITYANGATATGTYSSDDLTLGPYD----VVRGFLFGCAHADQGST 190
Query: 214 FNSKTTGIVGLGGGDISLISQ 234
F+ G + LGGG S + Q
Sbjct: 191 FSYDVAGTLALGGGSQSFVQQ 211
Score = 40.0 bits (92), Expect = 1.7, Method: Compositional matrix adjust.
Identities = 33/125 (26%), Positives = 58/125 (46%), Gaps = 12/125 (9%)
Query: 253 VIDSGTTLTFLP-QGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSLSQV--PEVTIH 309
VIDS T ++ +P Y + + S+M +P A P L+ CY F+ + + P + +
Sbjct: 291 VIDSATVISRIPPTAYQALRAAFRSAMTMYRP-APPVSILDTCYDFSGVRSITLPSIALV 349
Query: 310 FRG-ADVKLSRSNFFVKVSEDIVCSVFK-GITNSVPIY-GNIMQTNFLVGYDIEQQTVSF 366
F G A V L + ++ C F ++ +P + GN+ Q V YD+ + + F
Sbjct: 350 FDGGATVNLDAAGILLQG-----CLAFAPTASDRMPGFIGNVQQRTLEVVYDVPGKAIRF 404
Query: 367 KPTDC 371
+ C
Sbjct: 405 RSAAC 409
>gi|54290728|dbj|BAD62398.1| putative nucleoid DNA-binding protein cnd41 [Oryza sativa Japonica
Group]
Length = 486
Score = 100 bits (249), Expect = 1e-18, Method: Compositional matrix adjust.
Identities = 57/132 (43%), Positives = 74/132 (56%), Gaps = 8/132 (6%)
Query: 87 NNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPS-QCYMQDSPLFDPKMSSTYKS 145
+ +++ + +GTP + DTGSDL W QC+PC S C+ Q PLFDP SSTY +
Sbjct: 140 DTLEFVVAVGLGTPAQPSALIFDTGSDLSWVQCQPCGSSGHCHPQQDPLFDPSKSSTYAA 199
Query: 146 LPCSSSQCASLNQKSCSGVN--CQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGIT 203
+ C QCA+ CS N C Y V YGDGS + G L+ +T+ L S+ AL G
Sbjct: 200 VHCGEPQCAAAGDL-CSEDNTTCLYLVRYGDGSSTTGVLSRDTLALTSSR----ALTGFP 254
Query: 204 FGCGTNNGGLFN 215
FGCGT N G F
Sbjct: 255 FGCGTRNLGDFG 266
Score = 57.4 bits (137), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 38/125 (30%), Positives = 58/125 (46%), Gaps = 6/125 (4%)
Query: 253 VIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSLSQVPEVTIHFRG 312
++DSGT LT+LP + L +E A P L+ CY F S+V + FR
Sbjct: 362 LLDSGTVLTYLPAQAYALLRDRFRLTMERYTPAPPNDVLDACYDFAGESEVVVPAVSFRF 421
Query: 313 ADVKLSRSNFF---VKVSEDIVCSVFKGI-TNSVP--IYGNIMQTNFLVGYDIEQQTVSF 366
D + +FF + + E++ C F + T +P I GN Q + V YD+ + + F
Sbjct: 422 GDGAVFELDFFGVMIFLDENVGCLAFAAMDTGGLPLSIIGNTQQRSAEVIYDVAAEKIGF 481
Query: 367 KPTDC 371
P C
Sbjct: 482 VPASC 486
>gi|168064205|ref|XP_001784055.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162664441|gb|EDQ51161.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 459
Score = 100 bits (248), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 111/406 (27%), Positives = 170/406 (41%), Gaps = 100/406 (24%)
Query: 51 YQRLRDALTRSLNRLNHFNQNSSISSSKASQADIIPNNANYLIRISIGTPPTERLAVADT 110
Y+ LR+ R L R+ + + S D Y RI +GTPP + DT
Sbjct: 13 YRTLREHDQRRLRRIL-----PEVVAFPISGDDDTFTTGLYYTRIYLGTPPQQFYVHVDT 67
Query: 111 GSDLIWTQCEPCPPSQCYMQDS-----PLFDPKMSSTYKSLPCSSSQCASLNQKSCS--G 163
GSD+ W C PC + C + +FDP+ S++ S+ C+ +C + CS
Sbjct: 68 GSDVAWVNCVPC--TNCKRASNVALPISIFDPEKSTSKTSISCTDEECYLASNSKCSFNS 125
Query: 164 VNCQYSVSYGDGSFSNGNLATETVTLGST-TGQAVALPG---ITFGCGTNNGGLFNSKTT 219
++C YS YGDGS + G L + ++ +G + A G +TFGCG+N G + T
Sbjct: 126 MSCPYSTLYGDGSSTAGYLINDVLSFNQVPSGNSTATSGTARLTFGCGSNQTGTW--LTD 183
Query: 220 GIVGLGGGDISLISQMR---------------------TTIAGNQR-------------- 244
G+VG G ++SL SQ+ T + G+ R
Sbjct: 184 GLVGFGQAEVSLPSQLSKQNVSVNIFAHCLQGDNKGSGTLVIGHIREPGLVYTPIVPKQS 243
Query: 245 --------LGVS-----TP---------DIVIDSGTTLTFLPQ-GYNSNLLSVMSSMIEA 281
+GVS TP +++DSGTTLT+L Q Y+ V M
Sbjct: 244 HYNVELLNIGVSGTNVTTPTAFDLSNSGGVIMDSGTTLTYLVQPAYDQFQAKVRDCM--- 300
Query: 282 QPVADPTGSLELCYSFNSLSQ--VPEVTIHFR-GADVKLSRSNFFVK--VSEDIVCSVFK 336
+G L + + F + P VT++F GA + LS S++ K ++ + F
Sbjct: 301 -----RSGVLPVAFQFFCTIEGYFPNVTLYFAGGAAMLLSPSSYLYKEMLTTGLSAYCFS 355
Query: 337 GITNSVPIYGNIMQTNF--------LVGYDIEQQTVSFKPTDCTKQ 374
+ S +YG + T F LV YD + +K DCTK+
Sbjct: 356 WL-ESTSVYGYLSYTIFGDNVLKDQLVVYDNVNNRIGWKNFDCTKE 400
>gi|357148754|ref|XP_003574882.1| PREDICTED: aspartic proteinase-like protein 2-like [Brachypodium
distachyon]
Length = 488
Score = 99.8 bits (247), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 95/391 (24%), Positives = 158/391 (40%), Gaps = 86/391 (21%)
Query: 67 HFNQNSSISSSKASQADI------IPNNAN-YLIRISIGTPPTERLAVADTGSDLIWTQ- 118
H +S+ + AD+ +P + Y I IGTPP + DTGSD++W
Sbjct: 52 HLTHDSNRRGRLLAAADVPLGGLGLPTDTGLYYTEIEIGTPPKQYHVQVDTGSDILWVNC 111
Query: 119 --CEPCPPSQCYMQDSPLFDPKMSSTYKSLPCSSSQCASLNQKSCSG----VNCQYSVSY 172
C CP D L+DPK SS+ ++ C CA+ G + C+YSV Y
Sbjct: 112 ISCNKCPRKSDLGIDLRLYDPKGSSSGSTVSCDQKFCAATYGGKLPGCAKNIPCEYSVMY 171
Query: 173 GDGSFSNGNLATETVTLGSTTGQAV---ALPGITFGCGTNNGGLF---NSKTTGIVGLGG 226
GDGS + G ++++ +G A + FGCG GG N GI+G G
Sbjct: 172 GDGSSTTGYFVSDSLQYNQVSGDGQTRHANASVIFGCGAQQGGDLGSTNQALDGIIGFGQ 231
Query: 227 GDISLISQMRT-------------TIAGNQRLGVS------------TPDI--------- 252
+ S++SQ+ TI G + PD+
Sbjct: 232 SNTSMLSQLAAAGEVKKIFSHCLDTIKGGGIFAIGDVVQPKVKSTPLVPDMPHYNVNLES 291
Query: 253 -----------------------VIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTG 289
+IDSGTTLT+LP+ ++L+ + +
Sbjct: 292 INVGGTTLQLPSHMFETGEKKGTIIDSGTTLTYLPELVYKDVLAAVFAKHPDTTFHSVQD 351
Query: 290 SLELCYSFNSLSQVPEVTIHFRGADVKLSR--SNFFVKVSEDIVCSVFK--GIT----NS 341
L + Y + P++T HF D+ L+ ++F + +++ C F+ G+
Sbjct: 352 FLCIQYFQSVDDGFPKITFHFE-DDLGLNVYPHDYFFQNGDNLYCFGFQNGGLQSKDGKD 410
Query: 342 VPIYGNIMQTNFLVGYDIEQQTVSFKPTDCT 372
+ + G+++ +N +V YD+E Q V + +C+
Sbjct: 411 MVLLGDLVLSNKVVVYDLENQVVGWTDYNCS 441
>gi|148910443|gb|ABR18297.1| unknown [Picea sitchensis]
Length = 452
Score = 99.8 bits (247), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 60/179 (33%), Positives = 86/179 (48%), Gaps = 9/179 (5%)
Query: 30 SVELIHRDSPKSPFYNSSETPYQRLRDALTRSLNRLNHFNQNSSISSSKA-SQADIIPNN 88
S LIH S SPF + T + + + NRL + S S A + + +
Sbjct: 53 SFPLIHIYSECSPFRPPNRTWESLMSEKIRGDANRLRFLKRTSRSSKQDANANVPVRSGS 112
Query: 89 ANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPC 148
Y+I++ GTP + DTGSD+ W C+ C Q +P+FDP SS+YK C
Sbjct: 113 GEYIIQVDFGTPKQSMYTLIDTGSDVAWIPCKQC---QGCHSTAPIFDPAKSSSYKPFAC 169
Query: 149 SSSQCASLNQKSCSGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITFGCG 207
S C ++ CQ+ VSYGDG+ +G LA++ +TLGS LP +FGC
Sbjct: 170 DSQPCQEISGNCGGNSKCQFEVSYGDGTQVDGTLASDAITLGSQ-----YLPNFSFGCA 223
Score = 62.8 bits (151), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 43/146 (29%), Positives = 67/146 (45%), Gaps = 12/146 (8%)
Query: 235 MRTTIAGNQRLGVSTPDI------VIDSGTTLTFL-PQGYNSNLLSVMSSMIEAQPVADP 287
++ GN R+ V +I +IDSGTT+T L P Y + + + QP P
Sbjct: 306 LKAISVGNTRISVPGTNIASGGGTIIDSGTTITHLVPSAYTALRDAFRQQLSSLQPT--P 363
Query: 288 TGSLELCYSFNSLS-QVPEVTIHF-RGADVKLSRSNFFVKVSEDIVCSVFKGITNSVPIY 345
++ CY +S S VP +T+H R D+ L + N + + C F T+S I
Sbjct: 364 VEDMDTCYDLSSSSVDVPTITLHLDRNVDLVLPKENILITQESGLACLAFSS-TDSRSII 422
Query: 346 GNIMQTNFLVGYDIEQQTVSFKPTDC 371
GN+ Q N+ + +D+ V F C
Sbjct: 423 GNVQQQNWRIVFDVPNSQVGFAQEQC 448
>gi|168054484|ref|XP_001779661.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162668975|gb|EDQ55572.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 419
Score = 99.8 bits (247), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 56/154 (36%), Positives = 80/154 (51%), Gaps = 14/154 (9%)
Query: 88 NANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLP 147
+ Y + +GTPP + + D+GSDL+W QC PC QCY QD+PL+ P SST+ +P
Sbjct: 62 SGQYFVDFFLGTPPQKFSLIVDSGSDLLWVQCAPC--LQCYAQDTPLYAPSNSSTFNPVP 119
Query: 148 CSSSQCASLNQKSCSGVN------CQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPG 201
C S +C + + C Y Y D S S G A E+ T+ V +
Sbjct: 120 CLSPECLLIPATEGFPCDFHYPGACAYEYRYADTSLSKGVFAYESATVDD-----VRIDK 174
Query: 202 ITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQM 235
+ FGCG +N G F + G++GLG G +S SQ+
Sbjct: 175 VAFGCGRDNQGSF-AAAGGVLGLGQGPLSFGSQV 207
Score = 64.7 bits (156), Expect = 7e-08, Method: Compositional matrix adjust.
Identities = 43/129 (33%), Positives = 65/129 (50%), Gaps = 9/129 (6%)
Query: 253 VIDSGTTLTF-LPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSLSQ--VPEVTIH 309
+ DSGTT+T+ LP Y N+L+ + P A L+LC + Q P TI
Sbjct: 292 IFDSGTTVTYWLPPAYR-NILAAFDKNVR-YPRAASVQGLDLCVDVTGVDQPSFPSFTIV 349
Query: 310 FRGADV-KLSRSNFFVKVSEDIVCSVFKGITNSVPIY---GNIMQTNFLVGYDIEQQTVS 365
G V + + N+FV V+ ++ C G+ +SV + GN++Q NFLV YD E+ +
Sbjct: 350 LGGGAVFQPQQGNYFVDVAPNVQCLAMAGLPSSVGGFNTIGNLLQQNFLVQYDREENRIG 409
Query: 366 FKPTDCTKQ 374
F P C+
Sbjct: 410 FAPAKCSSH 418
>gi|255538124|ref|XP_002510127.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
gi|223550828|gb|EEF52314.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
Length = 641
Score = 99.8 bits (247), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 97/375 (25%), Positives = 158/375 (42%), Gaps = 80/375 (21%)
Query: 70 QNSSISSSKASQADIIPNNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYM 129
NS + ++ D + +N Y R+ IGTPP E + DTGS + + C C QC
Sbjct: 67 HNSDLPNAHMRLYDDLLSNGYYTTRLFIGTPPQEFALIVDTGSTVTYVPCSTC--EQCGK 124
Query: 130 QDSPLFDPKMSSTYKSLPCSSSQCASLNQKSCSGVNCQYSVSYGDGSFSNGNLATETVTL 189
P F P+ SSTYK + C+ S C ++ G C Y Y + S S+G LA + ++
Sbjct: 125 HQDPRFQPESSSTYKPMQCNPS-CNCDDE----GKQCTYERRYAEMSSSSGLLAEDVLSF 179
Query: 190 GSTTGQAVALPGITFGCGT-NNGGLFNSKTTGIVGLGGGDISLISQM------------- 235
G+ + + FGC T G LF+ + GI+GLG G +S++ Q+
Sbjct: 180 GNES--ELTPQRAIFGCETVETGELFSQRADGIMGLGRGPLSVVDQLVIKEVVGNSFSLC 237
Query: 236 ---RTTIAGNQRLG--VSTPDIV------------------------------------- 253
+ G LG PD+V
Sbjct: 238 YGGMDVVGGAMVLGNIPPPPDMVFAHSDPYRSAYYNIELKELHVAGKRLKLNPRVFDGKH 297
Query: 254 ---IDSGTTLTFLP-QGYNSNLLSVMSSMIEAQPVADPTGSL-ELCYS-----FNSLSQV 303
+DSGTT +LP + + + +++ + + + P S ++C+S + LS++
Sbjct: 298 GTVLDSGTTYAYLPEEAFVAFKDAIIKEIKFLKQIHGPDPSYNDICFSGAGRDVSQLSKI 357
Query: 304 -PEVTIHF-RGADVKLSRSNFF---VKVSEDIVCSVFKGITNSVPIYGNIMQTNFLVGYD 358
PEV + F G + LS N+ KVS +F+ + + G I+ N LV YD
Sbjct: 358 FPEVNMVFGNGQKLSLSPENYLFRHTKVSGAYCLGIFQNGKDPTTLLGGIVVRNTLVTYD 417
Query: 359 IEQQTVSFKPTDCTK 373
+ + F T+C++
Sbjct: 418 RDNDKIGFWKTNCSE 432
>gi|293332531|ref|NP_001169558.1| uncharacterized protein LOC100383437 precursor [Zea mays]
gi|224030089|gb|ACN34120.1| unknown [Zea mays]
gi|413925069|gb|AFW65001.1| hypothetical protein ZEAMMB73_160528 [Zea mays]
Length = 491
Score = 99.8 bits (247), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 95/360 (26%), Positives = 144/360 (40%), Gaps = 79/360 (21%)
Query: 91 YLIRISIGTPPTERLAVADTGSDLIWTQ---CEPCPPSQCYMQDSPLFDPKMSSTYKSLP 147
Y I +GTPP DTGSD++W CE CP D L+DPK SST +
Sbjct: 86 YYTEIKLGTPPKHYYVQVDTGSDILWVNCITCEQCPHKSGLGLDLTLYDPKASSTGSMVM 145
Query: 148 CSSSQCASLNQ----KSCSGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPG-- 201
C + CA+ K + V C+YSV+YGDGS + G+ T+ + T P
Sbjct: 146 CDQAFCAATFGGKLPKCGANVPCEYSVTYGDGSSTIGSFVTDALQFDQVTRDGQTQPANA 205
Query: 202 -ITFGCGTNNGGLF---NSKTTGIVGLGGGDISLISQMRT-------------TIAGNQR 244
+ FGCG GG N GI+G G + S++SQ+ T TI G
Sbjct: 206 SVIFGCGAQQGGDLGSSNQALDGILGFGEANTSMLSQLTTAGKVKKIFAHCLDTIKGGGI 265
Query: 245 LGV---------STPDIV-----------IDSG------------------------TTL 260
+ +TP + ID G TTL
Sbjct: 266 FSIGDVVQPKVKTTPLVADKPHYNVNLKTIDVGGTTLQLPAHIFEPGEKKGTIIDSGTTL 325
Query: 261 TFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSLSQVPEVTIHFRGADVKLSR- 319
T+LP+ ++ + + + D G L Y + P +T HF D+ L
Sbjct: 326 TYLPELVFKEVMLAVFNKHQDITFHDVQGFLCFQYPGSVDDGFPTITFHFE-DDLALHVY 384
Query: 320 -SNFFVKVSEDIVCSVFK-GITNS-----VPIYGNIMQTNFLVGYDIEQQTVSFKPTDCT 372
+F D+ C F+ G + S + + G+++ +N LV YD+E + + + +C+
Sbjct: 385 PHEYFFANGNDVYCVGFQNGASQSKDGKDIVLMGDLVLSNKLVIYDLENRVIGWTDYNCS 444
>gi|226492391|ref|NP_001140482.1| uncharacterized protein LOC100272542 precursor [Zea mays]
gi|224030447|gb|ACN34299.1| unknown [Zea mays]
gi|414887506|tpg|DAA63520.1| TPA: hypothetical protein ZEAMMB73_432695 [Zea mays]
Length = 512
Score = 99.4 bits (246), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 62/172 (36%), Positives = 89/172 (51%), Gaps = 13/172 (7%)
Query: 81 QADIIPNNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMS 140
++ + +A YL+ + +GTPP + DTGSDL W QC PC C+ Q P+FDP S
Sbjct: 136 ESGVAVGSAEYLMDVYVGTPPRRFQMIMDTGSDLNWLQCAPC--LDCFEQRGPVFDPAAS 193
Query: 141 STYKSLPCSSSQCASLNQKSCSGVN---------CQYSVSYGDGSFSNGNLATETVTLGS 191
S+Y++L C +C + C Y YGD S S G+LA E+ T+
Sbjct: 194 SSYRNLTCGDPRCGHVAPPEAPAPRACRRPGEDPCPYYYWYGDQSNSTGDLALESFTVNL 253
Query: 192 TT-GQAVALPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGN 242
T G + + G+ FGCG N GLF+ ++GLG G +S SQ+R G+
Sbjct: 254 TAPGASSRVDGVVFGCGHRNRGLFHGAAG-LLGLGRGPLSFASQLRAVYGGH 304
Score = 54.7 bits (130), Expect = 7e-05, Method: Compositional matrix adjust.
Identities = 38/127 (29%), Positives = 61/127 (48%), Gaps = 6/127 (4%)
Query: 253 VIDSGTTLT-FLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSLS--QVPEVTIH 309
+IDSGTTL+ F+ Y + + M + P L CY+ + + +VPE+++
Sbjct: 385 IIDSGTTLSYFVEPAYQVIRRAFIDRMSGSYPPVPDFPVLSPCYNVSGVERPEVPELSLL 444
Query: 310 FR-GADVKLSRSNFFVKVSED-IVCSVFKGITNS-VPIYGNIMQTNFLVGYDIEQQTVSF 366
F GA N+F+++ D I+C G + + I GN Q NF V YD+ + F
Sbjct: 445 FADGAVWDFPAENYFIRLDPDGIMCLAVLGTPRTGMSIIGNFQQQNFHVAYDLHNNRLGF 504
Query: 367 KPTDCTK 373
P C +
Sbjct: 505 APRRCAE 511
>gi|223949775|gb|ACN28971.1| unknown [Zea mays]
gi|414590177|tpg|DAA40748.1| TPA: hypothetical protein ZEAMMB73_257146 [Zea mays]
Length = 510
Score = 99.4 bits (246), Expect = 3e-18, Method: Compositional matrix adjust.
Identities = 60/203 (29%), Positives = 100/203 (49%), Gaps = 30/203 (14%)
Query: 23 EAQTGGFSVELIHRDSPKSPFYN--SSETPYQRLRDALTRSLNRLNHFNQNSSISSSKAS 80
+A+ +E +HR + +S +S +P + L + + ++
Sbjct: 97 KAEKDAVRIETMHRRAARSGVARMPASSSPRRALSERMVATV------------------ 138
Query: 81 QADIIPNNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMS 140
++ + + YLI + +GTPP + DTGSDL W QC PC C+ Q P+FDP S
Sbjct: 139 ESGVAVGSGEYLIDVYVGTPPRRFRMIMDTGSDLNWLQCAPC--LDCFEQRGPVFDPAAS 196
Query: 141 STYKSLPCSSSQCASL----NQKSC---SGVNCQYSVSYGDGSFSNGNLATETVTLGSTT 193
S+Y+++ C +C + ++C + +C Y YGD S + G+LA E+ T+ T
Sbjct: 197 SSYRNVTCGDQRCGLVAPPEAPRACRRPAEDSCPYYYWYGDQSNTTGDLALESFTVNLTA 256
Query: 194 -GQAVALPGITFGCGTNNGGLFN 215
G + + G+ FGCG N GLF+
Sbjct: 257 PGASRRVDGVVFGCGHRNRGLFH 279
Score = 58.2 bits (139), Expect = 6e-06, Method: Compositional matrix adjust.
Identities = 51/173 (29%), Positives = 82/173 (47%), Gaps = 25/173 (14%)
Query: 213 LFNSKTTGIVGLGGGDISLISQMRTTIAGNQRLGVSTPDIVIDSGTTLT-FLPQGYNS-- 269
+ K G+ L GGD+ IS + + G +IDSGTTL+ F+ Y
Sbjct: 350 FYYVKLKGV--LVGGDLLNISSDTWDVGKDGSGGT-----IIDSGTTLSYFVEPAYQVIR 402
Query: 270 ----NLLSVMSSMIEAQPVADPTGSLELCYSFNSLS--QVPEVTIHFR-GADVKLSRSNF 322
+L+S + +I PV +P CY+ + + +VPE+++ F GA N+
Sbjct: 403 QAFVDLMSRLYPLIPDFPVLNP------CYNVSGVERPEVPELSLLFADGAVWDFPAENY 456
Query: 323 FVKVSED-IVCSVFKGITNS-VPIYGNIMQTNFLVGYDIEQQTVSFKPTDCTK 373
FV++ D I+C +G + + I GN Q NF V YD++ + F P C +
Sbjct: 457 FVRLDPDGIMCLAVRGTPRTGMSIIGNFQQQNFHVVYDLQNNRLGFAPRRCAE 509
>gi|255563741|ref|XP_002522872.1| Aspartic proteinase nepenthesin-2 precursor, putative [Ricinus
communis]
gi|223537956|gb|EEF39570.1| Aspartic proteinase nepenthesin-2 precursor, putative [Ricinus
communis]
Length = 448
Score = 99.4 bits (246), Expect = 3e-18, Method: Compositional matrix adjust.
Identities = 63/185 (34%), Positives = 97/185 (52%), Gaps = 9/185 (4%)
Query: 28 GFSVELIHRDSPKSPFYNSSETPYQRLRDALTRSLNRLNHFN-QNSSISSSKASQADIIP 86
GFS+E++HR S +SPFY + T Y+R+ + S R ++ SS S +A + I
Sbjct: 27 GFSLEIVHRYSRESPFYPGNITDYERITRLVELSKIRAHNLAITTSSGFSPEAFRLRISQ 86
Query: 87 NNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSL 146
++ YL+++ IG+P V DTGS L WTQCEPC ++ + Q P+F+ S TY+ L
Sbjct: 87 DDTCYLVKVIIGSPGVPLYLVPDTGSGLFWTQCEPC--TRRFRQLPPIFNSTASRTYRDL 144
Query: 147 PCSSSQCA-SLNQKSCSGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITFG 205
PC C + N C C Y ++Y GS + G A + + + + +P FG
Sbjct: 145 PCQHQFCTNNQNVFQCRDDKCVYRIAYAGGSATAGVAAQDIL----QSAENDRIP-FYFG 199
Query: 206 CGTNN 210
C +N
Sbjct: 200 CSRDN 204
>gi|242057809|ref|XP_002458050.1| hypothetical protein SORBIDRAFT_03g026140 [Sorghum bicolor]
gi|241930025|gb|EES03170.1| hypothetical protein SORBIDRAFT_03g026140 [Sorghum bicolor]
Length = 489
Score = 99.4 bits (246), Expect = 3e-18, Method: Compositional matrix adjust.
Identities = 94/362 (25%), Positives = 144/362 (39%), Gaps = 83/362 (22%)
Query: 91 YLIRISIGTPPTERLAVADTGSDLIWTQ---CEPCPPSQCYMQDSPLFDPKMSSTYKSLP 147
Y I +GTPP DTGSD++W CE CP D +DPK SS+ ++
Sbjct: 84 YFTEIKLGTPPKRYYVQVDTGSDILWVNCISCEKCPRKSGLGLDLTFYDPKASSSGSTVS 143
Query: 148 CSSSQCASLNQKSCSG----VNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPG-- 201
C CA+ G V C+YSV YGDGS + G T+ + TG PG
Sbjct: 144 CDQGFCAATYGGKLPGCTANVPCEYSVMYGDGSSTTGFFVTDALQFDQVTGDGQTQPGNA 203
Query: 202 -ITFGCGTNNGGLF---NSKTTGIVGLGGGDISLISQMRT-------------TIAGNQR 244
+TFGCG GG N GI+G G + S++SQ+ TI G
Sbjct: 204 TVTFGCGAQQGGDLGSSNQALDGILGFGQANTSMLSQLAAAGKVKKIFAHCLDTIKGGGI 263
Query: 245 LGV---------STPDIV-----------IDSG------------------------TTL 260
+ +TP + ID G TTL
Sbjct: 264 FAIGNVVQPKVKTTPLVADMPHYNVNLKSIDVGGTTLQLPAHVFETGERKGTIIDSGTTL 323
Query: 261 TFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSF--NSLSQVPEVTIHFRGADVKLS 318
T+LP+ +++ + + + Q + +C+ + + P +T HF D+ L
Sbjct: 324 TYLPELVFKEVMAAIFN--KHQDIVFHNVQDFMCFQYPGSVDDGFPTITFHFE-DDLALH 380
Query: 319 R--SNFFVKVSEDIVCSVFK-GITNS-----VPIYGNIMQTNFLVGYDIEQQTVSFKPTD 370
+F D+ C F+ G S + + G+++ +N LV YD+E Q + + +
Sbjct: 381 VYPHEYFFPNGNDMYCVGFQNGALQSKDGKDIVLMGDLVLSNKLVIYDLENQVIGWTDYN 440
Query: 371 CT 372
C+
Sbjct: 441 CS 442
>gi|449451908|ref|XP_004143702.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Cucumis sativus]
gi|449529900|ref|XP_004171936.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Cucumis sativus]
Length = 459
Score = 99.0 bits (245), Expect = 3e-18, Method: Compositional matrix adjust.
Identities = 60/167 (35%), Positives = 82/167 (49%), Gaps = 27/167 (16%)
Query: 88 NANYLIRISIGTPPTERLAVADTGSDLIWTQCEPC------PPSQCYMQDSPLFDPKMSS 141
+ Y + I +GTPP L VADTGSDL+W +C C PPS ++ P+ SS
Sbjct: 85 SGQYFVDIRLGTPPQSLLLVADTGSDLVWVKCSACRNCSHHPPSSAFL-------PRHSS 137
Query: 142 TYKSLPCSSSQCASLNQKSCSGVN-------CQYSVSYGDGSFSNGNLATETVTLGSTTG 194
++ C C L N C++ SY DGS S+G + ET TL S +G
Sbjct: 138 SFSPFHCFDPHCRLLPHAPHHLCNHTRLHSPCRFLYSYADGSLSSGFFSKETTTLKSLSG 197
Query: 195 QAVALPGITFGCGTN------NGGLFNSKTTGIVGLGGGDISLISQM 235
+ L G++FGCG +G FN G++GLG G IS SQ+
Sbjct: 198 SEIHLKGLSFGCGFRISGPSVSGAQFNG-ARGVMGLGRGSISFSSQL 243
Score = 58.9 bits (141), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 35/125 (28%), Positives = 62/125 (49%), Gaps = 6/125 (4%)
Query: 253 VIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSLSQVPEV-TIHFR 311
V+DSGTTLT+L + +L + ++ A+ T +LC + + S+ P + + FR
Sbjct: 332 VVDSGTTLTYLTKTAYEEVLKSVRRRVKLPNAAELTPGFDLCVNASGESRRPSLPRLRFR 391
Query: 312 ---GADVKLSRSNFFVKVSEDIVCSVFKGIT--NSVPIYGNIMQTNFLVGYDIEQQTVSF 366
GA N+F++ E ++C + + N + GN+MQ FL+ +D E+ + F
Sbjct: 392 LGGGAVFAPPPRNYFLETEEGVMCLAIRAVESGNGFSVIGNLMQQGFLLEFDKEESRLGF 451
Query: 367 KPTDC 371
C
Sbjct: 452 TRRGC 456
>gi|359494621|ref|XP_002265771.2| PREDICTED: aspartic proteinase nepenthesin-2-like [Vitis vinifera]
Length = 449
Score = 99.0 bits (245), Expect = 3e-18, Method: Compositional matrix adjust.
Identities = 109/450 (24%), Positives = 172/450 (38%), Gaps = 111/450 (24%)
Query: 31 VELIHRDSPKSPFYNSSETPYQRLRDAL----TRSLNRLNHFNQNSSISSSKASQ----- 81
+ELIHR SP+ +T QRL++ + R L L H + I KA +
Sbjct: 3 LELIHRHSPQ--VMGRPKTQLQRLKELVHSDSVRQLMIL-HKLRGGQIPRRKAKEVLSSS 59
Query: 82 -------ADIIPNN-------ANYLIRISIGTPPTERLAVADTGSDLIWTQCE-PCPPSQ 126
A +P + Y + +GTP + + VADTGSDL W C+ C
Sbjct: 60 SGRGSDDAIEVPMHPAADYGIGQYSVAFKVGTPSQKFMLVADTGSDLTWMSCKYHCRSRN 119
Query: 127 C------YMQDSPLFDPKMSSTYKSLPCSSSQCA-------SLNQKSCSGVNCQYSVSYG 173
C ++ +F +SS++K++PC + C SL C Y Y
Sbjct: 120 CSNRKARRIRHKRVFHANLSSSFKTIPCLTDMCKIELMDLFSLTNCPTPLTPCGYDYRYS 179
Query: 174 DGSFSNGNLATETVTLGSTTGQAVALPGITFGCGTNNGGLFNSKTTGIVGLG-------- 225
DGS + G A ETVT+ G+ + L + GC + G G++GLG
Sbjct: 180 DGSTALGFFANETVTVELKEGRKMKLHNVLIGCSESFQGQSFQAADGVMGLGYSKYSFAI 239
Query: 226 ------GGDIS-----------------------------------LISQMRTTIAGNQR 244
GG S L+ M +
Sbjct: 240 KAAEKFGGKFSYCLVDHLSHKNVSNYLTFGSSRSKEALLNNMTYTELVLGMVNSFYAVNM 299
Query: 245 LGVS-------TPDIVID----SGTTL------TFLPQ-GYNSNLLSVMSSMIEAQPVAD 286
+G+S P V D GT L TFL + Y + ++ S+++ + V
Sbjct: 300 MGISIGGAMLKIPSEVWDVKGAGGTILDSGSSLTFLTEPAYQPVMAALRVSLLKFRKVEM 359
Query: 287 PTGSLELCYSFNSLSQ--VPEVTIHFR-GADVKLSRSNFFVKVSEDIVCSVFKGITN-SV 342
G LE C++ + VP + HF GA+ + ++ + ++ + C F +
Sbjct: 360 DIGPLEYCFNSTGFEESLVPRLVFHFADGAEFEPPVKSYVISAADGVRCLGFVSVAWPGT 419
Query: 343 PIYGNIMQTNFLVGYDIEQQTVSFKPTDCT 372
+ GNIMQ N L +D+ + + F P+ CT
Sbjct: 420 SVVGNIMQQNHLWEFDLGLKKLGFAPSSCT 449
>gi|30682289|ref|NP_187876.2| aspartyl protease family protein [Arabidopsis thaliana]
gi|11994412|dbj|BAB02414.1| chloroplast nucleoid DNA binding protein-like [Arabidopsis
thaliana]
gi|332641715|gb|AEE75236.1| aspartyl protease family protein [Arabidopsis thaliana]
Length = 461
Score = 98.6 bits (244), Expect = 4e-18, Method: Compositional matrix adjust.
Identities = 110/451 (24%), Positives = 173/451 (38%), Gaps = 96/451 (21%)
Query: 3 TFLSCVFILFFLCFYVVSPIEAQTGGFSVELIHRDS--PKSPFYNSSETPYQRLRDALTR 60
T LSC+ L + + ++L HRD+ PK P R+ D +
Sbjct: 26 TLLSCLITTLLL---ITVADSMKDTSVRLKLAHRDTLLPK---------PLSRIEDVIGA 73
Query: 61 SLNR--LNHFNQNSSISSSKASQADIIPNNANYLIRISIGTPPTERLAVADTGSDLIWTQ 118
R L +NS++ + I A Y I +GTP + V DTGS+L W
Sbjct: 74 DQKRHSLISRKRNSTVGVKMDLGSGIDYGTAQYFTEIRVGTPAKKFRVVVDTGSELTWVN 133
Query: 119 CEPCPPSQCYMQDSPLFDPKMSSTYKSLPCSSSQCA-------SLNQKSCSGVNCQYSVS 171
C + + +F S ++K++ C + C SL C Y
Sbjct: 134 CRYRARGK---DNRRVFRADESKSFKTVGCLTQTCKVDLMNLFSLTTCPTPSTPCSYDYR 190
Query: 172 YGDGSFSNGNLATETVTL-----------GSTTGQAVALPGITFGCGTNNGGL------F 214
Y DGS + G A ET+T+ G G + + G +F GL F
Sbjct: 191 YADGSAAQGVFAKETITVGLTNGRMARLPGHLIGCSSSFTGQSFQGADGVLGLAFSDFSF 250
Query: 215 NSKTTGIVG----------LGGGDIS---LISQMRTTIAGNQR----------------- 244
S T + G L ++S + R+T +R
Sbjct: 251 TSTATSLYGAKFSYCLVDHLSNKNVSNYLIFGSSRSTKTAFRRTTPLDLTRIPPFYAINV 310
Query: 245 ----LGVSTPDI-------------VIDSGTTLTFLPQG-YNSNLLSVMSSMIEAQPVAD 286
LG DI ++DSGT+LT L Y + + ++E + V
Sbjct: 311 IGISLGYDMLDIPSQVWDATSGGGTILDSGTSLTLLADAAYKQVVTGLARYLVELKRVKP 370
Query: 287 PTGSLELCYSFNS---LSQVPEVTIHFRG-ADVKLSRSNFFVKVSEDIVCSVFKGI-TNS 341
+E C+SF S +S++P++T H +G A + R ++ V + + C F T +
Sbjct: 371 EGVPIEYCFSFTSGFNVSKLPQLTFHLKGGARFEPHRKSYLVDAAPGVKCLGFVSAGTPA 430
Query: 342 VPIYGNIMQTNFLVGYDIEQQTVSFKPTDCT 372
+ GNIMQ N+L +D+ T+SF P+ CT
Sbjct: 431 TNVIGNIMQQNYLWEFDLMASTLSFAPSACT 461
>gi|356537728|ref|XP_003537377.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Glycine max]
Length = 543
Score = 98.6 bits (244), Expect = 4e-18, Method: Compositional matrix adjust.
Identities = 56/159 (35%), Positives = 84/159 (52%), Gaps = 13/159 (8%)
Query: 89 ANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPC 148
Y + + +GTPP + DTGSDL W QC+PC C+ Q+ + PK SSTY+++ C
Sbjct: 169 GEYFLDMFVGTPPKHVWLILDTGSDLSWIQCDPC--YDCFEQNGSHYYPKDSSTYRNISC 226
Query: 149 SSSQCASLNQ----KSCSGVN--CQYSVSYGDGSFSNGNLATETVTLGST----TGQAVA 198
+C ++ + C N C Y Y DGS + G+ A+ET T+ T +
Sbjct: 227 YDPRCQLVSSSDPLQHCKAENQTCPYFYDYADGSNTTGDFASETFTVNLTWPNGKEKFKQ 286
Query: 199 LPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRT 237
+ + FGCG N G F +G++GLG G IS SQ+++
Sbjct: 287 VVDVMFGCGHWNKGFFYG-ASGLLGLGRGPISFPSQIQS 324
Score = 58.5 bits (140), Expect = 5e-06, Method: Compositional matrix adjust.
Identities = 41/155 (26%), Positives = 74/155 (47%), Gaps = 7/155 (4%)
Query: 226 GGDISLISQMRTTIAGNQRLGVSTPDIVIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVA 285
GG++ IS+ + + +IDSG+TLTF P + I+ Q +A
Sbjct: 388 GGEVLDISEQTWHWSSEGAAADAGGGTIIDSGSTLTFFPDSAYDIIKEAFEKKIKLQQIA 447
Query: 286 DPTGSLELCYSFN-SLSQV--PEVTIHFRGADV-KLSRSNFFVKVSED-IVC-SVFKGIT 339
+ CY+ + ++ QV P+ IHF V N+F + D ++C ++ K
Sbjct: 448 ADDFVMSPCYNVSGAMMQVELPDFGIHFADGGVWNFPAENYFYQYEPDEVICLAIMKTPN 507
Query: 340 NS-VPIYGNIMQTNFLVGYDIEQQTVSFKPTDCTK 373
+S + I GN++Q NF + YD+++ + + P C +
Sbjct: 508 HSHLTIIGNLLQQNFHILYDVKRSRLGYSPRRCAE 542
>gi|357142295|ref|XP_003572524.1| PREDICTED: aspartic proteinase-like protein 2-like [Brachypodium
distachyon]
Length = 494
Score = 98.6 bits (244), Expect = 4e-18, Method: Compositional matrix adjust.
Identities = 98/369 (26%), Positives = 151/369 (40%), Gaps = 82/369 (22%)
Query: 85 IPNNAN-YLIRISIGTPPTERLAVADTGSDLIWTQC---EPCPPSQCYMQDSPLFDPKMS 140
IP + Y +I IGTP DTGSD++W C + CP D L+DP S
Sbjct: 82 IPTDTGLYFTQIGIGTPSKGYYVQVDTGSDILWVNCISCDSCPRKSGLGIDLTLYDPTAS 141
Query: 141 STYKSLPCSSSQCASLNQK----SCSGVN-CQYSVSYGDGSFSNGNLATETVTLGSTTGQ 195
++ K++ C CA+ SC+ + CQYS++YGDGS + G + + +G
Sbjct: 142 ASSKTVTCGQEFCATATNGGVPPSCAANSPCQYSITYGDGSSTTGFFVADFLQYDQVSGD 201
Query: 196 A---VALPGITFGCGTNNGGLFNSKTT---GIVGLG------------------------ 225
+A +TFGCG GG S GI+G G
Sbjct: 202 GQTNLANASVTFGCGAKIGGALGSSNVALDGILGFGQANSSMLSQLTSAGKVTKIFSHCL 261
Query: 226 ----GGDISLISQ---------------------MRTTIAGNQRL---------GVSTPD 251
GG I I ++T G L G +
Sbjct: 262 DTVNGGGIFAIGNVVQPKVKTTPLVPGMPHYNVVLKTIDVGGSTLQLPTNIFDIGGGSRG 321
Query: 252 IVIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSLSQVPEVTIHFR 311
+IDSGTTL +LP+ +LS + S + + L YS + + PEVT HF
Sbjct: 322 TIIDSGTTLAYLPEVVYKAVLSAVFSNHPDVTLKNVQDFLCFQYSGSVDNGFPEVTFHFD 381
Query: 312 GADVKLSR--SNFFVKVSEDIVCSVFK--GITNS----VPIYGNIMQTNFLVGYDIEQQT 363
G D+ L ++ + +ED+ C F+ G+ + + + G++ +N LV YD+E Q
Sbjct: 382 G-DLPLVVYPHDYLFQNTEDVYCVGFQSGGVQSKDGKDMVLLGDLALSNKLVVYDLENQV 440
Query: 364 VSFKPTDCT 372
+ + +C+
Sbjct: 441 IGWTNYNCS 449
>gi|357160409|ref|XP_003578755.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Brachypodium
distachyon]
Length = 373
Score = 98.6 bits (244), Expect = 4e-18, Method: Compositional matrix adjust.
Identities = 100/380 (26%), Positives = 155/380 (40%), Gaps = 87/380 (22%)
Query: 70 QNSSISSSKASQADIIPNNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYM 129
Q ++I S D I N + + IS+GTP L DTGS + W QC+ C CY
Sbjct: 3 QAANIPDSAVIGDDSIRKN-QFFMGISLGTPAVFNLVTIDTGSTISWVQCQYC-IVHCYT 60
Query: 130 QDS---PLFDPKMSSTYKSLPCSSSQCASLN--QKSCSGV-----NCQYSVSYGDGSFSN 179
QD P F+ SSTY+ + CS+ C ++ Q SG +C YS+ Y G +S
Sbjct: 61 QDQRAGPTFNTSSSSTYRRVGCSAQVCHDMHVSQNIPSGCVEEEDSCIYSLRYASGEYSA 120
Query: 180 GNLATETVTLGSTTGQAVALPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTT- 238
G L+ + +TL ++ ++ FGCG++N +N + GI+G G S +Q+
Sbjct: 121 GYLSQDRLTLANS----YSIQKFIFGCGSDN--RYNGHSAGIIGFGNKSYSFFNQIAQLT 174
Query: 239 --------IAGNQR------------------------LGVSTP-------DIVIDSGTT 259
NQ G P D++++ G
Sbjct: 175 NYSAFSYCFPSNQENEGFLSIGPYVRDSNKLILTQLFDYGAHLPVYALQQFDMMVN-GMR 233
Query: 260 LTFLPQGYNSNLLSVMSSMIE--------------------AQPVADPTGSLELCYSFN- 298
L P Y + + V S +E A+ + S E+C+ N
Sbjct: 234 LQVDPPVYTTRMTVVDSGTVETFVLSPVFRALDRALTKAMVAEGYVRGSDSKEICFHSNG 293
Query: 299 ---SLSQVPEVTIHFRGADVKLSRSN-FFVKVSEDIVCSVFKGITNSVP---IYGNIMQT 351
S++P V I F + +KL N F+ + S+ +CS F+ VP I GN
Sbjct: 294 DSVDWSKLPVVEIKFSRSILKLPAENVFYYETSDGSICSTFQPDDAGVPGVQILGNRATR 353
Query: 352 NFLVGYDIEQQTVSFKPTDC 371
+F V +DI+Q+ F+ C
Sbjct: 354 SFRVVFDIQQRNFGFEAGAC 373
>gi|302817726|ref|XP_002990538.1| hypothetical protein SELMODRAFT_131679 [Selaginella moellendorffii]
gi|300141706|gb|EFJ08415.1| hypothetical protein SELMODRAFT_131679 [Selaginella moellendorffii]
Length = 434
Score = 98.2 bits (243), Expect = 5e-18, Method: Compositional matrix adjust.
Identities = 90/358 (25%), Positives = 142/358 (39%), Gaps = 88/358 (24%)
Query: 91 YLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQ-DSPL--FDPKMSSTYKSLP 147
Y ++ +GTPP DTGSDL+W C PC + P+ +D K S++ +P
Sbjct: 36 YFTQVQLGTPPRTYNLQVDTGSDLLWVNCHPCIGCPAFSDLKIPIVPYDVKASASSSKVP 95
Query: 148 CSSSQCASLNQKSCSGVN----CQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGIT 203
CS C + Q S SG N C YS YGDGS + G L + + A +
Sbjct: 96 CSDPSCTLITQISESGCNDQNQCGYSFQYGDGSGTLGYLVEDVLHY-----MVNATATVI 150
Query: 204 FGCGTNNGGLFNSKTT---GIVGLGGGDISLISQMRTT----------IAGNQRLG---- 246
FGCG G ++ GI+G G D+S SQ+ + G +R G
Sbjct: 151 FGCGFKQSGDLSTSERALDGIIGFGASDLSFNSQLAKQGKTPNVFAHCLDGGERGGGILV 210
Query: 247 ---VSTPDI-----------------------------------------VIDSGTTLTF 262
V PDI + DSGTTL +
Sbjct: 211 LGNVIEPDIQYTPLVPYMSHYNVVLQSISVNNANLTIDPKLFSNDVMQGTIFDSGTTLAY 270
Query: 263 LPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSLSQVPEVTIHFRGADVKLSRSNF 322
LP +S ++ + D S + F P V ++F GA + L+ + +
Sbjct: 271 LPDEAYQAFTQAVSLVVAPFLLCDTRLSRFIYKLF------PNVVLYFEGASMTLTPAEY 324
Query: 323 FVK----VSEDIVCSVFKGITNS-----VPIYGNIMQTNFLVGYDIEQQTVSFKPTDC 371
++ + I C ++ + ++ I+G+++ N LV YD+E+ + ++P DC
Sbjct: 325 LIRQASAANAPIWCMGWQSMGSAESELQYTIFGDLVLKNKLVVYDLERGRIGWRPFDC 382
>gi|302803839|ref|XP_002983672.1| hypothetical protein SELMODRAFT_118648 [Selaginella moellendorffii]
gi|300148509|gb|EFJ15168.1| hypothetical protein SELMODRAFT_118648 [Selaginella moellendorffii]
Length = 388
Score = 97.8 bits (242), Expect = 7e-18, Method: Compositional matrix adjust.
Identities = 90/358 (25%), Positives = 142/358 (39%), Gaps = 88/358 (24%)
Query: 91 YLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQ-DSPL--FDPKMSSTYKSLP 147
Y ++ +GTPP DTGSDL+W C PC + P+ +D K S++ +P
Sbjct: 36 YFTQVQLGTPPRTYNLQVDTGSDLLWVNCHPCIGCPAFSDLKIPIVPYDVKASASSSKVP 95
Query: 148 CSSSQCASLNQKSCSGVN----CQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGIT 203
CS C + Q S SG N C YS YGDGS + G L + + A +
Sbjct: 96 CSDPSCTLITQISESGCNDQNQCGYSFQYGDGSGTLGYLVEDVLHY-----MVNATATVI 150
Query: 204 FGCGTNNGGLFNSKTT---GIVGLGGGDISLISQMRTT----------IAGNQRLG---- 246
FGCG G ++ GI+G G D+S SQ+ + G +R G
Sbjct: 151 FGCGFKQSGDLSTSERALDGIIGFGASDLSFNSQLAKQGKTPNVFAHCLDGGERGGGILV 210
Query: 247 ---VSTPDI-----------------------------------------VIDSGTTLTF 262
V PDI + DSGTTL +
Sbjct: 211 LGNVIEPDIQYTPLVPYMYHYNVVLQSISVNNANLTIDPKLFSNDVMQGTIFDSGTTLAY 270
Query: 263 LPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSLSQVPEVTIHFRGADVKLSRSNF 322
LP +S ++ + D S + F P V ++F GA + L+ + +
Sbjct: 271 LPDEAYQAFTQAVSLVVAPFLLCDTRLSRFIYKLF------PNVVLYFEGASMTLTPAEY 324
Query: 323 FVK----VSEDIVCSVFKGITNS-----VPIYGNIMQTNFLVGYDIEQQTVSFKPTDC 371
++ + I C ++ + ++ I+G+++ N LV YD+E+ + ++P DC
Sbjct: 325 LIRQASAANAPIWCMGWQSMGSAESELQYTIFGDLVLKNKLVVYDLERGRIGWRPFDC 382
>gi|224286159|gb|ACN40790.1| unknown [Picea sitchensis]
Length = 452
Score = 97.8 bits (242), Expect = 8e-18, Method: Compositional matrix adjust.
Identities = 59/179 (32%), Positives = 85/179 (47%), Gaps = 9/179 (5%)
Query: 30 SVELIHRDSPKSPFYNSSETPYQRLRDALTRSLNRLNHFNQNSSISSSKA-SQADIIPNN 88
S LIH S SPF + T + + + NRL + S S A + + +
Sbjct: 53 SFPLIHIYSECSPFRPPNRTWESLMSEKIRGDANRLRFLKRTSRSSKEDANANVPVRSGS 112
Query: 89 ANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPC 148
Y+I++ GTP + DTGSD+ W C+ C Q +P+FDP SS+YK C
Sbjct: 113 GEYIIQVDFGTPKQSMYTLIDTGSDVAWIPCKQC---QGCHSTAPIFDPAKSSSYKPFAC 169
Query: 149 SSSQCASLNQKSCSGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITFGCG 207
S C ++ CQ+ V YGDG+ +G LA++ +TLGS LP +FGC
Sbjct: 170 DSQPCQEISGNCGGNSKCQFEVLYGDGTQVDGTLASDAITLGSQ-----YLPNFSFGCA 223
Score = 63.5 bits (153), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 43/146 (29%), Positives = 67/146 (45%), Gaps = 12/146 (8%)
Query: 235 MRTTIAGNQRLGVSTPDI------VIDSGTTLTFL-PQGYNSNLLSVMSSMIEAQPVADP 287
++ GN R+ V +I +IDSGTT+T+L P Y + + QP P
Sbjct: 306 LKAISVGNTRISVPATNIASGGGTIIDSGTTITYLVPSAYKDLRDAFRQQLSSLQPT--P 363
Query: 288 TGSLELCYSFNSLS-QVPEVTIHF-RGADVKLSRSNFFVKVSEDIVCSVFKGITNSVPIY 345
++ CY +S S VP +T+H R D+ L + N + + C F T+S I
Sbjct: 364 VEDMDTCYDLSSSSVDVPTITLHLDRNVDLVLPKENILITQESGLSCLAFSS-TDSRSII 422
Query: 346 GNIMQTNFLVGYDIEQQTVSFKPTDC 371
GN+ Q N+ + +D+ V F C
Sbjct: 423 GNVQQQNWRIVFDVPNSQVGFAQEQC 448
>gi|15235526|ref|NP_193028.1| aspartyl protease family protein [Arabidopsis thaliana]
gi|5123933|emb|CAB45491.1| putative protein [Arabidopsis thaliana]
gi|7267994|emb|CAB78334.1| putative protein [Arabidopsis thaliana]
gi|332657803|gb|AEE83203.1| aspartyl protease family protein [Arabidopsis thaliana]
Length = 389
Score = 97.4 bits (241), Expect = 9e-18, Method: Compositional matrix adjust.
Identities = 85/338 (25%), Positives = 139/338 (41%), Gaps = 65/338 (19%)
Query: 91 YLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQD-SPLFDPKMSSTYKSLPCS 149
++ I G+P ++ DTGS L WTQC PC S CY Q P + P S TY+ C
Sbjct: 58 FMAEIHFGSPQKKQFLHMDTGSSLTWTQCFPC--SDCYAQKIYPKYRPAASITYRDAMCE 115
Query: 150 SSQCASLNQKSCSGVN--CQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITFGCG 207
S S + + C Y Y D + G LA E +T+ + G + G+ FGC
Sbjct: 116 DSHPKSNPHFAFDPLTRICTYQQHYLDETNIKGTLAQEMITVDTHDGGFKRVHGVYFGCN 175
Query: 208 T-NNGGLFNSKTTGIVGLGGGDISLIS--------------------------------- 233
T ++G F TGI+GLG G S+I
Sbjct: 176 TLSDGSYFTG--TGILGLGVGKYSIIGEFGSKFSFCLGEISEPKASHNLILGDGANVQGH 233
Query: 234 -------------QMRTTIAGNQRLGVSTPDIVIDSGTTLTFLPQGYNSNLLSVMSSMIE 280
Q+ + I G + + +D+G+TL+ L + +I
Sbjct: 234 PTVINITEGHTIFQLESIIVGEEITLDDPVQVFVDTGSTLSHLSTNLYYKFVDAFDDLIG 293
Query: 281 AQPVA-DPTGSLELCYSFNSLSQVPEVTIHFR---GADVKLSRSNFFVKVS-EDIVCSVF 335
++P++ +PT LCY +++ ++ ++ + F+ GA++ ++ N F++ +I C
Sbjct: 294 SRPLSYEPT----LCYKADTIERLEKMDVGFKFDVGAELSVNIHNIFIQQGPPEIRCLAI 349
Query: 336 KGITNSVP--IYGNIMQTNFLVGYDIEQQTVSFKPTDC 371
+ S I G I + VGYD+ +T DC
Sbjct: 350 QNNKESFSHVIIGVIAMQGYNVGYDLSAKTAYINKQDC 387
>gi|255571584|ref|XP_002526738.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
gi|223533927|gb|EEF35652.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
Length = 457
Score = 97.4 bits (241), Expect = 1e-17, Method: Compositional matrix adjust.
Identities = 75/245 (30%), Positives = 116/245 (47%), Gaps = 15/245 (6%)
Query: 4 FLSCVFILFFLCFYVVSP----IEAQTGGFSVELIHRDSPKSPFYNSSETPYQRLRDALT 59
F+ C+ L LCF P ++ GF V L+H S +SPFY + T + + ++
Sbjct: 9 FMICIQTL--LCFSSSLPDHVLLKDNRLGFKVPLLHWLSTESPFYEPNLTLAELTQASIR 66
Query: 60 RSLNR---LNHFNQNSSISSSKASQADIIPNNANYLIRISIGTPPTERLAVADTGSDLIW 116
S R + + SS K + + + Y+++ SIG+P + A+ D+GS L+W
Sbjct: 67 TSGARGDSIRSIMSGNITSSMKYPISRMSYTDKAYVMKFSIGSPAVDTYAIPDSGSSLVW 126
Query: 117 TQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCSSSQC-ASLNQK--SCSGVN--CQYSVS 171
QC CY Q PLF+P S TY C++++C +L + C N C+Y
Sbjct: 127 LQCGTPYCRNCYRQKIPLFNPSKSVTYMKRLCNTAECRVALGDEYWRCKKPNQICKYHED 186
Query: 172 YGDGSFSNGNLATETVTL-GSTTGQAVALPGITFGCGTNNGGLFNSKTTGIVGLGGGDIS 230
Y D S++ G ++T+ T +G I FGCG NN + G+VGL S
Sbjct: 187 YLDDSYTEGVISTDIFTFPEHISGFGNYTLRIIFGCGYNNSDPQHFYPPGLVGLTNNKAS 246
Query: 231 LISQM 235
L+ QM
Sbjct: 247 LVGQM 251
>gi|414878073|tpg|DAA55204.1| TPA: hypothetical protein ZEAMMB73_344109 [Zea mays]
Length = 440
Score = 97.1 bits (240), Expect = 1e-17, Method: Compositional matrix adjust.
Identities = 71/221 (32%), Positives = 109/221 (49%), Gaps = 36/221 (16%)
Query: 25 QTGGFSVELIHRDSPKSPFYNSSETPYQRLRDALTRSLNRLNHFNQNSSISSSKASQADI 84
+ G +EL H D+ ++ S+E +R+R A R+ RL + S+ SQ
Sbjct: 20 RAAGLRLELTHVDAKQN---CSTE---ERMRRATERTHRRLASMGEASAPVHWAESQ--- 70
Query: 85 IPNNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYK 144
Y+ IG PP + A+ DTGS+LIWTQC C P+ C+ Q+ +DP S T +
Sbjct: 71 ------YIAEYLIGDPPQQAEAIIDTGSNLIWTQCSTCQPAGCFSQNLSFYDPSRSRTAR 124
Query: 145 SLPCSSSQCASLNQKSCSGVN--CQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGI 202
+ C+ + CA ++ C+ N C +YG G G L TE T + + V+L
Sbjct: 125 PVACNDTACALGSETRCARDNKACAVLTAYGAGVI-GGVLGTEAFTFQPQS-ENVSL--- 179
Query: 203 TFGC--------GTNNGGLFNSKTTGIVGLGGGDISLISQM 235
FGC G+ +G +GI+GLG G++SL+SQ+
Sbjct: 180 AFGCIAATRLTPGSLDG------ASGIIGLGRGNLSLVSQL 214
Score = 52.4 bits (124), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 40/135 (29%), Positives = 64/135 (47%), Gaps = 15/135 (11%)
Query: 253 VIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGS--LELCYSF---NSLSQVPEVT 307
+IDSG+ T L L + + A V P G+ L+LC + + VP +
Sbjct: 304 LIDSGSPFTSLVDVAYQALRDELVQQLGASIVPPPAGAEGLDLCAAVAHGDVGKLVPPLV 363
Query: 308 IHFR--GADVKLSRSNFFVKVSEDIVCSVF---KGITNSVP-----IYGNIMQTNFLVGY 357
+HF G DV + N++ V + C V G +++P I GN MQ + + Y
Sbjct: 364 LHFGSGGGDVAVPPENYWGPVDDSTACMVVFSSGGPNSTLPMNETTIIGNYMQQDMHLLY 423
Query: 358 DIEQQTVSFKPTDCT 372
D+E+ +SF+P DC+
Sbjct: 424 DLEKGMLSFQPADCS 438
>gi|242085924|ref|XP_002443387.1| hypothetical protein SORBIDRAFT_08g018620 [Sorghum bicolor]
gi|241944080|gb|EES17225.1| hypothetical protein SORBIDRAFT_08g018620 [Sorghum bicolor]
Length = 460
Score = 97.1 bits (240), Expect = 1e-17, Method: Compositional matrix adjust.
Identities = 71/214 (33%), Positives = 101/214 (47%), Gaps = 21/214 (9%)
Query: 28 GFSVELIHRDSPKSPFYNSSETPYQRLRDALTRSLNRLNHFNQNSSISSSKASQADIIPN 87
G +EL H D+ ++ T +R+R A R+ RL S + A I N
Sbjct: 32 GLRLELTHVDAKQN------CTTKERMRRATERTHRRLA-----SMAGGGGEASAPIHWN 80
Query: 88 NANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLP 147
Y+ IG PP + A+ DTGS+LIWTQC C + C+ QD +DP S T K +
Sbjct: 81 ETQYIAEYLIGDPPQQAAAIIDTGSNLIWTQCSTCRANGCFGQDLTFYDPSRSRTAKPVA 140
Query: 148 CSSSQCASLNQKSCS--GVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITFG 205
C+ + C ++ C+ G C +YG G+ G L TE T G + + FG
Sbjct: 141 CNDTACLLGSETRCARDGKACAVLTAYGAGAI-GGFLGTEVFTFGHGQSSENNV-SLAFG 198
Query: 206 CGTNN----GGLFNSKTTGIVGLGGGDISLISQM 235
C T + G L +GI+GLG G +SL SQ+
Sbjct: 199 CITASRLTPGSL--DGASGIIGLGRGKLSLPSQL 230
Score = 46.6 bits (109), Expect = 0.022, Method: Compositional matrix adjust.
Identities = 39/139 (28%), Positives = 62/139 (44%), Gaps = 19/139 (13%)
Query: 253 VIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGS--LELCYSF----NSLSQVPEV 306
+IDSG+ T L L + + A V P G+ L+LC ++ VP +
Sbjct: 320 LIDSGSPFTSLIDVAYQALRDELVRQLGASVVPPPAGAEGLDLCVGGVAPGDAGKLVPPL 379
Query: 307 TIHFRGA-----DVKLSRSNFFVKVSEDIVCSVF---KGITNSVP-----IYGNIMQTNF 353
+HF DV + N++ V + C V G +++P I GN MQ +
Sbjct: 380 VLHFGSGGGGGGDVVVPPENYWGPVDDSTACMVVFSSGGPNSTLPLNETTIIGNYMQQDM 439
Query: 354 LVGYDIEQQTVSFKPTDCT 372
+ YD+ Q +SF+P DC+
Sbjct: 440 HLLYDLGQGVLSFQPADCS 458
>gi|15217887|ref|NP_176703.1| aspartic proteinase-like protein 2 [Arabidopsis thaliana]
gi|118572746|sp|Q9S9K4.2|ASPL2_ARATH RecName: Full=Aspartic proteinase-like protein 2; Flags: Precursor
gi|332196226|gb|AEE34347.1| aspartic proteinase-like protein 2 [Arabidopsis thaliana]
Length = 475
Score = 97.1 bits (240), Expect = 1e-17, Method: Compositional matrix adjust.
Identities = 95/392 (24%), Positives = 163/392 (41%), Gaps = 88/392 (22%)
Query: 65 LNHFNQNSSISSSKASQADIIPNNAN--------YLIRISIGTPPTERLAVADTGSDLIW 116
L HF + + S+ + +P + Y +I +G+PP E DTGSD++W
Sbjct: 40 LEHFKSHDTRRHSRMLASIDLPLGGDSRVDSVGLYFTKIKLGSPPKEYHVQVDTGSDILW 99
Query: 117 TQCEPCP--PSQCYMQ-DSPLFDPKMSSTYKSLPCSSSQCASLNQ-KSCS-GVNCQYSVS 171
C+PCP P++ + LFD SST K + C C+ ++Q SC + C Y +
Sbjct: 100 INCKPCPKCPTKTNLNFRLSLFDMNASSTSKKVGCDDDFCSFISQSDSCQPALGCSYHIV 159
Query: 172 YGDGSFSNGNLATETVTLGSTTGQAVALP---GITFGCGTNNGGLF---NSKTTGIVGLG 225
Y D S S+G + +TL TG P + FGCG++ G +S G++G G
Sbjct: 160 YADESTSDGKFIRDMLTLEQVTGDLKTGPLGQEVVFGCGSDQSGQLGNGDSAVDGVMGFG 219
Query: 226 GGDISLISQMRTT-------------IAGNQRLGVSTPD--------------------- 251
+ S++SQ+ T + G V D
Sbjct: 220 QSNTSVLSQLAATGDAKRVFSHCLDNVKGGGIFAVGVVDSPKVKTTPMVPNQMHYNVMLM 279
Query: 252 --------------IVIDSG------TTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSL 291
IV + G TTL + P+ +L+ +++ QPV
Sbjct: 280 GMDVDGTSLDLPRSIVRNGGTIVDSGTTLAYFPKVLYDSLI---ETILARQPVKLHIVEE 336
Query: 292 EL-CYSF--NSLSQVPEVTIHFRGADVKLSR--SNFFVKVSEDIVCSVFK--GIT----N 340
C+SF N P V+ F + VKL+ ++ + E++ C ++ G+T +
Sbjct: 337 TFQCFSFSTNVDEAFPPVSFEFEDS-VKLTVYPHDYLFTLEEELYCFGWQAGGLTTDERS 395
Query: 341 SVPIYGNIMQTNFLVGYDIEQQTVSFKPTDCT 372
V + G+++ +N LV YD++ + + + +C+
Sbjct: 396 EVILLGDLVLSNKLVVYDLDNEVIGWADHNCS 427
>gi|242079765|ref|XP_002444651.1| hypothetical protein SORBIDRAFT_07g025440 [Sorghum bicolor]
gi|241941001|gb|EES14146.1| hypothetical protein SORBIDRAFT_07g025440 [Sorghum bicolor]
Length = 493
Score = 96.3 bits (238), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 94/360 (26%), Positives = 143/360 (39%), Gaps = 79/360 (21%)
Query: 91 YLIRISIGTPPTERLAVADTGSDLIWTQ---CEPCPPSQCYMQDSPLFDPKMSSTYKSLP 147
Y + +GTPP DTGSD++W C+ CP D L+DPK SST ++
Sbjct: 88 YYTEVRLGTPPKRFYVQVDTGSDILWVNCITCDQCPHKSGLGLDLTLYDPKASSTGSTVM 147
Query: 148 CSSSQCASL---NQKSCSG-VNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPG-- 201
C CA CS V C+YSV+YGDGS + G+ + + TG P
Sbjct: 148 CDQGFCADTFGGRLPKCSANVPCEYSVTYGDGSSTVGSFVNDALQFDQVTGDGQTQPANA 207
Query: 202 -ITFGCGTNNGGLFNSKT---TGIVGLGGGDISLISQMRT-------------TIAGNQR 244
+ FGCG GG S + GI+G G + S++SQ+ T TI G
Sbjct: 208 SVIFGCGAQQGGDLGSSSQALDGILGFGEANTSMLSQLATAGKVKKIFAHCLDTIKGGGI 267
Query: 245 LGV---------STPDIV-----------IDSG------------------------TTL 260
+ +TP + ID G TTL
Sbjct: 268 FAIGDVVQPKVKTTPLVADKPHYNVNLKTIDVGGTTLELPADIFKPGEKRGTIIDSGTTL 327
Query: 261 TFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSLSQVPEVTIHFRGADVKLSR- 319
T+LP+ ++ + + + D L YS + P +T HF D+ L
Sbjct: 328 TYLPELVFKKVMLAVFNKHQDITFHDVQDFLCFEYSGSVDDGFPTLTFHFE-DDLALHVY 386
Query: 320 -SNFFVKVSEDIVCSVFK-GITNS-----VPIYGNIMQTNFLVGYDIEQQTVSFKPTDCT 372
+F D+ C F+ G S + + G+++ +N LV YD+E + + + +C+
Sbjct: 387 PHEYFFPNGNDVYCVGFQNGALQSKDGKDIVLMGDLVLSNKLVVYDLENRVIGWTDYNCS 446
>gi|242045564|ref|XP_002460653.1| hypothetical protein SORBIDRAFT_02g032590 [Sorghum bicolor]
gi|241924030|gb|EER97174.1| hypothetical protein SORBIDRAFT_02g032590 [Sorghum bicolor]
Length = 525
Score = 96.3 bits (238), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 51/148 (34%), Positives = 78/148 (52%), Gaps = 15/148 (10%)
Query: 81 QADIIPNNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMS 140
++ + + YL+ + +GTPP + DTGSDL W QC PC C+ Q P+FDP S
Sbjct: 141 ESGVAVGSGEYLMDVYVGTPPRRFRMIMDTGSDLNWLQCAPC--LDCFEQRGPVFDPAAS 198
Query: 141 STYKSLPCSSSQCASL---------NQKSCSGVN---CQYSVSYGDGSFSNGNLATETVT 188
S+Y+++ C +C + + ++C C Y YGD S + G+LA E+ T
Sbjct: 199 SSYRNVTCGDHRCGHVAPPPEPEASSPRTCRRPGEDPCPYYYWYGDQSNTTGDLALESFT 258
Query: 189 LGSTT-GQAVALPGITFGCGTNNGGLFN 215
+ T G + + G+ FGCG N GLF+
Sbjct: 259 VNLTAPGASRRVDGVVFGCGHRNRGLFH 286
Score = 57.4 bits (137), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 44/158 (27%), Positives = 75/158 (47%), Gaps = 18/158 (11%)
Query: 234 QMRTTIAGNQRLGVS--TPDI--------VIDSGTTLT-FLPQGYNSNLLSVMSSMIEAQ 282
+++ + G + L +S T D+ +IDSGTTL+ F+ Y + M M +
Sbjct: 367 KLKGVLVGGELLNISSDTWDVGKDGSGGTIIDSGTTLSYFVEPAYQVIRHAFMDRMSRSY 426
Query: 283 PVADPTGSLELCYSFNSLS--QVPEVTIHFR-GADVKLSRSNFFVKVSED---IVCSVFK 336
P+ L CY+ + + +VPE+++ F GA N+F+++ D I+C
Sbjct: 427 PLVPEFPVLSPCYNVSGVERPEVPELSLLFADGAVWDFPAENYFIRLDPDGGSIMCLAVL 486
Query: 337 GITNS-VPIYGNIMQTNFLVGYDIEQQTVSFKPTDCTK 373
G + + I GN Q NF V YD++ + F P C +
Sbjct: 487 GTPRTGMSIIGNFQQQNFHVVYDLQNNRLGFAPRRCAE 524
>gi|125575538|gb|EAZ16822.1| hypothetical protein OsJ_32294 [Oryza sativa Japonica Group]
Length = 392
Score = 96.3 bits (238), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 56/147 (38%), Positives = 82/147 (55%), Gaps = 11/147 (7%)
Query: 90 NYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCS 149
NY+ +IGTPP AV D +L+WTQC+ C S+C+ QD+PLFDP S+TY++ PC
Sbjct: 50 NYVANFTIGTPPQPASAVIDLAGELVWTQCKQC--SRCFEQDTPLFDPTASNTYRAEPCG 107
Query: 150 SSQCASL--NQKSCSGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITFGCG 207
+ C S+ + ++CSG C Y S G + G + T+T +G+ A + FGC
Sbjct: 108 TPLCESIPSDSRNCSGNVCAYQASTNAGD-TGGKVGTDTFAVGT------AKASLAFGCV 160
Query: 208 TNNGGLFNSKTTGIVGLGGGDISLISQ 234
+ +GIVGLG SL++Q
Sbjct: 161 VASDIDTMGGPSGIVGLGRTPWSLVTQ 187
>gi|147809812|emb|CAN71447.1| hypothetical protein VITISV_040904 [Vitis vinifera]
Length = 988
Score = 96.3 bits (238), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 106/417 (25%), Positives = 154/417 (36%), Gaps = 113/417 (27%)
Query: 32 ELIHRDSPKSPFYNSSETPYQRLRDALTRSLNRLNHFNQNSSISSSKASQADIIPNNANY 91
E+ RD + F NS Y S N NH + N ++ + N+
Sbjct: 87 EIFGRDESRVSFINSKCNQYT--------SGNLKNHAHNN-----------NLFDEDGNF 127
Query: 92 LIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCSSS 151
L+ ++ GTPP + + DTGS + WTQC+ C C FD SSTY C S
Sbjct: 128 LVDVAFGTPPQKFKLILDTGSSITWTQCKAC--VHCLKDSHRHFDSLASSTYSFGSCIPS 185
Query: 152 QCASLNQKSCSGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITFGCGTNNG 211
V Y+++YGD S S GN +T+TL + FGCG NN
Sbjct: 186 T-----------VGNTYNMTYGDKSTSVGNYGCDTMTLEPSD----VFQKFQFGCGRNNE 230
Query: 212 G--------------------------------------------LFNSKTT-------- 219
G LF K T
Sbjct: 231 GDFGSGADGMLGLGQGQLSTVSQTASKFKKVFSYCLPEENSIGSLLFGEKATSQSSSLKF 290
Query: 220 -------GIVGLGGGDISLISQMRTTIAGNQRLGV-----STPDIVIDSGTTLTFLPQGY 267
G GL + + ++ GN+RL + ++P +IDSGT +T LPQ
Sbjct: 291 TSLVNGPGTSGLEESGYYFVKLLDISV-GNKRLNIPSSVFASPGTIIDSGTVITRLPQRA 349
Query: 268 NSNLLSVMSSMIEAQPVAD----PTGSLELCYSFNSLSQV--PEVTIHF-RGADVKLSRS 320
S L + + P+++ L+ CY+ + V PE +HF GADV+L+
Sbjct: 350 YSALKAAFKKAMAKYPLSNGRRKENDMLDTCYNLSGRKDVLLPEXVLHFGDGADVRLNGK 409
Query: 321 NFFVKVSEDIVCSVFKGITNS-----VPIYGNIMQTNFLVGYDIEQQTVSFKPTDCT 372
+C F G + S + I GN Q + V YDI + + F C+
Sbjct: 410 RVVWGNDASRLCLAFAGNSKSTMNPELTIIGNRQQVSLTVLYDIRGRRIGFGGNGCS 466
>gi|302786560|ref|XP_002975051.1| hypothetical protein SELMODRAFT_54028 [Selaginella moellendorffii]
gi|300157210|gb|EFJ23836.1| hypothetical protein SELMODRAFT_54028 [Selaginella moellendorffii]
Length = 359
Score = 96.3 bits (238), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 54/157 (34%), Positives = 83/157 (52%), Gaps = 13/157 (8%)
Query: 88 NANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLP 147
Y++ +SIGTPP A+ DTGSDL+W +C+ C +F SS+YK LP
Sbjct: 2 EGEYMMELSIGTPPQLIPAMIDTGSDLVWLKCDNCDHCDLDHHGETIFFSDASSSYKKLP 61
Query: 148 CSSSQCASLNQKSCSGVN------CQYSVSYGDGSFSNGNLATETVTL---GSTTGQAVA 198
C+S+ C+ + S +G+ C+Y YGDGS ++G++ ++ ++ G+
Sbjct: 62 CNSTHCSGM---SSAGIGPRCEETCKYKYEYGDGSRTSGDVGSDRISFRSHGAGEDHRSF 118
Query: 199 LPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQM 235
G FGCG G +N T G++GLG SLI Q+
Sbjct: 119 FDGFLFGCGRKLKGDWNF-TQGLIGLGQKSHSLIQQL 154
Score = 48.9 bits (115), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 39/118 (33%), Positives = 55/118 (46%), Gaps = 6/118 (5%)
Query: 253 VIDSGTTLTFL-PQGYNSNLLSVMSSMIEAQPVADPTGSLELCY--SFNSLSQVPEVTIH 309
VIDSGTT T L P Y + S+ +I P + L+LC+ S ++ P VT +
Sbjct: 244 VIDSGTTYTLLTPPVYEAMRKSIEEQVI--LPTLGNSAGLDLCFNSSGDTSYGFPSVTFY 301
Query: 310 FRG-ADVKLSRSNFFVKVSEDIVCSVFKGITNSVPIYGNIMQTNFLVGYDIEQQTVSF 366
F + L N F S D+VC + I GN+ Q NF + YD+ +SF
Sbjct: 302 FANQVQLVLPFENIFQVTSRDVVCLSMDSSGGDLSIIGNMQQQNFHILYDLVASQISF 359
>gi|449511696|ref|XP_004164029.1| PREDICTED: aspartic proteinase-like protein 2-like [Cucumis
sativus]
Length = 639
Score = 95.9 bits (237), Expect = 3e-17, Method: Compositional matrix adjust.
Identities = 103/392 (26%), Positives = 158/392 (40%), Gaps = 83/392 (21%)
Query: 55 RDALTRSLNRLNHFNQNSSISSSKASQ--ADIIPNNANYLIRISIGTPPTERLAVADTGS 112
R L R +RL H SS A D + N Y R+ IG+PP E + DTGS
Sbjct: 52 RRVLDRD-HRLRHLQNLVKPHSSNARMRLHDDLLTNGYYTTRLWIGSPPQEFALIVDTGS 110
Query: 113 DLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCSSSQCASLNQKSCSGVNCQYSVSY 172
+ + C C QC P F P++SSTY+ + C++ N GV C Y Y
Sbjct: 111 TVTYVPCSNC--VQCGNHQDPRFQPELSSTYQPVKCNADCNCDEN-----GVQCTYERRY 163
Query: 173 GDGSFSNGNLATETVTLGSTTGQAVALPGITFGCGT-NNGGLFNSKTTGIVGL------- 224
+ S S+G LA + ++ G + + FGC T +G L+ + GI+GL
Sbjct: 164 AEMSTSSGVLAEDVMSFGKES--ELVPQRAVFGCETMESGDLYTQRADGIMGLGRGTLSV 221
Query: 225 -----------------------GGGDISL-----------------------ISQMRTT 238
GGG + L I
Sbjct: 222 MDQLVGKGVVSNSFSLCYGGMDVGGGAMVLGGISSPPGMVFSHSDPSRSPYYNIELKEIH 281
Query: 239 IAGNQ-RLGVSTPD----IVIDSGTTLTFLPQ-GYNSNLLSVMSSMIEAQPVADPTGSL- 291
+AG +L T D ++DSGTT + P+ Y + ++M + + ++ P +
Sbjct: 282 VAGKPLKLNPRTFDGKYGAILDSGTTYAYFPEKAYYAFKDAIMKKISFLKQISGPDPNFK 341
Query: 292 ELCYS-----FNSLSQV-PEVTIHF-RGADVKLSRSNFF---VKVSEDIVCSVFKGITNS 341
++C+S L +V PEV + F G + LS N+ KVS +FK +
Sbjct: 342 DICFSGAGRDVTELPKVFPEVDMVFANGQKISLSPENYLFRHTKVSGAYCLGIFKNGNDQ 401
Query: 342 VPIYGNIMQTNFLVGYDIEQQTVSFKPTDCTK 373
+ G I+ N LV Y+ E T+ F T+C++
Sbjct: 402 TTLLGGIIVRNTLVTYNRENSTIGFWKTNCSE 433
>gi|168014386|ref|XP_001759733.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162689272|gb|EDQ75645.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 392
Score = 95.9 bits (237), Expect = 3e-17, Method: Compositional matrix adjust.
Identities = 61/158 (38%), Positives = 79/158 (50%), Gaps = 19/158 (12%)
Query: 88 NANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLP 147
+ Y + S+GTP + + DTGSDL + QC PC CY QD PL+ P SST+ +P
Sbjct: 31 SGQYFVDFSLGTPEQKFHLIVDTGSDLAFVQCAPC--DLCYEQDGPLYQPSNSSTFTPVP 88
Query: 148 CSSSQ-----------CASLNQKSCSGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQA 196
C S++ C+S +S C Y YGD S + G A ET T+G
Sbjct: 89 CDSAECLLIPAPVGAPCSSSYPESPPQGACSYEYRYGDNSSTVGVFAYETATVGGIRVNH 148
Query: 197 VALPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQ 234
VA FGCG N G F S G++GLG G +S SQ
Sbjct: 149 VA-----FGCGNRNQGSFVS-AGGVLGLGQGALSFTSQ 180
Score = 62.8 bits (151), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 39/124 (31%), Positives = 68/124 (54%), Gaps = 6/124 (4%)
Query: 253 VIDSGTTLTFL-PQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSLSQ--VPEVTIH 309
+ DSGTT+T+ PQ Y + + S+ + P G L LC + + + P TI
Sbjct: 266 IFDSGTTVTYWSPQAYARIIAAFEKSVPYPRAPPSPQG-LPLCVNVSGIDHPIYPSFTIE 324
Query: 310 F-RGADVKLSRSNFFVKVSEDIVC-SVFKGITNSVPIYGNIMQTNFLVGYDIEQQTVSFK 367
F +GA + ++ N+F++VS +I C ++ + ++ + GNI+Q N+LV YD E+ + F
Sbjct: 325 FDQGATYRPNQGNYFIEVSPNIDCLAMLESSSDGFNVIGNIIQQNYLVQYDREEHRIGFA 384
Query: 368 PTDC 371
+C
Sbjct: 385 HANC 388
>gi|298204765|emb|CBI25263.3| unnamed protein product [Vitis vinifera]
Length = 359
Score = 95.9 bits (237), Expect = 3e-17, Method: Compositional matrix adjust.
Identities = 71/231 (30%), Positives = 104/231 (45%), Gaps = 30/231 (12%)
Query: 166 CQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITFGCGTNNGGLFNSKTTGIVGLG 225
C Y+++YGDGSF+ G L E + G+ + + FGCG NN GLF +G++GLG
Sbjct: 133 CNYAINYGDGSFTRGELGHEKLKFGT-----ILVKDFIFGCGRNNKGLFGG-VSGLMGLG 186
Query: 226 GGDISLISQMRT---------------TIAGN--QRLGVSTPDIVIDSGTTLTFLPQGYN 268
D+SLISQ +I G Q V I++DSGT +T LP
Sbjct: 187 RSDLSLISQTSENPQLYNFYFINLTGISIGGVALQAPSVGPSRILVDSGTVITRLPPTIY 246
Query: 269 SNLLSVMSSMIEAQPVADPTGSLELCYSFNSLSQV--PEVTIHFRG---ADVKLSRSNFF 323
L + P A L+ C++ ++ +V P + +HF G V ++ +F
Sbjct: 247 KALKAEFLKQFTGFPPAPAFSILDTCFNLSAYQEVDIPTIKMHFEGNAELTVDVTGVFYF 306
Query: 324 VKVSEDIVCSVFKGIT--NSVPIYGNIMQTNFLVGYDIEQQTVSFKPTDCT 372
VK VC + + V I GN Q N V YD ++ V F C+
Sbjct: 307 VKSDASQVCLALASLEYQDEVAILGNYQQKNLRVIYDTKETKVGFALETCS 357
>gi|212721496|ref|NP_001131929.1| uncharacterized protein LOC100193320 precursor [Zea mays]
gi|194692946|gb|ACF80557.1| unknown [Zea mays]
Length = 424
Score = 95.9 bits (237), Expect = 3e-17, Method: Compositional matrix adjust.
Identities = 96/384 (25%), Positives = 155/384 (40%), Gaps = 91/384 (23%)
Query: 71 NSSISSSKASQADIIPNNANYLIRISIGTPPTERLAVADTGSDLIWTQCE-PCPPSQCYM 129
+SSI++ D+ P+ Y + ++IG PP D+GSDL W QC+ PC C
Sbjct: 38 SSSIAAVFPLYGDVYPHGL-YYVAMNIGNPPKPYFLDVDSGSDLTWLQCDAPC--RSCNE 94
Query: 130 QDSPLFDPKMSSTYKSLPCSSSQCASLN-----QKSCSGVN--CQYSVSYGDGSFSNGNL 182
PL+ P S K +PC CASL+ + C + C Y + Y D S G L
Sbjct: 95 VPHPLYRPTKS---KLVPCVHRLCASLHNGLTGKHRCDSPHEQCDYVIKYADQGSSTGVL 151
Query: 183 ATETVTLGSTTGQAVALPGITFGCGTN---NGGLFNSKTTGIVGLGGGDISLISQMR--- 236
++ L T G +VA P + FGCG + G +S T G++GLG G +SL+SQ++
Sbjct: 152 INDSFALRLTNG-SVARPSVAFGCGYDQQVRSGDLSSPTDGVLGLGTGSVSLLSQLKQRG 210
Query: 237 --------------------------------TTIA-----------------GNQRLGV 247
T +A G++ LGV
Sbjct: 211 VTKNVVGHCLSLRGGGFLFFGDDLVPYQRATWTPMARSAFRNYYSPGSASLYFGDRSLGV 270
Query: 248 STPDIVIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYS----FNSLSQV 303
+V DSG++ T+ L++ + + +P SL LC+ F S+ V
Sbjct: 271 RLAKVVFDSGSSFTYFAAKPYQALVTALKDGLSRTLEEEPDTSLPLCWKGQEPFKSVLDV 330
Query: 304 PE----VTIHFRGAD---VKLSRSNFFVKVSEDIVCSVFKGITN-------SVPIYGNIM 349
+ + ++F +++ N+ + C GI N + I G+I
Sbjct: 331 RKEFKSLVLNFASGKKTLMEIPPENYLIVTENGNAC---LGILNGSEIGLKDLSIIGDIT 387
Query: 350 QTNFLVGYDIEQQTVSFKPTDCTK 373
+ +V YD E+ + + C +
Sbjct: 388 MQDHMVIYDNEKGKIGWIRAPCDR 411
>gi|226509616|ref|NP_001152116.1| aspartic proteinase Asp1 precursor [Zea mays]
gi|195652765|gb|ACG45850.1| aspartic proteinase Asp1 precursor [Zea mays]
Length = 432
Score = 95.5 bits (236), Expect = 3e-17, Method: Compositional matrix adjust.
Identities = 93/374 (24%), Positives = 148/374 (39%), Gaps = 92/374 (24%)
Query: 82 ADIIPNNANYLIRISIGTPPTERLAVADTGSDLIWTQCE-PCPPSQCYMQDSPLFDPKMS 140
D+ P+ Y + ++IG PP D+GSDL W QC+ PC C PL+ P S
Sbjct: 56 GDVYPHGL-YYVAMNIGNPPKPYFLDVDSGSDLTWLQCDAPC--RSCNEVPHPLYRPTKS 112
Query: 141 STYKSLPCSSSQCASLNQKSCSGVN--------CQYSVSYGDGSFSNGNLATETVTLGST 192
K +PC CASL+ G + C Y + Y D S G L ++ L T
Sbjct: 113 ---KLVPCVHRLCASLHNALTGGKHRCESPHEQCDYVIKYADQGSSTGVLVNDSFALRLT 169
Query: 193 TGQAVALPGITFGCGTN---NGGLFNSKTTGIVGLGGGDISLISQMR------------- 236
G +VA P + FGCG + G +S T G++GLG G +SL+SQ++
Sbjct: 170 NG-SVARPSVAFGCGYDQQVRSGDLSSPTDGVLGLGTGSVSLLSQLKQRGVTKNVVGHCL 228
Query: 237 ----------------------TTIA-----------------GNQRLGVSTPDIVIDSG 257
T +A G++ LGV +V DSG
Sbjct: 229 SLRGGGFLFFGDDLVPYQRATWTPMARSAFRNYYSPGSASLYFGDRSLGVRLAKVVFDSG 288
Query: 258 TTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYS----FNSLSQVPE----VTIH 309
++ T+ L++ + + +P SL LC+ F S+ V + + ++
Sbjct: 289 SSFTYFAAKPYQALVTALKDGLSRTLEEEPDTSLPLCWKGQEPFKSVLDVRKEFKSLVLN 348
Query: 310 FRGAD---VKLSRSNFFVKVSEDIVCSVFKGITN-------SVPIYGNIMQTNFLVGYDI 359
F +++ N+ + C GI N + I G+I + +V YD
Sbjct: 349 FASGKKTLMEIPPENYLIVTENGNAC---LGILNGSEIGLKDLSIIGDITMQDHMVIYDN 405
Query: 360 EQQTVSFKPTDCTK 373
E+ + + C +
Sbjct: 406 EKGKIGWIRAPCDR 419
>gi|449447285|ref|XP_004141399.1| PREDICTED: aspartic proteinase-like protein 2-like [Cucumis
sativus]
Length = 609
Score = 95.5 bits (236), Expect = 3e-17, Method: Compositional matrix adjust.
Identities = 103/392 (26%), Positives = 158/392 (40%), Gaps = 83/392 (21%)
Query: 55 RDALTRSLNRLNHFNQNSSISSSKASQ--ADIIPNNANYLIRISIGTPPTERLAVADTGS 112
R L R +RL H SS A D + N Y R+ IG+PP E + DTGS
Sbjct: 52 RRVLDRD-HRLRHLQNLVKPHSSNARMRLHDDLLTNGYYTTRLWIGSPPQEFALIVDTGS 110
Query: 113 DLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCSSSQCASLNQKSCSGVNCQYSVSY 172
+ + C C QC P F P++SSTY+ + C++ N GV C Y Y
Sbjct: 111 TVTYVPCSNC--VQCGNHQDPRFQPELSSTYQPVKCNADCNCDEN-----GVQCTYERRY 163
Query: 173 GDGSFSNGNLATETVTLGSTTGQAVALPGITFGCGT-NNGGLFNSKTTGIVGL------- 224
+ S S+G LA + ++ G + + FGC T +G L+ + GI+GL
Sbjct: 164 AEMSTSSGVLAEDVMSFGKES--ELVPQRAVFGCETMESGDLYTQRADGIMGLGRGTLSV 221
Query: 225 -----------------------GGGDISL-----------------------ISQMRTT 238
GGG + L I
Sbjct: 222 MDQLVGKGVVSNSFSLCYGGMDVGGGAMVLGGISSPPGMVFSHSDPSRSPYYNIELKEIH 281
Query: 239 IAGNQ-RLGVSTPD----IVIDSGTTLTFLPQ-GYNSNLLSVMSSMIEAQPVADPTGSL- 291
+AG +L T D ++DSGTT + P+ Y + ++M + + ++ P +
Sbjct: 282 VAGKPLKLNPRTFDGKYGAILDSGTTYAYFPEKAYYAFKDAIMKKISFLKQISGPDPNFK 341
Query: 292 ELCYS-----FNSLSQV-PEVTIHF-RGADVKLSRSNFF---VKVSEDIVCSVFKGITNS 341
++C+S L +V PEV + F G + LS N+ KVS +FK +
Sbjct: 342 DICFSGAGRDVTELPKVFPEVDMVFANGQKISLSPENYLFRHTKVSGAYCLGIFKNGNDQ 401
Query: 342 VPIYGNIMQTNFLVGYDIEQQTVSFKPTDCTK 373
+ G I+ N LV Y+ E T+ F T+C++
Sbjct: 402 TTLLGGIIVRNTLVTYNRENSTIGFWKTNCSE 433
>gi|357521081|ref|XP_003630829.1| Aspartic proteinase nepenthesin-2 [Medicago truncatula]
gi|355524851|gb|AET05305.1| Aspartic proteinase nepenthesin-2 [Medicago truncatula]
Length = 526
Score = 95.1 bits (235), Expect = 4e-17, Method: Compositional matrix adjust.
Identities = 63/206 (30%), Positives = 100/206 (48%), Gaps = 16/206 (7%)
Query: 42 PFYNSSETPYQRLRDAL-------TRSLNRLNHFNQNS-SISSSKASQADIIPNNANYLI 93
PF+N E P + SL +H ++N S+ + + I +N+L+
Sbjct: 130 PFHNQEEFPQTFSSSSSFKLKLYPAASLYNTHHQHKNYYSLDLNASLNPGITTGTSNFLV 189
Query: 94 RISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCSSSQC 153
+I +G PP + + D +D W QC+PC +CY Q +FDP SS+Y L C + C
Sbjct: 190 QIGVGGPPQKFYMIFDLQTDFTWLQCQPCI--KCYDQPDSIFDPSQSSSYTLLSCETKHC 247
Query: 154 ASLNQKSCSGVN-CQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITFGCGTNNGG 212
L SCS C+Y+++Y DG+ + G L ETV+ S+ + ++ GC N G
Sbjct: 248 NLLPNSSCSDDGYCRYNITYKDGTNTEGVLINETVSFESSG----WVDRVSLGCSNKNQG 303
Query: 213 LFNSKTTGIVGLGGGDISLISQMRTT 238
F + G GLG G +S S++ +
Sbjct: 304 PF-VGSDGTFGLGRGSLSFPSRINAS 328
>gi|242091325|ref|XP_002441495.1| hypothetical protein SORBIDRAFT_09g028050 [Sorghum bicolor]
gi|241946780|gb|EES19925.1| hypothetical protein SORBIDRAFT_09g028050 [Sorghum bicolor]
Length = 466
Score = 95.1 bits (235), Expect = 5e-17, Method: Compositional matrix adjust.
Identities = 86/361 (23%), Positives = 146/361 (40%), Gaps = 86/361 (23%)
Query: 90 NYLIRISIGTPPTERLAVADTGSDLIWTQCE-PCPPSQCYMQDSPLFDPKMSSTYKSLPC 148
Y +++ +GTP E VADTGSDL W +C PP + +F PK S ++ +PC
Sbjct: 115 QYFVKLRVGTPVQEFTLVADTGSDLTWVKCAGASPPGR-------VFRPKTSRSWAPIPC 167
Query: 149 SSSQCA-----SLNQKSCSGVNCQYSVSYGDGSF-SNGNLATETVTLGSTTGQAVALPGI 202
SS C +L S C Y Y +GS + G + TE+ T+ G+ L +
Sbjct: 168 SSDTCKLDVPFTLANCSSPASPCTYDYRYKEGSAGARGIVGTESATIALPGGKVAQLKDV 227
Query: 203 TFGCGTNNGG-----------LFNSK---------------------------TTGIVGL 224
GC +++ G L N+K TG +
Sbjct: 228 VLGCSSSHDGQSFRSADGVLSLGNAKISFATQAAARFGGSFSYCLVDHLAPRNATGYLAF 287
Query: 225 GGGDISLISQMRTT------------------IAGN------QRLGVSTPDIVIDSGTTL 260
G G + +T +AG + + +++DSG TL
Sbjct: 288 GPGQVPRTPATQTKLFLDPEMPFYGVKVDAIHVAGKALDIPAEVWDAKSGGVILDSGNTL 347
Query: 261 TFLPQGYNSNLLSVMSSMIEAQP-VADPTGSLELCYSFNSLSQ-----VPEVTIHFRG-A 313
T L +++ +S ++ P V+ P E CY++ + +P++ + F G A
Sbjct: 348 TVLAAPAYKAVVAALSKHLDGVPKVSFPP--FEHCYNWTARRPGAPEIIPKLAVQFAGSA 405
Query: 314 DVKLSRSNFFVKVSEDIVC-SVFKGITNSVPIYGNIMQTNFLVGYDIEQQTVSFKPTDCT 372
++ ++ + V + C V +G + + GNIMQ L +D++ V FK ++CT
Sbjct: 406 RLEPPAKSYVIDVKPGVKCIGVQEGEWPGLSVIGNIMQQEHLWEFDLKNMQVRFKQSNCT 465
Query: 373 K 373
+
Sbjct: 466 R 466
>gi|15232960|ref|NP_186923.1| aspartyl protease-like protein [Arabidopsis thaliana]
gi|6728988|gb|AAF26986.1|AC018363_31 putative aspartyl protease [Arabidopsis thaliana]
gi|21593593|gb|AAM65560.1| putative aspartyl protease [Arabidopsis thaliana]
gi|332640332|gb|AEE73853.1| aspartyl protease-like protein [Arabidopsis thaliana]
Length = 488
Score = 95.1 bits (235), Expect = 5e-17, Method: Compositional matrix adjust.
Identities = 102/358 (28%), Positives = 156/358 (43%), Gaps = 78/358 (21%)
Query: 91 YLIRISIGTPPTERLAVADTGSDLIWTQCE---PCPPSQCYMQDSPLFDPKMSSTYKSLP 147
Y +I +GTP + DTGSD++W C CP ++ +P +D SST KS+
Sbjct: 85 YFAKIGLGTPSRDFHVQVDTGSDILWVNCAGCIRCPRKSDLVELTP-YDVDASSTAKSVS 143
Query: 148 CSSSQCASLNQKS-C-SGVNCQYSVSYGDGSFSNGNLATETVTLGSTTG--QAVALPG-I 202
CS + C+ +NQ+S C SG CQY + YGDGS +NG L + V L TG Q + G I
Sbjct: 144 CSDNFCSYVNQRSECHSGSTCQYVIMYGDGSSTNGYLVKDVVHLDLVTGNRQTGSTNGTI 203
Query: 203 TFGCGTNNGGLF---NSKTTGIVGLGGGDISLISQMRTT----------IAGNQRLGV-S 248
FGCG+ G + GI+G G + S ISQ+ + + N G+ +
Sbjct: 204 IFGCGSKQSGQLGESQAAVDGIMGFGQSNSSFISQLASQGKVKRSFAHCLDNNNGGGIFA 263
Query: 249 TPDIVIDSGTTLTFLPQ----GYNSNLLSVMSSMIEAQPVADPTGSLE---------LCY 295
++V T L + N N + V +S++E A +G + L Y
Sbjct: 264 IGEVVSPKVKTTPMLSKSAHYSVNLNAIEVGNSVLELSSNAFDSGDDKGVIIDSGTTLVY 323
Query: 296 SFNS---------LSQVPEVTIH--------FRGADVKLSR------------------S 320
++ L+ PE+T+H F D KL R
Sbjct: 324 LPDAVYNPLLNEILASHPELTLHTVQESFTCFHYTD-KLDRFPTVTFQFDKSVSLAVYPR 382
Query: 321 NFFVKVSEDIVCSVFK--GITN----SVPIYGNIMQTNFLVGYDIEQQTVSFKPTDCT 372
+ +V ED C ++ G+ S+ I G++ +N LV YDIE Q + + +C+
Sbjct: 383 EYLFQVREDTWCFGWQNGGLQTKGGASLTILGDMALSNKLVVYDIENQVIGWTNHNCS 440
>gi|356523724|ref|XP_003530485.1| PREDICTED: aspartic proteinase-like protein 2-like [Glycine max]
Length = 488
Score = 94.7 bits (234), Expect = 6e-17, Method: Compositional matrix adjust.
Identities = 94/363 (25%), Positives = 143/363 (39%), Gaps = 84/363 (23%)
Query: 91 YLIRISIGTPPTERLAVADTGSDLIWT---QCEPCPPSQCYMQDSPLFDPKMSSTYKSLP 147
Y +I IGTPP DTGSD++W QC+ CP D L+D K SS+ K +P
Sbjct: 83 YYAKIGIGTPPKNYYLQVDTGSDIMWVNCIQCKECPTRSSLGMDLTLYDIKESSSGKLVP 142
Query: 148 CSSSQCASLNQKSCSG----VNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAV---ALP 200
C C +N +G ++C Y YGDGS + G + V +G A
Sbjct: 143 CDQEFCKEINGGLLTGCTANISCPYLEIYGDGSSTAGYFVKDIVLYDQVSGDLKTDSANG 202
Query: 201 GITFGCGTNNGGLFNSKTT----GIVGLGGGDISLISQ---------------------- 234
I FGCG G +S GI+G G + S+ISQ
Sbjct: 203 SIVFGCGARQSGDLSSSNEEALDGILGFGKANSSMISQLASSGKVKKMFAHCLNGVNGGG 262
Query: 235 ---------------------------MRTTIAGNQRLGVST--------PDIVIDSGTT 259
M G+ L +ST +IDSGTT
Sbjct: 263 IFAIGHVVQPKVNMTPLLPDQPHYSVNMTAVQVGHTFLSLSTDTSAQGDRKGTIIDSGTT 322
Query: 260 LTFLPQGYNSNLLSVMSSM---IEAQPVADPTGSLELCYSFNSLSQVPEVTIHFR-GADV 315
L +LP+G L+ M S ++ Q + D + YS + P VT F G +
Sbjct: 323 LAYLPEGIYEPLVYKMISQHPDLKVQTLHDEYTCFQ--YSESVDDGFPAVTFFFENGLSL 380
Query: 316 KLSRSNFFVKVSEDIVCSVFKG------ITNSVPIYGNIMQTNFLVGYDIEQQTVSFKPT 369
K+ ++ S + C ++ + ++ + G+++ +N LV YD+E Q + +
Sbjct: 381 KVYPHDYLFP-SVNFWCIGWQNSGTQSRDSKNMTLLGDLVLSNKLVFYDLENQAIGWAEY 439
Query: 370 DCT 372
+C+
Sbjct: 440 NCS 442
>gi|414881575|tpg|DAA58706.1| TPA: hypothetical protein ZEAMMB73_168363 [Zea mays]
Length = 506
Score = 94.7 bits (234), Expect = 6e-17, Method: Compositional matrix adjust.
Identities = 57/158 (36%), Positives = 74/158 (46%), Gaps = 13/158 (8%)
Query: 91 YLIRISIGTPPTERLAVADTGSDLIWTQ---CEPCPPSQCYMQDSPLFDPKMSSTYKSLP 147
Y I +GTPP DTGSD++W C CP D +DPK SS+ ++
Sbjct: 87 YFTEIKLGTPPKRYYVQVDTGSDILWVNCISCSKCPRKSGLGLDLTFYDPKASSSGSTVS 146
Query: 148 CSSSQCASLNQKSCSG----VNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPG-- 201
C CA+ G V C+YSV YGDGS + G T+ + TG PG
Sbjct: 147 CDQGFCAATYGGKLPGCTANVPCEYSVMYGDGSSTTGFFITDALQFDQVTGDGQTQPGNA 206
Query: 202 -ITFGCGTNNGGLF---NSKTTGIVGLGGGDISLISQM 235
ITFGCG GG N GI+G G + S++SQ+
Sbjct: 207 TITFGCGAQQGGDLGNSNQALDGILGFGQANTSMLSQL 244
Score = 46.6 bits (109), Expect = 0.018, Method: Compositional matrix adjust.
Identities = 37/128 (28%), Positives = 60/128 (46%), Gaps = 9/128 (7%)
Query: 253 VIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSLSQVPEVTIHFRG 312
+IDSGTTLT+LP+ ++ V+ S + L YS + P +T HF
Sbjct: 335 IIDSGTTLTYLPELVFKQVMDVVFSKHRDIAFHNLQDFLCFQYSGSVDDGFPTITFHFE- 393
Query: 313 ADVKLSR--SNFFVKVSEDIVCSVFK-GITNS-----VPIYGNIMQTNFLVGYDIEQQTV 364
D+ L +F DI C F+ G S + + G+++ +N LV YD+E Q +
Sbjct: 394 DDLALHVYPHEYFFPNGNDIYCVGFQNGALQSKDGKDIVLMGDLVLSNKLVVYDLENQVI 453
Query: 365 SFKPTDCT 372
+ +C+
Sbjct: 454 GWTDYNCS 461
>gi|359473000|ref|XP_002278677.2| PREDICTED: aspartic proteinase nepenthesin-1-like [Vitis vinifera]
Length = 458
Score = 94.7 bits (234), Expect = 6e-17, Method: Compositional matrix adjust.
Identities = 68/203 (33%), Positives = 99/203 (48%), Gaps = 21/203 (10%)
Query: 50 PYQRLRDALTRSLNRLNHF----NQNSSISSSKASQADIIPNNANYLIRISIGTPPTERL 105
P+ AL+ +RL+ F + S+ S S A + Y + + +GTPP + L
Sbjct: 46 PFTTPSQALSFDSHRLSFFFSALHTPQSLKSPVVSGAST--GSGQYFVDLRLGTPPQKLL 103
Query: 106 AVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCSSSQCASL---NQKSCS 162
VADTGSDL+W +C C + S F + S+T+ C S C + C+
Sbjct: 104 LVADTGSDLVWVKCSACRNCTRHTPGS-AFLARHSTTFSPNHCYDSACQLVPLPKHHRCN 162
Query: 163 GVN----CQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITFGCGTN------NGG 212
C+Y SYGDGS ++G + ET TL +++G+ L GI FGC +G
Sbjct: 163 HARLHSPCRYEYSYGDGSKTSGFFSKETTTLNTSSGREAKLKGIAFGCAFRISGPSVSGA 222
Query: 213 LFNSKTTGIVGLGGGDISLISQM 235
FN G++GLG G ISL SQ+
Sbjct: 223 SFNG-AHGVMGLGRGPISLSSQL 244
Score = 64.7 bits (156), Expect = 7e-08, Method: Compositional matrix adjust.
Identities = 35/124 (28%), Positives = 64/124 (51%), Gaps = 5/124 (4%)
Query: 253 VIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSLS--QVPEVTIHF 310
++DSGTTLTFLP+ +L+V+ + A+PT +LC + + + ++P+++
Sbjct: 332 IVDSGTTLTFLPEPAYLQILTVIKRRVRLPSPAEPTPGFDLCVNVSEIEHPRLPKLSFKL 391
Query: 311 RGADV-KLSRSNFFVKVSEDIVCSVFKGIT--NSVPIYGNIMQTNFLVGYDIEQQTVSFK 367
G V N+FV ED+ C + + + + GN+MQ FL+ +D ++ + F
Sbjct: 392 GGDSVFSPPPRNYFVDTDEDVKCLALQAVMTPSGFSVIGNLMQQGFLLEFDKDRTRLGFS 451
Query: 368 PTDC 371
C
Sbjct: 452 RHGC 455
>gi|302824729|ref|XP_002994005.1| hypothetical protein SELMODRAFT_431957 [Selaginella moellendorffii]
gi|300138167|gb|EFJ04945.1| hypothetical protein SELMODRAFT_431957 [Selaginella moellendorffii]
Length = 462
Score = 94.7 bits (234), Expect = 6e-17, Method: Compositional matrix adjust.
Identities = 64/229 (27%), Positives = 111/229 (48%), Gaps = 16/229 (6%)
Query: 24 AQTGGFSVELIHRDSPKSPFYNSSETPYQRLRDALTRSLNRLNHFNQNSSISSSKASQAD 83
A+ GG + IH +P+S + + S + +N + + ++S
Sbjct: 35 ARGGGIGFKAIHVAAPQSRVKANPSPSSAAQKSLFPYSAHIFQQHTKNPA--ALRSSTTT 92
Query: 84 IIPNNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTY 143
+ Y I +G+P E + + DTGS+L W QC PC C ++D S++Y
Sbjct: 93 LGRKFGEYYTSIKLGSPGQEAILIVDTGSELTWLQCLPC--KVCAPSVDTIYDAARSASY 150
Query: 144 KSLPCSSSQ-CASLNQKS----CSGVNCQYSVSYGDGSFSNGNLATETVTLGSTT-GQAV 197
+ + C++SQ C++ +Q + G CQ++ YGDGSFS G+L+T+T+ + + G+ V
Sbjct: 151 RPVTCNNSQLCSNSSQGTYAYCARGSQCQFAAFYGDGSFSYGSLSTDTLIMETVVGGKPV 210
Query: 198 ALPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGNQRLG 246
+ FGC + L + +GI+GL G ++L Q+ QR G
Sbjct: 211 TVQDFAFGCAQGDLELVPTGASGILGLNAGKMALPMQL------GQRFG 253
>gi|4646203|gb|AAD26876.1|AC007230_10 Belongs to PF|00026 Eukaryotic aspartyl protease family
[Arabidopsis thaliana]
Length = 449
Score = 94.4 bits (233), Expect = 8e-17, Method: Compositional matrix adjust.
Identities = 94/386 (24%), Positives = 160/386 (41%), Gaps = 88/386 (22%)
Query: 65 LNHFNQNSSISSSKASQADIIPNNAN--------YLIRISIGTPPTERLAVADTGSDLIW 116
L HF + + S+ + +P + Y +I +G+PP E DTGSD++W
Sbjct: 40 LEHFKSHDTRRHSRMLASIDLPLGGDSRVDSVGLYFTKIKLGSPPKEYHVQVDTGSDILW 99
Query: 117 TQCEPCP--PSQCYMQ-DSPLFDPKMSSTYKSLPCSSSQCASLNQ-KSCS-GVNCQYSVS 171
C+PCP P++ + LFD SST K + C C+ ++Q SC + C Y +
Sbjct: 100 INCKPCPKCPTKTNLNFRLSLFDMNASSTSKKVGCDDDFCSFISQSDSCQPALGCSYHIV 159
Query: 172 YGDGSFSNGNLATETVTLGSTTGQAVALP---GITFGCGTNNGGLF---NSKTTGIVGLG 225
Y D S S+G + +TL TG P + FGCG++ G +S G++G G
Sbjct: 160 YADESTSDGKFIRDMLTLEQVTGDLKTGPLGQEVVFGCGSDQSGQLGNGDSAVDGVMGFG 219
Query: 226 GGDISLISQMRTT-------------IAGNQRLGVSTPD--------------------- 251
+ S++SQ+ T + G V D
Sbjct: 220 QSNTSVLSQLAATGDAKRVFSHCLDNVKGGGIFAVGVVDSPKVKTTPMVPNQMHYNVMLM 279
Query: 252 --------------IVIDSG------TTLTFLPQGYNSNLLSVMSSMIEAQPVA-DPTGS 290
IV + G TTL + P+ +L+ +++ QPV
Sbjct: 280 GMDVDGTSLDLPRSIVRNGGTIVDSGTTLAYFPKVLYDSLI---ETILARQPVKLHIVEE 336
Query: 291 LELCYSF--NSLSQVPEVTIHFRGADVKLSR--SNFFVKVSEDIVCSVFK--GIT----N 340
C+SF N P V+ F + VKL+ ++ + E++ C ++ G+T +
Sbjct: 337 TFQCFSFSTNVDEAFPPVSFEFEDS-VKLTVYPHDYLFTLEEELYCFGWQAGGLTTDERS 395
Query: 341 SVPIYGNIMQTNFLVGYDIEQQTVSF 366
V + G+++ +N LV YD++ + + +
Sbjct: 396 EVILLGDLVLSNKLVVYDLDNEVIGW 421
>gi|356568907|ref|XP_003552649.1| PREDICTED: aspartic proteinase-like protein 2-like [Glycine max]
Length = 490
Score = 94.4 bits (233), Expect = 8e-17, Method: Compositional matrix adjust.
Identities = 93/363 (25%), Positives = 145/363 (39%), Gaps = 84/363 (23%)
Query: 91 YLIRISIGTPPTERLAVADTGSDLIWT---QCEPCPPSQCYMQDSPLFDPKMSSTYKSLP 147
Y +I IGTPP DTGSD++W QC+ CP D L+D K SS+ K +P
Sbjct: 85 YYAKIGIGTPPKNYYLQVDTGSDIMWVNCIQCKECPTRSNLGMDLTLYDIKESSSGKFVP 144
Query: 148 CSSSQCASLNQKSCSG----VNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAV---ALP 200
C C +N +G ++C Y YGDGS + G + V +G A
Sbjct: 145 CDQEFCKEINGGLLTGCTANISCPYLEIYGDGSSTAGYFVKDIVLYDQVSGDLKTDSANG 204
Query: 201 GITFGCGTNNGGLFNSKTT----GIVGLGGGDISLISQ---------------------- 234
I FGCG G +S GI+G G + S+ISQ
Sbjct: 205 SIVFGCGARQSGDLSSSNEEALGGILGFGKANSSMISQLASSGKVKKMFAHCLNGVNGGG 264
Query: 235 ---------------------------MRTTIAGNQRLGVST--------PDIVIDSGTT 259
M G+ L +ST +IDSGTT
Sbjct: 265 IFAIGHVVQPKVNMTPLLPDQPHYSVNMTAVQVGHAFLSLSTDTSTQGDRKGTIIDSGTT 324
Query: 260 LTFLPQG-YNSNLLSVMSSM--IEAQPVADPTGSLELCYSFNSLSQVPEVTIHFR-GADV 315
L +LP+G Y + ++S ++ + + D + YS + P VT +F G +
Sbjct: 325 LAYLPEGIYEPLVYKIISQHPDLKVRTLHDEYTCFQ--YSESVDDGFPAVTFYFENGLSL 382
Query: 316 KLSRSNFFVKVSEDIVCSVFKG------ITNSVPIYGNIMQTNFLVGYDIEQQTVSFKPT 369
K+ ++ S D C ++ + ++ + G+++ +N LV YD+E Q + +
Sbjct: 383 KVYPHDYLFP-SGDFWCIGWQNSGTQSRDSKNMTLLGDLVLSNKLVFYDLENQVIGWTEY 441
Query: 370 DCT 372
+C+
Sbjct: 442 NCS 444
>gi|238006986|gb|ACR34528.1| unknown [Zea mays]
gi|413916290|gb|AFW56222.1| aspartic proteinase Asp1 [Zea mays]
Length = 433
Score = 94.4 bits (233), Expect = 9e-17, Method: Compositional matrix adjust.
Identities = 93/373 (24%), Positives = 149/373 (39%), Gaps = 91/373 (24%)
Query: 82 ADIIPNNANYLIRISIGTPPTERLAVADTGSDLIWTQCE-PCPPSQCYMQDSPLFDPKMS 140
D+ P+ Y + ++IG PP D+GSDL W QC+ PC C PL+ P S
Sbjct: 58 GDVYPHGL-YYVAMNIGNPPKPYFLDVDSGSDLTWLQCDAPC--RSCNEVPHPLYRPTKS 114
Query: 141 STYKSLPCSSSQCASLN-----QKSCSGVN--CQYSVSYGDGSFSNGNLATETVTLGSTT 193
K +PC CASL+ + C + C Y + Y D S G L ++ L T
Sbjct: 115 ---KLVPCVHRLCASLHNGLTGKHRCDSPHEQCDYVIKYADQGSSTGVLINDSFALRLTN 171
Query: 194 GQAVALPGITFGCGTN---NGGLFNSKTTGIVGLGGGDISLISQMR-------------- 236
G +VA P + FGCG + G +S T G++GLG G +SL+SQ++
Sbjct: 172 G-SVARPSVAFGCGYDQQVRSGDLSSPTDGVLGLGTGSVSLLSQLKQRGVTKNVVGHCLS 230
Query: 237 ---------------------TTIA-----------------GNQRLGVSTPDIVIDSGT 258
T +A G++ LGV +V DSG+
Sbjct: 231 LRGGGFLFFGDDLVPYQRATWTPMARSAFRNYYSPGSASLYFGDRSLGVRLAKVVFDSGS 290
Query: 259 TLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYS----FNSLSQVPE----VTIHF 310
+ T+ L++ + + +P SL LC+ F S+ V + + ++F
Sbjct: 291 SFTYFAAKPYQALVTALKDGLSRTLEEEPDTSLPLCWKGQEPFKSVLDVRKEFKSLVLNF 350
Query: 311 RGAD---VKLSRSNFFVKVSEDIVCSVFKGITN-------SVPIYGNIMQTNFLVGYDIE 360
+++ N+ + C GI N + I G+I + +V YD E
Sbjct: 351 ASGKKTLMEIPPENYLIVTENGNAC---LGILNGSEIGLKDLSIIGDITMQDHMVIYDNE 407
Query: 361 QQTVSFKPTDCTK 373
+ + + C +
Sbjct: 408 KGKIGWIRAPCDR 420
>gi|212722026|ref|NP_001131674.1| uncharacterized protein LOC100193034 precursor [Zea mays]
gi|194692214|gb|ACF80191.1| unknown [Zea mays]
gi|413946454|gb|AFW79103.1| hypothetical protein ZEAMMB73_752316 [Zea mays]
Length = 441
Score = 94.4 bits (233), Expect = 9e-17, Method: Compositional matrix adjust.
Identities = 84/363 (23%), Positives = 143/363 (39%), Gaps = 84/363 (23%)
Query: 87 NNANYLIRISIGTPPTERLAVADTGSDLIWTQCE--PCPPSQCYMQDSPLFDPKMSSTYK 144
Y +++ +GTP E VADTGS+L W +C PP +F P+ S ++
Sbjct: 87 GTGQYFVKVLVGTPAQEFTLVADTGSELTWVKCAGGASPPGL-------VFRPEASKSWA 139
Query: 145 SLPCSSSQCA-----SLNQKSCSGVNCQYSVSYGDGSFSN-GNLATETVTLGSTTGQAVA 198
+PCSS C SL S S C Y Y +GS G + T++ T+ G+
Sbjct: 140 PVPCSSDTCKLDVPFSLANCSSSASPCSYDYRYKEGSAGALGVVGTDSATIALPGGKVAQ 199
Query: 199 LPGITFGCGTNNGG-----------LFNSK---------------------------TTG 220
L + GC + + G L N+K TG
Sbjct: 200 LQDVVLGCSSTHDGQSFKSVDGVLSLGNAKISFASRAAARFGGSFSYCLVDHLAPRNATG 259
Query: 221 IVGLGGGDISLISQMRTTI-----------------AGNQRLGV-------STPDIVIDS 256
+ G G + +T + Q L + + +++DS
Sbjct: 260 YLAFGPGQVPRTPATQTKLFLDPAMPFYGVKVDAVHVAGQALDIPAEVWDPKSGGVILDS 319
Query: 257 GTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNS----LSQVPEVTIHFRG 312
GTTLT L +++ ++ ++ P D E CY++ + ++P++ + F G
Sbjct: 320 GTTLTVLATPAYKAVVAALTKLLAGVPKVD-FPPFEHCYNWTAPRPGAPEIPKLAVQFTG 378
Query: 313 -ADVKLSRSNFFVKVSEDIVC-SVFKGITNSVPIYGNIMQTNFLVGYDIEQQTVSFKPTD 370
A ++ ++ + V + C + +G V + GNIMQ L +D++ V F P+
Sbjct: 379 CARLEPPAKSYVIDVKPGVKCIGLQEGEWPGVSVIGNIMQQEHLWEFDLKNMEVRFMPST 438
Query: 371 CTK 373
CT+
Sbjct: 439 CTR 441
>gi|297838267|ref|XP_002887015.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
gi|297332856|gb|EFH63274.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
Length = 324
Score = 94.0 bits (232), Expect = 9e-17, Method: Compositional matrix adjust.
Identities = 85/303 (28%), Positives = 123/303 (40%), Gaps = 76/303 (25%)
Query: 92 LIRISIGTPPTERLAVADTGSDLIWTQC--EPCPPSQCYMQDSPLFDPKMSSTYKSLPCS 149
+I + IGTPP + V DTGS L W QC + PP + FDP +SS++ +LPCS
Sbjct: 75 IISLPIGTPPQAQQMVLDTGSQLSWIQCHRKKLPP-----KPKTSFDPSLSSSFSTLPCS 129
Query: 150 SSQCA------SLNQKSCSGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGIT 203
C +L S C YS Y DG+F+ GNL E +T +T P +
Sbjct: 130 HPLCKPRIPDFTLPTSCDSNRLCHYSYFYADGTFAEGNLVKEKITFSNTE----ITPPLI 185
Query: 204 FGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGNQRLGVSTPDIVIDSGTTLTFL 263
GC T +S GI+G+ G +S +SQ + T + P
Sbjct: 186 LGCATE-----SSDDRGILGMNRGRLSFVSQAKIT-----KFSYCIPP------------ 223
Query: 264 PQGYNSNLLSVMSSMIEAQPVADPTGSLEL-------CYSFNSLSQVPEVTIHFRGADVK 316
SN +P PTGS L + + SL PE ++
Sbjct: 224 ----KSN-----------RPGFTPTGSFYLGDNPNSKGFKYVSLLTFPE------RVEIL 262
Query: 317 LSRSNFFVKVSEDIVC------SVFKGITNSVPIYGNIMQTNFLVGYDIEQQTVSFKPTD 370
+ + V V + I C S+ +N I GN+ Q N V +D+ + V F D
Sbjct: 263 VPKERVLVNVGDGIHCVGIGRSSMLGAASN---IIGNVHQQNLWVEFDVTNRRVGFARAD 319
Query: 371 CTK 373
C++
Sbjct: 320 CSR 322
>gi|226492150|ref|NP_001146362.1| hypothetical protein precursor [Zea mays]
gi|219886805|gb|ACL53777.1| unknown [Zea mays]
gi|414878074|tpg|DAA55205.1| TPA: hypothetical protein ZEAMMB73_415404 [Zea mays]
Length = 440
Score = 94.0 bits (232), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 75/222 (33%), Positives = 103/222 (46%), Gaps = 40/222 (18%)
Query: 28 GFSVELIHRDSPKSPFYNSSETPYQRLRDALTRSLNRLNHFNQ-NSSISSSKASQADIIP 86
G +EL H D+ + Y E R+R A R+ RL + I SQ
Sbjct: 22 GIRLELTHVDAKE--HYTVEE----RVRRATERTHRRLASMGGVTAPIHWGGQSQ----- 70
Query: 87 NNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSL 146
Y+ IG PP A+ DTGS+LIWTQC C P+ C+ Q+ P +DP S +++
Sbjct: 71 ----YIAEYLIGDPPQRAEAIIDTGSNLIWTQCSRCRPT-CFRQNLPYYDPSRSRAARAV 125
Query: 147 PCSSSQCASLNQKSCSGVN--CQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITF 204
C+ + CA ++ C N C YG G+ + G LATE +T S T V F
Sbjct: 126 GCNDAACALGSETQCLSDNKTCAVVTGYGAGNIA-GTLATENLTFQSETVSLV------F 178
Query: 205 GC--------GTNNGGLFNSKTTGIVGLGGGDISLISQMRTT 238
GC G+ NG +GI+GLG G +SL SQ+ T
Sbjct: 179 GCIVVTKLSPGSLNGA------SGIIGLGRGKLSLPSQLGDT 214
Score = 53.9 bits (128), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 41/135 (30%), Positives = 62/135 (45%), Gaps = 17/135 (12%)
Query: 254 IDSGTTLTFLPQGYNSNLLSVMSSMIEA---QPVADPTGSLELCYSFNSLSQ-VPEVTIH 309
IDSG LT L L + ++ + A QP+A TG +LC + + VP + +H
Sbjct: 305 IDSGAPLTSLVDVAYQALRAELARQLGAALVQPLAGTTG-FDLCVALKDAERLVPPLVLH 363
Query: 310 F-----RGADVKLSRSNFFVKVSEDIVCSV-FKGI------TNSVPIYGNIMQTNFLVGY 357
F G D+ + +N++ V C V F + N + GN MQ N V Y
Sbjct: 364 FGGGSGTGTDLVVPPANYWAPVDSATACMVVFSSVDRKSLPMNETTVIGNYMQQNMHVLY 423
Query: 358 DIEQQTVSFKPTDCT 372
D+ +SF+P DC+
Sbjct: 424 DLAGGVLSFQPADCS 438
>gi|224085379|ref|XP_002307559.1| predicted protein [Populus trichocarpa]
gi|222857008|gb|EEE94555.1| predicted protein [Populus trichocarpa]
Length = 455
Score = 94.0 bits (232), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 53/137 (38%), Positives = 74/137 (54%), Gaps = 12/137 (8%)
Query: 89 ANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPC 148
Y + + IGTPP + DTGSDL W QC PC C+ Q+ P +DPK SS+++++ C
Sbjct: 88 GEYFMDVFIGTPPKHYSLILDTGSDLNWIQCVPC--HDCFEQNGPYYDPKESSSFRNIGC 145
Query: 149 SSSQCASLNQKS----CSGVN--CQYSVSYGDGSFSNGNLATE--TVTLGSTTGQA--VA 198
+C ++ C N C Y YGD S + G+ ATE TV L S TG++
Sbjct: 146 HDPRCHLVSSPDPPLPCKAENQTCPYFYWYGDSSNTTGDFATETFTVNLTSPTGKSEFKR 205
Query: 199 LPGITFGCGTNNGGLFN 215
+ + FGCG N GLF+
Sbjct: 206 VENVMFGCGHWNRGLFH 222
>gi|224136884|ref|XP_002326969.1| predicted protein [Populus trichocarpa]
gi|222835284|gb|EEE73719.1| predicted protein [Populus trichocarpa]
Length = 626
Score = 94.0 bits (232), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 95/375 (25%), Positives = 156/375 (41%), Gaps = 80/375 (21%)
Query: 70 QNSSISSSKASQADIIPNNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYM 129
QNS + +++ D + +N Y R+ IGTPP E + DTGS + + C C QC
Sbjct: 56 QNSELPNARMRLFDDLLSNGYYTTRLFIGTPPQEFALIVDTGSTVTYVPCSSC--EQCGK 113
Query: 130 QDSPLFDPKMSSTYKSLPCSSSQCASLNQKSCSGVNCQYSVSYGDGSFSNGNLATETVTL 189
P F P +SSTY+ + C+ S C ++ G C Y Y + S S+G +A + V+
Sbjct: 114 HQDPRFQPDLSSTYRPVKCNPS-CNCDDE----GKQCTYERRYAEMSSSSGVIAEDVVSF 168
Query: 190 GSTTGQAVALPGITFGC-GTNNGGLFNSKTTGIVG------------------------- 223
G+ + + FGC G L++ + GI+G
Sbjct: 169 GNES--ELKPQRAVFGCENVETGDLYSQRADGIMGLGRGRLSVVDQLVDKGVIGDSFSLC 226
Query: 224 -----LGGGDISL-----------------------ISQMRTTIAGNQ-RLGVSTPD--- 251
+GGG + L I +AG +L D
Sbjct: 227 YGGMDVGGGAMVLGQISPPPNMVFSHSNPYRSPYYNIELKELHVAGKPLKLKPKVFDEKH 286
Query: 252 -IVIDSGTTLTFLPQG-YNSNLLSVMSSMIEAQPVADPTGSL-ELCYS-----FNSLSQV 303
V+DSGTT + P+ +++ ++M + + + P + ++C+S + LS+V
Sbjct: 287 GTVLDSGTTYAYFPEAAFHALKDAIMKEIRHLKQIPGPDPNYHDICFSGAGREVSHLSKV 346
Query: 304 -PEVTIHF-RGADVKLSRSNFF---VKVSEDIVCSVFKGITNSVPIYGNIMQTNFLVGYD 358
PEV + F G + LS N+ KVS +F+ + + G I+ N LV YD
Sbjct: 347 FPEVNMVFGSGQKLSLSPENYLFRHTKVSGAYCLGIFQNGNDLTTLLGGIVVRNTLVTYD 406
Query: 359 IEQQTVSFKPTDCTK 373
E + F T+C++
Sbjct: 407 RENDKIGFWKTNCSE 421
>gi|302784853|ref|XP_002974198.1| hypothetical protein SELMODRAFT_54030 [Selaginella moellendorffii]
gi|300157796|gb|EFJ24420.1| hypothetical protein SELMODRAFT_54030 [Selaginella moellendorffii]
Length = 359
Score = 94.0 bits (232), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 53/157 (33%), Positives = 82/157 (52%), Gaps = 13/157 (8%)
Query: 88 NANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLP 147
Y++ +SIGTPP A+ DTGSDL+W +C+ C +F SS+YK LP
Sbjct: 2 EGEYMMELSIGTPPQLIPAMIDTGSDLVWLKCDNCDHCDLDHHGETIFFSDASSSYKKLP 61
Query: 148 CSSSQCASLNQKSCSGVN------CQYSVSYGDGSFSNGNLATETVTL---GSTTGQAVA 198
C+S+ C+ + S +G+ C+Y YGDGS ++G++ ++ ++ G+
Sbjct: 62 CNSTHCSGM---SSAGIGPRCEETCKYKYEYGDGSRTSGDVGSDRISFRSHGAGEDHRSF 118
Query: 199 LPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQM 235
G FGC G +N T G++GLG SLI Q+
Sbjct: 119 FDGFLFGCARKLKGDWNF-TQGLIGLGQKSHSLIQQL 154
Score = 48.9 bits (115), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 39/118 (33%), Positives = 55/118 (46%), Gaps = 6/118 (5%)
Query: 253 VIDSGTTLTFL-PQGYNSNLLSVMSSMIEAQPVADPTGSLELCY--SFNSLSQVPEVTIH 309
VIDSGTT T L P Y + S+ +I P + L+LC+ S ++ P VT +
Sbjct: 244 VIDSGTTYTLLTPPVYEAMRKSIEEQVI--LPTLGNSAGLDLCFNSSGDTSYGFPSVTFY 301
Query: 310 FRG-ADVKLSRSNFFVKVSEDIVCSVFKGITNSVPIYGNIMQTNFLVGYDIEQQTVSF 366
F + L N F S D+VC + I GN+ Q NF + YD+ +SF
Sbjct: 302 FANQVQLVLPFENIFQVTSRDVVCLSMDSSGGDLSIIGNMQQQNFHILYDLVASQISF 359
>gi|218194599|gb|EEC77026.1| hypothetical protein OsI_15382 [Oryza sativa Indica Group]
Length = 409
Score = 94.0 bits (232), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 89/363 (24%), Positives = 149/363 (41%), Gaps = 85/363 (23%)
Query: 91 YLIRISIGTPPTERLAVADTGSDLIWTQ---CEPCPPSQCYMQDSPLFDPKMSSTYKSLP 147
Y I IGTP DTGSD++W C+ CP + L+DPK SST +
Sbjct: 4 YYTEIGIGTPTKRYYVQVDTGSDILWVNCISCDRCPRKSGLGLELTLYDPKDSSTGSKVS 63
Query: 148 CSSSQCAS----LNQKSCSGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPG-- 201
C CA+ L + + C+YSV+YGDGS + G ++ + +G P
Sbjct: 64 CDQGFCAATYGGLLPGCTTSLPCEYSVTYGDGSSTTGYFVSDLLQFDQVSGDGQTRPANS 123
Query: 202 -ITFGCGTNNGGLF---NSKTTGIVGLGGGDISLISQMRT-------------TIAGNQR 244
+TFGCG+ GG N GI+G G + S++SQ+ TI G
Sbjct: 124 TVTFGCGSQQGGDLGSSNQALDGIIGFGQSNTSMLSQLSAAGKVKKIFAHCLDTINGGGI 183
Query: 245 LGVST----------------------------------PDIVIDSG----------TTL 260
+ P + D+G TTL
Sbjct: 184 FAIGNVVQPKVKTTPLVPNMPHYNVNLKSIDVGGTALKLPSHMFDTGEKKGTIIDSGTTL 243
Query: 261 TFLPQ-GYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSL--SQVPEVTIHFRGADVKL 317
T+LP+ Y +L+V + + + + LC+ + P++T HF D+ L
Sbjct: 244 TYLPEIVYKEIMLAVFA---KHKDITFHNVQEFLCFQYVGRVDDDFPKITFHFEN-DLPL 299
Query: 318 SR--SNFFVKVSEDIVCSVFK--GITNS----VPIYGNIMQTNFLVGYDIEQQTVSFKPT 369
+ ++F + +++ C F+ G+ + + + G+++ +N LV YD+E Q + +
Sbjct: 300 NVYPHDYFFENGDNLYCVGFQNGGLQSKDGKGMVLLGDLVLSNKLVVYDLENQVIGWTEY 359
Query: 370 DCT 372
+C+
Sbjct: 360 NCS 362
>gi|449476186|ref|XP_004154665.1| PREDICTED: LOW QUALITY PROTEIN: aspartic proteinase-like protein
2-like [Cucumis sativus]
Length = 478
Score = 94.0 bits (232), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 97/369 (26%), Positives = 143/369 (38%), Gaps = 96/369 (26%)
Query: 91 YLIRISIGTPPTERLAVADTGSDLIWTQ---CEPCPPSQCYMQDSPLFDPKMSSTYKSLP 147
Y RI IG+PP + DTGSD++W C CP D L++PK SST +
Sbjct: 73 YYARIGIGSPPNDFHVQVDTGSDILWVNCVGCSNCPKKSDIGVDLQLYNPKSSSTSTLIT 132
Query: 148 CSSSQCASLNQKSCSGVN----CQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALP--- 200
C C++ G CQY V YGDGS + G + + L G
Sbjct: 133 CDQPFCSATYDAPIPGCKPDLLCQYKVIYGDGSATAGYFVNDYIQLQRAVGNHKTSETNG 192
Query: 201 GITFGCGTNNGGLFNSKTT---GIVGLGGGDISLISQMRTT-------------IAG--- 241
I FGCG G S + GI+G G + S+ISQ+ T I+G
Sbjct: 193 SIVFGCGAKQSGELGSSSEALDGILGFGQANSSMISQLAATGKVKKIFAHCLDSISGGGI 252
Query: 242 ------------------NQRL------GVSTPDIVID-----------------SGTTL 260
NQ GV D +D SGTTL
Sbjct: 253 FAIGEVVEPKLXNTPVVPNQAHYNVVLNGVKVGDTALDLPLGLFETSYKRGAIIDSGTTL 312
Query: 261 TFLPQGYNSNLLSVMSSMIEAQPVADPTGSLEL--------CYSF--NSLSQVPEVTIHF 310
+LP+ S L +M ++ AQP L+L C+ F N P VT F
Sbjct: 313 AYLPE---SIYLPLMEKILGAQP------DLKLRTVDDQFTCFVFDKNVDDGFPTVTFKF 363
Query: 311 RGADV-KLSRSNFFVKVSEDIVCSVFKG------ITNSVPIYGNIMQTNFLVGYDIEQQT 363
+ + + + ++ +D+ C ++ N V + G+++ N LV Y++E QT
Sbjct: 364 EESLILTIYPHEYLFQIRDDVWCVGWQNSGAQSKDGNEVTLLGDLVLQNKLVYYNLENQT 423
Query: 364 VSFKPTDCT 372
+ + +C+
Sbjct: 424 IGWTEYNCS 432
>gi|242046218|ref|XP_002460980.1| hypothetical protein SORBIDRAFT_02g038700 [Sorghum bicolor]
gi|241924357|gb|EER97501.1| hypothetical protein SORBIDRAFT_02g038700 [Sorghum bicolor]
Length = 517
Score = 93.6 bits (231), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 59/200 (29%), Positives = 94/200 (47%), Gaps = 24/200 (12%)
Query: 24 AQTGGFSVELIHRDSPKSPFYNSSETPYQRLRDALTRSLNRLNHFNQNSSISSSKASQAD 83
A +E +HR + +S + +P R AL+ + ++
Sbjct: 98 ADKDAVRIETMHRRAARSGGDRTPASPSSSPRRALSERM--------------VATVESG 143
Query: 84 IIPNNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTY 143
+ + YL+ + +GTPP + DTGSDL W QC PC C+ Q P+FDP SS+Y
Sbjct: 144 VAVGSGEYLMDVYVGTPPRRFRMIMDTGSDLNWLQCAPC--LDCFDQVGPVFDPAASSSY 201
Query: 144 KSLPCSSSQCASLN----QKSCSGV---NCQYSVSYGDGSFSNGNLATETVTLGSTT-GQ 195
+++ C +C + ++C +C Y YGD S + G+LA E+ T+ T G
Sbjct: 202 RNVTCGDQRCGLVAPPEPPRACRRPGEDSCPYYYWYGDQSNTTGDLALESFTVNLTAPGA 261
Query: 196 AVALPGITFGCGTNNGGLFN 215
+ + + FGCG N GLF+
Sbjct: 262 SRRVDDVVFGCGHWNRGLFH 281
Score = 55.1 bits (131), Expect = 5e-05, Method: Compositional matrix adjust.
Identities = 38/127 (29%), Positives = 63/127 (49%), Gaps = 6/127 (4%)
Query: 253 VIDSGTTLT-FLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSLS--QVPEVTIH 309
+IDSGTTL+ F+ Y + + M + P+ L CY+ + + +VPE+++
Sbjct: 390 IIDSGTTLSYFVEPAYQVIRQAFIDRMGRSYPLIPDFPVLSPCYNVSGVDRPEVPELSLL 449
Query: 310 FR-GADVKLSRSNFFVKVSED-IVCSVFKGITNS-VPIYGNIMQTNFLVGYDIEQQTVSF 366
F GA N+F+++ D I+C G + + I GN Q NF V YD++ + F
Sbjct: 450 FADGAVWDFPAENYFIRLDPDGIMCLAVLGTPRTGMSIIGNFQQQNFHVVYDLKNNRLGF 509
Query: 367 KPTDCTK 373
P C +
Sbjct: 510 APRRCAE 516
>gi|38344991|emb|CAE01597.2| OSJNBa0008A08.5 [Oryza sativa Japonica Group]
gi|116309515|emb|CAH66581.1| OSIGBa0137O04.7 [Oryza sativa Indica Group]
gi|222628622|gb|EEE60754.1| hypothetical protein OsJ_14310 [Oryza sativa Japonica Group]
Length = 494
Score = 93.6 bits (231), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 89/363 (24%), Positives = 149/363 (41%), Gaps = 85/363 (23%)
Query: 91 YLIRISIGTPPTERLAVADTGSDLIWTQ---CEPCPPSQCYMQDSPLFDPKMSSTYKSLP 147
Y I IGTP DTGSD++W C+ CP + L+DPK SST +
Sbjct: 89 YYTEIGIGTPTKRYYVQVDTGSDILWVNCISCDRCPRKSGLGLELTLYDPKDSSTGSKVS 148
Query: 148 CSSSQCAS----LNQKSCSGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPG-- 201
C CA+ L + + C+YSV+YGDGS + G ++ + +G P
Sbjct: 149 CDQGFCAATYGGLLPGCTTSLPCEYSVTYGDGSSTTGYFVSDLLQFDQVSGDGQTRPANS 208
Query: 202 -ITFGCGTNNGGLF---NSKTTGIVGLGGGDISLISQMRT-------------TIAGNQR 244
+TFGCG+ GG N GI+G G + S++SQ+ TI G
Sbjct: 209 TVTFGCGSQQGGDLGSSNQALDGIIGFGQSNTSMLSQLSAAGKVKKIFAHCLDTINGGGI 268
Query: 245 LGVST----------------------------------PDIVIDSG----------TTL 260
+ P + D+G TTL
Sbjct: 269 FAIGNVVQPKVKTTPLVPNMPHYNVNLKSIDVGGTALKLPSHMFDTGEKKGTIIDSGTTL 328
Query: 261 TFLPQ-GYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSL--SQVPEVTIHFRGADVKL 317
T+LP+ Y +L+V + + + + LC+ + P++T HF D+ L
Sbjct: 329 TYLPEIVYKEIMLAVFA---KHKDITFHNVQEFLCFQYVGRVDDDFPKITFHFEN-DLPL 384
Query: 318 SR--SNFFVKVSEDIVCSVFK--GITNS----VPIYGNIMQTNFLVGYDIEQQTVSFKPT 369
+ ++F + +++ C F+ G+ + + + G+++ +N LV YD+E Q + +
Sbjct: 385 NVYPHDYFFENGDNLYCVGFQNGGLQSKDGKGMVLLGDLVLSNKLVVYDLENQVIGWTEY 444
Query: 370 DCT 372
+C+
Sbjct: 445 NCS 447
>gi|449442641|ref|XP_004139089.1| PREDICTED: aspartic proteinase-like protein 2-like [Cucumis
sativus]
Length = 478
Score = 93.6 bits (231), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 97/369 (26%), Positives = 143/369 (38%), Gaps = 96/369 (26%)
Query: 91 YLIRISIGTPPTERLAVADTGSDLIWTQ---CEPCPPSQCYMQDSPLFDPKMSSTYKSLP 147
Y RI IG+PP + DTGSD++W C CP D L++PK SST +
Sbjct: 73 YYARIGIGSPPNDFHVQVDTGSDILWVNCVGCSNCPKKSDIGVDLQLYNPKSSSTSTLIT 132
Query: 148 CSSSQCASLNQKSCSGVN----CQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALP--- 200
C C++ G CQY V YGDGS + G + + L G
Sbjct: 133 CDQPFCSATYDAPIPGCKPDLLCQYKVIYGDGSATAGYFVNDYIQLQRAVGNHKTSETNG 192
Query: 201 GITFGCGTNNGGLFNSKTT---GIVGLGGGDISLISQMRTT-------------IAG--- 241
I FGCG G S + GI+G G + S+ISQ+ T I+G
Sbjct: 193 SIVFGCGAKQSGELGSSSEALDGILGFGQANSSMISQLAATGKVKKIFAHCLDSISGGGI 252
Query: 242 ------------------NQRL------GVSTPDIVID-----------------SGTTL 260
NQ GV D +D SGTTL
Sbjct: 253 FAIGEVVEPKLKTTPVVPNQAHYNVVLNGVKVGDTALDLPLGLFETSYKRGAIIDSGTTL 312
Query: 261 TFLPQGYNSNLLSVMSSMIEAQPVADPTGSLEL--------CYSF--NSLSQVPEVTIHF 310
+LP +S L +M ++ AQP L+L C+ F N P VT F
Sbjct: 313 AYLP---DSIYLPLMEKILGAQP------DLKLRTVDDQFTCFVFDKNVDDGFPTVTFKF 363
Query: 311 RGADV-KLSRSNFFVKVSEDIVCSVFKG------ITNSVPIYGNIMQTNFLVGYDIEQQT 363
+ + + + ++ +D+ C ++ N V + G+++ N LV Y++E QT
Sbjct: 364 EESLILTIYPHEYLFQIRDDVWCVGWQNSGAQSKDGNEVTLLGDLVLQNKLVYYNLENQT 423
Query: 364 VSFKPTDCT 372
+ + +C+
Sbjct: 424 IGWTEYNCS 432
>gi|356499344|ref|XP_003518501.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Glycine max]
Length = 561
Score = 93.6 bits (231), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 51/138 (36%), Positives = 73/138 (52%), Gaps = 12/138 (8%)
Query: 88 NANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLP 147
+ Y + + +GTPP + DTGSDL W QC PC C+ Q P +DPK SS+++++
Sbjct: 194 SGEYFMDVFVGTPPKHFSLILDTGSDLNWIQCVPC--IACFEQSGPYYDPKDSSSFRNIS 251
Query: 148 CSSSQCASLNQ----KSCSGVN--CQYSVSYGDGSFSNGNLATETVTLGSTTGQAVA--- 198
C +C ++ K C N C Y YGDGS + G+ A ET T+ TT +
Sbjct: 252 CHDPRCQLVSAPDPPKPCKAENQSCPYFYWYGDGSNTTGDFALETFTVNLTTPNGTSELK 311
Query: 199 -LPGITFGCGTNNGGLFN 215
+ + FGCG N GLF+
Sbjct: 312 HVENVMFGCGHWNRGLFH 329
Score = 51.6 bits (122), Expect = 7e-04, Method: Compositional matrix adjust.
Identities = 30/124 (24%), Positives = 60/124 (48%), Gaps = 4/124 (3%)
Query: 253 VIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSLS--QVPEVTIHF 310
+IDSGTTLT+ + + I+ + + L+ CY+ + + ++P+ I F
Sbjct: 436 IIDSGTTLTYFAEPAYEIIKEAFVRKIKGYQLVEGLPPLKPCYNVSGIEKMELPDFGILF 495
Query: 311 RGADV-KLSRSNFFVKVSEDIVC-SVFKGITNSVPIYGNIMQTNFLVGYDIEQQTVSFKP 368
V N+F+ + ++VC ++ +++ I GN Q NF + YD+++ + + P
Sbjct: 496 ADEAVWNFPVENYFIWIDPEVVCLAILGNPRSALSIIGNYQQQNFHILYDMKKSRLGYAP 555
Query: 369 TDCT 372
C
Sbjct: 556 MKCA 559
>gi|357494221|ref|XP_003617399.1| 60S ribosomal protein L18a [Medicago truncatula]
gi|355518734|gb|AET00358.1| 60S ribosomal protein L18a [Medicago truncatula]
Length = 749
Score = 93.2 bits (230), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 51/137 (37%), Positives = 71/137 (51%), Gaps = 12/137 (8%)
Query: 89 ANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPC 148
Y + + IGTPP + DTGSDL W QC PC C+ Q P +DPK SS+++++ C
Sbjct: 190 GEYFMDVFIGTPPKHYSLILDTGSDLNWIQCVPC--IACFEQSGPYYDPKESSSFENITC 247
Query: 149 SSSQCASLNQ----KSCSGVN--CQYSVSYGDGSFSNGNLATETVTLGSTT----GQAVA 198
+C ++ K C N C Y YGD S + G+ A ET T+ TT +
Sbjct: 248 HDPRCKLVSSPDPPKPCKDENQTCPYFYWYGDSSNTTGDFALETFTVNLTTPNGKSEQKH 307
Query: 199 LPGITFGCGTNNGGLFN 215
+ + FGCG N GLF+
Sbjct: 308 VENVMFGCGHWNRGLFH 324
Score = 57.8 bits (138), Expect = 8e-06, Method: Compositional matrix adjust.
Identities = 35/124 (28%), Positives = 61/124 (49%), Gaps = 4/124 (3%)
Query: 253 VIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSLS--QVPEVTIHF 310
+IDSGTTLT+ + + I+ + + L+ CY+ + + ++P+ I F
Sbjct: 431 IIDSGTTLTYFAEPAYEIIKEAFMKKIKGYELVEGFPPLKPCYNVSGIEKMELPDFGILF 490
Query: 311 R-GADVKLSRSNFFVKVSEDIVCSVFKGITNS-VPIYGNIMQTNFLVGYDIEQQTVSFKP 368
GA N+F+++ D+VC G S + I GN Q NF + YD+++ + + P
Sbjct: 491 SDGAMWDFPVENYFIQIEPDLVCLAILGTPKSALSIIGNYQQQNFHILYDMKKSRLGYAP 550
Query: 369 TDCT 372
CT
Sbjct: 551 MKCT 554
>gi|125527370|gb|EAY75484.1| hypothetical protein OsI_03384 [Oryza sativa Indica Group]
Length = 453
Score = 93.2 bits (230), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 68/195 (34%), Positives = 100/195 (51%), Gaps = 22/195 (11%)
Query: 57 ALTRSLNRLNHFN----QNSSISSSKASQADIIPNNANYLIRISIGTPPTERLAVADTGS 112
A+ RS +RL+ N+ + +++Q + + +Y + IGTP T ADTGS
Sbjct: 54 AVQRSRSRLSMLAARAVSNAGAAPGESAQTPLKKGSGDYAMSFGIGTPATGLSGEADTGS 113
Query: 113 DLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCSSSQCASLNQKSCSGV-------- 164
DLIWT+C C ++C + SP + P SS+ + C C L + CS V
Sbjct: 114 DLIWTKCGAC--ARCSPRGSPSYYPTSSSSAAFVACGDRTCGELPRPLCSNVAGGGSGSG 171
Query: 165 NCQYSVSYGDG----SFSNGNLATETVTLGSTTGQAVALPGITFGCGTNNGGLFNSKTTG 220
NC Y +YG+ ++ G L TET T G A A PGI FGC + G F + +G
Sbjct: 172 NCSYHYAYGNARDTHHYTEGILMTETFTFGD---DAAAFPGIAFGCTLRSEGGFGTG-SG 227
Query: 221 IVGLGGGDISLISQM 235
+VGLG G +SL++Q+
Sbjct: 228 LVGLGRGKLSLVTQL 242
Score = 47.4 bits (111), Expect = 0.012, Method: Compositional matrix adjust.
Identities = 35/115 (30%), Positives = 57/115 (49%), Gaps = 7/115 (6%)
Query: 252 IVIDSGTTLTFLPQ-GYNSNLLSVMSSMIEAQPVADPTGSLELCYS-FNSLSQVPEVTIH 309
++ DSGTTLT LP Y ++S M +P +C++ +S + P + +H
Sbjct: 326 VIFDSGTTLTMLPDPAYTLVRDELLSQMGFQKPPPAANDDDLICFTGGSSTTTFPSMVLH 385
Query: 310 FR-GADVKLSRSNFFVKV----SEDIVCSVFKGITNSVPIYGNIMQTNFLVGYDI 359
F GAD+ LS N+ ++ E C + ++ I GNIMQ +F V +D+
Sbjct: 386 FDGGADMDLSTENYLPQMQGQNGETARCWSVVKSSQALTIIGNIMQMDFHVVFDL 440
>gi|242079449|ref|XP_002444493.1| hypothetical protein SORBIDRAFT_07g022790 [Sorghum bicolor]
gi|241940843|gb|EES13988.1| hypothetical protein SORBIDRAFT_07g022790 [Sorghum bicolor]
Length = 449
Score = 92.8 bits (229), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 97/384 (25%), Positives = 161/384 (41%), Gaps = 99/384 (25%)
Query: 79 ASQADIIP-NNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCP-----PSQCYMQDS 132
A+ + P ++ + + + IGTPP R + DTGSDLIWTQC + Q
Sbjct: 71 AADVPVAPLSDQGHSLTVGIGTPPQPRTLIVDTGSDLIWTQCSMLSRRTRTAASASRQRE 130
Query: 133 PLFDPKMSSTYKSLPCSSSQC--ASLNQKSCS---------------------------G 163
PL++P+ SS++ LPCS C + K+C+ G
Sbjct: 131 PLYEPRRSSSFAYLPCSDRLCQEGQFSYKNCARNNRCMYDELYGSAEAGGVLASETFTFG 190
Query: 164 VNCQYSV--SYGDGSFSNGNLATETVTLGSTTG-----QAVALPGITFGCGTNNGGLFNS 216
VN + S+ +G G+ S G+L + +G + G +++P ++ C T F
Sbjct: 191 VNAKVSLPLGFGCGALSAGDLVGASGLMGLSPGIMSLVSQLSVPRFSY-CLTP----FAE 245
Query: 217 KTTGIVGLGG----------GDISLISQMRTTIA--------------GNQRL------- 245
+ T + G G + S +R G +RL
Sbjct: 246 RKTSPLLFGAMADLRRYRTTGTVQTTSILRNPAMETAYYYVPLVGLSLGTKRLDVPATSL 305
Query: 246 GVSTPD----IVIDSGTTLTFLPQGYNSNLLSVMSSMIEA--QPVADPTGS----LELCY 295
G+ PD ++DSG+T+++L + + +V +++EA PVA+ T ELC+
Sbjct: 306 GMIKPDGSGGTIVDSGSTMSYLEE---TAFRAVKKAVVEAVRLPVANGTDEDYDDYELCF 362
Query: 296 SFNS-----LSQVPEVTIHFRG-ADVKLSRSNFFVKVSEDIVCSVFKGITNS--VPIYGN 347
+ + + P + +HF G A + L R N+F + ++C + V I GN
Sbjct: 363 ALPTGVAMEAVKTPPLVLHFDGGAAMTLPRDNYFQEPRAGLMCLAVGTSPDGFGVSIIGN 422
Query: 348 IMQTNFLVGYDIEQQTVSFKPTDC 371
+ Q N V +D+ Q SF PT C
Sbjct: 423 VQQQNMHVLFDVRNQKFSFAPTKC 446
>gi|255566835|ref|XP_002524401.1| basic 7S globulin 2 precursor small subunit, putative [Ricinus
communis]
gi|223536362|gb|EEF38012.1| basic 7S globulin 2 precursor small subunit, putative [Ricinus
communis]
Length = 455
Score = 92.8 bits (229), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 57/162 (35%), Positives = 81/162 (50%), Gaps = 17/162 (10%)
Query: 88 NANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQC-YMQDSPLFDPKMSSTYKSL 146
+ Y + + IGTPP L VADTGSDLIW +C PC C + F + S+TY ++
Sbjct: 83 SGQYFVSLRIGTPPQTLLLVADTGSDLIWVKCSPC--RNCSHRSPGSAFFARHSTTYSAI 140
Query: 147 PCSSSQCASLNQKSCSGVN-------CQYSVSYGDGSFSNGNLATETVTLGSTTGQAVAL 199
C S QC + + N C+Y +Y D S + G + E +TL ++TG+ L
Sbjct: 141 HCYSPQCQLVPHPHPNPCNRTRLHSPCRYQYTYADSSTTTGFFSKEALTLNTSTGKVKKL 200
Query: 200 PGITFGCGTN------NGGLFNSKTTGIVGLGGGDISLISQM 235
G++FGCG G F G++GLG IS SQ+
Sbjct: 201 NGLSFGCGFRISGPSLTGASFEG-AQGVMGLGRAPISFSSQL 241
Score = 57.4 bits (137), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 31/124 (25%), Positives = 63/124 (50%), Gaps = 5/124 (4%)
Query: 253 VIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSLSQ--VPEVTIHF 310
+IDSGTTLTF+ + + +L ++ A+PT +LC + + +++ +P ++ +
Sbjct: 329 IIDSGTTLTFITEPAYTEILKAFKKRVKLPSPAEPTPGFDLCMNVSGVTRPALPRMSFNL 388
Query: 311 RGADV-KLSRSNFFVKVSEDIVCSVFKGITN--SVPIYGNIMQTNFLVGYDIEQQTVSFK 367
G V N+F++ + I C + ++ + GN+MQ FL+ +D ++ + F
Sbjct: 389 AGGSVFSPPPRNYFIETGDQIKCLAVQPVSQDGGFSVLGNLMQQGFLLEFDRDKSRLGFT 448
Query: 368 PTDC 371
C
Sbjct: 449 RRGC 452
>gi|125571687|gb|EAZ13202.1| hypothetical protein OsJ_03122 [Oryza sativa Japonica Group]
Length = 453
Score = 92.8 bits (229), Expect = 3e-16, Method: Compositional matrix adjust.
Identities = 68/195 (34%), Positives = 100/195 (51%), Gaps = 22/195 (11%)
Query: 57 ALTRSLNRLNHFNQ----NSSISSSKASQADIIPNNANYLIRISIGTPPTERLAVADTGS 112
A+ RS +RL+ N+ + +++Q + + +Y + IGTP T ADTGS
Sbjct: 54 AVQRSRSRLSMLAARAVSNAGAAPGESAQTPLKKGSGDYAMSFGIGTPATGLSGEADTGS 113
Query: 113 DLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCSSSQCASLNQKSCSGV-------- 164
DLIWT+C C ++C + SP + P SS+ + C C L + CS V
Sbjct: 114 DLIWTKCGAC--ARCSPRGSPSYYPTSSSSAAFVACGDRTCGELPRPLCSNVAGGGSGSG 171
Query: 165 NCQYSVSYGDG----SFSNGNLATETVTLGSTTGQAVALPGITFGCGTNNGGLFNSKTTG 220
NC Y +YG+ ++ G L TET T G A A PGI FGC + G F + +G
Sbjct: 172 NCSYHYAYGNARDTHHYTEGILMTETFTFGD---DAAAFPGIAFGCTLRSEGGFGTG-SG 227
Query: 221 IVGLGGGDISLISQM 235
+VGLG G +SL++Q+
Sbjct: 228 LVGLGRGKLSLVTQL 242
Score = 47.0 bits (110), Expect = 0.014, Method: Compositional matrix adjust.
Identities = 35/115 (30%), Positives = 57/115 (49%), Gaps = 7/115 (6%)
Query: 252 IVIDSGTTLTFLPQ-GYNSNLLSVMSSMIEAQPVADPTGSLELCYS-FNSLSQVPEVTIH 309
++ DSGTTLT LP Y ++S M +P +C++ +S + P + +H
Sbjct: 326 VIFDSGTTLTMLPDPAYTLVRDELLSQMGFQKPPPAANDDDLICFTGGSSTTTFPSMVLH 385
Query: 310 FR-GADVKLSRSNFFVKV----SEDIVCSVFKGITNSVPIYGNIMQTNFLVGYDI 359
F GAD+ LS N+ ++ E C + ++ I GNIMQ +F V +D+
Sbjct: 386 FDGGADMDLSTENYLPQMQGQNGETARCWSVVKSSQALTIIGNIMQMDFHVVFDL 440
>gi|255685712|gb|ACU28345.1| At2g28040-like protein [Arabidopsis lyrata subsp. petraea]
Length = 91
Score = 92.8 bits (229), Expect = 3e-16, Method: Compositional matrix adjust.
Identities = 48/102 (47%), Positives = 63/102 (61%), Gaps = 13/102 (12%)
Query: 93 IRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCSSSQ 152
+++ IGTPP E AV DTGS+LIWTQC PC CY Q +P+FDP SST+K C++
Sbjct: 1 MKLQIGTPPFEIEAVLDTGSELIWTQCLPC--LHCYDQKAPIFDPSKSSTFKETRCNTPD 58
Query: 153 CASLNQKSCSGVNCQYSVSYGDGSFSNGNLATETVTLGSTTG 194
+C Y + Y D S++ G LATETVT+ ST+G
Sbjct: 59 -----------HSCXYKIVYDDKSYTQGTLATETVTIHSTSG 89
>gi|326503794|dbj|BAK02683.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 456
Score = 92.4 bits (228), Expect = 3e-16, Method: Compositional matrix adjust.
Identities = 73/232 (31%), Positives = 109/232 (46%), Gaps = 27/232 (11%)
Query: 18 VVSPIEAQTGGFSVELIHRDSPKSPFYNSSETP-------YQRLRDA-LTRSLNRLNHFN 69
V S A + G +V L HR P SP S++ P + +LR + R L+ +
Sbjct: 52 VCSVTPASSSGTTVPLNHRYGPCSP-APSAKVPTILELLEHDQLRAKYIQRKLSGTDGLQ 110
Query: 70 Q-NSSISSSKASQADIIPNNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCY 128
+ ++ ++ S D + Y+I + IG+P + + DTGSD+ W +C
Sbjct: 111 PLDLTVPTTLGSALDTM----EYVITVGIGSPAVTQTMMIDTGSDVSWVRCNS------- 159
Query: 129 MQDSPLFDPKMSSTYKSLPCSSSQCASL--NQKSCSGVNCQYSVSYGDGSFSNGNLATET 186
LFDP S+TY CSS+ CA L N CS CQY V YGDGS + G +++T
Sbjct: 160 TDGLTLFDPSKSTTYAPFSCSSAACAQLGNNGDGCSNSGCQYRVQYGDGSNTTGTYSSDT 219
Query: 187 VTLGSTTGQAVALPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTT 238
+ L ++ + FGC + K G++GLGG SL+SQ T
Sbjct: 220 LALSASD----TVTDFHFGCSHHEEDFDGEKIDGLMGLGGDAQSLVSQTAAT 267
Score = 48.1 bits (113), Expect = 0.008, Method: Compositional matrix adjust.
Identities = 42/124 (33%), Positives = 57/124 (45%), Gaps = 11/124 (8%)
Query: 253 VIDSGTTLTFLPQGYNSNLLSVM-SSMIE-AQPVADPTGSLELCYSFNSLSQV--PEVTI 308
V+DSGT +T+LP+ S L S SSM A P G L+ CY F L V P V++
Sbjct: 339 VMDSGTVITWLPRRAYSALSSAFRSSMTRLRHQRAAPLGILDTCYDFTGLVNVSIPAVSL 398
Query: 309 HFR-GADVKLSRSNFFVKVSEDIVCSVFKGITNSVPIYGNIMQTNFLVGYDIEQQTVSFK 367
GA V L + ++ C F T+ I GN+ Q F V +D+ Q F+
Sbjct: 399 VLDGGAVVDLDGNGIMIQ-----DCLAFAA-TSGDSIIGNVQQRTFEVLHDVGQGVFGFR 452
Query: 368 PTDC 371
C
Sbjct: 453 SGAC 456
>gi|255685714|gb|ACU28346.1| At2g28040-like protein [Arabidopsis lyrata subsp. petraea]
Length = 91
Score = 92.4 bits (228), Expect = 3e-16, Method: Compositional matrix adjust.
Identities = 48/102 (47%), Positives = 63/102 (61%), Gaps = 13/102 (12%)
Query: 93 IRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCSSSQ 152
+++ IGTPP E AV DTGS+LIWTQC PC CY Q +P+FDP SST+K C++
Sbjct: 1 MKLQIGTPPFEIEAVLDTGSELIWTQCLPC--LHCYDQKAPIFDPSKSSTFKETRCNTPD 58
Query: 153 CASLNQKSCSGVNCQYSVSYGDGSFSNGNLATETVTLGSTTG 194
+C Y + Y D S++ G LATETVT+ ST+G
Sbjct: 59 -----------HSCSYKIVYDDKSYTQGTLATETVTIHSTSG 89
>gi|297736090|emb|CBI24128.3| unnamed protein product [Vitis vinifera]
Length = 378
Score = 92.4 bits (228), Expect = 3e-16, Method: Compositional matrix adjust.
Identities = 89/369 (24%), Positives = 142/369 (38%), Gaps = 85/369 (23%)
Query: 89 ANYLIRISIGTPPTERLAVADTGSDLIWTQCE-PCPPSQC------YMQDSPLFDPKMSS 141
Y + +GTP + + VADTGSDL W C+ C C ++ +F +SS
Sbjct: 10 GQYSVAFKVGTPSQKFMLVADTGSDLTWMSCKYHCRSRNCSNRKARRIRHKRVFHANLSS 69
Query: 142 TYKSLPCSSSQCA-------SLNQKSCSGVNCQYSVSYGDGSFSNGNLATETVTLGSTTG 194
++K++PC + C SL C Y Y DGS + G A ETVT+ G
Sbjct: 70 SFKTIPCLTDMCKIELMDLFSLTNCPTPLTPCGYDYRYSDGSTALGFFANETVTVELKEG 129
Query: 195 QAVALPGITFGCGTNNGGLFNSKTTGIVGLG--------------GGDIS---------- 230
+ + L + GC + G G++GLG GG S
Sbjct: 130 RKMKLHNVLIGCSESFQGQSFQAADGVMGLGYSKYSFAIKAAEKFGGKFSYCLVDHLSHK 189
Query: 231 -------------------------LISQMRTTIAGNQRLGVS-------TPDIVID--- 255
L+ M + +G+S P V D
Sbjct: 190 NVSNYLTFGSSRSKEALLNNMTYTELVLGMVNSFYAVNMMGISIGGAMLKIPSEVWDVKG 249
Query: 256 -SGTTL------TFLPQ-GYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSLSQ--VPE 305
GT L TFL + Y + ++ S+++ + V G LE C++ + VP
Sbjct: 250 AGGTILDSGSSLTFLTEPAYQPVMAALRVSLLKFRKVEMDIGPLEYCFNSTGFEESLVPR 309
Query: 306 VTIHFR-GADVKLSRSNFFVKVSEDIVCSVFKGITN-SVPIYGNIMQTNFLVGYDIEQQT 363
+ HF GA+ + ++ + ++ + C F + + GNIMQ N L +D+ +
Sbjct: 310 LVFHFADGAEFEPPVKSYVISAADGVRCLGFVSVAWPGTSVVGNIMQQNHLWEFDLGLKK 369
Query: 364 VSFKPTDCT 372
+ F P+ CT
Sbjct: 370 LGFAPSSCT 378
>gi|17979392|gb|AAL49921.1| unknown protein [Arabidopsis thaliana]
Length = 439
Score = 92.4 bits (228), Expect = 3e-16, Method: Compositional matrix adjust.
Identities = 111/451 (24%), Positives = 172/451 (38%), Gaps = 96/451 (21%)
Query: 3 TFLSCVFILFFLCFYVVSPIEAQTGGFSVELIHRDS--PKSPFYNSSETPYQRLRDALTR 60
T LSC+ L + + ++L HRD+ PK P R+ D +
Sbjct: 4 TLLSCLITTLLL---ITVADSMKDTSVRLKLAHRDTLLPK---------PLSRIEDVIGA 51
Query: 61 SLNR--LNHFNQNSSISSSKASQADIIPNNANYLIRISIGTPPTERLAVADTGSDLIWTQ 118
R L +NS++ + I A Y I +GTP + V DTGS+L W
Sbjct: 52 DQKRHSLISRKRNSTVGVKMDLGSGIDYGTAQYFTEIRVGTPAKKFRVVVDTGSELTWVN 111
Query: 119 CEPCPPSQCYMQDSPLFDPKMSSTYKSLPCSSSQCA-------SLNQKSCSGVNCQYSVS 171
C + + +F S ++K++ C + C SL C Y
Sbjct: 112 CRYRARGK---DNRRVFRADESKSFKTVGCLTQTCKVDLMNLFSLTTCPTPSTPCSYDYR 168
Query: 172 YGDGSFSNGNLA--TETVTL---------GSTTGQAVALPGITFGCGTNNGGL------F 214
Y DGS + G A T TV L G G + + G +F GL F
Sbjct: 169 YADGSAAQGVFAKETITVGLTNGRMARLPGHLIGCSSSFTGQSFQGADGVLGLAFSDFSF 228
Query: 215 NSKTTGIVG----------LGGGDIS---LISQMRTTIAGNQR----------------- 244
S T + G L ++S + R+T +R
Sbjct: 229 TSTATSLYGAKFSYCLVDHLSNKNVSNYLIFGSSRSTKTAFRRTTPLDLTRIPPFYAINV 288
Query: 245 ----LGVSTPDI-------------VIDSGTTLTFLPQG-YNSNLLSVMSSMIEAQPVAD 286
LG DI ++DSGT+LT L Y + + ++E + V
Sbjct: 289 IGISLGYDMLDIPSQVWDATSGGGTILDSGTSLTLLADAAYKQVVTGLARYLVELKRVKP 348
Query: 287 PTGSLELCYSFNS---LSQVPEVTIHFRG-ADVKLSRSNFFVKVSEDIVCSVFKGI-TNS 341
+E C+SF S +S++P++T H +G A + R ++ V + + C F T +
Sbjct: 349 EGVPIEYCFSFTSGFNVSKLPQLTFHLKGGARFEPHRKSYLVDAAPGVKCLGFVSAGTPA 408
Query: 342 VPIYGNIMQTNFLVGYDIEQQTVSFKPTDCT 372
+ GNIMQ N+L +D+ T+SF P+ CT
Sbjct: 409 TNVIGNIMQQNYLWEFDLMASTLSFAPSACT 439
>gi|357157325|ref|XP_003577760.1| PREDICTED: aspartic proteinase Asp1-like [Brachypodium distachyon]
Length = 413
Score = 92.4 bits (228), Expect = 3e-16, Method: Compositional matrix adjust.
Identities = 94/368 (25%), Positives = 156/368 (42%), Gaps = 84/368 (22%)
Query: 82 ADIIPNNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSS 141
D+ P +Y + ++IG P DTGSDL W QC+ P C PL+ P +
Sbjct: 44 GDVYPT-GHYYVTMNIGDPAKPYFLDIDTGSDLTWLQCD-APCQSCNKVPHPLYKPTKN- 100
Query: 142 TYKSLPCSSSQCASL------NQKSCSGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQ 195
K +PC++S C +L N+K C Y + Y D + S G L T+ TL
Sbjct: 101 --KLVPCAASICTTLHSAQSPNKKCAVPQQCDYQIKYTDSASSLGVLVTDNFTLPLRNSS 158
Query: 196 AVALPGITFGCGTN----NGGLFNSKTTGIVGLGGGDISLISQMRT-------------- 237
+V P TFGCG + G+ + T G++GLG G +SL+SQ++
Sbjct: 159 SVR-PSFTFGCGYDQQVGKNGVVQATTDGLLGLGKGSVSLVSQLKVLGITKNVLGHCLST 217
Query: 238 -------------------------TIAGN-------------QRLGVSTPDIVIDSGTT 259
+ +GN + LGV ++V DSG+T
Sbjct: 218 NGGGFLFFGDNVVPTSRATWVPMVRSTSGNYYSPGSGTLYFDRRSLGVKPMEVVFDSGST 277
Query: 260 LT-FLPQGYNSNLLSVMSSMIEA-QPVADPTGSLELCYS----FNSLSQVPE----VTIH 309
T F Q Y + + ++ + + ++ Q V+DP SL LC+ F S+S V + +
Sbjct: 278 YTYFAAQPYQATVSALKAGLSKSLQQVSDP--SLPLCWKGQKVFKSVSDVKNDFKSLFLS 335
Query: 310 F-RGADVKLSRSNFFVKVSEDIVC-SVFKGITNSVP--IYGNIMQTNFLVGYDIEQQTVS 365
F + + +++ N+ + C + G + I G+I + L+ YD E+ +
Sbjct: 336 FVKNSVLEIPPENYLIVTKNGNACLGILDGSAAKLTFNIIGDITMQDQLIIYDNERGQLG 395
Query: 366 FKPTDCTK 373
+ C++
Sbjct: 396 WIRGSCSR 403
>gi|125552105|gb|EAY97814.1| hypothetical protein OsI_19735 [Oryza sativa Indica Group]
Length = 424
Score = 92.4 bits (228), Expect = 3e-16, Method: Compositional matrix adjust.
Identities = 102/415 (24%), Positives = 168/415 (40%), Gaps = 86/415 (20%)
Query: 29 FSVELIHRDSPKSPFYNSSETPYQRLRDALTRSLNRLNHFNQNSSISSSKASQADIIPNN 88
+EL H D+ N T +R+R A R+ +R + +++ ++ +
Sbjct: 23 LRLELAHVDA------NEHCTMEERVRRATERTHHR-RLLHASTAAAAGGVAAPLRWSGK 75
Query: 89 ANYLIRISIGTPPTERLAVADTGSDLIWTQCEPC--------PPSQCYMQDSPLFDPKMS 140
Y+ IG PP AV DTGSDL+WTQC C C+ Q+ P ++ +S
Sbjct: 76 TQYIASYGIGDPPQPAEAVVDTGSDLVWTQCSTCRLPAAAAAGGGGCFPQNLPYYNFSLS 135
Query: 141 STYKSLPCS---------SSQCASLNQKSCSGVN-CQYSVSYGDGSFSNGNLATETVTLG 190
T +++PC + + A + SG + C + SYG G + G L T+ T
Sbjct: 136 RTARAVPCDDDDGALCGVAPETAGCARGGGSGDDACVVAASYGAG-VALGVLGTDAFTFP 194
Query: 191 STTGQAVALPGITFGCGTNN----GGLFNSKTTGIVGLGGGDISL--------------- 231
S++ +A FGC + G L + +GI+GLG G +SL
Sbjct: 195 SSSSVTLA-----FGCVSQTRISPGAL--TGASGIIGLGRGALSLNPKDSPFSTFYYLPL 247
Query: 232 --ISQMRTTI---AGNQRLGVSTPDI-----VIDSGTTLTFLPQGYNSNLLSVMSSMIEA 281
++ T+ AG L + P + +IDSG+ T L + L ++ +
Sbjct: 248 VGLAAGNATVALPAGAFDLREAAPKVWAGGALIDSGSPFTRLVDPAHRALTKELARQLRG 307
Query: 282 Q-----PVADPTGSLELCYSFN------SLSQVPEVTIHFR-----GADVKLSRSNFFVK 325
P A G+LELC + + VP + + F G ++ + ++ +
Sbjct: 308 SGSLVPPPAKLGGALELCVEAGDDGDSLAAAAVPSLVLRFDDGVGGGRELVIPAEKYWAR 367
Query: 326 VSEDIVCSVFKGI--------TNSVPIYGNIMQTNFLVGYDIEQQTVSFKPTDCT 372
V C TN I GN MQ + V YD+ +SF+P +C+
Sbjct: 368 VEASTWCMAVVSSASGNATLPTNETTIIGNFMQQDMRVLYDLANGLLSFQPANCS 422
>gi|15230868|ref|NP_189198.1| aspartyl protease family protein [Arabidopsis thaliana]
gi|11994761|dbj|BAB03090.1| chloroplast nucleoid DNA binding protein-like; nucellin-like
protein [Arabidopsis thaliana]
gi|189339286|gb|ACD89063.1| At3g25700 [Arabidopsis thaliana]
gi|332643533|gb|AEE77054.1| aspartyl protease family protein [Arabidopsis thaliana]
Length = 452
Score = 92.4 bits (228), Expect = 3e-16, Method: Compositional matrix adjust.
Identities = 65/210 (30%), Positives = 96/210 (45%), Gaps = 22/210 (10%)
Query: 40 KSPFYNSSETPYQRLRDALTRSLNRLNHFNQNSSISSSKASQADIIPNNANYLIRISIGT 99
KSPF + ++ R SL R S + S AS + Y + + IG
Sbjct: 39 KSPFPSPTQALALDTRRLHFLSLRRKPIPFVKSPVVSGAAS------GSGQYFVDLRIGQ 92
Query: 100 PPTERLAVADTGSDLIWTQCEPCPPSQC-YMQDSPLFDPKMSSTYKSLPCSSSQCASLNQ 158
PP L +ADTGSDL+W +C C C + + +F P+ SST+ C C + +
Sbjct: 93 PPQSLLLIADTGSDLVWVKCSAC--RNCSHHSPATVFFPRHSSTFSPAHCYDPVCRLVPK 150
Query: 159 KSCSGV--------NCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITFGCGTNN 210
+ + C Y Y DGS ++G A ET +L +++G+ L + FGCG
Sbjct: 151 PDRAPICNHTRIHSTCHYEYGYADGSLTSGLFARETTSLKTSSGKEARLKSVAFGCGFRI 210
Query: 211 GGLFNSKTT-----GIVGLGGGDISLISQM 235
G S T+ G++GLG G IS SQ+
Sbjct: 211 SGQSVSGTSFNGANGVMGLGRGPISFASQL 240
Score = 53.5 bits (127), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 34/128 (26%), Positives = 64/128 (50%), Gaps = 9/128 (7%)
Query: 253 VIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADP-TGSLELCYSFNSLSQ----VPEVT 307
V+DSGTTL FL + ++++ + ++ P+AD T +LC + + +++ +P +
Sbjct: 324 VVDSGTTLAFLAEPAYRSVIAAVRRRVKL-PIADALTPGFDLCVNVSGVTKPEKILPRLK 382
Query: 308 IHFRGADVKL-SRSNFFVKVSEDIVCSVFKGITNSV--PIYGNIMQTNFLVGYDIEQQTV 364
F G V + N+F++ E I C + + V + GN+MQ FL +D ++ +
Sbjct: 383 FEFSGGAVFVPPPRNYFIETEEQIQCLAIQSVDPKVGFSVIGNLMQQGFLFEFDRDRSRL 442
Query: 365 SFKPTDCT 372
F C
Sbjct: 443 GFSRRGCA 450
>gi|359485189|ref|XP_002279141.2| PREDICTED: aspartic proteinase nepenthesin-2-like [Vitis vinifera]
Length = 546
Score = 92.0 bits (227), Expect = 4e-16, Method: Compositional matrix adjust.
Identities = 50/137 (36%), Positives = 73/137 (53%), Gaps = 12/137 (8%)
Query: 89 ANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPC 148
Y I + +GTPP + DTGSDL W QC PC +C+ Q+ P +DP SS+Y+++ C
Sbjct: 179 GEYFIDVFVGTPPKHFSLILDTGSDLNWIQCVPC--YECFEQNGPHYDPGQSSSYRNIGC 236
Query: 149 SSSQCASLNQ----KSCSGVN--CQYSVSYGDGSFSNGNLATET----VTLGSTTGQAVA 198
S+C ++ + C N C Y YGD S + G+ A ET +T+ S +
Sbjct: 237 HDSRCHLVSSPDPPQPCKAENQTCPYYYWYGDSSNTTGDFALETFTVNLTMSSGKPELRR 296
Query: 199 LPGITFGCGTNNGGLFN 215
+ + FGCG N GLF+
Sbjct: 297 VENVMFGCGHWNRGLFH 313
Score = 60.1 bits (144), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 42/151 (27%), Positives = 72/151 (47%), Gaps = 10/151 (6%)
Query: 226 GGDISLISQMRTTIAGNQRLGVSTPDIVIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVA 285
GG++ I + + IA + G +IDSGTTL++ + + + ++ PV
Sbjct: 398 GGEVVNIPEEKWQIATDGSGGT-----IIDSGTTLSYFAEPAYQVIKEAFMAKVKGYPVV 452
Query: 286 DPTGSLELCYSFNSLSQ--VPEVTIHFR-GADVKLSRSNFFVKVS-EDIVCSVFKGIT-N 340
LE CY+ + Q +P+ I F GA N+F+++ ++VC G +
Sbjct: 453 KDFPVLEPCYNVTGVEQPDLPDFGIVFSDGAVWNFPVENYFIEIEPREVVCLAILGTPPS 512
Query: 341 SVPIYGNIMQTNFLVGYDIEQQTVSFKPTDC 371
++ I GN Q NF + YD ++ + F PT C
Sbjct: 513 ALSIIGNYQQQNFHILYDTKKSRLGFAPTKC 543
>gi|255685716|gb|ACU28347.1| At2g28040-like protein [Arabidopsis lyrata subsp. petraea]
gi|255685726|gb|ACU28352.1| At2g28040-like protein [Arabidopsis lyrata subsp. petraea]
gi|255685728|gb|ACU28353.1| At2g28040-like protein [Arabidopsis lyrata subsp. petraea]
Length = 91
Score = 92.0 bits (227), Expect = 4e-16, Method: Compositional matrix adjust.
Identities = 48/102 (47%), Positives = 63/102 (61%), Gaps = 13/102 (12%)
Query: 93 IRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCSSSQ 152
+++ IGTPP E AV DTGS+LIWTQC PC CY Q +P+FDP SST+K C++
Sbjct: 1 MKLQIGTPPFEIEAVLDTGSELIWTQCLPC--LHCYDQKAPIFDPSKSSTFKETRCNTPD 58
Query: 153 CASLNQKSCSGVNCQYSVSYGDGSFSNGNLATETVTLGSTTG 194
+C Y + Y D S++ G LATETVT+ ST+G
Sbjct: 59 -----------HSCPYKIVYDDKSYTQGTLATETVTIHSTSG 89
>gi|255685722|gb|ACU28350.1| At2g28040-like protein [Arabidopsis lyrata subsp. petraea]
Length = 91
Score = 92.0 bits (227), Expect = 4e-16, Method: Compositional matrix adjust.
Identities = 48/102 (47%), Positives = 63/102 (61%), Gaps = 13/102 (12%)
Query: 93 IRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCSSSQ 152
+++ IGTPP E AV DTGS+LIWTQC PC CY Q +P+FDP SST+K C++
Sbjct: 1 MKLQIGTPPFEXEAVLDTGSELIWTQCLPC--LHCYDQKAPIFDPSKSSTFKETRCNTPB 58
Query: 153 CASLNQKSCSGVNCQYSVSYGDGSFSNGNLATETVTLGSTTG 194
+C Y + Y D S++ G LATETVT+ ST+G
Sbjct: 59 -----------HSCPYKJVYDDKSYTXGTLATETVTIHSTSG 89
>gi|255685718|gb|ACU28348.1| At2g28040-like protein [Arabidopsis lyrata subsp. petraea]
gi|255685720|gb|ACU28349.1| At2g28040-like protein [Arabidopsis lyrata subsp. petraea]
gi|255685724|gb|ACU28351.1| At2g28040-like protein [Arabidopsis lyrata subsp. petraea]
Length = 91
Score = 92.0 bits (227), Expect = 5e-16, Method: Compositional matrix adjust.
Identities = 49/102 (48%), Positives = 65/102 (63%), Gaps = 13/102 (12%)
Query: 93 IRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCSSSQ 152
+++ IGTPP E AV DTGS+LIWTQC PC CY Q +P+FDP SST+K ++
Sbjct: 1 MKLQIGTPPFEIEAVLDTGSELIWTQCLPC--LHCYDQKAPIFDPSKSSTFK-----ETR 53
Query: 153 CASLNQKSCSGVNCQYSVSYGDGSFSNGNLATETVTLGSTTG 194
C + N +C Y + Y D S++ G LATETVT+ ST+G
Sbjct: 54 CNTPNH------SCPYKIVYDDKSYTLGTLATETVTIHSTSG 89
>gi|168065778|ref|XP_001784824.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162663621|gb|EDQ50376.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 392
Score = 91.7 bits (226), Expect = 5e-16, Method: Compositional matrix adjust.
Identities = 102/391 (26%), Positives = 151/391 (38%), Gaps = 77/391 (19%)
Query: 21 PIEAQTGGFSVELIHRDSPKSPF-----------YNSSETPYQRLRDALTRSLNRLNHFN 69
P E + V+LIHRD P SP ++ YQ+ R LT RL F
Sbjct: 39 PSENGSQSIRVDLIHRDHPLSPIAPAEGTTLSERLQAAAKRYQKRRTLLTSRGRRLGAFT 98
Query: 70 QNSSISSSKASQADIIPNNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYM 129
I + Y+I + GTP + DTGSDL+W QC PC CY
Sbjct: 99 ------------TPITAESFEYVIPLFFGTPLQPFTGMVDTGSDLVWIQCLPC--INCYT 144
Query: 130 -QDSPLFDPKMSSTYKSLPCSSSQCASLNQKSCSGVNCQYSVSYGDGSFSNGNLATETVT 188
P FDP SS+ +PC+ + C+ N + G+L
Sbjct: 145 THPHPEFDPTTSSSEAYVPCT-------DPALCTASNFS----------ARGDLDVFAFC 187
Query: 189 LGSTTGQAVALPGITFGC--GTNNGGLF-------NSKTTGIVGLGGGDISLISQMRTTI 239
L T + FG TN GL S + VG+ G ++ + T
Sbjct: 188 LVPYTAATTLTSALVFGSRDATNALGLVYTPLLQGTSPSFYWVGMVGVSVAGVDAGIPT- 246
Query: 240 AGNQRLGVSTPDIVIDSGTTLT-FLPQGYNSNLLSVMSSMIEAQPVA-DPTGSL------ 291
L ST ++ DSGT LT F P+ Y+ S+ ++ PVA DP ++
Sbjct: 247 ----ALFASTDGVLFDSGTPLTYFAPEIYDPLHQSIAGAI--PYPVAPDPVDAVVAKPLN 300
Query: 292 ELCYSFNSLSQ--VPEVTIHFRGADV-------KLSRSNFFVKVSEDIVC-SVFKGITNS 341
LC+ + +P + HF AD L N ++ + C ++ +G + +
Sbjct: 301 RLCFDLAGVQSPVLPTMAYHFTDADAAGATVDFDLGLENIYMNDMNTVWCLAIVRGESGN 360
Query: 342 VPIYGNIMQTNFLVGYDIEQQTVSFKPTDCT 372
I GNI Q N + +D+ + + DCT
Sbjct: 361 PSIVGNIQQANHYIEHDVALNRIGWTSKDCT 391
>gi|195645150|gb|ACG42043.1| aspartic proteinase Asp1 precursor [Zea mays]
Length = 415
Score = 91.7 bits (226), Expect = 5e-16, Method: Compositional matrix adjust.
Identities = 91/371 (24%), Positives = 154/371 (41%), Gaps = 87/371 (23%)
Query: 81 QADIIPNNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMS 140
Q D+ P +Y + ++IG P DTGSDL W QC+ P C PL+ P
Sbjct: 44 QGDVYPT-GHYYVTMNIGNPAKPYFLDVDTGSDLTWLQCD-APCRSCNKVPHPLYRP--- 98
Query: 141 STYKSLPCSSSQCASL------NQKSCSGVNCQYSVSYGDGSFSNGNLATETVTLGSTTG 194
+ + +PC+++ C +L N K S C Y + Y D + S G L ++ +L +
Sbjct: 99 TANRLVPCANALCTALHSGQGSNNKCPSPKQCDYQIKYTDSASSQGVLINDSFSLPMRSS 158
Query: 195 QAVALPGITFGCGTN----NGGLFNSKTTGIVGLGGGDISLISQMR-------------- 236
PG+TFGCG + G + G++GLG G +SL+SQ++
Sbjct: 159 N--IRPGLTFGCGYDQQVGKNGAVQAAIDGMLGLGRGSVSLVSQLKQQGITKNVVGHCLS 216
Query: 237 -------------------TTIAGNQR-------------------LGVSTPDIVIDSGT 258
T + QR LGV ++V DSG+
Sbjct: 217 TNGGGFLFFGDDVVPSSRVTWVPMAQRTSGNYYSPGSGTLYFDRRSLGVKPMEVVFDSGS 276
Query: 259 TLT-FLPQGYNSNLLSVMSSMIEA-QPVADPTGSLELCYS--------FNSLSQVPEVTI 308
T T F Q Y + + ++ + ++ + V+DPT L LC+ F+ ++ + +
Sbjct: 277 TYTYFTAQPYQAVVSALKGGLSKSLKQVSDPT--LPLCWKGQKAFKSVFDVKNEFKSMFL 334
Query: 309 HF---RGADVKLSRSNFFVKVSEDIVC-SVFKGITN--SVPIYGNIMQTNFLVGYDIEQQ 362
F + A +++ N+ + VC + G S + G+I + +V YD E+
Sbjct: 335 SFSSAKNAAMEIPPENYLIVTKNGNVCLGILDGTAAKLSFNVIGDITMQDQMVIYDNEKS 394
Query: 363 TVSFKPTDCTK 373
+ + CT+
Sbjct: 395 QLGWARGACTR 405
>gi|356569916|ref|XP_003553140.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Glycine max]
Length = 560
Score = 91.7 bits (226), Expect = 5e-16, Method: Compositional matrix adjust.
Identities = 50/137 (36%), Positives = 73/137 (53%), Gaps = 12/137 (8%)
Query: 89 ANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPC 148
Y + + +GTPP + DTGSDL W QC PC C+ Q+ P +DPK SS++K++ C
Sbjct: 193 GEYFMDVFVGTPPKHFSLILDTGSDLNWIQCVPC--YACFEQNGPYYDPKDSSSFKNITC 250
Query: 149 SSSQCASLNQ----KSCSGV--NCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVA---- 198
+C ++ + C G +C Y YGD S + G+ A ET T+ TT +
Sbjct: 251 HDPRCQLVSSPDPPQPCKGETQSCPYFYWYGDSSNTTGDFALETFTVNLTTPEGKPELKI 310
Query: 199 LPGITFGCGTNNGGLFN 215
+ + FGCG N GLF+
Sbjct: 311 VENVMFGCGHWNRGLFH 327
Score = 59.7 bits (143), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 37/124 (29%), Positives = 62/124 (50%), Gaps = 5/124 (4%)
Query: 253 VIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSLS--QVPEVTIHF 310
+IDSGTTLT+ + + I+ P+ + L+ CY+ + + ++PE I F
Sbjct: 434 IIDSGTTLTYFAEPAYEIIKEAFMRKIKGFPLVETFPPLKPCYNVSGVEKMELPEFAILF 493
Query: 311 R-GADVKLSRSNFFVKVS-EDIVCSVFKGITNS-VPIYGNIMQTNFLVGYDIEQQTVSFK 367
GA N+F+++ ED+VC G S + I GN Q NF + YD+++ + +
Sbjct: 494 ADGAMWDFPVENYFIQIEPEDVVCLAILGTPRSALSIIGNYQQQNFHILYDLKKSRLGYA 553
Query: 368 PTDC 371
P C
Sbjct: 554 PMKC 557
>gi|219886219|gb|ACL53484.1| unknown [Zea mays]
gi|219888509|gb|ACL54629.1| unknown [Zea mays]
gi|414588374|tpg|DAA38945.1| TPA: nucellin-like aspartic protease [Zea mays]
Length = 415
Score = 91.7 bits (226), Expect = 5e-16, Method: Compositional matrix adjust.
Identities = 91/371 (24%), Positives = 154/371 (41%), Gaps = 87/371 (23%)
Query: 81 QADIIPNNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMS 140
Q D+ P +Y + ++IG P DTGSDL W QC+ P C PL+ P +
Sbjct: 44 QGDVYPT-GHYYVTMNIGNPAKPYFLDVDTGSDLTWLQCD-APCRSCNKVPHPLYRPTAN 101
Query: 141 STYKSLPCSSSQCASL------NQKSCSGVNCQYSVSYGDGSFSNGNLATETVTLGSTTG 194
+ +PC+++ C +L N K S C Y + Y D + S G L ++ +L +
Sbjct: 102 ---RLVPCANALCTALHSGQGSNNKCPSPKQCDYQIKYTDSASSQGVLINDSFSLPMRSS 158
Query: 195 QAVALPGITFGCGTN----NGGLFNSKTTGIVGLGGGDISLISQMR-------------- 236
PG+TFGCG + G + G++GLG G +SL+SQ++
Sbjct: 159 N--IRPGLTFGCGYDQQVGKNGAVQAAIDGMLGLGRGSVSLVSQLKQQGITKNVVGHCLS 216
Query: 237 -------------------TTIAGNQR-------------------LGVSTPDIVIDSGT 258
T + QR LGV ++V DSG+
Sbjct: 217 TNGGGFLFFGDDVVPSSRVTWVPMAQRTSGNYYSPGSGTLYFDRRSLGVKPMEVVFDSGS 276
Query: 259 TLT-FLPQGYNSNLLSVMSSMIEA-QPVADPTGSLELCYS--------FNSLSQVPEVTI 308
T T F Q Y + + ++ + ++ + V+DPT L LC+ F+ ++ + +
Sbjct: 277 TYTYFTAQPYQAVVSALKGGLSKSLKQVSDPT--LPLCWKGQKAFKSVFDVKNEFKSMFL 334
Query: 309 HF---RGADVKLSRSNFFVKVSEDIVC-SVFKGITN--SVPIYGNIMQTNFLVGYDIEQQ 362
F + A +++ N+ + VC + G S + G+I + +V YD E+
Sbjct: 335 SFASAKNAAMEIPPENYLIVTKNGNVCLGILDGTAAKLSFNVIGDITMQDQMVIYDNEKS 394
Query: 363 TVSFKPTDCTK 373
+ + CT+
Sbjct: 395 QLGWARGACTR 405
>gi|356553775|ref|XP_003545228.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
Length = 559
Score = 91.3 bits (225), Expect = 6e-16, Method: Compositional matrix adjust.
Identities = 50/137 (36%), Positives = 71/137 (51%), Gaps = 12/137 (8%)
Query: 89 ANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPC 148
Y + + +GTPP + DTGSDL W QC PC C+ Q P +DPK SS+++++ C
Sbjct: 193 GEYFMDVFVGTPPKHFSLILDTGSDLNWIQCVPC--IACFEQSGPYYDPKDSSSFRNISC 250
Query: 149 SSSQCASLNQKS----CSGVN--CQYSVSYGDGSFSNGNLATETVTLGSTT----GQAVA 198
+C ++ C N C Y YGDGS + G+ A ET T+ TT +
Sbjct: 251 HDPRCQLVSSPDPPNPCKAENQSCPYFYWYGDGSNTTGDFALETFTVNLTTPNGKSELKH 310
Query: 199 LPGITFGCGTNNGGLFN 215
+ + FGCG N GLF+
Sbjct: 311 VENVMFGCGHWNRGLFH 327
Score = 56.2 bits (134), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 32/124 (25%), Positives = 62/124 (50%), Gaps = 4/124 (3%)
Query: 253 VIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSLS--QVPEVTIHF 310
+IDSGTTLT+ + + I+ + + L+ CY+ + + ++P+ I F
Sbjct: 434 IIDSGTTLTYFAEPAYEIIKEAFVRKIKGYELVEGLPPLKPCYNVSGIEKMELPDFGILF 493
Query: 311 R-GADVKLSRSNFFVKVSEDIVC-SVFKGITNSVPIYGNIMQTNFLVGYDIEQQTVSFKP 368
GA N+F+++ D+VC ++ +++ I GN Q NF + YD+++ + + P
Sbjct: 494 ADGAVWNFPVENYFIQIDPDVVCLAILGNPRSALSIIGNYQQQNFHILYDMKKSRLGYAP 553
Query: 369 TDCT 372
C
Sbjct: 554 MKCA 557
>gi|357155293|ref|XP_003577072.1| PREDICTED: LOW QUALITY PROTEIN: probable aspartic protease
At2g35615-like [Brachypodium distachyon]
Length = 429
Score = 91.3 bits (225), Expect = 6e-16, Method: Compositional matrix adjust.
Identities = 95/367 (25%), Positives = 143/367 (38%), Gaps = 90/367 (24%)
Query: 87 NNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYM---QDSPLFDPKMSSTY 143
+ + + IS+GTPP L DTGS L W C+ C S C+ + +FDP S+TY
Sbjct: 71 HEGKFFMDISLGTPPVANLVTVDTGSTLSWVVCQRCQIS-CHTTAPEAGSVFDPDKSTTY 129
Query: 144 KSLPCSSSQCASLNQKSCSGVN-------CQYSVSYG---DGSFSNGNLATETVTLGSTT 193
+ + CSS CA + + + C YS+ YG G +S G L T+ +TL S++
Sbjct: 130 ELVGCSSRDCADVQRSLVAPFGCIEETDTCLYSLRYGSGPSGQYSAGRLGTDKLTLASSS 189
Query: 194 GQAVALPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRT---------------T 238
+ G FGC ++ F +G++G GG + S +Q+ T
Sbjct: 190 S---IIDGFIFGCSGDDS--FKGYESGVIGFGGANFSFFNQVARQTNYRAFSYCFPGDHT 244
Query: 239 IAGNQRLG-----------------------VSTPDIVIDSGTTLTFLPQGYNSNLLSVM 275
G +G + D+++D G L Y ++ V
Sbjct: 245 AEGFLSIGAYPKDELVYTNLIPHFGDRSVYSLQQIDMMVD-GNRLQVDQSEYTKRMMVVD 303
Query: 276 SSMIEA---QPVADP-----------------TGSLELCYSFNSLSQV-----PEVTIHF 310
S ++ PV D T E C+ N V P V + F
Sbjct: 304 SGTVDTFLLGPVFDAFSKAMASAMQAKGFLSDTVGTETCFRPNGGDSVDSGDLPTVEMRF 363
Query: 311 RGADVKLSRSNFFVKV--SEDIVCSVFK----GITNSVPIYGNIMQTNFLVGYDIEQQTV 364
G +KL N F + S D +C FK G+ N V I GN +F V YD++
Sbjct: 364 IGTTLKLPPENVFHDLLPSHDKICLAFKPDVAGVRN-VQILGNKATXSFRVVYDLQAMYF 422
Query: 365 SFKPTDC 371
F+ C
Sbjct: 423 GFQAGAC 429
>gi|125553822|gb|EAY99427.1| hypothetical protein OsI_21398 [Oryza sativa Indica Group]
Length = 469
Score = 91.3 bits (225), Expect = 6e-16, Method: Compositional matrix adjust.
Identities = 92/333 (27%), Positives = 124/333 (37%), Gaps = 78/333 (23%)
Query: 107 VADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCSSSQCASLNQKS--CSGV 164
V DT SD+ W QC PCP CY Q L+DP SS+ C+S C L + C+
Sbjct: 147 VLDTASDVTWVQCSPCPTPPCYPQKDVLYDPTKSSSSGVFSCNSPTCTQLGPYANGCTNN 206
Query: 165 N-CQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITFGC--GTNNGGLFNSKTTGI 221
N CQY V Y DG+ + G ++ +T+ T A+ FGC G F S GI
Sbjct: 207 NQCQYRVRYPDGTSTAGTYISDLLTITPAT----AVRSFQFGCSHGVQGSFSFGSSAAGI 262
Query: 222 VGLGGGDISLISQMRTTIA------------------GNQRLG----------------- 246
+ LGGG SL+SQ T G R+
Sbjct: 263 MALGGGPESLVSQTAATYGRVFSHCFPPPTRRGFFTLGVPRVAAWRYVLTPMLKNPAIPP 322
Query: 247 ----------------VSTPDIVIDSGTTL-------TFLPQGYNSNLLSVMSSMIEAQP 283
++ P V +G L P Y + + M QP
Sbjct: 323 TFYMVRLEAIAVAGQRIAVPPTVFAAGAALDSRTAITRLPPTAYQALRQAFRDRMAMYQP 382
Query: 284 VADPTGSLELCYSFNSLSQ--VPEVTIHF-RGADVKLSRSNFFVKVSEDIVCSVF-KGIT 339
A P G L+ CY + +P +T+ F + A V+L S + C F G
Sbjct: 383 -APPKGPLDTCYDMAGVRSFALPRITLVFDKNAAVELDPSGVLFQ-----GCLAFTAGPN 436
Query: 340 NSVP-IYGNIMQTNFLVGYDIEQQTVSFKPTDC 371
+ VP I GNI V Y+I V F+ C
Sbjct: 437 DQVPGIIGNIQLQTLEVLYNIPAALVGFRHAAC 469
>gi|413916291|gb|AFW56223.1| hypothetical protein ZEAMMB73_420944 [Zea mays]
Length = 383
Score = 91.3 bits (225), Expect = 7e-16, Method: Compositional matrix adjust.
Identities = 76/277 (27%), Positives = 114/277 (41%), Gaps = 70/277 (25%)
Query: 82 ADIIPNNANYLIRISIGTPPTERLAVADTGSDLIWTQCE-PCPPSQCYMQDSPLFDPKMS 140
D+ P+ Y + ++IG PP D+GSDL W QC+ PC C PL+ P S
Sbjct: 58 GDVYPHGL-YYVAMNIGNPPKPYFLDVDSGSDLTWLQCDAPC--RSCNEVPHPLYRPTKS 114
Query: 141 STYKSLPCSSSQCASLN-----QKSCSGVN--CQYSVSYGDGSFSNGNLATETVTLGSTT 193
K +PC CASL+ + C + C Y + Y D S G L ++ L T
Sbjct: 115 ---KLVPCVHRLCASLHNGLTGKHRCDSPHEQCDYVIKYADQGSSTGVLINDSFALRLTN 171
Query: 194 GQAVALPGITFGCGTN---NGGLFNSKTTGIVGLGGGDISLISQMR-------------- 236
G +VA P + FGCG + G +S T G++GLG G +SL+SQ++
Sbjct: 172 G-SVARPSVAFGCGYDQQVRSGDLSSPTDGVLGLGTGSVSLLSQLKQRGVTKNVVGHCLS 230
Query: 237 ---------------------TTIA-----------------GNQRLGVSTPDIVIDSGT 258
T +A G++ LGV +V DSG+
Sbjct: 231 LRGGGFLFFGDDLVPYQRATWTPMARSAFRNYYSPGSASLYFGDRSLGVRLAKVVFDSGS 290
Query: 259 TLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCY 295
+ T+ L++ + + +P SL LC+
Sbjct: 291 SFTYFAAKPYQALVTALKDGLSRTLEEEPDTSLPLCW 327
>gi|115466060|ref|NP_001056629.1| Os06g0118700 [Oryza sativa Japonica Group]
gi|55296436|dbj|BAD68559.1| putative nucleoid DNA-binding protein cnd41 [Oryza sativa Japonica
Group]
gi|113594669|dbj|BAF18543.1| Os06g0118700 [Oryza sativa Japonica Group]
gi|215767921|dbj|BAH00150.1| unnamed protein product [Oryza sativa Japonica Group]
Length = 494
Score = 91.3 bits (225), Expect = 8e-16, Method: Compositional matrix adjust.
Identities = 92/333 (27%), Positives = 124/333 (37%), Gaps = 78/333 (23%)
Query: 107 VADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCSSSQCASLNQKS--CSGV 164
V DT SD+ W QC PCP CY Q L+DP SS+ C+S C L + C+
Sbjct: 172 VLDTASDVTWVQCSPCPTPPCYPQKDVLYDPTKSSSSGVFSCNSPTCTQLGPYANGCTNN 231
Query: 165 N-CQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITFGC--GTNNGGLFNSKTTGI 221
N CQY V Y DG+ + G ++ +T+ T A+ FGC G F S GI
Sbjct: 232 NQCQYRVRYPDGTSTAGTYISDLLTITPAT----AVRSFQFGCSHGVQGSFSFGSSAAGI 287
Query: 222 VGLGGGDISLISQMRTTIA------------------GNQRLG----------------- 246
+ LGGG SL+SQ T G R+
Sbjct: 288 MALGGGPESLVSQTAATYGRVFSHCFPPPTRRGFFTLGVPRVAAWRYVLTPMLKNPAIPP 347
Query: 247 ----------------VSTPDIVIDSGTTL-------TFLPQGYNSNLLSVMSSMIEAQP 283
++ P V +G L P Y + + M QP
Sbjct: 348 TFYMVRLEAIAVAGQRIAVPPTVFAAGAALDSRTAITRLPPTAYQALRQAFRDRMAMYQP 407
Query: 284 VADPTGSLELCYSFNSLS--QVPEVTIHF-RGADVKLSRSNFFVKVSEDIVCSVF-KGIT 339
A P G L+ CY + +P +T+ F + A V+L S + C F G
Sbjct: 408 -APPKGPLDTCYDMAGVRSFALPRITLVFDKNAAVELDPSGVLFQ-----GCLAFTAGPN 461
Query: 340 NSVP-IYGNIMQTNFLVGYDIEQQTVSFKPTDC 371
+ VP I GNI V Y+I V F+ C
Sbjct: 462 DQVPGIIGNIQLQTLEVLYNIPAALVGFRHAAC 494
>gi|72384474|gb|AAZ67590.1| 80A08_5 [Brassica rapa subsp. pekinensis]
Length = 632
Score = 91.3 bits (225), Expect = 8e-16, Method: Compositional matrix adjust.
Identities = 111/444 (25%), Positives = 166/444 (37%), Gaps = 81/444 (18%)
Query: 8 VFILFFLCFYVVSPIEAQTGGFSVELIHR------DSPKSP--FYNSSETPYQRLRDALT 59
FIL F+ V A FS LIHR S KSP F Y RL ++
Sbjct: 6 AFILLFILSLVSEKSLASL--FSSRLIHRFSDEGRASIKSPGSFPEKRSFEYYRLLTSID 63
Query: 60 RSLNRLNHFNQNSSISSSKASQADIIPNN---ANYLIRISIGTPPTERLAVADTGSDLIW 116
++N + S+ S+ S+ I P N + I IGTP L D+GSDL+W
Sbjct: 64 SRRQKMNLGAKFQSLVPSEGSKT-ISPGNYFGWLHYTWIDIGTPSVSFLVALDSGSDLLW 122
Query: 117 TQCE--PCPP------SQCYMQDSPLFDPKMSSTYKSLPCSSSQCASLNQKSCSGVNCQY 168
C C P S +D FDP S+T K PCS C S C Y
Sbjct: 123 IPCNCVQCAPLSSAYYSSLATKDLNEFDPSASTTSKVFPCSHKLCESAPACESPKEQCPY 182
Query: 169 SVSYG-DGSFSNGNLATETVTLGSTTGQAVALPG-ITFGCGTNNGGLFNSKTT--GIVGL 224
+V+Y + + S+G L + + L + + ++ + GCG G F G++GL
Sbjct: 183 TVTYASENTSSSGLLVEDVLHLAYSANASSSVKARVVVGCGEKQSGEFLKGIAPDGVMGL 242
Query: 225 GGGDISLIS------QMRTTIA-------------------------------------- 240
G G+IS+ S MR + +
Sbjct: 243 GPGEISVPSFLAKAGLMRNSFSMCFDEEDSGRIYFGDVGPSTQQSTRFLPYKNEFVAYFV 302
Query: 241 -------GNQRLGVSTPDIVIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLEL 293
GN L S+ +IDSG + TFLP+ + + S I A G E
Sbjct: 303 GVEVCCVGNSCLKQSSFTTLIDSGQSFTFLPEEIYREVALEIDSHINATVKKIEGGPWEY 362
Query: 294 CYSFNSLSQVPEVTIHFRGADVKLSRSNFFVKVSEDIVCSVFKGITNSVPIYGNIMQTNF 353
CY + +VP + + F + + FV + + I+ S G ++ N+
Sbjct: 363 CYETSFEPKVPAIKLKFSSNNTFVIHKPLFVLQRSEGLVQFCLPISASEEGTGGVIGQNY 422
Query: 354 LVGY----DIEQQTVSFKPTDCTK 373
+ GY D E + + + C +
Sbjct: 423 MAGYRIVFDRENMKLGWSASKCQE 446
>gi|224144963|ref|XP_002325476.1| predicted protein [Populus trichocarpa]
gi|222862351|gb|EEE99857.1| predicted protein [Populus trichocarpa]
Length = 372
Score = 90.9 bits (224), Expect = 8e-16, Method: Compositional matrix adjust.
Identities = 92/346 (26%), Positives = 143/346 (41%), Gaps = 72/346 (20%)
Query: 91 YLIRISIGTPPTERLAVADTGSDLIWTQ---CEPCPPSQCYMQDSPLFDPKMSSTYKSLP 147
Y +I +G P + DTGSD++W C+ CP L+DP S + +
Sbjct: 27 YFAKIGLGNPSKDYYVQVDTGSDILWVNCIGCDKCPTKSDLGIKLTLYDPASSVSATRVS 86
Query: 148 CSSSQCAS----LNQKSCSGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQ---AVALP 200
C C S L + CQY+V YGDGS + G ++ V TG ++
Sbjct: 87 CDDDFCTSTYNGLLPDCKKELPCQYNVVYGDGSSTAGYFVSDAVQFERVTGNLQTGLSNG 146
Query: 201 GITFGCGT-NNGGLFNSKTT--GIVG--------LGGGDISLISQ--------------- 234
+TFGCG +GGL S GI+G + GG I I +
Sbjct: 147 TVTFGCGAQQSGGLGTSGEALDGILGAFAHCLDNVNGGGIFAIGELVSPKVNTTPMVPNQ 206
Query: 235 ------MRTTIAGNQRLGVSTP--------DIVIDSGTTLTFLPQGYNSNLLSVMSSMIE 280
M+ G L + T +IDSGTTL +LP+ V SM+
Sbjct: 207 AHYNVYMKEIEVGGTVLELPTDVFDSGDRRGTIIDSGTTLAYLPE-------VVYDSMMN 259
Query: 281 AQPVADPTGSLE------LC--YSFNSLSQVPEVTIHFRGA-DVKLSRSNFFVKVSEDIV 331
P SL +C YS N P++ HF+ + + + ++ ++SEDI
Sbjct: 260 EIRSQQPGLSLHTVEEQFICFKYSGNVDDGFPDIKFHFKDSLTLTVYPHDYLFQISEDIW 319
Query: 332 CSVFK--GITNS----VPIYGNIMQTNFLVGYDIEQQTVSFKPTDC 371
C ++ G+ + + + G+++ +N LV YDIE Q + + +C
Sbjct: 320 CFGWQNGGMQSKDGRDMTLLGDLVLSNKLVLYDIENQAIGWTEYNC 365
>gi|293329689|dbj|BAJ04354.1| pollen allergen CPA63 [Cryptomeria japonica]
Length = 472
Score = 90.9 bits (224), Expect = 9e-16, Method: Compositional matrix adjust.
Identities = 58/153 (37%), Positives = 82/153 (53%), Gaps = 11/153 (7%)
Query: 87 NNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSL 146
+++NY+I++ GTPP V DTGS++ W C PC S C + P F+P SSTY L
Sbjct: 120 SSSNYIIKLGFGTPPQSFYTVLDTGSNIAWIPCNPC--SGCSSKQQP-FEPSKSSTYNYL 176
Query: 147 PCSSSQCASLN--QKSCSGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITF 204
C+S QC L KS + VNC + YGD S + L++ET+++GS + F
Sbjct: 177 TCASQQCQLLRVCTKSDNSVNCSLTQRYGDQSEVDEILSSETLSVGSQ-----QVENFVF 231
Query: 205 GCGTNNGGLFNSKTTGIVGLGGGDISLISQMRT 237
GC GL +T +VG G +S +SQ T
Sbjct: 232 GCSNAARGLIQ-RTPSLVGFGRNPLSFVSQTAT 263
Score = 42.7 bits (99), Expect = 0.32, Method: Compositional matrix adjust.
Identities = 41/159 (25%), Positives = 71/159 (44%), Gaps = 11/159 (6%)
Query: 222 VGLGGGDISLISQMRTTIAGNQRLGVST-PDIVIDSGTTLTFLPQGYNSNLLSVMSSMIE 280
VGL G IS+ ++ + AG L ST +IDSGT +T L + + + S +
Sbjct: 314 VGLNG--ISVGEELVSIPAGTLSLDESTGRGTIIDSGTVITRLVEPAYNAMRDSFRSQLS 371
Query: 281 AQPVADPTGSLELCYSFNSLS-QVPEVTIHF-RGADVKLSRSNFFVKVSED--IVCSVF- 335
+A PT + CY+ S + P +T+HF D+ L N ++D ++C F
Sbjct: 372 NLTMASPTDLFDTCYNRPSGDVEFPLITLHFDDNLDLTLPLDNILYPGNDDGSVLCLAFG 431
Query: 336 ---KGITNSVPIYGNIMQTNFLVGYDIEQQTVSFKPTDC 371
G + + +GN Q + +D+ + + +C
Sbjct: 432 LPPGGGDDVLSTFGNYQQQKLRIVHDVAESRLGIASENC 470
>gi|21717173|gb|AAM76366.1|AC074196_24 putative nucleoid DNA binding protein [Oryza sativa Japonica Group]
gi|31433291|gb|AAP54829.1| Eukaryotic aspartyl protease family protein [Oryza sativa Japonica
Group]
Length = 413
Score = 90.9 bits (224), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 57/170 (33%), Positives = 87/170 (51%), Gaps = 18/170 (10%)
Query: 76 SSKASQADIIPNNAN---YLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDS 132
++ A A ++P + Y+ +IGTPP A+ D +L+WTQC C +C+ QD
Sbjct: 44 ATPAGGAAVVPIRWSPPYYVANFTIGTPPQPASAIVDVAGELVWTQCSAC--RRCFKQDL 101
Query: 133 PLFDPKMSSTYKSLPCSSSQCASLNQKSCSGVNCQYSVSYGDGSFSNGN----LATETVT 188
P+F P SST+K PC ++ C S+ +SCSG C Y G + GN AT+T
Sbjct: 102 PVFVPNASSTFKPEPCGTAVCESIPTRSCSGDVCSYK---GPPTQLRGNTSGFAATDTFA 158
Query: 189 LGSTTGQAVALPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTT 238
+G+ T + + FGC + +G +GLG SL++QM+ T
Sbjct: 159 IGTATVR------LAFGCVVASDIDTMDGPSGFIGLGRTPWSLVAQMKLT 202
Score = 37.7 bits (86), Expect = 9.0, Method: Compositional matrix adjust.
Identities = 22/95 (23%), Positives = 46/95 (48%), Gaps = 13/95 (13%)
Query: 291 LELCYSFN---SLSQVPEVTIHFRGAD-VKLSRSNFFVKVSE--DIVCSVFKGIT----- 339
+LC+ S + P++ F+GA + + + + + V E D C+ +
Sbjct: 316 FDLCFKKAAGFSRATAPDLVFTFQGAAALTVPPAKYLIDVGEEKDTACAAILSMAWLNRT 375
Query: 340 --NSVPIYGNIMQTNFLVGYDIEQQTVSFKPTDCT 372
V + G++ Q + YD++++T+SF+P DC+
Sbjct: 376 GLEGVSVLGSLQQEDVHFLYDLKKETLSFEPADCS 410
>gi|302794412|ref|XP_002978970.1| hypothetical protein SELMODRAFT_418789 [Selaginella moellendorffii]
gi|300153288|gb|EFJ19927.1| hypothetical protein SELMODRAFT_418789 [Selaginella moellendorffii]
Length = 462
Score = 90.9 bits (224), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 63/229 (27%), Positives = 109/229 (47%), Gaps = 16/229 (6%)
Query: 24 AQTGGFSVELIHRDSPKSPFYNSSETPYQRLRDALTRSLNRLNHFNQNSSISSSKASQAD 83
A+ GG + IH +P+ + + S + +N + + ++S
Sbjct: 35 ARGGGIGFKAIHVAAPQFRVKANPSPSSAAQKSLFPYSAHIFQQHTKNPA--ALRSSTTT 92
Query: 84 IIPNNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTY 143
+ Y I +G+P E + + DTGS+L W +C PC C ++D S +Y
Sbjct: 93 LGRKFGEYYTSIKLGSPGQEAILIVDTGSELTWLKCLPC--KVCAPSVDTIYDAARSVSY 150
Query: 144 KSLPCSSSQ-CASLNQKS----CSGVNCQYSVSYGDGSFSNGNLATETVTLGSTT-GQAV 197
K + C++SQ C++ +Q + G CQ++ YGDGSFS G+L+T+T+ + + G+ V
Sbjct: 151 KPVTCNNSQLCSNSSQGTYAYCARGSQCQFAAFYGDGSFSYGSLSTDTLIMETVVGGKPV 210
Query: 198 ALPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGNQRLG 246
+ FGC + L + +GI+GL G ++L Q+ QR G
Sbjct: 211 TVQDFAFGCAQGDLELVPTGASGILGLNAGKMALPMQL------GQRFG 253
>gi|302809015|ref|XP_002986201.1| hypothetical protein SELMODRAFT_234982 [Selaginella moellendorffii]
gi|300146060|gb|EFJ12732.1| hypothetical protein SELMODRAFT_234982 [Selaginella moellendorffii]
Length = 449
Score = 90.5 bits (223), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 44/129 (34%), Positives = 72/129 (55%), Gaps = 10/129 (7%)
Query: 90 NYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCS 149
Y + + +G PP L + DTGSDL W QC+PC C+ Q P+FDP S+++K +PC+
Sbjct: 86 EYFMDVFVGNPPRHFLLIIDTGSDLTWLQCKPC--KACFDQSGPVFDPSQSTSFKIIPCN 143
Query: 150 SSQCASLNQKSC-------SGVNCQYSVSYGDGSFSNGNLATETVTLG-STTGQAVALPG 201
++ C + C S C+Y YGD S ++G+LA E++++ S ++ +
Sbjct: 144 AAACDLVVHDECRDNSSKTSPKTCKYFYWYGDSSRTSGDLALESLSVSLSDHPSSLEIRD 203
Query: 202 ITFGCGTNN 210
+ GCG +N
Sbjct: 204 MVIGCGHSN 212
Score = 57.0 bits (136), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 39/124 (31%), Positives = 64/124 (51%), Gaps = 5/124 (4%)
Query: 253 VIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSLSQVP--EVTIHF 310
+IDSGTTLT+L + + S + I + P ADP L +CY+ + VP ++I F
Sbjct: 325 IIDSGTTLTYLNRDAYRAVESAFLARI-SYPRADPFDILGICYNATGRAAVPFPALSIVF 383
Query: 311 R-GADVKLSRSNFFVKVSEDIVCSVFKGI-TNSVPIYGNIMQTNFLVGYDIEQQTVSFKP 368
+ GA++ L + N+F++ + T+ + I GN Q N YD++ + F
Sbjct: 384 QNGAELDLPQENYFIQPDPQEAKHCLAILPTDGMSIIGNFQQQNIHFLYDVQHARLGFAN 443
Query: 369 TDCT 372
TDC+
Sbjct: 444 TDCS 447
>gi|115446115|ref|NP_001046837.1| Os02g0473200 [Oryza sativa Japonica Group]
gi|47497549|dbj|BAD19621.1| putative aspartic proteinase nepenthesin [Oryza sativa Japonica
Group]
gi|47847591|dbj|BAD21978.1| putative aspartic proteinase nepenthesin [Oryza sativa Japonica
Group]
gi|113536368|dbj|BAF08751.1| Os02g0473200 [Oryza sativa Japonica Group]
Length = 494
Score = 90.5 bits (223), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 90/365 (24%), Positives = 150/365 (41%), Gaps = 89/365 (24%)
Query: 91 YLIRISIGTPPTERLAVADTGSDLIWTQ---CEPCPPSQCYMQDSPLFDPKMSSTYKSLP 147
Y RI IGTP DTGSD++W C+ CP + ++DP+ S + + +
Sbjct: 90 YFTRIGIGTPAKRYYVQVDTGSDILWVNCVSCDGCPRKSNLGIELTMYDPRGSQSGELVT 149
Query: 148 CSSSQCASLNQK---SCSGVN-CQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPG-- 201
C C + SC+ + C+YS+SYGDGS + G T+ + +G P
Sbjct: 150 CDQQFCVANYGGVLPSCTSTSPCEYSISYGDGSSTAGFFVTDFLQYNQVSGDGQTTPANA 209
Query: 202 -ITFGCGTNNGGLFNSKT---TGIVGLGGGDISLISQMRT-------------TIAGNQR 244
++FGCG GG S GI+G G + S++SQ+ T+ G
Sbjct: 210 SVSFGCGAKLGGDLGSSNLALDGILGFGQSNSSMLSQLAAAGKVRKMFAHCLDTVNGGGI 269
Query: 245 LGVS------------TPDI----------------------VIDSG----------TTL 260
+ PD+ + DSG TTL
Sbjct: 270 FAIGNVVQPKVKTTPLVPDMPHYNVILKGIDVGGTALGLPTNIFDSGNSKGTIIDSGTTL 329
Query: 261 TFLPQGYNSNLLSVM---SSMIEAQPVADPTGSLELCYSFNSL--SQVPEVTIHFRGADV 315
++P+G L +++ I Q + D + C+ ++ PEVT HF G DV
Sbjct: 330 AYVPEGVYKALFAMVFDKHQDISVQTLQDFS-----CFQYSGSVDDGFPEVTFHFEG-DV 383
Query: 316 KL--SRSNFFVKVSEDIVCSVFK--GIT----NSVPIYGNIMQTNFLVGYDIEQQTVSFK 367
L S ++ + +++ C F+ G+ + + G+++ +N LV YD+E Q + +
Sbjct: 384 SLIVSPHDYLFQNGKNLYCMGFQNGGVQTKDGKDMVLLGDLVLSNKLVLYDLENQAIGWA 443
Query: 368 PTDCT 372
+C+
Sbjct: 444 DYNCS 448
>gi|358345193|ref|XP_003636666.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
gi|355502601|gb|AES83804.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
Length = 161
Score = 90.1 bits (222), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 46/123 (37%), Positives = 73/123 (59%), Gaps = 2/123 (1%)
Query: 252 IVIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVA-DPTGSLELCYSFNSLSQVPEVTIHF 310
++IDSGT ++ LP+ + LL + + +P+ DP+ +LCY + + P + HF
Sbjct: 1 MLIDSGTPISILPEDFFHRLLEQVRKKVALEPMPFDPSLGYQLCYRTPTNLKGPTLVAHF 60
Query: 311 RGADVKLSRSNFFVKVSEDIVCSVF-KGITNSVPIYGNIMQTNFLVGYDIEQQTVSFKPT 369
GADV L+ + F+ V I C F +N YG+ +Q+N+L+G+D+E+Q VSFK T
Sbjct: 61 EGADVLLTPTQIFIPVQYGIFCFAFTSSFSNEYGTYGSYVQSNYLIGFDLEKQVVSFKAT 120
Query: 370 DCT 372
DCT
Sbjct: 121 DCT 123
>gi|218184944|gb|EEC67371.1| hypothetical protein OsI_34484 [Oryza sativa Indica Group]
Length = 396
Score = 90.1 bits (222), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 57/170 (33%), Positives = 87/170 (51%), Gaps = 18/170 (10%)
Query: 76 SSKASQADIIPNNAN---YLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDS 132
++ A A ++P + Y+ +IGTPP A+ D +L+WTQC C +C+ QD
Sbjct: 27 ATPAGGAAVVPIRWSPPYYVANFTIGTPPQPASAIVDVAGELVWTQCSAC--RRCFKQDL 84
Query: 133 PLFDPKMSSTYKSLPCSSSQCASLNQKSCSGVNCQYSVSYGDGSFSNGN----LATETVT 188
P+F P SST+K PC ++ C S+ +SCSG C Y G + GN AT+T
Sbjct: 85 PVFVPNASSTFKPEPCGTAVCESIPTRSCSGDVCSYK---GPPTQLRGNTSGFAATDTFA 141
Query: 189 LGSTTGQAVALPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTT 238
+G+ T + + FGC + +G +GLG SL++QM+ T
Sbjct: 142 IGTATVR------LAFGCVVASDIDTMDGPSGFIGLGRTPWSLVAQMKLT 185
Score = 37.7 bits (86), Expect = 9.2, Method: Compositional matrix adjust.
Identities = 22/95 (23%), Positives = 46/95 (48%), Gaps = 13/95 (13%)
Query: 291 LELCYSFN---SLSQVPEVTIHFRGAD-VKLSRSNFFVKVSE--DIVCSVFKGIT----- 339
+LC+ S + P++ F+GA + + + + + V E D C+ +
Sbjct: 299 FDLCFKKAAGFSRATAPDLVFTFQGAAALTVPPAKYLIDVGEEKDTACAAILSMAWLNRT 358
Query: 340 --NSVPIYGNIMQTNFLVGYDIEQQTVSFKPTDCT 372
V + G++ Q + YD++++T+SF+P DC+
Sbjct: 359 GLEGVSVLGSLQQEDVHFLYDLKKETLSFEPADCS 393
>gi|302806531|ref|XP_002985015.1| hypothetical protein SELMODRAFT_121417 [Selaginella moellendorffii]
gi|300147225|gb|EFJ13890.1| hypothetical protein SELMODRAFT_121417 [Selaginella moellendorffii]
Length = 533
Score = 90.1 bits (222), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 44/130 (33%), Positives = 72/130 (55%), Gaps = 10/130 (7%)
Query: 89 ANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPC 148
Y + + +G PP L + DTGSDL W QC+PC C+ Q P+FDP S+++K +PC
Sbjct: 169 GEYFMDVFVGNPPRHFLLIIDTGSDLTWLQCKPC--KACFDQSGPVFDPSQSTSFKIIPC 226
Query: 149 SSSQCASLNQKSC-------SGVNCQYSVSYGDGSFSNGNLATETVTLG-STTGQAVALP 200
+++ C + C S C+Y YGD S ++G+LA E++++ S ++ +
Sbjct: 227 NAAACDLVVHDECRDNSSKTSPKTCKYFYWYGDSSRTSGDLALESLSVSLSDHPSSLEIR 286
Query: 201 GITFGCGTNN 210
+ GCG +N
Sbjct: 287 DMVIGCGHSN 296
Score = 56.6 bits (135), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 39/124 (31%), Positives = 64/124 (51%), Gaps = 5/124 (4%)
Query: 253 VIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSLSQVP--EVTIHF 310
+IDSGTTLT+L + + S + I + P ADP L +CY+ + VP ++I F
Sbjct: 409 IIDSGTTLTYLNRDAYRAVESAFLARI-SYPRADPFDILGICYNATGRTAVPFPTLSIVF 467
Query: 311 R-GADVKLSRSNFFVKVSEDIVCSVFKGI-TNSVPIYGNIMQTNFLVGYDIEQQTVSFKP 368
+ GA++ L + N+F++ + T+ + I GN Q N YD++ + F
Sbjct: 468 QNGAELDLPQENYFIQPDPQEAKHCLAILPTDGMSIIGNFQQQNIHFLYDVQHARLGFAN 527
Query: 369 TDCT 372
TDC+
Sbjct: 528 TDCS 531
>gi|218190722|gb|EEC73149.1| hypothetical protein OsI_07179 [Oryza sativa Indica Group]
Length = 494
Score = 90.1 bits (222), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 92/367 (25%), Positives = 150/367 (40%), Gaps = 93/367 (25%)
Query: 91 YLIRISIGTPPTERLAVADTGSDLIWTQ---CEPCPPSQCYMQDSPLFDPKMSSTYKSLP 147
Y RI IGTP DTGSD++W C+ CP + ++DP+ S + + +
Sbjct: 90 YFTRIGIGTPAKRYYVQVDTGSDILWVNCVSCDGCPRKSNLGIELTMYDPRGSQSGELVT 149
Query: 148 CSSSQCASLNQK---SCSGVN-CQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPG-- 201
C C + SC+ + C+YS+SYGDGS + G T+ + +G P
Sbjct: 150 CDQQFCVANYGGVLPSCTSTSPCEYSISYGDGSSTAGFFVTDFLQYNQVSGDGQTTPANA 209
Query: 202 -ITFGCGTNNGGLFNSKT---TGIVGLGGGDISLISQM------RTTIA----------- 240
++FGCG GG S GI+G G + S++SQ+ R A
Sbjct: 210 SVSFGCGAKLGGDLGSSNLALDGILGFGQSNSSMLSQLAAAGKVRKMFAHCLDTVNGGGI 269
Query: 241 --------------------------------GNQRLGVSTPDIVIDSG----------T 258
G LG+ P + DSG T
Sbjct: 270 FAIGNVVQPKVKTTPLVSDMPHYNVILKGIDVGGTALGL--PTNIFDSGNSKGTIIDSGT 327
Query: 259 TLTFLPQGYNSNLLSVM---SSMIEAQPVADPTGSLELCYSFNSL--SQVPEVTIHFRGA 313
TL ++P+G L +++ I Q + D + C+ ++ PEVT HF G
Sbjct: 328 TLAYVPEGVYKALFAMVFDKHQDISVQTLQDFS-----CFQYSGSVDDGFPEVTFHFEG- 381
Query: 314 DVKL--SRSNFFVKVSEDIVCSVFK--GIT----NSVPIYGNIMQTNFLVGYDIEQQTVS 365
DV L S ++ + +++ C F+ G+ + + G+++ +N LV YD+E Q +
Sbjct: 382 DVSLIVSPHDYLFQNGKNLYCMGFQNGGVQTKDGKDMVLLGDLVLSNKLVLYDLENQAIG 441
Query: 366 FKPTDCT 372
+ +C+
Sbjct: 442 WADYNCS 448
>gi|359476204|ref|XP_002262813.2| PREDICTED: LOW QUALITY PROTEIN: aspartic proteinase
nepenthesin-2-like [Vitis vinifera]
Length = 460
Score = 90.1 bits (222), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 66/203 (32%), Positives = 90/203 (44%), Gaps = 36/203 (17%)
Query: 32 ELIHRDSPKSPFYNSSETPYQRLRDALTRSLNRLNHFNQNSSISSSKASQADIIPNNANY 91
E+ RD + F NS Y S N NH + N ++ + N+
Sbjct: 88 EIFGRDESRVSFINSKCNQYT--------SGNLKNHAHNN-----------NLFDEDGNF 128
Query: 92 LIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCSSS 151
L+ ++ GTP TE + DTGS + WTQC+ C C + FD SSTY C S
Sbjct: 129 LVDVAFGTPXTEIXLILDTGSSITWTQCKAC--VNCLQDSNRYFDSSASSTYSFGSCIPS 186
Query: 152 QCASLNQKSCSGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITFGCGTNNG 211
V Y+++YGD S S GN +T+TL + FGCG NN
Sbjct: 187 T-----------VENNYNMTYGDDSTSVGNYGCDTMTLEPSD----VFQKFQFGCGRNNK 231
Query: 212 GLFNSKTTGIVGLGGGDISLISQ 234
G F S G++GLG G +S +SQ
Sbjct: 232 GDFGSGVDGMLGLGQGQLSTVSQ 254
Score = 62.0 bits (149), Expect = 5e-07, Method: Compositional matrix adjust.
Identities = 45/145 (31%), Positives = 72/145 (49%), Gaps = 13/145 (8%)
Query: 241 GNQRLGV-----STPDIVIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGS----L 291
GN+RL + ++P +IDS T +T LPQ S L + + P+++ L
Sbjct: 317 GNERLNIPSSVFASPGTIIDSRTVITRLPQRAYSALKAAFKKAMAKYPLSNGRRKKGDIL 376
Query: 292 ELCYSFNSLSQV--PEVTIHFRG-ADVKLSRSNFFVKVSEDIVCSVFKGITNSVPIYGNI 348
+ CY+ + V PE+ +HF G ADV+L+ +N +C F G T+ + I GN
Sbjct: 377 DTCYNLSGRKDVLLPEIVLHFGGGADVRLNGTNIVWGSDASRLCLAFAG-TSELTIIGNR 435
Query: 349 MQTNFLVGYDIEQQTVSFKPTDCTK 373
Q + V YDI+ + + F C+K
Sbjct: 436 QQLSLTVLYDIQGRRIGFGGNGCSK 460
>gi|125575542|gb|EAZ16826.1| hypothetical protein OsJ_32298 [Oryza sativa Japonica Group]
Length = 396
Score = 90.1 bits (222), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 56/155 (36%), Positives = 79/155 (50%), Gaps = 15/155 (9%)
Query: 87 NNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSL 146
N AN+ +IGTPP A+ D +L+WTQC C S+C+ QD PLF P SST++
Sbjct: 43 NVANF----TIGTPPQPASAIIDVAGELVWTQCSRC--SRCFKQDLPLFIPNASSTFRPE 96
Query: 147 PCSSSQCASLNQKSCSGVNCQYSVSYG---DGSFSNGNLATETVTLGSTTGQAVALPGIT 203
PC + C S +CSG C Y + D + G + TET +G+ T +
Sbjct: 97 PCGTDACKSTPTSNCSGDVCTYESTTNIRLDRHTTLGIVGTETFAIGTATAS------LA 150
Query: 204 FGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTT 238
FGC + T+G +GLG SL++QM+ T
Sbjct: 151 FGCVVASDIDTMDGTSGFIGLGRTPRSLVAQMKLT 185
Score = 44.7 bits (104), Expect = 0.082, Method: Compositional matrix adjust.
Identities = 26/104 (25%), Positives = 51/104 (49%), Gaps = 13/104 (12%)
Query: 282 QPVADPTGSLELCYSFN---SLSQVPEVTIHFRGAD-VKLSRSNFFVKVSE--DIVCSVF 335
QP+A P +LC+ S + P++ F+GA + + + + + V E D C+
Sbjct: 290 QPMATPPQPFDLCFKKAAGFSRATAPDLVFTFQGAAALTVPPAKYLIDVGEEKDTACAAI 349
Query: 336 KGIT-------NSVPIYGNIMQTNFLVGYDIEQQTVSFKPTDCT 372
+ V + G++ Q + YD++++T+SF+P DC+
Sbjct: 350 LSMAWLNRTGLEGVSVLGSLQQEDVHFLYDLKKETLSFEPADCS 393
>gi|255547548|ref|XP_002514831.1| Aspartic proteinase Asp1 precursor, putative [Ricinus communis]
gi|223545882|gb|EEF47385.1| Aspartic proteinase Asp1 precursor, putative [Ricinus communis]
Length = 488
Score = 90.1 bits (222), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 90/367 (24%), Positives = 144/367 (39%), Gaps = 92/367 (25%)
Query: 91 YLIRISIGTPPTERLAVADTGSDLIWTQC---EPCPPSQCYMQDSPLFDPKMSSTYKSLP 147
Y +I +G PP + DTGSD++W C + CP L+DP+ S++ +
Sbjct: 82 YFAKIGLGNPPKDYYVQVDTGSDILWVNCANCDKCPTKSDLGVKLTLYDPQSSTSATRIY 141
Query: 148 CSSSQCAS----LNQKSCSGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQ---AVALP 200
C CA+ + Q + CQYSV YGDGS + G + + TG + A
Sbjct: 142 CDDDFCAATYNGVLQGCTKDLPCQYSVVYGDGSSTAGFFVKDNLQFDRVTGNLQTSSANG 201
Query: 201 GITFGCGTNNGGLFNSKTT---GIVGLG----------------------------GGDI 229
+ FGCG G + + GI+G G GG I
Sbjct: 202 SVIFGCGAKQSGELGTSSEALDGILGFGQANSSMISQLAAAGKVKRVFAHCLDNVKGGGI 261
Query: 230 SLISQ---------------------MRTTIAGNQRLGVSTPDI---------VIDSGTT 259
I + M+ G L + T DI +IDSGTT
Sbjct: 262 FAIGEVVSPKVNTTPMVPNQPHYNVVMKEIEVGGNVLELPT-DIFDTGDRRGTIIDSGTT 320
Query: 260 LTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLE----LCYSFNSLSQVPEVTIHFRGA-D 314
L +LP+ S+M+ ++ QP E Y+ N P V HF G+
Sbjct: 321 LAYLPEVVYE---SMMTKIVSEQPGLKLHTVEEQFTCFQYTGNVNEGFPVVKFHFNGSLS 377
Query: 315 VKLSRSNFFVKVSEDIVCSVFKGITNS---------VPIYGNIMQTNFLVGYDIEQQTVS 365
+ ++ ++ ++ E++ C G NS + + G+++ +N LV YD+E Q +
Sbjct: 378 LTVNPHDYLFQIHEEVWCF---GWQNSGMQSKDGRDMTLLGDLVLSNKLVLYDLENQAIG 434
Query: 366 FKPTDCT 372
+ +C+
Sbjct: 435 WTDYNCS 441
>gi|356568507|ref|XP_003552452.1| PREDICTED: aspartic proteinase-like protein 2-like [Glycine max]
Length = 481
Score = 90.1 bits (222), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 88/369 (23%), Positives = 143/369 (38%), Gaps = 89/369 (24%)
Query: 89 ANYLIRISIGTPPTERLAVADTGSDLIWTQC---EPCPPSQCYMQDSPLFDPKMSSTYKS 145
+N L IG P + DTGSD +W C CP D L+DP +S T K+
Sbjct: 72 SNGLYYTKIGLGPKDYYVQVDTGSDTLWVNCVGCTACPKKSGLGMDLTLYDPNLSKTSKA 131
Query: 146 LPCSSSQCASLNQKSCS----GVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALP- 200
+PC C S S G++C YS++YGDGS ++G+ + +T G +P
Sbjct: 132 VPCDDEFCTSTYDGQISGCTKGMSCPYSITYGDGSTTSGSYIKDDLTFDRVVGDLRTVPD 191
Query: 201 --GITFGCGTNNGGLFNSKT----TGIVGLGGGDISLISQM------------------- 235
+ FGCG+ G +S T GI+G G + S++SQ+
Sbjct: 192 NTSVIFGCGSKQSGTLSSTTDTSLDGIIGFGQANSSVLSQLAAAGKVKRIFSHCLDSISG 251
Query: 236 --------------RTT--IAGNQRLGVSTPDI----------------------VIDSG 257
+TT + G V DI +IDSG
Sbjct: 252 GGIFAIGEVVQPKVKTTPLLQGMAHYNVVLKDIEVAGDPIQLPSDILDSSSGRGTIIDSG 311
Query: 258 TTLTFLPQGYNSNLLSVM---SSMIEAQPVADPTGSLELCYSFNSLSQVPEV--TIHF-- 310
TTL +LP LL + S ++ V D C+ ++ V ++ T+ F
Sbjct: 312 TTLAYLPVSIYDQLLEKILAQRSGMKLYLVEDQF----TCFHYSDEESVDDLFPTVKFTF 367
Query: 311 -RGADVKLSRSNFFVKVSEDIVCSVFKGIT------NSVPIYGNIMQTNFLVGYDIEQQT 363
G + ++ ED+ C ++ + + G+++ N LV YD++
Sbjct: 368 EEGLTLTTYPRDYLFLFKEDMWCVGWQKSMAQTKDGKELILLGDLVLANKLVVYDLDNMA 427
Query: 364 VSFKPTDCT 372
+ + +C+
Sbjct: 428 IGWADYNCS 436
>gi|222632750|gb|EEE64882.1| hypothetical protein OsJ_19741 [Oryza sativa Japonica Group]
Length = 456
Score = 90.1 bits (222), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 87/347 (25%), Positives = 135/347 (38%), Gaps = 76/347 (21%)
Query: 90 NYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQD------SPLFDPKMSSTY 143
Y ++ +GTP T L V DTGSD++W PP ++ +P P+ +
Sbjct: 121 EYFAQVGVGTPATTALMVLDTGSDVVWAPVRALPPLLRAVRQGSSTGAAPAPTPRWN--- 177
Query: 144 KSLPCSSSQCASLNQKSCSGVN--CQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPG 201
C + C L+ C C Y V+YGDGS + G+ A+ET+T + +
Sbjct: 178 ----CVAPICRRLDSAGCDRRRNSCLYQVAYGDGSVTAGDFASETLTFA----RGARVQR 229
Query: 202 ITFGCGTNNGGL----------------FNSKTTGIVG---------------------L 224
+ GCG +N GL F S+ G
Sbjct: 230 VAIGCGHDNEGLFIAASGLLGLGRGRLSFPSQIARSFGRSFSYCLVDRTSSRRARPSRRW 289
Query: 225 GG----GDISLISQMRTTIAGNQRLGVSTPD-----------IVIDSGTTLTFLPQGYNS 269
GG + + ++ G + GVS D +++DSGT++T L +
Sbjct: 290 GGTPRMATFYYVHLLGFSVGGARVKGVSQSDLRLNPTTGRGGVILDSGTSVTRLARPVYE 349
Query: 270 NLLSVMSSMIEAQPVADPTGSL-ELCYSFNS--LSQVPEVTIHFRG-ADVKLSRSNFFVK 325
+ + V+ SL + CY+ + + +VP V++H G A V L N+ +
Sbjct: 350 AVRDAFRAAAVGLRVSPGGFSLFDTCYNLSGRRVVKVPTVSMHLAGGASVALPPENYLIP 409
Query: 326 V-SEDIVCSVFKGITNSVPIYGNIMQTNFLVGYDIEQQTVSFKPTDC 371
V + C G V I GNI Q F V +D + Q V F P C
Sbjct: 410 VDTSGTFCFAMAGTDGGVSIIGNIQQQGFRVVFDGDAQRVGFVPKSC 456
>gi|21717169|gb|AAM76362.1|AC074196_20 putative nucleoid DNA binding protein, 3'-partial [Oryza sativa
Japonica Group]
Length = 377
Score = 90.1 bits (222), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 58/152 (38%), Positives = 77/152 (50%), Gaps = 13/152 (8%)
Query: 91 YLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCSS 150
Y+ +IGTPP AV D +L+WTQC PC P C+ QD PLFDP SST++ LPC S
Sbjct: 57 YVANFTIGTPPQPVSAVVDLTGELVWTQCTPCQP--CFEQDLPLFDPTKSSTFRGLPCGS 114
Query: 151 SQCASLNQKS--CSGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITFGCGT 208
C S+ + S C+ C Y G + G T+T +G+ A + FGC
Sbjct: 115 HLCESIPESSRNCTSDVCIYEAPTKAGD-TGGKAGTDTFAIGA------AKETLGFGCVV 167
Query: 209 NNGGLFNS--KTTGIVGLGGGDISLISQMRTT 238
+ +GIVGLG SL++QM T
Sbjct: 168 MTDKRLKTIGGPSGIVGLGRTPWSLVTQMNVT 199
>gi|224099307|ref|XP_002311432.1| predicted protein [Populus trichocarpa]
gi|222851252|gb|EEE88799.1| predicted protein [Populus trichocarpa]
Length = 458
Score = 89.7 bits (221), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 56/160 (35%), Positives = 82/160 (51%), Gaps = 12/160 (7%)
Query: 88 NANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLP 147
+ Y + I +G+PP L VADTGSDL W +C C + F + S+T+
Sbjct: 80 SGQYFVSIRLGSPPQTLLLVADTGSDLTWVRCSACKTNCSIHPPGSTFLARHSTTFSPTH 139
Query: 148 CSSSQCASLNQKSCSGVN-------CQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALP 200
C SS C + Q + + N C+Y Y DGS ++G + ET TL +++G+ + L
Sbjct: 140 CFSSLCQLVPQPNPNPCNHTRLHSTCRYEYVYSDGSKTSGFFSKETTTLNTSSGREMKLK 199
Query: 201 GITFGCGTNNGG--LFNSK---TTGIVGLGGGDISLISQM 235
I FGCG + G L S +G++GLG G IS SQ+
Sbjct: 200 SIAFGCGFHASGPSLIGSSFNGASGVMGLGRGPISFASQL 239
Score = 50.8 bits (120), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 36/130 (27%), Positives = 61/130 (46%), Gaps = 12/130 (9%)
Query: 253 VIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGS-----LELCYSFNSLS--QVPE 305
VIDSGTTLTFL + +LS ++ P P G+ +LC + +S + P
Sbjct: 327 VIDSGTTLTFLTEPAYREILSAFKREVKL-PSPTPGGASTRSGFDLCVNVTGVSRPRFPR 385
Query: 306 VTIHFRGADV-KLSRSNFFVKVSEDIVCSVFKGI---TNSVPIYGNIMQTNFLVGYDIEQ 361
+++ G + N+F+ +SE I C + + + + GN+MQ FL+ +D +
Sbjct: 386 LSLELGGESLYSPPPRNYFIDISEGIKCLAIQPVEAESGRFSVIGNLMQQGFLLEFDRGK 445
Query: 362 QTVSFKPTDC 371
+ F C
Sbjct: 446 SRLGFSRRGC 455
>gi|297814776|ref|XP_002875271.1| hypothetical protein ARALYDRAFT_484331 [Arabidopsis lyrata subsp.
lyrata]
gi|297321109|gb|EFH51530.1| hypothetical protein ARALYDRAFT_484331 [Arabidopsis lyrata subsp.
lyrata]
Length = 451
Score = 89.7 bits (221), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 54/162 (33%), Positives = 80/162 (49%), Gaps = 16/162 (9%)
Query: 88 NANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQC-YMQDSPLFDPKMSSTYKSL 146
+ Y + + IG PP L +ADTGSDL+W +C C C + + +F P+ SST+
Sbjct: 80 SGQYFVDLRIGQPPQSLLLIADTGSDLVWVKCSAC--RNCSHHSPATVFFPRHSSTFSPA 137
Query: 147 PCSSSQCASLNQ----KSCSGVN----CQYSVSYGDGSFSNGNLATETVTLGSTTGQAVA 198
C C + + C+ C Y Y DGS ++G A ET +L +++G+
Sbjct: 138 HCYDPVCRLVPKPGRAPRCNHTRIHSTCPYEYGYADGSLTSGLFARETTSLKTSSGKEAK 197
Query: 199 LPGITFGCGTNNGGLFNSKTT-----GIVGLGGGDISLISQM 235
L + FGCG G S T+ G++GLG G IS SQ+
Sbjct: 198 LKSVAFGCGFRISGQSVSGTSFNGANGVMGLGRGPISFASQL 239
Score = 50.4 bits (119), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 32/127 (25%), Positives = 59/127 (46%), Gaps = 7/127 (5%)
Query: 253 VIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSLSQ----VPEVTI 308
V+DSGTTL FL +++ + I+ + T +LC + + +++ +P +
Sbjct: 323 VMDSGTTLAFLADPAYRLVIAAVKQRIKLPNADELTPGFDLCVNVSGVTKPEKILPRLKF 382
Query: 309 HFRGADVKL-SRSNFFVKVSEDIVCSVFKGITNSV--PIYGNIMQTNFLVGYDIEQQTVS 365
F G V + N+F++ E I C + + V + GN+MQ FL +D ++ +
Sbjct: 383 EFSGGAVFVPPPRNYFIETEEQIQCLAIQSVDPKVGFSVIGNLMQQGFLFEFDRDRSRLG 442
Query: 366 FKPTDCT 372
F C
Sbjct: 443 FSRRGCA 449
>gi|21717154|gb|AAM76347.1|AC074196_5 putative nucleoid DNA binding protein [Oryza sativa Japonica Group]
gi|31433293|gb|AAP54831.1| Eukaryotic aspartyl protease family protein [Oryza sativa Japonica
Group]
gi|125532791|gb|EAY79356.1| hypothetical protein OsI_34485 [Oryza sativa Indica Group]
Length = 397
Score = 89.7 bits (221), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 56/155 (36%), Positives = 79/155 (50%), Gaps = 15/155 (9%)
Query: 87 NNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSL 146
N AN+ +IGTPP A+ D +L+WTQC C S+C+ QD PLF P SST++
Sbjct: 43 NVANF----TIGTPPQPASAIIDVAGELVWTQCSRC--SRCFKQDLPLFIPNASSTFRPE 96
Query: 147 PCSSSQCASLNQKSCSGVNCQYSVSYG---DGSFSNGNLATETVTLGSTTGQAVALPGIT 203
PC + C S +CSG C Y + D + G + TET +G+ T +
Sbjct: 97 PCGTDACKSTPTSNCSGDVCTYESTTNIRLDRHTTLGIVGTETFAIGTATAS------LA 150
Query: 204 FGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTT 238
FGC + T+G +GLG SL++QM+ T
Sbjct: 151 FGCVVASDIDTMDGTSGFIGLGRTPRSLVAQMKLT 185
Score = 39.3 bits (90), Expect = 3.5, Method: Compositional matrix adjust.
Identities = 23/96 (23%), Positives = 45/96 (46%), Gaps = 14/96 (14%)
Query: 291 LELCYSFN---SLSQVPEVTIHFRGADVKLSR--SNFFVKVSE--DIVCSVFKGITN--- 340
+LC+ S + P++ F+G L+ + + + V E D C+ +
Sbjct: 299 FDLCFKKAAGFSRATAPDLVFTFQGGGAALTVPPAKYLIDVGEEKDTACAAILSMARLNR 358
Query: 341 ----SVPIYGNIMQTNFLVGYDIEQQTVSFKPTDCT 372
V + G++ Q N YD++++T+SF+P DC+
Sbjct: 359 TGLEGVSVLGSLQQENVHFLYDLKKETLSFEPADCS 394
>gi|224109494|ref|XP_002315215.1| predicted protein [Populus trichocarpa]
gi|222864255|gb|EEF01386.1| predicted protein [Populus trichocarpa]
Length = 444
Score = 89.4 bits (220), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 57/170 (33%), Positives = 85/170 (50%), Gaps = 20/170 (11%)
Query: 75 SSSKASQADIIPNNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPL 134
SSSK + + +N ++IGTPP V DTGS+L W +C+ P + +
Sbjct: 51 SSSKTTGKLLFHHNVTLTASLTIGTPPQNITMVLDTGSELSWLRCKKEP------NFTSI 104
Query: 135 FDPKMSSTYKSLPCSSSQCASLNQK-----SCSGVN-CQYSVSYGDGSFSNGNLATETVT 188
F+P S TY +PCSS C + +C C + +SY D S G+LA ET
Sbjct: 105 FNPLASKTYTKIPCSSQTCKTRTSDLTLPVTCDPAKLCHFIISYADASSVEGHLAFETFR 164
Query: 189 LGSTTGQAVALPGITFGC---GTNNGGLFNSKTTGIVGLGGGDISLISQM 235
GS T P FGC G+++ ++KTTG++G+ G +S ++QM
Sbjct: 165 FGSLTR-----PATVFGCMDSGSSSNTEEDAKTTGLMGMNRGSLSFVNQM 209
>gi|326520109|dbj|BAK03979.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 463
Score = 89.4 bits (220), Expect = 3e-15, Method: Compositional matrix adjust.
Identities = 70/229 (30%), Positives = 112/229 (48%), Gaps = 28/229 (12%)
Query: 21 PIEAQTGGFSVELIHRDSPKSPFYNSSETPYQRLRDALTRSLNRLNHF-NQNSSISSSKA 79
P T + ++HR+ P +P +S+ P +R AL R+ N+ SS + +A
Sbjct: 52 PNSPSTSTIRLTILHREHPCAP---ASKRPVRRSPSALQEYHTRVRRLANRLSSCPADEA 108
Query: 80 SQADIIPNNA------NYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSP 133
+ + +I N +Y+ ++ +GTP + DT S L W CEPC + C + P
Sbjct: 109 TASGLIFANGVPWDYYSYVTQVQLGTPAKTHNVLVDTASSLSWVGCEPC-INACLI---P 164
Query: 134 LFDPKMSSTYKSLPCSSSQC-----ASLNQKSCSG--VNCQYSVSYGDGSFSNGNLATET 186
F+P SSTYK + C S+ C A++ +KSC C Y SY D S S G ++++T
Sbjct: 165 TFNPNASSTYKVVGCGSALCNAVPSATMARKSCMAPTEGCSYRQSYHDYSLSVGVVSSDT 224
Query: 187 VTLGSTTGQAVALPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQM 235
+T G + + + FGC G+ + +GI+G+ SL SQM
Sbjct: 225 LTYGLGSQKFI------FGCCNLFRGV-GGRYSGILGMSVNKFSLFSQM 266
>gi|413938616|gb|AFW73167.1| hypothetical protein ZEAMMB73_633272 [Zea mays]
Length = 452
Score = 89.4 bits (220), Expect = 3e-15, Method: Compositional matrix adjust.
Identities = 93/349 (26%), Positives = 122/349 (34%), Gaps = 102/349 (29%)
Query: 90 NYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPS-QCYMQDSPLFDPKMSSTYKSLPC 148
NY++ S+GTP + DTGSDL W QC+PC + CY Q PLFDP SS+Y ++PC
Sbjct: 139 NYVVTASLGTPGVAQTMEVDTGSDLSWVQCKPCAAAPSCYSQKDPLFDPAQSSSYAAVPC 198
Query: 149 SSSQCASLNQKSCSGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITFGCGT 208
CA L + + + G A+ G FGCG
Sbjct: 199 GGPVCAGL------------------------GIYAASACSAAQCG---AVQGFFFGCGH 231
Query: 209 NNGGLFNSKTTGIVGLGGGDISLISQ--------------MRTTIAGNQRLGVSTPDIVI 254
GLFN G++GLG SL+ Q + + AG LGV P
Sbjct: 232 AQSGLFNG-VDGLLGLGREQPSLVEQTAGTYGGVFSYCLPTKPSTAGYLTLGVGGPSGAA 290
Query: 255 DSGTTLTFLPQGYNSNLLSVMSSMIE---------------------------------- 280
+T LP VM + I
Sbjct: 291 PGFSTTQLLPSPNAPTYYVVMLTGISVGGQQLSVPASAFAGGTVVDTGTVVTRLPPTAYA 350
Query: 281 -------------AQPVADPTGSLELCYSFNSLSQV--PEVTIHF-RGADVKLSRSNFFV 324
P A G L+ CY+F V P V + F GA V L
Sbjct: 351 ALRSAFRSGMASYGYPTAPSNGILDTCYNFAGYGTVTLPNVALTFGSGATVTLGADGIL- 409
Query: 325 KVSEDIVCSVFK--GITNSVPIYGNIMQTNFLVGYDIEQQTVSFKPTDC 371
C F G + I GN+ Q +F V I+ +V FKP+ C
Sbjct: 410 ----SFGCLAFAPSGSDGGMAILGNVQQRSFEV--RIDGTSVGFKPSSC 452
>gi|194699094|gb|ACF83631.1| unknown [Zea mays]
gi|413938606|gb|AFW73157.1| hypothetical protein ZEAMMB73_333672 [Zea mays]
Length = 452
Score = 89.4 bits (220), Expect = 3e-15, Method: Compositional matrix adjust.
Identities = 108/421 (25%), Positives = 144/421 (34%), Gaps = 116/421 (27%)
Query: 31 VELIHRDSPKSPFYNSSETPYQRLRDALTRSLNRLNHFNQNSS------ISSSKASQADI 84
+ L HR P +P SS + D L R + + S S A+ A
Sbjct: 68 LRLTHRHGPCAPSRASSLA-APSVADTLRADQRRAEYILRRVSGRAPQLWDSKAAAAAAT 126
Query: 85 IP-------NNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPS-QCYMQDSPLFD 136
+P NY++ S+GTP + DTGSDL W QC+PC + CY Q PLFD
Sbjct: 127 VPASWGYDIGTLNYVVTASLGTPGVAQTMEVDTGSDLSWVQCKPCSAAPSCYSQKDPLFD 186
Query: 137 PKMSSTYKSLPCSSSQCASLNQKSCSGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQA 196
P SS+Y ++PC CA L + + + G
Sbjct: 187 PAQSSSYAAVPCGGPVCAGL------------------------GIYAASACSAAQCG-- 220
Query: 197 VALPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQ--------------MRTTIAGN 242
A+ G FGCG GLFN G++GLG SL+ Q + + AG
Sbjct: 221 -AVQGFFFGCGHAQSGLFNG-VDGLLGLGREQPSLVEQTAGTYGGVFSYCLPTKPSTAGY 278
Query: 243 QRLGVSTPDIVIDSGTTLTFLPQGYNSNLLSVMSSMIE---------------------- 280
LGV P +T LP VM + I
Sbjct: 279 LTLGVGGPSGAAPGFSTTQLLPSPNAPTYYVVMLTGISVGGQQLSVPASAFAGGTVVDTG 338
Query: 281 -------------------------AQPVADPTGSLELCYSFNSLSQV--PEVTIHF-RG 312
P A G L+ CY+F V P V + F G
Sbjct: 339 TVVTRLPPTAYAALRSAFRSGMASYGYPTAPSNGILDTCYNFAGYGTVTLPNVALTFGSG 398
Query: 313 ADVKLSRSNFFVKVSEDIVCSVFK--GITNSVPIYGNIMQTNFLVGYDIEQQTVSFKPTD 370
A V L C F G + I GN+ Q +F V I+ +V FKP+
Sbjct: 399 ATVTLGADGIL-----SFGCLAFAPSGSDGGMAILGNVQQRSFEV--RIDGTSVGFKPSS 451
Query: 371 C 371
C
Sbjct: 452 C 452
>gi|218185380|gb|EEC67807.1| hypothetical protein OsI_35373 [Oryza sativa Indica Group]
Length = 418
Score = 89.4 bits (220), Expect = 3e-15, Method: Compositional matrix adjust.
Identities = 93/368 (25%), Positives = 161/368 (43%), Gaps = 84/368 (22%)
Query: 82 ADIIPNNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSS 141
D+ P +Y + ++IG P DTGSDL W QC+ P C PL+ P +
Sbjct: 49 GDVYPT-GHYYVTMNIGDPAKPYFLDVDTGSDLTWLQCD-APCQSCNKVPHPLYRPTKN- 105
Query: 142 TYKSLPCSSSQCASL------NQKSCSGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQ 195
K +PC++S C +L N+K + C Y + Y D + S G L T++ +L +
Sbjct: 106 --KLVPCANSICTALHSGSSPNKKCTTQQQCDYQIKYTDKASSLGVLVTDSFSL-PLRNK 162
Query: 196 AVALPGITFGCGTN----NGGLFNSKTTGIVGLGGGDISLISQMR--------------- 236
+ P ++FGCG + G + T G++GLG G +SL+SQ++
Sbjct: 163 SNVRPSLSFGCGYDQQVGKNGAAPATTDGLLGLGRGSVSLLSQLKQQGITKNVLGHCLST 222
Query: 237 ------------------------TTIAGN-----------QRLGVSTP--DIVIDSGTT 259
+ +GN R +ST ++V DSG+T
Sbjct: 223 SGGGFLFFGDDMVPTSRVTWVPMVRSTSGNYYSPGSATLYFDRRSLSTKPMEVVFDSGST 282
Query: 260 LTFL-PQGYNSNLLSVMSSMIEA-QPVADPTGSLELCY----SFNSLSQVPE--VTIHF- 310
T+ Q Y + + ++ S+ ++ + V+DP SL LC+ +F S+S V + ++ F
Sbjct: 283 YTYFSAQPYQATISAIKGSLSKSLKQVSDP--SLPLCWKGQKAFKSVSDVKKDFKSLQFI 340
Query: 311 --RGADVKLSRSNFFVKVSEDIVC-SVFKGITN--SVPIYGNIMQTNFLVGYDIEQQTVS 365
+ A +++ N+ + VC + G S I G+I + +V YD E+ +
Sbjct: 341 FGKNAVMEIPPENYLIVTKNGNVCLGILDGSAAKLSFSIIGDITMQDQMVIYDNEKAQLG 400
Query: 366 FKPTDCTK 373
+ C++
Sbjct: 401 WIRGSCSR 408
>gi|115465777|ref|NP_001056488.1| Os05g0591300 [Oryza sativa Japonica Group]
gi|113580039|dbj|BAF18402.1| Os05g0591300 [Oryza sativa Japonica Group]
Length = 453
Score = 89.0 bits (219), Expect = 3e-15, Method: Compositional matrix adjust.
Identities = 62/174 (35%), Positives = 92/174 (52%), Gaps = 20/174 (11%)
Query: 72 SSISSSKASQADIIPNNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQD 131
+SIS +K S+ N+ +LI + +GTP + L DTGS L W QC PC +C++Q
Sbjct: 38 TSISVTKDSKL----NDFAFLIPVKLGTPAVQYLVTMDTGSSLSWVQCRPC-TIKCHVQP 92
Query: 132 S---PLFDPKMSSTYKSLPCSSSQCASLNQ------KSCSGVN--CQYSVSYGDG-SFSN 179
+ P+FDP SST++ + CS+S C+ L + K+C C Y++SYG G ++S
Sbjct: 93 AKVGPIFDPSNSSTFRHVGCSTSICSYLGRTLRIQSKACMEWEDICLYTMSYGGGWAYSV 152
Query: 180 GNLATETVTL--GSTTGQAVALPGITFGCGTNNGGLFNSKTTGIVGLGGGDISL 231
G T+ + L G TT ++L FGC + K GI GLG + S
Sbjct: 153 GKAVTDRLVLGGGETTRTTLSLANFVFGCSMDT-QYSTHKEAGIFGLGTSNYSF 205
>gi|224101015|ref|XP_002312106.1| predicted protein [Populus trichocarpa]
gi|222851926|gb|EEE89473.1| predicted protein [Populus trichocarpa]
Length = 440
Score = 89.0 bits (219), Expect = 3e-15, Method: Compositional matrix adjust.
Identities = 56/170 (32%), Positives = 85/170 (50%), Gaps = 20/170 (11%)
Query: 75 SSSKASQADIIPNNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPL 134
++SK + + +N + ++ GTP V DTGS+L W C+ P + +
Sbjct: 51 TTSKTTDKLLFHHNVTLTVSLTAGTPLQNITMVLDTGSELSWLHCKKEP------NFNSI 104
Query: 135 FDPKMSSTYKSLPCSSSQCASLNQK-----SCSGVN-CQYSVSYGDGSFSNGNLATETVT 188
F+P S TY +PCSS C + + SC C + +SY D S GNLA ET
Sbjct: 105 FNPLASKTYTKIPCSSPTCETRTRDLPLPVSCDPAKLCHFIISYADASSVEGNLAFETFR 164
Query: 189 LGSTTGQAVALPGITFGC---GTNNGGLFNSKTTGIVGLGGGDISLISQM 235
+GS TG P FGC G ++ ++KTTG++G+ G +S ++QM
Sbjct: 165 VGSVTG-----PATVFGCMDSGFSSNSEEDAKTTGLMGMNRGSLSFVNQM 209
>gi|20160862|dbj|BAB89801.1| putative aspartic proteinase nepenthesin I [Oryza sativa Japonica
Group]
Length = 488
Score = 89.0 bits (219), Expect = 3e-15, Method: Compositional matrix adjust.
Identities = 53/153 (34%), Positives = 75/153 (49%), Gaps = 19/153 (12%)
Query: 87 NNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSL 146
N Y+ IGTPP + D SDL+WT C P F+P S+T +
Sbjct: 96 NAGMYVFSYGIGTPPQQVSGALDISSDLVWTACGATAP----------FNPVRSTTVADV 145
Query: 147 PCSSSQCASLNQKSC--SGVNCQYSVSYGDGSF-SNGNLATETVTLGSTTGQAVALPGIT 203
PC+ C ++C C Y+ YG G+ + G L TE T G T + G+
Sbjct: 146 PCTDDACQQFAPQTCGAGASECAYTYMYGGGAANTTGLLGTEAFTFGDTR-----IDGVV 200
Query: 204 FGCGTNNGGLFNSKTTGIVGLGGGDISLISQMR 236
FGCG N G F S +G++GLG G++SL+SQ++
Sbjct: 201 FGCGLKNVGDF-SGVSGVIGLGRGNLSLVSQLQ 232
>gi|15231625|ref|NP_191467.1| aspartyl protease family protein [Arabidopsis thaliana]
gi|15983376|gb|AAL11556.1|AF424562_1 AT3g59080/F17J16_130 [Arabidopsis thaliana]
gi|7529751|emb|CAB86936.1| putative protein [Arabidopsis thaliana]
gi|20466704|gb|AAM20669.1| putative protein [Arabidopsis thaliana]
gi|23198236|gb|AAN15645.1| putative protein [Arabidopsis thaliana]
gi|332646352|gb|AEE79873.1| aspartyl protease family protein [Arabidopsis thaliana]
Length = 535
Score = 89.0 bits (219), Expect = 3e-15, Method: Compositional matrix adjust.
Identities = 48/135 (35%), Positives = 71/135 (52%), Gaps = 12/135 (8%)
Query: 91 YLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCSS 150
Y + + +G+PP + DTGSDL W QC PC C+ Q+ +DPK S++YK++ C+
Sbjct: 170 YFMDVLVGSPPKHFSLILDTGSDLNWIQCLPC--YDCFQQNGAFYDPKASASYKNITCND 227
Query: 151 SQCASLNQKS----CSGVN--CQYSVSYGDGSFSNGNLATETVTLGSTTG----QAVALP 200
+C ++ C N C Y YGD S + G+ A ET T+ TT + +
Sbjct: 228 QRCNLVSSPDPPMPCKSDNQSCPYYYWYGDSSNTTGDFAVETFTVNLTTNGGSSELYNVE 287
Query: 201 GITFGCGTNNGGLFN 215
+ FGCG N GLF+
Sbjct: 288 NMMFGCGHWNRGLFH 302
Score = 54.7 bits (130), Expect = 7e-05, Method: Compositional matrix adjust.
Identities = 37/124 (29%), Positives = 63/124 (50%), Gaps = 5/124 (4%)
Query: 253 VIDSGTTLTFLPQGYNSNLLSVMSSMIEAQ-PVADPTGSLELCYSFNSLS--QVPEVTIH 309
+IDSGTTL++ + + + ++ + + PV L+ C++ + + Q+PE+ I
Sbjct: 409 IIDSGTTLSYFAEPAYEFIKNKIAEKAKGKYPVYRDFPILDPCFNVSGIHNVQLPELGIA 468
Query: 310 F-RGADVKLSRSNFFVKVSEDIVCSVFKGITNSV-PIYGNIMQTNFLVGYDIEQQTVSFK 367
F GA N F+ ++ED+VC G S I GN Q NF + YD ++ + +
Sbjct: 469 FADGAVWNFPTENSFIWLNEDLVCLAMLGTPKSAFSIIGNYQQQNFHILYDTKRSRLGYA 528
Query: 368 PTDC 371
PT C
Sbjct: 529 PTKC 532
>gi|302781476|ref|XP_002972512.1| hypothetical protein SELMODRAFT_441822 [Selaginella moellendorffii]
gi|300159979|gb|EFJ26598.1| hypothetical protein SELMODRAFT_441822 [Selaginella moellendorffii]
Length = 496
Score = 89.0 bits (219), Expect = 4e-15, Method: Compositional matrix adjust.
Identities = 61/210 (29%), Positives = 109/210 (51%), Gaps = 26/210 (12%)
Query: 52 QRLRDALTR-----SLNRLNHFN-QNSSISSSKASQADIIPNNANYLIRISIGTPPTERL 105
+++R++L+R N+ NH + + + +S S + + A + +++ IG+
Sbjct: 55 EQVRESLSRIQSQVQDNQNNHLDLRGNRPTSGVRSVVTPLEDYALFSMQLGIGSLQKNLS 114
Query: 106 AVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCSSSQCASLNQKSCSG-- 163
A+ DTGS+ + QC + P+FDP S +Y+ +PC S C ++ Q++ +G
Sbjct: 115 AIIDTGSEAVLVQCGS--------RSRPVFDPAASQSYRQVPCISQLCLAVQQQTSNGSS 166
Query: 164 -------VNCQYSVSYGDGSFSNGNLATETVTLGST--TGQAVALPGITFGCGTN-NGGL 213
C YS+SYGD S G+ + + + L ST +GQAV + FGC + G L
Sbjct: 167 QPCVNSSATCTYSLSYGDSRNSTGDFSQDVIFLNSTNSSGQAVQFRDVAFGCAHSPQGFL 226
Query: 214 FNSKTTGIVGLGGGDISLISQMRTTIAGNQ 243
+ + GIVG G++SL SQ++ + G++
Sbjct: 227 VDLGSLGIVGFNRGNLSLPSQLKDRLGGSK 256
Score = 43.5 bits (101), Expect = 0.16, Method: Compositional matrix adjust.
Identities = 46/146 (31%), Positives = 68/146 (46%), Gaps = 18/146 (12%)
Query: 244 RLGVSTPD--IVIDSGTTLT-FLPQGYNS--NLLSVMSSMIEAQPVADPTGSLELCYSFN 298
+L ST D V+DSGTT T + Y + N + + + V G + CY+ +
Sbjct: 326 KLDPSTGDGGTVLDSGTTFTRVVDDAYTAFRNAFAASNRSGLRKKVGAAAG-FDDCYNIS 384
Query: 299 ---SLSQVPEVTIHFRG-ADVKLSRSNFFVKVS----EDIVC----SVFKGITNSVPIYG 346
SL VPEV + + ++L + FV VS E VC S K + + G
Sbjct: 385 AGSSLPGVPEVRLSLQNNVRLELRFEHLFVPVSAAGNEVTVCLAILSSQKSGFGKINVLG 444
Query: 347 NIMQTNFLVGYDIEQQTVSFKPTDCT 372
N Q+N+LV YD E+ V F+ DC+
Sbjct: 445 NYQQSNYLVEYDNERSRVGFERADCS 470
>gi|356558489|ref|XP_003547539.1| PREDICTED: uncharacterized protein LOC100817234 [Glycine max]
Length = 739
Score = 88.6 bits (218), Expect = 5e-15, Method: Compositional matrix adjust.
Identities = 51/122 (41%), Positives = 73/122 (59%), Gaps = 3/122 (2%)
Query: 251 DIVIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSF--NSLSQVPEVTI 308
+I+IDSGTTLT L + + + L + + + I + V L LCY N+ +VP +T
Sbjct: 173 NIIIDSGTTLTILLENFYTKLEAEVEAHINLERVNSTDQILSLCYKSPPNNAIEVPIITT 232
Query: 309 HFRGADVKLSRSNFFVKVSEDIVCSVFKGITNSVPIYGNIMQTNFLVGYDIEQQTVSFKP 368
HF G D+ L+ N FV V +D + F + S I+GN+ Q N LVGYD+ ++TVSFKP
Sbjct: 233 HFAGVDIVLNSLNTFVSVFDDAMWFAFAPVA-SGSIFGNLAQMNHLVGYDLLRKTVSFKP 291
Query: 369 TD 370
TD
Sbjct: 292 TD 293
>gi|449440014|ref|XP_004137780.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Cucumis sativus]
gi|449483406|ref|XP_004156582.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Cucumis sativus]
Length = 449
Score = 88.2 bits (217), Expect = 5e-15, Method: Compositional matrix adjust.
Identities = 60/164 (36%), Positives = 87/164 (53%), Gaps = 19/164 (11%)
Query: 83 DIIP--NNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMS 140
D +P +N + + +++GTPP V DTGS+L W C SQ S F+P S
Sbjct: 63 DKLPFRHNISLTVSLTVGTPPQNVTMVIDTGSELSWLHCN---TSQNSSSSSSTFNPVWS 119
Query: 141 STYKSLPCSSSQCASLNQK-----SC-SGVNCQYSVSYGDGSFSNGNLATETVTLGSTTG 194
S+Y +PCSSS C + SC S C ++SY D S S GNLAT+T +GS+
Sbjct: 120 SSYSPIPCSSSTCTDQTRDFPIRPSCDSNQFCHATLSYADASSSEGNLATDTFYIGSS-- 177
Query: 195 QAVALPGITFGCGT---NNGGLFNSKTTGIVGLGGGDISLISQM 235
+P + FGC ++ +SK TG++G+ G +S +SQM
Sbjct: 178 ---GIPNVVFGCMDSIFSSNSEEDSKNTGLMGMNRGSLSFVSQM 218
>gi|225455900|ref|XP_002275943.1| PREDICTED: aspartic proteinase Asp1-like [Vitis vinifera]
Length = 686
Score = 88.2 bits (217), Expect = 5e-15, Method: Compositional matrix adjust.
Identities = 62/175 (35%), Positives = 84/175 (48%), Gaps = 16/175 (9%)
Query: 81 QADIIPNNANYLIRISIGTPPTERLAVADTGSDLIWTQCE-PCPPSQCYMQDSPLFDPKM 139
+ D+ PN Y I +G+PP DTGSDL W QC+ PC + C +PL+ PK
Sbjct: 305 RGDVYPNGL-YFTHIFVGSPPRRYFLDMDTGSDLTWIQCDAPC--TSCAKGPNPLYKPKK 361
Query: 140 SSTYKSLPCSSSQCASLNQKSCSGV-----NCQYSVSYGDGSFSNGNLATETVTLGSTTG 194
+ +P S C + + +G C Y + Y D S S G LA++ + L G
Sbjct: 362 GNL---VPLKDSLCVEVQRNLKTGYCETCEQCDYEIEYADHSSSMGVLASDDLHLMLANG 418
Query: 195 QAVALPGITFGCGTNNGG-LFNS--KTTGIVGLGGGDISLISQMRTTIAGNQRLG 246
L GI FGC + G L NS KT GI+GL +SL SQ+ + N LG
Sbjct: 419 SLTKL-GIMFGCAYDQQGLLLNSLAKTDGILGLSKAKVSLPSQLASQRIINNVLG 472
>gi|413936471|gb|AFW71022.1| hypothetical protein ZEAMMB73_589717 [Zea mays]
Length = 315
Score = 88.2 bits (217), Expect = 6e-15, Method: Compositional matrix adjust.
Identities = 57/177 (32%), Positives = 83/177 (46%), Gaps = 27/177 (15%)
Query: 4 FLSCVFILFFLC---------FYVV-----SPIEAQTGGFS----------VELIHRDSP 39
LSC+F+ F+L F V P +G F V L+HR P
Sbjct: 5 LLSCIFLCFYLSTVHGAGEDSFVTVPSSSFEPESVCSGEFVKPEQNGSTVYVPLVHRHGP 64
Query: 40 KSPFYNSSETPYQRLRDALTRSLNRLNHFNQNSSISSSKASQADIIPNNANYLIRISIGT 99
+P S T + D RS R ++ + +S ++ + Y++R+S GT
Sbjct: 65 CAP-APSLSTDTRSFADIFRRSRARPSYIVRGKKVSVPAHLGTSVM--SLEYVVRVSFGT 121
Query: 100 PPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCSSSQCASL 156
P ++ V DTGSD+ W QC+PC QC+ Q PL+DP SSTY ++PC+S C L
Sbjct: 122 PAVPQVVVIDTGSDVSWLQCKPCSSGQCFPQKDPLYDPSHSSTYSAVPCASDVCKKL 178
>gi|218186446|gb|EEC68873.1| hypothetical protein OsI_37494 [Oryza sativa Indica Group]
Length = 353
Score = 88.2 bits (217), Expect = 6e-15, Method: Compositional matrix adjust.
Identities = 55/155 (35%), Positives = 79/155 (50%), Gaps = 17/155 (10%)
Query: 91 YLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSP---LFDPKMSSTYKSLP 147
Y + IS+GTPP L DTGS L W QC+ C +CY Q + +F+P SSTY +
Sbjct: 6 YFMGISLGTPPVFNLVTIDTGSTLSWVQCKNCQI-KCYDQAAKAGQIFNPYNSSTYSKVG 64
Query: 148 CSSSQCASLN-----QKSCSGVN--CQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALP 200
CS+ C ++ + C + C YS+ YG G +S G L + +TL S ++
Sbjct: 65 CSTEACNGMHMDLAVEYGCVEEDDTCIYSLRYGSGEYSVGYLGKDRLTLASNR----SID 120
Query: 201 GITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQM 235
FGCG +N L+N GI+G G S +Q+
Sbjct: 121 NFIFGCGEDN--LYNGVNAGIIGFGTKSYSFFNQV 153
>gi|356523171|ref|XP_003530215.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
Length = 442
Score = 88.2 bits (217), Expect = 6e-15, Method: Compositional matrix adjust.
Identities = 59/161 (36%), Positives = 85/161 (52%), Gaps = 21/161 (13%)
Query: 87 NNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSL 146
+N + I I++GTPP V DTGS+L W C + P F+P +SS+Y +
Sbjct: 62 HNVSLTISITVGTPPQNMSMVIDTGSELSWLHCN---TNTTATIPYPFFNPNISSSYTPI 118
Query: 147 PCSSSQCASLNQK-----SCSGVN-CQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALP 200
CSS C + + SC N C ++SY D S S GNLA++T GS+ P
Sbjct: 119 SCSSPTCTTRTRDFPIPASCDSNNLCHATLSYADASSSEGNLASDTFGFGSSFN-----P 173
Query: 201 GITFGC-----GTNNGGLFNSKTTGIVGLGGGDISLISQMR 236
GI FGC TN+ +S TTG++G+ G +SL+SQ++
Sbjct: 174 GIVFGCMNSSYSTNSES--DSNTTGLMGMNLGSLSLVSQLK 212
>gi|449529533|ref|XP_004171754.1| PREDICTED: aspartic proteinase Asp1-like [Cucumis sativus]
Length = 437
Score = 88.2 bits (217), Expect = 6e-15, Method: Compositional matrix adjust.
Identities = 93/363 (25%), Positives = 149/363 (41%), Gaps = 90/363 (24%)
Query: 91 YLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCSS 150
Y + I+IG D+GSDL W QC+ P + C L+ P ++ L C
Sbjct: 55 YSVSINIGKGDEAFEFDIDSGSDLTWVQCD-APCTHCTKPREQLYKPNNNA----LNCFE 109
Query: 151 SQCASLN---QKSCSGVN--CQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITFG 205
C SL+ C + CQY + Y D S G L + V L T G ++A P I FG
Sbjct: 110 PLCTSLHPITNHHCKSADDQCQYEIEYADHGSSLGVLVNDHVPLKLTNG-SLAAPRIAFG 168
Query: 206 CGTNNGGLF---NSKTTGIVGLGGGDISLISQM------RTTIA---------------- 240
CG ++ + T G++GLG G++S ISQ+ R +
Sbjct: 169 CGYDHKYSVPDSSPPTAGVLGLGNGEVSFISQLSSMGVVRNVVGHCLSDEGGFLFFGDEF 228
Query: 241 -----------------------------GNQRLGVSTPDIVIDSGTTLTFL-PQGYNSN 270
G + G+ +V DSG++ T+ Q YNS
Sbjct: 229 VPSSGVTWTSMSHESIGSYYSSGPAEVYFGGKATGIKDLTLVFDSGSSYTYFNSQAYNS- 287
Query: 271 LLSVMSSMIEAQPVADP--TGSLELCYS----FNSLSQVPE----VTIHF---RGADVKL 317
+L+++ + + +P+ D SL +C+ F SL V + + + F + A ++L
Sbjct: 288 ILALVKNNLRGKPLEDAPEDKSLPVCWKGTRPFKSLRDVKKYFNLLALRFTKTKNAQIQL 347
Query: 318 SRSNFFVKVSEDIVCSVFKGITN-------SVPIYGNIMQTNFLVGYDIEQQTVSFKPTD 370
N+ + VC GI N + I G+I + +V YD E++ + + PT+
Sbjct: 348 PPENYLIITKYGNVCF---GILNGTEVGLGDLNIIGDISLKDKMVIYDNERRRIGWFPTN 404
Query: 371 CTK 373
C K
Sbjct: 405 CNK 407
>gi|302802500|ref|XP_002983004.1| hypothetical protein SELMODRAFT_13348 [Selaginella moellendorffii]
gi|300149157|gb|EFJ15813.1| hypothetical protein SELMODRAFT_13348 [Selaginella moellendorffii]
Length = 332
Score = 88.2 bits (217), Expect = 6e-15, Method: Compositional matrix adjust.
Identities = 55/146 (37%), Positives = 77/146 (52%), Gaps = 23/146 (15%)
Query: 91 YLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCSS 150
Y I++G+PP + V DTGSDL W +C+PC P C S FD S+TYK+L C+
Sbjct: 3 YYSTITLGSPPKDFSLVMDTGSDLTWVRCDPCSP-DC----SSTFDRLASNTYKALTCAD 57
Query: 151 SQCASLNQKSCSGVNCQYSVSYGDGSFSNGNLATETVTL-GSTTGQAVALPGITFGCGTN 209
YS YGDGSF+ G+L+ +T+ + G+ + + PG FGCG+
Sbjct: 58 ----------------DYSYGYGDGSFTQGDLSVDTLKMAGAASDELEEFPGFVFGCGSL 101
Query: 210 NGGLFNSKTTGIVGLGGGDISLISQM 235
GL S GI+ L G +S SQ+
Sbjct: 102 LKGLI-SGEVGILALSPGSLSFPSQI 126
Score = 71.2 bits (173), Expect = 7e-10, Method: Compositional matrix adjust.
Identities = 48/142 (33%), Positives = 74/142 (52%), Gaps = 14/142 (9%)
Query: 241 GNQRLGVS--------TPDIVIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLE 292
GNQRL +S + DSGTTLT LP G ++ ++SM+ G L+
Sbjct: 194 GNQRLDLSPSAFLNGQDKPTIFDSGTTLTMLPPGVCDSIKQSLASMVSGAEFVAIKG-LD 252
Query: 293 LCYSF--NSLSQVPEVTIHFR-GADVKLSRSNFFVKVSEDIVCSVFKGITNSVPIYGNIM 349
C+ +S +P++T HF GAD SN+ + + + C +F TN V I+GN+
Sbjct: 253 ACFRVPPSSGQGLPDITFHFNGGADFVTRPSNYVIDLGS-LQCLIFV-PTNEVSIFGNLQ 310
Query: 350 QTNFLVGYDIEQQTVSFKPTDC 371
Q +F V +D++ + + FK TDC
Sbjct: 311 QQDFFVLHDMDNRRIGFKETDC 332
>gi|77553049|gb|ABA95845.1| Eukaryotic aspartyl protease family protein, expressed [Oryza
sativa Japonica Group]
Length = 372
Score = 87.8 bits (216), Expect = 7e-15, Method: Compositional matrix adjust.
Identities = 55/155 (35%), Positives = 79/155 (50%), Gaps = 17/155 (10%)
Query: 91 YLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSP---LFDPKMSSTYKSLP 147
Y + IS+GTPP L DTGS L W QC+ C +CY Q + +F+P SSTY +
Sbjct: 25 YFMGISLGTPPVFNLVTIDTGSTLSWVQCKNCQI-KCYDQAAKAGQIFNPYNSSTYSKVG 83
Query: 148 CSSSQCASLN-----QKSCSGVN--CQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALP 200
CS+ C ++ + C + C YS+ YG G +S G L + +TL S ++
Sbjct: 84 CSTEACNGMHMDLAVEYGCVEEDDTCIYSLRYGSGEYSVGYLGKDRLTLASNR----SID 139
Query: 201 GITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQM 235
FGCG +N L+N GI+G G S +Q+
Sbjct: 140 NFIFGCGEDN--LYNGVNAGIIGFGTKSYSFFNQV 172
>gi|168051774|ref|XP_001778328.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162670305|gb|EDQ56876.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 165
Score = 87.8 bits (216), Expect = 7e-15, Method: Compositional matrix adjust.
Identities = 53/140 (37%), Positives = 67/140 (47%), Gaps = 16/140 (11%)
Query: 89 ANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPC 148
Y I I I TPP L + DTGSDL W QC PC CY+Q +F+P S +Y + C
Sbjct: 10 GEYFIDIFIDTPPRHILVIIDTGSDLTWVQCTPCL--HCYLQKGLVFNPHSSESYDPVAC 67
Query: 149 SSSQCA----SLNQKSC--SGVNCQYSVSYGDGSFSNGNLATETVTL--------GSTTG 194
+ A S N+ +C C Y YGD S + + ATET T+ G
Sbjct: 68 GEPKRAFVESSNNRSTCVTDSQGCSYFYWYGDSSNTTSDFATETFTVNKTIKNDEGGGED 127
Query: 195 QAVALPGITFGCGTNNGGLF 214
+ + I FGCG NN GLF
Sbjct: 128 DTLQISKIMFGCGHNNQGLF 147
>gi|224066811|ref|XP_002302227.1| predicted protein [Populus trichocarpa]
gi|222843953|gb|EEE81500.1| predicted protein [Populus trichocarpa]
Length = 422
Score = 87.8 bits (216), Expect = 8e-15, Method: Compositional matrix adjust.
Identities = 85/362 (23%), Positives = 145/362 (40%), Gaps = 86/362 (23%)
Query: 89 ANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPC 148
+Y + ++IG PP DTGSDL W QC+ P C L+ PK + +PC
Sbjct: 66 GHYSVILNIGNPPKAFDLDIDTGSDLTWVQCD-APCKGCTKPLDKLYKPKNN----RVPC 120
Query: 149 SSSQCASLNQKSCS--GVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITFGC 206
+SS C ++ +C C Y V Y D S G L ++ L G + P I FGC
Sbjct: 121 ASSLCQAIQNNNCDIPTEQCDYEVEYADLGSSLGVLLSDYFPLRLNNGSLLQ-PRIAFGC 179
Query: 207 GTNN---GGLFNSKTTGIVGLGGGDISLISQMRT-------------------------- 237
G + G T GI+GLG G S++SQ+RT
Sbjct: 180 GYDQKYLGPHSPPDTAGILGLGRGKASILSQLRTLGITQNVVGHCFSRVTGGFLFFGDHL 239
Query: 238 --------------------------TIAGNQRLGVSTPDIVIDSGTTLTFLPQGYNSNL 271
+ G + G+ ++ DSG++ T+ ++
Sbjct: 240 LPPSGITWTPMLRSSSDTLYSSGPAELLFGGKPTGIKGLQLIFDSGSSYTYFNAQVYQSI 299
Query: 272 LSVMSSMIEAQPVADP--TGSLELCYS--------FNSLSQVPEVTIHF---RGADVKLS 318
L+++ + P+ D +L +C+ + S +TI+F + ++L+
Sbjct: 300 LNLVRKDLSGMPLKDAPEEKALAVCWKTAKPIKSILDIKSFFKPLTINFIKAKNVQLQLA 359
Query: 319 RSNFFVKVSEDIVCSVFKGITN-------SVPIYGNIMQTNFLVGYDIEQQTVSFKPTDC 371
++ + + VC GI N ++ + G+I + +V YD E+Q + + PT+C
Sbjct: 360 PEDYLIITKDGNVCL---GILNGGEQGLGNLNVIGDIFMQDRVVVYDNERQQIGWFPTNC 416
Query: 372 TK 373
+
Sbjct: 417 NR 418
>gi|297734190|emb|CBI15437.3| unnamed protein product [Vitis vinifera]
Length = 473
Score = 87.8 bits (216), Expect = 8e-15, Method: Compositional matrix adjust.
Identities = 61/174 (35%), Positives = 83/174 (47%), Gaps = 14/174 (8%)
Query: 81 QADIIPNNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMS 140
+ D+ PN Y I +G+PP DTGSDL W QC+ P + C +PL+ PK
Sbjct: 92 RGDVYPNGL-YFTHIFVGSPPRRYFLDMDTGSDLTWIQCD-APCTSCAKGPNPLYKPKKG 149
Query: 141 STYKSLPCSSSQCASLNQKSCSGV-----NCQYSVSYGDGSFSNGNLATETVTLGSTTGQ 195
+ +P S C + + +G C Y + Y D S S G LA++ + L G
Sbjct: 150 NL---VPLKDSLCVEVQRNLKTGYCETCEQCDYEIEYADHSSSMGVLASDDLHLMLANGS 206
Query: 196 AVALPGITFGCGTNNGG-LFNS--KTTGIVGLGGGDISLISQMRTTIAGNQRLG 246
L GI FGC + G L NS KT GI+GL +SL SQ+ + N LG
Sbjct: 207 LTKL-GIMFGCAYDQQGLLLNSLAKTDGILGLSKAKVSLPSQLASQRIINNVLG 259
>gi|125532793|gb|EAY79358.1| hypothetical protein OsI_34487 [Oryza sativa Indica Group]
Length = 419
Score = 87.4 bits (215), Expect = 9e-15, Method: Compositional matrix adjust.
Identities = 57/190 (30%), Positives = 93/190 (48%), Gaps = 18/190 (9%)
Query: 58 LTRSLNRLNHFNQNSSISSSKASQADIIP---NNANYLIRISIGTPPTERLAVADTGSDL 114
L R L+R + + +++ ++P + A Y+ +IGTPP + D +L
Sbjct: 26 LRRGLDRQGMRGRILADATAAPPGGAVVPLHWSGACYVANFTIGTPPQAVSGIVDLSGEL 85
Query: 115 IWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCSSSQCASLNQKSCSG-VNCQYSVS-- 171
+WTQC C S C+ Q+ P+FDP S+TY++ C S C S+ ++CSG C Y
Sbjct: 86 VWTQCAACRSSGCFKQELPVFDPSASNTYRAEQCGSPLCKSIPTRNCSGDGECGYEAPSM 145
Query: 172 YGDGSFSNGNLATETVTLGSTTGQAVALPGITFGCGTNNGGLFNSKT---TGIVGLGGGD 228
+GD + G +T+ + +G+ G+ + FGC + G + +G VGLG
Sbjct: 146 FGD---TFGIASTDAIAIGNAEGR------LAFGCVVASDGSIDGAMDGPSGFVGLGRTP 196
Query: 229 ISLISQMRTT 238
SL+ Q T
Sbjct: 197 WSLVGQSNVT 206
Score = 52.0 bits (123), Expect = 5e-04, Method: Compositional matrix adjust.
Identities = 32/122 (26%), Positives = 63/122 (51%), Gaps = 9/122 (7%)
Query: 260 LTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSLSQVPEVTIHFRG-ADVKLS 318
L++LP L V+++ + + +A+P +LC+ ++S VP++ F+G A +
Sbjct: 295 LSYLPDAAYQALEKVVTAALGSPSMANPPEPFDLCFQNAAVSGVPDLVFTFQGGATLTAP 354
Query: 319 RSNFFV-------KVSEDIVCSV-FKGITNSVPIYGNIMQTNFLVGYDIEQQTVSFKPTD 370
S + + V I+ S + V I G+++Q N +D+E++T+SF+P D
Sbjct: 355 PSKYLLGDGNGNGTVCLSILSSTRLDSADDGVSILGSLLQENVHFLFDLEKETLSFEPAD 414
Query: 371 CT 372
C+
Sbjct: 415 CS 416
>gi|297805186|ref|XP_002870477.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
gi|297316313|gb|EFH46736.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
Length = 287
Score = 87.4 bits (215), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 57/158 (36%), Positives = 84/158 (53%), Gaps = 12/158 (7%)
Query: 86 PNNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKS 145
P + Y + IGTPP E V DTGSD++W C C C +Q+ FDP SS+
Sbjct: 77 PISRIYYTTLQIGTPPREFNVVIDTGSDVLWVSCISC--VGCPLQNVTFFDPGASSSAVK 134
Query: 146 LPCSSSQCAS-LNQKS-CSGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPG-- 201
L CS +C S L++KS CS + +Y V Y DGSF++G ++ ++ + + +
Sbjct: 135 LACSDKRCFSDLHKKSGCSPL--EYKVEYSDGSFTSGYYISDLISFETVMSSNLTVKSSA 192
Query: 202 -ITFGCGTNNGGLFNSKTT---GIVGLGGGDISLISQM 235
FGC + GL + T GIVGLG G + ++SQ+
Sbjct: 193 PFVFGCSNLHAGLISLPETSIHGIVGLGKGRLLVVSQL 230
>gi|224067042|ref|XP_002302336.1| predicted protein [Populus trichocarpa]
gi|222844062|gb|EEE81609.1| predicted protein [Populus trichocarpa]
Length = 438
Score = 87.4 bits (215), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 75/249 (30%), Positives = 109/249 (43%), Gaps = 25/249 (10%)
Query: 2 ATFLSCVFILFFLCFYVVSPIEAQTGGFSVELIHRDSPKSPFYNSSETPYQRLRDALTRS 61
ATF +F L F V P Q+ + +I S SPF + + + +T +
Sbjct: 8 ATFF--LFALLFSTTKAVDPCATQSDTSDLSVIPIYSKCSPFVPPKQESW--VNTVITMA 63
Query: 62 LNRLNHFNQNSSISSSKASQADIIPNN-----ANYLIRISIGTPPTERLAVADTGSDLIW 116
S+++ K + I P ANY++R+ +GTP + V DT +D W
Sbjct: 64 SKDPERLKYLSTLADQKTTAVPIAPGQQVLKIANYVVRVKLGTPGQQMFMVLDTSNDAAW 123
Query: 117 TQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCSSSQCASLNQKSCSGVN---CQYSVSYG 173
C S C S F P S+T SL CS +QC+ + SC C ++ SYG
Sbjct: 124 VPC-----SGCTGCSSTTFLPNASTTLGSLDCSGAQCSQVRGFSCPATGSSACLFNQSYG 178
Query: 174 DGSFSNGNLATETVTLGSTTGQAVALPGITFGC-GTNNGGLFNSKTTGIVGLGGGDISLI 232
S L + +TL + +PG TFGC +GG + G++GLG G ISLI
Sbjct: 179 GDSSLTATLVQDAITLANDV-----IPGFTFGCINAVSGGSIPPQ--GLLGLGRGPISLI 231
Query: 233 SQMRTTIAG 241
SQ +G
Sbjct: 232 SQAGAMYSG 240
>gi|356499109|ref|XP_003518386.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Glycine max]
Length = 428
Score = 87.4 bits (215), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 61/177 (34%), Positives = 87/177 (49%), Gaps = 27/177 (15%)
Query: 87 NNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSL 146
+N + +++G+PP V DTGS+L W C+ P + F+P +SS+Y
Sbjct: 56 HNVTLTVSLTVGSPPQNVTMVLDTGSELSWLHCKKLP------NLNSTFNPLLSSSYTPT 109
Query: 147 PCSSSQCASLNQ-----KSCSGVN--CQYSVSYGDGSFSNGNLATETVTLGSTTGQAVAL 199
PC+SS C + + SC N C VSY D S + G LA ET +L A
Sbjct: 110 PCNSSICTTRTRDLTIPASCDPNNKLCHVIVSYADASSAEGTLAAETFSLA-----GAAQ 164
Query: 200 PGITFGCGTNNGGLF----NSKTTGIVGLGGGDISLISQMRT-----TIAGNQRLGV 247
PG FGC + G +SKTTG++G+ G +SL++QM I+G LGV
Sbjct: 165 PGTLFGCMDSAGYTSDINEDSKTTGLMGMNRGSLSLVTQMSLPKFSYCISGEDALGV 221
Score = 44.7 bits (104), Expect = 0.075, Method: Compositional matrix adjust.
Identities = 36/132 (27%), Positives = 60/132 (45%), Gaps = 13/132 (9%)
Query: 253 VIDSGTTLTFLPQGYNSNLLSVMSSMIEA--QPVADPT----GSLELCYSF-NSLSQVPE 305
++DSGT TFL S+L + + DP G+++LCY S + VP
Sbjct: 286 MVDSGTQFTFLLGSVYSSLKDEFLEQTKGVLTRIEDPNFVFEGAMDLCYHAPASFAAVPA 345
Query: 306 VTIHFRGADVKLSRSNFFVKVSED---IVCSVFKG---ITNSVPIYGNIMQTNFLVGYDI 359
VT+ F GA++++S +VS+ + C F + + G+ Q N + +D+
Sbjct: 346 VTLVFSGAEMRVSGERLLYRVSKGSDWVYCFTFGNSDLLGIEAYVIGHHHQQNVWMEFDL 405
Query: 360 EQQTVSFKPTDC 371
+ V F T C
Sbjct: 406 LKSRVGFTQTTC 417
>gi|225449446|ref|XP_002283126.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Vitis vinifera]
Length = 436
Score = 87.4 bits (215), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 52/158 (32%), Positives = 83/158 (52%), Gaps = 20/158 (12%)
Query: 87 NNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSL 146
+N + + +++G+PP V DTGS+L W C+ P +FDP SS+Y +
Sbjct: 59 HNVSLTVSLTVGSPPQTVTMVLDTGSELSWLHCKKAPNLHS------VFDPLRSSSYSPI 112
Query: 147 PCSSSQCASLNQK-----SCSGVN-CQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALP 200
PC+S C + + SC C +SY D S GNLA++T +G++ A+P
Sbjct: 113 PCTSPTCRTRTRDFSIPVSCDKKKLCHAIISYADASSIEGNLASDTFHIGNS-----AIP 167
Query: 201 GITFGC---GTNNGGLFNSKTTGIVGLGGGDISLISQM 235
FGC G ++ +SKTTG++G+ G +S ++QM
Sbjct: 168 ATIFGCMDSGFSSNSDEDSKTTGLIGMNRGSLSFVTQM 205
>gi|108862257|gb|ABA96612.2| Eukaryotic aspartyl protease family protein, expressed [Oryza
sativa Japonica Group]
Length = 451
Score = 87.4 bits (215), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 93/373 (24%), Positives = 143/373 (38%), Gaps = 91/373 (24%)
Query: 82 ADIIPNNANYLIRISIGTPPTERLAVADTGSDLIWTQCE-PCPPSQCYMQDSPLFDPKMS 140
D+ P+ Y + +SIG PP DTGSDL W QC+ PC C PL+ P +
Sbjct: 50 GDVYPHGL-YYVAMSIGNPPRPYFLDVDTGSDLTWLQCDAPC--VSCSKVPHPLYRPTKN 106
Query: 141 STYKSLPCSSSQCASLN-----QKSCSGVN--CQYSVSYGDGSFSNGNLATETVTLGSTT 193
K +PC CA+L+ + C C Y + Y D S G L T++ L
Sbjct: 107 ---KLVPCVDQMCAALHGGLTGRHKCDSPKQQCDYEIKYADQGSSLGVLVTDSFAL-RLA 162
Query: 194 GQAVALPGITFGCGTNN---GGLFNSKTTGIVGLGGGDISLISQM--------------- 235
++ PG+ FGCG + S T G++GLG G +SL+SQ+
Sbjct: 163 NSSIVRPGLAFGCGYDQQVGSSTEVSATDGVLGLGSGSVSLLSQLKQHGITKNVVGHCLS 222
Query: 236 -----------------RTTIA--------------------GNQRLGVSTPDIVIDSGT 258
R T A G + LGV ++V DSG+
Sbjct: 223 TRGGGFLFFGDDIVPYSRATWAPMARSTSRNYYSPGSANLYFGGRPLGVRPMEVVFDSGS 282
Query: 259 TLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYS----FNSLSQVPE----VTIHF 310
+ T+ L+ + + P SL LC+ F S+ V + V + F
Sbjct: 283 SFTYFSAQPYQALVDAIKGDLSKNLKEVPDHSLPLCWKGKKPFKSVLDVKKEFRTVVLSF 342
Query: 311 ---RGADVKLSRSNFFVKVSEDIVCSVFKGITN-------SVPIYGNIMQTNFLVGYDIE 360
+ A +++ N+ + C GI N + I G+I + +V YD E
Sbjct: 343 SNGKKALMEIPPENYLIVTKYGNAC---LGILNGSEVGLKDLNIVGDITMQDQMVIYDNE 399
Query: 361 QQTVSFKPTDCTK 373
+ + + C +
Sbjct: 400 RGQIGWIRAPCDR 412
>gi|147821993|emb|CAN70318.1| hypothetical protein VITISV_016757 [Vitis vinifera]
Length = 429
Score = 87.4 bits (215), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 52/158 (32%), Positives = 83/158 (52%), Gaps = 20/158 (12%)
Query: 87 NNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSL 146
+N + + +++G+PP V DTGS+L W C+ P +FDP SS+Y +
Sbjct: 52 HNVSLTVSLTVGSPPQTVTMVLDTGSELSWLHCKKAPNLHS------VFDPLRSSSYSPI 105
Query: 147 PCSSSQCASLNQK-----SCSGVN-CQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALP 200
PC+S C + + SC C +SY D S GNLA++T +G++ A+P
Sbjct: 106 PCTSPTCRTRTRDFSIPVSCDKKKLCHAIISYADASSIEGNLASDTFHIGNS-----AIP 160
Query: 201 GITFGC---GTNNGGLFNSKTTGIVGLGGGDISLISQM 235
FGC G ++ +SKTTG++G+ G +S ++QM
Sbjct: 161 ATIFGCMDSGFSSNSDEDSKTTGLIGMNRGSLSFVTQM 198
>gi|115487628|ref|NP_001066301.1| Os12g0177500 [Oryza sativa Japonica Group]
gi|108862256|gb|ABA96613.2| Eukaryotic aspartyl protease family protein, expressed [Oryza
sativa Japonica Group]
gi|113648808|dbj|BAF29320.1| Os12g0177500 [Oryza sativa Japonica Group]
gi|215693997|dbj|BAG89196.1| unnamed protein product [Oryza sativa Japonica Group]
Length = 421
Score = 87.4 bits (215), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 93/373 (24%), Positives = 143/373 (38%), Gaps = 91/373 (24%)
Query: 82 ADIIPNNANYLIRISIGTPPTERLAVADTGSDLIWTQCE-PCPPSQCYMQDSPLFDPKMS 140
D+ P+ Y + +SIG PP DTGSDL W QC+ PC C PL+ P +
Sbjct: 50 GDVYPHGL-YYVAMSIGNPPRPYFLDVDTGSDLTWLQCDAPC--VSCSKVPHPLYRPTKN 106
Query: 141 STYKSLPCSSSQCASLN-----QKSCSGVN--CQYSVSYGDGSFSNGNLATETVTLGSTT 193
K +PC CA+L+ + C C Y + Y D S G L T++ L
Sbjct: 107 ---KLVPCVDQMCAALHGGLTGRHKCDSPKQQCDYEIKYADQGSSLGVLVTDSFAL-RLA 162
Query: 194 GQAVALPGITFGCGTNN---GGLFNSKTTGIVGLGGGDISLISQM--------------- 235
++ PG+ FGCG + S T G++GLG G +SL+SQ+
Sbjct: 163 NSSIVRPGLAFGCGYDQQVGSSTEVSATDGVLGLGSGSVSLLSQLKQHGITKNVVGHCLS 222
Query: 236 -----------------RTTIA--------------------GNQRLGVSTPDIVIDSGT 258
R T A G + LGV ++V DSG+
Sbjct: 223 TRGGGFLFFGDDIVPYSRATWAPMARSTSRNYYSPGSANLYFGGRPLGVRPMEVVFDSGS 282
Query: 259 TLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYS----FNSLSQVPE----VTIHF 310
+ T+ L+ + + P SL LC+ F S+ V + V + F
Sbjct: 283 SFTYFSAQPYQALVDAIKGDLSKNLKEVPDHSLPLCWKGKKPFKSVLDVKKEFRTVVLSF 342
Query: 311 ---RGADVKLSRSNFFVKVSEDIVCSVFKGITN-------SVPIYGNIMQTNFLVGYDIE 360
+ A +++ N+ + C GI N + I G+I + +V YD E
Sbjct: 343 SNGKKALMEIPPENYLIVTKYGNAC---LGILNGSEVGLKDLNIVGDITMQDQMVIYDNE 399
Query: 361 QQTVSFKPTDCTK 373
+ + + C +
Sbjct: 400 RGQIGWIRAPCDR 412
>gi|21717160|gb|AAM76353.1|AC074196_11 putative nucleoid DNA binding protein [Oryza sativa Japonica Group]
gi|31433304|gb|AAP54833.1| Eukaryotic aspartyl protease family protein [Oryza sativa Japonica
Group]
gi|125575544|gb|EAZ16828.1| hypothetical protein OsJ_32300 [Oryza sativa Japonica Group]
Length = 419
Score = 87.0 bits (214), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 56/190 (29%), Positives = 94/190 (49%), Gaps = 18/190 (9%)
Query: 58 LTRSLNRLNHFNQNSSISSSKASQADIIP---NNANYLIRISIGTPPTERLAVADTGSDL 114
L R L++ + + +++ ++P + A+Y+ +IGTPP + D +L
Sbjct: 26 LRRGLDQQGMRGRILADATAAPPGGAVVPLHWSGAHYVANFTIGTPPQAVSGIVDLSGEL 85
Query: 115 IWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCSSSQCASLNQKSCSG-VNCQYSVS-- 171
+WTQC C S C+ Q+ P+FDP S+TY++ C S C S+ ++CSG C Y
Sbjct: 86 VWTQCAACRSSGCFKQELPVFDPSASNTYRAEQCGSPLCKSIPTRNCSGDGECGYEAPSM 145
Query: 172 YGDGSFSNGNLATETVTLGSTTGQAVALPGITFGCGTNNGGLFNSKT---TGIVGLGGGD 228
+GD + G +T+ + +G+ G+ + FGC + G + +G VGLG
Sbjct: 146 FGD---TFGIASTDAIAIGNAEGR------LAFGCVVASDGSIDGAMDGPSGFVGLGRTP 196
Query: 229 ISLISQMRTT 238
SL+ Q T
Sbjct: 197 WSLVGQSNVT 206
Score = 52.4 bits (124), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 30/122 (24%), Positives = 63/122 (51%), Gaps = 9/122 (7%)
Query: 260 LTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSLSQVPEVTIHFRGADVKLSR 319
L++LP L V+++ + + +A+P +LC+ ++S VP++ F+G ++
Sbjct: 295 LSYLPDAAYQALEKVVTAALGSPSMANPPEPFDLCFQNAAVSGVPDLVFTFQGGATLTAQ 354
Query: 320 SNFFV--------KVSEDIVCSV-FKGITNSVPIYGNIMQTNFLVGYDIEQQTVSFKPTD 370
+ ++ V I+ S + V I G+++Q N +D+E++T+SF+P D
Sbjct: 355 PSKYLLGDGNGNGTVCLSILSSTRLDSADDGVSILGSLLQENVHFLFDLEKETLSFEPAD 414
Query: 371 CT 372
C+
Sbjct: 415 CS 416
>gi|242067689|ref|XP_002449121.1| hypothetical protein SORBIDRAFT_05g005400 [Sorghum bicolor]
gi|241934964|gb|EES08109.1| hypothetical protein SORBIDRAFT_05g005400 [Sorghum bicolor]
Length = 358
Score = 87.0 bits (214), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 78/279 (27%), Positives = 121/279 (43%), Gaps = 74/279 (26%)
Query: 81 QADIIPNNANYLIRISIGTPPTERLAVADTGSDLIWTQCE-PCPPSQCYMQDSPLFDPKM 139
Q ++ P +Y + ++IG P DTGSDL W QC+ PC C PL+ P
Sbjct: 45 QGNVYPT-GHYYVTMNIGNPAKPYFLDVDTGSDLTWLQCDAPC--RSCNKVPHPLYRPTA 101
Query: 140 SSTYKSLPCSSSQCASL------NQKSCSGVNCQYSVSYGDGSFSNGNLATETVTLGSTT 193
+S +PC+++ C +L N K S C Y + Y D + S G L + +L +
Sbjct: 102 NSL---VPCANALCTALHSGHGSNNKCPSPKQCDYQIKYTDSASSQGVLINDNFSLPMRS 158
Query: 194 GQAVALPGITFGCGTN----NGGLFNSKTTGIVGLGGGDISLISQMR------------- 236
PG+TFGCG + G + T G++GLG G +SL+SQ++
Sbjct: 159 SN--IRPGLTFGCGYDQQVGKNGAVQAATDGMLGLGRGSVSLVSQLKQQGITKNVLGHCL 216
Query: 237 -------------------------TTIAGN-------------QRLGVSTPDIVIDSGT 258
I+GN + LGV ++V DSG+
Sbjct: 217 STNGGGFLFFGDDIVPTSRVTWVPMAKISGNYYSPGSGTLYFDRRSLGVKPMEVVFDSGS 276
Query: 259 TLT-FLPQGYNSNLLSVMSSMIEA-QPVADPTGSLELCY 295
T T F Q Y + + ++ S + ++ + V+DP SL LC+
Sbjct: 277 TYTYFTAQPYQAVVSALKSGLSKSLKQVSDP--SLPLCW 313
>gi|218186522|gb|EEC68949.1| hypothetical protein OsI_37668 [Oryza sativa Indica Group]
Length = 421
Score = 87.0 bits (214), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 93/373 (24%), Positives = 143/373 (38%), Gaps = 91/373 (24%)
Query: 82 ADIIPNNANYLIRISIGTPPTERLAVADTGSDLIWTQCE-PCPPSQCYMQDSPLFDPKMS 140
D+ P+ Y + +SIG PP DTGSDL W QC+ PC C PL+ P +
Sbjct: 50 GDVYPHGL-YYVAMSIGNPPRPYFLDVDTGSDLTWLQCDAPC--VSCSKVPHPLYRPTKN 106
Query: 141 STYKSLPCSSSQCASLN-----QKSCSGVN--CQYSVSYGDGSFSNGNLATETVTLGSTT 193
K +PC CA+L+ + C C Y + Y D S G L T++ L
Sbjct: 107 ---KLVPCVDQMCAALHGGLTGRHKCDSPKQQCDYEIKYADQGSSLGVLVTDSFAL-RLA 162
Query: 194 GQAVALPGITFGCGTNN---GGLFNSKTTGIVGLGGGDISLISQM--------------- 235
++ PG+ FGCG + S T G++GLG G +SL+SQ+
Sbjct: 163 NSSIVRPGLAFGCGYDQQVGSSTEVSATDGVLGLGSGSVSLLSQLKQHGITKNVVGHCLS 222
Query: 236 -----------------RTTIA--------------------GNQRLGVSTPDIVIDSGT 258
R T A G + LGV ++V DSG+
Sbjct: 223 TRGGGFLFFGDDIVPYSRATWAPMARSTSRNYYSPGSANLYFGGRPLGVRPMEVVFDSGS 282
Query: 259 TLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYS----FNSLSQVPE----VTIHF 310
+ T+ L+ + + P SL LC+ F S+ V + V + F
Sbjct: 283 SFTYFSAQPYQALVDAIKGDLSKNLKEVPDHSLPLCWKGKKPFKSVLDVKKEFKTVVLSF 342
Query: 311 ---RGADVKLSRSNFFVKVSEDIVCSVFKGITN-------SVPIYGNIMQTNFLVGYDIE 360
+ A +++ N+ + C GI N + I G+I + +V YD E
Sbjct: 343 SNGKKALMEIPPENYLIVTKYGNAC---LGILNGSEVGLKDLNIVGDITMQDQMVIYDNE 399
Query: 361 QQTVSFKPTDCTK 373
+ + + C +
Sbjct: 400 RGQIGWIRAPCDR 412
>gi|125528511|gb|EAY76625.1| hypothetical protein OsI_04577 [Oryza sativa Indica Group]
Length = 492
Score = 87.0 bits (214), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 53/157 (33%), Positives = 75/157 (47%), Gaps = 23/157 (14%)
Query: 87 NNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSL 146
N Y+ IGTPP + D SDL+WT C P F+P S+T +
Sbjct: 96 NAGMYVFSYGIGTPPQQVSGALDISSDLVWTACGATAP----------FNPVRSTTVADV 145
Query: 147 PCSSSQCASLNQKSCSG------VNCQYSVSYGDGSF-SNGNLATETVTLGSTTGQAVAL 199
PC+ C ++C C Y+ YG G+ + G L TE T G T +
Sbjct: 146 PCTDDACQQFAPQTCGAGAGAGSSECAYTYMYGGGAANTTGLLGTEAFTFGDTR-----I 200
Query: 200 PGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMR 236
G+ FGCG N G F S +G++GLG G++SL+SQ++
Sbjct: 201 DGVVFGCGLQNVGDF-SGVSGVIGLGRGNLSLVSQLQ 236
>gi|125586059|gb|EAZ26723.1| hypothetical protein OsJ_10631 [Oryza sativa Japonica Group]
Length = 339
Score = 87.0 bits (214), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 53/140 (37%), Positives = 72/140 (51%), Gaps = 11/140 (7%)
Query: 97 IGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCSSSQCASL 156
+GTPP + G++LIW P P +C+ Q P F+P S + LP +S C S
Sbjct: 1 MGTPPNPVKLKLENGNELIWNHSNPSP--ECFEQAFPYFEPLTFS--RGLPFAS--CGS- 53
Query: 157 NQKSCSGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITFGCGTNNGGLFNS 216
K C Y+ SYGD S + G L + T G ++PG+ FGCG N G+F S
Sbjct: 54 -PKFWPNQTCVYTYSYGDKSVTTGFLEVDKFTF---VGAGASVPGVAFGCGLFNNGVFKS 109
Query: 217 KTTGIVGLGGGDISLISQMR 236
TGI G G G +SL SQ++
Sbjct: 110 NETGIAGFGRGPLSLPSQLK 129
Score = 57.8 bits (138), Expect = 8e-06, Method: Compositional matrix adjust.
Identities = 40/128 (31%), Positives = 64/128 (50%), Gaps = 9/128 (7%)
Query: 253 VIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSLSQ--VPEVTIHF 310
+IDSGT++T LP + ++ I+ V C+S S ++ VP++ +HF
Sbjct: 213 IIDSGTSITSLPPQVYQVVRDEFAAQIKLPVVPGNATGHYTCFSAPSQAKPDVPKLVLHF 272
Query: 311 RGADVKLSRSNFFVKVSED----IVC-SVFKGITNSVPIYGNIMQTNFLVGYDIEQQTVS 365
GA + L R N+ +V +D I+C ++ KG + I GN Q N V YD++ +S
Sbjct: 273 EGATMDLPRENYVFEVPDDAGNSIICLAINKG--DETTIIGNFQQQNMHVLYDLQNNMLS 330
Query: 366 FKPTDCTK 373
F C K
Sbjct: 331 FVAAQCDK 338
>gi|357122155|ref|XP_003562781.1| PREDICTED: aspartic proteinase-like protein 2-like [Brachypodium
distachyon]
Length = 629
Score = 87.0 bits (214), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 95/369 (25%), Positives = 150/369 (40%), Gaps = 80/369 (21%)
Query: 76 SSKASQADIIPNNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLF 135
S++ D + N Y R+ IGTPP E + D+GS + + C C QC P F
Sbjct: 70 SARMRLHDDLLTNGYYTTRLYIGTPPQEFALIVDSGSTVTYVPCASC--EQCGNHQDPRF 127
Query: 136 DPKMSSTYKSLPCSSSQCASLNQKSCSGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQ 195
P +SSTY + CS+ C + KS C Y Y + S S+G L + V+ G T
Sbjct: 128 QPDLSSTYSPVKCSAD-CTCDSDKS----QCTYERQYAEMSSSSGVLGEDIVSFG--TES 180
Query: 196 AVALPGITFGC-GTNNGGLFNSKTTGIVGLGGGDISLISQM--RTTI------------- 239
+ FGC + G LF+ GI+GLG G +S++ Q+ + I
Sbjct: 181 ELKPQRAVFGCENSETGDLFSQHADGIMGLGRGQLSIMDQLVDKGVIGDSFSMCYGGMDI 240
Query: 240 -AGNQRLGV--STPDIVID-------------------SGTTLTFLPQGYNSNLLSVMSS 277
G LG + PD+V +G L P+ ++S +V+ S
Sbjct: 241 GGGAMVLGAMPAPPDMVFSRSDPVRSPYYNIELKEIHVAGKALRLDPRIFDSKHGTVLDS 300
Query: 278 MI-------------------EAQPVADPTGS----LELCYS-----FNSLSQV-PEVTI 308
+ +P+ G ++C++ + LSQ P+V +
Sbjct: 301 GTTYAYLPEQAFVAFKDAVTSKVRPLKKIRGPDPNYKDICFAGAGRNVSQLSQAFPDVDM 360
Query: 309 HF-RGADVKLSRSNFFVKVS--EDIVC-SVFKGITNSVPIYGNIMQTNFLVGYDIEQQTV 364
F G + LS N+ + S E C VF+ + + G I+ N LV YD + +
Sbjct: 361 VFGDGQKLSLSPENYLFRHSKVEGAYCLGVFQNGKDPTTLLGGIVVRNTLVTYDRHNEKI 420
Query: 365 SFKPTDCTK 373
F T+C++
Sbjct: 421 GFWKTNCSE 429
>gi|15229663|ref|NP_190574.1| aspartyl protease family protein [Arabidopsis thaliana]
gi|6522926|emb|CAB62113.1| putative protein [Arabidopsis thaliana]
gi|53828539|gb|AAU94379.1| At3g50050 [Arabidopsis thaliana]
gi|55733749|gb|AAV59271.1| At3g50050 [Arabidopsis thaliana]
gi|332645100|gb|AEE78621.1| aspartyl protease family protein [Arabidopsis thaliana]
Length = 632
Score = 86.7 bits (213), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 54/174 (31%), Positives = 85/174 (48%), Gaps = 10/174 (5%)
Query: 63 NRLNHFNQNSSISSSKASQADIIPNNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPC 122
+R H + + S+ S+ D + N Y R+ IGTPP + D+GS + + C C
Sbjct: 65 HRKLHKSDSKSLPHSRMRLYDDLLINGYYTTRLWIGTPPQMFALIVDSGSTVTYVPCSDC 124
Query: 123 PPSQCYMQDSPLFDPKMSSTYKSLPCSSSQCASLNQKSCSGVNCQYSVSYGDGSFSNGNL 182
QC P F P+MSSTY+ + C + C + + C Y Y + S S G L
Sbjct: 125 --EQCGKHQDPKFQPEMSSTYQPVKC-NMDCNCDDDRE----QCVYEREYAEHSSSKGVL 177
Query: 183 ATETVTLGSTTGQAVALPGITFGCGT-NNGGLFNSKTTGIVGLGGGDISLISQM 235
+ ++ G+ + + FGC T G L++ + GI+GLG GD+SL+ Q+
Sbjct: 178 GEDLISFGNES--QLTPQRAVFGCETVETGDLYSQRADGIIGLGQGDLSLVDQL 229
>gi|42568291|ref|NP_199124.3| aspartyl protease family protein [Arabidopsis thaliana]
gi|332007527|gb|AED94910.1| aspartyl protease family protein [Arabidopsis thaliana]
Length = 631
Score = 86.7 bits (213), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 58/195 (29%), Positives = 96/195 (49%), Gaps = 17/195 (8%)
Query: 42 PFYNSSETPYQRLRDALTRSLNRLNHFNQNSSISSSKASQADIIPNNANYLIRISIGTPP 101
P SS P R+ D R L++ S + ++ D + +N Y R+ IGTPP
Sbjct: 34 PLSYSSLPPRPRVEDFRRRRLHQ-------SQLPNAHMKLYDDLLSNGYYTTRLWIGTPP 86
Query: 102 TERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCSSSQCASLNQKSC 161
E + DTGS + + C C QC P F P++S++Y++L C+ C ++
Sbjct: 87 QEFALIVDTGSTVTYVPCSTC--KQCGKHQDPKFQPELSTSYQALKCNPD-CNCDDE--- 140
Query: 162 SGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITFGC-GTNNGGLFNSKTTG 220
G C Y Y + S S+G L+ + ++ G+ + ++ FGC G LF+ + G
Sbjct: 141 -GKLCVYERRYAEMSSSSGVLSEDLISFGNES--QLSPQRAVFGCENEETGDLFSQRADG 197
Query: 221 IVGLGGGDISLISQM 235
I+GLG G +S++ Q+
Sbjct: 198 IMGLGRGKLSVVDQL 212
>gi|359476197|ref|XP_003631803.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Vitis vinifera]
Length = 414
Score = 86.7 bits (213), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 92/350 (26%), Positives = 133/350 (38%), Gaps = 94/350 (26%)
Query: 100 PPTERLAVADTGSDLI-WTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCSSSQCASLNQ 158
PP+ + +A+ D I WTQC+PC +C FDP S TY C S
Sbjct: 83 PPSPQEILAEMNPDSITWTQCKPC--VRCLKDSHRHFDPSASLTYSLGSCIPST------ 134
Query: 159 KSCSGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITFGCGTNNGG------ 212
V Y+++YGD S S GN +T+TL + P FGCG NN G
Sbjct: 135 -----VGNTYNMTYGDKSTSVGNYGCDTMTLEPSD----VFPKFQFGCGRNNEGDFGSGA 185
Query: 213 --------------------------------------LFNSKTT--------------G 220
LF K T G
Sbjct: 186 DGMLGLGQGQLSTVSQTASKFKKVFSYCLPEEDSIGSLLFGEKATSQSSLKFTSLVNGPG 245
Query: 221 IVGLGGGDISLISQMRTTIAGNQRLGV-----STPDIVIDSGTTLTFLPQGYNSNLLSVM 275
GL + + ++ GN+RL V ++P +IDSGT +T LPQ S L +
Sbjct: 246 TSGLEESGYYFVKLLDISV-GNKRLNVPSSVFASPGTIIDSGTVITCLPQRAYSALTAAF 304
Query: 276 SSMIEAQPVADPTGS----LELCYSFNSLSQV--PEVTIHF-RGADVKLSRSNFFVKVSE 328
+ P+++ L+ CY+ + V PE+ +HF GADV+L+
Sbjct: 305 KKAMAKYPLSNGRRKKGDILDTCYNLSGRKDVLLPEIVLHFGEGADVRLNGKRVIWGNDA 364
Query: 329 DIVCSVFKG-----ITNSVPIYGNIMQTNFLVGYDIEQQTVSFKPTDCTK 373
+C F G + + + I GN Q + V YDI+ + F C+K
Sbjct: 365 SRLCLAFAGNSKSTMNSELTIIGNRQQVSLTVLYDIQGGRIGFGGNGCSK 414
>gi|222613193|gb|EEE51325.1| hypothetical protein OsJ_32293 [Oryza sativa Japonica Group]
Length = 371
Score = 86.7 bits (213), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 77/314 (24%), Positives = 128/314 (40%), Gaps = 72/314 (22%)
Query: 126 QCYMQDSPLFDPKMSSTYKSLPCSSSQCASLNQKSCSGVNCQYSVSYGDGSFSNGNLATE 185
C+ QD P+F P SST+K PC + C S+ C+ C Y G G + G +AT+
Sbjct: 60 HCFKQDLPVFVPNASSTFKPEPCGTDVCKSIPTPKCASDVCAYDGVTGLGGHTVGIVATD 119
Query: 186 TVTLGSTTGQAVALPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTT------- 238
T +G+ G ++ + + +G +GLG SL++QM+ T
Sbjct: 120 TFAIGTAAPARPPASGASWRATSTPW----AGPSGFIGLGRTPWSLVAQMKLTRFSYCLA 175
Query: 239 ---IAGNQRL-----------GVSTPDIV-----------------IDSGTTLTFLPQGY 267
N RL G TP + I +G +P+G
Sbjct: 176 PHDTGKNSRLFLGASAKLAGGGAWTPFVKTSPNDGMSQYYPIELEEIKAGDATITMPRGR 235
Query: 268 NSNLL--------------------SVMSSMIEAQPVADPTGS-LELCYSFNSLSQVPEV 306
N+ L+ +VM+S + A P A P G+ E+C+ +S P++
Sbjct: 236 NTVLVQTAVVRVSLLVDSVYQEFKKAVMAS-VGAAPTATPVGAPFEVCFPKAGVSGAPDL 294
Query: 307 TIHFR-GADVKLSRSNFFVKVSEDIVCSVFKGIT-------NSVPIYGNIMQTNFLVGYD 358
F+ GA + + +N+ V D VC I + + I G+ Q N + +D
Sbjct: 295 VFTFQAGAALTVPPANYLFDVGNDTVCLSVMSIALLNITALDGLNILGSFQQENVHLLFD 354
Query: 359 IEQQTVSFKPTDCT 372
+++ +SF+P DC+
Sbjct: 355 LDKDMLSFEPADCS 368
>gi|356495496|ref|XP_003516613.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Glycine max]
Length = 645
Score = 86.7 bits (213), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 97/388 (25%), Positives = 154/388 (39%), Gaps = 88/388 (22%)
Query: 65 LNHFNQNSSISSSKASQA--------DIIPNNANYLIRISIGTPPTERLAVADTGSDLIW 116
+HFN + S + D + N Y R+ IGTPP + DTGS + +
Sbjct: 59 FSHFNPRRQLKESDSEHHPNARMRLYDDLLRNGYYTARLWIGTPPQRFALIVDTGSTVTY 118
Query: 117 TQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCSSSQCASLNQKSCSGVNCQYSVSYGDGS 176
C C C P F P+ S TY+ + C + QC N + C Y Y + S
Sbjct: 119 VPCSTC--RHCGSHQDPKFRPEDSETYQPVKC-TWQCNCDNDRK----QCTYERRYAEMS 171
Query: 177 FSNGNLATETVTLGSTTGQAVALPGITFGCGTNN-GGLFNSKTTGIVGLGGGDISLISQM 235
S+G L + V+ G+ T ++ FGC + G ++N + GI+GLG GD+S++ Q+
Sbjct: 172 TSSGALGEDVVSFGNQT--ELSPQRAIFGCENDETGDIYNQRADGIMGLGRGDLSIMDQL 229
Query: 236 --RTTIAGNQRL---------------GVSTP-DIV-------------ID------SGT 258
+ I+ + L G+S P D+V ID +G
Sbjct: 230 VEKKVISDSFSLCYGGMGVGGGAMVLGGISPPADMVFTRSDPVRSPYYNIDLKEIHVAGK 289
Query: 259 TLTFLPQGYNSNLLSVMSS------------MIEAQPVADPTGSL-----------ELCY 295
L P+ ++ +V+ S + + T SL ++C+
Sbjct: 290 RLHLNPKVFDGKHGTVLDSGTTYAYLPESAFLAFKHAIMKETHSLKRISGPDPRYNDICF 349
Query: 296 SFNSL--SQV----PEVTIHF-RGADVKLSRSNFFV---KVSEDIVCSVFKGITNSVPIY 345
S + SQ+ P V + F G + LS N+ KV VF + +
Sbjct: 350 SGAEIDVSQISKSFPVVEMVFGNGHKLSLSPENYLFRHSKVRGAYCLGVFSNGNDPTTLL 409
Query: 346 GNIMQTNFLVGYDIEQQTVSFKPTDCTK 373
G I+ N LV YD E + F T+C++
Sbjct: 410 GGIVVRNTLVMYDREHTKIGFWKTNCSE 437
>gi|9757837|dbj|BAB08274.1| unnamed protein product [Arabidopsis thaliana]
Length = 586
Score = 86.7 bits (213), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 58/195 (29%), Positives = 96/195 (49%), Gaps = 17/195 (8%)
Query: 42 PFYNSSETPYQRLRDALTRSLNRLNHFNQNSSISSSKASQADIIPNNANYLIRISIGTPP 101
P SS P R+ D R L++ S + ++ D + +N Y R+ IGTPP
Sbjct: 34 PLSYSSLPPRPRVEDFRRRRLHQ-------SQLPNAHMKLYDDLLSNGYYTTRLWIGTPP 86
Query: 102 TERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCSSSQCASLNQKSC 161
E + DTGS + + C C QC P F P++S++Y++L C+ C ++
Sbjct: 87 QEFALIVDTGSTVTYVPCSTC--KQCGKHQDPKFQPELSTSYQALKCNPD-CNCDDE--- 140
Query: 162 SGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITFGC-GTNNGGLFNSKTTG 220
G C Y Y + S S+G L+ + ++ G+ + ++ FGC G LF+ + G
Sbjct: 141 -GKLCVYERRYAEMSSSSGVLSEDLISFGNES--QLSPQRAVFGCENEETGDLFSQRADG 197
Query: 221 IVGLGGGDISLISQM 235
I+GLG G +S++ Q+
Sbjct: 198 IMGLGRGKLSVVDQL 212
>gi|115441003|ref|NP_001044781.1| Os01g0844500 [Oryza sativa Japonica Group]
gi|19571042|dbj|BAB86469.1| putative aspartic proteinase nepenthesin II [Oryza sativa Japonica
Group]
gi|20160609|dbj|BAB89555.1| putative aspartic proteinase nepenthesin II [Oryza sativa Japonica
Group]
gi|113534312|dbj|BAF06695.1| Os01g0844500 [Oryza sativa Japonica Group]
gi|125572614|gb|EAZ14129.1| hypothetical protein OsJ_04051 [Oryza sativa Japonica Group]
Length = 442
Score = 86.7 bits (213), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 61/191 (31%), Positives = 93/191 (48%), Gaps = 18/191 (9%)
Query: 87 NNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSL 146
+N + + +++GTPP V DTGS+L W C P + + F P+ S T+ S+
Sbjct: 62 HNVSLTVSLAVGTPPQNVTMVLDTGSELSWLLCAPGGGGGGGGRSALSFRPRASLTFASV 121
Query: 147 PCSSSQCASLNQKS---CSGVN--CQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPG 201
PC S+QC S + S C G + C+ S+SY DGS S+G LATE T+G A
Sbjct: 122 PCDSAQCRSRDLPSPPACDGASKQCRVSLSYADGSSSDGALATEVFTVGQGPPLRAAFGC 181
Query: 202 ITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGNQRLGVSTPD-----IVIDS 256
+ T+ G+ T G++G+ G +S +SQ A +R D +++
Sbjct: 182 MATAFDTSPDGV---ATAGLLGMNRGALSFVSQ-----ASTRRFSYCISDRDDAGVLLLG 233
Query: 257 GTTLTFLPQGY 267
+ L FLP Y
Sbjct: 234 HSDLPFLPLNY 244
>gi|110738505|dbj|BAF01178.1| hypothetical protein [Arabidopsis thaliana]
Length = 284
Score = 86.7 bits (213), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 54/174 (31%), Positives = 85/174 (48%), Gaps = 10/174 (5%)
Query: 63 NRLNHFNQNSSISSSKASQADIIPNNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPC 122
+R H + + S+ S+ D + N Y R+ IGTPP + D+GS + + C C
Sbjct: 65 HRKLHKSDSKSLPHSRMRLYDDLLINGYYTTRLWIGTPPQMFALIVDSGSTVTYVPCSDC 124
Query: 123 PPSQCYMQDSPLFDPKMSSTYKSLPCSSSQCASLNQKSCSGVNCQYSVSYGDGSFSNGNL 182
QC P F P+MSSTY+ + C + C + + C Y Y + S S G L
Sbjct: 125 --EQCGKHQDPKFQPEMSSTYQPVKC-NMDCNCDDDRE----QCVYEREYAEHSSSKGVL 177
Query: 183 ATETVTLGSTTGQAVALPGITFGCGT-NNGGLFNSKTTGIVGLGGGDISLISQM 235
+ ++ G+ + + FGC T G L++ + GI+GLG GD+SL+ Q+
Sbjct: 178 GEDLISFGNES--QLTPQRAVFGCETVETGDLYSQRADGIIGLGQGDLSLVDQL 229
>gi|115457778|ref|NP_001052489.1| Os04g0337000 [Oryza sativa Japonica Group]
gi|113564060|dbj|BAF14403.1| Os04g0337000, partial [Oryza sativa Japonica Group]
Length = 321
Score = 86.7 bits (213), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 53/158 (33%), Positives = 76/158 (48%), Gaps = 13/158 (8%)
Query: 91 YLIRISIGTPPTERLAVADTGSDLIWTQ---CEPCPPSQCYMQDSPLFDPKMSSTYKSLP 147
Y I IGTP DTGSD++W C+ CP + L+DPK SST +
Sbjct: 33 YYTEIGIGTPTKRYYVQVDTGSDILWVNCISCDRCPRKSGLGLELTLYDPKDSSTGSKVS 92
Query: 148 CSSSQCAS----LNQKSCSGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPG-- 201
C CA+ L + + C+YSV+YGDGS + G ++ + +G P
Sbjct: 93 CDQGFCAATYGGLLPGCTTSLPCEYSVTYGDGSSTTGYFVSDLLQFDQVSGDGQTRPANS 152
Query: 202 -ITFGCGTNNGGLF---NSKTTGIVGLGGGDISLISQM 235
+TFGCG+ GG N GI+G G + S++SQ+
Sbjct: 153 TVTFGCGSQQGGDLGSSNQALDGIIGFGQSNTSMLSQL 190
>gi|255543963|ref|XP_002513044.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
gi|223548055|gb|EEF49547.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
Length = 431
Score = 86.7 bits (213), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 71/239 (29%), Positives = 113/239 (47%), Gaps = 28/239 (11%)
Query: 8 VFILFFLCFYVVSPIE------AQTGGFSVELIHRDSPKSPFYNSSETPYQR-LRDALTR 60
+F L FL F + + Q G ++++ H SP SPF+ S ++ + +
Sbjct: 5 LFSLAFLFFTLAQGMHLNPKCGIQDQGSNLQVFHVYSPCSPFWPSKPLKWEESVLQMQAK 64
Query: 61 SLNRLNHFNQNSSISSSK-----ASQADIIPNNANYLIRISIGTPPTERLAVADTGSDLI 115
RL SS+ + K AS I+ + Y++R IGTP L DT +D
Sbjct: 65 DQARLQFL---SSLVARKSVVPIASGRQIV-QSPTYIVRAKIGTPAQTMLLAMDTSNDAA 120
Query: 116 WTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCSSSQCASLNQKSCSGVNCQYSVSYGDG 175
W C S C S +F+ S+T+K++ C + QC + C G C ++++YG
Sbjct: 121 WIPC-----SGCVGCSSTVFNNVKSTTFKTVGCEAPQCKQVPNSKCGGSACAFNMTYGSS 175
Query: 176 SFSNGNLATETVTLGSTTGQAVALPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQ 234
S + NL+ + VTL + + +P TFGC T G + G++GLG G +SL+SQ
Sbjct: 176 SIA-ANLSQDVVTLATDS-----IPSYTFGCLTEATG-SSIPPQGLLGLGRGPMSLLSQ 227
>gi|357502759|ref|XP_003621668.1| Aspartic proteinase-like protein [Medicago truncatula]
gi|355496683|gb|AES77886.1| Aspartic proteinase-like protein [Medicago truncatula]
Length = 481
Score = 86.7 bits (213), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 90/365 (24%), Positives = 144/365 (39%), Gaps = 86/365 (23%)
Query: 91 YLIRISIGTPPTERLAVADTGSDLIWT---QCEPCPPSQCYMQDSPLFDPKMSSTYKSLP 147
Y +I IGTP + DTG+D++W QC+ CP D L++ K SS+ K +P
Sbjct: 73 YYAKIGIGTPSKDYYLQVDTGTDMMWVNCIQCKECPTRSNLGMDLTLYNIKESSSGKLVP 132
Query: 148 CSSSQCASLNQKSCSGV------NCQYSVSYGDGSFSNGNLATETVTLGSTTGQ---AVA 198
C C +N +G +C Y YGDGS + G + V +G A A
Sbjct: 133 CDQELCKEINGGLLTGCTSKTNDSCPYLEIYGDGSSTAGYFVKDVVLFDQVSGDLKTASA 192
Query: 199 LPGITFGCGTNNGGLFN----SKTTGIVGLG----------------------------G 226
+ FGCG G + GI+G G G
Sbjct: 193 NGSVIFGCGARQSGDLSYSNEEALDGILGFGKANYSMISQLSSSGKVKKMFAHCLNGVNG 252
Query: 227 GDISLISQ---------------------MRTTIAGNQRLGVST--------PDIVIDSG 257
G I I M G+ L +ST +IDSG
Sbjct: 253 GGIFAIGHVVQPTVNTTPLLPDQPHYSVNMTAIQVGHTFLNLSTDASEQRDSKGTIIDSG 312
Query: 258 TTLTFLPQG-YNSNLLSVMSSM--IEAQPVADPTGSLELCYSFNSLSQVPEVTIHFR-GA 313
TTL +LP G Y + ++S ++ Q + D + YS + P VT +F G
Sbjct: 313 TTLAYLPDGIYQPLVYKILSQQPNLKVQTLHDEYTCFQ--YSGSVDDGFPNVTFYFENGL 370
Query: 314 DVKLSRSNFFVKVSEDIVCSVFKG------ITNSVPIYGNIMQTNFLVGYDIEQQTVSFK 367
+K+ ++ +SE++ C ++ + ++ + G+++ +N LV YD+E Q + +
Sbjct: 371 SLKVYPHDYLF-LSENLWCIGWQNSGAQSRDSKNMTLLGDLVLSNKLVFYDLENQVIGWT 429
Query: 368 PTDCT 372
+C+
Sbjct: 430 EYNCS 434
>gi|449464178|ref|XP_004149806.1| PREDICTED: aspartic proteinase Asp1-like [Cucumis sativus]
Length = 437
Score = 86.3 bits (212), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 92/363 (25%), Positives = 150/363 (41%), Gaps = 90/363 (24%)
Query: 91 YLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCSS 150
Y + I+IG D+GSDL W QC+ P + C L+ P ++ L C
Sbjct: 55 YSVSINIGKGDEAFEFDIDSGSDLTWVQCD-APCTHCTKPREQLYKPNNNA----LNCFE 109
Query: 151 SQCASLN---QKSCSGVN--CQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITFG 205
C SL+ C + CQY + Y D S G L + V L T G ++A P I FG
Sbjct: 110 PLCTSLHPITNHHCKSADDQCQYEIEYADHGSSLGVLVNDHVPLKLTNG-SLAAPRIAFG 168
Query: 206 CGTNNGGLF---NSKTTGIVGLGGGDISLISQMRT-----TIAGN--------------- 242
CG ++ + T G++GLG G++S ISQ+ + + G+
Sbjct: 169 CGYDHKYSVPDSSPPTAGVLGLGNGEVSFISQLSSMGVVRNVVGHCLSDEGGFLFFGDEF 228
Query: 243 -------------------------------QRLGVSTPDIVIDSGTTLTFL-PQGYNSN 270
+ G+ +V DSG++ T+ Q YNS
Sbjct: 229 VPSSGVTWTSMSHESIGSYYSSGPAEVYFSGKATGIKDLTLVFDSGSSYTYFNSQAYNS- 287
Query: 271 LLSVMSSMIEAQPVADP--TGSLELCYS----FNSLSQVPE----VTIHF---RGADVKL 317
+L+++ + + +P+ D SL +C+ F SL V + + + F + A ++L
Sbjct: 288 ILALVKNNLRGKPLEDAPEDKSLPVCWKGTRPFKSLRDVKKYFNPLALRFTKTKNAQIQL 347
Query: 318 SRSNFFVKVSEDIVCSVFKGITN-------SVPIYGNIMQTNFLVGYDIEQQTVSFKPTD 370
N+ + VC GI N + I G+I + +V YD E++ + + PT+
Sbjct: 348 PPENYLIITKYGNVCF---GILNGTEVGLGDLNIIGDISLKDKMVIYDNERRRIGWFPTN 404
Query: 371 CTK 373
C K
Sbjct: 405 CNK 407
>gi|148907752|gb|ABR17002.1| unknown [Picea sitchensis]
Length = 454
Score = 86.3 bits (212), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 59/160 (36%), Positives = 75/160 (46%), Gaps = 13/160 (8%)
Query: 91 YLIRISIGTPPTERLAVADTGSDLIWTQCEP---CPPSQCYMQDSPLFDPKMSSTYKSLP 147
Y RI +GTPP DTGSD++W C+P CP + FDP+ SST L
Sbjct: 41 YYTRIELGTPPRPFYVQIDTGSDILWVNCKPCNACPLTSGLGVALNFFDPRGSSTASPLS 100
Query: 148 CSSSQCASLNQKSCSGVN----CQYSVSYGDGSFSNGNLATETVTLGSTTGQAV---ALP 200
C S+C S NQ S S C YS YGDGS + G ++ Q V A
Sbjct: 101 CIDSKCVSSNQISESVCTTDRYCGYSFEYGDGSGTLGYYVSDEFDYNQYVNQYVTNNASA 160
Query: 201 GITFGCGTNNGGLF---NSKTTGIVGLGGGDISLISQMRT 237
ITFGC N G + GI G G D+S++SQ+ +
Sbjct: 161 KITFGCSYNQSGDLTKPDRAVDGIFGFGQNDLSVVSQLNS 200
>gi|115484503|ref|NP_001065913.1| Os11g0183900 [Oryza sativa Japonica Group]
gi|62954888|gb|AAY23257.1| nucellin-like aspartic protease [Oryza sativa Japonica Group]
gi|77549017|gb|ABA91814.1| Eukaryotic aspartyl protease family protein, expressed [Oryza
sativa Japonica Group]
gi|113644617|dbj|BAF27758.1| Os11g0183900 [Oryza sativa Japonica Group]
gi|222615638|gb|EEE51770.1| hypothetical protein OsJ_33210 [Oryza sativa Japonica Group]
Length = 418
Score = 86.3 bits (212), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 92/368 (25%), Positives = 159/368 (43%), Gaps = 84/368 (22%)
Query: 82 ADIIPNNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSS 141
D+ P +Y + ++IG P DTGSDL W QC+ P C PL+ P +
Sbjct: 49 GDVYPT-GHYYVTMNIGDPAKPYFLDVDTGSDLTWLQCD-APCQSCNKVPHPLYRPTKN- 105
Query: 142 TYKSLPCSSSQCASL------NQKSCSGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQ 195
K +PC++S C +L N+K + C Y + Y D + S G L ++ +L +
Sbjct: 106 --KLVPCANSICTALHSGSSPNKKCTTQQQCDYQIKYTDKASSLGVLVMDSFSL-PLRNK 162
Query: 196 AVALPGITFGCGTN----NGGLFNSKTTGIVGLGGGDISLISQMR--------------- 236
+ P ++FGCG + G + T G++GLG G +SL+SQ++
Sbjct: 163 SNVRPSLSFGCGYDQQVGKNGAAPATTDGLLGLGRGSVSLLSQLKQQGITKNVLGHCLST 222
Query: 237 ------------------------TTIAGN-----------QRLGVSTP--DIVIDSGTT 259
+ +GN R +ST ++V DSG+T
Sbjct: 223 SGGGFLFFGDDMVPTSRVTWVSMVRSTSGNYYSPGSATLYFDRRSLSTKPMEVVFDSGST 282
Query: 260 LTFL-PQGYNSNLLSVMSSMIEA-QPVADPTGSLELCY----SFNSLSQVPE--VTIHF- 310
T+ Q Y + + ++ S+ ++ + V+DP SL LC+ +F S+S V + ++ F
Sbjct: 283 YTYFSAQPYQATISAIKGSLSKSLKQVSDP--SLPLCWKGQKAFKSVSDVKKDFKSLQFI 340
Query: 311 --RGADVKLSRSNFFVKVSEDIVC-SVFKGITN--SVPIYGNIMQTNFLVGYDIEQQTVS 365
+ A + + N+ + VC + G S I G+I + +V YD E+ +
Sbjct: 341 FGKNAVMDIPPENYLIITKNGNVCLGILDGSAAKLSFSIIGDITMQDQMVIYDNEKAQLG 400
Query: 366 FKPTDCTK 373
+ C++
Sbjct: 401 WIRGSCSR 408
>gi|125528357|gb|EAY76471.1| hypothetical protein OsI_04407 [Oryza sativa Indica Group]
Length = 441
Score = 86.3 bits (212), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 61/191 (31%), Positives = 93/191 (48%), Gaps = 18/191 (9%)
Query: 87 NNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSL 146
+N + + +++GTPP V DTGS+L W C P + + F P+ S T+ S+
Sbjct: 61 HNVSLTVSLAVGTPPQNVTMVLDTGSELSWLLCAPGGGGGGGGRSALSFRPRASLTFASV 120
Query: 147 PCSSSQCASLNQKS---CSGVN--CQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPG 201
PC S+QC S + S C G + C+ S+SY DGS S+G LATE T+G A
Sbjct: 121 PCGSAQCRSRDLPSPPACDGASKQCRVSLSYADGSSSDGALATEVFTVGQGPPLRAAFGC 180
Query: 202 ITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGNQRLGVSTPD-----IVIDS 256
+ T+ G+ T G++G+ G +S +SQ A +R D +++
Sbjct: 181 MATAFDTSPDGV---ATAGLLGMNRGALSFVSQ-----ASTRRFSYCISDRDDAGVLLLG 232
Query: 257 GTTLTFLPQGY 267
+ L FLP Y
Sbjct: 233 HSDLPFLPLNY 243
>gi|125536419|gb|EAY82907.1| hypothetical protein OsI_38120 [Oryza sativa Indica Group]
Length = 448
Score = 86.3 bits (212), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 69/191 (36%), Positives = 93/191 (48%), Gaps = 25/191 (13%)
Query: 57 ALTRSLNRLNHFNQNSSISSSKASQADIIP--NNANYLIRISIGTPPTERLAVADTGSDL 114
A RS RL+ + +S ++A + Y+++ SIG PP A DTGSDL
Sbjct: 57 AAERSRRRLSVY------TSGTGTKAPVTKSQKGGKYIMQFSIGEPPLLIWAEVDTGSDL 110
Query: 115 IWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCSSSQCASLNQ-----KSCSGVN--CQ 167
+W +C PC + C SPL+DP S + LPCSS C +L + CS C
Sbjct: 111 MWVKCSPC--NGCNPPPSPLYDPARSRSSGKLPCSSQLCQALGRGRIISDQCSDDPPLCG 168
Query: 168 YSVSYGD-GSFS-NGNLATETVTLGSTTGQAVALPGITFG-CGTNNGGLFNSKTTGIVGL 224
Y +YG G S G L TET T G ++FG T +G F T G+VGL
Sbjct: 169 YHYAYGHSGDHSTQGVLGTETFTF----GDGYVANNVSFGRSDTIDGSQFGG-TAGLVGL 223
Query: 225 GGGDISLISQM 235
G G +SL+SQ+
Sbjct: 224 GRGHLSLVSQL 234
>gi|357507803|ref|XP_003624190.1| Aspartic proteinase-like protein [Medicago truncatula]
gi|355499205|gb|AES80408.1| Aspartic proteinase-like protein [Medicago truncatula]
Length = 476
Score = 86.3 bits (212), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 58/208 (27%), Positives = 92/208 (44%), Gaps = 25/208 (12%)
Query: 50 PYQRLRDALTRSLNRLNHFNQNSSISSSKASQADIIPNNAN--------YLIRISIGTPP 101
P QR + RSL+ + + + A +P N Y ++ +G+P
Sbjct: 26 PVQRKFNGPHRSLDAIKAHDDRRR---GRFLAAIDVPLGGNGLPSSTGLYYTKVGLGSPA 82
Query: 102 TERLAVADTGSDLIWTQCE---PCPPSQCYMQDSPLFDPKMSSTYKSLPCSSSQCASLNQ 158
E DTGSD++W C CP D L+DP S T ++PC C
Sbjct: 83 KEFYVQVDTGSDILWVNCAGCTACPKKSGLGMDLTLYDPNGSKTSNAVPCGDGFCTDTYS 142
Query: 159 KSCSG----VNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALP---GITFGCGTNNG 211
SG ++C YS++YGDGS ++G+ +++T +G P + FGCG
Sbjct: 143 GPISGCKQDMSCPYSITYGDGSTTSGSFVNDSLTFDEVSGNLHTKPDNSSVIFGCGAKQS 202
Query: 212 GLFNSKT----TGIVGLGGGDISLISQM 235
G +S + GI+G G + S++SQ+
Sbjct: 203 GSLSSNSDEALDGIIGFGQANSSVLSQL 230
Score = 43.1 bits (100), Expect = 0.22, Method: Compositional matrix adjust.
Identities = 34/129 (26%), Positives = 57/129 (44%), Gaps = 11/129 (8%)
Query: 253 VIDSGTTLTFLPQG-YNSNLLSVMSSM--IEAQPVADPTGSLELCYSFNSLSQVPEVTIH 309
+IDSGTTL +LP YN L V+ ++ V D YS P V H
Sbjct: 305 IIDSGTTLAYLPLSIYNQLLPKVLGRQPGLKLMIVEDQFTCFH--YSDKLDEGFPVVKFH 362
Query: 310 FRGADVKLSRSNFFVKVSEDIVCSVFKGIT------NSVPIYGNIMQTNFLVGYDIEQQT 363
F G + + ++ EDI C ++ + + + G+++ +N LV YD+E
Sbjct: 363 FEGLSLTVHPHDYLFLYKEDIYCIGWQKSSTQTKEGRDLILIGDLVLSNKLVVYDLENMV 422
Query: 364 VSFKPTDCT 372
+ + +C+
Sbjct: 423 IGWTNFNCS 431
>gi|383165471|gb|AFG65613.1| Pinus taeda anonymous locus 2_6422_01 genomic sequence
Length = 136
Score = 86.3 bits (212), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 48/111 (43%), Positives = 65/111 (58%), Gaps = 1/111 (0%)
Query: 130 QDSPLFDPKMSSTYKSLPCSSSQCASLNQKSC-SGVNCQYSVSYGDGSFSNGNLATETVT 188
Q +P++DP SSTY + C S C +L C S C+Y +YGD S + G L+ ET+T
Sbjct: 2 QPTPIYDPARSSTYSKVSCKSLLCNALPDFECKSAAGCEYQYTYGDFSITVGILSYETLT 61
Query: 189 LGSTTGQAVALPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTI 239
L S +G +P FGCG NN G + GIVGLG G +SLISQ+ ++
Sbjct: 62 LTSKSGAEQLIPNFAFGCGQNNEGNGFDQGAGIVGLGRGPLSLISQLSASM 112
>gi|242051593|ref|XP_002454942.1| hypothetical protein SORBIDRAFT_03g001790 [Sorghum bicolor]
gi|241926917|gb|EES00062.1| hypothetical protein SORBIDRAFT_03g001790 [Sorghum bicolor]
Length = 431
Score = 86.3 bits (212), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 103/418 (24%), Positives = 160/418 (38%), Gaps = 86/418 (20%)
Query: 21 PIEAQTGGFSVELIHRDSPKSPFYNSSETPYQRLRDALTRSLNRLNHFNQN--SSISSSK 78
P+ GF EL H P+ SS + R + S R+ +S
Sbjct: 30 PVAGSDAGFRAELHH------PYAGSSLPVHDMWRRSARASKARVARLEARLTGDMSVPL 83
Query: 79 ASQADIIPNNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPK 138
A +D Y + I IGTPP +ADT SDL WTQC + Q PLFDP
Sbjct: 84 ARISD-----EGYTVTIGIGTPPQLHTLIADTASDLTWTQCNLF--NDTAKQVEPLFDPA 136
Query: 139 MSSTYKSLPCSSSQCASLN--QKSCSGVNCQY---------------------------- 168
SS++ + CSS C N K CS C+Y
Sbjct: 137 KSSSFAFVTCSSKLCTEDNPGTKRCSNKTCRYVYPYVSVEAAGVLAYESFTLSDNNQHIC 196
Query: 169 -SVSYGDGSFSNGNLATETVTLGSTTG-----QAVALPGIT---------------FGCG 207
S +G G+ ++GNL + LG + +A+P + FG
Sbjct: 197 MSFGFGCGALTDGNLLGASGILGMSPAILSMVSQLAIPKFSYCLTPYTDRKSSPLFFGAW 256
Query: 208 TNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGNQRLGVSTPDIVIDSGTTLTFLPQGY 267
+ G KTTG + + + ++ G +RL V + G T+ L
Sbjct: 257 ADLG---RYKTTGPIQKSLTFYYYVPLVGLSL-GTRRLDVPAATFALKQGGTVVDLGCTV 312
Query: 268 NSNLLSVMSSMIEA------QPVADPT-GSLELCYSFNS-----LSQVPEVTIHFRG-AD 314
+++ EA P+ + T ++C++ S Q P + ++F G AD
Sbjct: 313 GQLAEPAFTALKEAVLHTLNLPLTNRTVKDYKVCFALPSGVAMGAVQTPPLVLYFDGGAD 372
Query: 315 VKLSRSNFFVKVSEDIVC-SVFKGITNSVPIYGNIMQTNFLVGYDIEQQTVSFKPTDC 371
+ L R N+F + + ++C ++ G + I GN+ Q NF + +D+ F PT C
Sbjct: 373 MVLPRDNYFQEPTAGLMCLALVPG--GGMSIIGNVQQQNFHLLFDVHDSKFLFAPTIC 428
>gi|297819684|ref|XP_002877725.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
gi|297323563|gb|EFH53984.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
Length = 633
Score = 86.3 bits (212), Expect = 3e-14, Method: Compositional matrix adjust.
Identities = 54/174 (31%), Positives = 85/174 (48%), Gaps = 10/174 (5%)
Query: 63 NRLNHFNQNSSISSSKASQADIIPNNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPC 122
+R H + + S+ S+ D + N Y R+ IGTPP + D+GS + + C C
Sbjct: 66 HRKLHKSDSKSLPHSRMRLYDDLLINGYYTTRLWIGTPPQMFALIVDSGSTVTYVPCSDC 125
Query: 123 PPSQCYMQDSPLFDPKMSSTYKSLPCSSSQCASLNQKSCSGVNCQYSVSYGDGSFSNGNL 182
QC P F P++SSTY+ + C + C + K C Y Y + S S G L
Sbjct: 126 --EQCGKHQDPKFQPELSSTYQPVKC-NMDCNCDDDKE----QCVYEREYAEHSSSKGVL 178
Query: 183 ATETVTLGSTTGQAVALPGITFGCGT-NNGGLFNSKTTGIVGLGGGDISLISQM 235
+ ++ G+ + + FGC T G L++ + GI+GLG GD+SL+ Q+
Sbjct: 179 GEDLISFGNES--QLTPQRAVFGCETVETGDLYSQRADGIIGLGQGDLSLVDQL 230
>gi|356553832|ref|XP_003545255.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Glycine max]
Length = 427
Score = 85.9 bits (211), Expect = 3e-14, Method: Compositional matrix adjust.
Identities = 57/160 (35%), Positives = 81/160 (50%), Gaps = 22/160 (13%)
Query: 87 NNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSL 146
+N I ++IG+PP V DTGS+L W C+ P + F+P +SS+Y
Sbjct: 55 HNVTLTISLTIGSPPQNVTMVLDTGSELSWLHCKKLP------NLNSTFNPLLSSSYTPT 108
Query: 147 PCSSSQCASLNQK-----SCSGVN--CQYSVSYGDGSFSNGNLATETVTLGSTTGQAVAL 199
PC+SS C + + SC N C VSY D S + G LA ET +L A
Sbjct: 109 PCNSSVCMTRTRDLTIPASCDPNNKLCHVIVSYADASSAEGTLAAETFSLA-----GAAQ 163
Query: 200 PGITFGCGTNNGGL----FNSKTTGIVGLGGGDISLISQM 235
PG FGC + G ++KTTG++G+ G +SL++QM
Sbjct: 164 PGTLFGCMDSAGYTSDINEDAKTTGLMGMNRGSLSLVTQM 203
Score = 45.1 bits (105), Expect = 0.061, Method: Compositional matrix adjust.
Identities = 42/136 (30%), Positives = 64/136 (47%), Gaps = 21/136 (15%)
Query: 253 VIDSGTTLTFL-PQGYNS---NLLSVMSSMIEAQPVADPT----GSLELCYSF-NSLSQV 303
++DSGT TFL YNS L ++ + DP G+++LCY SL+ V
Sbjct: 285 MVDSGTQFTFLLGPVYNSLKDEFLEQTKGVLTR--IEDPNFVFEGAMDLCYHAPASLAAV 342
Query: 304 PEVTIHFRGADVKLSRSNFFVKVSED---IVCSVFK-----GITNSVPIYGNIMQTNFLV 355
P VT+ F GA++++S +VS+ + C F GI V G+ Q N +
Sbjct: 343 PAVTLVFSGAEMRVSGERLLYRVSKGRDWVYCFTFGNSDLLGIEAYV--IGHHHQQNVWM 400
Query: 356 GYDIEQQTVSFKPTDC 371
+D+ + V F T C
Sbjct: 401 EFDLVKSRVGFTETTC 416
>gi|222637182|gb|EEE67314.1| hypothetical protein OsJ_24556 [Oryza sativa Japonica Group]
Length = 304
Score = 85.9 bits (211), Expect = 3e-14, Method: Compositional matrix adjust.
Identities = 92/327 (28%), Positives = 145/327 (44%), Gaps = 70/327 (21%)
Query: 93 IRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSP-----LFDPKMSSTYKSLP 147
+ +++GTPP A+ SDL W +C PC S C +P L+D SS++ P
Sbjct: 1 MELAVGTPPVTVQALFGI-SDLCWVECTPC--SGCNNNAAPPAGARLYDRANSSSFS--P 55
Query: 148 CSSSQCASLNQKSCSGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITFGCG 207
+ ++C G Y + D ++ G L TET+ GS A + TFGC
Sbjct: 56 LADTEC---------GYRYVYGATDTDRNYVKGILGTETIKFGSN--DAATVQSFTFGC- 103
Query: 208 TN---NGGLFNSKTTGIVGLGGGDISLISQMRTT-----IAGNQRL-------------- 245
TN LF+ T G+VGLG +SL+ Q+ +A N +
Sbjct: 104 TNTVYRNDLFDGNT-GVVGLGRSKLSLVGQLGLDRFSYCLASNPNVASPVLFGSTASMDG 162
Query: 246 -GVS-TPDIVIDSGTTLTFLPQGYNSNLLSV------MSSMIEAQPVADPTGSLELCYSF 297
GVS TP + D+ + L + L++ MS EA GS LC+
Sbjct: 163 NGVSSTPLLPDDANYYVNLLGISVDGTRLAIPNDTARMSRTYEAV-----NGSGLLCFLV 217
Query: 298 NSLSQ----VPEVTIHFRGADVKLSRSNFFVKVSE-------DIVCSVFKGITNSVPIYG 346
+ S+ VP +T+HF G D++L N+F + D++C + G +++ G
Sbjct: 218 DDASKNVVTVPTMTMHFDGMDMELLFGNYFAYTGKQSGGGGGDVLC-LMIGKSSTGSRIG 276
Query: 347 NIMQTNFLVGYDIEQQTVSFKPTDCTK 373
N +Q +F V Y+++ +S +P DC K
Sbjct: 277 NYLQMDFHVLYELKNSVLSVQPADCGK 303
>gi|296087361|emb|CBI33735.3| unnamed protein product [Vitis vinifera]
Length = 633
Score = 85.9 bits (211), Expect = 3e-14, Method: Compositional matrix adjust.
Identities = 60/180 (33%), Positives = 88/180 (48%), Gaps = 13/180 (7%)
Query: 58 LTRSLNRLNHFNQNSSISSSKASQADIIPNNANYLIRISIGTPPTERLAVADTGSDLIWT 117
L+ S L +S+ ++ D+IP Y RI IGTPP + DTGS L +
Sbjct: 60 LSHSRRHLQRSESHSTATARMPLYDDLIPY-GYYTTRIWIGTPPQTFALIVDTGSTLTYV 118
Query: 118 QCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCSSSQCASLNQKSCSGVNCQYSVSYGDGSF 177
C C QC P F P SSTY+ L C S +C ++ ++C Y Y + S
Sbjct: 119 PCSTC--EQCGKHQDPNFQPDWSSTYQPLKC-SMECTCDSEM----MHCVYDRQYAEMSS 171
Query: 178 SNGNLATETVTLGSTTGQAVALPGIT-FGC-GTNNGGLFNSKTTGIVGLGGGDISLISQM 235
S+G L + V+ G Q+ P T FGC G +++ + GI+GLG GD+S++ Q+
Sbjct: 172 SSGVLGEDIVSFGK---QSELKPQRTVFGCENVETGDIYSQRADGIMGLGRGDLSIVDQL 228
>gi|224090744|ref|XP_002309070.1| predicted protein [Populus trichocarpa]
gi|222855046|gb|EEE92593.1| predicted protein [Populus trichocarpa]
Length = 404
Score = 85.5 bits (210), Expect = 4e-14, Method: Compositional matrix adjust.
Identities = 52/158 (32%), Positives = 86/158 (54%), Gaps = 20/158 (12%)
Query: 87 NNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSL 146
+N + ++ +++GTPP V DTGS+L W C + FDP S++Y+++
Sbjct: 27 HNVSLIVSLTVGTPPQNVSMVIDTGSELSWLHCNKT------LSYPTTFDPTRSTSYQTI 80
Query: 147 PCSSSQCASLNQK-----SCSGVN-CQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALP 200
PCSS C + Q SC N C ++SY D S S+GNLA++ +GS+ +
Sbjct: 81 PCSSPTCTNRTQDFPIPASCDSNNLCHATLSYADASSSDGNLASDVFHIGSSD-----IS 135
Query: 201 GITFGCGT---NNGGLFNSKTTGIVGLGGGDISLISQM 235
G+ FGC ++ +SK+TG++G+ G +S +SQ+
Sbjct: 136 GLVFGCMDSVFSSNSDEDSKSTGLMGMNRGSLSFVSQL 173
Score = 39.7 bits (91), Expect = 2.4, Method: Compositional matrix adjust.
Identities = 36/140 (25%), Positives = 62/140 (44%), Gaps = 23/140 (16%)
Query: 253 VIDSGTTLTFLP----QGYNSNLLSVMSSMIEAQPVADP----TGSLELCY----SFNSL 300
++DSGT TFL S L+ SS++ + DP G+++LCY S L
Sbjct: 256 MVDSGTQFTFLLGPVYNALRSAFLNQTSSVLRV--LEDPDFVFQGAMDLCYLVPLSQRVL 313
Query: 301 SQVPEVTIHFRGADVKLSRSNFFVKV------SEDIVCSVFKG---ITNSVPIYGNIMQT 351
+P VT+ FRGA++ +S +V ++ + C F + + G+ Q
Sbjct: 314 PLLPTVTLVFRGAEMTVSGDRVLYRVPGELRGNDSVHCLSFGNSDLLGVEAYVIGHHHQQ 373
Query: 352 NFLVGYDIEQQTVSFKPTDC 371
N + +D+E+ + C
Sbjct: 374 NVWMEFDLEKSRIGLAQVRC 393
>gi|296084856|emb|CBI28265.3| unnamed protein product [Vitis vinifera]
Length = 446
Score = 85.5 bits (210), Expect = 4e-14, Method: Compositional matrix adjust.
Identities = 94/363 (25%), Positives = 145/363 (39%), Gaps = 63/363 (17%)
Query: 50 PYQRLRDALTRSLNRLNHFNQNSSISSSKASQADIIPNNANYLIRISIGTPPTERLAVAD 109
PY+ LR ++ SL R H + +S + Y I +S GTPP + D
Sbjct: 51 PYRNLRHLVSASLIRARHLKNPKTTPTSTTPLFTH--SYGAYSIPLSFGTPPQTLPLIMD 108
Query: 110 TGSDLIWTQCEPCPPSQCYM----------QDSPLFDPKMSSTYKSLPCSSSQCASLN-- 157
TGSDL+W C + Y+ S +F PK SS+ K L C + +C ++
Sbjct: 109 TGSDLVWFPC-----THRYVCRNCSFSTSNPSSNIFIPKSSSSSKVLGCVNPKCGWIHGS 163
Query: 158 ------------QKSCSGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITFG 205
+C+ + Y V YG G + G + +ET+ L G+ V P G
Sbjct: 164 KVQSRCRDCEPTSPNCTQICPPYLVFYGSG-ITGGIMLSETLDL---PGKGV--PNFIVG 217
Query: 206 CGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGNQRLGVSTPDIVIDSGTTLTFLPQ 265
C L S+ GI G G G SL SQ+ L D S +
Sbjct: 218 CSV----LSTSQPAGISGFGRGPPSLPSQLGLKKFSYCLLSRRYDDTTESSSLIFELVAA 273
Query: 266 GYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSLS--QVPEVTIHFR-GADVKLSRSNF 322
+ + S ++ +E L C++ + L+ PE+T+ FR GA+++L +N+
Sbjct: 274 EFEKQVQSKRATEVEG------ITGLRPCFNISGLNTPSFPELTLKFRGGAEMELPLANY 327
Query: 323 FVKV-SEDIVC----------SVFKGITNSVPIYGNIMQTNFLVGYDIEQQTVSFKPTDC 371
+ +D+VC F G I GN Q NF V YD+ + + F+ C
Sbjct: 328 VAFLGGDDVVCLTIVTDGAAGKEFSG--GPAIILGNFQQQNFYVEYDLRNERLGFRQQSC 385
Query: 372 TKQ 374
Sbjct: 386 NPH 388
>gi|225438908|ref|XP_002279194.1| PREDICTED: aspartic proteinase-like protein 2-like [Vitis vinifera]
Length = 634
Score = 85.5 bits (210), Expect = 4e-14, Method: Compositional matrix adjust.
Identities = 60/180 (33%), Positives = 88/180 (48%), Gaps = 13/180 (7%)
Query: 58 LTRSLNRLNHFNQNSSISSSKASQADIIPNNANYLIRISIGTPPTERLAVADTGSDLIWT 117
L+ S L +S+ ++ D+IP Y RI IGTPP + DTGS L +
Sbjct: 60 LSHSRRHLQRSESHSTATARMPLYDDLIPY-GYYTTRIWIGTPPQTFALIVDTGSTLTYV 118
Query: 118 QCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCSSSQCASLNQKSCSGVNCQYSVSYGDGSF 177
C C QC P F P SSTY+ L C S +C ++ ++C Y Y + S
Sbjct: 119 PCSTC--EQCGKHQDPNFQPDWSSTYQPLKC-SMECTCDSEM----MHCVYDRQYAEMSS 171
Query: 178 SNGNLATETVTLGSTTGQAVALPGIT-FGC-GTNNGGLFNSKTTGIVGLGGGDISLISQM 235
S+G L + V+ G Q+ P T FGC G +++ + GI+GLG GD+S++ Q+
Sbjct: 172 SSGVLGEDIVSFGK---QSELKPQRTVFGCENVETGDIYSQRADGIMGLGRGDLSIVDQL 228
>gi|449441618|ref|XP_004138579.1| PREDICTED: uncharacterized protein LOC101220661 [Cucumis sativus]
Length = 2819
Score = 85.5 bits (210), Expect = 4e-14, Method: Compositional matrix adjust.
Identities = 52/158 (32%), Positives = 84/158 (53%), Gaps = 20/158 (12%)
Query: 87 NNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSL 146
+N + +++G+PP + V DTGS+L W C+ P + +F+P SS+Y +
Sbjct: 996 HNVTLTVSLTVGSPPQQVTMVLDTGSELSWLHCKKSP------NLTSVFNPLSSSSYSPI 1049
Query: 147 PCSSSQCASLNQKSCSGVN------CQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALP 200
PCSS C + + + V C VSY D S GNLA++ +GS+ ALP
Sbjct: 1050 PCSSPICRTRTRDLPNPVTCDPKKLCHAIVSYADASSLEGNLASDNFRIGSS-----ALP 1104
Query: 201 GITFGC---GTNNGGLFNSKTTGIVGLGGGDISLISQM 235
G FGC G ++ ++KTTG++G+ G +S ++Q+
Sbjct: 1105 GTLFGCMDSGFSSNSEEDAKTTGLMGMNRGSLSFVTQL 1142
>gi|225458774|ref|XP_002283258.1| PREDICTED: aspartic proteinase-like protein 2 [Vitis vinifera]
gi|302142232|emb|CBI19435.3| unnamed protein product [Vitis vinifera]
Length = 659
Score = 85.5 bits (210), Expect = 4e-14, Method: Compositional matrix adjust.
Identities = 110/446 (24%), Positives = 176/446 (39%), Gaps = 98/446 (21%)
Query: 4 FLSCVFILFFLCFYVVSPIEAQTGGFSVELI---HRDSPKSPFYNSSETPYQRLRDALTR 60
+S + IL F+ Y S + G +I + SPKS + R A+
Sbjct: 8 LISAIVILSFVTIYSSSASQIPNRGVRRPMIFPLYFASPKSSGH----------RQAIEG 57
Query: 61 SLNRLNHFNQNSSISSSKASQADIIPNNANYLIRISIGTPPTERLAVADTGSDLIWTQCE 120
S R + + +++ D + +N Y R+ IGTPP E + DTGS + + C
Sbjct: 58 SYWRRHLKSDPYHHPNARMRLYDDLLSNGYYTTRLWIGTPPQEFALIVDTGSTVTYVPCS 117
Query: 121 PCPPSQCYMQDSPLFDPKMSSTYKSLPCSSS-QCASLNQKSCSGVNCQYSVSYGDGSFSN 179
C C P F P SSTY + C+ C GVNC Y Y + S S+
Sbjct: 118 DC--EHCGKHQDPRFQPDESSTYHPVKCNMDCNCDH------DGVNCVYERRYAEMSSSS 169
Query: 180 GNLATETVTLGSTTGQAVALP-GITFGC-GTNNGGLFNSKTTGIVGLG------------ 225
G L + ++ G+ Q+ +P FGC G L++ + GI+GLG
Sbjct: 170 GVLGEDIISFGN---QSEVVPQRAVFGCENVETGDLYSQRADGIMGLGRGQLSIVDQLVD 226
Query: 226 ------------------GGDISL-----------------------ISQMRTTIAGNQ- 243
GG + L I +AG
Sbjct: 227 KNVINDSFSLCYGGMHVGGGAMVLGGIPPPPDMVFSRSDPYRSPYYNIELKEIHVAGKPL 286
Query: 244 RLGVSTPD----IVIDSGTTLTFLPQ-GYNSNLLSVMSSMIEAQPVADPTGSL-ELCYS- 296
+L ST D V+DSGTT +LP+ + + +++ + + P + ++C+S
Sbjct: 287 KLSPSTFDRKHGTVLDSGTTYAYLPEEAFVAFRDAIIKKSHNLKQIHGPDPNYNDICFSG 346
Query: 297 ----FNSLSQV-PEVTIHF-RGADVKLSRSNFF---VKVSEDIVCSVFKGITNSVPIYGN 347
+ LS+ PEV + F G + L+ N+ KV +F+ +S + G
Sbjct: 347 AGRDVSQLSKAFPEVDMVFSNGQKLSLTPENYLFQHTKVHGAYCLGIFRN-GDSTTLLGG 405
Query: 348 IMQTNFLVGYDIEQQTVSFKPTDCTK 373
I+ N LV YD E + + F T+C++
Sbjct: 406 IIVRNTLVTYDRENEKIGFWKTNCSE 431
>gi|222616654|gb|EEE52786.1| hypothetical protein OsJ_35257 [Oryza sativa Japonica Group]
Length = 346
Score = 85.5 bits (210), Expect = 4e-14, Method: Compositional matrix adjust.
Identities = 54/151 (35%), Positives = 77/151 (50%), Gaps = 17/151 (11%)
Query: 95 ISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSP---LFDPKMSSTYKSLPCSSS 151
IS+GTPP L DTGS L W QC+ C +CY Q + +F+P SSTY + CS+
Sbjct: 3 ISLGTPPVFNLVTIDTGSTLSWVQCKNCQI-KCYDQAAKAGQIFNPYNSSTYSKVGCSTE 61
Query: 152 QCASLN-----QKSCSGVN--CQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITF 204
C ++ + C + C YS+ YG G +S G L + +TL S ++ F
Sbjct: 62 ACNGMHMDLAVEYGCVEEDDTCIYSLRYGSGEYSVGYLGKDRLTLASNR----SIDNFIF 117
Query: 205 GCGTNNGGLFNSKTTGIVGLGGGDISLISQM 235
GCG +N L+N GI+G G S +Q+
Sbjct: 118 GCGEDN--LYNGVNAGIIGFGTKSYSFFNQV 146
>gi|297795137|ref|XP_002865453.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
gi|297311288|gb|EFH41712.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
Length = 665
Score = 85.5 bits (210), Expect = 4e-14, Method: Compositional matrix adjust.
Identities = 59/188 (31%), Positives = 93/188 (49%), Gaps = 25/188 (13%)
Query: 53 RLRDALTRSLNRLNHFNQNSSISSSKASQADIIPNNANYLIRISIGTPPTERLAVADTGS 112
R+ D R L++ S + ++ D + +N Y R+ IGTPP E + DTGS
Sbjct: 49 RVEDFRRRRLHQ-------SQLPNAHMKLYDDLLSNGYYTTRLWIGTPPQEFALIVDTGS 101
Query: 113 DLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCSSSQCASLNQKSCSGVNCQYSVSY 172
+ + C C QC P F P++SS+YK+L C+ C ++ G C Y Y
Sbjct: 102 TVTYVPCSTC--KQCGKHQDPKFQPELSSSYKALKCNPD-CNCDDE----GKLCVYERRY 154
Query: 173 GDGSFSNGNLATETVTLGS----TTGQAVALPGITFGC-GTNNGGLFNSKTTGIVGLGGG 227
+ S S+G L+ + ++ G+ T +AV FGC G LF+ + GI+GLG G
Sbjct: 155 AEMSSSSGVLSEDLISFGNESQLTPQRAV------FGCENVETGDLFSQRADGIMGLGRG 208
Query: 228 DISLISQM 235
+S++ Q+
Sbjct: 209 KLSVVDQL 216
>gi|361068027|gb|AEW08325.1| Pinus taeda anonymous locus 2_6422_01 genomic sequence
gi|383165459|gb|AFG65601.1| Pinus taeda anonymous locus 2_6422_01 genomic sequence
gi|383165460|gb|AFG65602.1| Pinus taeda anonymous locus 2_6422_01 genomic sequence
gi|383165461|gb|AFG65603.1| Pinus taeda anonymous locus 2_6422_01 genomic sequence
gi|383165462|gb|AFG65604.1| Pinus taeda anonymous locus 2_6422_01 genomic sequence
gi|383165463|gb|AFG65605.1| Pinus taeda anonymous locus 2_6422_01 genomic sequence
gi|383165465|gb|AFG65607.1| Pinus taeda anonymous locus 2_6422_01 genomic sequence
gi|383165466|gb|AFG65608.1| Pinus taeda anonymous locus 2_6422_01 genomic sequence
gi|383165467|gb|AFG65609.1| Pinus taeda anonymous locus 2_6422_01 genomic sequence
gi|383165468|gb|AFG65610.1| Pinus taeda anonymous locus 2_6422_01 genomic sequence
gi|383165469|gb|AFG65611.1| Pinus taeda anonymous locus 2_6422_01 genomic sequence
gi|383165472|gb|AFG65614.1| Pinus taeda anonymous locus 2_6422_01 genomic sequence
gi|383165473|gb|AFG65615.1| Pinus taeda anonymous locus 2_6422_01 genomic sequence
gi|383165474|gb|AFG65616.1| Pinus taeda anonymous locus 2_6422_01 genomic sequence
gi|383165475|gb|AFG65617.1| Pinus taeda anonymous locus 2_6422_01 genomic sequence
gi|383165476|gb|AFG65618.1| Pinus taeda anonymous locus 2_6422_01 genomic sequence
Length = 136
Score = 85.1 bits (209), Expect = 4e-14, Method: Compositional matrix adjust.
Identities = 48/111 (43%), Positives = 65/111 (58%), Gaps = 1/111 (0%)
Query: 130 QDSPLFDPKMSSTYKSLPCSSSQCASLNQKSC-SGVNCQYSVSYGDGSFSNGNLATETVT 188
Q +P++DP SSTY + C S C +L C S C+Y +YGD S + G L+ ET+T
Sbjct: 2 QPTPIYDPARSSTYSKVSCKSLLCNALPDFECKSTAGCEYQYTYGDFSITVGILSYETLT 61
Query: 189 LGSTTGQAVALPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTI 239
L S +G +P FGCG NN G + GIVGLG G +SLISQ+ ++
Sbjct: 62 LTSKSGAEQLIPNFAFGCGQNNEGNGFDQGAGIVGLGRGPLSLISQLSASM 112
>gi|51091919|dbj|BAD35188.1| putative 41 kD chloroplast nucleoid DNA binding protein [Oryza
sativa Japonica Group]
gi|125596474|gb|EAZ36254.1| hypothetical protein OsJ_20576 [Oryza sativa Japonica Group]
gi|196212950|gb|ACG76111.1| S5 [Oryza sativa Japonica Group]
gi|340810891|gb|AEK75372.1| S5 [Oryza sativa]
gi|340810893|gb|AEK75373.1| S5 [Oryza sativa]
gi|340810899|gb|AEK75376.1| S5 [Oryza sativa]
gi|340810901|gb|AEK75377.1| S5 [Oryza sativa]
gi|340810933|gb|AEK75393.1| S5 [Oryza sativa]
gi|340810947|gb|AEK75400.1| S5 [Oryza sativa]
gi|340810949|gb|AEK75401.1| S5 [Oryza sativa]
gi|340810967|gb|AEK75410.1| S5 [Oryza sativa]
gi|340810969|gb|AEK75411.1| S5 [Oryza sativa]
gi|340810999|gb|AEK75426.1| S5 [Oryza rufipogon]
gi|340811017|gb|AEK75435.1| S5 [Oryza rufipogon]
gi|340811029|gb|AEK75441.1| S5 [Oryza nivara]
gi|340811051|gb|AEK75452.1| S5 [Oryza nivara]
gi|340811075|gb|AEK75464.1| S5 [Oryza nivara]
gi|340811077|gb|AEK75465.1| S5 [Oryza rufipogon]
gi|340811085|gb|AEK75469.1| S5 [Oryza nivara]
gi|340811096|gb|AEK75474.1| S5 [Oryza rufipogon]
gi|340811100|gb|AEK75476.1| S5 [Oryza rufipogon]
gi|340811114|gb|AEK75483.1| S5 [Oryza nivara]
Length = 472
Score = 85.1 bits (209), Expect = 4e-14, Method: Compositional matrix adjust.
Identities = 53/156 (33%), Positives = 83/156 (53%), Gaps = 23/156 (14%)
Query: 70 QNSSISSSKASQADIIP----NNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPS 125
Q I+SS +++ D+I N+ +L+ +S+G PP L DTGS L W QC+PC
Sbjct: 89 QEEEITSSSSTKIDVIEDSSINDFLFLMAVSLGKPPVVNLVAIDTGSTLSWVQCQPC-AV 147
Query: 126 QCYMQDS---PLFDPKMSSTYKSLPCSSSQCASLN------QKSC--SGVNCQYSVSYGD 174
C+ Q + P+FDP S T + + CSS +C L Q +C +C YSV+YG+
Sbjct: 148 HCHTQSAKAGPIFDPGRSYTSRRVRCSSVKCGELRYDLRLQQANCMEKEDSCTYSVTYGN 207
Query: 175 G-SFSNGNLATETVTLGSTTGQAVALPGITFGCGTN 209
G ++S G + T+T+ +G + + FGC +
Sbjct: 208 GWAYSVGKMVTDTLRIGDS------FMDLMFGCSMD 237
>gi|340810993|gb|AEK75423.1| S5 [Oryza rufipogon]
gi|340811015|gb|AEK75434.1| S5 [Oryza nivara]
gi|340811021|gb|AEK75437.1| S5 [Oryza nivara]
Length = 474
Score = 85.1 bits (209), Expect = 4e-14, Method: Compositional matrix adjust.
Identities = 53/156 (33%), Positives = 83/156 (53%), Gaps = 23/156 (14%)
Query: 70 QNSSISSSKASQADIIP----NNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPS 125
Q I+SS +++ D+I N+ +L+ +S+G PP L DTGS L W QC+PC
Sbjct: 91 QEEEITSSSSTKIDVIEDSSINDFLFLMAVSLGKPPVVNLVAIDTGSTLSWVQCQPC-AV 149
Query: 126 QCYMQDS---PLFDPKMSSTYKSLPCSSSQCASLN------QKSC--SGVNCQYSVSYGD 174
C+ Q + P+FDP S T + + CSS +C L Q +C +C YSV+YG+
Sbjct: 150 HCHTQSAKAGPIFDPGRSYTSRRVRCSSVKCGELRYDLRLQQANCMEKEDSCTYSVTYGN 209
Query: 175 G-SFSNGNLATETVTLGSTTGQAVALPGITFGCGTN 209
G ++S G + T+T+ +G + + FGC +
Sbjct: 210 GWAYSVGKMVTDTLRIGDS------FMDLMFGCSMD 239
>gi|340810907|gb|AEK75380.1| S5 [Oryza sativa]
Length = 472
Score = 85.1 bits (209), Expect = 4e-14, Method: Compositional matrix adjust.
Identities = 53/156 (33%), Positives = 83/156 (53%), Gaps = 23/156 (14%)
Query: 70 QNSSISSSKASQADIIP----NNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPS 125
Q I+SS +++ D+I N+ +L+ +S+G PP L DTGS L W QC+PC
Sbjct: 89 QEEEITSSSSTKIDVIEDSSINDFLFLMAVSLGKPPVVNLVAIDTGSTLSWVQCQPC-AV 147
Query: 126 QCYMQDS---PLFDPKMSSTYKSLPCSSSQCASLN------QKSC--SGVNCQYSVSYGD 174
C+ Q + P+FDP S T + + CSS +C L Q +C +C YSV+YG+
Sbjct: 148 HCHTQSAKAGPIFDPGRSYTSRRVRCSSVKCGELRYDLRLQQANCMEKEDSCTYSVTYGN 207
Query: 175 G-SFSNGNLATETVTLGSTTGQAVALPGITFGCGTN 209
G ++S G + T+T+ +G + + FGC +
Sbjct: 208 GWAYSVGKMVTDTLRIGDS------FMDLMFGCSMD 237
>gi|125554529|gb|EAZ00135.1| hypothetical protein OsI_22138 [Oryza sativa Indica Group]
Length = 472
Score = 85.1 bits (209), Expect = 4e-14, Method: Compositional matrix adjust.
Identities = 53/156 (33%), Positives = 83/156 (53%), Gaps = 23/156 (14%)
Query: 70 QNSSISSSKASQADIIP----NNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPS 125
Q I+SS +++ D+I N+ +L+ +S+G PP L DTGS L W QC+PC
Sbjct: 89 QEEEITSSSSTKIDVIEDSSINDFLFLMAVSLGKPPVVNLVAIDTGSTLSWVQCQPC-AV 147
Query: 126 QCYMQDS---PLFDPKMSSTYKSLPCSSSQCASLN------QKSC--SGVNCQYSVSYGD 174
C+ Q + P+FDP S T + + CSS +C L Q +C +C YSV+YG+
Sbjct: 148 HCHTQSAKAGPIFDPGRSYTSRRVRCSSVKCGELRYDLRLQQANCMEKEDSCTYSVTYGN 207
Query: 175 G-SFSNGNLATETVTLGSTTGQAVALPGITFGCGTN 209
G ++S G + T+T+ +G + + FGC +
Sbjct: 208 GWAYSVGKMVTDTLRIGDS------FMDLMFGCSMD 237
>gi|383165464|gb|AFG65606.1| Pinus taeda anonymous locus 2_6422_01 genomic sequence
gi|383165470|gb|AFG65612.1| Pinus taeda anonymous locus 2_6422_01 genomic sequence
Length = 136
Score = 85.1 bits (209), Expect = 4e-14, Method: Compositional matrix adjust.
Identities = 48/111 (43%), Positives = 65/111 (58%), Gaps = 1/111 (0%)
Query: 130 QDSPLFDPKMSSTYKSLPCSSSQCASLNQKSC-SGVNCQYSVSYGDGSFSNGNLATETVT 188
Q +P++DP SSTY + C S C +L C S C+Y +YGD S + G L+ ET+T
Sbjct: 2 QPTPIYDPARSSTYSKVSCKSLLCNALPDFECKSTAGCEYQYTYGDFSITVGILSYETLT 61
Query: 189 LGSTTGQAVALPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTI 239
L S +G +P FGCG NN G + GIVGLG G +SLISQ+ ++
Sbjct: 62 LTSKSGAEQLIPKFAFGCGQNNEGNGFDQGAGIVGLGRGPLSLISQLSASM 112
>gi|196212952|gb|ACG76112.1| S5 [Oryza sativa Indica Group]
gi|338809989|gb|AEJ08560.1| S5 [Oryza barthii]
gi|340810883|gb|AEK75368.1| S5 [Oryza sativa]
gi|340810885|gb|AEK75369.1| S5 [Oryza sativa]
gi|340810889|gb|AEK75371.1| S5 [Oryza sativa]
gi|340810895|gb|AEK75374.1| S5 [Oryza sativa]
gi|340810897|gb|AEK75375.1| S5 [Oryza sativa]
gi|340810905|gb|AEK75379.1| S5 [Oryza sativa]
gi|340810909|gb|AEK75381.1| S5 [Oryza sativa]
gi|340810911|gb|AEK75382.1| S5 [Oryza sativa]
gi|340810913|gb|AEK75383.1| S5 [Oryza sativa]
gi|340810923|gb|AEK75388.1| S5 [Oryza sativa]
gi|340810925|gb|AEK75389.1| S5 [Oryza sativa]
gi|340810929|gb|AEK75391.1| S5 [Oryza sativa]
gi|340810935|gb|AEK75394.1| S5 [Oryza sativa]
gi|340810937|gb|AEK75395.1| S5 [Oryza sativa]
gi|340810939|gb|AEK75396.1| S5 [Oryza sativa]
gi|340810941|gb|AEK75397.1| S5 [Oryza sativa]
gi|340810943|gb|AEK75398.1| S5 [Oryza sativa]
gi|340810951|gb|AEK75402.1| S5 [Oryza sativa]
gi|340810953|gb|AEK75403.1| S5 [Oryza sativa]
gi|340810963|gb|AEK75408.1| S5 [Oryza sativa]
gi|340810965|gb|AEK75409.1| S5 [Oryza sativa]
gi|340810973|gb|AEK75413.1| S5 [Oryza nivara]
gi|340811003|gb|AEK75428.1| S5 [Oryza rufipogon]
gi|340811005|gb|AEK75429.1| S5 [Oryza rufipogon]
gi|340811009|gb|AEK75431.1| S5 [Oryza rufipogon]
gi|340811023|gb|AEK75438.1| S5 [Oryza rufipogon]
gi|340811025|gb|AEK75439.1| S5 [Oryza nivara]
gi|340811031|gb|AEK75442.1| S5 [Oryza rufipogon]
gi|340811033|gb|AEK75443.1| S5 [Oryza rufipogon]
gi|340811035|gb|AEK75444.1| S5 [Oryza nivara]
gi|340811039|gb|AEK75446.1| S5 [Oryza rufipogon]
gi|340811049|gb|AEK75451.1| S5 [Oryza nivara]
gi|340811053|gb|AEK75453.1| S5 [Oryza rufipogon]
gi|340811055|gb|AEK75454.1| S5 [Oryza nivara]
gi|340811057|gb|AEK75455.1| S5 [Oryza rufipogon]
gi|340811059|gb|AEK75456.1| S5 [Oryza rufipogon]
gi|340811061|gb|AEK75457.1| S5 [Oryza rufipogon]
gi|340811065|gb|AEK75459.1| S5 [Oryza nivara]
gi|340811067|gb|AEK75460.1| S5 [Oryza nivara]
gi|340811069|gb|AEK75461.1| S5 [Oryza nivara]
gi|340811071|gb|AEK75462.1| S5 [Oryza rufipogon]
gi|340811081|gb|AEK75467.1| S5 [Oryza nivara]
gi|340811083|gb|AEK75468.1| S5 [Oryza nivara]
gi|340811087|gb|AEK75470.1| S5 [Oryza nivara]
gi|340811092|gb|AEK75472.1| S5 [Oryza nivara]
gi|340811102|gb|AEK75477.1| S5 [Oryza rufipogon]
gi|340811106|gb|AEK75479.1| S5 [Oryza rufipogon]
gi|340811108|gb|AEK75480.1| S5 [Oryza rufipogon]
gi|340811110|gb|AEK75481.1| S5 [Oryza rufipogon]
gi|340811112|gb|AEK75482.1| S5 [Oryza rufipogon]
gi|340811118|gb|AEK75485.1| S5 [Oryza nivara]
gi|340811120|gb|AEK75486.1| S5 [Oryza rufipogon]
Length = 472
Score = 85.1 bits (209), Expect = 5e-14, Method: Compositional matrix adjust.
Identities = 53/156 (33%), Positives = 83/156 (53%), Gaps = 23/156 (14%)
Query: 70 QNSSISSSKASQADIIP----NNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPS 125
Q I+SS +++ D+I N+ +L+ +S+G PP L DTGS L W QC+PC
Sbjct: 89 QEEEITSSSSTKIDVIEDSSINDFLFLMAVSLGKPPVVNLVAIDTGSTLSWVQCQPC-AV 147
Query: 126 QCYMQDS---PLFDPKMSSTYKSLPCSSSQCASLN------QKSC--SGVNCQYSVSYGD 174
C+ Q + P+FDP S T + + CSS +C L Q +C +C YSV+YG+
Sbjct: 148 HCHTQSAKAGPIFDPGRSYTSRRVRCSSVKCGELRYDLRLQQANCMEKEDSCTYSVTYGN 207
Query: 175 G-SFSNGNLATETVTLGSTTGQAVALPGITFGCGTN 209
G ++S G + T+T+ +G + + FGC +
Sbjct: 208 GWAYSVGKMVTDTLRIGDS------FMDLMFGCSMD 237
>gi|340810931|gb|AEK75392.1| S5 [Oryza sativa]
gi|340810983|gb|AEK75418.1| S5 [Oryza nivara]
gi|340810985|gb|AEK75419.1| S5 [Oryza nivara]
gi|340810997|gb|AEK75425.1| S5 [Oryza nivara]
gi|340811011|gb|AEK75432.1| S5 [Oryza nivara]
gi|340811013|gb|AEK75433.1| S5 [Oryza nivara]
gi|340811041|gb|AEK75447.1| S5 [Oryza nivara]
gi|340811043|gb|AEK75448.1| S5 [Oryza nivara]
Length = 474
Score = 85.1 bits (209), Expect = 5e-14, Method: Compositional matrix adjust.
Identities = 53/156 (33%), Positives = 83/156 (53%), Gaps = 23/156 (14%)
Query: 70 QNSSISSSKASQADIIP----NNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPS 125
Q I+SS +++ D+I N+ +L+ +S+G PP L DTGS L W QC+PC
Sbjct: 91 QEEEITSSSSTKIDVIEDSSINDFLFLMAVSLGKPPVVNLVAIDTGSTLSWVQCQPC-AV 149
Query: 126 QCYMQDS---PLFDPKMSSTYKSLPCSSSQCASLN------QKSC--SGVNCQYSVSYGD 174
C+ Q + P+FDP S T + + CSS +C L Q +C +C YSV+YG+
Sbjct: 150 HCHTQSAKAGPIFDPGRSYTSRRVRCSSVKCGELRYDLRLQQANCMEKEDSCTYSVTYGN 209
Query: 175 G-SFSNGNLATETVTLGSTTGQAVALPGITFGCGTN 209
G ++S G + T+T+ +G + + FGC +
Sbjct: 210 GWAYSVGKMVTDTLRIGDS------FMDLMFGCSMD 239
>gi|340810987|gb|AEK75420.1| S5 [Oryza rufipogon]
gi|340810989|gb|AEK75421.1| S5 [Oryza rufipogon]
gi|340810991|gb|AEK75422.1| S5 [Oryza rufipogon]
gi|340811001|gb|AEK75427.1| S5 [Oryza rufipogon]
gi|340811019|gb|AEK75436.1| S5 [Oryza rufipogon]
gi|340811104|gb|AEK75478.1| S5 [Oryza rufipogon]
gi|340811124|gb|AEK75488.1| S5 [Oryza rufipogon]
Length = 472
Score = 85.1 bits (209), Expect = 5e-14, Method: Compositional matrix adjust.
Identities = 53/156 (33%), Positives = 83/156 (53%), Gaps = 23/156 (14%)
Query: 70 QNSSISSSKASQADIIP----NNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPS 125
Q I+SS +++ D+I N+ +L+ +S+G PP L DTGS L W QC+PC
Sbjct: 89 QEEEITSSSSTKIDVIEDSSINDFLFLMAVSLGKPPVVNLVAIDTGSTLSWVQCQPC-AV 147
Query: 126 QCYMQDS---PLFDPKMSSTYKSLPCSSSQCASLN------QKSC--SGVNCQYSVSYGD 174
C+ Q + P+FDP S T + + CSS +C L Q +C +C YSV+YG+
Sbjct: 148 HCHTQSAKAGPIFDPGRSYTSRRVRCSSVKCGELRYDLRLQQANCMEKENSCTYSVTYGN 207
Query: 175 G-SFSNGNLATETVTLGSTTGQAVALPGITFGCGTN 209
G ++S G + T+T+ +G + + FGC +
Sbjct: 208 GWAYSVGKMVTDTLRIGDS------FMDLMFGCSMD 237
>gi|224082314|ref|XP_002306645.1| predicted protein [Populus trichocarpa]
gi|222856094|gb|EEE93641.1| predicted protein [Populus trichocarpa]
Length = 410
Score = 85.1 bits (209), Expect = 5e-14, Method: Compositional matrix adjust.
Identities = 84/362 (23%), Positives = 145/362 (40%), Gaps = 88/362 (24%)
Query: 91 YLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCSS 150
Y + ++IG PP DTGSDL W QC+ P C L+ PK + +PCS+
Sbjct: 54 YSVILNIGNPPKAFDFDIDTGSDLTWVQCD-APCKGCTKPRDKLYKPKNN----LVPCSN 108
Query: 151 SQCASL---NQKSCSGVN--CQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITFG 205
S C ++ C + C Y + Y D S G L +++ L + G + P + FG
Sbjct: 109 SLCQAVSTGENYHCDAPDDQCDYEIEYADLGSSIGVLLSDSFPLRLSNGTLLQ-PKMAFG 167
Query: 206 CGTNNGGLFNS---KTTGIVGLGGGDISLISQMRT------------------------- 237
CG + L T GI+GLG G +S++SQ+RT
Sbjct: 168 CGYDQKHLGPHPPPDTAGILGLGRGKVSILSQLRTLGITQNVVGHCFSRARGGFLFFGDH 227
Query: 238 ---------------------------TIAGNQRLGVSTPDIVIDSGTTLTFLPQGYNSN 270
+ G + G+ ++ DSG++ T+ +
Sbjct: 228 LFPSSRITWTPMLRSSSDTLYSSGPAELLFGGKPTGIKGLQLIFDSGSSYTYFNAQVYQS 287
Query: 271 LLSVMSSMIEAQPVAD-PTGSLELCYS--------FNSLSQVPEVTIHFRGAD---VKLS 318
+L+++ + +P+ D P L +C+ + S +TI F A ++L+
Sbjct: 288 ILNLVRKDLAGKPLKDAPEKELAVCWKTAKPIKSILDIKSYFKPLTISFMNAKNVQLQLA 347
Query: 319 RSNFFVKVSEDIVCSVFKGITN-------SVPIYGNIMQTNFLVGYDIEQQTVSFKPTDC 371
++ + + VC GI N + + G+I + +V YD E+Q + + P +C
Sbjct: 348 PEDYLIITKDGNVC---LGILNGSEQQLGNFNVIGDIFMQDRVVIYDNEKQQIGWFPANC 404
Query: 372 TK 373
+
Sbjct: 405 DR 406
>gi|225465837|ref|XP_002264626.1| PREDICTED: aspartic proteinase nepenthesin-1-like isoform 1 [Vitis
vinifera]
Length = 437
Score = 84.7 bits (208), Expect = 6e-14, Method: Compositional matrix adjust.
Identities = 60/190 (31%), Positives = 93/190 (48%), Gaps = 21/190 (11%)
Query: 23 EAQTGGFSVELIHRDSPKSPFYNSSETPYQR-LRDALTRSLNRLNHFNQNSSISSSK--- 78
E G +++++H SP SPF ++ + + RL SS+ + K
Sbjct: 31 ETPDQGSTLQVLHVYSPCSPFRPKEPLSWEESVLQMQAKDKARLQFL---SSLVARKSVV 87
Query: 79 --ASQADIIPNNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFD 136
AS I+ N Y++R IGTP L DT SD+ W C C S LF+
Sbjct: 88 PIASGRQIV-QNPTYIVRAKIGTPAQTMLMAMDTSSDVAWIPCN-----GCLGCSSTLFN 141
Query: 137 PKMSSTYKSLPCSSSQCASLNQKSCSGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQA 196
S+TYKSL C ++QC + + +C G C ++++YG S + NL+ +T+TL +
Sbjct: 142 SPASTTYKSLGCQAAQCKQVPKPTCGGGVCSFNLTYGGSSLA-ANLSQDTITLATD---- 196
Query: 197 VALPGITFGC 206
A+PG +FGC
Sbjct: 197 -AVPGYSFGC 205
>gi|242059211|ref|XP_002458751.1| hypothetical protein SORBIDRAFT_03g039590 [Sorghum bicolor]
gi|241930726|gb|EES03871.1| hypothetical protein SORBIDRAFT_03g039590 [Sorghum bicolor]
Length = 444
Score = 84.7 bits (208), Expect = 6e-14, Method: Compositional matrix adjust.
Identities = 95/385 (24%), Positives = 151/385 (39%), Gaps = 110/385 (28%)
Query: 87 NNANYLIRISIGTPPTERLAVADTGSDLIWTQCE------PCPPSQCYMQDSPLFDPKMS 140
+N + + +++GTPP V DTGS+L W C + M +S F P+ S
Sbjct: 59 HNVSLTVSLAVGTPPQNVTMVLDTGSELSWLLCATGRQGSAAAGAAAAMGES--FRPRAS 116
Query: 141 STYKSLPCSSSQCASLN---QKSCSGVN--CQYSVSYGDGSFSNGNLATETVTLGSTTGQ 195
+T+ ++PC S+QC+S + SC G + C S+SY DGS S+G LAT+ +G
Sbjct: 117 ATFAAVPCGSTQCSSRDLPAPPSCDGASRQCHVSLSYADGSASDGALATDVFAVGEAPPL 176
Query: 196 AVALPGITFGC------------------GTNNGGLF---------------NSKTTGIV 222
A FGC G N G L + G++
Sbjct: 177 RSA-----FGCMSTAYDSSPDGVATAGLLGMNRGTLSFVTQASTRRFSYCISDRDDAGVL 231
Query: 223 GLGGGDISLIS-----------------------QMRTTIAGNQRL----GVSTPD---- 251
LG D+ + Q+ G + L V PD
Sbjct: 232 LLGHSDLPFLPLNYTPLYQPTLPLPYFDRVAYSVQLLGIRVGGKALPIPASVLAPDHTGA 291
Query: 252 --IVIDSGTTLTFLP----QGYNSNLLSVMSSMIEAQPVADPT----GSLELCYSFNS-- 299
++DSGT TFL + L ++ A + DP+ +L+ C+ +
Sbjct: 292 GQTMVDSGTQFTFLLGDAYSALKAEFLKQTKPLLRA--LDDPSFAFQEALDTCFRVPAGR 349
Query: 300 ---LSQVPEVTIHFRGADVKLSRSNFFVKV------SEDIVCSVFKGITNSVP----IYG 346
+++P VT+ F GA++ ++ KV ++ + C F G + VP + G
Sbjct: 350 PPPSARLPPVTLLFNGAEMSVAGDRLLYKVPGEHRGADGVWCLTF-GNADMVPLTAYVIG 408
Query: 347 NIMQTNFLVGYDIEQQTVSFKPTDC 371
+ Q N V YD+E+ V P C
Sbjct: 409 HHHQMNLWVEYDLERGRVGLAPVKC 433
>gi|222616728|gb|EEE52860.1| hypothetical protein OsJ_35411 [Oryza sativa Japonica Group]
Length = 395
Score = 84.7 bits (208), Expect = 6e-14, Method: Compositional matrix adjust.
Identities = 83/304 (27%), Positives = 117/304 (38%), Gaps = 73/304 (24%)
Query: 82 ADIIPNNANYLIRISIGTPPTERLAVADTGSDLIWTQCE-PCPPSQCYMQDSPLFDPKMS 140
D+ P+ Y + +SIG PP DTGSDL W QC+ PC C PL+ P +
Sbjct: 50 GDVYPHGL-YYVAMSIGNPPRPYFLDVDTGSDLTWLQCDAPC--VSCSKVPHPLYRPTKN 106
Query: 141 STYKSLPCSSSQCASLN-----QKSCSGVN--CQYSVSYGDGSFSNGNLATETVTLGSTT 193
K +PC CA+L+ + C C Y + Y D S G L T++ L
Sbjct: 107 ---KLVPCVDQMCAALHGGLTGRHKCDSPKQQCDYEIKYADQGSSLGVLVTDSFAL-RLA 162
Query: 194 GQAVALPGITFGCGTNN---GGLFNSKTTGIVGLGGGDISLISQM--------------- 235
++ PG+ FGCG + S T G++GLG G +SL+SQ+
Sbjct: 163 NSSIVRPGLAFGCGYDQQVGSSTEVSATDGVLGLGSGSVSLLSQLKQHGITKNVVGHCLS 222
Query: 236 -----------------RTTIA--------------------GNQRLGVSTPDIVIDSGT 258
R T A G + LGV ++V DSG+
Sbjct: 223 TRGGGFLFFGDDIVPYSRATWAPMARSTSRNYYSPGSANLYFGGRPLGVRPMEVVFDSGS 282
Query: 259 TLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNS-LSQVPEVTIHFRGADVKL 317
+ T+ L+ + + P SL LC+ V +V FR V L
Sbjct: 283 SFTYFSAQPYQALVDAIKGDLSKNLKEVPDHSLPLCWKGKKPFKSVLDVKKEFR--TVVL 340
Query: 318 SRSN 321
S SN
Sbjct: 341 SFSN 344
>gi|168022164|ref|XP_001763610.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162685103|gb|EDQ71500.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 308
Score = 84.7 bits (208), Expect = 7e-14, Method: Compositional matrix adjust.
Identities = 75/252 (29%), Positives = 116/252 (46%), Gaps = 39/252 (15%)
Query: 51 YQRLRDALTRSLNRLNHFNQNSSISSSKASQADIIPNNANYLIRISIGTPPTERLAVADT 110
Y LR R L R+ +S + DI Y RIS+GTPP + DT
Sbjct: 6 YHTLRKHDQRRLRRM----LPEVVSFPISGDNDIFAMGL-YYTRISLGTPPQQFYVDVDT 60
Query: 111 GSDLIWTQCEPCPPSQCYMQDSPL----FDPKMSSTYKSLPCSSSQCASLNQK-SCS--G 163
GS++ W +C PC + + D P+ FDP+ S+T S+ C+ ++C LN+K CS
Sbjct: 61 GSNVAWVKCAPCTGCE-HSGDVPVPMSTFDPRKSTTKISISCTDAECGVLNKKLQCSPER 119
Query: 164 VNCQYSVSYGDGSFSNGNLATETVTLGST-TGQAVALPG---ITFGCGTNNGGLFNSKTT 219
++C YS+ YGDGS + G + T + + A G + FGCG G ++
Sbjct: 120 LSCPYSLLYGDGSSTAGYYLNDVFTFNQVPSDNSTAKSGTARLVFGCGGTQTGSWS--VD 177
Query: 220 GIVGLGGGDISLISQ--------------MRTTIAGNQRLGVST---PDIVIDSGTTLTF 262
G++G G +SL +Q ++ ++G L + T PD+V T + F
Sbjct: 178 GLLGFGPTTVSLPNQLAQQNISVNIFAHCLQGDVSGRGSLVIGTIREPDLVY---TPMVF 234
Query: 263 LPQGYNSNLLSV 274
YN LL++
Sbjct: 235 GEDHYNVQLLNI 246
>gi|356541713|ref|XP_003539318.1| PREDICTED: aspartic proteinase-like protein 2-like [Glycine max]
Length = 640
Score = 84.7 bits (208), Expect = 7e-14, Method: Compositional matrix adjust.
Identities = 53/180 (29%), Positives = 85/180 (47%), Gaps = 18/180 (10%)
Query: 65 LNHFNQNSSISSSKASQA--------DIIPNNANYLIRISIGTPPTERLAVADTGSDLIW 116
L+HFN + S++ D + N Y R+ IGTPP + DTGS + +
Sbjct: 59 LSHFNPRRHLQGSQSEHHPNARMRLFDDLLRNGYYTTRLWIGTPPQRFALIVDTGSTVTY 118
Query: 117 TQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCSSSQCASLNQKSCSGVNCQYSVSYGDGS 176
C C C P F P+ S TY+ + C + QC + + C Y Y + S
Sbjct: 119 VPCSTC--KHCGSHQDPKFRPEASETYQPVKC-TWQCNCDDDRK----QCTYERRYAEMS 171
Query: 177 FSNGNLATETVTLGSTTGQAVALPGITFGCGTNN-GGLFNSKTTGIVGLGGGDISLISQM 235
S+G L + V+ G+ + ++ FGC + G ++N + GI+GLG GD+S++ Q+
Sbjct: 172 TSSGVLGEDVVSFGNQS--ELSPQRAIFGCENDETGDIYNQRADGIMGLGRGDLSIMDQL 229
Score = 38.9 bits (89), Expect = 4.2, Method: Compositional matrix adjust.
Identities = 36/133 (27%), Positives = 61/133 (45%), Gaps = 12/133 (9%)
Query: 253 VIDSGTTLTFLPQ-GYNSNLLSVMSSMIEAQPVADPTGSL-ELCYS-----FNSLSQ-VP 304
V+DSGTT +LP+ + + ++M + ++ P ++C+S + LS+ P
Sbjct: 305 VLDSGTTYAYLPESAFLAFKHAIMKETHSLKRISGPDPHYNDICFSGAEINVSQLSKSFP 364
Query: 305 EVTIHF-RGADVKLSRSNFFV---KVSEDIVCSVFKGITNSVPIYGNIMQTNFLVGYDIE 360
V + F G + LS N+ KV VF + + G I+ N LV YD E
Sbjct: 365 VVEMVFGNGHKLSLSPENYLFRHSKVRGAYCLGVFSNGNDPTTLLGGIVVRNTLVMYDRE 424
Query: 361 QQTVSFKPTDCTK 373
+ F T+C++
Sbjct: 425 HSKIGFWKTNCSE 437
>gi|340811122|gb|AEK75487.1| S5 [Oryza rufipogon]
Length = 277
Score = 84.7 bits (208), Expect = 7e-14, Method: Compositional matrix adjust.
Identities = 53/156 (33%), Positives = 83/156 (53%), Gaps = 23/156 (14%)
Query: 70 QNSSISSSKASQADIIP----NNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPS 125
Q I+SS +++ D+I N+ +L+ +S+G PP L DTGS L W QC+PC
Sbjct: 89 QEEEITSSSSTKIDVIEDSSINDFLFLMAVSLGKPPVVNLVAIDTGSTLSWVQCQPC-AV 147
Query: 126 QCYMQDS---PLFDPKMSSTYKSLPCSSSQCASLN------QKSC--SGVNCQYSVSYGD 174
C+ Q + P+FDP S T + + CSS +C L Q +C +C YSV+YG+
Sbjct: 148 HCHTQSAKAGPIFDPGRSYTSRRVRCSSVKCGELRYDLRLQQANCMEKEDSCTYSVTYGN 207
Query: 175 G-SFSNGNLATETVTLGSTTGQAVALPGITFGCGTN 209
G ++S G + T+T+ +G + + FGC +
Sbjct: 208 GWAYSVGKMVTDTLRIGDS------FMDLMFGCSMD 237
>gi|326503602|dbj|BAJ86307.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 461
Score = 84.3 bits (207), Expect = 8e-14, Method: Compositional matrix adjust.
Identities = 64/196 (32%), Positives = 96/196 (48%), Gaps = 29/196 (14%)
Query: 87 NNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSL 146
+N + + +++GTPP V DTGS+L W C P + S F P+ SST+ ++
Sbjct: 81 HNVSLTVSLAVGTPPQNVTMVLDTGSELSWLLCAPAGARNKFSAMS--FRPRASSTFAAV 138
Query: 147 PCSSSQCASLNQKS---CSGV--NCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPG 201
PC+S+QC S + S C G C S+SY DGS S+G LAT+ +GS A
Sbjct: 139 PCASAQCRSRDLPSPPACDGASSRCSVSLSYADGSSSDGALATDVFAVGSGPPLRAA--- 195
Query: 202 ITFGCGTNNGGLFNS-----KTTGIVGLGGGDISLISQMRTT-----IAGNQRLGVSTPD 251
FGC ++ F+S + G++G+ G +S +SQ T I+ GV
Sbjct: 196 --FGCMSSA---FDSSPDGVASAGLLGMNRGALSFVSQASTRRFSYCISDRDDAGV---- 246
Query: 252 IVIDSGTTLTFLPQGY 267
+++ TFLP Y
Sbjct: 247 LLLGHSDLPTFLPLNY 262
>gi|118486912|gb|ABK95290.1| unknown [Populus trichocarpa]
Length = 438
Score = 84.3 bits (207), Expect = 8e-14, Method: Compositional matrix adjust.
Identities = 74/249 (29%), Positives = 108/249 (43%), Gaps = 25/249 (10%)
Query: 2 ATFLSCVFILFFLCFYVVSPIEAQTGGFSVELIHRDSPKSPFYNSSETPYQRLRDALTRS 61
ATF + L F V P Q+ + +I S SPF + + + +T +
Sbjct: 8 ATFF--LVALLFSTTKAVDPCATQSDTSDLSVIPIYSKCSPFVPPKQESW--VNTVITMA 63
Query: 62 LNRLNHFNQNSSISSSKASQADIIPNN-----ANYLIRISIGTPPTERLAVADTGSDLIW 116
S+++ K + I P ANY++R+ +GTP + V DT +D W
Sbjct: 64 SKDPERLKYLSTLADQKTTAVPIAPGQQVLKIANYVVRVKLGTPGQQMFMVLDTSNDAAW 123
Query: 117 TQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCSSSQCASLNQKSCSGVN---CQYSVSYG 173
C S C S F P S+T SL CS +QC+ + SC C ++ SYG
Sbjct: 124 VPC-----SGCTGFSSTTFLPNASTTLGSLDCSGAQCSQVRGFSCPATGSSACLFNQSYG 178
Query: 174 DGSFSNGNLATETVTLGSTTGQAVALPGITFGC-GTNNGGLFNSKTTGIVGLGGGDISLI 232
S L + +TL + +PG TFGC +GG + G++GLG G ISLI
Sbjct: 179 GDSSLTATLVQDAITLANDV-----IPGFTFGCINAVSGGSIPPQ--GLLGLGRGPISLI 231
Query: 233 SQMRTTIAG 241
SQ +G
Sbjct: 232 SQAGAMYSG 240
>gi|218189149|gb|EEC71576.1| hypothetical protein OsI_03949 [Oryza sativa Indica Group]
Length = 504
Score = 84.3 bits (207), Expect = 8e-14, Method: Compositional matrix adjust.
Identities = 53/162 (32%), Positives = 78/162 (48%), Gaps = 15/162 (9%)
Query: 91 YLIRISIGTPPTERLAVADTGSDLIWTQCEP---CPPSQCYMQDSPLFDPKMSSTYKSLP 147
Y R+ +G+PP E DTGSD++W C P CP S F+P SST +P
Sbjct: 91 YFTRVKLGSPPKEYFVQIDTGSDILWVACSPCTGCPSSSGLNIQLEFFNPDTSSTSSKIP 150
Query: 148 CSSSQCASLNQKS---CSGVN---CQYSVSYGDGSFSNGNLATETVTLGSTTGQ---AVA 198
CS +C + Q S C + C Y+ +YGDGS ++G ++T+ S G A +
Sbjct: 151 CSDDRCTAALQTSEAVCQTSDNSPCGYTFTYGDGSGTSGYYVSDTMYFDSVMGNEQTANS 210
Query: 199 LPGITFGCGTNNGGLF---NSKTTGIVGLGGGDISLISQMRT 237
I FGC + G + GI G G +S++SQ+ +
Sbjct: 211 SASIVFGCSNSQSGDLTKTDRAVDGIFGFGQHQLSVVSQLNS 252
>gi|125542690|gb|EAY88829.1| hypothetical protein OsI_10302 [Oryza sativa Indica Group]
Length = 440
Score = 84.3 bits (207), Expect = 8e-14, Method: Compositional matrix adjust.
Identities = 68/219 (31%), Positives = 98/219 (44%), Gaps = 31/219 (14%)
Query: 31 VELIHRDSPKSPFYNSSETPYQRLRDALTRSLNRLNHFNQNSSISSSKASQADIIPNNAN 90
+ + H P SP S R DA L L+ + +SS+ + P+
Sbjct: 27 LSVYHNVHPSSPSPLESIIALARDDDA---RLLFLSSKAATAGVSSAPVASGQAPPS--- 80
Query: 91 YLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCSS 150
Y++R +G+P + L DT +D W C PC C S LF P SS+Y SLPCSS
Sbjct: 81 YVVRAGLGSPSQQLLLALDTSADATWAHCSPC--GTC--PSSSLFAPANSSSYASLPCSS 136
Query: 151 SQCASLNQKSCSGVN--------------CQYSVSYGDGSFSNGNLATETVTLGSTTGQA 196
S C ++C C +S + D SF LA++T+ LG
Sbjct: 137 SWCPLFQGQACPAPQGGGDAAPPPATLPTCAFSKPFADASF-QAALASDTLRLGKD---- 191
Query: 197 VALPGITFGCGTN-NGGLFNSKTTGIVGLGGGDISLISQ 234
A+P TFGC ++ G N G++GLG G ++L+SQ
Sbjct: 192 -AIPNYTFGCVSSVTGPTTNMPRQGLLGLGRGPMALLSQ 229
>gi|255567949|ref|XP_002524952.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
gi|223535787|gb|EEF37449.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
Length = 394
Score = 84.3 bits (207), Expect = 9e-14, Method: Compositional matrix adjust.
Identities = 54/168 (32%), Positives = 86/168 (51%), Gaps = 12/168 (7%)
Query: 70 QNSSISSSKASQADIIPNNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYM 129
Q S+ +++ D + N Y RI IGTPP + DTGS + + C C QC
Sbjct: 69 QGSARPNARMRLYDDLLLNGYYTTRIWIGTPPQTFALIVDTGSTVTYVPCSTC--EQCGR 126
Query: 130 QDSPLFDPKMSSTYKSLPCSSSQCASLNQKSCSGVNCQYSVSYGDGSFSNGNLATETVTL 189
P F+P++SSTY+ + C+ C N++ C Y Y + S S+G L + ++
Sbjct: 127 HQDPKFEPELSSTYQPVSCNID-CTCDNERK----QCVYERQYAEMSSSSGVLGEDIISF 181
Query: 190 GSTTGQAVALP-GITFGC-GTNNGGLFNSKTTGIVGLGGGDISLISQM 235
G+ Q+ +P FGC G L++ + GI+GLG GD+S++ Q+
Sbjct: 182 GN---QSELVPQRAIFGCENQETGDLYSQRADGIMGLGRGDLSIVDQL 226
>gi|293333354|ref|NP_001169607.1| uncharacterized protein LOC100383488 [Zea mays]
gi|224030351|gb|ACN34251.1| unknown [Zea mays]
Length = 342
Score = 84.3 bits (207), Expect = 9e-14, Method: Compositional matrix adjust.
Identities = 45/121 (37%), Positives = 68/121 (56%), Gaps = 10/121 (8%)
Query: 118 QCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCSSSQCASLNQKSCSGVN---CQYSVSYGD 174
QC+PC CY Q P+F+PK+SS+Y +PC+S CA L+ C + CQY+ Y
Sbjct: 2 QCQPC--VSCYRQLDPVFNPKLSSSYAVVPCTSDTCAQLDGHRCHEDDDGACQYTYKYSG 59
Query: 175 GSFSNGNLATETVTLGSTTGQAVALPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQ 234
+ G LA + + +G AV FGC ++ G ++ +G+VGLG G +SL+SQ
Sbjct: 60 HGVTKGTLAIDKLAIGGDVFHAV-----VFGCSDSSVGGPAAQASGLVGLGRGPLSLVSQ 114
Query: 235 M 235
+
Sbjct: 115 L 115
>gi|21618176|gb|AAM67226.1| unknown [Arabidopsis thaliana]
Length = 430
Score = 84.0 bits (206), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 54/153 (35%), Positives = 75/153 (49%), Gaps = 22/153 (14%)
Query: 92 LIRISIGTPPTERLAVADTGSDLIWTQC--EPCPPSQCYMQDSPLFDPKMSSTYKSLPCS 149
+I + IGTPP + V DTGS L W QC + PP + FDP +SS++ +LPCS
Sbjct: 73 IISLPIGTPPQAQQMVLDTGSQLSWIQCHRKKLPP-----KPKTSFDPSLSSSFSTLPCS 127
Query: 150 SSQCA------SLNQKSCSGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGIT 203
C +L S C YS Y DG+F+ GNL E +T +T P +
Sbjct: 128 HPLCKPRIPDFTLPTSCDSNRLCHYSYFYADGTFAEGNLVKEKITFSNTE----ITPPLI 183
Query: 204 FGCGTNNGGLFNSKTTGIVGLGGGDISLISQMR 236
GC T +S GI+G+ G +S +SQ +
Sbjct: 184 LGCATE-----SSDDRGILGMNRGRLSFVSQAK 211
>gi|18408451|ref|NP_564867.1| aspartyl protease-like protein [Arabidopsis thaliana]
gi|12322615|gb|AAG51309.1|AC026480_16 unknown protein [Arabidopsis thaliana]
gi|14334808|gb|AAK59582.1| unknown protein [Arabidopsis thaliana]
gi|15293195|gb|AAK93708.1| unknown protein [Arabidopsis thaliana]
gi|332196351|gb|AEE34472.1| aspartyl protease-like protein [Arabidopsis thaliana]
Length = 430
Score = 84.0 bits (206), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 54/153 (35%), Positives = 75/153 (49%), Gaps = 22/153 (14%)
Query: 92 LIRISIGTPPTERLAVADTGSDLIWTQC--EPCPPSQCYMQDSPLFDPKMSSTYKSLPCS 149
+I + IGTPP + V DTGS L W QC + PP + FDP +SS++ +LPCS
Sbjct: 73 IISLPIGTPPQAQQMVLDTGSQLSWIQCHRKKLPP-----KPKTSFDPSLSSSFSTLPCS 127
Query: 150 SSQCA------SLNQKSCSGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGIT 203
C +L S C YS Y DG+F+ GNL E +T +T P +
Sbjct: 128 HPLCKPRIPDFTLPTSCDSNRLCHYSYFYADGTFAEGNLVKEKITFSNTE----ITPPLI 183
Query: 204 FGCGTNNGGLFNSKTTGIVGLGGGDISLISQMR 236
GC T +S GI+G+ G +S +SQ +
Sbjct: 184 LGCATE-----SSDDRGILGMNRGRLSFVSQAK 211
>gi|307136234|gb|ADN34070.1| aspartic proteinase nepenthesin-1 precursor [Cucumis melo subsp.
melo]
Length = 412
Score = 84.0 bits (206), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 52/158 (32%), Positives = 84/158 (53%), Gaps = 20/158 (12%)
Query: 87 NNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSL 146
+N + +++G+PP + V DTGS+L W C+ P + +F+P SS+Y +
Sbjct: 36 HNVTLTVSLTVGSPPQQVTMVLDTGSELSWLHCKKSP------NLTSVFNPLSSSSYSPI 89
Query: 147 PCSSSQCASLNQKSCSGVN------CQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALP 200
PCSS C + + + V C VSY D S GNLA++ +GS+ ALP
Sbjct: 90 PCSSPVCRTRTRDLPNPVTCDPKKLCHAIVSYADASSLEGNLASDNFRIGSS-----ALP 144
Query: 201 GITFGC---GTNNGGLFNSKTTGIVGLGGGDISLISQM 235
G FGC G ++ ++KTTG++G+ G +S ++Q+
Sbjct: 145 GTLFGCMDSGFSSNSEEDAKTTGLMGMNRGSLSFVTQL 182
>gi|413923781|gb|AFW63713.1| hypothetical protein ZEAMMB73_300584, partial [Zea mays]
Length = 190
Score = 84.0 bits (206), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 67/186 (36%), Positives = 95/186 (51%), Gaps = 11/186 (5%)
Query: 16 FYVVSPIEAQTGGFSVELIHRDSPKSPFYNSS-ETPYQRLRDALTRSLNRLNHFNQNSSI 74
F +V+ + + G +V L HR P SP + T +RLR R+ F+ I
Sbjct: 2 FGLVASVTPPSTGVTVPLHHRYDPCSPVPSKKVPTLEERLRRDQLRAAYIKRKFSGAGDI 61
Query: 75 SSSKASQADIIPNNA----NYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQ 130
S A+ + Y+I + IG+P + DTGSD+ W QC+PC SQC+ +
Sbjct: 62 EQSDAATVPTTLGTSLSTLEYVITVGIGSPAVTQTMSMDTGSDVSWVQCKPC--SQCHSE 119
Query: 131 DSPLFDPKMSSTYKSLPCSSSQCASLNQKS----CSGVNCQYSVSYGDGSFSNGNLATET 186
LFDP SSTY CSS+ CA L+Q C CQY V+YGD S + G +++T
Sbjct: 120 VDSLFDPSSSSTYSPFSCSSAPCAQLSQSQEGNGCMSSQCQYIVNYGDSSSTTGTYSSDT 179
Query: 187 VTLGST 192
+TLGS+
Sbjct: 180 LTLGSS 185
>gi|359482097|ref|XP_002271077.2| PREDICTED: aspartic proteinase nepenthesin-1-like [Vitis vinifera]
Length = 458
Score = 84.0 bits (206), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 64/187 (34%), Positives = 90/187 (48%), Gaps = 42/187 (22%)
Query: 60 RSLNRLNHFNQNSSISSSKASQADIIPNNANYLIRISIGTPPTERLAVADTGSDLIWTQC 119
RS N+L HF+ N S++ + +++GTPP V DTGS+L W +C
Sbjct: 72 RSPNKL-HFHHNVSLT-----------------VSLTVGTPPQNVSMVLDTGSELSWLRC 113
Query: 120 EPCPPSQCYMQDSPLFDPKMSSTYKSLPCSSSQCASLNQK-----SC-SGVNCQYSVSYG 173
Q FDP SS+Y +PCSS C + SC S C +SY
Sbjct: 114 NKTQTFQT------TFDPNRSSSYSPVPCSSLTCTDRTRDFPIPASCDSNQLCHAILSYA 167
Query: 174 DGSFSNGNLATETVTLGSTTGQAVALPGITFGC-----GTNNGGLFNSKTTGIVGLGGGD 228
D S S GNLA++T +G++ +PG FGC TN +SK TG++G+ G
Sbjct: 168 DASSSEGNLASDTFYIGNSD-----MPGTIFGCMDSSFSTNTEE--DSKNTGLMGMNRGS 220
Query: 229 ISLISQM 235
+S +SQM
Sbjct: 221 LSFVSQM 227
>gi|115473125|ref|NP_001060161.1| Os07g0592200 [Oryza sativa Japonica Group]
gi|29027762|dbj|BAC65898.1| putative CND41, chloroplast nucleoid DNA binding protein [Oryza
sativa Japonica Group]
gi|113611697|dbj|BAF22075.1| Os07g0592200 [Oryza sativa Japonica Group]
Length = 631
Score = 84.0 bits (206), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 93/362 (25%), Positives = 148/362 (40%), Gaps = 80/362 (22%)
Query: 83 DIIPNNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSST 142
D + N Y R+ IGTP E + D+GS + + C C QC P F P +SST
Sbjct: 83 DDLLTNGYYTTRLYIGTPSQEFALIVDSGSTVTYVPCATC--EQCGNHQDPRFQPDLSST 140
Query: 143 YKSLPCSSSQCASLNQKSCSGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGI 202
Y + C+ C N++S C Y Y + S S+G L + ++ G + +
Sbjct: 141 YSPVKCNVD-CTCDNERS----QCTYERQYAEMSSSSGVLGEDIMSFGKES--ELKPQRA 193
Query: 203 TFGC-GTNNGGLFNSKTTGIVGLGGGDISLISQM--RTTIA-------GNQRLGVST--- 249
FGC T G LF+ GI+GLG G +S++ Q+ + I+ G +G T
Sbjct: 194 VFGCENTETGDLFSQHADGIMGLGRGQLSIMDQLVEKGVISDSFSLCYGGMDVGGGTMVL 253
Query: 250 ------PDIVID-------------------SGTTLTFLPQGYNSNLLSVMSS------- 277
PD+V +G L P+ +NS +V+ S
Sbjct: 254 GGMPAPPDMVFSHSNPVRSPYYNIELKEIHVAGKALRLDPKIFNSKHGTVLDSGTTYAYL 313
Query: 278 -----MIEAQPVADPTGSL-----------ELCYS-----FNSLSQV-PEVTIHF-RGAD 314
+ V + SL ++C++ + LS+V P+V + F G
Sbjct: 314 PEQAFVAFKDAVTNKVNSLKKIRGPDPNYKDICFAGAGRNVSQLSEVFPDVDMVFGNGQK 373
Query: 315 VKLSRSNFFVKVS--EDIVC-SVFKGITNSVPIYGNIMQTNFLVGYDIEQQTVSFKPTDC 371
+ LS N+ + S E C VF+ + + G I+ N LV YD + + F T+C
Sbjct: 374 LSLSPENYLFRHSKVEGAYCLGVFQNGKDPTTLLGGIVVRNTLVTYDRHNEKIGFWKTNC 433
Query: 372 TK 373
++
Sbjct: 434 SE 435
>gi|61213864|sp|P69477.1|NEP2_NEPDI RecName: Full=Aspartic proteinase nepenthesin-2; AltName:
Full=Nepenthesin-II
Length = 178
Score = 84.0 bits (206), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 66/216 (30%), Positives = 93/216 (43%), Gaps = 69/216 (31%)
Query: 113 DLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCSSSQCASLNQKSCSGVNCQYSVSY 172
DLIWTQCEPC +QC+ QD SS++ +LPC S C L ++C +CQY+ Y
Sbjct: 20 DLIWTQCEPC--TQCFSQD--------SSSFSTLPCESQYCQDLPSETC---DCQYTYGY 66
Query: 173 GDGSFSNGNLATETVTLGSTTGQAVALPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLI 232
GDGS + G +A E ++P I FGCG N
Sbjct: 67 GDGSSTQGYMAXE---------DGSSVPNIAFGCGDN----------------------- 94
Query: 233 SQMRTTIAGNQRLGVSTPDIVIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLE 292
+ IDSGTTLT+LPQ + + + I V + + L
Sbjct: 95 -------------------LQIDSGTTLTYLPQDAYNAVAQAFTDQINLPTVDESSSGLS 135
Query: 293 LCY---SFNSLSQVPEVTIHFRGA--DVKLSRSNFF 323
C+ S S QVPE+++ G D++ +FF
Sbjct: 136 TCFQEPSDGSTVQVPEISMQDGGVLNDLQNLAVSFF 171
>gi|115451209|ref|NP_001049205.1| Os03g0186900 [Oryza sativa Japonica Group]
gi|49532749|dbj|BAD26705.1| Radc1 [Oryza sativa Japonica Group]
gi|108706569|gb|ABF94364.1| Eukaryotic aspartyl protease family protein, expressed [Oryza
sativa Japonica Group]
gi|113547676|dbj|BAF11119.1| Os03g0186900 [Oryza sativa Japonica Group]
gi|215692805|dbj|BAG88249.1| unnamed protein product [Oryza sativa Japonica Group]
gi|215767626|dbj|BAG99854.1| unnamed protein product [Oryza sativa Japonica Group]
Length = 438
Score = 83.6 bits (205), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 68/217 (31%), Positives = 97/217 (44%), Gaps = 31/217 (14%)
Query: 33 LIHRDSPKSPFYNSSETPYQRLRDALTRSLNRLNHFNQNSSISSSKASQADIIPNNANYL 92
+ H P SP S R DA L L+ + +SS+ + P+ Y+
Sbjct: 27 VYHNVHPSSPSPLESIIALARDDDA---RLLFLSSKAATAGVSSAPVASGQAPPS---YV 80
Query: 93 IRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCSSSQ 152
+R +G+P + L DT +D W C PC C S LF P SS+Y SLPCSSS
Sbjct: 81 VRAGLGSPSQQLLLALDTSADATWAHCSPC--GTC--PSSSLFAPANSSSYASLPCSSSW 136
Query: 153 CASLNQKSCSGVN--------------CQYSVSYGDGSFSNGNLATETVTLGSTTGQAVA 198
C ++C C +S + D SF LA++T+ LG A
Sbjct: 137 CPLFQGQACPAPQGGGDAAPPPATLPTCAFSKPFADASF-QAALASDTLRLGKD-----A 190
Query: 199 LPGITFGCGTN-NGGLFNSKTTGIVGLGGGDISLISQ 234
+P TFGC ++ G N G++GLG G ++L+SQ
Sbjct: 191 IPNYTFGCVSSVTGPTTNMPRQGLLGLGRGPMALLSQ 227
>gi|326511104|dbj|BAK03251.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 631
Score = 83.6 bits (205), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 93/369 (25%), Positives = 149/369 (40%), Gaps = 80/369 (21%)
Query: 76 SSKASQADIIPNNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLF 135
S++ D + N Y R+ IGTPP E + D+GS + + C C QC P F
Sbjct: 73 SARMRLHDDLLTNGYYTTRLHIGTPPQEFALIVDSGSTVTYVPCASC--EQCGNHQDPRF 130
Query: 136 DPKMSSTYKSLPCSSSQCASLNQKSCSGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQ 195
P +SSTY + C+ C + K+ C Y Y + S S+G L + V+ G+ +
Sbjct: 131 QPDLSSTYSPVKCNVD-CTCDSDKN----QCTYERQYAEMSSSSGVLGEDIVSFGTES-- 183
Query: 196 AVALPGITFGC-GTNNGGLFNSKTTGIVGL------------------------------ 224
+ FGC + G LF+ GI+GL
Sbjct: 184 ELKPQRAVFGCENSETGDLFSQHADGIMGLGRGQLSIMDQLVDKGVIGDSFSMCYGGMDI 243
Query: 225 GGGDISL-----------------------ISQMRTTIAG-----NQRLGVSTPDIVIDS 256
GGG + L I +AG + R+ V+DS
Sbjct: 244 GGGAMVLGAMPAPPGMIYTHSNAVRSPYYNIELKEMHVAGKALRVDPRIFDGKHGTVLDS 303
Query: 257 GTTLTFLP-QGYNSNLLSVMSSMIEAQPVADPTGSL-ELCYS-----FNSLSQV-PEVTI 308
GTT +LP Q + + +V S + + + P + ++C++ + LS+V P+V +
Sbjct: 304 GTTYAYLPEQAFVAFKDAVSSQVHPLKKIRGPDSNYKDICFAGAGRNVSQLSEVFPKVDM 363
Query: 309 HF-RGADVKLSRSNFFVKVS--EDIVC-SVFKGITNSVPIYGNIMQTNFLVGYDIEQQTV 364
F G + LS N+ + S E C VF+ + + G I+ N LV YD + +
Sbjct: 364 VFGNGQKLSLSPENYLFRHSKVEGAYCLGVFQNGKDPTTLLGGIVVRNTLVTYDRHNEKI 423
Query: 365 SFKPTDCTK 373
F T+C++
Sbjct: 424 GFWKTNCSE 432
>gi|168042409|ref|XP_001773681.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162675069|gb|EDQ61569.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 399
Score = 83.6 bits (205), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 92/363 (25%), Positives = 146/363 (40%), Gaps = 85/363 (23%)
Query: 91 YLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPL----FDPKMSSTYKSL 146
Y +I +GTPP DTGSD+ W C PC Q + +DP SST +L
Sbjct: 37 YYTKIYLGTPPVGYYVQVDTGSDVTWLNCAPCTSCVTETQLPSIKLTTYDPSRSSTDGAL 96
Query: 147 PCSSSQCASL---NQKSCSGVN-CQYSVSYGDGSFSNGNLATETVTLGSTTG--QAVALP 200
C S C + N+ SC+ C YS +YGDGS + G + +T Q
Sbjct: 97 SCRDSNCGAALGSNEVSCTSAGYCAYSTTYGDGSSTQGYFIQDVMTFQEIHNNTQVNGTA 156
Query: 201 GITFGCGTNNGG--LFNSKT-TGIVGLGGGDISLISQMRT-------------------- 237
+ FGCGT G L +S+ G++G G +S+ SQ+ +
Sbjct: 157 SVYFGCGTTQSGNLLMSSRALDGLIGFGQAAVSIPSQLASMGKVGNRFAHCLQGDNQGGG 216
Query: 238 ----------------------------TIAGNQRLGVSTP-----------DIVIDSGT 258
IA N R V+TP +++DSGT
Sbjct: 217 TIVIGSVSEPNISYTPIVSRNHYAVGMQNIAVNGR-NVTTPASFDTTSTSAGGVIMDSGT 275
Query: 259 TLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSLSQVPEVTIHF-RGADVKL 317
TL +L + ++ +S+ E+ + + L+L + + + P V + F GA + L
Sbjct: 276 TLAYLVDPAYTQFVNAVST-FESSMFSSHSQCLQLAWC-SLQADFPTVKLFFDAGAVMNL 333
Query: 318 SRSNFF----VKVSEDIVCSVFKGITN-----SVPIYGNIMQTNFLVGYDIEQQTVSFKP 368
+ N+ ++ + C ++ T S I G+I+ + LV YD + + V +K
Sbjct: 334 TPRNYLYSQPLQNGQAAYCMGWQKSTTKAGYLSYSILGDIVLKDHLVVYDNDNRVVGWKS 393
Query: 369 TDC 371
DC
Sbjct: 394 FDC 396
>gi|297798582|ref|XP_002867175.1| hypothetical protein ARALYDRAFT_328390 [Arabidopsis lyrata subsp.
lyrata]
gi|297313011|gb|EFH43434.1| hypothetical protein ARALYDRAFT_328390 [Arabidopsis lyrata subsp.
lyrata]
Length = 425
Score = 83.6 bits (205), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 85/380 (22%), Positives = 147/380 (38%), Gaps = 89/380 (23%)
Query: 77 SKASQADIIPNNAN------YLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQ 130
++A + + P + N Y + I+IG PP DTGSDL W QC+ P +C
Sbjct: 40 TRAVSSVVFPVHGNVYPLGYYNVTINIGQPPRPYYLDLDTGSDLTWLQCD-APCVRCLEA 98
Query: 131 DSPLFDPKMSSTYKSLPCSSSQCASL----NQKSCSGVNCQYSVSYGDGSFSNGNLATET 186
PL+ P + +PC+ C +L NQ+ + C Y V Y DG S G L +
Sbjct: 99 PHPLYQP----SSDLIPCNDPLCKALHLNSNQRCETPEQCDYEVEYADGGSSLGVLVRDV 154
Query: 187 VTLGSTTGQAVALPGITFGCGTNN--GGLFNSKTTGIVGLGGGDISLISQMRTT------ 238
++ T G + P + GCG + G + G++GLG G +S++SQ+ +
Sbjct: 155 FSMNYTKGLRLT-PRLALGCGYDQIPGASSHHPLDGVLGLGRGKVSILSQLHSQGYVKNV 213
Query: 239 ----------------------------------------------IAGNQRLGVSTPDI 252
+ G + G+
Sbjct: 214 IGHCLSSLGGGILFFGDDLYDSSRVSWTPMSREYSKHYSPAMGGELLFGGRTTGLKNLLT 273
Query: 253 VIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPV--ADPTGSLELCYSFNS-LSQVPEVTIH 309
V DSG++ T+ + ++ + +P+ A +L LC+ + EV +
Sbjct: 274 VFDSGSSYTYFNSKAYQAVTYLLKRELSGKPLKEARDDHTLPLCWQGRRPFMSIEEVKKY 333
Query: 310 FRGADVKL-----SRSNFFVKVSEDIVCS----VFKGITNSVPI-------YGNIMQTNF 353
F+ + S++ F + ++ S V GI N I G+I +
Sbjct: 334 FKPLALSFKTGWRSKTLFEIPPEAYLIISMKGNVCLGILNGTEIGLQNLNLIGDISMQDQ 393
Query: 354 LVGYDIEQQTVSFKPTDCTK 373
++ YD E+Q++ + P DC +
Sbjct: 394 MIIYDNEKQSIGWMPADCDE 413
>gi|53791672|dbj|BAD53242.1| putative aspartic proteinase nepenthesin II [Oryza sativa Japonica
Group]
Length = 504
Score = 83.6 bits (205), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 52/162 (32%), Positives = 78/162 (48%), Gaps = 15/162 (9%)
Query: 91 YLIRISIGTPPTERLAVADTGSDLIWTQCEP---CPPSQCYMQDSPLFDPKMSSTYKSLP 147
Y R+ +G+PP E DTGSD++W C P CP S F+P SST +P
Sbjct: 91 YFTRVKLGSPPKEYFVQIDTGSDILWVACSPCTGCPSSSGLNIQLEFFNPDTSSTSSKIP 150
Query: 148 CSSSQCASLNQKS---CSGVN---CQYSVSYGDGSFSNGNLATETVTLGSTTGQ---AVA 198
CS +C + Q S C + C Y+ +YGDGS ++G ++T+ + G A +
Sbjct: 151 CSDDRCTAALQTSEAVCQTSDNSPCGYTFTYGDGSGTSGYYVSDTMYFDTVMGNEQTANS 210
Query: 199 LPGITFGCGTNNGGLF---NSKTTGIVGLGGGDISLISQMRT 237
I FGC + G + GI G G +S++SQ+ +
Sbjct: 211 SASIVFGCSNSQSGDLTKTDRAVDGIFGFGQHQLSVVSQLNS 252
>gi|21805926|gb|AAM76716.1| nucellin-like aspartic protease [Zea mays]
Length = 357
Score = 83.6 bits (205), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 87/355 (24%), Positives = 145/355 (40%), Gaps = 86/355 (24%)
Query: 97 IGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCSSSQCASL 156
IG P DTGSDL W QC+ P C PL+ P + + +PC+++ C +L
Sbjct: 1 IGNPAKPYFLDVDTGSDLTWLQCD-APCRSCNKVPHPLYRP---TANRLVPCANALCTAL 56
Query: 157 ------NQKSCSGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITFGCGTN- 209
N K S C Y + Y D + S G L ++ +L + PG+TFGCG +
Sbjct: 57 HSGQGSNNKCPSPKQCDYQIKYTDSASSQGVLINDSFSLPMRSSN--IRPGLTFGCGYDQ 114
Query: 210 ---NGGLFNSKTTGIVGLGGGDISLISQMR------------------------------ 236
G + G++GLG G +SL+SQ++
Sbjct: 115 QVGKNGAVQAAIDGMLGLGRGSVSLVSQLKQQGITKNVVGHCLSTNGGGFLFFGDDVVPS 174
Query: 237 ---TTIAGNQR-------------------LGVSTPDIVIDSGTTLT-FLPQGYNSNLLS 273
T + QR LGV ++V DSG+T T F Q Y + + +
Sbjct: 175 SRVTWVPMAQRTSGNYYSPGSGTLYFDRRSLGVKPMEVVFDSGSTYTYFTAQPYQAVVSA 234
Query: 274 VMSSMIEA-QPVADPTGSLELCYS--------FNSLSQVPEVTIHF---RGADVKLSRSN 321
+ + ++ + V+DPT L LC+ F+ ++ + + F + A +++ N
Sbjct: 235 LKGGLSKSLKQVSDPT--LPLCWKGQKAFKSVFDVKNEFKSMFLSFASAKNAAMEIPPEN 292
Query: 322 FFVKVSEDIVC-SVFKGITN--SVPIYGNIMQTNFLVGYDIEQQTVSFKPTDCTK 373
+ + VC + G S + G+I + +V YD E+ + + CT+
Sbjct: 293 YLIVTKNGNVCLGILDGTAAKLSFNVIGDITMQDQMVIYDNEKSQLGWARGACTR 347
>gi|356555248|ref|XP_003545946.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
Length = 453
Score = 83.6 bits (205), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 105/440 (23%), Positives = 163/440 (37%), Gaps = 119/440 (27%)
Query: 31 VELIHRDSPKSPFYNSSETPYQRLRDALTRSLNRLNHFNQNSSISSSKASQ--------- 81
+EL+HR + + ++ + R R NQ + S+ S+
Sbjct: 35 LELVHRHHERFAGGGGDVDRVEAVKGFVKRDKLRRQRMNQRWGVVSNYDSRRKGFEMTTT 94
Query: 82 -ADI-IPNNA-------NYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDS 132
A++ +P ++ Y + +G+P V DTGS+ W C
Sbjct: 95 PAEVEMPMHSGRDDALGEYFAEVKVGSPGQRFWLVVDTGSEFTWLNC------------- 141
Query: 133 PLFDPKMSSTYKSLPCSSSQCA-------SLNQKSCSGVNCQYSVSYGDGSFSNGNLATE 185
S +++++ C+S +C SL+ C Y +SY DGS + G T+
Sbjct: 142 -------SKSFEAVTCASRKCKVDLSELFSLSVCPKPSDPCLYDISYADGSSAKGFFGTD 194
Query: 186 TVTLGSTTGQAVALPGITFGCGTN--NGGLFNSKTTGIVGLGGGDISLISQMRT------ 237
++T+G T G+ L +T GC + NG FN +T GI+GLG S I +
Sbjct: 195 SITVGLTNGKQGKLNNLTIGCTKSMLNGVNFNEETGGILGLGFAKDSFIDKAANKYGAKF 254
Query: 238 -----------------TIAG--NQRL----------------GVSTPDIVIDSGTTLTF 262
TI G N +L GV+ I I G L
Sbjct: 255 SYCLVDHLSHRSVSSNLTIGGHHNAKLLGEIRRTELILFPPFYGVNVVGISI-GGQMLKI 313
Query: 263 LPQGYNSN----------------LLSVMSSMIEA-----QPVADPTG----SLELCYSF 297
PQ ++ N LL ++ EA V TG +LE C+
Sbjct: 314 PPQVWDFNAEGGTLIDSGTTLTSLLLPAYEAVFEALTKSLTKVKRVTGEDFDALEFCFDA 373
Query: 298 NSL--SQVPEVTIHFR-GADVKLSRSNFFVKVSEDIVCSVFKGI--TNSVPIYGNIMQTN 352
S VP + HF GA + ++ + V+ + C I + GNIMQ N
Sbjct: 374 EGFDDSVVPRLVFHFAGGARFEPPVKSYIIDVAPLVKCIGIVPIDGIGGASVIGNIMQQN 433
Query: 353 FLVGYDIEQQTVSFKPTDCT 372
L +D+ TV F P+ CT
Sbjct: 434 HLWEFDLSTNTVGFAPSTCT 453
>gi|222622847|gb|EEE56979.1| hypothetical protein OsJ_06707 [Oryza sativa Japonica Group]
Length = 494
Score = 83.6 bits (205), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 91/365 (24%), Positives = 150/365 (41%), Gaps = 89/365 (24%)
Query: 91 YLIRISIGTPPTERLAVADTGSDLIWTQ---CEPCPPSQCYMQDSPLFDPKMSSTYKSLP 147
Y RI IGTP DTGSD++W C+ CP + ++DP+ S + + +
Sbjct: 90 YFTRIGIGTPAKRYYVQVDTGSDILWVNCVSCDGCPRKSNLGIELTMYDPRGSQSGELVT 149
Query: 148 CSSSQCASLNQK---SCSGVN-CQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPG-- 201
C C + SC+ + C+YS+SYGDGS + G T+ + +G P
Sbjct: 150 CDQQFCVANYGGVLPSCTSTSPCEYSISYGDGSSTAGFFVTDFLQYNQVSGDGQTTPANA 209
Query: 202 -ITFGCGTNNGGLFNSKT---TGIVGLGGGDISLISQMRT-------------TIAGNQR 244
++FGCG GG S GI+G G + S++SQ+ T+ G
Sbjct: 210 SVSFGCGAKLGGDLGSSNLALDGILGFGQSNSSMLSQLAAAGKVRKMFAHCLDTVNGGGI 269
Query: 245 LGVS------------TPDI----------------------VIDSG----------TTL 260
+ PD+ + DSG TTL
Sbjct: 270 FAIGNVVQPKVKTTPLVPDMPHYNVILKGIDVGGTALGLPTNIFDSGNSKGTIIDSGTTL 329
Query: 261 TFLPQGYNSNLLSVM---SSMIEAQPVADPTGSLELCYSFNSL--SQVPEVTIHFRGADV 315
++P+G L +++ I Q + D + C+ ++ PEVT HF G DV
Sbjct: 330 AYVPEGVYKALFAMVFDKHQDISVQTLQDFS-----CFQYSGSVDDGFPEVTFHFEG-DV 383
Query: 316 KL--SRSNFFVKVSEDIVCSVFK---GITNS---VPIYGNIMQTNFLVGYDIEQQTVSFK 367
L S ++ + +++ C F+ G T + + G+++ +N LV YD+E Q + +
Sbjct: 384 SLIVSPHDYLFQNGKNLYCMGFQNGGGKTKDGKDLGLLGDLVLSNKLVLYDLENQAIGWA 443
Query: 368 PTDCT 372
+C+
Sbjct: 444 DYNCS 448
>gi|357482719|ref|XP_003611646.1| Aspartic proteinase-like protein [Medicago truncatula]
gi|355512981|gb|AES94604.1| Aspartic proteinase-like protein [Medicago truncatula]
Length = 640
Score = 83.2 bits (204), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 88/359 (24%), Positives = 144/359 (40%), Gaps = 84/359 (23%)
Query: 88 NANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLP 147
N Y R+ IGTPP + DTGS + + C C C P F P +S TY+ +
Sbjct: 86 NGYYTTRLWIGTPPQRFALIVDTGSTVTYVPCSTC--EHCGRHQDPKFQPDLSETYQPVK 143
Query: 148 CSSSQCASLNQKSCSGV--NCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITFG 205
C + C +C G C Y Y + S S+G L + V+ G+ + +A FG
Sbjct: 144 C-TPDC------NCDGDTNQCMYDRQYAEMSSSSGVLGEDVVSFGNLS--ELAPQRAVFG 194
Query: 206 CGTNN-GGLFNSKTTGIVGLGGGDISLISQM--RTTIAGNQRL---------------GV 247
C + G L++ + GI+GLG GD+S++ Q+ + I+ + L G+
Sbjct: 195 CENDETGDLYSQRADGIMGLGRGDLSIMDQLVDKKVISDSFSLCYGGMDVGGGAMILGGI 254
Query: 248 STPDIVI-----------------------------------------DSGTTLTFLPQ- 265
S P+ ++ DSGTT +LP+
Sbjct: 255 SPPEDMVFTHSDPDRSPYYNINLKEMHVAGKKLQLNPKVFDGKHGTVLDSGTTYAYLPET 314
Query: 266 GYNSNLLSVMSSMIEAQPVADPTGSL-ELCYSFNSL--SQV----PEVTIHFR-GADVKL 317
+ + ++M + + P + ++C++ + SQ+ P V + F G + L
Sbjct: 315 AFLAFKRAIMKERNSLKQINGPDPNYKDICFTGAGIDVSQLAKSFPVVDMVFENGHKLSL 374
Query: 318 SRSNFFV---KVSEDIVCSVFKGITNSVPIYGNIMQTNFLVGYDIEQQTVSFKPTDCTK 373
S N+ KV VF + + G I N LV YD E + F T+C++
Sbjct: 375 SPENYLFRHSKVRGAYCLGVFSNGRDPTTLLGGIFVRNTLVMYDRENSKIGFWKTNCSE 433
>gi|302758750|ref|XP_002962798.1| hypothetical protein SELMODRAFT_78156 [Selaginella moellendorffii]
gi|300169659|gb|EFJ36261.1| hypothetical protein SELMODRAFT_78156 [Selaginella moellendorffii]
Length = 427
Score = 83.2 bits (204), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 54/178 (30%), Positives = 83/178 (46%), Gaps = 31/178 (17%)
Query: 88 NANYLIRISIGTPPTERLAVADTGSDLIWTQCEP--------CPPSQCYMQDSPLFDPKM 139
+ Y + + +GTP + + DTGSDL W QC P PP +P +D
Sbjct: 56 SGQYFVELRVGTPAKKFPLIVDTGSDLTWIQCNPPNTTANSSSPP-------APWYDKSS 108
Query: 140 SSTYKSLPCSSSQCASLNQ---KSCSGVN---CQYSVSYGDGSFSNGNLATETVTL---- 189
SS+Y+ +PC+ +C L SCS + C Y+ Y D S + G LA ET+++
Sbjct: 109 SSSYREIPCTDDECQFLPAPIGSSCSITSPSPCDYTYGYSDQSRTTGILAYETISMKSRK 168
Query: 190 ------GSTTGQAVALPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAG 241
G+ + + + + GC + G +G++GLG G ISL +Q R T G
Sbjct: 169 RSGKRAGNHKTRRIRIKNVALGCSRESVGASFLGASGVLGLGQGPISLATQTRHTALG 226
Score = 65.5 bits (158), Expect = 4e-08, Method: Compositional matrix adjust.
Identities = 35/123 (28%), Positives = 68/123 (55%), Gaps = 4/123 (3%)
Query: 253 VIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSLSQ-VPEVTIHFR 311
+ DSGTTL++L + S +L +++ I + ELCY+ + + +P++ + F+
Sbjct: 304 IFDSGTTLSYLREPAYSKVLGALNASIYLPRAQEIPEGFELCYNVTRMEKGMPKLGVEFQ 363
Query: 312 GADV-KLSRSNFFVKVSEDIVCSVFKGI--TNSVPIYGNIMQTNFLVGYDIEQQTVSFKP 368
G V +L +N+ V V+E++ C + + TN I GN++Q + + YD+ + + FK
Sbjct: 364 GGAVMELPWNNYMVLVAENVQCVALQKVTTTNGSNILGNLLQQDHHIEYDLAKARIGFKW 423
Query: 369 TDC 371
+ C
Sbjct: 424 SPC 426
>gi|334187133|ref|NP_001190905.1| aspartyl protease family protein [Arabidopsis thaliana]
gi|21592493|gb|AAM64443.1| nucellin-like protein [Arabidopsis thaliana]
gi|332660834|gb|AEE86234.1| aspartyl protease family protein [Arabidopsis thaliana]
Length = 425
Score = 83.2 bits (204), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 85/380 (22%), Positives = 147/380 (38%), Gaps = 89/380 (23%)
Query: 77 SKASQADIIPNNAN------YLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQ 130
++A + + P + N Y + I+IG PP DTGSDL W QC+ P +C
Sbjct: 40 TRAVSSVVFPVHGNVYPLGYYNVTINIGQPPRPYYLDLDTGSDLTWLQCD-APCVRCLEA 98
Query: 131 DSPLFDPKMSSTYKSLPCSSSQCASL----NQKSCSGVNCQYSVSYGDGSFSNGNLATET 186
PL+ P + +PC+ C +L NQ+ + C Y V Y DG S G L +
Sbjct: 99 PHPLYQP----SSDLIPCNDPLCKALHLNSNQRCETPEQCDYEVEYADGGSSLGVLVRDV 154
Query: 187 VTLGSTTGQAVALPGITFGCGTNN--GGLFNSKTTGIVGLGGGDISLISQMRTT------ 238
++ T G + P + GCG + G + G++GLG G +S++SQ+ +
Sbjct: 155 FSMNYTQGLRLT-PRLALGCGYDQIPGASSHHPLDGVLGLGRGKVSILSQLHSQGYVKNV 213
Query: 239 ----------------------------------------------IAGNQRLGVSTPDI 252
+ G + G+
Sbjct: 214 IGHCLSSLGGGILFFGDDLYDSSRVSWTPMSREYSKHYSPAMGGELLFGGRTTGLKNLLT 273
Query: 253 VIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPV--ADPTGSLELCYSFNS-LSQVPEVTIH 309
V DSG++ T+ + ++ + +P+ A +L LC+ + EV +
Sbjct: 274 VFDSGSSYTYFNSKAYQAVTYLLKRELSGKPLKEARDDHTLPLCWQGRRPFMSIEEVKKY 333
Query: 310 FRGADVKL-----SRSNFFVKVSEDIVCS----VFKGITNSVPI-------YGNIMQTNF 353
F+ + S++ F + ++ S V GI N I G+I +
Sbjct: 334 FKPLALSFKTGWRSKTLFEIPPEAYLIISMKGNVCLGILNGTEIGLQNLNLIGDISMQDQ 393
Query: 354 LVGYDIEQQTVSFKPTDCTK 373
++ YD E+Q++ + P DC +
Sbjct: 394 MIIYDNEKQSIGWMPVDCDE 413
>gi|297720449|ref|NP_001172586.1| Os01g0776900 [Oryza sativa Japonica Group]
gi|255673740|dbj|BAH91316.1| Os01g0776900 [Oryza sativa Japonica Group]
Length = 381
Score = 83.2 bits (204), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 52/162 (32%), Positives = 78/162 (48%), Gaps = 15/162 (9%)
Query: 91 YLIRISIGTPPTERLAVADTGSDLIWTQCEP---CPPSQCYMQDSPLFDPKMSSTYKSLP 147
Y R+ +G+PP E DTGSD++W C P CP S F+P SST +P
Sbjct: 91 YFTRVKLGSPPKEYFVQIDTGSDILWVACSPCTGCPSSSGLNIQLEFFNPDTSSTSSKIP 150
Query: 148 CSSSQCASLNQKS---CSGVN---CQYSVSYGDGSFSNGNLATETVTLGSTTGQ---AVA 198
CS +C + Q S C + C Y+ +YGDGS ++G ++T+ + G A +
Sbjct: 151 CSDDRCTAALQTSEAVCQTSDNSPCGYTFTYGDGSGTSGYYVSDTMYFDTVMGNEQTANS 210
Query: 199 LPGITFGCGTNNGGLF---NSKTTGIVGLGGGDISLISQMRT 237
I FGC + G + GI G G +S++SQ+ +
Sbjct: 211 SASIVFGCSNSQSGDLTKTDRAVDGIFGFGQHQLSVVSQLNS 252
>gi|26452545|dbj|BAC43357.1| putative nucellin [Arabidopsis thaliana]
Length = 413
Score = 83.2 bits (204), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 85/380 (22%), Positives = 147/380 (38%), Gaps = 89/380 (23%)
Query: 77 SKASQADIIPNNAN------YLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQ 130
++A + + P + N Y + I+IG PP DTGSDL W QC+ P +C
Sbjct: 28 TRAVSSVVFPVHGNVYPLGYYNVTINIGQPPRPYYLDLDTGSDLTWLQCD-APCVRCLEA 86
Query: 131 DSPLFDPKMSSTYKSLPCSSSQCASL----NQKSCSGVNCQYSVSYGDGSFSNGNLATET 186
PL+ P + +PC+ C +L NQ+ + C Y V Y DG S G L +
Sbjct: 87 PHPLYQP----SSDLIPCNDPLCKALHLNSNQRCETPEQCDYEVEYADGGSSLGVLVRDV 142
Query: 187 VTLGSTTGQAVALPGITFGCGTNN--GGLFNSKTTGIVGLGGGDISLISQMRTT------ 238
++ T G + P + GCG + G + G++GLG G +S++SQ+ +
Sbjct: 143 FSMNYTQGLRLT-PRLALGCGYDQIPGASSHHPLDGVLGLGRGKVSILSQLHSQGYVKNV 201
Query: 239 ----------------------------------------------IAGNQRLGVSTPDI 252
+ G + G+
Sbjct: 202 IGHCLSSLGGGILFFGDDLYDSSRVSWTPMSREYSKHYSPAMGGELLFGGRTTGLKNLLT 261
Query: 253 VIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPV--ADPTGSLELCYSFNS-LSQVPEVTIH 309
V DSG++ T+ + ++ + +P+ A +L LC+ + EV +
Sbjct: 262 VFDSGSSYTYFNSKAYQAVTYLLKRELSGKPLKEARDDHTLPLCWQGRRPFMSIEEVKKY 321
Query: 310 FRGADVKL-----SRSNFFVKVSEDIVCS----VFKGITNSVPI-------YGNIMQTNF 353
F+ + S++ F + ++ S V GI N I G+I +
Sbjct: 322 FKPLALSFKTGWRSKTLFEIPPEAYLIISMKGNVCLGILNGTEIGLQNLNLIGDISMQDQ 381
Query: 354 LVGYDIEQQTVSFKPTDCTK 373
++ YD E+Q++ + P DC +
Sbjct: 382 MIIYDNEKQSIGWMPVDCDE 401
>gi|255581545|ref|XP_002531578.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
gi|223528808|gb|EEF30814.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
Length = 442
Score = 82.8 bits (203), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 54/161 (33%), Positives = 81/161 (50%), Gaps = 20/161 (12%)
Query: 84 IIPNNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTY 143
+ +N + + +++G+PP V DTGS+L W C+ + +S +F+P S TY
Sbjct: 62 LFHHNVSLTVSLTVGSPPQNVTMVLDTGSELSWLHCK-----KTQFLNS-VFNPLSSKTY 115
Query: 144 KSLPCSSSQCASLNQK-----SCSGVN-CQYSVSYGDGSFSNGNLATETVTLGSTTGQAV 197
+PC S C + + SC C VSY D + GNLA ET LGS T
Sbjct: 116 SKVPCLSPTCKTRTRDLTIPVSCDATKLCHVIVSYADATSIEGNLAFETFRLGSLTK--- 172
Query: 198 ALPGITFGC---GTNNGGLFNSKTTGIVGLGGGDISLISQM 235
P FGC G ++ +SKTTG++G+ G +S ++QM
Sbjct: 173 --PATIFGCMDSGFSSNSEEDSKTTGLIGMNRGSLSFVNQM 211
>gi|115476830|ref|NP_001062011.1| Os08g0469100 [Oryza sativa Japonica Group]
gi|42407408|dbj|BAD09566.1| putative nucleoid DNA-binding protein [Oryza sativa Japonica Group]
gi|113623980|dbj|BAF23925.1| Os08g0469100 [Oryza sativa Japonica Group]
Length = 373
Score = 82.8 bits (203), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 97/369 (26%), Positives = 150/369 (40%), Gaps = 82/369 (22%)
Query: 83 DIIPNNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDS--PLFDPKMS 140
+I+ ++ + + + I P R + DTGSDLIWTQC+ + + P++DP S
Sbjct: 8 NILLSDQGHSLTVGIVQP---RKLIVDTGSDLIWTQCKLSSSTAAAARHGSPPVYDPGES 64
Query: 141 STYKSLPCSSSQC--ASLNQKSCSGVN-CQYSVSYGDGS---------FSNGNLATETVT 188
ST+ LPCS C + K+C+ N C Y YG + F+ G ++
Sbjct: 65 STFAFLPCSDRLCQEGQFSFKNCTSKNRCVYEDVYGSAAAVGVLASETFTFGARRAVSLR 124
Query: 189 LGSTTGQ--AVALPGITFGCGTNNGGL-----------------FNSKTTGIVGLGG-GD 228
LG G A +L G T G + L F K T + G D
Sbjct: 125 LGFGCGALSAGSLIGATGILGLSPESLSLITQLKIQRFSYCLTPFADKKTSPLLFGAMAD 184
Query: 229 ISLISQMR----TTIA------------------GNQRLGVST------PD----IVIDS 256
+S R T I G++RL V PD ++DS
Sbjct: 185 LSRHKTTRPIQTTAIVSNPVETVYYYVPLVGISLGHKRLAVPAASLAMRPDGGGGTIVDS 244
Query: 257 GTTLTFLPQGYNSNLLSVMSSMIEAQPVADPT-GSLELCYSFNSLS--------QVPEVT 307
G+T+ +L + + + ++ PVA+ T ELC+ + QVP +
Sbjct: 245 GSTVAYLVEAAFEAVKEAVMDVVRL-PVANRTVEDYELCFVLPRRTAAAAMEAVQVPPLV 303
Query: 308 IHFRG-ADVKLSRSNFFVKVSEDIVCSVFKGITNS--VPIYGNIMQTNFLVGYDIEQQTV 364
+HF G A + L R N+F + ++C T+ V I GN+ Q N V +D++
Sbjct: 304 LHFDGGAAMVLPRDNYFQEPRAGLMCLAVGKTTDGSGVSIIGNVQQQNMHVLFDVQHHKF 363
Query: 365 SFKPTDCTK 373
SF PT C +
Sbjct: 364 SFAPTQCDQ 372
>gi|326501422|dbj|BAK02500.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 631
Score = 82.8 bits (203), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 92/369 (24%), Positives = 147/369 (39%), Gaps = 80/369 (21%)
Query: 76 SSKASQADIIPNNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLF 135
S++ D + N Y R+ IGTPP E + D+GS + + C C QC P F
Sbjct: 73 SARMRLHDDLLTNGYYTTRLHIGTPPQEFALIVDSGSTVTYVPCASC--EQCGNHQDPRF 130
Query: 136 DPKMSSTYKSLPCSSSQCASLNQKSCSGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQ 195
P +SSTY + C+ C + K+ C Y Y + S S+G L + V+ G+ +
Sbjct: 131 QPDLSSTYSPVKCNVD-CTCDSDKN----QCTYERQYAEMSSSSGVLGEDIVSFGTES-- 183
Query: 196 AVALPGITFGC-GTNNGGLFNSKTTGIVGL------------------------------ 224
+ FGC + G LF+ GI+GL
Sbjct: 184 ELKPQRAVFGCENSETGDLFSQHADGIMGLGRGQLSIMDQLVDKGVIGDSFSMCYGGMDI 243
Query: 225 GGGDISL-----------------------ISQMRTTIAG-----NQRLGVSTPDIVIDS 256
GGG + L I +AG + R+ V+DS
Sbjct: 244 GGGAMVLGAMPAPPGMIYTHSNAVRSPYYNIELKEMHVAGKALRVDPRIFDGKHGTVLDS 303
Query: 257 GTTLTFLPQGYNSNLLSVMSSMIEA-QPVADPTGSL-ELCYS-----FNSLSQV-PEVTI 308
GTT +LP+ +SS + + + P + ++C++ + LS+V P+V +
Sbjct: 304 GTTYAYLPEQAFVAFKDAVSSQVHPLKKIRGPDPNYKDICFAGAGRNVSQLSEVFPKVDM 363
Query: 309 HF-RGADVKLSRSNFFVKVS--EDIVC-SVFKGITNSVPIYGNIMQTNFLVGYDIEQQTV 364
F G + LS N+ + S E C VF+ + + G I+ N LV YD + +
Sbjct: 364 VFGNGQKLSLSPENYLFRHSKVEGAYCLGVFQNGKDPTTLLGGIVVRNTLVTYDRHNEKI 423
Query: 365 SFKPTDCTK 373
F T+C++
Sbjct: 424 GFWKTNCSE 432
>gi|302821814|ref|XP_002992568.1| hypothetical protein SELMODRAFT_46291 [Selaginella moellendorffii]
gi|300139637|gb|EFJ06374.1| hypothetical protein SELMODRAFT_46291 [Selaginella moellendorffii]
Length = 368
Score = 82.8 bits (203), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 53/163 (32%), Positives = 87/163 (53%), Gaps = 20/163 (12%)
Query: 93 IRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCSSSQ 152
+++ IG+ A+ DTGS+ + QC + P+FDP S +Y+ +PC S
Sbjct: 1 MQLGIGSLQKNLSAIIDTGSEAVLVQCGS--------RSRPVFDPAASQSYRQVPCISQL 52
Query: 153 CASLNQKSCSG-----VN----CQYSVSYGDGSFSNGNLATETVTLGST--TGQAVALPG 201
C ++ Q++ +G VN C YS+SYGD S G+ + + + L ST + QAV
Sbjct: 53 CLAVQQQTSNGSSQPCVNSSAACTYSLSYGDSRNSTGDFSQDVIFLNSTNSSSQAVQFRD 112
Query: 202 ITFGCGTN-NGGLFNSKTTGIVGLGGGDISLISQMRTTIAGNQ 243
+ FGC + G L + + GIVG G++SL SQ++ + G++
Sbjct: 113 VAFGCAHSPQGFLVDLGSLGIVGFNRGNLSLPSQLKDRLGGSK 155
>gi|302756119|ref|XP_002961483.1| hypothetical protein SELMODRAFT_76765 [Selaginella moellendorffii]
gi|300170142|gb|EFJ36743.1| hypothetical protein SELMODRAFT_76765 [Selaginella moellendorffii]
Length = 388
Score = 82.8 bits (203), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 82/367 (22%), Positives = 138/367 (37%), Gaps = 86/367 (23%)
Query: 91 YLIRISIGTPPTERLAVADTGSDLIWTQCEP---CPPSQCYMQDSPLFDPKMSSTYKSLP 147
Y ++ +G P + DTGSD++W C P CP ++DP+ SST +
Sbjct: 2 YFTQVGLGNPVKHYIVQVDTGSDVLWVNCRPCSGCPRKSALNIPLTMYDPRESSTTSLVS 61
Query: 148 CSSSQCA---SLNQKSCSGV--NCQYSVSYGDGSFSNGNLATETVTLG--STTGQAVALP 200
CS C + CS NC+Y SYGDGS S G + + S+ G A
Sbjct: 62 CSDPLCVRGRRFAEAQCSQATNNCEYIFSYGDGSTSEGYYVRDAMQYNVISSNGLANTTS 121
Query: 201 GITFGCGTNNGGLFNS---KTTGIVGLGGGDISLISQ----------------------- 234
+ FGC G ++ GI+G G ++S+ +Q
Sbjct: 122 QVLFGCSIRQTGDLSTSQQAVDGIIGFGQLELSVPNQLAAQQNIPRVFSHCLEGEKRGGG 181
Query: 235 ---------------------------MRTTIAGNQRLGVSTPD--------IVIDSGTT 259
+R + RL + D +++DSGTT
Sbjct: 182 ILVIGGIAEPGMTYTPLVPDSVHYNVVLRGISVNSNRLPIDAEDFSSTNDTGVIMDSGTT 241
Query: 260 LTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSLSQV-PEVTIHFRGADVKLS 318
L + P G + + + A PV + LS + P VT++F G ++L
Sbjct: 242 LAYFPSGAYNVFVQAIREATSATPVRVQGMDTQCFLVSGRLSDLFPNVTLNFEGGAMELQ 301
Query: 319 RSNFFV------KVSEDIVCSVFKGITNS--------VPIYGNIMQTNFLVGYDIEQQTV 364
N+ + + D+ C ++ ++S + I G+I+ + LV YD++ +
Sbjct: 302 PDNYLMWGGTAPTGTTDVWCIGWQSSSSSAGPKDGSQLTILGDIVLKDKLVVYDLDNSRI 361
Query: 365 SFKPTDC 371
+ +C
Sbjct: 362 GWMSYNC 368
>gi|222619345|gb|EEE55477.1| hypothetical protein OsJ_03658 [Oryza sativa Japonica Group]
Length = 530
Score = 82.8 bits (203), Expect = 3e-13, Method: Compositional matrix adjust.
Identities = 52/162 (32%), Positives = 78/162 (48%), Gaps = 15/162 (9%)
Query: 91 YLIRISIGTPPTERLAVADTGSDLIWTQCEP---CPPSQCYMQDSPLFDPKMSSTYKSLP 147
Y R+ +G+PP E DTGSD++W C P CP S F+P SST +P
Sbjct: 117 YFTRVKLGSPPKEYFVQIDTGSDILWVACSPCTGCPSSSGLNIQLEFFNPDTSSTSSKIP 176
Query: 148 CSSSQCASLNQKS---CSGVN---CQYSVSYGDGSFSNGNLATETVTLGSTTGQ---AVA 198
CS +C + Q S C + C Y+ +YGDGS ++G ++T+ + G A +
Sbjct: 177 CSDDRCTAALQTSEAVCQTSDNSPCGYTFTYGDGSGTSGYYVSDTMYFDTVMGNEQTANS 236
Query: 199 LPGITFGCGTNNGGLF---NSKTTGIVGLGGGDISLISQMRT 237
I FGC + G + GI G G +S++SQ+ +
Sbjct: 237 SASIVFGCSNSQSGDLTKTDRAVDGIFGFGQHQLSVVSQLNS 278
>gi|357132618|ref|XP_003567926.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Brachypodium
distachyon]
Length = 468
Score = 82.8 bits (203), Expect = 3e-13, Method: Compositional matrix adjust.
Identities = 93/371 (25%), Positives = 144/371 (38%), Gaps = 92/371 (24%)
Query: 90 NYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSP---LFDPKMSSTYKSL 146
Y +R+ +GTP + VADTGSDL W +C S SP +F P S ++ L
Sbjct: 103 QYFVRLRVGTPAQPFVLVADTGSDLTWVKCSSPSSSSSSPAASPPQRVFRPAGSKSWSPL 162
Query: 147 PCSSSQCAS---LNQKSCSGVN--CQYSVSYGDGSFSNG--NLATETVTLGSTTG-QAVA 198
PC S C S + +CS C Y Y D S + G L + TV+L G +
Sbjct: 163 PCDSDTCKSYVPFSLANCSSPPDPCSYDYRYKDNSSARGVVGLDSATVSLSGNDGTRKAK 222
Query: 199 LPGITFGCGTNNGGLFNSKTTGIVGLG--------------GGDIS-------------- 230
L + GC T+ G + G++ LG GG S
Sbjct: 223 LQEVVLGCTTSYDGQSFKSSDGVLSLGNSNISFASRAASRFGGRFSYCLVDHLAPRNATS 282
Query: 231 -----------------------LISQMRT-----------TIAGNQRLGVSTPDI---- 252
L+ RT T+AG +RL + PD+
Sbjct: 283 FLTFGNGDSSPGDDSSSRRTPLVLLEDARTRPFYFVSVDAVTVAG-ERLEI-LPDVWDFR 340
Query: 253 -----VIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVA--DPTGSLELCYSFNSLS-QVP 304
++DSGT+LT L ++ +S P DP E CY++ +S ++P
Sbjct: 341 KNGGAILDSGTSLTILATPAYDAVVKAISKQFAGVPRVNMDP---FEYCYNWTGVSAEIP 397
Query: 305 EVTIHFRGA-DVKLSRSNFFVKVSEDIVC-SVFKGITNSVPIYGNIMQTNFLVGYDIEQQ 362
+ + F GA + ++ + + + C V +G V + GNI+Q L +D+ +
Sbjct: 398 RMELRFAGAATLAPPGKSYVIDTAPGVKCIGVVEGAWPGVSVIGNILQQEHLWEFDLANR 457
Query: 363 TVSFKPTDCTK 373
+ FK + C
Sbjct: 458 WLRFKQSRCAH 468
>gi|312283333|dbj|BAJ34532.1| unnamed protein product [Thellungiella halophila]
Length = 428
Score = 82.8 bits (203), Expect = 3e-13, Method: Compositional matrix adjust.
Identities = 61/205 (29%), Positives = 98/205 (47%), Gaps = 15/205 (7%)
Query: 31 VELIHRDSPKSPFYNSSETPYQRLRDALTRSLNRLNHFNQNSSISSSKASQADIIPNNAN 90
+ + H +SP SPF + ++ L + RL + + + S + I +
Sbjct: 34 LRVFHVNSPCSPFKQPNTVSWE---STLLKDKARLQYLSSLAKKPSVPIASGRAIVQSPT 90
Query: 91 YLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCSS 150
Y++R +IGTP L DT +D W C C S LFDP SS+ ++L C +
Sbjct: 91 YIVRANIGTPAQPMLVALDTSNDAAWVPCSGC----VGCASSVLFDPSKSSSSRNLQCDA 146
Query: 151 SQCASLNQKSC-SGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITFGCGTN 209
QC +C +G +C ++++YG GS +L +T+TL + + TFGC +
Sbjct: 147 PQCKQAPNPTCTAGKSCGFNMTYG-GSTIEASLTQDTLTLAND-----VIKSYTFGCISK 200
Query: 210 NGGLFNSKTTGIVGLGGGDISLISQ 234
G + G++GLG G +SLISQ
Sbjct: 201 ATGT-SLPAQGLMGLGRGPLSLISQ 224
>gi|312282457|dbj|BAJ34094.1| unnamed protein product [Thellungiella halophila]
Length = 424
Score = 82.4 bits (202), Expect = 3e-13, Method: Compositional matrix adjust.
Identities = 85/380 (22%), Positives = 146/380 (38%), Gaps = 89/380 (23%)
Query: 77 SKASQADIIPNNAN------YLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQ 130
++A+ + + P + N Y + I+IG PP DTGSDL W QC+ P C
Sbjct: 37 TRAASSVVFPVHGNVYPLGYYNVTINIGQPPRPYYLDLDTGSDLTWLQCD-APCVHCLEA 95
Query: 131 DSPLFDPKMSSTYKSLPCSSSQCASL----NQKSCSGVNCQYSVSYGDGSFSNGNLATET 186
PL+ P + +PC+ C +L N + + C Y V Y DG S G L +
Sbjct: 96 PHPLYQP----SNDLIPCNDPLCKALHFNGNHRCETPEQCDYEVEYADGGSSLGVLVRDV 151
Query: 187 VTLGSTTGQAVALPGITFGCGTNN--GGLFNSKTTGIVGLGGGDISLISQMRTT------ 238
+L T G + P + GCG + G + G++GLG G +S++SQ+ +
Sbjct: 152 FSLNYTKGLRLT-PRLALGCGYDQIPGASGHHPLDGVLGLGRGKVSILSQLHSQGYVKNV 210
Query: 239 ----------------------------------------------IAGNQRLGVSTPDI 252
+ G + G+
Sbjct: 211 VGHCLSSLGGGILFFGNDLYDSSRVSWTPMARENSKHYSPAMGGELLFGGRTTGLKNLLT 270
Query: 253 VIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPV--ADPTGSLELCYSFNS-LSQVPEVTIH 309
V DSG++ T+ + ++ + +P+ A +L LC+ + EV +
Sbjct: 271 VFDSGSSYTYFNSKAYQAVTYLLKRELSGKPLKEARDDHTLPLCWQGRRPFMSIEEVKKY 330
Query: 310 FRGADVKL-----SRSNFFVKVSEDIVCS----VFKGITNSVPI-------YGNIMQTNF 353
F+ + S++ F + ++ S V GI N I G+I +
Sbjct: 331 FKPLALSFKTGWRSKTLFEIPPEAYLIISMKGNVCLGILNGTEIGLQNLNLIGDISMQDQ 390
Query: 354 LVGYDIEQQTVSFKPTDCTK 373
++ YD E+Q++ + P DC +
Sbjct: 391 MIIYDNEKQSIGWIPADCDE 410
>gi|297827153|ref|XP_002881459.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
gi|297327298|gb|EFH57718.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
Length = 507
Score = 82.4 bits (202), Expect = 3e-13, Method: Compositional matrix adjust.
Identities = 56/158 (35%), Positives = 79/158 (50%), Gaps = 13/158 (8%)
Query: 91 YLIRISIGTPPTERLAVADTGSDLIW---TQCEPCPPSQCYMQDSPLFDPKMSSTYKSLP 147
Y ++ +G+PPTE DTGSD++W + C CP S D FD S T S+
Sbjct: 100 YFTKVKLGSPPTEFNVQIDTGSDILWVTCSSCSNCPHSSGLGIDLHFFDAPGSFTAGSVT 159
Query: 148 CSSSQCASLNQKS---CSGVN-CQYSVSYGDGSFSNGNLATETVTLGSTTGQAVAL---P 200
CS C+S+ Q + CS N C YS YGDGS ++G T+T + G+++
Sbjct: 160 CSDPICSSVFQTTAAQCSENNQCGYSFRYGDGSGTSGYYMTDTFYFDAILGESLVANSSA 219
Query: 201 GITFGCGTNNGGLF---NSKTTGIVGLGGGDISLISQM 235
I FGC T G + GI G G G +S++SQ+
Sbjct: 220 PIVFGCSTYQSGDLTKSDKAVDGIFGFGKGKLSVVSQL 257
>gi|224081804|ref|XP_002306494.1| predicted protein [Populus trichocarpa]
gi|222855943|gb|EEE93490.1| predicted protein [Populus trichocarpa]
Length = 564
Score = 82.4 bits (202), Expect = 3e-13, Method: Compositional matrix adjust.
Identities = 49/149 (32%), Positives = 75/149 (50%), Gaps = 10/149 (6%)
Query: 88 NANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLP 147
N Y R+ IGTPP + DTGS + + C C QC P F P +SSTY+S+
Sbjct: 10 NGYYTTRLWIGTPPQRFALIVDTGSSVTYVPCSSC--EQCGRHQDPKFQPDLSSTYQSVK 67
Query: 148 CSSSQCASLNQKSCSGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITFGC- 206
C+ C ++K C Y Y + S S+G L + ++ G+ + A+A FGC
Sbjct: 68 CNID-CNCDDEKQ----QCVYERQYAEMSTSSGVLGEDIISFGNLS--ALAPQRAVFGCE 120
Query: 207 GTNNGGLFNSKTTGIVGLGGGDISLISQM 235
G L++ GI+G+G GD+S++ +
Sbjct: 121 NMETGDLYSQHADGIMGMGRGDLSIVDHL 149
Score = 43.1 bits (100), Expect = 0.23, Method: Compositional matrix adjust.
Identities = 36/133 (27%), Positives = 65/133 (48%), Gaps = 12/133 (9%)
Query: 253 VIDSGTTLTFLPQ-GYNSNLLSVMSSMIEAQPVADPTGSL-ELCYS--FNSLSQV----P 304
++DSGTT +LP+ + S ++M + +P+ P + ++C+S + +SQ+ P
Sbjct: 225 ILDSGTTYAYLPEAAFVSFKDAIMKELHSLKPIRGPDPNYNDICFSGAGSDISQLSSSFP 284
Query: 305 EVTIHF-RGADVKLSRSNFFV---KVSEDIVCSVFKGITNSVPIYGNIMQTNFLVGYDIE 360
V + F G + LS N+ KV +F+ + + G I+ N LV YD E
Sbjct: 285 AVEMVFGNGQKLLLSPENYLFRHSKVHGAYCLGIFQNGKDPTTLLGGIVVRNTLVLYDRE 344
Query: 361 QQTVSFKPTDCTK 373
+ F T+C++
Sbjct: 345 NSKIGFWKTNCSE 357
>gi|42571079|ref|NP_973613.1| aspartyl protease-like protein [Arabidopsis thaliana]
gi|110737616|dbj|BAF00749.1| putative protease [Arabidopsis thaliana]
gi|330254187|gb|AEC09281.1| aspartyl protease-like protein [Arabidopsis thaliana]
Length = 507
Score = 82.0 bits (201), Expect = 4e-13, Method: Compositional matrix adjust.
Identities = 56/158 (35%), Positives = 79/158 (50%), Gaps = 13/158 (8%)
Query: 91 YLIRISIGTPPTERLAVADTGSDLIW---TQCEPCPPSQCYMQDSPLFDPKMSSTYKSLP 147
Y ++ +G+PPTE DTGSD++W + C CP S D FD S T S+
Sbjct: 100 YFTKVKLGSPPTEFNVQIDTGSDILWVTCSSCSNCPHSSGLGIDLHFFDAPGSLTAGSVT 159
Query: 148 CSSSQCASLNQKS---CSGVN-CQYSVSYGDGSFSNGNLATETVTLGSTTGQAVAL---P 200
CS C+S+ Q + CS N C YS YGDGS ++G T+T + G+++
Sbjct: 160 CSDPICSSVFQTTAAQCSENNQCGYSFRYGDGSGTSGYYMTDTFYFDAILGESLVANSSA 219
Query: 201 GITFGCGTNNGGLF---NSKTTGIVGLGGGDISLISQM 235
I FGC T G + GI G G G +S++SQ+
Sbjct: 220 PIVFGCSTYQSGDLTKSDKAVDGIFGFGKGKLSVVSQL 257
>gi|42569679|ref|NP_181205.2| aspartyl protease-like protein [Arabidopsis thaliana]
gi|330254186|gb|AEC09280.1| aspartyl protease-like protein [Arabidopsis thaliana]
Length = 512
Score = 82.0 bits (201), Expect = 4e-13, Method: Compositional matrix adjust.
Identities = 56/158 (35%), Positives = 79/158 (50%), Gaps = 13/158 (8%)
Query: 91 YLIRISIGTPPTERLAVADTGSDLIW---TQCEPCPPSQCYMQDSPLFDPKMSSTYKSLP 147
Y ++ +G+PPTE DTGSD++W + C CP S D FD S T S+
Sbjct: 105 YFTKVKLGSPPTEFNVQIDTGSDILWVTCSSCSNCPHSSGLGIDLHFFDAPGSLTAGSVT 164
Query: 148 CSSSQCASLNQKS---CSGVN-CQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPG-- 201
CS C+S+ Q + CS N C YS YGDGS ++G T+T + G+++
Sbjct: 165 CSDPICSSVFQTTAAQCSENNQCGYSFRYGDGSGTSGYYMTDTFYFDAILGESLVANSSA 224
Query: 202 -ITFGCGTNNGGLF---NSKTTGIVGLGGGDISLISQM 235
I FGC T G + GI G G G +S++SQ+
Sbjct: 225 PIVFGCSTYQSGDLTKSDKAVDGIFGFGKGKLSVVSQL 262
>gi|30699261|ref|NP_850981.1| aspartyl protease-like protein [Arabidopsis thaliana]
gi|17065172|gb|AAL32740.1| nucellin-like protein [Arabidopsis thaliana]
gi|24899795|gb|AAN65112.1| nucellin-like protein [Arabidopsis thaliana]
gi|332197863|gb|AEE35984.1| aspartyl protease-like protein [Arabidopsis thaliana]
Length = 466
Score = 82.0 bits (201), Expect = 4e-13, Method: Compositional matrix adjust.
Identities = 90/364 (24%), Positives = 145/364 (39%), Gaps = 90/364 (24%)
Query: 91 YLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCSS 150
Y + ++IG PP DTGSDL W QC+ P + C + + P + +LPCS
Sbjct: 67 YYVLLNIGNPPKLFDLDIDTGSDLTWVQCD-APCNGCTKPRAKQYKP----NHNTLPCSH 121
Query: 151 SQCASLN---QKSCSGVN--CQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITFG 205
C+ L+ + C+ C Y + Y D + S G L T+ V L G + L +TFG
Sbjct: 122 ILCSGLDLPQDRPCADPEDQCDYEIGYSDHASSIGALVTDEVPLKLANGSIMNLR-LTFG 180
Query: 206 CG---TNNGGLFNSKTTGIVGLGGGDISLISQMR-------------------------- 236
CG N G T GI+GLG G + L +Q++
Sbjct: 181 CGYDQQNPGPHPPPPTAGILGLGRGKVGLSTQLKSLGITKNVIVHCLSHTGKGFLSIGDE 240
Query: 237 ---------TTIAGN-----------------QRLGVSTPDIVIDSGTTLTFLPQGYNSN 270
T++A N + GV ++V DSG++ T+
Sbjct: 241 LVPSSGVTWTSLATNSPSKNYMAGPAELLFNDKTTGVKGINVVFDSGSSYTYFNAEAYQA 300
Query: 271 LLSVMSSMIEAQPVADPTG--SLELCYS----FNSLSQVPE----VTIHF----RGADVK 316
+L ++ + +P+ D SL +C+ SL +V + +T+ F G +
Sbjct: 301 ILDLIRKDLNGKPLTDTKDDKSLPVCWKGKKPLKSLDEVKKYFKTITLRFGNQKNGQLFQ 360
Query: 317 LSRSNFFVKVSEDIVCSVFKGITNSVP-------IYGNIMQTNFLVGYDIEQQTVSFKPT 369
+ ++ + + VC GI N I G+I +V YD E+Q + + +
Sbjct: 361 VPPESYLIITEKGRVCL---GILNGTEIGLEGYNIIGDISFQGIMVIYDNEKQRIGWISS 417
Query: 370 DCTK 373
DC K
Sbjct: 418 DCDK 421
>gi|30699263|ref|NP_177872.3| aspartyl protease-like protein [Arabidopsis thaliana]
gi|332197862|gb|AEE35983.1| aspartyl protease-like protein [Arabidopsis thaliana]
Length = 432
Score = 82.0 bits (201), Expect = 4e-13, Method: Compositional matrix adjust.
Identities = 90/364 (24%), Positives = 145/364 (39%), Gaps = 90/364 (24%)
Query: 91 YLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCSS 150
Y + ++IG PP DTGSDL W QC+ P + C + + P + +LPCS
Sbjct: 67 YYVLLNIGNPPKLFDLDIDTGSDLTWVQCD-APCNGCTKPRAKQYKP----NHNTLPCSH 121
Query: 151 SQCASLN---QKSCSGVN--CQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITFG 205
C+ L+ + C+ C Y + Y D + S G L T+ V L G + L +TFG
Sbjct: 122 ILCSGLDLPQDRPCADPEDQCDYEIGYSDHASSIGALVTDEVPLKLANGSIMNLR-LTFG 180
Query: 206 CG---TNNGGLFNSKTTGIVGLGGGDISLISQMR-------------------------- 236
CG N G T GI+GLG G + L +Q++
Sbjct: 181 CGYDQQNPGPHPPPPTAGILGLGRGKVGLSTQLKSLGITKNVIVHCLSHTGKGFLSIGDE 240
Query: 237 ---------TTIAGN-----------------QRLGVSTPDIVIDSGTTLTFLPQGYNSN 270
T++A N + GV ++V DSG++ T+
Sbjct: 241 LVPSSGVTWTSLATNSPSKNYMAGPAELLFNDKTTGVKGINVVFDSGSSYTYFNAEAYQA 300
Query: 271 LLSVMSSMIEAQPVADPTG--SLELCYS----FNSLSQVPE----VTIHF----RGADVK 316
+L ++ + +P+ D SL +C+ SL +V + +T+ F G +
Sbjct: 301 ILDLIRKDLNGKPLTDTKDDKSLPVCWKGKKPLKSLDEVKKYFKTITLRFGNQKNGQLFQ 360
Query: 317 LSRSNFFVKVSEDIVCSVFKGITNSVP-------IYGNIMQTNFLVGYDIEQQTVSFKPT 369
+ ++ + + VC GI N I G+I +V YD E+Q + + +
Sbjct: 361 VPPESYLIITEKGRVCL---GILNGTEIGLEGYNIIGDISFQGIMVIYDNEKQRIGWISS 417
Query: 370 DCTK 373
DC K
Sbjct: 418 DCDK 421
>gi|240255485|ref|NP_189841.4| aspartyl protease family protein [Arabidopsis thaliana]
gi|332644216|gb|AEE77737.1| aspartyl protease family protein [Arabidopsis thaliana]
Length = 430
Score = 82.0 bits (201), Expect = 4e-13, Method: Compositional matrix adjust.
Identities = 82/327 (25%), Positives = 134/327 (40%), Gaps = 53/327 (16%)
Query: 88 NANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLP 147
+A Y + IGTPP E V DTGSDL+W C C C + + FDP SS+ L
Sbjct: 75 SALYYTTVQIGTPPRELDVVIDTGSDLVWVSCNSC--VGCPLHNVTFFDPGASSSAVKLA 132
Query: 148 CSSSQCASLNQKS--CSGV-NCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITF 204
CS +C+S QK CS + +C Y V YGDGS ++G ++ ++ + + T+
Sbjct: 133 CSDKRCSSDLQKKSRCSLLESCTYKVEYGDGSVTSGYYISDLISFDTMSDW-------TY 185
Query: 205 GCGTNNGGLFNSKTTG-IVGLGGGDISL---------------ISQMRTTIAGNQRLGVS 248
+N G I+G S S M T + RL +
Sbjct: 186 IAFRDNSTWHPWVRQGAIIGTFPALCSTPCSTVSSQPLYYNPQFSHMMTVAVNDLRLPID 245
Query: 249 TPDI---------VIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNS 299
P + +IDSGTTL P L+ + +++ P S + C++ S
Sbjct: 246 -PSVFSVAKGYGTIIDSGTTLVHFPGEAYDPLIQAILNVVSQYGRPIPYESFQ-CFNITS 303
Query: 300 --------LSQVPEVTIHFR-GADVKLSRSNF----FVKVSEDIVC-SVFKGITNSVPIY 345
PEV + F GA + + + F+ ++ I C + + + I
Sbjct: 304 GISSHLVIADMFPEVHLGFAGGASMVIKPEAYLFQKFLDLTNAIWCLGFYSSTSRRITII 363
Query: 346 GNIMQTNFLVGYDIEQQTVSFKPTDCT 372
G + + + YD++ Q + + +C+
Sbjct: 364 GEVAIRDKMFVYDLDHQRIGWAEYNCS 390
>gi|302758122|ref|XP_002962484.1| hypothetical protein SELMODRAFT_78458 [Selaginella moellendorffii]
gi|300169345|gb|EFJ35947.1| hypothetical protein SELMODRAFT_78458 [Selaginella moellendorffii]
Length = 395
Score = 82.0 bits (201), Expect = 4e-13, Method: Compositional matrix adjust.
Identities = 54/178 (30%), Positives = 83/178 (46%), Gaps = 31/178 (17%)
Query: 88 NANYLIRISIGTPPTERLAVADTGSDLIWTQCEP--------CPPSQCYMQDSPLFDPKM 139
+ Y + + +GTP + + DTGSDL W QC P PP +P +D
Sbjct: 24 SGQYFVELRVGTPAKKFPLIIDTGSDLTWIQCNPPNTTANSSSPP-------APWYDKSS 76
Query: 140 SSTYKSLPCSSSQCASLNQ---KSCSGVN---CQYSVSYGDGSFSNGNLATETVTL---- 189
SS+Y+ +PC+ +C L SCS + C Y+ Y D S + G LA ET+++
Sbjct: 77 SSSYREIPCTDDECLFLPAPIGSSCSIKSPSPCDYTYGYSDQSRTTGILAYETISMKSRK 136
Query: 190 ------GSTTGQAVALPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAG 241
G+ + + + + GC + G +G++GLG G ISL +Q R T G
Sbjct: 137 RSGKRAGNHKTRTIRIKNVALGCSRESVGASFLGASGVLGLGQGPISLATQTRHTALG 194
Score = 65.5 bits (158), Expect = 4e-08, Method: Compositional matrix adjust.
Identities = 35/123 (28%), Positives = 68/123 (55%), Gaps = 4/123 (3%)
Query: 253 VIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSLSQ-VPEVTIHFR 311
+ DSGTTL++L + S +L +++ I + ELCY+ + + +P++ + F+
Sbjct: 272 IFDSGTTLSYLREPAYSKVLGALNASIYLPRAQEIPEGFELCYNVTRMEKGMPKLGVEFQ 331
Query: 312 GADV-KLSRSNFFVKVSEDIVCSVFKGI--TNSVPIYGNIMQTNFLVGYDIEQQTVSFKP 368
G V +L +N+ V V+E++ C + + TN I GN++Q + + YD+ + + FK
Sbjct: 332 GGAVMELPWNNYMVLVAENVQCVALQKVTTTNGSNILGNLLQQDHHIEYDLAKARIGFKW 391
Query: 369 TDC 371
+ C
Sbjct: 392 SPC 394
>gi|356531884|ref|XP_003534506.1| PREDICTED: aspartic proteinase-like protein 2-like [Glycine max]
Length = 482
Score = 82.0 bits (201), Expect = 5e-13, Method: Compositional matrix adjust.
Identities = 51/158 (32%), Positives = 74/158 (46%), Gaps = 14/158 (8%)
Query: 92 LIRISIGTPPTERLAVADTGSDLIWTQC---EPCPPSQCYMQDSPLFDPKMSSTYKSLPC 148
L IG P + DTGSD +W C CP + L+DP S T K +PC
Sbjct: 76 LYYTKIGLGPNDYYVQVDTGSDTLWVNCVGCTTCPKKSGLGMELTLYDPNSSKTSKVVPC 135
Query: 149 SSSQCASLNQKSCSG----VNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALP---G 201
C S SG ++C YS++YGDGS ++G+ + +T G +P
Sbjct: 136 DDEFCTSTYDGPISGCKKDMSCPYSITYGDGSTTSGSYIKDDLTFDRVVGDLRTVPDNTS 195
Query: 202 ITFGCGTNNGGLFNSKT----TGIVGLGGGDISLISQM 235
+ FGCG+ G +S T GI+G G + S++SQ+
Sbjct: 196 VIFGCGSKQSGTLSSTTDTSLDGIIGFGQANSSVLSQL 233
>gi|255588450|ref|XP_002534607.1| Aspartic proteinase nepenthesin-2 precursor, putative [Ricinus
communis]
gi|223524923|gb|EEF27776.1| Aspartic proteinase nepenthesin-2 precursor, putative [Ricinus
communis]
Length = 260
Score = 82.0 bits (201), Expect = 5e-13, Method: Compositional matrix adjust.
Identities = 57/178 (32%), Positives = 90/178 (50%), Gaps = 18/178 (10%)
Query: 61 SLNRLNHFNQNSSISSSKASQADIIPNNANYLIRISIGTPPTERLAVADTGSDLIWTQCE 120
SL NH + N+ + DI+ + Y ++ IGTPP E V DTGS++ + C
Sbjct: 25 SLANYNHLHPNARM----PLYGDIL-SYGYYATKLYIGTPPQEFTLVVDTGSNMTFVPC- 78
Query: 121 PCPPSQ-CYMQDSPLFDPKMSSTYKSLPCSSS-QCASLNQKSCSGVNCQYSVSYGDGSFS 178
C + C + P F + SSTY+ + C S C L + C Y + YGDGS+S
Sbjct: 79 -CGSEEYCGKHEDPAFQTESSSTYQPVNCHPSCDCDYLRSQ------CSYKMHYGDGSYS 131
Query: 179 NGNLATETVTLGSTTGQAVALPGITFGCGTNN-GGLFNSKTTGIVGLGGGDISLISQM 235
G LA + ++ G+ + A + FGC + G L++ + GI+GLG G +++ Q+
Sbjct: 132 RGVLAEDIISFGNES--EFAPQRLVFGCELDAIGSLYSLRADGIIGLGRGRSTIVDQL 187
>gi|356514298|ref|XP_003525843.1| PREDICTED: aspartic proteinase-like protein 1-like [Glycine max]
Length = 663
Score = 82.0 bits (201), Expect = 5e-13, Method: Compositional matrix adjust.
Identities = 91/359 (25%), Positives = 143/359 (39%), Gaps = 84/359 (23%)
Query: 88 NANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLP 147
N Y R+ IGTPP + DTGS + + C C QC P F P+ SSTY+ +
Sbjct: 109 NGYYTTRLWIGTPPQMFALIVDTGSTVTYVPCSTC--EQCGRHQDPKFQPESSSTYQPVK 166
Query: 148 CSSSQCASLNQKSCSG--VNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITFG 205
C+ C +C G + C Y Y + S S+G L + ++ G+ + +A FG
Sbjct: 167 CTID-C------NCDGDRMQCVYERQYAEMSTSSGVLGEDVISFGNQS--ELAPQRAVFG 217
Query: 206 C-GTNNGGLFNSKTTGIVGLGGG---------DISLISQMRTTIAGNQRLG--------- 246
C G L++ GI+GLG G D +IS + G +G
Sbjct: 218 CENVETGDLYSQHADGIMGLGRGDLSIMDQLVDKKVISDSFSLCYGGMDVGGGAMVLGGI 277
Query: 247 ---------VSTPD-------------------------------IVIDSGTTLTFLPQ- 265
S PD V+DSGTT +LP+
Sbjct: 278 SPPSDMTFAYSDPDRSPYYNIDLKEMHVAGKRLPLNANVFDGKHGTVLDSGTTYAYLPEA 337
Query: 266 GYNSNLLSVMSSMIEAQPVADPTGSL-ELCYS--FNSLSQV----PEVTIHF-RGADVKL 317
+ + +++ + + ++ P + ++C+S N +SQ+ P V + F G L
Sbjct: 338 AFLAFKDAIVKELQSLKQISGPDPNYNDICFSGAGNDVSQLSKSFPVVDMVFGNGHKYSL 397
Query: 318 SRSNFFV---KVSEDIVCSVFKGITNSVPIYGNIMQTNFLVGYDIEQQTVSFKPTDCTK 373
S N+ KV +F+ + + G I+ N LV YD EQ + F T+C +
Sbjct: 398 SPENYMFRHSKVRGAYCLGIFQNGNDQTTLLGGIIVRNTLVMYDREQTKIGFWKTNCAE 456
>gi|297842525|ref|XP_002889144.1| hypothetical protein ARALYDRAFT_476912 [Arabidopsis lyrata subsp.
lyrata]
gi|297334985|gb|EFH65403.1| hypothetical protein ARALYDRAFT_476912 [Arabidopsis lyrata subsp.
lyrata]
Length = 467
Score = 82.0 bits (201), Expect = 5e-13, Method: Compositional matrix adjust.
Identities = 87/361 (24%), Positives = 145/361 (40%), Gaps = 84/361 (23%)
Query: 91 YLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCSS 150
Y + ++IG PP DTGSDL W QC+ P + C + + P + +LPCS
Sbjct: 68 YYVLLNIGNPPKLFDLDIDTGSDLTWVQCD-APCNGCTKPRAKQYKP----NHNTLPCSH 122
Query: 151 SQCASLN---QKSCSGVN--CQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITFG 205
C+ L+ + C C Y + Y D + S G L T+ L G ++ P +TFG
Sbjct: 123 LLCSGLDLTQNRPCDDPEDQCDYEIGYSDHASSIGALVTDEFPLKLANG-SIMNPHLTFG 181
Query: 206 CG---TNNGGLFNSKTTGIVGLGGGDISLISQMR-------------------------- 236
CG N G T GI+GLG G + + +Q++
Sbjct: 182 CGYDQQNPGPHPPPPTAGILGLGRGKVGISTQLKSLGITKNVIVHCLSHTGKGFLSIGDE 241
Query: 237 ---------TTIAGN-----------------QRLGVSTPDIVIDSGTTLTFLPQGYNSN 270
T++A N + GV ++V DSG++ T+
Sbjct: 242 LVPSSGVTWTSLATNSASKNYMTGPAELLFNDKTTGVKGINVVFDSGSSYTYFNAEAYQA 301
Query: 271 LLSVMSSMIEAQPVADPTG--SLELCYS----FNSLSQVPE----VTIHF----RGADVK 316
+L ++ + +P+ D SL +C+ SL +V + +T+ F G +
Sbjct: 302 ILDLIRKDLNGKPLTDTKDDKSLPVCWKGKKPLKSLDEVKKYFKTITLRFGYQKNGQLFQ 361
Query: 317 LSRSNFFVKVSEDIVC-SVFKGIT---NSVPIYGNIMQTNFLVGYDIEQQTVSFKPTDCT 372
+ ++ + + VC + G +S I G+I +V YD E+Q + + +DC
Sbjct: 362 VPPESYLIITEKGNVCLGILNGTEVGLDSYNIVGDISFQGIMVIYDNEKQRIGWISSDCD 421
Query: 373 K 373
K
Sbjct: 422 K 422
>gi|4415912|gb|AAD20143.1| putative protease [Arabidopsis thaliana]
Length = 469
Score = 82.0 bits (201), Expect = 5e-13, Method: Compositional matrix adjust.
Identities = 56/158 (35%), Positives = 79/158 (50%), Gaps = 13/158 (8%)
Query: 91 YLIRISIGTPPTERLAVADTGSDLIW---TQCEPCPPSQCYMQDSPLFDPKMSSTYKSLP 147
Y ++ +G+PPTE DTGSD++W + C CP S D FD S T S+
Sbjct: 100 YFTKVKLGSPPTEFNVQIDTGSDILWVTCSSCSNCPHSSGLGIDLHFFDAPGSLTAGSVT 159
Query: 148 CSSSQCASLNQKS---CSGVN-CQYSVSYGDGSFSNGNLATETVTLGSTTGQAVAL---P 200
CS C+S+ Q + CS N C YS YGDGS ++G T+T + G+++
Sbjct: 160 CSDPICSSVFQTTAAQCSENNQCGYSFRYGDGSGTSGYYMTDTFYFDAILGESLVANSSA 219
Query: 201 GITFGCGTNNGGLF---NSKTTGIVGLGGGDISLISQM 235
I FGC T G + GI G G G +S++SQ+
Sbjct: 220 PIVFGCSTYQSGDLTKSDKAVDGIFGFGKGKLSVVSQL 257
>gi|326514838|dbj|BAJ99780.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 430
Score = 81.6 bits (200), Expect = 5e-13, Method: Compositional matrix adjust.
Identities = 88/355 (24%), Positives = 150/355 (42%), Gaps = 81/355 (22%)
Query: 90 NYLIRISIGTPPTERLAVADTGSDLIWTQCE-PCPPSQCYMQDSPLFDPKMSSTYKSLPC 148
+Y + ++IG P DTGSDL W QC+ PC C P + P + K +PC
Sbjct: 72 HYYVTMNIGDPAKPYFLDVDTGSDLTWLQCDAPC--QSCNKVPHPWYKPTKN---KIVPC 126
Query: 149 SSSQCASL--NQKSCSGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITFGC 206
++S C SL N+K C Y + Y D + S G L + TL S + +TFGC
Sbjct: 127 AASLCTSLTPNKKCAVPQQCDYQIKYTDKASSLGVLIADNFTL-SLRNSSTVRANLTFGC 185
Query: 207 GTN----NGGLFNSKTTGIVGLGGGDISLISQMR-------------------------- 236
G + G + T G++GLG G +SL+SQ++
Sbjct: 186 GYDQQVGKNGAVQAATDGLLGLGKGAVSLLSQLKQQGVTKNVLGHCFSTNGGGFLFFGDD 245
Query: 237 -------------TTIAGN-------------QRLGVSTPDIVIDSGTTLT-FLPQGYNS 269
T +GN + LG+ ++V DSG+T F + Y +
Sbjct: 246 IVPTSRVTWVPMARTTSGNYYSPGSGTLYFDRRSLGMKPMEVVFDSGSTYAYFAAEPYQA 305
Query: 270 NLLSVMSSMIEA-QPVADPTGSLELCYS----FNSLSQVPE----VTIHF-RGADVKLSR 319
+ ++ + + ++ + V+D SL LC+ F S+S+V + + F + + +++
Sbjct: 306 TVSALKAGLSKSLKEVSDV--SLPLCWKGQKVFKSVSEVKNDFKSLFLSFGKNSVMEIPP 363
Query: 320 SNFFVKVSEDIVC-SVFKGITNSVP--IYGNIMQTNFLVGYDIEQQTVSFKPTDC 371
N+ + VC + G T + I G+I + ++ YD E+ + + C
Sbjct: 364 ENYLIVTKYGNVCLGILDGTTAKLKFNIIGDITMQDQMIIYDNEKGQLGWIRGSC 418
>gi|26451756|dbj|BAC42973.1| unknown protein [Arabidopsis thaliana]
Length = 442
Score = 81.6 bits (200), Expect = 5e-13, Method: Compositional matrix adjust.
Identities = 52/159 (32%), Positives = 80/159 (50%), Gaps = 21/159 (13%)
Query: 87 NNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSL 146
+N + +++G PP V DTGS+L W C+ P +F+P SSTY +
Sbjct: 61 HNVTLTVTLAVGDPPQNISMVLDTGSELSWLHCKKSP------NLGSVFNPVSSSTYSPV 114
Query: 147 PCSSSQCASLNQK-----SCSGVN--CQYSVSYGDGSFSNGNLATETVTLGSTTGQAVAL 199
PCSS C + + SC C ++SY D + GNLA ET +GS T
Sbjct: 115 PCSSPICRTRTRDLPIPASCDPKTHLCHVAISYADATSIEGNLAHETFVIGSVTR----- 169
Query: 200 PGITFGC---GTNNGGLFNSKTTGIVGLGGGDISLISQM 235
PG FGC G ++ ++K+TG++G+ G +S ++Q+
Sbjct: 170 PGTLFGCMDSGLSSNSEEDAKSTGLMGMNRGSLSFVNQL 208
>gi|357476337|ref|XP_003608454.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
gi|355509509|gb|AES90651.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
Length = 683
Score = 81.6 bits (200), Expect = 5e-13, Method: Compositional matrix adjust.
Identities = 53/173 (30%), Positives = 84/173 (48%), Gaps = 10/173 (5%)
Query: 64 RLNHFNQNSSISSSKASQADIIPNNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCP 123
R H +++ +++ D + N Y R+ IGTPP + DTGS + + C C
Sbjct: 54 RQLHGSESKRHPNARMRLHDDLLLNGYYTTRLWIGTPPQMFALIVDTGSTVTYVPCSTC- 112
Query: 124 PSQCYMQDSPLFDPKMSSTYKSLPCSSSQCASLNQKSCSGVNCQYSVSYGDGSFSNGNLA 183
QC P F P +SSTY+ + C+ C N + + C Y Y + S S+G L
Sbjct: 113 -EQCGRHQDPKFQPDLSSTYQPVKCTLD-CNCDNDR----MQCVYERQYAEMSTSSGVLG 166
Query: 184 TETVTLGSTTGQAVALPGITFGC-GTNNGGLFNSKTTGIVGLGGGDISLISQM 235
+ V+ G+ + +A FGC G L++ GI+GLG GD+S++ Q+
Sbjct: 167 EDVVSFGNQS--ELAPQRAVFGCENVETGDLYSQHADGIMGLGRGDLSIMDQL 217
>gi|18405138|ref|NP_565911.1| Eukaryotic aspartyl protease family protein [Arabidopsis thaliana]
gi|13877759|gb|AAK43957.1|AF370142_1 unknown protein [Arabidopsis thaliana]
gi|15293231|gb|AAK93726.1| unknown protein [Arabidopsis thaliana]
gi|20196976|gb|AAB87120.2| expressed protein [Arabidopsis thaliana]
gi|20197046|gb|AAM14894.1| expressed protein [Arabidopsis thaliana]
gi|330254616|gb|AEC09710.1| Eukaryotic aspartyl protease family protein [Arabidopsis thaliana]
Length = 442
Score = 81.6 bits (200), Expect = 5e-13, Method: Compositional matrix adjust.
Identities = 52/159 (32%), Positives = 80/159 (50%), Gaps = 21/159 (13%)
Query: 87 NNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSL 146
+N + +++G PP V DTGS+L W C+ P +F+P SSTY +
Sbjct: 61 HNVTLTVTLAVGDPPQNISMVLDTGSELSWLHCKKSP------NLGSVFNPVSSSTYSPV 114
Query: 147 PCSSSQCASLNQK-----SCSGVN--CQYSVSYGDGSFSNGNLATETVTLGSTTGQAVAL 199
PCSS C + + SC C ++SY D + GNLA ET +GS T
Sbjct: 115 PCSSPICRTRTRDLPIPASCDPKTHLCHVAISYADATSIEGNLAHETFVIGSVT-----R 169
Query: 200 PGITFGC---GTNNGGLFNSKTTGIVGLGGGDISLISQM 235
PG FGC G ++ ++K+TG++G+ G +S ++Q+
Sbjct: 170 PGTLFGCMDSGLSSNSEEDAKSTGLMGMNRGSLSFVNQL 208
>gi|190896608|gb|ACE96817.1| aspartyl protease [Populus tremula]
Length = 339
Score = 81.6 bits (200), Expect = 6e-13, Method: Compositional matrix adjust.
Identities = 62/192 (32%), Positives = 90/192 (46%), Gaps = 24/192 (12%)
Query: 59 TRSLNRLNHFNQNSSISSSKASQADIIPNN-----ANYLIRISIGTPPTERLAVADTGSD 113
++ RL + S+++ K + I P ANY++R+ +GTP + V DT +D
Sbjct: 11 SKDPERLKYL---STLADQKTTAVPIAPGQQVLKIANYVVRVKLGTPGQQMFMVLDTSND 67
Query: 114 LIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCSSSQCASLNQKSCSGVN---CQYSV 170
W C S C S F P S+T SL CS +QC+ + SC C ++
Sbjct: 68 AAWVPC-----SGCTGCSSTTFLPNASTTLGSLDCSEAQCSQVRGFSCPATGSSACLFNQ 122
Query: 171 SYGDGSFSNGNLATETVTLGSTTGQAVALPGITFGC-GTNNGGLFNSKTTGIVGLGGGDI 229
SYG S L + +TL + +PG TFGC +GG + G++GLG G I
Sbjct: 123 SYGGDSSLAATLVQDAITLANDV-----IPGFTFGCINAVSGGSIPPQ--GLLGLGRGPI 175
Query: 230 SLISQMRTTIAG 241
SLISQ +G
Sbjct: 176 SLISQAGAMYSG 187
>gi|414887402|tpg|DAA63416.1| TPA: hypothetical protein ZEAMMB73_414910 [Zea mays]
Length = 407
Score = 81.3 bits (199), Expect = 6e-13, Method: Compositional matrix adjust.
Identities = 59/204 (28%), Positives = 90/204 (44%), Gaps = 20/204 (9%)
Query: 36 RDSPKSPFYNSSETPY---QRLRDALTRSLNRLNHFNQNSSISSSKASQADIIPNNANYL 92
R P+ P + Y RL +L R L H N ++ D + N Y
Sbjct: 38 RPVPRPPLFLPLTRSYPNASRLAASLRRGLGDGAHPN-------ARMRLHDDLLTNGYYT 90
Query: 93 IRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCSSSQ 152
R+ IGTPP E + D+GS + + C C QC P F P +SS+Y + C+
Sbjct: 91 TRLYIGTPPQEFALIVDSGSTVTYVPCASC--EQCGNHQDPRFQPDLSSSYSPVKCNVDC 148
Query: 153 CASLNQKSCSGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITFGC-GTNNG 211
++K C+ Y Y + S S+G L + V+ G + + FGC + G
Sbjct: 149 TCDSDKKQCT-----YERQYAEMSSSSGVLGEDIVSFGRES--ELKAQRAVFGCENSETG 201
Query: 212 GLFNSKTTGIVGLGGGDISLISQM 235
LF+ GI+GLG G +S++ Q+
Sbjct: 202 DLFSQHADGIMGLGRGQLSIMDQL 225
>gi|190896584|gb|ACE96805.1| aspartyl protease [Populus tremula]
gi|190896586|gb|ACE96806.1| aspartyl protease [Populus tremula]
gi|190896588|gb|ACE96807.1| aspartyl protease [Populus tremula]
gi|190896590|gb|ACE96808.1| aspartyl protease [Populus tremula]
gi|190896592|gb|ACE96809.1| aspartyl protease [Populus tremula]
gi|190896594|gb|ACE96810.1| aspartyl protease [Populus tremula]
gi|190896596|gb|ACE96811.1| aspartyl protease [Populus tremula]
gi|190896598|gb|ACE96812.1| aspartyl protease [Populus tremula]
gi|190896600|gb|ACE96813.1| aspartyl protease [Populus tremula]
gi|190896602|gb|ACE96814.1| aspartyl protease [Populus tremula]
gi|190896604|gb|ACE96815.1| aspartyl protease [Populus tremula]
gi|190896606|gb|ACE96816.1| aspartyl protease [Populus tremula]
gi|190896610|gb|ACE96818.1| aspartyl protease [Populus tremula]
gi|190896612|gb|ACE96819.1| aspartyl protease [Populus tremula]
gi|190896614|gb|ACE96820.1| aspartyl protease [Populus tremula]
gi|190896616|gb|ACE96821.1| aspartyl protease [Populus tremula]
gi|190896618|gb|ACE96822.1| aspartyl protease [Populus tremula]
gi|190896620|gb|ACE96823.1| aspartyl protease [Populus tremula]
gi|190896622|gb|ACE96824.1| aspartyl protease [Populus tremula]
gi|190896624|gb|ACE96825.1| aspartyl protease [Populus tremula]
gi|190896626|gb|ACE96826.1| aspartyl protease [Populus tremula]
gi|190896628|gb|ACE96827.1| aspartyl protease [Populus tremula]
gi|190896630|gb|ACE96828.1| aspartyl protease [Populus tremula]
gi|190896632|gb|ACE96829.1| aspartyl protease [Populus tremula]
gi|190896634|gb|ACE96830.1| aspartyl protease [Populus tremula]
gi|190896636|gb|ACE96831.1| aspartyl protease [Populus tremula]
gi|190896638|gb|ACE96832.1| aspartyl protease [Populus tremula]
gi|190896640|gb|ACE96833.1| aspartyl protease [Populus tremula]
gi|190896642|gb|ACE96834.1| aspartyl protease [Populus tremula]
gi|190896644|gb|ACE96835.1| aspartyl protease [Populus tremula]
gi|190896646|gb|ACE96836.1| aspartyl protease [Populus tremula]
gi|190896648|gb|ACE96837.1| aspartyl protease [Populus tremula]
gi|190896650|gb|ACE96838.1| aspartyl protease [Populus tremula]
gi|190896652|gb|ACE96839.1| aspartyl protease [Populus tremula]
gi|190896654|gb|ACE96840.1| aspartyl protease [Populus tremula]
gi|190896656|gb|ACE96841.1| aspartyl protease [Populus tremula]
gi|190896658|gb|ACE96842.1| aspartyl protease [Populus tremula]
Length = 339
Score = 81.3 bits (199), Expect = 7e-13, Method: Compositional matrix adjust.
Identities = 62/192 (32%), Positives = 90/192 (46%), Gaps = 24/192 (12%)
Query: 59 TRSLNRLNHFNQNSSISSSKASQADIIPNN-----ANYLIRISIGTPPTERLAVADTGSD 113
++ RL + S+++ K + I P ANY++R+ +GTP + V DT +D
Sbjct: 11 SKDPERLKYL---STLADQKTTAVPIAPGQQVLKIANYVVRVKLGTPGQQMFMVLDTSND 67
Query: 114 LIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCSSSQCASLNQKSCSGVN---CQYSV 170
W C S C S F P S+T SL CS +QC+ + SC C ++
Sbjct: 68 AAWVPC-----SGCTGCSSTTFLPNASTTLGSLDCSEAQCSQVRGFSCPATGSSACLFNQ 122
Query: 171 SYGDGSFSNGNLATETVTLGSTTGQAVALPGITFGC-GTNNGGLFNSKTTGIVGLGGGDI 229
SYG S L + +TL + +PG TFGC +GG + G++GLG G I
Sbjct: 123 SYGGDSSLAATLVQDAITLANDV-----IPGFTFGCINAVSGGSIPPQ--GLLGLGRGPI 175
Query: 230 SLISQMRTTIAG 241
SLISQ +G
Sbjct: 176 SLISQAGAMYSG 187
>gi|56692305|dbj|BAD80835.1| nucellin-like protein [Daucus carota]
Length = 426
Score = 81.3 bits (199), Expect = 7e-13, Method: Compositional matrix adjust.
Identities = 86/358 (24%), Positives = 139/358 (38%), Gaps = 82/358 (22%)
Query: 91 YLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCSS 150
Y ++ +IG PP DTGSDL W QC+ P QC PL+ P T + C
Sbjct: 67 YHVQFNIGQPPKPYFLDPDTGSDLTWLQCD-APCIQCTPAPHPLYQP----TNDLVVCKD 121
Query: 151 SQCASL---NQKSCSGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITFGCG 207
CASL N + C Y V Y DG S G L + + T+G A P +T GCG
Sbjct: 122 PICASLHPDNYRCDDPDQCDYEVEYADGGSSIGVLVNDLFPVNLTSGMR-ARPRLTIGCG 180
Query: 208 TNN-GGLFNSKTTGIVGLGGGDISLISQMRTT---------------------------- 238
+ G+ G++GLG G S+++Q+ +
Sbjct: 181 YDQLPGIAYHPLDGVLGLGRGSSSIVAQLSSQGLVRNVVGHCFSRRGGGYLFFGDDIYDS 240
Query: 239 -----------------------IAGNQRLGVSTPDIVIDSGTTLTFLPQGYNSNLLSVM 275
I + G+ +V DSG++ T+ LLS +
Sbjct: 241 SKVIWTPMSRDYLKHYTPGFAELILNGRSSGLKNLLVVFDSGSSYTYFNTQTYQTLLSFI 300
Query: 276 SSMIEAQPVADPT--GSLELCYSFNS-LSQVPEVTIHFR------GADVKLSRSNFFVKV 326
+ +P+ + +L +C+ + + +F+ G+ K ++S F ++
Sbjct: 301 KKDLHGKPLKEAVEDDTLPVCWRGKKPFKSIRDAKKYFKPLALSFGSGWK-TKSQFEIQQ 359
Query: 327 SEDIVC----SVFKGITNSVP-------IYGNIMQTNFLVGYDIEQQTVSFKPTDCTK 373
++ SV GI N I G+I LV YD E+Q + ++P++C +
Sbjct: 360 ESYLIISSKGSVCLGILNGTEVGLQNYNIIGDISMQEKLVIYDNEKQVIGWQPSNCDR 417
>gi|302776054|ref|XP_002971323.1| hypothetical protein SELMODRAFT_63598 [Selaginella moellendorffii]
gi|300161305|gb|EFJ27921.1| hypothetical protein SELMODRAFT_63598 [Selaginella moellendorffii]
Length = 395
Score = 81.3 bits (199), Expect = 7e-13, Method: Compositional matrix adjust.
Identities = 82/367 (22%), Positives = 138/367 (37%), Gaps = 86/367 (23%)
Query: 91 YLIRISIGTPPTERLAVADTGSDLIWTQCEPC---PPSQCYMQDSPLFDPKMSSTYKSLP 147
Y ++ +G P + DTGSD++W C PC P ++DP+ SST +
Sbjct: 29 YFTQVGLGNPVKHYIVQVDTGSDVLWVNCRPCSGCPRKSALNIPLTMYDPRESSTTSLVS 88
Query: 148 CSSSQCAS---LNQKSCSGV--NCQYSVSYGDGSFSNGNLATETVTLG--STTGQAVALP 200
CS C + CS NC+Y SYGDGS S G + + S+ G A
Sbjct: 89 CSDPLCVRGRRFAEAQCSQTTNNCEYIFSYGDGSTSEGYYVRDAMQYNVISSNGLANTTS 148
Query: 201 GITFGCGTNNGGLFNS---KTTGIVGLGGGDISLISQ----------------------- 234
+ FGC G ++ GI+G G ++S+ +Q
Sbjct: 149 QVLFGCSIRQTGDLSTSQQAVDGIIGFGQLELSVPNQLAAQQNIPRVFSHCLEGEKRGGG 208
Query: 235 ---------------------------MRTTIAGNQRLGVSTPD--------IVIDSGTT 259
+R + RL + D +++DSGTT
Sbjct: 209 ILVIGGIAEPGMTYTPLVPDSVHYNVVLRGISVNSNRLPIDAEDFSSTNDTGVIMDSGTT 268
Query: 260 LTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSLSQV-PEVTIHFRGADVKLS 318
L + P G + + + A PV + LS + P VT++F G ++L
Sbjct: 269 LAYFPSGAYNVFVQAIREATSATPVRVQGMDTQCFLVSGRLSDLFPNVTLNFEGGAMELQ 328
Query: 319 RSNFFV------KVSEDIVCSVFKGITNS--------VPIYGNIMQTNFLVGYDIEQQTV 364
N+ + + D+ C ++ ++S + I G+I+ + LV YD++ +
Sbjct: 329 PDNYLMWGGTAPTGTTDVWCIGWQSSSSSAGPKDGSQLTILGDIVLKDKLVVYDLDNSRI 388
Query: 365 SFKPTDC 371
+ +C
Sbjct: 389 GWMSYNC 395
>gi|293334661|ref|NP_001168795.1| uncharacterized protein LOC100382594 precursor [Zea mays]
gi|223973065|gb|ACN30720.1| unknown [Zea mays]
Length = 631
Score = 81.3 bits (199), Expect = 7e-13, Method: Compositional matrix adjust.
Identities = 98/412 (23%), Positives = 162/412 (39%), Gaps = 90/412 (21%)
Query: 36 RDSPKSPFYNSSETPY---QRLRDALTRSLNRLNHFNQNSSISSSKASQADIIPNNANYL 92
R +P P + Y RL +L R L H N ++ D + N Y
Sbjct: 37 RPAPGPPLFLPLTRSYPNASRLAASLRRGLGDGVHPN-------ARMRLHDDLLTNGYYT 89
Query: 93 IRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCSSSQ 152
R+ IGTPP E + D+GS + + C C QC P F P +SS+Y + C+
Sbjct: 90 TRLYIGTPPQEFALIVDSGSTVTYVPCSSC--EQCGNHQDPRFQPDLSSSYSPVKCNVDC 147
Query: 153 CASLNQKSCSGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITFGC-GTNNG 211
++K C+ Y Y + S S+G L + V+ G + + FGC + G
Sbjct: 148 TCDSDKKQCT-----YERQYAEMSSSSGVLGEDIVSFGRES--ELKPQHAIFGCENSETG 200
Query: 212 GLFNSKTTGIVGLGGGDISLISQM--RTTIA--------------GNQRLG--VSTPDIV 253
LF+ GI+GLG G +S++ Q+ + I+ G LG ++ PD++
Sbjct: 201 DLFSQHADGIMGLGRGQLSIMDQLVEKGVISDSFSLCYGGMDIGGGAMVLGGMLAPPDMI 260
Query: 254 I----------------------------------DSGTTL------TFLP-QGYNSNLL 272
GT L +LP Q + +
Sbjct: 261 FSNSDPLRSPYYNIELKEIHVAGKALRVESRIFNSKHGTVLDSGTTYAYLPEQAFVAFKE 320
Query: 273 SVMSSMIEAQPVADPTGSL-ELCYS-----FNSLSQV-PEVTIHF-RGADVKLSRSNFFV 324
+V S + + + P S ++C++ + L +V P+V + F G + L+ N+
Sbjct: 321 AVTSKVHSLKKIRGPDPSYKDICFAGAGRNVSKLHEVFPDVDMVFGNGQKLSLTPENYLF 380
Query: 325 ---KVSEDIVCSVFKGITNSVPIYGNIMQTNFLVGYDIEQQTVSFKPTDCTK 373
KV VF+ + + G I+ N LV YD + + F T+C++
Sbjct: 381 RHSKVDGAYCLGVFQNGKDPTTLLGGIIVRNTLVTYDRHNEKIGFWKTNCSE 432
>gi|115484725|ref|NP_001067506.1| Os11g0215400 [Oryza sativa Japonica Group]
gi|77549255|gb|ABA92052.1| Eukaryotic aspartyl protease family protein, expressed [Oryza
sativa Japonica Group]
gi|113644728|dbj|BAF27869.1| Os11g0215400 [Oryza sativa Japonica Group]
Length = 428
Score = 81.3 bits (199), Expect = 7e-13, Method: Compositional matrix adjust.
Identities = 97/382 (25%), Positives = 155/382 (40%), Gaps = 87/382 (22%)
Query: 65 LNHFNQNSSISSSKASQA--DIIPNNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPC 122
+ +S S+KA Q D + Y+I + +GTP ++ DTGS W CE C
Sbjct: 54 FRYITNKTSRLSTKAVQVGWDRGLQTSLYVISVGLGTPAKTQIVEIDTGSSTSWVFCE-C 112
Query: 123 PPSQCYMQDSPLFDPKMSSTYKSLPCSSSQCASLN-----QKSCSGVNCQYSVSYGDGSF 177
C+ + S+T + C +S C Q S + +C + VSY DGS
Sbjct: 113 --DGCHTNPRTFLQSR-STTCAKVSCGTSMCLLGGSDPHCQDSENYPDCPFRVSYQDGSA 169
Query: 178 SNGNLATETVTLGSTTGQAVALPGITFGCGTNNGGL------------------------ 213
S G L +T+T +PG +FGC ++ G
Sbjct: 170 SYGILYQDTLTFSDVQ----KIPGFSFGCNMDSFGANEFGNVDGLLGMGAGPMSVLKQSS 225
Query: 214 ------------------FNSKTTGIVGLGG----GDI---SLISQMRTTI--------- 239
F SKTTG LG D+ ++++ + T
Sbjct: 226 PTFDCFSYCLPLQKSERGFFSKTTGYFSLGKVATRTDVRYTKMVARKKNTELFFVDLTAI 285
Query: 240 -AGNQRLGVS-----TPDIVIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLEL 293
+RLG+S +V DSG+ L+++P S L + ++ + A+ S
Sbjct: 286 SVDGERLGLSPSVFSRKGVVFDSGSELSYIPDRALSVLSQRIRELLLKRGAAEEE-SERN 344
Query: 294 CYSFNSLSQ--VPEVTIHFR-GADVKLSRSNFFVKVS---EDIVCSVFKGITNSVPIYGN 347
CY S+ + +P +++HF GA L FV+ S +D+ C F T SV I G+
Sbjct: 345 CYDMRSVDEGDMPAISLHFDDGARFDLGSHGVFVERSVQEQDVWCLAF-APTESVSIIGS 403
Query: 348 IMQTNFLVGYDIEQQTVSFKPT 369
+MQT+ V YD+++Q + P+
Sbjct: 404 LMQTSKEVVYDLKRQLIGIGPS 425
>gi|115489316|ref|NP_001067145.1| Os12g0583300 [Oryza sativa Japonica Group]
gi|77556903|gb|ABA99699.1| Eukaryotic aspartyl protease family protein, expressed [Oryza
sativa Japonica Group]
gi|113649652|dbj|BAF30164.1| Os12g0583300 [Oryza sativa Japonica Group]
gi|125537189|gb|EAY83677.1| hypothetical protein OsI_38901 [Oryza sativa Indica Group]
Length = 446
Score = 81.3 bits (199), Expect = 8e-13, Method: Compositional matrix adjust.
Identities = 54/156 (34%), Positives = 74/156 (47%), Gaps = 16/156 (10%)
Query: 90 NYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCS 149
Y+ IG PP A+ DTGSDL+WTQC C C Q P ++ SST+ +PC+
Sbjct: 89 QYVAEYLIGDPPQRAEALIDTGSDLVWTQCSTCLRKVCARQALPYYNSSASSTFAPVPCA 148
Query: 150 SSQCASLNQ--KSCS-GVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITFGC 206
+ CA+ + C C YG G + G L TE S T + + FGC
Sbjct: 149 ARICAANDDIIHFCDLAAGCSVIAGYGAGVVA-GTLGTEAFAFQSGTAE------LAFGC 201
Query: 207 GT----NNGGLFNSKTTGIVGLGGGDISLISQMRTT 238
T G L + +G++GLG G +SL+SQ T
Sbjct: 202 VTFTRIVQGALHGA--SGLIGLGRGRLSLVSQTGAT 235
Score = 50.4 bits (119), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 38/133 (28%), Positives = 62/133 (46%), Gaps = 6/133 (4%)
Query: 246 GVSTPDIVIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLE--LCYSFNSLSQV 303
G+ + ++IDSG+ T L L S +++ + VA P + + LC + + +V
Sbjct: 312 GLFSGGVIIDSGSPFTSLVHDAYDALASELAARLNGSLVAPPPDADDGALCVARRDVGRV 371
Query: 304 -PEVTIHFRG-ADVKLSRSNFFVKVSEDIVCSVFK--GITNSVPIYGNIMQTNFLVGYDI 359
P V HFRG AD+ + +++ V + C G + GN Q N V YD+
Sbjct: 372 VPAVVFHFRGGADMAVPAESYWAPVDKAAACMAIASAGPYRRQSVIGNYQQQNMRVLYDL 431
Query: 360 EQQTVSFKPTDCT 372
SF+P DC+
Sbjct: 432 ANGDFSFQPADCS 444
>gi|242082978|ref|XP_002441914.1| hypothetical protein SORBIDRAFT_08g004800 [Sorghum bicolor]
gi|241942607|gb|EES15752.1| hypothetical protein SORBIDRAFT_08g004800 [Sorghum bicolor]
Length = 429
Score = 81.3 bits (199), Expect = 8e-13, Method: Compositional matrix adjust.
Identities = 57/167 (34%), Positives = 81/167 (48%), Gaps = 21/167 (12%)
Query: 82 ADIIPNNANYLIRISIGTPPTERLAVADTGSDLIWTQCE-PCPPSQCYMQDSPLFDPKMS 140
D+ P+ Y + ++IG PP DTGSDL W QC+ PC C PL+ P +
Sbjct: 58 GDVYPHGL-YYVAMNIGNPPKPYFLDVDTGSDLTWLQCDAPC--RSCNKVPHPLYRPTKN 114
Query: 141 STYKSLPCSSSQCASLN-----QKSCSG--VNCQYSVSYGDGSFSNGNLATETVTLGSTT 193
K +PC CASL+ + C C Y + Y D S G L ++ L
Sbjct: 115 ---KLVPCVDQLCASLHNGLNRKHKCDSPYEQCDYVIKYADQGSSTGVLVNDSFALRLAN 171
Query: 194 GQAVALPGITFGCGTN----NGGLFNSKTTGIVGLGGGDISLISQMR 236
G +V P + FGCG + +G + S T G++GLG G +SL+SQ +
Sbjct: 172 G-SVVRPSLAFGCGYDQQVSSGEM--SPTDGVLGLGTGSVSLLSQFK 215
>gi|125595845|gb|EAZ35625.1| hypothetical protein OsJ_19916 [Oryza sativa Japonica Group]
Length = 152
Score = 81.3 bits (199), Expect = 8e-13, Method: Compositional matrix adjust.
Identities = 53/137 (38%), Positives = 67/137 (48%), Gaps = 9/137 (6%)
Query: 107 VADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCSSSQCASLN--QKSCSGV 164
V DT SD+ W QC PCP CY Q L+DP SS+ C+S C L C+
Sbjct: 20 VLDTASDVTWVQCSPCPTPPCYPQKDVLYDPTKSSSSGVFSCNSPTCTQLGPYANGCTNN 79
Query: 165 N-CQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITFGC--GTNNGGLFNSKTTGI 221
N CQY V Y DG+ + G ++ +T+ A A+ FGC G F S GI
Sbjct: 80 NQCQYRVRYPDGTSTAGTYISDLLTI----TPATAVRSFQFGCSKGVKGSFSFGSSAAGI 135
Query: 222 VGLGGGDISLISQMRTT 238
+ LGGG SL+SQ T
Sbjct: 136 MALGGGPESLVSQTGAT 152
>gi|326503488|dbj|BAJ86250.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 551
Score = 81.3 bits (199), Expect = 8e-13, Method: Compositional matrix adjust.
Identities = 100/371 (26%), Positives = 149/371 (40%), Gaps = 90/371 (24%)
Query: 81 QADIIPNNANYLIRISIGTPPTERLAVADTGSDLIWTQCE-PCPPSQCYMQDSPLFDPKM 139
+ ++ P+ Y I +G PP DTGSDL W QC+ PC + C PL+ P
Sbjct: 182 KGNVFPD-GQYYTSIFVGNPPRPYFLDVDTGSDLTWIQCDAPC--TNCAKGPHPLYKP-- 236
Query: 140 SSTYKSLPCSSSQCASL--NQKSCSGVN-CQYSVSYGDGSFSNGNLATETVTLGSTTGQA 196
+ K +P S C L +Q C C Y + Y D S S G LA + + L +T G
Sbjct: 237 -AKEKIVPPRDSLCQELQGDQNYCETCKQCDYEIEYADRSSSMGVLAKDDMHLIATNGGR 295
Query: 197 VALPGITFGCGTNNGGLFNS---KTTGIVGLGGGDISLISQM---------------RTT 238
L FGC + G S KT GI+GL ISL SQ+ R T
Sbjct: 296 EKL-DFVFGCAYDQQGQLLSSPAKTDGILGLSSAAISLPSQLASKGIISNVFGHCITRET 354
Query: 239 IAGNQ---------RLGVST------PD------------------------IVIDSGTT 259
G R G++ PD ++ DSG++
Sbjct: 355 NGGGYMFLGDDYVPRWGMTWAPIRGGPDNLYHTEAQKVNYGDQELHAGNSVQVIFDSGSS 414
Query: 260 LTFLPQGYNSNLLSVMSSMIEAQP--VADPTG-SLELCYS--FNSLSQVPEVTIHFRGAD 314
T+LP+ NL+ + E P V D + +L LC+ F+ S + +HF
Sbjct: 415 YTYLPEEMYKNLIDAIK---EDSPSFVQDSSDTTLPLCWKADFSVRSFFKPLNLHFGRRW 471
Query: 315 VKLSRSNFFVKVSEDIVC-----SVFKGITN-------SVPIYGNIMQTNFLVGYDIEQQ 362
+ ++ F V +D + +V G+ N S I G++ LV YD E++
Sbjct: 472 FVVPKT--FTIVPDDYLIISDKGNVCLGLLNGTEINHGSTIIVGDVSLRGKLVVYDNERR 529
Query: 363 TVSFKPTDCTK 373
+ + ++CTK
Sbjct: 530 QIGWANSECTK 540
>gi|449434470|ref|XP_004135019.1| PREDICTED: aspartic proteinase-like protein 1-like [Cucumis
sativus]
gi|449517144|ref|XP_004165606.1| PREDICTED: aspartic proteinase-like protein 1-like [Cucumis
sativus]
Length = 508
Score = 81.3 bits (199), Expect = 8e-13, Method: Compositional matrix adjust.
Identities = 93/345 (26%), Positives = 136/345 (39%), Gaps = 67/345 (19%)
Query: 91 YLIRISIGTPPTERLAVADTGSDLIWTQCE--PCPPSQCYMQDSPL----FDPKMSSTYK 144
Y +SIGTP L DTGSDL W CE CP + + SST
Sbjct: 104 YYANVSIGTPGLYFLVALDTGSDLFWLPCECTKCPTYLTKRDNGKFWLNHYSSNASSTSI 163
Query: 145 SLPCSSSQCASLNQKSCSGVNCQYSVSY-GDGSFSNGNLATETVTLGSTTGQAVALP-GI 202
+PCSSS C NQ S + +C Y Y + S S G L + + + + Q + +
Sbjct: 164 RVPCSSSLCELANQCSSNKSSCPYQTHYLSENSSSAGYLVQDILHMATDDSQLKPVDVKV 223
Query: 203 TFGCGTNNGGLFNSKTT--GIVGLGGGDIS----LISQMRTTIAGNQRLG------VSTP 250
T GCG G F++ T G++GLG G +S L SQ TT + + G +
Sbjct: 224 TLGCGKVQTGKFSNVTAPNGLIGLGMGKVSVPSFLASQGLTTDSFSMCFGYYGYGRIDFG 283
Query: 251 DI-------------------------------------VIDSGTTLTFLPQGYNSNLLS 273
DI +IDSG + T+L + S +
Sbjct: 284 DIGPVGQRETPFNPASLSYNVTILQIIVTNRPTNVHLTAIIDSGASFTYLTDPFYSIITE 343
Query: 274 VMSSMIEAQPV-ADPTGSLELCY--SFNSLSQVPEVTIHFRGADVKLSRSNFFVKVSEDI 330
M + +E + + +D E CY S ++ Q P + G K +V V D
Sbjct: 344 NMDAAMELERIKSDSDFPFEYCYRLSLATIFQQPNLNFTMEGGR-KFDVITSYVSVDTDD 402
Query: 331 VCSVFKGITNSVPIYGNIMQTNFLVGYDI----EQQTVSFKPTDC 371
++ I S I N++ NF GY + E+ T+ +K DC
Sbjct: 403 GPALCLAIVKSTDI--NVIGHNFFGGYRVVFNREKMTLGWKEVDC 445
>gi|79495937|ref|NP_567922.2| aspartyl protease family protein [Arabidopsis thaliana]
gi|332660833|gb|AEE86233.1| aspartyl protease family protein [Arabidopsis thaliana]
Length = 401
Score = 80.9 bits (198), Expect = 8e-13, Method: Compositional matrix adjust.
Identities = 53/173 (30%), Positives = 83/173 (47%), Gaps = 18/173 (10%)
Query: 77 SKASQADIIPNNAN------YLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQ 130
++A + + P + N Y + I+IG PP DTGSDL W QC+ P +C
Sbjct: 37 TRAVSSVVFPVHGNVYPLGYYNVTINIGQPPRPYYLDLDTGSDLTWLQCD-APCVRCLEA 95
Query: 131 DSPLFDPKMSSTYKSLPCSSSQCASL----NQKSCSGVNCQYSVSYGDGSFSNGNLATET 186
PL+ P + +PC+ C +L NQ+ + C Y V Y DG S G L +
Sbjct: 96 PHPLYQP----SSDLIPCNDPLCKALHLNSNQRCETPEQCDYEVEYADGGSSLGVLVRDV 151
Query: 187 VTLGSTTGQAVALPGITFGCGTNN--GGLFNSKTTGIVGLGGGDISLISQMRT 237
++ T G + P + GCG + G + G++GLG G +S++SQ+ +
Sbjct: 152 FSMNYTQGLRLT-PRLALGCGYDQIPGASSHHPLDGVLGLGRGKVSILSQLHS 203
>gi|4490316|emb|CAB38807.1| nucellin-like protein [Arabidopsis thaliana]
gi|7270297|emb|CAB80066.1| nucellin-like protein [Arabidopsis thaliana]
Length = 420
Score = 80.9 bits (198), Expect = 9e-13, Method: Compositional matrix adjust.
Identities = 50/153 (32%), Positives = 75/153 (49%), Gaps = 12/153 (7%)
Query: 91 YLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCSS 150
Y + I+IG PP DTGSDL W QC+ P +C PL+ P + +PC+
Sbjct: 38 YNVTINIGQPPRPYYLDLDTGSDLTWLQCD-APCVRCLEAPHPLYQP----SSDLIPCND 92
Query: 151 SQCASL----NQKSCSGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITFGC 206
C +L NQ+ + C Y V Y DG S G L + ++ T G + P + GC
Sbjct: 93 PLCKALHLNSNQRCETPEQCDYEVEYADGGSSLGVLVRDVFSMNYTQGLRLT-PRLALGC 151
Query: 207 GTNN--GGLFNSKTTGIVGLGGGDISLISQMRT 237
G + G + G++GLG G +S++SQ+ +
Sbjct: 152 GYDQIPGASSHHPLDGVLGLGRGKVSILSQLHS 184
>gi|255556768|ref|XP_002519417.1| Aspartic proteinase Asp1 precursor, putative [Ricinus communis]
gi|223541280|gb|EEF42831.1| Aspartic proteinase Asp1 precursor, putative [Ricinus communis]
Length = 494
Score = 80.9 bits (198), Expect = 1e-12, Method: Compositional matrix adjust.
Identities = 52/159 (32%), Positives = 77/159 (48%), Gaps = 14/159 (8%)
Query: 91 YLIRISIGTPPTERLAVADTGSDLIW---TQCEPCPPSQCYMQDSPLFDPKMSSTYKSLP 147
Y R+ +GTPP E DTGSD++W + C CP + FD SST + +P
Sbjct: 81 YFTRVKLGTPPREFNVQIDTGSDVLWVTCSSCSNCPQTSGLGIQLNYFDTTSSSTARLVP 140
Query: 148 CSSSQCASLNQKSCSGV-----NCQYSVSYGDGSFSNGNLATETVTLGSTTGQAV---AL 199
CS C S Q + + C Y+ YGDGS ++G ++T + G+++ +
Sbjct: 141 CSHPICTSQIQTTATQCPPQSNQCSYAFQYGDGSGTSGYYVSDTFYFDAVLGESLIANSS 200
Query: 200 PGITFGCGTNNGGLF---NSKTTGIVGLGGGDISLISQM 235
I FGC T G + GI G G G++S+ISQ+
Sbjct: 201 AAIVFGCSTYQSGDLTKTDKAVDGIFGFGQGELSVISQL 239
Score = 40.0 bits (92), Expect = 1.7, Method: Compositional matrix adjust.
Identities = 32/127 (25%), Positives = 60/127 (47%), Gaps = 10/127 (7%)
Query: 253 VIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPT-GSLELCYSF-NSLSQV-PEVTIH 309
+ID+GTTL +L + +S +++ + +A PT CY NS+S+V P V+ +
Sbjct: 315 IIDTGTTLAYLVEEAYDPFVSAITAAVSQ--LATPTINKGNQCYLVSNSVSEVFPPVSFN 372
Query: 310 FRGADVKLSRSNFFVK-----VSEDIVCSVFKGITNSVPIYGNIMQTNFLVGYDIEQQTV 364
F G L + ++ + C F+ I + I G+++ + + YD+ Q +
Sbjct: 373 FAGGATMLLKPEEYLMYLTNYAGAALWCIGFQKIQGGITILGDLVLKDKIFVYDLAHQRI 432
Query: 365 SFKPTDC 371
+ DC
Sbjct: 433 GWANYDC 439
>gi|217073884|gb|ACJ85302.1| unknown [Medicago truncatula]
Length = 259
Score = 80.9 bits (198), Expect = 1e-12, Method: Compositional matrix adjust.
Identities = 59/194 (30%), Positives = 94/194 (48%), Gaps = 17/194 (8%)
Query: 23 EAQTGGFSVELIHRDSPKSPFYNSSETPYQ----RLRDALTRSLNRLNHFNQNSSISSSK 78
+ Q G ++++IH SP SPF S ++ +++ T L L+ SI
Sbjct: 23 DVQDNGSTLQVIHVFSPCSPFRPSKPLSWEESVLQMQAKDTTRLQFLDSLVARKSIVP-I 81
Query: 79 ASQADIIPNNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPK 138
AS II + Y++R IGTPP L DT +D W C C C S LF P+
Sbjct: 82 ASGRQII-QSPTYIVRAKIGTPPQTLLLAMDTSNDAAWIPCTAC--DGCA---STLFAPE 135
Query: 139 MSSTYKSLPCSSSQCASLNQKSCSGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVA 198
S+T+K++ C++ +C + C +C ++++YG S + NL +T+TL +
Sbjct: 136 KSTTFKNVSCAAPECKQVPNPGCGVSSCNFNLTYGSSSIA-ANLVQDTITLATD-----P 189
Query: 199 LPGITFGCGTNNGG 212
+P TFGC + G
Sbjct: 190 VPSYTFGCVSKTTG 203
>gi|242050744|ref|XP_002463116.1| hypothetical protein SORBIDRAFT_02g038150 [Sorghum bicolor]
gi|241926493|gb|EER99637.1| hypothetical protein SORBIDRAFT_02g038150 [Sorghum bicolor]
Length = 632
Score = 80.9 bits (198), Expect = 1e-12, Method: Compositional matrix adjust.
Identities = 88/362 (24%), Positives = 143/362 (39%), Gaps = 80/362 (22%)
Query: 83 DIIPNNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSST 142
D + N Y R+ IGTPP E + D+GS + + C C QC P F P +SS+
Sbjct: 81 DDLLTNGYYTTRLYIGTPPQEFALIVDSGSTVTYVPCASC--EQCGNHQDPRFQPDLSSS 138
Query: 143 YKSLPCSSSQCASLNQKSCSGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGI 202
Y + C+ ++K C+ Y Y + S S+G L + V+ G + +
Sbjct: 139 YSPVKCNVDCTCDSDKKQCT-----YERQYAEMSSSSGVLGEDIVSFGRES--ELKPQRA 191
Query: 203 TFGC-GTNNGGLFNSKTTGIVGLG------------------------------GGDISL 231
FGC + G LF+ GI+GLG GG + L
Sbjct: 192 VFGCENSETGDLFSQHADGIMGLGRGQLSIMDQLVEKGVISDSFSLCYGGMDIGGGAMVL 251
Query: 232 -----------------------ISQMRTTIAG-----NQRLGVSTPDIVIDSGTTLTFL 263
I +AG + R+ S V+DSGTT +L
Sbjct: 252 GGVPAPSDMVFSHSDPLRSPYYNIELKEIHVAGKALRVDSRVFNSKHGTVLDSGTTYAYL 311
Query: 264 P-QGYNSNLLSVMSSMIEAQPVADPTGSL-ELCYS-----FNSLSQV-PEVTIHF-RGAD 314
P Q + + +V S + + + P + ++C++ + L +V P+V + F G
Sbjct: 312 PEQAFVAFKDAVTSKVHSLKKIRGPDPNYKDICFAGAGRNVSKLHEVFPDVDMVFGNGQK 371
Query: 315 VKLSRSNFFV---KVSEDIVCSVFKGITNSVPIYGNIMQTNFLVGYDIEQQTVSFKPTDC 371
+ L+ N+ KV VF+ + + G I+ N LV YD + + F T+C
Sbjct: 372 LSLTPENYLFRHSKVDGAYCLGVFQNGKDPTTLLGGIIVRNTLVTYDRHNEKIGFWKTNC 431
Query: 372 TK 373
++
Sbjct: 432 SE 433
>gi|356539818|ref|XP_003538390.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
Length = 457
Score = 80.9 bits (198), Expect = 1e-12, Method: Compositional matrix adjust.
Identities = 63/209 (30%), Positives = 92/209 (44%), Gaps = 29/209 (13%)
Query: 92 LIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCSSS 151
++ + IGTPP + V DTGS L W QC P++ S FDP +SST+ +LPC+
Sbjct: 98 IVDLPIGTPPQVQPMVLDTGSQLSWIQCHKKAPAKPPPTAS--FDPSLSSTFSTLPCTHP 155
Query: 152 QCA------SLNQKSCSGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITFG 205
C +L C YS Y DG+++ GNL E T +++ P + G
Sbjct: 156 VCKPRIPDFTLPTSCDQNRLCHYSYFYADGTYAEGNLVREKFTF----SRSLFTPPLILG 211
Query: 206 CGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGNQRLGVSTPDIVIDSGTTLT---F 262
C T ++ GI+G+ G +S SQ + T + P V G T T +
Sbjct: 212 CATE-----STDPRGILGMNRGRLSFASQSKIT-----KFSYCVPTRVTRPGYTPTGSFY 261
Query: 263 LPQGYNSNLLSVMSSMIEAQ----PVADP 287
L NSN + + A+ P DP
Sbjct: 262 LGHNPNSNTFRYIEMLTFARSQRMPNLDP 290
>gi|147771308|emb|CAN69536.1| hypothetical protein VITISV_043237 [Vitis vinifera]
Length = 372
Score = 80.9 bits (198), Expect = 1e-12, Method: Compositional matrix adjust.
Identities = 45/122 (36%), Positives = 66/122 (54%), Gaps = 11/122 (9%)
Query: 85 IPNNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYK 144
I N Y++R IGTP L DT SD+ W C C S LF+ S+TYK
Sbjct: 30 IVQNPTYIVRAKIGTPAQTMLMAMDTSSDVAWIPCN-----GCLGCSSTLFNSPASTTYK 84
Query: 145 SLPCSSSQCASLNQKSCSGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITF 204
SL C ++QC + + +C G C ++++YG S + NL+ +T+TL + A+PG +F
Sbjct: 85 SLGCQAAQCKQVPKPTCGGGVCSFNLTYGGSSLA-ANLSQDTITLATD-----AVPGYSF 138
Query: 205 GC 206
GC
Sbjct: 139 GC 140
>gi|297827577|ref|XP_002881671.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
gi|297327510|gb|EFH57930.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
Length = 438
Score = 80.9 bits (198), Expect = 1e-12, Method: Compositional matrix adjust.
Identities = 51/159 (32%), Positives = 81/159 (50%), Gaps = 21/159 (13%)
Query: 87 NNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSL 146
+N + +++G+PP V DTGS+L W C+ P +F+P SSTY +
Sbjct: 57 HNVTLTVTLAVGSPPQNISMVLDTGSELSWLHCKKSP------NLGSVFNPVSSSTYSPV 110
Query: 147 PCSSSQCASLNQK-----SCSGVN--CQYSVSYGDGSFSNGNLATETVTLGSTTGQAVAL 199
PCSS C + + SC C ++SY D + GNLA +T +GS T
Sbjct: 111 PCSSPICRTRTRDLPIPASCDPKTHFCHVAISYADATSIEGNLAHDTFVIGSVT-----R 165
Query: 200 PGITFGC---GTNNGGLFNSKTTGIVGLGGGDISLISQM 235
PG FGC G ++ ++K+TG++G+ G +S ++Q+
Sbjct: 166 PGTLFGCMDSGLSSDSEEDAKSTGLMGMNRGSLSFVNQL 204
>gi|356537928|ref|XP_003537458.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Glycine max]
Length = 445
Score = 80.9 bits (198), Expect = 1e-12, Method: Compositional matrix adjust.
Identities = 54/158 (34%), Positives = 84/158 (53%), Gaps = 20/158 (12%)
Query: 87 NNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSL 146
+N + +++GTPP V DTGS+L W C+ + +F+P +SS+Y +
Sbjct: 66 HNVTLTVSLTVGTPPQSVTMVLDTGSELSWLHCKK------QQNINSVFNPHLSSSYTPI 119
Query: 147 PCSSSQCASLNQK-----SCSGVN-CQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALP 200
PC S C + + SC N C +VSY D + GNLA++T + S +GQ P
Sbjct: 120 PCMSPICKTRTRDFLIPVSCDSNNLCHVTVSYADFTSLEGNLASDTFAI-SGSGQ----P 174
Query: 201 GITFG---CGTNNGGLFNSKTTGIVGLGGGDISLISQM 235
GI FG G ++ +SKTTG++G+ G +S ++QM
Sbjct: 175 GIIFGSMDSGFSSNANEDSKTTGLMGMNRGSLSFVTQM 212
>gi|357520119|ref|XP_003630348.1| Aspartic proteinase Asp1 [Medicago truncatula]
gi|355524370|gb|AET04824.1| Aspartic proteinase Asp1 [Medicago truncatula]
Length = 435
Score = 80.5 bits (197), Expect = 1e-12, Method: Compositional matrix adjust.
Identities = 88/363 (24%), Positives = 150/363 (41%), Gaps = 94/363 (25%)
Query: 91 YLIRISIGTPPTERLAVADTGSDLIWTQCE-PCPPSQCYMQDSPLFDPKMSSTYKSLPCS 149
Y + ++IG PP DTGS+L W QC+ PC SQC PL+ P + +PC
Sbjct: 74 YNVTLNIGQPPRPYFLDVDTGSELTWLQCDAPC--SQCSETPHPLYKP----SNDFIPCK 127
Query: 150 SSQCASL---NQKSCSGVN-CQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITFG 205
CASL + +C N C Y + Y D + G L + L T G + + + G
Sbjct: 128 DPLCASLQPTDDYTCEDPNQCDYEIKYADQYSTLGVLLNDVYLLNFTNGVQLKV-RMALG 186
Query: 206 CGTNNGGLFNSKT----TGIVGLGGGDISLISQMRTT----------------------- 238
CG + +F+ T GI+GLG G SLISQ+ +
Sbjct: 187 CGYDQ--IFSPSTYHPLDGILGLGRGKASLISQLNSQGLVRNVMGHCLSSRGGGYIFFGN 244
Query: 239 ----------------------------IAGNQRLGVSTPDIVIDSGTTLTFLPQGYNSN 270
+ G ++ GV + +I+ D+G++ T+
Sbjct: 245 VYDSSRMSWTPISSIDSGKHYSAGPAELVFGGRKTGVGSLNIIFDTGSSYTYFNSQAYQA 304
Query: 271 LLSVMSSMIEAQPV--ADPTGSLELCYS----FNSLSQVPE----VTIHF-RGADVKLSR 319
++S+++ + +P+ A +L +C+ F S+++V + +T+ F G VK
Sbjct: 305 MISLLNKELHRKPIKAAPDDQTLPMCWHGKRPFRSINEVKKYFKPLTLSFTNGGRVK--- 361
Query: 320 SNFFVKVSEDIVCS----VFKGITN-------SVPIYGNIMQTNFLVGYDIEQQTVSFKP 368
F + ++ S V GI N + + G+I + ++ +D E+Q + + P
Sbjct: 362 PQFEIPPEAYLIISNMGNVCLGILNGPEVGLGELNLIGDISMLDKVMVFDNEKQLIGWGP 421
Query: 369 TDC 371
DC
Sbjct: 422 ADC 424
>gi|125532795|gb|EAY79360.1| hypothetical protein OsI_34488 [Oryza sativa Indica Group]
Length = 342
Score = 80.5 bits (197), Expect = 1e-12, Method: Compositional matrix adjust.
Identities = 80/353 (22%), Positives = 133/353 (37%), Gaps = 112/353 (31%)
Query: 91 YLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCSS 150
Y+ ++IGTPP A+ + +WTQC PC +C+ QD PLF+
Sbjct: 28 YMANLTIGTPPQPASAIIHLAGEFVWTQCSPC--RRCFKQDLPLFN-------------- 71
Query: 151 SQCASLNQKSCSGVNCQYSVS--YGDGSFSNGNLATETVTLGSTTGQAVALPGITFGCGT 208
+Y V +GD S G T+T +G+ T + FGC
Sbjct: 72 ----------------RYEVETMFGDTSGIGG---TDTFAIGTATAS------LAFGCAM 106
Query: 209 NNGGLFNSKTTGIVGLGGGDISLISQMRTT-------------------------IAGNQ 243
++ +G+VGLG SL+ QM T +AG +
Sbjct: 107 DSNIKQLLGASGVVGLGRTPWSLVGQMNATAFSYCLAPHGAAGKKSALLLGASAKLAGGK 166
Query: 244 RL--------------------GVSTPDIVI-----------DSGTTLTFLPQGYNSNLL 272
G+ D++I D+ ++FL +
Sbjct: 167 SAATTPLVNTSDDSSDYMIHLEGIKFGDVIIEPPPNGSVVLVDTIFGVSFLVDAAFHAIK 226
Query: 273 SVMSSMIEAQPVADPTGSLELCY-------SFNSLSQVPEVTIHFRGAD-VKLSRSNFFV 324
++ + A P+A PT +LC+ NS +P+V + F+GA + + S +
Sbjct: 227 KAVTVAVGAAPMATPTKPFDLCFPKAAAAAGANSSLPLPDVVLTFQGAAALTVPPSKYMY 286
Query: 325 KVSEDIVC-----SVFKGITNSVPIYGNIMQTNFLVGYDIEQQTVSFKPTDCT 372
VC S +T + I G + Q N +D++++T+SF+P DC+
Sbjct: 287 DAGNGTVCLAMMSSAMLNLTTELSILGRLHQENIHFLFDLDKETLSFEPADCS 339
>gi|388518257|gb|AFK47190.1| unknown [Lotus japonicus]
Length = 478
Score = 80.5 bits (197), Expect = 1e-12, Method: Compositional matrix adjust.
Identities = 52/159 (32%), Positives = 75/159 (47%), Gaps = 14/159 (8%)
Query: 91 YLIRISIGTPPTERLAVADTGSDLIW---TQCEPCPPSQCYMQDSPLFDPKMSSTYKSLP 147
Y +I +G+P + DTGSD++W +C CP L+DPK S T + +
Sbjct: 69 YFTKIGLGSPSKDYYVQVDTGSDILWVNCVECTRCPRKSDIGIGLTLYDPKRSKTSEFVS 128
Query: 148 CSSSQCASLNQK---SCSGVN-CQYSVSYGDGSFSNGNLATETVTLGSTTGQ---AVALP 200
C + C+S + C N C YS+SYGDGS + G + +T G A
Sbjct: 129 CEHNFCSSTYEGRILGCKAENPCPYSISYGDGSATTGYYVQDYLTFNRVNGNPHTATQNS 188
Query: 201 GITFGCGTNNGGLFNSKTT----GIVGLGGGDISLISQM 235
I FGCG G F S + GI+G G + S++SQ+
Sbjct: 189 SIIFGCGAAQSGTFASSSEEALDGIIGFGQANSSVLSQL 227
>gi|224063191|ref|XP_002301033.1| predicted protein [Populus trichocarpa]
gi|222842759|gb|EEE80306.1| predicted protein [Populus trichocarpa]
Length = 536
Score = 80.5 bits (197), Expect = 1e-12, Method: Compositional matrix adjust.
Identities = 111/444 (25%), Positives = 168/444 (37%), Gaps = 99/444 (22%)
Query: 22 IEAQTG-GFSVELIHR--DSPKS--------------PFYNSSETPYQRLRDALTRSLNR 64
EA G FS +LIHR D KS P S E L + L R +
Sbjct: 20 FEASIGLTFSSKLIHRFSDEAKSISISRKGNASGDLWPKRYSFEYFQLLLGNDLKRQRMK 79
Query: 65 LNHFNQNSSISSSKASQADIIPNNANYL--IRISIGTPPTERLAVADTGSDLIWTQCE-- 120
L +N + S+ SQA N ++L I IGTP L D GSDL+W C+
Sbjct: 80 LGS-QKNQLLFPSQGSQALFFGNELDWLHYTWIDIGTPNVSFLVALDAGSDLLWVPCDCI 138
Query: 121 PCPP-SQCYM-----QDSPLFDPKMSSTYKSLPCSSSQCASLNQKSCSGVNCQYSVSYGD 174
C P S Y +D + P +SST + L C C + C Y +Y D
Sbjct: 139 QCAPLSASYYNISLDRDLSEYSPSLSSTSRHLSCDHQLCEWGSNCKNPKDPCPYIFNYDD 198
Query: 175 --GSFSNGNLATETVTL---GSTTGQAVALPGITFGCGTNNGGLF--NSKTTGIVGLGGG 227
+ S G L + + L G T + + + GCG GG F + G++GLG G
Sbjct: 199 FENTTSAGFLVEDKLHLASVGDHTARKMLQASVVLGCGRKQGGSFFDGAAPDGVMGLGPG 258
Query: 228 DISLISQM---------------------------------------------------R 236
DIS+ S +
Sbjct: 259 DISVPSLLAKAGLIQNCFSLCFDENDSGRILFGDRGHASQQSTPFLPIQGTYVAYFVGVE 318
Query: 237 TTIAGNQRLGVSTPDIVIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYS 296
+ GN L S ++DSG++ T+LP + L+S + A+ ++ G + CY+
Sbjct: 319 SYCVGNSCLKRSGFKALVDSGSSFTYLPSEVYNELVSEFDKQVNAKRISFQDGLWDYCYN 378
Query: 297 FNS--LSQVPEVTIHF-RGADVKLSRSNFFVKVSED--IVCSVFKGITNSVPIYGNIMQT 351
+S L +P + + F R + + + + + + C + S YG I Q
Sbjct: 379 ASSQELHDIPAIQLKFPRNQNFVVHNPTYSIPHHQGFTMFCLSLQPTDGS---YGIIGQ- 434
Query: 352 NFLVGY----DIEQQTVSFKPTDC 371
NF++GY DIE + + + C
Sbjct: 435 NFMIGYRMVFDIENLKLGWSNSSC 458
>gi|224031303|gb|ACN34727.1| unknown [Zea mays]
gi|413923868|gb|AFW63800.1| hypothetical protein ZEAMMB73_012138 [Zea mays]
Length = 557
Score = 80.5 bits (197), Expect = 1e-12, Method: Compositional matrix adjust.
Identities = 75/259 (28%), Positives = 109/259 (42%), Gaps = 34/259 (13%)
Query: 2 ATFLSCVFILFFLCFYVVSPIEAQTGGFSVELIHRDSPKS---PFYNSS----------- 47
A L + + + CFY S + Q G E R+ +S P Y +
Sbjct: 83 ALVLGALAVAAYYCFY--SDVAVQFLGMEQEEEQRNETRSFLLPLYPKARQGRALREFGD 140
Query: 48 -ETPYQRLRDALTRSLNRLNHFNQNSSISSSKAS---QADIIPNNANYLIRISIGTPPTE 103
+ +R+ D ++ NR+ ++ ++S A + ++ P+ Y I IG PP
Sbjct: 141 VKLAARRVDDGGRKARNRMEVAKAATARTNSTALLPIKGNVFPD-GQYYTSIFIGNPPRP 199
Query: 104 RLAVADTGSDLIWTQCE-PCPPSQCYMQDSPLFDPKMSSTYKSLPCSSSQCASL--NQKS 160
DTGSDL W QC+ PC + C PL+ P + K +P C L NQ
Sbjct: 200 YFLDVDTGSDLTWIQCDAPC--TNCAKGPHPLYKP---AKEKIVPPRDLLCQELQGNQNY 254
Query: 161 CSGVN-CQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITFGCGTNNGGLFNS--- 216
C C Y + Y D S S G LA + + + +T G L FGC + G S
Sbjct: 255 CETCKQCDYEIEYADQSSSMGVLARDDMHMIATNGGREKL-DFVFGCAYDQQGQLLSSPA 313
Query: 217 KTTGIVGLGGGDISLISQM 235
KT GI+GL IS SQ+
Sbjct: 314 KTDGILGLSSAAISFPSQL 332
>gi|12323376|gb|AAG51657.1|AC010704_1 nucellin-like protein; 27671-25467 [Arabidopsis thaliana]
Length = 427
Score = 80.1 bits (196), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 90/364 (24%), Positives = 143/364 (39%), Gaps = 95/364 (26%)
Query: 91 YLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCSS 150
Y + ++IG PP DTGSDL W QC+ P + C K + +LPCS
Sbjct: 67 YYVLLNIGNPPKLFDLDIDTGSDLTWVQCD-APCNGC---------TKYKPNHNTLPCSH 116
Query: 151 SQCASLN---QKSCSGVN--CQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITFG 205
C+ L+ + C+ C Y + Y D + S G L T+ V L G + L +TFG
Sbjct: 117 ILCSGLDLPQDRPCADPEDQCDYEIGYSDHASSIGALVTDEVPLKLANGSIMNLR-LTFG 175
Query: 206 CG---TNNGGLFNSKTTGIVGLGGGDISLISQMR-------------------------- 236
CG N G T GI+GLG G + L +Q++
Sbjct: 176 CGYDQQNPGPHPPPPTAGILGLGRGKVGLSTQLKSLGITKNVIVHCLSHTGKGFLSIGDE 235
Query: 237 ---------TTIAGN-----------------QRLGVSTPDIVIDSGTTLTFLPQGYNSN 270
T++A N + GV ++V DSG++ T+
Sbjct: 236 LVPSSGVTWTSLATNSPSKNYMAGPAELLFNDKTTGVKGINVVFDSGSSYTYFNAEAYQA 295
Query: 271 LLSVMSSMIEAQPVADPTG--SLELCYS----FNSLSQVPE----VTIHF----RGADVK 316
+L ++ + +P+ D SL +C+ SL +V + +T+ F G +
Sbjct: 296 ILDLIRKDLNGKPLTDTKDDKSLPVCWKGKKPLKSLDEVKKYFKTITLRFGNQKNGQLFQ 355
Query: 317 LSRSNFFVKVSEDIVCSVFKGITNSVP-------IYGNIMQTNFLVGYDIEQQTVSFKPT 369
+ ++ + + VC GI N I G+I +V YD E+Q + + +
Sbjct: 356 VPPESYLIITEKGRVCL---GILNGTEIGLEGYNIIGDISFQGIMVIYDNEKQRIGWISS 412
Query: 370 DCTK 373
DC K
Sbjct: 413 DCDK 416
>gi|449439393|ref|XP_004137470.1| PREDICTED: aspartic proteinase Asp1-like [Cucumis sativus]
gi|449486840|ref|XP_004157418.1| PREDICTED: aspartic proteinase Asp1-like [Cucumis sativus]
Length = 570
Score = 80.1 bits (196), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 58/161 (36%), Positives = 75/161 (46%), Gaps = 10/161 (6%)
Query: 81 QADIIPNNANYLIRISIGTPPTERLAVADTGSDLIWTQCE-PCPPSQCYMQDSPLFDPKM 139
+ DI P+ Y I +G PP DTGSDL W QC+ PC S C SPL+ P+
Sbjct: 190 RGDIYPDGL-YYTYIMVGEPPRPYFLDIDTGSDLTWVQCDAPC--SSCGKGRSPLYKPRR 246
Query: 140 SS--TYKSLPCSSSQCASLNQKSCSGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAV 197
+ ++K C Q + + C Y V Y D S S G L + TL + G
Sbjct: 247 ENVVSFKDSLCMEVQRNYDGDQCAACQQCNYEVQYADQSSSLGVLVKDEFTLRFSNGSLT 306
Query: 198 ALPGITFGCGTNNGGLF---NSKTTGIVGLGGGDISLISQM 235
L I FGC + GL SKT GI+GL +SL SQ+
Sbjct: 307 KLNAI-FGCAYDQQGLLLNTLSKTDGILGLSRAKVSLPSQL 346
>gi|388508518|gb|AFK42325.1| unknown [Lotus japonicus]
Length = 204
Score = 79.7 bits (195), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 41/125 (32%), Positives = 72/125 (57%), Gaps = 3/125 (2%)
Query: 252 IVIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSLS---QVPEVTI 308
++IDSGTT+T+L + L S Q + L++C+S S + +VP++
Sbjct: 79 VIIDSGTTITYLEKSVFDTLKKEFISQSNLQLDKSSSTGLDVCFSLPSETTQVEVPKLVF 138
Query: 309 HFRGADVKLSRSNFFVKVSEDIVCSVFKGITNSVPIYGNIMQTNFLVGYDIEQQTVSFKP 368
HF+G D++L ++ + S+ V + G +N + I+GN+ Q N LV +D+E++T+SF P
Sbjct: 139 HFKGGDLELPAESYMIADSKLGVACLAMGASNGMSIFGNVQQQNILVNHDLEKETISFVP 198
Query: 369 TDCTK 373
T C +
Sbjct: 199 TQCDQ 203
>gi|326512066|dbj|BAJ96014.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 485
Score = 79.7 bits (195), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 57/167 (34%), Positives = 80/167 (47%), Gaps = 15/167 (8%)
Query: 91 YLIRISIGTPPTERLAVADTGSDLIWTQ---CEPCPPSQCYMQDSPLFDPKMSSTYKSLP 147
Y +I IGTP DTGSD++W C+ CP + L+DP SS+ +
Sbjct: 81 YFTQIGIGTPAKSYYVQVDTGSDILWVNCVFCDTCPRKSGLGIELTLYDPSGSSSGTGVT 140
Query: 148 CSSSQCASLNQK---SC-SGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQA---VALP 200
C C + + SC CQYS+SYGDGS + G T+ + +G + +A
Sbjct: 141 CGQDFCVATHGGVIPSCVPAAPCQYSISYGDGSSTTGFFVTDFLQYNQVSGNSQTTLANT 200
Query: 201 GITFGCGTNNGGLFNSKT---TGIVGLGGGDISLISQMRTTIAGNQR 244
ITFGCG GG S + GI+G G + S++SQ+ AG R
Sbjct: 201 SITFGCGAKIGGDLGSSSQALDGILGFGQSNSSMLSQLAA--AGKVR 245
Score = 40.0 bits (92), Expect = 1.8, Method: Compositional matrix adjust.
Identities = 31/127 (24%), Positives = 60/127 (47%), Gaps = 8/127 (6%)
Query: 253 VIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSLSQVPEVTIHFRG 312
+IDSGTTL +LP + ++S + + P+ + YS + P +T HF G
Sbjct: 313 IIDSGTTLAYLPGVVYNAIMSKVFAQYGDMPLKNDQDFQCFRYSGSVDDGFPIITFHFEG 372
Query: 313 A-DVKLSRSNFFVKVSEDIVCSVFK--GIT----NSVPIYGNIMQTNFLVGYDIEQQTVS 365
+ + ++ + E + C F+ G+ + + G++ +N LV YD+E Q +
Sbjct: 373 GLPLNIHPHDYLFQNGE-LYCMGFQTGGLQTKDGKDMVLLGDLAFSNRLVLYDLENQVIG 431
Query: 366 FKPTDCT 372
+ +C+
Sbjct: 432 WTDYNCS 438
>gi|297734873|emb|CBI17107.3| unnamed protein product [Vitis vinifera]
Length = 484
Score = 79.7 bits (195), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 55/158 (34%), Positives = 76/158 (48%), Gaps = 13/158 (8%)
Query: 91 YLIRISIGTPPTERLAVADTGSDLIWT---QCEPCPPSQCYMQDSPLFDPKMSSTYKSLP 147
Y R+ +G+PP E DTGSD++W C CP S FDP SST +
Sbjct: 68 YFTRVLLGSPPKEFYVQIDTGSDVLWVSCGSCNGCPQSSGLHIPLNFFDPGSSSTASLIS 127
Query: 148 CSSSQCASLNQKS---CS--GVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAV--ALP 200
CS +C+ Q S CS G C Y+ YGDGS ++G ++ + + G +V +
Sbjct: 128 CSDQRCSLGVQSSDAGCSSQGNQCIYTFQYGDGSGTSGYYVSDLLNFDAIVGSSVTNSSA 187
Query: 201 GITFGCGTNNGGLF---NSKTTGIVGLGGGDISLISQM 235
I FGC + G + GI G G D+S+ISQM
Sbjct: 188 SIVFGCSISQTGDLTKSDRAVDGIFGFGQQDMSVISQM 225
>gi|356539555|ref|XP_003538263.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
Length = 438
Score = 79.7 bits (195), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 62/194 (31%), Positives = 89/194 (45%), Gaps = 17/194 (8%)
Query: 23 EAQTGGFSVELIHRDSPKSPFYNSSETPYQ----RLRDALTRSLNRLNHFNQNSSISSSK 78
+ Q G ++E+ H SP SPF S + +L+ L L SI
Sbjct: 27 DTQDHGSTLEVFHVFSPCSPFRPSKPLSWAESVLQLQAKDQARLQFLASMVAGRSIVP-I 85
Query: 79 ASQADIIPNNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPK 138
AS II + Y++R IGTPP L DT +D W C C C S LF P+
Sbjct: 86 ASGRQII-QSPTYIVRAKIGTPPQTLLLAIDTSNDAAWIPCTAC--DGC---TSTLFAPE 139
Query: 139 MSSTYKSLPCSSSQCASLNQKSCSGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVA 198
S+T+K++ C S +C + SC C ++++YG S + N+ +TVTL +
Sbjct: 140 KSTTFKNVSCGSPECNKVPSPSCGTSACTFNLTYGSSSIA-ANVVQDTVTLATD-----P 193
Query: 199 LPGITFGCGTNNGG 212
+PG TFGC G
Sbjct: 194 IPGYTFGCVAKTTG 207
>gi|225436397|ref|XP_002272121.1| PREDICTED: aspartic proteinase-like protein 2 isoform 2 [Vitis
vinifera]
Length = 499
Score = 79.7 bits (195), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 55/158 (34%), Positives = 76/158 (48%), Gaps = 13/158 (8%)
Query: 91 YLIRISIGTPPTERLAVADTGSDLIWT---QCEPCPPSQCYMQDSPLFDPKMSSTYKSLP 147
Y R+ +G+PP E DTGSD++W C CP S FDP SST +
Sbjct: 83 YFTRVLLGSPPKEFYVQIDTGSDVLWVSCGSCNGCPQSSGLHIPLNFFDPGSSSTASLIS 142
Query: 148 CSSSQCASLNQKS---CS--GVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAV--ALP 200
CS +C+ Q S CS G C Y+ YGDGS ++G ++ + + G +V +
Sbjct: 143 CSDQRCSLGVQSSDAGCSSQGNQCIYTFQYGDGSGTSGYYVSDLLNFDAIVGSSVTNSSA 202
Query: 201 GITFGCGTNNGGLF---NSKTTGIVGLGGGDISLISQM 235
I FGC + G + GI G G D+S+ISQM
Sbjct: 203 SIVFGCSISQTGDLTKSDRAVDGIFGFGQQDMSVISQM 240
>gi|195627138|gb|ACG35399.1| aspartic proteinase nepenthesin-1 precursor [Zea mays]
Length = 431
Score = 79.7 bits (195), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 66/220 (30%), Positives = 96/220 (43%), Gaps = 24/220 (10%)
Query: 31 VELIHRDSPKSPFYNSSETPYQRLRDALTRSLNRLNHFNQNSSISSSKASQADIIPNNAN 90
+ + H P SP S R DA R L + + ++S+ + P+
Sbjct: 24 LSVYHNVHPPSPSPLESIIALARADDA--RLLFLSSKAASSGGVTSAPVASGQTPPS--- 78
Query: 91 YLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCSS 150
Y++R +GTP + L DT +D W+ C PC C F P SS+Y SLPC+S
Sbjct: 79 YVVRAGLGTPVQQLLLALDTSADATWSHCAPC--DTCPAGSR--FIPASSSSYASLPCAS 134
Query: 151 SQCASLNQKSCSG--------VNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGI 202
C + C C +S + D SF +L ++T+ LG A+ G
Sbjct: 135 DWCPLFEGQPCPANQDASAPLPACAFSKPFADTSF-QASLGSDTLRLGKD-----AIAGY 188
Query: 203 TFGC-GTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAG 241
FGC G G N G++GLG G +SL+SQ +T G
Sbjct: 189 AFGCVGAVAGPTTNLPKQGLLGLGRGPMSLLSQTGSTYNG 228
>gi|359488213|ref|XP_002263620.2| PREDICTED: aspartic proteinase nepenthesin-2-like [Vitis vinifera]
Length = 434
Score = 79.7 bits (195), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 64/189 (33%), Positives = 87/189 (46%), Gaps = 34/189 (17%)
Query: 65 LNHFNQNSSISSSKASQADIIPNNA----NY----------LIRISIGTPPTERLAVADT 110
L+ ++NS SSS ASQ PN NY ++ + IGTPP + V DT
Sbjct: 38 LSSHSKNSLFSSSLASQFKQNPNTKTTSYNYRSSFKYSMALIVSLPIGTPPQTQQMVLDT 97
Query: 111 GSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCSSSQCA------SLNQKSCSGV 164
GS L W QC+ PP FDP +SS++ LPC+ S C +L
Sbjct: 98 GSQLSWIQCK-VPPK----TPPTAFDPLLSSSFSVLPCNHSLCKPRVPDYTLPTSCDQNR 152
Query: 165 NCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITFGCGTNNGGLFNSKTTGIVGL 224
C YS Y DG+++ GNL E T S+ P + GC T+ +S T GI+G+
Sbjct: 153 LCHYSYFYADGTYAEGNLVREKFTFSSSQ----TTPPLILGCATD-----SSDTQGILGM 203
Query: 225 GGGDISLIS 233
G +S S
Sbjct: 204 NLGRLSFSS 212
>gi|242086034|ref|XP_002443442.1| hypothetical protein SORBIDRAFT_08g019550 [Sorghum bicolor]
gi|241944135|gb|EES17280.1| hypothetical protein SORBIDRAFT_08g019550 [Sorghum bicolor]
Length = 443
Score = 79.7 bits (195), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 61/208 (29%), Positives = 91/208 (43%), Gaps = 15/208 (7%)
Query: 28 GFSVELIHRDSPKSPFYNSSETPYQRLRDALTRSLNRLNHFNQNSSISSSKASQADIIPN 87
G ++L H D+ + T +R+R A+ S N S+ + A +
Sbjct: 33 GIRMKLTHVDA------KGNYTAPERVRRAIALS----RQINLASTRAEGGGVSAPVHWA 82
Query: 88 NANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLP 147
Y+ +G PP A+ DTGS LIWTQC C C QD P F+ S ++ +P
Sbjct: 83 TRQYIAEYMVGDPPQRAEALIDTGSSLIWTQCTACLRKVCVRQDLPYFNASSSGSFAPVP 142
Query: 148 CSSSQCASLNQKSCS-GVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITFGC 206
C CA C+ C + V+YG G G L T+ T + G +A ++F
Sbjct: 143 CQDKACAGNYLHFCALDGTCTFRVTYGAGGII-GFLGTDAFTF-QSGGATLAFGCVSFTR 200
Query: 207 GTNNGGLFNSKTTGIVGLGGGDISLISQ 234
L + +G++GLG G +SL SQ
Sbjct: 201 FAAPDVLHGA--SGLIGLGRGRLSLASQ 226
Score = 53.9 bits (128), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 35/127 (27%), Positives = 62/127 (48%), Gaps = 8/127 (6%)
Query: 252 IVIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADP---TGSLELCYSFNSLSQV-PEVT 307
++IDSG+ T L + L+ ++ + V P G + LC + L +V P +
Sbjct: 317 VIIDSGSPFTSLVEDAYEPLMGELARQLNGSLVPPPGEDDGGMALCVARGDLDRVVPTLV 376
Query: 308 IHFRG-ADVKLSRSNFFVKVSEDIVC-SVFKGITNSVPIYGNIMQTNFLVGYDIEQQTVS 365
+HF G AD+ L N++ + + C ++ +G S I GN Q N + +D+ +S
Sbjct: 377 LHFSGGADMALPPENYWAPLEKSTACMAIVRGYLQS--IIGNFQQQNMHILFDVGGGRLS 434
Query: 366 FKPTDCT 372
F+ DC+
Sbjct: 435 FQNADCS 441
>gi|125533812|gb|EAY80360.1| hypothetical protein OsI_35532 [Oryza sativa Indica Group]
Length = 428
Score = 79.3 bits (194), Expect = 3e-12, Method: Compositional matrix adjust.
Identities = 94/357 (26%), Positives = 144/357 (40%), Gaps = 91/357 (25%)
Query: 91 YLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCSS 150
Y+I + +GTP ++ DTGS W CE C C+ + S+T + C +
Sbjct: 82 YVISVGLGTPAKTQIVEIDTGSSTSWVFCE-C--DGCHTNPRTFLQSR-STTCAKVSCGT 137
Query: 151 SQCASLN-----QKSCSGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITFG 205
S C Q S + +C + VSY DGS S G L +T+T +P TFG
Sbjct: 138 SMCLLGGSDPHCQDSENYPDCPFRVSYQDGSASYGILYQDTLTFSDVQ----KIPSFTFG 193
Query: 206 CGTNNGGL------------------------------------------FNSKTTGIVG 223
C ++ G F SKTTG
Sbjct: 194 CNLDSFGANEFGNVDGLLGMGAGPMSVLKQSSPRFDGFSYCLPLQKSERGFFSKTTGYFS 253
Query: 224 LGG----GDI---SLISQMRTTI----------AGNQRLGVS-----TPDIVIDSGTTLT 261
LG D+ ++++ + T +RLG+S +V DSG+ L+
Sbjct: 254 LGKVATRTDVRYTKMVARRKNTELFFVDLAAISVDGERLGLSPSIFSRKGVVFDSGSELS 313
Query: 262 FLPQGYNSNLLSVMSSMIEA---QPVADPTGSLELCYSFNSLSQ--VPEVTIHFR-GADV 315
++P LSV+S I + A S CY S+ + +P +++HF GA
Sbjct: 314 YIPD----RALSVLSQRIRELLLRRGAAEEESERNCYDMRSVDEGDMPAISLHFDDGARF 369
Query: 316 KLSRSNFFVKVS---EDIVCSVFKGITNSVPIYGNIMQTNFLVGYDIEQQTVSFKPT 369
L FV+ S +D+ C F T SV I G++MQT+ V YD+++Q + P+
Sbjct: 370 DLGSHGVFVERSVQEQDVWCLAF-APTESVSIIGSLMQTSKEVVYDLKRQLIGIGPS 425
>gi|125575536|gb|EAZ16820.1| hypothetical protein OsJ_32292 [Oryza sativa Japonica Group]
Length = 253
Score = 79.3 bits (194), Expect = 3e-12, Method: Compositional matrix adjust.
Identities = 40/103 (38%), Positives = 59/103 (57%), Gaps = 5/103 (4%)
Query: 90 NYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCS 149
NY+ +IGTPP AV D +L+WTQC+ C +C+ Q +PLFDP S+TY++ PC
Sbjct: 50 NYVANFTIGTPPQPASAVIDLAGELVWTQCKQC--GRCFEQGTPLFDPTASNTYRAEPCG 107
Query: 150 SSQCASL--NQKSCSGVNCQYSVSYGDGSFSNGNLATETVTLG 190
+ C S+ + ++CSG C Y S G + G + VT+
Sbjct: 108 TPLCESIPSDVRNCSGNVCAYEASTNAGD-TGGKAVKKAVTVA 149
Score = 50.8 bits (120), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 29/102 (28%), Positives = 52/102 (50%), Gaps = 7/102 (6%)
Query: 279 IEAQPVADPTGSLELCYSFNSLS-QVPEVTIHFRG-ADVKLSRSNFFVKVSEDIVC---- 332
+ A P+A P +LC+ + S P++ FRG A + + +N+ + VC
Sbjct: 150 VGAPPMATPVEPFDLCFPKSGASGAAPDLVFTFRGGAAMTVPATNYLLDYKNGTVCLAML 209
Query: 333 -SVFKGITNSVPIYGNIMQTNFLVGYDIEQQTVSFKPTDCTK 373
S T + + G++ Q N +D++++T+SF+P DCTK
Sbjct: 210 SSARLNSTTELSLLGSLQQENIHFLFDLDKETLSFEPADCTK 251
>gi|302790323|ref|XP_002976929.1| hypothetical protein SELMODRAFT_105896 [Selaginella moellendorffii]
gi|300155407|gb|EFJ22039.1| hypothetical protein SELMODRAFT_105896 [Selaginella moellendorffii]
Length = 373
Score = 79.3 bits (194), Expect = 3e-12, Method: Compositional matrix adjust.
Identities = 52/149 (34%), Positives = 78/149 (52%), Gaps = 8/149 (5%)
Query: 93 IRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCSSSQ 152
++ IGTPP E L + DT S+L W Q C + C P F+P +SS++ S PC+SS
Sbjct: 1 MQTKIGTPPREVLLLVDTASELTWVQGTSC--TNCSPTKVPPFNPGLSSSFISEPCTSSV 58
Query: 153 CASLN----QKSC--SGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITFGC 206
C + Q +C S +C + V+Y DGS + G +A E +L S G A L + FGC
Sbjct: 59 CLGRSKLGFQSACNRSTGSCSFQVAYLDGSEAYGVIAREIFSLQSWDGAASTLGDVIFGC 118
Query: 207 GTNNGGLFNSKTTGIVGLGGGDISLISQM 235
+ + ++G +GL G S +Q+
Sbjct: 119 ASKDLQRPVDFSSGTLGLNRGSFSFPAQI 147
Score = 49.3 bits (116), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 33/132 (25%), Positives = 63/132 (47%), Gaps = 14/132 (10%)
Query: 254 IDSGTTLTFLPQGYNSNLLSVMSS-MIEAQPVADPTGSLELCYSFNS----LSQVPEVTI 308
DSGTT++FL + ++ L+ ++ + + ELCY + L P VT+
Sbjct: 239 FDSGTTVSFLVEPAHTALVEAFGRRVLHLNRTSGSDFTKELCYDVAAGDARLPTAPLVTL 298
Query: 309 HFRG-ADVKLSRSNFFVKVSED----IVCSVFKGI----TNSVPIYGNIMQTNFLVGYDI 359
HF+ D++L ++ +V ++ +C F V + GN Q ++L+ +D+
Sbjct: 299 HFKNNVDMELREASVWVPLARTPQVVTICLAFVNAGAVAQGGVNVIGNYQQQDYLIEHDL 358
Query: 360 EQQTVSFKPTDC 371
E+ + F P +C
Sbjct: 359 ERSRIGFAPANC 370
>gi|359476199|ref|XP_003631804.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Vitis vinifera]
Length = 421
Score = 79.3 bits (194), Expect = 3e-12, Method: Compositional matrix adjust.
Identities = 64/203 (31%), Positives = 89/203 (43%), Gaps = 36/203 (17%)
Query: 32 ELIHRDSPKSPFYNSSETPYQRLRDALTRSLNRLNHFNQNSSISSSKASQADIIPNNANY 91
E+ RD + F NS Y S N NH + N ++ + N+
Sbjct: 88 EIFGRDESRVSFINSKCNQYT--------SGNLKNHAHNN-----------NLFDEDGNF 128
Query: 92 LIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCSSS 151
L+ ++ GTPP + + DTGS + WTQC+ C C F+ SSTY S C
Sbjct: 129 LVDVAFGTPPQNFMLILDTGSSITWTQCKAC--VNCLQDSHRYFNWSASSTYSSGSCIPG 186
Query: 152 QCASLNQKSCSGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITFGCGTNNG 211
V Y+++YGD S S GN +T+TL + FGCG NN
Sbjct: 187 T-----------VENNYNMTYGDDSTSVGNYGCDTMTLEPSD----VFQKFQFGCGRNNK 231
Query: 212 GLFNSKTTGIVGLGGGDISLISQ 234
G F S G++GLG G +S +SQ
Sbjct: 232 GDFGSGVDGMLGLGQGQLSTVSQ 254
>gi|363808270|ref|NP_001242239.1| uncharacterized protein LOC100801883 [Glycine max]
gi|255641727|gb|ACU21134.1| unknown [Glycine max]
Length = 475
Score = 79.3 bits (194), Expect = 3e-12, Method: Compositional matrix adjust.
Identities = 49/159 (30%), Positives = 73/159 (45%), Gaps = 14/159 (8%)
Query: 91 YLIRISIGTPPTERLAVADTGSDLIW---TQCEPCPPSQCYMQDSPLFDPKMSSTYKSLP 147
Y ++ +G+PP + DTGSD++W +C CP D L+DPK S T +
Sbjct: 70 YFTKLGLGSPPRDYYVQVDTGSDILWVNCVECSRCPRKSDLGIDLTLYDPKGSETSDVVS 129
Query: 148 CSSSQCASLNQKSCSG----VNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALP--- 200
C C++ G + C YS++YGDGS + G + +T G P
Sbjct: 130 CDQDFCSATFDGPIPGCKSEIPCPYSITYGDGSATTGYYVQDYLTYNRINGNLRTSPQNS 189
Query: 201 GITFGCGTNNGGLFNSKTT----GIVGLGGGDISLISQM 235
I FGCG G S + GI+G G + S++SQ+
Sbjct: 190 SIIFGCGAVQSGTLGSSSEEALDGIIGFGQANSSVLSQL 228
>gi|388505490|gb|AFK40811.1| unknown [Medicago truncatula]
Length = 193
Score = 79.3 bits (194), Expect = 3e-12, Method: Compositional matrix adjust.
Identities = 44/127 (34%), Positives = 77/127 (60%), Gaps = 7/127 (5%)
Query: 252 IVIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGS--LELCYSFNSLS---QVPEV 306
++IDSGTT+T++ + +L +S + PV D +GS L++C+S S ++P++
Sbjct: 68 VIIDSGTTITYIEENAFDSLKKEFTSQTKL-PV-DKSGSTGLDVCFSLPSGKTEVEIPKL 125
Query: 307 TIHFRGADVKLSRSNFFVKVSEDIVCSVFKGITNSVPIYGNIMQTNFLVGYDIEQQTVSF 366
HF+G D++L N+ + S V + G +N + I+GNI Q N LV +D++++T++F
Sbjct: 126 VFHFKGGDLELPGENYMIADSSLGVACLAMGASNGMSIFGNIQQQNILVNHDLQKETITF 185
Query: 367 KPTDCTK 373
PT C K
Sbjct: 186 IPTQCNK 192
>gi|255549236|ref|XP_002515672.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
gi|223545215|gb|EEF46724.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
Length = 492
Score = 79.0 bits (193), Expect = 3e-12, Method: Compositional matrix adjust.
Identities = 89/370 (24%), Positives = 137/370 (37%), Gaps = 98/370 (26%)
Query: 91 YLIRISIGTPPTERLAVADTGSDLIWT---QCEPCPPSQCYMQDSPLFDPKMSSTYKSLP 147
Y ++ IGTP + DTGSD++W QC CP + + L++ K S + K +P
Sbjct: 86 YYAKVGIGTPSKDYYVQVDTGSDIMWVNCIQCRECPRTSSLGMELTLYNIKDSVSGKLVP 145
Query: 148 CSSSQCASLNQKSCSG----VNCQYSVSYGDGSFSNGNLATETVTLGSTTG--QAVALPG 201
C C +N SG ++C Y YGDGS + G + V +G Q + G
Sbjct: 146 CDEEFCYEVNGGPLSGCTANMSCPYLEIYGDGSSTAGYFVKDVVQYDRVSGDLQTTSSNG 205
Query: 202 -ITFGCGTNNGGLF----NSKTTGIVGLG----------------------------GGD 228
+ FGCG G GI+G G GG
Sbjct: 206 SVIFGCGARQSGDLGPTSEEALDGILGFGKSNSSMISQLAATRKVKKIFAHCLDGINGGG 265
Query: 229 ISLISQ---------------------MRTTIAGNQRLGVSTPDI--------VIDSGTT 259
I I M G L + T + +IDSGTT
Sbjct: 266 IFAIGHVVQPKVNMTPLIPNQPHYNVNMTAVQVGEDFLHLPTEEFEAGDRKGAIIDSGTT 325
Query: 260 LTFLPQGYNSNLLSVMSSM---IEAQPVADPTGSLELCYSFNSLSQVPEVTIHFRGADVK 316
L +LP+ L+S + S ++ V D + YS + P VT HF
Sbjct: 326 LAYLPEIVYEPLVSKIISQQPDLKVHIVRDEYTCFQ--YSGSVDDGFPNVTFHF------ 377
Query: 317 LSRSNFFVKVSEDIVCSVFKGI--------------TNSVPIYGNIMQTNFLVGYDIEQQ 362
++ F+KV F+G+ ++ + G+++ +N LV YD+E Q
Sbjct: 378 --ENSVFLKVHPHEYLFPFEGLWCIGWQNSGMQSRDRRNMTLLGDLVLSNKLVLYDLENQ 435
Query: 363 TVSFKPTDCT 372
+ + +C+
Sbjct: 436 AIGWTEYNCS 445
>gi|255581508|ref|XP_002531560.1| Aspartic proteinase nepenthesin-2 precursor, putative [Ricinus
communis]
gi|223528821|gb|EEF30826.1| Aspartic proteinase nepenthesin-2 precursor, putative [Ricinus
communis]
Length = 407
Score = 79.0 bits (193), Expect = 3e-12, Method: Compositional matrix adjust.
Identities = 53/158 (33%), Positives = 82/158 (51%), Gaps = 17/158 (10%)
Query: 87 NNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSL 146
+N + + +++GTPP V DTGS+L W C + F+ S +Y+ +
Sbjct: 27 HNISLTVSLTVGTPPQNVSMVIDTGSELSWLYCN---KTTTTTSYPTTFNQTRSISYRPI 83
Query: 147 PCSSSQCASLNQK-----SC-SGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALP 200
PCSSS C + + SC S C ++SY D S S GNLA++T +G A +P
Sbjct: 84 PCSSSTCTNQTRDFSIPASCDSNSLCHATLSYADASSSEGNLASDTFHMG-----ASDIP 138
Query: 201 GITFGCGT---NNGGLFNSKTTGIVGLGGGDISLISQM 235
G+ FGC ++ +SK TG++G+ G +S +SQM
Sbjct: 139 GMVFGCMDSVFSSNSDEDSKNTGLMGMNRGSLSFVSQM 176
>gi|449508697|ref|XP_004163385.1| PREDICTED: LOW QUALITY PROTEIN: aspartic proteinase Asp1-like
[Cucumis sativus]
Length = 418
Score = 79.0 bits (193), Expect = 3e-12, Method: Compositional matrix adjust.
Identities = 55/163 (33%), Positives = 75/163 (46%), Gaps = 15/163 (9%)
Query: 81 QADIIPNNANYLIRISIGTPPTERLAVADTGSDLIWTQCE-PCPPSQCYMQDSPLFDPKM 139
Q ++ PN Y + + +G PP DTGSDL W QC+ PC QC PL+ P
Sbjct: 48 QGNVYPNGF-YNVTLYVGQPPKPYFLDPDTGSDLTWLQCDAPC--QQCTETLHPLYQP-- 102
Query: 140 SSTYKSLPCSSSQCASLN---QKSCSGVN-CQYSVSYGDGSFSNGNLATETVTLGSTTGQ 195
+ +PC C SL+ C + C Y V Y DG S G L + L T G
Sbjct: 103 --SNDLVPCKDPLCMSLHSSMDHRCENPDQCDYEVEYADGGSSLGVLVRDVFPLNLTNGD 160
Query: 196 AVALPGITFGCGTNN--GGLFNSKTTGIVGLGGGDISLISQMR 236
+ P + GCG + G GI+GLG G +S++SQ+
Sbjct: 161 PIR-PRLALGCGYDQDPGSSSYHPMDGILGLGRGAVSIVSQLH 202
>gi|359476756|ref|XP_002277082.2| PREDICTED: aspartic proteinase-like protein 2-like isoform 2 [Vitis
vinifera]
Length = 560
Score = 79.0 bits (193), Expect = 4e-12, Method: Compositional matrix adjust.
Identities = 51/160 (31%), Positives = 76/160 (47%), Gaps = 12/160 (7%)
Query: 91 YLIRISIGTPPTERLAVADTGSDLIWTQ---CEPCPPSQCYMQDSPLFDPKMSSTYKSLP 147
Y +I IGTP + DTGSD++W C+ CP D L+D K S+T ++
Sbjct: 155 YFAKIGIGTPSKDYYVQVDTGSDILWVNCAGCDRCPTKSDLGVDLTLYDMKASTTSDAVG 214
Query: 148 CSSSQCASLNQ--KSCS-GVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALP---G 201
C + C+ + C G+ C YSV YGDGS + G + V +G P
Sbjct: 215 CDDNFCSLYDGPLPGCKPGLQCLYSVLYGDGSSTTGYFVQDFVQYNRISGNFQTTPTNGT 274
Query: 202 ITFGCGTNNGGLFNSKTT---GIVGLGGGDISLISQMRTT 238
+ FGCG G S + GI+G G + S++SQ+ ++
Sbjct: 275 VVFGCGNKQSGELGSSSEALDGILGFGQANSSMLSQLASS 314
>gi|359476754|ref|XP_002277058.2| PREDICTED: aspartic proteinase-like protein 2-like isoform 1 [Vitis
vinifera]
Length = 561
Score = 79.0 bits (193), Expect = 4e-12, Method: Compositional matrix adjust.
Identities = 51/160 (31%), Positives = 76/160 (47%), Gaps = 12/160 (7%)
Query: 91 YLIRISIGTPPTERLAVADTGSDLIWTQ---CEPCPPSQCYMQDSPLFDPKMSSTYKSLP 147
Y +I IGTP + DTGSD++W C+ CP D L+D K S+T ++
Sbjct: 155 YFAKIGIGTPSKDYYVQVDTGSDILWVNCAGCDRCPTKSDLGVDLTLYDMKASTTSDAVG 214
Query: 148 CSSSQCASLNQ--KSCS-GVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALP---G 201
C + C+ + C G+ C YSV YGDGS + G + V +G P
Sbjct: 215 CDDNFCSLYDGPLPGCKPGLQCLYSVLYGDGSSTTGYFVQDFVQYNRISGNFQTTPTNGT 274
Query: 202 ITFGCGTNNGGLFNSKTT---GIVGLGGGDISLISQMRTT 238
+ FGCG G S + GI+G G + S++SQ+ ++
Sbjct: 275 VVFGCGNKQSGELGSSSEALDGILGFGQANSSMLSQLASS 314
>gi|356564743|ref|XP_003550608.1| PREDICTED: aspartic proteinase-like protein 2-like [Glycine max]
Length = 490
Score = 79.0 bits (193), Expect = 4e-12, Method: Compositional matrix adjust.
Identities = 105/414 (25%), Positives = 169/414 (40%), Gaps = 91/414 (21%)
Query: 42 PFYNSSETPYQRLRDALTRSLNRLNHFNQNSSISSSKASQADIIPNNAN-YLIRISIGTP 100
P ++ E R RDAL R Q+S+ + Q P Y ++ +GTP
Sbjct: 30 PTNHTVELSQLRARDAL-----RHRRMLQSSNGVVDFSVQGTFDPFQVGLYYTKVQLGTP 84
Query: 101 PTERLAVADTGSDLIWTQCEP---CPPSQCYMQDSPLFDPKMSSTYKSLPCSSSQCASLN 157
P E DTGSD++W C CP + FDP SST + CS +C +
Sbjct: 85 PVEFNVQIDTGSDVLWVSCNSCSGCPQTSGLQIQLNFFDPGSSSTSSMIACSDQRCNNGI 144
Query: 158 QKS---CSGVN--CQYSVSYGDGSFSNGNLATETVTL-----GSTTGQAVALPGITFGCG 207
Q S CS N C Y+ YGDGS ++G ++ + L GS T + A P + FGC
Sbjct: 145 QSSDATCSSQNNQCSYTFQYGDGSGTSGYYVSDMMHLNTIFEGSVTTNSTA-P-VVFGCS 202
Query: 208 TNNGGLF---NSKTTGIVGLGGGDISLISQMRTT----------IAGNQRLG-------V 247
G + GI G G ++S+ISQ+ + + G+ G +
Sbjct: 203 NQQTGDLTKSDRAVDGIFGFGQQEMSVISQLSSQGIAPRVFSHCLKGDSSGGGILVLGEI 262
Query: 248 STPDIV--------------------------IDS---------------GTTLTFLPQG 266
P+IV IDS GTTL +L +
Sbjct: 263 VEPNIVYTSLVPAQPHYNLNLQSIAVNGQTLQIDSSVFATSNSRGTIVDSGTTLAYLAEE 322
Query: 267 YNSNLLSVMSSMIEAQPVADPTGSLELCYSF-NSLSQV-PEVTIHFR-GADVKLSRSNFF 323
+S +++ I Q V CY +S+++V P+V+++F GA + L ++
Sbjct: 323 AYDPFVSAITASIP-QSVHTVVSRGNQCYLITSSVTEVFPQVSLNFAGGASMILRPQDYL 381
Query: 324 VKVSE----DIVCSVFKGIT-NSVPIYGNIMQTNFLVGYDIEQQTVSFKPTDCT 372
++ + + C F+ I + I G+++ + +V YD+ Q + + DC+
Sbjct: 382 IQQNSIGGAAVWCIGFQKIQGQGITILGDLVLKDKIVVYDLAGQRIGWANYDCS 435
>gi|356509399|ref|XP_003523437.1| PREDICTED: aspartic proteinase Asp1-like [Glycine max]
Length = 421
Score = 79.0 bits (193), Expect = 4e-12, Method: Compositional matrix adjust.
Identities = 89/367 (24%), Positives = 148/367 (40%), Gaps = 97/367 (26%)
Query: 91 YLIRISIGTPPTERLAVADTGSDLIWTQCE-PCPPSQCYMQDSPLFDPKMSSTYKSLPCS 149
Y + ++IG PP DTGSDL W QC+ PC C + + L+ P + C
Sbjct: 64 YTVSLAIGNPPKVYDLDIDTGSDLTWVQCDAPC--KGCTLPRNRLYKPHGD----LVKCV 117
Query: 150 SSQCASLN---QKSCSGVN--CQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITF 204
CA++ C+G N C Y V Y D S G L + + L T G ++A P + F
Sbjct: 118 DPLCAAIQSAPNHHCAGPNEQCDYEVEYADQGSSLGVLLRDNIPLKFTNG-SLARPMLAF 176
Query: 205 GCG---TNNGGLFNSKTTGIVGLGGGDISLISQMRT-----TIAGN-------------- 242
GCG T++G T G++GLG G S++SQ+ + + G+
Sbjct: 177 GCGYDQTHHGQNPPPSTAGVLGLGNGRTSILSQLHSLGLIRNVVGHCLSGRGGGFLFFGD 236
Query: 243 -------------------QRLGVSTPDIVID--------------SGTTLTFLPQGYNS 269
Q D+ D SG++ T+ +
Sbjct: 237 QLIPPSGVVWTPLLQSSSAQHYKTGPADLFFDRKTTSVKGLELIFDSGSSYTYFNSQAHK 296
Query: 270 NLLSVMSSMIEAQPVADPTG--SLELCYS----FNSLSQVPEVTIHFRGADVKLSRSN-- 321
L++++++ + +P++ TG SL +C+ F SL +VT +F+ + ++S
Sbjct: 297 ALVNLIANDLRGKPLSRATGDPSLPICWKGPKPFKSLH---DVTSNFKPLLLSFTKSKNS 353
Query: 322 --------FFVKVSEDIVC-------SVFKGITNSVPIYGNIMQTNFLVGYDIEQQTVSF 366
+ + VC + G TN I G+I + LV YD E+Q + +
Sbjct: 354 PLQLPPEAYLIVTKHGNVCLGILDGTEIGLGNTN---IIGDISLQDKLVIYDNEKQQIGW 410
Query: 367 KPTDCTK 373
+C +
Sbjct: 411 ASANCDR 417
>gi|326495450|dbj|BAJ85821.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 491
Score = 79.0 bits (193), Expect = 4e-12, Method: Compositional matrix adjust.
Identities = 48/132 (36%), Positives = 65/132 (49%), Gaps = 10/132 (7%)
Query: 109 DTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCSSSQCASLN--QKSCSGVN- 165
DT D+ W QC PC QCY Q + FDP+ SST + C S C +L CS N
Sbjct: 164 DTTEDVPWIQCLPCLIPQCYPQRNAFFDPRRSSTGAPVRCGSRACRTLGGYANGCSKPNS 223
Query: 166 ---CQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITFGCGTNNGGLFNSKTTGIV 222
C Y + Y D + G T+T+T+ +T FGC G F+++ +G +
Sbjct: 224 TGDCLYRIEYSDHRLTLGTYMTDTLTISPST----TFLNFRFGCSHAVRGKFSAQASGTM 279
Query: 223 GLGGGDISLISQ 234
LGGG SL+SQ
Sbjct: 280 SLGGGPQSLLSQ 291
Score = 39.3 bits (90), Expect = 3.5, Method: Compositional matrix adjust.
Identities = 33/124 (26%), Positives = 54/124 (43%), Gaps = 10/124 (8%)
Query: 253 VIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSLSQ--VPEVTIHF 310
V+DS +T LP L + + A PTG+L+ C+ F +S+ VP V++ F
Sbjct: 373 VMDSSAVITQLPPTAYRALRLAFRNAMRAYKTRAPTGNLDTCFDFVGVSKVTVPTVSLVF 432
Query: 311 R-GADVKLSRSNFFVKVSEDIVCSVFKGITN--SVPIYGNIMQTNFLVGYDIEQQTVSFK 367
GA ++L + + C F + ++ GN+ Q V YD+ V F+
Sbjct: 433 DGGAVIELGLLSVLLD-----SCLAFAPMAADFALGFIGNVQQQTHEVLYDVAGGAVGFR 487
Query: 368 PTDC 371
C
Sbjct: 488 HGAC 491
>gi|224128838|ref|XP_002320434.1| predicted protein [Populus trichocarpa]
gi|222861207|gb|EEE98749.1| predicted protein [Populus trichocarpa]
Length = 485
Score = 78.6 bits (192), Expect = 4e-12, Method: Compositional matrix adjust.
Identities = 52/162 (32%), Positives = 71/162 (43%), Gaps = 14/162 (8%)
Query: 91 YLIRISIGTPPTERLAVADTGSDLIWT---QCEPCPPSQCYMQDSPLFDPKMSSTYKSLP 147
Y +I IGTP + DTGSD++W QC CP + D L++ S T K +P
Sbjct: 78 YYAKIGIGTPTKDYYVQVDTGSDIMWVNCIQCRECPKTSSLGIDLTLYNINESDTGKLVP 137
Query: 148 CSSSQCASLNQKSCSG----VNCQYSVSYGDGSFSNGNLATETVTLGSTTGQ---AVALP 200
C C +N G ++C Y YGDGS + G + V +G A
Sbjct: 138 CDQEFCYEINGGQLPGCTANMSCPYLEIYGDGSSTAGYFVKDVVQYARVSGDLKTTAANG 197
Query: 201 GITFGCGTNNGGLFNSKTT----GIVGLGGGDISLISQMRTT 238
+ FGCG G S GI+G G + S+ISQ+ T
Sbjct: 198 SVIFGCGARQSGDLGSSNEEALDGILGFGKSNSSMISQLAVT 239
>gi|312282765|dbj|BAJ34248.1| unnamed protein product [Thellungiella halophila]
Length = 515
Score = 78.6 bits (192), Expect = 4e-12, Method: Compositional matrix adjust.
Identities = 108/443 (24%), Positives = 176/443 (39%), Gaps = 85/443 (19%)
Query: 9 FILFFLCFYVVSPIEAQTGGFSVELIHRDS-------PKSPFYNSSETPYQRL---RDAL 58
IL + +V+ E G F E HR S P N + Y R+ RD L
Sbjct: 14 LILMLVSSWVLDRCEG-LGEFGFEFHHRFSDQVVGVLPGDGLPNRDSSKYYRVMAHRDRL 72
Query: 59 TRSLNRLNHFNQNSSISSSKASQADIIPNNANYL--IRISIGTPPTERLAVADTGSDLIW 116
R RL +++ S+ + I N +L +++GTP L DTGSDL W
Sbjct: 73 IRG-RRLA--SEDQSLVTFADGNETIRVNALGFLHYANVTVGTPSDWFLVALDTGSDLFW 129
Query: 117 TQCEPCPPSQCYMQ---------DSPLFDPKMSSTYKSLPCSSSQCASLNQKSCSGVNCQ 167
C+ C + C + D ++ P SST +PC+S+ C +++ + +C
Sbjct: 130 LPCD-CS-TNCVRELKAPGGSSLDLNIYSPNASSTSSKVPCNSTLCTRVDRCASPLSDCP 187
Query: 168 YSVSY-GDGSFSNGNLATETVTLGSTTGQAVALPG-ITFGCGTNNGGLFN--SKTTGIVG 223
Y + Y +G+ S G L + + L S + + IT GCG G+F+ + G+ G
Sbjct: 188 YQIRYLSNGTSSTGVLVEDVLHLVSMEKNSKPIRARITLGCGLVQTGVFHDGAAPNGLFG 247
Query: 224 LGGGDISLISQM-RTTIAGN-----------------------QR---LGVSTP------ 250
LG DIS+ S + + IA N QR L + P
Sbjct: 248 LGLEDISVPSVLAKEGIAANSFSMCFGDDGAGRISFGDKGSVDQRETPLNIRQPHPTYNV 307
Query: 251 ----------------DIVIDSGTTLTFLPQG-YNSNLLSVMSSMIEAQPVADPTGSLEL 293
D V D+GT+ T+L Y S S ++ + D E
Sbjct: 308 TVTQISVGGNTGDLEFDAVFDTGTSFTYLTDAPYTLISESFNSLALDKRYQTDSELPFEY 367
Query: 294 CYSF--NSLS-QVPEVTIHFR-GADVKLSRSNFFVKVSEDIVCSVFKGITNSVPIYGNIM 349
CY+ N S + P+V + + G+ + V + + +V + + + I G
Sbjct: 368 CYAVSPNKKSFEYPDVNLTMKGGSSYPVYHPLIVVPIEDTVVYCLAIMKSEDISIIGQNF 427
Query: 350 QTNFLVGYDIEQQTVSFKPTDCT 372
T + V +D E+ + +K +DC+
Sbjct: 428 MTGYRVVFDREKLILGWKESDCS 450
>gi|212722554|ref|NP_001131154.1| uncharacterized protein LOC100192462 precursor [Zea mays]
gi|194690728|gb|ACF79448.1| unknown [Zea mays]
Length = 431
Score = 78.6 bits (192), Expect = 4e-12, Method: Compositional matrix adjust.
Identities = 65/213 (30%), Positives = 93/213 (43%), Gaps = 24/213 (11%)
Query: 31 VELIHRDSPKSPFYNSSETPYQRLRDALTRSLNRLNHFNQNSSISSSKASQADIIPNNAN 90
+ + H P SP S R DA R L + + I+S+ + P+
Sbjct: 24 LSVYHNVHPPSPSPLESIIALARADDA--RLLFLSSKAASSGGITSAPVASGQTPPS--- 78
Query: 91 YLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCSS 150
Y++R +GTP + L DT +D W+ C PC C F P SS+Y SLPC+S
Sbjct: 79 YVVRAGLGTPVQQLLLALDTSADATWSHCAPC--DTCPAGSR--FIPASSSSYASLPCAS 134
Query: 151 SQCASLNQKSCSG--------VNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGI 202
C + C C +S + D SF +L ++T+ LG A+ G
Sbjct: 135 DWCPLFEGQPCPANQDASAPLPACAFSKPFADTSF-QASLGSDTLRLGKD-----AIAGY 188
Query: 203 TFGC-GTNNGGLFNSKTTGIVGLGGGDISLISQ 234
FGC G G N G++GLG G +SL+SQ
Sbjct: 189 AFGCVGAVAGPTTNLPKQGLLGLGRGPMSLLSQ 221
>gi|357160697|ref|XP_003578847.1| PREDICTED: aspartic proteinase Asp1-like [Brachypodium distachyon]
Length = 421
Score = 78.6 bits (192), Expect = 4e-12, Method: Compositional matrix adjust.
Identities = 91/375 (24%), Positives = 151/375 (40%), Gaps = 95/375 (25%)
Query: 82 ADIIPNNANYLIRISIGTPPTERLAVADTGSDLIWTQCE-PCPPSQCYMQDSPLFDPKMS 140
D+ P+ Y + +SIG PP DTGSDL W QC+ PC C PL+ P +
Sbjct: 50 GDVYPHGL-YYVAMSIGNPPRPYFLDVDTGSDLTWLQCDAPC--VSCNKVPHPLYRPTKN 106
Query: 141 STYKSLPCSSSQCASLN-----QKSCSGVN--CQYSVSYGDGSFSNGNLATETVTLGSTT 193
K +PC C+SL+ + C C Y + Y D S G L T++ +
Sbjct: 107 ---KIVPCVDQLCSSLHGGLSGKHKCDSPKQQCDYEIKYADQGSSLGVLLTDSFAV-RLA 162
Query: 194 GQAVALPGITFGCGTNN---GGLFNSKTTGIVGLGGGDISLISQMR-----TTIAGN--- 242
++ P + FGCG + + T G++GLG G ISL+SQ++ + G+
Sbjct: 163 NSSIVRPSLAFGCGYDQQVGSSTEVAPTDGVLGLGSGSISLLSQLKQHGITKNVVGHCLS 222
Query: 243 --------------------------------------------QRLGVSTPDIVIDSGT 258
+ LGV ++V+DSG+
Sbjct: 223 IRGGGFLFFGDNLVPYSRATWVPMVRSAFKNYYSPGTASLYFGGRSLGVRPMEVVLDSGS 282
Query: 259 TLTFL-PQGYNSNLLSVMSSMIEA-QPVADPTGSLELCYS----FNSLSQVPE----VTI 308
+ T+ Q Y + + ++ S + + + V DP SL LC+ F S+ V + + +
Sbjct: 283 SFTYFGAQPYQALVTALKSDLSKTLKEVFDP--SLPLCWKGKKPFKSVLDVKKEFKSLVL 340
Query: 309 HF---RGADVKLSRSNFFVKVSEDIVCSVFKGITN-------SVPIYGNIMQTNFLVGYD 358
F + A +++ N+ + C GI N + I G+I + +V YD
Sbjct: 341 SFSNGKKALMEIPPENYLIVTKFGNAC---LGILNGSEIGLKDLNIVGDITMQDQMVIYD 397
Query: 359 IEQQTVSFKPTDCTK 373
E+ + + C +
Sbjct: 398 NERGQIGWIRAPCDR 412
>gi|383156234|gb|AFG60356.1| Pinus taeda anonymous locus 2_5996_01 genomic sequence
gi|383156236|gb|AFG60358.1| Pinus taeda anonymous locus 2_5996_01 genomic sequence
gi|383156239|gb|AFG60361.1| Pinus taeda anonymous locus 2_5996_01 genomic sequence
Length = 154
Score = 78.6 bits (192), Expect = 5e-12, Method: Compositional matrix adjust.
Identities = 52/153 (33%), Positives = 73/153 (47%), Gaps = 12/153 (7%)
Query: 28 GFSVELIHRDSPKSPFYNSSETPYQRL-RDALTRSLNRLNHFNQNSSISSSKASQADIIP 86
G ++ L H SP ++ + + L +L R +RL +S S + S +
Sbjct: 3 GVNIRLDHIHGACSPLRPANSSKWIDLVSQSLERDNDRLKTIRSRNSGSYTTMSNLPLQS 62
Query: 87 NN----ANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSST 142
N NY++ GTP + L + DTGSDL W QC+PC CY Q P+F+P SS+
Sbjct: 63 GNKVGTGNYIVTAGFGTPTKKFLLIIDTGSDLTWIQCKPC--LGCYSQVDPIFEPSQSSS 120
Query: 143 YKSLPCSSSQCASL-----NQKSCSGVNCQYSV 170
YKSLPC S+ C L N C C Y +
Sbjct: 121 YKSLPCLSATCTELLTSESNLTPCFLGGCSYEI 153
>gi|225217039|gb|ACN85323.1| aspartic proteinase nepenthesin-1 precursor [Oryza brachyantha]
Length = 287
Score = 78.6 bits (192), Expect = 5e-12, Method: Compositional matrix adjust.
Identities = 83/281 (29%), Positives = 113/281 (40%), Gaps = 75/281 (26%)
Query: 159 KSCSGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITFGCGTNNGGLFNSKT 218
+ CSG +C Y V YGDGS++ G A +T+TL S A+ G FGCG N GLF +
Sbjct: 14 RGCSGGHCLYGVQYGDGSYTIGFFAMDTLTLSSHD----AIKGFRFGCGERNEGLFG-EA 68
Query: 219 TGIVGLGGGDISLISQ--------------MRTTIAGNQRLG----------VSTPDIVI 254
G++GLG G SL Q R++ G G +ST ++I
Sbjct: 69 AGLLGLGRGKTSLPVQTYDKYGGVFAHCFPARSSGTGYLEFGPGSSPAVSAKLSTTPMLI 128
Query: 255 D-----------------------------------SGTTLTFLPQGYNSNLLSVMSSMI 279
D SGT +T LP S+L S ++ +
Sbjct: 129 DTGPTFYYVGMTGIRVGGKLLPIPQSVFAAAGTIVDSGTVITRLPPAAYSSLRSAFAASM 188
Query: 280 EAQPV--ADPTGSLELCYSFNSLSQV--PEVTIHFRGA---DVKLSRSNFFVKVSEDIVC 332
A+ A L+ CY S+V P V++ F+G DV S + VS+ C
Sbjct: 189 AARGYKRAPALSLLDTCYDLTGASEVAIPTVSLLFQGGVSLDVDASGIIYAASVSQ--AC 246
Query: 333 SVFKG--ITNSVPIYGNIMQTNFLVGYDIEQQTVSFKPTDC 371
F G + V I GN F V YDI + V F P C
Sbjct: 247 LGFAGNEAADDVAIVGNTQLKTFGVVYDIASKVVGFCPGAC 287
>gi|449459186|ref|XP_004147327.1| PREDICTED: aspartic proteinase Asp1-like [Cucumis sativus]
Length = 418
Score = 78.6 bits (192), Expect = 5e-12, Method: Compositional matrix adjust.
Identities = 55/163 (33%), Positives = 75/163 (46%), Gaps = 15/163 (9%)
Query: 81 QADIIPNNANYLIRISIGTPPTERLAVADTGSDLIWTQCE-PCPPSQCYMQDSPLFDPKM 139
Q ++ PN Y + + +G PP DTGSDL W QC+ PC QC PL+ P
Sbjct: 48 QGNVYPNGF-YNVTLYVGQPPKPYFLDPDTGSDLTWLQCDAPC--QQCTETLHPLYQP-- 102
Query: 140 SSTYKSLPCSSSQCASLN---QKSCSGVN-CQYSVSYGDGSFSNGNLATETVTLGSTTGQ 195
+ +PC C SL+ C + C Y V Y DG S G L + L T G
Sbjct: 103 --SNDLVPCKDPLCMSLHSSMDHRCENPDQCDYEVEYADGGSSLGVLVRDVFPLNLTNGD 160
Query: 196 AVALPGITFGCGTNN--GGLFNSKTTGIVGLGGGDISLISQMR 236
+ P + GCG + G GI+GLG G +S++SQ+
Sbjct: 161 PIR-PRLALGCGYDQDPGSSSYHPMDGILGLGRGAVSIVSQLH 202
>gi|357492303|ref|XP_003616440.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
gi|355517775|gb|AES99398.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
Length = 521
Score = 78.6 bits (192), Expect = 5e-12, Method: Compositional matrix adjust.
Identities = 88/358 (24%), Positives = 143/358 (39%), Gaps = 95/358 (26%)
Query: 87 NNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSL 146
+N + +++G+PP V DTGS+L W C+ P + +F+P +SS+Y
Sbjct: 32 HNVTLTVSLTVGSPPQRVTMVLDTGSELSWLHCKKLP------NLNFIFNPLVSSSYTPT 85
Query: 147 PCSSSQCASLNQKSCSGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITFGC 206
PC+S C + + + V+C + +F G A G+ FGC
Sbjct: 86 PCTSPICTTQTRDLINPVSCDANKLCHIITFFVGGPAQR---------------GMVFGC 130
Query: 207 ---GTNNGGLFNSKTTGIVGLGGGDISLISQMR-------------------TTIAGNQR 244
GT++G +SKTTG++G+ G +S +QMR IA R
Sbjct: 131 MDTGTSSGDE-DSKTTGLMGMDLGSLSFSNQMRLPKFSYCISNKDSTGVLVLENIANPPR 189
Query: 245 LG---------VSTP------------------------DIVIDSGTTLTFLPQGYNSNL 271
LG +TP ++DS T TFL Q + L
Sbjct: 190 LGPLHYTPLVKKTTPLPYFNRNCCLFQKSAFLPDHTGAGQTMVDSATQFTFLRQPVYTAL 249
Query: 272 LSVMSSMIE--AQPVADPT----GSLELCYSF---NSLSQVPEVTIHFRGADVKLSRSNF 322
+ + + P+ DP G ++LC+ ++L +P VT+ F GA+++++
Sbjct: 250 KNEFAIQTKNILTPLGDPKFVFQGVMDLCFRVPIGSTLPVLPVVTLMFDGAELRVTGERL 309
Query: 323 FVKVSE------DIVCSVFKG---ITNSVPIYGNIMQTNFLVGYDIEQQTVSFKPTDC 371
KVS I C F + I G+ Q N + YD+ + F T+C
Sbjct: 310 LYKVSNVAKSNSWIYCFTFGNSDLLGIEAFIIGHHHQRNVWMEYDLANSRIGFSDTNC 367
>gi|357159298|ref|XP_003578403.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Brachypodium
distachyon]
Length = 442
Score = 78.6 bits (192), Expect = 5e-12, Method: Compositional matrix adjust.
Identities = 58/166 (34%), Positives = 83/166 (50%), Gaps = 19/166 (11%)
Query: 82 ADIIPNNANYLIRISIGTPPTERLAVADTGSDLIWTQCE-PCPPSQCYMQDSPLFDPKMS 140
A + Y+ IG+PP A+ DTGSDLIWTQC C P C Q P ++ S
Sbjct: 77 AQVHRATRQYIASYLIGSPPQRTEALIDTGSDLIWTQCATTCLPKSCAKQGLPYYNLSQS 136
Query: 141 STYKSLPCSSSQ--CASLNQKSCSGVN--CQYSVSYGDGSFSNGNLATETVTLGSTTGQA 196
ST+ +PC+ CA+ C G++ C + SYG G G+L TE+ S T
Sbjct: 137 STFVPVPCADKAGFCAANGVHLC-GLDGSCTFIASYGAGRVI-GSLGTESFAFESGT--- 191
Query: 197 VALPGITFGCGT----NNGGLFNSKTTGIVGLGGGDISLISQMRTT 238
+ FGC + +G L + +G++GLG G +SL+SQ+ T
Sbjct: 192 ---TSLAFGCVSLTRITSGAL--NDASGLIGLGRGRLSLVSQIGAT 232
Score = 50.1 bits (118), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 42/144 (29%), Positives = 67/144 (46%), Gaps = 12/144 (8%)
Query: 234 QMRTTIAGNQRLGVSTPDIVIDSGTTLTFLP-QGYNSNLLSVMSSMIEAQPVADPTGS-L 291
Q+R G GV +ID+G+ LT L Y + V + + V P S L
Sbjct: 304 QLRQLFKGYWAGGV-----IIDTGSPLTQLASHAYEALKEEVAAQLGNGSLVPAPEDSGL 358
Query: 292 ELCYSFNSLSQV-PEVTIHFRG-ADVKLSRSNFFVKVSEDIVCS-VFKGITNSVPIYGNI 348
ELC + +V P + HF G AD+ + ++++ V + C + +G +S I GN
Sbjct: 359 ELCVAREGFQKVVPALVFHFGGGADMAVPAASYWAPVDKAAACMMILEGGYDS--IIGNF 416
Query: 349 MQTNFLVGYDIEQQTVSFKPTDCT 372
Q + + YD+ + SF+ DCT
Sbjct: 417 QQQDMHLLYDLRRGRFSFQTADCT 440
>gi|302769978|ref|XP_002968408.1| hypothetical protein SELMODRAFT_89951 [Selaginella moellendorffii]
gi|300164052|gb|EFJ30662.1| hypothetical protein SELMODRAFT_89951 [Selaginella moellendorffii]
Length = 492
Score = 78.6 bits (192), Expect = 5e-12, Method: Compositional matrix adjust.
Identities = 52/163 (31%), Positives = 80/163 (49%), Gaps = 10/163 (6%)
Query: 74 ISSSKASQADIIPNNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSP 133
+ S++ D + Y R+ IGTPP E + DTGS + + C C + C P
Sbjct: 18 LGSARMDLHDDLLTKGYYTSRVKIGTPPHEFSLIVDTGSTVTYVPCSSC--THCGNHQDP 75
Query: 134 LFDPKMSSTYKSLPCSSSQCASLNQKSCSGVNCQYSVSYGDGSFSNGNLATETVTLGSTT 193
F P +SS+YK L C S+C++ C G +Y Y + S S+G L + + G +
Sbjct: 76 RFSPALSSSYKPLEC-GSECST---GFCDGSR-KYQRQYAEKSTSSGVLGKDVI--GFSN 128
Query: 194 GQAVALPGITFGCGT-NNGGLFNSKTTGIVGLGGGDISLISQM 235
+ + FGC T G L++ GI+GLG G +S+I Q+
Sbjct: 129 SSDLGGQRLVFGCETAETGDLYDQTADGIIGLGRGPLSIIDQL 171
>gi|30680102|ref|NP_849967.1| aspartyl protease-like protein [Arabidopsis thaliana]
gi|17978947|gb|AAL47439.1| putative chloroplast nucleoid DNA-binding protein [Arabidopsis
thaliana]
gi|22655368|gb|AAM98276.1| At2g17760/At2g17760 [Arabidopsis thaliana]
gi|330251585|gb|AEC06679.1| aspartyl protease-like protein [Arabidopsis thaliana]
Length = 513
Score = 78.6 bits (192), Expect = 5e-12, Method: Compositional matrix adjust.
Identities = 96/399 (24%), Positives = 154/399 (38%), Gaps = 77/399 (19%)
Query: 45 NSSETPYQRL---RDALTRSLNRLNHFNQNSSISSSKASQADIIPNNANYLIRISIGTPP 101
N + Y R+ RD L R RL + +Q+ S + + +++GTP
Sbjct: 56 NRDSSKYYRVMAHRDRLIRG-RRLANEDQSLVTFSDGNETVRVDALGFLHYANVTVGTPS 114
Query: 102 TERLAVADTGSDLIWTQCEPCPPSQCYMQ---------DSPLFDPKMSSTYKSLPCSSSQ 152
+ DTGSDL W PC + C + D ++ P SST +PC+S+
Sbjct: 115 DWFMVALDTGSDLFWL---PCDCTNCVRELKAPGGSSLDLNIYSPNASSTSTKVPCNSTL 171
Query: 153 CASLNQKSCSGVNCQYSVSY-GDGSFSNGNLATETVTLGSTTGQAVALPG-ITFGCGTNN 210
C ++ + +C Y + Y +G+ S G L + + L S + A+P +TFGCG
Sbjct: 172 CTRGDRCASPESDCPYQIRYLSNGTSSTGVLVEDVLHLVSNDKSSKAIPARVTFGCGQVQ 231
Query: 211 GGLFN--SKTTGIVGLGGGDISLISQM-RTTIAGN-----------------------QR 244
G+F+ + G+ GLG DIS+ S + + IA N QR
Sbjct: 232 TGVFHDGAAPNGLFGLGLEDISVPSVLAKEGIAANSFSMCFGNDGAGRISFGDKGSVDQR 291
Query: 245 ---LGVSTP----------------------DIVIDSGTTLTFLPQGYNSNLLSVMSSMI 279
L + P D V DSGT+ T+L + + +S+
Sbjct: 292 ETPLNIRQPHPTYNITVTKISVGGNTGDLEFDAVFDSGTSFTYLTDAAYTLISESFNSLA 351
Query: 280 --EAQPVADPTGSLELCYSFNSLS---QVPEVTIHFRGADVKLSRSNFFV--KVSEDIVC 332
+ D E CY+ + Q P V + +G V D+ C
Sbjct: 352 LDKRYQTTDSELPFEYCYALSPNKDSFQYPAVNLTMKGGSSYPVYHPLVVIPMKDTDVYC 411
Query: 333 SVFKGITNSVPIYGNIMQTNFLVGYDIEQQTVSFKPTDC 371
I + + I G T + V +D E+ + +K +DC
Sbjct: 412 LAIMKIED-ISIIGQNFMTGYRVVFDREKLILGWKESDC 449
>gi|242062640|ref|XP_002452609.1| hypothetical protein SORBIDRAFT_04g028990 [Sorghum bicolor]
gi|241932440|gb|EES05585.1| hypothetical protein SORBIDRAFT_04g028990 [Sorghum bicolor]
Length = 557
Score = 78.2 bits (191), Expect = 5e-12, Method: Compositional matrix adjust.
Identities = 61/193 (31%), Positives = 88/193 (45%), Gaps = 15/193 (7%)
Query: 52 QRLRDALTRSLNRLNHFNQNSSISSSKAS---QADIIPNNANYLIRISIGTPPTERLAVA 108
+R+ D ++ N++ ++ ++S A + ++ P+ Y I +G PP
Sbjct: 146 RRIDDGWRKARNKMEVAKAAAAGTNSTALLPIKGNVFPD-GQYYTSIFVGNPPRPYFLDV 204
Query: 109 DTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCSSSQCASL--NQKSCSGVN- 165
DTGSDL W QC+ P + C PL+ P + K +P C L NQ C
Sbjct: 205 DTGSDLTWIQCD-APCTNCAKGPHPLYKP---TKEKIVPPRDLLCQELQGNQNYCETCKQ 260
Query: 166 CQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITFGCGTNNGGLFNS---KTTGIV 222
C Y + Y D S S G LA + + L +T G L FGC + G S KT GI+
Sbjct: 261 CDYEIEYADQSSSMGVLARDDMHLIATNGGREKL-DFVFGCAYDQQGQLLSSPAKTDGIL 319
Query: 223 GLGGGDISLISQM 235
GL ISL SQ+
Sbjct: 320 GLSNAAISLPSQL 332
>gi|356543524|ref|XP_003540210.1| PREDICTED: aspartic proteinase-like protein 2-like [Glycine max]
Length = 493
Score = 78.2 bits (191), Expect = 6e-12, Method: Compositional matrix adjust.
Identities = 105/414 (25%), Positives = 166/414 (40%), Gaps = 91/414 (21%)
Query: 42 PFYNSSETPYQRLRDALTRSLNRLNHFNQNSSISSSKASQADIIPNNAN-YLIRISIGTP 100
P + E R RD L R Q+SS + Q P Y ++ +GTP
Sbjct: 33 PTNHGVELSQLRARDEL-----RHRRMLQSSSGVVDFSVQGTFDPFQVGLYYTKVQLGTP 87
Query: 101 PTERLAVADTGSDLIWTQCEP---CPPSQCYMQDSPLFDPKMSSTYKSLPCSSSQCASLN 157
P E DTGSD++W C CP + FDP SST + CS +C +
Sbjct: 88 PVEFNVQIDTGSDVLWVSCNSCNGCPQTSGLQIQLNFFDPGSSSTSSMIACSDQRCNNGK 147
Query: 158 QKS---CSGVN--CQYSVSYGDGSFSNGNLATETVTL-----GSTTGQAVALPGITFGCG 207
Q S CS N C Y+ YGDGS ++G ++ + L GS T + A P + FGC
Sbjct: 148 QSSDATCSSQNNQCSYTFQYGDGSGTSGYYVSDMMHLNTIFEGSMTTNSTA-P-VVFGCS 205
Query: 208 TNNGGLF---NSKTTGIVGLGGGDISLISQMRTT----------IAGNQRLG-------V 247
G + GI G G ++S+ISQ+ + + G+ G +
Sbjct: 206 NQQTGDLTKSDRAVDGIFGFGQQEMSVISQLSSQGIAPRIFSHCLKGDSSGGGILVLGEI 265
Query: 248 STPDIV--------------------------IDS---------------GTTLTFLPQG 266
P+IV IDS GTTL +L +
Sbjct: 266 VEPNIVYTSLVPAQPHYNLNLQSISVNGQTLQIDSSVFATSNSRGTIVDSGTTLAYLAEE 325
Query: 267 YNSNLLSVMSSMIEAQPVADPTGSLELCYSF-NSLSQV-PEVTIHFR-GADVKLSRSNFF 323
+S +++ I Q V CY +S++ V P+V+++F GA + L ++
Sbjct: 326 AYDPFVSAITAAIP-QSVRTVVSRGNQCYLITSSVTDVFPQVSLNFAGGASMILRPQDYL 384
Query: 324 VKVSE----DIVCSVFKGIT-NSVPIYGNIMQTNFLVGYDIEQQTVSFKPTDCT 372
++ + + C F+ I + I G+++ + +V YD+ Q + + DC+
Sbjct: 385 IQQNSIGGAAVWCIGFQKIQGQGITILGDLVLKDKIVVYDLAGQRIGWANYDCS 438
>gi|356535252|ref|XP_003536162.1| PREDICTED: aspartic proteinase-like protein 2-like [Glycine max]
Length = 475
Score = 78.2 bits (191), Expect = 6e-12, Method: Compositional matrix adjust.
Identities = 48/159 (30%), Positives = 74/159 (46%), Gaps = 14/159 (8%)
Query: 91 YLIRISIGTPPTERLAVADTGSDLIW---TQCEPCPPSQCYMQDSPLFDPKMSSTYKSLP 147
Y ++ +G+PP + DTGSD++W +C CP D L+DPK S T + +
Sbjct: 70 YFTKLGLGSPPKDYYVQVDTGSDILWVNCVKCSRCPRKSDLGIDLTLYDPKGSETSELIS 129
Query: 148 CSSSQCASLNQKSCSG----VNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALP--- 200
C C++ G + C YS++YGDGS + G + +T P
Sbjct: 130 CDQEFCSATYDGPIPGCKSEIPCPYSITYGDGSATTGYYVQDYLTYNHVNDNLRTAPQNS 189
Query: 201 GITFGCGTNNGGLFNSKTT----GIVGLGGGDISLISQM 235
I FGCG G +S + GI+G G + S++SQ+
Sbjct: 190 SIIFGCGAVQSGTLSSSSEEALDGIIGFGQSNSSVLSQL 228
>gi|296089645|emb|CBI39464.3| unnamed protein product [Vitis vinifera]
Length = 477
Score = 78.2 bits (191), Expect = 6e-12, Method: Compositional matrix adjust.
Identities = 88/362 (24%), Positives = 134/362 (37%), Gaps = 86/362 (23%)
Query: 91 YLIRISIGTPPTERLAVADTGSDLIWT---QCEPCPPSQCYMQDSPLFDPKMSSTYKSLP 147
Y +I IGTP + DTGSD++W QC CP + L+D K S T K +
Sbjct: 98 YYAKIGIGTPARDYYVQVDTGSDIMWVNCIQCNECPKKSSLGMELTLYDIKESLTGKLVS 157
Query: 148 CSSSQCASLN----QKSCSGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQ---AVALP 200
C C ++N + ++C Y+ Y DGS S G + V +G A
Sbjct: 158 CDQDFCYAINGGPPSYCIANMSCSYTEIYADGSSSFGYFVRDIVQYDQVSGDLETTSANG 217
Query: 201 GITFGCGTNNGGLFNSKTT--GIVGLG----------------------------GGDIS 230
+ FGC G +S+ GI+G G GG I
Sbjct: 218 SVIFGCSATQSGDLSSEEALDGILGFGKSNTSMISQLASSGKVRKMFAHCLDGLNGGGIF 277
Query: 231 LISQ---------------------MRTTIAGNQRLGVSTP--DI------VIDSGTTLT 261
I M+ G L + T D+ +IDSGTTL
Sbjct: 278 AIGHIVQPKVNTTPLVPNQTHYNVNMKAVEVGGYFLNLPTDVFDVGDKKGTIIDSGTTLA 337
Query: 262 FLPQGYNSNLLSVMSSM---IEAQPVADPTGSLELCYSFNSLSQVPEVTIHFRGADVKLS 318
+LP+ LLS + S ++ + D + YS + P VT HF +
Sbjct: 338 YLPEVVYDQLLSKIFSWQSDLKVHTIHDQFTCFQ--YSESLDDGFPAVTFHFENSLYLKV 395
Query: 319 RSNFFVKVSEDIVCSVFKGITNS---------VPIYGNIMQTNFLVGYDIEQQTVSFKPT 369
+ ++ + + C G NS + + G++ +N LV YD+E Q + +
Sbjct: 396 HPHEYLFSYDGLWCI---GWQNSGMQSRDRRNITLLGDLALSNKLVLYDLENQVIGWTEY 452
Query: 370 DC 371
+C
Sbjct: 453 NC 454
>gi|297832400|ref|XP_002884082.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
gi|297329922|gb|EFH60341.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
Length = 513
Score = 78.2 bits (191), Expect = 6e-12, Method: Compositional matrix adjust.
Identities = 99/401 (24%), Positives = 156/401 (38%), Gaps = 81/401 (20%)
Query: 45 NSSETPYQRL---RDALTRSLNRLNHFNQNSSISSSKASQADIIPNNANYL--IRISIGT 99
N + Y R+ RD L R RL N++ S+ + I + +L +++GT
Sbjct: 56 NRDSSKYYRVMAHRDRLIRG-RRLA--NEDQSLVTFSDGNETIRVDALGFLHYANVTVGT 112
Query: 100 PPTERLAVADTGSDLIWTQCEPCPPSQCYMQ---------DSPLFDPKMSSTYKSLPCSS 150
P L DTGSDL W PC + C + D ++ P SST +PC+S
Sbjct: 113 PSDWFLVALDTGSDLFWL---PCDCTNCVRELKAPGGSSLDLNIYSPNASSTSTKVPCNS 169
Query: 151 SQCASLNQKSCSGVNCQYSVSY-GDGSFSNGNLATETVTLGSTTGQAVALPG-ITFGCGT 208
+ C ++ + NC Y + Y +G+ S G L + + L S + A+P +T GCG
Sbjct: 170 TLCTRGDRCASPESNCPYQIRYLSNGTSSTGVLVEDVLHLVSNDKSSKAIPARVTLGCGQ 229
Query: 209 NNGGLFN--SKTTGIVGLGGGDISLISQM-RTTIAGN----------------------- 242
G+F+ + G+ GLG DIS+ S + + IA N
Sbjct: 230 VQTGVFHDGAAPNGLFGLGLEDISVPSVLAKEGIAANSFSMCFGNDGAGRISFGDKGSVD 289
Query: 243 QR---LGVSTP----------------------DIVIDSGTTLTFLPQGYNSNLLSVMSS 277
QR L + P D V DSGT+ T+L + + +S
Sbjct: 290 QRETPLNIRQPHPTYNITVTKISVEGNTGDLEFDAVFDSGTSFTYLTDAAYTLISESFNS 349
Query: 278 MI--EAQPVADPTGSLELCYSFNSLS---QVPEVTIHFRGADVKLSRSNFFV--KVSEDI 330
+ + D E CY+ + Q P V + +G V D+
Sbjct: 350 LALDKRYQTTDSELPFEYCYALSPNKDSFQYPAVNLTMKGGSSYPVYHPLVVIPMKDTDV 409
Query: 331 VCSVFKGITNSVPIYGNIMQTNFLVGYDIEQQTVSFKPTDC 371
C I + + I G T + V +D E+ + +K +DC
Sbjct: 410 YCLAILKIED-ISIIGQNFMTGYRVVFDREKLILGWKESDC 449
>gi|91806508|gb|ABE65981.1| aspartyl protease family protein [Arabidopsis thaliana]
Length = 203
Score = 78.2 bits (191), Expect = 6e-12, Method: Compositional matrix adjust.
Identities = 44/112 (39%), Positives = 62/112 (55%), Gaps = 5/112 (4%)
Query: 88 NANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLP 147
+A Y + IGTPP E V DTGSDL+W C C C + + FDP SS+ L
Sbjct: 75 SALYYTTVQIGTPPRELDVVIDTGSDLVWVSCNSC--VGCPLHNVTFFDPGASSSAVKLA 132
Query: 148 CSSSQCASLNQKS--CSGV-NCQYSVSYGDGSFSNGNLATETVTLGSTTGQA 196
CS +C+S QK CS + +C Y V YGDGS ++G ++ ++ + +G A
Sbjct: 133 CSDKRCSSDLQKKSRCSLLESCTYKVEYGDGSVTSGYYISDLISFDTMSGVA 184
>gi|297735249|emb|CBI17611.3| unnamed protein product [Vitis vinifera]
Length = 480
Score = 78.2 bits (191), Expect = 7e-12, Method: Compositional matrix adjust.
Identities = 51/160 (31%), Positives = 76/160 (47%), Gaps = 12/160 (7%)
Query: 91 YLIRISIGTPPTERLAVADTGSDLIWTQ---CEPCPPSQCYMQDSPLFDPKMSSTYKSLP 147
Y +I IGTP + DTGSD++W C+ CP D L+D K S+T ++
Sbjct: 74 YFAKIGIGTPSKDYYVQVDTGSDILWVNCAGCDRCPTKSDLGVDLTLYDMKASTTSDAVG 133
Query: 148 CSSSQCASLNQ--KSCS-GVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALP---G 201
C + C+ + C G+ C YSV YGDGS + G + V +G P
Sbjct: 134 CDDNFCSLYDGPLPGCKPGLQCLYSVLYGDGSSTTGYFVQDFVQYNRISGNFQTTPTNGT 193
Query: 202 ITFGCGTNNGGLFNSKTT---GIVGLGGGDISLISQMRTT 238
+ FGCG G S + GI+G G + S++SQ+ ++
Sbjct: 194 VVFGCGNKQSGELGSSSEALDGILGFGQANSSMLSQLASS 233
>gi|297849132|ref|XP_002892447.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
gi|297338289|gb|EFH68706.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
Length = 493
Score = 78.2 bits (191), Expect = 7e-12, Method: Compositional matrix adjust.
Identities = 86/363 (23%), Positives = 149/363 (41%), Gaps = 82/363 (22%)
Query: 91 YLIRISIGTPPTERLAVADTGSDLIW---TQCEPCPPSQCYMQDSPLFDPKMSSTYKSLP 147
Y ++ +GTPP E DTGSD++W T C CP + FDP +SS+ +
Sbjct: 84 YYTKVKLGTPPREFNVQIDTGSDVLWVSCTSCNGCPKTSELQIQLSFFDPGVSSSASLVS 143
Query: 148 CSSSQCAS--LNQKSCSGVN-CQYSVSYGDGSFSNGNLATETVTLGSTTGQAVAL---PG 201
CS +C S + CS N C YS YGDGS ++G ++ ++ + +A+
Sbjct: 144 CSDRRCYSNFQTESGCSPNNLCSYSFKYGDGSGTSGFYISDFMSFDTVITSTLAINSSAP 203
Query: 202 ITFGCGTNNGGLF---NSKTTGIVGLGGGDISLISQMRTT----------IAGNQRLG-- 246
FGC G GI GLG G +S+ISQ+ + G++ G
Sbjct: 204 FVFGCSNLQTGDLQRPRRAVDGIFGLGQGSLSVISQLAVQGLAPRVFSHCLKGDKSGGGI 263
Query: 247 -----VSTPDIV-----------------------------------------IDSGTTL 260
+ PD V ID+GTTL
Sbjct: 264 MVLGQIKRPDTVYTPLVPSQPHYNVNLQSIAVNGQILPIDPSVFTIATGDGTIIDTGTTL 323
Query: 261 TFLPQGYNSNLLSVMSSMIE--AQPVADPTGSLELCYSFNS--LSQVPEVTIHFRGADVK 316
+LP S + +++ + +P+ T C+ + + PEV++ F G
Sbjct: 324 AYLPDEAYSPFIQAIANAVSQYGRPI---TYESYQCFEITAGDVDVFPEVSLSFAGGASM 380
Query: 317 LSRSNFFVKV----SEDIVCSVFKGITN-SVPIYGNIMQTNFLVGYDIEQQTVSFKPTDC 371
+ R + ++++ I C F+ +++ + I G+++ + +V YD+ +Q + + DC
Sbjct: 381 VLRPHAYLQIFSSSGSSIWCIGFQRMSHRRITILGDLVLKDKVVVYDLVRQRIGWAEYDC 440
Query: 372 TKQ 374
+ +
Sbjct: 441 SLE 443
>gi|357460823|ref|XP_003600693.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
gi|355489741|gb|AES70944.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
Length = 431
Score = 78.2 bits (191), Expect = 7e-12, Method: Compositional matrix adjust.
Identities = 51/154 (33%), Positives = 73/154 (47%), Gaps = 23/154 (14%)
Query: 92 LIRISIGTPPTERLAVADTGSDLIWTQC-EPCPPSQCYMQDSPLFDPKMSSTYKSLPCSS 150
+I + IGTPP + V DTGS L W QC + PP+ FDP +SST+ LPC+
Sbjct: 76 IINLPIGTPPQTQPMVLDTGSQLSWIQCHKKQPPTAS-------FDPSLSSTFSILPCTH 128
Query: 151 SQCA------SLNQKSCSGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITF 204
C +L C YS Y DG+++ GNL E T ++V+ P +
Sbjct: 129 PLCKPRIPDFTLPTSCDQNRLCHYSYFYADGTYAEGNLVREKFTF----SRSVSTPPLIL 184
Query: 205 GCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTT 238
GC T ++ GI+G+ G +S Q + T
Sbjct: 185 GCATE-----STDPRGILGMNLGRLSFAKQSKIT 213
>gi|194698750|gb|ACF83459.1| unknown [Zea mays]
gi|194703964|gb|ACF86066.1| unknown [Zea mays]
gi|219886221|gb|ACL53485.1| unknown [Zea mays]
gi|219886359|gb|ACL53554.1| unknown [Zea mays]
gi|223950085|gb|ACN29126.1| unknown [Zea mays]
gi|414865218|tpg|DAA43775.1| TPA: aspartic proteinase nepenthesin-1 [Zea mays]
Length = 431
Score = 78.2 bits (191), Expect = 7e-12, Method: Compositional matrix adjust.
Identities = 64/213 (30%), Positives = 93/213 (43%), Gaps = 24/213 (11%)
Query: 31 VELIHRDSPKSPFYNSSETPYQRLRDALTRSLNRLNHFNQNSSISSSKASQADIIPNNAN 90
+ + H P SP S R DA R L + + ++S+ + P+
Sbjct: 24 LSVYHNVHPPSPSPLESIIALARADDA--RLLFLSSKAASSGGVTSAPVASGQTPPS--- 78
Query: 91 YLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCSS 150
Y++R +GTP + L DT +D W+ C PC C F P SS+Y SLPC+S
Sbjct: 79 YVVRAGLGTPVQQLLLALDTSADATWSHCAPC--DTCPAGSR--FIPASSSSYASLPCAS 134
Query: 151 SQCASLNQKSCSG--------VNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGI 202
C + C C +S + D SF +L ++T+ LG A+ G
Sbjct: 135 DWCPLFEGQPCPANQDASAPLPACAFSKPFADTSF-QASLGSDTLRLGKD-----AIAGY 188
Query: 203 TFGC-GTNNGGLFNSKTTGIVGLGGGDISLISQ 234
FGC G G N G++GLG G +SL+SQ
Sbjct: 189 AFGCVGAVAGPTTNLPKQGLLGLGRGPMSLLSQ 221
>gi|118482048|gb|ABK92955.1| unknown [Populus trichocarpa]
Length = 425
Score = 77.8 bits (190), Expect = 7e-12, Method: Compositional matrix adjust.
Identities = 60/205 (29%), Positives = 95/205 (46%), Gaps = 21/205 (10%)
Query: 8 VFILFFLCFYVVSPIEAQTGGFSVELIHRDSPKSPFYNSSETPYQ-RLRDALTRSLNRLN 66
+F L FL +V + + G +V++ H SP+SPF S ++ + L RL
Sbjct: 5 LFSLAFLFLSLVQGLNTRGQGTTVKVFHVYSPQSPFRPSKPVSWEDSVLQMLAEDQARLQ 64
Query: 67 HFNQNSSISSSK-----ASQADIIPNNANYLIRISIGTPPTERLAVADTGSDLIWTQCEP 121
SS+ K AS I+ + Y+++ ++GTP L DT +D W C
Sbjct: 65 FL---SSLVGRKSWVPIASGRQIV-QSPTYIVKANVGTPAQTFLMALDTSNDAAWIPCN- 119
Query: 122 CPPSQCYMQDSPLFDPKMSSTYKSLPCSSSQCASLNQKSCSGVNCQYSVSYGDGSFSNGN 181
C S +F+ S+T+K+L C + QC + +C G C ++ +YG GS N
Sbjct: 120 ----GCVGCSSTVFNSVTSTTFKTLGCDAPQCKQVPNPTCGGSTCTWNTTYG-GSTILSN 174
Query: 182 LATETVTLGSTTGQAVALPGITFGC 206
L +T+ L + +PG TFGC
Sbjct: 175 LTRDTIALSTDI-----VPGYTFGC 194
>gi|356565521|ref|XP_003550988.1| PREDICTED: aspartic proteinase-like protein 2-like [Glycine max]
Length = 634
Score = 77.8 bits (190), Expect = 8e-12, Method: Compositional matrix adjust.
Identities = 49/150 (32%), Positives = 73/150 (48%), Gaps = 12/150 (8%)
Query: 88 NANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLP 147
N Y R+ IGTPP + DTGS + + C C QC P F P+ SSTY+ +
Sbjct: 81 NGYYTTRLWIGTPPQMFALIVDTGSTVTYVPCSTC--EQCGRHQDPKFQPESSSTYQPVK 138
Query: 148 CS-SSQCASLNQKSCSGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITFGC 206
C+ C S + C Y Y + S S+G L + ++ G+ + +A FGC
Sbjct: 139 CTIDCNCDS------DRMQCVYERQYAEMSTSSGVLGEDLISFGNQS--ELAPQRAVFGC 190
Query: 207 -GTNNGGLFNSKTTGIVGLGGGDISLISQM 235
G L++ GI+GLG GD+S++ Q+
Sbjct: 191 ENVETGDLYSQHADGIMGLGRGDLSIMDQL 220
>gi|212722898|ref|NP_001132197.1| pepsin A precursor [Zea mays]
gi|194693730|gb|ACF80949.1| unknown [Zea mays]
gi|195605492|gb|ACG24576.1| pepsin A [Zea mays]
gi|413938914|gb|AFW73465.1| pepsin A [Zea mays]
Length = 519
Score = 77.8 bits (190), Expect = 8e-12, Method: Compositional matrix adjust.
Identities = 94/387 (24%), Positives = 149/387 (38%), Gaps = 72/387 (18%)
Query: 54 LRDALTRSLNRLNHFNQNSSISSSKASQADIIPNNANYLIRISIGTPPTERLAVADTGSD 113
LR L R RL NQ S+S ++ + Y + +GTP T L DTGSD
Sbjct: 63 LRSDLQRQKRRLAGKNQLLSLSKGGSTFSPGNDLGWLYYAWVDVGTPTTSFLVALDTGSD 122
Query: 114 LIWTQCE--PCPPSQCYM----QDSPLFDPKMSSTYKSLPCSSSQCASLNQKSCSGVNCQ 167
L W C+ C P Y +D ++ P S+T + LPCS C + + C
Sbjct: 123 LFWVPCDCIQCAPLSSYRGNLDRDLGIYKPAESTTSRHLPCSHELCQPGSGCTNPKQPCT 182
Query: 168 YSVSY-GDGSFSNGNLATETVTLGSTTGQAVALPGITFGCGTNNGG--LFNSKTTGIVGL 224
Y++ Y + + S+G L +++ L S G A + GCG G L G++GL
Sbjct: 183 YNIDYFSENTTSSGLLIEDSLHLNSREGHAPVNASVIIGCGRKQSGDYLDGIAPDGLLGL 242
Query: 225 GGGDIS--------------------------------------------LISQMRT--- 237
G DIS L +++T
Sbjct: 243 GMADISVPSFLARAGLVRNSFSMCFKEDSSGRIFFGDQGVSSQQSTPFVPLYGKLQTYAV 302
Query: 238 ----TIAGNQRLGVSTPDIVIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLEL 293
+ G++ L S+ ++DSGT+ T LP + I A V + +
Sbjct: 303 NVDKSCIGHKCLEGSSFQALVDSGTSFTSLPPDVYKAFTTEFDKQINASRVPYEDSTWKY 362
Query: 294 CYSFNSLS--QVPEVTIHFRGADVKLSRSNFFVKVSED---IVCSVFKGITNSVPIYGNI 348
CYS + L VP + + F A+ N + +++ + + ++ PI I
Sbjct: 363 CYSASPLEMPDVPTIILAF-AANKSFQAVNPILPFNDEQGALARFCLAVLPSTEPI--GI 419
Query: 349 MQTNFLVGY----DIEQQTVSFKPTDC 371
+ NFLVGY D E + + ++C
Sbjct: 420 IGQNFLVGYHVVFDRESMKLGWYRSEC 446
>gi|224057272|ref|XP_002299201.1| predicted protein [Populus trichocarpa]
gi|118483775|gb|ABK93780.1| unknown [Populus trichocarpa]
gi|222846459|gb|EEE84006.1| predicted protein [Populus trichocarpa]
Length = 425
Score = 77.8 bits (190), Expect = 8e-12, Method: Compositional matrix adjust.
Identities = 60/205 (29%), Positives = 95/205 (46%), Gaps = 21/205 (10%)
Query: 8 VFILFFLCFYVVSPIEAQTGGFSVELIHRDSPKSPFYNSSETPYQ-RLRDALTRSLNRLN 66
+F L FL +V + + G +V++ H SP+SPF S ++ + L RL
Sbjct: 5 LFSLAFLFLSLVQGLNTRGQGTTVKVFHVYSPQSPFRPSKPVSWEDSVLQMLAEDQARLQ 64
Query: 67 HFNQNSSISSSK-----ASQADIIPNNANYLIRISIGTPPTERLAVADTGSDLIWTQCEP 121
SS+ K AS I+ + Y+++ ++GTP L DT +D W C
Sbjct: 65 FL---SSLVGRKSWVPIASGRQIV-QSPTYIVKANVGTPAQTFLMALDTSNDAAWIPCN- 119
Query: 122 CPPSQCYMQDSPLFDPKMSSTYKSLPCSSSQCASLNQKSCSGVNCQYSVSYGDGSFSNGN 181
C S +F+ S+T+K+L C + QC + +C G C ++ +YG GS N
Sbjct: 120 ----GCVGCSSTVFNSVTSTTFKTLGCDAPQCKQVPNPTCGGSTCTWNTTYG-GSTILSN 174
Query: 182 LATETVTLGSTTGQAVALPGITFGC 206
L +T+ L + +PG TFGC
Sbjct: 175 LTRDTIALSTDI-----VPGYTFGC 194
>gi|147819672|emb|CAN76394.1| hypothetical protein VITISV_020864 [Vitis vinifera]
Length = 507
Score = 77.8 bits (190), Expect = 8e-12, Method: Compositional matrix adjust.
Identities = 51/160 (31%), Positives = 76/160 (47%), Gaps = 12/160 (7%)
Query: 91 YLIRISIGTPPTERLAVADTGSDLIWTQ---CEPCPPSQCYMQDSPLFDPKMSSTYKSLP 147
Y +I IGTP + DTGSD++W C+ CP D L+D K S+T ++
Sbjct: 78 YFAKIGIGTPSKDYYVQVDTGSDILWVNCAGCDRCPTKSDLGVDLTLYDMKASTTSDAVG 137
Query: 148 CSSSQCASLNQ--KSCS-GVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALP---G 201
C + C+ + C G+ C YSV YGDGS + G + V +G P
Sbjct: 138 CDDNFCSLYDGPLPGCKPGLQCLYSVLYGDGSSTTGYFVQDFVQYNRISGNFQTTPTNGT 197
Query: 202 ITFGCGTNNGGLFNSKTT---GIVGLGGGDISLISQMRTT 238
+ FGCG G S + GI+G G + S++SQ+ ++
Sbjct: 198 VVFGCGNKQSGELGSSSEALDGILGFGQANSSMLSQLASS 237
>gi|222637181|gb|EEE67313.1| hypothetical protein OsJ_24553 [Oryza sativa Japonica Group]
Length = 414
Score = 77.8 bits (190), Expect = 9e-12, Method: Compositional matrix adjust.
Identities = 46/112 (41%), Positives = 64/112 (57%), Gaps = 10/112 (8%)
Query: 126 QCYMQDSPLFDPKMSSTYKSLPCSSSQCASLNQK--SCSGVNCQYSVSYGDGSFSNGNLA 183
+C + +P F P SST+ LPC+SS C L +C+ C Y YG G F+ G LA
Sbjct: 87 ECAARPAPPFQPASSSTFSKLPCASSLCQFLTSPYLTCNATGCVYYYPYGMG-FTAGYLA 145
Query: 184 TETVTLGSTTGQAVALPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQM 235
TET+ +G + PG+ FGC T NG + ++GIVGLG +SL+SQ+
Sbjct: 146 TETLHVGGAS-----FPGVAFGCSTENG--VGNSSSGIVGLGRSPLSLVSQV 190
Score = 51.6 bits (122), Expect = 5e-04, Method: Compositional matrix adjust.
Identities = 40/137 (29%), Positives = 61/137 (44%), Gaps = 18/137 (13%)
Query: 253 VIDSGTTLTFL-PQGYNSNLLSVMSSMIEAQPVADPTGS---LELCYSFNSLSQ-----V 303
++DSGTTLT+L +GY + +S M A G+ +LC+ N+ V
Sbjct: 275 IVDSGTTLTYLVKEGYAMVKRAFLSQMATANLTTTVNGTRFGFDLCFDANAAGGGSGVPV 334
Query: 304 PEVTIHFRGADVKLSRSNFFVKVSE-------DIVCSVFKGITN--SVPIYGNIMQTNFL 354
P + + F G R +V V E + C + + S+ I GN+MQ +
Sbjct: 335 PTLVLRFAGGAEYAVRRRSYVGVVEVDSQGRAAVECLLVLPASEKLSISIIGNVMQMDLH 394
Query: 355 VGYDIEQQTVSFKPTDC 371
V YD++ SF P DC
Sbjct: 395 VLYDLDGGMFSFAPADC 411
>gi|21617933|gb|AAM66983.1| nucleoid DNA-binding-like protein [Arabidopsis thaliana]
Length = 425
Score = 77.8 bits (190), Expect = 9e-12, Method: Compositional matrix adjust.
Identities = 68/207 (32%), Positives = 99/207 (47%), Gaps = 19/207 (9%)
Query: 31 VELIHRDSPKSPFYNSSETPYQRLRD-ALTRSLNRLNHFNQNS-SISSSKASQADIIPNN 88
+ + H +S SPF S L+D A L+ L ++S I+S +A I +
Sbjct: 31 LRVFHINSQCSPFKTSVSWADTLLQDKARFLYLSSLAGVRKSSVPIASGRA-----IVQS 85
Query: 89 ANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPC 148
Y++R +IGTP L DT +D W C C S LFDP SS+ ++L C
Sbjct: 86 PTYIVRANIGTPAQPMLVALDTSNDAAWIPCSGC----VGCSSSVLFDPSKSSSSRTLQC 141
Query: 149 SSSQCASLNQKSCS-GVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITFGCG 207
+ QC SC+ +C ++++YG GS L +T+TL S +P TFGC
Sbjct: 142 EAPQCKQAPNPSCTVSKSCGFNMTYG-GSTIEAYLTQDTLTLASDV-----IPNYTFGC- 194
Query: 208 TNNGGLFNSKTTGIVGLGGGDISLISQ 234
N + G++GLG G +SLISQ
Sbjct: 195 INKASGTSLPAQGLMGLGRGPLSLISQ 221
>gi|357113696|ref|XP_003558637.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Brachypodium
distachyon]
Length = 432
Score = 77.8 bits (190), Expect = 9e-12, Method: Compositional matrix adjust.
Identities = 54/169 (31%), Positives = 80/169 (47%), Gaps = 25/169 (14%)
Query: 90 NYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCS 149
+Y++R +G+P L DT +D W C PC C S LF P S++Y LPCS
Sbjct: 76 SYVVRAGLGSPAQPILLALDTSADATWAHCSPC--GTCPSSGS-LFAPANSTSYAPLPCS 132
Query: 150 SSQCASLNQKSCSGVN----------CQYSVSYGDGSFSNGNLATETVTLGSTTGQAVAL 199
S+ C L + C + C ++ + D SF +LA++ + LG A+
Sbjct: 133 STMCTVLQGQPCPAQDPYDSSAPLPMCAFTKPFADASF-QASLASDWLHLGKD-----AI 186
Query: 200 PGITFGC-GTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGNQRLGV 247
P FGC +G N G++GLG G ++L+SQ+ GN GV
Sbjct: 187 PNYAFGCVSAVSGPTANLPKQGLLGLGRGPMALLSQV-----GNMYNGV 230
>gi|224079535|ref|XP_002305886.1| predicted protein [Populus trichocarpa]
gi|222848850|gb|EEE86397.1| predicted protein [Populus trichocarpa]
Length = 436
Score = 77.8 bits (190), Expect = 9e-12, Method: Compositional matrix adjust.
Identities = 51/153 (33%), Positives = 74/153 (48%), Gaps = 17/153 (11%)
Query: 92 LIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCSSS 151
L+ + IGTPP + + DTGS L W QC P + S +FDP +SS++ LPC+
Sbjct: 78 LVSLPIGTPPQSQQMILDTGSQLSWIQCHKKVPRKP--PPSTVFDPSLSSSFSVLPCNHP 135
Query: 152 QCA----SLNQKSCSGVN--CQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITFG 205
C + +N C YS Y DG+ + GNL E +T +T Q+ P + G
Sbjct: 136 LCKPRIPDFTLPTSCDLNRLCHYSYFYADGTLAEGNLVREKITF--STSQST--PPLILG 191
Query: 206 CGTNNGGLFNSKTTGIVGLGGGDISLISQMRTT 238
C + S GI+G+ G +S SQ + T
Sbjct: 192 CAED-----ASDDKGILGMNLGRLSFASQAKIT 219
>gi|359478045|ref|XP_002267046.2| PREDICTED: aspartic proteinase-like protein 2-like [Vitis vinifera]
Length = 502
Score = 77.4 bits (189), Expect = 9e-12, Method: Compositional matrix adjust.
Identities = 86/360 (23%), Positives = 138/360 (38%), Gaps = 80/360 (22%)
Query: 91 YLIRISIGTPPTERLAVADTGSDLIWT---QCEPCPPSQCYMQDSPLFDPKMSSTYKSLP 147
Y +I IGTP + DTGSD++W QC CP + L+D K S T K +
Sbjct: 98 YYAKIGIGTPARDYYVQVDTGSDIMWVNCIQCNECPKKSSLGMELTLYDIKESLTGKLVS 157
Query: 148 CSSSQCASLN----QKSCSGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQ---AVALP 200
C C ++N + ++C Y+ Y DGS S G + V +G A
Sbjct: 158 CDQDFCYAINGGPPSYCIANMSCSYTEIYADGSSSFGYFVRDIVQYDQVSGDLETTSANG 217
Query: 201 GITFGCGTNNGGLFNSKTT--GIVGLG----------------------------GGDIS 230
+ FGC G +S+ GI+G G GG I
Sbjct: 218 SVIFGCSATQSGDLSSEEALDGILGFGKSNTSMISQLASSGKVRKMFAHCLDGLNGGGIF 277
Query: 231 LISQ---------------------MRTTIAGNQRLGVSTP--DI------VIDSGTTLT 261
I M+ G L + T D+ +IDSGTTL
Sbjct: 278 AIGHIVQPKVNTTPLVPNQTHYNVNMKAVEVGGYFLNLPTDVFDVGDKKGTIIDSGTTLA 337
Query: 262 FLPQGYNSNLLSVMSSM---IEAQPVADPTGSLELCYSFNSLSQVPEVTIHFRGADVKLS 318
+LP+ LLS + S ++ + D + YS + P VT HF +
Sbjct: 338 YLPEVVYDQLLSKIFSWQSDLKVHTIHDQFTCFQ--YSESLDDGFPAVTFHFENSLYLKV 395
Query: 319 RSNFFVKVSEDIVCSVFK--GITN----SVPIYGNIMQTNFLVGYDIEQQTVSFKPTDCT 372
+ ++ + + C ++ G+ + ++ + G++ +N LV YD+E Q + + +C+
Sbjct: 396 HPHEYLFSYDGLWCIGWQNSGMQSRDRRNITLLGDLALSNKLVLYDLENQVIGWTEYNCS 455
>gi|326532354|dbj|BAK05106.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 564
Score = 77.4 bits (189), Expect = 9e-12, Method: Compositional matrix adjust.
Identities = 87/331 (26%), Positives = 124/331 (37%), Gaps = 63/331 (19%)
Query: 91 YLIRISIGTPPTERLAVADTGSDLIWTQCE--PCPPSQCYMQ----DSPLFDPKMSSTYK 144
Y + +GTP T + DTGSDL W C+ C P Y + D ++ P S+T +
Sbjct: 143 YYTWVDVGTPNTSFMVALDTGSDLFWVPCDCIECAPLAGYRETLDRDLGIYKPAESTTSR 202
Query: 145 SLPCSSSQCASLNQKSCSGVNCQYSVSY-GDGSFSNGNLATETVTLGSTTGQAVALPGIT 203
LPCS C + S C YS Y + + S+G L + + L S A +
Sbjct: 203 HLPCSHELCPPGSGCSSPKQPCPYSTDYLQENTTSSGLLIEDILHLDSRESHAPVKASVV 262
Query: 204 FGCGTNNGG--LFNSKTTGIVGLGGGDISLISQM------RTTIA------------GNQ 243
GCG G L G++GLG DIS+ S + R + + G+Q
Sbjct: 263 IGCGRKQSGSYLDGIAPDGLLGLGMADISVPSFLARAGLVRNSFSMCFKEDSGRIFFGDQ 322
Query: 244 RLGV--STP------------------------------DIVIDSGTTLTFLPQGYNSNL 271
+ + STP + ++DSGT+ T LP +
Sbjct: 323 GVSIQQSTPFVPLYGKYQTYAVNVDKSCVGHKCFEATSFEALVDSGTSFTALPLNVYKAV 382
Query: 272 LSVMSSMIEAQPVADPTGSLELCYSFNSLS--QVPEVTIHFRG-ADVKLSRSNFFVKVSE 328
+ A + S E CYS + L VP VT+ F + +K E
Sbjct: 383 AVEFDKQVHAPRITQEDASFEYCYSASPLKMPDVPTVTLTFAANKSFQAVNPTIVLKDGE 442
Query: 329 DIVCSVFKGITNSVPIYGNIMQTNFLVGYDI 359
V + S G I Q NFL GY I
Sbjct: 443 GSVAGFCLALQKSPEPIGIIGQ-NFLTGYHI 472
>gi|242046812|ref|XP_002461152.1| hypothetical protein SORBIDRAFT_02g041760 [Sorghum bicolor]
gi|241924529|gb|EER97673.1| hypothetical protein SORBIDRAFT_02g041760 [Sorghum bicolor]
Length = 452
Score = 77.4 bits (189), Expect = 9e-12, Method: Compositional matrix adjust.
Identities = 53/188 (28%), Positives = 88/188 (46%), Gaps = 16/188 (8%)
Query: 27 GGFSVELIHRDSPKSPFYNSSETPYQR--LRDALTRSLNRLNHFNQNSSISSSKA----S 80
G ++++ H P SP + P L D +R +RL + + ++ ++A +
Sbjct: 40 AGNTLQVSHAFGPCSPLGPGTTAPSWAGFLADQASRDASRLLYLDSLAARGKARAYAPIA 99
Query: 81 QADIIPNNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMS 140
+ Y++R +GTPP + L DT +D W C C + C +P FDP S
Sbjct: 100 SGRQLLQTPTYVVRARLGTPPQQLLLAVDTSNDAAWIPCAGC--AGCPTSSAPPFDPAAS 157
Query: 141 STYKSLPCSSSQCASLNQKSC--SGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVA 198
++Y+S+PC S CA +C G C +S++Y D S L+ +++ + G AV
Sbjct: 158 TSYRSVPCGSPLCAQAPNAACPPGGKACGFSLTYADSSL-QAALSQDSLAVA---GDAVK 213
Query: 199 LPGITFGC 206
TFGC
Sbjct: 214 T--YTFGC 219
>gi|225438361|ref|XP_002273988.1| PREDICTED: aspartic proteinase Asp1 [Vitis vinifera]
gi|296082608|emb|CBI21613.3| unnamed protein product [Vitis vinifera]
Length = 426
Score = 77.4 bits (189), Expect = 1e-11, Method: Compositional matrix adjust.
Identities = 86/359 (23%), Positives = 136/359 (37%), Gaps = 84/359 (23%)
Query: 91 YLIRISIGTPPTERLAVADTGSDLIWTQCE-PCPPSQCYMQDSPLFDPKMSSTYKSLPCS 149
Y + +SIG PP DTGSDL W QC+ PC +C PL+ P + C
Sbjct: 67 YYVSLSIGQPPKPYFLDPDTGSDLSWLQCDAPC--VRCTKAPHPLYRPNNNLVI----CK 120
Query: 150 SSQCASLN---QKSCSGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITFGC 206
CASL+ K C Y V Y DG S G L + L T G +A P + GC
Sbjct: 121 DPMCASLHPPGYKCEHPEQCDYEVEYADGGSSLGVLVKDVFPLNFTNGLRLA-PRLALGC 179
Query: 207 GTNN-GGLFNSKTTGIVGLGGGDISLISQMRTTIAGNQRLG------------------- 246
G + G G++GLG G S++SQ+ + +G
Sbjct: 180 GYDQIPGQSYHPLDGVLGLGKGKSSIVSQLHSQGVIRNVVGHCVSSRGGGFLFFGDDLYD 239
Query: 247 ----VSTPDI----------------------------VIDSGTTLTFLPQGYNSNLLSV 274
V TP + DSG++ T+L L+ +
Sbjct: 240 SSRVVWTPMLRDQHTHYSSGYAELILGGKTTVFKNLLVTFDSGSSYTYLNSLAYQALVHL 299
Query: 275 MSSMIEAQPVADPTG--SLELCYS----FNSLSQVPE----VTIHFRGA-------DVKL 317
+ + +PV + +L LC+ F S+ V + + + F G D+ L
Sbjct: 300 VRKELSEKPVREALDDQTLPLCWRGKRPFKSVRDVKKFFKPLALSFPGGGRTKTQYDIPL 359
Query: 318 SRSNFFVKVSEDIVCSVFKGITNSVP---IYGNIMQTNFLVGYDIEQQTVSFKPTDCTK 373
S + + ++ + G + + G+I + +V YD E+ + + PT+C +
Sbjct: 360 -ESYLIISLKGNVCLGILNGTEAGLQDFNLIGDISMQDKMVVYDNEKNQIGWAPTNCDR 417
>gi|388502484|gb|AFK39308.1| unknown [Medicago truncatula]
Length = 425
Score = 77.4 bits (189), Expect = 1e-11, Method: Compositional matrix adjust.
Identities = 58/194 (29%), Positives = 93/194 (47%), Gaps = 17/194 (8%)
Query: 23 EAQTGGFSVELIHRDSPKSPFYNSSETPYQ----RLRDALTRSLNRLNHFNQNSSISSSK 78
+ Q G ++++IH SP SPF S ++ +++ T L L+ SI
Sbjct: 23 DVQDNGSTLQVIHVFSPCSPFRPSKPLSWEESVLQMQAKDTTRLQFLDSLVARKSIVP-I 81
Query: 79 ASQADIIPNNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPK 138
AS II + Y++R IGTPP L DT +D W C C C S LF P+
Sbjct: 82 ASGRQII-QSPTYIVRAKIGTPPQTLLLAMDTSNDAAWIPCTAC--DGC---ASTLFAPE 135
Query: 139 MSSTYKSLPCSSSQCASLNQKSCSGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVA 198
S+T+K++ C++ +C + C + ++++YG S + NL +T+TL +
Sbjct: 136 KSTTFKNVSCAAPECKQVPNPGCGVSSRNFNLTYGSSSIA-ANLVQDTITLATD-----P 189
Query: 199 LPGITFGCGTNNGG 212
+P TFGC + G
Sbjct: 190 VPSYTFGCVSKTTG 203
>gi|224065128|ref|XP_002301682.1| predicted protein [Populus trichocarpa]
gi|222843408|gb|EEE80955.1| predicted protein [Populus trichocarpa]
Length = 441
Score = 77.4 bits (189), Expect = 1e-11, Method: Compositional matrix adjust.
Identities = 49/153 (32%), Positives = 72/153 (47%), Gaps = 17/153 (11%)
Query: 92 LIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCSSS 151
L+ + IGTPP + + DTGS L W QC P + S +FDP +SS++ LPC+
Sbjct: 83 LVSLPIGTPPQTQQMILDTGSQLSWIQCHKKVPRK--PPPSSVFDPSLSSSFSVLPCNHP 140
Query: 152 QCA------SLNQKSCSGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITFG 205
C +L C YS Y DG+ + GNL E +T ++ + P + G
Sbjct: 141 LCKPRIPDFTLPTSCDQNRLCHYSYFYADGTLAEGNLVREKITF----SRSQSTPPLILG 196
Query: 206 CGTNNGGLFNSKTTGIVGLGGGDISLISQMRTT 238
C +S GI+G+ G +S SQ + T
Sbjct: 197 CAEE-----SSDAKGILGMNLGRLSFASQAKLT 224
>gi|255586860|ref|XP_002534040.1| Aspartic proteinase nepenthesin-2 precursor, putative [Ricinus
communis]
gi|223525947|gb|EEF28344.1| Aspartic proteinase nepenthesin-2 precursor, putative [Ricinus
communis]
Length = 518
Score = 77.4 bits (189), Expect = 1e-11, Method: Compositional matrix adjust.
Identities = 95/359 (26%), Positives = 145/359 (40%), Gaps = 94/359 (26%)
Query: 95 ISIGTPPTERLAVADTGSDLIWTQCE--PCPPSQ--CYMQDSPL--FDPKMSSTYKSLPC 148
+ +GTP + + DTGSDL W C+ C P+Q Y D L +DPK SST K + C
Sbjct: 105 VELGTPGMKFMVALDTGSDLFWVPCDCSKCAPTQGVAYASDFELSIYDPKQSSTSKKVTC 164
Query: 149 SSSQCASLNQKSCSGVNCQYSVSYGDGSFS-NGNLATETVTLGSTTGQAVALPG-ITFGC 206
+++ CA N+ + +C Y VSY S +G L + + L S ++ +TFGC
Sbjct: 165 NNNLCAHRNRCLGTFSSCPYMVSYVSAQTSTSGILVEDVLHLTSEDSNQESIKAYVTFGC 224
Query: 207 GTNNGGLF--NSKTTGIVGLGGGDISLIS------------------------------- 233
G G F + G+ GLG IS+ S
Sbjct: 225 GQVQSGSFLNTAAPNGLFGLGMDQISVPSILSREGLTADSFSMCFGHDGVGRISFGDKGS 284
Query: 234 --QMRTTIAGNQ------------RLGVSTPDI----VIDSGTTLTFLPQGYNSNLLSVM 275
Q T N R+G + D+ + DSGT+ T+L + + +++
Sbjct: 285 PDQEETPFNSNPSHPSYNISVTQVRVGTTLVDVDFTALFDSGTSFTYLI----NPIYAMV 340
Query: 276 SSMIEAQPV-----ADPTGSLELCYSFN---SLSQVPEVTIHFRGADVKLSRSNFFV--- 324
S AQ DP E CY + + S +P +++ +G R +F V
Sbjct: 341 SENFHAQAQDKRRPPDPRIPFEYCYDMSPGANSSLIPSMSLTMKG------RGHFTVFDP 394
Query: 325 -----KVSEDIVCSVFKGITNSVPIYGNIMQTNFLVGY----DIEQQTVSFKPTDCTKQ 374
+E + C I S + NI+ NF+ GY D E+ + +K TDC Q
Sbjct: 395 IIVITTQNELVYC---LAIVKSTEL--NIIGQNFMTGYRVVFDREKLVLGWKETDCYDQ 448
>gi|356515904|ref|XP_003526637.1| PREDICTED: aspartic proteinase Asp1-like [Glycine max]
Length = 421
Score = 77.4 bits (189), Expect = 1e-11, Method: Compositional matrix adjust.
Identities = 91/365 (24%), Positives = 152/365 (41%), Gaps = 93/365 (25%)
Query: 91 YLIRISIGTPPTERLAVADTGSDLIWTQCE-PCPPSQCYMQDSPLFDPKMSSTYKSLP-C 148
Y + ++IG PP DTGSDL W QC+ PC C + + L+ P + P C
Sbjct: 64 YTVSLAIGNPPKVYDLDIDTGSDLTWVQCDAPC--QGCTIPRNRLYKPNGNLVKCGDPLC 121
Query: 149 SSSQCASLNQKSCSGVN--CQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITFGC 206
+ Q A C+G N C Y V Y D S G L + + L T G ++A P + FGC
Sbjct: 122 KAIQSAP--NHHCAGPNEQCDYEVEYADQGSSLGVLLRDNIPLKFTNG-SLARPILAFGC 178
Query: 207 GTNN---GGLFNSKTTGIVGLGGGDISLISQMRT-----TIAGN---QRLG--------- 246
G + G ++ T G++GLG G S++SQ+ + + G+ +R G
Sbjct: 179 GYDQKHVGHNPSASTAGVLGLGNGKTSILSQLHSLGLIRNVVGHCLSERGGGFLFFGDQL 238
Query: 247 ------VSTP-----------------------------DIVIDSGTTLTFLPQGYNSNL 271
V TP ++ DSG++ T+ + L
Sbjct: 239 VPQSGVVWTPLLQSSSTQHYKTGPADLFFDRKPTSVKGLQLIFDSGSSYTYFNSKAHKAL 298
Query: 272 LSVMSSMIEAQPVADPT--GSLELCYS----FNSLSQVPEVTIHFRGADVKLSRSN---- 321
++++++ + +P++ T SL +C+ F SL +VT +F+ + ++S
Sbjct: 299 VNLVTNDLRGKPLSRATEDSSLPICWRGPKPFKSLH---DVTSNFKPLLLSFTKSKNSLL 355
Query: 322 ------FFVKVSEDIVC-------SVFKGITNSVPIYGNIMQTNFLVGYDIEQQTVSFKP 368
+ + VC + G TN I G+I + LV YD E+Q + +
Sbjct: 356 QLPPEAYLIVTKHGNVCLGILDGTEIGLGNTN---IIGDISLQDKLVIYDNEKQQIGWAS 412
Query: 369 TDCTK 373
+C +
Sbjct: 413 ANCDR 417
>gi|297829808|ref|XP_002882786.1| hypothetical protein ARALYDRAFT_478632 [Arabidopsis lyrata subsp.
lyrata]
gi|297328626|gb|EFH59045.1| hypothetical protein ARALYDRAFT_478632 [Arabidopsis lyrata subsp.
lyrata]
Length = 449
Score = 77.4 bits (189), Expect = 1e-11, Method: Compositional matrix adjust.
Identities = 54/212 (25%), Positives = 88/212 (41%), Gaps = 16/212 (7%)
Query: 31 VELIHRDSPKSPFYNSSETPYQRLRDALTRSLNRLNHFNQNSSISSSKASQ--ADIIPNN 88
++L HRD+ P R+ D + R + ++ + I
Sbjct: 33 LKLAHRDT-------LWPNPLSRIEDIIGADQKRHSLISRKRKFKGGVKMDLGSGIDYGT 85
Query: 89 ANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPC 148
A Y + +GTP + V DTGS+L W C + +++ +F + S ++K++ C
Sbjct: 86 AQYFTEVRVGTPAKKFRVVVDTGSELTWVNCRYRGRGKGKVKNRRVFRAEESKSFKTVGC 145
Query: 149 SSSQCA-------SLNQKSCSGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPG 201
+ C SL+ C Y Y DGS + G A ET+T+G T G+ L G
Sbjct: 146 FTQTCKVDLMNLFSLSTCPTPSTPCSYDYRYADGSAAQGVFAKETITVGLTNGRKARLRG 205
Query: 202 ITFGCGTNNGGLFNSKTTGIVGLGGGDISLIS 233
+ GC ++ G G++GL D S S
Sbjct: 206 LLVGCSSSFSGQSFQGADGVLGLAFSDFSFTS 237
Score = 63.9 bits (154), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 40/127 (31%), Positives = 68/127 (53%), Gaps = 8/127 (6%)
Query: 253 VIDSGTTLTFLPQG-YNSNLLSVMSSMIEAQPVADPTGSLELCYS----FNSLSQVPEVT 307
++DSGT+LT L + Y + + ++E + V +E C+S FN S++P++T
Sbjct: 324 ILDSGTSLTLLAEAAYKPVVTGLARYLVELKRVKPEGIPIEYCFSSTSGFNE-SKLPQLT 382
Query: 308 IHFRG-ADVKLSRSNFFVKVSEDIVCSVFKGI-TNSVPIYGNIMQTNFLVGYDIEQQTVS 365
H +G A + R ++ V + + C F T + + GNIMQ N+L +D+ T+S
Sbjct: 383 FHLKGGARFEPHRKSYLVDAAPGVKCLGFMSAGTPATNVVGNIMQQNYLWEFDLMASTLS 442
Query: 366 FKPTDCT 372
F P+ CT
Sbjct: 443 FAPSTCT 449
>gi|449446233|ref|XP_004140876.1| PREDICTED: aspartic proteinase-like protein 2-like [Cucumis
sativus]
Length = 498
Score = 77.4 bits (189), Expect = 1e-11, Method: Compositional matrix adjust.
Identities = 89/360 (24%), Positives = 142/360 (39%), Gaps = 78/360 (21%)
Query: 91 YLIRISIGTPPTERLAVADTGSDLIWT---QCEPCPPSQCYMQDSPLFDPKMSSTYKSLP 147
Y +I IGTP + DTGSD++W QC CP + + +D + S+T K +
Sbjct: 87 YYAKIGIGTPSKDYYVQVDTGSDIVWVNCIQCRECPRTSSLGMELTPYDLEESTTGKLVS 146
Query: 148 CSSSQCASLNQKSCSG----VNCQYSVSYGDGSFSNGNLATETVTLGSTTG--QAVALPG 201
C C +N SG ++C Y YGDGS + G + V +G + A G
Sbjct: 147 CDEQFCLEVNGGPLSGCTTNMSCPYLQIYGDGSSTAGYFVKDYVQYNRVSGDLETTAANG 206
Query: 202 -ITFGCGTNNGGLFNS----KTTGIVGLGGGDISLISQMRTT----------IAGNQRLG 246
I FGCG G S GI+G G + S+ISQ+ +T + G G
Sbjct: 207 SIKFGCGARQSGDLGSSGEEALDGILGFGKSNSSIISQLASTRKVKKMFAHCLDGTNGGG 266
Query: 247 VSTPDIVIDSGTTLT-FLPQ--GYNSNLLSVMSSMI------EAQPVADPTGSL------ 291
+ V+ +T +P YN N+ V I + D G++
Sbjct: 267 IFAMGHVVQPKVNMTPLVPNQPHYNVNMTGVQVGHIILNISADVFEAGDRKGTIIDSGTT 326
Query: 292 -----ELCYS---FNSLSQ-------------------------VPEVTIHFRGADVKLS 318
EL Y LSQ P V HF + +
Sbjct: 327 LAYLPELIYEPLVAKILSQQHNLEVQTIHGEYKCFQYSERVDDGFPPVIFHFENSLLLKV 386
Query: 319 RSNFFVKVSEDIVCSVFK--GI----TNSVPIYGNIMQTNFLVGYDIEQQTVSFKPTDCT 372
+ ++ E++ C ++ G+ +V ++G+++ +N LV YD+E QT+ + +C+
Sbjct: 387 YPHEYLFQYENLWCIGWQNSGMQSRDRKNVTLFGDLVLSNKLVLYDLENQTIGWTEYNCS 446
>gi|361067983|gb|AEW08303.1| Pinus taeda anonymous locus 2_5996_01 genomic sequence
Length = 154
Score = 77.4 bits (189), Expect = 1e-11, Method: Compositional matrix adjust.
Identities = 51/153 (33%), Positives = 72/153 (47%), Gaps = 12/153 (7%)
Query: 28 GFSVELIHRDSPKSPFYNSSETPYQRL-RDALTRSLNRLNHFNQNSSISSSKASQADIIP 86
G ++ L H SP + + + L +L R +RL +S + S +
Sbjct: 3 GVNIRLDHIHGACSPLRPINSSSWIDLVSQSLERDDDRLKTIRSRNSSPYTTMSNLPLQS 62
Query: 87 NN----ANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSST 142
N NY++ +GTP L + DTGSDL W QC+PC CY Q P+F+P+ SS+
Sbjct: 63 GNKVGTGNYIVTAGLGTPTKNFLLIIDTGSDLTWIQCKPC--LDCYSQVDPIFEPRQSSS 120
Query: 143 YKSLPCSSSQCASL-----NQKSCSGVNCQYSV 170
YKSLPC S+ C L N C C Y +
Sbjct: 121 YKSLPCLSATCTELLISESNLTPCLLGGCSYEI 153
>gi|218199944|gb|EEC82371.1| hypothetical protein OsI_26705 [Oryza sativa Indica Group]
Length = 642
Score = 77.4 bits (189), Expect = 1e-11, Method: Compositional matrix adjust.
Identities = 94/372 (25%), Positives = 149/372 (40%), Gaps = 90/372 (24%)
Query: 83 DIIPNNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDS---------- 132
D + N Y R+ IGTP E + D+GS + + C C QC S
Sbjct: 84 DDLLTNGYYTTRLYIGTPSQEFALIVDSGSTVTYVPCATC--EQCGNHQSESPNIIEAHD 141
Query: 133 PLFDPKMSSTYKSLPCSSSQCASLNQKSCSGVNCQYSVSYGDGSFSNGNLATETVTLGST 192
P F P +SSTY + C+ C N++S C Y Y + S S+G L + ++ G
Sbjct: 142 PRFQPDLSSTYSPVKCNVD-CTCDNERS----QCTYERQYAEMSSSSGVLGEDIMSFGKE 196
Query: 193 TGQAVALPGITFGC-GTNNGGLFNSKTTGIVGLGGGDISLISQM--RTTIA-------GN 242
+ + FGC T G LF+ GI+GLG G +S++ Q+ + I+ G
Sbjct: 197 S--ELKPQRAVFGCENTETGDLFSQHADGIMGLGRGQLSIMDQLVEKGVISDSFSLCYGG 254
Query: 243 QRLGVST---------PDIVID-------------------SGTTLTFLPQGYNSNLLSV 274
+G T PD+V +G L P+ +NS +V
Sbjct: 255 MDVGGGTMVLGGMPAPPDMVFSHSNPVRSPYYNIELKEIHVAGKALRLDPKIFNSKHGTV 314
Query: 275 MSS------------MIEAQPVADPTGSL-----------ELCYS-----FNSLSQV-PE 305
+ S + V + SL ++C++ + LS+V P+
Sbjct: 315 LDSGTTYAYLPEQAFVAFKDAVTNKVNSLKKIRGPDPNYKDICFAGAGRNVSQLSEVFPD 374
Query: 306 VTIHF-RGADVKLSRSNFFVKVS--EDIVC-SVFKGITNSVPIYGNIMQTNFLVGYDIEQ 361
V + F G + LS N+ + S E C VF+ + + G I+ N LV YD
Sbjct: 375 VDMVFGNGQKLSLSPENYLFRHSKVEGAYCLGVFQNGKDPTTLLGGIVVRNTLVTYDRHN 434
Query: 362 QTVSFKPTDCTK 373
+ + F T+C++
Sbjct: 435 EKIGFWKTNCSE 446
>gi|9759559|dbj|BAB11161.1| nucleoid DNA-binding-like protein [Arabidopsis thaliana]
gi|21553652|gb|AAM62745.1| nucleoid DNA-binding-like protein [Arabidopsis thaliana]
gi|109134179|gb|ABG25087.1| At5g07030 [Arabidopsis thaliana]
Length = 439
Score = 77.0 bits (188), Expect = 1e-11, Method: Compositional matrix adjust.
Identities = 56/201 (27%), Positives = 91/201 (45%), Gaps = 30/201 (14%)
Query: 23 EAQTGGFSVELIHRDSPKSPFYNSSETPYQ-RLRDALTRSLNRLNHFNQNSSISSSKASQ 81
+ Q G ++ + H DSP SPF +SS ++ R+ L + RL + SS +
Sbjct: 29 KTQDQGSTLRIFHIDSPCSPFKSSSPLSWEARVLQTLAQDQARLQYL-------SSLVAG 81
Query: 82 ADIIP--------NNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCP--PSQCYMQD 131
++P + Y+++ IGTP L DT SD+ W C C PS
Sbjct: 82 RSVVPIASGRQMLQSTTYIVKALIGTPAQPLLLAMDTSSDVAWIPCSGCVGCPSNTA--- 138
Query: 132 SPLFDPKMSSTYKSLPCSSSQCASLNQKSCSGVNCQYSVSYGDGSFSNGNLATETVTLGS 191
F P S+++K++ CS+ QC + +C C ++++YG S + NL+ +T+ L
Sbjct: 139 ---FSPAKSTSFKNVSCSAPQCKQVPNPTCGARACSFNLTYGSSSIA-ANLSQDTIRLA- 193
Query: 192 TTGQAVALPGITFGCGTNNGG 212
A + TFGC G
Sbjct: 194 ----ADPIKAFTFGCVNKVAG 210
>gi|15232503|ref|NP_191008.1| aspartyl protease family protein [Arabidopsis thaliana]
gi|7288018|emb|CAB81805.1| nucleoid DNA-binding-like protein [Arabidopsis thaliana]
gi|17979257|gb|AAL49945.1| AT3g54400/T12E18_90 [Arabidopsis thaliana]
gi|21700851|gb|AAM70549.1| AT3g54400/T12E18_90 [Arabidopsis thaliana]
gi|332645705|gb|AEE79226.1| aspartyl protease family protein [Arabidopsis thaliana]
Length = 425
Score = 77.0 bits (188), Expect = 1e-11, Method: Compositional matrix adjust.
Identities = 64/207 (30%), Positives = 95/207 (45%), Gaps = 19/207 (9%)
Query: 31 VELIHRDSPKSPFYNSSETPYQRLRDALTRSLNRLNHFNQNSSI--SSSKASQADIIPNN 88
+ + H +S SPF S D L + R + + + + SS + I +
Sbjct: 31 LRVFHINSLCSPFKTSVS-----WADTLLQDKARFLYLSSLAGVRKSSVPIASGRAIVQS 85
Query: 89 ANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPC 148
Y++R +IGTP L DT +D W C C S LFDP SS+ ++L C
Sbjct: 86 PTYIVRANIGTPAQPMLVALDTSNDAAWIPCSGC----VGCSSSVLFDPSKSSSSRTLQC 141
Query: 149 SSSQCASLNQKSCS-GVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITFGCG 207
+ QC SC+ +C ++++YG GS L +T+TL S +P TFGC
Sbjct: 142 EAPQCKQAPNPSCTVSKSCGFNMTYG-GSTIEAYLTQDTLTLASDV-----IPNYTFGC- 194
Query: 208 TNNGGLFNSKTTGIVGLGGGDISLISQ 234
N + G++GLG G +SLISQ
Sbjct: 195 INKASGTSLPAQGLMGLGRGPLSLISQ 221
>gi|359492937|ref|XP_002283889.2| PREDICTED: aspartic proteinase nepenthesin-1-like [Vitis vinifera]
Length = 439
Score = 77.0 bits (188), Expect = 1e-11, Method: Compositional matrix adjust.
Identities = 72/252 (28%), Positives = 111/252 (44%), Gaps = 31/252 (12%)
Query: 8 VFILFFLCFYVVSPIEA---------QTGGFSVELIHRDSPKSPFYNSSETPY-QRLRDA 57
+F F VVS +A ++ G + +IH SPF + + +
Sbjct: 3 IFTAFVFLTLVVSTTKAFDPCASPSSESKGSDLSVIHVYGQCSPFNQHKAGSWVNTVINM 62
Query: 58 LTRSLNRLNHFNQNSSISSSKASQADI-----IPNNANYLIRISIGTPPTERLAVADTGS 112
++ R+ + + S ++S KA+ I + N NY++R+ +GTP V DT
Sbjct: 63 ASKDPARVTYLS--SLVASPKATSVPIASGQQVLNIGNYVVRVKLGTPGQLMFMVLDTSR 120
Query: 113 DLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCSSSQCASLNQKSC---SGVNCQYS 169
D W C C + C SP F P SSTY SL CS QC + SC C ++
Sbjct: 121 DAAWVPCADC--AGC---SSPTFSPNTSSTYASLQCSVPQCTQVRGLSCPTTGTAACFFN 175
Query: 170 VSYGDGSFSNGNLATETVTLGSTTGQAVALPGITFGCGTNNGGLFNSKTTGIVGLGGGDI 229
+YG S + L+ +++ L T LP +FGC G G++GLG G +
Sbjct: 176 QTYGGDSSFSAMLSQDSLGLAVDT-----LPSYSFGCVNAVSG-STLPPQGLLGLGRGPM 229
Query: 230 SLISQMRTTIAG 241
SL+SQ + +G
Sbjct: 230 SLLSQSGSLYSG 241
Score = 40.4 bits (93), Expect = 1.5, Method: Compositional matrix adjust.
Identities = 28/112 (25%), Positives = 49/112 (43%), Gaps = 7/112 (6%)
Query: 253 VIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSLSQVPEVTIHFRG 312
+IDSGT +T + + + ++ P A G+ + C++ + P VT HF G
Sbjct: 317 IIDSGTVITRFVEPVYAAIRDEFRKQVKG-PFAT-IGAFDTCFAATNEDIAPPVTFHFTG 374
Query: 313 ADVKLSRSNFFVKVSE-DIVCSVFKGITNSV----PIYGNIMQTNFLVGYDI 359
D+KL N + S + C N+V + N+ Q N + +D+
Sbjct: 375 MDLKLPLENTLIHSSAGSLACLAMAAAPNNVNSVLNVIANLQQQNLRIMFDV 426
>gi|79507883|ref|NP_196320.2| aspartyl protease family protein [Arabidopsis thaliana]
gi|332003717|gb|AED91100.1| aspartyl protease family protein [Arabidopsis thaliana]
Length = 455
Score = 77.0 bits (188), Expect = 1e-11, Method: Compositional matrix adjust.
Identities = 54/199 (27%), Positives = 91/199 (45%), Gaps = 26/199 (13%)
Query: 23 EAQTGGFSVELIHRDSPKSPFYNSSETPYQ-RLRDALTRSLNRLNHFNQNSSISSSKASQ 81
+ Q G ++ + H DSP SPF +SS ++ R+ L + RL + SS +
Sbjct: 45 KTQDQGSTLRIFHIDSPCSPFKSSSPLSWEARVLQTLAQDQARLQYL-------SSLVAG 97
Query: 82 ADIIP--------NNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSP 133
++P + Y+++ IGTP L DT SD+ W C C +
Sbjct: 98 RSVVPIASGRQMLQSTTYIVKALIGTPAQPLLLAMDTSSDVAWIPCSGC----VGCPSNT 153
Query: 134 LFDPKMSSTYKSLPCSSSQCASLNQKSCSGVNCQYSVSYGDGSFSNGNLATETVTLGSTT 193
F P S+++K++ CS+ QC + +C C ++++YG S + NL+ +T+ L +
Sbjct: 154 AFSPAKSTSFKNVSCSAPQCKQVPNPTCGARACSFNLTYGSSSIA-ANLSQDTIRLAADP 212
Query: 194 GQAVALPGITFGCGTNNGG 212
+A TFGC G
Sbjct: 213 IKA-----FTFGCVNKVAG 226
>gi|376337722|gb|AFB33417.1| hypothetical protein 2_5996_01, partial [Pinus mugo]
gi|376337724|gb|AFB33418.1| hypothetical protein 2_5996_01, partial [Pinus mugo]
gi|376337726|gb|AFB33419.1| hypothetical protein 2_5996_01, partial [Pinus mugo]
gi|376337728|gb|AFB33420.1| hypothetical protein 2_5996_01, partial [Pinus mugo]
gi|376337730|gb|AFB33421.1| hypothetical protein 2_5996_01, partial [Pinus mugo]
gi|376337732|gb|AFB33422.1| hypothetical protein 2_5996_01, partial [Pinus mugo]
Length = 154
Score = 77.0 bits (188), Expect = 1e-11, Method: Compositional matrix adjust.
Identities = 53/153 (34%), Positives = 72/153 (47%), Gaps = 12/153 (7%)
Query: 28 GFSVELIHRDSPKSPFYNSSETPYQRL-RDALTRSLNRLNHFNQNSSISSSKAS----QA 82
G ++ L H SP ++ + + L +L R +RL +S + S Q+
Sbjct: 3 GVNIRLDHIHGACSPLRPTNSSKWIDLVSQSLERDNDRLKTIRSRNSGPYTTMSNLPLQS 62
Query: 83 DIIPNNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSST 142
NY++ GTP + L V DTGSDL W QC+PC CY Q P+FDP SS+
Sbjct: 63 GSEVGTGNYILTAGFGTPTKKFLLVIDTGSDLTWIQCKPC--LGCYSQVDPIFDPSQSSS 120
Query: 143 YKSLPCSSSQCASL-----NQKSCSGVNCQYSV 170
YKSLPC S+ C L N C C Y +
Sbjct: 121 YKSLPCLSATCTELLTSESNLTPCLLGGCSYEI 153
>gi|356546036|ref|XP_003541438.1| PREDICTED: aspartic proteinase-like protein 2-like [Glycine max]
Length = 486
Score = 77.0 bits (188), Expect = 1e-11, Method: Compositional matrix adjust.
Identities = 62/202 (30%), Positives = 90/202 (44%), Gaps = 18/202 (8%)
Query: 52 QRLRDALTRSLNRLNHFNQNSSISSSKAS---QADIIPNNAN-YLIRISIGTPPTERLAV 107
R+ A ++ +R H ++ Q PN+ Y ++ +GTPP E
Sbjct: 35 HRVEVAALKARDRARHARMLRGVAGGVVDFSVQGTSDPNSVGLYYTKVKMGTPPKEFNVQ 94
Query: 108 ADTGSDLIWTQCEP---CPPSQCYMQDSPLFDPKMSSTYKSLPCSSSQCASLNQKS---C 161
DTGSD++W C CP S + FD SST +PCS C S Q + C
Sbjct: 95 IDTGSDILWVNCNTCSNCPQSSQLGIELNFFDTVGSSTAALIPCSDPICTSRVQGAAAEC 154
Query: 162 S-GVN-CQYSVSYGDGSFSNGNLATETVTLGSTTGQAVAL---PGITFGCGTNNGGLF-- 214
S VN C Y+ YGDGS ++G ++ + GQ A+ I FGC + G
Sbjct: 155 SPRVNQCSYTFQYGDGSGTSGYYVSDAMYFSLIMGQPPAVNSSATIVFGCSISQSGDLTK 214
Query: 215 -NSKTTGIVGLGGGDISLISQM 235
+ GI G G G +S++SQ+
Sbjct: 215 TDKAVDGIFGFGPGPLSVVSQL 236
Score = 38.1 bits (87), Expect = 7.1, Method: Compositional matrix adjust.
Identities = 30/127 (23%), Positives = 60/127 (47%), Gaps = 8/127 (6%)
Query: 253 VIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFN-SLSQV-PEVTIHF 310
++D GTTL +L Q L++ +++ + +Q CY + S+ + P V+++F
Sbjct: 313 IVDCGTTLAYLIQEAYDPLVTAINTAV-SQSARQTNSKGNQCYLVSTSIGDIFPSVSLNF 371
Query: 311 RG-ADVKLSRSNFFVK----VSEDIVCSVFKGITNSVPIYGNIMQTNFLVGYDIEQQTVS 365
G A + L + + ++ C F+ I G+++ + +V YDI QQ +
Sbjct: 372 EGGASMVLKPEQYLMHNGYLDGAEMWCIGFQKFQEGASILGDLVLKDKIVVYDIAQQRIG 431
Query: 366 FKPTDCT 372
+ DC+
Sbjct: 432 WANYDCS 438
>gi|222637379|gb|EEE67511.1| hypothetical protein OsJ_24961 [Oryza sativa Japonica Group]
Length = 641
Score = 77.0 bits (188), Expect = 1e-11, Method: Compositional matrix adjust.
Identities = 94/372 (25%), Positives = 149/372 (40%), Gaps = 90/372 (24%)
Query: 83 DIIPNNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDS---------- 132
D + N Y R+ IGTP E + D+GS + + C C QC S
Sbjct: 83 DDLLTNGYYTTRLYIGTPSQEFALIVDSGSTVTYVPCATC--EQCGNHQSESPNIIEAHD 140
Query: 133 PLFDPKMSSTYKSLPCSSSQCASLNQKSCSGVNCQYSVSYGDGSFSNGNLATETVTLGST 192
P F P +SSTY + C+ C N++S C Y Y + S S+G L + ++ G
Sbjct: 141 PRFQPDLSSTYSPVKCNVD-CTCDNERS----QCTYERQYAEMSSSSGVLGEDIMSFGKE 195
Query: 193 TGQAVALPGITFGC-GTNNGGLFNSKTTGIVGLGGGDISLISQM--RTTIA-------GN 242
+ + FGC T G LF+ GI+GLG G +S++ Q+ + I+ G
Sbjct: 196 S--ELKPQRAVFGCENTETGDLFSQHADGIMGLGRGQLSIMDQLVEKGVISDSFSLCYGG 253
Query: 243 QRLGVST---------PDIVID-------------------SGTTLTFLPQGYNSNLLSV 274
+G T PD+V +G L P+ +NS +V
Sbjct: 254 MDVGGGTMVLGGMPAPPDMVFSHSNPVRSPYYNIELKEIHVAGKALRLDPKIFNSKHGTV 313
Query: 275 MSS------------MIEAQPVADPTGSL-----------ELCYS-----FNSLSQV-PE 305
+ S + V + SL ++C++ + LS+V P+
Sbjct: 314 LDSGTTYAYLPEQAFVAFKDAVTNKVNSLKKIRGPDPNYKDICFAGAGRNVSQLSEVFPD 373
Query: 306 VTIHF-RGADVKLSRSNFFVKVS--EDIVC-SVFKGITNSVPIYGNIMQTNFLVGYDIEQ 361
V + F G + LS N+ + S E C VF+ + + G I+ N LV YD
Sbjct: 374 VDMVFGNGQKLSLSPENYLFRHSKVEGAYCLGVFQNGKDPTTLLGGIVVRNTLVTYDRHN 433
Query: 362 QTVSFKPTDCTK 373
+ + F T+C++
Sbjct: 434 EKIGFWKTNCSE 445
>gi|222623568|gb|EEE57700.1| hypothetical protein OsJ_08178 [Oryza sativa Japonica Group]
Length = 441
Score = 77.0 bits (188), Expect = 1e-11, Method: Compositional matrix adjust.
Identities = 51/149 (34%), Positives = 75/149 (50%), Gaps = 24/149 (16%)
Query: 28 GFSVELIHRDSPKSPFYNSSETPYQRLRDALTRSLNRLNHFNQN-SSISSSKASQADI-- 84
G +EL H SP SP ++ P+ + LT R++ + S++A+ D
Sbjct: 42 GLHLELHHPRSPCSPAPVPADLPFTAV---LTHDDARISSLAARLAKTPSARATSLDADA 98
Query: 85 ----------IPNN-------ANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQC 127
+P + NY+ R+ +GTP T+ + V DTGS L W QC PC S C
Sbjct: 99 DAGLAGSLASVPLSPGASVGVGNYVTRMGLGTPATQYVMVVDTGSSLTWLQCSPCLVS-C 157
Query: 128 YMQDSPLFDPKMSSTYKSLPCSSSQCASL 156
+ Q P+F+PK SSTY S+ CS+ QC+ L
Sbjct: 158 HRQSGPVFNPKSSSTYASVGCSAQQCSDL 186
Score = 47.4 bits (111), Expect = 0.011, Method: Compositional matrix adjust.
Identities = 34/115 (29%), Positives = 49/115 (42%), Gaps = 3/115 (2%)
Query: 260 LTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFN-SLSQVPEVTIHFRG-ADVKL 317
+T LP S L +++ ++ A L+ C+ S P VT+ F G A +KL
Sbjct: 328 ITRLPTSVYSALSKAVAAAMKGTSRASAYSILDTCFKGQASRVSAPAVTMSFAGGAALKL 387
Query: 318 SRSNFFVKVSEDIVCSVFKGITNSVPIYGNIMQTNFLVGYDIEQQTVSFKPTDCT 372
S N V V + C F S I GN Q F V YD++ + F C+
Sbjct: 388 SAQNLLVDVDDSTTCLAF-APARSAAIIGNTQQQTFSVVYDVKSSRIGFAAGGCS 441
>gi|361067981|gb|AEW08302.1| Pinus taeda anonymous locus 2_5996_01 genomic sequence
gi|383156226|gb|AFG60348.1| Pinus taeda anonymous locus 2_5996_01 genomic sequence
gi|383156228|gb|AFG60350.1| Pinus taeda anonymous locus 2_5996_01 genomic sequence
gi|383156229|gb|AFG60351.1| Pinus taeda anonymous locus 2_5996_01 genomic sequence
gi|383156230|gb|AFG60352.1| Pinus taeda anonymous locus 2_5996_01 genomic sequence
gi|383156231|gb|AFG60353.1| Pinus taeda anonymous locus 2_5996_01 genomic sequence
gi|383156232|gb|AFG60354.1| Pinus taeda anonymous locus 2_5996_01 genomic sequence
gi|383156233|gb|AFG60355.1| Pinus taeda anonymous locus 2_5996_01 genomic sequence
gi|383156235|gb|AFG60357.1| Pinus taeda anonymous locus 2_5996_01 genomic sequence
gi|383156237|gb|AFG60359.1| Pinus taeda anonymous locus 2_5996_01 genomic sequence
gi|383156238|gb|AFG60360.1| Pinus taeda anonymous locus 2_5996_01 genomic sequence
gi|383156240|gb|AFG60362.1| Pinus taeda anonymous locus 2_5996_01 genomic sequence
gi|383156241|gb|AFG60363.1| Pinus taeda anonymous locus 2_5996_01 genomic sequence
Length = 154
Score = 77.0 bits (188), Expect = 1e-11, Method: Compositional matrix adjust.
Identities = 51/154 (33%), Positives = 73/154 (47%), Gaps = 12/154 (7%)
Query: 28 GFSVELIHRDSPKSPFYNSSETPYQRL-RDALTRSLNRLNHFNQNSSISSSKASQADIIP 86
G ++ L H SP ++ + + L +L R +RL +S + S +
Sbjct: 3 GVNIRLDHIHGACSPLRPANSSKWIDLVSQSLERDNDRLKTIRSRNSGPYTTMSNLPLQS 62
Query: 87 NN----ANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSST 142
N NY++ GTP + L + DTGSDL W QC+PC CY Q P+F+P SS+
Sbjct: 63 GNKVGTGNYIVTAGFGTPTKKFLLIIDTGSDLTWIQCKPC--LGCYSQVDPIFEPSQSSS 120
Query: 143 YKSLPCSSSQCASL-----NQKSCSGVNCQYSVS 171
YKSLPC S+ C L N C C Y ++
Sbjct: 121 YKSLPCLSATCTELLTSESNLTPCFLGGCSYEIN 154
>gi|357143901|ref|XP_003573095.1| PREDICTED: aspartic proteinase-like protein 1-like [Brachypodium
distachyon]
Length = 627
Score = 77.0 bits (188), Expect = 1e-11, Method: Compositional matrix adjust.
Identities = 99/388 (25%), Positives = 145/388 (37%), Gaps = 74/388 (19%)
Query: 54 LRDALTRSLNRLNHFNQNSSISSSKASQADIIPNNAN----YLIRISIGTPPTERLAVAD 109
+R L R RL ++ +S SK IIP + Y + +GTP T + D
Sbjct: 170 VRSDLQRQKRRLGG-GKHQLLSFSK--DGGIIPTGNDFGWLYYTWVDVGTPNTSFMVALD 226
Query: 110 TGSDLIWTQCE--PCPPSQCYM----QDSPLFDPKMSSTYKSLPCSSSQCASLNQKSCSG 163
TGSDL W C+ C P Y +D ++ P S+T + LPCS C + +
Sbjct: 227 TGSDLFWIPCDCIECAPLSGYHGSLDRDLGIYKPAESTTSRHLPCSHELCLLGSDCTNQK 286
Query: 164 VNCQYSVSY-GDGSFSNGNLATETVTLGSTTGQAVALPGITFGCGTNNGG--LFNSKTTG 220
C Y+ Y + + S+G L + + L S A + GCG G L G
Sbjct: 287 QPCPYNTKYLQENTTSSGLLVEDILHLDSRESHAPVKASVIIGCGRKQSGSYLDGIAPDG 346
Query: 221 IVGLGGGDISLISQM-RTTIAGNQ----------RL-----GVSTP-------------- 250
++GLG DIS+ S + R + N R+ GVST
Sbjct: 347 LLGLGMADISVPSFLARAGLVRNSFSMCFTKDSGRIFFGDQGVSTQQSTPFVPLYGKLQT 406
Query: 251 --------------------DIVIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGS 290
++DSGT+ T LP + + A + S
Sbjct: 407 YTVNVDKSCVGHKCFESTSFQAIVDSGTSFTALPLDIYKAVAIEFDKQVNASRLPQEATS 466
Query: 291 LELCYSFNSL--SQVPEVTIHFRG-ADVKLSRSNFFVKVSEDIVCSVFKGITNSVPIYGN 347
+ CYS + L VP VT+ F G + F + E V + S G
Sbjct: 467 FDYCYSASPLVMPDVPTVTLTFAGNKSFQPVNPTFLLHDEEGAVAGFCLAVVQSPEPIGI 526
Query: 348 IMQTNFLVGY----DIEQQTVSFKPTDC 371
I Q NFL+GY D E + + ++C
Sbjct: 527 IAQ-NFLLGYHVVFDRENMKLGWYRSEC 553
>gi|6580159|emb|CAB62657.2| putative protein [Arabidopsis thaliana]
Length = 475
Score = 77.0 bits (188), Expect = 1e-11, Method: Compositional matrix adjust.
Identities = 78/304 (25%), Positives = 126/304 (41%), Gaps = 29/304 (9%)
Query: 91 YLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYM--------QDSP--LFDPKMS 140
Y +S+GTPP+ L DTGSDL W C C + C Q P L+ P S
Sbjct: 102 YYANVSVGTPPSSFLVALDTGSDLFWLPCN-C-GTTCIRDLEDIGVPQSVPLNLYTPNAS 159
Query: 141 STYKSLPCSSSQCASLNQKSCSGVN--CQYSVSYGDGSFSNGNLATETVTLGSTTGQAVA 198
+T S+ CS +C K CS + C Y +SY + + + G L + + L +
Sbjct: 160 TTSSSIRCSDKRC--FGSKKCSSPSSICPYQISYSNSTGTKGTLLQDVLHLATEDENLTP 217
Query: 199 LPG-ITFGCGTNNGGLF--NSKTTGIVGLGGGDISLISQMRTTIAGNQRLGVSTPDIVID 255
+ +T GCG GLF N+ G++GLG S+ S + + ++ +
Sbjct: 218 VKANVTLGCGQKQTGLFQRNNSVNGVLGLGIKGYSVPSLLAKANITANSFSMCFGRVIGN 277
Query: 256 SGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFN---SLSQVPEVTIHFRG 312
G ++F +GY + S+ + DP E CY + + Q P V + F G
Sbjct: 278 VG-RISFGDRGYTDQEETPFISVAPRRRPVDPELPFEFCYDLSPNATTIQFPLVEMTFIG 336
Query: 313 ADVKLSRSNFF-----VKVSEDIVCSVFKGITNSVPIYGNIMQTNFLVGYDIEQQTVSFK 367
+ + FF + E V G+ SV + N + + +D E+ + +K
Sbjct: 337 GSKIILNNPFFTARTQARHGEGNVMYCL-GVLKSVGLKINNFVAGYRIVFDRERMILGWK 395
Query: 368 PTDC 371
+ C
Sbjct: 396 QSLC 399
>gi|383156225|gb|AFG60347.1| Pinus taeda anonymous locus 2_5996_01 genomic sequence
gi|383156227|gb|AFG60349.1| Pinus taeda anonymous locus 2_5996_01 genomic sequence
Length = 154
Score = 77.0 bits (188), Expect = 1e-11, Method: Compositional matrix adjust.
Identities = 51/154 (33%), Positives = 73/154 (47%), Gaps = 12/154 (7%)
Query: 28 GFSVELIHRDSPKSPFYNSSETPYQRL-RDALTRSLNRLNHFNQNSSISSSKASQADIIP 86
G ++ L H SP ++ + + L +L R +RL +S + S +
Sbjct: 3 GVNIRLDHIHGACSPLRPANSSKWIDLISQSLERDNDRLKTIRSRNSGPYTTMSNLPLQS 62
Query: 87 NN----ANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSST 142
N NY++ GTP + L + DTGSDL W QC+PC CY Q P+F+P SS+
Sbjct: 63 GNKVGTGNYIVTAGFGTPTKKFLLIIDTGSDLTWIQCKPC--LGCYSQVDPIFEPSQSSS 120
Query: 143 YKSLPCSSSQCASL-----NQKSCSGVNCQYSVS 171
YKSLPC S+ C L N C C Y ++
Sbjct: 121 YKSLPCLSATCTELLTSESNLTPCFLGGCSYEIN 154
>gi|357440289|ref|XP_003590422.1| Aspartic proteinase-like protein [Medicago truncatula]
gi|355479470|gb|AES60673.1| Aspartic proteinase-like protein [Medicago truncatula]
Length = 498
Score = 77.0 bits (188), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 58/161 (36%), Positives = 80/161 (49%), Gaps = 16/161 (9%)
Query: 91 YLIRISIGTPPTERLAVADTGSDLIWTQCEP---CPPSQCYMQDSPLFDPKMSSTYKSLP 147
Y ++ +GTPP E DTGSD++W C CP S + FD SST +P
Sbjct: 84 YTTKVKMGTPPREFTVQIDTGSDILWINCNTCSNCPKSSGLGIELNFFDTVGSSTAALVP 143
Query: 148 CSSSQCASLNQKS---CS-GVN-CQYSVSYGDGSFSNGNLATET----VTLGSTTGQAVA 198
CS CAS Q + CS VN C Y+ Y DGS ++G ++ + LG +T VA
Sbjct: 144 CSDPMCASAIQGAAAQCSPQVNQCSYTFQYEDGSGTSGVYVSDAMYFDMILGQSTPANVA 203
Query: 199 LPG-ITFGCGTNNGGLF---NSKTTGIVGLGGGDISLISQM 235
I FGC T G + GI+G G G++S++SQ+
Sbjct: 204 SSATIVFGCSTYQSGDLTKTDKAVDGILGFGPGELSVVSQL 244
>gi|222613194|gb|EEE51326.1| hypothetical protein OsJ_32296 [Oryza sativa Japonica Group]
Length = 309
Score = 76.6 bits (187), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 41/117 (35%), Positives = 61/117 (52%), Gaps = 7/117 (5%)
Query: 55 RDALTRSLNRLNHFNQNSSISSSKASQADIIP---NNANYLIRISIGTPPTERLAVADTG 111
D L R L + + ++ A A ++P + Y+ +IGTPP A+ D
Sbjct: 25 HDDLRRGLEQATRGRLLAD--ATPAGGAAVVPIRWSPPYYVANFTIGTPPQPASAIVDVA 82
Query: 112 SDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCSSSQCASLNQKSCSGVNCQY 168
+L+WTQC C +C+ QD P+F P SST+K PC ++ C S+ +SCSG C Y
Sbjct: 83 GELVWTQCSAC--RRCFKQDLPVFVPNASSTFKPEPCGTAVCESIPTRSCSGDVCSY 137
>gi|297810815|ref|XP_002873291.1| hypothetical protein ARALYDRAFT_487523 [Arabidopsis lyrata subsp.
lyrata]
gi|297319128|gb|EFH49550.1| hypothetical protein ARALYDRAFT_487523 [Arabidopsis lyrata subsp.
lyrata]
Length = 439
Score = 76.6 bits (187), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 53/197 (26%), Positives = 90/197 (45%), Gaps = 26/197 (13%)
Query: 25 QTGGFSVELIHRDSPKSPFYNSSETPYQ-RLRDALTRSLNRLNHFNQNSSISSSKASQAD 83
Q G ++ + H DSP SPF + S ++ R+ L + RL + SS +
Sbjct: 31 QDQGSTLRIFHIDSPCSPFKSPSPLSWEARVLQTLAQDQARLQYL-------SSLVAGRS 83
Query: 84 IIP--------NNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLF 135
++P + Y++++ IGTP L DT SD+ W C C + F
Sbjct: 84 VVPIASGRQMLQSTTYIVKVLIGTPAQPLLLAMDTSSDVAWIPCSGC----VGCPSNTAF 139
Query: 136 DPKMSSTYKSLPCSSSQCASLNQKSCSGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQ 195
P S+++K++ CS+ QC + +C C ++++YG S + NL+ +T+ L + +
Sbjct: 140 SPAKSTSFKNVSCSAPQCKQVPNPACGARACSFNLTYGSSSIA-ANLSQDTIRLAADPIK 198
Query: 196 AVALPGITFGCGTNNGG 212
A TFGC G
Sbjct: 199 A-----FTFGCVNKVAG 210
>gi|42565826|ref|NP_190703.2| aspartyl protease family protein [Arabidopsis thaliana]
gi|332645261|gb|AEE78782.1| aspartyl protease family protein [Arabidopsis thaliana]
Length = 528
Score = 76.6 bits (187), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 85/354 (24%), Positives = 137/354 (38%), Gaps = 76/354 (21%)
Query: 91 YLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSP--------LFDPKMSST 142
Y +S+GTPP+ L DTGSDL W C ++D L+ P S+T
Sbjct: 102 YYANVSVGTPPSSFLVALDTGSDLFWLPCNCGTTCIRDLEDIGVPQSVPLNLYTPNASTT 161
Query: 143 YKSLPCSSSQCASLNQKSCSGVN--CQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALP 200
S+ CS +C K CS + C Y +SY + + + G L + + L + +
Sbjct: 162 SSSIRCSDKRC--FGSKKCSSPSSICPYQISYSNSTGTKGTLLQDVLHLATEDENLTPVK 219
Query: 201 G-ITFGCGTNNGGLF--NSKTTGIVGLG-------------------------------- 225
+T GCG GLF N+ G++GLG
Sbjct: 220 ANVTLGCGQKQTGLFQRNNSVNGVLGLGIKGYSVPSLLAKANITANSFSMCFGRVIGNVG 279
Query: 226 ---GGDISLISQMRT---TIAGNQRLGVSTPDIVI-------------DSGTTLTFLPQG 266
GD Q T ++A + GV+ + + D+G++ T L +
Sbjct: 280 RISFGDRGYTDQEETPFISVAPSTAYGVNISGVSVAGDPVDIRLFAKFDTGSSFTHLREP 339
Query: 267 YNSNLLSVMSSMIE--AQPVADPTGSLELCYSFN---SLSQVPEVTIHFRGADVKLSRSN 321
L ++E +PV DP E CY + + Q P V + F G + +
Sbjct: 340 AYGVLTKSFDELVEDRRRPV-DPELPFEFCYDLSPNATTIQFPLVEMTFIGGSKIILNNP 398
Query: 322 FFVKVSEDIVCSVFKGITNSVPIYGNIMQTNFLVGY----DIEQQTVSFKPTDC 371
FF +++ G+ SV + N++ NF+ GY D E+ + +K + C
Sbjct: 399 FFTARTQEGNVMYCLGVLKSVGLKINVIGQNFVAGYRIVFDRERMILGWKQSLC 452
>gi|297820186|ref|XP_002877976.1| hypothetical protein ARALYDRAFT_906847 [Arabidopsis lyrata subsp.
lyrata]
gi|297323814|gb|EFH54235.1| hypothetical protein ARALYDRAFT_906847 [Arabidopsis lyrata subsp.
lyrata]
Length = 425
Score = 76.6 bits (187), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 64/208 (30%), Positives = 97/208 (46%), Gaps = 21/208 (10%)
Query: 31 VELIHRDSPKSPFYNSSETPYQRLRDALTRSLNRLNHFNQNSSISSSK---ASQADIIPN 87
+ + H +S SPF S D L + R + + + ++ S AS I+
Sbjct: 31 LRVFHINSQCSPFKTSVS-----WADTLLQDKARFLYLSSLAGVTKSSVPIASGRGIV-Q 84
Query: 88 NANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLP 147
+ Y++R +IGTP L DT +D W C C S LFDP SS+ ++L
Sbjct: 85 SPTYIVRANIGTPAQAMLVALDTSNDAAWIPCSGC----VGCSSSVLFDPSKSSSSRTLQ 140
Query: 148 CSSSQCASLNQKSCS-GVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITFGC 206
C + QC SC+ +C ++++YG GS L +T+TL + +P TFGC
Sbjct: 141 CEAPQCKQAPNPSCTVSKSCGFNMTYG-GSAIEAYLTQDTLTLATDV-----IPNYTFGC 194
Query: 207 GTNNGGLFNSKTTGIVGLGGGDISLISQ 234
N + G++GLG G +SLISQ
Sbjct: 195 -INKASGTSLPAQGLMGLGRGPLSLISQ 221
>gi|226532674|ref|NP_001151415.1| pepsin A precursor [Zea mays]
gi|195646632|gb|ACG42784.1| pepsin A [Zea mays]
Length = 492
Score = 76.6 bits (187), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 54/163 (33%), Positives = 75/163 (46%), Gaps = 22/163 (13%)
Query: 91 YLIRISIGTPPTERLAVADTGSDLIWTQ---CEPCPPSQCYMQDSPLFDPKMSSTYKSLP 147
Y RI IG+PP DTGSD++W C+ CP + +DP S T ++
Sbjct: 85 YYTRIEIGSPPKGYYVQVDTGSDILWVNGISCDGCPTRSGLGIELTQYDPAGSGT--TVG 142
Query: 148 CSSSQCASLNQKSCSGV---------NCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVA 198
C C + + SGV CQ+ ++YGDGS + G T+ V +G
Sbjct: 143 CEQEFCVA--NSAASGVPPACPSAASPCQFRITYGDGSSTTGFYVTDFVQYNQVSGNGQT 200
Query: 199 LP---GITFGCGTNNGGLFNSKT---TGIVGLGGGDISLISQM 235
P ITFGCG GG S + GI+G G D S++SQ+
Sbjct: 201 TPSNVSITFGCGAQLGGDLGSSSQALDGILGFGQSDASMLSQL 243
Score = 42.0 bits (97), Expect = 0.53, Method: Compositional matrix adjust.
Identities = 36/131 (27%), Positives = 68/131 (51%), Gaps = 15/131 (11%)
Query: 253 VIDSGTTLTFLPQGYNSNLLSVMSSMIEAQP-VADPTGSLELCYSFN-SL-SQVPEVTIH 309
+IDSGTTL +LP+ LL +++ + P +A +C+ F+ SL + P +T
Sbjct: 319 IIDSGTTLAYLPREVYRTLL---TAVFDKHPDLAVRNYEDFICFQFSGSLDEEFPVITFS 375
Query: 310 FRGADVKLSR--SNFFVKVSEDIVCSVF--KGIT----NSVPIYGNIMQTNFLVGYDIEQ 361
F G D+ L+ ++ + D+ C F G+ + + G+++ +N LV YD+E+
Sbjct: 376 FEG-DLTLNVYPHDYLFQNGNDLYCMGFLDGGVQTKDGKDMVLLGDLVLSNKLVVYDLEK 434
Query: 362 QTVSFKPTDCT 372
Q + + +C+
Sbjct: 435 QVIGWTDYNCS 445
>gi|218189440|gb|EEC71867.1| hypothetical protein OsI_04576 [Oryza sativa Indica Group]
Length = 508
Score = 76.3 bits (186), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 66/249 (26%), Positives = 104/249 (41%), Gaps = 37/249 (14%)
Query: 83 DIIPNNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQC-----YMQDSPLFDP 137
D N Y++ S+GTPP V D SD +W QC C + C +P F
Sbjct: 89 DPATNTGMYVLSFSVGTPPQVVTGVLDITSDFVWMQCSAC--ATCGADAPAATSAPPFYA 146
Query: 138 KMSSTYKSLPCSSSQCASLNQKSCSGVN--CQYSVSYGDGSFSN--GNLATETVTLGSTT 193
+SST + + C++ C L ++CS + C YS YG G+ + G LA + +
Sbjct: 147 FLSSTIREVRCANRGCQRLVPQTCSADDSPCGYSYVYGGGAANTTAGLLAVDAFAFAT-- 204
Query: 194 GQAVALPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGNQRLGVSTPDIV 253
V G+ FGC G G++GLG G++SL+SQ++ + PD
Sbjct: 205 ---VRADGVIFGCAVATEG----DIGGVIGLGRGELSLVSQLQI----GRFSYYLAPDDA 253
Query: 254 IDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSLSQVPEVTIHFRGA 313
+D G+ + FL +V + ++ + + SL V I G
Sbjct: 254 VDVGSFILFLDDAKPRTSRAVSTPLVANR-------------ASRSLYYVELAGIRVDGE 300
Query: 314 DVKLSRSNF 322
D+ + R F
Sbjct: 301 DLAIPRGTF 309
>gi|21717171|gb|AAM76364.1|AC074196_22 putative nucleoid DNA binding protein [Oryza sativa Japonica Group]
gi|31433290|gb|AAP54828.1| Eukaryotic aspartyl protease family protein, expressed [Oryza
sativa Japonica Group]
gi|125532789|gb|EAY79354.1| hypothetical protein OsI_34483 [Oryza sativa Indica Group]
Length = 382
Score = 76.3 bits (186), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 54/146 (36%), Positives = 78/146 (53%), Gaps = 8/146 (5%)
Query: 96 SIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCSSSQCAS 155
+IGTPP A D G L+WTQC C S C+ Q+ P FDP SSTY+ PC ++ C
Sbjct: 29 TIGTPPQPASAFIDVGGLLVWTQCSQCSSSSCFNQELPPFDPTKSSTYRPEPCGTALCEF 88
Query: 156 L--NQKSCSGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITFGC-GTNNGG 212
+ ++CSG C Y S ++G + T+ V +G+ T +VA FGC ++
Sbjct: 89 FPASIRNCSGDVCAYEASTQLFEHTSGKIGTDAVAIGTATAASVA-----FGCVMASDIK 143
Query: 213 LFNSKTTGIVGLGGGDISLISQMRTT 238
L + +G VGL +SL++QM T
Sbjct: 144 LMDGGPSGFVGLARTPLSLVAQMNVT 169
Score = 50.8 bits (120), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 32/133 (24%), Positives = 64/133 (48%), Gaps = 13/133 (9%)
Query: 252 IVIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLE----LCYSFNSLSQVPEVT 307
+++ + + ++FL G +L +++ + P A P + LC+ +S P+V
Sbjct: 246 VLLQTFSPVSFLVDGVYQDLKKAVTAAV-GGPTATPPEQFQSIFDLCFKRGGVSGAPDVV 304
Query: 308 IHFRGAD-VKLSRSNFFVKVSEDIVCSVFKGITN-------SVPIYGNIMQTNFLVGYDI 359
+ F+GA + + +N+ + V +D VC + I G + Q N YD+
Sbjct: 305 LTFQGAAALTVPPTNYLLDVGDDTVCVAIASSARLNSTEVAGMSILGGLQQQNVHFLYDL 364
Query: 360 EQQTVSFKPTDCT 372
E++T+SF+ DC+
Sbjct: 365 EKETLSFEAADCS 377
>gi|226495667|ref|NP_001146721.1| uncharacterized protein LOC100280323 [Zea mays]
gi|219888491|gb|ACL54620.1| unknown [Zea mays]
Length = 557
Score = 76.3 bits (186), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 73/258 (28%), Positives = 107/258 (41%), Gaps = 32/258 (12%)
Query: 2 ATFLSCVFILFFLCFYVVSPIEAQTGGFSVELIHRDSPKS---PFYNSS----------- 47
A L + + + CFY S + Q G E R+ +S P Y +
Sbjct: 83 ALVLGALAVAAYYCFY--SDVAVQFLGMEQEEEQRNETRSFLLPLYPKARQGRALREFGD 140
Query: 48 -ETPYQRLRDALTRSLNRLNHFNQNSSISSSKAS---QADIIPNNANYLIRISIGTPPTE 103
+ +R+ D ++ NR+ ++ ++S A + ++ P+ Y I IG PP
Sbjct: 141 VKLAARRVDDGGRKARNRMEVAKAATARTNSTALLPIKGNVFPD-GQYYTSIFIGNPPRP 199
Query: 104 RLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCSSSQCASL--NQKSC 161
DTGSDL W QC+ P + PL+ P + K +P C L NQ C
Sbjct: 200 YFLDVDTGSDLTWIQCD-APCTNFAKGPHPLYKP---AKEKIVPPRDLLCQELQGNQNYC 255
Query: 162 SGVN-CQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITFGCGTNNGGLFNS---K 217
C Y + Y D S S G LA + + + +T G L FGC + G S K
Sbjct: 256 ETCKQCDYEIEYADQSSSMGVLARDDMHMIATNGGREKL-DFVFGCAYDQQGQLLSSPAK 314
Query: 218 TTGIVGLGGGDISLISQM 235
T GI+GL IS SQ+
Sbjct: 315 TDGILGLSSAAISFPSQL 332
>gi|359492489|ref|XP_002285867.2| PREDICTED: aspartic proteinase Asp1-like [Vitis vinifera]
Length = 453
Score = 76.3 bits (186), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 88/372 (23%), Positives = 140/372 (37%), Gaps = 90/372 (24%)
Query: 81 QADIIPNNANYLIRISIGTPPTERLAVADTGSDLIWTQCE-PCPPSQCYMQDSPLFDPKM 139
Q ++ P Y + + IG PP D+GSDL W QC+ PC C P + P
Sbjct: 59 QGNVYPQGF-YSVSLRIGNPPKPYTLDIDSGSDLTWLQCDAPC--VSCTKAPHPPYKPNK 115
Query: 140 SSTYKSLPCSSSQCASLNQKS-----CSGVNCQYSVSYGDGSFSNGNLATETVTLGSTTG 194
+ C+ C++L+ S S C Y VSY D S G L + +L T G
Sbjct: 116 G----PITCNDPMCSALHWPSKPPCKASHEQCDYEVSYADHGSSLGVLVHDIFSLQLTNG 171
Query: 195 QAVALPGITFGCGTNN---GGLFNSKTTGIVGLGGGDISLISQMRT-----TIAGN---- 242
+A P + FGCG + G G++GLG G S+++Q+R+ +I G+
Sbjct: 172 -TLAAPRLAFGCGYDQSYPGPNAPPFVDGVLGLGYGKSSIVTQLRSLGLIRSIVGHCLSG 230
Query: 243 -------------------------------------------QRLGVSTPDIVIDSGTT 259
Q GV +V DSG++
Sbjct: 231 RGGGFLFLGDGLSTTPGIIWTPMSRKSGESAYALGPADLLFNGQNSGVKGLRLVFDSGSS 290
Query: 260 LTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYS----FNSLSQVPEVTIHF----- 310
T+ LS++ + + SL +C+ F S+ +V F
Sbjct: 291 YTYFNAQAYKTTLSLVRKYLNGKLKETADESLPVCWRGAKPFKSIFEVKNYFKPFALSFT 350
Query: 311 --RGADVKLSRSNFFVKVSEDIVC-------SVFKGITNSVPIYGNIMQTNFLVGYDIEQ 361
+ A ++L ++ + C V G +N + G+I + +V YD E+
Sbjct: 351 KAKSAQLQLPPESYLIISKHGNACLGILNGSEVGLGDSN---VIGDIAFQDKMVIYDNER 407
Query: 362 QTVSFKPTDCTK 373
Q + + P DC K
Sbjct: 408 QQIGWVPKDCNK 419
>gi|357120129|ref|XP_003561782.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Brachypodium
distachyon]
Length = 452
Score = 76.3 bits (186), Expect = 3e-11, Method: Compositional matrix adjust.
Identities = 65/200 (32%), Positives = 95/200 (47%), Gaps = 43/200 (21%)
Query: 87 NNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSL 146
+N + + +++GTPP V DTGS+L W C D+P FD SS+Y +
Sbjct: 59 HNVSLTVPVAVGTPPQNVTMVLDTGSELSWLLC------NGSRHDAP-FDASASSSYAPV 111
Query: 147 PCSSSQCASLNQKS-----CSGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPG 201
PCSS C L + C C+ S+SY D S ++G LA +T LGS+ A+
Sbjct: 112 PCSSPACTWLGRDLPVRPFCDSSACRVSLSYADASSADGLLAADTFLLGSSPMPAL---- 167
Query: 202 ITFGCGTNNGGLFNSKT-------TGIVGLGGGDISLISQMRTT-----IAGNQRLGVST 249
FGC T+ ++S T TG++G+ G +S ++Q T IA Q
Sbjct: 168 --FGCITS----YSSSTDPSETPPTGLLGMNRGGLSFVTQTATRRFAYCIAAGQ-----G 216
Query: 250 PDIVI----DSGTTLTFLPQ 265
P I++ D+ T LT PQ
Sbjct: 217 PGILLLGGNDTETPLTSPPQ 236
>gi|297801286|ref|XP_002868527.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
gi|297314363|gb|EFH44786.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
Length = 444
Score = 76.3 bits (186), Expect = 3e-11, Method: Compositional matrix adjust.
Identities = 49/153 (32%), Positives = 71/153 (46%), Gaps = 15/153 (9%)
Query: 92 LIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCSSS 151
++ + IGTP + V DTGS L W QC P + + FDP +SS++ LPCS
Sbjct: 82 ILSLPIGTPSQSQELVLDTGSQLSWIQCHPKKIKKPLPPPTTSFDPSLSSSFSDLPCSHP 141
Query: 152 QCA------SLNQKSCSGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITFG 205
C +L S C YS Y DG+F+ GNL E T ++ P + G
Sbjct: 142 LCKPRIPDFTLPTSCDSNRLCHYSYFYADGTFAEGNLVKEKFTFSNSQ----TTPPLILG 197
Query: 206 CGTNNGGLFNSKTTGIVGLGGGDISLISQMRTT 238
C ++ GI+G+ G +S ISQ + +
Sbjct: 198 CAKE-----STDVKGILGMNLGRLSFISQAKIS 225
>gi|125606590|gb|EAZ45626.1| hypothetical protein OsJ_30294 [Oryza sativa Japonica Group]
Length = 431
Score = 76.3 bits (186), Expect = 3e-11, Method: Compositional matrix adjust.
Identities = 46/123 (37%), Positives = 62/123 (50%), Gaps = 24/123 (19%)
Query: 93 IRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCSSSQ 152
+ + IGTP V DT SDL+WTQC+PC C Q ++DP + TY +L SS
Sbjct: 90 VFLGIGTPAMNVTLVFDTTSDLLWTQCQPC--LSCVAQAGDMYDPNKTETYANLTSSS-- 145
Query: 153 CASLNQKSCSGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITFGCGTNNGG 212
Y+ +Y SF++G ATET LG+ T + ITFGCGT N G
Sbjct: 146 ---------------YNYTYSKQSFTSGYFATETFALGNVT-----VANITFGCGTRNQG 185
Query: 213 LFN 215
++
Sbjct: 186 YYD 188
Score = 51.2 bits (121), Expect = 9e-04, Method: Compositional matrix adjust.
Identities = 38/133 (28%), Positives = 67/133 (50%), Gaps = 13/133 (9%)
Query: 252 IVIDSGTTLTFLPQG----YNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSLSQVPE-- 305
+VIDS + +T L + L++ ++ + EA A L+LC+ + P
Sbjct: 291 LVIDSTSPVTVLDEATYGPVRRALVAQLAPLKEANANASAGVGLDLCFELAAGGATPTPP 350
Query: 306 ---VTIHFRG--ADVKLSRSNFFVKVSED-IVC-SVFKGITNSVPIYGNIMQTNFLVGYD 358
+T+HF G AD+ L +++ K S ++C ++ +N VP+ G+ + LV YD
Sbjct: 351 NVTMTLHFDGGAADLVLPPASYLAKDSAGGLICLTMTPSSSNGVPVLGSWALLDTLVLYD 410
Query: 359 IEQQTVSFKPTDC 371
+ + VSF+P DC
Sbjct: 411 LAKNVVSFQPLDC 423
>gi|302757589|ref|XP_002962218.1| hypothetical protein SELMODRAFT_403844 [Selaginella moellendorffii]
gi|300170877|gb|EFJ37478.1| hypothetical protein SELMODRAFT_403844 [Selaginella moellendorffii]
Length = 353
Score = 75.9 bits (185), Expect = 3e-11, Method: Compositional matrix adjust.
Identities = 84/292 (28%), Positives = 128/292 (43%), Gaps = 42/292 (14%)
Query: 97 IGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCSSSQCASL 156
+GTP E LA+ DT DL+W Q E + SS++K++ CS S+C L
Sbjct: 88 LGTPEQEILAIIDTALDLVWAQVE-----------------ERSSSFKNVSCSDSRC-RL 129
Query: 157 NQKSCS-GVN-CQYSVSYGDGSFSN-GNLATETVTLGSTTG---QAVALPGITFGCGTNN 210
CS G N C Y S G G LATETVTL G + + +P FGC
Sbjct: 130 TPSHCSDGSNTCIYYPSSAIGHAGRGGRLATETVTLVYARGRWTERIPVPDTLFGCERKT 189
Query: 211 GGLFNSKTTGIVGLGGGDISL-ISQM----RTTIAGNQRLGVSTPDIVIDSGTTLTFLPQ 265
NS+ + + S +S M R I G GV T ++ G + +
Sbjct: 190 EA-HNSRHSYYSEITENKFSYCLSSMLFLGRARIPGE---GVQTIPMLSSPGHGHYYFAE 245
Query: 266 GYNSNLLSVMSSMIEAQPVADPTGSLELCYS--FNSLSQVPEVTIHFRGADVKLSRSNFF 323
++V S+I A D +LELCYS + + P + +H A + LS+ N+
Sbjct: 246 ---LRAITVGFSVI-AIARNDSDANLELCYSTALDPSYKFPSMELHPESARMVLSQKNYI 301
Query: 324 VKVSEDIVCSVFKGITN--SVPIYGNIMQTNFLVGYDIEQQTVSFKPTDCTK 373
+ C V + + V + G++MQ ++ + +D T+SF P C++
Sbjct: 302 LSNGSGWAC-VATAMRDPGDVSVIGSLMQRDYHILFDNPGSTISFAPATCSE 352
>gi|302141796|emb|CBI18999.3| unnamed protein product [Vitis vinifera]
Length = 390
Score = 75.9 bits (185), Expect = 3e-11, Method: Compositional matrix adjust.
Identities = 88/372 (23%), Positives = 140/372 (37%), Gaps = 90/372 (24%)
Query: 81 QADIIPNNANYLIRISIGTPPTERLAVADTGSDLIWTQCE-PCPPSQCYMQDSPLFDPKM 139
Q ++ P Y + + IG PP D+GSDL W QC+ PC C P + P
Sbjct: 26 QGNVYPQGF-YSVSLRIGNPPKPYTLDIDSGSDLTWLQCDAPC--VSCTKAPHPPYKPNK 82
Query: 140 SSTYKSLPCSSSQCASLNQKS-----CSGVNCQYSVSYGDGSFSNGNLATETVTLGSTTG 194
+ C+ C++L+ S S C Y VSY D S G L + +L T G
Sbjct: 83 G----PITCNDPMCSALHWPSKPPCKASHEQCDYEVSYADHGSSLGVLVHDIFSLQLTNG 138
Query: 195 QAVALPGITFGCGTNN---GGLFNSKTTGIVGLGGGDISLISQMRT-----TIAGN---- 242
+A P + FGCG + G G++GLG G S+++Q+R+ +I G+
Sbjct: 139 -TLAAPRLAFGCGYDQSYPGPNAPPFVDGVLGLGYGKSSIVTQLRSLGLIRSIVGHCLSG 197
Query: 243 -------------------------------------------QRLGVSTPDIVIDSGTT 259
Q GV +V DSG++
Sbjct: 198 RGGGFLFLGDGLSTTPGIIWTPMSRKSGESAYALGPADLLFNGQNSGVKGLRLVFDSGSS 257
Query: 260 LTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYS----FNSLSQVPEVTIHF----- 310
T+ LS++ + + SL +C+ F S+ +V F
Sbjct: 258 YTYFNAQAYKTTLSLVRKYLNGKLKETADESLPVCWRGAKPFKSIFEVKNYFKPFALSFT 317
Query: 311 --RGADVKLSRSNFFVKVSEDIVC-------SVFKGITNSVPIYGNIMQTNFLVGYDIEQ 361
+ A ++L ++ + C V G +N + G+I + +V YD E+
Sbjct: 318 KAKSAQLQLPPESYLIISKHGNACLGILNGSEVGLGDSN---VIGDIAFQDKMVIYDNER 374
Query: 362 QTVSFKPTDCTK 373
Q + + P DC K
Sbjct: 375 QQIGWVPKDCNK 386
>gi|18421660|ref|NP_568551.1| aspartyl protease family protein [Arabidopsis thaliana]
gi|10177438|dbj|BAB10671.1| unnamed protein product [Arabidopsis thaliana]
gi|15809850|gb|AAL06853.1| AT5g37540/mpa22_p_70 [Arabidopsis thaliana]
gi|20260182|gb|AAM12989.1| unknown protein [Arabidopsis thaliana]
gi|23197748|gb|AAN15401.1| unknown protein [Arabidopsis thaliana]
gi|332006821|gb|AED94204.1| aspartyl protease family protein [Arabidopsis thaliana]
Length = 442
Score = 75.9 bits (185), Expect = 3e-11, Method: Compositional matrix adjust.
Identities = 58/193 (30%), Positives = 84/193 (43%), Gaps = 27/193 (13%)
Query: 64 RLNHFNQNSSISSSKASQADIIPNNANYLIR------------ISIGTPPTERLAVADTG 111
RL +SS +S S+ + P ++ Y R + IGTP + V DTG
Sbjct: 41 RLTPTTNSSSFKTSLLSRRNPSPPSSPYTFRSNIKYSMALILSLPIGTPSQSQELVLDTG 100
Query: 112 SDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCSSSQCA------SLNQKSCSGVN 165
S L W QC P + + FDP +SS++ LPCS C +L S
Sbjct: 101 SQLSWIQCHPKKIKKPLPPPTTSFDPSLSSSFSDLPCSHPLCKPRIPDFTLPTSCDSNRL 160
Query: 166 CQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITFGCGTNNGGLFNSKTTGIVGLG 225
C YS Y DG+F+ GNL E T ++ P + GC ++ GI+G+
Sbjct: 161 CHYSYFYADGTFAEGNLVKEKFTFSNSQ----TTPPLILGCAKE-----STDEKGILGMN 211
Query: 226 GGDISLISQMRTT 238
G +S ISQ + +
Sbjct: 212 LGRLSFISQAKIS 224
>gi|326517745|dbj|BAK03791.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 556
Score = 75.9 bits (185), Expect = 3e-11, Method: Compositional matrix adjust.
Identities = 48/160 (30%), Positives = 78/160 (48%), Gaps = 15/160 (9%)
Query: 87 NNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSP--LFDPKMSSTYK 144
NN +L+ I +GTPP L DTG+ L + QCEPC +C+ Q +FDP S ++
Sbjct: 202 NNFLFLMPIKLGTPPVWNLVAVDTGATLSFVQCEPC-TLRCHKQTDAGEIFDPSKSESFS 260
Query: 145 SLPCSSSQCAS------LNQKSC--SGVNCQYSVSYGD-GSFSNGNLATETVTLGSTTGQ 195
+ CS ++C + L K+C +C YS+++G S+S G L + + +G +
Sbjct: 261 RVGCSENKCRTVQRALHLQSKACMEKEDSCLYSMTFGGTSSYSVGKLVRDRLAIGK-YAK 319
Query: 196 AVALPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQM 235
+ P FGC + ++ G+VG S Q+
Sbjct: 320 GYSFPDFLFGCSLDTE--YHQYEAGLVGFADEPFSFFEQV 357
>gi|56784779|dbj|BAD82000.1| putative aspartic proteinase nepenthesin II [Oryza sativa Japonica
Group]
Length = 486
Score = 75.9 bits (185), Expect = 3e-11, Method: Compositional matrix adjust.
Identities = 64/245 (26%), Positives = 103/245 (42%), Gaps = 37/245 (15%)
Query: 87 NNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQC-----YMQDSPLFDPKMSS 141
N Y++ S+GTPP V D SD +W QC C + C +P F +SS
Sbjct: 93 NTGMYVLSFSVGTPPQVVTGVLDITSDFVWMQCSAC--ATCGADAPAATSAPPFYAFLSS 150
Query: 142 TYKSLPCSSSQCASLNQKSCSGVN--CQYSVSYGDGSF--SNGNLATETVTLGSTTGQAV 197
T + + C++ C L ++CS + C YS YG G+ + G LA + + V
Sbjct: 151 TIREVRCANRGCQRLVPQTCSADDSPCGYSYVYGGGAANTTAGLLAVDAFAFAT-----V 205
Query: 198 ALPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGNQRLGVSTPDIVIDSG 257
G+ FGC G G++GLG G++S +SQ++ + PD +D G
Sbjct: 206 RADGVIFGCAVATEG----DIGGVIGLGRGELSPVSQLQI----GRFSYYLAPDDAVDVG 257
Query: 258 TTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSLSQVPEVTIHFRGADVKL 317
+ + FL +V + ++ ++ + SL V I G D+ +
Sbjct: 258 SFILFLDDAKPRTSRAVSTPLVASR-------------ASRSLYYVELAGIRVDGEDLAI 304
Query: 318 SRSNF 322
R F
Sbjct: 305 PRGTF 309
>gi|125527257|gb|EAY75371.1| hypothetical protein OsI_03267 [Oryza sativa Indica Group]
Length = 484
Score = 75.5 bits (184), Expect = 3e-11, Method: Compositional matrix adjust.
Identities = 50/173 (28%), Positives = 71/173 (41%), Gaps = 20/173 (11%)
Query: 90 NYLIRISIGTPPTERLAVADTGSDLIWTQCEPCP-------------PSQCYMQDSPLFD 136
Y +R +GTP L VADTGSDL W +C P+ F
Sbjct: 86 QYFVRFRVGTPAQPFLLVADTGSDLTWVKCHRAAAAASASPRNASSLPAPAPASPRRTFR 145
Query: 137 PKMSSTYKSLPCSSSQCA-----SLNQKSCSGVNCQYSVSYGDGSFSNGNLATE--TVTL 189
P S T+ +PCSS+ C SL + C Y Y DGS + G + + T+ L
Sbjct: 146 PDKSRTWAPIPCSSATCRESLPFSLAACATPANPCAYDYRYKDGSAARGTVGVDSATIAL 205
Query: 190 GSTTGQAVALPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGN 242
+ L G+ GC T+ G + G++ LG +IS S+ + G
Sbjct: 206 SGRAARKAKLRGVVLGCTTSYNGQSFLASDGVLSLGYSNISFASRAASRFGGR 258
Score = 51.2 bits (121), Expect = 7e-04, Method: Compositional matrix adjust.
Identities = 32/129 (24%), Positives = 64/129 (49%), Gaps = 13/129 (10%)
Query: 253 VIDSGTTLTFLPQGYNSNLLSVMSSMIEAQP--VADPTGSLELCYSFNSLS------QVP 304
++DSGT+LT L + +++ +S + P DP + CY++ S S +P
Sbjct: 357 ILDSGTSLTMLAKPAYRAVVAALSKRLAGLPRVTMDP---FDYCYNWTSPSGSDVAAPLP 413
Query: 305 EVTIHFRG-ADVKLSRSNFFVKVSEDIVC-SVFKGITNSVPIYGNIMQTNFLVGYDIEQQ 362
+ +HF G A ++ ++ + + + C + +G + + GNI+Q L YD++ +
Sbjct: 414 MLAVHFAGSARLEPPAKSYVIDAAPGVKCIGLQEGPWPGLSVIGNILQQEHLWEYDLKNR 473
Query: 363 TVSFKPTDC 371
+ FK + C
Sbjct: 474 RLRFKRSRC 482
>gi|359476195|ref|XP_002268758.2| PREDICTED: aspartic proteinase nepenthesin-2-like [Vitis vinifera]
gi|296082174|emb|CBI21179.3| unnamed protein product [Vitis vinifera]
Length = 460
Score = 75.5 bits (184), Expect = 4e-11, Method: Compositional matrix adjust.
Identities = 95/384 (24%), Positives = 144/384 (37%), Gaps = 93/384 (24%)
Query: 60 RSLNR--LNHFNQNSSISSSKASQADIIPNNANYLIRISIGTPPTERLAVADTGSDLIWT 117
RS+N ++ S D + + +L+ + GTP + + DTGSD W
Sbjct: 96 RSINAKIFGQYSTQESKDGWSPESMDTLNEDGLFLVNVGFGTPQQKFNLIIDTGSDTTWI 155
Query: 118 QCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCSSSQCASLNQKSCSGVNCQYSVSYGDGSF 177
QC C C+ + + F+P +SS+Y + C S + Y++ Y D S+
Sbjct: 156 QCNSCSLGNCHNKKT--FNPSLSSSYSNRSCIPS------------TDTNYTMKYEDNSY 201
Query: 178 SNGNLATETVTLGSTTGQAVALPGITFGCGTNNGGLFNS--------------------- 216
S G + VTL + P FGCG + GG F +
Sbjct: 202 SKGVFVCDEVTL-----KPDVFPKFQFGCGDSGGGEFGTASGVLGLAKGEQYSLISQTAS 256
Query: 217 --------------KTTGIVGLGGGDISLISQMRTTIAGN------------------QR 244
T G + G IS ++ T N +R
Sbjct: 257 KFKKKFSYCFPPKEHTLGSLLFGEKAISASPSLKFTQLLNPPSGLGYFVELIGISVAKKR 316
Query: 245 LGVST-----PDIVIDSGTTLTFLP-QGYNSNLLSVMSSMIEAQPVADPTGS--LELCYS 296
L VS+ P +IDSGT +T LP Y + + M+ ++ P L+ CY+
Sbjct: 317 LNVSSSLFASPGTIIDSGTVITRLPTAAYEALRTAFQQEMLHCPSISPPPQEKLLDTCYN 376
Query: 297 FNSLS----QVPEVTIHFRG-ADVKLSRSNFFVKVSEDI--VCSVFKGITN--SVPIYGN 347
++PE+ +HF G DV L S + + D+ C F +N V I GN
Sbjct: 377 LKGCGGRNIKLPEIVLHFVGEVDVSLHPSG-ILWANGDLTQACLAFARKSNPSHVTIIGN 435
Query: 348 IMQTNFLVGYDIEQQTVSFKPTDC 371
Q + V YDIE + F DC
Sbjct: 436 RQQVSLKVVYDIEGGRLGFG-NDC 458
>gi|357125326|ref|XP_003564345.1| PREDICTED: aspartic proteinase-like protein 2-like [Brachypodium
distachyon]
Length = 506
Score = 75.5 bits (184), Expect = 4e-11, Method: Compositional matrix adjust.
Identities = 49/162 (30%), Positives = 74/162 (45%), Gaps = 17/162 (10%)
Query: 91 YLIRISIGTPPTERLAVADTGSDLIWTQCEP---CPPSQCYMQDSPLFDPKMSSTYKSLP 147
Y R+ +G P E DTGSD++W C P CP S F+P SST +P
Sbjct: 89 YFTRVKLGNPAKEYFVQIDTGSDILWVACSPCTGCPTSSGLNIQLEFFNPDSSSTSSRIP 148
Query: 148 CSSSQCASLNQKS---CSGVN-----CQYSVSYGDGSFSNGNLATETVTLGSTTGQ---A 196
CS +C + Q C + C Y+ +YGDGS ++G ++T+ + G A
Sbjct: 149 CSDDRCTAALQTGEAVCQSSDSPSSPCGYTFTYGDGSGTSGFYVSDTMYFDTVMGNEQTA 208
Query: 197 VALPGITFGCGTNNGGLF---NSKTTGIVGLGGGDISLISQM 235
+ + FGC + G + GI G G +S++SQ+
Sbjct: 209 NSSASVVFGCSNSQSGDLMKTDRAVDGIFGFGQHQLSVVSQL 250
>gi|51038078|gb|AAT93881.1| hypothetical protein [Oryza sativa Japonica Group]
Length = 481
Score = 75.5 bits (184), Expect = 4e-11, Method: Compositional matrix adjust.
Identities = 67/236 (28%), Positives = 107/236 (45%), Gaps = 35/236 (14%)
Query: 24 AQTGGFSVELIHRDSPKSPFYNSSETPYQRLRDALTRSLNRLNHFNQNSSISSSKASQAD 83
A +EL H D+ N T +R+R A R+ +R + +++ ++ +
Sbjct: 18 AGGAALRLELAHVDA------NEHCTMEERVRRATERTHHR-RLLHASTAAAAGGVAAPL 70
Query: 84 IIPNNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPC--------PPSQCYMQDSPLF 135
Y+ IG PP AV DTGSDL+WTQC C C+ Q+ P +
Sbjct: 71 RWSGKTQYIASYGIGDPPQPAEAVVDTGSDLVWTQCSTCRLPAAAAAGGGGCFPQNLPYY 130
Query: 136 DPKMSSTYKSLPCS---------SSQCASLNQKSCSGVN-CQYSVSYGDGSFSNGNLATE 185
+ +S T +++PC + + A + SG + C + SYG G + G L T+
Sbjct: 131 NFSLSRTARAVPCDDDDGALCGVAPETAGCARGGGSGDDACVVAASYGAG-VALGVLGTD 189
Query: 186 TVTLGSTTGQAVALPGITFGCGTN---NGGLFNSKTTGIVGLGGGDISLISQMRTT 238
T S++ +A FGC + + G N +GI+GLG G +SL+SQ+ T
Sbjct: 190 AFTFPSSSSVTLA-----FGCVSQTRISPGALNG-ASGIIGLGRGALSLVSQLNAT 239
Score = 43.1 bits (100), Expect = 0.19, Method: Compositional matrix adjust.
Identities = 42/181 (23%), Positives = 71/181 (39%), Gaps = 34/181 (18%)
Query: 221 IVGLGGGDISLISQMRTTIAGNQRLGVSTPDI-----VIDSGTTLTFLPQGYNSNLLSVM 275
+VGL G+ ++ AG L + P + +IDSG+ T L + L +
Sbjct: 304 LVGLAAGNATV-----ALPAGAFDLREAAPKVWAGGALIDSGSPFTRLVDPAHRALTKEL 358
Query: 276 SSMIEAQ-----PVADPTGSLELCYSFN------SLSQVPEVTIHFR-----GADVKLSR 319
+ + P A G+LELC + + VP + + F G ++ +
Sbjct: 359 ARQLRGSGSLVPPPAKLGGALELCVEAGDDGDSLAAAAVPPLVLRFDDGVGGGRELVIPA 418
Query: 320 SNFFVKVSEDIVCSVFKG--------ITNSVPIYGNIMQTNFLVGYDIEQQTVSFKPTDC 371
++ +V C TN I GN MQ + V YD+ +SF+P +C
Sbjct: 419 EKYWARVEASTWCMAVVSSASGNATLPTNETTIIGNFMQQDMRVLYDLANGLLSFQPANC 478
Query: 372 T 372
+
Sbjct: 479 S 479
>gi|376337718|gb|AFB33415.1| hypothetical protein 2_5996_01, partial [Pinus mugo]
gi|376337720|gb|AFB33416.1| hypothetical protein 2_5996_01, partial [Pinus mugo]
Length = 154
Score = 75.5 bits (184), Expect = 4e-11, Method: Compositional matrix adjust.
Identities = 52/153 (33%), Positives = 72/153 (47%), Gaps = 12/153 (7%)
Query: 28 GFSVELIHRDSPKSPFYNSSETPYQRL-RDALTRSLNRLNHFNQNSSISSSKAS----QA 82
G ++ L H SP ++ + + L +L R +RL +S + S Q+
Sbjct: 3 GVNIRLDHIHGACSPLRPTNSSKWIDLVSQSLERDNDRLKTIRSRNSGPYTTMSNLPLQS 62
Query: 83 DIIPNNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSST 142
NY++ GTP + L V DTGSDL W QC+PC CY Q P+F+P SS+
Sbjct: 63 GSEVGTGNYILTAGFGTPTKKFLLVIDTGSDLTWIQCKPC--LGCYSQVDPIFEPSQSSS 120
Query: 143 YKSLPCSSSQCASL-----NQKSCSGVNCQYSV 170
YKSLPC S+ C L N C C Y +
Sbjct: 121 YKSLPCLSATCTELLTSESNLTPCLLGGCSYEI 153
>gi|242067691|ref|XP_002449122.1| hypothetical protein SORBIDRAFT_05g005410 [Sorghum bicolor]
gi|241934965|gb|EES08110.1| hypothetical protein SORBIDRAFT_05g005410 [Sorghum bicolor]
Length = 407
Score = 75.5 bits (184), Expect = 4e-11, Method: Compositional matrix adjust.
Identities = 85/375 (22%), Positives = 146/375 (38%), Gaps = 94/375 (25%)
Query: 82 ADIIPNNANYLIRISIGTPPTERLAVADTGSDLIWTQCE--PCPPSQCYMQDSPLFDPKM 139
D+ P ++ + ++IG P DTGS+L W +C P P C PL+ PK
Sbjct: 32 GDVHPT-GHFYVTMNIGEPAKPYFLDIDTGSNLTWIKCHATPGPCKTCNKVPHPLYRPK- 89
Query: 140 SSTYKSLPCSSSQCASLNQ-----KSCSGV--NCQYSVSYGDGSFSNGNLATETVTLGST 192
K +PC+ C +L++ K C C Y ++Y DG+ S G L + +L +
Sbjct: 90 ----KLVPCADPLCDALHKDLGTTKDCREEPDQCHYQINYADGTTSLGVLLLDKFSLPTG 145
Query: 193 TGQAVALPGITFGCGTNNGGLFNSKT------TGIVGLG--------------------- 225
+ + +A FGCG + K GI+GLG
Sbjct: 146 SARNIA-----FGCGYDQMQGPKKKAPEKVPVDGILGLGRGSVDLVSQLKHSGAVSKNVI 200
Query: 226 -------GGDISLISQ--------------------------MRTTIAGNQRLGVSTPDI 252
GG I + T G +G
Sbjct: 201 GHCLSSKGGGYLFIGEENVPSSHLHIIYIYCISREPNHYSPGQATLHLGRNPIGTKPFKA 260
Query: 253 VIDSGTTLTFLPQGYNSNLLSVM-SSMIEA--QPVADPTGSLELCYS----FNSLSQVPE 305
+ DSG+T T+LP+ ++ L+S + +S+I++ + V+D L LC+ F ++ +P+
Sbjct: 261 IFDSGSTYTYLPENLHAQLVSALKASLIKSSLKLVSDTDTRLHLCWKGPKPFKTVHDLPK 320
Query: 306 -----VTIHF-RGADVKLSRSNFFVKVSEDIVC-SVFKGITNSVPIYGNIMQTNFLVGYD 358
VT+ F G + + N+ + C + + + + G I LV +D
Sbjct: 321 EFKSLVTLKFDHGVTMTIPPENYLIITGHGNACFGILELPGYDLFVIGGISMQEQLVIHD 380
Query: 359 IEQQTVSFKPTDCTK 373
E+ +++ P+ C K
Sbjct: 381 NEKGRLAWMPSPCDK 395
>gi|242094478|ref|XP_002437729.1| hypothetical protein SORBIDRAFT_10g001440 [Sorghum bicolor]
gi|241915952|gb|EER89096.1| hypothetical protein SORBIDRAFT_10g001440 [Sorghum bicolor]
Length = 486
Score = 75.5 bits (184), Expect = 4e-11, Method: Compositional matrix adjust.
Identities = 50/136 (36%), Positives = 73/136 (53%), Gaps = 13/136 (9%)
Query: 107 VADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCSSSQCASLNQKS--C-SG 163
V DT D+ W +C PC +QC +DP SSTY + PC+SS C L + + C +
Sbjct: 166 VLDTAGDVPWMRCVPCTFAQCAD-----YDPTRSSTYSAFPCNSSACKQLGRYANGCDAN 220
Query: 164 VNCQYSV-SYGDGSFSNGNLATETVTLGSTTGQAVALPGITFGCGTNNGGLFNSKTTGIV 222
CQY V + GD ++G +++ +T+ S G V G FGC N G F ++ GI+
Sbjct: 221 GQCQYMVVTAGDSFTTSGTYSSDVLTINS--GDRVE--GFRFGCSQNEQGSFENQADGIM 276
Query: 223 GLGGGDISLISQMRTT 238
LG G SL++Q +T
Sbjct: 277 ALGRGVQSLMAQTSST 292
>gi|357483911|ref|XP_003612242.1| Aspartic proteinase-like protein [Medicago truncatula]
gi|355513577|gb|AES95200.1| Aspartic proteinase-like protein [Medicago truncatula]
Length = 527
Score = 75.1 bits (183), Expect = 5e-11, Method: Compositional matrix adjust.
Identities = 92/360 (25%), Positives = 144/360 (40%), Gaps = 90/360 (25%)
Query: 91 YLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCY--MQDSP-------LFDPKMSS 141
+ +S+GTP + L DTGSDL W PC ++C +Q S ++D K SS
Sbjct: 113 HFANVSVGTPASSYLVALDTGSDLFWL---PCNCTKCVHGIQLSTGQKIAFNIYDNKESS 169
Query: 142 TYKSLPCSSSQCASLNQ-KSCSGVNCQYSVSY-GDGSFSNGNLATETVTLGSTTGQAV-- 197
T K++ C+SS C Q S SG C Y V Y + + + G L + + L +
Sbjct: 170 TSKNVACNSSLCEQKTQCSSSSGGTCPYQVEYLSENTSTTGFLVEDVLHLITDNDDQTQH 229
Query: 198 ALPGITFGCGTNNGGLF--NSKTTGIVGLGGGDISLISQM-------------------- 235
A P ITFGCG G F + G+ GLG D+S+ S +
Sbjct: 230 ANPLITFGCGQVQTGAFLDGAAPNGLFGLGMSDVSVPSILAKQGLTSNSFSMCFAADGLG 289
Query: 236 RTTIAGNQR--------------------------LGVSTPDI----VIDSGTTLTFLPQ 265
R T N +G ++ D+ + D+GT+ T+L
Sbjct: 290 RITFGDNNSSLDQGKTPFNIRPSHSTYNITVTQIIVGGNSADLEFNAIFDTGTSFTYLNN 349
Query: 266 GYNSNLLSVMSSMIEAQPVADPTGS---LELCYSF--NSLSQVPEVTIHFRGADVKLSRS 320
+ S I+ Q + E CY N +VP + + +G D
Sbjct: 350 PAYKQITQSFDSKIKLQRHSFSNSDDLPFEYCYDLRTNQTIEVPNINLTMKGGD------ 403
Query: 321 NFFVKV--------SEDIVC-SVFKGITNSVPIYGNIMQTNFLVGYDIEQQTVSFKPTDC 371
N+FV + ++C +V K +N+V I G T + + +D E T+ +K ++C
Sbjct: 404 NYFVMDPIITSGGGNNGVLCLAVLK--SNNVNIIGQNFMTGYRIVFDRENMTLGWKESNC 461
>gi|47497551|dbj|BAD19623.1| nucellin-like aspartic protease-like [Oryza sativa Japonica Group]
gi|47847593|dbj|BAD21980.1| nucellin-like aspartic protease-like [Oryza sativa Japonica Group]
Length = 297
Score = 75.1 bits (183), Expect = 5e-11, Method: Compositional matrix adjust.
Identities = 50/158 (31%), Positives = 76/158 (48%), Gaps = 13/158 (8%)
Query: 91 YLIRISIGTPPTERLAVADTGSDLIWTQ---CEPCPPSQCYMQDSPLFDPKMSSTYKSLP 147
Y RI IGTP DTGSD++W C+ CP + ++DP+ S + + +
Sbjct: 90 YFTRIGIGTPAKRYYVQVDTGSDILWVNCVSCDGCPRKSNLGIELTMYDPRGSQSGELVT 149
Query: 148 CSSSQCASLNQK---SCSGVN-CQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPG-- 201
C C + SC+ + C+YS+SYGDGS + G T+ + +G P
Sbjct: 150 CDQQFCVANYGGVLPSCTSTSPCEYSISYGDGSSTAGFFVTDFLQYNQVSGDGQTTPANA 209
Query: 202 -ITFGCGTNNGGLFNSKT---TGIVGLGGGDISLISQM 235
++FGCG GG S GI+G G + S++SQ+
Sbjct: 210 SVSFGCGAKLGGDLGSSNLALDGILGFGQSNSSMLSQL 247
>gi|356508308|ref|XP_003522900.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
Length = 439
Score = 75.1 bits (183), Expect = 5e-11, Method: Compositional matrix adjust.
Identities = 58/195 (29%), Positives = 86/195 (44%), Gaps = 19/195 (9%)
Query: 23 EAQTGGFSVELIHRDSPKSPFYNSSETPYQRLRDALTRSLNRLNHFNQNSSISSSK---- 78
+ Q G ++E+ H SP SPF P L +S+ + +
Sbjct: 28 DTQDHGSTLEVFHVFSPCSPF--RPPKPLSWAESVLQLQAKDQARLQFLASMVAGRSVVP 85
Query: 79 -ASQADIIPNNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDP 137
AS II + Y++R IG+PP L DT +D W C C C S LF P
Sbjct: 86 IASGRQII-QSPTYIVRAKIGSPPQTLLLAMDTSNDAAWIPCTAC--DGC---TSTLFAP 139
Query: 138 KMSSTYKSLPCSSSQCASLNQKSCSGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAV 197
+ S+T+K++ C S QC + SC C ++++YG S + N+ +TVTL +
Sbjct: 140 EKSTTFKNVSCGSPQCNQVPNPSCGTSACTFNLTYGSSSIA-ANVVQDTVTLATD----- 193
Query: 198 ALPGITFGCGTNNGG 212
+P TFGC G
Sbjct: 194 PIPDYTFGCVAKTTG 208
>gi|356518800|ref|XP_003528065.1| PREDICTED: aspartic proteinase Asp1-like [Glycine max]
Length = 438
Score = 75.1 bits (183), Expect = 5e-11, Method: Compositional matrix adjust.
Identities = 95/394 (24%), Positives = 153/394 (38%), Gaps = 103/394 (26%)
Query: 65 LNHFNQNSSISSSKASQADIIPNNAN------YLIRISIGTPPTERLAVADTGSDLIWTQ 118
LN F SS+ + P + N Y + ++IG PP DTGSDL W Q
Sbjct: 54 LNRFRAGSSV---------VFPVHGNVYPVGFYNVTLNIGQPPRPYFLDIDTGSDLTWLQ 104
Query: 119 CEPCPPSQCYMQDSPLFDPKMSSTYKSLPCSSSQCASLNQK---SCSGVN-CQYSVSYGD 174
C+ P S+C PL+ P + +PC S CASL+ C + C Y V Y D
Sbjct: 105 CD-APCSRCSQTPHPLYRP----SNDFVPCRHSLCASLHHSDNYDCEVPHQCDYEVQYAD 159
Query: 175 GSFSNGNLATETVTLGSTTGQAVALPGITFGCGTNNGGLF----NSKTTGIVGLGGGDIS 230
S G L + TL T G + + + GCG + +F + G++GLG G S
Sbjct: 160 HYSSLGVLLHDVYTLNFTNGVQLKVR-MALGCGYDQ--IFPDPSHHPLDGMLGLGRGKTS 216
Query: 231 LISQM------RTTIA-------------------------------------------- 240
L SQ+ R I
Sbjct: 217 LTSQLNSQGLVRNVIGHCLSAQGGGYIFFGDVYDSSRLTWTPMSSRDYKHYSAAGAAELL 276
Query: 241 -GNQRLGVSTPDIVIDSGTTLTFL-PQGYNSNLLSVMSSMIEAQPV--ADPTGSLELCY- 295
G ++ G+ + V D+G++ T+ P Y + L+S + +P+ A +L LC+
Sbjct: 277 FGGKKSGIGSLHAVFDTGSSYTYFNPYAYQA-LISWLGKESGGKPLKEAHDDQTLPLCWR 335
Query: 296 SFNSLSQVPEVTIHFRGADVKL-----SRSNFFVKVSEDIVCS----VFKGITN------ 340
+ EV +F+ + S++ F + ++ S V GI N
Sbjct: 336 GRRPFRSIYEVRKYFKPIVLSFTSNGRSKAQFEMPPEAYLIISNMGNVCLGILNGSEVGM 395
Query: 341 -SVPIYGNIMQTNFLVGYDIEQQTVSFKPTDCTK 373
+ + G+I N ++ +D ++Q + + P DC +
Sbjct: 396 GDLNLIGDISMLNKVMVFDNDKQLIGWTPADCDQ 429
>gi|225465839|ref|XP_002264668.1| PREDICTED: aspartic proteinase nepenthesin-1-like isoform 2 [Vitis
vinifera]
Length = 451
Score = 75.1 bits (183), Expect = 5e-11, Method: Compositional matrix adjust.
Identities = 60/204 (29%), Positives = 93/204 (45%), Gaps = 35/204 (17%)
Query: 23 EAQTGGFSVELIHRDSPKSPFYNSSETPYQR-LRDALTRSLNRLNHFNQNSSISSSK--- 78
E G +++++H SP SPF ++ + + RL SS+ + K
Sbjct: 31 ETPDQGSTLQVLHVYSPCSPFRPKEPLSWEESVLQMQAKDKARLQFL---SSLVARKSVV 87
Query: 79 --ASQADIIPNNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFD 136
AS I+ N Y++R IGTP L DT SD+ W C C S LF+
Sbjct: 88 PIASGRQIV-QNPTYIVRAKIGTPAQTMLMAMDTSSDVAWIPCN-----GCLGCSSTLFN 141
Query: 137 PKMSSTYKSLPCSSSQCAS--------------LNQKSCSGVNCQYSVSYGDGSFSNGNL 182
S+TYKSL C ++QC + + +C G C ++++YG S + NL
Sbjct: 142 SPASTTYKSLGCQAAQCKQVLHLLSPLLTSPSVVPKPTCGGGVCSFNLTYGGSSLA-ANL 200
Query: 183 ATETVTLGSTTGQAVALPGITFGC 206
+ +T+TL + A+PG +FGC
Sbjct: 201 SQDTITLATD-----AVPGYSFGC 219
>gi|449449906|ref|XP_004142705.1| PREDICTED: aspartic proteinase Asp1-like [Cucumis sativus]
gi|449500739|ref|XP_004161182.1| PREDICTED: aspartic proteinase Asp1-like [Cucumis sativus]
Length = 410
Score = 75.1 bits (183), Expect = 5e-11, Method: Compositional matrix adjust.
Identities = 81/361 (22%), Positives = 141/361 (39%), Gaps = 83/361 (22%)
Query: 89 ANYLIRISIGTPPTERLAVADTGSDLIWTQCE-PCPPSQCYMQDSPLFDPKMSSTYKSLP 147
++ + ++IG PP DTGSDL W QC+ PC + C + L+ P +
Sbjct: 53 GHFTVSVTIGNPPKVFELDIDTGSDLTWVQCDAPC--TGCTLPHDRLYKPHNNVVR---- 106
Query: 148 CSSSQCASL---NQKSCSGVN--CQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGI 202
C C++L ++ C N C Y V Y D S G L + V L T G +A P +
Sbjct: 107 CGEPLCSALFSASKSPCKNPNDQCDYEVEYADHGSSIGVLVKDPVPLRLTNGTILA-PNL 165
Query: 203 TFGCG---TNNGGLFNSKTTGIVGLGGGD------ISLISQMRTTIA------------- 240
FGCG N G T G++GLG +S +S +R +
Sbjct: 166 GFGCGYDQHNGGSQLPPLTAGVLGLGNSKATMATQLSALSHVRNVLGHCFSGQGGGFLFF 225
Query: 241 --------------------------------GNQRLGVSTPDIVIDSGTTLTFLPQGYN 268
G +G+ + DSG++ T+
Sbjct: 226 GGDLVPSSGMSWMPILRTPGGKYSAGPAEVYFGGNPVGIRGLILTFDSGSSYTYFNSQVY 285
Query: 269 SNLLSVMSSMIEAQPVADP--TGSLELCYSF-NSLSQVPEVTIHFRGADVKLSRSNFFVK 325
+L+++ + ++ QP+ D +L +C+ + V +V F+ + S +
Sbjct: 286 GAVLNLLRNGLKGQPLRDAPEDKTLPICWKGSKAFKSVADVRNFFKPLALSFGNSKVQFQ 345
Query: 326 VSEDI------VCSVFKGITN-------SVPIYGNIMQTNFLVGYDIEQQTVSFKPTDCT 372
+ + + +V GI N +V + G+I + ++ YD E+Q + + P +C+
Sbjct: 346 IPPEAYLIISNLGNVCLGILNGSQVGLGNVNLIGDISMLDKMMVYDNERQQIGWAPANCS 405
Query: 373 K 373
K
Sbjct: 406 K 406
>gi|357131735|ref|XP_003567490.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Brachypodium
distachyon]
Length = 458
Score = 75.1 bits (183), Expect = 5e-11, Method: Compositional matrix adjust.
Identities = 52/158 (32%), Positives = 77/158 (48%), Gaps = 14/158 (8%)
Query: 90 NYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYM-QDSPLFDPKMSSTYKSLPC 148
+Y+ R +GTPP L D +D W C C C SP FDP SSTY+ + C
Sbjct: 99 SYVARARLGTPPQTLLVAIDPSNDAAWVPCSAC--LGCAPGASSPSFDPTQSSTYRPVRC 156
Query: 149 SSSQCASLNQKSCS-----GVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGIT 203
+ QCA + + S G +C +++SY + + L + ++L + G AV T
Sbjct: 157 GAPQCAQVPPATPSCPAGPGASCAFNLSYASSTL-HAVLGQDALSLSDSNGAAVPDDHYT 215
Query: 204 FGC---GTNNGGLFNSKTTGIVGLGGGDISLISQMRTT 238
FGC T +GG + G+VG G G +S +SQ + T
Sbjct: 216 FGCLRVVTGSGG--SVPPQGLVGFGRGPLSFLSQTKAT 251
Score = 41.6 bits (96), Expect = 0.71, Method: Compositional matrix adjust.
Identities = 33/127 (25%), Positives = 49/127 (38%), Gaps = 8/127 (6%)
Query: 253 VIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSLSQVPEVTIHFR- 311
++D+GT T L + L + + A P A G + CY N VP V F
Sbjct: 331 IVDAGTMFTRLSPPAYAALRNAFRRGVSA-PAAPALGGFDTCYYVNGTKSVPAVAFVFAG 389
Query: 312 GADVKLSRSNFFV-KVSEDIVCSVFK-----GITNSVPIYGNIMQTNFLVGYDIEQQTVS 365
GA V L N + S + C G+ + + ++ Q N V +D+ V
Sbjct: 390 GARVTLPEENVVISSTSGGVACLAMAAGPSDGVNAGLNVLASMQQQNHRVVFDVGNGRVG 449
Query: 366 FKPTDCT 372
F CT
Sbjct: 450 FSRELCT 456
>gi|340810981|gb|AEK75417.1| S5 [Oryza rufipogon]
Length = 357
Score = 75.1 bits (183), Expect = 6e-11, Method: Compositional matrix adjust.
Identities = 45/129 (34%), Positives = 68/129 (52%), Gaps = 19/129 (14%)
Query: 93 IRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDS---PLFDPKMSSTYKSLPCS 149
+ +S+G PP L DTGS L W QC+PC C+ Q + P+FDP S T + + CS
Sbjct: 1 MAVSLGKPPVVNLVAIDTGSTLSWVQCQPC-AVHCHTQSAKAGPIFDPGRSYTSRRVRCS 59
Query: 150 SSQCASLN------QKSC--SGVNCQYSVSYGDG-SFSNGNLATETVTLGSTTGQAVALP 200
S +C L Q +C +C YSV+YG+G ++S G + T+T+ +G +
Sbjct: 60 SVKCGELRYDLRLQQANCMEKEDSCTYSVTYGNGWAYSVGKMVTDTLRIGDS------FM 113
Query: 201 GITFGCGTN 209
+ FGC +
Sbjct: 114 DLMFGCSMD 122
>gi|7413629|emb|CAB85978.1| putative protein [Arabidopsis thaliana]
Length = 356
Score = 75.1 bits (183), Expect = 6e-11, Method: Compositional matrix adjust.
Identities = 42/109 (38%), Positives = 60/109 (55%), Gaps = 5/109 (4%)
Query: 88 NANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLP 147
+A Y + IGTPP E V DTGSDL+W C C C + + FDP SS+ L
Sbjct: 75 SALYYTTVQIGTPPRELDVVIDTGSDLVWVSCNSC--VGCPLHNVTFFDPGASSSAVKLA 132
Query: 148 CSSSQCASLNQKS--CSGV-NCQYSVSYGDGSFSNGNLATETVTLGSTT 193
CS +C+S QK CS + +C Y V YGDGS ++G ++ ++ + +
Sbjct: 133 CSDKRCSSDLQKKSRCSLLESCTYKVEYGDGSVTSGYYISDLISFDTMS 181
>gi|224068901|ref|XP_002326227.1| predicted protein [Populus trichocarpa]
gi|222833420|gb|EEE71897.1| predicted protein [Populus trichocarpa]
Length = 477
Score = 74.7 bits (182), Expect = 6e-11, Method: Compositional matrix adjust.
Identities = 54/163 (33%), Positives = 77/163 (47%), Gaps = 18/163 (11%)
Query: 91 YLIRISIGTPPTERLAVADTGSDLIWT---QCEPCPPSQCYMQDSPLFDPKMSSTYKSLP 147
Y ++ +G+PP E DTGSD++W C CP + FD SST +
Sbjct: 66 YFTKVKLGSPPREFNVQIDTGSDVLWVCCNSCNNCPRTSGLGIQLNFFDSSSSSTAGQVR 125
Query: 148 CSSSQCASLNQKS---CSGV--NCQYSVSYGDGSFSNGNLATETVTLGSTTGQAV----- 197
CS C S Q + CS C Y+ YGDGS ++G ++T+ + GQ++
Sbjct: 126 CSDPICTSAVQTTATQCSSQTDQCSYTFQYGDGSGTSGYYVSDTLYFDAILGQSLIDNSS 185
Query: 198 ALPGITFGCGTNNGGLF---NSKTTGIVGLGGGDISLISQMRT 237
AL I FGC G + GI G G G++S+ISQ+ T
Sbjct: 186 AL--IVFGCSAYQSGDLTKTDKAVDGIFGFGQGELSVISQLST 226
>gi|340810961|gb|AEK75407.1| S5 [Oryza sativa]
gi|340811037|gb|AEK75445.1| S5 [Oryza rufipogon]
Length = 357
Score = 74.7 bits (182), Expect = 6e-11, Method: Compositional matrix adjust.
Identities = 45/129 (34%), Positives = 68/129 (52%), Gaps = 19/129 (14%)
Query: 93 IRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDS---PLFDPKMSSTYKSLPCS 149
+ +S+G PP L DTGS L W QC+PC C+ Q + P+FDP S T + + CS
Sbjct: 1 MAVSLGKPPVVNLVAIDTGSTLSWVQCQPC-AVHCHTQSAKAGPIFDPGRSYTSRRVRCS 59
Query: 150 SSQCASLN------QKSC--SGVNCQYSVSYGDG-SFSNGNLATETVTLGSTTGQAVALP 200
S +C L Q +C +C YSV+YG+G ++S G + T+T+ +G +
Sbjct: 60 SVKCGELRYDLRLQQANCMEKEDSCTYSVTYGNGWAYSVGKMVTDTLRIGDS------FM 113
Query: 201 GITFGCGTN 209
+ FGC +
Sbjct: 114 DLMFGCSMD 122
>gi|168060150|ref|XP_001782061.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162666472|gb|EDQ53125.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 423
Score = 74.7 bits (182), Expect = 6e-11, Method: Compositional matrix adjust.
Identities = 59/195 (30%), Positives = 88/195 (45%), Gaps = 21/195 (10%)
Query: 52 QRLRDALTRSLNRLNHFNQNSSISSSKAS---QADIIPNNANYLIRISIGTPPTERLAVA 108
Q R L R L+RL SS+ + +I P+ Y+ + +G+PP
Sbjct: 3 QIRRTLLERDLSRLG----KSSVGNHSVRFHVGGNIYPDGLYYMALL-LGSPPKLYFLDM 57
Query: 109 DTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCSSSQCASLNQKSCSGVN--- 165
DTGSDL W QC+ P C + L++PK + K + C CA + Q N
Sbjct: 58 DTGSDLTWAQCD-APCRNCAIGPHGLYNPKKA---KVVDCHLPVCAQIQQGGSYECNSDV 113
Query: 166 --CQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITFGCGTNNGGLFN---SKTTG 220
C Y V Y DGS + G L +T+T+ T G + I GCG + G + T G
Sbjct: 114 KQCDYEVEYADGSSTMGVLVEDTLTVRLTNGTLIQTKAI-IGCGYDQQGTLAKSPASTDG 172
Query: 221 IVGLGGGDISLISQM 235
++GL ++L +Q+
Sbjct: 173 VIGLSSSKVALPAQL 187
>gi|340811098|gb|AEK75475.1| S5 [Oryza nivara]
Length = 357
Score = 74.7 bits (182), Expect = 6e-11, Method: Compositional matrix adjust.
Identities = 45/126 (35%), Positives = 67/126 (53%), Gaps = 19/126 (15%)
Query: 93 IRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDS---PLFDPKMSSTYKSLPCS 149
+ +S+G PP L DTGS L W QC+PC C+ Q + P+FDP S T + + CS
Sbjct: 1 MAVSLGKPPVVNLVAIDTGSTLSWVQCQPC-AVHCHTQSAKAGPIFDPGRSYTSRRVRCS 59
Query: 150 SSQCASLN------QKSC--SGVNCQYSVSYGDG-SFSNGNLATETVTLGSTTGQAVALP 200
S +C L Q +C +C YSV+YG+G ++S G + T+T+ +G +
Sbjct: 60 SVKCGELRYDLRLQQANCMEKEDSCTYSVTYGNGWAYSVGKMVTDTLRIGDS------FM 113
Query: 201 GITFGC 206
+ FGC
Sbjct: 114 DLMFGC 119
>gi|196212948|gb|ACG76110.1| S5 [Oryza sativa Japonica Group]
gi|340810887|gb|AEK75370.1| S5 [Oryza sativa]
gi|340810903|gb|AEK75378.1| S5 [Oryza sativa]
gi|340810921|gb|AEK75387.1| S5 [Oryza sativa]
gi|340810955|gb|AEK75404.1| S5 [Oryza sativa]
gi|340811079|gb|AEK75466.1| S5 [Oryza nivara]
gi|340811090|gb|AEK75471.1| S5 [Oryza rufipogon]
gi|340811116|gb|AEK75484.1| S5 [Oryza nivara]
Length = 357
Score = 74.7 bits (182), Expect = 6e-11, Method: Compositional matrix adjust.
Identities = 45/129 (34%), Positives = 68/129 (52%), Gaps = 19/129 (14%)
Query: 93 IRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDS---PLFDPKMSSTYKSLPCS 149
+ +S+G PP L DTGS L W QC+PC C+ Q + P+FDP S T + + CS
Sbjct: 1 MAVSLGKPPVVNLVAIDTGSTLSWVQCQPC-AVHCHTQSAKAGPIFDPGRSYTSRRVRCS 59
Query: 150 SSQCASLN------QKSC--SGVNCQYSVSYGDG-SFSNGNLATETVTLGSTTGQAVALP 200
S +C L Q +C +C YSV+YG+G ++S G + T+T+ +G +
Sbjct: 60 SVKCGELRYDLRLQQANCMEKEDSCTYSVTYGNGWAYSVGKMVTDTLRIGDS------FM 113
Query: 201 GITFGCGTN 209
+ FGC +
Sbjct: 114 DLMFGCSMD 122
>gi|340810915|gb|AEK75384.1| S5 [Oryza sativa]
gi|340810917|gb|AEK75385.1| S5 [Oryza sativa]
gi|340810919|gb|AEK75386.1| S5 [Oryza sativa]
gi|340810927|gb|AEK75390.1| S5 [Oryza sativa]
gi|340810975|gb|AEK75414.1| S5 [Oryza nivara]
gi|340810979|gb|AEK75416.1| S5 [Oryza nivara]
gi|340810995|gb|AEK75424.1| S5 [Oryza nivara]
gi|340811027|gb|AEK75440.1| S5 [Oryza nivara]
gi|340811063|gb|AEK75458.1| S5 [Oryza nivara]
Length = 357
Score = 74.7 bits (182), Expect = 7e-11, Method: Compositional matrix adjust.
Identities = 45/129 (34%), Positives = 68/129 (52%), Gaps = 19/129 (14%)
Query: 93 IRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDS---PLFDPKMSSTYKSLPCS 149
+ +S+G PP L DTGS L W QC+PC C+ Q + P+FDP S T + + CS
Sbjct: 1 MAVSLGKPPVVNLVAIDTGSTLSWVQCQPC-AVHCHTQSAKAGPIFDPGRSYTSRRVRCS 59
Query: 150 SSQCAS------LNQKSC--SGVNCQYSVSYGDG-SFSNGNLATETVTLGSTTGQAVALP 200
S +C L Q +C +C YSV+YG+G ++S G + T+T+ +G +
Sbjct: 60 SVKCGEPRYDLRLQQANCMEKEDSCTYSVTYGNGWAYSVGKMVTDTLRIGDS------FM 113
Query: 201 GITFGCGTN 209
+ FGC +
Sbjct: 114 DLMFGCSMD 122
>gi|340810945|gb|AEK75399.1| S5 [Oryza sativa]
gi|340810957|gb|AEK75405.1| S5 [Oryza sativa]
gi|340811007|gb|AEK75430.1| S5 [Oryza nivara]
gi|340811073|gb|AEK75463.1| S5 [Oryza rufipogon]
gi|340811094|gb|AEK75473.1| S5 [Oryza rufipogon]
Length = 357
Score = 74.7 bits (182), Expect = 7e-11, Method: Compositional matrix adjust.
Identities = 45/126 (35%), Positives = 67/126 (53%), Gaps = 19/126 (15%)
Query: 93 IRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDS---PLFDPKMSSTYKSLPCS 149
+ +S+G PP L DTGS L W QC+PC C+ Q + P+FDP S T + + CS
Sbjct: 1 MAVSLGKPPVVNLVAIDTGSTLSWVQCQPC-AVHCHTQSAKAGPIFDPGRSYTSRRVRCS 59
Query: 150 SSQCAS------LNQKSC--SGVNCQYSVSYGDG-SFSNGNLATETVTLGSTTGQAVALP 200
S +C L Q +C +C YSV+YG+G ++S G + T+T+ +G +
Sbjct: 60 SVKCGEPRYDLRLQQANCMEKEDSCTYSVTYGNGWAYSVGKMVTDTLRIGDS------FM 113
Query: 201 GITFGC 206
+ FGC
Sbjct: 114 DLMFGC 119
>gi|340810977|gb|AEK75415.1| S5 [Oryza rufipogon]
Length = 357
Score = 74.7 bits (182), Expect = 7e-11, Method: Compositional matrix adjust.
Identities = 45/129 (34%), Positives = 68/129 (52%), Gaps = 19/129 (14%)
Query: 93 IRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDS---PLFDPKMSSTYKSLPCS 149
+ +S+G PP L DTGS L W QC+PC C+ Q + P+FDP S T + + CS
Sbjct: 1 MAVSLGKPPVVNLVAIDTGSTLSWVQCQPC-AVHCHTQSAKAGPIFDPGRSYTSRRVRCS 59
Query: 150 SSQCAS------LNQKSC--SGVNCQYSVSYGDG-SFSNGNLATETVTLGSTTGQAVALP 200
S +C L Q +C +C YSV+YG+G ++S G + T+T+ +G +
Sbjct: 60 SVKCGEPRYDLRLQQANCMEKEDSCTYSVTYGNGWAYSVGKMVTDTLRIGDS------FM 113
Query: 201 GITFGCGTN 209
+ FGC +
Sbjct: 114 DLMFGCSMD 122
>gi|300681506|emb|CBH32600.1| pepsin A, putative, expressed [Triticum aestivum]
Length = 477
Score = 74.7 bits (182), Expect = 8e-11, Method: Compositional matrix adjust.
Identities = 52/169 (30%), Positives = 71/169 (42%), Gaps = 18/169 (10%)
Query: 90 NYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSP---------LFDPKMS 140
Y +R +GTP L VADTGSDL W +C P+ SP F P+ S
Sbjct: 96 QYFVRFRVGTPAQPFLLVADTGSDLTWVKCR--RPASANSSLSPADSGPGPGRAFRPEDS 153
Query: 141 STYKSLPCSSSQCA-----SLNQKSCSGVNCQYSVSYGDGSFSNGNLATE--TVTLGSTT 193
T+ + C+S C SL G C Y Y DGS + G + TE T+ L
Sbjct: 154 RTWAPISCASDTCTKSLPFSLATCPTPGSPCAYDYRYKDGSAARGTVGTESATIALSGRE 213
Query: 194 GQAVALPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGN 242
+ L G+ GC ++ G + G++ LG IS S + G
Sbjct: 214 ERKAKLKGLVLGCSSSYTGPSFEASDGVLSLGYSGISFASHAASRFGGR 262
Score = 55.5 bits (132), Expect = 4e-05, Method: Compositional matrix adjust.
Identities = 34/131 (25%), Positives = 67/131 (51%), Gaps = 13/131 (9%)
Query: 252 IVIDSGTTLTFLPQGYNSNLLSVMSSMIEAQP--VADPTGSLELCYSFNSLS------QV 303
+++DSGT+LT L + +++ +S + P DP E CY++ S S V
Sbjct: 349 VILDSGTSLTVLAKPAYRAVVAALSKGLAGLPRVTMDP---FEYCYNWTSPSGKDADVAV 405
Query: 304 PEVTIHFRGA-DVKLSRSNFFVKVSEDIVC-SVFKGITNSVPIYGNIMQTNFLVGYDIEQ 361
P++ +HF GA ++ ++ + + + C + +G + + GNI+Q L +DI+
Sbjct: 406 PKMAVHFAGAARLEPPGKSYVIDAAPGVKCIGLQEGPWPGISVIGNILQQEHLWEFDIKN 465
Query: 362 QTVSFKPTDCT 372
+ + F+ + CT
Sbjct: 466 RRLKFQRSRCT 476
>gi|449518783|ref|XP_004166415.1| PREDICTED: aspartic proteinase-like protein 2-like, partial
[Cucumis sativus]
Length = 420
Score = 74.3 bits (181), Expect = 8e-11, Method: Compositional matrix adjust.
Identities = 53/162 (32%), Positives = 75/162 (46%), Gaps = 14/162 (8%)
Query: 91 YLIRISIGTPPTERLAVADTGSDLIWT---QCEPCPPSQCYMQDSPLFDPKMSSTYKSLP 147
Y +I IGTP + DTGSD++W QC CP + + +D + S+T K +
Sbjct: 87 YYAKIGIGTPSKDYYVQVDTGSDIVWVNCIQCRECPRTSSLGMELTPYDLEESTTGKLVS 146
Query: 148 CSSSQCASLNQKSCSG----VNCQYSVSYGDGSFSNGNLATETVTLGSTTG--QAVALPG 201
C C +N SG ++C Y YGDGS + G + V +G + A G
Sbjct: 147 CDEQFCLEVNGGPLSGCTTNMSCPYLQIYGDGSSTAGYFVKDYVQYNRVSGDLETTAANG 206
Query: 202 -ITFGCGTNNGGLFNS----KTTGIVGLGGGDISLISQMRTT 238
I FGCG G S GI+G G + S+ISQ+ +T
Sbjct: 207 SIKFGCGARQSGDLGSSGEEALDGILGFGKSNSSIISQLAST 248
>gi|15241713|ref|NP_195839.1| aspartyl protease family protein [Arabidopsis thaliana]
gi|75181297|sp|Q9LZL3.1|PCS1L_ARATH RecName: Full=Aspartic proteinase PCS1; AltName: Full=Aspartic
protease 38; Short=AtASP38; AltName: Full=Protein EMBRYO
DEFECTIVE 24; AltName: Full=Protein PROMOTION OF CELL
SURVIVAL 1; Flags: Precursor
gi|7340693|emb|CAB82992.1| putative protein [Arabidopsis thaliana]
gi|50897174|gb|AAT85726.1| At5g02190 [Arabidopsis thaliana]
gi|53828617|gb|AAU94418.1| At5g02190 [Arabidopsis thaliana]
gi|110742159|dbj|BAE99007.1| hypothetical protein [Arabidopsis thaliana]
gi|332003059|gb|AED90442.1| aspartyl protease family protein [Arabidopsis thaliana]
Length = 453
Score = 74.3 bits (181), Expect = 8e-11, Method: Compositional matrix adjust.
Identities = 55/149 (36%), Positives = 74/149 (49%), Gaps = 25/149 (16%)
Query: 100 PPTERLAVADTGSDLIWTQC----EPCPPSQCYMQDSPLFDPKMSSTYKSLPCSSSQCAS 155
PP V DTGS+L W +C P P + FDP SS+Y +PCSS C +
Sbjct: 82 PPQNISMVIDTGSELSWLRCNRSSNPNPVNN--------FDPTRSSSYSPIPCSSPTCRT 133
Query: 156 -----LNQKSC-SGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITFGCGTN 209
L SC S C ++SY D S S GNLA E G++T + + FGC +
Sbjct: 134 RTRDFLIPASCDSDKLCHATLSYADASSSEGNLAAEIFHFGNSTNDS----NLIFGCMGS 189
Query: 210 NGG---LFNSKTTGIVGLGGGDISLISQM 235
G ++KTTG++G+ G +S ISQM
Sbjct: 190 VSGSDPEEDTKTTGLLGMNRGSLSFISQM 218
>gi|357130715|ref|XP_003566992.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Brachypodium
distachyon]
Length = 479
Score = 74.3 bits (181), Expect = 8e-11, Method: Compositional matrix adjust.
Identities = 52/176 (29%), Positives = 76/176 (43%), Gaps = 24/176 (13%)
Query: 90 NYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSP---------LFDPKMS 140
Y +R +GTP L VADTGSDL W +C P + S F P+ S
Sbjct: 94 QYFVRFRVGTPAQPFLLVADTGSDLTWVKCRPAKAAAASTNSSSSASASSPRRAFRPEKS 153
Query: 141 STYKSLPCSSSQCA-----SLNQKSCSGVNCQYSVSYGDGSFSNGNLATETVTLG----- 190
T+ +PC+S C+ SL+ G C Y Y DGS + G + TE+ T+
Sbjct: 154 KTWAPIPCASDTCSKSLPFSLSTCPTPGSPCAYDYRYKDGSAARGTVGTESATIALSSSS 213
Query: 191 ---STTGQAVALPGITFGC-GTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGN 242
+ L G+ GC G+ G F + + G++ LG ++S S + G
Sbjct: 214 SSSKNKVKKAKLQGLVLGCTGSYTGPSFEA-SDGVLSLGYSNVSFASHAASRFGGR 268
Score = 55.5 bits (132), Expect = 4e-05, Method: Compositional matrix adjust.
Identities = 33/131 (25%), Positives = 67/131 (51%), Gaps = 13/131 (9%)
Query: 252 IVIDSGTTLTFLPQGYNSNLLSVMSSMIEAQP--VADPTGSLELCYSFNSLSQ------V 303
+++DSGT+LT L + +++ + + P DP E CY++ S S+ +
Sbjct: 350 VIVDSGTSLTVLAKPAYRAVVAALGKKLARFPRVAMDP---FEYCYNWTSPSRKDEGDDL 406
Query: 304 PEVTIHFRG-ADVKLSRSNFFVKVSEDIVC-SVFKGITNSVPIYGNIMQTNFLVGYDIEQ 361
P++ +HF G A ++ ++ + + + C V +G + + GNI+Q L +D++
Sbjct: 407 PKLAVHFAGSARLEPPSKSYVIDAAPGVKCIGVQEGPWPGISVIGNILQQEHLWEFDLKN 466
Query: 362 QTVSFKPTDCT 372
+ + FK + CT
Sbjct: 467 RRLRFKRSRCT 477
>gi|340810959|gb|AEK75406.1| S5 [Oryza sativa]
gi|340810971|gb|AEK75412.1| S5 [Oryza rufipogon]
Length = 357
Score = 74.3 bits (181), Expect = 8e-11, Method: Compositional matrix adjust.
Identities = 45/129 (34%), Positives = 68/129 (52%), Gaps = 19/129 (14%)
Query: 93 IRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDS---PLFDPKMSSTYKSLPCS 149
+ +S+G PP L DTGS L W QC+PC C+ Q + P+FDP S T + + CS
Sbjct: 1 MAVSLGKPPVVNLVAIDTGSTLSWVQCQPC-AVHCHTQSAKAGPIFDPGRSYTSRRVRCS 59
Query: 150 SSQCAS------LNQKSC--SGVNCQYSVSYGDG-SFSNGNLATETVTLGSTTGQAVALP 200
S +C L Q +C +C YSV+YG+G ++S G + T+T+ +G +
Sbjct: 60 SVKCGEPRYDLRLQQANCMEKEDSCTYSVTYGNGWAYSVGKMVTDTLRIGDS------FM 113
Query: 201 GITFGCGTN 209
+ FGC +
Sbjct: 114 DLMFGCSMD 122
>gi|413953655|gb|AFW86304.1| hypothetical protein ZEAMMB73_151223 [Zea mays]
Length = 535
Score = 74.3 bits (181), Expect = 9e-11, Method: Compositional matrix adjust.
Identities = 84/360 (23%), Positives = 133/360 (36%), Gaps = 83/360 (23%)
Query: 91 YLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCSS 150
Y IS+G+PP DTGS W QC+ P + C PL+ P + T +LP S
Sbjct: 160 YYTAISLGSPPRPYFLDVDTGSHTTWVQCDAPPCASCAKGAHPLYRP--ARTADALPASD 217
Query: 151 SQCASLNQKSCSGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITFGCGTNN 210
C ++ + C Y +SY DGS S G +++ G+ I FGCG +
Sbjct: 218 PLCEGAQHENPN--QCDYEISYADGSSSMGVYVRDSMQFVGEDGERENA-DIVFGCGYDQ 274
Query: 211 GG-LFNSKTT---------------------GIV-------------GLGG--------- 226
G L N+ T GI+ G GG
Sbjct: 275 QGVLLNALETTDGVLGLTNKALSLPTQLASRGIISNAFGHCMSTDPSGAGGYLFLGDDYI 334
Query: 227 --------------GDISLISQMRTTIAGNQRLGVSTP--DIVIDSGTTLTFLPQGYNSN 270
D +Q++ G+Q+L +V D+G+T T+ P +
Sbjct: 335 PRWGMTWVPIRDGPADDVRRAQVKQINHGDQQLNAQGKLTQVVFDTGSTYTYFPDEALTR 394
Query: 271 LLSVMSSMIEAQPVADPTG-SLELCYSFN-SLSQVPEVTIHFRGADVKLSRSNFFVKV-- 326
L+S + + V D + +L C + + V +V F+ ++ + FF +
Sbjct: 395 LISSLKEAASPRFVQDDSDKTLPFCMKSDFPVRSVEDVKHFFKPLSLQFEKRFFFSRTFN 454
Query: 327 -----------SEDIVCSVFKGIT---NSVPIYGNIMQTNFLVGYDIEQQTVSFKPTDCT 372
++ V G T +SV I G++ LV YD ++ V + DCT
Sbjct: 455 IRPEHYLVISDKGNVCLGVLNGTTIGYDSVVIVGDVSLRGKLVAYDNDKNEVGWVDFDCT 514
>gi|255541790|ref|XP_002511959.1| protein with unknown function [Ricinus communis]
gi|223549139|gb|EEF50628.1| protein with unknown function [Ricinus communis]
Length = 583
Score = 74.3 bits (181), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 60/206 (29%), Positives = 88/206 (42%), Gaps = 22/206 (10%)
Query: 69 NQNSSISSSKASQAD---IIPNNAN------YLIRISIGTPPTERLAVADTGSDLIWTQC 119
N+N ++SS A+ D + P N Y I +G PP DT SDL W QC
Sbjct: 177 NRNYKLASSNAAAVDSSSVFPVRGNVYPDGLYFTYILVGNPPRPYYLDIDTASDLTWIQC 236
Query: 120 EPCPPSQCYMQDSPLFDPKMSSTYKSLPCSSSQCASLNQKSCSGV-----NCQYSVSYGD 174
+ P + C + L+ P+ + + S C L++ +G C Y + Y D
Sbjct: 237 D-APCTSCAKGANALYKPRRDNI---VTPKDSLCVELHRNQKAGYCETCQQCDYEIEYAD 292
Query: 175 GSFSNGNLATETVTLGSTTGQAVALPGITFGCGTNNGGLFNS---KTTGIVGLGGGDISL 231
S S G LA + + L G + L FGC + GL + KT GI+GL +SL
Sbjct: 293 HSSSMGVLARDELHLTMANGSSTNL-KFNFGCAYDQQGLLLNTLVKTDGILGLSKAKVSL 351
Query: 232 ISQMRTTIAGNQRLGVSTPDIVIDSG 257
SQ+ N +G + V+ G
Sbjct: 352 PSQLANRGIINNVVGHCLANDVVGGG 377
>gi|297820902|ref|XP_002878334.1| hypothetical protein ARALYDRAFT_907565 [Arabidopsis lyrata subsp.
lyrata]
gi|297324172|gb|EFH54593.1| hypothetical protein ARALYDRAFT_907565 [Arabidopsis lyrata subsp.
lyrata]
Length = 362
Score = 74.3 bits (181), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 56/202 (27%), Positives = 89/202 (44%), Gaps = 36/202 (17%)
Query: 63 NRLNHFNQNSSISSSKASQADIIPNNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPC 122
+R H + + S+ S+ D + N Y R+ IGTPP + D+GS + + C C
Sbjct: 64 HRKLHKSDSKSLPHSRMRLYDDLLINGYYTTRLWIGTPPQMFALIVDSGSTVTYVPCSDC 123
Query: 123 ------------PPSQ------CYMQ----------DSPLFDPKMSSTYKSLPCSSSQCA 154
P Q C +Q + P F P++SSTY+ + C + C
Sbjct: 124 EQCGKHQVMLSSPKDQILCLVSCKVQIFKISYGLFDEDPKFQPELSSTYQPVKC-NMDCN 182
Query: 155 SLNQKSCSGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITFGCGT-NNGGL 213
+ K C Y Y + S S G L + ++ G+ + + FGC T G L
Sbjct: 183 CDDDKE----QCVYEREYAEHSSSKGVLGEDLISFGNES--HLTPQRAVFGCKTVETGDL 236
Query: 214 FNSKTTGIVGLGGGDISLISQM 235
++ + GI+GLG GD+SL+ Q+
Sbjct: 237 YSQRADGIIGLGQGDLSLVGQL 258
>gi|242065058|ref|XP_002453818.1| hypothetical protein SORBIDRAFT_04g018520 [Sorghum bicolor]
gi|241933649|gb|EES06794.1| hypothetical protein SORBIDRAFT_04g018520 [Sorghum bicolor]
Length = 490
Score = 73.9 bits (180), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 84/363 (23%), Positives = 142/363 (39%), Gaps = 85/363 (23%)
Query: 91 YLIRISIGTPPTERLAVADTGSDLIWT---QCEPCPPSQCYMQDSPLFDPKMSSTYKSLP 147
Y +I IG+P DTGSD++W +C+ CP + + +DP S T ++
Sbjct: 85 YYTQIEIGSPSKGYYVQVDTGSDILWVNCIRCDGCPTTSGLGIELTQYDPAGSGT--TVG 142
Query: 148 CSSSQCASLNQK----SCSGVN--CQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALP- 200
C C + + +C + CQ+ ++YGDGS + G +++V +G P
Sbjct: 143 CDQEFCVANSPNGLPPACPSTSSPCQFRIAYGDGSSTTGFYVSDSVQYNQVSGNGQTTPS 202
Query: 201 --GITFGCGTNNGGLFNSKT---TGIVGLGGGDISLISQMRT-------------TIAGN 242
ITFGCG GG S + GI+G G D S++SQ+ T+ G
Sbjct: 203 NASITFGCGAQLGGDLGSSSQALDGILGFGQADSSMLSQLAAARKVRKIFAHCLDTVHGG 262
Query: 243 QRLGVST----------------------------------PDIVIDSG----------T 258
+ P DSG T
Sbjct: 263 GIFAIGNVVQPKVKTTPLVQNVTHYNVNLQGISVGGATLQLPSSTFDSGDSKGTIIDSGT 322
Query: 259 TLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSL--SQVPEVTIHFRGA-DV 315
TL +LP+ LL+ + + Q +A +C+ F+ P VT F G +
Sbjct: 323 TLAYLPREVYRTLLTAV--FDKYQDLALHNYQDFVCFQFSGSIDDGFPVVTFSFEGEITL 380
Query: 316 KLSRSNFFVKVSEDIVCSVF--KGIT----NSVPIYGNIMQTNFLVGYDIEQQTVSFKPT 369
+ ++ + D+ C F G+ + + G+++ +N LV YD+E+Q + +
Sbjct: 381 NVYPHDYLFQNENDLYCMGFLDGGVQTKDGKDMVLLGDLVLSNKLVVYDLEKQVIGWADY 440
Query: 370 DCT 372
+C+
Sbjct: 441 NCS 443
>gi|222619890|gb|EEE56022.1| hypothetical protein OsJ_04800 [Oryza sativa Japonica Group]
Length = 423
Score = 73.9 bits (180), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 85/332 (25%), Positives = 126/332 (37%), Gaps = 59/332 (17%)
Query: 90 NYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCS 149
NY+ R +GTP L D +D W C C + C SP F P SSTY+++PC
Sbjct: 101 NYIARAGLGTPAQTLLVAIDPSNDAAWVPCSAC--AGC-AASSPSFSPTQSSTYRTVPCG 157
Query: 150 SSQCASLNQKSCS---GVNCQYSVSY---------GDGSFSNGNLATETVTLG------- 190
S QCA + SC G +C ++++Y G S + N + T G
Sbjct: 158 SPQCAQVPSPSCPAGVGSSCGFNLTYAASTFQAVLGQDSLALENNVVVSYTFGCLRVVNG 217
Query: 191 ---STTGQAVALPGITFGCGTNNGGL------FNSKTT--------------GIVGLGGG 227
+ G P + G L KTT ++G+ G
Sbjct: 218 NSRAAAGAHRLRPRAALLLVADQGHLGPIGQPKRIKTTPLLYNPHRPSLYYVNMIGIRVG 277
Query: 228 DISLISQMRTTIAGNQRLGVSTPDIVIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADP 287
++ ++ +A N G T +ID+GT T L + + + PVA P
Sbjct: 278 S-KVVQVPQSALAFNPVTGSGT---IIDAGTMFTRLAAPVYAAVRDAFRGRVR-TPVAPP 332
Query: 288 TGSLELCYSFNSLSQVPEVTIHFRGA-DVKLSRSNFFVKVSE-DIVCSVFK-----GITN 340
G + CY N VP VT F GA V L N + S + C G+
Sbjct: 333 LGGFDTCY--NVTVSVPTVTFMFAGAVAVTLPEENVMIHSSSGGVACLAMAAGPSDGVNA 390
Query: 341 SVPIYGNIMQTNFLVGYDIEQQTVSFKPTDCT 372
++ + ++ Q N V +D+ V F CT
Sbjct: 391 ALNVLASMQQQNQRVLFDVANGRVGFSRELCT 422
>gi|343172998|gb|AEL99202.1| aspartyl protease family protein, partial [Silene latifolia]
Length = 584
Score = 73.9 bits (180), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 86/350 (24%), Positives = 138/350 (39%), Gaps = 86/350 (24%)
Query: 97 IGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCSSS-QCAS 155
IGTPP E + DTGS + + C C QC P F P +S TY + C+ C +
Sbjct: 2 IGTPPQEFALIVDTGSTVTYVPCNSC--DQCGNHQDPKFQPDLSDTYHPVKCNPDCTCDT 59
Query: 156 LNQKSCSGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITFGC-GTNNGGLF 214
N + C Y Y + S S+G L + V+ G+ + + FGC G LF
Sbjct: 60 ENDQ------CTYERQYAEMSSSSGILGEDLVSFGNMS--ELKPQRAVFGCENAETGDLF 111
Query: 215 NSKTTGIVGL------------------------------GGGDISL--IS--------- 233
+ GI+GL GGG + L IS
Sbjct: 112 SQHADGIMGLGRGDLSIVDQLVEKGVINDSFSLCYGGMEVGGGAMVLGQISPPSDMVFSH 171
Query: 234 -----------QMRTTIAGNQRLGVSTPDI-------VIDSGTTLTFLPQGYNSNLLSVM 275
++R ++L ++ P + ++DSGTT +LP+ + +
Sbjct: 172 SDPDRSPYYNIELRGLHVAGKKLDIN-PQVFDGKHGTILDSGTTYAYLPEAAFLPFIQAI 230
Query: 276 SSMIEA-QPVADPTGSL-ELCYSFNSLSQVPEVTIHFRGADV--------KLSRSNFFVK 325
+S + + + P + ++C+S + S++PE+ F D+ LS N+ K
Sbjct: 231 TSELHGLKQIRGPDPNYNDVCFS-GAGSEIPELYKTFPSVDMVFDNGEKYSLSPENYLFK 289
Query: 326 VSE---DIVCSVFKGITNSVPIYGNIMQTNFLVGYDIEQQTVSFKPTDCT 372
S+ VF+ + + G I+ N LV YD E V F T+C+
Sbjct: 290 HSKVHGAYCLGVFQNGKDPTTLLGGIVVRNTLVTYDREHSKVGFWKTNCS 339
>gi|388495448|gb|AFK35790.1| unknown [Medicago truncatula]
Length = 434
Score = 73.9 bits (180), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 58/213 (27%), Positives = 98/213 (46%), Gaps = 26/213 (12%)
Query: 12 FFLCFYVVSPIEAQT-------GGFSVELIHRDSPKSPFYNSSETPYQR-LRDALTRSLN 63
F LC ++ S ++AQT G ++++ H S SPF S ++ + + +
Sbjct: 10 FLLCLFI-SLVQAQTPKCDIQDDGSTLKVFHIFSQCSPFKPSKPMSWEESVLNLQAKDQA 68
Query: 64 RLNHFNQNSSISSSKA----SQADIIPNNANYLIRISIGTPPTERLAVADTGSDLIWTQC 119
R+ +F SS+ + K+ + A I + Y+++ GTPP L DT SD W C
Sbjct: 69 RMQYF---SSLVARKSVVPIASARQIIQSPTYIVKAKFGTPPQTLLLALDTSSDAAWIPC 125
Query: 120 EPCPPSQCYMQDSPLFDPKMSSTYKSLPCSSSQCASLNQKSCSGVNCQYSVSYGDGSFSN 179
C S F P S++++++ C S C + +C G C ++ +YG S +
Sbjct: 126 SGC----VGCSTSKPFAPIKSTSFRNVSCGSPHCKQVPNPTCGGSACAFNFTYGSSSIA- 180
Query: 180 GNLATETVTLGSTTGQAVALPGITFGCGTNNGG 212
++ +T+TL A +PG TFGC G
Sbjct: 181 ASVVQDTLTLA-----ADPIPGYTFGCVNKTTG 208
>gi|297806153|ref|XP_002870960.1| hypothetical protein ARALYDRAFT_908082 [Arabidopsis lyrata subsp.
lyrata]
gi|297316797|gb|EFH47219.1| hypothetical protein ARALYDRAFT_908082 [Arabidopsis lyrata subsp.
lyrata]
Length = 453
Score = 73.9 bits (180), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 55/149 (36%), Positives = 74/149 (49%), Gaps = 25/149 (16%)
Query: 100 PPTERLAVADTGSDLIWTQC----EPCPPSQCYMQDSPLFDPKMSSTYKSLPCSSSQCAS 155
PP V DTGS+L W +C P P + FDP SS+Y +PCSS C +
Sbjct: 82 PPQNISMVIDTGSELSWLRCNRSSNPNPVNN--------FDPTRSSSYSPIPCSSPTCRT 133
Query: 156 -----LNQKSC-SGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITFGCGTN 209
L SC S C ++SY D S S GNLA E G++T + + FGC +
Sbjct: 134 RTRDFLIPASCDSDKLCHATLSYADASSSEGNLAAEIFHFGNSTNDS----NLIFGCMGS 189
Query: 210 NGG---LFNSKTTGIVGLGGGDISLISQM 235
G ++KTTG++G+ G +S ISQM
Sbjct: 190 VSGSDPEEDTKTTGLLGMNRGSLSFISQM 218
>gi|357469587|ref|XP_003605078.1| Aspartic proteinase Asp1 [Medicago truncatula]
gi|355506133|gb|AES87275.1| Aspartic proteinase Asp1 [Medicago truncatula]
Length = 418
Score = 73.9 bits (180), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 55/162 (33%), Positives = 76/162 (46%), Gaps = 22/162 (13%)
Query: 91 YLIRISIGTPPTERLAVADTGSDLIWTQCE----PCPPSQCYMQDSPLFDPKMSSTYKSL 146
Y + I+IG PP DTGSDL W QC+ PC C + L+ P + K
Sbjct: 62 YTVSINIGNPPNPYELDIDTGSDLTWVQCDGPDAPC--KGCTLPKDKLYKPNGNQLVK-- 117
Query: 147 PCSSSQCASLN-------QKSCSGV-NCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVA 198
CS CA++ QK + C Y V Y D + S G LA + + +GS +G V
Sbjct: 118 -CSDPICAAVQPPFSTFGQKCAKPIPPCVYKVEYADNAESTGALARDYMHIGSPSGSNVP 176
Query: 199 LPGITFGCGTNNG---GLFNSKTTGIVGLGGGDISLISQMRT 237
L + FGCG T G++GLG G IS++SQ+ +
Sbjct: 177 L--VVFGCGYEQKFSGPTPPPSTPGVLGLGNGKISILSQLHS 216
>gi|343172996|gb|AEL99201.1| aspartyl protease family protein, partial [Silene latifolia]
Length = 584
Score = 73.9 bits (180), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 86/350 (24%), Positives = 138/350 (39%), Gaps = 86/350 (24%)
Query: 97 IGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCSSS-QCAS 155
IGTPP E + DTGS + + C C QC P F P +S TY + C+ C +
Sbjct: 2 IGTPPQEFALIVDTGSTVTYVPCNSC--DQCGNHQDPKFQPDLSDTYHPVKCNPDCTCDT 59
Query: 156 LNQKSCSGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITFGC-GTNNGGLF 214
N + C Y Y + S S+G L + V+ G+ + + FGC G LF
Sbjct: 60 ENDQ------CTYERQYAEMSSSSGILGEDLVSFGNMS--ELKPQRAVFGCENAETGDLF 111
Query: 215 NSKTTGIVGL------------------------------GGGDISL--IS--------- 233
+ GI+GL GGG + L IS
Sbjct: 112 SQHADGIMGLGRGDLSIVDQLVEKGVINDSFSLCYGGMEVGGGAMVLGQISPPSDMVFSH 171
Query: 234 -----------QMRTTIAGNQRLGVSTPDI-------VIDSGTTLTFLPQGYNSNLLSVM 275
++R ++L ++ P + ++DSGTT +LP+ + +
Sbjct: 172 SDPDRSPYYNIELRGLHVAGKKLDIN-PQVFDGKHGTILDSGTTYAYLPEAAFLPFIQAI 230
Query: 276 SSMIEA-QPVADPTGSL-ELCYSFNSLSQVPEVTIHFRGADV--------KLSRSNFFVK 325
+S + + + P + ++C+S + S++PE+ F D+ LS N+ K
Sbjct: 231 TSELHGLKQIRGPDPNYNDVCFS-GAGSEIPELYKTFPSVDMVFDNGEKYSLSPENYLFK 289
Query: 326 VSE---DIVCSVFKGITNSVPIYGNIMQTNFLVGYDIEQQTVSFKPTDCT 372
S+ VF+ + + G I+ N LV YD E V F T+C+
Sbjct: 290 HSKVHGAYCLGVFQNGKDPTTLLGGIVVRNTLVTYDREHSKVGFWKTNCS 339
>gi|222629462|gb|EEE61594.1| hypothetical protein OsJ_16002 [Oryza sativa Japonica Group]
Length = 468
Score = 73.9 bits (180), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 49/158 (31%), Positives = 71/158 (44%), Gaps = 10/158 (6%)
Query: 87 NNANYLIRISIGTPPTE---RLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTY 143
+ YL+++ IGTP R + DTGSDL WTQCEPC + P DP S T+
Sbjct: 97 GGSTYLVQLRIGTPTDRISPRYVLFDTGSDLSWTQCEPCTNCSSFTPYPP-HDPSKSRTF 155
Query: 144 KSLPCSSSQ---CASLNQKSCSGVNCQYSVSYGDGSFSNGNLATETVTLGST-TGQAVAL 199
+ L C C ++ C + YGDG +G L ++ G+ G L
Sbjct: 156 RRLSCFDPMCELCTAVVDGGGGSAGCLFRRRYGDGGAVSGELVSDVFHFGAAGDGGGYQL 215
Query: 200 P-GITFGCG-TNNGGLFNSKTTGIVGLGGGDISLISQM 235
+ FGC + +TGI+ LG G S ++Q+
Sbjct: 216 ERDVAFGCAHVEDSKAVRGYSTGILALGIGKPSFVTQL 253
Score = 41.6 bits (96), Expect = 0.59, Method: Compositional matrix adjust.
Identities = 43/170 (25%), Positives = 71/170 (41%), Gaps = 9/170 (5%)
Query: 209 NNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGNQRLGVSTPDIVIDSGTTLTFLPQGYN 268
+G + + +V GG ++ + +AG + + +++DSGTTL +LP
Sbjct: 302 QDGSGYAVRLKSVVYQHGGRLNQQQPVPVYVAGEE--AAAAMPMLVDSGTTLLWLPGSVF 359
Query: 269 SNLLSVMSSMIEAQPVADPTGSLELCYSFNSLS-QVPEVTIHF-RGADVKLSRSNFFV-- 324
L + I D T CY N + VT+ F GAD++L ++ F
Sbjct: 360 YPLQRRIEEDISLTRRYDLTHPSLYCYLGNMTDVEAVSVTLGFGGGADLELFGTSLFFTD 419
Query: 325 -KVSEDIVCSVFKGITNSVPIYGNIMQTNFLVGYDIEQQTVSFKPTDCTK 373
++ED VC + I G Q N VGYD+ ++F C +
Sbjct: 420 ENLTEDWVCLAVAAGNRA--ILGVYPQRNINVGYDLSTMEIAFDRDQCDR 467
>gi|125524351|gb|EAY72465.1| hypothetical protein OsI_00321 [Oryza sativa Indica Group]
Length = 343
Score = 73.9 bits (180), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 29/72 (40%), Positives = 45/72 (62%), Gaps = 2/72 (2%)
Query: 90 NYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCS 149
Y R+ +G+P + V DTGSD+ W QC+PC + CY Q P+FDP +S++Y S+ C
Sbjct: 166 EYFSRVGVGSPARQLYMVLDTGSDVTWVQCQPC--ADCYQQSDPVFDPSLSTSYASVACD 223
Query: 150 SSQCASLNQKSC 161
+ +C L+ +C
Sbjct: 224 NPRCHDLDAAAC 235
>gi|116311058|emb|CAH67989.1| OSIGBa0142I02-OSIGBa0101B20.32 [Oryza sativa Indica Group]
Length = 488
Score = 73.9 bits (180), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 49/158 (31%), Positives = 71/158 (44%), Gaps = 10/158 (6%)
Query: 87 NNANYLIRISIGTPPTE---RLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTY 143
+ YL+++ IGTP R + DTGSDL WTQCEPC + P DP S T+
Sbjct: 119 GGSTYLVQLRIGTPTDRISPRYVLFDTGSDLSWTQCEPCTNCSSFTPYPP-HDPSKSRTF 177
Query: 144 KSLPCSSSQ---CASLNQKSCSGVNCQYSVSYGDGSFSNGNLATETVTLGST-TGQAVAL 199
+ L C C ++ C + YGDG +G L ++ G+ G L
Sbjct: 178 RRLSCFDPMCELCTAVVDGGGGSAGCLFRRRYGDGGAVSGELVSDVFHFGAAGDGGGYQL 237
Query: 200 P-GITFGCG-TNNGGLFNSKTTGIVGLGGGDISLISQM 235
+ FGC + +TGI+ LG G S ++Q+
Sbjct: 238 ERDVAFGCAHVEDSKAVRGYSTGILALGIGKPSFVTQL 275
Score = 42.0 bits (97), Expect = 0.48, Method: Compositional matrix adjust.
Identities = 43/170 (25%), Positives = 71/170 (41%), Gaps = 9/170 (5%)
Query: 209 NNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGNQRLGVSTPDIVIDSGTTLTFLPQGYN 268
+G + + +V GG ++ + +AG + + +++DSGTTL +LP
Sbjct: 322 QDGSGYAVRLKSVVYQHGGRLNQQQPVPVYVAGEE--AAAAMPMLVDSGTTLLWLPGSVF 379
Query: 269 SNLLSVMSSMIEAQPVADPTGSLELCYSFNSLS-QVPEVTIHF-RGADVKLSRSNFFV-- 324
L + I D T CY N + VT+ F GAD++L ++ F
Sbjct: 380 YPLQRRIEEDISLTRRYDLTHPSLYCYLGNMTDVEAVSVTLGFGGGADLELFGTSLFFTD 439
Query: 325 -KVSEDIVCSVFKGITNSVPIYGNIMQTNFLVGYDIEQQTVSFKPTDCTK 373
++ED VC + I G Q N VGYD+ ++F C +
Sbjct: 440 ENLTEDWVCLAVAAGNRA--ILGVYPQRNINVGYDLSTMEIAFDRDQCDR 487
>gi|255634819|gb|ACU17770.1| unknown [Glycine max]
Length = 354
Score = 73.9 bits (180), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 54/163 (33%), Positives = 77/163 (47%), Gaps = 18/163 (11%)
Query: 91 YLIRISIGTPPTERLAVADTGSDLIWTQCEP---CPPSQCYMQDSPLFDPKMSSTYKSLP 147
Y ++ +GTPP E DTGSD++W C CP + FDP SST +
Sbjct: 25 YYTKVQLGTPPVEFNVQIDTGSDVLWVSCNSCSGCPQTSGLQIQLNFFDPGSSSTSSMIA 84
Query: 148 CSSSQCASLNQKS---CSGVN--CQYSVSYGDGSFSNGNLATETVTL-----GSTTGQAV 197
CS +C + Q S CS N C Y+ YGDGS ++G ++ + L GS T +
Sbjct: 85 CSDQRCNNGIQSSDATCSSQNNQCSYTFQYGDGSGTSGYYVSDMMHLNTIFEGSVTTNST 144
Query: 198 ALPGITFGCGTNNGGLF---NSKTTGIVGLGGGDISLISQMRT 237
A + FGC G + GI G G ++S+ISQ+ +
Sbjct: 145 AP--VVFGCSNQQTGDLTKSDRAVDGIFGFGQQEMSVISQLSS 185
>gi|32489096|emb|CAE03928.1| OSJNba0093F12.2 [Oryza sativa Japonica Group]
gi|58532027|emb|CAD41565.3| OSJNBa0006A01.20 [Oryza sativa Japonica Group]
Length = 489
Score = 73.9 bits (180), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 49/158 (31%), Positives = 71/158 (44%), Gaps = 10/158 (6%)
Query: 87 NNANYLIRISIGTPPTE---RLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTY 143
+ YL+++ IGTP R + DTGSDL WTQCEPC + P DP S T+
Sbjct: 118 GGSTYLVQLRIGTPTDRISPRYVLFDTGSDLSWTQCEPCTNCSSFTPYPP-HDPSKSRTF 176
Query: 144 KSLPCSSSQ---CASLNQKSCSGVNCQYSVSYGDGSFSNGNLATETVTLGST-TGQAVAL 199
+ L C C ++ C + YGDG +G L ++ G+ G L
Sbjct: 177 RRLSCFDPMCELCTAVVDGGGGSAGCLFRRRYGDGGAVSGELVSDVFHFGAAGDGGGYQL 236
Query: 200 P-GITFGCG-TNNGGLFNSKTTGIVGLGGGDISLISQM 235
+ FGC + +TGI+ LG G S ++Q+
Sbjct: 237 ERDVAFGCAHVEDSKAVRGYSTGILALGIGKPSFVTQL 274
Score = 42.0 bits (97), Expect = 0.47, Method: Compositional matrix adjust.
Identities = 43/170 (25%), Positives = 71/170 (41%), Gaps = 9/170 (5%)
Query: 209 NNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGNQRLGVSTPDIVIDSGTTLTFLPQGYN 268
+G + + +V GG ++ + +AG + + +++DSGTTL +LP
Sbjct: 323 QDGSGYAVRLKSVVYQHGGRLNQQQPVPVYVAGEE--AAAAMPMLVDSGTTLLWLPGSVF 380
Query: 269 SNLLSVMSSMIEAQPVADPTGSLELCYSFNSLS-QVPEVTIHF-RGADVKLSRSNFFV-- 324
L + I D T CY N + VT+ F GAD++L ++ F
Sbjct: 381 YPLQRRIEEDISLTRRYDLTHPSLYCYLGNMTDVEAVSVTLGFGGGADLELFGTSLFFTD 440
Query: 325 -KVSEDIVCSVFKGITNSVPIYGNIMQTNFLVGYDIEQQTVSFKPTDCTK 373
++ED VC + I G Q N VGYD+ ++F C +
Sbjct: 441 ENLTEDWVCLAVAAGNRA--ILGVYPQRNINVGYDLSTMEIAFDRDQCDR 488
>gi|218195474|gb|EEC77901.1| hypothetical protein OsI_17222 [Oryza sativa Indica Group]
Length = 467
Score = 73.9 bits (180), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 49/158 (31%), Positives = 71/158 (44%), Gaps = 10/158 (6%)
Query: 87 NNANYLIRISIGTPPTE---RLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTY 143
+ YL+++ IGTP R + DTGSDL WTQCEPC + P DP S T+
Sbjct: 98 GGSTYLVQLRIGTPTDRISPRYVLFDTGSDLSWTQCEPCTNCSSFTPYPP-HDPSKSRTF 156
Query: 144 KSLPCSSSQ---CASLNQKSCSGVNCQYSVSYGDGSFSNGNLATETVTLGST-TGQAVAL 199
+ L C C ++ C + YGDG +G L ++ G+ G L
Sbjct: 157 RRLSCFDPMCELCTAVVDGGGGSAGCLFRRRYGDGGAVSGELVSDVFHFGAAGDGGGYQL 216
Query: 200 P-GITFGCG-TNNGGLFNSKTTGIVGLGGGDISLISQM 235
+ FGC + +TGI+ LG G S ++Q+
Sbjct: 217 ERDVAFGCAHVEDSKAVRGYSTGILALGIGKPSFVTQL 254
Score = 41.6 bits (96), Expect = 0.59, Method: Compositional matrix adjust.
Identities = 43/170 (25%), Positives = 71/170 (41%), Gaps = 9/170 (5%)
Query: 209 NNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGNQRLGVSTPDIVIDSGTTLTFLPQGYN 268
+G + + +V GG ++ + +AG + + +++DSGTTL +LP
Sbjct: 301 QDGSGYAVRLKSVVYQHGGRLNQQQPVPVYVAGEE--AAAAMPMLVDSGTTLLWLPGSVF 358
Query: 269 SNLLSVMSSMIEAQPVADPTGSLELCYSFNSLS-QVPEVTIHF-RGADVKLSRSNFFV-- 324
L + I D T CY N + VT+ F GAD++L ++ F
Sbjct: 359 YPLQRRIEEDISLTRRYDLTHPSLYCYLGNMTDVEAVSVTLGFGGGADLELFGTSLFFTD 418
Query: 325 -KVSEDIVCSVFKGITNSVPIYGNIMQTNFLVGYDIEQQTVSFKPTDCTK 373
++ED VC + I G Q N VGYD+ ++F C +
Sbjct: 419 ENLTEDWVCLAVAAGNRA--ILGVYPQRNINVGYDLSTMEIAFDRDQCDR 466
>gi|115460260|ref|NP_001053730.1| Os04g0595000 [Oryza sativa Japonica Group]
gi|113565301|dbj|BAF15644.1| Os04g0595000, partial [Oryza sativa Japonica Group]
Length = 471
Score = 73.9 bits (180), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 49/158 (31%), Positives = 71/158 (44%), Gaps = 10/158 (6%)
Query: 87 NNANYLIRISIGTPPTE---RLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTY 143
+ YL+++ IGTP R + DTGSDL WTQCEPC + P DP S T+
Sbjct: 100 GGSTYLVQLRIGTPTDRISPRYVLFDTGSDLSWTQCEPCTNCSSFTPYPP-HDPSKSRTF 158
Query: 144 KSLPCSSSQ---CASLNQKSCSGVNCQYSVSYGDGSFSNGNLATETVTLGST-TGQAVAL 199
+ L C C ++ C + YGDG +G L ++ G+ G L
Sbjct: 159 RRLSCFDPMCELCTAVVDGGGGSAGCLFRRRYGDGGAVSGELVSDVFHFGAAGDGGGYQL 218
Query: 200 P-GITFGCG-TNNGGLFNSKTTGIVGLGGGDISLISQM 235
+ FGC + +TGI+ LG G S ++Q+
Sbjct: 219 ERDVAFGCAHVEDSKAVRGYSTGILALGIGKPSFVTQL 256
Score = 42.0 bits (97), Expect = 0.54, Method: Compositional matrix adjust.
Identities = 43/170 (25%), Positives = 71/170 (41%), Gaps = 9/170 (5%)
Query: 209 NNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGNQRLGVSTPDIVIDSGTTLTFLPQGYN 268
+G + + +V GG ++ + +AG + + +++DSGTTL +LP
Sbjct: 305 QDGSGYAVRLKSVVYQHGGRLNQQQPVPVYVAGEE--AAAAMPMLVDSGTTLLWLPGSVF 362
Query: 269 SNLLSVMSSMIEAQPVADPTGSLELCYSFNSLS-QVPEVTIHF-RGADVKLSRSNFFV-- 324
L + I D T CY N + VT+ F GAD++L ++ F
Sbjct: 363 YPLQRRIEEDISLTRRYDLTHPSLYCYLGNMTDVEAVSVTLGFGGGADLELFGTSLFFTD 422
Query: 325 -KVSEDIVCSVFKGITNSVPIYGNIMQTNFLVGYDIEQQTVSFKPTDCTK 373
++ED VC + I G Q N VGYD+ ++F C +
Sbjct: 423 ENLTEDWVCLAVAAGNRA--ILGVYPQRNINVGYDLSTMEIAFDRDQCDR 470
>gi|224140237|ref|XP_002323490.1| predicted protein [Populus trichocarpa]
gi|222868120|gb|EEF05251.1| predicted protein [Populus trichocarpa]
Length = 478
Score = 73.6 bits (179), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 49/161 (30%), Positives = 77/161 (47%), Gaps = 14/161 (8%)
Query: 91 YLIRISIGTPPTERLAVADTGSDLIWT---QCEPCPPSQCYMQDSPLFDPKMSSTYKSLP 147
Y ++ +G+PP E DTGSD++W C CP + FD SST +
Sbjct: 66 YFTKVKLGSPPREFNVQIDTGSDVLWVCCNSCNNCPRTSGLGIQLNFFDSSSSSTAGLVH 125
Query: 148 CSSSQCASLNQKSCSGVN-----CQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPG- 201
CS C S Q + + + C Y+ Y DGS ++G ++T+ + G+++ +
Sbjct: 126 CSDPICTSAVQTTVTQCSPQTNQCSYTFQYEDGSGTSGYYVSDTLYFDAILGESLVVNSS 185
Query: 202 --ITFGCGTNNGG---LFNSKTTGIVGLGGGDISLISQMRT 237
I FGC T G + + GI G G G++S+ISQ+ T
Sbjct: 186 ALIVFGCSTFQSGDLTMTDKAVDGIFGFGQGELSVISQLST 226
>gi|326501496|dbj|BAK02537.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 476
Score = 73.6 bits (179), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 52/164 (31%), Positives = 73/164 (44%), Gaps = 11/164 (6%)
Query: 90 NYLIRISIGTPPTERLAVADTGSDLIWTQCE--PCPPSQCYMQDSPLFDPKMSSTYKSLP 147
Y +R +GTP L VADTGSDL W +C S+ F P+ S T+ +
Sbjct: 93 QYFVRFRVGTPAQPFLLVADTGSDLTWVKCRRPAANSSESGSGSGRAFRPEDSRTWAPIS 152
Query: 148 CSSSQCA-----SLNQKSCSGVNCQYSVSYGDGSFSNGNLATETVTLG-STTG---QAVA 198
C+S C SL G C Y Y DGS + G + TE+ T+ S G +
Sbjct: 153 CASDTCTKSLPFSLATCPTPGSPCAYDYRYKDGSAARGTVGTESATIALSGRGREERKAK 212
Query: 199 LPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGN 242
L G+ GC ++ G + G++ LG D+S S + AG
Sbjct: 213 LKGLVLGCTSSYTGPSFEVSDGVLSLGYSDVSFASHAASRFAGR 256
Score = 54.7 bits (130), Expect = 7e-05, Method: Compositional matrix adjust.
Identities = 33/128 (25%), Positives = 67/128 (52%), Gaps = 10/128 (7%)
Query: 252 IVIDSGTTLTFLPQGYNSNLLSVMSSMIEAQP--VADPTGSLELCYSFNSLS---QVPEV 306
+++DSGT+LT L + +++ +S + P DP E CY++ S S +P++
Sbjct: 351 VILDSGTSLTVLAKPAYRAVVAALSEGLAGLPRVTMDP---FEYCYNWTSPSGDVTLPKM 407
Query: 307 TIHFRGA-DVKLSRSNFFVKVSEDIVC-SVFKGITNSVPIYGNIMQTNFLVGYDIEQQTV 364
+HF GA ++ ++ + + + C + +G + + GNI+Q L +DI+ + +
Sbjct: 408 AVHFAGAARLEPPGKSYVIDAAPGVKCIGLQEGPWPGISVIGNILQQEHLWEFDIKNRRL 467
Query: 365 SFKPTDCT 372
F+ + CT
Sbjct: 468 KFQRSRCT 475
>gi|357515189|ref|XP_003627883.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
gi|355521905|gb|AET02359.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
Length = 434
Score = 73.6 bits (179), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 57/213 (26%), Positives = 98/213 (46%), Gaps = 26/213 (12%)
Query: 12 FFLCFYVVSPIEAQT-------GGFSVELIHRDSPKSPFYNSSETPYQR-LRDALTRSLN 63
F LC ++ S ++AQT G ++++ H S SPF S ++ + + +
Sbjct: 10 FLLCLFI-SLVQAQTPKCDIQDDGSTLKVFHIFSQCSPFKPSKPMSWEESVLNLQAKDQA 68
Query: 64 RLNHFNQNSSISSSKA----SQADIIPNNANYLIRISIGTPPTERLAVADTGSDLIWTQC 119
R+ +F SS+ + K+ + A I + Y+++ GTPP L DT SD W C
Sbjct: 69 RMQYF---SSLVARKSVVPIASARQIIQSPTYIVKAKFGTPPQTLLLALDTSSDAAWIPC 125
Query: 120 EPCPPSQCYMQDSPLFDPKMSSTYKSLPCSSSQCASLNQKSCSGVNCQYSVSYGDGSFSN 179
C S F P S++++++ C S C + +C G C ++ +YG S +
Sbjct: 126 SGC----VGCSTSKPFAPIKSTSFRNVSCGSPHCKQVPNPTCGGSACAFNFTYGSSSIA- 180
Query: 180 GNLATETVTLGSTTGQAVALPGITFGCGTNNGG 212
++ +T+TL + +PG TFGC G
Sbjct: 181 ASVVQDTLTLATD-----PIPGYTFGCVNKTTG 208
>gi|18390865|ref|NP_563808.1| aspartyl protease-like protein [Arabidopsis thaliana]
gi|11993877|gb|AAG42922.1|AF329505_1 unknown protein [Arabidopsis thaliana]
gi|20260142|gb|AAM12969.1| unknown protein [Arabidopsis thaliana]
gi|22136092|gb|AAM91124.1| unknown protein [Arabidopsis thaliana]
gi|332190140|gb|AEE28261.1| aspartyl protease-like protein [Arabidopsis thaliana]
Length = 492
Score = 73.6 bits (179), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 84/363 (23%), Positives = 147/363 (40%), Gaps = 82/363 (22%)
Query: 91 YLIRISIGTPPTERLAVADTGSDLIW---TQCEPCPPSQCYMQDSPLFDPKMSSTYKSLP 147
Y ++ +GTPP E DTGSD++W T C CP + FDP +SS+ +
Sbjct: 84 YYTKVKLGTPPREFNVQIDTGSDVLWVSCTSCNGCPKTSELQIQLSFFDPGVSSSASLVS 143
Query: 148 CSSSQCAS--LNQKSCSGVN-CQYSVSYGDGSFSNGNLATETVTLGSTTGQAVAL---PG 201
CS +C S + CS N C YS YGDGS ++G ++ ++ + +A+
Sbjct: 144 CSDRRCYSNFQTESGCSPNNLCSYSFKYGDGSGTSGYYISDFMSFDTVITSTLAINSSAP 203
Query: 202 ITFGCGTNNGGLF---NSKTTGIVGLGGGDISLISQMRTT----------IAGNQRLG-- 246
FGC G GI GLG G +S+ISQ+ + G++ G
Sbjct: 204 FVFGCSNLQSGDLQRPRRAVDGIFGLGQGSLSVISQLAVQGLAPRVFSHCLKGDKSGGGI 263
Query: 247 -----VSTPDIV-----------------------------------------IDSGTTL 260
+ PD V ID+GTTL
Sbjct: 264 MVLGQIKRPDTVYTPLVPSQPHYNVNLQSIAVNGQILPIDPSVFTIATGDGTIIDTGTTL 323
Query: 261 TFLPQGYNSNLLSVMSSMIE--AQPVADPTGSLELCYSFNS--LSQVPEVTIHFRGADVK 316
+LP S + +++ + +P+ T C+ + + P+V++ F G
Sbjct: 324 AYLPDEAYSPFIQAVANAVSQYGRPI---TYESYQCFEITAGDVDVFPQVSLSFAGGASM 380
Query: 317 LSRSNFFVKV----SEDIVCSVFKGITN-SVPIYGNIMQTNFLVGYDIEQQTVSFKPTDC 371
+ ++++ I C F+ +++ + I G+++ + +V YD+ +Q + + DC
Sbjct: 381 VLGPRAYLQIFSSSGSSIWCIGFQRMSHRRITILGDLVLKDKVVVYDLVRQRIGWAEYDC 440
Query: 372 TKQ 374
+ +
Sbjct: 441 SLE 443
>gi|115448471|ref|NP_001048015.1| Os02g0730700 [Oryza sativa Japonica Group]
gi|46390468|dbj|BAD15929.1| putative nucellin-like aspartic protease [Oryza sativa Japonica
Group]
gi|46390864|dbj|BAD16368.1| putative nucellin-like aspartic protease [Oryza sativa Japonica
Group]
gi|113537546|dbj|BAF09929.1| Os02g0730700 [Oryza sativa Japonica Group]
gi|215697021|dbj|BAG91015.1| unnamed protein product [Oryza sativa Japonica Group]
gi|222623612|gb|EEE57744.1| hypothetical protein OsJ_08261 [Oryza sativa Japonica Group]
Length = 573
Score = 73.6 bits (179), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 55/162 (33%), Positives = 74/162 (45%), Gaps = 14/162 (8%)
Query: 81 QADIIPNNANYLIRISIGTPPTERLAVADTGSDLIWTQCE-PCPPSQCYMQDSPLFDPKM 139
+ ++ P+ Y I +G PP DTGSDL W QC+ PC + C PL+ P
Sbjct: 194 KGNVFPD-GQYYTSIFVGNPPRPYFLDVDTGSDLTWIQCDAPC--TNCAKGPHPLYKP-- 248
Query: 140 SSTYKSLPCSSSQCASL--NQKSCSGVN-CQYSVSYGDGSFSNGNLATETVTLGSTTGQA 196
+ K +P C L NQ C C Y + Y D S S G LA + + + +T G
Sbjct: 249 -AKEKIVPPKDLLCQELQGNQNYCETCKQCDYEIEYADRSSSMGVLARDDMHIITTNGGR 307
Query: 197 VALPGITFGCGTNNGGLF---NSKTTGIVGLGGGDISLISQM 235
L FGC + G +KT GI+GL ISL SQ+
Sbjct: 308 EKL-DFVFGCAYDQQGQLLASPAKTDGILGLSSAGISLPSQL 348
>gi|218191512|gb|EEC73939.1| hypothetical protein OsI_08807 [Oryza sativa Indica Group]
Length = 574
Score = 73.6 bits (179), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 55/162 (33%), Positives = 74/162 (45%), Gaps = 14/162 (8%)
Query: 81 QADIIPNNANYLIRISIGTPPTERLAVADTGSDLIWTQCE-PCPPSQCYMQDSPLFDPKM 139
+ ++ P+ Y I +G PP DTGSDL W QC+ PC + C PL+ P
Sbjct: 195 KGNVFPD-GQYYTSIFVGNPPRPYFLDVDTGSDLTWIQCDAPC--TNCAKGPHPLYKP-- 249
Query: 140 SSTYKSLPCSSSQCASL--NQKSCSGVN-CQYSVSYGDGSFSNGNLATETVTLGSTTGQA 196
+ K +P C L NQ C C Y + Y D S S G LA + + + +T G
Sbjct: 250 -AKEKIVPPKDLLCQELQGNQNYCETCKQCDYEIEYADRSSSMGVLARDDMHIITTNGGR 308
Query: 197 VALPGITFGCGTNNGGLF---NSKTTGIVGLGGGDISLISQM 235
L FGC + G +KT GI+GL ISL SQ+
Sbjct: 309 EKL-DFVFGCAYDQQGQLLASPAKTDGILGLSSAGISLPSQL 349
Database: nr
Posted date: Mar 3, 2013 10:45 PM
Number of letters in database: 999,999,864
Number of sequences in database: 2,912,245
Database: /local_scratch/syshi//blastdatabase/nr.01
Posted date: Mar 3, 2013 10:52 PM
Number of letters in database: 999,999,666
Number of sequences in database: 2,912,720
Database: /local_scratch/syshi//blastdatabase/nr.02
Posted date: Mar 3, 2013 10:58 PM
Number of letters in database: 999,999,938
Number of sequences in database: 3,014,250
Database: /local_scratch/syshi//blastdatabase/nr.03
Posted date: Mar 3, 2013 11:03 PM
Number of letters in database: 999,999,780
Number of sequences in database: 2,805,020
Database: /local_scratch/syshi//blastdatabase/nr.04
Posted date: Mar 3, 2013 11:08 PM
Number of letters in database: 999,999,551
Number of sequences in database: 2,816,253
Database: /local_scratch/syshi//blastdatabase/nr.05
Posted date: Mar 3, 2013 11:13 PM
Number of letters in database: 999,999,897
Number of sequences in database: 2,981,387
Database: /local_scratch/syshi//blastdatabase/nr.06
Posted date: Mar 3, 2013 11:18 PM
Number of letters in database: 999,999,649
Number of sequences in database: 2,911,476
Database: /local_scratch/syshi//blastdatabase/nr.07
Posted date: Mar 3, 2013 11:24 PM
Number of letters in database: 999,999,452
Number of sequences in database: 2,920,260
Database: /local_scratch/syshi//blastdatabase/nr.08
Posted date: Mar 3, 2013 11:25 PM
Number of letters in database: 64,230,274
Number of sequences in database: 189,558
Lambda K H
0.317 0.132 0.390
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 5,846,866,689
Number of Sequences: 23463169
Number of extensions: 248656532
Number of successful extensions: 685059
Number of sequences better than 100.0: 1000
Number of HSP's better than 100.0 without gapping: 1231
Number of HSP's successfully gapped in prelim test: 3131
Number of HSP's that attempted gapping in prelim test: 675673
Number of HSP's gapped (non-prelim): 7136
length of query: 374
length of database: 8,064,228,071
effective HSP length: 144
effective length of query: 230
effective length of database: 8,980,499,031
effective search space: 2065514777130
effective search space used: 2065514777130
T: 11
A: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.7 bits)
S2: 77 (34.3 bits)