BLASTP 2.2.26 [Sep-21-2011]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for compositional score matrix adjustment: Altschul, Stephen F.,
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.
Query= 012359
(465 letters)
Database: nr
23,463,169 sequences; 8,064,228,071 total letters
Searching..................................................done
>gi|255565759|ref|XP_002523869.1| Aspartic proteinase nepenthesin-2 precursor, putative [Ricinus
communis]
gi|223536957|gb|EEF38595.1| Aspartic proteinase nepenthesin-2 precursor, putative [Ricinus
communis]
Length = 447
Score = 561 bits (1445), Expect = e-157, Method: Compositional matrix adjust.
Identities = 277/444 (62%), Positives = 339/444 (76%), Gaps = 21/444 (4%)
Query: 24 SSITSLTFSLSRFHTNPSQDSYQNLNSLVSSSLTRALHIKNPQTKTTTTTTTTTTTNISS 83
SS S+ S S + NPSQD Q LN LVS+SL RA H+KNPQT T + S
Sbjct: 23 SSFISIPLSHSYTNQNPSQDHLQKLNYLVSTSLARAHHLKNPQT-----------TPVFS 71
Query: 84 HSYGGYSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCS-SSKIPSFIPKLSSSS 142
HSYGGYSISLSFGTPPQ + F++DTGS VWFPCT Y C CS +S+I F+PK SSSS
Sbjct: 72 HSYGGYSISLSFGTPPQTLSFVMDTGSSFVWFPCTLRYLCNNCSFTSRISPFLPKHSSSS 131
Query: 143 RLLGCQNPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSGLTEGIALSETLN 202
+++GC+NPKCSWIH ++C DC++ S+NC+QICP YL+LYGSG T G+ALSETL+
Sbjct: 132 KIIGCKNPKCSWIHQTDLRCTDCDNN----SRNCSQICPPYLILYGSGTTGGVALSETLH 187
Query: 203 LPNRIIPNFLVGCSVLSSRQPAGIAGFGRGKTSLPSQLNLDKFSYCLLSHKFDDTTRTSS 262
L I+PNFLVGCSV SSRQPAGIAGFGRG +SLPSQL L KFSYCLLSHKFDDT +SS
Sbjct: 188 LHGLIVPNFLVGCSVFSSRQPAGIAGFGRGPSSLPSQLGLTKFSYCLLSHKFDDTQESSS 247
Query: 263 LILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRVRVWHKYLT 322
L+LD+ S SDKKT L YTP V NP V ++ AFSVYYYV LRRI++GG+ V++ +KYL+
Sbjct: 248 LVLDS-QSDSDKKTAALMYTPLVKNPKVQDKPAFSVYYYVSLRRISIGGRSVKIPYKYLS 306
Query: 323 LDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEALTGLRPCFDV 382
D+DGNGGTI+DSGTTFT+M+ E FE L++EF+SQ+ +NY RAL EAL+GL+PCF+V
Sbjct: 307 PDKDGNGGTIIDSGTTFTYMSTEAFEILSNEFISQV---KNYERALMVEALSGLKPCFNV 363
Query: 383 PGEKTGSFPELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTVVTD-REASGGPSIILGNF 441
G K P+L+LHFKGGA+V LP+ENYFA +G C TVVTD E + GP +ILGNF
Sbjct: 364 SGAKELELPQLRLHFKGGADVELPLENYFAFLGSREVACFTVVTDGAEKASGPGMILGNF 423
Query: 442 QMQNYYVEYDLRNQRLGFKQQLCK 465
QMQN+YVEYDL+N+RLGFK++ CK
Sbjct: 424 QMQNFYVEYDLQNERLGFKKESCK 447
>gi|224102847|ref|XP_002312826.1| predicted protein [Populus trichocarpa]
gi|222849234|gb|EEE86781.1| predicted protein [Populus trichocarpa]
Length = 445
Score = 539 bits (1389), Expect = e-150, Method: Compositional matrix adjust.
Identities = 287/459 (62%), Positives = 338/459 (73%), Gaps = 29/459 (6%)
Query: 21 IFPSSITSLTFSLSRFHTN--PSQDSYQNLNSLVSSSLTRALHIKNPQTKTTTTTTTTTT 78
+FP +S+T L TN P QD YQ LN LV++SL RA H+KNPQT TTTT
Sbjct: 1 LFPFISSSITIPLQHPQTNQIPFQDQYQKLNHLVTTSLARARHLKNPQTTPATTTTAP-- 58
Query: 79 TNISSHSYGGYSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSS------KIP 132
+ SHSYGGYS+SLSFGTPPQ + FI+DTGS +VWFPCT+HY CK+CS S +I
Sbjct: 59 --LFSHSYGGYSVSLSFGTPPQTLSFIMDTGSDIVWFPCTSHYLCKHCSFSSSSPSSRIQ 116
Query: 133 SFIPKLSSSSRLLGCQNPKCSWIHHESIQC-RDCNDEPLATSKNC-TQICPSYLVLYGSG 190
FIPK SSSS+LLGC+NPKCSWIHH +I C +DC + K+C Q CP Y++ YGSG
Sbjct: 117 PFIPKESSSSKLLGCKNPKCSWIHHSNINCDQDC------SIKSCLNQTCPPYMIFYGSG 170
Query: 191 LTEGIALSETLNLPNRIIPNFLVGCSVLSSRQPAGIAGFGRGKTSLPSQLNLDKFSYCLL 250
T G+ALSETL+L + PNFLVGCSV SS QPAGIAGFGRG +SLPSQL L KFSYCLL
Sbjct: 171 TTGGVALSETLHLHSLSKPNFLVGCSVFSSHQPAGIAGFGRGLSSLPSQLGLGKFSYCLL 230
Query: 251 SHKFDD-TTRTSSLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITV 309
SH+FDD T ++SSL+LD SDKKT L YTPFV NP V +++FSVYYY+GLRRITV
Sbjct: 231 SHRFDDDTKKSSSLVLDMEQLDSDKKTNALVYTPFVKNPKVDNKSSFSVYYYLGLRRITV 290
Query: 310 GGQRVRVWHKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALG 369
GG V+V +KYL+ DGNGG I+DSGTTFTFMA E FEPL+DEF+ Q+ ++Y R
Sbjct: 291 GGHHVKVPYKYLSPGEDGNGGVIIDSGTTFTFMAREAFEPLSDEFIRQI---KDYRRVKE 347
Query: 370 AEALTGLRPCFDVPGEKTGSFPELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTVVTDR- 428
E GLRPCF+V KT SFPEL+L+FKGGA+V LPVENYFA VG G CLTVVTD
Sbjct: 348 IEDAIGLRPCFNVSDAKTVSFPELRLYFKGGADVALPVENYFAFVG-GEVACLTVVTDGV 406
Query: 429 ---EASGGPSIILGNFQMQNYYVEYDLRNQRLGFKQQLC 464
E GGP +ILGNFQMQN+YVEYDLRN+RLGFKQ+ C
Sbjct: 407 AGPERVGGPGMILGNFQMQNFYVEYDLRNERLGFKQEKC 445
>gi|225464832|ref|XP_002272243.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Vitis vinifera]
Length = 467
Score = 521 bits (1343), Expect = e-145, Method: Compositional matrix adjust.
Identities = 275/449 (61%), Positives = 331/449 (73%), Gaps = 25/449 (5%)
Query: 27 TSLTFSLSRFHTNPSQDSYQNLNSLVSSSLTRALHIKNPQTKTTTTTTTTTTTNISSHSY 86
+ +T LS +P D Y+NL LVS+SL RA H+KN TT T+TT + +HSY
Sbjct: 34 SPITLPLSASKPSPPPDPYRNLRHLVSASLIRARHLKN------PKTTPTSTTPLFTHSY 87
Query: 87 GGYSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCS-SSKIPS---FIPKLSSSS 142
G YSI LSFGTPPQ +P I+DTGS LVWFPCT+ Y C+ CS S+ PS FIPK SSSS
Sbjct: 88 GAYSIPLSFGTPPQTLPLIMDTGSDLVWFPCTHRYVCRNCSFSTSNPSSNIFIPKSSSSS 147
Query: 143 RLLGCQNPKCSWIHHESIQ--CRDCNDEPLATSKNCTQICPSYLVLYGSGLTEGIALSET 200
++LGC NPKC WIH +Q CRDC EP TS NCTQICP YLV YGSG+T GI LSET
Sbjct: 148 KVLGCVNPKCGWIHGSKVQSRCRDC--EP--TSPNCTQICPPYLVFYGSGITGGIMLSET 203
Query: 201 LNLPNRIIPNFLVGCSVLSSRQPAGIAGFGRGKTSLPSQLNLDKFSYCLLSHKFDDTTRT 260
L+LP + +PNF+VGCSVLS+ QPAGI+GFGRG SLPSQL L KFSYCLLS ++DDTT +
Sbjct: 204 LDLPGKGVPNFIVGCSVLSTSQPAGISGFGRGPPSLPSQLGLKKFSYCLLSRRYDDTTES 263
Query: 261 SSLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRVRVWHKY 320
SSL+LD G S S +KT GL+YTPFV NP VA ++AFSVYYY+GLR ITVGG+ V++ +KY
Sbjct: 264 SSLVLD-GESDSGEKTAGLSYTPFVQNPKVAGKHAFSVYYYLGLRHITVGGKHVKIPYKY 322
Query: 321 LTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEALTGLRPCF 380
L DG+GGTI+DSGTTFT+M E+FE +A EF Q+ RA E +TGLRPCF
Sbjct: 323 LIPGADGDGGTIIDSGTTFTYMKGEIFELVAAEFEKQVQSK----RATEVEGITGLRPCF 378
Query: 381 DVPGEKTGSFPELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTVVTD----REASGGPSI 436
++ G T SFPEL L F+GGAE+ LP+ NY A +G VCLT+VTD +E SGGP+I
Sbjct: 379 NISGLNTPSFPELTLKFRGGAEMELPLANYVAFLGGDDVVCLTIVTDGAAGKEFSGGPAI 438
Query: 437 ILGNFQMQNYYVEYDLRNQRLGFKQQLCK 465
ILGNFQ QN+YVEYDLRN+RLGF+QQ CK
Sbjct: 439 ILGNFQQQNFYVEYDLRNERLGFRQQSCK 467
>gi|255587337|ref|XP_002534234.1| pepsin A, putative [Ricinus communis]
gi|223525662|gb|EEF28148.1| pepsin A, putative [Ricinus communis]
Length = 468
Score = 468 bits (1204), Expect = e-129, Method: Compositional matrix adjust.
Identities = 243/461 (52%), Positives = 313/461 (67%), Gaps = 33/461 (7%)
Query: 21 IFPSSIT-SLTFSLSRFHTNPSQDSYQNLNSLVSSSLTRALHIKNPQTKTTTTTTTTTTT 79
I PS+IT L+ ++++ PS D ++ LN L ++S++RA H+K+P+T + T
Sbjct: 23 ISPSTITIPLSPTITK---RPSSDPWEYLNHLATTSISRAHHLKSPKTNFSLIKTP---- 75
Query: 80 NISSHSYGGYSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCS-----SSKIPSF 134
+ S SYGGYS+SLS GTP Q + I+DTGS LVWFPCT+ Y C C+ +KIP F
Sbjct: 76 -LFSRSYGGYSMSLSLGTPSQTVKLIMDTGSSLVWFPCTSRYVCASCNFPNTDITKIPKF 134
Query: 135 IPKLSSSSRLLGCQNPKCSWIHHESIQ--CRDCNDEPLATSKNCTQICPSYLVLYGSGLT 192
+P+LSSSS+L+GC+NPKC+W+ S+Q C +CN + ++NCTQ CP Y++ YG G T
Sbjct: 135 MPRLSSSSKLIGCKNPKCAWVFGSSVQSKCHNCNPQ----AQNCTQACPPYIIQYGLGST 190
Query: 193 EGIALSETLNLPNRIIPNFLVGCSVLSSRQPAGIAGFGRGKTSLPSQLNLDKFSYCLLSH 252
G+ LSET+N PN+ I +FL GCS+LS+RQP GIAGFGR + SLP QL L KFSYCL+S
Sbjct: 191 AGLLLSETINFPNKTISDFLAGCSLLSTRQPEGIAGFGRSQESLPLQLGLKKFSYCLVSR 250
Query: 253 KFDDTTRTSSLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQ 312
+FDD+ +S LILD G S SD KTTGL+YTPF N + AF YYYV LR+I VG
Sbjct: 251 RFDDSPVSSDLILDMGPSTSDSKTTGLSYTPFQKNLASQSNPAFQEYYYVMLRKIIVGKT 310
Query: 313 RVRVWHKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEA 372
V+V + +L DGNGGTIVDSG+TFTF+ +FE LA EF QM NYT A +
Sbjct: 311 HVKVPYSFLVPGSDGNGGTIVDSGSTFTFVEGHVFELLAKEFEKQMA---NYTVATNVQK 367
Query: 373 LTGLRPCFDVPGEKTGSFPELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTVVTDREA-- 430
LTGLRPCFD+ GEK+ P+L FKGGA++ LP+ NYFA V G VCLT+V+D A
Sbjct: 368 LTGLRPCFDISGEKSVVIPDLTFQFKGGAKMQLPLSNYFAFVDMG-VVCLTIVSDNAAAL 426
Query: 431 -------SGGPSIILGNFQMQNYYVEYDLRNQRLGFKQQLC 464
S GP+IILGNFQ QN+Y+EYDL N R GFK+Q C
Sbjct: 427 GGDGGVRSSGPAIILGNFQQQNFYIEYDLENDRFGFKEQSC 467
>gi|449522369|ref|XP_004168199.1| PREDICTED: LOW QUALITY PROTEIN: aspartic proteinase
nepenthesin-2-like [Cucumis sativus]
Length = 457
Score = 460 bits (1184), Expect = e-127, Method: Compositional matrix adjust.
Identities = 249/471 (52%), Positives = 308/471 (65%), Gaps = 31/471 (6%)
Query: 6 SALCLSFIFFFTLLSIFPSSITSLTFSLSRFHTNPSQDSYQNLNSLVSSSLTRALHIKNP 65
S L ++ F+ LS S +T L+ F S D Q L L SSS TRA IK P
Sbjct: 5 SPLSFFYLLLFSSLSAIAHS-NPITLPLNSFPHLSSPDPLQALTFLASSSQTRAHQIKTP 63
Query: 66 QTKTTTTTTTTTTTNISSHSYGGYSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKY 125
++ + + +S HSYG YS LSFGTP Q + I DTGS LVWFPCT+ Y C
Sbjct: 64 KSNSVFKSP------LSPHSYGAYSTPLSFGTPQQTLHLIFDTGSSLVWFPCTSRYLCSE 117
Query: 126 CSSSKI-----PSFIPKLSSSSRLLGCQNPKCSWIHHESI--QCRDCNDEPLATSKNCTQ 178
CS KI P F+PKLSSSS+L+GCQNPKCSWI + QCR CN + ++NCTQ
Sbjct: 118 CSFPKIDPTGIPRFVPKLSSSSKLVGCQNPKCSWIFGPDVKSQCRSCNPK----TENCTQ 173
Query: 179 ICPSYLVLYGSGLTEGIALSETLNLPNRIIPNFLVGCSVLSSRQPAGIAGFGRGKTSLPS 238
CP+Y+V YGSG T G+ LSETL+ P++ IPNF+VGCS LS QP+GIAGFGRG SLPS
Sbjct: 174 TCPAYVVQYGSGSTAGLLLSETLDFPDKXIPNFVVGCSFLSIHQPSGIAGFGRGSESLPS 233
Query: 239 QLNLDKFSYCLLSHKFDDTTRTSSLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSV 298
Q+ L KF+YCL S KFDD+ + LILD+ + K++GLTYTPF NPSV+ NA+
Sbjct: 234 QMGLKKFAYCLASRKFDDSPHSGQLILDS----TGVKSSGLTYTPFRQNPSVSN-NAYKE 288
Query: 299 YYYVGLRRITVGGQRVRVWHKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQM 358
YYY+ +R+I VG Q V+V +K+L DGNGG+I+DSG+TFTFM + E +A EF Q+
Sbjct: 289 YYYLNIRKIIVGNQAVKVPYKFLVPGPDGNGGSIIDSGSTFTFMDKPVLEVVAREFEKQL 348
Query: 359 VKNRNYTRALGAEALTGLRPCFDVPGEKTGSFPELKLHFKGGAEVTLPVENYFAVVGEGS 418
N+TRA E LTGLRPCFD+ EK+ FPEL FKGGA+ LP+ NYFA+V
Sbjct: 349 A---NWTRATDVETLTGLRPCFDISKEKSVKFPELIFQFKGGAKWALPLNNYFALVSSSG 405
Query: 419 AVCLTVVTDR-----EASGGPSIILGNFQMQNYYVEYDLRNQRLGFKQQLC 464
CLTVVT + GGPS+ILG FQ QN+YVEYDL NQRLGF+QQ C
Sbjct: 406 VACLTVVTHQMEDGGGGGGGPSVILGAFQQQNFYVEYDLVNQRLGFRQQTC 456
>gi|449437856|ref|XP_004136706.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Cucumis sativus]
Length = 457
Score = 460 bits (1183), Expect = e-127, Method: Compositional matrix adjust.
Identities = 249/471 (52%), Positives = 308/471 (65%), Gaps = 31/471 (6%)
Query: 6 SALCLSFIFFFTLLSIFPSSITSLTFSLSRFHTNPSQDSYQNLNSLVSSSLTRALHIKNP 65
S L ++ F+ LS S +T L+ F S D Q L L SSS TRA IK P
Sbjct: 5 SPLSFFYLLLFSSLSAIAHS-NPITLPLNSFPHLSSPDPLQALTFLASSSQTRAHQIKTP 63
Query: 66 QTKTTTTTTTTTTTNISSHSYGGYSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKY 125
++ + + +S HSYG YS LSFGTP Q + I DTGS LVWFPCT+ Y C
Sbjct: 64 KSNSVFKSP------LSPHSYGAYSTPLSFGTPQQTLHLIFDTGSSLVWFPCTSRYLCSE 117
Query: 126 CSSSKI-----PSFIPKLSSSSRLLGCQNPKCSWIHHESI--QCRDCNDEPLATSKNCTQ 178
CS KI P F+PKLSSSS+L+GCQNPKCSWI + QCR CN + ++NCTQ
Sbjct: 118 CSFPKIDPTGIPRFVPKLSSSSKLVGCQNPKCSWIFGPDVKSQCRSCNPK----TENCTQ 173
Query: 179 ICPSYLVLYGSGLTEGIALSETLNLPNRIIPNFLVGCSVLSSRQPAGIAGFGRGKTSLPS 238
CP+Y+V YGSG T G+ LSETL+ P++ IPNF+VGCS LS QP+GIAGFGRG SLPS
Sbjct: 174 TCPAYVVQYGSGSTAGLLLSETLDFPDKKIPNFVVGCSFLSIHQPSGIAGFGRGSESLPS 233
Query: 239 QLNLDKFSYCLLSHKFDDTTRTSSLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSV 298
Q+ L KF+YCL S KFDD+ + LILD+ + K++GLTYTPF NPSV+ NA+
Sbjct: 234 QMGLKKFAYCLASRKFDDSPHSGQLILDS----TGVKSSGLTYTPFRQNPSVSN-NAYKE 288
Query: 299 YYYVGLRRITVGGQRVRVWHKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQM 358
YYY+ +R+I VG Q V+V +K+L DGNGG+I+DSG+TFTFM + E +A EF Q+
Sbjct: 289 YYYLNIRKIIVGNQAVKVPYKFLVPGPDGNGGSIIDSGSTFTFMDKPVLEVVAREFEKQL 348
Query: 359 VKNRNYTRALGAEALTGLRPCFDVPGEKTGSFPELKLHFKGGAEVTLPVENYFAVVGEGS 418
N+TRA E LTGLRPCFD+ EK+ FPEL FKGGA+ LP+ NYFA+V
Sbjct: 349 A---NWTRATDVETLTGLRPCFDISKEKSVKFPELIFQFKGGAKWALPLNNYFALVSSSG 405
Query: 419 AVCLTVVTDR-----EASGGPSIILGNFQMQNYYVEYDLRNQRLGFKQQLC 464
CLTVVT + GGPS+ILG FQ QN+YVEYDL NQRLGF+QQ C
Sbjct: 406 VACLTVVTHQMEDGGGGGGGPSVILGAFQQQNFYVEYDLVNQRLGFRQQTC 456
>gi|356503843|ref|XP_003520712.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Glycine max]
Length = 474
Score = 451 bits (1161), Expect = e-124, Method: Compositional matrix adjust.
Identities = 237/470 (50%), Positives = 319/470 (67%), Gaps = 25/470 (5%)
Query: 8 LCLSFIFFFTLLSIFPSSIT-SLTFSLSRFHTNPSQDSYQNLNSLVSSSLTRALHIKNPQ 66
L LS + S P++IT L+ L + H++ S D + +L S+SLTRA H+K+
Sbjct: 15 LLLSLLSHIAFTSSNPNTITLPLSPLLIKPHSSDS-DPFHSLKFAASASLTRAHHLKHRN 73
Query: 67 TKTTTTTTTTTTTNISSHSYGGYSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYC 126
+ + TT SYGGYSI L+ GTPPQ PF+LDTGS LVWFPCT+ Y C +C
Sbjct: 74 NNSPSVATTPAY----PKSYGGYSIDLNLGTPPQTSPFVLDTGSSLVWFPCTSRYLCSHC 129
Query: 127 S-----SSKIPSFIPKLSSSSRLLGCQNPKCSWIHHESIQ--CRDCNDEPLATSKNCTQI 179
+ ++KIP+FIPK SS+++LLGC+NPKC +I +Q C C E S+NC+
Sbjct: 130 NFPNIDTTKIPTFIPKNSSTAKLLGCRNPKCGYIFGSDVQFRCPQCKPE----SQNCSLT 185
Query: 180 CPSYLVLYGSGLTEGIALSETLNLPNRIIPNFLVGCSVLSSRQPAGIAGFGRGKTSLPSQ 239
CP+Y++ YG G T G L + LN P + +P FLVGCS+LS RQP+GIAGFGRG+ SLPSQ
Sbjct: 186 CPAYIIQYGLGSTAGFLLLDNLNFPGKTVPQFLVGCSILSIRQPSGIAGFGRGQESLPSQ 245
Query: 240 LNLDKFSYCLLSHKFDDTTRTSSLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVY 299
+NL +FSYCL+SH+FDDT ++S L+L SS D KT GL+YTPF +NPS AF Y
Sbjct: 246 MNLKRFSYCLVSHRFDDTPQSSDLVLQI-SSTGDTKTNGLSYTPFRSNPST-NNPAFKEY 303
Query: 300 YYVGLRRITVGGQRVRVWHKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMV 359
YY+ LR++ VGG+ V++ + +L DGNGGTIVDSG+TFTFM ++ +A EFV Q+
Sbjct: 304 YYLTLRKVIVGGKDVKIPYTFLEPGSDGNGGTIVDSGSTFTFMERPVYNLVAQEFVKQLE 363
Query: 360 KNRNYTRALGAEALTGLRPCFDVPGEKTGSFPELKLHFKGGAEVTLPVENYFAVVGEGSA 419
K NY+RA AE +GL PCF++ G KT +FPEL FKGGA++T P++NYF++VG+
Sbjct: 364 K--NYSRAEDAETQSGLSPCFNISGVKTVTFPELTFKFKGGAKMTQPLQNYFSLVGDAEV 421
Query: 420 VCLTVVTDREA----SGGPSIILGNFQMQNYYVEYDLRNQRLGFKQQLCK 465
VCLTVV+D A + GP+IILGN+Q QN+Y+EYDL N+R GF + C+
Sbjct: 422 VCLTVVSDGGAGPPKTTGPAIILGNYQQQNFYIEYDLENERFGFGPRSCR 471
>gi|225440731|ref|XP_002280866.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Vitis vinifera]
Length = 469
Score = 449 bits (1156), Expect = e-123, Method: Compositional matrix adjust.
Identities = 231/441 (52%), Positives = 304/441 (68%), Gaps = 25/441 (5%)
Query: 36 FHTNPSQDSYQNLNSLVSSSLTRALHIKNPQTKTTTTTTTTTTTNISSHSYGGYSISLSF 95
F NPS D +Q L+ L S+SLTRA H+K+ + T++ T + +HSYGGYS+SLSF
Sbjct: 43 FTKNPSSDPWQLLSHLTSASLTRAHHLKHRKN------TSSVNTPLFAHSYGGYSVSLSF 96
Query: 96 GTPPQIIPFILDTGSHLVWFPCTNHYQCKYCS-----SSKIPSFIPKLSSSSRLLGCQNP 150
GTP Q + F++DTGS LVWFPCT+ Y C CS +KIP+FIPKLSSS++++GC NP
Sbjct: 97 GTPSQTLSFVMDTGSSLVWFPCTSRYVCTRCSFPNIDPAKIPTFIPKLSSSAKIVGCLNP 156
Query: 151 KCSWIHHESIQ--CRDCNDEPLATSKNCTQICPSYLVLYGSGLTEGIALSETLNLPNRII 208
KC ++ ++ C C+ S NCT+ CP+Y + YG G T G+ L E+L R
Sbjct: 157 KCGFVMDSEVRTRCPGCDQN----SANCTKACPTYAIQYGLGTTVGLLLLESLVFAERTE 212
Query: 209 PNFLVGCSVLSSRQPAGIAGFGRGKTSLPSQLNLDKFSYCLLSHKFDDTTRTSSLILDNG 268
P+F+VGCS+LSSRQP+GIAGFGRG +SLP Q+ L KFSYCLLSH+FDD+ ++S + L G
Sbjct: 213 PDFVVGCSILSSRQPSGIAGFGRGPSSLPKQMGLKKFSYCLLSHRFDDSPKSSKMTLYVG 272
Query: 269 SSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRVRVWHKYLTLDRDGN 328
D KT GL+YTPF NP V+ +AF YYYV LR I VG +RV+V + ++ DGN
Sbjct: 273 PDSKDDKTGGLSYTPFRKNP-VSSNSAFKEYYYVTLRHIIVGDKRVKVPYSFMVAGSDGN 331
Query: 329 GGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEALTGLRPCFDVPGEKTG 388
GGTIVDSG+TFTFM +FE +A EF QM NYTRA EAL+GL+PCF++ G +
Sbjct: 332 GGTIVDSGSTFTFMEKPVFEAVATEFDRQMA---NYTRAADVEALSGLKPCFNLSGVGSV 388
Query: 389 SFPELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTVVTDREA----SGGPSIILGNFQMQ 444
+ P L FKGGA++ LPV NYF++VG+ S +CLT+V++ S GPSIILGN+Q Q
Sbjct: 389 ALPSLVFQFKGGAKMELPVANYFSLVGDLSVLCLTIVSNEAVGSTLSSGPSIILGNYQSQ 448
Query: 445 NYYVEYDLRNQRLGFKQQLCK 465
N+Y EYDL N+R GF++Q CK
Sbjct: 449 NFYTEYDLENERFGFRRQRCK 469
>gi|224091907|ref|XP_002309394.1| predicted protein [Populus trichocarpa]
gi|222855370|gb|EEE92917.1| predicted protein [Populus trichocarpa]
Length = 469
Score = 447 bits (1151), Expect = e-123, Method: Compositional matrix adjust.
Identities = 233/436 (53%), Positives = 298/436 (68%), Gaps = 29/436 (6%)
Query: 41 SQDSYQNLNSLVSSSLTRALHIKNPQTKTTTTTTTTTTTNISSHSYGGYSISLSFGTPPQ 100
S++ + LN L S SL+RA HIK+P+TK + T + SYGGYSISL+FGTPPQ
Sbjct: 49 SKNPWGALNHLASLSLSRAHHIKSPKTKFSLLKTP-----LFPRSYGGYSISLNFGTPPQ 103
Query: 101 IIPFILDTGSHLVWFPCTNHYQCKYC-----SSSKIPSFIPKLSSSSRLLGCQNPKCSWI 155
F++DTGS LVWFPCT+ Y C C + IP+FIPK SSSS L+GC+N KCSW+
Sbjct: 104 TTKFVMDTGSSLVWFPCTSRYLCSRCDFPNIEVTGIPTFIPKQSSSSNLIGCKNHKCSWL 163
Query: 156 HHESIQ--CRDCNDEPLATSKNCTQICPSYLVLYGSGLTEGIALSETLNLPNR-IIPNFL 212
+Q C++C+ T++NCTQ CP Y++ YG G T G+ LSETL+ P++ IP FL
Sbjct: 164 FGPKVQSKCQECD----PTTQNCTQSCPPYVIQYGLGSTAGLLLSETLDFPHKKTIPGFL 219
Query: 213 VGCSVLSSRQPAGIAGFGRGKTSLPSQLNLDKFSYCLLSHKFDDTTRTSSLILDNGSSHS 272
VGCS+ S RQP GIAGFGR SLPSQL L KFSYCL+SH FDDT +S L+LD GS
Sbjct: 220 VGCSLFSIRQPEGIAGFGRSPESLPSQLGLKKFSYCLVSHAFDDTPASSDLVLDTGSGSD 279
Query: 273 DKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRVRVWHKYLTLDRDGNGGTI 332
D KT GL+YTPF NP+ A R+ YYYV LR I +G V+V +K+L DGNGGTI
Sbjct: 280 DTKTPGLSYTPFQKNPTAAFRD----YYYVLLRNIVIGDTHVKVPYKFLVPGSDGNGGTI 335
Query: 333 VDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEALTGLRPCFDVPGEKTGSFPE 392
VDSGTTFTFM ++E +A EF Q+ +YT A + TGLRPCF++ GEK+ S PE
Sbjct: 336 VDSGTTFTFMEKPVYELVAKEFEKQVA---HYTVATEVQNQTGLRPCFNISGEKSVSVPE 392
Query: 393 LKLHFKGGAEVTLPVENYFAVVGEGSAVCLTVVTDREA----SGGPSIILGNFQMQNYYV 448
HFKGGA++ LP+ NYF+ V G +CLT+V+D + GGP+IILGN+Q +N++V
Sbjct: 393 FIFHFKGGAKMALPLANYFSFVDSG-VICLTIVSDNMSGSGIGGGPAIILGNYQQRNFHV 451
Query: 449 EYDLRNQRLGFKQQLC 464
E+DL+N+R GFKQQ C
Sbjct: 452 EFDLKNERFGFKQQNC 467
>gi|18409620|ref|NP_566966.1| aspartyl protease family protein [Arabidopsis thaliana]
gi|13430562|gb|AAK25903.1|AF360193_1 unknown protein [Arabidopsis thaliana]
gi|4886277|emb|CAB43423.1| putative protein [Arabidopsis thaliana]
gi|14532764|gb|AAK64083.1| unknown protein [Arabidopsis thaliana]
gi|15450892|gb|AAK96717.1| Unknown protein [Arabidopsis thaliana]
gi|30387567|gb|AAP31949.1| At3g52500 [Arabidopsis thaliana]
gi|332645431|gb|AEE78952.1| aspartyl protease family protein [Arabidopsis thaliana]
Length = 469
Score = 444 bits (1141), Expect = e-122, Method: Compositional matrix adjust.
Identities = 230/460 (50%), Positives = 308/460 (66%), Gaps = 29/460 (6%)
Query: 26 ITSLTFSLSRF-HTNPS-QDSYQNLNSLVSSSLTRALHIK---------NPQTKTTTTTT 74
++++ LS F H++ S +D Y +L L SS+ RA +K + + TTT +
Sbjct: 16 VSAVKLPLSPFSHSDQSPKDPYLSLRRLAESSIARAHKLKHGTSIKPDEDALSSTTTASA 75
Query: 75 TTTTTNISSHSYGGYSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSK---- 130
T + +S+ SYGGYS+SLSFGTP Q IPF+ DTGS LVW PCT+ Y C C S
Sbjct: 76 TVVKSPLSAKSYGGYSVSLSFGTPSQTIPFVFDTGSSLVWLPCTSRYLCSGCDFSGLDPT 135
Query: 131 -IPSFIPKLSSSSRLLGCQNPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGS 189
IP FIPK SSSS+++GCQ+PKC +++ ++QCR C+ ++NCT CP Y++ YG
Sbjct: 136 LIPRFIPKNSSSSKIIGCQSPKCQFLYGPNVQCRGCD----PNTRNCTVGCPPYILQYGL 191
Query: 190 GLTEGIALSETLNLPNRIIPNFLVGCSVLSSRQPAGIAGFGRGKTSLPSQLNLDKFSYCL 249
G T G+ ++E L+ P+ +P+F+VGCS++S+RQPAGIAGFGRG SLPSQ+NL +FS+CL
Sbjct: 192 GSTAGVLITEKLDFPDLTVPDFVVGCSIISTRQPAGIAGFGRGPVSLPSQMNLKRFSHCL 251
Query: 250 LSHKFDDTTRTSSLILDNGSSH-SDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRIT 308
+S +FDDT T+ L LD GS H S KT GLTYTPF NP+V+ + AF YYY+ LRRI
Sbjct: 252 VSRRFDDTNVTTDLDLDTGSGHNSGSKTPGLTYTPFRKNPNVSNK-AFLEYYYLNLRRIY 310
Query: 309 VGGQRVRVWHKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRAL 368
VG + V++ +KYL +G+GG+IVDSG+TFTFM +FE +A+EF SQM NYTR
Sbjct: 311 VGRKHVKIPYKYLAPGTNGDGGSIVDSGSTFTFMERPVFELVAEEFASQM---SNYTREK 367
Query: 369 GAEALTGLRPCFDVPGEKTGSFPELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTVVTDR 428
E TGL PCF++ G+ + PEL FKGGA++ LP+ NYF VG VCLTVV+D+
Sbjct: 368 DLEKETGLGPCFNISGKGDVTVPELIFEFKGGAKLELPLSNYFTFVGNTDTVCLTVVSDK 427
Query: 429 ----EASGGPSIILGNFQMQNYYVEYDLRNQRLGFKQQLC 464
GP+IILG+FQ QNY VEYDL N R GF ++ C
Sbjct: 428 TVNPSGGTGPAIILGSFQQQNYLVEYDLENDRFGFAKKKC 467
>gi|297819968|ref|XP_002877867.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
gi|297323705|gb|EFH54126.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
Length = 469
Score = 443 bits (1140), Expect = e-122, Method: Compositional matrix adjust.
Identities = 231/460 (50%), Positives = 309/460 (67%), Gaps = 29/460 (6%)
Query: 26 ITSLTFSLSRF-HTNPS-QDSYQNLNSLVSSSLTRALHIKN-----PQTKTTTTTTTTTT 78
++++ LS F H++ S +D Y +L L SS+ RA +K+ P + ++T T +
Sbjct: 16 VSAVKLPLSPFSHSDQSPKDPYLSLRRLAESSIARAHKLKHGTSIKPDEEALSSTATASA 75
Query: 79 TNISSH----SYGGYSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSS----- 129
T + SH SYGGYS+SLSFGTP Q IPF+ DTGS LVWFPCT+ Y C C+ S
Sbjct: 76 TVVKSHLSPKSYGGYSVSLSFGTPSQTIPFVFDTGSSLVWFPCTSRYLCSDCNFSGLDPT 135
Query: 130 KIPSFIPKLSSSSRLLGCQNPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGS 189
+IP FIPK SSSSR++GCQNPKC ++ ++QCR C+ ++NCT CP Y++ YG
Sbjct: 136 QIPRFIPKNSSSSRVIGCQNPKCQFLFGANVQCRGCD----PNTRNCTVPCPPYILQYGL 191
Query: 190 GLTEGIALSETLNLPNRIIPNFLVGCSVLSSRQPAGIAGFGRGKTSLPSQLNLDKFSYCL 249
G T GI +SE L+ P+ +P+F+VGCSV+S+R PAGIAGFGRG SLPSQ+ L FS+CL
Sbjct: 192 GSTAGILISEKLDFPDLTVPDFVVGCSVISTRTPAGIAGFGRGPESLPSQMKLKSFSHCL 251
Query: 250 LSHKFDDTTRTSSLILDNGSSH-SDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRIT 308
+S +FDDT T+ L LD GS H S KT GL+YTPF NP+V+ AF YYY+ LRRI
Sbjct: 252 VSRRFDDTNVTTDLGLDTGSGHKSGSKTPGLSYTPFRKNPNVSN-TAFLEYYYLNLRRIY 310
Query: 309 VGGQRVRVWHKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRAL 368
VG + V++ +K+L +GNGG+IVDSG+TFTFM +FE +A+EF +QM NYTR
Sbjct: 311 VGSKHVKIPYKFLAPGTNGNGGSIVDSGSTFTFMERPVFELVAEEFATQM---SNYTREK 367
Query: 369 GAEALTGLRPCFDVPGEKTGSFPELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTVVTDR 428
E ++G+ PCF++ G+ + PEL FKGGA++ LP+ NYF+ VG VCLTVV+D
Sbjct: 368 DLEKVSGIAPCFNISGKGDVTVPELIFEFKGGAKMELPLSNYFSFVGNADTVCLTVVSDN 427
Query: 429 EAS----GGPSIILGNFQMQNYYVEYDLRNQRLGFKQQLC 464
+ GP+IILG+FQ QNY VEYDL N R GF ++ C
Sbjct: 428 TVNPGGGTGPAIILGSFQQQNYLVEYDLENDRFGFAKKKC 467
>gi|147789749|emb|CAN67405.1| hypothetical protein VITISV_025616 [Vitis vinifera]
Length = 609
Score = 443 bits (1140), Expect = e-122, Method: Compositional matrix adjust.
Identities = 229/440 (52%), Positives = 302/440 (68%), Gaps = 25/440 (5%)
Query: 36 FHTNPSQDSYQNLNSLVSSSLTRALHIKNPQTKTTTTTTTTTTTNISSHSYGGYSISLSF 95
F NPS D +Q L+ L S+SLTRA H+K+ + T++ T + +HSYGGYS+SLSF
Sbjct: 43 FTKNPSSDPWQLLSHLTSASLTRAHHLKHRKN------TSSVNTPLFAHSYGGYSVSLSF 96
Query: 96 GTPPQIIPFILDTGSHLVWFPCTNHYQCKYCS-----SSKIPSFIPKLSSSSRLLGCQNP 150
GTP Q + F++DTGS LVWFPCT+ Y C CS +KIP+FIPKLSSS++++GC NP
Sbjct: 97 GTPSQTLSFVMDTGSSLVWFPCTSRYVCTRCSFPNIDPAKIPTFIPKLSSSAKIVGCLNP 156
Query: 151 KCSWIHHESIQ--CRDCNDEPLATSKNCTQICPSYLVLYGSGLTEGIALSETLNLPNRII 208
KC ++ ++ C C+ S NCT+ CP+Y + YG G T G+ L E+L R
Sbjct: 157 KCGFVMDSEVRTRCPGCDQN----SANCTKACPTYAIQYGLGTTVGLLLLESLVFAERTE 212
Query: 209 PNFLVGCSVLSSRQPAGIAGFGRGKTSLPSQLNLDKFSYCLLSHKFDDTTRTSSLILDNG 268
P+F+VGCS+LSSRQP+GIAGFGRG +SLP Q+ L KFSYCLLSH+FDD+ ++S + L G
Sbjct: 213 PDFVVGCSILSSRQPSGIAGFGRGPSSLPKQMGLKKFSYCLLSHRFDDSPKSSKMTLYVG 272
Query: 269 SSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRVRVWHKYLTLDRDGN 328
D KT GL+YTPF NP V+ +AF YYYV LR I VG +RV+ + ++ DGN
Sbjct: 273 PDSKDDKTGGLSYTPFRKNP-VSSNSAFKEYYYVTLRHIIVGDKRVKXPYSFMVAGSDGN 331
Query: 329 GGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEALTGLRPCFDVPGEKTG 388
GGTIVDSG+TFTFM +FE +A EF QM NYTRA EAL+GL+PCF++ G +
Sbjct: 332 GGTIVDSGSTFTFMEKPVFEAVATEFDRQMA---NYTRAADVEALSGLKPCFNLSGVGSV 388
Query: 389 SFPELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTVVTDREA----SGGPSIILGNFQMQ 444
+ P L FKGGA++ LPV NYF++VG+ S +CLT+V++ S GPSIILGN+Q Q
Sbjct: 389 ALPSLVFQFKGGAKMELPVANYFSLVGDLSVLCLTIVSNEAVGSTLSSGPSIILGNYQSQ 448
Query: 445 NYYVEYDLRNQRLGFKQQLC 464
N+Y EYDL N+R GF++Q C
Sbjct: 449 NFYTEYDLENERFGFRRQRC 468
>gi|449493359|ref|XP_004159266.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Cucumis sativus]
Length = 511
Score = 443 bits (1139), Expect = e-122, Method: Compositional matrix adjust.
Identities = 232/470 (49%), Positives = 312/470 (66%), Gaps = 22/470 (4%)
Query: 6 SALCLSFIFFFTLLSIFPSSITSLTFSLSRFHTNPSQDSYQNLNSLVSSSLTRALHIKNP 65
S L + F+F LL SS ++ L+ F + D ++ +N L+S+SL RA H+K P
Sbjct: 52 SFLPIPFLFSIFLLLPTSSSSSTTVLPLTTFPSVSFTDPFKTINLLLSASLNRAQHLKTP 111
Query: 66 QTKTTTTTTTTTTTNISSHSYGGYSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKY 125
Q+K+ T+ + + SYG YS+SL+FGTPPQ + FI DTGS LVWFPCT Y+C
Sbjct: 112 QSKSNTSIQNVS---LFPRSYGAYSVSLAFGTPPQNLSFIFDTGSSLVWFPCTAGYRCSR 168
Query: 126 CS-----SSKIPSFIPKLSSSSRLLGCQNPKCSWIHHESIQ--CRDCNDEPLATSKNCTQ 178
CS + I F+PKLSSS +++GC+NPKC+WI +++ CR+CN + S+ C+
Sbjct: 169 CSFPYVDPATISKFVPKLSSSVKVVGCRNPKCAWIFGPNLKSRCRNCNSK----SRKCSD 224
Query: 179 ICPSYLVLYGSGLTEGIALSETLNLPNRIIPNFLVGCSVLSSRQPAGIAGFGRGKTSLPS 238
CP Y + YGSG T GI LSETL+L N+ +P+FLVGCSV+S QPAGIAGFGRG SLPS
Sbjct: 225 SCPGYGLQYGSGATAGILLSETLDLENKRVPDFLVGCSVMSVHQPAGIAGFGRGPESLPS 284
Query: 239 QLNLDKFSYCLLSHKFDDTTRTSSLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSV 298
Q+ L +FS+CL+S FDD+ +S L+LD+GS + KT Y PF NPSV+ AF
Sbjct: 285 QMRLKRFSHCLVSRGFDDSPVSSPLVLDSGSESDESKTKSFIYAPFRENPSVSNA-AFRE 343
Query: 299 YYYVGLRRITVGGQRVRVWHKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQM 358
YYY+ LRRI +GG+ V+ +KYL D GNGG I+DSG+TFTF+ +FE +ADE Q+
Sbjct: 344 YYYLSLRRILIGGKPVKFPYKYLVPDSTGNGGAIIDSGSTFTFLDKPIFEAIADELEKQL 403
Query: 359 VKNRNYTRALGAEALTGLRPCFDVPG-EKTGSFPELKLHFKGGAEVTLPVENYFAVVGEG 417
VK Y RA EA +GLRPCF++P E++ FP++ L FKGG +++L ENY A+V +
Sbjct: 404 VK---YPRAKDVEAQSGLRPCFNIPKEEESAEFPDVVLKFKGGGKLSLAAENYLAMVTDE 460
Query: 418 SAVCLTVVTDRE---ASGGPSIILGNFQMQNYYVEYDLRNQRLGFKQQLC 464
VCLT++TD GGP+IILG FQ QN VEYDL QR+GF++Q C
Sbjct: 461 GVVCLTMMTDEAVVGGGGGPAIILGAFQQQNVLVEYDLAKQRIGFRKQKC 510
>gi|356498711|ref|XP_003518193.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Glycine max]
Length = 466
Score = 443 bits (1139), Expect = e-121, Method: Compositional matrix adjust.
Identities = 234/463 (50%), Positives = 307/463 (66%), Gaps = 23/463 (4%)
Query: 14 FFFTLLSIFPSSI-TSLTFSLSRFHTN-PSQDS--YQNLNSLVSSSLTRALHIKNPQTKT 69
F +++ F SS ++T LS TN PS S + L VS+S+TRA H+KN +
Sbjct: 13 FLSIIITTFSSSTPNTITLHLSPLFTNHPSSSSHPFHTLKLAVSTSITRAHHLKNHKPNK 72
Query: 70 TTTTTTTTTTNISSHSYGGYSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSS- 128
+ T + +YGGYSI L FGTP Q PF+LDTGS LVW PC++HY C C+S
Sbjct: 73 SLETP------VHPKTYGGYSIDLEFGTPSQTFPFVLDTGSTLVWLPCSSHYLCSKCNSF 126
Query: 129 SKIPSFIPKLSSSSRLLGCQNPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYG 188
S P FIPK SSSS+ +GC NPKC+W+ ++ C + A NC+Q CP+Y V YG
Sbjct: 127 SNTPKFIPKNSSSSKFVGCTNPKCAWVFGPDVKSHCCRQDK-AAFNNCSQTCPAYTVQYG 185
Query: 189 SGLTEGIALSETLNLPNRIIPNFLVGCSVLSSRQPAGIAGFGRGKTSLPSQLNLDKFSYC 248
G T G LSE LN P + +FL+GCSV+S QPAGIAGFGRG+ SLPSQ+NL +FSYC
Sbjct: 186 LGSTAGFLLSENLNFPTKKYSDFLLGCSVVSVYQPAGIAGFGRGEESLPSQMNLTRFSYC 245
Query: 249 LLSHKFDDT-TRTSSLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRI 307
LLSH+FDD+ T TS+L+L+ SS D KT G++YTPF+ NP+ + AF YYY+ L+RI
Sbjct: 246 LLSHQFDDSATITSNLVLETASSR-DGKTNGVSYTPFLKNPTTKKNPAFGAYYYITLKRI 304
Query: 308 TVGGQRVRVWHKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRA 367
VG +RVRV + L + DG+GG IVDSG+TFTFM +F+ +A EF Q+ +YTRA
Sbjct: 305 VVGEKRVRVPRRLLEPNVDGDGGFIVDSGSTFTFMERPIFDLVAQEFAKQV----SYTRA 360
Query: 368 LGAEALTGLRPCFDVP-GEKTGSFPELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTVVT 426
AE GL PCF + G +T SFPEL+ F+GGA++ LPV NYF++VG+G CLT+V+
Sbjct: 361 REAEKQFGLSPCFVLAGGAETASFPELRFEFRGGAKMRLPVANYFSLVGKGDVACLTIVS 420
Query: 427 DREASG----GPSIILGNFQMQNYYVEYDLRNQRLGFKQQLCK 465
D A GP++ILGN+Q QN+YVEYDL N+R GF+ Q C+
Sbjct: 421 DDVAGSGGTVGPAVILGNYQQQNFYVEYDLENERFGFRSQSCQ 463
>gi|147866226|emb|CAN79938.1| hypothetical protein VITISV_027777 [Vitis vinifera]
Length = 454
Score = 442 bits (1138), Expect = e-121, Method: Compositional matrix adjust.
Identities = 248/449 (55%), Positives = 299/449 (66%), Gaps = 38/449 (8%)
Query: 27 TSLTFSLSRFHTNPSQDSYQNLNSLVSSSLTRALHIKNPQTKTTTTTTTTTTTNISSHSY 86
+ +T LS +P D Y+NL LVS+SL RA H+KN TT T+TT + +HSY
Sbjct: 34 SPITLPLSASKPSPPPDPYRNLRHLVSASLIRARHLKN------PKTTPTSTTPLFTHSY 87
Query: 87 GGYSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCS-SSKIPS---FIPKLSSSS 142
G YSI LSFGTPPQ +P I+DTGS LVWFPCT+ Y C+ CS S+ PS FIPK SSSS
Sbjct: 88 GAYSIPLSFGTPPQTLPLIMDTGSDLVWFPCTHRYVCRNCSFSTSNPSSNIFIPKSSSSS 147
Query: 143 RLLGCQNPKCSWIHHESIQ--CRDCNDEPLATSKNCTQICPSYLVLYGSGLTEGIALSET 200
++LGC NPKC WIH +Q CRDC EP TS NCTQICP YL
Sbjct: 148 KVLGCVNPKCGWIHGSKVQSRCRDC--EP--TSPNCTQICPPYLNFLRFWDHRRSQFHRR 203
Query: 201 LNLPNRIIPNFLVGCSVLSSRQPAGIAGFGRGKTSLPSQLNLDKFSYCLLSHKFDDTTRT 260
+ P L I+GFGRG SLPSQL L KFSYCLLS ++DDTT +
Sbjct: 204 MLCP-------------LHQSTRREISGFGRGPPSLPSQLGLKKFSYCLLSRRYDDTTES 250
Query: 261 SSLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRVRVWHKY 320
SSL+LD G S S +KT GL+YTPFV NP VA ++AFSVYYY+GLR ITVGG+ V++ +KY
Sbjct: 251 SSLVLD-GESDSGEKTAGLSYTPFVQNPKVAGKHAFSVYYYLGLRHITVGGKHVKIPYKY 309
Query: 321 LTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEALTGLRPCF 380
L DG+GGTI+DSGTTFT+M E+FE +A EF Q+ RA E +TGLRPCF
Sbjct: 310 LIPGADGDGGTIIDSGTTFTYMKGEIFELVAAEFEKQVQSK----RATEVEGITGLRPCF 365
Query: 381 DVPGEKTGSFPELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTVVTD----REASGGPSI 436
++ G T SFPEL L F+GGAE+ LP+ NY A +G VCLT+VTD +E SGGP+I
Sbjct: 366 NISGLNTPSFPELTLKFRGGAEMELPLANYVAFLGGDDVVCLTIVTDGAAGKEFSGGPAI 425
Query: 437 ILGNFQMQNYYVEYDLRNQRLGFKQQLCK 465
ILGNFQ QN+YVEYDLRN+RLGF+QQ CK
Sbjct: 426 ILGNFQQQNFYVEYDLRNERLGFRQQSCK 454
>gi|224140036|ref|XP_002323393.1| predicted protein [Populus trichocarpa]
gi|222868023|gb|EEF05154.1| predicted protein [Populus trichocarpa]
Length = 459
Score = 442 bits (1137), Expect = e-121, Method: Compositional matrix adjust.
Identities = 232/436 (53%), Positives = 303/436 (69%), Gaps = 29/436 (6%)
Query: 41 SQDSYQNLNSLVSSSLTRALHIKNPQTKTTTTTTTTTTTNISSHSYGGYSISLSFGTPPQ 100
S+ + +LN L S SL+RA HIK+P+T + T + SYGGYSISL+FGTPPQ
Sbjct: 40 SKKPWGSLNHLASLSLSRAHHIKSPKTNFSLIKTP-----LFPRSYGGYSISLNFGTPPQ 94
Query: 101 IIPFILDTGSHLVWFPCTNHYQCKYCS-----SSKIPSFIPKLSSSSRLLGCQNPKCSWI 155
F++DTGS LVWFPCT+ Y C C+ + IP+F+PKLSSSS+L+GC+NP+CS I
Sbjct: 95 TTKFVMDTGSSLVWFPCTSRYLCSECNFPNIKKTGIPTFLPKLSSSSKLIGCKNPRCSMI 154
Query: 156 HHESIQ--CRDCNDEPLATSKNCTQICPSYLVLYGSGLTEGIALSETLNLPNR-IIPNFL 212
IQ C++C+ +T++NCTQ CP Y++ YGSG T G+ LSETL+ PN+ IP+FL
Sbjct: 155 FGPEIQSKCQECD----STAQNCTQTCPPYVIQYGSGSTAGLLLSETLDFPNKKTIPDFL 210
Query: 213 VGCSVLSSRQPAGIAGFGRGKTSLPSQLNLDKFSYCLLSHKFDDTTRTSSLILDNGSSHS 272
VGCS+ S +QP GIAGFGR SLPSQL L KFSYCL+SH FDDT +S L+LD GS
Sbjct: 211 VGCSIFSIKQPEGIAGFGRSPESLPSQLGLKKFSYCLVSHAFDDTPTSSDLVLDTGSGSG 270
Query: 273 DKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRVRVWHKYLTLDRDGNGGTI 332
KT GL++TPF+ NP+ A R+ YYYV LR I +G V+V +K+L DGNGGTI
Sbjct: 271 VTKTAGLSHTPFLKNPTTAFRD----YYYVLLRNIVIGDTHVKVPYKFLVPGTDGNGGTI 326
Query: 333 VDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEALTGLRPCFDVPGEKTGSFPE 392
VDSGTTFTFM ++E +A EF QM +YT A + LTGLRPC+++ GEK+ S P+
Sbjct: 327 VDSGTTFTFMENPVYELVAKEFEKQMA---HYTVATEIQNLTGLRPCYNISGEKSLSVPD 383
Query: 393 LKLHFKGGAEVTLPVENYFAVVGEGSAVCLTVVTDR----EASGGPSIILGNFQMQNYYV 448
L FKGGA++ LP+ NYF++V G +CLT+V+D GGP+IILGN+Q +N+YV
Sbjct: 384 LIFQFKGGAKMALPLSNYFSIVDSG-VICLTIVSDNVAGPGLGGGPAIILGNYQQRNFYV 442
Query: 449 EYDLRNQRLGFKQQLC 464
E+DL N++ GFKQQ C
Sbjct: 443 EFDLENEKFGFKQQSC 458
>gi|356570895|ref|XP_003553619.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Glycine max]
Length = 470
Score = 442 bits (1136), Expect = e-121, Method: Compositional matrix adjust.
Identities = 228/450 (50%), Positives = 301/450 (66%), Gaps = 22/450 (4%)
Query: 28 SLTFSLSRFHTNP---SQDSYQNLNSLVSSSLTRALHIKNPQTKTTTTTTTTTTTNISSH 84
S+T LS T P D + ++ SSSLTRA H+K+ + + TT
Sbjct: 28 SITLPLSPLLTKPHSSDSDPFHSVKLAASSSLTRAHHLKHRNNNSPSVATTPAYPK---- 83
Query: 85 SYGGYSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCS-----SSKIPSFIPKLS 139
SYGGYSI L+ GTPPQ PF+LDTGS LVWFPCT+HY C +C+ +KIP+FIPK S
Sbjct: 84 SYGGYSIDLNLGTPPQTSPFVLDTGSSLVWFPCTSHYLCSHCNFPNIDPTKIPTFIPKNS 143
Query: 140 SSSRLLGCQNPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSGLTEGIALSE 199
S+++LLGC+NPKC ++ ++ R C S+NC+ CPSY++ YG G T G L +
Sbjct: 144 STAKLLGCRNPKCGYLFGPDVESR-CPQCKKPGSQNCSLTCPSYIIQYGLGATAGFLLLD 202
Query: 200 TLNLPNRIIPNFLVGCSVLSSRQPAGIAGFGRGKTSLPSQLNLDKFSYCLLSHKFDDTTR 259
LN P + +P FLVGCS+LS RQP+GIAGFGRG+ SLPSQ+NL +FSYCL+SH+FDDT +
Sbjct: 203 NLNFPGKTVPQFLVGCSILSIRQPSGIAGFGRGQESLPSQMNLKRFSYCLVSHRFDDTPQ 262
Query: 260 TSSLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRVRVWHK 319
+S L+L SS D KT GL+YTPF +NPS + F YYYV LR++ VGG V++ +K
Sbjct: 263 SSDLVLQI-SSTGDTKTNGLSYTPFRSNPS--NNSVFREYYYVTLRKLIVGGVDVKIPYK 319
Query: 320 YLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEALTGLRPC 379
+L DGNGGTIVDSG+TFTFM ++ +A EF+ Q+ K Y+R EA +GL PC
Sbjct: 320 FLEPGSDGNGGTIVDSGSTFTFMERPVYNLVAQEFLRQLGK--KYSREENVEAQSGLSPC 377
Query: 380 FDVPGEKTGSFPELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTVVTDREA----SGGPS 435
F++ G KT SFPE FKGGA+++ P+ NYF+ VG+ +C TVV+D A + GP+
Sbjct: 378 FNISGVKTISFPEFTFQFKGGAKMSQPLLNYFSFVGDAEVLCFTVVSDGGAGQPKTAGPA 437
Query: 436 IILGNFQMQNYYVEYDLRNQRLGFKQQLCK 465
IILGN+Q QN+YVEYDL N+R GF + CK
Sbjct: 438 IILGNYQQQNFYVEYDLENERFGFGPRNCK 467
>gi|16209647|gb|AAL14384.1| AT3g52500/F22O6_120 [Arabidopsis thaliana]
Length = 469
Score = 438 bits (1127), Expect = e-120, Method: Compositional matrix adjust.
Identities = 229/460 (49%), Positives = 307/460 (66%), Gaps = 29/460 (6%)
Query: 26 ITSLTFSLSRF-HTNPS-QDSYQNLNSLVSSSLTRALHIK---------NPQTKTTTTTT 74
++++ LS F H++ S +D Y +L L SS+ RA +K + + TTT +
Sbjct: 16 VSAVKLPLSPFSHSDQSPKDPYLSLRRLAESSIARAHKLKHGTSIKPDEDALSSTTTASA 75
Query: 75 TTTTTNISSHSYGGYSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSK---- 130
T + +S+ SYGGYS+SLSFGTP Q IPF+ DTGS LV PCT+ Y C C S
Sbjct: 76 TVVKSPLSAKSYGGYSVSLSFGTPSQTIPFVFDTGSSLVCLPCTSRYLCSGCDFSGLDPT 135
Query: 131 -IPSFIPKLSSSSRLLGCQNPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGS 189
IP FIPK SSSS+++GCQ+PKC +++ ++QCR C+ ++NCT CP Y++ YG
Sbjct: 136 LIPRFIPKNSSSSKIIGCQSPKCQFLYGPNVQCRGCD----PNTRNCTVGCPPYILQYGL 191
Query: 190 GLTEGIALSETLNLPNRIIPNFLVGCSVLSSRQPAGIAGFGRGKTSLPSQLNLDKFSYCL 249
G T G+ ++E L+ P+ +P+F+VGCS++S+RQPAGIAGFGRG SLPSQ+NL +FS+CL
Sbjct: 192 GSTAGVLITEKLDFPDLTVPDFVVGCSIISTRQPAGIAGFGRGPVSLPSQMNLKRFSHCL 251
Query: 250 LSHKFDDTTRTSSLILDNGSSH-SDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRIT 308
+S +FDDT T+ L LD GS H S KT GLTYTPF NP+V+ + AF YYY+ LRRI
Sbjct: 252 VSRRFDDTNVTTDLDLDTGSGHNSGSKTPGLTYTPFRKNPNVSNK-AFLEYYYLNLRRIY 310
Query: 309 VGGQRVRVWHKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRAL 368
VG + V++ +KYL +G+GG+IVDSG+TFTFM +FE +A+EF SQM NYTR
Sbjct: 311 VGRKHVKIPYKYLAPGTNGDGGSIVDSGSTFTFMERPVFELVAEEFASQM---SNYTREK 367
Query: 369 GAEALTGLRPCFDVPGEKTGSFPELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTVVTDR 428
E TGL PCF++ G+ + PEL FKGGA++ LP+ NYF VG VCLTVV+D+
Sbjct: 368 DLEKETGLGPCFNISGKGDVTVPELIFEFKGGAKLELPLSNYFTFVGNTDTVCLTVVSDK 427
Query: 429 ----EASGGPSIILGNFQMQNYYVEYDLRNQRLGFKQQLC 464
GP+IILG+FQ QNY VEYDL N R GF ++ C
Sbjct: 428 TVNPSGGTGPAIILGSFQQQNYLVEYDLENDRFGFAKKKC 467
>gi|356537161|ref|XP_003537098.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Glycine max]
Length = 601
Score = 424 bits (1090), Expect = e-116, Method: Compositional matrix adjust.
Identities = 212/431 (49%), Positives = 285/431 (66%), Gaps = 21/431 (4%)
Query: 45 YQNLNSLVSSSLTRALHIKNPQTKTTTTTTTTTTTNISSHSYGGYSISLSFGTPPQIIPF 104
+ L VS+S+TRA H+KN ++ T + +YGGYSI L FGTPPQ PF
Sbjct: 178 FHTLQLAVSTSITRAHHLKNHNNPSSLKTL------VHPKTYGGYSIDLKFGTPPQTFPF 231
Query: 105 ILDTGSHLVWFPCTNHYQCKYCSS---SKIPSFIPKLSSSSRLLGCQNPKCSWIHHESIQ 161
+LDTGS LVW PC +HY C C+S + P FIPK S SS+ +GC+NPKC+W+ +
Sbjct: 232 VLDTGSSLVWLPCYSHYLCSKCNSFSNNNTPKFIPKDSFSSKFVGCRNPKCAWVFGSDVT 291
Query: 162 CRDCN--DEPLATSKNCTQICPSYLVLYGSGLTEGIALSETLNLPNRIIPNFLVGCSVLS 219
C + + NC+Q CP+Y V YG G T G LSE LN P + + +FLVGCSV+S
Sbjct: 292 SHCCKLAKAAFSNNNNCSQTCPAYTVQYGLGSTAGFLLSENLNFPAKNVSDFLVGCSVVS 351
Query: 220 SRQPAGIAGFGRGKTSLPSQLNLDKFSYCLLSHKFDDTTRTSSLILDNGSSHSDKKTTGL 279
QP GIAGFGRG+ SLP+Q+NL +FSYCLLSH+FD++ S L+++ +S KKT G+
Sbjct: 352 VYQPGGIAGFGRGEESLPAQMNLTRFSYCLLSHQFDESPENSDLVMEATNSGEGKKTNGV 411
Query: 280 TYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRVRVWHKYLTLDRDGNGGTIVDSGTTF 339
+YT F+ NPS ++ AF YYY+ LR+I VG +RVRV + L D +G+GG IVDSG+T
Sbjct: 412 SYTAFLKNPST-KKPAFGAYYYITLRKIVVGEKRVRVPRRMLEPDVNGDGGFIVDSGSTL 470
Query: 340 TFMAPELFEPLADEFVSQMVKNRNYTRALGAEALTGLRPCFDVP-GEKTGSFPELKLHFK 398
TFM +F+ +A+EFV Q+ NYTRA E GL PCF + G +T SFPE++ F+
Sbjct: 471 TFMERPIFDLVAEEFVKQV----NYTRARELEKQFGLSPCFVLAGGAETASFPEMRFEFR 526
Query: 399 GGAEVTLPVENYFAVVGEGSAVCLTVVTDREA----SGGPSIILGNFQMQNYYVEYDLRN 454
GGA++ LPV NYF+ VG+G CLT+V+D A + GP++ILGN+Q QN+YVE DL N
Sbjct: 527 GGAKMRLPVANYFSRVGKGDVACLTIVSDDVAGQGGAVGPAVILGNYQQQNFYVECDLEN 586
Query: 455 QRLGFKQQLCK 465
+R GF+ Q C+
Sbjct: 587 ERFGFRSQSCQ 597
>gi|297740190|emb|CBI30372.3| unnamed protein product [Vitis vinifera]
Length = 445
Score = 395 bits (1016), Expect = e-107, Method: Compositional matrix adjust.
Identities = 205/398 (51%), Positives = 270/398 (67%), Gaps = 27/398 (6%)
Query: 36 FHTNPSQDSYQNLNSLVSSSLTRALHIKNPQTKTTTTTTTTTTTNISSHSYGGYSISLSF 95
F NPS D +Q L+ L S+SLTRA H+K+ + T++ T + +HSYGGYS+SLSF
Sbjct: 59 FTKNPSSDPWQLLSHLTSASLTRAHHLKHRKN------TSSVNTPLFAHSYGGYSVSLSF 112
Query: 96 GTPPQIIPFILDTGSHLVWFPCTNHYQCKYCS-----SSKIPSFIPKLSSSSRLLGCQNP 150
GTP Q + F++DTGS LVWFPCT+ Y C CS +KIP+FIPKLSSS++++GC NP
Sbjct: 113 GTPSQTLSFVMDTGSSLVWFPCTSRYVCTRCSFPNIDPAKIPTFIPKLSSSAKIVGCLNP 172
Query: 151 KCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSGLTEGIALSETLNLPNRIIPN 210
KC ++ S NCT+ CP+Y + YG G T G+ L E+L R P+
Sbjct: 173 KCGFVMDSE------------NSANCTKACPTYAIQYGLGTTVGLLLLESLVFAERTEPD 220
Query: 211 FLVGCSVLSSRQPAGIAGFGRGKTSLPSQLNLDKFSYCLLSHKFDDTTRTSSLILDNGSS 270
F+VGCS+LSSRQP+GIAGFGRG +SLP Q+ L KFSYCLLSH+FDD+ ++S + L G
Sbjct: 221 FVVGCSILSSRQPSGIAGFGRGPSSLPKQMGLKKFSYCLLSHRFDDSPKSSKMTLYVGPD 280
Query: 271 HSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRVRVWHKYLTLDRDGNGG 330
D KT GL+YTPF NP V+ +AF YYYV LR I VG +RV+V + ++ DGNGG
Sbjct: 281 SKDDKTGGLSYTPFRKNP-VSSNSAFKEYYYVTLRHIIVGDKRVKVPYSFMVAGSDGNGG 339
Query: 331 TIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEALTGLRPCFDVPGEKTGSF 390
TIVDSG+TFTFM +FE +A EF QM NYTRA EAL+GL+PCF++ G + +
Sbjct: 340 TIVDSGSTFTFMEKPVFEAVATEFDRQMA---NYTRAADVEALSGLKPCFNLSGVGSVAL 396
Query: 391 PELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTVVTDR 428
P L FKGGA++ LPV NYF++VG+ S +CLT+V++
Sbjct: 397 PSLVFQFKGGAKMELPVANYFSLVGDLSVLCLTIVSNE 434
>gi|223973231|gb|ACN30803.1| unknown [Zea mays]
Length = 459
Score = 374 bits (960), Expect = e-101, Method: Compositional matrix adjust.
Identities = 203/439 (46%), Positives = 274/439 (62%), Gaps = 36/439 (8%)
Query: 53 SSSLTRALHIK------NPQTKTTTTTTTTTTTNISSHSYGGYSISLSFGTPPQIIPFIL 106
++SL RALH+K + Q + + T + HSYGGY+ + S GTPPQ +P +L
Sbjct: 25 AASLARALHLKRRDPNHHSQKGSGGHPSVPATAALYPHSYGGYAFTASLGTPPQPLPVLL 84
Query: 107 DTGSHLVWFPCTNHYQCKYCSS---SKIPSFIPKLSSSSRLLGCQNPKCSWIHHESIQCR 163
DTGSHL W PCT+ Y+C+ CSS S +P F PK SSSSRL+GC+NP C W+H +
Sbjct: 85 DTGSHLTWVPCTSSYECRNCSSPSASAVPVFHPKNSSSSRLVGCRNPSCQWVHSAANLAT 144
Query: 164 DCNDEPLAT-SKNC----TQICPSYLVLYGSGLTEGIALSETLNLPNRIIPNFLVGCSVL 218
C P + + NC + +CP Y V+YGSG T G+ +++TL P R +P F++GCS++
Sbjct: 145 KCRRAPCSPGAANCPAAASNVCPPYAVVYGSGSTAGLLIADTLRAPGRAVPGFVLGCSLV 204
Query: 219 SSRQ-PAGIAGFGRGKTSLPSQLNLDKFSYCLLSHKFDDTTRTS-SLILDNGSSHSDKKT 276
S Q P+G+AGFGRG S+P+QL L KFSYCLLS +FDD S SL+L
Sbjct: 205 SVHQPPSGLAGFGRGAPSVPAQLGLPKFSYCLLSRRFDDNAAVSGSLVLG-----GTGGG 259
Query: 277 TGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRVRVWHKYLTLDRDGNGGTIVDSG 336
G+ Y P V + + ++ + VYYY+ LR +TVGG+ VR+ + + G+GGTIVDSG
Sbjct: 260 EGMQYVPLVKS-AAGDKLPYGVYYYLALRGVTVGGKAVRLPARAFAANAAGSGGTIVDSG 318
Query: 337 TTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEALTGLRPCFDVP-GEKTGSFPELKL 395
TTFT++ P +F+P+AD Y R+ AE GL PCF +P G ++ + PEL
Sbjct: 319 TTFTYLDPTVFQPVADA--VVAAVGGRYKRSKDAEDELGLHPCFALPQGARSMALPELSF 376
Query: 396 HFKGGAEVTLPVENYFAVVGEGS--AVCLTVVTD--------REASGGPSIILGNFQMQN 445
HF+GGA + LPVENYF V G G+ A+CL VVTD E S GP+IILG+FQ QN
Sbjct: 377 HFEGGAVMQLPVENYFVVAGRGAVEAICLAVVTDFSGGSGAGNEGS-GPAIILGSFQQQN 435
Query: 446 YYVEYDLRNQRLGFKQQLC 464
Y VEYDL +RLGF++Q C
Sbjct: 436 YLVEYDLEKERLGFRRQSC 454
>gi|225440720|ref|XP_002275202.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Vitis vinifera]
Length = 479
Score = 368 bits (944), Expect = 5e-99, Method: Compositional matrix adjust.
Identities = 198/446 (44%), Positives = 273/446 (61%), Gaps = 36/446 (8%)
Query: 34 SRFHTNPSQDSYQNLNSLVSSSLTRALHIKNPQTKTTTTTTTTTTTNISSHSYGGYSISL 93
S F +PS + L L ++SL+RA H+K+ +T + T ++S HSYGG+SI L
Sbjct: 38 STFTNSPSTKPLRFLQHLATASLSRAHHLKHGKT------SPLTQISLSPHSYGGHSIPL 91
Query: 94 SFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSS-----KIPSFIPKLSSSSRLLGCQ 148
SFGTPPQ + F++DTGSH+VW PCT HY C CS S K+P F PKLSSSS++LGC+
Sbjct: 92 SFGTPPQKLSFLVDTGSHVVWAPCTTHYTCTNCSFSDAEPKKVPIFNPKLSSSSKILGCR 151
Query: 149 NPKCSWIHHESIQ--CRDCNDEPLATSKNCTQICPSYLVLYGSGLTEGIALSETLNLPNR 206
NPKC + C CN SKNC+ CP Y + YG+G + G L E LN P +
Sbjct: 152 NPKCVNTSSPDVHLGCPPCN----GNSKNCSHACPPYSLQYGTGASSGDFLLENLNFPGK 207
Query: 207 IIPNFLVGC--SVLSSRQPAGIAGFGRGKTSLPSQLNLDKFSYCLLSHKFDDTTRTSSLI 264
I FLVGC S + A +AGFGR SLP Q+ + KF+YCL SH +DDT +S LI
Sbjct: 208 TIHEFLVGCTTSAVGEVTSAALAGFGRSMFSLPMQMGVKKFAYCLNSHDYDDTRNSSKLI 267
Query: 265 LDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRVRVWHKYLTLD 324
LD +SD +T GL+Y PF+ NP F +YYY+G++ I +G + +R+ KYL
Sbjct: 268 LD----YSDGETKGLSYAPFLKNPP-----DFPIYYYLGVKDIKIGNKLLRIPSKYLAPG 318
Query: 325 RDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEALTGLRPCFDVPG 384
DG GG ++DSG + +M +F+ + +E +M K Y R+L AEA G+ PC++ G
Sbjct: 319 SDGRGGLMIDSGFAYGYMTGPVFKKVTNELKKRMSK---YRRSLEAEAEIGVTPCYNFTG 375
Query: 385 EKTGSFPELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTVVTDR-----EASGGPSIILG 439
+K+ P+L F+GGA + +P +NYF ++ E S C + TD E + GPSIILG
Sbjct: 376 QKSIKIPDLIYQFRGGATMVVPGKNYFVLIPEISLACFPLTTDAGTNTLEFTPGPSIILG 435
Query: 440 NFQMQNYYVEYDLRNQRLGFKQQLCK 465
N Q +YYVE+DL+N+RLGF+QQ C+
Sbjct: 436 NSQHVDYYVEFDLKNERLGFRQQTCQ 461
>gi|226531872|ref|NP_001147022.1| aspartic proteinase nepenthesin-2 precursor [Zea mays]
gi|195606574|gb|ACG25117.1| aspartic proteinase nepenthesin-2 precursor [Zea mays]
Length = 491
Score = 365 bits (936), Expect = 4e-98, Method: Compositional matrix adjust.
Identities = 194/419 (46%), Positives = 261/419 (62%), Gaps = 28/419 (6%)
Query: 66 QTKTTTTTTTTTTTNISSHSYGGYSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKY 125
Q + + T + HSYGGY+ + S GTPPQ +P +LDTGSHL W PCT+ Y+C+
Sbjct: 76 QKGSGGHPSVPATAALYPHSYGGYAFTASLGTPPQPLPVLLDTGSHLTWVPCTSSYECRN 135
Query: 126 CSS---SKIPSFIPKLSSSSRLLGCQNPKCSWIHHESIQCRDCNDEPLAT-SKNC----T 177
CSS S +P F PK SSSSRL+GC+NP C W+H + C P + + NC +
Sbjct: 136 CSSPSASAVPVFHPKNSSSSRLVGCRNPSCQWVHSAANLATKCRRAPCSPGAANCPAAAS 195
Query: 178 QICPSYLVLYGSGLTEGIALSETLNLPNRIIPNFLVGCSVLSSRQ-PAGIAGFGRGKTSL 236
+CP Y V+YGSG T G+ +++TL P R +P F++GCS++S Q P+G+AGFGRG S+
Sbjct: 196 NVCPPYAVVYGSGSTAGLLIADTLRAPGRAVPGFVLGCSLVSVHQPPSGLAGFGRGAPSV 255
Query: 237 PSQLNLDKFSYCLLSHKFDDTTRTS-SLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNA 295
P+QL L KFSYCLLS +FDD S SL+L G+ Y P V + + ++
Sbjct: 256 PAQLGLPKFSYCLLSRRFDDNAAVSGSLVLG-----GTGGGEGMQYVPLVKS-AAGDKLP 309
Query: 296 FSVYYYVGLRRITVGGQRVRVWHKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFV 355
+ VYYY+ LR +TVGG+ VR+ + + G+GGTIVDSGTTFT++ P +F+P+AD
Sbjct: 310 YGVYYYLALRGVTVGGKAVRLPARAFAGNAAGSGGTIVDSGTTFTYLDPTVFQPVADA-- 367
Query: 356 SQMVKNRNYTRALGAEALTGLRPCFDVP-GEKTGSFPELKLHFKGGAEVTLPVENYFAVV 414
Y R+ AE GL PCF +P G ++ + PEL HF+GGA + LPVENYF V
Sbjct: 368 VVAAVGGRYKRSKDAEDGLGLHPCFALPQGARSMALPELSFHFEGGAVMQLPVENYFVVA 427
Query: 415 GEGS--AVCLTVVTD-------REASGGPSIILGNFQMQNYYVEYDLRNQRLGFKQQLC 464
G G+ A+CL VVTD GP+IILG+FQ QNY VEYDL +RLGF++Q C
Sbjct: 428 GRGAVEAICLAVVTDFGGGSGAGNEGSGPAIILGSFQQQNYLVEYDLEKERLGFRRQSC 486
>gi|77808087|gb|AAS48510.2| aspartic protease [Fagopyrum esculentum]
gi|82780908|gb|ABB88696.2| aspartic proteinase-like protein [Fagopyrum esculentum]
Length = 447
Score = 362 bits (929), Expect = 2e-97, Method: Compositional matrix adjust.
Identities = 200/449 (44%), Positives = 275/449 (61%), Gaps = 42/449 (9%)
Query: 30 TFSLSRFHTNPSQ-DSYQNLNSLVSSSLTRALHIKNPQTKTTTTTTTTTTTNISSHSYGG 88
TF LS +PS D ++++N SSL+RA H+K P T T T SYGG
Sbjct: 22 TFPLS---ISPSALDKWESINLAALSSLSRARHLKRPPTLTGKVTLPAY-----PRSYGG 73
Query: 89 YSISLSFGTPPQIIPFILDTGSHLVWFPCT---NHYQCKYCSSS-----KIPSFIPKLSS 140
YS+ S GTPPQ + +LDTGS LVW PCT Y C+ C+ S KIP + SS
Sbjct: 74 YSVIFSLGTPPQKVSLVLDTGSSLVWTPCTIPTATYTCQNCTFSGVDPTKIPIYARNKSS 133
Query: 141 SSRLLGCQNPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSGLTEGIALSET 200
+ + L C++PKC+W+ + C + T+ CP Y + YG G T G +S+
Sbjct: 134 TVQSLPCRSPKCNWVFGSDLNC------------STTKRCPYYGLEYGLGSTTGQLVSDV 181
Query: 201 LNLP--NRIIPNFLVGCSVLSSRQPAGIAGFGRGKTSLPSQLNLDKFSYCLLSHKFDDTT 258
L L NRI P+FL GCS++S+RQP GIAGFGRG S+P+QL L KFSYCL+SH+FDDT
Sbjct: 182 LGLSKLNRI-PDFLFGCSLVSNRQPEGIAGFGRGLASIPAQLGLTKFSYCLVSHRFDDTP 240
Query: 259 RTSSLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRVRVWH 318
++ L+L G H+D G+ Y PF +P+++ +S YYY+ L +I VGG+ V +
Sbjct: 241 QSGDLVLHRGRRHADAAANGVAYAPFTKSPALS---PYSEYYYISLSKILVGGKDVPIPP 297
Query: 319 KYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEALTGLRP 378
+YL ++G+GG IVDSG+TFTFM +F+P+A E M K Y RA E +GL P
Sbjct: 298 RYLVPSKEGDGGMIVDSGSTFTFMERIIFDPVARELEKHMTK---YKRAKEIEDSSGLGP 354
Query: 379 CFDVPGEKTGSFPELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTVVTDREASG---GPS 435
C+++ G+ P+L FKGGA + LP+ +YF++V +G VC+TV+TD + G GP+
Sbjct: 355 CYNITGQSEVDVPKLTFSFKGGANMDLPLTDYFSLVTDG-VVCMTVLTDPDEPGSTTGPA 413
Query: 436 IILGNFQMQNYYVEYDLRNQRLGFKQQLC 464
IILGN+Q QN+Y+EYDL+ QR GFK Q C
Sbjct: 414 IILGNYQQQNFYIEYDLKKQRFGFKPQQC 442
>gi|225440722|ref|XP_002275223.1| PREDICTED: aspartic proteinase nepenthesin-2 [Vitis vinifera]
gi|147841923|emb|CAN65212.1| hypothetical protein VITISV_039022 [Vitis vinifera]
Length = 458
Score = 352 bits (904), Expect = 2e-94, Method: Compositional matrix adjust.
Identities = 194/481 (40%), Positives = 283/481 (58%), Gaps = 40/481 (8%)
Query: 1 MASYISALCLSFIFFFTLLSIFPSSITSLTFSLS-----RFHTNPSQDSYQNLNSLVSSS 55
MAS+ + L S F+ L + SS ++ +++ F NPS + L L ++S
Sbjct: 1 MASF-TTLLFSVFTLFSRLVLASSSKNNIPATITIPLTPTFTKNPSTEPLLFLQHLATAS 59
Query: 56 LTRALHIKNPQTKTTTTTTTTTTTNISSHSYGGYSISLSFGTPPQIIPFILDTGSHLVWF 115
++R+ H+K+ + + T++ HS+GG++I LSFGTPPQ + F++DTGSH+VW
Sbjct: 60 MSRSHHLKHGKA------SPLIQTSLFPHSHGGHTIPLSFGTPPQKLSFLVDTGSHVVWA 113
Query: 116 PCTNHYQCKYCSSS---KIPSFIPKLSSSSRLLGCQNPKCSWIHHESIQ--CRDCNDEPL 170
PCT HY C CS S K+P F P+LSSS ++LGC++PKC+ + C CN
Sbjct: 114 PCTTHYTCTNCSFSNPKKVPIFNPELSSSDKILGCRDPKCANTSSPDVHLGCPRCN---- 169
Query: 171 ATSKNCTQICPSYLVLYGSGLTEGIALSETLNLPNRIIPNFLVGCSVLSSRQPA--GIAG 228
SK C+ CP Y + YG+G G L E L+ P + I FLVGC+ + R+P+ +AG
Sbjct: 170 GNSKKCSHACPQYTLQYGTGAASGFFLLENLDFPGKTIHKFLVGCTTSADREPSSDALAG 229
Query: 229 FGRGKTSLPSQLNLDKFSYCLLSHKFDDTTRTSSLILDNGSSHSDKKTTGLTYTPFVNNP 288
FGR SLP Q+ + KF+YCL SH +DDT + LILD +SD +T GL+Y PF+ NP
Sbjct: 230 FGRTMFSLPMQMGVKKFAYCLNSHDYDDTRNSGKLILD----YSDGETQGLSYAPFLKNP 285
Query: 289 SVAERNAFSVYYYVGLRRITVGGQRVRVWHKYLTLDRDGNGGTIVDSGTTFTFMAPELFE 348
+ YYY+G++ + +G + +R+ KYLT D GG ++DSG + +M +F+
Sbjct: 286 P-----DYPFYYYLGVKDMKIGNKLLRIPGKYLTPGSDSRGGVMIDSGFAYGYMTLPVFK 340
Query: 349 PLADEFVSQMVKNRNYTRALGAEALTGLRPCFDVPGEKTGSFPELKLHFKGGAEVTLPVE 408
+ +E QM K Y R+L AE +GL PC++ G K+ P+L F GGA + +P
Sbjct: 341 IVTNELKKQMSK---YRRSLEAETQSGLTPCYNFTGHKSIKIPDLIYQFTGGANMVVPGM 397
Query: 409 NYFAVVGEGSAVCLTVVTDR-----EASGGPSIILGNFQMQNYYVEYDLRNQRLGFKQQL 463
NYF + E S C V TD E + GPSIILGN+Q ++YVE+DL+N+RLGF+QQ
Sbjct: 398 NYFLLFSEASLGCFPVTTDSPTNNLEFTPGPSIILGNYQQVDHYVEFDLKNERLGFRQQT 457
Query: 464 C 464
C
Sbjct: 458 C 458
>gi|380719867|gb|AFD63134.1| aspartyl protease [Vitis quinquangularis]
Length = 458
Score = 350 bits (898), Expect = 9e-94, Method: Compositional matrix adjust.
Identities = 193/481 (40%), Positives = 282/481 (58%), Gaps = 40/481 (8%)
Query: 1 MASYISALCLSFIFFFTLLSIFPSSITSLTFSLSR-----FHTNPSQDSYQNLNSLVSSS 55
MAS+ + L S F+ L + SS ++ +++ F NPS + L L ++S
Sbjct: 1 MASF-TTLLFSVFTLFSHLVLASSSKNNIPATITIPLTPIFTKNPSTEPLLFLQHLATAS 59
Query: 56 LTRALHIKNPQTKTTTTTTTTTTTNISSHSYGGYSISLSFGTPPQIIPFILDTGSHLVWF 115
++R+ H+K+ + + T++ HSYG ++I LSFGTPPQ + F++DTGSH+VW
Sbjct: 60 MSRSHHLKHGKA------SPLIQTSLFPHSYGAHTIPLSFGTPPQKLSFLMDTGSHVVWA 113
Query: 116 PCTNHYQCKYCSSS---KIPSFIPKLSSSSRLLGCQNPKCSWIHHESIQC--RDCNDEPL 170
PCT HY C CS S K+P F P+LSSS ++LGC++PKC+ + CN
Sbjct: 114 PCTTHYTCTNCSFSNPKKVPIFNPELSSSDKILGCRDPKCADTSSPBVHLGXPRCN---- 169
Query: 171 ATSKNCTQICPSYLVLYGSGLTEGIALSETLNLPNRIIPNFLVGCSVLSSRQPA--GIAG 228
SK C+ CP Y + YG+G G L E L+ P + I FLVGC+ + R+P+ +AG
Sbjct: 170 GNSKKCSHACPQYTLQYGTGAASGFFLLENLDFPGKTIHKFLVGCTTSADREPSSDALAG 229
Query: 229 FGRGKTSLPSQLNLDKFSYCLLSHKFDDTTRTSSLILDNGSSHSDKKTTGLTYTPFVNNP 288
FGR SLP Q+ + KF+YCL SH +DDT + LILD +SD +T GL+Y PF NP
Sbjct: 230 FGRTMFSLPMQMGVKKFAYCLNSHDYDDTRNSGKLILD----YSDGETQGLSYAPFXKNP 285
Query: 289 SVAERNAFSVYYYVGLRRITVGGQRVRVWHKYLTLDRDGNGGTIVDSGTTFTFMAPELFE 348
+ +YYY+G++ + +G + +R+ KYLT D GG ++DSG +++M +F+
Sbjct: 286 P-----DYPIYYYLGVKDMKIGNKVLRIPGKYLTPGSDSRGGVVIDSGFAYSYMTLPVFK 340
Query: 349 PLADEFVSQMVKNRNYTRALGAEALTGLRPCFDVPGEKTGSFPELKLHFKGGAEVTLPVE 408
+ +E QM K Y R+L EA TG+ PC++ G K+ P+L F GGA + +P
Sbjct: 341 IVTNELKKQMSK---YRRSLELEAQTGVTPCYNFTGHKSIKIPDLIYQFTGGANMVVPGM 397
Query: 409 NYFAVVGEGSAVCLTVVTDR-----EASGGPSIILGNFQMQNYYVEYDLRNQRLGFKQQL 463
NYF + E S C V TD E + GPSIILGN+Q ++YVE+DL+N+RLGF+QQ
Sbjct: 398 NYFLLFSEASLGCFPVTTDSPTSNLEFTPGPSIILGNYQQVDHYVEFDLKNERLGFRQQT 457
Query: 464 C 464
C
Sbjct: 458 C 458
>gi|242074844|ref|XP_002447358.1| hypothetical protein SORBIDRAFT_06g033560 [Sorghum bicolor]
gi|241938541|gb|EES11686.1| hypothetical protein SORBIDRAFT_06g033560 [Sorghum bicolor]
Length = 497
Score = 341 bits (875), Expect = 4e-91, Method: Compositional matrix adjust.
Identities = 192/435 (44%), Positives = 261/435 (60%), Gaps = 41/435 (9%)
Query: 58 RALHIKNPQTKTTTTTTTTTTTNISSHSYGGYSISLSFGTPPQIIPFILDTGSHLVWFPC 117
RA H + + + T + HSYGGY+ + S GTPPQ +P +LDTGS L W PC
Sbjct: 72 RASHHSQKGSSSGGHKSIPATAALYPHSYGGYAFTASLGTPPQPLPVLLDTGSQLTWVPC 131
Query: 118 TNHYQCKYCSS---SKIPSFIPKLSSSSRLLGCQNPKCSWIH--HESIQCRDCNDEPLAT 172
T++Y C+ CSS + +P F PK SSSSRL+GC+NP C W+H +CR P +
Sbjct: 132 TSNYDCRNCSSPFAAAVPVFHPKNSSSSRLVGCRNPSCLWVHSAEHVAKCR----APCSR 187
Query: 173 SKNCT---QICPSYLVLYGSGLTEGIALSETLNLPNRIIPNFLVGCSVLSSRQ-PAGIAG 228
NCT +CP Y V+YGSG T G+ +++TL P R + F++GCS++S Q P+G+AG
Sbjct: 188 GANCTPASNVCPPYAVVYGSGSTAGLLIADTLRAPGRAVSGFVLGCSLVSVHQPPSGLAG 247
Query: 229 FGRGKTSLPSQLNLDKFSYCLLSHKFDDTTRTSSLILDNGSSHSDKKTTGLTYTPFVNNP 288
FGRG S+P+QL L KFSYCLLS +FDD S ++ G + G+ Y P V +
Sbjct: 248 FGRGAPSVPAQLGLSKFSYCLLSRRFDDNAAVSGSLVLGGDN------DGMQYVPLVKS- 300
Query: 289 SVAERNAFSVYYYVGLRRITVGGQRVRVWHKYLTLDRDGNGGTIVDSGTTFTFMAPELFE 348
+ ++ ++VYYY+ L +TVGG+ VR+ + + G+GG IVDSGTTFT++ P +F+
Sbjct: 301 AAGDKQPYAVYYYLALSGVTVGGKAVRLPARAFAANAAGSGGAIVDSGTTFTYLDPTVFQ 360
Query: 349 PLADEFVSQMVKNRNYTRALGAEALTGLRPCFDVP-GEKTGSFPELKLHFKGGAEVTLPV 407
P+AD Y R+ E GL PCF +P G K+ + PEL LHFKGGA + LP+
Sbjct: 361 PVADA--VVAAVGGRYKRSKDVEEGLGLHPCFALPQGAKSMALPELSLHFKGGAVMQLPL 418
Query: 408 ENYFAVVGEG------------SAVCLTVVTD------REASGGPSIILGNFQMQNYYVE 449
ENYF V G A+CL VVTD + GGP+IILG+FQ QNY VE
Sbjct: 419 ENYFVVAGRAPVPGAGAGAGAAEAICLAVVTDFGGSGAGDEGGGPAIILGSFQQQNYLVE 478
Query: 450 YDLRNQRLGFKQQLC 464
YDL +RLGF++Q C
Sbjct: 479 YDLEKERLGFRRQPC 493
>gi|90399145|emb|CAJ86169.1| H0913C04.10 [Oryza sativa Indica Group]
gi|125550292|gb|EAY96114.1| hypothetical protein OsI_17992 [Oryza sativa Indica Group]
Length = 491
Score = 337 bits (865), Expect = 5e-90, Method: Compositional matrix adjust.
Identities = 197/431 (45%), Positives = 255/431 (59%), Gaps = 39/431 (9%)
Query: 65 PQTKTTTTTTTTTTTNISSHSYGGYSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCK 124
P+++ T + ++ HSYGGY+ ++S GTPPQ +P +LDTGSHL W PCT+ YQC+
Sbjct: 65 PRSRQGTAPPPSVRASLYPHSYGGYAFTVSLGTPPQPLPVLLDTGSHLSWVPCTSSYQCR 124
Query: 125 YCSS----SKIPSFIPKLSSSSRLLGCQNPKCSWIHHESIQCRDCNDEPLATSKNCT--- 177
CSS S + F PK SSSSRL+GC+NP C WIH DC NCT
Sbjct: 125 NCSSLSAASPLHVFHPKNSSSSRLIGCRNPSCLWIHSPD-HLSDCRAASSCPGANCTPRN 183
Query: 178 ----QICPSYLVLYGSGLTEGIALSETLNLPNRIIPNFLVGCSVLSSRQ-PAGIAGFGRG 232
+CP YLV+YGSG T G+ +S+TL P R + NF++GCS+ S Q P+G+AGFGRG
Sbjct: 184 ANANNVCPPYLVVYGSGSTAGLLISDTLRTPGRAVRNFVIGCSLASVHQPPSGLAGFGRG 243
Query: 233 KTSLPSQLNLDKFSYCLLSHKFDDTTRTS-SLILDNGSSHSDKKTTGLTYTPFVNNPSVA 291
S+PSQL L KFSYCLLS +FDD S LIL + G+ Y P S +
Sbjct: 244 APSVPSQLGLTKFSYCLLSRRFDDNAAVSGELILG--GAGGKDGGVGMQYAPLAR--SAS 299
Query: 292 ERNAFSVYYYVGLRRITVGGQRVRVWHKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLA 351
R +SVYYY+ L ITVGG+ V++ + + GG IVDSGTTF++ +FEP+A
Sbjct: 300 ARPPYSVYYYLALTAITVGGKSVQLPERAF-VAGGAGGGAIVDSGTTFSYFDRTVFEPVA 358
Query: 352 DEFVSQMVKNRNYTRALGAEALTGLRPCFDV-PGEKTGSFPELKLHFKGGAEVTLPVENY 410
Y+R+ E GL PCF + PG KT PE+ LHFKGG+ + LPVENY
Sbjct: 359 AA--VVAAVGGRYSRSKVVEEGLGLSPCFAMPPGTKTMELPEMSLHFKGGSVMNLPVENY 416
Query: 411 FAVVGE---------GSAVCLTVVTD--------REASGGPSIILGNFQMQNYYVEYDLR 453
F V G A+CL VV+D +SGGP+IILG+FQ QNYY+EYDL
Sbjct: 417 FVVAGPAPSGGAPAMAEAICLAVVSDVPTSSGGAGVSSGGPAIILGSFQQQNYYIEYDLE 476
Query: 454 NQRLGFKQQLC 464
+RLGF++Q C
Sbjct: 477 KERLGFRRQQC 487
>gi|222629809|gb|EEE61941.1| hypothetical protein OsJ_16693 [Oryza sativa Japonica Group]
Length = 648
Score = 337 bits (865), Expect = 6e-90, Method: Compositional matrix adjust.
Identities = 197/431 (45%), Positives = 255/431 (59%), Gaps = 39/431 (9%)
Query: 65 PQTKTTTTTTTTTTTNISSHSYGGYSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCK 124
P+++ T + ++ HSYGGY+ ++S GTPPQ +P +LDTGSHL W PCT+ YQC+
Sbjct: 65 PRSRQGTAPPPSVRASLYPHSYGGYAFTVSLGTPPQPLPVLLDTGSHLSWVPCTSSYQCR 124
Query: 125 YCSS----SKIPSFIPKLSSSSRLLGCQNPKCSWIHHESIQCRDCNDEPLATSKNCT--- 177
CSS S + F PK SSSSRL+GC+NP C WIH DC NCT
Sbjct: 125 NCSSLSAASPLHVFHPKNSSSSRLIGCRNPSCLWIHSPD-HLSDCRAASSCPGANCTPRN 183
Query: 178 ----QICPSYLVLYGSGLTEGIALSETLNLPNRIIPNFLVGCSVLSSRQ-PAGIAGFGRG 232
+CP YLV+YGSG T G+ +S+TL P R + NF++GCS+ S Q P+G+AGFGRG
Sbjct: 184 ANANNVCPPYLVVYGSGSTAGLLISDTLRTPGRAVRNFVIGCSLASVHQPPSGLAGFGRG 243
Query: 233 KTSLPSQLNLDKFSYCLLSHKFDDTTRTS-SLILDNGSSHSDKKTTGLTYTPFVNNPSVA 291
S+PSQL L KFSYCLLS +FDD S LIL + G+ Y P S +
Sbjct: 244 APSVPSQLGLTKFSYCLLSRRFDDNAAVSGELILG--GAGGKDGGVGMQYAPLAR--SAS 299
Query: 292 ERNAFSVYYYVGLRRITVGGQRVRVWHKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLA 351
R +SVYYY+ L ITVGG+ V++ + + GG IVDSGTTF++ +FEP+A
Sbjct: 300 ARPPYSVYYYLALTAITVGGKSVQLPERAF-VAGGAGGGAIVDSGTTFSYFDRTVFEPVA 358
Query: 352 DEFVSQMVKNRNYTRALGAEALTGLRPCFDV-PGEKTGSFPELKLHFKGGAEVTLPVENY 410
Y+R+ E GL PCF + PG KT PE+ LHFKGG+ + LPVENY
Sbjct: 359 --AAVVAAVGGRYSRSKVVEEGLGLSPCFAMPPGTKTMELPEMSLHFKGGSVMNLPVENY 416
Query: 411 FAVVGE---------GSAVCLTVVTD--------REASGGPSIILGNFQMQNYYVEYDLR 453
F V G A+CL VV+D +SGGP+IILG+FQ QNYY+EYDL
Sbjct: 417 FVVAGPAPSGGAPAMAEAICLAVVSDVPTSSGGAGVSSGGPAIILGSFQQQNYYIEYDLE 476
Query: 454 NQRLGFKQQLC 464
+RLGF++Q C
Sbjct: 477 KERLGFRRQQC 487
>gi|194708432|gb|ACF88300.1| unknown [Zea mays]
Length = 452
Score = 328 bits (841), Expect = 4e-87, Method: Compositional matrix adjust.
Identities = 178/378 (47%), Positives = 239/378 (63%), Gaps = 30/378 (7%)
Query: 108 TGSHLVWFPCTNHYQCKYCSS---SKIPSFIPKLSSSSRLLGCQNPKCSWIHHESIQCRD 164
+GSHL W PCT+ Y+C+ CSS S +P F PK SSSSRL+GC+NP C W+H +
Sbjct: 79 SGSHLTWVPCTSSYECRNCSSPSASAVPVFHPKNSSSSRLVGCRNPSCQWVHSAANLATK 138
Query: 165 CNDEPLAT-SKNC----TQICPSYLVLYGSGLTEGIALSETLNLPNRIIPNFLVGCSVLS 219
C P + + NC + +CP Y V+YGSG T G+ +++TL P R +P F++GCS++S
Sbjct: 139 CRRAPCSPGAANCPAAASNVCPPYAVVYGSGSTAGLLIADTLRAPGRAVPGFVLGCSLVS 198
Query: 220 SRQP-AGIAGFGRGKTSLPSQLNLDKFSYCLLSHKFDDTTRTS-SLILDNGSSHSDKKTT 277
QP +G+AGFGRG S+P+QL L KFSYCLLS +FDD S SL+L
Sbjct: 199 VHQPPSGLAGFGRGAPSVPAQLGLPKFSYCLLSRRFDDNAAVSGSLVLG-----GTGGGE 253
Query: 278 GLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRVRVWHKYLTLDRDGNGGTIVDSGT 337
G+ Y P V + + ++ + VYYY+ LR +TVGG+ VR+ + + G+GGTIVDSGT
Sbjct: 254 GMQYVPLVKS-AAGDKLPYGVYYYLALRGVTVGGKAVRLPARAFAANAAGSGGTIVDSGT 312
Query: 338 TFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEALTGLRPCFDVP-GEKTGSFPELKLH 396
TFT++ P +F+P+AD Y R+ AE GL PCF +P G ++ + PEL H
Sbjct: 313 TFTYLDPTVFQPVADA--VVAAVGGRYKRSKDAEDELGLHPCFALPQGARSMALPELSFH 370
Query: 397 FKGGAEVTLPVENYFAVVGEGS--AVCLTVVTD--------REASGGPSIILGNFQMQNY 446
F+GGA + LPVENYF V G G+ A+CL VVTD E S GP+IILG+FQ QNY
Sbjct: 371 FEGGAVMQLPVENYFVVAGRGAVEAICLAVVTDFSGGSGAGNEGS-GPAIILGSFQQQNY 429
Query: 447 YVEYDLRNQRLGFKQQLC 464
VEYDL +RLGF++Q C
Sbjct: 430 LVEYDLEKERLGFRRQSC 447
>gi|326529727|dbj|BAK04810.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 488
Score = 328 bits (840), Expect = 4e-87, Method: Compositional matrix adjust.
Identities = 203/453 (44%), Positives = 261/453 (57%), Gaps = 36/453 (7%)
Query: 40 PSQDSYQNLNSLVSSSLTRALHIKNPQTKTTTTTTTTTTTNISSHSYGGYSISLSFGTPP 99
P+ + L+ L +SL RA ++ ++ + HSYGGY+ SLS GTPP
Sbjct: 39 PAAAQHHPLSRLARASLARASRLRGHHQGQAASSPVRAA--LYPHSYGGYAFSLSLGTPP 96
Query: 100 QIIPFILDTGSHLVWFPCTNHYQCKYCSSSK--IPSFIPKLSSSSRLLGCQNPKCSWIH- 156
Q +P +LDTGSHL W PCT++YQC+ CS++ P F PK SSSS L+ C +P C WIH
Sbjct: 97 QPLPVLLDTGSHLTWVPCTSNYQCQNCSAAAGSFPVFHPKSSSSSLLVSCSSPSCLWIHS 156
Query: 157 --HESIQCRD---CNDEPLATSKNCTQICPSYLVLYGSGLTEGIALSETLNLPNRIIP-- 209
H S RD C S T +CP YLV+YGSG T G+ +S+TL L R
Sbjct: 157 KSHLSDCARDSAPCRPSTANCSATATNVCPPYLVVYGSGSTAGLLVSDTLRLSPRGAASR 216
Query: 210 NFLVGCSVLSSRQ-PAGIAGFGRGKTSLPSQLNLDKFSYCLLSHKFDDTTRTS-SLILDN 267
NF VGCS+ S Q P+G+AGFGRG S+P+QL ++KFSYCLLS +FDD S L+L
Sbjct: 217 NFAVGCSLASVHQPPSGLAGFGRGAPSVPAQLGVNKFSYCLLSRRFDDDAAISGELVL-- 274
Query: 268 GSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRVRVWHKYLT-LDRD 326
G+S + K + Y P + N R +SVYYY+ L I VGG+ V + + L +
Sbjct: 275 GASSAGKAKAMMQYAPLLKN--AGARPPYSVYYYLSLTGIAVGGKSVALPARALAPVSGG 332
Query: 327 GNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEALTGLRPCFDVP-GE 385
G GG I+DSGTTFT++ P +F+P Y R+ E GLRPCF +P G
Sbjct: 333 GGGGAIIDSGTTFTYLDPTVFKP--VAAAMVAAVGGRYNRSKDVEGALGLRPCFALPAGA 390
Query: 386 KTGSFPELKLHFKGGAEVTLPVENYFAVVGEGS-----AVCLTVVTD---------REAS 431
+T PEL LHF GGAE+ LP+ENYF G S A+CL VV+D
Sbjct: 391 RTMDLPELSLHFSGGAEMRLPIENYFLAAGPASGVAPEAICLAVVSDVSSASGGAGVSGG 450
Query: 432 GGPSIILGNFQMQNYYVEYDLRNQRLGFKQQLC 464
GGP+IILG+FQ QNY VEYDL RLGF+QQ C
Sbjct: 451 GGPAIILGSFQQQNYQVEYDLEKNRLGFRQQPC 483
>gi|225440729|ref|XP_002275391.1| PREDICTED: aspartic proteinase nepenthesin-1 [Vitis vinifera]
gi|147789748|emb|CAN67404.1| hypothetical protein VITISV_025615 [Vitis vinifera]
Length = 450
Score = 326 bits (835), Expect = 2e-86, Method: Compositional matrix adjust.
Identities = 184/428 (42%), Positives = 257/428 (60%), Gaps = 37/428 (8%)
Query: 51 LVSSSLTRALHIKNPQTKTTTTTTTTTTTNISSHSYGGYSISLSFGTPPQIIPFILDTGS 110
L S+SL+RA H+K+ +T T+ + HSYGG+SISLSFGTPPQ + F++DTGS
Sbjct: 46 LASASLSRAHHLKHGKTNPPVKTS------LFPHSYGGHSISLSFGTPPQKLSFLVDTGS 99
Query: 111 HLVWFPCTNHYQCKYCSSS-----KIPSFIPKLSSSSRLLGCQNPKC--SWIHHESIQCR 163
+VW PCT Y C CS S K+P F PKLSSSS++L C+NPKC ++ + + C
Sbjct: 100 DVVWAPCTTDYTCTNCSFSAADPKKVPIFDPKLSSSSKILDCRNPKCVSTYFPYVHLGCP 159
Query: 164 DCNDEPLATSKNCTQICPSYLVLYGSGLTEGIALSETLNLPNRIIPNFLVGCSVLSSRQP 223
CN SK+C+ CP Y YG+G + G L E L P + I NFL+GC+ ++R+
Sbjct: 160 RCN----GNSKHCSYACP-YSTQYGTGASSGYFLLENLKFPRKTIRNFLLGCTTSAAREL 214
Query: 224 A--GIAGFGRGKTSLPSQLNLDKFSYCLLSHKFDDTTRTSSLILDNGSSHSDKKTTGLTY 281
+ +AGFGR SLP Q+ + KF+YCL SH +DDT + LILD + D KT GL+Y
Sbjct: 215 SSDALAGFGRSMFSLPIQMGVKKFAYCLNSHDYDDTRNSGKLILD----YRDGKTKGLSY 270
Query: 282 TPFVNNPSVAERNAFSVYYYVGLRRITVGGQRVRVWHKYLTLDRDGNGGTIVDSGTTFT- 340
TPF+ +P A + YY++G++ I +G + +R+ KYL DG G I+DSG
Sbjct: 271 TPFLKSPP-----ASAFYYHLGVKDIKIGNKLLRIPSKYLAPGSDGRSGVIIDSGYGGAG 325
Query: 341 FMAPELFEPLADEFVSQMVKNRNYTRALGAEALTGLRPCFDVPGEKTGSFPELKLHFKGG 400
+M +F+ + +E QM K Y R+L AE TGL PC++ G K+ P L F+GG
Sbjct: 326 YMTGPVFKIVTNELKKQMSK---YRRSLEAETQTGLTPCYNFTGHKSIKIPPLIYQFRGG 382
Query: 401 AEVTLPVENYFAVVGEGSAVCLTVVTD----REASGGPSIILGNFQMQNYYVEYDLRNQR 456
A + +P +NYF + + S C + T+ E + PSIILGN Q +YYVEYDL+N R
Sbjct: 383 ANMVVPGKNYFGISPQESLACFLMDTNGTNALEITPDPSIILGNSQHVDYYVEYDLKNDR 442
Query: 457 LGFKQQLC 464
GF++Q C
Sbjct: 443 FGFRRQTC 450
>gi|357162717|ref|XP_003579500.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Brachypodium
distachyon]
Length = 488
Score = 323 bits (829), Expect = 8e-86, Method: Compositional matrix adjust.
Identities = 192/429 (44%), Positives = 254/429 (59%), Gaps = 42/429 (9%)
Query: 66 QTKTTTTTTTTTTTNISSHSYGGYSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKY 125
+ ++ T + HSYGGY+ S+S GTPPQ +P +LDTGSHL W PCT+ YQC+
Sbjct: 68 HAEPSSQAPAAVRTALYPHSYGGYAFSVSLGTPPQPLPVLLDTGSHLSWVPCTSSYQCRN 127
Query: 126 CSSSKIPS-----FIPKLSSSSRLLGCQNPKCSWIHHESIQCRDCNDEPLATSKNCT-QI 179
CSSS F PK SSSSRL+GC+NP C WIH +S C +T N +
Sbjct: 128 CSSSPSAMSAMAVFHPKNSSSSRLVGCRNPACRWIHSKSPS--TCG----STGNNGNGDV 181
Query: 180 CPSYLVLYGSGLTEGIALSETLNLPNRIIP-------NFLVGCSVLSSRQ-PAGIAGFGR 231
CP YLV+YGSG T G+ +S+TL L NF +GCS++S Q P+G+AGFGR
Sbjct: 182 CPPYLVVYGSGSTSGLLISDTLRLSPSSSSSAPAPFRNFAIGCSIVSVHQPPSGLAGFGR 241
Query: 232 GKTSLPSQLNLDKFSYCLLSHKFDDTTRTS-SLILDNGSSHSDKKTTGLTYTPFVNNPSV 290
G S+PSQL + KFSYCLLS +FDD + S L+L + + KK T + Y P +NN
Sbjct: 242 GAPSVPSQLKVPKFSYCLLSRRFDDNSAVSGELVLGDAMVPAGKKKTTMQYVPLLNN--A 299
Query: 291 AERNAFSVYYYVGLRRITVGGQRVRVWHKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPL 350
A + +SVYYY+ L I+VGG+ V + + GG I+DSGTTFT++ P +F+P+
Sbjct: 300 ASKPPYSVYYYLALTGISVGGKPVNLPSRAFV--PSSGGGAIIDSGTTFTYLDPTVFKPV 357
Query: 351 ADEFVSQMVKNRNYTRALGAEALTGLRPCFDVPGEKTGS--FPELKLHFKGGAEVTLPVE 408
A S + Y R+ E GLRPCF +P G+ P+L+L FKGGA + LPVE
Sbjct: 358 AAAMESAV--GGRYNRSRPVEDALGLRPCFALPPGPGGAMELPDLELKFKGGAVMRLPVE 415
Query: 409 NYF-------AVVGEGSAVCLTVVTD------REASGGPSIILGNFQMQNYYVEYDLRNQ 455
NYF A+CL VV+D A+ GP+IILG+FQ QNY++EYDL +
Sbjct: 416 NYFVAAGPAGGPAAGPVAICLAVVSDLPASGGDGAAAGPAIILGSFQQQNYHIEYDLGKE 475
Query: 456 RLGFKQQLC 464
RLGF+QQ C
Sbjct: 476 RLGFRQQPC 484
>gi|32488713|emb|CAE03456.1| OSJNBa0088H09.14 [Oryza sativa Japonica Group]
Length = 490
Score = 322 bits (826), Expect = 2e-85, Method: Compositional matrix adjust.
Identities = 193/431 (44%), Positives = 251/431 (58%), Gaps = 40/431 (9%)
Query: 65 PQTKTTTTTTTTTTTNISSHSYGGYSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCK 124
P+++ T + ++ HSYGGY+ ++S GTPPQ +P +L+TGSHL W P T+ Y
Sbjct: 65 PRSRQGTAPPPSVRASLYPHSYGGYAFTVSLGTPPQPLPVLLETGSHLSWVPSTSSYSAN 124
Query: 125 YCSS----SKIPSFIPKLSSSSRLLGCQNPKCSWIHHESIQCRDCNDEPLATSKNCT--- 177
CSS S + F PK SSSSRL+GC+NP C WIH DC NCT
Sbjct: 125 -CSSLSAASPLHVFHPKNSSSSRLIGCRNPSCLWIHSPD-HLSDCRAASSCPGANCTPRN 182
Query: 178 ----QICPSYLVLYGSGLTEGIALSETLNLPNRIIPNFLVGCSVLSSRQ-PAGIAGFGRG 232
+CP YLV+YGSG T G+ +S+TL P R + NF++GCS+ S Q P+G+AGFGRG
Sbjct: 183 ANANNVCPPYLVVYGSGSTAGLLISDTLRTPGRAVRNFVIGCSLASVHQPPSGLAGFGRG 242
Query: 233 KTSLPSQLNLDKFSYCLLSHKFDDTTRTS-SLILDNGSSHSDKKTTGLTYTPFVNNPSVA 291
S+PSQL L KFSYCLLS +FDD S LIL + G+ Y P S +
Sbjct: 243 APSVPSQLGLTKFSYCLLSRRFDDNAAVSGELILG--GAGGKDGGVGMQYAPLAR--SAS 298
Query: 292 ERNAFSVYYYVGLRRITVGGQRVRVWHKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLA 351
R +SVYYY+ L ITVGG+ V++ + + GG IVDSGTTF++ +FEP+A
Sbjct: 299 ARPPYSVYYYLALTAITVGGKSVQLPERAF-VAGGAGGGAIVDSGTTFSYFDRTVFEPVA 357
Query: 352 DEFVSQMVKNRNYTRALGAEALTGLRPCFDV-PGEKTGSFPELKLHFKGGAEVTLPVENY 410
Y+R+ E GL PCF + PG KT PE+ LHFKGG+ + LPVENY
Sbjct: 358 AA--VVAAVGGRYSRSKVVEEGLGLSPCFAMPPGTKTMELPEMSLHFKGGSVMNLPVENY 415
Query: 411 FAVVGE---------GSAVCLTVVTD--------REASGGPSIILGNFQMQNYYVEYDLR 453
F V G A+CL VV+D +SGGP+IILG+FQ QNYY+EYDL
Sbjct: 416 FVVAGPAPSGGAPAMAEAICLAVVSDVPTSSGGAGVSSGGPAIILGSFQQQNYYIEYDLE 475
Query: 454 NQRLGFKQQLC 464
+RLGF++Q C
Sbjct: 476 KERLGFRRQQC 486
>gi|294463081|gb|ADE77078.1| unknown [Picea sitchensis]
Length = 370
Score = 305 bits (781), Expect = 3e-80, Method: Compositional matrix adjust.
Identities = 162/378 (42%), Positives = 232/378 (61%), Gaps = 29/378 (7%)
Query: 106 LDTGSHLVWFPCTNHYQCKYC--SSSKIPSFIPKLSSSSRLLGCQNPKCSWIHHESIQ-- 161
+DTGS LVW PCT +Y C C S+ F+P++SSS L+ C + C ++ + +
Sbjct: 1 MDTGSDLVWVPCTRNYSCINCPEDSASNGVFLPRMSSSLHLVTCADSNCKTLYGNNTELL 60
Query: 162 CRDCNDEPLATSKNCTQICPSYLVLYGSGLTEGIALSETLNLP------NRIIPNFLVGC 215
C+ C + KNC++ CP Y + YG G T G+ L+ETLNLP R I +F VGC
Sbjct: 61 CQSC----AGSLKNCSETCPPYGIQYGRGSTAGLLLTETLNLPLENGEGARAITHFAVGC 116
Query: 216 SVLSSRQPAGIAGFGRGKTSLPSQLN----LDKFSYCLLSHKFDDTTRTSSLILDNGSSH 271
S++SS+QP+GIAGFGRG S+PSQL D+F+YCL SH+FD+ + S ++L + +
Sbjct: 117 SIVSSQQPSGIAGFGRGALSMPSQLGEHIGKDRFAYCLQSHRFDEENKKSLMVLGDKALP 176
Query: 272 SDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRVR-VWHKYLTLDRDGNGG 330
++ L YTPF+ N + + VYYY+GLR +++GG+R++ + K L D GNGG
Sbjct: 177 NNIP---LNYTPFLTNSRAPPSSQYGVYYYIGLRGVSIGGKRLKQLPSKLLRFDTKGNGG 233
Query: 331 TIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEALTGLRPCFDVPGEKTGSF 390
TI+DSGTTFT + E+F+ +A F SQ+ Y RA E TG+ C+DV G +
Sbjct: 234 TIIDSGTTFTVFSDEIFKHIAAGFASQI----GYRRAGEVEDKTGMGLCYDVTGLENIVL 289
Query: 391 PELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTVVTDR---EASGGPSIILGNFQMQNYY 447
PE HFKGG+++ LPV NYF+ ++CLT+++ R E GP++ILGN Q Q++Y
Sbjct: 290 PEFAFHFKGGSDMVLPVANYFSYFSSFDSICLTMISSRGLLEVDSGPAVILGNDQQQDFY 349
Query: 448 VEYDLRNQRLGFKQQLCK 465
+ YD RLGF QQ CK
Sbjct: 350 LLYDREKNRLGFTQQTCK 367
>gi|296084856|emb|CBI28265.3| unnamed protein product [Vitis vinifera]
Length = 446
Score = 304 bits (778), Expect = 8e-80, Method: Compositional matrix adjust.
Identities = 200/445 (44%), Positives = 243/445 (54%), Gaps = 100/445 (22%)
Query: 27 TSLTFSLSRFHTNPSQDSYQNLNSLVSSSLTRALHIKNPQTKTTTTTTTTTTTNISSHSY 86
+ +T LS +P D Y+NL LVS+SL RA H+KN TT T+TT + +HSY
Sbjct: 34 SPITLPLSASKPSPPPDPYRNLRHLVSASLIRARHLKN------PKTTPTSTTPLFTHSY 87
Query: 87 GGYSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCS-SSKIPS---FIPKLSSSS 142
G YSI LSFGTPPQ +P I+DTGS LVWFPCT+ Y C+ CS S+ PS FIPK SSSS
Sbjct: 88 GAYSIPLSFGTPPQTLPLIMDTGSDLVWFPCTHRYVCRNCSFSTSNPSSNIFIPKSSSSS 147
Query: 143 RLLGCQNPKCSWIHHESIQ--CRDCNDEPLATSKNCTQICPSYLVLYGSGLTEGIALSET 200
++LGC NPKC WIH +Q CRDC EP TS NCTQICP YLV YGSG+T GI LSET
Sbjct: 148 KVLGCVNPKCGWIHGSKVQSRCRDC--EP--TSPNCTQICPPYLVFYGSGITGGIMLSET 203
Query: 201 LNLPNRIIPNFLVGCSVLSSRQPAGIAGFGRGKTSLPSQLNLDKFSYCLLSHKFDDTTRT 260
L+LP + +PNF+VGCSVLS+ QPAGI+GFGRG SLPSQL L KFSYCLLS ++DDTT +
Sbjct: 204 LDLPGKGVPNFIVGCSVLSTSQPAGISGFGRGPPSLPSQLGLKKFSYCLLSRRYDDTTES 263
Query: 261 SSLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRR-ITVGGQRVRVWHK 319
SSLI + ++ +K+ V + A V GLR + G + +
Sbjct: 264 SSLIFELVAAEFEKQ--------------VQSKRATEVEGITGLRPCFNISGLNTPSFPE 309
Query: 320 YLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEALTGLRPC 379
LTL G E+ PLA NY LG + + L
Sbjct: 310 -LTLKFRGGA---------------EMELPLA-----------NYVAFLGGDDVVCLTIV 342
Query: 380 FDVPGEKTGSFPELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTVVTDREASGGPSIILG 439
D K F GG + L G
Sbjct: 343 TDGAAGK---------EFSGGPAIIL---------------------------------G 360
Query: 440 NFQMQNYYVEYDLRNQRLGFKQQLC 464
NFQ QN+YVEYDLRN+RLGF+QQ C
Sbjct: 361 NFQQQNFYVEYDLRNERLGFRQQSC 385
>gi|115461432|ref|NP_001054316.1| Os04g0685200 [Oryza sativa Japonica Group]
gi|113565887|dbj|BAF16230.1| Os04g0685200, partial [Oryza sativa Japonica Group]
Length = 330
Score = 238 bits (606), Expect = 7e-60, Method: Compositional matrix adjust.
Identities = 147/328 (44%), Positives = 191/328 (58%), Gaps = 31/328 (9%)
Query: 161 QCRDCNDEPLAT----SKNCTQICPSYLVLYGSGLTEGIALSETLNLPNRIIPNFLVGCS 216
CR + P A + N +CP YLV+YGSG T G+ +S+TL P R + NF++GCS
Sbjct: 6 DCRAASSCPGANCTPRNANANNVCPPYLVVYGSGSTAGLLISDTLRTPGRAVRNFVIGCS 65
Query: 217 VLSSRQ-PAGIAGFGRGKTSLPSQLNLDKFSYCLLSHKFDDTTRTS-SLILDNGSSHSDK 274
+ S Q P+G+AGFGRG S+PSQL L KFSYCLLS +FDD S LIL +
Sbjct: 66 LASVHQPPSGLAGFGRGAPSVPSQLGLTKFSYCLLSRRFDDNAAVSGELILG--GAGGKD 123
Query: 275 KTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRVRVWHKYLTLDRDGNGGTIVD 334
G+ Y P S + R +SVYYY+ L ITVGG+ V++ + + GG IVD
Sbjct: 124 GGVGMQYAPLAR--SASARPPYSVYYYLALTAITVGGKSVQLPERAF-VAGGAGGGAIVD 180
Query: 335 SGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEALTGLRPCFDV-PGEKTGSFPEL 393
SGTTF++ +FEP+A Y+R+ E GL PCF + PG KT PE+
Sbjct: 181 SGTTFSYFDRTVFEPVAAA--VVAAVGGRYSRSKVVEEGLGLSPCFAMPPGTKTMELPEM 238
Query: 394 KLHFKGGAEVTLPVENYFAVVGE---------GSAVCLTVVTD--------REASGGPSI 436
LHFKGG+ + LPVENYF V G A+CL VV+D +SGGP+I
Sbjct: 239 SLHFKGGSVMNLPVENYFVVAGPAPSGGAPAMAEAICLAVVSDVPTSSGGAGVSSGGPAI 298
Query: 437 ILGNFQMQNYYVEYDLRNQRLGFKQQLC 464
ILG+FQ QNYY+EYDL +RLGF++Q C
Sbjct: 299 ILGSFQQQNYYIEYDLEKERLGFRRQQC 326
>gi|359474399|ref|XP_003631454.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Vitis vinifera]
Length = 485
Score = 216 bits (551), Expect = 2e-53, Method: Compositional matrix adjust.
Identities = 148/414 (35%), Positives = 205/414 (49%), Gaps = 49/414 (11%)
Query: 89 YSISLSFGT-PPQIIPFILDTGSHLVWFPCTNHYQCKYCS----SSKIPSFIPKLSSSSR 143
Y++S + G+ PPQ I +DTGS LVWFPC ++C C ++ P +SS
Sbjct: 73 YTLSFNLGSHPPQPISLYMDTGSDLVWFPCAP-FECILCEGKYDTAATGGLSPPNITSSA 131
Query: 144 LLGCQNPKCSWIHHESIQCRD------CNDEPLATSKNCTQICPSYLVLYGSGLTEGIAL 197
+ C++P CS H S+ D C E + TS + CP + YG G
Sbjct: 132 SVSCKSPACSAAH-TSLSSSDLCAMARCPLELIETSDCSSFSCPPFYYAYGDGSLVARLY 190
Query: 198 SETLNLPNR---IIPNFLVGCSVLSSRQPAGIAGFGRGKTSLPSQLNL------DKFSYC 248
++L++P ++ NF GC+ + +P G+AGFGRG SLP+QL ++FSYC
Sbjct: 191 RDSLSMPASSPLVLHNFTFGCAHTALGEPVGVAGFGRGVLSLPAQLASFSPHLGNQFSYC 250
Query: 249 LLSHKFD--DTTRTSSLILDNGSSHSDKKTT------GLTYTPFVNNPSVAERNAFSVYY 300
L+SH FD R S LIL S +KK YT ++NP +Y
Sbjct: 251 LVSHSFDADRVRRPSPLILGRYSLDDEKKKRVGHDRGEFVYTAMLDNPK------HPYFY 304
Query: 301 YVGLRRITVGGQRVRVWHKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVK 360
VGL ITVG +++ V +DR GNGG +VDSGTTFT + L+E L EF +M
Sbjct: 305 CVGLEGITVGNRKIPVPEILKRVDRRGNGGMVVDSGTTFTMLPAGLYESLVTEFNHRM-- 362
Query: 361 NRNYTRALGAEALTGLRPCFDVPGEKTGSFPELKLHFKGGAEVTLPVENYFAVVGEG--- 417
R Y RA E TGL PC+ + P + LHF G + V LP NY+ +G
Sbjct: 363 GRVYKRATQIEERTGLGPCY-YSDDSAAKVPAVALHFVGNSTVILPRNNYYYEFFDGRDG 421
Query: 418 -----SAVCLTVVT--DREASGGPSIILGNFQMQNYYVEYDLRNQRLGFKQQLC 464
CL ++ D SGGP+ LGN+Q Q + V YDL R+GF ++ C
Sbjct: 422 QKKKRKVGCLMLMNGGDEAESGGPAATLGNYQQQGFEVVYDLEKHRVGFARRKC 475
>gi|356540510|ref|XP_003538731.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
Length = 417
Score = 211 bits (537), Expect = 6e-52, Method: Compositional matrix adjust.
Identities = 146/400 (36%), Positives = 205/400 (51%), Gaps = 37/400 (9%)
Query: 89 YSISLSFGT-PPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLGC 147
Y++S + G+ P Q I +DTGS LVWFPC ++C C K + P + S + C
Sbjct: 19 YTLSFNLGSHPSQSITLYMDTGSDLVWFPCAP-FECILCEG-KFNATKPLNITRSHRVSC 76
Query: 148 QNPKCSWIH-----HESIQCRDCNDEPLATSKNCTQICPSYLVLYGSGLTEGIALSETLN 202
Q+P CS H H+ C + + TS + CP + YG G +TL+
Sbjct: 77 QSPACSTAHSSVSSHDLCAIARCPLDNIETSDCSSATCPPFYYAYGDGSFIAHLHRDTLS 136
Query: 203 LPNRIIPNFLVGCSVLSSRQPAGIAGFGRGKTSLPSQL-----NL-DKFSYCLLSHKFDD 256
+ + NF GC+ + +P G+AGFGRG SLP+QL NL ++FSYCL+SH FD
Sbjct: 137 MSQLFLKNFTFGCAHTALAEPTGVAGFGRGLLSLPAQLATLSPNLGNRFSYCLVSHSFDK 196
Query: 257 --TTRTSSLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRV 314
+ S LIL + +S ++ YT + NP S +Y VGL I+VG + +
Sbjct: 197 ERVRKPSPLILGHYDDYSSERVE-FVYTSMLRNPK------HSYFYCVGLTGISVGKRTI 249
Query: 315 RVWHKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEALT 374
+DR G+GG +VDSGTTFT + L+ + EF ++ R + RA E T
Sbjct: 250 LAPEMLRRVDRRGDGGVVVDSGTTFTMLPASLYNSVVAEFDRRV--GRVHKRASEVEEKT 307
Query: 375 GLRPCFDVPGEKTGSFPELKLHFKG-GAEVTLPVENYFA--VVGEGSAV----CLTVVT- 426
GL PC+ + G P + HF G + V LP NYF + GE A CL ++
Sbjct: 308 GLGPCYFLEG--LVEVPTVTWHFLGNNSNVMLPRMNYFYEFLDGEDEARRKVGCLMLMNG 365
Query: 427 --DREASGGPSIILGNFQMQNYYVEYDLRNQRLGFKQQLC 464
D E SGGP ILGN+Q Q + V YDL NQR+GF ++ C
Sbjct: 366 GDDTELSGGPGAILGNYQQQGFEVVYDLENQRVGFAKRQC 405
>gi|115459640|ref|NP_001053420.1| Os04g0535200 [Oryza sativa Japonica Group]
gi|113564991|dbj|BAF15334.1| Os04g0535200 [Oryza sativa Japonica Group]
gi|116310090|emb|CAH67110.1| H0502G05.1 [Oryza sativa Indica Group]
gi|116310464|emb|CAH67468.1| OSIGBa0159I10.13 [Oryza sativa Indica Group]
gi|215715343|dbj|BAG95094.1| unnamed protein product [Oryza sativa Japonica Group]
gi|215765807|dbj|BAG87504.1| unnamed protein product [Oryza sativa Japonica Group]
gi|215767550|dbj|BAG99778.1| unnamed protein product [Oryza sativa Japonica Group]
gi|218195278|gb|EEC77705.1| hypothetical protein OsI_16781 [Oryza sativa Indica Group]
Length = 492
Score = 211 bits (536), Expect = 8e-52, Method: Compositional matrix adjust.
Identities = 146/406 (35%), Positives = 200/406 (49%), Gaps = 41/406 (10%)
Query: 89 YSISLSFGTPPQI--IPFILDTGSHLVWFPCTNHYQCKYCSSSKIP-----SFIPKLSSS 141
Y++SLS G P + LDTGS LVWFPC + C C P S +P
Sbjct: 88 YTLSLSVGPPSTASSVSLFLDTGSDLVWFPCAP-FTCMLCEGKATPGGNHSSPLPP-PID 145
Query: 142 SRLLGCQNPKCSWIHHES-----IQCRDCNDEPLATSKNCTQICPSYLVLYGSG-LTEGI 195
SR + C +P CS H + C + + T + CP YG G L +
Sbjct: 146 SRRISCASPLCSAAHSSAPTSDLCAAARCPLDAIETDSCASHACPPLYYAYGDGSLVANL 205
Query: 196 ALSETLNLPNRIIPNFLVGCSVLSSRQPAGIAGFGRGKTSLPSQLNLD---KFSYCLLSH 252
+ + NF C+ + +P G+AGFGRG SLP+QL +FSYCL++H
Sbjct: 206 RRGRVGLAASMAVENFTFACAHTALAEPVGVAGFGRGPLSLPAQLAPSLSGRFSYCLVAH 265
Query: 253 KF--DDTTRTSSLILDNGSSHS--DKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRIT 308
F D R+S LIL + + T YTP ++NP +Y V L ++
Sbjct: 266 SFRADRLIRSSPLILGRSTDAAAIGASETDFVYTPLLHNPK------HPYFYSVALEAVS 319
Query: 309 VGGQRVRVWHKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRAL 368
VGG+R++ + +DRDGNGG +VDSGTTFT + + F +ADEF M R
Sbjct: 320 VGGKRIQAQPELGDVDRDGNGGMVVDSGTTFTMLPSDTFARVADEFARAMAAARFTRAE- 378
Query: 369 GAEALTGLRPCFDV-PGEKTGSFPELKLHFKGGAEVTLPVENYFAVVG--EGSAV-CLTV 424
GAEA TGL PC+ P ++ + P + LHF+G A V LP NYF EG +V CL +
Sbjct: 379 GAEAQTGLAPCYHYSPSDR--AVPPVALHFRGNATVALPRRNYFMGFKSEEGRSVGCLML 436
Query: 425 VT------DREASGGPSIILGNFQMQNYYVEYDLRNQRLGFKQQLC 464
+ D E GGP+ LGNFQ Q + V YD+ R+GF ++ C
Sbjct: 437 MNVGGNNDDGEDGGGPAGTLGNFQQQGFEVVYDVDAGRVGFARRRC 482
>gi|38605896|emb|CAD41523.2| OSJNBb0020O11.8 [Oryza sativa Japonica Group]
Length = 519
Score = 211 bits (536), Expect = 9e-52, Method: Compositional matrix adjust.
Identities = 146/406 (35%), Positives = 200/406 (49%), Gaps = 41/406 (10%)
Query: 89 YSISLSFGTPPQI--IPFILDTGSHLVWFPCTNHYQCKYCSSSKIP-----SFIPKLSSS 141
Y++SLS G P + LDTGS LVWFPC + C C P S +P
Sbjct: 88 YTLSLSVGPPSTASSVSLFLDTGSDLVWFPCAP-FTCMLCEGKATPGGNHSSPLPP-PID 145
Query: 142 SRLLGCQNPKCSWIHHES-----IQCRDCNDEPLATSKNCTQICPSYLVLYGSG-LTEGI 195
SR + C +P CS H + C + + T + CP YG G L +
Sbjct: 146 SRRISCASPLCSAAHSSAPTSDLCAAARCPLDAIETDSCASHACPPLYYAYGDGSLVANL 205
Query: 196 ALSETLNLPNRIIPNFLVGCSVLSSRQPAGIAGFGRGKTSLPSQLNLD---KFSYCLLSH 252
+ + NF C+ + +P G+AGFGRG SLP+QL +FSYCL++H
Sbjct: 206 RRGRVGLAASMAVENFTFACAHTALAEPVGVAGFGRGPLSLPAQLAPSLSGRFSYCLVAH 265
Query: 253 KF--DDTTRTSSLILDNGSSHS--DKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRIT 308
F D R+S LIL + + T YTP ++NP +Y V L ++
Sbjct: 266 SFRADRLIRSSPLILGRSTDAAAIGASETDFVYTPLLHNPK------HPYFYSVALEAVS 319
Query: 309 VGGQRVRVWHKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRAL 368
VGG+R++ + +DRDGNGG +VDSGTTFT + + F +ADEF M R
Sbjct: 320 VGGKRIQAQPELGDVDRDGNGGMVVDSGTTFTMLPSDTFARVADEFARAMAAARFTRAE- 378
Query: 369 GAEALTGLRPCFDV-PGEKTGSFPELKLHFKGGAEVTLPVENYFAVVG--EGSAV-CLTV 424
GAEA TGL PC+ P ++ + P + LHF+G A V LP NYF EG +V CL +
Sbjct: 379 GAEAQTGLAPCYHYSPSDR--AVPPVALHFRGNATVALPRRNYFMGFKSEEGRSVGCLML 436
Query: 425 VT------DREASGGPSIILGNFQMQNYYVEYDLRNQRLGFKQQLC 464
+ D E GGP+ LGNFQ Q + V YD+ R+GF ++ C
Sbjct: 437 MNVGGNNDDGEDGGGPAGTLGNFQQQGFEVVYDVDAGRVGFARRRC 482
>gi|220702733|gb|ACL81165.1| aspartyl protease [Mirabilis jalapa]
Length = 499
Score = 206 bits (523), Expect = 3e-50, Method: Compositional matrix adjust.
Identities = 142/413 (34%), Positives = 202/413 (48%), Gaps = 50/413 (12%)
Query: 88 GYSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLS-SSSRLLG 146
G +L+F Q + +DTGS +VWFPC+ ++C C P + L+ S S L+
Sbjct: 91 GTDYTLTFSINSQTLSVYMDTGSDIVWFPCSP-FECILCEGKFEPGTLTPLNVSKSSLIS 149
Query: 147 CQNPKCSWIHH-----ESIQCRDCNDEPLATSKNCTQICPSYLVLYGSGLTEGIALSETL 201
C++ CS H+ + C + + TS CPS+ YG G +L L
Sbjct: 150 CKSRACSTAHNSPSTSDLCAIAKCPLDEIETSDCSNYHCPSFYYAYGDG-----SLIAKL 204
Query: 202 NLPNRIIP----------NFLVGCSVLSSRQPAGIAGFGRGKTSLPSQL-NL-----DKF 245
+ N I+P +F GC+ + +P G+AGFG G SLP+QL NL ++F
Sbjct: 205 HKHNLIMPSTSNKPFSLKDFTFGCAHSALGEPIGVAGFGFGSLSLPAQLANLSPDLGNQF 264
Query: 246 SYCLLSHKFDDTT--RTSSLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVG 303
SYCL+SH FD T S LIL + T YTP ++NP +Y V
Sbjct: 265 SYCLVSHSFDSTKLHHPSPLILGKVKERDFDEITQFVYTPMLDNPK------HPYFYSVS 318
Query: 304 LRRITVGGQRVRVWHKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRN 363
+ I+VG RVR + + +DRDGNGG +VDSGTT+T + + +A E ++ R
Sbjct: 319 MEAISVGSSRVRAPNALIRIDRDGNGGVVVDSGTTYTMLPTGFYNSVATELDRRV--GRV 376
Query: 364 YTRALGAEALTGLRPCFDVPG---EKTG-SFPELKLHFKGGAEVTLPVENYFAVVGEGS- 418
+ RA E+ TGL PC+ + G E+ G P L HF G V LP NYF +G
Sbjct: 377 FKRASETESKTGLSPCYYLEGNGVERLGLVVPRLAFHFGGNYSVVLPRRNYFYEFLDGED 436
Query: 419 ------AVCLTVVT-DREASGGPSIILGNFQMQNYYVEYDLRNQRLGFKQQLC 464
CL ++ E+ GGP LGN+Q Q + V YDL +R+GF + C
Sbjct: 437 EKKGRKVGCLMLMDGGDESEGGPGATLGNYQQQGFQVVYDLEERRVGFAPRKC 489
>gi|357482031|ref|XP_003611301.1| Aspartic proteinase nepenthesin-2 [Medicago truncatula]
gi|355512636|gb|AES94259.1| Aspartic proteinase nepenthesin-2 [Medicago truncatula]
Length = 481
Score = 204 bits (518), Expect = 1e-49, Method: Compositional matrix adjust.
Identities = 140/406 (34%), Positives = 205/406 (50%), Gaps = 41/406 (10%)
Query: 89 YSISLSFGT-PPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLGC 147
Y++S + G+ PPQ+I +DTGS LVWFPC+ ++C C + ++ + + C
Sbjct: 75 YTLSFNLGSNPPQLITLYMDTGSDLVWFPCSP-FECILCEGKPQTTKPANITKQTHSVSC 133
Query: 148 QNPKCSWIHHESI-----QCRDCNDEPLATSKNCTQICPSYLVLYGSGLTEGIALSETLN 202
Q+P CS H C + + TS + CP + YG G +TL+
Sbjct: 134 QSPACSAAHASMSSSNLCAISRCPLDYIETSDCSSFSCPPFYYAYGDGSFVANLYQQTLS 193
Query: 203 LPNRIIPNFLVGCSVLSSRQPAGIAGFGRGKTSLPSQLNL------DKFSYCLLSHKFDD 256
L + + NF GC+ + +P G+AGFGRG SLP+QL+ ++FSYCL+SH FD
Sbjct: 194 LSSLHLQNFTFGCAHTALAEPTGVAGFGRGILSLPAQLSTLSPHLGNRFSYCLVSHSFDG 253
Query: 257 T--TRTSSLIL----DNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVG 310
R S LIL D + D ++ YT ++NP YY VGL I+VG
Sbjct: 254 DRLRRPSPLILGRHNDTITGAGDGESVEFVYTSMLSNPK------HPYYYCVGLAGISVG 307
Query: 311 GQRVRVWHKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGA 370
+ V +D GNGG +VDSGTTFT + + + +EF ++ NR + RA
Sbjct: 308 KRTVPAPEILKRVDEKGNGGMVVDSGTTFTMLPESFYNAVVNEFDKRV--NRFHKRASEI 365
Query: 371 EALTGLRPCFDVPGEKTGSFPELKLHFKG-GAEVTLPVENYF--------AVVGEGSAVC 421
E TGL PC+ + G P LKLHF G ++V LP +NYF + +G C
Sbjct: 366 ETKTGLGPCYYLNG--LSQIPVLKLHFVGNNSDVVLPRKNYFYEFMDGGDGIRRKGKVGC 423
Query: 422 LTVVT---DREASGGPSIILGNFQMQNYYVEYDLRNQRLGFKQQLC 464
+ ++ + E GGP LGN+Q Q + V YDL +R+GF ++ C
Sbjct: 424 MMLMNGEDETELDGGPGATLGNYQQQGFEVVYDLEKERVGFAKKEC 469
>gi|255576064|ref|XP_002528927.1| pepsin A, putative [Ricinus communis]
gi|223531629|gb|EEF33456.1| pepsin A, putative [Ricinus communis]
Length = 493
Score = 202 bits (515), Expect = 2e-49, Method: Compositional matrix adjust.
Identities = 157/495 (31%), Positives = 241/495 (48%), Gaps = 60/495 (12%)
Query: 10 LSFIFFFTLLSIFPSSITSLTFSLSRFHTNPSQDSYQNLNSLVSSSLTR-ALHIKNPQTK 68
L FI F+ +S+ S I L + S +T + + + L+ S+ +R A ++ K
Sbjct: 9 LCFILCFSCISVSISEILYLPLTHSLSNTQ-----FTSTHHLLKSTSSRSASRFQHQHQK 63
Query: 69 TTTTTTTTTTTNISSHSYGGYSISLSFGT-PPQIIPFILDTGSHLVWFPCTNHYQCKYC- 126
+ +S S Y++S + + PPQ + LDTGS LVWFPC ++C C
Sbjct: 64 RHLRNRHQVSLPLSPGS--DYTLSFTLNSNPPQHVSLYLDTGSDLVWFPCKP-FECILCE 120
Query: 127 -----SSSKIPSFIPKLSSSSRLLGCQNPKCSWIHH-----ESIQCRDCNDEPLATSKNC 176
+++ P P+LSS++R + C++ CS H + DC E + TS
Sbjct: 121 GKAENTTASTPP--PRLSSTARSVHCKSSACSAAHSNLPTSDLCAIADCPLESIETSDCH 178
Query: 177 TQICPSYLVLYGSGLTEGIALSETLNLP----NRIIPNFLVGCSVLSSRQPAGIAGFGRG 232
+ CPS+ YG G +++ LP + + NF GC+ + +P G+AGFGRG
Sbjct: 179 SFSCPSFYYAYGDGSLVARLYHDSIKLPLATPSLSLHNFTFGCAHTALAEPVGVAGFGRG 238
Query: 233 KTSLPSQLNL------DKFSYCLLSHKF--DDTTRTSSLIL---DNGSSHSDKKTTGLTY 281
SLP+QL ++FSYCL+SH F D S LIL D+ +K Y
Sbjct: 239 VLSLPAQLASFAPQLGNRFSYCLVSHSFNSDRLRLPSPLILGHSDDKEKRVNKDDVQFVY 298
Query: 282 TPFVNNPSVAERNAFSVYYYVGLRRITVGGQRVRVWHKYLTLDRDGNGGTIVDSGTTFTF 341
T ++NP +Y VGL I++G +++ +DR+G+GG +VDSGTTFT
Sbjct: 299 TSMLDNPK------HPYFYCVGLEGISIGKKKIPAPEFLKRVDREGSGGVVVDSGTTFTM 352
Query: 342 MAPELFEPLADEFVSQMVKNRNYTRALGAEALTGLRPCFDVPGEKTGSFPELKLHFKGG- 400
+ L+ + EF +++ R Y RA E TGL PC+ + + P L LHF G
Sbjct: 353 LPASLYNSVVAEFDNRV--GRVYERAKEVEDKTGLGPCYYY--DTVVNIPSLVLHFVGNE 408
Query: 401 AEVTLPVENYF--------AVVGEGSAVCLTVVT---DREASGGPSIILGNFQMQNYYVE 449
+ V LP +NYF V + CL ++ + E +GGP LGN+Q + V
Sbjct: 409 SSVVLPKKNYFYDFLDGGDGVRRKRRVGCLMLMNGGEEAELTGGPGATLGNYQQHGFEVV 468
Query: 450 YDLRNQRLGFKQQLC 464
YDL +R+GF ++ C
Sbjct: 469 YDLEQRRVGFARRKC 483
>gi|15242307|ref|NP_199325.1| aspartyl protease family protein [Arabidopsis thaliana]
gi|9758987|dbj|BAB09497.1| chloroplast nucleoid DNA-binding protein-like [Arabidopsis
thaliana]
gi|332007824|gb|AED95207.1| aspartyl protease family protein [Arabidopsis thaliana]
Length = 491
Score = 201 bits (510), Expect = 8e-49, Method: Compositional matrix adjust.
Identities = 146/407 (35%), Positives = 192/407 (47%), Gaps = 40/407 (9%)
Query: 88 GYSISLSFGTPPQIIPFILDTGSHLVWFPCTN-HYQCKYC-----SSSKIPS-FIPKLSS 140
GY I+L+ GTPPQ + LDTGS L W PC N + C C + K PS F P SS
Sbjct: 82 GYLITLNIGTPPQAVQVYLDTGSDLTWVPCGNLSFDCIECYDLKNNDLKSPSVFSPLHSS 141
Query: 141 SSRLLGCQNPKCSWIHHESIQCRDCNDEPLATS----KNCTQICPSYLVLYGSG-LTEGI 195
+S C + C IH C + S C + CPS+ YG G L GI
Sbjct: 142 TSFRDSCASSFCVEIHSSDNPFDPCAVAGCSVSMLLKSTCVRPCPSFAYTYGEGGLISGI 201
Query: 196 ALSETLNLPNRIIPNFLVGCSVLSSRQPAGIAGFGRGKTSLPSQLN-LDK-FSYCLLSHK 253
+ L R +P F GC + R+P GIAGFGRG SLPSQL L+K FS+C L K
Sbjct: 202 LTRDILKARTRDVPRFSFGCVTSTYREPIGIAGFGRGLLSLPSQLGFLEKGFSHCFLPFK 261
Query: 254 FDDTTRTSSLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQ- 312
F + SS ++ S+ S T L +TP +N P + YY+GL IT+G
Sbjct: 262 FVNNPNISSPLILGASALSINLTDSLQFTPMLNTP------MYPNSYYIGLESITIGTNI 315
Query: 313 -RVRVWHKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAE 371
+V D GNGG +VDSGTT+T L EP + ++ + Y RA E
Sbjct: 316 TPTQVPLTLRQFDSQGNGGMLVDSGTTYT----HLPEPFYSQLLTTLQSTITYPRATETE 371
Query: 372 ALTGLRPCFDVP----------GEKTGSFPELKLHFKGGAEVTLPVENYFAVV---GEGS 418
+ TG C+ VP + FP + HF A + LP N F + +GS
Sbjct: 372 SRTGFDLCYKVPCPNNNLTSLENDVMMIFPSITFHFLNNATLLLPQGNSFYAMSAPSDGS 431
Query: 419 AV-CLTVVTDREASGGPSIILGNFQMQNYYVEYDLRNQRLGFKQQLC 464
V CL + GP+ + G+FQ QN V YDL +R+GF+ C
Sbjct: 432 VVQCLLFQNMEDGDYGPAGVFGSFQQQNVKVVYDLEKERIGFQAMDC 478
>gi|356513737|ref|XP_003525567.1| PREDICTED: aspartic proteinase nepenthesin-1-like, partial [Glycine
max]
Length = 455
Score = 197 bits (501), Expect = 9e-48, Method: Compositional matrix adjust.
Identities = 140/409 (34%), Positives = 199/409 (48%), Gaps = 46/409 (11%)
Query: 89 YSISLSFGTPPQIIPFIL--DTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSS-SRLL 145
Y++S + G Q P L DTGS LVWFPC ++C C P+ P ++++ S +
Sbjct: 48 YTLSFNLGPRAQAQPITLYMDTGSDLVWFPCA-PFKCILCEGK--PNASPPVNTTRSVAV 104
Query: 146 GCQNPKCSWIHH-----ESIQCRDCNDEPLATSKNCTQICPSYLVLYGSGLTEGIALSET 200
C++P CS H+ + C E + TS CP + YG G +T
Sbjct: 105 SCKSPACSAAHNLASPSDLCAAARCPLESIETSDCANFKCPPFYYAYGDGSLIARLYRDT 164
Query: 201 LNLPNRIIPNFLVGCSVLSSRQPAGIAGFGRGKTSLPSQLNL------DKFSYCLLSHKF 254
L+L + + NF GC+ + +P G+AGFGRG SLP+QL ++FSYCL+SH F
Sbjct: 165 LSLSSLFLRNFTFGCAYTTLAEPTGVAGFGRGLLSLPAQLATLSPQLGNRFSYCLVSHSF 224
Query: 255 DD--TTRTSSLILDNGSSHSDKKTTG-----LTYTPFVNNPSVAERNAFSVYYYVGLRRI 307
D + S LIL +++ G YTP + NP +Y VGL I
Sbjct: 225 DSERVRKPSPLILGRYEEEEEEEKVGGGVAEFVYTPMLENPK------HPYFYTVGLIGI 278
Query: 308 TVGGQRVRVWHKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRA 367
+VG + V ++ G+GG +VDSGTTFT + + + DEF + R RA
Sbjct: 279 SVGKRIVPAPEMLRRVNNRGDGGVVVDSGTTFTMLPAGFYNSVVDEFDRGV--GRVNERA 336
Query: 368 LGAEALTGLRPCFDVPGEKTGSFPELKLHFKGG-AEVTLPVENYF--------AVVGEGS 418
E TGL PC+ + P L L F GG + V LP +NYF A G+
Sbjct: 337 RKIEEKTGLAPCYYL--NSVAEVPVLTLRFAGGNSSVVLPRKNYFYEFLDGRDAAKGKRR 394
Query: 419 AVCLTVVT---DREASGGPSIILGNFQMQNYYVEYDLRNQRLGFKQQLC 464
CL ++ + E SGGP LGN+Q Q + VEYDL +R+GF ++ C
Sbjct: 395 VGCLMLMNGGDEAELSGGPGATLGNYQQQGFEVEYDLEEKRVGFARRQC 443
>gi|356563324|ref|XP_003549914.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
Length = 480
Score = 194 bits (494), Expect = 7e-47, Method: Compositional matrix adjust.
Identities = 137/410 (33%), Positives = 195/410 (47%), Gaps = 45/410 (10%)
Query: 89 YSISLSFGTPPQIIPFIL--DTGSHLVWFPCTNHYQCKYCSSS-KIPSFIPKLS-SSSRL 144
Y++S + G Q P L DTGS LVWFPC ++C C P+ P + + S
Sbjct: 70 YTLSFNLGPQAQAQPITLYMDTGSDLVWFPCA-PFKCILCEGKPNEPNASPPTNITQSVA 128
Query: 145 LGCQNPKCSWIHH-----ESIQCRDCNDEPLATSKNCTQICPSYLVLYGSGLTEGIALSE 199
+ C++P CS H+ + C E + TS CP + YG G +
Sbjct: 129 VSCKSPACSAAHNLAPPSDLCAAARCPLESIETSDCANFKCPPFYYAYGDGSLIARLYRD 188
Query: 200 TLNLPNRIIPNFLVGCSVLSSRQPAGIAGFGRGKTSLPSQLNL------DKFSYCLLSHK 253
TL+L + + NF GC+ + +P G+AGFGRG SLP+QL ++FSYCL+SH
Sbjct: 189 TLSLSSLFLRNFTFGCAHTTLAEPTGVAGFGRGLLSLPAQLATLSPQLGNRFSYCLVSHS 248
Query: 254 FDD--TTRTSSLILDNGSSHSDKKTTG----LTYTPFVNNPSVAERNAFSVYYYVGLRRI 307
FD + S LIL +K G YT + NP +Y V L I
Sbjct: 249 FDSERVRKPSPLILGRYEEKEKEKIGGGVAEFVYTSMLENPK------HPYFYTVSLIGI 302
Query: 308 TVGGQRVRVWHKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRA 367
VG + + ++ G+GG +VDSGTTFT + + + DEF ++ R+ RA
Sbjct: 303 AVGKRTIPAPEMLRRVNNRGDGGVVVDSGTTFTMLPAGFYNSVVDEFDRRV--GRDNKRA 360
Query: 368 LGAEALTGLRPCFDVPGEKTGSFPELKLHFKGG--AEVTLPVENYFAVVGEGS------- 418
E TGL PC+ + P L L F GG + V LP +NYF +GS
Sbjct: 361 RKIEEKTGLAPCYYL--NSVADVPALTLRFAGGKNSSVVLPRKNYFYEFSDGSDGAKGKR 418
Query: 419 -AVCLTVVT---DREASGGPSIILGNFQMQNYYVEYDLRNQRLGFKQQLC 464
CL ++ + + SGGP LGN+Q Q + VEYDL +R+GF ++ C
Sbjct: 419 KVGCLMLMNGGDEADLSGGPGATLGNYQQQGFEVEYDLEEKRVGFARRQC 468
>gi|297794789|ref|XP_002865279.1| hypothetical protein ARALYDRAFT_494467 [Arabidopsis lyrata subsp.
lyrata]
gi|297311114|gb|EFH41538.1| hypothetical protein ARALYDRAFT_494467 [Arabidopsis lyrata subsp.
lyrata]
Length = 419
Score = 194 bits (494), Expect = 7e-47, Method: Compositional matrix adjust.
Identities = 143/408 (35%), Positives = 193/408 (47%), Gaps = 42/408 (10%)
Query: 88 GYSISLSFGTPPQIIPFILDTGSHLVWFPCTN-HYQCKYCSSSKIPS------FIPKLSS 140
GY I+L+ GTPPQ + +DTGS L W PC N + C C+ K + F P SS
Sbjct: 10 GYLITLNIGTPPQAVQVYMDTGSDLTWVPCGNLSFDCIDCNDLKSNNLKSSSIFSPLHSS 69
Query: 141 SSRLLGCQNPKCSWIHHESIQCRDCNDEPLATS----KNCTQICPSYLVLYGSG-LTEGI 195
SS C + C+ IH C + S C + CPS+ YG G L GI
Sbjct: 70 SSFRASCASSFCAEIHSSDNPFDPCAIAGCSVSMLLKSTCIRPCPSFAYTYGEGGLVSGI 129
Query: 196 ALSETLNLPNRIIPNFLVGCSVLSSRQPAGIAGFGRGKTSLPSQLN-LDK-FSYCLLSHK 253
+ L R +P F GC + +P GIAGFGRG SLPSQL L+K FS+C L K
Sbjct: 130 LTRDILKARTRDVPRFSFGCVTSTYHEPIGIAGFGRGLLSLPSQLGFLEKGFSHCFLPFK 189
Query: 254 FDDTTRTSSLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQR 313
F + SS ++ S+ S T L +TP +N P + YY+GL IT+ G
Sbjct: 190 FVNNPNISSPLILGASALSINLTDSLQFTPMLNTP------VYPNSYYIGLESITI-GTN 242
Query: 314 VRVWHKYLTL---DRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGA 370
+ LTL D GNGG +VDSGTT+T L P + ++ + Y RA
Sbjct: 243 ITPTQVPLTLRQFDSQGNGGMLVDSGTTYT----HLPNPFYSQLLTILQSTITYPRATET 298
Query: 371 EALTGLRPCFDVP----------GEKTGSFPELKLHFKGGAEVTLPVENYFAVV---GEG 417
E+ TG C+ VP + FP + +F A + LP N F + +G
Sbjct: 299 ESRTGFDLCYKVPCPNNNLTSLENDVMMVFPSITFNFLNNATLLLPQGNSFYAMSAPSDG 358
Query: 418 SAV-CLTVVTDREASGGPSIILGNFQMQNYYVEYDLRNQRLGFKQQLC 464
S V CL + + GP+ + G+FQ QN V YDL +R+GF+ C
Sbjct: 359 SVVQCLLFQNMEDGNYGPAGVFGSFQQQNVKVVYDLEKERIGFQAMDC 406
>gi|224074147|ref|XP_002304273.1| predicted protein [Populus trichocarpa]
gi|222841705|gb|EEE79252.1| predicted protein [Populus trichocarpa]
Length = 496
Score = 192 bits (489), Expect = 2e-46, Method: Compositional matrix adjust.
Identities = 141/418 (33%), Positives = 205/418 (49%), Gaps = 51/418 (12%)
Query: 88 GYSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYC-----SSSKIPSFIPKLSSSS 142
G +LSF Q I LDTGS LVWFPC ++C C ++S + PKLS ++
Sbjct: 79 GSDYTLSFTINSQPISLYLDTGSDLVWFPC-QPFECILCEGKAENASLASTPPPKLSKTA 137
Query: 143 RLLGCQNPKCSWIHH-----ESIQCRDCNDEPLATSKNCTQICPSYLVLYGSGLTEGIAL 197
+ C++ CS +H + +C E + S CP + YG G
Sbjct: 138 TPVSCKSSACSAVHSNLPSSDLCAISNCPLESIEISDCRKHSCPQFYYAYGDGSLIARLY 197
Query: 198 SETLNLP-----NRIIPNFLVGCSVLSSRQPAGIAGFGRGKTSLPSQLNL------DKFS 246
+++ LP N I NF GC+ + +P G+AGFGRG SLP+QL ++FS
Sbjct: 198 RDSIRLPLSNQTNLIFNNFTFGCAHTTLAEPIGVAGFGRGVLSLPAQLATLSPQLGNQFS 257
Query: 247 YCLLSHKFD-DTTRTSSLILDNGSSHSDK-------KTTGLTYTPFVNNPSVAERNAFSV 298
YCL+SH FD D R S ++ H +K K YT ++NP R+ +
Sbjct: 258 YCLVSHSFDSDRVRRPSPLILGRYDHDEKERRVNGVKKPSFVYTSMLDNP----RHPY-- 311
Query: 299 YYYVGLRRITVGGQRVRVWHKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQM 358
+Y VGL I++G +++ +DR G+GG +VDSGTTFT + L++ + EF +++
Sbjct: 312 FYCVGLEGISIGRKKIPAPDFLRKVDRKGSGGVVVDSGTTFTMLPASLYDFVVAEFENRV 371
Query: 359 VKNRNYTRALGAEALTGLRPCFDVPGEKTGSFPELKLHFKG-GAEVTLPVENYF------ 411
R RA E TGL PC+ + P + LHF G G+ V LP NYF
Sbjct: 372 --GRVNERASVIEENTGLSPCYYF-DNNVVNVPRVVLHFVGNGSSVVLPRRNYFYEFLDG 428
Query: 412 --AVVGEGSAVCLTVVT---DREASGGPSIILGNFQMQNYYVEYDLRNQRLGFKQQLC 464
+ CL ++ + E SGGP LGN+Q Q + V YDL N+R+GF ++ C
Sbjct: 429 GHGKGKKRKVGCLMLMNGGDEAELSGGPGATLGNYQQQGFEVVYDLENRRVGFARRQC 486
>gi|224136436|ref|XP_002322329.1| predicted protein [Populus trichocarpa]
gi|222869325|gb|EEF06456.1| predicted protein [Populus trichocarpa]
Length = 486
Score = 190 bits (483), Expect = 1e-45, Method: Compositional matrix adjust.
Identities = 161/500 (32%), Positives = 229/500 (45%), Gaps = 62/500 (12%)
Query: 1 MASYISALCLSFIFFFTLLSIFPSSITSLTFSLSRFHTNPSQDSYQNLNSLVSSSLTRAL 60
MA ++ +F+FF + S+ SI SL S +N NSL+ LT A
Sbjct: 1 MAIALNKNITTFLFFLLVNSLVSYSIQSLA-------------SPRNPNSLILG-LTLAS 46
Query: 61 HIKNPQ-TKTTTTTTTTTTTNI------SSHSYGGYSISLSFGTPPQIIPFILDTGSHLV 113
P K +T++ + ++ S GY ISL+ GTPPQ+I ++DTGS L
Sbjct: 47 RASFPTYPKASTSSRKIVSIDVLGAKKPSREVRDGYLISLNIGTPPQVIQVLMDTGSDLT 106
Query: 114 WFPCTN-HYQCKYCSSSK----IPSFIPKLSSSSRLLGCQNPKCSWIHHESIQCRDCNDE 168
W PC N + C C + + +F P SSSS C +P C IH C
Sbjct: 107 WVPCGNLSFDCMECDDYRNNKLMATFSPSYSSSSYRASCASPFCIDIHSSDNPLDTCTVA 166
Query: 169 PLATS----KNCTQICPSYLVLYGS-GLTEGIALSETLNLPN------RIIPNFLVGCSV 217
+ S C++ CPS+ YG+ G+ GI +TL + + IP F GC
Sbjct: 167 GCSLSTLVKATCSRPCPSFAYTYGAGGVVTGILTRDTLRVNGSSPGVAKEIPKFCFGCVG 226
Query: 218 LSSRQPAGIAGFGRGKTSLPSQLNL--DKFSYCLLSHKFDDTTRTSSLILDNGSSHSDKK 275
+ R+P GIAGFGRG S+ SQL FS+C L+ K+ + SS ++ + + K
Sbjct: 227 SAYREPIGIAGFGRGTLSMVSQLGFLQKGFSHCFLAFKYANNPNISSPLVVGDIALTSKD 286
Query: 276 TTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGG-QRVRVWHKYLTLDRDGNGGTIVD 334
+ +TP +N+P + +YYVGL ITVG V D GNGG +D
Sbjct: 287 D--MQFTPMLNSP------MYPNFYYVGLEAITVGNVSATEVPSSLREFDSLGNGGMKID 338
Query: 335 SGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEALTGLRPCFDVPGEKTGS----- 389
SGTT+T L EP + +S + NY R G E TG C+ VP +
Sbjct: 339 SGTTYT----HLPEPFYSQVLSILQSTINYPRDTGMEMQTGFDLCYKVPRPNNNTLTSDD 394
Query: 390 -FPELKLHFKGGAEVTLPVENYFAVV---GEGSAV-CLTVVTDREASGGPSIILGNFQMQ 444
P + HF + LP N+F V G + V CL + + GP+ + G+FQ Q
Sbjct: 395 LLPSITFHFLNNVSLVLPQGNHFYPVSAPGNPAVVKCLMFQSTDDGDDGPAGVFGSFQQQ 454
Query: 445 NYYVEYDLRNQRLGFKQQLC 464
N V YDL +R+GF+ C
Sbjct: 455 NVEVVYDLEKERIGFQPMDC 474
>gi|226495677|ref|NP_001146995.1| pepsin A precursor [Zea mays]
gi|195606284|gb|ACG24972.1| pepsin A [Zea mays]
Length = 504
Score = 189 bits (481), Expect = 2e-45, Method: Compositional matrix adjust.
Identities = 144/414 (34%), Positives = 192/414 (46%), Gaps = 47/414 (11%)
Query: 89 YSISLSFGTPPQIIP--FILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSS--SSRL 144
Y++SLS G P LDTGS LVWFPC + C C P L SR
Sbjct: 90 YTLSLSVGPASAAAPVSLFLDTGSDLVWFPCAP-FTCMLCEGKPTPGRSGPLPPPPDSRR 148
Query: 145 LGCQNPKCSWIHHES-----IQCRDCNDEPLAT-SKNCTQICPSYLVLYGSG-----LTE 193
+ C +P CS H + C E + T S + CP YG G L
Sbjct: 149 IPCASPLCSAAHASAPPSDLCAAARCPLEDIETGSCGASHACPPLYYAYGDGSLVAHLRR 208
Query: 194 G-IALSETLNLPNRI-IPNFLVGCSVLSSRQPAGIAGFGRGKTSLPSQLNLD---KFSYC 248
G +AL + + NF C+ + +P G+AGFGRG SLP QL+ +FSYC
Sbjct: 209 GRVALGAGARASVAVAVDNFTFACAHTALGEPVGVAGFGRGPLSLPGQLSPQLSGRFSYC 268
Query: 249 LLSHKF--DDTTRTSSLILDNG---SSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVG 303
L+SH F D R S LIL + + +T G YTP ++NP +Y V
Sbjct: 269 LVSHSFRADRLIRPSPLILGRSPDDADAAAAETDGFVYTPLLHNPK------HPYFYSVA 322
Query: 304 LRRITVGGQRVRVWHKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRN 363
L ++VG R++ + +DR GNGG +VDSGTTFT + E++ +A+ F M
Sbjct: 323 LEAVSVGAARIQARPELARVDRAGNGGMVVDSGTTFTMLPNEMYARVAEAFARAMAAAGF 382
Query: 364 YTRALGAEALTGLRPCFDVPGEKTGSFPELKLHFKGGAEVTLPVENYF---------AVV 414
AE TGL PC+ G P L LHF+G A V LP NYF A
Sbjct: 383 ARAER-AEEQTGLTPCYRYAASDRG-VPPLALHFRGNATVALPRRNYFMGFKSEDAGAGT 440
Query: 415 GEGSAVCLTVVTDREASG----GPSIILGNFQMQNYYVEYDLRNQRLGFKQQLC 464
+ CL ++ +ASG GP+ LGNFQ Q + V YD+ R+GF ++ C
Sbjct: 441 RKDDVGCLMLMNGGDASGEEGDGPAGTLGNFQQQGFEVVYDVDAGRVGFARRRC 494
>gi|357164972|ref|XP_003580227.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Brachypodium
distachyon]
Length = 492
Score = 189 bits (481), Expect = 2e-45, Method: Compositional matrix adjust.
Identities = 138/409 (33%), Positives = 190/409 (46%), Gaps = 42/409 (10%)
Query: 89 YSISLSFG--TPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIP-------SFIPKLS 139
Y++SLS G + + LDTGS LVWFPC + C C P + +P
Sbjct: 83 YTLSLSVGPLSTANPVSLFLDTGSDLVWFPCAP-FTCMLCEGKPTPPGNNNSSNPLPP-P 140
Query: 140 SSSRLLGCQNPKCSWIHHESIQCRDCN------DEPLATSKNCTQICPSYLVLYGSG-LT 192
+ SR + C +P CS H + C D+ S + CP YG G L
Sbjct: 141 TDSRRIPCASPFCSAAHSSAPPADLCAAARCPLDDIETGSCAASHACPPLYYAYGDGSLV 200
Query: 193 EGIALSETLNLPNRIIPNFLVGCSVLSSRQPAGIAGFGRGKTSLPSQLNL----DKFSYC 248
+ + + NF C+ + +P G+AGFGRG SLP+QL +FSYC
Sbjct: 201 ARLRRGRVGIAASVAVENFTFACAHTALGEPVGVAGFGRGPLSLPAQLAPAALSGRFSYC 260
Query: 249 LLSHKF--DDTTRTSSLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRR 306
L++H F D R S LIL TG+ YTP ++NP +Y V L
Sbjct: 261 LVAHSFRADRPIRPSPLILGRSPGEDPASETGIVYTPLLHNPK------HPYFYSVALEA 314
Query: 307 ITVGGQRVRVWHKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTR 366
++VGG R+ + + R G+GG +VDSGTTFT + E + +A+EF M R
Sbjct: 315 VSVGGTRIPARPELGRVGRAGDGGMVVDSGTTFTMLPNETYARVAEEFGRAMAAARFERA 374
Query: 367 ALGAEALTGLRPCF----DVPGEKTGS---FPELKLHFKGGAEVTLPVENYFAVV--GEG 417
+ TGL PC+ D + GS P L +HF+G A V LP NYF E
Sbjct: 375 EAAEDQ-TGLAPCYYYDHDASAAEEGSARAVPPLAMHFRGEATVVLPRRNYFMGFRSEER 433
Query: 418 SAV-CLTVVTDREAS-GGPSIILGNFQMQNYYVEYDLRNQRLGFKQQLC 464
V CL ++ E GGP+ LGNFQ Q + V YD+ R+GF ++ C
Sbjct: 434 RRVGCLMLMNGGEDDGGGPAGTLGNFQQQGFEVVYDVDAGRVGFARRRC 482
>gi|224101053|ref|XP_002334311.1| predicted protein [Populus trichocarpa]
gi|222871031|gb|EEF08162.1| predicted protein [Populus trichocarpa]
Length = 496
Score = 189 bits (480), Expect = 3e-45, Method: Compositional matrix adjust.
Identities = 157/498 (31%), Positives = 232/498 (46%), Gaps = 51/498 (10%)
Query: 3 SYISALCLSFIFFFTLLSIFPSSITSLTFSLSRFHTNPSQDSYQNLNSLVSSSLTRALHI 62
SY LC S F +S + LT SLS+ + ++ ++ + R H
Sbjct: 4 SYSLLLCFSLCFSHFFISTSQTLFLPLTHSLSKTQFTSTHHLIKSTSTSSITRFRRHHHQ 63
Query: 63 KNPQTKTTTTTTTTTTTNISSHSYGGYSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQ 122
KN T +S G +LSF Q I LDTGS LVWFPC ++
Sbjct: 64 KN----------THNHRQVSLPLSPGSDYTLSFTLDSQPIFLYLDTGSDLVWFPC-QPFE 112
Query: 123 CKYC-----SSSKIPSFIPKLSSSSRLLGCQNPKCSWIHH-----ESIQCRDCNDEPLAT 172
C C ++S + PKLS ++ + C++ CS H + +C E + T
Sbjct: 113 CILCEGKAENTSLASTPPPKLSKTATPVSCKSSACSAAHSNLPSSDLCAISNCPLESIET 172
Query: 173 SKNCTQICPSYLVLYGSGLTEGIALSETLNLP-----NRIIPNFLVGCSVLSSRQPAGIA 227
S CP + YG G ++++LP N I+ NF GC+ + +P G+A
Sbjct: 173 SDCQKHSCPQFYYAYGDGSLIARLYRDSISLPLSNPTNLIVNNFTFGCAHTALAEPIGVA 232
Query: 228 GFGRGKTSLPSQLNL------DKFSYCLLSHKFD-DTTRTSSLILDNGSSHSDK--KTTG 278
GFGRG SLP+QL ++FSYCL+SH FD D R S ++ H +K + G
Sbjct: 233 GFGRGVLSLPAQLATLSPQLGNQFSYCLVSHSFDSDRLRRPSPLILGRYDHDEKERRVNG 292
Query: 279 LTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRVRVWHKYLTLDRDGNGGTIVDSGTT 338
+ FV S+ + +Y VGL I++G +++ +D +G+GG +VDSGTT
Sbjct: 293 VNKPRFVYT-SMLDNLEHPYFYCVGLEGISIGRKKIPAPGFLRKVDGEGSGGLVVDSGTT 351
Query: 339 FTFMAPELFEPLADEFVSQMVKNRNYTRALGAEALTGLRPCFDVPGEKTGSFPELKLHFK 398
FT + L+ + EF +++ R RA E TGL PC+ + P + LHF
Sbjct: 352 FTMLPASLYGSVVAEFENRV--GRVNERARVIEEDTGLSPCYYF-DNNVVNVPSVVLHFV 408
Query: 399 G-GAEVTLPVENYF--------AVVGEGSAVCLTVVT---DREASGGPSIILGNFQMQNY 446
G G+ V LP NYF + CL ++ + E SGGP LGN+Q Q +
Sbjct: 409 GNGSSVVLPRRNYFYEFLDGGDGKGKKRKVGCLMLMNGGDEAELSGGPGATLGNYQQQGF 468
Query: 447 YVEYDLRNQRLGFKQQLC 464
V YDL N+R+GF ++ C
Sbjct: 469 EVVYDLENKRVGFARRQC 486
>gi|224138580|ref|XP_002326638.1| predicted protein [Populus trichocarpa]
gi|222833960|gb|EEE72437.1| predicted protein [Populus trichocarpa]
Length = 496
Score = 189 bits (480), Expect = 3e-45, Method: Compositional matrix adjust.
Identities = 157/498 (31%), Positives = 232/498 (46%), Gaps = 51/498 (10%)
Query: 3 SYISALCLSFIFFFTLLSIFPSSITSLTFSLSRFHTNPSQDSYQNLNSLVSSSLTRALHI 62
SY LC S F +S + LT SLS+ + ++ ++ + R H
Sbjct: 4 SYSLLLCFSLCFSHFFISTSQTLFLPLTHSLSKTQFTSTHHLIKSTSTSSITRFRRHHHQ 63
Query: 63 KNPQTKTTTTTTTTTTTNISSHSYGGYSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQ 122
KN T +S G +LSF Q I LDTGS LVWFPC ++
Sbjct: 64 KN----------THNHRQVSLPLSPGSDYTLSFTLDSQPIFLYLDTGSDLVWFPC-QPFE 112
Query: 123 CKYC-----SSSKIPSFIPKLSSSSRLLGCQNPKCSWIHH-----ESIQCRDCNDEPLAT 172
C C ++S + PKLS ++ + C++ CS H + +C E + T
Sbjct: 113 CILCEGKAENTSLASTPPPKLSKTATPVSCKSSACSAAHSNLPSSDLCAISNCPLESIET 172
Query: 173 SKNCTQICPSYLVLYGSGLTEGIALSETLNLP-----NRIIPNFLVGCSVLSSRQPAGIA 227
S CP + YG G ++++LP N I+ NF GC+ + +P G+A
Sbjct: 173 SDCQKHSCPQFYYAYGDGSLIARLYRDSISLPLSNPTNLIVNNFTFGCAHTALAEPIGVA 232
Query: 228 GFGRGKTSLPSQLNL------DKFSYCLLSHKFD-DTTRTSSLILDNGSSHSDK--KTTG 278
GFGRG SLP+QL ++FSYCL+SH FD D R S ++ H +K + G
Sbjct: 233 GFGRGVLSLPAQLATLSPQLGNQFSYCLVSHSFDSDRLRRPSPLILGRYDHDEKERRVNG 292
Query: 279 LTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRVRVWHKYLTLDRDGNGGTIVDSGTT 338
+ FV S+ + +Y VGL I++G +++ +D +G+GG +VDSGTT
Sbjct: 293 VNKPRFVYT-SMLDNLEHPYFYCVGLEGISIGRKKIPAPGFLRKVDGEGSGGLVVDSGTT 351
Query: 339 FTFMAPELFEPLADEFVSQMVKNRNYTRALGAEALTGLRPCFDVPGEKTGSFPELKLHFK 398
FT + L+ + EF +++ R RA E TGL PC+ + P + LHF
Sbjct: 352 FTMLPASLYGSVVAEFENRV--GRVNERARVIEEDTGLSPCYYF-DNNVVNVPSVVLHFV 408
Query: 399 G-GAEVTLPVENYF--------AVVGEGSAVCLTVVT---DREASGGPSIILGNFQMQNY 446
G G+ V LP NYF + CL ++ + E SGGP LGN+Q Q +
Sbjct: 409 GNGSSVVLPRRNYFYEFLDGGDGKGKKRKVGCLMLMNGGEEAELSGGPGATLGNYQQQGF 468
Query: 447 YVEYDLRNQRLGFKQQLC 464
V YDL N+R+GF ++ C
Sbjct: 469 EVVYDLENKRVGFARRQC 486
>gi|222629275|gb|EEE61407.1| hypothetical protein OsJ_15596 [Oryza sativa Japonica Group]
Length = 466
Score = 189 bits (480), Expect = 3e-45, Method: Compositional matrix adjust.
Identities = 136/399 (34%), Positives = 191/399 (47%), Gaps = 53/399 (13%)
Query: 89 YSISLSFGTPPQI--IPFILDTGSHLVWFPCTNHYQCKYCSSSKIP-----SFIPKLSSS 141
Y++SLS G P + LDTGS LVWFPC + C C P S +P
Sbjct: 88 YTLSLSVGPPSTASSVSLFLDTGSDLVWFPCAP-FTCMLCEGKATPGGNHSSPLPP-PID 145
Query: 142 SRLLGCQNPKCSWIHHES-----IQCRDCNDEPLATSKNCTQICPSYLVLYGSG-LTEGI 195
SR + C +P CS H + C + + T + CP YG G L +
Sbjct: 146 SRRISCASPLCSAAHSSAPTSDLCAAARCPLDAIETDSCASHACPPLYYAYGDGSLVANL 205
Query: 196 ALSETLNLPNRIIPNFLVGCSVLSSRQPAGIAGFGRGKTSLPSQLNLDKFSYCLLSHKFD 255
+ + NF C+ + +P G+AGFGRG SLP+QL +
Sbjct: 206 RRGRVGLAASMAVENFTFACAHTALAEPVGVAGFGRGPLSLPAQL----------APSLS 255
Query: 256 DTTRTSSLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRVR 315
+T +++ G+S +D YTP ++NP +Y V L ++VGG+R++
Sbjct: 256 GSTDAAAI----GASETD-----FVYTPLLHNPK------HPYFYSVALEAVSVGGKRIQ 300
Query: 316 VWHKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEALTG 375
+ +DRDGNGG +VDSGTTFT + + F +ADEF M R GAEA TG
Sbjct: 301 AQPELGDVDRDGNGGMVVDSGTTFTMLPSDTFARVADEFARAMAAARFTRAE-GAEAQTG 359
Query: 376 LRPCFDV-PGEKTGSFPELKLHFKGGAEVTLPVENYFAVVG--EGSAV-CLTVVT----- 426
L PC+ P ++ + P + LHF+G A V LP NYF EG +V CL ++
Sbjct: 360 LAPCYHYSPSDR--AVPPVALHFRGNATVALPRRNYFMGFKSEEGRSVGCLMLMNVGGNN 417
Query: 427 -DREASGGPSIILGNFQMQNYYVEYDLRNQRLGFKQQLC 464
D E GGP+ LGNFQ Q + V YD+ R+GF ++ C
Sbjct: 418 DDGEDGGGPAGTLGNFQQQGFEVVYDVDAGRVGFARRRC 456
>gi|242076594|ref|XP_002448233.1| hypothetical protein SORBIDRAFT_06g023740 [Sorghum bicolor]
gi|241939416|gb|EES12561.1| hypothetical protein SORBIDRAFT_06g023740 [Sorghum bicolor]
Length = 508
Score = 188 bits (478), Expect = 5e-45, Method: Compositional matrix adjust.
Identities = 141/415 (33%), Positives = 187/415 (45%), Gaps = 49/415 (11%)
Query: 89 YSISLSFGTPPQIIP--FILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSS------ 140
Y++SLS G P LDTGS LVWFPC + C C PS S+
Sbjct: 94 YTLSLSVGPASAAAPVSLFLDTGSDLVWFPCAP-FTCMLCEGKPTPSGGHSSSAPLPLPP 152
Query: 141 --SSRLLGCQNPKCSWIHHESIQCRDC-------NDEPLATSKNCTQICPSYLVLYGSGL 191
SR + C +P CS H + C D + + + CP YG G
Sbjct: 153 PPDSRRVPCASPLCSAAHASAPPSDLCAAAGCPLEDIETGSCRGASHACPPLYYAYGDGS 212
Query: 192 TEGIALSETLNLPNRI-IPNFLVGCSVLSSRQPAGIAGFGRGKTSLPSQLNLD---KFSY 247
+ L + + NF C+ + +P G+AGFGRG SLP QL +FSY
Sbjct: 213 LVAHLRRGRVGLGASVAVDNFTFACAHTALGEPVGVAGFGRGPLSLPGQLAPQLSGRFSY 272
Query: 248 CLLSHKF--DDTTRTSSLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLR 305
CL+SH F D R S LIL S + +T G YTP ++NP +Y V L
Sbjct: 273 CLVSHSFRADRLIRPSPLILGR-SPDAAAETGGFVYTPLLHNPK------HPYFYSVALE 325
Query: 306 RITVGGQRVRVWHKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYT 365
++VG R++ + +DR GNGG +VDSGTTFT + E + +A+ F M
Sbjct: 326 AVSVGATRIQARPELARVDRAGNGGMVVDSGTTFTMLPNETYARVAEAFARAMAAAGFAR 385
Query: 366 RALGAEALTGLRPCFDVPGEKTGSFPELKLHFKGGAEVTLPVENYF-------AVVGEG- 417
AE TGL PC+ G P L LHF+G A V LP NYF G G
Sbjct: 386 AER-AEEQTGLTPCYHYAASDRG-VPPLALHFRGNATVALPRRNYFMGFKSEEEAGGAGR 443
Query: 418 --SAVCLTVVTDREASG------GPSIILGNFQMQNYYVEYDLRNQRLGFKQQLC 464
CL ++ + SG GP+ LGNFQ Q + V YD+ R+GF ++ C
Sbjct: 444 KDDVGCLMLMNGGDVSGEDGGDDGPAGTLGNFQQQGFEVVYDVDAGRVGFARRRC 498
>gi|357167693|ref|XP_003581287.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Brachypodium
distachyon]
Length = 468
Score = 188 bits (477), Expect = 5e-45, Method: Compositional matrix adjust.
Identities = 132/386 (34%), Positives = 180/386 (46%), Gaps = 44/386 (11%)
Query: 87 GGYSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLG 146
G + + +S GTP I+DTGS LVW C C C + P F P SS+ L
Sbjct: 116 GEFLMDMSIGTPALAYAAIVDTGSDLVWTQCK---PCVECFNQSTPVFDPSSSSTYSTLP 172
Query: 147 CQNPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYG-SGLTEGIALSETLNLPN 205
C + CS D P +T + + C Y YG + T+G+ +ET L
Sbjct: 173 CSSSLCS-------------DLPTSTCTSAAKDC-GYTYTYGDASSTQGVLAAETFTLAK 218
Query: 206 RIIPNFLVGCSVLSS----RQPAGIAGFGRGKTSLPSQLNLDKFSYCLLSHKFDDTTRTS 261
+P GC + Q AG+ G GRG SL SQL L KFSYCL S DDT+++
Sbjct: 219 TKLPGVAFGCGDTNEGDGFTQGAGLVGLGRGPLSLVSQLGLGKFSYCLTS--LDDTSKSP 276
Query: 262 SLILDNGSSHSDKKTTG-LTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRVRVWHKY 320
L+ + +D + + TP + NPS +YYV L+ +TVG R+ +
Sbjct: 277 LLLGSLAAISTDTASAAAIQTTPLIKNPSQPS------FYYVTLKALTVGSTRIPLPGSA 330
Query: 321 LTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEALTGLRPCF 380
+ DG GG IVDSGT+ T++ + + PL F +QM + + GL CF
Sbjct: 331 FAVQDDGTGGVIVDSGTSITYLELQGYRPLKKAFAAQM------KLPVADGSAVGLDLCF 384
Query: 381 DVP--GEKTGSFPELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTVVTDREASGGPSIIL 438
P G P+L LHF GGA++ LP ENY + A+CLTV+ R S I+
Sbjct: 385 KAPASGVDDVEVPKLVLHFDGGADLDLPAENYMVLDSASGALCLTVMGSRGLS-----II 439
Query: 439 GNFQMQNYYVEYDLRNQRLGFKQQLC 464
GNFQ QN YD+ L F C
Sbjct: 440 GNFQQQNIQFVYDVDKDTLSFAPVQC 465
>gi|296085344|emb|CBI29076.3| unnamed protein product [Vitis vinifera]
Length = 434
Score = 187 bits (476), Expect = 8e-45, Method: Compositional matrix adjust.
Identities = 138/408 (33%), Positives = 188/408 (46%), Gaps = 45/408 (11%)
Query: 88 GYSISLSFGTPPQIIPFILDTGSHLVWFPCTN-HYQCKYCSSSKIPSFIPKLSSSSRLLG 146
GY ISL+ GTPP++I +DTGS L W PC N + C C+ + + S S
Sbjct: 28 GYLISLNLGTPPKVIQVYMDTGSDLTWVPCGNLSFDCMDCNDYRNNKLMSTYSPSYSSSS 87
Query: 147 ----CQNPKCSWIHHESIQCRDCNDEPLATSK----NCTQICPSYLVLYGS-GLTEGIAL 197
C +P CS +H C + S C + CPS+ YG+ G+ G
Sbjct: 88 LRDLCVSPLCSDVHSSDNSYDPCAVAGCSLSTLVKGTCPRPCPSFAYTYGAGGVVIGTLT 147
Query: 198 SETLNLPN------RIIPNFLVGCSVLSSRQPAGIAGFGRGKTSLPSQLNL--DKFSYCL 249
+TL R +PNF GC + R+P GIAGFGRG SLPSQL FS+C
Sbjct: 148 RDTLTTHGSSPSFTREVPNFCFGCVGSTYREPIGIAGFGRGVLSLPSQLGFLQKGFSHCF 207
Query: 250 LSHKFDDTTRTSS--LILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRI 307
L KF + SS +I D S +D L +T + NP + YYY+GL I
Sbjct: 208 LGFKFANNPNISSPLVIGDLAISSNDH----LQFTSLLKNP------MYPNYYYIGLEAI 257
Query: 308 TVG-GQRVRVWHKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTR 366
TVG ++V D GNGG I+DSGTT+T L P + +S + Y R
Sbjct: 258 TVGNATAIQVPSSLREFDSHGNGGMIIDSGTTYT----HLPGPFYTQLLSMLQSIITYPR 313
Query: 367 ALGAEALTGLRPCFDVP------GEKTGSFPELKLHFKGGAEVTLPVENYFAVVGEGS-- 418
A EA TG C+ +P + P + HF + LP N+F +G S
Sbjct: 314 AQEQEARTGFDLCYRIPCPNNVVTDHDHLLPSISFHFSNNVSLVLPQGNHFYAMGAPSNS 373
Query: 419 --AVCLTVVTDREASGGPSIILGNFQMQNYYVEYDLRNQRLGFKQQLC 464
CL + ++ GP+ + G+FQ QN V YDL +R+GF+ C
Sbjct: 374 TVVKCLLLQNMDDSDSGPAGVFGSFQQQNVKVVYDLEKERIGFQPMDC 421
>gi|359484086|ref|XP_002263357.2| PREDICTED: aspartic proteinase nepenthesin-2-like [Vitis vinifera]
Length = 417
Score = 187 bits (476), Expect = 9e-45, Method: Compositional matrix adjust.
Identities = 138/408 (33%), Positives = 188/408 (46%), Gaps = 45/408 (11%)
Query: 88 GYSISLSFGTPPQIIPFILDTGSHLVWFPCTN-HYQCKYCSSSKIPSFIPKLSSSSRLLG 146
GY ISL+ GTPP++I +DTGS L W PC N + C C+ + + S S
Sbjct: 11 GYLISLNLGTPPKVIQVYMDTGSDLTWVPCGNLSFDCMDCNDYRNNKLMSTYSPSYSSSS 70
Query: 147 ----CQNPKCSWIHHESIQCRDCNDEPLATSK----NCTQICPSYLVLYGS-GLTEGIAL 197
C +P CS +H C + S C + CPS+ YG+ G+ G
Sbjct: 71 LRDLCVSPLCSDVHSSDNSYDPCAVAGCSLSTLVKGTCPRPCPSFAYTYGAGGVVIGTLT 130
Query: 198 SETLNLPN------RIIPNFLVGCSVLSSRQPAGIAGFGRGKTSLPSQLNL--DKFSYCL 249
+TL R +PNF GC + R+P GIAGFGRG SLPSQL FS+C
Sbjct: 131 RDTLTTHGSSPSFTREVPNFCFGCVGSTYREPIGIAGFGRGVLSLPSQLGFLQKGFSHCF 190
Query: 250 LSHKFDDTTRTSS--LILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRI 307
L KF + SS +I D S +D L +T + NP + YYY+GL I
Sbjct: 191 LGFKFANNPNISSPLVIGDLAISSNDH----LQFTSLLKNP------MYPNYYYIGLEAI 240
Query: 308 TVG-GQRVRVWHKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTR 366
TVG ++V D GNGG I+DSGTT+T L P + +S + Y R
Sbjct: 241 TVGNATAIQVPSSLREFDSHGNGGMIIDSGTTYT----HLPGPFYTQLLSMLQSIITYPR 296
Query: 367 ALGAEALTGLRPCFDVP------GEKTGSFPELKLHFKGGAEVTLPVENYFAVVGEGS-- 418
A EA TG C+ +P + P + HF + LP N+F +G S
Sbjct: 297 AQEQEARTGFDLCYRIPCPNNVVTDHDHLLPSISFHFSNNVSLVLPQGNHFYAMGAPSNS 356
Query: 419 --AVCLTVVTDREASGGPSIILGNFQMQNYYVEYDLRNQRLGFKQQLC 464
CL + ++ GP+ + G+FQ QN V YDL +R+GF+ C
Sbjct: 357 TVVKCLLLQNMDDSDSGPAGVFGSFQQQNVKVVYDLEKERIGFQPMDC 404
>gi|357476865|ref|XP_003608718.1| Aspartic proteinase nepenthesin-2 [Medicago truncatula]
gi|355509773|gb|AES90915.1| Aspartic proteinase nepenthesin-2 [Medicago truncatula]
Length = 482
Score = 187 bits (475), Expect = 9e-45, Method: Compositional matrix adjust.
Identities = 138/410 (33%), Positives = 200/410 (48%), Gaps = 48/410 (11%)
Query: 89 YSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSS------- 141
Y++S + G Q I +DTGS LVWFPCT + C C PKL+S
Sbjct: 75 YTLSFNLGPHSQPITLYMDTGSDLVWFPCTP-FNCILCE------LKPKLTSDPSPPTNI 127
Query: 142 --SRLLGCQNPKCSWIHHESIQCRDCNDE--PLAT--SKNCTQI-CPSYLVLYGSGLTEG 194
S + C + CS H + C PL + +K+C CP + YG G
Sbjct: 128 SHSTPISCNSHACSVAHSSTPSSDLCTMAHCPLDSIETKDCGSFHCPPFYYAYGDGSLIA 187
Query: 195 IALSETLNLPNRIIPNFLVGCSVLSSRQPAGIAGFGRGKTSLPSQLNL------DKFSYC 248
+TL+L + NF GC+ + +P G+AGFGRG SLP+QL ++FSYC
Sbjct: 188 SLYRDTLSLSTLQLTNFTFGCAHTTFSEPTGVAGFGRGLLSLPAQLATHSPQLGNRFSYC 247
Query: 249 LLSHKF--DDTTRTSSLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRR 306
L+SH F + + S LIL G + +K++ G FV S+ E S +Y VGL+
Sbjct: 248 LVSHSFRSERIRKPSPLIL--GRYNDEKQSNGDEVVEFVYT-SMLENPKHSYFYTVGLKG 304
Query: 307 ITVGGQRVRVWHKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTR 366
I+VG + V +++ G+GG +VDSGTTFT + + + + + F + K+ R
Sbjct: 305 ISVGKKTVPAPKILRRVNKKGDGGVVVDSGTTFTMLPEKFYNSVVEGFDRRARKSNR--R 362
Query: 367 ALGAEALTGLRPCFDVPGEKTGSFPELKLHFKG-GAEVTLPVENYFAVVGEGS------- 418
A E TGL PC+ + P + L F G + V LP +NYF +G
Sbjct: 363 APEIEQKTGLSPCYYL--NTAAIVPAVTLRFVGMNSSVVLPRKNYFYEFMDGGDGVRRKE 420
Query: 419 -AVCLTVVT---DREASGGPSIILGNFQMQNYYVEYDLRNQRLGFKQQLC 464
CL + + E SGGP +LGN+Q Q + VEYDL +R+GF ++ C
Sbjct: 421 RVGCLMFMNGGDEAEMSGGPGGVLGNYQQQGFEVEYDLEKKRVGFARRKC 470
>gi|449458942|ref|XP_004147205.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Cucumis sativus]
gi|449505000|ref|XP_004162350.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Cucumis sativus]
Length = 480
Score = 187 bits (474), Expect = 1e-44, Method: Compositional matrix adjust.
Identities = 142/412 (34%), Positives = 203/412 (49%), Gaps = 50/412 (12%)
Query: 87 GGYSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSS-KIPSFIPKLSSSSRLL 145
G Y++S + G+ I +DTGS LVWFPC+ ++C C KI S +PK++++ +
Sbjct: 74 GDYTLSFNLGSESHKISLYMDTGSDLVWFPCSP-FECILCEGKPKIQSPLPKIANNKSVS 132
Query: 146 GCQNPKCSWIHHESIQCRD------CNDEPLATSKNCTQICPSYLVLYGSGLTEGIALSE 199
CS H S+ C E + S+ + CP + YG G +
Sbjct: 133 CSAA-ACSAAHGGSLSASHLCAISRCPLESIEISECSSFSCPPFYYAYGDGSLVARLYRD 191
Query: 200 TLNLPNRI------IPNFLVGCSVLSSRQPAGIAGFGRGKTSLPSQLNL------DKFSY 247
+L+LP + NF GC+ + +P G+AGFGRG S+PSQL ++FSY
Sbjct: 192 SLSLPTPAPSPPINVRNFTFGCAHTTLGEPVGVAGFGRGVLSMPSQLATFSPQLGNRFSY 251
Query: 248 CLLSHKF--DDTTRTSSLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLR 305
CL+SH F D R S LIL G ++ + T YT + NP +Y VGL
Sbjct: 252 CLVSHSFAADRVRRPSPLIL--GRYYTGE--TEFIYTSLLENPK------HPYFYSVGLA 301
Query: 306 RITVGGQRVRVWHKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYT 365
I+VG R+ +D G+GG +VDSGTTFT + L+E + EF ++ K N
Sbjct: 302 GISVGNIRIPAPEFLTKVDEGGSGGVVVDSGTTFTMLPAGLYESVVAEFENRTGKVANRA 361
Query: 366 RALGAEALTGLRPCFDVPGEKTGSFPELKLHFKG-GAEVTLPVENYF--------AVVGE 416
R + E TGL PC+ E + P + LHF G + V LP +NYF VVG
Sbjct: 362 RRI--EENTGLSPCYYY--ENSVGVPRVVLHFVGEKSNVVLPRKNYFYEFLDGGDGVVGR 417
Query: 417 GSAV-CLTVVT---DREASGGPSIILGNFQMQNYYVEYDLRNQRLGFKQQLC 464
V CL ++ + E +GGP LGN+Q Q + V YDL R+GF ++ C
Sbjct: 418 KRKVGCLMLMNGGDEAELAGGPGATLGNYQQQGFEVVYDLEKNRVGFARRQC 469
>gi|224115494|ref|XP_002332148.1| predicted protein [Populus trichocarpa]
gi|222875198|gb|EEF12329.1| predicted protein [Populus trichocarpa]
Length = 483
Score = 187 bits (474), Expect = 1e-44, Method: Compositional matrix adjust.
Identities = 158/487 (32%), Positives = 224/487 (45%), Gaps = 58/487 (11%)
Query: 11 SFIFFFTLLSIFPSSITSLTFSLSRFHTNPSQDSYQNLNSLVS--SSLTRALHIKNPQTK 68
+F+FF + S+ SI SL +N NSL+ + +RA +P+
Sbjct: 10 TFLFFLLVNSLLFYSIQSLARP-------------RNPNSLILGLTPASRASLPTHPKAS 56
Query: 69 TTTTTTTTTTTNISS---HSYGGYSISLSFGTPPQIIPFILDTGSHLVWFPCTN-HYQCK 124
T++ T ++ GY ISLS GTPPQ+I +DTGS L W PC N + C
Sbjct: 57 TSSRKKLTDVLDMMEPLREVRDGYLISLSIGTPPQVIQVYMDTGSDLTWAPCGNISFDCI 116
Query: 125 YCSSSK----IPSFIPKLSSSSRLLGCQNPKCSWIHHESIQCRDCNDEPLATS----KNC 176
C + + + SF P SSSS C +P C +H C + S C
Sbjct: 117 ECDNYRNNRMMASFSPSHSSSSHRDSCTSPFCIDVHSSDNPLDPCTMAGCSLSTLVKATC 176
Query: 177 TQICPSYLVLYGS-GLTEGIALSETLNLPNR------IIPNFLVGCSVLSSRQPAGIAGF 229
+ CP + YG+ G+ G +TL + R IP F GC S R+P GIAGF
Sbjct: 177 SWPCPPFAYTYGAGGVVTGTLTRDTLRVHGRNLGVTQEIPRFCFGCVASSYREPIGIAGF 236
Query: 230 GRGKTSLPSQLNLDK--FSYCLLSHKFDDTTRTSSLILDNGSSHSDKKTTGLTYTPFVNN 287
GRG SLPSQL + FS+C L+ K+ + SS ++ + + K + +TP + +
Sbjct: 237 GRGALSLPSQLGFLRKGFSHCFLAFKYANNPNISSPLIIGDIALTSKDD--MQFTPMLKS 294
Query: 288 PSVAERNAFSVYYYVGLRRITVGG-QRVRVWHKYLTLDRDGNGGTIVDSGTTFTFMAPEL 346
P + YYYVGL ITVG V D GNGG +VDSGTT+T L
Sbjct: 295 P------MYPNYYYVGLEAITVGNVSATEVPSSLREFDSLGNGGMLVDSGTTYT----HL 344
Query: 347 FEPLADEFVSQMVKNRNYTRALGAEALTGLRPCFDVPGEK----TGS-FPELKLHFKGGA 401
EP + +S + NY RA E TG C+ VP + TG P + HF A
Sbjct: 345 PEPFYSQVLSVLQSIINYPRATDMEMRTGFDLCYKVPCQNNSILTGDLLPSITFHFLNNA 404
Query: 402 EVTLPVENYFAVVGEGS----AVCLTVVTDREASGGPSIILGNFQMQNYYVEYDLRNQRL 457
+ L ++F + S CL + + GP+ +LG+FQ Q+ V YD+ +R+
Sbjct: 405 SLVLSRGSHFYAMSAPSNSTVVKCLLFQSMDDGDYGPAGVLGSFQQQDVEVVYDMEKERI 464
Query: 458 GFKQQLC 464
GF+ C
Sbjct: 465 GFRPMDC 471
>gi|414586111|tpg|DAA36682.1| TPA: pepsin A [Zea mays]
Length = 503
Score = 186 bits (472), Expect = 2e-44, Method: Compositional matrix adjust.
Identities = 142/413 (34%), Positives = 191/413 (46%), Gaps = 46/413 (11%)
Query: 89 YSISLSFGTPPQIIP--FILDTGSHLVWFPCTNHYQCKYCSS--SKIPSFIPKLSSSSRL 144
Y++SLS G P LDTGS LVWFPC + C C + SR
Sbjct: 90 YTLSLSVGPASAAAPVSLFLDTGSDLVWFPCAP-FTCMLCEGKPTPGRLGPLPPPPDSRR 148
Query: 145 LGCQNPKCSWIHHES-----IQCRDCNDEPLAT-SKNCTQICPSYLVLYGSG-----LTE 193
+ C +P CS H + C E + T S + CP YG G L
Sbjct: 149 IPCASPLCSAAHASAPPSDLCAVARCPLEDIETGSCGASHACPPLYYAYGDGSLVAHLRR 208
Query: 194 G-IALSETLNLPNRI-IPNFLVGCSVLSSRQPAGIAGFGRGKTSLPSQLNLD---KFSYC 248
G +AL + + NF C+ + +P G+AGFGRG SLP QL+ +FSYC
Sbjct: 209 GRVALGAGARASVAVAVDNFTFACAHTALGEPVGVAGFGRGPLSLPGQLSPQLSGRFSYC 268
Query: 249 LLSHKF--DDTTRTSSLILDNG--SSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGL 304
L+SH F D R S LIL + + +T G YTP ++NP +Y V L
Sbjct: 269 LVSHSFRADRLIRPSPLILGRSPDDAAAAAETDGFVYTPLLHNPK------HPYFYSVAL 322
Query: 305 RRITVGGQRVRVWHKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNY 364
++VG R++ + +DR GNGG +VDSGTTFT + E++ +A+ F M
Sbjct: 323 EAVSVGAARIQARPELARVDRAGNGGMVVDSGTTFTMLPNEMYARVAEAFARAMAAAGFA 382
Query: 365 TRALGAEALTGLRPCFDVPGEKTGSFPELKLHFKGGAEVTLPVENYF---------AVVG 415
AE TGL PC+ G P L LHF+G A V LP NYF A
Sbjct: 383 RAER-AEEQTGLTPCYRYAASDRG-VPPLALHFRGNATVALPRRNYFMGFKSEDAGAGTR 440
Query: 416 EGSAVCLTVVTDREASG----GPSIILGNFQMQNYYVEYDLRNQRLGFKQQLC 464
+ CL ++ +ASG GP+ LGNFQ Q + V YD+ R+GF ++ C
Sbjct: 441 KDDVGCLMLMNGGDASGEEGDGPAGTLGNFQQQGFEVVYDVDAGRVGFARRRC 493
>gi|449455475|ref|XP_004145478.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Cucumis sativus]
gi|449518962|ref|XP_004166504.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Cucumis sativus]
Length = 449
Score = 184 bits (467), Expect = 8e-44, Method: Compositional matrix adjust.
Identities = 147/424 (34%), Positives = 193/424 (45%), Gaps = 59/424 (13%)
Query: 88 GYSISLSFGTPPQIIPFILDTGSHLVWFPCTN-HYQCKYC-------SSSKIPSFIPKLS 139
GY +SLS GTPPQ++ +DTGS L W PC N + C+ C S ++ +F+P S
Sbjct: 20 GYLMSLSIGTPPQVVQVYMDTGSDLTWVPCGNLSFDCQDCEEYQNNISGPRLAAFLPTHS 79
Query: 140 SSSRLLGCQNPKCSWIHHESIQCRDCNDEPLATSK----NCTQICPSYLVLYG-SGLTEG 194
S+S C + C IH C + + C + CPS+ YG SG+ G
Sbjct: 80 STSIRDTCGSSFCMDIHSSDNPFDPCTIAGCSLASLVKGTCPRPCPSFAYTYGASGVVTG 139
Query: 195 IALSETL---------NLPNRIIPNFLVGCSVLSSRQPAGIAGFGRGKTSLPSQLNLDK- 244
+ L N N+ IP F GC + R+P GIAGFGRG SLP QL
Sbjct: 140 SLTRDVLFTHGNYNNNNNNNKQIPRFCFGCVGATYREPIGIAGFGRGLLSLPFQLGFSHK 199
Query: 245 -FSYCLLSHKFDDTTRTSS-LILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYV 302
FS+C L KF + SS LIL N + S K L +TP + +P + YYY+
Sbjct: 200 GFSHCFLPFKFSNNPNFSSPLILGNLAISS--KDENLQFTPLLKSP------MYPNYYYI 251
Query: 303 GLRRITVGGQ----RVRVWHKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQM 358
GL IT+G R V K +D GNGG ++DSGTT+T L EPL + +S +
Sbjct: 252 GLESITIGNGDNNFRFGVSFKLREIDTKGNGGMLIDSGTTYT----HLPEPLYSQLISNL 307
Query: 359 VKNRNYTRALGAEALTGLRPCFDVPGEKTGS-------FPELKLHFKGGAEVTLPVENYF 411
Y RA E TG C+ VP + S P + HF V LP N F
Sbjct: 308 ELVIGYPRAKQVELNTGFDLCYKVPCKNNNSSFVDDAQLPSITFHFLNNVSVVLPQGNNF 367
Query: 412 ----AVVGEGSAVCL-------TVVTDREASGGPSIILGNFQMQNYYVEYDLRNQRLGFK 460
A + CL + GP+ I G+FQ QN V YDL +RLGF+
Sbjct: 368 YAMAAPINSTVVKCLLYQSMDGVGDDNDSDDNGPAGIFGSFQQQNIEVVYDLEKERLGFQ 427
Query: 461 QQLC 464
C
Sbjct: 428 PMDC 431
>gi|255550723|ref|XP_002516410.1| pepsin A, putative [Ricinus communis]
gi|223544445|gb|EEF45965.1| pepsin A, putative [Ricinus communis]
Length = 416
Score = 183 bits (465), Expect = 2e-43, Method: Compositional matrix adjust.
Identities = 133/406 (32%), Positives = 185/406 (45%), Gaps = 41/406 (10%)
Query: 88 GYSISLSFGTPPQIIPFILDTGSHLVWFPCTN-HYQCKYCSSSKIPSFIPKLSSSSRLL- 145
GY ISL+ GTPPQ+I +DTGS L W PC N + C C + + S S
Sbjct: 11 GYLISLNIGTPPQVIQVYMDTGSDLTWVPCGNLSFDCMDCDDYRNSKLMSAFSPSHSSSS 70
Query: 146 ---GCQNPKCSWIHHESIQCRDCNDEPLATS----KNCTQICPSYLVLYGS-GLTEGIAL 197
C +P C+ IH C + S C + CPS+ YG+ G+ G
Sbjct: 71 YRDSCASPYCTDIHSSDNSFDPCTVAGCSLSTLIKATCARPCPSFAYTYGAGGVVTGTLT 130
Query: 198 SETLNL---PNRI---IPNFLVGCSVLSSRQPAGIAGFGRGKTSLPSQLNLDK--FSYCL 249
+TL + P R+ IP F GC + +P GIAGF RG S PSQL L K FS+C
Sbjct: 131 RDTLRVHEGPARVTKDIPKFCFGCVGSTYHEPIGIAGFVRGTLSFPSQLGLLKKGFSHCF 190
Query: 250 LSHKFDDTTRTSSLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITV 309
L+ K+ + SS ++ ++ S K + +TP + +P + YYY+GL ITV
Sbjct: 191 LAFKYANNPNISSPLVIGDTALSSKDN--MQFTPMLKSP------MYPNYYYIGLEAITV 242
Query: 310 GG-QRVRVWHKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRAL 368
G V D GNGG ++DSGTT+T L EP + +S Y RA
Sbjct: 243 GNVSATTVPLNLREFDSQGNGGMLIDSGTTYT----HLPEPFYSQLLSIFKAIITYPRAT 298
Query: 369 GAEALTGLRPCFDVP------GEKTGSFPELKLHFKGGAEVTLPVENYFAVVGEGS---- 418
E G C+ VP + FP + HF LP N+F + S
Sbjct: 299 EVEMRAGFDLCYKVPCPNNRLTDDDNLFPSITFHFLNNVSFVLPQGNHFYAMSAPSNSTV 358
Query: 419 AVCLTVVTDREASGGPSIILGNFQMQNYYVEYDLRNQRLGFKQQLC 464
CL + ++ GP+ + G+FQ QN + YDL +R+GF+ C
Sbjct: 359 VKCLLFQSMADSDYGPAGVFGSFQQQNVQIVYDLEKERIGFQPMDC 404
>gi|115458644|ref|NP_001052922.1| Os04g0448300 [Oryza sativa Japonica Group]
gi|113564493|dbj|BAF14836.1| Os04g0448300 [Oryza sativa Japonica Group]
gi|215766465|dbj|BAG98773.1| unnamed protein product [Oryza sativa Japonica Group]
gi|215767943|dbj|BAH00172.1| unnamed protein product [Oryza sativa Japonica Group]
Length = 454
Score = 183 bits (464), Expect = 2e-43, Method: Compositional matrix adjust.
Identities = 137/387 (35%), Positives = 179/387 (46%), Gaps = 47/387 (12%)
Query: 87 GGYSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLG 146
G + + +S GTP I+DTGS LVW C C C P F P SS+ +
Sbjct: 103 GEFLMDVSIGTPALAYSAIVDTGSDLVWTQCK---PCVDCFKQSTPVFDPSSSSTYATVP 159
Query: 147 CQNPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYG-SGLTEGIALSETLNLPN 205
C + CS L TSK + Y YG S T+G+ +ET L
Sbjct: 160 CSSASCS---------------DLPTSKCTSASKCGYTYTYGDSSSTQGVLATETFTLAK 204
Query: 206 RIIPNFLVGCSVLSS----RQPAGIAGFGRGKTSLPSQLNLDKFSYCLLSHKFDDTTRTS 261
+P + GC + Q AG+ G GRG SL SQL LDKFSYCL S D T S
Sbjct: 205 SKLPGVVFGCGDTNEGDGFSQGAGLVGLGRGPLSLVSQLGLDKFSYCLTSL---DDTNNS 261
Query: 262 SLILDN--GSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRVRVWHK 319
L+L + G S + + + TP + NPS +YYV L+ ITVG R+ +
Sbjct: 262 PLLLGSLAGISEASAAASSVQTTPLIKNPSQPS------FYYVSLKAITVGSTRISLPSS 315
Query: 320 YLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEALTGLRPC 379
+ DG GG IVDSGT+ T++ + + L F +QM A G+ GL C
Sbjct: 316 AFAVQDDGTGGVIVDSGTSITYLEVQGYRALKKAFAAQMA----LPAADGSG--VGLDLC 369
Query: 380 FDVP--GEKTGSFPELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTVVTDREASGGPSII 437
F P G P L HF GGA++ LP ENY + G A+CLTV+ R S I
Sbjct: 370 FRAPAKGVDQVEVPRLVFHFDGGADLDLPAENYMVLDGGSGALCLTVMGSRGLS-----I 424
Query: 438 LGNFQMQNYYVEYDLRNQRLGFKQQLC 464
+GNFQ QN+ YD+ + L F C
Sbjct: 425 IGNFQQQNFQFVYDVGHDTLSFAPVQC 451
>gi|38344829|emb|CAD40873.2| OSJNBa0064H22.10 [Oryza sativa Japonica Group]
gi|116310063|emb|CAH67084.1| H0818E04.1 [Oryza sativa Indica Group]
gi|116310186|emb|CAH67198.1| OSIGBa0152K17.10 [Oryza sativa Indica Group]
Length = 444
Score = 183 bits (464), Expect = 2e-43, Method: Compositional matrix adjust.
Identities = 137/387 (35%), Positives = 179/387 (46%), Gaps = 47/387 (12%)
Query: 87 GGYSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLG 146
G + + +S GTP I+DTGS LVW C C C P F P SS+ +
Sbjct: 93 GEFLMDVSIGTPALAYSAIVDTGSDLVWTQCK---PCVDCFKQSTPVFDPSSSSTYATVP 149
Query: 147 CQNPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYG-SGLTEGIALSETLNLPN 205
C + CS L TSK + Y YG S T+G+ +ET L
Sbjct: 150 CSSASCS---------------DLPTSKCTSASKCGYTYTYGDSSSTQGVLATETFTLAK 194
Query: 206 RIIPNFLVGCSVLSS----RQPAGIAGFGRGKTSLPSQLNLDKFSYCLLSHKFDDTTRTS 261
+P + GC + Q AG+ G GRG SL SQL LDKFSYCL S D T S
Sbjct: 195 SKLPGVVFGCGDTNEGDGFSQGAGLVGLGRGPLSLVSQLGLDKFSYCLTSL---DDTNNS 251
Query: 262 SLILDN--GSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRVRVWHK 319
L+L + G S + + + TP + NPS +YYV L+ ITVG R+ +
Sbjct: 252 PLLLGSLAGISEASAAASSVQTTPLIKNPSQPS------FYYVSLKAITVGSTRISLPSS 305
Query: 320 YLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEALTGLRPC 379
+ DG GG IVDSGT+ T++ + + L F +QM A G+ GL C
Sbjct: 306 AFAVQDDGTGGVIVDSGTSITYLEVQGYRALKKAFAAQMA----LPAADGSG--VGLDLC 359
Query: 380 FDVP--GEKTGSFPELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTVVTDREASGGPSII 437
F P G P L HF GGA++ LP ENY + G A+CLTV+ R S I
Sbjct: 360 FRAPAKGVDQVEVPRLVFHFDGGADLDLPAENYMVLDGGSGALCLTVMGSRGLS-----I 414
Query: 438 LGNFQMQNYYVEYDLRNQRLGFKQQLC 464
+GNFQ QN+ YD+ + L F C
Sbjct: 415 IGNFQQQNFQFVYDVGHDTLSFAPVQC 441
>gi|125548488|gb|EAY94310.1| hypothetical protein OsI_16079 [Oryza sativa Indica Group]
Length = 423
Score = 183 bits (464), Expect = 2e-43, Method: Compositional matrix adjust.
Identities = 137/387 (35%), Positives = 179/387 (46%), Gaps = 47/387 (12%)
Query: 87 GGYSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLG 146
G + + +S GTP I+DTGS LVW C C C P F P SS+ +
Sbjct: 72 GEFLMDVSIGTPALAYSAIVDTGSDLVWTQCK---PCVDCFKQSTPVFDPSSSSTYATVP 128
Query: 147 CQNPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYG-SGLTEGIALSETLNLPN 205
C + CS L TSK + Y YG S T+G+ +ET L
Sbjct: 129 CSSASCS---------------DLPTSKCTSASKCGYTYTYGDSSSTQGVLATETFTLAK 173
Query: 206 RIIPNFLVGCSVLSS----RQPAGIAGFGRGKTSLPSQLNLDKFSYCLLSHKFDDTTRTS 261
+P + GC + Q AG+ G GRG SL SQL LDKFSYCL S D T S
Sbjct: 174 SKLPGVVFGCGDTNEGDGFSQGAGLVGLGRGPLSLVSQLGLDKFSYCLTSL---DDTNNS 230
Query: 262 SLILDN--GSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRVRVWHK 319
L+L + G S + + + TP + NPS +YYV L+ ITVG R+ +
Sbjct: 231 PLLLGSLAGISEASAAASSVQTTPLIKNPSQPS------FYYVSLKAITVGSTRISLPSS 284
Query: 320 YLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEALTGLRPC 379
+ DG GG IVDSGT+ T++ + + L F +QM A G+ GL C
Sbjct: 285 AFAVQDDGTGGVIVDSGTSITYLEVQGYRALKKAFAAQMA----LPAADGSG--VGLDLC 338
Query: 380 FDVP--GEKTGSFPELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTVVTDREASGGPSII 437
F P G P L HF GGA++ LP ENY + G A+CLTV+ R S I
Sbjct: 339 FRAPAKGVDQVEVPRLVFHFDGGADLDLPAENYMVLDGGSGALCLTVMGSRGLS-----I 393
Query: 438 LGNFQMQNYYVEYDLRNQRLGFKQQLC 464
+GNFQ QN+ YD+ + L F C
Sbjct: 394 IGNFQQQNFQFVYDVGHDTLSFAPVQC 420
>gi|242091057|ref|XP_002441361.1| hypothetical protein SORBIDRAFT_09g025220 [Sorghum bicolor]
gi|241946646|gb|EES19791.1| hypothetical protein SORBIDRAFT_09g025220 [Sorghum bicolor]
Length = 439
Score = 179 bits (454), Expect = 3e-42, Method: Compositional matrix adjust.
Identities = 140/423 (33%), Positives = 197/423 (46%), Gaps = 55/423 (13%)
Query: 88 GYSISLSFGTPPQIIPFILDTGSHLVWFPC--TNHYQCKYCSSS--KIPSFIPKLSSSSR 143
GY +SL+ GTPPQ+ LDTGS L W PC ++ YQC C SS P+F+P S+S+
Sbjct: 24 GYLLSLNLGTPPQVFQVYLDTGSDLTWVPCGSSSSYQCLDCGSSVKPTPTFLPSESTSNT 83
Query: 144 LLGCQNPKCSWIHHESIQCRDCNDE----PLATSKNCTQICPSYLVLYGSG-LTEGIALS 198
C + C +H + C P T C + CP + YG G L G
Sbjct: 84 RDLCGSRFCVDVHSSDNRFDPCAAAGCAIPAFTGGQCPRPCPPFSYTYGGGALVLGSLSR 143
Query: 199 ETLNLPNR-------------IIPNFLVGCSVLSSRQPAGIAGFGRGKTSLPSQLNL--D 243
+++ L P F GC S R+P GIAGFGRG SLPSQL
Sbjct: 144 DSVTLHGSTHGSGAGAGPLPVAFPGFGFGCVGSSIREPLGIAGFGRGALSLPSQLGFLGK 203
Query: 244 KFSYCLLSHKFD-DTTRTSSLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYV 302
FS+C L +F + TS L++ + + S G +TP + + + + +YYV
Sbjct: 204 GFSHCFLGFRFARNPNFTSPLVMGDLALSSASTDGGFVFTPMLTSAT------YPNFYYV 257
Query: 303 GLRRITV----GGQRVRVWHKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPL-ADEFVSQ 357
GL + + GG + +D GNGG +VD+GTT+T +L +P A S
Sbjct: 258 GLEGVVLGDDDGGSAMAAPPSLSGIDAQGNGGVLVDTGTTYT----QLPDPFYASVLASL 313
Query: 358 MVKNRNYTRALGAEALTGLRPCFDVPGEKT----GSFPELKLHFKGGAEVTLP-VENYFA 412
+ Y R+ EA TG CF VP + P + LH GGA + LP + +Y+
Sbjct: 314 ISAAPPYERSRDLEARTGFDLCFKVPCARAPCADDELPPITLHLAGGARLALPKLSSYYP 373
Query: 413 VVG-EGSAVCLTVVTDR---------EASGGPSIILGNFQMQNYYVEYDLRNQRLGFKQQ 462
V S V ++ R + GGP+ +LG+FQMQN V YDL R+GF+ +
Sbjct: 374 VTAIRDSVVVKCLLFQRMEMEDDGDGTSGGGPAAVLGSFQMQNVEVVYDLAAGRVGFRPR 433
Query: 463 LCK 465
C
Sbjct: 434 DCA 436
>gi|125590542|gb|EAZ30892.1| hypothetical protein OsJ_14967 [Oryza sativa Japonica Group]
Length = 516
Score = 177 bits (449), Expect = 1e-41, Method: Compositional matrix adjust.
Identities = 135/379 (35%), Positives = 174/379 (45%), Gaps = 47/379 (12%)
Query: 95 FGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLGCQNPKCSW 154
GTP I+DTGS LVW C C C P F P SS+ + C + CS
Sbjct: 173 IGTPALAYSAIVDTGSDLVWTQCK---PCVDCFKQSTPVFDPSSSSTYATVPCSSASCS- 228
Query: 155 IHHESIQCRDCNDEPLATSKNCTQICPSYLVLYG-SGLTEGIALSETLNLPNRIIPNFLV 213
L TSK + Y YG S T+G+ +ET L +P +
Sbjct: 229 --------------DLPTSKCTSASKCGYTYTYGDSSSTQGVLATETFTLAKSKLPGVVF 274
Query: 214 GCSVLSS----RQPAGIAGFGRGKTSLPSQLNLDKFSYCLLSHKFDDTTRTSSLILDN-- 267
GC + Q AG+ G GRG SL SQL LDKFSYCL S D T S L+L +
Sbjct: 275 GCGDTNEGDGFSQGAGLVGLGRGPLSLVSQLGLDKFSYCLTSL---DDTNNSPLLLGSLA 331
Query: 268 GSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRVRVWHKYLTLDRDG 327
G S + + + TP + NPS +YYV L+ ITVG R+ + + DG
Sbjct: 332 GISEASAAASSVQTTPLIKNPSQPS------FYYVSLKAITVGSTRISLPSSAFAVQDDG 385
Query: 328 NGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEALTGLRPCFDVP--GE 385
GG IVDSGT+ T++ + + L F +QM A G+ GL CF P G
Sbjct: 386 TGGVIVDSGTSITYLEVQGYRALKKAFAAQMA----LPAADGSG--VGLDLCFRAPAKGV 439
Query: 386 KTGSFPELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTVVTDREASGGPSIILGNFQMQN 445
P L HF GGA++ LP ENY + G A+CLTV+ R S I+GNFQ QN
Sbjct: 440 DQVEVPRLVFHFDGGADLDLPAENYMVLDGGSGALCLTVMGSRGLS-----IIGNFQQQN 494
Query: 446 YYVEYDLRNQRLGFKQQLC 464
+ YD+ + L F C
Sbjct: 495 FQFVYDVGHDTLSFAPVQC 513
>gi|413952720|gb|AFW85369.1| hypothetical protein ZEAMMB73_571116 [Zea mays]
Length = 451
Score = 176 bits (447), Expect = 2e-41, Method: Compositional matrix adjust.
Identities = 147/475 (30%), Positives = 214/475 (45%), Gaps = 56/475 (11%)
Query: 10 LSFIFFFTLLSIFPSSITS-LTFSLSRFHTNPSQD--SYQNLNSLVSSSLTRALHIKNPQ 66
L++ FTLL ++ T+ LT H + + ++ L+ + S RA +
Sbjct: 10 LAYALIFTLLFTAAATPTAGLTMRADLTHVDKGRGFTRWERLSRMAVRSRARAASLYQRG 69
Query: 67 TKTTTTTTTTTTTNISSHSYGGYSISLSFGTP-PQIIPFILDTGSHLVWFPCTNHYQCKY 125
T T S G Y I + GTP PQ + +DTGS LVW CT C
Sbjct: 70 GHYGQPVTATAVP-----SSGEYLIHFNIGTPRPQRVALTMDTGSDLVWTQCT---PCPV 121
Query: 126 CSSSKIPSFIPKLSSSSRLLGCQNPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLV 185
C P F P +SS+ R + C +P C R + ++ T C YL
Sbjct: 122 CFDQPFPLFDPSVSSTFRAVACPDPIC----------RPSSGLSVSACALKTFRC-FYLC 170
Query: 186 LYGS-GLTEGIALSETLNL--------PNRIIPNFLVGCS-----VLSSRQPAGIAGFGR 231
YG +T G +T P + GC V +S + +GIAGFGR
Sbjct: 171 SYGDKSITAGYIFKDTFTFMSPNGEGAPPVAVSGLAFGCGDYNTGVFASNE-SGIAGFGR 229
Query: 232 GKTSLPSQLNLDKFSYCLLSHKFDDTTRTSSLILDNGSSHSDKKTTG-LTYTPFVNNPSV 290
G SLPSQL + +FSYCL SH ++ +TS++ L + ++G TP +++PS
Sbjct: 230 GPLSLPSQLRVGRFSYCLTSHDETESNKTSAVFLGTPPNGLRAHSSGPFRSTPIIHSPS- 288
Query: 291 AERNAFSVYYYVGLRRITVGGQRVRVWHKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPL 350
F +YY+ L ITVG R+ V L +DG+GGT++DSGT T +FE L
Sbjct: 289 -----FPTFYYLSLEGITVGKTRLPVDSSVFALKKDGSGGTVIDSGTGVTTFPAAVFEQL 343
Query: 351 ADEFVSQMVKNRNYTRALGAEALTGLRPCFDVP-GEKTGSFPELKLHFKGGAEVTLPVEN 409
+EFV+Q+ R + L CF P G K P+L H A++ LP EN
Sbjct: 344 KNEFVAQLPLPRYDNTSEVGNLL-----CFQRPKGGKQVPVPKLIFHL-ASADMDLPREN 397
Query: 410 YFAVVGEGSAVCLTVVTDREASGGPSIILGNFQMQNYYVEYDLRNQRLGFKQQLC 464
Y + +CL ++ E +++GNFQ QN ++ YD+ N +L F C
Sbjct: 398 YIPEDTDSGVMCL-MINGAEVD---MVLIGNFQQQNMHIVYDVENSKLLFASAQC 448
>gi|125552953|gb|EAY98662.1| hypothetical protein OsI_20585 [Oryza sativa Indica Group]
Length = 429
Score = 176 bits (445), Expect = 3e-41, Method: Compositional matrix adjust.
Identities = 146/417 (35%), Positives = 201/417 (48%), Gaps = 55/417 (13%)
Query: 88 GYSISLSFGTPPQIIPFILDTGSHLVWFPC--TNHYQCKYC----SSSKIPSFIPKLSSS 141
GY +SL+ G PPQ+ LDTGS L W PC + YQC C S+SK SS
Sbjct: 24 GYLLSLNLGMPPQVFQVYLDTGSDLTWVPCGTNSSYQCLECGNEHSTSKPIPSFSPSQSS 83
Query: 142 SRLLG-CQNPKCSWIH-----HESIQCRDCNDEPLATSKNCTQICPSYLVLYGSG-LTEG 194
S + C + C IH H+ C P S CT+ CP + YG G L G
Sbjct: 84 SNMKELCGSRFCVDIHSSDNSHDPCAAVGCA-IPSFMSGLCTRPCPPFSYTYGGGALVLG 142
Query: 195 IALSETLNLPNRI--------IPNFLVGCSVLSSRQPAGIAGFGRGKTSLPSQLN-LDK- 244
+ + L I +P F GC S R+P GIAGFG+G SLPSQL LDK
Sbjct: 143 SLAKDIVTLHGSIFGIAILLDVPGFCFGCVGSSIREPIGIAGFGKGILSLPSQLGFLDKG 202
Query: 245 FSYCLLSHKFD-DTTRTSSLIL-DNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYV 302
FS+C L +F + TSSLI+ D S D +TP + S+ N +YY+
Sbjct: 203 FSHCFLGFRFARNPNFTSSLIMGDLALSAKDD----FLFTPMLK--SITNPN----FYYI 252
Query: 303 GLRRITVG-GQRVRVWHKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKN 361
GL +++G G + ++D +GNGG IVD+GTT+T L +P +S +
Sbjct: 253 GLEGVSIGDGAAIAAPPSLSSIDSEGNGGMIVDTGTTYT----HLPDPFYTAILSSLASV 308
Query: 362 RNYTRALGAEALTGLRPCFDVPGEKT----GSFPELKLHFKGGAEVTLPVEN-YFAVVGE 416
Y R+ E TG CF +P T P + HF G ++TLP ++ Y+AV
Sbjct: 309 ILYERSYDLEMRTGFDLCFKIPCTHTPCTQDELPLINFHFLGDVKLTLPKDSCYYAVTAP 368
Query: 417 GSAVCLTVV----TDRE-----ASGGPSIILGNFQMQNYYVEYDLRNQRLGFKQQLC 464
++V + + D E A+ GP +LG+FQMQN V YD+ R+GF+ + C
Sbjct: 369 KNSVVVKCLLFQRMDDEDDVGGANNGPGAVLGSFQMQNVEVVYDMEAGRIGFQPKDC 425
>gi|357128791|ref|XP_003566053.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Brachypodium
distachyon]
Length = 441
Score = 175 bits (444), Expect = 3e-41, Method: Compositional matrix adjust.
Identities = 140/426 (32%), Positives = 201/426 (47%), Gaps = 61/426 (14%)
Query: 88 GYSISLSFGTPPQIIPFILDTGSHLVWFPCTNH--YQCKYCS-----SSKIPSFIPKLSS 140
GY +SL+ GTPPQ+ LDTGS L W PC + YQC C S P+F S
Sbjct: 24 GYLLSLNLGTPPQVFQVYLDTGSDLTWVPCGTNTSYQCLECGNEHSISKPTPAFSLSQSY 83
Query: 141 SSRLLGCQNPKCSWIH-----HESIQCRDCNDEPLATSKNCTQICPSYLVLYGS-GLTEG 194
SS C + C +H H++ C+ P+ S CT++CP + YG L G
Sbjct: 84 SSTRDLCGSRFCVDVHSSDNSHDACAAAGCS-IPVFMSGLCTRLCPPFAYTYGGRALVLG 142
Query: 195 IALSETLNLPNRI--------IPNFLVGCSVLSSRQPAGIAGFGRGKTSLPSQLN-LDK- 244
+T+ L I P F GC S R+P GIAGFG+GK SLPSQL LDK
Sbjct: 143 SLARDTIALHGSIYGISVPIEFPGFCFGCVGSSIREPIGIAGFGKGKLSLPSQLGFLDKG 202
Query: 245 FSYCLLSHKFDDTTRTSSLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGL 304
FS+C L F +S ++ + S K G +TP + + + + +YY+GL
Sbjct: 203 FSHCFLGFWFARNPNITSPMVIGDLALSVKD--GFLFTPMLKSLT------YPNFYYIGL 254
Query: 305 RRITVGGQRVRVWHKYLT-LDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRN 363
+T+G L+ +D +GNGG IVD+GTT+T ++ +P +S +
Sbjct: 255 EGVTIGDNAAIPAPPSLSGIDSEGNGGVIVDTGTTYTHLS----DPFYASVLSSLSSTVP 310
Query: 364 YTRALGAEALTGLRPCFDVPGEKT----GSFPELKLHFKGGAEVTLPVEN-YFAVVGEGS 418
Y R+ E TG C VP P + +H G + LP E+ Y+AV +
Sbjct: 311 YNRSYELEIRTGFDLCLKVPCMHAPCNDDELPPITVHLGGDVTLALPKESCYYAVTAPRN 370
Query: 419 AVCLTVV---------------TDRE----ASGGPSIILGNFQMQNYYVEYDLRNQRLGF 459
+V + + D E ++GGP+ +LG+FQMQN V YDL + R+GF
Sbjct: 371 SVVIKCLLFQRKDDDGVFSADNDDGEDASFSAGGPAAVLGSFQMQNVEVVYDLESGRVGF 430
Query: 460 KQQLCK 465
+ + C
Sbjct: 431 QPRDCA 436
>gi|297724243|ref|NP_001174485.1| Os05g0511050 [Oryza sativa Japonica Group]
gi|222632192|gb|EEE64324.1| hypothetical protein OsJ_19161 [Oryza sativa Japonica Group]
gi|255676482|dbj|BAH93213.1| Os05g0511050 [Oryza sativa Japonica Group]
Length = 432
Score = 174 bits (442), Expect = 7e-41, Method: Compositional matrix adjust.
Identities = 144/420 (34%), Positives = 200/420 (47%), Gaps = 58/420 (13%)
Query: 88 GYSISLSFGTPPQIIPFILDTGSHLVWFPC--TNHYQCKYC----SSSKIPSFIPKLSSS 141
GY +SL+ G PPQ+ LDTGS L W PC + YQC C S+SK SS
Sbjct: 24 GYLLSLNLGMPPQVFQVYLDTGSDLTWVPCGTNSSYQCLECGNEHSTSKPIPSFSPSQSS 83
Query: 142 SRLLG-CQNPKCSWIH-----HESIQCRDCNDEPLATSKNCTQICPSYLVLYGSG-LTEG 194
S + C + C IH H+ C P S CT+ CP + YG G L G
Sbjct: 84 SNMKELCGSRFCVDIHSSDNSHDPCAAVGCA-IPSFMSDLCTRPCPPFSYTYGGGALVLG 142
Query: 195 IALSETLNLPNRI--------IPNFLVGCSVLSSRQPAGIAGFGRGKTSLPSQLN-LDK- 244
+ + L I +P F GC S R+P GIAGFG+G SLPSQL LDK
Sbjct: 143 SLAKDIVTLHGSIFGIAILLDVPGFCFGCVGSSIREPIGIAGFGKGILSLPSQLGFLDKG 202
Query: 245 FSYCLLSHKFD-DTTRTSSLIL-DNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYV 302
FS+C L +F + TSSLI+ D S D +TP + S+ N +YY+
Sbjct: 203 FSHCFLGFRFARNPNFTSSLIMGDLALSAKDD----FLFTPMLK--SITNPN----FYYI 252
Query: 303 GLRRITVG-GQRVRVWHKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKN 361
GL +++G G + ++D +GNGG IVD+GTT+T L +P +S +
Sbjct: 253 GLEGVSIGDGAAIAAPPSLSSIDSEGNGGMIVDTGTTYT----HLPDPFYTAILSSLASV 308
Query: 362 RNYTRALGAEALTGLRPCFDVPGEKT----GSFPELKLHFKGGAEVTLPVEN-YFAVVGE 416
Y R+ E TG CF +P T P + HF G ++TLP ++ Y+AV
Sbjct: 309 ILYERSYDLEMRTGFDLCFKIPCTHTPCTQDELPLINFHFLGDVKLTLPKDSCYYAVTAP 368
Query: 417 GSAVCLTVVTDRE------------ASGGPSIILGNFQMQNYYVEYDLRNQRLGFKQQLC 464
++V + + + A+ GP +LG+FQMQN V YD+ R+GF+ + C
Sbjct: 369 KNSVVVKCLLFQRMDNDDDDDDVGGANNGPGAVLGSFQMQNVEVVYDMEAGRIGFQPKDC 428
>gi|242050432|ref|XP_002462960.1| hypothetical protein SORBIDRAFT_02g035320 [Sorghum bicolor]
gi|241926337|gb|EER99481.1| hypothetical protein SORBIDRAFT_02g035320 [Sorghum bicolor]
Length = 445
Score = 173 bits (439), Expect = 2e-40, Method: Compositional matrix adjust.
Identities = 148/452 (32%), Positives = 213/452 (47%), Gaps = 59/452 (13%)
Query: 33 LSRFHTNPSQDSYQNLNSLVSSSLTRALHIKNPQ---TKTTTTTTTTTTTNISSHSYGGY 89
L+R H +PS + Q V +L R +H N + ++ TT + T IS + G Y
Sbjct: 32 LTRIHADPSVTASQ----FVRDALRRDMHRHNARQLAASSSNGTTVSAPTQISPTA-GEY 86
Query: 90 SISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSS----KIPSFIPKLSSSSRLL 145
++L+ GTPP I DTGS L+W QC CSS P + P S++ +L
Sbjct: 87 LMTLAIGTPPVSYQAIADTGSDLIW------TQCAPCSSQCFQQPTPLYNPSSSTTFAVL 140
Query: 146 GCQNPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSGLTEGIALSETLNLPN 205
C S + + P CT + Y + YGSG T SET +
Sbjct: 141 PCN----SSLSMCAAALAGTTPPP-----GCTCM---YNMTYGSGWTSVYQGSETFTFGS 188
Query: 206 RI------IPNFLVGCSVLS----SRQPAGIAGFGRGKTSLPSQLNLDKFSYCLLSHKFD 255
+P GCS S + +G+ G GRG SL SQL + KFSYCL +
Sbjct: 189 STPANQTGVPGIAFGCSNASGGFNTSSASGLVGLGRGSLSLVSQLGVPKFSYCL--TPYQ 246
Query: 256 DTTRTSSLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRVR 315
DT TS+L+L G S S T G++ TPFV +PS A S YYY+ L I++G +
Sbjct: 247 DTNSTSTLLL--GPSASLNDTGGVSSTPFVASPSDAP---MSTYYYLNLTGISLGTTALS 301
Query: 316 VWHKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEALTG 375
+ L+L DG GG I+DSGTT T + ++ + VS + G A TG
Sbjct: 302 IPTTALSLKADGTGGFIIDSGTTITLLGNTAYQQVRAAVVSLVTLPTTD----GGSAATG 357
Query: 376 LRPCFDVPGEKTG--SFPELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTVVTDREASGG 433
L CF++P + + P + LHF GA++ LP ++Y + + + CL + + GG
Sbjct: 358 LDLCFELPSSTSAPPTMPSMTLHFD-GADMVLPADSYMML--DSNLWCLAM--QNQTDGG 412
Query: 434 PSIILGNFQMQNYYVEYDLRNQRLGFKQQLCK 465
S ILGN+Q QN ++ YD+ + L F C
Sbjct: 413 VS-ILGNYQQQNMHILYDVGQETLTFAPAKCS 443
>gi|242073260|ref|XP_002446566.1| hypothetical protein SORBIDRAFT_06g018170 [Sorghum bicolor]
gi|241937749|gb|EES10894.1| hypothetical protein SORBIDRAFT_06g018170 [Sorghum bicolor]
Length = 452
Score = 173 bits (438), Expect = 2e-40, Method: Compositional matrix adjust.
Identities = 134/389 (34%), Positives = 178/389 (45%), Gaps = 48/389 (12%)
Query: 87 GGYSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLG 146
G + + ++ GTP I+DTGS LVW C C C P F P SS+ +
Sbjct: 98 GEFLMDVAIGTPALSYAAIVDTGSDLVWTQCK---PCVDCFKQSTPVFDPSSSSTYATVP 154
Query: 147 CQNPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYG-SGLTEGIALSETLNL-- 203
C + CS D P +T + ++ Y YG + T+G+ SET L
Sbjct: 155 CSSALCS-------------DLPTSTCTSASKC--GYTYTYGDASSTQGVLASETFTLGK 199
Query: 204 PNRIIPNFLVGCSVLSS----RQPAGIAGFGRGKTSLPSQLNLDKFSYCLLSHKFDDTTR 259
+ +P GC + Q AG+ G GRG SL SQL LDKFSYCL S DD
Sbjct: 200 EKKKLPGVAFGCGDTNEGDGFTQGAGLVGLGRGPLSLVSQLGLDKFSYCLTS--LDDGDG 257
Query: 260 TSSLILDNGSSHSDKKTTG--LTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRVRVW 317
S L+L ++ + + TP V NPS +YYV L +TVG R+ +
Sbjct: 258 KSPLLLGGSAAAISESAATAPVQTTPLVKNPSQPS------FYYVSLTGLTVGSTRITLP 311
Query: 318 HKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEALTGLR 377
+ DG GG IVDSGT+ T++ + + L FV+QM G+E GL
Sbjct: 312 ASAFAIQDDGTGGVIVDSGTSITYLELQGYRALKKAFVAQMA----LPTVDGSE--IGLD 365
Query: 378 PCFDVP--GEKTGSFPELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTVVTDREASGGPS 435
CF P G P+L LHF GGA++ LP ENY + A+CLTV R S
Sbjct: 366 LCFQGPAKGVDEVQVPKLVLHFDGGADLDLPAENYMVLDSASGALCLTVAPSRGLS---- 421
Query: 436 IILGNFQMQNYYVEYDLRNQRLGFKQQLC 464
I+GNFQ QN+ YD+ L F C
Sbjct: 422 -IIGNFQQQNFQFVYDVAGDTLSFAPVQC 449
>gi|357122568|ref|XP_003562987.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Brachypodium
distachyon]
Length = 455
Score = 173 bits (438), Expect = 2e-40, Method: Compositional matrix adjust.
Identities = 148/487 (30%), Positives = 213/487 (43%), Gaps = 74/487 (15%)
Query: 16 FTLLSIFPSSI------TSLTFSLSRFHTNPSQDSYQNLNSLVSSSLTRALHIKN----- 64
F++L I +I ++ L+R H +P + + V +L R +H
Sbjct: 4 FSVLLILACTILASDAAAAVRVGLTRIHADPEVTASE----FVRGALRRDMHRHARFARE 59
Query: 65 ---PQTKTTTTTTTTTTTNISSHSYGGYSISLSFGTPPQIIPFILDTGSHLVWFPCTNHY 121
P + T T + G Y ++LS GTPP I DTGS L+W
Sbjct: 60 QLAPSSAAAAGLTVGAPTQKDLRNGGEYIMTLSIGTPPLSYRAIADTGSDLIW------T 113
Query: 122 QCKYCSSSKIPS-----------FIPKLSSSSRLLGCQNP--KCSWIHHESIQCRDCNDE 168
QC C + + + P S++ +L C +P C+ + S
Sbjct: 114 QCAPCGDTVTDTDNQCFKQSGCLYNPSSSTTFGVLPCNSPLSMCAAMAGPS--------- 164
Query: 169 PLATSKNCTQICPSYLVLYGSGLTEGIALSETLNL------PNRIIPNFLVGCSVLSSRQ 222
C + Y YG+G T G+ ET P +PN GCS SS
Sbjct: 165 ---PPPGCACM---YNQTYGTGWTAGVQSVETFTFGSSSTPPAVRVPNIAFGCSNASSND 218
Query: 223 ---PAGIAGFGRGKTSLPSQLNLDKFSYCLLSHKFDDTTRTSSLILDNGSSHSDKKTTGL 279
AG+ G GRG SL SQL FSYCL F D TS+L+L ++ + K T +
Sbjct: 219 WNGSAGLVGLGRGSMSLVSQLGAGAFSYCLT--PFQDANSTSTLLLGPSAAAALKGTGPV 276
Query: 280 TYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRVRVWHKYLTLDRDGNGGTIVDSGTTF 339
TPFV PS A S YYY+ L I+VG + + +L DG GG I+DSGTT
Sbjct: 277 RSTPFVAGPSKAP---MSTYYYLNLTGISVGETALAIPPDAFSLRADGTGGLIIDSGTTI 333
Query: 340 TFMAPELFEPLADEFVSQMVKNRNYTRALGAEALTGLRPCFDVPGE-KTGSFPELKLHFK 398
T + ++ + S +V A G + TGL CF + + P + LHF+
Sbjct: 334 TTLVDSAYQQVRAAVRSLLVT--RLPLAHGPDHSTGLDLCFALKASTPPPAMPSMTLHFE 391
Query: 399 GGAEVTLPVENYFAVVGEGSAVCLTVVTDREASGGPSIILGNFQMQNYYVEYDLRNQRLG 458
GGA++ LPVENY ++G G CL + R + G ++GN+Q QN +V YD+R + L
Sbjct: 392 GGADMVLPVENYM-ILGSG-VWCLAM---RNQTVGAMSMVGNYQQQNIHVLYDVRKETLS 446
Query: 459 FKQQLCK 465
F +C
Sbjct: 447 FAPAVCS 453
>gi|242050430|ref|XP_002462959.1| hypothetical protein SORBIDRAFT_02g035310 [Sorghum bicolor]
gi|241926336|gb|EER99480.1| hypothetical protein SORBIDRAFT_02g035310 [Sorghum bicolor]
Length = 448
Score = 172 bits (436), Expect = 3e-40, Method: Compositional matrix adjust.
Identities = 143/479 (29%), Positives = 218/479 (45%), Gaps = 66/479 (13%)
Query: 10 LSFIFFFTLLSIFPSSITSLTFSLSRFHTNPSQDSYQNLNSLVSSSLTRALHIKNPQT-- 67
L+ + F + + S S+ L+R H++P + + V +L R +H + ++
Sbjct: 11 LAVLVFLVVCATLASGAASVRVGLTRIHSDPDITAPE----FVRDALRRDMHRQQSRSLF 66
Query: 68 ----KTTTTTTTTTTTNISSHSYGGYSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQC 123
+ TT + T + G Y ++LS GTPP P I DTGS L+W QC
Sbjct: 67 GRELAESDGTTVSARTRKDLPNGGEYLMTLSIGTPPLSYPAIADTGSDLIW------TQC 120
Query: 124 KYCSSSK-----IPSFIPKLSSSSRLLGCQNP--KCSWIHHESIQCRDCNDEPLATSKNC 176
CS + P + P S++ +L C + C+ + A C
Sbjct: 121 APCSGDQCFAQPAPLYNPASSTTFGVLPCNSSLSMCAGVLAGK-----------APPPGC 169
Query: 177 TQICPSYLVLYGSGLTEGIALSETLNLPNRI-----IPNFLVGCSVLSSRQ---PAGIAG 228
+ Y YG+G T G+ SET + +P GCS SS AG+ G
Sbjct: 170 ACM---YNQTYGTGWTAGVQGSETFTFGSAAADQARVPGIAFGCSNASSSDWNGSAGLVG 226
Query: 229 FGRGKTSLPSQLNLDKFSYCLLSHKFDDTTRTSSLILDNGSSHSDKKTTGLTYTPFVNNP 288
GRG SL SQL +FSYCL F DT TS+L+L ++ + TG+ TPFV +P
Sbjct: 227 LGRGSLSLVSQLGAGRFSYCL--TPFQDTNSTSTLLLGPSAALNG---TGVRSTPFVASP 281
Query: 289 SVAERNAFSVYYYVGLRRITVGGQRVRVWHKYLTLDRDGNGGTIVDSGTTFTFMAPELFE 348
+ A S YYY+ L I++G + + + +L DG GG I+DSGTT T + ++
Sbjct: 282 AKAP---MSTYYYLNLTGISLGAKALSISPDAFSLKADGTGGLIIDSGTTITSLVNAAYQ 338
Query: 349 PLADEFVSQMVKNRNYTRALGAEALTGLRPCFDVPGEKTG--SFPELKLHFKGGAEVTLP 406
V V++ A+ TGL C+ +P + + P + LHF GA++ LP
Sbjct: 339 Q-----VRAAVQSLVTLPAIDGSDSTGLDLCYALPTPTSAPPAMPSMTLHFD-GADMVLP 392
Query: 407 VENYFAVVGEGSAVCLTVVTDREASGGPSIILGNFQMQNYYVEYDLRNQRLGFKQQLCK 465
++Y + G G CL + R + G GN+Q QN ++ YD+RN+ L F C
Sbjct: 393 ADSYM-ISGSG-VWCLAM---RNQTDGAMSTFGNYQQQNMHILYDVRNEMLSFAPAKCS 446
>gi|225438315|ref|XP_002272802.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Vitis vinifera]
Length = 436
Score = 172 bits (435), Expect = 4e-40, Method: Compositional matrix adjust.
Identities = 130/418 (31%), Positives = 197/418 (47%), Gaps = 54/418 (12%)
Query: 56 LTRALHIKNPQTKTTTTTTTTTTTNISSHSYGG---YSISLSFGTPPQIIPFILDTGSHL 112
L RA+ + + + T + +++ + + G + + L+ GTP + I+DTGS L
Sbjct: 61 LQRAMKRGKLRLQRLSAKTASFESSVEAPVHAGNGEFLMKLAIGTPAETYSAIMDTGSDL 120
Query: 113 VWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLGCQNPKCSWIHHESIQCRDCNDEPLAT 172
+W C CK C P F PK SSS L C + C+ + P+++
Sbjct: 121 IWTQCK---PCKDCFDQPTPIFDPKKSSSFSKLPCSSDLCAAL-------------PISS 164
Query: 173 SKNCTQICPSYLVLYGS-GLTEGIALSETLNLPNRIIPNFLVGCSVLSS----RQPAGIA 227
C+ C YL YG T+G+ +ET + + GC + Q AG+
Sbjct: 165 ---CSDGC-EYLYSYGDYSSTQGVLATETFAFGDASVSKIGFGCGEDNDGSGFSQGAGLV 220
Query: 228 GFGRGKTSLPSQLNLDKFSYCLLSHKFDDTTRTSSLILDNGSSHSDKKTTGLTYTPFVNN 287
G GRG SL SQL KFSYCL S DD+ SSL++ + ++ + TT P + N
Sbjct: 221 GLGRGPLSLISQLGEPKFSYCLTS--MDDSKGISSLLVGSEATMKNAITT-----PLIQN 273
Query: 288 PSVAERNAFSVYYYVGLRRITVGGQRVRVWHKYLTLDRDGNGGTIVDSGTTFTFMAPELF 347
PS +YY+ L I+VG + + ++ DG+GG I+DSGTT T++ F
Sbjct: 274 PSQPS------FYYLSLEGISVGDTLLPIEKSTFSIQNDGSGGLIIDSGTTITYLEDSAF 327
Query: 348 EPLADEFVSQMVKNRNYTRALGAEALTGLRPCFDVPGEK-TGSFPELKLHFKGGAEVTLP 406
L EF+SQ+ + + + + TGL CF +P + T P+L HF+ GA++ LP
Sbjct: 328 AALKKEFISQLKLDVDESGS------TGLDLCFTLPPDASTVDVPQLVFHFE-GADLKLP 380
Query: 407 VENYFAVVGEGSAVCLTVVTDREASGGPSIILGNFQMQNYYVEYDLRNQRLGFKQQLC 464
ENY +CLT+ + S I GNFQ QN V +DL + + F C
Sbjct: 381 AENYIIADSGLGVICLTMGSSSGMS-----IFGNFQQQNIVVLHDLEKETISFAPAQC 433
>gi|297740191|emb|CBI30373.3| unnamed protein product [Vitis vinifera]
Length = 218
Score = 171 bits (434), Expect = 6e-40, Method: Compositional matrix adjust.
Identities = 94/230 (40%), Positives = 135/230 (58%), Gaps = 17/230 (7%)
Query: 240 LNLDKFSYCLLSHKFDDTTRTSSLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVY 299
+ + KF+YCL SH +DDT + LILD + D KT GL+YTPF+ +P A + Y
Sbjct: 1 MGVKKFAYCLNSHDYDDTRNSGKLILD----YRDGKTKGLSYTPFLKSPP-----ASAFY 51
Query: 300 YYVGLRRITVGGQRVRVWHKYLTLDRDGNGGTIVDSGTTFT-FMAPELFEPLADEFVSQM 358
Y++G++ I +G + +R+ KYL DG G I+DSG +M +F+ + +E QM
Sbjct: 52 YHLGVKDIKIGNKLLRIPSKYLAPGSDGRSGVIIDSGYGGAGYMTGPVFKIVTNELKKQM 111
Query: 359 VKNRNYTRALGAEALTGLRPCFDVPGEKTGSFPELKLHFKGGAEVTLPVENYFAVVGEGS 418
K Y R+L AE TGL PC++ G K+ P L F+GGA + +P +NYF + + S
Sbjct: 112 SK---YRRSLEAETQTGLTPCYNFTGHKSIKIPPLIYQFRGGANMVVPGKNYFGISPQES 168
Query: 419 AVCLTVVTDR----EASGGPSIILGNFQMQNYYVEYDLRNQRLGFKQQLC 464
C + T+ E + PSIILGN Q +YYVEYDL+N R GF++Q C
Sbjct: 169 LACFLMDTNGTNALEITPDPSIILGNSQHVDYYVEYDLKNDRFGFRRQTC 218
>gi|449453902|ref|XP_004144695.1| PREDICTED: LOW QUALITY PROTEIN: aspartic proteinase
nepenthesin-1-like, partial [Cucumis sativus]
Length = 716
Score = 170 bits (431), Expect = 1e-39, Method: Compositional matrix adjust.
Identities = 127/390 (32%), Positives = 186/390 (47%), Gaps = 52/390 (13%)
Query: 87 GGYSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLG 146
G + + L+ G+PP+ I+DTGS L+W C C+ C P F PK SSS +
Sbjct: 364 GEFLMKLAIGSPPRSFSAIMDTGSDLIWTQCK---PCQQCFDQSTPIFDPKQSSSFYKIS 420
Query: 147 CQNPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYG-SGLTEGIALSETLNLPN 205
C + C + P +T C+ YL YG S T+G+ ET +
Sbjct: 421 CSSELCGAL-------------PTST---CSSDGCEYLYTYGDSSSTQGVLAFETFTFGD 464
Query: 206 RI-----IPNFLVGCSVLSS----RQPAGIAGFGRGKTSLPSQLNLDKFSYCLLSHKFDD 256
IP GC ++ Q AG+ G GRG SL SQL KF+YCL + D
Sbjct: 465 STEDQISIPGLGFGCGNDNNGDGFSQGAGLVGLGRGPLSLVSQLKEQKFAYCLTAI---D 521
Query: 257 TTRTSSLILDNGSSHSDKKTTG-LTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRVR 315
++ SSL+L + ++ + K + + TP + NPS +YY+ L+ I+VGG ++
Sbjct: 522 DSKPSSLLLGSLANITPKTSKDEMKTTPLIKNPSQPS------FYYLSLQGISVGGTQLS 575
Query: 316 VWHKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEALTG 375
+ L DG+GG I+DSGTT T++ F L +EF++QM + G
Sbjct: 576 IPKSTFELHDDGSGGVIIDSGTTITYVENSAFTSLKNEFIAQM------NLPVDDSGTGG 629
Query: 376 LRPCFDVP-GEKTGSFPELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTVVTDREASGGP 434
L CF++P G P+L HFK GA++ LP ENY + +CL + + R S
Sbjct: 630 LDLCFNLPAGTNQVEVPKLTFHFK-GADLELPGENYMIGDSKAGLLCLAIGSSRGMS--- 685
Query: 435 SIILGNFQMQNYYVEYDLRNQRLGFKQQLC 464
I GN Q QN+ V +DL+ + L F C
Sbjct: 686 --IFGNLQQQNFMVVHDLQEETLSFLPTQC 713
>gi|226492465|ref|NP_001150925.1| aspartic proteinase nepenthesin-1 [Zea mays]
gi|195642996|gb|ACG40966.1| aspartic proteinase nepenthesin-1 precursor [Zea mays]
Length = 472
Score = 170 bits (431), Expect = 1e-39, Method: Compositional matrix adjust.
Identities = 143/480 (29%), Positives = 217/480 (45%), Gaps = 60/480 (12%)
Query: 10 LSFIFFFTLLSIFPSSITSLTFSLSRFHTNPSQDSYQNLNSLVSSSLTRALHIKN----- 64
L+ + F + + S S+ L+R H++P + Q V +L R +H +
Sbjct: 27 LAVLVFLVVCATLASGAASVRVGLTRIHSDPDTTAPQ----FVRDALRRDMHRQRSRSFG 82
Query: 65 -------PQTKTTTTTTTTTTTNISSHSYGGYSISLSFGTPPQIIPFILDTGSHLVWFPC 117
++ T+TT + T + G Y ++L+ GTPP + DTGS L+W C
Sbjct: 83 RDRDRELAESDGRTSTTVSARTRKDLPNGGEYLMTLAIGTPPLPYAAVADTGSDLIWTQC 142
Query: 118 TNHYQC-KYCSSSKIPSFIPKLSSSSRLLGCQNPKCSWIHHESIQCRDCNDEPLATSKNC 176
C C P + P S++ +L C + S+ A C
Sbjct: 143 A---PCGTQCFEQPAPLYNPASSTTFSVLPC---------NSSLSMCAGALAGAAPPPGC 190
Query: 177 TQICPSYLVLYGSGLTEGIALSETLNLPNRI-----IPNFLVGCSVLSSRQ---PAGIAG 228
+ Y YG+G T G+ SET + +P GCS SS AG+ G
Sbjct: 191 ACM---YYQTYGTGWTAGVQGSETFTFGSSAADQARVPGVAFGCSNASSSDWNGSAGLVG 247
Query: 229 FGRGKTSLPSQLNLDKFSYCLLSHKFDDTTRTSSLILDNGSSHSDKKTTGLTYTPFVNNP 288
GRG SL SQL +FSYCL F DT TS+L+L ++ + TG+ TPFV +P
Sbjct: 248 LGRGSLSLVSQLGAGRFSYCL--TPFQDTNSTSTLLLGPSAALNG---TGVRSTPFVASP 302
Query: 289 SVAERNAFSVYYYVGLRRITVGGQRVRVWHKYLTLDRDGNGGTIVDSGTTFTFMAPELFE 348
+ R S YYY+ L I++G + + + +L DG GG I+DSGTT T +A ++
Sbjct: 303 A---RAPMSTYYYLNLTGISLGAKALPISPGAFSLKPDGTGGLIIDSGTTITSLANAAYQ 359
Query: 349 PLADEFVSQMVKNRNYTRALGAEALTGLRPCFDVPGEKTGS---FPELKLHFKGGAEVTL 405
+ SQ+V G+++ TGL CF +P + P + LHF GA++ L
Sbjct: 360 QVRAAVKSQLVTTLPTVD--GSDS-TGLDLCFALPAPTSAPPAVLPSMTLHFD-GADMVL 415
Query: 406 PVENYFAVVGEGSAVCLTVVTDREASGGPSIILGNFQMQNYYVEYDLRNQRLGFKQQLCK 465
P ++Y + G G CL + R + G GN+Q QN ++ YD+R + L F C
Sbjct: 416 PADSYM-ISGSG-VWCLAM---RNQTDGAMSTFGNYQQQNMHILYDVREETLSFAPAKCS 470
>gi|449523529|ref|XP_004168776.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Cucumis sativus]
Length = 461
Score = 170 bits (431), Expect = 1e-39, Method: Compositional matrix adjust.
Identities = 127/390 (32%), Positives = 186/390 (47%), Gaps = 52/390 (13%)
Query: 87 GGYSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLG 146
G + + L+ G+PP+ I+DTGS L+W C C+ C P F PK SSS +
Sbjct: 109 GEFLMKLAIGSPPRSFSAIMDTGSDLIWTQCK---PCQQCFDQSTPIFDPKQSSSFYKIS 165
Query: 147 CQNPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYG-SGLTEGIALSETLNLPN 205
C + C + P +T C+ YL YG S T+G+ ET +
Sbjct: 166 CSSELCGAL-------------PTST---CSSDGCEYLYTYGDSSSTQGVLAFETFTFGD 209
Query: 206 RI-----IPNFLVGCSVLSS----RQPAGIAGFGRGKTSLPSQLNLDKFSYCLLSHKFDD 256
IP GC ++ Q AG+ G GRG SL SQL KF+YCL + D
Sbjct: 210 STEDQISIPGLGFGCGNDNNGDGFSQGAGLVGLGRGPLSLVSQLKEQKFAYCLTAI---D 266
Query: 257 TTRTSSLILDNGSSHSDKKTTG-LTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRVR 315
++ SSL+L + ++ + K + + TP + NPS +YY+ L+ I+VGG ++
Sbjct: 267 DSKPSSLLLGSLANITPKTSKDEMKTTPLIKNPSQPS------FYYLSLQGISVGGTQLS 320
Query: 316 VWHKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEALTG 375
+ L DG+GG I+DSGTT T++ F L +EF++QM + G
Sbjct: 321 IPKSTFELHDDGSGGVIIDSGTTITYVENSAFTSLKNEFIAQM------NLPVDDSGTGG 374
Query: 376 LRPCFDVP-GEKTGSFPELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTVVTDREASGGP 434
L CF++P G P+L HFK GA++ LP ENY + +CL + + R S
Sbjct: 375 LDLCFNLPAGTNQVEVPKLTFHFK-GADLELPGENYMIGDSKAGLLCLAIGSSRGMS--- 430
Query: 435 SIILGNFQMQNYYVEYDLRNQRLGFKQQLC 464
I GN Q QN+ V +DL+ + L F C
Sbjct: 431 --IFGNLQQQNFMVVHDLQEETLSFLPTQC 458
>gi|326494754|dbj|BAJ94496.1| predicted protein [Hordeum vulgare subsp. vulgare]
gi|326514480|dbj|BAJ96227.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 449
Score = 169 bits (429), Expect = 2e-39, Method: Compositional matrix adjust.
Identities = 133/387 (34%), Positives = 179/387 (46%), Gaps = 49/387 (12%)
Query: 87 GGYSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLG 146
G + + +S GTP I+DTGS LVW C C C + P F P SS+ L
Sbjct: 100 GEFLMDMSIGTPAVAYAAIIDTGSDLVWTQCK---PCVECFNQSTPVFDPSSSSTYAALP 156
Query: 147 CQNPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYG-SGLTEGIALSETLNLPN 205
C + CS D P S CT Y YG S T+G+ +ET L
Sbjct: 157 CSSTLCS-------------DLP---SSKCTSAKCGYTYTYGDSSSTQGVLAAETFTLAK 200
Query: 206 RIIPNFLVGCSVLSS----RQPAGIAGFGRGKTSLPSQLNLDKFSYCLLSHKFDDTTRTS 261
+P+ GC + Q AG+ G GRG SL SQL L+KFSYCL S DDT++ S
Sbjct: 201 TKLPDVAFGCGDTNEGDGFTQGAGLVGLGRGPLSLVSQLGLNKFSYCLTS--LDDTSK-S 257
Query: 262 SLILDNGSS--HSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRVRVWHK 319
L+L + ++ S + + TP + NPS +YYV L+ +TVG + +
Sbjct: 258 PLLLGSLATISESAAAASSVQTTPLIRNPSQPS------FYYVNLKGLTVGSTHITLPSS 311
Query: 320 YLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEALTGLRPC 379
+ DG GG IVDSGT+ T++ + + L F +QM A G+ GL C
Sbjct: 312 AFAVQDDGTGGVIVDSGTSITYLELQGYRALKKAFAAQM----KLPAADGSG--IGLDTC 365
Query: 380 FDVP--GEKTGSFPELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTVVTDREASGGPSII 437
F+ P G P+L H GA++ LP ENY + A+CLTV+ R S I
Sbjct: 366 FEAPASGVDQVEVPKLVFHLD-GADLDLPAENYMVLDSGSGALCLTVMGSRGLS-----I 419
Query: 438 LGNFQMQNYYVEYDLRNQRLGFKQQLC 464
+GNFQ QN YD+ L F C
Sbjct: 420 IGNFQQQNIQFVYDVGENTLSFAPVQC 446
>gi|226508080|ref|NP_001150678.1| aspartic proteinase nepenthesin-1 precursor [Zea mays]
gi|195641018|gb|ACG39977.1| aspartic proteinase nepenthesin-1 precursor [Zea mays]
Length = 450
Score = 169 bits (427), Expect = 4e-39, Method: Compositional matrix adjust.
Identities = 142/449 (31%), Positives = 207/449 (46%), Gaps = 52/449 (11%)
Query: 33 LSRFHTNPSQDSYQNLNSLVSSSLTRALHIKNPQTKTTTTTTTTTTTNISSHS--YGGYS 90
L+R H +PS + Q V +L R +H N + ++ T + + +S G Y
Sbjct: 36 LTRVHADPSVTASQ----FVRGALRRDMHRHNARKLALAASSGATVSAPTQNSPTAGEYL 91
Query: 91 ISLSFGTPPQIIPFILDTGSHLVW---FPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLGC 147
++L+ GTPP I DTGS L+W PCT+ C P + P S++ +L C
Sbjct: 92 MALAIGTPPLPYQAIADTGSDLIWTQCAPCTSQ-----CFRQPTPLYNPSSSTTFAVLPC 146
Query: 148 QNPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSGLTEGIALSETLNLPNRI 207
+ S+ A C C +Y V YGSG T SET +
Sbjct: 147 NS-------SLSVCAAALAGTGTAPPPGCA--C-TYNVTYGSGWTSVFQGSETFTFGSTP 196
Query: 208 -----IPNFLVGCSVLSS----RQPAGIAGFGRGKTSLPSQLNLDKFSYCLLSHKFDDTT 258
+P GCS SS +G+ G GRG+ SL SQL + KFSYCL + DT
Sbjct: 197 AGQSRVPGIAFGCSTASSGFNASSASGLVGLGRGRLSLVSQLGVPKFSYCL--TPYQDTN 254
Query: 259 RTSSLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRVRVWH 318
TS+L+L G S S T G++ TPFV +PS A N F YY+ L I++G + +
Sbjct: 255 STSTLLL--GPSASLNGTAGVSSTPFVASPSTAPMNTF---YYLNLTGISLGTTALSIPP 309
Query: 319 KYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEALTGLRP 378
L+ DG GG I+DSGTT T + ++ + VS + A TGL
Sbjct: 310 DAFLLNADGTGGLIIDSGTTITLLGNTAYQQVRAAVVSLVTLPTT-----DGSAATGLDL 364
Query: 379 CFDVPGEKTG--SFPELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTVVTDREASGGPSI 436
CF +P + + P + LHF GA++ LP ++Y + + CL + + + G
Sbjct: 365 CFMLPSSTSAPPAMPSMTLHFN-GADMVLPADSYM-MSDDSGLWCLAM---QNQTDGEVN 419
Query: 437 ILGNFQMQNYYVEYDLRNQRLGFKQQLCK 465
ILGN+Q QN ++ YD+ + L F C
Sbjct: 420 ILGNYQQQNMHILYDIGQETLSFAPAKCS 448
>gi|414886964|tpg|DAA62978.1| TPA: aspartic proteinase nepenthesin-1 [Zea mays]
Length = 452
Score = 168 bits (426), Expect = 4e-39, Method: Compositional matrix adjust.
Identities = 142/449 (31%), Positives = 207/449 (46%), Gaps = 52/449 (11%)
Query: 33 LSRFHTNPSQDSYQNLNSLVSSSLTRALHIKNPQ--TKTTTTTTTTTTTNISSHSYGGYS 90
L+R H +PS + Q V +L R +H N + ++ T + S + G Y
Sbjct: 38 LTRVHADPSVTASQ----FVRGALRRDMHRHNARKLALAASSGATVSAPTQDSPTAGEYL 93
Query: 91 ISLSFGTPPQIIPFILDTGSHLVW---FPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLGC 147
++L+ GTPP I DTGS L+W PCT+ C P + P S++ +L C
Sbjct: 94 MALAIGTPPLPYQAIADTGSDLIWTQCAPCTSQ-----CFRQPTPLYNPSSSTTFAVLPC 148
Query: 148 QNPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSGLTEGIALSETLNLP--- 204
+ S+ A C C +Y V YGSG T SET
Sbjct: 149 NS-------SLSVCAAALAGTGTAPPPGCA--C-TYNVTYGSGWTSVFQGSETFTFGSTP 198
Query: 205 --NRIIPNFLVGCSVLSS----RQPAGIAGFGRGKTSLPSQLNLDKFSYCLLSHKFDDTT 258
+ +P GCS SS +G+ G GRG+ SL SQL + KFSYCL + DT
Sbjct: 199 AGHARVPGIAFGCSTASSGFNASSASGLVGLGRGRLSLVSQLGVPKFSYCL--TPYQDTN 256
Query: 259 RTSSLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRVRVWH 318
TS+L+L G S S T G++ TPFV +PS A N F YY+ L I++G + +
Sbjct: 257 STSTLLL--GPSASLNGTAGVSSTPFVASPSTAPMNTF---YYLNLTGISLGTTALSIPP 311
Query: 319 KYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEALTGLRP 378
+L+ DG GG I+DSGTT T + ++ + VS + A TGL
Sbjct: 312 DAFSLNADGTGGLIIDSGTTITLLGNTAYQQVRAAVVSLVTLPTT-----DGSADTGLDL 366
Query: 379 CFDVPGEKTG--SFPELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTVVTDREASGGPSI 436
CF +P + + P + LHF GA++ LP ++Y + + CL + + + G
Sbjct: 367 CFMLPSSTSAPPAMPSMTLHFN-GADMVLPADSYM-MSDDSGLWCLAM---QNQTDGEVN 421
Query: 437 ILGNFQMQNYYVEYDLRNQRLGFKQQLCK 465
ILGN+Q QN ++ YD+ + L F C
Sbjct: 422 ILGNYQQQNMHILYDIGQETLSFAPAKCS 450
>gi|356516413|ref|XP_003526889.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
Length = 453
Score = 168 bits (426), Expect = 5e-39, Method: Compositional matrix adjust.
Identities = 150/471 (31%), Positives = 210/471 (44%), Gaps = 70/471 (14%)
Query: 21 IFPSSITSLTFSLSRFHTNPSQDSYQNLNSLVSS-----SLTRALH-IKNPQTKT----- 69
I P+S TS S + H P+ + ++ + V S L R H IK +++
Sbjct: 23 IAPTSSTSRKTSFKQQHPCPTTNGFRVMLRHVDSGKNLTKLERVQHGIKRGKSRLQKLNA 82
Query: 70 ---TTTTTTTTTTNISSHSYGG---YSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQC 123
++T + + + + G Y I L+ GTPP P +LDTGS L+W C C
Sbjct: 83 MVLAASSTPDSEDQLEAPIHAGNGEYLIELAIGTPPVSYPAVLDTGSDLIWTQCK---PC 139
Query: 124 KYCSSSKIPSFIPKLSSSSRLLGCQNPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSY 183
C P F PK SSS + C + CS + S C+ C Y
Sbjct: 140 TRCYKQPTPIFDPKKSSSFSKVSCGSSLCSAL----------------PSSTCSDGC-EY 182
Query: 184 LVLYGS-GLTEGIALSETLNL---PNRI-IPNFLVGCSVLSS----RQPAGIAGFGRGKT 234
+ YG +T+G+ +ET N++ + N GC + Q +G+ G GRG
Sbjct: 183 VYSYGDYSMTQGVLATETFTFGKSKNKVSVHNIGFGCGEDNEGDGFEQASGLVGLGRGPL 242
Query: 235 SLPSQLNLDKFSYCLLSHKFDDTTRTSSLILDNGSSHSDKKTTGLTYTPFVNNPSVAERN 294
SL SQL +FSYCL D T+ S L+L GS K + TP + NP
Sbjct: 243 SLVSQLKEQRFSYCLTPI---DDTKESVLLL--GSLGKVKDAKEVVTTPLLKNPLQPS-- 295
Query: 295 AFSVYYYVGLRRITVGGQRVRVWHKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEF 354
+YY+ L I+VG R+ + + DGNGG I+DSGTT T++ + +E L EF
Sbjct: 296 ----FYYLSLEAISVGDTRLSIEKSTFEVGDDGNGGVIIDSGTTITYVQQKAYEALKKEF 351
Query: 355 VSQMVKNRNYTRALGAEALTGLRPCFDVP-GEKTGSFPELKLHFKGGAEVTLPVENYFAV 413
+SQ AL + TGL CF +P G P+L HFKGG ++ LP ENY
Sbjct: 352 ISQT------KLALDKTSSTGLDLCFSLPSGSTQVEIPKLVFHFKGG-DLELPAENYMIG 404
Query: 414 VGEGSAVCLTVVTDREASGGPSIILGNFQMQNYYVEYDLRNQRLGFKQQLC 464
CL + AS G S I GN Q QN V +DL + + F C
Sbjct: 405 DSNLGVACLAM----GASSGMS-IFGNVQQQNILVNHDLEKETISFVPTSC 450
>gi|413918484|gb|AFW58416.1| hypothetical protein ZEAMMB73_998053 [Zea mays]
Length = 475
Score = 167 bits (424), Expect = 9e-39, Method: Compositional matrix adjust.
Identities = 130/389 (33%), Positives = 176/389 (45%), Gaps = 41/389 (10%)
Query: 87 GGYSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLG 146
G + + LS GTP I+DTGS LVW C C C + P F P SS+ L
Sbjct: 114 GEFLMDLSVGTPALPYAAIVDTGSDLVWTQCK---PCVECFNQTTPVFDPAASSTYAALP 170
Query: 147 CQNPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYG-SGLTEGIALSETLNLPN 205
C + C+ + + ++S + Y YG + T+G+ +ET L
Sbjct: 171 CSSALCADLPTSTCA--------SSSSSSSASSPCGYTYTYGDASSTQGVLATETFTLAR 222
Query: 206 RIIPNFLVGCSVLSS----RQPAGIAGFGRGKTSLPSQLNLDKFSYCLLSHKFDDTTRTS 261
+ +P GC + Q AG+ G GRG SL SQL +D+FSYCL S DD S
Sbjct: 223 QKVPGVAFGCGDTNEGDGFTQGAGLVGLGRGPLSLVSQLGIDRFSYCLTS--LDDAAGRS 280
Query: 262 SLILDNGSSHSDKKTTG-LTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRVRVWHKY 320
L+L + + S T TP V NPS +YYV L +TVG R+ +
Sbjct: 281 PLLLGSAAGISASAATAPAQTTPLVKNPSQPS------FYYVSLTGLTVGSTRLALPSSA 334
Query: 321 LTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEALTGLRPCF 380
+ DG GG IVDSGT+ T++ + L FV+ M + +E GL CF
Sbjct: 335 FAIQDDGTGGVIVDSGTSITYLELRAYRALRKAFVAHM----SLPTVDASE--IGLDLCF 388
Query: 381 DVPGEKTG-----SFPELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTVVTDREASGGPS 435
P P+L LHF GGA++ LP ENY + A+CLTV+ R S
Sbjct: 389 QGPAGAVDQDVQVQVPKLVLHFDGGADLDLPAENYMVLDSASGALCLTVMASRGLS---- 444
Query: 436 IILGNFQMQNYYVEYDLRNQRLGFKQQLC 464
I+GNFQ QN+ YD+ L F C
Sbjct: 445 -IIGNFQQQNFQFVYDVAGDTLSFAPAEC 472
>gi|302768196|ref|XP_002967518.1| hypothetical protein SELMODRAFT_87804 [Selaginella moellendorffii]
gi|300165509|gb|EFJ32117.1| hypothetical protein SELMODRAFT_87804 [Selaginella moellendorffii]
Length = 398
Score = 167 bits (422), Expect = 2e-38, Method: Compositional matrix adjust.
Identities = 128/395 (32%), Positives = 186/395 (47%), Gaps = 55/395 (13%)
Query: 87 GGYSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLG 146
G Y ++S GTP ++ I DTGS L+W C C+ C + K P F P+ SSS +
Sbjct: 38 GDYVTTISLGTPAKVFSVIADTGSDLIWIQCK---PCQACFNQKDPIFDPEGSSSYTTMS 94
Query: 147 CQNPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSGL-TEGIALSETLNLPN 205
C + C + K+C+ C Y YG G T G SET+ L +
Sbjct: 95 CGDTLCDSLPR----------------KSCSPDC-DYSYGYGDGSGTRGTLSSETVTLTS 137
Query: 206 R-----IIPNFLVGCSVL---SSRQPAGIAGFGRGKTSLPSQLNL---DKFSYCLLSHKF 254
N GC L S +G+ G GRG S SQL KFSYCL+ +
Sbjct: 138 TQGEKLAAKNIAFGCGHLNRGSFNDASGLVGLGRGNLSFVSQLGDLFGHKFSYCLVPWR- 196
Query: 255 DDTTRTSSLIL-DNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQR 313
D ++TS + D SSHS K +TP ++NP A +YYV L+ I++ G+
Sbjct: 197 DAPSKTSPMFFGDESSSHSSGKKLHYAFTPMIHNP------AMESFYYVKLKDISIAGRA 250
Query: 314 VRVWHKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEAL 373
+R+ + DG+GG I DSGTT T + ++ + S++ ++ + G+ A
Sbjct: 251 LRIPAGSFDIKPDGSGGMIFDSGTTLTLLPDAPYQIVLRALRSKI----SFPKIDGSSA- 305
Query: 374 TGLRPCFDVPGEKTG---SFPELKLHFKGGAEVTLPVENYFAVVGE-GSAVCLTVVTDRE 429
GL C+DV G K P + HF+ GA+ LPVENYF + G+ VCL +V+
Sbjct: 306 -GLDLCYDVSGSKASYKMKIPAMVFHFE-GADYQLPVENYFIAANDAGTIVCLAMVSSNM 363
Query: 430 ASGGPSIILGNFQMQNYYVEYDLRNQRLGFKQQLC 464
G I GN QN+ V YD+ + ++G+ C
Sbjct: 364 DIG----IYGNMMQQNFRVMYDIGSSKIGWAPSQC 394
>gi|255563827|ref|XP_002522914.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
gi|223537841|gb|EEF39457.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
Length = 442
Score = 166 bits (421), Expect = 2e-38, Method: Compositional matrix adjust.
Identities = 131/379 (34%), Positives = 182/379 (48%), Gaps = 48/379 (12%)
Query: 87 GGYSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLG 146
G + ++L+ GTPP+ I+DTGS L+W C C C P F PK SSS L
Sbjct: 98 GEFLMNLAIGTPPETYSAIMDTGSDLIWTQCK---PCTQCFDQPSPIFDPKKSSSFSKLS 154
Query: 147 CQNPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGS-GLTEGIALSETLNLPN 205
C + C + S C+ C YL YG T+G +ET
Sbjct: 155 CSSQLCKALPQSS----------------CSDSC-EYLYTYGDYSSTQGTMATETFTFGK 197
Query: 206 RIIPNFLVGCSVLSS----RQPAGIAGFGRGKTSLPSQLNLDKFSYCLLSHKFDDTTRTS 261
IPN GC + Q +G+ G GRG SL SQL KFSYCL S D T+TS
Sbjct: 198 VSIPNVGFGCGEDNEGDGFTQGSGLVGLGRGPLSLVSQLKEAKFSYCLTSI---DDTKTS 254
Query: 262 SLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRVRVWHKYL 321
+L++ + +S + + + TP + NP +YY+ L I+VGG R+ +
Sbjct: 255 TLLMGSLAS-VNGTSAAIRTTPLIQNPLQPS------FYYLSLEGISVGGTRLPIKESTF 307
Query: 322 TLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEALTGLRPCFD 381
L DG GG I+DSGTT T++ F+ + EF SQM + + A TGL C++
Sbjct: 308 QLQDDGTGGLIIDSGTTITYLEESAFDLVKKEFTSQMGLPVDNSGA------TGLELCYN 361
Query: 382 VPGEKTG-SFPELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTVVTDREASGGPSIILGN 440
+P + + P+L LHF GA++ LP ENY +CL + +SGG S I GN
Sbjct: 362 LPSDTSELEVPKLVLHFT-GADLELPGENYMIADSSMGVICLAM----GSSGGMS-IFGN 415
Query: 441 FQMQNYYVEYDLRNQRLGF 459
Q QN +V +DL + L F
Sbjct: 416 VQQQNMFVSHDLEKETLSF 434
>gi|30678047|ref|NP_565298.2| aspartyl protease-like protein [Arabidopsis thaliana]
gi|30102688|gb|AAP21262.1| At2g03200 [Arabidopsis thaliana]
gi|110736021|dbj|BAE99983.1| putative chloroplast nucleoid DNA binding protein [Arabidopsis
thaliana]
gi|330250580|gb|AEC05674.1| aspartyl protease-like protein [Arabidopsis thaliana]
Length = 461
Score = 166 bits (420), Expect = 2e-38, Method: Compositional matrix adjust.
Identities = 140/458 (30%), Positives = 201/458 (43%), Gaps = 50/458 (10%)
Query: 22 FPSSITSLTFSLSRFHTNPSQDSYQNLNSL--VSSSLTRALHIKNPQTKTTTTTTTTT-- 77
P ++ F LS H DS +NL + + + R H N +
Sbjct: 36 LPKNLPRSGFRLSLRHV----DSGKNLTKIQKIQRGINRGFHRLNRLGAVAVLAVASKPD 91
Query: 78 -TTNISSHSYGG---YSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPS 133
T NI + ++GG + + LS G P I+DTGS L+W C C C P
Sbjct: 92 DTNNIKAPTHGGSGEFLMELSIGNPAVKYSAIVDTGSDLIWTQCK---PCTECFDQPTPI 148
Query: 134 FIPKLSSSSRLLGCQNPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGS-GLT 192
F P+ SSS +GC + C+ + +CN++ A YL YG T
Sbjct: 149 FDPEKSSSYSKVGCSSGLCNALPRS-----NCNEDKDACE---------YLYTYGDYSST 194
Query: 193 EGIALSETLNLPNR-IIPNFLVGCSVLSS----RQPAGIAGFGRGKTSLPSQLNLDKFSY 247
G+ +ET + I GC V + Q +G+ G GRG SL SQL KFSY
Sbjct: 195 RGLLATETFTFEDENSISGIGFGCGVENEGDGFSQGSGLVGLGRGPLSLISQLKETKFSY 254
Query: 248 CLLSHKFDDTTRTSSLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRI 307
CL S +D+ +SSL + + +S KT S+ +YY+ L+ I
Sbjct: 255 CLTS--IEDSEASSSLFIGSLASGIVNKTGASLDGEVTKTMSLLRNPDQPSFYYLELQGI 312
Query: 308 TVGGQRVRVWHKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRA 367
TVG +R+ V L DG GG I+DSGTT T++ F+ L +EF S+M +
Sbjct: 313 TVGAKRLSVEKSTFELAEDGTGGMIIDSGTTITYLEETAFKVLKEEFTSRM------SLP 366
Query: 368 LGAEALTGLRPCFDVP-GEKTGSFPELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTVVT 426
+ TGL CF +P K + P++ HFK GA++ LP ENY +CL + +
Sbjct: 367 VDDSGSTGLDLCFKLPDAAKNIAVPKMIFHFK-GADLELPGENYMVADSSTGVLCLAMGS 425
Query: 427 DREASGGPSIILGNFQMQNYYVEYDLRNQRLGFKQQLC 464
S I GN Q QN+ V +DL + + F C
Sbjct: 426 SNGMS-----IFGNVQQQNFNVLHDLEKETVSFVPTEC 458
>gi|297817972|ref|XP_002876869.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
gi|297322707|gb|EFH53128.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
Length = 462
Score = 166 bits (420), Expect = 3e-38, Method: Compositional matrix adjust.
Identities = 141/458 (30%), Positives = 202/458 (44%), Gaps = 50/458 (10%)
Query: 22 FPSSITSLTFSLSRFHTNPSQDSYQNLNSL--VSSSLTRALHIKNPQTKTTTTTTTTT-- 77
P ++ F LS H DS +NL + + + R H N +
Sbjct: 37 LPKNLPRSGFRLSLRHV----DSGKNLTKIQKIQRGINRGFHRLNRLGAVAVLAVASNPD 92
Query: 78 -TTNISSHSYGG---YSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPS 133
T NI + ++GG + + LS G P I+DTGS L+W C C C P
Sbjct: 93 DTNNIKAPTHGGSGEFLMELSIGNPAVKYAAIVDTGSDLIWTQCK---PCTECFDQPTPI 149
Query: 134 FIPKLSSSSRLLGCQNPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGS-GLT 192
F P+ SSS +GC + C+ + +CN++ +C YL YG T
Sbjct: 150 FDPEKSSSYSKVGCSSGLCNALPRS-----NCNED----KDSC-----EYLYTYGDYSST 195
Query: 193 EGIALSETLNLPNR-IIPNFLVGCSVLSS----RQPAGIAGFGRGKTSLPSQLNLDKFSY 247
G+ +ET + I GC V + Q +G+ G GRG SL SQL KFSY
Sbjct: 196 RGLLATETFTFEDENSISGIGFGCGVENEGDGFSQGSGLVGLGRGPLSLISQLKETKFSY 255
Query: 248 CLLSHKFDDTTRTSSLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRI 307
CL S +D+ +SSL + + +S KT S+ +YY+ L+ I
Sbjct: 256 CLTS--IEDSEASSSLFIGSLASGIVNKTGANLDGEVTKTMSLLRNPDQPSFYYLELQGI 313
Query: 308 TVGGQRVRVWHKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRA 367
TVG +R+ V L DG GG I+DSGTT T++ F+ L +EF S+M +
Sbjct: 314 TVGAKRLSVEKSTFELSEDGTGGMIIDSGTTITYLEETAFKVLKEEFTSRM------SLP 367
Query: 368 LGAEALTGLRPCFDVP-GEKTGSFPELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTVVT 426
+ TGL CF +P K + P+L HFK GA++ LP ENY +CL + +
Sbjct: 368 VDDSGSTGLDLCFKLPNAAKNIAVPKLIFHFK-GADLELPGENYMVADSSTGVLCLAMGS 426
Query: 427 DREASGGPSIILGNFQMQNYYVEYDLRNQRLGFKQQLC 464
S I GN Q QN+ V +DL + + F C
Sbjct: 427 SNGMS-----IFGNVQQQNFNVLHDLEKETVTFVPTEC 459
>gi|242044886|ref|XP_002460314.1| hypothetical protein SORBIDRAFT_02g026340 [Sorghum bicolor]
gi|241923691|gb|EER96835.1| hypothetical protein SORBIDRAFT_02g026340 [Sorghum bicolor]
Length = 444
Score = 166 bits (419), Expect = 3e-38, Method: Compositional matrix adjust.
Identities = 130/392 (33%), Positives = 182/392 (46%), Gaps = 48/392 (12%)
Query: 85 SYGGYSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRL 144
S G Y + + GTP + ILDTGS L+W C C C P F P SS+ R
Sbjct: 88 SDGEYLMEMGIGTPARFYSAILDTGSDLIWTQCA---PCLLCVDQPTPYFDPANSSTYRS 144
Query: 145 LGCQNPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYG-SGLTEGIALSETL-- 201
LGC P C+ +++ PL K C Y YG S T G+ +ET
Sbjct: 145 LGCSAPACNALYY-----------PLCYQKTCV-----YQYFYGDSASTAGVLANETFTF 188
Query: 202 --NLPNRIIPNFLVGCSVLSSRQPA---GIAGFGRGKTSLPSQLNLDKFSYCLLSHKFDD 256
N +P GC L++ A G+ GFGRG SL SQL +FSYCL S F
Sbjct: 189 GTNDTRVTLPRISFGCGNLNAGSLANGSGMVGFGRGSLSLVSQLGSPRFSYCLTS--FLS 246
Query: 257 TTRTSSLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRVRV 316
R S L ++ + + + TPF+ NP A Y++ + I+VGG R+ +
Sbjct: 247 PVR-SRLYFGAYATLNSTNASTVQSTPFIINP------ALPTMYFLNMTGISVGGNRLPI 299
Query: 317 WHKYLTL-DRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEALTG 375
L + D DG GGTI+DSGTT T++A + + + FV + + L +
Sbjct: 300 DPAVLAINDTDGTGGTIIDSGTTITYLAEPAYYAVREAFVLYL---NSTLPLLDVTETSV 356
Query: 376 LRPCFDV--PGEKTGSFPELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTVVTDREASGG 433
L CF P ++ + P+L LHF GA+ LP++NY V +CL + T + S
Sbjct: 357 LDTCFQWPPPPRQSVTLPQLVLHFD-GADWELPLQNYMLVDPSTGGLCLAMATSSDGS-- 413
Query: 434 PSIILGNFQMQNYYVEYDLRNQRLGFKQQLCK 465
I+G++Q QN+ V YDL N L F C
Sbjct: 414 ---IIGSYQHQNFNVLYDLENSLLSFVPAPCN 442
>gi|302753526|ref|XP_002960187.1| hypothetical protein SELMODRAFT_75184 [Selaginella moellendorffii]
gi|300171126|gb|EFJ37726.1| hypothetical protein SELMODRAFT_75184 [Selaginella moellendorffii]
Length = 398
Score = 166 bits (419), Expect = 3e-38, Method: Compositional matrix adjust.
Identities = 130/395 (32%), Positives = 187/395 (47%), Gaps = 55/395 (13%)
Query: 87 GGYSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLG 146
G Y ++S GTP ++ I DTGS L+W C C+ C + K P F P+ SSS +
Sbjct: 38 GDYVTTISLGTPAKVFSVIADTGSDLIWIQCK---PCQACFNQKDPIFDPEGSSSYTTMS 94
Query: 147 CQNPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSGL-TEGIALSETLNLPN 205
C + C +S+ + C S NC Y YG G T G SET+ L +
Sbjct: 95 CGDTLC-----DSLPRKSC-------SPNC-----DYSYGYGDGSGTRGTLSSETVTLTS 137
Query: 206 R-----IIPNFLVGCSVL---SSRQPAGIAGFGRGKTSLPSQLNL---DKFSYCLLSHKF 254
N GC L S +G+ G GRG S SQL KFSYCL+ +
Sbjct: 138 TQGEKLAAKNIAFGCGHLNRGSFNDASGLVGLGRGNLSFVSQLGDLFGHKFSYCLVPWR- 196
Query: 255 DDTTRTSSLIL-DNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQR 313
D ++TS + D SSHS K +TP ++NP A +YYV L+ I++ G+
Sbjct: 197 DAPSKTSPMFFGDESSSHSSGKKLHYAFTPMIHNP------AMESFYYVKLKDISIAGRA 250
Query: 314 VRVWHKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEAL 373
+R+ + DG+GG I DSGTT T + ++ + S++ ++ G+ A
Sbjct: 251 LRIPAGSFDIKPDGSGGMIFDSGTTLTLLPDAPYQIVLRALRSKV----SFPEIDGSSA- 305
Query: 374 TGLRPCFDVPGEKTG---SFPELKLHFKGGAEVTLPVENYFAVVGE-GSAVCLTVVTDRE 429
GL C+DV G K P + HF+ GA+ LPVENYF + G+ VCL +V+
Sbjct: 306 -GLDLCYDVSGSKASYKKKIPAMVFHFE-GADHQLPVENYFIAANDAGTIVCLAMVSSNM 363
Query: 430 ASGGPSIILGNFQMQNYYVEYDLRNQRLGFKQQLC 464
G I GN QN+ V YD+ + ++G+ C
Sbjct: 364 DIG----IYGNMMQQNFRVMYDIGSSKIGWAPSQC 394
>gi|168043550|ref|XP_001774247.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162674374|gb|EDQ60883.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 419
Score = 165 bits (418), Expect = 4e-38, Method: Compositional matrix adjust.
Identities = 131/386 (33%), Positives = 190/386 (49%), Gaps = 53/386 (13%)
Query: 87 GGYSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLG 146
G Y I +SFG+PPQ I+DTGS L+W C C+ C+++ F P SS+ +
Sbjct: 78 GEYLIDISFGSPPQKASVIVDTGSDLIWTQC---LPCETCNAAASVIFDPVKSSTYDTVS 134
Query: 147 CQNPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSGLTEGIALS-ETLNLPN 205
C + CS + +S CT C Y +YG G + ALS ET+ +
Sbjct: 135 CASNFCSSLPFQS----------------CTTSC-KYDYMYGDGSSTSGALSTETVTVGT 177
Query: 206 RIIPNFLVGC---SVLSSRQPAGIAGFGRGKTSLPSQ---LNLDKFSYCLLSHKFDDTTR 259
IPN GC ++ S AGI G G+G SL SQ + KFSYCL+ +T+
Sbjct: 178 GTIPNVAFGCGHTNLGSFAGAAGIVGLGQGPLSLISQASSITSKKFSYCLVPLG---STK 234
Query: 260 TSSLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRVRVWHK 319
TS +++ + ++ G+ YT + N A +YY L I+V G+ V
Sbjct: 235 TSPMLIGDSAAAG-----GVAYTALLTN------TANPTFYYADLTGISVSGKAVTYPVG 283
Query: 320 YLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEALTGLRPC 379
++D G GG I+DSGTT T++ F L V+ + + A G +L GL C
Sbjct: 284 TFSIDASGQGGFILDSGTTLTYLETGAFNAL----VAALKAEVPFPEADG--SLYGLDYC 337
Query: 380 FDVPGEKTGSFPELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTVVTDREASGGPSIILG 439
F G ++P + HFK GA+ LP EN F + G ++CL + AS G S I+G
Sbjct: 338 FSTAGVANPTYPTMTFHFK-GADYELPPENVFVALDTGGSICLAMA----ASTGFS-IMG 391
Query: 440 NFQMQNYYVEYDLRNQRLGFKQQLCK 465
N Q QN+ + +DL NQR+GFK+ C+
Sbjct: 392 NIQQQNHLIVHDLVNQRVGFKEANCE 417
>gi|356508918|ref|XP_003523200.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
Length = 453
Score = 165 bits (418), Expect = 4e-38, Method: Compositional matrix adjust.
Identities = 132/388 (34%), Positives = 178/388 (45%), Gaps = 53/388 (13%)
Query: 87 GGYSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLG 146
G Y + L+ GTPP P +LDTGS L+W C C C P F PK SSS +
Sbjct: 106 GEYLMELAIGTPPVSYPAVLDTGSDLIWTQCK---PCTQCYKQPTPIFDPKKSSSFSKVS 162
Query: 147 CQNPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGS-GLTEGIALSETLNL-- 203
C + CS + S C+ C Y+ YG +T+G+ +ET
Sbjct: 163 CGSSLCSAV----------------PSSTCSDGC-EYVYSYGDYSMTQGVLATETFTFGK 205
Query: 204 -PNRI-IPNFLVGCSVLSS----RQPAGIAGFGRGKTSLPSQLNLDKFSYCLLSHKFDDT 257
N++ + N GC + Q +G+ G GRG SL SQL +FSYCL D
Sbjct: 206 SKNKVSVHNIGFGCGEDNEGDGFEQASGLVGLGRGPLSLVSQLKEPRFSYCLTPM---DD 262
Query: 258 TRTSSLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRVRVW 317
T+ S L+L GS K + TP + NP +YY+ L I+VG R+ +
Sbjct: 263 TKESILLL--GSLGKVKDAKEVVTTPLLKNPLQPS------FYYLSLEGISVGDTRLSIE 314
Query: 318 HKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEALTGLR 377
+ DGNGG I+DSGTT T++ + FE L EF+SQ + T + TGL
Sbjct: 315 KSTFEVGDDGNGGVIIDSGTTITYIEQKAFEALKKEFISQTKLPLDKTSS------TGLD 368
Query: 378 PCFDVP-GEKTGSFPELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTVVTDREASGGPSI 436
CF +P G P++ HFKGG ++ LP ENY CL + AS G S
Sbjct: 369 LCFSLPSGSTQVEIPKIVFHFKGG-DLELPAENYMIGDSNLGVACLAM----GASSGMS- 422
Query: 437 ILGNFQMQNYYVEYDLRNQRLGFKQQLC 464
I GN Q QN V +DL + + F C
Sbjct: 423 IFGNVQQQNILVNHDLEKETISFVPTSC 450
>gi|357494221|ref|XP_003617399.1| 60S ribosomal protein L18a [Medicago truncatula]
gi|355518734|gb|AET00358.1| 60S ribosomal protein L18a [Medicago truncatula]
Length = 749
Score = 164 bits (414), Expect = 1e-37, Method: Compositional matrix adjust.
Identities = 130/394 (32%), Positives = 188/394 (47%), Gaps = 46/394 (11%)
Query: 87 GGYSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLG 146
G Y + + GTPP+ ILDTGS L W C C C P + PK SSS +
Sbjct: 190 GEYFMDVFIGTPPKHYSLILDTGSDLNWIQCV---PCIACFEQSGPYYDPKESSSFENIT 246
Query: 147 CQNPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYG--SGLTEGIALSE-TLNL 203
C +P+C + + +P K+ Q CP Y YG S T AL T+NL
Sbjct: 247 CHDPRCKLV---------SSPDPPKPCKDENQTCP-YFYWYGDSSNTTGDFALETFTVNL 296
Query: 204 --PN-----RIIPNFLVGCSVLSS---RQPAGIAGFGRGKTSLPSQLNL---DKFSYCLL 250
PN + + N + GC + AG+ G GRG S SQL FSYCL+
Sbjct: 297 TTPNGKSEQKHVENVMFGCGHWNRGLFHGAAGLLGLGRGPLSFASQLQSIYGHSFSYCLV 356
Query: 251 SHKFDDTTRTSSLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVG 310
DT+ +S LI G L +T FV E N+ +YYVG++ I V
Sbjct: 357 DRN-SDTSVSSKLIF--GEDKELLSHPNLNFTSFVG----GEENSVDTFYYVGIKSIMVD 409
Query: 311 GQRVRVWHKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGA 370
G+ +++ + L ++G GGTI+DSGTT T+ A +E + + F M K + Y
Sbjct: 410 GEVLKIPEETWHLSKEGGGGTIIDSGTTLTYFAEPAYEIIKEAF---MKKIKGYEL---V 463
Query: 371 EALTGLRPCFDVPGEKTGSFPELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTVVTDREA 430
E L+PC++V G + P+ + F GA PVENYF + E VCL ++ ++
Sbjct: 464 EGFPPLKPCYNVSGIEKMELPDFGILFSDGAMWDFPVENYFIQI-EPDLVCLAILGTPKS 522
Query: 431 SGGPSIILGNFQMQNYYVEYDLRNQRLGFKQQLC 464
+ I+GN+Q QN+++ YD++ RLG+ C
Sbjct: 523 ALS---IIGNYQQQNFHILYDMKKSRLGYAPMKC 553
>gi|125543638|gb|EAY89777.1| hypothetical protein OsI_11319 [Oryza sativa Indica Group]
Length = 390
Score = 164 bits (414), Expect = 1e-37, Method: Compositional matrix adjust.
Identities = 126/390 (32%), Positives = 182/390 (46%), Gaps = 49/390 (12%)
Query: 89 YSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLGCQ 148
Y + L+ GTPPQ + LDTGS L+W C C C +P F SS++ LL C+
Sbjct: 35 YLVHLAIGTPPQPVQLTLDTGSDLIWTQCK---PCVSCFDQPLPYFDTSRSSTNALLPCE 91
Query: 149 NPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSGLTEGIALSETLN-LPNRI 207
+ +C ++ C N Q C Y + +T G+ ++ +
Sbjct: 92 STQCKLDPTVTV-CVKLNQT--------VQTCAYYTSYGDNSVTIGLLAADKFTFVAGTS 142
Query: 208 IPNFLVGCS-----VLSSRQPAGIAGFGRGKTSLPSQLNLDKFSYCLLSHKFDDTTRT-- 260
+P GC V +S + GIAGFGRG SLPSQL + FS+C TT T
Sbjct: 143 LPGVTFGCGLNNTGVFNSNE-TGIAGFGRGPLSLPSQLKVGNFSHCF-------TTITGA 194
Query: 261 --SSLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRVRVWH 318
S+++LD + + TP + A+ A YY+ L+ ITVG R+ V
Sbjct: 195 IPSTVLLDLPADLFSNGQGAVQTTPLIQ---YAKNEANPTLYYLSLKGITVGSTRLPVPE 251
Query: 319 KYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEALTGLRP 378
L +G GGTI+DSGT+ T + P++++ + DEF +Q+ + TG
Sbjct: 252 SAFAL-TNGTGGTIIDSGTSITSLPPQVYQVVRDEFAAQI------KLPVVPGNATGHYT 304
Query: 379 CFDVPGEKTGSFPELKLHFKGGAEVTLPVENYFAVVGE---GSAVCLTVVTDREASGGPS 435
CF P + P+L LHF+ GA + LP ENY V + S +CL + E +
Sbjct: 305 CFSAPSQAKPDVPKLVLHFE-GATMDLPRENYVFEVPDDAGNSIICLAINKGDETT---- 359
Query: 436 IILGNFQMQNYYVEYDLRNQRLGFKQQLCK 465
I+GNFQ QN +V YDL+N L F C
Sbjct: 360 -IIGNFQQQNMHVLYDLQNNMLSFVAAQCD 388
>gi|414590468|tpg|DAA41039.1| TPA: aspartic proteinase nepenthesin-1 [Zea mays]
Length = 469
Score = 163 bits (412), Expect = 2e-37, Method: Compositional matrix adjust.
Identities = 142/479 (29%), Positives = 214/479 (44%), Gaps = 61/479 (12%)
Query: 10 LSFIFFFTLLSIFPSSITSLTFSLSRFHTNPSQDSYQNLNSLVSSSLTRALHIKNPQT-- 67
L+ + F + + S S+ L+R H++P + Q V +L R +H + ++
Sbjct: 27 LAVLVFLVVCATLASGAASVRVGLTRIHSDPDTTAPQ----FVRDALRRDMHRQRSRSFG 82
Query: 68 --------KTTTTTTTTTTTNISSHSYGGYSISLSFGTPPQIIPFILDTGSHLVWFPCTN 119
++ TT + T + G Y ++L+ GTPP + DTGS L+W C
Sbjct: 83 RDRDRELAESDGRTTVSARTRKDLPNGGEYLMTLAIGTPPLPYAAVADTGSDLIWTQCA- 141
Query: 120 HYQC-KYCSSSKIPSFIPKLSSSSRLLGCQNPKCSWIHHESIQCRDCNDEPLATSKNCTQ 178
C C P + P S++ +L C + S+ A C
Sbjct: 142 --PCGTQCFEQPAPLYNPASSTTFSVLPC---------NSSLSMCAGALAGAAPPPGCAC 190
Query: 179 ICPSYLVLYGSGLTEGIALSETLNLPNRI-----IPNFLVGCSVLSSRQ---PAGIAGFG 230
+ Y YG+G T G+ SET + +P GCS SS AG+ G G
Sbjct: 191 M---YNQTYGTGWTAGVQGSETFTFGSSAADQARVPGVAFGCSNASSSDWNGSAGLVGLG 247
Query: 231 RGKTSLPSQLNLDKFSYCLLSHKFDDTTRTSSLILDNGSSHSDKKTTGLTYTPFVNNPSV 290
RG SL SQL +FSYCL F DT TS+L+L ++ + TG+ TPFV +P+
Sbjct: 248 RGSLSLVSQLGAGRFSYCL--TPFQDTNSTSTLLLGPSAALNG---TGVRSTPFVASPA- 301
Query: 291 AERNAFSVYYYVGLRRITVGGQRVRVWHKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPL 350
R S YYY+ L I++G + + + +L DG GG I+DSGTT T +A ++
Sbjct: 302 --RAPMSTYYYLNLTGISLGAKALPISPGAFSLKPDGTGGLIIDSGTTITSLANAAYQQ- 358
Query: 351 ADEFVSQMVKNRNYT-RALGAEALTGLRPCFDVPGEKTGS---FPELKLHFKGGAEVTLP 406
V VK+ T + TGL CF +P + P + LHF GA++ LP
Sbjct: 359 ----VRAAVKSLVTTLPTVDGSDSTGLDLCFALPAPTSAPPAVLPSMTLHFD-GADMVLP 413
Query: 407 VENYFAVVGEGSAVCLTVVTDREASGGPSIILGNFQMQNYYVEYDLRNQRLGFKQQLCK 465
++Y + G G CL + R + G GN+Q QN ++ YD+R + L F C
Sbjct: 414 ADSYM-ISGSG-VWCLAM---RNQTDGAMSTFGNYQQQNMHILYDVREETLSFAPAKCS 467
>gi|224060469|ref|XP_002300215.1| predicted protein [Populus trichocarpa]
gi|222847473|gb|EEE85020.1| predicted protein [Populus trichocarpa]
Length = 439
Score = 163 bits (412), Expect = 2e-37, Method: Compositional matrix adjust.
Identities = 132/386 (34%), Positives = 186/386 (48%), Gaps = 52/386 (13%)
Query: 87 GGYSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLG 146
G + + L+ GTPP+ ILDTGS L+W C C C P F PK SSS L
Sbjct: 95 GEFLMKLAIGTPPETYSAILDTGSDLIWTQCK---PCTQCFHQSTPIFDPKKSSSFSKLS 151
Query: 147 CQNPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGS-GLTEGIALSETLNLPN 205
C + C E++ CN+ C YL YG T+GI SETL
Sbjct: 152 CSSQLC-----EALPQSSCNN-------GC-----EYLYSYGDYSSTQGILASETLTFGK 194
Query: 206 RIIPNFLVGCSVLSS----RQPAGIAGFGRGKTSLPSQLNLDKFSYCLLSHKFDDTTRTS 261
+PN GC + Q AG+ G GRG SL SQL KFSYCL + D T+TS
Sbjct: 195 ASVPNVAFGCGADNEGSGFSQGAGLVGLGRGPLSLVSQLKEPKFSYCLTTV---DDTKTS 251
Query: 262 SLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRVRVWHKYL 321
+L++ + +S + ++ + TP +++P A +YY+ L I+VG R+ +
Sbjct: 252 TLLMGSLAS-VNASSSAIKTTPLIHSP------AHPSFYYLSLEGISVGDTRLPIKKSTF 304
Query: 322 TLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEALTGLRPCFD 381
+L DG+GG I+DSGTT T++ F +A EF +++ + + TGL CF
Sbjct: 305 SLQDDGSGGLIIDSGTTITYLEESAFNLVAKEFTAKI------NLPVDSSGSTGLDVCFT 358
Query: 382 VP-GEKTGSFPELKLHFKGGAEVTLPVENYFAVVGEGS--AVCLTVVTDREASGGPSIIL 438
+P G P+L HF GA++ LP ENY ++G+ S CL + + S I
Sbjct: 359 LPSGSTNIEVPKLVFHFD-GADLELPAENY--MIGDSSMGVACLAMGSSSGMS-----IF 410
Query: 439 GNFQMQNYYVEYDLRNQRLGFKQQLC 464
GN Q QN V +DL + L F C
Sbjct: 411 GNVQQQNMLVLHDLEKETLSFLPTQC 436
>gi|326520291|dbj|BAK07404.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 464
Score = 162 bits (411), Expect = 3e-37, Method: Compositional matrix adjust.
Identities = 146/493 (29%), Positives = 224/493 (45%), Gaps = 69/493 (13%)
Query: 6 SALCLSFIFFFTLLSIFPSSITSLTFSLSR--------FHTNPSQD--SYQNLNSLVSSS 55
S+ C S + +++L +FP + LTFSL+ H + + ++ L +V+ S
Sbjct: 4 SSACNSTMKGWSVLQLFPC-VLLLTFSLAESAALRADLTHVDSGRGFTKHELLRRMVARS 62
Query: 56 LTRALHIKNPQTKTTTTTTTTTTTNISSHSYGGYSISLSFGTP-PQIIPFILDTGSHLVW 114
R +++ + T T S Y I L GTP PQ + LDTGS LVW
Sbjct: 63 KARLASLRS--SACDTALTAPVDHGGSDVGSSEYLIHLGIGTPRPQRVVLHLDTGSDLVW 120
Query: 115 FPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLGCQNPKCSWIHHESIQCRDCNDEPLATSK 174
C C C +P F +S + + C +P C H + C A +
Sbjct: 121 TQCA----CTVCFDQPVPVFRASVSHTFSRVPCSDPLCG--HAVYLPLSGCA----ARDR 170
Query: 175 NCTQICPSYLVLYG---SGLTEGIALSETLNL--PNRI-----IPNFLVGCSVLS----S 220
+C YG +T G +T P+R +PN GC +++ +
Sbjct: 171 SC-------FYAYGYMDHSITTGKMAEDTFTFKAPDRADTAAAVPNIRFGCGMMNYGLFT 223
Query: 221 RQPAGIAGFGRGKTSLPSQLNLDKFSYCLLSHKFDDTTRTSSLILDNGSSHSDKKTTG-L 279
+GIAGFG G SLPSQL + +FSYC + + +R S +IL + + TG +
Sbjct: 224 PNQSGIAGFGTGPLSLPSQLKVRRFSYCFTAM---EESRVSPVILGGEPENIEAHATGPI 280
Query: 280 TYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRVRVWHKYLTLDRDGNGGTIVDSGTTF 339
TPF P+ A + +Y++ LR +TVG R+ L DG+GGT +DSGT
Sbjct: 281 QSTPFAPGPAGAPVGS-QPFYFLSLRGVTVGETRLPFNASTFALKGDGSGGTFIDSGTAI 339
Query: 340 TFMAPELFEPLADEFVSQ--MVKNRNYTRALGAEALTGLRPCFDVPGEKTG-SFPELKLH 396
TF +F L + FV+Q + + YT + L CF VP +K + P+L LH
Sbjct: 340 TFFPQAVFRSLREAFVAQVPLPVAKGYTD---PDNLL----CFSVPAKKKAPAVPKLILH 392
Query: 397 FKGGAEVTLPVENYFAV-----VGEGSAVCLTVVTDREASGGPSIILGNFQMQNYYVEYD 451
+ GA+ LP ENY G G +C+ +++ ++G I+GNFQ QN ++ YD
Sbjct: 393 LE-GADWELPRENYVLDNDDDGSGAGRKLCVVILSAGNSNG---TIIGNFQQQNMHIVYD 448
Query: 452 LRNQRLGFKQQLC 464
L + ++ F C
Sbjct: 449 LESNKMVFAPARC 461
>gi|224035171|gb|ACN36661.1| unknown [Zea mays]
Length = 378
Score = 162 bits (409), Expect = 4e-37, Method: Compositional matrix adjust.
Identities = 123/356 (34%), Positives = 168/356 (47%), Gaps = 41/356 (11%)
Query: 142 SRLLGCQNPKCSWIHHES-----IQCRDCNDEPLAT-SKNCTQICPSYLVLYGSG----- 190
SR + C +P CS H + C E + T S + CP YG G
Sbjct: 21 SRRIPCASPLCSAAHASAPPSDLCAVARCPLEDIETGSCGASHACPPLYYAYGDGSLVAH 80
Query: 191 LTEG-IALSETLNLPNRI-IPNFLVGCSVLSSRQPAGIAGFGRGKTSLPSQLNLD---KF 245
L G +AL + + NF C+ + +P G+AGFGRG SLP QL+ +F
Sbjct: 81 LRRGRVALGAGARASVAVAVDNFTFACAHTALGEPVGVAGFGRGPLSLPGQLSPQLSGRF 140
Query: 246 SYCLLSHKF--DDTTRTSSLILDNG--SSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYY 301
SYCL+SH F D R S LIL + + +T G YTP ++NP +Y
Sbjct: 141 SYCLVSHSFRADRLIRPSPLILGRSPDDAAAAAETDGFVYTPLLHNPK------HPYFYS 194
Query: 302 VGLRRITVGGQRVRVWHKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKN 361
V L ++VG R++ + +DR GNGG +VDSGTTFT + E++ +A+ F M
Sbjct: 195 VALEAVSVGAARIQARPELARVDRAGNGGMVVDSGTTFTMLPNEMYARVAEAFARAMAAA 254
Query: 362 RNYTRALGAEALTGLRPCFDVPGEKTGSFPELKLHFKGGAEVTLPVENYF---------A 412
AE TGL PC+ G P L LHF+G A V LP NYF A
Sbjct: 255 GFARAER-AEEQTGLTPCYRYAASDRG-VPPLALHFRGNATVALPRRNYFMGFKSEDAGA 312
Query: 413 VVGEGSAVCLTVVTDREASG----GPSIILGNFQMQNYYVEYDLRNQRLGFKQQLC 464
+ CL ++ +ASG GP+ LGNFQ Q + V YD+ R+GF ++ C
Sbjct: 313 GTRKDDVGCLMLMNGGDASGEEGDGPAGTLGNFQQQGFEVVYDVDAGRVGFARRRC 368
>gi|125558622|gb|EAZ04158.1| hypothetical protein OsI_26300 [Oryza sativa Indica Group]
Length = 435
Score = 161 bits (407), Expect = 7e-37, Method: Compositional matrix adjust.
Identities = 131/393 (33%), Positives = 180/393 (45%), Gaps = 51/393 (12%)
Query: 84 HSYGGYSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSR 143
+ GGY++++S GTP P + DTGS L+W C C C P F P SS+
Sbjct: 81 NGVGGYNMNISVGTPLLTFPVVADTGSDLIWTQCA---PCTKCFQQPAPPFQPASSSTFS 137
Query: 144 LLGCQNPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSGLTEGIALSETLNL 203
L C + C ++ + R CN + C Y YGSG T G +ETL +
Sbjct: 138 KLPCTSSFCQFLPNS---IRTCN------ATGCV-----YNYKYGSGYTAGYLATETLKV 183
Query: 204 PNRIIPNFLVGCSVLS--SRQPAGIAGFGRGKTSLPSQLNLDKFSYCLLSHKFDDTTRTS 261
+ P+ GCS + +GIAG GRG SL QL + +FSYCL S S
Sbjct: 184 GDASFPSVAFGCSTENGVGNSTSGIAGLGRGALSLIPQLGVGRFSYCLRSGS---AAGAS 240
Query: 262 SLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRVRVWHKYL 321
++ + ++ +D + TPFVNNP+V YYYV L ITVG + V
Sbjct: 241 PILFGSLANLTDGN---VQSTPFVNNPAVHPS-----YYYVNLTGITVGETDLPVTTSTF 292
Query: 322 TLDRDG-NGGTIVDSGTTFTFMAPELFEPLADEFVSQM--VKNRNYTRALGAEALTGLRP 378
++G GGTIVDSGTT T++A + +E + F+SQ V N TR GL
Sbjct: 293 GFTQNGLGGGTIVDSGTTLTYLAKDGYEMVKQAFLSQTANVTTVNGTR--------GLDL 344
Query: 379 CFDVPGEKTG-SFPELKLHFKGGAEVTLPVENYFAVV-----GEGSAVCLTVVTDREASG 432
CF G G + P L L F GGAE +P YFA V G + CL ++ +
Sbjct: 345 CFKSTGGGGGIAVPSLVLRFDGGAEYAVP--TYFAGVETDSQGSVTVACLMMLPAKGDQ- 401
Query: 433 GPSIILGNFQMQNYYVEYDLRNQRLGFKQQLCK 465
P ++GN + ++ YDL F C
Sbjct: 402 -PMSVIGNVMQMDMHLLYDLDGGIFSFSPADCA 433
>gi|125561848|gb|EAZ07296.1| hypothetical protein OsI_29544 [Oryza sativa Indica Group]
Length = 448
Score = 161 bits (407), Expect = 8e-37, Method: Compositional matrix adjust.
Identities = 139/488 (28%), Positives = 221/488 (45%), Gaps = 65/488 (13%)
Query: 1 MASYISALCLSFIFFFTLLSIFPSSITSLTFSLSRFHTNPSQDSYQNLNSLVSSSLTRAL 60
MAS + L L TLL S++ + F L H + + SY L LV+ ++ R+
Sbjct: 1 MASPVLVLAL---VAATLLPASHCSVSGVGFQLKLRHVD-AHGSYTKLE-LVTRAIRRSR 55
Query: 61 HIKNPQTKTTTTTTT--------TTTTNISSHSYGGYSISLSFGTPPQIIPFILDTGSHL 112
T T + + S G Y + L+ GTPP ++DTGS L
Sbjct: 56 ARVAALQAVAAAAATVAPVVDPITAARILVAASQGEYLMDLAIGTPPLRYTAMVDTGSDL 115
Query: 113 VWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLGCQNPKCSWIHHESIQCRDCNDEPLAT 172
+W C C C+ P F P S++ RL+ C++P C+ + + + R
Sbjct: 116 IWTQCA---PCVLCADQPTPYFRPARSATYRLVPCRSPLCAALPYPACFQR--------- 163
Query: 173 SKNCTQICPSYLVLYGS-GLTEGIALSETL-----NLPNRIIPNFLVGCSVLSSRQPA-- 224
+C Y YG T G+ SET N ++ + GC ++S Q A
Sbjct: 164 -----SVC-VYQYYYGDEASTAGVLASETFTFGAANSSKVMVSDVAFGCGNINSGQLANS 217
Query: 225 -GIAGFGRGKTSLPSQLNLDKFSYCLLSHKFDDTTRTSSLILD--NGSSHSDKKTTGLTY 281
G+ G GRG SL SQL +FSYCL S + +R + + NG++ S + +
Sbjct: 218 SGMVGLGRGPLSLVSQLGPSRFSYCLTSFLSPEPSRLNFGVFATLNGTNASSSGSP-VQS 276
Query: 282 TPFVNNPSVAERNAFSVYYYVGLRRITVGGQRVRVWHKYLTLDRDGNGGTIVDSGTTFTF 341
TP V N A Y++ L+ I++G +R+ + ++ DG GG +DSGT+ T+
Sbjct: 277 TPLVVN------AALPSLYFMSLKGISLGQKRLPIDPLVFAINDDGTGGVFIDSGTSLTW 330
Query: 342 MAPELFEPLADEFVSQM--VKNRNYTRALGAEALTGLRPCFDVPGEKTGS--FPELKLHF 397
+ + ++ + E VS + + N T GL CF P + + P+++LHF
Sbjct: 331 LQQDAYDAVRHELVSVLRPLPPTNDTE-------IGLETCFPWPPPPSVAVTVPDMELHF 383
Query: 398 KGGAEVTLPVENYFAVVGEGSAVCLTVVTDREASGGPSIILGNFQMQNYYVEYDLRNQRL 457
GGA +T+P ENY + G +CL ++ +A+ I+GN+Q QN ++ YD+ N L
Sbjct: 384 DGGANMTVPPENYMLIDGATGFLCLAMIRSGDAT-----IIGNYQQQNMHILYDIANSLL 438
Query: 458 GFKQQLCK 465
F C
Sbjct: 439 SFVPAPCN 446
>gi|115476828|ref|NP_001062010.1| Os08g0469000 [Oryza sativa Japonica Group]
gi|42407407|dbj|BAD09565.1| putative nucleoid DNA-binding protein [Oryza sativa Japonica Group]
gi|113623979|dbj|BAF23924.1| Os08g0469000 [Oryza sativa Japonica Group]
gi|125603713|gb|EAZ43038.1| hypothetical protein OsJ_27627 [Oryza sativa Japonica Group]
Length = 448
Score = 161 bits (407), Expect = 8e-37, Method: Compositional matrix adjust.
Identities = 139/488 (28%), Positives = 221/488 (45%), Gaps = 65/488 (13%)
Query: 1 MASYISALCLSFIFFFTLLSIFPSSITSLTFSLSRFHTNPSQDSYQNLNSLVSSSLTRAL 60
MAS + L L TLL S++ + F L H + + SY L LV+ ++ R+
Sbjct: 1 MASPVLVLAL---VAATLLPASHCSVSGVGFQLKLRHVD-AHGSYTKLE-LVTRAIRRSR 55
Query: 61 HIKNPQTKTTTTTTT--------TTTTNISSHSYGGYSISLSFGTPPQIIPFILDTGSHL 112
T T + + S G Y + L+ GTPP ++DTGS L
Sbjct: 56 ARVAALQAVAAAAATVAPVVDPITAARILVAASQGEYLMDLAIGTPPLRYTAMVDTGSDL 115
Query: 113 VWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLGCQNPKCSWIHHESIQCRDCNDEPLAT 172
+W C C C+ P F P S++ RL+ C++P C+ + + + R
Sbjct: 116 IWTQCA---PCVLCADQPTPYFRPARSATYRLVPCRSPLCAALPYPACFQR--------- 163
Query: 173 SKNCTQICPSYLVLYGS-GLTEGIALSETL-----NLPNRIIPNFLVGCSVLSSRQPA-- 224
+C Y YG T G+ SET N ++ + GC ++S Q A
Sbjct: 164 -----SVC-VYQYYYGDEASTAGVLASETFTFGAANSSKVMVSDVAFGCGNINSGQLANS 217
Query: 225 -GIAGFGRGKTSLPSQLNLDKFSYCLLSHKFDDTTRTSSLILD--NGSSHSDKKTTGLTY 281
G+ G GRG SL SQL +FSYCL S + +R + + NG++ S + +
Sbjct: 218 SGMVGLGRGPLSLVSQLGPSRFSYCLTSFLSPEPSRLNFGVFATLNGTNASSSGSP-VQS 276
Query: 282 TPFVNNPSVAERNAFSVYYYVGLRRITVGGQRVRVWHKYLTLDRDGNGGTIVDSGTTFTF 341
TP V N A Y++ L+ I++G +R+ + ++ DG GG +DSGT+ T+
Sbjct: 277 TPLVVN------AALPSLYFMSLKGISLGQKRLPIDPLVFAINDDGTGGVFIDSGTSLTW 330
Query: 342 MAPELFEPLADEFVSQM--VKNRNYTRALGAEALTGLRPCFDVPGEKTGSF--PELKLHF 397
+ + ++ + E VS + + N T GL CF P + + P+++LHF
Sbjct: 331 LQQDAYDAVRRELVSVLRPLPPTNDTE-------IGLETCFPWPPPPSVAVTVPDMELHF 383
Query: 398 KGGAEVTLPVENYFAVVGEGSAVCLTVVTDREASGGPSIILGNFQMQNYYVEYDLRNQRL 457
GGA +T+P ENY + G +CL ++ +A+ I+GN+Q QN ++ YD+ N L
Sbjct: 384 DGGANMTVPPENYMLIDGATGFLCLAMIRSGDAT-----IIGNYQQQNMHILYDIANSLL 438
Query: 458 GFKQQLCK 465
F C
Sbjct: 439 SFVPAPCN 446
>gi|147862576|emb|CAN79341.1| hypothetical protein VITISV_006338 [Vitis vinifera]
Length = 436
Score = 160 bits (406), Expect = 1e-36, Method: Compositional matrix adjust.
Identities = 124/384 (32%), Positives = 179/384 (46%), Gaps = 51/384 (13%)
Query: 87 GGYSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLG 146
G + ++L+ GTP + I+DTGS L+W C CK C P F P+ SSS L
Sbjct: 95 GEFLMNLAIGTPAETYSAIMDTGSDLIWTQCK---PCKVCFDQPTPIFDPEKSSSFSKLP 151
Query: 147 CQNPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGS-GLTEGIALSETLNLPN 205
C + C + P+++ C+ C Y YG T+G+ +ET +
Sbjct: 152 CSSDLCVAL-------------PISS---CSDGC-EYRYSYGDHSSTQGVLATETFTFGD 194
Query: 206 RIIPNFLVGCSV----LSSRQPAGIAGFGRGKTSLPSQLNLDKFSYCLLSHKFDDTTRTS 261
+ GC + Q AG+ G GRG SL SQL + KFSYCL S DD+ S
Sbjct: 195 ASVSKIGFGCGEDNRGRAYSQGAGLVGLGRGPLSLISQLGVPKFSYCLTS--IDDSKGIS 252
Query: 262 SLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRVRVWHKYL 321
+L++ S+ TP + NPS R +F YY+ L I+VG + +
Sbjct: 253 TLLV-----GSEATVKSAIPTPLIQNPS---RPSF---YYLSLEGISVGDTLLPIEKSTF 301
Query: 322 TLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEALTGLRPCFD 381
++ DG+GG I+DSGTT T++ F L EF+SQM + + A T L CF
Sbjct: 302 SIQDDGSGGLIIDSGTTITYLKDNAFAALKKEFISQMKLD------VDASGSTELELCFT 355
Query: 382 VPGEKTG-SFPELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTVVTDREASGGPSIILGN 440
+P + + P+L HF+ G ++ LP ENY +CLT+ + S I GN
Sbjct: 356 LPPDGSPVEVPQLVFHFE-GVDLKLPKENYIIEDSALRVICLTMGSSSGMS-----IFGN 409
Query: 441 FQMQNYYVEYDLRNQRLGFKQQLC 464
FQ QN V +DL + + F C
Sbjct: 410 FQQQNIVVLHDLEKETISFAPAQC 433
>gi|226508202|ref|NP_001141111.1| hypothetical protein precursor [Zea mays]
gi|194702684|gb|ACF85426.1| unknown [Zea mays]
gi|414590469|tpg|DAA41040.1| TPA: hypothetical protein ZEAMMB73_571218 [Zea mays]
Length = 439
Score = 160 bits (405), Expect = 1e-36, Method: Compositional matrix adjust.
Identities = 136/459 (29%), Positives = 207/459 (45%), Gaps = 79/459 (17%)
Query: 33 LSRFHTNPSQDSYQNLNSLVSSSLTRALHIKNPQTKTTTTTTTTTTTNISSHSY-GGYSI 91
L+R H +PS + Q V ++L R +H N + +++ T + +S + G + +
Sbjct: 32 LTRVHADPSVTASQ----FVRAALHRDMHRHNARKLAASSSDGTVSAPVSPTTVPGEFLM 87
Query: 92 SLSFGTPPQIIPF--ILDTGSHLVWFPCTNHYQC-KYCSSSKIPSFIPKLSSSSRLLGCQ 148
+L+ GTPP +PF I DTGS L+W C C + C P + P S++ L C
Sbjct: 88 TLAIGTPP--LPFLAIADTGSDLIWTQCA---PCSRQCFQQPTPLYNPSSSTTFSALPCN 142
Query: 149 N------PKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSGLTEGIALSETLN 202
+ P C+ +++ + YGSG T +ET
Sbjct: 143 SSLGLCAPACACMYN---------------------------MTYGSGWTYVFQGTETFT 175
Query: 203 LPNRI------IPNFLVGCSVLSS----RQPAGIAGFGRGKTSLPSQLNLDKFSYCLLSH 252
+ +P GCS SS +G+ G GRG SL SQL KFSYCL
Sbjct: 176 FGSSTPADQVRVPGIAFGCSNASSGFNASSASGLVGLGRGSLSLVSQLGAPKFSYCL--T 233
Query: 253 KFDDTTRTSSLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQ 312
+ DT TS+L+L G S S T ++ TPFV +PS S+YYY+ L I++G
Sbjct: 234 PYQDTNSTSTLLL--GPSASLNDTGVVSSTPFVASPS-------SIYYYLNLTGISLGTT 284
Query: 313 RVRVWHKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEA 372
+ + +L DG GG I+DSGTT T + ++ + +S + A
Sbjct: 285 ALPIPPNAFSLKADGTGGLIIDSGTTITMLGNTAYQQVRAAVLSLVTLPTT-----DGSA 339
Query: 373 LTGLRPCFDVPGEKTG--SFPELKLHFKGGAEVTLPVENYFAVVGEGSAV----CLTVVT 426
TGL CF++P + S P + LHF GA++ LP +NY + + + CL +
Sbjct: 340 ATGLDLCFELPSSTSAPPSMPSMTLHFD-GADMVLPADNYMMSLSDPDSDSSLWCLAMQN 398
Query: 427 DREASGGPSIILGNFQMQNYYVEYDLRNQRLGFKQQLCK 465
+ G ILGN+Q QN ++ YD+ + L F C
Sbjct: 399 QTDTDGVVVSILGNYQQQNMHILYDVGKETLSFAPAKCS 437
>gi|225437854|ref|XP_002264056.1| PREDICTED: aspartic proteinase nepenthesin-1 [Vitis vinifera]
Length = 436
Score = 160 bits (405), Expect = 1e-36, Method: Compositional matrix adjust.
Identities = 124/384 (32%), Positives = 179/384 (46%), Gaps = 51/384 (13%)
Query: 87 GGYSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLG 146
G + ++L+ GTP + I+DTGS L+W C CK C P F P+ SSS L
Sbjct: 95 GEFLMNLAIGTPAETYSAIMDTGSDLIWTQCK---PCKVCFDQPTPIFDPEKSSSFSKLP 151
Query: 147 CQNPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGS-GLTEGIALSETLNLPN 205
C + C + P+++ C+ C Y YG T+G+ +ET +
Sbjct: 152 CSSDLCVAL-------------PISS---CSDGC-EYRYSYGDHSSTQGVLATETFTFGD 194
Query: 206 RIIPNFLVGCSV----LSSRQPAGIAGFGRGKTSLPSQLNLDKFSYCLLSHKFDDTTRTS 261
+ GC + Q AG+ G GRG SL SQL + KFSYCL S DD+ S
Sbjct: 195 ASVSKIGFGCGEDNRGRAYSQGAGLVGLGRGPLSLISQLGVPKFSYCLTS--IDDSKGIS 252
Query: 262 SLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRVRVWHKYL 321
+L++ S+ TP + NPS R +F YY+ L I+VG + +
Sbjct: 253 TLLV-----GSEATVKSAIPTPLIQNPS---RPSF---YYLSLEGISVGDTLLPIEKSTF 301
Query: 322 TLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEALTGLRPCFD 381
++ DG+GG I+DSGTT T++ F L EF+SQM + + A T L CF
Sbjct: 302 SIQDDGSGGLIIDSGTTITYLKDSAFAALKKEFISQMKLDVD------ASGSTELELCFT 355
Query: 382 VPGEKTG-SFPELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTVVTDREASGGPSIILGN 440
+P + + P+L HF+ G ++ LP ENY +CLT+ + S I GN
Sbjct: 356 LPPDGSPVDVPQLVFHFE-GVDLKLPKENYIIEDSALRVICLTMGSSSGMS-----IFGN 409
Query: 441 FQMQNYYVEYDLRNQRLGFKQQLC 464
FQ QN V +DL + + F C
Sbjct: 410 FQQQNIVVLHDLEKETISFAPAQC 433
>gi|357158688|ref|XP_003578209.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Brachypodium
distachyon]
Length = 443
Score = 159 bits (403), Expect = 2e-36, Method: Compositional matrix adjust.
Identities = 124/392 (31%), Positives = 175/392 (44%), Gaps = 46/392 (11%)
Query: 85 SYGGYSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRL 144
S G Y +S+ GTPP+ ILDTGS L+W C C C P F P S S
Sbjct: 85 SEGEYLMSMGIGTPPRYYSAILDTGSDLIWTQCA---PCMLCVDQPTPFFDPAQSPSYAK 141
Query: 145 LGCQNPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYG-SGLTEGIALSETL-- 201
L C +P C+ +++ PL C Y YG S T G+ +ET
Sbjct: 142 LPCNSPMCNALYY-----------PLCYRNVCV-----YQYFYGDSANTAGVLSNETFTF 185
Query: 202 --NLPNRIIPNFLVGCSVLSSR---QPAGIAGFGRGKTSLPSQLNLDKFSYCLLSHKFDD 256
N +P GC L++ +G+ GFGRG SL SQL +FSYCL S
Sbjct: 186 GTNDTRVTVPRIAFGCGNLNAGSLFNGSGMVGFGRGPLSLVSQLGSPRFSYCLTSFMSPV 245
Query: 257 TTRTSSLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRVRV 316
+R +S S + TPF+ NP + YY+ + I+VGG+ + +
Sbjct: 246 PSRLYFGAYATLNSTSASTGEPVQSTPFIVNPGLPTM------YYLNMTGISVGGELLPI 299
Query: 317 WHKYLTL-DRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEALTG 375
+ D DG GG I+DSG+T T++A ++ + F Q+ +L
Sbjct: 300 DPSVFAINDADGTGGVIIDSGSTITYLARAAYDMVHQAFADQVGLPLTNATSLADV---- 355
Query: 376 LRPCF--DVPGEKTGSFPELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTVVTDREASGG 433
L CF P K + PEL HF+ GA + LP+ENY + G+ +CL + + S
Sbjct: 356 LDTCFVWPPPPRKIVTMPELAFHFE-GANMELPLENYMLIDGDTGNLCLAIAASDDGS-- 412
Query: 434 PSIILGNFQMQNYYVEYDLRNQRLGFKQQLCK 465
I+G+FQ QN++V YD N L F C
Sbjct: 413 ---IIGSFQHQNFHVLYDNENSLLSFTPATCN 441
>gi|297800470|ref|XP_002868119.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
gi|297313955|gb|EFH44378.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
Length = 499
Score = 159 bits (403), Expect = 2e-36, Method: Compositional matrix adjust.
Identities = 132/408 (32%), Positives = 191/408 (46%), Gaps = 65/408 (15%)
Query: 106 LDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLGCQNPKCSWIHHESIQCRDC 165
LDTGS LVWFPC + C C S +P P SSS + H S+ D
Sbjct: 98 LDTGSDLVWFPC-RPFTCILCESKPLPPSPPPTLSSSATTVSCSSPSCSAAHSSLPSSD- 155
Query: 166 NDEPLATSKNC-------------TQICPSYLVLYGSGLTEGIALSETLNLPNRIIPNFL 212
L NC + CP + YG G S++L+LP+ + NF
Sbjct: 156 ----LCAISNCPLDYIETGDCNTSSYPCPPFYYAYGDGSLVAKLFSDSLSLPSVSVANFT 211
Query: 213 VGCSVLSSRQPAGIAGFGRGKTSLPSQLNL------DKFSYCLLSHKFDD--TTRTSSLI 264
GC+ + +P G+AGFGRG+ SLP+QL++ + FSYCL+SH FD R S LI
Sbjct: 212 FGCAHTTLAEPIGVAGFGRGRLSLPAQLSVHSPHLGNSFSYCLVSHSFDSDRVRRPSPLI 271
Query: 265 LDNGSSHSDKKTTG----------------LTYTPFVNNPSVAERNAFSVYYYVGLRRIT 308
L +K+ +T + NP +Y V L+ I+
Sbjct: 272 LGRFVDKKEKRVATTDDDDDGDETKKKKNEFVFTEMLVNPK------HPYFYSVSLQGIS 325
Query: 309 VGGQRVRVWHKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRAL 368
+G + + +D++G GG +VDSGTTFT + + + + +EF S++ R + RA
Sbjct: 326 IGKRNIPAPAMLRRIDKNGGGGVVVDSGTTFTMLPAKFYNSVVEEFDSRV--GRVHERAD 383
Query: 369 GAEALTGLRPCFDVPGEKTGSFPELKLHFKG-GAEVTLPVENYFAVVGEG--------SA 419
E +G+ PC+ + +T P L LHF G G+ VTLP NYF +G
Sbjct: 384 RVEPSSGMSPCYYL--NQTVKVPALVLHFAGNGSTVTLPRRNYFYEFMDGGDGKEEKRKV 441
Query: 420 VCLTVVT---DREASGGPSIILGNFQMQNYYVEYDLRNQRLGFKQQLC 464
CL ++ + E GG ILGN+Q Q + V YDL N+R+GF ++ C
Sbjct: 442 GCLMLMNGGDESELRGGTGAILGNYQQQGFEVVYDLLNRRVGFAKRKC 489
>gi|226493786|ref|NP_001142400.1| uncharacterized protein LOC100274575 [Zea mays]
gi|194708650|gb|ACF88409.1| unknown [Zea mays]
Length = 392
Score = 159 bits (403), Expect = 2e-36, Method: Compositional matrix adjust.
Identities = 132/399 (33%), Positives = 187/399 (46%), Gaps = 50/399 (12%)
Query: 83 SHSYGGYSISLSFGTPPQIIPFILDTGSHLVW---FPCTNHYQCKYCSSSKIPSFIPKLS 139
S + G Y ++L+ GTPP I DTGS L+W PCT+ C P + P S
Sbjct: 26 SPTAGEYLMALAIGTPPLPYQAIADTGSDLIWTQCAPCTSQ-----CFRQPTPLYNPSSS 80
Query: 140 SSSRLLGCQNPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSGLTEGIALSE 199
++ +L C + S+ A C C +Y V YGSG T SE
Sbjct: 81 TTFAVLPCNS-------SLSVCAAALAGTGTAPPPGCA--C-TYNVTYGSGWTSVFQGSE 130
Query: 200 TLNLPNRI-----IPNFLVGCSVLSS----RQPAGIAGFGRGKTSLPSQLNLDKFSYCLL 250
T + +P GCS SS +G+ G GRG+ SL SQL + KFSYCL
Sbjct: 131 TFTFGSTPAGHARVPGIAFGCSTASSGFNASSASGLVGLGRGRLSLVSQLGVPKFSYCL- 189
Query: 251 SHKFDDTTRTSSLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVG 310
+ DT TS+L+L G S S T G++ TPFV +PS A N F YY+ L I++G
Sbjct: 190 -TPYQDTNSTSTLLL--GPSASLNGTAGVSSTPFVASPSTAPMNTF---YYLNLTGISLG 243
Query: 311 GQRVRVWHKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGA 370
+ + +L+ DG GG I+DSGTT T + ++ + VS +
Sbjct: 244 TTALSIPPDAFSLNADGTGGLIIDSGTTITLLGNTAYQQVRAAVVSLVTLPTT-----DG 298
Query: 371 EALTGLRPCFDVPGEKTG--SFPELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTV--VT 426
A TGL CF +P + + P + LHF GA++ LP ++Y + + CL + T
Sbjct: 299 SADTGLDLCFMLPSSTSAPPAMPSMTLHFN-GADMVLPADSYM-MSDDSGLWCLAMQNQT 356
Query: 427 DREASGGPSIILGNFQMQNYYVEYDLRNQRLGFKQQLCK 465
D E + ILGN+Q QN ++ YD+ + L F C
Sbjct: 357 DGEVN-----ILGNYQQQNMHILYDIGQETLSFAPAKCS 390
>gi|356569916|ref|XP_003553140.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Glycine max]
Length = 560
Score = 159 bits (402), Expect = 3e-36, Method: Compositional matrix adjust.
Identities = 129/395 (32%), Positives = 184/395 (46%), Gaps = 47/395 (11%)
Query: 87 GGYSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLG 146
G Y + + GTPP+ ILDTGS L W C C C P + PK SSS + +
Sbjct: 193 GEYFMDVFVGTPPKHFSLILDTGSDLNWIQCV---PCYACFEQNGPYYDPKDSSSFKNIT 249
Query: 147 CQNPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYG--SGLTEGIALSE-TLNL 203
C +P+C + + +P K TQ CP Y YG S T AL T+NL
Sbjct: 250 CHDPRCQLV---------SSPDPPQPCKGETQSCP-YFYWYGDSSNTTGDFALETFTVNL 299
Query: 204 PN-------RIIPNFLVGCSVLSS---RQPAGIAGFGRGKTSLPSQLNL---DKFSYCLL 250
+I+ N + GC + AG+ G GRG S +QL FSYCL+
Sbjct: 300 TTPEGKPELKIVENVMFGCGHWNRGLFHGAAGLLGLGRGPLSFATQLQSLYGHSFSYCLV 359
Query: 251 SHKFDDTTRTSSLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVG 310
+++ +S LI G L +T FV + N +YYV ++ I VG
Sbjct: 360 DRN-SNSSVSSKLIF--GEDKELLSHPNLNFTSFVG----GKENPVDTFYYVLIKSIMVG 412
Query: 311 GQRVRVWHKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGA 370
G+ +++ + L G GGTI+DSGTT T+ A +E + + F M K + +
Sbjct: 413 GEVLKIPEETWHLSAQGGGGTIIDSGTTLTYFAEPAYEIIKEAF---MRKIKGFPL---V 466
Query: 371 EALTGLRPCFDVPGEKTGSFPELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTVV-TDRE 429
E L+PC++V G + PE + F GA PVENYF + VCL ++ T R
Sbjct: 467 ETFPPLKPCYNVSGVEKMELPEFAILFADGAMWDFPVENYFIQIEPEDVVCLAILGTPRS 526
Query: 430 ASGGPSIILGNFQMQNYYVEYDLRNQRLGFKQQLC 464
A I+GN+Q QN+++ YDL+ RLG+ C
Sbjct: 527 ALS----IIGNYQQQNFHILYDLKKSRLGYAPMKC 557
>gi|297740193|emb|CBI30375.3| unnamed protein product [Vitis vinifera]
Length = 256
Score = 158 bits (399), Expect = 6e-36, Method: Compositional matrix adjust.
Identities = 91/205 (44%), Positives = 119/205 (58%), Gaps = 24/205 (11%)
Query: 34 SRFHTNPSQDSYQNLNSLVSSSLTRALHIKNPQTKTTTTTTTTTTTNISSHSYGGYSISL 93
S F + S + L L S+SL+RA H+K+ TT+ ++ HSYGG++I L
Sbjct: 66 STFTSKLSTEPRVFLQHLASASLSRAHHLKH------GTTSPLVKASLFPHSYGGHTIPL 119
Query: 94 SFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSS---KIPSFIPKLSSSSRLLGCQNP 150
SFGTPPQ + F++DTGSH+VW PCT HY C CS S K+P F PKLSSS ++L C+NP
Sbjct: 120 SFGTPPQKLSFLVDTGSHVVWAPCTTHYTCTNCSFSNPKKVPIFNPKLSSSYKILECRNP 179
Query: 151 KCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSGLTEGIALSETLNLPNRIIPN 210
KC S+ C CN SKNC+ CP Y + YG+G G L E LN P + I
Sbjct: 180 KC------SLGCPRCN----GNSKNCSHACPQYSLQYGTGSASGFFLLENLNFPGKTIHK 229
Query: 211 FLVGCSVLSSRQP-----AGIAGFG 230
FLVGC+ ++ +P AG FG
Sbjct: 230 FLVGCTTSAAHEPTSDALAGFVDFG 254
>gi|222624819|gb|EEE58951.1| hypothetical protein OsJ_10630 [Oryza sativa Japonica Group]
Length = 440
Score = 158 bits (399), Expect = 6e-36, Method: Compositional matrix adjust.
Identities = 124/385 (32%), Positives = 178/385 (46%), Gaps = 47/385 (12%)
Query: 89 YSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLGCQ 148
Y + L+ GTPPQ + LDTGS LVW C C C + +P + SS+ L C
Sbjct: 91 YLLHLAIGTPPQPVQLTLDTGSVLVWTQCQ---PCAVCFNQSLPYYDASRSSTFALPSCD 147
Query: 149 NPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGS-GLTEGIALSETLN-LPNR 206
+ +C D + N T +Y YG T G ET++ +
Sbjct: 148 STQCKL------------DPSVTMCVNQTVQTCAYSYSYGDKSATIGFLDVETVSFVAGA 195
Query: 207 IIPNFLVGCSV----LSSRQPAGIAGFGRGKTSLPSQLNLDKFSYCLLSHKFDDTTRTSS 262
+P + GC + + GIAGFGRG SLPSQL + FS+C + + S+
Sbjct: 196 SVPGVVFGCGLNNTGIFRSNETGIAGFGRGPLSLPSQLKVGNFSHCFTA---VSGRKPST 252
Query: 263 LILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRVRVWHKYLT 322
++ D + + TP + NP A +YY+ L+ ITVG R+ V
Sbjct: 253 VLFDLPADLYKNGRGTVQTTPLIKNP------AHPTFYYLSLKGITVGSTRLPVPESAFA 306
Query: 323 LDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEALTGLRPCFDV 382
L ++G GGTI+DSGT FT + P ++ + DEF + + + TG CF
Sbjct: 307 L-KNGTGGTIIDSGTAFTSLPPRVYRLVHDEFAAHV------KLPVVPSNETGPLLCFSA 359
Query: 383 PG-EKTGSFPELKLHFKGGAEVTLPVENYF--AVVGEGSAVCLTVVTDREASGGPSIILG 439
P K P+L LHF+ GA + LP ENY A G ++CL ++ G I+G
Sbjct: 360 PPLGKAPHVPKLVLHFE-GATMHLPRENYVFEAKDGGNCSICLAIIE------GEMTIIG 412
Query: 440 NFQMQNYYVEYDLRNQRLGFKQQLC 464
NFQ QN +V YDL+N +L F + C
Sbjct: 413 NFQQQNMHVLYDLKNSKLSFVRAKC 437
>gi|125558629|gb|EAZ04165.1| hypothetical protein OsI_26307 [Oryza sativa Indica Group]
Length = 455
Score = 158 bits (399), Expect = 6e-36, Method: Compositional matrix adjust.
Identities = 126/388 (32%), Positives = 178/388 (45%), Gaps = 34/388 (8%)
Query: 87 GGYSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLG 146
G Y++++S GTPP P I+DTGS+L+W C +C + + P P SS+ L
Sbjct: 89 GAYNMNISLGTPPLDFPVIVDTGSNLIWAQCAPCTRC-FPRPTPAPVLQPARSSTFSRLP 147
Query: 147 CQNPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSGLTEGIALSETLNLPNR 206
C C ++ S + R C N T C +Y YGSG T G +ETL + +
Sbjct: 148 CNGSFCQYLPTSS-RPRTC---------NATAAC-AYNYTYGSGYTAGYLATETLTVGDG 196
Query: 207 IIPNFLVGCSVLSS-RQPAGIAGFGRGKTSLPSQLNLDKFSYCLLSHKFDDTTRTSSLIL 265
P GCS + +GI G GRG SL SQL + +FSYCL S D S ++
Sbjct: 197 TFPKVAFGCSTENGVDNSSGIVGLGRGPLSLVSQLAVGRFSYCLRSDMADGG--ASPILF 254
Query: 266 DNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRVRVWHKYLTLDR 325
+ + +++ + TP + NP + S +YYV L I V + V +
Sbjct: 255 GSLAKLTERSV--VQSTPLLKNPYLQR----STHYYVNLTGIAVDSTELPVTGSTFGFTQ 308
Query: 326 DG-NGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEALTGLRPCFDVP- 383
G GGTIVDSGTT T++A + + + F SQM T A GA L C+
Sbjct: 309 TGLGGGTIVDSGTTLTYLAKDGYAMVKQAFQSQMANLNQTTPASGAP--YDLDLCYKPSA 366
Query: 384 --GEKTGSFPELKLHFKGGAEVTLPVENYFAVV-----GEGSAVCLTVVTDREASGGPSI 436
G K P L L F GGA+ +PV+NYFA V G + CL V+ + P
Sbjct: 367 GGGGKAVRVPRLALRFAGGAKYNVPVQNYFAGVEADSQGRVTVACLLVLPATDDL--PIS 424
Query: 437 ILGNFQMQNYYVEYDLRNQRLGFKQQLC 464
I+GN + ++ YD+ F C
Sbjct: 425 IIGNLMQMDMHLLYDIDGGMFSFAPADC 452
>gi|108707835|gb|ABF95630.1| Eukaryotic aspartyl protease family protein [Oryza sativa Japonica
Group]
Length = 384
Score = 158 bits (399), Expect = 7e-36, Method: Compositional matrix adjust.
Identities = 126/385 (32%), Positives = 180/385 (46%), Gaps = 47/385 (12%)
Query: 89 YSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLGCQ 148
Y + L+ GTPPQ + LDTGS LVW C C C + +P + SS+ L C
Sbjct: 35 YLLHLAIGTPPQPVQLTLDTGSVLVWTQCQ---PCAVCFNQSLPYYDASRSSTFALPSC- 90
Query: 149 NPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGS-GLTEGIALSETLN-LPNR 206
+S QC+ D + N T +Y YG T G ET++ +
Sbjct: 91 ---------DSTQCK--LDPSVTMCVNQTVQTCAYSYSYGDKSATIGFLDVETVSFVAGA 139
Query: 207 IIPNFLVGCSV----LSSRQPAGIAGFGRGKTSLPSQLNLDKFSYCLLSHKFDDTTRTSS 262
+P + GC + + GIAGFGRG SLPSQL + FS+C + + S+
Sbjct: 140 SVPGVVFGCGLNNTGIFRSNETGIAGFGRGPLSLPSQLKVGNFSHCFTA---VSGRKPST 196
Query: 263 LILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRVRVWHKYLT 322
++ D + + TP + NP A +YY+ L+ ITVG R+ V
Sbjct: 197 VLFDLPADLYKNGRGTVQTTPLIKNP------AHPTFYYLSLKGITVGSTRLPVPESAFA 250
Query: 323 LDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEALTGLRPCFDV 382
L ++G GGTI+DSGT FT + P ++ + DEF + + + TG CF
Sbjct: 251 L-KNGTGGTIIDSGTAFTSLPPRVYRLVHDEFAAHV------KLPVVPSNETGPLLCFSA 303
Query: 383 PG-EKTGSFPELKLHFKGGAEVTLPVENYF--AVVGEGSAVCLTVVTDREASGGPSIILG 439
P K P+L LHF+ GA + LP ENY A G ++CL ++ G I+G
Sbjct: 304 PPLGKAPHVPKLVLHFE-GATMHLPRENYVFEAKDGGNCSICLAIIE------GEMTIIG 356
Query: 440 NFQMQNYYVEYDLRNQRLGFKQQLC 464
NFQ QN +V YDL+N +L F + C
Sbjct: 357 NFQQQNMHVLYDLKNSKLSFVRAKC 381
>gi|50508279|dbj|BAD32128.1| putative aspartic proteinase nepenthesin II [Oryza sativa Japonica
Group]
gi|125600536|gb|EAZ40112.1| hypothetical protein OsJ_24555 [Oryza sativa Japonica Group]
Length = 455
Score = 158 bits (399), Expect = 7e-36, Method: Compositional matrix adjust.
Identities = 130/388 (33%), Positives = 178/388 (45%), Gaps = 34/388 (8%)
Query: 87 GGYSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLG 146
G Y++++S GTPP P I+DTGS+L+W C +C + + P P SS+ L
Sbjct: 89 GAYNMNISLGTPPLDFPVIVDTGSNLIWAQCAPCTRC-FPRPTPAPVLQPARSSTFSRLP 147
Query: 147 CQNPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSGLTEGIALSETLNLPNR 206
C C ++ S + R C N T C +Y YGSG T G +ETL + +
Sbjct: 148 CNGSFCQYLPTSS-RPRTC---------NATAAC-AYNYTYGSGYTAGYLATETLTVGDG 196
Query: 207 IIPNFLVGCSVLSS-RQPAGIAGFGRGKTSLPSQLNLDKFSYCLLSHKFDDTTRTSSLIL 265
P GCS + +GI G GRG SL SQL + +FSYCL S D +S IL
Sbjct: 197 TFPKVAFGCSTENGVDNSSGIVGLGRGPLSLVSQLAVGRFSYCLRSDMADG---GASPIL 253
Query: 266 DNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRVRVWHKYLTLDR 325
GS + + + TP + NP + S +YYV L I V + V +
Sbjct: 254 -FGSLAKLTEGSVVQSTPLLKNPYLQR----STHYYVNLTGIAVDSTELPVTGSTFGFTQ 308
Query: 326 DG-NGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEALTGLRPCFDVP- 383
G GGTIVDSGTT T++A + + + F SQM T A GA L C+
Sbjct: 309 TGLGGGTIVDSGTTLTYLAKDGYAMVKQAFQSQMANLNQTTPASGAP--YDLDLCYKPSA 366
Query: 384 --GEKTGSFPELKLHFKGGAEVTLPVENYFAVV-----GEGSAVCLTVVTDREASGGPSI 436
G K P L L F GGA+ +PV+NYFA V G + CL V+ + P
Sbjct: 367 GGGGKAVRVPRLALRFAGGAKYNVPVQNYFAGVEADSQGRVTVACLLVLPATDDL--PIS 424
Query: 437 ILGNFQMQNYYVEYDLRNQRLGFKQQLC 464
I+GN + ++ YD+ F C
Sbjct: 425 IIGNLMQMDMHLLYDIDGGMFSFAPADC 452
>gi|20197342|gb|AAC34482.2| putative chloroplast nucleoid DNA binding protein [Arabidopsis
thaliana]
Length = 353
Score = 158 bits (399), Expect = 7e-36, Method: Compositional matrix adjust.
Identities = 123/381 (32%), Positives = 173/381 (45%), Gaps = 38/381 (9%)
Query: 91 ISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLGCQNP 150
+ LS G P I+DTGS L+W C C C P F P+ SSS +GC +
Sbjct: 1 MELSIGNPAVKYSAIVDTGSDLIWTQCK---PCTECFDQPTPIFDPEKSSSYSKVGCSSG 57
Query: 151 KCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGS-GLTEGIALSETLNLPNR-II 208
C+ + +CN++ A YL YG T G+ +ET + I
Sbjct: 58 LCNALPR-----SNCNEDKDACE---------YLYTYGDYSSTRGLLATETFTFEDENSI 103
Query: 209 PNFLVGCSVLSS----RQPAGIAGFGRGKTSLPSQLNLDKFSYCLLSHKFDDTTRTSSLI 264
GC V + Q +G+ G GRG SL SQL KFSYCL S +D+ +SSL
Sbjct: 104 SGIGFGCGVENEGDGFSQGSGLVGLGRGPLSLISQLKETKFSYCLTS--IEDSEASSSLF 161
Query: 265 LDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRVRVWHKYLTLD 324
+ + +S KT S+ +YY+ L+ ITVG +R+ V L
Sbjct: 162 IGSLASGIVNKTGASLDGEVTKTMSLLRNPDQPSFYYLELQGITVGAKRLSVEKSTFELA 221
Query: 325 RDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEALTGLRPCFDVP- 383
DG GG I+DSGTT T++ F+ L +EF S+M + + TGL CF +P
Sbjct: 222 EDGTGGMIIDSGTTITYLEETAFKVLKEEFTSRM------SLPVDDSGSTGLDLCFKLPD 275
Query: 384 GEKTGSFPELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTVVTDREASGGPSIILGNFQM 443
K + P++ HFK GA++ LP ENY +CL + + S I GN Q
Sbjct: 276 AAKNIAVPKMIFHFK-GADLELPGENYMVADSSTGVLCLAMGSSNGMS-----IFGNVQQ 329
Query: 444 QNYYVEYDLRNQRLGFKQQLC 464
QN+ V +DL + + F C
Sbjct: 330 QNFNVLHDLEKETVSFVPTEC 350
>gi|218192703|gb|EEC75130.1| hypothetical protein OsI_11317 [Oryza sativa Indica Group]
Length = 440
Score = 157 bits (397), Expect = 1e-35, Method: Compositional matrix adjust.
Identities = 123/385 (31%), Positives = 178/385 (46%), Gaps = 47/385 (12%)
Query: 89 YSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLGCQ 148
Y + L+ GTPPQ + LDTGS LVW C C C + +P + SS+ L C
Sbjct: 91 YLLHLAIGTPPQPVQLTLDTGSDLVWTQCQ---PCAVCFNQSLPYYDASRSSTFALPSCD 147
Query: 149 NPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGS-GLTEGIALSETLN-LPNR 206
+ +C D + N T ++ YG T G ET++ +
Sbjct: 148 STQCKL------------DPSVTMCVNQTVQTCAFSYSYGDKSATIGFLDVETVSFVAGA 195
Query: 207 IIPNFLVGCSV----LSSRQPAGIAGFGRGKTSLPSQLNLDKFSYCLLSHKFDDTTRTSS 262
+P + GC + + GIAGFGRG SLPSQL + FS+C + + S+
Sbjct: 196 SVPGVVFGCGLNNTGIFRSNETGIAGFGRGPLSLPSQLKVGNFSHCFTA---VSGRKPST 252
Query: 263 LILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRVRVWHKYLT 322
++ D + + TP + NP A +YY+ L+ ITVG R+ V
Sbjct: 253 VLFDLPADLYKNGRGTVQTTPLIKNP------AHPTFYYLSLKGITVGSTRLPVPESAFA 306
Query: 323 LDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEALTGLRPCFDV 382
L ++G GGTI+DSGT FT + P ++ + DEF + + + TG CF
Sbjct: 307 L-KNGTGGTIIDSGTAFTSLPPRVYRLVHDEFAAHV------KLPVVPSNETGPLLCFSA 359
Query: 383 PG-EKTGSFPELKLHFKGGAEVTLPVENYF--AVVGEGSAVCLTVVTDREASGGPSIILG 439
P K P+L LHF+ GA + LP ENY A G ++CL ++ G I+G
Sbjct: 360 PPLGKAPHVPKLVLHFE-GATMHLPRENYVFEAKDGGNCSICLAIIE------GEMTIIG 412
Query: 440 NFQMQNYYVEYDLRNQRLGFKQQLC 464
NFQ QN +V YDL+N +L F + C
Sbjct: 413 NFQQQNMHVLYDLKNSKLSFVRAKC 437
>gi|242041115|ref|XP_002467952.1| hypothetical protein SORBIDRAFT_01g037070 [Sorghum bicolor]
gi|241921806|gb|EER94950.1| hypothetical protein SORBIDRAFT_01g037070 [Sorghum bicolor]
Length = 774
Score = 156 bits (395), Expect = 2e-35, Method: Compositional matrix adjust.
Identities = 127/390 (32%), Positives = 178/390 (45%), Gaps = 47/390 (12%)
Query: 89 YSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLGCQ 148
Y + L+ GTPPQ + ILDTGS LVW C C C S + P SS+ +L C
Sbjct: 415 YLVHLAIGTPPQPVQLILDTGSDLVWTQCR---PCPVCFSRALGPLDPSNSSTFDVLPCS 471
Query: 149 NPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSG-LTEGIALSETLNLP--- 204
+P C +++ C Q C Y+ Y G +T G +ET
Sbjct: 472 SPVC-----DNLTWSSCGKHNWGN-----QTC-VYVYAYADGSITTGHLDAETFTFAAAD 520
Query: 205 ---NRIIPNFLVGCSVLS----SRQPAGIAGFGRGKTSLPSQLNLDKFSYCLLSHKFDDT 257
+P+ GC + + + GIAGFGRG SLPSQL +D FS+C +
Sbjct: 521 GTGQATVPDLAFGCGLFNNGIFTSNETGIAGFGRGALSLPSQLKVDNFSHCFTAIT---G 577
Query: 258 TRTSSLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRVRVW 317
+ SS++L ++ + TP V N S YY+ L+ ITVG R+ +
Sbjct: 578 SEPSSVLLGLPANLYSDADGAVQSTPLVQNFSSLR------AYYLSLKGITVGSTRLPIP 631
Query: 318 HKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEALTGLR 377
L +DG GGTI+DSGT T + + ++ + D F +Q+ R + +L+ L
Sbjct: 632 ESTFALKQDGTGGTIIDSGTGMTTLPQDAYKLVHDAFTAQV---RLPVDNATSSSLSRLC 688
Query: 378 PCFDVPGEKTGSFPELKLHFKGGAEVTLPVENY---FAVVGEGSAVCLTVVTDREASGGP 434
F VP P+L LHF+ GA + LP ENY F G GS CL + +G
Sbjct: 689 FSFSVPRRAKPDVPKLVLHFE-GATLDLPRENYMFEFEDAG-GSVTCLAI-----NAGDD 741
Query: 435 SIILGNFQMQNYYVEYDLRNQRLGFKQQLC 464
I+GN+Q QN +V YDL L F C
Sbjct: 742 LTIIGNYQQQNLHVLYDLVRNMLSFVPAQC 771
>gi|168025534|ref|XP_001765289.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162683608|gb|EDQ70017.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 372
Score = 156 bits (394), Expect = 2e-35, Method: Compositional matrix adjust.
Identities = 121/392 (30%), Positives = 176/392 (44%), Gaps = 49/392 (12%)
Query: 83 SHSYGGYSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSS 142
S YG + + + GTPPQ I+DTGS L W C+ C P F P SS+
Sbjct: 19 SAGYGEFLVPIYLGTPPQKAVVIIDTGSDLTWI---QSEPCRACFEQADPIFDPSKSSTY 75
Query: 143 RLLGCQNPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSG-LTEGIALSETL 201
+ C + C+ + + + C + + NC Y YG G +T G ET+
Sbjct: 76 NKIACSSSACA----DLLGTQTC-----SAAANCI-----YAYGYGDGSVTRGYFSKETI 121
Query: 202 NLPNRIIPNFLVGCSV-----LSSRQPAGIAGFGRGKTSLPSQLNL---DKFSYCLLSHK 253
+ G SV GI G G+G S+PSQL +KFSYCL+
Sbjct: 122 TATDTAGEEVKFGASVYNTGTFGDTGGEGILGLGQGPVSMPSQLGSVLGNKFSYCLVDW- 180
Query: 254 FDDTTRTSSLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQR 313
+ TS++ + + S + + YTP V N YYY+ ++ I+VGG
Sbjct: 181 LSAGSETSTMYFGDAAVPSGE----VQYTPIVPNAD------HPTYYYIAVQGISVGGSL 230
Query: 314 VRVWHKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEAL 373
+ + +D G+GGTI+DSGTT T++ E+F L + SQ V+ T A
Sbjct: 231 LDIDQSVYEIDSGGSGGTIIDSGTTITYLQQEVFNALVAAYTSQ-VRYPTTTSA------ 283
Query: 374 TGLRPCFDVPGEKTGSFPELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTVVTDREASGG 433
TGL CF+ G + FP + +H G + LP N F + E + +CL + +
Sbjct: 284 TGLDLCFNTRGTGSPVFPAMTIHLD-GVHLELPTANTFISL-ETNIICLAFASALDF--- 338
Query: 434 PSIILGNFQMQNYYVEYDLRNQRLGFKQQLCK 465
P I GN Q QN+ + YDL N R+GF C
Sbjct: 339 PIAIFGNIQQQNFDIVYDLDNMRIGFAPADCA 370
>gi|108707838|gb|ABF95633.1| Eukaryotic aspartyl protease family protein [Oryza sativa Japonica
Group]
Length = 391
Score = 156 bits (394), Expect = 3e-35, Method: Compositional matrix adjust.
Identities = 125/391 (31%), Positives = 176/391 (45%), Gaps = 50/391 (12%)
Query: 89 YSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLGCQ 148
Y + L+ GTPPQ + LDTGS L+W C C C +P F P SS+ L C
Sbjct: 35 YLVHLAIGTPPQPVQLTLDTGSDLIWTQCQ---PCPACFDQALPYFDPSTSSTLSLTSCD 91
Query: 149 NPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGS-GLTEGIALSETLNL--PN 205
+ C + + C ++ C Y YG +T G +
Sbjct: 92 STLC-----QGLPVASCGSPKFWPNQTCV-----YTYSYGDKSVTTGFLEVDKFTFVGAG 141
Query: 206 RIIPNFLVGCSVLSS----RQPAGIAGFGRGKTSLPSQLNLDKFSYCLLSHKFDDTTRT- 260
+P GC + ++ GIAGFGRG SLPSQL + FS+C TT T
Sbjct: 142 ASVPGVAFGCGLFNNGVFKSNETGIAGFGRGPLSLPSQLKVGNFSHCF-------TTITG 194
Query: 261 ---SSLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRVRVW 317
S+++LD + + TP + A+ A YY+ L+ ITVG R+ V
Sbjct: 195 AIPSTVLLDLPADLFSNGQGAVQTTPLIQ---YAKNEANPTLYYLSLKGITVGSTRLPVP 251
Query: 318 HKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEALTGLR 377
L +G GGTI+DSGT+ T + P++++ + DEF +Q+ A TG
Sbjct: 252 ESAFAL-TNGTGGTIIDSGTSITSLPPQVYQVVRDEFAAQIKLPVVPGNA------TGHY 304
Query: 378 PCFDVPGEKTGSFPELKLHFKGGAEVTLPVENYFAVVGE---GSAVCLTVVTDREASGGP 434
CF P + P+L LHF+ GA + LP ENY V + S +CL + E +
Sbjct: 305 TCFSAPSQAKPDVPKLVLHFE-GATMDLPRENYVFEVPDDAGNSIICLAINKGDETT--- 360
Query: 435 SIILGNFQMQNYYVEYDLRNQRLGFKQQLCK 465
I+GNFQ QN +V YDL+N L F C
Sbjct: 361 --IIGNFQQQNMHVLYDLQNNMLSFVAAQCD 389
>gi|255558694|ref|XP_002520371.1| basic 7S globulin 2 precursor small subunit, putative [Ricinus
communis]
gi|223540418|gb|EEF41987.1| basic 7S globulin 2 precursor small subunit, putative [Ricinus
communis]
Length = 557
Score = 156 bits (394), Expect = 3e-35, Method: Compositional matrix adjust.
Identities = 133/397 (33%), Positives = 186/397 (46%), Gaps = 51/397 (12%)
Query: 87 GGYSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLG 146
G Y + + GTPP+ ILDTGS L W C C C P + PK SSS + +G
Sbjct: 190 GEYFMDVFIGTPPRHFSLILDTGSDLNWIQCV---PCYDCFVQNGPYYDPKESSSFKNIG 246
Query: 147 CQNPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYG--SGLTEGIALSE-TLNL 203
C +P+C + + +P K Q CP Y YG S T AL T+NL
Sbjct: 247 CHDPRCHLV---------SSPDPPQPCKAENQTCP-YFYWYGDSSNTTGDFALETFTVNL 296
Query: 204 PN-------RIIPNFLVGCSVLSS---RQPAGIAGFGRGKTSLPSQLNL---DKFSYCLL 250
+ + + N + GC + AG+ G GRG S SQL FSYCL+
Sbjct: 297 TSPAGKSEFKRVENVMFGCGHWNRGLFHGAAGLLGLGRGPLSFSSQLQSLYGHSFSYCLV 356
Query: 251 SHKFDDTTRTSSLILDNGSSHSDKKTTGLTYTPFVNNPSV--AERNAFSVYYYVGLRRIT 308
DT +S LI DK L P VN S+ + N +YYV ++ I
Sbjct: 357 DRN-SDTNVSSKLIFG-----EDKD---LLNHPEVNFTSLVAGKENPVDTFYYVQIKSIM 407
Query: 309 VGGQRVRVWHKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRAL 368
VGG+ +++ + L +G GGTIVDSGTT ++ A +E + D FV + VK +
Sbjct: 408 VGGEVLKIPEETWHLSPEGAGGTIVDSGTTLSYFAEPSYEIIKDAFVKK-VKGYPVIKDF 466
Query: 369 GAEALTGLRPCFDVPGEKTGSFPELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTVV-TD 427
L PC++V G + PE ++ F+ GA PVENYF + VCL ++ T
Sbjct: 467 PI-----LDPCYNVSGVEKMELPEFRILFEDGAVWNFPVENYFIKLEPEEIVCLAILGTP 521
Query: 428 REASGGPSIILGNFQMQNYYVEYDLRNQRLGFKQQLC 464
R A I+GN+Q QN+++ YD + RLG+ C
Sbjct: 522 RSALS----IIGNYQQQNFHILYDTKKSRLGYAPMKC 554
>gi|125558632|gb|EAZ04168.1| hypothetical protein OsI_26310 [Oryza sativa Indica Group]
gi|125600539|gb|EAZ40115.1| hypothetical protein OsJ_24558 [Oryza sativa Japonica Group]
Length = 453
Score = 155 bits (393), Expect = 4e-35, Method: Compositional matrix adjust.
Identities = 144/451 (31%), Positives = 204/451 (45%), Gaps = 51/451 (11%)
Query: 33 LSRFHTNPSQDSYQNLNSLVSSSLTRALH--------IKNPQTKTTTTTTTTTTTNISSH 84
L+R H+ P + Q V +L R +H + + + ++ T + T
Sbjct: 32 LTRIHSEPGVTASQ----FVRDALRRDMHRRARFGRELASSSSSSSPAGTVSAPTRKDLP 87
Query: 85 SYGGYSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQC-KYCSSSKIPSFIPKLSSSSR 143
+ G Y ++L+ GTPPQ P I DTGS LVW C C + C P + P S + R
Sbjct: 88 NGGEYIMTLAIGTPPQSYPAIADTGSDLVWTQCA---PCGERCFKQPSPLYNPSSSPTFR 144
Query: 144 LLGCQNPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSGLTEGIALSETLNL 203
+L C S ++ + + R P C C Y YG+G T G+ SET
Sbjct: 145 VLPCS----SALNLCAAEARLAGATP---PPGCA--C-RYNQTYGTGWTSGLQGSETFTF 194
Query: 204 PNRI-----IPNFLVGCSVLSSRQPAGIA---GFGRGKTSLPSQLNLDKFSYCLLSHKFD 255
+ +P GCS SS G A G GRG SL SQL FSYCL F
Sbjct: 195 GSSPADQVRVPGIAFGCSNASSDDWNGSAGLVGLGRGGLSLVSQLAAGMFSYCL--TPFQ 252
Query: 256 DTTRTSSLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRVR 315
DT S+L+L ++ + TG+ TPFV +PS + S YYY+ L I+VG +
Sbjct: 253 DTKSKSTLLLGPAAAAAALNGTGVRSTPFVPSPS---KPPMSTYYYLNLTGISVGAAALP 309
Query: 316 VWHKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEALTG 375
+ L DG GG I+DSGTT T + + + V V++ TG
Sbjct: 310 IPPGAFALRADGTGGLIIDSGTTITSLVDAAY-----KRVRAAVRSLVKLPVTDGSNATG 364
Query: 376 LRPCFDVPGEKT--GSFPELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTVVTDREASGG 433
L CF +P + P + LHF GGA++ LPVENY + +G CL + R + G
Sbjct: 365 LDLCFALPSSSAPPATLPSMTLHFGGGADMVLPVENYMIL--DGGMWCLAM---RSQTDG 419
Query: 434 PSIILGNFQMQNYYVEYDLRNQRLGFKQQLC 464
LGN+Q QN ++ YD++ + L F C
Sbjct: 420 ELSTLGNYQQQNLHILYDVQKETLSFAPAKC 450
>gi|22831049|dbj|BAC15912.1| putative nucleoid DNA-binding protein cnd41 [Oryza sativa Japonica
Group]
gi|50508281|dbj|BAD32130.1| putative nucleoid DNA-binding protein cnd41 [Oryza sativa Japonica
Group]
Length = 453
Score = 155 bits (391), Expect = 6e-35, Method: Compositional matrix adjust.
Identities = 144/451 (31%), Positives = 204/451 (45%), Gaps = 51/451 (11%)
Query: 33 LSRFHTNPSQDSYQNLNSLVSSSLTRALH--------IKNPQTKTTTTTTTTTTTNISSH 84
L+R H+ P + Q V +L R +H + + + ++ T + T
Sbjct: 32 LTRIHSEPGVTASQ----FVRDALRRDMHRRARFGRELASSSSSSSPAGTVSAPTRKDLP 87
Query: 85 SYGGYSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQC-KYCSSSKIPSFIPKLSSSSR 143
+ G Y ++L+ GTPPQ P I DTGS LVW C C + C P + P S + R
Sbjct: 88 NGGEYIMTLAIGTPPQSYPAIADTGSDLVWTQCA---PCGERCFKQPSPLYNPSSSPTFR 144
Query: 144 LLGCQNPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSGLTEGIALSETLNL 203
+L C S ++ + + R P C C Y YG+G T G+ SET
Sbjct: 145 VLPCS----SALNLCAAEARLAGATP---PPGCA--C-RYNQTYGTGWTSGLQGSETFTF 194
Query: 204 PNRI-----IPNFLVGCSVLSSRQPAGIA---GFGRGKTSLPSQLNLDKFSYCLLSHKFD 255
+ +P GCS SS G A G GRG SL SQL FSYCL F
Sbjct: 195 GSSPADQVRVPGIAFGCSNASSDDWNGSAGLVGLGRGGLSLVSQLAAGMFSYCL--TPFQ 252
Query: 256 DTTRTSSLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRVR 315
DT S+L+L ++ + TG+ TPFV +PS + S YYY+ L I+VG +
Sbjct: 253 DTKSKSTLLLGPAAAAAALNGTGVRSTPFVPSPS---KPPMSTYYYLNLTGISVGPAALP 309
Query: 316 VWHKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEALTG 375
+ L DG GG I+DSGTT T + + + V V++ TG
Sbjct: 310 IPPGAFALRADGTGGLIIDSGTTITSLVDAAY-----KRVRAAVRSLVKLPVTDGSNATG 364
Query: 376 LRPCFDVPGEKT--GSFPELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTVVTDREASGG 433
L CF +P + P + LHF GGA++ LPVENY + +G CL + R + G
Sbjct: 365 LDLCFALPSSSAPPATLPSMTLHFGGGADMVLPVENYMIL--DGGMWCLAM---RSQTDG 419
Query: 434 PSIILGNFQMQNYYVEYDLRNQRLGFKQQLC 464
LGN+Q QN ++ YD++ + L F C
Sbjct: 420 ELSTLGNYQQQNLHILYDVQKETLSFAPAKC 450
>gi|224126751|ref|XP_002329464.1| predicted protein [Populus trichocarpa]
gi|222870144|gb|EEF07275.1| predicted protein [Populus trichocarpa]
Length = 439
Score = 154 bits (390), Expect = 6e-35, Method: Compositional matrix adjust.
Identities = 132/388 (34%), Positives = 178/388 (45%), Gaps = 56/388 (14%)
Query: 87 GGYSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLG 146
G + + L+ GTPP+ I+DTGS L+W C C C P F PK SSS L
Sbjct: 95 GEFLMKLAIGTPPETYSAIMDTGSDLIWTQCK---PCTQCFDQPTPIFDPKKSSSFSKLS 151
Query: 147 CQNPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGS-GLTEGIALSETLNLPN 205
C + C + P +T C+ C YL YG T+G+ SETL
Sbjct: 152 CSSKLCEAL-------------PQST---CSDGC-EYLYGYGDYSSTQGMLASETLTFGK 194
Query: 206 RIIPNFLVGCSVLSS----RQPAGIAGFGRGKTSLPSQLNLDKFSYCLLSHKFDDTTRTS 261
+P GC + Q +G+ G GRG SL SQL KFSYCL S D T+ S
Sbjct: 195 VSVPEVAFGCGEDNEGSGFSQGSGLVGLGRGPLSLVSQLKEPKFSYCLTSV---DDTKAS 251
Query: 262 SLILDNGSSHSDKKT-TGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRVRVWHKY 320
+L++ GS S K + + + TP + N +A +YY+ L I+VG + +
Sbjct: 252 TLLM--GSLASVKASDSEIKTTPLIQN------SAQPSFYYLSLEGISVGDTSLPIKKST 303
Query: 321 LTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQM---VKNRNYTRALGAEALTGLR 377
+L DG+GG I+DSGTT T++ F+ +A EF SQ+ V N TGL
Sbjct: 304 FSLQEDGSGGLIIDSGTTITYLEQSAFDLVAKEFTSQINLPVDNSGS---------TGLE 354
Query: 378 PCFDVPGEKTG-SFPELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTVVTDREASGGPSI 436
CF +P T P+L HF GA++ LP ENY CL + + S
Sbjct: 355 VCFTLPSGSTDIEVPKLVFHFD-GADLELPAENYMIADASMGVACLAMGSSSGMS----- 408
Query: 437 ILGNFQMQNYYVEYDLRNQRLGFKQQLC 464
I GN Q QN V +DL + L F C
Sbjct: 409 IFGNIQQQNMLVLHDLEKETLSFLPTQC 436
>gi|115472519|ref|NP_001059858.1| Os07g0533800 [Oryza sativa Japonica Group]
gi|113611394|dbj|BAF21772.1| Os07g0533800 [Oryza sativa Japonica Group]
Length = 458
Score = 154 bits (390), Expect = 8e-35, Method: Compositional matrix adjust.
Identities = 144/451 (31%), Positives = 204/451 (45%), Gaps = 51/451 (11%)
Query: 33 LSRFHTNPSQDSYQNLNSLVSSSLTRALH--------IKNPQTKTTTTTTTTTTTNISSH 84
L+R H+ P + Q V +L R +H + + + ++ T + T
Sbjct: 37 LTRIHSEPGVTASQ----FVRDALRRDMHRRARFGRELASSSSSSSPAGTVSAPTRKDLP 92
Query: 85 SYGGYSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQC-KYCSSSKIPSFIPKLSSSSR 143
+ G Y ++L+ GTPPQ P I DTGS LVW C C + C P + P S + R
Sbjct: 93 NGGEYIMTLAIGTPPQSYPAIADTGSDLVWTQCA---PCGERCFKQPSPLYNPSSSPTFR 149
Query: 144 LLGCQNPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSGLTEGIALSETLNL 203
+L C S ++ + + R P C C Y YG+G T G+ SET
Sbjct: 150 VLPCS----SALNLCAAEARLAGATP---PPGCA--C-RYNQTYGTGWTSGLQGSETFTF 199
Query: 204 PNRI-----IPNFLVGCSVLSSRQPAGIA---GFGRGKTSLPSQLNLDKFSYCLLSHKFD 255
+ +P GCS SS G A G GRG SL SQL FSYCL F
Sbjct: 200 GSSPADQVRVPGIAFGCSNASSDDWNGSAGLVGLGRGGLSLVSQLAAGMFSYCL--TPFQ 257
Query: 256 DTTRTSSLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRVR 315
DT S+L+L ++ + TG+ TPFV +PS + S YYY+ L I+VG +
Sbjct: 258 DTKSKSTLLLGPAAAAAALNGTGVRSTPFVPSPS---KPPMSTYYYLNLTGISVGPAALP 314
Query: 316 VWHKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEALTG 375
+ L DG GG I+DSGTT T + + + V V++ TG
Sbjct: 315 IPPGAFALRADGTGGLIIDSGTTITSLVDAAY-----KRVRAAVRSLVKLPVTDGSNATG 369
Query: 376 LRPCFDVPGEKT--GSFPELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTVVTDREASGG 433
L CF +P + P + LHF GGA++ LPVENY + +G CL + R + G
Sbjct: 370 LDLCFALPSSSAPPATLPSMTLHFGGGADMVLPVENYMIL--DGGMWCLAM---RSQTDG 424
Query: 434 PSIILGNFQMQNYYVEYDLRNQRLGFKQQLC 464
LGN+Q QN ++ YD++ + L F C
Sbjct: 425 ELSTLGNYQQQNLHILYDVQKETLSFAPAKC 455
>gi|125543640|gb|EAY89779.1| hypothetical protein OsI_11321 [Oryza sativa Indica Group]
Length = 434
Score = 153 bits (387), Expect = 1e-34, Method: Compositional matrix adjust.
Identities = 123/388 (31%), Positives = 176/388 (45%), Gaps = 50/388 (12%)
Query: 89 YSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLGCQ 148
Y + L+ GTPPQ + LDTGS L+W C C C +P F P SS+ L C
Sbjct: 82 YLVHLAIGTPPQPVQLTLDTGSDLIWTQCQ---PCPACFDQALPYFDPSTSSTLSLTSCD 138
Query: 149 NPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGS-GLTEGIALSETLNL--PN 205
+ C + + C ++ C Y YG +T G +
Sbjct: 139 STLC-----QGLPVASCGSPKFWPNQTCV-----YTYSYGDKSVTTGFLEVDKFTFVGAG 188
Query: 206 RIIPNFLVGCSVLSS----RQPAGIAGFGRGKTSLPSQLNLDKFSYCLLSHKFDDTTRTS 261
+P GC + ++ GIAGFGRG SLPSQL + FS+C + + + S
Sbjct: 189 ASVPGVAFGCGLFNNGVFKSNETGIAGFGRGPLSLPSQLKVGNFSHCFTAV---NGLKPS 245
Query: 262 SLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRVRVWHKYL 321
+++LD + + TP + NP A +YY+ L+ ITVG R+ V
Sbjct: 246 TVLLDLPADLYKSGRGAVQSTPLIQNP------ANPTFYYLSLKGITVGSTRLPVPESEF 299
Query: 322 TLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQM---VKNRNYTRALGAEALTGLRP 378
TL ++G GGTI+DSGT T + ++ + D F +Q+ V + N T
Sbjct: 300 TL-KNGTGGTIIDSGTAMTSLPTRVYRLVRDAFAAQVKLPVVSGNTTDPYF--------- 349
Query: 379 CFDVPGEKTGSFPELKLHFKGGAEVTLPVENY-FAVVGEGSAV-CLTVVTDREASGGPSI 436
C P P+L LHF+ GA + LP ENY F V GS++ CL ++ GG
Sbjct: 350 CLSAPLRAKPYVPKLVLHFE-GATMDLPRENYVFEVEDAGSSILCLAII-----EGGEVT 403
Query: 437 ILGNFQMQNYYVEYDLRNQRLGFKQQLC 464
+GNFQ QN +V YDL+N +L F C
Sbjct: 404 TIGNFQQQNMHVLYDLQNSKLSFVPAQC 431
>gi|449453872|ref|XP_004144680.1| PREDICTED: LOW QUALITY PROTEIN: protein ASPARTIC PROTEASE IN GUARD
CELL 1-like [Cucumis sativus]
Length = 757
Score = 153 bits (386), Expect = 2e-34, Method: Compositional matrix adjust.
Identities = 124/395 (31%), Positives = 185/395 (46%), Gaps = 46/395 (11%)
Query: 87 GGYSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLG 146
G Y I + G+PP+ ILDTGS L W C C C P + PK S S R +
Sbjct: 194 GEYFIDVFIGSPPKHFSLILDTGSDLNWIQCV---PCFDCFEQNGPYYDPKDSISFRNIT 250
Query: 147 CQNPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYG--SGLTEGIALSE-TLNL 203
C +P+C + + +P K TQ CP Y YG S T AL T+NL
Sbjct: 251 CNDPRCQLV---------SSPDPPRPCKFETQSCP-YFYWYGDSSNTTGDFALETFTVNL 300
Query: 204 PN--------RIIPNFLVGCSVLSS---RQPAGIAGFGRGKTSLPSQLNL---DKFSYCL 249
+ R + N + GC + AG+ G GRG S SQL FSYCL
Sbjct: 301 TSSTTGKSEFRRVENVMFGCGHWNRGLFHGAAGLLGLGRGPLSFSSQLQSLYGHSFSYCL 360
Query: 250 LSHKFDDTTRTSSLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITV 309
+ DT+ +S LI G L +T + + N +YY+ ++ I V
Sbjct: 361 VDRD-SDTSVSSKLIF--GEDKDLLTHPELNFTSLI----AGKENPVDTFYYLQIKSIFV 413
Query: 310 GGQRVRVWHKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALG 369
GG+++++ + L DG GGTI+DSGTT ++ + + + + F+ ++ + Y
Sbjct: 414 GGEKLQIPEENWNLSADGAGGTIIDSGTTLSYFSDPAYRIIKEAFLRKV---KGYKL--- 467
Query: 370 AEALTGLRPCFDVPGEKTGSFPELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTVVTDRE 429
E L PC++V G +FPE + F GA PVENYF + + VCL ++ +
Sbjct: 468 VEDFPILHPCYNVSGTDELNFPEFLIQFADGAVWNFPVENYFIRIQQLDIVCLAMLGTPK 527
Query: 430 ASGGPSIILGNFQMQNYYVEYDLRNQRLGFKQQLC 464
++ I+GN+Q QN+++ YD +N RLG+ C
Sbjct: 528 SALS---IIGNYQQQNFHILYDTKNSRLGYAPMRC 559
>gi|50508275|dbj|BAD32124.1| putative aspartic proteinase nepenthesin I [Oryza sativa Japonica
Group]
Length = 451
Score = 153 bits (386), Expect = 2e-34, Method: Compositional matrix adjust.
Identities = 131/399 (32%), Positives = 169/399 (42%), Gaps = 51/399 (12%)
Query: 84 HSYGGYSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSR 143
+S G Y+++LS GTPP + DTGS L+W C C C++ P F P SS+
Sbjct: 85 NSAGAYNMNLSIGTPPVTFSVLADTGSSLIWTQCA---PCTECAARPAPPFQPASSSTFS 141
Query: 144 LLGCQNPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSGLTEGIALSETLNL 203
L C + C ++ + C + C P YG G T G +ETL++
Sbjct: 142 KLPCASSLCQFLTSPYLTCN---------ATGCVYYYP-----YGMGFTAGYLATETLHV 187
Query: 204 PNRIIPNFLVGCSVLS--SRQPAGIAGFGRGKTSLPSQLNLDKFSYCLLSHKFDDTTRTS 261
P GCS + +GI G GR SL SQ+ + +FSYCL S D
Sbjct: 188 GGASFPGVAFGCSTENGVGNSSSGIVGLGRSPLSLVSQVGVGRFSYCLRS----DADAGD 243
Query: 262 SLILDNGSSHSDKKTTG--LTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRVRVWHK 319
S IL S K TG + TP + NP + S YYYV L ITVG + V
Sbjct: 244 SPILFG----SLAKVTGGNVQSTPLLENPEMPS----SSYYYVNLTGITVGATDLPVTST 295
Query: 320 YLTLDRDGN----GGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEALTG 375
R GGTIVDSGTT T++ E + + F+SQM T G G
Sbjct: 296 TFGFTRGAGAGLVGGTIVDSGTTLTYLVKEGYAMVKRAFLSQMATANLTTTVNGTR--FG 353
Query: 376 LRPCFDVPGEKTGS---FPELKLHFKGGAEVTLPVENYFAVV-----GEGSAVCLTVVTD 427
CFD GS P L L F GGAE + +Y VV G + CL V+
Sbjct: 354 FDLCFDATAAGGGSGVPVPTLVLRFAGGAEYAVRRRSYVGVVAVDSQGRAAVECLLVLPA 413
Query: 428 REASGGPSI-ILGNFQMQNYYVEYDLRNQRLGFKQQLCK 465
E SI I+GN + +V YDL F C
Sbjct: 414 SEKL---SISIIGNVMQMDLHVLYDLDGGMFSFAPADCA 449
>gi|449519146|ref|XP_004166596.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 1-like [Cucumis
sativus]
Length = 752
Score = 153 bits (386), Expect = 2e-34, Method: Compositional matrix adjust.
Identities = 124/395 (31%), Positives = 185/395 (46%), Gaps = 46/395 (11%)
Query: 87 GGYSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLG 146
G Y I + G+PP+ ILDTGS L W C C C P + PK S S R +
Sbjct: 194 GEYFIDVFIGSPPKHFSLILDTGSDLNWIQCV---PCFDCFEQNGPYYDPKDSISFRNIT 250
Query: 147 CQNPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYG--SGLTEGIALSE-TLNL 203
C +P+C + + +P K TQ CP Y YG S T AL T+NL
Sbjct: 251 CNDPRCQLV---------SSPDPPRPCKFETQSCP-YFYWYGDSSNTTGDFALETFTVNL 300
Query: 204 PN--------RIIPNFLVGCSVLSS---RQPAGIAGFGRGKTSLPSQLNL---DKFSYCL 249
+ R + N + GC + AG+ G GRG S SQL FSYCL
Sbjct: 301 TSSTTGKSEFRRVENVMFGCGHWNRGLFHGAAGLLGLGRGPLSFSSQLQSLYGHSFSYCL 360
Query: 250 LSHKFDDTTRTSSLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITV 309
+ DT+ +S LI G L +T + + N +YY+ ++ I V
Sbjct: 361 VDRD-SDTSVSSKLIF--GEDKDLLTHPELNFTSLI----AGKENPVDTFYYLQIKSIFV 413
Query: 310 GGQRVRVWHKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALG 369
GG+++++ + L DG GGTI+DSGTT ++ + + + + F+ ++ + Y
Sbjct: 414 GGEKLQIPEENWNLSADGAGGTIIDSGTTLSYFSDPAYRIIKEAFLRKV---KGYKL--- 467
Query: 370 AEALTGLRPCFDVPGEKTGSFPELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTVVTDRE 429
E L PC++V G +FPE + F GA PVENYF + + VCL ++ +
Sbjct: 468 VEDFPILHPCYNVSGTDELNFPEFLIQFADGAVWNFPVENYFIRIQQLDIVCLAMLGTPK 527
Query: 430 ASGGPSIILGNFQMQNYYVEYDLRNQRLGFKQQLC 464
++ I+GN+Q QN+++ YD +N RLG+ C
Sbjct: 528 SALS---IIGNYQQQNFHILYDTKNSRLGYAPMRC 559
>gi|115472515|ref|NP_001059856.1| Os07g0532800 [Oryza sativa Japonica Group]
gi|50508274|dbj|BAD32123.1| putative aspartic proteinase nepenthesin I [Oryza sativa Japonica
Group]
gi|113611392|dbj|BAF21770.1| Os07g0532800 [Oryza sativa Japonica Group]
Length = 436
Score = 152 bits (385), Expect = 2e-34, Method: Compositional matrix adjust.
Identities = 130/391 (33%), Positives = 178/391 (45%), Gaps = 52/391 (13%)
Query: 87 GGYSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLG 146
GGY++++S GTP + DTGS L+W C C C P F P SS+ L
Sbjct: 84 GGYNMNISVGTPLLTFSVVADTGSDLIWTQCA---PCTKCFQQPAPPFQPASSSTFSKLP 140
Query: 147 CQNPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSGLTEGIALSETLNLPNR 206
C + C ++ + R CN + C Y YGSG T G +ETL + +
Sbjct: 141 CTSSFCQFLPNS---IRTCN------ATGCV-----YNYKYGSGYTAGYLATETLKVGDA 186
Query: 207 IIPNFLVGCSVLS--SRQPAGIAGFGRGKTSLPSQLNLDKFSYCLLSHKFDDTTRTSSLI 264
P+ GCS + +GIAG GRG SL QL + +FSYCL S S ++
Sbjct: 187 SFPSVAFGCSTENGVGNSTSGIAGLGRGALSLIPQLGVGRFSYCLRSGS---AAGASPIL 243
Query: 265 LDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRVRVWHKYLTLD 324
+ ++ +D + TPFVNNP+V YYYV L ITVG + V
Sbjct: 244 FGSLANLTDGN---VQSTPFVNNPAVHPS-----YYYVNLTGITVGETDLPVTTSTFGFT 295
Query: 325 RDG-NGGTIVDSGTTFTFMAPELFEPLADEFVSQM--VKNRNYTRALGAEALTGLRPCFD 381
++G GGTIVDSGTT T++A + +E + F+SQ V N TR GL CF
Sbjct: 296 QNGLGGGTIVDSGTTLTYLAKDGYEMVKQAFLSQTADVTTVNGTR--------GLDLCFK 347
Query: 382 VPGEKTG--SFPELKLHFKGGAEVTLPVENYFAVV-----GEGSAVCLTVVTDREASGGP 434
G G + P L L F GGAE +P YFA V G + CL ++ + P
Sbjct: 348 STGGGGGGIAVPSLVLRFDGGAEYAVP--TYFAGVETDSQGSVTVACLMMLPAKGDQ--P 403
Query: 435 SIILGNFQMQNYYVEYDLRNQRLGFKQQLCK 465
++GN + ++ YDL F C
Sbjct: 404 MSVIGNVMQMDMHLLYDLDGGIFSFAPADCA 434
>gi|409179878|gb|AFV26024.1| aspartic proteinase nepenthesin 1 [Nepenthes mirabilis]
Length = 437
Score = 152 bits (385), Expect = 3e-34, Method: Compositional matrix adjust.
Identities = 125/385 (32%), Positives = 182/385 (47%), Gaps = 50/385 (12%)
Query: 87 GGYSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLG 146
G Y ++LS GTP Q I+DTGS L+W C C C + P F P+ SSS L
Sbjct: 93 GEYLMNLSIGTPAQPFSAIMDTGSDLIWTQCQ---PCTQCFNQSTPIFNPQGSSSFSTLP 149
Query: 147 CQNPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSG-LTEGIALSETLNLPN 205
C + C +++Q C S N Q Y YG G T+G +ETL +
Sbjct: 150 CSSQLC-----QALQSPTC-------SNNSCQ----YTYGYGDGSETQGSMGTETLTFGS 193
Query: 206 RIIPNFLVGCSV----LSSRQPAGIAGFGRGKTSLPSQLNLDKFSYCLLSHKFDDTTRTS 261
IPN GC AG+ G GRG SLPSQL++ KFSYC+ ++ +S
Sbjct: 194 VSIPNITFGCGENNQGFGQGNGAGLVGMGRGPLSLPSQLDVTKFSYCMTPIG---SSTSS 250
Query: 262 SLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRVRVWHKYL 321
+L+L S ++ T G +P N ++ E + +YY+ L ++VG + +
Sbjct: 251 TLLL---GSLANSVTAG---SP---NTTLIESSQIPTFYYITLNGLSVGSTPLPIDPSVF 301
Query: 322 TLD-RDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEALTGLRPCF 380
L+ +G GG I+DSGTT T+ A ++ + F+SQM N + G+ + G CF
Sbjct: 302 KLNSNNGTGGIIIDSGTTLTYFADNAYQAVRQAFISQM----NLSVVNGSSS--GFDLCF 355
Query: 381 DVPGEKTG-SFPELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTVVTDREASGGPSIILG 439
+P +++ P +HF GG ++ LP ENYF G +CL + + + I G
Sbjct: 356 QMPSDQSNLQIPTFVMHFDGG-DLVLPSENYFISPSNG-LICLAMGSSSQGMS----IFG 409
Query: 440 NFQMQNYYVEYDLRNQRLGFKQQLC 464
N Q QN V YD N + F C
Sbjct: 410 NIQQQNLLVVYDTGNSVVSFLFAQC 434
>gi|297828001|ref|XP_002881883.1| hypothetical protein ARALYDRAFT_903676 [Arabidopsis lyrata subsp.
lyrata]
gi|297327722|gb|EFH58142.1| hypothetical protein ARALYDRAFT_903676 [Arabidopsis lyrata subsp.
lyrata]
Length = 529
Score = 152 bits (385), Expect = 3e-34, Method: Compositional matrix adjust.
Identities = 128/396 (32%), Positives = 186/396 (46%), Gaps = 47/396 (11%)
Query: 87 GGYSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLG 146
G Y + + GTPP+ ILDTGS L W C Y C + + + + PK S+S + +
Sbjct: 160 GEYFMDVLVGTPPKHFSLILDTGSDLNWLQCLPCYDCFHQNEA---FYDPKTSASFKNIT 216
Query: 147 CQNPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYG--SGLTEGIALSE-TLNL 203
C +P+CS I + EP K+ Q CP Y YG S T A+ T+NL
Sbjct: 217 CNDPRCSLI---------SSPEPPVQCKSDNQSCP-YFYWYGDRSNTTGDFAVETFTVNL 266
Query: 204 -------PNRIIPNFLVGCSVLSS---RQPAGIAGFGRGKTSLPSQLNL---DKFSYCLL 250
+ N + GC + +G+ G GRG S SQL FSYCL+
Sbjct: 267 TTTEGRSSEYKVENMMFGCGHWNRGLFSGASGLLGLGRGPLSFSSQLQSLYGHSFSYCLV 326
Query: 251 SHKFDDTTRTSSLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVG 310
DT +S LI G T L +T FVN + N+ +YY+ ++ I VG
Sbjct: 327 DRN-SDTNVSSKLIF--GEDKDLLNHTNLNFTSFVN----GKENSVETFYYIQIKSILVG 379
Query: 311 GQRVRVWHKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGA 370
G+ + + + + DG GGTI+DSGTT ++ A +E + ++F +M +N R
Sbjct: 380 GEALDIPEETWNISPDGAGGTIIDSGTTLSYFAEPAYEIIKNKFAEKMKENYLVFRDFPV 439
Query: 371 EALTGLRPCFDVPG--EKTGSFPELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTVVTDR 428
L PCF+V G E PEL + F GA P EN F + E VCL ++
Sbjct: 440 -----LDPCFNVSGIEENNIHLPELGIAFADGAVWNFPAENSFIWLSE-DLVCLAILGTP 493
Query: 429 EASGGPSIILGNFQMQNYYVEYDLRNQRLGFKQQLC 464
+++ I+GN+Q QN+++ YD + RLGF C
Sbjct: 494 KSTFS---IIGNYQQQNFHILYDTKMSRLGFTPTKC 526
>gi|224286173|gb|ACN40797.1| unknown [Picea sitchensis]
Length = 383
Score = 152 bits (384), Expect = 4e-34, Method: Compositional matrix adjust.
Identities = 126/385 (32%), Positives = 180/385 (46%), Gaps = 49/385 (12%)
Query: 87 GGYSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLG 146
G Y I ++ GTP + I+DTGS LVW C C CS+S I + S L
Sbjct: 40 GEYLIQMAIGTPALSLSAIMDTGSDLVWTKCN---PCTDCSTSSIYDPSSSSTYSKVL-- 94
Query: 147 CQNPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGS-GLTEGIALSETLNLPN 205
CQ+ C + CN++ +C + P YG T GI ET ++ +
Sbjct: 95 CQSSLC-----QPPSIFSCNND-----GDCEYVYP-----YGDRSSTSGILSDETFSISS 139
Query: 206 RIIPNFLVGCSVLSS--RQPAGIAGFGRGKTSLPSQLNL---DKFSYCLLSHKFDDTTRT 260
+ +PN GC + + G+ GFGRG SL SQL +KFSYCL+S D+++T
Sbjct: 140 QSLPNITFGCGHDNQGFDKVGGLVGFGRGSLSLVSQLGPSMGNKFSYCLVSRT--DSSKT 197
Query: 261 SSLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRVRVWHKY 320
S L + N +S + T + TP V + S +YY+ L I+VGGQ + +
Sbjct: 198 SPLFIGNTAS---LEATTVGSTPLVQSSSTN-------HYYLSLEGISVGGQSLAIPTGT 247
Query: 321 LTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEALTGLRPCF 380
+ DG+GG I+DSGTT TF+ ++ + + VS + N +A G L CF
Sbjct: 248 FDIQSDGSGGLIIDSGTTLTFLQQTAYDAVKEAMVSSI----NLPQADGQ-----LDLCF 298
Query: 381 DVPGEKTGSFPELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTVVTDREASGGPSIILGN 440
+ G FP + HFK GA+ +P ENY VCL ++ ++ G I GN
Sbjct: 299 NQQGSSNPGFPSMTFHFK-GADYDVPKENYLFPDSTSDIVCLAMMP-TNSNLGNMAIFGN 356
Query: 441 FQMQNYYVEYDLRNQRLGFKQQLCK 465
Q QNY + YD N L F C
Sbjct: 357 VQQQNYQILYDNENNVLSFAPTACD 381
>gi|115452685|ref|NP_001049943.1| Os03g0318400 [Oryza sativa Japonica Group]
gi|108707841|gb|ABF95636.1| Eukaryotic aspartyl protease family protein, expressed [Oryza
sativa Japonica Group]
gi|113548414|dbj|BAF11857.1| Os03g0318400 [Oryza sativa Japonica Group]
Length = 434
Score = 152 bits (383), Expect = 4e-34, Method: Compositional matrix adjust.
Identities = 122/388 (31%), Positives = 175/388 (45%), Gaps = 50/388 (12%)
Query: 89 YSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLGCQ 148
Y + L+ GTPPQ + LDTGS L+W C C C +P F P SS+ L C
Sbjct: 82 YLVHLAIGTPPQPVQLTLDTGSDLIWTQCQ---PCPACFDQALPYFDPSTSSTLSLTSCD 138
Query: 149 NPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGS-GLTEGIALSETLNL--PN 205
+ C + + C ++ C Y YG +T G +
Sbjct: 139 STLC-----QGLPVASCGSPKFWPNQTCV-----YTYSYGDKSVTTGFLEVDKFTFVGAG 188
Query: 206 RIIPNFLVGCSVLSS----RQPAGIAGFGRGKTSLPSQLNLDKFSYCLLSHKFDDTTRTS 261
+P GC + ++ GIAGFGRG SLPSQL + FS+C + + + S
Sbjct: 189 ASVPGVAFGCGLFNNGVFKSNETGIAGFGRGPLSLPSQLKVGNFSHCFTAV---NGLKPS 245
Query: 262 SLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRVRVWHKYL 321
+++LD + + TP + NP A +YY+ L+ ITVG R+ V
Sbjct: 246 TVLLDLPADLYKSGRGAVQSTPLIQNP------ANPTFYYLSLKGITVGSTRLPVPESEF 299
Query: 322 TLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQM---VKNRNYTRALGAEALTGLRP 378
L ++G GGTI+DSGT T + ++ + D F +Q+ V + N T
Sbjct: 300 AL-KNGTGGTIIDSGTAMTSLPTRVYRLVRDAFAAQVKLPVVSGNTTDPYF--------- 349
Query: 379 CFDVPGEKTGSFPELKLHFKGGAEVTLPVENY-FAVVGEGSAV-CLTVVTDREASGGPSI 436
C P P+L LHF+ GA + LP ENY F V GS++ CL ++ GG
Sbjct: 350 CLSAPLRAKPYVPKLVLHFE-GATMDLPRENYVFEVEDAGSSILCLAII-----EGGEVT 403
Query: 437 ILGNFQMQNYYVEYDLRNQRLGFKQQLC 464
+GNFQ QN +V YDL+N +L F C
Sbjct: 404 TIGNFQQQNMHVLYDLQNSKLSFVPAQC 431
>gi|297814776|ref|XP_002875271.1| hypothetical protein ARALYDRAFT_484331 [Arabidopsis lyrata subsp.
lyrata]
gi|297321109|gb|EFH51530.1| hypothetical protein ARALYDRAFT_484331 [Arabidopsis lyrata subsp.
lyrata]
Length = 451
Score = 151 bits (382), Expect = 6e-34, Method: Compositional matrix adjust.
Identities = 134/429 (31%), Positives = 187/429 (43%), Gaps = 53/429 (12%)
Query: 57 TRALHIKNPQTKTTTTTTTTTTTNISSHSYGGYSISLSFGTPPQIIPFILDTGSHLVWFP 116
TR LH + + K + + SS S G Y + L G PPQ + I DTGS LVW
Sbjct: 52 TRRLHFLSLRRKPVPFVKSPVVSGASSGS-GQYFVDLRIGQPPQSLLLIADTGSDLVWVK 110
Query: 117 CTNHYQCKYCSSSKIPSFIPKLSSSSRLLGCQNPKCSWIHHESIQCRDCNDEPLATSKNC 176
C+ C + S + + F P+ SS+ C +P C + R CN + ++
Sbjct: 111 CSACRNCSHHSPATV--FFPRHSSTFSPAHCYDPVCRLVPKPGRAPR-CNHTRIHST--- 164
Query: 177 TQICPSYLVLYGSG-LTEGIALSETLNLP-----NRIIPNFLVGCSVLSSRQPA------ 224
CP Y Y G LT G+ ET +L + + GC S Q
Sbjct: 165 ---CP-YEYGYADGSLTSGLFARETTSLKTSSGKEAKLKSVAFGCGFRISGQSVSGTSFN 220
Query: 225 ---GIAGFGRGKTSLPSQLNL---DKFSYCLLSHKFDDTTRTSSLILDNGSSHSDKKTTG 278
G+ G GRG S SQL +KFSYCL+ + TS LI+ +G K
Sbjct: 221 GANGVMGLGRGPISFASQLGRRFGNKFSYCLMDYTLSPPP-TSYLIIGDGGDAVSK---- 275
Query: 279 LTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRVRVWHKYLTLDRDGNGGTIVDSGTT 338
L +TP + NP +YYV L+ + V G ++R+ +D GNGGT++DSGTT
Sbjct: 276 LFFTPLLTNPLSP------TFYYVKLKSVFVNGAKLRIDPSIWEIDDSGNGGTVMDSGTT 329
Query: 339 FTFMAPELFEPLADEFVSQMVKNRNYTRALGAEALT-GLRPCFDVPG--EKTGSFPELKL 395
F+A + L V Q +K N A+ LT G C +V G + P LK
Sbjct: 330 LAFLADPAYR-LVIAAVKQRIKLPN------ADELTPGFDLCVNVSGVTKPEKILPRLKF 382
Query: 396 HFKGGAEVTLPVENYFAVVGEGSAVCLTVVTDREASGGPSIILGNFQMQNYYVEYDLRNQ 455
F GGA P NYF + E CL + + G ++GN Q + E+D
Sbjct: 383 EFSGGAVFVPPPRNYF-IETEEQIQCLAIQSVDPKVG--FSVIGNLMQQGFLFEFDRDRS 439
Query: 456 RLGFKQQLC 464
RLGF ++ C
Sbjct: 440 RLGFSRRGC 448
>gi|165292434|dbj|BAF98915.1| aspartic proteinase nepenthesin I [Nepenthes alata]
Length = 437
Score = 151 bits (381), Expect = 7e-34, Method: Compositional matrix adjust.
Identities = 123/385 (31%), Positives = 181/385 (47%), Gaps = 50/385 (12%)
Query: 87 GGYSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLG 146
G Y ++LS GTP Q I+DTGS L+W C C C + P F P+ SSS L
Sbjct: 93 GEYLMNLSIGTPAQPFSAIMDTGSDLIWTQCQ---PCTQCFNQSTPIFNPQGSSSFSTLP 149
Query: 147 CQNPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSG-LTEGIALSETLNLPN 205
C + C +++Q C S N Q Y YG G T+G +ETL +
Sbjct: 150 CSSQLC-----QALQSPTC-------SNNSCQ----YTYGYGDGSETQGSMGTETLTFGS 193
Query: 206 RIIPNFLVGCSV----LSSRQPAGIAGFGRGKTSLPSQLNLDKFSYCLLSHKFDDTTRTS 261
IPN GC AG+ G GRG SLPSQL++ KFSYC+ ++ +S
Sbjct: 194 VSIPNITFGCGENNQGFGQGNGAGLVGMGRGPLSLPSQLDVTKFSYCMTPIG---SSNSS 250
Query: 262 SLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRVRVWHKYL 321
+L+L S ++ T G +P N ++ + + +YY+ L ++VG + +
Sbjct: 251 TLLL---GSLANSVTAG---SP---NTTLIQSSQIPTFYYITLNGLSVGSTPLPIDPSVF 301
Query: 322 TLD-RDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEALTGLRPCF 380
L+ +G GG I+DSGTT T+ ++ + F+SQM N + G+ + G CF
Sbjct: 302 KLNSNNGTGGIIIDSGTTLTYFVDNAYQAVRQAFISQM----NLSVVNGSSS--GFDLCF 355
Query: 381 DVPGEKTG-SFPELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTVVTDREASGGPSIILG 439
+P +++ P +HF GG ++ LP ENYF G +CL + + + I G
Sbjct: 356 QMPSDQSNLQIPTFVMHFDGG-DLVLPSENYFISPSNG-LICLAMGSSSQGMS----IFG 409
Query: 440 NFQMQNYYVEYDLRNQRLGFKQQLC 464
N Q QN V YD N + F C
Sbjct: 410 NIQQQNLLVVYDTGNSVVSFLSAQC 434
>gi|414878073|tpg|DAA55204.1| TPA: hypothetical protein ZEAMMB73_344109 [Zea mays]
Length = 440
Score = 151 bits (381), Expect = 7e-34, Method: Compositional matrix adjust.
Identities = 127/390 (32%), Positives = 183/390 (46%), Gaps = 49/390 (12%)
Query: 95 FGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLGCQNPKCSW 154
G PPQ I+DTGS+L+W C+ Q C S + + P S ++R + C + C+
Sbjct: 77 IGDPPQQAEAIIDTGSNLIWTQCST-CQPAGCFSQNLSFYDPSRSRTARPVACNDTACA- 134
Query: 155 IHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSGLTEGIALSETLNL-PNRIIPNFLV 213
+ E+ RD +K C + L YG+G+ G+ +E P +
Sbjct: 135 LGSETRCARD--------NKAC-----AVLTAYGAGVIGGVLGTEAFTFQPQSENVSLAF 181
Query: 214 GCSVLSSRQP------AGIAGFGRGKTSLPSQLNLDKFSYCLLSHKFDDTTRTSSLILDN 267
GC + P +GI G GRG SL SQL +KFSYCL + F +T TS L +
Sbjct: 182 GCIAATRLTPGSLDGASGIIGLGRGNLSLVSQLGDNKFSYCLTPY-FSQSTNTSRLFVGA 240
Query: 268 GSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRVRVWHKYLTLDRDG 327
+ S T PF+ NP V + FS +YY+ L ITVG ++ V L +
Sbjct: 241 SAGLSSGGAPA-TSVPFLKNPDV---DPFSTFYYLPLTGITVGDAKLAVPEAAFDLRQVA 296
Query: 328 NG---GTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEALTGLRPCFDVPG 384
G GT++DSG+ FT + ++ L DE V Q+ + A GAE GL C V
Sbjct: 297 TGLWAGTLIDSGSPFTSLVDVAYQALRDELVQQLGASIVPPPA-GAE---GLDLCAAVAH 352
Query: 385 EKTGSF-PELKLHF-KGGAEVTLPVENYFAVVGEGSAVCLTVVTDREASGGP-------- 434
G P L LHF GG +V +P ENY+ V + +A C+ V + SGGP
Sbjct: 353 GDVGKLVPPLVLHFGSGGGDVAVPPENYWGPVDDSTA-CMVVFS----SGGPNSTLPMNE 407
Query: 435 SIILGNFQMQNYYVEYDLRNQRLGFKQQLC 464
+ I+GN+ Q+ ++ YDL L F+ C
Sbjct: 408 TTIIGNYMQQDMHLLYDLEKGMLSFQPADC 437
>gi|326513755|dbj|BAJ87896.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 442
Score = 151 bits (381), Expect = 9e-34, Method: Compositional matrix adjust.
Identities = 131/460 (28%), Positives = 199/460 (43%), Gaps = 53/460 (11%)
Query: 10 LSFIFFFTLLSIFPSSITSLTFSLSRFHTNPSQDSYQNLNSLVSSSLTRALHIKNPQTKT 69
L ++FF +L +P + +L LS + L +V S RA ++ P +
Sbjct: 14 LPYLFFLAILFAWPVTSATLRAHLSHVDDGRGFTKRELLRRMVVRSRARAANL-CPYSGA 72
Query: 70 TTTTTTTTTTNISSHSYGGYSISLSFGTP-PQIIPFILDTGSHLVWFPCTNHYQCKYCSS 128
T T ++ Y I LS G P Q + LDTGS +VW C C C +
Sbjct: 73 TARPATAPVGRANTDVNSEYLIHLSIGAPRSQPVVLTLDTGSDVVWTQCE---PCAECFT 129
Query: 129 SKIPSFIPKLSSSSRLLGCQNPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYG 188
+P F S++ R + C +P C+ H E C +Y+ YG
Sbjct: 130 QPLPRFDTAASNTVRSVACSDPLCN-AHSE---------------HGCFLHGCTYVSGYG 173
Query: 189 SG-LTEGIALSETLNLPNR------IIPNFLVGCSVLSS----RQPAGIAGFGRGKTSLP 237
G L+ G L ++ + +P+ GC + ++ + GIAGFGRG SLP
Sbjct: 174 DGSLSFGHFLRDSFTFDDGKGGGKVTVPDIGFGCGMYNAGRFLQTETGIAGFGRGPLSLP 233
Query: 238 SQLNLDKFSYCLLSHKFDDTTRTSSLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFS 297
SQL + +FSYC + +F+ ++S + L T + TPFV + N+
Sbjct: 234 SQLKVRQFSYCFTT-RFE--AKSSPVFLGGAGDLKAHATGPILSTPFVRSLPPGTDNS-- 288
Query: 298 VYYYVGLRRITVGGQRVRVWHKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQ 357
+Y + + +TVG R+ V + DG+G T +DSGT T +F L F++Q
Sbjct: 289 -HYVLSFKGVTVGKTRLPVPE----IKADGSGATFIDSGTDITTFPDAVFRQLKSAFIAQ 343
Query: 358 MVKNRNYTRALGAEALTGLRPCFDVPGEKTGSFPELKLHFKGGAEVTLPVENYFAVVGEG 417
N T CF G+KT + P+L H + GA+ LP ENY E
Sbjct: 344 AALPVNKTADED-------DICFSWDGKKTAAMPKLVFHLE-GADWDLPRENYVTEDRES 395
Query: 418 SAVCLTVVTDREASGGPSIILGNFQMQNYYVEYDLRNQRL 457
VC+ V T + ++GNFQ QN ++ YDL +L
Sbjct: 396 GQVCVAVSTSGQMD---RTLIGNFQQQNTHIVYDLAAGKL 432
>gi|115472517|ref|NP_001059857.1| Os07g0533600 [Oryza sativa Japonica Group]
gi|22831047|dbj|BAC15910.1| putative nucleoid DNA-binding protein cnd41 [Oryza sativa Japonica
Group]
gi|50508280|dbj|BAD32129.1| putative nucleoid DNA-binding protein cnd41 [Oryza sativa Japonica
Group]
gi|113611393|dbj|BAF21771.1| Os07g0533600 [Oryza sativa Japonica Group]
gi|215766673|dbj|BAG98901.1| unnamed protein product [Oryza sativa Japonica Group]
Length = 441
Score = 150 bits (380), Expect = 9e-34, Method: Compositional matrix adjust.
Identities = 119/381 (31%), Positives = 172/381 (45%), Gaps = 39/381 (10%)
Query: 89 YSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLGCQ 148
Y + ++ GTPP + +LDTGS L+W C C+ C P + P S++ + C+
Sbjct: 92 YLVDIAIGTPPLPLTAVLDTGSDLIWTQC--DAPCRRCFPQPAPLYAPARSATYANVSCR 149
Query: 149 NPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSGL-TEGIALSETLNL-PNR 206
+P C + +C + C +Y YG G T+G+ +ET L +
Sbjct: 150 SPMCQALQSPWSRCSPPD-------TGC-----AYYFSYGDGTSTDGVLATETFTLGSDT 197
Query: 207 IIPNFLVGC---SVLSSRQPAGIAGFGRGKTSLPSQLNLDKFSYCLLSHKFDDTTRTSSL 263
+ GC ++ S+ +G+ G GRG SL SQL + +FSYC + T S L
Sbjct: 198 AVRGVAFGCGTENLGSTDNSSGLVGMGRGPLSLVSQLGVTRFSYCFTPF---NATAASPL 254
Query: 264 ILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRVRVWHKYLTL 323
L + + S T TPFV +PS R S YYY+ L ITVG + + L
Sbjct: 255 FLGSSARLSSAAKT----TPFVPSPSGGARRR-SSYYYLSLEGITVGDTLLPIDPAVFRL 309
Query: 324 DRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEALTGLRPCFDVP 383
G+GG I+DSGTTFT + F LA S++ L + A GL CF
Sbjct: 310 TPMGDGGVIIDSGTTFTALEERAFVALARALASRV------RLPLASGAHLGLSLCFAAA 363
Query: 384 GEKTGSFPELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTVVTDREASGGPSIILGNFQM 443
+ P L LHF GA++ L E+Y CL +V+ R S +LG+ Q
Sbjct: 364 SPEAVEVPRLVLHFD-GADMELRRESYVVEDRSAGVACLGMVSARGMS-----VLGSMQQ 417
Query: 444 QNYYVEYDLRNQRLGFKQQLC 464
QN ++ YDL L F+ C
Sbjct: 418 QNTHILYDLERGILSFEPAKC 438
>gi|326507654|dbj|BAK03220.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 442
Score = 150 bits (380), Expect = 1e-33, Method: Compositional matrix adjust.
Identities = 139/452 (30%), Positives = 201/452 (44%), Gaps = 63/452 (13%)
Query: 33 LSRFHTNPSQDSYQNLNSLVSSSLTRALHIKNPQTKTTTTT---TTTTTTNISSHSYGGY 89
L+R H+NP + + V +L R +H T+ ++ T T + G Y
Sbjct: 33 LTRIHSNPDVSATE----FVRDALRRDMHRHARFTRELASSGDRTVAAPTRKDLPNGGEY 88
Query: 90 SISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPS----FIPKLSSSSRLL 145
++L+ GTPP P I DTGS L+W QC C S + P S++ +L
Sbjct: 89 IMTLAIGTPPLSYPAIADTGSDLIW------TQCAPCGSQCFKQAGQPYNPSSSTTFGVL 142
Query: 146 GCQNP--KCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSGLTEGIALSETLNL 203
C + C+ + S C+ + Y YG+G T GI ET
Sbjct: 143 PCNSSVSMCAALAGPS------------PPPGCSCM---YNQTYGTGWTAGIQSVETFTF 187
Query: 204 -----PNRIIPNFLVGCSVLSSRQ---PAGIAGFGRGKTSLPSQLNLDKFSYCLLSHKFD 255
+P GCS SS AG+ G GRG SL SQL FSYCL F
Sbjct: 188 GSTPADQTRVPGIAFGCSNASSDDWNGSAGLVGLGRGSMSLVSQLGAGMFSYCL--TPFQ 245
Query: 256 DTTRTSSLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRVR 315
D TS+L+L ++ + TG+ TPFV +PS A S YYY+ L I++G +
Sbjct: 246 DANSTSTLLLGPSAALNG---TGVLTTPFVASPSKAP---MSTYYYLNLTGISIGTTALS 299
Query: 316 VWHKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEALTG 375
+ L DG GG I+DSGTT T + ++ + S + A G+++ TG
Sbjct: 300 IPPNAFALRTDGTGGLIIDSGTTITSLVDAAYQQVRAAIESLV----TLPVADGSDS-TG 354
Query: 376 LRPCFDVPGEKTG--SFPELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTVVTDREASGG 433
L CF + E + S P + HF GA++ LPV+NY ++G G CL + R + G
Sbjct: 355 LDLCFALTSETSTPPSMPSMTFHFD-GADMVLPVDNYM-ILGSG-VWCLAM---RNQTVG 408
Query: 434 PSIILGNFQMQNYYVEYDLRNQRLGFKQQLCK 465
GN+Q QN ++ YD+ + L F C
Sbjct: 409 AMSTFGNYQQQNVHLLYDIHEETLSFAPAKCS 440
>gi|224085379|ref|XP_002307559.1| predicted protein [Populus trichocarpa]
gi|222857008|gb|EEE94555.1| predicted protein [Populus trichocarpa]
Length = 455
Score = 150 bits (380), Expect = 1e-33, Method: Compositional matrix adjust.
Identities = 126/395 (31%), Positives = 176/395 (44%), Gaps = 47/395 (11%)
Query: 87 GGYSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLG 146
G Y + + GTPP+ ILDTGS L W C C C P + PK SSS R +G
Sbjct: 88 GEYFMDVFIGTPPKHYSLILDTGSDLNWIQCV---PCHDCFEQNGPYYDPKESSSFRNIG 144
Query: 147 CQNPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYG-SGLTEGIALSET--LNL 203
C +P+C + D PL K Q CP Y YG S T G +ET +NL
Sbjct: 145 CHDPRCHLVSSP--------DPPLPC-KAENQTCP-YFYWYGDSSNTTGDFATETFTVNL 194
Query: 204 PN-------RIIPNFLVGCSVLSSRQPAGIAGFGRGKT---SLPSQLNL---DKFSYCLL 250
+ + + N + GC + G +G S SQL FSYCL+
Sbjct: 195 TSPTGKSEFKRVENVMFGCGHWNRGLFHGASGLLGLGRGPLSFSSQLQSLYGHSFSYCLV 254
Query: 251 SHKFDDTTRTSSLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVG 310
DT +S LI G L +T V + N +YYV ++ I VG
Sbjct: 255 DRN-SDTNVSSKLIF--GEDKDLLNHPELNFTTLVG----GKENPVDTFYYVQIKSIMVG 307
Query: 311 GQRVRVWHKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGA 370
G+ + + + DG GGTIVDSGTT ++ ++ + D FV ++ + Y
Sbjct: 308 GEVLNIPESTWNMTSDGVGGTIVDSGTTLSYFTEPAYQIIKDAFVKKV---KGYPI---V 361
Query: 371 EALTGLRPCFDVPGEKTGSFPELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTVV-TDRE 429
+ L PC++V G + P+ + F GA PVENYF + VCL ++ T R
Sbjct: 362 QDFPILDPCYNVSGVEKIDLPDFGILFADGAVWNFPVENYFIRLDPEEVVCLAILGTPRS 421
Query: 430 ASGGPSIILGNFQMQNYYVEYDLRNQRLGFKQQLC 464
A I+GN+Q QN++V YD + RLG+ C
Sbjct: 422 ALS----IIGNYQQQNFHVLYDTKKSRLGYAPMNC 452
>gi|125558631|gb|EAZ04167.1| hypothetical protein OsI_26309 [Oryza sativa Indica Group]
Length = 441
Score = 150 bits (380), Expect = 1e-33, Method: Compositional matrix adjust.
Identities = 119/381 (31%), Positives = 172/381 (45%), Gaps = 39/381 (10%)
Query: 89 YSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLGCQ 148
Y + ++ GTPP + +LDTGS L+W C C+ C P + P S++ + C+
Sbjct: 92 YLVDIAIGTPPLPLTAVLDTGSDLIWTQC--DAPCRRCFPQPAPLYAPARSATYANVSCR 149
Query: 149 NPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSGL-TEGIALSETLNL-PNR 206
+P C + +C + C +Y YG G T+G+ +ET L +
Sbjct: 150 SPMCQALQSPWSRCSPPD-------TGC-----AYYFSYGDGTSTDGVLATETFTLGSDT 197
Query: 207 IIPNFLVGC---SVLSSRQPAGIAGFGRGKTSLPSQLNLDKFSYCLLSHKFDDTTRTSSL 263
+ GC ++ S+ +G+ G GRG SL SQL + +FSYC + T S L
Sbjct: 198 AVRGVAFGCGTENLGSTDNSSGLVGMGRGPLSLVSQLGVTRFSYCFTPF---NATAASPL 254
Query: 264 ILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRVRVWHKYLTL 323
L + + S T TPFV +PS R S YYY+ L ITVG + + L
Sbjct: 255 FLGSSARLSSAAKT----TPFVPSPSGGARRR-SSYYYLSLEGITVGDTLLPIDPAVFRL 309
Query: 324 DRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEALTGLRPCFDVP 383
G+GG I+DSGTTFT + F LA S++ L + A GL CF
Sbjct: 310 TPMGDGGVIIDSGTTFTALEESAFVALARALASRV------RLPLASGAHLGLSLCFAAA 363
Query: 384 GEKTGSFPELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTVVTDREASGGPSIILGNFQM 443
+ P L LHF GA++ L E+Y CL +V+ R S +LG+ Q
Sbjct: 364 SPEAVEVPRLVLHFD-GADMELRRESYVVEDRSAGVACLGMVSARGMS-----VLGSMQQ 417
Query: 444 QNYYVEYDLRNQRLGFKQQLC 464
QN ++ YDL L F+ C
Sbjct: 418 QNTHILYDLERGILSFEPAKC 438
>gi|15230868|ref|NP_189198.1| aspartyl protease family protein [Arabidopsis thaliana]
gi|11994761|dbj|BAB03090.1| chloroplast nucleoid DNA binding protein-like; nucellin-like
protein [Arabidopsis thaliana]
gi|189339286|gb|ACD89063.1| At3g25700 [Arabidopsis thaliana]
gi|332643533|gb|AEE77054.1| aspartyl protease family protein [Arabidopsis thaliana]
Length = 452
Score = 150 bits (379), Expect = 1e-33, Method: Compositional matrix adjust.
Identities = 132/429 (30%), Positives = 184/429 (42%), Gaps = 53/429 (12%)
Query: 57 TRALHIKNPQTKTTTTTTTTTTTNISSHSYGGYSISLSFGTPPQIIPFILDTGSHLVWFP 116
TR LH + + K + + +S S G Y + L G PPQ + I DTGS LVW
Sbjct: 53 TRRLHFLSLRRKPIPFVKSPVVSGAASGS-GQYFVDLRIGQPPQSLLLIADTGSDLVWVK 111
Query: 117 CTNHYQCKYCSSSKIPSFIPKLSSSSRLLGCQNPKCSWIHHESIQCRDCNDEPLATSKNC 176
C+ C + S + + F P+ SS+ C +P C + + P+
Sbjct: 112 CSACRNCSHHSPATV--FFPRHSSTFSPAHCYDPVCRLVPKP-------DRAPICNHTRI 162
Query: 177 TQICPSYLVLYGSG-LTEGIALSETLNLP-----NRIIPNFLVGCSVLSSRQPA------ 224
C Y Y G LT G+ ET +L + + GC S Q
Sbjct: 163 HSTC-HYEYGYADGSLTSGLFARETTSLKTSSGKEARLKSVAFGCGFRISGQSVSGTSFN 221
Query: 225 ---GIAGFGRGKTSLPSQLNL---DKFSYCLLSHKFDDTTRTSSLILDNGSSHSDKKTTG 278
G+ G GRG S SQL +KFSYCL+ + TS LI+ NG K
Sbjct: 222 GANGVMGLGRGPISFASQLGRRFGNKFSYCLMDYTLSPPP-TSYLIIGNGGDGISK---- 276
Query: 279 LTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRVRVWHKYLTLDRDGNGGTIVDSGTT 338
L +TP + NP +YYV L+ + V G ++R+ +D GNGGT+VDSGTT
Sbjct: 277 LFFTPLLTNPLSP------TFYYVKLKSVFVNGAKLRIDPSIWEIDDSGNGGTVVDSGTT 330
Query: 339 FTFMAPELFEPLADEFVSQMVKNRNYTRALGAEALT-GLRPCFDVPG--EKTGSFPELKL 395
F+A EP + S + R + A+ALT G C +V G + P LK
Sbjct: 331 LAFLA----EP---AYRSVIAAVRRRVKLPIADALTPGFDLCVNVSGVTKPEKILPRLKF 383
Query: 396 HFKGGAEVTLPVENYFAVVGEGSAVCLTVVTDREASGGPSIILGNFQMQNYYVEYDLRNQ 455
F GGA P NYF + E CL + + G ++GN Q + E+D
Sbjct: 384 EFSGGAVFVPPPRNYF-IETEEQIQCLAIQSVDPKVG--FSVIGNLMQQGFLFEFDRDRS 440
Query: 456 RLGFKQQLC 464
RLGF ++ C
Sbjct: 441 RLGFSRRGC 449
>gi|61214233|sp|Q766C3.1|NEP1_NEPGR RecName: Full=Aspartic proteinase nepenthesin-1; AltName:
Full=Nepenthesin-I; Flags: Precursor
gi|41016421|dbj|BAD07474.1| aspartic proteinase nepenthesin I [Nepenthes gracilis]
Length = 437
Score = 150 bits (378), Expect = 2e-33, Method: Compositional matrix adjust.
Identities = 121/385 (31%), Positives = 176/385 (45%), Gaps = 50/385 (12%)
Query: 87 GGYSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLG 146
G Y ++LS GTP Q I+DTGS L+W C C C + P F P+ SSS L
Sbjct: 93 GEYLMNLSIGTPAQPFSAIMDTGSDLIWTQCQ---PCTQCFNQSTPIFNPQGSSSFSTLP 149
Query: 147 CQNPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSG-LTEGIALSETLNLPN 205
C + C + +S C+ Y YG G T+G +ETL +
Sbjct: 150 CSSQLCQAL----------------SSPTCSNNFCQYTYGYGDGSETQGSMGTETLTFGS 193
Query: 206 RIIPNFLVGCSV----LSSRQPAGIAGFGRGKTSLPSQLNLDKFSYCLLSHKFDDTTRTS 261
IPN GC AG+ G GRG SLPSQL++ KFSYC+ ++ S
Sbjct: 194 VSIPNITFGCGENNQGFGQGNGAGLVGMGRGPLSLPSQLDVTKFSYCMTPIG---SSTPS 250
Query: 262 SLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRVRVWHKYL 321
+L+L S ++ T G +P N ++ + + +YY+ L ++VG R+ +
Sbjct: 251 NLLL---GSLANSVTAG---SP---NTTLIQSSQIPTFYYITLNGLSVGSTRLPIDPSAF 301
Query: 322 TLD-RDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEALTGLRPCF 380
L+ +G GG I+DSGTT T+ ++ + EF+SQ+ N G+ + G CF
Sbjct: 302 ALNSNNGTGGIIIDSGTTLTYFVNNAYQSVRQEFISQI----NLPVVNGSSS--GFDLCF 355
Query: 381 DVPGEKTG-SFPELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTVVTDREASGGPSIILG 439
P + + P +HF GG ++ LP ENYF G +CL + + + I G
Sbjct: 356 QTPSDPSNLQIPTFVMHFDGG-DLELPSENYFISPSNG-LICLAMGSSSQGMS----IFG 409
Query: 440 NFQMQNYYVEYDLRNQRLGFKQQLC 464
N Q QN V YD N + F C
Sbjct: 410 NIQQQNMLVVYDTGNSVVSFASAQC 434
>gi|115445765|ref|NP_001046662.1| Os02g0314600 [Oryza sativa Japonica Group]
gi|46391044|dbj|BAD15987.1| putative chloroplast nucleoid DNA binding protein [Oryza sativa
Japonica Group]
gi|113536193|dbj|BAF08576.1| Os02g0314600 [Oryza sativa Japonica Group]
gi|125581836|gb|EAZ22767.1| hypothetical protein OsJ_06441 [Oryza sativa Japonica Group]
gi|215697168|dbj|BAG91162.1| unnamed protein product [Oryza sativa Japonica Group]
Length = 514
Score = 149 bits (377), Expect = 2e-33, Method: Compositional matrix adjust.
Identities = 135/415 (32%), Positives = 194/415 (46%), Gaps = 47/415 (11%)
Query: 65 PQTKTTTTTTTTTTTNISSHSYGGYSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCK 124
P+ T + ++ S G Y + L GTPP+ I+DTGS L W C C
Sbjct: 129 PRRALAERIVATVESGVAVGS-GEYLVDLYVGTPPRRFQMIMDTGSDLNWLQCA---PCL 184
Query: 125 YCSSSKIPSFIPKLSSSSRLLGCQNPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYL 184
C + P F P S S R + C +P+C + + P A + + CP Y
Sbjct: 185 DCFEQRGPVFDPAASLSYRNVTCGDPRCGLVAPPT--------APRACRRPHSDPCP-YY 235
Query: 185 VLYG--SGLTEGIALSE-TLNL----PNRIIPNFLVGCSVLSSR----QPAGIAGFGRGK 233
YG S T +AL T+NL +R + + + GC S+R AG+ G GRG
Sbjct: 236 YWYGDQSNTTGDLALEAFTVNLTAPGASRRVDDVVFGCG-HSNRGLFHGAAGLLGLGRGA 294
Query: 234 TSLPSQLNL---DKFSYCLLSHKFDDTTRTSSLILDNGSSHSDKKTTGLTYTPFVNNPSV 290
S SQL FSYCL+ H ++ S ++ G + L YT +
Sbjct: 295 LSFASQLRAVYGHAFSYCLVDHG---SSVGSKIVF--GDDDALLGHPRLNYT----AFAP 345
Query: 291 AERNAFSVYYYVGLRRITVGGQRVRVWHKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPL 350
+ A +YYV L+ + VGG+++ + + +DG+GGTI+DSGTT ++ A +E +
Sbjct: 346 SAAAAADTFYYVQLKGVLVGGEKLNISPSTWDVGKDGSGGTIIDSGTTLSYFAEPAYEVI 405
Query: 351 ADEFVSQMVKNRNYTRALGAEALTGLRPCFDVPGEKTGSFPELKLHFKGGAEVTLPVENY 410
FV +M K L A+ L PC++V G + PE L F GA P ENY
Sbjct: 406 RRAFVERMDK----AYPLVAD-FPVLSPCYNVSGVERVEVPEFSLLFADGAVWDFPAENY 460
Query: 411 FAVVGEGSAVCLTVV-TDREASGGPSIILGNFQMQNYYVEYDLRNQRLGFKQQLC 464
F + +CL V+ T R A I+GNFQ QN++V YDL+N RLGF + C
Sbjct: 461 FVRLDPDGIMCLAVLGTPRSAMS----IIGNFQQQNFHVLYDLQNNRLGFAPRRC 511
>gi|302822373|ref|XP_002992845.1| hypothetical protein SELMODRAFT_136051 [Selaginella moellendorffii]
gi|300139393|gb|EFJ06135.1| hypothetical protein SELMODRAFT_136051 [Selaginella moellendorffii]
Length = 510
Score = 149 bits (377), Expect = 2e-33, Method: Compositional matrix adjust.
Identities = 125/404 (30%), Positives = 190/404 (47%), Gaps = 62/404 (15%)
Query: 89 YSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLGCQ 148
Y + L GTP + I+DTGS + W C CK C + P F P+ SSS L C
Sbjct: 138 YYVPLQLGTPAVEVVLIMDTGSDVSWIQCV---PCKDCVPALRPPFNPRHSSSFFKLPCA 194
Query: 149 NPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSG-LTEGIALSETL--NLPN 205
+ C+ ++ +P + T + + + YG G L+ G+ ET+ N PN
Sbjct: 195 SSTCTNVYQ--------GVKPFCSPSGRTCL---FSIQYGDGSLSSGLLAMETIAGNTPN 243
Query: 206 ------RIIPNFLVGCSVLSSRQ----PAGIAGFGRGKTSLPSQLN---LDKFSYCL--- 249
+ N +GC+ + +G+ G R S PSQL+ KFS+C
Sbjct: 244 FGDGEPVKLSNITLGCADIDREGLPTGASGLLGMDRRPISFPSQLSSRYARKFSHCFPDK 303
Query: 250 LSHKFDDTTRTSSLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITV 309
++H +S L+ SD + L YTP V NP+V +A YYYVGL I+V
Sbjct: 304 IAH-----LNSSGLVF---FGESDIISPYLRYTPLVQNPAVP--SASLDYYYVGLVGISV 353
Query: 310 GGQRVRVWHKYLTLDR-DGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRAL 368
R+ + HK +D+ G+GGTI+DSGT FT++ F+ + EF+++ +
Sbjct: 354 DESRLPLSHKNFDIDKVTGSGGTIIDSGTAFTYLKKPAFQAMRREFLART------SHLA 407
Query: 369 GAEALTGLRPCFDV----PGEKTGSFPELKLHFKGGAEVTLPVENYFAVVG---EGSAVC 421
+ +G PC+++ ++ P + LHF+GG +V LP + V E + +C
Sbjct: 408 KVDDNSGFTPCYNITSGTAALESTILPSITLHFRGGLDVVLPKNSILIPVSSSEEQTTLC 467
Query: 422 LTVVTDREASGG-PSIILGNFQMQNYYVEYDLRNQRLGFKQQLC 464
L + SG P I+GN+Q QN +VEYDL RLG C
Sbjct: 468 LAF----QMSGDIPFNIIGNYQQQNLWVEYDLEKLRLGIAPAQC 507
>gi|302811785|ref|XP_002987581.1| hypothetical protein SELMODRAFT_426333 [Selaginella moellendorffii]
gi|300144735|gb|EFJ11417.1| hypothetical protein SELMODRAFT_426333 [Selaginella moellendorffii]
Length = 511
Score = 149 bits (377), Expect = 2e-33, Method: Compositional matrix adjust.
Identities = 123/403 (30%), Positives = 189/403 (46%), Gaps = 60/403 (14%)
Query: 89 YSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLGCQ 148
Y + L GTP + I+DTGS + W C CK C + P F P+ SSS L C
Sbjct: 139 YYVPLQVGTPAVEVVLIMDTGSDVSWIQCV---PCKDCVPALRPPFNPRHSSSFFKLPCA 195
Query: 149 NPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSG-LTEGIALSETL--NLPN 205
+ C+ ++ +P + T + + + YG G L+ G+ ET+ N PN
Sbjct: 196 SSTCTNVYQ--------GVKPFCSPSGRTCL---FSIQYGDGSLSSGLLAMETIAGNTPN 244
Query: 206 ------RIIPNFLVGCSVLSSRQ----PAGIAGFGRGKTSLPSQLN---LDKFSYCL--- 249
+ N +GC+ + +G+ G R S PSQL+ KFS+C
Sbjct: 245 FGDGEPVKLSNITLGCADIDREGLPTGASGLLGMDRRPISFPSQLSSRYARKFSHCFPDK 304
Query: 250 LSHKFDDTTRTSSLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITV 309
++H +S L+ SD + L YTP V NP+V +A YYYVGL I+V
Sbjct: 305 IAH-----LNSSGLVF---FGESDIISPYLRYTPLVQNPAVP--SASLDYYYVGLVGISV 354
Query: 310 GGQRVRVWHKYLTLDR-DGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRAL 368
R+ + HK +D+ G+GGTI+DSGT FT++ F+ + EF+++ +
Sbjct: 355 DESRLPLSHKNFDIDKVTGSGGTIIDSGTAFTYLKKPAFQAMRREFLART------SHLA 408
Query: 369 GAEALTGLRPCFDV----PGEKTGSFPELKLHFKGGAEVTLPVENYFAVVG---EGSAVC 421
+ +G PC+++ ++ P + LHF+GG +V LP + V E + +C
Sbjct: 409 KVDDNSGFTPCYNITSGTAALESTILPSITLHFRGGLDVVLPKNSILIPVSSSEEQTTLC 468
Query: 422 LTVVTDREASGGPSIILGNFQMQNYYVEYDLRNQRLGFKQQLC 464
L + + P I+GN+Q QN +VEYDL RLG C
Sbjct: 469 LAFLMSGDI---PFNIIGNYQQQNLWVEYDLEKLRLGIAPAQC 508
>gi|125539168|gb|EAY85563.1| hypothetical protein OsI_06935 [Oryza sativa Indica Group]
Length = 514
Score = 149 bits (377), Expect = 2e-33, Method: Compositional matrix adjust.
Identities = 135/415 (32%), Positives = 194/415 (46%), Gaps = 47/415 (11%)
Query: 65 PQTKTTTTTTTTTTTNISSHSYGGYSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCK 124
P+ T + ++ S G Y + L GTPP+ I+DTGS L W C C
Sbjct: 129 PRRALAERIVATVESGVAVGS-GEYLVDLYVGTPPRRFQMIMDTGSDLNWLQCA---PCL 184
Query: 125 YCSSSKIPSFIPKLSSSSRLLGCQNPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYL 184
C + P F P S S R + C +P+C + + P A + + CP Y
Sbjct: 185 DCFEQRGPVFDPATSLSYRNVTCGDPRCGLVAPPT--------APRACRRPHSDPCP-YY 235
Query: 185 VLYG--SGLTEGIALSE-TLNL----PNRIIPNFLVGCSVLSSR----QPAGIAGFGRGK 233
YG S T +AL T+NL +R + + + GC S+R AG+ G GRG
Sbjct: 236 YWYGDQSNTTGDLALEAFTVNLTAPGASRRVDDVVFGCG-HSNRGLFHGAAGLLGLGRGA 294
Query: 234 TSLPSQLNL---DKFSYCLLSHKFDDTTRTSSLILDNGSSHSDKKTTGLTYTPFVNNPSV 290
S SQL FSYCL+ H ++ S ++ G + L YT +
Sbjct: 295 LSFASQLRAVYGHAFSYCLVDHG---SSVGSKIVF--GDDDALLGHPRLNYT----AFAP 345
Query: 291 AERNAFSVYYYVGLRRITVGGQRVRVWHKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPL 350
+ A +YYV L+ + VGG+++ + + +DG+GGTI+DSGTT ++ A +E +
Sbjct: 346 SAAAAADTFYYVQLKGVLVGGEKLNISPSTWDVGKDGSGGTIIDSGTTLSYFAEPAYEVI 405
Query: 351 ADEFVSQMVKNRNYTRALGAEALTGLRPCFDVPGEKTGSFPELKLHFKGGAEVTLPVENY 410
FV +M K L A+ L PC++V G + PE L F GA P ENY
Sbjct: 406 RRAFVERMDK----AYPLVAD-FPVLSPCYNVSGVERVEVPEFSLLFADGAVWDFPAENY 460
Query: 411 FAVVGEGSAVCLTVV-TDREASGGPSIILGNFQMQNYYVEYDLRNQRLGFKQQLC 464
F + +CL V+ T R A I+GNFQ QN++V YDL+N RLGF + C
Sbjct: 461 FVRLDPDGIMCLAVLGTPRSAMS----IIGNFQQQNFHVLYDLQNNRLGFAPRRC 511
>gi|125561849|gb|EAZ07297.1| hypothetical protein OsI_29545 [Oryza sativa Indica Group]
Length = 451
Score = 149 bits (377), Expect = 2e-33, Method: Compositional matrix adjust.
Identities = 133/406 (32%), Positives = 186/406 (45%), Gaps = 58/406 (14%)
Query: 79 TNISSHSYGGYSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKI------- 131
+S S G+S+++ GTPPQ I+DTGS L+W QCK SS+ +
Sbjct: 81 VRLSPLSDQGHSLTVGIGTPPQPRKLIVDTGSDLIW------TQCKLSSSTAVAARHGSP 134
Query: 132 PSFIPKLSSSSRLLGCQNPKCSWIHHESIQCRDCNDEPLATSKN-CTQICPSYLVLYGSG 190
P + P SS+ L C + C ++C TSKN C Y +YGS
Sbjct: 135 PVYDPGESSTFAFLPCSDRLC---QEGQFSFKNC------TSKNRCV-----YEDVYGSA 180
Query: 191 LTEGIALSETLNLPNRIIPNFLVG--CSVLSSRQ---PAGIAGFGRGKTSLPSQLNLDKF 245
G+ SET R + +G C LS+ GI G SL +QL + +F
Sbjct: 181 AAVGVLASETFTFGARRAVSLRLGFGCGALSAGSLIGATGILGLSPESLSLITQLKIQRF 240
Query: 246 SYCLLSHKFDDTTRTSSLILDNGSSHSDKKTTG-LTYTPFVNNPSVAERNAFSVYYYVGL 304
SYCL F D +TS L+ + S KTT + T V+NP +VYYYV L
Sbjct: 241 SYCLT--PFADK-KTSPLLFGAMADLSRHKTTRPIQTTAIVSNP------VKTVYYYVPL 291
Query: 305 RRITVGGQRVRVWHKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNY 364
I++G +R+ V L + DG GGTIVDSG+T ++ FE + E V +V+
Sbjct: 292 VGISLGHKRLAVPAASLAMRPDGGGGTIVDSGSTVAYLVEAAFEAV-KEAVMDVVRLPVA 350
Query: 365 TRALGAEALTGLRPCFDVPGEKTGS------FPELKLHFKGGAEVTLPVENYFAVVGEGS 418
R + L CF +P + P L LHF GGA + LP +NYF G
Sbjct: 351 NRTVEDYEL-----CFVLPRRTAAAAMEAVQVPPLVLHFDGGAAMVLPRDNYFQEPRAG- 404
Query: 419 AVCLTVVTDREASGGPSIILGNFQMQNYYVEYDLRNQRLGFKQQLC 464
+CL V + SG I+GN Q QN +V +D+++ + F C
Sbjct: 405 LMCLAVGKTTDGSG--VSIIGNVQQQNMHVLFDVQHHKFSFAPTQC 448
>gi|223949775|gb|ACN28971.1| unknown [Zea mays]
gi|414590177|tpg|DAA40748.1| TPA: hypothetical protein ZEAMMB73_257146 [Zea mays]
Length = 510
Score = 149 bits (376), Expect = 3e-33, Method: Compositional matrix adjust.
Identities = 135/427 (31%), Positives = 189/427 (44%), Gaps = 50/427 (11%)
Query: 54 SSLTRALHIKNPQTKTTTTTTTTTTTNISSHSYGGYSISLSFGTPPQIIPFILDTGSHLV 113
S + R +P+ + T + ++ S G Y I + GTPP+ I+DTGS L
Sbjct: 115 SGVARMPASSSPRRALSERMVATVESGVAVGS-GEYLIDVYVGTPPRRFRMIMDTGSDLN 173
Query: 114 WFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLGCQNPKCSWIHHESIQCRDCNDEPLATS 173
W C C C + P F P SSS R + C + +C + + P A
Sbjct: 174 WLQCA---PCLDCFEQRGPVFDPAASSSYRNVTCGDQRCGLVAPP--------EAPRACR 222
Query: 174 KNCTQICPSYLVLYG--SGLTEGIAL-SETLNL----PNRIIPNFLVGCSVLSS---RQP 223
+ CP Y YG S T +AL S T+NL +R + + GC +
Sbjct: 223 RPAEDSCP-YYYWYGDQSNTTGDLALESFTVNLTAPGASRRVDGVVFGCGHRNRGLFHGA 281
Query: 224 AGIAGFGRGKTSLPSQLNL---DKFSYCLLSHKFDDTTRTSSLILDNGSSHSDKKTTGLT 280
AG+ G GRG S SQL FSYCL+ H D ++ G + L
Sbjct: 282 AGLLGLGRGPLSFASQLRAVYGHTFSYCLVEHGSDAGSKVVF-----GEDYLVLAHPQLK 336
Query: 281 YTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRVRVWHKYLTLDRDGNGGTIVDSGTTFT 340
YT F S A+ +YYV L+ + VGG + + + +DG+GGTI+DSGTT +
Sbjct: 337 YTAFAPTSSPAD-----TFYYVKLKGVLVGGDLLNISSDTWDVGKDGSGGTIIDSGTTLS 391
Query: 341 FMAPELFEPLADEFVSQMVKNRNYTRALGAEALTGLRPCFDVPGEKTGSFPELKLHFKGG 400
+ ++ + FV M +R Y L PC++V G + PEL L F G
Sbjct: 392 YFVEPAYQVIRQAFVDLM--SRLYPL---IPDFPVLNPCYNVSGVERPEVPELSLLFADG 446
Query: 401 AEVTLPVENYFAVVGEGSAVCLTVVTDREASGGPSI---ILGNFQMQNYYVEYDLRNQRL 457
A P ENYF + +CL V G P I+GNFQ QN++V YDL+N RL
Sbjct: 447 AVWDFPAENYFVRLDPDGIMCLAV------RGTPRTGMSIIGNFQQQNFHVVYDLQNNRL 500
Query: 458 GFKQQLC 464
GF + C
Sbjct: 501 GFAPRRC 507
>gi|115452683|ref|NP_001049942.1| Os03g0317300 [Oryza sativa Japonica Group]
gi|108707833|gb|ABF95628.1| Eukaryotic aspartyl protease family protein, expressed [Oryza
sativa Japonica Group]
gi|113548413|dbj|BAF11856.1| Os03g0317300 [Oryza sativa Japonica Group]
Length = 448
Score = 149 bits (376), Expect = 3e-33, Method: Compositional matrix adjust.
Identities = 116/392 (29%), Positives = 168/392 (42%), Gaps = 45/392 (11%)
Query: 89 YSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLGCQ 148
Y + ++ GTPPQ + ILDTGS L W C C C +P F P S + +L C
Sbjct: 85 YLVHMAIGTPPQPVQLILDTGSDLTWTQCA---PCVSCFRQSLPRFNPSRSMTFSVLPC- 140
Query: 149 NPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSGLTEGIALSETLNLPNRI- 207
+ CRD IC +T G S+T + +
Sbjct: 141 ---------DLRICRDLTWSSCGEQSWGNGICVYAYAYADHSITTGHLDSDTFSFASADH 191
Query: 208 ------IPNFLVGCSVLSS----RQPAGIAGFGRGKTSLPSQLNLDKFSYCLLSHKFDDT 257
+P+ GC + ++ GIAGF RG S+P+QL +D FSYC + +
Sbjct: 192 AIGGASVPDLTFGCGLFNNGIFVSNETGIAGFSRGALSMPAQLKVDNFSYCFTAITGSEP 251
Query: 258 TRTSSLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRVRVW 317
+ + N S + G+ + + ++ A YY+ L+ +TVG R+ +
Sbjct: 252 SPVFLGVPPNLYSDAAGGGHGVVQSTALIRYHSSQLKA----YYISLKGVTVGTTRLPIP 307
Query: 318 HKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQM-VKNRNYTRALGAEALTGL 376
L DG GGTIVDSGT T + ++ + D FV+Q + N T +L
Sbjct: 308 ESVFALKEDGTGGTIVDSGTGMTMLPEAVYNLVCDAFVAQTKLTVHNSTSSLS------- 360
Query: 377 RPCFDVPGEKTGSFPELKLHFKGGAEVTLPVENYFAVVGEGSAV---CLTVVTDREASGG 433
+ CF VP P L LHF+ GA + LP ENY + E + CL + + S
Sbjct: 361 QLCFSVPPGAKPDVPALVLHFE-GATLDLPRENYMFEIEEAGGIRLTCLAINAGEDLS-- 417
Query: 434 PSIILGNFQMQNYYVEYDLRNQRLGFKQQLCK 465
++GNFQ QN +V YDL N L F C
Sbjct: 418 ---VIGNFQQQNMHVLYDLANDMLSFVPARCN 446
>gi|61214232|sp|Q766C2.1|NEP2_NEPGR RecName: Full=Aspartic proteinase nepenthesin-2; AltName:
Full=Nepenthesin-II; Flags: Precursor
gi|41016423|dbj|BAD07475.1| aspartic proteinase nepenthesin II [Nepenthes gracilis]
Length = 438
Score = 149 bits (376), Expect = 3e-33, Method: Compositional matrix adjust.
Identities = 119/384 (30%), Positives = 171/384 (44%), Gaps = 48/384 (12%)
Query: 87 GGYSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLG 146
G Y ++++ GTP I+DTGS L+W C C C S P F P+ SSS L
Sbjct: 94 GEYLMNVAIGTPDSSFSAIMDTGSDLIWTQCE---PCTQCFSQPTPIFNPQDSSSFSTLP 150
Query: 147 CQNPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSG-LTEGIALSETLNLPN 205
C++ C + E+ +C Y YG G T+G +ET
Sbjct: 151 CESQYCQDLPSETCNNNECQ----------------YTYGYGDGSTTQGYMATETFTFET 194
Query: 206 RIIPNFLVGCSV----LSSRQPAGIAGFGRGKTSLPSQLNLDKFSYCLLSHKFDDTTRTS 261
+PN GC AG+ G G G SLPSQL + +FSYC+ S+ ++ S
Sbjct: 195 SSVPNIAFGCGEDNQGFGQGNGAGLIGMGWGPLSLPSQLGVGQFSYCMTSYG---SSSPS 251
Query: 262 SLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRVRVWHKYL 321
+L L + +S + + T NP+ YYY+ L+ ITVGG + +
Sbjct: 252 TLALGSAASGVPEGSPSTTLIHSSLNPT---------YYYITLQGITVGGDNLGIPSSTF 302
Query: 322 TLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEALTGLRPCFD 381
L DG GG I+DSGTT T++ + + +A F Q+ N E+ +GL CF
Sbjct: 303 QLQDDGTGGMIIDSGTTLTYLPQDAYNAVAQAFTDQI----NLPTV--DESSSGLSTCFQ 356
Query: 382 VPGE-KTGSFPELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTVVTDREASGGPSIILGN 440
P + T PE+ + F GG + L +N EG +CL + + + G S I GN
Sbjct: 357 QPSDGSTVQVPEISMQFDGGV-LNLGEQNILISPAEG-VICLAMGSSSQL--GIS-IFGN 411
Query: 441 FQMQNYYVEYDLRNQRLGFKQQLC 464
Q Q V YDL+N + F C
Sbjct: 412 IQQQETQVLYDLQNLAVSFVPTQC 435
>gi|242079447|ref|XP_002444492.1| hypothetical protein SORBIDRAFT_07g022780 [Sorghum bicolor]
gi|241940842|gb|EES13987.1| hypothetical protein SORBIDRAFT_07g022780 [Sorghum bicolor]
Length = 441
Score = 149 bits (376), Expect = 3e-33, Method: Compositional matrix adjust.
Identities = 120/433 (27%), Positives = 193/433 (44%), Gaps = 51/433 (11%)
Query: 46 QNLNSLVSSSLTRALHIKNPQTKTTTTTTTTTTTNI-SSHSYGGYSISLSFGTPPQIIPF 104
Q L+ ++ S R +++ T + + S G Y + L+ GTPP
Sbjct: 45 QLLSRAIARSKARVAALQSAAVSPAPVADPITAARVLVTASSGEYLVDLAIGTPPLYYTA 104
Query: 105 ILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLGCQNPKCSWIHHESIQCRD 164
I+DTGS L+W C C C++ P F K S++ R L C++ +C+ +
Sbjct: 105 IMDTGSDLIWTQCA---PCLLCAAQPTPYFDVKRSATYRALPCRSSRCAAL--------- 152
Query: 165 CNDEPLATSKNCTQICPSYLVLYG-SGLTEGIALSETLNL-----PNRIIPNFLVGCSVL 218
+S +C + Y YG + T G+ +ET N GC L
Sbjct: 153 -------SSPSCFKKMCVYQYYYGDTASTAGVLANETFTFGAASSTKVRAANISFGCGSL 205
Query: 219 SSRQPA---GIAGFGRGKTSLPSQLNLDKFSYCLLSHKFDDTTRTSSLILDNGSSHSDKK 275
++ + A G+ GFGRG SL SQL +FSYCL S+ +R + N +S +
Sbjct: 206 NAGELANSSGMVGFGRGPLSLVSQLGPSRFSYCLTSYLSPTPSRLYFGVFANLNSTNTSS 265
Query: 276 TTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRVRVWHKYLTLDRDGNGGTIVDS 335
+ + TPFV NP++ Y++ ++ I++G +R+ + ++ DG GG I+DS
Sbjct: 266 GSPVQSTPFVINPALPNM------YFLSVKGISLGTKRLPIDPLVFAINDDGTGGVIIDS 319
Query: 336 GTTFTFMAPELFEPLADEFVSQM-VKNRNYTRALGAEALTGLRPCFDVPGEK--TGSFPE 392
GT+ T++ + +E + S + + N T GL CF P T + P+
Sbjct: 320 GTSITWLQQDAYEAVRRGLASTIPLPAMNDTD-------IGLDTCFQWPPPPNVTVTVPD 372
Query: 393 LKLHFKGGAEVTLPVENYFAVVGEGSAVCLTVVTDREASGGPSIILGNFQMQNYYVEYDL 452
HF GA +TLP ENY + +CL + + I+GN+Q QN ++ YD+
Sbjct: 373 FVFHFD-GANMTLPPENYMLIASTTGYLCLAMAPTSVGT-----IIGNYQQQNLHLLYDI 426
Query: 453 RNQRLGFKQQLCK 465
N L F C
Sbjct: 427 ANSFLSFVPAPCD 439
>gi|359485189|ref|XP_002279141.2| PREDICTED: aspartic proteinase nepenthesin-2-like [Vitis vinifera]
Length = 546
Score = 149 bits (376), Expect = 3e-33, Method: Compositional matrix adjust.
Identities = 126/394 (31%), Positives = 177/394 (44%), Gaps = 45/394 (11%)
Query: 87 GGYSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLG 146
G Y I + GTPP+ ILDTGS L W C C C P + P SSS R +G
Sbjct: 179 GEYFIDVFVGTPPKHFSLILDTGSDLNWIQCV---PCYECFEQNGPHYDPGQSSSYRNIG 235
Query: 147 CQNPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYG--SGLTEGIALSE-TLNL 203
C + +C + + +P K Q CP Y YG S T AL T+NL
Sbjct: 236 CHDSRCHLV---------SSPDPPQPCKAENQTCP-YYYWYGDSSNTTGDFALETFTVNL 285
Query: 204 ------PN-RIIPNFLVGCSVLSS---RQPAGIAGFGRGKTSLPSQLNL---DKFSYCLL 250
P R + N + GC + AG+ G GRG S SQL FSYCL+
Sbjct: 286 TMSSGKPELRRVENVMFGCGHWNRGLFHGAAGLLGLGRGPLSFSSQLQSLYGHSFSYCLV 345
Query: 251 SHKFDDTTRTSSLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVG 310
D +S LI G L +T V + N +YYV ++ I VG
Sbjct: 346 DRN-SDANVSSKLIF--GEDKDLLSHPELNFTTLV----AGKENPVDTFYYVQIKSIVVG 398
Query: 311 GQRVRVWHKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGA 370
G+ V + + + DG+GGTI+DSGTT ++ A ++ + + F M K + Y
Sbjct: 399 GEVVNIPEEKWQIATDGSGGTIIDSGTTLSYFAEPAYQVIKEAF---MAKVKGYPV---V 452
Query: 371 EALTGLRPCFDVPGEKTGSFPELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTVVTDREA 430
+ L PC++V G + P+ + F GA PVENYF + VCL ++ +
Sbjct: 453 KDFPVLEPCYNVTGVEQPDLPDFGIVFSDGAVWNFPVENYFIEIEPREVVCLAILGTPPS 512
Query: 431 SGGPSIILGNFQMQNYYVEYDLRNQRLGFKQQLC 464
+ I+GN+Q QN+++ YD + RLGF C
Sbjct: 513 ALS---IIGNYQQQNFHILYDTKKSRLGFAPTKC 543
>gi|115467014|ref|NP_001057106.1| Os06g0209100 [Oryza sativa Japonica Group]
gi|51091210|dbj|BAD35903.1| putative aspartic proteinase nepenthesin I [Oryza sativa Japonica
Group]
gi|113595146|dbj|BAF19020.1| Os06g0209100 [Oryza sativa Japonica Group]
gi|125554496|gb|EAZ00102.1| hypothetical protein OsI_22105 [Oryza sativa Indica Group]
Length = 454
Score = 149 bits (375), Expect = 3e-33, Method: Compositional matrix adjust.
Identities = 129/397 (32%), Positives = 175/397 (44%), Gaps = 58/397 (14%)
Query: 89 YSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLGCQ 148
Y + +S GTPP+ + LDTGS LVW C C P P SS+ L C
Sbjct: 90 YLMHVSVGTPPRPVALTLDTGSDLVWTQCAPCLDC--FEQGAAPVLDPAASSTHAALPCD 147
Query: 149 NPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGS-GLTEGIALSETL------ 201
P C + S R D ++C Y+ YG LT G +++
Sbjct: 148 APLCRALPFTSCGGRSWGD------RSCV-----YVYHYGDRSLTVGQLATDSFTFGGDD 196
Query: 202 NLPNRIIPNFLVGCSVLSS----RQPAGIAGFGRGKTSLPSQLNLDKFSYCLLSHKFDDT 257
N GC ++ GIAGFGRG+ SLPSQLN+ FSYC S FD
Sbjct: 197 NAGGLAARRVTFGCGHINKGIFQANETGIAGFGRGRWSLPSQLNVTSFSYCFTS-MFD-- 253
Query: 258 TRTSSLILDNGSS------HSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGG 311
T++SS++ ++ H T + T + NPS Y+V LR I+VGG
Sbjct: 254 TKSSSVVTLGAAAAELLHTHHAAHTGDVRTTRLIKNPSQPS------LYFVPLRGISVGG 307
Query: 312 QRVRVWHKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAE 371
RV V L TI+DSG + T + +++E + EFVSQ+ A
Sbjct: 308 ARVAVPESRL------RSSTIIDSGASITTLPEDVYEAVKAEFVSQV------GLPAAAA 355
Query: 372 ALTGLRPCFDVPGE---KTGSFPELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTVVTDR 428
L CF +P + + P L LH GGA+ LP NY V + +A L VV D
Sbjct: 356 GSAALDLCFALPVAALWRRPAVPALTLHLDGGADWELPRGNY--VFEDYAARVLCVVLD- 412
Query: 429 EASGGPSIILGNFQMQNYYVEYDLRNQRLGFKQQLCK 465
A+ G +++GN+Q QN +V YDL N L F C
Sbjct: 413 -AAAGEQVVIGNYQQQNTHVVYDLENDVLSFAPARCD 448
>gi|226503109|ref|NP_001147206.1| aspartic proteinase nepenthesin-2 precursor [Zea mays]
gi|195608496|gb|ACG26078.1| aspartic proteinase nepenthesin-2 precursor [Zea mays]
gi|413921850|gb|AFW61782.1| aspartic proteinase nepenthesin-2 [Zea mays]
Length = 441
Score = 149 bits (375), Expect = 4e-33, Method: Compositional matrix adjust.
Identities = 126/447 (28%), Positives = 195/447 (43%), Gaps = 50/447 (11%)
Query: 31 FSLSRFHTNPSQDSYQNLNSLVSSSLTRALHIKNPQTKTTTTTTTTTTTNISSHSYGGYS 90
L+ S Q L+ ++ S R +++ T + + S G Y
Sbjct: 31 LKLTHVDAGTSYTKLQLLSRAIARSKARVAALQSAAVLPPVVDPITAARVLVTASSGEYL 90
Query: 91 ISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLGCQNP 150
+ L+ GTPP I+DTGS L+W C C C+ P F K S++ R L C++
Sbjct: 91 VDLAIGTPPLYYTAIMDTGSDLIWTQCA---PCLLCADQPTPYFDVKKSATYRALPCRSS 147
Query: 151 KCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYG-SGLTEGIALSETL-----NLP 204
+C+ + +S +C + Y YG + T G+ +ET N
Sbjct: 148 RCASL----------------SSPSCFKKMCVYQYYYGDTASTAGVLANETFTFGAANST 191
Query: 205 NRIIPNFLVGCSVLSSRQPA---GIAGFGRGKTSLPSQLNLDKFSYCLLSHKFDDTTRTS 261
N GC L++ A G+ GFGRG SL SQL +FSYCL S+ +R
Sbjct: 192 KVRATNIAFGCGSLNAGDLANSSGMVGFGRGPLSLVSQLGPSRFSYCLTSYLSATPSRLY 251
Query: 262 SLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRVRVWHKYL 321
+ N SS + + + TPFV NP++ Y++ L+ I++G + + +
Sbjct: 252 FGVYANLSSTNTSSGSPVQSTPFVINPALPNM------YFLSLKAISLGTKLLPIDPLVF 305
Query: 322 TLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQM-VKNRNYTRALGAEALTGLRPCF 380
++ DG GG I+DSGT+ T++ + +E + VS + + N T GL CF
Sbjct: 306 AINDDGTGGVIIDSGTSITWLQQDAYEAVRRGLVSAIPLPAMNDTD-------IGLDTCF 358
Query: 381 DVPGEK--TGSFPELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTVVTDREASGGPSIIL 438
P T + P+L HF A +TL ENY + +CL + A G I+
Sbjct: 359 QWPPPPNVTVTVPDLVFHFD-SANMTLLPENYMLIASTTGYLCLVM-----APTGVGTII 412
Query: 439 GNFQMQNYYVEYDLRNQRLGFKQQLCK 465
GN+Q QN ++ YD+ N L F C
Sbjct: 413 GNYQQQNLHLLYDIGNSFLSFVPAPCD 439
>gi|125586057|gb|EAZ26721.1| hypothetical protein OsJ_10629 [Oryza sativa Japonica Group]
Length = 474
Score = 149 bits (375), Expect = 4e-33, Method: Compositional matrix adjust.
Identities = 116/389 (29%), Positives = 168/389 (43%), Gaps = 39/389 (10%)
Query: 89 YSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLGCQ 148
Y + ++ GTPPQ + ILDTGS L W C C C +P F P S + +L C
Sbjct: 111 YLVHMAIGTPPQPVQLILDTGSDLTWTQCA---PCVSCFRQSLPRFNPSRSMTFSVLPC- 166
Query: 149 NPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSGLTEGIALSETLNLPNRI- 207
+ CRD IC +T G S+T + +
Sbjct: 167 ---------DLRICRDLTWSSCGEQSWGNGICVYAYAYADHSITTGHLDSDTFSFASADH 217
Query: 208 ------IPNFLVGCSVLSS----RQPAGIAGFGRGKTSLPSQLNLDKFSYCLLSHKFDDT 257
+P+ GC + ++ GIAGF RG S+P+QL +D FSYC + +
Sbjct: 218 AIGGASVPDLTFGCGLFNNGIFVSNETGIAGFSRGALSMPAQLKVDNFSYCFTAITGSEP 277
Query: 258 TRTSSLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRVRVW 317
+ + N S + G+ + + ++ A YY+ L+ +TVG R+ +
Sbjct: 278 SPVFLGVPPNLYSDAAGGGHGVVQSTALIRYHSSQLKA----YYISLKGVTVGTTRLPIP 333
Query: 318 HKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQM-VKNRNYTRALGAEALTGL 376
L DG GGTIVDSGT T + ++ + D FV+Q + N T +L
Sbjct: 334 ESVFALKEDGTGGTIVDSGTGMTMLPEAVYNLVCDAFVAQTKLTVHNSTSSLS------- 386
Query: 377 RPCFDVPGEKTGSFPELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTVVTDREASGGPSI 436
+ CF VP P L LHF+ GA + LP ENY + E + LT + +G
Sbjct: 387 QLCFSVPPGAKPDVPALVLHFE-GATLDLPRENYMFEIEEAGGIRLTCLAIN--AGEDLS 443
Query: 437 ILGNFQMQNYYVEYDLRNQRLGFKQQLCK 465
++GNFQ QN +V YDL N L F C
Sbjct: 444 VIGNFQQQNMHVLYDLANDMLSFVPARCN 472
>gi|125543634|gb|EAY89773.1| hypothetical protein OsI_11315 [Oryza sativa Indica Group]
Length = 474
Score = 149 bits (375), Expect = 4e-33, Method: Compositional matrix adjust.
Identities = 116/389 (29%), Positives = 168/389 (43%), Gaps = 39/389 (10%)
Query: 89 YSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLGCQ 148
Y + ++ GTPPQ + ILDTGS L W C C C +P F P S + +L C
Sbjct: 111 YLVHMAIGTPPQPVQLILDTGSDLTWTQCA---PCVSCFRQSLPRFNPSRSMTFSVLPC- 166
Query: 149 NPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSGLTEGIALSETLNLPNRI- 207
+ CRD IC +T G S+T + +
Sbjct: 167 ---------DLRICRDLTWSSCGEQSWGNGICVYAYAYADHSITTGHLDSDTFSFASADH 217
Query: 208 ------IPNFLVGCSVLSS----RQPAGIAGFGRGKTSLPSQLNLDKFSYCLLSHKFDDT 257
+P+ GC + ++ GIAGF RG S+P+QL +D FSYC + +
Sbjct: 218 AIGGASVPDLTFGCGLFNNGIFVSNETGIAGFSRGALSMPAQLKVDNFSYCFTAITGSEP 277
Query: 258 TRTSSLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRVRVW 317
+ + N S + G+ + + ++ A YY+ L+ +TVG R+ +
Sbjct: 278 SPVFLGVPPNLYSDAAGGGHGVVQSTALIRYHSSQLKA----YYISLKGVTVGTTRLPIP 333
Query: 318 HKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQM-VKNRNYTRALGAEALTGL 376
L DG GGTIVDSGT T + ++ + D FV+Q + N T +L
Sbjct: 334 ESVFALKEDGTGGTIVDSGTGMTMLPEAVYNLVCDAFVAQTKLTVHNSTSSLS------- 386
Query: 377 RPCFDVPGEKTGSFPELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTVVTDREASGGPSI 436
+ CF VP P L LHF+ GA + LP ENY + E + LT + +G
Sbjct: 387 QLCFSVPPGAKPDVPALVLHFE-GATLDLPRENYMFEIEEAGGIRLTCLAIN--AGEDLS 443
Query: 437 ILGNFQMQNYYVEYDLRNQRLGFKQQLCK 465
++GNFQ QN +V YDL N L F C
Sbjct: 444 VIGNFQQQNMHVLYDLANDMLSFVPARCN 472
>gi|18414692|ref|NP_567506.1| aspartyl protease family protein [Arabidopsis thaliana]
gi|15809800|gb|AAL06828.1| AT4g16560/dl4305c [Arabidopsis thaliana]
gi|18377815|gb|AAL67094.1| AT4g16560/dl4305c [Arabidopsis thaliana]
gi|332658370|gb|AEE83770.1| aspartyl protease family protein [Arabidopsis thaliana]
Length = 499
Score = 148 bits (374), Expect = 5e-33, Method: Compositional matrix adjust.
Identities = 131/408 (32%), Positives = 189/408 (46%), Gaps = 65/408 (15%)
Query: 106 LDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLGCQNPKCSWIHHESIQCRDC 165
LDTGS LVWFPC + C C S +P P SSS + H S+ D
Sbjct: 100 LDTGSDLVWFPC-RPFTCILCESKPLPPSPPSSLSSSATTVSCSSPSCSAAHSSLPSSD- 157
Query: 166 NDEPLATSKNC-------------TQICPSYLVLYGSGLTEGIALSETLNLPNRIIPNFL 212
L NC + CP + YG G S++L+LP+ + NF
Sbjct: 158 ----LCAISNCPLDFIETGDCNTSSYPCPPFYYAYGDGSLVAKLYSDSLSLPSVSVSNFT 213
Query: 213 VGCSVLSSRQPAGIAGFGRGKTSLPSQLNL------DKFSYCLLSHKFDD--TTRTSSLI 264
GC+ + +P G+AGFGRG+ SLP+QL + + FSYCL+SH FD R S LI
Sbjct: 214 FGCAHTTLAEPIGVAGFGRGRLSLPAQLAVHSPHLGNSFSYCLVSHSFDSDRVRRPSPLI 273
Query: 265 LDNGSSHSDKKT----------------TGLTYTPFVNNPSVAERNAFSVYYYVGLRRIT 308
L +K+ +T + NP +Y V L+ I+
Sbjct: 274 LGRFVDKKEKRVGTTDDHDDGDDEKKKKNEFVFTEMLENPK------HPYFYSVSLQGIS 327
Query: 309 VGGQRVRVWHKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRAL 368
+G + + +D++G GG +VDSGTTFT + + + + +EF S++ R + RA
Sbjct: 328 IGKRNIPAPAMLRRIDKNGGGGVVVDSGTTFTMLPAKFYNSVVEEFDSRV--GRVHERAD 385
Query: 369 GAEALTGLRPCFDVPGEKTGSFPELKLHFKGG-AEVTLPVENYFAVVGEG--------SA 419
E +G+ PC+ + +T P L LHF G + VTLP NYF +G
Sbjct: 386 RVEPSSGMSPCYYL--NQTVKVPALVLHFAGNRSSVTLPRRNYFYEFMDGGDGKEEKRKI 443
Query: 420 VCLTVVT---DREASGGPSIILGNFQMQNYYVEYDLRNQRLGFKQQLC 464
CL ++ + E GG ILGN+Q Q + V YDL N+R+GF ++ C
Sbjct: 444 GCLMLMNGGDESELRGGTGAILGNYQQQGFEVVYDLLNRRVGFAKRKC 491
>gi|356539818|ref|XP_003538390.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
Length = 457
Score = 147 bits (372), Expect = 9e-33, Method: Compositional matrix adjust.
Identities = 138/473 (29%), Positives = 210/473 (44%), Gaps = 48/473 (10%)
Query: 10 LSFIFFFTLLSIFPS----------SITSLTFSLSRFHTNPSQDSYQNLNSLVSSSLTRA 59
+ F+FFF L SI S + ++ FSLS T+ S + L ++ +SL
Sbjct: 10 MHFLFFFLLSSIHLSVQLNHTTTTTNNSTSLFSLSFPLTSLSLSTNTALKMMLRNSLIAN 69
Query: 60 LHIKNPQTKTTTTTTTTTTTNISSHSYGGYSISLSFGTPPQIIPFILDTGSHLVWFPCTN 119
+ N Q K+ ++ +S + L GTPPQ+ P +LDTGS L W
Sbjct: 70 TNNNNTQLKSPPSSPYNY--KLSFKYSMALIVDLPIGTPPQVQPMVLDTGSQLSWI---- 123
Query: 120 HYQCKYCSSSKIP---SFIPKLSSSSRLLGCQNPKCSWIHHESIQCRDCNDEPLATSKNC 176
QC + +K P SF P LSS+ L C +P C D L TS +
Sbjct: 124 --QCHKKAPAKPPPTASFDPSLSSTFSTLPCTHPVCK---------PRIPDFTLPTSCDQ 172
Query: 177 TQICPSYLVLYGSG-LTEGIALSETLNLPNRII-PNFLVGCSVLSSRQPAGIAGFGRGKT 234
++C Y Y G EG + E + P ++GC+ S P GI G RG+
Sbjct: 173 NRLC-HYSYFYADGTYAEGNLVREKFTFSRSLFTPPLILGCAT-ESTDPRGILGMNRGRL 230
Query: 235 SLPSQLNLDKFSYCLLSH-KFDDTTRTSSLIL-DNGSSHSDKKTTGLTYTPFVNNPSVAE 292
S SQ + KFSYC+ + T T S L N +S++ + LT+ P
Sbjct: 231 SFASQSKITKFSYCVPTRVTRPGYTPTGSFYLGHNPNSNTFRYIEMLTFARSQRMP---- 286
Query: 293 RNAFSVYYYVGLRRITVGGQRVRVWHKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLAD 352
N + Y V L+ I +GG+++ + D G+G T++DSG+ FT++ E ++ +
Sbjct: 287 -NLDPLAYTVALQGIRIGGRKLNISPAVFRADAGGSGQTMLDSGSEFTYLVNEAYDKVRA 345
Query: 353 EFVSQMVKNRNYTRALGAEALTGLRPCFDVPGEKTGSF-PELKLHFKGGAEVTLPVENYF 411
E V + G A CFD + G ++ F+ G ++ +P E
Sbjct: 346 EVVRAVGPRMKKGYVYGGVADM----CFDGNAIEIGRLIGDMVFEFEKGVQIVVPKERVL 401
Query: 412 AVVGEGSAVCLTVVTDREASGGPSIILGNFQMQNYYVEYDLRNQRLGFKQQLC 464
A V EG C+ + + + G S I+GNF QN +VE+DL N+R+GF C
Sbjct: 402 ATV-EGGVHCIGIA-NSDKLGAASNIIGNFHQQNLWVEFDLVNRRMGFGTADC 452
>gi|115479485|ref|NP_001063336.1| Os09g0452400 [Oryza sativa Japonica Group]
gi|51535935|dbj|BAD38017.1| putative aspartic proteinase nepenthesin I [Oryza sativa Japonica
Group]
gi|113631569|dbj|BAF25250.1| Os09g0452400 [Oryza sativa Japonica Group]
gi|215693279|dbj|BAG88661.1| unnamed protein product [Oryza sativa Japonica Group]
Length = 441
Score = 147 bits (371), Expect = 1e-32, Method: Compositional matrix adjust.
Identities = 117/392 (29%), Positives = 176/392 (44%), Gaps = 47/392 (11%)
Query: 85 SYGGYSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRL 144
S G Y + + G+PP+ ++DTGS L+W C C C P F P S+S
Sbjct: 84 SEGEYLMDVGIGSPPRYFSAMIDTGSDLIWTQCA---PCLLCVEQPTPYFEPAKSTSYAS 140
Query: 145 LGCQNPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYG-SGLTEGIALSETL-- 201
L C + C+ ++ PL C Y YG S + G+ +ET
Sbjct: 141 LPCSSAMCNALY-----------SPLCFQNACV-----YQAFYGDSASSAGVLANETFTF 184
Query: 202 --NLPNRIIPNFLVGCSVLSSR---QPAGIAGFGRGKTSLPSQLNLDKFSYCLLSHKFDD 256
N +P GC +++ +G+ GFGRG SL SQL +FSYCL S
Sbjct: 185 GTNSTRVAVPRVSFGCGNMNAGTLFNGSGMVGFGRGALSLVSQLGSPRFSYCLTSFMSPA 244
Query: 257 TTRTSSLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRVRV 316
T+R +S + + + TPF+ NP A Y++ + I+V G + +
Sbjct: 245 TSRLYFGAYATLNSTNTSSSGPVQSTPFIVNP------ALPTMYFLNMTGISVAGDLLPI 298
Query: 317 WHKYLTLDR-DGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEALTG 375
++ DG GG I+DSGTT TF+A + + FV+ + RA + T
Sbjct: 299 DPSVFAINETDGTGGVIIDSGTTVTFLAQPAYAMVQGAFVAWV----GLPRANATPSDT- 353
Query: 376 LRPCFDV--PGEKTGSFPELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTVVTDREASGG 433
CF P + + PE+ LHF GA++ LP+ENY + G +CL ++ + S
Sbjct: 354 FDTCFKWPPPPRRMVTLPEMVLHFD-GADMELPLENYMVMDGGTGNLCLAMLPSDDGS-- 410
Query: 434 PSIILGNFQMQNYYVEYDLRNQRLGFKQQLCK 465
I+G+FQ QN+++ YDL N L F C
Sbjct: 411 ---IIGSFQHQNFHMLYDLENSLLSFVPAPCN 439
>gi|414877221|tpg|DAA54352.1| TPA: hypothetical protein ZEAMMB73_214154 [Zea mays]
Length = 447
Score = 147 bits (371), Expect = 1e-32, Method: Compositional matrix adjust.
Identities = 125/393 (31%), Positives = 179/393 (45%), Gaps = 58/393 (14%)
Query: 89 YSISLSFGTPPQIIPFIL--DTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLG 146
Y + L+ GTPP +PF+ DTGS L W C CK C P + +SSS +
Sbjct: 93 YLMELAIGTPP--VPFVALADTGSDLTWTQCQ---PCKLCFPQDTPIYDTAVSSSFSPVP 147
Query: 147 CQNPKCSWIHHESIQCRDCNDEPLATSKNCTQICP--SYLVLYGSGL-TEGIALSETLNL 203
C + C P+ +S+NCT Y YG G + G+ +ETL
Sbjct: 148 CASATC---------------LPIWSSRNCTASSSPCRYRYAYGDGAYSAGVLGTETLTF 192
Query: 204 PNR---IIPNFLVGCSVLS---SRQPAGIAGFGRGKTSLPSQLNLDKFSYCLLSHKFDDT 257
P + GC V + S G G GRG SL +QL + KFSYCL F +T
Sbjct: 193 PGAPGVSVGGIAFGCGVDNGGLSYNSTGTVGLGRGSLSLVAQLGVGKFSYCL--TDFFNT 250
Query: 258 TRTSSLILDNGSSHSDKKT-TGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRVRV 316
+ S ++ + + T + TP V +P V +YYV L I++G R+ +
Sbjct: 251 SLGSPVLFGALAELAAPSTGAAVQSTPLVQSPYVP------TWYYVSLEGISLGDARLPI 304
Query: 317 WHKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEALTGL 376
+ L DG+GG IVDSGTTFTF+ F + D + + +L +
Sbjct: 305 PNGTFDLRDDGSGGMIVDSGTTFTFLVESAFRVVVDHVAGVLRQPVVNASSLDS------ 358
Query: 377 RPCFDVP-GEKT-GSFPELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTVVTDREASGGP 434
PCF GE+ + P++ LHF GGA++ L +NY + E S+ CL + +G P
Sbjct: 359 -PCFPAATGEQQLPAMPDMVLHFAGGADMRLHRDNYMSFNQEESSFCLNI------AGSP 411
Query: 435 SI---ILGNFQMQNYYVEYDLRNQRLGFKQQLC 464
S ILGNFQ QN + +D+ +L F C
Sbjct: 412 SADVSILGNFQQQNIQMLFDITVGQLSFMPTDC 444
>gi|414589630|tpg|DAA40201.1| TPA: hypothetical protein ZEAMMB73_629620 [Zea mays]
Length = 443
Score = 147 bits (371), Expect = 1e-32, Method: Compositional matrix adjust.
Identities = 132/398 (33%), Positives = 185/398 (46%), Gaps = 61/398 (15%)
Query: 85 SYGGYSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRL 144
S G Y + + GTP + ILDTGS L+W C C C P F P S++ R
Sbjct: 86 SDGEYLMEMGIGTPTRYYSAILDTGSDLIWTQCA---PCLLCVDQPTPYFDPARSATYRS 142
Query: 145 LGCQNPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYG-SGLTEGIALSETLNL 203
LGC +P C+ +++ PL K C Y YG S T G+ +ET
Sbjct: 143 LGCASPACNALYY-----------PLCYQKVCV-----YQYFYGDSASTAGVLANETFTF 186
Query: 204 ---PNRI-IPNFLVGCSVLSSRQPA---GIAGFGRGKTSLPSQLNLDKFSYCLLSHKFDD 256
R+ +P GC L++ A G+ GFGRG SL SQL +FSYCL S
Sbjct: 187 GTNETRVSLPGISFGCGNLNAGSLANGSGMVGFGRGSLSLVSQLGSPRFSYCLTSFLSPV 246
Query: 257 TTRT-----SSLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGG 311
+R ++L N SS + T PFV NP A Y++ + I+VGG
Sbjct: 247 PSRLYFGVYATLNSTNASSEPVQST------PFVVNP------ALPTMYFLNMTGISVGG 294
Query: 312 QRVRVWHKYLTL-DRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQM-VKNRNYTRALG 369
+ + + D DG GGTI+DSGTT T++A ++ + F SQ+ + N T A
Sbjct: 295 YLLPIDPAVFAINDTDGTGGTIIDSGTTITYLAEPAYDAVRAAFASQITLPLLNVTDA-- 352
Query: 370 AEALTGLRPCFDV--PGEKTGSFPELKLHFKGGAEVTLPVENYFAV-VGEGSAVCLTVVT 426
+ L CF P ++ + P+L LHF GA+ LP++NY V G +CL +
Sbjct: 353 ----SVLDTCFQWPPPPRQSVTLPQLVLHFD-GADWELPLQNYMLVDPSTGGGLCLAM-- 405
Query: 427 DREASGGPSIILGNFQMQNYYVEYDLRNQRLGFKQQLC 464
AS I+G++Q QN+ V YDL N + F C
Sbjct: 406 ---ASSSDGSIIGSYQHQNFNVLYDLENSLMSFVPAPC 440
>gi|125563957|gb|EAZ09337.1| hypothetical protein OsI_31609 [Oryza sativa Indica Group]
gi|125605916|gb|EAZ44952.1| hypothetical protein OsJ_29595 [Oryza sativa Japonica Group]
Length = 438
Score = 147 bits (371), Expect = 1e-32, Method: Compositional matrix adjust.
Identities = 116/392 (29%), Positives = 174/392 (44%), Gaps = 47/392 (11%)
Query: 85 SYGGYSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRL 144
S G Y + + G+PP+ ++DTGS L+W C C C P F P S+S
Sbjct: 81 SEGEYLMDVGIGSPPRYFSAMIDTGSDLIWTQCA---PCLLCVEQPTPYFEPAKSTSYAS 137
Query: 145 LGCQNPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYG-SGLTEGIALSETL-- 201
L C + C+ ++ PL C Y YG S + G+ +ET
Sbjct: 138 LPCSSAMCNALY-----------SPLCFQNACV-----YQAFYGDSASSAGVLANETFTF 181
Query: 202 --NLPNRIIPNFLVGCSVLSSR---QPAGIAGFGRGKTSLPSQLNLDKFSYCLLSHKFDD 256
N +P GC +++ +G+ GFGRG SL SQL +FSYCL S
Sbjct: 182 GTNSTRVAVPRVSFGCGNMNAGTLFNGSGMVGFGRGALSLVSQLGSPRFSYCLTSFMSPA 241
Query: 257 TTRTSSLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRVRV 316
T+R +S + + + TPF+ NP A Y++ + I+V G + +
Sbjct: 242 TSRLYFGAYATLNSTNTSSSGPVQSTPFIVNP------ALPTMYFLNMTGISVAGDLLPI 295
Query: 317 WHKYLTLDR-DGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEALTG 375
++ DG GG I+DSGTT TF+A + + FV+ + R A
Sbjct: 296 DPSVFAINETDGTGGVIIDSGTTVTFLAQPAYAMVQGAFVAWVGLPRA-----NATPSDT 350
Query: 376 LRPCFDV--PGEKTGSFPELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTVVTDREASGG 433
CF P + + PE+ LHF GA++ LP+ENY + G +CL ++ + S
Sbjct: 351 FDTCFKWPPPPRRMVTLPEMVLHFD-GADMELPLENYMVMDGGTGNLCLAMLPSDDGS-- 407
Query: 434 PSIILGNFQMQNYYVEYDLRNQRLGFKQQLCK 465
I+G+FQ QN+++ YDL N L F C
Sbjct: 408 ---IIGSFQHQNFHMLYDLENSLLSFVPAPCN 436
>gi|409179880|gb|AFV26025.1| aspartic proteinase nepenthesin 2 [Nepenthes mirabilis]
Length = 437
Score = 147 bits (371), Expect = 1e-32, Method: Compositional matrix adjust.
Identities = 123/384 (32%), Positives = 176/384 (45%), Gaps = 49/384 (12%)
Query: 87 GGYSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLG 146
G Y ++++ GTP + I+DTGS L+W C C C S P F P+ SSS L
Sbjct: 94 GEYLMNVAIGTPASSLSAIMDTGSDLIWTQCE---PCTQCFSQPTPIFNPQDSSSFSTLP 150
Query: 147 CQNPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSGL-TEGIALSETLNLPN 205
C++ + C D P S++C C Y YG G T+G +ET
Sbjct: 151 CES-------------QYCQDLP---SESCYNDC-QYTYGYGDGSSTQGYMATETFTFET 193
Query: 206 RIIPNFLVGCSV----LSSRQPAGIAGFGRGKTSLPSQLNLDKFSYCLLSHKFDDTTRTS 261
+PN GC AG+ G G G SLPSQL + +FSYC+ S ++ S
Sbjct: 194 SSVPNIAFGCGEDNQGFGQGNGAGLIGMGWGPLSLPSQLGVGQFSYCMTSSG---SSSPS 250
Query: 262 SLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRVRVWHKYL 321
+L L + +S + + T NP+ YYY+ L+ ITVGG + +
Sbjct: 251 TLALGSAASGVPEGSPSTTLIHSSLNPT---------YYYITLQGITVGGDNLGIPSSTF 301
Query: 322 TLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEALTGLRPCFD 381
L DG GG I+DSGTT T++ + + +A F Q+ N + E+ +GL CF
Sbjct: 302 QLQDDGTGGMIIDSGTTLTYLPQDAYNAVAQAFTDQI----NLSPV--DESSSGLSTCFQ 355
Query: 382 VPGE-KTGSFPELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTVVTDREASGGPSIILGN 440
+P + T PE+ + F GG + L EN EG +CL + + + G S I GN
Sbjct: 356 LPSDGSTVQVPEISMQFDGGV-LNLGEENVLISPAEG-VICLAMGSSSQQ--GIS-IFGN 410
Query: 441 FQMQNYYVEYDLRNQRLGFKQQLC 464
Q Q V YDL+N + F C
Sbjct: 411 IQQQETQVLYDLQNLAVSFVPTQC 434
>gi|15228044|ref|NP_181826.1| aspartyl protease-like protein [Arabidopsis thaliana]
gi|20197868|gb|AAM15292.1| putative chloroplast nucleoid DNA binding protein [Arabidopsis
thaliana]
gi|20197965|gb|AAD21712.2| putative chloroplast nucleoid DNA binding protein [Arabidopsis
thaliana]
gi|330255100|gb|AEC10194.1| aspartyl protease-like protein [Arabidopsis thaliana]
Length = 527
Score = 147 bits (370), Expect = 1e-32, Method: Compositional matrix adjust.
Identities = 126/398 (31%), Positives = 184/398 (46%), Gaps = 51/398 (12%)
Query: 87 GGYSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLG 146
G Y + + GTPP+ ILDTGS L W C Y C + + + PK S+S + +
Sbjct: 158 GEYFMDVLVGTPPKHFSLILDTGSDLNWLQCLPCYDCFHQNGM---FYDPKTSASFKNIT 214
Query: 147 CQNPKCSWIHHES--IQCRDCNDEPLATSKNCTQICPSYLVLYG--SGLTEGIALSE-TL 201
C +P+CS I +QC N Q CP Y YG S T A+ T+
Sbjct: 215 CNDPRCSLISSPDPPVQCESDN-----------QSCP-YFYWYGDRSNTTGDFAVETFTV 262
Query: 202 NL-------PNRIIPNFLVGCSVLSS---RQPAGIAGFGRGKTSLPSQLNL---DKFSYC 248
NL + N + GC + +G+ G GRG S SQL FSYC
Sbjct: 263 NLTTTEGGSSEYKVGNMMFGCGHWNRGLFSGASGLLGLGRGPLSFSSQLQSLYGHSFSYC 322
Query: 249 LLSHKFDDTTRTSSLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRIT 308
L+ +T +S LI G T L +T FVN + N+ +YY+ ++ I
Sbjct: 323 LVDRN-SNTNVSSKLIF--GEDKDLLNHTNLNFTSFVN----GKENSVETFYYIQIKSIL 375
Query: 309 VGGQRVRVWHKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRAL 368
VGG+ + + + + DG+GGTI+DSGTT ++ A +E + ++F +M +N R
Sbjct: 376 VGGKALDIPEETWNISSDGDGGTIIDSGTTLSYFAEPAYEIIKNKFAEKMKENYPIFRDF 435
Query: 369 GAEALTGLRPCFDVPG--EKTGSFPELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTVVT 426
L PCF+V G E PEL + F G P EN F + E VCL ++
Sbjct: 436 PV-----LDPCFNVSGIEENNIHLPELGIAFVDGTVWNFPAENSFIWLSE-DLVCLAILG 489
Query: 427 DREASGGPSIILGNFQMQNYYVEYDLRNQRLGFKQQLC 464
+++ I+GN+Q QN+++ YD + RLGF C
Sbjct: 490 TPKSTFS---IIGNYQQQNFHILYDTKRSRLGFTPTKC 524
>gi|293335828|ref|NP_001170221.1| uncharacterized protein LOC100384173 precursor [Zea mays]
gi|224034427|gb|ACN36289.1| unknown [Zea mays]
Length = 443
Score = 147 bits (370), Expect = 2e-32, Method: Compositional matrix adjust.
Identities = 132/398 (33%), Positives = 185/398 (46%), Gaps = 61/398 (15%)
Query: 85 SYGGYSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRL 144
S G Y + + GTP + ILDTGS L+W C C C P F P S++ R
Sbjct: 86 SDGEYLMEMGIGTPTRYYSAILDTGSDLIWTQCA---PCLLCVDQPTPYFDPARSATYRS 142
Query: 145 LGCQNPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYG-SGLTEGIALSETLNL 203
LGC +P C+ +++ PL K C Y YG S T G+ +ET
Sbjct: 143 LGCASPACNALYY-----------PLCYQKVCV-----YQYFYGDSASTAGVLANETFTF 186
Query: 204 ---PNRI-IPNFLVGCSVLSSRQPA---GIAGFGRGKTSLPSQLNLDKFSYCLLSHKFDD 256
R+ +P GC L++ A G+ GFGRG SL SQL +FSYCL S
Sbjct: 187 GTNETRVSLPGISFGCGNLNAGLLANGSGMVGFGRGSLSLVSQLGSPRFSYCLTSFLSPV 246
Query: 257 TTRT-----SSLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGG 311
+R ++L N SS + T PFV NP A Y++ + I+VGG
Sbjct: 247 PSRLYFGVYATLNSTNASSEPVQST------PFVVNP------ALPTMYFLNMTGISVGG 294
Query: 312 QRVRVWHKYLTL-DRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQM-VKNRNYTRALG 369
+ + + D DG GGTI+DSGTT T++A ++ + F SQ+ + N T A
Sbjct: 295 YLLPIDPAVFAINDTDGTGGTIIDSGTTITYLAEPAYDAVRAAFASQITLPLLNVTDA-- 352
Query: 370 AEALTGLRPCFDV--PGEKTGSFPELKLHFKGGAEVTLPVENYFAV-VGEGSAVCLTVVT 426
+ L CF P ++ + P+L LHF GA+ LP++NY V G +CL +
Sbjct: 353 ----SVLDTCFQWPPPPRQSVTLPQLVLHFD-GADWELPLQNYMLVDPSTGGGLCLAM-- 405
Query: 427 DREASGGPSIILGNFQMQNYYVEYDLRNQRLGFKQQLC 464
AS I+G++Q QN+ V YDL N + F C
Sbjct: 406 ---ASSSDGSIIGSYQHQNFNVLYDLENSLMSFVPAPC 440
>gi|226492150|ref|NP_001146362.1| hypothetical protein precursor [Zea mays]
gi|219886805|gb|ACL53777.1| unknown [Zea mays]
gi|414878074|tpg|DAA55205.1| TPA: hypothetical protein ZEAMMB73_415404 [Zea mays]
Length = 440
Score = 146 bits (369), Expect = 2e-32, Method: Compositional matrix adjust.
Identities = 124/405 (30%), Positives = 190/405 (46%), Gaps = 58/405 (14%)
Query: 86 YGGYSISLS---FGTPPQIIPFILDTGSHLVWFPCTNHYQCK-YCSSSKIPSFIPKLSSS 141
+GG S ++ G PPQ I+DTGS+L+W C+ +C+ C +P + P S +
Sbjct: 65 WGGQSQYIAEYLIGDPPQRAEAIIDTGSNLIWTQCS---RCRPTCFRQNLPYYDPSRSRA 121
Query: 142 SRLLGCQNPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSGLTEGIALSETL 201
+R +GC + C+ QC L+ +K C + + YG+G G +E L
Sbjct: 122 ARAVGCNDAACAL--GSETQC-------LSDNKTC-----AVVTGYGAGNIAGTLATENL 167
Query: 202 NLPNRIIPNFLVGCSVLSSRQP------AGIAGFGRGKTSLPSQLNLDKFSYCLLSHKFD 255
+ + + + GC V++ P +GI G GRGK SLPSQL +FSYCL + F+
Sbjct: 168 TFQSETV-SLVFGCIVVTKLSPGSLNGASGIIGLGRGKLSLPSQLGDTRFSYCLTPY-FE 225
Query: 256 DTTRTSSLI------LDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITV 309
DT S ++ L NGS+ S T +T PFV +PS + FS +YY+ L IT
Sbjct: 226 DTIEPSHMVVGASAGLINGSASS----TPVTTVPFVRSPS---DDPFSTFYYLPLTGITA 278
Query: 310 GGQRVRVWHKYLTLDRDGNG---GTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTR 366
G ++ V L + G GT +DSG T + ++ L E Q+ +
Sbjct: 279 GKVKLAVPSAAFDLRQVAPGMWTGTFIDSGAPLTSLVDVAYQALRAELARQL--GAALVQ 336
Query: 367 ALGAEALTGLRPCFDVPGEKTGSFPELKLHFKGGA----EVTLPVENYFAVVGEGSAVCL 422
L TG C + + P L LHF GG+ ++ +P NY+A V +A C+
Sbjct: 337 PLAGT--TGFDLCVALK-DAERLVPPLVLHFGGGSGTGTDLVVPPANYWAPVDSATA-CM 392
Query: 423 TVVTDREASGGP---SIILGNFQMQNYYVEYDLRNQRLGFKQQLC 464
V + + P + ++GN+ QN +V YDL L F+ C
Sbjct: 393 VVFSSVDRKSLPMNETTVIGNYMQQNMHVLYDLAGGVLSFQPADC 437
>gi|356569424|ref|XP_003552901.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Glycine max]
Length = 536
Score = 145 bits (367), Expect = 3e-32, Method: Compositional matrix adjust.
Identities = 118/394 (29%), Positives = 178/394 (45%), Gaps = 44/394 (11%)
Query: 87 GGYSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLG 146
G Y I + GTPP+ + ILDTGS L W C C C P + P SSS R +
Sbjct: 168 GEYFIDMFVGTPPKHVWLILDTGSDLSWIQCD---PCYDCFEQNGPHYNPNESSSYRNIS 224
Query: 147 CQNPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLY--GSGLTEGIALSE-TLNL 203
C +P+C + + +PL K Q CP Y Y GS T AL T+NL
Sbjct: 225 CYDPRCQLV---------SSPDPLQHCKTENQTCP-YFYDYADGSNTTGDFALETFTVNL 274
Query: 204 --PN-----RIIPNFLVGCSVLSS---RQPAGIAGFGRGKTSLPSQLNL---DKFSYCLL 250
PN + + + + GC + G+ G GRG S PSQL FSYCL
Sbjct: 275 TWPNGKEKFKHVVDVMFGCGHWNKGFFHGAGGLLGLGRGPLSFPSQLQSIYGHSFSYCL- 333
Query: 251 SHKFDDTTRTSSLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVG 310
+ F +T+ +S LI G L +T + E +YY+ ++ I VG
Sbjct: 334 TDLFSNTSVSSKLIF--GEDKELLNHHNLNFTKLL----AGEETPDDTFYYLQIKSIVVG 387
Query: 311 GQRVRVWHKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGA 370
G+ + + K +G GGTI+DSG+T TF ++ + + F ++ + + A
Sbjct: 388 GEVLDIPEKTWHWSSEGVGGTIIDSGSTLTFFPDSAYDVIKEAFEKKIK-----LQQIAA 442
Query: 371 EALTGLRPCFDVPGEKTGSFPELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTVVTDREA 430
+ + PC++V G P+ +HF GA P ENYF +CL ++ +
Sbjct: 443 DDFI-MSPCYNVSGAMQVELPDYGIHFADGAVWNFPAENYFYQYEPDEVICLAIL--KTP 499
Query: 431 SGGPSIILGNFQMQNYYVEYDLRNQRLGFKQQLC 464
+ I+GN QN+++ YD++ RLG+ + C
Sbjct: 500 NHSHLTIIGNLLQQNFHILYDVKRSRLGYSPRRC 533
>gi|222624820|gb|EEE58952.1| hypothetical protein OsJ_10633 [Oryza sativa Japonica Group]
Length = 415
Score = 145 bits (367), Expect = 3e-32, Method: Compositional matrix adjust.
Identities = 122/389 (31%), Positives = 170/389 (43%), Gaps = 78/389 (20%)
Query: 89 YSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLGCQ 148
Y + L+ GTPPQ + LDTGS L+W C C C +P F P SS+ L C
Sbjct: 89 YLVHLAIGTPPQPVQLTLDTGSDLIWTQCQ---PCPACFDQALPYFDPSTSSTLSLTSCD 145
Query: 149 NPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSGLTEGIALSETLNLPNRII 208
+ C + S+ D G+G + +
Sbjct: 146 STLCQGLPVASLPRSD------------------KFTFVGAGAS---------------V 172
Query: 209 PNFLVGCSVLSS----RQPAGIAGFGRGKTSLPSQLNLDKFSYCLLSHKFDDTTRT---- 260
P GC + ++ GIAGFGRG SLPSQL + FS+C TT T
Sbjct: 173 PGVAFGCGLFNNGVFKSNETGIAGFGRGPLSLPSQLKVGNFSHCF-------TTITGAIP 225
Query: 261 SSLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRVRVWHKY 320
S+++LD + + TP + NP A +YY+ L+ ITVG R+ V
Sbjct: 226 STVLLDLPADLFSNGQGAVQTTPLIQNP------ANPTFYYLSLKGITVGSTRLPVPESE 279
Query: 321 LTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQM---VKNRNYTRALGAEALTGLR 377
L ++G GGTI+DSGT T + ++ + D F +Q+ V + N T
Sbjct: 280 FAL-KNGTGGTIIDSGTAMTSLPTRVYRLVRDAFAAQVKLPVVSGNTTDPYF-------- 330
Query: 378 PCFDVPGEKTGSFPELKLHFKGGAEVTLPVENY-FAVVGEGSAV-CLTVVTDREASGGPS 435
C P P+L LHF+ GA + LP ENY F V GS++ CL ++ GG
Sbjct: 331 -CLSAPLRAKPYVPKLVLHFE-GATMDLPRENYVFEVEDAGSSILCLAII-----EGGEV 383
Query: 436 IILGNFQMQNYYVEYDLRNQRLGFKQQLC 464
+GNFQ QN +V YDL+N +L F C
Sbjct: 384 TTIGNFQQQNMHVLYDLQNSKLSFVPAQC 412
>gi|116787398|gb|ABK24493.1| unknown [Picea sitchensis]
Length = 479
Score = 145 bits (367), Expect = 3e-32, Method: Compositional matrix adjust.
Identities = 125/390 (32%), Positives = 176/390 (45%), Gaps = 60/390 (15%)
Query: 87 GGYSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLG 146
G Y ++ FGTP + I+DTGS + W C C C S P F P+ SSS + L
Sbjct: 136 GNYIVTAGFGTPAKNSLLIIDTGSDVTWIQCK---PCSDCYSQVDPIFEPQQSSSYKHLS 192
Query: 147 CQNPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSG-LTEGIALSETLNLPN 205
C + C+ L T +C Y + YG G ++G ETL L +
Sbjct: 193 CLSSACT---------------ELTTMNHCRLGGCVYEINYGDGSRSQGDFSQETLTLGS 237
Query: 206 RIIPNFLVGCSVLSS---RQPAGIAGFGRGKTSLPSQLNLD---KFSYCLLSHKFDDTTR 259
P+F GC ++ + AG+ G GR S PSQ +FSYCL F +T
Sbjct: 238 DSFPSFAFGCGHTNTGLFKGSAGLLGLGRTALSFPSQTKSKYGGQFSYCL--PDFVSSTS 295
Query: 260 TSSLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRVRVWHK 319
T S + GS + T+ P V+N + + +Y+VGL I+VGG+R+ +
Sbjct: 296 TGSFSVGQGSIPATA-----TFVPLVSNSN------YPSFYFVGLNGISVGGERLSIPPA 344
Query: 320 YLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRAL-GAEALTGLRP 378
L G GGTIVDSGT T + P+ ++ L F R+ TR L A+ + L
Sbjct: 345 VL-----GRGGTIVDSGTVITRLVPQAYDALKTSF-------RSKTRNLPSAKPFSILDT 392
Query: 379 CFDVPGEKTGSFPELKLHFKGGAEVTL-PVENYFAVVGEGSAVCLTVVTDREASGGPSI- 436
C+D+ P + HF+ A+V + V F + +GS VCL AS SI
Sbjct: 393 CYDLSSYSQVRIPTITFHFQNNADVAVSAVGILFTIQSDGSQVCLAF-----ASASQSIS 447
Query: 437 --ILGNFQMQNYYVEYDLRNQRLGFKQQLC 464
I+GNFQ Q V +D R+GF C
Sbjct: 448 TNIIGNFQQQRMRVAFDTGAGRIGFAPGSC 477
>gi|242050426|ref|XP_002462957.1| hypothetical protein SORBIDRAFT_02g035290 [Sorghum bicolor]
gi|241926334|gb|EER99478.1| hypothetical protein SORBIDRAFT_02g035290 [Sorghum bicolor]
Length = 452
Score = 145 bits (367), Expect = 4e-32, Method: Compositional matrix adjust.
Identities = 127/400 (31%), Positives = 169/400 (42%), Gaps = 52/400 (13%)
Query: 81 ISSHSYGGYSISLSFGTPPQIIPFILDTGSHLVW---FPCTNHYQCKYCSSSKIPSFIPK 137
++ + G Y + LS GTPP P I+DTGS L W PCT C + P + P
Sbjct: 88 LAENGAGAYHMILSVGTPPLAFPAIIDTGSDLTWTQCAPCTTA-----CFAQPTPLYDPA 142
Query: 138 LSSSSRLLGCQNPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSGLTEGIAL 197
SS+ L C +P C + R CN + C Y Y G T G
Sbjct: 143 RSSTFSKLPCASPLCQALPSAF---RACN------ATGCV-----YDYRYAVGFTAGYLA 188
Query: 198 SETLNL--------PNRIIPNFLVGCSVLSSRQ---PAGIAGFGRGKTSLPSQLNLDKFS 246
++TL + + GCS + +GI G GR SL SQ+ + +FS
Sbjct: 189 ADTLAIGDGDGDGDASSSFAGVAFGCSTANGGDMDGASGIVGLGRSALSLLSQIGVGRFS 248
Query: 247 YCLLSHKFDDTTRTSSLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRR 306
YCL S D S ++ ++ + K + T + NP A R A YYYV L
Sbjct: 249 YCLRS---DADAGASPILFGALANVTGDK---VQSTALLRNPVAARRRA--PYYYVNLTG 300
Query: 307 ITVGGQRVRVWHKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTR 366
I VG + V G GG IVDSGTTFT++A + L F+SQ TR
Sbjct: 301 IAVGSTDLPVTSSTFGFTAAGAGGVIVDSGTTFTYLAEAGYTMLRQAFLSQTAG--LLTR 358
Query: 367 ALGAEALTGLRPCFDVPGEKTGSFPELKLHFKGGAEVTLPVENYFAVVGEGSAV-CLTVV 425
GA+ L CF+ G P L F GGAE +P ++YF V EG V CL V+
Sbjct: 359 VSGAQFDFDL--CFEA-GAADTPVPRLVFRFAGGAEYAVPRQSYFDAVDEGGRVACLLVL 415
Query: 426 TDREASGGPSIILGNFQMQNYYVEYDLRNQRLGFKQQLCK 465
R S ++GN + +V YDL F C
Sbjct: 416 PTRGVS-----VIGNVMQMDLHVLYDLDGATFSFAPADCA 450
>gi|356553775|ref|XP_003545228.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
Length = 559
Score = 145 bits (366), Expect = 5e-32, Method: Compositional matrix adjust.
Identities = 120/394 (30%), Positives = 183/394 (46%), Gaps = 46/394 (11%)
Query: 87 GGYSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLG 146
G Y + + GTPP+ ILDTGS L W C C C P + PK SSS R +
Sbjct: 193 GEYFMDVFVGTPPKHFSLILDTGSDLNWIQCV---PCIACFEQSGPYYDPKDSSSFRNIS 249
Query: 147 CQNPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSG--LTEGIALSE-TLNL 203
C +P+C + + +P K Q CP Y YG G T AL T+NL
Sbjct: 250 CHDPRCQLV---------SSPDPPNPCKAENQSCP-YFYWYGDGSNTTGDFALETFTVNL 299
Query: 204 --PN-----RIIPNFLVGCSVLSS---RQPAGIAGFGRGKTSLPSQLNL---DKFSYCLL 250
PN + + N + GC + AG+ G G+G S SQ+ FSYCL+
Sbjct: 300 TTPNGKSELKHVENVMFGCGHWNRGLFHGAAGLLGLGKGPLSFASQMQSLYGQSFSYCLV 359
Query: 251 SHKFDDTTRTSSLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVG 310
+ + +S LI G L +T F + + +YYV + + V
Sbjct: 360 DRN-SNASVSSKLIF--GEDKELLSHPNLNFTSFGG----GKDGSVDTFYYVQINSVMVD 412
Query: 311 GQRVRVWHKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGA 370
+ +++ + L +G GGTI+DSGTT T+ A +E + + FV ++ + Y
Sbjct: 413 DEVLKIPEETWHLSSEGAGGTIIDSGTTLTYFAEPAYEIIKEAFVRKI---KGYEL---V 466
Query: 371 EALTGLRPCFDVPGEKTGSFPELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTVVTDREA 430
E L L+PC++V G + P+ + F GA PVENYF + + VCL ++ + +
Sbjct: 467 EGLPPLKPCYNVSGIEKMELPDFGILFADGAVWNFPVENYFIQI-DPDVVCLAILGNPRS 525
Query: 431 SGGPSIILGNFQMQNYYVEYDLRNQRLGFKQQLC 464
+ I+GN+Q QN+++ YD++ RLG+ C
Sbjct: 526 ALS---IIGNYQQQNFHILYDMKKSRLGYAPMKC 556
>gi|449451908|ref|XP_004143702.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Cucumis sativus]
gi|449529900|ref|XP_004171936.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Cucumis sativus]
Length = 459
Score = 145 bits (365), Expect = 5e-32, Method: Compositional matrix adjust.
Identities = 129/425 (30%), Positives = 185/425 (43%), Gaps = 62/425 (14%)
Query: 64 NPQTKTTTTTTTTTTTNISSHSYGGYSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQC 123
NP K+ + +T + G Y + + GTPPQ + + DTGS LVW C+ C
Sbjct: 70 NPTLKSPLISGASTGS-------GQYFVDIRLGTPPQSLLLVADTGSDLVWVKCSACRNC 122
Query: 124 KYCSSSKIPSFIPKLSSSSRLLGCQNPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSY 183
+ S +F+P+ SSS C +P C + H CN L + C +
Sbjct: 123 SHHPPSS--AFLPRHSSSFSPFHCFDPHCRLLPHAPHHL--CNHTRLHSP------C-RF 171
Query: 184 LVLYGSG-LTEGIALSETLNLPN-----------------RIIPNFLVGCSVLSSRQPAG 225
L Y G L+ G ET L + RI + G +R G
Sbjct: 172 LYSYADGSLSSGFFSKETTTLKSLSGSEIHLKGLSFGCGFRISGPSVSGAQFNGAR---G 228
Query: 226 IAGFGRGKTSLPSQLNL---DKFSYCLLSHKFDDTTRTSSLILDNGSSHSDKKT--TGLT 280
+ G GRG S SQL +KFSYCL+ + +S ++ G HS T T ++
Sbjct: 229 VMGLGRGSISFSSQLGRRFGNKFSYCLMDYTLSPPP--TSFLMIGGGLHSLPLTNATKIS 286
Query: 281 YTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRVRVWHKYLTLDRDGNGGTIVDSGTTFT 340
YTP NP +YY+ + IT+ G ++ + +D GNGGT+VDSGTT T
Sbjct: 287 YTPLQINPLSP------TFYYITIHSITIDGVKLPINPAVWEIDEQGNGGTVVDSGTTLT 340
Query: 341 FMAPELFEPLADEFVSQMVKNRNYTRALGAEALTGLRPCFDVPGE-KTGSFPELKLHFKG 399
++ +E + V + VK N AE G C + GE + S P L+ G
Sbjct: 341 YLTKTAYEEVLKS-VRRRVKLPN-----AAELTPGFDLCVNASGESRRPSLPRLRFRLGG 394
Query: 400 GAEVTLPVENYFAVVGEGSAVCLTVVTDREASGGPSIILGNFQMQNYYVEYDLRNQRLGF 459
GA P NYF EG +CL + E+ G S+I GN Q + +E+D RLGF
Sbjct: 395 GAVFAPPPRNYFLETEEG-VMCL-AIRAVESGNGFSVI-GNLMQQGFLLEFDKEESRLGF 451
Query: 460 KQQLC 464
++ C
Sbjct: 452 TRRGC 456
>gi|357131735|ref|XP_003567490.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Brachypodium
distachyon]
Length = 458
Score = 145 bits (365), Expect = 5e-32, Method: Compositional matrix adjust.
Identities = 122/397 (30%), Positives = 177/397 (44%), Gaps = 62/397 (15%)
Query: 89 YSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLGCQ 148
Y GTPPQ + +D + W PC+ C +SS PSF P SS+ R + C
Sbjct: 100 YVARARLGTPPQTLLVAIDPSNDAAWVPCSACLGCAPGASS--PSFDPTQSSTYRPVRCG 157
Query: 149 NPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSGLTEGIALSETLNLPN--- 205
P+C+ + + C P +C ++ + Y S + + L+L +
Sbjct: 158 APQCAQVPPATPSC------PAGPGASC-----AFNLSYASSTLHAVLGQDALSLSDSNG 206
Query: 206 RIIPN--FLVGCSVL-----SSRQPAGIAGFGRGKTSLPSQLNL---DKFSYCLLSHKFD 255
+P+ + GC + S P G+ GFGRG S SQ FSYCL S+K
Sbjct: 207 AAVPDDHYTFGCLRVVTGSGGSVPPQGLVGFGRGPLSFLSQTKATYGSIFSYCLPSYKSS 266
Query: 256 DTTRTSSLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRVR 315
+ + T L G + ++ + TP ++NP R + YYV + + V G+ V
Sbjct: 267 NFSGTLRL----GPAGQPRR---IKTTPLLSNP---HRPSL---YYVAMVGVRVNGKAVP 313
Query: 316 VWHKYLTLDRD-GNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEALT 374
+ L LD G GGTIVD+GT FT ++P + L + F R A A AL
Sbjct: 314 IPASALALDAATGRGGTIVDAGTMFTRLSPPAYAALRNAF-------RRGVSAPAAPALG 366
Query: 375 GLRPCFDVPGEKTGSFPELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTVVTDREASGGP 434
G C+ V G K S P + F GGA VTLP EN G CL + + GP
Sbjct: 367 GFDTCYYVNGTK--SVPAVAFVFAGGARVTLPEENVVISSTSGGVACLAM------AAGP 418
Query: 435 SI-------ILGNFQMQNYYVEYDLRNQRLGFKQQLC 464
S +L + Q QN+ V +D+ N R+GF ++LC
Sbjct: 419 SDGVNAGLNVLASMQQQNHRVVFDVGNGRVGFSRELC 455
>gi|449443786|ref|XP_004139658.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 1-like [Cucumis
sativus]
gi|449475416|ref|XP_004154449.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 1-like [Cucumis
sativus]
Length = 453
Score = 144 bits (364), Expect = 8e-32, Method: Compositional matrix adjust.
Identities = 128/394 (32%), Positives = 174/394 (44%), Gaps = 57/394 (14%)
Query: 83 SHSYGGYSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSS 142
S G Y L GTPP+ + +LDTGS +VW C+ C+ C S P F P S S
Sbjct: 104 SQGSGEYFTRLGVGTPPRYLYMVLDTGSDVVWLQCS---PCRKCYSQSDPIFNPYKSKSF 160
Query: 143 RLLGCQNPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSG-LTEGIALSETL 201
+ C +P C + R C Y V YG G T G +ETL
Sbjct: 161 AGIPCSSPLCRRLDSSGCSTR---------RHTCL-----YQVSYGDGSFTTGDFATETL 206
Query: 202 NLPNRIIPNFLVGCSVLSSRQPAGI-------AGFGRGKTSLPSQLNL---DKFSYCLLS 251
I +GC G+ G GRG+ S PSQ + KFSYCL+
Sbjct: 207 TFRGNKIAKVALGC----GHHNEGLFVGAAGLLGLGRGRLSFPSQTGIRFNHKFSYCLVD 262
Query: 252 HKFDDTTRTSSLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGG 311
+++ SS++ + + + +TP + NP + +YYVGL I+VGG
Sbjct: 263 RS--ASSKPSSMVFGDAAISRLAR-----FTPLIRNPKL------DTFYYVGLIGISVGG 309
Query: 312 QRVR-VWHKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGA 370
RVR V LD GNGG I+DSGT+ T + + L D F V R+ R G
Sbjct: 310 VRVRGVSPSLFKLDSAGNGGVIIDSGTSVTRLTRPAYTALRDAF---RVGARHLKR--GP 364
Query: 371 EALTGLRPCFDVPGEKTGSFPELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTVVTDREA 430
E + C+D+ G+ + P + LHF+ GA++ LP NY V E + C
Sbjct: 365 E-FSLFDTCYDLSGQSSVKVPTVVLHFR-GADMALPATNYLIPVDENGSFCFAFA---GT 419
Query: 431 SGGPSIILGNFQMQNYYVEYDLRNQRLGFKQQLC 464
G SII GN Q Q + V YDL R+GF + C
Sbjct: 420 ISGLSII-GNIQQQGFRVVYDLAGSRIGFAPRGC 452
>gi|302809015|ref|XP_002986201.1| hypothetical protein SELMODRAFT_234982 [Selaginella moellendorffii]
gi|300146060|gb|EFJ12732.1| hypothetical protein SELMODRAFT_234982 [Selaginella moellendorffii]
Length = 449
Score = 144 bits (362), Expect = 1e-31, Method: Compositional matrix adjust.
Identities = 121/396 (30%), Positives = 194/396 (48%), Gaps = 52/396 (13%)
Query: 87 GGYSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLG 146
G Y + + G PP+ I+DTGS L W C CK C P F P S+S +++
Sbjct: 85 GEYFMDVFVGNPPRHFLLIIDTGSDLTWLQCK---PCKACFDQSGPVFDPSQSTSFKIIP 141
Query: 147 CQNPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYG-SGLTEGIALSETLNL-- 203
C C + H+ +CRD +SK + C Y YG S T G E+L++
Sbjct: 142 CNAAACDLVVHD--ECRD------NSSKTSPKTC-KYFYWYGDSSRTSGDLALESLSVSL 192
Query: 204 ---PNRI-IPNFLVGC---SVLSSRQPAGIAGFGRGKTSLPSQLNL----DKFSYCLLSH 252
P+ + I + ++GC + + G+ G G+G S PSQL FSYCL+
Sbjct: 193 SDHPSSLEIRDMVIGCGHSNKGLFQGAGGLLGLGQGALSFPSQLRSSPIGQSFSYCLVD- 251
Query: 253 KFDDTTRTSSLILDNG---SSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITV 309
+ ++ + +S++ G S H D+ + +TPFV N+ +YY+G++ I +
Sbjct: 252 RTNNLSVSSAISFGAGFALSRHFDQ----MKFTPFVRT-----NNSVETFYYLGIQGIKI 302
Query: 310 GGQRVRVWHKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALG 369
+ + + + + +G+GGTI+DSGTT T++ + + + F++++ +Y R
Sbjct: 303 DQELLPIPAERFAIATNGSGGTIIDSGTTLTYLNRDAYRAVESAFLARI----SYPR--- 355
Query: 370 AEALTGLRPCFDVPGEKTGSFPELKLHFKGGAEVTLPVENYFAVVGEGSAV-CLTVVTDR 428
A+ L C++ G FP L + F+ GAE+ LP ENYF A CL ++
Sbjct: 356 ADPFDILGICYNATGRAAVPFPALSIVFQNGAELDLPQENYFIQPDPQEAKHCLAIL--- 412
Query: 429 EASGGPSIILGNFQMQNYYVEYDLRNQRLGFKQQLC 464
+ G SII GNFQ QN + YD+++ RLGF C
Sbjct: 413 -PTDGMSII-GNFQQQNIHFLYDVQHARLGFANTDC 446
>gi|242079449|ref|XP_002444493.1| hypothetical protein SORBIDRAFT_07g022790 [Sorghum bicolor]
gi|241940843|gb|EES13988.1| hypothetical protein SORBIDRAFT_07g022790 [Sorghum bicolor]
Length = 449
Score = 144 bits (362), Expect = 1e-31, Method: Compositional matrix adjust.
Identities = 123/392 (31%), Positives = 180/392 (45%), Gaps = 43/392 (10%)
Query: 88 GYSISLSFGTPPQIIPFILDTGSHLVWFPCT----NHYQCKYCSSSKIPSFIPKLSSSSR 143
G+S+++ GTPPQ I+DTGS L+W C+ S + P + P+ SSS
Sbjct: 83 GHSLTVGIGTPPQPRTLIVDTGSDLIWTQCSMLSRRTRTAASASRQREPLYEPRRSSSFA 142
Query: 144 LLGCQNPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSGLTEGIALSE--TL 201
L C + C C A + C Y LYGS G+ SE T
Sbjct: 143 YLPCSDRLCQEGQFSYKNC--------ARNNRCM-----YDELYGSAEAGGVLASETFTF 189
Query: 202 NLPNRIIPNFLVGCSVLSSRQ---PAGIAGFGRGKTSLPSQLNLDKFSYCLLSHKFDDTT 258
+ ++ GC LS+ +G+ G G SL SQL++ +FSYCL
Sbjct: 190 GVNAKVSLPLGFGCGALSAGDLVGASGLMGLSPGIMSLVSQLSVPRFSYCLTPFA---ER 246
Query: 259 RTSSLILDNGSSHSDKKTTG-LTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRVRVW 317
+TS L+ + +TTG + T + NP++ + YYYV L +++G +R+ V
Sbjct: 247 KTSPLLFGAMADLRRYRTTGTVQTTSILRNPAME-----TAYYYVPLVGLSLGTKRLDVP 301
Query: 318 HKYLTLDR-DGNGGTIVDSGTTFTFMAPELFEPLADEFVSQM-VKNRNYTRALGAEALTG 375
L + + DG+GGTIVDSG+T +++ F + V + + N T E
Sbjct: 302 ATSLGMIKPDGSGGTIVDSGSTMSYLEETAFRAVKKAVVEAVRLPVANGTD----EDYDD 357
Query: 376 LRPCFDVP---GEKTGSFPELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTVVTDREASG 432
CF +P + P L LHF GGA +TLP +NYF G +CL V T + G
Sbjct: 358 YELCFALPTGVAMEAVKTPPLVLHFDGGAAMTLPRDNYFQEPRAG-LMCLAVGTSPDGFG 416
Query: 433 GPSIILGNFQMQNYYVEYDLRNQRLGFKQQLC 464
I+GN Q QN +V +D+RNQ+ F C
Sbjct: 417 --VSIIGNVQQQNMHVLFDVRNQKFSFAPTKC 446
>gi|242045564|ref|XP_002460653.1| hypothetical protein SORBIDRAFT_02g032590 [Sorghum bicolor]
gi|241924030|gb|EER97174.1| hypothetical protein SORBIDRAFT_02g032590 [Sorghum bicolor]
Length = 525
Score = 143 bits (361), Expect = 2e-31, Method: Compositional matrix adjust.
Identities = 131/426 (30%), Positives = 196/426 (46%), Gaps = 55/426 (12%)
Query: 64 NPQTKTTTTTTTTTTTNISSHSYGGYSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQC 123
+P+ + T + ++ S G Y + + GTPP+ I+DTGS L W C C
Sbjct: 127 SPRRALSERMVATVESGVAVGS-GEYLMDVYVGTPPRRFRMIMDTGSDLNWLQCA---PC 182
Query: 124 KYCSSSKIPSFIPKLSSSSRLLGCQNPKCSWIHHESIQ-------CRDCNDEPLATSKNC 176
C + P F P SSS R + C + +C + CR ++P
Sbjct: 183 LDCFEQRGPVFDPAASSSYRNVTCGDHRCGHVAPPPEPEASSPRTCRRPGEDP------- 235
Query: 177 TQICPSYLVLYG--SGLTEGIAL-SETLNL----PNRIIPNFLVGCSVLSS---RQPAGI 226
CP Y YG S T +AL S T+NL +R + + GC + AG+
Sbjct: 236 ---CP-YYYWYGDQSNTTGDLALESFTVNLTAPGASRRVDGVVFGCGHRNRGLFHGAAGL 291
Query: 227 AGFGRGKTSLPSQLNL---DKFSYCLLSHKFDDTTRTSSLILDNGSSHSDKKTTGLTYTP 283
G GRG S SQL FSYCL+ H D ++ + ++ + + L YT
Sbjct: 292 LGLGRGPLSFASQLRAVYGHTFSYCLVDHGSDVGSKV--VFGEDDDALALAAHPQLKYTA 349
Query: 284 FVNNPSVAERNAFSVYYYVGLRRITVGGQRVRVWHKYLTLDRDGNGGTIVDSGTTFTFMA 343
F + + + +YYV L+ + VGG+ + + + +DG+GGTI+DSGTT ++
Sbjct: 350 FAP--ASSSSSPADTFYYVKLKGVLVGGELLNISSDTWDVGKDGSGGTIIDSGTTLSYFV 407
Query: 344 PELFEPLADEFVSQMVKNRNYTRALGAEALTGLRPCFDVPGEKTGSFPELKLHFKGGAEV 403
++ + F+ +M +R+Y L PC++V G + PEL L F GA
Sbjct: 408 EPAYQVIRHAFMDRM--SRSYPL---VPEFPVLSPCYNVSGVERPEVPELSLLFADGAVW 462
Query: 404 TLPVENYFAVVGE--GSAVCLTVVTDREASGGPSI---ILGNFQMQNYYVEYDLRNQRLG 458
P ENYF + GS +CL V+ G P I+GNFQ QN++V YDL+N RLG
Sbjct: 463 DFPAENYFIRLDPDGGSIMCLAVL------GTPRTGMSIIGNFQQQNFHVVYDLQNNRLG 516
Query: 459 FKQQLC 464
F + C
Sbjct: 517 FAPRRC 522
>gi|242081367|ref|XP_002445452.1| hypothetical protein SORBIDRAFT_07g019450 [Sorghum bicolor]
gi|241941802|gb|EES14947.1| hypothetical protein SORBIDRAFT_07g019450 [Sorghum bicolor]
Length = 459
Score = 143 bits (361), Expect = 2e-31, Method: Compositional matrix adjust.
Identities = 121/396 (30%), Positives = 180/396 (45%), Gaps = 50/396 (12%)
Query: 81 ISSHSYGGYSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSS 140
IS +++ G+S+++ GTPPQ ILD GS L+W C+ + P F SS
Sbjct: 99 ISPYAHQGHSLTVGVGTPPQPSKVILDLGSDLLWTQCS---LVGPTAKQLEPVFDAARSS 155
Query: 141 SSRLLGCQNPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSGLTEGIALSET 200
S +L C + C + T+K CT +Y YG G+ +ET
Sbjct: 156 SFSVLPCDSKLC--------------EAGTFTNKTCTDRKCAYENDYGIMTATGVLATET 201
Query: 201 LNLPNR--IIPNFLVGCSVLSS---RQPAGIAGFGRGKTSLPSQLNLDKFSYCLLSHKFD 255
+ N GC L++ + +GI G G S+ QL + KFSYCL F
Sbjct: 202 FTFGAHHGVSANLTFGCGKLANGTIAEASGILGLSPGPLSMLKQLAITKFSYCLTP--FA 259
Query: 256 DTTRTSSLILDNGSSHSDKKTTGLTYT-PFVNNPSVAERNAFSVYYYVGLRRITVGGQRV 314
D +TS ++ + KTTG T P + NP +YYYV + ++VG +R+
Sbjct: 260 DR-KTSPVMFGAMADLGKYKTTGKVQTIPLLKNP------VEDIYYYVPMVGMSVGSKRL 312
Query: 315 RVWHKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLAD---EFVSQMVKNRNYTRALGAE 371
V + L + DG GGT++DS TT ++ F L E + V NR+
Sbjct: 313 DVPQETLAIKPDGTGGTVLDSATTLAYLVEPAFTELKKAVMEGIKLPVANRS-------- 364
Query: 372 ALTGLRPCFDVP---GEKTGSFPELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTVVTDR 428
+ CF++P + P L LHF G AE++LP +NYF G +CL V+
Sbjct: 365 -VDDYPVCFELPRGMSMEGVQVPPLVLHFDGDAEMSLPRDNYFQEPSPG-MMCLAVM-QA 421
Query: 429 EASGGPSIILGNFQMQNYYVEYDLRNQRLGFKQQLC 464
G P++I GN Q QN +V YD+ N++ + C
Sbjct: 422 PFEGAPNVI-GNVQQQNMHVLYDVGNRKFSYAPTKC 456
>gi|356499344|ref|XP_003518501.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Glycine max]
Length = 561
Score = 143 bits (361), Expect = 2e-31, Method: Compositional matrix adjust.
Identities = 119/394 (30%), Positives = 182/394 (46%), Gaps = 46/394 (11%)
Query: 87 GGYSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLG 146
G Y + + GTPP+ ILDTGS L W C C C P + PK SSS R +
Sbjct: 195 GEYFMDVFVGTPPKHFSLILDTGSDLNWIQCV---PCIACFEQSGPYYDPKDSSSFRNIS 251
Query: 147 CQNPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSG--LTEGIALSE-TLNL 203
C +P+C + +P K Q CP Y YG G T AL T+NL
Sbjct: 252 CHDPRCQLVSAP---------DPPKPCKAENQSCP-YFYWYGDGSNTTGDFALETFTVNL 301
Query: 204 --PN-----RIIPNFLVGCSVLSS---RQPAGIAGFGRGKTSLPSQLNL---DKFSYCLL 250
PN + + N + GC + AG+ G G+G S SQ+ FSYCL+
Sbjct: 302 TTPNGTSELKHVENVMFGCGHWNRGLFHGAAGLLGLGKGPLSFASQMQSLYGQSFSYCLV 361
Query: 251 SHKFDDTTRTSSLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVG 310
+ + +S LI G L +T F + + +YYV ++ + V
Sbjct: 362 DRN-SNASVSSKLIF--GEDKELLSHPNLNFTSFGG----GKDGSVDTFYYVQIKSVMVD 414
Query: 311 GQRVRVWHKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGA 370
+ +++ + L +G GGTI+DSGTT T+ A +E + + FV ++ + Y
Sbjct: 415 DEVLKIPEETWHLSSEGAGGTIIDSGTTLTYFAEPAYEIIKEAFVRKI---KGYQL---V 468
Query: 371 EALTGLRPCFDVPGEKTGSFPELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTVVTDREA 430
E L L+PC++V G + P+ + F A PVENYF + + VCL ++ + +
Sbjct: 469 EGLPPLKPCYNVSGIEKMELPDFGILFADEAVWNFPVENYFIWI-DPEVVCLAILGNPRS 527
Query: 431 SGGPSIILGNFQMQNYYVEYDLRNQRLGFKQQLC 464
+ I+GN+Q QN+++ YD++ RLG+ C
Sbjct: 528 ALS---IIGNYQQQNFHILYDMKKSRLGYAPMKC 558
>gi|224053042|ref|XP_002297678.1| predicted protein [Populus trichocarpa]
gi|222844936|gb|EEE82483.1| predicted protein [Populus trichocarpa]
Length = 482
Score = 143 bits (360), Expect = 2e-31, Method: Compositional matrix adjust.
Identities = 117/370 (31%), Positives = 172/370 (46%), Gaps = 49/370 (13%)
Query: 105 ILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLGCQNPKCSWIHHESIQCRD 164
I+DTGS L W C CK C + + P F P S S R + C +P C + +
Sbjct: 149 IVDTGSDLSWVQCQ---PCKRCYNQQDPVFNPSTSPSYRTVLCSSPTCQSLQSATGNLGV 205
Query: 165 CNDEPLATSKNCTQICPSYLVLYGSG-LTEGIALSETLNLPNR-IIPNFLVGCSVLSSR- 221
C P + + Y+V YG G T G +E L+L N + NF+ GC +
Sbjct: 206 CGSNPPSCN---------YVVNYGDGSYTRGELGTEHLDLGNSTAVNNFIFGCGRNNQGL 256
Query: 222 --QPAGIAGFGRGKTSLPSQLNL---DKFSYCLLSHKFDDTTRTSSLILDNGSSHSDKKT 276
+G+ G GR SL SQ + FSYCL +T + SL++ G+S K T
Sbjct: 257 FGGASGLVGLGRSSLSLISQTSAMFGGVFSYCL---PITETEASGSLVM-GGNSSVYKNT 312
Query: 277 TGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRVRVWHKYLTLDRDGNGGTIVDSG 336
T ++YT + NP + +Y++ L ITVG V+ G G ++DSG
Sbjct: 313 TPISYTRMIPNPQLP-------FYFLNLTGITVGSVAVQA-------PSFGKDGMMIDSG 358
Query: 337 TTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEALTGLRPCFDVPGEKTGSFPELKLH 396
T T + P +++ L DEFV Q ++ A A L CF++ G + P +K+H
Sbjct: 359 TVITRLPPSIYQALKDEFVKQ------FSGFPSAPAFMILDTCFNLSGYQEVEIPNIKMH 412
Query: 397 FKGGAEVTLPVENYFAVVG-EGSAVCLTVVT-DREASGGPSIILGNFQMQNYYVEYDLRN 454
F+G AE+ + V F V + S VCL + + E G I+GN+Q +N V YD +
Sbjct: 413 FEGNAELNVDVTGVFYFVKTDASQVCLAIASLSYENEVG---IIGNYQQKNQRVIYDTKG 469
Query: 455 QRLGFKQQLC 464
LGF + C
Sbjct: 470 SMLGFAAEAC 479
>gi|255564685|ref|XP_002523337.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
gi|223537425|gb|EEF39053.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
Length = 469
Score = 143 bits (360), Expect = 2e-31, Method: Compositional matrix adjust.
Identities = 126/386 (32%), Positives = 173/386 (44%), Gaps = 49/386 (12%)
Query: 87 GGYSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLG 146
G Y + GTPP+ + +LDTGS +VW C CK C + P F P+ S S +
Sbjct: 124 GEYFTRIGVGTPPRYVYMVLDTGSDIVWIQCA---PCKRCYAQSDPVFDPRKSRSFASIA 180
Query: 147 CQNPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSG-LTEGIALSETLNLPN 205
C++P C + CN + Q C Y V YG G T G +ETL
Sbjct: 181 CRSPLC-----HRLDSPGCNTQ--------KQTC-MYQVSYGDGSFTFGDFSTETLTFRR 226
Query: 206 RIIPNFLVGCSVLSSR---QPAGIAGFGRGKTSLPSQLNL---DKFSYCLLSHKFDDTTR 259
+ +GC + AG+ G GRG+ S PSQ KFSYCL+ +++
Sbjct: 227 TRVARVALGCGHDNEGLFVGAAGLLGLGRGRLSFPSQTGRRFNHKFSYCLVDRS--ASSK 284
Query: 260 TSSLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRV-RVWH 318
SS++ + + + +TP V+NP + +YYV L I+VGG RV +
Sbjct: 285 PSSMVFGDSAVSRTAR-----FTPLVSNPKL------DTFYYVELLGISVGGTRVPGITA 333
Query: 319 KYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEALTGLRP 378
LD+ GNGG I+DSGT+ T + + D F N R A +
Sbjct: 334 SLFKLDQTGNGGVIIDSGTSVTRLTRPAYIAFRDAF---RAGASNLKR---APQFSLFDT 387
Query: 379 CFDVPGEKTGSFPELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTVVTDREASGGPSIIL 438
CFD+ G+ P + LHF+ GA+V+LP NY V CL GG SII
Sbjct: 388 CFDLSGKTEVKVPTVVLHFR-GADVSLPASNYLIPVDTSGNFCLAFA---GTMGGLSII- 442
Query: 439 GNFQMQNYYVEYDLRNQRLGFKQQLC 464
GN Q Q + V YDL R+GF C
Sbjct: 443 GNIQQQGFRVVYDLAGSRVGFAPHGC 468
>gi|302806531|ref|XP_002985015.1| hypothetical protein SELMODRAFT_121417 [Selaginella moellendorffii]
gi|300147225|gb|EFJ13890.1| hypothetical protein SELMODRAFT_121417 [Selaginella moellendorffii]
Length = 533
Score = 142 bits (359), Expect = 3e-31, Method: Compositional matrix adjust.
Identities = 121/396 (30%), Positives = 194/396 (48%), Gaps = 52/396 (13%)
Query: 87 GGYSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLG 146
G Y + + G PP+ I+DTGS L W C CK C P F P S+S +++
Sbjct: 169 GEYFMDVFVGNPPRHFLLIIDTGSDLTWLQCK---PCKACFDQSGPVFDPSQSTSFKIIP 225
Query: 147 CQNPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYG-SGLTEGIALSETLNL-- 203
C C + H+ +CRD +SK + C Y YG S T G E+L++
Sbjct: 226 CNAAACDLVVHD--ECRD------NSSKTSPKTC-KYFYWYGDSSRTSGDLALESLSVSL 276
Query: 204 ---PNRI-IPNFLVGC---SVLSSRQPAGIAGFGRGKTSLPSQLNL----DKFSYCLLSH 252
P+ + I + ++GC + + G+ G G+G S PSQL FSYCL+
Sbjct: 277 SDHPSSLEIRDMVIGCGHSNKGLFQGAGGLLGLGQGALSFPSQLRSSPIGQSFSYCLVD- 335
Query: 253 KFDDTTRTSSLILDNG---SSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITV 309
+ ++ + +S++ G S H D+ + +TPFV N+ +YY+G++ I +
Sbjct: 336 RTNNLSVSSAISFGAGFALSRHFDQ----MRFTPFVRT-----NNSVETFYYLGIQGIKI 386
Query: 310 GGQRVRVWHKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALG 369
+ + + + + +G+GGTI+DSGTT T++ + + + F++++ +Y R
Sbjct: 387 DQELLPIPAERFAIAPNGSGGTIIDSGTTLTYLNRDAYRAVESAFLARI----SYPR--- 439
Query: 370 AEALTGLRPCFDVPGEKTGSFPELKLHFKGGAEVTLPVENYFAVVGEGSAV-CLTVVTDR 428
A+ L C++ G FP L + F+ GAE+ LP ENYF A CL ++
Sbjct: 440 ADPFDILGICYNATGRTAVPFPTLSIVFQNGAELDLPQENYFIQPDPQEAKHCLAIL--- 496
Query: 429 EASGGPSIILGNFQMQNYYVEYDLRNQRLGFKQQLC 464
+ G SII GNFQ QN + YD+++ RLGF C
Sbjct: 497 -PTDGMSII-GNFQQQNIHFLYDVQHARLGFANTDC 530
>gi|293329689|dbj|BAJ04354.1| pollen allergen CPA63 [Cryptomeria japonica]
Length = 472
Score = 142 bits (358), Expect = 3e-31, Method: Compositional matrix adjust.
Identities = 126/388 (32%), Positives = 178/388 (45%), Gaps = 51/388 (13%)
Query: 89 YSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLGCQ 148
Y I L FGTPPQ +LDTGS++ W PC C CSS + P F P SS+ L C
Sbjct: 124 YIIKLGFGTPPQSFYTVLDTGSNIAWIPCN---PCSGCSSKQQP-FEPSKSSTYNYLTCA 179
Query: 149 NPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGS-GLTEGIALSETLNLPNRI 207
+ +C + R C S NC S YG + I SETL++ ++
Sbjct: 180 SQQCQLL-------RVCTKSD--NSVNC-----SLTQRYGDQSEVDEILSSETLSVGSQQ 225
Query: 208 IPNFLVGCSV----LSSRQPAGIAGFGRGKTSLPSQ---LNLDKFSYCLLSHKFDDTTRT 260
+ NF+ GCS L R P+ + GFGR S SQ L FSYCL S + T
Sbjct: 226 VENFVFGCSNAARGLIQRTPS-LVGFGRNPLSFVSQTATLYDSTFSYCLPS--LFSSAFT 282
Query: 261 SSLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRVRVWHKY 320
SL+L GL +TP ++N + + +YYVGL I+VG + V +
Sbjct: 283 GSLLL----GKEALSAQGLKFTPLLSN------SRYPSFYYVGLNGISVGEELVSIPAGT 332
Query: 321 LTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEALTGLRPCF 380
L+LD GTI+DSGT T + + + D F SQ+ N T A + C+
Sbjct: 333 LSLDESTGRGTIIDSGTVITRLVEPAYNAMRDSFRSQL---SNLTMASPTDLFD---TCY 386
Query: 381 DVPGEKTGSFPELKLHFKGGAEVTLPVEN-YFAVVGEGSAVCLTVVTDREASGGPSII-- 437
+ P FP + LHF ++TLP++N + +GS +CL GG ++
Sbjct: 387 NRPSGDV-EFPLITLHFDDNLDLTLPLDNILYPGNDDGSVLCLAF--GLPPGGGDDVLST 443
Query: 438 LGNFQMQNYYVEYDLRNQRLGFKQQLCK 465
GN+Q Q + +D+ RLG + C
Sbjct: 444 FGNYQQQKLRIVHDVAESRLGIASENCD 471
>gi|357135412|ref|XP_003569303.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Brachypodium
distachyon]
Length = 494
Score = 142 bits (358), Expect = 4e-31, Method: Compositional matrix adjust.
Identities = 129/411 (31%), Positives = 175/411 (42%), Gaps = 49/411 (11%)
Query: 67 TKTTTTTTTTTTTNISSHSYGGYSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYC 126
T+ T + + + G Y + GTP +LDTGS +VW C C+ C
Sbjct: 120 TRRTGSGVVAPVVSGLAQGSGEYFTKIGVGTPATPALMVLDTGSDVVWLQCA---PCRRC 176
Query: 127 SSSKIPSFIPKLSSSSRLLGCQNPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVL 186
F P+ S S +GC P C + R K C Y V
Sbjct: 177 YDQSGQVFDPRRSRSYGAVGCSAPLCRRLDSGGCDLR---------RKACL-----YQVA 222
Query: 187 YGSG-LTEGIALSETLNLPNRI-IPNFLVGCSVLSSR---QPAGIAGFGRGKTSLPSQLN 241
YG G +T G +ETL + +GC + AG+ G GRG S P+Q++
Sbjct: 223 YGDGSVTAGDFATETLTFAGGARVARIALGCGHDNEGLFVAAAGLLGLGRGSLSFPAQIS 282
Query: 242 LD---KFSYCLLSH--KFDDTTRTSSLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAF 296
FSYCL+ + + +S++ +G+ S T ++TP V NP +
Sbjct: 283 RRYGRSFSYCLVDRTSSANPASHSSTVTFGSGAVGS---TVAASFTPMVKNPRM------ 333
Query: 297 SVYYYVGLRRITVGGQRVR-VWHKYLTLD-RDGNGGTIVDSGTTFTFMAPELFEPLADEF 354
+YYV L I+VGG RV V L LD G GG IVDSGT+ T +A + L D F
Sbjct: 334 ETFYYVQLVGISVGGARVSGVADSDLRLDPSSGRGGVIVDSGTSVTRLARPAYSALRDAF 393
Query: 355 VSQMVKNRNYTRALGAEALTGLRPCFDVPGEKTGSFPELKLHFKGGAEVTLPVENYFAVV 414
+ R L + C+D+ G K P + +HF GGAE LP ENY V
Sbjct: 394 RAAAAGLR-----LSPGGFSLFDTCYDLSGRKVVKVPTVSMHFAGGAEAALPPENYLIPV 448
Query: 415 GEGSAVCLTVV-TDREASGGPSIILGNFQMQNYYVEYDLRNQRLGFKQQLC 464
C TD GG SII GN Q Q + V +D QR+GF + C
Sbjct: 449 DSKGTFCFAFAGTD----GGVSII-GNIQQQGFRVVFDGDGQRVGFVPKGC 494
>gi|116787333|gb|ABK24467.1| unknown [Picea sitchensis]
Length = 497
Score = 142 bits (357), Expect = 4e-31, Method: Compositional matrix adjust.
Identities = 127/390 (32%), Positives = 170/390 (43%), Gaps = 57/390 (14%)
Query: 87 GGYSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLG 146
G Y L GTPP+ +LDTGS ++W C C C P F P SS+ R +
Sbjct: 151 GEYFTRLGVGTPPRYTYMVLDTGSDIMWIQC---LPCAKCYGQTDPLFNPAASSTYRKVP 207
Query: 147 CQNPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSG-LTEGIALSETLNLPN 205
C P C + CR+ + C Y V YG G T G +ETL
Sbjct: 208 CATPLCKKLDISG--CRN--------KRYC-----EYQVSYGDGSFTVGDFSTETLTFRG 252
Query: 206 RIIPNFLVGCSVLSSRQPAGI-------AGFGRGKTSLPSQLNLD---KFSYCLLSHKFD 255
++I +GC G+ G GRG S PSQ +FSYCL+
Sbjct: 253 QVIRRVALGCG----HDNEGLFIGAAGLLGLGRGSLSFPSQTGAQFSKRFSYCLVDRSAS 308
Query: 256 DTTRTSSLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRV- 314
T SSLI G + K +TP ++NP + +YYV L I+VGG+R+
Sbjct: 309 GTA--SSLIF--GKAAIPKSAI---FTPLLSNPKL------DTFYYVELVGISVGGRRLT 355
Query: 315 RVWHKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEALT 374
+ +D GNGG I+DSGT+ T + + + D F V N A G +
Sbjct: 356 SIPASVFRMDATGNGGVIIDSGTSVTRLVDSAYSTMRDAF---RVGTGNLKSAGG---FS 409
Query: 375 GLRPCFDVPGEKTGSFPELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTVVTDREASGGP 434
C+D+ G KT P L HF+GGA ++LP NY V + C + +GG
Sbjct: 410 LFDTCYDLSGLKTVKVPTLVFHFQGGAHISLPATNYLIPVDSSATFCFAFAGN---TGGL 466
Query: 435 SIILGNFQMQNYYVEYDLRNQRLGFKQQLC 464
SII GN Q Q Y V +D R+GFK C
Sbjct: 467 SII-GNIQQQGYRVVFDSLANRVGFKAGSC 495
>gi|356553263|ref|XP_003544977.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
Length = 445
Score = 142 bits (357), Expect = 5e-31, Method: Compositional matrix adjust.
Identities = 127/433 (29%), Positives = 196/433 (45%), Gaps = 37/433 (8%)
Query: 39 NPSQDSYQNLNSLVSSSLTRALHIK-NPQTKTTTTTTTTTTTNISSHSYGGYSISLSFGT 97
NP+ DS L S L+ A + NP+ +T +++++ + +S ++L GT
Sbjct: 38 NPTTDSLSLSFPLTSLPLSTAKPLNTNPKLRTLSSSSSYNIKSSFKYSMA-LVVTLPIGT 96
Query: 98 PPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLGCQNPKCSWIHH 157
PPQ +LDTGS L W C N + SF P LSSS +L C +P C
Sbjct: 97 PPQPQQMVLDTGSQLSWIQCHNK-------TPPTASFDPSLSSSFYVLPCTHPLCK---- 145
Query: 158 ESIQCRDCNDEPLATSKNCTQICPSYLVLYGSG-LTEGIALSETLNL-PNRIIPNFLVGC 215
D L T+ + ++C Y Y G EG + E L P++ P ++GC
Sbjct: 146 -----PRVPDFTLPTTCDQNRLC-HYSYFYADGTYAEGNLVREKLAFSPSQTTPPLILGC 199
Query: 216 SVLSSRQPAGIAGFGRGKTSLPSQLNLDKFSYCLLSHKFDDTTR--TSSLIL-DNGSSHS 272
S SR GI G G+ S P Q + KFSYC+ + + + T S L +N +S
Sbjct: 200 SS-ESRDARGILGMNLGRLSFPFQAKVTKFSYCVPTRQPANNNNFPTGSFYLGNNPNSAR 258
Query: 273 DKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRVRVWHKYLTLDRDGNGGTI 332
+ + LT+ P N + Y V ++ I +GG+++ + + G+G T+
Sbjct: 259 FRYVSMLTFPQSQRMP-----NLDPLAYTVPMQGIRIGGRKLNIPPSVFRPNAGGSGQTM 313
Query: 333 VDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEALTGLRPCFDVPGEKTGS-FP 391
VDSG+ FTF+ ++ + +E + + G A CFD + G
Sbjct: 314 VDSGSEFTFLVDVAYDRVREEIIRVLGPRVKKGYVYGGVADM----CFDGNAMEIGRLLG 369
Query: 392 ELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTVVTDREASGGPSIILGNFQMQNYYVEYD 451
++ F+ G E+ +P E A VG G V + E G S I+GNF QN +VE+D
Sbjct: 370 DVAFEFEKGVEIVVPKERVLADVGGG--VHCVGIGRSERLGAASNIIGNFHQQNLWVEFD 427
Query: 452 LRNQRLGFKQQLC 464
L N+R+GF C
Sbjct: 428 LANRRIGFGVADC 440
>gi|357143680|ref|XP_003573011.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Brachypodium
distachyon]
Length = 510
Score = 141 bits (356), Expect = 7e-31, Method: Compositional matrix adjust.
Identities = 130/394 (32%), Positives = 184/394 (46%), Gaps = 50/394 (12%)
Query: 87 GGYSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLG 146
G Y + + GTPP+ I+DTGS L W C C C + P F P S+S R +
Sbjct: 148 GEYLVEVYVGTPPRRFQMIMDTGSDLNWLQCA---PCLDCFDQRGPVFDPMASTSYRNVT 204
Query: 147 CQNPKCSWIHHESI--QCRDCNDEPLATSKNCTQICPSYLVLYG--SGLTEGIALSE-TL 201
C + +C + + CR +P CP Y YG S T +AL T+
Sbjct: 205 CGDTRCGLVSPPAAPRTCRSSRSDP----------CP-YYYWYGDQSNTTGDLALEAFTV 253
Query: 202 NL---PNRIIPNFLVGCSVLSS---RQPAGIAGFGRGKTSLPSQLNL---DKFSYCLLSH 252
NL +R + ++GC + AG+ G GRG S SQL FSYCL+ H
Sbjct: 254 NLTASSSRRVDGVVLGCGHRNRGLFHGAAGLLGLGRGPLSFASQLRAVYGHAFSYCLVDH 313
Query: 253 KFDDTTRTSSLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQ 312
+ S ++ G + L YT F PS AE + +YYV L+ I VGG+
Sbjct: 314 G---SAVGSKIVF--GDDNVLLSHPQLNYTAFA--PSAAE----NTFYYVQLKGILVGGE 362
Query: 313 RVRVWHKYLTLDR-DGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAE 371
+ + + + DG+GGTI+DSGTT ++ ++ + FV +M K L A+
Sbjct: 363 MLDIPSNTWGVSKEDGSGGTIIDSGTTLSYFPEPAYKAIRQAFVDRMDK----AYPLIAD 418
Query: 372 ALTGLRPCFDVPGEKTGSFPELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTVV-TDREA 430
L PC++V G + PE L F GA P ENYF + +CL V+ T R A
Sbjct: 419 -FPVLSPCYNVSGVERVEVPEFSLLFADGAVWDFPAENYFIRLDTEGIMCLAVLGTPRSA 477
Query: 431 SGGPSIILGNFQMQNYYVEYDLRNQRLGFKQQLC 464
I+GN+Q QN++V YDL + RLGF + C
Sbjct: 478 MS----IIGNYQQQNFHVLYDLHHNRLGFAPRRC 507
>gi|168054484|ref|XP_001779661.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162668975|gb|EDQ55572.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 419
Score = 140 bits (354), Expect = 1e-30, Method: Compositional matrix adjust.
Identities = 116/384 (30%), Positives = 169/384 (44%), Gaps = 37/384 (9%)
Query: 87 GGYSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLG 146
G Y + GTPPQ I+D+GS L+W C C C + P + P SS+ +
Sbjct: 63 GQYFVDFFLGTPPQKFSLIVDSGSDLLWVQCA---PCLQCYAQDTPLYAPSNSSTFNPVP 119
Query: 147 CQNPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSGLTEGIALSETLNLPNR 206
C +P+C I E + C + L++G+ E+ + +
Sbjct: 120 CLSPECLLIPAT---------EGFPCDFHYPGACAYEYRYADTSLSKGVFAYESATVDDV 170
Query: 207 IIPNFLVGC---SVLSSRQPAGIAGFGRGKTSLPSQLNL---DKFSYCLLSHKFDDTTRT 260
I GC + S G+ G G+G S SQ+ +KF+YCL+++ D T+ +
Sbjct: 171 RIDKVAFGCGRDNQGSFAAAGGVLGLGQGPLSFGSQVGYAYGNKFAYCLVNY-LDPTSVS 229
Query: 261 SSLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRVRVWHKY 320
S LI + L +TP V+N RN YYV + ++ VGG+ + + H
Sbjct: 230 SWLIFGD---ELISTIHDLQFTPIVSN----SRNP--TLYYVQIEKVMVGGESLPISHSA 280
Query: 321 LTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEALTGLRPCF 380
+LD GNGG+I DSGTT T+ P P ++ KN Y R A ++ GL C
Sbjct: 281 WSLDFLGNGGSIFDSGTTVTYWLP----PAYRNILAAFDKNVRYPR---AASVQGLDLCV 333
Query: 381 DVPGEKTGSFPELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTVVTDREASGGPSIILGN 440
DV G SFP + GGA NYF V + CL + + GG + I GN
Sbjct: 334 DVTGVDQPSFPSFTIVLGGGAVFQPQQGNYFVDVAP-NVQCLAMAGLPSSVGGFNTI-GN 391
Query: 441 FQMQNYYVEYDLRNQRLGFKQQLC 464
QN+ V+YD R+GF C
Sbjct: 392 LLQQNFLVQYDREENRIGFAPAKC 415
>gi|15222611|ref|NP_173922.1| aspartyl protease-like protein [Arabidopsis thaliana]
gi|12321511|gb|AAG50814.1|AC079281_16 hypothetical protein [Arabidopsis thaliana]
gi|20466516|gb|AAM20575.1| unknown protein [Arabidopsis thaliana]
gi|23198172|gb|AAN15613.1| unknown protein [Arabidopsis thaliana]
gi|110736960|dbj|BAF00436.1| hypothetical protein [Arabidopsis thaliana]
gi|332192515|gb|AEE30636.1| aspartyl protease-like protein [Arabidopsis thaliana]
Length = 483
Score = 140 bits (353), Expect = 1e-30, Method: Compositional matrix adjust.
Identities = 118/382 (30%), Positives = 170/382 (44%), Gaps = 48/382 (12%)
Query: 87 GGYSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLG 146
G Y + G P + + +LDTGS + W CT C C P F P SSS L
Sbjct: 146 GEYFTRVGIGKPAREVYMVLDTGSDVNWLQCT---PCADCYHQTEPIFEPSSSSSYEPLS 202
Query: 147 CQNPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSG-LTEGIALSETLNLPN 205
C P+C+ + E +CR N T + Y V YG G T G +ETL + +
Sbjct: 203 CDTPQCNAL--EVSECR-----------NATCL---YEVSYGDGSYTVGDFATETLTIGS 246
Query: 206 RIIPNFLVGCSVLSSR---QPAGIAGFGRGKTSLPSQLNLDKFSYCLLSHKFDDTTRTSS 262
++ N VGC + AG+ G G G +LPSQLN FSYCL+ D S+
Sbjct: 247 TLVQNVAVGCGHSNEGLFVGAAGLLGLGGGLLALPSQLNTTSFSYCLVDRDSD-----SA 301
Query: 263 LILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRVRVWHKYLT 322
+D G+S S P + N + +YY+GL I+VGG+ +++
Sbjct: 302 STVDFGTSLSPDAVVA----PLLRN------HQLDTFYYLGLTGISVGGELLQIPQSSFE 351
Query: 323 LDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEALTGLRPCFDV 382
+D G+GG I+DSGT T + E++ L D FV + + +A G + C+++
Sbjct: 352 MDESGSGGIIIDSGTAVTRLQTEIYNSLRDSFVKGTL---DLEKAAG---VAMFDTCYNL 405
Query: 383 PGEKTGSFPELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTVVTDREASGGPSIILGNFQ 442
+ T P + HF GG + LP +NY V CL + I+GN Q
Sbjct: 406 SAKTTVEVPTVAFHFPGGKMLALPAKNYMIPVDSVGTFCLAFA----PTASSLAIIGNVQ 461
Query: 443 MQNYYVEYDLRNQRLGFKQQLC 464
Q V +DL N +GF C
Sbjct: 462 QQGTRVTFDLANSLIGFSSNKC 483
>gi|167997964|ref|XP_001751688.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162696786|gb|EDQ83123.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 418
Score = 140 bits (353), Expect = 1e-30, Method: Compositional matrix adjust.
Identities = 115/388 (29%), Positives = 173/388 (44%), Gaps = 45/388 (11%)
Query: 87 GGYSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLG 146
G Y + GTPPQ I+D+GS L+W C+ C+ C + P ++P SS+ +
Sbjct: 62 GQYFVDFFLGTPPQKFSLIVDSGSDLLWVQCS---PCRQCYAQDSPLYVPSNSSTFSPVP 118
Query: 147 CQNPKCSWIHHESIQCRDCNDEPLATSKNCTQICP---SYLVLYG-SGLTEGIALSETLN 202
C + C I P C P +Y LY + ++G+ E+
Sbjct: 119 CLSSDCLLI-------------PATEGFPCDFRYPGACAYEYLYADTSSSKGVFAYESAT 165
Query: 203 LPNRIIPNFLVGC---SVLSSRQPAGIAGFGRGKTSLPSQLNL---DKFSYCLLSHKFDD 256
+ I GC + S G+ G G+G S SQ+ +KF+YCL+++ D
Sbjct: 166 VDGVRIDKVAFGCGSDNQGSFAAAGGVLGLGQGPLSFGSQVGYAYGNKFAYCLVNY-LDP 224
Query: 257 TTRTSSLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRVRV 316
T+ +SSLI + + YTP V+NP YYV + ++TVGG+ + +
Sbjct: 225 TSVSSSLIFGD---ELISTIHDMQYTPIVSNPKSP------TLYYVQIEKVTVGGKSLPI 275
Query: 317 WHKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEALTGL 376
+D GNGG+I DSGTT T+ P + + F S + +Y R AE++ GL
Sbjct: 276 SDSAWEIDLLGNGGSIFDSGTTLTYWFPSAYSHILAAFDSGV----HYPR---AESVQGL 328
Query: 377 RPCFDVPGEKTGSFPELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTVVTDREASGGPSI 436
C ++ G SFP + F GA ENYF V + CL + GG +
Sbjct: 329 DLCVELTGVDQPSFPSFTIEFDDGAVFQPEAENYFVDVAP-NVRCLAMAGLASPLGGFNT 387
Query: 437 ILGNFQMQNYYVEYDLRNQRLGFKQQLC 464
I GN QN++V+YD +GF C
Sbjct: 388 I-GNLLQQNFFVQYDREENLIGFAPAKC 414
>gi|356537728|ref|XP_003537377.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Glycine max]
Length = 543
Score = 140 bits (353), Expect = 2e-30, Method: Compositional matrix adjust.
Identities = 115/400 (28%), Positives = 178/400 (44%), Gaps = 50/400 (12%)
Query: 87 GGYSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLG 146
G Y + + GTPP+ + ILDTGS L W C Y C + S + PK SS+ R +
Sbjct: 169 GEYFLDMFVGTPPKHVWLILDTGSDLSWIQCDPCYDCFEQNGSH---YYPKDSSTYRNIS 225
Query: 147 CQNPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSGL-TEGIALSET----L 201
C +P+C + + +PL K Q CP Y Y G T G SET L
Sbjct: 226 CYDPRCQLV---------SSSDPLQHCKAENQTCP-YFYDYADGSNTTGDFASETFTVNL 275
Query: 202 NLPN-----RIIPNFLVGCSVLSS---RQPAGIAGFGRGKTSLPSQLNL---DKFSYCLL 250
PN + + + + GC + +G+ G GRG S PSQ+ FSYC L
Sbjct: 276 TWPNGKEKFKQVVDVMFGCGHWNKGFFYGASGLLGLGRGPISFPSQIQSIYGHSFSYC-L 334
Query: 251 SHKFDDTTRTSSLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVG 310
+ F +T+ +S LI G L +T + E +YY+ ++ I VG
Sbjct: 335 TDLFSNTSVSSKLIF--GEDKELLNNHNLNFTTLL----AGEETPDETFYYLQIKSIMVG 388
Query: 311 GQRVRV----WH-KYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYT 365
G+ + + WH D GGTI+DSG+T TF ++ + + F ++
Sbjct: 389 GEVLDISEQTWHWSSEGAAADAGGGTIIDSGSTLTFFPDSAYDIIKEAFEKKIK-----L 443
Query: 366 RALGAEALTGLRPCFDVPGEKTG-SFPELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTV 424
+ + A+ + PC++V G P+ +HF G P ENYF +CL +
Sbjct: 444 QQIAADDFV-MSPCYNVSGAMMQVELPDFGIHFADGGVWNFPAENYFYQYEPDEVICLAI 502
Query: 425 VTDREASGGPSIILGNFQMQNYYVEYDLRNQRLGFKQQLC 464
+ + + I+GN QN+++ YD++ RLG+ + C
Sbjct: 503 M--KTPNHSHLTIIGNLLQQNFHILYDVKRSRLGYSPRRC 540
>gi|226492391|ref|NP_001140482.1| uncharacterized protein LOC100272542 precursor [Zea mays]
gi|224030447|gb|ACN34299.1| unknown [Zea mays]
gi|414887506|tpg|DAA63520.1| TPA: hypothetical protein ZEAMMB73_432695 [Zea mays]
Length = 512
Score = 140 bits (352), Expect = 2e-30, Method: Compositional matrix adjust.
Identities = 130/395 (32%), Positives = 179/395 (45%), Gaps = 50/395 (12%)
Query: 89 YSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLGCQ 148
Y + + GTPP+ I+DTGS L W C C C + P F P SSS R L C
Sbjct: 146 YLMDVYVGTPPRRFQMIMDTGSDLNWLQCA---PCLDCFEQRGPVFDPAASSSYRNLTCG 202
Query: 149 NPKCSWIHHESIQ----CRDCNDEPLATSKNCTQICPSYLVLYG--SGLTEGIAL-SETL 201
+P+C + CR ++P CP Y YG S T +AL S T+
Sbjct: 203 DPRCGHVAPPEAPAPRACRRPGEDP----------CP-YYYWYGDQSNSTGDLALESFTV 251
Query: 202 NL----PNRIIPNFLVGCSVLSS---RQPAGIAGFGRGKTSLPSQLNL----DKFSYCLL 250
NL + + + GC + AG+ G GRG S SQL FSYCL+
Sbjct: 252 NLTAPGASSRVDGVVFGCGHRNRGLFHGAAGLLGLGRGPLSFASQLRAVYGGHTFSYCLV 311
Query: 251 SHKFDDTTRTSSLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVG 310
H D S ++ + + L YT F S A+ +YYV L + VG
Sbjct: 312 DHGSD---VASKVVFGEDDALALAAHPRLKYTAFAPASSPAD-----TFYYVRLTGVLVG 363
Query: 311 GQRVRVWHKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGA 370
G+ + + G+GGTI+DSGTT ++ ++ + F+ +M + +Y
Sbjct: 364 GELLNISSDTWDASEGGSGGTIIDSGTTLSYFVEPAYQVIRRAFIDRM--SGSYPPVPDF 421
Query: 371 EALTGLRPCFDVPGEKTGSFPELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTVV-TDRE 429
L+ PC++V G + PEL L F GA P ENYF + +CL V+ T R
Sbjct: 422 PVLS---PCYNVSGVERPEVPELSLLFADGAVWDFPAENYFIRLDPDGIMCLAVLGTPRT 478
Query: 430 ASGGPSIILGNFQMQNYYVEYDLRNQRLGFKQQLC 464
G SII GNFQ QN++V YDL N RLGF + C
Sbjct: 479 ---GMSII-GNFQQQNFHVAYDLHNNRLGFAPRRC 509
>gi|326506192|dbj|BAJ86414.1| predicted protein [Hordeum vulgare subsp. vulgare]
gi|326516358|dbj|BAJ92334.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 492
Score = 139 bits (351), Expect = 2e-30, Method: Compositional matrix adjust.
Identities = 126/395 (31%), Positives = 169/395 (42%), Gaps = 49/395 (12%)
Query: 83 SHSYGGYSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSS 142
+ G Y + GTP +LDTGS +VW C C+ C F P+ S S
Sbjct: 134 AQGSGEYFTKIGVGTPATPALMVLDTGSDVVWLQCA---PCRRCYEQSGQVFDPRRSRSY 190
Query: 143 RLLGCQNPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSG-LTEGIALSETL 201
+GC P C + R C Y V YG G +T G +ETL
Sbjct: 191 NAVGCAAPLCRRLDSGGCDLR---------RSACL-----YQVAYGDGSVTAGDFATETL 236
Query: 202 NLPNRI-IPNFLVGCSVLSSR---QPAGIAGFGRGKTSLPSQLNLD---KFSYCLLSH-- 252
+ +GC + AG+ G GRG S P+Q++ FSYCL+
Sbjct: 237 TFAGGARVARVALGCGHDNEGLFVAAAGLLGLGRGSLSFPTQISRRYGRSFSYCLVDRTS 296
Query: 253 KFDDTTRTSSLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQ 312
+ +R+S++ +G+ S T ++TP V NP + +YYV L I+VGG
Sbjct: 297 SANTASRSSTVTFGSGAVGS---TVASSFTPMVKNPRM------ETFYYVQLIGISVGGA 347
Query: 313 RV-RVWHKYLTLD-RDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGA 370
RV V + L LD G GG IVDSGT+ T +A + L D F R L
Sbjct: 348 RVPGVANSDLRLDPSSGRGGVIVDSGTSVTRLARPAYSALRDAFRGAAAGLR-----LSP 402
Query: 371 EALTGLRPCFDVPGEKTGSFPELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTVV-TDRE 429
+ C+D+ G K P + +HF GGAE LP ENY V C TD
Sbjct: 403 GGFSLFDTCYDLSGRKVVKVPTVSMHFAGGAEAALPPENYLIPVDSKGTFCFAFAGTD-- 460
Query: 430 ASGGPSIILGNFQMQNYYVEYDLRNQRLGFKQQLC 464
GG SII GN Q Q + V +D QR+ F + C
Sbjct: 461 --GGVSII-GNIQQQGFRVVFDGDGQRVAFTPKGC 492
>gi|357143847|ref|XP_003573077.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Brachypodium
distachyon]
Length = 420
Score = 139 bits (351), Expect = 2e-30, Method: Compositional matrix adjust.
Identities = 122/388 (31%), Positives = 170/388 (43%), Gaps = 53/388 (13%)
Query: 89 YSISLSFGTPPQIIPFIL--DTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLG 146
Y + L+ G PP +PF+ DTGS L W C CK C P + P SS+ L
Sbjct: 71 YLMELAIGKPP--VPFVALADTGSDLTWTQCQ---PCKLCFPQDTPVYDPSASSTFSPLP 125
Query: 147 CQNPKCSWIHHESIQCRDCNDEPLATSKNCT--QICPSYLVLYGSGL-TEGIALSETLNL 203
C + C I S+NCT +C Y YG G + GI +ETL L
Sbjct: 126 CSSATCLPIW----------------SRNCTPSSLC-RYRYAYGDGAYSAGILGTETLTL 168
Query: 204 PNRIIP----NFLVGCSVLS---SRQPAGIAGFGRGKTSLPSQLNLDKFSYCLLSHKFDD 256
P GC + S G G GRG SL +QL + KFSYCL F +
Sbjct: 169 GPSSAPVSVGGVAFGCGTDNGGDSLNSTGTVGLGRGTLSLLAQLGVGKFSYCL--TDFFN 226
Query: 257 TTRTSSLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRVRV 316
+ S +L + + +T + TP + +P R Y+V L+ I++G R+ +
Sbjct: 227 SALDSPFLLGTLAELAPGPST-VQSTPLLQSPQNPSR------YFVSLQGISLGDVRLPI 279
Query: 317 WHKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEALTGL 376
+ L DG GG IVDSGTTFT +A F + + + +L A
Sbjct: 280 PNGTFDLRGDGTGGMIVDSGTTFTILAESGFREVVGRVARVLGQPPVNASSLDA------ 333
Query: 377 RPCFDVPGEKTGSFPELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTVVTDREASGGPSI 436
PCF P + P+L LHF GGA++ L +NY + E S+ CL + S +
Sbjct: 334 -PCFPAPAGEPPYMPDLVLHFAGGADMRLYRDNYMSYNEEDSSFCLNIAGTTPES---TS 389
Query: 437 ILGNFQMQNYYVEYDLRNQRLGFKQQLC 464
+LGNFQ QN + +D +L F C
Sbjct: 390 VLGNFQQQNIQMLFDTTVGQLSFLPTDC 417
>gi|168000296|ref|XP_001752852.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162696015|gb|EDQ82356.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 384
Score = 139 bits (351), Expect = 2e-30, Method: Compositional matrix adjust.
Identities = 126/391 (32%), Positives = 181/391 (46%), Gaps = 56/391 (14%)
Query: 87 GGYSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLG 146
G Y ++L+ G+PPQ I+DTGS L W C C+ C P F P S S R
Sbjct: 37 GEYLMTLTLGSPPQSFDVIVDTGSDLNWVQC---LPCRVCYQQPGPKFDPSKSRSFRKAA 93
Query: 147 CQNPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGS-GLTEGIALSETLNLPN 205
C + C + + PL K C Y YG T G ET++L N
Sbjct: 94 CTDNLC-----------NVSALPL---KACAANVCQYQYTYGDQSNTNGDLAFETISLNN 139
Query: 206 ----RIIPNFLVGC---SVLSSRQPAGIAGFGRGKTSLPSQLN---LDKFSYCLLSHKFD 255
+ +PNF GC ++ + AG+ G G+G SL SQL+ +KFSYCL+S
Sbjct: 140 GAGTQSVPNFAFGCGTQNLGTFAGAAGLVGLGQGPLSLNSQLSHTFANKFSYCLVSL--- 196
Query: 256 DTTRTSSLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRVR 315
++ S L + ++ ++ + YT V N YYYV L I VGGQ +
Sbjct: 197 NSLSASPLTFGSIAAAAN-----IQYTSIVVNAR------HPTYYYVQLNSIEVGGQPLN 245
Query: 316 VWHKYLTLDRD-GNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEALT 374
+ +D+ G GGTI+DSGTT T + + + + S + NY R G+
Sbjct: 246 LAPSVFAIDQSTGRGGTIIDSGTTITMLTLPAYSAVLRAYESFV----NYPRLDGSA--Y 299
Query: 375 GLRPCFDVPGEKTGSFPELKLHFKGGAEVTLPVENYFAVVG-EGSAVCLTVVTDREASGG 433
GL CF++ G S P++ F+ GA+ + EN F +V + +CL + S G
Sbjct: 300 GLDLCFNIAGVSNPSVPDMVFKFQ-GADFQMRGENLFVLVDTSATTLCLAM----GGSQG 354
Query: 434 PSIILGNFQMQNYYVEYDLRNQRLGFKQQLC 464
SII GN Q QN+ V YDL +++GF C
Sbjct: 355 FSII-GNIQQQNHLVVYDLEAKKIGFATADC 384
>gi|326517992|dbj|BAK07248.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 500
Score = 139 bits (351), Expect = 2e-30, Method: Compositional matrix adjust.
Identities = 117/383 (30%), Positives = 166/383 (43%), Gaps = 48/383 (12%)
Query: 87 GGYSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLG 146
G Y + G P + + +LDTGS + W C C C + P + P +S+S +G
Sbjct: 161 GEYFSRVGVGRPARQLYMVLDTGSDVTWLQCQ---PCADCYAQSDPVYDPSVSTSYATVG 217
Query: 147 CQNPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSG-LTEGIALSETLNLPN 205
C +P+C RD + A +N T C Y V YG G T G +ETL L +
Sbjct: 218 CDSPRC----------RDLD---AAACRNSTGSC-LYEVAYGDGSYTVGDFATETLTLGD 263
Query: 206 RI-IPNFLVGCSVLSSR---QPAGIAGFGRGKTSLPSQLNLDKFSYCLLSHKFDDTTRTS 261
+ N +GC + AG+ G G S PSQ++ FSYCL+ D S
Sbjct: 264 SAPVSNVAIGCGHDNEGLFVGAAGLLALGGGPLSFPSQISATTFSYCLV-----DRDSPS 318
Query: 262 SLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRVRVWHKYL 321
S L G S T L +P N +YYV L I+VGG+ + +
Sbjct: 319 SSTLQFGDSEQPAVTAPLIRSPRTNT-----------FYYVALSGISVGGEALSIPSSAF 367
Query: 322 TLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEALTGLRPCFD 381
+D G+GG IVDSGT T + + L + FV ++ RA G ++ C+D
Sbjct: 368 AMDDAGSGGVIVDSGTAVTRLQSGAYGALREAFVQ---GTQSLPRASG---VSLFDTCYD 421
Query: 382 VPGEKTGSFPELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTVVTDREASGGPSIILGNF 441
+ G + P + L F+GG E+ LP +NY V CL + GP I+GN
Sbjct: 422 LAGRSSVQVPAVALWFEGGGELKLPAKNYLIPVDAAGTYCLAFA----GTSGPVSIIGNV 477
Query: 442 QMQNYYVEYDLRNQRLGFKQQLC 464
Q Q V +D +GF C
Sbjct: 478 QQQGVRVSFDTAKNTVGFTADKC 500
>gi|18391062|ref|NP_563851.1| aspartyl protease-like protein [Arabidopsis thaliana]
gi|2160166|gb|AAB60729.1| F21M12.13 gene product [Arabidopsis thaliana]
gi|21593996|gb|AAM65914.1| nucleoid DNA-binding-like protein [Arabidopsis thaliana]
gi|26983826|gb|AAN86165.1| unknown protein [Arabidopsis thaliana]
gi|332190367|gb|AEE28488.1| aspartyl protease-like protein [Arabidopsis thaliana]
Length = 449
Score = 139 bits (350), Expect = 3e-30, Method: Compositional matrix adjust.
Identities = 116/389 (29%), Positives = 173/389 (44%), Gaps = 51/389 (13%)
Query: 87 GGYSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLG 146
G Y + GTPPQ++ +LDT + VW PC+ C CS++ S+ S +
Sbjct: 102 GNYVVRAKLGTPPQLMFMVLDTSNDAVWLPCSG---CSGCSNASTSFNTNSSSTYSTV-S 157
Query: 147 CQNPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSGLTEGIAL-SETLNLPN 205
C +C+ + C + +P S N + YG + +L +TL L
Sbjct: 158 CSTAQCT--QARGLTCPSSSPQPSVCSFNQS---------YGGDSSFSASLVQDTLTLAP 206
Query: 206 RIIPNFLVGC---SVLSSRQPAGIAGFGRGKTSLPSQ---LNLDKFSYCLLSHK---FDD 256
+IPNF GC + +S P G+ G GRG SL SQ L FSYCL S + F
Sbjct: 207 DVIPNFSFGCINSASGNSLPPQGLMGLGRGPMSLVSQTTSLYSGVFSYCLPSFRSFYFSG 266
Query: 257 TTRTSSLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRVRV 316
+ + L + + YTP + NP R + YYV L ++VG +V V
Sbjct: 267 SLKLGLL----------GQPKSIRYTPLLRNP---RRPSL---YYVNLTGVSVGSVQVPV 310
Query: 317 WHKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEALTGL 376
YLT D + GTI+DSGT T A ++E + DEF Q+ N + LGA
Sbjct: 311 DPVYLTFDANSGAGTIIDSGTVITRFAQPVYEAIRDEFRKQV--NVSSFSTLGA-----F 363
Query: 377 RPCFDVPGEKTGSFPELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTVVTDREASGGPSI 436
CF E P++ LH ++ LP+EN G+ CL++ R+ +
Sbjct: 364 DTCFSADNENVA--PKITLHMT-SLDLKLPMENTLIHSSAGTLTCLSMAGIRQNANAVLN 420
Query: 437 ILGNFQMQNYYVEYDLRNQRLGFKQQLCK 465
++ N Q QN + +D+ N R+G + C
Sbjct: 421 VIANLQQQNLRILFDVPNSRIGIAPEPCN 449
>gi|168059885|ref|XP_001781930.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162666576|gb|EDQ53226.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 355
Score = 139 bits (350), Expect = 3e-30, Method: Compositional matrix adjust.
Identities = 118/392 (30%), Positives = 179/392 (45%), Gaps = 55/392 (14%)
Query: 87 GGYSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLG 146
G Y ++ GTP ++ I+DTGS L W C+ C C S FIP S+S L
Sbjct: 1 GEYLATVRLGTPERVFSVIVDTGSDLTWVQCS---PCGTCYSQNDSLFIPNTSTSFTKLA 57
Query: 147 CQNPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSG-LTEGIALSETL---- 201
C C+ + + P+ C Y YG G L+ G + +T+
Sbjct: 58 CGTELCNGLPY-----------PMCNQTTCV-----YWYSYGDGSLSTGDFVYDTITMDG 101
Query: 202 -NLPNRIIPNFLVGCSVLSSRQPAG---IAGFGRGKTSLPSQLNL---DKFSYCLLSHKF 254
N + +PNF GC + AG I G G+G S PSQL KFSYCL+
Sbjct: 102 INGQKQQVPNFAFGCGHDNEGSFAGADGILGLGQGPLSFPSQLKTVFNGKFSYCLVDW-L 160
Query: 255 DDTTRTSSLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRV 314
T+TS L+ + + + G+ Y + NP V YYYV L I+VGG+ +
Sbjct: 161 APPTQTSPLLFGDAAVPT---FPGVKYISLLTNPKVP------TYYYVKLNGISVGGKLL 211
Query: 315 RVWHKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQM-VKNRNYTRALGAEAL 373
+ +D G GTI DSGTT T +A E+ + E ++ M +Y R ++
Sbjct: 212 NISSTAFDIDSVGRAGTIFDSGTTVTQLAGEVHQ----EVLAAMNASTMDYPRK--SDDS 265
Query: 374 TGLRPCFDVPGE-KTGSFPELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTVVTDREASG 432
+GL C E + + P + HF+GG ++ LP NYF + + C ++V+ + +
Sbjct: 266 SGLDLCLGGFAEGQLPTVPSMTFHFEGG-DMELPPSNYFIFLESSQSYCFSMVSSPDVT- 323
Query: 433 GPSIILGNFQMQNYYVEYDLRNQRLGFKQQLC 464
I+G+ Q QN+ V YD +++GF + C
Sbjct: 324 ----IIGSIQQQNFQVYYDTVGRKIGFVPKSC 351
>gi|242050428|ref|XP_002462958.1| hypothetical protein SORBIDRAFT_02g035300 [Sorghum bicolor]
gi|241926335|gb|EER99479.1| hypothetical protein SORBIDRAFT_02g035300 [Sorghum bicolor]
Length = 460
Score = 139 bits (350), Expect = 3e-30, Method: Compositional matrix adjust.
Identities = 119/384 (30%), Positives = 168/384 (43%), Gaps = 34/384 (8%)
Query: 89 YSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLGCQ 148
Y + + GTPP + +LDTGS L+W C C+ C P + P S + + C
Sbjct: 100 YLVDFAIGTPPLALSAVLDTGSDLIWTQC--DAPCRRCFPQPAPLYAPARSVTYANVSCG 157
Query: 149 NPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSGL-TEGIALSETLNL-PNR 206
+ C + S++ + C +Y YG G T+G+ +ET
Sbjct: 158 SRLCDAL--PSLRPSSRCSASASAPAPERGGC-TYYYSYGDGSSTDGVLATETFTFGAGT 214
Query: 207 IIPNFLVGC---SVLSSRQPAGIAGFGRGKTSLPSQLNLDKFSYCLLSHKFDDTTRTSSL 263
+ + GC ++ + +G+ G GRG SL SQL + KFSYC F+DTT +S L
Sbjct: 215 TVHDLAFGCGTDNLGGTDNSSGLVGMGRGPLSLVSQLGVTKFSYCFT--PFNDTTTSSPL 272
Query: 264 ILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRVRVWHKYLTL 323
L + +S S + TPFV +PS R S YYY+ L ITVG + + L
Sbjct: 273 FLGSSASLSPAAKS----TPFVPSPSGPRR---SSYYYLSLEGITVGDTLLPIDPAVFRL 325
Query: 324 DRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEALTGLRPCFDVP 383
G GG I+DSGTTFT + F V L + A GL CF P
Sbjct: 326 TASGRGGLIIDSGTTFTALEERAFV------VLARAVAARVALPLASGAHLGLSVCFAAP 379
Query: 384 ---GEKTGSFPELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTVVTDREASGGPSIILGN 440
G + P L LHF GA++ LP + CL +V+ R S +LG+
Sbjct: 380 QGRGPEAVDVPRLVLHFD-GADMELPRSSAVVEDRVAGVACLGIVSARGMS-----VLGS 433
Query: 441 FQMQNYYVEYDLRNQRLGFKQQLC 464
Q QN +V YD+ L F+ C
Sbjct: 434 MQQQNMHVRYDVGRDVLSFEPANC 457
>gi|225470916|ref|XP_002263964.1| PREDICTED: aspartic proteinase nepenthesin-1 [Vitis vinifera]
gi|147788999|emb|CAN64659.1| hypothetical protein VITISV_009613 [Vitis vinifera]
Length = 489
Score = 139 bits (350), Expect = 3e-30, Method: Compositional matrix adjust.
Identities = 125/398 (31%), Positives = 180/398 (45%), Gaps = 50/398 (12%)
Query: 75 TTTTTNISSHSYGGYSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSF 134
+++ T+ + G Y L GTPP+ + +LDTGS +VW C C+ C S P F
Sbjct: 133 SSSVTSGLAQGSGEYFTRLGVGTPPKYVYMVLDTGSDVVWIQCA---PCRKCYSQTDPVF 189
Query: 135 IPKLSSSSRLLGCQNPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSG-LTE 193
PK S S + C++P C + +S C N Q C Y V YG G T
Sbjct: 190 DPKKSGSFSSISCRSPLC--LRLDSPGC------------NSRQSC-LYQVAYGDGSFTF 234
Query: 194 GIALSETLNLPNRIIPNFLVGCSVLSSR---QPAGIAGFGRGKTSLPSQLNL---DKFSY 247
G +ETL +P +GC + AG+ G GRG+ S P+Q L KFSY
Sbjct: 235 GEFSTETLTFRGTRVPKVALGCGHDNEGLFVGAAGLLGLGRGRLSFPTQTGLRFGRKFSY 294
Query: 248 CLLSHKFDDTTRTSSLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRI 307
CL+ +++ SS++ G S + +TP + NP + +YY+ L I
Sbjct: 295 CLVDRS--ASSKPSSVVF--GQSAVSRTA---VFTPLITNPKL------DTFYYLELTGI 341
Query: 308 TVGGQRVR-VWHKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTR 366
+VGG RV + LD GNGG I+DSGT+ T + + L D F + +
Sbjct: 342 SVGGARVAGITASLFKLDTAGNGGVIIDSGTSVTRLTRRAYVSLRDAFRAGAADLKR--- 398
Query: 367 ALGAEALTGLRPCFDVPGEKTGSFPELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTVVT 426
A + CFD+ G+ P + +HF+ GA+V+LP NY V C
Sbjct: 399 ---APDYSLFDTCFDLSGKTEVKVPTVVMHFR-GADVSLPATNYLIPVDTNGVFCFAFAG 454
Query: 427 DREASGGPSIILGNFQMQNYYVEYDLRNQRLGFKQQLC 464
G SII GN Q Q + V +D+ R+GF + C
Sbjct: 455 TMS---GLSII-GNIQQQGFRVVFDVAASRIGFAARGC 488
>gi|224130548|ref|XP_002320868.1| predicted protein [Populus trichocarpa]
gi|222861641|gb|EEE99183.1| predicted protein [Populus trichocarpa]
Length = 488
Score = 139 bits (349), Expect = 4e-30, Method: Compositional matrix adjust.
Identities = 130/390 (33%), Positives = 174/390 (44%), Gaps = 57/390 (14%)
Query: 87 GGYSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLG 146
G Y L GTP + + +LDTGS +VW C C C S P F P S S +
Sbjct: 143 GEYFTRLGVGTPARYVYMVLDTGSDIVWIQCA---PCIKCYSQTDPVFDPTKSRSFANIP 199
Query: 147 CQNPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSG-LTEGIALSETLNLPN 205
C +P C + D P ++K QIC Y V YG G T G +ETL
Sbjct: 200 CGSPLCRRL-----------DYPGCSTKK--QIC-LYQVSYGDGSFTVGEFSTETLTFRG 245
Query: 206 RIIPNFLVGCSVLSSRQPAGI-------AGFGRGKTSLPSQLNL---DKFSYCLLSHKFD 255
+ ++GC G+ G GRG+ S PSQ+ KFSYCL
Sbjct: 246 TRVGRVVLGC----GHDNEGLFVGAAGLLGLGRGRLSFPSQIGRRFNSKFSYCLGDRS-- 299
Query: 256 DTTRTSSLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRVR 315
++R SS++ G S + T +TP ++NP + +YYV L I+VGG RV
Sbjct: 300 ASSRPSSIVF--GDSAISRTT---RFTPLLSNPKL------DTFYYVELLGISVGGTRVS 348
Query: 316 -VWHKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEALT 374
+ LD GNGG I+DSGT+ T + + L D F +V N R A +
Sbjct: 349 GISASLFKLDSTGNGGVIIDSGTSVTRLTRAAYVALRDAF---LVGASNLKR---APEFS 402
Query: 375 GLRPCFDVPGEKTGSFPELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTVVTDREASGGP 434
CFD+ G+ P + LHF+ GA+V LP NY V + C + G
Sbjct: 403 LFDTCFDLSGKTEVKVPTVVLHFR-GADVPLPASNYLIPVDNSGSFCFAFA---GTASGL 458
Query: 435 SIILGNFQMQNYYVEYDLRNQRLGFKQQLC 464
SII GN Q Q + V YDL R+GF + C
Sbjct: 459 SII-GNIQQQGFRVVYDLATSRVGFAPRGC 487
>gi|242079451|ref|XP_002444494.1| hypothetical protein SORBIDRAFT_07g022800 [Sorghum bicolor]
gi|241940844|gb|EES13989.1| hypothetical protein SORBIDRAFT_07g022800 [Sorghum bicolor]
Length = 445
Score = 139 bits (349), Expect = 4e-30, Method: Compositional matrix adjust.
Identities = 124/397 (31%), Positives = 172/397 (43%), Gaps = 64/397 (16%)
Query: 89 YSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSK---IPSFIPKLSSSSRLL 145
+++++S GTPPQ ILDTGS L+W QCK + + P + P SSS
Sbjct: 89 HTLTVSIGTPPQPRTLILDTGSDLIW------TQCKLFDTRQHREKPLYDPAKSSSFAAA 142
Query: 146 GCQNPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSGLTEGIALSETLNLPN 205
C R C T KNC++ Y YGS T+G SET
Sbjct: 143 PCDG-------------RLCETGSFNT-KNCSRNKCIYTYNYGSATTKGELASETFTFGE 188
Query: 206 --RIIPNFLVGCSVLSSRQ---PAGIAGFGRGKTSLPSQLNLDKFSYCLLSHKFDDTTRT 260
R+ + GC L+S +GI G + SL SQL + +FSYCL F D T
Sbjct: 189 HRRVSVSLDFGCGKLTSGSLPGASGILGISPDRLSLVSQLQIPRFSYCLT--PFLDRNTT 246
Query: 261 SSLILDNGSSHSDKKTTG-LTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRVRVWHK 319
S + + S +TTG + T V NP + + YYYV L I+VG +R+ V
Sbjct: 247 SHIFFGAMADLSKYRTTGPIQTTSLVTNP-----DGSNYYYYVPLIGISVGTKRLNVPVS 301
Query: 320 YLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQM------VKNRNYTRALGAEAL 373
+ RDG+GGT VDSG T + + E L + V + + Y L
Sbjct: 302 SFAIGRDGSGGTFVDSGDTTGMLPSVVMEALKEAMVEAVKLPVVNATDHGYEYEL----- 356
Query: 374 TGLRPCFDVPGEKTGS------FPELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTVVTD 427
CF +P G+ P L HF GGA + L ++Y V G +CL +
Sbjct: 357 -----CFQLPRNGGGAVETAVQVPPLVYHFDGGAAMLLRRDSYMVEVSAGR-MCLVI--- 407
Query: 428 REASGGPSIILGNFQMQNYYVEYDLRNQRLGFKQQLC 464
+SG I+GN+Q QN +V +D+ N F C
Sbjct: 408 --SSGARGAIIGNYQQQNMHVLFDVENHEFSFAPTQC 442
>gi|118484458|gb|ABK94105.1| unknown [Populus trichocarpa]
Length = 499
Score = 139 bits (349), Expect = 4e-30, Method: Compositional matrix adjust.
Identities = 132/444 (29%), Positives = 189/444 (42%), Gaps = 60/444 (13%)
Query: 33 LSRFHTNPSQDSYQNLNSLVSSSLTRALH------IKNPQTKTTTTTTTTTTTNISSHSY 86
LSR H + + NSL ++ L AL +K +T+ +T T+ +S
Sbjct: 105 LSRLHRDTVR-----FNSL-TARLQLALEDISKSDLKPLETEIKPEDLSTPVTSGTSQGS 158
Query: 87 GGYSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLG 146
G Y + G P + +LDTGS + W C C C P F P SS+ +
Sbjct: 159 GEYFTRVGVGNPARQFYMVLDTGSDINWLQCQ---PCTDCYQQTDPIFDPTASSTYAPVT 215
Query: 147 CQNPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSG-LTEGIALSETLNLPN 205
CQ+ +CS + S CR S C Y V YG G T G +E+++ N
Sbjct: 216 CQSQQCSSLEMSS--CR---------SGQCL-----YQVNYGDGSYTFGDFATESVSFGN 259
Query: 206 R-IIPNFLVGCSVLSSR---QPAGIAGFGRGKTSLPSQLNLDKFSYCLLSHKFDDTTRTS 261
+ N +GC + AG+ G G G SL +QL FSYCL++ D+ +S
Sbjct: 260 SGSVKNVALGCGHDNEGLFVGAAGLLGLGGGPLSLTNQLKATSFSYCLVNR---DSAGSS 316
Query: 262 SLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRVRVWHKYL 321
+L ++ D T P + N + +YYVGL ++VGGQ V +
Sbjct: 317 TLDFNSAQLGVDSVTA-----PLMKNRKI------DTFYYVGLSGMSVGGQMVSIPESTF 365
Query: 322 TLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEALTGLRPCFD 381
LD GNGG IVD GT T + + + PL D FV +M +N T A+ C+D
Sbjct: 366 RLDESGNGGIIVDCGTAITRLQTQAYNPLRDAFV-RMTQNLKLTSAVAL-----FDTCYD 419
Query: 382 VPGEKTGSFPELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTVVTDREASGGPSIILGNF 441
+ G+ + P + HF G LP NY V C + I+GN
Sbjct: 420 LSGQASVRVPTVSFHFADGKSWNLPAANYLIPVDSAGTYCFAFAPTTSSLS----IIGNV 475
Query: 442 QMQNYYVEYDLRNQRLGFKQQLCK 465
Q Q V +DL N R+GF C+
Sbjct: 476 QQQGTRVTFDLANNRMGFSPNKCQ 499
>gi|21450872|gb|AAK44106.2|AF370291_1 unknown protein [Arabidopsis thaliana]
Length = 375
Score = 139 bits (349), Expect = 4e-30, Method: Compositional matrix adjust.
Identities = 116/389 (29%), Positives = 173/389 (44%), Gaps = 51/389 (13%)
Query: 87 GGYSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLG 146
G Y + GTPPQ++ +LDT + VW PC+ C CS++ S+ S +
Sbjct: 28 GNYVVRAKLGTPPQLMFMVLDTSNDAVWLPCSG---CSGCSNASTSFNTNSSSTYSTV-S 83
Query: 147 CQNPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSGLTEGIAL-SETLNLPN 205
C +C+ + C + +P S N + YG + +L +TL L
Sbjct: 84 CSTAQCT--QARGLTCPSSSPQPSVCSFNQS---------YGGDSSFSASLVQDTLTLAP 132
Query: 206 RIIPNFLVGC---SVLSSRQPAGIAGFGRGKTSLPSQ---LNLDKFSYCLLSHK---FDD 256
+IPNF GC + +S P G+ G GRG SL SQ L FSYCL S + F
Sbjct: 133 DVIPNFSFGCINSASGNSLPPQGLMGLGRGPMSLVSQTTSLYSGVFSYCLPSFRSFYFSG 192
Query: 257 TTRTSSLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRVRV 316
+ + L + + YTP + NP R + YYV L ++VG +V V
Sbjct: 193 SLKLGLL----------GQPKSIRYTPLLRNP---RRPSL---YYVNLTGVSVGSVQVPV 236
Query: 317 WHKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEALTGL 376
YLT D + GTI+DSGT T A ++E + DEF Q+ N + LGA
Sbjct: 237 DPVYLTFDANSGAGTIIDSGTVITRFAQPVYEAIRDEFRKQV--NVSSFSTLGA-----F 289
Query: 377 RPCFDVPGEKTGSFPELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTVVTDREASGGPSI 436
CF E P++ LH ++ LP+EN G+ CL++ R+ +
Sbjct: 290 DTCFSADNENVA--PKITLHMT-SLDLKLPMENTLIHSSAGTLTCLSMAGIRQNANAVLN 346
Query: 437 ILGNFQMQNYYVEYDLRNQRLGFKQQLCK 465
++ N Q QN + +D+ N R+G + C
Sbjct: 347 VIANLQQQNLRILFDVPNSRIGIAPEPCN 375
>gi|302784853|ref|XP_002974198.1| hypothetical protein SELMODRAFT_54030 [Selaginella moellendorffii]
gi|300157796|gb|EFJ24420.1| hypothetical protein SELMODRAFT_54030 [Selaginella moellendorffii]
Length = 359
Score = 138 bits (348), Expect = 5e-30, Method: Compositional matrix adjust.
Identities = 121/394 (30%), Positives = 175/394 (44%), Gaps = 58/394 (14%)
Query: 87 GGYSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPS--FIPKLSSSSRL 144
G Y + LS GTPPQ+IP ++DTGS LVW C N C +C F SSS +
Sbjct: 3 GEYMMELSIGTPPQLIPAMIDTGSDLVWLKCDN---CDHCDLDHHGETIFFSDASSSYKK 59
Query: 145 LGCQNPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSG-LTEGIALSETLNL 203
L C + CS + I R C + C Y YG G T G S+ ++
Sbjct: 60 LPCNSTHCSGMSSAGIGPR------------CEETC-KYKYEYGDGSRTSGDVGSDRISF 106
Query: 204 PNR--------IIPNFLVGCSVLSSRQ---PAGIAGFGRGKTSLPSQLNLD---KFSYCL 249
+ FL GC+ G+ G G+ SL QL KFSYCL
Sbjct: 107 RSHGAGEDHRSFFDGFLFGCARKLKGDWNFTQGLIGLGQKSHSLIQQLGDKLGYKFSYCL 166
Query: 250 LSHKFDDTTRTSSLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITV 309
+S+ D+ ++ L GSS L V+ P + + YYV L+ IT+
Sbjct: 167 VSY---DSPPSAKSFLFLGSS------AALRGHDVVSTPILHGDHLDQTLYYVDLQSITI 217
Query: 310 GGQRVRVWHKYLTLDRDGN----GGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYT 365
GG V V+ K + T++DSGTT+T + P ++E + Q++
Sbjct: 218 GGVPVVVYDKESGHNTSVGPFLANKTVIDSGTTYTLLTPPVYEAMRKSIEEQVI-----L 272
Query: 366 RALGAEALTGLRPCFDVPGEKTGSFPELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTVV 425
LG A GL CF+ G+ + FP + +F ++ LP EN F V VCL++
Sbjct: 273 PTLGNSA--GLDLCFNSSGDTSYGFPSVTFYFANQVQLVLPFENIFQVTSR-DVVCLSM- 328
Query: 426 TDREASGGPSIILGNFQMQNYYVEYDLRNQRLGF 459
++SGG I+GN Q QN+++ YDL ++ F
Sbjct: 329 ---DSSGGDLSIIGNMQQQNFHILYDLVASQISF 359
>gi|302786560|ref|XP_002975051.1| hypothetical protein SELMODRAFT_54028 [Selaginella moellendorffii]
gi|300157210|gb|EFJ23836.1| hypothetical protein SELMODRAFT_54028 [Selaginella moellendorffii]
Length = 359
Score = 138 bits (348), Expect = 6e-30, Method: Compositional matrix adjust.
Identities = 122/394 (30%), Positives = 174/394 (44%), Gaps = 58/394 (14%)
Query: 87 GGYSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPS--FIPKLSSSSRL 144
G Y + LS GTPPQ+IP ++DTGS LVW C N C +C F SSS +
Sbjct: 3 GEYMMELSIGTPPQLIPAMIDTGSDLVWLKCDN---CDHCDLDHHGETIFFSDASSSYKK 59
Query: 145 LGCQNPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSG-LTEGIALSETLNL 203
L C + CS + I R C + C Y YG G T G S+ ++
Sbjct: 60 LPCNSTHCSGMSSAGIGPR------------CEETC-KYKYEYGDGSRTSGDVGSDRISF 106
Query: 204 PNR--------IIPNFLVGCSVLSSRQ---PAGIAGFGRGKTSLPSQLNLD---KFSYCL 249
+ FL GC G+ G G+ SL QL KFSYCL
Sbjct: 107 RSHGAGEDHRSFFDGFLFGCGRKLKGDWNFTQGLIGLGQKSHSLIQQLGDKLGYKFSYCL 166
Query: 250 LSHKFDDTTRTSSLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITV 309
+S+ D+ ++ L GSS L V+ P + + YYV L+ ITV
Sbjct: 167 VSY---DSPPSAKSFLFLGSS------AALRGHDVVSTPILHGDHLDQTLYYVDLQSITV 217
Query: 310 GGQRVRVWHKYLTLDRDGN----GGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYT 365
GG V V+ K + T++DSGTT+T + P ++E + Q++
Sbjct: 218 GGVPVVVYDKESGHNTSVGPFLANKTVIDSGTTYTLLTPPVYEAMRKSIEEQVI-----L 272
Query: 366 RALGAEALTGLRPCFDVPGEKTGSFPELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTVV 425
LG A GL CF+ G+ + FP + +F ++ LP EN F V VCL++
Sbjct: 273 PTLGNSA--GLDLCFNSSGDTSYGFPSVTFYFANQVQLVLPFENIFQVTSR-DVVCLSM- 328
Query: 426 TDREASGGPSIILGNFQMQNYYVEYDLRNQRLGF 459
++SGG I+GN Q QN+++ YDL ++ F
Sbjct: 329 ---DSSGGDLSIIGNMQQQNFHILYDLVASQISF 359
>gi|449432044|ref|XP_004133810.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 1-like [Cucumis
sativus]
Length = 471
Score = 138 bits (348), Expect = 6e-30, Method: Compositional matrix adjust.
Identities = 134/443 (30%), Positives = 194/443 (43%), Gaps = 67/443 (15%)
Query: 36 FHTNPSQDS--YQNLNSLVSSSLTRALHIKNPQTKTTTTTTTTTTTNISSHSYGGYSISL 93
FH +D+ + L+SL ++S +N TT +++ + + G Y +
Sbjct: 81 FHLRLQRDAIRVKKLSSLGATS-------RNLSKPGGTTGFSSSVISGLAQGSGEYFTRI 133
Query: 94 SFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLGCQNPKCS 153
GTPP+ + +LDTGS +VW C CK C S P F P S S + C+ P C
Sbjct: 134 GVGTPPKYVYMVLDTGSDIVWLQCA---PCKNCYSQTDPVFNPVKSGSFAKVLCRTPLCR 190
Query: 154 WIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSG-LTEGIALSETLNLPNRIIPNFL 212
+ ES C N Q C Y V YG G T G ++ETL +
Sbjct: 191 RL--ESPGC------------NQRQTC-LYQVSYGDGSYTTGEFVTETLTFRRTKVEQVA 235
Query: 213 VGCSVLSSRQPAGI-------AGFGRGKTSLPSQLNL---DKFSYCLLSHKFDDTTRTSS 262
+GC G+ G GRG S PSQ KFSYCL+ +++ SS
Sbjct: 236 LGC----GHDNEGLFVGAAGLLGLGRGGLSFPSQAGRTFNQKFSYCLVDRS--ASSKPSS 289
Query: 263 LILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRVR-VWHKYL 321
++ N + + +TP + NP + +YYV L I+VGG V + +
Sbjct: 290 VVFGNSAVSRTAR-----FTPLLTNPRL------DTFYYVELLGISVGGTPVSGITASHF 338
Query: 322 TLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEALTGLRPCFD 381
LDR GNGG I+D GT+ T + + L D F + ++ A + C+D
Sbjct: 339 KLDRTGNGGVIIDCGTSVTRLNKPAYIALRDAFRAGASSLKS------APEFSLFDTCYD 392
Query: 382 VPGEKTGSFPELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTVVTDREASGGPSIILGNF 441
+ G+ T P + LHF+ GA+V+LP NY V C + G SII GN
Sbjct: 393 LSGKTTVKVPTVVLHFR-GADVSLPASNYLIPVDGSGRFCFAFA---GTTSGLSII-GNI 447
Query: 442 QMQNYYVEYDLRNQRLGFKQQLC 464
Q Q + V YDL + R+GF + C
Sbjct: 448 QQQGFRVVYDLASSRVGFSPRGC 470
>gi|302780575|ref|XP_002972062.1| hypothetical protein SELMODRAFT_96804 [Selaginella moellendorffii]
gi|300160361|gb|EFJ26979.1| hypothetical protein SELMODRAFT_96804 [Selaginella moellendorffii]
Length = 429
Score = 138 bits (347), Expect = 6e-30, Method: Compositional matrix adjust.
Identities = 118/431 (27%), Positives = 181/431 (41%), Gaps = 67/431 (15%)
Query: 68 KTTTTTTTTTTTNISSHSY---GGYSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQ-- 122
K TTT+ + + S ++ G Y +S++FGTPPQ + I DTGS L+W C+
Sbjct: 29 KLATTTSFWAESPMESGAFLGLGQYLVSMAFGTPPQEVLLIADTGSDLIWLQCSTTAAPP 88
Query: 123 --CKYCSSSKIPSFIPKLSSSSRLLGCQNPKCSWIHHESIQCRDCNDEPLATSKNCTQIC 180
C + S+ P+F+ S++ ++ C +C + P C+
Sbjct: 89 AFCPKKACSRRPAFVASKSATLSVVPCSAAQCLLV-----------PAPRGHGPACSPAA 137
Query: 181 P---SYLVLYGSG-LTEGIALSETLNLPN-----RIIPNFLVGCSVL----SSRQPAGIA 227
P Y Y G T G +T + N + GC S G+
Sbjct: 138 PVPCGYAYDYADGSSTTGFLARDTATISNGTSGGAAVRGVAFGCGTRNQGGSFSGTGGVI 197
Query: 228 GFGRGKTSLPSQ---LNLDKFSYCLLSHKFDDTTRTSSLILDNGSSHSDKKTTGLTYTPF 284
G G+G+ S P+Q L FSYCLL + R+SS + ++ YTP
Sbjct: 198 GLGQGQLSFPAQSGSLFAQTFSYCLLDLEGGRRGRSSSFLFLG----RPERRAAFAYTPL 253
Query: 285 VNNPSVAERNAFSVYYYVGLRRITVGGQRVRVWHKYLTLDRDGNGGTIVDSGTTFTFMAP 344
V+NP +YYVG+ I VG + + V +D GNGGT++DSG+T T++
Sbjct: 254 VSNPLA------PTFYYVGVVAIRVGNRVLPVPGSEWAIDVLGNGGTVIDSGSTLTYLRL 307
Query: 345 ELFEPLADEFVSQMVKNRNYTRALGAEALTGLRPCFDV-----PGEKTGSFPELKLHFKG 399
+ L F + + R + A GL C++V G FP L + F
Sbjct: 308 GAYLHLVSAFAASVHLPRIPSS---ATFFQGLELCYNVSSSSSSAPANGGFPRLTIDFAQ 364
Query: 400 GAEVTLPVENYFAVVGEGSAVCLTVVTDREASGGPSI------ILGNFQMQNYYVEYDLR 453
G + LP NY V + CL + P++ +LGN Q Y+VE+D
Sbjct: 365 GLSLELPTGNYLVDVAD-DVKCLAIR--------PTLSPFAFNVLGNLMQQGYHVEFDRA 415
Query: 454 NQRLGFKQQLC 464
+ R+GF + C
Sbjct: 416 SARIGFARTEC 426
>gi|357127293|ref|XP_003565317.1| PREDICTED: aspartic proteinase nepenthesin-1-like, partial
[Brachypodium distachyon]
Length = 540
Score = 137 bits (346), Expect = 8e-30, Method: Compositional matrix adjust.
Identities = 117/387 (30%), Positives = 169/387 (43%), Gaps = 49/387 (12%)
Query: 87 GGYSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLG 146
G Y + G+P + + +LDTGS + W C C C + P F P LSSS +
Sbjct: 194 GEYFSRIGIGSPARQLYMVLDTGSDVTWLQCA---PCADCYAQSDPLFDPALSSSYATVP 250
Query: 147 CQNPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSG-LTEGIALSETLNLP- 204
C +P C ++ C++ + +C Y V YG G T G +ETL L
Sbjct: 251 CDSPHC-----RALDASACHNNAANGNSSCV-----YEVAYGDGSYTVGDFATETLTLGG 300
Query: 205 --NRIIPNFLVGCSVLSSR---QPAGIAGFGRGKTSLPSQLNLDKFSYCLLSHKFDDTTR 259
+ + + +GC + AG+ G G S PSQ++ +FSYCL+ D
Sbjct: 301 DGSAAVHDVAIGCGHDNEGLFVGAAGLLALGGGPLSFPSQISATEFSYCLV-----DRDS 355
Query: 260 TSSLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRVR-VWH 318
S+ L G+S S T L +P N +YYV L I+VGG+ + +
Sbjct: 356 PSASTLQFGASDSSTVTAPLMRSPRSN-----------TFYYVALNGISVGGETLSDIPP 404
Query: 319 KYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRAL-GAEALTGLR 377
+D G+GG IVDSGT T + + L D FV T+AL A ++
Sbjct: 405 AAFAMDEQGSGGVIVDSGTAVTRLQSSAYSALRDAFVRG-------TQALPRASGVSLFD 457
Query: 378 PCFDVPGEKTGSFPELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTVVTDREASGGPSII 437
C+D+ G + P + L F+GG E+ LP +NY V CL A+GG I
Sbjct: 458 TCYDLAGRSSVQVPAVSLRFEGGGELKLPAKNYLIPVDGAGTYCLAFA----ATGGAVSI 513
Query: 438 LGNFQMQNYYVEYDLRNQRLGFKQQLC 464
+GN Q Q V +D +GF C
Sbjct: 514 VGNVQQQGIRVSFDTAKNTVGFSPNKC 540
>gi|302758750|ref|XP_002962798.1| hypothetical protein SELMODRAFT_78156 [Selaginella moellendorffii]
gi|300169659|gb|EFJ36261.1| hypothetical protein SELMODRAFT_78156 [Selaginella moellendorffii]
Length = 427
Score = 137 bits (346), Expect = 8e-30, Method: Compositional matrix adjust.
Identities = 127/412 (30%), Positives = 180/412 (43%), Gaps = 66/412 (16%)
Query: 82 SSHSYGGYSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSS 141
SS G Y + L GTP + P I+DTGS L W C SS P + SSS
Sbjct: 52 SSIGSGQYFVELRVGTPAKKFPLIVDTGSDLTWIQCNPPNTTANSSSPPAPWYDKSSSSS 111
Query: 142 SRLLGCQNPKCSWIHHESIQCRDCNDEPLATSKNCTQICPS---YLVLYGS-GLTEGIAL 197
R + C + +C ++ P +C+ PS Y Y T GI
Sbjct: 112 YREIPCTDDECQFL-------------PAPIGSSCSITSPSPCDYTYGYSDQSRTTGILA 158
Query: 198 SETLNLPNRI---------------IPNFLVGCSVLSSRQ----PAGIAGFGRGKTSLPS 238
ET+++ +R I N +GCS S +G+ G G+G SL +
Sbjct: 159 YETISMKSRKRSGKRAGNHKTRRIRIKNVALGCSRESVGASFLGASGVLGLGQGPISLAT 218
Query: 239 QLNLDK----FSYCLLSHKFDDTTRTSSLILDNGSSHSDKKTTGLTYTPFVNNPSVAERN 294
Q FSYCL+ + + +S L++ G +H K L +TP V NP
Sbjct: 219 QTRHTALGGIFSYCLVDY-LRGSNASSFLVM--GRTHWRK----LAHTPIVRNP------ 265
Query: 295 AFSVYYYVGLRRITVGGQRVR-VWHKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADE 353
A +YYV + + V G+ V + +D DGN GTI DSGTT ++ L EP +
Sbjct: 266 AAQSFYYVNVTGVAVDGKPVDGIASSDWGIDGDGNKGTIFDSGTTLSY----LREPAYSK 321
Query: 354 FVSQMVKNRNYTRALGAEALTGLRPCFDVPGEKTGSFPELKLHFKGGAEVTLPVENYFAV 413
+ + + RA E G C++V + G P+L + F+GGA + LP NY +
Sbjct: 322 VLGALNASIYLPRA--QEIPEGFELCYNVTRMEKG-MPKLGVEFQGGAVMELPWNNYMVL 378
Query: 414 VGEG-SAVCLTVVTDREASGGPSIILGNFQMQNYYVEYDLRNQRLGFKQQLC 464
V E V L VT S ILGN Q++++EYDL R+GFK C
Sbjct: 379 VAENVQCVALQKVTTTNGSN----ILGNLLQQDHHIEYDLAKARIGFKWSPC 426
>gi|326516344|dbj|BAJ92327.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 464
Score = 137 bits (346), Expect = 9e-30, Method: Compositional matrix adjust.
Identities = 115/373 (30%), Positives = 173/373 (46%), Gaps = 49/373 (13%)
Query: 105 ILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLGCQNPKCSWIH-HESIQCR 163
I+DT S L W C C C + P F P S S + C + C + + +
Sbjct: 127 IVDTASELTWVQCE---PCDACHDQQEPLFDPSSSPSYAAVPCNSSSCDALRVATGMSGQ 183
Query: 164 DCNDEPLATSKNCTQICPSYLVLYGSG-LTEGIALSETLNLPNRIIPNFLVGCSVLSSRQ 222
C+D+P A S Y + Y G + G+ + L+L I F+ GC S++
Sbjct: 184 ACDDQPAACS---------YTLSYRDGSYSRGVLAHDRLSLAGEDIQGFVFGCGT-SNQG 233
Query: 223 P----AGIAGFGRGKTSLPSQLNLDKF----SYCLLSHKFDDTTRTSSLIL-DNGSSHSD 273
P +G+ G GR + SL SQ +D+F SYCL ++ + SL+L D+ S +
Sbjct: 234 PFGGTSGLMGLGRSQLSLISQ-TMDQFGGVFSYCLPPK---ESGSSGSLVLGDDASVY-- 287
Query: 274 KKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRVRVWHKYLTLDRDGNGGTIV 333
+ +T + YT V++P +Y L ITVGG+ V + G G IV
Sbjct: 288 RNSTPIVYTAMVSDPLQGP------FYLANLTGITVGGEDV----QSPGFSAGGGGKAIV 337
Query: 334 DSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEALTGLRPCFDVPGEKTGSFPEL 393
DSGT T + P ++ + EFVSQ+ + A + L CFD+ G + P L
Sbjct: 338 DSGTIITSLVPSVYAAVRAEFVSQLAEYPQ------AAPFSILDTCFDLTGLREVQVPSL 391
Query: 394 KLHFKGGAEVTLPVENYFAVV-GEGSAVCLTVVTDREASGGPSIILGNFQMQNYYVEYDL 452
KL F GGAEV + + VV G+ S VCL + + + P I+GN+Q +N V +D
Sbjct: 392 KLVFDGGAEVEVDSKGVLYVVTGDASQVCLALASLKSEYDTP--IIGNYQQKNLRVIFDT 449
Query: 453 RNQRLGFKQQLCK 465
++GF Q+ C
Sbjct: 450 VGSQIGFAQETCD 462
>gi|224126221|ref|XP_002329620.1| predicted protein [Populus trichocarpa]
gi|222870359|gb|EEF07490.1| predicted protein [Populus trichocarpa]
Length = 357
Score = 137 bits (346), Expect = 1e-29, Method: Compositional matrix adjust.
Identities = 120/395 (30%), Positives = 169/395 (42%), Gaps = 48/395 (12%)
Query: 75 TTTTTNISSHSYGGYSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSF 134
+T T+ +S G Y + G P + +LDTGS + W C C C P F
Sbjct: 6 STPVTSGTSQGSGEYFTRVGVGNPARQFYMVLDTGSDINWLQCQ---PCTDCYQQTDPIF 62
Query: 135 IPKLSSSSRLLGCQNPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSG-LTE 193
P SS+ + CQ+ +CS + S CR S C Y V YG G T
Sbjct: 63 DPTASSTYAPVTCQSQQCSSLEMSS--CR---------SGQCL-----YQVNYGDGSYTF 106
Query: 194 GIALSETLNLPNR-IIPNFLVGCSVLSSR---QPAGIAGFGRGKTSLPSQLNLDKFSYCL 249
G +E+++ N + N +GC + AG+ G G G SL +QL FSYCL
Sbjct: 107 GDFATESVSFGNSGSVKNVALGCGHDNEGLFVGAAGLLGLGGGPLSLTNQLKATSFSYCL 166
Query: 250 LSHKFDDTTRTSSLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITV 309
++ D+ +S+L ++ D T P + N + +YYVGL ++V
Sbjct: 167 VNR---DSAGSSTLDFNSAQLGVDSVTA-----PLMKNRKI------DTFYYVGLSGMSV 212
Query: 310 GGQRVRVWHKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALG 369
GGQ V + LD GNGG IVD GT T + + + PL D FV +M +N T A+
Sbjct: 213 GGQMVSIPESTFRLDESGNGGIIVDCGTAITRLQTQAYNPLRDAFV-RMTQNLKLTSAVA 271
Query: 370 AEALTGLRPCFDVPGEKTGSFPELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTVVTDRE 429
C+D+ G+ + P + HF G LP NY V C
Sbjct: 272 L-----FDTCYDLSGQASVRVPTVSFHFADGKSWNLPAANYLIPVDSAGTYCFAFAPTTS 326
Query: 430 ASGGPSIILGNFQMQNYYVEYDLRNQRLGFKQQLC 464
+ I+GN Q Q V +DL N R+GF C
Sbjct: 327 SLS----IIGNVQQQGTRVTFDLANNRMGFSPNKC 357
>gi|326490862|dbj|BAJ90098.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 447
Score = 137 bits (346), Expect = 1e-29, Method: Compositional matrix adjust.
Identities = 126/438 (28%), Positives = 184/438 (42%), Gaps = 64/438 (14%)
Query: 48 LNSLVSSSLTRALHIKNPQTKTTTTTTTTTTTNISSH--SYGGYSISLSFGTP-PQIIPF 104
L +V S RA P +++ T T SH Y Y I GTP PQ +
Sbjct: 50 LRRMVLRSRARAAKQLCP-SRSGTPVRVTAPVASGSHVVGYTEYLIHFGIGTPRPQQVAL 108
Query: 105 ILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLGCQNPKCSWIHHESIQCRD 164
+DTGS +VW C C C + +P F S + + C +P C + +
Sbjct: 109 EVDTGSDVVWTQCR---PCFDCFTQPLPRFDTSASDTVHGVLCTDPICRALRPHACFLGG 165
Query: 165 CNDEPLATSKNCTQICPSYLVLYG-SGLTEGIALSETLNLPNR-----IIPNFLVGCSVL 218
C +Y V YG + +T G ++ + +P+ + GC
Sbjct: 166 C----------------TYQVNYGDNSVTIGQLAKDSFTFDGKGGGKVTVPDLVFGCGQY 209
Query: 219 SS----RQPAGIAGFGRGKTSLPSQLNLDKFSYCLLSHKFDDTTRTSSLILDNGSSHSD- 273
++ GIAGFGRG SLP QL + FSYC F + S + G + +D
Sbjct: 210 NTGNFHSNETGIAGFGRGPLSLPRQLGVSSFSYC-----FTTIFESKSTPVFLGGAPADG 264
Query: 274 ---KKTTGLTYTPFV-NNPSVAERNAFSVYYYVGLRRITVGGQRVRVWHKYLTLDRDGNG 329
T + TPF+ N+P YYY+ L+ ITVG R+ V + DG+G
Sbjct: 265 LRAHATGPILSTPFLPNHPE---------YYYLSLKGITVGKTRLAVPESAFVVKADGSG 315
Query: 330 GTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEALTGLRPCF---DVPGEK 386
GTI+DSGT T +F L + FV+Q+ G L CF VP
Sbjct: 316 GTIIDSGTAITAFPRAVFRSLWEAFVAQVPLPHTSYNDTGEPTLQ----CFSTESVPDAS 371
Query: 387 TGSFPELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTVVTDREASGGPSIILGNFQMQNY 446
P++ LH + GA+ LP ENY A + +C+ V+ A ++GNFQ QN
Sbjct: 372 KVPVPKMTLHLE-GADWELPRENYMAEYPDSDQLCVVVL----AGDDDRTMIGNFQQQNM 426
Query: 447 YVEYDLRNQRLGFKQQLC 464
++ +DL +L + C
Sbjct: 427 HIVHDLAGNKLVIEPAQC 444
>gi|115442309|ref|NP_001045434.1| Os01g0954900 [Oryza sativa Japonica Group]
gi|20161865|dbj|BAB90778.1| nucleoid DNA-binding-like protein [Oryza sativa Japonica Group]
gi|113534965|dbj|BAF07348.1| Os01g0954900 [Oryza sativa Japonica Group]
gi|215766867|dbj|BAG99095.1| unnamed protein product [Oryza sativa Japonica Group]
Length = 445
Score = 137 bits (345), Expect = 1e-29, Method: Compositional matrix adjust.
Identities = 113/383 (29%), Positives = 169/383 (44%), Gaps = 48/383 (12%)
Query: 89 YSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLGCQ 148
Y GTP Q + +D + W PC+ C C++S PSF P SS+ R + C
Sbjct: 102 YIARAGLGTPAQTLLVAIDPSNDAAWVPCS---ACAGCAASS-PSFSPTQSSTYRTVPCG 157
Query: 149 NPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSGLTEGIALSETLNLPNRII 208
+P+C+ + S P +C + + Y + + + ++L L N ++
Sbjct: 158 SPQCAQVPSPSC--------PAGVGSSC-----GFNLTYAASTFQAVLGQDSLALENNVV 204
Query: 209 PNFLVGC-SVLS--SRQPAGIAGFGRGKTSLPSQLN---LDKFSYCLLSHKFDDTTRTSS 262
++ GC V+S S P G+ GFGRG S SQ FSYCL +++ + + T
Sbjct: 205 VSYTFGCLRVVSGNSVPPQGLIGFGRGPLSFLSQTKDTYGSVFSYCLPNYRSSNFSGT-- 262
Query: 263 LILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRVRVWHKYLT 322
L L KTT L Y P + PS+ YYV + I VG + V+V L
Sbjct: 263 LKLGPIGQPKRIKTTPLLYNP--HRPSL---------YYVNMIGIRVGSKVVQVPQSALA 311
Query: 323 LDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEALTGLRPCFDV 382
+ GTI+D+GT FT +A ++ + D F R R A L G C++V
Sbjct: 312 FNPVTGSGTIIDAGTMFTRLAAPVYAAVRDAF-------RGRVRTPVAPPLGGFDTCYNV 364
Query: 383 PGEKTGSFPELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTVVTDREASGGPSI-ILGNF 441
T S P + F G VTLP EN G CL + ++ +L +
Sbjct: 365 ----TVSVPTVTFMFAGAVAVTLPEENVMIHSSSGGVACLAMAAGPSDGVNAALNVLASM 420
Query: 442 QMQNYYVEYDLRNQRLGFKQQLC 464
Q QN V +D+ N R+GF ++LC
Sbjct: 421 QQQNQRVLFDVANGRVGFSRELC 443
>gi|297817208|ref|XP_002876487.1| hypothetical protein ARALYDRAFT_486375 [Arabidopsis lyrata subsp.
lyrata]
gi|297322325|gb|EFH52746.1| hypothetical protein ARALYDRAFT_486375 [Arabidopsis lyrata subsp.
lyrata]
Length = 520
Score = 137 bits (345), Expect = 1e-29, Method: Compositional matrix adjust.
Identities = 120/393 (30%), Positives = 174/393 (44%), Gaps = 43/393 (10%)
Query: 87 GGYSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLG 146
G Y + + G+PP+ ILDTGS L W C C C + PK S+S + +
Sbjct: 153 GEYFMDVLVGSPPKHFSLILDTGSDLNWIQC---LPCHDCFQQNGAFYDPKASASYKNIT 209
Query: 147 CQNPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSGLTEGIALSET--LNLP 204
C +P+C+ + +P K+ Q CP Y S T G ET +NL
Sbjct: 210 CNDPRCNLVSPP---------DPPKPCKSDNQSCPYYYWYGDSSNTTGDFAVETFTVNLT 260
Query: 205 NR-------IIPNFLVGCSVLSS---RQPAGIAGFGRGKTSLPSQLNL---DKFSYCLLS 251
+ N + GC + AG+ G GRG S SQL FSYCL+
Sbjct: 261 TSGGSSELYNVENMMFGCGHWNRGLFHGAAGLLGLGRGPLSFSSQLQSLYGHSFSYCLVD 320
Query: 252 HKFDDTTRTSSLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGG 311
DT +S LI G L +T FV + N +YYV ++ I V G
Sbjct: 321 RN-SDTNVSSKLIF--GEDKDLLSHPNLNFTSFV----ARKENLVDTFYYVQIKSIIVAG 373
Query: 312 QRVRVWHKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAE 371
+ + + + + DG GGTI+DSGTT ++ A EP A EF+ + + +
Sbjct: 374 EVLNIPEETWNISSDGAGGTIIDSGTTLSYFA----EP-AYEFIKNKIAEKAKGKYPVYR 428
Query: 372 ALTGLRPCFDVPGEKTGSFPELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTVVTDREAS 431
L PCF+V G + PEL + F GA P EN F + E VCL ++ +++
Sbjct: 429 DFPILDPCFNVSGIDSIQLPELGIAFADGAVWNFPTENSFIWLNE-DLVCLAILGTPKSA 487
Query: 432 GGPSIILGNFQMQNYYVEYDLRNQRLGFKQQLC 464
I+GN+Q QN+++ YD + RLG+ C
Sbjct: 488 FS---IIGNYQQQNFHILYDTKRSRLGYAPTKC 517
>gi|414589628|tpg|DAA40199.1| TPA: hypothetical protein ZEAMMB73_627989 [Zea mays]
Length = 452
Score = 137 bits (345), Expect = 1e-29, Method: Compositional matrix adjust.
Identities = 123/396 (31%), Positives = 169/396 (42%), Gaps = 59/396 (14%)
Query: 89 YSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLGCQ 148
Y + L+ GTPPQ + +LDTGS L+W C C C S P F P S+S + C
Sbjct: 96 YVVDLAIGTPPQPVSALLDTGSDLIWTQCA---PCASCLSQPDPLFAPGQSASYEPMRCA 152
Query: 149 NPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSG-LTEGIALSETLNLPN-- 205
CS I H S + D CT Y YG G +T G+ +E +
Sbjct: 153 GTLCSDILHHSCERPD----------TCT-----YRYNYGDGTMTVGVYATERFTFASSG 197
Query: 206 ------RIIPNFLVGC---SVLSSRQPAGIAGFGRGKTSLPSQLNLDKFSYCLLSHKFDD 256
+P GC +V S +GI GFGR SL SQL++ +FSYCL S+
Sbjct: 198 GGGLTTTTVP-LGFGCGSVNVGSLNNGSGIVGFGRNPLSLVSQLSIRRFSYCLTSYA--- 253
Query: 257 TTRTSSLILDNGSSHSDKKTTG-LTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRVR 315
+ R S+L+ + S TG + TP + +P +YYV +TVG +R+R
Sbjct: 254 SRRQSTLLFGSLSDGVYGDATGRVQTTPLLQSPQ------NPTFYYVHFTGLTVGARRLR 307
Query: 316 VWHKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEALTG 375
+ L DG+GG IVDSGT T + + + F Q+ A G G
Sbjct: 308 IPESAFALRPDGSGGVIVDSGTALTLLPAAVLAEVVRAFRQQL----RLPFANGGNPEDG 363
Query: 376 LRPCFDVPGEKTGS-------FPELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTVVTDR 428
+ CF VP S P + LHF+ GA++ LP NY +CL +
Sbjct: 364 V--CFLVPAAWRRSSSTSQMPVPRMVLHFQ-GADLDLPRRNYVLDDHRRGRLCLLLAD-- 418
Query: 429 EASGGPSIILGNFQMQNYYVEYDLRNQRLGFKQQLC 464
SG +GN Q+ V YDL + L C
Sbjct: 419 --SGDDGSTIGNLVQQDMRVLYDLEAETLSIAPARC 452
>gi|125529158|gb|EAY77272.1| hypothetical protein OsI_05246 [Oryza sativa Indica Group]
Length = 426
Score = 137 bits (345), Expect = 1e-29, Method: Compositional matrix adjust.
Identities = 113/383 (29%), Positives = 169/383 (44%), Gaps = 48/383 (12%)
Query: 89 YSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLGCQ 148
Y GTP Q + +D + W PC+ C C++S PSF P SS+ R + C
Sbjct: 83 YIARAGLGTPAQTLLVAIDPSNDAAWVPCS---ACAGCAASS-PSFSPTQSSTYRTVPCG 138
Query: 149 NPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSGLTEGIALSETLNLPNRII 208
+P+C+ + S P +C + + Y + + + ++L L N ++
Sbjct: 139 SPQCAQVPSPSC--------PAGVGSSC-----GFNLTYAASTFQAVLGQDSLALENNVV 185
Query: 209 PNFLVGC-SVLS--SRQPAGIAGFGRGKTSLPSQLN---LDKFSYCLLSHKFDDTTRTSS 262
++ GC V+S S P G+ GFGRG S SQ FSYCL +++ + + T
Sbjct: 186 VSYTFGCLRVVSGNSVPPQGLIGFGRGPLSFLSQTKDTYGSVFSYCLPNYRSSNFSGT-- 243
Query: 263 LILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRVRVWHKYLT 322
L L KTT L Y P + PS+ YYV + I VG + V+V L
Sbjct: 244 LKLGPIGQPKRIKTTPLLYNP--HRPSL---------YYVNMIGIRVGSKVVQVPQSALA 292
Query: 323 LDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEALTGLRPCFDV 382
+ GTI+D+GT FT +A ++ + D F R R A L G C++V
Sbjct: 293 FNPVTGSGTIIDAGTMFTRLAAPVYAAVRDAF-------RGRVRTPVAPPLGGFDTCYNV 345
Query: 383 PGEKTGSFPELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTVVTDREASGGPSI-ILGNF 441
T S P + F G VTLP EN G CL + ++ +L +
Sbjct: 346 ----TVSVPTVTFMFAGAVAVTLPEENVMIHSSSGGVACLAMAAGPSDGVNAALNVLASM 401
Query: 442 QMQNYYVEYDLRNQRLGFKQQLC 464
Q QN V +D+ N R+GF ++LC
Sbjct: 402 QQQNQRVLFDVANGRVGFSRELC 424
>gi|357460823|ref|XP_003600693.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
gi|355489741|gb|AES70944.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
Length = 431
Score = 137 bits (345), Expect = 1e-29, Method: Compositional matrix adjust.
Identities = 129/446 (28%), Positives = 187/446 (41%), Gaps = 46/446 (10%)
Query: 36 FHTNPSQDSYQ-----NLNSLVSSSLTRALHIKNPQTKTTTTTTTTTTTNISSHSYG--- 87
F NPSQ + L SL S+ T + + Q +TT S++Y
Sbjct: 10 FSINPSQQTNSLSLSFPLTSLSLSNDTTSKMLYTSQLFSTTKKPNNPQNKTPSYNYKFSF 69
Query: 88 GYS----ISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSR 143
YS I+L GTPPQ P +LDTGS L W C + SF P LSS+
Sbjct: 70 KYSMALIINLPIGTPPQTQPMVLDTGSQLSWIQC-------HKKQPPTASFDPSLSSTFS 122
Query: 144 LLGCQNPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSG-LTEGIALSETLN 202
+L C +P C D L TS + ++C Y Y G EG + E
Sbjct: 123 ILPCTHPLCK---------PRIPDFTLPTSCDQNRLC-HYSYFYADGTYAEGNLVREKFT 172
Query: 203 LPNRI-IPNFLVGCSVLSSRQPAGIAGFGRGKTSLPSQLNLDKFSYCLLSHKFDDT-TRT 260
+ P ++GC+ S P GI G G+ S Q + KFSYC+ + T T
Sbjct: 173 FSRSVSTPPLILGCAT-ESTDPRGILGMNLGRLSFAKQSKITKFSYCVPPRQTRPGFTPT 231
Query: 261 SSLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRVRVWHKY 320
S L N S K G+ + P N + Y + + I + G+++ +
Sbjct: 232 GSFYLGNNPSSKGFKYVGMMTSSRQRMP-----NFDPLAYTIPMVGIRIAGKKLNISPAV 286
Query: 321 LTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEALTGLRPCF 380
D G+G T++DSG+ FT++ E ++ + + V + G A CF
Sbjct: 287 FRADAGGSGQTMIDSGSEFTYLVSEAYDKVRAQVVRAVGPRLKKGYVYGGVADM----CF 342
Query: 381 D-VPGEKTGSF-PELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTVVTDREASGGPSIIL 438
D V + G E+ F+ G EV +P E A VG G V + + G S I+
Sbjct: 343 DSVKAVEIGRLIGEMVFEFERGVEVVIPKERVLADVGGG--VHCVGIGSSDKLGAASNII 400
Query: 439 GNFQMQNYYVEYDLRNQRLGFKQQLC 464
GNF QN +VE+DL +R+GF + C
Sbjct: 401 GNFHQQNLWVEFDLVRRRVGFGKADC 426
>gi|302781668|ref|XP_002972608.1| hypothetical protein SELMODRAFT_97538 [Selaginella moellendorffii]
gi|300160075|gb|EFJ26694.1| hypothetical protein SELMODRAFT_97538 [Selaginella moellendorffii]
Length = 430
Score = 137 bits (345), Expect = 1e-29, Method: Compositional matrix adjust.
Identities = 117/431 (27%), Positives = 182/431 (42%), Gaps = 67/431 (15%)
Query: 68 KTTTTTTTTTTTNISSHSY---GGYSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQ-- 122
K T T+ + + S ++ G Y +S++FGTPPQ + I DTGS L+W C+
Sbjct: 30 KLATITSFWAESPMESGAFLGLGQYLVSMAFGTPPQEVLLIADTGSDLIWLQCSTTAAPP 89
Query: 123 --CKYCSSSKIPSFIPKLSSSSRLLGCQNPKCSWIHHESIQCRDCNDEPLATSKNCTQIC 180
C + S+ P+F+ S++ ++ C +C + P +C+
Sbjct: 90 AFCPKKACSRRPAFVASKSATLSVVPCSAAQCLLV-----------PAPRGHGPSCSPAA 138
Query: 181 P---SYLVLYGSG-LTEGIALSETLNLPN-----RIIPNFLVGCSVL----SSRQPAGIA 227
P Y Y G T G +T + N + GC S G+
Sbjct: 139 PVPCGYAYDYADGSSTTGFLARDTATISNGTSGGAAVRGVAFGCGTRNQGGSFSGTGGVI 198
Query: 228 GFGRGKTSLPSQ---LNLDKFSYCLLSHKFDDTTRTSSLILDNGSSHSDKKTTGLTYTPF 284
G G+G+ S P+Q L FSYCLL + R+SS + ++ YTP
Sbjct: 199 GLGQGQLSFPAQSGSLFAQTFSYCLLDLEGGRRGRSSSFLFLG----RPERRAAFAYTPL 254
Query: 285 VNNPSVAERNAFSVYYYVGLRRITVGGQRVRVWHKYLTLDRDGNGGTIVDSGTTFTFMAP 344
V+NP +YYVG+ I VG + + V +D GNGGT++DSG+T T++
Sbjct: 255 VSNPLA------PTFYYVGVVAIRVGNRVLPVPGSEWAIDVLGNGGTVIDSGSTLTYLRL 308
Query: 345 ELFEPLADEFVSQMVKNRNYTRALGAEALTGLRPCFDVPGEKT-----GSFPELKLHFKG 399
+ L F + + R + A GL C++V + G FP L + F
Sbjct: 309 GAYLHLVSAFAASVHLPRIPSS---ATFFQGLELCYNVSSSSSLAPANGGFPRLTIDFAQ 365
Query: 400 GAEVTLPVENYFAVVGEGSAVCLTVVTDREASGGPSI------ILGNFQMQNYYVEYDLR 453
G + LP NY V + CL + P++ +LGN Q Y+VE+D
Sbjct: 366 GLSLELPTGNYLVDVAD-DVKCLAIR--------PTLSPFAFNVLGNLMQQGYHVEFDRA 416
Query: 454 NQRLGFKQQLC 464
+ R+GF + C
Sbjct: 417 SARIGFARTEC 427
>gi|168064509|ref|XP_001784204.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162664276|gb|EDQ51002.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 367
Score = 137 bits (344), Expect = 1e-29, Method: Compositional matrix adjust.
Identities = 115/393 (29%), Positives = 170/393 (43%), Gaps = 54/393 (13%)
Query: 87 GGYSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLG 146
G Y + GTP + + ++DTGS + W C C C K F P SSS ++L
Sbjct: 14 GEYFAVVGVGTPRRDMYLVVDTGSDITWLQCA---PCTNCYKQKDALFNPSSSSSFKVLD 70
Query: 147 CQNPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSG-------LTEGIALSE 199
C + C ++ + + C S C Y YG G +T+ + L +
Sbjct: 71 CSSSLC--LNLDVMGC---------LSNKCL-----YQADYGDGSFTMGELVTDNVVLDD 114
Query: 200 TLNLPNRIIPNFLVGCSVLSSRQ---PAGIAGFGRGKTSLPSQLNL---DKFSYCLLSHK 253
++ N +GC + AGI G GRG S P+ L+ + FSYCL +
Sbjct: 115 AFGPGQVVLTNIPLGCGHDNEGTFGTAAGILGLGRGPLSFPNNLDASTRNIFSYCLPDRE 174
Query: 254 FDDTTRTSSLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQR 313
D +++ + D H+ T + + P + NP VA YYYV + I+VGG
Sbjct: 175 SDPNHKSTLVFGDAAIPHT--ATGSVKFIPQLRNPRVA------TYYYVQITGISVGGNL 226
Query: 314 V-RVWHKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEA 372
+ + LD GNGGTI DSGTT T + + + D F R T L + A
Sbjct: 227 LTNIPASVFQLDSHGNGGTIFDSGTTITRLEARAYTAVRDAF-------RAATMHLTSAA 279
Query: 373 -LTGLRPCFDVPGEKTGSFPELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTVVTDREAS 431
C+D G + S P + HF+G ++ LP NY V + C AS
Sbjct: 280 DFKIFDTCYDFTGMNSISVPTVTFHFQGDVDMRLPPSNYIVPVSNNNIFCFAFA----AS 335
Query: 432 GGPSIILGNFQMQNYYVEYDLRNQRLGFKQQLC 464
GPS+I GN Q Q++ V YD ++++G C
Sbjct: 336 MGPSVI-GNVQQQSFRVIYDNVHKQIGLLPDQC 367
>gi|449529638|ref|XP_004171805.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 1-like, partial
[Cucumis sativus]
Length = 384
Score = 137 bits (344), Expect = 2e-29, Method: Compositional matrix adjust.
Identities = 125/386 (32%), Positives = 174/386 (45%), Gaps = 50/386 (12%)
Query: 87 GGYSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLG 146
G Y + GTPP+ + +LDTGS +VW C CK C S P F P S S +
Sbjct: 40 GEYFTRIGVGTPPKYVYMVLDTGSDIVWLQCA---PCKNCYSQTDPVFNPVKSGSFAKVL 96
Query: 147 CQNPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSG-LTEGIALSETLNLPN 205
C+ P C + ES C N Q C Y V YG G T G ++ETL
Sbjct: 97 CRTPLCRRL--ESPGC------------NQRQTC-LYQVSYGDGSYTTGEFVTETLTFRR 141
Query: 206 RIIPNFLVGCSVLSSR---QPAGIAGFGRGKTSLPSQLNL---DKFSYCLLSHKFDDTTR 259
+ +GC + AG+ G GRG S PSQ KFSYCL+ +++
Sbjct: 142 TKVEQVALGCGHDNEGLFVGAAGLLGLGRGGLSFPSQAGRTFNQKFSYCLVDR--SASSK 199
Query: 260 TSSLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRVR-VWH 318
SS++ N + + +TP + NP + +YYV L I+VGG V +
Sbjct: 200 PSSVVFGNSAVSRTAR-----FTPLLTNPRL------DTFYYVELLGISVGGTPVSGITA 248
Query: 319 KYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEALTGLRP 378
+ LDR GNGG I+D GT+ T + + L D F + ++ A +
Sbjct: 249 SHFKLDRTGNGGVIIDCGTSVTRLNKPAYIALRDAFRAGASSLKS------APEFSLFDT 302
Query: 379 CFDVPGEKTGSFPELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTVVTDREASGGPSIIL 438
C+D+ G+ T P + LHF+ GA+V+LP NY V C + G SII
Sbjct: 303 CYDLSGKTTVKVPTVVLHFR-GADVSLPASNYLIPVDGSGRFCFAFA---GTTSGLSII- 357
Query: 439 GNFQMQNYYVEYDLRNQRLGFKQQLC 464
GN Q Q + V YDL + R+GF + C
Sbjct: 358 GNIQQQGFRVVYDLASSRVGFSPRGC 383
>gi|242053503|ref|XP_002455897.1| hypothetical protein SORBIDRAFT_03g026970 [Sorghum bicolor]
gi|241927872|gb|EES01017.1| hypothetical protein SORBIDRAFT_03g026970 [Sorghum bicolor]
Length = 493
Score = 136 bits (343), Expect = 2e-29, Method: Compositional matrix adjust.
Identities = 124/393 (31%), Positives = 167/393 (42%), Gaps = 44/393 (11%)
Query: 83 SHSYGGYSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSS 142
+ G Y + GTP +LDTGS +VW C C+ C P F P+ SSS
Sbjct: 134 AQGSGEYFTKIGVGTPSTPALMVLDTGSDVVWLQCA---PCRRCYDQSGPVFDPRRSSSY 190
Query: 143 RLLGCQNPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSG-LTEGIALSETL 201
+ C P C + R + C Y V YG G +T G +ETL
Sbjct: 191 GAVDCAAPLCRRLDSGGCDLR---------RRACL-----YQVAYGDGSVTAGDFATETL 236
Query: 202 NLPNRI-IPNFLVGCSVLSSR---QPAGIAGFGRGKTSLPSQLNL---DKFSYCLLSHKF 254
+ +GC + AG+ G GRG S P+Q++ FSYCL+
Sbjct: 237 TFAGGARVARVALGCGHDNEGLFVAAAGLLGLGRGSLSFPTQISRRYGKSFSYCLVDRTS 296
Query: 255 DDTTRTSSLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRV 314
++ +S + + + ++TP V NP + +YYV L I+VGG RV
Sbjct: 297 SSSSGAASRSRSSTVTFGPPSASAASFTPMVRNPRM------ETFYYVQLVGISVGGARV 350
Query: 315 -RVWHKYLTLD-RDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEA 372
V L LD G GG IVDSGT+ T +A + L D F + R L
Sbjct: 351 PGVAESDLRLDPSTGRGGVIVDSGTSVTRLARPSYSALRDAFRAAAAGLR-----LSPGG 405
Query: 373 LTGLRPCFDVPGEKTGSFPELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTVV-TDREAS 431
+ C+D+ G K P + +HF GGAE LP ENY V C TD
Sbjct: 406 FSLFDTCYDLGGRKVVKVPTVSMHFAGGAEAALPPENYLIPVDSRGTFCFAFAGTD---- 461
Query: 432 GGPSIILGNFQMQNYYVEYDLRNQRLGFKQQLC 464
GG SII GN Q Q + V +D QR+GF + C
Sbjct: 462 GGVSII-GNIQQQGFRVVFDGDGQRVGFAPKGC 493
>gi|357143854|ref|XP_003573079.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Brachypodium
distachyon]
Length = 417
Score = 136 bits (343), Expect = 2e-29, Method: Compositional matrix adjust.
Identities = 126/393 (32%), Positives = 184/393 (46%), Gaps = 61/393 (15%)
Query: 89 YSISLSFGTPPQIIPFIL--DTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLG 146
Y + L+ GTPP +PF+ DTGS L W C CK C P + P SS+ +
Sbjct: 66 YLMELAIGTPP--VPFVALADTGSDLTWTQCQ---PCKLCFPQDTPVYDPSASSTFSPVP 120
Query: 147 CQNPKC--SWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSGL-TEGIALSETLNL 203
C + C +W + R+C++ S C Y+ Y G + GI +ETL +
Sbjct: 121 CSSATCLPTW------RSRNCSNP----SSPC-----RYIYSYSDGAYSVGILGTETLTI 165
Query: 204 PNRI------IPNFLVGCSVLS---SRQPAGIAGFGRGKTSLPSQLNLDKFSYCLLSHKF 254
+ + + + GC + S G G GRG SL +QL + KFSYCL F
Sbjct: 166 GSSVPGQTVSVGSVAFGCGTDNGGDSLNSTGTVGLGRGTLSLLAQLGVGKFSYCL--TDF 223
Query: 255 DDTTRTSSLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRV 314
++T S L + + T + TP + +P R Y+V L+ I++G R+
Sbjct: 224 FNSTMDSPFFLGTLAELAPGPGT-VQSTPLLQSPLNPSR------YFVNLQGISLGDVRL 276
Query: 315 RVWHKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEALT 374
+ + L DGNGG +VDSGTTFT +A F + D V+Q++ + A +L
Sbjct: 277 PIPNGTFDLRADGNGGMMVDSGTTFTILAKSGFREVVDR-VAQLLGQ----PPVNASSLD 331
Query: 375 GLRPCFDVP-GEKTGSFPELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTVVTDREASGG 433
PCF P GE P+L LHF GGA++ L +NY + + S+ CL +V G
Sbjct: 332 --SPCFPSPDGEPF--MPDLVLHFAGGADMRLHRDNYMSYNEDDSSFCLNIV------GS 381
Query: 434 PSII--LGNFQMQNYYVEYDLRNQRLGFKQQLC 464
PS LGNFQ QN + +D+ +L F C
Sbjct: 382 PSTWSRLGNFQQQNIQMLFDMTVGQLSFLPTDC 414
>gi|255566835|ref|XP_002524401.1| basic 7S globulin 2 precursor small subunit, putative [Ricinus
communis]
gi|223536362|gb|EEF38012.1| basic 7S globulin 2 precursor small subunit, putative [Ricinus
communis]
Length = 455
Score = 136 bits (342), Expect = 2e-29, Method: Compositional matrix adjust.
Identities = 115/404 (28%), Positives = 170/404 (42%), Gaps = 51/404 (12%)
Query: 82 SSHSYGGYSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCS-SSKIPSFIPKLSS 140
+S G Y +SL GTPPQ + + DTGS L+W C+ C+ CS S +F + S+
Sbjct: 79 ASSGSGQYFVSLRIGTPPQTLLLVADTGSDLIWVKCS---PCRNCSHRSPGSAFFARHST 135
Query: 141 SSRLLGCQNPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSGLTEGIALSET 200
+ + C +P+C + H CN L + C S T G E
Sbjct: 136 TYSAIHCYSPQCQLVPHP--HPNPCNRTRLHSP------CRYQYTYADSSTTTGFFSKEA 187
Query: 201 LNLPN-----------------RIIPNFLVGCSVLSSRQPAGIAGFGRGKTSLPSQLNL- 242
L L RI L G S ++ G+ G GR S SQL
Sbjct: 188 LTLNTSTGKVKKLNGLSFGCGFRISGPSLTGASFEGAQ---GVMGLGRAPISFSSQLGRR 244
Query: 243 --DKFSYCLLSHKFDDTTRTSSLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYY 300
KFSYCL+ + TS L + + + K +++TP + NP +Y
Sbjct: 245 FGSKFSYCLMDYTLSPPP-TSFLTIGGAQNVAVSKKGIMSFTPLLINPLSP------TFY 297
Query: 301 YVGLRRITVGGQRVRVWHKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVK 360
Y+ ++ + V G ++ + ++D GNGGTI+DSGTT TF+ EP E + K
Sbjct: 298 YIAIKGVYVNGVKLPINPSVWSIDDLGNGGTIIDSGTTLTFIT----EPAYTEILKAFKK 353
Query: 361 NRNYTRALGAEALTGLRPCFDVPGEKTGSFPELKLHFKGGAEVTLPVENYFAVVGEGSAV 420
AE G C +V G + P + + GG+ + P NYF G+
Sbjct: 354 RVKLPSP--AEPTPGFDLCMNVSGVTRPALPRMSFNLAGGSVFSPPPRNYFIETGD-QIK 410
Query: 421 CLTVVTDREASGGPSIILGNFQMQNYYVEYDLRNQRLGFKQQLC 464
CL V + G +LGN Q + +E+D RLGF ++ C
Sbjct: 411 CLAVQPVSQDGG--FSVLGNLMQQGFLLEFDRDKSRLGFTRRGC 452
>gi|357143850|ref|XP_003573078.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Brachypodium
distachyon]
Length = 431
Score = 136 bits (342), Expect = 2e-29, Method: Compositional matrix adjust.
Identities = 123/394 (31%), Positives = 179/394 (45%), Gaps = 60/394 (15%)
Query: 89 YSISLSFGTPPQIIPFIL--DTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLG 146
Y + L+ GTPP +PF+ DTGS L W C CK C P + P SS+ +
Sbjct: 77 YLMELAIGTPP--VPFVALADTGSDLTWTQCQ---PCKLCFPQDTPVYDPSASSTFSPVP 131
Query: 147 CQNPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYG-----SGLTEGIALSETL 201
C + C P+ S+NC+ PS L YG + GI +ETL
Sbjct: 132 CSSATC---------------LPVLRSRNCST--PSSLCRYGYSYSDGAYSAGILGTETL 174
Query: 202 NLPNRI------IPNFLVGCSVLS---SRQPAGIAGFGRGKTSLPSQLNLDKFSYCLLSH 252
L + + + + GC + S G G GRG SL +QL + KFSYCL
Sbjct: 175 TLGSSVPGQAVSVSDVAFGCGTDNGGDSLNSTGTVGLGRGTLSLLAQLGVGKFSYCL--T 232
Query: 253 KFDDTTRTSSLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQ 312
F ++T S +L + + + TP + +P R Y V L+ IT+G
Sbjct: 233 DFFNSTLDSPFLLGTLAELA-PGPGAVQSTPLLQSPLNPSR------YVVSLQGITLGDV 285
Query: 313 RVRVWHKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEA 372
R+ + +K L + GG +VDSGTTF+ + F + D V+Q++ + A +
Sbjct: 286 RLPIPNKTFDLHANSTGGMVVDSGTTFSILPESGFRVVVDH-VAQVLGQ----PPVNASS 340
Query: 373 LTGLRPCFDVP-GEKTGSF-PELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTVVTDREA 430
L PCF P GE+ F P+L LHF GGA++ L +NY + E S+ CL +V
Sbjct: 341 LD--SPCFPAPAGERQLPFMPDLVLHFAGGADMRLHRDNYMSYNQEDSSFCLNIVGTTST 398
Query: 431 SGGPSIILGNFQMQNYYVEYDLRNQRLGFKQQLC 464
+LGNFQ QN + +D+ +L F C
Sbjct: 399 WS----MLGNFQQQNIQMLFDMTVGQLSFLPTDC 428
>gi|242070719|ref|XP_002450636.1| hypothetical protein SORBIDRAFT_05g008470 [Sorghum bicolor]
gi|241936479|gb|EES09624.1| hypothetical protein SORBIDRAFT_05g008470 [Sorghum bicolor]
Length = 410
Score = 136 bits (342), Expect = 2e-29, Method: Compositional matrix adjust.
Identities = 129/389 (33%), Positives = 168/389 (43%), Gaps = 70/389 (17%)
Query: 87 GGYSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLG 146
G Y ++ S GTPPQ + + DTGS L+W C C C PS+ P SSS L
Sbjct: 80 GAYDMTFSIGTPPQELSALADTGSDLIWAKCG---ACTRCVPQGSPSYYPNKSSSFSKLP 136
Query: 147 CQNPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSG-----LTEGIALSETL 201
C CS + S QC A C Y YG T+G SET
Sbjct: 137 CSGSLCSDL--PSSQCS-------AGGAEC-----DYKYSYGLASDPHHYTQGYLGSETF 182
Query: 202 NLPNRIIPNFLVGCSVL---SSRQPAGIAGFGRGKTSLPSQLNLDKFSYCLLSHKFDDTT 258
L + +P GC+ + +G+ G GRG SL SQLN+ FSYCL S D
Sbjct: 183 TLGSDAVPGIGFGCTTMSEGGYGSGSGLVGLGRGPLSLVSQLNVGAFSYCLTS----DAA 238
Query: 259 RTSSLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYY-VGLRRITVGGQRVRVW 317
+TS L+ +G+ G+ TP + S YYY V L I++G
Sbjct: 239 KTSPLLFGSGA----LTGAGVQSTPLLRT---------STYYYTVNLESISIGAA----- 280
Query: 318 HKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEALTGLR 377
T G+ G I DSGTT F+A + LA E V + + N T A G + G
Sbjct: 281 ----TTAGTGSSGIIFDSGTTVAFLAEPAYT-LAKEAV--LSQTTNLTMASGRD---GYE 330
Query: 378 PCFDVPGEKTGSFPELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTVVTDREASGGPSI- 436
CF G FP + LHF GG ++ LP ENYF V + S C V PS+
Sbjct: 331 VCFQTSGAV---FPSMVLHFDGG-DMDLPTENYFGAV-DDSVSCWIVQKS------PSLS 379
Query: 437 ILGNFQMQNYYVEYDLRNQRLGFKQQLCK 465
I+GN NY++ YD+ L F+ C
Sbjct: 380 IVGNIMQMNYHIRYDVEKSMLSFQPANCD 408
>gi|115438214|ref|NP_001043485.1| Os01g0598600 [Oryza sativa Japonica Group]
gi|15290061|dbj|BAB63755.1| nucleoid DNA-binding protein cnd41-like [Oryza sativa Japonica
Group]
gi|113533016|dbj|BAF05399.1| Os01g0598600 [Oryza sativa Japonica Group]
gi|125526702|gb|EAY74816.1| hypothetical protein OsI_02707 [Oryza sativa Indica Group]
Length = 500
Score = 136 bits (342), Expect = 3e-29, Method: Compositional matrix adjust.
Identities = 128/393 (32%), Positives = 166/393 (42%), Gaps = 44/393 (11%)
Query: 83 SHSYGGYSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSS 142
+ G Y + GTP +LDTGS +VW C C+ C F P+ S S
Sbjct: 141 AQGSGEYFTKIGVGTPVTPALMVLDTGSDVVWLQCA---PCRRCYDQSGQMFDPRASHSY 197
Query: 143 RLLGCQNPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSG-LTEGIALSETL 201
+ C P C + R K C Y V YG G +T G +ETL
Sbjct: 198 GAVDCAAPLCRRLDSGGCDLR---------RKACL-----YQVAYGDGSVTAGDFATETL 243
Query: 202 NLPNRI-IPNFLVGCSVLSSR---QPAGIAGFGRGKTSLPSQLNLD---KFSYCLLSHKF 254
+ +P +GC + AG+ G GRG S PSQ++ FSYCL+
Sbjct: 244 TFASGARVPRVALGCGHDNEGLFVAAAGLLGLGRGSLSFPSQISRRFGRSFSYCLVDRTS 303
Query: 255 DDTTRTSSLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRV 314
+ TS S + + ++TP V NP + +YYV L I+VGG RV
Sbjct: 304 SSASATSRSSTVTFGSGAVGPSAAASFTPMVKNPRM------ETFYYVQLMGISVGGARV 357
Query: 315 -RVWHKYLTLD-RDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEA 372
V L LD G GG IVDSGT+ T +A + L D F + R L
Sbjct: 358 PGVAVSDLRLDPSTGRGGVIVDSGTSVTRLARPAYAALRDAFRAAAAGLR-----LSPGG 412
Query: 373 LTGLRPCFDVPGEKTGSFPELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTVV-TDREAS 431
+ C+D+ G K P + +HF GGAE LP ENY V C TD
Sbjct: 413 FSLFDTCYDLSGLKVVKVPTVSMHFAGGAEAALPPENYLIPVDSRGTFCFAFAGTD---- 468
Query: 432 GGPSIILGNFQMQNYYVEYDLRNQRLGFKQQLC 464
GG SII GN Q Q + V +D QRLGF + C
Sbjct: 469 GGVSII-GNIQQQGFRVVFDGDGQRLGFVPKGC 500
>gi|242092368|ref|XP_002436674.1| hypothetical protein SORBIDRAFT_10g006870 [Sorghum bicolor]
gi|241914897|gb|EER88041.1| hypothetical protein SORBIDRAFT_10g006870 [Sorghum bicolor]
Length = 461
Score = 135 bits (341), Expect = 3e-29, Method: Compositional matrix adjust.
Identities = 127/404 (31%), Positives = 178/404 (44%), Gaps = 67/404 (16%)
Query: 89 YSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLGCQ 148
Y + L+ GTPP+ + LDTGS LVW C C+ C +P P SS+ L C
Sbjct: 92 YLVHLAVGTPPRPVALTLDTGSDLVWTQCA---PCRDCFHQGLPLLDPAASSTYAALPCG 148
Query: 149 NPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGS-GLTEGIALSETL------ 201
P+C + S C ++ N + C +Y+ YG +T G ++
Sbjct: 149 APRCRALPFTS-----CGGGGRSSWGNGNRSC-AYIYHYGDKSVTVGEIATDRFTFGGDN 202
Query: 202 -----NLPNRIIPNFLVGCS-----VLSSRQPAGIAGFGRGKTSLPSQLNLDKFSYCLLS 251
LP R GC V S + GIAGFGRG+ SLPSQLN+ FSYC S
Sbjct: 203 GDGDSRLPTR---RLTFGCGHFNKGVFQSNE-TGIAGFGRGRWSLPSQLNVTTFSYCFTS 258
Query: 252 HKFDDTTRTSSLILDNGS-------SHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGL 304
F+ SSL+ G+ SH+ + + TP + NPS Y++ L
Sbjct: 259 -MFES---KSSLVTLGGAPAAALLYSHAAHISGEVRTTPLLKNPSQPS------LYFLSL 308
Query: 305 RRITVGGQRVRVWHKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNY 364
+ I+VG R+ V L TI+DSG + T + ++E + EF +Q+
Sbjct: 309 KGISVGKTRLAVPEAKLR-------STIIDSGASITTLPEAVYEAVKAEFAAQV-----G 356
Query: 365 TRALGAEALTGLRPCFDVPGE---KTGSFPELKLHFKGGAEVTLPVENYFAVVGEGSAVC 421
G + L CF +P + P L LH GA+ LP NY V + +A
Sbjct: 357 LPPTGVVEGSALDLCFALPVTALWRRPPVPSLTLHLD-GADWELPRGNY--VFEDLAARV 413
Query: 422 LTVVTDREASGGPSIILGNFQMQNYYVEYDLRNQRLGFKQQLCK 465
+ VV D A+ G ++GNFQ QN +V YDL N L F C
Sbjct: 414 MCVVLD--AAPGDQTVIGNFQQQNTHVVYDLENDWLSFAPARCD 455
>gi|224072755|ref|XP_002303865.1| predicted protein [Populus trichocarpa]
gi|222841297|gb|EEE78844.1| predicted protein [Populus trichocarpa]
Length = 412
Score = 135 bits (341), Expect = 3e-29, Method: Compositional matrix adjust.
Identities = 113/369 (30%), Positives = 175/369 (47%), Gaps = 48/369 (13%)
Query: 105 ILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLGCQNPKCSWIHHESIQCRD 164
I+DTGS L W C C C + + P F P S S R + C + C + +
Sbjct: 80 IVDTGSDLSWVQCQ---PCNRCYNQQDPVFNPSKSPSYRTVLCNSLTCRSLQLATGNSGV 136
Query: 165 CNDEPLATSKNCTQICPSYLVLYGSG-LTEGIALSETLNLPNRIIPNFLVGCSVLSSR-- 221
C P C +Y+V YG G T G E LNL N + NF+ GC +
Sbjct: 137 CGSNP----PTC-----NYVVNYGDGSYTSGEVGMEHLNLGNTTVNNFIFGCGRKNQGLF 187
Query: 222 -QPAGIAGFGRGKTSLPSQLNL---DKFSYCLLSHKFDDTTRTSSLILDNGSSHSDKKTT 277
+G+ G GR SL SQ++ FSYCL + + + SL++ G+S K TT
Sbjct: 188 GGASGLVGLGRTDLSLISQISPMFGGVFSYCLPTT---EAEASGSLVM-GGNSSVYKNTT 243
Query: 278 GLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRVRVWHKYLTLDRDGNGGTIVDSGT 337
++YT ++NP + +Y++ L ITVGG V V DR I+DSGT
Sbjct: 244 PISYTRMIHNPLLP-------FYFLNLTGITVGG--VEVQAPSFGKDR-----MIIDSGT 289
Query: 338 TFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEALTGLRPCFDVPGEKTGSFPELKLHF 397
+ + P +++ L EFV Q ++ A + L CF++ G + P++K++F
Sbjct: 290 VISRLPPSIYQALKAEFVKQ------FSGYPSAPSFMILDSCFNLSGYQEVKIPDIKMYF 343
Query: 398 KGGAEVTLPVEN-YFAVVGEGSAVCLTVVT-DREASGGPSIILGNFQMQNYYVEYDLRNQ 455
+G AE+ + V +++V + S VCL + + E G I+GN+Q +N + YD +
Sbjct: 344 EGSAELNVDVTGVFYSVKTDASQVCLAIASLPYEDEVG---IIGNYQQKNQRIIYDTKGS 400
Query: 456 RLGFKQQLC 464
LGF ++ C
Sbjct: 401 MLGFAEEAC 409
>gi|297820186|ref|XP_002877976.1| hypothetical protein ARALYDRAFT_906847 [Arabidopsis lyrata subsp.
lyrata]
gi|297323814|gb|EFH54235.1| hypothetical protein ARALYDRAFT_906847 [Arabidopsis lyrata subsp.
lyrata]
Length = 425
Score = 135 bits (341), Expect = 3e-29, Method: Compositional matrix adjust.
Identities = 119/384 (30%), Positives = 167/384 (43%), Gaps = 55/384 (14%)
Query: 89 YSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLGCQ 148
Y + + GTP Q + LDT + W PC+ C CSSS + F P SSSSR L C+
Sbjct: 88 YIVRANIGTPAQAMLVALDTSNDAAWIPCSG---CVGCSSSVL--FDPSKSSSSRTLQCE 142
Query: 149 NPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSGLTEGIALSETLNLPNRII 208
P QC+ + SK+C + + YG E +TL L +I
Sbjct: 143 AP----------QCKQAPNPSCTVSKSC-----GFNMTYGGSAIEAYLTQDTLTLATDVI 187
Query: 209 PNFLVGC--SVLSSRQPA-GIAGFGRGKTSLPSQ---LNLDKFSYCLLSHKFDDTTRTSS 262
PN+ GC + PA G+ G GRG SL SQ L FSYCL + K + + +
Sbjct: 188 PNYTFGCINKASGTSLPAQGLMGLGRGPLSLISQSQNLYQSTFSYCLPNSKSSNFSGSLR 247
Query: 263 LILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRVRVWHKYLT 322
L N + + TP + NP R++ YYV L I VG + V + L
Sbjct: 248 LGPKN-------QPIRIKTTPLLKNP---RRSSL---YYVNLVGIRVGNKIVDIPTSALA 294
Query: 323 LDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEALTGLRPCFDV 382
D GTI DSGT +T + + + +EF + VKN N A +L G C+
Sbjct: 295 FDPATGAGTIFDSGTVYTRLVEPAYVAMRNEF-RRRVKNAN------ATSLGGFDTCY-- 345
Query: 383 PGEKTGS--FPELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTVVTDREASGGPSIILGN 440
+GS FP + F G VTLP +N G+ CL + ++ +
Sbjct: 346 ----SGSVVFPSVTFMF-AGMNVTLPPDNLLIHSSAGNLSCLAMAAAPTNVNSVLNVIAS 400
Query: 441 FQMQNYYVEYDLRNQRLGFKQQLC 464
Q QN+ V D+ N RLG ++ C
Sbjct: 401 MQQQNHRVLIDVPNSRLGISRETC 424
>gi|297843774|ref|XP_002889768.1| hypothetical protein ARALYDRAFT_471076 [Arabidopsis lyrata subsp.
lyrata]
gi|297335610|gb|EFH66027.1| hypothetical protein ARALYDRAFT_471076 [Arabidopsis lyrata subsp.
lyrata]
Length = 449
Score = 135 bits (341), Expect = 3e-29, Method: Compositional matrix adjust.
Identities = 117/389 (30%), Positives = 174/389 (44%), Gaps = 52/389 (13%)
Query: 87 GGYSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLG 146
G Y + GTPPQ++ +LDT + VW PC+ C CS++ S+ S +
Sbjct: 103 GNYVVRARLGTPPQLMFMVLDTSNDAVWLPCSG---CSGCSNASTSFNTNSSSTYSTV-S 158
Query: 147 CQNPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSGLTEGIAL-SETLNLPN 205
C +C+ + C +P IC S+ YG + L +TL L
Sbjct: 159 CSTTQCT--QARGLTCPSSTPQP--------SIC-SFNQSYGGDSSFSANLVQDTLTLSP 207
Query: 206 RIIPNFLVGC---SVLSSRQPAGIAGFGRGKTSLPSQ---LNLDKFSYCLLSHK---FDD 256
+IPNF GC + +S P G+ G GRG SL SQ L FSYCL S + F
Sbjct: 208 DVIPNFSFGCINSASGNSLPPQGLMGLGRGPMSLVSQTTSLYSGVFSYCLPSFRSFYFSG 267
Query: 257 TTRTSSLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRVRV 316
+ + L + + YTP + NP R + YYV L ++VG +V V
Sbjct: 268 SLKLGLL----------GQPKSIRYTPLLRNP---RRPSL---YYVNLTGVSVGSVQVPV 311
Query: 317 WHKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEALTGL 376
YLT D + GTI+DSGT T A ++E + DEF Q+ N +++ LGA
Sbjct: 312 DPVYLTFDSNSGAGTIIDSGTVITRFAQPVYEAIRDEFRKQV--NGSFS-TLGA-----F 363
Query: 377 RPCFDVPGEKTGSFPELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTVVTDREASGGPSI 436
CF E P++ LH ++ LP+EN G+ CL++ R+ +
Sbjct: 364 DTCFSADNENVT--PKITLHMT-SLDLKLPMENTLIHSSAGTLTCLSMAGIRQNANAVLN 420
Query: 437 ILGNFQMQNYYVEYDLRNQRLGFKQQLCK 465
++ N Q QN + +D+ N R+G + C
Sbjct: 421 VIANLQQQNLRILFDVPNSRIGIAPEPCN 449
>gi|15223368|ref|NP_171637.1| aspartyl protease-like protein [Arabidopsis thaliana]
gi|9665144|gb|AAF97328.1|AC023628_9 Unknown protein [Arabidopsis thaliana]
gi|22135930|gb|AAM91547.1| chloroplast nucleoid DNA binding protein, putative [Arabidopsis
thaliana]
gi|30387595|gb|AAP31963.1| At1g01300 [Arabidopsis thaliana]
gi|332189147|gb|AEE27268.1| aspartyl protease-like protein [Arabidopsis thaliana]
Length = 485
Score = 135 bits (341), Expect = 3e-29, Method: Compositional matrix adjust.
Identities = 124/390 (31%), Positives = 170/390 (43%), Gaps = 49/390 (12%)
Query: 83 SHSYGGYSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSS 142
S G Y L GTP + + +LDTGS +VW C C+ C S P F P+ S +
Sbjct: 136 SQGSGEYFTRLGVGTPARYVYMVLDTGSDIVWLQCA---PCRRCYSQSDPIFDPRKSKTY 192
Query: 143 RLLGCQNPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSG-LTEGIALSETL 201
+ C +P C + R K C Y V YG G T G +ETL
Sbjct: 193 ATIPCSSPHCRRLDSAGCNTR---------RKTCL-----YQVSYGDGSFTVGDFSTETL 238
Query: 202 NLPNRIIPNFLVGCSVLSSR---QPAGIAGFGRGKTSLPSQLNL---DKFSYCLLSHKFD 255
+ +GC + AG+ G G+GK S P Q KFSYCL+
Sbjct: 239 TFRRNRVKGVALGCGHDNEGLFVGAAGLLGLGKGKLSFPGQTGHRFNQKFSYCLVDRS-- 296
Query: 256 DTTRTSSLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRV- 314
+++ SS++ N + + +TP ++NP + +YYVGL I+VGG RV
Sbjct: 297 ASSKPSSVVFGNAAVSRIAR-----FTPLLSNPKL------DTFYYVGLLGISVGGTRVP 345
Query: 315 RVWHKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEALT 374
V LD+ GNGG I+DSGT+ T + + + D F V + R A +
Sbjct: 346 GVTASLFKLDQIGNGGVIIDSGTSVTRLIRPAYIAMRDAF---RVGAKTLKR---APDFS 399
Query: 375 GLRPCFDVPGEKTGSFPELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTVVTDREASGGP 434
CFD+ P + LHF+ GA+V+LP NY V C GG
Sbjct: 400 LFDTCFDLSNMNEVKVPTVVLHFR-GADVSLPATNYLIPVDTNGKFCFAFA---GTMGGL 455
Query: 435 SIILGNFQMQNYYVEYDLRNQRLGFKQQLC 464
SII GN Q Q + V YDL + R+GF C
Sbjct: 456 SII-GNIQQQGFRVVYDLASSRVGFAPGGC 484
>gi|168005153|ref|XP_001755275.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162693403|gb|EDQ79755.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 429
Score = 135 bits (341), Expect = 4e-29, Method: Compositional matrix adjust.
Identities = 121/386 (31%), Positives = 176/386 (45%), Gaps = 53/386 (13%)
Query: 87 GGYSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLG 146
G Y I +S+G PPQ I+DTGS L W C CK C + F P S+S + LG
Sbjct: 88 GEYLIDISYGNPPQKSTAIVDTGSDLNWVQC---LPCKSCYETLSAKFDPSKSASYKTLG 144
Query: 147 CQNPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSGLTEGIALS-ETLNLPN 205
C + C D P ++C C Y +YG G + ALS + + +
Sbjct: 145 CGS-------------NFCQDLPF---QSCAASC-QYDYMYGDGSSTSGALSTDDVTIGT 187
Query: 206 RIIPNFLVGC---SVLSSRQPAGIAGFGRGKTSLPSQLN---LDKFSYCLLSHKFDDTTR 259
IPN GC ++ + G+ G G+G SL SQL KFSYCL+ +T+
Sbjct: 188 GKIPNVAFGCGNSNLGTFAGAGGLVGLGKGPLSLVSQLGGTATKKFSYCLVPLG---STK 244
Query: 260 TSSLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRVRVWHK 319
TS L + + + G+ YTP + N N + +YY L+ I+V G+ V
Sbjct: 245 TSPLYIGDST-----LAGGVAYTPMLTN------NNYPTFYYAELQGISVEGKAVNYPAN 293
Query: 320 YLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEALTGLRPC 379
+ G GG I+DSGTT T++ + F P+ V+ + Y A G + GL C
Sbjct: 294 TFDIAATGRGGLILDSGTTLTYLDVDAFNPM----VAALKAALPYPEADG--SFYGLEYC 347
Query: 380 FDVPGEKTGSFPELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTVVTDREASGGPSIILG 439
F G ++P + HF GA+V L +N F + CL + + S I G
Sbjct: 348 FSTAGVANPTYPTVVFHFN-GADVALAPDNTFIALDFEGTTCLAMASSTGFS-----IFG 401
Query: 440 NFQMQNYYVEYDLRNQRLGFKQQLCK 465
N Q N+ + +DL N+R+GFK C+
Sbjct: 402 NIQQLNHVIVHDLVNKRIGFKSANCE 427
>gi|125586059|gb|EAZ26723.1| hypothetical protein OsJ_10631 [Oryza sativa Japonica Group]
Length = 339
Score = 135 bits (341), Expect = 4e-29, Method: Compositional matrix adjust.
Identities = 115/383 (30%), Positives = 166/383 (43%), Gaps = 60/383 (15%)
Query: 95 FGTPPQIIPFILDTGSHLVWFPCTNHYQ-CKYCSSSKIPSFIP-KLSSSSRLLGCQNPKC 152
GTPP + L+ G+ L+W NH C P F P S C +PK
Sbjct: 1 MGTPPNPVKLKLENGNELIW----NHSNPSPECFEQAFPYFEPLTFSRGLPFASCGSPKF 56
Query: 153 SWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSGLTEGIALSETLNLPNRIIPNFL 212
W + + D+ + T + G+G + +P
Sbjct: 57 -WPNQTCVYTYSYGDKSVTTGF----LEVDKFTFVGAGAS---------------VPGVA 96
Query: 213 VGCSVLSS----RQPAGIAGFGRGKTSLPSQLNLDKFSYCLLSHKFDDTTRT----SSLI 264
GC + ++ GIAGFGRG SLPSQL + FS+C TT T S+++
Sbjct: 97 FGCGLFNNGVFKSNETGIAGFGRGPLSLPSQLKVGNFSHCF-------TTITGAIPSTVL 149
Query: 265 LDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRVRVWHKYLTLD 324
LD + + TP + A+ A YY+ L+ ITVG R+ V L
Sbjct: 150 LDLPADLFSNGQGAVQTTPLI---QYAKNEANPTLYYLSLKGITVGSTRLPVPESAFAL- 205
Query: 325 RDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEALTGLRPCFDVPG 384
+G GGTI+DSGT+ T + P++++ + DEF +Q+ + TG CF P
Sbjct: 206 TNGTGGTIIDSGTSITSLPPQVYQVVRDEFAAQI------KLPVVPGNATGHYTCFSAPS 259
Query: 385 EKTGSFPELKLHFKGGAEVTLPVENYFAVVGE---GSAVCLTVVTDREASGGPSIILGNF 441
+ P+L LHF+ GA + LP ENY V + S +CL + E + I+GNF
Sbjct: 260 QAKPDVPKLVLHFE-GATMDLPRENYVFEVPDDAGNSIICLAINKGDETT-----IIGNF 313
Query: 442 QMQNYYVEYDLRNQRLGFKQQLC 464
Q QN +V YDL+N L F C
Sbjct: 314 QQQNMHVLYDLQNNMLSFVAAQC 336
>gi|242044888|ref|XP_002460315.1| hypothetical protein SORBIDRAFT_02g026350 [Sorghum bicolor]
gi|241923692|gb|EER96836.1| hypothetical protein SORBIDRAFT_02g026350 [Sorghum bicolor]
Length = 456
Score = 135 bits (341), Expect = 4e-29, Method: Compositional matrix adjust.
Identities = 128/412 (31%), Positives = 177/412 (42%), Gaps = 63/412 (15%)
Query: 75 TTTTTNISSHSYGG--YSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIP 132
TT T +S G Y + L+ GTPPQ + +LDTGS L+W C C C + P
Sbjct: 86 TTPPTGVSVRPSGDLEYVVDLAIGTPPQPVSALLDTGSDLIWTQCA---PCASCLAQPDP 142
Query: 133 SFIPKLSSSSRLLGCQNPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSG-L 191
F P S+S + C CS I H + D CT Y YG G +
Sbjct: 143 LFAPGESASYEPMRCAGQLCSDILHHGCEMPD----------TCT-----YRYNYGDGTM 187
Query: 192 TEGIALSETLNLP----NRIIPNFL-VGC---SVLSSRQPAGIAGFGRGKTSLPSQLNLD 243
T G+ +E +R++ L GC +V S +GI GFGR SL SQL++
Sbjct: 188 TMGVYATERFTFTSSGGDRLMTVPLGFGCGSMNVGSLNNGSGIVGFGRNPLSLVSQLSIR 247
Query: 244 KFSYCLLSHKFDDTTRTSSLILDNGSSHSDKKTTG-LTYTPF---VNNPSVAERNAFSVY 299
+FSYCL S+ + R S+L+ + S TG + TP + NP+ +
Sbjct: 248 RFSYCLTSYG---SGRKSTLLFGSLSGGVYGDATGPVQTTPLLQSLQNPT---------F 295
Query: 300 YYVGLRRITVGGQRVRVWHKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMV 359
YYV L +TVG +R+R+ L DG+GG IVDSGT T + + + F Q+
Sbjct: 296 YYVHLAGLTVGARRLRIPESAFALRPDGSGGVIVDSGTALTLLPGAVLAEVVRAFRQQL- 354
Query: 360 KNRNYTRALGAEALTGLRPCFDVPGEKTGS-------FPELKLHFKGGAEVTLPVENYFA 412
A G G+ CF VP S P + HF+ A++ LP NY
Sbjct: 355 ---RLPFANGGNPEDGV--CFLVPAAWRRSSSTSQVPVPRMVFHFQ-DADLDLPRRNYVL 408
Query: 413 VVGEGSAVCLTVVTDREASGGPSIILGNFQMQNYYVEYDLRNQRLGFKQQLC 464
+CL + SG +GN Q+ V YDL + L F C
Sbjct: 409 DDHRKGRLCLLLAD----SGDDGSTIGNLVQQDMRVLYDLEAETLSFAPAQC 456
>gi|168040957|ref|XP_001772959.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162675692|gb|EDQ62184.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 351
Score = 135 bits (341), Expect = 4e-29, Method: Compositional matrix adjust.
Identities = 124/393 (31%), Positives = 167/393 (42%), Gaps = 53/393 (13%)
Query: 83 SHSYGGYSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSS 142
S G Y + +S GTPPQ I+DTGS L W C C C P FIP SSS
Sbjct: 2 SAGSGEYVLQISLGTPPQQFSAIVDTGSDLCWVQCA---PCARCFEQPDPLFIPLASSSY 58
Query: 143 RLLGCQNPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSGL-TEGIALSETL 201
C + C + + R+ CT Y YG G T G ET+
Sbjct: 59 SNASCTDSLCDALPRPTCSMRN----------TCT-----YSYSYGDGSNTRGDFAFETV 103
Query: 202 NLPNRIIPNFLVGCSVLSSRQPAG---IAGFGRGKTSLPSQLN---LDKFSYCLLSHKFD 255
L + GC AG + G G+G SLPSQLN FSYCL+
Sbjct: 104 TLNGSTLARIGFGCGHNQEGTFAGADGLIGLGQGPLSLPSQLNSSFTHIFSYCLVDQS-- 161
Query: 256 DTTRTSSLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRVR 315
TT T S I ++ + + + +TP + N E N YYYVG+ I+VG +RV
Sbjct: 162 -TTGTFSPITFGNAAENSRAS----FTPLLQN----EDNP--SYYYVGVESISVGNRRVP 210
Query: 316 VWHKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEALTG 375
+D +G GG I+DSGTT T+ F P+ E Q+ +Y A G
Sbjct: 211 TPPSAFRIDANGVGGVILDSGTTITYWRLAAFIPILAELRRQI----SYPEA--DPTPYG 264
Query: 376 LRPCFDVPGEKTGS--FPELKLHFKGGAEVTLPVENYFAVVGE-GSAVCLTVVTDREASG 432
L C+D+ S P + +H + +PV N + +V G VC + T + S
Sbjct: 265 LNLCYDISSVSASSLTLPSMTVHLT-NVDFEIPVSNLWVLVDNFGETVCTAMSTSDQFS- 322
Query: 433 GPSIILGNFQMQNYYVEYDLRNQRLGFKQQLCK 465
I+GN Q QN + D+ N R+GF C
Sbjct: 323 ----IIGNVQQQNNLIVTDVANSRVGFLATDCS 351
>gi|326524580|dbj|BAK00673.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 461
Score = 135 bits (340), Expect = 4e-29, Method: Compositional matrix adjust.
Identities = 122/396 (30%), Positives = 169/396 (42%), Gaps = 57/396 (14%)
Query: 89 YSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLGCQ 148
Y I L+ GTPPQ + +LDTGS L+W C C C + P F P SSS + C
Sbjct: 103 YLIDLAIGTPPQPVSALLDTGSDLIWTQCA---PCASCLAQPDPLFAPAASSSYVPMRCS 159
Query: 149 NPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSGLTE-GIALSETLNLPNRI 207
C+ I H S Q D CT Y YG G T G+ +E +
Sbjct: 160 GQLCNDILHHSCQRPD----------TCT-----YRYNYGDGTTTLGVYATERFTFASSS 204
Query: 208 IPNFLV----GCSVL---SSRQPAGIAGFGRGKTSLPSQLNLDKFSYCLLSHKFDDTTRT 260
V GC + S +GI GFGR SL SQL++ +FSYCL + +TR
Sbjct: 205 GEKLSVPLGFGCGTMNVGSLNNGSGIVGFGRDPLSLVSQLSIRRFSYCLTPYT---STRK 261
Query: 261 SSLI---LDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRVRVW 317
S+L+ L +G D TG V + + +YYV +TVG +R+R+
Sbjct: 262 STLMFGSLSDGVFEGDDAATGQ-----VQTTRLLQSRQNPTFYYVPFTGVTVGTRRLRIP 316
Query: 318 HKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEALTGLR 377
L DG+GG IVDSGT T + + F +Q+ +T + + G+
Sbjct: 317 LSAFALRPDGSGGVIVDSGTALTLFPAAVLTEVLRAFRAQL--RLPFTSSSSPD--DGV- 371
Query: 378 PCFDVPGEKTG---------SFPELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTVVTDR 428
CF P G S P + HF+ GA++ LP NY ++C+ +
Sbjct: 372 -CFATPMAAGGRRASAATVVSVPRMAFHFQ-GADLELPRRNYVLDDPRRGSLCILLAD-- 427
Query: 429 EASGGPSIILGNFQMQNYYVEYDLRNQRLGFKQQLC 464
SG +GNF Q+ V YDL + L F C
Sbjct: 428 --SGDSGATIGNFVQQDMRVLYDLEAETLSFAPAQC 461
>gi|125547728|gb|EAY93550.1| hypothetical protein OsI_15341 [Oryza sativa Indica Group]
Length = 418
Score = 135 bits (340), Expect = 4e-29, Method: Compositional matrix adjust.
Identities = 144/487 (29%), Positives = 200/487 (41%), Gaps = 112/487 (22%)
Query: 13 IFFFTLLSIFPSSITSLTFSLSRFHTNPSQDSYQNLNSLVSSSLTRALHIKNPQTKTTTT 72
+ ++I+ +L LS ++ L + S RA H+ + Q ++
Sbjct: 8 VLMLLAVTIYSCDSANLRLQLSHVDAGRGLTHWELLRRMAQRSKARATHLLSAQDQSGRG 67
Query: 73 TTTTTTTNISSHSYG----GYSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSS 128
+ + N ++ G Y + L+ GTPPQ + LDTGS + W QCK C +
Sbjct: 68 RSASAPVNPGAYDDGFPFTEYLVHLAAGTPPQEVQLTLDTGSDITW------TQCKRCPA 121
Query: 129 SK-----IPSFIPKLSSSSRLLGCQNPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSY 183
S +P F P SSS L C +P C + C ND ATS+ C +Y
Sbjct: 122 SACFNQTLPLFDPSASSSFASLPCSSPAC----ETTPPCGGGND---ATSRPC-----NY 169
Query: 184 LVLYGSG-LTEGIALSETLNLPN-------RIIPNFLVGCS-----VLSSRQPAGIAGFG 230
+ YG G ++ G E + +P + GC V +S + GIAGFG
Sbjct: 170 SISYGDGSVSRGEIGREVFTFASGTGEGSSAAVPGLVFGCGHANRGVFTSNE-TGIAGFG 228
Query: 231 RGKTSLPSQLNLDKFSYCLLSHKFDDTTRTSSLILDNGSSHSDKKTTGLTYTPFVNNPSV 290
RG SLPSQL + FS+C TT T S K + L P V PS
Sbjct: 229 RGSLSLPSQLKVGNFSHCF-------TTITGS-----------KTSAVLLGLPGVAPPSA 270
Query: 291 AERNAFSVYYYVGLRRITVGGQRVRVWHKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPL 350
+ +G RR G R R + R N SGT+ T + P + +
Sbjct: 271 SP---------LGRRR---GSYRCR------STPRSSN------SGTSITSLPPRTYRAV 306
Query: 351 ADEFVSQM---VKNRNYTRALGAEALTGLRPCFDVP--GEKTGSFPELKLHFKGGAEVTL 405
+EF +Q+ V N T CF P G K P + LHF+ GA + L
Sbjct: 307 REEFAAQVKLPVVPGNATDPF---------TCFSAPLRGPKP-DVPTMALHFE-GATMRL 355
Query: 406 PVENY-FAVVGEGSA------VCLTVVTDREASGGPSIILGNFQMQNYYVEYDLRNQRLG 458
P ENY F VV + A +CL V+ E IILGN Q QN +V YDL+N +L
Sbjct: 356 PQENYVFEVVDDDDAGNSSRIICLAVIEGGE------IILGNIQQQNMHVLYDLQNSKLS 409
Query: 459 FKQQLCK 465
F C
Sbjct: 410 FVPAQCD 416
>gi|414584783|tpg|DAA35354.1| TPA: hypothetical protein ZEAMMB73_186928 [Zea mays]
Length = 464
Score = 135 bits (340), Expect = 5e-29, Method: Compositional matrix adjust.
Identities = 115/385 (29%), Positives = 161/385 (41%), Gaps = 52/385 (13%)
Query: 87 GGYSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLG 146
G Y + + G+PP ++D+GS ++W C C C + P F P S++ +
Sbjct: 125 GEYFVRVGIGSPPTEQYLVVDSGSDVIWVQCK---PCLECYAQADPLFDPATSATFSAVP 181
Query: 147 CQNPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSG-LTEGIALSETLNLPN 205
C + C +++ C D S C Y V YG G T+G ETL L
Sbjct: 182 CGSAVC-----RTLRTSGCGD-----SGGC-----DYEVSYGDGSYTKGALALETLTLGG 226
Query: 206 RIIPNFLVGCSVLSSR---QPAGIAGFGRGKTSLPSQLNLDK---FSYCLLSHKFDDTTR 259
+ +GC + AG+ G G G SL QL FSYCL S
Sbjct: 227 TAVEGVAIGCGHRNRGLFVGAAGLLGLGWGPMSLVGQLGGAAGGAFSYCLASRG------ 280
Query: 260 TSSLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRVRVWHK 319
SL+L S+ G + P V NP +YYVGL I VG +R+ +
Sbjct: 281 AGSLVL----GRSEAVPEGAVWVPLVRNPQAPS------FYYVGLSGIGVGDERLPLQED 330
Query: 320 YLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEALTGLRPC 379
L DG GG ++D+GT T + E + L D FV+ + RA G L C
Sbjct: 331 LFQLTEDGAGGVVMDTGTAVTRLPQEAYAALRDAFVAAV---GALPRAPGVSLLD---TC 384
Query: 380 FDVPGEKTGSFPELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTVVTDREASGGPSIILG 439
+D+ G + P + +F G A +TLP N V +G CL +S GPS ILG
Sbjct: 385 YDLSGYTSVRVPTVSFYFDGAATLTLPARNLLLEV-DGGIYCLAFA---PSSSGPS-ILG 439
Query: 440 NFQMQNYYVEYDLRNQRLGFKQQLC 464
N Q + + D N +GF C
Sbjct: 440 NIQQEGIQITVDSANGYIGFGPTTC 464
>gi|413921976|gb|AFW61908.1| hypothetical protein ZEAMMB73_608282 [Zea mays]
Length = 459
Score = 135 bits (340), Expect = 5e-29, Method: Compositional matrix adjust.
Identities = 119/397 (29%), Positives = 168/397 (42%), Gaps = 54/397 (13%)
Query: 89 YSISLSFGTPPQIIPFIL--DTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLG 146
Y + L+ GTPP +PF+ DTGS L W C CK C P + S+S +
Sbjct: 95 YLMELAIGTPP--VPFVALADTGSDLTWTQCK---PCKLCFPQDTPIYDTAASASFSPVP 149
Query: 147 CQNPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSGL-TEGIALSETLNL-- 203
C + C I S C P Y Y G + G+ +ETL
Sbjct: 150 CASATCLPIWRSSRNCTATTTSPC-----------RYRYAYDDGAYSAGVLGTETLTFAG 198
Query: 204 -------PNRIIPNFLVGCSVLS---SRQPAGIAGFGRGKTSLPSQLNLDKFSYCLLSHK 253
P + GC V + S G G GRG SL +QL + KFSYCL
Sbjct: 199 SSPGAPGPGVSVGGVAFGCGVDNGGLSYNSTGTVGLGRGSLSLVAQLGVGKFSYCL--TD 256
Query: 254 FDDTTRTSSLILDNGSSHSDKKTTG---LTYTPFVNNPSVAERNAFSVYYYVGLRRITVG 310
F +T+ S ++ + + + T G + TP V P R YYV L I++G
Sbjct: 257 FFNTSLGSPVLFGSLAELAAPSTIGGAAVQSTPLVQGPYNPSR------YYVSLEGISLG 310
Query: 311 GQRVRVWHKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGA 370
R+ + + L DG+GG IVDSGT FT + F + + V N+ A
Sbjct: 311 DARLPIPNGTFDLRDDGSGGMIVDSGTIFTVLVESAFRVVVNHVAG--VLNQPVVNASSL 368
Query: 371 EALTGLRPCFDVPG--EKTGSFPELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTVVTDR 428
++ PCF ++ P++ LHF GGA++ L +NY + E S+ CL +
Sbjct: 369 DS-----PCFPATAGEQQLPDMPDMLLHFAGGADMRLHRDNYMSFNQESSSFCLNIAGAP 423
Query: 429 EASGGPSIILGNFQMQNYYVEYDLRNQRLGFKQQLCK 465
A G ILGNFQ QN + +D+ +L F C
Sbjct: 424 SAYGS---ILGNFQQQNIQMLFDITVGQLSFVPTDCS 457
>gi|224142011|ref|XP_002324354.1| predicted protein [Populus trichocarpa]
gi|222865788|gb|EEF02919.1| predicted protein [Populus trichocarpa]
Length = 471
Score = 135 bits (339), Expect = 5e-29, Method: Compositional matrix adjust.
Identities = 120/424 (28%), Positives = 185/424 (43%), Gaps = 68/424 (16%)
Query: 60 LHIKNPQTKTTTTTTTTTTTNI------SSHSYGGYSISLSFGTPPQIIPFILDTGSHLV 113
L +K+ + K + ++TT N ++H GGY++++ GTP + + DTGS L
Sbjct: 97 LRVKSIRAKHSMNSSTTGVFNEMKTRVPTTHFGGGYAVTVGLGTPKKDFSLLFDTGSDLT 156
Query: 114 WFPCTNHYQCKYCSSSKIP----SFIPKLSSSSRLLGCQNPKCSWIHHESIQCRDCNDEP 169
W QC+ CS P F P S+S + L C + C I ES Q
Sbjct: 157 W------TQCEPCSGGCFPQNDEKFDPTKSTSYKNLSCSSEPCKSIGKESAQ-------G 203
Query: 170 LATSKNCTQICPSYLVLYGSGLTEGIALSETLNL-PNRIIPNFLVGCSVLSSRQ---PAG 225
++S +C Y V YG+G T G +ETL + P+ + NF++GC + + AG
Sbjct: 204 CSSSNSCL-----YGVKYGTGYTVGFLATETLTITPSDVFENFVIGCGERNGGRFSGTAG 258
Query: 226 IAGFGRGKTSLPSQLN---LDKFSYCLLSHKFDDTTRTSSLILDNGSSHSDKKTTGLTYT 282
+ G GR +LPSQ + + FSYCL + ++ T L G S + K +T
Sbjct: 259 LLGLGRSPVALPSQTSSTYKNLFSYCLPA----SSSSTGHLSFGGGVSQAAK------FT 308
Query: 283 PFVNNPSVAERNAFSVYYYVGLRRITVGGQRVRVWHKYLTLDRDGNGGTIVDSGTTFTFM 342
P + Y + + I+VGG+++ + GTI+DSGTT T++
Sbjct: 309 PITSK--------IPELYGLDVSGISVGGRKLPIDPSVFR-----TAGTIIDSGTTLTYL 355
Query: 343 APELFEPLADEFVSQMVKNRNYTRALGAEALTGLRPCFDVPGEKTG--SFPELKLHFKGG 400
L+ F M NYT G +GL+PC+D + P++ + F+GG
Sbjct: 356 PSTAHSALSSAFQEMMT---NYTLTKGT---SGLQPCYDFSKHANDNITIPQISIFFEGG 409
Query: 401 AEVTLPVENYFAVVGEGSAVCLTVVTDREASGGPSIILGNFQMQNYYVEYDLRNQRLGFK 460
EV + F VCL + + I GN Q + Y V YD+ +GF
Sbjct: 410 VEVDIDDSGIFIAANGLEEVCLAFKDNGNDT--DVAIFGNVQQKTYEVVYDVAKGMVGFA 467
Query: 461 QQLC 464
C
Sbjct: 468 PGGC 471
>gi|115476830|ref|NP_001062011.1| Os08g0469100 [Oryza sativa Japonica Group]
gi|42407408|dbj|BAD09566.1| putative nucleoid DNA-binding protein [Oryza sativa Japonica Group]
gi|113623980|dbj|BAF23925.1| Os08g0469100 [Oryza sativa Japonica Group]
Length = 373
Score = 135 bits (339), Expect = 5e-29, Method: Compositional matrix adjust.
Identities = 127/397 (31%), Positives = 180/397 (45%), Gaps = 61/397 (15%)
Query: 88 GYSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKI-------PSFIPKLSS 140
G+S+++ P ++I +DTGS L+W QCK SS+ P + P SS
Sbjct: 15 GHSLTVGIVQPRKLI---VDTGSDLIW------TQCKLSSSTAAAARHGSPPVYDPGESS 65
Query: 141 SSRLLGCQNPKCSWIHHESIQCRDCNDEPLATSKN-CTQICPSYLVLYGSGLTEGIALSE 199
+ L C + C ++C TSKN C Y +YGS G+ SE
Sbjct: 66 TFAFLPCSDRLC---QEGQFSFKNC------TSKNRCV-----YEDVYGSAAAVGVLASE 111
Query: 200 TLNLPNRIIPNFLVG--CSVLSSRQ---PAGIAGFGRGKTSLPSQLNLDKFSYCLLSHKF 254
T R + +G C LS+ GI G SL +QL + +FSYCL F
Sbjct: 112 TFTFGARRAVSLRLGFGCGALSAGSLIGATGILGLSPESLSLITQLKIQRFSYCLT--PF 169
Query: 255 DDTTRTSSLILDNGSSHSDKKTTG-LTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQR 313
D +TS L+ + S KTT + T V+NP +VYYYV L I++G +R
Sbjct: 170 ADK-KTSPLLFGAMADLSRHKTTRPIQTTAIVSNP------VETVYYYVPLVGISLGHKR 222
Query: 314 VRVWHKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEAL 373
+ V L + DG GGTIVDSG+T ++ FE + E V +V+ R + L
Sbjct: 223 LAVPAASLAMRPDGGGGTIVDSGSTVAYLVEAAFEAV-KEAVMDVVRLPVANRTVEDYEL 281
Query: 374 TGLRPCFDVPGEKTGS------FPELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTVVTD 427
CF +P + P L LHF GGA + LP +NYF G +CL V
Sbjct: 282 -----CFVLPRRTAAAAMEAVQVPPLVLHFDGGAAMVLPRDNYFQEPRAG-LMCLAVGKT 335
Query: 428 REASGGPSIILGNFQMQNYYVEYDLRNQRLGFKQQLC 464
+ SG I+GN Q QN +V +D+++ + F C
Sbjct: 336 TDGSG--VSIIGNVQQQNMHVLFDVQHHKFSFAPTQC 370
>gi|224099307|ref|XP_002311432.1| predicted protein [Populus trichocarpa]
gi|222851252|gb|EEE88799.1| predicted protein [Populus trichocarpa]
Length = 458
Score = 134 bits (338), Expect = 7e-29, Method: Compositional matrix adjust.
Identities = 127/461 (27%), Positives = 196/461 (42%), Gaps = 44/461 (9%)
Query: 24 SSITSLTFSLSRFHTNPSQDSYQNLNSLVSSSLTRALHIKNPQTKTTTTTTTTTTTNISS 83
S+ T+ L H P Q+L+S + Q T++ + + SS
Sbjct: 19 STSTTEYLKLPLLHKTPFPTPLQSLSSDLQRLSLLHHSHHRHQNHRRTSSKSPLMSGASS 78
Query: 84 HSYGGYSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIP--SFIPKLSSS 141
S G Y +S+ G+PPQ + + DTGS L W C+ CK S P +F+ + S++
Sbjct: 79 GS-GQYFVSIRLGSPPQTLLLVADTGSDLTWVRCS---ACKTNCSIHPPGSTFLARHSTT 134
Query: 142 SRLLGCQNPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSG-LTEGIALSET 200
C + C + + CN L ++ Y +Y G T G ET
Sbjct: 135 FSPTHCFSSLCQLVPQPNPN--PCNHTRLHSTCR-------YEYVYSDGSKTSGFFSKET 185
Query: 201 LNL-----PNRIIPNFLVGCSVLSS---------RQPAGIAGFGRGKTSLPSQLNLD--- 243
L + + GC +S +G+ G GRG S SQL
Sbjct: 186 TTLNTSSGREMKLKSIAFGCGFHASGPSLIGSSFNGASGVMGLGRGPISFASQLGRRFGR 245
Query: 244 KFSYCLLSHKFDDTTRTSSLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVG 303
FSYCLL + + +I D S+ D K+ +++TP + NP +YY+
Sbjct: 246 SFSYCLLDYTLSPPPTSYLMIGDVVSTKKDNKSM-MSFTPLLINPEAP------TFYYIS 298
Query: 304 LRRITVGGQRVRVWHKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRN 363
++ + V G ++ + +LD GNGGT++DSGTT TF+ + + F + VK +
Sbjct: 299 IKGVFVDGVKLHIDPSVWSLDELGNGGTVIDSGTTLTFLTEPAYREILSAFKRE-VKLPS 357
Query: 364 YTRALGAEALTGLRPCFDVPGEKTGSFPELKLHFKGGAEVTLPVENYFAVVGEGSAVCLT 423
T GA +G C +V G FP L L G + + P NYF + EG CL
Sbjct: 358 PTPG-GASTRSGFDLCVNVTGVSRPRFPRLSLELGGESLYSPPPRNYFIDISEG-IKCL- 414
Query: 424 VVTDREASGGPSIILGNFQMQNYYVEYDLRNQRLGFKQQLC 464
+ EA G ++GN Q + +E+D RLGF ++ C
Sbjct: 415 AIQPVEAESGRFSVIGNLMQQGFLLEFDRGKSRLGFSRRGC 455
>gi|326532056|dbj|BAK01404.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 506
Score = 134 bits (338), Expect = 7e-29, Method: Compositional matrix adjust.
Identities = 136/440 (30%), Positives = 191/440 (43%), Gaps = 60/440 (13%)
Query: 43 DSYQNLNSLVSSSLTRALHIKNPQTKTTTTTTTTTTTNISSHSYGGYSISLSFGTPPQII 102
D+ +L S+ R P+ + T + + S G Y + + GTPP+
Sbjct: 106 DTMHRRAALSGSAAAR--RDSAPRRALSERVVATVESGVPVGS-GEYLVDVYLGTPPRRF 162
Query: 103 PFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLGCQNPKCSWI----HHE 158
I+DTGS L W C C C P F P S S R + C + +C +
Sbjct: 163 RMIMDTGSDLNWLQCA---PCLDCFEQSGPIFDPAASISYRNVTCGDDRCRLVSPPAESA 219
Query: 159 SIQCRDCNDEPLATSKNCTQICPSYLVLYG--SGLTEGIALSE-TLNLPN---RIIPNFL 212
+CR +P CP Y YG S T +AL T+NL R +
Sbjct: 220 PRECRRPRSDP----------CP-YYYWYGDQSNTTGDLALEAFTVNLTQSGTRRVDGVA 268
Query: 213 VGCSVLSS---RQPAGIAGFGRGKTSLPSQLNL----DKFSYCLLSHKFDDTTRTSSLIL 265
GC + AG+ G GRG S SQL FSYCL+ H + S +I
Sbjct: 269 FGCGHRNRGLFHGAAGLLGLGRGPLSFASQLRGVYGGHAFSYCLVEHG---SAAGSKIIF 325
Query: 266 DNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRVRVWHKYLTLDR 325
H D L P +N + A +YY+ L+ I VGG+ V + L+
Sbjct: 326 ----GHDD----ALLAHPQLNYTAFAPTTDADTFYYLQLKSILVGGEAVNISSDTLS--- 374
Query: 326 DGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEALTGLRPCFDVPGE 385
GGTI+DSGTT ++ ++ + F+ +M + +Y LG L+ PC++V G
Sbjct: 375 --AGGTIIDSGTTLSYFPEPAYQAIRQAFIDRM--SPSYPLILGFPVLS---PCYNVSGA 427
Query: 386 KTGSFPELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTVV-TDREASGGPSIILGNFQMQ 444
+ PEL L F GA P ENYF + +CL V+ T R G SII GN+Q Q
Sbjct: 428 EKVEVPELSLVFADGAAWEFPAENYFIRLEPEGIMCLAVLGTPRS---GMSII-GNYQQQ 483
Query: 445 NYYVEYDLRNQRLGFKQQLC 464
N++V YDL + RLGF + C
Sbjct: 484 NFHVLYDLEHNRLGFAPRRC 503
>gi|413952718|gb|AFW85367.1| hypothetical protein ZEAMMB73_231535 [Zea mays]
Length = 443
Score = 134 bits (338), Expect = 8e-29, Method: Compositional matrix adjust.
Identities = 119/391 (30%), Positives = 171/391 (43%), Gaps = 53/391 (13%)
Query: 89 YSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLGCQ 148
Y + L+ GTP + + LDTGS LVW C C+ C +P P SS+ L C
Sbjct: 84 YLVRLAVGTPRRPVALTLDTGSDLVWTQCA---PCRDCFDQDLPVLDPAASSTYAALPCG 140
Query: 149 NPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGS-GLTEGIALSETLNLPNR- 206
+C + S C L ++C Y YG LT G ++ +
Sbjct: 141 AARCRALPFTS-----CGVRTLGNHRSCI-----YAYHYGDKSLTVGEIATDRFTFGDSG 190
Query: 207 ------IIPNFLVGCSVLSS----RQPAGIAGFGRGKTSLPSQLNLDKFSYCLLSHKFDD 256
GC L+ GIAGFGRG+ SLPSQLN+ FSYC S F+
Sbjct: 191 GSGESLHTRRLTFGCGHLNKGVFQSNETGIAGFGRGRWSLPSQLNVTSFSYCFTS-MFES 249
Query: 257 TTRTSSLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRVRV 316
+ +L + +S + + TP + NPS Y++ L+ I+VG R+ V
Sbjct: 250 KSSLVTLGGSPAALYSHAHSGEVRTTPILKNPSQPS------LYFLSLKGISVGKTRLPV 303
Query: 317 WHKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEALTGL 376
TI+DSG + T + E++E + EF +Q+ + G E + L
Sbjct: 304 PETKFR-------STIIDSGASITTLPEEVYEAVKAEFAAQVGLPPS-----GVEG-SAL 350
Query: 377 RPCFDVPGE---KTGSFPELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTVVTDREASGG 433
CF +P + + P L LH + GA+ LP NY V + A + +V D A+ G
Sbjct: 351 DLCFALPVTALWRRPAVPSLTLHLE-GADWELPRSNY--VFEDLGARVMCIVLD--AAPG 405
Query: 434 PSIILGNFQMQNYYVEYDLRNQRLGFKQQLC 464
++GNFQ QN +V YDL N RL F C
Sbjct: 406 EQTVIGNFQQQNTHVVYDLENDRLSFAPARC 436
>gi|302781476|ref|XP_002972512.1| hypothetical protein SELMODRAFT_441822 [Selaginella moellendorffii]
gi|300159979|gb|EFJ26598.1| hypothetical protein SELMODRAFT_441822 [Selaginella moellendorffii]
Length = 496
Score = 134 bits (337), Expect = 9e-29, Method: Compositional matrix adjust.
Identities = 118/405 (29%), Positives = 180/405 (44%), Gaps = 54/405 (13%)
Query: 84 HSYGGYSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSR 143
Y +S+ L G+ + + I+DTGS V C S P F P S S R
Sbjct: 95 EDYALFSMQLGIGSLQKNLSAIIDTGSEAVLVQC---------GSRSRPVFDPAASQSYR 145
Query: 144 LLGCQNPKCSWIHHESIQCRDCNDEP-LATSKNCTQICPSYLVLYGSG------LTEGIA 196
+ C + C + Q + + +P + +S CT Y + YG ++ +
Sbjct: 146 QVPCISQLCLAVQQ---QTSNGSSQPCVNSSATCT-----YSLSYGDSRNSTGDFSQDVI 197
Query: 197 LSETLNLPNRIIP--NFLVGCS-----VLSSRQPAGIAGFGRGKTSLPSQLNL----DKF 245
+ N + + + GC+ L GI GF RG SLPSQL KF
Sbjct: 198 FLNSTNSSGQAVQFRDVAFGCAHSPQGFLVDLGSLGIVGFNRGNLSLPSQLKDRLGGSKF 257
Query: 246 SYCLLSHKFDDTTRTSSLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLR 305
SYC S + + D+G S S + YTP ++NP R S YYVGL
Sbjct: 258 SYCFPSQPWQPRATGVIFLGDSGLSKSK-----VGYTPLLDNPVTPAR---SQLYYVGLT 309
Query: 306 RITVGGQRVRVWHKYLTLD-RDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNY 364
I+V G+ + + LD G+GGT++DSGTTFT + + + + F + NR+
Sbjct: 310 SISVDGKTLAIPESAFKLDPSTGDGGTVLDSGTTFTRVVDDAYTAFRNAFAA---SNRSG 366
Query: 365 TRA-LGAEALTGLRPCFDV-PGEKTGSFPELKLHFKGGAEVTLPVENYFAVV---GEGSA 419
R +GA A G C+++ G PE++L + + L E+ F V G
Sbjct: 367 LRKKVGAAA--GFDDCYNISAGSSLPGVPEVRLSLQNNVRLELRFEHLFVPVSAAGNEVT 424
Query: 420 VCLTVVTDREASGGPSIILGNFQMQNYYVEYDLRNQRLGFKQQLC 464
VCL +++ +++ G +LGN+Q NY VEYD R+GF++ C
Sbjct: 425 VCLAILSSQKSGFGKINVLGNYQQSNYLVEYDNERSRVGFERADC 469
>gi|224067990|ref|XP_002302634.1| predicted protein [Populus trichocarpa]
gi|222844360|gb|EEE81907.1| predicted protein [Populus trichocarpa]
Length = 490
Score = 134 bits (337), Expect = 1e-28, Method: Compositional matrix adjust.
Identities = 131/419 (31%), Positives = 188/419 (44%), Gaps = 50/419 (11%)
Query: 54 SSLTRALHIKNPQTKTTTTTTTTTTTNISSHSYGGYSISLSFGTPPQIIPFILDTGSHLV 113
+SL A+ N +T+ +++ T+ + G Y L GTP + + +LDTGS +V
Sbjct: 113 TSLAAAVGSTN-RTRARGPGFSSSVTSGLAQGSGEYFTRLGVGTPARYVFMVLDTGSDVV 171
Query: 114 WFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLGCQNPKCSWIHHESIQCRDCNDEPLATS 173
W C CK C S P F P S S + C +P C + D P ++
Sbjct: 172 WIQCA---PCKKCYSQTDPVFNPTKSRSFANIPCGSPLCRRL-----------DSPGCST 217
Query: 174 KNCTQICPSYLVLYGSG-LTEGIALSETLNLPNRIIPNFLVGCSVLSSR---QPAGIAGF 229
K IC Y V YG G T G +ETL + +GC + AG+ G
Sbjct: 218 KK--HIC-LYQVSYGDGSFTYGEFSTETLTFRGTRVGRVALGCGHDNEGLFIGAAGLLGL 274
Query: 230 GRGKTSLPSQLNL---DKFSYCLLSHKFDDTTRTSSLILDNGSSHSDKKTTGLTYTPFVN 286
GRG+ S PSQ+ KFSYCL+ +++ S ++ + + + +TP V+
Sbjct: 275 GRGRLSFPSQIGRRFSRKFSYCLVDRS--ASSKPSYMVFGDSAISRTAR-----FTPLVS 327
Query: 287 NPSVAERNAFSVYYYVGLRRITVGGQRV-RVWHKYLTLDRDGNGGTIVDSGTTFTFMAPE 345
NP + +YYV L ++VGG RV + LD GNGG I+DSGT+ T +
Sbjct: 328 NPKL------DTFYYVELLGVSVGGTRVPGITASLFKLDSTGNGGVIIDSGTSVTRLTRP 381
Query: 346 LFEPLADEFVSQMVKNRNYTRALGAEALTGLRPCFDVPGEKTGSFPELKLHFKGGAEVTL 405
+ L D F V N R A + CFD+ G+ P + LHF+ GA+V+L
Sbjct: 382 AYVALRDAF---RVGASNLKR---APEFSLFDTCFDLSGKTEVKVPTVVLHFR-GADVSL 434
Query: 406 PVENYFAVVGEGSAVCLTVVTDREASGGPSIILGNFQMQNYYVEYDLRNQRLGFKQQLC 464
P NY V + C G SI+ GN Q Q + V YDL R+GF + C
Sbjct: 435 PASNYLIPVDNSGSFCFAFAGTMS---GLSIV-GNIQQQGFRVVYDLAASRVGFAPRGC 489
>gi|21617933|gb|AAM66983.1| nucleoid DNA-binding-like protein [Arabidopsis thaliana]
Length = 425
Score = 134 bits (337), Expect = 1e-28, Method: Compositional matrix adjust.
Identities = 119/384 (30%), Positives = 168/384 (43%), Gaps = 55/384 (14%)
Query: 89 YSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLGCQ 148
Y + + GTP Q + LDT + W PC+ C CSSS + F P SSSSR L C+
Sbjct: 88 YIVRANIGTPAQPMLVALDTSNDAAWIPCSG---CVGCSSSVL--FDPSKSSSSRTLQCE 142
Query: 149 NPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSGLTEGIALSETLNLPNRII 208
P QC+ + SK+C + + YG E +TL L + +I
Sbjct: 143 AP----------QCKQAPNPSCTVSKSC-----GFNMTYGGSTIEAYLTQDTLTLASDVI 187
Query: 209 PNFLVGC--SVLSSRQPA-GIAGFGRGKTSLPSQ---LNLDKFSYCLLSHKFDDTTRTSS 262
PN+ GC + PA G+ G GRG SL SQ L FSYCL + K + + +
Sbjct: 188 PNYTFGCINKASGTSLPAQGLMGLGRGPLSLISQSQNLYQSTFSYCLPNSKSSNFSGSLR 247
Query: 263 LILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRVRVWHKYLT 322
L N + + TP + NP R++ YYV L I VG + V + L
Sbjct: 248 LGPKN-------QPIRIKTTPLLKNP---RRSSL---YYVNLVGIRVGNKIVDIPTSALA 294
Query: 323 LDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEALTGLRPCFDV 382
D GTI DSGT +T + + + +EF + VKN N A +L G C+
Sbjct: 295 FDPATGAGTIFDSGTVYTRLVEPAYVAVRNEF-RRRVKNAN------ATSLGGFDTCY-- 345
Query: 383 PGEKTGS--FPELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTVVTDREASGGPSIILGN 440
+GS FP + F G VTLP +N G+ CL + ++ +
Sbjct: 346 ----SGSVVFPSVTFMF-AGMNVTLPPDNLLIHSSAGNLSCLAMAAAPVNVNSVLNVIAS 400
Query: 441 FQMQNYYVEYDLRNQRLGFKQQLC 464
Q QN+ V D+ N RLG ++ C
Sbjct: 401 MQQQNHRVLIDVPNSRLGISRETC 424
>gi|242085924|ref|XP_002443387.1| hypothetical protein SORBIDRAFT_08g018620 [Sorghum bicolor]
gi|241944080|gb|EES17225.1| hypothetical protein SORBIDRAFT_08g018620 [Sorghum bicolor]
Length = 460
Score = 134 bits (337), Expect = 1e-28, Method: Compositional matrix adjust.
Identities = 125/399 (31%), Positives = 180/399 (45%), Gaps = 60/399 (15%)
Query: 95 FGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLGCQNPKCSW 154
G PPQ I+DTGS+L+W C+ + C + + P S +++ + C + C
Sbjct: 90 IGDPPQQAAAIIDTGSNLIWTQCST-CRANGCFGQDLTFYDPSRSRTAKPVACNDTAC-L 147
Query: 155 IHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSGLTEGIALSETLNL------PNRII 208
+ E+ RD K C + L YG+G G +E N +
Sbjct: 148 LGSETRCARD--------GKAC-----AVLTAYGAGAIGGFLGTEVFTFGHGQSSENNV- 193
Query: 209 PNFLVGCSVLSSRQP------AGIAGFGRGKTSLPSQLNLDKFSYCLLSHKFDDTTRTSS 262
+ GC S P +GI G GRGK SLPSQL +KFSYCL + F D TS+
Sbjct: 194 -SLAFGCITASRLTPGSLDGASGIIGLGRGKLSLPSQLGDNKFSYCLTPY-FSDAANTST 251
Query: 263 LILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRVRVWHKYLT 322
L + + S T PF+ NP + + F +YY+ L ITVG ++ V
Sbjct: 252 LFVGASAGLSGGGAPA-TSVPFLKNP---DDDPFDSFYYLPLTGITVGTAKLDVPAAAFD 307
Query: 323 LDRDGN---GGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEALTGLRPC 379
L GGT++DSG+ FT + ++ L DE V Q+ + A GAE GL C
Sbjct: 308 LREVAPAKWGGTLIDSGSPFTSLIDVAYQALRDELVRQLGASVVPPPA-GAE---GLDLC 363
Query: 380 FD--VPGEKTGSFPELKLHF----KGGAEVTLPVENYFAVVGEGSAVCLTVVTDREASGG 433
PG+ P L LHF GG +V +P ENY+ V + +A C+ V + SGG
Sbjct: 364 VGGVAPGDAGKLVPPLVLHFGSGGGGGGDVVVPPENYWGPVDDSTA-CMVVFS----SGG 418
Query: 434 P--------SIILGNFQMQNYYVEYDLRNQRLGFKQQLC 464
P + I+GN+ Q+ ++ YDL L F+ C
Sbjct: 419 PNSTLPLNETTIIGNYMQQDMHLLYDLGQGVLSFQPADC 457
>gi|413947545|gb|AFW80194.1| hypothetical protein ZEAMMB73_386053 [Zea mays]
Length = 456
Score = 134 bits (336), Expect = 1e-28, Method: Compositional matrix adjust.
Identities = 127/395 (32%), Positives = 177/395 (44%), Gaps = 64/395 (16%)
Query: 89 YSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLGCQ 148
Y + ++ GTPP + I DTGS LVW C+++ S + F P S++ LL CQ
Sbjct: 100 YLMYVNVGTPPAQMLAIADTGSDLVWVNCSSNGGGGGASDGAV-VFHPSRSTTYSLLSCQ 158
Query: 149 NPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSG-LTEGIALSETLNLPNRI 207
+ C + S D + E C Y YG G T G+ +ET +
Sbjct: 159 SAACQALSQASC---DADSE-------C-----QYQYAYGDGSRTIGVLSTETFSFAAAG 203
Query: 208 --------IPNFLVGCSVLS--SRQPAGIAGFGRGKTSLPSQLNLD-----KFSYCLLSH 252
+P GCS S S + G+ G G G SL SQL +FSYCL+
Sbjct: 204 GGGEGQVRVPRVSFGCSTGSAGSFRSDGLVGLGAGALSLVSQLGAAARIARRFSYCLVP- 262
Query: 253 KFDDTTRTSSLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQ 312
+ +S+L + SD G TP V PS + YY V L + V GQ
Sbjct: 263 PYAAANSSSTLSFGARAVVSDP---GAASTPLV--PSEVDS-----YYTVALESVAVAGQ 312
Query: 313 RVRVWHKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEA 372
V + + IVDSGTT TF+ P L PL V+++ + RA E
Sbjct: 313 DVASAN---------SSRIIVDSGTTLTFLDPALLRPL----VAELERRIRLPRAQPPEQ 359
Query: 373 LTGLRPCFDVPGEKTGS---FPELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTVVTDRE 429
L L+ C+DV G+ P++ L F GGA VTL EN F+++ EG+ +CL +V E
Sbjct: 360 L--LQLCYDVQGKSQAEDFGIPDVTLRFGGGASVTLRPENTFSLLEEGT-LCLVLVPVSE 416
Query: 430 ASGGPSIILGNFQMQNYYVEYDLRNQRLGFKQQLC 464
+ P ILGN QN++V YDL + + F C
Sbjct: 417 SQ--PVSILGNIAQQNFHVGYDLDARTVTFAAVDC 449
>gi|15232503|ref|NP_191008.1| aspartyl protease family protein [Arabidopsis thaliana]
gi|7288018|emb|CAB81805.1| nucleoid DNA-binding-like protein [Arabidopsis thaliana]
gi|17979257|gb|AAL49945.1| AT3g54400/T12E18_90 [Arabidopsis thaliana]
gi|21700851|gb|AAM70549.1| AT3g54400/T12E18_90 [Arabidopsis thaliana]
gi|332645705|gb|AEE79226.1| aspartyl protease family protein [Arabidopsis thaliana]
Length = 425
Score = 134 bits (336), Expect = 1e-28, Method: Compositional matrix adjust.
Identities = 119/384 (30%), Positives = 168/384 (43%), Gaps = 55/384 (14%)
Query: 89 YSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLGCQ 148
Y + + GTP Q + LDT + W PC+ C CSSS + F P SSSSR L C+
Sbjct: 88 YIVRANIGTPAQPMLVALDTSNDAAWIPCSG---CVGCSSSVL--FDPSKSSSSRTLQCE 142
Query: 149 NPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSGLTEGIALSETLNLPNRII 208
P QC+ + SK+C + + YG E +TL L + +I
Sbjct: 143 AP----------QCKQAPNPSCTVSKSC-----GFNMTYGGSTIEAYLTQDTLTLASDVI 187
Query: 209 PNFLVGC--SVLSSRQPA-GIAGFGRGKTSLPSQ---LNLDKFSYCLLSHKFDDTTRTSS 262
PN+ GC + PA G+ G GRG SL SQ L FSYCL + K + + +
Sbjct: 188 PNYTFGCINKASGTSLPAQGLMGLGRGPLSLISQSQNLYQSTFSYCLPNSKSSNFSGSLR 247
Query: 263 LILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRVRVWHKYLT 322
L N + + TP + NP R++ YYV L I VG + V + L
Sbjct: 248 LGPKN-------QPIRIKTTPLLKNP---RRSSL---YYVNLVGIRVGNKIVDIPTSALA 294
Query: 323 LDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEALTGLRPCFDV 382
D GTI DSGT +T + + + +EF + VKN N A +L G C+
Sbjct: 295 FDPATGAGTIFDSGTVYTRLVEPAYVAVRNEF-RRRVKNAN------ATSLGGFDTCY-- 345
Query: 383 PGEKTGS--FPELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTVVTDREASGGPSIILGN 440
+GS FP + F G VTLP +N G+ CL + ++ +
Sbjct: 346 ----SGSVVFPSVTFMF-AGMNVTLPPDNLLIHSSAGNLSCLAMAAAPVNVNSVLNVIAS 400
Query: 441 FQMQNYYVEYDLRNQRLGFKQQLC 464
Q QN+ V D+ N RLG ++ C
Sbjct: 401 MQQQNHRVLIDVPNSRLGISRETC 424
>gi|15231625|ref|NP_191467.1| aspartyl protease family protein [Arabidopsis thaliana]
gi|15983376|gb|AAL11556.1|AF424562_1 AT3g59080/F17J16_130 [Arabidopsis thaliana]
gi|7529751|emb|CAB86936.1| putative protein [Arabidopsis thaliana]
gi|20466704|gb|AAM20669.1| putative protein [Arabidopsis thaliana]
gi|23198236|gb|AAN15645.1| putative protein [Arabidopsis thaliana]
gi|332646352|gb|AEE79873.1| aspartyl protease family protein [Arabidopsis thaliana]
Length = 535
Score = 134 bits (336), Expect = 1e-28, Method: Compositional matrix adjust.
Identities = 119/393 (30%), Positives = 173/393 (44%), Gaps = 43/393 (10%)
Query: 87 GGYSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLG 146
G Y + + G+PP+ ILDTGS L W C C C + PK S+S + +
Sbjct: 168 GEYFMDVLVGSPPKHFSLILDTGSDLNWIQC---LPCYDCFQQNGAFYDPKASASYKNIT 224
Query: 147 CQNPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSGLTEGIALSET--LNLP 204
C + +C+ + + +P K+ Q CP Y S T G ET +NL
Sbjct: 225 CNDQRCNLVS---------SPDPPMPCKSDNQSCPYYYWYGDSSNTTGDFAVETFTVNLT 275
Query: 205 NR-------IIPNFLVGCSVLSS---RQPAGIAGFGRGKTSLPSQLNL---DKFSYCLLS 251
+ N + GC + AG+ G GRG S SQL FSYCL+
Sbjct: 276 TNGGSSELYNVENMMFGCGHWNRGLFHGAAGLLGLGRGPLSFSSQLQSLYGHSFSYCLVD 335
Query: 252 HKFDDTTRTSSLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGG 311
DT +S LI G L +T FV + N +YYV ++ I V G
Sbjct: 336 RN-SDTNVSSKLIF--GEDKDLLSHPNLNFTSFV----AGKENLVDTFYYVQIKSILVAG 388
Query: 312 QRVRVWHKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAE 371
+ + + + + DG GGTI+DSGTT ++ A EP A EF+ + + +
Sbjct: 389 EVLNIPEETWNISSDGAGGTIIDSGTTLSYFA----EP-AYEFIKNKIAEKAKGKYPVYR 443
Query: 372 ALTGLRPCFDVPGEKTGSFPELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTVVTDREAS 431
L PCF+V G PEL + F GA P EN F + E VCL ++ +++
Sbjct: 444 DFPILDPCFNVSGIHNVQLPELGIAFADGAVWNFPTENSFIWLNE-DLVCLAMLGTPKSA 502
Query: 432 GGPSIILGNFQMQNYYVEYDLRNQRLGFKQQLC 464
I+GN+Q QN+++ YD + RLG+ C
Sbjct: 503 FS---IIGNYQQQNFHILYDTKRSRLGYAPTKC 532
>gi|115479815|ref|NP_001063501.1| Os09g0482200 [Oryza sativa Japonica Group]
gi|50725878|dbj|BAD33407.1| putative nucleoid DNA-binding protein cnd41 [Oryza sativa Japonica
Group]
gi|50725881|dbj|BAD33410.1| putative nucleoid DNA-binding protein cnd41 [Oryza sativa Japonica
Group]
gi|113631734|dbj|BAF25415.1| Os09g0482200 [Oryza sativa Japonica Group]
gi|125606112|gb|EAZ45148.1| hypothetical protein OsJ_29786 [Oryza sativa Japonica Group]
Length = 485
Score = 134 bits (336), Expect = 1e-28, Method: Compositional matrix adjust.
Identities = 119/388 (30%), Positives = 166/388 (42%), Gaps = 60/388 (15%)
Query: 87 GGYSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLG 146
G Y +S+ GTP + I DTGS L W C C C + P F P LSS+ +
Sbjct: 147 GNYVVSVGLGTPAKQYAVIFDTGSDLSWVQCK---PCADCYEQQDPLFDPSLSSTYAAVA 203
Query: 147 CQNPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGS-GLTEGIALSETLNL-P 204
C P+C + + C+ + C Y V YG T+G + +TL L
Sbjct: 204 CGAPEC-----QELDASGCSSD-----SRCR-----YEVQYGDQSQTDGNLVRDTLTLSA 248
Query: 205 NRIIPNFLVGCSVLSSR---QPAGIAGFGRGKTSLPSQLNLD---KFSYCLLSHKFDDTT 258
+ +P F+ GC ++ Q G+ G GR K SLPSQ F+YCL S
Sbjct: 249 SDTLPGFVFGCGDQNAGLFGQVDGLFGLGREKVSLPSQGAPSYGPGFTYCLPS------- 301
Query: 259 RTSSLILDNGSSHSDKKTTGLTYTPFVNNPSVA-ERNAFSVYYYVGLRRITVGGQRVRVW 317
S S + L P N A A +YY+ L I VGG+ +R+
Sbjct: 302 -----------SSSGRGYLSLGGAPPANAQFTALADGATPSFYYIDLVGIKVGGRAIRIP 350
Query: 318 HKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEALTGLR 377
T++DSGT T + P + PL F M + + A AL+ L
Sbjct: 351 ATAFAAAGG----TVIDSGTVITRLPPRAYAPLRAAFARSMAQYKK------APALSILD 400
Query: 378 PCFDVPGEKTGSFPELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTVVTDREASGGPSI- 436
C+D G +T P ++L F GGA V+L V + S CL + + S SI
Sbjct: 401 TCYDFTGHRTAQIPTVELAFAGGATVSLDFTGVL-YVSKVSQACLAFAPNADDS---SIA 456
Query: 437 ILGNFQMQNYYVEYDLRNQRLGFKQQLC 464
ILGN Q + + V YD+ NQR+GF + C
Sbjct: 457 ILGNTQQKTFAVAYDVANQRIGFGAKGC 484
>gi|356522504|ref|XP_003529886.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
Length = 473
Score = 133 bits (335), Expect = 1e-28, Method: Compositional matrix adjust.
Identities = 125/390 (32%), Positives = 171/390 (43%), Gaps = 49/390 (12%)
Query: 83 SHSYGGYSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSS 142
S G Y L GTPP+ + +LDTGS +VW C C C S F P S S
Sbjct: 124 SQGSGEYFTRLGVGTPPKYLYMVLDTGSDVVWLQCK---PCTKCYSQTDQIFDPSKSKSF 180
Query: 143 RLLGCQNPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSG-LTEGIALSETL 201
+ C +P C + D P + KN +C Y V YG G T G +ETL
Sbjct: 181 AGIPCYSPLCRRL-----------DSPGCSLKN--NLC-QYQVSYGDGSFTFGDFSTETL 226
Query: 202 NLPNRIIPNFLVGCSVLSSR---QPAGIAGFGRGKTSLPSQLNL---DKFSYCLLSHKFD 255
+P +GC + AG+ G GRG S P+Q +KFSYCL
Sbjct: 227 TFRRAAVPRVAIGCGHDNEGLFVGAAGLLGLGRGGLSFPTQTGTRFNNKFSYCLTDRT-- 284
Query: 256 DTTRTSSLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRVR 315
+ + SS++ + + + +TP V NP + +YYV L I+VGG VR
Sbjct: 285 ASAKPSSIVFGDSAVSRTAR-----FTPLVKNPKL------DTFYYVELLGISVGGAPVR 333
Query: 316 -VWHKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEALT 374
+ + LD GNGG I+DSGT+ T + + L D F V + R A +
Sbjct: 334 GISASFFRLDSTGNGGVIIDSGTSVTRLTRPAYVSLRDAF---RVGASHLKR---APEFS 387
Query: 375 GLRPCFDVPGEKTGSFPELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTVVTDREASGGP 434
C+D+ G P + LHF+ GA+V+LP NY V + C G
Sbjct: 388 LFDTCYDLSGLSEVKVPTVVLHFR-GADVSLPAANYLVPVDNSGSFCFAFA---GTMSGL 443
Query: 435 SIILGNFQMQNYYVEYDLRNQRLGFKQQLC 464
SII GN Q Q + V +DL R+GF + C
Sbjct: 444 SII-GNIQQQGFRVVFDLAGSRVGFAPRGC 472
>gi|125564143|gb|EAZ09523.1| hypothetical protein OsI_31798 [Oryza sativa Indica Group]
Length = 485
Score = 133 bits (335), Expect = 2e-28, Method: Compositional matrix adjust.
Identities = 119/388 (30%), Positives = 166/388 (42%), Gaps = 60/388 (15%)
Query: 87 GGYSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLG 146
G Y +S+ GTP + I DTGS L W C C C + P F P LSS+ +
Sbjct: 147 GNYVVSVGLGTPAKQYAVIFDTGSDLSWVQCK---PCADCYEQQDPLFDPSLSSTYAAVA 203
Query: 147 CQNPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGS-GLTEGIALSETLNL-P 204
C P+C + + C+ + C Y V YG T+G + +TL L
Sbjct: 204 CGAPEC-----QELDASGCSSD-----SRCR-----YEVQYGDQSQTDGNLVRDTLTLSA 248
Query: 205 NRIIPNFLVGCSVLSSR---QPAGIAGFGRGKTSLPSQLNLD---KFSYCLLSHKFDDTT 258
+ +P F+ GC ++ Q G+ G GR K SLPSQ F+YCL S
Sbjct: 249 SDTLPGFVFGCGDQNAGLFGQVDGLFGLGREKVSLPSQGAPSYGPGFTYCLPS------- 301
Query: 259 RTSSLILDNGSSHSDKKTTGLTYTPFVNNPSVA-ERNAFSVYYYVGLRRITVGGQRVRVW 317
S S + L P N A A +YY+ L I VGG+ +R+
Sbjct: 302 -----------SSSGRGYLSLGGAPPANAQFTALADGATPSFYYIDLVGIKVGGRAIRIP 350
Query: 318 HKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEALTGLR 377
T++DSGT T + P + PL F M + + A AL+ L
Sbjct: 351 ATAFAAAGG----TVIDSGTVITRLPPRAYAPLRAAFARSMAQYKK------APALSILD 400
Query: 378 PCFDVPGEKTGSFPELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTVVTDREASGGPSI- 436
C+D G +T P ++L F GGA V+L V + S CL + + S SI
Sbjct: 401 TCYDFTGHRTAQIPTVELAFAGGATVSLDFTGVL-YVSKVSQACLAFAPNADDS---SIA 456
Query: 437 ILGNFQMQNYYVEYDLRNQRLGFKQQLC 464
ILGN Q + + V YD+ NQR+GF + C
Sbjct: 457 ILGNTQQKTFAVTYDVANQRIGFGAKGC 484
>gi|224065128|ref|XP_002301682.1| predicted protein [Populus trichocarpa]
gi|222843408|gb|EEE80955.1| predicted protein [Populus trichocarpa]
Length = 441
Score = 133 bits (335), Expect = 2e-28, Method: Compositional matrix adjust.
Identities = 120/385 (31%), Positives = 170/385 (44%), Gaps = 42/385 (10%)
Query: 91 ISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLGCQNP 150
+SL GTPPQ ILDTGS L W C K SS F P LSSS +L C +P
Sbjct: 84 VSLPIGTPPQTQQMILDTGSQLSWIQCHKKVPRKPPPSS---VFDPSLSSSFSVLPCNHP 140
Query: 151 KCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSG-LTEGIALSETLNLP-NRII 208
C D L TS + ++C Y Y G L EG + E + ++
Sbjct: 141 LCK---------PRIPDFTLPTSCDQNRLC-HYSYFYADGTLAEGNLVREKITFSRSQST 190
Query: 209 PNFLVGCSVLSSRQPAGIAGFGRGKTSLPSQLNLDKFSYCLLSHKFDDT-TRTSSLIL-D 266
P ++GC+ SS GI G G+ S SQ L KFSYC+ + + T T S L +
Sbjct: 191 PPLILGCAEESS-DAKGILGMNLGRLSFASQAKLTKFSYCVPTRQVRPGFTPTGSFYLGE 249
Query: 267 NGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRVRVWHKYLTLDRD 326
N +S + LT++ P N + Y V ++ I +G Q++ + D
Sbjct: 250 NPNSGGFRYINLLTFSQSQRMP-----NLDPLAYTVAMQGIRIGNQKLNIPISAFRPDPS 304
Query: 327 GNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEALTGL------RPCF 380
G G T++DSG+ FT++ E + + +E V R +GA G CF
Sbjct: 305 GAGQTMIDSGSEFTYLVDEAYNKVREEVV----------RLVGARLKKGYVYGGVSDMCF 354
Query: 381 DVPGEKTGSF-PELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTVVTDREASGGPSIILG 439
+ + G + F G E+ + E A VG G V + E G S I+G
Sbjct: 355 NGNAIEIGRLIGNMVFEFDKGVEIVVEKERVLADVGGG--VHCVGIGRSEMLGAASNIIG 412
Query: 440 NFQMQNYYVEYDLRNQRLGFKQQLC 464
NF QN +VE+DL N+R+GF + C
Sbjct: 413 NFHQQNIWVEFDLANRRVGFGKADC 437
>gi|302758122|ref|XP_002962484.1| hypothetical protein SELMODRAFT_78458 [Selaginella moellendorffii]
gi|300169345|gb|EFJ35947.1| hypothetical protein SELMODRAFT_78458 [Selaginella moellendorffii]
Length = 395
Score = 133 bits (334), Expect = 2e-28, Method: Compositional matrix adjust.
Identities = 126/412 (30%), Positives = 179/412 (43%), Gaps = 66/412 (16%)
Query: 82 SSHSYGGYSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSS 141
SS G Y + L GTP + P I+DTGS L W C SS P + SSS
Sbjct: 20 SSIGSGQYFVELRVGTPAKKFPLIIDTGSDLTWIQCNPPNTTANSSSPPAPWYDKSSSSS 79
Query: 142 SRLLGCQNPKCSWIHHESIQCRDCNDEPLATSKNCTQICPS---YLVLYGS-GLTEGIAL 197
R + C + +C ++ P +C+ PS Y Y T GI
Sbjct: 80 YREIPCTDDECLFL-------------PAPIGSSCSIKSPSPCDYTYGYSDQSRTTGILA 126
Query: 198 SETLNLPNRI---------------IPNFLVGCSVLSSRQ----PAGIAGFGRGKTSLPS 238
ET+++ +R I N +GCS S +G+ G G+G SL +
Sbjct: 127 YETISMKSRKRSGKRAGNHKTRTIRIKNVALGCSRESVGASFLGASGVLGLGQGPISLAT 186
Query: 239 QLNLDK----FSYCLLSHKFDDTTRTSSLILDNGSSHSDKKTTGLTYTPFVNNPSVAERN 294
Q FSYCL+ + + +S L++ G + K L +TP V NP
Sbjct: 187 QTRHTALGGIFSYCLVDY-LRGSNASSFLVM--GRTRWRK----LAHTPIVRNP------ 233
Query: 295 AFSVYYYVGLRRITVGGQRVR-VWHKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADE 353
A +YYV + + V G+ V + +D DGN GTI DSGTT ++ L EP +
Sbjct: 234 AAQSFYYVNVTGVAVDGKPVDGIASSDWGIDGDGNKGTIFDSGTTLSY----LREPAYSK 289
Query: 354 FVSQMVKNRNYTRALGAEALTGLRPCFDVPGEKTGSFPELKLHFKGGAEVTLPVENYFAV 413
+ + + RA E G C++V + G P+L + F+GGA + LP NY +
Sbjct: 290 VLGALNASIYLPRA--QEIPEGFELCYNVTRMEKG-MPKLGVEFQGGAVMELPWNNYMVL 346
Query: 414 VGEG-SAVCLTVVTDREASGGPSIILGNFQMQNYYVEYDLRNQRLGFKQQLC 464
V E V L VT S ILGN Q++++EYDL R+GFK C
Sbjct: 347 VAENVQCVALQKVTTTNGSN----ILGNLLQQDHHIEYDLAKARIGFKWSPC 394
>gi|302757235|ref|XP_002962041.1| hypothetical protein SELMODRAFT_64201 [Selaginella moellendorffii]
gi|300170700|gb|EFJ37301.1| hypothetical protein SELMODRAFT_64201 [Selaginella moellendorffii]
Length = 367
Score = 133 bits (334), Expect = 2e-28, Method: Compositional matrix adjust.
Identities = 131/404 (32%), Positives = 181/404 (44%), Gaps = 64/404 (15%)
Query: 87 GGYSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLG 146
G Y++ + G+PP+ I+DTGS LVW C C C S P + P SS+
Sbjct: 2 GAYTMEIELGSPPKKFNAIVDTGSDLVWIQCK---PCSQCYSQSDPIYDPSASST----- 53
Query: 147 CQNPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYG-SGLTEGIALSETLNL-- 203
CS +S+ C+ +++K C Y YG S T+G ETL L
Sbjct: 54 FAKTSCSTSSCQSLPASGCS----SSAKTCI-----YGYQYGDSSSTQGDFALETLTLRS 104
Query: 204 ---PNRIIPNFLVGCSVLSSRQ---PAGIAGFGRGKTSLPSQLNL---DKFSYCLLSHKF 254
++ PNF GC L+S AGI G G+GK SL +QL +KFSYCL+
Sbjct: 105 SGGSSKAFPNFQFGCGRLNSGSFGGAAGIVGLGQGKISLSTQLGSAINNKFSYCLVDFD- 163
Query: 255 DDTTRTSSLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQR- 313
DD+++TS LI GSS S +G TP + N + S YY+VGL I+VGG++
Sbjct: 164 DDSSKTSPLIF--GSSAS--TGSGAISTPIIPN------SGRSTYYFVGLEGISVGGKQL 213
Query: 314 -----------VRVWHKYLTLDRDGN-GGTIVDSGTTFTFMAPELFEPLADEFVSQMVKN 361
VR K + N GGTI DSGTT T + ++ + F S +
Sbjct: 214 SLATRAIDFLSVRSKKKLRVRALEVNSGGTIFDSGTTLTLLDDAVYSKVKSAFASSV--- 270
Query: 362 RNYTRALGAEALTGLRPCFDVPGEKTGSFPELKLHFKGGAEVTLPVENYFAVVGEGSAV- 420
+ + +G C+DV K FP L L FK G + + P +NYF +V V
Sbjct: 271 ---SLPTVDASSSGFDLCYDVSKSKNFKFPALTLAFK-GTKFSPPQKNYFVIVDTAETVA 326
Query: 421 CLTVVTDREASGGPSIILGNFQMQNYYVEYDLRNQRLGFKQQLC 464
CL + I+GN QNY+V YD + C
Sbjct: 327 CLAMGGSGSLG---LGIIGNLMQQNYHVVYDRGTSTISMSPAQC 367
>gi|357124861|ref|XP_003564115.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Brachypodium
distachyon]
Length = 477
Score = 132 bits (333), Expect = 3e-28, Method: Compositional matrix adjust.
Identities = 121/416 (29%), Positives = 172/416 (41%), Gaps = 79/416 (18%)
Query: 89 YSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLGCQ 148
Y + LS GTPP+ + LDTGS LVW C C IP P SS+ + C
Sbjct: 94 YLVHLSVGTPPRPVALTLDTGSDLVWTQCAPCLNC--FDQGAIPVLDPAASSTHAAVRCD 151
Query: 149 NPKC-------------SWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSGL-TEG 194
P C SW + D+ + K S +G G +G
Sbjct: 152 APVCRALPFTSCGRGGSSWGERSCVYVYHYGDKSITVGK-----LASDRFTFGPGDNADG 206
Query: 195 IALSETLNLPNRIIPNFLVGCSVLSS----RQPAGIAGFGRGKTSLPSQLNLDKFSYCLL 250
+SE GC + GIAGFGRG+ SLPSQL + FSYC
Sbjct: 207 GGVSER---------RLTFGCGHFNKGIFQANETGIAGFGRGRWSLPSQLGVTSFSYCFT 257
Query: 251 SHKFDDTTRTSSLILDNGSSHSDKKTTG-LTYTPFVNNPSVAERNAFSVYYYVGLRRITV 309
S F+ T+ +L G + ++ TG + TP + +PS Y++ L+ ITV
Sbjct: 258 S-MFESTSSLVTL----GVAPAELHLTGQVQSTPLLRDPSQPS------LYFLSLKAITV 306
Query: 310 GGQRVRVWHKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALG 369
G R+ + + L I+DSG + T + +++E + EFV+Q+ L
Sbjct: 307 GATRIPIPERRQRLR---EASAIIDSGASITTLPEDVYEAVKAEFVAQV--------GLP 355
Query: 370 AEALTG--LRPCFDVPGEKTGS-----------------FPELKLHFKGGAEVTLPVENY 410
A+ G L CF +P P L H GGA+ LP ENY
Sbjct: 356 VSAVEGSALDLCFALPSAAAPKSAFGWRWRGRGRAMPVRVPRLVFHLGGGADWELPRENY 415
Query: 411 FAVVGEGSAVCLTVVTDREASGGP-SIILGNFQMQNYYVEYDLRNQRLGFKQQLCK 465
V + A + +V D GG ++++GN+Q QN +V YDL N L F C+
Sbjct: 416 --VFEDYGARVMCLVLDAATGGGDQTVVIGNYQQQNTHVVYDLENDVLSFAPARCE 469
>gi|125524353|gb|EAY72467.1| hypothetical protein OsI_00323 [Oryza sativa Indica Group]
Length = 500
Score = 132 bits (333), Expect = 3e-28, Method: Compositional matrix adjust.
Identities = 115/383 (30%), Positives = 164/383 (42%), Gaps = 48/383 (12%)
Query: 87 GGYSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLG 146
G Y + G+P + + +LDTGS + W C C C P F P LS+S +
Sbjct: 161 GEYFSRVGVGSPARQLYMVLDTGSDVTWVQCQ---PCADCYQQSDPVFDPSLSTSYASVA 217
Query: 147 CQNPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSG-LTEGIALSETLNLPN 205
C NP+C +D A +N T C Y V YG G T G +ETL L +
Sbjct: 218 CDNPRC-------------HDLDAAACRNSTGAC-LYEVAYGDGSYTVGDFATETLTLGD 263
Query: 206 RI-IPNFLVGCSVLSSR---QPAGIAGFGRGKTSLPSQLNLDKFSYCLLSHKFDDTTRTS 261
+ + +GC + AG+ G G S PSQ++ FSYCL+ D+ +S
Sbjct: 264 SAPVSSVAIGCGHDNEGLFVGAAGLLALGGGPLSFPSQISATTFSYCLVDR---DSPSSS 320
Query: 262 SLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRVRVWHKYL 321
+L + + D + T P + +P S +YYVGL I+VGGQ + +
Sbjct: 321 TLQFGDAA---DAEVT----APLIRSPRT------STFYYVGLSGISVGGQILSIPPSAF 367
Query: 322 TLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEALTGLRPCFD 381
+D G GG IVDSGT T + + L D FV ++ R G ++ C+D
Sbjct: 368 AMDGTGAGGVIVDSGTAVTRLQSSAYAALRDAFVR---GTQSLPRTSG---VSLFDTCYD 421
Query: 382 VPGEKTGSFPELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTVVTDREASGGPSIILGNF 441
+ + P + L F GG E+ LP +NY V CL A I+GN
Sbjct: 422 LSDRTSVEVPAVSLRFAGGGELRLPAKNYLIPVDGAGTYCLAFAPTNAAVS----IIGNV 477
Query: 442 QMQNYYVEYDLRNQRLGFKQQLC 464
Q Q V +D +GF C
Sbjct: 478 QQQGTRVSFDTAKSTVGFTSNKC 500
>gi|297848386|ref|XP_002892074.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
gi|297337916|gb|EFH68333.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
Length = 485
Score = 132 bits (333), Expect = 3e-28, Method: Compositional matrix adjust.
Identities = 124/391 (31%), Positives = 170/391 (43%), Gaps = 51/391 (13%)
Query: 83 SHSYGGYSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSS 142
S G Y L GTP + + +LDTGS +VW C C+ C S P F P+ S +
Sbjct: 136 SQGSGEYFTRLGVGTPARYVYMVLDTGSDIVWLQCA---PCRRCYSQSDPIFDPRKSKTY 192
Query: 143 RLLGCQNPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSG-LTEGIALSETL 201
+ C +P C + R K C Y V YG G T G +ETL
Sbjct: 193 ATIPCSSPHCRRLDSAGCNTR---------RKTCL-----YQVSYGDGSFTVGDFSTETL 238
Query: 202 NLPNRIIPNFLVGCSVLSSR---QPAGIAGFGRGKTSLPSQLNL---DKFSYCLLSHKFD 255
+ +GC + AG+ G G+GK S P Q KFSYCL+
Sbjct: 239 TFRRNRVKGVALGCGHDNEGLFVGAAGLLGLGKGKLSFPGQTGHRFNQKFSYCLVDRS-- 296
Query: 256 DTTRTSSLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRV- 314
+++ SS++ N + + +TP ++NP + +YYV L I+VGG RV
Sbjct: 297 ASSKPSSVVFGNAAVSRIAR-----FTPLLSNPKL------DTFYYVELLGISVGGTRVP 345
Query: 315 RVWHKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALG-AEAL 373
V LD+ GNGG I+DSGT+ T + + + D F R +AL A
Sbjct: 346 GVAASLFKLDQIGNGGVIIDSGTSVTRLIRPAYIAMRDAF-------RVGAKALKRAPDF 398
Query: 374 TGLRPCFDVPGEKTGSFPELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTVVTDREASGG 433
+ CFD+ P + LHF+ GA+V+LP NY V C GG
Sbjct: 399 SLFDTCFDLSNMNEVKVPTVVLHFR-GADVSLPATNYLIPVDTNGKFCFAFAG---TMGG 454
Query: 434 PSIILGNFQMQNYYVEYDLRNQRLGFKQQLC 464
SII GN Q Q + V YDL + R+GF C
Sbjct: 455 LSII-GNIQQQGFRVVYDLASSRVGFAPGGC 484
>gi|242069057|ref|XP_002449805.1| hypothetical protein SORBIDRAFT_05g023600 [Sorghum bicolor]
gi|241935648|gb|EES08793.1| hypothetical protein SORBIDRAFT_05g023600 [Sorghum bicolor]
Length = 430
Score = 132 bits (333), Expect = 3e-28, Method: Compositional matrix adjust.
Identities = 120/390 (30%), Positives = 171/390 (43%), Gaps = 57/390 (14%)
Query: 89 YSISLSFGTPPQIIPFIL--DTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLG 146
Y + L+ GTPP +PFI DTGS L W C CK C P + SSS L
Sbjct: 83 YLMELAIGTPP--VPFIALADTGSDLTWTQCK---PCKLCFGQDTPIYDTTTSSSFSPLP 137
Query: 147 CQNPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSGLTEGIALSETLNLPNR 206
C + C P+ +S+ T PS Y +G E +
Sbjct: 138 CSSATC---------------LPIWSSRCST---PSATCRYRYAYDDGAYSPECAGIS-- 177
Query: 207 IIPNFLVGCSVLS---SRQPAGIAGFGRGKTSLPSQLNLDKFSYCLLSHKFDDTTRTSSL 263
+ GC V + S G G GRG SL +QL + KFSYCL F +T+ +S +
Sbjct: 178 -VGGIAFGCGVDNGGLSYNSTGTVGLGRGSLSLVAQLGVGKFSYCLT--DFFNTSLSSPV 234
Query: 264 ILDNGSSHSDKKTTG----LTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRVRVWHK 319
+ + + + + TP V +P R YYV L I++G R+ + +
Sbjct: 235 FFGSLAELAASSASADAAVVQSTPLVQSPYNPSR------YYVSLEGISLGDARLPIPNG 288
Query: 320 YLTL-DRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEALTGLRP 378
L D DG+GG IVDSGT FT + F + D + + + A +L RP
Sbjct: 289 TFDLNDDDGSGGMIVDSGTIFTILVETGFRVVVDHVAGVLGQ-----PVVNASSLD--RP 341
Query: 379 CFDVPG---EKTGSFPELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTVVTDREASGGPS 435
CF P ++ P++ LHF GGA++ L +NY + E S+ CL +V ASG
Sbjct: 342 CFPAPAAGVQELPDMPDMVLHFAGGADMRLHRDNYMSFNEEESSFCLNIVGTESASGS-- 399
Query: 436 IILGNFQMQNYYVEYDLRNQRLGFKQQLCK 465
+LGNFQ QN + +D+ +L F C
Sbjct: 400 -VLGNFQQQNIQMLFDITVGQLSFMPTDCS 428
>gi|255541796|ref|XP_002511962.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
gi|223549142|gb|EEF50631.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
Length = 495
Score = 132 bits (333), Expect = 3e-28, Method: Compositional matrix adjust.
Identities = 122/442 (27%), Positives = 183/442 (41%), Gaps = 59/442 (13%)
Query: 33 LSRFHTNPSQ-DSYQNLNSLVSSSLTRALHIKNPQTKTTTTTTTTTTTNISSHSYGGYSI 91
LSR H + S+ + L+ + ++++ +K QT+ +T ++ +S G Y
Sbjct: 103 LSRLHRDSSRVQAITTRLQLILNGVSKS-DLKPLQTEIQPQDLSTPVSSGTSQGSGEYFT 161
Query: 92 SLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLGCQNPK 151
+ G P + +LDTGS + W C C C P F P SSS L C + +
Sbjct: 162 RVGVGNPAKSYYMVLDTGSDINWIQCQ---PCSDCYQQSDPIFTPAASSSYSPLTCDSQQ 218
Query: 152 CSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSG-LTEGIALSETLNLPNR-IIP 209
C+ + S + C Y V YG G T G ++ET++ +
Sbjct: 219 CNSLQMSSCRNGQCR----------------YQVNYGDGSFTFGDFVTETMSFGGSGTVN 262
Query: 210 NFLVGCSVLSSRQPAGI-------AGFGRGKTSLPSQLNLDKFSYCLLSHKFDDTTRTSS 262
+ +GC G+ G G G SL SQL FSYCL++ D+ +S+
Sbjct: 263 SIALGCG----HDNEGLFVGAAGLLGLGGGPLSLTSQLKATSFSYCLVNR---DSAASST 315
Query: 263 LILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRVRVWHKYLT 322
L N + D L + + +YYVGL ++VGG+ +R+ +
Sbjct: 316 LDF-NSAPVGDSVIAPLL-----------KSSKIDTFYYVGLSGMSVGGELLRIPQEVFK 363
Query: 323 LDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEALTGLRPCFDV 382
LD G+GG IVD GT T + E + L D FVS + R+ AL C+D+
Sbjct: 364 LDDSGDGGVIVDCGTAITRLQSEAYNSLRDSFVSM----SRHLRSTSGVAL--FDTCYDL 417
Query: 383 PGEKTGSFPELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTVVTDREASGGPSIILGNFQ 442
G+ + P + HF GG LP NY V C + I+GN Q
Sbjct: 418 SGQSSVKVPTVSFHFDGGKSWDLPAANYLIPVDSAGTYCFAFAPTTSSLS----IIGNVQ 473
Query: 443 MQNYYVEYDLRNQRLGFKQQLC 464
Q V +DL N R+GF C
Sbjct: 474 QQGTRVSFDLANNRVGFSTNKC 495
>gi|125527370|gb|EAY75484.1| hypothetical protein OsI_03384 [Oryza sativa Indica Group]
Length = 453
Score = 132 bits (333), Expect = 3e-28, Method: Compositional matrix adjust.
Identities = 124/390 (31%), Positives = 181/390 (46%), Gaps = 46/390 (11%)
Query: 87 GGYSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLG 146
G Y++S GTP + DTGS L+W C C CS PS+ P SSS+ +
Sbjct: 90 GDYAMSFGIGTPATGLSGEADTGSDLIWTKCG---ACARCSPRGSPSYYPTSSSSAAFVA 146
Query: 147 CQNPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSG-----LTEGIALSETL 201
C + C + C + + S NC SY YG+ TEGI ++ET
Sbjct: 147 CGDRTCGELPRP--LCSNVAGG-GSGSGNC-----SYHYAYGNARDTHHYTEGILMTETF 198
Query: 202 NLPN--RIIPNFLVGCSVLSSR---QPAGIAGFGRGKTSLPSQLNLDKFSYCLLSHKFDD 256
+ P GC++ S +G+ G GRGK SL +QLN++ F Y L S D
Sbjct: 199 TFGDDAAAFPGIAFGCTLRSEGGFGTGSGLVGLGRGKLSLVTQLNVEAFGYRLSS----D 254
Query: 257 TTRTSSLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRVRV 316
+ S + + + + TP + NP V + +YYVGL I+VGG+ V++
Sbjct: 255 LSAPSPISFGSLADVTGGNGDSFMSTPLLTNPVVQDLP----FYYVGLTGISVGGKLVQI 310
Query: 317 WHKYLTLDRD-GNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEALTG 375
+ DR G GG I DSGTT T + + + DE +SQM + A + +
Sbjct: 311 PSGTFSFDRSTGAGGVIFDSGTTLTMLPDPAYTLVRDELLSQMGFQKPPPAANDDDLI-- 368
Query: 376 LRPCFDVPGEKTGSFPELKLHFKGGAEVTLPVENYFAVV----GEGSAVCLTVVTDREAS 431
CF G T +FP + LHF GGA++ L ENY + GE +A C +VV +A
Sbjct: 369 ---CF-TGGSSTTTFPSMVLHFDGGADMDLSTENYLPQMQGQNGE-TARCWSVVKSSQA- 422
Query: 432 GGPSIILGNFQMQNYYVEYDLR-NQRLGFK 460
I+GN +++V +DL N R+ F+
Sbjct: 423 ---LTIIGNIMQMDFHVVFDLSGNARMLFQ 449
>gi|125571687|gb|EAZ13202.1| hypothetical protein OsJ_03122 [Oryza sativa Japonica Group]
Length = 453
Score = 132 bits (333), Expect = 3e-28, Method: Compositional matrix adjust.
Identities = 124/390 (31%), Positives = 181/390 (46%), Gaps = 46/390 (11%)
Query: 87 GGYSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLG 146
G Y++S GTP + DTGS L+W C C CS PS+ P SSS+ +
Sbjct: 90 GDYAMSFGIGTPATGLSGEADTGSDLIWTKCG---ACARCSPRGSPSYYPTSSSSAAFVA 146
Query: 147 CQNPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSG-----LTEGIALSETL 201
C + C + C + + S NC SY YG+ TEGI ++ET
Sbjct: 147 CGDRTCGELPRP--LCSNVAGG-GSGSGNC-----SYHYAYGNARDTHHYTEGILMTETF 198
Query: 202 NLPN--RIIPNFLVGCSVLSSR---QPAGIAGFGRGKTSLPSQLNLDKFSYCLLSHKFDD 256
+ P GC++ S +G+ G GRGK SL +QLN++ F Y L S D
Sbjct: 199 TFGDDAAAFPGIAFGCTLRSEGGFGTGSGLVGLGRGKLSLVTQLNVEAFGYRLSS----D 254
Query: 257 TTRTSSLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRVRV 316
+ S + + + + TP + NP V + +YYVGL I+VGG+ V++
Sbjct: 255 LSAPSPISFGSLADVTGGNGDSFMSTPLLTNPVVQDLP----FYYVGLTGISVGGKLVQI 310
Query: 317 WHKYLTLDRD-GNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEALTG 375
+ DR G GG I DSGTT T + + + DE +SQM + A + +
Sbjct: 311 PSGTFSFDRSTGAGGVIFDSGTTLTMLPDPAYTLVRDELLSQMGFQKPPPAANDDDLI-- 368
Query: 376 LRPCFDVPGEKTGSFPELKLHFKGGAEVTLPVENYFAVV----GEGSAVCLTVVTDREAS 431
CF G T +FP + LHF GGA++ L ENY + GE +A C +VV +A
Sbjct: 369 ---CF-TGGSSTTTFPSMVLHFDGGADMDLSTENYLPQMQGQNGE-TARCWSVVKSSQA- 422
Query: 432 GGPSIILGNFQMQNYYVEYDLR-NQRLGFK 460
I+GN +++V +DL N R+ F+
Sbjct: 423 ---LTIIGNIMQMDFHVVFDLSGNARMLFQ 449
>gi|255583547|ref|XP_002532530.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
gi|223527742|gb|EEF29846.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
Length = 440
Score = 132 bits (332), Expect = 3e-28, Method: Compositional matrix adjust.
Identities = 113/379 (29%), Positives = 169/379 (44%), Gaps = 29/379 (7%)
Query: 91 ISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLGCQNP 150
+SL GTPPQ +LDTGS L W C H + SF P LSSS +L C +P
Sbjct: 82 VSLPIGTPPQTQQMVLDTGSQLSWIQC--HKKSVPKKPPPTTSFDPSLSSSFSVLPCNHP 139
Query: 151 KCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSG-LTEGIALSETLNLPN-RII 208
C D L T+ + ++C Y Y G EG + E + + +
Sbjct: 140 LCK---------PRIPDFTLPTTCDQNRLC-HYSYFYADGTYAEGSLVREKITFSSSQST 189
Query: 209 PNFLVGCSVLSSRQPAGIAGFGRGKTSLPSQLNLDKFSYCLLSHKFD-DTTRTSSLIL-D 266
P ++GC+ S+ + GI G G+ S SQ + KFSYC+ + + + T S L +
Sbjct: 190 PPLILGCAEASTDE-KGILGMNLGRRSFASQAKISKFSYCVPTRQARAGLSSTGSFYLGN 248
Query: 267 NGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRVRVWHKYLTLDRD 326
N +S + LT+TP +P N + Y + ++ I +G R+ + D
Sbjct: 249 NPNSGRFQYINLLTFTPSQRSP-----NLDPLAYTIPMQGIRMGNARLNISATLFRPDPS 303
Query: 327 GNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEALTGLRPCFDV-PGE 385
G G TI+DSG+ FT++ E + + +E V + G + CFD P E
Sbjct: 304 GAGQTIIDSGSEFTYLVDEAYNKVREEVVRLVGPKLKKGYVYGGVSDM----CFDGNPME 359
Query: 386 KTGSFPELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTVVTDREASGGPSIILGNFQMQN 445
+ F+ G E+ + A VG G C+ + E G S I+GNF QN
Sbjct: 360 IGRLIGNMVFEFEKGVEIVIDKWRVLADVG-GGVHCIGI-GRSEMLGAASNIIGNFHQQN 417
Query: 446 YYVEYDLRNQRLGFKQQLC 464
+VEYDL N+R+G + C
Sbjct: 418 LWVEYDLANRRIGLGKADC 436
>gi|21594980|gb|AAM66061.1| chloroplast nucleoid DNA binding protein, putative [Arabidopsis
thaliana]
Length = 485
Score = 132 bits (332), Expect = 4e-28, Method: Compositional matrix adjust.
Identities = 123/390 (31%), Positives = 169/390 (43%), Gaps = 49/390 (12%)
Query: 83 SHSYGGYSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSS 142
S G Y L GTP + + +LDTGS +VW C C+ C S P F P+ S +
Sbjct: 136 SQGSGEYFTRLGVGTPARYVYMVLDTGSDIVWLQCA---PCRRCYSQSDPIFDPRKSKTY 192
Query: 143 RLLGCQNPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSG-LTEGIALSETL 201
+ C +P C + R K C Y V YG G T G +ETL
Sbjct: 193 ATIPCSSPHCRRLDSAGCNTR---------RKTCL-----YQVSYGDGSFTVGDFSTETL 238
Query: 202 NLPNRIIPNFLVGCSVLSSR---QPAGIAGFGRGKTSLPSQLNL---DKFSYCLLSHKFD 255
+ +GC + AG+ G G+GK S P Q KFSYCL+
Sbjct: 239 TFRRNRVKGVALGCGHDNEGLFVGAAGLLGLGKGKLSFPGQTGHRFNQKFSYCLVDRS-- 296
Query: 256 DTTRTSSLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRV- 314
+++ SS++ N + + +TP ++NP + +YYVGL I+VGG RV
Sbjct: 297 ASSKPSSVVFGNAAVSRIAR-----FTPLLSNPKL------DTFYYVGLLGISVGGTRVP 345
Query: 315 RVWHKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEALT 374
V LD+ GNGG I+DSGT+ T + + + D F V + R A +
Sbjct: 346 GVTASLFKLDQIGNGGVIIDSGTSVTRLIRPAYIAMRDAF---RVGAKTLKR---APNFS 399
Query: 375 GLRPCFDVPGEKTGSFPELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTVVTDREASGGP 434
CFD+ P + LHF+ A+V+LP NY V C GG
Sbjct: 400 LFDTCFDLSNMNEVKVPTVVLHFR-RADVSLPATNYLIPVDTNGKFCFAFA---GTMGGL 455
Query: 435 SIILGNFQMQNYYVEYDLRNQRLGFKQQLC 464
SII GN Q Q + V YDL + R+GF C
Sbjct: 456 SII-GNIQQQGFRVVYDLASSRVGFAPGGC 484
>gi|326512608|dbj|BAJ99659.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 484
Score = 132 bits (332), Expect = 4e-28, Method: Compositional matrix adjust.
Identities = 114/386 (29%), Positives = 168/386 (43%), Gaps = 54/386 (13%)
Query: 87 GGYSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLG 146
G Y +S+ GTP + + + DTGS L W CT C C K P F P SS+ +
Sbjct: 144 GNYVVSMGLGTPARDMTVVFDTGSDLSWVQCT---PCSDCYEQKDPLFDPARSSTYSAVP 200
Query: 147 CQNPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGS-GLTEGIALSETLNL-P 204
C +P+C + + R C+ + K C Y V+YG T+G +TL L
Sbjct: 201 CASPEC-----QGLDSRSCSRD-----KKC-----RYEVVYGDQSQTDGALARDTLTLTQ 245
Query: 205 NRIIPNFLVGCSVLSS---RQPAGIAGFGRGKTSLPSQLNLD---KFSYCLLSHKFDDTT 258
+ ++P F+ GC + + G+ G GR K SL SQ FSYCL S +
Sbjct: 246 SDVLPGFVFGCGEQDTGLFGRADGLVGLGREKVSLSSQAASKYGAGFSYCLPS-----SP 300
Query: 259 RTSSLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRVRVWH 318
+ + G + ++ + T + R+ +YYV L + V G+ VRV
Sbjct: 301 SAAGYLSLGGPAPANARFTAME-----------TRHDSPSFYYVRLVGVKVAGRTVRVSP 349
Query: 319 KYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEALTGLRP 378
+ GT++DSGT T + P ++ L F M + Y R A AL+ L
Sbjct: 350 IVFS-----AAGTVIDSGTVITRLPPRVYAALRSAFARSMGRY-GYKR---APALSILDT 400
Query: 379 CFDVPGEKTGSFPELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTVVTDREASGGPSIIL 438
C+D G T P + L F GGA V L V + S CL + + G + I+
Sbjct: 401 CYDFTGHTTVRIPSVALVFAGGAAVGLDFSGVL-YVAKVSQACLAFAPNGD--GADAGII 457
Query: 439 GNFQMQNYYVEYDLRNQRLGFKQQLC 464
GN Q + V YD+ Q++GF C
Sbjct: 458 GNTQQKTLAVVYDVARQKIGFGANGC 483
>gi|224286159|gb|ACN40790.1| unknown [Picea sitchensis]
Length = 452
Score = 132 bits (332), Expect = 4e-28, Method: Compositional matrix adjust.
Identities = 118/409 (28%), Positives = 182/409 (44%), Gaps = 65/409 (15%)
Query: 68 KTTTTTTTTTTTNISSHSYGG-YSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYC 126
+T+ ++ N+ S G Y I + FGTP Q + ++DTGS + W PC QC+ C
Sbjct: 93 RTSRSSKEDANANVPVRSGSGEYIIQVDFGTPKQSMYTLIDTGSDVAWIPCK---QCQGC 149
Query: 127 SSSKIPSFIPKLSSSSRLLGCQNPKCSWIHHESIQCRDCNDEPLATSKNC--TQICPSYL 184
S+ P F P SSS + C + C I S NC C +
Sbjct: 150 HSTA-PIFDPAKSSSYKPFACDSQPCQEI-----------------SGNCGGNSKC-QFE 190
Query: 185 VLYGSGL-TEGIALSETLNLPNRIIPNFLVGCSVLSSRQ-------PAGIAGFGRGKTSL 236
VLYG G +G S+ + L ++ +PNF GC+ S G T
Sbjct: 191 VLYGDGTQVDGTLASDAITLGSQYLPNFSFGCAESLSEDTYSSPGLMGLGGGSLSLLTQA 250
Query: 237 P-SQLNLDKFSYCLLSHKFDDTTRTSSLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNA 295
P ++L FSYCL S +T + SL+L ++ S ++ L +T + +PS
Sbjct: 251 PTAELFGGTFSYCLPSS----STSSGSLVLGKEAAVS---SSSLKFTTLIKDPS------ 297
Query: 296 FSVYYYVGLRRITVGGQRVRVWHKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFV 355
F +Y+V L+ I+VG R+ V + GGTI+DSGTT T++ P ++ L D F
Sbjct: 298 FPTFYFVTLKAISVGNTRISVPATNIA----SGGGTIIDSGTTITYLVPSAYKDLRDAFR 353
Query: 356 SQMVKNRNYTRALGAEALTGLRPCFDVPGEKTGSFPELKLHFKGGAEVTLPVENYFAVVG 415
Q+ +L + + C+D+ P + LH ++ LP EN +
Sbjct: 354 QQL-------SSLQPTPVEDMDTCYDLSSSSV-DVPTITLHLDRNVDLVLPKENIL-ITQ 404
Query: 416 EGSAVCLTVVTDREASGGPSIILGNFQMQNYYVEYDLRNQRLGFKQQLC 464
E CL + S I+GN Q QN+ + +D+ N ++GF Q+ C
Sbjct: 405 ESGLSCLAFSSTDSRS-----IIGNVQQQNWRIVFDVPNSQVGFAQEQC 448
>gi|115434442|ref|NP_001041979.1| Os01g0140100 [Oryza sativa Japonica Group]
gi|12328547|dbj|BAB21205.1| putative aspartic proteinase nepenthesin I [Oryza sativa Japonica
Group]
gi|113531510|dbj|BAF03893.1| Os01g0140100 [Oryza sativa Japonica Group]
gi|125568961|gb|EAZ10476.1| hypothetical protein OsJ_00309 [Oryza sativa Japonica Group]
gi|215697206|dbj|BAG91200.1| unnamed protein product [Oryza sativa Japonica Group]
Length = 504
Score = 132 bits (332), Expect = 4e-28, Method: Compositional matrix adjust.
Identities = 114/383 (29%), Positives = 164/383 (42%), Gaps = 48/383 (12%)
Query: 87 GGYSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLG 146
G Y + G+P + + +LDTGS + W C C C P F P LS+S +
Sbjct: 165 GEYFSRVGVGSPARQLYMVLDTGSDVTWVQCQ---PCADCYQQSDPVFDPSLSTSYASVA 221
Query: 147 CQNPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSG-LTEGIALSETLNLPN 205
C NP+C +D A +N T C Y V YG G T G +ETL L +
Sbjct: 222 CDNPRC-------------HDLDAAACRNSTGAC-LYEVAYGDGSYTVGDFATETLTLGD 267
Query: 206 RI-IPNFLVGCSVLSSR---QPAGIAGFGRGKTSLPSQLNLDKFSYCLLSHKFDDTTRTS 261
+ + +GC + AG+ G G S PSQ++ FSYCL+ D+ +S
Sbjct: 268 SAPVSSVAIGCGHDNEGLFVGAAGLLALGGGPLSFPSQISATTFSYCLVDR---DSPSSS 324
Query: 262 SLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRVRVWHKYL 321
+L + + D + T P + +P S +YYVGL ++VGGQ + +
Sbjct: 325 TLQFGDAA---DAEVTA----PLIRSPRT------STFYYVGLSGLSVGGQILSIPPSAF 371
Query: 322 TLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEALTGLRPCFD 381
+D G GG IVDSGT T + + L D FV ++ R G ++ C+D
Sbjct: 372 AMDSTGAGGVIVDSGTAVTRLQSSAYAALRDAFVR---GTQSLPRTSG---VSLFDTCYD 425
Query: 382 VPGEKTGSFPELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTVVTDREASGGPSIILGNF 441
+ + P + L F GG E+ LP +NY V CL A I+GN
Sbjct: 426 LSDRTSVEVPAVSLRFAGGGELRLPAKNYLIPVDGAGTYCLAFAPTNAAVS----IIGNV 481
Query: 442 QMQNYYVEYDLRNQRLGFKQQLC 464
Q Q V +D +GF C
Sbjct: 482 QQQGTRVSFDTAKSTVGFTTNKC 504
>gi|449461377|ref|XP_004148418.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Cucumis sativus]
gi|449518059|ref|XP_004166061.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Cucumis sativus]
Length = 436
Score = 132 bits (331), Expect = 5e-28, Method: Compositional matrix adjust.
Identities = 125/436 (28%), Positives = 189/436 (43%), Gaps = 52/436 (11%)
Query: 38 TNPSQDSYQN-LNSLVSSSLTRALHIKNPQTKTTTTTTTTTTTNISSHSYGGYSISLSFG 96
T P +S+ N + + S R ++ + + T + + + G Y + + G
Sbjct: 45 TAPKSESWMNTVIDMASKDPARIRYLSSLTAQKTVAAPIASGQQV--LNVGNYVVRVQLG 102
Query: 97 TPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLGCQNPKCSWIH 156
TP Q + +LDT + W PC+ C CSS+ +F + SS+ L C P+C+
Sbjct: 103 TPGQTMYMVLDTSNDAAWAPCSG---CIGCSSTT--TFSAQNSSTFATLDCSKPECT--Q 155
Query: 157 HESIQCRDCNDEPLATSKNCTQICPSYLVLYGSGLT-EGIALSETLNLPNRIIPNFLVGC 215
+ C P + +C + YG T + ++L+L +IPNF GC
Sbjct: 156 ARGLSC------PTTGNVDCL-----FNQTYGGDSTFSATLVQDSLHLGPNVIPNFSFGC 204
Query: 216 ---SVLSSRQPAGIAGFGRGKTSLPSQ---LNLDKFSYCLLSHKFDDTTRTSSLILDNGS 269
+ SS P G+ G GRG SL SQ L FSYCL S F + SL L
Sbjct: 205 ISSASGSSIPPQGLMGLGRGPLSLISQSGSLYSGLFSYCLPS--FKSYYFSGSLKLGPVG 262
Query: 270 SHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRVRVWHKYLTLDRDGNG 329
+TT L + P + PS+ YYV L I+VG V + + L D +
Sbjct: 263 QPKAIRTTPLLHNP--HRPSL---------YYVNLTGISVGRVLVPISPELLAFDPNTGA 311
Query: 330 GTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEALTGLRPCFDVPGEKTGS 389
GTI+DSGT T P ++ + DEF Q+ + + LGA CF E S
Sbjct: 312 GTIIDSGTVITRFVPAIYTAVRDEFRKQVGGSFS---PLGA-----FDTCFATNNEV--S 361
Query: 390 FPELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTVVTDREASGGPSIILGNFQMQNYYVE 449
P + LH G ++ LP+EN GS CL + ++ N Q QN+ +
Sbjct: 362 APAITLHLS-GLDLKLPMENSLIHSSAGSLACLAMAAAPNNVNSVVNVIANLQQQNHRIL 420
Query: 450 YDLRNQRLGFKQQLCK 465
+D+ N +LG ++LC
Sbjct: 421 FDINNSKLGIARELCN 436
>gi|18408451|ref|NP_564867.1| aspartyl protease-like protein [Arabidopsis thaliana]
gi|12322615|gb|AAG51309.1|AC026480_16 unknown protein [Arabidopsis thaliana]
gi|14334808|gb|AAK59582.1| unknown protein [Arabidopsis thaliana]
gi|15293195|gb|AAK93708.1| unknown protein [Arabidopsis thaliana]
gi|332196351|gb|AEE34472.1| aspartyl protease-like protein [Arabidopsis thaliana]
Length = 430
Score = 131 bits (330), Expect = 6e-28, Method: Compositional matrix adjust.
Identities = 117/382 (30%), Positives = 168/382 (43%), Gaps = 37/382 (9%)
Query: 91 ISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLGCQNP 150
ISL GTPPQ +LDTGS L W C ++ K K SF P LSSS L C +P
Sbjct: 74 ISLPIGTPPQAQQMVLDTGSQLSWIQC---HRKKLPPKPKT-SFDPSLSSSFSTLPCSHP 129
Query: 151 KCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSG-LTEGIALSETLNLPN-RII 208
C D L TS + ++C Y Y G EG + E + N I
Sbjct: 130 LCK---------PRIPDFTLPTSCDSNRLC-HYSYFYADGTFAEGNLVKEKITFSNTEIT 179
Query: 209 PNFLVGCSVLSSRQPAGIAGFGRGKTSLPSQLNLDKFSYCL--LSHKFDDTTRTSSLILD 266
P ++GC+ SS GI G RG+ S SQ + KFSYC+ S++ T S + D
Sbjct: 180 PPLILGCATESSDD-RGILGMNRGRLSFVSQAKISKFSYCIPPKSNRPGFTPTGSFYLGD 238
Query: 267 NGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRVRVWHKYLTLDRD 326
N +SH K + LT+ P N + Y V + I G +++ + D
Sbjct: 239 NPNSHGFKYVSLLTFPESQRMP-----NLDPLAYTVPMIGIRFGLKKLNISGSVFRPDAG 293
Query: 327 GNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEALTGLRPCFDVPGEK 386
G+G T+VDSG+ FT + ++ + E ++++ + G A CFD
Sbjct: 294 GSGQTMVDSGSEFTHLVDAAYDKVRAEIMTRVGRRLKKGYVYGGTADM----CFD---GN 346
Query: 387 TGSFP----ELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTVVTDREASGGPSIILGNFQ 442
P +L F G E+ +P E VG G + + G S I+GN
Sbjct: 347 VAMIPRLIGDLVFVFTRGVEILVPKERVLVNVGGG--IHCVGIGRSSMLGAASNIIGNVH 404
Query: 443 MQNYYVEYDLRNQRLGFKQQLC 464
QN +VE+D+ N+R+GF + C
Sbjct: 405 QQNLWVEFDVTNRRVGFAKADC 426
>gi|357491933|ref|XP_003616254.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
gi|355517589|gb|AES99212.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
Length = 442
Score = 131 bits (330), Expect = 6e-28, Method: Compositional matrix adjust.
Identities = 120/412 (29%), Positives = 195/412 (47%), Gaps = 35/412 (8%)
Query: 64 NPQTKTT---TTTTTTTTTNISSHSYGGYS----ISLSFGTPPQIIPFILDTGSHLVWFP 116
N +TKT TT +++++++I+ S YS ++L GTPPQ+ +LDTGS L W
Sbjct: 50 NSKTKTNQQFTTLSSSSSSSINVKSSFKYSMALVVTLPIGTPPQLQQMVLDTGSQLSWIQ 109
Query: 117 CTNHYQCKYCSSSKIPSFIPKLSSSSRLLGCQNPKCSWIHHESIQCRDCNDEPLATSKNC 176
C N + SF P LSSS +L C +P C D L T +
Sbjct: 110 CHNKKTPQKKQPPTTSSFDPSLSSSFFVLPCNHPLCK---------PRVPDFSLPTDCDA 160
Query: 177 TQICPSYLVLYGSG-LTEGIALSETLNL-PNRIIPNFLVGCSVLSSRQPAGIAGFGRGKT 234
+C Y Y G EG + E + P++ P ++GC+ S GI G G+
Sbjct: 161 NSLC-HYSYFYADGTYAEGNLVREKIAFSPSQTTPPIILGCAT-QSDDARGILGMNLGRL 218
Query: 235 SLPSQLNLDKFSYCLLSHKFDDTTRTSSLILDNGSSHSDKKTTGLTYTPFVNNPSVAERN 294
PSQ + KFSYC+ + + + S + +N +S S + LT+ P N
Sbjct: 219 GFPSQAKITKFSYCVPTKQAQPAS-GSFYLGNNPASSSFRYVNLLTFGQSQRMP-----N 272
Query: 295 AFSVYYYVGLRRITVGGQRVRVWHKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEF 354
+ Y + L+ I++GG+++ + + G+G T++DSG+ FT++ E + + +E
Sbjct: 273 LDPLAYTLPLQGISIGGKKLNIPPSVFKPNAGGSGQTMIDSGSEFTYLVDEAYNVIREEL 332
Query: 355 VSQMVKNRNYTRALGAEALTGLRPCFDVPGEKTGSF-PELKLHFKGGAEVTLPVENYFAV 413
V ++ G A CFD + G ++ F+ G ++ +P E A
Sbjct: 333 VKKVGPKIKKGYMYGGVADI----CFDGDAIEIGRLVGDMVFEFEKGVQIVIPKERVLAT 388
Query: 414 VGEGSAVCLTV-VTDREASGGPSIILGNFQMQNYYVEYDLRNQRLGFKQQLC 464
V +G CL + ++R +GG I+GNF QN +VE+DL N+R+GF + C
Sbjct: 389 V-DGGVHCLGMGRSERLGAGGN--IIGNFHQQNLWVEFDLANRRVGFGEADC 437
>gi|297845610|ref|XP_002890686.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
gi|297336528|gb|EFH66945.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
Length = 486
Score = 131 bits (330), Expect = 6e-28, Method: Compositional matrix adjust.
Identities = 113/382 (29%), Positives = 166/382 (43%), Gaps = 48/382 (12%)
Query: 87 GGYSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLG 146
G Y + G P + + +LDTGS + W CT C C P F P SSS L
Sbjct: 149 GEYFTRVGIGNPAREVYMVLDTGSDVNWLQCT---PCADCYHQTEPIFEPSSSSSYEPLS 205
Query: 147 CQNPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSG-LTEGIALSETLNLPN 205
C P+C+ + E +CR N T + Y V YG G T G +ETL + +
Sbjct: 206 CDTPQCNAL--EVSECR-----------NATCL---YEVSYGDGSYTVGDFATETLTIGS 249
Query: 206 RIIPNFLVGCSVLSSR---QPAGIAGFGRGKTSLPSQLNLDKFSYCLLSHKFDDTTRTSS 262
++ N VGC + AG+ G G G +LPSQLN FSYCL+ D S+
Sbjct: 250 TLVQNVAVGCGHSNEGLFVGAAGLLGLGGGLLALPSQLNTTSFSYCLVDRDSD-----SA 304
Query: 263 LILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRVRVWHKYLT 322
++ G+S P + N + +YY+GL I+VGG+ +++
Sbjct: 305 STVEFGTSLPPDAVVA----PLLRN------HQLDTFYYLGLTGISVGGELLQIPQSSFE 354
Query: 323 LDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEALTGLRPCFDV 382
+D G+GG I+DSGT T + ++ L D F+ + + A + C+++
Sbjct: 355 MDESGSGGIIIDSGTAVTRLQTGIYNSLRDSFL------KGTSDLEKAAGVAMFDTCYNL 408
Query: 383 PGEKTGSFPELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTVVTDREASGGPSIILGNFQ 442
+ T P + HF GG + LP +NY V CL + I+GN Q
Sbjct: 409 SAKTTIEVPTVAFHFPGGKMLALPAKNYMIPVDSVGTFCLAFA----PTASSLAIIGNVQ 464
Query: 443 MQNYYVEYDLRNQRLGFKQQLC 464
Q V +DL N +GF C
Sbjct: 465 QQGTRVTFDLANSLIGFSSNKC 486
>gi|223946005|gb|ACN27086.1| unknown [Zea mays]
Length = 336
Score = 131 bits (330), Expect = 6e-28, Method: Compositional matrix adjust.
Identities = 110/372 (29%), Positives = 168/372 (45%), Gaps = 50/372 (13%)
Query: 106 LDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLGCQNPKCSWIHHESIQCRDC 165
+DTGS L+W C C C+ P F K S++ R L C++ +C+ +
Sbjct: 1 MDTGSDLIWTQCA---PCLLCADQPTPYFDVKKSATYRALPCRSSRCASL---------- 47
Query: 166 NDEPLATSKNCTQICPSYLVLYG-SGLTEGIALSETL-----NLPNRIIPNFLVGCSVLS 219
+S +C + Y YG + T G+ +ET N N GC L+
Sbjct: 48 ------SSPSCFKKMCVYQYYYGDTASTAGVLANETFTFGAANSTKVRATNIAFGCGSLN 101
Query: 220 SRQPA---GIAGFGRGKTSLPSQLNLDKFSYCLLSHKFDDTTRTSSLILDNGSSHSDKKT 276
+ A G+ GFGRG SL SQL +FSYCL S+ +R + N SS +
Sbjct: 102 AGDLANSSGMVGFGRGPLSLVSQLGPSRFSYCLTSYLSATPSRLYFGVYANLSSTNTSSG 161
Query: 277 TGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRVRVWHKYLTLDRDGNGGTIVDSG 336
+ + TPFV NP++ Y++ L+ I++G + + + ++ DG GG I+DSG
Sbjct: 162 SPVQSTPFVINPALPNM------YFLSLKAISLGTKLLPIDPLVFAINDDGTGGVIIDSG 215
Query: 337 TTFTFMAPELFEPLADEFVSQM-VKNRNYTRALGAEALTGLRPCFDVPGEK--TGSFPEL 393
T+ T++ + +E + VS + + N T GL CF P T + P+L
Sbjct: 216 TSITWLQQDAYEAVRRGLVSAIPLPAMNDTD-------IGLDTCFQWPPPPNVTVTVPDL 268
Query: 394 KLHFKGGAEVTLPVENYFAVVGEGSAVCLTVVTDREASGGPSIILGNFQMQNYYVEYDLR 453
HF A +TL ENY + +CL + A G I+GN+Q QN ++ YD+
Sbjct: 269 VFHFD-SANMTLLPENYMLIASTTGYLCLVM-----APTGVGTIIGNYQQQNLHLLYDIG 322
Query: 454 NQRLGFKQQLCK 465
N L F C
Sbjct: 323 NSFLSFVPAPCD 334
>gi|357113696|ref|XP_003558637.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Brachypodium
distachyon]
Length = 432
Score = 131 bits (330), Expect = 6e-28, Method: Compositional matrix adjust.
Identities = 118/437 (27%), Positives = 180/437 (41%), Gaps = 47/437 (10%)
Query: 40 PSQDSYQNLNSLVSSSLTRALHIKNPQTKTTTTTTTTTTTNISSHSYGGYSISLSFGTPP 99
PS +++ +L R L + + +T ++ S S Y + G+P
Sbjct: 32 PSSSPLESIIALAREDDARLLFL----SSKAASTGVSSAPVASGQSPPSYVVRAGLGSPA 87
Query: 100 QIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLGCQNPKCSWIHHES 159
Q I LDT + W C+ C C SS F P S+S L C + C+ + +
Sbjct: 88 QPILLALDTSADATWAHCS---PCGTCPSSGS-LFAPANSTSYAPLPCSSTMCTVLQGQP 143
Query: 160 IQCRDCNDEPLATSKNCTQICPSYLVLYGSGLTEGIALSETLNLPNRIIPNFLVGCSVLS 219
+D D S +C ++ + + S+ L+L IPN+ GC
Sbjct: 144 CPAQDPYD-----SSAPLPMC-AFTKPFADASFQASLASDWLHLGKDAIPNYAFGCVSAV 197
Query: 220 SRQPA-----GIAGFGRGKTSLPSQL-NL--DKFSYCLLSHK---FDDTTRTSSLILDNG 268
S A G+ G GRG +L SQ+ N+ FSYCL S+K F + R +
Sbjct: 198 SGPTANLPKQGLLGLGRGPMALLSQVGNMYNGVFSYCLPSYKSYYFSGSLRLGAA----- 252
Query: 269 SSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRVRVWHKYLTLDRDGN 328
+ G+ YTP + NP+ S YYV + ++VG V+V D
Sbjct: 253 -----GQPRGVRYTPMLKNPN------RSSLYYVNVTGLSVGRAPVKVPAGSFAFDPATG 301
Query: 329 GGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEALTGLRPCFDVPGEKTG 388
GT+VDSGT T P ++ L +EF + YT +LGA CF+ G
Sbjct: 302 AGTVVDSGTVITRWTPPVYAALREEFRRHVAAPSGYT-SLGA-----FDTCFNTDEVAAG 355
Query: 389 SFPELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTVVTDREASGGPSIILGNFQMQNYYV 448
P + +H GG ++ LP+EN CL + + +L N Q QN V
Sbjct: 356 VAPAVTVHMDGGLDLALPMENTLIHSSATPLACLAMAEAPQNVNAVVNVLANLQQQNLRV 415
Query: 449 EYDLRNQRLGFKQQLCK 465
+D+ N R+GF ++ C
Sbjct: 416 VFDVANSRVGFARESCN 432
>gi|413944032|gb|AFW76681.1| hypothetical protein ZEAMMB73_606599 [Zea mays]
Length = 315
Score = 131 bits (330), Expect = 6e-28, Method: Compositional matrix adjust.
Identities = 111/351 (31%), Positives = 159/351 (45%), Gaps = 63/351 (17%)
Query: 138 LSSSSRLLGCQNPKC--------SWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGS 189
+SS+ + + C +P C S E+ QC YL YG
Sbjct: 1 MSSTFKAVACPDPICRPSSGVSVSACAMENFQCF-------------------YLCSYGD 41
Query: 190 -GLTEGIALSETLNL--PNRI---IPNFLVGC----SVLSSRQPAGIAGFGRGKTSLPSQ 239
+T G +T PN + + GC + L +GIAGFGRG SLPSQ
Sbjct: 42 RSITAGHIFKDTFTFMSPNGVPVAVSELAFGCGDYNTGLFVSNESGIAGFGRGPQSLPSQ 101
Query: 240 LNLDKFSYCLLSHKFDDTTRTSSLILDNGSSHSD---KKTTG-LTYTPFVNNPSVAERNA 295
L + +FSYCL T SS+++ D TTG TP + NP +
Sbjct: 102 LKVGRFSYCLTLV----TESKSSVVILGTPPDPDGLRAHTTGPFQSTPIIYNPLIP---- 153
Query: 296 FSVYYYVGLRRITVGGQRVRVWHKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFV 355
+YY+ L ITVG R+ L +DG+GGT++DSGT+ T + +FE L +E V
Sbjct: 154 --TFYYLSLEGITVGKTRLPFDKSVFALKKDGSGGTVIDSGTSLTTLPEAVFELLQEELV 211
Query: 356 SQMVKNR-NYTRALGAEALTGLRPCFDVP-GEKTGSFPELKLHFKGGAEVTLPVENYFAV 413
+Q R + T +G R CF P G K P+L LH GA++ LP +NYF
Sbjct: 212 AQFPLPRYDNTPEVGD------RLCFRRPKGGKQVPVPKLILHL-AGADMDLPRDNYFVE 264
Query: 414 VGEGSAVCLTVVTDREASGGPSIILGNFQMQNYYVEYDLRNQRLGFKQQLC 464
+ +CL + + + +++GNFQ QN +V YD+ N +L F C
Sbjct: 265 EPDSGVMCLQINGAEDTT---MVLIGNFQQQNMHVVYDVENNKLLFAPAQC 312
>gi|449530542|ref|XP_004172253.1| PREDICTED: LOW QUALITY PROTEIN: protein ASPARTIC PROTEASE IN GUARD
CELL 1-like [Cucumis sativus]
Length = 486
Score = 131 bits (330), Expect = 7e-28, Method: Compositional matrix adjust.
Identities = 116/392 (29%), Positives = 162/392 (41%), Gaps = 58/392 (14%)
Query: 82 SSHSYGGYSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSS 141
+S G Y + G PP + +LDTGS + W C C C P F P S+S
Sbjct: 144 ASQGSGEYFSRVGIGRPPSPVYMVLDTGSDVSWVQCA---PCAECYEQTDPXFEPTSSAS 200
Query: 142 SRLLGCQNPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSG-LTEGIALSET 200
L C+ +C +S+ +C +N T + Y V YG G T G ++ET
Sbjct: 201 FTSLSCETEQC-----KSLDVSEC--------RNGTCL---YEVSYGDGSYTVGDFVTET 244
Query: 201 LNLPNRIIPNFLVGCSVLSSRQPAGI-------AGFGRGKTSLPSQLNLDKFSYCLLSHK 253
+ L + + N +GC G+ G G G S PSQLN FSYCL+
Sbjct: 245 VTLGSTSLGNIAIGCG----HNNEGLFIGAAGLLGLGGGSLSFPSQLNASSFSYCLVDRD 300
Query: 254 FDDTTRTSSLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQR 313
D T+ LD S + T P NP++ ++Y+GL ++VGG
Sbjct: 301 SDSTS-----TLDFNSPITPDAVTA----PLHRNPNL------DTFFYLGLTGMSVGGAV 345
Query: 314 VRVWHKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEAL 373
+ + + DGNGG IVDSGT T + ++ L D FV + A +
Sbjct: 346 LPIPETSFQMSEDGNGGIIVDSGTAVTRLQTTVYNVLRDAFVKSTHDLQT------ARGV 399
Query: 374 TGLRPCFDVPGEKTGSFPELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTVV-TDREASG 432
C+D+ + P + HF G E+ LP +NY V C TD S
Sbjct: 400 ALFDTCYDLSSKSRVEVPTVSFHFANGNELPLPAKNYLIPVDSEGTFCFAFAPTDSTLS- 458
Query: 433 GPSIILGNFQMQNYYVEYDLRNQRLGFKQQLC 464
ILGN Q Q V +DL N +GF C
Sbjct: 459 ----ILGNAQQQGTRVGFDLANSLVGFSPNKC 486
>gi|168041176|ref|XP_001773068.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162675615|gb|EDQ62108.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 365
Score = 131 bits (329), Expect = 8e-28, Method: Compositional matrix adjust.
Identities = 119/392 (30%), Positives = 172/392 (43%), Gaps = 55/392 (14%)
Query: 87 GGYSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLG 146
G Y ++ GTP ++ I+DTGS L W C+ C C S F+P S+S L
Sbjct: 11 GEYLATVRLGTPERVFSVIVDTGSDLTWVQCS---PCGKCYSQNDALFLPNTSTSFTKLA 67
Query: 147 CQNPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSG-LTEGIALSETL---- 201
C + C N P C Q Y YG G LT G + +T+
Sbjct: 68 CGSALC-------------NGLPFPM---CNQTTCVYWYSYGDGSLTTGDFVYDTITMDG 111
Query: 202 -NLPNRIIPNFLVGCSVLSSRQPAG---IAGFGRGKTSLPSQLNL---DKFSYCLLSHKF 254
N + +PNF GC + AG I G G+G S SQL KFSYCL+
Sbjct: 112 INGQKQQVPNFAFGCGHDNEGSFAGADGILGLGQGPLSFHSQLKSVYNGKFSYCLVDW-L 170
Query: 255 DDTTRTSSLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRV 314
T+TS L+ + + + Y P + NP V YYYV L I+VG +
Sbjct: 171 APPTQTSPLLFGDAAV---PILPDVKYLPILANPKVP------TYYYVKLNGISVGDNLL 221
Query: 315 RVWHKYLTLDRDGNGGTIVDSGTTFTFMAPELF-EPLADEFVSQMVKNRNYTRALGAEAL 373
+ +D G GTI DSGTT T +A + E LA S M Y+R + + +
Sbjct: 222 NISSTVFDIDSVGGAGTIFDSGTTVTQLAEAAYKEVLAAMNASTMA----YSRKI--DDI 275
Query: 374 TGLRPCFD-VPGEKTGSFPELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTVVTDREASG 432
+ L C P ++ + P + HF+GG ++ LP NYF + + C + + + +
Sbjct: 276 SRLDLCLSGFPKDQLPTVPAMTFHFEGG-DMVLPPSNYFIYLESSQSYCFAMTSSPDVN- 333
Query: 433 GPSIILGNFQMQNYYVEYDLRNQRLGFKQQLC 464
I+G+ Q QN+ V YD ++LGF + C
Sbjct: 334 ----IIGSVQQQNFQVYYDTAGRKLGFVPKDC 361
>gi|15228618|ref|NP_191741.1| aspartyl protease-like protein [Arabidopsis thaliana]
gi|6850873|emb|CAB71112.1| putative protein [Arabidopsis thaliana]
gi|332646739|gb|AEE80260.1| aspartyl protease-like protein [Arabidopsis thaliana]
Length = 483
Score = 131 bits (329), Expect = 8e-28, Method: Compositional matrix adjust.
Identities = 122/394 (30%), Positives = 166/394 (42%), Gaps = 51/394 (12%)
Query: 83 SHSYGGYSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSS 142
S G Y + L GTP + +LDTGS +VW C+ CK C + F PK S +
Sbjct: 129 SQGSGEYFMRLGVGTPATNVYMVLDTGSDVVWLQCS---PCKACYNQTDAIFDPKKSKTF 185
Query: 143 RLLGCQNPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSG-LTEGIALSETL 201
+ C + C + S +C SK C Y V YG G TEG +ETL
Sbjct: 186 ATVPCGSRLCRRLDDSS----ECVTR---RSKTCL-----YQVSYGDGSFTEGDFSTETL 233
Query: 202 NLPNRIIPNFLVGCSVLSSRQPAGI-------AGFGRGKTSLPSQLN---LDKFSYCLLS 251
+ + +GC G+ G GRG S PSQ KFSYCL+
Sbjct: 234 TFHGARVDHVPLGC----GHDNEGLFVGAAGLLGLGRGGLSFPSQTKNRYNGKFSYCLVD 289
Query: 252 HKFDDTTRTSSLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGG 311
++ + G++ K + +TP + NP + +YY+ L I+VGG
Sbjct: 290 RTSSGSSSKPPSTIVFGNAAVPKTS---VFTPLLTNPKL------DTFYYLQLLGISVGG 340
Query: 312 QRV-RVWHKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGA 370
RV V LD GNGG I+DSGT+ T + + L D F K + A
Sbjct: 341 SRVPGVSESQFKLDATGNGGVIIDSGTSVTRLTQPAYVALRDAFRLGATKLKR------A 394
Query: 371 EALTGLRPCFDVPGEKTGSFPELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTVVTDREA 430
+ + CFD+ G T P + HF GG EV+LP NY V C
Sbjct: 395 PSYSLFDTCFDLSGMTTVKVPTVVFHF-GGGEVSLPASNYLIPVNTEGRFCFAFA----G 449
Query: 431 SGGPSIILGNFQMQNYYVEYDLRNQRLGFKQQLC 464
+ G I+GN Q Q + V YDL R+GF + C
Sbjct: 450 TMGSLSIIGNIQQQGFRVAYDLVGSRVGFLSRAC 483
>gi|224111722|ref|XP_002315953.1| predicted protein [Populus trichocarpa]
gi|222864993|gb|EEF02124.1| predicted protein [Populus trichocarpa]
Length = 484
Score = 131 bits (329), Expect = 8e-28, Method: Compositional matrix adjust.
Identities = 114/392 (29%), Positives = 160/392 (40%), Gaps = 58/392 (14%)
Query: 82 SSHSYGGYSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSS 141
+S G Y + G PP ILDTGS + W C C C P F P S+S
Sbjct: 142 TSQGSGEYFSRVGIGKPPSQAYLILDTGSDVNWVQCA---PCADCYQQADPIFEPASSAS 198
Query: 142 SRLLGCQNPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSG-LTEGIALSET 200
L C +C + + +CR ND L Y V YG G T G ++ET
Sbjct: 199 FSTLSCNTRQCRSL--DVSECR--NDTCL------------YEVSYGDGSYTVGDFVTET 242
Query: 201 LNLPNRIIPNFLVGCSVLSSRQPAGI-------AGFGRGKTSLPSQLNLDKFSYCLLSHK 253
+ L + + N +GC G+ G G G S PSQ+N FSYCL+
Sbjct: 243 ITLGSAPVDNVAIGCG----HNNEGLFVGAAGLLGLGGGSLSFPSQINATSFSYCLVDR- 297
Query: 254 FDDTTRTSSLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQR 313
D+ S+L ++ T P + + + +YYVGL ++VGG+
Sbjct: 298 --DSESASTLEFNS------------TLPPNAVSAPLLRNHHLDTFYYVGLTGLSVGGEL 343
Query: 314 VRVWHKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRAL-GAEA 372
V + +D GNGG IVDSGT T + +++ L D FV + TR L
Sbjct: 344 VSIPESAFQIDESGNGGVIVDSGTAITRLQTDVYNSLRDAFVKR-------TRDLPSTNG 396
Query: 373 LTGLRPCFDVPGEKTGSFPELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTVVTDREASG 432
+ C+D+ + P + HF G E+ LP +NY + C +
Sbjct: 397 IALFDTCYDLSSKGNVEVPTVSFHFPDGKELPLPAKNYLVPLDSEGTFCFAFAPTASSLS 456
Query: 433 GPSIILGNFQMQNYYVEYDLRNQRLGFKQQLC 464
I+GN Q Q V YDL N +GF C
Sbjct: 457 ----IIGNVQQQGTRVVYDLVNHLVGFVPNKC 484
>gi|242046218|ref|XP_002460980.1| hypothetical protein SORBIDRAFT_02g038700 [Sorghum bicolor]
gi|241924357|gb|EER97501.1| hypothetical protein SORBIDRAFT_02g038700 [Sorghum bicolor]
Length = 517
Score = 131 bits (329), Expect = 8e-28, Method: Compositional matrix adjust.
Identities = 128/419 (30%), Positives = 186/419 (44%), Gaps = 49/419 (11%)
Query: 64 NPQTKTTTTTTTTTTTNISSHSYGGYSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQC 123
+P+ + T + ++ S G Y + + GTPP+ I+DTGS L W C C
Sbjct: 127 SPRRALSERMVATVESGVAVGS-GEYLMDVYVGTPPRRFRMIMDTGSDLNWLQCA---PC 182
Query: 124 KYCSSSKIPSFIPKLSSSSRLLGCQNPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSY 183
C P F P SSS R + C + +C + + P A + CP Y
Sbjct: 183 LDCFDQVGPVFDPAASSSYRNVTCGDQRCGLVAPP--------EPPRACRRPGEDSCP-Y 233
Query: 184 LVLYG--SGLTEGIAL-SETLNL----PNRIIPNFLVGCSVLSS---RQPAGIAGFGRGK 233
YG S T +AL S T+NL +R + + + GC + AG+ G GRG
Sbjct: 234 YYWYGDQSNTTGDLALESFTVNLTAPGASRRVDDVVFGCGHWNRGLFHGAAGLLGLGRGP 293
Query: 234 TSLPSQLNL---DKFSYCLLSHKFDDTTRTSSLILDNGSSHSDKKTTGLTYTPFVNNPSV 290
S SQL FSYCL+ H D ++ + ++ + L YT F S
Sbjct: 294 LSFASQLRAVYGHTFSYCLVDHGSDVASKV--VFGEDDALALAAAHPQLNYTAFAPASSP 351
Query: 291 AERNAFSVYYYVGLRRITVGGQRVRV----WHKYLTLDRDGNGGTIVDSGTTFTFMAPEL 346
A+ +YYV L+ + VGG+ + + W G TI+DSGTT ++
Sbjct: 352 AD-----TFYYVKLKGVLVGGELLNISSDTWGVGEGEGGSGG--TIIDSGTTLSYFVEPA 404
Query: 347 FEPLADEFVSQMVKNRNYTRALGAEALTGLRPCFDVPGEKTGSFPELKLHFKGGAEVTLP 406
++ + F+ +M R+Y L+ PC++V G PEL L F GA P
Sbjct: 405 YQVIRQAFIDRM--GRSYPLIPDFPVLS---PCYNVSGVDRPEVPELSLLFADGAVWDFP 459
Query: 407 VENYFAVVGEGSAVCLTVV-TDREASGGPSIILGNFQMQNYYVEYDLRNQRLGFKQQLC 464
ENYF + +CL V+ T R I+GNFQ QN++V YDL+N RLGF + C
Sbjct: 460 AENYFIRLDPDGIMCLAVLGTPRTGMS----IIGNFQQQNFHVVYDLKNNRLGFAPRRC 514
>gi|21618176|gb|AAM67226.1| unknown [Arabidopsis thaliana]
Length = 430
Score = 131 bits (329), Expect = 9e-28, Method: Compositional matrix adjust.
Identities = 117/382 (30%), Positives = 168/382 (43%), Gaps = 37/382 (9%)
Query: 91 ISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLGCQNP 150
ISL GTPPQ +LDTGS L W C ++ K K SF P LSSS L C +P
Sbjct: 74 ISLPIGTPPQAQQMVLDTGSQLSWIQC---HRKKLPPKPKT-SFDPSLSSSFSTLPCSHP 129
Query: 151 KCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSG-LTEGIALSETLNLPN-RII 208
C D L TS + ++C Y Y G EG + E + N I
Sbjct: 130 LCK---------PRIPDFTLPTSCDSNRLC-HYSYFYADGTFAEGNLVKEKITFSNTEIT 179
Query: 209 PNFLVGCSVLSSRQPAGIAGFGRGKTSLPSQLNLDKFSYCL--LSHKFDDTTRTSSLILD 266
P ++GC+ SS GI G RG+ S SQ + KFSYC+ S++ T S + D
Sbjct: 180 PPLILGCATESSDD-RGILGMNRGRLSFVSQAKISKFSYCIPPKSNRPGFTPTGSFYLGD 238
Query: 267 NGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRVRVWHKYLTLDRD 326
N +SH K + LT+ P N + Y V + I G +++ + D
Sbjct: 239 NPNSHGFKYVSLLTFPESQRMP-----NLDPLAYTVPMIGIRFGLKKLNISGSVFRPDAG 293
Query: 327 GNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEALTGLRPCFDVPGEK 386
G+G T+VDSG+ FT + ++ + E ++++ + G A CFD
Sbjct: 294 GSGQTMVDSGSEFTHLVDAAYDKVRAEIMTRVGRRLKKGYVYGGTADM----CFD---GN 346
Query: 387 TGSFP----ELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTVVTDREASGGPSIILGNFQ 442
P +L F G E+ +P E VG G + + G S I+GN
Sbjct: 347 VAMIPRLIGDLVFVFTRGVEIFVPKERVLVNVGGG--IHCVGIGRSSMLGAASNIIGNVH 404
Query: 443 MQNYYVEYDLRNQRLGFKQQLC 464
QN +VE+D+ N+R+GF + C
Sbjct: 405 QQNLWVEFDVTNRRVGFAKADC 426
>gi|312283333|dbj|BAJ34532.1| unnamed protein product [Thellungiella halophila]
Length = 428
Score = 131 bits (329), Expect = 9e-28, Method: Compositional matrix adjust.
Identities = 117/384 (30%), Positives = 165/384 (42%), Gaps = 55/384 (14%)
Query: 89 YSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLGCQ 148
Y + + GTP Q + LDT + W PC+ C C+SS + F P SSSSR L C
Sbjct: 91 YIVRANIGTPAQPMLVALDTSNDAAWVPCSG---CVGCASSVL--FDPSKSSSSRNLQCD 145
Query: 149 NPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSGLTEGIALSETLNLPNRII 208
P QC+ + K+C + + YG E +TL L N +I
Sbjct: 146 AP----------QCKQAPNPTCTAGKSC-----GFNMTYGGSTIEASLTQDTLTLANDVI 190
Query: 209 PNFLVGC--SVLSSRQPA-GIAGFGRGKTSLPSQ---LNLDKFSYCLLSHKFDDTTRTSS 262
++ GC + PA G+ G GRG SL SQ L + FSYCL + K + + +
Sbjct: 191 KSYTFGCISKATGTSLPAQGLMGLGRGPLSLISQTQNLYMSTFSYCLPNSKSSNFSGSLR 250
Query: 263 LILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRVRVWHKYLT 322
L G + + + TP + NP S YYV L I VG + V + L
Sbjct: 251 L----GPKYQPVR---IKTTPLLKNPR------RSSLYYVNLVGIRVGNKIVDIPTSALA 297
Query: 323 LDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEALTGLRPCFDV 382
D GTI DSGT FT + + + +EF + +KN N A +L G C+
Sbjct: 298 FDASTGAGTIFDSGTVFTRLVEPAYVAVRNEF-RRRIKNAN------ATSLGGFDTCY-- 348
Query: 383 PGEKTGS--FPELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTVVTDREASGGPSIILGN 440
+GS +P + F G VTLP +N GS CL + ++ +
Sbjct: 349 ----SGSVVYPSVTFMF-AGMNVTLPPDNLLIHSSSGSTSCLAMAAAPNNVNSVLNVIAS 403
Query: 441 FQMQNYYVEYDLRNQRLGFKQQLC 464
Q QN+ V DL N RLG ++ C
Sbjct: 404 MQQQNHRVLIDLPNSRLGISRETC 427
>gi|222637180|gb|EEE67312.1| hypothetical protein OsJ_24552 [Oryza sativa Japonica Group]
Length = 420
Score = 131 bits (329), Expect = 9e-28, Method: Compositional matrix adjust.
Identities = 122/389 (31%), Positives = 170/389 (43%), Gaps = 64/389 (16%)
Query: 87 GGYSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLG 146
GGY++++S GTP + DTGS L+W C C C P F P SS+ L
Sbjct: 84 GGYNMNISVGTPLLTFSVVADTGSDLIWTQCA---PCTKCFQQPAPPFQPASSSTFSKLP 140
Query: 147 CQNPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSGLTEGIALSETLNLPNR 206
C + C ++ + R CN + C Y YGSG T G +ETL + +
Sbjct: 141 CTSSFCQFLPNS---IRTCN------ATGCV-----YNYKYGSGYTAGYLATETLKVGDA 186
Query: 207 IIPNFLVGCSVLSSRQPAGIAGFGRGKTSLPSQLNLDKFSYCLLSHKFDDTTRTSSLILD 266
P+ GCS + G G+ L + +FSYCL S S ++
Sbjct: 187 SFPSVAFGCSTEN--------GLGQ------LDLGVGRFSYCLRSGS---AAGASPILFG 229
Query: 267 NGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRVRVWHKYLTLDRD 326
+ ++ +D + TPFVNNP+V YYYV L ITVG + V ++
Sbjct: 230 SLANLTDGN---VQSTPFVNNPAVHPS-----YYYVNLTGITVGETDLPVTTSTFGFTQN 281
Query: 327 G-NGGTIVDSGTTFTFMAPELFEPLADEFVSQM--VKNRNYTRALGAEALTGLRPCFDVP 383
G GGTIVDSGTT T++A + +E + F+SQ V N TR GL CF
Sbjct: 282 GLGGGTIVDSGTTLTYLAKDGYEMVKQAFLSQTADVTTVNGTR--------GLDLCFKST 333
Query: 384 GEKTG--SFPELKLHFKGGAEVTLPVENYFAVV-----GEGSAVCLTVVTDREASGGPSI 436
G G + P L L F GGAE +P YFA V G + CL ++ + P
Sbjct: 334 GGGGGGIAVPSLVLRFDGGAEYAVP--TYFAGVETDSQGSVTVACLMMLPAKGDQ--PMS 389
Query: 437 ILGNFQMQNYYVEYDLRNQRLGFKQQLCK 465
++GN + ++ YDL F C
Sbjct: 390 VIGNVMQMDMHLLYDLDGGIFSFAPADCA 418
>gi|302821814|ref|XP_002992568.1| hypothetical protein SELMODRAFT_46291 [Selaginella moellendorffii]
gi|300139637|gb|EFJ06374.1| hypothetical protein SELMODRAFT_46291 [Selaginella moellendorffii]
Length = 368
Score = 131 bits (329), Expect = 9e-28, Method: Compositional matrix adjust.
Identities = 116/398 (29%), Positives = 179/398 (44%), Gaps = 54/398 (13%)
Query: 91 ISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLGCQNP 150
+ L G+ + + I+DTGS V C S P F P S S R + C +
Sbjct: 1 MQLGIGSLQKNLSAIIDTGSEAVLV---------QCGSRSRPVFDPAASQSYRQVPCISQ 51
Query: 151 KCSWIHHESIQCRDCNDEP-LATSKNCTQICPSYLVLYGSG------LTEGIALSETLNL 203
C + Q + + +P + +S CT Y + YG ++ + + N
Sbjct: 52 LCLAVQQ---QTSNGSSQPCVNSSAACT-----YSLSYGDSRNSTGDFSQDVIFLNSTNS 103
Query: 204 PNRIIP--NFLVGCS-----VLSSRQPAGIAGFGRGKTSLPSQLNL----DKFSYCLLSH 252
++ + + GC+ L GI GF RG SLPSQL KFSYC S
Sbjct: 104 SSQAVQFRDVAFGCAHSPQGFLVDLGSLGIVGFNRGNLSLPSQLKDRLGGSKFSYCFPSQ 163
Query: 253 KFDDTTRTSSLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQ 312
+ + D+G S S ++YTP ++NP R S YYVGL I+V G+
Sbjct: 164 PWQPRATGVIFLGDSGLSKSK-----VSYTPLLDNPVTPAR---SQLYYVGLTSISVDGK 215
Query: 313 RVRVWHKYLTLD-RDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRA-LGA 370
+ + LD G+GGT++DSGTTFT + + + + F + NR+ R +GA
Sbjct: 216 TLAIPESAFKLDPSTGDGGTVLDSGTTFTRVVDDAYTAFRNAFAA---SNRSGLRKKVGA 272
Query: 371 EALTGLRPCFDV-PGEKTGSFPELKLHFKGGAEVTLPVENYFAVV---GEGSAVCLTVVT 426
A G C+++ G PE++L + + L E+ F V G VCL +++
Sbjct: 273 AA--GFDDCYNISAGSSLPGVPEVRLSLQNNVRLELRFEHLFVPVSAAGNEVTVCLAILS 330
Query: 427 DREASGGPSIILGNFQMQNYYVEYDLRNQRLGFKQQLC 464
+++ G +LGN+Q NY VEYD R+GF++ C
Sbjct: 331 SQKSGFGKINVLGNYQQSNYLVEYDNERSRVGFERADC 368
>gi|302780643|ref|XP_002972096.1| hypothetical protein SELMODRAFT_96575 [Selaginella moellendorffii]
gi|300160395|gb|EFJ27013.1| hypothetical protein SELMODRAFT_96575 [Selaginella moellendorffii]
Length = 393
Score = 130 bits (328), Expect = 1e-27, Method: Compositional matrix adjust.
Identities = 131/437 (29%), Positives = 198/437 (45%), Gaps = 72/437 (16%)
Query: 46 QNLNSLVSSSLTRALHIKNPQTKTTTTTTTTTTTNISSHSY---GGYSISLSFGTPPQII 102
+ + +LV+ S R + + +++ ++ TT++ S + GGY + +S GTP +
Sbjct: 10 EAIRALVAKSHARVRWMA-ARANSSSWSSMAGTTDVESPLHPDGGGYVMDISVGTPGKRF 68
Query: 103 PFILDTGSHLVWF---PCTNHYQCKYCSSSKIPSFIPKLSSSSRLLGCQNPKCSWIHHES 159
I DTGS LVW PCT CS I F P+ SS+ R + C + C+ +
Sbjct: 69 RAIADTGSDLVWVQSEPCTG------CSGGTI--FDPRQSSTFREMDCSSQLCAELPGSC 120
Query: 160 IQCRDCNDEPLATSKNCTQICPSYLVLYGSGLTEGIALSETLNL-----PNRIIPNFLVG 214
EP S C SY YGSG TEG +T++L ++ P+F VG
Sbjct: 121 --------EP--GSSTC-----SYSYEYGSGETEGEFARDTISLGTTSDGSQKFPSFAVG 165
Query: 215 CSVLSSRQPA--GIAGFGRGKTSLPSQLNL---DKFSYCLLSHKFDDTTRTSSLILDNGS 269
C +++S G+ G G+G SL SQL+ KFSYCL+ + + +S L+ +
Sbjct: 166 CGMVNSGFDGVDGLVGLGQGPVSLTSQLSAAIDSKFSYCLV--DINSQSESSPLLFGPSA 223
Query: 270 SHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRVRVWHKYLTLDRDGNG 329
+ TP PS + + YY + + I V GQ + G
Sbjct: 224 ALHGTGIQSTKITP----PS----DTYPTYYLLTVNGIAVAGQTM-----------GSPG 264
Query: 330 GTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEALTGLRPCFDVPGEKTGS 389
TI+DSGTT T++ ++ +S+M R G+ GL C+D +
Sbjct: 265 TTIIDSGTTLTYVPSGVY----GRVLSRMESMVTLPRVDGSS--MGLDLCYDRSSNRNYK 318
Query: 390 FPELKLHFKGGAEVTLPVENYFAVVGE-GSAVCLTVVTDREASGGPSIILGNFQMQNYYV 448
FP L + GA +T P NYF VV + G VCL + + ASG P I+GN Q Y++
Sbjct: 319 FPALTIRLA-GATMTPPSSNYFLVVDDSGDTVCLAMGS---ASGLPVSIIGNVMQQGYHI 374
Query: 449 EYDLRNQRLGFKQQLCK 465
YD + L F Q C+
Sbjct: 375 LYDRGSSELSFVQAKCE 391
>gi|388504358|gb|AFK40245.1| unknown [Medicago truncatula]
Length = 480
Score = 130 bits (328), Expect = 1e-27, Method: Compositional matrix adjust.
Identities = 118/380 (31%), Positives = 179/380 (47%), Gaps = 45/380 (11%)
Query: 93 LSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLGCQNPKC 152
++ G Q + I+DTGS L W C C C S + P F P SSS L C + C
Sbjct: 135 VTIGLGNQNMTVIIDTGSDLTWVQCD---PCMSCYSQQGPVFNPSNSSSYNSLLCNSSTC 191
Query: 153 SWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSG-LTEGIALSETLNLPNRIIPNF 211
+++Q N E A N C ++ V YG G T+G E L+ + NF
Sbjct: 192 -----QNLQFTTGNTE--ACESNNPSSC-NHTVSYGDGSFTDGELGVEHLSFGGISVSNF 243
Query: 212 LVGCSVLSSR---QPAGIAGFGRGKTSLPSQLNL---DKFSYCLLSHKFDDTTRTSSLIL 265
+ GC + +GI G GR S+ SQ N FSYCL + D+ + SL++
Sbjct: 244 VFGCGRNNKGLFGGVSGIMGLGRSNLSMISQTNTTFGGVFSYCLPT---TDSGASGSLVI 300
Query: 266 DNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRVRVWHKYLTLDR 325
N SS K T + YT V+NP ++ +Y + L I VGG ++ D
Sbjct: 301 GNESSLF-KNLTPIAYTSMVSNPQLSN------FYVLNLTGIDVGGVAIQ--------DT 345
Query: 326 D-GNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEALTGLRPCFDVPG 384
GNGG ++DSGT T +AP L+ L EF+ Q ++ A AL+ L CF++ G
Sbjct: 346 SFGNGGILIDSGTVITRLAPSLYNALKAEFLKQ------FSGYPIAPALSILDTCFNLTG 399
Query: 385 EKTGSFPELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTVVTDREASGGPSIILGNFQMQ 444
+ S P L +HF+ ++ + + +GS VCL + + + + I+GN+Q +
Sbjct: 400 IEEVSIPTLSMHFENNVDLNVDAVGILYMPKDGSQVCLALASLSDEN--DMAIIGNYQQR 457
Query: 445 NYYVEYDLRNQRLGFKQQLC 464
N V YD + ++GF ++ C
Sbjct: 458 NQRVIYDAKQSKIGFAREDC 477
>gi|225423917|ref|XP_002281973.1| PREDICTED: aspartic proteinase nepenthesin-2 [Vitis vinifera]
Length = 491
Score = 130 bits (328), Expect = 1e-27, Method: Compositional matrix adjust.
Identities = 116/393 (29%), Positives = 160/393 (40%), Gaps = 59/393 (15%)
Query: 82 SSHSYGGYSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSS 141
+S G Y + G+PP+ + ++DTGS + W C C C P F P SSS
Sbjct: 148 ASQGSGEYFSRVGIGSPPKHVYMVVDTGSDVNWVQCA---PCADCYQQADPIFEPSFSSS 204
Query: 142 SRLLGCQNPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSG-LTEGIALSET 200
L C+ +C + + +CR ND L Y V YG G T G +ET
Sbjct: 205 YAPLTCETHQCKSL--DVSECR--NDSCL------------YEVSYGDGSYTVGDFATET 248
Query: 201 LNLPNRI-IPNFLVGCSVLSSRQPAGI-------AGFGRGKTSLPSQLNLDKFSYCLLSH 252
+ L + N +GC G+ G G G S PSQ+N FSYCL++
Sbjct: 249 ITLDGSASLNNVAIGCG----HDNEGLFVGAAGLLGLGGGSLSFPSQINASSFSYCLVNR 304
Query: 253 KFDDTTRTSSLILDNG-SSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGG 311
DT S+L ++ SHS P + N N +YY+G+ I VGG
Sbjct: 305 ---DTDSASTLEFNSPIPSHS-------VTAPLLRN------NQLDTFYYLGMTGIGVGG 348
Query: 312 QRVRVWHKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAE 371
Q + + +D GNGG IVDSGT T + +++ L D FV R
Sbjct: 349 QMLSIPRSSFEVDESGNGGIIVDSGTAVTRLQSDVYNSLRDSFV------RGTQHLPSTS 402
Query: 372 ALTGLRPCFDVPGEKTGSFPELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTVVTDREAS 431
+ C+D+ + P + HF G + LP +NY V C A
Sbjct: 403 GVALFDTCYDLSSRSSVEVPTVSFHFPDGKYLALPAKNYLIPVDSAGTFCFAFAPTTSAL 462
Query: 432 GGPSIILGNFQMQNYYVEYDLRNQRLGFKQQLC 464
I+GN Q Q V YDL N +GF C
Sbjct: 463 S----IIGNVQQQGTRVSYDLSNSLVGFSPNGC 491
>gi|297821064|ref|XP_002878415.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
gi|297324253|gb|EFH54674.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
Length = 486
Score = 130 bits (328), Expect = 1e-27, Method: Compositional matrix adjust.
Identities = 121/394 (30%), Positives = 165/394 (41%), Gaps = 51/394 (12%)
Query: 83 SHSYGGYSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSS 142
S G Y + L GTP + +LDTGS +VW C+ CK C + F PK S +
Sbjct: 132 SQGSGEYFMRLGVGTPATNVYMVLDTGSDVVWLQCS---PCKACYNQSDVIFDPKKSKTF 188
Query: 143 RLLGCQNPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSG-LTEGIALSETL 201
+ C + C + D ++ SK C Y V YG G TEG +ETL
Sbjct: 189 ATVPCGSRLCRRLD-------DSSECVTRRSKTCL-----YQVSYGDGSFTEGDFSTETL 236
Query: 202 NLPNRIIPNFLVGCSVLSSRQPAGI-------AGFGRGKTSLPSQLNL---DKFSYCLLS 251
+ + +GC G+ G GRG S PSQ KFSYCL+
Sbjct: 237 TFHGARVDHVPLGC----GHDNEGLFVGAAGLLGLGRGGLSFPSQTKSRYNGKFSYCLVD 292
Query: 252 HKFDDTTRTSSLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGG 311
++ + G+ K + +TP + NP + +YY+ L I+VGG
Sbjct: 293 RTSSGSSSKPPSTIVFGNDAVPKTS---VFTPLLTNPKL------DTFYYLQLLGISVGG 343
Query: 312 QRV-RVWHKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGA 370
RV V LD GNGG I+DSGT+ T + + L D F K + A
Sbjct: 344 SRVPGVSESQFKLDATGNGGVIIDSGTSVTRLTQSAYVALRDAFRLGATKLKR------A 397
Query: 371 EALTGLRPCFDVPGEKTGSFPELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTVVTDREA 430
+ + CFD+ G T P + HF GG EV+LP NY V C
Sbjct: 398 PSYSLFDTCFDLSGMTTVKVPTVVFHF-GGGEVSLPASNYLIPVNTEGRFCFAFA----G 452
Query: 431 SGGPSIILGNFQMQNYYVEYDLRNQRLGFKQQLC 464
+ G I+GN Q Q + V YDL R+GF + C
Sbjct: 453 TMGSLSIIGNIQQQGFRVAYDLVGSRVGFLSRAC 486
>gi|302774174|ref|XP_002970504.1| hypothetical protein SELMODRAFT_93504 [Selaginella moellendorffii]
gi|300162020|gb|EFJ28634.1| hypothetical protein SELMODRAFT_93504 [Selaginella moellendorffii]
Length = 483
Score = 130 bits (327), Expect = 1e-27, Method: Compositional matrix adjust.
Identities = 120/394 (30%), Positives = 180/394 (45%), Gaps = 52/394 (13%)
Query: 87 GGYSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLG 146
G Y + L GTP + + ++DTGS L W C CK C P F P+ SSS + +
Sbjct: 127 GEYFVRLGVGTPARSLFMVVDTGSDLPWLQCQ---PCKSCYKQADPIFDPRNSSSFQRIP 183
Query: 147 CQNPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSG-LTEGIALSE--TLNL 203
C +P C ++++ C+ ATS+ C SY V YG G + G S+ TL
Sbjct: 184 CLSPLC-----KALEIHSCSGSRGATSR-----C-SYQVAYGDGSFSVGDFSSDLFTLGT 232
Query: 204 PNRIIPNFLVGCSV---LSSRQPAGIAGFGRGKTSLPSQL--------NLDKFSYCLLSH 252
++ + + GC AG+ G G GK S PSQ+ + FSYCL+
Sbjct: 233 GSKAM-SVAFGCGFDNEGLFAGAAGLLGLGAGKLSFPSQIFASSTNSSTANSFSYCLVDR 291
Query: 253 KFDDTTRTSSLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQ 312
T +SSLI + S T L+ P + NP + +YY + ++VGG
Sbjct: 292 SNPMTRSSSSLIFGAAAIPS---TAALS--PLLKNPKL------DTFYYAAMIGVSVGGA 340
Query: 313 RVRVWHKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRAL-GAE 371
++ + K L L + G+GG I+DSGT+ T ++ + D F RN T L A
Sbjct: 341 QLPISLKSLQLSQSGSGGVIIDSGTSVTRFPTSVYATIRDAF-------RNATTNLPSAP 393
Query: 372 ALTGLRPCFDVPGEKTGSFPELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTVVTDREAS 431
+ C++ G+ + P L LHF+ GA++ LP NY + + CL
Sbjct: 394 RYSLFDTCYNFSGKASVDVPALVLHFENGADLQLPPTNYLIPINTAGSFCLAFAPTSMEL 453
Query: 432 GGPSIILGNFQMQNYYVEYDLRNQRLGFKQQLCK 465
G I+GN Q Q++ + +DL+ L F Q CK
Sbjct: 454 G----IIGNIQQQSFRIGFDLQKSHLAFAPQQCK 483
>gi|449434646|ref|XP_004135107.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 1-like [Cucumis
sativus]
Length = 486
Score = 130 bits (327), Expect = 1e-27, Method: Compositional matrix adjust.
Identities = 116/392 (29%), Positives = 162/392 (41%), Gaps = 58/392 (14%)
Query: 82 SSHSYGGYSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSS 141
+S G Y + G PP + +LDTGS + W C C C P F P S+S
Sbjct: 144 ASQGSGEYFSRVGIGRPPSPVYMVLDTGSDVSWVQCA---PCAECYEQTDPIFEPTSSAS 200
Query: 142 SRLLGCQNPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSG-LTEGIALSET 200
L C+ +C +S+ +C +N T + Y V YG G T G ++ET
Sbjct: 201 FTSLSCETEQC-----KSLDVSEC--------RNGTCL---YEVSYGDGSYTVGDFVTET 244
Query: 201 LNLPNRIIPNFLVGCSVLSSRQPAGI-------AGFGRGKTSLPSQLNLDKFSYCLLSHK 253
+ L + + N +GC G+ G G G S PSQLN FSYCL+
Sbjct: 245 VTLGSTSLGNIAIGCG----HNNEGLFIGAAGLLGLGGGSLSFPSQLNASSFSYCLVDRD 300
Query: 254 FDDTTRTSSLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQR 313
D T+ LD S + T P NP++ ++Y+GL ++VGG
Sbjct: 301 SDSTS-----TLDFNSPITPDAVTA----PLHRNPNL------DTFFYLGLTGMSVGGAV 345
Query: 314 VRVWHKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEAL 373
+ + + DGNGG IVDSGT T + ++ L D FV + A +
Sbjct: 346 LPIPETSFQMSEDGNGGIIVDSGTAVTRLQTTVYNVLRDAFVKSTHDLQT------ARGV 399
Query: 374 TGLRPCFDVPGEKTGSFPELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTVV-TDREASG 432
C+D+ + P + HF G E+ LP +NY V C TD S
Sbjct: 400 ALFDTCYDLSSKSRVEVPTVSFHFANGNELPLPAKNYLIPVDSEGTFCFAFAPTDSTLS- 458
Query: 433 GPSIILGNFQMQNYYVEYDLRNQRLGFKQQLC 464
ILGN Q Q V +DL N +GF C
Sbjct: 459 ----ILGNAQQQGTRVGFDLANSLVGFSPNKC 486
>gi|359488213|ref|XP_002263620.2| PREDICTED: aspartic proteinase nepenthesin-2-like [Vitis vinifera]
Length = 434
Score = 130 bits (327), Expect = 1e-27, Method: Compositional matrix adjust.
Identities = 136/446 (30%), Positives = 191/446 (42%), Gaps = 52/446 (11%)
Query: 28 SLTFSLSRFHTNPSQDSYQNLNSLVSSSLTRALHIKNPQTKTTTTTTTTTTTNISSHSYG 87
SL+FSL+ S + NSL SSSL +NP TKTT+ SS Y
Sbjct: 28 SLSFSLTSIPL-----SSHSKNSLFSSSLASQFK-QNPNTKTTSYNYR------SSFKYS 75
Query: 88 -GYSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLG 146
+SL GTPPQ +LDTGS L W QCK + +F P LSSS +L
Sbjct: 76 MALIVSLPIGTPPQTQQMVLDTGSQLSWI------QCKVPPKTPPTAFDPLLSSSFSVLP 129
Query: 147 CQNPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSG-LTEGIALSETLNLPN 205
C + C D L TS + ++C Y Y G EG + E +
Sbjct: 130 CNHSLCK---------PRVPDYTLPTSCDQNRLC-HYSYFYADGTYAEGNLVREKFTFSS 179
Query: 206 -RIIPNFLVGCSVLSSRQPAGIAGFGRGKTSLPSQLNLDKFSYCLLSHKFDD-TTRTSSL 263
+ P ++GC+ SS GI G G+ S S + KFSYC+ + ++ T S
Sbjct: 180 SQTTPPLILGCATDSSDT-QGILGMNLGRLSFSSLAKISKFSYCVPPRRSQSGSSPTGSF 238
Query: 264 ILD-NGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRVRVWHKYLT 322
L N SS K +TY P N + Y + + I + G+++ +
Sbjct: 239 YLGPNPSSAGFKYVNLMTYRQSQRMP-----NLDPLAYTLPMLGIRINGKKLNISTSAFR 293
Query: 323 LDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEALTGLRPCFD- 381
D G G T++DSGT FTF+ E + + +E V G L CFD
Sbjct: 294 ADPSGAGQTLIDSGTWFTFLVDEAYSKVKEEIVKLAGPKLKKGYVYGGS----LDMCFDG 349
Query: 382 ---VPGEKTGSFPELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTVVTDREASGGPSIIL 438
V G G+ + F+ G E+ + E A VG G CL + + G S I+
Sbjct: 350 DAMVIGRMIGN---MAFEFENGVEIVVEREKMLADVG-GGVQCLG-IGRSDLLGVASNII 404
Query: 439 GNFQMQNYYVEYDLRNQRLGFKQQLC 464
GNF Q+ +VE+DL +R+GF + C
Sbjct: 405 GNFHQQDLWVEFDLVGRRVGFGRTDC 430
>gi|212274713|ref|NP_001130791.1| uncharacterized protein LOC100191895 precursor [Zea mays]
gi|194690124|gb|ACF79146.1| unknown [Zea mays]
gi|194708040|gb|ACF88104.1| unknown [Zea mays]
gi|223950469|gb|ACN29318.1| unknown [Zea mays]
gi|414885521|tpg|DAA61535.1| TPA: hypothetical protein ZEAMMB73_650724 [Zea mays]
Length = 500
Score = 130 bits (327), Expect = 1e-27, Method: Compositional matrix adjust.
Identities = 118/373 (31%), Positives = 174/373 (46%), Gaps = 45/373 (12%)
Query: 105 ILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLGCQNPKCSWIHHESIQCRD 164
I+DT S L W C C+ C + P F P S S + C +P C + +
Sbjct: 157 IVDTASELTWVQCA---PCESCHDQQGPLFDPSSSPSYAAVPCDSPSCDALQQQLATGAG 213
Query: 165 CNDEPLATSKNCTQICPSYLVLYGSG-LTEGIALSETLNLPNRIIPNFLVGCSVLSSRQP 223
P + C SY + Y G + G+ + L+L +I F+ GC + P
Sbjct: 214 AGAPPCDAGRPAA--C-SYALSYRDGSYSRGVLAHDRLSLAGEVIDGFVFGCGTSNQGPP 270
Query: 224 ----AGIAGFGRGKTSLPSQLNLDKF----SYCL-LSHKFDDTTRTSSLIL-DNGSSHSD 273
+G+ G GR + SL SQ +D+F SYCL LS + D + SL+L D+ S++
Sbjct: 271 FGGTSGLMGLGRSQLSLVSQ-TVDQFGGVFSYCLPLSRESD---ASGSLVLGDDPSAY-- 324
Query: 274 KKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRVRVWHKYLTLDRDGNGGTIV 333
+ +T + YT V+N + F Y V L ITVGGQ V + IV
Sbjct: 325 RNSTPVVYTSMVSNSDPLLQGPF---YLVNLTGITVGGQEVE--------STGFSARAIV 373
Query: 334 DSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEALTGLRPCFDVPGEKTGSFPEL 393
DSGT T + P ++ + EF+SQ+ + Y +A G + L CF++ G K P L
Sbjct: 374 DSGTVITSLVPSVYNAVRAEFMSQLAE---YPQAPG---FSILDTCFNMTGLKEVQVPSL 427
Query: 394 KLHFKGGAEVTLPVEN--YFAVVGEGSAVCLTVVTDREASGGPSIILGNFQMQNYYVEYD 451
L F GGAEV + YF V + S VCL V + + S + I+GN+Q +N V +D
Sbjct: 428 TLVFDGGAEVEVDSGGVLYF-VSSDSSQVCLAVASLK--SEDETSIIGNYQQKNLRVVFD 484
Query: 452 LRNQRLGFKQQLC 464
++GF Q+ C
Sbjct: 485 TSASQVGFAQETC 497
>gi|356557203|ref|XP_003546907.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
Length = 470
Score = 130 bits (326), Expect = 2e-27, Method: Compositional matrix adjust.
Identities = 114/386 (29%), Positives = 180/386 (46%), Gaps = 50/386 (12%)
Query: 89 YSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLGCQ 148
Y +++ G+ Q + I+DTGS L W C C+ C + P F P S S + + C
Sbjct: 122 YIVTMGLGS--QNMSVIVDTGSDLTWVQCE---PCRSCYNQNGPLFKPSTSPSYQPILCN 176
Query: 149 NPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSG-LTEGIALSETLNLPNRI 207
+ C +S++ C +P +TS C Y+V YG G T G E L
Sbjct: 177 STTC-----QSLELGACGSDP-STSATC-----DYVVNYGDGSYTSGELGIEKLGFGGIS 225
Query: 208 IPNFLVGCSVLSSR---QPAGIAGFGRGKTSLPSQLNL---DKFSYCLLSHKFDDTTRTS 261
+ NF+ GC + +G+ G GR + S+ SQ N FSYCL S D +
Sbjct: 226 VSNFVFGCGRNNKGLFGGASGLMGLGRSELSMISQTNATFGGVFSYCLPST--DQAGASG 283
Query: 262 SLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRVRVWHKYL 321
SL++ N S K T + YT + N ++ +Y + L I VGG + V
Sbjct: 284 SLVMGN-QSGVFKNVTPIAYTRMLPNLQLSN------FYILNLTGIDVGGVSLHVQASSF 336
Query: 322 TLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEALTGLRPCFD 381
GNGG I+DSGT + +AP +++ L +F+ Q ++ A + L CF+
Sbjct: 337 -----GNGGVILDSGTVISRLAPSVYKALKAKFLEQ------FSGFPSAPGFSILDTCFN 385
Query: 382 VPGEKTGSFPELKLHFKGGAEVTLPVENYFAVVGE-GSAVCLTV--VTDREASGGPSIIL 438
+ G + P + ++F+G AE+ + F +V E S VCL + ++D G I+
Sbjct: 386 LTGYDQVNIPTISMYFEGNAELNVDATGIFYLVKEDASRVCLALASLSDEYEMG----II 441
Query: 439 GNFQMQNYYVEYDLRNQRLGFKQQLC 464
GN+Q +N V YD + ++GF ++ C
Sbjct: 442 GNYQQRNQRVLYDAKLSQVGFAKEPC 467
>gi|242086034|ref|XP_002443442.1| hypothetical protein SORBIDRAFT_08g019550 [Sorghum bicolor]
gi|241944135|gb|EES17280.1| hypothetical protein SORBIDRAFT_08g019550 [Sorghum bicolor]
Length = 443
Score = 130 bits (326), Expect = 2e-27, Method: Compositional matrix adjust.
Identities = 114/382 (29%), Positives = 171/382 (44%), Gaps = 47/382 (12%)
Query: 96 GTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLGCQNPKCS-- 153
G PPQ ++DTGS L+W CT + K C +P F S S + CQ+ C+
Sbjct: 93 GDPPQRAEALIDTGSSLIWTQCTACLR-KVCVRQDLPYFNASSSGSFAPVPCQDKACAGN 151
Query: 154 WIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSGLTEGIALSETLNLPNRIIPNFLV 213
++H A CT + V YG+G G ++ +
Sbjct: 152 YLHF------------CALDGTCT-----FRVTYGAGGIIGFLGTDAFTFQSGGA-TLAF 193
Query: 214 GCSVLSS-------RQPAGIAGFGRGKTSLPSQLNLDKFSYCLLSHKFDDTTRTSSLILD 266
GC + +G+ G GRG+ SL SQ +FSYCL + F + +S L +
Sbjct: 194 GCVSFTRFAAPDVLHGASGLIGLGRGRLSLASQTGAKRFSYCLTPY-FHNNGASSHLFVG 252
Query: 267 NGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRVRVWHKYLTLDR- 325
+S S ++ FV +P + +S +YY+ L ITVG ++ + L
Sbjct: 253 AAASLSGGGGAVMSMA-FVESP---KDYPYSTFYYLPLVGITVGETKLAIPSTAFDLQEV 308
Query: 326 -DG--NGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEALTGLRPCFDV 382
+G GG I+DSG+ FT + + +EPL E Q+ N + G E G+ C
Sbjct: 309 EEGFWEGGVIIDSGSPFTSLVEDAYEPLMGELARQL--NGSLVPPPG-EDDGGMALCV-A 364
Query: 383 PGEKTGSFPELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTVVTDREASGGPSIILGNFQ 442
G+ P L LHF GGA++ LP ENY+A + E S C+ +V S I+GNFQ
Sbjct: 365 RGDLDRVVPTLVLHFSGGADMALPPENYWAPL-EKSTACMAIVRGYLQS-----IIGNFQ 418
Query: 443 MQNYYVEYDLRNQRLGFKQQLC 464
QN ++ +D+ RL F+ C
Sbjct: 419 QQNMHILFDVGGGRLSFQNADC 440
>gi|312282359|dbj|BAJ34045.1| unnamed protein product [Thellungiella halophila]
Length = 484
Score = 130 bits (326), Expect = 2e-27, Method: Compositional matrix adjust.
Identities = 128/431 (29%), Positives = 178/431 (41%), Gaps = 54/431 (12%)
Query: 46 QNLNSLVSSSLTRALHIKNPQTKTTTTTTTTTTTNISSHSYGGYSISLSFGTPPQIIPFI 105
++L SL + S R + + P++ + + S G Y + L GTP + +
Sbjct: 96 ESLTSLAAVSAGRNVTKRPPRSAGGFSGVVISGL---SQGSGEYFMRLGVGTPATNMYMV 152
Query: 106 LDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLGCQNPKCSWIHHESIQCRDC 165
LDTGS +VW C+ CK C + P F P S + + C + C + S +C
Sbjct: 153 LDTGSDVVWLQCS---PCKVCYNQSDPVFNPAKSKTFATVPCGSRLCRRLDDSS----EC 205
Query: 166 NDEPLATSKNCTQICPSYLVLYGSG-LTEGIALSETLNLPNRIIPNFLVGCSVLSSRQPA 224
SK C Y V YG G T G +ETL + + +GC
Sbjct: 206 VSR---RSKACL-----YQVSYGDGSFTVGDFSTETLTFHGARVDHVALGC----GHDNE 253
Query: 225 GI-------AGFGRGKTSLPSQLN---LDKFSYCLLSHKFDDTTRTSSLILDNGSSHSDK 274
G+ G GRG S PSQ KFSYCL+ ++ + G+ K
Sbjct: 254 GLFVGAAGLLGLGRGGLSFPSQTKNRYNGKFSYCLVDRTSSGSSSKPPSTIVFGNGAVPK 313
Query: 275 KTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRV-RVWHKYLTLDRDGNGGTIV 333
+TP + NP + +YY+ L I+VGG RV V LD GNGG I+
Sbjct: 314 TA---VFTPLLTNPKL------DTFYYLQLLGISVGGSRVPGVSESQFKLDATGNGGVII 364
Query: 334 DSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEALTGLRPCFDVPGEKTGSFPEL 393
DSGT+ T + + L D F TR A + + CFD+ G T P +
Sbjct: 365 DSGTSVTRLTQSAYVALRDAF------RLGATRLKRAPSYSLFDTCFDLSGMTTVKVPTV 418
Query: 394 KLHFKGGAEVTLPVENYFAVVGEGSAVCLTVVTDREASGGPSIILGNFQMQNYYVEYDLR 453
HF GG EV+LP NY V C + G I+GN Q Q + V YDL
Sbjct: 419 VFHFTGG-EVSLPASNYLIPVNNQGRFCFAFA----GTMGSLSIIGNIQQQGFRVAYDLV 473
Query: 454 NQRLGFKQQLC 464
R+GF + C
Sbjct: 474 GSRVGFLSRAC 484
>gi|19424106|gb|AAL87345.1| putative chloroplast nucleoid DNA-binding protein [Arabidopsis
thaliana]
Length = 500
Score = 129 bits (325), Expect = 2e-27, Method: Compositional matrix adjust.
Identities = 117/412 (28%), Positives = 171/412 (41%), Gaps = 55/412 (13%)
Query: 62 IKNPQTKTTTTTTTTTTTNISSHSYGGYSISLSFGTPPQIIPFILDTGSHLVWFPCTNHY 121
+ N T+ T TT + +S G Y + GTP + + +LDTGS + W C
Sbjct: 135 VYNEDTRYQTEDLTTPVVSGASQGSGEYFSRIGVGTPAKDMYLVLDTGSDVNWIQCE--- 191
Query: 122 QCKYCSSSKIPSFIPKLSSSSRLLGCQNPKCSWIHHESIQCRDCNDEPLATSKNCTQICP 181
C C P F P SS+ + L C P+CS + E+ CR S C
Sbjct: 192 PCADCYQQSDPVFNPTSSSTYKSLTCSAPQCSLL--ETSACR---------SNKCL---- 236
Query: 182 SYLVLYGSG-LTEGIALSETLNLPNR-IIPNFLVGCSVLSSRQPAGI-------AGFGRG 232
Y V YG G T G ++T+ N I N +GC G+ G G G
Sbjct: 237 -YQVSYGDGSFTVGELATDTVTFGNSGKINNVALGCG----HDNEGLFTGAAGLLGLGGG 291
Query: 233 KTSLPSQLNLDKFSYCLLSHKFDDTTRTSSLILDNGSSHSDKKTTGLTYTPFVNNPSVAE 292
S+ +Q+ FSYCL+ D+ ++SSL +S + G P + N +
Sbjct: 292 VLSITNQMKATSFSYCLVDR---DSGKSSSLDF-----NSVQLGGGDATAPLLRNKKI-- 341
Query: 293 RNAFSVYYYVGLRRITVGGQRVRVWHKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLAD 352
+YYVGL +VGG++V + +D G+GG I+D GT T + + + L D
Sbjct: 342 ----DTFYYVGLSGFSVGGEKVVLPDAIFDVDASGSGGVILDCGTAVTRLQTQAYNSLRD 397
Query: 353 EFVSQMVKNRNYTRALGAEALTGLRPCFDVPGEKTGSFPELKLHFKGGAEVTLPVENYFA 412
F+ V + G+ +++ C+D T P + HF GG + LP +NY
Sbjct: 398 AFLKLTVNLKK-----GSSSISLFDTCYDFSSLSTVKVPTVAFHFTGGKSLDLPAKNYLI 452
Query: 413 VVGEGSAVCLTVVTDREASGGPSIILGNFQMQNYYVEYDLRNQRLGFKQQLC 464
V + C S SII GN Q Q + YDL +G C
Sbjct: 453 PVDDSGTFCFAFA---PTSSSLSII-GNVQQQGTRITYDLSKNVIGLSGNKC 500
>gi|413925432|gb|AFW65364.1| hypothetical protein ZEAMMB73_378208 [Zea mays]
Length = 418
Score = 129 bits (325), Expect = 2e-27, Method: Compositional matrix adjust.
Identities = 122/389 (31%), Positives = 166/389 (42%), Gaps = 63/389 (16%)
Query: 87 GGYSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLG 146
G Y ++ S GTPPQ + + DTGS L+W C CK C+ S+ P SSS L
Sbjct: 79 GAYDMTFSMGTPPQTLSALADTGSDLIWAKCG---ACKRCAPRGSASYYPTKSSSFSKLP 135
Query: 147 CQNPKCSWIHHESIQCRDCNDEPLAT---SKNCTQICPSYLVLYG-----SGLTEGIALS 198
C S CR + LAT ++ +C SY YG T+G S
Sbjct: 136 C----------SSALCRTLESQSLATCGGTRARGAVC-SYRYSYGLSSNPHHYTQGYMGS 184
Query: 199 ETLNLPNRIIPNFLVGCSVL---SSRQPAGIAGFGRGKTSLPSQLNLDKFSYCLLSHKFD 255
ET L + + GC+ + +G+ G GRGK SL QL + FSYCL S
Sbjct: 185 ETFTLGSDAVQGIGFGCTTMSEGGYGSGSGLVGLGRGKLSLVRQLKVGAFSYCLTS---- 240
Query: 256 DTTRTSSLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRVR 315
D + +S L+ G+ G+ TP VN + S +Y V L I++G +
Sbjct: 241 DPSTSSPLLFGAGA----LTGPGVQSTPLVNLKT-------STFYTVNLDSISIGAAKT- 288
Query: 316 VWHKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEALTG 375
G G I DSGTT TF+A + +SQ N TR G + G
Sbjct: 289 --------PGTGRHGIIFDSGTTLTFLAEPAYTLAEAGLLSQTT---NLTRVPGTD---G 334
Query: 376 LRPCFDVPGEKTGSFPELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTVVTDREASGGPS 435
CF G FP + LHF GG ++ L ENYF V + + L + E S
Sbjct: 335 YEVCFQTSGGAV--FPSMVLHFDGG-DMALKTENYFGAVNDSVSCWLVQKSPSEMS---- 387
Query: 436 IILGNFQMQNYYVEYDLRNQRLGFKQQLC 464
I+GN +Y++ YDL L F+ C
Sbjct: 388 -IVGNIMQMDYHIRYDLDKSVLSFQPTNC 415
>gi|15229656|ref|NP_188478.1| aspartyl protease family protein [Arabidopsis thaliana]
gi|75273882|sp|Q9LS40.1|ASPG1_ARATH RecName: Full=Protein ASPARTIC PROTEASE IN GUARD CELL 1;
Short=AtASPG1; Flags: Precursor
gi|11994113|dbj|BAB01116.1| CND41, chloroplast nucleoid DNA binding protein-like [Arabidopsis
thaliana]
gi|23297732|gb|AAN13013.1| putative chloroplast nucleoid DNA-binding protein [Arabidopsis
thaliana]
gi|332642583|gb|AEE76104.1| aspartyl protease family protein [Arabidopsis thaliana]
Length = 500
Score = 129 bits (325), Expect = 2e-27, Method: Compositional matrix adjust.
Identities = 117/412 (28%), Positives = 171/412 (41%), Gaps = 55/412 (13%)
Query: 62 IKNPQTKTTTTTTTTTTTNISSHSYGGYSISLSFGTPPQIIPFILDTGSHLVWFPCTNHY 121
+ N T+ T TT + +S G Y + GTP + + +LDTGS + W C
Sbjct: 135 VYNEDTRYQTEDLTTPVVSGASQGSGEYFSRIGVGTPAKEMYLVLDTGSDVNWIQCE--- 191
Query: 122 QCKYCSSSKIPSFIPKLSSSSRLLGCQNPKCSWIHHESIQCRDCNDEPLATSKNCTQICP 181
C C P F P SS+ + L C P+CS + E+ CR S C
Sbjct: 192 PCADCYQQSDPVFNPTSSSTYKSLTCSAPQCSLL--ETSACR---------SNKCL---- 236
Query: 182 SYLVLYGSG-LTEGIALSETLNLPNR-IIPNFLVGCSVLSSRQPAGI-------AGFGRG 232
Y V YG G T G ++T+ N I N +GC G+ G G G
Sbjct: 237 -YQVSYGDGSFTVGELATDTVTFGNSGKINNVALGCG----HDNEGLFTGAAGLLGLGGG 291
Query: 233 KTSLPSQLNLDKFSYCLLSHKFDDTTRTSSLILDNGSSHSDKKTTGLTYTPFVNNPSVAE 292
S+ +Q+ FSYCL+ D+ ++SSL +S + G P + N +
Sbjct: 292 VLSITNQMKATSFSYCLVDR---DSGKSSSLDF-----NSVQLGGGDATAPLLRNKKI-- 341
Query: 293 RNAFSVYYYVGLRRITVGGQRVRVWHKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLAD 352
+YYVGL +VGG++V + +D G+GG I+D GT T + + + L D
Sbjct: 342 ----DTFYYVGLSGFSVGGEKVVLPDAIFDVDASGSGGVILDCGTAVTRLQTQAYNSLRD 397
Query: 353 EFVSQMVKNRNYTRALGAEALTGLRPCFDVPGEKTGSFPELKLHFKGGAEVTLPVENYFA 412
F+ V + G+ +++ C+D T P + HF GG + LP +NY
Sbjct: 398 AFLKLTVNLKK-----GSSSISLFDTCYDFSSLSTVKVPTVAFHFTGGKSLDLPAKNYLI 452
Query: 413 VVGEGSAVCLTVVTDREASGGPSIILGNFQMQNYYVEYDLRNQRLGFKQQLC 464
V + C S SII GN Q Q + YDL +G C
Sbjct: 453 PVDDSGTFCFAFA---PTSSSLSII-GNVQQQGTRITYDLSKNVIGLSGNKC 500
>gi|18461217|dbj|BAB84414.1| chloroplast nucleoid DNA-binding protein cnd41-like [Oryza sativa
Japonica Group]
Length = 446
Score = 129 bits (325), Expect = 2e-27, Method: Compositional matrix adjust.
Identities = 115/390 (29%), Positives = 171/390 (43%), Gaps = 41/390 (10%)
Query: 87 GGYSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLG 146
G Y + GTP ++DTGS LVW C+ C+ C + + F P+ SS+ R +
Sbjct: 84 GEYFALVGVGTPSTKAMLVIDTGSDLVWLQCS---PCRRCYAQRGQVFDPRRSSTYRRVP 140
Query: 147 CQNPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSGLTE-GIALSETLNLPN 205
C +P+C + D A C Y+V YG G + G ++ L N
Sbjct: 141 CSSPQCRALRFPGC------DSGGAAGGGCR-----YMVAYGDGSSSTGDLATDKLAFAN 189
Query: 206 RI-IPNFLVGCSVLSS---RQPAGIAGFGRGKTSLPSQLNL---DKFSYCLLSHKFDDTT 258
+ N +GC + AG+ G GRGK S+ +Q+ F YCL + +T
Sbjct: 190 DTYVNNVTLGCGRDNEGLFDSAAGLLGVGRGKISISTQVAPAYGSVFEYCL-GDRTSRST 248
Query: 259 RTSSLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRVRVW- 317
R+S L+ T L P PS+ YYV + +VGG+RV +
Sbjct: 249 RSSYLVFGRTPEPPSTAFTALLSNP--RRPSL---------YYVDMAGFSVGGERVTGFS 297
Query: 318 HKYLTLD-RDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEALTGL 376
+ L LD G GG +VDSGT + A + + L D F ++ G ++
Sbjct: 298 NASLALDTATGRGGVVVDSGTAISRFARDAYAALRDAFDARARAAGMRR-LAGEHSV--F 354
Query: 377 RPCFDVPGEKTGSFPELKLHFKGGAEVTLPVENYFAVV--GEGSAVCLTVVTDREASGGP 434
C+D+ G S P + LHF GGA++ LP ENYF V G A EA+
Sbjct: 355 DACYDLRGRPAASAPLIVLHFAGGADMALPPENYFLPVDGGRRRAASYRRCLGFEAADDG 414
Query: 435 SIILGNFQMQNYYVEYDLRNQRLGFKQQLC 464
++GN Q Q + V +D+ +R+GF + C
Sbjct: 415 LSVIGNVQQQGFRVVFDVEKERIGFAPKGC 444
>gi|302776610|ref|XP_002971459.1| hypothetical protein SELMODRAFT_64134 [Selaginella moellendorffii]
gi|300160591|gb|EFJ27208.1| hypothetical protein SELMODRAFT_64134 [Selaginella moellendorffii]
Length = 357
Score = 129 bits (324), Expect = 3e-27, Method: Compositional matrix adjust.
Identities = 114/389 (29%), Positives = 166/389 (42%), Gaps = 54/389 (13%)
Query: 87 GGYSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLG 146
G Y + + G+P ++ ++DTGS + W C+ CK C F P+ SSS R L
Sbjct: 12 GEYFVRVGIGSPTKLQYLVMDTGSDVPWIQCS---PCKSCYKQNDAVFDPRASSSFRRLS 68
Query: 147 CQNPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSG-LTEGIALSETLNLPN 205
C P+C + ++ +T C Y V YG G T G S++ ++
Sbjct: 69 CSTPQCKLLDVKACA---------STDNRCL-----YQVSYGDGSFTVGDLASDSFSVSR 114
Query: 206 RIIPNFLVGCSVLSSRQPAGI-------AGFGRGKTSLPSQLNLDKFSYCLLSHKFDDTT 258
+ GC G+ G G GK S PSQL+ KFSYCL+S D+
Sbjct: 115 GRTSPVVFGCG----HDNEGLFVGAAGLLGLGAGKLSFPSQLSSRKFSYCLVSR--DNGV 168
Query: 259 RTSSLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRVRVWH 318
R SS +L S+ + YT + NP + +YY GL I++GG + +
Sbjct: 169 RASSALLFGDSAL--PTSASFAYTQLLKNPKL------DTFYYAGLSGISIGGTLLSIPS 220
Query: 319 KYLTLDRD-GNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRAL-GAEALTGL 376
L G GG I+DSGT+ T + + + D F R+ T+ L A +
Sbjct: 221 TAFKLSSSTGRGGVIIDSGTSVTRLPTYAYTVMRDAF-------RSATQKLPRAADFSLF 273
Query: 377 RPCFDVPGEKTGSFPELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTVV-TDREASGGPS 435
C+D + + P + HF+GGA V LP NY V C T + S
Sbjct: 274 DTCYDFSALTSVTIPTVSFHFEGGASVQLPPSNYLVPVDTSGTFCFAFSKTSLDLS---- 329
Query: 436 IILGNFQMQNYYVEYDLRNQRLGFKQQLC 464
I+GN Q Q V DL + R+GF + C
Sbjct: 330 -IIGNIQQQTMRVAIDLDSSRVGFAPRQC 357
>gi|414881704|tpg|DAA58835.1| TPA: hypothetical protein ZEAMMB73_701358 [Zea mays]
Length = 485
Score = 129 bits (324), Expect = 3e-27, Method: Compositional matrix adjust.
Identities = 125/395 (31%), Positives = 167/395 (42%), Gaps = 53/395 (13%)
Query: 87 GGYSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLG 146
G Y + GTP +LDTGS +VW C C+ C P F P+ SSS +G
Sbjct: 127 GEYFTKIGVGTPATQALMVLDTGSDVVWVQCA---PCRRCYEQSGPVFDPRRSSSYGAVG 183
Query: 147 CQNPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSG-LTEGIALSETLNLPN 205
C C + R C Y V YG G +T G ++ETL
Sbjct: 184 CGAALCRRLDSGGCDLR---------RGACM-----YQVAYGDGSVTAGDFVTETLTFAG 229
Query: 206 RI-IPNFLVGCSVLSSR---QPAGIAGFGRGKTSLPSQLNLD---KFSYCLLSH------ 252
+ +GC + AG+ G GRG S P+Q++ FSYCL+
Sbjct: 230 GARVARVALGCGHDNEGLFVAAAGLLGLGRGGLSFPTQISRRYGRSFSYCLVDRTSSGAG 289
Query: 253 KFDDTTRTSSLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQ 312
+ R+S++ GS + + +TP V NP + +YYV L I+VGG
Sbjct: 290 AAPGSHRSSTVSFGAGSVGASSAS----FTPMVRNPRM------ETFYYVQLVGISVGGA 339
Query: 313 RV-RVWHKYLTLD-RDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGA 370
RV V L LD G GG IVDSGT+ T +A + L D F + L
Sbjct: 340 RVPGVAESDLRLDPSTGRGGVIVDSGTSVTRLARASYSALRDAFRAAAAGGLR----LSP 395
Query: 371 EALTGLRPCFDVPGEKTGSFPELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTVV-TDRE 429
+ C+D+ G + P + +HF GGAE LP ENY V C TD
Sbjct: 396 GGFSLFDTCYDLGGRRVVKVPTVSMHFAGGAEAALPPENYLIPVDSRGTFCFAFAGTD-- 453
Query: 430 ASGGPSIILGNFQMQNYYVEYDLRNQRLGFKQQLC 464
GG SII GN Q Q + V +D QR+GF + C
Sbjct: 454 --GGVSII-GNIQQQGFRVVFDGDGQRVGFAPKGC 485
>gi|242077672|ref|XP_002448772.1| hypothetical protein SORBIDRAFT_06g032900 [Sorghum bicolor]
gi|241939955|gb|EES13100.1| hypothetical protein SORBIDRAFT_06g032900 [Sorghum bicolor]
Length = 471
Score = 129 bits (324), Expect = 4e-27, Method: Compositional matrix adjust.
Identities = 113/388 (29%), Positives = 160/388 (41%), Gaps = 49/388 (12%)
Query: 87 GGYSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLG 146
G Y + + G+PP ++D+GS ++W C C C + P F P S++ +
Sbjct: 123 GEYFVRVGIGSPPTEQYLVVDSGSDVIWVQCK---PCLECYAQADPLFDPASSATFSAVS 179
Query: 147 CQNPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSG-LTEGIALSETLNLPN 205
C + C +++ C D S C Y V YG G T+G ETL L
Sbjct: 180 CGSAIC-----RTLRTSGCGD-----SGGC-----EYEVSYGDGSYTKGTLALETLTLGG 224
Query: 206 RIIPNFLVGCSVLSSR---QPAGIAGFGRGKTSLPSQLNLDK---FSYCLLSHKFDDT-- 257
+ +GC + AG+ G G G SL QL FSYCL S +
Sbjct: 225 TAVEGVAIGCGHRNRGLFVGAAGLLGLGWGPMSLVGQLGGAAGGAFSYCLASRGGSGSGA 284
Query: 258 -TRTSSLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRVRV 316
SL+L S+ G + P V NP +YYVG+ I VG +R+ +
Sbjct: 285 ADAAGSLVL----GRSEAVPEGAVWVPLVRNPQAPS------FYYVGVSGIGVGDERLPL 334
Query: 317 WHKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEALTGL 376
L DG GG ++D+GT T + E + L D FV + RA G L
Sbjct: 335 QDGLFQLTEDGGGGVVMDTGTAVTRLPQEAYAALRDAFVGAV---GALPRAPGVSLLD-- 389
Query: 377 RPCFDVPGEKTGSFPELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTVVTDREASGGPSI 436
C+D+ G + P + +F G A +TLP N V +G CL +S G S
Sbjct: 390 -TCYDLSGYTSVRVPTVSFYFDGAATLTLPARNLLLEV-DGGIYCLAFA---PSSSGLS- 443
Query: 437 ILGNFQMQNYYVEYDLRNQRLGFKQQLC 464
ILGN Q + + D N +GF C
Sbjct: 444 ILGNIQQEGIQITVDSANGYIGFGPATC 471
>gi|242056193|ref|XP_002457242.1| hypothetical protein SORBIDRAFT_03g003930 [Sorghum bicolor]
gi|241929217|gb|EES02362.1| hypothetical protein SORBIDRAFT_03g003930 [Sorghum bicolor]
Length = 457
Score = 129 bits (323), Expect = 4e-27, Method: Compositional matrix adjust.
Identities = 129/394 (32%), Positives = 180/394 (45%), Gaps = 64/394 (16%)
Query: 89 YSISLSFGTPPQIIPFILDTGSHLVWFPCTNH-YQCKYCSSSKIPSFIPKLSSSSRLLGC 147
Y + ++ GTPP + I DTGS LVW C++ + F P SS+ L C
Sbjct: 103 YLMYVNVGTPPTQLLAIADTGSDLVWVNCSSSGGGLADADAGGNVVFQPTRSSTYSQLSC 162
Query: 148 QNPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSG-LTEGIALSETLNLPNR 206
Q+ C + S D + E C Y YG G T G+ +ET + +
Sbjct: 163 QSNACQALSQASC---DADSE-------C-----QYQYSYGDGSRTIGVLSTETFSFVDG 207
Query: 207 ------IIPNFLVGCSVLSSR--QPAGIAGFGRGKTSLPSQL----NLD-KFSYCLLSHK 253
+P GCS S+ + G+ G G G SL SQL ++D K SYCL+
Sbjct: 208 GGKGQVRVPRVNFGCSTASAGTFRSDGLVGLGAGAFSLVSQLGATTHIDRKLSYCLIPSY 267
Query: 254 FDDTTRTSSLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQR 313
D +S+L N S + G TP V PS + YY V L + VGGQ
Sbjct: 268 --DANSSSTL---NFGSRAVVSEPGAASTPLV--PSDVDS-----YYTVALESVAVGGQE 315
Query: 314 VRVWHKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEAL 373
V + IVDSGTT TF+ P L PL V+++ + R E L
Sbjct: 316 VATHDSRI----------IVDSGTTLTFLDPALLGPL----VTELERRIKLQRVQPPEQL 361
Query: 374 TGLRPCFDVPGE-KTGSF--PELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTVVTDREA 430
L+ C+DV G+ +T +F P++ L F GGA VTL EN F+++ EG+ +CL +V E+
Sbjct: 362 --LQLCYDVQGKSETDNFGIPDVTLRFGGGAAVTLRPENTFSLLQEGT-LCLVLVPVSES 418
Query: 431 SGGPSIILGNFQMQNYYVEYDLRNQRLGFKQQLC 464
P ILGN QN++V YDL + + F C
Sbjct: 419 Q--PVSILGNIAQQNFHVGYDLDARTVTFAAADC 450
>gi|297834758|ref|XP_002885261.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
gi|297331101|gb|EFH61520.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
Length = 500
Score = 129 bits (323), Expect = 4e-27, Method: Compositional matrix adjust.
Identities = 115/412 (27%), Positives = 170/412 (41%), Gaps = 55/412 (13%)
Query: 62 IKNPQTKTTTTTTTTTTTNISSHSYGGYSISLSFGTPPQIIPFILDTGSHLVWFPCTNHY 121
+ N T+ TT + S G Y + GTP + + +LDTGS + W C
Sbjct: 135 VNNEDTRYQPEALTTPVVSGVSQGSGEYFSRIGVGTPAKEMYLVLDTGSDVNWIQCE--- 191
Query: 122 QCKYCSSSKIPSFIPKLSSSSRLLGCQNPKCSWIHHESIQCRDCNDEPLATSKNCTQICP 181
C C P F P SS+ + L C P+CS + E+ CR S C
Sbjct: 192 PCSDCYQQSDPVFNPTSSSTYKSLTCSAPQCSLL--ETSACR---------SNKCL---- 236
Query: 182 SYLVLYGSG-LTEGIALSETLNLPNR-IIPNFLVGCSVLSSRQPAGI-------AGFGRG 232
Y V YG G T G ++T+ N I + +GC G+ G G G
Sbjct: 237 -YQVSYGDGSFTVGELATDTVTFGNSGKINDVALGCG----HDNEGLFTGAAGLLGLGGG 291
Query: 233 KTSLPSQLNLDKFSYCLLSHKFDDTTRTSSLILDNGSSHSDKKTTGLTYTPFVNNPSVAE 292
S+ +Q+ FSYCL+ D+ ++SSL +S + +G P + N +
Sbjct: 292 ALSITNQMKATSFSYCLVDR---DSGKSSSLDF-----NSVQLGSGDATAPLLRNQKI-- 341
Query: 293 RNAFSVYYYVGLRRITVGGQRVRVWHKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLAD 352
+YYVGL +VGGQ+V + +D G+GG I+D GT T + + + L D
Sbjct: 342 ----DTFYYVGLSGFSVGGQKVMMPDAIFDVDASGSGGVILDCGTAVTRLQTQAYNSLRD 397
Query: 353 EFVSQMVKNRNYTRALGAEALTGLRPCFDVPGEKTGSFPELKLHFKGGAEVTLPVENYFA 412
F+ + G +++ C+D + P + HF GG + LP +NY
Sbjct: 398 AFLKLTTNLKK-----GTSSISLFDTCYDFSSLSSVKVPTVAFHFTGGKSLDLPAKNYLI 452
Query: 413 VVGEGSAVCLTVVTDREASGGPSIILGNFQMQNYYVEYDLRNQRLGFKQQLC 464
V + C S SII GN Q Q + YDL N+ +G C
Sbjct: 453 PVDDNGTFCFAFA---PTSSSLSII-GNVQQQGTRITYDLANKIIGLSGNKC 500
>gi|224067042|ref|XP_002302336.1| predicted protein [Populus trichocarpa]
gi|222844062|gb|EEE81609.1| predicted protein [Populus trichocarpa]
Length = 438
Score = 129 bits (323), Expect = 4e-27, Method: Compositional matrix adjust.
Identities = 135/479 (28%), Positives = 198/479 (41%), Gaps = 68/479 (14%)
Query: 9 CLSFIFFFTLL-----SIFPSSITSLTFSLSRF--------HTNPSQDSYQN-LNSLVSS 54
C + F F LL ++ P + S T LS P Q+S+ N + ++ S
Sbjct: 6 CAATFFLFALLFSTTKAVDPCATQSDTSDLSVIPIYSKCSPFVPPKQESWVNTVITMASK 65
Query: 55 SLTRALHIKNPQTKTTTTTTTTTTTNISSHSYGGYSISLSFGTPPQIIPFILDTGSHLVW 114
R ++ + TT + Y + + GTP Q + +LDT + W
Sbjct: 66 DPERLKYLSTLADQKTTAVPIAPGQQV--LKIANYVVRVKLGTPGQQMFMVLDTSNDAAW 123
Query: 115 FPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLGCQNPKCSWIHHESIQCRDCNDEPLATSK 174
PC+ C CSS+ +F+P S++ L C +CS + S P S
Sbjct: 124 VPCSG---CTGCSST---TFLPNASTTLGSLDCSGAQCSQVRGFSC--------PATGSS 169
Query: 175 NCTQICPSYLVLYG--SGLTEGIALSETLNLPNRIIPNFLVGC-SVLS--SRQPAGIAGF 229
C + YG S LT + + + + L N +IP F GC + +S S P G+ G
Sbjct: 170 ACL-----FNQSYGGDSSLTATL-VQDAITLANDVIPGFTFGCINAVSGGSIPPQGLLGL 223
Query: 230 GRGKTSLPSQLNL---DKFSYCLLSHKFDDTTRTSSLILDNGSSHSDKKTTGLTYTPFVN 286
GRG SL SQ FSYCL S F + SL L +TT L P +
Sbjct: 224 GRGPISLISQAGAMYSGVFSYCLPS--FKSYYFSGSLKLGPVGQPKSIRTTPLLRNP--H 279
Query: 287 NPSVAERNAFSVYYYVGLRRITVGGQRVRVWHKYLTLDRDGNGGTIVDSGTTFTFMAPEL 346
PS+ YYV L ++VG +V + + L D + GTI+DSGT T +
Sbjct: 280 RPSL---------YYVNLTGVSVGRIKVPIPSEQLVFDPNTGAGTIIDSGTVITRFVQPV 330
Query: 347 FEPLADEFVSQMVKNRNYTRALGAEALTGLRPCFDVPGEKTGSFPELKLHFKGGAEVTLP 406
+ + DEF Q+ +LGA CF E P + LHF+G + LP
Sbjct: 331 YFAIRDEFRKQV---NGPISSLGA-----FDTCFAATNEAEA--PAITLHFEG-LNLVLP 379
Query: 407 VENYFAVVGEGSAVCLTVVTDREASGGPSIILGNFQMQNYYVEYDLRNQRLGFKQQLCK 465
+EN GS CL++ ++ N Q QN + +D N RLG ++LC
Sbjct: 380 MENSLIHSSSGSLACLSMAAAPNNVNSVLNVIANLQQQNLRIMFDTTNSRLGIARELCN 438
>gi|125563959|gb|EAZ09339.1| hypothetical protein OsI_31611 [Oryza sativa Indica Group]
Length = 453
Score = 128 bits (322), Expect = 6e-27, Method: Compositional matrix adjust.
Identities = 118/399 (29%), Positives = 170/399 (42%), Gaps = 66/399 (16%)
Query: 89 YSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLGCQ 148
Y + L+ GTPPQ I +LDTGS L+W C C C P F P++SSS + C
Sbjct: 98 YVLDLAVGTPPQPITALLDTGSDLIWTQCDT---CTACLRQPDPLFSPRMSSSYEPMRCA 154
Query: 149 NPKCSWI-HHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSGLTE-GIALSETLNLPN- 205
C I HH ++ C +Y YG G T G +E +
Sbjct: 155 GQLCGDILHHSCVRPDTC----------------TYRYSYGDGTTTLGYYATERFTFASS 198
Query: 206 ----RIIPNFLVGCSVL---SSRQPAGIAGFGRGKTSLPSQLNLDKFSYCLLSHKFDDTT 258
+ +P GC + S +GI GFGR SL SQL++ +FSYCL + ++
Sbjct: 199 SGETQSVP-LGFGCGTMNVGSLNNASGIVGFGRDPLSLVSQLSIRRFSYCLTPYA---SS 254
Query: 259 RTSSLILDNGSSHS--DKKTTGLTYTPFVN---NPSVAERNAFSVYYYVGLRRITVGGQR 313
R S+L + + D T + TP + NP+ +YYV +TVG +R
Sbjct: 255 RKSTLQFGSLADVGLYDDATGPVQTTPILQSAQNPT---------FYYVAFTGVTVGARR 305
Query: 314 VRVWHKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEAL 373
+R+ L DG+GG I+DSGT T + + F SQ+ A G+
Sbjct: 306 LRIPASAFALRPDGSGGVIIDSGTALTLFPAAVLAEVVRAFRSQL----RLPFANGSSPD 361
Query: 374 TGLRPCFDVPG--------EKTGSFPELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTVV 425
G+ CF P + + P + HF+ GA++ LP ENY +C+ +
Sbjct: 362 DGV--CFAAPAVAAGGGRMARQVAVPRMVFHFQ-GADLDLPRENYVLEDHRRGHLCVLL- 417
Query: 426 TDREASGGPSIILGNFQMQNYYVEYDLRNQRLGFKQQLC 464
SG +GNF Q+ V YDL + L F C
Sbjct: 418 ---GDSGDDGATIGNFVQQDMRVVYDLERETLSFAPVEC 453
>gi|2245012|emb|CAB10432.1| hypothetical protein [Arabidopsis thaliana]
gi|7268406|emb|CAB78698.1| hypothetical protein [Arabidopsis thaliana]
Length = 1046
Score = 128 bits (322), Expect = 6e-27, Method: Compositional matrix adjust.
Identities = 125/406 (30%), Positives = 179/406 (44%), Gaps = 80/406 (19%)
Query: 106 LDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLGCQNPKCSWIHHESIQCRDC 165
LDTGS LVWFPC + C C S +P P SSS + H S+ D
Sbjct: 130 LDTGSDLVWFPC-RPFTCILCESKPLPPSPPSSLSSSATTVSCSSPSCSAAHSSLPSSD- 187
Query: 166 NDEPLATSKNC-------------TQICPSYLVLYGSGLTEGIALSETLNLPNRIIPNFL 212
L NC + CP + YG G S++L+LP+ + NF
Sbjct: 188 ----LCAISNCPLDFIETGDCNTSSYPCPPFYYAYGDGSLVAKLYSDSLSLPSVSVSNFT 243
Query: 213 VGCSVLSSRQPAGIAGFGRGKTSLPSQLNL------DKFSYCLLSHKFDD--TTRTSSLI 264
GC+ + +P G+AGFGRG+ SLP+QL + + FSYCL+SH FD R S LI
Sbjct: 244 FGCAHTTLAEPIGVAGFGRGRLSLPAQLAVHSPHLGNSFSYCLVSHSFDSDRVRRPSPLI 303
Query: 265 LDNGSSHSDKKT----------------TGLTYTPFVNNPSVAERNAFSVYYYVGLRRIT 308
L +K+ +T + NP +Y V L+ I+
Sbjct: 304 LGRFVDKKEKRVGTTDDHDDGDDEKKKKNEFVFTEMLENPK------HPYFYSVSLQGIS 357
Query: 309 VGGQRVRVWHKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRAL 368
+G + + +D++G GG +VDSGTTFT + + + + +EF S++ R + RA
Sbjct: 358 IGKRNIPAPAMLRRIDKNGGGGVVVDSGTTFTMLPAKFYNSVVEEFDSRV--GRVHERAD 415
Query: 369 GAEALTGLRPCFDVPGEKTGSFPELKLHFKGG-AEVTLPVENYFAVVGEG--------SA 419
E + L LHF G + VTLP NYF +G
Sbjct: 416 RVEPSSA-----------------LVLHFAGNRSSVTLPRRNYFYEFMDGGDGKEEKRKI 458
Query: 420 VCLTVVT---DREASGGPSIILGNFQMQNYYVEYDLRNQRLGFKQQ 462
CL ++ + E GG ILGN+Q Q + V YDL N+R+GF ++
Sbjct: 459 GCLMLMNGGDESELRGGTGAILGNYQQQGFEVVYDLLNRRVGFAKR 504
>gi|108707839|gb|ABF95634.1| Eukaryotic aspartyl protease family protein [Oryza sativa Japonica
Group]
Length = 330
Score = 128 bits (322), Expect = 6e-27, Method: Compositional matrix adjust.
Identities = 87/258 (33%), Positives = 130/258 (50%), Gaps = 29/258 (11%)
Query: 208 IPNFLVGCSVLSS----RQPAGIAGFGRGKTSLPSQLNLDKFSYCLLSHKFDDTTRTSSL 263
+P GC + ++ GIAGFGRG SLPSQL + FS+C + + + S++
Sbjct: 91 VPGVAFGCGLFNNGVFKSNETGIAGFGRGPLSLPSQLKVGNFSHCFTAV---NGLKQSTV 147
Query: 264 ILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRVRVWHKYLTL 323
+LD + + TP + N +A +YY+ L+ ITVG R+ V L
Sbjct: 148 LLDLPADLYKNGRGAVQSTPLIQN------SANPTFYYLSLKGITVGSTRLPVPESAFAL 201
Query: 324 DRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEALTGLRPCFDVP 383
+G GGTI+DSGT+ T + P++++ + DEF +Q+ + TG CF P
Sbjct: 202 -TNGTGGTIIDSGTSITSLPPQVYQVVRDEFAAQI------KLPVVPGNATGPYTCFSAP 254
Query: 384 GEKTGSFPELKLHFKGGAEVTLPVENYFAVVGE---GSAVCLTVVTDREASGGPSIILGN 440
+ P+L LHF+ GA + LP ENY V + S +CL + E + I+GN
Sbjct: 255 SQAKPDVPKLVLHFE-GATMDLPRENYVFEVPDDAGNSIICLAINKGDETT-----IIGN 308
Query: 441 FQMQNYYVEYDLRNQRLG 458
FQ QN +V YDL+N G
Sbjct: 309 FQQQNMHVLYDLQNMHRG 326
>gi|242058537|ref|XP_002458414.1| hypothetical protein SORBIDRAFT_03g033075 [Sorghum bicolor]
gi|241930389|gb|EES03534.1| hypothetical protein SORBIDRAFT_03g033075 [Sorghum bicolor]
Length = 448
Score = 128 bits (321), Expect = 7e-27, Method: Compositional matrix adjust.
Identities = 115/396 (29%), Positives = 176/396 (44%), Gaps = 52/396 (13%)
Query: 87 GGYSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLG 146
G Y ++ G PP ++DTGS L+W C C++C P + P+ SS+ R +
Sbjct: 86 GEYFAVINVGDPPTRALVVIDTGSDLIWLQCV---PCRHCYRQVTPLYDPRSSSTHRRIP 142
Query: 147 CQNPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSG-LTEGIALSETLNLPN 205
C +P+C + ++ C+ A + C Y+V+YG G + G ++ L P+
Sbjct: 143 CASPRC----RDVLRYPGCD----ARTGGCV-----YMVVYGDGSASSGDLATDRLVFPD 189
Query: 206 RI-IPNFLVGC---SVLSSRQPAGIAGFGRGKTSLPSQLNL---DKFSYCLLSHKFDDTT 258
+ N +GC +V AG+ G GRG+ S P+QL FSYCL D +
Sbjct: 190 DTHVHNVTLGCGHDNVGLLESAAGLLGVGRGQLSFPTQLAPAYGHVFSYCL----GDRLS 245
Query: 259 RTSSLILDNGSSH----SDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRV 314
R NGSS+ + +TP NP YYV + +VGG+RV
Sbjct: 246 RAQ-----NGSSYLVFGRTPEPPSTAFTPLRTNPRRPS------LYYVDMVGFSVGGERV 294
Query: 315 RVW-HKYLTLD-RDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEA 372
+ + L L+ G GG +VDSGT + A + + + D F S R L A
Sbjct: 295 TGFSNASLALNPATGRGGIVVDSGTAISRFARDAYAAVRDAFDSHAAAA-GTMRKL-ATK 352
Query: 373 LTGLRPCFDVPGEKTGS----FPELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTVVTDR 428
+ C+D+ G + P + LHF GGA++ LP NY V G + +
Sbjct: 353 FSVFDACYDLRGNGAPAAAVRVPSIVLHFAGGADMALPQANYLIPVQGGDRRTYFCLGLQ 412
Query: 429 EASGGPSIILGNFQMQNYYVEYDLRNQRLGFKQQLC 464
A G + +LGN Q Q + + +D+ R+GF C
Sbjct: 413 AADDGLN-VLGNVQQQGFGLVFDVERGRIGFTPNGC 447
>gi|302793638|ref|XP_002978584.1| hypothetical protein SELMODRAFT_54048 [Selaginella moellendorffii]
gi|300153933|gb|EFJ20570.1| hypothetical protein SELMODRAFT_54048 [Selaginella moellendorffii]
Length = 407
Score = 128 bits (321), Expect = 7e-27, Method: Compositional matrix adjust.
Identities = 119/393 (30%), Positives = 179/393 (45%), Gaps = 52/393 (13%)
Query: 87 GGYSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLG 146
G Y + L GTP + + ++DTGS L W C CK C P F P+ SSS + +
Sbjct: 52 GEYFVRLGLGTPARSLFMVVDTGSDLPWLQCQ---PCKSCYKQADPIFDPRNSSSFQRIP 108
Query: 147 CQNPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSG-LTEGIALSE--TLNL 203
C +P C ++++ C+ ATS+ C SY V YG G + G S+ TL
Sbjct: 109 CLSPLC-----KALEVHSCSGSRGATSR-----C-SYQVAYGDGSFSVGDFSSDLFTLGT 157
Query: 204 PNRIIPNFLVGCSV---LSSRQPAGIAGFGRGKTSLPSQL--------NLDKFSYCLLSH 252
++ + + GC AG+ G G GK S PSQ+ + FSYCL+
Sbjct: 158 GSKAM-SVAFGCGFDNEGLFAGAAGLLGLGAGKLSFPSQIFASSTNSSTANSFSYCLVDR 216
Query: 253 KFDDTTRTSSLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQ 312
T +SSLI + S T L+ P + NP + +YY + ++VGG
Sbjct: 217 SNPMTRSSSSLIFGVAAIPS---TAALS--PLLKNPKL------DTFYYAAMIGVSVGGA 265
Query: 313 RVRVWHKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRAL-GAE 371
++ + K L L + G+GG I+DSGT+ T ++ + D F RN T L A
Sbjct: 266 QLPISLKSLQLSQSGSGGVIIDSGTSVTRFPTSVYATIRDAF-------RNATINLPSAP 318
Query: 372 ALTGLRPCFDVPGEKTGSFPELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTVVTDREAS 431
+ C++ G+ + P L LHF+ GA++ LP NY + + CL
Sbjct: 319 RYSLFDTCYNFSGKASVDVPALVLHFENGADLQLPPTNYLIPINTAGSFCLAFAPTSMEL 378
Query: 432 GGPSIILGNFQMQNYYVEYDLRNQRLGFKQQLC 464
G I+GN Q Q++ + +DL+ L F Q C
Sbjct: 379 G----IIGNIQQQSFRIGFDLQKSHLAFAPQQC 407
>gi|302781726|ref|XP_002972637.1| hypothetical protein SELMODRAFT_97698 [Selaginella moellendorffii]
gi|300160104|gb|EFJ26723.1| hypothetical protein SELMODRAFT_97698 [Selaginella moellendorffii]
Length = 393
Score = 128 bits (321), Expect = 7e-27, Method: Compositional matrix adjust.
Identities = 131/437 (29%), Positives = 197/437 (45%), Gaps = 72/437 (16%)
Query: 46 QNLNSLVSSSLTRALHIKNPQTKTTTTTTTTTTTNISSHSY---GGYSISLSFGTPPQII 102
+ + LV+ S R + + +++ ++ TT++ S + GGY + +S GTP +
Sbjct: 10 EAIRGLVAKSHARVRWMA-ARANSSSWSSMAGTTDVESPLHPDGGGYVMDISVGTPGKRF 68
Query: 103 PFILDTGSHLVWF---PCTNHYQCKYCSSSKIPSFIPKLSSSSRLLGCQNPKCSWIHHES 159
I DTGS LVW PCT CS I F P+ SS+ R + C + C+ +
Sbjct: 69 RAIADTGSDLVWVQSEPCTG------CSGGTI--FDPRQSSTFREMDCSSQLCTELPGSC 120
Query: 160 IQCRDCNDEPLATSKNCTQICPSYLVLYGSGLTEGIALSETLNL-----PNRIIPNFLVG 214
EP S C SY YGSG TEG +T++L ++ P+F VG
Sbjct: 121 --------EP--GSSAC-----SYSYEYGSGETEGEFARDTISLGTTSGGSQKFPSFAVG 165
Query: 215 CSVLSSRQPA--GIAGFGRGKTSLPSQLNL---DKFSYCLLSHKFDDTTRTSSLILDNGS 269
C +++S G+ G G+G SL SQL+ KFSYCL+ + + +S L+ +
Sbjct: 166 CGMVNSGFDGVDGLVGLGQGPVSLTSQLSAAIDSKFSYCLV--DINSQSESSPLLFGPSA 223
Query: 270 SHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRVRVWHKYLTLDRDGNG 329
+ TP PS + + YY + + I V GQ T+ G
Sbjct: 224 ALHGTGIQSTKITP----PS----DTYPTYYLLTVNGIAVAGQ---------TMGSPGT- 265
Query: 330 GTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEALTGLRPCFDVPGEKTGS 389
TI+DSGTT T++ ++ +S+M R G+ GL C+D +
Sbjct: 266 -TIIDSGTTLTYVPSGVY----GRVLSRMESMVTLPRVDGSS--MGLDLCYDRSSNRNYK 318
Query: 390 FPELKLHFKGGAEVTLPVENYFAVVGE-GSAVCLTVVTDREASGGPSIILGNFQMQNYYV 448
FP L + GA +T P NYF VV + G VCL + + A G P I+GN Q Y++
Sbjct: 319 FPALTIRLA-GATMTPPSSNYFLVVDDSGDTVCLAMGS---AGGLPVSIIGNVMQQGYHI 374
Query: 449 EYDLRNQRLGFKQQLCK 465
YD + L F Q C+
Sbjct: 375 LYDRGSSELSFVQAKCE 391
>gi|414880708|tpg|DAA57839.1| TPA: hypothetical protein ZEAMMB73_997765 [Zea mays]
Length = 452
Score = 128 bits (321), Expect = 8e-27, Method: Compositional matrix adjust.
Identities = 118/398 (29%), Positives = 175/398 (43%), Gaps = 58/398 (14%)
Query: 87 GGYSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLG 146
G Y + G PP ++DTGS L+W C C+ C P + P+ S + R +
Sbjct: 90 GEYFAVIGVGDPPTHALVVIDTGSDLIWLQC---LPCRRCYRQVTPLYDPRNSKTHRRIP 146
Query: 147 CQNPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSG-LTEGIALSETLNLPN 205
C +P+C + ++ C+ A + C Y+V+YG G + G ++TL LP+
Sbjct: 147 CASPQCRGV----LRYPGCD----ARTGGCV-----YMVVYGDGSASSGDLATDTLVLPD 193
Query: 206 RI-IPNFLVGC-----SVLSSRQPAGIAGFGRGKTSLPSQLNL---DKFSYCLLSHKFDD 256
+ N +GC +L+S AG+ G GRG+ S P+QL FSYCL
Sbjct: 194 DTRVHNVTLGCGHDNEGLLAS--AAGLLGAGRGQLSFPTQLAPAYGHVFSYCLGDRM--S 249
Query: 257 TTRTSSLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRVRV 316
R SS L G + T +TP NP YYV + +VGG+RV
Sbjct: 250 RARNSSSYLVFGRTPELPST---AFTPLRTNPRRPS------LYYVDMVGFSVGGERVAG 300
Query: 317 W-HKYLTLD-RDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQ-----MVKNRNYTRALG 369
+ + L L+ G GG +VDSGT + + + + D FVS M + RN
Sbjct: 301 FSNASLALNPATGRGGVVVDSGTAISRFTRDAYAAVRDAFVSHAAAAGMRRLRN------ 354
Query: 370 AEALTGLRPCFDVPGEKTGS---FPELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTVVT 426
+ C+DV G G+ P + LHF A++ LP NY V G +
Sbjct: 355 --KFSVFDTCYDVHGNGPGTGVRVPSIVLHFAAAADMALPQANYLIPVVGGDRRTYFCLG 412
Query: 427 DREASGGPSIILGNFQMQNYYVEYDLRNQRLGFKQQLC 464
+ A G + +LGN Q Q + V +D+ R+GF C
Sbjct: 413 LQAADDGLN-VLGNVQQQGFGVVFDVERGRIGFTPNGC 449
>gi|115479489|ref|NP_001063338.1| Os09g0452800 [Oryza sativa Japonica Group]
gi|51535938|dbj|BAD38020.1| putative aspartic proteinase nepenthesin I [Oryza sativa Japonica
Group]
gi|113631571|dbj|BAF25252.1| Os09g0452800 [Oryza sativa Japonica Group]
gi|125605918|gb|EAZ44954.1| hypothetical protein OsJ_29597 [Oryza sativa Japonica Group]
gi|215740967|dbj|BAG97462.1| unnamed protein product [Oryza sativa Japonica Group]
Length = 453
Score = 128 bits (321), Expect = 8e-27, Method: Compositional matrix adjust.
Identities = 118/399 (29%), Positives = 170/399 (42%), Gaps = 66/399 (16%)
Query: 89 YSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLGCQ 148
Y + L+ GTPPQ I +LDTGS L+W C C C P F P++SSS + C
Sbjct: 98 YVLDLAVGTPPQPITALLDTGSDLIWTQCDT---CTACLRQPDPLFSPRMSSSYEPMRCA 154
Query: 149 NPKCSWI-HHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSGLTE-GIALSETLNLPN- 205
C I HH ++ C +Y YG G T G +E +
Sbjct: 155 GQLCGDILHHSCVRPDTC----------------TYRYSYGDGTTTLGYYATERFTFASS 198
Query: 206 ----RIIPNFLVGCSVL---SSRQPAGIAGFGRGKTSLPSQLNLDKFSYCLLSHKFDDTT 258
+ +P GC + S +GI GFGR SL SQL++ +FSYCL + ++
Sbjct: 199 SGETQSVP-LGFGCGTMNVGSLNNASGIVGFGRDPLSLVSQLSIRRFSYCLTPYA---SS 254
Query: 259 RTSSLILDNGSSHS--DKKTTGLTYTPFVN---NPSVAERNAFSVYYYVGLRRITVGGQR 313
R S+L + + D T + TP + NP+ +YYV +TVG +R
Sbjct: 255 RKSTLQFGSLADVGLYDDATGPVQTTPILQSAQNPT---------FYYVAFTGVTVGARR 305
Query: 314 VRVWHKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEAL 373
+R+ L DG+GG I+DSGT T + + F SQ+ A G+
Sbjct: 306 LRIPASAFALRPDGSGGVIIDSGTALTLFPVAVLAEVVRAFRSQL----RLPFANGSSPD 361
Query: 374 TGLRPCFDVPG--------EKTGSFPELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTVV 425
G+ CF P + + P + HF+ GA++ LP ENY +C+ +
Sbjct: 362 DGV--CFAAPAVAAGGGRMARQVAVPRMVFHFQ-GADLDLPRENYVLEDHRRGHLCVLL- 417
Query: 426 TDREASGGPSIILGNFQMQNYYVEYDLRNQRLGFKQQLC 464
SG +GNF Q+ V YDL + L F C
Sbjct: 418 ---GDSGDDGATIGNFVQQDMRVVYDLERETLSFAPVEC 453
>gi|302765224|ref|XP_002966033.1| hypothetical protein SELMODRAFT_64135 [Selaginella moellendorffii]
gi|300166847|gb|EFJ33453.1| hypothetical protein SELMODRAFT_64135 [Selaginella moellendorffii]
Length = 357
Score = 127 bits (320), Expect = 8e-27, Method: Compositional matrix adjust.
Identities = 114/389 (29%), Positives = 165/389 (42%), Gaps = 54/389 (13%)
Query: 87 GGYSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLG 146
G Y + + G+P ++ ++DTGS + W C+ CK C F P+ SSS R L
Sbjct: 12 GEYFVRVGIGSPTKLQYLVMDTGSDVPWIQCS---PCKSCYKQNDAVFDPRASSSFRRLS 68
Query: 147 CQNPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSG-LTEGIALSETLNLPN 205
C P+C + ++ +T C Y V YG G T G S++ +
Sbjct: 69 CSTPQCKLLDVKACA---------STDNRCL-----YQVSYGDGSFTVGDLASDSFLVSR 114
Query: 206 RIIPNFLVGCSVLSSRQPAGI-------AGFGRGKTSLPSQLNLDKFSYCLLSHKFDDTT 258
+ GC G+ G G GK S PSQL+ KFSYCL+S D+
Sbjct: 115 GRTSPVVFGCG----HDNEGLFVGAAGLLGLGAGKLSFPSQLSSRKFSYCLVSR--DNGV 168
Query: 259 RTSSLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRVRVWH 318
R SS +L S+ + YT + NP + +YY GL I++GG + +
Sbjct: 169 RASSALLFGDSAL--PTSASFAYTQLLKNPKL------DTFYYAGLSGISIGGTLLSIPS 220
Query: 319 KYLTLDRD-GNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRAL-GAEALTGL 376
L G GG I+DSGT+ T + + + D F R+ T+ L A +
Sbjct: 221 TAFKLSSSTGRGGVIIDSGTSVTRLPTYAYTVMRDAF-------RSATQKLPRAADFSLF 273
Query: 377 RPCFDVPGEKTGSFPELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTVV-TDREASGGPS 435
C+D + + P + HF+GGA V LP NY V C T + S
Sbjct: 274 DTCYDFSALTSVTIPTVSFHFEGGASVQLPPSNYLVPVDTSGTFCFAFSKTSLDLS---- 329
Query: 436 IILGNFQMQNYYVEYDLRNQRLGFKQQLC 464
I+GN Q Q V DL + R+GF + C
Sbjct: 330 -IIGNIQQQTMRVAIDLDSSRVGFAPRQC 357
>gi|118484651|gb|ABK94196.1| unknown [Populus trichocarpa]
Length = 125
Score = 127 bits (320), Expect = 8e-27, Method: Composition-based stats.
Identities = 62/127 (48%), Positives = 87/127 (68%), Gaps = 8/127 (6%)
Query: 342 MAPELFEPLADEFVSQMVKNRNYTRALGAEALTGLRPCFDVPGEKTGSFPELKLHFKGGA 401
M ++E +A EF Q+ +YT A + TGLRPCF++ GEK+ S PE HFKGGA
Sbjct: 1 MEKPVYELVAKEFEKQVA---HYTVATEVQNQTGLRPCFNISGEKSVSVPEFIFHFKGGA 57
Query: 402 EVTLPVENYFAVVGEGSAVCLTVVTDREA----SGGPSIILGNFQMQNYYVEYDLRNQRL 457
++ LP+ NYF+ V G +CLT+V+D + GGP+IILGN+Q +N++VE+DL+N+R
Sbjct: 58 KMALPLANYFSFVDSG-VICLTIVSDNMSGSGIGGGPAIILGNYQQRNFHVEFDLKNERF 116
Query: 458 GFKQQLC 464
GFKQQ C
Sbjct: 117 GFKQQNC 123
>gi|242056497|ref|XP_002457394.1| hypothetical protein SORBIDRAFT_03g006630 [Sorghum bicolor]
gi|241929369|gb|EES02514.1| hypothetical protein SORBIDRAFT_03g006630 [Sorghum bicolor]
Length = 509
Score = 127 bits (320), Expect = 1e-26, Method: Compositional matrix adjust.
Identities = 113/384 (29%), Positives = 159/384 (41%), Gaps = 47/384 (12%)
Query: 87 GGYSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLG 146
G Y + G+P + + +LDTGS + W C C C P F P LS+S +
Sbjct: 167 GEYFSRVGIGSPARELYMVLDTGSDVTWVQCQ---PCADCYQQSDPVFDPSLSASYAAVS 223
Query: 147 CQNPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSG-LTEGIALSETLNLPN 205
C +P+C RD + A +N T C Y V YG G T G +ETL L +
Sbjct: 224 CDSPRC----------RDLD---TAACRNATGAC-LYEVAYGDGSYTVGDFATETLTLGD 269
Query: 206 RI-IPNFLVGCSVLSSR---QPAGIAGFGRGKTSLPSQLNLDKFSYCLLSHKFDDTTRTS 261
+ N +GC + AG+ G G S PSQ++ FSYCL+ D+ S
Sbjct: 270 STPVTNVAIGCGHDNEGLFVGAAGLLALGGGPLSFPSQISASTFSYCLVDR---DSPAAS 326
Query: 262 SLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRVRVWHKYL 321
+L + +D T P V +P +YYV L I+VGGQ + +
Sbjct: 327 TLQFGADGAEADTVTA-----PLVRSPRTG------TFYYVALSGISVGGQALSIPSSAF 375
Query: 322 TLDR-DGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEALTGLRPCF 380
+D G+GG IVDSGT T + + L D FV R ++ C+
Sbjct: 376 AMDATSGSGGVIVDSGTAVTRLQSSAYAALRDAFV------RGTPSLPRTSGVSLFDTCY 429
Query: 381 DVPGEKTGSFPELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTVVTDREASGGPSIILGN 440
D+ + P + L F+GG + LP +NY V CL A I+GN
Sbjct: 430 DLSDRTSVEVPAVSLRFEGGGALRLPAKNYLIPVDGAGTYCLAFAPTNAAVS----IIGN 485
Query: 441 FQMQNYYVEYDLRNQRLGFKQQLC 464
Q Q V +D +GF C
Sbjct: 486 VQQQGTRVSFDTAKGVVGFTPNKC 509
>gi|125527523|gb|EAY75637.1| hypothetical protein OsI_03542 [Oryza sativa Indica Group]
Length = 446
Score = 127 bits (319), Expect = 1e-26, Method: Compositional matrix adjust.
Identities = 114/390 (29%), Positives = 170/390 (43%), Gaps = 41/390 (10%)
Query: 87 GGYSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLG 146
G Y + GTP ++DTGS LVW C+ C+ C + + F P+ SS+ R +
Sbjct: 84 GEYFALVGVGTPSTKAMLVIDTGSDLVWLQCS---PCRRCYAQRGQVFDPRRSSTYRRVP 140
Query: 147 CQNPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSGLTE-GIALSETLNLPN 205
C +P+C + D A C Y+V YG G + G ++ L N
Sbjct: 141 CSSPQCRALRFPGC------DSGGAAGGGCR-----YMVAYGDGSSSTGELATDKLAFAN 189
Query: 206 RI-IPNFLVGCSVLSS---RQPAGIAGFGRGKTSLPSQLNL---DKFSYCLLSHKFDDTT 258
+ N +GC + AG+ G RGK S+ +Q+ F YCL + +T
Sbjct: 190 DTYVNNVTLGCGRDNEGLFDSAAGLLGVARGKISISTQVAPAYGSVFEYCL-GDRTSRST 248
Query: 259 RTSSLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRVRVW- 317
R+S L+ T L P PS+ YYV + +VGG+RV +
Sbjct: 249 RSSYLVFGRTPEPPSTAFTALLSNP--RRPSL---------YYVDMAGFSVGGERVTGFS 297
Query: 318 HKYLTLD-RDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEALTGL 376
+ L LD G GG +VDSGT + A + + L D F ++ G ++
Sbjct: 298 NASLALDTATGRGGVVVDSGTAISRFARDAYAALRDAFDARARAAGMRR-LAGEHSV--F 354
Query: 377 RPCFDVPGEKTGSFPELKLHFKGGAEVTLPVENYFAVV--GEGSAVCLTVVTDREASGGP 434
C+D+ G S P + LHF GGA++ LP ENYF V G A EA+
Sbjct: 355 DACYDLRGRPAASAPLIVLHFAGGADMALPPENYFLPVDGGRRRAASYRRCLGFEAADDG 414
Query: 435 SIILGNFQMQNYYVEYDLRNQRLGFKQQLC 464
++GN Q Q + V +D+ +R+GF + C
Sbjct: 415 LSVIGNVQQQGFRVVFDVEKERIGFAPKGC 444
>gi|293335955|ref|NP_001168399.1| uncharacterized protein LOC100382168 precursor [Zea mays]
gi|223948009|gb|ACN28088.1| unknown [Zea mays]
gi|413922066|gb|AFW61998.1| hypothetical protein ZEAMMB73_694403 [Zea mays]
Length = 507
Score = 127 bits (319), Expect = 1e-26, Method: Compositional matrix adjust.
Identities = 111/384 (28%), Positives = 173/384 (45%), Gaps = 37/384 (9%)
Query: 90 SISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLGCQN 149
S+ S G+P + I+DTGS L W C C C + + P F P S++ + C
Sbjct: 149 SLGGSSGSPAANLTVIVDTGSDLTWVQCK---PCSACYAQRDPLFDPAGSATYAAVRCNA 205
Query: 150 PKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSG-LTEGIALSETLNLPNRII 208
C+ +S++ ++ ++ C Y + YG G + G+ ++T+ L +
Sbjct: 206 SACA----DSLRAATGTPGSCGSTGAGSEKC-YYALAYGDGSFSRGVLATDTVALGGASL 260
Query: 209 PNFLVGCSVLSSRQ----PAGIAGFGRGKTSLPSQLNL---DKFSYCLLSHKFDDTTRTS 261
F+ GC LS+R AG+ G GR + SL SQ FSYCL + D + +
Sbjct: 261 GGFVFGCG-LSNRGLFGGTAGLMGLGRTELSLVSQTASRYGGVFSYCLPAATSGDASGSL 319
Query: 262 SLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRVRVWHKYL 321
SL + ++ S + TT + YT + +P A +Y++ + VGG L
Sbjct: 320 SLGGGDDAASSYRNTTPVAYTRMIADP------AQPPFYFLNVTGAAVGG-------TAL 366
Query: 322 TLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEALTGLRPCFD 381
G ++DSGT T +AP ++ + EF+ Q Y A G + L C+D
Sbjct: 367 AAQGLGASNVLIDSGTVITRLAPSVYRAVRAEFMRQF-GAAGYPAAPG---FSILDTCYD 422
Query: 382 VPGEKTGSFPELKLHFKGGAEVTLPVENY-FAVVGEGSAVCLTVVTDREASGGPSIILGN 440
+ G P L L +GGA+VT+ F V +GS VCL + + P I+GN
Sbjct: 423 LTGHDEVKVPLLTLRLEGGADVTVDAAGMLFVVRKDGSQVCLAMASLSYEDETP--IIGN 480
Query: 441 FQMQNYYVEYDLRNQRLGFKQQLC 464
+Q +N V YD RLGF + C
Sbjct: 481 YQQKNKRVVYDTLGSRLGFADEDC 504
>gi|148910443|gb|ABR18297.1| unknown [Picea sitchensis]
Length = 452
Score = 127 bits (319), Expect = 1e-26, Method: Compositional matrix adjust.
Identities = 117/409 (28%), Positives = 181/409 (44%), Gaps = 65/409 (15%)
Query: 68 KTTTTTTTTTTTNISSHSYGG-YSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYC 126
+T+ ++ N+ S G Y I + FGTP Q + ++DTGS + W PC QC+ C
Sbjct: 93 RTSRSSKQDANANVPVRSGSGEYIIQVDFGTPKQSMYTLIDTGSDVAWIPCK---QCQGC 149
Query: 127 SSSKIPSFIPKLSSSSRLLGCQNPKCSWIHHESIQCRDCNDEPLATSKNC--TQICPSYL 184
S+ P F P SSS + C + C I S NC C +
Sbjct: 150 HSTA-PIFDPAKSSSYKPFACDSQPCQEI-----------------SGNCGGNSKC-QFE 190
Query: 185 VLYGSGL-TEGIALSETLNLPNRIIPNFLVGC--SVLSSRQPA-----GIAGFGRGKTSL 236
V YG G +G S+ + L ++ +PNF GC S+ P+ G T
Sbjct: 191 VSYGDGTQVDGTLASDAITLGSQYLPNFSFGCAESLSEDTSPSPGLMGLGGGSLSLLTQA 250
Query: 237 P-SQLNLDKFSYCLLSHKFDDTTRTSSLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNA 295
P ++L FSYCL S +T + SL+L ++ S ++ L +T + +PS+
Sbjct: 251 PTAELFGGTFSYCLPSS----STSSGSLVLGKEAAVS---SSSLKFTTLIKDPSIP---- 299
Query: 296 FSVYYYVGLRRITVGGQRVRVWHKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFV 355
+Y+V L+ I+VG R+ V + GGTI+DSGTT T + P + L D F
Sbjct: 300 --TFYFVTLKAISVGNTRISVPGTNIA----SGGGTIIDSGTTITHLVPSAYTALRDAFR 353
Query: 356 SQMVKNRNYTRALGAEALTGLRPCFDVPGEKTGSFPELKLHFKGGAEVTLPVENYFAVVG 415
Q+ +L + + C+D+ P + LH ++ LP EN +
Sbjct: 354 QQL-------SSLQPTPVEDMDTCYDLSSSSV-DVPTITLHLDRNVDLVLPKENIL-ITQ 404
Query: 416 EGSAVCLTVVTDREASGGPSIILGNFQMQNYYVEYDLRNQRLGFKQQLC 464
E CL + S I+GN Q QN+ + +D+ N ++GF Q+ C
Sbjct: 405 ESGLACLAFSSTDSRS-----IIGNVQQQNWRIVFDVPNSQVGFAQEQC 448
>gi|168008816|ref|XP_001757102.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162691600|gb|EDQ77961.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 406
Score = 127 bits (319), Expect = 1e-26, Method: Compositional matrix adjust.
Identities = 125/424 (29%), Positives = 185/424 (43%), Gaps = 57/424 (13%)
Query: 56 LTRALHIKNPQTKTTTTTTTTTTTNISSHSYGGYSISLSFGTPPQIIPFILDTGSHLVWF 115
LTR+ ++ QTK + + S G Y I +S GTPP+ + ++DTGS ++W
Sbjct: 26 LTRS-RSRDRQTKVPSQDFQAPVVSGLSLGSGEYFIRISVGTPPRRMYLVMDTGSDILWL 84
Query: 116 PCTNHYQCKYCSSSKIPSFIPKLSSSSRLLGCQNPKCSWIHHESIQCRDCNDEPLATSKN 175
C C Y S I F P SS+ LGC +C + + Q C
Sbjct: 85 QCAPCVNC-YHQSDAI--FDPYKSSTYSTLGCSTRQCLNLDIGTCQANKC---------- 131
Query: 176 CTQICPSYLVLYGSGL-------TEGIALSETLNLPNRIIPNFLVGCSVLSSR---QPAG 225
Y V YG G T+ ++L+ T + ++ +GC + AG
Sbjct: 132 ------LYQVDYGDGSFTTGEFGTDDVSLNSTSGVGQVVLNKIPLGCGHDNEGYFVGAAG 185
Query: 226 IAGFGRGKTSLPSQL---NLDKFSYCLLSHKFDDTTRTSSLILDNGSSHSDKKTTGLTYT 282
+ G G+G S P+Q+ N +FSYCL + D+T SSL+ + G +T
Sbjct: 186 LLGLGKGPLSFPNQVDPQNGGRFSYCLTDRE-TDSTEGSSLVF----GEAAVPPAGARFT 240
Query: 283 PFVNNPSVAERNAFSVYYYVGLRRITVGGQRVRVWHKYLTLDRDGNGGTIVDSGTTFTFM 342
P +N V +YY+ + I+VGG + + LD GNGG I+DSGT+ T +
Sbjct: 241 PQDSNMRVP------TFYYLKMTGISVGGTILTIPTSAFQLDSLGNGGVIIDSGTSVTRL 294
Query: 343 APELFEPLADEFVSQMVKNRNYTRALGAEA-LTGLRPCFDVPGEKTGSFPELKLHFKGGA 401
+ L D F R T L A + C+D+ G + P + LHF+GG
Sbjct: 295 QNAAYASLRDAF-------RAGTSDLAPTAGFSLFDTCYDLSGLASVDVPTVTLHFQGGT 347
Query: 402 EVTLPVENYFAVVGEGSAVCLTVVTDREASGGPSIILGNFQMQNYYVEYDLRNQRLGFKQ 461
++ LP NY V + CL + GPSII GN Q Q + V YD + ++GF
Sbjct: 348 DLKLPASNYLIPVDNSNTFCLAFA----GTTGPSII-GNIQQQGFRVIYDNLHNQVGFVP 402
Query: 462 QLCK 465
C
Sbjct: 403 SQCN 406
>gi|125542690|gb|EAY88829.1| hypothetical protein OsI_10302 [Oryza sativa Indica Group]
Length = 440
Score = 127 bits (319), Expect = 1e-26, Method: Compositional matrix adjust.
Identities = 113/389 (29%), Positives = 161/389 (41%), Gaps = 41/389 (10%)
Query: 89 YSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLGCQ 148
Y + G+P Q + LDT + W C+ C C SS + F P SSS L C
Sbjct: 81 YVVRAGLGSPSQQLLLALDTSADATWAHCS---PCGTCPSSSL--FAPANSSSYASLPCS 135
Query: 149 NPKCSWIHHESIQCRDCNDE---PLATSKNCTQICPSYLVLYGSGLTEGIALSETLNLPN 205
+ C ++ + P AT C P + + L S+TL L
Sbjct: 136 SSWCPLFQGQACPAPQGGGDAAPPPATLPTCAFSKPFADASFQAALA-----SDTLRLGK 190
Query: 206 RIIPNFLVGCSVLSSRQPA------GIAGFGRGKTSLPSQ---LNLDKFSYCLLSHKFDD 256
IPN+ GC V S P G+ G GRG +L SQ L FSYCL S++
Sbjct: 191 DAIPNYTFGC-VSSVTGPTTNMPRQGLLGLGRGPMALLSQAGSLYNGVFSYCLPSYR--S 247
Query: 257 TTRTSSLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRVRV 316
+ SL L G + YTP + NP R++ YYV + ++VG V+V
Sbjct: 248 YYFSGSLRLGAGGGQPRS----VRYTPMLRNP---HRSSL---YYVNVTGLSVGRAWVKV 297
Query: 317 WHKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEALTGL 376
D GT+VDSGT T ++ L +EF Q+ YT +LGA
Sbjct: 298 PAGSFAFDAATGAGTVVDSGTVITRWTAPVYAALREEFRRQVAAPSGYT-SLGA-----F 351
Query: 377 RPCFDVPGEKTGSFPELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTVVTDREASGGPSI 436
CF+ G P + +H GG ++ LP+EN CL + +
Sbjct: 352 DTCFNTDEVAAGGAPAVTVHMDGGVDLALPMENTLIHSSATPLACLAMAEAPQNVNSVVN 411
Query: 437 ILGNFQMQNYYVEYDLRNQRLGFKQQLCK 465
++ N Q QN V +D+ N R+GF ++ C
Sbjct: 412 VIANLQQQNIRVVFDVANSRIGFAKESCN 440
>gi|125543639|gb|EAY89778.1| hypothetical protein OsI_11320 [Oryza sativa Indica Group]
Length = 488
Score = 127 bits (319), Expect = 1e-26, Method: Compositional matrix adjust.
Identities = 89/267 (33%), Positives = 132/267 (49%), Gaps = 32/267 (11%)
Query: 208 IPNFLVGCSVLSS----RQPAGIAGFGRGKTSLPSQLNLDKFSYCLLSHKFDDTTRTSSL 263
+P GC + ++ GIAGFGRG SLPSQL + FS+C + + + S++
Sbjct: 243 VPGVAFGCGLFNNGVFKSNETGIAGFGRGPLSLPSQLKVGNFSHCFTAV---NGLKQSTV 299
Query: 264 ILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRVRVWHKYLTL 323
+LD + + TP + N +A YY+ L+ ITVG R+ V L
Sbjct: 300 LLDLLADLYKNGRGAVQSTPLIQN------SANPTLYYLSLKGITVGSTRLPVPESAFAL 353
Query: 324 DRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEALTGLRPCFDVP 383
+G GGTI+DSGT+ T + P++++ + DEF +Q+ + TG CF P
Sbjct: 354 -TNGTGGTIIDSGTSITSLPPQVYQVVRDEFAAQI------KLPVVPGNATGPYTCFSAP 406
Query: 384 GEKTGSFPELKLHFKGGAEVTLPVENYFAVVGE---GSAVCLTV--VTDREASGGPSIIL 438
+ P+L LHF+ GA + LP ENY V + S +CL + + D A+ +
Sbjct: 407 SQAKPDVPKLVLHFE-GATMDLPRENYVFEVPDDAGNSMICLAINELGDERAT------I 459
Query: 439 GNFQMQNYYVEYDLRNQRLGFKQQLCK 465
GNFQ QN +V YDL+N L F C
Sbjct: 460 GNFQQQNMHVLYDLQNNMLSFVAAQCD 486
Score = 76.3 bits (186), Expect = 4e-11, Method: Compositional matrix adjust.
Identities = 52/148 (35%), Positives = 75/148 (50%), Gaps = 16/148 (10%)
Query: 303 GLRRITVGGQRVRVWHKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNR 362
G ITVG R+ V L +G GGTI+DSGT+ T + P++++ + DEF +Q+
Sbjct: 38 GRPGITVGSTRLPVPESAFAL-TNGTGGTIIDSGTSITSLPPQVYQVVRDEFAAQI---- 92
Query: 363 NYTRALGAEALTGLRPCFDVPGEKTGSFPELKLHFKGGAEVTLPVENYFAVVGE---GSA 419
+ TG CF P + P+L LHF+ GA + LP ENY V + S
Sbjct: 93 --KLPVVPGNATGPYTCFSAPSQAKPDVPKLVLHFE-GATMDLPRENYVFEVPDDAGNSI 149
Query: 420 VCLTVVTDREASGGPSIILGNFQMQNYY 447
+CL + G + I+GNFQ QN +
Sbjct: 150 ICLAI-----NKGDETTIIGNFQQQNMH 172
>gi|414876414|tpg|DAA53545.1| TPA: hypothetical protein ZEAMMB73_483039 [Zea mays]
Length = 506
Score = 127 bits (318), Expect = 1e-26, Method: Compositional matrix adjust.
Identities = 115/384 (29%), Positives = 164/384 (42%), Gaps = 47/384 (12%)
Query: 87 GGYSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLG 146
G Y + G+P + + +LDTGS + W C C C P F P LS+S +
Sbjct: 164 GEYFSRVGIGSPARQLYMVLDTGSDVTWVQCQ---PCADCYQQSDPVFDPSLSASYAAVS 220
Query: 147 CQNPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSG-LTEGIALSETLNLPN 205
C +S +CRD + A +N T C Y V YG G T G +ETL L +
Sbjct: 221 C----------DSQRCRDLD---TAACRNATGAC-LYEVAYGDGSYTVGDFATETLTLGD 266
Query: 206 RI-IPNFLVGCSVLSSR---QPAGIAGFGRGKTSLPSQLNLDKFSYCLLSHKFDDTTRTS 261
+ N +GC + AG+ G G S PSQ++ FSYCL+ D+ S
Sbjct: 267 STPVGNVAIGCGHDNEGLFVGAAGLLALGGGPLSFPSQISASTFSYCLVDR---DSPAAS 323
Query: 262 SLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRVRVWHKYL 321
+L +G++ + G P V +P S +YYV L I+VGGQ + +
Sbjct: 324 TLQFGDGAAEA-----GTVTAPLVRSPRT------STFYYVALSGISVGGQPLSIPASAF 372
Query: 322 TLDR-DGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEALTGLRPCF 380
+D G+GG IVDSGT T + + L D FV Q + T ++ C+
Sbjct: 373 AMDATSGSGGVIVDSGTAVTRLQSAAYAALRDAFV-QGAPSLPRT-----SGVSLFDTCY 426
Query: 381 DVPGEKTGSFPELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTVVTDREASGGPSIILGN 440
D+ + P + L F+GG + LP +NY V CL A I+GN
Sbjct: 427 DLSDRTSVEVPAVSLRFEGGGALRLPAKNYLIPVDGAGTYCLAFAPTNAAVS----IIGN 482
Query: 441 FQMQNYYVEYDLRNQRLGFKQQLC 464
Q Q V +D +GF C
Sbjct: 483 VQQQGTRVSFDTARGAVGFTPNKC 506
>gi|115451209|ref|NP_001049205.1| Os03g0186900 [Oryza sativa Japonica Group]
gi|49532749|dbj|BAD26705.1| Radc1 [Oryza sativa Japonica Group]
gi|108706569|gb|ABF94364.1| Eukaryotic aspartyl protease family protein, expressed [Oryza
sativa Japonica Group]
gi|113547676|dbj|BAF11119.1| Os03g0186900 [Oryza sativa Japonica Group]
gi|215692805|dbj|BAG88249.1| unnamed protein product [Oryza sativa Japonica Group]
gi|215767626|dbj|BAG99854.1| unnamed protein product [Oryza sativa Japonica Group]
Length = 438
Score = 127 bits (318), Expect = 2e-26, Method: Compositional matrix adjust.
Identities = 113/389 (29%), Positives = 161/389 (41%), Gaps = 41/389 (10%)
Query: 89 YSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLGCQ 148
Y + G+P Q + LDT + W C+ C C SS + F P SSS L C
Sbjct: 79 YVVRAGLGSPSQQLLLALDTSADATWAHCS---PCGTCPSSSL--FAPANSSSYASLPCS 133
Query: 149 NPKCSWIHHESIQCRDCNDE---PLATSKNCTQICPSYLVLYGSGLTEGIALSETLNLPN 205
+ C ++ + P AT C P + + L S+TL L
Sbjct: 134 SSWCPLFQGQACPAPQGGGDAAPPPATLPTCAFSKPFADASFQAALA-----SDTLRLGK 188
Query: 206 RIIPNFLVGCSVLSSRQPA------GIAGFGRGKTSLPSQ---LNLDKFSYCLLSHKFDD 256
IPN+ GC V S P G+ G GRG +L SQ L FSYCL S++
Sbjct: 189 DAIPNYTFGC-VSSVTGPTTNMPRQGLLGLGRGPMALLSQAGSLYNGVFSYCLPSYR--S 245
Query: 257 TTRTSSLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRVRV 316
+ SL L G + YTP + NP R++ YYV + ++VG V+V
Sbjct: 246 YYFSGSLRLGAGGGQPRS----VRYTPMLRNP---HRSSL---YYVNVTGLSVGHAWVKV 295
Query: 317 WHKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEALTGL 376
D GT+VDSGT T ++ L +EF Q+ YT +LGA
Sbjct: 296 PAGSFAFDAATGAGTVVDSGTVITRWTAPVYAALREEFRRQVAAPSGYT-SLGA-----F 349
Query: 377 RPCFDVPGEKTGSFPELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTVVTDREASGGPSI 436
CF+ G P + +H GG ++ LP+EN CL + +
Sbjct: 350 DTCFNTDEVAAGGAPAVTVHMDGGVDLALPMENTLIHSSATPLACLAMAEAPQNVNSVVN 409
Query: 437 ILGNFQMQNYYVEYDLRNQRLGFKQQLCK 465
++ N Q QN V +D+ N R+GF ++ C
Sbjct: 410 VIANLQQQNIRVVFDVANSRVGFAKESCN 438
>gi|242089623|ref|XP_002440644.1| hypothetical protein SORBIDRAFT_09g004500 [Sorghum bicolor]
gi|241945929|gb|EES19074.1| hypothetical protein SORBIDRAFT_09g004500 [Sorghum bicolor]
Length = 469
Score = 127 bits (318), Expect = 2e-26, Method: Compositional matrix adjust.
Identities = 127/444 (28%), Positives = 186/444 (41%), Gaps = 75/444 (16%)
Query: 38 TNPSQDSYQNLNSLVSSSLTRALHIKNPQTKTTTTTTTTTTTNISSHSYGG---YSISLS 94
T + +S++ L+ L S R+ + PQ+ + + + T + GG Y + S
Sbjct: 50 TQAALESHRRLSFLAS----RSSQVDKPQSSSASQLSNNDTDTVPLRMDGGGGAYDMEFS 105
Query: 95 FGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLGCQNPKCSW 154
GTPPQ + + DTGS L+W C + S+ P SS+ L C + C+
Sbjct: 106 IGTPPQKLTALADTGSDLIWTKCDAG---GGAAWGGSSSYHPNASSTFTRLPCSDRLCAA 162
Query: 155 IHHESI-QCRDCNDEPLATSKNCTQICPSYLVLYGSG----LTEGIALSETLNLPNRIIP 209
+ S+ +C A C Y YG G T+G SET L +P
Sbjct: 163 LRSYSLARCA-------AGGAEC-----DYKYAYGLGDDPDFTQGFLGSETFTLGGDAVP 210
Query: 210 NFLVGCSVL---SSRQPAGIAGFGRGKTSLPSQLNLDKFSYCLLSHKFDDTTRTSSLILD 266
GC+ + AG+ G GRG SL SQL+ F YCL + D ++ S L+
Sbjct: 211 GVGFGCTTALEGDYGEGAGLVGLGRGPLSLVSQLDAGTFMYCLTA----DASKASPLLFG 266
Query: 267 NGSSHSDK----KTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRVRVWHKYLT 322
++ + ++TGL A + +Y V LR IT+G T
Sbjct: 267 ALATMTGAGAGVQSTGLL--------------ASTTFYAVNLRSITIGSAT--------T 304
Query: 323 LDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEALTGLRPCFDV 382
G GG + DSGTT T++A + F+SQ T E G C++
Sbjct: 305 AGVGGPGGVVFDSGTTLTYLAEPAYTEAKAAFLSQT------TSLTPVEGRYGFEACYEK 358
Query: 383 PGEKTGSFPELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTVVTDREASGGPSI-ILGNF 441
P + P + LHF GGA++ LPV NY V +G VC V PS+ I+GN
Sbjct: 359 P-DSARLIPAMVLHFDGGADMALPVANYVVEVDDG-VVCWVVQRS------PSLSIIGNI 410
Query: 442 QMQNYYVEYDLRNQRLGFKQQLCK 465
NY V +D+R L F+ C
Sbjct: 411 MQMNYLVLHDVRKSVLSFQPANCD 434
>gi|125592062|gb|EAZ32412.1| hypothetical protein OsJ_16623 [Oryza sativa Japonica Group]
Length = 473
Score = 126 bits (317), Expect = 2e-26, Method: Compositional matrix adjust.
Identities = 113/385 (29%), Positives = 159/385 (41%), Gaps = 46/385 (11%)
Query: 87 GGYSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLG 146
G Y + + G+PP ++D+GS ++W C C+ C + P F P SSS +
Sbjct: 128 GEYFVRVGVGSPPTDQYLVVDSGSDVIWVQCR---PCEQCYAQTDPLFDPAASSSFSGVS 184
Query: 147 CQNPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSG-LTEGIALSETLNLPN 205
C + C + + C Y V YG G T+G ETL L
Sbjct: 185 CGSAICRTLSGTGCGGG-------GDAGKC-----DYSVTYGDGSYTKGELALETLTLGG 232
Query: 206 RIIPNFLVGCSVLSSR---QPAGIAGFGRGKTSLPSQLNLDK---FSYCLLSHKFDDTTR 259
+ +GC +S AG+ G G G SL QL FSYCL S
Sbjct: 233 TAVQGVAIGCGHRNSGLFVGAAGLLGLGWGAMSLVGQLGGAAGGVFSYCLASRG---AGG 289
Query: 260 TSSLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRVRVWHK 319
SL+L ++ G + P V N N S +YYVGL I VGG+R+ +
Sbjct: 290 AGSLVL----GRTEAVPVGAVWVPLVRN------NQASSFYYVGLTGIGVGGERLPLQDS 339
Query: 320 YLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEALTGLRPC 379
L DG GG ++D+GT T + E + L F M + A++ L C
Sbjct: 340 LFQLTEDGAGGVVMDTGTAVTRLPREAYAALRGAFDGAMGALPR------SPAVSLLDTC 393
Query: 380 FDVPGEKTGSFPELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTVVTDREASGGPSIILG 439
+D+ G + P + +F GA +TLP N VG G+ CL +S G S ILG
Sbjct: 394 YDLSGYASVRVPTVSFYFDQGAVLTLPARNLLVEVG-GAVFCLAFA---PSSSGIS-ILG 448
Query: 440 NFQMQNYYVEYDLRNQRLGFKQQLC 464
N Q + + D N +GF C
Sbjct: 449 NIQQEGIQITVDSANGYVGFGPNTC 473
>gi|90399033|emb|CAJ86229.1| H0402C08.5 [Oryza sativa Indica Group]
gi|125550227|gb|EAY96049.1| hypothetical protein OsI_17922 [Oryza sativa Indica Group]
Length = 473
Score = 126 bits (317), Expect = 2e-26, Method: Compositional matrix adjust.
Identities = 113/385 (29%), Positives = 159/385 (41%), Gaps = 46/385 (11%)
Query: 87 GGYSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLG 146
G Y + + G+PP ++D+GS ++W C C+ C + P F P SSS +
Sbjct: 128 GEYFVRVGVGSPPTDQYLVVDSGSDVIWVQCR---PCEQCYAQTDPLFDPAASSSFSGVS 184
Query: 147 CQNPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSG-LTEGIALSETLNLPN 205
C + C + + C Y V YG G T+G ETL L
Sbjct: 185 CGSAICRTLSGTGCGGG-------GDAGKC-----DYSVTYGDGSYTKGELALETLTLGG 232
Query: 206 RIIPNFLVGCSVLSSR---QPAGIAGFGRGKTSLPSQLNLDK---FSYCLLSHKFDDTTR 259
+ +GC +S AG+ G G G SL QL FSYCL S
Sbjct: 233 TAVQGVAIGCGHRNSGLFVGAAGLLGLGWGAMSLIGQLGGAAGGVFSYCLASRG---AGG 289
Query: 260 TSSLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRVRVWHK 319
SL+L ++ G + P V N N S +YYVGL I VGG+R+ +
Sbjct: 290 AGSLVL----GRTEAVPVGAVWVPLVRN------NQASSFYYVGLTGIGVGGERLPLQDG 339
Query: 320 YLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEALTGLRPC 379
L DG GG ++D+GT T + E + L F M + A++ L C
Sbjct: 340 LFQLTEDGAGGVVMDTGTAVTRLPREAYAALRGAFDGAMGALPR------SPAVSLLDTC 393
Query: 380 FDVPGEKTGSFPELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTVVTDREASGGPSIILG 439
+D+ G + P + +F GA +TLP N VG G+ CL +S G S ILG
Sbjct: 394 YDLSGYASVRVPTVSFYFDQGAVLTLPARNLLVEVG-GAVFCLAFA---PSSSGIS-ILG 448
Query: 440 NFQMQNYYVEYDLRNQRLGFKQQLC 464
N Q + + D N +GF C
Sbjct: 449 NIQQEGIQITVDSANGYVGFGPNTC 473
>gi|147821993|emb|CAN70318.1| hypothetical protein VITISV_016757 [Vitis vinifera]
Length = 429
Score = 126 bits (317), Expect = 2e-26, Method: Compositional matrix adjust.
Identities = 121/411 (29%), Positives = 183/411 (44%), Gaps = 61/411 (14%)
Query: 78 TTNISSHSYGGYSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPS---- 133
++ +S H ++SL+ G+PPQ + +LDTGS L W C K P+
Sbjct: 45 SSKLSFHHNVSLTVSLTVGSPPQTVTMVLDTGSELSWLHC-----------KKAPNLHSV 93
Query: 134 FIPKLSSSSRLLGCQNPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSGLTE 193
F P SSS + C +P C + RD + + S + ++C + + + E
Sbjct: 94 FDPLRSSSYSPIPCTSPTC------RTRTRDFS---IPVSCDKKKLCHAIISYADASSIE 144
Query: 194 GIALSETLNLPNRIIPNFLVGC--SVLSSR-----QPAGIAGFGRGKTSLPSQLNLDKFS 246
G S+T ++ N IP + GC S SS + G+ G RG S +Q+ L KFS
Sbjct: 145 GNLASDTFHIGNSAIPATIFGCMDSGFSSNSDEDSKTTGLIGMNRGSLSFVTQMGLQKFS 204
Query: 247 YCLLSHKFDDTTRTSSLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRR 306
YC+ +S ++L SS S K L YTP V S V Y V L
Sbjct: 205 YCISGQD------SSGILLFGESSFSWLK--ALKYTPLVQI-STPLPYFDRVAYTVQLEG 255
Query: 307 ITVGGQRVRVWHKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQ------MVK 360
I V +++ D G G T+VDSGT FTF+ ++ L +EFV Q +++
Sbjct: 256 IKVANSMLQLPKSVYAPDHTGAGQTMVDSGTQFTFLLGPVYTALKNEFVRQTKASLKVLE 315
Query: 361 NRNYTRALGAEALTGLRPCFDVPGEKTG--SFPELKLHFKGGAEVTLPVENYF-----AV 413
+ N+ GA L C+ VP + P + L F+ GAE+++ E +
Sbjct: 316 DPNFVFQ-GAMDL-----CYRVPLTRRTLPPLPTVTLMFR-GAEMSVSAERLMYRVPGVI 368
Query: 414 VGEGSAVCLTVVTDREASGGPSIILGNFQMQNYYVEYDLRNQRLGFKQQLC 464
G S C T + E G S I+G+ QN ++E+DL R+GF + C
Sbjct: 369 RGSDSVYCFT-FGNSELLGVESYIIGHHHQQNVWMEFDLAKSRVGFAEVRC 418
>gi|242041575|ref|XP_002468182.1| hypothetical protein SORBIDRAFT_01g041210 [Sorghum bicolor]
gi|241922036|gb|EER95180.1| hypothetical protein SORBIDRAFT_01g041210 [Sorghum bicolor]
Length = 490
Score = 126 bits (316), Expect = 2e-26, Method: Compositional matrix adjust.
Identities = 119/392 (30%), Positives = 166/392 (42%), Gaps = 51/392 (13%)
Query: 87 GGYSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLG 146
G Y ++ GTP LDT S L W C C+ C P F P+ S+S R +
Sbjct: 136 GEYIAKIAVGTPGVEALLALDTASDLTWLQCQ---PCRRCYPQSGPVFDPRHSTSYREM- 191
Query: 147 CQNPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSG-LTEGIALSETLNLPN 205
S DC + + + Y V YG G T G + ETL
Sbjct: 192 ------------SFNAADCQALGRSGGGDAKRGTCVYTVGYGDGSTTVGDFIEETLTFAG 239
Query: 206 RI-IPNFLVGCS----VLSSRQPAGIAGFGRGKTSLPSQLNLDK-FSYCLLSHKFDDTTR 259
+ +P +GC L AGI G GRG S P+Q++ + FSYCL+ +
Sbjct: 240 GVRLPRISIGCGHDNKGLFGAPAAGILGLGRGLMSFPNQIDHNGTFSYCLVDFLSGPGSL 299
Query: 260 TSSLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRV-RVWH 318
+S+L G+ + +++TP V N ++ +YYV L I+VGG RV V
Sbjct: 300 SSTLTFGAGAVDTSPP---VSFTPTVLNLNM------PTFYYVRLTGISVGGVRVPGVTE 350
Query: 319 KYLTLD-RDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEALTG-- 375
+ L LD G GG IVDSGT T +A + D F + V LG ++ G
Sbjct: 351 RDLQLDPYTGRGGVIVDSGTAVTRLARPAYTAFRDAFRAVAVD-------LGQVSIGGPS 403
Query: 376 --LRPCFDVPGEKTGSFPELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTVVTDREASGG 433
C+ V G P + +HF G EV L +NY V VC A+G
Sbjct: 404 GFFDTCYTVGGRGMKKVPTVSMHFAGSVEVKLQPKNYLIPVDSMGTVCFAFA----ATGD 459
Query: 434 PSI-ILGNFQMQNYYVEYDLRNQRLGFKQQLC 464
S+ I+GN Q Q + + YD+ R+GF C
Sbjct: 460 HSVSIIGNIQQQGFRIVYDI-GGRVGFAPNSC 490
>gi|225449446|ref|XP_002283126.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Vitis vinifera]
Length = 436
Score = 126 bits (316), Expect = 2e-26, Method: Compositional matrix adjust.
Identities = 121/411 (29%), Positives = 183/411 (44%), Gaps = 61/411 (14%)
Query: 78 TTNISSHSYGGYSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPS---- 133
++ +S H ++SL+ G+PPQ + +LDTGS L W C K P+
Sbjct: 52 SSKLSFHHNVSLTVSLTVGSPPQTVTMVLDTGSELSWLHC-----------KKAPNLHSV 100
Query: 134 FIPKLSSSSRLLGCQNPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSGLTE 193
F P SSS + C +P C + RD + + S + ++C + + + E
Sbjct: 101 FDPLRSSSYSPIPCTSPTC------RTRTRDFS---IPVSCDKKKLCHAIISYADASSIE 151
Query: 194 GIALSETLNLPNRIIPNFLVGC--SVLSSR-----QPAGIAGFGRGKTSLPSQLNLDKFS 246
G S+T ++ N IP + GC S SS + G+ G RG S +Q+ L KFS
Sbjct: 152 GNLASDTFHIGNSAIPATIFGCMDSGFSSNSDEDSKTTGLIGMNRGSLSFVTQMGLQKFS 211
Query: 247 YCLLSHKFDDTTRTSSLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRR 306
YC+ +S ++L SS S K L YTP V S V Y V L
Sbjct: 212 YCISGQD------SSGILLFGESSFSWLK--ALKYTPLVQI-STPLPYFDRVAYTVQLEG 262
Query: 307 ITVGGQRVRVWHKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQ------MVK 360
I V +++ D G G T+VDSGT FTF+ ++ L +EFV Q +++
Sbjct: 263 IKVANSMLQLPKSVYAPDHTGAGQTMVDSGTQFTFLLGPVYTALKNEFVRQTKASLKVLE 322
Query: 361 NRNYTRALGAEALTGLRPCFDVPGEKTG--SFPELKLHFKGGAEVTLPVENYF-----AV 413
+ N+ GA L C+ VP + P + L F+ GAE+++ E +
Sbjct: 323 DPNFVFQ-GAMDL-----CYRVPLTRRTLPPLPTVTLMFR-GAEMSVSAERLMYRVPGVI 375
Query: 414 VGEGSAVCLTVVTDREASGGPSIILGNFQMQNYYVEYDLRNQRLGFKQQLC 464
G S C T + E G S I+G+ QN ++E+DL R+GF + C
Sbjct: 376 RGSDSVYCFT-FGNSELLGVESYIIGHHHQQNVWMEFDLAKSRVGFAEVRC 425
>gi|242081123|ref|XP_002445330.1| hypothetical protein SORBIDRAFT_07g009580 [Sorghum bicolor]
gi|241941680|gb|EES14825.1| hypothetical protein SORBIDRAFT_07g009580 [Sorghum bicolor]
Length = 543
Score = 126 bits (316), Expect = 3e-26, Method: Compositional matrix adjust.
Identities = 112/380 (29%), Positives = 165/380 (43%), Gaps = 43/380 (11%)
Query: 94 SFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLGCQNPKCS 153
S G+P + I+DTGS L W C C C + + P F P S++ + C C+
Sbjct: 195 SSGSPAANLTVIVDTGSDLTWVQCK---PCSACYAQRDPLFDPAGSATYAAVRCNASACA 251
Query: 154 WIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSG-LTEGIALSETLNLPNRIIPNFL 212
+ P + + C Y + YG G + G+ ++T+ L + F+
Sbjct: 252 ------ASLKAATGTP-GSCGGGNERC-YYALAYGDGSFSRGVLATDTVALGGASLDGFV 303
Query: 213 VGCSVLSSRQ----PAGIAGFGRGKTSLPSQLNL---DKFSYCLLSHKFDDTTRTSSLIL 265
GC LS+R AG+ G GR + SL SQ L FSYCL + D + + SL
Sbjct: 304 FGCG-LSNRGLFGGTAGLMGLGRTELSLVSQTALRYGGVFSYCLPATTSGDASGSLSL-- 360
Query: 266 DNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRVRVWHKYLTLDR 325
G + S + TT + YT + +P A +Y++ + VGG L
Sbjct: 361 -GGDASSYRNTTPVAYTRMIADP------AQPPFYFLNVTGAAVGG-------TALAAQG 406
Query: 326 DGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEALTGLRPCFDVPGE 385
G ++DSGT T +AP ++ + EF Q T A + L C+D+ G
Sbjct: 407 LGASNVLIDSGTVITRLAPSVYRGVRAEFTRQFAAAGYPT----APGFSILDTCYDLTGH 462
Query: 386 KTGSFPELKLHFKGGAEVTLPVENY-FAVVGEGSAVCLTVVTDREASGGPSIILGNFQMQ 444
P L L +GGAEVT+ F V +GS VCL + + P I+GN+Q +
Sbjct: 463 DEVKVPLLTLRLEGGAEVTVDAAGMLFVVRKDGSQVCLAMASLSYEDQTP--IIGNYQQK 520
Query: 445 NYYVEYDLRNQRLGFKQQLC 464
N V YD RLGF + C
Sbjct: 521 NKRVVYDTVGSRLGFADEDC 540
>gi|242084336|ref|XP_002442593.1| hypothetical protein SORBIDRAFT_08g022613 [Sorghum bicolor]
gi|241943286|gb|EES16431.1| hypothetical protein SORBIDRAFT_08g022613 [Sorghum bicolor]
Length = 482
Score = 125 bits (315), Expect = 3e-26, Method: Compositional matrix adjust.
Identities = 132/428 (30%), Positives = 180/428 (42%), Gaps = 72/428 (16%)
Query: 56 LTRALHIKNPQTKTTTTTTTTTTTNISSHSYGGYSISLSFGTPPQ--------IIPFILD 107
+T+A +P+ T T T+ G Y ++ GTP + + P D
Sbjct: 101 ITKAATPADPENGTVVTGAPTS---------GEYIAKITVGTPYENDSSFEALLSP---D 148
Query: 108 TGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLGCQNPKCSWIHHESIQCRDCND 167
GS + W C ++C + P + SSS+ +GC P C
Sbjct: 149 MGSDVTWLQCMPCFRCYH---QPGPVYNRLKSSSASDVGCYAPAC--------------- 190
Query: 168 EPLATSKNCTQI---CPSYLVLYGSGLTE-GIALSETLNLPNRI-IPNFLVGCSV----L 218
L +S C Q C Y V YG G + G ETL P + +P +GC L
Sbjct: 191 RALGSSGGCVQFLNEC-QYKVEYGDGSSSAGDFGVETLTFPPGVRVPGVAIGCGSDNQGL 249
Query: 219 SSRQPAGIAGFGRGKTSLPSQLNLD---KFSYCLLSHKFDDTTRTSSLILDNGSSHSDKK 275
AGI G GRG S PSQ+ FSYCL R+S+L +G+S +
Sbjct: 250 FPAPAAGILGLGRGSLSFPSQIAGRYGRSFSYCLAGQG--TGGRSSTLTFGSGASATTTT 307
Query: 276 TTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRVR-VWHKYLTLD-RDGNGGTIV 333
TT ++TP + N + +YYVGL I+VGG RVR V L LD G+GG IV
Sbjct: 308 TTPPSFTPMLTN------SRMYTFYYVGLVGISVGGVRVRGVTESDLRLDPSTGHGGVIV 361
Query: 334 DSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEALTGLRPCFD-VPGEKTGSFPE 392
DSGT T ++ + D F VK + G A C+ V G P
Sbjct: 362 DSGTAVTRLSGPAYAAFRDAFRVAAVKELGWPSPGGPFAF--FDTCYSSVRGRVMKKVPA 419
Query: 393 LKLHFKGGAEVTLPVENYFAVVGEGSA-VCLTVV--TDREASGGPSIILGNFQMQNYYVE 449
+ +HF GG EV LP +NY V +C DR S I+GN Q+Q + V
Sbjct: 420 VSMHFAGGVEVKLPPQNYLIPVDSNKGTMCFAFAGSGDRGVS-----IIGNIQLQGFRVV 474
Query: 450 YDLRNQRL 457
YD+ QR+
Sbjct: 475 YDVDGQRV 482
>gi|302763741|ref|XP_002965292.1| hypothetical protein SELMODRAFT_83230 [Selaginella moellendorffii]
gi|300167525|gb|EFJ34130.1| hypothetical protein SELMODRAFT_83230 [Selaginella moellendorffii]
Length = 423
Score = 125 bits (315), Expect = 3e-26, Method: Compositional matrix adjust.
Identities = 117/420 (27%), Positives = 176/420 (41%), Gaps = 53/420 (12%)
Query: 54 SSLTRALHIKNPQTKTTTTTTTTTTTNISSHSYGGYSISLSFGTPPQIIPFILDTGSHLV 113
SSLT L NP + T + S G Y +SL GTPP+ + + DTGS ++
Sbjct: 49 SSLTNPLKNTNPFLQQDFETPLRSGL---SDGSGEYFVSLGVGTPPRTVNMVADTGSDVL 105
Query: 114 WFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLGCQNPKCSWIHHESIQCRDCNDEPLATS 173
W C C+ C P F P SS+ + + C + C + + C
Sbjct: 106 WLQC---LPCQSCYGQTDPLFNPSFSSTFQSITCGSSLCQQLLIRGCRRNQC-------- 154
Query: 174 KNCTQICPSYLVLYGSG-LTEGIALSETLNLPNRIIPNFLVGCSVLSS---RQPAGIAGF 229
Y V YG G T G +ETL+ + + + +GC + AG+ G
Sbjct: 155 --------LYQVSYGDGSFTVGEFSTETLSFGSNAVNSVAIGCGHNNQGLFTGAAGLLGL 206
Query: 230 GRGKTSLPSQLNL---DKFSYCLLSHKFDDTTRTSSLILDNGSSHSDKKTTGLTYTPFVN 286
G+G S PSQ+ FSYCL + ++T + LI N + S+ + +T +
Sbjct: 207 GKGLLSFPSQVGQLYGSVFSYCLPTR---ESTGSVPLIFGNQAVASNAQ-----FTTLLT 258
Query: 287 NPSVAERNAFSVYYYVGLRRITVGGQRVRVWHKYLTLDRD-GNGGTIVDSGTTFTFMAPE 345
NP + +YYV + I VGG V + L+LD GNGG I+DSGT T +
Sbjct: 259 NPKL------DTFYYVEMVGIKVGGTSVNIPAGSLSLDSSTGNGGVILDSGTAVTRLVTS 312
Query: 346 LFEPLADEFVSQMVKNRNYTRALGAEALTGLRPCFDVPGEKTGSFPELKLHFKGGAEVTL 405
+ P+ D F + M + T + C+D+ G + P + F GGA + L
Sbjct: 313 AYNPMRDAFRAGMPSDAKMT-----SGFSLFDTCYDLSGRSSIMLPAVSFVFNGGATMAL 367
Query: 406 PVENYFAVVGEGSAVCLTVVTDREASGGPSIILGNFQMQNYYVEYDLRNQRLGFKQQLCK 465
P +N V CL + E I+GN Q Q++ + +D R+G C
Sbjct: 368 PAQNIMVPVDNSGTYCLAFAPNSENFS----IIGNIQQQSFRMSFDSTGNRVGIGANQCN 423
>gi|359492937|ref|XP_002283889.2| PREDICTED: aspartic proteinase nepenthesin-1-like [Vitis vinifera]
Length = 439
Score = 125 bits (315), Expect = 4e-26, Method: Compositional matrix adjust.
Identities = 113/389 (29%), Positives = 162/389 (41%), Gaps = 56/389 (14%)
Query: 87 GGYSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLG 146
G Y + + GTP Q++ +LDT W PC + C CSS P+F P SS+ L
Sbjct: 97 GNYVVRVKLGTPGQLMFMVLDTSRDAAWVPCAD---CAGCSS---PTFSPNTSSTYASLQ 150
Query: 147 CQNPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSGLTEGIALSE-TLNLPN 205
C P+C+ + + C P + C + YG + LS+ +L L
Sbjct: 151 CSVPQCTQV--RGLSC------PTTGTAACF-----FNQTYGGDSSFSAMLSQDSLGLAV 197
Query: 206 RIIPNFLVGCSVL---SSRQPAGIAGFGRGKTSLPSQ---LNLDKFSYCLLSHK---FDD 256
+P++ GC S+ P G+ G GRG SL SQ L FSYC S K F
Sbjct: 198 DTLPSYSFGCVNAVSGSTLPPQGLLGLGRGPMSLLSQSGSLYSGVFSYCFPSFKSYYFSG 257
Query: 257 TTRTSSLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRVRV 316
+ R L + + TP + NP R YYV L ++VG V V
Sbjct: 258 SLRLGPL----------GQPKNIRTTPLLRNP---HRPTL---YYVNLTGVSVGRVLVPV 301
Query: 317 WHKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEALTGL 376
+ L D + GTI+DSGT T ++ + DEF Q+ + +GA
Sbjct: 302 APELLAFDPNTGAGTIIDSGTVITRFVEPVYAAIRDEFRKQV---KGPFATIGA-----F 353
Query: 377 RPCFDVPGEKTGSFPELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTVVTDREASGGPSI 436
CF E P + HF G ++ LP+EN GS CL +
Sbjct: 354 DTCFAATNEDIA--PPVTFHFTG-MDLKLPLENTLIHSSAGSLACLAMAAAPNNVNSVLN 410
Query: 437 ILGNFQMQNYYVEYDLRNQRLGFKQQLCK 465
++ N Q QN + +D+ N RLG ++LC
Sbjct: 411 VIANLQQQNLRIMFDVTNSRLGIARELCN 439
>gi|357120129|ref|XP_003561782.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Brachypodium
distachyon]
Length = 452
Score = 125 bits (315), Expect = 4e-26, Method: Compositional matrix adjust.
Identities = 109/404 (26%), Positives = 168/404 (41%), Gaps = 56/404 (13%)
Query: 90 SISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLGCQN 149
++ ++ GTPPQ + +LDTGS L W C +S S+ P + C +
Sbjct: 64 TVPVAVGTPPQNVTMVLDTGSELSWLLCNGSRHDAPFDASASSSYAP--------VPCSS 115
Query: 150 PKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSGLTEGIALSETLNLPNRIIP 209
P C+W+ RD P S C ++ SY + +G+ ++T L + +P
Sbjct: 116 PACTWLG------RDLPVRPFCDSSAC-RVSLSY---ADASSADGLLAADTFLLGSSPMP 165
Query: 210 NFLVGC-------SVLSSRQPAGIAGFGRGKTSLPSQLNLDKFSYCLLSHKFDDTTRTSS 262
L GC + S P G+ G RG S +Q +F+YC+ + +
Sbjct: 166 A-LFGCITSYSSSTDPSETPPTGLLGMNRGGLSFVTQTATRRFAYCIAAGQ------GPG 218
Query: 263 LILDNGSSHSDKKTT----GLTYTPFVNNPSVAERNAF--SVYYYVGLRRITVGGQRVRV 316
++L G+ T+ L YTP V +++ + Y V L I VG + +
Sbjct: 219 ILLLGGNDTETPLTSPPQQQLNYTPLVE---ISQPLPYFDRAAYTVQLEGIRVGSALLAI 275
Query: 317 WHKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEALTGL 376
LT D G G T+VDSGT FTF+ P+ + L EF +Q+ ++ + A E
Sbjct: 276 PKHLLTPDHTGAGQTMVDSGTRFTFLLPDAYAALKAEFANQLTRSLDGGLAPLGEPGFVF 335
Query: 377 RPCFDV----------PGEKTGSFPELKLHFKGGAEVTLPVENYFAVV-----GEGSAVC 421
+ FD G PE+ L +G V E V GEG V
Sbjct: 336 QGAFDACFRGTEARVSAAAAGGLLPEVGLVLRGAEVVVAGAEKLLYRVPGERRGEGEGVW 395
Query: 422 LTVVTDREASGGPSIILGNFQMQNYYVEYDLRNQRLGFKQQLCK 465
+ +G + ++G+ Q+ +VEYDLRN RLGF C
Sbjct: 396 CLTFGSSDMAGVSAYVIGHHHQQDVWVEYDLRNARLGFAAARCA 439
>gi|302809855|ref|XP_002986620.1| hypothetical protein SELMODRAFT_124369 [Selaginella moellendorffii]
gi|300145803|gb|EFJ12477.1| hypothetical protein SELMODRAFT_124369 [Selaginella moellendorffii]
Length = 423
Score = 125 bits (314), Expect = 4e-26, Method: Compositional matrix adjust.
Identities = 117/420 (27%), Positives = 176/420 (41%), Gaps = 53/420 (12%)
Query: 54 SSLTRALHIKNPQTKTTTTTTTTTTTNISSHSYGGYSISLSFGTPPQIIPFILDTGSHLV 113
SSLT L NP + T + S G Y +SL GTPP+ + + DTGS ++
Sbjct: 49 SSLTNPLKNTNPFLQQDFETPLRSGL---SDGSGEYFVSLGVGTPPRTVNMVADTGSDVL 105
Query: 114 WFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLGCQNPKCSWIHHESIQCRDCNDEPLATS 173
W C C+ C P F P SS+ + + C + C + + C
Sbjct: 106 WLQC---LPCQSCYGQTDPLFNPSFSSTFQSITCGSSLCQQLLIRGCRRNQC-------- 154
Query: 174 KNCTQICPSYLVLYGSG-LTEGIALSETLNLPNRIIPNFLVGCSVLSS---RQPAGIAGF 229
Y V YG G T G +ETL+ + + + +GC + AG+ G
Sbjct: 155 --------LYQVSYGDGSFTVGEFSTETLSFGSNAVNSVAIGCGHNNQGLFTGAAGLLGL 206
Query: 230 GRGKTSLPSQLNL---DKFSYCLLSHKFDDTTRTSSLILDNGSSHSDKKTTGLTYTPFVN 286
G+G S PSQ+ FSYCL + ++T + LI N + S+ + +T +
Sbjct: 207 GKGLLSFPSQVGQLYGSVFSYCLPTR---ESTGSVPLIFGNQAVASNAQ-----FTTLLT 258
Query: 287 NPSVAERNAFSVYYYVGLRRITVGGQRVRVWHKYLTLDRD-GNGGTIVDSGTTFTFMAPE 345
NP + +YYV + I VGG V + L+LD GNGG I+DSGT T +
Sbjct: 259 NPKL------DTFYYVEMVGIKVGGTSVSIPAGSLSLDSSTGNGGVILDSGTAVTRLVTS 312
Query: 346 LFEPLADEFVSQMVKNRNYTRALGAEALTGLRPCFDVPGEKTGSFPELKLHFKGGAEVTL 405
+ P+ D F + M + T + C+D+ G + P + F GGA + L
Sbjct: 313 AYNPMRDAFRAGMPSDAKMT-----SGFSLFDTCYDLSGRSSIMLPAVSFVFNGGATMAL 367
Query: 406 PVENYFAVVGEGSAVCLTVVTDREASGGPSIILGNFQMQNYYVEYDLRNQRLGFKQQLCK 465
P +N V CL + E I+GN Q Q++ + +D R+G C
Sbjct: 368 PAQNIMVPVDNSGTYCLAFAPNSENFS----IIGNIQQQSFRMSFDSTGNRVGIGANQCN 423
>gi|302802500|ref|XP_002983004.1| hypothetical protein SELMODRAFT_13348 [Selaginella moellendorffii]
gi|300149157|gb|EFJ15813.1| hypothetical protein SELMODRAFT_13348 [Selaginella moellendorffii]
Length = 332
Score = 125 bits (314), Expect = 4e-26, Method: Compositional matrix adjust.
Identities = 112/395 (28%), Positives = 163/395 (41%), Gaps = 80/395 (20%)
Query: 87 GGYSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLG 146
G Y +++ G+PP+ ++DTGS L W +C CS P SS+ L
Sbjct: 1 GVYYSTITLGSPPKDFSLVMDTGSDLTWV------RCDPCS--------PDCSSTFDRLA 46
Query: 147 CQNPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSG-LTEGIALSETLNLPN 205
++++ C D Y YG G T+G +TL +
Sbjct: 47 SNT-------YKALTCAD-----------------DYSYGYGDGSFTQGDLSVDTLKMAG 82
Query: 206 RI------IPNFLVGCSVLSS---RQPAGIAGFGRGKTSLPSQLNL---DKFSYCLLSHK 253
P F+ GC L GI G S PSQ+ +KFSYCLL
Sbjct: 83 AASDELEEFPGFVFGCGSLLKGLISGEVGILALSPGSLSFPSQIGEKYGNKFSYCLLRQT 142
Query: 254 FDDTTRTSSLILDNGSSHSDKKTTG----LTYTPFVNNPSVAERNAFSVYYYVGLRRITV 309
++ + S ++ + + +G L YTP + S+YY V L I+V
Sbjct: 143 AQNSLKKSPMVFGEAAVELKEPGSGKLQELQYTPIGES---------SIYYTVRLDGISV 193
Query: 310 GGQRVRVWHKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALG 369
G QR+ + +D TI DSGTT T + P + + + S MV +
Sbjct: 194 GNQRLDLSPSAFLNGQDKP--TIFDSGTTLTMLPPGVCDSIKQSLAS-MVSGAEFV---- 246
Query: 370 AEALTGLRPCFDVPGEKTGSFPELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTVVTDRE 429
A+ GL CF VP P++ HF GGA+ NY V+ GS CL V E
Sbjct: 247 --AIKGLDACFRVPPSSGQGLPDITFHFNGGADFVTRPSNY--VIDLGSLQCLIFVPTNE 302
Query: 430 ASGGPSIILGNFQMQNYYVEYDLRNQRLGFKQQLC 464
S I GN Q Q+++V +D+ N+R+GFK+ C
Sbjct: 303 VS-----IFGNLQQQDFFVLHDMDNRRIGFKETDC 332
>gi|255576511|ref|XP_002529147.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
gi|223531426|gb|EEF33260.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
Length = 479
Score = 125 bits (314), Expect = 5e-26, Method: Compositional matrix adjust.
Identities = 111/392 (28%), Positives = 160/392 (40%), Gaps = 58/392 (14%)
Query: 82 SSHSYGGYSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSS 141
+S G Y + G P + +LDTGS + W C C C P F P S+S
Sbjct: 137 TSQGSGEYFSRVGIGKPSSPVYMVLDTGSDVNWIQCA---PCADCYHQADPIFEPASSTS 193
Query: 142 SRLLGCQNPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSG-LTEGIALSET 200
L C +C +S+ +C +N T + Y V YG G T G ++ET
Sbjct: 194 YSPLSCDTKQC-----QSLDVSEC--------RNNTCL---YEVSYGDGSYTVGDFVTET 237
Query: 201 LNLPNRIIPNFLVGCSVLSSRQPAGI-------AGFGRGKTSLPSQLNLDKFSYCLLSHK 253
+ L + + N +GC G+ G G GK S PSQ+N FSYCL+
Sbjct: 238 ITLGSASVDNVAIGCG----HNNEGLFIGAAGLLGLGGGKLSFPSQINASSFSYCLVDRD 293
Query: 254 FDDTTRTSSLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQR 313
D S+ L+ S+ T P + N +YYVG+ ++VGG+
Sbjct: 294 SD-----SASTLEFNSALLPHAITA----PLLRN------RELDTFYYVGMTGLSVGGEL 338
Query: 314 VRVWHKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEAL 373
+ + +D GNGG I+DSGT T + + L D FV T+ L +
Sbjct: 339 LSIPESMFEMDESGNGGIIIDSGTAVTRLQTAAYNALRDAFVKG-------TKDLPVTSE 391
Query: 374 TGL-RPCFDVPGEKTGSFPELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTVVTDREASG 432
L C+D+ + + P + H GG + LP NY V C A
Sbjct: 392 VALFDTCYDLSRKTSVEVPTVTFHLAGGKVLPLPATNYLIPVDSDGTFCFAFAPTSSALS 451
Query: 433 GPSIILGNFQMQNYYVEYDLRNQRLGFKQQLC 464
I+GN Q Q V +DL N +GF+ + C
Sbjct: 452 ----IIGNVQQQGTRVGFDLANSLVGFEPRQC 479
>gi|356573235|ref|XP_003554768.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
Length = 472
Score = 125 bits (314), Expect = 5e-26, Method: Compositional matrix adjust.
Identities = 122/386 (31%), Positives = 171/386 (44%), Gaps = 49/386 (12%)
Query: 87 GGYSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLG 146
G Y + GTP + + +LDTGS +VW C C+ C + P F P S + +
Sbjct: 127 GEYFTRIGVGTPARYVYMVLDTGSDVVWLQCA---PCRKCYTQADPVFDPTKSRTYAGIP 183
Query: 147 CQNPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSG-LTEGIALSETLNLPN 205
C P C + D P +KN ++C Y V YG G T G +ETL
Sbjct: 184 CGAPLCRRL-----------DSPGCNNKN--KVC-QYQVSYGDGSFTFGDFSTETLTFRR 229
Query: 206 RIIPNFLVGCSVLSSR---QPAGIAGFGRGKTSLPSQLNL---DKFSYCLLSHKFDDTTR 259
+ +GC + AG+ G GRG+ S P Q KFSYCL+ + +
Sbjct: 230 TRVTRVALGCGHDNEGLFIGAAGLLGLGRGRLSFPVQTGRRFNQKFSYCLVDRS--ASAK 287
Query: 260 TSSLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRVRVWHK 319
SS++ + + + +TP + NP + +YY+ L I+VGG VR
Sbjct: 288 PSSVVFGDSAVSRTAR-----FTPLIKNPKL------DTFYYLELLGISVGGSPVRGLSA 336
Query: 320 YL-TLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEALTGLRP 378
L LD GNGG I+DSGT+ T + + L D F V + RA AE +
Sbjct: 337 SLFRLDAAGNGGVIIDSGTSVTRLTRPAYIALRDAF---RVGASHLKRA--AE-FSLFDT 390
Query: 379 CFDVPGEKTGSFPELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTVVTDREASGGPSIIL 438
CFD+ G P + LHF+ GA+V+LP NY V + C G SII
Sbjct: 391 CFDLSGLTEVKVPTVVLHFR-GADVSLPATNYLIPVDNSGSFCFAFAGTMS---GLSII- 445
Query: 439 GNFQMQNYYVEYDLRNQRLGFKQQLC 464
GN Q Q + V +DL R+GF + C
Sbjct: 446 GNIQQQGFRVSFDLAGSRVGFAPRGC 471
>gi|357487631|ref|XP_003614103.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
gi|355515438|gb|AES97061.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
Length = 431
Score = 125 bits (313), Expect = 5e-26, Method: Compositional matrix adjust.
Identities = 135/474 (28%), Positives = 208/474 (43%), Gaps = 76/474 (16%)
Query: 13 IFFFTLLSI--FPSSITSLTFSLSRFHTN--------PSQDSYQNLNSLVSSSLTRALHI 62
+ FF+L I F S+ + +FS H + P+Q+ +Q++ + S+ RA
Sbjct: 9 LLFFSLCFIISFSHSLRN-SFSFELIHRDSSKSPLYKPAQNKFQHVVNAARRSINRA--- 64
Query: 63 KNPQTKTTTTTTTTTTTNISSHSYGGYSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQ 122
N K + + T +T ++ G Y ++ S GTPP + ++DTGS +VW C
Sbjct: 65 -NRLFKDSLSNTPESTVYVNG---GEYLMTYSVGTPPFNVYGVVDTGSDIVWLQCK---P 117
Query: 123 CKYCSSSKIPSFIPKLSSSSRLLGCQNPKCSWIHHESIQCRDCNDEPLATSKNCTQICPS 182
C+ C P F P SSS + + C + C + + S CN + C
Sbjct: 118 CEQCYKQTTPIFNPSKSSSYKNIPCSSNLCQSVRYTS-----CNKQ---------NSCEY 163
Query: 183 YLVLYGSGLTEGIALSETLNLPNRI-----IPNFLVGCSV----LSSRQPAGIAGFGRGK 233
+ ++G ETL L + P ++GC + + +GI G G G
Sbjct: 164 TINFSDQSYSQGELSVETLTLDSTTGHSVSFPKTVIGCGHNNRGMFQGETSGIVGLGIGP 223
Query: 234 TSLPSQLNLD---KFSYCLLSHKFDDTTRTSSLILDNGSSHSDKKTTGLTYTPFVNNPSV 290
SL +QL KFSYCLL D+ +TS L + + S G+ TPFV
Sbjct: 224 VSLTTQLKSSIGGKFSYCLLP-LLVDSNKTSKLNFGDAAVVSGD---GVVSTPFVKKDPQ 279
Query: 291 AERNAFSVYYYVGLRRITVGGQRVRVWHKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPL 350
A +YY+ L +VG +R+ ++ LD G I+DSGTT T + ++ L
Sbjct: 280 A-------FYYLTLEAFSVGNKRI----EFEVLDDSEEGNIILDSGTTLTLLPSHVYTNL 328
Query: 351 ADEFVSQMVKNRNYTRALGAEALTGLRPCFDVPGEKTGSFPELKLHFKGGAEVTLPVENY 410
+ V+Q+VK R L L C+ + ++ FP + HFK GA++ L +
Sbjct: 329 -ESAVAQLVK---LDRVDDPNQLLNL--CYSITSDQY-DFPIITAHFK-GADIKLNPIST 380
Query: 411 FAVVGEGSAVCLTVVTDREASGGPSIILGNFQMQNYYVEYDLRNQRLGFKQQLC 464
FA V +G VCL + + GP I GN N V YDL+ + FK C
Sbjct: 381 FAHVADG-VVCLAFTSSQT---GP--IFGNLAQLNLLVGYDLQQNIVSFKPSDC 428
>gi|357158691|ref|XP_003578210.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Brachypodium
distachyon]
Length = 459
Score = 125 bits (313), Expect = 5e-26, Method: Compositional matrix adjust.
Identities = 122/397 (30%), Positives = 166/397 (41%), Gaps = 62/397 (15%)
Query: 89 YSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLGCQ 148
Y + L+ GTPPQ + +LDTGS L+W C C C P F P SSS + C
Sbjct: 104 YLVDLAVGTPPQPVSALLDTGSDLIWTQCA---PCASCLPQPDPIFSPGASSSYEPMRCA 160
Query: 149 NPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSGLT-EGIALSETLNLP--- 204
C+ I H S Q D CT Y YG G T G+ +E
Sbjct: 161 GELCNDILHHSCQRPD----------TCT-----YRYSYGDGTTTRGVYATERFTFSSSS 205
Query: 205 -----NRIIPNFLVGCSVL---SSRQPAGIAGFGRGKTSLPSQLNLDKFSYCLLSHKFDD 256
++ GC + S +GI GFGR SL SQL + +FSYCL +
Sbjct: 206 SGGETTKLSAPLGFGCGTMNKGSLNNGSGIVGFGRAPLSLVSQLAIRRFSYCLTPYA--- 262
Query: 257 TTRTSSLILDN--GSSHSDKKTTGLTYTPFV---NNPSVAERNAFSVYYYVGLRRITVGG 311
+ R S+L+ + G + D T + T + NP+ +YYV +TVG
Sbjct: 263 SGRKSTLLFGSLRGGVY-DAATATVQTTRLLRSRQNPT---------FYYVPFTGVTVGA 312
Query: 312 QRVRVWHKYLTLDRDGNGGTIVDSGTTFT-FMAPELFEPLADEFVSQMVKNRNYTRALGA 370
+R+R+ L DG+GG IVDSGT T F AP L E + F SQ+ + G
Sbjct: 313 RRLRIPISAFALRPDGSGGAIVDSGTALTLFPAPVLAE-VVRAFRSQLRLPFAANGSSGP 371
Query: 371 EALTGLRPCFDVPGEKT---GSFPELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTVVTD 427
+ G+ CF + P + H + GA++ LP NY +CL +
Sbjct: 372 D--DGV--CFAAAASRVPRPAVVPRMVFHLQ-GADLDLPRRNYVLDDQRKGNLCLLLAD- 425
Query: 428 REASGGPSIILGNFQMQNYYVEYDLRNQRLGFKQQLC 464
SG +GNF Q+ V YDL L F C
Sbjct: 426 ---SGDSGTTIGNFVQQDMRVLYDLEADTLSFAPAQC 459
>gi|115465771|ref|NP_001056485.1| Os05g0590000 [Oryza sativa Japonica Group]
gi|49328116|gb|AAT58814.1| putative nucleoid DNA-binding protein [Oryza sativa Japonica Group]
gi|113580036|dbj|BAF18399.1| Os05g0590000 [Oryza sativa Japonica Group]
Length = 481
Score = 125 bits (313), Expect = 5e-26, Method: Compositional matrix adjust.
Identities = 122/394 (30%), Positives = 173/394 (43%), Gaps = 54/394 (13%)
Query: 87 GGYSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLG 146
G Y + GTP +LDTGS +VW C C++C + F P+ S S +
Sbjct: 126 GEYFAQVGVGTPATTALMVLDTGSDVVWLQCA---PCRHCYAQSGRVFDPRRSRSYAAVD 182
Query: 147 CQNPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSG-LTEGIALSETLNLPN 205
C P C + +S C + L Y V YG G +T G SETL
Sbjct: 183 CVAPICRRL--DSAGCDRRRNSCL------------YQVAYGDGSVTAGDFASETLTFAR 228
Query: 206 RI-IPNFLVGCSVLSSRQPAGIAG-----FGRGKTSLPSQLNLD---KFSYCLL---SHK 253
+ +GC + IA GRG+ S PSQ+ FSYCL+ S
Sbjct: 229 GARVQRVAIGCG--HDNEGLFIAASGLLGLGRGRLSFPSQIARSFGRSFSYCLVDRTSSV 286
Query: 254 FDDTTRTSSLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQR 313
+TR+S++ + + G ++TP NP +A +YYV L +VGG R
Sbjct: 287 RPSSTRSSTVTF---GAGAVAAAAGASFTPMGRNPRMA------TFYYVHLLGFSVGGAR 337
Query: 314 VR-VWHKYLTLD-RDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAE 371
V+ V L L+ G GG I+DSGT+ T +A ++E + D F + V R +
Sbjct: 338 VKGVSQSDLRLNPTTGRGGVILDSGTSVTRLARPVYEAVRDAFRAAAVGLR-----VSPG 392
Query: 372 ALTGLRPCFDVPGEKTGSFPELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTVV-TDREA 430
+ C+++ G + P + +H GGA V LP ENY V C + TD
Sbjct: 393 GFSLFDTCYNLSGRRVVKVPTVSMHLAGGASVALPPENYLIPVDTSGTFCFAMAGTD--- 449
Query: 431 SGGPSIILGNFQMQNYYVEYDLRNQRLGFKQQLC 464
GG SII GN Q Q + V +D QR+GF + C
Sbjct: 450 -GGVSII-GNIQQQGFRVVFDGDAQRVGFVPKSC 481
>gi|116786826|gb|ABK24255.1| unknown [Picea sitchensis]
gi|148906052|gb|ABR16185.1| unknown [Picea sitchensis]
Length = 485
Score = 125 bits (313), Expect = 5e-26, Method: Compositional matrix adjust.
Identities = 113/386 (29%), Positives = 166/386 (43%), Gaps = 51/386 (13%)
Query: 87 GGYSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLG 146
G Y + G P + +LDTGS + W C C C P + P LSSS +L+G
Sbjct: 143 GEYFSRIGVGAPRRDQLMVLDTGSDVTWIQCE---PCSDCYQQSDPIYNPALSSSYKLVG 199
Query: 147 CQNPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSG-LTEGIALSETLNLPN 205
CQ C + + C S+N + + Y V YG G T+G +ETL L
Sbjct: 200 CQANLC-----QQLDVSGC-------SRNGSCL---YQVSYGDGSYTQGNFATETLTLGG 244
Query: 206 RIIPNFLVGCSVLSSR---QPAGIAGFGRGKTSLPSQL---NLDKFSYCLLSHKFDDTTR 259
+ N +GC + AG+ G G G S PSQL N FSYCL+ D+
Sbjct: 245 APLQNVAIGCGHDNEGLFVGAAGLLGLGGGSLSFPSQLTDENGKIFSYCLVDR---DSES 301
Query: 260 TSSLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRVRVWHK 319
+S+L + + G P + N + +YYV L I+VGG+ + +
Sbjct: 302 SSTLQFGRAAVPN-----GAVLAPMLKN------SRLDTFYYVSLSGISVGGKMLSISDS 350
Query: 320 YLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRAL-GAEALTGLRP 378
+D GNGG IVDSGT T + ++ L D F R T+ L + ++
Sbjct: 351 VFGIDASGNGGVIVDSGTAVTRLQTAAYDSLRDAF-------RAGTKNLPSTDGVSLFDT 403
Query: 379 CFDVPGEKTGSFPELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTVVTDREASGGPSIIL 438
C+D+ +++ P + HF GG ++LP +NY V C + I+
Sbjct: 404 CYDLSSKESVDVPTVVFHFSGGGSMSLPAKNYLVPVDSMGTFCFAFAPTSSSLS----IV 459
Query: 439 GNFQMQNYYVEYDLRNQRLGFKQQLC 464
GN Q Q V +D N ++GF C
Sbjct: 460 GNIQQQGIRVSFDRANNQVGFAVNKC 485
>gi|297736090|emb|CBI24128.3| unnamed protein product [Vitis vinifera]
Length = 378
Score = 125 bits (313), Expect = 5e-26, Method: Compositional matrix adjust.
Identities = 121/405 (29%), Positives = 179/405 (44%), Gaps = 54/405 (13%)
Query: 82 SSHSYGGYSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPS------FI 135
+ + G YS++ GTP Q + DTGS L W C H + + CS+ K F
Sbjct: 5 ADYGIGQYSVAFKVGTPSQKFMLVADTGSDLTWMSCKYHCRSRNCSNRKARRIRHKRVFH 64
Query: 136 PKLSSSSRLLGCQNPKCSWIHHESIQCRDCNDEPLATSKNC-TQICP-SYLVLYGSGLTE 193
LSSS + + C C I+ D L + NC T + P Y Y G T
Sbjct: 65 ANLSSSFKTIPCLTDMC------KIELMD-----LFSLTNCPTPLTPCGYDYRYSDGSTA 113
Query: 194 -GIALSETLNLPNR-----IIPNFLVGCSV----LSSRQPAGIAGFGRGKTSLPSQLNLD 243
G +ET+ + + + N L+GCS S + G+ G G K S +
Sbjct: 114 LGFFANETVTVELKEGRKMKLHNVLIGCSESFQGQSFQAADGVMGLGYSKYSFAIKAAEK 173
Query: 244 ---KFSYCLLSHKFDDTTRTSSLILDNGSSHSDKK-TTGLTYTPFVNNPSVAERNAFSVY 299
KFSYCL+ H + + S L GSS S + +TYT V + N+F
Sbjct: 174 FGGKFSYCLVDHL---SHKNVSNYLTFGSSRSKEALLNNMTYTELV----LGMVNSF--- 223
Query: 300 YYVGLRRITVGGQRVRVWHKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMV 359
Y V + I++GG +++ + D G GGTI+DSG++ TF+ ++P+ ++
Sbjct: 224 YAVNMMGISIGGAMLKIPSE--VWDVKGAGGTILDSGSSLTFLTEPAYQPVMAALRVSLL 281
Query: 360 KNRNYTRALGAEALTGLRPCFDVPGEKTGSFPELKLHFKGGAEVTLPVENYFAVVGEGSA 419
K R +G L CF+ G + P L HF GAE PV++Y +G
Sbjct: 282 KFRKVEMDIGP-----LEYCFNSTGFEESLVPRLVFHFADGAEFEPPVKSYVISAADG-V 335
Query: 420 VCLTVVTDREASGGPSIILGNFQMQNYYVEYDLRNQRLGFKQQLC 464
CL V+ A G S++ GN QN+ E+DL ++LGF C
Sbjct: 336 RCLGFVS--VAWPGTSVV-GNIMQQNHLWEFDLGLKKLGFAPSSC 377
>gi|255543383|ref|XP_002512754.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
gi|223547765|gb|EEF49257.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
Length = 414
Score = 125 bits (313), Expect = 6e-26, Method: Compositional matrix adjust.
Identities = 108/371 (29%), Positives = 170/371 (45%), Gaps = 51/371 (13%)
Query: 105 ILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLGCQNPKCSWIHHESIQCRD 164
I+DTGS L W C C+ C + + P F P S S + + C + C + + +
Sbjct: 81 IVDTGSDLTWVQCQ---PCRLCYNQQDPLFNPSGSPSYQTILCNSSTCQSLQYATGNLGV 137
Query: 165 CNDEPLATSKNCTQICPSYLVLYGSG-LTEGIALSETLNLPNRIIPNFLVGCSVLSSR-- 221
C T C +Y+V YG G T G E LNL + NF+ GC +
Sbjct: 138 CGSN--------TPTC-NYVVNYGDGSYTRGDLGMEQLNLGTTHVSNFIFGCGRNNKGLF 188
Query: 222 -QPAGIAGFGRGKTSLPSQLNL---DKFSYCLLSHKFDDTTRTSSLILDNGSSHSDKKTT 277
+G+ G G+ SL SQ + FSYCL + D + SLIL G+S K TT
Sbjct: 189 GGASGLMGLGKSDLSLVSQTSAIFEGVFSYCLPTTAAD---ASGSLIL-GGNSSVYKNTT 244
Query: 278 GLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRVRVWHKYLTLDRDGNGGTIVDSGT 337
++YT + NP + +Y++ L I++GG ++ + + G ++DSGT
Sbjct: 245 PISYTRMIANPQLP------TFYFLNLTGISIGGVALQAPNYRQS-------GILIDSGT 291
Query: 338 TFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEALTGLRPCFDVPGEKTGSFPELKLHF 397
T + P ++ L EF+ Q ++ A + L CF++ G P +++ F
Sbjct: 292 VITRLPPPVYRDLKAEFLKQ------FSGFPSAPPFSILDTCFNLNGYDEVDIPTIRMQF 345
Query: 398 KGGAEVTLPVENYFAVVG-EGSAVCLTVVT---DREASGGPSIILGNFQMQNYYVEYDLR 453
+G AE+T+ V F V + S VCL + + D E I+GN+Q +N V Y+ +
Sbjct: 346 EGNAELTVDVTGIFYFVKTDASQVCLALASLSFDDEIP-----IIGNYQQRNQRVIYNTK 400
Query: 454 NQRLGFKQQLC 464
+LGF + C
Sbjct: 401 ESKLGFAAEAC 411
>gi|357130848|ref|XP_003567056.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Brachypodium
distachyon]
Length = 448
Score = 125 bits (313), Expect = 7e-26, Method: Compositional matrix adjust.
Identities = 116/391 (29%), Positives = 178/391 (45%), Gaps = 52/391 (13%)
Query: 87 GGYSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLG 146
G Y S+ GTPP ++DTGS +VW C C +C P + P+ SS+
Sbjct: 97 GEYFASVGVGTPPTPALLVIDTGSDVVWLQCK---PCVHCYRQLSPLYDPRGSSTYAQTP 153
Query: 147 CQNPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYG-SGLTEGIALSETLNLPN 205
C P+C + C+ T+ C Y ++YG + T G ++ L N
Sbjct: 154 CSPPQCR-------NPQTCD----GTTGGC-----GYRIVYGDASSTSGNLATDRLVFSN 197
Query: 206 RI-IPNFLVGCSVLSSR---QPAGIAGFGRGKTSLPSQLNLDK---FSYCLLSHKFDDTT 258
+ N +GC + AG+ G RG S +Q+ F+YCL D T
Sbjct: 198 DTSVGNVTLGCGHDNEGLFGSAAGLLGVARGNNSFATQVADSYGRYFAYCL-----GDRT 252
Query: 259 RT--SSLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRVRV 316
R+ SS L G + + ++ +TP +NP R + YYV + +VGG+ V
Sbjct: 253 RSGSSSSYLVFGRTAPEPPSS--VFTPLRSNP---RRPSL---YYVDMVGFSVGGEPVTG 304
Query: 317 W-HKYLTLD-RDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEALT 374
+ + L+LD G GG +VDSGT+ T A + + L D F ++ K R +G ++
Sbjct: 305 FSNASLSLDPATGRGGVVVDSGTSITRFARDAYGALRDAFDARAAKVG--MRKVG-RGIS 361
Query: 375 GLRPCFDVPGEKTGSFPELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTVVTDREASGGP 434
C+D+ G P + LHF GGA+V LP ENY G C + EA+G
Sbjct: 362 VFDACYDLRGVAVADAPGVVLHFAGGADVALPPENYLVPEESGRYHCFAL----EAAGHD 417
Query: 435 SI-ILGNFQMQNYYVEYDLRNQRLGFKQQLC 464
+ ++GN Q + V +D+ N+R+GF+ C
Sbjct: 418 GLSVIGNVLQQRFRVVFDVENERVGFEPNGC 448
>gi|312281631|dbj|BAJ33681.1| unnamed protein product [Thellungiella halophila]
Length = 502
Score = 125 bits (313), Expect = 7e-26, Method: Compositional matrix adjust.
Identities = 111/399 (27%), Positives = 163/399 (40%), Gaps = 55/399 (13%)
Query: 75 TTTTTNISSHSYGGYSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSF 134
TT + +S G Y + GTP + + +LDTGS + W C C C P F
Sbjct: 150 TTPVVSGTSQGSGEYFSRIGVGTPAKEMYVVLDTGSDVNWIQC---LPCSECYQQSDPIF 206
Query: 135 IPKLSSSSRLLGCQNPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSG-LTE 193
P SS+ + L C +PKC+ + + CR S C Y V YG G T
Sbjct: 207 DPTSSSTFKSLTCSDPKCASLDVSA--CR---------SNKCL-----YQVSYGDGSFTV 250
Query: 194 GIALSETLNL-PNRIIPNFLVGCSVLSSRQPAGI-------AGFGRGKTSLPSQLNLDKF 245
G ++T+ + + + +GC G+ G G G S+ +Q+ F
Sbjct: 251 GNYATDTVTFGESGKVNDVALGCG----HDNEGLFTGAAGLLGLGGGALSMTNQIKAKSF 306
Query: 246 SYCLLSHKFDDTTRTSSLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLR 305
SYCL+ D+ ++SSL +S + G P + N + +YYVGL
Sbjct: 307 SYCLVDR---DSAKSSSLDF-----NSVQIGAGDATAPLLRN------SKMDTFYYVGLS 352
Query: 306 RITVGGQRVRVWHKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYT 365
+VGGQ+V + +D G GG I+D GT T + + + L D FV +
Sbjct: 353 GFSVGGQQVSIPSSLFEVDASGAGGVILDCGTAVTRLQTQAYNSLRDAFVKLTTDFKK-- 410
Query: 366 RALGAEALTGLRPCFDVPGEKTGSFPELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTVV 425
G ++ C+D T P + HF GG + LP +NY + + C
Sbjct: 411 ---GTSPISLFDTCYDFSSLSTVKVPTVTFHFTGGKSLNLPAKNYLIPIDDAGTFCFAFA 467
Query: 426 TDREASGGPSIILGNFQMQNYYVEYDLRNQRLGFKQQLC 464
S SII GN Q Q + YDL N +G C
Sbjct: 468 ---PTSSSLSII-GNVQQQGTRITYDLANNLIGLSANKC 502
>gi|47777372|gb|AAT38006.1| unknow protein [Oryza sativa Japonica Group]
Length = 475
Score = 124 bits (312), Expect = 7e-26, Method: Compositional matrix adjust.
Identities = 122/394 (30%), Positives = 173/394 (43%), Gaps = 54/394 (13%)
Query: 87 GGYSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLG 146
G Y + GTP +LDTGS +VW C C++C + F P+ S S +
Sbjct: 120 GEYFAQVGVGTPATTALMVLDTGSDVVWLQCA---PCRHCYAQSGRVFDPRRSRSYAAVD 176
Query: 147 CQNPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSG-LTEGIALSETLNLPN 205
C P C + +S C + L Y V YG G +T G SETL
Sbjct: 177 CVAPICRRL--DSAGCDRRRNSCL------------YQVAYGDGSVTAGDFASETLTFAR 222
Query: 206 RI-IPNFLVGCSVLSSRQPAGIAG-----FGRGKTSLPSQLNLD---KFSYCLL---SHK 253
+ +GC + IA GRG+ S PSQ+ FSYCL+ S
Sbjct: 223 GARVQRVAIGCG--HDNEGLFIAASGLLGLGRGRLSFPSQIARSFGRSFSYCLVDRTSSV 280
Query: 254 FDDTTRTSSLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQR 313
+TR+S++ + + G ++TP NP +A +YYV L +VGG R
Sbjct: 281 RPSSTRSSTVTF---GAGAVAAAAGASFTPMGRNPRMA------TFYYVHLLGFSVGGAR 331
Query: 314 VR-VWHKYLTLD-RDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAE 371
V+ V L L+ G GG I+DSGT+ T +A ++E + D F + V R +
Sbjct: 332 VKGVSQSDLRLNPTTGRGGVILDSGTSVTRLARPVYEAVRDAFRAAAVGLR-----VSPG 386
Query: 372 ALTGLRPCFDVPGEKTGSFPELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTVV-TDREA 430
+ C+++ G + P + +H GGA V LP ENY V C + TD
Sbjct: 387 GFSLFDTCYNLSGRRVVKVPTVSMHLAGGASVALPPENYLIPVDTSGTFCFAMAGTD--- 443
Query: 431 SGGPSIILGNFQMQNYYVEYDLRNQRLGFKQQLC 464
GG SII GN Q Q + V +D QR+GF + C
Sbjct: 444 -GGVSII-GNIQQQGFRVVFDGDAQRVGFVPKSC 475
>gi|356523155|ref|XP_003530207.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
Length = 412
Score = 124 bits (311), Expect = 1e-25, Method: Compositional matrix adjust.
Identities = 112/384 (29%), Positives = 176/384 (45%), Gaps = 47/384 (12%)
Query: 89 YSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLGCQ 148
Y +++ G+ + I+DTGS L W C C C + + P F P SSS + + C
Sbjct: 65 YIVTMGLGSTNMTV--IIDTGSDLTWVQCE---PCMSCYNQQGPIFKPSTSSSYQSVSCN 119
Query: 149 NPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSG-LTEGIALSETLNLPNRI 207
+ C + + C P C +Y+V YG G T G E L+
Sbjct: 120 SSTCQSLQFATGNTGACGSNP----STC-----NYVVNYGDGSYTNGELGVEQLSFGGVS 170
Query: 208 IPNFLVGCSVLSSRQPAGIAGF---GRGKTSLPSQLNL---DKFSYCLLSHKFDDTTRTS 261
+ +F+ GC + G++G GR SL SQ N FSYCL + ++ +
Sbjct: 171 VSDFVFGCGRNNKGLFGGVSGLMGLGRSYLSLVSQTNATFGGVFSYCLPTT---ESGASG 227
Query: 262 SLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRVRVWHKYL 321
SL++ N SS K T +TYT + NP ++ +Y + L I V G ++V
Sbjct: 228 SLVMGNESSVF-KNVTPITYTRMLPNPQLSN------FYILNLTGIDVDGVALQV----- 275
Query: 322 TLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEALTGLRPCFD 381
GNGG ++DSGT T + +++ L F+ Q +T A + L CF+
Sbjct: 276 --PSFGNGGVLIDSGTVITRLPSSVYKALKALFLKQ------FTGFPSAPGFSILDTCFN 327
Query: 382 VPGEKTGSFPELKLHFKGGAEVTLPVENYFAVVGE-GSAVCLTVVTDREASGGPSIILGN 440
+ G S P + +HF+G AE+ + F VV E S VCL + + +A + I+GN
Sbjct: 328 LTGYDEVSIPTISMHFEGNAELKVDATGTFYVVKEDASQVCLALASLSDAY--DTAIIGN 385
Query: 441 FQMQNYYVEYDLRNQRLGFKQQLC 464
+Q +N V YD + ++GF ++ C
Sbjct: 386 YQQRNQRVIYDTKQSKVGFAEESC 409
>gi|359494621|ref|XP_002265771.2| PREDICTED: aspartic proteinase nepenthesin-2-like [Vitis vinifera]
Length = 449
Score = 124 bits (311), Expect = 1e-25, Method: Compositional matrix adjust.
Identities = 121/405 (29%), Positives = 179/405 (44%), Gaps = 54/405 (13%)
Query: 82 SSHSYGGYSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPS------FI 135
+ + G YS++ GTP Q + DTGS L W C H + + CS+ K F
Sbjct: 76 ADYGIGQYSVAFKVGTPSQKFMLVADTGSDLTWMSCKYHCRSRNCSNRKARRIRHKRVFH 135
Query: 136 PKLSSSSRLLGCQNPKCSWIHHESIQCRDCNDEPLATSKNC-TQICP-SYLVLYGSGLTE 193
LSSS + + C C I+ D L + NC T + P Y Y G T
Sbjct: 136 ANLSSSFKTIPCLTDMC------KIELMD-----LFSLTNCPTPLTPCGYDYRYSDGSTA 184
Query: 194 -GIALSETLNLPNR-----IIPNFLVGCSV----LSSRQPAGIAGFGRGKTSLPSQLNLD 243
G +ET+ + + + N L+GCS S + G+ G G K S +
Sbjct: 185 LGFFANETVTVELKEGRKMKLHNVLIGCSESFQGQSFQAADGVMGLGYSKYSFAIKAAEK 244
Query: 244 ---KFSYCLLSHKFDDTTRTSSLILDNGSSHSDKK-TTGLTYTPFVNNPSVAERNAFSVY 299
KFSYCL+ H + + S L GSS S + +TYT V + N+F
Sbjct: 245 FGGKFSYCLVDHL---SHKNVSNYLTFGSSRSKEALLNNMTYTELV----LGMVNSF--- 294
Query: 300 YYVGLRRITVGGQRVRVWHKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMV 359
Y V + I++GG +++ + D G GGTI+DSG++ TF+ ++P+ ++
Sbjct: 295 YAVNMMGISIGGAMLKIPSE--VWDVKGAGGTILDSGSSLTFLTEPAYQPVMAALRVSLL 352
Query: 360 KNRNYTRALGAEALTGLRPCFDVPGEKTGSFPELKLHFKGGAEVTLPVENYFAVVGEGSA 419
K R +G L CF+ G + P L HF GAE PV++Y +G
Sbjct: 353 KFRKVEMDIGP-----LEYCFNSTGFEESLVPRLVFHFADGAEFEPPVKSYVISAADG-V 406
Query: 420 VCLTVVTDREASGGPSIILGNFQMQNYYVEYDLRNQRLGFKQQLC 464
CL V+ A G S++ GN QN+ E+DL ++LGF C
Sbjct: 407 RCLGFVS--VAWPGTSVV-GNIMQQNHLWEFDLGLKKLGFAPSSC 448
>gi|357153697|ref|XP_003576537.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Brachypodium
distachyon]
Length = 474
Score = 124 bits (311), Expect = 1e-25, Method: Compositional matrix adjust.
Identities = 110/374 (29%), Positives = 172/374 (45%), Gaps = 48/374 (12%)
Query: 105 ILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLGCQNPKCSWIH-HESIQCR 163
++DT S L W C C+ C + P F P S S + C + C + +
Sbjct: 134 VVDTASELTWVQCQ---PCESCHDQQDPLFDPSSSPSYAAVPCNSSSCDALRVAMAAGTS 190
Query: 164 DCNDEPLATSKNCTQICPSYLVLYGSG-LTEGIALSETLNLPNRIIPNFLVGCSVLSSRQ 222
C D+ N Q SY + Y G + G+ + L L + I F+ GC +
Sbjct: 191 PCADD------NEQQPACSYALSYRDGSYSRGVLARDKLRLAGQDIEGFVFGCGTSNQGA 244
Query: 223 P----AGIAGFGRGKTSLPSQLNLDKF----SYCLLSHKFDDTTRTSSLILDNGSSHSDK 274
P +G+ G GR SL SQ +D+F SYCL ++ + SL+L + SS + +
Sbjct: 245 PFGGTSGLMGLGRSHVSLVSQ-TMDQFGGVFSYCL---PMRESGSSGSLVLGDDSS-AYR 299
Query: 275 KTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRVRV-WHKYLTLDRDGNGGTIV 333
+T + YT V++ + +Y++ L ITVGGQ V W G I+
Sbjct: 300 NSTPIVYTAMVSDSGPLQ----GPFYFLNLTGITVGGQEVESPWFS--------AGRVII 347
Query: 334 DSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEALTGLRPCFDVPGEKTGSFPEL 393
DSGT T + P ++ + EF+SQ+ + Y + A A + L CF++ G K P L
Sbjct: 348 DSGTIITTLVPSVYNAVRAEFLSQLAE---YPQ---APAFSILDTCFNLTGLKEVQVPSL 401
Query: 394 KLHFKGGAEVTLPVEN--YFAVVGEGSAVCLTVVTDREASGGPSIILGNFQMQNYYVEYD 451
K F+G EV + + YF V + S VCL + + + S + I+GN+Q +N V +D
Sbjct: 402 KFVFEGSVEVEVDSKGVLYF-VSSDASQVCLALASLK--SEYDTSIIGNYQQKNLRVIFD 458
Query: 452 LRNQRLGFKQQLCK 465
++GF Q+ C
Sbjct: 459 TLGSQIGFAQETCD 472
>gi|357515189|ref|XP_003627883.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
gi|355521905|gb|AET02359.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
Length = 434
Score = 124 bits (311), Expect = 1e-25, Method: Compositional matrix adjust.
Identities = 113/382 (29%), Positives = 161/382 (42%), Gaps = 51/382 (13%)
Query: 89 YSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLGCQ 148
Y + FGTPPQ + LDT S W PC+ C CS+SK F P S+S R + C
Sbjct: 97 YIVKAKFGTPPQTLLLALDTSSDAAWIPCSG---CVGCSTSK--PFAPIKSTSFRNVSCG 151
Query: 149 NPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSGLTEGIALSETLNLPNRII 208
+P C + + + C ++ YGS + +TL L I
Sbjct: 152 SPHCKQVPNPTCGGSAC----------------AFNFTYGSSSIAASVVQDTLTLATDPI 195
Query: 209 PNFLVGC----SVLSSRQPAGIAGFGRGKTSLPSQLNLDK--FSYCLLSHKFDDTTRTSS 262
P + GC + S+ Q + + L NL K FSYCL S F + S
Sbjct: 196 PGYTFGCVNKTTGSSAPQQGLLGLGRGPLSLLSQSQNLYKSTFSYCLPS--FKSINFSGS 253
Query: 263 LILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRVRVWHKYLT 322
L L G + K+ + YTP + NP R++ YYV L I VG + V + L
Sbjct: 254 LRL--GPVYQPKR---IKYTPLLRNP---RRSSL---YYVNLVAIKVGRKIVDIPPAALA 302
Query: 323 LDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEALTGLRPCFDV 382
+ GTI DSGT FT +A ++ + +EF R L L G C++V
Sbjct: 303 FNPTTGAGTIFDSGTVFTRLAEPVYTAVRNEF------RRRVGPKLPVTTLGGFDTCYNV 356
Query: 383 PGEKTGSFPELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTVVTDREASGGPSIILGNFQ 442
P P + F G VTLP +N GS CL + + ++ N Q
Sbjct: 357 PIV----VPTITFLF-SGMNVTLPPDNIVIHSTAGSTTCLAMAGAPDNVNSVLNVIANMQ 411
Query: 443 MQNYYVEYDLRNQRLGFKQQLC 464
QN+ V +D+ N R+G ++LC
Sbjct: 412 QQNHRVLFDVPNSRIGIARELC 433
>gi|118486912|gb|ABK95290.1| unknown [Populus trichocarpa]
Length = 438
Score = 124 bits (310), Expect = 1e-25, Method: Compositional matrix adjust.
Identities = 122/435 (28%), Positives = 181/435 (41%), Gaps = 55/435 (12%)
Query: 40 PSQDSYQN-LNSLVSSSLTRALHIKNPQTKTTTTTTTTTTTNISSHSYGGYSISLSFGTP 98
P Q+S+ N + ++ S R ++ + TT + Y + + GTP
Sbjct: 50 PKQESWVNTVITMASKDPERLKYLSTLADQKTTAVPIAPGQQV--LKIANYVVRVKLGTP 107
Query: 99 PQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLGCQNPKCSWIHHE 158
Q + +LDT + W PC+ C+ +F+P S++ L C +CS +
Sbjct: 108 GQQMFMVLDTSNDAAWVPCSG------CTGFSSTTFLPNASTTLGSLDCSGAQCSQVRGF 161
Query: 159 SIQCRDCNDEPLATSKNCTQICPSYLVLYG--SGLTEGIALSETLNLPNRIIPNFLVGC- 215
S P S C + YG S LT + + + + L N +IP F GC
Sbjct: 162 SC--------PATGSSACL-----FNQSYGGDSSLTATL-VQDAITLANDVIPGFTFGCI 207
Query: 216 SVLS--SRQPAGIAGFGRGKTSLPSQLNL---DKFSYCLLSHKFDDTTRTSSLILDNGSS 270
+ +S S P G+ G GRG SL SQ FSYCL S F + SL L
Sbjct: 208 NAVSGGSIPPQGLLGLGRGPISLISQAGAMYSGVFSYCLPS--FKSYYFSGSLKLGPVGQ 265
Query: 271 HSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRVRVWHKYLTLDRDGNGG 330
+TT L P + PS+ YYV L ++VG +V + + L D + G
Sbjct: 266 PKSIRTTPLLRNP--HRPSL---------YYVNLTGVSVGRIKVPIPSEQLVFDPNTGAG 314
Query: 331 TIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEALTGLRPCFDVPGEKTGSF 390
TI+DSGT T ++ + DEF Q+ +LGA CF E
Sbjct: 315 TIIDSGTVITRFVQPVYFAIRDEFRKQV---NGPISSLGA-----FDTCFAATNEAEA-- 364
Query: 391 PELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTVVTDREASGGPSIILGNFQMQNYYVEY 450
P + LHF+G + LP+EN GS CL++ ++ N Q QN + +
Sbjct: 365 PAITLHFEG-LNLVLPMENSLIHSSSGSLACLSMAAAPNNVNSVLNVIANLQQQNLRIMF 423
Query: 451 DLRNQRLGFKQQLCK 465
D N RLG ++LC
Sbjct: 424 DTTNSRLGIARELCN 438
>gi|356526294|ref|XP_003531753.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
Length = 414
Score = 124 bits (310), Expect = 1e-25, Method: Compositional matrix adjust.
Identities = 109/384 (28%), Positives = 176/384 (45%), Gaps = 45/384 (11%)
Query: 89 YSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLGCQ 148
Y +++ G+ + I+DTGS L W C C C + + P F P SSS + + C
Sbjct: 65 YIVTMGLGSKNMTV--IIDTGSDLTWVQCE---PCMSCYNQQGPIFKPSTSSSYQSVSCN 119
Query: 149 NPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSG-LTEGIALSETLNLPNRI 207
+ C + + C +T +Y+V YG G T G E L+
Sbjct: 120 SSTCQSLQFATGNTGACGSSNPSTC--------NYVVNYGDGSYTNGELGVEALSFGGVS 171
Query: 208 IPNFLVGCSVLSSRQPAGIAGF---GRGKTSLPSQLNL---DKFSYCLLSHKFDDTTRTS 261
+ +F+ GC + G++G GR SL SQ N FSYCL + + +
Sbjct: 172 VSDFVFGCGRNNKGLFGGVSGLMGLGRSYLSLVSQTNATFGGVFSYCLPTT---EAGSSG 228
Query: 262 SLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRVRVWHKYL 321
SL++ N SS K +TYT ++NP ++ +Y + L I VGG ++ +
Sbjct: 229 SLVMGNESSVF-KNANPITYTRMLSNPQLSN------FYILNLTGIDVGGVALKAPLSF- 280
Query: 322 TLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEALTGLRPCFD 381
GNGG ++DSGT T + +++ L EF+ + +T A + L CF+
Sbjct: 281 -----GNGGILIDSGTVITRLPSSVYKALKAEFL------KKFTGFPSAPGFSILDTCFN 329
Query: 382 VPGEKTGSFPELKLHFKGGAEVTLPVENYFAVVGE-GSAVCLTVVTDREASGGPSIILGN 440
+ G S P + L F+G A++ + F VV E S VCL + + +A + I+GN
Sbjct: 330 LTGYDEVSIPTISLRFEGNAQLNVDATGTFYVVKEDASQVCLALASLSDAY--DTAIIGN 387
Query: 441 FQMQNYYVEYDLRNQRLGFKQQLC 464
+Q +N V YD + ++GF ++ C
Sbjct: 388 YQQRNQRVIYDTKQSKVGFAEEPC 411
>gi|326530426|dbj|BAJ97639.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 479
Score = 124 bits (310), Expect = 1e-25, Method: Compositional matrix adjust.
Identities = 122/407 (29%), Positives = 171/407 (42%), Gaps = 48/407 (11%)
Query: 67 TKTTTTTTTTTTTNISSHSYGGYSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYC 126
T TT + + IS S G Y + + G+PP ++D+GS ++W C C C
Sbjct: 112 TTMTTEVGSEVVSGISEGS-GEYFVRVGVGSPPTEQYLVVDSGSDVIWIQCR---PCAEC 167
Query: 127 SSSKIPSFIPKLSSSSRLLGCQNPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVL 186
P F P S+S + C + C + S C D S C Y V
Sbjct: 168 YQQADPLFDPAASASFTAVPCDSGVCRTLPGGSSGCAD--------SGAC-----RYQVS 214
Query: 187 YGSG-LTEGIALSETLNLPNRI-IPNFLVGCSVLSSR---QPAGIAGFGRGKTSLPSQL- 240
YG G T+G+ ETL + + +GC + AG+ G G G SL QL
Sbjct: 215 YGDGSYTQGVLAMETLTFGDSTPVQGVAIGCGHRNRGLFVGAAGLLGLGWGPMSLVGQLG 274
Query: 241 --NLDKFSYCLLSHKFDDTTRTSSLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSV 298
FSYCL S D SL+ D G + P + N A++ +F
Sbjct: 275 GAAGGAFSYCLASRGAD--AGAGSLVF----GRDDAMPVGAVWVPLLRN---AQQPSF-- 323
Query: 299 YYYVGLRRITVGGQRVRVWHKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQM 358
YYVGL + VGG+R+ + L DG GG ++D+GT T + P+ + L D F S +
Sbjct: 324 -YYVGLTGLGVGGERLPLQDGLFDLTEDGGGGVVMDTGTAVTRLPPDAYAALRDAFASTI 382
Query: 359 VKNRNYTRALGAEALTGLRPCFDVPGEKTGSFPELKLHF-KGGAEVTLPVENYFAVVGEG 417
+ RA G L C+D+ G + P + L+F + GA +TLP N +G G
Sbjct: 383 --GGDLPRAPGVSLLD---TCYDLSGYASVRVPTVALYFGRDGAALTLPARNLLVEMG-G 436
Query: 418 SAVCLTVVTDREASGGPSIILGNFQMQNYYVEYDLRNQRLGFKQQLC 464
CL AS ILGN Q Q + D N +GF C
Sbjct: 437 GVYCLAFA----ASASGLSILGNIQQQGIQITVDSANGYVGFGPSTC 479
>gi|125553531|gb|EAY99240.1| hypothetical protein OsI_21202 [Oryza sativa Indica Group]
Length = 475
Score = 123 bits (309), Expect = 2e-25, Method: Compositional matrix adjust.
Identities = 121/394 (30%), Positives = 173/394 (43%), Gaps = 54/394 (13%)
Query: 87 GGYSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLG 146
G Y + GTP +LDTGS +VW C C++C + F P+ S S +
Sbjct: 120 GEYFAQVGVGTPATTALMVLDTGSDVVWLQCA---PCRHCYAQSGRVFDPRRSRSYAAVD 176
Query: 147 CQNPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSG-LTEGIALSETLNLPN 205
C P C + +S C + L Y V YG G +T G SETL
Sbjct: 177 CVAPICRRL--DSAGCDRRRNSCL------------YQVAYGDGSVTAGDFASETLTFAR 222
Query: 206 RI-IPNFLVGCSVLSSRQPAGIAG-----FGRGKTSLPSQLNLD---KFSYCLL---SHK 253
+ +GC + IA GRG+ S P+Q+ FSYCL+ S
Sbjct: 223 GARVQRVAIGCG--HDNEGLFIAASGLLGLGRGRLSFPTQIARSFGRSFSYCLVDRTSSV 280
Query: 254 FDDTTRTSSLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQR 313
+TR+S++ + + G ++TP NP +A +YYV L +VGG R
Sbjct: 281 RPSSTRSSTVTF---GAGAVAAAAGASFTPMGRNPRMA------TFYYVHLLGFSVGGAR 331
Query: 314 VR-VWHKYLTLD-RDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAE 371
V+ V L L+ G GG I+DSGT+ T +A ++E + D F + V R +
Sbjct: 332 VKGVSQSDLRLNPTTGRGGVILDSGTSVTRLARPVYEAVRDAFRAAAVGLR-----VSPG 386
Query: 372 ALTGLRPCFDVPGEKTGSFPELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTVV-TDREA 430
+ C+++ G + P + +H GGA V LP ENY V C + TD
Sbjct: 387 GFSLFDTCYNLSGRRVVKVPTVSMHLAGGASVALPPENYLIPVDTSGTFCFAMAGTD--- 443
Query: 431 SGGPSIILGNFQMQNYYVEYDLRNQRLGFKQQLC 464
GG SII GN Q Q + V +D QR+GF + C
Sbjct: 444 -GGVSII-GNIQQQGFRVVFDGDAQRVGFVPKSC 475
>gi|168013126|ref|XP_001759252.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162689565|gb|EDQ75936.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 385
Score = 123 bits (309), Expect = 2e-25, Method: Compositional matrix adjust.
Identities = 116/395 (29%), Positives = 173/395 (43%), Gaps = 60/395 (15%)
Query: 87 GGYSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLG 146
G Y I +S GTPP+ + ++DTGS ++W C C C F P SS+ LG
Sbjct: 35 GEYFIRVSVGTPPRGMYLVMDTGSDILWLQCA---PCVSCYHQCDEVFDPYKSSTYSTLG 91
Query: 147 CQNPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSG-------LTEGIALSE 199
C + +C + D C Y V YG G T+ ++L+
Sbjct: 92 CNSRQCLNL-----------DVGGCVGNKCL-----YQVDYGDGSFSTGEFATDAVSLNS 135
Query: 200 TLNLPNRIIPNFLVGCSVLSSR---QPAGIAGFGRGKTSLPSQLNLD---KFSYCLLSHK 253
T ++ +GC + AG+ G G+G S P+Q+N + +FSYCL
Sbjct: 136 TSGGGQVVLNKIPLGCGHDNEGYFVGAAGLLGLGKGPLSFPNQINSENGGRFSYCLTGRD 195
Query: 254 FDDTTRTSSLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQR 313
D T R SSLI + + G+ +TP +N V S +YY+ + I+VGG
Sbjct: 196 TDSTER-SSLIFGDAA----VPPAGVRFTPQASNLRV------STFYYLKMTGISVGGSI 244
Query: 314 VRVWHKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEF---VSQMVKNRNYTRALGA 370
+ + LD GNGG I+DSGT+ T + + L + F S +V ++
Sbjct: 245 LTIPTSAFQLDSLGNGGVIIDSGTSVTRLQNAAYASLREAFRAGTSDLVLTTEFSL---- 300
Query: 371 EALTGLRPCFDVPGEKTGSFPELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTVVTDREA 430
C+++ + P + LHF+GGA++ LP NY V S CL
Sbjct: 301 -----FDTCYNLSDLSSVDVPTVTLHFQGGADLKLPASNYLVPVDNSSTFCLAFA----G 351
Query: 431 SGGPSIILGNFQMQNYYVEYDLRNQRLGFKQQLCK 465
+ GPSII GN Q Q + V YD + ++GF C
Sbjct: 352 TTGPSII-GNIQQQGFRVIYDNLHNQVGFVPSQCD 385
>gi|357159298|ref|XP_003578403.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Brachypodium
distachyon]
Length = 442
Score = 123 bits (309), Expect = 2e-25, Method: Compositional matrix adjust.
Identities = 108/392 (27%), Positives = 172/392 (43%), Gaps = 54/392 (13%)
Query: 89 YSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLGCQ 148
Y S G+PPQ ++DTGS L+W C K C+ +P + LS SS +
Sbjct: 86 YIASYLIGSPPQRTEALIDTGSDLIWTQCATTCLPKSCAKQGLPYY--NLSQSSTFVPV- 142
Query: 149 NPKCSWIHHESIQCRDCNDEPLATSKNCTQICP-----SYLVLYGSGLTEGIALSETLNL 203
C D+ + N +C +++ YG+G G +E+
Sbjct: 143 ---------------PCADKAGFCAANGVHLCGLDGSCTFIASYGAGRVIGSLGTESFAF 187
Query: 204 PNRIIPNFLVGCSVLSS------RQPAGIAGFGRGKTSLPSQLNLDKFSYCLLSHKFDDT 257
+ + GC L+ +G+ G GRG+ SL SQ+ +FSYCL + F +
Sbjct: 188 ESGTT-SLAFGCVSLTRITSGALNDASGLIGLGRGRLSLVSQIGATRFSYCLTPY-FHSS 245
Query: 258 TRTSSLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRVRVW 317
+S L + +S + PFV +P + +S +YY+ L ITVG R+
Sbjct: 246 GASSHLFVGASASLGGGGAS----MPFVKSP---KDYPYSTFYYLPLEGITVGKTRLPAV 298
Query: 318 HKYLTLDRD-----GNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEA 372
+ R GG I+D+G+ T +A +E L +E +Q+ + A
Sbjct: 299 NSTTFQLRQLFKGYWAGGVIIDTGSPLTQLASHAYEALKEEVAAQLGNG----SLVPAPE 354
Query: 373 LTGLRPCFDVPGEKTGSFPELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTVVTDREASG 432
+GL C G + P L HF GGA++ +P +Y+A V + +A C+ ++ G
Sbjct: 355 DSGLELCVAREGFQK-VVPALVFHFGGGADMAVPAASYWAPV-DKAAACMMIL-----EG 407
Query: 433 GPSIILGNFQMQNYYVEYDLRNQRLGFKQQLC 464
G I+GNFQ Q+ ++ YDLR R F+ C
Sbjct: 408 GYDSIIGNFQQQDMHLLYDLRRGRFSFQTADC 439
>gi|359473000|ref|XP_002278677.2| PREDICTED: aspartic proteinase nepenthesin-1-like [Vitis vinifera]
Length = 458
Score = 123 bits (309), Expect = 2e-25, Method: Compositional matrix adjust.
Identities = 118/401 (29%), Positives = 166/401 (41%), Gaps = 55/401 (13%)
Query: 87 GGYSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPS-FIPKLSSSSRLL 145
G Y + L GTPPQ + + DTGS LVW C+ C+ C+ S F+ + S++
Sbjct: 87 GQYFVDLRLGTPPQKLLLVADTGSDLVWVKCS---ACRNCTRHTPGSAFLARHSTTFSPN 143
Query: 146 GCQNPKCSWI----HHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSG-LTEGIALSET 200
C + C + HH CN L + C Y YG G T G ET
Sbjct: 144 HCYDSACQLVPLPKHHR------CNHARLHSP------C-RYEYSYGDGSKTSGFFSKET 190
Query: 201 LNL-----PNRIIPNFLVGC---------SVLSSRQPAGIAGFGRGKTSLPSQLNL---D 243
L + GC S S G+ G GRG SL SQL +
Sbjct: 191 TTLNTSSGREAKLKGIAFGCAFRISGPSVSGASFNGAHGVMGLGRGPISLSSQLGHRFGN 250
Query: 244 KFSYCLLSHKFDDTTRTSSLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVG 303
KFSYCL+ H + TS L++ + + + +TP NP +YY+G
Sbjct: 251 KFSYCLMDHDISPSP-TSYLLIGSTQNDVAPGKRRMRFTPLHINPLSP------TFYYIG 303
Query: 304 LRRITVGGQRVRVWHKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRN 363
+ ++V G ++ + LD GNGGTIVDSGTT TF+ EP + ++ V R
Sbjct: 304 IESVSVDGIKLPINPSVWALDELGNGGTIVDSGTTLTFLP----EPAYLQILT--VIKRR 357
Query: 364 YTRALGAEALTGLRPCFDVPGEKTGSFPELKLHFKGGAEVTLPVENYFAVVGEGSAVCLT 423
AE G C +V + P+L G + + P NYF E CL
Sbjct: 358 VRLPSPAEPTPGFDLCVNVSEIEHPRLPKLSFKLGGDSVFSPPPRNYFVDTDE-DVKCLA 416
Query: 424 VVTDREASGGPSIILGNFQMQNYYVEYDLRNQRLGFKQQLC 464
+ SG ++GN Q + +E+D RLGF + C
Sbjct: 417 LQAVMTPSG--FSVIGNLMQQGFLLEFDKDRTRLGFSRHGC 455
>gi|225216995|gb|ACN85284.1| aspartic proteinase nepenthesin-1 precursor [Oryza australiensis]
Length = 519
Score = 123 bits (309), Expect = 2e-25, Method: Compositional matrix adjust.
Identities = 120/391 (30%), Positives = 167/391 (42%), Gaps = 62/391 (15%)
Query: 87 GGYSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLG 146
G Y +++ GTP + DTGS W C Y K+ F P SS+ +
Sbjct: 178 GNYVVTVGLGTPASRYTVVFDTGSDTTWVQCQPCVVVCYEQREKL--FDPARSSTYANIS 235
Query: 147 CQNPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSG-LTEGIALSETLNLPN 205
C P CS + R C + NC Y V YG G + G +TL L +
Sbjct: 236 CAAPACS-----DLDTRGC------SGGNCL-----YGVQYGDGSYSIGFFAMDTLTLSS 279
Query: 206 -RIIPNFLVGCSVLSS---RQPAGIAGFGRGKTSLPSQLNLDK----FSYCLLSHKFDDT 257
+ F GC + + AG+ G GRGKTSLP Q DK F++CL +
Sbjct: 280 YDAVKGFRFGCGERNEGLFGEAAGLLGLGRGKTSLPVQ-TYDKYGGVFAHCLPARS---- 334
Query: 258 TRTSSLILDNGSSHSDKKTTGLTYTPFV--NNPSVAERNAFSVYYYVGLRRITVGGQRVR 315
+ + LD G LT TP + N P+ +YYVG+ I VGGQ +
Sbjct: 335 --SGTGYLDFGPGSPAAAGARLT-TPMLTDNGPT---------FYYVGMTGIRVGGQLLS 382
Query: 316 VWHKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEALTG 375
+ T GTIVDSGT T + P + L F S M R Y + A A++
Sbjct: 383 IPQSVFT-----TAGTIVDSGTVITRLPPAAYSSLRSAFASAMAA-RGYKK---APAVSL 433
Query: 376 LRPCFDVPGEKTGSFPELKLHFKGGAEVTLPVEN--YFAVVGEGSAVCLTVVTDREASGG 433
L C+D G + P + L F+GGA + + Y A V S VCL + + GG
Sbjct: 434 LDTCYDFTGMSQVAIPTVSLLFQGGARLDVDASGIMYAASV---SQVCLGFAANED--GG 488
Query: 434 PSIILGNFQMQNYYVEYDLRNQRLGFKQQLC 464
I+GN Q++ + V YD+ + +GF C
Sbjct: 489 DVGIVGNTQLKTFGVAYDIGKKVVGFSPGAC 519
>gi|255568540|ref|XP_002525244.1| Aspartic proteinase nepenthesin-2 precursor, putative [Ricinus
communis]
gi|223535541|gb|EEF37210.1| Aspartic proteinase nepenthesin-2 precursor, putative [Ricinus
communis]
Length = 460
Score = 123 bits (309), Expect = 2e-25, Method: Compositional matrix adjust.
Identities = 120/396 (30%), Positives = 180/396 (45%), Gaps = 70/396 (17%)
Query: 87 GGYSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCK----YCSSSKIPSFIPKLSSSS 142
G Y + L G+PP+ ILDTGS L W QCK YC S P F P S++
Sbjct: 118 GNYYLKLGLGSPPKYYTMILDTGSSLSWL------QCKPCVVYCHSQVDPLFEPSASNTY 171
Query: 143 RLLGCQNPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYG-SGLTEGIALSETL 201
R L C + +CS + ++ ++PL T+ +C Y YG + + G + L
Sbjct: 172 RPLYCSSSECSLLKAATL------NDPLCTASG---VC-VYTASYGDASYSMGYLSRDLL 221
Query: 202 NL-PNRIIPNFLVGCSVLSS---RQPAGIAGFGRGKTSLPSQLNLD---KFSYCLLSHKF 254
L P++ +P+F GC + + AGI G R K S+ +QL+ FSYCL
Sbjct: 222 TLTPSQTLPSFTYGCGQDNEGLFGKAAGIVGLARDKLSMLAQLSPKYGYAFSYCL----- 276
Query: 255 DDTTRTSSLILDNGSSHSDKKTTGLTY--TPFV---NNPSVAERNAFSVYYYVGLRRITV 309
T TSS G S K + +Y TP + NPS+ Y++ L ITV
Sbjct: 277 --PTSTSS----GGGFLSIGKISPSSYKFTPMIRNSQNPSL---------YFLRLAAITV 321
Query: 310 GGQRVRVWHKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALG 369
G+ V V + TI+DSGT T + ++ L + FV M +R Y +
Sbjct: 322 AGRPVGVAAAGYQVP------TIIDSGTVVTRLPISIYAALREAFVKIM--SRRYEQ--- 370
Query: 370 AEALTGLRPCFDVPGEKTGSFPELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTVVTDRE 429
A A + L CF + PE+++ F+GGA+++L N +G A CL + +
Sbjct: 371 APAYSILDTCFKGSLKSMSGAPEIRMIFQGGADLSLRAPNILIEADKGIA-CLAFASSNQ 429
Query: 430 ASGGPSIILGNFQMQNYYVEYDLRNQRLGFKQQLCK 465
+ I+GN Q Q Y + YD+ ++GF C+
Sbjct: 430 IA-----IIGNHQQQTYNIAYDVSASKIGFAPGGCR 460
>gi|308044575|ref|NP_001183392.1| uncharacterized protein LOC100501808 [Zea mays]
gi|238011188|gb|ACR36629.1| unknown [Zea mays]
Length = 342
Score = 123 bits (308), Expect = 2e-25, Method: Compositional matrix adjust.
Identities = 120/378 (31%), Positives = 161/378 (42%), Gaps = 53/378 (14%)
Query: 104 FILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLGCQNPKCSWIHHESIQCR 163
+LDTGS +VW C C+ C P F P+ SSS +GC C + R
Sbjct: 1 MVLDTGSDVVWVQCA---PCRRCYEQSGPVFDPRRSSSYGAVGCGAALCRRLDSGGCDLR 57
Query: 164 DCNDEPLATSKNCTQICPSYLVLYGSG-LTEGIALSETLNLPNRI-IPNFLVGCSVLSSR 221
C Y V YG G +T G ++ETL + +GC +
Sbjct: 58 ---------RGACM-----YQVAYGDGSVTAGDFVTETLTFAGGARVARVALGCGHDNEG 103
Query: 222 ---QPAGIAGFGRGKTSLPSQLNLD---KFSYCLLSH------KFDDTTRTSSLILDNGS 269
AG+ G GRG S P+Q++ FSYCL+ + R+S++ GS
Sbjct: 104 LFVAAAGLLGLGRGGLSFPTQISRRYGRSFSYCLVDRTSSGAGAAPGSHRSSTVSFGAGS 163
Query: 270 SHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRV-RVWHKYLTLD-RDG 327
+ + +TP V NP + +YYV L I+VGG RV V L LD G
Sbjct: 164 VGASSAS----FTPMVRNPRM------ETFYYVQLVGISVGGARVPGVAESDLRLDPSTG 213
Query: 328 NGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEALTGLRPCFDVPGEKT 387
GG IVDSGT+ T +A + L D F + L + C+D+ G +
Sbjct: 214 RGGVIVDSGTSVTRLARASYSALRDAFRAAAAGGLR----LSPGGFSLFDTCYDLGGRRV 269
Query: 388 GSFPELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTVV-TDREASGGPSIILGNFQMQNY 446
P + +HF GGAE LP ENY V C TD GG SII GN Q Q +
Sbjct: 270 VKVPTVSMHFAGGAEAALPPENYLIPVDSRGTFCFAFAGTD----GGVSII-GNIQQQGF 324
Query: 447 YVEYDLRNQRLGFKQQLC 464
V +D QR+GF + C
Sbjct: 325 RVVFDGDGQRVGFAPKGC 342
>gi|56784779|dbj|BAD82000.1| putative aspartic proteinase nepenthesin II [Oryza sativa Japonica
Group]
Length = 486
Score = 123 bits (308), Expect = 2e-25, Method: Compositional matrix adjust.
Identities = 106/381 (27%), Positives = 161/381 (42%), Gaps = 43/381 (11%)
Query: 87 GGYSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQC--KYCSSSKIPSFIPKLSSSSRL 144
G Y +S S GTPPQ++ +LD S VW C+ C +++ P F LSS+ R
Sbjct: 95 GMYVLSFSVGTPPQVVTGVLDITSDFVWMQCSACATCGADAPAATSAPPFYAFLSSTIRE 154
Query: 145 LGCQNPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSGL---TEGIALSETL 201
+ C N C + ++ +D P Y +YG G T G+ +
Sbjct: 155 VRCANRGCQRLVPQTCS---ADDSPCG-----------YSYVYGGGAANTTAGLLAVDAF 200
Query: 202 NLPNRIIPNFLVGCSVLSSRQPAGIAGFGRGKTSLPSQLNLDKFSYCLLSHKFDDTTRTS 261
+ GC+V + G+ G GRG+ S SQL + +FSY L DD
Sbjct: 201 AFATVRADGVIFGCAVATEGDIGGVIGLGRGELSPVSQLQIGRFSYYLAP---DDAVDVG 257
Query: 262 SLI--LDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRVRVWHK 319
S I LD+ + + V+ P VA R + S+ YYV L I V G+ + +
Sbjct: 258 SFILFLDDAKPRTSRA---------VSTPLVASRASRSL-YYVELAGIRVDGEDLAIPRG 307
Query: 320 YLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEALTGLRPC 379
L DG+GG ++ TF+ + A + V Q + ++ RA L GL C
Sbjct: 308 TFDLQADGSGGVVLSITIPVTFL-----DAGAYKVVRQAMASKIELRAADGSEL-GLDLC 361
Query: 380 FDVPGEKTGSFPELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTVVTDREASGGPSIILG 439
+ T P + L F GGA + L + NYF + CLT++ G +LG
Sbjct: 362 YTSESLATAKVPSMALVFAGGAVMELEMGNYFYMDSTTGLECLTILPSPAGDGS---LLG 418
Query: 440 NFQMQNYYVEYDLRNQRLGFK 460
+ ++ YD+ RL F+
Sbjct: 419 SLIQVGTHMIYDISGSRLVFE 439
>gi|116790042|gb|ABK25480.1| unknown [Picea sitchensis]
Length = 460
Score = 123 bits (308), Expect = 2e-25, Method: Compositional matrix adjust.
Identities = 115/387 (29%), Positives = 162/387 (41%), Gaps = 51/387 (13%)
Query: 87 GGYSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLG 146
G + + ++ GTP ILDTGS L W C C C P + P SS+ +
Sbjct: 113 GEFLMKMAIGTPSLSFSAILDTGSDLTWTQCK---PCTDCYPQPTPIYDPSQSSTYSKVP 169
Query: 147 CQNPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGS-GLTEGIALSETLNLPN 205
C + C + S +C YL YG T+GI E+ L +
Sbjct: 170 CSSSMCQALPMYSCSGANCE----------------YLYSYGDQSSTQGILSYESFTLTS 213
Query: 206 RIIPNFLVGCSVLSSRQPAGIAGFGRGKTSLP----SQLNL---DKFSYCLLSHKFDDTT 258
+ +P+ GC + G G P SQL +KFSYCL+S D +
Sbjct: 214 QSLPHIAFGCGQENEGGGFSQGGGLVGFGRGPLSLISQLGQSLGNKFSYCLVSIT-DSPS 272
Query: 259 RTSSLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRVRVWH 318
+TS L + KT L + P V R+ +YY+ L I+VGGQ + +
Sbjct: 273 KTSPLFI--------GKTASLNAKTVSSTPLVQSRSR-PTFYYLSLEGISVGGQLLDIAD 323
Query: 319 KYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEALTGLRP 378
L DG GG I+DSGTT T++ ++ + +S + N + G+ GL
Sbjct: 324 GTFDLQLDGTGGVIIDSGTTVTYLEQSGYDVVKKAVISSI----NLPQVDGSN--IGLDL 377
Query: 379 CFD-VPGEKTGSFPELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTVVTDREASGGPSII 437
CF+ G T FP + HF+ GA+ LP ENY G A CL ++ S I
Sbjct: 378 CFEPQSGSSTSHFPTITFHFE-GADFNLPKENYIYTDSSGIA-CLAMLPSNGMS-----I 430
Query: 438 LGNFQMQNYYVEYDLRNQRLGFKQQLC 464
GN Q QNY + YD L F +C
Sbjct: 431 FGNIQQQNYQILYDNERNVLSFAPTVC 457
>gi|302764208|ref|XP_002965525.1| hypothetical protein SELMODRAFT_406966 [Selaginella moellendorffii]
gi|300166339|gb|EFJ32945.1| hypothetical protein SELMODRAFT_406966 [Selaginella moellendorffii]
Length = 464
Score = 123 bits (308), Expect = 2e-25, Method: Compositional matrix adjust.
Identities = 115/402 (28%), Positives = 168/402 (41%), Gaps = 74/402 (18%)
Query: 79 TNISSHSYGGYSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKL 138
T +S + G Y S++ G+PP+ ++DTGS L W +C CS P
Sbjct: 114 TPVSFTNGGVYYSSITLGSPPKDFSLVMDTGSDLTW------VRCDPCS--------PDC 159
Query: 139 SSSSRLLGCQNPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSGLTEGIALS 198
SS+ L ++++ C D D L P L L+ G +L
Sbjct: 160 SSTFDRLASNT-------YKALTCAD--DLRL----------PVLLRLWRRLFHSGRSLR 200
Query: 199 ETLNLPNRI------IPNFLVGCSVLSS---RQPAGIAGFGRGKTSLPSQLNL---DKFS 246
+TL + P F+ GC L GI G S PSQ+ +KFS
Sbjct: 201 DTLKMAGAASDELEEFPGFVFGCGSLLKGLISGEVGILALSPGSLSFPSQIGEKYGNKFS 260
Query: 247 YCLLSHKFDDTTRTSSLILDNGSSHSDKKTTG----LTYTPFVNNPSVAERNAFSVYYYV 302
YCLL ++ + S ++ + + +G L YTP + S+YY V
Sbjct: 261 YCLLRQTAQNSLKKSPMVFGEAAVELKEPGSGKPQELQYTPIGES---------SIYYTV 311
Query: 303 GLRRITVGGQRVRVWHKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNR 362
L I+VG QR+ + +D TI DSGTT T + + + + S MV
Sbjct: 312 RLDGISVGNQRLDLSPSTFLNGQDKP--TIFDSGTTLTMLPSGVCDSIKQSLAS-MVSGA 368
Query: 363 NYTRALGAEALTGLRPCFDVPGEKTGSFPELKLHFKGGAEVTLPVENYFAVVGEGSAVCL 422
+ A+ GL CF VP P++ HF GGA+ NY V+ GS CL
Sbjct: 369 EFV------AIKGLDACFRVPPSSGQGLPDITFHFNGGADFVTRPSNY--VIDLGSLQCL 420
Query: 423 TVVTDREASGGPSIILGNFQMQNYYVEYDLRNQRLGFKQQLC 464
V E S I GN Q Q+++V +D+ N+R+GFK+ C
Sbjct: 421 IFVPTNEVS-----IFGNLQQQDFFVLHDMDNRRIGFKETDC 457
>gi|359482097|ref|XP_002271077.2| PREDICTED: aspartic proteinase nepenthesin-1-like [Vitis vinifera]
Length = 458
Score = 122 bits (307), Expect = 3e-25, Method: Compositional matrix adjust.
Identities = 113/405 (27%), Positives = 175/405 (43%), Gaps = 59/405 (14%)
Query: 84 HSYGGYSISLSFGTPPQIIPFILDTGSHLVWFPC--TNHYQCKYCSSSKIPSFIPKLSSS 141
H ++SL+ GTPPQ + +LDTGS L W C T +Q +F P SSS
Sbjct: 80 HHNVSLTVSLTVGTPPQNVSMVLDTGSELSWLRCNKTQTFQT---------TFDPNRSSS 130
Query: 142 SRLLGCQNPKCSWIHHESIQCRD-CNDEPLATSKNCTQICPSYLVLYGSGLTEGIALSET 200
+ C S+ C D D P+ S + Q+C + L + +EG S+T
Sbjct: 131 YSPVPC----------SSLTCTDRTRDFPIPASCDSNQLCHAILSYADASSSEGNLASDT 180
Query: 201 LNLPNRIIPNFLVGC-------SVLSSRQPAGIAGFGRGKTSLPSQLNLDKFSYCLLSHK 253
+ N +P + GC + + G+ G RG S SQ++ KFSYC+
Sbjct: 181 FYIGNSDMPGTIFGCMDSSFSTNTEEDSKNTGLMGMNRGSLSFVSQMDFPKFSYCISDSD 240
Query: 254 FDDTTRTSSLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQR 313
F S ++L ++ S L YTP + S V Y V L I V +
Sbjct: 241 F------SGVLLLGDANFS--WLMPLNYTPLIQI-STPLPYFDRVAYTVQLEGIKVSSKL 291
Query: 314 VRVWHKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQ------MVKNRNYTRA 367
+ + D G G T+VDSGT FTF+ ++ L +EF++Q ++++ NY
Sbjct: 292 LPLPKSVFVPDHTGAGQTMVDSGTQFTFLLGPVYSALRNEFLNQTSQILRVLEDPNYVFQ 351
Query: 368 LGAEALTGLRPCFDVPGEKTG--SFPELKLHFKGGAEVTLPVENYF-----AVVGEGSAV 420
G + C+ VP +T P + L F+ GAE+ + + V G S
Sbjct: 352 GGMDL------CYRVPLSQTSLPWLPTVSLMFR-GAEMKVSGDRLLYRVPGEVRGSDSVY 404
Query: 421 CLTVVTDREASGGPSIILGNFQMQNYYVEYDLRNQRLGFKQQLCK 465
C T + + + ++G+ QN ++E+DL R+GF Q C
Sbjct: 405 CFT-FGNSDLLAVEAYVIGHHHQQNVWMEFDLEKSRIGFAQVQCD 448
>gi|218189440|gb|EEC71867.1| hypothetical protein OsI_04576 [Oryza sativa Indica Group]
Length = 508
Score = 122 bits (307), Expect = 3e-25, Method: Compositional matrix adjust.
Identities = 107/381 (28%), Positives = 162/381 (42%), Gaps = 43/381 (11%)
Query: 87 GGYSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQC--KYCSSSKIPSFIPKLSSSSRL 144
G Y +S S GTPPQ++ +LD S VW C+ C +++ P F LSS+ R
Sbjct: 95 GMYVLSFSVGTPPQVVTGVLDITSDFVWMQCSACATCGADAPAATSAPPFYAFLSSTIRE 154
Query: 145 LGCQNPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSGL---TEGIALSETL 201
+ C N C + ++ +D P Y +YG G T G+ +
Sbjct: 155 VRCANRGCQRLVPQTCSA---DDSPCG-----------YSYVYGGGAANTTAGLLAVDAF 200
Query: 202 NLPNRIIPNFLVGCSVLSSRQPAGIAGFGRGKTSLPSQLNLDKFSYCLLSHKFDDTTRTS 261
+ GC+V + G+ G GRG+ SL SQL + +FSY L DD
Sbjct: 201 AFATVRADGVIFGCAVATEGDIGGVIGLGRGELSLVSQLQIGRFSYYLAP---DDAVDVG 257
Query: 262 SLI--LDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRVRVWHK 319
S I LD+ + + V+ P VA R + S+ YYV L I V G+ + +
Sbjct: 258 SFILFLDDAKPRTSRA---------VSTPLVANRASRSL-YYVELAGIRVDGEDLAIPRG 307
Query: 320 YLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEALTGLRPC 379
L DG+GG ++ TF+ + A + V Q + ++ RA L GL C
Sbjct: 308 TFDLQADGSGGVVLSITIPVTFL-----DAGAYKVVRQAMASKIGLRAADGSEL-GLDLC 361
Query: 380 FDVPGEKTGSFPELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTVVTDREASGGPSIILG 439
+ T P + L F GGA + L + NYF + CLT++ G +LG
Sbjct: 362 YTSESLATAKVPSMALVFAGGAVMELEMGNYFYMDSTTGLECLTILPSPAGDGS---LLG 418
Query: 440 NFQMQNYYVEYDLRNQRLGFK 460
+ ++ YD+ RL F+
Sbjct: 419 SLIQVGTHMIYDISGSRLVFE 439
>gi|356499109|ref|XP_003518386.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Glycine max]
Length = 428
Score = 122 bits (307), Expect = 3e-25, Method: Compositional matrix adjust.
Identities = 117/413 (28%), Positives = 179/413 (43%), Gaps = 58/413 (14%)
Query: 72 TTTTTTTTNISSHSYGGYSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKI 131
T T T + +S H ++SL+ G+PPQ + +LDTGS L W C K+
Sbjct: 43 TQTQTPSRKLSFHHNVTLTVSLTVGSPPQNVTMVLDTGSELSWLHC-----------KKL 91
Query: 132 P----SFIPKLSSSSRLLGCQNPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLY 187
P +F P LSSS C + C + + RD P + N ++C +
Sbjct: 92 PNLNSTFNPLLSSSYTPTPCNSSIC------TTRTRDLT-IPASCDPN-NKLCHVIVSYA 143
Query: 188 GSGLTEGIALSETLNLPNRIIPNFLVGC--------SVLSSRQPAGIAGFGRGKTSLPSQ 239
+ EG +ET +L P L GC + + G+ G RG SL +Q
Sbjct: 144 DASSAEGTLAAETFSLAGAAQPGTLFGCMDSAGYTSDINEDSKTTGLMGMNRGSLSLVTQ 203
Query: 240 LNLDKFSYCLLSHKFDDTTRTSSLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVY 299
++L KFSYC+ L+L +G+ + L YTP V + + V
Sbjct: 204 MSLPKFSYCISGED-----ALGVLLLGDGT----DAPSPLQYTPLV-TATTSSPYFNRVA 253
Query: 300 YYVGLRRITVGGQRVRVWHKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQM- 358
Y V L I V + +++ D G G T+VDSGT FTF+ ++ L DEF+ Q
Sbjct: 254 YTVQLEGIKVSEKLLQLPKSVFVPDHTGAGQTMVDSGTQFTFLLGSVYSSLKDEFLEQTK 313
Query: 359 -----VKNRNYTRALGAEALTGLRPCFDVPGEKTGSFPELKLHFKGGAEVTLPVENYFAV 413
+++ N+ GA L C+ P + P + L F GAE+ + E
Sbjct: 314 GVLTRIEDPNFVFE-GAMDL-----CYHAPA-SFAAVPAVTLVFS-GAEMRVSGERLLYR 365
Query: 414 VGEGS--AVCLTVVTDREASGGPSIILGNFQMQNYYVEYDLRNQRLGFKQQLC 464
V +GS C T + + G + ++G+ QN ++E+DL R+GF Q C
Sbjct: 366 VSKGSDWVYCFT-FGNSDLLGIEAYVIGHHHQQNVWMEFDLLKSRVGFTQTTC 417
>gi|293336306|ref|NP_001168599.1| uncharacterized protein LOC100382383 [Zea mays]
gi|223949441|gb|ACN28804.1| unknown [Zea mays]
Length = 326
Score = 122 bits (307), Expect = 3e-25, Method: Compositional matrix adjust.
Identities = 111/367 (30%), Positives = 156/367 (42%), Gaps = 47/367 (12%)
Query: 104 FILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLGCQNPKCSWIHHESIQCR 163
+LDTGS + W C C C P F P LS+S + C +S +CR
Sbjct: 1 MVLDTGSDVTWVQCQ---PCADCYQQSDPVFDPSLSASYAAVSC----------DSQRCR 47
Query: 164 DCNDEPLATSKNCTQICPSYLVLYGSG-LTEGIALSETLNLPNRI-IPNFLVGCSVLSSR 221
D + A +N T C Y V YG G T G +ETL L + + N +GC +
Sbjct: 48 DLD---TAACRNATGAC-LYEVAYGDGSYTVGDFATETLTLGDSTPVGNVAIGCGHDNEG 103
Query: 222 ---QPAGIAGFGRGKTSLPSQLNLDKFSYCLLSHKFDDTTRTSSLILDNGSSHSDKKTTG 278
AG+ G G S PSQ++ FSYCL+ D+ S+L +G++ + G
Sbjct: 104 LFVGAAGLLALGGGPLSFPSQISASTFSYCLVDR---DSPAASTLQFGDGAAEA-----G 155
Query: 279 LTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRVRVWHKYLTLDR-DGNGGTIVDSGT 337
P V +P S +YYV L I+VGGQ + + +D G+GG IVDSGT
Sbjct: 156 TVTAPLVRSPRT------STFYYVALSGISVGGQPLSIPASAFAMDATSGSGGVIVDSGT 209
Query: 338 TFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEALTGLRPCFDVPGEKTGSFPELKLHF 397
T + + L D FV Q + T ++ C+D+ + P + L F
Sbjct: 210 AVTRLQSAAYAALRDAFV-QGAPSLPRT-----SGVSLFDTCYDLSDRTSVEVPAVSLRF 263
Query: 398 KGGAEVTLPVENYFAVVGEGSAVCLTVVTDREASGGPSIILGNFQMQNYYVEYDLRNQRL 457
+GG + LP +NY V CL A I+GN Q Q V +D +
Sbjct: 264 EGGGALRLPAKNYLIPVDGAGTYCLAFAPTNAAVS----IIGNVQQQGTRVSFDTARGAV 319
Query: 458 GFKQQLC 464
GF C
Sbjct: 320 GFTPNKC 326
>gi|413951280|gb|AFW83929.1| hypothetical protein ZEAMMB73_279135 [Zea mays]
Length = 451
Score = 122 bits (307), Expect = 3e-25, Method: Compositional matrix adjust.
Identities = 115/390 (29%), Positives = 168/390 (43%), Gaps = 61/390 (15%)
Query: 89 YSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLGCQ 148
Y GTP Q + +D + W PC ++ PSF P SS+ R + C
Sbjct: 107 YVARARLGTPAQALLVAIDPSNDAAWVPCAACA-----GCARAPSFDPTRSSTYRPVRCG 161
Query: 149 NPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSGLTEGIALSETLNLPNRI- 207
P+CS S P +C ++ + Y + + + + L L + +
Sbjct: 162 APQCSQAPAPSC--------PGGLGSSC-----AFNLSYAASTFQALLGQDALALHDDVD 208
Query: 208 -IPNFLVGCSVL---SSRQPAGIAGFGRGKTSLPSQLNL---DKFSYCLLSHKFDDTTRT 260
+ + GC + S P G+ GFGRG S PSQ FSYCL S+K + + T
Sbjct: 209 AVAAYTFGCLHVVTGGSVPPQGLVGFGRGPLSFPSQTKDVYGSVFSYCLPSYKSSNFSGT 268
Query: 261 SSLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRVRVWHKY 320
L G + K+ + TP ++NP R + YYV + I VGG+ V V
Sbjct: 269 LRL----GPAGQPKR---IKTTPLLSNP---HRPSL---YYVNMVGIRVGGRPVPVPASA 315
Query: 321 LTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEALTGLRPCF 380
L D GTIVD+GT FT ++ ++ + D F R+ RA A L G C+
Sbjct: 316 LAFDPTSGRGTIVDAGTMFTRLSAPVYAAVRDVF-------RSRVRAPVAGPLGGFDTCY 368
Query: 381 DVPGEKTGSFPELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTVVTDREASGGPSI---- 436
+V T S P + F G VTLP EN G CL + A+G P
Sbjct: 369 NV----TISVPTVTFSFDGRVSVTLPEENVVIRSSSGGIACLAM-----AAGPPDGVDAA 419
Query: 437 --ILGNFQMQNYYVEYDLRNQRLGFKQQLC 464
+L + Q QN+ V +D+ N R+GF ++LC
Sbjct: 420 LNVLASMQQQNHRVLFDVANGRVGFSRELC 449
>gi|242084330|ref|XP_002442590.1| hypothetical protein SORBIDRAFT_08g022570 [Sorghum bicolor]
gi|241943283|gb|EES16428.1| hypothetical protein SORBIDRAFT_08g022570 [Sorghum bicolor]
Length = 494
Score = 122 bits (306), Expect = 4e-25, Method: Compositional matrix adjust.
Identities = 120/400 (30%), Positives = 162/400 (40%), Gaps = 59/400 (14%)
Query: 87 GGYSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLG 146
G Y ++ GTP LDT S L W C C+ C P F P+ S+S +
Sbjct: 132 GEYMAKIAVGTPAVQALLALDTASDLTWLQCQ---PCRRCYPQSGPVFDPRHSTSYGEMN 188
Query: 147 CQNPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSG-----LTEGIALSETL 201
P C + +K T I Y V YG G + G + ETL
Sbjct: 189 YDAPDCQALGRSG----------GGDAKRGTCI---YTVQYGDGHGSTSTSVGDLVEETL 235
Query: 202 NLPNRIIPNFL-VGCS----VLSSRQPAGIAGFGRGKTSLPSQLNL----DKFSYCLLSH 252
+ +L +GC L AGI G GRG+ S+P Q+ FSYCL+
Sbjct: 236 TFAGGVRQAYLSIGCGHDNKGLFGAPAAGILGLGRGQISIPHQIAFLGYNASFSYCLVDF 295
Query: 253 KFDDTTRTSSLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQ 312
+ +S+L G+ T P P+V +N +YYV L ++VGG
Sbjct: 296 ISGPGSPSSTLTFGAGAVD--------TSPPASFTPTVLNQN-MPTFYYVRLIGVSVGGV 346
Query: 313 RV-RVWHKYLTLD-RDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGA 370
RV V + L LD G GG I+DSGTT T +A +V+ R +LG
Sbjct: 347 RVPGVTERDLQLDPYTGRGGVILDSGTTVTRLA-------RPAYVAFRDAFRAAATSLGQ 399
Query: 371 EALTG----LRPCFDVPGEKTGSFPELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTVV- 425
+ G C+ V G P + +HF GG EV+L +NY V VC
Sbjct: 400 VSTGGPSGLFDTCYTVGGRAGVKVPAVSMHFAGGVEVSLQPKNYLIPVDSRGTVCFAFAG 459
Query: 426 -TDREASGGPSIILGNFQMQNYYVEYDLRNQRLGFKQQLC 464
DR S ++GN Q + V YDL QR+GF C
Sbjct: 460 TGDRSVS-----VIGNILQQGFRVVYDLAGQRVGFAPNNC 494
>gi|225455876|ref|XP_002275164.1| PREDICTED: aspartic proteinase nepenthesin-2 [Vitis vinifera]
Length = 496
Score = 122 bits (306), Expect = 4e-25, Method: Compositional matrix adjust.
Identities = 118/413 (28%), Positives = 169/413 (40%), Gaps = 63/413 (15%)
Query: 61 HIKNPQTKTTTTTTTTTTTNISSHSYGGYSISLSFGTPPQIIPFILDTGSHLVWFPCTNH 120
I +PQ +T T+ T S G Y + + G P + ++DTGS + W C
Sbjct: 138 EILHPQDFSTPVTSGT------SQGSGEYFLRVGIGRPSKTFYMVIDTGSDVNWLQCK-- 189
Query: 121 YQCKYCSSSKIPSFIPKLSSSSRLLGCQNPKCSWIHHESIQCRDCNDEPLATSKNCTQIC 180
C C P F P SSS LGCQ P+C + + CR ND L
Sbjct: 190 -PCDDCYQQVDPIFDPASSSSFSRLGCQTPQCRNL--DVFACR--NDSCL---------- 234
Query: 181 PSYLVLYGSG-LTEGIALSETLNLPNR-IIPNFLVGCSVLSSRQPAGI-------AGFGR 231
Y V YG G T G +ET++ N + +GC G+ G G
Sbjct: 235 --YQVSYGDGSYTVGDFATETVSFGNSGSVDKVAIGCG----HDNEGLFVGAAGLIGLGG 288
Query: 232 GKTSLPSQLNLDKFSYCLLSHKFDDTTRTSSLILDNGSSHSDKKTTGLTYTPFVNNPSVA 291
G SL SQ+ FSYCL++ D+ +S+L N + SD T P N V
Sbjct: 289 GPLSLTSQIKASSFSYCLVNR---DSVDSSTLEF-NSAKPSDSVTA-----PIFKNSKV- 338
Query: 292 ERNAFSVYYYVGLRRITVGGQRVRVWHKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLA 351
+YYVG+ ++VGG+++ + +D G GG IVD GT T + + + L
Sbjct: 339 -----DTFYYVGITGMSVGGEKLAIPPSIFEVDGSGKGGIIVDCGTAVTRLQTQAYNALR 393
Query: 352 DEFVSQMVKNRNYTRALGAEALTGLRPCFDVPGEKTGSFPELKLHFKGGAEVTLPVENYF 411
D FV ++ K+ T C+++ + P + F GG + LP NY
Sbjct: 394 DTFV-KLTKDLPSTSGFAL-----FDTCYNLSSRTSVRVPTVAFLFDGGKSLPLPPSNYL 447
Query: 412 AVVGEGSAVCLTVVTDREASGGPSIILGNFQMQNYYVEYDLRNQRLGFKQQLC 464
V CL + I+GN Q Q V YDL N ++ F + C
Sbjct: 448 IPVDSAGTFCLAFAPTTASLS----IIGNVQQQGTRVTYDLANSQVSFSSRKC 496
>gi|125536419|gb|EAY82907.1| hypothetical protein OsI_38120 [Oryza sativa Indica Group]
Length = 448
Score = 122 bits (305), Expect = 5e-25, Method: Compositional matrix adjust.
Identities = 120/431 (27%), Positives = 188/431 (43%), Gaps = 57/431 (13%)
Query: 10 LSFIFFFTLLSIFPSSITSLTFSLSRFHTNPSQDSYQNLNSLVSSSLTRALHIKNPQTK- 68
LS F LL I P S+ + F SL+ ++ +R L + +++
Sbjct: 15 LSLPVFAVLLLISPVVAVSIGDADVGFRA-----------SLIRTAESRNLSLAAERSRR 63
Query: 69 --TTTTTTTTTTTNISSHSYGG-YSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKY 125
+ T+ T T ++ GG Y + S G PP +I +DTGS L+W C+ C
Sbjct: 64 RLSVYTSGTGTKAPVTKSQKGGKYIMQFSIGEPPLLIWAEVDTGSDLMWVKCS---PCNG 120
Query: 126 CSSSKIPSFIPKLSSSSRLLGCQNPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLV 185
C+ P + P S SS L C + C + I C+D+P +C Y
Sbjct: 121 CNPPPSPLYDPARSRSSGKLPCSSQLCQALGRGRIISDQCSDDP--------PLC-GYHY 171
Query: 186 LYG-SG--LTEGIALSETLNLPNRIIPN---FLVGCSVLSSR--QPAGIAGFGRGKTSLP 237
YG SG T+G+ +ET + + N F ++ S+ AG+ G GRG SL
Sbjct: 172 AYGHSGDHSTQGVLGTETFTFGDGYVANNVSFGRSDTIDGSQFGGTAGLVGLGRGHLSLV 231
Query: 238 SQLNLDKFSYCLLSHKFDDTTRTSSLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFS 297
SQL +F+YCL + S IL + D ++ TP V NP +R+
Sbjct: 232 SQLGAGRFAYCLAADP-----NVYSTILFGSLAALDTSAGDVSSTPLVTNPK-PDRD--- 282
Query: 298 VYYYVGLRRITVGGQRVRVWHKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQ 357
+YYV L+ I+VGG R+ + ++ DG+GG DSG T + ++ + S+
Sbjct: 283 THYYVNLQGISVGGSRLPIKDGTFAINSDGSGGVFFDSGAIDTSLKDAAYQVVRQAITSE 342
Query: 358 MVKNRNYTRALGAEALTGLRPCFDVPGEKT-GSFPELKLHFKGGAEVTLPVENYFAVVGE 416
+ + LG +A G CF ++ P L LHF GA+++L NY +
Sbjct: 343 IQR-------LGYDA--GDDTCFVAANQQAVAQMPPLVLHFDDGADMSLNGRNYLKTSTK 393
Query: 417 GSA---VCLTV 424
G + VC+ +
Sbjct: 394 GPSEVLVCMAI 404
>gi|388495448|gb|AFK35790.1| unknown [Medicago truncatula]
Length = 434
Score = 122 bits (305), Expect = 5e-25, Method: Compositional matrix adjust.
Identities = 112/382 (29%), Positives = 160/382 (41%), Gaps = 51/382 (13%)
Query: 89 YSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLGCQ 148
Y + FGTPPQ + LDT S W PC+ C CS+SK F P S+S R + C
Sbjct: 97 YIVKAKFGTPPQTLLLALDTSSDAAWIPCSG---CVGCSTSK--PFAPIKSTSFRNVSCG 151
Query: 149 NPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSGLTEGIALSETLNLPNRII 208
+P C + + + C ++ YGS + +TL L I
Sbjct: 152 SPHCKQVPNPTCGGSAC----------------AFNFTYGSSSIAASVVQDTLTLAADPI 195
Query: 209 PNFLVGC----SVLSSRQPAGIAGFGRGKTSLPSQLNLDK--FSYCLLSHKFDDTTRTSS 262
P + GC + S+ Q + + L NL K FSYCL S F + S
Sbjct: 196 PGYTFGCVNKTTGSSAPQQGLLGLGRGPLSLLSQSQNLYKSTFSYCLPS--FKSINFSGS 253
Query: 263 LILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRVRVWHKYLT 322
L L G + K+ + YTP + NP R++ YYV L I VG + V + L
Sbjct: 254 LRL--GPVYQPKR---IKYTPLLRNP---RRSSL---YYVNLVAIKVGRKIVDIPPAALA 302
Query: 323 LDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEALTGLRPCFDV 382
+ GTI DSGT FT +A ++ + +EF R L L G C++V
Sbjct: 303 FNPTTGAGTIFDSGTVFTRLAEPVYTAVRNEF------RRRVGPKLPVTTLGGFDTCYNV 356
Query: 383 PGEKTGSFPELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTVVTDREASGGPSIILGNFQ 442
P P + F G V LP +N GS CL + + ++ N Q
Sbjct: 357 PIV----VPTITFLF-SGMNVALPPDNIVIHSTAGSTTCLAMAGAPDNVNSVLNVIANMQ 411
Query: 443 MQNYYVEYDLRNQRLGFKQQLC 464
QN+ V +D+ N R+G ++LC
Sbjct: 412 QQNHRVLFDVPNSRIGIARELC 433
>gi|147814824|emb|CAN65806.1| hypothetical protein VITISV_015630 [Vitis vinifera]
Length = 449
Score = 122 bits (305), Expect = 5e-25, Method: Compositional matrix adjust.
Identities = 120/405 (29%), Positives = 178/405 (43%), Gaps = 54/405 (13%)
Query: 82 SSHSYGGYSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPS------FI 135
+ + G Y ++ GTP Q + DTGS L W C H + + CS+ K F
Sbjct: 76 ADYGIGQYFVAFKVGTPSQKFMLVADTGSDLTWMSCKYHCRSRNCSNRKARRIRHKRVFH 135
Query: 136 PKLSSSSRLLGCQNPKCSWIHHESIQCRDCNDEPLATSKNC-TQICP-SYLVLYGSGLTE 193
LSSS + + C C I+ D L + NC T + P Y Y G T
Sbjct: 136 ANLSSSFKTIPCLTDMC------KIELMD-----LFSLTNCPTPLTPCGYDYRYSDGSTA 184
Query: 194 -GIALSETLNLPNR-----IIPNFLVGCSV----LSSRQPAGIAGFGRGKTSLPSQLNLD 243
G +ET+ + + + N L+GCS S + G+ G G K S +
Sbjct: 185 LGFFANETVTVELKEGRKMKLHNVLIGCSESFQGQSFQAADGVMGLGYSKYSFAIKAAEK 244
Query: 244 ---KFSYCLLSHKFDDTTRTSSLILDNGSSHSDKK-TTGLTYTPFVNNPSVAERNAFSVY 299
KFSYCL+ H + + S L GSS S + +TYT V + N+F
Sbjct: 245 FGGKFSYCLVDHL---SHKNVSNYLTFGSSRSKEALLNNMTYTELV----LGMVNSF--- 294
Query: 300 YYVGLRRITVGGQRVRVWHKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMV 359
Y V + I++GG +++ + D G GGTI+DSG++ TF+ ++P+ ++
Sbjct: 295 YAVNMMGISIGGAMLKIPSE--VWDVKGAGGTILDSGSSLTFLTEPAYQPVMAALRVSLL 352
Query: 360 KNRNYTRALGAEALTGLRPCFDVPGEKTGSFPELKLHFKGGAEVTLPVENYFAVVGEGSA 419
K R +G L CF+ G + P L HF GAE PV++Y +G
Sbjct: 353 KFRKVEMDIGP-----LEYCFNSTGFEESLVPRLVFHFADGAEFEPPVKSYVISAADG-V 406
Query: 420 VCLTVVTDREASGGPSIILGNFQMQNYYVEYDLRNQRLGFKQQLC 464
CL V+ A G S++ GN QN+ E+DL ++LGF C
Sbjct: 407 RCLGFVS--VAWPGTSVV-GNIMQQNHLWEFDLGLKKLGFAPSSC 448
>gi|148905906|gb|ABR16115.1| unknown [Picea sitchensis]
Length = 482
Score = 122 bits (305), Expect = 5e-25, Method: Compositional matrix adjust.
Identities = 122/388 (31%), Positives = 169/388 (43%), Gaps = 52/388 (13%)
Query: 87 GGYSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLG 146
G Y ++ FGTP + I+DTGS L W C C C S F PK SSS + L
Sbjct: 135 GNYIVTAGFGTPAKNSLLIIDTGSDLTWIQCK---PCADCYSQVDAIFEPKQSSSYKTLP 191
Query: 147 CQNPKCS-WIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSG-LTEGIALSETLNLP 204
C + C+ I ES N P C Y + YG G ++G ETL L
Sbjct: 192 CLSATCTELITSES------NPTP------CLLGGCVYEINYGDGSSSQGDFSQETLTLG 239
Query: 205 NRIIPNFLVGCSVLSS---RQPAGIAGFGRGKTSLPSQLNLD---KFSYCLLSHKFDDTT 258
+ NF GC ++ + +G+ G G+ S PSQ +F+YCL F +T
Sbjct: 240 SDSFQNFAFGCGHTNTGLFKGSSGLLGLGQNSLSFPSQSKSKYGGQFAYCL--PDFGSST 297
Query: 259 RTSSLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRVRVWH 318
T S + GS + +TP V+N + +Y+VGL I+VGG R+ +
Sbjct: 298 STGSFSVGKGSIPASA-----VFTPLVSN------FMYPTFYFVGLNGISVGGDRLSIPP 346
Query: 319 KYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRAL-GAEALTGLR 377
L G G TIVDSGT T + P+ + L F R+ TR L A+ + L
Sbjct: 347 AVL-----GRGSTIVDSGTVITRLLPQAYNALKTSF-------RSKTRDLPSAKPFSILD 394
Query: 378 PCFDVPGEKTGSFPELKLHFKGGAEVTL-PVENYFAVVGEGSAVCLTVVTDREASGGPSI 436
C+D+ P + HF+ A+V + V V GS VCL + + G
Sbjct: 395 TCYDLSRHSQVRIPTITFHFQNNADVAVSDVGILVPVQNGGSQVCLAFASASQMDG--FN 452
Query: 437 ILGNFQMQNYYVEYDLRNQRLGFKQQLC 464
I+GNFQ Q V +D R+GF C
Sbjct: 453 IIGNFQQQRMRVAFDTGAGRIGFASGSC 480
>gi|224079535|ref|XP_002305886.1| predicted protein [Populus trichocarpa]
gi|222848850|gb|EEE86397.1| predicted protein [Populus trichocarpa]
Length = 436
Score = 122 bits (305), Expect = 6e-25, Method: Compositional matrix adjust.
Identities = 112/383 (29%), Positives = 171/383 (44%), Gaps = 38/383 (9%)
Query: 91 ISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLGCQNP 150
+SL GTPPQ ILDTGS L W C K S+ F P LSSS +L C +P
Sbjct: 79 VSLPIGTPPQSQQMILDTGSQLSWIQCHKKVPRKPPPST---VFDPSLSSSFSVLPCNHP 135
Query: 151 KCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSG-LTEGIALSETLNLP-NRII 208
C D L TS + ++C Y Y G L EG + E + ++
Sbjct: 136 LCK---------PRIPDFTLPTSCDLNRLC-HYSYFYADGTLAEGNLVREKITFSTSQST 185
Query: 209 PNFLVGCSVLSSRQPAGIAGFGRGKTSLPSQLNLDKFSYCLLSHKFDDT-TRTSSLIL-D 266
P ++GC+ +S GI G G+ S SQ + KFSYC+ + + T T S L +
Sbjct: 186 PPLILGCAEDASDD-KGILGMNLGRLSFASQAKITKFSYCVPTRQVRPGFTPTGSFYLGE 244
Query: 267 NGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRVRVWHKYLTLDRD 326
N +S + + LT++ P N + + V L+ I +G +++ + D
Sbjct: 245 NPNSAGFQYISLLTFSQSQRMP-----NLDPLAHTVALQGIRIGNKKLNIPVSAFRADPS 299
Query: 327 GNGGTIVDSGTTFTFMAPELFEPLADEFVS----QMVKNRNYTRALGAEALTGLRPCFDV 382
G G +++DSG+ FT++ + + +E V ++ K Y+ G + CFD
Sbjct: 300 GAGQSMIDSGSEFTYLVDVAYNKVREEVVRLAGPRLKKGYVYS---GVSDM-----CFDG 351
Query: 383 PGEKTGSF-PELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTVVTDREASGGPSIILGNF 441
+ G + F G E+ + A VG G V + E G S I+GNF
Sbjct: 352 NAMEIGRLIGNMVFEFDKGVEIVIEKGRVLADVGGG--VHCVGIGRSEMLGAASNIIGNF 409
Query: 442 QMQNYYVEYDLRNQRLGFKQQLC 464
QN +VE+D+ N+R+GF + C
Sbjct: 410 HQQNLWVEFDIANRRVGFGKADC 432
>gi|115479237|ref|NP_001063212.1| Os09g0423500 [Oryza sativa Japonica Group]
gi|50725896|dbj|BAD33424.1| putative nucleoid DNA-binding protein cnd41, chloroplast [Oryza
sativa Japonica Group]
gi|50726136|dbj|BAD33657.1| putative nucleoid DNA-binding protein cnd41, chloroplast [Oryza
sativa Japonica Group]
gi|113631445|dbj|BAF25126.1| Os09g0423500 [Oryza sativa Japonica Group]
gi|215767614|dbj|BAG99842.1| unnamed protein product [Oryza sativa Japonica Group]
gi|222641596|gb|EEE69728.1| hypothetical protein OsJ_29412 [Oryza sativa Japonica Group]
Length = 473
Score = 121 bits (304), Expect = 6e-25, Method: Compositional matrix adjust.
Identities = 110/372 (29%), Positives = 172/372 (46%), Gaps = 52/372 (13%)
Query: 105 ILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLGCQNPKCSWIHHESIQCRD 164
I+DT S L W C C C + P F P S S +L C + C +++Q
Sbjct: 141 IVDTASELTWVQCA---PCASCHDQQGPLFDPASSPSYAVLPCNSSSC-----DALQVAT 192
Query: 165 CNDEPLATSKNCTQICPSYLVLYGSG-LTEGIALSETLNLPNRIIPNFLVGCSVLSSRQP 223
+ Q SY + Y G ++G+ + L+L +I F+ GC S++ P
Sbjct: 193 GSAAGACGGGE--QPSCSYTLSYRDGSYSQGVLAHDKLSLAGEVIDGFVFGCGT-SNQGP 249
Query: 224 ----AGIAGFGRGKTSLPSQLNLDKF----SYCLLSHKFDDTTRTSSLILDNGSSHSDKK 275
+G+ G GR + SL SQ +D+F SYCL ++ + SL+L + +S +
Sbjct: 250 FGGTSGLMGLGRSQLSLISQ-TMDQFGGVFSYCL---PLKESESSGSLVLGDDTSVY-RN 304
Query: 276 TTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRVRVWHKYLTLDRDGNGGTIVDS 335
+T + YT V++P +Y+V L IT+GGQ V G IVDS
Sbjct: 305 STPIVYTTMVSDPVQGP------FYFVNLTGITIGGQEVE----------SSAGKVIVDS 348
Query: 336 GTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEALTGLRPCFDVPGEKTGSFPELKL 395
GT T + P ++ + EF+SQ + Y +A G + L CF++ G + P LK
Sbjct: 349 GTIITSLVPSVYNAVKAEFLSQFAE---YPQAPG---FSILDTCFNLTGFREVQIPSLKF 402
Query: 396 HFKGGAEVTLPVEN--YFAVVGEGSAVCLTVVTDREASGGPSIILGNFQMQNYYVEYDLR 453
F+G EV + YF V + S VCL + + + S + I+GN+Q +N V +D
Sbjct: 403 VFEGNVEVEVDSSGVLYF-VSSDSSQVCLALASLK--SEYETSIIGNYQQKNLRVIFDTL 459
Query: 454 NQRLGFKQQLCK 465
++GF Q+ C
Sbjct: 460 GSQIGFAQETCD 471
>gi|357456413|ref|XP_003598487.1| Peptidase A1, putative [Medicago truncatula]
gi|355487535|gb|AES68738.1| Peptidase A1, putative [Medicago truncatula]
Length = 414
Score = 121 bits (304), Expect = 6e-25, Method: Compositional matrix adjust.
Identities = 113/382 (29%), Positives = 165/382 (43%), Gaps = 52/382 (13%)
Query: 89 YSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLGCQ 148
Y + GTPPQ + +DT + W PCT C C+S+ F P+ S++ + + C
Sbjct: 78 YIVRAKIGTPPQTLLLAMDTSNDAAWIPCT---ACDGCASTL---FAPEKSTTFKNVSCA 131
Query: 149 NPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSGLTEGIALSETLNLPNRII 208
P+C + + CN + + YGS + +T+ L +
Sbjct: 132 APECKQVPNPGCGVSSCN----------------FNLTYGSSSIAANLVQDTITLATDPV 175
Query: 209 PNFLVGC---SVLSSRQPAGIAGFGRGKTSLPSQ---LNLDKFSYCLLSHKFDDTTRTSS 262
P++ GC + +S P G+ G GRG SL SQ L FSYCL S F + S
Sbjct: 176 PSYTFGCVSKTTGTSAPPQGLLGLGRGPLSLLSQTQNLYQSTFSYCLPS--FKSLNFSGS 233
Query: 263 LILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRVRVWHKYLT 322
L L G K+ + YTP + NP R++ YYV L I VG + V + L
Sbjct: 234 LRL--GPVAQPKR---IKYTPLLKNP---RRSSL---YYVNLEAIRVGRKVVDIPPAALA 282
Query: 323 LDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEALTGLRPCFDV 382
+ GTI DSGT FT + ++ + DEF R L +L G C++V
Sbjct: 283 FNPTTGAGTIFDSGTVFTRLVAPVYVAVRDEF------RRRVGPKLTVTSLGGFDTCYNV 336
Query: 383 PGEKTGSFPELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTVVTDREASGGPSIILGNFQ 442
P P + F G VTLP +N GS CL + + ++ N Q
Sbjct: 337 PI----VVPTITFIFTG-MNVTLPQDNILIHSTAGSTTCLAMAGAPDNVNSVLNVIANMQ 391
Query: 443 MQNYYVEYDLRNQRLGFKQQLC 464
QN+ V YD+ N R+G ++LC
Sbjct: 392 QQNHRVLYDVPNSRVGVARELC 413
>gi|222640709|gb|EEE68841.1| hypothetical protein OsJ_27628 [Oryza sativa Japonica Group]
Length = 375
Score = 121 bits (304), Expect = 6e-25, Method: Compositional matrix adjust.
Identities = 122/414 (29%), Positives = 179/414 (43%), Gaps = 66/414 (15%)
Query: 61 HIKNPQTKTTTTTTTTTTTNISSHSYGGYSISLSFGTPPQIIPFILDTGSHLVWFPCTNH 120
HI + + T + + S G+S+++ P ++I +DTGS L+W
Sbjct: 15 HIISHSRNVSAALVVRTPSRRTDGSDQGHSLTVGIVQPRKLI---VDTGSDLIW------ 65
Query: 121 YQCKYCSSSKIPSFIPKLSSSSRLLGCQNPKCSWIHHESIQCRDCNDEPLATSKNCTQIC 180
QCK SS+ ++++R H R A ++ CT
Sbjct: 66 TQCKLSSST---------AAAAR------------HGSPPLSRTAPARTGAFTRTCTASA 104
Query: 181 PSYLVLYGSGLTEGIALSETLNLPNRIIPNFLVGCSVLSSRQ---PAGIAGFGRGKTSLP 237
+ VL T G + +L L F GC LS+ GI G SL
Sbjct: 105 AAVGVLASETFTFGARRAVSLRL------GF--GCGALSAGSLIGATGILGLSPESLSLI 156
Query: 238 SQLNLDKFSYCLLSHKFDDTTRTSSLILDNGSSHSDKKTTG-LTYTPFVNNPSVAERNAF 296
+QL + +FSYCL F D +TS L+ + S KTT + T V+NP
Sbjct: 157 TQLKIQRFSYCLT--PFADK-KTSPLLFGAMADLSRHKTTRPIQTTAIVSNP------VE 207
Query: 297 SVYYYVGLRRITVGGQRVRVWHKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVS 356
+VYYYV L I++G +R+ V L + DG GGTIVDSG+T ++ FE + E V
Sbjct: 208 TVYYYVPLVGISLGHKRLAVPAASLAMRPDGGGGTIVDSGSTVAYLVEAAFEAV-KEAVM 266
Query: 357 QMVKNRNYTRALGAEALTGLRPCFDVPGEKTGS------FPELKLHFKGGAEVTLPVENY 410
+V+ R + L CF +P + P L LHF GGA + LP +NY
Sbjct: 267 DVVRLPVANRTVEDYEL-----CFVLPRRTAAAAMEAVQVPPLVLHFDGGAAMVLPRDNY 321
Query: 411 FAVVGEGSAVCLTVVTDREASGGPSIILGNFQMQNYYVEYDLRNQRLGFKQQLC 464
F G +CL V + SG I+GN Q QN +V +D+++ + F C
Sbjct: 322 FQEPRAG-LMCLAVGKTTDGSG--VSIIGNVQQQNMHVLFDVQHHKFSFAPTQC 372
>gi|125563764|gb|EAZ09144.1| hypothetical protein OsI_31414 [Oryza sativa Indica Group]
Length = 472
Score = 121 bits (304), Expect = 6e-25, Method: Compositional matrix adjust.
Identities = 110/372 (29%), Positives = 172/372 (46%), Gaps = 52/372 (13%)
Query: 105 ILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLGCQNPKCSWIHHESIQCRD 164
I+DT S L W C C C + P F P S S +L C + C +++Q
Sbjct: 140 IVDTASELTWVQCA---PCASCHDQQGPLFDPASSPSYAVLPCNSSSC-----DALQVAT 191
Query: 165 CNDEPLATSKNCTQICPSYLVLYGSG-LTEGIALSETLNLPNRIIPNFLVGCSVLSSRQP 223
+ Q SY + Y G ++G+ + L+L +I F+ GC S++ P
Sbjct: 192 GSAAGACGGGE--QPSCSYTLSYRDGSYSQGVLAHDKLSLAGEVIDGFVFGCGT-SNQGP 248
Query: 224 ----AGIAGFGRGKTSLPSQLNLDKF----SYCLLSHKFDDTTRTSSLILDNGSSHSDKK 275
+G+ G GR + SL SQ +D+F SYCL ++ + SL+L + +S +
Sbjct: 249 FGGTSGLMGLGRSQLSLISQ-TMDQFGGVFSYCL---PLKESESSGSLVLGDDTSVY-RN 303
Query: 276 TTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRVRVWHKYLTLDRDGNGGTIVDS 335
+T + YT V++P +Y+V L IT+GGQ V G IVDS
Sbjct: 304 STPIVYTTMVSDPVQGP------FYFVNLTGITIGGQEVE----------SSAGKVIVDS 347
Query: 336 GTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEALTGLRPCFDVPGEKTGSFPELKL 395
GT T + P ++ + EF+SQ + Y +A G + L CF++ G + P LK
Sbjct: 348 GTIITSLVPSVYNAVKAEFLSQFAE---YPQAPG---FSILDTCFNLTGFREVQIPSLKF 401
Query: 396 HFKGGAEVTLPVEN--YFAVVGEGSAVCLTVVTDREASGGPSIILGNFQMQNYYVEYDLR 453
F+G EV + YF V + S VCL + + + S + I+GN+Q +N V +D
Sbjct: 402 VFEGNVEVEVDSSGVLYF-VSSDSSQVCLALASLK--SEYETSIIGNYQQKNLRVIFDTL 458
Query: 454 NQRLGFKQQLCK 465
++GF Q+ C
Sbjct: 459 GSQIGFAQETCD 470
>gi|357137788|ref|XP_003570481.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Brachypodium
distachyon]
Length = 455
Score = 121 bits (304), Expect = 7e-25, Method: Compositional matrix adjust.
Identities = 111/387 (28%), Positives = 169/387 (43%), Gaps = 56/387 (14%)
Query: 87 GGYSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKY-CSSSKIPSFIPKLSSSSRLL 145
G Y + GTP + ++DTGS L W C+ C+ C P F PK SSS +
Sbjct: 115 GNYVTRMGLGTPAKPYIMVVDTGSSLTWLQCS---PCRVSCHRQSGPVFDPKTSSSYAAV 171
Query: 146 GCQNPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYG-SGLTEGIALSETLNLP 204
C +P+C + ++ C+ + +C Y YG S + G +T++
Sbjct: 172 SCSSPQCDGLSTATLNPAVCSP---------SNVC-IYQASYGDSSFSVGYLSKDTVSFG 221
Query: 205 NRIIPNFLVGCSVLSS---RQPAGIAGFGRGKTSLPSQLNLD---KFSYCLLSHKFDDTT 258
+PNF GC + + AG+ G R K SL QL FSYCL S
Sbjct: 222 ANSVPNFYYGCGQDNEGLFGRSAGLMGLARNKLSLLYQLAPTLGYSFSYCLPS------- 274
Query: 259 RTSSLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRVRV-W 317
+SS L GS + G +YTP V+N Y++ L +TV G+ + V
Sbjct: 275 TSSSGYLSIGSYNPG----GYSYTPMVSN------TLDDSLYFISLSGMTVAGKPLAVSS 324
Query: 318 HKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEALTGLR 377
+Y +L TI+DSGT T + ++ L+ + M + T+ A ++ L
Sbjct: 325 SEYTSLP------TIIDSGTVITRLPTSVYTALSKAVAAAM---KGSTKRAAAYSI--LD 373
Query: 378 PCFDVPGEKTGSFPELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTVVTDREASGGPSII 437
CF+ K + P + + F GGA + L N V +G+ CL R A+ I
Sbjct: 374 TCFEGQASKLRAVPAVSMAFSGGATLKLSAGNLLVDV-DGATTCLAFAPARSAA-----I 427
Query: 438 LGNFQMQNYYVEYDLRNQRLGFKQQLC 464
+GN Q Q + V YD+++ R+GF C
Sbjct: 428 IGNTQQQTFSVVYDVKSNRIGFAAAGC 454
>gi|302790323|ref|XP_002976929.1| hypothetical protein SELMODRAFT_105896 [Selaginella moellendorffii]
gi|300155407|gb|EFJ22039.1| hypothetical protein SELMODRAFT_105896 [Selaginella moellendorffii]
Length = 373
Score = 121 bits (304), Expect = 7e-25, Method: Compositional matrix adjust.
Identities = 107/392 (27%), Positives = 173/392 (44%), Gaps = 48/392 (12%)
Query: 95 FGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLGCQNPKCSW 154
GTPP+ + ++DT S L W T+ C CS +K+P F P LSSS C + C
Sbjct: 5 IGTPPREVLLLVDTASELTWVQGTS---CTNCSPTKVPPFNPGLSSSFISEPCTSSVCLG 61
Query: 155 IHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSGLTE-GIALSETLNLPN-----RII 208
Q CN ++ +C S+ V Y G G+ E +L + +
Sbjct: 62 RSKLGFQSA-CNR----STGSC-----SFQVAYLDGSEAYGVIAREIFSLQSWDGAASTL 111
Query: 209 PNFLVGCSVLSSRQP----AGIAGFGRGKTSLPSQLNL-------DKFSYCLLSHKFDDT 257
+ + GC+ ++P +G G RG S P+Q+ D+FSYC ++ +
Sbjct: 112 GDVIFGCASKDLQRPVDFSSGTLGLNRGSFSFPAQIGSRSKSGLSDRFSYCF-PNRAEHL 170
Query: 258 TRTSSLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRVRVW 317
+ +I + S Y P +A + +YYVGL+ I+VGG+ + +
Sbjct: 171 NSSGVIIFGD----SGIPAHHFQYLSLEQEPPIA---SIVDFYYVGLQGISVGGELLHIP 223
Query: 318 HKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEALTGLR 377
+DR GNGGT DSGTT +F+ L + F +++ + R G++ L
Sbjct: 224 RSAFKIDRLGNGGTYFDSGTTVSFLVEPAHTALVEAFGRRVL---HLNRTSGSDFTKEL- 279
Query: 378 PCFDVPG--EKTGSFPELKLHFKGGAEVTLPVENYFAVVGEGSAV---CLTVVTDREASG 432
C+DV + + P + LHFK ++ L + + + V CL V +
Sbjct: 280 -CYDVAAGDARLPTAPLVTLHFKNNVDMELREASVWVPLARTPQVVTICLAFVNAGAVAQ 338
Query: 433 GPSIILGNFQMQNYYVEYDLRNQRLGFKQQLC 464
G ++GN+Q Q+Y +E+DL R+GF C
Sbjct: 339 GGVNVIGNYQQQDYLIEHDLERSRIGFAPANC 370
>gi|356504173|ref|XP_003520873.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
Length = 461
Score = 121 bits (304), Expect = 7e-25, Method: Compositional matrix adjust.
Identities = 119/386 (30%), Positives = 168/386 (43%), Gaps = 49/386 (12%)
Query: 87 GGYSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLG 146
G Y + GTP + + +LDTGS +VW C C+ C + F P S + +
Sbjct: 116 GEYFTRIGVGTPARYVYMVLDTGSDVVWLQCA---PCRKCYTQTDHVFDPTKSRTYAGIP 172
Query: 147 CQNPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSG-LTEGIALSETLNLPN 205
C P C + D P ++KN ++C Y V YG G T G +ETL
Sbjct: 173 CGAPLCRRL-----------DSPGCSNKN--KVC-QYQVSYGDGSFTFGDFSTETLTFRR 218
Query: 206 RIIPNFLVGCSVLSS---RQPAGIAGFGRGKTSLPSQLNL---DKFSYCLLSHKFDDTTR 259
+ +GC + AG+ G GRG+ S P Q KFSYCL+ + +
Sbjct: 219 NRVTRVALGCGHDNEGLFTGAAGLLGLGRGRLSFPVQTGRRFNHKFSYCLVDRS--ASAK 276
Query: 260 TSSLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRVRVWHK 319
SS+I + + +TP + NP + +YY+ L I+VGG VR
Sbjct: 277 PSSVIFGDSAVSRTAH-----FTPLIKNPKL------DTFYYLELLGISVGGAPVRGLSA 325
Query: 320 YL-TLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEALTGLRP 378
L LD GNGG I+DSGT+ T + + L D F + + R A +
Sbjct: 326 SLFRLDAAGNGGVIIDSGTSVTRLTRPAYIALRDAF---RIGASHLKR---APEFSLFDT 379
Query: 379 CFDVPGEKTGSFPELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTVVTDREASGGPSIIL 438
CFD+ G P + LHF+ GA+V+LP NY V + C G SII
Sbjct: 380 CFDLSGLTEVKVPTVVLHFR-GADVSLPATNYLIPVDNSGSFCFAFAG---TMSGLSII- 434
Query: 439 GNFQMQNYYVEYDLRNQRLGFKQQLC 464
GN Q Q + + YDL R+GF + C
Sbjct: 435 GNIQQQGFRISYDLTGSRVGFAPRGC 460
>gi|449440014|ref|XP_004137780.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Cucumis sativus]
gi|449483406|ref|XP_004156582.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Cucumis sativus]
Length = 449
Score = 121 bits (303), Expect = 8e-25, Method: Compositional matrix adjust.
Identities = 112/390 (28%), Positives = 172/390 (44%), Gaps = 38/390 (9%)
Query: 90 SISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLGCQN 149
++SL+ GTPPQ + ++DTGS L W C SSS +F P SSS + C +
Sbjct: 74 TVSLTVGTPPQNVTMVIDTGSELSWLHCNTSQN----SSSSSSTFNPVWSSSYSPIPCSS 129
Query: 150 PKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSGLTEGIALSETLNLPNRIIP 209
C + Q RD P+ S + Q C + L + +EG ++T + + IP
Sbjct: 130 STC------TDQTRDF---PIRPSCDSNQFCHATLSYADASSSEGNLATDTFYIGSSGIP 180
Query: 210 NFLVGC--SVLSSR-----QPAGIAGFGRGKTSLPSQLNLDKFSYCLLSHKFDDTTRTSS 262
N + GC S+ SS + G+ G RG S SQ+ KFSYC+ + F S
Sbjct: 181 NVVFGCMDSIFSSNSEEDSKNTGLMGMNRGSLSFVSQMGFPKFSYCISEYDF------SG 234
Query: 263 LILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRVRVWHKYLT 322
L+L ++ S L YTP + S V Y V L I V + + +
Sbjct: 235 LLLLGDANFS--WLAPLNYTPLIEM-STPLPYFDRVAYTVQLEGIKVAHKLLPIPESVFE 291
Query: 323 LDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEALTGLRPCFDV 382
D G G T+VDSGT FTF+ + L D F+++ + + C+ V
Sbjct: 292 PDHTGAGQTMVDSGTQFTFLLGPAYTALRDHFLNKTAGSLRVYEDSNFVFQGAMDLCYRV 351
Query: 383 PGEKTG--SFPELKLHFKGGAEVTLPVENYFAVV-----GEGSAVCLTVVTDREASGGPS 435
P +T P + L F+ GAE+T+ + V G S C T + + G +
Sbjct: 352 PTNQTRLPPLPSVTLVFR-GAEMTVTGDRILYRVPGERRGNDSIHCFT-FGNSDLLGVEA 409
Query: 436 IILGNFQMQNYYVEYDLRNQRLGFKQQLCK 465
++G+ QN ++E+DL+ R+G + C
Sbjct: 410 FVIGHLHQQNVWMEFDLKKSRIGLAEIRCD 439
>gi|307136234|gb|ADN34070.1| aspartic proteinase nepenthesin-1 precursor [Cucumis melo subsp.
melo]
Length = 412
Score = 121 bits (303), Expect = 9e-25, Method: Compositional matrix adjust.
Identities = 118/403 (29%), Positives = 174/403 (43%), Gaps = 46/403 (11%)
Query: 78 TTNISSHSYGGYSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPK 137
+ +S H ++SL+ G+PPQ + +LDTGS L W C S + F P
Sbjct: 29 SNKLSFHHNVTLTVSLTVGSPPQQVTMVLDTGSELSWLHCKK-------SPNLTSVFNPL 81
Query: 138 LSSSSRLLGCQNPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSGLTEGIAL 197
SSS + C +P C D P + + ++C + + + EG
Sbjct: 82 SSSSYSPIPCSSPVCR---------TRTRDLPNPVTCDPKKLCHAIVSYADASSLEGNLA 132
Query: 198 SETLNLPNRIIPNFLVGC--SVLSSR-----QPAGIAGFGRGKTSLPSQLNLDKFSYCLL 250
S+ + + +P L GC S SS + G+ G RG S +QL L KFSYC+
Sbjct: 133 SDNFRIGSSALPGTLFGCMDSGFSSNSEEDAKTTGLMGMNRGSLSFVTQLGLPKFSYCI- 191
Query: 251 SHKFDDTTRTSSLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVG 310
+ R SS +L G SH LTYTP V S V Y V L I VG
Sbjct: 192 ------SGRDSSGVLLFGDSHL-SWLGNLTYTPLVQI-STPLPYFDRVAYTVQLDGIRVG 243
Query: 311 GQRVRVWHKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGA 370
+ + + D G G T+VDSGT FTF+ ++ L +EF+ Q + LG
Sbjct: 244 NKILPLPKSIFAPDHTGAGQTMVDSGTQFTFLLGPVYTALRNEFLEQ---TKGVLAPLGD 300
Query: 371 EALT---GLRPCFDVP-GEKTGSFPELKLHFKGGAEVTLPVENYFAVV-----GEGSAVC 421
+ C+ VP G K P + L F+ GAE+ + E V G+ C
Sbjct: 301 PNFVFQGAMDLCYRVPAGGKLPELPAVSLMFR-GAEMVVGGEVLLYKVPGMMKGKEWVYC 359
Query: 422 LTVVTDREASGGPSIILGNFQMQNYYVEYDLRNQRLGFKQQLC 464
LT + + G + ++G+ QN ++E+DL R+GF + C
Sbjct: 360 LT-FGNSDLLGIEAFVIGHHHQQNVWMEFDLVKSRVGFVETRC 401
>gi|168014386|ref|XP_001759733.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162689272|gb|EDQ75645.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 392
Score = 121 bits (303), Expect = 1e-24, Method: Compositional matrix adjust.
Identities = 120/399 (30%), Positives = 168/399 (42%), Gaps = 61/399 (15%)
Query: 87 GGYSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLG 146
G Y + S GTP Q I+DTGS L + C C C P + P SS+ +
Sbjct: 32 GQYFVDFSLGTPEQKFHLIVDTGSDLAFVQCA---PCDLCYEQDGPLYQPSNSSTFTPVP 88
Query: 147 CQNPKCSWIHHESIQCRDCNDEPLATSKNCTQICP--------SYLVLYG-SGLTEGIAL 197
C + +C I P C+ P SY YG + T G+
Sbjct: 89 CDSAECLLI-------------PAPVGAPCSSSYPESPPQGACSYEYRYGDNSSTVGVFA 135
Query: 198 SETLNLPNRIIPNFLVGC------SVLSSRQPAGIAGFGRGKTSLPSQLNL---DKFSYC 248
ET + + + GC S +S+ G+ G G+G S SQ +KF+YC
Sbjct: 136 YETATVGGIRVNHVAFGCGNRNQGSFVSA---GGVLGLGQGALSFTSQAGYAFENKFAYC 192
Query: 249 LLSHKFDDTTRTSSLIL--DNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRR 306
L S+ T+ SSLI D S+ D + T L P NPSV YYV + R
Sbjct: 193 LTSY-LSPTSVFSSLIFGDDMMSTIHDLQFTPLVSNPL--NPSV---------YYVQIVR 240
Query: 307 ITVGGQRVRVWHKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTR 366
I GG+ + + +D GNGGTI DSGTT T+ +P+ + ++ K+ Y R
Sbjct: 241 ICFGGETLLIPDSAWKIDSVGNGGTIFDSGTTVTYWSPQAYA----RIIAAFEKSVPYPR 296
Query: 367 ALGAEALTGLRPCFDVPGEKTGSFPELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTVVT 426
A + GL C +V G +P + F GA NYF V + CL ++
Sbjct: 297 A--PPSPQGLPLCVNVSGIDHPIYPSFTIEFDQGATYRPNQGNYFIEVSP-NIDCLAML- 352
Query: 427 DREASGGPSIILGNFQMQNYYVEYDLRNQRLGFKQQLCK 465
E+S ++GN QNY V+YD R+GF C
Sbjct: 353 --ESSSDGFNVIGNIIQQNYLVQYDREEHRIGFAHANCD 389
>gi|145693992|gb|ABP93696.1| unknown protein isoform 1 [Lemna minor]
Length = 350
Score = 120 bits (302), Expect = 1e-24, Method: Compositional matrix adjust.
Identities = 120/388 (30%), Positives = 173/388 (44%), Gaps = 66/388 (17%)
Query: 89 YSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSS----KIPSFIPKLSSSSRL 144
Y I++ FGTP + I DTGS++ W QCK C S + P F P LSS+ R
Sbjct: 16 YVITVGFGTPKKNQTVIFDTGSNVNWI------QCKPCVVSCYPQQEPLFDPTLSSTYRN 69
Query: 145 LGCQNPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSG-LTEGIALSETLNL 203
+ C + C+ + +S+ C+ Y V YG G T G +ET L
Sbjct: 70 ISCTSAACTGL----------------SSRGCSGSTCVYGVTYGDGSSTVGFLATETFTL 113
Query: 204 -PNRIIPNFLVGCSVLSS---RQPAGIAGFGRGKTSLPSQLNL---DKFSYCLLSHKFDD 256
+ NF+ GC + AG+ G GR SL SQL + FSYCL S
Sbjct: 114 AAGNVFNNFIFGCGQNNQGLFTGAAGLIGLGRSPYSLNSQLATSLGNIFSYCLPS----- 168
Query: 257 TTRTSSLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRVRV 316
T +++ L+ G + +T G YT + N Y++ L I+VGG R+ +
Sbjct: 169 -TSSATGYLNIG---NPLRTPG--YTAMLTNSRAPT------LYFIDLIGISVGGTRLAL 216
Query: 317 WHKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEALTGL 376
+ GTI+DSGT T + P + L F + M + YTRA A L
Sbjct: 217 SSTVFQ-----SVGTIIDSGTVITRLPPTAYGALRTAFRAAMTQ---YTRAAAASI---L 265
Query: 377 RPCFDVPGEKTGSFPELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTVVTDREASGGPSI 436
C+D T +FP +KLH+ G +VT+P F V+ S VCL + +++
Sbjct: 266 DTCYDFSRTTTVTFPTIKLHYT-GLDVTIPGAGVFYVI-SSSQVCLAFAGNSDST--QIG 321
Query: 437 ILGNFQMQNYYVEYDLRNQRLGFKQQLC 464
I+GN Q + V YD +R+GF C
Sbjct: 322 IIGNVQQRTMEVTYDNALKRIGFAAGAC 349
>gi|326491519|dbj|BAJ94237.1| predicted protein [Hordeum vulgare subsp. vulgare]
gi|326524456|dbj|BAK00611.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 499
Score = 120 bits (302), Expect = 1e-24, Method: Compositional matrix adjust.
Identities = 118/412 (28%), Positives = 175/412 (42%), Gaps = 61/412 (14%)
Query: 64 NPQTKTTTTTTTTTTTNISSHSYGGYSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQC 123
+P +++T + T+ + S G Y +++ GTP + DTGS W QC
Sbjct: 138 HPGHSASSSTPSLPATSGRAVSTGNYVVTVGLGTPASKYTVVFDTGSDTTWV------QC 191
Query: 124 K----YCSSSKIPSFIPKLSSSSRLLGCQNPKCSWIHHESIQCRDCNDEPLATSKNCTQI 179
+ C K P F P SS+ + C + C+ + D N CT
Sbjct: 192 RPCVVKCYKQKEPLFDPAKSSTYANVSCTDSACADL--------DTN--------GCTGG 235
Query: 180 CPSYLVLYGSG-LTEGIALSETLNLPNRIIPNFLVGCSVLSS---RQPAGIAGFGRGKTS 235
Y V YG G T G +TL + + I F GC ++ + AG+ G GRGKTS
Sbjct: 236 HCLYAVQYGDGSYTVGFFAQDTLTIAHDAIKGFRFGCGEKNNGLFGKTAGLMGLGRGKTS 295
Query: 236 LPSQL---NLDKFSYCLLSHKFDDTTRTSSLILDNGSSHSDKKTTGLTYTPFVNNPSVAE 292
L Q F+YCL + TT T L GS+ ++ + T P + +
Sbjct: 296 LTVQAYNKYGGAFAYCLPAL----TTGTGYLDFGPGSAGNNARLT----------PMLTD 341
Query: 293 RNAFSVYYYVGLRRITVGGQRVRVWHKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLAD 352
+ +YYVG+ I VGGQ+V V + GT+VDSGT T + + L+
Sbjct: 342 KG--QTFYYVGMTGIRVGGQQVPVAESVFS-----TAGTLVDSGTVITRLPATAYTALSS 394
Query: 353 EFVSQMVKNRNYTRALGAEALTGLRPCFDVPGEKTGSFPELKLHFKGGAEVTLPVENYFA 412
F M+ R Y +A G L C+D G P + L F+GGA + + V
Sbjct: 395 AFDKVMLA-RGYKKAPGYSI---LDTCYDFTGLSDVELPTVSLVFQGGACLDVDVSGIVY 450
Query: 413 VVGEGSAVCLTVVTDREASGGPSIILGNFQMQNYYVEYDLRNQRLGFKQQLC 464
+ E + VCL ++ + I+GN Q + Y V YDL + +GF C
Sbjct: 451 AISE-AQVCLAFASNGDDES--VAIVGNTQQKTYGVLYDLGKKTVGFAPGSC 499
>gi|326520736|dbj|BAJ92731.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 499
Score = 120 bits (301), Expect = 1e-24, Method: Compositional matrix adjust.
Identities = 118/412 (28%), Positives = 175/412 (42%), Gaps = 61/412 (14%)
Query: 64 NPQTKTTTTTTTTTTTNISSHSYGGYSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQC 123
+P +++T + T+ + S G Y +++ GTP + DTGS W QC
Sbjct: 138 HPGHSASSSTPSLPATSGRAVSTGNYVVTVGLGTPASKYTVVFDTGSDTTWV------QC 191
Query: 124 K----YCSSSKIPSFIPKLSSSSRLLGCQNPKCSWIHHESIQCRDCNDEPLATSKNCTQI 179
+ C K P F P SS+ + C + C+ + D N CT
Sbjct: 192 RPCVVKCYKQKGPLFDPAKSSTYANVSCTDSACADL--------DTN--------GCTGG 235
Query: 180 CPSYLVLYGSG-LTEGIALSETLNLPNRIIPNFLVGCSVLSS---RQPAGIAGFGRGKTS 235
Y V YG G T G +TL + + I F GC ++ + AG+ G GRGKTS
Sbjct: 236 HCLYAVQYGDGSYTVGFFAQDTLTIAHDAIKGFRFGCGEKNNGLFGKTAGLMGLGRGKTS 295
Query: 236 LPSQL---NLDKFSYCLLSHKFDDTTRTSSLILDNGSSHSDKKTTGLTYTPFVNNPSVAE 292
L Q F+YCL + TT T L GS+ ++ + T P + +
Sbjct: 296 LTVQAYNKYGGAFAYCLPAL----TTGTGYLDFGPGSAGNNARLT----------PMLTD 341
Query: 293 RNAFSVYYYVGLRRITVGGQRVRVWHKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLAD 352
+ +YYVG+ I VGGQ+V V + GT+VDSGT T + + L+
Sbjct: 342 KG--QTFYYVGMTGIRVGGQQVPVAESVFS-----TAGTLVDSGTVITRLPATAYTALSS 394
Query: 353 EFVSQMVKNRNYTRALGAEALTGLRPCFDVPGEKTGSFPELKLHFKGGAEVTLPVENYFA 412
F M+ R Y +A G L C+D G P + L F+GGA + + V
Sbjct: 395 AFDKVMLA-RGYKKAPGYSI---LDTCYDFTGLSDVELPTVSLVFQGGACLDVDVSGIVY 450
Query: 413 VVGEGSAVCLTVVTDREASGGPSIILGNFQMQNYYVEYDLRNQRLGFKQQLC 464
+ E + VCL ++ + I+GN Q + Y V YDL + +GF C
Sbjct: 451 AISE-AQVCLAFASNGDDE--SVAIVGNTQQKTYGVLYDLGKKTVGFAPGSC 499
>gi|18421660|ref|NP_568551.1| aspartyl protease family protein [Arabidopsis thaliana]
gi|10177438|dbj|BAB10671.1| unnamed protein product [Arabidopsis thaliana]
gi|15809850|gb|AAL06853.1| AT5g37540/mpa22_p_70 [Arabidopsis thaliana]
gi|20260182|gb|AAM12989.1| unknown protein [Arabidopsis thaliana]
gi|23197748|gb|AAN15401.1| unknown protein [Arabidopsis thaliana]
gi|332006821|gb|AED94204.1| aspartyl protease family protein [Arabidopsis thaliana]
Length = 442
Score = 120 bits (301), Expect = 1e-24, Method: Compositional matrix adjust.
Identities = 112/381 (29%), Positives = 167/381 (43%), Gaps = 29/381 (7%)
Query: 91 ISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLGCQNP 150
+SL GTP Q +LDTGS L W C + + K SF P LSSS L C +P
Sbjct: 82 LSLPIGTPSQSQELVLDTGSQLSWIQC-HPKKIKKPLPPPTTSFDPSLSSSFSDLPCSHP 140
Query: 151 KCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSG-LTEGIALSETLNLPN-RII 208
C D L TS + ++C Y Y G EG + E N +
Sbjct: 141 LCK---------PRIPDFTLPTSCDSNRLC-HYSYFYADGTFAEGNLVKEKFTFSNSQTT 190
Query: 209 PNFLVGCSVLSSRQPAGIAGFGRGKTSLPSQLNLDKFSYCL--LSHKFDDTTRTSSLILD 266
P ++GC+ S+ + GI G G+ S SQ + KFSYC+ S++ + S + D
Sbjct: 191 PPLILGCAKESTDE-KGILGMNLGRLSFISQAKISKFSYCIPTRSNRPGLASTGSFYLGD 249
Query: 267 NGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRVRVWHKYLTLDRD 326
N +S K + LT+ P N + Y V L+ I +G +R+ + D
Sbjct: 250 NPNSRGFKYVSLLTFPQSQRMP-----NLDPLAYTVPLQGIRIGQKRLNIPGSVFRPDAG 304
Query: 327 GNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEALTGLRPCFDVPGEK 386
G+G T+VDSG+ FT + ++ + +E V + G+ A CFD G
Sbjct: 305 GSGQTMVDSGSEFTHLVDVAYDKVKEEIVRLVGSRLKKGYVYGSTADM----CFD--GNH 358
Query: 387 TGSFPEL--KLHFKGGAEVTLPVENYFAVVGEGSAVCLTVVTDREASGGPSIILGNFQMQ 444
+ L L F+ G V + VE +V G + + G S I+GN Q
Sbjct: 359 SMEIGRLIGDLVFEFGRGVEILVEKQSLLVNVGGGIHCVGIGRSSMLGAASNIIGNVHQQ 418
Query: 445 NYYVEYDLRNQRLGFKQQLCK 465
N +VE+D+ N+R+GF + C+
Sbjct: 419 NLWVEFDVTNRRVGFSKAECR 439
>gi|218192707|gb|EEC75134.1| hypothetical protein OsI_11325 [Oryza sativa Indica Group]
Length = 401
Score = 120 bits (301), Expect = 2e-24, Method: Compositional matrix adjust.
Identities = 100/332 (30%), Positives = 145/332 (43%), Gaps = 43/332 (12%)
Query: 89 YSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLGCQ 148
Y + L+ GTPPQ + LDTGS L+W C C C +P F P SS+ L C
Sbjct: 82 YLVHLAIGTPPQPVQLTLDTGSDLIWTQCQ---PCPACFDQALPYFDPSTSSTLSLTSCD 138
Query: 149 NPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGS-GLTEGIALSETLNL--PN 205
+ C + + C ++ C Y YG +T G +
Sbjct: 139 STLC-----QGLPVASCGSPKFWPNQTCV-----YTYSYGDKSVTTGFLEVDKFTFVGAG 188
Query: 206 RIIPNFLVGCSVLSS----RQPAGIAGFGRGKTSLPSQLNLDKFSYCLLSHKFDDTTRTS 261
+P GC + ++ GIAGFGRG SLPSQL + FS+C + + + S
Sbjct: 189 ASVPGVAFGCGLFNNGVFKSNETGIAGFGRGPLSLPSQLKVGNFSHCFTAV---NGLKPS 245
Query: 262 SLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRVRVWHKYL 321
+++LD + + TP + NP A +YY+ L+ ITVG R+ V
Sbjct: 246 TVLLDLPADLYKSGRGAVQSTPLIQNP------ANPTFYYLSLKGITVGSTRLPVPESEF 299
Query: 322 TLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQM---VKNRNYTRALGAEALTGLRP 378
L ++G GGTI+DSGT T + ++ + D F +Q+ V + N T
Sbjct: 300 AL-KNGTGGTIIDSGTAMTSLPTRVYRLVRDAFAAQVKLPVVSGNTTDPYF--------- 349
Query: 379 CFDVPGEKTGSFPELKLHFKGGAEVTLPVENY 410
C P P+L LHF+ GA + LP ENY
Sbjct: 350 CLSAPLRAKPYVPKLVLHFE-GATMDLPRENY 380
>gi|225430555|ref|XP_002285593.1| PREDICTED: aspartic proteinase nepenthesin-2 [Vitis vinifera]
Length = 481
Score = 120 bits (301), Expect = 2e-24, Method: Compositional matrix adjust.
Identities = 120/418 (28%), Positives = 178/418 (42%), Gaps = 51/418 (12%)
Query: 58 RALHIKNPQT--KTTTTTTTTTTTNISSHSYGGYSISLSFGTPPQIIPFILDTGSHLVWF 115
R+ KNP K + T + + S+ G Y +++ GTP + + FI DTGS L W
Sbjct: 105 RSRLAKNPADGGKLKGSKVTLPSKSGSTIGTGNYVVTVGLGTPKRDLTFIFDTGSDLTWT 164
Query: 116 PCTNHYQCKYCSSSKIPSFIPKLSSSSRLLGCQNPKCSWIHHESIQCRDCNDEPLATSKN 175
C +YC + P F P S+S + C +P C + + + P ++
Sbjct: 165 QC--EPCARYCYHQQEPIFNPSKSTSYTNISCSSPTCDELKSGT------GNSPSCSAST 216
Query: 176 CTQICPSYLVLYGS-GLTEGIALSETLNLPN-RIIPNFLVGCSVLSSR---QPAGIAGFG 230
C Y + YG + G + L L + + NFL GC + AG+ G G
Sbjct: 217 CV-----YGIQYGDQSYSVGFFAQDKLALTSTDVFNNFLFGCGQNNRGLFVGVAGLIGLG 271
Query: 231 RGKTSLPSQLNLDK---FSYCLLSHKFDDTTRTSSLILDNGSSHSDKKTTGLTYTPFVNN 287
R SL SQ FSYCL S T +S+ L GS K T
Sbjct: 272 RNALSLVSQTAQKYGKLFSYCLPS------TSSSTGYLTFGSGGGTSKAVKFT------- 318
Query: 288 PSVAERNAFSVYYYVGLRRITVGGQRVRVWHKYLTLDRDGNGGTIVDSGTTFTFMAPELF 347
PS+ S +Y++ L I+VGG+++ + GTI+DSGT + + P +
Sbjct: 319 PSLVNSQGPS-FYFLNLIAISVGGRKLSTSASVFS-----TAGTIIDSGTVISRLPPTAY 372
Query: 348 EPLADEFVSQMVKNRNYTRALGAEALTGLRPCFDVPGEKTGSFPELKLHFKGGAEVTLPV 407
L F QM K Y +A A L C+D T P++ L+F GAE+ L
Sbjct: 373 SDLRASFQQQMSK---YPKAAPASI---LDTCYDFSQYDTVDVPKINLYFSDGAEMDLDP 426
Query: 408 ENYFAVVGEGSAVCLTVVTDREASGGPSIILGNFQMQNYYVEYDLRNQRLGFKQQLCK 465
F ++ S VCL + +A+ ILGN Q + + V YD+ R+GF C+
Sbjct: 427 SGIFYILNI-SQVCLAFAGNSDAT--DIAILGNVQQKTFDVVYDVAGGRIGFAPGGCE 481
>gi|357500973|ref|XP_003620775.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
gi|357500991|ref|XP_003620784.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
gi|355495790|gb|AES76993.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
gi|355495799|gb|AES77002.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
Length = 438
Score = 120 bits (300), Expect = 2e-24, Method: Compositional matrix adjust.
Identities = 137/484 (28%), Positives = 212/484 (43%), Gaps = 85/484 (17%)
Query: 10 LSFIFFFTLLSIFPSSITSLTFSLSRFHTN--------PSQDSYQNLNSLVSSSLTRALH 61
L+ FFF SI S S FS+ H + P+Q+ YQ++ V S+ R H
Sbjct: 7 LTLSFFFLCFSISFSQAVSNGFSIELIHRDSSKSPFYKPTQNKYQHVVDAVHRSINRVNH 66
Query: 62 IKNPQTKTTTTTTTTTTTNISSHSYGGYSISLSFGTPPQIIPFILDTGSHLVWFPCTNHY 121
+ + +T +T IS G Y +S S GTPP I+DTGS +VW C
Sbjct: 67 -----SNKNSLASTPESTVISYE--GDYIMSYSVGTPPIKSYGIVDTGSDIVWLQCE--- 116
Query: 122 QCKYCSSSKIPSFIPKLSSSSRLLGCQNPKCSWIHHESIQCRDCNDEPLATSKNCTQICP 181
C+ C + P F P SSS + + C + C +S++ CND+ KNC
Sbjct: 117 PCEQCYNQTTPKFNPSKSSSYKNISCSSKLC-----QSVRDTSCNDK-----KNC----- 161
Query: 182 SYLVLYGS-GLTEGIALSETLNLPNRI-----IPNFLVGCSVLS----SRQPAGIAGFGR 231
Y + YG+ ++G ETL L + P ++GC + R +G+ G G
Sbjct: 162 EYSINYGNQSHSQGDLSLETLTLESTTGRPVSFPKTVIGCGTNNIGSFKRVSSGVVGLGG 221
Query: 232 GKTSLPSQLNLD---KFSYCLLSHKFDDTTRTSSLILDNGSSHSDKKTTG----LTYTPF 284
G SL +QL KFSYCL+ S+ L N S S K G ++
Sbjct: 222 GPASLITQLGPSIGGKFSYCLVRM---------SITLKNMSMGSSKLNFGDVAIVSGHNV 272
Query: 285 VNNPSVAERNAFSVYYYVGLRRITVGGQRVRVWHKYLTLDRDGNGGTIVDSGTTFTFMAP 344
++ P V + ++F +YY+ + +VG +RV ++ G I+DS T TF+
Sbjct: 273 LSTPIVKKDHSF--FYYLTIEAFSVGDKRVEFAGSSKGVEE---GNIIIDSSTIVTFVPS 327
Query: 345 ELFEPLADEFVS----QMVKNRNYTRALGAEALTGLRPCFDVPGEKTGSFPELKLHFKGG 400
+++ L V + V + N +L C++V ++ FP + HFK G
Sbjct: 328 DVYTKLNSAIVDLVTLERVDDPNQQFSL----------CYNVSSDEEYDFPYMTAHFK-G 376
Query: 401 AEVTLPVENYFAVVGEGSAVCLTVVTDREASGGPSIILGNFQMQNYYVEYDLRNQRLGFK 460
A++ L N F V +C ++GG I G+F Q++ V YDL+ + + FK
Sbjct: 377 ADILLYATNTFVEVAR-DVLCFAFA---PSNGGA--IFGSFSQQDFMVGYDLQQKTVSFK 430
Query: 461 QQLC 464
C
Sbjct: 431 SVDC 434
>gi|293332735|ref|NP_001168472.1| uncharacterized protein LOC100382248 [Zea mays]
gi|223948487|gb|ACN28327.1| unknown [Zea mays]
Length = 434
Score = 119 bits (299), Expect = 2e-24, Method: Compositional matrix adjust.
Identities = 113/389 (29%), Positives = 166/389 (42%), Gaps = 59/389 (15%)
Query: 87 GGYSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQC-KYCSSSKIPSFIPKLSSSSRLL 145
G Y + + GTP + + DTGS W C C YC K P F P S++ +
Sbjct: 94 GNYVVPVRLGTPAERFTVVFDTGSDTTWVQCQ---PCVAYCYRQKEPLFDPTKSATYANI 150
Query: 146 GCQNPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSG-LTEGIALSETLNLP 204
C + CS ++ C Y + YG G T G +TL L
Sbjct: 151 SCSSSYCSDLYVSGCSGGHC----------------LYGIQYGDGSYTIGFYAQDTLTLA 194
Query: 205 NRIIPNFLVGCSVLSS---RQPAGIAGFGRGKTSLPSQLNLDK----FSYCLLSHKFDDT 257
I NF GC + + AG+ G GRGKTSLP Q DK F+YCL + +
Sbjct: 195 YDTIKNFRFGCGEKNRGLFGRAAGLLGLGRGKTSLPVQA-YDKYGGVFAYCLPA----TS 249
Query: 258 TRTSSLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRVRVW 317
T L L G+ ++ + T P + +R +YYVG+ I VGG + +
Sbjct: 250 AGTGFLDLGPGAPAANARLT----------PMLVDRG--PTFYYVGMTGIKVGGHVLPIP 297
Query: 318 HKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEALTGLR 377
+ GT+VDSGT T + P + PL F S+ ++ Y+ A A + L
Sbjct: 298 GSVFS-----TAGTLVDSGTVITRLPPSAYAPLRSAF-SKAMQGLGYS---AAPAFSILD 348
Query: 378 PCFDVPGEKTGS--FPELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTVVTDREASGGPS 435
C+D+ G K GS P + L F+GGA + + V + S CL + + +
Sbjct: 349 TCYDLTGHKGGSIALPAVSLVFQGGACLDVDASGIL-YVADVSQACLAFAPNADDT--DV 405
Query: 436 IILGNFQMQNYYVEYDLRNQRLGFKQQLC 464
I+GN Q + + V YD+ + +GF C
Sbjct: 406 AIVGNTQQKTHGVLYDIGKKIVGFAPGAC 434
>gi|356508308|ref|XP_003522900.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
Length = 439
Score = 119 bits (299), Expect = 3e-24, Method: Compositional matrix adjust.
Identities = 109/385 (28%), Positives = 167/385 (43%), Gaps = 54/385 (14%)
Query: 89 YSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLGCQ 148
Y + G+PPQ + +DT + W PCT C C+S+ F P+ S++ + + C
Sbjct: 98 YIVRAKIGSPPQTLLLAMDTSNDAAWIPCT---ACDGCTSTL---FAPEKSTTFKNVSCG 151
Query: 149 NPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSGLTEGIALSETLNLPNRII 208
+P+C+ + + S C ++ + YGS + +T+ L I
Sbjct: 152 SPQCNQVPNPSCGTSAC----------------TFNLTYGSSSIAANVVQDTVTLATDPI 195
Query: 209 PNFLVGC---SVLSSRQPAGIAGFGRGKTSLPSQ---LNLDKFSYCLLSHK---FDDTTR 259
P++ GC + +S P G+ G GRG SL SQ L FSYCL S K F + R
Sbjct: 196 PDYTFGCVAKTTGASAPPQGLLGLGRGPLSLLSQTQNLYQSTFSYCLPSFKSLNFSGSLR 255
Query: 260 TSSLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRVRVWHK 319
+ + + YTP + NP R++ YYV L I VG + V + +
Sbjct: 256 LGPV----------AQPIRIKYTPLLKNP---RRSSL---YYVNLVAIRVGRKVVDIPPE 299
Query: 320 YLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEALTGLRPC 379
L + GT+ DSGT FT + + + DEF ++ L +L G C
Sbjct: 300 ALAFNAATGAGTVFDSGTVFTRLVAPAYTAVRDEFQRRVAIAAKAN--LTVTSLGGFDTC 357
Query: 380 FDVPGEKTGSFPELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTVVTDREASGGPSIILG 439
+ VP P + F G VTLP +N GS CL + + + ++
Sbjct: 358 YTVPIVA----PTITFMF-SGMNVTLPEDNILIHSTAGSTTCLAMASAPDNVNSVLNVIA 412
Query: 440 NFQMQNYYVEYDLRNQRLGFKQQLC 464
N Q QN+ V YD+ N RLG ++LC
Sbjct: 413 NMQQQNHRVLYDVPNSRLGVARELC 437
>gi|297597434|ref|NP_001043968.2| Os01g0696800 [Oryza sativa Japonica Group]
gi|255673588|dbj|BAF05882.2| Os01g0696800 [Oryza sativa Japonica Group]
Length = 334
Score = 119 bits (299), Expect = 3e-24, Method: Compositional matrix adjust.
Identities = 107/346 (30%), Positives = 160/346 (46%), Gaps = 43/346 (12%)
Query: 131 IPSFIPKLSSSSRLLGCQNPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSG 190
+P P SSS+ + C + C + C + + S NC SY YG+
Sbjct: 12 LPLLYPTSSSSAAFVACGDRTCGELPRP--LCSNVAGG-GSGSGNC-----SYHYAYGNA 63
Query: 191 -----LTEGIALSETLNLPNRI--IPNFLVGCSVLSSR---QPAGIAGFGRGKTSLPSQL 240
TEGI ++ET + P GC++ S +G+ G GRGK SL +QL
Sbjct: 64 RDTHHYTEGILMTETFTFGDDAAAFPGIAFGCTLRSEGGFGTGSGLVGLGRGKLSLVTQL 123
Query: 241 NLDKFSYCLLSHKFDDTTRTSSLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYY 300
N++ F Y L S D + S + + + + TP + NP V + +Y
Sbjct: 124 NVEAFGYRLSS----DLSAPSPISFGSLADVTGGNGDSFMSTPLLTNPVVQDLP----FY 175
Query: 301 YVGLRRITVGGQRVRVWHKYLTLDRD-GNGGTIVDSGTTFTFMAPELFEPLADEFVSQMV 359
YVGL I+VGG+ V++ + DR G GG I DSGTT T + + + DE +SQM
Sbjct: 176 YVGLTGISVGGKLVQIPSGTFSFDRSTGAGGVIFDSGTTLTMLPDPAYTLVRDELLSQMG 235
Query: 360 KNRNYTRALGAEALTGLRPCFDVPGEKTGSFPELKLHFKGGAEVTLPVENYFAVV----G 415
+ A + + CF G T +FP + LHF GGA++ L ENY + G
Sbjct: 236 FQKPPPAANDDDLI-----CF-TGGSSTTTFPSMVLHFDGGADMDLSTENYLPQMQGQNG 289
Query: 416 EGSAVCLTVVTDREASGGPSIILGNFQMQNYYVEYDLR-NQRLGFK 460
E +A C +VV +A I+GN +++V +DL N R+ F+
Sbjct: 290 E-TARCWSVVKSSQA----LTIIGNIMQMDFHVVFDLSGNARMLFQ 330
>gi|225216930|gb|ACN85225.1| aspartic proteinase nepenthesin-1 precursor [Oryza punctata]
gi|225216938|gb|ACN85232.1| aspartic proteinase nepenthesin-1 precursor [Oryza minuta]
Length = 516
Score = 119 bits (298), Expect = 3e-24, Method: Compositional matrix adjust.
Identities = 114/389 (29%), Positives = 164/389 (42%), Gaps = 60/389 (15%)
Query: 87 GGYSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLG 146
G Y +++ GTP + DTGS W C Y K+ F P SS+ +
Sbjct: 177 GNYVVTVGLGTPASRYTVVFDTGSDTTWVQCQPCVVVCYEQREKL--FDPARSSTYANVS 234
Query: 147 CQNPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSG-LTEGIALSETLNLPN 205
C P CS + R C + +C Y V YG G + G +TL L +
Sbjct: 235 CAAPACS-----DLDTRGC------SGGHCL-----YGVQYGDGSYSIGFFAMDTLTLSS 278
Query: 206 -RIIPNFLVGCSVLSS---RQPAGIAGFGRGKTSLPSQLNLDK----FSYCLLSHKFDDT 257
+ F GC + + AG+ G GRGKTSLP Q DK F++CL +
Sbjct: 279 YDAVKGFRFGCGERNEGLFGEAAGLLGLGRGKTSLPVQ-TYDKYGGVFAHCLPARS---- 333
Query: 258 TRTSSLILDNGSSHSDKKTTGLTYTPFV--NNPSVAERNAFSVYYYVGLRRITVGGQRVR 315
T + LD G+ + LT TP + N P+ +YYVGL I VGG+ +
Sbjct: 334 --TGTGYLDFGAGSPAAR---LTTTPMLVDNGPT---------FYYVGLTGIRVGGRLLY 379
Query: 316 VWHKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEALTG 375
+ GTIVDSGT T + P + L F + M R Y + A A++
Sbjct: 380 IPQSVFA-----TAGTIVDSGTVITRLPPAAYSSLRSAFAAAM-SARGYKK---APAVSL 430
Query: 376 LRPCFDVPGEKTGSFPELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTVVTDREASGGPS 435
L C+D G + P + L F+GGA + + S VCL + + GG
Sbjct: 431 LDTCYDFAGMSQVAIPTVSLLFQGGARLDVDASGIM-YAASASQVCLAFAANED--GGDV 487
Query: 436 IILGNFQMQNYYVEYDLRNQRLGFKQQLC 464
I+GN Q++ + V YD+ + + F C
Sbjct: 488 GIVGNTQLKTFGVAYDIGKKVVSFSPGAC 516
>gi|413943688|gb|AFW76337.1| hypothetical protein ZEAMMB73_223549 [Zea mays]
gi|413943689|gb|AFW76338.1| hypothetical protein ZEAMMB73_223549 [Zea mays]
Length = 499
Score = 119 bits (298), Expect = 3e-24, Method: Compositional matrix adjust.
Identities = 112/388 (28%), Positives = 165/388 (42%), Gaps = 57/388 (14%)
Query: 87 GGYSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLG 146
G Y + + GTP + + DTGS W C YC K P F P S++ +
Sbjct: 159 GNYVVPVRLGTPAERFTVVFDTGSDTTWVQCQPCV--AYCYRQKEPLFDPTKSATYANIS 216
Query: 147 CQNPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSG-LTEGIALSETLNLPN 205
C + CS ++ C Y + YG G T G +TL L
Sbjct: 217 CSSSYCSDLYVSGCSGGHC----------------LYGIQYGDGSYTIGFYAQDTLTLAY 260
Query: 206 RIIPNFLVGCSVLSS---RQPAGIAGFGRGKTSLPSQLNLDK----FSYCLLSHKFDDTT 258
I NF GC + + AG+ G GRGKTSLP Q DK F+YCL + +
Sbjct: 261 DTIKNFRFGCGEKNRGLFGRAAGLLGLGRGKTSLPVQA-YDKYGGVFAYCLPA----TSA 315
Query: 259 RTSSLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRVRVWH 318
T L L G+ ++ + T P + +R +YYVG+ I VGG + +
Sbjct: 316 GTGFLDLGPGAPAANARLT----------PMLVDRG--PTFYYVGMTGIKVGGHVLPIPG 363
Query: 319 KYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEALTGLRP 378
+ GT+VDSGT T + P + PL F S+ ++ Y+ A A + L
Sbjct: 364 SVFS-----TAGTLVDSGTVITRLPPSAYAPLRSAF-SKAMQGLGYS---AAPAFSILDT 414
Query: 379 CFDVPGEKTGS--FPELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTVVTDREASGGPSI 436
C+D+ G K GS P + L F+GGA + + V + S CL + + +
Sbjct: 415 CYDLTGHKGGSIALPAVSLVFQGGACLDVDASGIL-YVADVSQACLAFAPNADDTD--VA 471
Query: 437 ILGNFQMQNYYVEYDLRNQRLGFKQQLC 464
I+GN Q + + V YD+ + +GF C
Sbjct: 472 IVGNTQQKTHGVLYDIGKKIVGFAPGAC 499
>gi|357514995|ref|XP_003627786.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
gi|355521808|gb|AET02262.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
Length = 436
Score = 119 bits (298), Expect = 3e-24, Method: Compositional matrix adjust.
Identities = 134/479 (27%), Positives = 198/479 (41%), Gaps = 76/479 (15%)
Query: 10 LSFIFFFTLLSIFPSSITSLTFSLSRFHTN--------PSQDSYQNLNSLVSSSLTRALH 61
L+ +FF + S FS+ H + P+Q+ YQ S+ RA H
Sbjct: 7 LTLLFFSICFIVSFSHAQKNGFSVELIHRDSLKSPLYKPTQNKYQYFVDAARRSINRANH 66
Query: 62 IKNPQTKTTTTTTTTTTTNISSHSYGGYSISLSFGTPPQIIPFILDTGSHLVWFPCTNHY 121
+ +T +I G Y ++ S GTPP + I+DTGS +VW C
Sbjct: 67 FY--KYSLANIPQSTVIPDI-----GEYLMTYSVGTPPFKLYGIVDTGSDIVWLQCE--- 116
Query: 122 QCKYCSSSKIPSFIPKLSSSSRLLGCQNPKCSWIHHESIQCRDCNDEPLATSKNCTQICP 181
C+ C + P F P SSS + + C + C +S++ CND KN +
Sbjct: 117 PCQECYNQTTPMFNPSKSSSYKNIPCPSKLC-----QSMEDTSCND------KNYCE--- 162
Query: 182 SYLVLYGSGLTEGIALS------ETLNLPNRIIPNFLVGC---SVLSSR-QPAGIAGFGR 231
Y YG G LS E+ N PN ++GC ++LS +GI GFG
Sbjct: 163 -YSTYYGDNSHSGGDLSVDTLTLESTNGLTVSFPNIVIGCGTNNILSYEGASSGIVGFGS 221
Query: 232 GKTSLPSQLNLD---KFSYC---LLSHKFDDTTRTSSLILDNGSSHSDKKTTGLTYTPFV 285
G S +QL KFSYC L S + TS L + ++ S G+ TP +
Sbjct: 222 GPASFITQLGSSTGGKFSYCLTPLFSVTNIQSNATSKLNFGDAATVSGD---GVVTTPIL 278
Query: 286 NNPSVAERNAFSVYYYVGLRRITVGGQRVRVWHKYLTLDRDGNGGTIVDSGTTFTFMAPE 345
+ +YY+ L +VG +RV + + D G I+DSGTT T + +
Sbjct: 279 -------KKDPETFYYLTLEAFSVGNRRVEIGG---VPNGDNEGNIIIDSGTTLTSLTKD 328
Query: 346 LFEPLADEFVSQMVKNRNYTRALGAEALTGLRPCFDVPGEKTGSFPELKLHFKGGAEVTL 405
+ L V + R + L C+ V E FP + +HFK GA+V L
Sbjct: 329 DYSFLESAVVDLVKLERV------DDPTQTLNLCYSVKAEGY-DFPIITMHFK-GADVDL 380
Query: 406 PVENYFAVVGEGSAVCLTVVTDREASGGPSIILGNFQMQNYYVEYDLRNQRLGFKQQLC 464
+ F V +G CL + ++ + I GN QN V YDL+ + + FK C
Sbjct: 381 HPISTFVSVADG-VFCLAFESSQDHA-----IFGNLAQQNLMVGYDLQQKIVSFKPSDC 433
>gi|326513976|dbj|BAJ92138.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 342
Score = 119 bits (298), Expect = 3e-24, Method: Compositional matrix adjust.
Identities = 89/258 (34%), Positives = 129/258 (50%), Gaps = 22/258 (8%)
Query: 214 GCSVLSSRQ---PAGIAGFGRGKTSLPSQLNLDKFSYCLLSHKFDDTTRTSSLILDNGSS 270
GC LS+ +G+ G G SL SQL++ +FSYCL +TS ++ +
Sbjct: 97 GCGALSAGSLVGASGLMGLSPGTMSLISQLSVPRFSYCLTPFA---ERKTSPMLFGAMAD 153
Query: 271 HSDKKTTG-LTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRVRVWHKYLTLDRDGNG 329
TTG + T + NP++ + YYYV L +++G +R+RV L ++ DG G
Sbjct: 154 LRKYNTTGPIQTTAILRNPAMD-----TFYYYVPLVGLSLGTKRLRVPAASLAINPDGTG 208
Query: 330 GTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEALTGLRPCFDVP-GEKTG 388
GTIVDSG+T +A + F+ + + V + VK + + L CF VP G
Sbjct: 209 GTIVDSGSTMAHLAGKAFDAV-KKAVLEAVKLPVFNGTVEDYEL-----CFAVPSGVAMA 262
Query: 389 SF--PELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTVVTDREASGGPSIILGNFQMQNY 446
+ P L LHF GGA + LP +NYF G +CL V E G P I+GN Q QN
Sbjct: 263 AVKTPPLVLHFDGGAAMALPRDNYFQEPRAG-LMCLAVARSPEDLGAPISIIGNVQQQNM 321
Query: 447 YVEYDLRNQRLGFKQQLC 464
+V +D+ NQ+ F C
Sbjct: 322 HVLFDVHNQKFSFAPTKC 339
>gi|356575389|ref|XP_003555824.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
Length = 473
Score = 119 bits (298), Expect = 4e-24, Method: Compositional matrix adjust.
Identities = 110/385 (28%), Positives = 159/385 (41%), Gaps = 50/385 (12%)
Query: 87 GGYSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLG 146
G Y + + G+PP+ ++D+GS ++W C C C P F P SSS +
Sbjct: 132 GEYFVRIGVGSPPRNQYVVIDSGSDIIWVQCE---PCTQCYHQSDPVFNPADSSSYAGVS 188
Query: 147 CQNPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSG-LTEGIALSETLNLPN 205
C + CS + + C Y V YG G T+G ETL
Sbjct: 189 CASTVCSHVDNAGCHEGRCR----------------YEVSYGDGSYTKGTLALETLTFGR 232
Query: 206 RIIPNFLVGCSVLSSRQ---PAGIAGFGRGKTSLPSQLNLDK---FSYCLLSHKFDDTTR 259
+I N +GC + AG+ G G G S QL FSYCL+S
Sbjct: 233 TLIRNVAIGCGHHNQGMFVGAAGLLGLGSGPMSFVGQLGGQAGGTFSYCLVSRGIQ---- 288
Query: 260 TSSLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRVRVWHK 319
SS +L G + G + P ++NP +YYVGL + VGG RV +
Sbjct: 289 -SSGLLQFGR---EAVPVGAAWVPLIHNPRAQS------FYYVGLSGLGVGGLRVPISED 338
Query: 320 YLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEALTGLRPC 379
L G+GG ++D+GT T + +E D F++Q N RA G C
Sbjct: 339 VFKLSELGDGGVVMDTGTAVTRLPTAAYEAFRDAFIAQTT---NLPRASGVSIFD---TC 392
Query: 380 FDVPGEKTGSFPELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTVVTDREASGGPSIILG 439
+D+ G + P + +F GG +TLP N+ V + + C +S G SII G
Sbjct: 393 YDLFGFVSVRVPTVSFYFSGGPILTLPARNFLIPVDDVGSFCFAFA---PSSSGLSII-G 448
Query: 440 NFQMQNYYVEYDLRNQRLGFKQQLC 464
N Q + + D N +GF +C
Sbjct: 449 NIQQEGIEISVDGANGFVGFGPNVC 473
>gi|2541876|dbj|BAA22813.1| CND41, chloroplast nucleoid DNA binding protein [Nicotiana tabacum]
Length = 502
Score = 119 bits (297), Expect = 4e-24, Method: Compositional matrix adjust.
Identities = 114/392 (29%), Positives = 165/392 (42%), Gaps = 56/392 (14%)
Query: 87 GGYSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLG 146
G Y +++ GTP + + I DTGS L W C K C + + P F P S + +
Sbjct: 152 GNYIVNVGLGTPKKDLSLIFDTGSDLTWTQC--QPCVKSCYAQQQPIFDPSTSKTYSNIS 209
Query: 147 CQNPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYG-SGLTEGIALSETLNL-P 204
C + CS + + + P +S NC Y + YG S T G + L L
Sbjct: 210 CTSAACSSLKSAT------GNSPGCSSSNCV-----YGIQYGDSSFTIGFFAKDKLTLTQ 258
Query: 205 NRIIPNFLVGCSVLSS---RQPAGIAGFGRGKTSLPSQLNL---DKFSYCLLSHKFDDTT 258
N + F+ GC + + AG+ G GR S+ Q FSYCL T+
Sbjct: 259 NDVFDGFMFGCGQNNKGLFGKTAGLIGLGRDPLSIVQQTAQKFGKYFSYCL------PTS 312
Query: 259 RTSSLIL----DNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRV 314
R S+ L NG S G+T+TPF ++ A YY++ + I+VGG+ +
Sbjct: 313 RGSNGHLTFGNGNGVKASKAVKNGITFTPFASSQGTA-------YYFIDVLGISVGGKAL 365
Query: 315 RVWHKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEALT 374
+ N GTI+DSGT T + + L F M K A AL+
Sbjct: 366 SISPMLFQ-----NAGTIIDSGTVITRLPSTAYGSLKSAFKQFMSKYPT------APALS 414
Query: 375 GLRPCFDVPGEKTGSFPELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTVV--TDREASG 432
L C+D+ + S P++ +F G A V L N + S VCL D ++ G
Sbjct: 415 LLDTCYDLSNYTSISIPKISFNFNGNANVELD-PNGILITNGASQVCLAFAGNGDDDSIG 473
Query: 433 GPSIILGNFQMQNYYVEYDLRNQRLGFKQQLC 464
I GN Q Q V YD+ +LGF + C
Sbjct: 474 ----IFGNIQQQTLEVVYDVAGGQLGFGYKGC 501
>gi|222637181|gb|EEE67313.1| hypothetical protein OsJ_24553 [Oryza sativa Japonica Group]
Length = 414
Score = 119 bits (297), Expect = 4e-24, Method: Compositional matrix adjust.
Identities = 113/356 (31%), Positives = 145/356 (40%), Gaps = 47/356 (13%)
Query: 126 CSSSKIPSFIPKLSSSSRLLGCQNPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLV 185
C++ P F P SS+ L C + C ++ + C + C P
Sbjct: 88 CAARPAPPFQPASSSTFSKLPCASSLCQFLTSPYLTCN---------ATGCVYYYP---- 134
Query: 186 LYGSGLTEGIALSETLNLPNRIIPNFLVGCSVLSS--RQPAGIAGFGRGKTSLPSQLNLD 243
YG G T G +ETL++ P GCS + +GI G GR SL SQ+ +
Sbjct: 135 -YGMGFTAGYLATETLHVGGASFPGVAFGCSTENGVGNSSSGIVGLGRSPLSLVSQVGVG 193
Query: 244 KFSYCLLSHKFDDTTRTSSLILDNGSSHSDKKTTGLTYTP-FVNNPSVAERNAFSVYYYV 302
+FSYCL S D S IL S K TG +P + NP + S YYYV
Sbjct: 194 RFSYCLRS----DADAGDSPILFG----SLAKVTGGKSSPAILENPEMPS----SSYYYV 241
Query: 303 GLRRITVGGQRVRVWHKYLTLDRDGN----GGTIVDSGTTFTFMAPELFEPLADEFVSQM 358
L ITVG + V R GGTIVDSGTT T++ E + + F+SQM
Sbjct: 242 NLTGITVGATDLPVTSTTFGFTRGAGAGLVGGTIVDSGTTLTYLVKEGYAMVKRAFLSQM 301
Query: 359 VKNRNYTRALGAEALTGLRPCFDVPGEKTGS---FPELKLHFKGGAEVTLPVENYFAVV- 414
T G G CFD GS P L L F GGAE + +Y VV
Sbjct: 302 ATANLTTTVNGTR--FGFDLCFDANAAGGGSGVPVPTLVLRFAGGAEYAVRRRSYVGVVE 359
Query: 415 ----GEGSAVCLTVVTDREASGGPSI-ILGNFQMQNYYVEYDLRNQRLGFKQQLCK 465
G + CL V+ E SI I+GN + +V YDL F C
Sbjct: 360 VDSQGRAAVECLLVLPASEKL---SISIIGNVMQMDLHVLYDLDGGMFSFAPADCA 412
>gi|326529233|dbj|BAK01010.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 441
Score = 119 bits (297), Expect = 4e-24, Method: Compositional matrix adjust.
Identities = 122/481 (25%), Positives = 198/481 (41%), Gaps = 67/481 (13%)
Query: 8 LCLSFIFFFTLLSIFPSSITS------LTFSLSRFHTNPSQDSYQNLNSLVSSSLTRALH 61
+ +F+F LL S +TS L L+ + + + V+ S R L
Sbjct: 1 MARTFVFLLVLLCFRASLVTSSSTGAGLRMKLTHVDDKAGYTTEERVRRAVAVSRER-LA 59
Query: 62 IKNPQTKTTTTTTTTTTTNISSHSYGGYSISLSFGTPPQIIPFILDTGSHLVWFPCTNHY 121
Q + + + ++++ Y + G PPQ ++DTGS+L+W C
Sbjct: 60 YTQQQQQLRASGDVSAPVHLATRQYIAEYL---IGDPPQRAAALIDTGSNLIWTQCGTTC 116
Query: 122 QCKYCSSSKIPSFIPKLSSSSRLLGCQNPKCSWIHHESIQCRDCNDEPLATSKNCTQICP 181
K C+ +P + LS SS + C D + N +C
Sbjct: 117 GLKACAKQDLPYY--NLSRSS----------------TFAAVPCADSAKLCAANGVHLCG 158
Query: 182 -----SYLVLYGSGLTEGIALSETLNLPNRIIPNFLVGCSVLSS------RQPAGIAGFG 230
++ YG+G G +E + GC L+ +G+ G G
Sbjct: 159 LDGSCTFAASYGAGSVFGSLGTEAFTFQSG-AAKLGFGCVSLTRITKGALNGASGLIGLG 217
Query: 231 RGKTSLPSQLNLDKFSYCLLSHKFDDTTRTSSLILDNGSSHSDKKTTGLTYTPFVNNPSV 290
RG+ SL SQ KFSYCL + + SS + S+ +T PFV +P
Sbjct: 218 RGRLSLVSQTGATKFSYCLTPYLRNHG--ASSHLFVGASASLSGGGGAVTSIPFVKSP-- 273
Query: 291 AERNAFSVYYYVGLRRITVGGQRVRVWHKYLTLDRDG----NGGTIVDSGTTFTFMAPEL 346
E +S +YY+ L I+VG ++ + L R +GG I+D+G+ T +A
Sbjct: 274 -EDYPYSTFYYLPLVGISVGETKLPIPSAAFELRRVAAGYWSGGVIIDTGSPVTSLAEAA 332
Query: 347 FEPLADEFVSQMVKNRNYTRALGAEALTGLRPCF---DVPGEKTGSFPELKLHFKGGAEV 403
+ L+DE Q+ NR+ + A TGL C DV +K P L HF GGA++
Sbjct: 333 YSALSDEVARQL--NRSLVQ---PPADTGLDLCVARQDV--DKV--VPVLVFHFGGGADM 383
Query: 404 TLPVENYFAVVGEGSAVCLTVVTDREASGGPSIILGNFQMQNYYVEYDLRNQRLGFKQQL 463
+ +Y+ V + +A C+ + GG ++GNFQ Q+ ++ YD+ L F+
Sbjct: 384 AVSAGSYWGPVDKSTA-CMLI-----EEGGYETVIGNFQQQDVHLLYDIGKGELSFQTAD 437
Query: 464 C 464
C
Sbjct: 438 C 438
>gi|224130878|ref|XP_002320947.1| predicted protein [Populus trichocarpa]
gi|222861720|gb|EEE99262.1| predicted protein [Populus trichocarpa]
Length = 440
Score = 119 bits (297), Expect = 4e-24, Method: Compositional matrix adjust.
Identities = 123/442 (27%), Positives = 196/442 (44%), Gaps = 57/442 (12%)
Query: 33 LSRFHTNPSQDSYQNLNSLVSSSLTRALHIKNPQTKTTTTTTTTTTTNISSHSYGGYSIS 92
LS F+ N + Q +N+ + S++R H + + ++++S+ G Y +S
Sbjct: 43 LSPFY-NSEETDLQRINNALRRSISRVHHFD--PIAAASVSPKAAESDVTSNR-GEYLMS 98
Query: 93 LSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLGCQNPKC 152
LS GTPP I I DTGS L+W C C+ C P F PK S + R C +C
Sbjct: 99 LSLGTPPFKIMGIADTGSDLIWTQCK---PCERCYKQVDPLFDPKSSKTYRDFSCDARQC 155
Query: 153 SWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSGLTEGIALSETLNLPNRIIPNFL 212
S + + C + ++ T G+ ++ I L T P P +
Sbjct: 156 SLLDQSTCSGNICQYQYSYGDRSYTM---------GNVASDTITLDSTTGSPVS-FPKTV 205
Query: 213 VGCSVLS----SRQPAGIAGFGRGKTSLPSQLNLD---KFSYCL--LSHKFDDTTRTSSL 263
+GC + S + +GI G G G SL SQ+ KFSYCL LS + ++++
Sbjct: 206 IGCGHENDGTFSDKGSGIVGLGAGPLSLISQMGSSVGGKFSYCLVPLSSRAGNSSKL--- 262
Query: 264 ILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRVRVWHKYLTL 323
N S++ G+ TP +++ ++ S +Y++ L ++VG +R++ L
Sbjct: 263 ---NFGSNAVVSGPGVQSTPLLSSETM------SSFYFLTLEAMSVGNERIKFGDSSLGT 313
Query: 324 DRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEALTG-LRPCFDV 382
G G I+DSGTT T + + F L+ +Q+ R AE +G L C+
Sbjct: 314 ---GEGNIIIDSGTTLTIVPDDFFSNLSTAVGNQVEGRR-------AEDPSGFLSVCYSA 363
Query: 383 PGEKTGSFPELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTVVTDREASGGPSIILGNFQ 442
+ P + HF GA+V L N F V + VCL + + G S I GN
Sbjct: 364 TSDL--KVPAITAHFT-GADVKLKPINTFVQVSD-DVVCLAFAS---TTSGIS-IYGNVA 415
Query: 443 MQNYYVEYDLRNQRLGFKQQLC 464
N+ VEY+++ + L FK C
Sbjct: 416 QMNFLVEYNIQGKSLSFKPTDC 437
>gi|24430421|dbj|BAC22609.1| 41 kD chloroplast nucleoid DNA binding protein (CND41) [Nicotiana
sylvestris]
Length = 502
Score = 119 bits (297), Expect = 5e-24, Method: Compositional matrix adjust.
Identities = 113/392 (28%), Positives = 166/392 (42%), Gaps = 56/392 (14%)
Query: 87 GGYSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLG 146
G Y +++ GTP + + I DTGS L W C K C + + P F P S + +
Sbjct: 152 GNYIVNVGLGTPKKDLSLIFDTGSDLTWTQC--QPCVKSCYAQQQPIFDPSASKTYSNIS 209
Query: 147 CQNPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYG-SGLTEGIALSETLNL-P 204
C + CS + + + P +S NC Y + YG S T G +TL L
Sbjct: 210 CTSTACSGLKSAT------GNSPGCSSSNCV-----YGIQYGDSSFTVGFFAKDTLTLTQ 258
Query: 205 NRIIPNFLVGCSVLSS---RQPAGIAGFGRGKTSLPSQLNL---DKFSYCLLSHKFDDTT 258
N + F+ GC + + AG+ G GR S+ Q FSYCL T+
Sbjct: 259 NDVFDGFMFGCGQNNRGLFGKTAGLIGLGRDPLSIVQQTAQKFGKYFSYCL------PTS 312
Query: 259 RTSSLIL----DNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRV 314
R S+ L NG S G+T+TPF ++ + +Y++ + I+VGG+ +
Sbjct: 313 RGSNGHLTFGNGNGVKTSKAVKNGITFTPFASSQG-------ATFYFIDVLGISVGGKAL 365
Query: 315 RVWHKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEALT 374
+ N GTI+DSGT T + ++ L F M K A AL+
Sbjct: 366 SISPMLFQ-----NAGTIIDSGTVITRLPSTVYGSLKSTFKQFMSKYPT------APALS 414
Query: 375 GLRPCFDVPGEKTGSFPELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTVV--TDREASG 432
L C+D+ + S P++ +F G A V L N + S VCL D + G
Sbjct: 415 LLDTCYDLSNYTSISIPKISFNFNGNANVDLE-PNGILITNGASQVCLAFAGNGDDDTIG 473
Query: 433 GPSIILGNFQMQNYYVEYDLRNQRLGFKQQLC 464
I GN Q Q V YD+ +LGF + C
Sbjct: 474 ----IFGNIQQQTLEVVYDVAGGQLGFGYKGC 501
>gi|449439383|ref|XP_004137465.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 1-like [Cucumis
sativus]
Length = 491
Score = 119 bits (297), Expect = 5e-24, Method: Compositional matrix adjust.
Identities = 118/392 (30%), Positives = 163/392 (41%), Gaps = 57/392 (14%)
Query: 82 SSHSYGGYSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSS 141
+S G Y + G P + +LDTGS + W C C C P F P+ SSS
Sbjct: 148 TSQGSGEYFSRVGVGQPAKPFYMVLDTGSDINWLQCQ---PCTDCYQQTDPIFDPRSSSS 204
Query: 142 SRLLGCQNPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSG-LTEGIALSET 200
L C++ +C + E+ CR + C Y V YG G T G + ET
Sbjct: 205 FASLPCESQQCQAL--ETSGCR---------ASKCL-----YQVSYGDGSFTVGEFVIET 248
Query: 201 LNLPNR-IIPNFLVGCSVLSSRQPAGI-------AGFGRGKTSLPSQLNLDKFSYCLLSH 252
L N +I N VGC G+ G G G SL SQ+ FSYCL+
Sbjct: 249 LTFGNSGMINNVAVGCG----HDNEGLFVGSAGLLGLGGGSLSLTSQMKASSFSYCLV-- 302
Query: 253 KFDDTTRTSSLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQ 312
D + +SS + N ++ SD VN P + + +YYVGL ++VGGQ
Sbjct: 303 --DRDSSSSSDLEFNSAAPSDS----------VNAP-LLKSGKVDTFYYVGLTGMSVGGQ 349
Query: 313 RVRVWHKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEA 372
+ + +D G GG IVDSGT T + + + L D FVS+ Y + A
Sbjct: 350 LLSIPPNLFQMDDSGYGGIIVDSGTAITRLQTQAYNTLRDAFVSRT----PYLKKTNGFA 405
Query: 373 LTGLRPCFDVPGEKTGSFPELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTVVTDREASG 432
L C+D+ + + P + F GG + LP +NY V C +
Sbjct: 406 L--FDTCYDLSSQSRVTIPTVSFEFAGGKSLQLPPKNYLIPVDSVGTFCFAFAPTTSSLS 463
Query: 433 GPSIILGNFQMQNYYVEYDLRNQRLGFKQQLC 464
I+GN Q Q V YDL N +GF C
Sbjct: 464 ----IIGNVQQQGTRVHYDLANSVVGFSPHKC 491
>gi|359497446|ref|XP_003635520.1| PREDICTED: aspartic proteinase nepenthesin-2-like, partial [Vitis
vinifera]
Length = 354
Score = 119 bits (297), Expect = 5e-24, Method: Compositional matrix adjust.
Identities = 120/393 (30%), Positives = 171/393 (43%), Gaps = 65/393 (16%)
Query: 87 GGYSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCK----YCSSSKIPSFIPKLSSSS 142
G Y + + G+P + I+DTGS L W QCK YC P F P S +
Sbjct: 11 GNYYVKVGLGSPARYYSMIVDTGSSLSWL------QCKPCVVYCHVQADPLFDPSASKTY 64
Query: 143 RLLGCQNPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYG-SGLTEGIALSETL 201
+ L C + +CS + ++ N+ TS N +C Y YG S + G + L
Sbjct: 65 KSLSCTSSQCSSLVDATL-----NNPLCETSSN---VC-VYTASYGDSSYSMGYLSQDLL 115
Query: 202 NL-PNRIIPNFLVGCSVLSS---RQPAGIAGFGRGKTSLPSQLNLD---KFSYCLLSHKF 254
L P++ +P F+ GC S + AGI G GR K S+ Q++ FSYCL
Sbjct: 116 TLAPSQTLPGFVYGCGQDSEGLFGRAAGILGLGRNKLSMLGQVSSKFGYAFSYCL----- 170
Query: 255 DDTTRTSSLILDNGSSH---SDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGG 311
TR L G + S K T +T P NPS+ Y++ L ITVGG
Sbjct: 171 --PTRGGGGFLSIGKASLAGSAYKFTPMTTDP--GNPSL---------YFLRLTAITVGG 217
Query: 312 QRVRVWHKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAE 371
+ + V + TI+DSGT T + ++ P FV M + Y RA G
Sbjct: 218 RALGVAAAQYRVP------TIIDSGTVITRLPMSVYTPFQQAFVKIM--SSKYARAPG-- 267
Query: 372 ALTGLRPCFDVPGEKTGSFPELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTVVTDREAS 431
+ L CF + S PE++L F+GGA++ L N V EG CL + +
Sbjct: 268 -FSILDTCFKGNLKDMQSVPEVRLIFQGGADLNLRPVNVLLQVDEG-LTCLAFAGNNGVA 325
Query: 432 GGPSIILGNFQMQNYYVEYDLRNQRLGFKQQLC 464
I+GN Q Q + V +D+ R+GF C
Sbjct: 326 -----IIGNHQQQTFKVAHDISTARIGFATGGC 353
>gi|242084332|ref|XP_002442591.1| hypothetical protein SORBIDRAFT_08g022580 [Sorghum bicolor]
gi|241943284|gb|EES16429.1| hypothetical protein SORBIDRAFT_08g022580 [Sorghum bicolor]
Length = 493
Score = 119 bits (297), Expect = 5e-24, Method: Compositional matrix adjust.
Identities = 113/398 (28%), Positives = 164/398 (41%), Gaps = 56/398 (14%)
Query: 87 GGYSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLG 146
G Y ++ GTP +DTGS + W C C+ C P F P+ S+S R +G
Sbjct: 132 GEYMAKIAVGTPAVEALLAMDTGSDITWLQCQ---PCRRCYPQSGPVFDPRHSTSYREMG 188
Query: 147 CQNPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGS--GLTEGIALSETLNLP 204
P C + + ++ Y V YG T G + ETL
Sbjct: 189 YDAPDCQALGRSG-------------GGDAKRMTCVYAVGYGDDGSTTVGDFIEETLTFA 235
Query: 205 NRI-IPNFLVGCS----VLSSRQPAGIAGFGRGKTSLPSQL-----NLDKFSYCLLSHKF 254
+ +P+ +GC L + AGI G GRG+ S PSQ+ N+ FSYCL
Sbjct: 236 GGVQVPHMSIGCGHDNKGLFAAPAAGILGLGRGQISCPSQIAALGYNVTSFSYCLADFFL 295
Query: 255 DDTTRT--SSLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQ 312
R+ S+L + +G++ + +TP V N ++A + YY + G +
Sbjct: 296 SSPGRSVSSTLTIGDGAAAGSPPPS---FTPTVQNLNMA-----TFYYVRLVGVSVGGVR 347
Query: 313 RVRVWHKYLTLD-RDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAE 371
V L LD G GG I+DSGT T +A +++ R LG
Sbjct: 348 VPGVTEDDLKLDPYTGRGGVILDSGTAVTRLARR-------AYIAFRDAFRAAAVDLGQV 400
Query: 372 ALTGLRPCFDV---PGEKTGSFPELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTVV--T 426
++ G FD G + P + +HF GG E+TLP +NY V VC
Sbjct: 401 SIGGPSGFFDTCYTMGGRAMKVPTVSMHFAGGVELTLPPKNYLIPVDSMGTVCFAFAGTG 460
Query: 427 DREASGGPSIILGNFQMQNYYVEYDLRNQRLGFKQQLC 464
DR S I+GN Q Q + V Y++ R+GF C
Sbjct: 461 DRSVS-----IIGNIQQQGFRVVYNIGGGRVGFAPNSC 493
>gi|449486856|ref|XP_004157423.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 1-like [Cucumis
sativus]
Length = 491
Score = 118 bits (296), Expect = 5e-24, Method: Compositional matrix adjust.
Identities = 117/392 (29%), Positives = 164/392 (41%), Gaps = 57/392 (14%)
Query: 82 SSHSYGGYSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSS 141
+S G Y + G P + +LDTGS + W C C C P F P+ SSS
Sbjct: 148 TSQGSGEYFSRVGVGQPAKPFYMVLDTGSDINWLQCQ---PCTDCYQQTDPIFDPRSSSS 204
Query: 142 SRLLGCQNPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSG-LTEGIALSET 200
L C++ +C + E+ CR + C Y V YG G T G ++ET
Sbjct: 205 FASLPCESQQCQAL--ETSGCR---------ASKCL-----YQVSYGDGSFTVGEFVTET 248
Query: 201 LNLPNR-IIPNFLVGCSVLSSRQPAGI-------AGFGRGKTSLPSQLNLDKFSYCLLSH 252
L N +I + VGC G+ G G G SL SQ+ FSYCL+
Sbjct: 249 LTFGNSGMINDVAVGCG----HDNEGLFVGSAGLLGLGGGPLSLTSQMKASSFSYCLV-- 302
Query: 253 KFDDTTRTSSLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQ 312
D + +SS + N ++ SD VN P + + +YYVGL ++VGGQ
Sbjct: 303 --DRDSSSSSDLEFNSAAPSDS----------VNAP-LLKSGKVDTFYYVGLTGMSVGGQ 349
Query: 313 RVRVWHKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEA 372
+ + +D G GG IVDSGT T + + + L D FVS+ Y + A
Sbjct: 350 LLSIPPNLFQMDDSGYGGIIVDSGTAITRLQTQAYNTLRDAFVSRT----PYLKKTNGFA 405
Query: 373 LTGLRPCFDVPGEKTGSFPELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTVVTDREASG 432
L C+D+ + + P + F GG + LP +NY V C +
Sbjct: 406 L--FDTCYDLSSQSRVTIPTVSFEFAGGKSLQLPPKNYLIPVDSVGTFCFAFAPTTSSLS 463
Query: 433 GPSIILGNFQMQNYYVEYDLRNQRLGFKQQLC 464
I+GN Q Q V YDL N +GF C
Sbjct: 464 ----IIGNVQQQGTRVHYDLANSVVGFSPHKC 491
>gi|242044724|ref|XP_002460233.1| hypothetical protein SORBIDRAFT_02g025060 [Sorghum bicolor]
gi|241923610|gb|EER96754.1| hypothetical protein SORBIDRAFT_02g025060 [Sorghum bicolor]
Length = 512
Score = 118 bits (296), Expect = 6e-24, Method: Compositional matrix adjust.
Identities = 105/371 (28%), Positives = 165/371 (44%), Gaps = 39/371 (10%)
Query: 105 ILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLGCQNPKCSWIHHESIQCRD 164
I+DT S L W C C+ C + P F P S S + C + C + +
Sbjct: 167 IVDTASELTWVQCA---PCESCHDQQDPLFDPSSSPSYAAVPCNSSSCDALQLAT--GGT 221
Query: 165 CNDEPLATSKNCTQICPSYLVLYGSG-LTEGIALSETLNLPNRIIPNFLVGCSVLSSRQP 223
++ + SY + Y G + G+ + L+L +I F+ GC + P
Sbjct: 222 SGGAAACQGQDQSAAACSYTLSYRDGSYSRGVLAHDRLSLAGEVIDGFVFGCGTSNQGPP 281
Query: 224 ----AGIAGFGRGKTSLPSQLNLDKF----SYCLLSHKFDDTTRTSSLILDNGSSHSDKK 275
+G+ G GR + SL SQ +D+F SYCL ++ + SL++ + SS +
Sbjct: 282 FGGTSGLMGLGRSQLSLVSQ-TMDQFGGVFSYCL---PLKESDSSGSLVIGDDSSVY-RN 336
Query: 276 TTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRVRVWHKYLTLDRDGNGGTIVDS 335
+T + Y V++P +Y+V L ITVGGQ V I+DS
Sbjct: 337 STPIVYASMVSDPLQGP------FYFVNLTGITVGGQEVESSGFSSGGGGGK---AIIDS 387
Query: 336 GTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEALTGLRPCFDVPGEKTGSFPELKL 395
GT T + P ++ + EF+SQ + Y +A G + L CF++ G + P LKL
Sbjct: 388 GTVITSLVPSIYNAVKAEFLSQFAE---YPQAPG---FSILDTCFNMTGLREVQVPSLKL 441
Query: 396 HFKGGAEVTLPVEN--YFAVVGEGSAVCLTVVTDREASGGPSIILGNFQMQNYYVEYDLR 453
F GG EV + YF V + S VCL + + + I+GN+Q +N V +D
Sbjct: 442 VFDGGVEVEVDSGGVLYF-VSSDSSQVCLAMAPLKSEY--ETNIIGNYQQKNLRVIFDTS 498
Query: 454 NQRLGFKQQLC 464
++GF Q+ C
Sbjct: 499 GSQVGFAQETC 509
>gi|225427550|ref|XP_002266461.1| PREDICTED: probable aspartic protease At2g35615 [Vitis vinifera]
Length = 439
Score = 118 bits (296), Expect = 6e-24, Method: Compositional matrix adjust.
Identities = 132/442 (29%), Positives = 184/442 (41%), Gaps = 69/442 (15%)
Query: 39 NPSQDSYQNLNSLVSSSLTRALHIKNPQTKTTTTTTTTTTTNISSH---SYGGYSISLSF 95
+PS+ + L S++R + T T+ I S S G Y ++L
Sbjct: 48 DPSKTQAERLTDAFRRSVSRVGRFR---------PTAMTSDGIQSRIVPSAGEYLMNLYI 98
Query: 96 GTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLGCQNPKCSWI 155
GTPP + I+DTGS L W C C +C +P F PK SS+ R C C +
Sbjct: 99 GTPPVPVIAIVDTGSDLTWTQCR---PCTHCYKQVVPLFDPKNSSTYRDSSCGTSFCLAL 155
Query: 156 HHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSG-LTEGIALSETLNLPNRI-----IP 209
+ R C+ E K CT + Y G T G SETL + + P
Sbjct: 156 GKD----RSCSKE-----KKCT-----FRYSYADGSFTGGNLASETLTVDSTAGKPVSFP 201
Query: 210 NFLVGCSVLS----SRQPAGIAGFGRGKTSLPSQLNL---DKFSYCLLSHKFDDTTRTSS 262
F GC S + +GI G G G+ SL SQL FSYCLL D + SS
Sbjct: 202 GFAFGCGHSSGGIFDKSSSGIVGLGGGELSLISQLKSTINGLFSYCLLPVSTDSS--ISS 259
Query: 263 LILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRVRVWHKYLT 322
I N + G TP V + + +YY+ L I+VG +R+ + Y
Sbjct: 260 RI--NFGASGRVSGYGTVSTPLV-------QKSPDTFYYLTLEGISVGKKRLP-YKGYSK 309
Query: 323 LDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEALTGLRPCFDV 382
G IVDSGTT+TF+ E + L ++ V+ +K + G +L C++
Sbjct: 310 KTEVEEGNIIVDSGTTYTFLPQEFYSKL-EKSVANSIKGKRVRDPNGIFSL-----CYNT 363
Query: 383 PGEKTGSFPELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTVVTDREASGGPSIILGNFQ 442
E P + HFK A V L N F + E VC TV + +LGN
Sbjct: 364 TAEINA--PIITAHFK-DANVELQPLNTFMRMQE-DLVCFTVAPTSDIG-----VLGNLA 414
Query: 443 MQNYYVEYDLRNQRLGFKQQLC 464
N+ V +DLR +R+ FK C
Sbjct: 415 QVNFLVGFDLRKKRVSFKAADC 436
>gi|22165126|gb|AAM93742.1| putative chloroplast nucleoid DNA binding protein [Oryza sativa
Japonica Group]
gi|31433307|gb|AAP54836.1| Eukaryotic aspartyl protease family protein [Oryza sativa Japonica
Group]
gi|125575547|gb|EAZ16831.1| hypothetical protein OsJ_32302 [Oryza sativa Japonica Group]
Length = 405
Score = 118 bits (296), Expect = 6e-24, Method: Compositional matrix adjust.
Identities = 113/396 (28%), Positives = 171/396 (43%), Gaps = 63/396 (15%)
Query: 85 SYGGYSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRL 144
S G Y + + GTPPQ + ++D LVW CT C+ C +P F P SS+ R
Sbjct: 53 SQGLYVANFTIGTPPQPVSAVVDLTGELVWTQCT---PCQPCFEQDLPLFDPTKSSTFRG 109
Query: 145 LGCQNPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSGLTEGIALSETLNLP 204
L C + C I +S+NCT Y +G T G A ++T +
Sbjct: 110 LPCGSHLCESIPE--------------SSRNCTSDVCIYEAPTKAGDTGGKAGTDTFAI- 154
Query: 205 NRIIPNFLVGCSVLSSRQ------PAGIAGFGRGKTSLPSQLNLDKFSYCLLSHKFDDTT 258
GC V++ ++ P+GI G GR SL +Q+N+ FSYCL
Sbjct: 155 GAAKETLGFGCVVMTDKRLKTIGGPSGIVGLGRTPWSLVTQMNVTAFSYCLAGK------ 208
Query: 259 RTSSLILDNGSSHSDKKTTGLTYTPFVNNPSV-AERNAFSVYYYVGLRRITVGGQRVRVW 317
SS L G++ + TPFV S + N + YY V L I GG ++
Sbjct: 209 --SSGALFLGATAKQLAGGKNSSTPFVIKTSAGSSDNGSNPYYMVKLAGIKTGGAPLQAA 266
Query: 318 HKYLTLDRDGNGGTI-VDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEALTGL 376
+G T+ +D+ + +++A ++ L + T A+G + +
Sbjct: 267 SS--------SGSTVLLDTVSRASYLADGAYKAL----------KKALTAAVGVQPVASP 308
Query: 377 RPCFDV--PGEKTGSFPELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTVVTDR------ 428
+D+ P G PEL F GGA +T+P NY G G+ VCLT+ +
Sbjct: 309 PKPYDLCFPKAVAGDAPELVFTFDGGAALTVPPANYLLASGNGT-VCLTIGSSASLNLTG 367
Query: 429 EASGGPSIILGNFQMQNYYVEYDLRNQRLGFKQQLC 464
E G + ILG+ Q +N +V +DL+ + L FK C
Sbjct: 368 ELEG--ASILGSLQQENVHVLFDLKEETLSFKPADC 401
>gi|225217022|gb|ACN85307.1| aspartic proteinase nepenthesin-1 precursor [Oryza ridleyi]
Length = 525
Score = 118 bits (295), Expect = 7e-24, Method: Compositional matrix adjust.
Identities = 116/385 (30%), Positives = 164/385 (42%), Gaps = 50/385 (12%)
Query: 87 GGYSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLG 146
G Y +++ GTP + DTGS W C Y K+ F P SS+ +
Sbjct: 184 GNYVVTIGLGTPAGRYTVVFDTGSDTTWVQCEPCVVVCYEQQEKL--FDPARSSTDANIS 241
Query: 147 CQNPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSG-LTEGIALSETLNLPN 205
C P CS ++ +K C+ Y V YG G + G +TL L +
Sbjct: 242 CAAPACSDLY----------------TKGCSGGHCLYGVQYGDGSYSIGFFAMDTLTLSS 285
Query: 206 -RIIPNFLVGCSVLSS---RQPAGIAGFGRGKTSLPSQLNLDKFSYCLLSHKFDDTTRTS 261
I F GC + + AG+ G GRGKTSLP Q DK+ + +H F + +
Sbjct: 286 YDAIKGFRFGCGERNEGLFGEAAGLLGLGRGKTSLPVQA-YDKYG-GVFAHCFPARS-SG 342
Query: 262 SLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRVRVWHKYL 321
+ LD G S +T LT V+N +YYVGL I VGG+ + +
Sbjct: 343 TGYLDFGPGSSPAVSTKLTTPMLVDN--------GLTFYYVGLTGIRVGGKLLSIPPSVF 394
Query: 322 TLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEALTGLRPCFD 381
T GTIVDSGT T + P + L F S + R Y + A AL+ L C+D
Sbjct: 395 T-----TAGTIVDSGTVITRLPPAAYSSLRSAFAS-AIAARGYKK---APALSLLDTCYD 445
Query: 382 VPGEKTGSFPELKLHFKGGAEVTLPVEN--YFAVVGEGSAVCLTVVTDREASGGPSIILG 439
G + P + L F+GGA + + Y A V S CL + E I+G
Sbjct: 446 FTGMSQVAIPTVSLLFQGGASLDVDASGIIYAASV---SQACLGFAANEEDD--DVGIVG 500
Query: 440 NFQMQNYYVEYDLRNQRLGFKQQLC 464
N Q++ + V YD+ + +GF C
Sbjct: 501 NTQLKTFGVVYDIGKKVVGFSPGAC 525
>gi|413944596|gb|AFW77245.1| hypothetical protein ZEAMMB73_545774 [Zea mays]
gi|414876929|tpg|DAA54060.1| TPA: hypothetical protein ZEAMMB73_875469 [Zea mays]
Length = 459
Score = 118 bits (295), Expect = 7e-24, Method: Compositional matrix adjust.
Identities = 118/399 (29%), Positives = 164/399 (41%), Gaps = 75/399 (18%)
Query: 85 SYGGYSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRL 144
S G Y + S GTPPQ + + DTGS L+W C C PS++P SS+
Sbjct: 87 SGGAYDMEFSMGTPPQKLTALADTGSDLIWAKCGGACTTS-CEPQGSPSYLPNASSTFAK 145
Query: 145 LGCQNPKCSWIHHESIQ-CRDCNDEPLATSKNCTQICPSYLVLYGSG-----LTEGIALS 198
L C + CS + +S+ C A C Y YG G T+G
Sbjct: 146 LPCSDRLCSLLRSDSVAWCA-------AAGAEC-----DYRYSYGLGDDDHHYTQGFLAR 193
Query: 199 ETLNLPNRIIPNFLVGCSV---LSSRQPAGIAGFGRGKTSLPSQLNLDKFSYCLLSHKFD 255
ET L +P+ GC+ +G+ G GRG SL SQLN F YCL S
Sbjct: 194 ETFTLGADAVPSVRFGCTTASEGGYGSGSGLVGLGRGPLSLVSQLNASTFMYCLTS---- 249
Query: 256 DTTRTSSLILDNGSS--HSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQR 313
D ++ S L+ + +S + ++TGL A + +Y V LR I++G
Sbjct: 250 DASKASPLLFGSLASLTGAQVQSTGLL--------------ASTTFYAVNLRSISIGSAT 295
Query: 314 VRVWHKYLTLDRDGNG---GTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGA 370
G G G + DSGTT T++A + F+SQ ++
Sbjct: 296 T-----------PGVGEPEGVVFDSGTTLTYLAEPAYSEAKAAFLSQTSLDQ-------V 337
Query: 371 EALTGLRPCFDVPGE---KTGSFPELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTVVTD 427
E G CF P + P + LHF GA++ LPV NY V +G VC V
Sbjct: 338 EDTDGFEACFQKPANGRLSNAAVPTMVLHFD-GADMALPVANYVVEVEDG-VVCWIV--- 392
Query: 428 REASGGPSI-ILGNFQMQNYYVEYDLRNQRLGFKQQLCK 465
PS+ I+GN NY V +D+ L F+ C
Sbjct: 393 ---QRSPSLSIIGNIMQVNYLVLHDVHRSVLSFQPANCD 428
>gi|356502456|ref|XP_003520035.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
Length = 484
Score = 118 bits (295), Expect = 8e-24, Method: Compositional matrix adjust.
Identities = 108/391 (27%), Positives = 163/391 (41%), Gaps = 56/391 (14%)
Query: 82 SSHSYGGYSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSS 141
+S G Y + + G PP +LDTGS + W C C C P F P S+S
Sbjct: 142 TSQGSGEYFLRVGIGKPPSQAYVVLDTGSDVSWIQCA---PCSECYQQSDPIFDPVSSNS 198
Query: 142 SRLLGCQNPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSG-LTEGIALSET 200
+ C P+C +S+ +C +N T + Y V YG G T G +ET
Sbjct: 199 YSPIRCDAPQC-----KSLDLSEC--------RNGTCL---YEVSYGDGSYTVGEFATET 242
Query: 201 LNLPNRIIPNFLVGCSVLSSRQPAGI-------AGFGRGKTSLPSQLNLDKFSYCLLSHK 253
+ L + N +GC G+ G G GK S P+Q+N FSYCL++
Sbjct: 243 VTLGTAAVENVAIGCG----HNNEGLFVGAAGLLGLGGGKLSFPAQVNATSFSYCLVNR- 297
Query: 254 FDDTTRTSSLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQR 313
D+ S+L ++ + P NP + +YY+GL+ I+VGG+
Sbjct: 298 --DSDAVSTLEFNS------PLPRNVVTAPLRRNPEL------DTFYYLGLKGISVGGEA 343
Query: 314 VRVWHKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEAL 373
+ + +D G GG I+DSGT T + E+++ L D FV + A +
Sbjct: 344 LPIPESIFEVDAIGGGGIIIDSGTAVTRLRSEVYDALRDAFV------KGAKGIPKANGV 397
Query: 374 TGLRPCFDVPGEKTGSFPELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTVVTDREASGG 433
+ C+D+ ++ P + HF G E+ LP NY V C +
Sbjct: 398 SLFDTCYDLSSRESVQVPTVSFHFPEGRELPLPARNYLIPVDSVGTFCFAFAPTTSSLS- 456
Query: 434 PSIILGNFQMQNYYVEYDLRNQRLGFKQQLC 464
I+GN Q Q V +D+ N +GF C
Sbjct: 457 ---IMGNVQQQGTRVGFDIANSLVGFSADSC 484
>gi|356539555|ref|XP_003538263.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
Length = 438
Score = 118 bits (295), Expect = 8e-24, Method: Compositional matrix adjust.
Identities = 112/387 (28%), Positives = 165/387 (42%), Gaps = 58/387 (14%)
Query: 89 YSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLGCQ 148
Y + GTPPQ + +DT + W PCT C C+S+ F P+ S++ + + C
Sbjct: 97 YIVRAKIGTPPQTLLLAIDTSNDAAWIPCT---ACDGCTSTL---FAPEKSTTFKNVSCG 150
Query: 149 NPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSGLTEGIALSETLNLPNRII 208
+P+C+ + S C ++ + YGS + +T+ L I
Sbjct: 151 SPECNKVPSPSCGTSAC----------------TFNLTYGSSSIAANVVQDTVTLATDPI 194
Query: 209 PNFLVGCSVLS---SRQPAGIAGFGRGKTSLPSQ---LNLDKFSYCLLSHK---FDDTTR 259
P + GC + S P G+ G GRG SL SQ L FSYCL S K F + R
Sbjct: 195 PGYTFGCVAKTTGPSTPPQGLLGLGRGPLSLLSQTQNLYQSTFSYCLPSFKSLNFSGSLR 254
Query: 260 TSSLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRVRVWHK 319
+ + + YTP + NP R++ YYV L I VG + V +
Sbjct: 255 LGPV----------AQPIRIKYTPLLKNP---RRSSL---YYVNLFAIRVGRKIVDIPPA 298
Query: 320 YLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQ--MVKNRNYTRALGAEALTGLR 377
L + GT+ DSGT FT + ++ + DEF + M N T +L G
Sbjct: 299 ALAFNAATGAGTVFDSGTVFTRLVAPVYTAVRDEFRRRVAMAAKANLT----VTSLGGFD 354
Query: 378 PCFDVPGEKTGSFPELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTVVTDREASGGPSII 437
C+ VP P + F G VTLP +N GS CL + + + +
Sbjct: 355 TCYTVPIVA----PTITFMFSG-MNVTLPQDNILIHSTAGSTSCLAMASAPDNVNSVLNV 409
Query: 438 LGNFQMQNYYVEYDLRNQRLGFKQQLC 464
+ N Q QN+ V YD+ N RLG ++LC
Sbjct: 410 IANMQQQNHRVLYDVPNSRLGVARELC 436
>gi|297805038|ref|XP_002870403.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
gi|297316239|gb|EFH46662.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
Length = 440
Score = 117 bits (294), Expect = 9e-24, Method: Compositional matrix adjust.
Identities = 128/441 (29%), Positives = 202/441 (45%), Gaps = 65/441 (14%)
Query: 39 NPSQDSYQNLNSLVSSSLTRALHIKNPQTKTTTTTTTTTTTNISSHSYGGYSISLSFGTP 98
NP++ S Q L + + S++R H + K + +++S+S G Y +++S GTP
Sbjct: 47 NPTETSSQRLRNAIHRSVSRVFHFTDISQKDASDNAPQI--DLTSNS-GEYLMNISLGTP 103
Query: 99 PQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLGCQNPKCSWIHHE 158
P I I DTGS L+W C C C + P F PK SS+ + + C + +C+ + ++
Sbjct: 104 PFPIMAIADTGSDLLWTQCK---PCDDCYTQVDPLFDPKASSTYKDVSCSSSQCTALENQ 160
Query: 159 SIQCRDCNDEPLATSKNCTQICPSYLVLYGS-GLTEGIALSETLNLPNRI-----IPNFL 212
+ C+ E C SY YG T+G +TL L + + N +
Sbjct: 161 A----SCSTE----DNTC-----SYSTSYGDRSYTKGNIAVDTLTLGSTDTRPVQLKNII 207
Query: 213 VGCSVLSS----RQPAGIAGFGRGKTSLPSQL--NLD-KFSYCLLSHKFDDTTRTSSLIL 265
+GC ++ ++ +GI G G G SL +QL ++D KFSYCL+ ++ RTS +
Sbjct: 208 IGCGHNNAGTFNKKGSGIVGLGGGAVSLITQLGDSIDGKFSYCLVPLTSEN-DRTSKI-- 264
Query: 266 DNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRVRVWHKYLTLDR 325
N +++ TG+ TP + + +YY+ L+ I+VG + V+ Y D
Sbjct: 265 -NFGTNAVVSGTGVVSTPLI-------AKSQETFYYLTLKSISVGSKEVQ----YPGSDS 312
Query: 326 -DGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEALTGLRPCFDVPG 384
G G I+DSGTT T + E + L D S + + + TGL C+ G
Sbjct: 313 GSGEGNIIIDSGTTLTLLPTEFYSELEDAVASSIDAEKK------QDPQTGLSLCYSATG 366
Query: 385 EKTGSFPELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTVVTDREASGGPSI-ILGNFQM 443
+ P + +HF GA+V L N F + E VC G PS I GN
Sbjct: 367 DL--KVPAITMHFD-GADVNLKPSNCFVQISE-DLVCFAF------RGSPSFSIYGNVAQ 416
Query: 444 QNYYVEYDLRNQRLGFKQQLC 464
N+ V YD ++ + FK C
Sbjct: 417 MNFLVGYDTVSKTVSFKPTDC 437
>gi|356537928|ref|XP_003537458.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Glycine max]
Length = 445
Score = 117 bits (294), Expect = 9e-24, Method: Compositional matrix adjust.
Identities = 121/417 (29%), Positives = 184/417 (44%), Gaps = 61/417 (14%)
Query: 74 TTTTTTNISSHSYGGYSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPS 133
T ++T +S + ++SL+ GTPPQ + +LDTGS L W C
Sbjct: 55 TPSSTRKVSFYHNVTLTVSLTVGTPPQSVTMVLDTGSELSWLHCKKQQNINSV------- 107
Query: 134 FIPKLSSSSRLLGCQNPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSGLTE 193
F P LSSS + C +P C + + C+ S N + SY E
Sbjct: 108 FNPHLSSSYTPIPCMSPICKTRTRDFLIPVSCD------SNNLCHVTVSYADFTS---LE 158
Query: 194 GIALSETLNLPNRIIPNFLVGC--SVLSSR-----QPAGIAGFGRGKTSLPSQLNLDKFS 246
G S+T + P + G S SS + G+ G RG S +Q+ KFS
Sbjct: 159 GNLASDTFAISGSGQPGIIFGSMDSGFSSNANEDSKTTGLMGMNRGSLSFVTQMGFPKFS 218
Query: 247 YCLLSHKFDDTTRTSSLILDNGSSHSDKKTTGLTYTPFV--NNP-SVAERNAFSVYYYVG 303
YC+ + + +S +L G + + K L YTP V N P +R V Y V
Sbjct: 219 YCI-------SGKDASGVLLFGDA-TFKWLGPLKYTPLVKMNTPLPYFDR----VAYTVR 266
Query: 304 LRRITVGGQRVRVWHKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQ------ 357
L I VG + ++V + D G G T+VDSGT FTF+ ++ L +EFV+Q
Sbjct: 267 LMGIRVGSKPLQVPKEIFAPDHTGAGQTMVDSGTRFTFLLGSVYTALRNEFVAQTRGVLT 326
Query: 358 MVKNRNYTRALGAEALTGLRPCFDV-PGEKTGSFPELKLHFKGGAEVTLPVENYFAVV-- 414
++++ N+ GA L CF V G + P + + F+ GAE+++ E V
Sbjct: 327 LLEDPNFVFE-GAMDL-----CFRVRRGGVVPAVPAVTMVFE-GAEMSVSGERLLYRVGG 379
Query: 415 ------GEGSAVCLTVVTDREASGGPSIILGNFQMQNYYVEYDLRNQRLGFKQQLCK 465
G G CLT + + G + ++G+ QN ++E+DL N R+GF C+
Sbjct: 380 DGDVAKGNGDVYCLT-FGNSDLLGIEAYVIGHHHQQNVWMEFDLVNSRVGFADTKCE 435
>gi|125532796|gb|EAY79361.1| hypothetical protein OsI_34489 [Oryza sativa Indica Group]
Length = 405
Score = 117 bits (294), Expect = 1e-23, Method: Compositional matrix adjust.
Identities = 113/396 (28%), Positives = 171/396 (43%), Gaps = 63/396 (15%)
Query: 85 SYGGYSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRL 144
S G Y + + GTPPQ + ++D LVW CT C+ C +P F P SS+ R
Sbjct: 53 SQGLYVANFTIGTPPQPVSAVVDLTGELVWTQCT---PCQPCFEQDLPLFDPTKSSTFRG 109
Query: 145 LGCQNPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSGLTEGIALSETLNLP 204
L C + C I +S+NCT Y +G T G+A ++T +
Sbjct: 110 LPCGSHLCESIPE--------------SSRNCTSDVCIYEAPTKAGDTGGMAGTDTFAI- 154
Query: 205 NRIIPNFLVGCSVLSSRQ------PAGIAGFGRGKTSLPSQLNLDKFSYCLLSHKFDDTT 258
GC V++ ++ P+GI G GR SL +Q+N+ FSYCL
Sbjct: 155 GAAKETLGFGCVVMTDKRLKTIGGPSGIVGLGRTPWSLVTQMNVTAFSYCLAGK------ 208
Query: 259 RTSSLILDNGSSHSDKKTTGLTYTPFVNNPSVAER-NAFSVYYYVGLRRITVGGQRVRVW 317
SS L G++ + TPFV S N + YY V L I GG ++
Sbjct: 209 --SSGALFLGATAKQLAGGKNSSTPFVIKTSAGSSDNGSNPYYMVKLAGIKAGGAPLQAA 266
Query: 318 HKYLTLDRDGNGGTI-VDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEALTGL 376
+G T+ +D+ + +++A ++ L + T A+G + +
Sbjct: 267 SS--------SGSTVLLDTVSRASYLADGAYKAL----------KKALTAAVGVQPVASP 308
Query: 377 RPCFDVPGEK--TGSFPELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTVVTDR------ 428
+D+ K G PEL F GGA +T+P NY G G+ VCLT+ +
Sbjct: 309 PKPYDLCFSKAVAGDAPELVFTFDGGAALTVPPANYLLASGNGT-VCLTIGSSASLNLTG 367
Query: 429 EASGGPSIILGNFQMQNYYVEYDLRNQRLGFKQQLC 464
E G + ILG+ Q +N +V +DL+ + L FK C
Sbjct: 368 ELEG--ASILGSLQQENVHVLFDLKEETLSFKPADC 401
>gi|297842769|ref|XP_002889266.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
gi|297335107|gb|EFH65525.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
Length = 489
Score = 117 bits (293), Expect = 1e-23, Method: Compositional matrix adjust.
Identities = 119/433 (27%), Positives = 190/433 (43%), Gaps = 61/433 (14%)
Query: 56 LTRALHIKNPQTKTTTTTTTTTTTNISSHSYGGYSISLSFGTPPQIIPFIL--------- 106
+ RAL + N + ++ T++ + S I L+ G + + +I+
Sbjct: 90 MRRALLLDNIRVQSLQLRIKAMTSSTTEQSVSETQIPLTSGIKLETLNYIVTVELGGKNM 149
Query: 107 ----DTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLGCQNPKCSWIHHESIQC 162
DTGS L W C C+ C + + P + P +SSS + + C + C + +
Sbjct: 150 SLIVDTGSDLTWVQCQ---PCRSCYNQQGPLYDPSVSSSYKTVFCNSSTC-----QDLVA 201
Query: 163 RDCNDEPLATSKNCTQICPSYLVLYGSG-LTEGIALSETLNLPNRIIPNFLVGCSVLSSR 221
N P + Y+V YG G T G SE++ L + + N + GC +
Sbjct: 202 ATGNSGPCGGFNGVVKTTCEYVVSYGDGSYTRGDLASESIVLGDTKLENLVFGCGRNNKG 261
Query: 222 ---QPAGIAGFGRGKTSLPSQ----LNLDKFSYCLLSHKFDDTTRTSSLILDNGSSHSD- 273
+G+ G GR SL SQ N FSYCL S + D + T S G+ S
Sbjct: 262 LFGGASGLMGLGRSSVSLVSQTLKTFN-GVFSYCLPSLE-DGASGTLSF----GNDFSVY 315
Query: 274 KKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRVRVWHKYLTLDRDGNGGTIV 333
K +T + YTP V NP + +Y + L ++GG V K L+ R G ++
Sbjct: 316 KNSTSVFYTPLVQNPQLRS------FYILNLTGASIGG----VELKTLSFGR----GILI 361
Query: 334 DSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEALTGLRPCFDVPGEKTGSFPEL 393
DSGT T + P +++ + EF+ Q ++ A + L CF++ + S P +
Sbjct: 362 DSGTVITRLPPSIYKAVKTEFLKQ------FSGFPSAPGYSILDTCFNLTSYEDISIPTI 415
Query: 394 KLHFKGGAEVTLPVENYFAVVG-EGSAVCLTVVT-DREASGGPSIILGNFQMQNYYVEYD 451
K+ F+G AE+ + V F V + S VCL + + E G I+GN+Q +N V YD
Sbjct: 416 KMIFEGNAELEVDVTGVFYFVKPDASLVCLALASLSYENEVG---IIGNYQQKNQRVIYD 472
Query: 452 LRNQRLGFKQQLC 464
+RLG + C
Sbjct: 473 TTQERLGIAGENC 485
>gi|225216988|gb|ACN85278.1| aspartic proteinase nepenthesin-1 precursor [Oryza alta]
Length = 518
Score = 117 bits (293), Expect = 1e-23, Method: Compositional matrix adjust.
Identities = 118/391 (30%), Positives = 166/391 (42%), Gaps = 62/391 (15%)
Query: 87 GGYSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLG 146
G Y +++ GTP + DTGS W C Y K+ F P SS+ +
Sbjct: 177 GNYVVTVGLGTPASRYTVVFDTGSDTTWVQCQPCVVVCYEQQEKL--FDPARSSTYANVS 234
Query: 147 CQNPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSG-LTEGIALSETLNLPN 205
C P C + R C + +C Y V YG G + G +TL L +
Sbjct: 235 CAAPACF-----DLDTRGC------SGGHCL-----YGVQYGDGSYSIGFFAMDTLTLSS 278
Query: 206 -RIIPNFLVGCSVLSS---RQPAGIAGFGRGKTSLPSQLNLDK----FSYCLLSHKFDDT 257
+ F GC + + AG+ G GRGKTSLP Q DK F++CL +
Sbjct: 279 YDAVKGFRFGCGERNEGLFGEAAGLLGLGRGKTSLPVQ-TYDKYGGVFAHCLPARS---- 333
Query: 258 TRTSSLILDNGSSHSDKKTTGLTYTPFV--NNPSVAERNAFSVYYYVGLRRITVGGQRVR 315
+ + LD G LT TP + N P+ +YYVG+ I VGGQ +
Sbjct: 334 --SGTGYLDFGPGSPAAAGARLT-TPMLTDNGPT---------FYYVGMTGIRVGGQLLS 381
Query: 316 VWHKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEALTG 375
+ GTIVDSGT T + P + L FVS M R Y + A A++
Sbjct: 382 IPQSVFA-----TAGTIVDSGTVITRLPPPAYSSLRSAFVSAMAA-RGYKK---APAVSL 432
Query: 376 LRPCFDVPGEKTGSFPELKLHFKGGAEVTLPVEN--YFAVVGEGSAVCLTVVTDREASGG 433
L C+D G + P + L F+GGA + + Y A V S VCL + + GG
Sbjct: 433 LDTCYDFTGMSQVAIPTVSLLFQGGAILDVDASGIMYAASV---SQVCLGFAANED--GG 487
Query: 434 PSIILGNFQMQNYYVEYDLRNQRLGFKQQLC 464
I+GN Q++ + V YD+ + +GF C
Sbjct: 488 DVGIVGNTQLKTFGVAYDIGKKVVGFSPGAC 518
>gi|51038078|gb|AAT93881.1| hypothetical protein [Oryza sativa Japonica Group]
Length = 481
Score = 117 bits (293), Expect = 1e-23, Method: Compositional matrix adjust.
Identities = 114/416 (27%), Positives = 167/416 (40%), Gaps = 55/416 (13%)
Query: 89 YSISLSFGTPPQIIPFILDTGSHLVWFPCTN-------HYQCKYCSSSKIPSFIPKLSSS 141
Y S G PPQ ++DTGS LVW C+ C +P + LS +
Sbjct: 78 YIASYGIGDPPQPAEAVVDTGSDLVWTQCSTCRLPAAAAAGGGGCFPQNLPYYNFSLSRT 137
Query: 142 SRLLGCQNPKCSW--IHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSGLTEGIALSE 199
+R + C + + + E+ C + + + S YG+G+ G+ ++
Sbjct: 138 ARAVPCDDDDGALCGVAPETAGCARGG----GSGDDACVVAAS----YGAGVALGVLGTD 189
Query: 200 TLNLPNRIIPNFLVGCSVLSSRQP------AGIAGFGRGKTSLPSQLNLDKFSYCLLSHK 253
P+ GC + P +GI G GRG SL SQLN +FSYCL +
Sbjct: 190 AFTFPSSSSVTLAFGCVSQTRISPGALNGASGIIGLGRGALSLVSQLNATEFSYCLTPY- 248
Query: 254 FDDTTRTSSLILDNGSSHSDKKTTG--------LTYTPFVNNPSVAERNAFSVYYYVGLR 305
F DT S L + +G G +T PF NP + + FS +YY+ L
Sbjct: 249 FRDTVSPSHLFVGDGELAGLSAAAGGGGGGGAPVTTVPFAKNP---KDSPFSTFYYLPLV 305
Query: 306 RITVGGQRVRVWHKYLTLDRDG----NGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKN 361
+ G V + L GG ++DSG+ FT + L E Q+ +
Sbjct: 306 GLAAGNATVALPAGAFDLREAAPKVWAGGALIDSGSPFTRLVDPAHRALTKELARQLRGS 365
Query: 362 RNYT---RALGAEALTGLRPCFDVPGEKTGSFPELKLHFK----GGAEVTLPVENYFAVV 414
+ LG + D + P L L F GG E+ +P E Y+A V
Sbjct: 366 GSLVPPPAKLGGALELCVEAGDDGDSLAAAAVPPLVLRFDDGVGGGRELVIPAEKYWARV 425
Query: 415 GEGSAVCLTVVTDREASGGPSI------ILGNFQMQNYYVEYDLRNQRLGFKQQLC 464
E S C+ VV+ ASG ++ I+GNF Q+ V YDL N L F+ C
Sbjct: 426 -EASTWCMAVVS--SASGNATLPTNETTIIGNFMQQDMRVLYDLANGLLSFQPANC 478
>gi|224080963|ref|XP_002306246.1| predicted protein [Populus trichocarpa]
gi|222855695|gb|EEE93242.1| predicted protein [Populus trichocarpa]
Length = 382
Score = 117 bits (293), Expect = 1e-23, Method: Compositional matrix adjust.
Identities = 107/385 (27%), Positives = 159/385 (41%), Gaps = 50/385 (12%)
Query: 87 GGYSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLG 146
G Y + + G+PP+ ++D+GS +VW C C C P F P S+S +
Sbjct: 41 GEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCK---PCTQCYHQTDPLFDPADSASFMGVS 97
Query: 147 CQNPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSGL-TEGIALSETLNLPN 205
C + C + + CN S C Y V YG G T+G ETL L
Sbjct: 98 CSSAVCDQVDNAG-----CN------SGRC-----RYEVSYGDGSSTKGTLALETLTLGR 141
Query: 206 RIIPNFLVGCSVLSSRQ---PAGIAGFGRGKTSLPSQLNLDK---FSYCLLSHKFDDTTR 259
++ N +GC ++ AG+ G G G S QL+ ++ FSYCL+S
Sbjct: 142 TVVQNVAIGCGHMNQGMFVGAAGLLGLGGGSMSFVGQLSRERGNAFSYCLVSR-----VT 196
Query: 260 TSSLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRVRVWHK 319
S+ L+ GS + G + P + NP YYY+GL + VG +V +
Sbjct: 197 NSNGFLEFGS---EAMPVGAAWIPLIRNPHSPS------YYYIGLSGLGVGDMKVPISED 247
Query: 320 YLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEALTGLRPC 379
L GNGG ++D+GT T +E D F+ Q N RA G C
Sbjct: 248 IFELTELGNGGVVMDTGTAVTRFPTVAYEAFRDAFIDQ---TGNLPRASGVSIFD---TC 301
Query: 380 FDVPGEKTGSFPELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTVVTDREASGGPSIILG 439
+++ G + P + +F GG +TLP N+ V + C ILG
Sbjct: 302 YNLFGFLSVRVPTVSFYFSGGPILTLPANNFLIPVDDAGTFCFAFAPSPSGLS----ILG 357
Query: 440 NFQMQNYYVEYDLRNQRLGFKQQLC 464
N Q + + D N+ +GF +C
Sbjct: 358 NIQQEGIQISVDGANEFVGFGPNVC 382
>gi|449440931|ref|XP_004138237.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 1-like [Cucumis
sativus]
Length = 523
Score = 117 bits (293), Expect = 1e-23, Method: Compositional matrix adjust.
Identities = 119/400 (29%), Positives = 170/400 (42%), Gaps = 46/400 (11%)
Query: 70 TTTTTTTTTTNISSHSYGGYSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSS 129
+T + T T+ +S G Y + G P Q F+ DTGS + W C C
Sbjct: 165 STNSLTAPVTSGASQGAGEYFARIGVGQPVQSYFFVPDTGSDVSWLQCQPCDGENGCYKQ 224
Query: 130 KIPSFIPKLSSSSRLLGCQNPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGS 189
P F PK SSS L C + +C + DE + +C Y V YG
Sbjct: 225 IGPIFDPKSSSSYSPLSCDSEQCHLL-----------DEAACDANSCI-----YEVEYGD 268
Query: 190 G-LTEGIALSETLNLPN-RIIPNFLVGCSVLSSR---QPAGIAGFGRGKTSLPSQLNLDK 244
G T G +ET + + IPN +GC + G+ G G G SL SQL
Sbjct: 269 GSFTVGELATETFSFRHSNSIPNLPIGCGHDNEGLFVGADGLIGLGGGAISLSSQLEATS 328
Query: 245 FSYCLLSHKFDDTTRTSSLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGL 304
FSYCL+ D SS LD ++D+ + LT +P V N + F + YV +
Sbjct: 329 FSYCLV-----DLDSESSSTLD---FNADQPSDSLT-SPLVKN------DRFPTFRYVKV 373
Query: 305 RRITVGGQRVRVWHKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNY 364
++VGG+ + + +D G+GG IVDSGTT T + ++++ L D FV + KN
Sbjct: 374 IGMSVGGKPLPISSSSFEIDESGSGGIIVDSGTTITEIPSDVYDVLRDAFVG-LTKNLP- 431
Query: 365 TRALGAEALTGLRPCFDVPGEKTGSFPELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTV 424
A ++ C+D+ + P + G + LP +N V CL
Sbjct: 432 ----PAPGVSPFDTCYDLSSQSNVEVPTIAFILPGENSLQLPAKNCLIQVDSAGTFCLAF 487
Query: 425 VTDREASGGPSIILGNFQMQNYYVEYDLRNQRLGFKQQLC 464
+ S P I+GN Q Q V YDL N +GF C
Sbjct: 488 L----PSTFPLSIIGNVQQQGIRVSYDLANSLVGFSTDKC 523
>gi|242051593|ref|XP_002454942.1| hypothetical protein SORBIDRAFT_03g001790 [Sorghum bicolor]
gi|241926917|gb|EES00062.1| hypothetical protein SORBIDRAFT_03g001790 [Sorghum bicolor]
Length = 431
Score = 117 bits (293), Expect = 1e-23, Method: Compositional matrix adjust.
Identities = 108/386 (27%), Positives = 165/386 (42%), Gaps = 56/386 (14%)
Query: 88 GYSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLGC 147
GY++++ GTPPQ+ I DT S L W C + P F P SSS + C
Sbjct: 90 GYTVTIGIGTPPQLHTLIADTASDLTWTQCNLFND---TAKQVEPLFDPAKSSSFAFVTC 146
Query: 148 QNPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSGLTEGIALSETLNLPN-- 205
+ C+ D P +K C+ Y+ Y S G+ E+ L +
Sbjct: 147 SSKLCT------------EDNP--GTKRCSNKTCRYVYPYVSVEAAGVLAYESFTLSDNN 192
Query: 206 -RIIPNFLVGCSVLSSRQ---PAGIAGFGRGKTSLPSQLNLDKFSYCLLSHKFDDTTRTS 261
I +F GC L+ +GI G S+ SQL + KFSYCL + ++S
Sbjct: 193 QHICMSFGFGCGALTDGNLLGASGILGMSPAILSMVSQLAIPKFSYCLTPYT---DRKSS 249
Query: 262 SLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRVRVWHKYL 321
L + KTTG + + + YYYV L +++G +R+ V
Sbjct: 250 PLFFGAWADLGRYKTTGPI------------QKSLTFYYYVPLVGLSLGTRRLDVPAATF 297
Query: 322 TLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEALTGLRPCFD 381
L + GGT+VD G T +A F L + + + L + + CF
Sbjct: 298 ALKQ---GGTVVDLGCTVGQLAEPAFTALKEAVLHTL------NLPLTNRTVKDYKVCFA 348
Query: 382 VP-GEKTGSF--PELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTVVTDREASGGPSIIL 438
+P G G+ P L L+F GGA++ LP +NYF G +CL +V GG I+
Sbjct: 349 LPSGVAMGAVQTPPLVLYFDGGADMVLPRDNYFQEPTAG-LMCLALV-----PGGGMSII 402
Query: 439 GNFQMQNYYVEYDLRNQRLGFKQQLC 464
GN Q QN+++ +D+ + + F +C
Sbjct: 403 GNVQQQNFHLLFDVHDSKFLFAPTIC 428
>gi|115434870|ref|NP_001042193.1| Os01g0178600 [Oryza sativa Japonica Group]
gi|55296112|dbj|BAD67831.1| putative CDR1 [Oryza sativa Japonica Group]
gi|55296252|dbj|BAD67993.1| putative CDR1 [Oryza sativa Japonica Group]
gi|113531724|dbj|BAF04107.1| Os01g0178600 [Oryza sativa Japonica Group]
gi|125569253|gb|EAZ10768.1| hypothetical protein OsJ_00604 [Oryza sativa Japonica Group]
Length = 454
Score = 117 bits (293), Expect = 1e-23, Method: Compositional matrix adjust.
Identities = 121/396 (30%), Positives = 169/396 (42%), Gaps = 66/396 (16%)
Query: 89 YSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLGCQ 148
Y ++++ G+PP+ + I DTGS LVW C +++ F P SS+ + CQ
Sbjct: 101 YLMTVNLGSPPRSMLAIADTGSDLVWVKCKKGNNDTSSAAAPTTQFDPSRSSTYGRVSCQ 160
Query: 149 NPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSGL-TEGIALSETLNL---- 203
C E++ C+D NC +YL YG G T G+ +ET
Sbjct: 161 TDAC-----EALGRATCDD-----GSNC-----AYLYAYGDGSNTTGVLSTETFTFDDGG 205
Query: 204 ----PNRI-IPNFLVGCSVLSSRQ--PAGIAGFGRGKTSLPSQLNLD-----KFSYCLLS 251
P ++ + GCS ++ G+ G G G SL +QL +FSYCL+
Sbjct: 206 SGRSPRQVRVGGVKFGCSTATAGSFPADGLVGLGGGAVSLVTQLGGATSLGRRFSYCLVP 265
Query: 252 HKFDDTTRTSSLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGG 311
H + +S+L N + +D G TP V YY V L + VG
Sbjct: 266 HSVN---ASSAL---NFGALADVTEPGAASTPLVAGD-------VDTYYTVVLDSVKVGN 312
Query: 312 QRVRVWHKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAE 371
+ T+ + IVDSGTT TF+ P L P+ DE +R T
Sbjct: 313 K---------TVASAASSRIIVDSGTTLTFLDPSLLGPIVDEL------SRRITLPPVQS 357
Query: 372 ALTGLRPCFDVPG---EKTGSFPELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTVVTDR 428
L+ C++V G E S P+L L F GGA V L EN F V EG+ +CL +V
Sbjct: 358 PDGLLQLCYNVAGREVEAGESIPDLTLEFGGGAAVALKPENAFVAVQEGT-LCLAIVATT 416
Query: 429 EASGGPSIILGNFQMQNYYVEYDLRNQRLGFKQQLC 464
E P ILGN QN +V YDL + F C
Sbjct: 417 EQQ--PVSILGNLAQQNIHVGYDLDAGTVTFAGADC 450
>gi|413944392|gb|AFW77041.1| hypothetical protein ZEAMMB73_800604 [Zea mays]
Length = 476
Score = 117 bits (292), Expect = 2e-23, Method: Compositional matrix adjust.
Identities = 112/392 (28%), Positives = 166/392 (42%), Gaps = 66/392 (16%)
Query: 89 YSISLSFGTPPQIIPFILDTGSHLVW---FPCTNHYQCKYCSSSKIPSFIPKLSSSSRLL 145
+ +++ FGTP Q I DTGS + W PC+ H C P F P S++ ++
Sbjct: 135 FVVTVGFGTPAQTYTVIFDTGSDVSWIQCLPCSGH-----CYKQHDPIFDPTKSATYSVV 189
Query: 146 GCQNPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSGLTEGIALS-ETLNLP 204
C +P+C+ A C+ Y V YG G + LS ETL+L
Sbjct: 190 PCGHPQCA----------------AADGSKCSNGTCLYKVEYGDGSSSAGVLSHETLSLT 233
Query: 205 N-RIIPNFLVGC---SVLSSRQPAGIAGFGRGKTSLPSQLNLD---KFSYCLLSHKFDDT 257
+ R +P F GC ++ G+ G GRG+ SL SQ FSYCL S D+T
Sbjct: 234 STRALPGFAFGCGQTNLGDFGDVDGLIGLGRGQLSLSSQAAASFGGTFSYCLPS---DNT 290
Query: 258 TRTSSLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRVRVW 317
T I + +D + YT V ++ + +Y+V L I +GG + V
Sbjct: 291 THGYLTIGPTTPASNDD----VQYTAMV------QKQDYPSFYFVELVSIDIGGYILPVP 340
Query: 318 HKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEALTGLR 377
T D GT +DSGT T++ PE + L D F M + + A A
Sbjct: 341 PTLFTDD-----GTFLDSGTILTYLPPEAYTALRDRFKFTMTQYKP------APAYDPFD 389
Query: 378 PCFDVPGEKTGSFPELKLHFKGGAEVTLPVENYFAVV----GEGSAV-CLTVVTDREASG 432
C+D G+ P + F G+ L ++F ++ A+ CL V S
Sbjct: 390 TCYDFTGQSAIFIPAVSFKFSDGSVFDL---SFFGILIFPDDTAPAIGCLGFVA--RPSA 444
Query: 433 GPSIILGNFQMQNYYVEYDLRNQRLGFKQQLC 464
P I+GN Q +N V YD+ +++GF C
Sbjct: 445 MPFTIVGNMQQRNTEVIYDVAAEKIGFASASC 476
>gi|449485448|ref|XP_004157171.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Cucumis sativus]
Length = 430
Score = 117 bits (292), Expect = 2e-23, Method: Compositional matrix adjust.
Identities = 114/389 (29%), Positives = 167/389 (42%), Gaps = 45/389 (11%)
Query: 91 ISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLGCQNP 150
+SL GTPPQ +LDTGS L W C + K ++P +PK ++S +
Sbjct: 68 VSLPIGTPPQPTDLVLDTGSQLSWIQCHDKKIKK-----RLPP-LPKPKTTSFDPSLSSS 121
Query: 151 KCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSG-LTEGIALSETLNLPNRI-I 208
+ I D L TS + ++C Y Y G L EG + E +
Sbjct: 122 FSLLPCNHPICKPRIPDFTLPTSCDQNRLC-HYSYFYADGTLAEGNLVREKFTFSKSLST 180
Query: 209 PNFLVGCSVLSSRQPAGIAGFGRGKTSLPSQLNLDKFSYCLLSHKFDDTTRTSSLILDNG 268
P ++GC+ S+ GI G RG+ S SQ + KFSYC+ S + T L DN
Sbjct: 181 PPVILGCAQASTEN-RGILGMNRGRLSFISQAKISKFSYCVPSRTGSNPTGLFYL-GDNP 238
Query: 269 SSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRVRVWHKYLTLDRDGN 328
+S K T LT+ ++P N + Y + ++ I + G+R+ V D G+
Sbjct: 239 NSSKFKYVTMLTFPESQSSP-----NLDPLAYTLPMKAIKIAGKRLNVPPAAFKPDAGGS 293
Query: 329 GGTIVDSGTTFTFMAPELFEPLADE---FVSQMVKNRNYTRALGAEALTGLRPCFD--VP 383
G T++DSG+ T++ E +E + +E V M+K + Y A A+ CFD V
Sbjct: 294 GQTMIDSGSDLTYLVDEAYEKVKEEVVRLVGAMMK-KGYVYADVADM------CFDAGVT 346
Query: 384 GEKTGSFPELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTV--------VTDREASGGPS 435
E + F G E+ VG G V V + E G S
Sbjct: 347 AEVGRRIGGISFEFDNGVEI---------FVGRGEGVLTEVEKGVKCVGIGRSERLGIGS 397
Query: 436 IILGNFQMQNYYVEYDLRNQRLGFKQQLC 464
I+G QN +VEYDL N+R+GF C
Sbjct: 398 NIIGTVHQQNMWVEYDLANKRVGFGGAEC 426
>gi|302817380|ref|XP_002990366.1| hypothetical protein SELMODRAFT_43971 [Selaginella moellendorffii]
gi|300141928|gb|EFJ08635.1| hypothetical protein SELMODRAFT_43971 [Selaginella moellendorffii]
Length = 420
Score = 117 bits (292), Expect = 2e-23, Method: Compositional matrix adjust.
Identities = 109/389 (28%), Positives = 160/389 (41%), Gaps = 58/389 (14%)
Query: 87 GGYSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLG 146
G Y + GTP + + + DTGS + W C+ C+ C + P F P LSSS + L
Sbjct: 79 GDYFARIGVGTPARSVYMVADTGSDVSWLQCS---PCRKCYRQQDPIFNPSLSSSFKPLA 135
Query: 147 CQNPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSG-LTEGIALSETLNLPN 205
C + C ++ + C+ + C Y V YG G T G +ETL+
Sbjct: 136 CASSICG-----KLKIKGCSRK-----NECM-----YQVSYGDGSFTVGDFSTETLSFGE 180
Query: 206 RIIPNFLVGCSVLSSRQPAGI-------AGFGRGKTSLPSQLN---LDKFSYCLLSHKFD 255
+ + +GC R G+ G GRG S PSQ FSYCL
Sbjct: 181 HAVRSVAMGCG----RNNQGLFHGAAGLLGLGRGPLSFPSQTGTSYASVFSYCLPRR--- 233
Query: 256 DTTRTSSLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRVR 315
++ +SL+ G S +K +T + N YYYVGL RI V G V
Sbjct: 234 ESAIAASLVF--GPSAVPEKAR---FTKLLPN------RRLDTYYYVGLARIRVAGSPVN 282
Query: 316 VWHKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEALTG 375
+ + G GG IVDSGT + + + L D F R+ A ++
Sbjct: 283 IPPDAFAMGSRGTGGVIVDSGTAISRLTTPAYTALRDAF-------RSLVTFPSAPGISL 335
Query: 376 LRPCFDVPGEKTGSFPELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTVVTDREASGGPS 435
C+D+ KT + P + L F GGA + LP + V + CL + EA
Sbjct: 336 FDTCYDLSSMKTATLPAVVLDFDGGASMPLPADGILVNVDDEGTYCLAFAPEEEAFS--- 392
Query: 436 IILGNFQMQNYYVEYDLRNQRLGFKQQLC 464
I+GN Q Q + + D + +++G C
Sbjct: 393 -IIGNVQQQTFRISIDNQKEQMGIAPDQC 420
>gi|225427552|ref|XP_002266498.1| PREDICTED: probable aspartic protease At2g35615 [Vitis vinifera]
Length = 441
Score = 117 bits (292), Expect = 2e-23, Method: Compositional matrix adjust.
Identities = 132/467 (28%), Positives = 194/467 (41%), Gaps = 61/467 (13%)
Query: 14 FFFTLLSIFPSSITSLTFSLSRFHTNPSQDSYQNLNSLVSSSLTRALHIKNPQTKTTTTT 73
F F LL + +S FS+ H + + + + + LT A H ++
Sbjct: 17 FLFHLLEVGLAS--GGGFSVDLIHRDSPHSPFFDPSKTRTERLTDAFH-RSASRVGRFRQ 73
Query: 74 TTTTTTNISSH---SYGGYSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSK 130
+ T+ I S S G Y ++LS GTPP + I+DTGS L W C C +C
Sbjct: 74 SAMTSDGIQSRLVPSAGEYIMNLSIGTPPVPVIAIVDTGSDLTWTQCR---PCTHCYKQV 130
Query: 131 IPSFIPKLSSSSRLLGCQNPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSG 190
+P F PK SS+ R C C + ND K CT ++ Y G
Sbjct: 131 VPFFDPKNSSTYRDSSCGTSFCLALG---------NDRSCRNGKKCT-----FMYSYADG 176
Query: 191 -LTEGIALSETLNLPNRI-----IPNFLVGCSVLS----SRQPAGIAGFGRGKTSLPSQL 240
T G ETL + + P F GC S +GI G G + S+ SQL
Sbjct: 177 SFTGGNLAVETLTVASTAGKPVSFPGFAFGCVHRSGGIFDEHSSGIVGLGVAELSMISQL 236
Query: 241 NL---DKFSYCLLSHKFDDTTRTSSLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFS 297
+FSYCLL F D++ +S + N G TP V + +
Sbjct: 237 KSTINGRFSYCLLP-VFTDSSMSSRI---NFGRSGIVSGAGTVSTPLV------MKGPDT 286
Query: 298 VYYYVGLRRITVGGQRVRVWHKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQ 357
YY + L +VG +R+ + + G IVDSGTT+T++ E + L +E V+
Sbjct: 287 YYYLITLEGFSVGKKRLS-YKGFSKKAEVEEGNIIVDSGTTYTYLPLEFYVKL-EESVAH 344
Query: 358 MVKNRNYTRALGAEALTGLRPCFDVPGEKTGSFPELKLHFKGGAEVTLPVENYFAVVGEG 417
+K + G +L C++ ++ + P + HFK A V L N F + E
Sbjct: 345 SIKGKRVRDPNGISSL-----CYNTTVDQIDA-PIITAHFK-DANVELQPWNTFLRMQE- 396
Query: 418 SAVCLTVVTDREASGGPSIILGNFQMQNYYVEYDLRNQRLGFKQQLC 464
VC TV+ + ILGN N+ V +DLR +R+ FK C
Sbjct: 397 DLVCFTVLPTSDIG-----ILGNLAQVNFLVGFDLRKKRVSFKAADC 438
>gi|302795261|ref|XP_002979394.1| hypothetical protein SELMODRAFT_53966 [Selaginella moellendorffii]
gi|300153162|gb|EFJ19802.1| hypothetical protein SELMODRAFT_53966 [Selaginella moellendorffii]
Length = 353
Score = 117 bits (292), Expect = 2e-23, Method: Compositional matrix adjust.
Identities = 109/389 (28%), Positives = 160/389 (41%), Gaps = 58/389 (14%)
Query: 87 GGYSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLG 146
G Y + GTP + + + DTGS + W C+ C+ C + P F P LSSS + L
Sbjct: 12 GDYFARIGVGTPARSVYMVADTGSDVSWLQCS---PCRKCYRQQDPIFNPSLSSSFKPLA 68
Query: 147 CQNPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSG-LTEGIALSETLNLPN 205
C + C ++ + C+ + C Y V YG G T G +ETL+
Sbjct: 69 CASSICG-----KLKIKGCSRK-----NKCM-----YQVSYGDGSFTVGDFSTETLSFGE 113
Query: 206 RIIPNFLVGCSVLSSRQPAGI-------AGFGRGKTSLPSQLN---LDKFSYCLLSHKFD 255
+ + +GC R G+ G GRG S PSQ FSYCL
Sbjct: 114 HAVRSVAMGCG----RNNQGLFHGAAGLLGLGRGPLSFPSQTGTSYASVFSYCLPRR--- 166
Query: 256 DTTRTSSLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRVR 315
++ +SL+ G S +K +T + N YYYVGL RI V G V
Sbjct: 167 ESAIAASLVF--GPSAVPEKAR---FTKLLPN------RRLDTYYYVGLARIRVAGSPVN 215
Query: 316 VWHKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEALTG 375
+ + G GG IVDSGT + + + L D F R+ A ++
Sbjct: 216 IPPDAFAMGSRGTGGVIVDSGTAISRLTTPAYTALRDAF-------RSLVTFPSAPGISL 268
Query: 376 LRPCFDVPGEKTGSFPELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTVVTDREASGGPS 435
C+D+ KT + P + L F GGA + LP + V + CL + EA
Sbjct: 269 FDTCYDLSSMKTATLPAVVLDFDGGASMPLPADGILVNVDDEGTYCLAFAPEEEAFS--- 325
Query: 436 IILGNFQMQNYYVEYDLRNQRLGFKQQLC 464
I+GN Q Q + + D + +++G C
Sbjct: 326 -IIGNVQQQTFRISIDNQKEQMGIAPDQC 353
>gi|449527151|ref|XP_004170576.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 1-like [Cucumis
sativus]
Length = 523
Score = 117 bits (292), Expect = 2e-23, Method: Compositional matrix adjust.
Identities = 120/400 (30%), Positives = 171/400 (42%), Gaps = 46/400 (11%)
Query: 70 TTTTTTTTTTNISSHSYGGYSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSS 129
+T + T T+ +S G Y + G P Q F+ DTGS + W C C
Sbjct: 165 STNSLTAPVTSGASQGAGEYFARIGVGQPVQSYFFVPDTGSDVSWLQCQPCDGENGCYKQ 224
Query: 130 KIPSFIPKLSSSSRLLGCQNPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGS 189
P F PK SSS L C + +C + DE + +C Y V YG
Sbjct: 225 IGPIFDPKSSSSYSPLSCDSEQCHLL-----------DEAACDANSCI-----YEVEYGD 268
Query: 190 G-LTEGIALSETLNLPN-RIIPNFLVGCSVLSSR---QPAGIAGFGRGKTSLPSQLNLDK 244
G T G +ET + + IPN +GC + AG+ G G G SL SQL
Sbjct: 269 GSFTVGELATETFSFRHSNSIPNLPIGCGHDNEGLFVGAAGLIGLGGGAISLSSQLEATS 328
Query: 245 FSYCLLSHKFDDTTRTSSLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGL 304
FSYCL+ D SS LD ++D+ + LT +P V N + F + YV +
Sbjct: 329 FSYCLV-----DLDSESSSTLD---FNADQPSDSLT-SPLVKN------DRFPTFRYVKV 373
Query: 305 RRITVGGQRVRVWHKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNY 364
++VGG+ + + +D G+GG IVDSGTT T + ++++ L D FV + KN
Sbjct: 374 IGMSVGGKPLPISSSSFEIDESGSGGIIVDSGTTITEIPSDVYDVLRDAFVG-LTKNLP- 431
Query: 365 TRALGAEALTGLRPCFDVPGEKTGSFPELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTV 424
A ++ C+D+ + P + G + LP +N V CL
Sbjct: 432 ----PAPGVSPFDTCYDLSSQSNVEVPTIAFILPGENSLQLPAKNCLFQVDSAGTFCLAF 487
Query: 425 VTDREASGGPSIILGNFQMQNYYVEYDLRNQRLGFKQQLC 464
+ S P I+GN Q Q V YDL N +GF C
Sbjct: 488 L----PSTFPLSIIGNVQQQGIRVSYDLANSLVGFSTDKC 523
>gi|223950045|gb|ACN29106.1| unknown [Zea mays]
Length = 392
Score = 116 bits (291), Expect = 2e-23, Method: Compositional matrix adjust.
Identities = 110/387 (28%), Positives = 175/387 (45%), Gaps = 60/387 (15%)
Query: 98 PPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLGCQNPKCSWIHH 157
P I ++DTGS++ W T K CS SK S +P C +PKC
Sbjct: 42 PKDNISAVVDTGSNIFW---TTE---KECSRSKTRSMLP----------CCSPKCE--QR 83
Query: 158 ESIQCRDCNDEPLATSKNCTQICPSYLVLYGSGL---TEGIALSETLNL---PNRIIP-- 209
S CR E A ++ T+ +Y + YG T G+ + L + ++ +P
Sbjct: 84 ASCGCR--RSELKAEAEKETKC--TYAIKYGGNANDSTAGVLYEDKLTIVAVASKAVPGS 139
Query: 210 ----NFLVGCSV---LSSRQPA--GIAGFGRGKTSLPSQLNLDKFSYCLLSHKFDDTTRT 260
+GCS L + P+ G+ G GR TSLP QLN KFSYCL S++ D
Sbjct: 140 QSFEEVAIGCSTSATLKFKDPSIKGVFGLGRSATSLPRQLNFSKFSYCLSSYQKPDL--P 197
Query: 261 SSLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRVRVWHKY 320
S L+L ++ D T + V ++ + + Y+V L+ I++GG R+
Sbjct: 198 SYLLL---TAAPDMATGAVGGAAAVATTALQPNSDYKTRYFVDLQGISIGGTRLPA---- 250
Query: 321 LTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEALTGLRPCF 380
+ G VD+GT+FT + +F L E + +++K R Y + + C+
Sbjct: 251 --VSTKSGGNMFVDTGTSFTRLEGTVFAKLVTE-LDRIMKERKYVKE--QPGRNNGQICY 305
Query: 381 DVP---GEKTGSFPELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTVVTDREASGGPSII 437
P +++ P++ LHF A + LP ++Y S +CL + D+ G +
Sbjct: 306 SPPSTAADESSKLPDMVLHFADSANMVLPWDSYLWKT--TSKLCLAI--DKSNIKGGISV 361
Query: 438 LGNFQMQNYYVEYDLRNQRLGFKQQLC 464
LGNFQMQN ++ D N++L F + C
Sbjct: 362 LGNFQMQNTHMLLDTGNEKLSFVRADC 388
>gi|356555807|ref|XP_003546221.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Glycine max]
Length = 457
Score = 116 bits (291), Expect = 2e-23, Method: Compositional matrix adjust.
Identities = 113/389 (29%), Positives = 173/389 (44%), Gaps = 46/389 (11%)
Query: 87 GGYSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCK-YCSSSKIPSFIPKLSSSSRLL 145
G Y + + GTP + I+DTGS L W C C YC P F P +S + + L
Sbjct: 105 GNYYVKIGVGTPAKYFSMIVDTGSSLSWLQCQ---PCVIYCHVQVDPIFTPSVSKTYKAL 161
Query: 146 GCQNPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYG-SGLTEGIALSETLNLP 204
C + +CS + ++ C+ N T C Y YG + + G + L L
Sbjct: 162 SCSSSQCSSLKSSTLNAPGCS--------NATGAC-VYKASYGDTSFSIGYLSQDVLTLT 212
Query: 205 NRIIPN--FLVGCSVLSSR---QPAGIAGFGRGKTSLPSQLN---LDKFSYCLLSHKFDD 256
P+ F+ GC + + AGI G K S+ QL+ + FSYCL S
Sbjct: 213 PSAAPSSGFVYGCGQDNQGLFGRSAGIIGLANDKLSMLGQLSNKYGNAFSYCLPSSFSAQ 272
Query: 257 TTRTSSLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRVRV 316
+ S L G+S +TP V NP + Y++GL ITV G+ + V
Sbjct: 273 PNSSVSGFLSIGASSLSSSP--YKFTPLVKNPKIPS------LYFLGLTTITVAGKPLGV 324
Query: 317 WHKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEALTGL 376
+ TI+DSGT T + ++ L FV M+ ++ Y +A G + L
Sbjct: 325 SASSYNVP------TIIDSGTVITRLPVAIYNALKKSFV--MIMSKKYAQAPG---FSIL 373
Query: 377 RPCFDVPGEKTGSFPELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTVVTDREASGGPSI 436
CF ++ + PE+++ F+GGA + L V N + +G+ CL + AS P
Sbjct: 374 DTCFKGSVKEMSTVPEIRIIFRGGAGLELKVHNSLVEIEKGT-TCLAIA----ASSNPIS 428
Query: 437 ILGNFQMQNYYVEYDLRNQRLGFKQQLCK 465
I+GN+Q Q + V YD+ N ++GF C+
Sbjct: 429 IIGNYQQQTFTVAYDVANSKIGFAPGGCQ 457
>gi|302783200|ref|XP_002973373.1| hypothetical protein SELMODRAFT_98841 [Selaginella moellendorffii]
gi|300159126|gb|EFJ25747.1| hypothetical protein SELMODRAFT_98841 [Selaginella moellendorffii]
Length = 389
Score = 116 bits (291), Expect = 2e-23, Method: Compositional matrix adjust.
Identities = 113/404 (27%), Positives = 179/404 (44%), Gaps = 74/404 (18%)
Query: 91 ISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLGCQNP 150
+ LS GTPPQ + F L S W C++ C+++ + F P LS+S L C +P
Sbjct: 1 MDLSLGTPPQPLNFTLAVDSGFSWVACSSSCAIN-CTTASL--FQPGLSTSHTKLPCGSP 57
Query: 151 KCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSGLTE-GIALSETLNLPN---- 205
CS S C S +C SY YG+ + G +S+ + +
Sbjct: 58 SCSAFSAVSTSC--------GPSSSC-----SYNTSYGTNFSSAGDLVSDIATMDSVRNR 104
Query: 206 RIIPNFLVGCS-----VLSSRQPAGIAGFGRGKTSLPSQLNL----DKFSYCLLSHKFDD 256
++ N +GC +L +G GF +G S QL+ KF YCL S F
Sbjct: 105 KVAANLSLGCGRDSGGLLELLDTSGFVGFDKGNVSFMGQLSALGYRSKFIYCLPSDTF-- 162
Query: 257 TTRTSSLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRVRV 316
L++ N + ++ + YTP + NP AE Y++ L I++ + +V
Sbjct: 163 ---RGKLVIGNYKLRNASISSSMAYTPMITNPQAAE------LYFINLSTISIDKNKFQV 213
Query: 317 -WHKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKN-RNYTRALG----- 369
+L+ +G GGT++D+ T ++ L +F +Q+V+ +NYT L
Sbjct: 214 PIQGFLS---NGTGGTVIDTTTFLSY--------LTSDFYTQLVQAIKNYTTNLVEVSSS 262
Query: 370 -AEALTGLRPCFDVPGEKTGSFP---ELKLHFKGGAEVTLPVENYFAVVGEGSA---VCL 422
A+AL G+ C+++ FP L HF GGA V V +F + S +C+
Sbjct: 263 VADAL-GVELCYNISANS--DFPPPATLTYHFLGGAGV--EVSTWFLLDDSDSVNNTICM 317
Query: 423 TVVTDREASGGPSI-ILGNFQMQNYYVEYDLRNQRLGFKQQLCK 465
+ R S GP++ ++G +Q + VEYDL R GF Q C
Sbjct: 318 AI--GRSESVGPNLNVIGTYQQLDLTVEYDLEQMRYGFGAQGCN 359
>gi|449440933|ref|XP_004138238.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 1-like [Cucumis
sativus]
Length = 487
Score = 116 bits (291), Expect = 2e-23, Method: Compositional matrix adjust.
Identities = 113/382 (29%), Positives = 162/382 (42%), Gaps = 48/382 (12%)
Query: 89 YSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLGCQ 148
Y + G P ++ + DTGS + W C C P F PK SSS L C
Sbjct: 148 YLAQIGVGQPVKLFYLVPDTGSDVTWLQCQPCASENTCYKQFDPIFDPKSSSSYSPLSCN 207
Query: 149 NPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSG-LTEGIALSETLNLPN-R 206
+ +C + +CN S C Y V YG G T G +ETL+ N
Sbjct: 208 SQQCKLLDKA-----NCN------SDTCI-----YQVHYGDGSFTTGELATETLSFGNSN 251
Query: 207 IIPNFLVGCSVLSS---RQPAGIAGFGRGKTSLPSQLNLDKFSYCLLSHKFDDTTRTSSL 263
IPN +GC + AG+ G G G SL SQL FSYCL++ D +SS
Sbjct: 252 SIPNLPIGCGHDNEGLFAGGAGLIGLGGGAISLSSQLKASSFSYCLVNLDSD----SSST 307
Query: 264 ILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRVRVWHKYLTL 323
+ N + SD T+ P V N + F Y YV + I+VGG+ + + +
Sbjct: 308 LEFNSNMPSDSLTS-----PLVKN------DRFHSYRYVKVVGISVGGKTLPISPTRFEI 356
Query: 324 DRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALG-AEALTGLRPCFDV 382
D G GG IVDSGT + + +++E L + FV T +L A ++ C++
Sbjct: 357 DESGLGGIIVDSGTIISRLPSDVYESLREAFV-------KLTSSLSPAPGISVFDTCYNF 409
Query: 383 PGEKTGSFPELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTVVTDREASGGPSIILGNFQ 442
G+ P + G + LP NY ++ CL + + + I+G+FQ
Sbjct: 410 SGQSNVEVPTIAFVLSEGTSLRLPARNYLIMLDTAGTYCLAFIKTKSSLS----IIGSFQ 465
Query: 443 MQNYYVEYDLRNQRLGFKQQLC 464
Q V YDL N +GF C
Sbjct: 466 QQGIRVSYDLTNSLVGFSTNKC 487
>gi|255544139|ref|XP_002513132.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
gi|223548143|gb|EEF49635.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
Length = 481
Score = 116 bits (291), Expect = 2e-23, Method: Compositional matrix adjust.
Identities = 108/418 (25%), Positives = 165/418 (39%), Gaps = 50/418 (11%)
Query: 54 SSLTRALHIKNPQTKTTTTTTTTTTTNISSHSYGGYSISLSFGTPPQIIPFILDTGSHLV 113
++L R L ++ + + + + G Y I + G+PP+ ++D+GS +V
Sbjct: 107 ATLIRRLSPRDATSSYSVEEFGAEVVSGMNQGSGEYFIRIGVGSPPREQYVVIDSGSDIV 166
Query: 114 WFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLGCQNPKCSWIHHESIQCRDCNDEPLATS 173
W C C C P F P S+S + C + C I + C
Sbjct: 167 WVQCQ---PCTQCYHQTDPVFDPADSASFMGVPCSSSVCERIENAGCHAGGCR------- 216
Query: 174 KNCTQICPSYLVLYGSG-LTEGIALSETLNLPNRIIPNFLVGCSVLSSRQ---PAGIAGF 229
Y V+YG G T+G ETL ++ N +GC + AG+ G
Sbjct: 217 ---------YEVMYGDGSYTKGTLALETLTFGRTVVRNVAIGCGHRNRGMFVGAAGLLGL 267
Query: 230 GRGKTSLPSQLNLD---KFSYCLLSHKFDDTTRTSSLILDNGSSHSDKKTTGLTYTPFVN 286
G G SL QL FSYCL+S D SL G+ G + P +
Sbjct: 268 GGGSMSLVGQLGGQTGGAFSYCLVSRGTDSA---GSLEFGRGA-----MPVGAAWIPLIR 319
Query: 287 NPSVAERNAFSVYYYVGLRRITVGGQRVRVWHKYLTLDRDGNGGTIVDSGTTFTFMAPEL 346
NP +YY+ L + VGG +V + L+ GNGG ++D+GT T +
Sbjct: 320 NPRAPS------FYYIRLSGVGVGGMKVPISEDVFQLNEMGNGGVVMDTGTAVTRIPTVA 373
Query: 347 FEPLADEFVSQMVKNRNYTRALGAEALTGLRPCFDVPGEKTGSFPELKLHFKGGAEVTLP 406
+ D F+ Q N RA G C+++ G + P + +F GG +TLP
Sbjct: 374 YVAFRDAFIGQ---TGNLPRASGVSIFD---TCYNLNGFVSVRVPTVSFYFAGGPILTLP 427
Query: 407 VENYFAVVGEGSAVCLTVVTDREASGGPSIILGNFQMQNYYVEYDLRNQRLGFKQQLC 464
N+ V + C AS I+GN Q + + +D N +GF +C
Sbjct: 428 ARNFLIPVDDVGTFCFAFA----ASPSGLSIIGNIQQEGIQISFDGANGFVGFGPNVC 481
>gi|242093566|ref|XP_002437273.1| hypothetical protein SORBIDRAFT_10g023970 [Sorghum bicolor]
gi|241915496|gb|EER88640.1| hypothetical protein SORBIDRAFT_10g023970 [Sorghum bicolor]
Length = 503
Score = 116 bits (290), Expect = 2e-23, Method: Compositional matrix adjust.
Identities = 111/387 (28%), Positives = 162/387 (41%), Gaps = 55/387 (14%)
Query: 87 GGYSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLG 146
G Y + + GTP + DTGS W C YC K P F P S++ +
Sbjct: 163 GNYVVPIRLGTPAARFTVVFDTGSDTTWVQCQPCV--AYCYQQKEPLFTPTKSATYANIS 220
Query: 147 CQNPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSG-LTEGIALSETLNLPN 205
C + CS + R C + +C Y V YG G T G +TL L
Sbjct: 221 CTSSYCS-----DLDTRGC------SGGHCL-----YAVQYGDGSYTVGFYAQDTLTLGY 264
Query: 206 RIIPNFLVGCSVLSS---RQPAGIAGFGRGKTSLPSQLNLDK----FSYCLLSHKFDDTT 258
+ +F GC + + AG+ G GRGKTS+P Q DK F+YC+ + T
Sbjct: 265 DTVKDFRFGCGEKNRGLFGKAAGLMGLGRGKTSVPVQ-AYDKYSGVFAYCIPA------T 317
Query: 259 RTSSLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRVRVWH 318
+ + LD G LT N P+ +YYVG+ I VGG + +
Sbjct: 318 SSGTGFLDFGPGAPAAANARLTPMLVDNGPT---------FYYVGMTGIKVGGHLLSIPA 368
Query: 319 KYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEALTGLRP 378
+ + G +VDSGT T + P +EPL F M + Y A A + L
Sbjct: 369 TVFS-----DAGALVDSGTVITRLPPSAYEPLRSAFAKGM-EGLGYKT---APAFSILDT 419
Query: 379 CFDVPG-EKTGSFPELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTVVTDREASGGPSII 437
C+D+ G + + + P + L F+GGA + + V + S CL + + + I
Sbjct: 420 CYDLTGYQGSIALPAVSLVFQGGACLDVDASGIL-YVADVSQACLAFAANDDDTD--MTI 476
Query: 438 LGNFQMQNYYVEYDLRNQRLGFKQQLC 464
+GN Q + Y V YDL + +GF C
Sbjct: 477 VGNTQQKTYSVLYDLGKKVVGFAPGAC 503
>gi|357492389|ref|XP_003616483.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
gi|355517818|gb|AES99441.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
Length = 434
Score = 116 bits (290), Expect = 3e-23, Method: Compositional matrix adjust.
Identities = 124/443 (27%), Positives = 190/443 (42%), Gaps = 74/443 (16%)
Query: 40 PSQDSYQNLNSLVSSSLTRALHI-KNPQTKTTTTTTTTTTTNISSHSYGGYSISLSFGTP 98
P+++ YQ+ S+ RA H K+ T T +T GGY ++ S GTP
Sbjct: 45 PTENKYQHFVDAARRSINRANHFFKDSDTSTPESTVIP--------DRGGYLMTYSVGTP 96
Query: 99 PQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLGCQNPKCSWIHHE 158
P I I DTGS +VW C C+ C + P F P SSS + + C + C
Sbjct: 97 PTKIYGIADTGSDIVWLQCE---PCEQCYNQTTPIFNPSKSSSYKNIPCSSKLC-----H 148
Query: 159 SIQCRDCNDEPLATSKNCTQICPSYLVLYG-SGLTEGIALSETLNLPNR-----IIPNFL 212
S++ C+D+ N Q Y + YG S ++G +TL+L + P +
Sbjct: 149 SVRDTSCSDQ------NSCQ----YKISYGDSSHSQGDLSVDTLSLESTSGSPVSFPKIV 198
Query: 213 VGCSVLSS----RQPAGIAGFGRGKTSLPSQLNLD---KFSYCLLSHKFDDTTRTSSLIL 265
+GC ++ +GI G G G SL +QL KFSYCL+ ++ +S L
Sbjct: 199 IGCGTDNAGTFGGASSGIVGLGGGPVSLITQLGSSIGGKFSYCLVPLLNKESNASSILSF 258
Query: 266 DNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRVRVWHKYLTLDR 325
+ + S G+ TP + V+Y++ L+ +VG +RV D
Sbjct: 259 GDAAVVSGD---GVVSTPLIKKD--------PVFYFLTLQAFSVGNKRVEFGGSSEGGDD 307
Query: 326 DGNGGTIVDSGTTFTFMAPELFEPLADEFVS----QMVKNRNYTRALGAEALTGLRPCFD 381
+GN I+DSGTT T + +++ L V V + N +L C+
Sbjct: 308 EGN--IIIDSGTTLTLIPSDVYTNLESAVVDLVKLDRVDDPNQQFSL----------CYS 355
Query: 382 VPGEKTGSFPELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTVVTDREASGGPSIILGNF 441
+ + FP + +HFK GA+V L + F + +G VC + S I GN
Sbjct: 356 LKSNEY-DFPIITVHFK-GADVELHSISTFVPITDG-IVCFAF----QPSPQLGSIFGNL 408
Query: 442 QMQNYYVEYDLRNQRLGFKQQLC 464
QN V YDL+ + + FK C
Sbjct: 409 AQQNLLVGYDLQQKTVSFKPTDC 431
>gi|356498306|ref|XP_003517994.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
Length = 484
Score = 116 bits (290), Expect = 3e-23, Method: Compositional matrix adjust.
Identities = 108/393 (27%), Positives = 164/393 (41%), Gaps = 60/393 (15%)
Query: 82 SSHSYGGYSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSS 141
+S G Y + + G PP +LDTGS + W C C C P F P S+S
Sbjct: 142 TSQGSGEYFLRVGIGKPPSQAYVVLDTGSDVSWIQCA---PCSECYQQSDPIFDPISSNS 198
Query: 142 SRLLGCQNPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSG-LTEGIALSET 200
+ C P+C +S+ +C +N T + Y V YG G T G +ET
Sbjct: 199 YSPIRCDEPQC-----KSLDLSEC--------RNGTCL---YEVSYGDGSYTVGEFATET 242
Query: 201 LNLPNRIIPNFLVGCSVLSSRQPAGI-------AGFGRGKTSLPSQLNLDKFSYCLLSHK 253
+ L + + N +GC G+ G G GK S P+Q+N FSYCL++
Sbjct: 243 VTLGSAAVENVAIGCG----HNNEGLFVGAAGLLGLGGGKLSFPAQVNATSFSYCLVNRD 298
Query: 254 FD--DTTRTSSLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGG 311
D T +S + N ++ P + NP + +YY+GL+ I+VGG
Sbjct: 299 SDAVSTLEFNSPLPRNAAT-----------APLMRNPEL------DTFYYLGLKGISVGG 341
Query: 312 QRVRVWHKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAE 371
+ + + +D G GG I+DSGT T + E+++ L D FV + A
Sbjct: 342 EALPIPESSFEVDAIGGGGIIIDSGTAVTRLRSEVYDALRDAFV------KGAKGIPKAN 395
Query: 372 ALTGLRPCFDVPGEKTGSFPELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTVVTDREAS 431
++ C+D+ ++ P + F G E+ LP NY V C +
Sbjct: 396 GVSLFDTCYDLSSRESVEIPTVSFRFPEGRELPLPARNYLIPVDSVGTFCFAFAPTTSSL 455
Query: 432 GGPSIILGNFQMQNYYVEYDLRNQRLGFKQQLC 464
I+GN Q Q V +D+ N +GF C
Sbjct: 456 S----IIGNVQQQGTRVGFDIANSLVGFSVDSC 484
>gi|242066142|ref|XP_002454360.1| hypothetical protein SORBIDRAFT_04g029400 [Sorghum bicolor]
gi|241934191|gb|EES07336.1| hypothetical protein SORBIDRAFT_04g029400 [Sorghum bicolor]
Length = 466
Score = 116 bits (290), Expect = 3e-23, Method: Compositional matrix adjust.
Identities = 113/384 (29%), Positives = 165/384 (42%), Gaps = 58/384 (15%)
Query: 89 YSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLGCQ 148
Y I++ G+P + ++DTGS + W C C C S P F P SS+ C
Sbjct: 133 YLITVRLGSPGKSQTMLIDTGSDVSWVQCK---PCSQCHSQADPLFDPSSSSTYSPFSCS 189
Query: 149 NPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSG-LTEGIALSETLNLPNRI 207
+ C+ + E C +S C Y V YG G T G S+TL L +
Sbjct: 190 SAACAQLGQEGNGC---------SSSQC-----QYTVTYGDGSSTTGTYSSDTLALGSNA 235
Query: 208 IPNFLVGCSVLSS---RQPAGIAGFGRGKTSLPSQLN---LDKFSYCLLSHKFDDTTRTS 261
+ F GCS + S Q G+ G G G SL SQ FSYCL T +S
Sbjct: 236 VRKFQFGCSNVESGFNDQTDGLMGLGGGAQSLVSQTAGTFGAAFSYCL------PATSSS 289
Query: 262 SLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRVRVWHKYL 321
S L G+ T+G TP + + V +Y V ++ I VGG+++ +
Sbjct: 290 SGFLTLGAG-----TSGFVKTPMLRSSQVP------TFYGVRIQAIRVGGRQLSIPTSVF 338
Query: 322 TLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEALTGLRPCFD 381
+ GTI+DSGT T + P + L+ F + M + Y A + L CFD
Sbjct: 339 S------AGTIMDSGTVLTRLPPTAYSALSSAFKAGM---KQYPSAPPSGI---LDTCFD 386
Query: 382 VPGEKTGSFPELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTVVTDREASGGPSI-ILGN 440
G+ + S P + L F GGA V + + + S +CL + + S S+ I+GN
Sbjct: 387 FSGQSSVSIPTVALVFSGGAVVDIASDGIM-LQTSNSILCLAFAANSDDS---SLGIIGN 442
Query: 441 FQMQNYYVEYDLRNQRLGFKQQLC 464
Q + + V YD+ +GFK C
Sbjct: 443 VQQRTFEVLYDVGGGAVGFKAGAC 466
>gi|388508518|gb|AFK42325.1| unknown [Lotus japonicus]
Length = 204
Score = 116 bits (290), Expect = 3e-23, Method: Compositional matrix adjust.
Identities = 79/222 (35%), Positives = 109/222 (49%), Gaps = 26/222 (11%)
Query: 244 KFSYCLLSHKFDDTTRTSSLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVG 303
KFSYCL S D ++ S L+L + + K T TP + NPS +YY+
Sbjct: 5 KFSYCLTSM---DDSKASVLLLGSLA----KATKDAISTPLLTNPSQPS------FYYLS 51
Query: 304 LRRITVGGQRVRVWHKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRN 363
L I VGG ++ + + DG+GG I+DSGTT T++ +F+ L EF+SQ
Sbjct: 52 LEGIPVGGTQLSIEQSIFDVSDDGSGGVIIDSGTTITYLEKSVFDTLKKEFISQ------ 105
Query: 364 YTRALGAEALTGLRPCFDVPGEKTG-SFPELKLHFKGGAEVTLPVENYFAVVGEGSAVCL 422
L + TGL CF +P E T P+L HFKGG ++ LP E+Y + CL
Sbjct: 106 SNLQLDKSSSTGLDVCFSLPSETTQVEVPKLVFHFKGG-DLELPAESYMIADSKLGVACL 164
Query: 423 TVVTDREASGGPSIILGNFQMQNYYVEYDLRNQRLGFKQQLC 464
+ AS G S I GN Q QN V +DL + + F C
Sbjct: 165 AM----GASNGMS-IFGNVQQQNILVNHDLEKETISFVPTQC 201
>gi|413923876|gb|AFW63808.1| hypothetical protein ZEAMMB73_793799 [Zea mays]
Length = 415
Score = 116 bits (290), Expect = 3e-23, Method: Compositional matrix adjust.
Identities = 110/387 (28%), Positives = 175/387 (45%), Gaps = 60/387 (15%)
Query: 98 PPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLGCQNPKCSWIHH 157
P I ++DTGS++ W T K CS SK S +P C +PKC
Sbjct: 65 PKDNISAVVDTGSNIFW---TTE---KECSRSKTRSMLP----------CCSPKCE--QR 106
Query: 158 ESIQCRDCNDEPLATSKNCTQICPSYLVLYGSGL---TEGIALSETLNL---PNRIIP-- 209
S CR E A ++ T+ +Y + YG T G+ + L + ++ +P
Sbjct: 107 ASCGCR--RSELKAEAEKETKC--TYAIKYGGNANDSTAGVLYEDKLTIVAVASKAVPGS 162
Query: 210 ----NFLVGCSV---LSSRQPA--GIAGFGRGKTSLPSQLNLDKFSYCLLSHKFDDTTRT 260
+GCS L + P+ G+ G GR TSLP QLN KFSYCL S++ D
Sbjct: 163 QSFEEVAIGCSTSATLKFKDPSIKGVFGLGRSATSLPRQLNFSKFSYCLSSYQKPDL--P 220
Query: 261 SSLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRVRVWHKY 320
S L+L ++ D T + V ++ + + Y+V L+ I++GG R+
Sbjct: 221 SYLLL---TAAPDMATGAVGGAAAVATTALQPNSDYKTRYFVDLQGISIGGTRLPA---- 273
Query: 321 LTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEALTGLRPCF 380
+ G VD+GT+FT + +F L E + +++K R Y + + C+
Sbjct: 274 --VSTKSGGNMFVDTGTSFTRLEGTVFAKLVTE-LDRIMKERKYVKE--QPGRNNGQICY 328
Query: 381 DVP---GEKTGSFPELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTVVTDREASGGPSII 437
P +++ P++ LHF A + LP ++Y S +CL + D+ G +
Sbjct: 329 SPPSTAADESSKLPDMVLHFADSANMVLPWDSYLWKT--TSKLCLAI--DKSNIKGGISV 384
Query: 438 LGNFQMQNYYVEYDLRNQRLGFKQQLC 464
LGNFQMQN ++ D N++L F + C
Sbjct: 385 LGNFQMQNTHMLLDTGNEKLSFVRADC 411
>gi|358346443|ref|XP_003637277.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
gi|355503212|gb|AES84415.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
Length = 434
Score = 115 bits (289), Expect = 3e-23, Method: Compositional matrix adjust.
Identities = 124/474 (26%), Positives = 199/474 (41%), Gaps = 72/474 (15%)
Query: 10 LSFIFFFTLLSIFPSSITSLTFSLSRFHTN--------PSQDSYQNLNSLVSSSLTRALH 61
L+ IFF+ I+ S + S+ H + P+ +Q ++V S+ R +
Sbjct: 7 LTLIFFYLCCFIYFSHASKKGLSIEMIHRDFSKSPLYHPTVTKFQRAYNVVHRSINRVNY 66
Query: 62 IKNPQTKTTTTTTTTTTTNISSHSYGGYSISLSFGTPPQIIPFILDTGSHLVWFPCTNHY 121
+ +T T + G Y IS S GTPP + +DTGS++VW C
Sbjct: 67 FTKEFSLNKNQPVSTLTPEL-----GEYLISYSVGTPPFKVYGFMDTGSNIVWLQCQ--- 118
Query: 122 QCKYCSSSKIPSFIPKLSSSSRLLGCQNPKCSWIHHESIQCRDCNDEPLATSKNCTQICP 181
C C + P F P SSS + + C S C+D ND ++ S N +C
Sbjct: 119 PCNTCFNQTSPIFNPSKSSSYKNIPCT----------SSTCKDTNDTHISCS-NGGDVCE 167
Query: 182 SYLVLYGSGLTEGIALSETLNLPNR-----IIPNFLVGCSVLS----SRQPAGIAGFGRG 232
+ G ++G +++L L + + PN ++GC ++ + Q +G+ G GRG
Sbjct: 168 YSITYGGDAKSQGDLSNDSLTLDSTSGSSVLFPNIVIGCGHINVLQDNSQSSGVVGMGRG 227
Query: 233 KTSLPSQLNL----DKFSYCLLSHKFDDTTRTSSLILDNGSSHSDKKTTG--LTYTPFVN 286
SL Q+ KFSYCL+ + D+ +S LI D +G + TP V
Sbjct: 228 PMSLIKQVGSSSVGSKFSYCLIPYN-SDSNSSSKLIFG-----EDVVVSGEIVVSTPMV- 280
Query: 287 NPSVAERNAFSVYYYVGLRRITVGGQRVRVWHKYLTLDRDGNGGTIVDSGTTFTFMAPEL 346
+ N YY++ L +VG R+ +Y ++DSGT T M P L
Sbjct: 281 -----KVNGQENYYFLTLEAFSVGNNRI----EYGERSNASTQNILIDSGTPLT-MLPNL 330
Query: 347 FEPLADEFVSQMVKNRNYTRALGAEALTGLRPCFDVPGEKTGSFPELKLHFKGGAEVTLP 406
F +V+Q VK R + L C++ G++ + P++ HF GA+V L
Sbjct: 331 FLSKLVSYVAQEVK---LPRIEPPDHHLSL--CYNTTGKQL-NVPDITAHFN-GADVKLN 383
Query: 407 VENYFAVVGEGSAVCLTVVTDREASGGPSIILGNFQMQNYYVEYDLRNQRLGFK 460
F +G +C ++ I GN N ++YDL + + FK
Sbjct: 384 SNGTFFPFEDG-IMCFGFISSNGLE-----IFGNIAQNNLLIDYDLEKEIISFK 431
>gi|449449334|ref|XP_004142420.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Cucumis sativus]
Length = 441
Score = 115 bits (289), Expect = 3e-23, Method: Compositional matrix adjust.
Identities = 108/385 (28%), Positives = 162/385 (42%), Gaps = 57/385 (14%)
Query: 89 YSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLGCQ 148
+ + GTP Q + LDT + W PC+ C C S+ + F SSS R L CQ
Sbjct: 103 FVVRAKIGTPAQTLLLALDTSNDAAWIPCSG---CIGCPSTTV--FSSDKSSSFRPLPCQ 157
Query: 149 NPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSGLTEGIALSETLNLPNRII 208
+P+C+ + + S C + + YGS + + L L +
Sbjct: 158 SPQCNQVPNPSCSGSACG----------------FNLTYGSSTVAADLVQDNLTLATDSV 201
Query: 209 PNFLVGC---SVLSSRQPAGIAGFGRGKTSLPSQ---LNLDKFSYCLLSHK---FDDTTR 259
P++ GC + SS P G+ G GRG SL Q L FSYCL S K F + R
Sbjct: 202 PSYTFGCIRKATGSSVPPQGLLGLGRGPLSLLGQSQSLYQSTFSYCLPSFKSVNFSGSLR 261
Query: 260 TSSLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRVRVWHK 319
+ + + YTP + NP R++ YYV L I VG + V +
Sbjct: 262 LGPV----------AQPIRIKYTPLLRNP---RRSSL---YYVNLISIRVGRKIVDIPPS 305
Query: 320 YLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEALTGLRPC 379
L + GT++DSGTTFT + + + DEF R R + +L G C
Sbjct: 306 ALAFNSATGAGTVIDSGTTFTRLVAPAYTAVRDEF------RRRVGRNVTVSSLGGFDTC 359
Query: 380 FDVPGEKTGSFPELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTVVTDREASGGPSIILG 439
+ VP P + F G VTLP +N+ GS CL + + ++
Sbjct: 360 YTVPIIS----PTITFMF-AGMNVTLPPDNFLIHSTAGSTTCLAMAAAPDNVNSVLNVIA 414
Query: 440 NFQMQNYYVEYDLRNQRLGFKQQLC 464
+ Q QN+ + +D+ N R+G ++ C
Sbjct: 415 SMQQQNHRILFDIPNSRVGVARESC 439
>gi|297801286|ref|XP_002868527.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
gi|297314363|gb|EFH44786.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
Length = 444
Score = 115 bits (289), Expect = 4e-23, Method: Compositional matrix adjust.
Identities = 112/383 (29%), Positives = 163/383 (42%), Gaps = 35/383 (9%)
Query: 91 ISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLGCQNP 150
+SL GTP Q +LDTGS L W C + + K SF P LSSS L C +P
Sbjct: 83 LSLPIGTPSQSQELVLDTGSQLSWIQC-HPKKIKKPLPPPTTSFDPSLSSSFSDLPCSHP 141
Query: 151 KCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSG-LTEGIALSETLNLPN-RII 208
C D L TS + ++C Y Y G EG + E N +
Sbjct: 142 LCK---------PRIPDFTLPTSCDSNRLC-HYSYFYADGTFAEGNLVKEKFTFSNSQTT 191
Query: 209 PNFLVGCSVLSSRQPAGIAGFGRGKTSLPSQLNLDKFSYCL--LSHKFDDTTRTSSLILD 266
P ++GC+ S GI G G+ S SQ + KFSYC+ S++ + S + +
Sbjct: 192 PPLILGCAK-ESTDVKGILGMNLGRLSFISQAKISKFSYCIPTRSNRPGLASTGSFYLGE 250
Query: 267 NGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRVRVWHKYLTLDRD 326
N +S K + LT+ P N + Y V L I +G +R+ + D
Sbjct: 251 NPNSRGFKYVSLLTFPQSQRMP-----NLDPLAYTVPLLGIRIGQKRLNIPSSVFRPDAG 305
Query: 327 GNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEALTGLRPCFD----- 381
G+G T+VDSG+ FT + ++ + +E V + G+ A CFD
Sbjct: 306 GSGQTMVDSGSEFTHLVDVAYDKVKEEIVRLVGSRLKKGYVYGSTADM----CFDGNHQM 361
Query: 382 VPGEKTGSFPELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTVVTDREASGGPSIILGNF 441
V G G L F+ G V + VE +V G + + G S I+GN
Sbjct: 362 VIGRLIGD-----LVFEFGRGVEILVEKQRLLVNVGGGIHCVGIGRSSMLGAASNIIGNV 416
Query: 442 QMQNYYVEYDLRNQRLGFKQQLC 464
QN +VE+D+ N+R+GF + C
Sbjct: 417 HQQNLWVEFDVANRRVGFSKAEC 439
>gi|297795137|ref|XP_002865453.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
gi|297311288|gb|EFH41712.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
Length = 665
Score = 115 bits (289), Expect = 4e-23, Method: Compositional matrix adjust.
Identities = 120/412 (29%), Positives = 174/412 (42%), Gaps = 98/412 (23%)
Query: 85 SYGGYSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRL 144
S G Y+ L GTPPQ I+DTGS + + PC+ CK C + P F P+LSSS +
Sbjct: 76 SNGYYTTRLWIGTPPQEFALIVDTGSTVTYVPCST---CKQCGKHQDPKFQPELSSSYKA 132
Query: 145 LGCQNPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSGLTEGIALSETL--- 201
L C NP C +C+DE K C Y Y + LSE L
Sbjct: 133 LKC-NPDC-----------NCDDE----GKLCV-----YERRYAEMSSSSGVLSEDLISF 171
Query: 202 NLPNRIIPNFLV-GCS-----VLSSRQPAGIAGFGRGKTSLPSQLNLDK------FSYCL 249
+++ P V GC L S++ GI G GRGK S+ QL +DK FS C
Sbjct: 172 GNESQLTPQRAVFGCENVETGDLFSQRADGIMGLGRGKLSVVDQL-VDKGVIEDVFSLCY 230
Query: 250 LSHKFDDTTRTSSLILDNGS-------SHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYV 302
+ +++L S SHSD PF S YY +
Sbjct: 231 GGMEVG----GGAMVLGKISPPAGMVFSHSD---------PFR-----------SPYYNI 266
Query: 303 GLRRITVGGQRVRVWHKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNR 362
L+++ V G+ +++ K +G GT++DSGTT+ + E F + D + ++ +
Sbjct: 267 DLKQMHVAGKSLKLNPKVF----NGKHGTVLDSGTTYAYFPKEAFIAIKDAIIKEIPSLK 322
Query: 363 NYTRALGAEALTGLRP-----CFDVPGEKTGS----FPELKLHFKGGAEVTLPVENY-FA 412
+ G P CF G FPE+ + F G ++ L ENY F
Sbjct: 323 R---------IHGPDPNYDDVCFSGAGRDVAEIHNFFPEIDMEFGNGQKLILSPENYLFR 373
Query: 413 VVGEGSAVCLTVVTDREASGGPSIILGNFQMQNYYVEYDLRNQRLGFKQQLC 464
A CL + DR++ + +LG ++N V YD N +LGF + C
Sbjct: 374 HTKVRGAYCLGIFPDRDS----TTLLGGIVVRNTLVTYDRENDKLGFLKTNC 421
>gi|225217008|gb|ACN85295.1| aspartic proteinase nepenthesin-1 precursor [Oryza coarctata]
Length = 500
Score = 115 bits (288), Expect = 4e-23, Method: Compositional matrix adjust.
Identities = 122/420 (29%), Positives = 177/420 (42%), Gaps = 61/420 (14%)
Query: 63 KNPQTKTTTTTTTTT-----------TTNISSHSYGGYSISLSFGTPPQIIPFILDTGSH 111
K+ Q + +TTTT + ++ S+ G Y +++ GTP + DTGS
Sbjct: 124 KSIQRRVSTTTTVSRGKPKRNRPSLPASSGSALGTGNYVVTIGLGTPAGRYTVVFDTGSD 183
Query: 112 LVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLGCQNPKCSWIHHESIQCRDCNDEPLA 171
W C Y K+ F P SS+ + C P CS ++
Sbjct: 184 TTWVQCEPCVVVCYKQQEKL--FDPARSSTYANISCAAPACSDLY--------------- 226
Query: 172 TSKNCTQICPSYLVLYGSG-LTEGIALSETLNLPN-RIIPNFLVGCSVLSSR---QPAGI 226
K C+ Y V YG G + G +TL L + I F GC + + AG+
Sbjct: 227 -IKGCSGGHCLYGVQYGDGSYSIGFFAMDTLTLSSYDAIKGFRFGCGERNEGLYGEAAGL 285
Query: 227 AGFGRGKTSLPSQLNLDKFSYCLLSHKFDDTTRTSSLILDNGSSHSDKKTTGLTYTPFVN 286
G GRGKTSLP Q DK+ + +H F + + + LD G + LT V+
Sbjct: 286 LGLGRGKTSLPVQA-YDKYG-GVFAHCFPARS-SGTGYLDFGPGSLPAVSAKLTTPMLVD 342
Query: 287 NPSVAERNAFSVYYYVGLRRITVGGQRVRVWHKYLTLDRDGNGGTIVDSGTTFTFMAPEL 346
N +YYVGL I VGG+ + + T GTIVDSGT T + P
Sbjct: 343 NGPT--------FYYVGLTGIRVGGKLLSIPQSVFT-----TSGTIVDSGTVITRLPPAA 389
Query: 347 FEPLADEFVSQMVKNRNYTRALGAEALTGLRPCFDVPGEKTGSFPELKLHFKGGAEVTLP 406
+ L F S M + R Y + A AL+ L C+D G + P + L F+GGA + +
Sbjct: 390 YSSLRSAFASAMAE-RGYKK---APALSLLDTCYDFTGMSEVAIPTVSLLFQGGASLDVH 445
Query: 407 VEN--YFAVVGEGSAVCLTVVTDREASGGPSIILGNFQMQNYYVEYDLRNQRLGFKQQLC 464
Y A V S CL ++E I+GN Q++ + V YD+ + +GF C
Sbjct: 446 ASGIIYAASV---SQACLGFAGNKEDD--DVGIVGNTQLKTFGVVYDIGKKVVGFCPGAC 500
>gi|449527515|ref|XP_004170756.1| PREDICTED: aspartic proteinase nepenthesin-1-like, partial [Cucumis
sativus]
Length = 364
Score = 115 bits (288), Expect = 4e-23, Method: Compositional matrix adjust.
Identities = 108/385 (28%), Positives = 162/385 (42%), Gaps = 57/385 (14%)
Query: 89 YSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLGCQ 148
+ + GTP Q + LDT + W PC+ C C S+ + F SSS R L CQ
Sbjct: 26 FVVRAKIGTPAQTLLLALDTSNDAAWIPCSG---CIGCPSTTV--FSSDKSSSFRPLPCQ 80
Query: 149 NPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSGLTEGIALSETLNLPNRII 208
+P+C+ + + S C + + YGS + + L L +
Sbjct: 81 SPQCNQVPNPSCSGSACG----------------FNLTYGSSTVAADLVQDNLTLATDSV 124
Query: 209 PNFLVGC---SVLSSRQPAGIAGFGRGKTSLPSQ---LNLDKFSYCLLSHK---FDDTTR 259
P++ GC + SS P G+ G GRG SL Q L FSYCL S K F + R
Sbjct: 125 PSYTFGCIRKATGSSVPPQGLLGLGRGPLSLLGQSQSLYQSTFSYCLPSFKSVNFSGSLR 184
Query: 260 TSSLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRVRVWHK 319
+ + + YTP + NP R++ YYV L I VG + V +
Sbjct: 185 LGPV----------AQPIRIKYTPLLRNP---RRSSL---YYVNLISIRVGRKIVDIPPS 228
Query: 320 YLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEALTGLRPC 379
L + GT++DSGTTFT + + + DEF R R + +L G C
Sbjct: 229 ALAFNSATGAGTVIDSGTTFTRLVAPAYTAVRDEF------RRRVGRNVTVSSLGGFDTC 282
Query: 380 FDVPGEKTGSFPELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTVVTDREASGGPSIILG 439
+ VP P + F G VTLP +N+ GS CL + + ++
Sbjct: 283 YTVPIIS----PTITFMF-AGMNVTLPPDNFLIHSTSGSTTCLAMAAAPDNVNSVLNVIA 337
Query: 440 NFQMQNYYVEYDLRNQRLGFKQQLC 464
+ Q QN+ + +D+ N R+G ++ C
Sbjct: 338 SMQQQNHRILFDIPNSRVGVARESC 362
>gi|383130042|gb|AFG45741.1| Pinus taeda anonymous locus 2_3758_01 genomic sequence
Length = 155
Score = 115 bits (288), Expect = 5e-23, Method: Compositional matrix adjust.
Identities = 57/146 (39%), Positives = 86/146 (58%), Gaps = 5/146 (3%)
Query: 279 LTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRVRVWHKYLTLDRDGNGGTIVDSGTT 338
L YTPF+ N A + + +YY+ LR +++G +R+ + K + D GNGGTI+DSGTT
Sbjct: 15 LNYTPFLINTK-ASSSGYHTFYYIDLRGVSIGRKRLNLPSKLFSFDSKGNGGTIIDSGTT 73
Query: 339 FTFMAPELFEPLADEFVSQMVKNRNYTRALGAEALTGLRPCFDVPGEKTGSFPELKLHFK 398
FT E ++ + F SQ+ + RA EA TG+R C++V G P+ HFK
Sbjct: 74 FTIFNEEFYKNITAAFASQI----GFRRASEVEARTGMRLCYNVSGVDHVLLPDFAFHFK 129
Query: 399 GGAEVTLPVENYFAVVGEGSAVCLTV 424
GG+++ LPV NYF+ ++CLT+
Sbjct: 130 GGSDMVLPVANYFSYFVSFDSICLTM 155
>gi|357521081|ref|XP_003630829.1| Aspartic proteinase nepenthesin-2 [Medicago truncatula]
gi|355524851|gb|AET05305.1| Aspartic proteinase nepenthesin-2 [Medicago truncatula]
Length = 526
Score = 115 bits (288), Expect = 5e-23, Method: Compositional matrix adjust.
Identities = 101/370 (27%), Positives = 152/370 (41%), Gaps = 49/370 (13%)
Query: 91 ISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLGCQNP 150
+ + G PPQ I D + W C C C F P SSS LL C+
Sbjct: 189 VQIGVGGPPQKFYMIFDLQTDFTWLQCQ---PCIKCYDQPDSIFDPSQSSSYTLLSCETK 245
Query: 151 KCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSGL-TEGIALSETLNLPNR-II 208
C+ + + S C+D+ Y + Y G TEG+ ++ET++ + +
Sbjct: 246 HCNLLPNSS-----CSDDGYC----------RYNITYKDGTNTEGVLINETVSFESSGWV 290
Query: 209 PNFLVGCSVLSSRQP----AGIAGFGRGKTSLPSQLNLDKFSYCLLSHKFDDTTRTSSLI 264
+GCS ++ P G G GRG S PS++N SYCL+ K D +S+L
Sbjct: 291 DRVSLGCSN-KNQGPFVGSDGTFGLGRGSLSFPSRINASSMSYCLVESK--DGYSSSTLE 347
Query: 265 LDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRVRVWHKYLTLD 324
++ +G + NP YYVGL+ I VGG+++ V + T+D
Sbjct: 348 FNS------PPCSGSVKAKLLQNPKAEN------LYYVGLKGIKVGGEKIDVPNSTFTID 395
Query: 325 RDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEALTGLRPCFDVPG 384
GNGG IV S + T + + + + D FV+ K ++ R +A C+++
Sbjct: 396 PYGNGGMIVSSSSLITMLENDTYNVVRDAFVA---KTQHLER---LKAFLQFDTCYNLSS 449
Query: 385 EKTGSFPELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTVVTDREASGGPSIILGNFQMQ 444
T P L+ G LP E+Y V + C S G ILG Q
Sbjct: 450 NNTVELPILEFEVNDGKSWLLPKESYLYAVDKNGTFCFAFA----PSKGSFSILGTLQQY 505
Query: 445 NYYVEYDLRN 454
V +DL N
Sbjct: 506 GTRVTFDLVN 515
>gi|449509162|ref|XP_004163513.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 2-like [Cucumis
sativus]
Length = 417
Score = 115 bits (288), Expect = 5e-23, Method: Compositional matrix adjust.
Identities = 104/380 (27%), Positives = 168/380 (44%), Gaps = 40/380 (10%)
Query: 93 LSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLGCQNPKC 152
++ G Q I+DTGS L W C C+ C + + P F P SSS L C +P C
Sbjct: 68 VTVGIGGQNSTLIVDTGSDLTWVQC---LPCRLCYNQQEPLFNPSNSSSFLSLPCNSPTC 124
Query: 153 SWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSG-LTEGIALSETLNLPNRIIPNF 211
+ + L ++KN T Y + YG G + G E L L I NF
Sbjct: 125 VALQPTA------GSSGLCSNKNSTSC--DYQIDYGDGSYSRGELGFEKLTLGKTEIDNF 176
Query: 212 LVGCSVLSSR---QPAGIAGFGRGKTSLPSQ---LNLDKFSYCLLSHKFDDTTRTSSLIL 265
+ GC + +G+ G R + SL SQ L FSYCL + + SL L
Sbjct: 177 IFGCGRNNKGLFGGASGLMGLARSELSLVSQTSSLFGSVFSYCLPTT---GVGSSGSLTL 233
Query: 266 DNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRVRVWHKYLTLDR 325
+ K + ++YT + NP ++ +Y++ L I++GG + V L
Sbjct: 234 GGADFSNFKNISPISYTRMIQNPQMSN------FYFLNLTGISIGGVNLNVPR----LSS 283
Query: 326 DGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEALTGLRPCFDVPGE 385
+ +++DSGT T ++P +++ EF Q R + L CF++ G
Sbjct: 284 NEGVLSLLDSGTVITRLSPSIYKAFKAEFEKQFSGYRT------TPGFSILNTCFNLTGY 337
Query: 386 KTGSFPELKLHFKGGAEVTLPVENYFAVV-GEGSAVCLTVVTDREASGGPSIILGNFQMQ 444
+ + P +K F+G AE+ + VE F V + S +CL + ++I+GN+Q +
Sbjct: 338 EEVNIPTVKFIFEGNAEMIVDVEGVFYFVKSDASQICLAFASLGYED--QTMIIGNYQQK 395
Query: 445 NYYVEYDLRNQRLGFKQQLC 464
N V Y+ + ++GF + C
Sbjct: 396 NQRVIYNSKESKVGFAGEPC 415
>gi|297603570|ref|NP_001054261.2| Os04g0677100 [Oryza sativa Japonica Group]
gi|255675885|dbj|BAF16175.2| Os04g0677100 [Oryza sativa Japonica Group]
Length = 464
Score = 115 bits (287), Expect = 6e-23, Method: Compositional matrix adjust.
Identities = 109/385 (28%), Positives = 153/385 (39%), Gaps = 55/385 (14%)
Query: 87 GGYSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLG 146
G Y + + G+PP ++D+GS ++W C C+ C + P F P SSS +
Sbjct: 128 GEYFVRVGVGSPPTDQYLVVDSGSDVIWVQCR---PCEQCYAQTDPLFDPAASSSFSGVS 184
Query: 147 CQNPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSG-LTEGIALSETLNLPN 205
C + C + + C Y V YG G T+G ETL L
Sbjct: 185 CGSAICRTLSGTGCGG-------GGDAGKC-----DYSVTYGDGSYTKGELALETLTLGG 232
Query: 206 RIIPNFLVGCSVLSSR---QPAGIAGFGRGKTSLPSQLNLDK---FSYCLLSHKFDDTTR 259
+ +GC +S AG+ G G G SL QL FSYCL S
Sbjct: 233 TAVQGVAIGCGHRNSGLFVGAAGLLGLGWGAMSLVGQLGGAAGGVFSYCLASRG---AGG 289
Query: 260 TSSLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRVRVWHK 319
SL+L +V S +YYVGL I VGG+R+ +
Sbjct: 290 AGSLVLGR-------------------TEAVPRGRRASSFYYVGLTGIGVGGERLPLQDS 330
Query: 320 YLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEALTGLRPC 379
L DG GG ++D+GT T + E + L F M + A++ L C
Sbjct: 331 LFQLTEDGAGGVVMDTGTAVTRLPREAYAALRGAFDGAMGALPR------SPAVSLLDTC 384
Query: 380 FDVPGEKTGSFPELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTVVTDREASGGPSIILG 439
+D+ G + P + +F GA +TLP N VG G+ CL +S G S ILG
Sbjct: 385 YDLSGYASVRVPTVSFYFDQGAVLTLPARNLLVEVG-GAVFCLAFA---PSSSGIS-ILG 439
Query: 440 NFQMQNYYVEYDLRNQRLGFKQQLC 464
N Q + + D N +GF C
Sbjct: 440 NIQQEGIQITVDSANGYVGFGPNTC 464
>gi|383130040|gb|AFG45740.1| Pinus taeda anonymous locus 2_3758_01 genomic sequence
Length = 155
Score = 115 bits (287), Expect = 6e-23, Method: Compositional matrix adjust.
Identities = 57/146 (39%), Positives = 86/146 (58%), Gaps = 5/146 (3%)
Query: 279 LTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRVRVWHKYLTLDRDGNGGTIVDSGTT 338
L YTPF+ N A + + +YY+ LR +++G +R+ + K + D GNGGTI+DSGTT
Sbjct: 15 LNYTPFLINTK-ASSSGYHTFYYIDLRGVSIGRKRLNLPSKLFSFDTKGNGGTIIDSGTT 73
Query: 339 FTFMAPELFEPLADEFVSQMVKNRNYTRALGAEALTGLRPCFDVPGEKTGSFPELKLHFK 398
FT E ++ + F SQ+ + RA EA TG+R C++V G P+ HFK
Sbjct: 74 FTIFNEEFYKNITAAFASQI----GFRRASEVEARTGMRLCYNVSGVDHVLLPDFAFHFK 129
Query: 399 GGAEVTLPVENYFAVVGEGSAVCLTV 424
GG+++ LPV NYF+ ++CLT+
Sbjct: 130 GGSDMVLPVANYFSYFVSFDSICLTM 155
>gi|357137345|ref|XP_003570261.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Brachypodium
distachyon]
Length = 458
Score = 115 bits (287), Expect = 6e-23, Method: Compositional matrix adjust.
Identities = 124/388 (31%), Positives = 164/388 (42%), Gaps = 66/388 (17%)
Query: 89 YSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLGCQ 148
Y I++S GTP ++DTGS + W C H + SS F P SS+ C
Sbjct: 125 YVITVSIGTPAMTQAVMIDTGSDVSWVHC--HARAGAGSSLF---FDPGKSSTYTPFSCS 179
Query: 149 NPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSGL-TEGIALSETLNL-PNR 206
+ C+ + C S N T C Y V YG G T G S+TL L
Sbjct: 180 SAACTRLEGRDNGC----------SLNST--C-QYTVRYGDGSNTTGTYGSDTLALNSTE 226
Query: 207 IIPNFLVGCSV-------LSSRQPAGIAGFGRGKTSLPSQLNL---DKFSYCLLSHKFDD 256
+ NF GCS L Q G+ G G G SL SQ FSYCL +
Sbjct: 227 KVENFQFGCSETSDPGEGLDEDQTDGLMGLGGGAPSLVSQTAATYGSAFSYCLPA----- 281
Query: 257 TTRTSSLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRVRV 316
TTR+S + S+ T+G FV P R A +Y+V L+ I VGG V +
Sbjct: 282 TTRSSGFLTLGAST----GTSG-----FVTTPMFRSRRA-PTFYFVILQGINVGGDPVAI 331
Query: 317 WHKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEALTGL 376
G+I+DSGT T + P + L+ F + M R Y RA A + L
Sbjct: 332 SPTVFA------AGSIMDSGTIITRLPPRAYSALSAAFRAGM---RRYPRA---RAFSIL 379
Query: 377 RPCFDVPGEKTGSFPELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTVVTDREASGGPSI 436
CFD G+ S P ++L F GGA V L + GS CL A+GG
Sbjct: 380 DTCFDFTGQDNVSIPAVELVFSGGAVVDLDADGIM----YGS--CLAFA---PATGGIGS 430
Query: 437 ILGNFQMQNYYVEYDLRNQRLGFKQQLC 464
I+GN Q + + V +D+ LGF+ C
Sbjct: 431 IIGNVQQRTFEVLHDVGQSVLGFRPGAC 458
>gi|361067845|gb|AEW08234.1| Pinus taeda anonymous locus 2_3758_01 genomic sequence
gi|383130032|gb|AFG45736.1| Pinus taeda anonymous locus 2_3758_01 genomic sequence
gi|383130034|gb|AFG45737.1| Pinus taeda anonymous locus 2_3758_01 genomic sequence
gi|383130036|gb|AFG45738.1| Pinus taeda anonymous locus 2_3758_01 genomic sequence
gi|383130046|gb|AFG45743.1| Pinus taeda anonymous locus 2_3758_01 genomic sequence
gi|383130048|gb|AFG45744.1| Pinus taeda anonymous locus 2_3758_01 genomic sequence
gi|383130050|gb|AFG45745.1| Pinus taeda anonymous locus 2_3758_01 genomic sequence
gi|383130054|gb|AFG45747.1| Pinus taeda anonymous locus 2_3758_01 genomic sequence
gi|383130056|gb|AFG45748.1| Pinus taeda anonymous locus 2_3758_01 genomic sequence
Length = 155
Score = 115 bits (287), Expect = 7e-23, Method: Compositional matrix adjust.
Identities = 57/146 (39%), Positives = 86/146 (58%), Gaps = 5/146 (3%)
Query: 279 LTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRVRVWHKYLTLDRDGNGGTIVDSGTT 338
L YTPF+ N A + + +YY+ LR +++G +R+ + K + D GNGGTI+DSGTT
Sbjct: 15 LNYTPFLINTK-ASSSGYHTFYYIDLRGVSIGRKRLNLPSKLFSFDTKGNGGTIIDSGTT 73
Query: 339 FTFMAPELFEPLADEFVSQMVKNRNYTRALGAEALTGLRPCFDVPGEKTGSFPELKLHFK 398
FT E ++ + F SQ+ + RA EA TG+R C++V G P+ HFK
Sbjct: 74 FTIFNEEFYKNITAAFASQI----GFRRASEVEARTGMRLCYNVSGVDHVLLPDFAFHFK 129
Query: 399 GGAEVTLPVENYFAVVGEGSAVCLTV 424
GG+++ LPV NYF+ ++CLT+
Sbjct: 130 GGSDMVLPVANYFSYFVSFDSICLTM 155
>gi|356553832|ref|XP_003545255.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Glycine max]
Length = 427
Score = 115 bits (287), Expect = 7e-23, Method: Compositional matrix adjust.
Identities = 112/395 (28%), Positives = 169/395 (42%), Gaps = 58/395 (14%)
Query: 90 SISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIP----SFIPKLSSSSRLL 145
+ISL+ G+PPQ + +LDTGS L W C K+P +F P LSSS
Sbjct: 60 TISLTIGSPPQNVTMVLDTGSELSWLHC-----------KKLPNLNSTFNPLLSSSYTPT 108
Query: 146 GCQNPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSGLTEGIALSETLNLPN 205
C + C + RD P + N ++C + + EG +ET +L
Sbjct: 109 PCNSSVCM------TRTRDLTI-PASCDPN-NKLCHVIVSYADASSAEGTLAAETFSLAG 160
Query: 206 RIIPNFLVGC--------SVLSSRQPAGIAGFGRGKTSLPSQLNLDKFSYCLLSHKFDDT 257
P L GC + + G+ G RG SL +Q+ L KFSYC+
Sbjct: 161 AAQPGTLFGCMDSAGYTSDINEDAKTTGLMGMNRGSLSLVTQMVLPKFSYCISGED---- 216
Query: 258 TRTSSLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRVRVW 317
L+L +G S + L YTP V + + V Y V L I V + +++
Sbjct: 217 -AFGVLLLGDGPS----APSPLQYTPLVTA-TTSSPYFDRVAYTVQLEGIKVSEKLLQLP 270
Query: 318 HKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQM------VKNRNYTRALGAE 371
D G G T+VDSGT FTF+ ++ L DEF+ Q +++ N+ GA
Sbjct: 271 KSVFVPDHTGAGQTMVDSGTQFTFLLGPVYNSLKDEFLEQTKGVLTRIEDPNFVFE-GAM 329
Query: 372 ALTGLRPCFDVPGEKTGSFPELKLHFKGGAEVTLPVENYFAVVGEGS--AVCLTVVTDRE 429
L C+ P + P + L F GAE+ + E V +G C T + +
Sbjct: 330 DL-----CYHAPA-SLAAVPAVTLVFS-GAEMRVSGERLLYRVSKGRDWVYCFT-FGNSD 381
Query: 430 ASGGPSIILGNFQMQNYYVEYDLRNQRLGFKQQLC 464
G + ++G+ QN ++E+DL R+GF + C
Sbjct: 382 LLGIEAYVIGHHHQQNVWMEFDLVKSRVGFTETTC 416
>gi|449527149|ref|XP_004170575.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 1-like [Cucumis
sativus]
Length = 487
Score = 114 bits (286), Expect = 7e-23, Method: Compositional matrix adjust.
Identities = 113/382 (29%), Positives = 161/382 (42%), Gaps = 48/382 (12%)
Query: 89 YSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLGCQ 148
Y + G P ++ + DTGS + W C C P F PK SSS L C
Sbjct: 148 YLAQIGVGQPVKLFYLVPDTGSDVTWLQCQPCASENTCYKQFDPIFDPKSSSSYSPLSCN 207
Query: 149 NPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSG-LTEGIALSETLNLPN-R 206
+ +C + +CN S C Y V YG G T G +ETL+ N
Sbjct: 208 SQQCKLLDKA-----NCN------SDTCI-----YQVHYGDGSFTTGELATETLSFGNSN 251
Query: 207 IIPNFLVGCSVLSS---RQPAGIAGFGRGKTSLPSQLNLDKFSYCLLSHKFDDTTRTSSL 263
IPN +GC + AG+ G G G SL SQL FSYCL++ D +SS
Sbjct: 252 SIPNLPIGCGHDNEGLFAGGAGLIGLGGGAISLSSQLKASSFSYCLVNLDSD----SSST 307
Query: 264 ILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRVRVWHKYLTL 323
+ N SD T+ P V N + F Y YV + I+VGG+ + + +
Sbjct: 308 LEFNSYMPSDSLTS-----PLVKN------DRFHSYRYVKVVGISVGGKTLPISPTRFEI 356
Query: 324 DRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALG-AEALTGLRPCFDV 382
D G GG IVDSGT + + +++E L + FV T +L A ++ C++
Sbjct: 357 DESGLGGIIVDSGTIISRLPSDVYESLREAFV-------KLTSSLSPAPGISVFDTCYNF 409
Query: 383 PGEKTGSFPELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTVVTDREASGGPSIILGNFQ 442
G+ P + G + LP NY ++ CL + + + I+G+FQ
Sbjct: 410 SGQSNVEVPTIAFVLSEGTSLRLPARNYLIMLDTAGTYCLAFIKTKSSLS----IIGSFQ 465
Query: 443 MQNYYVEYDLRNQRLGFKQQLC 464
Q V YDL N +GF C
Sbjct: 466 QQGIRVSYDLTNSIVGFSTNKC 487
>gi|449436215|ref|XP_004135889.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 2-like [Cucumis
sativus]
Length = 496
Score = 114 bits (286), Expect = 7e-23, Method: Compositional matrix adjust.
Identities = 104/380 (27%), Positives = 168/380 (44%), Gaps = 40/380 (10%)
Query: 93 LSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLGCQNPKC 152
++ G Q I+DTGS L W C C+ C + + P F P SSS L C +P C
Sbjct: 147 VTVGIGGQNSTLIVDTGSDLTWVQC---LPCRLCYNQQEPLFNPSNSSSFLSLPCNSPTC 203
Query: 153 SWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSG-LTEGIALSETLNLPNRIIPNF 211
+ + L ++KN T Y + YG G + G E L L I NF
Sbjct: 204 VALQPTA------GSSGLCSNKNSTSC--DYQIDYGDGSYSRGELGFEKLTLGKTEIDNF 255
Query: 212 LVGCSVLSSR---QPAGIAGFGRGKTSLPSQ---LNLDKFSYCLLSHKFDDTTRTSSLIL 265
+ GC + +G+ G R + SL SQ L FSYCL + + SL L
Sbjct: 256 IFGCGRNNKGLFGGASGLMGLARSELSLVSQTSSLFGSVFSYCLPT---TGVGSSGSLTL 312
Query: 266 DNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRVRVWHKYLTLDR 325
+ K + ++YT + NP ++ +Y++ L I++GG + V L
Sbjct: 313 GGADFSNFKNISPISYTRMIQNPQMSN------FYFLNLTGISIGGVNLNVPR----LSS 362
Query: 326 DGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEALTGLRPCFDVPGE 385
+ +++DSGT T ++P +++ EF Q R + L CF++ G
Sbjct: 363 NEGVLSLLDSGTVITRLSPSIYKAFKAEFEKQFSGYRT------TPGFSILNTCFNLTGY 416
Query: 386 KTGSFPELKLHFKGGAEVTLPVENYFAVV-GEGSAVCLTVVTDREASGGPSIILGNFQMQ 444
+ + P +K F+G AE+ + VE F V + S +CL + ++I+GN+Q +
Sbjct: 417 EEVNIPTVKFIFEGNAEMIVDVEGVFYFVKSDASQICLAFASLGYED--QTMIIGNYQQK 474
Query: 445 NYYVEYDLRNQRLGFKQQLC 464
N V Y+ + ++GF + C
Sbjct: 475 NQRVIYNSKESKVGFAGEPC 494
>gi|225463766|ref|XP_002267930.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Vitis vinifera]
Length = 479
Score = 114 bits (286), Expect = 7e-23, Method: Compositional matrix adjust.
Identities = 104/385 (27%), Positives = 160/385 (41%), Gaps = 50/385 (12%)
Query: 87 GGYSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLG 146
G Y + + G+PP+ ++D+GS +VW C QC + S P F P S+S +
Sbjct: 138 GEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCTQCYHQSD---PVFDPADSASFTGVS 194
Query: 147 CQNPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSG-LTEGIALSETLNLPN 205
C + C + + C Y V YG G T+G ETL
Sbjct: 195 CSSSVCDRLENAGCHAGRCR----------------YEVSYGDGSYTKGTLALETLTFGR 238
Query: 206 RIIPNFLVGCSVLSSRQ---PAGIAGFGRGKTSLPSQLNLD---KFSYCLLSHKFDDTTR 259
++ + +GC + AG+ G G G S QL FSYCL+S D +
Sbjct: 239 TMVRSVAIGCGHRNRGMFVGAAGLLGLGGGSMSFVGQLGGQTGGAFSYCLVSRGTDSS-- 296
Query: 260 TSSLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRVRVWHK 319
SL+ + G + P V NP +YY+GL + VGG RV + +
Sbjct: 297 -GSLVFGR-----EALPAGAAWVPLVRNPRAPS------FYYIGLAGLGVGGIRVPISEE 344
Query: 320 YLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEALTGLRPC 379
L G+GG ++D+GT T + ++ D F++Q N RA G C
Sbjct: 345 VFRLTELGDGGVVMDTGTAVTRLPTLAYQAFRDAFLAQTA---NLPRATGVAIFD---TC 398
Query: 380 FDVPGEKTGSFPELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTVVTDREASGGPSIILG 439
+D+ G + P + +F GG +TLP N+ + + C ++ G S ILG
Sbjct: 399 YDLLGFVSVRVPTVSFYFSGGPILTLPARNFLIPMDDAGTFCFAFA---PSTSGLS-ILG 454
Query: 440 NFQMQNYYVEYDLRNQRLGFKQQLC 464
N Q + + +D N +GF +C
Sbjct: 455 NIQQEGIQISFDGANGYVGFGPNIC 479
>gi|148906646|gb|ABR16474.1| unknown [Picea sitchensis]
Length = 538
Score = 114 bits (286), Expect = 8e-23, Method: Compositional matrix adjust.
Identities = 121/392 (30%), Positives = 163/392 (41%), Gaps = 62/392 (15%)
Query: 87 GGYSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLG 146
G Y + GTP + +LDTGS +VW C C C S P F P LS+S LG
Sbjct: 195 GEYFTRIGVGTPMREQYMVLDTGSDVVWIQCE---PCSKCYSQVDPIFNPSLSASFSTLG 251
Query: 147 CQNPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSG-LTEGIALSETLNLPN 205
C + CS++ + NC Y V YG G T G +E L
Sbjct: 252 CNSAVCSYLD----------------AYNCHGGGCLYKVSYGDGSYTIGSFATEMLTFGT 295
Query: 206 RIIPNFLVGCSVLSSRQPAGI-------AGFGRGKTSLPSQLNLD---KFSYCLLSHKFD 255
+ N +GC AG+ G G G S PSQL FSYCL+
Sbjct: 296 TSVRNVAIGCG----HDNAGLFVGAAGLLGLGAGLLSFPSQLGTQTGRAFSYCLV----- 346
Query: 256 DTTRTSSLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRV- 314
D SS L+ G + G TP + NPS+ +YYV L I+VGG +
Sbjct: 347 DRFSESSGTLEFGP---ESVPLGSILTPLLTNPSLP------TFYYVPLISISVGGALLD 397
Query: 315 RVWHKYLTLDR-DGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRAL-GAEA 372
V +D G GG IVDSGT T + +++ + D FV+ TR L AE
Sbjct: 398 SVPPDVFRIDETSGRGGFIVDSGTAVTRLQTPVYDAVRDAFVAG-------TRQLPKAEG 450
Query: 373 LTGLRPCFDVPGEKTGSFPELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTVVTDREASG 432
++ C+D+ G + P + HF GA + LP +NY + C A+
Sbjct: 451 VSIFDTCYDLSGLPLVNVPTVVFHFSNGASLILPAKNYMIPMDFMGTFCFAFA---PATS 507
Query: 433 GPSIILGNFQMQNYYVEYDLRNQRLGFKQQLC 464
S I+GN Q Q V +D N +GF + C
Sbjct: 508 DLS-IMGNIQQQGIRVSFDTANSLVGFALRQC 538
>gi|242086412|ref|XP_002443631.1| hypothetical protein SORBIDRAFT_08g022620 [Sorghum bicolor]
gi|241944324|gb|EES17469.1| hypothetical protein SORBIDRAFT_08g022620 [Sorghum bicolor]
Length = 507
Score = 114 bits (286), Expect = 8e-23, Method: Compositional matrix adjust.
Identities = 119/406 (29%), Positives = 163/406 (40%), Gaps = 65/406 (16%)
Query: 87 GGYSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLG 146
G Y ++ GTP LDT S L W C C+ C P F P+ S+S +
Sbjct: 139 GDYIAKIAVGTPAVEALLALDTASDLTWLQCQ---PCRRCYPQSGPVFDPRHSTSYGEMN 195
Query: 147 CQNPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSG-------LTEGIALSE 199
P C + +K T I Y VLYG G + G + E
Sbjct: 196 YDAPDCQALGRSG----------GGDAKRGTCI---YTVLYGDGDGHGSTSTSVGDLVEE 242
Query: 200 TLNLPNRIIPNFL-VGCS----VLSSRQPAGIAGFGRGKTSLPSQLNL----DKFSYCLL 250
TL + +L +GC L AGI G RG+ S+P Q+ FSYCL+
Sbjct: 243 TLTFAGGVRQAYLSIGCGHDNKGLFGAPAAGILGLSRGQISIPHQIAFLGYNASFSYCLV 302
Query: 251 SHKFDDTTRTSSLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVG 310
+ +S+L G+ T P P+V +N +YYV L ++VG
Sbjct: 303 DFISGPGSPSSTLTFGAGAVD--------TSPPASFTPTVLNQN-MPTFYYVRLIGVSVG 353
Query: 311 GQRV-RVWHKYLTLD-RDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRAL 368
G RV V + L LD G+GG I+DSGTT T +A P + + R L
Sbjct: 354 GVRVPGVTERDLQLDPYTGHGGVILDSGTTVTRLA----RP---AYTAFRDAFRAAATGL 406
Query: 369 GAEALTGLRPCFDV---PGEKTG-----SFPELKLHFKGGAEVTLPVENYFAVVGEGSAV 420
G + G FD G + G P + +HF GG E++L +NY V V
Sbjct: 407 GQVSTGGPSGLFDTCYTVGGRAGLRHCVKVPAVSMHFAGGVELSLQPKNYLITVDSRGTV 466
Query: 421 CLTVV--TDREASGGPSIILGNFQMQNYYVEYDLRNQRLGFKQQLC 464
C DR S ++GN Q + V YD+ QR+GF C
Sbjct: 467 CFAFAGTGDRSVS-----VIGNILQQGFRVVYDIGGQRVGFAPNSC 507
>gi|383130044|gb|AFG45742.1| Pinus taeda anonymous locus 2_3758_01 genomic sequence
Length = 155
Score = 114 bits (286), Expect = 8e-23, Method: Compositional matrix adjust.
Identities = 56/146 (38%), Positives = 86/146 (58%), Gaps = 5/146 (3%)
Query: 279 LTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRVRVWHKYLTLDRDGNGGTIVDSGTT 338
L YTPF+ N A + ++ +YY+ LR +++G +R+ + K + D GNGGTI+DSGTT
Sbjct: 15 LNYTPFLINTK-ASSSGYNTFYYIDLRGVSIGRKRLNLPSKLFSFDNKGNGGTIIDSGTT 73
Query: 339 FTFMAPELFEPLADEFVSQMVKNRNYTRALGAEALTGLRPCFDVPGEKTGSFPELKLHFK 398
FT E ++ + F SQ+ + RA EA TG+R C++ G P+ HFK
Sbjct: 74 FTIFNEEFYKNITAAFASQI----GFRRASEVEARTGMRLCYNASGVDHVLLPDFAFHFK 129
Query: 399 GGAEVTLPVENYFAVVGEGSAVCLTV 424
GG+++ LPV NYF+ ++CLT+
Sbjct: 130 GGSDMVLPVANYFSYFVSFDSICLTM 155
>gi|356546372|ref|XP_003541600.1| PREDICTED: probable aspartic protease At2g35615-like [Glycine max]
Length = 433
Score = 114 bits (285), Expect = 9e-23, Method: Compositional matrix adjust.
Identities = 133/482 (27%), Positives = 205/482 (42%), Gaps = 87/482 (18%)
Query: 10 LSFIFFFTLLSIFPSSITSLTFSLSRFHTN--------PSQDSYQNLNSLVSSSLTRALH 61
L+ + F+ L +IF + FS+ H + P++ +Q + + V S+ RA H
Sbjct: 9 LALVLFY-LCNIFYLEAFNGGFSVEMIHRDSSRSPFFSPTETQFQRVANAVHRSINRANH 67
Query: 62 IKNPQTKTTTTTTTTTTTNISSHSYGGYSISLSFGTPPQIIPFILDTGSHLVWFPCTNHY 121
+ ++ + + TT IS+ G Y IS S GTP + ILDTGS ++W C
Sbjct: 68 LN----QSFVSPNSPETTVISA--LGEYLISYSVGTPSLQVFGILDTGSDIIWLQCQ--- 118
Query: 122 QCKYCSSSKIPSFIPKLSSSSRLLGCQNPKCSWIHHESIQCRDCNDEPLATSKNCTQICP 181
CK C P F S + + L C + C +S+Q C ++ K+C
Sbjct: 119 PCKKCYEQTTPIFDSSKSQTYKTLPCPSNTC-----QSVQGTFC-----SSRKHCL---- 164
Query: 182 SYLVLYGSG-------LTEGIALSETLNLPNRIIPNFLVGC----SVLSSRQPAGIAGFG 230
Y + Y G E + L T P + P ++GC ++ + +GI G G
Sbjct: 165 -YSIHYVDGSQSLGDLSVETLTLGSTNGSPVQ-FPGTVIGCGRYNAIGIEEKNSGIVGLG 222
Query: 231 RGKTSLPSQLNLD---KFSYCLLSHKFDDTTRTSSLILDNGSSHSDKKTTGLTYTPFVNN 287
RG SL +QL+ KFSYCL+ +T +S L N + S + G TP +
Sbjct: 223 RGPMSLITQLSPSTGGKFSYCLVPGL---STASSKLNFGNAAVVSGR---GTVSTPLFS- 275
Query: 288 PSVAERNAFSVYYYVGLRRITVGGQRVRVWHKYLTLDRDGNGGTIVDSGTTFTFMAPELF 347
+N V+Y++ L +VG R+ ++ + G G I+DSGTT T + ++
Sbjct: 276 -----KNGL-VFYFLTLEAFSVGRNRI----EFGSPGSGGKGNIIIDSGTTLTALPNGVY 325
Query: 348 EPL----ADEFVSQMVKNRNYTRALGAEALTGLRPCFDV-PGEKTGSFPELKLHFKGGAE 402
L A + Q V++ N L C+ V P + S P + HF GA+
Sbjct: 326 SKLEAAVAKTVILQRVRDPNQVLGL----------CYKVTPDKLDASVPVITAHFS-GAD 374
Query: 403 VTLPVENYFAVVGEGSAVCLTVVTDREASGGPSIILGNFQMQNYYVEYDLRNQRLGFKQQ 462
VTL N F V + VC + + GN QN V YDL+ + FK
Sbjct: 375 VTLNAINTFVQVAD-DVVCFAFQPTETGA-----VFGNLAQQNLLVGYDLQMNTVSFKHT 428
Query: 463 LC 464
C
Sbjct: 429 DC 430
>gi|383130052|gb|AFG45746.1| Pinus taeda anonymous locus 2_3758_01 genomic sequence
Length = 155
Score = 114 bits (285), Expect = 1e-22, Method: Compositional matrix adjust.
Identities = 57/146 (39%), Positives = 86/146 (58%), Gaps = 5/146 (3%)
Query: 279 LTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRVRVWHKYLTLDRDGNGGTIVDSGTT 338
L YTPF+ N A + + +YY+ LR +++G +R+ + K + D GNGGTI+DSGTT
Sbjct: 15 LNYTPFLINTK-ASSSGYHTFYYIDLRGVSIGRKRLNLPSKLFSFDTKGNGGTIIDSGTT 73
Query: 339 FTFMAPELFEPLADEFVSQMVKNRNYTRALGAEALTGLRPCFDVPGEKTGSFPELKLHFK 398
FT E ++ + F SQ+ + RA EA TG+R C++V G P+ HFK
Sbjct: 74 FTIFNEEFYKNITAAFSSQI----GFRRASEVEARTGMRLCYNVSGVDHVLLPDFAFHFK 129
Query: 399 GGAEVTLPVENYFAVVGEGSAVCLTV 424
GG+++ LPV NYF+ ++CLT+
Sbjct: 130 GGSDMVLPVANYFSYFVSFDSICLTM 155
>gi|449445943|ref|XP_004140731.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Cucumis sativus]
Length = 430
Score = 114 bits (285), Expect = 1e-22, Method: Compositional matrix adjust.
Identities = 112/389 (28%), Positives = 166/389 (42%), Gaps = 45/389 (11%)
Query: 91 ISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLGCQNP 150
+SL GTPPQ +LDTGS L W C + K ++P +PK ++S +
Sbjct: 68 VSLPIGTPPQPTDLVLDTGSQLSWIQCHDKKVKK-----RLPP-LPKPKTASFDPSLSSS 121
Query: 151 KCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSG-LTEGIALSETLNLPNRI-I 208
+ I D L TS + ++C Y Y G L EG + E +
Sbjct: 122 FSLLPCNHPICKPRIPDFTLPTSCDQNRLC-HYSYFYADGTLAEGNLVREKFTFSKSLST 180
Query: 209 PNFLVGCSVLSSRQPAGIAGFGRGKTSLPSQLNLDKFSYCLLSHKFDDTTRTSSLILDNG 268
P ++GC+ S+ GI G G+ S SQ + KFSYC+ S + T L DN
Sbjct: 181 PPVILGCAQASTEN-RGILGMNHGRLSFISQAKISKFSYCVPSRTGSNPTGLFYL-GDNP 238
Query: 269 SSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRVRVWHKYLTLDRDGN 328
+S K T LT+ ++P N + Y + ++ I + G+R+ + D G+
Sbjct: 239 NSSKFKYVTMLTFPESQSSP-----NLDPLAYTLPMKAIKIAGKRLNIPPAAFKPDAGGS 293
Query: 329 GGTIVDSGTTFTFMAPELFEPLADE---FVSQMVKNRNYTRALGAEALTGLRPCFD--VP 383
G T++DSG+ T++ E +E + +E V M+K + Y A A+ CFD V
Sbjct: 294 GQTMIDSGSDLTYLVDEAYEKVKEEVVRLVGAMMK-KGYVYADVADM------CFDAGVT 346
Query: 384 GEKTGSFPELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTV--------VTDREASGGPS 435
E + F G E+ VG G V V + E G S
Sbjct: 347 AEVGRRIGGISFEFDNGVEI---------FVGRGEGVLTEVEKGVKCVGIGRSERLGIGS 397
Query: 436 IILGNFQMQNYYVEYDLRNQRLGFKQQLC 464
I+G QN +VEYDL N+R+GF C
Sbjct: 398 NIIGTVHQQNMWVEYDLANKRVGFGGAEC 426
>gi|79315693|ref|NP_001030891.1| aspartyl protease family protein [Arabidopsis thaliana]
gi|332646353|gb|AEE79874.1| aspartyl protease family protein [Arabidopsis thaliana]
Length = 499
Score = 114 bits (285), Expect = 1e-22, Method: Compositional matrix adjust.
Identities = 113/393 (28%), Positives = 158/393 (40%), Gaps = 79/393 (20%)
Query: 87 GGYSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLG 146
G Y + + G+PP+ ILDTGS L W C
Sbjct: 168 GEYFMDVLVGSPPKHFSLILDTGSDLNWIQC----------------------------- 198
Query: 147 CQNPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSGLTEGIALSET--LNLP 204
+ C DC +N Q CP Y S T G ET +NL
Sbjct: 199 -------------LPCYDC------FQQNDNQSCPYYYWYGDSSNTTGDFAVETFTVNLT 239
Query: 205 NR-------IIPNFLVGCSVLSS---RQPAGIAGFGRGKTSLPSQLNL---DKFSYCLLS 251
+ N + GC + AG+ G GRG S SQL FSYCL+
Sbjct: 240 TNGGSSELYNVENMMFGCGHWNRGLFHGAAGLLGLGRGPLSFSSQLQSLYGHSFSYCLVD 299
Query: 252 HKFDDTTRTSSLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGG 311
DT +S LI G L +T FV + N +YYV ++ I V G
Sbjct: 300 RN-SDTNVSSKLIF--GEDKDLLSHPNLNFTSFV----AGKENLVDTFYYVQIKSILVAG 352
Query: 312 QRVRVWHKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAE 371
+ + + + + DG GGTI+DSGTT ++ A EP A EF+ + + +
Sbjct: 353 EVLNIPEETWNISSDGAGGTIIDSGTTLSYFA----EP-AYEFIKNKIAEKAKGKYPVYR 407
Query: 372 ALTGLRPCFDVPGEKTGSFPELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTVVTDREAS 431
L PCF+V G PEL + F GA P EN F + E VCL ++ +++
Sbjct: 408 DFPILDPCFNVSGIHNVQLPELGIAFADGAVWNFPTENSFIWLNE-DLVCLAMLGTPKSA 466
Query: 432 GGPSIILGNFQMQNYYVEYDLRNQRLGFKQQLC 464
I+GN+Q QN+++ YD + RLG+ C
Sbjct: 467 FS---IIGNYQQQNFHILYDTKRSRLGYAPTKC 496
>gi|242095588|ref|XP_002438284.1| hypothetical protein SORBIDRAFT_10g011120 [Sorghum bicolor]
gi|241916507|gb|EER89651.1| hypothetical protein SORBIDRAFT_10g011120 [Sorghum bicolor]
Length = 487
Score = 114 bits (284), Expect = 1e-22, Method: Compositional matrix adjust.
Identities = 109/402 (27%), Positives = 170/402 (42%), Gaps = 63/402 (15%)
Query: 89 YSISLSFGTPPQIIPFILDTGSHLVW---FPCTNHYQCKYCSSSKIPSFIPKLSSSSRLL 145
Y +++ GTPP+ + DTGS L W PC + C + P F P SS+ +
Sbjct: 122 YVVTIGIGTPPRNFTVLFDTGSDLTWVQCLPCPD----SSCYPQQEPLFDPSKSSTYVDV 177
Query: 146 GCQNPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSGLTEGIALSE---TLN 202
C P+C H +Q C + +C Y V YG +L+E TL+
Sbjct: 178 PCSAPEC---HIGGVQQTRCG------ATSC-----EYSVKYGDESETHGSLAEETFTLS 223
Query: 203 LPNRIIP---NFLVGCSVL-------SSRQPAGIAGFGRGKTSLPSQLNLD------KFS 246
P+ + P + GCS + AG+ G GRG +S+ SQ FS
Sbjct: 224 PPSPLAPAATGVVFGCSHEYISVFNDTGMGVAGLLGLGRGDSSILSQTRRSINSGGGVFS 283
Query: 247 YCLLSHKFDDTTRTSSLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRR 306
YCL + T L + G++ ++ + L++TP + S R+A Y V L
Sbjct: 284 YCLPPRG----SSTGYLTIGGGAAAPQQQYSNLSFTPLITTIS-QLRSA----YVVNLAG 334
Query: 307 ITVGGQRVRVWHKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTR 366
++V G V + +L G ++DSGT T M + PL DEF M +
Sbjct: 335 VSVNGAAVDIPASAFSL------GAVIDSGTVVTHMPAAAYYPLRDEFRLHM----GSYK 384
Query: 367 ALGAEALTGLRPCFDVPGEKTGSFPELKLHFKGGAEVTLPVENYFAVV----GEGSAVCL 422
L ++ L C+DV G+ + P + L F GGA + + V+ G G ++ L
Sbjct: 385 MLPEGSMKLLDTCYDVTGQDVVTAPRVALEFGGGARIDVDASGILLVLPAEDGSGQSLTL 444
Query: 423 TVVTDREASGGPSIILGNFQMQNYYVEYDLRNQRLGFKQQLC 464
+ + +I+GN Q + Y V +D+ R+GF C
Sbjct: 445 ACLAFLPTNSAGLVIVGNMQQRAYNVVFDVDGGRIGFGPNGC 486
>gi|356557014|ref|XP_003546813.1| PREDICTED: probable aspartic protease At2g35615-like [Glycine max]
Length = 435
Score = 114 bits (284), Expect = 1e-22, Method: Compositional matrix adjust.
Identities = 114/398 (28%), Positives = 173/398 (43%), Gaps = 72/398 (18%)
Query: 87 GGYSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLG 146
G Y + G+PP ++DTGS L+W C+ C C + P F P SS+ +
Sbjct: 87 GEYLMRFYIGSPPVERLAMVDTGSSLIWLQCS---PCHNCFPQETPLFEPLKSSTYKYAT 143
Query: 147 CQNPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGS-GLTEGIALSETLNLPN 205
C + C+ + RDC C Y ++YG + GI +ETL+ +
Sbjct: 144 CDSQPCTLLQPSQ---RDC-----GKLGQCI-----YGIMYGDKSFSVGILGTETLSFGS 190
Query: 206 R------IIPNFLVGCSV------LSSRQPAGIAGFGRGKTSLPSQLNLD---KFSYCLL 250
PN + GC V +S + GIAG G G SL SQL KFSYCLL
Sbjct: 191 TGGAQTVSFPNTIFGCGVDNNFTIYTSNKVMGIAGLGAGPLSLVSQLGAQIGHKFSYCLL 250
Query: 251 SHKFDDTTRTSSLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVG 310
+ D+T TS L S + T G+ TP + PS+ YY++ L +T+G
Sbjct: 251 PY---DSTSTSKLKF---GSEAIITTNGVVSTPLIIKPSLP------TYYFLNLEAVTIG 298
Query: 311 GQRVRVWHKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGA 370
+ V T DGN ++DSGT T++ + + FV+ + LG
Sbjct: 299 QKVVS------TGQTDGN--IVIDSGTPLTYLENTFY----NNFVASL------QETLGV 340
Query: 371 EAL----TGLRPCFDVPGEKTGSFPELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTVVT 426
+ L + L+ CF P + P++ F GA V L +N + + + +CL VV
Sbjct: 341 KLLQDLPSPLKTCF--PNRANLAIPDIAFQFT-GASVALRPKNVLIPLTDSNILCLAVV- 396
Query: 427 DREASGGPSIILGNFQMQNYYVEYDLRNQRLGFKQQLC 464
+SG + G+ ++ VEYDL +++ F C
Sbjct: 397 --PSSGIGISLFGSIAQYDFQVEYDLEGKKVSFAPTDC 432
>gi|413921849|gb|AFW61781.1| hypothetical protein ZEAMMB73_702843 [Zea mays]
Length = 379
Score = 114 bits (284), Expect = 1e-22, Method: Compositional matrix adjust.
Identities = 92/337 (27%), Positives = 149/337 (44%), Gaps = 34/337 (10%)
Query: 31 FSLSRFHTNPSQDSYQNLNSLVSSSLTRALHIKNPQTKTTTTTTTTTTTNISSHSYGGYS 90
L+ S Q L+ ++ S R +++ T + + S G Y
Sbjct: 31 LKLTHVDAGTSYTKLQLLSRAIARSKARVAALQSAAVLPPVVDPITAARVLVTASSGEYL 90
Query: 91 ISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLGCQNP 150
+ L+ GTPP I+DTGS L+W C C C+ P F K S++ R L C++
Sbjct: 91 VDLAIGTPPLYYTAIMDTGSDLIWTQCA---PCLLCADQPTPYFDVKKSATYRALPCRSS 147
Query: 151 KCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYG-SGLTEGIALSETL-----NLP 204
+C+ + +S +C + Y YG + T G+ +ET N
Sbjct: 148 RCASL----------------SSPSCFKKMCVYQYYYGDTASTAGVLANETFTFGAANST 191
Query: 205 NRIIPNFLVGCSVLSSRQPA---GIAGFGRGKTSLPSQLNLDKFSYCLLSHKFDDTTRTS 261
N GC L++ A G+ GFGRG SL SQL +FSYCL S+ +R
Sbjct: 192 KVRATNIAFGCGSLNAGDLANSSGMVGFGRGPLSLVSQLGPSRFSYCLTSYLSATPSRLY 251
Query: 262 SLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRVRVWHKYL 321
+ N SS + + + TPFV NP++ Y++ L+ I++G + + +
Sbjct: 252 FGVYANLSSTNTSSGSPVQSTPFVINPALPN------MYFLSLKAISLGTKLLPIDPLVF 305
Query: 322 TLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQM 358
++ DG GG I+DSGT+ T++ + +E + VS +
Sbjct: 306 AINDDGTGGVIIDSGTSITWLQQDAYEAVRRGLVSAI 342
>gi|357514989|ref|XP_003627783.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
gi|355521805|gb|AET02259.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
Length = 416
Score = 114 bits (284), Expect = 2e-22, Method: Compositional matrix adjust.
Identities = 135/476 (28%), Positives = 199/476 (41%), Gaps = 89/476 (18%)
Query: 10 LSFIFFFTLLSIFP-SSITSLTFSLSRFHTN--------PSQDSYQNLNSLVSSSLTRAL 60
L +F+F+L I S + FS+ H + P+Q+ YQ++ + S+ RA
Sbjct: 6 LLILFYFSLCFIISLSHALNNGFSVELIHRDSSKSPLYQPTQNKYQHIVNAARRSINRAN 65
Query: 61 HIKNPQTKTTTTTTTTTTTNISSHSYGGYSISLSFGTPPQIIPFILDTGSHLVWFPCTNH 120
H T T T +T I H G Y ++ S GTPP + I DTGS +VW C
Sbjct: 66 HFYK-----TALTNTPQSTVIPDH--GEYLMTYSVGTPPFKLYGIADTGSDIVWLQCE-- 116
Query: 121 YQCKYCSSSKIPSFIPKLSSSSRLLGCQNPKCSWIHHESIQCRDCNDEPLATSKNCTQIC 180
CK C + P F P SS+ + + C + C K+ Q
Sbjct: 117 -PCKECYNQTTPKFKPSKSSTYKNIPCSSDLC---------------------KSGQQ-- 152
Query: 181 PSYLVLYGSGLTEGIALSETLNLPNRIIPNFLVGC----SVLSSRQPAGIAGFGRGKTSL 236
G+ + + L + P P ++GC +V +GI G G G SL
Sbjct: 153 -------GNLSVDTLTLESSTGHPISF-PKTVIGCGTDNTVSFEGASSGIVGLGGGPASL 204
Query: 237 PSQL--NLD-KFSYCLLSHKFDDTTRTSSLILDNGSSHSDKKTTGLTYTPFVNNPSVAER 293
+QL ++D KFSYCLL + + T + D D G+ TP V +
Sbjct: 205 ITQLGSSIDAKFSYCLLPNPVESNTTSKLNFGDTAVVSGD----GVVSTPIVKKDPI--- 257
Query: 294 NAFSVYYYVGLRRITVGGQRVRVWHKYLTLDRDGNGG----TIVDSGTTFTFMAPELFEP 349
V+YY+ L +VG +R+ + NGG I+DSGTT T + +++
Sbjct: 258 ----VFYYLTLEAFSVGNKRIE-------FEGSSNGGHEGNIIIDSGTTLTVIPTDVYNN 306
Query: 350 LADEFVSQMVKNRNYTRALGAEALTGLRPCFDVPGEKTGSFPELKLHFKGGAEVTLPVEN 409
L + V ++VK + R L L C+ V + FP + HFKG A+V L +
Sbjct: 307 L-ESAVLELVKLK---RVNDPTRLFNL--CYSVTSDGY-DFPIITTHFKG-ADVKLHPIS 358
Query: 410 YFAVVGEGSAVCLTVVTDREASGGPSI-ILGNFQMQNYYVEYDLRNQRLGFKQQLC 464
F V +G VCL T + I GN QN V YDL+ + + FK C
Sbjct: 359 TFVDVADG-IVCLAFATTSAFIPSDVVSIFGNLAQQNLLVGYDLQQKIVSFKPTDC 413
>gi|226500708|ref|NP_001149229.1| aspartic proteinase nepenthesin-2 [Zea mays]
gi|195625632|gb|ACG34646.1| aspartic proteinase nepenthesin-2 precursor [Zea mays]
Length = 471
Score = 113 bits (283), Expect = 2e-22, Method: Compositional matrix adjust.
Identities = 111/398 (27%), Positives = 175/398 (43%), Gaps = 62/398 (15%)
Query: 89 YSISLSFGT--PPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLG 146
Y I+ G P I ++DTGS + W T K CS SK S +P
Sbjct: 110 YIITFYLGNQRPEDNISAVVDTGSDIFW---TTE---KECSRSKTRSMLP---------- 153
Query: 147 CQNPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSGL---TEGIALSETLNL 203
C +PKC + C C L C +Y ++YG T G+ + L +
Sbjct: 154 CCSPKC----EQRASC-GCGRSELKAEAEKETKC-TYAIIYGGNANDSTAGVMYEDKLTI 207
Query: 204 ---PNRIIPN------FLVGCSV---LSSRQPA--GIAGFGRGKTSLPSQLNLDKFSYCL 249
++ +P+ +GCS L + P+ G+ G GR TSLP QLN KFSYCL
Sbjct: 208 VAVASKAVPSSQSFKEVAIGCSTSATLKFKDPSIKGVFGLGRSATSLPRQLNFSKFSYCL 267
Query: 250 LSHKFDDTTRTSSLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITV 309
S++ D S L+L ++ D T + V ++ + + Y+V L+ I++
Sbjct: 268 SSYQEPDL--PSYLLL---TAAPDMATGAVGGGAAVATTALQPNSDYKTLYFVHLQNISI 322
Query: 310 GGQRVRVWHKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALG 369
GG R + + G VD+G +FT + +F L E + +++K R Y +
Sbjct: 323 GGTR------FPAVSTKSGGNMFVDTGASFTRLEGTVFAKLVTE-LDRIMKERKYVKEQP 375
Query: 370 AEALTGLRPCFDVP---GEKTGSFPELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTVVT 426
+ C+ P +++ P++ LHF A + LP ++Y S +CL +
Sbjct: 376 GR--NNGQICYSPPSTAADESSKLPDMVLHFADSANMVLPWDSYLWKT--TSKLCLAIYK 431
Query: 427 DREASGGPSIILGNFQMQNYYVEYDLRNQRLGFKQQLC 464
GG S +LGNFQMQN ++ D N++L F + C
Sbjct: 432 S-NIKGGIS-VLGNFQMQNTHMLLDTGNEKLSFVRADC 467
>gi|449466304|ref|XP_004150866.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 1-like [Cucumis
sativus]
gi|449485213|ref|XP_004157102.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 1-like [Cucumis
sativus]
Length = 473
Score = 113 bits (283), Expect = 2e-22, Method: Compositional matrix adjust.
Identities = 109/387 (28%), Positives = 164/387 (42%), Gaps = 51/387 (13%)
Query: 87 GGYSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLG 146
G Y +S+ GTP + + I DTGS L W C +YC + K P F+P S++ +
Sbjct: 129 GNYIVSVGLGTPKKYLSLIFDTGSDLTWTQC--QPCARYCYNQKDPVFVPSQSTTYSNIS 186
Query: 147 CQNPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGS-GLTEGIALSETLNLPN 205
C +P CS + + N + ++ C Y + YG + G ETL L +
Sbjct: 187 CSSPDCSQLESGT-----GNQPGCSAARACI-----YGIQYGDQSFSVGYFAKETLTLTS 236
Query: 206 -RIIPNFLVGCSVLSS---RQPAGIAGFGRGKTSLPSQLNL---DKFSYCLLSHKFDDTT 258
+I NFL GC + AG+ G G+ K S+ Q FSYCL T
Sbjct: 237 TDVIENFLFGCGQNNRGLFGSAAGLIGLGQDKISIVKQTAQKYGQVFSYCL------PKT 290
Query: 259 RTSSLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRVRVWH 318
+S+ L + L YTP VA +Y V + + VGG ++ +
Sbjct: 291 SSSTGYL---TFGGGGGGGALKYTPITKAHGVAN------FYGVDIVGMKVGGTQIPISS 341
Query: 319 KYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEALTGLRP 378
+ G I+DSGT T + P+ + L F M K Y + A L+ L
Sbjct: 342 SVFS-----TSGAIIDSGTVITRLPPDAYSALKSAFEKGMAK---YPK---APELSILDT 390
Query: 379 CFDVPGEKTGSFPELKLHFKGGAEVTLPVENYFAVVGEG-SAVCLTVVTDREASGGPSII 437
C+D+ T P++ FKGG E+ L + + G S VCL +++ S I
Sbjct: 391 CYDLSKYSTIQIPKVGFVFKGGEELDL--DGIGIMYGASTSQVCLAFAGNQDPS--TVAI 446
Query: 438 LGNFQMQNYYVEYDLRNQRLGFKQQLC 464
+GN Q + V YD+ ++GF C
Sbjct: 447 IGNVQQKTLQVVYDVGGGKIGFGYNGC 473
>gi|449441139|ref|XP_004138341.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 1-like [Cucumis
sativus]
gi|449477464|ref|XP_004155031.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 1-like [Cucumis
sativus]
Length = 336
Score = 113 bits (283), Expect = 2e-22, Method: Compositional matrix adjust.
Identities = 108/381 (28%), Positives = 154/381 (40%), Gaps = 54/381 (14%)
Query: 93 LSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLGCQNPKC 152
+ G P Q F+LDTGS + W C C P F P+LSSS + C + +C
Sbjct: 1 MRVGQPQQPSFFVLDTGSDVTWLQCLPCAGKNGCYEQITPIFDPELSSSYNPVSCDSEQC 60
Query: 153 SWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSG-LTEGIALSETLN-LPNRIIPN 210
+ C Y V YG G T G +ETL + + IPN
Sbjct: 61 QLLDEAGCNVNSC----------------IYKVEYGDGSFTIGELATETLTFVHSNSIPN 104
Query: 211 FLVGCSVLSSRQPAGI-------AGFGRGKTSLPSQLNLDKFSYCLLSHKFDDTTRTSSL 263
+GC G+ G G G S+ SQL FSYCL+ D S
Sbjct: 105 ISIGCG----HDNEGLFVGADGLIGLGGGAISISSQLKASSFSYCLV-----DIDSPSFS 155
Query: 264 ILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRVRVWHKYLTL 323
LD ++D + L +P V N + F + YV + ++VGG+ + + +
Sbjct: 156 TLD---FNTDPPSDSLI-SPLVKN------DRFPSFRYVKVIGMSVGGKPLPISSSRFEI 205
Query: 324 DRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEALTGLRPCFDVP 383
D G GG IVDSGTT T + +++E L + F+ T A ++ C+D+
Sbjct: 206 DESGLGGIIVDSGTTITQLPSDVYEVLREAFLGLT------TNLPPAPEISPFDTCYDLS 259
Query: 384 GEKTGSFPELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTVVTDREASGGPSIILGNFQM 443
+ P + G + LP +N V CL V ++ P I+GNFQ
Sbjct: 260 SQSNVEVPTIAFILPGENSLQLPAKNCLIQVDSAGTFCLAFV----SATFPLSIIGNFQQ 315
Query: 444 QNYYVEYDLRNQRLGFKQQLC 464
Q V YDL N +GF C
Sbjct: 316 QGIRVSYDLTNSLVGFSTNKC 336
>gi|357492401|ref|XP_003616489.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
gi|355517824|gb|AES99447.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
Length = 434
Score = 113 bits (282), Expect = 2e-22, Method: Compositional matrix adjust.
Identities = 133/487 (27%), Positives = 205/487 (42%), Gaps = 83/487 (17%)
Query: 5 ISALCLSFIFFFTLLSIFP-SSITSLTFSLSRFHTN--------PSQDSYQNLNSLVSSS 55
++ LC + F+L I S S FS+ H + P+++ YQ+ S
Sbjct: 1 MNTLCFLTLSLFSLCFIASFSHALSNGFSVELIHRDSPKSPYYKPTENKYQHFVDAARRS 60
Query: 56 LTRALHI-KNPQTKTTTTTTTTTTTNISSHSYGGYSISLSFGTPPQIIPFILDTGSHLVW 114
+ RA H K+ T T +T GGY ++ S GTPP I I DTGS +VW
Sbjct: 61 INRANHFFKDSDTSTPESTVIP--------DRGGYLMTYSVGTPPTKIYGIADTGSDIVW 112
Query: 115 FPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLGCQNPKCSWIHHESIQCRDCNDEPLATSK 174
C C+ C + P F P SSS + + C + C S++ C+D+
Sbjct: 113 LQCE---PCEQCYNQTTPIFNPSKSSSYKNIPCLSKLC-----HSVRDTSCSDQ------ 158
Query: 175 NCTQICPSYLVLYG-SGLTEGIALSETLNLPNR-----IIPNFLVGCSVLSS----RQPA 224
N Q Y + YG S ++G +TL+L + P ++GC ++ +
Sbjct: 159 NSCQ----YKISYGDSSHSQGDLSVDTLSLESTSGSPVSFPKTVIGCGTDNAGTFGGASS 214
Query: 225 GIAGFGRGKTSLPSQLNLD---KFSYCLLSHKFDDTTRTSSLILDNGSSHSDKKTTGLTY 281
GI G G G SL +QL KFSYCL+ ++ +S L + + S G+
Sbjct: 215 GIVGLGGGPVSLITQLGSSIGGKFSYCLVPLLNKESNASSILSFGDAAVVSGD---GVVS 271
Query: 282 TPFVNNPSVAERNAFSVYYYVGLRRITVGGQRVRVWHKYLTLDRDGNGGTIVDSGTTFTF 341
TP + V+Y++ L+ +VG +RV D +GN I+DSGTT T
Sbjct: 272 TPLIKKD--------PVFYFLTLQAFSVGNKRVEFGGSSEGGDDEGN--IIIDSGTTLTL 321
Query: 342 MAPELFEPLADEFVS----QMVKNRNYTRALGAEALTGLRPCFDVPGEKTGSFPELKLHF 397
+ +++ L V V + N +L C+ + + FP + HF
Sbjct: 322 IPSDVYTNLESAVVDLVKLDRVDDPNQQFSL----------CYSLKSNEY-DFPIITAHF 370
Query: 398 KGGAEVTLPVENYFAVVGEGSAVCLTVVTDREASGGPSIILGNFQMQNYYVEYDLRNQRL 457
K GA++ L + F + +G VC + S I GN QN V YDL+ + +
Sbjct: 371 K-GADIELHSISTFVPITDG-IVCFAF----QPSPQLGSIFGNLAQQNLLVGYDLQQKTV 424
Query: 458 GFKQQLC 464
FK C
Sbjct: 425 SFKPTDC 431
>gi|224072901|ref|XP_002303933.1| predicted protein [Populus trichocarpa]
gi|222841365|gb|EEE78912.1| predicted protein [Populus trichocarpa]
Length = 370
Score = 113 bits (282), Expect = 2e-22, Method: Compositional matrix adjust.
Identities = 111/382 (29%), Positives = 154/382 (40%), Gaps = 53/382 (13%)
Query: 89 YSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLGCQ 148
Y + GTPPQ + LD W PC C CSS+ F S++ + LGC
Sbjct: 35 YIVKAKVGTPPQTLLMALDNSYDAAWIPCKG---CVGCSST---VFNTVKSTTFKTLGCG 88
Query: 149 NPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSGLTEGIALSETLNLPNRII 208
P+C + + P+ CT + YGS +T+ L +
Sbjct: 89 APQCKQVPN-----------PICGGSTCT-----WNTTYGSSTILSNLTRDTIALSMDPV 132
Query: 209 PNFLVGC---SVLSSRQPAGIAGFGRGKTSLPSQL-NLDK--FSYCLLSHKFDDTTRTSS 262
P + GC + SS P G+ GFGRG S SQ NL K FSYCL S F + S
Sbjct: 133 PYYAFGCIQKATGSSVPPQGLLGFGRGPLSFLSQTQNLYKSTFSYCLPS--FRTLNFSGS 190
Query: 263 LILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRVRVWHKYLT 322
L L KTT P + NP R++ YYV L I VG + V + L
Sbjct: 191 LRLGPVGQPPRIKTT-----PLLKNP---RRSSL---YYVKLNGIRVGRKIVDIPRSALA 239
Query: 323 LDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEALTGLRPCFDV 382
+ GTI DSGT FT + + + +EF R +L G C+ V
Sbjct: 240 FNPTTGAGTIFDSGTVFTRLVAPAYIAVRNEF-------RKRVGNATVSSLGGFDTCYSV 292
Query: 383 PGEKTGSFPELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTVVTDREASGGPSIILGNFQ 442
P P + F G VT+P EN G CL + + ++ + Q
Sbjct: 293 PIVP----PTITFMFSG-MNVTMPPENLLIHSTAGVTSCLAMAAAPDNVNSVLNVIASMQ 347
Query: 443 MQNYYVEYDLRNQRLGFKQQLC 464
QN+ + +D+ N RLG ++ C
Sbjct: 348 QQNHRILFDVPNSRLGVAREQC 369
>gi|195627138|gb|ACG35399.1| aspartic proteinase nepenthesin-1 precursor [Zea mays]
Length = 431
Score = 113 bits (282), Expect = 2e-22, Method: Compositional matrix adjust.
Identities = 118/447 (26%), Positives = 174/447 (38%), Gaps = 53/447 (11%)
Query: 33 LSRFHT--NPSQDSYQNLNSLVSSSLTRALHIKNPQTKTTTTTTTTTTTNISSHSYGGYS 90
LS +H PS +++ +L + R L + +K ++ T+ S + Y
Sbjct: 24 LSVYHNVHPPSPSPLESIIALARADDARLLFLS---SKAASSGGVTSAPVASGQTPPSYV 80
Query: 91 ISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLGCQNP 150
+ GTP Q + LDT + W C C C + FIP SSS L C +
Sbjct: 81 VRAGLGTPVQQLLLALDTSADATWSHCA---PCDTCPAGS--RFIPASSSSYASLPCASD 135
Query: 151 KCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSGLTEGIALSETLNLPNRIIPN 210
C + PL C P + + L S+TL L I
Sbjct: 136 WCPLFEGQPCPANQDASAPL---PACAFSKPFADTSFQASLG-----SDTLRLGKDAIAG 187
Query: 211 FLVGCSVLSSRQPA------GIAGFGRGKTSLPSQLNL---DKFSYCLLSHK---FDDTT 258
+ GC V + P G+ G GRG SL SQ FSYCL S++ F +
Sbjct: 188 YAFGC-VGAVAGPTTNLPKQGLLGLGRGPMSLLSQTGSTYNGVFSYCLPSYRSYYFSGSL 246
Query: 259 RTSSLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRVRVWH 318
R + + + YTP + NP R + YYV + ++VG V+V
Sbjct: 247 RLGAA----------GQPRNVRYTPLLTNP---HRPSL---YYVNVTGLSVGRTWVKVPA 290
Query: 319 KYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEALTGLRP 378
D GT++DSGT T ++ L +EF Q+ YT +LGA
Sbjct: 291 GSFAFDPATGAGTVIDSGTVITRWTAPVYAALREEFRRQVAAPSGYT-SLGA-----FDT 344
Query: 379 CFDVPGEKTGSFPELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTVVTDREASGGPSIIL 438
CF+ G P + LH GG ++TLP+EN CL + + ++
Sbjct: 345 CFNTDEVAAGGAPPVTLHMDGGVDLTLPMENTLIHSSATPLACLAMAEAPQNVNAVVNVV 404
Query: 439 GNFQMQNYYVEYDLRNQRLGFKQQLCK 465
N Q QN V D+ R+GF ++ C
Sbjct: 405 ANLQQQNVRVVVDVAGSRVGFAREPCN 431
>gi|15450651|gb|AAK96597.1| AT3g52500/F22O6_120 [Arabidopsis thaliana]
Length = 110
Score = 113 bits (282), Expect = 2e-22, Method: Compositional matrix adjust.
Identities = 55/106 (51%), Positives = 68/106 (64%), Gaps = 4/106 (3%)
Query: 363 NYTRALGAEALTGLRPCFDVPGEKTGSFPELKLHFKGGAEVTLPVENYFAVVGEGSAVCL 422
NYTR E TGL PCF++ G+ + PEL FKGGA++ LP+ NYF VG VCL
Sbjct: 3 NYTREKDLEKETGLGPCFNISGKGDVTVPELIFEFKGGAKLELPLSNYFTFVGNTDTVCL 62
Query: 423 TVVTDR----EASGGPSIILGNFQMQNYYVEYDLRNQRLGFKQQLC 464
TVV+D+ GP+IILG+FQ QNY VEYDL N R GF ++ C
Sbjct: 63 TVVSDKTVNPSGGTGPAIILGSFQQQNYLVEYDLENDRFGFAKKKC 108
>gi|414584780|tpg|DAA35351.1| TPA: hypothetical protein ZEAMMB73_696016 [Zea mays]
Length = 524
Score = 113 bits (282), Expect = 2e-22, Method: Compositional matrix adjust.
Identities = 111/391 (28%), Positives = 156/391 (39%), Gaps = 49/391 (12%)
Query: 87 GGYSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLG 146
G Y + +S G+PP ++D+GS ++W C C C P F P S++ +
Sbjct: 169 GEYLVRVSVGSPPTEQYLVVDSGSDVMWVQCK---PCLECYVQADPLFDPATSATFSGVS 225
Query: 147 CQNPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSG-LTEGIALSETLNLPN 205
C + C + + C D L + Y V Y G T+G ETL L
Sbjct: 226 CGSAICRILPTSA-----CGDGELGGCE--------YEVSYADGSYTKGALALETLTLGG 272
Query: 206 RIIPNFLVGCSVLSSR---QPAGIAGFGRGKTSLPSQLNLD---KFSYCLLSH------K 253
+ ++GC + AG+ G G G SL QL + FSYCL S
Sbjct: 273 TAVEGVVIGCGHRNRGLFVGAAGLMGLGWGPMSLVGQLGGEVGGAFSYCLASRGGYGSGA 332
Query: 254 FDDTTRTSSLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQR 313
DD L+L S+ G + P V NP +YYVGL I VG +R
Sbjct: 333 ADDDA--GWLVL----GRSEAVPEGAVWVPLVRNPRAPS------FYYVGLSGIEVGDER 380
Query: 314 VRVWHKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEAL 373
+ + L DG G ++D+GTT T + E + L D FV + RA G +
Sbjct: 381 LPLQAGLFQLTEDGAGDVVMDTGTTVTRLPQEAYAALRDAFVGALAG--AVPRAQGVSSS 438
Query: 374 TGLRPCFDVPGEKTGSFPELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTVVTDREASGG 433
L C+D+ G + P + F G A + L N V G CL +S G
Sbjct: 439 V-LDTCYDLSGYASVRVPTVSFCFDGDARLILAARNVLLEVDMG-IYCLAFA---PSSSG 493
Query: 434 PSIILGNFQMQNYYVEYDLRNQRLGFKQQLC 464
S I+GN Q + D N +GF C
Sbjct: 494 LS-IMGNTQQAGIQITVDSANGYIGFGPANC 523
>gi|242066176|ref|XP_002454377.1| hypothetical protein SORBIDRAFT_04g029680 [Sorghum bicolor]
gi|241934208|gb|EES07353.1| hypothetical protein SORBIDRAFT_04g029680 [Sorghum bicolor]
Length = 474
Score = 113 bits (282), Expect = 2e-22, Method: Compositional matrix adjust.
Identities = 119/383 (31%), Positives = 157/383 (40%), Gaps = 55/383 (14%)
Query: 89 YSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLGCQ 148
Y +++S GTP +DTGS L W CT C S K P F P SSS + C
Sbjct: 140 YVVTVSLGTPGVAQTLEVDTGSDLSWVQCT-PCAAPACYSQKDPLFDPAQSSSYAAVPCG 198
Query: 149 NPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSG-LTEGIALSETLNL-PNR 206
P C + + C ++ C Y+V YG G T G+ S+TL L PN
Sbjct: 199 GPVCGGLGIYASSC---------SAAQC-----GYVVSYGDGSKTTGVYSSDTLTLSPND 244
Query: 207 IIPNFLVGCSVLSSRQPA--GIAGFGRGKTSLPSQLN---LDKFSYCLLSHKFDDTTRTS 261
+ F GC S G+ G GR + SL Q FSYCL TR S
Sbjct: 245 AVRGFFFGCGHAQSGFTGNDGLLGLGREEASLVEQTAGTYGGVFSYCL-------PTRPS 297
Query: 262 SLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRVRVWHKYL 321
+ S G + T +++P+ A YY V L I+VGGQ++ V
Sbjct: 298 TTGYLTLGGPSGAAPPGFSTTQLLSSPNAA------TYYVVMLTGISVGGQQLSVPSSVF 351
Query: 322 TLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEALTGLRPCFD 381
GGT+VD+GT T + P + L F S M + Y A A L C++
Sbjct: 352 A------GGTVVDTGTVITRLPPTAYAALRSAFRSGMA-SYGYPS---APATGILDTCYN 401
Query: 382 VPGEKTGSFPELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTVVTDREASGGPSIILGNF 441
G T + P + L F GGA VTL + S CL S G ILGN
Sbjct: 402 FSGYGTVTLPNVALTFSGGATVTLGADGIL------SFGCLAFAP--SGSDGGMAILGNV 453
Query: 442 QMQNYYVEYDLRNQRLGFKQQLC 464
Q +++ V D +GFK C
Sbjct: 454 QQRSFEVRID--GTSVGFKPSSC 474
>gi|194698750|gb|ACF83459.1| unknown [Zea mays]
gi|194703964|gb|ACF86066.1| unknown [Zea mays]
gi|219886221|gb|ACL53485.1| unknown [Zea mays]
gi|219886359|gb|ACL53554.1| unknown [Zea mays]
gi|223950085|gb|ACN29126.1| unknown [Zea mays]
gi|414865218|tpg|DAA43775.1| TPA: aspartic proteinase nepenthesin-1 [Zea mays]
Length = 431
Score = 113 bits (282), Expect = 2e-22, Method: Compositional matrix adjust.
Identities = 118/447 (26%), Positives = 174/447 (38%), Gaps = 53/447 (11%)
Query: 33 LSRFHT--NPSQDSYQNLNSLVSSSLTRALHIKNPQTKTTTTTTTTTTTNISSHSYGGYS 90
LS +H PS +++ +L + R L + +K ++ T+ S + Y
Sbjct: 24 LSVYHNVHPPSPSPLESIIALARADDARLLFLS---SKAASSGGVTSAPVASGQTPPSYV 80
Query: 91 ISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLGCQNP 150
+ GTP Q + LDT + W C C C + FIP SSS L C +
Sbjct: 81 VRAGLGTPVQQLLLALDTSADATWSHCA---PCDTCPAGS--RFIPASSSSYASLPCASD 135
Query: 151 KCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSGLTEGIALSETLNLPNRIIPN 210
C + PL C P + + L S+TL L I
Sbjct: 136 WCPLFEGQPCPANQDASAPL---PACAFSKPFADTSFQASLG-----SDTLRLGKDAIAG 187
Query: 211 FLVGCSVLSSRQPA------GIAGFGRGKTSLPSQLNL---DKFSYCLLSHK---FDDTT 258
+ GC V + P G+ G GRG SL SQ FSYCL S++ F +
Sbjct: 188 YAFGC-VGAVAGPTTNLPKQGLLGLGRGPMSLLSQTGSRYNGVFSYCLPSYRSYYFSGSL 246
Query: 259 RTSSLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRVRVWH 318
R + + + YTP + NP R + YYV + ++VG V+V
Sbjct: 247 RLGAA----------GQPRNVRYTPLLTNP---HRPSL---YYVNVTGLSVGRTWVKVPA 290
Query: 319 KYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEALTGLRP 378
D GT++DSGT T ++ L +EF Q+ YT +LGA
Sbjct: 291 GSFAFDPATGAGTVIDSGTVITRWTAPVYAALREEFRRQVAAPSGYT-SLGA-----FDT 344
Query: 379 CFDVPGEKTGSFPELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTVVTDREASGGPSIIL 438
CF+ G P + LH GG ++TLP+EN CL + + ++
Sbjct: 345 CFNTDEVAAGGAPPVTLHMDGGVDLTLPMENTLIHSSATPLACLAMAEAPQNVNAVVNVV 404
Query: 439 GNFQMQNYYVEYDLRNQRLGFKQQLCK 465
N Q QN V D+ R+GF ++ C
Sbjct: 405 ANLQQQNVRVVVDVAGSRVGFAREPCN 431
>gi|356500756|ref|XP_003519197.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
Length = 451
Score = 113 bits (282), Expect = 2e-22, Method: Compositional matrix adjust.
Identities = 111/386 (28%), Positives = 165/386 (42%), Gaps = 47/386 (12%)
Query: 87 GGYSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSS-SRLL 145
G Y + + G+P Q+ +LDT + W PCT C CSSS + P+ S++ +
Sbjct: 106 GSYVVRVKLGSPNQLFFMVLDTSTDEAWVPCTG---CTGCSSSST-YYSPQASTTYGGAV 161
Query: 146 GCQNPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSGLTEGIALSETLNLPN 205
C P+C+ Q R P SK CT + Y + ++L L
Sbjct: 162 ACYAPRCA-------QARGALPCPYTGSKACT-----FNQSYAGSTFSATLVQDSLRLGI 209
Query: 206 RIIPNFLVGCS------VLSSRQPAGIAGFGRGKTSLPSQLNLDKFSYCLLSHKFDDTTR 259
+P++ GC L ++ G+ S S+L FSYCL S F +
Sbjct: 210 DTLPSYAFGCVNSASGWTLPAQGLLGLGRGPLSLPSQSSKLYSGIFSYCLPS--FQSSYF 267
Query: 260 TSSLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRVRVWHK 319
+ SL L G + ++ + TP + NP R + YYV L +TVG +V + +
Sbjct: 268 SGSLKL--GPTGQPRR---IRTTPLLQNP---RRPSL---YYVNLTGVTVGRVKVPLPIE 316
Query: 320 YLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEALTGLRPC 379
YL D + GTI+DSGT T ++ + DEF +Q VK ++R G C
Sbjct: 317 YLAFDPNKGSGTILDSGTVITRFVGPVYSAIRDEFRNQ-VKGPFFSRG-------GFDTC 368
Query: 380 FDVPGEKTGSFPELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTVVTDREASGGPSIILG 439
F E P +KL F G +VTLP EN G CL + ++
Sbjct: 369 FVKTYENLT--PLIKLRFT-GLDVTLPYENTLIHTAYGGMACLAMAAAPNNVNSVLNVIA 425
Query: 440 NFQMQNYYVEYDLRNQRLGFKQQLCK 465
N+Q QN V +D N R+G ++LC
Sbjct: 426 NYQQQNLRVLFDTVNNRVGIARELCN 451
>gi|212722554|ref|NP_001131154.1| uncharacterized protein LOC100192462 precursor [Zea mays]
gi|194690728|gb|ACF79448.1| unknown [Zea mays]
Length = 431
Score = 113 bits (282), Expect = 2e-22, Method: Compositional matrix adjust.
Identities = 118/447 (26%), Positives = 174/447 (38%), Gaps = 53/447 (11%)
Query: 33 LSRFHT--NPSQDSYQNLNSLVSSSLTRALHIKNPQTKTTTTTTTTTTTNISSHSYGGYS 90
LS +H PS +++ +L + R L + +K ++ T+ S + Y
Sbjct: 24 LSVYHNVHPPSPSPLESIIALARADDARLLFLS---SKAASSGGITSAPVASGQTPPSYV 80
Query: 91 ISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLGCQNP 150
+ GTP Q + LDT + W C C C + FIP SSS L C +
Sbjct: 81 VRAGLGTPVQQLLLALDTSADATWSHCA---PCDTCPAGS--RFIPASSSSYASLPCASD 135
Query: 151 KCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSGLTEGIALSETLNLPNRIIPN 210
C + PL C P + + L S+TL L I
Sbjct: 136 WCPLFEGQPCPANQDASAPL---PACAFSKPFADTSFQASLG-----SDTLRLGKDAIAG 187
Query: 211 FLVGCSVLSSRQPA------GIAGFGRGKTSLPSQLNL---DKFSYCLLSHK---FDDTT 258
+ GC V + P G+ G GRG SL SQ FSYCL S++ F +
Sbjct: 188 YAFGC-VGAVAGPTTNLPKQGLLGLGRGPMSLLSQTGSRYNGVFSYCLPSYRSYYFSGSL 246
Query: 259 RTSSLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRVRVWH 318
R + + + YTP + NP R + YYV + ++VG V+V
Sbjct: 247 RLGAA----------GQPRNVRYTPLLTNP---HRPSL---YYVNVTGLSVGRTWVKVPA 290
Query: 319 KYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEALTGLRP 378
D GT++DSGT T ++ L +EF Q+ YT +LGA
Sbjct: 291 GSFAFDPATGAGTVIDSGTVITRWTAPVYAALREEFRRQVAAPSGYT-SLGA-----FDT 344
Query: 379 CFDVPGEKTGSFPELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTVVTDREASGGPSIIL 438
CF+ G P + LH GG ++TLP+EN CL + + ++
Sbjct: 345 CFNTDEVAAGGAPPVTLHMDGGVDLTLPMENTLIHSSATPLACLAMAEAPQNVNAVVNVV 404
Query: 439 GNFQMQNYYVEYDLRNQRLGFKQQLCK 465
N Q QN V D+ R+GF ++ C
Sbjct: 405 ANLQQQNVRVVVDVAGSRVGFAREPCN 431
>gi|224101015|ref|XP_002312106.1| predicted protein [Populus trichocarpa]
gi|222851926|gb|EEE89473.1| predicted protein [Populus trichocarpa]
Length = 440
Score = 113 bits (282), Expect = 2e-22, Method: Compositional matrix adjust.
Identities = 118/427 (27%), Positives = 180/427 (42%), Gaps = 51/427 (11%)
Query: 59 ALHIKNPQTKTTT----TTTTTTTTNISSHSYGGYSISLSFGTPPQIIPFILDTGSHLVW 114
AL + +T +T +TT+ TT + H ++SL+ GTP Q I +LDTGS L W
Sbjct: 33 ALRTQKHRTPISTPRLFSTTSKTTDKLLFHHNVTLTVSLTAGTPLQNITMVLDTGSELSW 92
Query: 115 FPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLGCQNPKCSWIHHESIQCRDCNDEPLATSK 174
C F P S + + C +P C D PL S
Sbjct: 93 LHCKKEPNFNSI-------FNPLASKTYTKIPCSSPTCE---------TRTRDLPLPVSC 136
Query: 175 NCTQICPSYLVLYGSGLTEGIALSETLNLPNRIIPNFLVGC--SVLSSR-----QPAGIA 227
+ ++C + + EG ET + + P + GC S SS + G+
Sbjct: 137 DPAKLCHFIISYADASSVEGNLAFETFRVGSVTGPATVFGCMDSGFSSNSEEDAKTTGLM 196
Query: 228 GFGRGKTSLPSQLNLDKFSYCLLSHKFDDTTRTSSLILDNGSSHSDKKTTGLTYTPFVNN 287
G RG S +Q+ KFSYC+ D + L+L S K L YTP V
Sbjct: 197 GMNRGSLSFVNQMGFRKFSYCI-----SDRDSSGVLLLGEASFSWLKP---LNYTPLVEM 248
Query: 288 PSVAERNAFSVYYYVGLRRITVGGQRVRVWHKYLTLDRDGNGGTIVDSGTTFTFMAPELF 347
S V Y V L I V + + + D G G T+VDSGT FTF+ ++
Sbjct: 249 -STPLPYFDRVAYSVQLEGIRVSDKVLSLPKSVFVPDHTGAGQTMVDSGTQFTFLLGPVY 307
Query: 348 EPLADEFVSQ---MVKNRNYTRALGAEALTGLRPCFDVPGEKTG--SFPELKLHFKGGAE 402
L EF+ Q +++ N R + A+ C+ + + + P + L F+ GAE
Sbjct: 308 SALKQEFLLQTKGVLRVLNEPRYVFQGAMD---LCYLIEPTRAALPNLPVVNLMFR-GAE 363
Query: 403 VTLPVENYF-----AVVGEGSAVCLTVVTDREASGGPSIILGNFQMQNYYVEYDLRNQRL 457
+++ + V G+ S C T + ++ G S ++G+ Q QN ++EYDL R+
Sbjct: 364 MSVSGQRLLYRVPGEVRGKDSVWCFT-FGNSDSLGIESFVIGHHQQQNVWMEYDLEKSRI 422
Query: 458 GFKQQLC 464
GF + C
Sbjct: 423 GFAEVRC 429
>gi|356523171|ref|XP_003530215.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
Length = 442
Score = 113 bits (282), Expect = 3e-22, Method: Compositional matrix adjust.
Identities = 105/397 (26%), Positives = 169/397 (42%), Gaps = 52/397 (13%)
Query: 90 SISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLGCQN 149
+IS++ GTPPQ + ++DTGS L W C + ++ P F P +SSS + C +
Sbjct: 67 TISITVGTPPQNMSMVIDTGSELSWLHCNTNTT----ATIPYPFFNPNISSSYTPISCSS 122
Query: 150 PKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSGLTEGIALSETLNLPNRIIP 209
P C+ D P+ S + +C + L + +EG S+T + P
Sbjct: 123 PTCT---------TRTRDFPIPASCDSNNLCHATLSYADASSSEGNLASDTFGFGSSFNP 173
Query: 210 NFLVGC-------SVLSSRQPAGIAGFGRGKTSLPSQLNLDKFSYCLLSHKFDDTTRTSS 262
+ GC + S G+ G G SL SQL + KFSYC+ F S
Sbjct: 174 GIVFGCMNSSYSTNSESDSNTTGLMGMNLGSLSLVSQLKIPKFSYCISGSDF------SG 227
Query: 263 LILDNGSSHSDKKTTGLTYTPFVNNPS---VAERNAFSVYYYVGLRRITVGGQRVRVWHK 319
++L S+ S + L YTP V + +R+A Y V L I + + + +
Sbjct: 228 ILLLGESNFSWGGS--LNYTPLVQISTPLPYFDRSA----YTVRLEGIKISDKLLNISGN 281
Query: 320 YLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEALT---GL 376
D G G T+ D GT F+++ ++ L DEF++Q RAL +
Sbjct: 282 LFVPDHTGAGQTMFDLGTQFSYLLGPVYNALRDEFLNQ---TNGTLRALDDPNFVFQIAM 338
Query: 377 RPCFDVPGEKTG--SFPELKLHFKG------GAEVTLPVENYFAVVGEGSAVCLTVVTDR 428
C+ VP ++ P + L F+G G ++ V + V G S C T +
Sbjct: 339 DLCYRVPVNQSELPELPSVSLVFEGAEMRVFGDQLLYRVPGF--VWGNDSVYCFT-FGNS 395
Query: 429 EASGGPSIILGNFQMQNYYVEYDLRNQRLGFKQQLCK 465
+ G + I+G+ Q+ ++E+DL R+G C
Sbjct: 396 DLLGVEAFIIGHHHQQSMWMEFDLVEHRVGLAHARCD 432
>gi|42568291|ref|NP_199124.3| aspartyl protease family protein [Arabidopsis thaliana]
gi|332007527|gb|AED94910.1| aspartyl protease family protein [Arabidopsis thaliana]
Length = 631
Score = 112 bits (281), Expect = 3e-22, Method: Compositional matrix adjust.
Identities = 119/412 (28%), Positives = 174/412 (42%), Gaps = 98/412 (23%)
Query: 85 SYGGYSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRL 144
S G Y+ L GTPPQ I+DTGS + + PC+ CK C + P F P+LS+S +
Sbjct: 72 SNGYYTTRLWIGTPPQEFALIVDTGSTVTYVPCST---CKQCGKHQDPKFQPELSTSYQA 128
Query: 145 LGCQNPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSGLTEGIALSETL--- 201
L C NP C +C+DE K C Y Y + LSE L
Sbjct: 129 LKC-NPDC-----------NCDDE----GKLCV-----YERRYAEMSSSSGVLSEDLISF 167
Query: 202 NLPNRIIPNFLV-GCS-----VLSSRQPAGIAGFGRGKTSLPSQLNLDK------FSYCL 249
+++ P V GC L S++ GI G GRGK S+ QL +DK FS C
Sbjct: 168 GNESQLSPQRAVFGCENEETGDLFSQRADGIMGLGRGKLSVVDQL-VDKGVIEDVFSLCY 226
Query: 250 LSHKFDDTTRTSSLILDNGS-------SHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYV 302
+ +++L S SHSD PF S YY +
Sbjct: 227 GGMEVGG----GAMVLGKISPPPGMVFSHSD---------PFR-----------SPYYNI 262
Query: 303 GLRRITVGGQRVRVWHKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNR 362
L+++ V G+ +++ K +G GT++DSGTT+ + E F + D + ++ +
Sbjct: 263 DLKQMHVAGKSLKLNPKVF----NGKHGTVLDSGTTYAYFPKEAFIAIKDAVIKEIPSLK 318
Query: 363 NYTRALGAEALTGLRP-----CFDVPGEKTGS----FPELKLHFKGGAEVTLPVENY-FA 412
+ G P CF G FPE+ + F G ++ L ENY F
Sbjct: 319 R---------IHGPDPNYDDVCFSGAGRDVAEIHNFFPEIAMEFGNGQKLILSPENYLFR 369
Query: 413 VVGEGSAVCLTVVTDREASGGPSIILGNFQMQNYYVEYDLRNQRLGFKQQLC 464
A CL + DR++ + +LG ++N V YD N +LGF + C
Sbjct: 370 HTKVRGAYCLGIFPDRDS----TTLLGGIVVRNTLVTYDRENDKLGFLKTNC 417
>gi|9757837|dbj|BAB08274.1| unnamed protein product [Arabidopsis thaliana]
Length = 586
Score = 112 bits (281), Expect = 3e-22, Method: Compositional matrix adjust.
Identities = 119/412 (28%), Positives = 174/412 (42%), Gaps = 98/412 (23%)
Query: 85 SYGGYSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRL 144
S G Y+ L GTPPQ I+DTGS + + PC+ CK C + P F P+LS+S +
Sbjct: 72 SNGYYTTRLWIGTPPQEFALIVDTGSTVTYVPCST---CKQCGKHQDPKFQPELSTSYQA 128
Query: 145 LGCQNPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSGLTEGIALSETL--- 201
L C NP C +C+DE K C Y Y + LSE L
Sbjct: 129 LKC-NPDC-----------NCDDE----GKLCV-----YERRYAEMSSSSGVLSEDLISF 167
Query: 202 NLPNRIIPNFLV-GCS-----VLSSRQPAGIAGFGRGKTSLPSQLNLDK------FSYCL 249
+++ P V GC L S++ GI G GRGK S+ QL +DK FS C
Sbjct: 168 GNESQLSPQRAVFGCENEETGDLFSQRADGIMGLGRGKLSVVDQL-VDKGVIEDVFSLCY 226
Query: 250 LSHKFDDTTRTSSLILDNGS-------SHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYV 302
+ +++L S SHSD PF S YY +
Sbjct: 227 GGMEVG----GGAMVLGKISPPPGMVFSHSD---------PFR-----------SPYYNI 262
Query: 303 GLRRITVGGQRVRVWHKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNR 362
L+++ V G+ +++ K +G GT++DSGTT+ + E F + D + ++ +
Sbjct: 263 DLKQMHVAGKSLKLNPKVF----NGKHGTVLDSGTTYAYFPKEAFIAIKDAVIKEIPSLK 318
Query: 363 NYTRALGAEALTGLRP-----CFDVPGEKTGS----FPELKLHFKGGAEVTLPVENY-FA 412
+ G P CF G FPE+ + F G ++ L ENY F
Sbjct: 319 R---------IHGPDPNYDDVCFSGAGRDVAEIHNFFPEIAMEFGNGQKLILSPENYLFR 369
Query: 413 VVGEGSAVCLTVVTDREASGGPSIILGNFQMQNYYVEYDLRNQRLGFKQQLC 464
A CL + DR++ + +LG ++N V YD N +LGF + C
Sbjct: 370 HTKVRGAYCLGIFPDRDS----TTLLGGIVVRNTLVTYDRENDKLGFLKTNC 417
>gi|357116170|ref|XP_003559856.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Brachypodium
distachyon]
Length = 460
Score = 112 bits (281), Expect = 3e-22, Method: Compositional matrix adjust.
Identities = 102/389 (26%), Positives = 161/389 (41%), Gaps = 40/389 (10%)
Query: 89 YSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLGCQ 148
Y + S GTPPQ + +DT + W PC + C + PSF P S++ R + C
Sbjct: 94 YLVRASLGTPPQRLLLAVDTSNDAAWVPCAGCHGCP----TTAPSFNPASSATFRPVPCG 149
Query: 149 NPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSGLTEGIALSETLNLPNR-- 206
P CS + S C LA SKN + + YG + + L +
Sbjct: 150 APPCSQAPNPS-----CTS--LAKSKNSC----GFSLSYGDSSLDATLSQDNLAVTANGG 198
Query: 207 IIPNFLVGCSVLSSRQPAGIAGFGRGKTSLPSQLNLDK------FSYCLLSHKFDDTTRT 260
+I + GC S+ A G + K FSYCL S+ +
Sbjct: 199 VIKGYTFGCLTKSNGSAAPAQGLLGLGRGPLGFVAQTKGIYEGTFSYCLPSYYRSAANFS 258
Query: 261 SSLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRVRVWHKY 320
SL L + +K + TP + +P R + YYV + + +G + V +
Sbjct: 259 GSLTLGRKGQPAPEK---MKTTPLLASP---HRPSL---YYVAMTGVRIGKKSVPIPPSA 309
Query: 321 LTLDRDGNGGTIVDSGTTFTFMAPELFEPLADE----FVSQMVKNRNYTRALGAEALTGL 376
L D GT++DSGT F +A + + DE + + ++ +L G
Sbjct: 310 LAFDAATGAGTVLDSGTMFARLAQPAYAAVRDEVRRRVAGSLRRRGGGGASVSVSSLGGF 369
Query: 377 RPCFDVPGEKTGSFPELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTVVTDREASGGPSI 436
C++V T ++P + L F GG EV LP EN GS CL + ++
Sbjct: 370 DTCYNV---STVAWPAVTLVFGGGMEVRLPEENVVIRSTYGSTSCLAMAASPADGVNAAL 426
Query: 437 -ILGNFQMQNYYVEYDLRNQRLGFKQQLC 464
++G+ Q QN+ V +D+ N R+GF ++ C
Sbjct: 427 NVIGSLQQQNHRVLFDVPNARVGFARERC 455
>gi|222619890|gb|EEE56022.1| hypothetical protein OsJ_04800 [Oryza sativa Japonica Group]
Length = 423
Score = 112 bits (281), Expect = 3e-22, Method: Compositional matrix adjust.
Identities = 100/383 (26%), Positives = 154/383 (40%), Gaps = 70/383 (18%)
Query: 89 YSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLGCQ 148
Y GTP Q + +D + W PC+ C C++S PSF P SS+ R + C
Sbjct: 102 YIARAGLGTPAQTLLVAIDPSNDAAWVPCS---ACAGCAASS-PSFSPTQSSTYRTVPCG 157
Query: 149 NPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSGLTEGIALSETLNLPNRII 208
+P+C+ + S P +C + + Y + + + ++L L N ++
Sbjct: 158 SPQCAQVPSPSC--------PAGVGSSC-----GFNLTYAASTFQAVLGQDSLALENNVV 204
Query: 209 PNFLVGCSVLSSRQPAGIAGFGRGKTSLPSQLNLDKFSYCLLSHKFDDTTRTSSLILDNG 268
++ GC + + AG R L + LL + D G
Sbjct: 205 VSYTFGCLRVVNGNSRAAAGAHR----------LRPRAALLL-------------VADQG 241
Query: 269 SSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRVRVWHKYLTLDRDGN 328
+ + TP + NP R + YYV + I VG + V+V L +
Sbjct: 242 HLGPIGQPKRIKTTPLLYNP---HRPSL---YYVNMIGIRVGSKVVQVPQSALAFNPVTG 295
Query: 329 GGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEALTGLRPCFDVPGEKTG 388
GTI+D+GT FT +A ++ + D F R R A L G C++V T
Sbjct: 296 SGTIIDAGTMFTRLAAPVYAAVRDAF-------RGRVRTPVAPPLGGFDTCYNV----TV 344
Query: 389 SFPELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTVVTDREASGGPSI-------ILGNF 441
S P + F G VTLP EN G CL + + GPS +L +
Sbjct: 345 SVPTVTFMFAGAVAVTLPEENVMIHSSSGGVACLAM------AAGPSDGVNAALNVLASM 398
Query: 442 QMQNYYVEYDLRNQRLGFKQQLC 464
Q QN V +D+ N R+GF ++LC
Sbjct: 399 QQQNQRVLFDVANGRVGFSRELC 421
>gi|224092220|ref|XP_002309515.1| predicted protein [Populus trichocarpa]
gi|222855491|gb|EEE93038.1| predicted protein [Populus trichocarpa]
Length = 473
Score = 112 bits (280), Expect = 4e-22, Method: Compositional matrix adjust.
Identities = 105/386 (27%), Positives = 159/386 (41%), Gaps = 52/386 (13%)
Query: 87 GGYSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLG 146
G Y++++ GTP + I DTGS L W C K C K P P S+S + +
Sbjct: 131 GDYAVTVGLGTPKKEFTLIFDTGSDLTWTQC--EPCAKTCYKQKEPRLDPTKSTSYKNIS 188
Query: 147 CQNPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSG-LTEGIALSETLNL-P 204
C + C + E ++C+ Y V YG G + G +ETL L
Sbjct: 189 CSSAFCKLLDTEG-------------GESCSSPTCLYQVQYGDGSYSIGFFATETLTLSS 235
Query: 205 NRIIPNFLVGCSVLSS---RQPAGIAGFGRGKTSLPSQLNLDK---FSYCLLSHKFDDTT 258
+ + NFL GC +S R AG+ G GR K SLPSQ FSYCL +
Sbjct: 236 SNVFKNFLFGCGQQNSGLFRGAAGLLGLGRTKLSLPSQTAQKYKKLFSYCL------PAS 289
Query: 259 RTSSLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRVRVWH 318
+S L G S + +TP ++E + +Y + + ++VGG ++ +
Sbjct: 290 SSSKGYLSFGGQVSKT----VKFTP------LSEDFKSTPFYGLDITELSVGGNKLSIDA 339
Query: 319 KYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEALTGLRP 378
+ GT++DSGT T + + L+ F M T + +
Sbjct: 340 SIFS-----TSGTVIDSGTVITRLPSTAYSALSSAFQKLM------TDYPSTDGYSIFDT 388
Query: 379 CFDVPGEKTGSFPELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTVVTDREASGGPSIIL 438
C+D +T P++ + FKGG E+ + V V VCL + + + I
Sbjct: 389 CYDFSKNETIKIPKVGVSFKGGVEMDIDVSGILYPVNGLKKVCLAFAGNGDDV--KAAIF 446
Query: 439 GNFQMQNYYVEYDLRNQRLGFKQQLC 464
GN Q + Y V YD R+GF C
Sbjct: 447 GNTQQKTYQVVYDDAKGRVGFAPSGC 472
>gi|115483168|ref|NP_001065177.1| Os10g0538200 [Oryza sativa Japonica Group]
gi|21717168|gb|AAM76361.1|AC074196_19 putative nucleoid DNA binding protein [Oryza sativa Japonica Group]
gi|31433289|gb|AAP54827.1| Eukaryotic aspartyl protease family protein, expressed [Oryza
sativa Japonica Group]
gi|113639786|dbj|BAF27091.1| Os10g0538200 [Oryza sativa Japonica Group]
gi|215686408|dbj|BAG87693.1| unnamed protein product [Oryza sativa Japonica Group]
Length = 394
Score = 112 bits (280), Expect = 4e-22, Method: Compositional matrix adjust.
Identities = 109/385 (28%), Positives = 162/385 (42%), Gaps = 54/385 (14%)
Query: 89 YSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLGCQ 148
Y + + GTPPQ ++D LVW C QC C P F P S++ R C
Sbjct: 51 YVANFTIGTPPQPASAVIDLAGELVWTQCK---QCSRCFEQDTPLFDPTASNTYRAEPCG 107
Query: 149 NPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSGLTEGIALSETLNLPNRII 208
P C I +S +NC+ +Y +G T G ++T +
Sbjct: 108 TPLCESIPSDS--------------RNCSGNVCAYQASTNAGDTGGKVGTDTFAV-GTAK 152
Query: 209 PNFLVGCSVLSSRQ----PAGIAGFGRGKTSLPSQLNLDKFSYCLLSHKFDDTTRTSSLI 264
+ GC V S P+GI G GR SL +Q + FSYCL H D R S+L
Sbjct: 153 ASLAFGCVVASDIDTMGGPSGIVGLGRTPWSLVTQTGVAAFSYCLAPH---DAGRNSALF 209
Query: 265 LDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRVRVWHKYLTLD 324
L GSS TPFVN N S YY V L + G + + T+
Sbjct: 210 L--GSSAKLAGGGKAASTPFVN--ISGNGNDLSNYYKVQLEGLKAGDAMIPLPPSGSTV- 264
Query: 325 RDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEAL-TGLRP---CF 380
++D+ + +F+ + Q VK + T A+GA + T + P CF
Sbjct: 265 -------LLDTFSPISFLVDGAY---------QAVK-KAVTAAVGAPPMATPVEPFDLCF 307
Query: 381 DVPGEKTGSFPELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTVVTDREASGGPSI-ILG 439
G +G+ P+L F+GGA +T+P NY G+ VCL +++ + + +LG
Sbjct: 308 PKSG-ASGAAPDLVFTFRGGAAMTVPATNYLLDYKNGT-VCLAMLSSARLNSTTELSLLG 365
Query: 440 NFQMQNYYVEYDLRNQRLGFKQQLC 464
+ Q +N + +DL + L F+ C
Sbjct: 366 SLQQENIHFLFDLDKETLSFEPADC 390
>gi|308081797|ref|NP_001182920.1| uncharacterized protein LOC100501208 [Zea mays]
gi|238008190|gb|ACR35130.1| unknown [Zea mays]
gi|413922182|gb|AFW62114.1| hypothetical protein ZEAMMB73_927324 [Zea mays]
Length = 269
Score = 112 bits (280), Expect = 4e-22, Method: Compositional matrix adjust.
Identities = 87/262 (33%), Positives = 127/262 (48%), Gaps = 25/262 (9%)
Query: 210 NFLVGCSVLSSRQPAG---IAGFGRGKTSLPSQLNLDKFSYCLLSHKFDDTTRTSSLILD 266
N GC L++ AG I G G S+ QL++ KFSYCL F D +TS ++
Sbjct: 23 NLTFGCGKLTNGTIAGASGIMGVSPGPLSVLKQLSITKFSYCLTP--FTDH-KTSPVMFG 79
Query: 267 NGSSHSDKKTTGLTYT-PFVNNPSVAERNAFSVYYYVGLRRITVGGQRVRVWHKYLTLDR 325
+ KTTG T P + NP +YYYV + I++G +R+ V L L
Sbjct: 80 AMADLGKYKTTGKVQTIPLLKNP------VEDIYYYVPMVGISIGSKRLDVPEAILALRP 133
Query: 326 DGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEALTGLRPCFDVP-- 383
DG GGT++DS TT ++ F+ L + M K R++ + CF++P
Sbjct: 134 DGTGGTVLDSATTLAYLVEPAFKELKKAVMEGM-KLPAANRSIDDYPV-----CFELPRG 187
Query: 384 -GEKTGSFPELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTVVTDREASGGPSIILGNFQ 442
+ P L LHF G AE++LP ++YF G +CL V+ G P++I GN Q
Sbjct: 188 MSMEGVQVPPLVLHFAGDAEMSLPRDSYFQEPSPG-MMCLAVM-QAPFEGAPNVI-GNVQ 244
Query: 443 MQNYYVEYDLRNQRLGFKQQLC 464
QN +V YDL N++ + C
Sbjct: 245 QQNMHVLYDLGNRKFSYAPTKC 266
>gi|383130038|gb|AFG45739.1| Pinus taeda anonymous locus 2_3758_01 genomic sequence
Length = 154
Score = 112 bits (280), Expect = 4e-22, Method: Compositional matrix adjust.
Identities = 58/146 (39%), Positives = 86/146 (58%), Gaps = 6/146 (4%)
Query: 279 LTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRVRVWHKYLTLDRDGNGGTIVDSGTT 338
L YTPF+ N A + + +YY+ LR +++G +R+ + K + D GNGGTI+DSGTT
Sbjct: 15 LNYTPFLINTK-ASSSGYHTFYYIDLRGVSIGRKRLNLPSKLFSFDTKGNGGTIIDSGTT 73
Query: 339 FTFMAPELFEPLADEFVSQMVKNRNYTRALGAEALTGLRPCFDVPGEKTGSFPELKLHFK 398
FT E ++ + F SQ+ + RA EA TG+R C++V G P+ HFK
Sbjct: 74 FTIFNEEFYKNITAAFASQI----GFRRASEVEARTGMRLCYNVSGVDHVLLPDFAFHFK 129
Query: 399 GGAEVTLPVENYFAVVGEGSAVCLTV 424
GG+++ LPV NYF+ S +CLT+
Sbjct: 130 GGSDMVLPVANYFSYFVSDS-ICLTM 154
>gi|225217056|gb|ACN85339.1| aspartic proteinase nepenthesin-1 precursor [Oryza granulata]
Length = 521
Score = 112 bits (280), Expect = 4e-22, Method: Compositional matrix adjust.
Identities = 113/391 (28%), Positives = 165/391 (42%), Gaps = 62/391 (15%)
Query: 87 GGYSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLG 146
G Y +++ GTP + DTGS W C Y K+ F P SS+ +
Sbjct: 180 GNYVVTIGLGTPASRYTVVFDTGSDTTWVQCQPCVVVCYKQQEKL--FDPARSSTYANVS 237
Query: 147 CQNPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSG-LTEGIALSETLNLPN 205
C P CS ++ ++ C+ Y V YG G + G +TL L +
Sbjct: 238 CAAPACSDLY----------------TRGCSGGHCLYSVQYGDGSYSIGFFAMDTLTLSS 281
Query: 206 -RIIPNFLVGCSVLSS---RQPAGIAGFGRGKTSLPSQLNLDK----FSYCLLSHKFDDT 257
+ F GC + + AG+ G GRGKTSLP Q DK F++CL +
Sbjct: 282 YDAVKGFRFGCGERNEGLFGEAAGLLGLGRGKTSLPVQ-TYDKYGGVFAHCLPARS---- 336
Query: 258 TRTSSLILDNGSSHSDKKTTGLTYTPFV--NNPSVAERNAFSVYYYVGLRRITVGGQRVR 315
+ + LD G S TP + N P+ +YYVG+ I VGGQ +
Sbjct: 337 --SGTGYLDFGPG-SPAAVGARQTTPMLTDNGPT---------FYYVGMTGIRVGGQLLS 384
Query: 316 VWHKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEALTG 375
+ + GTIVDSGT T + P + L F S M R Y + A AL+
Sbjct: 385 IPQSVFS-----TAGTIVDSGTVITRLPPAAYSSLRSAFASAMAA-RGYKK---APALSL 435
Query: 376 LRPCFDVPGEKTGSFPELKLHFKGGAEVTLPVEN--YFAVVGEGSAVCLTVVTDREASGG 433
L C+D G + P++ L F+GGA + + Y A + S VCL + +
Sbjct: 436 LDTCYDFTGMSEVAIPKVSLLFQGGAYLDVNASGIMYAASL---SQVCLGFAANEDDD-- 490
Query: 434 PSIILGNFQMQNYYVEYDLRNQRLGFKQQLC 464
I+GN Q++ + V YD+ + +GF C
Sbjct: 491 DVGIVGNTQLKTFGVVYDIGKKTVGFSPGAC 521
>gi|255543963|ref|XP_002513044.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
gi|223548055|gb|EEF49547.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
Length = 431
Score = 112 bits (280), Expect = 4e-22, Method: Compositional matrix adjust.
Identities = 107/382 (28%), Positives = 155/382 (40%), Gaps = 53/382 (13%)
Query: 89 YSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLGCQ 148
Y + GTP Q + +DT + W PC+ C CSS+ F S++ + +GC+
Sbjct: 96 YIVRAKIGTPAQTMLLAMDTSNDAAWIPCSG---CVGCSST---VFNNVKSTTFKTVGCE 149
Query: 149 NPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSGLTEGIALSETLNLPNRII 208
P+C + + C ++ + YGS + + L I
Sbjct: 150 APQCKQVPNSKCGGSAC----------------AFNMTYGSSSIAANLSQDVVTLATDSI 193
Query: 209 PNFLVGC---SVLSSRQPAGIAGFGRGKTSLPSQ---LNLDKFSYCLLSHKFDDTTRTSS 262
P++ GC + SS P G+ G GRG SL SQ L FSYCL S F + S
Sbjct: 194 PSYTFGCLTEATGSSIPPQGLLGLGRGPMSLLSQTQNLYQSTFSYCLPS--FRSLNFSGS 251
Query: 263 LILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRVRVWHKYLT 322
L L KTT P + NP R++ YYV L I VG + V + L
Sbjct: 252 LRLGPVGQPKRIKTT-----PLLKNP---RRSSL---YYVNLMAIRVGRRVVDIPPSALA 300
Query: 323 LDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEALTGLRPCFDV 382
+ GTI DSGT FT + + + D F + V N T +L G C+
Sbjct: 301 FNPTTGAGTIFDSGTVFTRLVAPAYTAVRDAF-RKRVGNATVT------SLGGFDTCYTS 353
Query: 383 PGEKTGSFPELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTVVTDREASGGPSIILGNFQ 442
P P + F G VTLP +N S CL + + ++ N Q
Sbjct: 354 PIVA----PTITFMFSG-MNVTLPPDNLLIHSTASSITCLAMAAAPDNVNSVLNVIANMQ 408
Query: 443 MQNYYVEYDLRNQRLGFKQQLC 464
QN+ + +D+ N RLG ++ C
Sbjct: 409 QQNHRILFDVPNSRLGVAREPC 430
>gi|118487542|gb|ABK95598.1| unknown [Populus trichocarpa]
Length = 471
Score = 112 bits (279), Expect = 5e-22, Method: Compositional matrix adjust.
Identities = 111/391 (28%), Positives = 166/391 (42%), Gaps = 56/391 (14%)
Query: 87 GGYSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCK-YCSSSKIPSFIPKLSSSSRLL 145
G Y + L GTPP+ ILDTGS L W C C YC + P + P +S + + L
Sbjct: 123 GNYYVKLGLGTPPKYYAMILDTGSSLSWLQCQ---PCAVYCHAQADPLYDPSVSKTYKKL 179
Query: 146 GCQNPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYG-SGLTEGIALSETLNL- 203
C + +CS + ++ ND T N Y YG + + G + L L
Sbjct: 180 SCASVECSRLKAATL-----NDPLCETDSNACL----YTASYGDTSFSIGYLSQDLLTLT 230
Query: 204 PNRIIPNFLVGCSVLSS---RQPAGIAGFGRGKTSLPSQLNLD---KFSYCLLSHKFDDT 257
++ +P F GC + + AGI G R K S+ +QL+ FSYCL
Sbjct: 231 SSQTLPQFTYGCGQDNQGLFGRAAGIIGLARDKLSMLAQLSTKYGHAFSYCL-------P 283
Query: 258 TRTSSLILDNGSSHSDKKTTGLTYTPFV---NNPSVAERNAFSVYYYVGLRRITVGGQRV 314
T S S T +TP + NPS+ Y++ L ITV G+ +
Sbjct: 284 TANSGSSGGGFLSIGSISPTSYKFTPMLTDSKNPSL---------YFLRLTAITVSGRPL 334
Query: 315 RVWHKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEALT 374
+ + T++DSGT T + ++ L FV M T+ A A +
Sbjct: 335 DLAAAMYRVP------TLIDSGTVITRLPMSMYAALRQAFVKIMS-----TKYAKAPAYS 383
Query: 375 GLRPCFDVPGEKTGSFPELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTVVTDREASGGP 434
L CF + + PE+K+ F+GGA++TL + +G +T + +SG
Sbjct: 384 ILDTCFKGSLKSISAVPEIKMIFQGGADLTLRAPSILIEADKG----ITCLAFAGSSGTN 439
Query: 435 SI-ILGNFQMQNYYVEYDLRNQRLGFKQQLC 464
I I+GN Q Q Y + YD+ R+GF C
Sbjct: 440 QIAIIGNRQQQTYNIAYDVSTSRIGFAPGSC 470
>gi|218187618|gb|EEC70045.1| hypothetical protein OsI_00635 [Oryza sativa Indica Group]
Length = 570
Score = 112 bits (279), Expect = 5e-22, Method: Compositional matrix adjust.
Identities = 117/392 (29%), Positives = 163/392 (41%), Gaps = 83/392 (21%)
Query: 89 YSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLGCQ 148
Y ++++ G+PP+ + I DTGS LVW C +++ F P SS+ + CQ
Sbjct: 101 YLMTVNLGSPPRSMLAIADTGSDLVWVKCKKGNNDTSSAAAPTTQFDPSRSSTYGRVSCQ 160
Query: 149 NPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSGL-TEGIALSETLNL---- 203
C + + C+D NC +YL YG G T G+ +ET
Sbjct: 161 TDACEALGRAT-----CDD-----GSNC-----AYLYAYGDGSNTTGVLSTETFTFDDGG 205
Query: 204 ----PNRI-IPNFLVGCSVLSSRQ--PAGIAGFGRGKTSLPSQLNLD-----KFSYCLLS 251
P ++ I GCS ++ G+ G G G SL +QL +FSYCL+
Sbjct: 206 AGRSPRQVRIGGVKFGCSTATAGSFPADGLVGLGGGAVSLVTQLGGATSLGRRFSYCLVP 265
Query: 252 HKFDDTTRTSSLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGG 311
H + +S+L N + +D G TP V N +VA + +
Sbjct: 266 HSVN---ASSAL---NFGALADVTEPGAASTPLVGNKTVASAASSRI------------- 306
Query: 312 QRVRVWHKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAE 371
IVDSGTT TF+ P L P+ DE +R T
Sbjct: 307 --------------------IVDSGTTLTFLDPSLLGPIVDEL------SRRITLPPVQS 340
Query: 372 ALTGLRPCFDVPG---EKTGSFPELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTVVTDR 428
L+ C++V G E S P+L L F GGA V L EN F V EG+ +CL +V
Sbjct: 341 PDGLLQLCYNVAGREVEAGESIPDLTLEFGGGAAVALKPENAFVAVQEGT-LCLAIVATT 399
Query: 429 EASGGPSIILGNFQMQNYYVEYDLRNQRLGFK 460
E P ILGN QN +V YDL +G K
Sbjct: 400 EQQ--PVSILGNLAQQNIHVGYDLDAGTVGNK 429
Score = 79.3 bits (194), Expect = 4e-12, Method: Compositional matrix adjust.
Identities = 54/137 (39%), Positives = 66/137 (48%), Gaps = 12/137 (8%)
Query: 332 IVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEALTGLRPCFDVPG---EKTG 388
IVDSGTT TF+ P L P+ DE +R T L+ C++V G E
Sbjct: 440 IVDSGTTLTFLDPSLLGPIVDEL------SRRITLPPVQSPDGLLQLCYNVAGREVEAGE 493
Query: 389 SFPELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTVVTDREASGGPSIILGNFQMQNYYV 448
S P+L L F GGA V L EN F V EG+ +CL +V E P ILGN QN +V
Sbjct: 494 SIPDLTLEFGGGAAVALKPENAFVAVQEGT-LCLAIVATTEQQ--PVSILGNLAQQNIHV 550
Query: 449 EYDLRNQRLGFKQQLCK 465
YDL + F C
Sbjct: 551 GYDLDAGTVTFAVADCA 567
>gi|224142013|ref|XP_002324355.1| predicted protein [Populus trichocarpa]
gi|222865789|gb|EEF02920.1| predicted protein [Populus trichocarpa]
Length = 477
Score = 112 bits (279), Expect = 5e-22, Method: Compositional matrix adjust.
Identities = 112/387 (28%), Positives = 159/387 (41%), Gaps = 56/387 (14%)
Query: 87 GGYSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLG 146
G Y +++ GTP + DTGS L W C C P F P S+S + +
Sbjct: 138 GAYVVTVGLGTPKKDFTLSFDTGSDLTWTQCEPCL--GGCFPQNQPKFDPTTSTSYKNVS 195
Query: 147 CQNPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSGLTEGIALSETLNLPNR 206
C + C I + +DC S C Y + YGSG T G +ETL + +
Sbjct: 196 CSSEFCKLIAEGNYPAQDC------ISNTCL-----YGIQYGSGYTIGFLATETLAIASS 244
Query: 207 -IIPNFLVGCSVLSS---RQPAGIAGFGRGKTSLPSQLN---LDKFSYCLLSHKFDDTTR 259
+ NFL GCS S G+ G GR +LPSQ + FSYCL + +
Sbjct: 245 DVFKNFLFGCSEESRGTFNGTTGLLGLGRSPIALPSQTTNKYKNLFSYCLPASP----SS 300
Query: 260 TSSLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRVRVWHK 319
T L S + K T P +P + + Y + I+V G+ + +
Sbjct: 301 TGHLSFGVEVSQAAKST------PI--SPKLKQ------LYGLNTVGISVRGRELPING- 345
Query: 320 YLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEALTGLRPC 379
++ R TI+DSGTTFTF+ + L F M NYT G + +PC
Sbjct: 346 --SISR-----TIIDSGTTFTFLPSPTYSALGSAFREMMA---NYTLTNGTSS---FQPC 392
Query: 380 FDVP--GEKTGSFPELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTVVTDREASGGPSII 437
+D G T + P + + F+GG EV + V V VCL S I
Sbjct: 393 YDFSNIGNGTLTIPGISIFFEGGVEVEIDVSGIMIPVNGLKEVCLAFA--DTGSDSDFAI 450
Query: 438 LGNFQMQNYYVEYDLRNQRLGFKQQLC 464
GN+Q + Y V YD+ +GF + C
Sbjct: 451 FGNYQQKTYEVIYDVAKGMVGFAPKGC 477
>gi|356546370|ref|XP_003541599.1| PREDICTED: probable aspartic protease At2g35615-like [Glycine max]
Length = 434
Score = 112 bits (279), Expect = 6e-22, Method: Compositional matrix adjust.
Identities = 129/484 (26%), Positives = 199/484 (41%), Gaps = 80/484 (16%)
Query: 5 ISALCLSFIFFFTLLSIFPSSITSLTFSLSRFHTN--------PSQDSYQNLNSLVSSSL 56
IS L+ + F+ L +IF + FS+ H + P++ +Q + + V S+
Sbjct: 4 ISPSTLALVLFY-LCNIFYLEAFNGGFSVEMIHRDSSRSPFFRPTETQFQRVANAVHRSV 62
Query: 57 TRALHIKNPQTKTTTTTTTTTTTNISSHSYGGYSISLSFGTPPQIIPFILDTGSHLVWFP 116
RA H T T + G Y IS S G PP + I+DTGS ++W
Sbjct: 63 NRANHFHKAHKAAKATIT---------QNDGEYLISYSVGIPPFQLYGIIDTGSDMIWLQ 113
Query: 117 CTNHYQCKYCSSSKIPSFIPKLSSSSRLLGCQNPKCSWIHHESIQCRDCNDEPLATSKNC 176
C C+ C + F P S++ ++L + C + E + S +
Sbjct: 114 CK---PCEKCYNQTTRIFDPSKSNTYKILPFSSTTCQSV------------EDTSCSSDN 158
Query: 177 TQICPSYLVLYGSG-LTEGIALSETLNL--PNRIIPNF---LVGC----SVLSSRQPAGI 226
++C Y + YG G ++G ETL L N F ++GC +V + +GI
Sbjct: 159 RKMC-EYTIYYGDGSYSQGDLSVETLTLGSTNGSSVKFRRTVIGCGRNNTVSFEGKSSGI 217
Query: 227 AGFGRGKTSLPSQLNL------DKFSYCLLSHKFDDTTRTSSLILDNGSSHSDKKTTGLT 280
G G G SL +QL KFSYCL S + +S L + + S G
Sbjct: 218 VGLGNGPVSLINQLRRRSSSIGRKFSYCLASM----SNISSKLNFGDAAVVSGD---GTV 270
Query: 281 YTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRVRVWHKYLTLDRDGNGGTIVDSGTTFT 340
TP V + V+YY+ L +VG R+ GN I+DSGTT T
Sbjct: 271 STPIVTHDP-------KVFYYLTLEAFSVGNNRIEFTSSSFRFGEKGN--IIIDSGTTLT 321
Query: 341 FMAPELFEPLADEFVSQMVKNRNYTRALGAEALTGLRPCFDVPGEKTGSFPELKLHFKGG 400
+ +++ L + +R + L L C+ ++ + P + HF G
Sbjct: 322 LLPNDIYSKLESAVADLVELDR------VKDPLKQLSLCYRSTFDELNA-PVIMAHFS-G 373
Query: 401 AEVTLPVENYFAVVGEGSAVCLTVVTDREASGGPSIILGNFQMQNYYVEYDLRNQRLGFK 460
A+V L N F V +G CL ++ + GP I GN QN+ V YDL+ + + FK
Sbjct: 374 ADVKLNAVNTFIEVEQG-VTCLAFISSKI---GP--IFGNMAQQNFLVGYDLQKKIVSFK 427
Query: 461 QQLC 464
C
Sbjct: 428 PTDC 431
>gi|86438622|emb|CAJ26370.1| chloroplast nucleoid binding protein [Brachypodium sylvaticum]
Length = 443
Score = 112 bits (279), Expect = 6e-22, Method: Compositional matrix adjust.
Identities = 119/403 (29%), Positives = 168/403 (41%), Gaps = 68/403 (16%)
Query: 87 GGYSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSS-----KIPSFIPKLSSS 141
G Y +S+ GTP + + + DTGS L W QC CSS + P F P SS+
Sbjct: 83 GNYVVSVGLGTPARDLTVVFDTGSDLSWV------QCGPCSSGGCYHQQDPLFAPSSSST 136
Query: 142 SRLLGCQNPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGS-GLTEGIALSET 200
+ C P+C C+ P CP Y V+YG T G ++T
Sbjct: 137 FSAVRCGEPECPRARQS------CSSSP------GDDRCP-YEVVYGDKSRTVGHLGNDT 183
Query: 201 LNLP-----------NRIIPNFLVGCSVLSS---RQPAGIAGFGRGKTSLPSQLN---LD 243
L L + +P F+ GC ++ + G+ G GRGK SL SQ +
Sbjct: 184 LTLGTTPSTNASENNSNKLPGFVFGCGENNTGLFGKADGLFGLGRGKVSLSSQAAGKYGE 243
Query: 244 KFSYCLLSHKFDDTTRTSSLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVG 303
FSYCL S + S +H+ +TP +N R+ +YYV
Sbjct: 244 GFSYCLPSSSSNAHGYLSLGTPAPAPAHA-------RFTPMLN------RSNTPSFYYVK 290
Query: 304 LRRITVGGQRVRVWHKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRN 363
L I V G+ ++V + G IVDSGT T +AP + L F+S M K
Sbjct: 291 LVGIRVAGRAIKVSSRPALWP----AGLIVDSGTVITRLAPRAYSALRTAFLSAMGK-YG 345
Query: 364 YTRALGAEALTGLRPCFD--VPGEKTGSFPELKLHFKGGAEVTLPVENYFAVVGEGSAVC 421
Y R A L+ L C+D T S P + L F GGA +++ V A C
Sbjct: 346 YKR---APRLSILDTCYDFTAHANATVSIPAVALVFAGGATISVDFSGVLYVAKVAQA-C 401
Query: 422 LTVVTDREASGGPSIILGNFQMQNYYVEYDLRNQRLGFKQQLC 464
L + +G + ILGN Q + V YD+ Q++GF + C
Sbjct: 402 LAFAPN--GNGRSAGILGNTQQRTVAVVYDVGRQKIGFAAKGC 442
>gi|356555042|ref|XP_003545848.1| PREDICTED: probable aspartic protease At2g35615-like [Glycine max]
Length = 431
Score = 112 bits (279), Expect = 6e-22, Method: Compositional matrix adjust.
Identities = 129/471 (27%), Positives = 194/471 (41%), Gaps = 68/471 (14%)
Query: 8 LCLSFIFFFTLL-SIFPSSITSLTFSLSRFHTNPSQDSYQNLNSLVSSSLTRALHIKNPQ 66
LCL I F L S F I S S F+ ++ +Q + + V S+ RA H
Sbjct: 12 LCLYNICFSEALKSGFSVEIIHRDSSRSPFY-RATETQFQRVTNAVRRSMNRANHFNQIS 70
Query: 67 TKTTTTTTTTTTTNISSHSYGGYSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYC 126
+ + T + G Y +S S GTPP + I+DT S ++W C C+ C
Sbjct: 71 VYSNAVESPVTLLDD-----GDYLMSYSLGTPPFPVYGIVDTASDIIWVQCQ---LCETC 122
Query: 127 SSSKIPSFIPKLSSSSRLLGCQNPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVL 186
+ P F P S + + L C + C +S+Q C+ + +IC +
Sbjct: 123 YNDTSPMFDPSYSKTYKNLPCSSTTC-----KSVQGTSCSSDE-------RKICEHTVNY 170
Query: 187 YGSGLTEGIALSETLNL-----PNRIIPNFLVGC--SVLSSRQPAGIAGFGRGKTSLPSQ 239
++G + ET+ L P P ++GC + S GI G G G SL Q
Sbjct: 171 KDGSHSQGDLIVETVTLGSYNDPFVHFPRTVIGCIRNTNVSFDSIGIVGLGGGPVSLVPQ 230
Query: 240 LN---LDKFSYCLLSHKFDDTTRTSSLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAF 296
L+ KFSYCL + R+S L + + S T V+ R F
Sbjct: 231 LSSSISKKFSYCLAPI----SDRSSKLKFGDAAMVSGDGT-------------VSTRIVF 273
Query: 297 ---SVYYYVGLRRITVGGQRVRVWHKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADE 353
+YY+ L +VG R+ G G I+DSGTTFT + +++ L +
Sbjct: 274 KDWKKFYYLTLEAFSVGNNRIEFRSSSSR--SSGKGNIIIDSGTTFTVLPDDVYSKL-ES 330
Query: 354 FVSQMVKNRNYTRALGAEALTGLRPCFDVPGEKTGSFPELKLHFKGGAEVTLPVENYFAV 413
V+ +VK RA + L C+ +K P + HF GA+V L N F +
Sbjct: 331 AVADVVK---LERA--EDPLKQFSLCYKSTYDKV-DVPVITAHF-SGADVKLNALNTF-I 382
Query: 414 VGEGSAVCLTVVTDREASGGPSIILGNFQMQNYYVEYDLRNQRLGFKQQLC 464
V VCL ++ + + I GN QN+ V YDL+ + + FK C
Sbjct: 383 VASHRVVCLAFLSSQSGA-----IFGNLAQQNFLVGYDLQRKIVSFKPTDC 428
>gi|224126551|ref|XP_002329582.1| predicted protein [Populus trichocarpa]
gi|222870291|gb|EEF07422.1| predicted protein [Populus trichocarpa]
Length = 440
Score = 112 bits (279), Expect = 6e-22, Method: Compositional matrix adjust.
Identities = 119/440 (27%), Positives = 178/440 (40%), Gaps = 63/440 (14%)
Query: 39 NPSQDSYQNLNSLVSSSLTRALHIKNPQTKTTTTTTTTTTTNISSHSYGGYSISLSFGTP 98
N Q Q N + S++R H Q T + + I ++ G Y +SLS GTP
Sbjct: 47 NSQQTHLQRWNKAMRRSVSRVHHF---QRTAATVSPKEVESEIIANG-GEYLMSLSLGTP 102
Query: 99 PQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLGCQNPKCSWIHHE 158
P I I DTGS L+W CT C C P F PK S + R L C +C +
Sbjct: 103 PFEILAIADTGSDLIWTQCT---PCDKCYKQIAPLFDPKSSKTYRDLSCDTRQCQNLGES 159
Query: 159 SIQCRDCNDEPLATSKNCTQICPSYLVLYGS-GLTEGIALSETLNLPNR-----IIPNFL 212
S C+ E Q+C Y YG T G +T+ LP+ P +
Sbjct: 160 S----SCSSE---------QLC-QYSYYYGDRSFTNGNLAVDTVTLPSTNGGPVYFPKTV 205
Query: 213 VGCSVLSS----RQPAGIAGFGRGKTSLPSQLNLD---KFSYCLLSHKFDDTTRTSSLIL 265
+GC ++ ++ +GI G G G SL SQ+ KFSYCL+ + +S L
Sbjct: 206 IGCGRRNNGTFDKKDSGIIGLGGGPMSLISQMGSSVGGKFSYCLVPFSSESAGNSSKLHF 265
Query: 266 DNGSSHSDKKTTGLTYTPFVN-NPSVAERNAFSVYYYVGLRRITVGGQRVRVWHKYLTLD 324
+ S +G+ TP ++ NP +YY+ L ++VG +++
Sbjct: 266 GRNAVVSG---SGVQSTPLISKNP--------DTFYYLTLEAMSVGDKKIEFGGSSFGGS 314
Query: 325 RDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEALTGLRPCFDVPG 384
I+DSGT+ T F A V V N T+ RP D+
Sbjct: 315 EG---NIIIDSGTSLTLFPVNFFTEFATA-VENAVINGERTQDASGLLSHCYRPTPDL-- 368
Query: 385 EKTGSFPELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTVVTDREASGGPSIILGNFQMQ 444
P + HF GA+V L N F ++ + +CL + + + I GN
Sbjct: 369 ----KVPVITAHFN-GADVVLQTLNTFILISD-DVLCLAFNSTQSGA-----IFGNVAQM 417
Query: 445 NYYVEYDLRNQRLGFKQQLC 464
N+ + YD++ + + FK C
Sbjct: 418 NFLIGYDIQGKSVSFKPTDC 437
>gi|302143530|emb|CBI22091.3| unnamed protein product [Vitis vinifera]
Length = 360
Score = 111 bits (278), Expect = 7e-22, Method: Compositional matrix adjust.
Identities = 69/220 (31%), Positives = 106/220 (48%), Gaps = 16/220 (7%)
Query: 245 FSYCLLSHKFDDTTRTSSLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGL 304
FSYCL+ D +S LI G L +T V + N +YYV +
Sbjct: 154 FSYCLVDRN-SDANVSSKLIF--GEDKDLLSHPELNFTTLV----AGKENPVDTFYYVQI 206
Query: 305 RRITVGGQRVRVWHKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNY 364
+ I VGG+ V + + + DG+GGTI+DSGTT ++ A ++ + + F M K + Y
Sbjct: 207 KSIVVGGEVVNIPEEKWQIATDGSGGTIIDSGTTLSYFAEPAYQVIKEAF---MAKVKGY 263
Query: 365 TRALGAEALTGLRPCFDVPGEKTGSFPELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTV 424
+ L PC++V G + P+ + F GA PVENYF + VCL +
Sbjct: 264 PV---VKDFPVLEPCYNVTGVEQPDLPDFGIVFSDGAVWNFPVENYFIEIEPREVVCLAI 320
Query: 425 VTDREASGGPSIILGNFQMQNYYVEYDLRNQRLGFKQQLC 464
+ ++ I+GN+Q QN+++ YD + RLGF C
Sbjct: 321 LGTPPSALS---IIGNYQQQNFHILYDTKKSRLGFAPTKC 357
>gi|115452187|ref|NP_001049694.1| Os03g0271900 [Oryza sativa Japonica Group]
gi|29893618|gb|AAP06872.1| hypothetical protein [Oryza sativa Japonica Group]
gi|108707424|gb|ABF95219.1| Eukaryotic aspartyl protease family protein, expressed [Oryza
sativa Japonica Group]
gi|108707425|gb|ABF95220.1| Eukaryotic aspartyl protease family protein, expressed [Oryza
sativa Japonica Group]
gi|113548165|dbj|BAF11608.1| Os03g0271900 [Oryza sativa Japonica Group]
gi|215715205|dbj|BAG94956.1| unnamed protein product [Oryza sativa Japonica Group]
gi|215737033|dbj|BAG95962.1| unnamed protein product [Oryza sativa Japonica Group]
gi|215740994|dbj|BAG97489.1| unnamed protein product [Oryza sativa Japonica Group]
Length = 447
Score = 111 bits (278), Expect = 7e-22, Method: Compositional matrix adjust.
Identities = 111/410 (27%), Positives = 167/410 (40%), Gaps = 62/410 (15%)
Query: 90 SISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLGCQN 149
++ ++ GTPPQ + +LDTGS L W C Y + P+F SSS + C +
Sbjct: 56 TVPVAVGTPPQNVTMVLDTGSELSWLLCNGSY-----APPLTPAFNASGSSSYGAVPCPS 110
Query: 150 PKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSGLTEGIALSETLNLPNRIIP 209
C W + C+ P N ++ SY + +G+ ++T L P
Sbjct: 111 TACEWRGRDLPVPPFCDTPP----SNACRVSLSYA---DASSADGVLATDTFLLTGGAPP 163
Query: 210 ---NFLVGC---------------SVLSSRQPAGIAGFGRGKTSLPSQLNLDKFSYCLLS 251
GC S G+ G RG S +Q +F+YC+
Sbjct: 164 VAVGAYFGCITSYSSTTATNSNGTGTDVSEAATGLLGMNRGTLSFVTQTGTRRFAYCIAP 223
Query: 252 HKFDDTTRTSSLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAF--SVYYYVGLRRITV 309
+ L+ D+G L YTP + +++ + V Y V L I V
Sbjct: 224 GEGPGVL----LLGDDGGVAPP-----LNYTPLIE---ISQPLPYFDRVAYSVQLEGIRV 271
Query: 310 GGQRVRVWHKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALG 369
G + + LT D G G T+VDSGT FTF+ + + L EF SQ R LG
Sbjct: 272 GCALLPIPKSVLTPDHTGAGQTMVDSGTQFTFLLADAYAALKAEFTSQA---RLLLAPLG 328
Query: 370 AEALT---GLRPCFDVPGEK----TGSFPELKLHFKGGAEVTLPVENYFAVV-----GEG 417
CF P + +G PE+ L + GAEV + E +V GEG
Sbjct: 329 EPGFVFQGAFDACFRGPEARVAAASGLLPEVGLVLR-GAEVAVSGEKLLYMVPGERRGEG 387
Query: 418 SAVCLTVVT--DREASGGPSIILGNFQMQNYYVEYDLRNQRLGFKQQLCK 465
A + +T + + +G + ++G+ QN +VEYDL+N R+GF C
Sbjct: 388 GAEAVWCLTFGNSDMAGMSAYVIGHHHQQNVWVEYDLQNGRVGFAPARCD 437
>gi|362799904|dbj|BAL41445.1| aspartyl protease 1 [Linum grandiflorum]
Length = 449
Score = 111 bits (277), Expect = 1e-21, Method: Compositional matrix adjust.
Identities = 123/406 (30%), Positives = 166/406 (40%), Gaps = 61/406 (15%)
Query: 85 SYGGYSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRL 144
S G Y ++LS GTPP I I DTGS L W C C K P F P S++
Sbjct: 76 SGGEYMMNLSIGTPPFPILAIADTGSDLTWL---QSKPCDQCYPQKGPIFDPSNSTTFHK 132
Query: 145 LGCQNPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGS-GLTEGIALSETLNL 203
L C C+ + + C D C Y YG T G S+T+ +
Sbjct: 133 LPCTTAPCNALDESARSCTD--------PTTC-----GYTYSYGDHSYTTGYLASDTVTV 179
Query: 204 PNR--IIPNFLVGCSVLS----SRQPAGIAGFGRGKTSLPSQLNL---DKFSYCLL---- 250
N I N GC + Q +GI G G G S SQL KFSYCLL
Sbjct: 180 GNASVQIRNVAFGCGTRNGGNFDEQGSGIVGLGGGNLSFVSQLGDTIGKKFSYCLLPLEN 239
Query: 251 --SHKFDDTTRTSSLILDNGSSHSDKKTTGLTY--TPFVNNPSVAERNAFSVYYYVGLRR 306
S + D+ TS ++ + S T G+ + TP VN S YYY+ +
Sbjct: 240 EISSQPSDSPATSRIVFGDNPVFSSSSTNGVVFATTPLVNKEP-------STYYYLTIEA 292
Query: 307 ITVGGQRV---RVWHKYLTLDRDG-----NGGTIVDSGTTFTFMAPELFEPLADEFVSQM 358
ITVG +++ K + D G I+DSGTT TF+ E + L V ++
Sbjct: 293 ITVGRKKLLYSSSSSKTASYDSGSKSSVEEGNIIIDSGTTLTFLEEEFYGALEAALVEEI 352
Query: 359 VKNRNYTRALGAEALTGLRPCFDVPGEKTGSFPELKLHFKGGAEVTLPVENYFAVVGEGS 418
R +L CF G++ P +K+HF+GGA+V L N F EG
Sbjct: 353 KMERVNDVKNSMFSL-----CFK-SGKEEVELPLMKVHFRGGADVELKPVNTFVRAEEG- 405
Query: 419 AVCLTVVTDREASGGPSIILGNFQMQNYYVEYDLRNQRLGFKQQLC 464
VC T++ + I GN N+ V YDL + + F C
Sbjct: 406 LVCFTMLPTNDVG-----IYGNLAQMNFVVGYDLGKRTVSFLPADC 446
>gi|449458736|ref|XP_004147103.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 2-like [Cucumis
sativus]
gi|449518669|ref|XP_004166359.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 2-like [Cucumis
sativus]
Length = 482
Score = 111 bits (277), Expect = 1e-21, Method: Compositional matrix adjust.
Identities = 116/438 (26%), Positives = 169/438 (38%), Gaps = 53/438 (12%)
Query: 37 HTNPSQDSYQNLNSLVSSSLTRALHIKNPQTKTTTTTTTTTTTNISSHSYGG---YSISL 93
H D + V++ + R H K + T++ S G Y + +
Sbjct: 88 HRRGFNDRMKRDAIRVATLVRRLSHGAPAAVKDSRYKVANFATDVISGMEAGSGEYFVRI 147
Query: 94 SFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLGCQNPKCS 153
G+PP+ ++D+GS +VW C C C P F P SSS + C + C
Sbjct: 148 GVGSPPRNQYMVIDSGSDIVWVQCK---PCSRCYQQSDPVFDPADSSSFAGVSCGSDVCD 204
Query: 154 WIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSG-LTEGIALSETLNLPNRIIPNFL 212
+ + C Y V YG G T+G ETL + +I +
Sbjct: 205 RLENTGCNAGRCR----------------YEVSYGDGSYTKGTLALETLTVGQVMIRDVA 248
Query: 213 VGCSVLSSRQ---PAGIAGFGRGKTSLPSQLNLD---KFSYCLLSHKFDDTTRTSSLILD 266
+GC + AG+ G G G S QL FSYCL+S T T +L
Sbjct: 249 IGCGHTNQGMFIGAAGLLGLGGGSMSFIGQLGGQTGGAFSYCLVSRG---TGSTGALEFG 305
Query: 267 NGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRVRVWHKYLTLDRD 326
G+ G T+ + NP +YY+GL I VGG RV V + L
Sbjct: 306 RGA-----LPVGATWISLIRNPRAPS------FYYIGLAGIGVGGVRVSVPEETFQLTEY 354
Query: 327 GNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEALTGLRPCFDVPGEK 386
G G ++D+GT T + D F +Q N RA G C+D+ G +
Sbjct: 355 GTNGVVMDTGTAVTRFPTAAYVAFRDSFTAQ---TSNLPRAPGVSIFD---TCYDLNGFE 408
Query: 387 TGSFPELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTVVTDREASGGPSIILGNFQMQNY 446
+ P + +F G +TLP N+ V G CL + G SII GN Q +
Sbjct: 409 SVRVPTVSFYFSDGPVLTLPARNFLIPVDGGGTFCLAFA---PSPSGLSII-GNIQQEGI 464
Query: 447 YVEYDLRNQRLGFKQQLC 464
+ +D N +GF +C
Sbjct: 465 QISFDGANGFVGFGPNIC 482
>gi|356532386|ref|XP_003534754.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Glycine max]
Length = 463
Score = 110 bits (276), Expect = 1e-21, Method: Compositional matrix adjust.
Identities = 110/388 (28%), Positives = 173/388 (44%), Gaps = 44/388 (11%)
Query: 87 GGYSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLG 146
G Y + + GTP + I+DTGS L W C YC P F P S + + L
Sbjct: 111 GNYYVKIGLGTPAKYFSMIVDTGSSLSWLQC--QPCVIYCHVQVDPIFTPSTSKTYKALP 168
Query: 147 CQNPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYG-SGLTEGIALSETLNLPN 205
C + +CS + ++ C+ N T C Y YG + + G + L L
Sbjct: 169 CSSSQCSSLKSSTLNAPGCS--------NATGAC-VYKASYGDTSFSIGYLSQDVLTLTP 219
Query: 206 RIIPN--FLVGCSVLSS---RQPAGIAGFGRGKTSLPSQLNL---DKFSYCLLSHKFDDT 257
P+ F+ GC + + +GI G K S+ QL+ + FSYCL S
Sbjct: 220 SEAPSSGFVYGCGQDNQGLFGRSSGIIGLANDKISMLGQLSKKYGNAFSYCLPSSFSAPN 279
Query: 258 TRTSSLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRVRVW 317
+ + S L G+S LT +P+ P V + S+ Y++ L ITV G+ + V
Sbjct: 280 SSSLSGFLSIGASS-------LTSSPYKFTPLVKNQKIPSL-YFLDLTTITVAGKPLGVS 331
Query: 318 HKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEALTGLR 377
N TI+DSGT T + ++ L FV ++ ++ Y +A G + L
Sbjct: 332 ASSY------NVPTIIDSGTVITRLPVAVYNALKKSFV--LIMSKKYAQAPG---FSILD 380
Query: 378 PCFDVPGEKTGSFPELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTVVTDREASGGPSII 437
CF ++ + PE+++ F+GGA + L N + +G+ CL + AS P I
Sbjct: 381 TCFKGSVKEMSTVPEIQIIFRGGAGLELKAHNSLVEIEKGT-TCLAIA----ASSNPISI 435
Query: 438 LGNFQMQNYYVEYDLRNQRLGFKQQLCK 465
+GN+Q Q + V YD+ N ++GF C+
Sbjct: 436 IGNYQQQTFKVAYDVANFKIGFAPGGCQ 463
>gi|357140068|ref|XP_003571594.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Brachypodium
distachyon]
Length = 533
Score = 110 bits (276), Expect = 1e-21, Method: Compositional matrix adjust.
Identities = 113/373 (30%), Positives = 162/373 (43%), Gaps = 50/373 (13%)
Query: 105 ILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLGCQNPKCSWIHHESIQCRD 164
I+DTGS L W C C + + P F P S + + C +P C+ +D
Sbjct: 197 IVDTGSDLTWVQC-EPCPGSSCYAQRDPLFDPAASPTFAAVPCGSPACA------ASLKD 249
Query: 165 CNDEPLATSK---NCTQICPSYLVLYGSG-LTEGIALSETLNLPNRI-IPNFLVGCSVLS 219
P + ++ N Q C Y + YG G + G+ +TL L + F+ GC LS
Sbjct: 250 ATGAPGSCARSAGNSEQRC-YYALSYGDGSFSRGVLAQDTLGLGTTTKLDGFVFGCG-LS 307
Query: 220 SRQ----PAGIAGFGRGKTSLPSQLNL---DKFSYCLLSHKFDDTTRTSSLILDNGSSHS 272
+R AG+ G GR SL SQ FSYCL + TT T SL L G S S
Sbjct: 308 NRGLFGGTAGLMGLGRTDLSLVSQTAARFGGVFSYCLPA----TTTSTGSLSLGPGPSSS 363
Query: 273 DKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRVRVWHKYLTLDRDGNGGTI 332
+ YT + +P+ +Y++ + V LT G G +
Sbjct: 364 FPN---MAYTRMIADPTQPP------FYFINITGAAV------GGGAALTAPGFGAGNVL 408
Query: 333 VDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEALTGLRPCFDVPGEKTGSFPE 392
VDSGT T +AP +++ + EF + Y A G + L C+D+ G + P
Sbjct: 409 VDSGTVITRLAPSVYKAVRAEFARRF----EYPAAPG---FSILDACYDLTGRDEVNVPL 461
Query: 393 LKLHFKGGAEVTLPVENY-FAVVGEGSAVCLTVVTDREASGGPSIILGNFQMQNYYVEYD 451
L L +GGA+VT+ F V +GS VCL + + P I+GN+Q +N V YD
Sbjct: 462 LTLTLEGGAQVTVDAAGMLFVVRKDGSQVCLAMASLPYEDQTP--IIGNYQQRNKRVVYD 519
Query: 452 LRNQRLGFKQQLC 464
RLGF + C
Sbjct: 520 TVGSRLGFADEDC 532
>gi|225446261|ref|XP_002265547.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Vitis vinifera]
Length = 440
Score = 110 bits (275), Expect = 1e-21, Method: Compositional matrix adjust.
Identities = 119/431 (27%), Positives = 187/431 (43%), Gaps = 57/431 (13%)
Query: 49 NSLVSS--SLTRALHIKNPQTKTTTTTTTTTTTNISSHSYG-GYSISLSFGTPPQIIPFI 105
+S++SS SL R +++ +T+ T N+ + G + ++ S G PP
Sbjct: 49 DSILSSYQSLDRN-NVERRRTRRAAFITDEIQANMVADDRGQAFLVNFSVGRPPVPQLVG 107
Query: 106 LDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLGCQNPKCSWIHHESIQCRDC 165
+DTGS L+W C C C P F P SS+ L +P C
Sbjct: 108 IDTGSDLLWVQCR---PCADCFRQSTPIFDPSKSSTYVDLSYDSPICP------------ 152
Query: 166 NDEPLATSKNCTQICPSYLVLYGSGLTEGIALS------ETLNLPNRIIPNFLVGCSVLS 219
+ P + Q Y Y G T L+ ET + + + + GC S
Sbjct: 153 -NSPQKKYNHLNQCI--YNASYADGSTSSGNLATEDIVFETSDQGTVTVSSVVFGCG-HS 208
Query: 220 SR-----QPAGIAGFGRGKTSLPSQLNLDKFSYCLLSHKFDDTTRTSSLILDNGSSHSDK 274
+R Q +GI G G S+ S+L +FSYC+ FD + L+L +G
Sbjct: 209 NRGRFDGQQSGILGLSAGDQSIVSRLG-SRFSYCI-GDLFDPHYTHNQLVLGDGV----- 261
Query: 275 KTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRVRVWHKYLTLDRDGNGGTIVD 334
K G + TPF + F+ +YYV L I+VG R+ + + G GG ++D
Sbjct: 262 KMEG-SSTPF---------HTFNGFYYVTLEGISVGETRLDINPEVFQRTESGQGGVVMD 311
Query: 335 SGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEALTGLRPCFDVPGEKTGSFPELK 394
SGTT TF+A + F+PL++E + ++V R + + + + G E FPEL
Sbjct: 312 SGTTATFLAKDGFDPLSNE-IQRLV--RGHFQQVIYRTIPGWLCYKGRVNEDLRGFPELA 368
Query: 395 LHFKGGAEVTLPVENYFAVVGEGSAVCLTVVTDREASGGPSIILGNFQMQNYYVEYDLRN 454
HF GA++ L + F V CL V+ + G ++G Q+Y V YDL
Sbjct: 369 FHFAEGADLVLDANSLF-VQKNQDVFCLAVLESNLKNIGS--VIGIMAQQHYNVAYDLIG 425
Query: 455 QRLGFKQQLCK 465
+R+ F++ C+
Sbjct: 426 KRVYFQRTDCE 436
>gi|21717175|gb|AAM76368.1|AC074196_26 putative nucleoid DNA binding protein [Oryza sativa Japonica Group]
gi|31433292|gb|AAP54830.1| Eukaryotic aspartyl protease family protein [Oryza sativa Japonica
Group]
Length = 418
Score = 110 bits (275), Expect = 2e-21, Method: Compositional matrix adjust.
Identities = 112/387 (28%), Positives = 159/387 (41%), Gaps = 57/387 (14%)
Query: 92 SLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLGCQNPK 151
+ + GTPPQ I+D LVW C+ C C +P F+P SS+ R C
Sbjct: 70 NFTIGTPPQPASAIIDVAGELVWTQCS---MCSRCFKQDLPLFVPNASSTFRPEPCGTDA 126
Query: 152 CSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSGLTEGIALSETLNLPNRIIPNF 211
C I + C E SK G T GI ++T + +
Sbjct: 127 CKSIPTSNCSSNMCTYEGTINSKL-------------GGHTLGIVATDTFAI-GTATASL 172
Query: 212 LVGCSVLSSRQ----PAGIAGFGRGKTSLPSQLNLDKFSYCLLSHKFDDTTRTSSLILDN 267
GC V S P+G+ G GR +SL SQ+N+ KFSYCL H D+ + S L+L
Sbjct: 173 GFGCVVASGIDTMGGPSGLIGLGRAPSSLVSQMNITKFSYCLTPH---DSGKNSRLLL-- 227
Query: 268 GSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRVRVWHKYLTLDRDG 327
GSS T TPFV + + S YY + L I G + L G
Sbjct: 228 GSSAKLAGGGNSTTTPFVK---TSPGDDMSQYYPIQLDGIKAG-------DAAIALPPSG 277
Query: 328 NGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAE-ALTGLRP---CFDVP 383
N +V + +F+ ++ L E T+A+GA T L+P CF
Sbjct: 278 N-TVLVQTLAPMSFLVDSAYQALKKEV----------TKAVGAAPTATPLQPFDLCFPKA 326
Query: 384 GEKTGSFPELKLHF-KGGAEVTLPVENYFAVVGEGSA-VCLTVVT----DREASGGPSII 437
G S P+L F +G A +T+P Y VGE VC+ +++ + A I
Sbjct: 327 GLSNASAPDLVFTFQQGAAALTVPPPKYLIDVGEEKGTVCMAILSTSWLNTTALDENLNI 386
Query: 438 LGNFQMQNYYVEYDLRNQRLGFKQQLC 464
LG+ Q +N + DL + L F+ C
Sbjct: 387 LGSLQQENTHFLLDLEKKTLSFEPADC 413
>gi|225216949|gb|ACN85242.1| aspartic proteinase nepenthesin-1 precursor [Oryza minuta]
Length = 519
Score = 110 bits (275), Expect = 2e-21, Method: Compositional matrix adjust.
Identities = 111/389 (28%), Positives = 160/389 (41%), Gaps = 58/389 (14%)
Query: 87 GGYSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLG 146
G Y +++ GTP + DTGS W C Y K+ F P SS+ +
Sbjct: 178 GNYVVTVGLGTPASRYTVVFDTGSDTTWVQCQPCVVVCYEQREKL--FDPARSSTYANVS 235
Query: 147 CQNPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSG-LTEGIALSETLNLPN 205
C P CS ++ C Y V YG G + G +TL L +
Sbjct: 236 CAAPACSDLNIHGCSGGHC----------------LYGVQYGDGSYSIGFFAMDTLTLSS 279
Query: 206 -RIIPNFLVGCSVLSS---RQPAGIAGFGRGKTSLPSQLNLDK----FSYCLLSHKFDDT 257
+ F GC + + AG+ G GRGKTSLP Q DK F++CL +
Sbjct: 280 YDAVKGFRFGCGERNEGLFGEAAGLLGLGRGKTSLPVQ-TYDKYGGVFAHCLPARS---- 334
Query: 258 TRTSSLILDNGSSHSDKKTTGLTYTPFV--NNPSVAERNAFSVYYYVGLRRITVGGQRVR 315
T + LD G+ LT TP + N P+ +YYVG+ I VGGQ +
Sbjct: 335 --TGTGYLDFGAGSLAAARARLT-TPMLTENGPT---------FYYVGMTGIRVGGQLLS 382
Query: 316 VWHKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEALTG 375
+ GTIVDSGT T + P + L + + R Y + A A++
Sbjct: 383 IPQSVFA-----TAGTIVDSGTVITRLPPAAYSSL-RYAFAAAMAARGYKK---APAVSL 433
Query: 376 LRPCFDVPGEKTGSFPELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTVVTDREASGGPS 435
L C+D G + P + L F+GGA + + S VCL + + GG
Sbjct: 434 LDTCYDFTGMSQVAIPTVSLLFQGGARLDVDASGIM-YAASASQVCLAFAANED--GGDV 490
Query: 436 IILGNFQMQNYYVEYDLRNQRLGFKQQLC 464
I+GN Q++ + V YD+ + +GF C
Sbjct: 491 GIVGNTQLKTFGVAYDIGKKVVGFYPGAC 519
>gi|148908573|gb|ABR17396.1| unknown [Picea sitchensis]
Length = 350
Score = 110 bits (274), Expect = 2e-21, Method: Compositional matrix adjust.
Identities = 115/389 (29%), Positives = 163/389 (41%), Gaps = 54/389 (13%)
Query: 87 GGYSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLG 146
G Y + GTP + +LDTGS +VW C C+ C S P F P S S +G
Sbjct: 6 GEYFTRIGIGTPTREQYMVLDTGSDVVWIQCE---PCRECYSQADPIFNPSSSVSFSTVG 62
Query: 147 CQNPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSG-LTEGIALSETLNLPN 205
C + CS + DC+ C Y V YG G T G +ETL
Sbjct: 63 CDSAVCS-----QLDANDCH------GGGCL-----YEVSYGDGSYTVGSYATETLTFGT 106
Query: 206 RIIPNFLVGC---SVLSSRQPAGIAGFGRGKTSLPSQLNLD---KFSYCLLSHKFDDTTR 259
I N +GC +V AG+ G G G S P+QL FSYCL+ D
Sbjct: 107 TSIQNVAIGCGHDNVGLFVGAAGLLGLGAGSLSFPAQLGTQTGRAFSYCLV-----DRDS 161
Query: 260 TSSLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRV-RVWH 318
SS L+ G + G +TP V NP + +YY+ + I+VGG + V
Sbjct: 162 ESSGTLEFG---PESVPIGSIFTPLVANPFLP------TFYYLSMVAISVGGVILDSVPS 212
Query: 319 KYLTLDR-DGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEALTGLR 377
+ +D G GG I+DSGT T + ++ L D F++ ++ R A+ ++
Sbjct: 213 EAFRIDETTGRGGIIIDSGTAVTRLQTSAYDALRDAFIA---GTQHLPR---ADGISIFD 266
Query: 378 PCFDVPGEKTGSFPELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTVV-TDREASGGPSI 436
C+D+ ++ S P + HF GA LP +N + C D S
Sbjct: 267 TCYDLSALQSVSIPAVGFHFSNGAGFILPAKNCLIPMDSMGTFCFAFAPADSNLS----- 321
Query: 437 ILGNFQMQNYYVEYDLRNQRLGFKQQLCK 465
I+GN Q Q V +D N +GF C+
Sbjct: 322 IMGNIQQQGIRVSFDSANSLVGFAIDQCQ 350
>gi|255538124|ref|XP_002510127.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
gi|223550828|gb|EEF52314.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
Length = 641
Score = 110 bits (274), Expect = 2e-21, Method: Compositional matrix adjust.
Identities = 109/397 (27%), Positives = 169/397 (42%), Gaps = 67/397 (16%)
Query: 85 SYGGYSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRL 144
S G Y+ L GTPPQ I+DTGS + + PC+ C+ C + P F P+ SS+ +
Sbjct: 84 SNGYYTTRLFIGTPPQEFALIVDTGSTVTYVPCST---CEQCGKHQDPRFQPESSSTYKP 140
Query: 145 LGCQNPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSGLTEGIALS---ETL 201
+ C NP C +C+DE K CT + SGL LS E+
Sbjct: 141 MQC-NPSC-----------NCDDE----GKQCTYERRYAEMSSSSGLLAEDVLSFGNESE 184
Query: 202 NLPNRIIPNFLVGCSV-----LSSRQPAGIAGFGRGKTSLPSQLNLDKF---SYCLLSHK 253
P R I GC L S++ GI G GRG S+ QL + + S+ L
Sbjct: 185 LTPQRAI----FGCETVETGELFSQRADGIMGLGRGPLSVVDQLVIKEVVGNSFSLCYGG 240
Query: 254 FDDTTRTSSLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAF-SVYYYVGLRRITVGGQ 312
D +++L N D A + + S YY + L+ + V G+
Sbjct: 241 MD--VVGGAMVLGNIPPPPDM--------------VFAHSDPYRSAYYNIELKELHVAGK 284
Query: 313 RVRVWHKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEA 372
R+++ + DG GT++DSGTT+ ++ E F D ++K + + +
Sbjct: 285 RLKLNPRVF----DGKHGTVLDSGTTYAYLPEEAFVAFKDA----IIKEIKFLKQIHGPD 336
Query: 373 LTGLRPCFDVPGEKTGS----FPELKLHFKGGAEVTLPVENY-FAVVGEGSAVCLTVVTD 427
+ CF G FPE+ + F G +++L ENY F A CL + +
Sbjct: 337 PSYNDICFSGAGRDVSQLSKIFPEVNMVFGNGQKLSLSPENYLFRHTKVSGAYCLGIFQN 396
Query: 428 REASGGPSIILGNFQMQNYYVEYDLRNQRLGFKQQLC 464
+ P+ +LG ++N V YD N ++GF + C
Sbjct: 397 GK---DPTTLLGGIVVRNTLVTYDRDNDKIGFWKTNC 430
>gi|357491945|ref|XP_003616260.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
gi|355517595|gb|AES99218.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
Length = 441
Score = 110 bits (274), Expect = 2e-21, Method: Compositional matrix adjust.
Identities = 119/427 (27%), Positives = 185/427 (43%), Gaps = 31/427 (7%)
Query: 42 QDSYQNLNSLVSSSLTRALHIKNPQTKTTTTTTTTTTTNISSHSYG-GYSISLSFGTPPQ 100
+ + N+N S S L I +TT T +IS + Y ++L GTPPQ
Sbjct: 27 HNKHHNVNDSFSLSFPLTLSI------NSTTKTNPIVPSISPYKYSMALVVTLPIGTPPQ 80
Query: 101 IIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLGCQNPKCSWIHHESI 160
+ +LDTGS + W C N + SF P LSSS L C +P C
Sbjct: 81 LQQMVLDTGSQVSWIHCDNKKGPQKKQPPTTSSFDPSLSSSFFALPCNHPLCK------- 133
Query: 161 QCRDCNDEPLATSKNCTQICPSYLVLYGSG-LTEGIALSETLNL-PNRIIPNFLVGCSVL 218
D L T + ++C Y Y G + EG + E + L P+ P ++GC+
Sbjct: 134 --PQVPDISLPTDCDANRLC-HYSFSYTDGTVVEGNLVRENIALSPSLTTPPIILGCAN- 189
Query: 219 SSRQPAGIAGFGRGKTSLPSQLNLDKFSYCLLSHKFDDTTRTSSLILDNGSSHSDKKTTG 278
S GI G G+ S P+Q + KFSY + + + SL L N + S +
Sbjct: 190 QSDDARGILGMNLGRLSFPNQAKITKFSYFVPVKQ--TQPGSGSLYLGNNPNSSCFRYVK 247
Query: 279 LTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRVRVWHKYLTLDRDGNGGTIVDSGTT 338
L F + S N + + + ++ I++GG+++ + D G G TI+DSG+
Sbjct: 248 LLT--FSKSQSQRMPNLDPLAFTLPMQGISIGGKKLNIPPSVFKPDTTGFGQTIIDSGSE 305
Query: 339 FTFMAPELFEPLADEFVSQMVKNRNYTRALGAEALTGLRPCFDVPGEKTGSF-PELKLHF 397
F++M + + + +E V ++ G A CFD + G ++ F
Sbjct: 306 FSYMVDKAYNVIRNELVKKVGSKIKKDYIYGGVADI----CFDGDATEIGRLVGDMVFEF 361
Query: 398 KGGAEVTLPVENYFAVVGEGSAVCLTVVTDREASGGPSIILGNFQMQNYYVEYDLRNQRL 457
+ G E+ +P E V +G C + E GG I+GNF QN +VE+DL R+
Sbjct: 362 EKGVEIVIPKERVLIEV-DGGVHCFGI-GRAEGLGGGGNIIGNFYQQNLWVEFDLAKHRV 419
Query: 458 GFKQQLC 464
GF+ C
Sbjct: 420 GFRGANC 426
>gi|356536463|ref|XP_003536757.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
Length = 475
Score = 109 bits (273), Expect = 2e-21, Method: Compositional matrix adjust.
Identities = 114/445 (25%), Positives = 177/445 (39%), Gaps = 72/445 (16%)
Query: 34 SRFHTNPSQDSYQNLNSLVSSSLTRALHIKNPQTKTTTTTTTTTTTNISSHSYGGYSISL 93
+RF+ +D+ + ++SL R L P T + + + S G Y + +
Sbjct: 89 TRFNARMQRDTKR------AASLLRRLAAGKP-TYAAEAFGSDVVSGMEQGS-GEYFVRI 140
Query: 94 SFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLGCQNPKCS 153
G+PP+ ++D+GS ++W C C C P F P SSS + C + CS
Sbjct: 141 GVGSPPRNQYVVMDSGSDIIWVQCE---PCTQCYHQSDPVFNPADSSSFSGVSCASTVCS 197
Query: 154 WIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSG-LTEGIALSETLNLPNRIIPNFL 212
+ + + C Y V YG G T+G ET+ +I N
Sbjct: 198 HVDNAACHEGRCR----------------YEVSYGDGSYTKGTLALETITFGRTLIRNVA 241
Query: 213 VGCS-------------VLSSRQPAGIAGFGRGKTSLPSQLNLDKFSYCLLSHKFDDTTR 259
+GC + P G G+T FSYCL+S +
Sbjct: 242 IGCGHHNQGMFVGAAGLLGLGGGPMSFVGQLGGQTG-------GAFSYCLVSRGIE---- 290
Query: 260 TSSLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRVRVWHK 319
SS +L+ G + G + P ++NP +YY+GL + VGG RV +
Sbjct: 291 -SSGLLEFGR---EAMPVGAAWVPLIHNPRAQS------FYYIGLSGLGVGGLRVSISED 340
Query: 320 YLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEALTGLRPC 379
L G+GG ++D+GT T + +E D F++Q N RA G C
Sbjct: 341 VFKLSELGDGGVVMDTGTAVTRLPTVAYEAFRDGFIAQTT---NLPRASGVSIFD---TC 394
Query: 380 FDVPGEKTGSFPELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTVVTDREASGGPSIILG 439
+D+ G + P + +F GG +TLP N+ V + C +S G SII G
Sbjct: 395 YDLFGFVSVRVPTVSFYFSGGPILTLPARNFLIPVDDVGTFCFAFA---PSSSGLSII-G 450
Query: 440 NFQMQNYYVEYDLRNQRLGFKQQLC 464
N Q + + D N +GF +C
Sbjct: 451 NIQQEGIQISVDGANGFVGFGPNVC 475
>gi|56784900|dbj|BAD82194.1| aspartic proteinase nepenthesin I-like [Oryza sativa Japonica
Group]
Length = 260
Score = 109 bits (273), Expect = 2e-21, Method: Compositional matrix adjust.
Identities = 86/261 (32%), Positives = 128/261 (49%), Gaps = 28/261 (10%)
Query: 209 PNFLVGCSVLSSR---QPAGIAGFGRGKTSLPSQLNLDKFSYCLLSHKFDDTTRTSSLIL 265
P GC++ S +G+ G GRGK SL +QLN++ F Y L S D + S +
Sbjct: 15 PGIAFGCTLRSEGGFGTGSGLVGLGRGKLSLVTQLNVEAFGYRLSS----DLSAPSPISF 70
Query: 266 DNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRVRVWHKYLTLDR 325
+ + + TP + NP V + +YYVGL I+VGG+ V++ + DR
Sbjct: 71 GSLADVTGGNGDSFMSTPLLTNPVVQDLP----FYYVGLTGISVGGKLVQIPSGTFSFDR 126
Query: 326 D-GNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEALTGLRPCFDVPG 384
G GG I DSGTT T + + + DE +SQM + A + + CF G
Sbjct: 127 STGAGGVIFDSGTTLTMLPDPAYTLVRDELLSQMGFQKPPPAANDDDLI-----CF-TGG 180
Query: 385 EKTGSFPELKLHFKGGAEVTLPVENYFAVV----GEGSAVCLTVVTDREASGGPSIILGN 440
T +FP + LHF GGA++ L ENY + GE +A C +VV +A I+GN
Sbjct: 181 SSTTTFPSMVLHFDGGADMDLSTENYLPQMQGQNGE-TARCWSVVKSSQA----LTIIGN 235
Query: 441 FQMQNYYVEYDLR-NQRLGFK 460
+++V +DL N R+ F+
Sbjct: 236 IMQMDFHVVFDLSGNARMLFQ 256
>gi|15242803|ref|NP_198319.1| aspartyl protease family protein [Arabidopsis thaliana]
gi|75130158|sp|Q6XBF8.1|CDR1_ARATH RecName: Full=Aspartic proteinase CDR1; AltName: Full=Protein
CONSTITUTIVE DISEASE RESISTANCE 1; Flags: Precursor
gi|37935737|gb|AAP72988.1| CDR1 [Arabidopsis thaliana]
gi|91806924|gb|ABE66189.1| aspartyl protease family protein [Arabidopsis thaliana]
gi|109946613|gb|ABG48485.1| At5g33340 [Arabidopsis thaliana]
gi|332006513|gb|AED93896.1| aspartyl protease family protein [Arabidopsis thaliana]
Length = 437
Score = 109 bits (273), Expect = 2e-21, Method: Compositional matrix adjust.
Identities = 124/442 (28%), Positives = 197/442 (44%), Gaps = 70/442 (15%)
Query: 39 NPSQDSYQNLNSLVSSSLTRALHIKNPQTKTTTTTTTTTTTNISSHSYGGYSISLSFGTP 98
NP + S Q L + + S+ R H T T +++S+S G Y +++S GTP
Sbjct: 47 NPMETSSQRLRNAIHRSVNRVFHF------TEKDNTPQPQIDLTSNS-GEYLMNVSIGTP 99
Query: 99 PQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLGCQNPKCSWIHHE 158
P I I DTGS L+W C C C + P F PK SS+ + + C + +C+ + ++
Sbjct: 100 PFPIMAIADTGSDLLWTQCA---PCDDCYTQVDPLFDPKTSSTYKDVSCSSSQCTALENQ 156
Query: 159 SIQCRDCNDEPLATSKNCTQICPSYLVLYG-SGLTEGIALSETLNLPNRI-----IPNFL 212
A+ C SY + YG + T+G +TL L + + N +
Sbjct: 157 ------------ASCSTNDNTC-SYSLSYGDNSYTKGNIAVDTLTLGSSDTRPMQLKNII 203
Query: 213 VGCSVLSS----RQPAGIAGFGRGKTSLPSQL--NLD-KFSYCL--LSHKFDDTTRTSSL 263
+GC ++ ++ +GI G G G SL QL ++D KFSYCL L+ K D T++
Sbjct: 204 IGCGHNNAGTFNKKGSGIVGLGGGPVSLIKQLGDSIDGKFSYCLVPLTSKKDQTSKI--- 260
Query: 264 ILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRVRVWHKYLTL 323
N +++ +G+ TP + + + +YY+ L+ I+VG ++++
Sbjct: 261 ---NFGTNAIVSGSGVVSTPLI------AKASQETFYYLTLKSISVGSKQIQYSGSDSES 311
Query: 324 DRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEALTGLRPCFDVP 383
G I+DSGTT T + E + L D S + + + +GL C+
Sbjct: 312 ---SEGNIIIDSGTTLTLLPTEFYSELEDAVASSIDAEKK------QDPQSGLSLCYSAT 362
Query: 384 GEKTGSFPELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTVVTDREASGGPSI-ILGNFQ 442
G+ P + +HF GA+V L N F V E VC G PS I GN
Sbjct: 363 GDL--KVPVITMHFD-GADVKLDSSNAFVQVSE-DLVCFAF------RGSPSFSIYGNVA 412
Query: 443 MQNYYVEYDLRNQRLGFKQQLC 464
N+ V YD ++ + FK C
Sbjct: 413 QMNFLVGYDTVSKTVSFKPTDC 434
>gi|224090744|ref|XP_002309070.1| predicted protein [Populus trichocarpa]
gi|222855046|gb|EEE92593.1| predicted protein [Populus trichocarpa]
Length = 404
Score = 109 bits (273), Expect = 2e-21, Method: Compositional matrix adjust.
Identities = 108/392 (27%), Positives = 169/392 (43%), Gaps = 47/392 (11%)
Query: 91 ISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLGCQNP 150
+SL+ GTPPQ + ++DTGS L W C + S +F P S+S + + C +P
Sbjct: 33 VSLTVGTPPQNVSMVIDTGSELSWLHCNK-------TLSYPTTFDPTRSTSYQTIPCSSP 85
Query: 151 KCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSGLTEGIALSETLNLPNRIIPN 210
C+ D P+ S + +C + L + ++G S+ ++ + I
Sbjct: 86 TCT---------NRTQDFPIPASCDSNNLCHATLSYADASSSDGNLASDVFHIGSSDISG 136
Query: 211 FLVGC--SVLSSR-----QPAGIAGFGRGKTSLPSQLNLDKFSYCLLSHKFDDTTRTSSL 263
+ GC SV SS + G+ G RG S SQL KFSYC+ F S L
Sbjct: 137 LVFGCMDSVFSSNSDEDSKSTGLMGMNRGSLSFVSQLGFPKFSYCISGTDF------SGL 190
Query: 264 ILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRVRVWHKYLTL 323
+L S+ + + L YTP + S V Y V L I V + + +
Sbjct: 191 LLLGESNLT--WSVPLNYTPLIQI-STPLPYFDRVAYTVQLEGIKVLDKLLPIPKSTFEP 247
Query: 324 DRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEALT---GLRPCF 380
D G G T+VDSGT FTF+ ++ L F++Q + R L + C+
Sbjct: 248 DHTGAGQTMVDSGTQFTFLLGPVYNALRSAFLNQ---TSSVLRVLEDPDFVFQGAMDLCY 304
Query: 381 DVPGEK--TGSFPELKLHFKGGAEVTLPVENYFAVV-----GEGSAVCLTVVTDREASGG 433
VP + P + L F+ GAE+T+ + V G S CL+ + + G
Sbjct: 305 LVPLSQRVLPLLPTVTLVFR-GAEMTVSGDRVLYRVPGELRGNDSVHCLS-FGNSDLLGV 362
Query: 434 PSIILGNFQMQNYYVEYDLRNQRLGFKQQLCK 465
+ ++G+ QN ++E+DL R+G Q C
Sbjct: 363 EAYVIGHHHQQNVWMEFDLEKSRIGLAQVRCD 394
>gi|296090291|emb|CBI40110.3| unnamed protein product [Vitis vinifera]
Length = 408
Score = 109 bits (273), Expect = 3e-21, Method: Compositional matrix adjust.
Identities = 119/431 (27%), Positives = 187/431 (43%), Gaps = 57/431 (13%)
Query: 49 NSLVSS--SLTRALHIKNPQTKTTTTTTTTTTTNISSHSYG-GYSISLSFGTPPQIIPFI 105
+S++SS SL R +++ +T+ T N+ + G + ++ S G PP
Sbjct: 17 DSILSSYQSLDRN-NVERRRTRRAAFITDEIQANMVADDRGQAFLVNFSVGRPPVPQLVG 75
Query: 106 LDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLGCQNPKCSWIHHESIQCRDC 165
+DTGS L+W C C C P F P SS+ L +P C
Sbjct: 76 IDTGSDLLWVQCR---PCADCFRQSTPIFDPSKSSTYVDLSYDSPICP------------ 120
Query: 166 NDEPLATSKNCTQICPSYLVLYGSGLTEGIALS------ETLNLPNRIIPNFLVGCSVLS 219
+ P + Q Y Y G T L+ ET + + + + GC S
Sbjct: 121 -NSPQKKYNHLNQCI--YNASYADGSTSSGNLATEDIVFETSDQGTVTVSSVVFGCG-HS 176
Query: 220 SR-----QPAGIAGFGRGKTSLPSQLNLDKFSYCLLSHKFDDTTRTSSLILDNGSSHSDK 274
+R Q +GI G G S+ S+L +FSYC+ FD + L+L +G
Sbjct: 177 NRGRFDGQQSGILGLSAGDQSIVSRLG-SRFSYCI-GDLFDPHYTHNQLVLGDGV----- 229
Query: 275 KTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRVRVWHKYLTLDRDGNGGTIVD 334
K G + TPF + F+ +YYV L I+VG R+ + + G GG ++D
Sbjct: 230 KMEG-SSTPF---------HTFNGFYYVTLEGISVGETRLDINPEVFQRTESGQGGVVMD 279
Query: 335 SGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEALTGLRPCFDVPGEKTGSFPELK 394
SGTT TF+A + F+PL++E + ++V R + + + + G E FPEL
Sbjct: 280 SGTTATFLAKDGFDPLSNE-IQRLV--RGHFQQVIYRTIPGWLCYKGRVNEDLRGFPELA 336
Query: 395 LHFKGGAEVTLPVENYFAVVGEGSAVCLTVVTDREASGGPSIILGNFQMQNYYVEYDLRN 454
HF GA++ L + F V CL V+ + G ++G Q+Y V YDL
Sbjct: 337 FHFAEGADLVLDANSLF-VQKNQDVFCLAVLESNLKNIGS--VIGIMAQQHYNVAYDLIG 393
Query: 455 QRLGFKQQLCK 465
+R+ F++ C+
Sbjct: 394 KRVYFQRTDCE 404
>gi|116831531|gb|ABK28718.1| unknown [Arabidopsis thaliana]
Length = 438
Score = 109 bits (273), Expect = 3e-21, Method: Compositional matrix adjust.
Identities = 124/442 (28%), Positives = 197/442 (44%), Gaps = 70/442 (15%)
Query: 39 NPSQDSYQNLNSLVSSSLTRALHIKNPQTKTTTTTTTTTTTNISSHSYGGYSISLSFGTP 98
NP + S Q L + + S+ R H T T +++S+S G Y +++S GTP
Sbjct: 47 NPMETSSQRLRNAIHRSVNRVFHF------TEKDNTPQPQIDLTSNS-GEYLMNVSIGTP 99
Query: 99 PQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLGCQNPKCSWIHHE 158
P I I DTGS L+W C C C + P F PK SS+ + + C + +C+ + ++
Sbjct: 100 PFPIMAIADTGSDLLWTQCA---PCDDCYTQVDPLFDPKTSSTYKDVSCSSSQCTALENQ 156
Query: 159 SIQCRDCNDEPLATSKNCTQICPSYLVLYG-SGLTEGIALSETLNLPNRI-----IPNFL 212
A+ C SY + YG + T+G +TL L + + N +
Sbjct: 157 ------------ASCSTNDNTC-SYSLSYGDNSYTKGNIAVDTLTLGSSDTRPMQLKNII 203
Query: 213 VGCSVLSS----RQPAGIAGFGRGKTSLPSQL--NLD-KFSYCL--LSHKFDDTTRTSSL 263
+GC ++ ++ +GI G G G SL QL ++D KFSYCL L+ K D T++
Sbjct: 204 IGCGHNNAGTFNKKGSGIVGLGGGPVSLIKQLGDSIDGKFSYCLVPLTSKKDQTSKI--- 260
Query: 264 ILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRVRVWHKYLTL 323
N +++ +G+ TP + + + +YY+ L+ I+VG ++++
Sbjct: 261 ---NFGTNAIVSGSGVVSTPLI------AKASQETFYYLTLKSISVGSKQIQYSGSDSES 311
Query: 324 DRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEALTGLRPCFDVP 383
G I+DSGTT T + E + L D S + + + +GL C+
Sbjct: 312 ---SEGNIIIDSGTTLTLLPTEFYSELEDAVASSIDAEKK------QDPQSGLSLCYSAT 362
Query: 384 GEKTGSFPELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTVVTDREASGGPSI-ILGNFQ 442
G+ P + +HF GA+V L N F V E VC G PS I GN
Sbjct: 363 GDL--KVPVITMHFD-GADVKLDSSNAFVQVSE-DLVCFAF------RGSPSFSIYGNVA 412
Query: 443 MQNYYVEYDLRNQRLGFKQQLC 464
N+ V YD ++ + FK C
Sbjct: 413 QMNFLVGYDTVSKTVSFKPTDC 434
>gi|242066140|ref|XP_002454359.1| hypothetical protein SORBIDRAFT_04g029390 [Sorghum bicolor]
gi|241934190|gb|EES07335.1| hypothetical protein SORBIDRAFT_04g029390 [Sorghum bicolor]
Length = 460
Score = 109 bits (272), Expect = 3e-21, Method: Compositional matrix adjust.
Identities = 113/406 (27%), Positives = 166/406 (40%), Gaps = 65/406 (16%)
Query: 66 QTKTTTTTTTTTTTNISSHSYGGYSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKY 125
Q+ T TT T+ N Y I++ G+P + ++D+GS + W C C
Sbjct: 113 QSHVTVPTTLGTSLNTLE-----YLITVRLGSPAKTQTVLIDSGSDVSWVQCK---PCLQ 164
Query: 126 CSSSKIPSFIPKLSSSSRLLGCQNPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLV 185
C S P F P LSS+ C + C+ + + C ++S C Y+V
Sbjct: 165 CHSQVDPLFDPSLSSTYSPFSCSSAACAQLGQDGNGC--------SSSSQC-----QYIV 211
Query: 186 LYGSG-LTEGIALSETLNLPNRIIPNFLVGCSVLSS---RQPAGIAGFGRGKTSLPSQLN 241
Y G T G S+TL L + I NF GCS + S G+ G G G SL SQ
Sbjct: 212 RYADGSSTTGTYSSDTLALGSNTISNFQFGCSHVESGFNDLTDGLMGLGGGAPSLASQTA 271
Query: 242 ---LDKFSYCLLSHKFDDTTRTSSLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSV 298
FSYCL T +SS L G+ T+G TP + + V
Sbjct: 272 GTFGTAFSYCL------PPTPSSSGFLTLGAG-----TSGFVKTPMLRSSPVP------T 314
Query: 299 YYYVGLRRITVGGQRVRVWHKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQM 358
+Y V L I VGG ++ + + G ++DSGT T + + L+ F + M
Sbjct: 315 FYGVRLEAIRVGGTQLSIPTSVFS------AGMVMDSGTIITRLPRTAYSALSSAFKAGM 368
Query: 359 VKNRNYTRALGAEALTGLRPCFDVPGEKTGSFPELKLHFKGGAEVTLPVENYFAVVGEGS 418
+ R A + + CFD G+ + P + L F GGA V L G+
Sbjct: 369 KQYRP------APPRSIMDTCFDFSGQSSVRLPSVALVFSGGAVVNLDANGIIL----GN 418
Query: 419 AVCLTVVTDREASGGPSIILGNFQMQNYYVEYDLRNQRLGFKQQLC 464
+ +D + G I+GN Q + + V YD+ +GFK C
Sbjct: 419 CLAFAANSDDSSPG----IVGNVQQRTFEVLYDVGGGAVGFKAGAC 460
>gi|255581508|ref|XP_002531560.1| Aspartic proteinase nepenthesin-2 precursor, putative [Ricinus
communis]
gi|223528821|gb|EEF30826.1| Aspartic proteinase nepenthesin-2 precursor, putative [Ricinus
communis]
Length = 407
Score = 109 bits (272), Expect = 3e-21, Method: Compositional matrix adjust.
Identities = 110/393 (27%), Positives = 169/393 (43%), Gaps = 44/393 (11%)
Query: 90 SISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLGCQN 149
++SL+ GTPPQ + ++DTGS L W C ++S +F S S R + C +
Sbjct: 32 TVSLTVGTPPQNVSMVIDTGSELSWLYCNKTTT----TTSYPTTFNQTRSISYRPIPCSS 87
Query: 150 PKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSGLTEGIALSETLNLPNRIIP 209
C + Q RD + + S + +C + L + +EG S+T ++ IP
Sbjct: 88 STC------TNQTRDFS---IPASCDSNSLCHATLSYADASSSEGNLASDTFHMGASDIP 138
Query: 210 NFLVGC--SVLSSR-----QPAGIAGFGRGKTSLPSQLNLDKFSYCLLSHKFDDTTRTSS 262
+ GC SV SS + G+ G RG S SQ+ KFSYC+ F S
Sbjct: 139 GMVFGCMDSVFSSNSDEDSKNTGLMGMNRGSLSFVSQMGFPKFSYCISGTDF------SG 192
Query: 263 LILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRVRVWHKYLT 322
++L S+ + L YTP V S + Y V L I V + + +
Sbjct: 193 MLLLGESNFT--WAVPLNYTPLVQI-STPLPYFDRIAYTVQLEGIKVSDRLLPIPKSVFE 249
Query: 323 LDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEALT---GLRPC 379
D G G T+VDSGT FTF+ + L EF++Q + R L + C
Sbjct: 250 PDHTGAGQTMVDSGTQFTFLLGPAYTALRSEFLNQTT---GFLRVLEDPDFVFQGAMDLC 306
Query: 380 FDVPGEKT--GSFPELKLHFKGGAEVTLPVENYF-----AVVGEGSAVCLTVVTDREASG 432
+ VP + P + L F GAE+T+ E + G S CL+ + + G
Sbjct: 307 YRVPISQRVLPRLPTVSLVFN-GAEMTVADERVLYRVPGEIRGNDSVHCLS-FGNSDLLG 364
Query: 433 GPSIILGNFQMQNYYVEYDLRNQRLGFKQQLCK 465
+ ++G+ QN ++E+DL R+G Q C
Sbjct: 365 VEAYVIGHHHQQNVWMEFDLERSRIGLAQVRCD 397
>gi|116787367|gb|ABK24480.1| unknown [Picea sitchensis]
Length = 496
Score = 109 bits (272), Expect = 3e-21, Method: Compositional matrix adjust.
Identities = 117/389 (30%), Positives = 163/389 (41%), Gaps = 54/389 (13%)
Query: 87 GGYSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLG 146
G Y + GTP + +LDTGS +VW C C+ C S P F P S S +G
Sbjct: 152 GEYFTRIGIGTPTREQYMVLDTGSDVVWIQCE---PCRECYSQADPIFNPSSSVSFSTVG 208
Query: 147 CQNPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSG-LTEGIALSETLNLPN 205
C + CS + D ND C Y V YG G T G +ETL
Sbjct: 209 CDSAVCSQL--------DAND--------CHGGGCLYEVSYGDGSYTVGSYATETLTFGT 252
Query: 206 RIIPNFLVGC---SVLSSRQPAGIAGFGRGKTSLPSQLNLD---KFSYCLLSHKFDDTTR 259
I N +GC +V AG+ G G G S P+QL FSYCL+ D
Sbjct: 253 TSIQNVAIGCGHDNVGLFVGAAGLLGLGAGSLSFPAQLGTQTGRAFSYCLV-----DRDS 307
Query: 260 TSSLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRV-RVWH 318
SS L+ G + G +TP V NP + +YY+ + I+VGG + V
Sbjct: 308 ESSGTLEFG---PESVPIGSIFTPLVANPFLP------TFYYLSMVAISVGGVILDSVPS 358
Query: 319 KYLTLDR-DGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEALTGLR 377
+ +D G GG I+DSGT T + ++ L D F++ ++ RA G ++
Sbjct: 359 EAFRIDETTGRGGIIIDSGTAVTRLQTSAYDALRDAFIA---GTQHLPRADG---ISIFD 412
Query: 378 PCFDVPGEKTGSFPELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTVV-TDREASGGPSI 436
C+D+ ++ S P + HF GA LP +N + C D S
Sbjct: 413 TCYDLSALQSVSIPAVGFHFSNGAGFILPAKNCLIPMDSMGTFCFAFAPADSNLS----- 467
Query: 437 ILGNFQMQNYYVEYDLRNQRLGFKQQLCK 465
I+GN Q Q V +D N +GF C+
Sbjct: 468 IMGNIQQQGIRVSFDSANSLVGFAIDQCQ 496
>gi|38344196|emb|CAE05761.2| OSJNBa0064G10.12 [Oryza sativa Japonica Group]
Length = 451
Score = 109 bits (272), Expect = 4e-21, Method: Compositional matrix adjust.
Identities = 105/385 (27%), Positives = 148/385 (38%), Gaps = 68/385 (17%)
Query: 87 GGYSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLG 146
G Y + + G+PP ++D+GS ++W C C+ C + P F P SSS +
Sbjct: 128 GEYFVRVGVGSPPTDQYLVVDSGSDVIWVQCR---PCEQCYAQTDPLFDPAASSSFSGVS 184
Query: 147 CQNPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSG-LTEGIALSETLNLPN 205
C + C + + C Y V YG G T+G ETL L
Sbjct: 185 CGSAICRTLSGTGCGGG-------GDAGKC-----DYSVTYGDGSYTKGELALETLTLGG 232
Query: 206 RIIPNFLVGCSVLSSR---QPAGIAGFGRGKTSLPSQLNLDK---FSYCLLSHKFDDTTR 259
+ +GC +S AG+ G G G SL QL FSYCL S
Sbjct: 233 TAVQGVAIGCGHRNSGLFVGAAGLLGLGWGAMSLVGQLGGAAGGVFSYCLASRGAGGAGS 292
Query: 260 TSSLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRVRVWHK 319
+S +YYVGL I VGG+R+ +
Sbjct: 293 LAS-----------------------------------SFYYVGLTGIGVGGERLPLQDS 317
Query: 320 YLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEALTGLRPC 379
L DG GG ++D+GT T + E + L F M + A++ L C
Sbjct: 318 LFQLTEDGAGGVVMDTGTAVTRLPREAYAALRGAFDGAMGALPR------SPAVSLLDTC 371
Query: 380 FDVPGEKTGSFPELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTVVTDREASGGPSIILG 439
+D+ G + P + +F GA +TLP N VG G+ CL +S G S ILG
Sbjct: 372 YDLSGYASVRVPTVSFYFDQGAVLTLPARNLLVEVG-GAVFCLAFA---PSSSGIS-ILG 426
Query: 440 NFQMQNYYVEYDLRNQRLGFKQQLC 464
N Q + + D N +GF C
Sbjct: 427 NIQQEGIQITVDSANGYVGFGPNTC 451
>gi|18412482|ref|NP_565219.1| aspartyl protease-like protein [Arabidopsis thaliana]
gi|19699359|gb|AAL91289.1| At1g79720/F19K16_30 [Arabidopsis thaliana]
gi|26450464|dbj|BAC42346.1| unknown protein [Arabidopsis thaliana]
gi|115646741|gb|ABJ17101.1| At1g79720 [Arabidopsis thaliana]
gi|332198170|gb|AEE36291.1| aspartyl protease-like protein [Arabidopsis thaliana]
Length = 484
Score = 109 bits (272), Expect = 4e-21, Method: Compositional matrix adjust.
Identities = 117/437 (26%), Positives = 192/437 (43%), Gaps = 67/437 (15%)
Query: 56 LTRALHIKNPQTKTTTTTTTTTTTNISSHSYGGYSISLSFGTPPQIIPFIL--------- 106
+ RAL + N + ++ T++ + S I L+ G + + +I+
Sbjct: 87 MRRALVLDNIRVQSLQLKIKAMTSSTTEQSVSETQIPLTSGIKLESLNYIVTVELGGKNM 146
Query: 107 ----DTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLGCQNPKCSWIHHESIQC 162
DTGS L W C C+ C + + P + P +SSS + + C + C + +
Sbjct: 147 SLIVDTGSDLTWVQCQ---PCRSCYNQQGPLYDPSVSSSYKTVFCNSSTC-----QDLVA 198
Query: 163 RDCNDEPLATSKNCTQICPSYLVLYGSG-LTEGIALSETLNLPNRIIPNFLVGCSVLSSR 221
N P + + Y+V YG G T G SE++ L + + NF+ GC R
Sbjct: 199 ATSNSGPCGGNNGVVKTPCEYVVSYGDGSYTRGDLASESILLGDTKLENFVFGCG----R 254
Query: 222 QPAGI-------AGFGRGKTSLPSQ----LNLDKFSYCLLSHKFDDTTRTSSLILDNGSS 270
G+ G GR SL SQ N FSYCL S +D S ++ S
Sbjct: 255 NNKGLFGGSSGLMGLGRSSVSLVSQTLKTFN-GVFSYCLPS--LEDGASGSLSFGNDSSV 311
Query: 271 HSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRVRVWHKYLTLDRDGNGG 330
+++ +T ++YTP V NP + +Y + L ++GG V K + R G
Sbjct: 312 YTN--STSVSYTPLVQNPQLRS------FYILNLTGASIGG----VELKSSSFGR----G 355
Query: 331 TIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEALTGLRPCFDVPGEKTGSF 390
++DSGT T + P +++ + EF+ Q ++ A + L CF++ + S
Sbjct: 356 ILIDSGTVITRLPPSIYKAVKIEFLKQ------FSGFPTAPGYSILDTCFNLTSYEDISI 409
Query: 391 PELKLHFKGGAEVTLPVENYFAVVG-EGSAVCLTVVT-DREASGGPSIILGNFQMQNYYV 448
P +K+ F+G AE+ + V F V + S VCL + + E G I+GN+Q +N V
Sbjct: 410 PIIKMIFQGNAELEVDVTGVFYFVKPDASLVCLALASLSYENEVG---IIGNYQQKNQRV 466
Query: 449 EYDLRNQRLGFKQQLCK 465
YD +RLG + C+
Sbjct: 467 IYDTTQERLGIVGENCR 483
>gi|326506682|dbj|BAJ91382.1| predicted protein [Hordeum vulgare subsp. vulgare]
gi|326525815|dbj|BAJ88954.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 473
Score = 109 bits (272), Expect = 4e-21, Method: Compositional matrix adjust.
Identities = 111/415 (26%), Positives = 173/415 (41%), Gaps = 58/415 (13%)
Query: 59 ALHIKNPQTKTTTTTTTTTTTNISSHSYGGYSISLSFGTPPQIIPFILDTGSHLVWFPCT 118
+L+ N + + T +S+ G Y + GTP + ++DTGS L W C+
Sbjct: 107 SLYRANDDAAVDGSLASVPLTPGTSYGVGNYVTRMGLGTPAKPYIMVVDTGSSLTWLQCS 166
Query: 119 NHYQCKY-CSSSKIPSFIPKLSSSSRLLGCQNPKCSWIHHESIQCRDCNDEPLATSKNCT 177
C+ C P F PK SSS + C P+C+ + ++ C+ +
Sbjct: 167 ---PCRVSCHRQSGPVFDPKTSSSYAAVSCSTPQCNDLSTATLNPAACSS---------S 214
Query: 178 QICPSYLVLYG-SGLTEGIALSETLNLPNRIIPNFLVGCSVLSS---RQPAGIAGFGRGK 233
+C Y YG S + G +T++ + +PNF GC + + AG+ G R K
Sbjct: 215 DVC-IYQASYGDSSFSVGYLSKDTVSFGSNSVPNFYYGCGQDNEGLFGRSAGLMGLARNK 273
Query: 234 TSLPSQLNLD---KFSYCLLSHKFDDTTRTSSLILDNGSSHSDKKTTGLTYTPFVNNPSV 290
SL QL FSYCL S S N +S YTP V++
Sbjct: 274 LSLLYQLAPTLGYSFSYCLPSSSSSGYLSIGSY---NPGQYS--------YTPMVSS--- 319
Query: 291 AERNAFSVYYYVGLRRITVGGQRVRV-WHKYLTLDRDGNGGTIVDSGTTFTFMAPELFEP 349
Y++ L +TV G+ + V +Y +L TI+DSGT T + +++
Sbjct: 320 ---TLDDSLYFIKLSGMTVAGKPLAVSSSEYSSLP------TIIDSGTVITRLPTTVYDA 370
Query: 350 LADEFVSQMVKNRNYTRALGAEALTGLRPCFDVPGEKTGSFPELKLHFKGGAEVTLPVEN 409
L+ M + A+A + L CF V + P + + F GGA + L +N
Sbjct: 371 LSKAVAGAMKGTKR------ADAYSILDTCF-VGQASSLRVPAVSMAFSGGAALKLSAQN 423
Query: 410 YFAVVGEGSAVCLTVVTDREASGGPSIILGNFQMQNYYVEYDLRNQRLGFKQQLC 464
V + S CL R A+ I+GN Q Q + V YD+++ R+GF C
Sbjct: 424 LLVDV-DSSTTCLAFAPARSAA-----IIGNTQQQTFSVVYDVKSNRIGFAAGGC 472
>gi|7715602|gb|AAF68120.1|AC010793_15 F20B17.14 [Arabidopsis thaliana]
gi|12324588|gb|AAG52249.1|AC011717_17 putative aspartyl protease; 105611-106921 [Arabidopsis thaliana]
Length = 436
Score = 108 bits (271), Expect = 4e-21, Method: Compositional matrix adjust.
Identities = 120/437 (27%), Positives = 191/437 (43%), Gaps = 67/437 (15%)
Query: 56 LTRALHIKNPQTKTTTTTTTTTTTNISSHSYGGYSISLSFGTPPQIIPFIL--------- 106
+ RAL + N + ++ T++ + S I L+ G + + +I+
Sbjct: 39 MRRALVLDNIRVQSLQLKIKAMTSSTTEQSVSETQIPLTSGIKLESLNYIVTVELGGKNM 98
Query: 107 ----DTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLGCQNPKCSWIHHESIQC 162
DTGS L W C C+ C + + P + P +SSS + + C + C + +
Sbjct: 99 SLIVDTGSDLTWVQCQ---PCRSCYNQQGPLYDPSVSSSYKTVFCNSSTC-----QDLVA 150
Query: 163 RDCNDEPLATSKNCTQICPSYLVLYGSG-LTEGIALSETLNLPNRIIPNFLVGCSVLSSR 221
N P + + Y+V YG G T G SE++ L + + NF+ GC R
Sbjct: 151 ATSNSGPCGGNNGVVKTPCEYVVSYGDGSYTRGDLASESILLGDTKLENFVFGCG----R 206
Query: 222 QPAGI-------AGFGRGKTSLPSQ----LNLDKFSYCLLSHKFDDTTRTSSLILDNGSS 270
G+ G GR SL SQ N FSYCL S +D + SL N SS
Sbjct: 207 NNKGLFGGSSGLMGLGRSSVSLVSQTLKTFN-GVFSYCLPS--LEDGA-SGSLSFGNDSS 262
Query: 271 HSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRVRVWHKYLTLDRDGNGG 330
T+ ++YTP V NP + +Y + L ++GG V K + R G
Sbjct: 263 VYTNSTS-VSYTPLVQNPQLRS------FYILNLTGASIGG----VELKSSSFGR----G 307
Query: 331 TIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEALTGLRPCFDVPGEKTGSF 390
++DSGT T + P +++ + EF+ Q ++ A + L CF++ + S
Sbjct: 308 ILIDSGTVITRLPPSIYKAVKIEFLKQ------FSGFPTAPGYSILDTCFNLTSYEDISI 361
Query: 391 PELKLHFKGGAEVTLPVENYFAVVG-EGSAVCLTVVT-DREASGGPSIILGNFQMQNYYV 448
P +K+ F+G AE+ + V F V + S VCL + + E G I+GN+Q +N V
Sbjct: 362 PIIKMIFQGNAELEVDVTGVFYFVKPDASLVCLALASLSYENEVG---IIGNYQQKNQRV 418
Query: 449 EYDLRNQRLGFKQQLCK 465
YD +RLG + C+
Sbjct: 419 IYDTTQERLGIVGENCR 435
>gi|302141829|emb|CBI19032.3| unnamed protein product [Vitis vinifera]
Length = 382
Score = 108 bits (271), Expect = 4e-21, Method: Compositional matrix adjust.
Identities = 80/230 (34%), Positives = 116/230 (50%), Gaps = 26/230 (11%)
Query: 238 SQLNLDKFSYCLLS-HKFDDTTRTSSLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAF 296
SQL KFSYCL S H+ +TSSL+ + ++S+ + TP + NP +
Sbjct: 173 SQLGTQKFSYCLTSIHE----NKTSSLLFGS-LAYSNFNPGKIPRTPLIQNPFLPS---- 223
Query: 297 SVYYYVGLRRITVGGQRVRVWHKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVS 356
YYY+ L+ ITVG + + L +DG+GG I+DSGTT T++ + F+ L + F+S
Sbjct: 224 --YYYLALKGITVGYTLLPIPEFAFQLGKDGSGGMILDSGTTITYLQEDAFDVLKNAFIS 281
Query: 357 QMVKNRNYTRALGAEALTGLRPCFDVPGEKTGS--FPELKLHFKGGAEVTLPVENYFAVV 414
Q + + TGL CF +P + P+L HFK G ++ LPVENY
Sbjct: 282 QT------ELQVANSSTTGLDLCFHLPVKNAAEVKVPKLIFHFK-GLDLALPVENYMVSD 334
Query: 415 GEGSAVCLTVVTDREASGGPSIILGNFQMQNYYVEYDLRNQRLGFKQQLC 464
E +CL + +A+G S I GN Q QN V +DL+ L C
Sbjct: 335 PEMGLICLAI----DATGSLS-IFGNIQQQNMLVLHDLKKSTLSLVPTQC 379
Score = 39.7 bits (91), Expect = 3.7, Method: Compositional matrix adjust.
Identities = 21/52 (40%), Positives = 28/52 (53%), Gaps = 6/52 (11%)
Query: 87 GGYSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKL 138
G + ++L GTPP P I+DTGS L+W H CK SK IP++
Sbjct: 97 GEFVVNLMIGTPPVPFPAIMDTGSDLIW----THKLCKGVKPSKFS--IPRI 142
>gi|21717162|gb|AAM76355.1|AC074196_13 putative nucleoid DNA binding protein [Oryza sativa Japonica Group]
gi|31433286|gb|AAP54824.1| Eukaryotic aspartyl protease family protein [Oryza sativa Japonica
Group]
Length = 397
Score = 108 bits (271), Expect = 5e-21, Method: Compositional matrix adjust.
Identities = 104/381 (27%), Positives = 157/381 (41%), Gaps = 52/381 (13%)
Query: 92 SLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLGCQNPK 151
+ + GTPPQ +D LVW C+ QC +C +P F+P SS+ + C
Sbjct: 57 NFTIGTPPQAASAFIDLTGELVWTQCS---QCIHCFKQDLPVFVPNASSTFKPEPCGTDV 113
Query: 152 CSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSGLTEGIALSETLNLPNRIIPNF 211
C I T K + +C V G T GI ++T + +
Sbjct: 114 CKSI---------------PTPKCASDVCAYDGVTGLGGHTVGIVATDTFAIGTAAPASL 158
Query: 212 LVGCSVLSSRQ----PAGIAGFGRGKTSLPSQLNLDKFSYCLLSHKFDDTTRTSSLILDN 267
GC V S P+G G GR SL +Q+ L +FSYCL H DT + S L L
Sbjct: 159 GFGCVVASDIDTMGGPSGFIGLGRTPWSLVAQMKLTRFSYCLAPH---DTGKNSRLFL-- 213
Query: 268 GSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRVRVWHKYLTLDRDG 327
S K G +TPFV + + S YY + L I G +T+ R
Sbjct: 214 --GASAKLAGGGAWTPFVKT---SPNDGMSQYYPIELEEIKAG-------DATITMPRGR 261
Query: 328 NGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEALTGLRPCFDVPGEKT 387
N + + + + +++ ++ + T +GA CF P
Sbjct: 262 NTVLVQTAVVRVSLLVDSVYQEFKKAVMASVGAAPTAT-PVGAP----FEVCF--PKAGV 314
Query: 388 GSFPELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTVVT----DREASGGPSIILGNFQM 443
P+L F+ GA +T+P NY VG + VCL+V++ + A G + ILG+FQ
Sbjct: 315 SGAPDLVFTFQAGAALTVPPANYLFDVGNDT-VCLSVMSIALLNITALDGLN-ILGSFQQ 372
Query: 444 QNYYVEYDLRNQRLGFKQQLC 464
+N ++ +DL L F+ C
Sbjct: 373 ENVHLLFDLDKDMLSFEPADC 393
>gi|21595063|gb|AAM66069.1| putative aspartyl protease [Arabidopsis thaliana]
Length = 484
Score = 108 bits (271), Expect = 5e-21, Method: Compositional matrix adjust.
Identities = 117/437 (26%), Positives = 192/437 (43%), Gaps = 67/437 (15%)
Query: 56 LTRALHIKNPQTKTTTTTTTTTTTNISSHSYGGYSISLSFGTPPQIIPFIL--------- 106
+ RAL + N + ++ T++ + S I L+ G + + +I+
Sbjct: 87 MRRALVLDNIRVQSLQLKIKAMTSSTTEQSVSETQIPLTSGIKLESLNYIVTVELGGKNM 146
Query: 107 ----DTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLGCQNPKCSWIHHESIQC 162
DTGS L W C C+ C + + P + P +SSS + + C + C + +
Sbjct: 147 SLIVDTGSDLTWVQCQ---PCRSCYNQQGPLYDPSVSSSYKTVFCNSSTC-----QDLVA 198
Query: 163 RDCNDEPLATSKNCTQICPSYLVLYGSG-LTEGIALSETLNLPNRIIPNFLVGCSVLSSR 221
N P + + Y+V YG G T G SE++ L + + NF+ GC R
Sbjct: 199 ATSNSGPCGGNNGVVKTPCEYVVSYGDGSYTRGDLASESILLGDTKLENFVFGCG----R 254
Query: 222 QPAGI-------AGFGRGKTSLPSQ----LNLDKFSYCLLSHKFDDTTRTSSLILDNGSS 270
G+ G GR SL SQ N FSYCL S +D S ++ S
Sbjct: 255 NNKGLFGGSSGLMGLGRSSVSLVSQTLKTFN-GVFSYCLPS--LEDGASGSLSFGNDSSV 311
Query: 271 HSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRVRVWHKYLTLDRDGNGG 330
+++ +T ++YTP V NP + +Y + L ++GG V K + R G
Sbjct: 312 YTN--STSVSYTPLVQNPQLRS------FYILNLTGASIGG----VELKSSSFGR----G 355
Query: 331 TIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEALTGLRPCFDVPGEKTGSF 390
++DSGT T + P +++ + EF+ Q ++ A + L CF++ + S
Sbjct: 356 ILIDSGTVITRLPPSIYKAVKIEFLKQ------FSGFPTAPGYSILDTCFNLTSYEDISI 409
Query: 391 PELKLHFKGGAEVTLPVENYFAVVG-EGSAVCLTVVT-DREASGGPSIILGNFQMQNYYV 448
P +K+ F+G AE+ + V F V + S VCL + + E G I+GN+Q +N V
Sbjct: 410 PIIKMIFQGNAELEVDVTGVFYFVKPDASLVCLALASLSYENEVG---IIGNYQQKNQRV 466
Query: 449 EYDLRNQRLGFKQQLCK 465
YD +RLG + C+
Sbjct: 467 IYDSTQERLGIVGENCR 483
>gi|22326716|ref|NP_196638.2| aspartyl protease family protein [Arabidopsis thaliana]
gi|18700103|gb|AAL77663.1| AT5g10770/T30N20_40 [Arabidopsis thaliana]
gi|24111269|gb|AAN46758.1| At5g10770/T30N20_40 [Arabidopsis thaliana]
gi|332004211|gb|AED91594.1| aspartyl protease family protein [Arabidopsis thaliana]
Length = 474
Score = 108 bits (271), Expect = 5e-21, Method: Compositional matrix adjust.
Identities = 112/387 (28%), Positives = 159/387 (41%), Gaps = 52/387 (13%)
Query: 87 GGYSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLG 146
G Y +++ GTP + I DTGS L W C + C K P F P S+S +
Sbjct: 130 GNYIVTVGLGTPKNDLSLIFDTGSDLTWTQC--QPCVRTCYDQKEPIFNPSKSTSYYNVS 187
Query: 147 CQNPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGS-GLTEGIALSETLNLPN 205
C + C + + C ++ NC Y + YG + G E L N
Sbjct: 188 CSSAACGSLSSATGNAGSC------SASNCI-----YGIQYGDQSFSVGFLAKEKFTLTN 236
Query: 206 R-IIPNFLVGCSVLSS---RQPAGIAGFGRGKTSLPSQLNL---DKFSYCLLSHKFDDTT 258
+ GC + AG+ G GR K S PSQ FSYCL S +
Sbjct: 237 SDVFDGVYFGCGENNQGLFTGVAGLLGLGRDKLSFPSQTATAYNKIFSYCLPS----SAS 292
Query: 259 RTSSLILDN-GSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRVRVW 317
T L + G S S K +TP ++ + +F Y + + ITVGGQ++ +
Sbjct: 293 YTGHLTFGSAGISRSVK------FTPI---STITDGTSF---YGLNIVAITVGGQKLPIP 340
Query: 318 HKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEALTGLR 377
+ G ++DSGT T + P+ + L F ++M K Y G L
Sbjct: 341 STVFSTP-----GALIDSGTVITRLPPKAYAALRSSFKAKMSK---YPTTSGVSI---LD 389
Query: 378 PCFDVPGEKTGSFPELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTVVTDREASGGPSII 437
CFD+ G KT + P++ F GGA V L + F V + S VCL + + S + I
Sbjct: 390 TCFDLSGFKTVTIPKVAFSFSGGAVVELGSKGIFYVF-KISQVCLAFAGNSDDSN--AAI 446
Query: 438 LGNFQMQNYYVEYDLRNQRLGFKQQLC 464
GN Q Q V YD R+GF C
Sbjct: 447 FGNVQQQTLEVVYDGAGGRVGFAPNGC 473
>gi|356531224|ref|XP_003534178.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
Length = 492
Score = 108 bits (271), Expect = 5e-21, Method: Compositional matrix adjust.
Identities = 106/398 (26%), Positives = 159/398 (39%), Gaps = 56/398 (14%)
Query: 75 TTTTTNISSHSYGGYSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSF 134
+T ++ ++ G Y + G P + +LDTGS + W C C C P F
Sbjct: 143 STPVSSGTAQGSGEYFSRVGVGQPSKPFYMVLDTGSDVNWLQCK---PCSDCYQQSDPIF 199
Query: 135 IPKLSSSSRLLGCQNPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSG-LTE 193
P SSS L C +C + E CR+ C Y V YG G T
Sbjct: 200 DPTASSSYNPLTCDAQQCQDL--EMSACRN---------GKCL-----YQVSYGDGSFTV 243
Query: 194 GIALSETLNLPNRIIPNFLVGCSVLSSRQPAGI-------AGFGRGKTSLPSQLNLDKFS 246
G ++ET++ + +GC G+ G G G SL SQ+ FS
Sbjct: 244 GEYVTETVSFGAGSVNRVAIGCG----HDNEGLFVGSAGLLGLGGGPLSLTSQIKATSFS 299
Query: 247 YCLLSHKFDDTTRTSSLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRR 306
YCL+ D+ ++S+L ++ + P + N V + +YYV L
Sbjct: 300 YCLVDR---DSGKSSTLEFNS------PRPGDSVVAPLLKNQKV------NTFYYVELTG 344
Query: 307 ITVGGQRVRVWHKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTR 366
++VGG+ V V + +D+ G GG IVDSGT T + + + + D F R +
Sbjct: 345 VSVGGEIVTVPPETFAVDQSGAGGVIVDSGTAITRLRTQAYNSVRDAF------KRKTSN 398
Query: 367 ALGAEALTGLRPCFDVPGEKTGSFPELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTVVT 426
AE + C+D+ ++ P + HF G LP +NY V C
Sbjct: 399 LRPAEGVALFDTCYDLSSLQSVRVPTVSFHFSGDRAWALPAKNYLIPVDGAGTYCFAFAP 458
Query: 427 DREASGGPSIILGNFQMQNYYVEYDLRNQRLGFKQQLC 464
+ I+GN Q Q V +DL N +GF C
Sbjct: 459 TTSSMS----IIGNVQQQGTRVSFDLANSLVGFSPNKC 492
>gi|357154085|ref|XP_003576664.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Brachypodium
distachyon]
Length = 509
Score = 108 bits (270), Expect = 5e-21, Method: Compositional matrix adjust.
Identities = 120/403 (29%), Positives = 165/403 (40%), Gaps = 71/403 (17%)
Query: 87 GGYSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSS-----KIPSFIPKLSSS 141
G Y +S+ GTP + + + DTGS L W QC CSS + P F P SS+
Sbjct: 152 GNYVVSVGLGTPARDLTVVFDTGSDLSWV------QCGPCSSGGCYKQQDPLFAPSDSST 205
Query: 142 SRLLGCQNPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGS-GLTEGIALSET 200
+ C +C C P CP Y V+YG T+G ++T
Sbjct: 206 FSAVRCGARECRARQS-------CGGSP------GDDRCP-YEVVYGDKSRTQGHLGNDT 251
Query: 201 LNLP-----------NRIIPNFLVGCSVLSS---RQPAGIAGFGRGKTSLPSQLN---LD 243
L L + +P F+ GC ++ Q G+ G GRGK SL SQ +
Sbjct: 252 LTLGTMAPANASAENDNKLPGFVFGCGENNTGLFGQADGLFGLGRGKVSLSSQAAGKFGE 311
Query: 244 KFSYCLLSHKFDDTTRTSSLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVG 303
FSYCL S S +H+ +TP +N R +YYV
Sbjct: 312 GFSYCLPSSSSSAPGYLSLGTPVPAPAHAQ-------FTPMLN------RTTTPSFYYVK 358
Query: 304 LRRITVGGQRVRVWHKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRN 363
L I V G+ +RV + L IVDSGT T +AP + L F+S M K
Sbjct: 359 LVGIRVAGRAIRVSSPRVALP------LIVDSGTVITRLAPRAYRALRAAFLSAMGKY-G 411
Query: 364 YTRALGAEALTGLRPCFD--VPGEKTGSFPELKLHFKGGAEVTLPVENYFAVVGEGSAVC 421
Y R A L+ L C+D T S P + L F GGA +++ V A C
Sbjct: 412 YKR---APRLSILDTCYDFTAHANATVSIPAVALVFAGGATISVDFSGVLYVAKVAQA-C 467
Query: 422 LTVVTDREASGGPSIILGNFQMQNYYVEYDLRNQRLGFKQQLC 464
L + + G + ILGN Q + V YD+ Q++GF + C
Sbjct: 468 LAFAPNGD--GRSAGILGNTQQRTLAVVYDVARQKIGFAAKGC 508
>gi|449441618|ref|XP_004138579.1| PREDICTED: uncharacterized protein LOC101220661 [Cucumis sativus]
Length = 2819
Score = 108 bits (270), Expect = 5e-21, Method: Compositional matrix adjust.
Identities = 116/403 (28%), Positives = 172/403 (42%), Gaps = 50/403 (12%)
Query: 78 TTNISSHSYGGYSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPK 137
+ +S H ++SL+ G+PPQ + +LDTGS L W C S + F P
Sbjct: 989 SNKLSFHHNVTLTVSLTVGSPPQQVTMVLDTGSELSWLHCKK-------SPNLTSVFNPL 1041
Query: 138 LSSSSRLLGCQNPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSGLTEGIAL 197
SSS + C +P C + RD P + + ++C + + + EG
Sbjct: 1042 SSSSYSPIPCSSPICR------TRTRDL---PNPVTCDPKKLCHAIVSYADASSLEGNLA 1092
Query: 198 SETLNLPNRIIPNFLVGC--SVLSSR-----QPAGIAGFGRGKTSLPSQLNLDKFSYCLL 250
S+ + + +P L GC S SS + G+ G RG S +QL L KFSYC+
Sbjct: 1093 SDNFRIGSSALPGTLFGCMDSGFSSNSEEDAKTTGLMGMNRGSLSFVTQLGLPKFSYCI- 1151
Query: 251 SHKFDDTTRTSSLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVG 310
+ R SS +L G H LTYTP V S V Y V L I VG
Sbjct: 1152 ------SGRDSSGVLLFGDLHLSW-LGNLTYTPLVQ-ISTPLPYFDRVAYTVQLDGIRVG 1203
Query: 311 GQRVRVWHKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGA 370
+ + + D G G T+VDSGT FTF+ ++ L +EF+ Q + LG
Sbjct: 1204 NKILPLPKSIFAPDHTGAGQTMVDSGTQFTFLLGPVYTALRNEFLEQ---TKGVLAPLGD 1260
Query: 371 EALT---GLRPCFDV-PGEKTGSFPELKLHFKGGAEVTLPVENYFAVV-----GEGSAVC 421
+ C+ V G K + P + L F+ GAE+ + E V G C
Sbjct: 1261 PNFVFQGAMDLCYSVAAGGKLPTLPSVSLMFR-GAEMVVGGEVLLYRVPEMMKGNEWVYC 1319
Query: 422 LTVVTDREASGGPSIILGNFQMQNYYVEYDLRNQRLGFKQQLC 464
LT + + G + ++G+ QN ++E+DL + F LC
Sbjct: 1320 LT-FGNSDLLGIEAFVIGHHHQQNVWMEFDL----VAFAADLC 1357
>gi|125543284|gb|EAY89423.1| hypothetical protein OsI_10930 [Oryza sativa Indica Group]
Length = 447
Score = 108 bits (270), Expect = 6e-21, Method: Compositional matrix adjust.
Identities = 110/410 (26%), Positives = 166/410 (40%), Gaps = 62/410 (15%)
Query: 90 SISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLGCQN 149
++ ++ GTPPQ + +LDTGS L W C Y + P+F SSS + C +
Sbjct: 56 TVPVAVGTPPQNVTMVLDTGSELSWLLCNGSY-----APPLTPAFNASGSSSYGAVPCPS 110
Query: 150 PKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSGLTEGIALSETLNLPNRIIP 209
C W + C+ P N ++ SY + +G+ ++T L P
Sbjct: 111 TACEWRGRDLPVPPFCDTPP----SNACRVSLSYA---DASSADGVLATDTFLLTGGAPP 163
Query: 210 ---NFLVGC---------------SVLSSRQPAGIAGFGRGKTSLPSQLNLDKFSYCLLS 251
GC S G+ G RG S +Q +F+YC+
Sbjct: 164 VAVGAYFGCITSYSSTTATNSNGTGTDVSEAATGLLGMNRGTLSFVTQTGTRRFAYCIAP 223
Query: 252 HKFDDTTRTSSLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAF--SVYYYVGLRRITV 309
+ L+ D+G L YTP + +++ + V Y V L I V
Sbjct: 224 GEGPGVL----LLGDDGGVAPP-----LNYTPLIE---ISQPLPYFDRVAYSVQLEGIRV 271
Query: 310 GGQRVRVWHKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALG 369
G + + LT D G G T+VDSGT FTF+ + + L EF SQ R LG
Sbjct: 272 GCALLPIPKSVLTPDHTGAGQTMVDSGTQFTFLLADAYAALKAEFTSQA---RLLLAPLG 328
Query: 370 AEALT---GLRPCFDVPGEK----TGSFPELKLHFKGGAEVTLPVENYFAVV-----GEG 417
CF P + +G P + L + GAEV + E +V GEG
Sbjct: 329 EPGFVFQGAFDACFRGPEARVAAASGLLPVVGLVLR-GAEVAVSGEKLLYMVPGERRGEG 387
Query: 418 SAVCLTVVT--DREASGGPSIILGNFQMQNYYVEYDLRNQRLGFKQQLCK 465
A + +T + + +G + ++G+ QN +VEYDL+N R+GF C
Sbjct: 388 GAEAVWCLTFGNSDMAGMSAYVIGHHHQQNVWVEYDLQNGRVGFAPARCD 437
>gi|224093400|ref|XP_002309912.1| predicted protein [Populus trichocarpa]
gi|222852815|gb|EEE90362.1| predicted protein [Populus trichocarpa]
Length = 382
Score = 108 bits (270), Expect = 6e-21, Method: Compositional matrix adjust.
Identities = 105/386 (27%), Positives = 158/386 (40%), Gaps = 52/386 (13%)
Query: 87 GGYSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLG 146
G Y + + G+PP+ ++D+GS +VW C C C P F P S+S +
Sbjct: 41 GEYFVRIGLGSPPRSQYMVIDSGSDIVWVQCK---PCTQCYHQTDPLFDPADSASFMGVS 97
Query: 147 CQNPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSG-LTEGIALSETLNLPN 205
C + C + + CN S C Y V YG G T+G ETL
Sbjct: 98 CSSAVCDRVENAG-----CN------SGRC-----RYEVSYGDGSYTKGTLALETLTFGR 141
Query: 206 RIIPNFLVGCSVLSSR----QPAGIAGFGRGKTSLPSQLNL---DKFSYCLLSHKFDDTT 258
++ N +GC S+R AG+ G G G S QL+ + FSYCL+S T
Sbjct: 142 TVVRNVAIGCG-HSNRGMFVGAAGLLGLGGGSMSFMGQLSGQTGNAFSYCLVSRG----T 196
Query: 259 RTSSLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRVRVWH 318
T+ + S+ G + P V NP +YY+ L + VG RV V
Sbjct: 197 NTNGFL----EFGSEAMPVGAAWIPLVRNPRAPS------FYYIRLLGLGVGDTRVPVSE 246
Query: 319 KYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEALTGLRP 378
L+ G+GG ++D+GT T +E + F+ Q +N RA G
Sbjct: 247 DVFQLNELGSGGVVMDTGTAVTRFPTVAYEAFRNAFIEQ---TQNLPRASGVSIFD---T 300
Query: 379 CFDVPGEKTGSFPELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTVVTDREASGGPSIIL 438
C+++ G + P + +F GG +T+P N+ V + C IL
Sbjct: 301 CYNLFGFLSVRVPTVSFYFSGGPILTIPANNFLIPVDDAGTFCFAFAPSPSGLS----IL 356
Query: 439 GNFQMQNYYVEYDLRNQRLGFKQQLC 464
GN Q + + D N+ +GF +C
Sbjct: 357 GNIQQEGIQISVDEANEFVGFGPNIC 382
>gi|388502484|gb|AFK39308.1| unknown [Medicago truncatula]
Length = 425
Score = 108 bits (269), Expect = 7e-21, Method: Compositional matrix adjust.
Identities = 111/374 (29%), Positives = 162/374 (43%), Gaps = 52/374 (13%)
Query: 89 YSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLGCQ 148
Y + GTPPQ + +DT + W PCT C C+S+ F P+ S++ + + C
Sbjct: 93 YIVRAKIGTPPQTLLLAMDTSNDAAWIPCT---ACDGCASTL---FAPEKSTTFKNVSCA 146
Query: 149 NPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSGLTEGIALSETLNLPNRII 208
P+C + + C +S+N + + YGS + +T+ L +
Sbjct: 147 APECKQVPNPG-----CG----VSSRN-------FNLTYGSSSIAANLVQDTITLATDPV 190
Query: 209 PNFLVGC---SVLSSRQPAGIAGFGRGKTSLPSQ---LNLDKFSYCLLSHKFDDTTRTSS 262
P++ GC + +S P G+ G GRG SL SQ L FSYCL S F + S
Sbjct: 191 PSYTFGCVSKTTGTSAPPQGLLGLGRGPLSLLSQTQNLYQSTFSYCLPS--FKSLNFSGS 248
Query: 263 LILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRVRVWHKYLT 322
L L G K+ + YTP + NP R++ YYV L I VG + V + L
Sbjct: 249 LRL--GPVAQPKR---IKYTPLLKNP---RRSSL---YYVNLEAIRVGRKVVDIPPAALA 297
Query: 323 LDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEALTGLRPCFDV 382
+ GTI DSGT FT + ++ + DEF R L +L G C++V
Sbjct: 298 FNPTTGAGTIFDSGTVFTRLVAPVYVAVRDEF------RRRVGPKLTVTSLGGFDTCYNV 351
Query: 383 PGEKTGSFPELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTVVTDREASGGPSIILGNFQ 442
P P + F G VTLP +N GS CL + + ++ N Q
Sbjct: 352 P----IVVPTITFIFT-GMNVTLPQDNILIHSTAGSTTCLAMAGAPDNVNSVLNVIANMQ 406
Query: 443 MQNYYVEYDLRNQR 456
QN+ V YD+ N R
Sbjct: 407 QQNHRVLYDVPNSR 420
>gi|222631382|gb|EEE63514.1| hypothetical protein OsJ_18330 [Oryza sativa Japonica Group]
Length = 464
Score = 108 bits (269), Expect = 7e-21, Method: Compositional matrix adjust.
Identities = 108/400 (27%), Positives = 162/400 (40%), Gaps = 55/400 (13%)
Query: 105 ILDTGSHLVWFPCTN-------HYQCKYCSSSKIPSFIPKLSSSSRLLGCQNPKCSW--I 155
++DTGS LVW C+ C +P + LS ++R + C + + +
Sbjct: 77 VVDTGSDLVWTQCSTCRLPAVAAAGGGGCFPQNLPYYNFSLSRTARAVPCDDDDGALCGV 136
Query: 156 HHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSGLTEGIALSETLNLPNRIIPNFLVGC 215
E+ C + + + S YG+G+ G+ ++ P+ GC
Sbjct: 137 APETAGCARGG----GSGDDACVVAAS----YGAGVALGVLGTDAFTFPSSSSVTLAFGC 188
Query: 216 SVLSSRQP------AGIAGFGRGKTSLPSQLNLDKFSYCLLSHKFDDTTRTSSLILDNGS 269
+ P +GI G GRG SL SQLN +FSYCL + F DT S L + +G
Sbjct: 189 VSQTRISPGALNGASGIIGLGRGALSLVSQLNATEFSYCLTPY-FRDTVSPSHLFVGDGE 247
Query: 270 SHSDKKTTG--------LTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRVRVWHKYL 321
+ G +T PF NP + + FS +YY+ L + G V +
Sbjct: 248 LAGLRAAAGGGGGGGAPVTTVPFAKNP---KDSPFSTFYYLPLVGLAAGNATVALPAGAF 304
Query: 322 TLDRDG----NGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYT---RALGAEALT 374
L GG ++DSG+ FT + L E Q+ + + LG
Sbjct: 305 DLREAAPKVWAGGALIDSGSPFTRLVDPAHRALTKELARQLRGSGSLVPPPAKLGGALEL 364
Query: 375 GLRPCFDVPGEKTGSFPELKLHFK----GGAEVTLPVENYFAVVGEGSAVCLTVVTDREA 430
+ D + P L L F GG E+ +P E Y+A V E S C+ VV+ A
Sbjct: 365 CVEAGDDGDSLAAAAVPPLVLRFDDGVGGGRELVIPAEKYWARV-EASTWCMAVVS--SA 421
Query: 431 SGGPSI------ILGNFQMQNYYVEYDLRNQRLGFKQQLC 464
SG ++ I+GNF Q+ V YDL N L F+ C
Sbjct: 422 SGNATLPTNETTIIGNFMQQDMRVLYDLANGLLSFQPANC 461
>gi|145693994|gb|ABP93697.1| unknown protein isoform 2 [Lemna minor]
Length = 351
Score = 108 bits (269), Expect = 7e-21, Method: Compositional matrix adjust.
Identities = 116/393 (29%), Positives = 169/393 (43%), Gaps = 71/393 (18%)
Query: 87 GGYSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCS----SSKIPSFIPKLSSSS 142
G Y I++ FGTP + + DTGS + W QCK C+ + + P F P LSS+
Sbjct: 14 GNYVITVGFGTPTRTQTVVFDTGSDVNWL------QCKPCAVRCYAQQEPLFDPSLSSTY 67
Query: 143 RLLGCQNPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSG-LTEGIALSETL 201
R + C P C + +++ C+ Y V YG G T G +T
Sbjct: 68 RNVSCTEPACVGL----------------STRGCSSSTCLYGVFYGDGSSTIGFLAMDTF 111
Query: 202 NL-PNRIIPNFLVGCSVLSS---RQPAGIAGFGRGKT-SLPSQLNL---DKFSYCLLSHK 253
L P + NF+ GC ++ + AG+ G GR T SL SQ+ + FSYCL S
Sbjct: 112 MLTPAQKFKNFIFGCGQNNTGLFQGTAGLVGLGRSSTYSLNSQVAPSLGNVFSYCLPS-- 169
Query: 254 FDDTTRTSSLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQR 313
T +++ L+ G+ + YT + + V Y++ L I+VGG R
Sbjct: 170 ----TSSATGYLNIGNPQNTPG-----YTAMLTDTRVPT------LYFIDLIGISVGGTR 214
Query: 314 VRVWHKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEAL 373
+ + + GTI+DSGT T + P + L + M + YT A A+
Sbjct: 215 LSLSSTVFQ-----SVGTIIDSGTVITRLPPTAYSALKTAVRAAMTQ---YTL---APAV 263
Query: 374 TGLRPCFDVPGEKTGSFPELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTVV--TDREAS 431
T L C+D + +P + LHF G +V +P F V S VCL TD
Sbjct: 264 TILDTCYDFSRTTSVVYPVIVLHFA-GLDVRIPATGVFFVF-NSSQVCLAFAGNTDSTMI 321
Query: 432 GGPSIILGNFQMQNYYVEYDLRNQRLGFKQQLC 464
G I+GN Q V YD +R+GF C
Sbjct: 322 G----IIGNVQQLTMEVTYDNELKRIGFSAGAC 350
>gi|125532792|gb|EAY79357.1| hypothetical protein OsI_34486 [Oryza sativa Indica Group]
Length = 396
Score = 108 bits (269), Expect = 7e-21, Method: Compositional matrix adjust.
Identities = 113/418 (27%), Positives = 172/418 (41%), Gaps = 50/418 (11%)
Query: 56 LTRALHIKNP-QTKTTTTTTTTTTTNISSH-SYGGYSISLSFGTPPQIIPFILDTGSHLV 113
LTRA ++ + TT + H S Y ++L+ GTPPQ + I+D G LV
Sbjct: 16 LTRAHELRRGLELADDATTARPGGVTVPVHFSQAFYVVNLTIGTPPQPVSAIIDIGGELV 75
Query: 114 WFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLGCQNPKCSWIHHESIQCRDCNDEPLATS 173
W C H C+ C +P F SS+ R C C ESI R +
Sbjct: 76 WTQCAQH--CRRCFKQDLPLFDTNASSTFRPEPCGAAVC-----ESIPTR--------SC 120
Query: 174 KNCTQICPSYLVLYGSGLTEGIALSETLNLPNRIIPNFLVGCSVLSSRQ----PAGIAGF 229
Y G T G ++ + + GC+V S +G G
Sbjct: 121 AGDGGGACGYEASTSFGRTVGRIGTDAVAIGTAATARLAFGCAVASEMDTMWGSSGSVGL 180
Query: 230 GRGKTSLPSQLNLDKFSYCLLSHKFDDTTRTSSLILDNGSSHSDKKTTGLTYTPFVNNPS 289
GR SL +Q+N FSYCL DT ++S+L L S+ G TPFV S
Sbjct: 181 GRTNLSLAAQMNATAFSYCLAP---PDTGKSSALFL-GASAKLAGAGKGAGTTPFVKT-S 235
Query: 290 VAERNAFSVYYYVGLRRITVGGQRVRVWHKYLTLDRDGNGGTIVDSGTTFTFMAPELFEP 349
+ S Y + L + +R + + + + GN +V + T T + ++
Sbjct: 236 TPPHSGLSRSYLLRL-------EAIRAGNATIAMPQSGN-TIMVSTATPVTALVDSVYRD 287
Query: 350 LADEFVSQMVKNRNYTRALGAEALTGLRPCFDV---PGEKTGSFPELKLHFKGGAEVTLP 406
L + A+GA + +D+ +G P+L L F+GGAE+T+P
Sbjct: 288 L----------RKAVADAVGAAPVPPPVQNYDLCFPKASASGGAPDLVLAFQGGAEMTVP 337
Query: 407 VENYFAVVGEGSAVCLTVVTDREASGGPSIILGNFQMQNYYVEYDLRNQRLGFKQQLC 464
V +Y G +A C+ ++ A GG S ILG+ Q N ++ +DL + L F+ C
Sbjct: 338 VSSYLFDAGNDTA-CVAIL-GSPALGGVS-ILGSLQQVNIHLLFDLDKETLSFEPADC 392
>gi|147786881|emb|CAN62311.1| hypothetical protein VITISV_008929 [Vitis vinifera]
Length = 408
Score = 108 bits (269), Expect = 8e-21, Method: Compositional matrix adjust.
Identities = 118/431 (27%), Positives = 186/431 (43%), Gaps = 57/431 (13%)
Query: 49 NSLVSS--SLTRALHIKNPQTKTTTTTTTTTTTNISSHSYG-GYSISLSFGTPPQIIPFI 105
+S++SS SL R +++ +T+ N+ + G + ++ S G PP
Sbjct: 17 DSILSSYQSLDRN-NVERRRTRRAAFIXDEIQANMVADDRGQAFLVNFSVGRPPVPQLVG 75
Query: 106 LDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLGCQNPKCSWIHHESIQCRDC 165
+DTGS L+W C C C P F P SS+ L +P C
Sbjct: 76 IDTGSDLLWVQCR---PCADCFRQSTPIFDPSKSSTYVDLSYDSPICP------------ 120
Query: 166 NDEPLATSKNCTQICPSYLVLYGSGLTEGIALS------ETLNLPNRIIPNFLVGCSVLS 219
+ P + Q Y Y G T L+ ET + + + + GC S
Sbjct: 121 -NSPQKKYNHLNQCI--YNASYADGSTSSGNLATEDIVFETSDQGTVTVSSVVFGCG-HS 176
Query: 220 SR-----QPAGIAGFGRGKTSLPSQLNLDKFSYCLLSHKFDDTTRTSSLILDNGSSHSDK 274
+R Q +GI G G S+ S+L +FSYC+ FD + L+L +G
Sbjct: 177 NRGRFDGQQSGILGLSAGDQSIVSRLG-SRFSYCI-GDLFDPHYTHNQLVLGDGV----- 229
Query: 275 KTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRVRVWHKYLTLDRDGNGGTIVD 334
K G + TPF + F+ +YYV L I+VG R+ + + G GG ++D
Sbjct: 230 KMEG-SSTPF---------HTFNGFYYVTLEGISVGETRLDINPEVFQRTESGQGGVVMD 279
Query: 335 SGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEALTGLRPCFDVPGEKTGSFPELK 394
SGTT TF+A + F+PL++E + ++V R + + + + G E FPEL
Sbjct: 280 SGTTATFLAKDGFDPLSNE-IQRLV--RGHFQQVIYRTIPGWLCYKGRVNEDLRGFPELA 336
Query: 395 LHFKGGAEVTLPVENYFAVVGEGSAVCLTVVTDREASGGPSIILGNFQMQNYYVEYDLRN 454
HF GA++ L + F V CL V+ + G ++G Q+Y V YDL
Sbjct: 337 FHFAEGADLVLDANSLF-VQKNQDVFCLAVLESNLKNIGS--VIGIMAQQHYNVAYDLIG 393
Query: 455 QRLGFKQQLCK 465
+R+ F++ C+
Sbjct: 394 KRVYFQRTDCE 404
>gi|8979711|emb|CAB96832.1| nucleoid DNA-binding protein cnd41-like protein [Arabidopsis
thaliana]
Length = 446
Score = 108 bits (269), Expect = 8e-21, Method: Compositional matrix adjust.
Identities = 112/387 (28%), Positives = 159/387 (41%), Gaps = 52/387 (13%)
Query: 87 GGYSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLG 146
G Y +++ GTP + I DTGS L W C + C K P F P S+S +
Sbjct: 102 GNYIVTVGLGTPKNDLSLIFDTGSDLTWTQC--QPCVRTCYDQKEPIFNPSKSTSYYNVS 159
Query: 147 CQNPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGS-GLTEGIALSETLNLPN 205
C + C + + C ++ NC Y + YG + G E L N
Sbjct: 160 CSSAACGSLSSATGNAGSC------SASNCI-----YGIQYGDQSFSVGFLAKEKFTLTN 208
Query: 206 R-IIPNFLVGCSVLSS---RQPAGIAGFGRGKTSLPSQLNL---DKFSYCLLSHKFDDTT 258
+ GC + AG+ G GR K S PSQ FSYCL S +
Sbjct: 209 SDVFDGVYFGCGENNQGLFTGVAGLLGLGRDKLSFPSQTATAYNKIFSYCLPS----SAS 264
Query: 259 RTSSLILDN-GSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRVRVW 317
T L + G S S K +TP ++ + +F Y + + ITVGGQ++ +
Sbjct: 265 YTGHLTFGSAGISRSVK------FTPI---STITDGTSF---YGLNIVAITVGGQKLPIP 312
Query: 318 HKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEALTGLR 377
+ G ++DSGT T + P+ + L F ++M K Y G L
Sbjct: 313 STVFSTP-----GALIDSGTVITRLPPKAYAALRSSFKAKMSK---YPTTSGVSI---LD 361
Query: 378 PCFDVPGEKTGSFPELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTVVTDREASGGPSII 437
CFD+ G KT + P++ F GGA V L + F V + S VCL + + S + I
Sbjct: 362 TCFDLSGFKTVTIPKVAFSFSGGAVVELGSKGIFYVF-KISQVCLAFAGNSDDSN--AAI 418
Query: 438 LGNFQMQNYYVEYDLRNQRLGFKQQLC 464
GN Q Q V YD R+GF C
Sbjct: 419 FGNVQQQTLEVVYDGAGGRVGFAPNGC 445
>gi|218184943|gb|EEC67370.1| hypothetical protein OsI_34481 [Oryza sativa Indica Group]
Length = 367
Score = 108 bits (269), Expect = 8e-21, Method: Compositional matrix adjust.
Identities = 110/383 (28%), Positives = 154/383 (40%), Gaps = 56/383 (14%)
Query: 92 SLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLGCQNPK 151
+ + GTPPQ +D LVW C+ QC +C +P F+P SS+ + C
Sbjct: 27 NFTIGTPPQAASAFIDLTGELVWTQCS---QCIHCFKQDLPVFVPNASSTFKPEPCGTDV 83
Query: 152 CSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSGLTEGIALSETLNLPNRIIPNF 211
C I T K + +C V G T GI ++T + +
Sbjct: 84 CKSI---------------PTPKCASDVCAFDGVTGLGGHTVGIVATDTFAIGTAAPASL 128
Query: 212 LVGCSVLSSRQ----PAGIAGFGRGKTSLPSQLNLDKFSYCLLSHKFDDTTRTSSLILDN 267
GC V S P+G G GR SL +Q+ L +FSYCL H DT + S L L
Sbjct: 129 GFGCVVASDIDTMGGPSGFIGLGRTPWSLVAQMKLTRFSYCLAPH---DTGKNSRLFL-- 183
Query: 268 GSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRVRVWHKYLTLDRDG 327
S K G +TPFV + + S YY + L I G +T+ R
Sbjct: 184 --GASAKLAGGGAWTPFVKT---SPNDGMSQYYPIELEEIKAG-------DATITMPRGR 231
Query: 328 NGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEALTGLRPCFDV--PGE 385
N T A L D V Q K A T + F+V P
Sbjct: 232 N--------TVLVQTAVVRVSLLVDS-VYQEFKKAVMASVGAAPTATPVGEPFEVCFPKA 282
Query: 386 KTGSFPELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTVVT----DREASGGPSIILGNF 441
P+L F+ GA +T+P NY VG + VCL+V++ + A G + ILG+F
Sbjct: 283 GVSGAPDLVFTFQAGAALTVPPANYLFDVGNDT-VCLSVMSIALLNITALDGLN-ILGSF 340
Query: 442 QMQNYYVEYDLRNQRLGFKQQLC 464
Q +N ++ +DL L F+ C
Sbjct: 341 QQENVHLLFDLDKDMLSFEPADC 363
>gi|225216879|gb|ACN85177.1| aspartic proteinase nepenthesin-1 precursor [Oryza nivara]
gi|225216896|gb|ACN85193.1| aspartic proteinase nepenthesin-1 precursor [Oryza rufipogon]
Length = 515
Score = 108 bits (269), Expect = 8e-21, Method: Compositional matrix adjust.
Identities = 110/388 (28%), Positives = 161/388 (41%), Gaps = 59/388 (15%)
Query: 87 GGYSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLG 146
G Y +++ GTP + DTGS W C Y K+ F P SS+ +
Sbjct: 177 GNYVVTVGLGTPASRYTVVFDTGSDTTWVQCQPCVVACYEQREKL--FDPASSSTYANVS 234
Query: 147 CQNPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSG-LTEGIALSETLNLPN 205
C P CS + D + +C Y V YG G + G +TL L +
Sbjct: 235 CAAPACSDL-----------DVSGCSGGHCL-----YGVQYGDGSYSIGFFAMDTLTLSS 278
Query: 206 -RIIPNFLVGCSVLSS---RQPAGIAGFGRGKTSLPSQLNLD---KFSYCLLSHKFDDTT 258
+ F GC + + AG+ G GRGKTSLP Q F++CL +
Sbjct: 279 YDAVKGFRFGCGERNDGLFGEAAGLLGLGRGKTSLPVQTYGKYGGVFAHCLPARS----- 333
Query: 259 RTSSLILDNGSSHSDKKTTGLTYTPFV--NNPSVAERNAFSVYYYVGLRRITVGGQRVRV 316
T + LD G+ TT TP + N P+ +YYVG+ I VGG+ + +
Sbjct: 334 -TGTGYLDFGAGSPPATTT----TPMLTGNGPT---------FYYVGMTGIRVGGRLLPI 379
Query: 317 WHKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEALTGL 376
GTIVDSGT T + P + L + + R Y + A A++ L
Sbjct: 380 APSVFAA-----AGTIVDSGTVITRLPPAAYSSL-RSAFAAAMAARGYRK---AAAVSLL 430
Query: 377 RPCFDVPGEKTGSFPELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTVVTDREASGGPSI 436
C+D G + P + L F+GGA + + V S VCL + + GG
Sbjct: 431 DTCYDFTGMSQVAIPTVSLLFQGGAALDVDASGIMYTV-SASQVCLAFAGNED--GGDVG 487
Query: 437 ILGNFQMQNYYVEYDLRNQRLGFKQQLC 464
I+GN Q++ + V YD+ + +GF C
Sbjct: 488 IVGNTQLKTFGVAYDIGKKVVGFSPGAC 515
>gi|297805036|ref|XP_002870402.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
gi|297316238|gb|EFH46661.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
Length = 435
Score = 108 bits (269), Expect = 8e-21, Method: Compositional matrix adjust.
Identities = 124/486 (25%), Positives = 214/486 (44%), Gaps = 76/486 (15%)
Query: 1 MASYISALCLSFIFFFTLLSIFPSSITSLTFSLSRFHT--------NPSQDSYQNLNSLV 52
M S +++ LS F + + ++ L F+ H NP++ Q + + +
Sbjct: 1 MVSLFTSVLLSLCLFSSHILSNVNAKPKLGFTTDLIHRDSPKSPFYNPAETPSQRIRNAI 60
Query: 53 SSSLTRALHIKNPQTKTTTTTTTTTTTNISSHSYGGYSISLSFGTPPQIIPFILDTGSHL 112
S R H + + + T+I+ G Y ++LS GTPP I + DTGS+L
Sbjct: 61 HRSFNRVSHFTD--LSEMDASLNSPQTDITPCG-GEYLMNLSLGTPPSPIMAVADTGSNL 117
Query: 113 VWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLGCQNPKCSWIHHESIQCRDCNDEPLAT 172
+W C C C + P F PK SS+ + + C + +C+ + +++ C+ E
Sbjct: 118 IWTQCK---PCDDCYTQVDPLFDPKASSTYKDVSCSSSQCTALENQA----SCSTE---- 166
Query: 173 SKNCTQICPSYLVLYGSG-------LTEGIALSETLNLPNRIIPNFLVGC----SVLSSR 221
K C SYLV Y G + + L T N P + + N ++GC +V
Sbjct: 167 DKTC-----SYLVSYADGSYTMGKFAVDTLTLGSTDNRPVQ-LKNIIIGCGQNNAVTFRN 220
Query: 222 QPAGIAGFGRGKTSLPSQL--NLD-KFSYCLLSHKFDDTTRTSSLILDNGSSHSDKKTTG 278
+ +G+ G G G SL QL ++D KFSYCL+ D T++ N +++ G
Sbjct: 221 KSSGVVGLGGGAVSLIKQLGDSIDGKFSYCLVPEN-DQTSKI------NFGTNAVVSGPG 273
Query: 279 LTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRVRVWHKYLTLDRDGNGGTIVDSGTT 338
TP V V R+ F YY+ L+ I+VG + ++ T D + G ++DSGTT
Sbjct: 274 TVSTPLV----VKSRDTF---YYLTLKSISVGSKNMQ------TPDSNIKGNMVIDSGTT 320
Query: 339 FTFMAPELFEPLADEFVSQMVKNRNYTRALGAEALTGLRPCFDVPGEKTGSFPELKLHFK 398
T + + + + + S + +++ +G+ C++ + + P + +HF+
Sbjct: 321 LTLLPVKYYIEIENAVASLINADKSKDERIGSSL------CYNATADL--NIPVITMHFE 372
Query: 399 GGAEVTLPVENYFAVVGEGSAVCLTVVTDREASGGPSIILGNFQMQNYYVEYDLRNQRLG 458
G P ++F V + VCL +G I GN +N+ V YD ++ +
Sbjct: 373 GADVKLYPYNSFFKVTED--LVCLAFGMSFYRNG----IYGNVAQKNFLVGYDTASKTMS 426
Query: 459 FKQQLC 464
FK C
Sbjct: 427 FKPTDC 432
>gi|357118398|ref|XP_003560942.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Brachypodium
distachyon]
Length = 478
Score = 107 bits (268), Expect = 9e-21, Method: Compositional matrix adjust.
Identities = 124/412 (30%), Positives = 173/412 (41%), Gaps = 72/412 (17%)
Query: 89 YSISLSFGTP-PQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLGC 147
Y I LS GTP PQ + LDTGS LVW C C C + P+F S ++ + C
Sbjct: 100 YLIHLSIGTPRPQRVALTLDTGSDLVWTQCA----CHVCFAQPFPTFDALASQTTLAVPC 155
Query: 148 QNPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGS-GLTEGIALSETLNL--- 203
+P C+ + PL+ C YL Y +T G + +T
Sbjct: 156 SDPICTSGKY-----------PLSGCTFNDNTC-FYLYDYADKSITSGRIVEDTFTFRSP 203
Query: 204 ---------PNRIIPNFLVGCS-----VLSSRQPAGIAGFGRGKTSLPSQLNLDKFSYCL 249
+PN GC + S + +GIAGF RG SLPSQL + +FS+C
Sbjct: 204 QGNNGSKAHAGVAVPNVRFGCGQYNKGIFKSNE-SGIAGFSRGPMSLPSQLKVARFSHCF 262
Query: 250 LSHKFDDTTRTSSLIL------DNGSSHSDKKTTGLTYTPFVN-NPSVAERNAFSVYYYV 302
+ RTS + L DN +H+ T + TPF N N S+ YY+
Sbjct: 263 TAIA---DARTSPVFLGGAPGPDNLGAHA---TGPVQSTPFANSNGSL---------YYL 307
Query: 303 GLRRITVGGQR--VRVWHKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQM-- 358
L+ ITVG R + G+GGTI+DSGT + ++ L FV+++
Sbjct: 308 TLKGITVGKTRLPLNALAFAGKGTGSGSGGTIIDSGTGIRTLPGPMYRSLRAAFVARVKL 367
Query: 359 -VKNRNYTRALGAEALTGLRPCFDVPGEKTGSFPELKLHFKGGAEVTLPVENYFAVV--- 414
V N + A R P + P++ LH GA+ LP E+Y +
Sbjct: 368 PVANESAADAESTLCFEAARSASLPPEAPAPALPKVVLHV-AGADWDLPRESYVLDLLED 426
Query: 415 --GEGSAVCLTVVTDREASGGPSIILGNFQMQNYYVEYDLRNQRLGFKQQLC 464
G GS +CL + A I+GNFQ QN +V YDL +L F C
Sbjct: 427 EDGSGSGLCLVM---NSAGDSDLTIIGNFQQQNMHVAYDLEKNKLVFVPARC 475
>gi|359806832|ref|NP_001241567.1| uncharacterized protein LOC100819698 precursor [Glycine max]
gi|255638149|gb|ACU19388.1| unknown [Glycine max]
Length = 437
Score = 107 bits (268), Expect = 1e-20, Method: Compositional matrix adjust.
Identities = 96/382 (25%), Positives = 156/382 (40%), Gaps = 49/382 (12%)
Query: 89 YSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLGCQ 148
Y + GTP Q + +DT + W PCT C CS++ F P S++ + +GC
Sbjct: 98 YIVKAKIGTPAQTLLLAMDTSNDASWVPCT---ACVGCSTTT--PFAPAKSTTFKKVGCG 152
Query: 149 NPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSGLTEGIALSETLNLPNRII 208
+C + + + C ++ YG+ + +T+ L +
Sbjct: 153 ASQCKQVRNPTCDGSAC----------------AFNFTYGTSSVAASLVQDTVTLATDPV 196
Query: 209 PNFLVGC------SVLSSRQPAGIAGFGRGKTSLPSQLNLDKFSYCLLSHKFDDTTRTSS 262
P + GC S + + G+ + +L FSYCL S K T S
Sbjct: 197 PAYAFGCIQKVTGSSVPPQGLLGLGRGPLSLLAQTQKLYQSTFSYCLPSFK----TLNFS 252
Query: 263 LILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRVRVWHKYLT 322
L G K+ + +TP + NP R++ YYV L I VG + V + + L
Sbjct: 253 GSLRLGPVAQPKR---IKFTPLLKNP---RRSSL---YYVNLVAIRVGRRIVDIPPEALA 303
Query: 323 LDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEALTGLRPCFDV 382
+ + GT+ DSGT FT + + + +EF ++ ++ T +L G C+
Sbjct: 304 FNANTGAGTVFDSGTVFTRLVEPAYNAVRNEFRRRIAVHKKLT----VTSLGGFDTCYTA 359
Query: 383 PGEKTGSFPELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTVVTDREASGGPSIILGNFQ 442
P P + F G VTLP +N GS CL + + ++ N Q
Sbjct: 360 PIVA----PTITFMF-SGMNVTLPPDNILIHSTAGSVTCLAMAPAPDNVNSVLNVIANMQ 414
Query: 443 MQNYYVEYDLRNQRLGFKQQLC 464
QN+ V +D+ N RLG ++LC
Sbjct: 415 QQNHRVLFDVPNSRLGVARELC 436
>gi|21717157|gb|AAM76350.1|AC074196_8 putative nucleoid DNA binding protein [Oryza sativa Japonica Group]
gi|31433294|gb|AAP54832.1| Eukaryotic aspartyl protease family protein, expressed [Oryza
sativa Japonica Group]
Length = 396
Score = 107 bits (268), Expect = 1e-20, Method: Compositional matrix adjust.
Identities = 114/418 (27%), Positives = 172/418 (41%), Gaps = 50/418 (11%)
Query: 56 LTRALHIKNP-QTKTTTTTTTTTTTNISSH-SYGGYSISLSFGTPPQIIPFILDTGSHLV 113
LTRA ++ + TT + H S Y ++L+ GTPPQ + I+D G LV
Sbjct: 16 LTRAHELRRGLELADDATTARPGGVTVPVHFSQAFYVVNLTIGTPPQPVSAIIDIGGELV 75
Query: 114 WFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLGCQNPKCSWIHHESIQCRDCNDEPLATS 173
W C H C+ C +P F SS+ R C C ESI R +
Sbjct: 76 WTQCAQH--CRRCFKQDLPLFDTNASSTFRPEPCGAAVC-----ESIPTR--------SC 120
Query: 174 KNCTQICPSYLVLYGSGLTEGIALSETLNLPNRIIPNFLVGCSVLSSRQ----PAGIAGF 229
Y G T G ++ + + GC+V S +G G
Sbjct: 121 AGDGGGACGYEASTSFGRTVGRIGTDAVAIGTAATARLAFGCAVASEMDTMWGSSGSVGL 180
Query: 230 GRGKTSLPSQLNLDKFSYCLLSHKFDDTTRTSSLILDNGSSHSDKKTTGLTYTPFVNNPS 289
GR SL +Q+N FSYCL DT ++S+L L S+ G TPFV S
Sbjct: 181 GRTNLSLAAQMNATAFSYCLAPP---DTGKSSALFL-GASAKLAGAGKGAGTTPFVKT-S 235
Query: 290 VAERNAFSVYYYVGLRRITVGGQRVRVWHKYLTLDRDGNGGTIVDSGTTFTFMAPELFEP 349
+ S Y + L + +R + + + + GN T V + T T + ++
Sbjct: 236 TPPNSGLSRSYLLRL-------EAIRAGNATIAMPQSGNTIT-VSTATPVTALVDSVYRD 287
Query: 350 LADEFVSQMVKNRNYTRALGAEALTGLRPCFDV---PGEKTGSFPELKLHFKGGAEVTLP 406
L + A+GA + +D+ +G P+L L F+GGAE+T+P
Sbjct: 288 L----------RKAVADAVGAAPVPPPVQNYDLCFPKASASGGAPDLVLAFQGGAEMTVP 337
Query: 407 VENYFAVVGEGSAVCLTVVTDREASGGPSIILGNFQMQNYYVEYDLRNQRLGFKQQLC 464
V +Y G +A C+ ++ A GG S ILG+ Q N ++ +DL + L F+ C
Sbjct: 338 VSSYLFDAGNDTA-CVAIL-GSPALGGVS-ILGSLQQVNIHLLFDLDKETLSFEPADC 392
>gi|326499093|dbj|BAK06037.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 471
Score = 107 bits (267), Expect = 1e-20, Method: Compositional matrix adjust.
Identities = 129/464 (27%), Positives = 188/464 (40%), Gaps = 85/464 (18%)
Query: 34 SRFHTNPSQDSYQNLNSLVSSSLTRALHIKNPQTKTTTTTTTTTTTNISSHSYGGYSISL 93
S FH +PS + + S RA + + + + ++S + Y +++
Sbjct: 47 SPFH-DPSLTAPARVLEAARRSTVRAAALSRSYVRVDAPSADGFVSELTSTPFE-YLMAV 104
Query: 94 SFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPS-------FIPKLSSSSRLLG 146
+ GTPP + I DTGS L+W C+ ++++ F P S++ RL+
Sbjct: 105 NIGTPPTRMVAIADTGSDLIWLNCSYGGDGPGLAAARDADAQPPGVQFDPSKSTTFRLVD 164
Query: 147 CQNPKCSWIHHESI----QCRDCNDEPLATSKNCTQICPSYLVLYGSGL-TEGIALSETL 201
C + CS + S +CR Y YG G T G+ +ET
Sbjct: 165 CDSVACSELPEASCGADSKCR-------------------YSYSYGDGSHTSGVLSTETF 205
Query: 202 NLPNRI----------IPNFLVGCSV--LSSRQPAGIAGFGRGKTSLPSQLNLD-----K 244
+ + N GCS + S G+ G G G SL SQL D +
Sbjct: 206 TFADAPGARGDGTTTRVANVNFGCSTTFVGSSVGDGLVGLGGGDLSLVSQLGADTSLGRR 265
Query: 245 FSYCLLSHKFDDTTRTSSLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGL 304
FSYCL+ + + + SS + N + G TP + PS + YY V L
Sbjct: 266 FSYCLVPY----SVKASSAL--NFGPRAAVTDPGAVTTPLI--PSQVK-----AYYIVEL 312
Query: 305 RRITVGGQRVRVWHKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNY 364
R + VG + T + IVDSGTT TF+ L +PL E ++
Sbjct: 313 RSVKVGNK---------TFEAPDRSPLIVDSGTTLTFLPEALVDPLVKELTGRI----KL 359
Query: 365 TRALGAEALTGLRPCFDVPGEKTGS----FPELKLHFKGGAEVTLPVENYFAVVGEGSAV 420
A E L L CFDV G + G P++ + GGA VTL EN F V EG+ +
Sbjct: 360 PPAQSPERLLPL--CFDVSGVREGQVAAMIPDVTVGLGGGAAVTLKAENTFVEVQEGT-L 416
Query: 421 CLTVVTDREASGGPSIILGNFQMQNYYVEYDLRNQRLGFKQQLC 464
CL V E P+ I+GN QN +V YDL + F C
Sbjct: 417 CLAVSAMSEQF--PASIIGNIAQQNMHVGYDLDKGTVTFAPAAC 458
>gi|115468912|ref|NP_001058055.1| Os06g0610800 [Oryza sativa Japonica Group]
gi|51091964|dbj|BAD35493.1| putative nucleoid DNA-binding protein cnd41, chloroplast [Oryza
sativa Japonica Group]
gi|113596095|dbj|BAF19969.1| Os06g0610800 [Oryza sativa Japonica Group]
gi|218198535|gb|EEC80962.1| hypothetical protein OsI_23679 [Oryza sativa Indica Group]
Length = 519
Score = 107 bits (267), Expect = 1e-20, Method: Compositional matrix adjust.
Identities = 110/388 (28%), Positives = 161/388 (41%), Gaps = 59/388 (15%)
Query: 87 GGYSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLG 146
G Y +++ GTP + DTGS W C Y K+ F P SS+ +
Sbjct: 181 GNYVVTVGLGTPASRYTVVFDTGSDTTWVQCQPCVVACYEQREKL--FDPASSSTYANVS 238
Query: 147 CQNPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSG-LTEGIALSETLNLPN 205
C P CS + D + +C Y V YG G + G +TL L +
Sbjct: 239 CAAPACSDL-----------DVSGCSGGHCL-----YGVQYGDGSYSIGFFAMDTLTLSS 282
Query: 206 -RIIPNFLVGCSVLSS---RQPAGIAGFGRGKTSLPSQLNLD---KFSYCLLSHKFDDTT 258
+ F GC + + AG+ G GRGKTSLP Q F++CL +
Sbjct: 283 YDAVKGFRFGCGERNDGLFGEAAGLLGLGRGKTSLPVQTYGKYGGVFAHCLPARS----- 337
Query: 259 RTSSLILDNGSSHSDKKTTGLTYTPFV--NNPSVAERNAFSVYYYVGLRRITVGGQRVRV 316
T + LD G+ TT TP + N P+ +YYVG+ I VGG+ + +
Sbjct: 338 -TGTGYLDFGAGSPPATTT----TPMLTGNGPT---------FYYVGMTGIRVGGRLLPI 383
Query: 317 WHKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEALTGL 376
GTIVDSGT T + P + L + + R Y + A A++ L
Sbjct: 384 APSVFAA-----AGTIVDSGTVITRLPPAAYSSL-RSAFAAAMAARGYRK---AAAVSLL 434
Query: 377 RPCFDVPGEKTGSFPELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTVVTDREASGGPSI 436
C+D G + P + L F+GGA + + V S VCL + + GG
Sbjct: 435 DTCYDFTGMSQVAIPTVSLLFQGGAALDVDASGIMYTV-SASQVCLAFAGNED--GGDVG 491
Query: 437 ILGNFQMQNYYVEYDLRNQRLGFKQQLC 464
I+GN Q++ + V YD+ + +GF C
Sbjct: 492 IVGNTQLKTFGVAYDIGKKVVGFSPGAC 519
>gi|125532788|gb|EAY79353.1| hypothetical protein OsI_34482 [Oryza sativa Indica Group]
Length = 394
Score = 107 bits (267), Expect = 1e-20, Method: Compositional matrix adjust.
Identities = 107/385 (27%), Positives = 161/385 (41%), Gaps = 54/385 (14%)
Query: 89 YSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLGCQ 148
Y + + GTPPQ ++D LVW C QC C P F P S++ R C
Sbjct: 51 YVANFTIGTPPQPASAVIDLAGELVWTQCK---QCSRCFEQDTPLFDPTASNTYRAEPCG 107
Query: 149 NPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSGLTEGIALSETLNLPNRII 208
P C I +S +NC+ +Y +G T G ++T +
Sbjct: 108 TPLCESIPSDS--------------RNCSGNVCAYQASTNAGDTGGKVGTDTFAV-GTAK 152
Query: 209 PNFLVGCSVLSSRQ----PAGIAGFGRGKTSLPSQLNLDKFSYCLLSHKFDDTTRTSSLI 264
+ GC V S P+GI G GR SL +Q + FSYCL H D + S+L
Sbjct: 153 ASLAFGCVVASDIDTMGGPSGIVGLGRTPWSLVTQTGVAAFSYCLAPH---DAGKNSALF 209
Query: 265 LDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRVRVWHKYLTLD 324
L GSS TPFVN N S YY V L + G + + T+
Sbjct: 210 L--GSSAKLAGGGKAASTPFVN--ISGNGNDLSNYYKVQLEGLKAGDAMIPLPPSGSTV- 264
Query: 325 RDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEAL-TGLRP---CF 380
++D+ + +F+ + Q VK + T A+GA + T + P CF
Sbjct: 265 -------LLDTFSPISFLVDGAY---------QAVK-KAVTVAVGAPPMATPVEPFDLCF 307
Query: 381 DVPGEKTGSFPELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTVVTDREASGGPSI-ILG 439
G +G+ P+L F+GGA +T+ NY G+ VCL +++ + + +LG
Sbjct: 308 PKSG-ASGAAPDLVFTFRGGAAMTVAASNYLLDYKNGT-VCLAMLSSARLNSTTELSLLG 365
Query: 440 NFQMQNYYVEYDLRNQRLGFKQQLC 464
+ Q +N + +DL + L F+ C
Sbjct: 366 SLQQENIHFLFDLDKETLSFEPADC 390
>gi|125558627|gb|EAZ04163.1| hypothetical protein OsI_26305 [Oryza sativa Indica Group]
Length = 404
Score = 107 bits (267), Expect = 1e-20, Method: Compositional matrix adjust.
Identities = 118/399 (29%), Positives = 153/399 (38%), Gaps = 98/399 (24%)
Query: 84 HSYGGYSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSR 143
+S G Y+++LS GTPP + DTGS L+W C C C++ P F P SS+
Sbjct: 85 NSAGAYNMNLSIGTPPVTFSVLADTGSSLIWTQCA---PCTECAARPAPPFQPASSSTFS 141
Query: 144 LLGCQNPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSGLTEGIALSETLNL 203
L C + C ++ R CN + C P YG G T G +ETL++
Sbjct: 142 KLPCASSLCQFLTSPY---RTCN------ATGCVYYYP-----YGMGFTAGYLATETLHV 187
Query: 204 PNRIIPNFLVGCSVLS--SRQPAGIAGFGRGKTSLPSQLNLDKFSYCLLSHKFDDTTRTS 261
P GCS + +GI G GR SL SQ+ + +FSYCL S+
Sbjct: 188 GGASFPGVTFGCSTENGVGNSSSGIVGLGRSPLSLVSQVGVARFSYCLRSNA-------- 239
Query: 262 SLILDNGSS----HSDKKTTG--LTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRVR 315
D G S S K TG + TP + NP + S YYYV L ITVG +
Sbjct: 240 ----DAGDSPILFGSLAKVTGGNVQSTPLLENPEMPS----SSYYYVNLTGITVGATDLP 291
Query: 316 VWHKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEALTG 375
+ LT +GT F G
Sbjct: 292 MAMANLT----------TVNGTRF-----------------------------------G 306
Query: 376 LRPCFD---VPGEKTGSFPELKLHFKGGAEVTLPVENYFAVV-----GEGSAVCLTVVTD 427
CFD G P L L F GGAE + +YF VV G + CL V+
Sbjct: 307 FDLCFDATAAGGGGGVPVPTLVLRFAGGAEYAVRRRSYFGVVEVDSQGRAAVECLLVL-- 364
Query: 428 REASGGPSI-ILGNFQMQNYYVEYDLRNQRLGFKQQLCK 465
AS SI I+GN + +V YDL F C
Sbjct: 365 -PASEKLSISIIGNVMQMDLHVLYDLDGGMFSFAPADCA 402
>gi|125600538|gb|EAZ40114.1| hypothetical protein OsJ_24557 [Oryza sativa Japonica Group]
Length = 412
Score = 107 bits (267), Expect = 1e-20, Method: Compositional matrix adjust.
Identities = 101/381 (26%), Positives = 151/381 (39%), Gaps = 68/381 (17%)
Query: 89 YSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLGCQ 148
Y + ++ GTPP + +LDTGS L+W C C+ C P + P S++ + C+
Sbjct: 92 YLVDIAIGTPPLPLTAVLDTGSDLIWTQC--DAPCRRCFPQPAPLYAPARSATYANVSCR 149
Query: 149 NPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSGL-TEGIALSETLNL-PNR 206
+P C + +C + C +Y YG G T+G+ +ET L +
Sbjct: 150 SPMCQALQSPWSRCSPPD-------TGC-----AYYFSYGDGTSTDGVLATETFTLGSDT 197
Query: 207 IIPNFLVGC---SVLSSRQPAGIAGFGRGKTSLPSQLNLDKFSYCLLSHKFDDTTRTSSL 263
+ GC ++ S+ +G+ G GRG SL SQL + +
Sbjct: 198 AVRGVAFGCGTENLGSTDNSSGLVGMGRGPLSLVSQLGVTRPR----------------- 240
Query: 264 ILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRVRVWHKYLTL 323
S + P +P L ITVG + + L
Sbjct: 241 -----RSCRARAAARGGGAPTTTSP---------------LEGITVGDTLLPIDPAVFRL 280
Query: 324 DRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEALTGLRPCFDVP 383
G+GG I+DSGTTFT + F LA S++ L + A GL CF
Sbjct: 281 TPMGDGGVIIDSGTTFTALEERAFVALARALASRV------RLPLASGAHLGLSLCFAAA 334
Query: 384 GEKTGSFPELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTVVTDREASGGPSIILGNFQM 443
+ P L LHF GA++ L E+Y CL +V+ R S +LG+ Q
Sbjct: 335 SPEAVEVPRLVLHFD-GADMELRRESYVVEDRSAGVACLGMVSARGMS-----VLGSMQQ 388
Query: 444 QNYYVEYDLRNQRLGFKQQLC 464
QN ++ YDL L F+ C
Sbjct: 389 QNTHILYDLERGILSFEPAKC 409
>gi|115483166|ref|NP_001065176.1| Os10g0537800 [Oryza sativa Japonica Group]
gi|21717159|gb|AAM76352.1|AC074196_10 putative nucleoid DNA binding protein [Oryza sativa Japonica Group]
gi|31433285|gb|AAP54823.1| Eukaryotic aspartyl protease family protein, expressed [Oryza
sativa Japonica Group]
gi|113639785|dbj|BAF27090.1| Os10g0537800 [Oryza sativa Japonica Group]
gi|215692411|dbj|BAG87831.1| unnamed protein product [Oryza sativa Japonica Group]
Length = 394
Score = 107 bits (266), Expect = 1e-20, Method: Compositional matrix adjust.
Identities = 107/385 (27%), Positives = 161/385 (41%), Gaps = 54/385 (14%)
Query: 89 YSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLGCQ 148
Y + + GTPPQ ++D LVW C QC C P F P S++ R C
Sbjct: 51 YVANFTIGTPPQPASAVIDLAGELVWTQCK---QCGRCFEQGTPLFDPTASNTYRAEPCG 107
Query: 149 NPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSGLTEGIALSETLNLPNRII 208
P C I + +NC+ +Y +G T G ++T +
Sbjct: 108 TPLCESIPSDV--------------RNCSGNVCAYEASTNAGDTGGKVGTDTFAV-GTAK 152
Query: 209 PNFLVGCSVLSSRQ----PAGIAGFGRGKTSLPSQLNLDKFSYCLLSHKFDDTTRTSSLI 264
+ GC V S P+GI G GR SL +Q + FSYCL H D + S+L
Sbjct: 153 ASLAFGCVVASDIDTMGGPSGIVGLGRTPWSLVTQTGVAAFSYCLAPH---DAGKNSALF 209
Query: 265 LDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRVRVWHKYLTLD 324
L GSS TPFVN N S YY V L + G + + T+
Sbjct: 210 L--GSSAKLAGGGKAASTPFVN--ISGNGNDLSNYYKVQLEGLKAGDAMIPLPPSGSTV- 264
Query: 325 RDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEAL-TGLRP---CF 380
++D+ + +F+ + Q VK + T A+GA + T + P CF
Sbjct: 265 -------LLDTFSPISFLVDGAY---------QAVK-KAVTVAVGAPPMATPVEPFDLCF 307
Query: 381 DVPGEKTGSFPELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTVVTDREASGGPSI-ILG 439
G +G+ P+L F+GGA +T+P NY G+ VCL +++ + + +LG
Sbjct: 308 PKSG-ASGAAPDLVFTFRGGAAMTVPATNYLLDYKNGT-VCLAMLSSARLNSTTELSLLG 365
Query: 440 NFQMQNYYVEYDLRNQRLGFKQQLC 464
+ Q +N + +DL + L F+ C
Sbjct: 366 SLQQENIHFLFDLDKETLSFEPADC 390
>gi|226530755|ref|NP_001150335.1| aspartic proteinase nepenthesin-2 precursor [Zea mays]
gi|195638492|gb|ACG38714.1| aspartic proteinase nepenthesin-2 precursor [Zea mays]
Length = 461
Score = 107 bits (266), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 111/384 (28%), Positives = 165/384 (42%), Gaps = 58/384 (15%)
Query: 89 YSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLGCQ 148
Y I++ G+P ++DTGS + W C C C S P F P SS+ C
Sbjct: 128 YLITVGLGSPATSQTMLIDTGSDVSWVQCK---PCSQCHSQADPLFDPSSSSTYSPFSCG 184
Query: 149 NPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSGL-TEGIALSETLNLPNRI 207
+ C+ + E C ++S C Y+V YG G T G S+TL L +
Sbjct: 185 SAACAQLGQEGNGC--------SSSSQC-----QYIVTYGDGSSTTGTYSSDTLALGSSA 231
Query: 208 IPNFLVGCSVLSS---RQPAGIAGFGRGKTSLPSQL--NLDK-FSYCLLSHKFDDTTRTS 261
+ +F GCS + S Q G+ G G G SL SQ L + FSYCL T +S
Sbjct: 232 VKSFQFGCSNVESGFNDQTDGLMGLGGGAQSLVSQTAGTLGRAFSYCL------PPTPSS 285
Query: 262 SLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRVRVWHKYL 321
S L + G + FV P + + +Y V L+ I VGG+++ +
Sbjct: 286 SGFL------TLGAAGGSGTSGFVKTP-MLRSSQVPTFYGVRLQAIRVGGRQLSIPASVF 338
Query: 322 TLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEALTGLRPCFD 381
+ GT++DSGT T + P + L+ F + M + Y A + L CFD
Sbjct: 339 S------AGTVMDSGTVITRLPPTAYSALSSAFKAGM---KQYPPAQPSGIL---DTCFD 386
Query: 382 VPGEKTGSFPELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTVVTDREASGGPSI-ILGN 440
G+ + S P + L F GGA V+L + CL + + S S+ I+GN
Sbjct: 387 FSGQSSVSIPSVALVFSGGAVVSLDASGIIL------SNCLAFAANSDDS---SLGIIGN 437
Query: 441 FQMQNYYVEYDLRNQRLGFKQQLC 464
Q + + V YD+ +GF+ C
Sbjct: 438 VQQRTFEVLYDVGRGVVGFRAGAC 461
>gi|255548662|ref|XP_002515387.1| Aspartic proteinase nepenthesin-2 precursor, putative [Ricinus
communis]
gi|223545331|gb|EEF46836.1| Aspartic proteinase nepenthesin-2 precursor, putative [Ricinus
communis]
Length = 463
Score = 107 bits (266), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 110/386 (28%), Positives = 158/386 (40%), Gaps = 63/386 (16%)
Query: 87 GGYSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLG 146
G Y +S+ G+P + + I DTGS L W C S +F P S+S +
Sbjct: 132 GNYIVSIGLGSPKKDLMLIFDTGSDLTWARC-----------SAAETFDPTKSTSYANVS 180
Query: 147 CQNPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSG-LTEGIALSETLNLPN 205
C P CS + + C + C Y + YG G + G E L + +
Sbjct: 181 CSTPLCSSVISATGNPSRC------AASTCV-----YGIQYGDGSYSIGFLGKERLTIGS 229
Query: 206 -RIIPNFLVGCSVLSS---RQPAGIAGFGRGKTSLPSQLNLDK---FSYCLLSHKFDDTT 258
I NF GC + AG+ G GR K S+ SQ FSYCL S
Sbjct: 230 TDIFNNFYFGCGQDVDGLFGKAAGLLGLGRDKLSVVSQTAPKYNQLFSYCLPS------- 282
Query: 259 RTSSLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRVRVWH 318
+S+ L GSS S +TP + P S +Y + L ITVGGQ++ +
Sbjct: 283 SSSTGFLSFGSSQSKSA----KFTPLSSGP--------SSFYNLDLTGITVGGQKLAIPL 330
Query: 319 KYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEALTGLRP 378
+ GTI+DSGT T + P + L F M + +G + L+ L
Sbjct: 331 SVFS-----TAGTIIDSGTVVTRLPPAAYSALRSAFRKAMA-----SYPMG-KPLSILDT 379
Query: 379 CFDVPGEKTGSFPELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTVVTDREASGGPSIIL 438
C+D KT P++ + F GG +V + F G VCL + A + I
Sbjct: 380 CYDFSKYKTIKVPKIVISFSGGVDVDVDQAGIFVANGL-KQVCLAFAGNTGAR--DTAIF 436
Query: 439 GNFQMQNYYVEYDLRNQRLGFKQQLC 464
GN Q +N+ V YD+ ++GF C
Sbjct: 437 GNTQQRNFEVVYDVSGGKVGFAPASC 462
>gi|356528627|ref|XP_003532901.1| PREDICTED: probable aspartic protease At2g35615-like [Glycine max]
Length = 430
Score = 107 bits (266), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 119/396 (30%), Positives = 167/396 (42%), Gaps = 70/396 (17%)
Query: 86 YGGYSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLL 145
+G Y + S GTP I DTGS L W CT CK C + P F P SS+ +
Sbjct: 85 HGEYLMRFSLGTPSVERLAIFDTGSDLSWLQCT---PCKTCYPQEAPLFDPTQSSTYVDV 141
Query: 146 GCQNPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGS-GLTEG------IALS 198
C++ C+ +C +SK C YL YG+ T G I+ S
Sbjct: 142 PCESQPCTLFPQNQREC--------GSSKQCI-----YLHQYGTDSFTIGRLGYDTISFS 188
Query: 199 ET-LNLPNRIIPNFLVGCSVLS------SRQPAGIAGFGRGKTSLPSQLNLD---KFSYC 248
T + P + GC+ S S + G G G G SL SQL KFSYC
Sbjct: 189 STGMGQGGATFPKSVFGCAFYSNFTFKISTKANGFVGLGPGPLSLASQLGDQIGHKFSYC 248
Query: 249 LLSHKFDDTTRTSSLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRIT 308
++ + TS+ L GS T + TPF+ NPS + YY + L IT
Sbjct: 249 MVPF-----SSTSTGKLKFGSM---APTNEVVSTPFMINPS------YPSYYVLNLEGIT 294
Query: 309 VGGQRVRVWHKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRAL 368
VG ++V L G I+DS T + ++ +F+S + + N A
Sbjct: 295 VGQKKV--------LTGQIGGNIIIDSVPILTHLEQGIYT----DFISSVKEAINVEVA- 341
Query: 369 GAEALTGLRPCFDVPGEKTGSFPELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTVVTDR 428
+A T C P +FPE HF GA+V L +N F + + + VC+TVV +
Sbjct: 342 -EDAPTPFEYCVRNPTNL--NFPEFVFHFT-GADVVLGPKNMFIAL-DNNLVCMTVVPSK 396
Query: 429 EASGGPSIILGNFQMQNYYVEYDLRNQRLGFKQQLC 464
S I GN+ N+ VEYDL +++ F C
Sbjct: 397 GIS-----IFGNWAQVNFQVEYDLGEKKVSFAPTNC 427
>gi|449515031|ref|XP_004164553.1| PREDICTED: probable aspartic protease At2g35615-like [Cucumis
sativus]
Length = 430
Score = 107 bits (266), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 133/494 (26%), Positives = 205/494 (41%), Gaps = 95/494 (19%)
Query: 1 MASYISALCLSFIFFFTLLSIFPSSITSLT-----FSLSRFHTNP--------SQDSYQN 47
MA+ IS +FF +L + S T++ F+ S FH + S Y
Sbjct: 1 MAATIS------LFFHLILFLISFSQTTIINGDNGFTTSLFHRDSLLSPLEFSSLSHYDR 54
Query: 48 LNSLVSSSLTRALHIKNPQTKTTTTTTTTTTTNISSHSYGGYSISLSFGTPPQIIPFILD 107
L + SL+R+ + N + T+ ++I S G Y +S+S GTPP I D
Sbjct: 55 LANAFRRSLSRSAALLN---RAATSGAVGLQSSIGPGS-GEYLMSVSIGTPPVDYLGIAD 110
Query: 108 TGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLGCQNPKCSWIHHESIQCRDCND 167
TGS L W C C C P F P S+S + C C + +D
Sbjct: 111 TGSDLTWAQC---LPCLKCYQQLRPIFNPLKSTSFSHVPCNTQTCHAV----------DD 157
Query: 168 EPLATSKNCTQICPSYLVLYGS-GLTEGIALSETLNLPNRIIPNFLVGCSVLSSRQ---P 223
C Y YG ++G E + + + + + ++GC SS
Sbjct: 158 GHCGVQGVC-----DYSYTYGDRTYSKGDLGFEKITIGSSSVKS-VIGCGHASSGGFGFA 211
Query: 224 AGIAGFGRGKTSLPSQLNLD-----KFSYC---LLSHKFDDTTRTSSLILDNGSSHSDKK 275
+G+ G G G+ SL SQ++ +FSYC LLSH + ++
Sbjct: 212 SGVIGLGGGQLSLVSQMSQTSGISRRFSYCLPTLLSHANGKINFGENAVVSG-------- 263
Query: 276 TTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRVRVWHKYLTLDRDGNGGTIVDS 335
G+ TP ++ +V YYY+ L I++G +R + K G I+DS
Sbjct: 264 -PGVVSTPLISKNTV-------TYYYITLEAISIGNERHMAFAK--------QGNVIIDS 307
Query: 336 GTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEALTGLRPCFD--VPGEKTGSFPEL 393
GTT T + EL++ + + ++VK + G+ L CFD + + P +
Sbjct: 308 GTTLTILPKELYDGVVSSLL-KVVKAKRVKDPHGS-----LDLCFDDGINAAASLGIPVI 361
Query: 394 KLHFKGGAEVTLPVENYFAVVGEGSAVCLTVVTDREASGGPSI---ILGNFQMQNYYVEY 450
HF GGA V L N F V + + CLT+ + AS P+ I+GN N+ + Y
Sbjct: 362 TAHFSGGANVNLLPINTFRKVAD-NVNCLTL---KAAS--PTTEFGIIGNLAQANFLIGY 415
Query: 451 DLRNQRLGFKQQLC 464
DL +RL FK +C
Sbjct: 416 DLEAKRLSFKPTVC 429
>gi|357515001|ref|XP_003627789.1| Aspartic proteinase nepenthesin-2 [Medicago truncatula]
gi|355521811|gb|AET02265.1| Aspartic proteinase nepenthesin-2 [Medicago truncatula]
Length = 415
Score = 107 bits (266), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 134/482 (27%), Positives = 197/482 (40%), Gaps = 93/482 (19%)
Query: 5 ISALCLSFIFFFTLLSIFPSSITSLT--FSLSRFHTN--------PSQDSYQNLNSLVSS 54
+SA + FFT+ S +L F+L H + P+Q+ Y+ + + V
Sbjct: 1 MSAHSFLTLLFFTIFCFIISLSHALNNGFTLELIHRDSSKSPFYQPTQNKYERIANAVRR 60
Query: 55 SLTRALHIKNPQTKTTTTTTTTTTTNISSHSYGGYSISLSFGTPPQIIPFILDTGSHLVW 114
S+ R H K + T+T +T N G Y +S S GTPP + +DTGS LVW
Sbjct: 61 SINRVNHFY----KYSLTSTPQSTVNSDK---GEYLMSYSIGTPPFKVFGFVDTGSDLVW 113
Query: 115 FPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLGCQNPKCSWIHHESIQCRDCNDEPLATSK 174
C CK C P F P LSSS + + C + C + S R
Sbjct: 114 LQCE---PCKQCYPQITPIFDPSLSSSYQNIPCLSDTCHSMRTTSCDVR----------- 159
Query: 175 NCTQICPSYLVLYGSGLTEGIALSETLNLPNRIIPNFLVGCSVLSSRQ----PAGIAGFG 230
YL + E + L T + P ++GC ++ +GI G G
Sbjct: 160 -------GYLSV------ETLTLDSTTGY-SVSFPKTMIGCGYRNTGTFHGPSSGIVGLG 205
Query: 231 RGKTSLPSQLNLD---KFSYCLLSHKFDDTTRTSSLILDNGSSHSDKKTTGLTYTPFVNN 287
G SLPSQL KFSYCL L N +S + + Y
Sbjct: 206 SGPMSLPSQLGTSIGGKFSYCL------------GPWLPNSTSKLNFGDAAIVYGDGAMT 253
Query: 288 PSVAERNAFSVYYYVGLRRITVGGQRVRVWHKYLTLDRDGNGGTI-VDSGTTFTFMAPEL 346
+ +++A S YY+ L +VG + + GN G I +DSGTTFTF+ ++
Sbjct: 254 TPIVKKDAQSG-YYLTLEAFSVGNKLIEFGGP----TYGGNEGNILIDSGTTFTFLPYDV 308
Query: 347 ---FEPLADEFVS-QMVKNRNYTRALGAEALTGLRPCFDVPGEKTGSFPELKLHFKGGAE 402
FE E+++ + V++ N T L C++V + P + HFK GA+
Sbjct: 309 YYRFESAVAEYINLEHVEDPNGTFKL----------CYNVAYHGFEA-PLITAHFK-GAD 356
Query: 403 VTLPVENYFAVVGEGSAVCLTVVTDREASGGPSIILGNFQMQNYYVEYDLRNQRLGFKQQ 462
+ L + F V +G A CL + + A I GN QN V Y+L + FK
Sbjct: 357 IKLYYISTFIKVSDGIA-CLAFIPSQTA------IFGNVAQQNLLVGYNLVQNTVTFKPV 409
Query: 463 LC 464
C
Sbjct: 410 DC 411
>gi|343198386|gb|AEM05966.1| nodulin 41 [Phaseolus vulgaris]
Length = 437
Score = 107 bits (266), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 114/395 (28%), Positives = 161/395 (40%), Gaps = 65/395 (16%)
Query: 87 GGYSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLG 146
G Y + GTPP DTGS L+W C+ C C P F P SS+
Sbjct: 88 GEYLMRFYIGTPPVERLATADTGSDLIWVQCS---PCASCFPQSTPLFQPLKSSTFMPTT 144
Query: 147 CQNPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGS--GLTEGIALSETLNLP 204
C++ C+ + E C S C Y YG +EG+ +ETL
Sbjct: 145 CRSQPCTLLLPEQKGC--------GKSGECI-----YTYKYGDQYSFSEGLLSTETLRFD 191
Query: 205 NR------IIPNFLVGC------SVLSSRQPAGIAGFGRGKTSLPSQLNLD---KFSYCL 249
++ PN GC +V S + GI G G G SL SQ+ KFSYCL
Sbjct: 192 SQGGVQTVAFPNSFFGCGLYNNITVFPSYKLTGIMGLGAGPLSLVSQIGDQIGHKFSYCL 251
Query: 250 LSHKFDDTTRTSSLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITV 309
L +T TS L N S + + G+ TP + P + YY++ L +TV
Sbjct: 252 LPL---GSTSTSKLKFGNESIITGE---GVVSTPMIIKPWLP------TYYFLNLEAVTV 299
Query: 310 GGQRVRVWHKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALG 369
+ V T DGN I+DSGT T++ + A + L
Sbjct: 300 AQKTVP------TGSTDGN--VIIDSGTLLTYLGESFYYNFAASL------QESLAVELV 345
Query: 370 AEALTGLRPCFDVPGEKTGSFPELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTVVTDRE 429
+ L+ L CF P FPE+ F GA V+L N F + + + VCL ++
Sbjct: 346 QDVLSPLPFCF--PYRDNFVFPEIAFQFT-GARVSLKPANLFVMTEDRNTVCL-MIAPSS 401
Query: 430 ASGGPSIILGNFQMQNYYVEYDLRNQRLGFKQQLC 464
SG I G+F ++ VEYDL +++ F+ C
Sbjct: 402 VSG--ISIFGSFSQIDFQVEYDLEGKKVSFQPTDC 434
>gi|326490597|dbj|BAJ89966.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 450
Score = 107 bits (266), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 106/405 (26%), Positives = 166/405 (40%), Gaps = 59/405 (14%)
Query: 90 SISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLGCQN 149
++S+ GTPPQ + +LDTGS L + C S S F S + + C +
Sbjct: 66 TVSVVVGTPPQNVTMVLDTGSEL------SGLLCNGSSLSPPAPFNASASLTYSAVDCSS 119
Query: 150 PKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSGLTEGIALSETLNLPNRIIP 209
P C W + R D P +TS C + + +G +++T L + +P
Sbjct: 120 PACVW-RGRDLPVRPFCDAPPSTS------CRVSISYADASSADGHLVADTFILGTQAVP 172
Query: 210 NFLVGC-------------SVLSSRQPAGIAGFGRGKTSLPSQLNLDKFSYCLLSHKFDD 256
L GC + S G+ G RG S +Q +F+YC+ +
Sbjct: 173 A-LFGCITSYSSSTAINSSATDPSEAATGLLGMNRGSLSFVTQTATLRFAYCIAPGQGPG 231
Query: 257 TTRTSSLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAF--SVYYYVGLRRITVGGQRV 314
L YTP + +++ + V Y V L I VG +
Sbjct: 232 ILLLGG---------DGGAAPPLNYTPLIE---ISQPLPYFDRVAYSVQLEGIRVGSALL 279
Query: 315 RVWHKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEALT 374
++ LT D G G T+VDSGT FTF+ + + L EF++Q R+ LG
Sbjct: 280 QIPKSVLTPDHTGAGQTMVDSGTQFTFLLADAYAALKAEFLNQA---RSLLAPLGEPGFV 336
Query: 375 ---GLRPCFDVPGEKTGS----FPELKLHFKGGAEVTLPVEN-YFAVVGE------GSAV 420
CF P E+ + PE+ L + GAEV + E ++V GE AV
Sbjct: 337 FQGAFDACFRGPEERVSAASRLLPEVGLVLR-GAEVAVAGEKLLYSVPGERRGEEGAEAV 395
Query: 421 CLTVVTDREASGGPSIILGNFQMQNYYVEYDLRNQRLGFKQQLCK 465
+ + +G + ++G+ Q+ +VEYDL+N R+GF C+
Sbjct: 396 WCLTFGNSDMAGMSAYVIGHHHQQDVWVEYDLQNGRVGFAPARCE 440
>gi|30686482|ref|NP_850251.1| aspartyl protease-like protein [Arabidopsis thaliana]
gi|122215044|sp|Q3EBM5.1|ASPR1_ARATH RecName: Full=Probable aspartic protease At2g35615; Flags:
Precursor
gi|330254036|gb|AEC09130.1| aspartyl protease-like protein [Arabidopsis thaliana]
Length = 447
Score = 107 bits (266), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 113/409 (27%), Positives = 166/409 (40%), Gaps = 79/409 (19%)
Query: 87 GGYSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLG 146
G + +S++ GTPP + I DTGS L W C C+ C P F K SS+ +
Sbjct: 83 GEFFMSITIGTPPIKVFAIADTGSDLTWVQCK---PCQQCYKENGPIFDKKKSSTYKSEP 139
Query: 147 CQNPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGS-GLTEGIALSETLNLPN 205
C + C + C + N+ IC Y YG ++G +ET+++ +
Sbjct: 140 CDSRNCQALSSTERGCDESNN-----------IC-KYRYSYGDQSFSKGDVATETVSIDS 187
Query: 206 R-----IIPNFLVGCSVLSSRQPAGIAGFGRGKT----------------SLPSQLN--- 241
P + GC G+ G T SL SQL
Sbjct: 188 ASGSPVSFPGTVFGC------------GYNNGGTFDETGSGIIGLGGGHLSLISQLGSSI 235
Query: 242 LDKFSYCLLSHKFDDTTRTSSLILDNGSSHSD-KKTTGLTYTPFVNNPSVAERNAFSVYY 300
KFSYC LSHK T TS + L S S K +G+ TP V+ + YY
Sbjct: 236 SKKFSYC-LSHKSATTNGTSVINLGTNSIPSSLSKDSGVVSTPLVDKEPL-------TYY 287
Query: 301 YVGLRRITVGGQRVRVWHKYLTLDRDG-----NGGTIVDSGTTFTFMAPELFEPLADEFV 355
Y+ L I+VG +++ + DG +G I+DSGTT T + F+ +
Sbjct: 288 YLTLEAISVGKKKIPYTGSSYNPNDDGILSETSGNIIIDSGTTLTLLEAGFFDKFSSAVE 347
Query: 356 SQMVKNRNYTRALGAEALTGLRPCFDVPGEKTGSFPELKLHFKGGAEVTLPVENYFAVVG 415
+ + + G L CF + G PE+ +HF GA+V L N F +
Sbjct: 348 ESVTGAKRVSDPQGL-----LSHCFKSGSAEIG-LPEITVHFT-GADVRLSPINAFVKLS 400
Query: 416 EGSAVCLTVVTDREASGGPSIILGNFQMQNYYVEYDLRNQRLGFKQQLC 464
E VCL++V E + I GNF ++ V YDL + + F+ C
Sbjct: 401 E-DMVCLSMVPTTEVA-----IYGNFAQMDFLVGYDLETRTVSFQHMDC 443
>gi|413953782|gb|AFW86431.1| hypothetical protein ZEAMMB73_038825 [Zea mays]
Length = 462
Score = 106 bits (265), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 114/422 (27%), Positives = 173/422 (40%), Gaps = 61/422 (14%)
Query: 58 RALHIKNPQTKTTTTTTTTTTTNISSHSYG--GYSISLSFGTPPQIIPFILDTGSHLVW- 114
R + I P T T + + S G + +++ FGTP Q + DTGS + W
Sbjct: 87 RGIPISYPPTIPPAEAPAVTIPDSTGTSLGTLEFVVTVGFGTPAQTYTLMFDTGSDVSWI 146
Query: 115 --FPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLGCQNPKCSWIHHESIQCRDCNDEPLAT 172
PC+ H C P F P S++ + C +P+C+ +
Sbjct: 147 QCLPCSGH-----CYKQHDPIFDPTKSATYSAVPCGHPQCAAAGGKC------------- 188
Query: 173 SKNCTQICPSYLVLYGSGL-TEGIALSETLNLPN-RIIPNFLVGC---SVLSSRQPAGIA 227
S N T + Y V YG G T G+ ETL+L + R +P F GC ++ G+
Sbjct: 189 SSNGTCL---YKVQYGDGSSTAGVLSHETLSLTSARALPGFAFGCGETNLGDFGDVDGLI 245
Query: 228 GFGRGKTSLPSQLNLDKFS---YCLLSHKFDDTTRTSSLILDNGSSHSDKKTTGLTYTPF 284
G GRG+ SL SQ + YCL S+ TS L G++ + G+ YT
Sbjct: 246 GLGRGQLSLSSQAAASFGAAFSYCLPSYN------TSHGYLTIGTTTPASGSDGVRYTAM 299
Query: 285 VNNPSVAERNAFSVYYYVGLRRITVGGQRVRVWHKYLTLDRDGNGGTIVDSGTTFTFMAP 344
+ ++ + +Y+V L I VGG + V T D GT++DSGT T++ P
Sbjct: 300 I------QKQDYPSFYFVDLVSIVVGGFVLPVPPILFTRD-----GTLLDSGTVLTYLPP 348
Query: 345 ELFEPLADEFVSQMVKNRNYTRALGAEALTGLRPCFDVPGEKTGSFPELKLHFKGGAEVT 404
E + L D F M + + A A C+D G+ P + F G+
Sbjct: 349 EAYTALRDRFKFTMTQYKP------APAYDPFDTCYDFAGQNAIFMPLVSFKFSDGSSFD 402
Query: 405 LPVENYFAVVGEGSAV--CLTVVTDREASGGPSIILGNFQMQNYYVEYDLRNQRLGFKQQ 462
L + + CL V S P I+GN Q +N + YD+ +++GF
Sbjct: 403 LSPFGVLIFPDDTAPATGCLAFVP--RPSTMPFTIVGNTQQRNTEMIYDVAAEKIGFVSG 460
Query: 463 LC 464
C
Sbjct: 461 SC 462
>gi|225216914|gb|ACN85210.1| aspartic proteinase nepenthesin-1 precursor [Oryza glaberrima]
Length = 516
Score = 106 bits (265), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 110/388 (28%), Positives = 160/388 (41%), Gaps = 59/388 (15%)
Query: 87 GGYSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLG 146
G Y +++ GTP + DTGS W C Y K+ F P SS+ +
Sbjct: 178 GNYVVTVGLGTPASRYTVVFDTGSDTTWVQCQPCVVACYEQREKL--FDPASSSTYANVS 235
Query: 147 CQNPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSG-LTEGIALSETLNLPN 205
C P CS + D + +C Y V YG G + G +TL L +
Sbjct: 236 CAAPACSDL-----------DVSGCSGGHCL-----YGVQYGDGSYSIGFFAMDTLTLSS 279
Query: 206 -RIIPNFLVGCSVLSS---RQPAGIAGFGRGKTSLPSQLNLD---KFSYCLLSHKFDDTT 258
+ F GC + + AG+ G GRGKTSLP Q F++CL
Sbjct: 280 YDAVKGFRFGCGERNDGLFGEAAGLLGLGRGKTSLPVQTYGKYGGVFAHCLPPRS----- 334
Query: 259 RTSSLILDNGSSHSDKKTTGLTYTPFV--NNPSVAERNAFSVYYYVGLRRITVGGQRVRV 316
T + LD G+ TT TP + N P+ +YYVG+ I VGG+ + +
Sbjct: 335 -TGTGYLDFGAGSPPATTT----TPMLTGNGPT---------FYYVGMTGIRVGGRLLPI 380
Query: 317 WHKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEALTGL 376
GTIVDSGT T + P + L + + R Y + A A++ L
Sbjct: 381 APSVFAA-----AGTIVDSGTVITRLPPAAYSSL-RSAFAAAMAARGYRK---AAAVSLL 431
Query: 377 RPCFDVPGEKTGSFPELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTVVTDREASGGPSI 436
C+D G + P + L F+GGA + + V S VCL + + GG
Sbjct: 432 DTCYDFTGMSQVAIPTVSLLFQGGAALDVDASGIMYTV-SASQVCLAFAGNED--GGDVG 488
Query: 437 ILGNFQMQNYYVEYDLRNQRLGFKQQLC 464
I+GN Q++ + V YD+ + +GF C
Sbjct: 489 IVGNTQLKTFGVAYDIGKKVVGFSPGAC 516
>gi|414885970|tpg|DAA61984.1| TPA: hypothetical protein ZEAMMB73_915310 [Zea mays]
Length = 523
Score = 106 bits (265), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 109/386 (28%), Positives = 158/386 (40%), Gaps = 61/386 (15%)
Query: 89 YSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLGCQ 148
Y +S+ GTP + + + DTGS L W C C C P F P S++ + C
Sbjct: 188 YIVSVGLGTPRRDLLVVFDTGSDLSWVQCK---PCNNCYKQHDPLFDPSQSTTYSAVPCG 244
Query: 149 NPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGS-GLTEGIALSETLNL--PN 205
++C D +S C Y V+YG T+G +TL L +
Sbjct: 245 -------------AQECLDSGTCSSGKC-----RYEVVYGDMSQTDGNLARDTLTLGPSS 286
Query: 206 RIIPNFLVGCSVLSSR---QPAGIAGFGRGKTSLPSQLNLD---KFSYCLLSHKFDDTTR 259
+ F+ GC + + G+ G GR + SL SQ FSYCL S +
Sbjct: 287 DQLQGFVFGCGDDDTGLFGRADGLFGLGRDRVSLASQAAARYGAGFSYCLPS------SW 340
Query: 260 TSSLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRVRVWHK 319
+ L GS+ + P ++ R+ +YY+ L I V G+ VRV
Sbjct: 341 RAEGYLSLGSAAA---------PPHAQFTAMVTRSDTPSFYYLDLVGIKVAGRTVRVAPA 391
Query: 320 YLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEALTGLRPC 379
GT++DSGT T + + L F M R Y RA AL+ L C
Sbjct: 392 VFKAP-----GTVIDSGTVITRLPSRAYSALRSSFAGFM---RRYKRA---PALSILDTC 440
Query: 380 FDVPGEKTGSFPELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTVVTDREASGGPSI-IL 438
+D G P + L F GGA + L V S CL ++ + + S+ IL
Sbjct: 441 YDFTGRTKVQIPSVALLFDGGATLNLGFGGVLYVANR-SQACLAFASNGDDT---SVGIL 496
Query: 439 GNFQMQNYYVEYDLRNQRLGFKQQLC 464
GN Q + + V YDL NQ++GF + C
Sbjct: 497 GNMQQKTFAVVYDLANQKIGFGAKGC 522
>gi|225216960|gb|ACN85252.1| aspartic proteinase nepenthesin-1 precursor [Oryza officinalis]
Length = 519
Score = 106 bits (265), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 111/389 (28%), Positives = 161/389 (41%), Gaps = 58/389 (14%)
Query: 87 GGYSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLG 146
G Y +++ GTP + DTGS W C Y K+ F P SS+ +
Sbjct: 178 GNYVVTVGLGTPVSRYTVVFDTGSDTTWVQCQPCVVVCYEQREKL--FDPARSSTYANVS 235
Query: 147 CQNPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSG-LTEGIALSETLNLPN 205
C P CS ++ C Y V YG G + G +TL L +
Sbjct: 236 CAAPACSDLNIHGCSGGHC----------------LYGVQYGDGSYSIGFFAMDTLTLSS 279
Query: 206 -RIIPNFLVGCSVLSS---RQPAGIAGFGRGKTSLPSQLNLDK----FSYCLLSHKFDDT 257
+ F GC + + AG+ G GRGKTSLP Q DK F++CL +
Sbjct: 280 YDAVKGFRFGCGERNEGLFGEAAGLLGLGRGKTSLPVQ-TYDKYGGVFAHCLPARS---- 334
Query: 258 TRTSSLILDNGSSHSDKKTTGLTYTPFV--NNPSVAERNAFSVYYYVGLRRITVGGQRVR 315
T + LD G+ + LT TP + N P+ +YYVG+ I VGGQ +
Sbjct: 335 --TGTGYLDFGAGSLAAASARLT-TPMLTDNGPT---------FYYVGMTGIRVGGQLLS 382
Query: 316 VWHKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEALTG 375
+ GTIVDSGT T + P + L + + R Y + A A++
Sbjct: 383 IPQSVFA-----TAGTIVDSGTVITRLPPAAYSSL-RYAFAAAMAARGYKK---APAVSL 433
Query: 376 LRPCFDVPGEKTGSFPELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTVVTDREASGGPS 435
L C+D G + P + L F+GGA + + S VCL + + GG
Sbjct: 434 LDTCYDFTGMSQVAIPTVSLLFQGGARLDVDASGIM-YAASASQVCLAFAANED--GGDV 490
Query: 436 IILGNFQMQNYYVEYDLRNQRLGFKQQLC 464
I+GN Q++ + V YD+ + +GF C
Sbjct: 491 GIVGNTQLKTFGVAYDIGKKVVGFYPGAC 519
>gi|255581545|ref|XP_002531578.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
gi|223528808|gb|EEF30814.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
Length = 442
Score = 106 bits (265), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 119/424 (28%), Positives = 181/424 (42%), Gaps = 47/424 (11%)
Query: 58 RALHIKNPQTKTTTTTTTTTTTNISSHSYGGYSISLSFGTPPQIIPFILDTGSHLVWFPC 117
R HI + TT T ++TT + H ++SL+ G+PPQ + +LDTGS L W C
Sbjct: 38 RHSHISTARKYFTTATASSTTNKLLFHHNVSLTVSLTVGSPPQNVTMVLDTGSELSWLHC 97
Query: 118 TNHYQCKYCSSSKIPSFIPKLSSSSRLLGCQNPKCSWIHHESIQCRDCNDEPLATSKNCT 177
+ ++ +S F P S + + C +P C + RD + S + T
Sbjct: 98 K---KTQFLNS----VFNPLSSKTYSKVPCLSPTC------KTRTRDLT---IPVSCDAT 141
Query: 178 QICPSYLVLYGSGLTEGIALSETLNLPNRIIPNFLVGC--SVLSSR-----QPAGIAGFG 230
++C + + EG ET L + P + GC S SS + G+ G
Sbjct: 142 KLCHVIVSYADATSIEGNLAFETFRLGSLTKPATIFGCMDSGFSSNSEEDSKTTGLIGMN 201
Query: 231 RGKTSLPSQLNLDKFSYCLLSHKFDDTTRTSSLILDNGSSHSDKKTTGLTYTPFVNNPSV 290
RG S +Q+ KFSYC+ FD L+L N S K L+YTP V S
Sbjct: 202 RGSLSFVNQMGYPKFSYCI--SGFDS---AGVLLLGNASFPWLKP---LSYTPLV-QIST 252
Query: 291 AERNAFSVYYYVGLRRITVGGQRVRVWHKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPL 350
V Y V L I V + + + D G G T+VDSGT FTF+ ++ L
Sbjct: 253 PLPYFDRVAYTVQLEGIKVKNKVLSLPKSVFVPDHTGAGQTMVDSGTQFTFLLGPVYTAL 312
Query: 351 ADEFVSQMVKNRNYTRALGAEALT---GLRPCF--DVPGEKTGSFPELKLHFKGGAEVTL 405
+EF+SQ R + L + + C+ D + P + L F+ GAE+++
Sbjct: 313 KNEFLSQ---TRGILKVLNDDNFVFQGAMDLCYLLDSSRPNLQNLPVVSLMFQ-GAEMSV 368
Query: 406 PVENYF-----AVVGEGSAVCLTVVTDREASGGPSIILGNFQMQNYYVEYDLRNQRLGFK 460
E V G S C T + + G + ++G+ QN ++E+DL R+G
Sbjct: 369 SGERLLYRVPGEVRGRDSVWCFT-FGNSDLLGVEAFVIGHHHQQNVWMEFDLEKSRIGLA 427
Query: 461 QQLC 464
C
Sbjct: 428 DVRC 431
>gi|302756591|ref|XP_002961719.1| hypothetical protein SELMODRAFT_64161 [Selaginella moellendorffii]
gi|300170378|gb|EFJ36979.1| hypothetical protein SELMODRAFT_64161 [Selaginella moellendorffii]
Length = 357
Score = 106 bits (265), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 111/388 (28%), Positives = 158/388 (40%), Gaps = 50/388 (12%)
Query: 87 GGYSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLG 146
G Y + G P + LDTGS + W C C C S P + P SSS R +
Sbjct: 10 GEYFARMGIGNPQRSYYLELDTGSDVTWIQCA---PCSSCYSQVDPIYDPSNSSSYRRVY 66
Query: 147 CQNPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYG-SGLTEGIALSETLNL-P 204
C + C + + + Q C SY V+YG S + G E+ L P
Sbjct: 67 CGSALCQALDYSACQGMGC----------------SYRVVYGDSSASSGDLGIESFYLGP 110
Query: 205 NR--IIPNFLVGCSVLSS---RQPAGIAGFGRGKTSLPSQLNLD---KFSYCLLSHKFDD 256
N + N GC +S R AG+ G G G S SQ+ FSYCL+
Sbjct: 111 NSSTAMRNIAFGCGHSNSGLFRGEAGLLGMGGGTLSFFSQIAASIGPAFSYCLVDRYSQL 170
Query: 257 TTRTSSLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRVRV 316
+R+S LI + + +TP + NP + + +YY L I+VGG + +
Sbjct: 171 QSRSSPLIFGRTAIPFAAR-----FTPLLKNPRI------NTFYYAVLTGISVGGTPLPI 219
Query: 317 WHKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEALTGL 376
L +G GG I+DSGT+ T + P + L D + +RN A G L
Sbjct: 220 PPAQFALTGNGTGGAILDSGTSVTRVVPPAYAVLRDAY---RAASRNLPPAPGVYLLD-- 274
Query: 377 RPCFDVPGEKTGSFPELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTVVTDREASGGPSI 436
CF+ G T P L LHF G ++ LP N V CL S P
Sbjct: 275 -TCFNFQGLPTVQIPSLVLHFDNGVDMVLPGGNILIPVDRSGTFCLAFAP----SSMPIS 329
Query: 437 ILGNFQMQNYYVEYDLRNQRLGFKQQLC 464
++GN Q Q + + +DL+ + + C
Sbjct: 330 VIGNVQQQTFRIGFDLQRSLIAIAPREC 357
>gi|194707866|gb|ACF88017.1| unknown [Zea mays]
Length = 461
Score = 106 bits (264), Expect = 3e-20, Method: Compositional matrix adjust.
Identities = 112/388 (28%), Positives = 165/388 (42%), Gaps = 66/388 (17%)
Query: 89 YSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLGCQ 148
Y I++ G+P ++DTGS + W C C C S P F P SS+ C
Sbjct: 128 YLITVGLGSPATSQTMLIDTGSDVSWVQCK---PCSQCHSQADPLFDPSSSSTYSPFSCG 184
Query: 149 NPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSGL-TEGIALSETLNLPNRI 207
+ C+ + E C ++S C Y+V YG G T G S+TL L +
Sbjct: 185 SADCAQLGQEGNGC--------SSSSQC-----QYIVTYGDGSSTTGTYSSDTLALGSSA 231
Query: 208 IPNFLVGCSVLSS---RQPAGIAGFGRGKTSLPSQL--NLDK-FSYCLLSHKFDDTTRTS 261
+ +F GCS + S Q G+ G G G SL SQ L + FSYCL T +S
Sbjct: 232 VRSFQFGCSNVESGFNDQTDGLMGLGGGAQSLVSQTAGTLGRAFSYCL------PPTPSS 285
Query: 262 SLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRVRVWHKYL 321
S L + G + FV P + + +Y V L+ I VGG+++ +
Sbjct: 286 SGFL------TLGAAGGSGTSGFVKTP-MLRSSQVPTFYGVRLQAIRVGGRQLSIPASVF 338
Query: 322 TLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEALTGLRPCFD 381
+ GT++DSGT T + P + L+ F + M + Y A + L CFD
Sbjct: 339 S------AGTVMDSGTVITRLPPTAYSALSSAFKAGM---KQYPPAQPSGIL---DTCFD 386
Query: 382 VPGEKTGSFPELKLHFKGGAEVTLP-----VENYFAVVGEGSAVCLTVVTDREASGGPSI 436
G+ + S P + L F GGA V+L + N A G +D + G
Sbjct: 387 FSGQSSVSIPSVALVFSGGAVVSLDASGIILSNCLAFAGN---------SDDSSLG---- 433
Query: 437 ILGNFQMQNYYVEYDLRNQRLGFKQQLC 464
I+GN Q + + V YD+ +GF+ C
Sbjct: 434 IIGNVQQRTFEVLYDVGRGVVGFRAGAC 461
>gi|357457681|ref|XP_003599121.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
gi|355488169|gb|AES69372.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
Length = 439
Score = 106 bits (264), Expect = 3e-20, Method: Compositional matrix adjust.
Identities = 118/454 (25%), Positives = 192/454 (42%), Gaps = 64/454 (14%)
Query: 31 FSLSRFHTNPSQDSY--------QNLNSLVSSSLTRALHIKNPQTKTTTTTTTTTTTNIS 82
FS+ H + S+ + Q ++++V+ S+ RA ++ + + + T I
Sbjct: 27 FSVELIHPDSSRSPFYNIRETQLQRISNVVTHSIKRAHYLNHVFSLSHNDLPKPT---II 83
Query: 83 SHSYGGYSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSS 142
++ Y +S S GTPP + ++DTGS +WF C CK C + P F P SS+
Sbjct: 84 PYAGSYYVMSYSIGTPPFQLYGVVDTGSDGIWFQCK---PCKPCLNQTSPIFNPSKSSTY 140
Query: 143 RLLGCQNPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSGLTEGIALSETLN 202
+ + C +P C E +C S N + C + ++G +TL
Sbjct: 141 KNIRCSSPICK--RGEKTRC----------SSNRKRKCEYEITYLDRSGSQGDISKDTLT 188
Query: 203 LPNR-----IIPNFLVGC----SVLSSRQPAGIAGFGRGKTSLPSQLNLD---KFSYCLL 250
L + P ++GC S+ + +GI GFGRG S+ SQL KFSYCL
Sbjct: 189 LNSNDGSPISFPKIVIGCGHKNSLTTEGLASGIIGFGRGNFSIVSQLGSSIGGKFSYCLA 248
Query: 251 SHKFDDTTRTSSLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVG 310
S F +S L + + S G+ TP + + V Y+ L +VG
Sbjct: 249 S-LFSKANISSKLYFGDMAVVSGH---GVVSTPLIQSFYVGN-------YFTNLEAFSVG 297
Query: 311 GQRVRVWHKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGA 370
+++ L D +GN ++DSG+T T + +++ L +S MVK +
Sbjct: 298 DHIIKLKDSSLIPDNEGNA--VIDSGSTITQLPNDVYSQLETAVIS-MVKLKRV-----K 349
Query: 371 EALTGLRPCFDVPGEKTGSFPELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTVVTDREA 430
+ L C+ +K P + HF+ GA+V L N F + +C +
Sbjct: 350 DPTQQLSLCYKTTLKKY-EVPIITAHFR-GADVKLNAFNTFIQMNH-EVMCFAF----NS 402
Query: 431 SGGPSIILGNFQMQNYYVEYDLRNQRLGFKQQLC 464
S P ++ GN QN+ V YD + FK C
Sbjct: 403 SAFPWVVYGNIAQQNFLVGYDTLKNIISFKPTNC 436
>gi|224109494|ref|XP_002315215.1| predicted protein [Populus trichocarpa]
gi|222864255|gb|EEF01386.1| predicted protein [Populus trichocarpa]
Length = 444
Score = 105 bits (263), Expect = 3e-20, Method: Compositional matrix adjust.
Identities = 115/415 (27%), Positives = 168/415 (40%), Gaps = 55/415 (13%)
Query: 71 TTTTTTTTTNISSHSYGGYSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSK 130
+ +++ TT + H + SL+ GTPPQ I +LDTGS L W C K
Sbjct: 49 SNSSSKTTGKLLFHHNVTLTASLTIGTPPQNITMVLDTGSELSWLRC-----------KK 97
Query: 131 IPSFI----PKLSSSSRLLGCQNPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVL 186
P+F P S + + C + C +D L + + ++C +
Sbjct: 98 EPNFTSIFNPLASKTYTKIPCSSQTCK---------TRTSDLTLPVTCDPAKLCHFIISY 148
Query: 187 YGSGLTEGIALSETLNLPNRIIPNFLVGC-------SVLSSRQPAGIAGFGRGKTSLPSQ 239
+ EG ET + P + GC + + G+ G RG S +Q
Sbjct: 149 ADASSVEGHLAFETFRFGSLTRPATVFGCMDSGSSSNTEEDAKTTGLMGMNRGSLSFVNQ 208
Query: 240 LNLDKFSYCLLSHKFDDTTRTSSLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVY 299
+ KFSYC+ D T +L + +S K L YTP V S V
Sbjct: 209 MGFRKFSYCI--SGLDST----GFLLLGEARYSWLKP--LNYTPLV-QISTPLPYFDRVA 259
Query: 300 YYVGLRRITVGGQRVRVWHKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQM- 358
Y V L I V + + + D G G T+VDSGT FTF+ ++ L EF+ Q
Sbjct: 260 YSVQLEGIKVNNKVLPLPKSVFVPDHTGAGQTMVDSGTQFTFLLGPVYSALRKEFLLQTA 319
Query: 359 ----VKNRNYTRALGAEALTGLRPCFDVPGEKTGSFPELKLHFKGGAEVTLPVENYF--- 411
V N GA L L D + P +KL F+ GAE+++ +
Sbjct: 320 GVLRVLNEPQYVFQGAMDLCYL---IDSTSSTLPNLPVVKLMFR-GAEMSVSGQRLLYRV 375
Query: 412 --AVVGEGSAVCLTVVTDREASGGPSIILGNFQMQNYYVEYDLRNQRLGFKQQLC 464
V G+ S C T E G S ++G+ Q QN ++EYDL N R+GF + C
Sbjct: 376 PGEVRGKDSVWCFTFGNSDEL-GISSFLIGHHQQQNVWMEYDLENSRIGFAELRC 429
>gi|357166728|ref|XP_003580821.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Brachypodium
distachyon]
Length = 479
Score = 105 bits (263), Expect = 3e-20, Method: Compositional matrix adjust.
Identities = 111/398 (27%), Positives = 162/398 (40%), Gaps = 47/398 (11%)
Query: 81 ISSHSYGGYSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSS 140
I SH G Y + + G+PP + DTGS ++W C+ C C + P F P S+
Sbjct: 115 IVSHGSGEYLVRVGIGSPPLEQHLVADTGSDVIWVQCS---PCSDCYAQGDPLFDPANSA 171
Query: 141 SSRLLGCQNPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGS-GLTEGIALSE 199
S + C + C S ++S Y V YG T G+ E
Sbjct: 172 SFSPVPCNSGVCRAAARYS-----------SSSCGGGGGECEYKVSYGDKSYTNGVLALE 220
Query: 200 TLNLPNRI-IPNFLVGCSVLSS---RQPAGIAGFGRGKTSLPSQLNLDK---FSYCLLSH 252
TL L + +GC + + AG+ G G G SL QL FSYCL +
Sbjct: 221 TLTLDGGTEVQGVAMGCGHENRGLFAEAAGLLGLGWGPMSLVGQLGGAAGGAFSYCLAGY 280
Query: 253 KFDDTTRTSSLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQ 312
+ + + SL+L D TG + P V NP +YYVG+ + V G+
Sbjct: 281 YSGEGSGSGSLVL----GREDAAPTGAVWVPLVRNPDAPS------FYYVGVNGLGVAGE 330
Query: 313 RVRVWHKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEA 372
R+++ L DG GG ++D+GT T + E + L F + A A
Sbjct: 331 RLQLQDGLFDLGDDGGGGVVMDTGTAVTRLPAEAYAALRGAFAGAFEEG-----APRAPG 385
Query: 373 LTGLRPCFDVPGEKTGSFPELKLHFKG------GAEVTLPVENYFAVVGEGSAVCLTVVT 426
++ C+D+ G + P + L+F G A +TLP N V +G CL
Sbjct: 386 VSLFDTCYDLSGYASVRVPTVALYFGGGGQGQEAASLTLPARNLLVPVDDGGTYCLAFAA 445
Query: 427 DREASGGPSIILGNFQMQNYYVEYDLRNQRLGFKQQLC 464
+ GPS ILGN Q Q + D + +GF C
Sbjct: 446 ---VASGPS-ILGNIQQQGIEITVDSASGYVGFGPATC 479
>gi|357440767|ref|XP_003590661.1| Basic 7S globulin [Medicago truncatula]
gi|355479709|gb|AES60912.1| Basic 7S globulin [Medicago truncatula]
Length = 500
Score = 105 bits (263), Expect = 3e-20, Method: Compositional matrix adjust.
Identities = 113/439 (25%), Positives = 187/439 (42%), Gaps = 81/439 (18%)
Query: 61 HIKNPQTKTTTTTTTTTTTNISSHSYGGYSISLSFGTPPQIIPFILDTGSHLVWFPCTNH 120
H+ +P TK + TT Y ++ TP + ++D G +W C NH
Sbjct: 34 HLFSPVTKDSATTLQ-------------YIAQINQRTPLVPLNLVVDLGGKFLWVDCENH 80
Query: 121 YQCKYCSSSKIPSFIPKLSSSSRLLGCQNPKCSWIHHESIQCRDCNDEPLATSKNCTQIC 180
Y SS+ R + C + +CS +S C DC P N +
Sbjct: 81 YT----------------SSTYRPVRCPSAQCSLAKSDS--CGDCFSSPKPGCNNTCGLI 122
Query: 181 PSYLVLYGSGLTEGIALSETLNLP---------NRIIPNFLVGCSVLS-----SRQPAGI 226
P + + + T G + L++ N ++ FL C+ S + +G+
Sbjct: 123 PDNTITHSA--TRGDLAEDVLSIQSTSGFNTGQNVVVSRFLFSCAPTSLLRGLAGGASGM 180
Query: 227 AGFGRGKTSLPSQLN-----LDKFSYCLLSHK----FDDTTRTSSLILDNGSSHS---DK 274
AG GR K +LPSQL KF++C S F D S + DN S + D
Sbjct: 181 AGLGRTKIALPSQLASAFIFKRKFAFCFSSSDGVIIFGDGPY--SFLADNPSLPNVVFDS 238
Query: 275 KTTGLTYTPFVNNPSVAERNAF-----SVYYYVGLRRITVGGQRVRVWHKYLTLDRDGNG 329
K+ LTYTP + N V+ +AF SV Y++G++ I + G+ V + L++D G G
Sbjct: 239 KS--LTYTPLLIN-HVSTASAFLQGESSVEYFIGVKTIKIDGKVVSLNSSLLSIDNKGVG 295
Query: 330 GTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEALTGLRPCFDVPGEKTG- 388
GT + + +T + +++ + D FV V RN T + ++PG G
Sbjct: 296 GTKISTVDPYTVLEASIYKAVTDAFVKASVA-RNITTEDSSPPFEFCYSFDNLPGTPLGA 354
Query: 389 SFPELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTVVTDREASGG----PSIILGNFQMQ 444
S P ++L + ++ N + + +CL V +GG SI++G +Q++
Sbjct: 355 SVPTIELLLQNNVIWSMFGANSMVNIND-EVLCLGFV-----NGGVNLRTSIVIGGYQLE 408
Query: 445 NYYVEYDLRNQRLGFKQQL 463
N +++DL RLGF +
Sbjct: 409 NNLLQFDLAASRLGFSNTI 427
>gi|115489316|ref|NP_001067145.1| Os12g0583300 [Oryza sativa Japonica Group]
gi|77556903|gb|ABA99699.1| Eukaryotic aspartyl protease family protein, expressed [Oryza
sativa Japonica Group]
gi|113649652|dbj|BAF30164.1| Os12g0583300 [Oryza sativa Japonica Group]
gi|125537189|gb|EAY83677.1| hypothetical protein OsI_38901 [Oryza sativa Indica Group]
Length = 446
Score = 105 bits (263), Expect = 3e-20, Method: Compositional matrix adjust.
Identities = 111/403 (27%), Positives = 164/403 (40%), Gaps = 76/403 (18%)
Query: 89 YSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLGCQ 148
Y G PPQ ++DTGS LVW C+ + K C+ +P + SS+ + C
Sbjct: 90 YVAEYLIGDPPQRAEALIDTGSDLVWTQCSTCLR-KVCARQALPYYNSSASSTFAPVPCA 148
Query: 149 NPKCSWIHHESIQCRDCNDEPL---ATSKNCTQICPSYLVLYGSGLTEGIALSETLNLPN 205
C+ ND+ + + C+ I YG+G+ G +E +
Sbjct: 149 ARICA-----------ANDDIIHFCDLAAGCSVIAG-----YGAGVVAGTLGTEAFAFQS 192
Query: 206 -------------RIIPNFLVGCSVLSSRQPAGIAGFGRGKTSLPSQLNLDKFSYCLLSH 252
RI+ L G S G+ G GRG+ SL SQ KFSYCL +
Sbjct: 193 GTAELAFGCVTFTRIVQGALHGAS--------GLIGLGRGRLSLVSQTGATKFSYCLTPY 244
Query: 253 KFDDTTRTSSLILDNGSS---HSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITV 309
F + T L + +S H D T T FV P S +YY+ L +TV
Sbjct: 245 -FHNNGATGHLFVGASASLGGHGDVMT-----TQFVKGPK------GSPFYYLPLIGLTV 292
Query: 310 GGQRVRVWHKYLTLDRDG----NGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYT 365
G R+ + L +GG I+DSG+ FT + + ++ LA E +++
Sbjct: 293 GETRLPIPATVFDLREVAPGLFSGGVIIDSGSPFTSLVHDAYDALASELAARL------N 346
Query: 366 RALGAEALTGLRPCFDVPGEKTGS-FPELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTV 424
+L A V G P + HF+GGA++ +P E+Y+A V +
Sbjct: 347 GSLVAPPPDADDGALCVARRDVGRVVPAVVFHFRGGADMAVPAESYWAPVDK------AA 400
Query: 425 VTDREASGGP---SIILGNFQMQNYYVEYDLRNQRLGFKQQLC 464
AS GP ++GN+Q QN V YDL N F+ C
Sbjct: 401 ACMAIASAGPYRRQSVIGNYQQQNMRVLYDLANGDFSFQPADC 443
>gi|414866064|tpg|DAA44621.1| TPA: putative aspartic protease family protein [Zea mays]
Length = 454
Score = 105 bits (263), Expect = 4e-20, Method: Compositional matrix adjust.
Identities = 107/411 (26%), Positives = 163/411 (39%), Gaps = 66/411 (16%)
Query: 90 SISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIP---------KLSS 140
++ ++ G PPQ + +LDTGS L W C+ S++PS P SS
Sbjct: 63 TVPVAVGAPPQNVTMVLDTGSELSWL---------RCNGSRVPSTPPPQAPAAFNGSASS 113
Query: 141 SSRLLGCQNPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSGLTEGIALSET 200
+ C +P+C W + RD P + C L + +GI ++T
Sbjct: 114 TYAAAHCSSPECQW------RGRDLPVPPFCAGPP-SNSCRVSLSYADASSADGILAADT 166
Query: 201 LNLPNRIIPNFLVGC----------SVLSSRQPAGIAGFGRGKTSLPSQLNLDKFSYCLL 250
L L GC + S G+ G RG S +Q +F+YC+
Sbjct: 167 FLLGGAPPVRALFGCVTSYSSATATNSSDSEAATGLLGMNRGSLSFVTQTATLRFAYCIA 226
Query: 251 SHKFDDTTRTSSLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAF--SVYYYVGLRRIT 308
L++ G + L YTP + ++ + V Y V L I
Sbjct: 227 PGD------GPGLLVLGGDGAALAPQ--LNYTPLIQ---ISRPLPYFDRVAYSVQLEGIR 275
Query: 309 VGGQRVRVWHKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRAL 368
VG + + L D G G T+VDSGT FTF+ + + PL EF++Q L
Sbjct: 276 VGAALLPIPKSVLAPDHTGAGQTMVDSGTQFTFLLADAYAPLKGEFLNQ---TSALLAPL 332
Query: 369 GAEALT---GLRPCFDVPGEKTGS----FPELKLHFKGGAEVTLPVENYFAVV-----GE 416
G CF + + PE+ L + GAEV + E V GE
Sbjct: 333 GESDFVFQGAFDACFRASEARVAAASQMLPEVGLVLR-GAEVAVGGEKLLYRVPGERRGE 391
Query: 417 GSAVCLTVVT--DREASGGPSIILGNFQMQNYYVEYDLRNQRLGFKQQLCK 465
G A + +T + + +G + ++G+ QN +VEYDL+N R+GF C
Sbjct: 392 GGAEAVWCLTFGNSDMAGMSAYVIGHHHQQNVWVEYDLQNGRVGFAPARCD 442
>gi|225216973|gb|ACN85264.1| aspartic proteinase nepenthesin-1 precursor [Oryza alta]
Length = 517
Score = 105 bits (263), Expect = 4e-20, Method: Compositional matrix adjust.
Identities = 110/389 (28%), Positives = 162/389 (41%), Gaps = 58/389 (14%)
Query: 87 GGYSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLG 146
G Y +++ GTP + DTGS W C Y K+ F P SS+ +
Sbjct: 176 GNYVVTVGLGTPASRYTVVFDTGSDTTWVQCQPCVVVCYEQQEKL--FDPVRSSTYANVS 233
Query: 147 CQNPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSG-LTEGIALSETLNLPN 205
C P CS ++ C Y V YG G + G +TL L +
Sbjct: 234 CAAPACSDLNIHGCSGGHC----------------LYGVQYGDGSYSIGFFAMDTLTLSS 277
Query: 206 -RIIPNFLVGCSVLSS---RQPAGIAGFGRGKTSLPSQLNLDK----FSYCLLSHKFDDT 257
+ F GC + + AG+ G GRGKTSLP Q DK F++CL +
Sbjct: 278 YDAVKGFRFGCGERNEGLFGEAAGLLGLGRGKTSLPVQ-TYDKYGGVFAHCLPARS---- 332
Query: 258 TRTSSLILDNGSSHSDKKTTGLTYTPFV--NNPSVAERNAFSVYYYVGLRRITVGGQRVR 315
T + LD G+ + LT TP + N P+ +YY+G+ I VGGQ +
Sbjct: 333 --TGTGYLDFGAGSPAAASARLT-TPMLTDNGPT---------FYYIGMTGIRVGGQLLS 380
Query: 316 VWHKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEALTG 375
+ GTIVDSGT T + P + L + + R Y + A A++
Sbjct: 381 IPQSVFA-----TAGTIVDSGTVITRLPPPAYSSL-RYAFAAAMAARGYKK---APAVSL 431
Query: 376 LRPCFDVPGEKTGSFPELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTVVTDREASGGPS 435
L C+D G + P + L F+GGA + + S VCL + + GG
Sbjct: 432 LDTCYDFTGMSQVAIPTVSLLFQGGARLDVDASGIM-YAASASQVCLAFAANED--GGDV 488
Query: 436 IILGNFQMQNYYVEYDLRNQRLGFKQQLC 464
I+GN Q++ + V YD+ + +GF +C
Sbjct: 489 GIVGNTQLKTFGVAYDIGKKVVGFYPGVC 517
>gi|223950123|gb|ACN29145.1| unknown [Zea mays]
gi|413923785|gb|AFW63717.1| hypothetical protein ZEAMMB73_445506 [Zea mays]
Length = 385
Score = 105 bits (263), Expect = 4e-20, Method: Compositional matrix adjust.
Identities = 112/388 (28%), Positives = 165/388 (42%), Gaps = 66/388 (17%)
Query: 89 YSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLGCQ 148
Y I++ G+P ++DTGS + W C C C S P F P SS+ C
Sbjct: 52 YLITVGLGSPATSQTMLIDTGSDVSWVQCK---PCSQCHSQADPLFDPSSSSTYSPFSCG 108
Query: 149 NPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSGL-TEGIALSETLNLPNRI 207
+ C+ + E C ++S C Y+V YG G T G S+TL L +
Sbjct: 109 SADCAQLGQEGNGC--------SSSSQC-----QYIVTYGDGSSTTGTYSSDTLALGSSA 155
Query: 208 IPNFLVGCSVLSS---RQPAGIAGFGRGKTSLPSQL--NLDK-FSYCLLSHKFDDTTRTS 261
+ +F GCS + S Q G+ G G G SL SQ L + FSYCL T +S
Sbjct: 156 VRSFQFGCSNVESGFNDQTDGLMGLGGGAQSLVSQTAGTLGRAFSYCL------PPTPSS 209
Query: 262 SLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRVRVWHKYL 321
S L + G + FV P + + +Y V L+ I VGG+++ +
Sbjct: 210 SGFL------TLGAAGGSGTSGFVKTP-MLRSSQVPTFYGVRLQAIRVGGRQLSIPASVF 262
Query: 322 TLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEALTGLRPCFD 381
+ GT++DSGT T + P + L+ F + M + Y A + L CFD
Sbjct: 263 S------AGTVMDSGTVITRLPPTAYSALSSAFKAGM---KQYPPAQPSGIL---DTCFD 310
Query: 382 VPGEKTGSFPELKLHFKGGAEVTLP-----VENYFAVVGEGSAVCLTVVTDREASGGPSI 436
G+ + S P + L F GGA V+L + N A G +D + G
Sbjct: 311 FSGQSSVSIPSVALVFSGGAVVSLDASGIILSNCLAFAGN---------SDDSSLG---- 357
Query: 437 ILGNFQMQNYYVEYDLRNQRLGFKQQLC 464
I+GN Q + + V YD+ +GF+ C
Sbjct: 358 IIGNVQQRTFEVLYDVGRGVVGFRAGAC 385
>gi|297823357|ref|XP_002879561.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
gi|297325400|gb|EFH55820.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
Length = 447
Score = 105 bits (263), Expect = 4e-20, Method: Compositional matrix adjust.
Identities = 128/457 (28%), Positives = 187/457 (40%), Gaps = 86/457 (18%)
Query: 39 NPSQDSYQNLNSLVSSSLTRALHIKNPQTKTTTTTTTTTTTNISSHSYGGYSISLSFGTP 98
NP LN+ S++R+ + N ++T + G + +S++ GTP
Sbjct: 42 NPKNTVTDRLNAAFLRSISRSRRLNNILSQTDLQSGLIGAD-------GEFFMSITIGTP 94
Query: 99 PQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLGCQNPKCSWIHHE 158
P + I DTGS L W C C+ C P F K SS+ + C + C H
Sbjct: 95 PMKVFAIADTGSDLTWVQCK---PCQQCYKENGPIFDKKKSSTYKSEPCDSRNC---HAL 148
Query: 159 SIQCRDCNDEPLATSKNCTQICPSYLVLYGS-GLTEGIALSETLNLPNR-----IIPNFL 212
S R C++ SKN +C Y YG ++G +ET+++ + P +
Sbjct: 149 SSSERGCDE-----SKN---VC-KYRYSYGDQSFSKGDVATETISIDSASGSPVSFPGTV 199
Query: 213 VGCSVLSSRQPAGIAGFGRGKT----------------SLPSQLN---LDKFSYCLLSHK 253
GC G+ G T SL SQL KFSYC LSHK
Sbjct: 200 FGC------------GYNNGGTFDETGSGIIGLGGGHLSLISQLGSSISKKFSYC-LSHK 246
Query: 254 FDDTTRTSSLILDNGSSHSD-KKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQ 312
T TS + L S S K +G+ TP V+ YYY+ L I+VG +
Sbjct: 247 SATTNGTSVINLGTNSIPSSLSKDSGVISTPLVDKEP-------RTYYYLTLEAISVGKK 299
Query: 313 RVRVWHKYLTLDRDG-----NGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRA 367
++ + G +G I+DSGTT T + F+ V ++V R
Sbjct: 300 KIPYTGSSYNPNDGGIFSETSGNIIIDSGTTLTLLDSGFFDKFG-AAVEELVTGAK--RV 356
Query: 368 LGAEALTGLRPCFDVPGEKTGSFPELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTVVTD 427
+ L L CF + G PE+ +HF GA+V L N F V E VCL++V
Sbjct: 357 SDPQGL--LSHCFKSGSAEIG-LPEITVHFT-GADVRLSPINAFVKVSE-DMVCLSMVPT 411
Query: 428 REASGGPSIILGNFQMQNYYVEYDLRNQRLGFKQQLC 464
E + I GNF ++ V YDL + + F++ C
Sbjct: 412 TEVA-----IYGNFAQMDFLVGYDLETRTVSFQRMDC 443
>gi|224092218|ref|XP_002309514.1| predicted protein [Populus trichocarpa]
gi|222855490|gb|EEE93037.1| predicted protein [Populus trichocarpa]
Length = 474
Score = 105 bits (263), Expect = 4e-20, Method: Compositional matrix adjust.
Identities = 112/390 (28%), Positives = 166/390 (42%), Gaps = 61/390 (15%)
Query: 87 GGYSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIP----SFIPKLSSSS 142
G Y +++ GTP + + DTGS + W QC+ C S P F P S+S
Sbjct: 133 GNYVVTVGLGTPKEDFTLVFDTGSGITW------TQCQPCLGSCYPQKEQKFDPTKSTSY 186
Query: 143 RLLGCQNPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGS-GLTEGIALSETL 201
+ C + C+ + C N L Y ++YG ++G +ETL
Sbjct: 187 NNVSCSSASCNLLPTSERGCSASNSTCL------------YQIIYGDQSYSQGFFATETL 234
Query: 202 NLPNR-IIPNFLVGCSVLSSR---QPAGIAGFGRGKTSLPSQLN---LDKFSYCLLSHKF 254
+ + + NFL GC ++ Q AG+ G SLPSQ +FSYCL S
Sbjct: 235 TISSSDVFTNFLFGCGQSNNGLFGQAAGLLGLSSSSVSLPSQTAEKYQKQFSYCLPS--- 291
Query: 255 DDTTRTSSLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRV 314
T +S+ L+ G S +T G +TP AFS +Y + + I+V G ++
Sbjct: 292 ---TPSSTGYLNFGGKVS--QTAG--FTPI--------SPAFSSFYGIDIVGISVAGSQL 336
Query: 315 RVWHKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEALT 374
+ T G I+DSGT T + P ++ L + F +M NY + G E L
Sbjct: 337 PIDPSIFT-----TSGAIIDSGTVITRLPPTAYKALKEAFDEKM---SNYPKTNGDELL- 387
Query: 375 GLRPCFDVPGEKTGSFPELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTVVTDREASGGP 434
C+D T SFP++ + FKGG EV + +V VCL +++ S
Sbjct: 388 --DTCYDFSNYTTVSFPKVSVSFKGGVEVDIDASGILYLVNGVKMVCLAFAANKDDS--E 443
Query: 435 SIILGNFQMQNYYVEYDLRNQRLGFKQQLC 464
I GN Q + Y V YD +GF C
Sbjct: 444 FGIFGNHQQKTYEVVYDGAKGMIGFAAGAC 473
>gi|148907930|gb|ABR17085.1| unknown [Picea sitchensis]
Length = 498
Score = 105 bits (263), Expect = 4e-20, Method: Compositional matrix adjust.
Identities = 109/395 (27%), Positives = 163/395 (41%), Gaps = 68/395 (17%)
Query: 87 GGYSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLG 146
G Y + GTP + +LDTGS + W C C+ C S P F P S+S +G
Sbjct: 155 GEYFTRIGVGTPTREQYMVLDTGSDVAWIQCE---PCRECYSQADPIFNPSYSASFSTVG 211
Query: 147 CQNPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSG-LTEGIALSETLNLPN 205
C + CS + DC+ S C Y YG G + G +ETL
Sbjct: 212 CDSAVCS-----QLDAYDCH------SGGCL-----YEASYGDGSYSTGSFATETLTFGT 255
Query: 206 RIIPNFLVGCSVLSSRQPAGI-------AGFGRGKTSLPSQLNLDK---FSYCLLSHKFD 255
+ N +GC + G+ G G G S P+Q+ FSYCL+ + D
Sbjct: 256 TSVANVAIGCG----HKNVGLFIGAAGLLGLGAGALSFPNQIGTQTGHTFSYCLVDRESD 311
Query: 256 DTTRTSSLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRV- 314
SS L G G +TP NP + +YY+ + I+VGG +
Sbjct: 312 -----SSGPLQFGPK---SVPVGSIFTPLEKNPHLP------TFYYLSVTAISVGGALLD 357
Query: 315 RVWHKYLTLDR-DGNGGTIVDSGTTFTFMAPELFEPLADEFVS---QMVKNRNYTRALGA 370
+ + +D G+GG I+DSGT T + ++ + D FV+ Q+ +
Sbjct: 358 SIPPEVFRIDETSGHGGFIIDSGTVVTRLVTSAYDAVRDAFVAGTGQLPRT--------- 408
Query: 371 EALTGLRPCFDVPGEKTGSFPELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTVVTDREA 430
+A++ C+D+ G + S P + HF GA + LP +NY + C A
Sbjct: 409 DAVSIFDTCYDLSGLQFVSVPTVGFHFSNGASLILPAKNYLIPMDTVGTFCFAF-----A 463
Query: 431 SGGPSI-ILGNFQMQNYYVEYDLRNQRLGFKQQLC 464
S+ I+GN Q Q+ V +D N +GF C
Sbjct: 464 PAASSVSIMGNTQQQHIRVSFDSANSLVGFAFDQC 498
>gi|242059211|ref|XP_002458751.1| hypothetical protein SORBIDRAFT_03g039590 [Sorghum bicolor]
gi|241930726|gb|EES03871.1| hypothetical protein SORBIDRAFT_03g039590 [Sorghum bicolor]
Length = 444
Score = 105 bits (262), Expect = 4e-20, Method: Compositional matrix adjust.
Identities = 110/395 (27%), Positives = 165/395 (41%), Gaps = 45/395 (11%)
Query: 90 SISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKI---PSFIPKLSSSSRLLG 146
++SL+ GTPPQ + +LDTGS L W C Q + + SF P+ S++ +
Sbjct: 64 TVSLAVGTPPQNVTMVLDTGSELSWLLCATGRQGSAAAGAAAAMGESFRPRASATFAAVP 123
Query: 147 CQNPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSGLTEGIALSETLNLPNR 206
C + +CS RD P + ++ C L ++G ++ +
Sbjct: 124 CGSTQCS--------SRDLPAPP--SCDGASRQCHVSLSYADGSASDGALATDVFAVGEA 173
Query: 207 IIPNFLVGC-SVLSSRQPAGIA-----GFGRGKTSLPSQLNLDKFSYCLLSHKFDDTTRT 260
GC S P G+A G RG S +Q + +FSYC+ D
Sbjct: 174 PPLRSAFGCMSTAYDSSPDGVATAGLLGMNRGTLSFVTQASTRRFSYCI-----SDRDDA 228
Query: 261 SSLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRVRVWHKY 320
L+L HSD L YTP P++ V Y V L I VGG+ + +
Sbjct: 229 GVLLL----GHSDLPFLPLNYTPLYQ-PTLPLPYFDRVAYSVQLLGIRVGGKALPIPASV 283
Query: 321 LTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEALT---GLR 377
L D G G T+VDSGT FTF+ + + L EF+ Q + RAL + L
Sbjct: 284 LAPDHTGAGQTMVDSGTQFTFLLGDAYSALKAEFLKQ---TKPLLRALDDPSFAFQEALD 340
Query: 378 PCFDVPGEK---TGSFPELKLHFKGGAEVTLPVEN-YFAVVGEGSAV----CLTVVTDRE 429
CF VP + + P + L F GAE+++ + + V GE CLT + +
Sbjct: 341 TCFRVPAGRPPPSARLPPVTLLFN-GAEMSVAGDRLLYKVPGEHRGADGVWCLT-FGNAD 398
Query: 430 ASGGPSIILGNFQMQNYYVEYDLRNQRLGFKQQLC 464
+ ++G+ N +VEYDL R+G C
Sbjct: 399 MVPLTAYVIGHHHQMNLWVEYDLERGRVGLAPVKC 433
>gi|356555248|ref|XP_003545946.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
Length = 453
Score = 105 bits (262), Expect = 4e-20, Method: Compositional matrix adjust.
Identities = 111/419 (26%), Positives = 169/419 (40%), Gaps = 67/419 (15%)
Query: 64 NPQTKTTTTTTTTTTTNISSHS-----YGGYSISLSFGTPPQIIPFILDTGSHLVWFPCT 118
+ + K TTT + HS G Y + G+P Q ++DTGS W C+
Sbjct: 83 DSRRKGFEMTTTPAEVEMPMHSGRDDALGEYFAEVKVGSPGQRFWLVVDTGSEFTWLNCS 142
Query: 119 NHYQCKYCSSSKIPSFIPKLSSSSRLLGCQNPKCSWIHHESIQCRDCNDEPLATSKNCTQ 178
++ C+S K + +L S S C P ++ D + +++K
Sbjct: 143 KSFEAVTCASRKCKVDLSELFSLSV---CPKPSDPCLY-------DISYADGSSAKG--- 189
Query: 179 ICPSYLVLYGSGLTEGIALSETLNLPNRIIPNFLVGCS------VLSSRQPAGIAGFGRG 232
+G T+ I + T N + N +GC+ V + + GI G G
Sbjct: 190 -------FFG---TDSITVGLT-NGKQGKLNNLTIGCTKSMLNGVNFNEETGGILGLGFA 238
Query: 233 KTSLPSQLNLD---KFSYCLLSHKFDDTTRTSSLILDNGSSHSDKKTTGLTYTPFVNNPS 289
K S + KFSYCL+ H + R+ S L G H+ K + T +
Sbjct: 239 KDSFIDKAANKYGAKFSYCLVDHL---SHRSVSSNLTIGGHHNAKLLGEIRRTELI---- 291
Query: 290 VAERNAFSVYYYVGLRRITVGGQRVR----VWHKYLTLDRDGNGGTIVDSGTTFTFMAPE 345
F +Y V + I++GGQ ++ VW D + GGT++DSGTT T +
Sbjct: 292 -----LFPPFYGVNVVGISIGGQMLKIPPQVW------DFNAEGGTLIDSGTTLTSLLLP 340
Query: 346 LFEPLADEFVSQMVKNRNYTRALGAEALTGLRPCFDVPGEKTGSFPELKLHFKGGAEVTL 405
+E + + + K + T E L CFD G P L HF GGA
Sbjct: 341 AYEAVFEALTKSLTKVKRVT----GEDFDALEFCFDAEGFDDSVVPRLVFHFAGGARFEP 396
Query: 406 PVENYFAVVGEGSAVCLTVVTDREASGGPSIILGNFQMQNYYVEYDLRNQRLGFKQQLC 464
PV++Y V C+ +V + GG S+I GN QN+ E+DL +GF C
Sbjct: 397 PVKSYIIDVAP-LVKCIGIVP-IDGIGGASVI-GNIMQQNHLWEFDLSTNTVGFAPSTC 452
>gi|242045120|ref|XP_002460431.1| hypothetical protein SORBIDRAFT_02g028000 [Sorghum bicolor]
gi|241923808|gb|EER96952.1| hypothetical protein SORBIDRAFT_02g028000 [Sorghum bicolor]
Length = 481
Score = 105 bits (262), Expect = 5e-20, Method: Compositional matrix adjust.
Identities = 109/391 (27%), Positives = 161/391 (41%), Gaps = 63/391 (16%)
Query: 89 YSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLGCQ 148
Y +S+ GTP + + + DTGS L W C C C P F P S++ + C
Sbjct: 138 YIVSVGLGTPKRDLLVVFDTGSDLSWVQCK---PCDGCYQQHDPLFDPSQSTTYSAVPCG 194
Query: 149 NPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGS-GLTEGIALSETLNL---- 203
+C + S C Y V+YG T+G +TL L
Sbjct: 195 AQECRRLDSGSCSSGKCR----------------YEVVYGDMSQTDGNLARDTLTLGPSS 238
Query: 204 ---PNRIIPNFLVGCSVLSSR---QPAGIAGFGRGKTSLPSQLNLD---KFSYCLLSHKF 254
+ + F+ GC + + G+ G GR + SL SQ FSYCL S
Sbjct: 239 SSSSSDQLQEFVFGCGDDDTGLFGKADGLFGLGRDRVSLASQAAAKYGAGFSYCLPSS-- 296
Query: 255 DDTTRTSSLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRV 314
+T L L + + + + T +T + + PS +YY+ L I V G+ V
Sbjct: 297 --STAEGYLSLGSAAPPNARFTAMVTRS---DTPS---------FYYLNLVGIKVAGRTV 342
Query: 315 RVWHKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEALT 374
RV GT++DSGT T + + L F M + +Y RA AL+
Sbjct: 343 RVSPAVFR-----TPGTVIDSGTVITRLPSRAYAALRSSFAGLM-RRYSYKRA---PALS 393
Query: 375 GLRPCFDVPGEKTGSFPELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTVVTDREASGGP 434
L C+D G P + L F GGA + L V + S CL ++ + +
Sbjct: 394 ILDTCYDFTGRNKVQIPSVALLFDGGATLNLGFGEVLYVANK-SQACLAFASNGDDT--- 449
Query: 435 SI-ILGNFQMQNYYVEYDLRNQRLGFKQQLC 464
SI ILGN Q + + V YD+ NQ++GF + C
Sbjct: 450 SIAILGNMQQKTFAVVYDVANQKIGFGAKGC 480
>gi|226530102|ref|NP_001152414.1| PCS1 precursor [Zea mays]
gi|195656033|gb|ACG47484.1| PCS1 [Zea mays]
Length = 452
Score = 105 bits (262), Expect = 5e-20, Method: Compositional matrix adjust.
Identities = 107/411 (26%), Positives = 162/411 (39%), Gaps = 66/411 (16%)
Query: 90 SISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIP---------KLSS 140
++ ++ G PPQ + +LDTGS L W C+ S++PS P SS
Sbjct: 61 TVPVAVGAPPQNVTMVLDTGSELSWL---------RCNGSRVPSTPPPQAPAAFNGSASS 111
Query: 141 SSRLLGCQNPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSGLTEGIALSET 200
+ C +P+C W + RD P C L + +GI ++T
Sbjct: 112 TYAAAHCSSPECQW------RGRDLPVPPFCAGPPSXS-CRVSLSYADASSADGILAADT 164
Query: 201 LNLPNRIIPNFLVGC----------SVLSSRQPAGIAGFGRGKTSLPSQLNLDKFSYCLL 250
L L GC + S G+ G RG S +Q +F+YC+
Sbjct: 165 FLLGGAPPVXALFGCVTSYSSATATNSSDSEAATGLLGMNRGSLSFVTQTATLRFAYCIA 224
Query: 251 SHKFDDTTRTSSLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAF--SVYYYVGLRRIT 308
L++ G + L YTP + ++ + V Y V L I
Sbjct: 225 PGD------GPGLLVLGGDGAALAPQ--LNYTPLIQ---ISRPLPYFDRVAYSVQLEGIR 273
Query: 309 VGGQRVRVWHKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRAL 368
VG + + L D G G T+VDSGT FTF+ + + PL EF++Q L
Sbjct: 274 VGAALLPIPKSVLAPDHTGAGQTMVDSGTQFTFLLADAYAPLKGEFLNQ---TSALLAPL 330
Query: 369 GAEALT---GLRPCFDVPGEKTGS----FPELKLHFKGGAEVTLPVENYFAVV-----GE 416
G CF + + PE+ L + GAEV + E V GE
Sbjct: 331 GESDFVFQGAFDACFRASEARVAAASXMLPEVGLVLR-GAEVAVGGEKLLYRVPGERRGE 389
Query: 417 GSAVCLTVVT--DREASGGPSIILGNFQMQNYYVEYDLRNQRLGFKQQLCK 465
G A + +T + + +G + ++G+ QN +VEYDL+N R+GF C
Sbjct: 390 GGAEAVWCLTFGNSDMAGMSAYVIGHHHQQNVWVEYDLQNGRVGFAPARCD 440
>gi|413953772|gb|AFW86421.1| hypothetical protein ZEAMMB73_098827 [Zea mays]
Length = 482
Score = 105 bits (261), Expect = 6e-20, Method: Compositional matrix adjust.
Identities = 115/417 (27%), Positives = 169/417 (40%), Gaps = 67/417 (16%)
Query: 71 TTTTTTTTTNISSHSYGGYSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSS- 129
T T + ++ HS Y +++ GTP + + DTGS L W QCK C+ S
Sbjct: 109 TAATIPASLGLAFHSLE-YVVTIGIGTPARNFTVLFDTGSDLTWV------QCKPCTDSC 161
Query: 130 ---KIPSFIPKLSSSSRLLGCQNPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVL 186
+ P F P SS+ + C P+C + + C E Y V
Sbjct: 162 YQQQEPLFDPSKSSTYVDVPCGTPQCKIGGGQDLTCGGTTCE--------------YSVK 207
Query: 187 YGS-GLTEGIALSETLNLPNRIIP--NFLVGCS------VLSSRQP---AGIAGFGRGKT 234
YG +T G E L P + GCS V + + AG+ G GRG +
Sbjct: 208 YGDQSVTRGNLAQEAFTLSPSAPPAAGVVFGCSHEYSSGVKGAEEEMSVAGLLGLGRGDS 267
Query: 235 SLPSQLNL----DKFSYCLLSHKFDDTTRTSSLILDNGSSHSDKKTTGLTYTPFVNNPSV 290
S+ SQ D FSYCL +S+ L G++ + L++TP V + S
Sbjct: 268 SILSQTRRGNSGDVFSYCLPPRG------SSAGYLTIGAAAPPQSN--LSFTPLVTDNS- 318
Query: 291 AERNAFSVYYYVGLRRITVGGQRVRVWHKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPL 350
S Y V L I+V G + + + GT++DSGT T M + L
Sbjct: 319 ----QLSSVYVVNLVGISVSGAALPIDASAFYI------GTVIDSGTVITHMPAAAYYVL 368
Query: 351 ADEFVSQMVKNRNYTRALGAEALTGLRPCFDVPGEKTGSFPELKLHFKGGAEVTLPVEN- 409
DEF M YT L + L C+DV G + P + L F GGA + +
Sbjct: 369 RDEFRRHM---GGYTM-LPEGHVESLDTCYDVTGHDVVTAPPVALEFGGGARIDVDASGI 424
Query: 410 --YFAVVGEGSAVCLTVVTDREASGGPSIILGNFQMQNYYVEYDLRNQRLGFKQQLC 464
FAV G ++ L + + +I+GN Q + Y V +D+ +R+GF C
Sbjct: 425 LLVFAVDASGQSLTLACLAFVPTNLPGFVIIGNMQQRAYNVVFDVEGRRIGFGANGC 481
>gi|115458646|ref|NP_001052923.1| Os04g0448500 [Oryza sativa Japonica Group]
gi|38344830|emb|CAD40872.2| OSJNBa0064H22.11 [Oryza sativa Japonica Group]
gi|113564494|dbj|BAF14837.1| Os04g0448500 [Oryza sativa Japonica Group]
Length = 464
Score = 105 bits (261), Expect = 6e-20, Method: Compositional matrix adjust.
Identities = 109/409 (26%), Positives = 156/409 (38%), Gaps = 66/409 (16%)
Query: 87 GGYSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLG 146
G Y + L GTPP +DT S L+W C C C P F P++SS+ L
Sbjct: 87 GEYLVKLGIGTPPYKFTAAIDTASDLIWTQCQ---PCTGCYHQVDPMFNPRVSSTYAALP 143
Query: 147 CQNPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSGLTEGIALSETLNLPNR 206
C + C + +C +DE C G+ TEG + L +
Sbjct: 144 CSSDTCDELDVH--RCGHDDDES----------CQYTYTYSGNATTEGTLAVDKLVIGED 191
Query: 207 IIPNFLVGCSVLSSR-----QPAGIAGFGRGKTSLPSQLNLDKFSYCLLSHKFDDTTRTS 261
GCS S+ Q +G+ G GRG SL SQL++ +F+YCL +R
Sbjct: 192 AFRGVAFGCSTSSTGGAPPPQASGVVGLGRGPLSLVSQLSVRRFAYCLPPP----ASRIP 247
Query: 262 SLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRVRVWHKYL 321
++ + + + T P +P + YYY+ L + +G + + +
Sbjct: 248 GKLVLGADADAARNATNRIAVPMRRDPR------YPSYYYLNLDGLLIGDRAMSLPPTTT 301
Query: 322 TLDR----------------------DGNG-GTIVDSGTTFTFMAPELFEPLADEFVSQM 358
T D N G I+D +T TF+ L+ DE V+ +
Sbjct: 302 TTATATATAPAPAPTPSPNATAVAVGDANRYGMIIDIASTITFLEASLY----DELVNDL 357
Query: 359 VKNRNYTRALGAEALTGLRPCFDVPGEKTGS---FPELKLHFKGGAEVTLPVENYFAVVG 415
R G+ GL CF +P P + L F G + L FA
Sbjct: 358 EVEIRLPRGTGSS--LGLDLCFILPDGVAFDRVYVPAVALAFD-GRWLRLDKARLFAEDR 414
Query: 416 EGSAVCLTVVTDREASGGPSIILGNFQMQNYYVEYDLRNQRLGFKQQLC 464
E +CL V A G ILGNFQ QN V Y+LR R+ F Q C
Sbjct: 415 ESGMMCLMV---GRAEAGSVSILGNFQQQNMQVLYNLRRGRVTFVQSPC 460
>gi|413923784|gb|AFW63716.1| aspartic proteinase nepenthesin-2 [Zea mays]
Length = 531
Score = 105 bits (261), Expect = 6e-20, Method: Compositional matrix adjust.
Identities = 112/388 (28%), Positives = 165/388 (42%), Gaps = 66/388 (17%)
Query: 89 YSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLGCQ 148
Y I++ G+P ++DTGS + W C C C S P F P SS+ C
Sbjct: 198 YLITVGLGSPATSQTMLIDTGSDVSWVQCK---PCSQCHSQADPLFDPSSSSTYSPFSCG 254
Query: 149 NPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSGL-TEGIALSETLNLPNRI 207
+ C+ + E C ++S C Y+V YG G T G S+TL L +
Sbjct: 255 SADCAQLGQEGNGC--------SSSSQC-----QYIVTYGDGSSTTGTYSSDTLALGSSA 301
Query: 208 IPNFLVGCSVLSS---RQPAGIAGFGRGKTSLPSQL--NLDK-FSYCLLSHKFDDTTRTS 261
+ +F GCS + S Q G+ G G G SL SQ L + FSYCL T +S
Sbjct: 302 VRSFQFGCSNVESGFNDQTDGLMGLGGGAQSLVSQTAGTLGRAFSYCL------PPTPSS 355
Query: 262 SLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRVRVWHKYL 321
S L + G + FV P + + +Y V L+ I VGG+++ +
Sbjct: 356 SGFL------TLGAAGGSGTSGFVKTP-MLRSSQVPTFYGVRLQAIRVGGRQLSIPASVF 408
Query: 322 TLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEALTGLRPCFD 381
+ GT++DSGT T + P + L+ F + M + Y A + L CFD
Sbjct: 409 S------AGTVMDSGTVITRLPPTAYSALSSAFKAGM---KQYPPAQPSGILD---TCFD 456
Query: 382 VPGEKTGSFPELKLHFKGGAEVTLP-----VENYFAVVGEGSAVCLTVVTDREASGGPSI 436
G+ + S P + L F GGA V+L + N A G +D + G
Sbjct: 457 FSGQSSVSIPSVALVFSGGAVVSLDASGIILSNCLAFAGN---------SDDSSLG---- 503
Query: 437 ILGNFQMQNYYVEYDLRNQRLGFKQQLC 464
I+GN Q + + V YD+ +GF+ C
Sbjct: 504 IIGNVQQRTFEVLYDVGRGVVGFRAGAC 531
>gi|449499012|ref|XP_004160696.1| PREDICTED: probable aspartic protease At2g35615-like [Cucumis
sativus]
Length = 435
Score = 105 bits (261), Expect = 6e-20, Method: Compositional matrix adjust.
Identities = 125/482 (25%), Positives = 202/482 (41%), Gaps = 75/482 (15%)
Query: 5 ISALCLSFIFFFTLLSIFPSSITSLTFSLSRFHTN----PSQDSYQNLNSLVSSSLTRAL 60
I +L + IF + + ++ F++ H + P + +N V+ +L R++
Sbjct: 4 IFSLVIVIIFLISTAVVSAATGPDYGFTVELIHRDSPKSPMYNPLENHYHRVADTLRRSI 63
Query: 61 HIKNPQTKTTTTTTTTTTTNISSHSYGGYSISLSFGTPPQIIPFILDTGSHLVW---FPC 117
+ T T T I ++ G Y + LS GTPP I + DTGS ++W PC
Sbjct: 64 ------SHNTGLVTNTVEAPIYNNR-GEYLMKLSVGTPPFPIIAVADTGSDIIWTQCVPC 116
Query: 118 TNHYQCKYCSSSKIPSFIPKLSSSSRLLGCQNPKCSWIHHESIQCRDCNDEPLATSKNCT 177
TN YQ +P F P S++ R + C +P CS+ ++ C+ +P +CT
Sbjct: 117 TNCYQ------QDLPMFNPSKSTTYRKVSCSSPVCSFTGEDN----SCSFKP-----DCT 161
Query: 178 QICPSYLVLYG-SGLTEGIALSETLNL---PNRII--PNFLVGCSVLSS----RQPAGIA 227
Y + YG + ++G +TL + R++ P +GC ++ +GI
Sbjct: 162 -----YSISYGDNSHSQGDFAVDTLTMGSTSGRVVAFPRTAIGCGHDNAGSFDANVSGIV 216
Query: 228 GFGRGKTSLPSQLNL---DKFSYCLLSHKFDDTTRTSSLILDNGSSHSDKKTTGLTYTPF 284
G G G SL Q+ KFSYCL DD N S+++ +G TP
Sbjct: 217 GLGLGPASLIKQMGSAVGGKFSYCLTPIGNDDGGSNKL----NFGSNANVSGSGAVSTP- 271
Query: 285 VNNPSVAERNAFSVYYYVGLRRITVGGQRVRVWHKYLTLDRDGNGGTIVDSGTTFTFMAP 344
+ + F +Y + L+ ++VG R ++ G I+DSGTT T +
Sbjct: 272 -----IYISDKFKSFYSLKLKAVSVG--RNNTFYSTANSILGGKANIIIDSGTTLTLLPV 324
Query: 345 ELFEPLADEFVSQMVKNRNYTRALGAEALTGLRPCFDVPGEKTGSFPELKLHFKGGAEVT 404
+L+ F + + N R L CF+ + P + +HF+ GA +
Sbjct: 325 DLYH----NFAKAISNSINLQRTDDPNQF--LEYCFETTTDDY-KVPFIAMHFE-GANLR 376
Query: 405 LPVENYFAVVGEGSAVCLTV--VTDREASGGPSIILGNFQMQNYYVEYDLRNQRLGFKQQ 462
L EN V + + +CL D + S I GN N+ V YD+ N L FK
Sbjct: 377 LQRENVLIRVSD-NVICLAFAGAQDNDIS-----IYGNIAQINFLVGYDVTNMSLSFKPM 430
Query: 463 LC 464
C
Sbjct: 431 NC 432
>gi|226504334|ref|NP_001141706.1| uncharacterized protein LOC100273835 precursor [Zea mays]
gi|194705620|gb|ACF86894.1| unknown [Zea mays]
gi|414885968|tpg|DAA61982.1| TPA: hypothetical protein ZEAMMB73_231717 [Zea mays]
Length = 477
Score = 105 bits (261), Expect = 6e-20, Method: Compositional matrix adjust.
Identities = 109/386 (28%), Positives = 157/386 (40%), Gaps = 53/386 (13%)
Query: 89 YSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLGCQ 148
Y SL GTP + LDTGS W C C C F P SS+ + C
Sbjct: 134 YFTSLRLGTPATDLLVELDTGSDQSWIQCK---PCPDCYEQHEALFDPSKSSTYSDITCS 190
Query: 149 NPKCSWIHHESIQCRDCNDEPLATSKNCT--QICPSYLVLYGSGLTEGIALSETLNL-PN 205
+ R+C + + NC+ + CP + T G +TL L P
Sbjct: 191 S-------------RECQELGSSHKHNCSSDKKCPYEITYADDSYTVGNLARDTLTLSPT 237
Query: 206 RIIPNFLVGC---SVLSSRQPAGIAGFGRGKTSLPSQLNL---DKFSYCLLSHKFDDTTR 259
+P F+ GC + S + G+ G GRGK SL SQ+ FSYCL S +
Sbjct: 238 DAVPGFVFGCGHNNAGSFGEIDGLLGLGRGKASLSSQVAARYGAGFSYCLPS-----SPS 292
Query: 260 TSSLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRVRVWHK 319
+ + +G++ + T +PS +YY+ L ITV G+ ++V
Sbjct: 293 ATGYLSFSGAAAAAPTNAQFTEMVAGQHPS---------FYYLNLTGITVAGRAIKVPPS 343
Query: 320 YLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEALTGLRPC 379
GTI+DSGT F+ + P + L S M + Y RA + T C
Sbjct: 344 VFAT----AAGTIIDSGTAFSCLPPSAYAALRSSVRSAMGR---YKRA---PSSTIFDTC 393
Query: 380 FDVPGEKTGSFPELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTVVTD-REASGGPSIIL 438
+D+ G +T P + L F GA V L S CL + + + S G +L
Sbjct: 394 YDLTGHETVRIPSVALVFADGATVHLHPSGVLYTWSNVSQTCLAFLPNPDDTSLG---VL 450
Query: 439 GNFQMQNYYVEYDLRNQRLGFKQQLC 464
GN Q + V YD+ NQ++GF C
Sbjct: 451 GNTQQRTLAVIYDVDNQKVGFGANGC 476
>gi|116310064|emb|CAH67085.1| H0818E04.2 [Oryza sativa Indica Group]
gi|116310187|emb|CAH67199.1| OSIGBa0152K17.11 [Oryza sativa Indica Group]
Length = 464
Score = 105 bits (261), Expect = 6e-20, Method: Compositional matrix adjust.
Identities = 109/409 (26%), Positives = 156/409 (38%), Gaps = 66/409 (16%)
Query: 87 GGYSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLG 146
G Y + L GTPP +DT S L+W C C C P F P++SS+ L
Sbjct: 87 GEYLVKLGIGTPPYKFTAAIDTASDLIWTQCQ---PCTGCYHQVDPMFNPRVSSTYAALP 143
Query: 147 CQNPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSGLTEGIALSETLNLPNR 206
C + C + +C +DE C G+ TEG + L +
Sbjct: 144 CSSDTCDELDVH--RCGHDDDES----------CQYTYTYSGNATTEGTLAVDKLVIGED 191
Query: 207 IIPNFLVGCSVLSSR-----QPAGIAGFGRGKTSLPSQLNLDKFSYCLLSHKFDDTTRTS 261
GCS S+ Q +G+ G GRG SL SQL++ +F+YCL +R
Sbjct: 192 AFRGVAFGCSTSSTGGAPPPQASGVVGLGRGPLSLVSQLSVRRFAYCLPPP----ASRIP 247
Query: 262 SLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRVRVWHKYL 321
++ + + + T P +P + YYY+ L + +G + + +
Sbjct: 248 GKLVLGADADAARNATNRIAVPMRRDPR------YPSYYYLNLDGLLIGDRTMSLPPTTT 301
Query: 322 TLDR----------------------DGNG-GTIVDSGTTFTFMAPELFEPLADEFVSQM 358
T D N G I+D +T TF+ L+ DE V+ +
Sbjct: 302 TTATATATAPAPAPTPSPNATAVAVGDANRYGMIIDIASTITFLEASLY----DELVNDL 357
Query: 359 VKNRNYTRALGAEALTGLRPCFDVPGEKTGS---FPELKLHFKGGAEVTLPVENYFAVVG 415
R G+ GL CF +P P + L F G + L FA
Sbjct: 358 EVEIRLPRGTGSS--LGLDLCFILPDGVAFDRVYVPAVALAFD-GRWLRLDKARLFAEDR 414
Query: 416 EGSAVCLTVVTDREASGGPSIILGNFQMQNYYVEYDLRNQRLGFKQQLC 464
E +CL V A G ILGNFQ QN V Y+LR R+ F Q C
Sbjct: 415 ESGMMCLMV---GRAEAGSVSILGNFQQQNMQVLYNLRRGRVTFVQSPC 460
>gi|302762735|ref|XP_002964789.1| hypothetical protein SELMODRAFT_82470 [Selaginella moellendorffii]
gi|300167022|gb|EFJ33627.1| hypothetical protein SELMODRAFT_82470 [Selaginella moellendorffii]
Length = 390
Score = 104 bits (260), Expect = 8e-20, Method: Compositional matrix adjust.
Identities = 110/388 (28%), Positives = 157/388 (40%), Gaps = 50/388 (12%)
Query: 87 GGYSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLG 146
G Y + G+P + LDTGS + W C C C S P + P SSS R +
Sbjct: 43 GEYFARMGIGSPQRSYYLELDTGSDVTWIQCA---PCSSCYSQVDPIYDPSNSSSYRRVY 99
Query: 147 CQNPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYG-SGLTEGIALSETLNL-P 204
C + C + + + Q C SY V+YG S + G E+ L P
Sbjct: 100 CGSALCQALDYSACQGMGC----------------SYRVVYGDSSASSGDLGIESFYLGP 143
Query: 205 NR--IIPNFLVGCSVLSS---RQPAGIAGFGRGKTSLPSQLNLD---KFSYCLLSHKFDD 256
N + N GC +S R AG+ G G G S SQ+ FSYCL+
Sbjct: 144 NSSTAMRNIAFGCGHSNSGLFRGEAGLLGMGGGTLSFFSQIAASIGPAFSYCLVDRYSQL 203
Query: 257 TTRTSSLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRVRV 316
+R+S LI + + +TP + NP + +YY L I+VGG + +
Sbjct: 204 QSRSSPLIFGRTAIPFAAR-----FTPLLKNPRI------DTFYYAILTGISVGGTALPI 252
Query: 317 WHKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEALTGL 376
L +G GG I+DSGT+ T + P + L D + +RN A G L
Sbjct: 253 PPAQFALTGNGTGGAILDSGTSVTRVVPAAYAVLRDAY---RAASRNLPPAPGVYLLD-- 307
Query: 377 RPCFDVPGEKTGSFPELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTVVTDREASGGPSI 436
CF+ G T P L LHF ++ LP N V CL S P
Sbjct: 308 -TCFNFQGLPTVQIPSLVLHFDNDVDMVLPGGNILIPVDRSGTFCLAFAP----SSMPIS 362
Query: 437 ILGNFQMQNYYVEYDLRNQRLGFKQQLC 464
++GN Q Q + + +DL+ + + C
Sbjct: 363 VIGNVQQQTFRIGFDLQRSLIAIAPREC 390
>gi|383125861|gb|AFG43521.1| Pinus taeda anonymous locus 2_6033_01 genomic sequence
Length = 134
Score = 104 bits (260), Expect = 8e-20, Method: Compositional matrix adjust.
Identities = 55/141 (39%), Positives = 83/141 (58%), Gaps = 7/141 (4%)
Query: 253 KFDDTTRTSSLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQ 312
+FD+ + S ++L + + L YTPF+ N + + VYYY+GLR +++GG+
Sbjct: 1 RFDEENQKSLMVLGD---KAFPNGIPLNYTPFLTNYRAPPSSQYGVYYYIGLRAVSIGGK 57
Query: 313 RVRVWHKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEA 372
R+++ K L D GNGGTI+DSGTTFT E+F+ +A F SQ+ Y RA+ EA
Sbjct: 58 RMKLPSKLLRFDAKGNGGTIIDSGTTFTVFHDEIFKHIAAGFASQI----EYRRAVDVEA 113
Query: 373 LTGLRPCFDVPGEKTGSFPEL 393
LTG+ C++V G + PE
Sbjct: 114 LTGMGLCYNVSGLENIVLPEF 134
>gi|222637611|gb|EEE67743.1| hypothetical protein OsJ_25435 [Oryza sativa Japonica Group]
Length = 396
Score = 104 bits (260), Expect = 8e-20, Method: Compositional matrix adjust.
Identities = 107/385 (27%), Positives = 166/385 (43%), Gaps = 54/385 (14%)
Query: 89 YSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLGCQ 148
Y + GTP Q + +DT + W PC+ C C +S F P S+S R + C
Sbjct: 54 YVVRARLGTPAQQLLLAVDTSNDAAWIPCSG---CAGCPTSS--PFNPAASASYRPVPCG 108
Query: 149 NPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSGLTEGIALSETLNLPNRII 208
+P+C + S C+ +K+C + + Y + +TL + ++
Sbjct: 109 SPQCVLAPNPS-----CSPN----AKSC-----GFSLSYADSSLQAALSQDTLAVAGDVV 154
Query: 209 PNFLVGC---SVLSSRQPAGIAGFGRGKTSLPSQ---LNLDKFSYCLLSHK---FDDTTR 259
+ GC + ++ P G+ G GRG S SQ + FSYCL S K F T R
Sbjct: 155 KAYTFGCLQRATGTAAPPQGLLGLGRGPLSFLSQTKDMYGATFSYCLPSFKSLNFSGTLR 214
Query: 260 TSSLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRVRVWHK 319
+ NG K T P + NP R++ YYV + I VG + V +
Sbjct: 215 ----LGRNGQPRRIKTT------PLLANP---HRSSL---YYVNMTGIRVGKKVVSIPAS 258
Query: 320 YLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEALTGLRPC 379
L D GT++DSGT FT + ++ L DE V+ R A +L G C
Sbjct: 259 ALAFDPATGAGTVLDSGTMFTRLVAPVYLALRDE-----VRRRVGAGAAAVSSLGGFDTC 313
Query: 380 FDVPGEKTGSFPELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTVVTDREASGGPSIILG 439
++ T ++P + L F G +VTLP EN G+ CL + + ++
Sbjct: 314 YNT----TVAWPPVTLLFDG-MQVTLPEENVVIHTTYGTTSCLAMAAAPDGVNTVLNVIA 368
Query: 440 NFQMQNYYVEYDLRNQRLGFKQQLC 464
+ Q QN+ V +D+ N R+GF ++ C
Sbjct: 369 SMQQQNHRVLFDVPNGRVGFARESC 393
>gi|297742733|emb|CBI35367.3| unnamed protein product [Vitis vinifera]
Length = 521
Score = 104 bits (260), Expect = 8e-20, Method: Compositional matrix adjust.
Identities = 100/385 (25%), Positives = 153/385 (39%), Gaps = 69/385 (17%)
Query: 87 GGYSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLG 146
G Y + + G+PP+ ++D+GS +VW C QC + S P F P S+S +
Sbjct: 199 GEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCTQCYHQSD---PVFDPADSASFTGVS 255
Query: 147 CQNPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSG-LTEGIALSETLNLPN 205
C + C + + C Y V YG G T+G ETL
Sbjct: 256 CSSSVCDRLENAGCHAGRCR----------------YEVSYGDGSYTKGTLALETLTFGR 299
Query: 206 RIIPNFLVGCSVLSSRQ---PAGIAGFGRGKTSLPSQLNLD---KFSYCLLSHKFDDTTR 259
++ + +GC + AG+ G G G S QL FSYCL+S
Sbjct: 300 TMVRSVAIGCGHRNRGMFVGAAGLLGLGGGSMSFVGQLGGQTGGAFSYCLVSA------- 352
Query: 260 TSSLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRVRVWHK 319
+ P V NP +YY+GL + VGG RV + +
Sbjct: 353 --------------------AWVPLVRNPRAPS------FYYIGLAGLGVGGIRVPISEE 386
Query: 320 YLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEALTGLRPC 379
L G+GG ++D+GT T + ++ D F++Q N RA G C
Sbjct: 387 VFRLTELGDGGVVMDTGTAVTRLPTLAYQAFRDAFLAQTA---NLPRATGVAIFD---TC 440
Query: 380 FDVPGEKTGSFPELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTVVTDREASGGPSIILG 439
+D+ G + P + +F GG +TLP N+ + + C ++ G S ILG
Sbjct: 441 YDLLGFVSVRVPTVSFYFSGGPILTLPARNFLIPMDDAGTFCFAFA---PSTSGLS-ILG 496
Query: 440 NFQMQNYYVEYDLRNQRLGFKQQLC 464
N Q + + +D N +GF +C
Sbjct: 497 NIQQEGIQISFDGANGYVGFGPNIC 521
>gi|297811185|ref|XP_002873476.1| hypothetical protein ARALYDRAFT_325615 [Arabidopsis lyrata subsp.
lyrata]
gi|297319313|gb|EFH49735.1| hypothetical protein ARALYDRAFT_325615 [Arabidopsis lyrata subsp.
lyrata]
Length = 475
Score = 104 bits (260), Expect = 8e-20, Method: Compositional matrix adjust.
Identities = 115/420 (27%), Positives = 173/420 (41%), Gaps = 57/420 (13%)
Query: 59 ALHIKNPQTKTTTTTTTTTTTNI-----SSHSYGGYSISLSFGTPPQIIPFILDTGSHLV 113
++H K + TT + + +T++ S+ G Y +++ GTP + I DTGS L
Sbjct: 98 SIHSKLSKKLTTNHVSQSQSTDLPAKDGSTLGSGNYIVTVGLGTPKNDLSLIFDTGSDLT 157
Query: 114 WFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLGCQNPKCSWIHHESIQCRDCNDEPLATS 173
W C + C K P F P S+S + C + C + + C ++
Sbjct: 158 WTQC--QPCVRTCYDQKEPIFNPSKSTSYYNVSCSSAACGSLSSATGNAGSC------SA 209
Query: 174 KNCTQICPSYLVLYGS-GLTEGIALSETLNL-PNRIIPNFLVGCSVLSS---RQPAGIAG 228
NC Y + YG + G + L + + GC + AG+ G
Sbjct: 210 SNCI-----YGIQYGDQSFSVGFLAKDKFTLTSSDVFDGVYFGCGENNQGLFTGVAGLLG 264
Query: 229 FGRGKTSLPSQLNL---DKFSYCLLSHKFDDTTRTSSLILDN-GSSHSDKKTTGLTYTPF 284
GR K S PSQ FSYCL S + T L + G S S K +TP
Sbjct: 265 LGRDKLSFPSQTATAYNKIFSYCLPS----SASYTGHLTFGSAGISRSVK------FTPI 314
Query: 285 VNNPSVAERNAFSVYYYVGLRRITVGGQRVRVWHKYLTLDRDGNGGTIVDSGTTFTFMAP 344
++ + +F Y + + ITVGGQ++ + + G ++DSGT T + P
Sbjct: 315 ---STITDGTSF---YGLNIVAITVGGQKLPIPSTVFSTP-----GALIDSGTVITRLPP 363
Query: 345 ELFEPLADEFVSQMVKNRNYTRALGAEALTGLRPCFDVPGEKTGSFPELKLHFKGGAEVT 404
+ + L F ++M K Y G L CFD+ G KT + P++ F GGA V
Sbjct: 364 KAYAALRSSFKAKMSK---YPTTSGVSI---LDTCFDLSGFKTVTIPKVAFSFSGGAVVE 417
Query: 405 LPVENYFAVVGEGSAVCLTVVTDREASGGPSIILGNFQMQNYYVEYDLRNQRLGFKQQLC 464
L + F + S VCL + + S + I GN Q Q V YD R+GF C
Sbjct: 418 LGSKGIFYAF-KISQVCLAFAGNSDDSN--AAIFGNVQQQTLEVVYDGAGGRVGFAPNGC 474
>gi|356524289|ref|XP_003530762.1| PREDICTED: LOW QUALITY PROTEIN: aspartic proteinase
nepenthesin-2-like [Glycine max]
Length = 392
Score = 104 bits (260), Expect = 8e-20, Method: Compositional matrix adjust.
Identities = 121/418 (28%), Positives = 178/418 (42%), Gaps = 57/418 (13%)
Query: 63 KNPQTKTTTTTTTTTTTNISSHSYGG---YSISLSFGTPPQIIPFILDTGSHLVWFPCTN 119
KN + T +TT S S G Y + + GTP + + + DTGS L W
Sbjct: 17 KNLGRENTVKDLDSTTLPAESGSLIGSANYVVVVGLGTPKRDLSLVFDTGSDLTW----- 71
Query: 120 HYQCKYCSSS----KIPSFIPKLSSSSRLLGCQNPKCSWIHHESIQCRDCNDEPLATSKN 175
QC+ C+ S + F P SSS + C + C+ + + I+ +C+ +T +
Sbjct: 72 -TQCEPCAGSCYKQQDAIFDPSKSSSYTNITCTSSLCTQLTSDGIK-SECSS---STDAS 126
Query: 176 CTQICPSYLVLYGSGLTE-GIALSETLNL-PNRIIPNFLVGCSVLSS---RQPAGIAGFG 230
C Y YG T G E L + I+ +FL GC + AG+ G G
Sbjct: 127 CI-----YDAKYGDNSTSVGFLSQERLTITATDIVDDFLFGCGQDNEGLFNGSAGLMGLG 181
Query: 231 RGKTSLPSQL--NLDK-FSYCLLSHKFDDTTRTSSLILDNGSSHSDKKTTGLTYTPFVNN 287
R S+ Q N +K FSYCL + T +S L G+S + + L YTP
Sbjct: 182 RHPISIVQQTSSNYNKIFSYCLPA------TSSSLGHLTFGASAATNAS--LIYTPL--- 230
Query: 288 PSVAERNAFSVYYYVGLRRITVGGQRVRVWHKYLTLDRDGNGGTIVDSGTTFTFMAPELF 347
+++ N+F Y + + I+VGG ++ ++ GG+I+DSGT T +AP ++
Sbjct: 231 STISGDNSF---YGLDIVSISVGGTKLPA----VSSSTFSAGGSIIDSGTVITRLAPTVY 283
Query: 348 EPLADEFVSQMVKNRNYTRALGAEALTGLRPCFDVPGEKTGSFPELKLHFKGGAEVTLPV 407
L F M K Y A A L C+D+ G K S P + F GG V L
Sbjct: 284 AALRSAFRRXMEK---YPVANEAGL---LDTCYDLSGYKEISVPRIDFEFSGGVTVELXH 337
Query: 408 ENYFAVVGEGSAVCLTVVTDREASGGPSIILGNFQMQNYYVEYDLRNQRLGFKQQLCK 465
V E VCL + S + GN Q + V YD++ R+GF CK
Sbjct: 338 RGILXVESE-QQVCLAFAAN--GSDNDITVFGNVQQKTLEVVYDVKGGRIGFGAAGCK 392
>gi|383125857|gb|AFG43519.1| Pinus taeda anonymous locus 2_6033_01 genomic sequence
gi|383125863|gb|AFG43522.1| Pinus taeda anonymous locus 2_6033_01 genomic sequence
gi|383125867|gb|AFG43524.1| Pinus taeda anonymous locus 2_6033_01 genomic sequence
gi|383125869|gb|AFG43525.1| Pinus taeda anonymous locus 2_6033_01 genomic sequence
gi|383125871|gb|AFG43526.1| Pinus taeda anonymous locus 2_6033_01 genomic sequence
gi|383125873|gb|AFG43527.1| Pinus taeda anonymous locus 2_6033_01 genomic sequence
gi|383125877|gb|AFG43529.1| Pinus taeda anonymous locus 2_6033_01 genomic sequence
Length = 134
Score = 104 bits (260), Expect = 9e-20, Method: Compositional matrix adjust.
Identities = 55/141 (39%), Positives = 84/141 (59%), Gaps = 7/141 (4%)
Query: 253 KFDDTTRTSSLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQ 312
+FD+ + S ++L + + + L YTPF+ N + + VYYY+GLR +++GG+
Sbjct: 1 RFDEENQKSLMVLGDKAFPTG---IPLNYTPFLTNYRAPPSSQYGVYYYIGLRAVSIGGK 57
Query: 313 RVRVWHKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEA 372
R+++ K L D GNGGTI+DSGTTFT E+F+ +A F SQ+ Y RA+ EA
Sbjct: 58 RMKLPSKLLRFDTKGNGGTIIDSGTTFTVFHDEIFKHIAAGFASQI----EYRRAVDVEA 113
Query: 373 LTGLRPCFDVPGEKTGSFPEL 393
LTG+ C++V G + PE
Sbjct: 114 LTGMGLCYNVSGLENIVLPEF 134
>gi|115473845|ref|NP_001060521.1| Os07g0658600 [Oryza sativa Japonica Group]
gi|22775625|dbj|BAC15479.1| nucleoid DNA-binding-like protein [Oryza sativa Japonica Group]
gi|50510141|dbj|BAD31106.1| nucleoid DNA-binding-like protein [Oryza sativa Japonica Group]
gi|113612057|dbj|BAF22435.1| Os07g0658600 [Oryza sativa Japonica Group]
Length = 449
Score = 104 bits (260), Expect = 9e-20, Method: Compositional matrix adjust.
Identities = 107/385 (27%), Positives = 166/385 (43%), Gaps = 54/385 (14%)
Query: 89 YSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLGCQ 148
Y + GTP Q + +DT + W PC+ C C +S F P S+S R + C
Sbjct: 107 YVVRARLGTPAQQLLLAVDTSNDAAWIPCSG---CAGCPTSS--PFNPAASASYRPVPCG 161
Query: 149 NPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSGLTEGIALSETLNLPNRII 208
+P+C + S C+ +K+C + + Y + +TL + ++
Sbjct: 162 SPQCVLAPNPS-----CSPN----AKSC-----GFSLSYADSSLQAALSQDTLAVAGDVV 207
Query: 209 PNFLVGC---SVLSSRQPAGIAGFGRGKTSLPSQ---LNLDKFSYCLLSHK---FDDTTR 259
+ GC + ++ P G+ G GRG S SQ + FSYCL S K F T R
Sbjct: 208 KAYTFGCLQRATGTAAPPQGLLGLGRGPLSFLSQTKDMYGATFSYCLPSFKSLNFSGTLR 267
Query: 260 TSSLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRVRVWHK 319
+ NG K T P + NP R++ YYV + I VG + V +
Sbjct: 268 ----LGRNGQPRRIKTT------PLLANP---HRSSL---YYVNMTGIRVGKKVVSIPAS 311
Query: 320 YLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEALTGLRPC 379
L D GT++DSGT FT + ++ L DE V+ R A +L G C
Sbjct: 312 ALAFDPATGAGTVLDSGTMFTRLVAPVYLALRDE-----VRRRVGAGAAAVSSLGGFDTC 366
Query: 380 FDVPGEKTGSFPELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTVVTDREASGGPSIILG 439
++ T ++P + L F G +VTLP EN G+ CL + + ++
Sbjct: 367 YNT----TVAWPPVTLLFDG-MQVTLPEENVVIHTTYGTTSCLAMAAAPDGVNTVLNVIA 421
Query: 440 NFQMQNYYVEYDLRNQRLGFKQQLC 464
+ Q QN+ V +D+ N R+GF ++ C
Sbjct: 422 SMQQQNHRVLFDVPNGRVGFARESC 446
>gi|449454652|ref|XP_004145068.1| PREDICTED: probable aspartic protease At2g35615-like [Cucumis
sativus]
gi|449470630|ref|XP_004153019.1| PREDICTED: probable aspartic protease At2g35615-like [Cucumis
sativus]
Length = 435
Score = 104 bits (260), Expect = 9e-20, Method: Compositional matrix adjust.
Identities = 125/482 (25%), Positives = 202/482 (41%), Gaps = 75/482 (15%)
Query: 5 ISALCLSFIFFFTLLSIFPSSITSLTFSLSRFHTN----PSQDSYQNLNSLVSSSLTRAL 60
I +L + IF + + ++ F++ H + P + +N V+ +L R++
Sbjct: 4 IFSLVIVIIFLISTAVVSAATGPDYGFTVELIHRDSPKSPMYNPLENHYHRVADTLRRSI 63
Query: 61 HIKNPQTKTTTTTTTTTTTNISSHSYGGYSISLSFGTPPQIIPFILDTGSHLVWF---PC 117
+ T T T I ++ G Y + LS GTPP I + DTGS ++W PC
Sbjct: 64 ------SHNTGLVTNTVEAPIYNNR-GEYLMKLSVGTPPFPIIAVADTGSDIIWTQCEPC 116
Query: 118 TNHYQCKYCSSSKIPSFIPKLSSSSRLLGCQNPKCSWIHHESIQCRDCNDEPLATSKNCT 177
TN YQ +P F P S++ R + C +P CS+ ++ C+ +P +CT
Sbjct: 117 TNCYQ------QDLPMFNPSKSTTYRKVSCSSPVCSFTGEDN----SCSFKP-----DCT 161
Query: 178 QICPSYLVLYG-SGLTEGIALSETLNL---PNRII--PNFLVGCSVLSS----RQPAGIA 227
Y + YG + ++G +TL + R++ P +GC ++ +GI
Sbjct: 162 -----YSISYGDNSHSQGDFAVDTLTMGSTSGRVVAFPRTAIGCGHDNAGSFDANVSGIV 216
Query: 228 GFGRGKTSLPSQLNL---DKFSYCLLSHKFDDTTRTSSLILDNGSSHSDKKTTGLTYTPF 284
G G G SL Q+ KFSYCL DD N S+++ +G TP
Sbjct: 217 GLGLGPASLIKQMGSAVGGKFSYCLTPIGNDDGGSNKL----NFGSNANVSGSGAVSTP- 271
Query: 285 VNNPSVAERNAFSVYYYVGLRRITVGGQRVRVWHKYLTLDRDGNGGTIVDSGTTFTFMAP 344
+ + F +Y + L+ ++VG R ++ G I+DSGTT T +
Sbjct: 272 -----IYISDKFKSFYSLKLKAVSVG--RNNTFYSTANSILGGKANIIIDSGTTLTLLPV 324
Query: 345 ELFEPLADEFVSQMVKNRNYTRALGAEALTGLRPCFDVPGEKTGSFPELKLHFKGGAEVT 404
+L+ F + + N R L CF+ + P + +HF+ GA +
Sbjct: 325 DLYH----NFAKAISNSINLQRTDDPNQF--LEYCFETTTDDY-KVPFIAMHFE-GANLR 376
Query: 405 LPVENYFAVVGEGSAVCLTV--VTDREASGGPSIILGNFQMQNYYVEYDLRNQRLGFKQQ 462
L EN V + + +CL D + S I GN N+ V YD+ N L FK
Sbjct: 377 LQRENVLIRVSD-NVICLAFAGAQDNDIS-----IYGNIAQINFLVGYDVTNMSLSFKPM 430
Query: 463 LC 464
C
Sbjct: 431 NC 432
>gi|359806276|ref|NP_001241217.1| uncharacterized protein LOC100818868 precursor [Glycine max]
gi|255644718|gb|ACU22861.1| unknown [Glycine max]
Length = 450
Score = 104 bits (260), Expect = 9e-20, Method: Compositional matrix adjust.
Identities = 109/422 (25%), Positives = 180/422 (42%), Gaps = 81/422 (19%)
Query: 89 YSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLGCQ 148
YS S+ GTPP + ++D +WF C N Y SS+ + C
Sbjct: 50 YSTSIDMGTPPLTLDLVIDIRERFLWFECGNDYN----------------SSTYYPVRCG 93
Query: 149 NPKCSWIHHESIQCRDCNDEPLAT--SKNCTQICP--SYLVLYGSG-----------LTE 193
KC + C C + PL T + N + P + + SG T
Sbjct: 94 TKKCK--KAKGTACITCTNHPLKTGCTNNTCGVDPFNPFGEFFVSGDVGEDILSSLHSTS 151
Query: 194 GIALSETLNLPNRI----------IPNFLVGCSVLSSRQPAGIAGFGRGKTSLPSQL--- 240
G TL++P + + FL G + + G+ G R SLP+QL
Sbjct: 152 GARAPSTLHVPRFVSTCVYPDKFGVEGFLQGLA----KGKKGVLGLARTAISLPTQLAAK 207
Query: 241 -NLD-KFSYCLLS-HKFDDTTRTSSLILDNGSSH--SDKKTTGLTYTPFVNNPS----VA 291
NL+ KF+ CL S K++ + L + G + + L+YTP + NP +
Sbjct: 208 YNLEPKFALCLPSTSKYN---KLGDLFVGGGPYYLPPHDASKFLSYTPILTNPQSTGPIF 264
Query: 292 ERNAFSVYYYVGLRRITVGGQRVRVWHKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLA 351
+ + S Y++ ++ I + G+ V V L++DR GNGG + + +T +++PL
Sbjct: 265 DADP-SSEYFIDVKSIKLDGKIVNVNTSLLSIDRQGNGGCKLSTVVPYTKFHTSIYQPLV 323
Query: 352 DEFVSQMVKNRNYTRALGAEALTGLRP---CFD---VPGEKTG-SFPELKLHFKGGAEVT 404
++FV Q + + +T + P CFD + TG + P + L KGG +
Sbjct: 324 NDFVKQAALRK-------IKRVTSVAPFGACFDSRTIGKTVTGPNVPTIDLVLKGGVQWR 376
Query: 405 LPVENYFAVVGEGSAVCLTVVTDREASGGP---SIILGNFQMQNYYVEYDLRNQRLGFKQ 461
+ N V + + +CL V G P SI++G +QM++ +E+DL + +LGF
Sbjct: 377 IYGANSMVKVSK-NVLCLGFVDGGLEPGSPIATSIVIGGYQMEDNLLEFDLVSSKLGFSS 435
Query: 462 QL 463
L
Sbjct: 436 SL 437
>gi|225430551|ref|XP_002283470.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Vitis vinifera]
Length = 490
Score = 104 bits (260), Expect = 9e-20, Method: Compositional matrix adjust.
Identities = 110/404 (27%), Positives = 174/404 (43%), Gaps = 53/404 (13%)
Query: 71 TTTTTTTTTNISSHSYGGYSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQC-KYCSSS 129
+ T + + S+ G Y +++ G+P + + FI DTGS L W C C YC
Sbjct: 129 ASKATLPSKSASTLGSGNYVVTVGLGSPKRDLTFIFDTGSDLTWTQCE---PCVGYCYQQ 185
Query: 130 KIPSFIPKLSSSSRLLGCQNPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGS 189
+ F P S S + C +P C + + + P +S C Y + YG
Sbjct: 186 REHIFDPSTSLSYSNVSCDSPSCEKLESAT------GNSPGCSSSTCL-----YGIRYGD 234
Query: 190 G-LTEGIALSETLNLPN-RIIPNFLVGCSVLSS---RQPAGIAGFGRGKTSLPSQLNL-- 242
G + G E L+L + + NF GC + AG+ G R SL SQ
Sbjct: 235 GSYSIGFFAREKLSLTSTDVFNNFQFGCGQNNRGLFGGTAGLLGLARNPLSLVSQTAQKY 294
Query: 243 -DKFSYCLLSHKFDDTTRTSSLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNA-FSVYY 300
FSYCL ++ +S+ L GS D K + +TP +E N+ + +Y
Sbjct: 295 GKVFSYCL------PSSSSSTGYLSFGSGDGDSKA--VKFTP-------SEVNSDYPSFY 339
Query: 301 YVGLRRITVGGQRVRVWHKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVK 360
++ + I+VG +++ + + GTI+DSGT + + P ++ + F M
Sbjct: 340 FLDMVGISVGERKLPIPKSVFS-----TAGTIIDSGTVISRLPPTVYSSVQKVFRELM-- 392
Query: 361 NRNYTRALGAEALTGLRPCFDVPGEKTGSFPELKLHFKGGAEVTLPVENYFAVVGEGSAV 420
+Y R G L C+D+ KT P++ L+F GGAE+ L E V+ + S V
Sbjct: 393 -SDYPRVKGVSILD---TCYDLSKYKTVKVPKIILYFSGGAEMDLAPEGIIYVL-KVSQV 447
Query: 421 CLTVVTDREASGGPSIILGNFQMQNYYVEYDLRNQRLGFKQQLC 464
CL + + I+GN Q + +V YD R+GF C
Sbjct: 448 CLAFAGNSDDD--EVAIIGNVQQKTIHVVYDDAEGRVGFAPSGC 489
>gi|361067987|gb|AEW08305.1| Pinus taeda anonymous locus 2_6033_01 genomic sequence
gi|383125859|gb|AFG43520.1| Pinus taeda anonymous locus 2_6033_01 genomic sequence
gi|383125865|gb|AFG43523.1| Pinus taeda anonymous locus 2_6033_01 genomic sequence
gi|383125875|gb|AFG43528.1| Pinus taeda anonymous locus 2_6033_01 genomic sequence
Length = 134
Score = 104 bits (259), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 55/141 (39%), Positives = 83/141 (58%), Gaps = 7/141 (4%)
Query: 253 KFDDTTRTSSLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQ 312
+FD+ + S ++L + + L YTPF+ N + + VYYY+GLR +++GG+
Sbjct: 1 RFDEENQKSLMVLGD---KAFPNGIPLNYTPFLTNYRAPPSSQYGVYYYIGLRAVSIGGK 57
Query: 313 RVRVWHKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEA 372
R+++ K L D GNGGTI+DSGTTFT E+F+ +A F SQ+ Y RA+ EA
Sbjct: 58 RMKLPSKLLRFDTKGNGGTIIDSGTTFTVFHDEIFKHIAAGFASQI----EYRRAVDVEA 113
Query: 373 LTGLRPCFDVPGEKTGSFPEL 393
LTG+ C++V G + PE
Sbjct: 114 LTGMGLCYNVSGLENIVLPEF 134
>gi|79407941|ref|NP_188636.2| aspartyl protease family protein [Arabidopsis thaliana]
gi|75273243|sp|Q9LHE3.1|ASPG2_ARATH RecName: Full=Protein ASPARTIC PROTEASE IN GUARD CELL 2;
Short=AtASPG2; Flags: Precursor
gi|11994777|dbj|BAB03167.1| nucleoid chloroplast DNA-binding protein-like [Arabidopsis
thaliana]
gi|28392860|gb|AAO41867.1| unknown protein [Arabidopsis thaliana]
gi|332642798|gb|AEE76319.1| aspartyl protease family protein [Arabidopsis thaliana]
Length = 470
Score = 104 bits (259), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 107/385 (27%), Positives = 155/385 (40%), Gaps = 50/385 (12%)
Query: 87 GGYSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLG 146
G Y + + G+PP+ ++D+GS +VW C CK C P F P S S +
Sbjct: 129 GEYFVRIGVGSPPRDQYMVIDSGSDMVWVQCQ---PCKLCYKQSDPVFDPAKSGSYTGVS 185
Query: 147 CQNPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSG-LTEGIALSETLNLPN 205
C + C I + C Y V+YG G T+G ETL
Sbjct: 186 CGSSVCDRIENSGCHSGGCR----------------YEVMYGDGSYTKGTLALETLTFAK 229
Query: 206 RIIPNFLVGCSVLSS---RQPAGIAGFGRGKTSLPSQLNLD---KFSYCLLSHKFDDTTR 259
++ N +GC + AG+ G G G S QL+ F YCL+S D T
Sbjct: 230 TVVRNVAMGCGHRNRGMFIGAAGLLGIGGGSMSFVGQLSGQTGGAFGYCLVSRGTDST-- 287
Query: 260 TSSLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRVRVWHK 319
SL+ + G ++ P V NP +YYVGL+ + VGG R+ +
Sbjct: 288 -GSLVFGR-----EALPVGASWVPLVRNPRAPS------FYYVGLKGLGVGGVRIPLPDG 335
Query: 320 YLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEALTGLRPC 379
L G+GG ++D+GT T + + D F SQ N RA G C
Sbjct: 336 VFDLTETGDGGVVMDTGTAVTRLPTAAYVAFRDGFKSQTA---NLPRASGVSIFD---TC 389
Query: 380 FDVPGEKTGSFPELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTVVTDREASGGPSIILG 439
+D+ G + P + +F G +TLP N+ V + C AS I+G
Sbjct: 390 YDLSGFVSVRVPTVSFYFTEGPVLTLPARNFLMPVDDSGTYCFAFA----ASPTGLSIIG 445
Query: 440 NFQMQNYYVEYDLRNQRLGFKQQLC 464
N Q + V +D N +GF +C
Sbjct: 446 NIQQEGIQVSFDGANGFVGFGPNVC 470
>gi|356527091|ref|XP_003532147.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Glycine max]
Length = 482
Score = 104 bits (259), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 121/425 (28%), Positives = 181/425 (42%), Gaps = 60/425 (14%)
Query: 52 VSSSLTRALHIKNPQTKTTTTTTTTTTTNISSHSYGGYSISLSFGTPPQIIPFILDTGSH 111
+ S L++ L +N + +TT + + + Y + + GTP + + I DTGS+
Sbjct: 105 IQSRLSKNLGGENRVKELDSTTLPAKSGRLIGSA--DYYVVVGLGTPKRDLSLIFDTGSY 162
Query: 112 LVWFPCTNHYQCKYCSSS----KIPSFIPKLSSSSRLLGCQNPKCSWIHHESIQCRDCND 167
L W QC+ C+ S + P F P SSS + C + C+ S C
Sbjct: 163 LTW------TQCEPCAGSCYKQQDPIFDPSKSSSYTNIKCTSSLCTQF--RSAGCSS--- 211
Query: 168 EPLATSKNCTQICPSYLVLYG-SGLTEGIALSETLNL-PNRIIPNFLVGCSVLSS---RQ 222
+T +C Y V YG + ++ G E L + I+ +FL GC + R
Sbjct: 212 ---STDASCI-----YDVKYGDNSISRGFLSQERLTITATDIVHDFLFGCGQDNEGLFRG 263
Query: 223 PAGIAGFGRGKTSLPSQ---LNLDKFSYCLLSHKFDDTTRTSSLILDNGSSHSDKKTTGL 279
AG+ G R S Q + FSYCL S T +S L G+S + L
Sbjct: 264 TAGLMGLSRHPISFVQQTSSIYNKIFSYCLPS------TPSSLGHLTFGASAA--TNANL 315
Query: 280 TYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRVRVWHKYLTLDRDGNGGTIVDSGTTF 339
YTPF +++ N+F Y + + I+VGG ++ ++ GG+I+DSGT
Sbjct: 316 KYTPF---STISGENSF---YGLDIVGISVGGTKLPA----VSSSTFSAGGSIIDSGTVI 365
Query: 340 TFMAPELFEPLADEFVSQMVKNRNYTRALGAEALTGLRPCFDVPGEKTGSFPELKLHFKG 399
T + P + L F M+K Y A G L C+D G K S P + F G
Sbjct: 366 TRLPPTAYAALRSAFRQFMMK---YPVAYGTRLLD---TCYDFSGYKEISVPRIDFEFAG 419
Query: 400 GAEVTLPVENYFAVVGEGSAVCLTVVTDREASGGPSIILGNFQMQNYYVEYDLRNQRLGF 459
G +V LP+ + GE SA L + +G I GN Q + V YD+ R+GF
Sbjct: 420 GVKVELPLVG--ILYGE-SAQQLCLAFAANGNGNDITIFGNVQQKTLEVVYDVEGGRIGF 476
Query: 460 KQQLC 464
C
Sbjct: 477 GAAGC 481
>gi|297834938|ref|XP_002885351.1| pepsin A [Arabidopsis lyrata subsp. lyrata]
gi|297331191|gb|EFH61610.1| pepsin A [Arabidopsis lyrata subsp. lyrata]
Length = 471
Score = 104 bits (259), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 107/385 (27%), Positives = 155/385 (40%), Gaps = 50/385 (12%)
Query: 87 GGYSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLG 146
G Y + + G+PP+ ++D+GS +VW C CK C P F P S S +
Sbjct: 130 GEYFVRIGVGSPPRDQYMVIDSGSDMVWVQCQ---PCKLCYKQSDPVFDPAKSGSYTGVS 186
Query: 147 CQNPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSG-LTEGIALSETLNLPN 205
C + C I + C Y V+YG G T+G ETL
Sbjct: 187 CGSSVCDRIENSGCHSGGCR----------------YEVMYGDGSYTKGTLALETLTFAK 230
Query: 206 RIIPNFLVGCSVLSS---RQPAGIAGFGRGKTSLPSQLNLD---KFSYCLLSHKFDDTTR 259
++ N +GC + AG+ G G G S QL+ F YCL+S D T
Sbjct: 231 TVVRNVAMGCGHRNRGMFIGAAGLLGIGGGSMSFVGQLSGQTGGAFGYCLVSRGTDST-- 288
Query: 260 TSSLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRVRVWHK 319
SL+ + G ++ P V NP +YYVGL+ + VGG R+ +
Sbjct: 289 -GSLVFGR-----EALPVGASWVPLVRNPRAPS------FYYVGLKGLGVGGVRIPLPDG 336
Query: 320 YLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEALTGLRPC 379
L G+GG ++D+GT T + + D F SQ N RA G C
Sbjct: 337 VFDLTETGDGGVVMDTGTAVTRLPTGAYAAFRDGFKSQTA---NLPRASGVSIFD---TC 390
Query: 380 FDVPGEKTGSFPELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTVVTDREASGGPSIILG 439
+D+ G + P + +F G +TLP N+ V + C AS I+G
Sbjct: 391 YDLSGFVSVRVPTVSFYFTEGPVLTLPARNFLMPVDDSGTYCFAFA----ASPTGLSIIG 446
Query: 440 NFQMQNYYVEYDLRNQRLGFKQQLC 464
N Q + V +D N +GF +C
Sbjct: 447 NIQQEGIQVSFDGANGFVGFGPNVC 471
>gi|414587000|tpg|DAA37571.1| TPA: hypothetical protein ZEAMMB73_036171 [Zea mays]
Length = 459
Score = 104 bits (259), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 110/406 (27%), Positives = 164/406 (40%), Gaps = 66/406 (16%)
Query: 87 GGYSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLG 146
G Y + L GTP +DT S LVW C C C P F PKLSSS ++
Sbjct: 90 GEYLVKLGTGTPQHFFSAAIDTASDLVWMQCQ---PCVSCYRQLDPVFNPKLSSSYAVVP 146
Query: 147 CQNPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSGLTEGIALSETLNLPNR 206
C + C+ + + +C + +D C G G+T+G + L +
Sbjct: 147 CTSDTCAQL--DGHRCHEDDD----------GACQYTYKYSGHGVTKGTLAIDKLAIGGD 194
Query: 207 IIPNFLVGCSVLSSRQPA----GIAGFGRGKTSLPSQLNLDKFSYCLLSHKFDDTTRTS- 261
+ + GCS S PA G+ G GRG SL SQL++ +F YCL +RTS
Sbjct: 195 VFHAVVFGCSDSSVGGPAAQASGLVGLGRGPLSLVSQLSVHRFMYCLPPP----MSRTSG 250
Query: 262 SLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQ--------- 312
L+L G+ + +T T ++ + YYY+ L + VG Q
Sbjct: 251 KLVLGAGADAVRNMSDRVTVT-------MSSSTRYPSYYYLNLDGLAVGDQTPGTTRNAT 303
Query: 313 ----------RVRVWHKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNR 362
+ G IVD +T +F+ L++ LAD+ ++
Sbjct: 304 SPPSGGAGGGGGGGGGGIVGAGGANAYGMIVDVASTISFLETSLYDELADDLEEEI---- 359
Query: 363 NYTRALGAEALTGLRPCFDVP---GEKTGSFPELKLHFKGGAEVTLPVENYFAVVGEGSA 419
RA + L GL CF +P G P + L F G L ++ V +G
Sbjct: 360 RLPRATPSLRL-GLDLCFILPEGVGMDRVYVPTVSLSFDG---RWLELDRDRLFVTDGRM 415
Query: 420 VCLTVVTDREASGGPSIILGNFQMQNYYVEYDLRNQRLGFKQQLCK 465
+CL + S ILGNFQ+QN V ++LR ++ F + C
Sbjct: 416 MCLMIGRTSGVS-----ILGNFQLQNMRVLFNLRRGKITFAKASCD 456
>gi|357482719|ref|XP_003611646.1| Aspartic proteinase-like protein [Medicago truncatula]
gi|355512981|gb|AES94604.1| Aspartic proteinase-like protein [Medicago truncatula]
Length = 640
Score = 103 bits (258), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 111/393 (28%), Positives = 171/393 (43%), Gaps = 63/393 (16%)
Query: 87 GGYSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLG 146
G Y+ L GTPPQ I+DTGS + + PC+ C++C + P F P LS + + +
Sbjct: 87 GYYTTRLWIGTPPQRFALIVDTGSTVTYVPCST---CEHCGRHQDPKFQPDLSETYQPVK 143
Query: 147 CQNPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSGLTEGIALSETLNLPNR 206
C P C+ ++ QC D A + + + +V +G+ LSE P R
Sbjct: 144 C-TPDCN-CDGDTNQC--MYDRQYAEMSSSSGVLGEDVVSFGN-------LSELA--PQR 190
Query: 207 IIPNFLVGCS-----VLSSRQPAGIAGFGRGKTSLPSQLN-----LDKFSYCLLSHKFDD 256
+ GC L S++ GI G GRG S+ QL D FS C
Sbjct: 191 AV----FGCENDETGDLYSQRADGIMGLGRGDLSIMDQLVDKKVISDSFSLCYGGMDVGG 246
Query: 257 TTRTSSLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRVRV 316
++IL S D V S +R S YY + L+ + V G+++++
Sbjct: 247 ----GAMILGGISPPED----------MVFTHSDPDR---SPYYNINLKEMHVAGKKLQL 289
Query: 317 WHKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEALTGL 376
K DG GT++DSGTT+ ++ F LA F ++K RN + +
Sbjct: 290 NPKVF----DGKHGTVLDSGTTYAYLPETAF--LA--FKRAIMKERNSLKQINGPDPNYK 341
Query: 377 RPCFDVPG----EKTGSFPELKLHFKGGAEVTLPVENY-FAVVGEGSAVCLTVVTDREAS 431
CF G + SFP + + F+ G +++L ENY F A CL V ++
Sbjct: 342 DICFTGAGIDVSQLAKSFPVVDMVFENGHKLSLSPENYLFRHSKVRGAYCLGVFSNGR-- 399
Query: 432 GGPSIILGNFQMQNYYVEYDLRNQRLGFKQQLC 464
P+ +LG ++N V YD N ++GF + C
Sbjct: 400 -DPTTLLGGIFVRNTLVMYDRENSKIGFWKTNC 431
>gi|255554715|ref|XP_002518395.1| Aspartic proteinase Asp1 precursor, putative [Ricinus communis]
gi|223542240|gb|EEF43782.1| Aspartic proteinase Asp1 precursor, putative [Ricinus communis]
Length = 489
Score = 103 bits (258), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 114/396 (28%), Positives = 162/396 (40%), Gaps = 55/396 (13%)
Query: 89 YSISLSFGTP-PQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPS---FIPKLSSSSRL 144
Y +S+ GTP PQ + DTGS L W C Y CK C F SSS R
Sbjct: 119 YFVSIRIGTPRPQKFILVTDTGSDLTWMNC--EYWCKSCPKPNPHPGRVFRANDSSSFRT 176
Query: 145 LGCQNPKCSWIHHESIQCRDCNDEPLATSKNCTQICP-SYLVLYGSGLTEGIALSET--- 200
+ C + C + +C N C Y L G G+ +ET
Sbjct: 177 IPCSSDDCKIELQDYFSLTEC--------PNPNAPCLFDYRYLNGPRAI-GVFANETVTV 227
Query: 201 -LNLPNRI-IPNFLVGCSVLSSRQ---PAGIAGFGRGKTSLPSQLNL---DKFSYCLLSH 252
LN +I + + L+GC+ + P G+ G G K SL +L +KFSYCL+ H
Sbjct: 228 GLNDHKKIRLFDVLIGCTESFNETNGFPDGVMGLGYRKHSLALRLAEIFGNKFSYCLVDH 287
Query: 253 KFDDTTRTSSLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAF----SVYYYVGLRRIT 308
SS + K P + P + + +Y V + I+
Sbjct: 288 L---------------SSSNHKNFLSFGDIPEMKLPKMQHTELLLGYINAFYPVNVSGIS 332
Query: 309 VGGQRVRVWHKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRAL 368
VGG + + + G GG IVDSGT+ T +A E ++ + D K++ + +
Sbjct: 333 VGGSMLSISSDIWNVT--GVGGMIVDSGTSLTMLAGEAYDKVVDALKPIFDKHK---KVV 387
Query: 369 GAEALTGLRPCFDVPGEKTGSFPELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTVVTDR 428
E CF+ G + P L +HF GA PV++Y V EG CL ++
Sbjct: 388 PIELPELNNFCFEDKGFDRAAVPRLLIHFADGAIFKPPVKSYIIDVAEGIK-CLGII--- 443
Query: 429 EASGGPSIILGNFQMQNYYVEYDLRNQRLGFKQQLC 464
+A S ILGN QN+ EYDL +LGF C
Sbjct: 444 KADFPGSSILGNVMQQNHLWEYDLGRGKLGFGPSSC 479
>gi|356528671|ref|XP_003532923.1| PREDICTED: probable aspartic protease At2g35615-like [Glycine max]
Length = 440
Score = 103 bits (258), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 124/484 (25%), Positives = 199/484 (41%), Gaps = 79/484 (16%)
Query: 11 SFIFFFTLL------SIFPSSITSLTFSLSRFHT--------NPSQDSYQNLNSLVSSSL 56
+F+F F LL S +S T FS++ H NPS + + + V S
Sbjct: 3 AFVFCFLLLCSHSIASFAEASKTLSGFSINLIHRESPLSPFYNPSLTPSERIKNTVLRSF 62
Query: 57 TRALHIKNPQTKTTTTTTTTTTTNISSHSYGGYSISLSFGTPPQIIPFILDTGSHLVWFP 116
R+ K + + T I Y + GTPP I DTGS L+W
Sbjct: 63 ARS---KRRLRLSQNDDRSPGTITIPDEPITEYLMRFYIGTPPVERFAIADTGSDLIWVQ 119
Query: 117 CTNHYQCKYCSSSKIPSFIPKLSSSSRLLGCQNPKCSWIHHESIQCRDCNDEPLATSKNC 176
C C+ C P F P+ SS+ + + C + C+ + C + S C
Sbjct: 120 CA---PCEKCVPQNAPLFDPRKSSTFKTVPCDSQPCTLLPPSQRAC-------VGKSGQC 169
Query: 177 TQICPSYLVLYGS-GLTEGIALSETLNLPNR----IIPNFLVGC------SVLSSRQPAG 225
Y +YG L GI E++N ++ P GC +V S++ G
Sbjct: 170 Y-----YQYIYGDHTLVSGILGFESINFGSKNNAIKFPKLTFGCTFSNNDTVDESKRNMG 224
Query: 226 IAGFGRGKTSLPSQLNLD---KFSYCLLSHKFDDTTRTSSLILDNGSSHSDKKTTGLTYT 282
+ G G G SL SQL KFSYC F + S+ + G+ K+ G+ T
Sbjct: 225 LVGLGVGPLSLISQLGYQIGRKFSYC-----FPPLSSNSTSKMRFGNDAIVKQIKGVVST 279
Query: 283 PFVNNPSVAERNAFSVYYYVGLRRITVGGQRVRVWHKYLTLDRDGNGGTIVDSGTTFTFM 342
P + ++ YYY+ L +++G ++V+ T + +G ++DSGT+FT +
Sbjct: 280 PLI------IKSIGPSYYYLNLEGVSIGNKKVK------TSESQTDGNILIDSGTSFTIL 327
Query: 343 APELFEPLADEFVSQMVKNRNYTRALGAEALTGLRPCFDVPGEKTGSFPELKLHFKGGAE 402
+ ++FV+ +VK A+ L CF+ G++ FP++ F GA+
Sbjct: 328 KQSFY----NKFVA-LVKEVYGVEAVKIPPLV-YNFCFENKGKRK-RFPDVVFLFT-GAK 379
Query: 403 VTLPVENYFAVVGEGSAVCLTVV--TDREASGGPSIILGNFQMQNYYVEYDLRNQRLGFK 460
V + N F + + +C+ + +D + S I GN Y VEYDL+ + F
Sbjct: 380 VRVDASNLFE-AEDNNLLCMVALPTSDEDDS-----IFGNHAQIGYQVEYDLQGGMVSFA 433
Query: 461 QQLC 464
C
Sbjct: 434 PADC 437
>gi|115441003|ref|NP_001044781.1| Os01g0844500 [Oryza sativa Japonica Group]
gi|19571042|dbj|BAB86469.1| putative aspartic proteinase nepenthesin II [Oryza sativa Japonica
Group]
gi|20160609|dbj|BAB89555.1| putative aspartic proteinase nepenthesin II [Oryza sativa Japonica
Group]
gi|113534312|dbj|BAF06695.1| Os01g0844500 [Oryza sativa Japonica Group]
gi|125572614|gb|EAZ14129.1| hypothetical protein OsJ_04051 [Oryza sativa Japonica Group]
Length = 442
Score = 103 bits (258), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 109/423 (25%), Positives = 159/423 (37%), Gaps = 38/423 (8%)
Query: 57 TRALHIKNPQTKTTTTTTTTTTTNISSHSYGGYSISLSFGTPPQIIPFILDTGSHLVWFP 116
TR L + + + + H ++SL+ GTPPQ + +LDTGS L W
Sbjct: 34 TRPLLFELRARQVPAGALPRPASKLRFHHNVSLTVSLAVGTPPQNVTMVLDTGSELSWLL 93
Query: 117 CTNHYQCKYCSSSKIPSFIPKLSSSSRLLGCQNPKCSWIHHESIQCRDCNDEPLATSKNC 176
C S + SF P+ S + + C + +C + RD P
Sbjct: 94 CAPGGGGGGGGRSAL-SFRPRASLTFASVPCDSAQC--------RSRDLPSPP--ACDGA 142
Query: 177 TQICPSYLVLYGSGLTEGIALSETLNLPNRIIPNFLVGCSVLS-SRQPAGIA-----GFG 230
++ C L ++G +E + GC + P G+A G
Sbjct: 143 SKQCRVSLSYADGSSSDGALATEVFTVGQGPPLRAAFGCMATAFDTSPDGVATAGLLGMN 202
Query: 231 RGKTSLPSQLNLDKFSYCLLSHKFDDTTRTSSLILDNGSSHSDKKTTGLTYTPFVNNPSV 290
RG S SQ + +FSYC+ D L+L HSD L YTP P++
Sbjct: 203 RGALSFVSQASTRRFSYCI-----SDRDDAGVLLL----GHSDLPFLPLNYTPLYQ-PAM 252
Query: 291 AERNAFSVYYYVGLRRITVGGQRVRVWHKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPL 350
V Y V L I VGG+ + + L D G G T+VDSGT FTF+ + + L
Sbjct: 253 PLPYFDRVAYSVQLLGIRVGGKPLPIPASVLAPDHTGAGQTMVDSGTQFTFLLGDAYSAL 312
Query: 351 ADEFVSQMVKNRNYTRALGAEALT---GLRPCFDVPGEKT--GSFPELKLHFKGGAEVTL 405
EF Q + + AL CF VP + P + L F G
Sbjct: 313 KAEFSRQ---TKPWLPALNDPNFAFQEAFDTCFRVPQGRAPPARLPAVTLLFNGAQMTVA 369
Query: 406 PVENYFAVVGE---GSAVCLTVVTDREASGGPSIILGNFQMQNYYVEYDLRNQRLGFKQQ 462
+ V GE G V + + + ++G+ N +VEYDL R+G
Sbjct: 370 GDRLLYKVPGERRGGDGVWCLTFGNADMVPITAYVIGHHHQMNVWVEYDLERGRVGLAPI 429
Query: 463 LCK 465
C
Sbjct: 430 RCD 432
>gi|449462551|ref|XP_004149004.1| PREDICTED: probable aspartic protease At2g35615-like [Cucumis
sativus]
gi|449515029|ref|XP_004164552.1| PREDICTED: probable aspartic protease At2g35615-like [Cucumis
sativus]
Length = 434
Score = 103 bits (257), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 119/442 (26%), Positives = 188/442 (42%), Gaps = 57/442 (12%)
Query: 33 LSRFHTNPSQDSYQNLNSLVSSSLTRALHIKNPQTKTTTTTTTTTTTNISSHSYGGYSIS 92
LS H NPS Y +L S +R+ + T T+ +T + I S G + +S
Sbjct: 39 LSPLH-NPSLSRYDSLIDAFRRSFSRSATL---LTHLTSVSTACIRSPIIPDS-GEFLMS 93
Query: 93 LSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLGCQNPKC 152
+ GTPP + I DTGS L W C C+ C + P F P+ SSS R + C + C
Sbjct: 94 IFIGTPPVNVIAIADTGSDLTWTQC---LPCRECFNQSQPIFNPRRSSSYRKVSCASDTC 150
Query: 153 SWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGS-GLTEGIALSETLNLPNRIIPNF 211
S++ C + Q C SY YG T G S+ + + + +P
Sbjct: 151 -----RSLESYHCGPD--------LQSC-SYGYSYGDRSFTYGDLASDQITIGSFKLPKT 196
Query: 212 LVGCSVLSSRQPAGIAGFGRGKTSLP----SQLNL-----DKFSYCLLSHKFDDTTRTSS 262
++GC + G+ G SQ+ +FSYCL + F + T +
Sbjct: 197 VIGCGHQNGGTFGGVTSGIIGLGGGSLSLVSQMRTIAGVKPRFSYCLPTF-FSNANITGT 255
Query: 263 LILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRVRVWHKYLT 322
+ + S ++ + TP V P + +Y++ L I+VG +R + +
Sbjct: 256 ISFGRKAVVSGRQ---VVSTPLV--PRSPD-----TFYFLTLEAISVGKKRFKAANGISA 305
Query: 323 LDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEALTGLRPCFDV 382
+ GN I+DSGTT T + L+ + +++++K + G L C+
Sbjct: 306 MTNHGN--IIIDSGTTLTLLPRSLYYGVFST-LARVIKAKRVDDPSGILEL-----CYSA 357
Query: 383 PGEKTGSFPELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTVVTDREASGGPSIILGNFQ 442
+ P + HF GGA+V L N FA V + + CLT + + I GN
Sbjct: 358 GQVDDLNIPIITAHFAGGADVKLLPVNTFAPVAD-NVTCLTFAPATQVA-----IFGNLA 411
Query: 443 MQNYYVEYDLRNQRLGFKQQLC 464
N+ V YDL N+RL F+ +LC
Sbjct: 412 QINFEVGYDLGNKRLSFEPKLC 433
>gi|357127507|ref|XP_003565421.1| PREDICTED: probable aspartic protease At2g35615-like [Brachypodium
distachyon]
Length = 438
Score = 103 bits (257), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 111/383 (28%), Positives = 157/383 (40%), Gaps = 65/383 (16%)
Query: 89 YSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLGCQ 148
Y ++L TPP + + DTGS LVW C K+P+ SSS L C
Sbjct: 76 YLMALDVSTPPVRMLALADTGSSLVWLKC------------KLPAAHTPASSSYARLPCD 123
Query: 149 NPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSGLTEGIALSETLNLPNRII 208
C + ++ CR + + IC T G + R+
Sbjct: 124 AFACKALG-DAASCR--------ATGSGNNICVYRYAFADGSCTAGPVTVDAFTFSTRL- 173
Query: 209 PNFLVGCSVLS---SRQPAGIAGFGRGKTSLPSQLNLD-----KFSYCLLSHKFDDTTRT 260
+F GC+ + S G+ G G SL SQL+ KFSYCL+ + ++ T
Sbjct: 174 -DF--GCATRTEGLSVPDDGLVGLANGPISLVSQLSAKTPFAHKFSYCLVPYS---SSET 227
Query: 261 SSLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRVRVWHKY 320
S L+ GS + G TP V A RN +Y + L I V G+ V +
Sbjct: 228 VSSSLNFGSHAIVSSSPGAATTPLV-----AGRN--KSFYTIALDSIKVAGKPVPLQTTT 280
Query: 321 LTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEALTGLRPCF 380
L IVDSGT T++ + +PL V+ + R E L + C+
Sbjct: 281 TKL--------IVDSGTMLTYLPKAVLDPL----VAALTAAIKLPRVKSPETLYAV--CY 326
Query: 381 DV----PGEKTGSFPELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTVVTDREASGGPSI 436
DV P + S P++ L GG EV LP N F V +G+ VCL +V S P
Sbjct: 327 DVRRRAPEDVGKSIPDVTLVLGGGGEVRLPWGNTFVVENKGTTVCLALVE----SHLPEF 382
Query: 437 ILGNFQMQNYYVEYDLRNQRLGF 459
ILGN QN +V +DL + + F
Sbjct: 383 ILGNVAQQNLHVGFDLERRTVSF 405
>gi|356556809|ref|XP_003546713.1| PREDICTED: LOW QUALITY PROTEIN: aspartic proteinase
nepenthesin-1-like [Glycine max]
Length = 444
Score = 103 bits (256), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 98/385 (25%), Positives = 149/385 (38%), Gaps = 56/385 (14%)
Query: 89 YSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLGCQ 148
Y + FGTP Q + +DT + W PCT C CS++ F P S++ + +GC
Sbjct: 106 YIVRAKFGTPAQTLLLAMDTSNDAAWVPCT---ACVGCSTTT--PFAPPKSTTFKKVGCG 160
Query: 149 NPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSGLTEGIALSETLNLPNRII 208
+C + + + C ++ YG+ + +T+ L +
Sbjct: 161 ASQCKQVRNPTCDGSAC----------------AFNFTYGTSSVAASLVQDTVTLATDPV 204
Query: 209 PNFLVGC------SVLSSRQPAGIAGFGRGKTSLPSQLNLDKFSYCLLSHKFDDTTRTSS 262
P + GC S L + G+ + +L FSYCL S K
Sbjct: 205 PAYTFGCIQKATGSSLPPQGLLGLGRGPLSLLAQTQKLYQSTFSYCLPSFK--------- 255
Query: 263 LILDNGSSHSDKKTTGLTYT---PFVNNPSVAERNAFSVYYYVGLRRITVGGQRVRVWHK 319
N S H D P NP R++ YYV L I VG + V + +
Sbjct: 256 --TLNFSGHXDLXPVAQPRDQVYPSFKNP---RRSSL---YYVNLVAIRVGRRIVDIPPE 307
Query: 320 YLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEALTGLRPC 379
L + GT+ DSGT FT L EP ++ + + + L +L G C
Sbjct: 308 ALAFNPXTGAGTVFDSGTVFT----RLVEPAYTAVRNEFRRRVSVHKKLTVTSLGGFDTC 363
Query: 380 FDVPGEKTGSFPELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTVVTDREASGGPSIILG 439
+ VP P + F G VTLP +N GS CL + + ++
Sbjct: 364 YTVPIVA----PTITFMF-SGMNVTLPPDNILIHSTAGSVTCLAMAPAPDNVNSVLNVIA 418
Query: 440 NFQMQNYYVEYDLRNQRLGFKQQLC 464
N Q QN+ V +D+ N RLG ++LC
Sbjct: 419 NMQQQNHRVLFDVPNSRLGVARELC 443
>gi|356518052|ref|XP_003527698.1| PREDICTED: basic 7S globulin 2-like [Glycine max]
Length = 447
Score = 103 bits (256), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 100/396 (25%), Positives = 167/396 (42%), Gaps = 63/396 (15%)
Query: 92 SLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLGCQNPK 151
++ GTP ++D G +W C+N +Y SSSK R + C++ K
Sbjct: 59 TIGIGTPQHSTNLVIDLGGENLWHDCSNR---RYNSSSK------------RKIVCKSKK 103
Query: 152 CSWIHHESIQCRDCN----DEPLATSKNCTQICPSYLVLYGSGLTEGIALSETLNLPNRI 207
C E C +P +CT + L + S T + +T+ L +
Sbjct: 104 CP----EGAACVSTGCIGPYKPGCAISDCTITVSNPLAQFSSSYT---MVEDTIFLSHTY 156
Query: 208 IPNFLVGCSVLSS-----------RQPAGIAGFGRGKTSLPSQLNLD-----KFSYCLLS 251
IP FL GC L R GI GF + +LPSQL L KFS C S
Sbjct: 157 IPGFLAGCVDLDDGLSGNALQGLPRTSKGIIGFSHSELALPSQLVLSNKLIPKFSLCFPS 216
Query: 252 HKFDDTTRTSSLILDNGSSHSDKKTTGLTYTPFVNNP----SVAERNAFSVYYYVGLRRI 307
++ ++ + G H ++ L TP V NP +V+ A S+ Y++ ++ I
Sbjct: 217 S--NNLKGFGNIFIGAGGGHPQVESKFLQTTPLVVNPVATGAVSIYGAPSIEYFIDVKAI 274
Query: 308 TVGGQRVRVWHKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRA 367
+ G + + L++D+ GNGGT + + T +T + L++P EF+++ + R R
Sbjct: 275 KIDGHVLNLNSSLLSIDKKGNGGTKISTMTPWTELHSSLYKPFVQEFINK-AEGRRMKR- 332
Query: 368 LGAEALTGLRPCFD---VPGEKTG-SFPELKLHFKGGAEVTLPVENYFAVVGEGSAVCLT 423
+ CFD + TG + P + L GGA+ T+ N V+ + CL
Sbjct: 333 --VAPVPPFDACFDTSTIRNSITGLAVPSIDLVLPGGAQWTIYGANSMTVMTSKNVACLA 390
Query: 424 VVTD----REASG---GPSIILGNFQMQNYYVEYDL 452
V +E S+++G Q+++ + D+
Sbjct: 391 FVDGGMKPKEMHSIQLEASVVIGGHQLEDNLLVIDM 426
>gi|357486591|ref|XP_003613583.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
gi|355514918|gb|AES96541.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
Length = 437
Score = 103 bits (256), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 112/390 (28%), Positives = 164/390 (42%), Gaps = 56/390 (14%)
Query: 88 GYSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLGC 147
GY IS GTPP + ++DT + +WF C CK C ++ P F P SS+ + + C
Sbjct: 88 GYIISFLIGTPPFQLYGVMDTANDNIWFQCN---PCKPCFNTTSPMFDPSKSSTYKTIPC 144
Query: 148 QNPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSGLTEGIALSETLNL-PNR 206
+PKC + E+ C S + ++C G ++G +TL L N
Sbjct: 145 SSPKCKNV--ENTHC----------SSDDKKVCEYSFTYGGEAYSQGDLSIDTLTLNSNN 192
Query: 207 IIP----NFLVGCSVLSSRQP-----AGIAGFGRGKTSLPSQLNLD---KFSYCLLSHKF 254
P N ++GC ++ P +G G GRG S SQLN KFSYCL+ F
Sbjct: 193 DTPISFKNIVIGCG-HRNKGPLEGYVSGNIGLGRGPLSFISQLNSSIGGKFSYCLVP-LF 250
Query: 255 DDTTRTSSLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRV 314
+ + L + S S G TP A + Y L ++VG +
Sbjct: 251 SNEGISGKLHFGDKSVVSG---VGTVSTPIT---------AGEIGYSTTLNALSVGDHII 298
Query: 315 RVWHKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEALT 374
+ + T D G TI+DSGTT T + ++ L + V+ MVK RA
Sbjct: 299 KFENS--TSKNDNLGNTIIDSGTTLTILPENVYSRL-ESIVTSMVK---LERAKSPNQ-- 350
Query: 375 GLRPCFDVPGEKTGSFPELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTVVTDREASGGP 434
+ C+ K P + HF GA+V L N F + + VC V+ P
Sbjct: 351 QFKLCYKAT-LKNLDVPIITAHFN-GADVHLNSLNTFYPI-DHEVVCFAFVS---VGNFP 404
Query: 435 SIILGNFQMQNYYVEYDLRNQRLGFKQQLC 464
I+GN QN+ V +DL+ + FK C
Sbjct: 405 GTIIGNIAQQNFLVGFDLQKNIISFKPTDC 434
>gi|356564743|ref|XP_003550608.1| PREDICTED: aspartic proteinase-like protein 2-like [Glycine max]
Length = 490
Score = 103 bits (256), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 104/398 (26%), Positives = 170/398 (42%), Gaps = 56/398 (14%)
Query: 87 GGYSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPS--FIPKLSSSSRL 144
G Y + GTPP +DTGS ++W C + C S +I F P SS+S +
Sbjct: 73 GLYYTKVQLGTPPVEFNVQIDTGSDVLWVSCNSCSGCPQTSGLQIQLNFFDPGSSSTSSM 132
Query: 145 LGCQNPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSGL-TEGIALSETLNL 203
+ C + +C + IQ D AT + C SY YG G T G +S+ ++L
Sbjct: 133 IACSDQRC----NNGIQSSD------ATCSSQNNQC-SYTFQYGDGSGTSGYYVSDMMHL 181
Query: 204 ---------PNRIIPNFLVGCS-------VLSSRQPAGIAGFGRGKTSLPSQLNLDKFSY 247
N P + GCS S R GI GFG+ + S+ SQL+ +
Sbjct: 182 NTIFEGSVTTNSTAP-VVFGCSNQQTGDLTKSDRAVDGIFGFGQQEMSVISQLSSQGIAP 240
Query: 248 CLLSHKFD-DTTRTSSLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRR 306
+ SH D++ L+L + + YT V P+ +Y + L+
Sbjct: 241 RVFSHCLKGDSSGGGILVL------GEIVEPNIVYTSLV--PA-------QPHYNLNLQS 285
Query: 307 ITVGGQRVRVWHKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTR 366
I V GQ +++ + GTIVDSGTT ++A E ++P + + ++ +
Sbjct: 286 IAVNGQTLQIDSSVFATSN--SRGTIVDSGTTLAYLAEEAYDPFVSAITASIPQSVHTVV 343
Query: 367 ALGAEALTGLRPCFDVPGEKTGSFPELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTVVT 426
+ G + C+ + T FP++ L+F GGA + L ++Y + +
Sbjct: 344 SRGNQ-------CYLITSSVTEVFPQVSLNFAGGASMILRPQDYLIQQNSIGGAAVWCIG 396
Query: 427 DREASGGPSIILGNFQMQNYYVEYDLRNQRLGFKQQLC 464
++ G ILG+ +++ V YDL QR+G+ C
Sbjct: 397 FQKIQGQGITILGDLVLKDKIVVYDLAGQRIGWANYDC 434
>gi|125528357|gb|EAY76471.1| hypothetical protein OsI_04407 [Oryza sativa Indica Group]
Length = 441
Score = 103 bits (256), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 109/422 (25%), Positives = 159/422 (37%), Gaps = 38/422 (9%)
Query: 57 TRALHIKNPQTKTTTTTTTTTTTNISSHSYGGYSISLSFGTPPQIIPFILDTGSHLVWFP 116
TR L + + + + H ++SL+ GTPPQ + +LDTGS L W
Sbjct: 33 TRPLLFELRARQVPAGALPRPASKLRFHHNVSLTVSLAVGTPPQNVTMVLDTGSELSWLL 92
Query: 117 CTNHYQCKYCSSSKIPSFIPKLSSSSRLLGCQNPKCSWIHHESIQCRDCNDEPLATSKNC 176
C S + SF P+ S + + C + +C + RD P
Sbjct: 93 CAPGGGGGGGGRSAL-SFRPRASLTFASVPCGSAQC--------RSRDLPSPP--ACDGA 141
Query: 177 TQICPSYLVLYGSGLTEGIALSETLNLPNRIIPNFLVGCSVLS-SRQPAGIA-----GFG 230
++ C L ++G +E + GC + P G+A G
Sbjct: 142 SKQCRVSLSYADGSSSDGALATEVFTVGQGPPLRAAFGCMATAFDTSPDGVATAGLLGMN 201
Query: 231 RGKTSLPSQLNLDKFSYCLLSHKFDDTTRTSSLILDNGSSHSDKKTTGLTYTPFVNNPSV 290
RG S SQ + +FSYC+ D L+L HSD L YTP P++
Sbjct: 202 RGALSFVSQASTRRFSYCI-----SDRDDAGVLLL----GHSDLPFLPLNYTPLYQ-PAM 251
Query: 291 AERNAFSVYYYVGLRRITVGGQRVRVWHKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPL 350
V Y V L I VGG+ + + L D G G T+VDSGT FTF+ + + L
Sbjct: 252 PLPYFDRVAYSVQLLGIRVGGKPLPIPASVLAPDHTGAGQTMVDSGTQFTFLLGDAYSAL 311
Query: 351 ADEFVSQMVKNRNYTRALGAEALT---GLRPCFDVPGEKT--GSFPELKLHFKGGAEVTL 405
EF Q + + AL CF VP + P + L F G
Sbjct: 312 KAEFSRQ---TKPWLPALNDPNFAFQEAFDTCFRVPQGRAPPARLPAVTLLFNGAQMTVA 368
Query: 406 PVENYFAVVGE---GSAVCLTVVTDREASGGPSIILGNFQMQNYYVEYDLRNQRLGFKQQ 462
+ V GE G V + + + ++G+ N +VEYDL R+G
Sbjct: 369 GDRLLYKVPGERRGGDGVWCLTFGNADMVPITAYVIGHHHQMNVWVEYDLERGRVGLAPI 428
Query: 463 LC 464
C
Sbjct: 429 RC 430
>gi|15217764|ref|NP_176663.1| aspartyl protease-like protein [Arabidopsis thaliana]
gi|5042418|gb|AAD38257.1|AC006193_13 Hypothetical Protein [Arabidopsis thaliana]
gi|332196174|gb|AEE34295.1| aspartyl protease-like protein [Arabidopsis thaliana]
Length = 431
Score = 102 bits (255), Expect = 3e-19, Method: Compositional matrix adjust.
Identities = 123/441 (27%), Positives = 191/441 (43%), Gaps = 69/441 (15%)
Query: 39 NPSQDSYQNLNSLVSSSLTRALHIKNPQTKTTTTTTTTTTTNISSHSYGGYSISLSFGTP 98
N ++ S Q + + + S L N + + T+ G Y +++S GTP
Sbjct: 42 NSAETSSQRMRNAIRRSARSTLQFSNDDASPNSPQSFITSNR------GEYLMNISIGTP 95
Query: 99 PQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLGCQNPKCSWIHHE 158
P I I DTGS L+W C C+ C P F PK SS+ R + C
Sbjct: 96 PVPILAIADTGSDLIWTQCN---PCEDCYQQTSPLFDPKESSTYRKVSCS---------- 142
Query: 159 SIQCRDCNDEPLATSKNCTQICPSYLVLYG-SGLTEGIALSETLNLPNR-----IIPNFL 212
S QCR D +T +N SY + YG + T+G +T+ + + + N +
Sbjct: 143 SSQCRALEDASCSTDENTC----SYTITYGDNSYTKGDVAVDTVTMGSSGRRPVSLRNMI 198
Query: 213 VGC--SVLSSRQPA--GIAGFGRGKTSLPSQLNLD---KFSYCLLSHKFDDTTRTSSLIL 265
+GC + PA GI G G G TSL SQL KFSYCL+ F T +S I
Sbjct: 199 IGCGHENTGTFDPAGSGIIGLGGGSTSLVSQLRKSINGKFSYCLV--PFTSETGLTSKI- 255
Query: 266 DNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRVRVWHKYLTLDR 325
+ T G+ V + S+ +++ + YY++ L I+VG ++++ T+
Sbjct: 256 -------NFGTNGIVSGDGVVSTSMVKKDP-ATYYFLNLEAISVGSKKIQFTS---TIFG 304
Query: 326 DGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEALTGLRPCFDVPGE 385
G G ++DSGTT T + + L + V+ +K G +L C+
Sbjct: 305 TGEGNIVIDSGTTLTLLPSNFYYEL-ESVVASTIKAERVQDPDGILSL-----CY----R 354
Query: 386 KTGSF--PELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTVVTDREASGGPSIILGNFQM 443
+ SF P++ +HFKGG +V L N F V E + C + + + I GN
Sbjct: 355 DSSSFKVPDITVHFKGG-DVKLGNLNTFVAVSEDVS-CFAFAANEQLT-----IFGNLAQ 407
Query: 444 QNYYVEYDLRNQRLGFKQQLC 464
N+ V YD + + FK+ C
Sbjct: 408 MNFLVGYDTVSGTVSFKKTDC 428
>gi|56542455|gb|AAV92892.1| Avr9/Cf-9 rapidly elicited protein 36, partial [Nicotiana tabacum]
Length = 191
Score = 102 bits (255), Expect = 3e-19, Method: Compositional matrix adjust.
Identities = 60/197 (30%), Positives = 97/197 (49%), Gaps = 13/197 (6%)
Query: 268 GSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRVRVWHKYLTLDRDG 327
G K L +T V + N +YYV ++ + VGG+ + + + L +G
Sbjct: 5 GEDKELLKHLNLNFTSLVG----GKENHLETFYYVQIKSVIVGGEVLNIPEETWNLSTEG 60
Query: 328 NGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEALTGLRPCFDVPGEKT 387
GGTI+DSGTT ++ A +E + FV+++ R + L+PC++V G +
Sbjct: 61 VGGTIIDSGTTLSYFAEPAYEIIKQAFVNKV------KRYPILDDFPILKPCYNVSGVEK 114
Query: 388 GSFPELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTVVTDREASGGPSIILGNFQMQNYY 447
P + F GA T PVENYF + VCL ++ ++ I+GN+Q QN++
Sbjct: 115 LELPSFGIVFGDGAIWTFPVENYFIKLEPEDIVCLAILGTPHSAMS---IIGNYQQQNFH 171
Query: 448 VEYDLRNQRLGFKQQLC 464
+ YD + RLGF + C
Sbjct: 172 ILYDTKRSRLGFAPRRC 188
>gi|125555056|gb|EAZ00662.1| hypothetical protein OsI_22683 [Oryza sativa Indica Group]
Length = 491
Score = 102 bits (255), Expect = 4e-19, Method: Compositional matrix adjust.
Identities = 112/392 (28%), Positives = 164/392 (41%), Gaps = 65/392 (16%)
Query: 89 YSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLGCQ 148
+ +++ GTP Q I DTGS L W C +C + P F P SS+ + C
Sbjct: 149 FVVAVGLGTPAQPSALIFDTGSDLSWVQCQPCGSSGHCHPQQDPLFDPSKSSTYAAVHCG 208
Query: 149 NPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSG-LTEGIALSETLNL-PNR 206
P+C+ L + N T + YLV YG G T G+ +TL L +R
Sbjct: 209 EPQCAAAGG------------LCSEDNTTCL---YLVHYGDGSSTTGVLSRDTLALTSSR 253
Query: 207 IIPNFLVGCSVLSSRQPAGIAGFGR---------GKTSLPSQLNLD---KFSYCLLSHKF 254
+ F GC + + FGR G+ SLPSQ FSYCL S
Sbjct: 254 ALAGFPFGCGTRN------LGDFGRVDGLLGLGRGELSLPSQAAASFGAVFSYCLPS--- 304
Query: 255 DDTTRTSSLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRV 314
+ T+ + + +D T YT + P F +Y+V L I +GG +
Sbjct: 305 --SNSTTGYLTIGATPATD--TGAAQYTAMLRKPQ------FPSFYFVELVSIDIGGYIL 354
Query: 315 RVWHKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEALT 374
V T GGT++DSGT T++ + +E L D F M + YT A + L
Sbjct: 355 PVPPAVFT-----RGGTLLDSGTVLTYLPAQAYELLRDRFRLTMER---YTPAPPNDVLD 406
Query: 375 GLRPCFDVPGEKTGSFPELKLHFKGGAEVTLPVENYFAVVG--EGSAVCLTVVTDREASG 432
C+D GE P + F GA L ++F V+ + + CL +A G
Sbjct: 407 A---CYDFAGESEVIVPAVSFRFGDGAVFEL---DFFGVMIFLDENVGCLAFAA-MDAGG 459
Query: 433 GPSIILGNFQMQNYYVEYDLRNQRLGFKQQLC 464
P I+GN Q ++ V YD+ +++GF C
Sbjct: 460 LPLSIIGNTQQRSAEVIYDVAAEKIGFVPASC 491
>gi|293332561|ref|NP_001170100.1| uncharacterized protein LOC100384018 precursor [Zea mays]
gi|224033441|gb|ACN35796.1| unknown [Zea mays]
gi|413944035|gb|AFW76684.1| hypothetical protein ZEAMMB73_746438 [Zea mays]
Length = 456
Score = 102 bits (254), Expect = 4e-19, Method: Compositional matrix adjust.
Identities = 87/281 (30%), Positives = 121/281 (43%), Gaps = 40/281 (14%)
Query: 89 YSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLGCQ 148
Y + L+ GTPP+ + LDTGS LVW C C+ C IP P SS+ L C
Sbjct: 86 YLVHLAVGTPPRPVALTLDTGSDLVWTQCA---PCRDCFDQGIPLLDPAASSTYAALPCG 142
Query: 149 NPKCSWIHHESIQCRDC------NDEPLATSKNCTQICPSYLVLYGSGLTEGIALSETLN 202
P+C + S R C D+ + K I +G G +L
Sbjct: 143 APRCRALPFTSCGGRSCVYVYHYGDKSVTVGK----IATDRFTFGDNGRRNG---DGSLP 195
Query: 203 LPNRIIPNFLVGCS-----VLSSRQPAGIAGFGRGKTSLPSQLNLDKFSYCLLSHKFDDT 257
R+ GC V S + GIAGFGRG+ SLPSQLN FSYC S FD
Sbjct: 196 ATRRLT----FGCGHFNKGVFQSNE-TGIAGFGRGRWSLPSQLNATSFSYCFTS-MFDSK 249
Query: 258 TRTSSLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRVRVW 317
+ +L + +S + + TP NPS Y++ L+ I+VG R+ V
Sbjct: 250 SSIVTLGGAPAALYSHAHSGEVRTTPLFKNPSQPS------LYFLSLKGISVGKTRLPVP 303
Query: 318 HKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQM 358
TI+DSG + T + E++E + EF +Q+
Sbjct: 304 ETKFR-------STIIDSGASITTLPEEVYEAVKAEFAAQV 337
>gi|125540928|gb|EAY87323.1| hypothetical protein OsI_08727 [Oryza sativa Indica Group]
Length = 463
Score = 102 bits (254), Expect = 4e-19, Method: Compositional matrix adjust.
Identities = 124/471 (26%), Positives = 185/471 (39%), Gaps = 84/471 (17%)
Query: 24 SSITSLTFSLSRFHTN----PSQDSYQNLNSLVSSSLTRALHIKNP-------------- 65
SS++ T +L+ H PS L+ RA HI+
Sbjct: 47 SSLSGTTVALNHRHGPCSPVPSSKKRPTEEELLKRDQLRAEHIQRKFAMNAAVDGAGDLQ 106
Query: 66 QTKTTTTTTTTTTTNISSHSYGGYSISLSFGTPPQIIPFILDTGSHLVWF---PCTNHYQ 122
Q+K +++ T +++ + Y IS+ GTP +DTGS + W PC N
Sbjct: 107 QSKVSSSVPTKLGSSLDTLEY---VISVGLGTPAVTQTVTIDTGSDVSWVQCNPCPN--- 160
Query: 123 CKYCSSSKIPSFIPKLSSSSRLLGCQNPKCSWIHHESIQCRDCNDEPLATSKNCTQICPS 182
C + F P SS+ R + C +C+ + + C AT+ C
Sbjct: 161 -PPCHAQTGALFDPAKSSTYRAVSCAAAECAQLEQQGNGCG-------ATNYEC-----Q 207
Query: 183 YLVLYGSG-LTEGIALSETLNL--PNRIIPNFLVGCSVLSS---RQPAGIAGFGRGKTSL 236
Y V YG G T G +TL L + + F GCS L S Q G+ G G G SL
Sbjct: 208 YGVQYGDGSTTNGTYSRDTLTLSGASDAVKGFQFGCSHLESGFSDQTDGLMGLGGGAQSL 267
Query: 237 PSQLNL---DKFSYCLLSHKFDDTTRTSSLILDNGSSHSDKKTTGLTYTPFVNNPSVAER 293
SQ + FSYCL +GSS G + FV + +
Sbjct: 268 VSQTAAAYGNSFSYCLPP--------------TSGSSGFLTLGGGGGASGFVTTRMLRSK 313
Query: 294 NAFSVYYYVGLRRITVGGQRVRVWHKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADE 353
+Y L+ I VGG+++ + G++VDSGT T + P + L+
Sbjct: 314 Q-IPTFYGARLQDIAVGGKQLGLSPSVFA------AGSVVDSGTIITRLPPTAYSALSSA 366
Query: 354 FVSQMVKNRNYTRALGAEALTGLRPCFDVPGEKTGSFPELKLHFKGGAEVTLPVENYFAV 413
F + M + R+ A A + L CFD G+ S P + L F GGA + L
Sbjct: 367 FKAGMKQYRS------APARSILDTCFDFAGQTQISIPTVALVFSGGAAIDLDPNGIM-- 418
Query: 414 VGEGSAVCLTVVTDREASGGPSIILGNFQMQNYYVEYDLRNQRLGFKQQLC 464
G+ + D +G I+GN Q + + V YD+ + LGF+ C
Sbjct: 419 --YGNCLAFAATGDDGTTG----IIGNVQQRTFEVLYDVGSSTLGFRSGAC 463
>gi|225458774|ref|XP_002283258.1| PREDICTED: aspartic proteinase-like protein 2 [Vitis vinifera]
gi|302142232|emb|CBI19435.3| unnamed protein product [Vitis vinifera]
Length = 659
Score = 102 bits (254), Expect = 4e-19, Method: Compositional matrix adjust.
Identities = 102/391 (26%), Positives = 168/391 (42%), Gaps = 56/391 (14%)
Query: 85 SYGGYSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRL 144
S G Y+ L GTPPQ I+DTGS + + PC++ C++C + P F P SS+
Sbjct: 84 SNGYYTTRLWIGTPPQEFALIVDTGSTVTYVPCSD---CEHCGKHQDPRFQPDESSTYHP 140
Query: 145 LGCQNPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSGLTEGIALSETLNLP 204
+ C N C+ H+ + C + A + + + ++ +G +++ +P
Sbjct: 141 VKC-NMDCN-CDHDGVNC--VYERRYAEMSSSSGVLGEDIISFG---------NQSEVVP 187
Query: 205 NRIIPNFLVGCS-----VLSSRQPAGIAGFGRGKTSLPSQLNLDKFSYCLLSHKFDDTTR 259
R + GC L S++ GI G GRG+ S+ QL D
Sbjct: 188 QRAV----FGCENVETGDLYSQRADGIMGLGRGQLSIVDQL-------------VDKNVI 230
Query: 260 TSSLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAF-SVYYYVGLRRITVGGQRVRVWH 318
S L G H L P + + + + S YY + L+ I V G+ +++
Sbjct: 231 NDSFSLCYGGMHVGGGAMVLGGIPPPPDMVFSRSDPYRSPYYNIELKEIHVAGKPLKL-- 288
Query: 319 KYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEALTGLRP 378
T DR GT++DSGTT+ ++ E F D + K+ N + G +
Sbjct: 289 SPSTFDR--KHGTVLDSGTTYAYLPEEAFVAFRDAIIK---KSHNLKQIHGPDPNYN-DI 342
Query: 379 CFDVPGEK----TGSFPELKLHFKGGAEVTLPVENY-FAVVGEGSAVCLTVVTDREASGG 433
CF G + +FPE+ + F G +++L ENY F A CL + + G
Sbjct: 343 CFSGAGRDVSQLSKAFPEVDMVFSNGQKLSLTPENYLFQHTKVHGAYCLGIFRN----GD 398
Query: 434 PSIILGNFQMQNYYVEYDLRNQRLGFKQQLC 464
+ +LG ++N V YD N+++GF + C
Sbjct: 399 STTLLGGIIVRNTLVTYDRENEKIGFWKTNC 429
>gi|326503602|dbj|BAJ86307.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 461
Score = 102 bits (254), Expect = 4e-19, Method: Compositional matrix adjust.
Identities = 108/392 (27%), Positives = 153/392 (39%), Gaps = 42/392 (10%)
Query: 90 SISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLGCQN 149
++SL+ GTPPQ + +LDTGS L W C S+ SF P+ SS+ + C +
Sbjct: 86 TVSLAVGTPPQNVTMVLDTGSELSWLLCAPAGARNKFSAM---SFRPRASSTFAAVPCAS 142
Query: 150 PKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSGLTEGIALSETLNLPNRIIP 209
+C + RD P + C L ++G ++ + +
Sbjct: 143 AQC--------RSRDLPSPP--ACDGASSRCSVSLSYADGSSSDGALATDVFAVGSGPPL 192
Query: 210 NFLVGC------SVLSSRQPAGIAGFGRGKTSLPSQLNLDKFSYCLLSHKFDDTTRTSSL 263
GC S AG+ G RG S SQ + +FSYC+ D L
Sbjct: 193 RAAFGCMSSAFDSSPDGVASAGLLGMNRGALSFVSQASTRRFSYCI-----SDRDDAGVL 247
Query: 264 ILDNGSSHSDKKT-TGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRVRVWHKYLT 322
+L HSD T L YTP P++ V Y V L I VGG+ + + L
Sbjct: 248 LL----GHSDLPTFLPLNYTPMYQ-PALPLPYFDRVAYSVQLLGIRVGGKHLPIPASVLA 302
Query: 323 LDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEALT---GLRPC 379
D G G T+VDSGT FTF+ + + L EF Q R AL + C
Sbjct: 303 PDHTGAGQTMVDSGTQFTFLLGDAYSALKAEFTRQA---RPLLPALDDPSFAFQEAFDTC 359
Query: 380 FDVPGEK---TGSFPELKLHFKGGAEVTLPVENYFAVVGE---GSAVCLTVVTDREASGG 433
F VP + T P + L F G + V GE G V + +
Sbjct: 360 FRVPQGRSPPTARLPGVTLLFNGAEMAVAGDRLLYKVPGERRGGDGVWCLTFGNADMVPI 419
Query: 434 PSIILGNFQMQNYYVEYDLRNQRLGFKQQLCK 465
+ ++G+ N +VEYDL R+G C
Sbjct: 420 MAYVIGHHHQMNVWVEYDLERGRVGLAPVRCD 451
>gi|356525748|ref|XP_003531485.1| PREDICTED: probable aspartic protease At2g35615-like [Glycine max]
Length = 436
Score = 102 bits (254), Expect = 4e-19, Method: Compositional matrix adjust.
Identities = 113/397 (28%), Positives = 173/397 (43%), Gaps = 66/397 (16%)
Query: 85 SYGGYSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRL 144
++G Y + GTPP I DT S L+W C+ C+ C P F P SS+
Sbjct: 86 NHGEYLMRFYIGTPPVERLAIADTASDLIWVQCS---PCETCFPQDTPLFEPHKSSTFAN 142
Query: 145 LGCQNPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSG-LTEGIALSETLNL 203
L C + C+ S C PL + +C Y YG G T+G+ +E+++
Sbjct: 143 LSCDSQPCT-----SSNIYYC---PLVGN-----LC-LYTNTYGDGSSTKGVLCTESIHF 188
Query: 204 PNRII--PNFLVGCSVLS------SRQPAGIAGFGRGKTSLPSQLNLD---KFSYCLLSH 252
++ + P + GC + S + GI G G G SL SQL KFSYCLL
Sbjct: 189 GSQTVTFPKTIFGCGSNNDFMHQISNKVTGIVGLGAGPLSLVSQLGDQIGHKFSYCLLPF 248
Query: 253 KFDDTTRTSSLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQ 312
T TS++ L G+ + G+ TP + +P + YY++ L IT+G +
Sbjct: 249 -----TSTSTIKLKFGND-TTITGNGVVSTPLIIDPH------YPSYYFLHLVGITIGQK 296
Query: 313 RVRVWHKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEA 372
++V T D NG I+D GT T++ + FV+ + ALG
Sbjct: 297 MLQVR----TTDHT-NGNIIIDLGTVLTYLEVNFYH----NFVTLL------REALGISE 341
Query: 373 LTGLRP-----CFDVPGEKTGSFPELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTVVTD 427
P CF P + +FP++ F GA+V L +N F + + +CL V+ D
Sbjct: 342 TKDDIPYPFDFCF--PNQANITFPKIVFQFT-GAKVFLSPKNLFFRFDDLNMICLAVLPD 398
Query: 428 REASGGPSIILGNFQMQNYYVEYDLRNQRLGFKQQLC 464
A G + GN ++ VEYD + +++ F C
Sbjct: 399 FYAKGFS--VFGNLAQVDFQVEYDRKGKKVSFAPADC 433
>gi|242041431|ref|XP_002468110.1| hypothetical protein SORBIDRAFT_01g039750 [Sorghum bicolor]
gi|241921964|gb|EER95108.1| hypothetical protein SORBIDRAFT_01g039750 [Sorghum bicolor]
Length = 467
Score = 102 bits (254), Expect = 4e-19, Method: Compositional matrix adjust.
Identities = 105/426 (24%), Positives = 167/426 (39%), Gaps = 78/426 (18%)
Query: 90 SISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLGCQN 149
++ ++ G PPQ + +LDTGS L W C+ S++PS P+ + + G +
Sbjct: 60 TVPVAVGAPPQNVTMVLDTGSELSWL---------LCNGSRVPSTPPQPQAPAAFNGSAS 110
Query: 150 -----------PKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSGLTEGIALS 198
P+C W + RD P + C L + +G+ +
Sbjct: 111 STYAAAHCSSSPECQW------RGRDLPVPPFCAGPP-SNSCRVSLSYADASSADGVLAA 163
Query: 199 ETLNLPNRIIPNFLVGC--------------------SVLSSRQPAGIAGFGRGKTSLPS 238
+T L L GC + SS G+ G RG S +
Sbjct: 164 DTFLLGGAPPVRALFGCITSYSSSSTADGNGNGNDASATNSSEAATGLLGMNRGSLSFVT 223
Query: 239 QLNLDKFSYCLLSHKFDDTTRTSSLILD-NGSSHSDKKTTGLTYTPFVNNPSVAERNAF- 296
Q +F+YC+ L+L +G + L YTP + +++ +
Sbjct: 224 QTGTLRFAYCIAPGDGPGL-----LVLGGDGDGAALSAAPQLNYTPLIE---MSQPLPYF 275
Query: 297 -SVYYYVGLRRITVGGQRVRVWHKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFV 355
V Y V L I VG + + L D G G T+VDSGT FTF+ + + PL EF+
Sbjct: 276 DRVAYSVQLEGIRVGAALLPIPKSVLAPDHTGAGQTMVDSGTQFTFLLADAYAPLKGEFL 335
Query: 356 SQMVKNRNYTRALGAEALT---GLRPCFDVPGEKTGS------FPELKLHFKGGAEVTLP 406
+Q LG CF + + PE+ L + GAEV +
Sbjct: 336 NQ---TSALLAPLGEPDFVFQGAFDACFRASEARVAAATASQLLPEVGLVLR-GAEVAVG 391
Query: 407 VENYFAVV-----GEGSAVCLTVVT--DREASGGPSIILGNFQMQNYYVEYDLRNQRLGF 459
E +V GEG + + +T + + +G + ++G+ QN +VEYDL+N R+GF
Sbjct: 392 GEKLLYMVPGERRGEGGSEAVWCLTFGNSDMAGMSAYVIGHHHQQNVWVEYDLQNSRVGF 451
Query: 460 KQQLCK 465
C
Sbjct: 452 APARCD 457
>gi|255556768|ref|XP_002519417.1| Aspartic proteinase Asp1 precursor, putative [Ricinus communis]
gi|223541280|gb|EEF42831.1| Aspartic proteinase Asp1 precursor, putative [Ricinus communis]
Length = 494
Score = 102 bits (253), Expect = 5e-19, Method: Compositional matrix adjust.
Identities = 107/397 (26%), Positives = 167/397 (42%), Gaps = 55/397 (13%)
Query: 87 GGYSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIP--SFIPKLSSSSRL 144
G Y + GTPP+ +DTGS ++W C++ C S I F SS++RL
Sbjct: 79 GLYFTRVKLGTPPREFNVQIDTGSDVLWVTCSSCSNCPQTSGLGIQLNYFDTTSSSTARL 138
Query: 145 LGCQNPKC-SWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSGL-TEGIALSETLN 202
+ C +P C S I + QC S C SY YG G T G +S+T
Sbjct: 139 VPCSHPICTSQIQTTATQCP-------PQSNQC-----SYAFQYGDGSGTSGYYVSDTFY 186
Query: 203 ----LPNRIIPN----FLVGCSVLSS-------RQPAGIAGFGRGKTSLPSQLNLDKFSY 247
L +I N + GCS S + GI GFG+G+ S+ SQL+ +
Sbjct: 187 FDAVLGESLIANSSAAIVFGCSTYQSGDLTKTDKAVDGIFGFGQGELSVISQLSSHGITP 246
Query: 248 CLLSHKFDDTTRTSSLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRI 307
+ SH +++ + G+ Y+P V PS +Y + L+ I
Sbjct: 247 RVFSHCLKGEDSGGGILV-----LGEILEPGIVYSPLV--PS-------QPHYNLDLQSI 292
Query: 308 TVGGQRVRVWHKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRA 367
V GQ + + N GTI+D+GTT ++ E ++P FVS +
Sbjct: 293 AVSGQLLPI--DPAAFATSSNRGTIIDTGTTLAYLVEEAYDP----FVSAITAA---VSQ 343
Query: 368 LGAEALTGLRPCFDVPGEKTGSFPELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTVVTD 427
L + C+ V + FP + +F GGA + L E Y + + L +
Sbjct: 344 LATPTINKGNQCYLVSNSVSEVFPPVSFNFAGGATMLLKPEEYLMYLTNYAGAALWCIGF 403
Query: 428 REASGGPSIILGNFQMQNYYVEYDLRNQRLGFKQQLC 464
++ GG + ILG+ +++ YDL +QR+G+ C
Sbjct: 404 QKIQGGIT-ILGDLVLKDKIFVYDLAHQRIGWANYDC 439
>gi|357163818|ref|XP_003579856.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Brachypodium
distachyon]
Length = 467
Score = 102 bits (253), Expect = 5e-19, Method: Compositional matrix adjust.
Identities = 116/456 (25%), Positives = 177/456 (38%), Gaps = 70/456 (15%)
Query: 45 YQNLNSLVSSSLTRALHIKNPQTKTTTTTTTTTTTNISSHSYGG-YSISLSFGTPPQIIP 103
++ L + S R I P+ T++ S GG Y + L GTP
Sbjct: 44 HELLRRAIQRSRDRLASIA-PRLLPTSSRNKVVVAEAPVLSAGGEYLVKLGLGTPQHCFT 102
Query: 104 FILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLGCQNPKCSWIH-HESIQC 162
+DT S L+W C C C P F P S+S ++ C + C + H +
Sbjct: 103 AAIDTASDLIWTQCQ---PCVKCYKQLDPVFNPVASTSYAVVPCNSDTCDELDTHRCARD 159
Query: 163 RDCNDEPLATSKNCTQICPSYLVLYGSGLTEGIALSETLNLPNRIIPNFLVGCSVLSSRQ 222
D +DE + Q SY G+ T GI + L + + + + GCS S
Sbjct: 160 GDSDDE------DACQYTYSY---GGNATTRGILAVDRLAIGDDVFRGVVFGCSSSSVGG 210
Query: 223 P----AGIAGFGRGKTSLPSQLNLDKFSYCLLSHKFDDTTRTSSLILDNGSSHSDKKTTG 278
P +G+ G GRG SL SQL++ +F YCL R L+L ++ + + +
Sbjct: 211 PPPQVSGVVGLGRGALSLVSQLSVRRFMYCLPPPVSRSAGR---LVLGADAAATVRNASE 267
Query: 279 LTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRVRVWHKYLTLDRDGNG--------- 329
P S R + YYY+ L I++ G R + ++ G
Sbjct: 268 RVVVPM----STGSR--YPSYYYLNLDGISI-GDRAMSFRSRNRMNATTPGTAAGAPASP 320
Query: 330 -----------------GTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEA 372
G I+D +T TF+ L+E + D+ ++ R G+ +
Sbjct: 321 VSGSGDGDGSGTGPDAYGMIIDIASTITFLEESLYEEMVDDLEEEIRLPR------GSGS 374
Query: 373 LTGLRPCFDVPGEKTGS---FPELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTVVTDRE 429
GL CF +P S P + L F+ G + L E F +CL V
Sbjct: 375 DLGLDLCFILPEGVPMSRVYAPPVSLAFE-GVWLRLDKEQMFVEDRASGMMCLMVGKTDG 433
Query: 430 ASGGPSIILGNFQMQNYYVEYDLRNQRLGFKQQLCK 465
S ILGN+Q QN V Y+LR R+ F + C+
Sbjct: 434 VS-----ILGNYQQQNMQVMYNLRRGRITFIKTACE 464
>gi|449516339|ref|XP_004165204.1| PREDICTED: probable aspartic protease At2g35615-like [Cucumis
sativus]
Length = 456
Score = 102 bits (253), Expect = 5e-19, Method: Compositional matrix adjust.
Identities = 111/391 (28%), Positives = 167/391 (42%), Gaps = 54/391 (13%)
Query: 88 GYSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLGC 147
G+ ++LS G+PP ++DTGS L+W C C C F P S S + LGC
Sbjct: 103 GFLVNLSIGSPPVTQLVVVDTGSSLLWVQCL---PCINCFQQSTSWFDPLKSVSFKTLGC 159
Query: 148 QNPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSGLTEGIA-----LSETLN 202
P ++I+ CN A K L G ++GI L ETL+
Sbjct: 160 GFPGYNYIN-----GYKCNRFNQAEYK---------LRYLGGDSSQGILAKESLLFETLD 205
Query: 203 LPNRIIPNFLVGC---SVLSSRQPAGIAGFGRGK---TSLPSQLNLDKFSYCLLSHKFDD 256
N GC ++ ++ A FG G ++ +QL +KFSYC+ ++
Sbjct: 206 EGKIKKSNITFGCGHMNIKTNNDDAYNGVFGLGAYPHITMATQLG-NKFSYCI--GDINN 262
Query: 257 TTRTSS-LILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRVR 315
T + L+L GS ++ S + F +YYV L+ I+VG + ++
Sbjct: 263 PLYTHNHLVLGQGS--------------YIEGDSTPLQIHFG-HYYVTLQSISVGSKTLK 307
Query: 316 VWHKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEALTG 375
+ + DG+GG ++DSG T+T +A FE L DE V M R G
Sbjct: 308 IDPNAFKISSDGSGGVLIDSGMTYTKLANGGFELLYDEIVDLM--KGLLERIPTQRKFEG 365
Query: 376 LRPCFD-VPGEKTGSFPELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTVVTDREASGGP 434
L CF V FP + HF GGA++ L + F G G CL ++
Sbjct: 366 L--CFKGVVSRDLVGFPAVTFHFAGGADLVLESGSLFRQHG-GDRFCLAILPSNSELLNL 422
Query: 435 SIILGNFQMQNYYVEYDLRNQRLGFKQQLCK 465
S+I G QNY V +DL ++ F++ C+
Sbjct: 423 SVI-GILAQQNYNVGFDLEQMKVFFRRIDCQ 452
>gi|53791672|dbj|BAD53242.1| putative aspartic proteinase nepenthesin II [Oryza sativa Japonica
Group]
Length = 504
Score = 102 bits (253), Expect = 6e-19, Method: Compositional matrix adjust.
Identities = 104/400 (26%), Positives = 165/400 (41%), Gaps = 59/400 (14%)
Query: 87 GGYSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSS-----KIPSFIPKLSSS 141
G Y + G+PP+ +DTGS ++W C+ C C SS ++ F P SS+
Sbjct: 89 GLYFTRVKLGSPPKEYFVQIDTGSDILWVACS---PCTGCPSSSGLNIQLEFFNPDTSST 145
Query: 142 SRLLGCQNPKC-SWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSGL-TEGIALSE 199
S + C + +C + + C+ ++ P Y YG G T G +S+
Sbjct: 146 SSKIPCSDDRCTAALQTSEAVCQTSDNSPCG-----------YTFTYGDGSGTSGYYVSD 194
Query: 200 TLN----LPNRIIPN----FLVGCS-------VLSSRQPAGIAGFGRGKTSLPSQLNLDK 244
T+ + N N + GCS + R GI GFG+ + S+ SQLN
Sbjct: 195 TMYFDTVMGNEQTANSSASIVFGCSNSQSGDLTKTDRAVDGIFGFGQHQLSVVSQLNSLG 254
Query: 245 FSYCLLSHKFDDTTRTSSLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGL 304
S + SH + +++ + GL YTP V PS +Y + L
Sbjct: 255 VSPKVFSHCLKGSDNGGGILV-----LGEIVEPGLVYTPLV--PS-------QPHYNLNL 300
Query: 305 RRITVGGQRVRVWHKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNY 364
I V GQ++ + T GTIVDSGTT ++A ++P + + + +
Sbjct: 301 ESIVVNGQKLPIDSSLFTTSN--TQGTIVDSGTTLAYLADGAYDPFVNAITAAVSPS--- 355
Query: 365 TRALGAEALTGLRPCFDVPGEKTGSFPELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTV 424
R+L ++ CF SFP + L+F GG +T+ ENY L
Sbjct: 356 VRSLVSKG----NQCFVTSSSVDSSFPTVSLYFMGGVAMTVKPENYLLQQASIDNNVLWC 411
Query: 425 VTDREASGGPSIILGNFQMQNYYVEYDLRNQRLGFKQQLC 464
+ + G ILG+ +++ YDL N R+G+ C
Sbjct: 412 IGWQRNQGQQITILGDLVLKDKIFVYDLANMRMGWTDYDC 451
>gi|79507883|ref|NP_196320.2| aspartyl protease family protein [Arabidopsis thaliana]
gi|332003717|gb|AED91100.1| aspartyl protease family protein [Arabidopsis thaliana]
Length = 455
Score = 102 bits (253), Expect = 6e-19, Method: Compositional matrix adjust.
Identities = 99/384 (25%), Positives = 151/384 (39%), Gaps = 52/384 (13%)
Query: 89 YSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLGCQ 148
Y + GTP Q + +DT S + W PC+ C C S+ +F P S+S + + C
Sbjct: 115 YIVKALIGTPAQPLLLAMDTSSDVAWIPCSG---CVGCPSNT--AFSPAKSTSFKNVSCS 169
Query: 149 NPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSGLTEGIALSETLNLPNRII 208
P+C + + + R C S+ + YGS +T+ L I
Sbjct: 170 APQCKQVPNPTCGARAC----------------SFNLTYGSSSIAANLSQDTIRLAADPI 213
Query: 209 PNFLVGC--------SVLSSRQPAGIAGFGRGKTSLPSQLNLDKFSYCLLSHKFDDTTRT 260
F GC ++ + G+ S + FSYCL S F T +
Sbjct: 214 KAFTFGCVNKVAGGGTIPPPQGLLGLGRGPLSLMSQAQSIYKSTFSYCLPS--FRSLTFS 271
Query: 261 SSLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRVRVWHKY 320
SL L S K YT + NP R++ YYV L I VG + V +
Sbjct: 272 GSLRLGPTSQPQRVK-----YTQLLRNP---RRSSL---YYVNLVAIRVGRKVVDLPPAA 320
Query: 321 LTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEALTGLRPCF 380
+ + GTI DSGT +T +A ++E + +EF ++ +LG F
Sbjct: 321 IAFNPSTGAGTIFDSGTVYTRLAKPVYEAVRNEFRKRVKPTTAVVTSLGG---------F 371
Query: 381 DVPGEKTGSFPELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTVVTDREASGGPSIILGN 440
D P + FK G +T+P +N GS CL + E ++ +
Sbjct: 372 DTCYSGQVKVPTITFMFK-GVNMTMPADNLMLHSTAGSTSCLAMAAAPENVNSVVNVIAS 430
Query: 441 FQMQNYYVEYDLRNQRLGFKQQLC 464
Q QN+ V D+ N RLG ++ C
Sbjct: 431 MQQQNHRVLIDVPNGRLGLARERC 454
>gi|125528511|gb|EAY76625.1| hypothetical protein OsI_04577 [Oryza sativa Indica Group]
Length = 492
Score = 101 bits (252), Expect = 6e-19, Method: Compositional matrix adjust.
Identities = 103/380 (27%), Positives = 155/380 (40%), Gaps = 47/380 (12%)
Query: 87 GGYSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLG 146
G Y S GTPPQ + LD S LVW C F P S++ +
Sbjct: 98 GMYVFSYGIGTPPQQVSGALDISSDLVWTAC-----------GATAPFNPVRSTTVADVP 146
Query: 147 CQNPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSGL--TEGIALSETLNLP 204
C + C ++ C A S C +Y +YG G T G+ +E
Sbjct: 147 CTDDACQQFAPQT-----CGAGAGAGSSEC-----AYTYMYGGGAANTTGLLGTEAFTFG 196
Query: 205 NRIIPNFLVGC---SVLSSRQPAGIAGFGRGKTSLPSQLNLDKFSYCLLSHKFDDTTRTS 261
+ I + GC +V +G+ G GRG SL SQL +D+FSY DD+ T
Sbjct: 197 DTRIDGVVFGCGLQNVGDFSGVSGVIGLGRGNLSLVSQLQVDRFSYHFAP---DDSVDTQ 253
Query: 262 SLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRVRVWHKYL 321
S IL D T ++T + + +A YYV L I V G+ + +
Sbjct: 254 SFIL-----FGDDATPQTSHT---LSTRLLASDANPSLYYVELAGIQVDGKDLAIPSGTF 305
Query: 322 TL-DRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEALTGLRPCF 380
L ++DG+GG + T + ++PL Q V ++ A+ AL GL C+
Sbjct: 306 DLRNKDGSGGVFLSITDLVTVLEEAAYKPL-----RQAVASKIGLPAVNGSAL-GLDLCY 359
Query: 381 DVPGEKTGSFPELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTVVTDREASGGPSIILGN 440
P + L F GGA + L + NYF + CLT++ +S G +LG+
Sbjct: 360 TGESLAKAKVPSMALVFAGGAVMELELGNYFYMDSTTGLACLTIL---PSSAGDGSVLGS 416
Query: 441 FQMQNYYVEYDLRNQRLGFK 460
++ YD+ +L F+
Sbjct: 417 LIQVGTHMMYDINGSKLVFE 436
>gi|218189149|gb|EEC71576.1| hypothetical protein OsI_03949 [Oryza sativa Indica Group]
Length = 504
Score = 101 bits (252), Expect = 6e-19, Method: Compositional matrix adjust.
Identities = 104/400 (26%), Positives = 165/400 (41%), Gaps = 59/400 (14%)
Query: 87 GGYSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSS-----KIPSFIPKLSSS 141
G Y + G+PP+ +DTGS ++W C+ C C SS ++ F P SS+
Sbjct: 89 GLYFTRVKLGSPPKEYFVQIDTGSDILWVACS---PCTGCPSSSGLNIQLEFFNPDTSST 145
Query: 142 SRLLGCQNPKC-SWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSGL-TEGIALSE 199
S + C + +C + + C+ ++ P Y YG G T G +S+
Sbjct: 146 SSKIPCSDDRCTAALQTSEAVCQTSDNSPCG-----------YTFTYGDGSGTSGYYVSD 194
Query: 200 TLN----LPNRIIPN----FLVGCS-------VLSSRQPAGIAGFGRGKTSLPSQLNLDK 244
T+ + N N + GCS + R GI GFG+ + S+ SQLN
Sbjct: 195 TMYFDSVMGNEQTANSSASIVFGCSNSQSGDLTKTDRAVDGIFGFGQHQLSVVSQLNSLG 254
Query: 245 FSYCLLSHKFDDTTRTSSLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGL 304
S + SH + +++ + GL YTP V PS +Y + L
Sbjct: 255 VSPKVFSHCLKGSDNGGGILV-----LGEIVEPGLVYTPLV--PS-------QPHYNLNL 300
Query: 305 RRITVGGQRVRVWHKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNY 364
I V GQ++ + T GTIVDSGTT ++A ++P + + + +
Sbjct: 301 ESIVVNGQKLPIDSSLFTTSN--TQGTIVDSGTTLAYLADGAYDPFVNAITAAVSPS--- 355
Query: 365 TRALGAEALTGLRPCFDVPGEKTGSFPELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTV 424
R+L ++ CF SFP + L+F GG +T+ ENY L
Sbjct: 356 VRSLVSKG----NQCFVTSSSVDSSFPTVSLYFMGGVAMTVKPENYLLQQASIDNNVLWC 411
Query: 425 VTDREASGGPSIILGNFQMQNYYVEYDLRNQRLGFKQQLC 464
+ + G ILG+ +++ YDL N R+G+ C
Sbjct: 412 IGWQRNQGQQITILGDLVLKDKIFVYDLANMRMGWTDYDC 451
>gi|242062704|ref|XP_002452641.1| hypothetical protein SORBIDRAFT_04g029670 [Sorghum bicolor]
gi|241932472|gb|EES05617.1| hypothetical protein SORBIDRAFT_04g029670 [Sorghum bicolor]
Length = 466
Score = 101 bits (252), Expect = 7e-19, Method: Compositional matrix adjust.
Identities = 127/464 (27%), Positives = 185/464 (39%), Gaps = 74/464 (15%)
Query: 23 PSSITSLTFSLSRFHTN----PSQDSYQNLNSLVSSSLTRALHIK---------NPQTKT 69
PS+ +T L H PS +L + RA +IK + +
Sbjct: 55 PSTSGGITVPLHHRHGPCSPVPSNKMPASLEERLQRDQLRAAYIKRKFSGAKGGDVEQSD 114
Query: 70 TTTTTTTTTTNISSHSYGGYSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSS 129
T TT T++S+ Y I++ G+P +DTGS + W C C C S
Sbjct: 115 AATVPTTLGTSLSTLEY---VITVGIGSPAVTQTMSMDTGSDVSWVQCK---PCSQCHSE 168
Query: 130 KIPSFIPKLSSSSRLLGCQNPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGS 189
F P SS+ C + C + +S Q C +S C Y+V Y
Sbjct: 169 VDSLFDPSASSTYSPFSCSSAACVQLS-QSQQGNGC------SSSQC-----QYIVSYVD 216
Query: 190 GL-TEGIALSETLNLPNRIIPNFLVGCSVLSS----RQPAGIAGFGRGKTSLPSQLNL-- 242
G T G S+TL L + I F GCS S Q G+ G G SL SQ
Sbjct: 217 GSSTTGTYSSDTLTLGSNAIKGFQFGCSQSESGGFSDQTDGLMGLGGDAQSLVSQTAGTF 276
Query: 243 -DKFSYCLLSHKFDDTTRTSSLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYY 301
FSYCL T SS L G++ +G TP + + + YY
Sbjct: 277 GKAFSYCL------PPTPGSSGFLTLGAA----SRSGFVKTPMLRSTQIP------TYYG 320
Query: 302 VGLRRITVGGQRVRVWHKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKN 361
V L I VGGQ++ + + G+++DSGT T + P + L+ F + M K
Sbjct: 321 VLLEAIRVGGQQLNIPTSVFS------AGSVMDSGTVITRLPPTAYSALSSAFKAGMKKY 374
Query: 362 RNYTRALGAEALTGLRPCFDVPGEKTGSFPELKLHFKGGAEVTLPVENYFAVVGEGSAVC 421
A+ L CFD G+ + S P + L F GGA V L ++ ++ E C
Sbjct: 375 PP------AQPSGILDTCFDFSGQSSVSIPSVALVFSGGAVVNL---DFNGIMLELDNWC 425
Query: 422 LTVVTDREASGGPSI-ILGNFQMQNYYVEYDLRNQRLGFKQQLC 464
L + + S S+ +GN Q + + V YD+ +GF+ C
Sbjct: 426 LAFAANSDDS---SLGFIGNVQQRTFEVLYDVGGGAVGFRAGAC 466
>gi|297810815|ref|XP_002873291.1| hypothetical protein ARALYDRAFT_487523 [Arabidopsis lyrata subsp.
lyrata]
gi|297319128|gb|EFH49550.1| hypothetical protein ARALYDRAFT_487523 [Arabidopsis lyrata subsp.
lyrata]
Length = 439
Score = 101 bits (252), Expect = 7e-19, Method: Compositional matrix adjust.
Identities = 99/384 (25%), Positives = 153/384 (39%), Gaps = 52/384 (13%)
Query: 89 YSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLGCQ 148
Y + + GTP Q + +DT S + W PC+ C C S+ +F P S+S + + C
Sbjct: 99 YIVKVLIGTPAQPLLLAMDTSSDVAWIPCSG---CVGCPSNT--AFSPAKSTSFKNVSCS 153
Query: 149 NPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSGLTEGIALSETLNLPNRII 208
P+C + + + R C S+ + YGS +T+ L I
Sbjct: 154 APQCKQVPNPACGARAC----------------SFNLTYGSSSIAANLSQDTIRLAADPI 197
Query: 209 PNFLVGC--------SVLSSRQPAGIAGFGRGKTSLPSQLNLDKFSYCLLSHKFDDTTRT 260
F GC ++ + G+ S + FSYCL S F T +
Sbjct: 198 KAFTFGCVNKVAGGGTIPPPQGLLGLGRGPLSLMSQAQSVYKSTFSYCLPS--FRSLTFS 255
Query: 261 SSLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRVRVWHKY 320
SL L S K YT + NP R++ YYV L I VG + V +
Sbjct: 256 GSLRLGPTSQPQRVK-----YTQLLRNP---RRSSL---YYVNLVAIRVGRKVVDLPPAA 304
Query: 321 LTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEALTGLRPCF 380
+ + GTI DSGT +T +A ++E + +EF ++ +LG F
Sbjct: 305 IAFNPSTGAGTIFDSGTVYTRLAKPVYEAVRNEFRKRVKPPTAVVTSLGG---------F 355
Query: 381 DVPGEKTGSFPELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTVVTDREASGGPSIILGN 440
D P + FKG +T+P +N GS CL + + E ++ +
Sbjct: 356 DTCYSGQVKVPTITFMFKG-VNMTMPADNLMLHSTAGSTSCLAMASAPENVNSVVNVIAS 414
Query: 441 FQMQNYYVEYDLRNQRLGFKQQLC 464
Q QN+ V D+ N RLG ++ C
Sbjct: 415 MQQQNHRVLIDVPNGRLGLARERC 438
>gi|9759559|dbj|BAB11161.1| nucleoid DNA-binding-like protein [Arabidopsis thaliana]
gi|21553652|gb|AAM62745.1| nucleoid DNA-binding-like protein [Arabidopsis thaliana]
gi|109134179|gb|ABG25087.1| At5g07030 [Arabidopsis thaliana]
Length = 439
Score = 101 bits (252), Expect = 8e-19, Method: Compositional matrix adjust.
Identities = 99/384 (25%), Positives = 151/384 (39%), Gaps = 52/384 (13%)
Query: 89 YSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLGCQ 148
Y + GTP Q + +DT S + W PC+ C C S+ +F P S+S + + C
Sbjct: 99 YIVKALIGTPAQPLLLAMDTSSDVAWIPCSG---CVGCPSNT--AFSPAKSTSFKNVSCS 153
Query: 149 NPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSGLTEGIALSETLNLPNRII 208
P+C + + + R C S+ + YGS +T+ L I
Sbjct: 154 APQCKQVPNPTCGARAC----------------SFNLTYGSSSIAANLSQDTIRLAADPI 197
Query: 209 PNFLVGC--------SVLSSRQPAGIAGFGRGKTSLPSQLNLDKFSYCLLSHKFDDTTRT 260
F GC ++ + G+ S + FSYCL S F T +
Sbjct: 198 KAFTFGCVNKVAGGGTIPPPQGLLGLGRGPLSLMSQAQSIYKSTFSYCLPS--FRSLTFS 255
Query: 261 SSLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRVRVWHKY 320
SL L S K YT + NP R++ YYV L I VG + V +
Sbjct: 256 GSLRLGPTSQPQRVK-----YTQLLRNP---RRSSL---YYVNLVAIRVGRKVVDLPPAA 304
Query: 321 LTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEALTGLRPCF 380
+ + GTI DSGT +T +A ++E + +EF ++ +LG F
Sbjct: 305 IAFNPSTGAGTIFDSGTVYTRLAKPVYEAVRNEFRKRVKPTTAVVTSLGG---------F 355
Query: 381 DVPGEKTGSFPELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTVVTDREASGGPSIILGN 440
D P + FK G +T+P +N GS CL + E ++ +
Sbjct: 356 DTCYSGQVKVPTITFMFK-GVNMTMPADNLMLHSTAGSTSCLAMAAAPENVNSVVNVIAS 414
Query: 441 FQMQNYYVEYDLRNQRLGFKQQLC 464
Q QN+ V D+ N RLG ++ C
Sbjct: 415 MQQQNHRVLIDVPNGRLGLARERC 438
>gi|242046812|ref|XP_002461152.1| hypothetical protein SORBIDRAFT_02g041760 [Sorghum bicolor]
gi|241924529|gb|EER97673.1| hypothetical protein SORBIDRAFT_02g041760 [Sorghum bicolor]
Length = 452
Score = 101 bits (252), Expect = 8e-19, Method: Compositional matrix adjust.
Identities = 108/387 (27%), Positives = 159/387 (41%), Gaps = 58/387 (14%)
Query: 89 YSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLGCQ 148
Y + GTPPQ + +DT + W PC C C +S P F P S+S R + C
Sbjct: 110 YVVRARLGTPPQQLLLAVDTSNDAAWIPCAG---CAGCPTSSAPPFDPAASTSYRSVPCG 166
Query: 149 NPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSGLTEGIALSETLNLPNRII 208
+P C+ P A + C L S L ++ ++L + +
Sbjct: 167 SPLCA-------------QAPNAACPPGGKACGFSLTYADSSLQAALS-QDSLAVAGDAV 212
Query: 209 PNFLVGC---SVLSSRQPAGIAGFGRGKTSLPSQ---LNLDKFSYCLLSHK---FDDTTR 259
+ GC + ++ P G+ G GRG S SQ + FSYCL S K F T R
Sbjct: 213 KTYTFGCLQKATGTAAPPQGLLGLGRGPLSFLSQTRDMYQGTFSYCLPSFKSLNFSGTLR 272
Query: 260 TSSLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRVRVWHK 319
+ NG K T P + NP S YYV + I VG + V +
Sbjct: 273 ----LGRNGQPPRIKTT------PLLANPH------RSSLYYVNMTGIRVGRKVVPIPPP 316
Query: 320 YLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGA--EALTGLR 377
L D GT++DSGT FT + + + DE R +GA +L G
Sbjct: 317 ALAFDPATGAGTVLDSGTMFTRLVAPAYVAVRDEV----------RRRVGAPVSSLGGFD 366
Query: 378 PCFDVPGEKTGSFPELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTVVTDREASGGPSII 437
CF+ ++P + L F G +VTLP EN G+ CL + + +
Sbjct: 367 TCFNT---TAVAWPPVTLLFDG-MQVTLPEENVVIHSTYGTISCLAMAAAPDGVNTVLNV 422
Query: 438 LGNFQMQNYYVEYDLRNQRLGFKQQLC 464
+ + Q QN+ V +D+ N R+GF ++ C
Sbjct: 423 IASMQQQNHRVLFDVPNGRVGFARERC 449
>gi|18405138|ref|NP_565911.1| Eukaryotic aspartyl protease family protein [Arabidopsis thaliana]
gi|13877759|gb|AAK43957.1|AF370142_1 unknown protein [Arabidopsis thaliana]
gi|15293231|gb|AAK93726.1| unknown protein [Arabidopsis thaliana]
gi|20196976|gb|AAB87120.2| expressed protein [Arabidopsis thaliana]
gi|20197046|gb|AAM14894.1| expressed protein [Arabidopsis thaliana]
gi|330254616|gb|AEC09710.1| Eukaryotic aspartyl protease family protein [Arabidopsis thaliana]
Length = 442
Score = 101 bits (251), Expect = 8e-19, Method: Compositional matrix adjust.
Identities = 112/394 (28%), Positives = 172/394 (43%), Gaps = 58/394 (14%)
Query: 90 SISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLGCQN 149
+++L+ G PPQ I +LDTGS L W C S + F P SS+ + C +
Sbjct: 66 TVTLAVGDPPQNISMVLDTGSELSWLHCKK-------SPNLGSVFNPVSSSTYSPVPCSS 118
Query: 150 PKCSWIHHESIQCRDCNDEPL-ATSKNCTQICPSYLVLYGSGLTEGIALSETLNLPNRII 208
P C + RD P+ A+ T +C + + EG ET + +
Sbjct: 119 PICR------TRTRDL---PIPASCDPKTHLCHVAISYADATSIEGNLAHETFVIGSVTR 169
Query: 209 PNFLVGC--SVLSSR-----QPAGIAGFGRGKTSLPSQLNLDKFSYCLLSHKFDDTTRTS 261
P L GC S LSS + G+ G RG S +QL KFSYC+ + +S
Sbjct: 170 PGTLFGCMDSGLSSNSEEDAKSTGLMGMNRGSLSFVNQLGFSKFSYCI------SGSDSS 223
Query: 262 SLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRVRVWHKYL 321
+L +S+S + YTP V S V Y V L I VG + + +
Sbjct: 224 GFLLLGDASYS--WLGPIQYTPLVLQ-STPLPYFDRVAYTVQLEGIRVGSKILSLPKSVF 280
Query: 322 TLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQ------MVKNRNY----TRALGAE 371
D G G T+VDSGT FTF+ ++ L +EF++Q +V + ++ T L +
Sbjct: 281 VPDHTGAGQTMVDSGTQFTFLMGPVYTALKNEFITQTKSVLRLVDDPDFVFQGTMDLCYK 340
Query: 372 ALTGLRPCFDVPGEKTGSFPELKLHFKGGAEVTLPVEN-YFAVVGEGS-----AVCLTVV 425
+ RP F P + L F+ GAE+++ + + V G GS C T
Sbjct: 341 VGSTTRPNFS-------GLPMVSLMFR-GAEMSVSGQKLLYRVNGAGSEGKEEVYCFT-F 391
Query: 426 TDREASGGPSIILGNFQMQNYYVEYDLRNQRLGF 459
+ + G + ++G+ QN ++E+DL R+GF
Sbjct: 392 GNSDLLGIEAFVIGHHHQQNVWMEFDLAKSRVGF 425
>gi|356507997|ref|XP_003522749.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
Length = 440
Score = 101 bits (251), Expect = 8e-19, Method: Compositional matrix adjust.
Identities = 104/400 (26%), Positives = 166/400 (41%), Gaps = 48/400 (12%)
Query: 72 TTTTTTTTNISSHSYGGYSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKI 131
T +T + + + G Y + + GTP Q++ +LDT + + PC+ C CS +
Sbjct: 83 TVSTAPIASGQTFNIGNYVVRVKLGTPGQLLFMVLDTSTDEAFVPCSG---CTGCSDT-- 137
Query: 132 PSFIPKLSSSSRLLGCQNPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSGL 191
+F PK S+S L C P+C + + C P + C S+ Y
Sbjct: 138 -TFSPKASTSYGPLDCSVPQCGQVR--GLSC------PATGTGAC-----SFNQSYAGSS 183
Query: 192 TEGIALSETLNLPNRIIPNFLVGC--SVLSSRQPAGIAGFGRGKTSLPSQLNLDK----F 245
+ ++L L +IPN+ GC ++ + PA + F
Sbjct: 184 FSATLVQDSLRLATDVIPNYSFGCVNAITGASVPAQGLLGLGRGPLSLLSQSGSNYSGIF 243
Query: 246 SYCLLSHKFDDTTRTSSLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLR 305
SYCL S F + SL L +TT L +P + PS+ YYV
Sbjct: 244 SYCLPS--FKSYYFSGSLKLGPVGQPKSIRTTPLLRSP--HRPSL---------YYVNFT 290
Query: 306 RITVGGQRVRVWHKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYT 365
I+VG V +YL + + GTI+DSGT T ++ + +EF Q V +T
Sbjct: 291 GISVGRVLVPFPSEYLGFNPNTGSGTIIDSGTVITRFVEPVYNAVREEFRKQ-VGGTTFT 349
Query: 366 RALGAEALTGLRPCFDVPGEKTGSFPELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTVV 425
++GA CF E P + LHF+G ++ LP+EN GS CL +
Sbjct: 350 -SIGA-----FDTCFVKTYETLA--PPITLHFEG-LDLKLPLENSLIHSSAGSLACLAMA 400
Query: 426 TDREASGGPSIILGNFQMQNYYVEYDLRNQRLGFKQQLCK 465
+ ++ NFQ QN + +D N ++G +++C
Sbjct: 401 AAPDNVNSVLNVIANFQQQNLRILFDTVNNKVGIAREVCN 440
>gi|125560845|gb|EAZ06293.1| hypothetical protein OsI_28528 [Oryza sativa Indica Group]
Length = 525
Score = 101 bits (251), Expect = 1e-18, Method: Compositional matrix adjust.
Identities = 103/373 (27%), Positives = 159/373 (42%), Gaps = 37/373 (9%)
Query: 102 IPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLGCQNPKC-SWIHHESI 160
+ I+DTGS L W C C C + + P F P S+S + C C + + +
Sbjct: 177 LTVIVDTGSDLTWVQCK---PCSVCYAQRDPLFDPSGSASYAAVPCNASACEASLKAATG 233
Query: 161 QCRDCNDEPLATSKNCTQICPSYLVLYGSG-LTEGIALSETLNLPNRIIPNFLVGCSVLS 219
C ++ C Y + YG G + G+ ++T+ L + F+ GC LS
Sbjct: 234 VPGSCATVGGGGGGGKSERC-YYSLAYGDGSFSRGVLATDTVALGGASVDGFVFGCG-LS 291
Query: 220 SRQ----PAGIAGFGRGKTSLPSQLNL---DKFSYCLLSHKFDDTTRTSSLILDNGSSHS 272
+R AG+ G GR + SL SQ FSYCL + D + SL G + S
Sbjct: 292 NRGLFGGTAGLMGLGRTELSLVSQTAPRFGGVFSYCLPAATSGDAAGSLSL---GGDTSS 348
Query: 273 DKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRVRVWHKYLTLDRDGNGGTI 332
+ T ++YT + +P A +Y++ + +VGG V +
Sbjct: 349 YRNATPVSYTRMIADP------AQPPFYFMNVTGASVGGAAVAAAGLGAA-------NVL 395
Query: 333 VDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEALTGLRPCFDVPGEKTGSFPE 392
+DSGT T +AP ++ + EF Q R A + L C+++ G P
Sbjct: 396 LDSGTVITRLAPSVYRAVRAEFARQF----GAERYPAAPPFSLLDACYNLTGHDEVKVPL 451
Query: 393 LKLHFKGGAEVTLPVENY-FAVVGEGSAVCLTVVTDREASGGPSIILGNFQMQNYYVEYD 451
L L +GGA++T+ F +GS VCL + + P I+GN+Q +N V YD
Sbjct: 452 LTLRLEGGADMTVDAAGMLFMARKDGSQVCLAMASLSFEDQTP--IIGNYQQKNKRVVYD 509
Query: 452 LRNQRLGFKQQLC 464
RLGF + C
Sbjct: 510 TVGSRLGFADEDC 522
>gi|357439021|ref|XP_003589787.1| Aspartic proteinase nepenthesin-2 [Medicago truncatula]
gi|355478835|gb|AES60038.1| Aspartic proteinase nepenthesin-2 [Medicago truncatula]
Length = 456
Score = 101 bits (251), Expect = 1e-18, Method: Compositional matrix adjust.
Identities = 100/385 (25%), Positives = 150/385 (38%), Gaps = 62/385 (16%)
Query: 87 GGYSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLG 146
G Y + + G+P ++D+GS +VW C C C + P F P S+S +
Sbjct: 127 GEYFVRIGIGSPAIYQYMVIDSGSDIVWIQCE---PCDQCYNQTDPIFNPATSASFIGVA 183
Query: 147 CQNPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSG-LTEGIALSETLNLPN 205
C + C+ + + + CR C Y V YG G T+G ET+ +
Sbjct: 184 CSSNVCNQLD-DDVACR---------KGRC-----GYQVAYGDGSYTKGTLALETITIGR 228
Query: 206 RIIPNFLVGCSVLSSRQ---PAGIAGFGRGKTSLPSQLNLD---KFSYCLLSHKFDDTTR 259
+I + +GC + AG+ G G G S QL F YCL+S
Sbjct: 229 TVIQDTAIGCGHWNEGMFVGAAGLLGLGGGPMSFVGQLGAQTGGAFGYCLVSRAM----- 283
Query: 260 TSSLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRVRVWHK 319
G + P ++NP + +YYV L + VGG RV + +
Sbjct: 284 ----------------PVGAMWVPLIHNP------FYPSFYYVSLSGLAVGGIRVPISEQ 321
Query: 320 YLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEALTGLRPC 379
L G GG ++D+GT T + + D F++Q N RA G C
Sbjct: 322 IFQLTDIGTGGVVMDTGTAITRLPTVAYNAFRDAFIAQTT---NLPRAPGVSIFD---TC 375
Query: 380 FDVPGEKTGSFPELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTVVTDREASGGPSIILG 439
+D+ G T P + +F GG +T P N+ + C + G SII G
Sbjct: 376 YDLNGFVTVRVPTVSFYFSGGQILTFPARNFLIPADDVGTFCFAFA---PSPSGLSII-G 431
Query: 440 NFQMQNYYVEYDLRNQRLGFKQQLC 464
N Q + V D N +GF +C
Sbjct: 432 NIQQEGIQVSIDGTNGFVGFGPNVC 456
>gi|302824729|ref|XP_002994005.1| hypothetical protein SELMODRAFT_431957 [Selaginella moellendorffii]
gi|300138167|gb|EFJ04945.1| hypothetical protein SELMODRAFT_431957 [Selaginella moellendorffii]
Length = 462
Score = 101 bits (251), Expect = 1e-18, Method: Compositional matrix adjust.
Identities = 109/451 (24%), Positives = 176/451 (39%), Gaps = 63/451 (13%)
Query: 34 SRFHTNPSQDSYQNLNSLVSSSLTRALHIKNPQTKTTTTTTTTTTTNISSHSYGGYSISL 93
SR NPS S + S+ H KNP ++TTT +G Y S+
Sbjct: 52 SRVKANPSPSSAAQKSLFPYSAHIFQQHTKNPAALRSSTTTL-------GRKFGEYYTSI 104
Query: 94 SFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLGCQNPKCS 153
G+P Q I+DTGS L W C CK C+ S + S+S R + C N +
Sbjct: 105 KLGSPGQEAILIVDTGSELTWLQC---LPCKVCAPSVDTIYDAARSASYRPVTCNNSQL- 160
Query: 154 WIHHESIQCRDCNDEPLATSKNCTQICP-SYLVLYGSG-LTEGIALSETLNLPNRI---- 207
C++ T C + + YG G + G ++TL + +
Sbjct: 161 -----------CSNSSQGTYAYCARGSQCQFAAFYGDGSFSYGSLSTDTLIMETVVGGKP 209
Query: 208 --IPNFLVGCSV----LSSRQPAGIAGFGRGKTSLPSQLNLD---KFSYCLLSHKFDDTT 258
+ +F GC+ L +GI G GK +LP QL KFS+C +
Sbjct: 210 VTVQDFAFGCAQGDLELVPTGASGILGLNAGKMALPMQLGQRFGWKFSHCF-PDRSSHLN 268
Query: 259 RTSSLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRVRVWH 318
T + N ++ + YT S +R +Y+V L+ +++ H
Sbjct: 269 STGVVFFGNAELPHEQ----VQYTSVALTNSELQRK----FYHVALKGVSINS------H 314
Query: 319 KYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYT-RALGAEALTGLR 377
+ + L R I+DSG++F+ P + +K+R + + L ++ L
Sbjct: 315 ELVFLPR--GSVVILDSGSSFS----SFVRPFHSQLREAFLKHRPPSLKHLEGDSFGDLG 368
Query: 378 PCFDVPGEKTG----SFPELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTVVTDREASGG 433
CF V + + P L L F+ G + +P V +
Sbjct: 369 TCFKVSNDDIDELHRTLPSLSLVFEDGVTIGIPSIGVLLPVARFQNHVKMCFAFEDGGPN 428
Query: 434 PSIILGNFQMQNYYVEYDLRNQRLGFKQQLC 464
P ++GN+Q QN +VEYD++ R+GF + C
Sbjct: 429 PVNVIGNYQQQNLWVEYDIQRSRVGFARASC 459
>gi|297827577|ref|XP_002881671.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
gi|297327510|gb|EFH57930.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
Length = 438
Score = 101 bits (251), Expect = 1e-18, Method: Compositional matrix adjust.
Identities = 113/397 (28%), Positives = 174/397 (43%), Gaps = 64/397 (16%)
Query: 90 SISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLGCQN 149
+++L+ G+PPQ I +LDTGS L W C S + F P SS+ + C +
Sbjct: 62 TVTLAVGSPPQNISMVLDTGSELSWLHCKK-------SPNLGSVFNPVSSSTYSPVPCSS 114
Query: 150 PKCSWIHHESIQCRDCNDEPL-ATSKNCTQICPSYLVLYGSGLTEGIALSETLNLPNRII 208
P C D P+ A+ T C + + EG +T + +
Sbjct: 115 PICR---------TRTRDLPIPASCDPKTHFCHVAISYADATSIEGNLAHDTFVIGSVTR 165
Query: 209 PNFLVGC--SVLSS-----RQPAGIAGFGRGKTSLPSQLNLDKFSYCLLSHKFDDTTRTS 261
P L GC S LSS + G+ G RG S +QL KFSYC+ + +S
Sbjct: 166 PGTLFGCMDSGLSSDSEEDAKSTGLMGMNRGSLSFVNQLGFSKFSYCI------SGSDSS 219
Query: 262 SLILDNGSSHSDKKTTGLTYTPFVNNPS---VAERNAFSVYYYVGLRRITVGGQRVRVWH 318
++L +S+S + YTP V + +R V Y V L I VG + + +
Sbjct: 220 GILLLGDASYS--WLGPIQYTPLVLQTTPLPYFDR----VAYTVQLEGIRVGSKILSLPK 273
Query: 319 KYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQ------MVKNRNY----TRAL 368
D G G T+VDSGT FTF+ ++ L +EF++Q +V + N+ T L
Sbjct: 274 SVFVPDHTGAGQTMVDSGTQFTFLMGPVYTALKNEFIAQTKSVLRIVDDPNFVFQGTMDL 333
Query: 369 GAEALTGLRPCFDVPGEKTGSFPELKLHFKGGAEVTLPVEN-YFAVVGEGS-----AVCL 422
+ RP F TG P + L F+ GAE+++ + + V G GS C
Sbjct: 334 CYRVGSSTRPNF------TG-LPVISLMFR-GAEMSVSGQKLLYRVNGAGSEGKEEVYCF 385
Query: 423 TVVTDREASGGPSIILGNFQMQNYYVEYDLRNQRLGF 459
T + + G + ++G+ QN ++E+DL R+GF
Sbjct: 386 T-FGNSDLLGIEAFVIGHHHQQNVWMEFDLAKSRVGF 421
>gi|115475621|ref|NP_001061407.1| Os08g0267300 [Oryza sativa Japonica Group]
gi|37806402|dbj|BAC99940.1| putative 41 kD chloroplast nucleoid DNA binding protein [Oryza
sativa Japonica Group]
gi|113623376|dbj|BAF23321.1| Os08g0267300 [Oryza sativa Japonica Group]
Length = 524
Score = 101 bits (251), Expect = 1e-18, Method: Compositional matrix adjust.
Identities = 103/370 (27%), Positives = 158/370 (42%), Gaps = 37/370 (10%)
Query: 105 ILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLGCQNPKC-SWIHHESIQCR 163
I+DTGS L W C C C + + P F P S+S + C C + + +
Sbjct: 179 IVDTGSDLTWVQCK---PCSVCYAQRDPLFDPSGSASYAAVPCNASACEASLKAATGVPG 235
Query: 164 DCNDEPLATSKNCTQICPSYLVLYGSG-LTEGIALSETLNLPNRIIPNFLVGCSVLSSRQ 222
C ++ C Y + YG G + G+ ++T+ L + F+ GC LS+R
Sbjct: 236 SCATVGGGGGGGKSERC-YYSLAYGDGSFSRGVLATDTVALGGASVDGFVFGCG-LSNRG 293
Query: 223 ----PAGIAGFGRGKTSLPSQLNL---DKFSYCLLSHKFDDTTRTSSLILDNGSSHSDKK 275
AG+ G GR + SL SQ FSYCL + D + SL G + S +
Sbjct: 294 LFGGTAGLMGLGRTELSLVSQTAPRFGGVFSYCLPAATSGDAAGSLSL---GGDTSSYRN 350
Query: 276 TTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRVRVWHKYLTLDRDGNGGTIVDS 335
T ++YT + +P A +Y++ + +VGG V ++DS
Sbjct: 351 ATPVSYTRMIADP------AQPPFYFMNVTGASVGGAAVAAAGLGAA-------NVLLDS 397
Query: 336 GTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEALTGLRPCFDVPGEKTGSFPELKL 395
GT T +AP ++ + EF Q R A + L C+++ G P L L
Sbjct: 398 GTVITRLAPSVYRAVRAEFARQF----GAERYPAAPPFSLLDACYNLTGHDEVKVPLLTL 453
Query: 396 HFKGGAEVTLPVENY-FAVVGEGSAVCLTVVTDREASGGPSIILGNFQMQNYYVEYDLRN 454
+GGA++T+ F +GS VCL + + P I+GN+Q +N V YD
Sbjct: 454 RLEGGADMTVDAAGMLFMARKDGSQVCLAMASLSFEDQTP--IIGNYQQKNKRVVYDTVG 511
Query: 455 QRLGFKQQLC 464
RLGF + C
Sbjct: 512 SRLGFADEDC 521
>gi|383143511|gb|AFG53183.1| Pinus taeda anonymous locus 2_9704_01 genomic sequence
Length = 135
Score = 100 bits (250), Expect = 1e-18, Method: Compositional matrix adjust.
Identities = 56/153 (36%), Positives = 82/153 (53%), Gaps = 18/153 (11%)
Query: 247 YCLLSHKFDDTTRTSSLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRR 306
YCL D +S +++ N + D LTYTP + NP + +YY+GL
Sbjct: 1 YCL-----DYVNNSSKIVVGNKAVPGD---ISLTYTPLIINP------IYPFFYYLGLEA 46
Query: 307 ITVGGQRVRVWHKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTR 366
+++G +R+ + T D GNGGTI+DSGT+FT ++ +A EF SQ+ Y R
Sbjct: 47 VSIGRKRMNLPFNSATFDSKGNGGTIIDSGTSFTIFPEAMYSQIAGEFASQI----GYKR 102
Query: 367 ALGAEALTGLRPCFDVPGEKTGSFPELKLHFKG 399
GAE+ TGL C++V G + FP+ HFKG
Sbjct: 103 VPGAESTTGLGLCYNVSGVENTQFPQFAFHFKG 135
>gi|26451756|dbj|BAC42973.1| unknown protein [Arabidopsis thaliana]
Length = 442
Score = 100 bits (250), Expect = 1e-18, Method: Compositional matrix adjust.
Identities = 111/394 (28%), Positives = 170/394 (43%), Gaps = 58/394 (14%)
Query: 90 SISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLGCQN 149
+++L+ G PPQ I +LDTGS L W C S + F P SS+ + C +
Sbjct: 66 TVTLAVGDPPQNISMVLDTGSELSWLHCKK-------SPNLGSVFNPVSSSTYSPVPCSS 118
Query: 150 PKCSWIHHESIQCRDCNDEPL-ATSKNCTQICPSYLVLYGSGLTEGIALSETLNLPNRII 208
P C D P+ A+ T +C + + EG ET + +
Sbjct: 119 PICR---------TRTRDLPIPASCDPKTHLCHVAISYADATSIEGNLAHETFVIGSVTR 169
Query: 209 PNFLVGC--SVLSSR-----QPAGIAGFGRGKTSLPSQLNLDKFSYCLLSHKFDDTTRTS 261
P L GC S LSS + G+ G RG S +QL KFSYC+ + +S
Sbjct: 170 PGTLFGCMDSGLSSNSEEDAKSTGLMGMNRGSLSFVNQLGFSKFSYCI------SGSDSS 223
Query: 262 SLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRVRVWHKYL 321
+L +S+S + YTP V S V Y V L I VG + + +
Sbjct: 224 VFLLLGDASYS--WLGPIQYTPLVLQ-STPLPYFDRVAYTVQLEGIRVGSKILSLPKSVF 280
Query: 322 TLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQ------MVKNRNY----TRALGAE 371
D G G T+VDSGT FTF+ ++ L +EF++Q +V + ++ T L +
Sbjct: 281 VPDHTGAGQTMVDSGTQFTFLMGPVYTALKNEFITQTKSVLRLVDDPDFVFQGTMDLCYK 340
Query: 372 ALTGLRPCFDVPGEKTGSFPELKLHFKGGAEVTLPVEN-YFAVVGEGS-----AVCLTVV 425
+ RP F P + L F+ GAE+++ + + V G GS C T
Sbjct: 341 VGSTTRPNFS-------GLPMVSLMFR-GAEMSVSGQKLLYRVNGAGSEGKEEVYCFT-F 391
Query: 426 TDREASGGPSIILGNFQMQNYYVEYDLRNQRLGF 459
+ + G + ++G+ QN ++E+DL R+GF
Sbjct: 392 GNSDLLGIEAFVIGHHHQQNVWMEFDLAKSRVGF 425
>gi|356539352|ref|XP_003538162.1| PREDICTED: aspartic proteinase-like protein 2-like [Glycine max]
Length = 489
Score = 100 bits (250), Expect = 1e-18, Method: Compositional matrix adjust.
Identities = 110/402 (27%), Positives = 174/402 (43%), Gaps = 63/402 (15%)
Query: 87 GGYSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIP--SFIPKLSSSSRL 144
G Y + GTPP+ + +DTGS ++W C + C S +I F P SS+S L
Sbjct: 75 GLYYTKVKLGTPPRELYVQIDTGSDVLWVSCGSCNGCPQTSGLQIQLNYFDPGSSSTSSL 134
Query: 145 LGCQNPKC-SWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSGL-TEGIALSETLN 202
+ C + +C S + C N++ CT Y YG G T G +S+ ++
Sbjct: 135 ISCLDRRCRSGVQTSDASCSGRNNQ-------CT-----YTFQYGDGSGTSGYYVSDLMH 182
Query: 203 --------LPNRIIPNFLVGCSVL-------SSRQPAGIAGFGRGKTSLPSQLNLDKFSY 247
L + + GCS+L S R GI GFG+ S+ SQL+ +
Sbjct: 183 FASIFEGTLTTNSSASVVFGCSILQTGDLTKSERAVDGIFGFGQQGMSVISQLSSQGIAP 242
Query: 248 CLLSHKFD-DTTRTSSLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRR 306
+ SH D + L+L + + Y+P V PS +Y + L+
Sbjct: 243 RVFSHCLKGDNSGGGVLVL------GEIVEPNIVYSPLV--PS-------QPHYNLNLQS 287
Query: 307 ITVGGQRVRVWHKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTR 366
I+V GQ VR+ N GTIVDSGTT ++A E + P + + ++
Sbjct: 288 ISVNGQIVRIAPSVFATSN--NRGTIVDSGTTLAYLAEEAYNPFVIAIAAVIPQSVRSVL 345
Query: 367 ALGAEALTGLRPCFDV-PGEKTGSFPELKLHFKGGAEVTLPVENYFA---VVGEGSAVCL 422
+ G + C+ + FP++ L+F GGA + L ++Y +GEGS C+
Sbjct: 346 SRGNQ-------CYLITTSSNVDIFPQVSLNFAGGASLVLRPQDYLMQQNFIGEGSVWCI 398
Query: 423 TVVTDREASGGPSIILGNFQMQNYYVEYDLRNQRLGFKQQLC 464
++ SG ILG+ +++ YDL QR+G+ C
Sbjct: 399 GF---QKISGQSITILGDLVLKDKIFVYDLAGQRIGWANYDC 437
>gi|242089069|ref|XP_002440367.1| hypothetical protein SORBIDRAFT_09g030430 [Sorghum bicolor]
gi|241945652|gb|EES18797.1| hypothetical protein SORBIDRAFT_09g030430 [Sorghum bicolor]
Length = 462
Score = 100 bits (250), Expect = 1e-18, Method: Compositional matrix adjust.
Identities = 115/393 (29%), Positives = 154/393 (39%), Gaps = 77/393 (19%)
Query: 83 SHSYGGYSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSS 142
+ G Y S+ GTPP +LDTGS +VW C QC Y S ++ F P+ S S
Sbjct: 136 AQGSGEYFASVGVGTPPTPALLVLDTGSDVVWLQCAPCRQC-YAQSGRV--FDPRRSRSY 192
Query: 143 RLLGCQNPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSG-LTEGIALSETL 201
+ C P C + C Y V YG G +T G +ETL
Sbjct: 193 AAVRCGAPPCRGLDAGGGG------GCDRRRGTCL-----YQVAYGDGSVTAGDLATETL 241
Query: 202 NLPNRI-IPNFLVGCSVLSSR---QPAGIAGFGRGKTSLPSQLNLD---KFSYCLLSHKF 254
+P VGC + AG+ G GRG+ SLP+Q +FSYC
Sbjct: 242 WFARGARVPRVAVGCGHDNEGLFVAAAGLLGLGRGRLSLPTQTARRYGRRFSYCFQGSDL 301
Query: 255 DDTTRTSSLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRV 314
D T I+ H VGG RV
Sbjct: 302 DHRT-----IIRTVHQH-------------------------------------VGGARV 319
Query: 315 R-VWHKYLTLD-RDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEA 372
R V + L LD G GG I+DSGT+ T +A ++ + + F + R L
Sbjct: 320 RGVGERSLRLDPSTGRGGVILDSGTSVTRLARPVYVAVREAFRAAAGGLR-----LAPGG 374
Query: 373 LTGLRPCFDVPGEKTGSFPELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTVV-TDREAS 431
+ C+D+ G + P + +H GGAEV LP ENY V CL + TD
Sbjct: 375 FSLFDTCYDLRGRRVVKVPTVSVHLAGGAEVALPPENYLIPVDTRGTFCLALAGTD---- 430
Query: 432 GGPSIILGNFQMQNYYVEYDLRNQRLGFKQQLC 464
GG SI+ GN Q Q + V +D QR+ + C
Sbjct: 431 GGVSIV-GNIQQQGFRVVFDGDRQRVALVPKSC 462
>gi|449454654|ref|XP_004145069.1| PREDICTED: probable aspartic protease At2g35615-like [Cucumis
sativus]
gi|449470632|ref|XP_004153020.1| PREDICTED: probable aspartic protease At2g35615-like [Cucumis
sativus]
gi|449499016|ref|XP_004160697.1| PREDICTED: probable aspartic protease At2g35615-like [Cucumis
sativus]
Length = 434
Score = 100 bits (250), Expect = 1e-18, Method: Compositional matrix adjust.
Identities = 117/478 (24%), Positives = 199/478 (41%), Gaps = 68/478 (14%)
Query: 7 ALCLSFIFFFTLLSIFPSSITS--LTFSLSRFHTN----PSQDSYQNLNSLVSSSLTRAL 60
A S +F + S+F S++T+ F++ H + P +S + + ++L R+
Sbjct: 2 APVFSLLFLISTASVF-SAVTARDYGFTVELIHRDSPKSPMYNSSETHFDRIVNALRRSS 60
Query: 61 HIKNPQTKTTTTTTTTTTTNISSHSYGGYSISLSFGTPPQIIPFILDTGSHLVWFPCTNH 120
H + T + T ++ G Y + +S GTPP I + DTGS ++W C
Sbjct: 61 H------RNTVVLESDTAEAPIFNNGGEYLVEISVGTPPFSIVAVADTGSDVIWTQCK-- 112
Query: 121 YQCKYCSSSKIPSFIPKLSSSSRLLGCQNPKCSWIHHESIQCRDCNDEPLATSKNCTQIC 180
C C P F P S++ + + C +P CS+ S C+D+ C
Sbjct: 113 -PCSNCYQQNAPMFDPSKSTTYKNVACSSPVCSY----SGDGSSCSDD-----SECL--- 159
Query: 181 PSYLVLYGSG-------LTEGIALSETLNLPNRIIPNFLVGCSVLSS----RQPAGIAGF 229
Y + YG + + + T P P ++GC ++ +GI G
Sbjct: 160 --YSIAYGDDSHSQGNLAVDTVTMQSTSGRP-VAFPRTVIGCGHDNAGTFNANVSGIVGL 216
Query: 230 GRGKTSLPSQLNL---DKFSYCLLSHKFDDTTRTSSLILDNGSSHSDKKTTGLTYTPFVN 286
GRG SL +QL KFSYCL+ T ++ L N S+++ +G TP
Sbjct: 217 GRGPASLVTQLGPATGGKFSYCLIPIGTGSTNDSTKL---NFGSNANVSGSGTVSTP--- 270
Query: 287 NPSVAERNAFSVYYYVGLRRITVGGQRVRVWHKYLTLDRDGNGGTIVDSGTTFTFMAPEL 346
+ + +Y + L ++VG + L + N I+DSGTT T++ L
Sbjct: 271 ---IYSSAQYKTFYSLKLEAVSVGDTKFNFPEGASKLGGESN--IIIDSGTTLTYLPSAL 325
Query: 347 FEPLADEFVSQMVKNRNYTRALGAEALTGLRPCFDVPGEKTGSFPELKLHFKGGAEVTLP 406
+ F S + ++ + A L CF + P + +HF+ GA+V L
Sbjct: 326 L----NSFGSAISQSMSLPHAQDPSEF--LDYCFATTTDDY-EMPPVTMHFE-GADVPLQ 377
Query: 407 VENYFAVVGEGSAVCLTVVTDREASGGPSIILGNFQMQNYYVEYDLRNQRLGFKQQLC 464
EN F + + + +CL + + + I GN N+ V YD++N + F+ C
Sbjct: 378 RENLFVRLSDDT-ICLAFGSFPDDN---IFIYGNIAQSNFLVGYDIKNLAVSFQPAHC 431
>gi|356543524|ref|XP_003540210.1| PREDICTED: aspartic proteinase-like protein 2-like [Glycine max]
Length = 493
Score = 100 bits (250), Expect = 1e-18, Method: Compositional matrix adjust.
Identities = 103/398 (25%), Positives = 169/398 (42%), Gaps = 56/398 (14%)
Query: 87 GGYSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPS--FIPKLSSSSRL 144
G Y + GTPP +DTGS ++W C + C S +I F P SS+S +
Sbjct: 76 GLYYTKVQLGTPPVEFNVQIDTGSDVLWVSCNSCNGCPQTSGLQIQLNFFDPGSSSTSSM 135
Query: 145 LGCQNPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSGL-TEGIALSETLNL 203
+ C + +C + Q D AT + C SY YG G T G +S+ ++L
Sbjct: 136 IACSDQRC----NNGKQSSD------ATCSSQNNQC-SYTFQYGDGSGTSGYYVSDMMHL 184
Query: 204 ---------PNRIIPNFLVGCS-------VLSSRQPAGIAGFGRGKTSLPSQLNLDKFSY 247
N P + GCS S R GI GFG+ + S+ SQL+ +
Sbjct: 185 NTIFEGSMTTNSTAP-VVFGCSNQQTGDLTKSDRAVDGIFGFGQQEMSVISQLSSQGIAP 243
Query: 248 CLLSHKFD-DTTRTSSLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRR 306
+ SH D++ L+L + + YT V P+ +Y + L+
Sbjct: 244 RIFSHCLKGDSSGGGILVL------GEIVEPNIVYTSLV--PA-------QPHYNLNLQS 288
Query: 307 ITVGGQRVRVWHKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTR 366
I+V GQ +++ + GTIVDSGTT ++A E ++P + + ++
Sbjct: 289 ISVNGQTLQIDSSVFATSN--SRGTIVDSGTTLAYLAEEAYDPFVSAITAAIPQSVRTVV 346
Query: 367 ALGAEALTGLRPCFDVPGEKTGSFPELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTVVT 426
+ G + C+ + T FP++ L+F GGA + L ++Y + +
Sbjct: 347 SRGNQ-------CYLITSSVTDVFPQVSLNFAGGASMILRPQDYLIQQNSIGGAAVWCIG 399
Query: 427 DREASGGPSIILGNFQMQNYYVEYDLRNQRLGFKQQLC 464
++ G ILG+ +++ V YDL QR+G+ C
Sbjct: 400 FQKIQGQGITILGDLVLKDKIVVYDLAGQRIGWANYDC 437
>gi|356515690|ref|XP_003526531.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
Length = 439
Score = 100 bits (250), Expect = 1e-18, Method: Compositional matrix adjust.
Identities = 108/432 (25%), Positives = 179/432 (41%), Gaps = 51/432 (11%)
Query: 40 PSQDSYQN-LNSLVSSSLTRALHIKNPQTKTTTTTTTTTTTNISSHSYGGYSISLSFGTP 98
P D++ N + ++ S R ++ ++ T +T + + + G Y + + GTP
Sbjct: 51 PKADTWDNRIINMASKDPVRVKYLSTLVSQKTVSTAPIASGQ--AFNIGNYVVRVKLGTP 108
Query: 99 PQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLGCQNPKCSWIHHE 158
Q++ +LDT + + PC+ C CS + +F PK S+S L C P+C +
Sbjct: 109 GQLLFMVLDTSTDEAFVPCSG---CTGCSDT---TFSPKASTSYGPLDCSVPQCGQVR-- 160
Query: 159 SIQCRDCNDEPLATSKNCTQICPSYLVLYGSGLTEGIALSETLNLPNRIIPNFLVGC--S 216
+ C P + C S+ Y + + L L +IP + GC +
Sbjct: 161 GLSC------PATGTGAC-----SFNQSYAGSSFSATLVQDALRLATDVIPYYSFGCVNA 209
Query: 217 VLSSRQPAGIAGFGRGKTSLPSQLNLDK----FSYCLLSHKFDDTTRTSSLILDNGSSHS 272
+ + PA + FSYCL S F + SL L
Sbjct: 210 ITGASVPAQGLLGLGRGPLSLLSQSGSNYSGIFSYCLPS--FKSYYFSGSLKLGPVGQPK 267
Query: 273 DKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRVRVWHKYLTLDRDGNGGTI 332
+TT L +P + PS+ YYV I+VG V +YL + + GTI
Sbjct: 268 SIRTTPLLRSP--HRPSL---------YYVNFTGISVGRVLVPFPSEYLGFNPNTGSGTI 316
Query: 333 VDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEALTGLRPCFDVPGEKTGSFPE 392
+DSGT T ++ + +EF Q V +T ++GA CF E P
Sbjct: 317 IDSGTVITRFVEPVYNAVREEFRKQ-VGGTTFT-SIGA-----FDTCFVKTYETLA--PP 367
Query: 393 LKLHFKGGAEVTLPVENYFAVVGEGSAVCLTVVTDREASGGPSIILGNFQMQNYYVEYDL 452
+ LHF+G ++ LP+EN GS CL + + ++ NFQ QN + +D+
Sbjct: 368 ITLHFEG-LDLKLPLENSLIHSSAGSLACLAMAAAPDNVNSVLNVIANFQQQNLRILFDI 426
Query: 453 RNQRLGFKQQLC 464
N ++G +++C
Sbjct: 427 VNNKVGIAREVC 438
>gi|222619345|gb|EEE55477.1| hypothetical protein OsJ_03658 [Oryza sativa Japonica Group]
Length = 530
Score = 100 bits (250), Expect = 1e-18, Method: Compositional matrix adjust.
Identities = 103/398 (25%), Positives = 164/398 (41%), Gaps = 59/398 (14%)
Query: 89 YSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSS-----KIPSFIPKLSSSSR 143
Y + G+PP+ +DTGS ++W C+ C C SS ++ F P SS+S
Sbjct: 117 YFTRVKLGSPPKEYFVQIDTGSDILWVACS---PCTGCPSSSGLNIQLEFFNPDTSSTSS 173
Query: 144 LLGCQNPKC-SWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSGL-TEGIALSETL 201
+ C + +C + + C+ ++ P Y YG G T G +S+T+
Sbjct: 174 KIPCSDDRCTAALQTSEAVCQTSDNSPCG-----------YTFTYGDGSGTSGYYVSDTM 222
Query: 202 N----LPNRIIPN----FLVGCS-------VLSSRQPAGIAGFGRGKTSLPSQLNLDKFS 246
+ N N + GCS + R GI GFG+ + S+ SQLN S
Sbjct: 223 YFDTVMGNEQTANSSASIVFGCSNSQSGDLTKTDRAVDGIFGFGQHQLSVVSQLNSLGVS 282
Query: 247 YCLLSHKFDDTTRTSSLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRR 306
+ SH + +++ + GL YTP V PS +Y + L
Sbjct: 283 PKVFSHCLKGSDNGGGILV-----LGEIVEPGLVYTPLV--PS-------QPHYNLNLES 328
Query: 307 ITVGGQRVRVWHKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTR 366
I V GQ++ + T GTIVDSGTT ++A ++P + + + + R
Sbjct: 329 IVVNGQKLPIDSSLFTTSN--TQGTIVDSGTTLAYLADGAYDPFVNAITAAVSPS---VR 383
Query: 367 ALGAEALTGLRPCFDVPGEKTGSFPELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTVVT 426
+L ++ CF SFP + L+F GG +T+ ENY L +
Sbjct: 384 SLVSKG----NQCFVTSSSVDSSFPTVSLYFMGGVAMTVKPENYLLQQASIDNNVLWCIG 439
Query: 427 DREASGGPSIILGNFQMQNYYVEYDLRNQRLGFKQQLC 464
+ G ILG+ +++ YDL N R+G+ C
Sbjct: 440 WQRNQGQQITILGDLVLKDKIFVYDLANMRMGWTDYDC 477
>gi|383143501|gb|AFG53178.1| Pinus taeda anonymous locus 2_9704_01 genomic sequence
gi|383143503|gb|AFG53179.1| Pinus taeda anonymous locus 2_9704_01 genomic sequence
gi|383143507|gb|AFG53181.1| Pinus taeda anonymous locus 2_9704_01 genomic sequence
gi|383143509|gb|AFG53182.1| Pinus taeda anonymous locus 2_9704_01 genomic sequence
gi|383143517|gb|AFG53186.1| Pinus taeda anonymous locus 2_9704_01 genomic sequence
gi|383143519|gb|AFG53187.1| Pinus taeda anonymous locus 2_9704_01 genomic sequence
Length = 135
Score = 100 bits (250), Expect = 1e-18, Method: Compositional matrix adjust.
Identities = 56/153 (36%), Positives = 82/153 (53%), Gaps = 18/153 (11%)
Query: 247 YCLLSHKFDDTTRTSSLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRR 306
YCL D +S +++ N + D LTYTP + NP + +YY+GL
Sbjct: 1 YCL-----DYVNNSSKIVVGNKAVPGD---ISLTYTPLIINP------IYPFFYYLGLEA 46
Query: 307 ITVGGQRVRVWHKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTR 366
+++G +R+ + T D GNGGTI+DSGT+FT ++ +A EF SQ+ Y R
Sbjct: 47 VSIGRKRLNLPFNSATFDSKGNGGTIIDSGTSFTIFPEAMYSQIAGEFASQI----GYKR 102
Query: 367 ALGAEALTGLRPCFDVPGEKTGSFPELKLHFKG 399
GAE+ TGL C++V G + FP+ HFKG
Sbjct: 103 VPGAESTTGLGLCYNVSGVENTQFPQFAFHFKG 135
>gi|388505490|gb|AFK40811.1| unknown [Medicago truncatula]
Length = 193
Score = 100 bits (249), Expect = 1e-18, Method: Compositional matrix adjust.
Identities = 71/210 (33%), Positives = 97/210 (46%), Gaps = 22/210 (10%)
Query: 256 DTTRTSSLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRVR 315
D T+ S L+L S + T TP + NP +YY+ L I+VG ++
Sbjct: 2 DDTKQSVLLL---GSLPNVNATKQVTTPLITNPLQPS------FYYISLEVISVGDTKLS 52
Query: 316 VWHKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEALTG 375
+ + DG+GG I+DSGTT T++ F+ L EF SQ + TG
Sbjct: 53 IEQSTFEVSDDGSGGVIIDSGTTITYIEENAFDSLKKEFTSQT------KLPVDKSGSTG 106
Query: 376 LRPCFDVPGEKTG-SFPELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTVVTDREASGGP 434
L CF +P KT P+L HFKGG ++ LP ENY CL + AS G
Sbjct: 107 LDVCFSLPSGKTEVEIPKLVFHFKGG-DLELPGENYMIADSSLGVACLAM----GASNGM 161
Query: 435 SIILGNFQMQNYYVEYDLRNQRLGFKQQLC 464
S I GN Q QN V +DL+ + + F C
Sbjct: 162 S-IFGNIQQQNILVNHDLQKETITFIPTQC 190
>gi|255545932|ref|XP_002514026.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
gi|223547112|gb|EEF48609.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
Length = 437
Score = 100 bits (249), Expect = 1e-18, Method: Compositional matrix adjust.
Identities = 110/394 (27%), Positives = 157/394 (39%), Gaps = 66/394 (16%)
Query: 87 GGYSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLG 146
G Y + + GTP Q + +LDT + W PC+ C CSS+ + S L
Sbjct: 95 GNYVVRVKLGTPGQFMFMVLDTSNDAAWVPCSG---CTGCSSTTFSTNTSSTYGS---LD 148
Query: 147 CQNPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSGLTEGIALSE-TLNLPN 205
C +C+ + S P S +C + YG + L E +L L N
Sbjct: 149 CSMAQCTQVRGFSC--------PATGSSSCV-----FNQSYGGDSSFSATLVEDSLRLVN 195
Query: 206 RIIPNFLVGC--SVLSSRQP------------AGIAGFGRGKTSLPSQLNLDKFSYCLLS 251
+IPNF GC S+ P + IA G + L FSYCL S
Sbjct: 196 DVIPNFAFGCINSISGGSVPPQGLLGLGRGPLSLIAQSGSLYSGL--------FSYCLPS 247
Query: 252 HKFDDTTRTSSLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGG 311
F + SL L G + K + YTP + NP R + YYV L ++VG
Sbjct: 248 --FKSYYFSGSLKL--GPAGQPKS---IRYTPLLRNP---HRPSL---YYVNLTGVSVGR 294
Query: 312 QRVRVWHKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAE 371
V + + L + + GTI+DSGT T ++ + DEF Q+ A
Sbjct: 295 TLVPIAPELLAFNPNTGAGTIIDSGTVITRFVQPIYTAIRDEFRKQV--------AGPFS 346
Query: 372 ALTGLRPCFDVPGEKTGSFPELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTVVTDREAS 431
+L CF E P + LHF G + LP+EN GS CL +
Sbjct: 347 SLGAFDTCFAATNEAVA--PAVTLHFTG-LNLVLPMENSLIHSSAGSLACLAMAAAPNNV 403
Query: 432 GGPSIILGNFQMQNYYVEYDLRNQRLGFKQQLCK 465
++ N Q QN + +D+ N RLG ++LC
Sbjct: 404 NSVLNVIANLQQQNLRLLFDVPNSRLGIARELCN 437
>gi|30683732|ref|NP_180371.2| aspartyl protease-like protein [Arabidopsis thaliana]
gi|28392898|gb|AAO41885.1| putative chloroplast nucleoid DNA binding protein [Arabidopsis
thaliana]
gi|56382011|gb|AAV85724.1| At2g28040 [Arabidopsis thaliana]
gi|330252978|gb|AEC08072.1| aspartyl protease-like protein [Arabidopsis thaliana]
Length = 395
Score = 100 bits (249), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 99/388 (25%), Positives = 158/388 (40%), Gaps = 74/388 (19%)
Query: 89 YSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLGCQ 148
Y + L GTPP I +LDTGS +W C C +C + P F P SS+ + + C
Sbjct: 65 YLMKLQIGTPPFEIEAVLDTGSEHIWTQC---LPCVHCYNQTAPIFDPSKSSTFKEIRC- 120
Query: 149 NPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSGLTEGIALSETLNLPNR-- 206
D +D CP LV G T+G ++ET+ + +
Sbjct: 121 ---------------DTHDHS----------CPYELVYGGKSYTKGTLVTETVTIHSTSG 155
Query: 207 ---IIPNFLVGCSVLSSR-QP--AGIAGFGRGKTSLPSQLNLDK---FSYCLLSHKFDDT 257
++P ++GC +S +P AG+ G RG SL +Q+ + SYC
Sbjct: 156 QPFVMPETIIGCGRNNSGFKPGFAGVVGLDRGPKSLITQMGGEYPGLMSYCFAG------ 209
Query: 258 TRTSSLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRVR-V 316
G+S + + V + +V + A +YY+ L ++VG R+ V
Sbjct: 210 ---------KGTSKINFGANAIVAGDGVVSTTVFVKTAKPGFYYLNLDAVSVGNTRIETV 260
Query: 317 WHKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEALTGL 376
+ L G ++DSG+T T+ PE + L + V Q+V + R ++ L
Sbjct: 261 GTPFHAL----KGNIVIDSGSTLTYF-PESYCNLVRKAVEQVVTAVRFPR---SDILCYY 312
Query: 377 RPCFDVPGEKTGSFPELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTVVTDREASGGPSI 436
D+ FP + +HF GGA++ L N + G CL ++ +
Sbjct: 313 SKTIDI-------FPVITMHFSGGADLVLDKYNMYVASNTGGVFCLAIICNSPIE---EA 362
Query: 437 ILGNFQMQNYYVEYDLRNQRLGFKQQLC 464
I GN N+ V YD + + FK C
Sbjct: 363 IFGNRAQNNFLVGYDSSSLLVSFKPTNC 390
>gi|225465839|ref|XP_002264668.1| PREDICTED: aspartic proteinase nepenthesin-1-like isoform 2 [Vitis
vinifera]
Length = 451
Score = 100 bits (249), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 111/389 (28%), Positives = 158/389 (40%), Gaps = 52/389 (13%)
Query: 89 YSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLGCQ 148
Y + GTP Q + +DT S + W PC C CSS+ F S++ + LGCQ
Sbjct: 101 YIVRAKIGTPAQTMLMAMDTSSDVAWIPCNG---CLGCSSTL---FNSPASTTYKSLGCQ 154
Query: 149 NPKCSWIHHESIQCRDCNDEPLATSKNCTQ-------ICPSYLVLYGSGLTEGIALSETL 201
+C + H PL TS + +C L GS L ++ +T+
Sbjct: 155 AAQCKQVLHLL--------SPLLTSPSVVPKPTCGGGVCSFNLTYGGSSLAANLS-QDTI 205
Query: 202 NLPNRIIPNFLVGC------SVLSSRQPAGIAGFGRGKTSLPSQLNLDKFSYCLLSHKFD 255
L +P + GC L ++ G+ S L FSYCL S F
Sbjct: 206 TLATDAVPGYSFGCIQKATGGSLPAQGLLGLGRGPLSLLSQTQNLYQSTFSYCLPS--FK 263
Query: 256 DTTRTSSLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRVR 315
+ SL L G K+ + YTP + NP R + Y+V L + VG + V
Sbjct: 264 SLNFSGSLRL--GPVGQPKR---IKYTPLLKNP---RRPSL---YFVNLMAVRVGRRVVD 312
Query: 316 VWHKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEALTG 375
V T + GTI DSGT FT + + + D F +NR R L +L G
Sbjct: 313 VPPGSFTFNPSTGAGTIFDSGTVFTRLVTPAYIAVRDAF-----RNR-VGRNLTVTSLGG 366
Query: 376 LRPCFDVPGEKTGSFPELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTVVTDREASGGPS 435
C+ VP P + F G VTLP +N GS CL + +
Sbjct: 367 FDTCYTVPIAA----PTITFMFTG-MNVTLPPDNLLIHSTAGSTTCLAMAAAPDNVNSVL 421
Query: 436 IILGNFQMQNYYVEYDLRNQRLGFKQQLC 464
++ N Q QN+ + YD+ N RLG ++LC
Sbjct: 422 NVIANLQQQNHRLLYDVPNSRLGVARELC 450
>gi|242063796|ref|XP_002453187.1| hypothetical protein SORBIDRAFT_04g001340 [Sorghum bicolor]
gi|241933018|gb|EES06163.1| hypothetical protein SORBIDRAFT_04g001340 [Sorghum bicolor]
Length = 493
Score = 100 bits (249), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 113/397 (28%), Positives = 164/397 (41%), Gaps = 64/397 (16%)
Query: 89 YSISLSFGTPP-QIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLGC 147
Y I++ G+PP + ++DTGS + W C +Q C P F P LSS+ C
Sbjct: 140 YVITVRLGSPPGKSQTMLIDTGSDISWVRCKPCWQ--QCRPQVDPLFDPSLSSTYSPFSC 197
Query: 148 QNPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSGL--TEGIALSETLNLPN 205
+ C+ + E N ++S C Y+ +YG G T G S+TL L +
Sbjct: 198 SSAACAQLFQEG------NANGCSSSGQC-----QYIAMYGDGSVGTTGTYSSDTLALGS 246
Query: 206 R----IIPNFLVGCSVLSSRQPAGIAGFGRGKT-------SLPSQ----LNLDKFSYCLL 250
++ F GCS GI G G SL SQ FSYCL
Sbjct: 247 NSNTVVVSKFRFGCS----HAETGITGLTAGLMGLGGGAQSLVSQTAGTFGTTAFSYCL- 301
Query: 251 SHKFDDTTRTSSLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVG 310
T +SS L G++ + + G TP + + V +Y V L I VG
Sbjct: 302 -----PPTPSSSGFLTLGAAGT--SSAGFVKTPMLRSSQVP------AFYGVRLEAIRVG 348
Query: 311 GQRVRVWHKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGA 370
G+++ + + G I+DSGT T + P + L+ F + M + Y A +
Sbjct: 349 GRQLSIPTTVF------SAGMIMDSGTVVTRLPPTAYSSLSSAFKAGM---KQYPPAPSS 399
Query: 371 EALTGLRPCFDVPGEKTGSFPELKLHF--KGGAEVTLPVENYFAVVGEGSAVCLT-VVTD 427
L CFD+ G+ + S P + L F GGA V L + S CL V T
Sbjct: 400 AGGGFLDTCFDMSGQSSVSMPTVALVFSGAGGAVVNLDASGILLQMETSSIFCLAFVATS 459
Query: 428 REASGGPSIILGNFQMQNYYVEYDLRNQRLGFKQQLC 464
+ S G I+GN Q + + V YD+ +GFK C
Sbjct: 460 DDGSTG---IIGNVQQRTFQVLYDVAGGAVGFKAGAC 493
>gi|255566006|ref|XP_002523991.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
gi|223536718|gb|EEF38359.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
Length = 455
Score = 100 bits (249), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 103/387 (26%), Positives = 164/387 (42%), Gaps = 39/387 (10%)
Query: 87 GGYSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLG 146
G Y + L GTPP I +DTGS+++W PC N CK C + F P SS+ +
Sbjct: 96 GNYLMKLLIGTPPTEIHAAIDTGSNVIWIPCIN---CKDCFNQSSSIFNPLASSTYQDAP 152
Query: 147 CQNPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSGLTEGIALSETLNLPNR 206
C + +C S C+ N + + CP+ G + + L+ + P
Sbjct: 153 CDSYQC---ETTSSSCQSDNVCLYSCDEKHQLNCPN-----GRIAVDTMTLTSSDGRPFP 204
Query: 207 I-IPNFLVGCSVLSSRQPAGIAGFGRGKTSLPSQ---LNLDKFSYCLLSHKFDDTTRTSS 262
+ +F+ G S+ + G+ G GRG SL S+ L+ KFSYCL + ++ +
Sbjct: 205 LPYSDFVCGNSIYKTFAGVGVIGLGRGALSLTSKLYHLSDGKFSYCLADYYSKQPSKI-N 263
Query: 263 LILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRVRVWHKYLT 322
L + S D + T + S YYV L I+VG +R +++
Sbjct: 264 FGLQSFISDDDLEVVSTTLG----------HHRHSGNYYVTLEGISVGEKRQDLYYVDDP 313
Query: 323 LDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKN-----RNYTRALGAEALTGLR 377
G ++DSGT FT + + ++ L + +N N + L
Sbjct: 314 F-APPVGNMLIDSGTMFTLLPKDFYDYLWSTVSYAIPENPQNHPHNSRFPFSMDNTLKLS 372
Query: 378 PCFDVPGEKTGSFPELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTVVTDREASGGPSII 437
PCF E FP++ +HF A+V L +N F V E VC + G S +
Sbjct: 373 PCFWYYPEL--KFPKITIHFT-DADVELSDDNSFIRVAE-DVVCFAFAATQP---GQSTV 425
Query: 438 LGNFQMQNYYVEYDLRNQRLGFKQQLC 464
G++Q N+ + YDL+ + FK+ C
Sbjct: 426 YGSWQQMNFILGYDLKRGTVSFKRTDC 452
>gi|255548660|ref|XP_002515386.1| Aspartic proteinase nepenthesin-2 precursor, putative [Ricinus
communis]
gi|223545330|gb|EEF46835.1| Aspartic proteinase nepenthesin-2 precursor, putative [Ricinus
communis]
Length = 387
Score = 100 bits (249), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 113/396 (28%), Positives = 168/396 (42%), Gaps = 70/396 (17%)
Query: 87 GGYSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSS----KIPSFIPKLSSSS 142
G Y + ++ GTP + LDTGS + W QC+ C S F P+ SSS
Sbjct: 43 GNYLVKMALGTPKLSLSLALDTGSDITW------TQCEPCVGSCYRQAQTKFDPRKSSSY 96
Query: 143 RLLGCQNPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSG-LTEGIALSETL 201
+ + C + C I +S R C S C Y V YG G + G +E L
Sbjct: 97 KNVSCSSSSCRIIT-DSGGARGC------VSSTCI-----YKVQYGDGSYSVGFFATEKL 144
Query: 202 NL-PNRIIPNFLVGCSVLSSRQPAGIAGFGRGKTSLP------------SQLNLDKFSYC 248
+ P+ +I NFL GC +Q AG FGR L S+ + F+YC
Sbjct: 145 TISPSDVISNFLFGCG----QQNAGR--FGRIAGLLGLGRGKLSLALQTSEKYNNLFTYC 198
Query: 249 LLSHKFDDTTRTSSLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRIT 308
L S ++ T L L G K T L+ F N P +Y + ++ ++
Sbjct: 199 LPSFS---SSSTGHLTL-GGQVPKSVKFTPLS-PAFKNTP----------FYGIDIKGLS 243
Query: 309 VGGQRVRVWHKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRAL 368
VGG + + + N G I+DSGT T + P ++ L+ +F Q++K+ T
Sbjct: 244 VGGHVLPIDASVFS-----NAGAIIDSGTVITRLQPTVYSALSSKF-QQLMKDYPKT--- 294
Query: 369 GAEALTGLRPCFDVPGEKTGSFPELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTVVTDR 428
+ + L C+D G ++ S P + FKGG EV + V+ VCL +
Sbjct: 295 --DGFSILDTCYDFSGNESISVPRISFFFKGGVEVDIKFFGILTVINAWDKVCLAFAPND 352
Query: 429 EASGGPSIILGNFQMQNYYVEYDLRNQRLGFKQQLC 464
+ G ++ GN Q Q Y V +DL R+GF C
Sbjct: 353 DD--GDFVVFGNSQQQTYDVVHDLAKGRIGFAPSGC 386
>gi|224136884|ref|XP_002326969.1| predicted protein [Populus trichocarpa]
gi|222835284|gb|EEE73719.1| predicted protein [Populus trichocarpa]
Length = 626
Score = 100 bits (249), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 107/394 (27%), Positives = 162/394 (41%), Gaps = 61/394 (15%)
Query: 85 SYGGYSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRL 144
S G Y+ L GTPPQ I+DTGS + + PC++ C+ C + P F P LSS+ R
Sbjct: 73 SNGYYTTRLFIGTPPQEFALIVDTGSTVTYVPCSS---CEQCGKHQDPRFQPDLSSTYRP 129
Query: 145 LGCQNPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSGLTEGIALS---ETL 201
+ C NP C +C+DE K CT + SG+ +S E+
Sbjct: 130 VKC-NPSC-----------NCDDE----GKQCTYERRYAEMSSSSGVIAEDVVSFGNESE 173
Query: 202 NLPNRIIPNFLVGCS-----VLSSRQPAGIAGFGRGKTSLPSQLNLDKFSYCLLSHKFDD 256
P R + GC L S++ GI G GRG+ S+ QL D
Sbjct: 174 LKPQRAV----FGCENVETGDLYSQRADGIMGLGRGRLSVVDQL-------------VDK 216
Query: 257 TTRTSSLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAF-SVYYYVGLRRITVGGQRVR 315
S L G L N + N + S YY + L+ + V G+ ++
Sbjct: 217 GVIGDSFSLCYGGMDVGGGAMVLGQISPPPNMVFSHSNPYRSPYYNIELKELHVAGKPLK 276
Query: 316 VWHKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEALTG 375
+ K D GT++DSGTT+ + F L D + ++ R+ + G +
Sbjct: 277 LKPKVF----DEKHGTVLDSGTTYAYFPEAAFHALKDAIMKEI---RHLKQIPGPDP-NY 328
Query: 376 LRPCFDVPGEKTGS----FPELKLHFKGGAEVTLPVENY-FAVVGEGSAVCLTVVTDREA 430
CF G + FPE+ + F G +++L ENY F A CL + +
Sbjct: 329 HDICFSGAGREVSHLSKVFPEVNMVFGSGQKLSLSPENYLFRHTKVSGAYCLGIFQN--- 385
Query: 431 SGGPSIILGNFQMQNYYVEYDLRNQRLGFKQQLC 464
+ +LG ++N V YD N ++GF + C
Sbjct: 386 GNDLTTLLGGIVVRNTLVTYDRENDKIGFWKTNC 419
>gi|4063754|gb|AAC98462.1| putative chloroplast nucleoid DNA binding protein [Arabidopsis
thaliana]
gi|20197956|gb|AAM15330.1| putative chloroplast nucleoid DNA binding protein [Arabidopsis
thaliana]
Length = 389
Score = 100 bits (248), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 99/388 (25%), Positives = 157/388 (40%), Gaps = 74/388 (19%)
Query: 89 YSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLGCQ 148
Y + L GTPP I +LDTGS +W C C +C + P F P SS+ + + C
Sbjct: 59 YLMKLQIGTPPFEIEAVLDTGSEHIWTQC---LPCVHCYNQTAPIFDPSKSSTFKEIRCD 115
Query: 149 NPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSGLTEGIALSETLNLPNR-- 206
H S CP LV G T+G ++ET+ + +
Sbjct: 116 T------HDHS--------------------CPYELVYGGKSYTKGTLVTETVTIHSTSG 149
Query: 207 ---IIPNFLVGCSVLSSR-QP--AGIAGFGRGKTSLPSQLNLDK---FSYCLLSHKFDDT 257
++P ++GC +S +P AG+ G RG SL +Q+ + SYC
Sbjct: 150 QPFVMPETIIGCGRNNSGFKPGFAGVVGLDRGPKSLITQMGGEYPGLMSYCFAG------ 203
Query: 258 TRTSSLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRVR-V 316
G+S + + V + +V + A +YY+ L ++VG R+ V
Sbjct: 204 ---------KGTSKINFGANAIVAGDGVVSTTVFVKTAKPGFYYLNLDAVSVGNTRIETV 254
Query: 317 WHKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEALTGL 376
+ L G ++DSG+T T+ PE + L + V Q+V + R ++ L
Sbjct: 255 GTPFHAL----KGNIVIDSGSTLTYF-PESYCNLVRKAVEQVVTAVRFPR---SDILCYY 306
Query: 377 RPCFDVPGEKTGSFPELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTVVTDREASGGPSI 436
D+ FP + +HF GGA++ L N + G CL ++ +
Sbjct: 307 SKTIDI-------FPVITMHFSGGADLVLDKYNMYVASNTGGVFCLAIICNSPIE---EA 356
Query: 437 ILGNFQMQNYYVEYDLRNQRLGFKQQLC 464
I GN N+ V YD + + FK C
Sbjct: 357 IFGNRAQNNFLVGYDSSSLLVSFKPTNC 384
>gi|302757345|ref|XP_002962096.1| hypothetical protein SELMODRAFT_403622 [Selaginella moellendorffii]
gi|300170755|gb|EFJ37356.1| hypothetical protein SELMODRAFT_403622 [Selaginella moellendorffii]
Length = 506
Score = 100 bits (248), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 111/418 (26%), Positives = 171/418 (40%), Gaps = 81/418 (19%)
Query: 80 NISSHSYGGYSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSK--------- 130
N SS S Y + G P Q + I+DTGS ++WF C C+ CSS K
Sbjct: 79 NGSSTSDATYYAQIGVGHPVQFLNAIVDTGSDILWFKCK---LCQGCSSKKNVIVCSSII 135
Query: 131 ----IPSFIPKLSSSSRLLGCQNPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVL 186
I + P+LS ++ C +P CS E CR N+ C +
Sbjct: 136 MQGPITLYDPELSITASPATCSDPLCS----EGGSCRGNNNS-----------CAYDISY 180
Query: 187 YGSGLTEGIALSETLNLPNRIIPN--FLVGCSV-LSSRQPA-GIAGFGRGKTSLPSQLNL 242
+ + GI + ++L ++ N +GC+ +S P GI GFGR K S+P+QL
Sbjct: 181 EDTSSSTGIYFRDVVHLGHKASLNTTMFLGCATSISGLWPVDGIMGFGRSKVSVPNQLAA 240
Query: 243 DKFSYCLLSHKFDDTTRTSSLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYV 302
SY + H +++ + + + YTP + N + Y V
Sbjct: 241 QAGSYNIFYHCLSGEKEGGGILVLG----KNDEFPEMVYTPMLAN---------DIVYNV 287
Query: 303 GLRRITVGGQRVRVWHKYLTLDRD-GNGGTIVDSGT---TFTFMAPELFEPLADEFVSQM 358
L ++V + + + + GNGGTI+DSGT TF A LF +F + +
Sbjct: 288 KLVSLSVNSKALPIEASEFEYNATVGNGGTIIDSGTSSATFPSKALALFVKAVSKFTTAI 347
Query: 359 VKNRNYTRALGAEALTGLRPCFDVPGEKTG---SFPELKLHFKGGAEVTLPVENYF-AVV 414
A + PCF ++ FP + L F GGA + L NY AVV
Sbjct: 348 PT---------APLESSGSPCFISISDRNSVEVDFPNVTLKFDGGATMELTAHNYLEAVV 398
Query: 415 GEGSA----------VCLTVVTDREASGGPSIILGNFQMQNYYVEYDLRNQRLGFKQQ 462
+ VC++ S G S ILG+ +++ V YD+ R+G+ +Q
Sbjct: 399 SRKLSESTHFQGVRLVCIS------WSVGNSTILGDAILKDKVVVYDMEKSRIGWVKQ 450
>gi|449515033|ref|XP_004164554.1| PREDICTED: probable aspartic protease At2g35615-like [Cucumis
sativus]
Length = 430
Score = 100 bits (248), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 126/480 (26%), Positives = 193/480 (40%), Gaps = 85/480 (17%)
Query: 13 IFFFTLLSIFPSSITSLT-----FSLSRFHTNP--------SQDSYQNLNSLVSSSLTRA 59
IFF +L + S T++ F+ S FH + S Y L + SL+R+
Sbjct: 7 IFFHLILLLISFSQTTIINGDNGFTTSLFHRDSLLSPLEFSSLSHYDRLTNAFRRSLSRS 66
Query: 60 LHIKNPQTKTTTTTTTTTTTNISSHSYGGYSISLSFGTPPQIIPFILDTGSHLVWFPCTN 119
+ N + T ++ S G Y +S+S GTPP + DTGS L+W C
Sbjct: 67 ATLLN---RAATNGALDLQAPLTPGS-GEYLMSVSIGTPPVDYIGMADTGSDLMWAQC-- 120
Query: 120 HYQCKYCSSSKIPSFIPKLSSSSRLLGCQNPKCSWIHHESIQCRDCNDEPLATSKNCTQI 179
C C P F P S+S + C + C I + D
Sbjct: 121 -LPCLKCYKQSRPIFDPLKSTSFSHVPCNSQNCKAIDDSHCGAQGVCD------------ 167
Query: 180 CPSYLVLYGSG-LTEGIALSETLNLPNRIIPNFLVGCS---VLSSRQPAGIAGFGRGKTS 235
Y YG T+G E + + + + + ++GC +G+ G G G+ S
Sbjct: 168 ---YSYTYGDQTYTKGDLGFEKITIGSSSVKS-VIGCGHESGGGFGFASGVIGLGGGQLS 223
Query: 236 LPSQLNLD-----KFSYC---LLSHKFDDTTRTSSLILDNGSSHSDKKTTGLTYTPFVNN 287
L SQ++ +FSYC LLSH + ++ G+ TP ++
Sbjct: 224 LVSQMSQTSGISRRFSYCLPTLLSHANGKINFGQNAVVSG---------PGVVSTPLISK 274
Query: 288 PSVAERNAFSVYYYVGLRRITVGGQRVRVWHKYLTLDRDGNGGTIVDSGTTFTFMAPELF 347
V YYYV L I++G +R ++ + GN I+DSGTT +F+ EL+
Sbjct: 275 NPV-------TYYYVTLEAISIGNER------HMASAKQGN--VIIDSGTTLSFLPKELY 319
Query: 348 EPLADEFVSQMVKNRNYTRALGAEALTGLRPCFD--VPGEKTGSFPELKLHFKGGAEVT- 404
D VS ++K R L CFD + + P + F GGA V
Sbjct: 320 ----DGVVSSLLKVVKAKRVKDPGNFWDL--CFDDGINVATSSGIPIITAQFSGGANVNL 373
Query: 405 LPVENYFAVVGEGSAVCLTVVTDREASGGPSIILGNFQMQNYYVEYDLRNQRLGFKQQLC 464
LPV + V + + LT + + G I+GN + N+ + YDL +RL FK +C
Sbjct: 374 LPVNTFQKVANNVNCLTLTPASPTDEFG----IIGNLALANFLIGYDLEAKRLSFKPTVC 429
>gi|326521034|dbj|BAJ92880.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 448
Score = 100 bits (248), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 102/382 (26%), Positives = 160/382 (41%), Gaps = 50/382 (13%)
Query: 89 YSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLGCQ 148
Y + GTPPQ + +DT + W PC+ C C ++ F P S S R + C
Sbjct: 108 YVVRARLGTPPQQLLLAVDTSNDAAWIPCSG---CAGCPTTT--PFNPAASKSYRAVPCG 162
Query: 149 NPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSGLTEGIALSETLNLPNRII 208
+P CS R N +K+C + + Y E ++L + N ++
Sbjct: 163 SPACS---------RAPNPSCSLNTKSC-----GFSLTYADSSLEAALSQDSLAVANDVV 208
Query: 209 PNFLVGC---SVLSSRQPAGIAGFGRGKTSLPSQ---LNLDKFSYCLLSHKFDDTTRTSS 262
++ GC + ++ P G+ G GRG S SQ + FSYCL S F + +
Sbjct: 209 KSYTFGCLQKATGTATPPQGLLGLGRGPLSFLSQTKDMYEGTFSYCLPS--FKSLNFSGT 266
Query: 263 LILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRVRVWHKYLT 322
L L KTT P + NP R++ YYV + I VG + V + L
Sbjct: 267 LRLGRKGQPLRIKTT-----PLLVNP---HRSSL---YYVSMTGIRVGKKVVPIPPAALA 315
Query: 323 LDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEALTGLRPCFDV 382
D GT++DSGT FT + + + DE R R +L G C++
Sbjct: 316 FDPATGAGTVLDSGTMFTRLVAPAYVAVRDEV-------RRRIRGAPLSSLGGFDTCYNT 368
Query: 383 PGEKTGSFPELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTVVTDREASGGPSIILGNFQ 442
T +P + F G +VTLP +N G+ CL + + ++ + Q
Sbjct: 369 ----TVKWPPVTFMFTG-MQVTLPADNLVIHSTYGTTSCLAMAAAPDGVNTVLNVIASMQ 423
Query: 443 MQNYYVEYDLRNQRLGFKQQLC 464
QN+ + +D+ N R+GF ++ C
Sbjct: 424 QQNHRILFDVPNGRVGFAREQC 445
>gi|242092878|ref|XP_002436929.1| hypothetical protein SORBIDRAFT_10g011180 [Sorghum bicolor]
gi|241915152|gb|EER88296.1| hypothetical protein SORBIDRAFT_10g011180 [Sorghum bicolor]
Length = 505
Score = 100 bits (248), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 110/389 (28%), Positives = 161/389 (41%), Gaps = 57/389 (14%)
Query: 89 YSISLSFGTPPQIIPFILDTGSHLVW---FPCTNHYQCKYCSSSKIPSFIPKLSSSSRLL 145
+ +++ FG+P Q +DTGS + W PC+ H C P F P S++ +
Sbjct: 161 FVVTVGFGSPAQNYTLSIDTGSDVSWIQCLPCSGH-----CYKQHDPVFDPTKSATYSAV 215
Query: 146 GCQNPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSGL-TEGIALSETLNLP 204
C +P+C+ + + S C Y V YG G T G+ ETL+L
Sbjct: 216 PCGHPQCAAAGGK-----------CSNSGTCL-----YKVTYGDGSSTAGVLSHETLSLS 259
Query: 205 N-RIIPNFLVGCSVLSSRQ---PAGIAGFGRGKTSLPSQLNLD---KFSYCLLSHKFDDT 257
+ R +P F GC + + G+ G GRG SLPSQ FSYCL S+ DT
Sbjct: 260 STRDLPGFAFGCGQTNLGEFGGVDGLVGLGRGALSLPSQAAATFGATFSYCLPSY---DT 316
Query: 258 TRTSSLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRVRVW 317
T L + + + + + YT + ++ + Y+V + I +GG + V
Sbjct: 317 TH-GYLTMGSTTPAASNDDDDVQYTAMI------QKEDYPSLYFVEVVSIDIGGYILPVP 369
Query: 318 HKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEALTGLR 377
T D GT+ DSGT T++ PE + L D F M + + A A
Sbjct: 370 PTVFTRD-----GTLFDSGTILTYLPPEAYASLRDRFKFTMTQYKP------APAYDPFD 418
Query: 378 PCFDVPGEKTGSFPELKLHFKGGAEVTL-PVENYFAVVGEGSAV-CLTVVTDREASGGPS 435
C+D G P + F GA L PV A CL V S P
Sbjct: 419 TCYDFTGHNAIFMPAVAFKFSDGAVFDLSPVAILIYPDDTAPATGCLAFVP--RPSTMPF 476
Query: 436 IILGNFQMQNYYVEYDLRNQRLGFKQQLC 464
I+GN Q + V YD+ +++GF Q C
Sbjct: 477 NIIGNTQQRGTEVIYDVAAEKIGFGQFTC 505
>gi|297798978|ref|XP_002867373.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
gi|297313209|gb|EFH43632.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
Length = 434
Score = 100 bits (248), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 130/464 (28%), Positives = 189/464 (40%), Gaps = 77/464 (16%)
Query: 23 PSSITSLTFSLSRFHTNPSQDSYQNLNSLVSSSLTRA-LHIKN-PQTKTTTTTTTTTTTN 80
P TS F+L R HT +++ + S + LH K+ P TT +
Sbjct: 23 PKQCTSYRFTL-RLHT-------KSIKTKESPKIKPGYLHSKSTPAPSRLDNLWTTEIAD 74
Query: 81 ISSH-----SYGGYSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFI 135
I SH + + ++S G PP ++DTGS L W C CK C IP F
Sbjct: 75 IVSHVTPIPNPAAFLANISIGDPPVPQLLLIDTGSDLTWIQC---LPCK-CYPQTIPFFH 130
Query: 136 PKLSSSSRLLGCQNPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSGLTEGI 195
P SS+ R N C H Q DE T C +L T GI
Sbjct: 131 PSRSSTYR-----NASCESAPHAMPQI--FRDEK-------TGNCRYHLRYRDFSNTRGI 176
Query: 196 ALSETLNLPNR---II--PNFLVGCSVLSS--RQPAGIAGFGRGKTSLPSQLNLDKFSYC 248
E L +I PN + GC +S Q +G+ G G G S+ ++ KFSYC
Sbjct: 177 LAKEKLTFQTSDEGLISKPNIVFGCGQDNSGFTQYSGVLGLGPGTFSIVTRNFGSKFSYC 236
Query: 249 LLSHKFDDTTRTSSLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRIT 308
S D T + LIL NG+ T F YY+ L+ I+
Sbjct: 237 FGS-LIDPTYPHNFLILGNGARIEGDPT---------------PLQIFQDRYYLDLQAIS 280
Query: 309 VGGQRVRVWHKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEF------VSQMVKN- 361
+G + + + R GGT++D+G + T +A E +E L++E V + VK+
Sbjct: 281 LGEKLLDIEPGIFQRYR-SKGGTVIDTGCSPTILAREAYETLSEEIDFLLGEVLRRVKDW 339
Query: 362 RNYTRALGAEALTGLRPCFDVPGEKTGSFPELKLHFKGGAEVTLPVENYFAVVGEGSAVC 421
YT L D+ G FP + HF GGAE+ L VE+ F G + C
Sbjct: 340 EQYTNHCYEGNLK-----LDLYG-----FPVVTFHFAGGAELALDVESLFVSSESGDSFC 389
Query: 422 LTVVTDREASGGPSIILGNFQMQNYYVEYDLRNQRLGFKQQLCK 465
L + + + ++G QNY V Y+LR ++ F++ C+
Sbjct: 390 LAMTMN---TFDDMSVIGAMAQQNYNVGYNLRTMKVYFQRTDCE 430
>gi|168064205|ref|XP_001784055.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162664441|gb|EDQ51161.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 459
Score = 100 bits (248), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 113/401 (28%), Positives = 168/401 (41%), Gaps = 72/401 (17%)
Query: 87 GGYSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIP--SFIPKLSSSSRL 144
G Y + GTPPQ +DTGS + W C CK S+ +P F P+ S+S
Sbjct: 46 GLYYTRIYLGTPPQQFYVHVDTGSDVAWVNCVPCTNCKRASNVALPISIFDPEKSTSKTS 105
Query: 145 LGCQNPKCSWIHHESIQCRDCNDEPLATSKNC---TQICPSYLVLYGSG-LTEGIALSET 200
+ C + +C LA++ C + CP Y LYG G T G +++
Sbjct: 106 ISCTDEECY----------------LASNSKCSFNSMSCP-YSTLYGDGSSTAGYLINDV 148
Query: 201 LNLPNRIIPN-----------FLVGCSVLSSRQPAGIAGFGRGKTSLPSQLNLDKFSYCL 249
L+ N F G + + G+ GFG+ + SLPSQL+ S +
Sbjct: 149 LSFNQVPSGNSTATSGTARLTFGCGSNQTGTWLTDGLVGFGQAEVSLPSQLSKQNVSVNI 208
Query: 250 LSHKFD-DTTRTSSLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRIT 308
+H D + +L++ + + GL YTP V S +Y V L I
Sbjct: 209 FAHCLQGDNKGSGTLVIGH------IREPGLVYTPIVPKQS---------HYNVELLNIG 253
Query: 309 VGGQRVRVWHKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRAL 368
V G V + D +GG I+DSGTT T+ L +P D+F K R+ R+
Sbjct: 254 VSGTNVTTPTAF---DLSNSGGVIMDSGTTLTY----LVQPAYDQF---QAKVRDCMRS- 302
Query: 369 GAEALTGLRP-CFDVPGEKTGSFPELKLHFKGGAEVTLPVENYF--AVVGEG-SAVCLTV 424
G+ P F G FP + L+F GGA + L +Y ++ G SA C +
Sbjct: 303 ------GVLPVAFQFFCTIEGYFPNVTLYFAGGAAMLLSPSSYLYKEMLTTGLSAYCFSW 356
Query: 425 VTDREASGGPS-IILGNFQMQNYYVEYDLRNQRLGFKQQLC 464
+ G S I G+ +++ V YD N R+G+K C
Sbjct: 357 LESTSVYGYLSYTIFGDNVLKDQLVVYDNVNNRIGWKNFDC 397
>gi|293333012|ref|NP_001168410.1| uncharacterized protein LOC100382179 precursor [Zea mays]
gi|223948083|gb|ACN28125.1| unknown [Zea mays]
gi|413953783|gb|AFW86432.1| hypothetical protein ZEAMMB73_737575 [Zea mays]
Length = 466
Score = 100 bits (248), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 108/385 (28%), Positives = 160/385 (41%), Gaps = 57/385 (14%)
Query: 89 YSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLGCQ 148
Y I++S GTP +DTGS + W C + CSS K F P S++ C
Sbjct: 130 YVITVSLGTPAVTQVMSIDTGSDVSWVQCA-PCAAQSCSSQKDKLFDPAKSATYSAFSCS 188
Query: 149 NPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLY-GSGLTEGIALSETLNL-PNR 206
+ +C+ + E C + + + Y+V Y T G S+TL L +
Sbjct: 189 SAQCAQLGGEGNGCLNSHCQ--------------YIVKYVDHSNTTGTYGSDTLGLTTSD 234
Query: 207 IIPNFLVGCSVLSS---RQPAGIAGFGRGKTSLPSQLNL---DKFSYCLLSHKFDDTTRT 260
+ NF GCS ++ Q G+ G G SL SQ FSYCL ++ +
Sbjct: 235 AVKNFQFGCSHRANGFVGQLDGLMGLGGDTESLVSQTAATYGKAFSYCL-----PPSSSS 289
Query: 261 SSLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRVRVWHKY 320
+ L G++ ++ + TP V R +Y V L+ ITV G ++ V
Sbjct: 290 AGGFLTLGAAAGGTSSSRYSRTPLV-------RFNVPTFYGVFLQAITVAGTKLNVPASV 342
Query: 321 LTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEALTG-LRPC 379
+ G ++VDSGT T + P ++ L F +M +A + A G L C
Sbjct: 343 FS------GASVVDSGTVITQLPPTAYQALRTAFKKEM-------KAYPSAAPVGILDTC 389
Query: 380 FDVPGEKTGSFPELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTVVTDREASGGPSIILG 439
FD G KT P + L F GA + L V F A CL A G + ILG
Sbjct: 390 FDFSGIKTVRVPVVTLTFSRGAVMDLDVSGIFY------AGCLAFTA--TAQDGDTGILG 441
Query: 440 NFQMQNYYVEYDLRNQRLGFKQQLC 464
N Q + + + +D+ LGF+ C
Sbjct: 442 NVQQRTFEMLFDVGGSTLGFRPGAC 466
>gi|115448349|ref|NP_001047954.1| Os02g0720600 [Oryza sativa Japonica Group]
gi|45735841|dbj|BAD12876.1| putative 41 kD chloroplast nucleoid DNA binding protein (CND41)
[Oryza sativa Japonica Group]
gi|45735967|dbj|BAD12996.1| putative 41 kD chloroplast nucleoid DNA binding protein (CND41)
[Oryza sativa Japonica Group]
gi|113537485|dbj|BAF09868.1| Os02g0720600 [Oryza sativa Japonica Group]
gi|125583492|gb|EAZ24423.1| hypothetical protein OsJ_08176 [Oryza sativa Japonica Group]
gi|215740989|dbj|BAG97484.1| unnamed protein product [Oryza sativa Japonica Group]
Length = 463
Score = 100 bits (248), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 124/471 (26%), Positives = 185/471 (39%), Gaps = 84/471 (17%)
Query: 24 SSITSLTFSLSRFHTN----PSQDSYQNLNSLVSSSLTRALHIKNP-------------- 65
SS++ T +L+ H PS L+ RA HI+
Sbjct: 47 SSLSGTTVALNHRHGPCSPVPSSKKRPTEEELLKRDQLRAEHIQRKFAMNAAVDGAGDLQ 106
Query: 66 QTKTTTTTTTTTTTNISSHSYGGYSISLSFGTPPQIIPFILDTGSHLVWF---PCTNHYQ 122
Q+K +++ T +++ + Y IS+ GTP +DTGS + W PC N
Sbjct: 107 QSKVSSSVPTKLGSSLDTLEY---VISVGLGTPAVTQTVTIDTGSDVSWVQCNPCPN--- 160
Query: 123 CKYCSSSKIPSFIPKLSSSSRLLGCQNPKCSWIHHESIQCRDCNDEPLATSKNCTQICPS 182
C + F P SS+ R + C +C+ + + C AT+ C
Sbjct: 161 -PPCYAQTGALFDPAKSSTYRAVSCAAAECAQLEQQGNGCG-------ATNYEC-----Q 207
Query: 183 YLVLYGSG-LTEGIALSETLNL--PNRIIPNFLVGCSVLSS---RQPAGIAGFGRGKTSL 236
Y V YG G T G +TL L + + F GCS + S Q G+ G G G SL
Sbjct: 208 YGVQYGDGSTTNGTYSRDTLTLSGASDAVKGFQFGCSHVESGFSDQTDGLMGLGGGAQSL 267
Query: 237 PSQLNL---DKFSYCLLSHKFDDTTRTSSLILDNGSSHSDKKTTGLTYTPFVNNPSVAER 293
SQ + FSYCL +GSS G + FV + R
Sbjct: 268 VSQTAAAYGNSFSYCLPPT--------------SGSSGFLTLGGGGGVSGFVTTRMLRSR 313
Query: 294 NAFSVYYYVGLRRITVGGQRVRVWHKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADE 353
+Y L+ I VGG+++ + G++VDSGT T + P + L+
Sbjct: 314 Q-IPTFYGARLQDIAVGGKQLGLSPSVFA------AGSVVDSGTIITRLPPTAYSALSSA 366
Query: 354 FVSQMVKNRNYTRALGAEALTGLRPCFDVPGEKTGSFPELKLHFKGGAEVTLPVENYFAV 413
F + M + R+ A A + L CFD G+ S P + L F GGA + L
Sbjct: 367 FKAGMKQYRS------APARSILDTCFDFAGQTQISIPTVALVFSGGAAIDLDPNGIM-- 418
Query: 414 VGEGSAVCLTVVTDREASGGPSIILGNFQMQNYYVEYDLRNQRLGFKQQLC 464
G+ + D +G I+GN Q + + V YD+ + LGF+ C
Sbjct: 419 --YGNCLAFAATGDDGTTG----IIGNVQQRTFEVLYDVGSSTLGFRSGAC 463
>gi|359483137|ref|XP_002272278.2| PREDICTED: aspartic proteinase nepenthesin-1-like [Vitis vinifera]
Length = 402
Score = 100 bits (248), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 88/298 (29%), Positives = 140/298 (46%), Gaps = 42/298 (14%)
Query: 179 ICPSYLVLYGSG-LTEGIALSETLNLPNRIIPNFLVGCSVLSSRQPAGIAGF---GRGKT 234
IC +Y + YG G T G E L ++ +F+ GC + G++G GR
Sbjct: 132 IC-NYAINYGDGSFTRGELGHEKLKFGTILVKDFIFGCGRNNKGLFGGVSGLMGLGRSDL 190
Query: 235 SLPSQ---LNLDKFSYCLLSHKFDDTTRTSSLILDNGSSHSDKKTTGLTYTPFVNNPSVA 291
SL SQ + FSYCL S + + SLIL G+S + ++ ++Y + NP +
Sbjct: 191 SLISQTSGIFGGVFSYCLPS---TERKGSGSLIL-GGNSSVYRNSSPISYAKMIENPQLY 246
Query: 292 ERNAFSVYYYVGLRRITVGGQRVRVWHKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLA 351
+Y++ L I++GG ++ G +VDSGT T + P +++ L
Sbjct: 247 N------FYFINLTGISIGGVALQAPSV-------GPSRILVDSGTVITRLPPTIYKALK 293
Query: 352 DEFVSQMVKNRNYTRALGAEALTGLRPCFDVPGEKTGSFPELKLHFKGGAEVTLPVENYF 411
EF+ Q +T A A + L CF++ + P +K+HF+G AE+T+ V F
Sbjct: 294 AEFLKQ------FTGFPPAPAFSILDTCFNLSAYQEVDIPTIKMHFEGNAELTVDVTGVF 347
Query: 412 AVV-GEGSAVCLTVVT----DREASGGPSIILGNFQMQNYYVEYDLRNQRLGFKQQLC 464
V + S VCL + + D A ILGN+Q +N V YD + ++GF + C
Sbjct: 348 YFVKSDASQVCLALASLEYQDEVA------ILGNYQQKNLRVIYDTKETKVGFALETC 399
>gi|297802338|ref|XP_002869053.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
gi|297314889|gb|EFH45312.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
Length = 522
Score = 99.8 bits (247), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 126/485 (25%), Positives = 200/485 (41%), Gaps = 88/485 (18%)
Query: 9 CLSFIFFFTLLSIFPSSITSLTFSLSRFHTNPSQDSYQNLNSLV-SSSLTRALHIKNPQT 67
C IF F + F + + S RF P + S++ N+LV L R + + ++
Sbjct: 24 CNGRIFTFEMHHRFSDEVKQWSDSTGRFVKFPPKGSFEYFNALVLRDWLIRGRRLSDSES 83
Query: 68 KTTTT-TTTTTTTNISSHSYGGYSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCK-- 124
+++ T + +T+ ISS + Y+ ++ GTP LDTGS L W PC + +C
Sbjct: 84 ESSLTFSDGNSTSRISSLGFLHYT-TVKLGTPGMRFMVALDTGSDLFWVPC-DCGKCAPT 141
Query: 125 ----YCSSSKIPSFIPKLSSSSRLLGCQNPKCSWIHHESIQCRDCNDEPLATSKNCTQIC 180
Y S ++ + PK+S++++ + C N C+ + QC L T C
Sbjct: 142 EGATYASEFELSIYNPKISTTNKKVTCNNSLCA----QRNQC-------LGTFSTC---- 186
Query: 181 PSYLVLYGSGL--TEGIALSETLNL------PNRIIPNFLVGC------SVLSSRQPAGI 226
Y+V Y S T GI + + ++L P R+ GC S L P G+
Sbjct: 187 -PYMVSYVSAQTSTSGILMEDVMHLTTEDKNPERVEAYVTFGCGQVQSGSFLDIAAPNGL 245
Query: 227 AGFGRGKTSLPSQLN-----LDKFSYCLLSHKFDDTTRTSSLILDNGSSHSDKKTTGLTY 281
G G K S+PS L D FS C D R S D GSS ++
Sbjct: 246 FGLGMEKISVPSVLAREGLVADSFSMCF---GHDGVGRIS--FGDKGSSDQEE------- 293
Query: 282 TPFVNNPSVAERNAFSVYYYVGLRRITVGGQRVRVWHKYLTLDRDGNGGTIVDSGTTFTF 341
TPF NPS Y + + R+ VG + D + D+GT+FT+
Sbjct: 294 TPFNLNPSHPN-------YNITVTRVRVGTTLI-----------DDEFTALFDTGTSFTY 335
Query: 342 MAPELFEPLADEFVSQMVKNRNYTRALGAEALTGLRPCFDVPGEKTGSF-PELKLHFKGG 400
+ ++ +++ F SQ R+ ++ C+D+ + S P L L KG
Sbjct: 336 LVDPMYTTVSESFHSQAQDKRH-----SPDSRIPFEYCYDMSNDANASLIPSLSLTMKGN 390
Query: 401 AEVTLPVENYFAVVGEGSAV-CLTVVTDREASGGPSIILGNFQMQNYYVEYDLRNQRLGF 459
+ T+ + + EG V CL +V E + I+G M Y V +D L +
Sbjct: 391 SHFTIN-DPIIVISTEGELVYCLAIVKSSELN-----IIGQNYMTGYRVVFDREKLVLAW 444
Query: 460 KQQLC 464
K+ C
Sbjct: 445 KKFDC 449
>gi|42567433|ref|NP_195313.2| aspartyl protease family protein [Arabidopsis thaliana]
gi|190576481|gb|ACE79041.1| At4g35880 [Arabidopsis thaliana]
gi|222423134|dbj|BAH19546.1| AT4G35880 [Arabidopsis thaliana]
gi|332661184|gb|AEE86584.1| aspartyl protease family protein [Arabidopsis thaliana]
Length = 524
Score = 99.8 bits (247), Expect = 3e-18, Method: Compositional matrix adjust.
Identities = 126/487 (25%), Positives = 200/487 (41%), Gaps = 90/487 (18%)
Query: 9 CLSFIFFFTLLSIFPSSITSLTFSLSRFHTNPSQDSYQNLNSLV-SSSLTRALHIKNPQT 67
C IF F + F + + S RF P + S++ N+LV L R + ++
Sbjct: 24 CNGRIFTFEMHHRFSDEVKQWSDSTGRFAKFPPKGSFEYFNALVLRDWLIRGRRLSESES 83
Query: 68 KTTTTTTTT---TTTNISSHSYGGYSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCK 124
++ ++ T + +T+ ISS + Y+ ++ GTP LDTGS L W PC + +C
Sbjct: 84 ESESSLTFSDGNSTSRISSLGFLHYT-TVKLGTPGMRFMVALDTGSDLFWVPC-DCGKCA 141
Query: 125 ------YCSSSKIPSFIPKLSSSSRLLGCQNPKCSWIHHESIQCRDCNDEPLATSKNCTQ 178
Y S ++ + PK+S++++ + C N C+ + QC L T C
Sbjct: 142 PTEGATYASEFELSIYNPKVSTTNKKVTCNNSLCA----QRNQC-------LGTFSTC-- 188
Query: 179 ICPSYLVLYGSGL--TEGIALSETLNL------PNRIIPNFLVGC------SVLSSRQPA 224
Y+V Y S T GI + + ++L P R+ GC S L P
Sbjct: 189 ---PYMVSYVSAQTSTSGILMEDVMHLTTEDKNPERVEAYVTFGCGQVQSGSFLDIAAPN 245
Query: 225 GIAGFGRGKTSLPSQLN-----LDKFSYCLLSHKFDDTTRTSSLILDNGSSHSDKKTTGL 279
G+ G G K S+PS L D FS C D R S D GSS ++
Sbjct: 246 GLFGLGMEKISVPSVLAREGLVADSFSMCF---GHDGVGRIS--FGDKGSSDQEE----- 295
Query: 280 TYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRVRVWHKYLTLDRDGNGGTIVDSGTTF 339
TPF NPS Y + + R+ VG + D + D+GT+F
Sbjct: 296 --TPFNLNPSHPN-------YNITVTRVRVGTTLI-----------DDEFTALFDTGTSF 335
Query: 340 TFMAPELFEPLADEFVSQMVKNRNYTRALGAEALTGLRPCFDVPGEKTGSF-PELKLHFK 398
T++ ++ +++ F SQ R+ ++ C+D+ + S P L L K
Sbjct: 336 TYLVDPMYTTVSESFHSQAQDKRH-----SPDSRIPFEYCYDMSNDANASLIPSLSLTMK 390
Query: 399 GGAEVTLPVENYFAVVGEGSAV-CLTVVTDREASGGPSIILGNFQMQNYYVEYDLRNQRL 457
G + T+ + + EG V CL +V E + I+G M Y V +D L
Sbjct: 391 GNSHFTIN-DPIIVISTEGELVYCLAIVKSSELN-----IIGQNYMTGYRVVFDREKLVL 444
Query: 458 GFKQQLC 464
+K+ C
Sbjct: 445 AWKKFDC 451
>gi|357123876|ref|XP_003563633.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Brachypodium
distachyon]
Length = 503
Score = 99.8 bits (247), Expect = 3e-18, Method: Compositional matrix adjust.
Identities = 108/389 (27%), Positives = 159/389 (40%), Gaps = 61/389 (15%)
Query: 89 YSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSS----KIPSFIPKLSSSSRL 144
Y + + GTPP + DTGS W QC+ C S K F P SS+
Sbjct: 163 YVVPIGLGTPPSRFTVVFDTGSDTTWV------QCRPCVVSCYKQKDRLFDPAKSSTYAN 216
Query: 145 LGCQNPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSG-LTEGIALSETLNL 203
+ C +P C+ + CN + +C Y + YG G T G +TL +
Sbjct: 217 VSCADPACA-----DLDASGCN------AGHCL-----YGIQYGDGSYTVGFFAKDTLAV 260
Query: 204 PNRIIPNFLVGCSVLSS---RQPAGIAGFGRGKTSLPSQLNLD---KFSYCLLSHKFDDT 257
I F GC + Q AG+ G GRG TS+ Q FSYCL +
Sbjct: 261 AQDAIKGFKFGCGEKNRGLFGQTAGLLGLGRGPTSITVQAYEKYGGSFSYCLPA------ 314
Query: 258 TRTSSLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRVRVW 317
+ ++ L+ G + TP + + +YYVGL I VGG+++
Sbjct: 315 SSAATGYLEFGPLSPSSSGSNAKTTPMLTDKG-------PTFYYVGLTGIRVGGKQLGAI 367
Query: 318 HKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEALTGLR 377
+ + N GT+VDSGT T + P+ + + Y + A A + L
Sbjct: 368 PESVF----SNSGTLVDSGTVITRL-PDTAYAALSSAFAAAMAASGYKK---AAAYSILD 419
Query: 378 PCFDVPGEKTGSFPELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTVVT--DREASGGPS 435
C+D G S P + L F+GGA + L + + S VCL + D E+ G
Sbjct: 420 TCYDFTGLSQVSLPTVSLVFQGGACLDLDASGIVYAISQ-SQVCLGFASNGDDESVG--- 475
Query: 436 IILGNFQMQNYYVEYDLRNQRLGFKQQLC 464
I+GN Q + Y V YD+ + +GF C
Sbjct: 476 -IVGNTQQRTYGVLYDVSKKVVGFAPGAC 503
>gi|190896584|gb|ACE96805.1| aspartyl protease [Populus tremula]
gi|190896586|gb|ACE96806.1| aspartyl protease [Populus tremula]
gi|190896588|gb|ACE96807.1| aspartyl protease [Populus tremula]
gi|190896590|gb|ACE96808.1| aspartyl protease [Populus tremula]
gi|190896592|gb|ACE96809.1| aspartyl protease [Populus tremula]
gi|190896594|gb|ACE96810.1| aspartyl protease [Populus tremula]
gi|190896596|gb|ACE96811.1| aspartyl protease [Populus tremula]
gi|190896598|gb|ACE96812.1| aspartyl protease [Populus tremula]
gi|190896600|gb|ACE96813.1| aspartyl protease [Populus tremula]
gi|190896602|gb|ACE96814.1| aspartyl protease [Populus tremula]
gi|190896604|gb|ACE96815.1| aspartyl protease [Populus tremula]
gi|190896606|gb|ACE96816.1| aspartyl protease [Populus tremula]
gi|190896610|gb|ACE96818.1| aspartyl protease [Populus tremula]
gi|190896612|gb|ACE96819.1| aspartyl protease [Populus tremula]
gi|190896614|gb|ACE96820.1| aspartyl protease [Populus tremula]
gi|190896616|gb|ACE96821.1| aspartyl protease [Populus tremula]
gi|190896618|gb|ACE96822.1| aspartyl protease [Populus tremula]
gi|190896620|gb|ACE96823.1| aspartyl protease [Populus tremula]
gi|190896622|gb|ACE96824.1| aspartyl protease [Populus tremula]
gi|190896624|gb|ACE96825.1| aspartyl protease [Populus tremula]
gi|190896626|gb|ACE96826.1| aspartyl protease [Populus tremula]
gi|190896628|gb|ACE96827.1| aspartyl protease [Populus tremula]
gi|190896630|gb|ACE96828.1| aspartyl protease [Populus tremula]
gi|190896632|gb|ACE96829.1| aspartyl protease [Populus tremula]
gi|190896634|gb|ACE96830.1| aspartyl protease [Populus tremula]
gi|190896636|gb|ACE96831.1| aspartyl protease [Populus tremula]
gi|190896638|gb|ACE96832.1| aspartyl protease [Populus tremula]
gi|190896640|gb|ACE96833.1| aspartyl protease [Populus tremula]
gi|190896642|gb|ACE96834.1| aspartyl protease [Populus tremula]
gi|190896644|gb|ACE96835.1| aspartyl protease [Populus tremula]
gi|190896646|gb|ACE96836.1| aspartyl protease [Populus tremula]
gi|190896648|gb|ACE96837.1| aspartyl protease [Populus tremula]
gi|190896650|gb|ACE96838.1| aspartyl protease [Populus tremula]
gi|190896652|gb|ACE96839.1| aspartyl protease [Populus tremula]
gi|190896654|gb|ACE96840.1| aspartyl protease [Populus tremula]
gi|190896656|gb|ACE96841.1| aspartyl protease [Populus tremula]
gi|190896658|gb|ACE96842.1| aspartyl protease [Populus tremula]
Length = 339
Score = 99.8 bits (247), Expect = 3e-18, Method: Compositional matrix adjust.
Identities = 99/327 (30%), Positives = 140/327 (42%), Gaps = 48/327 (14%)
Query: 89 YSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLGCQ 148
Y + + GTP Q + +LDT + W PC+ C CSS+ +F+P S++ L C
Sbjct: 45 YVVRVKLGTPGQQMFMVLDTSNDAAWVPCSG---CTGCSST---TFLPNASTTLGSLDCS 98
Query: 149 NPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSGLTEGIALSETLNLPNRII 208
+CS + S AT + SY G + + + L N +I
Sbjct: 99 EAQCSQVRGFSCP---------ATGSSACLFNQSY---GGDSSLAATLVQDAITLANDVI 146
Query: 209 PNFLVGC-SVLS--SRQPAGIAGFGRGKTSLPSQLNL---DKFSYCLLSHKFDDTTRTSS 262
P F GC + +S S P G+ G GRG SL SQ FSYCL S F + S
Sbjct: 147 PGFTFGCINAVSGGSIPPQGLLGLGRGPISLISQAGAMYSGVFSYCLPS--FKSYYFSGS 204
Query: 263 LILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRVRVWHKYLT 322
L L +TT L P + PS+ YYV L ++VG +V + + L
Sbjct: 205 LKLGPVGQPKSIRTTPLLRNP--HRPSL---------YYVNLTGVSVGRIKVPIPSEQLV 253
Query: 323 LDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEALTGLRPCFDV 382
D + GTI+DSGT T ++ + DEF Q+ +LGA CF
Sbjct: 254 FDPNTGAGTIIDSGTVITRFVQPVYFAIRDEFRKQV---NGPISSLGA-----FDTCFAA 305
Query: 383 PGEKTGSFPELKLHFKGGAEVTLPVEN 409
E P + LHF+ G + LP+EN
Sbjct: 306 TNEAEA--PAVTLHFE-GLNLVLPMEN 329
>gi|54290728|dbj|BAD62398.1| putative nucleoid DNA-binding protein cnd41 [Oryza sativa Japonica
Group]
Length = 486
Score = 99.8 bits (247), Expect = 3e-18, Method: Compositional matrix adjust.
Identities = 110/392 (28%), Positives = 162/392 (41%), Gaps = 65/392 (16%)
Query: 89 YSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLGCQ 148
+ +++ GTP Q I DTGS L W C +C + P F P SS+ + C
Sbjct: 144 FVVAVGLGTPAQPSALIFDTGSDLSWVQCQPCGSSGHCHPQQDPLFDPSKSSTYAAVHCG 203
Query: 149 NPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSG-LTEGIALSETLNL-PNR 206
P+C+ L + N T + YLV YG G T G+ +TL L +R
Sbjct: 204 EPQCAAAGD------------LCSEDNTTCL---YLVRYGDGSSTTGVLSRDTLALTSSR 248
Query: 207 IIPNFLVGCSVLSSRQPAGIAGFGR---------GKTSLPSQLNLD---KFSYCLLSHKF 254
+ F GC + + FGR G+ SLPSQ FSYCL S
Sbjct: 249 ALTGFPFGCGTRN------LGDFGRVDGLLGLGRGELSLPSQAAASFGAVFSYCLPS--- 299
Query: 255 DDTTRTSSLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRV 314
+ T+ + + +D T YT + P F +Y+V L I +GG +
Sbjct: 300 --SNSTTGYLTIGATPATD--TGAAQYTAMLRKPQ------FPSFYFVELVSIDIGGYVL 349
Query: 315 RVWHKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEALT 374
V T GGT++DSGT T++ + + L D F M + YT A + L
Sbjct: 350 PVPPAVFT-----RGGTLLDSGTVLTYLPAQAYALLRDRFRLTMER---YTPAPPNDVLD 401
Query: 375 GLRPCFDVPGEKTGSFPELKLHFKGGAEVTLPVENYFAVVG--EGSAVCLTVVTDREASG 432
C+D GE P + F GA L ++F V+ + + CL + G
Sbjct: 402 A---CYDFAGESEVVVPAVSFRFGDGAVFEL---DFFGVMIFLDENVGCLAFAA-MDTGG 454
Query: 433 GPSIILGNFQMQNYYVEYDLRNQRLGFKQQLC 464
P I+GN Q ++ V YD+ +++GF C
Sbjct: 455 LPLSIIGNTQQRSAEVIYDVAAEKIGFVPASC 486
>gi|118482048|gb|ABK92955.1| unknown [Populus trichocarpa]
Length = 425
Score = 99.8 bits (247), Expect = 3e-18, Method: Compositional matrix adjust.
Identities = 108/382 (28%), Positives = 152/382 (39%), Gaps = 53/382 (13%)
Query: 89 YSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLGCQ 148
Y + + GTP Q LDT + W PC C CSS+ S S++ + LGC
Sbjct: 90 YIVKANVGTPAQTFLMALDTSNDAAWIPCNG---CVGCSSTVFNSVT---STTFKTLGCD 143
Query: 149 NPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSGLTEGIALSETLNLPNRII 208
P+C + + P CT + YG +T+ L I+
Sbjct: 144 APQCKQVPN-----------PTCGGSTCT-----WNTTYGGSTILSNLTRDTIALSTDIV 187
Query: 209 PNFLVGC---SVLSSRQPAGIAGFGRGKTSLPSQ---LNLDKFSYCLLSHKFDDTTRTSS 262
P + GC + SS P G+ G GRG S SQ L FSYCL S F + +
Sbjct: 188 PGYTFGCIQKTTGSSVPPQGLLGLGRGPLSFLSQTQDLYKSTFSYCLPS--FRTLNFSGT 245
Query: 263 LILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRVRVWHKYLT 322
L L KTT P + NP R++ YYV L I VG + V + L
Sbjct: 246 LRLGPAGQPLRIKTT-----PLLKNP---RRSSL---YYVNLIGIRVGRKIVDIPASALA 294
Query: 323 LDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEALTGLRPCFDV 382
+ GTI DSGT FT + ++ + DEF R +L G C+
Sbjct: 295 FNPTTGAGTIFDSGTVFTRLVAPVYTAVRDEF-------RKRVGNAIVSSLGGFDTCYTG 347
Query: 383 PGEKTGSFPELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTVVTDREASGGPSIILGNFQ 442
P P + F G VTLP +N GS CL + + ++ N Q
Sbjct: 348 PIVA----PTMTFMFSG-MNVTLPTDNLLIRSTAGSTSCLAMAAAPDNVNSVLNVIANMQ 402
Query: 443 MQNYYVEYDLRNQRLGFKQQLC 464
QN+ + +D+ N R+G ++ C
Sbjct: 403 QQNHRILFDVPNSRIGVAREPC 424
>gi|21717169|gb|AAM76362.1|AC074196_20 putative nucleoid DNA binding protein, 3'-partial [Oryza sativa
Japonica Group]
Length = 377
Score = 99.4 bits (246), Expect = 3e-18, Method: Compositional matrix adjust.
Identities = 99/350 (28%), Positives = 148/350 (42%), Gaps = 55/350 (15%)
Query: 85 SYGGYSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRL 144
S G Y + + GTPPQ + ++D LVW CT C+ C +P F P SS+ R
Sbjct: 53 SQGLYVANFTIGTPPQPVSAVVDLTGELVWTQCT---PCQPCFEQDLPLFDPTKSSTFRG 109
Query: 145 LGCQNPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSGLTEGIALSETLNLP 204
L C + C I +S+NCT Y +G T G A ++T +
Sbjct: 110 LPCGSHLCESIPE--------------SSRNCTSDVCIYEAPTKAGDTGGKAGTDTFAI- 154
Query: 205 NRIIPNFLVGCSVLSSRQ------PAGIAGFGRGKTSLPSQLNLDKFSYCLLSHKFDDTT 258
GC V++ ++ P+GI G GR SL +Q+N+ FSYCL
Sbjct: 155 GAAKETLGFGCVVMTDKRLKTIGGPSGIVGLGRTPWSLVTQMNVTAFSYCLAGK------ 208
Query: 259 RTSSLILDNGSSHSDKKTTGLTYTPFVNNPSVAER-NAFSVYYYVGLRRITVGGQRVRVW 317
SS L G++ + TPFV S N + YY V L I GG ++
Sbjct: 209 --SSGALFLGATAKQLAGGKNSSTPFVIKTSAGSSDNGSNPYYMVKLAGIKTGGAPLQAA 266
Query: 318 HKYLTLDRDGNGGTI-VDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEALTGL 376
+G T+ +D+ + +++A ++ L + T A+G + +
Sbjct: 267 SS--------SGSTVLLDTVSRASYLADGAYKAL----------KKALTAAVGVQPVASP 308
Query: 377 RPCFDV--PGEKTGSFPELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTV 424
+D+ P G PEL F GGA +T+P NY G G+ VCLT+
Sbjct: 309 PKPYDLCFPKAVAGDAPELVFTFDGGAALTVPPANYLLASGNGT-VCLTI 357
>gi|255565531|ref|XP_002523756.1| Aspartic proteinase Asp1 precursor, putative [Ricinus communis]
gi|223537060|gb|EEF38696.1| Aspartic proteinase Asp1 precursor, putative [Ricinus communis]
Length = 507
Score = 99.4 bits (246), Expect = 4e-18, Method: Compositional matrix adjust.
Identities = 110/407 (27%), Positives = 173/407 (42%), Gaps = 69/407 (16%)
Query: 87 GGYSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPS--FIPKLSSSSRL 144
G Y + G+PP+ +DTGS ++W C++ C S +IP F P S+++ L
Sbjct: 82 GLYFTRVQLGSPPKDFYVQIDTGSDVLWVSCSSCNGCPVTSGLQIPLTFFDPGSSTTAAL 141
Query: 145 LGCQNPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSGL-TEGIALSETLNL 203
+ C + +C+ IQ D L +S+ T C Y YG G T G +++ ++L
Sbjct: 142 VSCSDQRCT----AGIQSSD----SLCSSR--TNQC-GYTFQYGDGSGTSGYYVADLMHL 190
Query: 204 PNRIIPNFLVG-------------CSVL-------SSRQPAGIAGFGRGKTSLPSQLNLD 243
++ + + CS L S R GI GFG+ + S+ SQL
Sbjct: 191 DTLLLSSGELSQICQTYDSSVSFMCSTLQTGDLTKSDRAVDGIFGFGQQEMSVISQLASQ 250
Query: 244 K-----FSYCLLSHKFDDTTRTSSLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSV 298
FS+CL K DD+ L+L + + YTP V PS N +
Sbjct: 251 GITPRVFSHCL---KGDDSGG-GVLVL------GEIVEPNIVYTPLV--PSQPHYNLY-- 296
Query: 299 YYYVGLRRITVGGQRVRVWHKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQM 358
L+ I+V GQ + + N GTIVDSGTT ++A ++P S +
Sbjct: 297 -----LQSISVAGQTLAIDPS--VFGASSNQGTIVDSGTTLAYLAEGAYDPFVSAITSVV 349
Query: 359 VKN-RNYTRALGAEALTGLRPCFDVPGEKTGSFPELKLHFKGGAEVTLPVENYFAVVGEG 417
N R Y L+ C+ V FP++ L+F GGA + L ++Y
Sbjct: 350 SLNARTY--------LSKGNQCYLVTSSVNDVFPQVSLNFAGGASLILNPQDYLLQQNSV 401
Query: 418 SAVCLTVVTDREASGGPSIILGNFQMQNYYVEYDLRNQRLGFKQQLC 464
+ V ++ G ILG+ +++ YD+ NQR+G+ C
Sbjct: 402 GGAAVWCVGFQKTPGQQITILGDLVLKDKIFVYDIANQRVGWTNYDC 448
>gi|384252236|gb|EIE25712.1| acid protease [Coccomyxa subellipsoidea C-169]
Length = 599
Score = 99.4 bits (246), Expect = 4e-18, Method: Compositional matrix adjust.
Identities = 120/413 (29%), Positives = 176/413 (42%), Gaps = 79/413 (19%)
Query: 86 YGGYSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQC-KYCS-SSKIPSFIPKLSSSSR 143
YG + +L GTP + I+DTGS + + PC + C + C K +F P SSSS
Sbjct: 59 YGYFYATLHLGTPARQFAVIVDTGSTITYVPCAS---CGRNCGPHHKDAAFDPASSSSSA 115
Query: 144 LLGCQNPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSGLTEGIALSETLNL 203
++GC + KC C P S+ + C + G+ +S+ L L
Sbjct: 116 VIGCDSDKCI-----------CGRPPCGCSEK--RECTYQRTYAEQSSSAGLLVSDQLQL 162
Query: 204 PNRIIPNFLVGCSV-----LSSRQPAGIAGFGRGKTSLPSQLN-----LDKFSYCLLSHK 253
+ + + GC + +++ GI G G + SL +QL D F+ C S +
Sbjct: 163 RDGAV-EVVFGCETKETGEIYNQEADGILGLGNSEVSLVNQLAGSGVIDDVFALCFGSVE 221
Query: 254 FDDTTRTSSLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQR 313
D +L+L G + + L YT +++ A YY V L + VGGQ+
Sbjct: 222 GD-----GALML--GDVDAAEYDVALQYTALLSSL------AHPHYYSVQLEALWVGGQQ 268
Query: 314 VRVWHKYLTLDRDGNG-GTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEA 372
+ V +R G GT++DSGTTFT++ E F+ L E VS Y G +
Sbjct: 269 LPV-----KPERYEEGYGTVLDSGTTFTYLPSEAFQ-LFKEAVSA------YALEHGLNS 316
Query: 373 LTGLRP-----------CFDV---PGEKTGS-----FPELKLHFKGGAEV-TLPVENYFA 412
+ G P CF G S FP +L F G + T P+ F
Sbjct: 317 VKGPDPKEKSFAQFHDICFGGAPHAGHADQSKLEKVFPVFELQFADGVRLRTGPLNYLFM 376
Query: 413 VVGEGSAVCLTVVTDREASGGPSIILGNFQMQNYYVEYDLRNQRLGFKQQLCK 465
GE A CL V D ASG +LG +N V+YD RN+R+GF C+
Sbjct: 377 HTGEMGAYCLGVF-DNGASG---TLLGGISFRNILVQYDRRNRRVGFGAASCQ 425
>gi|242117573|dbj|BAH80056.1| hypothetical protein [Oryza sativa Indica Group]
Length = 469
Score = 99.0 bits (245), Expect = 4e-18, Method: Compositional matrix adjust.
Identities = 104/377 (27%), Positives = 158/377 (41%), Gaps = 30/377 (7%)
Query: 91 ISLSFGTP-PQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLGCQN 149
I+++ GTP Q + ++D S+ VW C C +F P S++ L C +
Sbjct: 90 INITVGTPVAQTVSGLVDITSYFVWAQCAPCAAAAGCLPPPATAFRPNGSATFSPLPCSS 149
Query: 150 PKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYG--SGLTEGIALSETLNLPNRI 207
C + E+ C A + C SY + YG + T G ++T
Sbjct: 150 DMCLPVLRET-----CGRAGAAANATAGARCDSYSLTYGGSAANTSGYLATDTFTFGATA 204
Query: 208 IPNFLVGCSVLSSRQPAG---IAGFGRGKTSLPSQLNLDKFSYCLLSHKFDDTTRTSSLI 264
+P + GCS S AG + G GRG SL SQL KFSY LL+ + D S+I
Sbjct: 205 VPGVVFGCSDASYGDFAGASGVIGIGRGNLSLISQLQFGKFSYQLLAPEATDDGSADSVI 264
Query: 265 LDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRV-RVWHKYLTL 323
G K G + TP +++ + +YYV L + V G R+ + L
Sbjct: 265 -RFGDDAVPKTKRGRS-TPLLSS------TLYPDFYYVNLTGVRVDGNRLDAIPAGTFDL 316
Query: 324 DRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEALTGLRPCFDVP 383
+G GG I+ S T T++ E A + V V +R A+ A L C++
Sbjct: 317 RANGTGGVILSSTTPVTYL-----EQAAYDVVRAAVASRIGLPAVNGSAALELDLCYNAS 371
Query: 384 GEKTGSFPELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTVVTDREASGGPSIILGNFQM 443
P+L L F GGA++ L NYF + + CLT++ + S +LG
Sbjct: 372 SMAKVKVPKLTLVFDGGADMDLSAANYFYIDNDTGLECLTMLPSQGGS-----VLGTLLQ 426
Query: 444 QNYYVEYDLRNQRLGFK 460
+ YD+ RL F+
Sbjct: 427 TGTNMIYDVDAGRLTFE 443
>gi|225436397|ref|XP_002272121.1| PREDICTED: aspartic proteinase-like protein 2 isoform 2 [Vitis
vinifera]
Length = 499
Score = 99.0 bits (245), Expect = 4e-18, Method: Compositional matrix adjust.
Identities = 108/411 (26%), Positives = 174/411 (42%), Gaps = 74/411 (18%)
Query: 85 SYGGYSISLSF-----GTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPS--FIPK 137
+Y Y + L F G+PP+ +DTGS ++W C + C S IP F P
Sbjct: 74 TYDPYRVGLYFTRVLLGSPPKEFYVQIDTGSDVLWVSCGSCNGCPQSSGLHIPLNFFDPG 133
Query: 138 LSSSSRLLGCQNPKCSWIHHESIQCRD--CNDEPLATSKNCTQICPSYLVLYGSGL-TEG 194
SS++ L+ C + +CS +Q D C+ + C Y YG G T G
Sbjct: 134 SSSTASLISCSDQRCSL----GVQSSDAGCSSQ----GNQCI-----YTFQYGDGSGTSG 180
Query: 195 IALSETLNLPNRII--------PNFLVGCSV-------LSSRQPAGIAGFGRGKTSLPSQ 239
+S+ LN + I+ + + GCS+ S R GI GFG+ S+ SQ
Sbjct: 181 YYVSDLLNF-DAIVGSSVTNSSASIVFGCSISQTGDLTKSDRAVDGIFGFGQQDMSVISQ 239
Query: 240 LNLDKFSYCLLSHKFDDTTRTSSLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVY 299
++ + + SH +++ D + Y+P V PS +
Sbjct: 240 MSSQGITPKVFSHCLKGDGGGGGILVLGEIVEED-----IVYSPLV--PS-------QPH 285
Query: 300 YYVGLRRITVGGQRVRVWHKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLAD---EFVS 356
Y + L+ I+V G+ + + + N GTIVDSGTT ++A E ++P E VS
Sbjct: 286 YNLNLQSISVNGKSLAIDPEVFATST--NRGTIVDSGTTLAYLAEEAYDPFVSAITEAVS 343
Query: 357 QMVKNRNYTRALGAEALTGLRPCFDVPGEKTGSFPELKLHFKGGAEVTLPVENYFA---V 413
Q V+ L+ C+ + G FP + L+F GG + L E+Y
Sbjct: 344 QSVR----------PLLSKGTQCYLITSSVKGIFPTVSLNFAGGVSMNLKPEDYLLQQNS 393
Query: 414 VGEGSAVCLTVVTDREASGGPSIILGNFQMQNYYVEYDLRNQRLGFKQQLC 464
+G+ + C+ ++ G ILG+ +++ YDL QR+G+ C
Sbjct: 394 IGDAAVWCIGF---QKIQGQGITILGDLVLKDKIFVYDLAGQRIGWANYDC 441
>gi|125536523|gb|EAY83011.1| hypothetical protein OsI_38231 [Oryza sativa Indica Group]
Length = 469
Score = 99.0 bits (245), Expect = 4e-18, Method: Compositional matrix adjust.
Identities = 104/377 (27%), Positives = 158/377 (41%), Gaps = 30/377 (7%)
Query: 91 ISLSFGTP-PQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLGCQN 149
I+++ GTP Q + ++D S+ VW C C +F P S++ L C +
Sbjct: 90 INITVGTPVAQTVSGLVDITSYFVWAQCAPCAAAAGCLPPPATAFRPNGSATFSPLPCSS 149
Query: 150 PKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYG--SGLTEGIALSETLNLPNRI 207
C + E+ C A + C SY + YG + T G ++T
Sbjct: 150 DMCLPVLRET-----CGRAGAAANATAGARCDSYSLTYGGSAANTSGYLATDTFTFGATA 204
Query: 208 IPNFLVGCSVLSSRQPAG---IAGFGRGKTSLPSQLNLDKFSYCLLSHKFDDTTRTSSLI 264
+P + GCS S AG + G GRG SL SQL KFSY LL+ + D S+I
Sbjct: 205 VPGVVFGCSDASYGDFAGASGVIGIGRGNLSLISQLQFGKFSYQLLAPEATDDGSADSVI 264
Query: 265 LDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRV-RVWHKYLTL 323
G K G + TP +++ + +YYV L + V G R+ + L
Sbjct: 265 -RFGDDAVPKTKRGQS-TPLLSS------TLYPDFYYVNLTGVRVDGNRLDAIPAGTFDL 316
Query: 324 DRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEALTGLRPCFDVP 383
+G GG I+ S T T++ E A + V V +R A+ A L C++
Sbjct: 317 RANGTGGVILSSTTPVTYL-----EQAAYDVVRAAVASRIGLPAVNGSAALELDLCYNAS 371
Query: 384 GEKTGSFPELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTVVTDREASGGPSIILGNFQM 443
P+L L F GGA++ L NYF + + CLT++ + S +LG
Sbjct: 372 SMAKVKVPKLTLVFDGGADMDLSAANYFYIDNDTGLECLTMLPSQGGS-----VLGTLLQ 426
Query: 444 QNYYVEYDLRNQRLGFK 460
+ YD+ RL F+
Sbjct: 427 TGTNMIYDVDAGRLTFE 443
>gi|242073262|ref|XP_002446567.1| hypothetical protein SORBIDRAFT_06g018180 [Sorghum bicolor]
gi|241937750|gb|EES10895.1| hypothetical protein SORBIDRAFT_06g018180 [Sorghum bicolor]
Length = 453
Score = 99.0 bits (245), Expect = 4e-18, Method: Compositional matrix adjust.
Identities = 105/402 (26%), Positives = 162/402 (40%), Gaps = 60/402 (14%)
Query: 87 GGYSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLG 146
G Y + L GTP +DT S LVW C C C P F P+LSSS ++
Sbjct: 86 GEYLVKLGIGTPQHYFSAAIDTASDLVWLQCQ---PCVSCYRQLDPIFNPRLSSSYAVVP 142
Query: 147 CQNPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSGLTEGIALSETLNLPNR 206
C + CS + + +C + +D Q C G+ +T G + L +
Sbjct: 143 CSSDTCSQL--DGHRCDEDDD----------QACRYNYKYSGNAVTNGTLAIDKLAVGGN 190
Query: 207 IIPNFLVGCSVLSSRQP----AGIAGFGRGKTSLPSQLNLDKFSYCLLSHKFDDTTRT-S 261
+ ++GCS S P +G+ G RG SL SQL++ +F YCL +RT
Sbjct: 191 VFHAVVLGCSDSSVGGPPPQASGLVGLARGPLSLLSQLSVRRFMYCLPPP----MSRTPG 246
Query: 262 SLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRVRVWHK-- 319
L+L G+ + T +++ + YYY+ + VG Q +
Sbjct: 247 KLVLGAGAGADAVRNVSDRVT-----VTMSSSTRYPSYYYLNFDGLAVGDQTPGTIRRPT 301
Query: 320 ------------YLTLDRDGNG-GTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTR 366
N G IVD +T +F+ L++ LAD+ ++ R
Sbjct: 302 SPPATGGGVGGGGGDGGSGANAYGMIVDVASTISFLEASLYDELADDLEEEI----RLPR 357
Query: 367 ALGAEALTGLRPCFDVP---GEKTGSFPELKLHFKGGAEVTLPVENYFAVVGEGSAVCLT 423
A + L GL CF +P G P + + F G L +E + +G +CL
Sbjct: 358 ATPSTRL-GLDLCFILPEGVGIDRVYVPTVSMSFDG---RWLELERDRLFLEDGRMMCLM 413
Query: 424 VVTDREASGGPSIILGNFQMQNYYVEYDLRNQRLGFKQQLCK 465
+ S ILGN+Q QN +V Y+LR ++ F + C
Sbjct: 414 IGRTSGVS-----ILGNYQQQNMHVLYNLRRGKITFAKASCD 450
>gi|414871328|tpg|DAA49885.1| TPA: hypothetical protein ZEAMMB73_545054 [Zea mays]
Length = 565
Score = 99.0 bits (245), Expect = 5e-18, Method: Compositional matrix adjust.
Identities = 83/255 (32%), Positives = 118/255 (46%), Gaps = 29/255 (11%)
Query: 215 CSVLSSRQPA-GIAGFGRGKTSLPSQ---LNLDKFSYCLLSHKFDDTTRTSSLILDNGSS 270
C V P+ G+ GF RG S PSQ + FSYCL S+K + + T L G +
Sbjct: 333 CVVTGGSVPSQGLVGFNRGPLSFPSQNKNVYGSVFSYCLPSYKSSNFSGTLRL----GPA 388
Query: 271 HSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRVRVWHKYLTLDRDGNGG 330
K+ + TP ++NP R + YYV + I VGG+ V V L D G
Sbjct: 389 GQPKR---IKTTPLLSNP---HRPSL---YYVNMVGIRVGGRPVAVPASALAFDPASGHG 439
Query: 331 TIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEALTGLRPCFDVPGEKTGSF 390
TIVD+GT FT ++ ++ + D F R+ RA A L G C++V T S
Sbjct: 440 TIVDAGTMFTRLSAPVYAAVCDVF-------RSRVRAPVAGPLGGFDTCYNV----TISV 488
Query: 391 PELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTVVTDREASGGPSI-ILGNFQMQNYYVE 449
P + F G VTLP EN CL + S + ++ + Q QN+ V
Sbjct: 489 PTVTFLFDGRVSVTLPEENVVIRSSLDGIACLAMAAGPSDSVDAVLNVMASMQQQNHRVL 548
Query: 450 YDLRNQRLGFKQQLC 464
+D+ N R+GF ++LC
Sbjct: 549 FDVANGRVGFSRELC 563
>gi|413937238|gb|AFW71789.1| hypothetical protein ZEAMMB73_638381 [Zea mays]
Length = 598
Score = 99.0 bits (245), Expect = 5e-18, Method: Compositional matrix adjust.
Identities = 89/286 (31%), Positives = 131/286 (45%), Gaps = 49/286 (17%)
Query: 193 EGIALSETLNLPNRIIPNFLVGCSVL---SSRQPAGIAGFGRGKTSLPSQLNLDK----F 245
+ +AL + ++ ++ + GC + S P G+ GFG G S PSQ N D F
Sbjct: 346 DALALHDDVD----VVAAYTFGCLRVVTGGSVPPQGLVGFGCGPLSFPSQ-NKDVYGFVF 400
Query: 246 SYCLLSHKFDDTTRTSSLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLR 305
SYCL S+K + + T L G + K+ + TP ++NP R + YYV +
Sbjct: 401 SYCLPSYKSSNFSSTLRL----GPAGQPKR---IKMTPLLSNP---HRPSL---YYVNMV 447
Query: 306 RITVGGQRVRVWHKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYT 365
I VGG+ + V L D GTIVD+GT FT ++ ++ + D F R+
Sbjct: 448 GIHVGGRPMLVPASALAFDPASGRGTIVDAGTMFTRLSAPVYAAVRDVF-------RSRV 500
Query: 366 RALGAEALTGLRPCFDVPGEKTGSFPELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTVV 425
RA L G C++V T S P + F G VTLP EN CL +
Sbjct: 501 RAPVTGPLGGFDTCYNV----TISVPTVTFSFDGRVSVTLPEENVVIRSSSDGIACLAM- 555
Query: 426 TDREASGGPSI-------ILGNFQMQNYYVEYDLRNQRLGFKQQLC 464
+ GPS +L + Q QN+ V +D+ N R+GF ++LC
Sbjct: 556 -----AAGPSDGVDAVLNVLASMQQQNHRVLFDVANGRVGFSRELC 596
>gi|224124882|ref|XP_002329972.1| predicted protein [Populus trichocarpa]
gi|222871994|gb|EEF09125.1| predicted protein [Populus trichocarpa]
Length = 332
Score = 98.6 bits (244), Expect = 5e-18, Method: Compositional matrix adjust.
Identities = 104/374 (27%), Positives = 157/374 (41%), Gaps = 56/374 (14%)
Query: 104 FILDTGSHLVWFPCTNHYQCK-YCSSSKIPSFIPKLSSSSRLLGCQNPKCSWIHHESIQC 162
ILDTGS L W C C YC + P + P +S + + L C + +CS + ++
Sbjct: 1 MILDTGSSLSWLQCQ---PCAVYCHAQADPLYDPSVSKTYKKLSCASVECSRLKAATL-- 55
Query: 163 RDCNDEPLATSKNCTQICPSYLVLYG-SGLTEGIALSETLNL-PNRIIPNFLVGCSVLSS 220
ND T N Y YG + + G + L L ++ +P F GC +
Sbjct: 56 ---NDPLCETDSNACL----YTASYGDTSFSIGYLSQDLLTLTSSQTLPQFTYGCGQDNQ 108
Query: 221 R---QPAGIAGFGRGKTSLPSQLNLD---KFSYCLLSHKFDDTTRTSSLILDNGSSHSDK 274
+ AGI G R K S+ +QL+ FSYCL T S S
Sbjct: 109 GLFGRAAGIIGLARDKLSMLAQLSTKYGHAFSYCL-------PTANSGSSGGGFLSIGSI 161
Query: 275 KTTGLTYTPFV---NNPSVAERNAFSVYYYVGLRRITVGGQRVRVWHKYLTLDRDGNGGT 331
T +TP + NPS+ Y++ L ITV G+ + + + T
Sbjct: 162 SPTSYKFTPMLTDSKNPSL---------YFLRLTAITVSGRPLDLAAAMYRVP------T 206
Query: 332 IVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEALTGLRPCFDVPGEKTGSFP 391
++DSGT T + ++ L FV M T+ A A + L CF + + P
Sbjct: 207 LIDSGTVITRLPMSMYAALRQAFVKIMS-----TKYAKAPAYSILDTCFKGSLKSISAVP 261
Query: 392 ELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTVVTDREASGGPSI-ILGNFQMQNYYVEY 450
E+K+ F+GGA++TL + +G +T + +SG I I+GN Q Q Y + Y
Sbjct: 262 EIKMIFQGGADLTLRAPSILIEADKG----ITCLAFAGSSGTNQIAIIGNRQQQTYNIAY 317
Query: 451 DLRNQRLGFKQQLC 464
D+ R+GF C
Sbjct: 318 DVSTSRIGFAPGSC 331
>gi|297734873|emb|CBI17107.3| unnamed protein product [Vitis vinifera]
Length = 484
Score = 98.6 bits (244), Expect = 5e-18, Method: Compositional matrix adjust.
Identities = 108/411 (26%), Positives = 174/411 (42%), Gaps = 74/411 (18%)
Query: 85 SYGGYSISLSF-----GTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPS--FIPK 137
+Y Y + L F G+PP+ +DTGS ++W C + C S IP F P
Sbjct: 59 TYDPYRVGLYFTRVLLGSPPKEFYVQIDTGSDVLWVSCGSCNGCPQSSGLHIPLNFFDPG 118
Query: 138 LSSSSRLLGCQNPKCSWIHHESIQCRD--CNDEPLATSKNCTQICPSYLVLYGSGL-TEG 194
SS++ L+ C + +CS +Q D C+ + C Y YG G T G
Sbjct: 119 SSSTASLISCSDQRCSL----GVQSSDAGCSSQ----GNQCI-----YTFQYGDGSGTSG 165
Query: 195 IALSETLNLPNRII--------PNFLVGCSV-------LSSRQPAGIAGFGRGKTSLPSQ 239
+S+ LN + I+ + + GCS+ S R GI GFG+ S+ SQ
Sbjct: 166 YYVSDLLNF-DAIVGSSVTNSSASIVFGCSISQTGDLTKSDRAVDGIFGFGQQDMSVISQ 224
Query: 240 LNLDKFSYCLLSHKFDDTTRTSSLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVY 299
++ + + SH +++ D + Y+P V PS +
Sbjct: 225 MSSQGITPKVFSHCLKGDGGGGGILVLGEIVEED-----IVYSPLV--PS-------QPH 270
Query: 300 YYVGLRRITVGGQRVRVWHKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLAD---EFVS 356
Y + L+ I+V G+ + + + N GTIVDSGTT ++A E ++P E VS
Sbjct: 271 YNLNLQSISVNGKSLAIDPEVFATST--NRGTIVDSGTTLAYLAEEAYDPFVSAITEAVS 328
Query: 357 QMVKNRNYTRALGAEALTGLRPCFDVPGEKTGSFPELKLHFKGGAEVTLPVENYFA---V 413
Q V+ L+ C+ + G FP + L+F GG + L E+Y
Sbjct: 329 QSVR----------PLLSKGTQCYLITSSVKGIFPTVSLNFAGGVSMNLKPEDYLLQQNS 378
Query: 414 VGEGSAVCLTVVTDREASGGPSIILGNFQMQNYYVEYDLRNQRLGFKQQLC 464
+G+ + C+ ++ G ILG+ +++ YDL QR+G+ C
Sbjct: 379 IGDAAVWCIGF---QKIQGQGITILGDLVLKDKIFVYDLAGQRIGWANYDC 426
>gi|190896608|gb|ACE96817.1| aspartyl protease [Populus tremula]
Length = 339
Score = 98.6 bits (244), Expect = 6e-18, Method: Compositional matrix adjust.
Identities = 99/327 (30%), Positives = 140/327 (42%), Gaps = 48/327 (14%)
Query: 89 YSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLGCQ 148
Y + + GTP Q + +LDT + W PC+ C CSS+ +F+P S++ L C
Sbjct: 45 YVVRVKLGTPGQQMFMVLDTSNDAAWVPCSG---CTGCSST---TFLPNASTTLGSLDCS 98
Query: 149 NPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSGLTEGIALSETLNLPNRII 208
+CS + S AT + SY G + + + L N +I
Sbjct: 99 EAQCSQVRGFSCP---------ATGSSACLFNQSY---GGDSSLAATLVQDAITLANDVI 146
Query: 209 PNFLVGC-SVLS--SRQPAGIAGFGRGKTSLPSQLNL---DKFSYCLLSHKFDDTTRTSS 262
P F GC + +S S P G+ G GRG SL SQ FSYCL S F + S
Sbjct: 147 PGFTFGCINAVSGGSIPPQGLLGLGRGPISLISQAGAMYSGVFSYCLPS--FKSYYFSGS 204
Query: 263 LILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRVRVWHKYLT 322
L L +TT L P + PS+ YYV L ++VG +V + + L
Sbjct: 205 LKLGPVGQPKSIRTTPLLRNP--HRPSL---------YYVNLTGVSVGRIKVPIPSEQLV 253
Query: 323 LDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEALTGLRPCFDV 382
D + GTI+DSGT T ++ + DEF Q+ +LGA CF
Sbjct: 254 FDPNTGAGTIIDSGTVITRFVQPVYFAIRDEFRKQV---NGPISSLGA-----FDTCFAE 305
Query: 383 PGEKTGSFPELKLHFKGGAEVTLPVEN 409
E P + LHF+ G + LP+EN
Sbjct: 306 TNEAEA--PAVTLHFE-GLNLVLPMEN 329
>gi|302756119|ref|XP_002961483.1| hypothetical protein SELMODRAFT_76765 [Selaginella moellendorffii]
gi|300170142|gb|EFJ36743.1| hypothetical protein SELMODRAFT_76765 [Selaginella moellendorffii]
Length = 388
Score = 98.6 bits (244), Expect = 6e-18, Method: Compositional matrix adjust.
Identities = 114/405 (28%), Positives = 174/405 (42%), Gaps = 65/405 (16%)
Query: 89 YSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIP--SFIPKLSSSSRLLG 146
Y + G P + +DTGS ++W C C S+ IP + P+ SS++ L+
Sbjct: 2 YFTQVGLGNPVKHYIVQVDTGSDVLWVNCRPCSGCPRKSALNIPLTMYDPRESSTTSLVS 61
Query: 147 CQNPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSGLT-EGIALSETLNLPN 205
C +P C + C+ + NC Y+ YG G T EG + + + N
Sbjct: 62 CSDPLC--VRGRRFAEAQCSQ----ATNNC-----EYIFSYGDGSTSEGYYVRDAMQY-N 109
Query: 206 RIIPN--------FLVGCSV-----LSSRQPA--GIAGFGRGKTSLPSQLNLDKFSYCLL 250
I N L GCS+ LS+ Q A GI GFG+ + S+P+QL + +
Sbjct: 110 VISSNGLANTTSQVLFGCSIRQTGDLSTSQQAVDGIIGFGQLELSVPNQLAAQQNIPRVF 169
Query: 251 SHKFDDTTRTSSLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVG 310
SH + R +++ G + G+TYTP V + SV+Y V LR I+V
Sbjct: 170 SHCLEGEKRGGGILVIGGIAEP-----GMTYTPLVPD---------SVHYNVVLRGISVN 215
Query: 311 GQRVRVWHKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGA 370
R+ + + + D G I+DSGTT + + + FV + R T A
Sbjct: 216 SNRLPIDAEDFSSTNDT--GVIMDSGTTLAYFPSGAY----NVFVQAI---REATSATPV 266
Query: 371 EALTGLRPCFDVPGEKTGSFPELKLHFKGGAEVTLPVENYF-----AVVGEGSAVCLTVV 425
CF V G + FP + L+F+GGA + L +NY A G C+
Sbjct: 267 RVQGMDTQCFLVSGRLSDLFPNVTLNFEGGA-MELQPDNYLMWGGTAPTGTTDVWCIGWQ 325
Query: 426 TDREASGGPS-----IILGNFQMQNYYVEYDLRNQRLGFKQQLCK 465
+ +S GP ILG+ +++ V YDL N R+G+ CK
Sbjct: 326 SS-SSSAGPKDGSQLTILGDIVLKDKLVVYDLDNSRIGWMSYNCK 369
>gi|255566010|ref|XP_002523993.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
gi|223536720|gb|EEF38361.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
Length = 439
Score = 98.6 bits (244), Expect = 6e-18, Method: Compositional matrix adjust.
Identities = 133/484 (27%), Positives = 202/484 (41%), Gaps = 68/484 (14%)
Query: 1 MASYISALCL-SFIFFFTLLSI------FPSSITSLTFSLSRFHTNPSQDSYQNLNSLVS 53
MA+ +S L + + IF TL+ I F + + S F+ NP + Q + S V
Sbjct: 1 MAASVSLLAIVTLIFSGTLVPIDAAKDGFTVELINRDSPKSPFY-NPRETPTQRIVSAVR 59
Query: 54 SSLTRALHIKNPQTKTTTTTTTTTTTNISSHSYGGYSISLSFGTPPQIIPFILDTGSHLV 113
S++R H +P + T T + IS+ G Y + S GTP I I DTGS L+
Sbjct: 60 RSMSRVHHF-SPTKNSDIFTDTAQSEMISNQ--GEYLMKFSLGTPAFDILAIADTGSDLI 116
Query: 114 WFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLGCQNPKCSWIHHESIQCRDCNDEPLATS 173
W C C C P F PK SS+ R + C +C + + C+ E +
Sbjct: 117 WTQCK---PCDQCYEQDAPLFDPKSSSTYRDISCSTKQCDLLKEGA----SCSGE---GN 166
Query: 174 KNCTQICPSYLVLYGS-GLTEGIALSETLNLPNR-----IIPNFLVGCSVLS----SRQP 223
K C Y YG T G ++T+ L + ++P ++GC + + +
Sbjct: 167 KTC-----HYSYSYGDRSFTSGNVAADTITLGSTSGRPVLLPKAIIGCGHNNGGSFTEKG 221
Query: 224 AGIAGFGRGKTSLPSQLN--LD-KFSYCLLSHKFDDTTRTSSLILDNGSSHSDKKTTGLT 280
+GI G G G SL SQL +D KFSYCL+ + T +S L N S+ G+
Sbjct: 222 SGIVGLGGGPISLISQLGSTIDGKFSYCLVPLS-SNATNSSKL---NFGSNGIVSGGGVQ 277
Query: 281 YTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRVRVWHKYLTLDRDGNGGTIVDSGTTFT 340
TP ++ +Y++ L ++VG +R++ G I+DSGTT T
Sbjct: 278 STPLISKDP-------DTFYFLTLEAVSVGSERIKFPGSSFGTSE---GNIIIDSGTTLT 327
Query: 341 FMAPELFEPLADEFVSQMVKNRNYTRALGAEALTGLRPCFDVPGEKTGSFPELKLHFKGG 400
+ F L+ V V G +L C+ + + FP + HF G
Sbjct: 328 LFPEDFFSELSSA-VQDAVAGTPVEDPSGILSL-----CYSIDADL--KFPSITAHFD-G 378
Query: 401 AEVTLPVENYFAVVGEGSAVCLTVVTDREASGGPSIILGNFQMQNYYVEYDLRNQRLGFK 460
A+V L N F V + + +C + I GN N+ V YDL + + FK
Sbjct: 379 ADVKLNPLNTFVQVSD-TVLCFAFNPINSGA-----IFGNLAQMNFLVGYDLEGKTVSFK 432
Query: 461 QQLC 464
C
Sbjct: 433 PTDC 436
>gi|224057272|ref|XP_002299201.1| predicted protein [Populus trichocarpa]
gi|118483775|gb|ABK93780.1| unknown [Populus trichocarpa]
gi|222846459|gb|EEE84006.1| predicted protein [Populus trichocarpa]
Length = 425
Score = 98.6 bits (244), Expect = 6e-18, Method: Compositional matrix adjust.
Identities = 108/382 (28%), Positives = 152/382 (39%), Gaps = 53/382 (13%)
Query: 89 YSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLGCQ 148
Y + + GTP Q LDT + W PC C CSS+ S S++ + LGC
Sbjct: 90 YIVKANVGTPAQTFLMALDTSNDAAWIPCNG---CVGCSSTVFNSVT---STTFKTLGCD 143
Query: 149 NPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSGLTEGIALSETLNLPNRII 208
P+C + + P CT + YG +T+ L I+
Sbjct: 144 APQCKQVPN-----------PTCGGSTCT-----WNTTYGGSTILSNLTRDTIALSTDIV 187
Query: 209 PNFLVGC---SVLSSRQPAGIAGFGRGKTSLPSQ---LNLDKFSYCLLSHKFDDTTRTSS 262
P + GC + SS P G+ G GRG S SQ L FSYCL S F + +
Sbjct: 188 PGYTFGCIQKTTGSSVPPQGLLGLGRGPLSFLSQTQDLYKSTFSYCLPS--FRTLNFSGT 245
Query: 263 LILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRVRVWHKYLT 322
L L KTT P + NP R++ YYV L I VG + V + L
Sbjct: 246 LRLGPAGQPLRIKTT-----PLLKNP---RRSSL---YYVNLIGIRVGRKIVDIPASALA 294
Query: 323 LDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEALTGLRPCFDV 382
+ GTI DSGT FT + ++ + DEF R +L G C+
Sbjct: 295 FNPTTGAGTIFDSGTVFTRLVAPVYTAVRDEF-------RKRVGNAIVSSLGGFDTCYTG 347
Query: 383 PGEKTGSFPELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTVVTDREASGGPSIILGNFQ 442
P P + F G VTLP +N GS CL + + ++ N Q
Sbjct: 348 PIVA----PTMTFMFSG-MNVTLPPDNLLIRSTAGSTSCLAMAAAPDNVNSVLNVIANMQ 402
Query: 443 MQNYYVEYDLRNQRLGFKQQLC 464
QN+ + +D+ N R+G ++ C
Sbjct: 403 QQNHRILFDVPNSRIGVAREPC 424
>gi|30692930|ref|NP_198475.2| aspartyl protease family protein [Arabidopsis thaliana]
gi|66792626|gb|AAY56415.1| At5g36260 [Arabidopsis thaliana]
gi|332006680|gb|AED94063.1| aspartyl protease family protein [Arabidopsis thaliana]
Length = 482
Score = 98.6 bits (244), Expect = 6e-18, Method: Compositional matrix adjust.
Identities = 107/403 (26%), Positives = 166/403 (41%), Gaps = 62/403 (15%)
Query: 83 SHSYGGYSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPS--FIPKLSS 140
+ S G Y + G+PP+ +DTGS ++W C +C + IP + K SS
Sbjct: 72 ADSIGLYFTKIKLGSPPKEYYVQVDTGSDILWVNCAPCPKCPVKTDLGIPLSLYDSKTSS 131
Query: 141 SSRLLGCQNPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSGLT-EGIALSE 199
+S+ +GC++ CS+I +Q C K C SY V+YG G T +G + +
Sbjct: 132 TSKNVGCEDDFCSFI----MQSETC-----GAKKPC-----SYHVVYGDGSTSDGDFIKD 177
Query: 200 TLNLPN-----RIIP---NFLVGCSVLSSRQPA-------GIAGFGRGKTSLPSQLNLDK 244
+ L R P + GC S Q GI GFG+ TS+ SQL
Sbjct: 178 NITLEQVTGNLRTAPLAQEVVFGCGKNQSGQLGQTDSAVDGIMGFGQSNTSIISQLAAGG 237
Query: 245 FSYCLLSHKFDDTTRTSSLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGL 304
+ + SH D+ I G S T TP V N V+Y V L
Sbjct: 238 STKRIFSHCLDNMNGGG--IFAVGEVESPVVKT----TPIVPN---------QVHYNVIL 282
Query: 305 RRITVGGQRVRVWHKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVS-QMVKNRN 363
+ + V G + + + +G+GGTI+DSGTT ++ L+ L ++ + Q VK
Sbjct: 283 KGMDVDGDPIDLPPSLAS--TNGDGGTIIDSGTTLAYLPQNLYNSLIEKITAKQQVKLHM 340
Query: 364 YTRALGAEALTGLRPCFDVPGEKTGSFPELKLHFKGGAEVTLPVENYFAVVGEGSAVCLT 423
CF +FP + LHF+ ++++ +Y + E C
Sbjct: 341 VQETFA---------CFSFTSNTDKAFPVVNLHFEDSLKLSVYPHDYLFSLRE-DMYCFG 390
Query: 424 VVTD--REASGGPSIILGNFQMQNYYVEYDLRNQRLGFKQQLC 464
+ G I+LG+ + N V YDL N+ +G+ C
Sbjct: 391 WQSGGMTTQDGADVILLGDLVLSNKLVVYDLENEVIGWADHNC 433
>gi|449527083|ref|XP_004170542.1| PREDICTED: LOW QUALITY PROTEIN: basic 7S globulin-like [Cucumis
sativus]
Length = 432
Score = 98.6 bits (244), Expect = 6e-18, Method: Compositional matrix adjust.
Identities = 114/423 (26%), Positives = 176/423 (41%), Gaps = 80/423 (18%)
Query: 81 ISSHSYGGYSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSS 140
++ H G Y + TP + +D G +W C Y +SS
Sbjct: 34 VTKHPSGQYITQIRQRTPLVPVKLTVDLGGQFMWVDCDRGY----------------VSS 77
Query: 141 SSRLLGCQNPKCSWIHHESIQCRDCNDEPLATSKNCTQIC---PSYLVLYGSGLTEGIAL 197
S + + C++ +CS +S C DC P N T C P ++ S T G
Sbjct: 78 SYKPVRCRSAQCSL--SKSTSCGDCFSPPXPGCNNNT--CGHFPGNTIIQLS--TSGEVT 131
Query: 198 SETLNL-------PNRI--IPNFLVGCS---VLS--SRQPAGIAGFGRGKTSLPSQLNLD 243
S+ L++ P R IPNFL C +L + +G+AGFGR SLPSQ +
Sbjct: 132 SDVLSVSSTNGFNPTRAVSIPNFLFVCGPTFLLEGLAGGVSGMAGFGRTGISLPSQFSAA 191
Query: 244 -----KFSYCLLSHKFDDTTRTSSLILD-NGSSH---SDKKTTGLTYTPFVNNP----SV 290
KF+ CL +TR+ +I NG H + T LTYTP NP V
Sbjct: 192 FSFNRKFAVCL-----SGSTRSPGVIFSGNGPYHFLQNVDVTKSLTYTPLFINPVSTAGV 246
Query: 291 AERNAFSVYYYVGLRRITVGGQRVRVWHKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPL 350
+ S Y++G++ I + V + L +D +GNGGT + + +T + ++ L
Sbjct: 247 STSGEKSSEYFIGVKSIVFNSKTVPINTTLLKIDSNGNGGTKISTVHPYTVLESSIYNAL 306
Query: 351 ADEFVSQMVKNRNYTRALGAEALTGLRPCFDVPGEKTGSFPELKLHFKGGAEVTLPVEN- 409
++ RN R A+ C+ K+ SF +L G + L ++N
Sbjct: 307 VKTITREL---RNIPR---VAAVAPFGVCY-----KSKSFGSTRLG-PGMPSIDLILQNK 354
Query: 410 --YFAVVGEGSAV-------CLTVVTDREASGGPSIILGNFQMQNYYVEYDLRNQRLGFK 460
+ + G S V CL V D +I++G +QM++ +E+DL RLGF
Sbjct: 355 KVIWRIFGANSMVQVNEEVLCLGFV-DGGVEARTAIVIGAYQMEDNLLEFDLATSRLGFS 413
Query: 461 QQL 463
L
Sbjct: 414 STL 416
>gi|383143497|gb|AFG53176.1| Pinus taeda anonymous locus 2_9704_01 genomic sequence
gi|383143499|gb|AFG53177.1| Pinus taeda anonymous locus 2_9704_01 genomic sequence
gi|383143505|gb|AFG53180.1| Pinus taeda anonymous locus 2_9704_01 genomic sequence
gi|383143513|gb|AFG53184.1| Pinus taeda anonymous locus 2_9704_01 genomic sequence
gi|383143515|gb|AFG53185.1| Pinus taeda anonymous locus 2_9704_01 genomic sequence
Length = 135
Score = 98.6 bits (244), Expect = 6e-18, Method: Compositional matrix adjust.
Identities = 55/153 (35%), Positives = 81/153 (52%), Gaps = 18/153 (11%)
Query: 247 YCLLSHKFDDTTRTSSLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRR 306
YCL D +S +++ N + D LTYTP + NP + +YY+GL
Sbjct: 1 YCL-----DYVNNSSKIVVGNKAVPGD---ISLTYTPLIINP------IYPFFYYLGLEA 46
Query: 307 ITVGGQRVRVWHKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTR 366
+++G +R+ + T D GNGGTI+DSGT+FT ++ +A EF SQ+ Y R
Sbjct: 47 VSIGRKRLNLPFNSATFDSKGNGGTIIDSGTSFTIFPEAMYSQIAGEFASQI----GYKR 102
Query: 367 ALGAEALTGLRPCFDVPGEKTGSFPELKLHFKG 399
GAE+ T L C++V G + FP+ HFKG
Sbjct: 103 VPGAESTTALGLCYNVSGVENIQFPQFAFHFKG 135
>gi|224142007|ref|XP_002324352.1| predicted protein [Populus trichocarpa]
gi|222865786|gb|EEF02917.1| predicted protein [Populus trichocarpa]
Length = 460
Score = 98.6 bits (244), Expect = 6e-18, Method: Compositional matrix adjust.
Identities = 110/391 (28%), Positives = 166/391 (42%), Gaps = 61/391 (15%)
Query: 87 GGYSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLG 146
G Y +++ GTP + I DTGS + W C K C K P P S+S + +
Sbjct: 117 GDYVVTVGLGTPKKEFTLIFDTGSDITWTQC--EPCVKTCYKQKEPRLNPSTSTSYKNIS 174
Query: 147 CQNPKCSWIHHESIQCRDCNDEPLATSKNCTQICPS----YLVLYGSG-LTEGIALSETL 201
C + C + A+ K +Q C S Y V YG G + G +ETL
Sbjct: 175 CSSALCKLV---------------ASGKKFSQSCSSSTCLYQVQYGDGSYSIGFFATETL 219
Query: 202 NL-PNRIIPNFLVGCSVLSS---RQPAGIAGFGRGKTSLPSQLN---LDKFSYCLLSHKF 254
L + + NFL GC ++ AG+ G GR K +LPSQ FSYCL +
Sbjct: 220 TLSSSNVFKNFLFGCGQQNNGLFGGAAGLLGLGRTKLALPSQTAKTYKKLFSYCLPAS-- 277
Query: 255 DDTTRTSSLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRV 314
++ L L S S K T P A+ ++ + +Y + + ++VGG++
Sbjct: 278 --SSSKGYLSLGGQVSKSVKFT-----------PLSADFDS-TPFYGLDITGLSVGGRK- 322
Query: 315 RVWHKYLTLDRDG-NGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEAL 373
L++D + GT++DSGT T ++P + L+ F + M +Y G
Sbjct: 323 ------LSIDESAFSAGTVIDSGTVITRLSPTAYSELSSAFQNLMT---DYPSTSGYSI- 372
Query: 374 TGLRPCFDVPGEKTGSFPELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTVVTDREASGG 433
C+D T P++ + FKGG E+ + V V VCL + + S
Sbjct: 373 --FDTCYDFSKYDTVRIPKVGVTFKGGVEMDIDVSGILYPVNGLKKVCLAFAGNDDDS-- 428
Query: 434 PSIILGNFQMQNYYVEYDLRNQRLGFKQQLC 464
+ I GN Q + Y V YD R+GF C
Sbjct: 429 DTSIFGNVQQRTYQVVYDGAKGRVGFAPGGC 459
>gi|224142005|ref|XP_002324351.1| predicted protein [Populus trichocarpa]
gi|222865785|gb|EEF02916.1| predicted protein [Populus trichocarpa]
Length = 472
Score = 98.2 bits (243), Expect = 7e-18, Method: Compositional matrix adjust.
Identities = 110/391 (28%), Positives = 166/391 (42%), Gaps = 61/391 (15%)
Query: 87 GGYSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLG 146
G Y +++ GTP + I DTGS + W C K C K P P S+S + +
Sbjct: 129 GDYVVTVGLGTPKKEFTLIFDTGSDITWTQC--EPCVKTCYKQKEPRLNPSTSTSYKNIS 186
Query: 147 CQNPKCSWIHHESIQCRDCNDEPLATSKNCTQICPS----YLVLYGSG-LTEGIALSETL 201
C + C + A+ K +Q C S Y V YG G + G +ETL
Sbjct: 187 CSSALCKLV---------------ASGKKFSQSCSSSTCLYQVQYGDGSYSIGFFATETL 231
Query: 202 NL-PNRIIPNFLVGCSVLSS---RQPAGIAGFGRGKTSLPSQLN---LDKFSYCLLSHKF 254
L + + NFL GC ++ AG+ G GR K +LPSQ FSYCL +
Sbjct: 232 TLSSSNVFKNFLFGCGQQNNGLFGGAAGLLGLGRTKLALPSQTAKTYKKLFSYCLPAS-- 289
Query: 255 DDTTRTSSLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRV 314
++ L L S S K T P A+ ++ + +Y + + ++VGG++
Sbjct: 290 --SSSKGYLSLGGQVSKSVKFT-----------PLSADFDS-TPFYGLDITGLSVGGRK- 334
Query: 315 RVWHKYLTLDRDG-NGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEAL 373
L++D + GT++DSGT T ++P + L+ F + M +Y G
Sbjct: 335 ------LSIDESAFSAGTVIDSGTVITRLSPTAYSELSSAFQNLMT---DYPSTSGYSI- 384
Query: 374 TGLRPCFDVPGEKTGSFPELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTVVTDREASGG 433
C+D T P++ + FKGG E+ + V V VCL + + S
Sbjct: 385 --FDTCYDFSKYDTVRIPKVGVTFKGGVEMDIDVSGILYPVNGLKKVCLAFAGNDDDS-- 440
Query: 434 PSIILGNFQMQNYYVEYDLRNQRLGFKQQLC 464
+ I GN Q + Y V YD R+GF C
Sbjct: 441 DTSIFGNVQQRTYQVVYDGAKGRVGFAPGGC 471
>gi|413937239|gb|AFW71790.1| hypothetical protein ZEAMMB73_638381 [Zea mays]
Length = 537
Score = 98.2 bits (243), Expect = 8e-18, Method: Compositional matrix adjust.
Identities = 89/286 (31%), Positives = 131/286 (45%), Gaps = 49/286 (17%)
Query: 193 EGIALSETLNLPNRIIPNFLVGCSVL---SSRQPAGIAGFGRGKTSLPSQLNLDK----F 245
+ +AL + ++ ++ + GC + S P G+ GFG G S PSQ N D F
Sbjct: 285 DALALHDDVD----VVAAYTFGCLRVVTGGSVPPQGLVGFGCGPLSFPSQ-NKDVYGFVF 339
Query: 246 SYCLLSHKFDDTTRTSSLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLR 305
SYCL S+K + + T L G + K+ + TP ++NP R + YYV +
Sbjct: 340 SYCLPSYKSSNFSSTLRL----GPAGQPKR---IKMTPLLSNP---HRPSL---YYVNMV 386
Query: 306 RITVGGQRVRVWHKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYT 365
I VGG+ + V L D GTIVD+GT FT ++ ++ + D F R+
Sbjct: 387 GIHVGGRPMLVPASALAFDPASGRGTIVDAGTMFTRLSAPVYAAVRDVF-------RSRV 439
Query: 366 RALGAEALTGLRPCFDVPGEKTGSFPELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTVV 425
RA L G C++V T S P + F G VTLP EN CL +
Sbjct: 440 RAPVTGPLGGFDTCYNV----TISVPTVTFSFDGRVSVTLPEENVVIRSSSDGIACLAM- 494
Query: 426 TDREASGGPSI-------ILGNFQMQNYYVEYDLRNQRLGFKQQLC 464
+ GPS +L + Q QN+ V +D+ N R+GF ++LC
Sbjct: 495 -----AAGPSDGVDAVLNVLASMQQQNHRVLFDVANGRVGFSRELC 535
>gi|20160862|dbj|BAB89801.1| putative aspartic proteinase nepenthesin I [Oryza sativa Japonica
Group]
Length = 488
Score = 98.2 bits (243), Expect = 8e-18, Method: Compositional matrix adjust.
Identities = 102/380 (26%), Positives = 155/380 (40%), Gaps = 51/380 (13%)
Query: 87 GGYSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLG 146
G Y S GTPPQ + LD S LVW C F P S++ +
Sbjct: 98 GMYVFSYGIGTPPQQVSGALDISSDLVWTAC-----------GATAPFNPVRSTTVADVP 146
Query: 147 CQNPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSGL--TEGIALSETLNLP 204
C + C ++ C A + C +Y +YG G T G+ +E
Sbjct: 147 CTDDACQQFAPQT-----CG----AGASEC-----AYTYMYGGGAANTTGLLGTEAFTFG 192
Query: 205 NRIIPNFLVGC---SVLSSRQPAGIAGFGRGKTSLPSQLNLDKFSYCLLSHKFDDTTRTS 261
+ I + GC +V +G+ G GRG SL SQL +D+FSY DD+ T
Sbjct: 193 DTRIDGVVFGCGLKNVGDFSGVSGVIGLGRGNLSLVSQLQVDRFSYHFAP---DDSVDTQ 249
Query: 262 SLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRVRVWHKYL 321
S IL D T ++T + + +A YYV L I V G+ + +
Sbjct: 250 SFIL-----FGDDATPQTSHT---LSTRLLASDANPSLYYVELAGIQVDGKDLAIPSGTF 301
Query: 322 TL-DRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEALTGLRPCF 380
L ++DG+GG + T + ++PL Q V ++ A+ AL GL C+
Sbjct: 302 DLRNKDGSGGVFLSITDLVTVLEEAAYKPL-----RQAVASKIGLPAVNGSAL-GLDLCY 355
Query: 381 DVPGEKTGSFPELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTVVTDREASGGPSIILGN 440
P + L F GGA + L + NYF + CLT++ +S G +LG+
Sbjct: 356 TGESLAKAKVPSMALVFAGGAVMELELGNYFYMDSTTGLACLTIL---PSSAGDGSVLGS 412
Query: 441 FQMQNYYVEYDLRNQRLGFK 460
++ YD+ +L F+
Sbjct: 413 LIQVGTHMMYDINGSKLVFE 432
>gi|449432733|ref|XP_004134153.1| PREDICTED: basic 7S globulin-like [Cucumis sativus]
Length = 432
Score = 98.2 bits (243), Expect = 8e-18, Method: Compositional matrix adjust.
Identities = 114/423 (26%), Positives = 176/423 (41%), Gaps = 80/423 (18%)
Query: 81 ISSHSYGGYSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSS 140
++ H G Y + TP + +D G +W C Y +SS
Sbjct: 34 VTKHPSGQYITQIRQRTPLVPVKLTVDLGGQFMWVDCDRGY----------------VSS 77
Query: 141 SSRLLGCQNPKCSWIHHESIQCRDCNDEPLATSKNCTQIC---PSYLVLYGSGLTEGIAL 197
S + + C++ +CS +S C DC P N T C P ++ S T G
Sbjct: 78 SYKPVRCRSAQCSL--SKSTSCGDCFSPPRPGCNNNT--CGHFPGNTIIQLS--TSGEVT 131
Query: 198 SETLNL-------PNRI--IPNFLVGCS---VLS--SRQPAGIAGFGRGKTSLPSQLNL- 242
S+ L++ P R IPNFL C +L + +G+AGFGR SLPSQ +
Sbjct: 132 SDVLSVSSTNGFNPTRAVSIPNFLFVCGPTFLLEGLAGGVSGMAGFGRTGISLPSQFSAA 191
Query: 243 ----DKFSYCLLSHKFDDTTRTSSLILD-NGSSH---SDKKTTGLTYTPFVNNP----SV 290
KF+ CL +TR+ +I NG H + T LTYTP NP V
Sbjct: 192 FSFNRKFAVCL-----SGSTRSPGVIFSGNGPYHFLQNVDVTKSLTYTPLFINPVSTAGV 246
Query: 291 AERNAFSVYYYVGLRRITVGGQRVRVWHKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPL 350
+ S Y++G++ I + V + L +D +GNGGT + + +T + ++ L
Sbjct: 247 STSGEKSSEYFIGVKSIVFNSKTVPINTTLLKIDSNGNGGTKISTVHPYTVLESSIYNAL 306
Query: 351 ADEFVSQMVKNRNYTRALGAEALTGLRPCFDVPGEKTGSFPELKLHFKGGAEVTLPVEN- 409
++ RN R A+ C+ K+ SF +L G + L ++N
Sbjct: 307 VKTITREL---RNIPR---VAAVAPFGVCY-----KSKSFGSTRLG-PGMPSIDLILQNK 354
Query: 410 --YFAVVGEGSAV-------CLTVVTDREASGGPSIILGNFQMQNYYVEYDLRNQRLGFK 460
+ + G S V CL V D +I++G +QM++ +E+DL RLGF
Sbjct: 355 KVIWRIFGANSMVQVNEEVLCLGFV-DGGVEARTAIVIGAYQMEDNLLEFDLATSRLGFS 413
Query: 461 QQL 463
L
Sbjct: 414 STL 416
>gi|356528623|ref|XP_003532899.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
Length = 507
Score = 98.2 bits (243), Expect = 8e-18, Method: Compositional matrix adjust.
Identities = 80/268 (29%), Positives = 116/268 (43%), Gaps = 38/268 (14%)
Query: 210 NFLVGCS------VLSSRQPAGIAGFGRGKTSLPSQLNLD---KFSYCLLSHKFDDTTRT 260
N +GC+ V + GI G G K S + + KFSYCL+ H + R
Sbjct: 264 NLTIGCTKSMENGVNFNEDTGGILGLGFAKDSFIDKAAYEYGAKFSYCLVDHL---SHRN 320
Query: 261 SSLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRVR----V 316
S L G H+ K + T + F +Y V + I++GGQ ++ V
Sbjct: 321 VSSYLTIGGHHNAKLLGEIKRTELI---------LFPPFYGVNVVGISIGGQMLKIPPQV 371
Query: 317 WHKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEALTGL 376
W D + GGT++DSGTT T + +EP+ + + + K + T E L
Sbjct: 372 W------DFNSQGGTLIDSGTTLTALLVPAYEPVFEALIKSLTKVKRVT----GEDFGAL 421
Query: 377 RPCFDVPGEKTGSFPELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTVVTDREASGGPSI 436
CFD G P L HF GGA PV++Y V C+ +V + GG S+
Sbjct: 422 DFCFDAEGFDDSVVPRLVFHFAGGARFEPPVKSYIIDVAP-LVKCIGIVP-IDGIGGASV 479
Query: 437 ILGNFQMQNYYVEYDLRNQRLGFKQQLC 464
I GN QN+ E+DL +GF +C
Sbjct: 480 I-GNIMQQNHLWEFDLSTNTIGFAPSIC 506
>gi|449463971|ref|XP_004149703.1| PREDICTED: aspartic proteinase-like protein 2-like [Cucumis
sativus]
Length = 641
Score = 98.2 bits (243), Expect = 8e-18, Method: Compositional matrix adjust.
Identities = 108/424 (25%), Positives = 174/424 (41%), Gaps = 69/424 (16%)
Query: 62 IKNPQTKTTTTTTTTTTTNISSHSY------GGYSISLSFGTPPQIIPFILDTGSHLVWF 115
I NP + T+ +N Y G Y+ L GTPPQ I+DTGS + +
Sbjct: 50 ISNPHRRLRQFPTSDNLSNARMRLYDDLLLNGYYTTRLWIGTPPQQFALIVDTGSTVTYV 109
Query: 116 PCTNHYQCKYCSSSKIPSFIPKLSSSSRLLGCQNPKCSWIHHESIQCRDCNDEPLATSKN 175
PC+ C+ C + P F P+ SS+ + + C N C + +QC +
Sbjct: 110 PCST---CEQCGRHQDPKFDPESSSTYKPIKC-NIDC-ICDSDGVQC--------VYERQ 156
Query: 176 CTQICPSYLVLYGSGLTEGIALSETLNLPNRIIPNFLVGCSVLS-----SRQPAGIAGFG 230
++ S VL ++ G +++ +P R + GC + S++ GI G G
Sbjct: 157 YAEMSTSSGVLGEDVISFG---NQSELIPQRAV----FGCENMETGDLFSQRADGIMGLG 209
Query: 231 RGKTSLPSQLNL-----DKFSYCLLSHKFDDTTRTSSLILDNGSSHSDKKTTGLTYTPFV 285
G SL QL D FS C +++L S SD TY+ V
Sbjct: 210 TGDLSLVDQLVEKGAINDSFSLCYGGMDIG----GGAMVLGGISPPSDMI---FTYSDPV 262
Query: 286 NNPSVAERNAFSVYYYVGLRRITVGGQRVRVWHKYLTLDRDGNGGTIVDSGTTFTFMAPE 345
+P YY V L+ I V G+++ + DG G ++DSGTT+ ++ E
Sbjct: 263 RSP----------YYNVDLKEIHVAGKKLPLSSGIF----DGRYGAVLDSGTTYAYLPAE 308
Query: 346 LFEPLADEFVSQMVKNRNYTRALGAEALTGLRPCFDVPG----EKTGSFPELKLHFKGGA 401
F D + ++ + + + CF G E + FP + + F+ G
Sbjct: 309 AFSAFKDAIMDEI----HSLKKIDGPDPNFKDICFSGAGSDAAELSNKFPTVDMVFENGQ 364
Query: 402 EVTLPVENYFAVVGE-GSAVCLTVVTDREASGGPSIILGNFQMQNYYVEYDLRNQRLGFK 460
+++L ENYF + A CL + E + +LG ++N V YD N ++GF
Sbjct: 365 KLSLTPENYFFRHSKVHGAYCLGIF---ENGNDQTTLLGGIVVRNTLVMYDRANSKIGFW 421
Query: 461 QQLC 464
+ C
Sbjct: 422 KTNC 425
>gi|302776054|ref|XP_002971323.1| hypothetical protein SELMODRAFT_63598 [Selaginella moellendorffii]
gi|300161305|gb|EFJ27921.1| hypothetical protein SELMODRAFT_63598 [Selaginella moellendorffii]
Length = 395
Score = 98.2 bits (243), Expect = 9e-18, Method: Compositional matrix adjust.
Identities = 118/413 (28%), Positives = 174/413 (42%), Gaps = 75/413 (18%)
Query: 85 SYGGYSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIP--SFIPKLSSSS 142
S G Y + G P + +DTGS ++W C C S+ IP + P+ SS++
Sbjct: 25 SGGLYFTQVGLGNPVKHYIVQVDTGSDVLWVNCRPCSGCPRKSALNIPLTMYDPRESSTT 84
Query: 143 RLLGCQNPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSGLT-EGIALSETL 201
L+ C +P C + C+ T+ NC Y+ YG G T EG + + +
Sbjct: 85 SLVSCSDPLC--VRGRRFAEAQCSQ----TTNNC-----EYIFSYGDGSTSEGYYVRDAM 133
Query: 202 NLPNRIIPN--------FLVGCSV-----LSSRQPA--GIAGFGRGKTSLPSQLNLDK-- 244
N I N L GCS+ LS+ Q A GI GFG+ + S+P+QL +
Sbjct: 134 QY-NVISSNGLANTTSQVLFGCSIRQTGDLSTSQQAVDGIIGFGQLELSVPNQLAAQQNI 192
Query: 245 ---FSYCLLSHKFDDTTRTSSLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYY 301
FS+CL K I + G+TYTP V + SV+Y
Sbjct: 193 PRVFSHCLEGEKRGGGILVIGGIAE----------PGMTYTPLVPD---------SVHYN 233
Query: 302 VGLRRITVGGQRVRVWHKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKN 361
V LR I+V R+ + + + D G I+DSGTT + + + FV +
Sbjct: 234 VVLRGISVNSNRLPIDAEDFSSTNDT--GVIMDSGTTLAYFPSGAY----NVFVQAI--- 284
Query: 362 RNYTRALGAEALTGLRPCFDVPGEKTGSFPELKLHFKGGAEVTLPVENYF-----AVVGE 416
R T A CF V G + FP + L+F+GGA + L +NY A G
Sbjct: 285 REATSATPVRVQGMDTQCFLVSGRLSDLFPNVTLNFEGGA-MELQPDNYLMWGGTAPTGT 343
Query: 417 GSAVCLTVVTDREASGGPS-----IILGNFQMQNYYVEYDLRNQRLGFKQQLC 464
C+ + +S GP ILG+ +++ V YDL N R+G+ C
Sbjct: 344 TDVWCIGWQS-SSSSAGPKDGSQLTILGDIVLKDKLVVYDLDNSRIGWMSYNC 395
>gi|356527089|ref|XP_003532146.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
Length = 488
Score = 98.2 bits (243), Expect = 9e-18, Method: Compositional matrix adjust.
Identities = 112/387 (28%), Positives = 163/387 (42%), Gaps = 49/387 (12%)
Query: 87 GGYSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLG 146
G Y + + GTP + + I DTGS L W C + Y I F P S+S +
Sbjct: 143 GNYFVVVGLGTPKRDLSLIFDTGSDLTWTQCEPCARSCYKQQDAI--FDPSKSTSYSNIT 200
Query: 147 CQNPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYG-SGLTEGIALSETLNL-P 204
C + C+ + + C+ A++K C Y + YG S + G E L++
Sbjct: 201 CTSTLCTQLSTATGNEPGCS----ASTKACI-----YGIQYGDSSFSVGYFSRERLSVTA 251
Query: 205 NRIIPNFLVGCSVLSS---RQPAGIAGFGRGKTSLPSQ---LNLDKFSYCLLSHKFDDTT 258
I+ NFL GC + AG+ G GR S Q + FSYCL T
Sbjct: 252 TDIVDNFLFGCGQNNQGLFGGSAGLIGLGRHPISFVQQTAAVYRKIFSYCL------PAT 305
Query: 259 RTSSLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRVRVWH 318
+S+ L G++ T+ + YTPF S R S +Y + + I+VGG ++ V
Sbjct: 306 SSSTGRLSFGTT----TTSYVKYTPF----STISRG--SSFYGLDITGISVGGAKLPVSS 355
Query: 319 KYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEALTGLRP 378
+ GG I+DSGT T + P + L F M K + A L+ L
Sbjct: 356 STFS-----TGGAIIDSGTVITRLPPTAYTALRSAFRQGMSKYPS------AGELSILDT 404
Query: 379 CFDVPGEKTGSFPELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTVVTDREASGGPSIIL 438
C+D+ G + S P++ F GG V LP + V VCL + + S I
Sbjct: 405 CYDLSGYEVFSIPKIDFSFAGGVTVQLPPQGIL-YVASAKQVCLAFAANGDDS--DVTIY 461
Query: 439 GNFQMQNYYVEYDLRNQRLGFKQQLCK 465
GN Q + V YD+ R+GF CK
Sbjct: 462 GNVQQKTIEVVYDVGGGRIGFGAGGCK 488
>gi|449508297|ref|XP_004163275.1| PREDICTED: aspartic proteinase-like protein 2-like [Cucumis
sativus]
Length = 641
Score = 97.8 bits (242), Expect = 9e-18, Method: Compositional matrix adjust.
Identities = 108/424 (25%), Positives = 174/424 (41%), Gaps = 69/424 (16%)
Query: 62 IKNPQTKTTTTTTTTTTTNISSHSY------GGYSISLSFGTPPQIIPFILDTGSHLVWF 115
I NP + T+ +N Y G Y+ L GTPPQ I+DTGS + +
Sbjct: 50 ISNPHRRLRQFPTSDNLSNARMRLYDDLLLNGYYTTRLWIGTPPQQFALIVDTGSTVTYV 109
Query: 116 PCTNHYQCKYCSSSKIPSFIPKLSSSSRLLGCQNPKCSWIHHESIQCRDCNDEPLATSKN 175
PC+ C+ C + P F P+ SS+ + + C N C + +QC +
Sbjct: 110 PCST---CEQCGRHQDPKFDPESSSTYKPIKC-NIDC-ICDSDGVQC--------VYERQ 156
Query: 176 CTQICPSYLVLYGSGLTEGIALSETLNLPNRIIPNFLVGCSVLS-----SRQPAGIAGFG 230
++ S VL ++ G +++ +P R + GC + S++ GI G G
Sbjct: 157 YAEMSTSSGVLGEDVISFG---NQSELIPQRAV----FGCENMETGDLFSQRADGIMGLG 209
Query: 231 RGKTSLPSQLNL-----DKFSYCLLSHKFDDTTRTSSLILDNGSSHSDKKTTGLTYTPFV 285
G SL QL D FS C +++L S SD TY+ V
Sbjct: 210 TGDLSLVDQLVEKGAINDSFSLCYGGMDIG----GGAMVLGGISPPSDMI---FTYSDPV 262
Query: 286 NNPSVAERNAFSVYYYVGLRRITVGGQRVRVWHKYLTLDRDGNGGTIVDSGTTFTFMAPE 345
+P YY V L+ I V G+++ + DG G ++DSGTT+ ++ E
Sbjct: 263 RSP----------YYNVDLKEIHVAGKKLPLSSGIF----DGRYGAVLDSGTTYAYLPAE 308
Query: 346 LFEPLADEFVSQMVKNRNYTRALGAEALTGLRPCFDVPG----EKTGSFPELKLHFKGGA 401
F D + ++ + + + CF G E + FP + + F+ G
Sbjct: 309 AFSAFKDAIMDEI----HSLKKIDGPDPNFKDICFSGAGSDAAELSNKFPTVDMVFENGQ 364
Query: 402 EVTLPVENYFAVVGE-GSAVCLTVVTDREASGGPSIILGNFQMQNYYVEYDLRNQRLGFK 460
+++L ENYF + A CL + E + +LG ++N V YD N ++GF
Sbjct: 365 KLSLTPENYFFRHSKVHGAYCLGIF---ENGNDQTTLLGGIVVRNTLVMYDRANSKIGFW 421
Query: 461 QQLC 464
+ C
Sbjct: 422 KTNC 425
>gi|9759039|dbj|BAB09366.1| aspartyl protease-like [Arabidopsis thaliana]
Length = 478
Score = 97.8 bits (242), Expect = 9e-18, Method: Compositional matrix adjust.
Identities = 107/403 (26%), Positives = 166/403 (41%), Gaps = 62/403 (15%)
Query: 83 SHSYGGYSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPS--FIPKLSS 140
+ S G Y + G+PP+ +DTGS ++W C +C + IP + K SS
Sbjct: 68 ADSIGLYFTKIKLGSPPKEYYVQVDTGSDILWVNCAPCPKCPVKTDLGIPLSLYDSKTSS 127
Query: 141 SSRLLGCQNPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSGLT-EGIALSE 199
+S+ +GC++ CS+I +Q C K C SY V+YG G T +G + +
Sbjct: 128 TSKNVGCEDDFCSFI----MQSETC-----GAKKPC-----SYHVVYGDGSTSDGDFIKD 173
Query: 200 TLNLPN-----RIIP---NFLVGCSVLSSRQPA-------GIAGFGRGKTSLPSQLNLDK 244
+ L R P + GC S Q GI GFG+ TS+ SQL
Sbjct: 174 NITLEQVTGNLRTAPLAQEVVFGCGKNQSGQLGQTDSAVDGIMGFGQSNTSIISQLAAGG 233
Query: 245 FSYCLLSHKFDDTTRTSSLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGL 304
+ + SH D+ I G S T TP V N V+Y V L
Sbjct: 234 STKRIFSHCLDNMNGGG--IFAVGEVESPVVKT----TPIVPN---------QVHYNVIL 278
Query: 305 RRITVGGQRVRVWHKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVS-QMVKNRN 363
+ + V G + + + +G+GGTI+DSGTT ++ L+ L ++ + Q VK
Sbjct: 279 KGMDVDGDPIDLPPSLAS--TNGDGGTIIDSGTTLAYLPQNLYNSLIEKITAKQQVKLHM 336
Query: 364 YTRALGAEALTGLRPCFDVPGEKTGSFPELKLHFKGGAEVTLPVENYFAVVGEGSAVCLT 423
CF +FP + LHF+ ++++ +Y + E C
Sbjct: 337 VQETFA---------CFSFTSNTDKAFPVVNLHFEDSLKLSVYPHDYLFSLRE-DMYCFG 386
Query: 424 VVTD--REASGGPSIILGNFQMQNYYVEYDLRNQRLGFKQQLC 464
+ G I+LG+ + N V YDL N+ +G+ C
Sbjct: 387 WQSGGMTTQDGADVILLGDLVLSNKLVVYDLENEVIGWADHNC 429
>gi|356541713|ref|XP_003539318.1| PREDICTED: aspartic proteinase-like protein 2-like [Glycine max]
Length = 640
Score = 97.8 bits (242), Expect = 9e-18, Method: Compositional matrix adjust.
Identities = 110/401 (27%), Positives = 165/401 (41%), Gaps = 79/401 (19%)
Query: 87 GGYSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLG 146
G Y+ L GTPPQ I+DTGS + + PC+ CK+C S + P F P+ S +
Sbjct: 91 GYYTTRLWIGTPPQRFALIVDTGSTVTYVPCST---CKHCGSHQDPKFRPEASET----- 142
Query: 147 CQNPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSGLTEGIALSETLN---- 202
Q KC+W QC +C+D+ K CT Y Y T L E +
Sbjct: 143 YQPVKCTW------QC-NCDDD----RKQCT-----YERRYAEMSTSSGVLGEDVVSFGN 186
Query: 203 ----LPNRIIPNFLVGCS-----VLSSRQPAGIAGFGRGKTSLPSQLNL-----DKFSYC 248
P R I GC + +++ GI G GRG S+ QL D FS C
Sbjct: 187 QSELSPQRAI----FGCENDETGDIYNQRADGIMGLGRGDLSIMDQLVEKKVISDAFSLC 242
Query: 249 LLSHKFDDTTRTSSLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRIT 308
I S T++ V +P YY + L+ I
Sbjct: 243 YGGMGVGGGAMVLGGI-------SPPADMVFTHSDPVRSP----------YYNIDLKEIH 285
Query: 309 VGGQRVRVWHKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRAL 368
V G+R+ + K DG GT++DSGTT+ ++ F LA F ++K + + +
Sbjct: 286 VAGKRLHLNPKVF----DGKHGTVLDSGTTYAYLPESAF--LA--FKHAIMKETHSLKRI 337
Query: 369 GAEALTGLRPCFDVP----GEKTGSFPELKLHFKGGAEVTLPVENY-FAVVGEGSAVCLT 423
CF + + SFP +++ F G +++L ENY F A CL
Sbjct: 338 SGPDPHYNDICFSGAEINVSQLSKSFPVVEMVFGNGHKLSLSPENYLFRHSKVRGAYCLG 397
Query: 424 VVTDREASGGPSIILGNFQMQNYYVEYDLRNQRLGFKQQLC 464
V ++ P+ +LG ++N V YD + ++GF + C
Sbjct: 398 VFSN---GNDPTTLLGGIVVRNTLVMYDREHSKIGFWKTNC 435
>gi|212274314|ref|NP_001130524.1| uncharacterized protein LOC100191623 [Zea mays]
gi|194689376|gb|ACF78772.1| unknown [Zea mays]
gi|224031455|gb|ACN34803.1| unknown [Zea mays]
gi|238011528|gb|ACR36799.1| unknown [Zea mays]
gi|238015454|gb|ACR38762.1| unknown [Zea mays]
Length = 304
Score = 97.8 bits (242), Expect = 1e-17, Method: Compositional matrix adjust.
Identities = 79/249 (31%), Positives = 111/249 (44%), Gaps = 28/249 (11%)
Query: 224 AGIAGFGRGKTSLPSQLNLDKFSYCLLSHKFDDTTRTSSLILDNGSSHSDKKTTG-LTYT 282
+GI GFGR SL SQL++ +FSYCL S+ + R S+L+ + S TG + T
Sbjct: 76 SGIVGFGRNPLSLVSQLSIRRFSYCLTSYA---SRRQSTLLFGSLSDGVYGDATGRVQTT 132
Query: 283 PFVNNPSVAERNAFSVYYYVGLRRITVGGQRVRVWHKYLTLDRDGNGGTIVDSGTTFTFM 342
P + +P +YYV +TVG +R+R+ L DG+GG IVDSGT T +
Sbjct: 133 PLLQSPQNP------TFYYVHFTGLTVGARRLRIPESAFALRPDGSGGVIVDSGTALTLL 186
Query: 343 APELFEPLADEFVSQMVKNRNYTRALGAEALTGLRPCFDVPGEKTGS-------FPELKL 395
+ + F Q+ A G G+ CF VP S P + L
Sbjct: 187 PAAVLAEVVRAFRQQL----RLPFANGGNPEDGV--CFLVPAAWRRSSSTSQMPVPRMVL 240
Query: 396 HFKGGAEVTLPVENYFAVVGEGSAVCLTVVTDREASGGPSIILGNFQMQNYYVEYDLRNQ 455
HF+ GA++ LP NY +CL + SG +GN Q+ V YDL +
Sbjct: 241 HFQ-GADLDLPRRNYVLDDHRRGRLCLLLAD----SGDDGSTIGNLVQQDMRVLYDLEAE 295
Query: 456 RLGFKQQLC 464
L C
Sbjct: 296 TLSIAPARC 304
>gi|125572774|gb|EAZ14289.1| hypothetical protein OsJ_04213 [Oryza sativa Japonica Group]
Length = 492
Score = 97.8 bits (242), Expect = 1e-17, Method: Compositional matrix adjust.
Identities = 95/351 (27%), Positives = 143/351 (40%), Gaps = 52/351 (14%)
Query: 87 GGYSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLG 146
G Y +S S GTPPQ++ +LD S VW C+ C C + P +S+
Sbjct: 95 GMYVLSFSVGTPPQVVTGVLDITSDFVWMQCS---ACATCGADA-----PAATSAP---- 142
Query: 147 CQNPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSGL---TEGIALSETLNL 203
P +++ +D T+ C Y +YG G T G+ +
Sbjct: 143 ---PFYAFLSF--------HDTRAPTTPPC-----GYSYVYGGGAANTTAGLLAVDAFAF 186
Query: 204 PNRIIPNFLVGCSVLSSRQPAGIAGFGRGKTSLPSQLNLDKFSYCLLSHKFDDTTRTSSL 263
+ GC+V + G+ G GRG+ S SQL + +FSY L DD S
Sbjct: 187 ATVRADGVIFGCAVATEGDIGGVIGLGRGELSPVSQLQIGRFSYYLAP---DDAVDVGSF 243
Query: 264 I--LDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRVRVWHKYL 321
I LD+ + + V+ P VA R + S+ YYV L I V G+ + +
Sbjct: 244 ILFLDDAKPRTSRA---------VSTPLVASRASRSL-YYVELAGIRVDGEDLAIPRGTF 293
Query: 322 TLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEALTGLRPCFD 381
L DG+GG ++ TF+ + A + V Q + ++ RA L GL C+
Sbjct: 294 DLQADGSGGVVLSITIPVTFL-----DAGAYKVVRQAMASKIELRAADGSEL-GLDLCYT 347
Query: 382 VPGEKTGSFPELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTVVTDREASG 432
T P + L F GGA + L + NYF + CLT++ G
Sbjct: 348 SESLATAKVPSMALVFAGGAVMELEMGNYFYMDSTTGLECLTILPSPAGDG 398
>gi|222624645|gb|EEE58777.1| hypothetical protein OsJ_10300 [Oryza sativa Japonica Group]
Length = 431
Score = 97.8 bits (242), Expect = 1e-17, Method: Compositional matrix adjust.
Identities = 105/392 (26%), Positives = 156/392 (39%), Gaps = 42/392 (10%)
Query: 90 SISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLGCQN 149
++ ++ GTPPQ + +LDTGS L W C Y S L C
Sbjct: 56 TVPVAVGTPPQNVTMVLDTGSELSWLLCNGSYAPPLTRRSTRRWRGRDLPVPPF---CDT 112
Query: 150 PKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSGLTEGIALSETLNLPNRIIP 209
P S CR A+S + ++L+ G+ A + +
Sbjct: 113 PP-------SNACRVSLSYADASSADGVLATDTFLLTGGAPPVAVGAYFGCITSYSSTTA 165
Query: 210 NFLVGCSVLSSRQPAGIAGFGRGKTSLPSQLNLDKFSYCLLSHKFDDTTRTSSLILDNGS 269
G S G+ G RG S +Q +F+YC+ + L+ D+G
Sbjct: 166 TNSNGTGTDVSEAATGLLGMNRGTLSFVTQTGTRRFAYCIAPGEGPGVL----LLGDDGG 221
Query: 270 SHSDKKTTGLTYTPFVNNPSVAERNAF--SVYYYVGLRRITVGGQRVRVWHKYLTLDRDG 327
L YTP + +++ + V Y V L I VG + + LT D G
Sbjct: 222 VAPP-----LNYTPLIE---ISQPLPYFDRVAYSVQLEGIRVGCALLPIPKSVLTPDHTG 273
Query: 328 NGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEALT---GLRPCFDVPG 384
G T+VDSGT FTF+ + + L EF SQ R LG CF P
Sbjct: 274 AGQTMVDSGTQFTFLLADAYAALKAEFTSQA---RLLLAPLGEPGFVFQGAFDACFRGPE 330
Query: 385 EK----TGSFPELKLHFKGGAEVTLPVENYFAVV-----GEGSAVCLTVVT--DREASGG 433
+ +G PE+ L + GAEV + E +V GEG A + +T + + +G
Sbjct: 331 ARVAAASGLLPEVGLVLR-GAEVAVSGEKLLYMVPGERRGEGGAEAVWCLTFGNSDMAGM 389
Query: 434 PSIILGNFQMQNYYVEYDLRNQRLGFKQQLCK 465
+ ++G+ QN +VEYDL+N R+GF C
Sbjct: 390 SAYVIGHHHQQNVWVEYDLQNGRVGFAPARCD 421
>gi|356567196|ref|XP_003551807.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Glycine max]
Length = 490
Score = 97.8 bits (242), Expect = 1e-17, Method: Compositional matrix adjust.
Identities = 113/387 (29%), Positives = 160/387 (41%), Gaps = 48/387 (12%)
Query: 87 GGYSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLG 146
G Y + + GTP + + I DTGS L W C + Y I F P S+S +
Sbjct: 144 GNYFVVVGLGTPKRDLSLIFDTGSDLTWTQCEPCARSCYKQQDVI--FDPSKSTSYSNIT 201
Query: 147 CQNPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYG-SGLTEGIALSETLNL-P 204
C + C+ + + C+ A++K C Y + YG S + G E L +
Sbjct: 202 CTSALCTQLSTATGNDPGCS----ASTKACI-----YGIQYGDSSFSVGYFSRERLTVTA 252
Query: 205 NRIIPNFLVGCSVLSS---RQPAGIAGFGRGKTSLPSQLNLDK---FSYCLLSHKFDDTT 258
++ NFL GC + AG+ G GR S Q FSYCL S T
Sbjct: 253 TDVVDNFLFGCGQNNQGLFGGSAGLIGLGRHPISFVQQTAAKYRKIFSYCLPS------T 306
Query: 259 RTSSLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRVRVWH 318
+S+ L G + + + L YTPF S R S +Y + + I VGG ++ V
Sbjct: 307 SSSTGHLSFGPAATGRY---LKYTPF----STISRG--SSFYGLDITAIAVGGVKLPVSS 357
Query: 319 KYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEALTGLRP 378
+ GG I+DSGT T + P + L F M K + A L+ L
Sbjct: 358 STFS-----TGGAIIDSGTVITRLPPTAYGALRSAFRQGMSKYPS------AGELSILDT 406
Query: 379 CFDVPGEKTGSFPELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTVVTDREASGGPSIIL 438
C+D+ G K S P ++ F GG V LP + V VCL + + S I
Sbjct: 407 CYDLSGYKVFSIPTIEFSFAGGVTVKLPPQGIL-FVASTKQVCLAFAANGDDS--DVTIY 463
Query: 439 GNFQMQNYYVEYDLRNQRLGFKQQLCK 465
GN Q + V YD+ R+GF CK
Sbjct: 464 GNVQQRTIEVVYDVGGGRIGFGAGGCK 490
>gi|388508700|gb|AFK42416.1| unknown [Lotus japonicus]
Length = 440
Score = 97.8 bits (242), Expect = 1e-17, Method: Compositional matrix adjust.
Identities = 103/402 (25%), Positives = 178/402 (44%), Gaps = 76/402 (18%)
Query: 97 TPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSR-----LLGCQNPK 151
TP + LD G +W C N +Y SS+ F P SS+ L GC K
Sbjct: 59 TPLVPVKLTLDLGGGYLWVNCENR---QYVSST----FKPARCGSSQCSLFGLTGCSGDK 111
Query: 152 CSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSGLTEGIALSETLNLPNRIIPNF 211
C P T + + + T+G ++ +++PN + F
Sbjct: 112 I------------CGRSPSNTVTGVSSYGDIHSDVVSVNSTDGTTPTKVVSVPNFL---F 156
Query: 212 LVGCSVLS---SRQPAGIAGFGRGKTSLPSQLNLD-----KFSYCLLSHKFDDTTRTSSL 263
+ G V+ ++ G+AG GR + SLPSQ + KF+ CL ++ D +
Sbjct: 157 ICGSKVVQNGLAKGVTGMAGLGRTRVSLPSQFSSAFSFHRKFAICLTANSGADGV----M 212
Query: 264 ILDNGSSHSDKKTTG-LTYTPFVNNPSVAERNAF----SVYYYVGLRRITVGGQRVRVWH 318
+G + ++ + LTYTP + NP +AF SV Y++G++ + V + V +
Sbjct: 213 FFGDGPYNLNQDVSKVLTYTPLITNPVSTAPSAFLGEPSVEYFIGVKSVKVSEKNVPLNT 272
Query: 319 KYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEALTGLRP 378
L+++++G GGT + + +T M +++ +AD FV ++LGA ++ + P
Sbjct: 273 TLLSINKNGVGGTKISTVNPYTVMETTIYKAVADAFV----------KSLGAPTVSPVAP 322
Query: 379 ---CF---DVPGEKTG-SFPELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTVV---TDR 428
CF D+ + G P + L + G E P+ ++V +CL V ++
Sbjct: 323 FGTCFATKDISFSRIGPGVPAIDLVLQNGVE--WPIIGANSMVQFDDVICLGFVDAGSNP 380
Query: 429 EAS------GGP----SIILGNFQMQNYYVEYDLRNQRLGFK 460
+AS GG SI +G Q++N +++DL RLGF+
Sbjct: 381 KASQVGFVNGGSHPVTSITIGAHQLENNLLKFDLAASRLGFR 422
>gi|302794412|ref|XP_002978970.1| hypothetical protein SELMODRAFT_418789 [Selaginella moellendorffii]
gi|300153288|gb|EFJ19927.1| hypothetical protein SELMODRAFT_418789 [Selaginella moellendorffii]
Length = 462
Score = 97.8 bits (242), Expect = 1e-17, Method: Compositional matrix adjust.
Identities = 107/450 (23%), Positives = 174/450 (38%), Gaps = 63/450 (14%)
Query: 35 RFHTNPSQDSYQNLNSLVSSSLTRALHIKNPQTKTTTTTTTTTTTNISSHSYGGYSISLS 94
R NPS S + S+ H KNP ++TTT +G Y S+
Sbjct: 53 RVKANPSPSSAAQKSLFPYSAHIFQQHTKNPAALRSSTTTL-------GRKFGEYYTSIK 105
Query: 95 FGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLGCQNPKCSW 154
G+P Q I+DTGS L W C CK C+ S + S S + + C N +
Sbjct: 106 LGSPGQEAILIVDTGSELTWLKC---LPCKVCAPSVDTIYDAARSVSYKPVTCNNSQL-- 160
Query: 155 IHHESIQCRDCNDEPLATSKNCTQICP-SYLVLYGSG-LTEGIALSETLNLPNRI----- 207
C++ T C + + YG G + G ++TL + +
Sbjct: 161 ----------CSNSSQGTYAYCARGSQCQFAAFYGDGSFSYGSLSTDTLIMETVVGGKPV 210
Query: 208 -IPNFLVGCSV----LSSRQPAGIAGFGRGKTSLPSQLNLD---KFSYCLLSHKFDDTTR 259
+ +F GC+ L +GI G GK +LP QL KFS+C +
Sbjct: 211 TVQDFAFGCAQGDLELVPTGASGILGLNAGKMALPMQLGQRFGWKFSHCF-PDRSSHLNS 269
Query: 260 TSSLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRVRVWHK 319
T + N ++ + YT S +R +Y+V L+ +++ H+
Sbjct: 270 TGVVFFGNAELPHEQ----VQYTSVALTNSELQRK----FYHVALKGVSINS------HE 315
Query: 320 YLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYT-RALGAEALTGLRP 378
+ L R I+DSG++F+ P + +K+R + + L ++ L
Sbjct: 316 LVLLPR--GSVVILDSGSSFS----SFVRPFHSQLREAFLKHRPPSLKHLEGDSFGDLGT 369
Query: 379 CFDVPGEKTG----SFPELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTVVTDREASGGP 434
CF V + + P L L F+ G + +P V + P
Sbjct: 370 CFKVSNDDIDELHRTLPSLSLVFEDGVTIGIPSIGVLLPVARYQNHVKMCFAFEDGGPNP 429
Query: 435 SIILGNFQMQNYYVEYDLRNQRLGFKQQLC 464
++GN+Q QN +VEYD++ R+GF + C
Sbjct: 430 VNVIGNYQQQNLWVEYDIQRSRVGFARASC 459
>gi|224142009|ref|XP_002324353.1| predicted protein [Populus trichocarpa]
gi|222865787|gb|EEF02918.1| predicted protein [Populus trichocarpa]
Length = 412
Score = 97.8 bits (242), Expect = 1e-17, Method: Compositional matrix adjust.
Identities = 110/391 (28%), Positives = 166/391 (42%), Gaps = 61/391 (15%)
Query: 87 GGYSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLG 146
G Y +++ GTP + I DTGS + W C K C K P P S+S + +
Sbjct: 69 GDYVVTVGLGTPKKEFTLIFDTGSDITWTQC--EPCVKTCYKQKEPRLNPSTSTSYKNIS 126
Query: 147 CQNPKCSWIHHESIQCRDCNDEPLATSKNCTQICPS----YLVLYGSG-LTEGIALSETL 201
C + C + A+ K +Q C S Y V YG G + G +ETL
Sbjct: 127 CSSALCKLV---------------ASGKKFSQSCSSSTCLYQVQYGDGSYSIGFFATETL 171
Query: 202 NL-PNRIIPNFLVGCSVLSS---RQPAGIAGFGRGKTSLPSQLN---LDKFSYCLLSHKF 254
L + + NFL GC ++ AG+ G GR K +LPSQ FSYCL +
Sbjct: 172 TLSSSNVFKNFLFGCGQQNNGLFGGAAGLLGLGRTKLALPSQTAKTYKKLFSYCLPAS-- 229
Query: 255 DDTTRTSSLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRV 314
++ L L S S K T P A+ ++ + +Y + + ++VGG++
Sbjct: 230 --SSSKGYLSLGGQVSKSVKFT-----------PLSADFDS-TPFYGLDITGLSVGGRQ- 274
Query: 315 RVWHKYLTLDRDG-NGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEAL 373
L++D + GT++DSGT T ++P + L+ F + M +Y G
Sbjct: 275 ------LSIDESAFSAGTVIDSGTVITRLSPTAYSELSSAFQNLMT---DYPSTSGYSI- 324
Query: 374 TGLRPCFDVPGEKTGSFPELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTVVTDREASGG 433
C+D T P++ + FKGG E+ + V V VCL + + S
Sbjct: 325 --FDTCYDFSKYDTVRIPKVGVTFKGGVEMDIDVSGILYPVNGLKKVCLAFAGNDDDS-- 380
Query: 434 PSIILGNFQMQNYYVEYDLRNQRLGFKQQLC 464
+ I GN Q + Y V YD R+GF C
Sbjct: 381 DTSIFGNVQQRTYQVVYDGAKGRVGFAPGGC 411
>gi|356498789|ref|XP_003518231.1| PREDICTED: probable aspartic protease At2g35615-like [Glycine max]
Length = 446
Score = 97.4 bits (241), Expect = 1e-17, Method: Compositional matrix adjust.
Identities = 105/417 (25%), Positives = 171/417 (41%), Gaps = 91/417 (21%)
Query: 81 ISSHSYGGYSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSS 140
+ S Y + ++ S G PP ++DTGS L W C + C CS +P F P SS
Sbjct: 85 VPSPRYVVFLMNFSIGEPPIPQLAVMDTGSSLTWVMC---HPCSSCSQQSVPIFDPSKSS 141
Query: 141 SSRLLGCQNPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSGLTEGIALSET 200
+ + ++ C +CN + + CP + GSG ++GI E
Sbjct: 142 T---------------YSNLSCSECNKCDVVNGE-----CPYSVEYVGSGSSQGIYAREQ 181
Query: 201 LNLPN-----RIIPNFLVGC----SVLSSRQP----AGIAGFGRGKTSLPSQLNLDKFSY 247
L L +P+ + GC S+ S+ P G+ G G G+ SL KFSY
Sbjct: 182 LTLETIDESIIKVPSLIFGCGRKFSISSNGYPYQGINGVFGLGSGRFSLLPSFG-KKFSY 240
Query: 248 CLLSHKFDDTTRTSSLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRI 307
C+ + + + + + L+L + ++ TT N + YYV L I
Sbjct: 241 CIGNLR-NTNYKFNRLVLGDKANMQGDSTT---------------LNVINGLYYVNLEAI 284
Query: 308 TVGGQRVRV----WHKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADE---------F 354
++GG+++ + + + +T D N G I+DSG T++ FE L+ E
Sbjct: 285 SIGGRKLDIDPTLFERSIT---DNNSGVIIDSGADHTWLTKYGFEVLSFEVENLLEGVLV 341
Query: 355 VSQMVKNRNYTRALGAEALTGLRPCFDVPGEKTGSFPELKLHFKGGAEVTLPVENYFAVV 414
++Q K+ YT V + FP + HF GA + L V + F
Sbjct: 342 LAQQDKHNPYTLCYSG-----------VVSQDLSGFPLVTFHFAEGAVLDLDVTSMFIQT 390
Query: 415 GEGSAVCLTVV------TDREASGGPSIILGNFQMQNYYVEYDLRNQRLGFKQQLCK 465
E + C+ ++ D E+ +G QNY V YDL R+ F++ C+
Sbjct: 391 TE-NEFCMAMLPGNYFGDDYESFSS----IGMLAQQNYNVGYDLNRMRVYFQRIDCE 442
>gi|297805182|ref|XP_002870475.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
gi|297316311|gb|EFH46734.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
Length = 480
Score = 97.4 bits (241), Expect = 1e-17, Method: Compositional matrix adjust.
Identities = 103/403 (25%), Positives = 168/403 (41%), Gaps = 62/403 (15%)
Query: 83 SHSYGGYSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPS--FIPKLSS 140
+ S G Y + G+PP+ +DTGS ++W C +C + IP + K SS
Sbjct: 71 ADSIGLYFTKIKLGSPPKEYYVQVDTGSDILWVNCAPCPKCPVKTDLGIPLSLYDSKASS 130
Query: 141 SSRLLGCQNPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSGLT-EGIALSE 199
+S+ +GC++ CS+I +Q C K C SY V+YG G T +G + +
Sbjct: 131 TSKNVGCEDAFCSFI----MQSETC-----GAKKPC-----SYHVVYGDGSTSDGDFVKD 176
Query: 200 TLNLPN-----RIIP---NFLVGCSVLSSRQPA-------GIAGFGRGKTSLPSQLNLDK 244
+ L R P + GC S Q GI GFG+ TS+ SQL
Sbjct: 177 NITLDQVTGNLRTAPLAQEVVFGCGKNQSGQLGQTESAVDGIMGFGQSNTSVISQLAAGG 236
Query: 245 FSYCLLSHKFDDTTRTSSLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGL 304
+ SH D+ + + ++ + TP V N V+Y V L
Sbjct: 237 SVKRIFSHCLDNMNGGGIFAI------GEVESPVVKTTPLVPN---------QVHYNVIL 281
Query: 305 RRITVGGQRVRVWHKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVS-QMVKNRN 363
+ + V G+ + + + +G+GGTI+DSGTT ++ L+ L ++ + Q VK
Sbjct: 282 KGMDVDGEPIDLPPSLAS--TNGDGGTIIDSGTTLAYLPQNLYNSLIEKITAKQQVKLHM 339
Query: 364 YTRALGAEALTGLRPCFDVPGEKTGSFPELKLHFKGGAEVTLPVENYFAVVGEGSAVCLT 423
CF +FP + LHF+ ++++ +Y + E C
Sbjct: 340 VQETFA---------CFSFTSNTDKAFPVVNLHFEDSLKLSVYPHDYLFSLRE-DMYCFG 389
Query: 424 VVTDREAS--GGPSIILGNFQMQNYYVEYDLRNQRLGFKQQLC 464
+ + G I+LG+ + N V YDL N+ +G+ C
Sbjct: 390 WQSGGMTTQDGADVILLGDLVLSNKLVVYDLENEVIGWADHNC 432
>gi|358347314|ref|XP_003637703.1| Basic 7S globulin [Medicago truncatula]
gi|355503638|gb|AES84841.1| Basic 7S globulin [Medicago truncatula]
Length = 454
Score = 97.4 bits (241), Expect = 1e-17, Method: Compositional matrix adjust.
Identities = 105/417 (25%), Positives = 163/417 (39%), Gaps = 67/417 (16%)
Query: 89 YSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLGCQ 148
YS S+ GTP + ++D +WF C + Y S++ + C
Sbjct: 50 YSTSIKLGTPAVPLDLVIDIRERFLWFECDDSYN----------------STTYNPIQCG 93
Query: 149 NPKCSWIHHESIQCRDCNDEPLAT--SKNCTQICPSYLVLYGSGLTEGIALSETLNLP-- 204
KC C DC + P T + N + P +G G + L+ P
Sbjct: 94 TKKCK--QARGTGCIDCTNHPFKTGCTNNTCGVEP--FNPFGGFFVSGDVGEDILSFPRV 149
Query: 205 --------NRIIPNFLVGCSVLS-----------SRQPAGIAGFGRGKTSLPSQL----N 241
N +P F+ C S+ G+ G R SLP+Q+
Sbjct: 150 TSDGRRVTNVRVPRFISSCVYPDKFGVQGFLEGLSKGKKGVLGLARTLISLPTQIATRFK 209
Query: 242 LD-KFSYCLLSHKFDDTTRTSSLILDNG----SSHSDKKTTGLTYTPFVNNPSVAE---R 293
LD KF+ CL S + SL + G S+ D + L YTP + N
Sbjct: 210 LDRKFTLCLPSTSQKNGLGPGSLFVGGGPYNLGSNKDDASKFLKYTPLITNRRSTGPIFD 269
Query: 294 NAFSVYYYVGLRRITVGGQRVRVWHKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADE 353
N S Y++ ++ I V V L++++ G GGT + + T + ++ PL +
Sbjct: 270 NFPSTEYFIKVKSIKVDNNVVNFNTTLLSINKLGEGGTKLSTVIPHTTLHTSIYNPLLNA 329
Query: 354 FVSQMVKNRNYTRALGAEALTGLRPCFD---VPGEKTG-SFPELKLHFKGGAEVTLPVEN 409
FV + + R R +A+ CFD + G + P + L KGG E + N
Sbjct: 330 FVKK-AEIRKIKR---VKAVAPFGACFDSRTISKSVNGPNVPTIDLVLKGGVEWRIFGAN 385
Query: 410 YFAVVGEGSAVCLTVVTDREASGGP---SIILGNFQMQNYYVEYDLRNQRLGFKQQL 463
V E + +CL V GP SII+G Q+++ VE+DL + +LGF L
Sbjct: 386 SMVKVNE-NVLCLGFVDAGSEEVGPSATSIIIGGHQLEDNLVEFDLVSSKLGFSSSL 441
>gi|194707632|gb|ACF87900.1| unknown [Zea mays]
Length = 423
Score = 97.4 bits (241), Expect = 1e-17, Method: Compositional matrix adjust.
Identities = 104/401 (25%), Positives = 166/401 (41%), Gaps = 59/401 (14%)
Query: 87 GGYSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSS-----KIPSFIPKLSSS 141
G Y + G P + +DTGS ++W C+ C C +S ++ SF P SS+
Sbjct: 3 GLYFTRVKLGNPAKEFFVQIDTGSDILWVTCS---PCTGCPTSSGLNIQLESFNPDSSST 59
Query: 142 SRLLGCQNPKCS--WIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSGL-TEGIALS 198
+ + C + +C+ + E+I C+ N + S C Y YG G T G +S
Sbjct: 60 ASRITCSDDRCTAGFQTGEAI-CQTSNSQ----SSPC-----GYTFTYGDGSGTSGYYVS 109
Query: 199 ETL----NLPNRIIPN----FLVGCS-------VLSSRQPAGIAGFGRGKTSLPSQLNLD 243
+T+ + N N + GCS + R GI GFG+ + S+ SQLN
Sbjct: 110 DTMFFETVMGNEQTANSSASIVFGCSNSQSGDLTKADRAVDGIFGFGQHQLSVISQLNSL 169
Query: 244 KFSYCLLSHKFDDTTRTSSLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVG 303
S + SH + +++ + GL YTP V PS +Y +
Sbjct: 170 GVSPKVFSHCLKGSDNGGGILV-----LGEIVEPGLVYTPLV--PS-------QPHYNLN 215
Query: 304 LRRITVGGQRVRVWHKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRN 363
L I V GQ++ + T GTIVDSGTT ++A ++P + + +
Sbjct: 216 LESIAVNGQKLPIDSSLFTTSN--TQGTIVDSGTTLAYLADGAYDPFVSAIAAAVSPSVR 273
Query: 364 YTRALGAEALTGLRPCFDVPGEKTGSFPELKLHFKGGAEVTLPVENYFAVVGEGSAVCLT 423
+ G++ CF SFP + L+F GG +++ ENY L
Sbjct: 274 SLVSKGSQ-------CFITSSSVDSSFPTVTLYFMGGVAMSVKPENYLLQQASVDNSVLW 326
Query: 424 VVTDREASGGPSIILGNFQMQNYYVEYDLRNQRLGFKQQLC 464
+ + G ILG+ +++ YDL N R+G+ C
Sbjct: 327 CIGWQRNQGQEITILGDLVLKDKIFVYDLANMRMGWADYDC 367
>gi|357450863|ref|XP_003595708.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
gi|355484756|gb|AES65959.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
Length = 407
Score = 97.4 bits (241), Expect = 1e-17, Method: Compositional matrix adjust.
Identities = 105/391 (26%), Positives = 157/391 (40%), Gaps = 61/391 (15%)
Query: 89 YSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLGCQ 148
Y + LS GTPP I DTGS LVWF C C C + P F P+ SSS + C
Sbjct: 60 YLMELSIGTPPIKIYAEADTGSDLVWFQCI---PCTKCYKQQNPMFDPRSSSSYTNITCG 116
Query: 149 NPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSGLTEGIALSETLNLPNR-- 206
C+ + D L ++ T C + +T+G+ ETL L +
Sbjct: 117 TESCNKL-----------DSSLCSTDQKT--CNYTYSYADNSITQGVLAQETLTLTSTTG 163
Query: 207 ---IIPNFLVGC----SVLSSRQPAGIAGFGRGKTSLPSQLNL------DKFSYCLLSHK 253
+ GC S + R+ G+ G GRG SL SQ+ + FS CL+
Sbjct: 164 EPVAFQGIIFGCGHNNSGFNDRE-MGLIGLGRGPLSLISQIGSSLGAGGNMFSQCLVPFN 222
Query: 254 FDDTTRTSSLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQR 313
D + TS + G S+ G TP ++ Y+ L I+V
Sbjct: 223 TDPSI-TSQMNFGKG---SEVLGNGTVSTPLISKDGTG--------YFATLLGISVEDIN 270
Query: 314 VRVWHKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEAL 373
+ + +L G ++DSGTT T++ E + L ++ V+N+ AL +
Sbjct: 271 LP-FSNGSSLGTITKGNILIDSGTTITYLPEEFYHRLIEQ-----VRNK---VALEPFRI 321
Query: 374 TGLRPCFDVPGEKTGSFPELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTVVTDREASGG 433
G C+ P G P L +HF+GG + P + + V + C V E
Sbjct: 322 DGYELCYQTPTNLNG--PTLTIHFEGGDVLLTPAQMFIPV--QDDNFCFAVFDTNEE--- 374
Query: 434 PSIILGNFQMQNYYVEYDLRNQRLGFKQQLC 464
+ GN+ NY + +DL Q + FK C
Sbjct: 375 -YVTYGNYAQSNYLIGFDLERQVVSFKATDC 404
>gi|218184944|gb|EEC67371.1| hypothetical protein OsI_34484 [Oryza sativa Indica Group]
Length = 396
Score = 97.4 bits (241), Expect = 1e-17, Method: Compositional matrix adjust.
Identities = 100/387 (25%), Positives = 159/387 (41%), Gaps = 50/387 (12%)
Query: 89 YSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLGCQ 148
Y + + GTPPQ I+D LVW C+ C+ C +P F+P SS+ + C
Sbjct: 45 YVANFTIGTPPQPASAIVDVAGELVWTQCS---ACRRCFKQDLPVFVPNASSTFKPEPCG 101
Query: 149 NPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSGLTEGIALSETLNLPNRII 208
C ESI R C+ + + TQ+ G T G A ++T + +
Sbjct: 102 TAVC-----ESIPTRSCSGDVCSYKGPPTQL---------RGNTSGFAATDTFAIGTATV 147
Query: 209 PNFLVGCSVLSSRQ----PAGIAGFGRGKTSLPSQLNLDKFSYCLLSHKFDDTTRTSSLI 264
GC V S P+G G GR SL +Q+ L +FSYCL +T ++S L
Sbjct: 148 -RLAFGCVVASDIDTMDGPSGFIGLGRTPWSLVAQMKLTRFSYCLSPR---NTGKSSRLF 203
Query: 265 LDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRVRVWHKYLTLD 324
L GSS + + PF+ + + S YY + L I G +
Sbjct: 204 L--GSSAKLAGSESTSTAPFIKT---SPDDDGSNYYLLSLDAIRAGNTTIATAQ------ 252
Query: 325 RDGNGGTIV-DSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEALTGLRPCF-DV 382
+GG +V + + F+ + ++ + V++ V L CF
Sbjct: 253 ---SGGILVMHTVSPFSLLVDSAYKAF-KKAVTEAVGGAAAPPMATPPQPFDL--CFKKA 306
Query: 383 PGEKTGSFPELKLHFKGGAEVTLPVENYFAVVG-EGSAVCLTVVT----DREASGGPSII 437
G + P+L F+G A +T+P Y VG E C +++ +R G S +
Sbjct: 307 AGFSRATAPDLVFTFQGAAALTVPPAKYLIDVGEEKDTACAAILSMAWLNRTGLEGVS-V 365
Query: 438 LGNFQMQNYYVEYDLRNQRLGFKQQLC 464
LG+ Q ++ + YDL+ + L F+ C
Sbjct: 366 LGSLQQEDVHFLYDLKKETLSFEPADC 392
>gi|449464952|ref|XP_004150193.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 2-like [Cucumis
sativus]
gi|449526850|ref|XP_004170426.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 2-like [Cucumis
sativus]
Length = 476
Score = 97.4 bits (241), Expect = 1e-17, Method: Compositional matrix adjust.
Identities = 109/386 (28%), Positives = 161/386 (41%), Gaps = 52/386 (13%)
Query: 87 GGYSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLG 146
G Y + + G+PP+ ++D+GS +VW C C C P F P S++ +
Sbjct: 135 GEYFVRIGVGSPPRSQYVVIDSGSDIVWVQCQ---PCSECYQQSDPVFDPAGSATYAGIS 191
Query: 147 CQNPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSG-LTEGIALSETLNLPN 205
C + C + + CND C Y V YG G T G ETL
Sbjct: 192 CDSSVCDRLDNAG-----CND------GRC-----RYEVSYGDGSYTRGTLALETLTFGR 235
Query: 206 RIIPNFLVGCSVLSSRQ---PAGIAGFGRGKTSLPSQLNLD---KFSYCLLSHKFDDTTR 259
+I N +GC ++ AG+ G G G S QL FSYCL+S T
Sbjct: 236 VLIRNIAIGCGHMNRGMFIGAAGLLGLGGGAMSFVGQLGGQTGGAFSYCLVSRG---TES 292
Query: 260 TSSLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRVRVWHK 319
T +L G+ G + P + NP +YYVGL + VGG RV + +
Sbjct: 293 TGTLEFGRGA-----MPVGAAWVPLIRNPRAPS------FYYVGLSGLGVGGIRVPIPEQ 341
Query: 320 YLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEALTGLRPC 379
L G GG ++D+GT T + +E D F+ Q N R ++ ++ C
Sbjct: 342 IFELTDLGYGGVVMDTGTAVTRLPAPAYEAFRDTFIGQTA---NLPR---SDRVSIFDTC 395
Query: 380 FDVPGEKTGSFPELKLHFKGGAEVTLPVENYF-AVVGEGSAVCLTVVTDREASGGPSIIL 438
+++ G + P + +F GG +TLP N+ V GEG+ C AS I+
Sbjct: 396 YNLNGFVSVRVPTVSFYFSGGPILTLPARNFLIPVDGEGT-FCFAFA----ASASGLSII 450
Query: 439 GNFQMQNYYVEYDLRNQRLGFKQQLC 464
GN Q + + D N +GF +C
Sbjct: 451 GNIQQEGIQISIDGSNGFVGFGPTIC 476
>gi|159464048|ref|XP_001690254.1| pepsin-type aspartyl protease [Chlamydomonas reinhardtii]
gi|158284242|gb|EDP09992.1| pepsin-type aspartyl protease [Chlamydomonas reinhardtii]
Length = 485
Score = 97.4 bits (241), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 106/400 (26%), Positives = 167/400 (41%), Gaps = 65/400 (16%)
Query: 82 SSHSYGGYSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSS 141
+ HSY + +L GTP + I+DTGS + + PC + C +C F P S++
Sbjct: 8 TRHSY--FYTTLKLGTPERTFSVIIDTGSTITYIPCKD---CSHCGKHTAEWFDPDKSTT 62
Query: 142 SRLLGCQNPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSGLTEGIALSETL 201
++ L C +P C+ + C CN++ S+ + S EG + +T
Sbjct: 63 AKKLACGDPLCNC---GTPSCT-CNNDRCYYSRTYAERSSS----------EGWMIEDTF 108
Query: 202 NLPNRIIPNFLV-GCSVLSS----RQPA-GIAGFGRGKTSLPSQLNL-----DKFSYCLL 250
P+ P LV GC + RQ A GI G G + SQL D FS C
Sbjct: 109 GFPDSDSPVRLVFGCENGETGEIYRQMADGIMGMGNNHNAFQSQLVQRKVIEDVFSLC-F 167
Query: 251 SHKFDDTTRTSSLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVG 310
+ D + L G++ YTP + + + YY V + ITV
Sbjct: 168 GYPKDGILLLGDVTLPEGAN--------TVYTPLLTHLHLH-------YYNVKMDGITVN 212
Query: 311 GQRVRVWHKYLTLDR---DGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRA 367
GQ L D D GT++DSGTTFT++ + F+ +A + V V+ +
Sbjct: 213 GQT-------LAFDASVFDRGYGTVLDSGTTFTYLPTDAFKAMA-KAVGDYVEKKGLQST 264
Query: 368 LGAEALTG---LRPCFDVPGEKTGSFPELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTV 424
GA+ + D + FP + F GGA++TLP Y + + + CL +
Sbjct: 265 PGADPQYNDICWKGAPDQFKDLDKYFPPAEFVFGGGAKLTLPPLRYL-FLSKPAEYCLGI 323
Query: 425 VTDREASGGPSIILGNFQMQNYYVEYDLRNQRLGFKQQLC 464
+ G ++G +++ V YD RN ++GF C
Sbjct: 324 FDN----GNSGALVGGVSVRDVVVTYDRRNSKVGFTTMAC 359
>gi|147771308|emb|CAN69536.1| hypothetical protein VITISV_043237 [Vitis vinifera]
Length = 372
Score = 97.1 bits (240), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 105/382 (27%), Positives = 155/382 (40%), Gaps = 52/382 (13%)
Query: 89 YSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLGCQ 148
Y + GTP Q + +DT S + W PC C CSS+ F S++ + LGCQ
Sbjct: 36 YIVRAKIGTPAQTMLMAMDTSSDVAWIPCNG---CLGCSSTL---FNSPASTTYKSLGCQ 89
Query: 149 NPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSGLTEGIALSETLNLPNRII 208
+C + + C +C L GS L ++ +T+ L +
Sbjct: 90 AAQCKQVPKPT-----CGGG----------VCSFNLTYGGSSLAANLS-QDTITLATDAV 133
Query: 209 PNFLVGC------SVLSSRQPAGIAGFGRGKTSLPSQLNLDKFSYCLLSHKFDDTTRTSS 262
P + GC L ++ G+ S L FSYCL S F + S
Sbjct: 134 PGYSFGCIQKATGGSLPAQGLLGLGRGPLSLLSQTQNLYQSTFSYCLPS--FKSLNFSGS 191
Query: 263 LILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRVRVWHKYLT 322
L L G K+ + YTP + NP R + Y+V L + VG + V V T
Sbjct: 192 LRL--GPVGQPKR---IKYTPLLKNP---RRPSL---YFVNLMAVRVGRRVVDVPPGSFT 240
Query: 323 LDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEALTGLRPCFDV 382
+ GTI DSGT FT + + + D F +++ +N L +L G C+ V
Sbjct: 241 FNPSTGAGTIFDSGTVFTRLVTPAYIAVRDAFRNRVGRN------LTVTSLGGFDTCYTV 294
Query: 383 PGEKTGSFPELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTVVTDREASGGPSIILGNFQ 442
P P + F G VTLP +N GS CL + + ++ N Q
Sbjct: 295 PIAA----PTITFMFTG-MNVTLPPDNLLIHSTAGSTTCLAMAAAPDNVNSVLNVIANLQ 349
Query: 443 MQNYYVEYDLRNQRLGFKQQLC 464
QN+ + YD+ N RLG ++LC
Sbjct: 350 QQNHRLLYDVPNSRLGVARELC 371
>gi|449457263|ref|XP_004146368.1| PREDICTED: probable aspartic protease At2g35615-like [Cucumis
sativus]
Length = 469
Score = 97.1 bits (240), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 108/394 (27%), Positives = 169/394 (42%), Gaps = 47/394 (11%)
Query: 88 GYSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLGC 147
G+ ++LS G+PP ++DTGS L+W C C C F P S S + LGC
Sbjct: 103 GFLVNLSIGSPPVTQLVVVDTGSSLLWVQCL---PCINCFQQSTSWFDPLKSVSFKTLGC 159
Query: 148 QNPKCSWIHHESIQCRDCNDEPLAT---SKNCTQICPSYLVLYGSGLTEGI-----ALSE 199
P ++I+ +C N + +Q + L L EG A+S
Sbjct: 160 GFPGYNYIN--GYKCNRFNQAEYKLRYLGGDSSQGILAKESLLFETLDEGRVFQYNAIST 217
Query: 200 TLNLPNRIIPNFLVGC---SVLSSRQPAGIAGFGRGK---TSLPSQLNLDKFSYCLLSHK 253
++ + N GC ++ ++ A FG G ++ +QL +KFSYC+
Sbjct: 218 QISKIKK--SNITFGCGHMNIKTNNDDAYNGVFGLGAYPHITMATQLG-NKFSYCI--GD 272
Query: 254 FDDTTRTSS-LILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQ 312
++ T + L+L GS ++ S + F +YYV L+ I+VG +
Sbjct: 273 INNPLYTHNHLVLGQGS--------------YIEGDSTPLQIHFG-HYYVTLQSISVGSK 317
Query: 313 RVRVWHKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEA 372
+++ + DG+GG ++DSG T+T +A FE L DE V M R
Sbjct: 318 TLKIDPNAFKISSDGSGGVLIDSGMTYTKLANGGFELLYDEIVDLM--KGLLERIPTQRK 375
Query: 373 LTGLRPCFD-VPGEKTGSFPELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTVVTDREAS 431
GL CF V FP + HF GGA++ L + F G G CL ++
Sbjct: 376 FEGL--CFKGVVSRDLVGFPAVTFHFAGGADLVLESGSLFRQHG-GDRFCLAILPSNSEL 432
Query: 432 GGPSIILGNFQMQNYYVEYDLRNQRLGFKQQLCK 465
S+I G QNY V +DL ++ F++ C+
Sbjct: 433 LNLSVI-GILAQQNYNVGFDLEQMKVFFRRIDCQ 465
>gi|356546036|ref|XP_003541438.1| PREDICTED: aspartic proteinase-like protein 2-like [Glycine max]
Length = 486
Score = 97.1 bits (240), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 104/405 (25%), Positives = 168/405 (41%), Gaps = 64/405 (15%)
Query: 84 HSYGGYSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIP-SFIPKL-SSS 141
+S G Y + GTPP+ +DTGS ++W C C S I +F + SS+
Sbjct: 73 NSVGLYYTKVKMGTPPKEFNVQIDTGSDILWVNCNTCSNCPQSSQLGIELNFFDTVGSST 132
Query: 142 SRLLGCQNPKC-SWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSG-------LTE 193
+ L+ C +P C S + + +C S Q SY YG G +++
Sbjct: 133 AALIPCSDPICTSRVQGAAAEC----------SPRVNQC--SYTFQYGDGSGTSGYYVSD 180
Query: 194 GIALSETLNLPNRI--IPNFLVGCSVLSS-------RQPAGIAGFGRGKTSLPSQLNL-- 242
+ S + P + + GCS+ S + GI GFG G S+ SQL+
Sbjct: 181 AMYFSLIMGQPPAVNSSATIVFGCSISQSGDLTKTDKAVDGIFGFGPGPLSVVSQLSSRG 240
Query: 243 ---DKFSYCLLSHKFDDTTRTSSLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVY 299
FS+CL IL+ + Y+P V PS +
Sbjct: 241 ITPKVFSHCLKGDGDGGGVLVLGEILE----------PSIVYSPLV--PS-------QPH 281
Query: 300 YYVGLRRITVGGQRVRVWHKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMV 359
Y + L+ I V GQ + + ++ + GGTIVD GTT ++ E ++PL + +
Sbjct: 282 YNLNLQSIAVNGQLLPINPAVFSISNN-RGGTIVDCGTTLAYLIQEAYDPLVTAINTAVS 340
Query: 360 KNRNYTRALGAEALTGLRPCFDVPGEKTGSFPELKLHFKGGAEVTLPVENYFAVVGEGSA 419
++ T + G + C+ V FP + L+F+GGA + L E Y G
Sbjct: 341 QSARQTNSKGNQ-------CYLVSTSIGDIFPSVSLNFEGGASMVLKPEQYLMHNGYLDG 393
Query: 420 VCLTVVTDREASGGPSIILGNFQMQNYYVEYDLRNQRLGFKQQLC 464
+ + ++ G S ILG+ +++ V YD+ QR+G+ C
Sbjct: 394 AEMWCIGFQKFQEGAS-ILGDLVLKDKIVVYDIAQQRIGWANYDC 437
>gi|449511696|ref|XP_004164029.1| PREDICTED: aspartic proteinase-like protein 2-like [Cucumis
sativus]
Length = 639
Score = 97.1 bits (240), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 118/483 (24%), Positives = 202/483 (41%), Gaps = 71/483 (14%)
Query: 1 MASYISALCLSFIFFFTLLSIFPSSITSLTFSLSR------FHTNPSQDSYQNLNSLVSS 54
M SY + L S + F L I SS+ S F + +P+ S++ V
Sbjct: 1 MNSYSATLLCSLLGFNLLAVILSSSVDSRDFDYQQRSVILPLFISPTNSSHRR----VLD 56
Query: 55 SLTRALHIKNPQTKTTTTTTTTTTTNISSHSYGGYSISLSFGTPPQIIPFILDTGSHLVW 114
R H++N ++ ++ ++ Y Y+ L G+PPQ I+DTGS + +
Sbjct: 57 RDHRLRHLQNLVKPHSSNARMRLHDDLLTNGY--YTTRLWIGSPPQEFALIVDTGSTVTY 114
Query: 115 FPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLGCQNPKCSWIHHESIQCRDCNDEPLATSK 174
PC+N C C + + P F P+LSS+ + + C N C+ +QC +
Sbjct: 115 VPCSN---CVQCGNHQDPRFQPELSSTYQPVKC-NADCN-CDENGVQC--------TYER 161
Query: 175 NCTQICPSYLVLYGSGLTEGIALSETLNLPNRIIPNFLVGCSVLSS-----RQPAGIAGF 229
++ S VL ++ G E+ +P R + GC + S ++ GI G
Sbjct: 162 RYAEMSTSSGVLAEDVMSFG---KESELVPQRAV----FGCETMESGDLYTQRADGIMGL 214
Query: 230 GRGKTSLPSQL---NLDKFSYCLLSHKFDDTTRTSSLILDNGSSHSDKKTTGLTYTPFVN 286
GRG S+ QL + S+ L D +++L SS G+ ++ +
Sbjct: 215 GRGTLSVMDQLVGKGVVSNSFSLCYGGMD--VGGGAMVLGGISS-----PPGMVFSH--S 265
Query: 287 NPSVAERNAFSVYYYVGLRRITVGGQRVRVWHKYLTLDRDGNGGTIVDSGTTFTFMAPEL 346
+PS S YY + L+ I V G+ +++ + DG G I+DSGTT+ + +
Sbjct: 266 DPSR------SPYYNIELKEIHVAGKPLKLNPRTF----DGKYGAILDSGTTYAYFPEKA 315
Query: 347 FEPLADEFVSQMVKNRNYTRALGAEALTGLRPCFDVPG----EKTGSFPELKLHFKGGAE 402
+ D ++K ++ + + CF G E FPE+ + F G +
Sbjct: 316 YYAFKD----AIMKKISFLKQISGPDPNFKDICFSGAGRDVTELPKVFPEVDMVFANGQK 371
Query: 403 VTLPVENY-FAVVGEGSAVCLTVVTDREASGGPSIILGNFQMQNYYVEYDLRNQRLGFKQ 461
++L ENY F A CL + + + +LG ++N V Y+ N +GF +
Sbjct: 372 ISLSPENYLFRHTKVSGAYCLGIFKN---GNDQTTLLGGIIVRNTLVTYNRENSTIGFWK 428
Query: 462 QLC 464
C
Sbjct: 429 TNC 431
>gi|413951979|gb|AFW84628.1| putative aspartic protease family protein [Zea mays]
Length = 435
Score = 97.1 bits (240), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 104/391 (26%), Positives = 155/391 (39%), Gaps = 44/391 (11%)
Query: 90 SISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLGCQN 149
++SL+ GTPPQ + +LDTGS L W C + S F P+ S++ + C +
Sbjct: 62 TVSLAVGTPPQNVTMVLDTGSELSWLLCATGRAAAAAADS----FRPRASATFAAVPCGS 117
Query: 150 PKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSGLTEGIALSETLNLPNRIIP 209
+CS RD P + ++ C L ++G ++ + +
Sbjct: 118 ARCS--------SRDLPAPP--SCDAASRRCRVSLSYADGSASDGALATDVFAVGDAPPL 167
Query: 210 NFLVGC-SVLSSRQP-----AGIAGFGRGKTSLPSQLNLDKFSYCLLSHKFDDTTRTSSL 263
GC S P AG+ G RG S +Q + +FSYC+ D L
Sbjct: 168 RSAFGCMSAAYDSSPDAVATAGLLGMNRGALSFVTQASTRRFSYCI-----SDRDDAGVL 222
Query: 264 ILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRVRVWHKYLTL 323
+L HSD L YTP P+ V Y V L I VGG+ + + L
Sbjct: 223 LL----GHSDLPFLPLNYTPLYQ-PTPPLPYFDRVAYSVQLLGIRVGGKPLPIPPSVLAP 277
Query: 324 DRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEALT---GLRPCF 380
D G G T+VDSGT FTF+ + + + EF+ Q + AL + CF
Sbjct: 278 DHTGAGQTMVDSGTQFTFLLGDAYSAVKAEFLKQ---TKPLLPALEDPSFAFQEAFDTCF 334
Query: 381 DVPGEK---TGSFPELKLHFKGGAEVTLPVENYFAVVGEGSAV----CLTVVTDREASGG 433
VP + + P + L F G + V GE CLT + +
Sbjct: 335 RVPKGRPPPSARLPPVTLLFNGAQMSVAGDRLLYKVPGERRGADGVWCLT-FGNADMVPL 393
Query: 434 PSIILGNFQMQNYYVEYDLRNQRLGFKQQLC 464
+ ++G+ N +VEYDL R+G C
Sbjct: 394 TAYVIGHHHQMNLWVEYDLERGRVGLAPVKC 424
>gi|449447285|ref|XP_004141399.1| PREDICTED: aspartic proteinase-like protein 2-like [Cucumis
sativus]
Length = 609
Score = 97.1 bits (240), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 118/483 (24%), Positives = 202/483 (41%), Gaps = 71/483 (14%)
Query: 1 MASYISALCLSFIFFFTLLSIFPSSITSLTFSLSR------FHTNPSQDSYQNLNSLVSS 54
M SY + L S + F L I SS+ S F + +P+ S++ V
Sbjct: 1 MNSYSATLLCSLLGFNLLAVILSSSVDSRDFDYQQRSVILPLFISPTNSSHRR----VLD 56
Query: 55 SLTRALHIKNPQTKTTTTTTTTTTTNISSHSYGGYSISLSFGTPPQIIPFILDTGSHLVW 114
R H++N ++ ++ ++ Y Y+ L G+PPQ I+DTGS + +
Sbjct: 57 RDHRLRHLQNLVKPHSSNARMRLHDDLLTNGY--YTTRLWIGSPPQEFALIVDTGSTVTY 114
Query: 115 FPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLGCQNPKCSWIHHESIQCRDCNDEPLATSK 174
PC+N C C + + P F P+LSS+ + + C N C+ +QC +
Sbjct: 115 VPCSN---CVQCGNHQDPRFQPELSSTYQPVKC-NADCN-CDENGVQC--------TYER 161
Query: 175 NCTQICPSYLVLYGSGLTEGIALSETLNLPNRIIPNFLVGCSVLSS-----RQPAGIAGF 229
++ S VL ++ G E+ +P R + GC + S ++ GI G
Sbjct: 162 RYAEMSTSSGVLAEDVMSFG---KESELVPQRAV----FGCETMESGDLYTQRADGIMGL 214
Query: 230 GRGKTSLPSQL---NLDKFSYCLLSHKFDDTTRTSSLILDNGSSHSDKKTTGLTYTPFVN 286
GRG S+ QL + S+ L D +++L SS G+ ++ +
Sbjct: 215 GRGTLSVMDQLVGKGVVSNSFSLCYGGMD--VGGGAMVLGGISS-----PPGMVFSH--S 265
Query: 287 NPSVAERNAFSVYYYVGLRRITVGGQRVRVWHKYLTLDRDGNGGTIVDSGTTFTFMAPEL 346
+PS S YY + L+ I V G+ +++ + DG G I+DSGTT+ + +
Sbjct: 266 DPSR------SPYYNIELKEIHVAGKPLKLNPRTF----DGKYGAILDSGTTYAYFPEKA 315
Query: 347 FEPLADEFVSQMVKNRNYTRALGAEALTGLRPCFDVPG----EKTGSFPELKLHFKGGAE 402
+ D ++K ++ + + CF G E FPE+ + F G +
Sbjct: 316 YYAFKD----AIMKKISFLKQISGPDPNFKDICFSGAGRDVTELPKVFPEVDMVFANGQK 371
Query: 403 VTLPVENY-FAVVGEGSAVCLTVVTDREASGGPSIILGNFQMQNYYVEYDLRNQRLGFKQ 461
++L ENY F A CL + + + +LG ++N V Y+ N +GF +
Sbjct: 372 ISLSPENYLFRHTKVSGAYCLGIFKN---GNDQTTLLGGIIVRNTLVTYNRENSTIGFWK 428
Query: 462 QLC 464
C
Sbjct: 429 TNC 431
>gi|224143371|ref|XP_002324933.1| predicted protein [Populus trichocarpa]
gi|222866367|gb|EEF03498.1| predicted protein [Populus trichocarpa]
Length = 463
Score = 97.1 bits (240), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 105/386 (27%), Positives = 160/386 (41%), Gaps = 65/386 (16%)
Query: 89 YSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLGCQ 148
Y +++ GTP + +P I DTGS L+W C CK C K+P F P S+S + L C
Sbjct: 132 YIVNVGIGTPKKEMPLIFDTGSGLIWTQCK---PCKAC-YPKVPVFDPTKSASFKGLPCS 187
Query: 149 NPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLY-GSGLTEGIALSETLNLPNRI 207
+ C I +S CT YL Y + + G +ET++ +
Sbjct: 188 SKLCQSIRQG------------CSSPKCT-----YLTAYVDNSSSTGTLATETISFSHLK 230
Query: 208 --IPNFLVGCSVLSSRQ---PAGIAGFGRGKTSLPSQLN--LDK-FSYCLLSHKFDDTTR 259
N L+GCS S + +GI G R SL SQ DK FSYC+ S T
Sbjct: 231 YDFKNILIGCSDQVSGESLGESGIMGLNRSPISLASQTANIYDKLFSYCIPS-----TPG 285
Query: 260 TSSLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRVRVWHK 319
++ + G +D + ++P + A S Y + + I+VGG+++ +
Sbjct: 286 STGHLTFGGKVPND-----VRFSP-------VSKTAPSSDYDIKMTGISVGGRKLLIDAS 333
Query: 320 YLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEALTGLRPC 379
+ + +DSG T + P+ + L F M + Y L + L C
Sbjct: 334 AFKI------ASTIDSGAVLTRLPPKAYSALRSVFREMM---KGYP-LLDQDDF--LDTC 381
Query: 380 FDVPGEKTGSFPELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTVVT-DREASGGPSIIL 438
+D T + P + + F+GG E+ + V V CL D E S I
Sbjct: 382 YDFSNYSTVAIPSISVFFEGGVEMDIDVSGIMWQVPGSKVYCLAFAELDDEVS-----IF 436
Query: 439 GNFQMQNYYVEYDLRNQRLGFKQQLC 464
GNFQ + Y V +D +R+GF C
Sbjct: 437 GNFQQKTYTVVFDGAKERIGFAPGGC 462
>gi|343172998|gb|AEL99202.1| aspartyl protease family protein, partial [Silene latifolia]
Length = 584
Score = 96.7 bits (239), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 109/390 (27%), Positives = 164/390 (42%), Gaps = 73/390 (18%)
Query: 95 FGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLGCQNPKCSW 154
GTPPQ I+DTGS + + PC + C C + + P F P LS + + C NP C+
Sbjct: 2 IGTPPQEFALIVDTGSTVTYVPCNS---CDQCGNHQDPKFQPDLSDTYHPVKC-NPDCT- 56
Query: 155 IHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSGLTEGIALSETLNLPNRIIPNFLVG 214
E+ QC + A + + I LV +G+ +SE P R + G
Sbjct: 57 CDTENDQC--TYERQYAEMSSSSGILGEDLVSFGN-------MSEL--KPQRAV----FG 101
Query: 215 CS-----VLSSRQPAGIAGFGRGKTSLPSQLNL-----DKFSYCLLSHKFDDTTRTSSLI 264
C L S+ GI G GRG S+ QL D FS C + +++
Sbjct: 102 CENAETGDLFSQHADGIMGLGRGDLSIVDQLVEKGVINDSFSLCYGGME----VGGGAMV 157
Query: 265 LDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRVRVWHKYLTLD 324
L S SD V + S +R S YY + LR + V G+++ + +
Sbjct: 158 LGQISPPSD----------MVFSHSDPDR---SPYYNIELRGLHVAGKKLDINPQVF--- 201
Query: 325 RDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEALTGLRP-----C 379
DG GTI+DSGTT+ ++ F P S++ G + + G P C
Sbjct: 202 -DGKHGTILDSGTTYAYLPEAAFLPFIQAITSEL---------HGLKQIRGPDPNYNDVC 251
Query: 380 FDVPG----EKTGSFPELKLHFKGGAEVTLPVENY-FAVVGEGSAVCLTVVTDREASGGP 434
F G E +FP + + F G + +L ENY F A CL V + + P
Sbjct: 252 FSGAGSEIPELYKTFPSVDMVFDNGEKYSLSPENYLFKHSKVHGAYCLGVFQNGK---DP 308
Query: 435 SIILGNFQMQNYYVEYDLRNQRLGFKQQLC 464
+ +LG ++N V YD + ++GF + C
Sbjct: 309 TTLLGGIVVRNTLVTYDREHSKVGFWKTNC 338
>gi|413952263|gb|AFW84912.1| aspartic proteinase nepenthesin-2 [Zea mays]
Length = 509
Score = 96.7 bits (239), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 104/401 (25%), Positives = 166/401 (41%), Gaps = 59/401 (14%)
Query: 87 GGYSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSS-----KIPSFIPKLSSS 141
G Y + G P + +DTGS ++W C+ C C +S ++ SF P SS+
Sbjct: 89 GLYFTRVKLGNPAKEFFVQIDTGSDILWVTCS---PCTGCPTSSGLNIQLESFNPDSSST 145
Query: 142 SRLLGCQNPKCS--WIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSGL-TEGIALS 198
+ + C + +C+ + E+I C+ N + S C Y YG G T G +S
Sbjct: 146 ASRITCSDDRCTAGFQTGEAI-CQTSNSQ----SSPC-----GYTFTYGDGSGTSGYYVS 195
Query: 199 ETL----NLPNRIIPN----FLVGCS-------VLSSRQPAGIAGFGRGKTSLPSQLNLD 243
+T+ + N N + GCS + R GI GFG+ + S+ SQLN
Sbjct: 196 DTMFFETVMGNEQTANSSASIVFGCSNSQSGDLTKADRAVDGIFGFGQHQLSVISQLNSL 255
Query: 244 KFSYCLLSHKFDDTTRTSSLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVG 303
S + SH + +++ + GL YTP V PS +Y +
Sbjct: 256 GVSPKVFSHCLKGSDNGGGILV-----LGEIVEPGLVYTPLV--PS-------QPHYNLN 301
Query: 304 LRRITVGGQRVRVWHKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRN 363
L I V GQ++ + T GTIVDSGTT ++A ++P + + +
Sbjct: 302 LESIAVNGQKLPIDSSLFTTSN--TQGTIVDSGTTLAYLADGAYDPFVSAIAAAVSPSVR 359
Query: 364 YTRALGAEALTGLRPCFDVPGEKTGSFPELKLHFKGGAEVTLPVENYFAVVGEGSAVCLT 423
+ G++ CF SFP + L+F GG +++ ENY L
Sbjct: 360 SLVSKGSQ-------CFITSSSVDSSFPTVTLYFMGGVAMSVKPENYLLQQASVDNSVLW 412
Query: 424 VVTDREASGGPSIILGNFQMQNYYVEYDLRNQRLGFKQQLC 464
+ + G ILG+ +++ YDL N R+G+ C
Sbjct: 413 CIGWQRNQGQEITILGDLVLKDKIFVYDLANMRMGWADYDC 453
>gi|224104765|ref|XP_002313558.1| predicted protein [Populus trichocarpa]
gi|222849966|gb|EEE87513.1| predicted protein [Populus trichocarpa]
Length = 468
Score = 96.7 bits (239), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 108/402 (26%), Positives = 171/402 (42%), Gaps = 64/402 (15%)
Query: 87 GGYSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPS--FIPKLSSSSRL 144
G Y L GTPP+ +DTGS ++W C + C S IP F P S ++ L
Sbjct: 50 GLYYTRLQLGTPPRDFYVQIDTGSDVLWVSCGSCNGCPVNSGLHIPLNFFDPGSSPTASL 109
Query: 145 LGCQNPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSGL-TEGIALSETLN- 202
+ C + +CS S + + +++N +C Y YG G T G +S+ L+
Sbjct: 110 ISCSDQRCSLGLQSS--------DSVCSAQN--NLC-GYNFQYGDGSGTSGYYVSDLLHF 158
Query: 203 ---LPNRIIPN----FLVGCSVL-------SSRQPAGIAGFGRGKTSLPSQLNLD----- 243
L ++ N + GCS L S R GI GFG+ S+ SQL
Sbjct: 159 DTVLGGSVMNNSSAPIVFGCSALQTGDLTKSDRAVDGIFGFGQQDMSVVSQLASQGISPR 218
Query: 244 KFSYCLLSHKFDDTTRTSSLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVG 303
FS+CL K DD+ L+L + + YTP V PS +Y +
Sbjct: 219 AFSHCL---KGDDSG-GGILVL------GEIVEPNIVYTPLV--PS-------QPHYNLN 259
Query: 304 LRRITVGGQRVRVWHKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKN-R 362
++ I+V GQ + + + GTI+DSGTT ++A ++P S + + R
Sbjct: 260 MQSISVNGQTLAIDPS--VFGTSSSQGTIIDSGTTLAYLAEAAYDPFISAITSIVSPSVR 317
Query: 363 NYTRALGAEALTGLRPCFDVPGEKTGSFPELKLHFKGGAEVTLPVENYFAVVGEGSAVCL 422
Y L+ C+ + FP++ L+F GGA + L ++Y L
Sbjct: 318 PY--------LSKGNHCYLISSSINDIFPQVSLNFAGGASMILIPQDYLIQQSSIGGAAL 369
Query: 423 TVVTDREASGGPSIILGNFQMQNYYVEYDLRNQRLGFKQQLC 464
+ ++ G ILG+ +++ YD+ NQR+G+ C
Sbjct: 370 WCIGFQKIQGQGITILGDLVLKDKIFVYDIANQRIGWANYDC 411
>gi|226508052|ref|NP_001150337.1| LOC100283967 precursor [Zea mays]
gi|195638522|gb|ACG38729.1| aspartic proteinase nepenthesin-2 precursor [Zea mays]
Length = 507
Score = 96.7 bits (239), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 104/401 (25%), Positives = 166/401 (41%), Gaps = 59/401 (14%)
Query: 87 GGYSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSS-----KIPSFIPKLSSS 141
G Y + G P + +DTGS ++W C+ C C +S ++ SF P SS+
Sbjct: 87 GLYFTRVKLGNPAKEFFVQIDTGSDILWVTCS---PCTGCPTSSGLNIQLESFNPDSSST 143
Query: 142 SRLLGCQNPKCS--WIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSGL-TEGIALS 198
+ + C + +C+ + E+I C+ N + S C Y YG G T G +S
Sbjct: 144 ASRITCSDDRCTAGFQTGEAI-CQTSNSQ----SSPC-----GYTFTYGDGSGTSGYYVS 193
Query: 199 ETL----NLPNRIIPN----FLVGCS-------VLSSRQPAGIAGFGRGKTSLPSQLNLD 243
+T+ + N N + GCS + R GI GFG+ + S+ SQLN
Sbjct: 194 DTMFFETVMGNEQTANSSASIVFGCSNSQSGDLTKADRAVDGIFGFGQHQLSVISQLNSL 253
Query: 244 KFSYCLLSHKFDDTTRTSSLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVG 303
S + SH + +++ + GL YTP V PS +Y +
Sbjct: 254 GVSPKVFSHCLKGSDNGGGILV-----LGEIVEPGLVYTPLV--PS-------QPHYNLN 299
Query: 304 LRRITVGGQRVRVWHKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRN 363
L I V GQ++ + T GTIVDSGTT ++A ++P + + +
Sbjct: 300 LESIAVNGQKLPIDSSLFTTSN--TQGTIVDSGTTLAYLADGAYDPFVSAIAAAVSPSVR 357
Query: 364 YTRALGAEALTGLRPCFDVPGEKTGSFPELKLHFKGGAEVTLPVENYFAVVGEGSAVCLT 423
+ G++ CF SFP + L+F GG +++ ENY L
Sbjct: 358 SLVSKGSQ-------CFITSSSVDSSFPTVTLYFMGGVAMSVKPENYLLQQASVDNSVLW 410
Query: 424 VVTDREASGGPSIILGNFQMQNYYVEYDLRNQRLGFKQQLC 464
+ + G ILG+ +++ YDL N R+G+ C
Sbjct: 411 CIGWQRNQGQEITILGDLVLKDKIFVYDLANMRMGWADYDC 451
>gi|222632750|gb|EEE64882.1| hypothetical protein OsJ_19741 [Oryza sativa Japonica Group]
Length = 456
Score = 96.7 bits (239), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 113/391 (28%), Positives = 155/391 (39%), Gaps = 67/391 (17%)
Query: 87 GGYSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLG 146
G Y + GTP +LDTGS +VW P +P + + S
Sbjct: 120 GEYFAQVGVGTPATTALMVLDTGSDVVWAPV-----------RALPPLLRAVRQGSSTGA 168
Query: 147 CQNPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSG-LTEGIALSETLNLPN 205
P W I CR + +N Y V YG G +T G SETL
Sbjct: 169 APAPTPRWNCVAPI-CRRLDSAGCDRRRNSCL----YQVAYGDGSVTAGDFASETLTFAR 223
Query: 206 RI-IPNFLVGCSVLSSRQPAGIAG-----FGRGKTSLPSQLNLD---KFSYCLLSHKFDD 256
+ +GC + IA GRG+ S PSQ+ FSYCL+
Sbjct: 224 GARVQRVAIGCG--HDNEGLFIAASGLLGLGRGRLSFPSQIARSFGRSFSYCLV------ 275
Query: 257 TTRTSSLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRVR- 315
D SS + + TP + +YYV L +VGG RV+
Sbjct: 276 ---------DRTSSRRARPSRRWGGTP-----------RMATFYYVHLLGFSVGGARVKG 315
Query: 316 VWHKYLTLD-RDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEALT 374
V L L+ G GG I+DSGT+ T +A ++E + D F + V R + +
Sbjct: 316 VSQSDLRLNPTTGRGGVILDSGTSVTRLARPVYEAVRDAFRAAAVGLR-----VSPGGFS 370
Query: 375 GLRPCFDVPGEKTGSFPELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTVV-TDREASGG 433
C+++ G + P + +H GGA V LP ENY V C + TD GG
Sbjct: 371 LFDTCYNLSGRRVVKVPTVSMHLAGGASVALPPENYLIPVDTSGTFCFAMAGTD----GG 426
Query: 434 PSIILGNFQMQNYYVEYDLRNQRLGFKQQLC 464
SII GN Q Q + V +D QR+GF + C
Sbjct: 427 VSII-GNIQQQGFRVVFDGDAQRVGFVPKSC 456
>gi|356542694|ref|XP_003539801.1| PREDICTED: aspartic proteinase-like protein 2-like [Glycine max]
Length = 489
Score = 96.7 bits (239), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 107/402 (26%), Positives = 172/402 (42%), Gaps = 63/402 (15%)
Query: 87 GGYSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIP--SFIPKLSSSSRL 144
G Y + GTPP+ +DTGS ++W C + C S +I F P+ SS+S L
Sbjct: 75 GLYYTKVKLGTPPREFYVQIDTGSDVLWVSCGSCNGCPQTSGLQIQLNYFDPRSSSTSSL 134
Query: 145 LGCQNPKC-SWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSGL-TEGIALSETLN 202
+ C + +C S + C N++ CT Y YG G T G +S+ ++
Sbjct: 135 ISCSDRRCRSGVQTSDASCSSQNNQ-------CT-----YTFQYGDGSGTSGYYVSDLMH 182
Query: 203 --------LPNRIIPNFLVGCSVL-------SSRQPAGIAGFGRGKTSLPSQLNLDKFSY 247
L + + GCS+L S R GI GFG+ S+ SQL+L +
Sbjct: 183 FAGIFEGTLTTNSSASVVFGCSILQTGDLTKSERAVDGIFGFGQQGMSVISQLSLQGIAP 242
Query: 248 CLLSHKFD-DTTRTSSLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRR 306
+ SH D + L+L + + Y+P V + +Y + L+
Sbjct: 243 RVFSHCLKGDNSGGGVLVL------GEIVEPNIVYSPLVQSQP---------HYNLNLQS 287
Query: 307 ITVGGQRVRVWHKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTR 366
I+V GQ V + N GTIVDSGTT ++A E + P + + + ++
Sbjct: 288 ISVNGQIVPIAPAVFATSN--NRGTIVDSGTTLAYLAEEAYNPFVNAITALVPQSVRSVL 345
Query: 367 ALGAEALTGLRPCFDV-PGEKTGSFPELKLHFKGGAEVTLPVENYFA---VVGEGSAVCL 422
+ G + C+ + FP++ L+F GGA + L ++Y +GEGS C+
Sbjct: 346 SRGNQ-------CYLITTSSNVDIFPQVSLNFAGGASLVLRPQDYLMQQNYIGEGSVWCI 398
Query: 423 TVVTDREASGGPSIILGNFQMQNYYVEYDLRNQRLGFKQQLC 464
+ G ILG+ +++ YDL QR+G+ C
Sbjct: 399 GF---QRIPGQSITILGDLVLKDKIFVYDLAGQRIGWANYDC 437
>gi|356495496|ref|XP_003516613.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Glycine max]
Length = 645
Score = 96.7 bits (239), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 113/402 (28%), Positives = 164/402 (40%), Gaps = 81/402 (20%)
Query: 87 GGYSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLG 146
G Y+ L GTPPQ I+DTGS + + PC+ C++C S + P F P+ S +
Sbjct: 91 GYYTARLWIGTPPQRFALIVDTGSTVTYVPCST---CRHCGSHQDPKFRPEDSET----- 142
Query: 147 CQNPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSGLTEGIALSE------- 199
Q KC+W QC ND K CT Y Y T AL E
Sbjct: 143 YQPVKCTW------QCNCDNDR-----KQCT-----YERRYAEMSTSSGALGEDVVSFGN 186
Query: 200 -TLNLPNRIIPNFLVGCS-----VLSSRQPAGIAGFGRGKTSLPSQLNL-----DKFSYC 248
T P R I GC + +++ GI G GRG S+ QL D FS C
Sbjct: 187 QTELSPQRAI----FGCENDETGDIYNQRADGIMGLGRGDLSIMDQLVEKKVISDSFSLC 242
Query: 249 LLSHKFDDTTRTSSLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRIT 308
I S T + V +P YY + L+ I
Sbjct: 243 YGGMGVGGGAMVLGGI-------SPPADMVFTRSDPVRSP----------YYNIDLKEIH 285
Query: 309 VGGQRVRVWHKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRAL 368
V G+R+ + K DG GT++DSGTT+ ++ F LA F ++K + + +
Sbjct: 286 VAGKRLHLNPKVF----DGKHGTVLDSGTTYAYLPESAF--LA--FKHAIMKETHSLKRI 337
Query: 369 GAEALTGLRPCF-----DVPGEKTGSFPELKLHFKGGAEVTLPVENY-FAVVGEGSAVCL 422
CF DV + + SFP +++ F G +++L ENY F A CL
Sbjct: 338 SGPDPRYNDICFSGAEIDV-SQISKSFPVVEMVFGNGHKLSLSPENYLFRHSKVRGAYCL 396
Query: 423 TVVTDREASGGPSIILGNFQMQNYYVEYDLRNQRLGFKQQLC 464
V ++ P+ +LG ++N V YD + ++GF + C
Sbjct: 397 GVFSN---GNDPTTLLGGIVVRNTLVMYDREHTKIGFWKTNC 435
>gi|219362525|ref|NP_001136612.1| uncharacterized protein LOC100216735 [Zea mays]
gi|194696366|gb|ACF82267.1| unknown [Zea mays]
gi|413953802|gb|AFW86451.1| hypothetical protein ZEAMMB73_488726 [Zea mays]
Length = 411
Score = 96.7 bits (239), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 108/388 (27%), Positives = 156/388 (40%), Gaps = 67/388 (17%)
Query: 89 YSISLSFGTP--PQIIPFILDTGSHLVWFPCTNHYQCKYCSSS-----KIPSFIPKLSSS 141
Y + +SFGTP PQ++ ++DTGS + W QCK CSS K P + P SS+
Sbjct: 79 YVVRVSFGTPAVPQVV--VIDTGSDVSWL------QCKPCSSGQCFPQKDPLYDPSHSST 130
Query: 142 SRLLGCQNPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSGL-TEGIALSET 200
+ C + C + D + K C + + Y G T G +
Sbjct: 131 YSAVPCASDVCKKL------AADAYGSGCTSGKQC-----GFAISYADGTSTVGAYSQDK 179
Query: 201 LNL-PNRIIPNFLVGCSVLSSRQPA---GIAGFGRGKTSLPSQLNLDKFSYCLLSHKFDD 256
L L P I+ NF GC G+ G GR + SL ++ FSYCL S
Sbjct: 180 LTLAPGAIVQNFYFGCGHGKHAVRGLFDGVLGLGRLRESLGARYG-GVFSYCLPSV---- 234
Query: 257 TTRTSSLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRVRV 316
+++ L L G K +G +TP P + FS V L I VGG+++ +
Sbjct: 235 SSKPGFLALGAG-----KNPSGFVFTPMGTVPG---QPTFST---VTLAGINVGGKKLDL 283
Query: 317 WHKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEALTGL 376
+ GG IVDSGT T + + L F M R L
Sbjct: 284 RPSAFS------GGMIVDSGTVITGLQSTAYRALRSAFRKAMEAYRLLPNG-------DL 330
Query: 377 RPCFDVPGEKTGSFPELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTVVTDREASGGPSI 436
C+++ G K P++ L F GGA + L V N V G CL G +
Sbjct: 331 DTCYNLTGYKNVVVPKIALTFTGGATINLDVPNGILVNG-----CLAFA--ESGPDGSAG 383
Query: 437 ILGNFQMQNYYVEYDLRNQRLGFKQQLC 464
+LGN + + V +D + GF+ + C
Sbjct: 384 VLGNVNQRAFEVLFDTSTSKFGFRAKAC 411
>gi|356546378|ref|XP_003541603.1| PREDICTED: probable aspartic protease At2g35615-like [Glycine max]
Length = 439
Score = 96.3 bits (238), Expect = 3e-17, Method: Compositional matrix adjust.
Identities = 122/486 (25%), Positives = 194/486 (39%), Gaps = 72/486 (14%)
Query: 1 MASYISALCLSFIFFFTLLSIFPSSITSLTFSLSRFHTN--------PSQDSYQNLNSLV 52
MA L+ + + L +I FS+ H + P++ +Q + + V
Sbjct: 1 MAMITRYCSLALVLLWCLYNISFLKANDGGFSVEMIHRDSSRSPLYRPTETPFQRVANAV 60
Query: 53 SSSLTRALHIKNPQTKTTTTTTTTTTTNISSHSYGGYSISLSFGTPPQIIPFILDTGSHL 112
S+ R H K K +T + +T ++S G Y + S G+PP + I+DTGS +
Sbjct: 61 RRSINRGNHFK----KAFVSTDSAESTVVASQ--GEYLMRYSVGSPPFQVLGIVDTGSDI 114
Query: 113 VWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLGCQNPKCSWIHHESIQCRDCNDEPLAT 172
+W C C+ C P F P S + + L C + C ES++ C+ +
Sbjct: 115 LWLQCE---PCEDCYKQTTPIFDPSKSKTYKTLPCSSNTC-----ESLRNTACSSD---- 162
Query: 173 SKNCTQICPSYLVLYGSGL-TEGIALSETLNLPNRI-----IPNFLVGCSVLSSRQPAGI 226
+C Y + YG G ++G ETL L + P ++GC +
Sbjct: 163 -----NVC-EYSIDYGDGSHSDGDLSVETLTLGSTDGSSVHFPKTVIGCGHNNGGTFQEE 216
Query: 227 AGFGRGKTSLPSQLNLD-------KFSYCLLSHKFDDTTRTSSLILDNGSSHSDKKTTGL 279
G P L KFSYCL + F ++ +S L + + S + T
Sbjct: 217 GSGIVGLGGGPVSLISQLSSSIGGKFSYCL-APIFSESNSSSKLNFGDAAVVSGRGTVST 275
Query: 280 TYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRVRVWHKYLTLDRDGNGGTIVDSGTTF 339
P V+Y++ L +VG R+ + G+G I+DSGTT
Sbjct: 276 PLDPLNGQ----------VFYFLTLEAFSVGDNRIEFSGSSSSGSGSGDGNIIIDSGTTL 325
Query: 340 TFMAPELFEPLADEFVSQMVKNRNYTRALGAEALTGLRPCFDVPGEKTGSFPELKLHFKG 399
T + E + L + VS ++K RA L L C+ ++ P + HFK
Sbjct: 326 TLLPQEDYLNL-ESAVSDVIK---LERARDPSKLLSL--CYKTTSDEL-DLPVITAHFK- 377
Query: 400 GAEVTL-PVENYFAVVGEGSAVCLTVVTDREASGGPSIILGNFQMQNYYVEYDLRNQRLG 458
GA+V L P+ + V E VC ++ + + I GN QN V YDL + +
Sbjct: 378 GADVELNPISTFVPV--EKGVVCFAFISSKIGA-----IFGNLAQQNLLVGYDLVKKTVS 430
Query: 459 FKQQLC 464
FK C
Sbjct: 431 FKPTDC 436
>gi|242066168|ref|XP_002454373.1| hypothetical protein SORBIDRAFT_04g029630 [Sorghum bicolor]
gi|241934204|gb|EES07349.1| hypothetical protein SORBIDRAFT_04g029630 [Sorghum bicolor]
Length = 458
Score = 96.3 bits (238), Expect = 3e-17, Method: Compositional matrix adjust.
Identities = 106/387 (27%), Positives = 160/387 (41%), Gaps = 57/387 (14%)
Query: 87 GGYSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKY-CSSSKIPSFIPKLSSSSRLL 145
G Y + GTP + ++DTGS L W C+ C C P F P+ SSS +
Sbjct: 119 GNYVTRMGLGTPAKSYVMVVDTGSSLTWLQCS---PCLVSCHRQSGPVFNPRSSSSYASV 175
Query: 146 GCQNPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYG-SGLTEGIALSETLNLP 204
C P+C + ++ C+ TS C Y YG S + G +T++
Sbjct: 176 SCSAPQCDALTTATLNPSTCS-----TSNVCI-----YQASYGDSSFSVGYLSKDTVSFG 225
Query: 205 NRIIPNFLVGCSVLSS---RQPAGIAGFGRGKTSLPSQLNLD---KFSYCLLSHKFDDTT 258
+ +PNF GC + Q AG+ G R K SL QL FSYCL +
Sbjct: 226 STSVPNFYYGCGQDNEGLFGQSAGLIGLARNKLSLLYQLAPSMGYSFSYCLPTSSSSSGY 285
Query: 259 RTSSLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRVRV-W 317
+ N +S YTP +A+ + Y++ + ITV G+ + V
Sbjct: 286 LSIGSY--NPGQYS--------YTP------MAKSSLDDSLYFIKMTGITVAGKPLSVSA 329
Query: 318 HKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEALTGLR 377
Y +L TI+DSGT T + +++ L+ M + R A A + L
Sbjct: 330 SAYSSLP------TIIDSGTVITRLPTDVYSALSKAVAGAM---KGTPR---ASAFSILD 377
Query: 378 PCFDVPGEKTGSFPELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTVVTDREASGGPSII 437
CF + P++ + F GGA + L N V + + CL R A+ I
Sbjct: 378 TCFQGQASRL-RVPQVSMAFAGGAALKLKATNLLVDV-DSATTCLAFAPARSAA-----I 430
Query: 438 LGNFQMQNYYVEYDLRNQRLGFKQQLC 464
+GN Q Q + V YD++N ++GF C
Sbjct: 431 IGNTQQQTFSVVYDVKNSKIGFAAGGC 457
>gi|388493426|gb|AFK34779.1| unknown [Medicago truncatula]
Length = 454
Score = 96.3 bits (238), Expect = 3e-17, Method: Compositional matrix adjust.
Identities = 105/417 (25%), Positives = 163/417 (39%), Gaps = 67/417 (16%)
Query: 89 YSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLGCQ 148
YS S+ GTP + ++D +WF C + Y S++ + C
Sbjct: 50 YSTSIKLGTPAVPLDLVIDIRERFLWFECDDSYN----------------STTYNPIQCG 93
Query: 149 NPKCSWIHHESIQCRDCNDEPLAT--SKNCTQICPSYLVLYGSGLTEGIALSETLNLP-- 204
KC C DC + P T + N + P +G G + L+ P
Sbjct: 94 TKKCK--QARGTGCIDCTNHPSKTGCTNNTCGVEP--FNPFGGFFVSGDVGEDILSFPRV 149
Query: 205 --------NRIIPNFLVGCSVLS-----------SRQPAGIAGFGRGKTSLPSQL----N 241
N +P F+ C S+ G+ G R SLP+Q+
Sbjct: 150 TSDGRRVTNVRVPRFISSCVYPDKFGVQGFLEGLSKGKKGVLGLARTLISLPTQIATRFK 209
Query: 242 LD-KFSYCLLSHKFDDTTRTSSLILDNG----SSHSDKKTTGLTYTPFVNNPSVAE---R 293
LD KF+ CL S + SL + G S+ D + L YTP + N
Sbjct: 210 LDRKFTLCLPSTSQKNGLGPGSLFVGGGPYNLGSNKDDASKFLKYTPLITNRRSTGPIFD 269
Query: 294 NAFSVYYYVGLRRITVGGQRVRVWHKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADE 353
N S Y++ ++ I V V L++++ G GGT + + T + ++ PL +
Sbjct: 270 NFPSTEYFIKVKSIKVDNNVVNFNTTLLSINKLGEGGTKLSTVIPHTTLHTSIYNPLLNA 329
Query: 354 FVSQMVKNRNYTRALGAEALTGLRPCFD---VPGEKTG-SFPELKLHFKGGAEVTLPVEN 409
FV + + R R +A+ CFD + G + P + L KGG E + N
Sbjct: 330 FVKK-AEIRKIKR---VKAVAPFGACFDSRTISKSVNGPNVPTIDLVLKGGVEWRIFGAN 385
Query: 410 YFAVVGEGSAVCLTVVTDREASGGP---SIILGNFQMQNYYVEYDLRNQRLGFKQQL 463
V E + +CL V GP SII+G Q+++ VE+DL + +LGF L
Sbjct: 386 SMVKVNE-NVLCLGFVDAGSEEVGPSATSIIIGGHQLEDNLVEFDLVSSKLGFSSSL 441
>gi|357127503|ref|XP_003565419.1| PREDICTED: probable aspartic protease At2g35615-like [Brachypodium
distachyon]
Length = 486
Score = 96.3 bits (238), Expect = 3e-17, Method: Compositional matrix adjust.
Identities = 116/410 (28%), Positives = 156/410 (38%), Gaps = 78/410 (19%)
Query: 89 YSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLGCQ 148
Y +++ GTPP + I DTGS LVW C ++ F+P SS+ +GC
Sbjct: 110 YLMAIEVGTPPVRVLAIADTGSDLVWVKCKGKDNDNNSTAPPSVYFVPSASSTYGRVGCD 169
Query: 149 NPKCSWIHHESIQCRDCNDEPLATSKNCTQICP-SYLVLYGSGLTEGIALS-ETLNLPNR 206
C L+++ +C+ YL YG G LS ET
Sbjct: 170 TKAC---------------RALSSAASCSPDGSCEYLYSYGDGSRASGQLSTETFTFSTI 214
Query: 207 I----------------------IPNFLVGCSVLSSR--QPAGIAGFGRGKTSLPSQLNL 242
I GCS ++ + G+ G G G SL SQL
Sbjct: 215 ADSSKTNSHGNNNNNSSSHGQVEIAKLDFGCSTTTTGTFRADGLVGLGGGPVSLASQLGA 274
Query: 243 D-----KFSYCLLSHKFDDTTRTSSLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFS 297
KFSYCL + +T +S+L N S + G TP +
Sbjct: 275 TTSLGRKFSYCLA--PYANTNASSAL---NFGSRAVVSEPGAASTPLITG-------EVE 322
Query: 298 VYYYVGLRRITVGGQRVRVWHKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQ 357
YY + L I V G + + IVDSGTT T++ L PL V
Sbjct: 323 TYYTIALDSINVAGTKRPTTAAQAHI--------IVDSGTTLTYLDSALLTPL----VKD 370
Query: 358 MVKNRNYTRALGAEALTGLRPCFD---VPGEKTGSFPELKLHFKGGAEVTLPVENYFAVV 414
+ + RA E + L C+D V GE P++ L GG EVTL +N F VV
Sbjct: 371 LTRRIKLPRAESPEKILDL--CYDISGVRGEDALGIPDVTLVLGGGGEVTLKPDNTFVVV 428
Query: 415 GEGSAVCLTVVTDREASGGPSIILGNFQMQNYYVEYDLRNQRLGFKQQLC 464
EG +CL +V E ILGN QN +V YDL + F C
Sbjct: 429 QEG-VLCLALVATSERQS--VSILGNIAQQNLHVGYDLEKGTVTFAAADC 475
>gi|225465837|ref|XP_002264626.1| PREDICTED: aspartic proteinase nepenthesin-1-like isoform 1 [Vitis
vinifera]
Length = 437
Score = 96.3 bits (238), Expect = 3e-17, Method: Compositional matrix adjust.
Identities = 105/382 (27%), Positives = 155/382 (40%), Gaps = 52/382 (13%)
Query: 89 YSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLGCQ 148
Y + GTP Q + +DT S + W PC C CSS+ F S++ + LGCQ
Sbjct: 101 YIVRAKIGTPAQTMLMAMDTSSDVAWIPCNG---CLGCSSTL---FNSPASTTYKSLGCQ 154
Query: 149 NPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSGLTEGIALSETLNLPNRII 208
+C + + C +C L GS L ++ +T+ L +
Sbjct: 155 AAQCKQVPKPT-----CGGG----------VCSFNLTYGGSSLAANLS-QDTITLATDAV 198
Query: 209 PNFLVGC------SVLSSRQPAGIAGFGRGKTSLPSQLNLDKFSYCLLSHKFDDTTRTSS 262
P + GC L ++ G+ S L FSYCL S F + S
Sbjct: 199 PGYSFGCIQKATGGSLPAQGLLGLGRGPLSLLSQTQNLYQSTFSYCLPS--FKSLNFSGS 256
Query: 263 LILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRVRVWHKYLT 322
L L G K+ + YTP + NP R + Y+V L + VG + V V T
Sbjct: 257 LRL--GPVGQPKR---IKYTPLLKNP---RRPSL---YFVNLMAVRVGRRVVDVPPGSFT 305
Query: 323 LDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEALTGLRPCFDV 382
+ GTI DSGT FT + + + D F +++ +N L +L G C+ V
Sbjct: 306 FNPSTGAGTIFDSGTVFTRLVTPAYIAVRDAFRNRVGRN------LTVTSLGGFDTCYTV 359
Query: 383 PGEKTGSFPELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTVVTDREASGGPSIILGNFQ 442
P P + F G VTLP +N GS CL + + ++ N Q
Sbjct: 360 PIAA----PTITFMFTG-MNVTLPPDNLLIHSTAGSTTCLAMAAAPDNVNSVLNVIANLQ 414
Query: 443 MQNYYVEYDLRNQRLGFKQQLC 464
QN+ + YD+ N RLG ++LC
Sbjct: 415 QQNHRLLYDVPNSRLGVARELC 436
>gi|343172996|gb|AEL99201.1| aspartyl protease family protein, partial [Silene latifolia]
Length = 584
Score = 96.3 bits (238), Expect = 3e-17, Method: Compositional matrix adjust.
Identities = 109/390 (27%), Positives = 164/390 (42%), Gaps = 73/390 (18%)
Query: 95 FGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLGCQNPKCSW 154
GTPPQ I+DTGS + + PC + C C + + P F P LS + + C NP C+
Sbjct: 2 IGTPPQEFALIVDTGSTVTYVPCNS---CDQCGNHQDPKFQPDLSDTYHPVKC-NPDCT- 56
Query: 155 IHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSGLTEGIALSETLNLPNRIIPNFLVG 214
E+ QC + A + + I LV +G+ +SE P R + G
Sbjct: 57 CDTENDQC--TYERQYAEMSSSSGILGEDLVSFGN-------MSEL--KPQRAV----FG 101
Query: 215 CS-----VLSSRQPAGIAGFGRGKTSLPSQLNL-----DKFSYCLLSHKFDDTTRTSSLI 264
C L S+ GI G GRG S+ QL D FS C + +++
Sbjct: 102 CENAETGDLFSQHADGIMGLGRGDLSIVDQLVEKGVINDSFSLCYGGME----VGGGAMV 157
Query: 265 LDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRVRVWHKYLTLD 324
L S SD V + S +R S YY + LR + V G+++ + +
Sbjct: 158 LGQISPPSD----------MVFSHSDPDR---SPYYNIELRGLHVAGKKLDINPQVF--- 201
Query: 325 RDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEALTGLRP-----C 379
DG GTI+DSGTT+ ++ F P S++ G + + G P C
Sbjct: 202 -DGKHGTILDSGTTYAYLPEAAFLPFIQAITSEL---------HGLKQIRGPDPNYNDVC 251
Query: 380 FDVPG----EKTGSFPELKLHFKGGAEVTLPVENY-FAVVGEGSAVCLTVVTDREASGGP 434
F G E +FP + + F G + +L ENY F A CL V + + P
Sbjct: 252 FSGAGSEIPELYKTFPSVDMVFDNGEKYSLSPENYLFKHSKVHGAYCLGVFQNGKD---P 308
Query: 435 SIILGNFQMQNYYVEYDLRNQRLGFKQQLC 464
+ +LG ++N V YD + ++GF + C
Sbjct: 309 TTLLGGIVVRNTLVTYDREHSKVGFWKTNC 338
>gi|326495920|dbj|BAJ90582.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 476
Score = 96.3 bits (238), Expect = 3e-17, Method: Compositional matrix adjust.
Identities = 115/389 (29%), Positives = 152/389 (39%), Gaps = 68/389 (17%)
Query: 89 YSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPS-----FIPKLSSSSR 143
Y +++S GTP +DTGS + W QCK CS+ S F P SS+
Sbjct: 143 YVVTVSLGTPGVSQTVEVDTGSDVSWV------QCKPCSAPACNSQRDQLFDPAKSSTYS 196
Query: 144 LLGCQNPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSGL-TEGIALSETLN 202
+ C CS + C + C Y+V YG G T G+ S+TL
Sbjct: 197 AVPCGADACSELRIYEAGC---------SGSQC-----GYVVSYGDGSNTTGVYGSDTLA 242
Query: 203 L-PNRIIPNFLVGCSVLSSRQPAGIAGF---GRGKTSLPSQLNL---DKFSYCLLSHKFD 255
L P + FL GC + AGI G GR SL SQ FSYCL S +
Sbjct: 243 LAPGNTVGTFLFGCGHAQAGMFAGIDGLLALGRQSMSLKSQAAGAYGGVFSYCLPSKQ-- 300
Query: 256 DTTRTSSLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRVR 315
+ L L SS S TTGL A +Y V L I+VGGQ+V
Sbjct: 301 --SAAGYLTLGGPSSASGFATTGLL-----------TAWAAPTFYMVMLTGISVGGQQVA 347
Query: 316 VWHKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEALTG 375
V GGT+VD+GT T + P + L F + Y A A
Sbjct: 348 VPASAFA------GGTVVDTGTVITRLPPTAYAALRSAFRGAIAPC-GYPS---APANGI 397
Query: 376 LRPCFDVPGEKTGSFPELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTVVTDREASGGPS 435
L C+D + P + L F GGA + L S+ CL + G +
Sbjct: 398 LDTCYDFSRYGVVTLPTVALTFSGGATLALEAPGIL------SSGCLAFAPN--GGDGDA 449
Query: 436 IILGNFQMQNYYVEYDLRNQRLGFKQQLC 464
ILGN Q +++ V +D +GF C
Sbjct: 450 AILGNVQQRSFAVRFD--GSTVGFMPGAC 476
>gi|242095602|ref|XP_002438291.1| hypothetical protein SORBIDRAFT_10g011210 [Sorghum bicolor]
gi|241916514|gb|EER89658.1| hypothetical protein SORBIDRAFT_10g011210 [Sorghum bicolor]
Length = 464
Score = 96.3 bits (238), Expect = 3e-17, Method: Compositional matrix adjust.
Identities = 104/385 (27%), Positives = 159/385 (41%), Gaps = 58/385 (15%)
Query: 89 YSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLGCQ 148
Y I+++ GTP +DTGS + W C + CSS K F P +S++ C
Sbjct: 129 YVITVTIGTPAVTQVMSIDTGSDVSWVQCAP-CAAQSCSSQKDKLFDPAMSATYSAFSCG 187
Query: 149 NPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSGL-TEGIALSETLNL-PNR 206
+ +C+ + E C + Y+V YG G T G S+TL+L +
Sbjct: 188 SAQCAQLGDEGNGCLKSQCQ--------------YIVKYGDGSNTAGTYGSDTLSLTSSD 233
Query: 207 IIPNFLVGCSVLSSR---QPAGIAGFGRGKTSLPSQLNL---DKFSYCLLSHKFDDTTRT 260
+ +F GCS ++ + G+ G G SL SQ FSYCL ++
Sbjct: 234 AVKSFQFGCSHRAAGFVGELDGLMGLGGDTESLVSQTAATYGKAFSYCLPPPS---SSGG 290
Query: 261 SSLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRVRVWHKY 320
L L S + ++TP V R + +Y V L+ ITV G + V
Sbjct: 291 GFLTLGAAGGASSSR---YSHTPMV-------RFSVPTFYGVFLQGITVAGTMLNVPASV 340
Query: 321 LTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEALTG-LRPC 379
+G ++VDSGT T + P ++ L F +M +A + A G L C
Sbjct: 341 F------SGASVVDSGTVITQLPPTAYQALRTAFKKEM-------KAYPSAAPVGSLDTC 387
Query: 380 FDVPGEKTGSFPELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTVVTDREASGGPSIILG 439
FD G T + P + L F GA + L + A CL A G + ILG
Sbjct: 388 FDFSGFNTITVPTVTLTFSRGAAMDLDISGILY------AGCLAFTA--TAHDGDTGILG 439
Query: 440 NFQMQNYYVEYDLRNQRLGFKQQLC 464
N Q + + + +D+ + +GF+ C
Sbjct: 440 NVQQRTFEMLFDVGGRTIGFRSGAC 464
>gi|222635172|gb|EEE65304.1| hypothetical protein OsJ_20543 [Oryza sativa Japonica Group]
Length = 274
Score = 96.3 bits (238), Expect = 3e-17, Method: Compositional matrix adjust.
Identities = 80/246 (32%), Positives = 107/246 (43%), Gaps = 72/246 (29%)
Query: 225 GIAGFGRGKTSLPSQLNLDKFSYCLLSHKFDDTTRTSSLILDNGSS------HSDKKTTG 278
GIAGFGRG+ SLPSQLN+ FSYC S FD T++SS++ ++ H T
Sbjct: 88 GIAGFGRGRWSLPSQLNVTSFSYCFTS-MFD--TKSSSVVTLGAAAAELLHTHHAAHTGD 144
Query: 279 LTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRVRVWHKYLTLDRDGNGGTIVDSGTT 338
+ T + NPS Y+V LR I+VGG RV V L TI+DSG +
Sbjct: 145 VRTTRLIKNPSQPS------LYFVPLRGISVGGARVAVPESRL------RSSTIIDSGAS 192
Query: 339 FTFMAPELFEPLADEFVSQMVKNRNYTRALGAEALTGLRPCFDVPGEKTGSFPELKLHFK 398
T + +++E + EFVSQ
Sbjct: 193 ITTLPEDVYEAVKAEFVSQ----------------------------------------- 211
Query: 399 GGAEVTLPVENYFAVVGEGSAVCLTVVTDREASGGPSIILGNFQMQNYYVEYDLRNQRLG 458
LP NY V + +A L VV D A+ G +++GN+Q QN +V YDL N L
Sbjct: 212 ------LPRGNY--VFEDYAARVLCVVLD--AAAGEQVVIGNYQQQNTHVVYDLENDVLS 261
Query: 459 FKQQLC 464
F C
Sbjct: 262 FAPARC 267
>gi|194696934|gb|ACF82551.1| unknown [Zea mays]
gi|413936470|gb|AFW71021.1| hypothetical protein ZEAMMB73_589717 [Zea mays]
gi|413953801|gb|AFW86450.1| hypothetical protein ZEAMMB73_488726 [Zea mays]
Length = 445
Score = 96.3 bits (238), Expect = 3e-17, Method: Compositional matrix adjust.
Identities = 108/388 (27%), Positives = 156/388 (40%), Gaps = 67/388 (17%)
Query: 89 YSISLSFGTP--PQIIPFILDTGSHLVWFPCTNHYQCKYCSSS-----KIPSFIPKLSSS 141
Y + +SFGTP PQ++ ++DTGS + W QCK CSS K P + P SS+
Sbjct: 113 YVVRVSFGTPAVPQVV--VIDTGSDVSWL------QCKPCSSGQCFPQKDPLYDPSHSST 164
Query: 142 SRLLGCQNPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSGL-TEGIALSET 200
+ C + C + D + K C + + Y G T G +
Sbjct: 165 YSAVPCASDVCKKL------AADAYGSGCTSGKQC-----GFAISYADGTSTVGAYSQDK 213
Query: 201 LNL-PNRIIPNFLVGCSVLSSRQPA---GIAGFGRGKTSLPSQLNLDKFSYCLLSHKFDD 256
L L P I+ NF GC G+ G GR + SL ++ FSYCL S
Sbjct: 214 LTLAPGAIVQNFYFGCGHGKHAVRGLFDGVLGLGRLRESLGARYG-GVFSYCLPSV---- 268
Query: 257 TTRTSSLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRVRV 316
+++ L L G K +G +TP P + FS V L I VGG+++ +
Sbjct: 269 SSKPGFLALGAG-----KNPSGFVFTPMGTVPG---QPTFST---VTLAGINVGGKKLDL 317
Query: 317 WHKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEALTGL 376
+ GG IVDSGT T + + L F M R L
Sbjct: 318 RPSAFS------GGMIVDSGTVITGLQSTAYRALRSAFRKAMEAYRLLPNG-------DL 364
Query: 377 RPCFDVPGEKTGSFPELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTVVTDREASGGPSI 436
C+++ G K P++ L F GGA + L V N V G CL G +
Sbjct: 365 DTCYNLTGYKNVVVPKIALTFTGGATINLDVPNGILVNG-----CLAFA--ESGPDGSAG 417
Query: 437 ILGNFQMQNYYVEYDLRNQRLGFKQQLC 464
+LGN + + V +D + GF+ + C
Sbjct: 418 VLGNVNQRAFEVLFDTSTSKFGFRAKAC 445
>gi|15222357|ref|NP_174430.1| aspartyl protease-like protein [Arabidopsis thaliana]
gi|12322538|gb|AAG51267.1|AC027135_8 chloroplast nucleoid DNA binding protein, putative [Arabidopsis
thaliana]
gi|67633408|gb|AAY78629.1| aspartyl protease family protein [Arabidopsis thaliana]
gi|332193236|gb|AEE31357.1| aspartyl protease-like protein [Arabidopsis thaliana]
Length = 445
Score = 95.9 bits (237), Expect = 3e-17, Method: Compositional matrix adjust.
Identities = 128/491 (26%), Positives = 189/491 (38%), Gaps = 87/491 (17%)
Query: 4 YISALCLSFIFFFTLLSIFPSSITSLTFSLSRFHTNPSQDSYQNLNSLVSSSLTRALHIK 63
Y S L +SF F SS ++ H + N + VS L A
Sbjct: 8 YCSLLAISFFFASN------SSANRENLTVELIHRDSPHSPLYNPHHTVSDRLNAAFLRS 61
Query: 64 NPQTKTTTTTTTTTTTNISSHSYGGYSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQC 123
+++ TT T + IS+ G Y +S+S GTPP + I DTGS L W C C
Sbjct: 62 ISRSRRFTTKTDLQSGLISNG--GEYFMSISIGTPPSKVFAIADTGSDLTWVQCK---PC 116
Query: 124 KYCSSSKIPSFIPKLSSSSRLLGCQNPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSY 183
+ C P F K SS+ + C + C + C + D IC Y
Sbjct: 117 QQCYKQNSPLFDKKKSSTYKTESCDSKTCQALSEHEEGCDESKD-----------IC-KY 164
Query: 184 LVLYG-SGLTEGIALSETL-----NLPNRIIPNFLVGCSVLSSRQPAGIAGFGRGKT--- 234
YG + T+G +ET+ + + P + GC G+ G T
Sbjct: 165 RYSYGDNSFTKGDVATETISIDSSSGSSVSFPGTVFGC------------GYNNGGTFEE 212
Query: 235 -------------SLPSQLNL---DKFSYCLLSHKFDDTTRTSSLILDNGS--SHSDKKT 276
SL SQL KFSYC LSH T TS + L S S+ K +
Sbjct: 213 TGSGIIGLGGGPLSLVSQLGSSIGKKFSYC-LSHTAATTNGTSVINLGTNSIPSNPSKDS 271
Query: 277 TGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRVRVWHKYLTLDRDGN---GGTIV 333
LT TP + YY++ L +TVG ++ L+ + G I+
Sbjct: 272 ATLT-TPLIQKDP-------ETYYFLTLEAVTVGKTKLPYTGGGYGLNGKSSKRTGNIII 323
Query: 334 DSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEALTGLRPCFDVPGEKTGSFPEL 393
DSGTT T + ++ + + + G L CF G+K P +
Sbjct: 324 DSGTTLTLLDSGFYDDFGTAVEESVTGAKRVSDPQGL-----LTHCFK-SGDKEIGLPAI 377
Query: 394 KLHFKGGAEVTLPVENYFAVVGEGSAVCLTVVTDREASGGPSIILGNFQMQNYYVEYDLR 453
+HF A+V L N F + E + VCL+++ E + I GN ++ V YDL
Sbjct: 378 TMHFT-NADVKLSPINAFVKLNEDT-VCLSMIPTTEVA-----IYGNMVQMDFLVGYDLE 430
Query: 454 NQRLGFKQQLC 464
+ + F++ C
Sbjct: 431 TKTVSFQRMDC 441
>gi|115448353|ref|NP_001047956.1| Os02g0720900 [Oryza sativa Japonica Group]
gi|45735844|dbj|BAD12879.1| putative chloroplast nucleoid DNA-binding protein cnd41 [Oryza
sativa Japonica Group]
gi|45735970|dbj|BAD12999.1| putative chloroplast nucleoid DNA-binding protein cnd41 [Oryza
sativa Japonica Group]
gi|113537487|dbj|BAF09870.1| Os02g0720900 [Oryza sativa Japonica Group]
gi|125540930|gb|EAY87325.1| hypothetical protein OsI_08729 [Oryza sativa Indica Group]
gi|215692622|dbj|BAG88042.1| unnamed protein product [Oryza sativa Japonica Group]
Length = 458
Score = 95.9 bits (237), Expect = 4e-17, Method: Compositional matrix adjust.
Identities = 107/386 (27%), Positives = 163/386 (42%), Gaps = 56/386 (14%)
Query: 87 GGYSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKY-CSSSKIPSFIPKLSSSSRLL 145
G Y + GTP ++DTGS L W C+ C C P F PK SS+ +
Sbjct: 120 GNYVTRMGLGTPATQYVMVVDTGSSLTWLQCS---PCLVSCHRQSGPVFNPKSSSTYASV 176
Query: 146 GCQNPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYG-SGLTEGIALSETLNLP 204
GC +CS + ++ C+ +S C Y YG S + G +T++
Sbjct: 177 GCSAQQCSDLPSATLNPSACS-----SSNVCI-----YQASYGDSSFSVGYLSKDTVSFG 226
Query: 205 NRIIPNFLVGCSVLSS---RQPAGIAGFGRGKTSLPSQLNLD---KFSYCLLSHKFDDTT 258
+ +PNF GC + + AG+ G R K SL QL F+YCL ++
Sbjct: 227 STSLPNFYYGCGQDNEGLFGRSAGLIGLARNKLSLLYQLAPSLGYSFTYCL---PSSSSS 283
Query: 259 RTSSLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRVRVWH 318
SL N +S YTP V++ S+ + Y++ L +TV G + V
Sbjct: 284 GYLSLGSYNPGQYS--------YTPMVSS-SLDDS-----LYFIKLSGMTVAGNPLSVSS 329
Query: 319 KYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEALTGLRP 378
+ TI+DSGT T + ++ L+ + M + +R A A + L
Sbjct: 330 SAYSSLP-----TIIDSGTVITRLPTSVYSALSKAVAAAM---KGTSR---ASAYSILDT 378
Query: 379 CFDVPGEKTGSFPELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTVVTDREASGGPSIIL 438
CF + S P + + F GGA + L +N V + S CL R A+ I+
Sbjct: 379 CFKGQASRV-SAPAVTMSFAGGAALKLSAQNLLVDV-DDSTTCLAFAPARSAA-----II 431
Query: 439 GNFQMQNYYVEYDLRNQRLGFKQQLC 464
GN Q Q + V YD+++ R+GF C
Sbjct: 432 GNTQQQTFSVVYDVKSSRIGFAAGGC 457
>gi|225217039|gb|ACN85323.1| aspartic proteinase nepenthesin-1 precursor [Oryza brachyantha]
Length = 287
Score = 95.9 bits (237), Expect = 4e-17, Method: Compositional matrix adjust.
Identities = 91/300 (30%), Positives = 134/300 (44%), Gaps = 31/300 (10%)
Query: 172 TSKNCTQICPSYLVLYGSG-LTEGIALSETLNLPNR-IIPNFLVGCSVLSSR---QPAGI 226
T++ C+ Y V YG G T G +TL L + I F GC + + AG+
Sbjct: 12 TTRGCSGGHCLYGVQYGDGSYTIGFFAMDTLTLSSHDAIKGFRFGCGERNEGLFGEAAGL 71
Query: 227 AGFGRGKTSLPSQLNLDKFSYCLLSHKFDDTTRTSSLILDNGSSHSDKKTTGLTYTPFVN 286
G GRGKTSLP Q DK+ + +H F + + L+ G S + L+ TP +
Sbjct: 72 LGLGRGKTSLPVQ-TYDKYG-GVFAHCFPARSSGTGY-LEFGPGSSPAVSAKLSTTPMLI 128
Query: 287 NPSVAERNAFSVYYYVGLRRITVGGQRVRVWHKYLTLDRDGNGGTIVDSGTTFTFMAPEL 346
+ +YYVG+ I VGG+ + + GTIVDSGT T + P
Sbjct: 129 DTG-------PTFYYVGMTGIRVGGKLLPIPQSVFA-----AAGTIVDSGTVITRLPPAA 176
Query: 347 FEPLADEFVSQMVKNRNYTRALGAEALTGLRPCFDVPGEKTGSFPELKLHFKGGAEVTLP 406
+ L F + M R Y R A AL+ L C+D+ G + P + L F+GG + +
Sbjct: 177 YSSLRSAFAASMAA-RGYKR---APALSLLDTCYDLTGASEVAIPTVSLLFQGGVSLDVD 232
Query: 407 VEN--YFAVVGEGSAVCLTVVTDREASGGPSIILGNFQMQNYYVEYDLRNQRLGFKQQLC 464
Y A V S CL + A I+GN Q++ + V YD+ ++ +GF C
Sbjct: 233 ASGIIYAASV---SQACLGFAGNEAAD--DVAIVGNTQLKTFGVVYDIASKVVGFCPGAC 287
>gi|357118064|ref|XP_003560779.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Brachypodium
distachyon]
Length = 472
Score = 95.9 bits (237), Expect = 4e-17, Method: Compositional matrix adjust.
Identities = 110/392 (28%), Positives = 162/392 (41%), Gaps = 60/392 (15%)
Query: 89 YSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSS-----KIPSFIPKLSSSSR 143
Y ++L GTP ++DTGS L W QCK C++S K P F P SS+
Sbjct: 125 YVVTLGIGTPAVQQTVLIDTGSDLSWV------QCKPCNASDCYPQKDPLFDPSKSSTFA 178
Query: 144 LLGCQNPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSG-LTEGIALSETLN 202
+ C + C + + C + C Y + YG+G +TEG+ +ETL
Sbjct: 179 TIPCASDACKQLPVDGYD-NGCTNNTSGMPPQC-----GYAIEYGNGAITEGVYSTETLA 232
Query: 203 L-PNRIIPNFLVGCSVLSSRQP----AGIAGFGRGKTSLPSQ---LNLDKFSYCLLSHKF 254
L + ++ +F GC P G+ G G SL SQ + FSYCL
Sbjct: 233 LGSSAVVKSFRFGCGS-DQHGPYDKFDGLLGLGGAPESLVSQTASVYGGAFSYCL----- 286
Query: 255 DDTTRTSSLILDNGSSHS-DKKTTGLTYTPF-VNNPSVAERNAFSVYYYVGLRRITVGGQ 312
+ + L G+ +S + +G +TP +P +A +Y V L I+VGG+
Sbjct: 287 -PPLNSGAGFLTLGAPNSTNNSNSGFVFTPMHAFSPKIA------TFYVVTLTGISVGGK 339
Query: 313 RVRVWHKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEA 372
+ + G IVDSGT T + ++ L F S M + L A
Sbjct: 340 ALDIPPAVFAK------GNIVDSGTVITGIPTTAYKALRTAFRSAMAE-----YPLLPPA 388
Query: 373 LTGLRPCFDVPGEKTGSFPELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTVVTDREASG 432
+ L C++ G T + P++ L F GGA V L V + V CL + S
Sbjct: 389 DSALDTCYNFTGHGTVTVPKVALTFVGGATVDLDVPSGVLV-----EDCLAFADAGDGSF 443
Query: 433 GPSIILGNFQMQNYYVEYDLRNQRLGFKQQLC 464
G I+GN + V YD LGF+ C
Sbjct: 444 G---IIGNVNTRTIEVLYDSGKGHLGFRAGAC 472
>gi|449462553|ref|XP_004149005.1| PREDICTED: probable aspartic protease At2g35615-like [Cucumis
sativus]
Length = 418
Score = 95.9 bits (237), Expect = 4e-17, Method: Compositional matrix adjust.
Identities = 129/486 (26%), Positives = 199/486 (40%), Gaps = 91/486 (18%)
Query: 1 MASYISALCLSFIFFFTLLSIFPSSITSLT-----FSLSRFHTNPSQDSYQNLNSLVSSS 55
MA+ IS +FF +L + S T++ F+ S FH +DS L+ L SS
Sbjct: 1 MAATIS------LFFHLILFLISFSQTTIINGDNGFTTSLFH----RDSL--LSPLEFSS 48
Query: 56 LTRALHIKNPQTKTTTTTTTTTTTNISSHSYGGYSISLSFGTPPQIIPFILDTGSHLVWF 115
L+ + N ++ + + +S + G S + GTPP I DTGS L W
Sbjct: 49 LSHYDRLANAFRRSLSRSAALLNRAATSGAVGLQSSII--GTPPVDYLGIADTGSDLTWA 106
Query: 116 PCTNHYQCKYCSSSKIPSFIPKLSSSSRLLGCQNPKCSWIH--HESIQCRDCNDEPLATS 173
C C C P F P S+S + C C + H +Q
Sbjct: 107 QC---LPCLKCYQQLRPIFNPLKSTSFSHVPCNTQTCHAVDDGHCGVQ------------ 151
Query: 174 KNCTQICPSYLVLYGS-GLTEGIALSETLNLPNRIIPNFLVGCSVLSSRQ---PAGIAGF 229
+C Y YG ++G E + + + + + ++GC SS +G+ G
Sbjct: 152 ----GVC-DYSYTYGDRTYSKGDLGFEKITIGSSSVKS-VIGCGHASSGGFGFASGVIGL 205
Query: 230 GRGKTSLPSQLNLD-----KFSYCL---LSHKFDDTTRTSSLILDNGSSHSDKKTTGLTY 281
G G+ SL SQ++ +FSYCL LSH + ++ G+
Sbjct: 206 GGGQLSLVSQMSQTSGISRRFSYCLPTLLSHANGKINFGQNAVVSG---------PGVVS 256
Query: 282 TPFVNNPSVAERNAFSVYYYVGLRRITVGGQRVRVWHKYLTLDRDGNGGTIVDSGTTFTF 341
TP ++ +V YYY+ L I++G +R + K G I+DSGTT +F
Sbjct: 257 TPLISKNTV-------TYYYITLEAISIGNERHMAFAK--------QGNVIIDSGTTLSF 301
Query: 342 MAPELFEPLADEFVSQMVKNRNYTRALGAEALTGLRPCFD--VPGEKTGSFPELKLHFKG 399
+ EL+ D VS ++K R L CFD + + P + F G
Sbjct: 302 LPKELY----DGVVSSLLKVVKAKRVKDPGNFWDL--CFDDGINVATSSGIPIITAQFSG 355
Query: 400 GAEVTL-PVENYFAVVGEGSAVCLTVVTDREASGGPSIILGNFQMQNYYVEYDLRNQRLG 458
GA V L PV + V + + LT + + G I+GN + N+ + YDL +RL
Sbjct: 356 GANVNLLPVNTFQKVANNVNCLTLTPASPTDEFG----IIGNLALANFLIGYDLEAKRLS 411
Query: 459 FKQQLC 464
FK +C
Sbjct: 412 FKPTVC 417
>gi|357448247|ref|XP_003594399.1| Aspartic proteinase nepenthesin-2 [Medicago truncatula]
gi|355483447|gb|AES64650.1| Aspartic proteinase nepenthesin-2 [Medicago truncatula]
Length = 452
Score = 95.9 bits (237), Expect = 4e-17, Method: Compositional matrix adjust.
Identities = 108/394 (27%), Positives = 176/394 (44%), Gaps = 57/394 (14%)
Query: 87 GGYSISLSFGTPPQIIPFILDTGSHLVWF---PCTNHYQCKYCSSSKIPSFIPKLSSSSR 143
G Y + + G+P + I+DTGS W PCT YC + P F P S + +
Sbjct: 101 GNYYVKMGLGSPTKYYTMIVDTGSSFSWLQCQPCT-----IYCHIQEDPVFNPSASKTYK 155
Query: 144 LLGCQNPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYG-SGLTEGIALSETLN 202
+ C + +CS + ++ C+ + S C Y YG S + G + L
Sbjct: 156 TVPCSSSQCSSLKSATLNEPTCSKQ----SNACV-----YKASYGDSSFSLGYLSQDVLT 206
Query: 203 L-PNRIIPNFLVGCSVLSS---RQPAGIAGFGRGKTSLPSQLN---LDKFSYCLLSHKFD 255
L P++ + +F+ GC + + GI G + S+ SQL+ + FSYCL + F
Sbjct: 207 LTPSQTLSSFVYGCGQDNQGLFGRTDGIIGLANNELSMLSQLSGKYGNAFSYCLPT-SFS 265
Query: 256 DTTRTSSLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRVR 315
L G+S S ++ +TP + NP+ Y++ L ITV G+ +
Sbjct: 266 TPNSPKEGFLSIGTS-SLTPSSSYKFTPLLKNPNNPS------LYFIDLESITVAGRPLG 318
Query: 316 VWHKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEALTG 375
V + TI+DSGT T + ++ L + +V+ + ++ Y +A G ++
Sbjct: 319 VAASSYKVP------TIIDSGTVITRLPTPVYTTLKNAYVT--ILSKKYQQAPG---ISL 367
Query: 376 LRPCFDVPGEKTG---SFPELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTVVTDREASG 432
L CF G G P++++ FKGGA++ L N + E CL + +G
Sbjct: 368 LDTCFK--GSLAGISEVAPDIRIIFKGGADLQLKGHNSLVEL-ETGITCLAM------AG 418
Query: 433 GPSI-ILGNFQMQNYYVEYDLRNQRLGFKQQLCK 465
SI I+GN+Q Q V YD+ N R+GF C+
Sbjct: 419 SSSIAIIGNYQQQTVKVAYDVGNSRVGFAPGGCQ 452
>gi|217074470|gb|ACJ85595.1| unknown [Medicago truncatula]
gi|388505166|gb|AFK40649.1| unknown [Medicago truncatula]
Length = 452
Score = 95.9 bits (237), Expect = 4e-17, Method: Compositional matrix adjust.
Identities = 108/394 (27%), Positives = 176/394 (44%), Gaps = 57/394 (14%)
Query: 87 GGYSISLSFGTPPQIIPFILDTGSHLVWF---PCTNHYQCKYCSSSKIPSFIPKLSSSSR 143
G Y + + G+P + I+DTGS W PCT YC + P F P S + +
Sbjct: 101 GNYYVKMGLGSPTKYYTMIVDTGSSFSWLQCQPCT-----IYCHIQEDPVFNPSASKTYK 155
Query: 144 LLGCQNPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYG-SGLTEGIALSETLN 202
+ C + +CS + ++ C+ + S C Y YG S + G + L
Sbjct: 156 TVPCSSSQCSSLKSATLNEPTCSKQ----SNACV-----YKASYGDSSFSLGYLSQDVLT 206
Query: 203 L-PNRIIPNFLVGCSVLSS---RQPAGIAGFGRGKTSLPSQLN---LDKFSYCLLSHKFD 255
L P++ + +F+ GC + + GI G + S+ SQL+ + FSYCL + F
Sbjct: 207 LTPSQTLSSFVYGCGQDNQGLFGRTDGIIGLANNELSMLSQLSGKYGNAFSYCLPT-SFS 265
Query: 256 DTTRTSSLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRVR 315
L G+S S ++ +TP + NP+ Y++ L ITV G+ +
Sbjct: 266 TPNSPKEGFLSIGTS-SLTPSSSYKFTPLLKNPNNPS------LYFIDLESITVAGRPLG 318
Query: 316 VWHKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEALTG 375
V + TI+DSGT T + ++ L + +V+ + ++ Y +A G ++
Sbjct: 319 VAASSYKVP------TIIDSGTVITRLPTPVYTTLKNAYVT--ILSKKYQQAPG---ISL 367
Query: 376 LRPCFDVPGEKTG---SFPELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTVVTDREASG 432
L CF G G P++++ FKGGA++ L N + E CL + +G
Sbjct: 368 LDTCFK--GSLAGISEVAPDIRIIFKGGADLQLKGHNSLVEL-ETGITCLAM------AG 418
Query: 433 GPSI-ILGNFQMQNYYVEYDLRNQRLGFKQQLCK 465
SI I+GN+Q Q V YD+ N R+GF C+
Sbjct: 419 SSSIAIIGNYQQQTVKVAYDVGNSRVGFAPGGCQ 452
>gi|326531454|dbj|BAJ97731.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 476
Score = 95.9 bits (237), Expect = 4e-17, Method: Compositional matrix adjust.
Identities = 114/389 (29%), Positives = 152/389 (39%), Gaps = 68/389 (17%)
Query: 89 YSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPS-----FIPKLSSSSR 143
Y +++S GTP +DTGS + W QCK CS+ S F P SS+
Sbjct: 143 YVVTVSLGTPGVSQTVEVDTGSDVSWV------QCKPCSAPACNSQRDQLFDPAKSSTYS 196
Query: 144 LLGCQNPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSGL-TEGIALSETLN 202
+ C CS + C + C Y+V YG G T G+ S+TL
Sbjct: 197 AVPCGADACSELRIYEAGC---------SGSQC-----GYVVSYGDGSNTTGVYGSDTLA 242
Query: 203 L-PNRIIPNFLVGCSVLSSRQPAGIAGF---GRGKTSLPSQLNL---DKFSYCLLSHKFD 255
L P + FL GC + AGI G GR SL SQ FSYCL S +
Sbjct: 243 LAPGNTVGTFLFGCGHAQAGMFAGIDGLLALGRQSMSLKSQAAGAYGGVFSYCLPSKQ-- 300
Query: 256 DTTRTSSLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRVR 315
+ L L +S S TTGL A +Y V L I+VGGQ+V
Sbjct: 301 --SAAGYLTLGGPTSASGFATTGLL-----------TAWAAPTFYMVMLTGISVGGQQVA 347
Query: 316 VWHKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEALTG 375
V GGT+VD+GT T + P + L F + Y A A
Sbjct: 348 VPASAFA------GGTVVDTGTVITRLPPTAYAALRSAF-RGAIAPYGYPS---APANGI 397
Query: 376 LRPCFDVPGEKTGSFPELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTVVTDREASGGPS 435
L C+D + P + L F GGA + L S+ CL + G +
Sbjct: 398 LDTCYDFSRYGVVTLPTVALTFSGGATLALEAPGIL------SSGCLAFAPN--GGDGDA 449
Query: 436 IILGNFQMQNYYVEYDLRNQRLGFKQQLC 464
ILGN Q +++ V +D +GF C
Sbjct: 450 AILGNVQQRSFAVRFD--GSTVGFMPGAC 476
>gi|255553149|ref|XP_002517617.1| Aspartic proteinase nepenthesin-2 precursor, putative [Ricinus
communis]
gi|223543249|gb|EEF44781.1| Aspartic proteinase nepenthesin-2 precursor, putative [Ricinus
communis]
Length = 449
Score = 95.5 bits (236), Expect = 5e-17, Method: Compositional matrix adjust.
Identities = 133/504 (26%), Positives = 196/504 (38%), Gaps = 98/504 (19%)
Query: 1 MASYISALCLSFIFFFTLLSIFP-SSITSLTFSLSRFHT--------NPSQDSYQNLNSL 51
MA+ S FI F +++S F + FS + H NP + L +
Sbjct: 1 MAAVSSIYVSLFIAFISMVSAFSLVEARNAGFSANLIHRDSSVSPLYNPRDTYFDRLRNS 60
Query: 52 VSSSLTRALHIKNPQTKTTTTTTTTTTTNISSHSYGGYSISLSFGTPPQIIPFILDTGSH 111
S++RA K + + ++I G Y + +S G P I I DTGS
Sbjct: 61 FHRSISRANRFK----PNSISARALVQSDIVPGG-GEYLMRISIGNPQVEILAIADTGSD 115
Query: 112 LVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLGCQNPKCSWIHHESIQCRDCNDEPLA 171
L+W C C+ C P F P+ SSS R + C N C+ + E+ C
Sbjct: 116 LIWVQCQ---PCEMCYKQNSPIFDPRRSSSYRNVLCGNEFCNKLDGEARSC--------- 163
Query: 172 TSKNCTQICPSYLVLYGSGLTEGIALSETLNLPNRIIPNFLVGCSVLSSRQPAGIA---- 227
++ + C Y YG ++ + + I F +G + +S A IA
Sbjct: 164 DARGFVKTC-GYTYSYG---------DQSFSDGHLAIERFGIGST--NSNTSAAIAYFQE 211
Query: 228 -GFGRG--------------------KTSLPSQLNLD---KFSYCLLSHKFDDTTRTSSL 263
FG G SL SQL KFSYCL+ + + TS +
Sbjct: 212 VAFGCGTKNGGTFDELGSGIIGLGGGSMSLVSQLGPKLSGKFSYCLVPTS-EQSNYTSKI 270
Query: 264 ILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRV---RVWHKY 320
N D +G Y V+ P + ++ YYY+ L I+V +R+ +W+
Sbjct: 271 NFGN-----DINISGSNYN-VVSTPLLPKKP--ETYYYLTLEAISVENKRLPYTNLWNGE 322
Query: 321 LTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEALTGLRPCF 380
+ G I+DSGTT TF+ E F L D V + VK + G CF
Sbjct: 323 VE-----KGNIIIDSGTTLTFLDSEFFNNL-DSAVEEAVKGERVSDPHGL-----FNICF 371
Query: 381 DVPGEKTGSFPELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTVVTDREASGGPSIILGN 440
EK P + HF GA+V L N FA V E +C T++ + + I GN
Sbjct: 372 --KDEKAIELPIITAHFT-GADVELQPVNTFAKV-EEDLLCFTMIPSNDIA-----IFGN 422
Query: 441 FQMQNYYVEYDLRNQRLGFKQQLC 464
N+ V YDL + + F C
Sbjct: 423 LAQMNFLVGYDLEKKAVSFLPTDC 446
>gi|326511104|dbj|BAK03251.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 631
Score = 95.5 bits (236), Expect = 5e-17, Method: Compositional matrix adjust.
Identities = 106/397 (26%), Positives = 168/397 (42%), Gaps = 71/397 (17%)
Query: 87 GGYSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLG 146
G Y+ L GTPPQ I+D+GS + + PC + C+ C + + P F P LSS+ +
Sbjct: 86 GYYTTRLHIGTPPQEFALIVDSGSTVTYVPCAS---CEQCGNHQDPRFQPDLSSTYSPVK 142
Query: 147 CQNPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSGLTEGIALSETLNLPNR 206
C N C+ + QC + A + + + +V +G+ E+ P R
Sbjct: 143 C-NVDCT-CDSDKNQC--TYERQYAEMSSSSGVLGEDIVSFGT---------ESELKPQR 189
Query: 207 IIPNFLVGCS-----VLSSRQPAGIAGFGRGKTSLPSQLNL-----DKFSYCLLSHKFDD 256
+ GC L S+ GI G GRG+ S+ QL D FS C
Sbjct: 190 AV----FGCENSETGDLFSQHADGIMGLGRGQLSIMDQLVDKGVIGDSFSMCYGGMDIG- 244
Query: 257 TTRTSSLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAF-SVYYYVGLRRITVGGQRVR 315
+++L + G+ YT NA S YY + L+ + V G+ +R
Sbjct: 245 ---GGAMVLG-----AMPAPPGMIYT---------HSNAVRSPYYNIELKEMHVAGKALR 287
Query: 316 VWHKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQM-------VKNRNYTRAL 368
V + DG GT++DSGTT+ ++ + F D SQ+ + NY
Sbjct: 288 VDPRIF----DGKHGTVLDSGTTYAYLPEQAFVAFKDAVSSQVHPLKKIRGPDSNYKDIC 343
Query: 369 GAEALTGLRPCFDVPGEKTGSFPELKLHFKGGAEVTLPVENY-FAVVGEGSAVCLTVVTD 427
A A + +V FP++ + F G +++L ENY F A CL V +
Sbjct: 344 FAGAGRNVSQLSEV-------FPKVDMVFGNGQKLSLSPENYLFRHSKVEGAYCLGVFQN 396
Query: 428 REASGGPSIILGNFQMQNYYVEYDLRNQRLGFKQQLC 464
+ P+ +LG ++N V YD N+++GF + C
Sbjct: 397 GKD---PTTLLGGIVVRNTLVTYDRHNEKIGFWKTNC 430
>gi|326501422|dbj|BAK02500.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 631
Score = 95.5 bits (236), Expect = 5e-17, Method: Compositional matrix adjust.
Identities = 106/399 (26%), Positives = 167/399 (41%), Gaps = 75/399 (18%)
Query: 87 GGYSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLG 146
G Y+ L GTPPQ I+D+GS + + PC + C+ C + + P F P LSS+ +
Sbjct: 86 GYYTTRLHIGTPPQEFALIVDSGSTVTYVPCAS---CEQCGNHQDPRFQPDLSSTYSPVK 142
Query: 147 CQNPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSGLTEGIALSETLNLPNR 206
C N C+ + QC + A + + + +V +G+ E+ P R
Sbjct: 143 C-NVDCT-CDSDKNQC--TYERQYAEMSSSSGVLGEDIVSFGT---------ESELKPQR 189
Query: 207 IIPNFLVGCS-----VLSSRQPAGIAGFGRGKTSLPSQLNL-----DKFSYCLLSHKFDD 256
+ GC L S+ GI G GRG+ S+ QL D FS C
Sbjct: 190 AV----FGCENSETGDLFSQHADGIMGLGRGQLSIMDQLVDKGVIGDSFSMCYGGMDIG- 244
Query: 257 TTRTSSLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAF-SVYYYVGLRRITVGGQRVR 315
+++L + G+ YT NA S YY + L+ + V G+ +R
Sbjct: 245 ---GGAMVLG-----AMPAPPGMIYT---------HSNAVRSPYYNIELKEMHVAGKALR 287
Query: 316 VWHKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEALTG 375
V + DG GT++DSGTT+ ++ + F D SQ+ + + G
Sbjct: 288 VDPRIF----DGKHGTVLDSGTTYAYLPEQAFVAFKDAVSSQVHPLKK---------IRG 334
Query: 376 LRP-----CFDVPGEKTGS----FPELKLHFKGGAEVTLPVENY-FAVVGEGSAVCLTVV 425
P CF G FP++ + F G +++L ENY F A CL V
Sbjct: 335 PDPNYKDICFAGAGRNVSQLSEVFPKVDMVFGNGQKLSLSPENYLFRHSKVEGAYCLGVF 394
Query: 426 TDREASGGPSIILGNFQMQNYYVEYDLRNQRLGFKQQLC 464
+ + P+ +LG ++N V YD N+++GF + C
Sbjct: 395 QNGKD---PTTLLGGIVVRNTLVTYDRHNEKIGFWKTNC 430
>gi|359476754|ref|XP_002277058.2| PREDICTED: aspartic proteinase-like protein 2-like isoform 1 [Vitis
vinifera]
Length = 561
Score = 95.5 bits (236), Expect = 5e-17, Method: Compositional matrix adjust.
Identities = 110/398 (27%), Positives = 160/398 (40%), Gaps = 58/398 (14%)
Query: 87 GGYSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSS--SKIPSFIPKLSSSSRL 144
G Y + GTP + +DTGS ++W C +C S + + K S++S
Sbjct: 153 GLYFAKIGIGTPSKDYYVQVDTGSDILWVNCAGCDRCPTKSDLGVDLTLYDMKASTTSDA 212
Query: 145 LGCQNPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSGLTEGIALSETLNLP 204
+GC + CS D PL K Q Y VLYG G + +
Sbjct: 213 VGCDDNFCSLY-----------DGPLPGCKPGLQCL--YSVLYGDGSSTTGYFVQDFVQY 259
Query: 205 NRIIPNF---------LVGCSVL-------SSRQPAGIAGFGRGKTSLPSQLNLDKFSYC 248
NRI NF + GC SS GI GFG+ +S+ SQL
Sbjct: 260 NRISGNFQTTPTNGTVVFGCGNKQSGELGSSSEALDGILGFGQANSSMLSQLASSGKVKK 319
Query: 249 LLSHKFDDTTRTSSLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRIT 308
+ SH D+ +D G + G P VN + + A +Y V ++ I
Sbjct: 320 VFSHCLDN--------VDGGGIFA----IGEVVEPKVNITPLVQNQA---HYNVVMKEIE 364
Query: 309 VGGQRVRV-WHKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRA 367
VGG + V + + DR G TI+DSGTT + E++ PL ++ +SQ R +T
Sbjct: 365 VGGDPLDVPSDAFESGDRKG---TIIDSGTTLAYFPQEVYVPLIEKILSQQPDLRLHTV- 420
Query: 368 LGAEALTGLRPCFDVPGEKTGSFPELKLHFKGGAEVTL-PVENYFAVVGEGSAVCLTVVT 426
+A T CFD G FP + LHF +T+ P E F V +
Sbjct: 421 --EQAFT----CFDYTGNVDDGFPTVTLHFDKSISLTVYPHEYLFQVKEFEWCIGWQNSG 474
Query: 427 DREASGGPSIILGNFQMQNYYVEYDLRNQRLGFKQQLC 464
+ G +LG+ + N V YDL Q +G+ + C
Sbjct: 475 AQTKDGKDLTLLGDLVLSNKLVVYDLEKQGIGWVEYNC 512
>gi|255547548|ref|XP_002514831.1| Aspartic proteinase Asp1 precursor, putative [Ricinus communis]
gi|223545882|gb|EEF47385.1| Aspartic proteinase Asp1 precursor, putative [Ricinus communis]
Length = 488
Score = 95.1 bits (235), Expect = 6e-17, Method: Compositional matrix adjust.
Identities = 107/402 (26%), Positives = 164/402 (40%), Gaps = 65/402 (16%)
Query: 87 GGYSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQC--KYCSSSKIPSFIPKLSSSSRL 144
G Y + G PP+ +DTGS ++W C N +C K K+ + P+ S+S+
Sbjct: 80 GLYFAKIGLGNPPKDYYVQVDTGSDILWVNCANCDKCPTKSDLGVKLTLYDPQSSTSATR 139
Query: 145 LGCQNPKCSWIHHESIQCRDCNDEPLATSKNCTQICP-SYLVLYGSG-LTEGIALSETLN 202
+ C + C+ ++ +Q CT+ P Y V+YG G T G + + L
Sbjct: 140 IYCDDDFCAATYNGVLQ-------------GCTKDLPCQYSVVYGDGSSTAGFFVKDNLQ 186
Query: 203 LPNRIIPNF---------LVGCSV-------LSSRQPAGIAGFGRGKTSLPSQLNLDKFS 246
+R+ N + GC SS GI GFG+ +S+ SQL
Sbjct: 187 F-DRVTGNLQTSSANGSVIFGCGAKQSGELGTSSEALDGILGFGQANSSMISQLAAAGKV 245
Query: 247 YCLLSHKFDDTTRTSSLILDNGSSHSDKKTTGLTYTPFV-NNPSVAERNAFSVYYYVGLR 305
+ +H D+ I G S K T TP V N P +Y V ++
Sbjct: 246 KRVFAHCLDNVKGGG--IFAIGEVVSPKVNT----TPMVPNQP----------HYNVVMK 289
Query: 306 RITVGGQRVRV-WHKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNY 364
I VGG + + + T DR GTI+DSGTT ++ ++E + + VS+ + +
Sbjct: 290 EIEVGGNVLELPTDIFDTGDRR---GTIIDSGTTLAYLPEVVYESMMTKIVSEQPGLKLH 346
Query: 365 TRALGAEALTGLRPCFDVPGEKTGSFPELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTV 424
T E T CF G FP +K HF G +T+ +Y + E C
Sbjct: 347 TV---EEQFT----CFQYTGNVNEGFPVVKFHFNGSLSLTVNPHDYLFQIHE-EVWCFGW 398
Query: 425 VTD--REASGGPSIILGNFQMQNYYVEYDLRNQRLGFKQQLC 464
+ G +LG+ + N V YDL NQ +G+ C
Sbjct: 399 QNSGMQSKDGRDMTLLGDLVLSNKLVLYDLENQAIGWTDYNC 440
>gi|297735249|emb|CBI17611.3| unnamed protein product [Vitis vinifera]
Length = 480
Score = 95.1 bits (235), Expect = 7e-17, Method: Compositional matrix adjust.
Identities = 110/398 (27%), Positives = 160/398 (40%), Gaps = 58/398 (14%)
Query: 87 GGYSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQC--KYCSSSKIPSFIPKLSSSSRL 144
G Y + GTP + +DTGS ++W C +C K + + K S++S
Sbjct: 72 GLYFAKIGIGTPSKDYYVQVDTGSDILWVNCAGCDRCPTKSDLGVDLTLYDMKASTTSDA 131
Query: 145 LGCQNPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSGLTEGIALSETLNLP 204
+GC + CS D PL K Q Y VLYG G + +
Sbjct: 132 VGCDDNFCSLY-----------DGPLPGCKPGLQCL--YSVLYGDGSSTTGYFVQDFVQY 178
Query: 205 NRIIPNF---------LVGCSVL-------SSRQPAGIAGFGRGKTSLPSQLNLDKFSYC 248
NRI NF + GC SS GI GFG+ +S+ SQL
Sbjct: 179 NRISGNFQTTPTNGTVVFGCGNKQSGELGSSSEALDGILGFGQANSSMLSQLASSGKVKK 238
Query: 249 LLSHKFDDTTRTSSLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRIT 308
+ SH D+ +D G + G P VN + + A +Y V ++ I
Sbjct: 239 VFSHCLDN--------VDGGGIFA----IGEVVEPKVNITPLVQNQA---HYNVVMKEIE 283
Query: 309 VGGQRVRV-WHKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRA 367
VGG + V + + DR G TI+DSGTT + E++ PL ++ +SQ R +T
Sbjct: 284 VGGDPLDVPSDAFESGDRKG---TIIDSGTTLAYFPQEVYVPLIEKILSQQPDLRLHTV- 339
Query: 368 LGAEALTGLRPCFDVPGEKTGSFPELKLHFKGGAEVTL-PVENYFAVVGEGSAVCLTVVT 426
+A T CFD G FP + LHF +T+ P E F V +
Sbjct: 340 --EQAFT----CFDYTGNVDDGFPTVTLHFDKSISLTVYPHEYLFQVKEFEWCIGWQNSG 393
Query: 427 DREASGGPSIILGNFQMQNYYVEYDLRNQRLGFKQQLC 464
+ G +LG+ + N V YDL Q +G+ + C
Sbjct: 394 AQTKDGKDLTLLGDLVLSNKLVVYDLEKQGIGWVEYNC 431
>gi|84453222|dbj|BAE71208.1| hypothetical protein [Trifolium pratense]
gi|84453226|dbj|BAE71210.1| hypothetical protein [Trifolium pratense]
Length = 437
Score = 95.1 bits (235), Expect = 7e-17, Method: Compositional matrix adjust.
Identities = 111/433 (25%), Positives = 177/433 (40%), Gaps = 52/433 (12%)
Query: 40 PSQDSYQN-LNSLVSSSLTRALHIKNPQTKTTTTTTTTTTTNISSHSYGGYSISLSFGTP 98
P DS+ N + ++ S R ++ + T T+ + + + G Y + + GTP
Sbjct: 50 PKADSWDNRVINMASKDPARMSYLSTLVAQKTATSAPIASGQ--TFNIGNYVVRVKIGTP 107
Query: 99 PQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLGCQNPKCSWIHHE 158
Q++ +LDT + + P + C CS++ +F P +S+S L C P+C +
Sbjct: 108 GQLLFMVLDTSTDEAFVPSSG---CIGCSAT---TFYPNVSTSFVPLDCSVPQCGQVR-- 159
Query: 159 SIQCRDCNDEPLATSKNCTQICPSYLVLYGSGLTEGIALSETLNLPNRIIPNFLVGC--S 216
+ C P S C S+ Y + ++L L +IP++ G +
Sbjct: 160 GLSC------PATGSGAC-----SFNQSYAGSTFSATLVQDSLRLATDVIPSYSFGSINA 208
Query: 217 VLSSRQPA----GIAGFGRGKTSLPSQLNLDKFSYCLLSHKFDDTTRTSSLILDNGSSHS 272
+ S PA G+ S + FSYCL S F + SL L
Sbjct: 209 ISGSSVPAQGLLGLGRGPLSLLSQSGAIYSGVFSYCLPS--FKSYYFSGSLKLGPVGQPK 266
Query: 273 DKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRVRVWHKYLTLDRDGNGGTI 332
+TT L + P + PS+ YYV L I+VG V + + L + GTI
Sbjct: 267 SIRTTPLLHNP--HRPSL---------YYVNLTAISVGRVYVPLPSELLAFNPSTGAGTI 315
Query: 333 VDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEALTGLRPCFDVPGEKTGSFPE 392
+DSGT T ++ + DEF Q+ + +LGA CF E P
Sbjct: 316 IDSGTVITRFVEPIYNAVRDEFRKQVTGPFS---SLGA-----FDTCFVKNYETLA--PA 365
Query: 393 LKLHFKGGAEVTLPVENYFAVVGEGSAVCLTVVTDREASGGPSIILGNFQMQNYYVEYDL 452
+ LHF ++ LP+EN GS CL + ++ NFQ QN V +D
Sbjct: 366 ITLHFT-DLDLKLPLENSLIHSSSGSLACLAMAAAPSNVNSVLNVIANFQQQNLRVLFDT 424
Query: 453 RNQRLGFKQQLCK 465
N ++G ++LC
Sbjct: 425 VNNKVGIARELCN 437
>gi|21717173|gb|AAM76366.1|AC074196_24 putative nucleoid DNA binding protein [Oryza sativa Japonica Group]
gi|31433291|gb|AAP54829.1| Eukaryotic aspartyl protease family protein [Oryza sativa Japonica
Group]
Length = 413
Score = 94.7 bits (234), Expect = 8e-17, Method: Compositional matrix adjust.
Identities = 97/387 (25%), Positives = 159/387 (41%), Gaps = 50/387 (12%)
Query: 89 YSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLGCQ 148
Y + + GTPPQ I+D LVW C+ C+ C +P F+P SS+ + C
Sbjct: 62 YVANFTIGTPPQPASAIVDVAGELVWTQCS---ACRRCFKQDLPVFVPNASSTFKPEPCG 118
Query: 149 NPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSGLTEGIALSETLNLPNRII 208
C ESI R C+ + + TQ+ G T G A ++T + +
Sbjct: 119 TAVC-----ESIPTRSCSGDVCSYKGPPTQL---------RGNTSGFAATDTFAIGTATV 164
Query: 209 PNFLVGCSVLSSRQ----PAGIAGFGRGKTSLPSQLNLDKFSYCLLSHKFDDTTRTSSLI 264
GC V S P+G G GR SL +Q+ L +FSYCL +T ++S L
Sbjct: 165 -RLAFGCVVASDIDTMDGPSGFIGLGRTPWSLVAQMKLTRFSYCLSPR---NTGKSSRLF 220
Query: 265 LDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRVRVWHKYLTLD 324
L + + + ++T + PF+ + + YY + L I G +
Sbjct: 221 LGSSAKLAGGEST--STAPFIKTSPDDDSHH---YYLLSLDAIRAGNTTIATAQ------ 269
Query: 325 RDGNGGTIV-DSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEALTGLRPCF-DV 382
+GG +V + + F+ + + + V++ V L CF
Sbjct: 270 ---SGGILVMHTVSPFSLLVDSAYRAF-KKAVTEAVGGAAAPPMATPPQPFDL--CFKKA 323
Query: 383 PGEKTGSFPELKLHFKGGAEVTLPVENYFAVVG-EGSAVCLTVVT----DREASGGPSII 437
G + P+L F+G A +T+P Y VG E C +++ +R G S +
Sbjct: 324 AGFSRATAPDLVFTFQGAAALTVPPAKYLIDVGEEKDTACAAILSMAWLNRTGLEGVS-V 382
Query: 438 LGNFQMQNYYVEYDLRNQRLGFKQQLC 464
LG+ Q ++ + YDL+ + L F+ C
Sbjct: 383 LGSLQQEDVHFLYDLKKETLSFEPADC 409
>gi|449433371|ref|XP_004134471.1| PREDICTED: probable aspartic protease At2g35615-like [Cucumis
sativus]
gi|449495479|ref|XP_004159853.1| PREDICTED: probable aspartic protease At2g35615-like [Cucumis
sativus]
Length = 424
Score = 94.7 bits (234), Expect = 9e-17, Method: Compositional matrix adjust.
Identities = 109/393 (27%), Positives = 171/393 (43%), Gaps = 59/393 (15%)
Query: 87 GGYSISLSFGTPPQIIPFILDTGSHLVWFPCTN-HYQCKYCSSSKIPSFIPKLSSSSRLL 145
G Y +S + G P + LDT + L+W C+N + QC+ F+ S + +
Sbjct: 73 GEYLMSFNIGNPSSQVMGFLDTSNGLIWVQCSNCNSQCEPEKRGLTTKFLSSKSFTYEME 132
Query: 146 GCQNPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSG-LTEGIALSETLNLP 204
C + C+ + + CN ++ K C Y ++YG T GI S++
Sbjct: 133 PCGSNFCNSL----TGFQTCN----SSDKWC-----KYRLVYGDNKATSGILSSDSFGFD 179
Query: 205 NR----IIPNFL-VGCSVL----SSRQPAGIAGFGRGKTSLPSQLNLDKFSYCLLSHKFD 255
+ FL GCS + G G + SL SQL + KFSYCL+ F+
Sbjct: 180 TSDGMLVDVGFLNFGCSEAPLTGDEQSYTGNVGLNQTPLSLISQLGIKKFSYCLV--PFN 237
Query: 256 DTTRTSSLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRVR 315
+ TS + GS T TP + S A YYV + I++G
Sbjct: 238 NLGSTSKMYF--GS----LPVTSGGQTPLLYPNSDA--------YYVKVLGISIGNDEPH 283
Query: 316 ---VWHKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEA 372
V+ Y D G I+D+G T++ + + F+ L +F++ +K+ + E
Sbjct: 284 FDGVFDVYEVRD-----GWIIDTGITYSSLETDAFDSLLAKFLT--LKDFPQRKDDPKER 336
Query: 373 LTGLRPCFDVP-GEKTGSFPELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTVVTDREAS 431
CF++ SFP++ +HF G A++ L VE+ F + + CL ++ S
Sbjct: 337 F---ELCFELQNANDLESFPDVTVHFDG-ADLILNVESTFVKIEDDGIFCLALL----RS 388
Query: 432 GGPSIILGNFQMQNYYVEYDLRNQRLGFKQQLC 464
G P ILGNFQ+QNY+V YDL Q + F C
Sbjct: 389 GSPVSILGNFQLQNYHVGYDLEAQVISFAPVDC 421
>gi|125532793|gb|EAY79358.1| hypothetical protein OsI_34487 [Oryza sativa Indica Group]
Length = 419
Score = 94.7 bits (234), Expect = 9e-17, Method: Compositional matrix adjust.
Identities = 108/396 (27%), Positives = 158/396 (39%), Gaps = 62/396 (15%)
Query: 89 YSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSS-----KIPSFIPKLSSSSR 143
Y + + GTPPQ + I+D LVW QC C SS ++P F P S++ R
Sbjct: 62 YVANFTIGTPPQAVSGIVDLSGELVW------TQCAACRSSGCFKQELPVFDPSASNTYR 115
Query: 144 LLGCQNPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSGLTEGIALSETLNL 203
C +P C +SI R+C+ + C PS G T GIA ++ + +
Sbjct: 116 AEQCGSPLC-----KSIPTRNCSGD-----GECGYEAPSMF-----GDTFGIASTDAIAI 160
Query: 204 PNRIIPNFLVGCSVLSSRQ-------PAGIAGFGRGKTSLPSQLNLDKFSYCLLSHKFDD 256
N GC V S P+G G GR SL Q N+ FSYCL H
Sbjct: 161 GNAE-GRLAFGCVVASDGSIDGAMDGPSGFVGLGRTPWSLVGQSNVTAFSYCLAPHG--- 216
Query: 257 TTRTSSLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRVRV 316
+ S+L L + + + + S + YY V L I G V
Sbjct: 217 PGKKSALFLGASAKLAGAGKSNPPTPLLGQHASNTSDDGSDPYYTVQLEGIKAGDVAVAA 276
Query: 317 WHKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLA--DEFVSQMVKNRNYTRALGAEALT 374
GG I T + E F PL+ + Q ++ + T ALG+ ++
Sbjct: 277 ASS--------GGGAI-------TILQLETFRPLSYLPDAAYQALE-KVVTAALGSPSMA 320
Query: 375 GLRPCFDV--PGEKTGSFPELKLHFKGGAEVTLPVENYFAVVGEGSA-VCLTVVTD---R 428
FD+ P+L F+GGA +T P Y G G+ VCL++++
Sbjct: 321 NPPEPFDLCFQNAAVSGVPDLVFTFQGGATLTAPPSKYLLGDGNGNGTVCLSILSSTRLD 380
Query: 429 EASGGPSIILGNFQMQNYYVEYDLRNQRLGFKQQLC 464
A G S ILG+ +N + +DL + L F+ C
Sbjct: 381 SADDGVS-ILGSLLQENVHFLFDLEKETLSFEPADC 415
>gi|356505293|ref|XP_003521426.1| PREDICTED: aspartic proteinase-like protein 2-like [Glycine max]
Length = 499
Score = 94.7 bits (234), Expect = 9e-17, Method: Compositional matrix adjust.
Identities = 103/402 (25%), Positives = 165/402 (41%), Gaps = 64/402 (15%)
Query: 87 GGYSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPS--FIPKLSSSSRL 144
G Y + G+P + +DTGS ++W C C + S I F SS++ L
Sbjct: 81 GLYFTKVKLGSPAKDFYVQIDTGSDILWINCITCSNCPHSSGLGIELDFFDTAGSSTAAL 140
Query: 145 LGCQNPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSGL-TEGIALSETLNL 203
+ C +P CS+ + C+ + + C SY YG G T G +S+T+
Sbjct: 141 VSCADPICSYAVQTATS--GCSSQ----ANQC-----SYTFQYGDGSGTTGYYVSDTMYF 189
Query: 204 PNRIIPNFLV---------GCSVLSS-------RQPAGIAGFGRGKTSLPSQLNL----- 242
++ +V GCS S + GI GFG G S+ SQL+
Sbjct: 190 DTVLLGQSMVANSSSTIVFGCSTYQSGDLTKTDKAVDGIFGFGPGALSVISQLSSRGVTP 249
Query: 243 DKFSYCLLSHKFDDTTRTSSLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYV 302
FS+CL + IL+ + Y+P V PS+ +Y +
Sbjct: 250 KVFSHCLKGGENGGGVLVLGEILE----------PSIVYSPLV--PSLP-------HYNL 290
Query: 303 GLRRITVGGQRVRVWHKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNR 362
L+ I V GQ + + N GTIVDSGTT ++ E + P D + + +
Sbjct: 291 NLQSIAVNGQLLPIDSNVFATTN--NQGTIVDSGTTLAYLVQEAYNPFVDAITAAVSQFS 348
Query: 363 NYTRALGAEALTGLRPCFDVPGEKTGSFPELKLHFKGGAEVTLPVENYFAVVGEGSAVCL 422
+ G + C+ V FP++ L+F GGA + L E+Y G + +
Sbjct: 349 KPIISKGNQ-------CYLVSNSVGDIFPQVSLNFMGGASMVLNPEHYLMHYGFLDSAAM 401
Query: 423 TVVTDREASGGPSIILGNFQMQNYYVEYDLRNQRLGFKQQLC 464
+ ++ G + ILG+ +++ YDL NQR+G+ C
Sbjct: 402 WCIGFQKVERGFT-ILGDLVLKDKIFVYDLANQRIGWADYNC 442
>gi|317106730|dbj|BAJ53226.1| JHL06P13.6 [Jatropha curcas]
Length = 445
Score = 94.7 bits (234), Expect = 9e-17, Method: Compositional matrix adjust.
Identities = 138/490 (28%), Positives = 207/490 (42%), Gaps = 75/490 (15%)
Query: 1 MASY-ISALCLSFIFFFTLLSI--FPSSITSLTFSLSRFHT--------NPSQDSYQNLN 49
MA++ I+ L L F+ F L+S +S+ + +F+ S H NP + L
Sbjct: 1 MAAFSITHLSL-FVIFVALISKTSLTASMNNGSFTASLIHRDSPISPLYNPKNTYFDRLQ 59
Query: 50 SLVSSSLTRALHIKNPQTKTTTTTTTTTTTNISSHSYGGYSISLSFGTPPQIIPFILDTG 109
S S++RA N T + + T +I G Y + +S GTPP + I DTG
Sbjct: 60 SSFHRSISRA----NRFTPNSVSAAKTLEYDIIPGG-GEYFMRISIGTPPIEVLVIADTG 114
Query: 110 SHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLGCQNPKCSWIHHESIQCRDCNDEP 169
S L+W C C+ C K P F PK SS+ R + C+ C+ ++ + C
Sbjct: 115 SDLIWVQCQ---PCQECYKQKSPIFNPKQSSTYRRVLCETRYCNALNSDMRAC------- 164
Query: 170 LATSKNCTQICPSYLVLYGS-GLTEGIALSETLNL--PNRIIPNFLVGCSVLS----SRQ 222
++ + C Y YG T G +E + N I GC +
Sbjct: 165 --SAHGFFKAC-GYSYSYGDHSFTMGYLATERFIIGSTNNSIQELAFGCGNSNGGNFDEV 221
Query: 223 PAGIAGFGRGKTSLPSQLNL---DKFSYCLLSHKFDDTTRTSSLILDNGSSHSDKKTTGL 279
+GI G G G SL SQL +KFSYCL+ ++ + S S T
Sbjct: 222 GSGIVGLGGGSLSLISQLGTKIDNKFSYCLVPILEKSNFSLGKIVFGDNSFISGSDT--- 278
Query: 280 TYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRVRVWHKYLTLDRDGN---GGTIVDSG 336
+V+ P V++ +YY+ L I+VG +R+ Y DGN G I+DSG
Sbjct: 279 ----YVSTPLVSKEP--ETFYYLTLEAISVGNERL----AYENSRNDGNVEKGNIIIDSG 328
Query: 337 TTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEALTGLRPCFDVP-GEKTG-SFPELK 394
TT TF+ +L+ L E V + +A+ E ++ F + +K G P +
Sbjct: 329 TTLTFLDSKLYNKL--ELVLE--------KAVEGERVSDPNGIFSICFRDKIGIELPIIT 378
Query: 395 LHFKGGAEVTLPVENYFAVVGEGSAVCLTVVTDREASGGPSIILGNFQMQNYYVEYDLRN 454
+HF A+V L N FA E +C T++ S G + I GN N+ V YDL
Sbjct: 379 VHFT-DADVELKPINTFA-KAEEDLLCFTMI----PSNGIA-IFGNLAQMNFLVGYDLDK 431
Query: 455 QRLGFKQQLC 464
+ F C
Sbjct: 432 NCVSFMPTDC 441
>gi|218188634|gb|EEC71061.1| hypothetical protein OsI_02803 [Oryza sativa Indica Group]
Length = 479
Score = 94.7 bits (234), Expect = 9e-17, Method: Compositional matrix adjust.
Identities = 103/386 (26%), Positives = 152/386 (39%), Gaps = 51/386 (13%)
Query: 89 YSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLGCQ 148
Y IS+ G+P ++DTGS + W C C + F P SS+ C
Sbjct: 135 YVISVGLGSPAMTQRVVIDTGSDVSWVQCEPCPAPSPCHAHAGALFDPAASSTYAAFNCS 194
Query: 149 NPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSGL-TEGIALSETLNLP-NR 206
C+ + +S + C+ +K+ Q Y+V YG G T G S+ L L +
Sbjct: 195 AAACAQL-GDSGEANGCD------AKSRCQ----YIVKYGDGSNTTGTYSSDVLTLSGSD 243
Query: 207 IIPNFLVGCS-----VLSSRQPAGIAGFGRGKTSLPSQLNL---DKFSYCLLSHKFDDTT 258
++ F GCS + G+ G G SL SQ FSYCL +
Sbjct: 244 VVRGFQFGCSHAELGAGMDDKTDGLIGLGGDAQSLVSQTAARYGKSFSYCLPATP----A 299
Query: 259 RTSSLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRVRVWH 318
+ L L +S + TP + + V YY+ L I VGG+++ +
Sbjct: 300 SSGFLTLGAPASGGGGGASRFATTPMLRSKKV------PTYYFAALEDIAVGGKKLGLSP 353
Query: 319 KYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEALTGLRP 378
G++VDSGT T + P + L+ F + M + Y R AE L L
Sbjct: 354 SVFAA------GSLVDSGTVITRLPPAAYAALSSAFRAGMTR---YAR---AEPLGILDT 401
Query: 379 CFDVPGEKTGSFPELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTVVTDREASGGPSIIL 438
CF+ G S P + L F GGA V L + G + D +A G +
Sbjct: 402 CFNFTGLDKVSIPTVALVFAGGAVVDLDAHGIVS----GGCLAFAPTRDDKAFG----TI 453
Query: 439 GNFQMQNYYVEYDLRNQRLGFKQQLC 464
GN Q + + V YD+ GF+ C
Sbjct: 454 GNVQQRTFEVLYDVGGGVFGFRAGAC 479
>gi|296087361|emb|CBI33735.3| unnamed protein product [Vitis vinifera]
Length = 633
Score = 94.4 bits (233), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 104/419 (24%), Positives = 176/419 (42%), Gaps = 65/419 (15%)
Query: 61 HIKNPQTKTTTTTTTTTTTNISSHSYGGYSISLSFGTPPQIIPFILDTGSHLVWFPCTNH 120
H++ ++ +T T ++ YG Y+ + GTPPQ I+DTGS L + PC+
Sbjct: 66 HLQRSESHSTATARMPLYDDLIP--YGYYTTRIWIGTPPQTFALIVDTGSTLTYVPCST- 122
Query: 121 YQCKYCSSSKIPSFIPKLSSSSRLLGCQNPKCSWIHHESIQCRDCNDEPLATSKNCTQIC 180
C+ C + P+F P SS+ + L C +C+ E + C D A + + +
Sbjct: 123 --CEQCGKHQDPNFQPDWSSTYQPLKCSM-ECT-CDSEMMHC--VYDRQYAEMSSSSGVL 176
Query: 181 PSYLVLYGSGLTEGIALSETLNLPNRIIPNFLVGCSVLS-----SRQPAGIAGFGRGKTS 235
+V +G ++ P R + GC + S++ GI G GRG S
Sbjct: 177 GEDIVSFG---------KQSELKPQRTV----FGCENVETGDIYSQRADGIMGLGRGDLS 223
Query: 236 LPSQLNL-----DKFSYCLLSHKFDDTTRTSSLILDNGSSHSDKKTTGLTYTPFVNNPSV 290
+ QL + FS C + ++ G S G+ +T ++P
Sbjct: 224 IVDQLVEKGVIGNSFSLC-----YGGMDVGGGAMVLGGIS----PPAGMVFTH--SDP-- 270
Query: 291 AERNAFSVYYYVGLRRITVGGQRVRVWHKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPL 350
A S YY + L+ I + G+++ + + DG GTI+DSGTT+ ++ EP
Sbjct: 271 ----ARSAYYNIDLKEIHIAGKQLPI----NPMVFDGKYGTILDSGTTYAYLP----EPA 318
Query: 351 ADEFVSQMVKNRNYTRALGAEALTGLRPCFDVPG----EKTGSFPELKLHFKGGAEVTLP 406
F ++K N + + CF G + + +FP + L F G ++L
Sbjct: 319 FKAFKDAIMKELNSLKLIQGPDRNYNDICFSGVGSDVSQLSKTFPAVDLVFSNGNRLSLS 378
Query: 407 VENY-FAVVGEGSAVCLTVVTDREASGGPSIILGNFQMQNYYVEYDLRNQRLGFKQQLC 464
ENY F A CL + + + +LG ++N V YD + ++GF + C
Sbjct: 379 PENYLFQHSKAHGAYCLGIFQNENDQ---TTLLGGIIVRNTLVMYDREHLKIGFWKTNC 434
>gi|356565521|ref|XP_003550988.1| PREDICTED: aspartic proteinase-like protein 2-like [Glycine max]
Length = 634
Score = 94.4 bits (233), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 107/398 (26%), Positives = 166/398 (41%), Gaps = 73/398 (18%)
Query: 87 GGYSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLG 146
G Y+ L GTPPQ+ I+DTGS + + PC+ C+ C + P F P+ SS+ + +
Sbjct: 82 GYYTTRLWIGTPPQMFALIVDTGSTVTYVPCST---CEQCGRHQDPKFQPESSSTYQPVK 138
Query: 147 CQNPKCSWIHHESIQCRDCNDEPL--ATSKNCTQICPSYLVLYGSGLTEGIALSETLNLP 204
C +I C +C+ + + + ++ S VL G L SE P
Sbjct: 139 C-----------TIDC-NCDSDRMQCVYERQYAEMSTSSGVL-GEDLISFGNQSEL--AP 183
Query: 205 NRIIPNFLVGCS-----VLSSRQPAGIAGFGRGKTSLPSQL---NLDKFSYCLLSHKFDD 256
R + GC L S+ GI G GRG S+ QL N+ S+ L D
Sbjct: 184 QRAV----FGCENVETGDLYSQHADGIMGLGRGDLSIMDQLVDKNVISDSFSLCYGGMD- 238
Query: 257 TTRTSSLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRVRV 316
+++L S SD Y+ V +P YY + L+ I V G+R+ +
Sbjct: 239 -VGGGAMVLGGISPPSD---MAFAYSDPVRSP----------YYNIDLKEIHVAGKRLPL 284
Query: 317 WHKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEALTGL 376
DG GT++DSGTT+ ++ F D V ++ + ++G
Sbjct: 285 NANVF----DGKHGTVLDSGTTYAYLPEAAFLAFKDAIVKELQS---------LKKISGP 331
Query: 377 RP-----CFDVPG----EKTGSFPELKLHFKGGAEVTLPVENY-FAVVGEGSAVCLTVVT 426
P CF G + + SFP + + F+ G + TL ENY F A CL V
Sbjct: 332 DPNYNDICFSGAGIDVSQLSKSFPVVDMVFENGQKYTLSPENYMFRHSKVRGAYCLGVFQ 391
Query: 427 DREASGGPSIILGNFQMQNYYVEYDLRNQRLGFKQQLC 464
+ + +LG ++N V YD ++GF + C
Sbjct: 392 N---GNDQTTLLGGIIVRNTLVVYDREQTKIGFWKTNC 426
>gi|225438908|ref|XP_002279194.1| PREDICTED: aspartic proteinase-like protein 2-like [Vitis vinifera]
Length = 634
Score = 94.4 bits (233), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 104/419 (24%), Positives = 176/419 (42%), Gaps = 65/419 (15%)
Query: 61 HIKNPQTKTTTTTTTTTTTNISSHSYGGYSISLSFGTPPQIIPFILDTGSHLVWFPCTNH 120
H++ ++ +T T ++ YG Y+ + GTPPQ I+DTGS L + PC+
Sbjct: 66 HLQRSESHSTATARMPLYDDLIP--YGYYTTRIWIGTPPQTFALIVDTGSTLTYVPCST- 122
Query: 121 YQCKYCSSSKIPSFIPKLSSSSRLLGCQNPKCSWIHHESIQCRDCNDEPLATSKNCTQIC 180
C+ C + P+F P SS+ + L C +C+ E + C D A + + +
Sbjct: 123 --CEQCGKHQDPNFQPDWSSTYQPLKCSM-ECT-CDSEMMHC--VYDRQYAEMSSSSGVL 176
Query: 181 PSYLVLYGSGLTEGIALSETLNLPNRIIPNFLVGCSVLS-----SRQPAGIAGFGRGKTS 235
+V +G ++ P R + GC + S++ GI G GRG S
Sbjct: 177 GEDIVSFG---------KQSELKPQRTV----FGCENVETGDIYSQRADGIMGLGRGDLS 223
Query: 236 LPSQLNL-----DKFSYCLLSHKFDDTTRTSSLILDNGSSHSDKKTTGLTYTPFVNNPSV 290
+ QL + FS C + ++ G S G+ +T ++P
Sbjct: 224 IVDQLVEKGVIGNSFSLC-----YGGMDVGGGAMVLGGIS----PPAGMVFTH--SDP-- 270
Query: 291 AERNAFSVYYYVGLRRITVGGQRVRVWHKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPL 350
A S YY + L+ I + G+++ + + DG GTI+DSGTT+ ++ EP
Sbjct: 271 ----ARSAYYNIDLKEIHIAGKQLPI----NPMVFDGKYGTILDSGTTYAYLP----EPA 318
Query: 351 ADEFVSQMVKNRNYTRALGAEALTGLRPCFDVPG----EKTGSFPELKLHFKGGAEVTLP 406
F ++K N + + CF G + + +FP + L F G ++L
Sbjct: 319 FKAFKDAIMKELNSLKLIQGPDRNYNDICFSGVGSDVSQLSKTFPAVDLVFSNGNRLSLS 378
Query: 407 VENY-FAVVGEGSAVCLTVVTDREASGGPSIILGNFQMQNYYVEYDLRNQRLGFKQQLC 464
ENY F A CL + + + +LG ++N V YD + ++GF + C
Sbjct: 379 PENYLFQHSKAHGAYCLGIFQNENDQ---TTLLGGIIVRNTLVMYDREHLKIGFWKTNC 434
>gi|195626958|gb|ACG35309.1| aspartic proteinase nepenthesin-1 precursor [Zea mays]
Length = 450
Score = 94.0 bits (232), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 106/387 (27%), Positives = 158/387 (40%), Gaps = 62/387 (16%)
Query: 89 YSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLGCQ 148
Y + S GTPPQ + +DT + W PC C C +S F P S+S R + C
Sbjct: 112 YVVRASLGTPPQQLLLAVDTSNDASWIPCAG---CAGCPTSSAAPFDPAASASYRTVPCG 168
Query: 149 NPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSGLTEGIALSETLNLPNRII 208
+P C+ P A + C L S L ++ ++L + +
Sbjct: 169 SPLCA-------------QAPNAACPPGGKACGFSLTYADSSLQAALS-QDSLAVAGNAV 214
Query: 209 PNFLVGC---SVLSSRQPAGIAGFGRGKTSLPSQ---LNLDKFSYCLLSHK---FDDTTR 259
+ GC + ++ P G+ G GRG S SQ + FSYCL S K F T R
Sbjct: 215 KAYTFGCLQRATGTAAPPQGLLGLGRGPLSFLSQTKDMYEATFSYCLPSFKSLNFSGTLR 274
Query: 260 TSSLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRVRVWHK 319
+ NG K T P + NP S YYV + + VG + V +
Sbjct: 275 ----LGRNGQPQRIKTT------PLLANPH------RSSLYYVNMTGVRVGRKVVPI--- 315
Query: 320 YLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGA--EALTGLR 377
D GT++DSGT FT + + + DE R +GA +L G
Sbjct: 316 -PAFDPATGAGTVLDSGTMFTRLVAPAYVAVRDEV----------RRRVGAPVSSLGGFD 364
Query: 378 PCFDVPGEKTGSFPELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTVVTDREASGGPSII 437
CF+ ++P + L F G +VTLP EN G+ CL + + +
Sbjct: 365 TCFNT---TAVAWPPMTLLFDG-MQVTLPEENVVIHSTYGTISCLAMAAAPDGVNTVLNV 420
Query: 438 LGNFQMQNYYVEYDLRNQRLGFKQQLC 464
+ + Q QN+ V +D+ N R+GF ++ C
Sbjct: 421 IASMQQQNHRVLFDVPNGRVGFARERC 447
>gi|383161173|gb|AFG63169.1| Pinus taeda anonymous locus 0_11073_01 genomic sequence
Length = 133
Score = 94.0 bits (232), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 64/145 (44%), Positives = 85/145 (58%), Gaps = 20/145 (13%)
Query: 176 CTQICPSYLVLYGSGLTEGIALSETLNLP-----NRIIPNFLVGCSVLSSRQPAGIAGFG 230
C++ICP + + YG+G G LS+TL LP R I NF GCSVLSS Q AGIAGFG
Sbjct: 1 CSKICPHFSLTYGTGNATGRLLSDTLTLPLEDGGRREIKNFAFGCSVLSS-QVAGIAGFG 59
Query: 231 RGKTSLPSQLNL---DKFSYCLLSHKFDDTTRTSSLILDNGSSHSDKKTTGLTYTPFVNN 287
G S+PSQL DKF+YCL D + +S ++L N + D LTYTP + N
Sbjct: 60 NGGLSMPSQLAPLIGDKFAYCL-----DYRSNSSKIVLGNKAVPRDLP---LTYTPLLFN 111
Query: 288 PSVAERNAFSVYYYVGLRRITVGGQ 312
P + FS Y+Y+ L +++GG+
Sbjct: 112 P--VNPSVFS-YFYLALEAVSIGGK 133
>gi|285741|dbj|BAA03413.1| EDGP precursor [Daucus carota]
Length = 433
Score = 94.0 bits (232), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 97/392 (24%), Positives = 171/392 (43%), Gaps = 61/392 (15%)
Query: 104 FILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLGCQNPKCSWIHHESIQCR 163
++D G +W C +Y +SS+ R + C+ +CS SI C
Sbjct: 57 LVVDLGGRFLWVDCDQNY----------------VSSTYRPVRCRTSQCSL--SGSIACG 98
Query: 164 DCNDEPLATSKNCT-QICPSYLVL---YGSGLTEGIALSETLN--LPNRII--PNFLVGC 215
DC + P N T + P V+ G + E + E+ + R++ P F+ C
Sbjct: 99 DCFNGPRPGCNNNTCGVFPENPVINTATGGEVAEDVVSVESTDGSSSGRVVTVPRFIFSC 158
Query: 216 SVLSSRQ-----PAGIAGFGRGKTSLPSQLN-----LDKFSYCLLSHKFDDTTRTSSLIL 265
+ S Q G+AG GR + +LPSQ KF+ CL +T ++S+I+
Sbjct: 159 APTSLLQNLASGVVGMAGLGRTRIALPSQFASAFSFKRKFAMCL-----SGSTSSNSVII 213
Query: 266 DNGSSH--------SDKKTTGLTYTPFVNNP----SVAERNAFSVYYYVGLRRITVGGQR 313
+ SDK LTYTP + NP + + + SV Y++G++ I + +
Sbjct: 214 FGNDPYTFLPNIIVSDKT---LTYTPLLTNPVSTSATSTQGEPSVEYFIGVKSIKINSKI 270
Query: 314 VRVWHKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEAL 373
V + L++ G GGT + + +T + +++ + + F+ + RN TR
Sbjct: 271 VALNTSLLSISSAGLGGTKISTINPYTVLETSIYKAVTEAFIKESAA-RNITRVASVAPF 329
Query: 374 TGLRPCFDVPGEKTG-SFPELKLHFKGGAEV-TLPVENYFAVVGEGSAVCLTVVTDREAS 431
++ + G S P + L + + V T+ N + + + VCL VV D ++
Sbjct: 330 GACFSTDNILSTRLGPSVPSIDLVLQSESVVWTITGSNSMVYIND-NVVCLGVV-DGGSN 387
Query: 432 GGPSIILGNFQMQNYYVEYDLRNQRLGFKQQL 463
SI++G Q+++ V++DL R+GF L
Sbjct: 388 LRTSIVIGGHQLEDNLVQFDLATSRVGFSGTL 419
>gi|195607464|gb|ACG25562.1| aspartic proteinase nepenthesin-2 precursor [Zea mays]
Length = 478
Score = 94.0 bits (232), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 110/384 (28%), Positives = 150/384 (39%), Gaps = 53/384 (13%)
Query: 89 YSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLGCQ 148
Y ++ S GTP +DTGS L W C C S K P F P SSS + C
Sbjct: 140 YVVTASLGTPGVAQTMEVDTGSDLSWVQCKPCSAAPSCYSQKDPLFDPAQSSSYAAVPCG 199
Query: 149 NPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSGL-TEGIALSETLNL-PNR 206
P C+ + + + C+ Y+V YG G T G+ S+TL L +
Sbjct: 200 GPVCAGLG-------------IYAASACSAAQCGYVVSYGDGSNTTGVYSSDTLTLSASS 246
Query: 207 IIPNFLVGCSVLSS---RQPAGIAGFGRGKTSLPSQLN---LDKFSYCLLSHKFDDTTRT 260
+ F GC S G+ G GR + SL Q FSYCL T +
Sbjct: 247 AVQGFFFGCGHAQSGLFNGVDGLLGLGREQPSLVEQTAGTYGGVFSYCL------PTKPS 300
Query: 261 SSLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRVRVWHKY 320
++ L G G + T + +P+ YY V L I+VGGQ++ V
Sbjct: 301 TAGYLTLGLGGPSGAAPGFSTTQLLPSPNA------PTYYVVMLTGISVGGQQLSVPASA 354
Query: 321 LTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEALTGLRPCF 380
GGT+VD+GT T + P + L F S M T A + L C+
Sbjct: 355 FA------GGTVVDTGTVITRLPPTAYAALRSAFRSGMASYGYPT----APSNGILDTCY 404
Query: 381 DVPGEKTGSFPELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTVVTDREASGGPSIILGN 440
+ G T + P + L F GA V L + S CL S G ILGN
Sbjct: 405 NFAGYGTVTLPNVALTFGSGATVMLGADGIL------SFGCLAFAP--SGSDGGMAILGN 456
Query: 441 FQMQNYYVEYDLRNQRLGFKQQLC 464
Q +++ V D +GFK C
Sbjct: 457 VQQRSFEVRID--GTSVGFKPSSC 478
>gi|356542355|ref|XP_003539632.1| PREDICTED: probable aspartic protease At2g35615-like [Glycine max]
Length = 444
Score = 94.0 bits (232), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 121/442 (27%), Positives = 176/442 (39%), Gaps = 66/442 (14%)
Query: 40 PSQDSYQNLNSLVSSSLTRALHIKNPQTKTTTTTTTTTTTNISSHSYGGYSISLSFGTPP 99
P++ +Q + + + S+ RA H P +T T +T S G Y +S S GTPP
Sbjct: 49 PTETQFQRVANALRRSINRANHFNKPNLVASTNTAESTVI----ASQGEYLMSYSVGTPP 104
Query: 100 QIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLGCQNPKCSWIHHES 159
I I+DTGS ++W C C+ C + P F P S + + L C + C + +
Sbjct: 105 FQILGIVDTGSDIIWLQCQ---PCEDCYNQTTPIFDPSQSKTYKTLPCSSNICQSVQSAA 161
Query: 160 IQCRDCNDEPLATSKNCTQICPSYLVLYG-SGLTEGIALSETLNL-----PNRIIPNFLV 213
C NDE C Y + YG + ++G ETL L + P ++
Sbjct: 162 -SCSSNNDE-------C-----EYTITYGDNSHSQGDLSVETLTLGSTDGSSVQFPKTVI 208
Query: 214 GCSVLSSRQPAGIAGFGRGKTSLPSQLNLD-------KFSYCLLSHKFDDTTRTSSLILD 266
GC + G P L KFSYC L+ F + +S L
Sbjct: 209 GCGHNNKGTFQREGSGIVGLGGGPVSLISQLSSSIGGKFSYC-LAPLFSQSNSSSKLNFG 267
Query: 267 NGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRVRVWHKYLTLDRD 326
+ + S + G TP V +N +Y++ L +VG R+
Sbjct: 268 DEAVVSGR---GTVSTPIV------PKNGLG-FYFLTLEAFSVGDNRIEFGSSSFES-SG 316
Query: 327 GNGGTIVDSGTTFTFMAPE----LFEPLADEFVSQMVKNRNYTRALGAEALTGLRPCFDV 382
G G I+DSGTT T + + L +AD + V++ + LR C+
Sbjct: 317 GEGNIIIDSGTTLTILPEDDYLNLESAVADAIELERVEDPS----------KFLRLCYRT 366
Query: 383 PGEKTGSFPELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTVVTDREASGGPSIILGNFQ 442
+ P + HFK GA+V L + F V EG VC R + GP I GN
Sbjct: 367 TSSDELNVPVITAHFK-GADVELNPISTFIEVDEG-VVCFAF---RSSKIGP--IFGNLA 419
Query: 443 MQNYYVEYDLRNQRLGFKQQLC 464
QN V YDL Q + FK C
Sbjct: 420 QQNLLVGYDLVKQTVSFKPTDC 441
>gi|357450869|ref|XP_003595711.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
gi|355484759|gb|AES65962.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
Length = 443
Score = 94.0 bits (232), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 110/404 (27%), Positives = 168/404 (41%), Gaps = 57/404 (14%)
Query: 76 TTTTNISSHSYGGYSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFI 135
T T +S H Y Y + LS GTPP +DTGS L+W C C C P F
Sbjct: 47 TAQTPVSVHHYD-YLMELSIGTPPVKTYAQVDTGSDLIWLQCI---PCTNCYKQLNPMFD 102
Query: 136 PKLSSSSRLLGCQNPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLY-GSGLTEG 194
P+ SS+ + + CS ++ S C+ + NC +Y Y +TEG
Sbjct: 103 PQSSSTYSNIAYGSESCSKLYSTS-----CSPD----QNNC-----NYTYSYEDDSITEG 148
Query: 195 IALSETLNLPNR-----IIPNFLVGC-----SVLSSRQPAGIAGFGRGKTSLPSQLNL-- 242
+ ETL L + + + GC V + ++ GI G GRG SL SQ+
Sbjct: 149 VLAQETLTLTSTTGKPVALKGVIFGCGHNNNGVFNDKE-MGIIGLGRGPLSLVSQIGSSF 207
Query: 243 --DKFSYCLLSHKFDDTTRTSSLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYY 300
FS CL+ + + TS + G S+ G+ TP V+ +N +Y
Sbjct: 208 GGKMFSQCLVPFHTNPSI-TSPMSFGKG---SEVLGNGVVSTPLVS------KNTHQAFY 257
Query: 301 YVGLRRITVGGQRVRVWHKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVK 360
+V L I+V + ++ +L+ G ++DSGT T + + + L +E V+
Sbjct: 258 FVTLLGISVEDINLP-FNDGSSLEPITKGNMVIDSGTPTTLLPEDFYHRLVEE-----VR 311
Query: 361 NRNYTRALGAEALTGLRPCFDVPGEKTGSFPELKLHFKGGAEVTLPVENYFAVVGEGSAV 420
N+ + + G + C+ P G+ L HF+ GA+V L F V +G
Sbjct: 312 NKVALDPIPIDPTLGYQLCYRTPTNLKGT--TLTAHFE-GADVLLTPTQIFIPVQDG-IF 367
Query: 421 CLTVVTDREASGGPSIILGNFQMQNYYVEYDLRNQRLGFKQQLC 464
C + G I GN NY + +DL Q + FK C
Sbjct: 368 CFAFTSTFSNEYG---IYGNHAQSNYLIGFDLEKQLVSFKATDC 408
>gi|147809812|emb|CAN71447.1| hypothetical protein VITISV_040904 [Vitis vinifera]
Length = 988
Score = 94.0 bits (232), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 110/409 (26%), Positives = 171/409 (41%), Gaps = 71/409 (17%)
Query: 77 TTTNISSHSY--------GGYSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSS 128
T+ N+ +H++ G + + ++FGTPPQ ILDTGS + W C C +C
Sbjct: 107 TSGNLKNHAHNNNLFDEDGNFLVDVAFGTPPQKFKLILDTGSSITWTQCK---ACVHCLK 163
Query: 129 SKIPSFIPKLSSSSRLLGCQNPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYG 188
F L+SS+ G P + + +Y + YG
Sbjct: 164 DSHRHF-DSLASSTYSFGSCIP--------------------------STVGNTYNMTYG 196
Query: 189 SGLTE-GIALSETLNL-PNRIIPNFLVGCSVLSSRQ----PAGIAGFGRGKTSLPSQL-- 240
T G +T+ L P+ + F GC + G+ G G+G+ S SQ
Sbjct: 197 DKSTSVGNYGCDTMTLEPSDVFQKFQFGCGRNNEGDFGSGADGMLGLGQGQLSTVSQTAS 256
Query: 241 NLDK-FSYCLLSHKFDDTTRTSSLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVY 299
K FSYCL ++ + S L + +S S + L +T VN P + S Y
Sbjct: 257 KFKKVFSYCLP----EENSIGSLLFGEKATSQS----SSLKFTSLVNGPGTSGLEE-SGY 307
Query: 300 YYVGLRRITVGGQRVRVWHKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMV 359
Y+V L I+VG +R+ + + GTI+DSGT T + + + +
Sbjct: 308 YFVKLLDISVGNKRLNIPSSVF-----ASPGTIIDSGTVITRLPQRAY---SALKAAFKK 359
Query: 360 KNRNYTRALGAEALTG-LRPCFDVPGEKTGSFPELKLHFKGGAEVTLPVENYFAVV--GE 416
Y + G L C+++ G K PE LHF GA+V L N VV +
Sbjct: 360 AMAKYPLSNGRRKENDMLDTCYNLSGRKDVLLPEXVLHFGDGADVRL---NGKRVVWGND 416
Query: 417 GSAVCLTVVTDREASGGPSI-ILGNFQMQNYYVEYDLRNQRLGFKQQLC 464
S +CL + +++ P + I+GN Q + V YD+R +R+GF C
Sbjct: 417 ASRLCLAFAGNSKSTMNPELTIIGNRQQVSLTVLYDIRGRRIGFGGNGC 465
>gi|297846526|ref|XP_002891144.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
gi|297336986|gb|EFH67403.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
Length = 445
Score = 94.0 bits (232), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 109/407 (26%), Positives = 159/407 (39%), Gaps = 77/407 (18%)
Query: 87 GGYSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLG 146
G Y +S+S GTPP I DTGS L W C C+ C P F K SS+ +
Sbjct: 83 GEYFMSISIGTPPSKFLAIADTGSDLTWVQCK---PCQQCYKQNTPLFDKKKSSTYKTES 139
Query: 147 CQNPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGS-GLTEGIALSETLNLPN 205
C + C+ + C + S+N + Y YG T+G +ET+++ +
Sbjct: 140 CDSITCNALSEHEEGCDE--------SRNACK----YRYSYGDESFTKGEVATETISIDS 187
Query: 206 R-----IIPNFLVGCSVLSSRQPAGIAGFGRGKT----------------SLPSQLNL-- 242
P GC G+ G T SL SQL
Sbjct: 188 SSGSPVSFPGTAFGC------------GYNNGGTFEETGSGIIGLGGGPLSLVSQLGSSI 235
Query: 243 -DKFSYCLLSHKFDDTTRTSSLILDNGSSHSD-KKTTGLTYTPFVNNPSVAERNAFSVYY 300
KFSYC LSH T TS + L S S K + + TP + YY
Sbjct: 236 GKKFSYC-LSHTSATTNGTSVINLGTNSMTSKPSKDSAILTTPLIQKDP-------ETYY 287
Query: 301 YVGLRRITVGGQRV-RVWHKYLTLDRDGN--GGTIVDSGTTFTFMAPELFEPLADEFVSQ 357
++ L ITVG ++ +L+R G I+DSGTT T + ++
Sbjct: 288 FLTLEAITVGKTKLPYTGGGGYSLNRKSKKTGNIIIDSGTTLTLLDSGFYDDFGAVVEES 347
Query: 358 MVKNRNYTRALGAEALTGLRPCFDVPGEKTGSFPELKLHFKGGAEVTLPVENYFAVVGEG 417
+ + + G L CF G+K P + +HF GA+V L N F + E
Sbjct: 348 VTGAKRVSDPQGI-----LTHCFK-SGDKEIGLPTITMHFT-GADVKLSPINSFVKLSE- 399
Query: 418 SAVCLTVVTDREASGGPSIILGNFQMQNYYVEYDLRNQRLGFKQQLC 464
VCL+++ E + I GN ++ V YDL + + F++ C
Sbjct: 400 DIVCLSMIPTTEVA-----IYGNMVQMDFLVGYDLETKTVSFQRMDC 441
>gi|297806153|ref|XP_002870960.1| hypothetical protein ARALYDRAFT_908082 [Arabidopsis lyrata subsp.
lyrata]
gi|297316797|gb|EFH47219.1| hypothetical protein ARALYDRAFT_908082 [Arabidopsis lyrata subsp.
lyrata]
Length = 453
Score = 94.0 bits (232), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 98/387 (25%), Positives = 160/387 (41%), Gaps = 44/387 (11%)
Query: 98 PPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLGCQNPKCSWIHH 157
PPQ I ++DTGS L W C + + + + +F P SSS + C +P C
Sbjct: 82 PPQNISMVIDTGSELSWLRCN-----RSSNPNPVNNFDPTRSSSYSPIPCSSPTCRTRTR 136
Query: 158 ESIQCRDCNDEPLATSKNCTQICPSYLVLYGSGLTEGIALSETLNLPNRI-IPNFLVGC- 215
+ + C+ + ++C + L + +EG +E + N N + GC
Sbjct: 137 DFLIPASCDSD---------KLCHATLSYADASSSEGNLAAEIFHFGNSTNDSNLIFGCM 187
Query: 216 SVLSSRQP------AGIAGFGRGKTSLPSQLNLDKFSYCLLSHKFDDTTRTSSLILDNGS 269
+S P G+ G RG S SQ+ KFSYC+ T +L S
Sbjct: 188 GSVSGSDPEEDTKTTGLLGMNRGSLSFISQMGFPKFSYCI-----SGTDDFPGFLLLGDS 242
Query: 270 SHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRVRVWHKYLTLDRDGNG 329
+ + T L YTP + S V Y V L I V G+ + + L D G G
Sbjct: 243 NFT--WLTPLNYTPLIRI-STPLPYFDRVAYTVQLTGIKVNGKLLPIPKSVLLPDHTGAG 299
Query: 330 GTIVDSGTTFTFMAPELFEPLADEFVSQ------MVKNRNYTRALGAEALTGLRPCFDVP 383
T+VDSGT FTF+ ++ L +F++Q + ++ + + + P F +
Sbjct: 300 QTMVDSGTQFTFLLGPVYTALRSDFLNQTNGILTVYEDPEFVFQGTMDLCYRISP-FRIR 358
Query: 384 GEKTGSFPELKLHFKGGAEVTLPVENYFAVV-----GEGSAVCLTVVTDREASGGPSIIL 438
P + L F+G AE+ + + V G S C T + + G + ++
Sbjct: 359 TGILHRLPTVSLVFEG-AEIAVSGQPLLYRVPHLTAGNDSVYCFTF-GNSDLMGMEAYVI 416
Query: 439 GNFQMQNYYVEYDLRNQRLGFKQQLCK 465
G+ QN ++E+DL+ R+G C
Sbjct: 417 GHHHQQNMWIEFDLQRSRIGLAPVQCD 443
>gi|384482418|pdb|3VLB|A Chain A, Crystal Structure Of Xeg-Edgp
gi|384482420|pdb|3VLB|C Chain C, Crystal Structure Of Xeg-Edgp
Length = 413
Score = 93.6 bits (231), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 97/392 (24%), Positives = 171/392 (43%), Gaps = 61/392 (15%)
Query: 104 FILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLGCQNPKCSWIHHESIQCR 163
++D G +W C +Y +SS+ R + C+ +CS SI C
Sbjct: 37 LVVDLGGRFLWVDCDQNY----------------VSSTYRPVRCRTSQCSL--SGSIACG 78
Query: 164 DCNDEPLATSKNCT-QICPSYLVL---YGSGLTEGIALSETLN--LPNRII--PNFLVGC 215
DC + P N T + P V+ G + E + E+ + R++ P F+ C
Sbjct: 79 DCFNGPRPGCNNNTCGVFPENPVINTATGGEVAEDVVSVESTDGSSSGRVVTVPRFIFSC 138
Query: 216 SVLSSRQ-----PAGIAGFGRGKTSLPSQLN-----LDKFSYCLLSHKFDDTTRTSSLIL 265
+ S Q G+AG GR + +LPSQ KF+ CL +T ++S+I+
Sbjct: 139 APTSLLQNLASGVVGMAGLGRTRIALPSQFASAFSFKRKFAMCL-----SGSTSSNSVII 193
Query: 266 DNGSSH--------SDKKTTGLTYTPFVNNP----SVAERNAFSVYYYVGLRRITVGGQR 313
+ SDK LTYTP + NP + + + SV Y++G++ I + +
Sbjct: 194 FGNDPYTFLPNIIVSDKT---LTYTPLLTNPVSTSATSTQGEPSVEYFIGVKSIKINSKI 250
Query: 314 VRVWHKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEAL 373
V + L++ G GGT + + +T + +++ + + F+ + RN TR
Sbjct: 251 VALNTSLLSISSAGLGGTKISTINPYTVLETSIYKAVTEAFIKESAA-RNITRVASVAPF 309
Query: 374 TGLRPCFDVPGEKTG-SFPELKLHFKGGAEV-TLPVENYFAVVGEGSAVCLTVVTDREAS 431
++ + G S P + L + + V T+ N + + + VCL VV D ++
Sbjct: 310 GACFSTDNILSTRLGPSVPSIDLVLQSESVVWTITGSNSMVYIND-NVVCLGVV-DGGSN 367
Query: 432 GGPSIILGNFQMQNYYVEYDLRNQRLGFKQQL 463
SI++G Q+++ V++DL R+GF L
Sbjct: 368 LRTSIVIGGHQLEDNLVQFDLATSRVGFSGTL 399
>gi|356524287|ref|XP_003530761.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
Length = 481
Score = 93.6 bits (231), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 117/425 (27%), Positives = 177/425 (41%), Gaps = 57/425 (13%)
Query: 52 VSSSLTRALHIKNPQTKTTTTTTTTTTTNISSHSYGGYSISLSFGTPPQIIPFILDTGSH 111
+ S L++ L +N + +TT + ++ + Y + + GTP + + + DTGS
Sbjct: 101 IQSRLSKNLGRENSVKELDSTTLPAKSGSLIGSA--NYFVVVGLGTPKRDLSLVFDTGSD 158
Query: 112 LVWFPCTNHYQCKYCSSS----KIPSFIPKLSSSSRLLGCQNPKCSWIHHESIQCRDCND 167
L W QC+ C+ S + F P SSS + C + C+ + I+ R C+
Sbjct: 159 LTW------TQCEPCAGSCYKQQDAIFDPSKSSSYINITCTSSLCTQLTSAGIKSR-CSS 211
Query: 168 EPLATSKNCTQICPSYLVLYGSGLTE-GIALSETLNL-PNRIIPNFLVGCSVLSS---RQ 222
A C Y + YG T G E L + I+ +FL GC +
Sbjct: 212 STTA----CI-----YGIQYGDKSTSVGFLSQERLTITATDIVDDFLFGCGQDNEGLFSG 262
Query: 223 PAGIAGFGRGKTSLPSQLN--LDK-FSYCLLSHKFDDTTRTSSLILDNGSSHSDKKTTGL 279
AG+ G GR S Q + +K FSYCL S T +S L G+S + L
Sbjct: 263 SAGLIGLGRHPISFVQQTSSIYNKIFSYCLPS------TSSSLGHLTFGASAA--TNANL 314
Query: 280 TYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRVRVWHKYLTLDRDGNGGTIVDSGTTF 339
YTP +++ N F Y + + I+VGG ++ ++ GG+I+DSGT
Sbjct: 315 KYTPL---STISGDNTF---YGLDIVGISVGGTKLPA----VSSSTFSAGGSIIDSGTVI 364
Query: 340 TFMAPELFEPLADEFVSQMVKNRNYTRALGAEALTGLRPCFDVPGEKTGSFPELKLHFKG 399
T +AP + L F M K Y A + L C+D G K S P++ F G
Sbjct: 365 TRLAPTAYAALRSAFRQGMEK---YPVA-NEDGL--FDTCYDFSGYKEISVPKIDFEFAG 418
Query: 400 GAEVTLPVENYFAVVGEGSAVCLTVVTDREASGGPSIILGNFQMQNYYVEYDLRNQRLGF 459
G V LP+ + VCL + + I GN Q + V YD+ R+GF
Sbjct: 419 GVTVELPLVGIL-IGRSAQQVCLAFAAN--GNDNDITIFGNVQQKTLEVVYDVEGGRIGF 475
Query: 460 KQQLC 464
C
Sbjct: 476 GAAGC 480
>gi|384482417|pdb|3VLA|A Chain A, Crystal Structure Of Edgp
Length = 413
Score = 93.6 bits (231), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 97/392 (24%), Positives = 171/392 (43%), Gaps = 61/392 (15%)
Query: 104 FILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLGCQNPKCSWIHHESIQCR 163
++D G +W C +Y +SS+ R + C+ +CS SI C
Sbjct: 37 LVVDLGGRFLWVDCDQNY----------------VSSTYRPVRCRTSQCSL--SGSIACG 78
Query: 164 DCNDEPLATSKNCT-QICPSYLVL---YGSGLTEGIALSETLN--LPNRII--PNFLVGC 215
DC + P N T + P V+ G + E + E+ + R++ P F+ C
Sbjct: 79 DCFNGPRPGCNNNTCGVFPENPVINTATGGEVAEDVVSVESTDGSSSGRVVTVPRFIFSC 138
Query: 216 SVLSSRQ-----PAGIAGFGRGKTSLPSQLN-----LDKFSYCLLSHKFDDTTRTSSLIL 265
+ S Q G+AG GR + +LPSQ KF+ CL +T ++S+I+
Sbjct: 139 APTSLLQNLASGVVGMAGLGRTRIALPSQFASAFSFKRKFAMCL-----SGSTSSNSVII 193
Query: 266 DNGSSH--------SDKKTTGLTYTPFVNNP----SVAERNAFSVYYYVGLRRITVGGQR 313
+ SDK LTYTP + NP + + + SV Y++G++ I + +
Sbjct: 194 FGNDPYTFLPNIIVSDKT---LTYTPLLTNPVSTSATSTQGEPSVEYFIGVKSIKINSKI 250
Query: 314 VRVWHKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEAL 373
V + L++ G GGT + + +T + +++ + + F+ + RN TR
Sbjct: 251 VALNTSLLSISSAGLGGTKISTINPYTVLETSIYKAVTEAFIKESAA-RNITRVASVAPF 309
Query: 374 TGLRPCFDVPGEKTG-SFPELKLHFKGGAEV-TLPVENYFAVVGEGSAVCLTVVTDREAS 431
++ + G S P + L + + V T+ N + + + VCL VV D ++
Sbjct: 310 GACFSTDNILSTRLGPSVPSIDLVLQSESVVWTITGSNSMVYIND-NVVCLGVV-DGGSN 367
Query: 432 GGPSIILGNFQMQNYYVEYDLRNQRLGFKQQL 463
SI++G Q+++ V++DL R+GF L
Sbjct: 368 LRTSIVIGGHQLEDNLVQFDLATSRVGFSGTL 399
>gi|356546376|ref|XP_003541602.1| PREDICTED: probable aspartic protease At2g35615-like [Glycine max]
Length = 450
Score = 93.6 bits (231), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 115/427 (26%), Positives = 171/427 (40%), Gaps = 53/427 (12%)
Query: 52 VSSSLTRALHIKNPQTKTTTTTTTTTTTNISSHSYGGYSISLSFGTPPQIIPFILDTGSH 111
V++++ R+++ N K + +T T + S G Y +S S GTPP I ++DTGS
Sbjct: 60 VANAMRRSINRANHFNKKSFVASTNTAESTVKASQGEYLMSYSVGTPPFEILGVVDTGSG 119
Query: 112 LVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLGCQNPKCSWIHHESIQCRDCNDEPLA 171
+ W C +C+ C P F P S + + L C + C + I C+ +
Sbjct: 120 ITWMQCQ---RCEDCYEQTTPIFDPSKSKTYKTLPCSSNMCQSV----ISTPSCSSD--- 169
Query: 172 TSKNCTQICPSYLVLYGSGL-TEGIALSETLNL-----PNRIIPNFLVGCS-------VL 218
+I Y + YG G ++G ETL L + PN ++GC
Sbjct: 170 ------KIGCKYTIKYGDGSHSQGDLSVETLTLGSTNGSSVQFPNTVIGCGHNNKGTFQG 223
Query: 219 SSRQPAGIAGFGRGKTSLPSQLNLDKFSYCLLSHKFDDTTRTSSLILDNGSSHSDKKTTG 278
G+ G S S KFSYC L+ F + +S L + + S G
Sbjct: 224 EGSGVVGLGGGPVSLISQLSSSIGGKFSYC-LAPMFSQSNSSSKLNFGDAAVVSG---LG 279
Query: 279 LTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRVR-VWHKYLTLDRDGNGGTIVDSGT 337
TP V+ + V+YY+ L +VG +R+ V + +G G I+DSGT
Sbjct: 280 AVSTPLVS------KTGSEVFYYLTLEAFSVGDKRIEFVGGSSSSGSSNGEGNIIIDSGT 333
Query: 338 TFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEALTGLRPCFDVPGEKTGSFPELKLHF 397
T T + E + L + NR ++ L C+ P + HF
Sbjct: 334 TLTLLPQEDYSNLESAVADAIQANRV------SDPSNFLSLCYQTTPSGQLDVPVITAHF 387
Query: 398 KGGAEVTLPVENYFAVVGEGSAVCLTVVTDREASGGPSIILGNFQMQNYYVEYDLRNQRL 457
K GA+V L + F V EG VC + S I GN N V YDL Q +
Sbjct: 388 K-GADVELNPISTFVQVAEG-VVCFAFHSSEVVS-----IFGNLAQLNLLVGYDLMEQTV 440
Query: 458 GFKQQLC 464
FK C
Sbjct: 441 SFKPTDC 447
>gi|242092892|ref|XP_002436936.1| hypothetical protein SORBIDRAFT_10g011730 [Sorghum bicolor]
gi|241915159|gb|EER88303.1| hypothetical protein SORBIDRAFT_10g011730 [Sorghum bicolor]
Length = 469
Score = 93.6 bits (231), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 111/393 (28%), Positives = 165/393 (41%), Gaps = 62/393 (15%)
Query: 89 YSISLSFGTP--PQIIPFILDTGSHLVWFPCTNHYQCKYCSSS-----KIPSFIPKLSSS 141
Y ++L FGTP PQ++ ++DTGS L W QC+ C+SS K P F P SS+
Sbjct: 122 YVVTLGFGTPAVPQVL--LIDTGSDLSWV------QCQPCNSSTCYPQKDPVFDPSASST 173
Query: 142 SRLLGCQNPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSG-LTEGIALSET 200
+ C + C + +S C + S + +C Y + YG+G T G+ +ET
Sbjct: 174 YAPVPCGSEACRDLDPDSYA-NGCTN-----SSSGASLC-QYGIQYGNGDTTVGVYSTET 226
Query: 201 LNLPNR---IIPNFLVGCSVLSS---RQPAGIAGFGRGKTSLPSQLN---LDKFSYCLLS 251
L L ++ NF GC ++ G+ G G SL SQ FSYCL +
Sbjct: 227 LTLSPEAATVVNNFSFGCGLVQKGVFDLFDGLLGLGGAPESLVSQTTGTYGGAFSYCLPA 286
Query: 252 HKFDDTTRTSSLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGG 311
T+ + + T G +TP V E + +Y V L I+VGG
Sbjct: 287 GN-----STAGFLALGAPATGGNNTAGFQFTPL----QVVE----TTFYLVKLTGISVGG 333
Query: 312 QRVRVWHKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAE 371
+++ + GG I+DSGT T + + L F S M + L
Sbjct: 334 KQLDIEPTVFA------GGMIIDSGTIVTGLPETAYSALRTAFRSAM----SAYPLLPPN 383
Query: 372 ALTGLRPCFDVPGEKTGSFPELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTVVTDREAS 431
L C+D G + P + L F+GG + L V + + G CL V AS
Sbjct: 384 DDEDLDTCYDFTGNTNVTVPTVALTFEGGVTIDLDVPSGVLLDG-----CLAFVAG--AS 436
Query: 432 GGPSIILGNFQMQNYYVEYDLRNQRLGFKQQLC 464
G + I+GN + + V YD +GF+ C
Sbjct: 437 DGDTGIIGNVNQRTFEVLYDSARGHVGFRAGAC 469
>gi|45735845|dbj|BAD12880.1| putative chloroplast nucleoid DNA-binding protein cnd41 [Oryza
sativa Japonica Group]
gi|45735971|dbj|BAD13000.1| putative chloroplast nucleoid DNA-binding protein cnd41 [Oryza
sativa Japonica Group]
Length = 333
Score = 93.6 bits (231), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 104/380 (27%), Positives = 161/380 (42%), Gaps = 56/380 (14%)
Query: 93 LSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKY-CSSSKIPSFIPKLSSSSRLLGCQNPK 151
+ GTP ++DTGS L W C+ C C P F PK SS+ +GC +
Sbjct: 1 MGLGTPATQYVMVVDTGSSLTWLQCS---PCLVSCHRQSGPVFNPKSSSTYASVGCSAQQ 57
Query: 152 CSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYG-SGLTEGIALSETLNLPNRIIPN 210
CS + ++ C+ + +C Y YG S + G +T++ + +PN
Sbjct: 58 CSDLPSATLNPSACSS---------SNVCI-YQASYGDSSFSVGYLSKDTVSFGSTSLPN 107
Query: 211 FLVGCSVLSSR---QPAGIAGFGRGKTSLPSQLNLD---KFSYCLLSHKFDDTTRTSSLI 264
F GC + + AG+ G R K SL QL F+YCL ++ SL
Sbjct: 108 FYYGCGQDNEGLFGRSAGLIGLARNKLSLLYQLAPSLGYSFTYCL---PSSSSSGYLSLG 164
Query: 265 LDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRVRVWHKYLTLD 324
N +S YTP V++ S+ + Y++ L +TV G + V +
Sbjct: 165 SYNPGQYS--------YTPMVSS-SLDDS-----LYFIKLSGMTVAGNPLSVSSSAYSSL 210
Query: 325 RDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEALTGLRPCFDVPG 384
TI+DSGT T + ++ L+ + M + +RA A + L CF
Sbjct: 211 P-----TIIDSGTVITRLPTSVYSALSKAVAAAM---KGTSRA---SAYSILDTCFKGQA 259
Query: 385 EKTGSFPELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTVVTDREASGGPSIILGNFQMQ 444
+ S P + + F GGA + L +N V + S CL R A+ I+GN Q Q
Sbjct: 260 SRV-SAPAVTMSFAGGAALKLSAQNLLVDVDD-STTCLAFAPARSAA-----IIGNTQQQ 312
Query: 445 NYYVEYDLRNQRLGFKQQLC 464
+ V YD+++ R+GF C
Sbjct: 313 TFSVVYDVKSSRIGFAAGGC 332
>gi|212275143|ref|NP_001130306.1| uncharacterized protein LOC100191400 precursor [Zea mays]
gi|194688798|gb|ACF78483.1| unknown [Zea mays]
gi|194703430|gb|ACF85799.1| unknown [Zea mays]
gi|194707192|gb|ACF87680.1| unknown [Zea mays]
gi|223944599|gb|ACN26383.1| unknown [Zea mays]
gi|223948667|gb|ACN28417.1| unknown [Zea mays]
gi|414887962|tpg|DAA63976.1| TPA: aspartic proteinase nepenthesin-1 [Zea mays]
Length = 450
Score = 93.6 bits (231), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 107/387 (27%), Positives = 158/387 (40%), Gaps = 62/387 (16%)
Query: 89 YSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLGCQ 148
Y + S GTPPQ + +DT + W PC C C +S F P S+S R + C
Sbjct: 112 YVVRASLGTPPQQLLLAVDTSNDASWIPCAG---CAGCPTSSAAPFDPASSASYRTVPCG 168
Query: 149 NPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSGLTEGIALSETLNLPNRII 208
+P C+ P A + C L S L ++ ++L + +
Sbjct: 169 SPLCA-------------QAPNAACPPGGKACGFSLTYADSSLQAALS-QDSLAVAGNAV 214
Query: 209 PNFLVGC---SVLSSRQPAGIAGFGRGKTSLPSQ---LNLDKFSYCLLSHK---FDDTTR 259
+ GC + ++ P G+ G GRG S SQ + FSYCL S K F T R
Sbjct: 215 KAYTFGCLQRATGTAAPPQGLLGLGRGPLSFLSQTKDMYEATFSYCLPSFKSLNFSGTLR 274
Query: 260 TSSLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRVRVWHK 319
+ NG K T P + NP S YYV + I VG + V +
Sbjct: 275 ----LGRNGQPQRIKTT------PLLANPH------RSSLYYVNMTGIRVGRKVVPI--- 315
Query: 320 YLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGA--EALTGLR 377
D GT++DSGT FT + + + DE R +GA +L G
Sbjct: 316 -PAFDPATGAGTVLDSGTMFTRLVAPAYVAVRDEV----------RRRVGAPVSSLGGFD 364
Query: 378 PCFDVPGEKTGSFPELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTVVTDREASGGPSII 437
CF+ ++P + L F G +VTLP EN G+ CL + + +
Sbjct: 365 TCFNT---TAVAWPPVTLLFDG-MQVTLPEENVVIHSTYGTISCLAMAAAPDGVNTVLNV 420
Query: 438 LGNFQMQNYYVEYDLRNQRLGFKQQLC 464
+ + Q QN+ V +D+ N R+GF ++ C
Sbjct: 421 IASMQQQNHRVLFDVPNGRVGFARERC 447
>gi|357440289|ref|XP_003590422.1| Aspartic proteinase-like protein [Medicago truncatula]
gi|355479470|gb|AES60673.1| Aspartic proteinase-like protein [Medicago truncatula]
Length = 498
Score = 93.2 bits (230), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 104/409 (25%), Positives = 178/409 (43%), Gaps = 59/409 (14%)
Query: 78 TTNISSHSYGGYSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIP-SFIP 136
+++ S+ YG Y+ + GTPP+ +DTGS ++W C C S I +F
Sbjct: 73 SSDPSTLGYGLYTTKVKMGTPPREFTVQIDTGSDILWINCNTCSNCPKSSGLGIELNFFD 132
Query: 137 KL-SSSSRLLGCQNPKC-SWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSGLTEG 194
+ SS++ L+ C +P C S I + QC S Q ++ GSG T G
Sbjct: 133 TVGSSTAALVPCSDPMCASAIQGAAAQC----------SPQVNQCSYTFQYEDGSG-TSG 181
Query: 195 IALSETL--------NLPNRIIPN--FLVGCSVLSS-------RQPAGIAGFGRGKTSLP 237
+ +S+ + + P + + + GCS S + GI GFG G+ S+
Sbjct: 182 VYVSDAMYFDMILGQSTPANVASSATIVFGCSTYQSGDLTKTDKAVDGILGFGPGELSVV 241
Query: 238 SQLNLDKFSYCLLSHKFD-DTTRTSSLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAF 296
SQL+ + + SH D L+L + + Y+P V PS
Sbjct: 242 SQLSSRGITPKVFSHCLKGDGNGGGILVL------GEILEPSIVYSPLV--PS------- 286
Query: 297 SVYYYVGLRRITVGGQRVRVWHK-YLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFV 355
+Y + L+ I V GQ + + + T D+ GTI+DSGTT +++ E ++PL +
Sbjct: 287 QPHYNLNLQSIAVNGQVLSINPAVFATSDKR---GTIIDSGTTLSYLVQEAYDPLVNAVD 343
Query: 356 SQMVKNRNYTRALGAEALTGLRPCFDVPGEKTGSFPELKLHFKGGAEVTLPVENYFAVVG 415
+ + + + G++ C+ V SFP + +F+GGA + L Y G
Sbjct: 344 TAVSQFATSFISKGSQ-------CYLVLTSIDDSFPTVSFNFEGGASMDLKPSQYLLNRG 396
Query: 416 EGSAVCLTVVTDREASGGPSIILGNFQMQNYYVEYDLRNQRLGFKQQLC 464
+ + ++ G + ILG+ +++ V YDL Q++G+ C
Sbjct: 397 FQDGAKMWCIGFQKVQEGVT-ILGDLVLKDKIVVYDLARQQIGWTNYDC 444
>gi|224058947|ref|XP_002299658.1| predicted protein [Populus trichocarpa]
gi|222846916|gb|EEE84463.1| predicted protein [Populus trichocarpa]
Length = 451
Score = 93.2 bits (230), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 99/396 (25%), Positives = 162/396 (40%), Gaps = 54/396 (13%)
Query: 89 YSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPS--FIPKLSSSSRLLG 146
Y L G+PP+ +DTGS ++W C++ C S IP F P S ++ L+
Sbjct: 90 YYTRLQLGSPPRDFYVQIDTGSDVLWVSCSSCNGCPVSSGLHIPLNFFDPGSSPTASLIS 149
Query: 147 CQNPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSGL-TEGIALSETLN--- 202
C + +CS + + + A + C Y YG G T G +S+ L+
Sbjct: 150 CSDQRCS------LGLQSSDSVCAAQNNQC-----GYTFQYGDGSGTSGYYVSDLLHFDT 198
Query: 203 -LPNRIIPN----FLVGCSVLSS-------RQPAGIAGFGRGKTSLPSQLNLDKFSYCLL 250
L ++ N + GCS L + R GI GFG+ S+ SQL + +
Sbjct: 199 ILGGSVMKNSSAPIVFGCSTLQTGDLTKPDRAVDGIFGFGQQDMSVISQLASQGITPRVF 258
Query: 251 SHKFD-DTTRTSSLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITV 309
SH D + L+L + + YTP V PS +Y + L+ I V
Sbjct: 259 SHCLKGDDSGGGILVL------GEIVEPNIVYTPLV--PS-------QPHYNLNLQSIYV 303
Query: 310 GGQRVRVWHKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALG 369
GQ + + N GTI+DSGTT ++ ++P S + + + + G
Sbjct: 304 NGQTLAIDPSVFATSS--NQGTIIDSGTTLAYLTEAAYDPFISAITSTVSPSVSPYLSKG 361
Query: 370 AEALTGLRPCFDVPGEKTGSFPELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTVVTDRE 429
+ C+ FP++ L+F GG + L ++Y + L V ++
Sbjct: 362 NQ-------CYLTSSSINDVFPQVSLNFAGGTSMILIPQDYLIQQSSINGAALWCVGFQK 414
Query: 430 ASGGPSIILGNFQMQNYYVEYDLRNQRLGFKQQLCK 465
G ILG+ +++ YD+ QR+G+ CK
Sbjct: 415 IQGQEITILGDLVLKDKIFVYDIAGQRIGWANYDCK 450
>gi|223974335|gb|ACN31355.1| unknown [Zea mays]
Length = 91
Score = 93.2 bits (230), Expect = 2e-16, Method: Composition-based stats.
Identities = 46/85 (54%), Positives = 56/85 (65%), Gaps = 9/85 (10%)
Query: 389 SFPELKLHFKGGAEVTLPVENYFAVVGEGS--AVCLTVVTDREASGG-------PSIILG 439
+ PEL F+GGA + LPVENYF V G G+ A+CL VVTD G P+IILG
Sbjct: 2 ALPELSFRFEGGAVMQLPVENYFVVAGRGAVEAICLAVVTDFSGGSGAGNEGSGPAIILG 61
Query: 440 NFQMQNYYVEYDLRNQRLGFKQQLC 464
+FQ QNY VEYDL +RLGF++Q C
Sbjct: 62 SFQQQNYLVEYDLEKERLGFRRQSC 86
>gi|296085498|emb|CBI29230.3| unnamed protein product [Vitis vinifera]
Length = 542
Score = 93.2 bits (230), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 125/440 (28%), Positives = 174/440 (39%), Gaps = 91/440 (20%)
Query: 39 NPSQDSYQNLNSLVSSSLTRALHIKNPQTKTTTTTTTTTTTNISSH---SYGGYSISLSF 95
+PS+ + L S++R + T T+ I S S G Y ++L
Sbjct: 48 DPSKTQAERLTDAFRRSVSRVGRFR---------PTAMTSDGIQSRIVPSAGEYLMNLYI 98
Query: 96 GTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLGCQNPKCSWI 155
GTPP + I+DTGS L W C C +C +P F PK SS+ R C C +
Sbjct: 99 GTPPVPVIAIVDTGSDLTWTQCR---PCTHCYKQVVPLFDPKNSSTYRDSSCGTSFCLAL 155
Query: 156 HHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSG-LTEGIALSETLNLPNRI-----IP 209
+ R C+ E K CT + Y G T G SETL + + P
Sbjct: 156 GKD----RSCSKE-----KKCT-----FRYSYADGSFTGGNLASETLTVDSTAGKPVSFP 201
Query: 210 NFLVGCSVLS----SRQPAGIAGFGRGKTSLPSQLNL---DKFSYCLLSHKFDDTTRTSS 262
F GC S + +GI G G G+ SL SQL FSYCLL D + SS
Sbjct: 202 GFAFGCGHSSGGIFDKSSSGIVGLGGGELSLISQLKSTINGLFSYCLLPVSTDSS--ISS 259
Query: 263 LILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRVRVWHKYLT 322
I N + G TP R+ G + K
Sbjct: 260 RI--NFGASGRVSGYGTVSTPL---------------------RLPYKG-----YSKKTE 291
Query: 323 LDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEALTGLRPCFDV 382
++ G IVDSGTT+TF+ E + L ++ V+ +K + G +L C++
Sbjct: 292 VEE---GNIIVDSGTTYTFLPQEFYSKL-EKSVANSIKGKRVRDPNGIFSL-----CYNT 342
Query: 383 PGEKTGSFPELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTVVTDREASGGPSIILGNFQ 442
E P + HFK A V L N F + E VC TV + +LGN
Sbjct: 343 TAEINA--PIITAHFK-DANVELQPLNTFMRMQE-DLVCFTVAPTSDIG-----VLGNLA 393
Query: 443 MQNYYVEYDLRNQRLGFKQQ 462
N+ V +DLR +R GF ++
Sbjct: 394 QVNFLVGFDLRKKR-GFSKK 412
Score = 55.8 bits (133), Expect = 4e-05, Method: Compositional matrix adjust.
Identities = 44/137 (32%), Positives = 66/137 (48%), Gaps = 14/137 (10%)
Query: 328 NGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEALTGLRPCFDVPGEKT 387
G IVDSGTT+T++ E + L +E V+ +K + G +L C++ ++
Sbjct: 417 EGNIIVDSGTTYTYLPLEFYVKL-EESVAHSIKGKRVRDPNGISSL-----CYNTTVDQI 470
Query: 388 GSFPELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTVVTDREASGGPSIILGNFQMQNYY 447
+ P + HFK A V L N F + E VC TV+ + ILGN N+
Sbjct: 471 DA-PIITAHFKD-ANVELQPWNTFLRMQE-DLVCFTVLPTSDIG-----ILGNLAQVNFL 522
Query: 448 VEYDLRNQRLGFKQQLC 464
V +DLR +R+ FK C
Sbjct: 523 VGFDLRKKRVSFKAADC 539
>gi|242092902|ref|XP_002436941.1| hypothetical protein SORBIDRAFT_10g011760 [Sorghum bicolor]
gi|241915164|gb|EER88308.1| hypothetical protein SORBIDRAFT_10g011760 [Sorghum bicolor]
Length = 445
Score = 93.2 bits (230), Expect = 3e-16, Method: Compositional matrix adjust.
Identities = 109/390 (27%), Positives = 161/390 (41%), Gaps = 70/390 (17%)
Query: 89 YSISLSFGTP--PQIIPFILDTGSHLVWFPCTNHYQCKYCSSS-----KIPSFIPKLSSS 141
Y ++SFGTP PQ++ ++DTGS L W QCK CSS K P F P SS+
Sbjct: 112 YVATVSFGTPAVPQVV--VIDTGSDLTWL------QCKPCSSGQCSPQKDPLFDPSHSST 163
Query: 142 SRLLGCQNPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSGL-TEGIALSET 200
+ C + +C + ++ N +P + + Y G T G+ +
Sbjct: 164 YSAVPCASGECKKLAADAYGSGCSNGQPCG-----------FAISYVDGTSTVGVYGKDK 212
Query: 201 LNL-PNRIIPNFLVGCSVLSSRQPAGIAGFGRGKT---SLPSQ-LNLDKFSYCLLSHKFD 255
L L P I+ +F GC S P G SL +Q FSYCL +
Sbjct: 213 LTLAPGAIVKDFYFGCGHSKSSLPGLFDGLLGLGRLSESLGAQYGGGGGFSYCLPAVN-- 270
Query: 256 DTTRTSSLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRVR 315
++ L G + +G +TP P + FS V L ITVGG+++
Sbjct: 271 --SKPGFLAFGAG-----RNPSGFVFTPMGRVPG---QPTFST---VTLAGITVGGKKLD 317
Query: 316 VWHKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEALTG 375
+ + GG IVDSGT T + ++ L F M + Y G
Sbjct: 318 LRPSAFS------GGMIVDSGTVVTVLQSTVYRALRAAFREAM---KAYRLVHG-----D 363
Query: 376 LRPCFDVPGEKTGSFPELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTVV-TDREASGGP 434
L C+D+ G K P++ L F GGA + L V N V G CL T ++ + G
Sbjct: 364 LDTCYDLTGYKNVVVPKIALTFSGGATINLDVPNGILVNG-----CLAFAETGKDGTAG- 417
Query: 435 SIILGNFQMQNYYVEYDLRNQRLGFKQQLC 464
+LGN + + V +D + GF+ + C
Sbjct: 418 --VLGNVNQRTFEVLFDTSASKFGFRAKAC 445
>gi|356540371|ref|XP_003538663.1| PREDICTED: probable aspartic protease At2g35615-like [Glycine max]
Length = 374
Score = 93.2 bits (230), Expect = 3e-16, Method: Compositional matrix adjust.
Identities = 109/392 (27%), Positives = 164/392 (41%), Gaps = 58/392 (14%)
Query: 87 GGYSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLG 146
G Y + +S GTPP I I DTGS L W C C C + P F P+ S+S R +
Sbjct: 23 GHYLMEVSIGTPPFKIYGIADTGSDLTWTSCV---PCNKCYKQRNPIFDPQKSTSYRNIS 79
Query: 147 CQNPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGS-GLTEGIALSETLNL-- 203
C +S C + + K+C +Y Y S +T+G+ ET+ L
Sbjct: 80 C----------DSKLCHKLDTGVCSPQKHC-----NYTYAYASAAITQGVLAQETITLSS 124
Query: 204 -PNRIIP--NFLVGCSVLSS----RQPAGIAGFGRGKTSLPSQLNL----DKFSYCLLSH 252
+P + GC ++ + GI G G G S SQ+ +FS CL+
Sbjct: 125 TKGESVPLKGIVFGCGHNNTGGFNDREMGIIGLGGGPVSFISQIGSSFGGKRFSQCLVPF 184
Query: 253 KFDDTTRTSSLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQ 312
D + +S + L GS S K V+ P VA+++ Y+V L I+VG
Sbjct: 185 H-TDVSVSSKMSLGKGSEVSGKGV--------VSTPLVAKQD--KTPYFVTLLGISVGNT 233
Query: 313 RVRVWHKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEA 372
+ GN +DSGT T + +L+ D V+Q V++ + + +
Sbjct: 234 YLHFNGSSSQSVEKGN--VFLDSGTPPTILPTQLY----DRLVAQ-VRSEVAMKPVTNDL 286
Query: 373 LTGLRPCFDVPGEKTGSFPELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTVVTDREASG 432
G + C+ G P L HF+GG LP + + V + CL T+ + G
Sbjct: 287 DLGPQLCYRTKNNLRG--PVLTAHFEGGDVKLLPTQTF--VSPKDGVFCLG-FTNTSSDG 341
Query: 433 GPSIILGNFQMQNYYVEYDLRNQRLGFKQQLC 464
G + GNF NY + +DL Q + FK C
Sbjct: 342 G---VYGNFAQSNYLIGFDLDRQVVSFKPMDC 370
>gi|242050744|ref|XP_002463116.1| hypothetical protein SORBIDRAFT_02g038150 [Sorghum bicolor]
gi|241926493|gb|EER99637.1| hypothetical protein SORBIDRAFT_02g038150 [Sorghum bicolor]
Length = 632
Score = 92.8 bits (229), Expect = 3e-16, Method: Compositional matrix adjust.
Identities = 102/396 (25%), Positives = 167/396 (42%), Gaps = 69/396 (17%)
Query: 87 GGYSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLG 146
G Y+ L GTPPQ I+D+GS + + PC + C+ C + + P F P LSSS +
Sbjct: 87 GYYTTRLYIGTPPQEFALIVDSGSTVTYVPCAS---CEQCGNHQDPRFQPDLSSSYSPVK 143
Query: 147 CQNPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSGLTEGIALSETLNLPNR 206
C N C+ + QC + A + + + +V +G E+ P R
Sbjct: 144 C-NVDCT-CDSDKKQC--TYERQYAEMSSSSGVLGEDIVSFGR---------ESELKPQR 190
Query: 207 IIPNFLVGCS-----VLSSRQPAGIAGFGRGKTSLPSQLNL-----DKFSYCLLSHKFDD 256
+ GC L S+ GI G GRG+ S+ QL D FS C
Sbjct: 191 AV----FGCENSETGDLFSQHADGIMGLGRGQLSIMDQLVEKGVISDSFSLCYGGMDIG- 245
Query: 257 TTRTSSLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRVRV 316
+++L + SD +++ + +P YY + L+ I V G+ +RV
Sbjct: 246 ---GGAMVLGGVPAPSDMV---FSHSDPLRSP----------YYNIELKEIHVAGKALRV 289
Query: 317 WHKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVK-------NRNYTRALG 369
+ + GT++DSGTT+ ++ + F D S++ + NY
Sbjct: 290 DSRVF----NSKHGTVLDSGTTYAYLPEQAFVAFKDAVTSKVHSLKKIRGPDPNYKDICF 345
Query: 370 AEALTGLRPCFDVPGEKTGSFPELKLHFKGGAEVTLPVENY-FAVVGEGSAVCLTVVTDR 428
A A + +V FP++ + F G +++L ENY F A CL V +
Sbjct: 346 AGAGRNVSKLHEV-------FPDVDMVFGNGQKLSLTPENYLFRHSKVDGAYCLGVFQNG 398
Query: 429 EASGGPSIILGNFQMQNYYVEYDLRNQRLGFKQQLC 464
+ P+ +LG ++N V YD N+++GF + C
Sbjct: 399 K---DPTTLLGGIIVRNTLVTYDRHNEKIGFWKTNC 431
>gi|242095586|ref|XP_002438283.1| hypothetical protein SORBIDRAFT_10g011110 [Sorghum bicolor]
gi|241916506|gb|EER89650.1| hypothetical protein SORBIDRAFT_10g011110 [Sorghum bicolor]
Length = 470
Score = 92.8 bits (229), Expect = 3e-16, Method: Compositional matrix adjust.
Identities = 108/390 (27%), Positives = 158/390 (40%), Gaps = 62/390 (15%)
Query: 87 GGYSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKY-CSSSKIPSFIPKLSSSSRLL 145
G Y L GTP ++DTGS L W C+ C C P + P+ SS+ +
Sbjct: 132 GNYVTELGLGTPATSYAMVVDTGSSLTWLQCS---PCVVSCHRQVGPLYDPRASSTYATV 188
Query: 146 GCQNPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYG-SGLTEGIALSETLNLP 204
C +C + ++ C+ +C Y YG S + G +T++
Sbjct: 189 PCSASQCDELQAATLNPSACSVR---------NVC-IYQASYGDSSFSVGYLSRDTVSFG 238
Query: 205 NRIIPNFLVGCSV----LSSRQPAGIAGFGRGKTSLPSQLNLD---KFSYCLLSHKFDDT 257
+ PNF GC L R AG+ G R K SL QL FSYCL
Sbjct: 239 SGSYPNFYYGCGQDNEGLFGRS-AGLIGLARNKLSLLYQLAPSLGYSFSYCL-------P 290
Query: 258 TRTSSLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRVRVW 317
T S+ L G S +YTP +A + + Y+V L ++VGG + V
Sbjct: 291 TPASTGYLSIGPYTSGH----YSYTP------MASSSLDASLYFVTLSGMSVGGSPLAVS 340
Query: 318 -HKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEALTGL 376
+Y +L TI+DSGT T + ++ L+ + MV ++ A A + L
Sbjct: 341 PAEYSSLP------TIIDSGTVITRLPTAVYTALSKAVAAAMVGVQS------APAFSIL 388
Query: 377 RPCFDVPGEKTGSFPELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTVV-TDREASGGPS 435
CF + P + + F GGA + L +N V + S CL TD +
Sbjct: 389 DTCFQGQASQL-RVPAVAMAFAGGATLKLATQNVLIDV-DDSTTCLAFAPTDS------T 440
Query: 436 IILGNFQMQNYYVEYDLRNQRLGFKQQLCK 465
I+GN Q Q + V YD+ R+GF C
Sbjct: 441 TIIGNTQQQTFSVVYDVAQSRIGFAAGGCS 470
>gi|168063189|ref|XP_001783556.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162664943|gb|EDQ51645.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 414
Score = 92.8 bits (229), Expect = 3e-16, Method: Compositional matrix adjust.
Identities = 102/404 (25%), Positives = 162/404 (40%), Gaps = 64/404 (15%)
Query: 87 GGYSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLG 146
G Y +++ G P ++ +DTGS L W C C+ C+ + PK +R++
Sbjct: 29 GLYYMAMRIGNPAKLYYLDMDTGSDLTWLQC--DAPCRSCAVGPHGLYDPK---RARVVD 83
Query: 147 CQNPKCSWIHHE-----SIQCRDCNDEPLATSKNCTQICPSYLVLYGSGL-TEGIALSET 200
C+ P C+ + S R C+ Y V Y G T GI + +T
Sbjct: 84 CRRPTCAQVQRGGQFTCSGDVRQCD----------------YEVDYVDGSSTMGILVEDT 127
Query: 201 LNL----PNRIIPNFLVGCSVLS----SRQPA---GIAGFGRGKTSLPSQLNLDKFSYCL 249
+ L R ++GC ++ PA G+ G K SLPSQL + +
Sbjct: 128 ITLVLTNGTRFQTRAVIGCGYDQQGTLAKAPAVTDGVIGLSSSKISLPSQLAAKGIANNV 187
Query: 250 LSHKFDDTTRTSSLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITV 309
+ H + + + G+T+TP + P V Y LR I
Sbjct: 188 IGHCLAGGSNGGGYLF---FGDTLVPALGMTWTPMIGRPLVEG-------YQARLRSIKY 237
Query: 310 GGQRVRVWHKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKN---RNYTR 366
GG+ + L D GG + DSGT+FT++ P + + V Q ++ R T
Sbjct: 238 GGEVLE-----LEGTTDDVGGAMFDSGTSFTYLVPNAYTAVLSAVVRQAQRSGLERIKTD 292
Query: 367 ALGAEALTGLRPCFDVPGEKTGSFPELKLHFKG------GAEVTLPVENYFAVVGEGSAV 420
G P F+ + + F + L F G G + L E Y V +G+ V
Sbjct: 293 TTLPFCWRGPSP-FESVADVSAYFKTVTLDFGGSTWWSSGKLLELSPEGYLIVSTQGN-V 350
Query: 421 CLTVVTDREASGGPSIILGNFQMQNYYVEYDLRNQRLGFKQQLC 464
CL V+ AS + ILG+ M+ Y V YD +++G+ ++ C
Sbjct: 351 CLGVLDASVASLEVTNILGDISMRGYLVVYDNMREQIGWVRRNC 394
>gi|357481205|ref|XP_003610888.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
gi|355512223|gb|AES93846.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
Length = 413
Score = 92.8 bits (229), Expect = 3e-16, Method: Compositional matrix adjust.
Identities = 110/394 (27%), Positives = 159/394 (40%), Gaps = 61/394 (15%)
Query: 87 GGYSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLG 146
G Y + L GTPP I +DTGS L+W C C C + P F P SS+ +
Sbjct: 62 GQYLMELYIGTPPIKISGTVDTGSDLIWVQCV---PCLGCYNQINPMFDPLKSSTYTNIS 118
Query: 147 CQNPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYG-SGLTEGIALSETLNLPN 205
C +P C + +C+ E K C Y Y S LT+G+ ET+ L +
Sbjct: 119 CDSPLCYKPY-----IGECSPE-----KRC-----DYTYGYADSSLTKGVLAQETVTLTS 163
Query: 206 RI-----IPNFLVGCSVLSS----RQPAGIAGFGRGKTSLPSQLNL----DKFSYCLLSH 252
+ L GC ++ G+ G G G TSL SQ+ KFS CL+
Sbjct: 164 NTGKPISLQGILFGCGHNNTGNFNDHEMGLIGLGGGPTSLVSQIGPLFGGKKFSQCLVPF 223
Query: 253 KFDDTTRTSSLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQ 312
D T +S + GS + G+ TP V +R YYV L I+
Sbjct: 224 -LTDITISSQMSFGKGSEVLGE---GVVTTPLV------QREQDMTSYYVTLLGIS---- 269
Query: 313 RVRVWHKYLTLDRD-GNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAE 371
V YL ++ G +VDSGT + +L++ + E VKN+ + +
Sbjct: 270 ---VEDTYLPMNSTIEKGNMLVDSGTPPNILPQQLYDRVYVE-----VKNKVPLEPITDD 321
Query: 372 ALTGLRPCFDVPGEKTGSFPELKLHFKGGAEVTLPVENYFAVVGEGSAV-CLTVVTDREA 430
G + C+ G P L HF+G + P++ + E V CL + +
Sbjct: 322 PSLGPQLCYRTQTNLKG--PTLTYHFEGANLLLTPIQTFIPPTPETKGVFCLAITNCANS 379
Query: 431 SGGPSIILGNFQMQNYYVEYDLRNQRLGFKQQLC 464
G I GNF NY + +DL Q + FK C
Sbjct: 380 DPG---IYGNFAQTNYLIGFDLDRQIVSFKPTDC 410
>gi|225427554|ref|XP_002266533.1| PREDICTED: probable aspartic protease At2g35615-like [Vitis
vinifera]
Length = 447
Score = 92.4 bits (228), Expect = 4e-16, Method: Compositional matrix adjust.
Identities = 121/475 (25%), Positives = 190/475 (40%), Gaps = 98/475 (20%)
Query: 22 FPSSITSLTFSLSRFHTNPSQDSYQNLNSLVSSSLTRALHIKNPQTKTTTTTTTTTTTNI 81
F + + S LS F+ NPS+ + L S++RA H + T + + + N
Sbjct: 35 FSTDLISRDSPLSPFY-NPSETQFDRLQKAFHRSISRANHFRANGVSTNSIQSPVISNN- 92
Query: 82 SSHSYGGYSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSS 141
G Y +++S GTPP + I DTGS L+W C C C P F P S +
Sbjct: 93 -----GEYLMNISLGTPPVSMHGIADTGSDLLWRQCK---PCDSCYEQIEPIFDPAKSKT 144
Query: 142 SRLLGCQNPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSG-LTEGIALSET 200
++L C+ CS + + C+D+ C Y YG G T G +T
Sbjct: 145 YQILSCEGKSCSNLGGQG----GCSDD-----NTCI-----YSYSYGDGSHTSGDLAVDT 190
Query: 201 LNLPNRI-----IPNFLVGCSVLSSRQPAGIAGFGRGKT----------------SLPSQ 239
L + + +P + GC G G T S+ SQ
Sbjct: 191 LTIGSTTGRPVSVPKVVFGC------------GHNNGGTFELHGSGLVGLGGGPLSMISQ 238
Query: 240 LNL---DKFSYCLLSHKFDDTTRTSSLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAF 296
L +FSYCL+ +D + +S + S G TP +A R
Sbjct: 239 LRPLIGGRFSYCLVPLG-NDPSVSSKMHF---GSRGIVSGAGAVSTP------LASRQP- 287
Query: 297 SVYYYVGLRRITVGGQRV--RVWHKYLTLDRDGN-GGTIVDSGTTFTFMAPELFEPLADE 353
+YY+ L ++VG +++ + + K + D + G I+DSGTT T + + + L
Sbjct: 288 DTFYYLTLESMSVGSKKLAYKGFSKVGSPLADADEGNIIIDSGTTLTLLPQDFYGTLESN 347
Query: 354 FVSQM----VKNRNYTRALGAEALTGLRPCFDVPGEKTGSFPELKLHFKGGAEVTLPVEN 409
VS + V++ N +L L+GLR P + HF GA++ L N
Sbjct: 348 VVSAIGGKPVRDPNNVFSLCYSNLSGLR------------IPTITAHFV-GADLELKPLN 394
Query: 410 YFAVVGEGSAVCLTVVTDREASGGPSIILGNFQMQNYYVEYDLRNQRLGFKQQLC 464
F V E C ++ + + I GN N+ V YDL+++ + FK C
Sbjct: 395 TFVQVQE-DLFCFAMIPVSDLA-----IFGNLAQMNFLVGYDLKSRTVSFKPTDC 443
>gi|357124468|ref|XP_003563922.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Brachypodium
distachyon]
Length = 450
Score = 92.4 bits (228), Expect = 4e-16, Method: Compositional matrix adjust.
Identities = 102/388 (26%), Positives = 159/388 (40%), Gaps = 54/388 (13%)
Query: 87 GGYSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKY-CSSSKIPSFIPKLSSSSRLL 145
G Y L GTP ++D+GS L W C C C P + P+ SS+ +
Sbjct: 106 GNYITRLGLGTPTTTYVMVVDSGSSLTWLQCA---PCAVSCHPQAGPLYDPRASSTYAAV 162
Query: 146 GCQNPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSG-LTEGIALSETLNLP 204
C P+C+ + ++ C+ + +C Y YG G + G +T++L
Sbjct: 163 PCSAPQCAELQAATLNPSSCSG---------SGVC-QYQASYGDGSFSFGYLSKDTVSLS 212
Query: 205 NR-IIPNFLVGC---SVLSSRQPAGIAGFGRGKTSLPSQLNL---DKFSYCLLSHKFDDT 257
+ P F GC +V + AG+ G R K SL SQL + F+YCL +
Sbjct: 213 SSGSFPGFYYGCGQDNVGLFGRAAGLIGLARNKLSLLSQLAPSVGNSFAYCL-----PTS 267
Query: 258 TRTSSLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRVRV- 316
S+ L GS+ +K +YT V++ A Y+V L ++V G + V
Sbjct: 268 AAASAGYLSFGSNSDNKNPGKYSYTSMVSSSLDAS------LYFVSLAGMSVAGSPLAVP 321
Query: 317 WHKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEALTGL 376
+Y +L TI+DSGT T + ++ L+ + + ++ L
Sbjct: 322 SSEYGSLP------TIIDSGTVITRLPTPVYTALSKAVGAALAAPSAPAYSI-------L 368
Query: 377 RPCFDVPGEKTGSFPELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTVVTDREASGGPSI 436
+ CF K P + + F GGA + L N V E + TD A
Sbjct: 369 QTCFKGQVAKL-PVPAVNMAFAGGATLRLTPGNVLVDVNETTTCLAFAPTDSTA------ 421
Query: 437 ILGNFQMQNYYVEYDLRNQRLGFKQQLC 464
I+GN Q Q + V YD++ R+GF C
Sbjct: 422 IIGNTQQQTFSVVYDVKGSRIGFAAGGC 449
>gi|242045118|ref|XP_002460430.1| hypothetical protein SORBIDRAFT_02g027990 [Sorghum bicolor]
gi|241923807|gb|EER96951.1| hypothetical protein SORBIDRAFT_02g027990 [Sorghum bicolor]
Length = 488
Score = 92.4 bits (228), Expect = 4e-16, Method: Compositional matrix adjust.
Identities = 110/394 (27%), Positives = 154/394 (39%), Gaps = 63/394 (15%)
Query: 89 YSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLGCQ 148
Y SL GTP + LDTGS W C C C + P F P SS+ + C
Sbjct: 139 YVASLRLGTPATELVVELDTGSDQSWVQCK---PCADCYEQRDPVFDPTASSTYSAVPCG 195
Query: 149 NPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLY-GSGLTEGIALSETLNLPNR- 206
+C E + +KNC Y V Y T G +TL L
Sbjct: 196 AREC----QELASSSSSRNCSSDNNKNC-----PYEVSYDDDSHTVGDLARDTLTLSPSP 246
Query: 207 ------IIPNFLVGCSVLSSRQPAGIAG-------FGRGKTSLPSQLNLD---KFSYCLL 250
+P F+ GC AG G G GK SLPSQ+ FSYCL
Sbjct: 247 SPSPADTVPGFVFGC----GHSNAGTFGEVDGLLGLGLGKASLPSQVAARYGAAFSYCLP 302
Query: 251 SHKFDDTTRTSSLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVG 310
S + L ++ ++ + T + V ++ S YY+ L I V
Sbjct: 303 SSP----SAAGYLSFGGAAARANAQFTEM----------VTGQDPTS--YYLNLTGIVVA 346
Query: 311 GQRVRVWHKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGA 370
G+ ++V GTI+DSGT F+ + P + L F S M + R Y RA +
Sbjct: 347 GRAIKVPASAFAT----AAGTIIDSGTAFSRLPPSAYAALRSSFRSAMGRYR-YKRAPSS 401
Query: 371 EALTGLRPCFDVPGEKTGSFPELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTVVTDREA 430
C+D G +T P ++L F GA V L + + CL V + +
Sbjct: 402 PIFD---TCYDFTGHETVRIPAVELVFADGATVHLHPSGVLYTWNDVAQTCLAFVPNHDL 458
Query: 431 SGGPSIILGNFQMQNYYVEYDLRNQRLGFKQQLC 464
ILGN Q + V YD+ +QR+GF ++ C
Sbjct: 459 G-----ILGNTQQRTLAVIYDVGSQRIGFGRKGC 487
>gi|297849132|ref|XP_002892447.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
gi|297338289|gb|EFH68706.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
Length = 493
Score = 92.4 bits (228), Expect = 5e-16, Method: Compositional matrix adjust.
Identities = 106/398 (26%), Positives = 166/398 (41%), Gaps = 59/398 (14%)
Query: 87 GGYSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPS--FIPKLSSSSRL 144
G Y + GTPP+ +DTGS ++W CT+ C S +I F P +SSS+ L
Sbjct: 82 GLYYTKVKLGTPPREFNVQIDTGSDVLWVSCTSCNGCPKTSELQIQLSFFDPGVSSSASL 141
Query: 145 LGCQNPKCSWIHHESIQCRDCNDEPLATSKNCT--QICPSYLVLYGSGL-TEGIALSETL 201
+ C + +C + + Q T C+ +C SY YG G T G +S+ +
Sbjct: 142 VSCSDRRC----YSNFQ----------TESGCSPNNLC-SYSFKYGDGSGTSGFYISDFM 186
Query: 202 NLPNRIIPN--------FLVGCSVLSS-------RQPAGIAGFGRGKTSLPSQLNLDKFS 246
+ I F+ GCS L + R GI G G+G S+ SQL + +
Sbjct: 187 SFDTVITSTLAINSSAPFVFGCSNLQTGDLQRPRRAVDGIFGLGQGSLSVISQLAVQGLA 246
Query: 247 YCLLSHKFDDTTRTSSLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRR 306
+ SH +++ D YTP V PS +Y V L+
Sbjct: 247 PRVFSHCLKGDKSGGGIMVLGQIKRPDT-----VYTPLV--PS-------QPHYNVNLQS 292
Query: 307 ITVGGQRVRVWHKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTR 366
I V GQ + + T+ GTI+D+GTT ++ E + P + + + Y R
Sbjct: 293 IAVNGQILPIDPSVFTIAT--GDGTIIDTGTTLAYLPDEAYSPFIQAIANAVSQ---YGR 347
Query: 367 ALGAEALTGLRPCFDVPGEKTGSFPELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTVVT 426
+ E+ CF++ FPE+ L F GGA + L Y + S + +
Sbjct: 348 PITYESYQ----CFEITAGDVDVFPEVSLSFAGGASMVLRPHAYLQIF-SSSGSSIWCIG 402
Query: 427 DREASGGPSIILGNFQMQNYYVEYDLRNQRLGFKQQLC 464
+ S ILG+ +++ V YDL QR+G+ + C
Sbjct: 403 FQRMSHRRITILGDLVLKDKVVVYDLVRQRIGWAEYDC 440
>gi|356570798|ref|XP_003553571.1| PREDICTED: aspartic proteinase-like protein 2-like [Glycine max]
Length = 500
Score = 92.0 bits (227), Expect = 5e-16, Method: Compositional matrix adjust.
Identities = 103/402 (25%), Positives = 162/402 (40%), Gaps = 64/402 (15%)
Query: 87 GGYSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPS--FIPKLSSSSRL 144
G Y + G+P + +DTGS ++W C C + S I F SS++ L
Sbjct: 81 GLYFTKVKLGSPAKEFYVQIDTGSDILWINCITCSNCPHSSGLGIELDFFDTAGSSTAAL 140
Query: 145 LGCQNPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSGL-TEGIALSETLN- 202
+ C +P CS+ + E + + C SY YG G T G +S+T+
Sbjct: 141 VSCGDPICSY------AVQTATSECSSQANQC-----SYTFQYGDGSGTTGYYVSDTMYF 189
Query: 203 ----LPNRIIPN----FLVGCSVLSS-------RQPAGIAGFGRGKTSLPSQLNL----- 242
L ++ N + GCS S + GI GFG G S+ SQL+
Sbjct: 190 DTVLLGQSVVANSSSTIIFGCSTYQSGDLTKTDKAVDGIFGFGPGALSVISQLSSRGVTP 249
Query: 243 DKFSYCLLSHKFDDTTRTSSLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYV 302
FS+CL + IL+ + Y+P V PS +Y +
Sbjct: 250 KVFSHCLKGGENGGGVLVLGEILE----------PSIVYSPLV--PS-------QPHYNL 290
Query: 303 GLRRITVGGQRVRVWHKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNR 362
L+ I V GQ + + N GTIVDSGTT ++ E + P + + +
Sbjct: 291 NLQSIAVNGQLLPIDSNVFATTN--NQGTIVDSGTTLAYLVQEAYNPFVKAITAAVSQFS 348
Query: 363 NYTRALGAEALTGLRPCFDVPGEKTGSFPELKLHFKGGAEVTLPVENYFAVVGEGSAVCL 422
+ G + C+ V FP++ L+F GGA + L E+Y G +
Sbjct: 349 KPIISKGNQ-------CYLVSNSVGDIFPQVSLNFMGGASMVLNPEHYLMHYGFLDGAAM 401
Query: 423 TVVTDREASGGPSIILGNFQMQNYYVEYDLRNQRLGFKQQLC 464
+ ++ G + ILG+ +++ YDL NQR+G+ C
Sbjct: 402 WCIGFQKVEQGFT-ILGDLVLKDKIFVYDLANQRIGWADYDC 442
>gi|359476756|ref|XP_002277082.2| PREDICTED: aspartic proteinase-like protein 2-like isoform 2 [Vitis
vinifera]
Length = 560
Score = 92.0 bits (227), Expect = 5e-16, Method: Compositional matrix adjust.
Identities = 107/397 (26%), Positives = 157/397 (39%), Gaps = 57/397 (14%)
Query: 87 GGYSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSS--SKIPSFIPKLSSSSRL 144
G Y + GTP + +DTGS ++W C +C S + + K S++S
Sbjct: 153 GLYFAKIGIGTPSKDYYVQVDTGSDILWVNCAGCDRCPTKSDLGVDLTLYDMKASTTSDA 212
Query: 145 LGCQNPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSGLTEGIALSETLNLP 204
+GC + CS D PL K Q Y VLYG G + +
Sbjct: 213 VGCDDNFCSLY-----------DGPLPGCKPGLQCL--YSVLYGDGSSTTGYFVQDFVQY 259
Query: 205 NRIIPNF---------LVGCSVL-------SSRQPAGIAGFGRGKTSLPSQLNLDKFSYC 248
NRI NF + GC SS GI GFG+ +S+ SQL
Sbjct: 260 NRISGNFQTTPTNGTVVFGCGNKQSGELGSSSEALDGILGFGQANSSMLSQLASSGKVKK 319
Query: 249 LLSHKFDDTTRTSSLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRIT 308
+ SH D+ +D G + G P VN + + A +Y V ++ I
Sbjct: 320 VFSHCLDN--------VDGGGIFA----IGEVVEPKVNITPLVQNQA---HYNVVMKEIE 364
Query: 309 VGGQRVRV-WHKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRA 367
VGG + V + + DR G TI+DSGTT + E++ PL ++ +SQ R +T
Sbjct: 365 VGGDPLDVPSDAFESGDRKG---TIIDSGTTLAYFPQEVYVPLIEKILSQQPDLRLHTV- 420
Query: 368 LGAEALTGLRPCFDVPGEKTGSFPELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTVVTD 427
+A T CFD G FP + LHF +T+ Y +
Sbjct: 421 --EQAFT----CFDYTGNVDDGFPTVTLHFDKSISLTVYPHEYLFQHEFEWCIGWQNSGA 474
Query: 428 REASGGPSIILGNFQMQNYYVEYDLRNQRLGFKQQLC 464
+ G +LG+ + N V YDL Q +G+ + C
Sbjct: 475 QTKDGKDLTLLGDLVLSNKLVVYDLEKQGIGWVEYNC 511
>gi|255552237|ref|XP_002517163.1| basic 7S globulin 2 precursor small subunit, putative [Ricinus
communis]
gi|223543798|gb|EEF45326.1| basic 7S globulin 2 precursor small subunit, putative [Ricinus
communis]
Length = 469
Score = 92.0 bits (227), Expect = 5e-16, Method: Compositional matrix adjust.
Identities = 93/394 (23%), Positives = 171/394 (43%), Gaps = 54/394 (13%)
Query: 97 TPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLGCQNPKCSWIH 156
TP + I+D G+ +W C Y +SSS + C + C +
Sbjct: 88 TPLVPVKLIVDLGARFMWVDCEEGY----------------VSSSYTPVSCDSLLCKLAN 131
Query: 157 HESIQCR-DCNDEPLATSKN--CTQICPSYLVLYGSG--LTEGIALSETLN--LPNRII- 208
S+ C +CN P N C + ++ G+ + + + ++ N P+RI+
Sbjct: 132 --SLACATECNSTPKPGCHNNTCAHSPENPVIRLGTSGQIGQDVVSLQSFNGKTPDRIVS 189
Query: 209 -PNFLVGCS---VLSSRQPA--GIAGFGRGKTSLPSQLNL-----DKFSYCLLSHKFDDT 257
PNF C +L + G+AG G SLP+Q + KF+ CL ++
Sbjct: 190 VPNFPFVCGPTFLLENLADGVTGLAGLGNSNISLPAQFSSAFGFPKKFAVCL-----SNS 244
Query: 258 TRTSSLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAF----SVYYYVGLRRITVGGQR 313
T+++ LI +S+ LTYTP ++NP ++ SV Y++G++ I +GG+
Sbjct: 245 TKSNGLIFFGDGPYSNLPND-LTYTPLIHNPVSTAGGSYLGEASVEYFIGVKSIRIGGKD 303
Query: 314 VRVWHKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEAL 373
V+ L++D +G GGT + + +T + +++ + FV +M ++ + + +
Sbjct: 304 VKFNKTLLSIDSEGKGGTKISTVDPYTVLHTSIYKAVVKAFVKEM--DKKFIPQV-QPPI 360
Query: 374 TGLRPCFDVPGEKTGSF----PELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTVVTDRE 429
CF + F P + L +G VT + ++V S V D
Sbjct: 361 APFGACFQSIVIDSNEFGPVLPFIDLVLEGQGSVTWRIWGANSMVKISSLVMCLGFVDGG 420
Query: 430 ASGGPSIILGNFQMQNYYVEYDLRNQRLGFKQQL 463
SI++G Q+++ +++DL + +LGF L
Sbjct: 421 IEPRTSIVIGGRQIEDNLLQFDLASSKLGFSSSL 454
>gi|383161172|gb|AFG63168.1| Pinus taeda anonymous locus 0_11073_01 genomic sequence
gi|383161174|gb|AFG63170.1| Pinus taeda anonymous locus 0_11073_01 genomic sequence
gi|383161175|gb|AFG63171.1| Pinus taeda anonymous locus 0_11073_01 genomic sequence
Length = 133
Score = 92.0 bits (227), Expect = 5e-16, Method: Compositional matrix adjust.
Identities = 62/145 (42%), Positives = 85/145 (58%), Gaps = 20/145 (13%)
Query: 176 CTQICPSYLVLYGSGLTEGIALSETLNLP-----NRIIPNFLVGCSVLSSRQPAGIAGFG 230
C++ICP + + YG+G G LS+TL LP R I NF GC+V+SS Q AGIAGFG
Sbjct: 1 CSKICPHFSLTYGTGNATGRLLSDTLTLPLEDGGRREIKNFATGCAVVSS-QVAGIAGFG 59
Query: 231 RGKTSLPSQLNL---DKFSYCLLSHKFDDTTRTSSLILDNGSSHSDKKTTGLTYTPFVNN 287
G S+PSQL DKF+YCL D + +S ++L N + D LTYTP + N
Sbjct: 60 NGGLSMPSQLAPLIGDKFAYCL-----DYRSNSSKIVLGNKAVPRDLP---LTYTPLLFN 111
Query: 288 PSVAERNAFSVYYYVGLRRITVGGQ 312
P + FS Y+Y+ L +++GG+
Sbjct: 112 P--VNPSVFS-YFYLALETVSIGGK 133
>gi|388509650|gb|AFK42891.1| unknown [Lotus japonicus]
Length = 347
Score = 92.0 bits (227), Expect = 5e-16, Method: Compositional matrix adjust.
Identities = 82/302 (27%), Positives = 148/302 (49%), Gaps = 52/302 (17%)
Query: 192 TEGIALSETLNLPNRIIPNFLVGCSVLS---SRQPAGIAGFGRGKTSLPSQLNLD----- 243
T+G ++ +++PN + F+ G V+ ++ G+AG GR + SLPSQ +
Sbjct: 47 TDGTTPTKVVSVPNFL---FICGSKVVQNGLAKGVTGMAGLGRTRVSLPSQFSSAFSFHR 103
Query: 244 KFSYCLLSHKFDDTTRTSSLILDNGSSHSDKKTTG-LTYTPFVNNPSVAERNAF----SV 298
KF+ CL ++ D + +G + ++ + LTYTP + NP +AF SV
Sbjct: 104 KFAICLTANSGADGV----MFFGDGPYNLNQDVSKVLTYTPLITNPVSTAPSAFLGEPSV 159
Query: 299 YYYVGLRRITVGGQRVRVWHKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQM 358
Y++G++ I V + V + L+++++G GGT + + +T M +++ +AD FV
Sbjct: 160 EYFIGVKSIKVSEKNVPLNTTLLSINKNGVGGTKISTVNPYTVMETTIYKAVADAFV--- 216
Query: 359 VKNRNYTRALGAEALTGLRP---CF---DVPGEKTG-SFPELKLHFKGGAEVTLPVENYF 411
++LGA ++ + P CF D+ + G P + L + G E P+
Sbjct: 217 -------KSLGAPTVSPVAPFGTCFATKDISFSRIGPGVPAIDLVLQNGVE--WPIIGAN 267
Query: 412 AVVGEGSAVCLTVV---TDREAS------GGP----SIILGNFQMQNYYVEYDLRNQRLG 458
++V +CL V ++ +AS GG SI +G Q++N +++DL RLG
Sbjct: 268 SMVQFDDVICLGFVDAGSNPKASQVGFVNGGSHPVTSITIGAHQLENNLLKFDLAASRLG 327
Query: 459 FK 460
F+
Sbjct: 328 FR 329
>gi|147776733|emb|CAN74676.1| hypothetical protein VITISV_038368 [Vitis vinifera]
Length = 389
Score = 92.0 bits (227), Expect = 6e-16, Method: Compositional matrix adjust.
Identities = 85/290 (29%), Positives = 135/290 (46%), Gaps = 42/290 (14%)
Query: 179 ICPSYLVLYGSG-LTEGIALSETLNLPNRIIPNFLVGCSVLSSR---QPAGIAGFGRGKT 234
IC +Y + YG G T G E L ++ +F+ GC + +G+ G GR
Sbjct: 75 IC-NYAINYGDGSFTRGELGHEKLKFGTILVKDFIFGCGRNNKGLFGGVSGLMGLGRSDL 133
Query: 235 SLPSQ---LNLDKFSYCLLSHKFDDTTRTSSLILDNGSSHSDKKTTGLTYTPFVNNPSVA 291
SL SQ + FSYCL S + + SLIL G+S + ++ ++Y + NP +
Sbjct: 134 SLISQTSGIFGGVFSYCLPS---TERKGSGSLIL-GGNSSVYRNSSPISYAKMIENPQLY 189
Query: 292 ERNAFSVYYYVGLRRITVGGQRVRVWHKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLA 351
+Y++ L I++GG ++ G +VDSGT T + P +++ L
Sbjct: 190 N------FYFINLTGISIGGVALQA-------PSVGPSRILVDSGTVITRLPPTIYKALK 236
Query: 352 DEFVSQMVKNRNYTRALGAEALTGLRPCFDVPGEKTGSFPELKLHFKGGAEVTLPVENYF 411
EF+ Q +T A A + L CF++ + P +K+HF+G AE+T+ V F
Sbjct: 237 AEFLKQ------FTGFPPAPAFSILDTCFNLSAYQEVDIPTIKMHFEGNAELTVDVTGVF 290
Query: 412 AVV-GEGSAVCLTVVT----DREASGGPSIILGNFQMQNYYVEYDLRNQR 456
V + S VCL + + D A ILGN+Q +N V YD + +
Sbjct: 291 YFVKSDASQVCLALASLEYQDEVA------ILGNYQQKNLRVIYDTKETK 334
>gi|413923780|gb|AFW63712.1| hypothetical protein ZEAMMB73_689747 [Zea mays]
Length = 470
Score = 91.7 bits (226), Expect = 6e-16, Method: Compositional matrix adjust.
Identities = 114/391 (29%), Positives = 156/391 (39%), Gaps = 72/391 (18%)
Query: 89 YSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSS-----SKIPSFIPKLSSSSR 143
Y ++ S GTP +DTGS L W QCK C++ K P F P SSS
Sbjct: 137 YVVTASLGTPGMAQTLEVDTGSDLSWV------QCKPCAAPSCYRQKDPLFDPAQSSSYA 190
Query: 144 LLGCQNPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSGL-TEGIALSETLN 202
+ C C+ + + C ++ C Y+V YG G T G+ S+TL
Sbjct: 191 AVPCGRSACAGLGIYASAC---------SAAQC-----GYVVSYGDGSNTTGVYSSDTLT 236
Query: 203 L-PNRIIPNFLVGCSVLSS----RQPAGIAGFGRGKTSLPSQLN---LDKFSYCLLSHKF 254
L N + FL GC S G+ GFGR + SL Q FSYCL +
Sbjct: 237 LAANATVQGFLFGCGHAQSGGLFTGIDGLLGFGREQPSLVQQTAGAYGGVFSYCLPTK-- 294
Query: 255 DDTTRTSSLILDNGSSHSDK-KTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQR 313
++ T L L S + TT L +P N P+ YY V L I+VGGQ
Sbjct: 295 --SSTTGYLTLGGPSGVAPGFSTTQLLPSP--NAPT---------YYVVMLTGISVGGQP 341
Query: 314 VRVWHKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEAL 373
+ V GT+VD+GT T + P + L F S M + A +
Sbjct: 342 LSVPASAFA------AGTVVDTGTVITRLPPAAYAALRSAFRSGMASYPS------APPI 389
Query: 374 TGLRPCFDVPGEKTGSFPELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTVVTDREASGG 433
L C+ G T + + L F GA +TL + S CL + S G
Sbjct: 390 GILDTCYSFAGYGTVNLTSVALTFSSGATMTLGADGIM------SFGCLAFAS--SGSDG 441
Query: 434 PSIILGNFQMQNYYVEYDLRNQRLGFKQQLC 464
ILGN Q +++ V D +GF+ C
Sbjct: 442 SMAILGNVQQRSFEVRID--GSSVGFRPSSC 470
>gi|15241713|ref|NP_195839.1| aspartyl protease family protein [Arabidopsis thaliana]
gi|75181297|sp|Q9LZL3.1|PCS1L_ARATH RecName: Full=Aspartic proteinase PCS1; AltName: Full=Aspartic
protease 38; Short=AtASP38; AltName: Full=Protein EMBRYO
DEFECTIVE 24; AltName: Full=Protein PROMOTION OF CELL
SURVIVAL 1; Flags: Precursor
gi|7340693|emb|CAB82992.1| putative protein [Arabidopsis thaliana]
gi|50897174|gb|AAT85726.1| At5g02190 [Arabidopsis thaliana]
gi|53828617|gb|AAU94418.1| At5g02190 [Arabidopsis thaliana]
gi|110742159|dbj|BAE99007.1| hypothetical protein [Arabidopsis thaliana]
gi|332003059|gb|AED90442.1| aspartyl protease family protein [Arabidopsis thaliana]
Length = 453
Score = 91.7 bits (226), Expect = 7e-16, Method: Compositional matrix adjust.
Identities = 97/386 (25%), Positives = 156/386 (40%), Gaps = 42/386 (10%)
Query: 98 PPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLGCQNPKCSWIHH 157
PPQ I ++DTGS L W C + + + + +F P SSS + C +P C
Sbjct: 82 PPQNISMVIDTGSELSWLRCN-----RSSNPNPVNNFDPTRSSSYSPIPCSSPTCRTRTR 136
Query: 158 ESIQCRDCNDEPLATSKNCTQICPSYLVLYGSGLTEGIALSETLNLPNRI-IPNFLVGC- 215
+ + C+ + ++C + L + +EG +E + N N + GC
Sbjct: 137 DFLIPASCDSD---------KLCHATLSYADASSSEGNLAAEIFHFGNSTNDSNLIFGCM 187
Query: 216 SVLSSRQP------AGIAGFGRGKTSLPSQLNLDKFSYCLLSHKFDDTTRTSSLILDNGS 269
+S P G+ G RG S SQ+ KFSYC+ T +L S
Sbjct: 188 GSVSGSDPEEDTKTTGLLGMNRGSLSFISQMGFPKFSYCI-----SGTDDFPGFLLLGDS 242
Query: 270 SHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRVRVWHKYLTLDRDGNG 329
+ + T L YTP + S V Y V L I V G+ + + L D G G
Sbjct: 243 NFT--WLTPLNYTPLIRI-STPLPYFDRVAYTVQLTGIKVNGKLLPIPKSVLVPDHTGAG 299
Query: 330 GTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEALTGLRPCFDVPGEKTGS 389
T+VDSGT FTF+ ++ L F+++ + C+ + + S
Sbjct: 300 QTMVDSGTQFTFLLGPVYTALRSHFLNRTNGILTVYEDPDFVFQGTMDLCYRISPVRIRS 359
Query: 390 -----FPELKLHFKGGAEVTLPVENYF-----AVVGEGSAVCLTVVTDREASGGPSIILG 439
P + L F+G AE+ + + VG S C T + + G + ++G
Sbjct: 360 GILHRLPTVSLVFEG-AEIAVSGQPLLYRVPHLTVGNDSVYCFTF-GNSDLMGMEAYVIG 417
Query: 440 NFQMQNYYVEYDLRNQRLGFKQQLCK 465
+ QN ++E+DL+ R+G C
Sbjct: 418 HHHQQNMWIEFDLQRSRIGLAPVECD 443
>gi|225427558|ref|XP_002266614.1| PREDICTED: probable aspartic protease At2g35615-like [Vitis
vinifera]
Length = 444
Score = 91.7 bits (226), Expect = 7e-16, Method: Compositional matrix adjust.
Identities = 119/446 (26%), Positives = 177/446 (39%), Gaps = 73/446 (16%)
Query: 39 NPSQDSYQNLNSLVSSSLTRALHIKNPQTKTTTTTTTTTTTNISSHSYGGYSISLSFGTP 98
NPS+ YQ L S+ R H + + + + G Y +++S GTP
Sbjct: 50 NPSETKYQRLQKAFRRSILRGNHFRAMRASPNDIQSDVISGG------GAYLMNISLGTP 103
Query: 99 PQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLGCQNPKCSWIHHE 158
P + I DTGS L+W C C C P F PK S + + L C N C + +
Sbjct: 104 PVPMLGIADTGSDLIWRQC---LPCPNCYEQVEPLFDPKESETYKTLDCDNEFCQDLGQQ 160
Query: 159 SIQCRDCNDEPLATSKNCTQICPSYLVLYGS-GLTEGIALSETLNLPN-----RIIPNFL 212
C+D+ CT Y YG T G S+TL + + P
Sbjct: 161 G----SCDDD-----NTCT-----YSYSYGDRSYTRGDLSSDTLTIGSTEGDPASFPGIA 206
Query: 213 VGCSVLS----SRQPAGIAGFGRGKTSLPSQLNLD---KFSYCLLSHKFDDTTRTSSLIL 265
GC + + + G+ G G G SL QL+ + +FSYCL+ D T SS I
Sbjct: 207 FGCGHDNGGTFNEKDGGLIGLGGGPLSLVMQLSSEVGGQFSYCLVPLSSDST--VSSKI- 263
Query: 266 DNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRVRVWHKYLTLDR 325
N +G TP + + +YY+ L ++VG + V K + ++
Sbjct: 264 -NFGKSGVVSGSGTVSTPLI-------KGTPDTFYYLTLEGLSVGSETVAF--KGFSENK 313
Query: 326 DG-----NGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEALTGLRPCF 380
G I+DSGTT T + + + + T A+G + T F
Sbjct: 314 SSPAAVEEGNIIIDSGTTLTLLPQDFYTDVESAL----------TNAIGGQTTTDPNGIF 363
Query: 381 DVPGEKTGSF--PELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTVVTDREASGGPSIIL 438
+ + P + HF GA+V LP N F V E VC +++ + I
Sbjct: 364 SLCYSSVNNLEIPTITAHFT-GADVQLPPLNTFVQVQE-DLVCFSMIPSSNLA-----IF 416
Query: 439 GNFQMQNYYVEYDLRNQRLGFKQQLC 464
GN N+ V YDL+N ++ FKQ C
Sbjct: 417 GNLAQINFLVGYDLKNNKVSFKQTDC 442
>gi|224074591|ref|XP_002304395.1| predicted protein [Populus trichocarpa]
gi|222841827|gb|EEE79374.1| predicted protein [Populus trichocarpa]
Length = 443
Score = 91.7 bits (226), Expect = 7e-16, Method: Compositional matrix adjust.
Identities = 123/466 (26%), Positives = 181/466 (38%), Gaps = 91/466 (19%)
Query: 31 FSLSRFHT--------NPSQDSYQNLNSLVSSSLTRALHIKNPQTKTTTTTTTTTTTNIS 82
FSL+ H NP+ + L + S S++R K T + N
Sbjct: 34 FSLNLIHRDSPLSPLYNPNHTDFDRLRNAFSRSISRVNVFK------TKAVDINSFQNDL 87
Query: 83 SHSYGGYSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSS 142
+ G Y + +S GTP + I DTGS L W C C C K P F P SSS
Sbjct: 88 VPNGGEYFMKMSIGTPLVEVIVIADTGSDLTWVQC---LPCDPCYRQKSPLFDPSRSSSY 144
Query: 143 RLLGCQNPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSY---LVLYGSGLTEGIALSE 199
R + C + C+ + D +++ N + SY G+ TE +
Sbjct: 145 RHMLCGSRFCNAL--------DVSEQACTMDTNICEYHYSYGDKSYTNGNLATEKFTIGS 196
Query: 200 TLNLPNRIIPNFLVGCSVLSSRQPAGIAGFGRGKT----------------SLPSQLN-- 241
T + P + P + GC G G G T SL SQL+
Sbjct: 197 TSSRPVHLSP-IVFGC------------GTGNGGTFDELGSGIVGLGGGALSLVSQLSSI 243
Query: 242 -LDKFSYCLLSHKFDDTTRTSSLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYY 300
KFSYCL+ + + TS + S S + V+ P V+++ YY
Sbjct: 244 IKGKFSYCLVPLS-EQSNVTSKIKFGTDSVISGPQV--------VSTPLVSKQP--DTYY 292
Query: 301 YVGLRRITVGGQRVRVWHKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVK 360
YV L I+VG +R+ + L + + G I+DSGTT TF+ E F L
Sbjct: 293 YVTLEAISVGNKRLPYTNGLLNGNVE-KGNVIIDSGTTLTFLDSEFFTEL---------- 341
Query: 361 NRNYTRALGAEALTGLRPCFDVPGEKTG--SFPELKLHFKGGAEVTLPVENYFAVVGEGS 418
R + AE ++ R F V G P + +HF A+V L N F V +
Sbjct: 342 ERVLEETVKAERVSDPRGLFSVCFRSAGDIDLPVIAVHFN-DADVKLQPLNTF-VKADED 399
Query: 419 AVCLTVVTDREASGGPSIILGNFQMQNYYVEYDLRNQRLGFKQQLC 464
+C T+++ + I GN ++ V YDL + + FK C
Sbjct: 400 LLCFTMISSNQIG-----IFGNLAQMDFLVGYDLEKRTVSFKPTDC 440
>gi|224081804|ref|XP_002306494.1| predicted protein [Populus trichocarpa]
gi|222855943|gb|EEE93490.1| predicted protein [Populus trichocarpa]
Length = 564
Score = 91.7 bits (226), Expect = 7e-16, Method: Compositional matrix adjust.
Identities = 109/405 (26%), Positives = 163/405 (40%), Gaps = 87/405 (21%)
Query: 87 GGYSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLG 146
G Y+ L GTPPQ I+DTGS + + PC++ C+ C + P F P LSS+ + +
Sbjct: 11 GYYTTRLWIGTPPQRFALIVDTGSSVTYVPCSS---CEQCGRHQDPKFQPDLSSTYQSVK 67
Query: 147 CQNPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSGLTEGIALSETL----N 202
C +I C +C+DE + C Y Y T L E + N
Sbjct: 68 C-----------NIDC-NCDDE----KQQCV-----YERQYAEMSTSSGVLGEDIISFGN 106
Query: 203 LPNRIIPNFLVGCSVLS-----SRQPAGIAGFGRGKTSLPSQLNL-----DKFSYCLLSH 252
L + GC + S+ GI G GRG S+ L D FS C
Sbjct: 107 LSALAPQRAVFGCENMETGDLYSQHADGIMGMGRGDLSIVDHLVDKGVINDSFSLCYGGM 166
Query: 253 KFDDTTRTSSLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQ 312
+++L S S+ V + S R S YY + L+ I V G
Sbjct: 167 ----GIGGGAMVLGGISPPSN----------MVFSQSDPVR---SPYYNIDLKEIHVAG- 208
Query: 313 RVRVWHKYLTLDR---DGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALG 369
K L L+ DG GTI+DSGTT+ ++ F D + ++
Sbjct: 209 ------KPLPLNPTVFDGKHGTILDSGTTYAYLPEAAFVSFKDAIMKEL---------HS 253
Query: 370 AEALTGLRP-----CFDVPG----EKTGSFPELKLHFKGGAEVTLPVENY-FAVVGEGSA 419
+ + G P CF G + + SFP +++ F G ++ L ENY F A
Sbjct: 254 LKPIRGPDPNYNDICFSGAGSDISQLSSSFPAVEMVFGNGQKLLLSPENYLFRHSKVHGA 313
Query: 420 VCLTVVTDREASGGPSIILGNFQMQNYYVEYDLRNQRLGFKQQLC 464
CL + + + P+ +LG ++N V YD N ++GF + C
Sbjct: 314 YCLGIFQNGK---DPTTLLGGIVVRNTLVLYDRENSKIGFWKTNC 355
>gi|15234606|ref|NP_194732.1| aspartyl protease family protein [Arabidopsis thaliana]
gi|4938479|emb|CAB43838.1| putative protein [Arabidopsis thaliana]
gi|7269903|emb|CAB80996.1| putative protein [Arabidopsis thaliana]
gi|67633774|gb|AAY78811.1| aspartyl protease family protein [Arabidopsis thaliana]
gi|332660310|gb|AEE85710.1| aspartyl protease family protein [Arabidopsis thaliana]
Length = 424
Score = 91.7 bits (226), Expect = 7e-16, Method: Compositional matrix adjust.
Identities = 115/416 (27%), Positives = 179/416 (43%), Gaps = 55/416 (13%)
Query: 60 LHIKN-PQTKTTTTTTTTTTTNISSHSYGGYSISLSFGTPPQIIPFILDTGSHLVWFPCT 118
LH K+ P ++ T + T I + + + ++S G PP ++DTGS L W C
Sbjct: 50 LHSKSTPASRLDNLWTVSHVTPIPNPA--AFLANISIGNPPVPQLLLIDTGSDLTWIHC- 106
Query: 119 NHYQCKYCSSSKIPSFIPKLSSSSRLLGCQNPKCSWIHHESIQCRDCNDEPLATSKNCTQ 178
CK C IP F P SS+ R N C H Q DE T
Sbjct: 107 --LPCK-CYPQTIPFFHPSRSSTYR-----NASCVSAPHAMPQI--FRDEK-------TG 149
Query: 179 ICPSYLVLYGSGLTEGIALSETLNLP---NRII--PNFLVGCSVLSS--RQPAGIAGFGR 231
C +L T GI E L + +I N + GC +S + +G+ G G
Sbjct: 150 NCQYHLRYRDFSNTRGILAEEKLTFETSDDGLISKQNIVFGCGQDNSGFTKYSGVLGLGP 209
Query: 232 GKTSLPSQLNLDKFSYCLLSHKFDDTTRTSSLILDNGSSHSDKKTTGLTYTPFVNNPSVA 291
G S+ ++ KFSYC S + T + LIL NG+ K G +P+
Sbjct: 210 GTFSIVTRNFGSKFSYCFGSLT-NPTYPHNILILGNGA-----KIEG--------DPTPL 255
Query: 292 ERNAFSVYYYVGLRRITVGGQRVRVWHKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLA 351
+ F YY+ L+ I+ G + + + R GGT++D+G + T +A E +E L+
Sbjct: 256 Q--IFQDRYYLDLQAISFGEKLLDIEPGTFQRYR-SQGGTVIDTGCSPTILAREAYETLS 312
Query: 352 DEFVSQMVKNRNYTRALGAEALTGLRPCFD--VPGEKTGSFPELKLHFKGGAEVTLPVEN 409
+E + R + T PC++ + + G FP + HF GGAE+ L VE+
Sbjct: 313 EEI--DFLLGEVLRRVKDWDQYT--TPCYEGNLKLDLYG-FPVVTFHFAGGAELALDVES 367
Query: 410 YFAVVGEGSAVCLTVVTDREASGGPSIILGNFQMQNYYVEYDLRNQRLGFKQQLCK 465
F G + CL + + + ++G QNY V Y+LR ++ F++ C+
Sbjct: 368 LFVSSESGDSFCLAMTMN---TFDDMSVIGAMAQQNYNVGYNLRTMKVYFQRTDCE 420
>gi|413923782|gb|AFW63714.1| hypothetical protein ZEAMMB73_300584 [Zea mays]
Length = 458
Score = 91.7 bits (226), Expect = 7e-16, Method: Compositional matrix adjust.
Identities = 122/424 (28%), Positives = 176/424 (41%), Gaps = 69/424 (16%)
Query: 58 RALHIK-------NPQTKTTTTTTTTTTTNISSHSYGGYSISLSFGTPPQIIPFILDTGS 110
RA +IK + + T TT T++S+ Y I++ G+P +DTGS
Sbjct: 87 RAAYIKRKFSGAGDIEQSDAATVPTTLGTSLSTLEY---VITVGIGSPAVTQTMSMDTGS 143
Query: 111 HLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLGCQNPKCSWIHHESIQCRDCNDEPL 170
+ W C C C S F P SS+ C + C+ + +S + C
Sbjct: 144 DVSWVQCK---PCSQCHSEVDSLFDPSSSSTYSPFSCSSAPCAQL-SQSQEGNGC----- 194
Query: 171 ATSKNCTQICPSYLVLYG-SGLTEGIALSETLNLPNRIIPNFLVGCSVLSS----RQPAG 225
S C Y+V YG S T G S+TL L + + +F GCS S Q G
Sbjct: 195 -MSSQC-----QYIVNYGDSSSTTGTYSSDTLTLGSSAMTDFQFGCSQSESGGFNDQTDG 248
Query: 226 IAGFGRGKTSLPSQLN---LDKFSYCLLSHKFDDTTRTSS-LILDNGSSHSDKKTTGLTY 281
+ G G G SL SQ FSYCL T+ +S L L GSS G
Sbjct: 249 LMGLGGGAQSLASQTAGTFGTAFSYCL-----PPTSGSSGFLTLGTGSS-------GFVK 296
Query: 282 TPFVNNPSVAERNAFSVYYYVGLRRITVGGQRVRVWHKYLTLDRDGNGGTIVDSGTTFTF 341
TP + + + YY V L I VG Q++ + + G+++DSGT T
Sbjct: 297 TPMLRSTQIP------TYYVVLLESIKVGSQQLNLPTSVF------SAGSLMDSGTIITR 344
Query: 342 MAPELFEPLADEFVSQMVKNRNYTRALGAEALTGLRPCFDVPGEKTGSFPELKLHFKGGA 401
+ P + L+ F + M + Y A + L CFD G+ + S P + L F GGA
Sbjct: 345 LPPTAYSALSSAFKAGM---QQYPPATPSGI---LDTCFDFSGQSSISIPTVTLVFSGGA 398
Query: 402 EVTLPVENYFAVVGEGSAVCLTVVTDREASGGPSI-ILGNFQMQNYYVEYDLRNQRLGFK 460
V L + + S CL + + S S+ I+GN Q + + V YD+ +GFK
Sbjct: 399 AVDLAFDGIMLEI-SSSIRCLAFTPNGDDS---SLGIIGNVQQRTFEVLYDVGGGAVGFK 454
Query: 461 QQLC 464
C
Sbjct: 455 AGAC 458
>gi|357440775|ref|XP_003590665.1| Xyloglucan-specific endoglucanase inhibitor protein [Medicago
truncatula]
gi|355479713|gb|AES60916.1| Xyloglucan-specific endoglucanase inhibitor protein [Medicago
truncatula]
Length = 435
Score = 91.7 bits (226), Expect = 8e-16, Method: Compositional matrix adjust.
Identities = 103/406 (25%), Positives = 174/406 (42%), Gaps = 64/406 (15%)
Query: 89 YSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLGCQ 148
Y ++ TP + I+D G +W C N Y +SS+ R C+
Sbjct: 47 YKAQINQRTPLVPLNVIVDLGGQFLWVDCENKY----------------ISSTYRPARCR 90
Query: 149 NPKCSWIHHESIQCRDCNDEPLATSKNCT-QICPSYLVLYGSGLTEGIALSETLNLP--- 204
+ +CS + + C DC P N T + P + + + T G + L++
Sbjct: 91 SAQCSLANSDG--CGDCFSSPKPGCNNNTCGVTPDNSITHTA--TSGELAEDVLSIQSSN 146
Query: 205 ------NRIIPNFLVGCS---VLS--SRQPAGIAGFGRGKTSLPSQLN-----LDKFSYC 248
N ++ FL C+ +L + +G+AG GR K +LPSQL KF+ C
Sbjct: 147 GFNPGQNVVVSRFLFSCAPTFLLKGLATGASGMAGLGRTKIALPSQLASAFSFARKFAIC 206
Query: 249 LLSHK----FDDTTRTSSLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFS-----VY 299
L S K F D L N SD LTYTP + NP V+ +AFS
Sbjct: 207 LSSSKGVVLFGDGPYG---FLPNVVFDSDS----LTYTPLLINP-VSTASAFSQGQPSAE 258
Query: 300 YYVGLRRITVGGQRVRVWHKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMV 359
Y++G++ I + + V + L++D +G GGT + + +T + +++ + D FV
Sbjct: 259 YFIGVKTIKIDEKVVSLNTSLLSIDNNGVGGTKISTVDPYTVLEASIYKAVTDAFVKASA 318
Query: 360 KNRNYTRALGAEALTGLRPCF-DVPGEKTG-SFPELKLHFKGGAEVTLPVENYFAVVGEG 417
RN R ++ C+ ++ G + G + P ++L + V V
Sbjct: 319 A-RNIKR---VGSVAPFEFCYTNLTGTRLGAAVPTIELFLQNENVVWRIFGANSMVSIND 374
Query: 418 SAVCLTVVTDREASGGPSIILGNFQMQNYYVEYDLRNQRLGFKQQL 463
+CL V + + SI++G +Q++N +++DL +LGF L
Sbjct: 375 EVLCLGFVNGGKNT-RTSIVIGGYQLENNLLQFDLAASKLGFSSLL 419
>gi|357465299|ref|XP_003602931.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
gi|355491979|gb|AES73182.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
Length = 438
Score = 91.7 bits (226), Expect = 8e-16, Method: Compositional matrix adjust.
Identities = 104/385 (27%), Positives = 158/385 (41%), Gaps = 50/385 (12%)
Query: 87 GGYSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLG 146
G Y + + GTP Q++ +LDT + + P + C CS++ +F P S+S L
Sbjct: 96 GNYIVRVKIGTPGQLLFMVLDTSTDEAFIPSSG---CIGCSAT---TFSPNASTSYVPLE 149
Query: 147 CQNPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSGLTEGIALSETLNLPNR 206
C P+CS + + C P S C S+ Y + ++L L
Sbjct: 150 CSVPQCSQVR--GLSC------PATGSGAC-----SFNKSYAGSTYSATLVQDSLRLATD 196
Query: 207 IIPNFLVGC------SVLSSRQPAGIAGFGRGKTSLPSQLNLDKFSYCLLSHKFDDTTRT 260
+IP++ G S + ++ G+ S L FSYCL S F +
Sbjct: 197 VIPSYSFGSINAISGSSIPAQGLLGLGRGPLSLLSQTGSLYSGVFSYCLPS--FKSYYFS 254
Query: 261 SSLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRVRVWHKY 320
SL L +TT L P PS+ Y+V L ITVG V +
Sbjct: 255 GSLKLGPVGQPKSIRTTPLLRNP--RRPSL---------YFVNLTGITVGKVNVPFPKEL 303
Query: 321 LTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEALTGLRPCF 380
L D + GTI+DSGT T ++ + DEF Q+ + +LGA CF
Sbjct: 304 LAFDVNTGSGTIIDSGTVITRFVEPVYNAVRDEFRKQVTGPFS---SLGA-----FDTCF 355
Query: 381 DVPGEKTGSFPELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTVV-TDREASGGPSIILG 439
E P + LHF ++ LP+EN GS CL + T + + ++
Sbjct: 356 VKNYETLA--PAITLHFT-DLDLKLPLENSLIHSSSGSLACLAMASTPKNVNYTVLNVIA 412
Query: 440 NFQMQNYYVEYDLRNQRLGFKQQLC 464
N+Q QN V +D N ++G ++LC
Sbjct: 413 NYQQQNLRVLFDTVNNKVGIARELC 437
>gi|357440781|ref|XP_003590668.1| Basic 7S globulin [Medicago truncatula]
gi|355479716|gb|AES60919.1| Basic 7S globulin [Medicago truncatula]
Length = 434
Score = 91.3 bits (225), Expect = 8e-16, Method: Compositional matrix adjust.
Identities = 106/429 (24%), Positives = 182/429 (42%), Gaps = 70/429 (16%)
Query: 65 PQTKTTTTTTTTTTTNISSHSYGGYSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCK 124
P+ T TTN Y ++ TP + I+D G +W C N Y
Sbjct: 30 PKALVLPVTKDVATTN-------QYKAQINQRTPLVPLNIIVDLGGLFLWVDCENQY--- 79
Query: 125 YCSSSKIPSFIPKLSSSSRLLGCQNPKCSWIHHESIQCRDCNDEPLATSKNCT-QICPSY 183
+SS+ R C++ +CS + C C P N T + P
Sbjct: 80 -------------ISSTYRPARCRSAQCSLAKFD--DCGVCFSSPKPGCNNNTCSVAPGN 124
Query: 184 LVLYGS---GLTEGIALSETLNL----PNRIIPNFLVGCS---VLS--SRQPAGIAGFGR 231
V + L E I ++ N N ++ FL C+ +L + +G+AG GR
Sbjct: 125 SVTQSAMSGELAEDILSIQSSNGFNPGQNVMVSRFLFSCARTFLLEGLASGASGMAGLGR 184
Query: 232 GKTSLPSQLN-----LDKFSYCLLSHK----FDDTTR--TSSLILDNGSSHSDKKTTGLT 280
K +LPSQL KF+ CL S K F D +++ D+ S LT
Sbjct: 185 NKLALPSQLASAFSFAKKFAICLSSSKGVVLFGDGPYGFLPNVVFDSKS---------LT 235
Query: 281 YTPFVNNP---SVAERNAFSVYYYVGLRRITVGGQRVRVWHKYLTLDR-DGNGGTIVDSG 336
YTP + NP + ++ S Y++G++ I + G+ V + L++D +G GGT + +
Sbjct: 236 YTPLLINPFSTAAFAKSEPSAEYFIGVKTIKIDGKVVSLDTSLLSIDSSNGAGGTKISTV 295
Query: 337 TTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEALTGLRPCF-DVPGEKTGS-FPELK 394
+T + +++ + D FV RN R +++ C+ +V G + G+ P ++
Sbjct: 296 DPYTVLEASIYKAVTDAFVKASAA-RNIKRV---DSVAPFEFCYTNVTGTRLGADVPTIE 351
Query: 395 LHFKGGAEVTLPVENYFAVVGEGSAVCLTVVTDREASGGPSIILGNFQMQNYYVEYDLRN 454
L+ + + N + + +CL V E + SI++G +Q++N +++DL
Sbjct: 352 LYLQNNVIWRIFGANSMVNIND-EVLCLGFVIGGENTWA-SIVIGGYQLENNLLQFDLAA 409
Query: 455 QRLGFKQQL 463
+LGF L
Sbjct: 410 SKLGFSSLL 418
>gi|255548664|ref|XP_002515388.1| Aspartic proteinase nepenthesin-2 precursor, putative [Ricinus
communis]
gi|223545332|gb|EEF46837.1| Aspartic proteinase nepenthesin-2 precursor, putative [Ricinus
communis]
Length = 494
Score = 91.3 bits (225), Expect = 9e-16, Method: Compositional matrix adjust.
Identities = 117/403 (29%), Positives = 168/403 (41%), Gaps = 53/403 (13%)
Query: 71 TTTTTTTTTNISSHSYGGYSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQC-KYCSSS 129
T TT + S G Y +++ GTP + I DTGS L W C C K C +
Sbjct: 135 TAATTLPAKDGSIIGSGNYFVTVGLGTPKKDFSLIFDTGSDLTWTQCE---PCVKSCYNQ 191
Query: 130 KIPSFIPKLSSSSRLLGCQNPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYG- 188
K F P S+S + C + C + + +C S C Y + YG
Sbjct: 192 KEAIFNPSQSTSYANISCGSTLCDSLASATGNIFNC------ASSTCV-----YGIQYGD 240
Query: 189 SGLTEGIALSETLNL-PNRIIPNFLVGC---SVLSSRQPAGIAGFGRGKTSLPSQL--NL 242
S + G E L+L + +F GC + AG+ G GR K SL SQ
Sbjct: 241 SSFSIGFFGKEKLSLTATDVFNDFYFGCGQNNKGLFGGAAGLLGLGRDKLSLVSQTAQRY 300
Query: 243 DK-FSYCLLSHKFDDTTRTSSLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYY 301
+K FSYCL S ++ T L +S S ++TP +A + S +Y
Sbjct: 301 NKIFSYCLPSS----SSSTGFLTFGGSTSKS------ASFTP------LATISGGSSFYG 344
Query: 302 VGLRRITVGGQRVRVWHKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKN 361
+ L I+VGG+++ + + GTI+DSGT T + P + L+ F M
Sbjct: 345 LDLTGISVGGRKLAISPSVFS-----TAGTIIDSGTVITRLPPAAYSALSSTFRKLM--- 396
Query: 362 RNYTRALGAEALTGLRPCFDVPGEKTGSFPELKLHFKGGAEVTLPVENYFAVVGEGSAVC 421
++ A AL+ L CFD T S P++ L F GG V + F V + + VC
Sbjct: 397 ---SQYPAAPALSILDTCFDFSNHDTISVPKIGLFFSGGVVVDIDKTGIF-YVNDLTQVC 452
Query: 422 LTVVTDREASGGPSIILGNFQMQNYYVEYDLRNQRLGFKQQLC 464
L + +AS I GN Q + V YD R+GF C
Sbjct: 453 LAFAGNSDAS--DVAIFGNVQQKTLEVVYDGAAGRVGFAPAGC 493
>gi|18390865|ref|NP_563808.1| aspartyl protease-like protein [Arabidopsis thaliana]
gi|11993877|gb|AAG42922.1|AF329505_1 unknown protein [Arabidopsis thaliana]
gi|20260142|gb|AAM12969.1| unknown protein [Arabidopsis thaliana]
gi|22136092|gb|AAM91124.1| unknown protein [Arabidopsis thaliana]
gi|332190140|gb|AEE28261.1| aspartyl protease-like protein [Arabidopsis thaliana]
Length = 492
Score = 91.3 bits (225), Expect = 9e-16, Method: Compositional matrix adjust.
Identities = 109/400 (27%), Positives = 166/400 (41%), Gaps = 63/400 (15%)
Query: 87 GGYSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPS--FIPKLSSSSRL 144
G Y + GTPP+ +DTGS ++W CT+ C S +I F P +SSS+ L
Sbjct: 82 GLYYTKVKLGTPPREFNVQIDTGSDVLWVSCTSCNGCPKTSELQIQLSFFDPGVSSSASL 141
Query: 145 LGCQNPKCSWIHHESIQCRDCNDEPLATSKNCT--QICPSYLVLYGSGL-TEGIALSETL 201
+ C + +C + + Q T C+ +C SY YG G T G +S+ +
Sbjct: 142 VSCSDRRC----YSNFQ----------TESGCSPNNLC-SYSFKYGDGSGTSGYYISDFM 186
Query: 202 NLPNRIIPN--------FLVGCSVLSS-------RQPAGIAGFGRGKTSLPSQLNLDKFS 246
+ I F+ GCS L S R GI G G+G S+ SQL + +
Sbjct: 187 SFDTVITSTLAINSSAPFVFGCSNLQSGDLQRPRRAVDGIFGLGQGSLSVISQLAVQGLA 246
Query: 247 YCLLSHKFDDTTRTSSLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRR 306
+ SH +++ D YTP V PS +Y V L+
Sbjct: 247 PRVFSHCLKGDKSGGGIMVLGQIKRPDT-----VYTPLV--PS-------QPHYNVNLQS 292
Query: 307 ITVGGQRVRVWHKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKN--RNY 364
I V GQ + + T+ GTI+D+GTT ++ E + P Q V N Y
Sbjct: 293 IAVNGQILPIDPSVFTIAT--GDGTIIDTGTTLAYLPDEAYSPFI-----QAVANAVSQY 345
Query: 365 TRALGAEALTGLRPCFDVPGEKTGSFPELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTV 424
R + E+ CF++ FP++ L F GGA + L Y + S +
Sbjct: 346 GRPITYESYQ----CFEITAGDVDVFPQVSLSFAGGASMVLGPRAYLQIF-SSSGSSIWC 400
Query: 425 VTDREASGGPSIILGNFQMQNYYVEYDLRNQRLGFKQQLC 464
+ + S ILG+ +++ V YDL QR+G+ + C
Sbjct: 401 IGFQRMSHRRITILGDLVLKDKVVVYDLVRQRIGWAEYDC 440
>gi|388516731|gb|AFK46427.1| unknown [Medicago truncatula]
Length = 435
Score = 91.3 bits (225), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 103/406 (25%), Positives = 175/406 (43%), Gaps = 64/406 (15%)
Query: 89 YSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLGCQ 148
Y ++ TP + I+D G +W C N Y +SS+ R C+
Sbjct: 47 YKAQINQRTPLVPLNVIVDLGGQFLWVDCENKY----------------ISSTYRPARCR 90
Query: 149 NPKCSWIHHESIQCRDCNDEPLATSKNCT-QICPSYLVLYGSGLTEGIALSETLNLP--- 204
+ +CS + + C DC P N T + P + + + T G + L++
Sbjct: 91 SAQCSLANSDG--CGDCFSSPKPGCNNNTCGVTPDNSITHTA--TSGELAEDVLSIQSSN 146
Query: 205 ------NRIIPNFLVGCS---VLS--SRQPAGIAGFGRGKTSLPSQLN-----LDKFSYC 248
N ++ FL C+ +L + +G+AG GR K +LPSQL KF+ C
Sbjct: 147 GFNPGQNVVVSRFLFSCAPTFLLKGLATGASGMAGLGRTKIALPSQLASAFSFARKFAIC 206
Query: 249 LLSHK----FDDTTRTSSLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFS-----VY 299
L S K F D L N SD LTYTP + NP V+ +AFS
Sbjct: 207 LSSSKGVVLFGDGPYG---FLPNVVFDSDS----LTYTPLLINP-VSTASAFSQGQPSAE 258
Query: 300 YYVGLRRITVGGQRVRVWHKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMV 359
Y++G++ I + + V + L++D +G GGT + + +T + +++ + D FV +
Sbjct: 259 YFIGVKTIKIDEKVVSLNTSLLSIDNNGVGGTKISTVDPYTVLEASIYKAVTDAFV-KAP 317
Query: 360 KNRNYTRALGAEALTGLRPCF-DVPGEKTG-SFPELKLHFKGGAEVTLPVENYFAVVGEG 417
RN R ++ C+ ++ G + G + P ++L + V V
Sbjct: 318 AARNIKR---VGSVAPFEFCYTNLTGTRLGAAVPTIELFLQNENVVWRIFGANSMVSIND 374
Query: 418 SAVCLTVVTDREASGGPSIILGNFQMQNYYVEYDLRNQRLGFKQQL 463
+CL V + + SI++G +Q++N +++DL +LGF L
Sbjct: 375 EVLCLGFVNGGKNT-RTSIVIGGYQLENNLLQFDLAASKLGFSSLL 419
>gi|225427556|ref|XP_002266575.1| PREDICTED: probable aspartic protease At2g35615 [Vitis vinifera]
Length = 445
Score = 91.3 bits (225), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 112/455 (24%), Positives = 174/455 (38%), Gaps = 91/455 (20%)
Query: 39 NPSQDSYQNLNSLVSSSLTRALHIKNPQTKTTTTTTTTTTTNISSHSYGGYSISLSFGTP 98
NPS+ YQ L S+ R H + + +N+ S G Y +++S GTP
Sbjct: 50 NPSETKYQRLQKAFRRSILRGNHFR-----AIRASPNDIQSNVISGG-GSYLMNISLGTP 103
Query: 99 PQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLGCQNPKCSWIHHE 158
P + I DTGS L+W C C C P F PK S + + LGC N C + +
Sbjct: 104 PVSMLGIADTGSDLIWRQC---LPCDDCYKQVEPLFDPKKSKTYKTLGCNNDFCQDLGQQ 160
Query: 159 SIQCRDCNDEPLATSK--------------------NCTQICPSYL--VLYGSGLTEGIA 196
C D+ TS T+ P+ + +G G + G
Sbjct: 161 G----SCGDDNTCTSSYSYGDQSYTRRDLSSETFTIGSTEGDPASFPGLAFGCGHSNGGT 216
Query: 197 LSETLNLPNRIIPNFLVGCSVLSSRQPAGIAGFGRGKTSLPSQLNLDKFSYCLLSHKFDD 256
+E + + L LSS+ + G +FSYCL+ D
Sbjct: 217 FNEKDSGLIGLGGGPLSLVMQLSSK----VGG---------------QFSYCLVPLSSDS 257
Query: 257 TTRTSSLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRVRV 316
T SS I N + +G TP + + +YY+ L +++G ++V
Sbjct: 258 T--ASSKI--NFGKSAVVSGSGTVSTPLI-------KGTPDTFYYLTLEGMSLGSEKVAF 306
Query: 317 WHKYLTLDRDGNGGT-----IVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAE 371
K + ++ I+DSGTT T + + + + T+ +G +
Sbjct: 307 --KGFSKNKSSPAAAEESNIIIDSGTTLTLLPRDFYTDMESAL----------TKVIGGQ 354
Query: 372 ALTGLRPCFDV--PGEKTGSFPELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTVVTDRE 429
T R F + G K P + HF GA+V LP N F V + VC +++
Sbjct: 355 TTTDPRGTFSLCYSGVKKLEIPTITAHFI-GADVQLPPLNTF-VQAQEDLVCFSMIPSSN 412
Query: 430 ASGGPSIILGNFQMQNYYVEYDLRNQRLGFKQQLC 464
+ I GN N+ V YDL+N ++ FK C
Sbjct: 413 LA-----IFGNLSQMNFLVGYDLKNNKVSFKPTDC 442
>gi|307103543|gb|EFN51802.1| hypothetical protein CHLNCDRAFT_59135 [Chlorella variabilis]
Length = 746
Score = 91.3 bits (225), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 111/407 (27%), Positives = 167/407 (41%), Gaps = 75/407 (18%)
Query: 86 YGGYSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIP-----SFIPKLSS 140
YG + +L GTP + I+DTGS + + PC++ C S P +F P+ SS
Sbjct: 75 YGYFYATLYLGTPAKKFAVIVDTGSTMTYVPCSS------CGSGCGPNHQDAAFDPEASS 128
Query: 141 SSRLLGCQNPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSGLTEGIALSET 200
++ + C +PKCS S +C C+ + +++ + S +L L + +AL +
Sbjct: 129 TASRISCTSPKCSC---GSPRC-GCSTQQCTYTRSYAEQSSSSGIL----LEDVLALHD- 179
Query: 201 LNLPNRIIPNFLVGCSVLSS----RQPA-GIAGFGRGKTSLPSQLNL-----DKFSYCLL 250
LP I + GC + RQ A G+ G G S+ +QL D FS C
Sbjct: 180 -GLPGAPI---IFGCETRETGEIFRQRADGLFGLGNSDASVVNQLVKAGVIDDVFSLC-- 233
Query: 251 SHKFDDTTRTSSLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVG 310
F +L+L + + L YTP + + YY V + + V
Sbjct: 234 ---FGMVEGDGALLLGDAEV---PGSISLQYTPLLTS------TTHPFYYNVKMLSLAVE 281
Query: 311 GQRVRVWHKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGA 370
GQ + V D GT++DSGTTFT+M +F+ A Y + G
Sbjct: 282 GQLLPVSQSLF----DQGYGTVLDSGTTFTYMPSPVFKAFAGAV-------EKYALSHGL 330
Query: 371 EALTGLRPCFD------VPGEK-----TGSFPELKLHFKGGAEVTLPVENY-FAVVGEGS 418
+ + G P FD P + FP +++ F G + L NY F
Sbjct: 331 KRVPGPDPQFDDICFGQAPSHDDLEALSSVFPSMEVQFDQGTSLVLGPLNYLFVHTFNSG 390
Query: 419 AVCLTVVTDREASGGPSIILGNFQMQNYYVEYDLRNQRLGFKQQLCK 465
CL V + A +LG +N V YD NQR+GF LCK
Sbjct: 391 KYCLGVFDNGRAG----TLLGGITFRNVLVRYDRANQRVGFGPALCK 433
>gi|326490656|dbj|BAJ89995.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 456
Score = 90.9 bits (224), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 110/386 (28%), Positives = 157/386 (40%), Gaps = 66/386 (17%)
Query: 89 YSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLGCQ 148
Y I++ G+P ++DTGS + W C C C S F P SS+ C
Sbjct: 127 YLITVGMGSPAVAQTMLIDTGSDVSWVQCK---PCSQCHSQADSLFDPSSSSTYSAFSCT 183
Query: 149 NPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSGLT-EGIALSETLNLPNRI 207
+ C+ + C Y V YG G T G S+TL L +
Sbjct: 184 SAACAQLRQRGCSSSQCQ----------------YTVKYGDGSTGSGTYSSDTLALGSST 227
Query: 208 IPNFLVGCSV-----LSSRQPAGIAGFGRGKTSLPSQLN---LDKFSYCLLSHKFDDTTR 259
+ NF GCS L Q AG+ G G G SL +Q FSYCL T
Sbjct: 228 VENFQFGCSQSESGNLLQDQTAGLMGLGGGAESLATQTAGTFGKAFSYCL------PPTP 281
Query: 260 TSSLILDNGSSHSDKKTTG-LTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRVRVWH 318
SS L G+S T+G + TP + + V YY V L+ I VGG+++ +
Sbjct: 282 GSSGFLTLGAS-----TSGFVVKTPMLRSTQVPS------YYGVLLQAIRVGGRQLNIPA 330
Query: 319 KYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEALTGLRP 378
+ G+I+DSGT T + + L+ F + M + Y A+ +
Sbjct: 331 SAF------SAGSIMDSGTIITRLPRTAYSALSSAFKAGM---KQYPP---AQPMGIFDT 378
Query: 379 CFDVPGEKTGSFPELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTVVTDREASGGPSIIL 438
CFD G+ + S P + L F GGA V L + GS + +D + G I+
Sbjct: 379 CFDFSGQSSVSIPTVALVFSGGAVVDLASDGIIL----GSCLAFAANSDDTSLG----II 430
Query: 439 GNFQMQNYYVEYDLRNQRLGFKQQLC 464
GN Q + + V YD+ +GFK C
Sbjct: 431 GNVQQRTFEVLYDVGGGAVGFKAGAC 456
>gi|242067691|ref|XP_002449122.1| hypothetical protein SORBIDRAFT_05g005410 [Sorghum bicolor]
gi|241934965|gb|EES08110.1| hypothetical protein SORBIDRAFT_05g005410 [Sorghum bicolor]
Length = 407
Score = 90.9 bits (224), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 100/404 (24%), Positives = 156/404 (38%), Gaps = 68/404 (16%)
Query: 84 HSYGGYSISLSFGTPPQIIPFILDTGSHLVWFPC-TNHYQCKYCSSSKIPSFIPKLSSSS 142
H G + ++++ G P + +DTGS+L W C CK C+ P + PK
Sbjct: 35 HPTGHFYVTMNIGEPAKPYFLDIDTGSNLTWIKCHATPGPCKTCNKVPHPLYRPK----- 89
Query: 143 RLLGCQNPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSGLTE-GIALSETL 201
+L+ C +P C +H + +DC +EP C Y + Y G T G+ L +
Sbjct: 90 KLVPCADPLCDALHKDLGTTKDCREEP----DQC-----HYQINYADGTTSLGVLLLDKF 140
Query: 202 NLPNRIIPNFLVGCSVLSSRQPA----------GIAGFGRGKTSLPSQLNLDKFSYCLLS 251
+LP N GC + P GI G GRG L SQL
Sbjct: 141 SLPTGSARNIAFGCGYDQMQGPKKKAPEKVPVDGILGLGRGSVDLVSQL----------- 189
Query: 252 HKFDDTTRTSSLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGG 311
+ S + N H + G Y F+ +V + +Y Y I+
Sbjct: 190 -------KHSGAVSKNVIGHC-LSSKGGGYL-FIGEENVPSSHLHIIYIYC----ISREP 236
Query: 312 QRVRVWHKYLTLDRDGNG----GTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRA 367
L L R+ G I DSG+T+T++ L L + ++K+ +
Sbjct: 237 NHYSPGQATLHLGRNPIGTKPFKAIFDSGSTYTYLPENLHAQLVSALKASLIKS---SLK 293
Query: 368 LGAEALTGLRPCFDVPG--EKTGSFPE-----LKLHFKGGAEVTLPVENYFAVVGEGSAV 420
L ++ T L C+ P + P+ + L F G +T+P ENY + G G+A
Sbjct: 294 LVSDTDTRLHLCWKGPKPFKTVHDLPKEFKSLVTLKFDHGVTMTIPPENYLIITGHGNA- 352
Query: 421 CLTVVTDREASGGPSIILGNFQMQNYYVEYDLRNQRLGFKQQLC 464
C ++ E G ++G MQ V +D RL + C
Sbjct: 353 CFGIL---ELPGYDLFVIGGISMQEQLVIHDNEKGRLAWMPSPC 393
>gi|218197468|gb|EEC79895.1| hypothetical protein OsI_21423 [Oryza sativa Indica Group]
Length = 471
Score = 90.9 bits (224), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 109/374 (29%), Positives = 147/374 (39%), Gaps = 61/374 (16%)
Query: 103 PFILDTGSHLVWFPCTNHYQCKY--CSSSKIPSFIPKLSSSSRLLGCQNPKCSWIHHESI 160
P +DT L W C C C + F P+ S +S + C + C +
Sbjct: 147 PMSIDTSIDLPWIQCA---PCPMPECYPQQNALFDPRRSRTSAAVPCGSAACGELGRYGA 203
Query: 161 QCRDCNDEPLATSKNCTQICPSYLVLYGSG-LTEGIALSETLNL-PNRIIPNFLVGCSVL 218
C S N Q Y V YG G T G + + L L P+ ++ NF GCS
Sbjct: 204 GC----------SNNQCQ----YFVDYGDGRATSGTYMVDALTLNPSTVVMNFRFGCSHA 249
Query: 219 S----SRQPAGIAGFGRGKTSLPSQLNL---DKFSYCLLSHKFDDTTRTSSLILDNGSSH 271
S +G G G+ SL SQ + FSYC+ +SS L G
Sbjct: 250 VRGNFSASTSGTMSLGGGRQSLLSQTAATFGNAFSYCV-------PDPSSSGFLSLGGPA 302
Query: 272 SDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRVRVWHKYLTLDRDGNGGT 331
TP V NPS+ Y V LR I VGG+R+ V GG
Sbjct: 303 DGGGAGRFARTPLVRNPSI-----IPTLYLVRLRGIEVGGRRLNVPPVVFA------GGA 351
Query: 332 IVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEALTGLRPCFDVPGEKTGSFP 391
++DS T + P + L F S M Y R G A GL C+D + + P
Sbjct: 352 VMDSSVIITQLPPTAYRALRLAFRSAMAA---YPRVAGGRA--GLDTCYDFVRFTSVTVP 406
Query: 392 ELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTVV-TDREASGGPSIILGNFQMQNYYVEY 450
+ L F GGA V L + V+ EG CL V T + + G +GN Q Q + V Y
Sbjct: 407 AVSLVFDGGAVVRL---DAMGVMVEG---CLAFVPTPGDFALG---FIGNVQQQTHEVLY 457
Query: 451 DLRNQRLGFKQQLC 464
D+ +GF++ C
Sbjct: 458 DVGGGSVGFRRGAC 471
>gi|357476337|ref|XP_003608454.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
gi|355509509|gb|AES90651.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
Length = 683
Score = 90.9 bits (224), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 100/392 (25%), Positives = 169/392 (43%), Gaps = 61/392 (15%)
Query: 87 GGYSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLG 146
G Y+ L GTPPQ+ I+DTGS + + PC+ C+ C + P F P LSS+ + +
Sbjct: 79 GYYTTRLWIGTPPQMFALIVDTGSTVTYVPCST---CEQCGRHQDPKFQPDLSSTYQPVK 135
Query: 147 CQ-NPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSGLTEGIALSETLNLPN 205
C + C ++ +QC + A + + +V +G+ ++ P
Sbjct: 136 CTLDCNCD---NDRMQC--VYERQYAEMSTSSGVLGEDVVSFGN---------QSELAPQ 181
Query: 206 RIIPNFLVGCS-----VLSSRQPAGIAGFGRGKTSLPSQL---NLDKFSYCLLSHKFDDT 257
R + GC L S+ GI G GRG S+ QL N+ S+ L D
Sbjct: 182 RAV----FGCENVETGDLYSQHADGIMGLGRGDLSIMDQLVDKNVVSDSFSLCYGGMD-- 235
Query: 258 TRTSSLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRVRVW 317
+++L S SD F + V S YY + L+ I V G+R+ +
Sbjct: 236 VGGGAMVLGGISPPSDMV--------FAQSDPVR-----SPYYNIDLKEIHVAGKRLPLN 282
Query: 318 HKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEALTGLR 377
DG G+++DSGTT+ ++ E F + V ++ +++++ G +
Sbjct: 283 PSVF----DGKHGSVLDSGTTYAYLPEEAFLAFKEAIVKEL---QSFSQISGPDPNYN-D 334
Query: 378 PCFDVPG----EKTGSFPELKLHFKGGAEVTLPVENY-FAVVGEGSAVCLTVVTDREASG 432
CF G + + +FP + + F G + +L ENY F A CL + + +
Sbjct: 335 LCFSGAGIDVSQLSKTFPVVDMIFGNGHKYSLSPENYMFRHSKVRGAYCLGIFQNGK--- 391
Query: 433 GPSIILGNFQMQNYYVEYDLRNQRLGFKQQLC 464
P+ +LG ++N V YD ++GF + C
Sbjct: 392 DPTTLLGGIVVRNTLVLYDREQTKIGFWKTNC 423
>gi|224068901|ref|XP_002326227.1| predicted protein [Populus trichocarpa]
gi|222833420|gb|EEE71897.1| predicted protein [Populus trichocarpa]
Length = 477
Score = 90.9 bits (224), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 108/402 (26%), Positives = 168/402 (41%), Gaps = 66/402 (16%)
Query: 87 GGYSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPS--FIPKLSSSSRL 144
G Y + G+PP+ +DTGS ++W C + C S I F SS++
Sbjct: 64 GLYFTKVKLGSPPREFNVQIDTGSDVLWVCCNSCNNCPRTSGLGIQLNFFDSSSSSTAGQ 123
Query: 145 LGCQNPKC-SWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSGL-TEGIALSETLN 202
+ C +P C S + + QC D+ C SY YG G T G +S+TL
Sbjct: 124 VRCSDPICTSAVQTTATQCSSQTDQ-------C-----SYTFQYGDGSGTSGYYVSDTLY 171
Query: 203 ----LPNRIIPN----FLVGCSVLSS-------RQPAGIAGFGRGKTSLPSQLNLDKFSY 247
L +I N + GCS S + GI GFG+G+ S+ SQL+ +
Sbjct: 172 FDAILGQSLIDNSSALIVFGCSAYQSGDLTKTDKAVDGIFGFGQGELSVISQLSTRGITP 231
Query: 248 CLLSHKFD-DTTRTSSLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRR 306
+ SH D + L+L + G+ Y+P V PS +Y + L
Sbjct: 232 RVFSHCLKGDGSGGGILVL------GEILEPGIVYSPLV--PS-------QPHYNLNLLS 276
Query: 307 ITVGGQRVRVWHKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTR 366
I V GQ + + + GTIVDSGTT ++ E ++P FVS +
Sbjct: 277 IAVNGQLLPI--DPAAFATSNSQGTIVDSGTTLAYLVAEAYDP----FVSAV-------N 323
Query: 367 ALGAEALTGL----RPCFDVPGEKTGSFPELKLHFKGGAEVTLPVENYFAVVGEGSAVCL 422
A+ + ++T + C+ V + FP +F GGA + L E+Y G +
Sbjct: 324 AIVSPSVTPITSKGNQCYLVSTSVSQMFPLASFNFAGGASMVLKPEDYLIPFGSSGGSAM 383
Query: 423 TVVTDREASGGPSIILGNFQMQNYYVEYDLRNQRLGFKQQLC 464
+ ++ G ILG+ +++ YDL QR+G+ C
Sbjct: 384 WCIGFQKVQG--VTILGDLVLKDKIFVYDLVRQRIGWANYDC 423
>gi|297605079|ref|NP_001056639.2| Os06g0121800 [Oryza sativa Japonica Group]
gi|255676668|dbj|BAF18553.2| Os06g0121800 [Oryza sativa Japonica Group]
Length = 487
Score = 90.5 bits (223), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 109/374 (29%), Positives = 147/374 (39%), Gaps = 61/374 (16%)
Query: 103 PFILDTGSHLVWFPCTNHYQCKY--CSSSKIPSFIPKLSSSSRLLGCQNPKCSWIHHESI 160
P +DT L W C C C + F P+ S +S + C + C +
Sbjct: 163 PMSIDTSIDLPWIQCA---PCPMPECYPQQNALFDPRRSRTSAAVPCGSAACGELGRYGA 219
Query: 161 QCRDCNDEPLATSKNCTQICPSYLVLYGSG-LTEGIALSETLNL-PNRIIPNFLVGCSVL 218
C S N Q Y V YG G T G + + L L P+ ++ NF GCS
Sbjct: 220 GC----------SNNQCQ----YFVDYGDGRATSGTYMVDALTLNPSTVVMNFRFGCSHA 265
Query: 219 S----SRQPAGIAGFGRGKTSLPSQLNL---DKFSYCLLSHKFDDTTRTSSLILDNGSSH 271
S +G G G+ SL SQ + FSYC+ +SS L G
Sbjct: 266 VRGNFSASTSGTMSLGGGRQSLLSQTAATFGNAFSYCV-------PDPSSSGFLSLGGPA 318
Query: 272 SDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRVRVWHKYLTLDRDGNGGT 331
TP V NPS+ Y V LR I VGG+R+ V GG
Sbjct: 319 DGGGAGRFARTPLVRNPSI-----IPTLYLVRLRGIEVGGRRLNVPPVVFA------GGA 367
Query: 332 IVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEALTGLRPCFDVPGEKTGSFP 391
++DS T + P + L F S M Y R G A GL C+D + + P
Sbjct: 368 VMDSSVIITQLPPTAYRALRLAFRSAMAA---YPRVAGGRA--GLDTCYDFVRFTSVTVP 422
Query: 392 ELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTVV-TDREASGGPSIILGNFQMQNYYVEY 450
+ L F GGA V L + V+ EG CL V T + + G +GN Q Q + V Y
Sbjct: 423 AVSLVFDGGAVVRL---DAMGVMVEG---CLAFVPTPGDFALG---FIGNVQQQTHEVLY 473
Query: 451 DLRNQRLGFKQQLC 464
D+ +GF++ C
Sbjct: 474 DVGGGSVGFRRGAC 487
>gi|21717160|gb|AAM76353.1|AC074196_11 putative nucleoid DNA binding protein [Oryza sativa Japonica Group]
gi|31433304|gb|AAP54833.1| Eukaryotic aspartyl protease family protein [Oryza sativa Japonica
Group]
gi|125575544|gb|EAZ16828.1| hypothetical protein OsJ_32300 [Oryza sativa Japonica Group]
Length = 419
Score = 90.5 bits (223), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 107/396 (27%), Positives = 157/396 (39%), Gaps = 62/396 (15%)
Query: 89 YSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSS-----KIPSFIPKLSSSSR 143
Y + + GTPPQ + I+D LVW QC C SS ++P F P S++ R
Sbjct: 62 YVANFTIGTPPQAVSGIVDLSGELVW------TQCAACRSSGCFKQELPVFDPSASNTYR 115
Query: 144 LLGCQNPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSGLTEGIALSETLNL 203
C +P C +SI R+C+ + C PS G T GIA ++ + +
Sbjct: 116 AEQCGSPLC-----KSIPTRNCSGD-----GECGYEAPSMF-----GDTFGIASTDAIAI 160
Query: 204 PNRIIPNFLVGCSVLSSRQ-------PAGIAGFGRGKTSLPSQLNLDKFSYCLLSHKFDD 256
N GC V S P+G G GR SL Q N+ FSYCL H
Sbjct: 161 GNAE-GRLAFGCVVASDGSIDGAMDGPSGFVGLGRTPWSLVGQSNVTAFSYCLALHG--- 216
Query: 257 TTRTSSLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRVRV 316
+ S+L L + + + + S + YY V L I G V
Sbjct: 217 PGKKSALFLGASAKLAGAGKSNPPTPLLGQHASNTSDDGSDPYYTVQLEGIKAGDVAVAA 276
Query: 317 WHKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLA--DEFVSQMVKNRNYTRALGAEALT 374
GG I T + E F PL+ + Q ++ + T ALG+ ++
Sbjct: 277 ASS--------GGGAI-------TVLQLETFRPLSYLPDAAYQALE-KVVTAALGSPSMA 320
Query: 375 GLRPCFDV--PGEKTGSFPELKLHFKGGAEVTLPVENYFAVVGEGSA-VCLTVVTD---R 428
FD+ P+L F+GGA +T Y G G+ VCL++++
Sbjct: 321 NPPEPFDLCFQNAAVSGVPDLVFTFQGGATLTAQPSKYLLGDGNGNGTVCLSILSSTRLD 380
Query: 429 EASGGPSIILGNFQMQNYYVEYDLRNQRLGFKQQLC 464
A G S ILG+ +N + +DL + L F+ C
Sbjct: 381 SADDGVS-ILGSLLQENVHFLFDLEKETLSFEPADC 415
>gi|356495754|ref|XP_003516738.1| PREDICTED: probable aspartic protease At2g35615-like [Glycine max]
Length = 420
Score = 90.5 bits (223), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 108/392 (27%), Positives = 165/392 (42%), Gaps = 59/392 (15%)
Query: 87 GGYSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLG 146
G Y + LS GTPP I I DTGS L W C C C + P F P+ S++ R +
Sbjct: 70 GHYLMELSIGTPPFKIYGIADTGSDLTWTSCV---PCNNCYKQRNPMFDPQKSTTYRNIS 126
Query: 147 CQNPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGS-GLTEGIALSETLNL-- 203
C +S C + + K C +Y Y S +T G+ ET+ L
Sbjct: 127 C----------DSKLCHKLDTGVCSPQKRC-----NYTYAYASAAITRGVLAQETITLSS 171
Query: 204 -PNRIIP--NFLVGCSVLSS----RQPAGIAGFGRGKTSLPSQLNL----DKFSYCLLSH 252
+ +P + GC ++ GI G G G SL SQ+ +FS CL+
Sbjct: 172 TKGKSVPLKGIVFGCGHNNTGGFNDHEMGIIGLGGGPVSLISQMGSSFGGKRFSQCLVPF 231
Query: 253 KFDDTTRTSSLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQ 312
D + +S + GS S K V+ P VA+++ Y+V L I+V
Sbjct: 232 H-TDVSVSSKMSFGKGSKVSGKGV--------VSTPLVAKQD--KTPYFVTLLGISVENT 280
Query: 313 RVRVWHKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEA 372
+ +++ G +DSGT T + +L+ D+ V+Q V++ + + +
Sbjct: 281 YLHFNGSSQNVEK---GNMFLDSGTPPTILPTQLY----DQVVAQ-VRSEVAMKPVTDDP 332
Query: 373 LTGLRPCFDVPGEKTGSFPELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTVVTDREASG 432
G + C+ G P L HF+ GA+V L F +G CL T+ + G
Sbjct: 333 DLGPQLCYRTKNNLRG--PVLTAHFE-GADVKLSPTQTFISPKDG-VFCLG-FTNTSSDG 387
Query: 433 GPSIILGNFQMQNYYVEYDLRNQRLGFKQQLC 464
G + GNF NY + +DL Q + FK + C
Sbjct: 388 G---VYGNFAQSNYLIGFDLDRQVVSFKPKDC 416
>gi|224090425|ref|XP_002308984.1| predicted protein [Populus trichocarpa]
gi|222854960|gb|EEE92507.1| predicted protein [Populus trichocarpa]
Length = 416
Score = 90.5 bits (223), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 100/405 (24%), Positives = 158/405 (39%), Gaps = 81/405 (20%)
Query: 97 TPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLGCQNPKCSWIH 156
TP + LD G +W C Y +SSS + C +CS
Sbjct: 41 TPLVPVEVTLDLGGQYLWVDCQQGY----------------VSSSKKNPSCNTAQCSLAV 84
Query: 157 HESIQC----RDCNDEP------LATSKNCTQICPSYLVLYGSGLTEGIALSETLNLPNR 206
+ C + C P TS TQ S GS P R
Sbjct: 85 YRLKTCTVDKKFCVLSPDNTATRTGTSDYLTQDVVSIQSTDGSN-------------PGR 131
Query: 207 II--PNFLVGCS---VLS--SRQPAGIAGFGRGKTSLPSQLNL-----DKFSYCLLSHKF 254
++ PNFL C+ +L ++ G+AG GR K SLPSQ + KF+ CL S
Sbjct: 132 VVSVPNFLFSCAPTFILQGLAKGVKGMAGLGRTKISLPSQFSAAFSFPKKFAICLTS--- 188
Query: 255 DDTTRTSSLILDNGS----SHSDKKTTGLTYTPFVNNPSVAERNAF----SVYYYVGLRR 306
+ +I +G H+D + L YTP + NP F S Y++G++
Sbjct: 189 --SNAKGVVIFGDGPYVLLPHADDLSQSLIYTPLILNPVSTASGYFEGEPSTDYFIGVKS 246
Query: 307 ITVGGQRVRVWHKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTR 366
I + V + L+++R+G GGT + + +T M ++ + D FV ++ K N R
Sbjct: 247 IKINENVVPLNASLLSINREGYGGTKISTVNAYTVMETTIYNAVTDSFVRELAK-ANVPR 305
Query: 367 ALGAEALTGLRPCFDVPGEKTG-SFPELKLHFKGGAEVTLPVENYFAVVGEGSAV----- 420
++ + G + P++ L + Y+ + G S V
Sbjct: 306 VASVAPFGACFNSKNIGSTRVGPAVPQIDLVLQSK-------NVYWRIFGANSMVQVKDD 358
Query: 421 --CLTVVTDREASGGPSIILGNFQMQNYYVEYDLRNQRLGFKQQL 463
CL V D + SI++G Q+++ +++DL RLGF L
Sbjct: 359 VLCLGFV-DGGVNPRTSIVIGGHQLEDNLLQFDLAASRLGFSSSL 402
>gi|357122155|ref|XP_003562781.1| PREDICTED: aspartic proteinase-like protein 2-like [Brachypodium
distachyon]
Length = 629
Score = 90.1 bits (222), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 103/398 (25%), Positives = 169/398 (42%), Gaps = 73/398 (18%)
Query: 87 GGYSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLG 146
G Y+ L GTPPQ I+D+GS + + PC + C+ C + + P F P LSS+ +
Sbjct: 83 GYYTTRLYIGTPPQEFALIVDSGSTVTYVPCAS---CEQCGNHQDPRFQPDLSSTYSPVK 139
Query: 147 CQNPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSGLTEGIALSETLNLPNR 206
C + C+ +S QC + A + + + +V +G+ E+ P R
Sbjct: 140 C-SADCTCDSDKS-QC--TYERQYAEMSSSSGVLGEDIVSFGT---------ESELKPQR 186
Query: 207 IIPNFLVGCS-----VLSSRQPAGIAGFGRGKTSLPSQLNL-----DKFSYCLLSHKFDD 256
+ GC L S+ GI G GRG+ S+ QL D FS C
Sbjct: 187 AV----FGCENSETGDLFSQHADGIMGLGRGQLSIMDQLVDKGVIGDSFSMCYGGMDIG- 241
Query: 257 TTRTSSLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRVRV 316
+++L + D + + V +P YY + L+ I V G+ +R+
Sbjct: 242 ---GGAMVLGAMPAPPDMV---FSRSDPVRSP----------YYNIELKEIHVAGKALRL 285
Query: 317 WHKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEALTGL 376
+ D GT++DSGTT+ ++ + F D S++ R L + + G
Sbjct: 286 DPRIF----DSKHGTVLDSGTTYAYLPEQAFVAFKDAVTSKV-------RPL--KKIRGP 332
Query: 377 RP-----CFDVPGEKTG----SFPELKLHFKGGAEVTLPVENY-FAVVGEGSAVCLTVVT 426
P CF G +FP++ + F G +++L ENY F A CL V
Sbjct: 333 DPNYKDICFAGAGRNVSQLSQAFPDVDMVFGDGQKLSLSPENYLFRHSKVEGAYCLGVFQ 392
Query: 427 DREASGGPSIILGNFQMQNYYVEYDLRNQRLGFKQQLC 464
+ + P+ +LG ++N V YD N+++GF + C
Sbjct: 393 NGK---DPTTLLGGIVVRNTLVTYDRHNEKIGFWKTNC 427
>gi|195625122|gb|ACG34391.1| aspartic proteinase nepenthesin-2 precursor [Zea mays]
Length = 471
Score = 90.1 bits (222), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 106/389 (27%), Positives = 156/389 (40%), Gaps = 59/389 (15%)
Query: 87 GGYSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKY-CSSSKIPSFIPKLSSSSRLL 145
G Y L GTP ++DTGS L W C+ C C P F P+ SS+ +
Sbjct: 132 GNYVTQLGLGTPSTSYAMVVDTGSSLTWLQCS---PCVVSCHRQVGPLFDPRASSTYASV 188
Query: 146 GCQNPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYG-SGLTEGIALSETLNLP 204
C +C + +Q N + S C Y YG S + G ++T++
Sbjct: 189 RCSASQC-----DELQAATLNPSACSASNVCI-----YQASYGDSSFSVGSLSTDTVSFG 238
Query: 205 NRIIPNFLVGCSVLSS---RQPAGIAGFGRGKTSLPSQLNLD---KFSYCLLSHKFDDTT 258
+ P+F GC + + AG+ G R K SL QL FSYCL T
Sbjct: 239 STRYPSFYYGCGQDNEGLFGRSAGLIGLARNKLSLLYQLAPSLGYSFSYCL-------PT 291
Query: 259 RTSSLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRVRVW- 317
S+ L G ++ +YTP +A + + Y++ L ++VGG + V
Sbjct: 292 AASTGYLSIGPYNTGHY---YSYTP------MASSSLDASLYFITLSGMSVGGSPLAVSP 342
Query: 318 HKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEALTGLR 377
+Y +L TI+DSGT T + + L+ M + A A + L
Sbjct: 343 SEYSSLP------TIIDSGTVITRLPTAVHTALSKAVAQAMAGAQR------APAFSILD 390
Query: 378 PCFDVPGEKTGSFPELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTVV-TDREASGGPSI 436
CF+ + P + + F GGA + L N V + S CL TD A
Sbjct: 391 TCFEGQASQL-RVPTVAMAFAGGASMKLTTRNVLIDV-DDSTTCLAFAPTDSTA------ 442
Query: 437 ILGNFQMQNYYVEYDLRNQRLGFKQQLCK 465
I+GN Q Q + V YD+ R+GF C
Sbjct: 443 IIGNTQQQTFSVIYDVAQSRIGFSAGGCS 471
>gi|23954367|emb|CAD27730.1| xylanase inhibitor [Triticum aestivum]
gi|56201268|dbj|BAD72880.1| xylanase inhibitor TAXI-I [Triticum aestivum]
Length = 402
Score = 90.1 bits (222), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 98/388 (25%), Positives = 158/388 (40%), Gaps = 72/388 (18%)
Query: 104 FILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLL-------GCQNPKCSWIH 156
+LD LVW C + P+ IP SS + LL GC P C
Sbjct: 47 LVLDVAGPLVW---------STCDGGQPPAEIP-CSSPTCLLANAYPAPGCPAPSCGSDK 96
Query: 157 HESIQCRDCNDEPL-ATSKNCTQICPSYLVLYGSGLTEGIALSETLNLPNRIIPNFLVGC 215
H+ + C P S C S+ + + T+G +N+ L C
Sbjct: 97 HD----KPCTAYPYNPVSGACAAGSLSH-TRFVANTTDGSKPVSKVNV------GVLAAC 145
Query: 216 S---VLSS--RQPAGIAGFGRGKTSLPSQLN-----LDKFSYCLLSHKFDDTTRTSSLIL 265
+ +L+S R G+AG +LP+Q+ ++F CL T I
Sbjct: 146 APSKLLASLPRGSTGVAGLANSGLALPAQVASAQKVANRFLLCL------PTGGPGVAIF 199
Query: 266 DNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRVRVWHKYLTLDR 325
G + T + YTP V S +Y+ R I VG RV V L
Sbjct: 200 GGGPVPWPQFTQSMPYTPLVTK-------GGSPAHYISARSIVVGDTRVPVPEGALA--- 249
Query: 326 DGNGGTIVDSGTTFTFMAPELFEPLADEF----VSQMVKNRNYTRALGAEALTGLRPCFD 381
GG ++ + + + P+++ PL D F +Q RA+ A A G+ C+D
Sbjct: 250 --TGGVMLSTRLPYVLLRPDVYRPLMDAFTKALAAQHANGAPVARAVEAVAPFGV--CYD 305
Query: 382 VP--GEKTGSF--PELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTVVTDREASGG---- 433
G G + P ++L GG++ T+ +N V +G+A C+ V + + G
Sbjct: 306 TKTLGNNLGGYAVPNVQLGLDGGSDWTMTGKNSMVDVKQGTA-CVAFVEMKGVAAGDGRA 364
Query: 434 PSIILGNFQMQNYYVEYDLRNQRLGFKQ 461
P++ILG QM+++ +++D+ +RLGF +
Sbjct: 365 PAVILGGAQMEDFVLDFDMEKKRLGFSR 392
>gi|115473125|ref|NP_001060161.1| Os07g0592200 [Oryza sativa Japonica Group]
gi|29027762|dbj|BAC65898.1| putative CND41, chloroplast nucleoid DNA binding protein [Oryza
sativa Japonica Group]
gi|113611697|dbj|BAF22075.1| Os07g0592200 [Oryza sativa Japonica Group]
Length = 631
Score = 90.1 bits (222), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 111/437 (25%), Positives = 176/437 (40%), Gaps = 72/437 (16%)
Query: 44 SYQNLNSLVSSSLTRAL-HIKNPQTKTTTTTTTTTTTNISSHSYGGYSISLSFGTPPQII 102
+Y N L +SS R L NP + T G Y+ L GTP Q
Sbjct: 53 AYPNATRLPASSARRGLGDGHNPNARMRLHDDLLTN--------GYYTTRLYIGTPSQEF 104
Query: 103 PFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLGCQNPKCSWIHHESIQC 162
I+D+GS + + PC C+ C + + P F P LSS+ + C N C+ +E QC
Sbjct: 105 ALIVDSGSTVTYVPCAT---CEQCGNHQDPRFQPDLSSTYSPVKC-NVDCT-CDNERSQC 159
Query: 163 RDCNDEPLATSKNCTQICPSYLVLYGSGLTEGIALSETLNLPNRIIPNFLVGCS-----V 217
+ ++ S VL ++ G E+ P R + GC
Sbjct: 160 --------TYERQYAEMSSSSGVLGEDIMSFG---KESELKPQRAV----FGCENTETGD 204
Query: 218 LSSRQPAGIAGFGRGKTSLPSQLNL-----DKFSYCLLSHKFDDTTRTSSLILDNGSSHS 272
L S+ GI G GRG+ S+ QL D FS C T ++L +
Sbjct: 205 LFSQHADGIMGLGRGQLSIMDQLVEKGVISDSFSLCYGGMDVGGGT----MVLGGMPAPP 260
Query: 273 DKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRVRVWHKYLTLDRDGNGGTI 332
D +++ V +P YY + L+ I V G+ +R+ K + GT+
Sbjct: 261 DMV---FSHSNPVRSP----------YYNIELKEIHVAGKALRLDPKIF----NSKHGTV 303
Query: 333 VDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEALTGLRPCFDVPGEKTGS--- 389
+DSGTT+ ++ + F D +++ N + + CF G
Sbjct: 304 LDSGTTYAYLPEQAFVAFKDAVTNKV----NSLKKIRGPDPNYKDICFAGAGRNVSQLSE 359
Query: 390 -FPELKLHFKGGAEVTLPVENY-FAVVGEGSAVCLTVVTDREASGGPSIILGNFQMQNYY 447
FP++ + F G +++L ENY F A CL V + + P+ +LG ++N
Sbjct: 360 VFPDVDMVFGNGQKLSLSPENYLFRHSKVEGAYCLGVFQNGK---DPTTLLGGIVVRNTL 416
Query: 448 VEYDLRNQRLGFKQQLC 464
V YD N+++GF + C
Sbjct: 417 VTYDRHNEKIGFWKTNC 433
>gi|226494448|ref|NP_001141341.1| uncharacterized protein LOC100273432 precursor [Zea mays]
gi|194704078|gb|ACF86123.1| unknown [Zea mays]
gi|413953775|gb|AFW86424.1| aspartic proteinase nepenthesin-2 [Zea mays]
Length = 471
Score = 89.7 bits (221), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 106/389 (27%), Positives = 156/389 (40%), Gaps = 59/389 (15%)
Query: 87 GGYSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKY-CSSSKIPSFIPKLSSSSRLL 145
G Y L GTP ++DTGS L W C+ C C P F P+ SS+ +
Sbjct: 132 GNYVTQLGLGTPSTSYAMVVDTGSSLTWLQCS---PCVVSCHRQVGPLFDPRASSTYTSV 188
Query: 146 GCQNPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYG-SGLTEGIALSETLNLP 204
C +C + +Q N + S C Y YG S + G ++T++
Sbjct: 189 RCSASQC-----DELQAATLNPSACSASNVCI-----YQASYGDSSFSVGYLSTDTVSFG 238
Query: 205 NRIIPNFLVGCSVLSS---RQPAGIAGFGRGKTSLPSQLNLD---KFSYCLLSHKFDDTT 258
+ P+F GC + + AG+ G R K SL QL FSYCL T
Sbjct: 239 STSYPSFYYGCGQDNEGLFGRSAGLIGLARNKLSLLYQLAPSLGYSFSYCL-------PT 291
Query: 259 RTSSLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRVRVW- 317
S+ L G ++ +YTP +A + + Y++ L ++VGG + V
Sbjct: 292 AASTGYLSIGPYNTGHY---YSYTP------MASSSLDASLYFITLSGMSVGGSPLAVSP 342
Query: 318 HKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEALTGLR 377
+Y +L TI+DSGT T + + L+ M + A A + L
Sbjct: 343 SEYSSLP------TIIDSGTVITRLPTAVHTALSKAVAQAMAGAQR------APAFSILD 390
Query: 378 PCFDVPGEKTGSFPELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTVV-TDREASGGPSI 436
CF+ + P + + F GGA + L N V + S CL TD A
Sbjct: 391 TCFEGQASQL-RVPTVVMAFAGGASMKLTTRNVLIDV-DDSTTCLAFAPTDSTA------ 442
Query: 437 ILGNFQMQNYYVEYDLRNQRLGFKQQLCK 465
I+GN Q Q + V YD+ R+GF C
Sbjct: 443 IIGNTQQQTFSVIYDVAQSRIGFSAGGCS 471
>gi|218186446|gb|EEC68873.1| hypothetical protein OsI_37494 [Oryza sativa Indica Group]
Length = 353
Score = 89.7 bits (221), Expect = 3e-15, Method: Compositional matrix adjust.
Identities = 103/389 (26%), Positives = 167/389 (42%), Gaps = 54/389 (13%)
Query: 89 YSISLSFGTPPQIIPFILDTGSHLVWFPCTN-HYQCKYCSSSKIPSFIPKLSSSSRLLGC 147
Y + +S GTPP +DTGS L W C N +C ++ F P SS+ +GC
Sbjct: 6 YFMGISLGTPPVFNLVTIDTGSTLSWVQCKNCQIKCYDQAAKAGQIFNPYNSSTYSKVGC 65
Query: 148 QNPKCSWIHHESIQCRDCNDEPLATSKNCTQICPS--YLVLYGSG-LTEGIALSETLNLP 204
C+ +H + LA C + + Y + YGSG + G + L L
Sbjct: 66 STEACNGMHMD-----------LAVEYGCVEEDDTCIYSLRYGSGEYSVGYLGKDRLTLA 114
Query: 205 -NRIIPNFLVGCSV--LSSRQPAGIAGFGRGKTSLPSQL----NLDKFSYCLLSHKFDDT 257
NR I NF+ GC L + AGI GFG S +Q+ + FSYC D
Sbjct: 115 SNRSIDNFIFGCGEDNLYNGVNAGIIGFGTKSYSFFNQVCQQTDYTAFSYCFPR----DH 170
Query: 258 TRTSSLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRVRVW 317
SL + + + T L Y + + P+ Y + + V G R+ +
Sbjct: 171 ENEGSLTIGPYARDINLMWTKLIY--YDHKPA----------YAIQQLDMMVNGIRLEI- 217
Query: 318 HKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEALTGLR 377
Y+ + + TIVDSGT T++ +F+ L D+ +++ ++ + YTR R
Sbjct: 218 DPYIYISK----MTIVDSGTADTYILSPVFDAL-DKAMTKEMQAKGYTRGWDER-----R 267
Query: 378 PCF--DVPGEKTGSFPELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTVVTDREASGGPS 435
CF + FP +++ + + LPVEN F + +C T + D G
Sbjct: 268 ICFISNSGSANWNDFPTVEMKLI-RSTLKLPVENAF-YESSNNVICSTFLPDDAGVRGVQ 325
Query: 436 IILGNFQMQNYYVEYDLRNQRLGFKQQLC 464
+LGN ++++ + +D++ GFK + C
Sbjct: 326 -MLGNRAVRSFKLVFDIQAMNFGFKARAC 353
>gi|77553049|gb|ABA95845.1| Eukaryotic aspartyl protease family protein, expressed [Oryza
sativa Japonica Group]
Length = 372
Score = 89.7 bits (221), Expect = 3e-15, Method: Compositional matrix adjust.
Identities = 103/389 (26%), Positives = 167/389 (42%), Gaps = 54/389 (13%)
Query: 89 YSISLSFGTPPQIIPFILDTGSHLVWFPCTN-HYQCKYCSSSKIPSFIPKLSSSSRLLGC 147
Y + +S GTPP +DTGS L W C N +C ++ F P SS+ +GC
Sbjct: 25 YFMGISLGTPPVFNLVTIDTGSTLSWVQCKNCQIKCYDQAAKAGQIFNPYNSSTYSKVGC 84
Query: 148 QNPKCSWIHHESIQCRDCNDEPLATSKNCTQICPS--YLVLYGSG-LTEGIALSETLNLP 204
C+ +H + LA C + + Y + YGSG + G + L L
Sbjct: 85 STEACNGMHMD-----------LAVEYGCVEEDDTCIYSLRYGSGEYSVGYLGKDRLTLA 133
Query: 205 -NRIIPNFLVGCSV--LSSRQPAGIAGFGRGKTSLPSQL----NLDKFSYCLLSHKFDDT 257
NR I NF+ GC L + AGI GFG S +Q+ + FSYC D
Sbjct: 134 SNRSIDNFIFGCGEDNLYNGVNAGIIGFGTKSYSFFNQVCQQTDYTAFSYCFPR----DH 189
Query: 258 TRTSSLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRVRVW 317
SL + + + T L Y + + P+ Y + + V G R+ +
Sbjct: 190 ENEGSLTIGPYARDINLMWTKLIY--YDHKPA----------YAIQQLDMMVNGIRLEI- 236
Query: 318 HKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEALTGLR 377
Y+ + + TIVDSGT T++ +F+ L D+ +++ ++ + YTR R
Sbjct: 237 DPYIYISK----MTIVDSGTADTYILSPVFDAL-DKAMTKEMQAKGYTRGWDER-----R 286
Query: 378 PCF--DVPGEKTGSFPELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTVVTDREASGGPS 435
CF + FP +++ + + LPVEN F + +C T + D G
Sbjct: 287 ICFISNSGSANWNDFPTVEMKLI-RSTLKLPVENAF-YESSNNVICSTFLPDDAGVRGVQ 344
Query: 436 IILGNFQMQNYYVEYDLRNQRLGFKQQLC 464
+LGN ++++ + +D++ GFK + C
Sbjct: 345 -MLGNRAVRSFKLVFDIQAMNFGFKARAC 372
>gi|55669876|pdb|1T6E|X Chain X, Crystal Structure Of The Triticum Aestivum Xylanase
Inhibitor I
gi|55669877|pdb|1T6G|A Chain A, Crystal Structure Of The Triticum Aestivum Xylanase
Inhibitor-i In Complex With Aspergillus Niger Xylanase-i
gi|55669878|pdb|1T6G|B Chain B, Crystal Structure Of The Triticum Aestivum Xylanase
Inhibitor-i In Complex With Aspergillus Niger Xylanase-i
Length = 381
Score = 89.7 bits (221), Expect = 3e-15, Method: Compositional matrix adjust.
Identities = 98/388 (25%), Positives = 158/388 (40%), Gaps = 72/388 (18%)
Query: 104 FILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLL-------GCQNPKCSWIH 156
+LD LVW C + P+ IP SS + LL GC P C
Sbjct: 26 LVLDVAGPLVW---------STCDGGQPPAEIP-CSSPTCLLANAYPAPGCPAPSCGSDK 75
Query: 157 HESIQCRDCNDEPL-ATSKNCTQICPSYLVLYGSGLTEGIALSETLNLPNRIIPNFLVGC 215
H+ + C P S C S+ + + T+G +N+ L C
Sbjct: 76 HD----KPCTAYPYNPVSGACAAGSLSH-TRFVANTTDGSKPVSKVNV------GVLAAC 124
Query: 216 S---VLSS--RQPAGIAGFGRGKTSLPSQLN-----LDKFSYCLLSHKFDDTTRTSSLIL 265
+ +L+S R G+AG +LP+Q+ ++F CL T I
Sbjct: 125 APSKLLASLPRGSTGVAGLANSGLALPAQVASAQKVANRFLLCL------PTGGPGVAIF 178
Query: 266 DNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRVRVWHKYLTLDR 325
G + T + YTP V S +Y+ R I VG RV V L
Sbjct: 179 GGGPVPWPQFTQSMPYTPLVTK-------GGSPAHYISARSIVVGDTRVPVPEGALA--- 228
Query: 326 DGNGGTIVDSGTTFTFMAPELFEPLADEF----VSQMVKNRNYTRALGAEALTGLRPCFD 381
GG ++ + + + P+++ PL D F +Q RA+ A A G+ C+D
Sbjct: 229 --TGGVMLSTRLPYVLLRPDVYRPLMDAFTKALAAQHANGAPVARAVEAVAPFGV--CYD 284
Query: 382 VP--GEKTGSF--PELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTVVTDREASGG---- 433
G G + P ++L GG++ T+ +N V +G+A C+ V + + G
Sbjct: 285 TKTLGNNLGGYAVPNVQLGLDGGSDWTMTGKNSMVDVKQGTA-CVAFVEMKGVAAGDGRA 343
Query: 434 PSIILGNFQMQNYYVEYDLRNQRLGFKQ 461
P++ILG QM+++ +++D+ +RLGF +
Sbjct: 344 PAVILGGAQMEDFVLDFDMEKKRLGFSR 371
>gi|242041951|ref|XP_002468370.1| hypothetical protein SORBIDRAFT_01g044790 [Sorghum bicolor]
gi|241922224|gb|EER95368.1| hypothetical protein SORBIDRAFT_01g044790 [Sorghum bicolor]
Length = 408
Score = 89.4 bits (220), Expect = 4e-15, Method: Compositional matrix adjust.
Identities = 104/432 (24%), Positives = 162/432 (37%), Gaps = 61/432 (14%)
Query: 40 PSQDSYQNLNSLVSSSLTRALHIKNPQTKTTTTTTTTTTTNISSHSYGGYSISLSFGTPP 99
PS +++ +L + R L + + ++ T+ S + Y + GTP
Sbjct: 32 PSPSPLESIIALARADDARLLFLSSKAASSSGGVTSAPVA--SGQTPPSYVVRAGLGTPV 89
Query: 100 QIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLGCQNPKCSWIHHES 159
Q + LDT + W C C C + FIP SSS L C + C +
Sbjct: 90 QQLLLALDTSADATWSHCA---PCDTCPAGS--RFIPASSSSYASLPCASDWCPLFRRPA 144
Query: 160 IQCRDCNDEPLATSKNCTQICPSYLVLYGSGLTEGIALSETLNLPNRIIPNFLVG---CS 216
+ EP G A L +R + ++ C
Sbjct: 145 VP-----GEP--------------------GRVGAAADVRLLQAASRTPRSGVLAATRCG 179
Query: 217 VLSSRQPAGIAGFGRGKTSLPSQLNLDKFSYCLLSHK---FDDTTRTSSLILDNGSSHSD 273
+ PA +G + S+ N FSYCL S++ F + R G++
Sbjct: 180 WARTPSPATRSGPMSLLSQTGSRYN-GVFSYCLPSYRSYYFSGSLRL-------GAAGQP 231
Query: 274 KKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRVRVWHKYLTLDRDGNGGTIV 333
+ + YTP + NP R + YYV + ++VG V+ D GT++
Sbjct: 232 RN---VRYTPLLTNP---HRPSL---YYVNVTGLSVGRALVKAPAGSFAFDPSTGAGTVI 282
Query: 334 DSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEALTGLRPCFDVPGEKTGSFPEL 393
DSGT T ++ L DEF Q+ YT +LGA CF+ G P +
Sbjct: 283 DSGTVITRWTAPVYAALRDEFRRQVAAPSGYT-SLGA-----FDTCFNTDEVAAGGAPPV 336
Query: 394 KLHFKGGAEVTLPVENYFAVVGEGSAVCLTVVTDREASGGPSIILGNFQMQNYYVEYDLR 453
LH GG ++TLP+EN CL + + ++ N Q QN V D+
Sbjct: 337 TLHMGGGVDLTLPMENTLIHSSATPLACLAMAEAPQNVNSVVNVVANLQQQNVRVVVDVA 396
Query: 454 NQRLGFKQQLCK 465
R+GF ++ C
Sbjct: 397 GSRVGFAREPCN 408
>gi|449456068|ref|XP_004145772.1| PREDICTED: aspartic proteinase-like protein 2-like [Cucumis
sativus]
gi|449496218|ref|XP_004160076.1| PREDICTED: aspartic proteinase-like protein 2-like [Cucumis
sativus]
Length = 500
Score = 89.0 bits (219), Expect = 4e-15, Method: Compositional matrix adjust.
Identities = 101/397 (25%), Positives = 167/397 (42%), Gaps = 54/397 (13%)
Query: 87 GGYSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPS--FIPKLSSSSRL 144
G Y + G PP+ +DTGS ++W C + C S +IP F P S+++ L
Sbjct: 81 GLYYTRVQLGNPPKDFYVQIDTGSDVLWVSCNSCNGCPATSGLQIPLNFFDPGSSTTASL 140
Query: 145 LGCQNPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSGL-TEGIALSETLNL 203
+ C + C+ +Q D S C +Y+ YG G T G + + ++L
Sbjct: 141 VSCSDQICAL----GVQSSD--SACFGQSNQC-----AYVFQYGDGSGTSGYYVMDMIHL 189
Query: 204 PNRI--------IPNFLVGCSV-------LSSRQPAGIAGFGRGKTSLPSQLNLDKFSYC 248
I + + GCS S R GI GFG+ S+ SQL+ +
Sbjct: 190 DVVIDSSVTSNSSASVVFGCSTSQTGDLTKSDRAVDGIFGFGQQDLSVISQLSSRGIAPK 249
Query: 249 LLSHKFD-DTTRTSSLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRI 307
+ SH D + L+L + + YTP V PS +Y + L+ I
Sbjct: 250 VFSHCLKGDDSGGGILVL------GEIVEPNVVYTPLV--PS-------QPHYNLNLQSI 294
Query: 308 TVGGQRVRVWHKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRA 367
+V GQ + + + GTI+DSGTT ++A E + V+ +V +++
Sbjct: 295 SVNGQVLPISPAVFATSS--SQGTIIDSGTTLAYLAEEAYNAFVVA-VTNIV-----SQS 346
Query: 368 LGAEALTGLRPCFDVPGEKTGSFPELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTVVTD 427
+ L G R C+ + FP++ L+F GGA + L ++Y + +
Sbjct: 347 TQSVVLKGNR-CYVTSSSVSDIFPQVSLNFAGGASLVLGAQDYLIQQNSVGGTTVWCIGF 405
Query: 428 REASGGPSIILGNFQMQNYYVEYDLRNQRLGFKQQLC 464
++ G ILG+ +++ YDL NQR+G+ C
Sbjct: 406 QKIPGQGITILGDLVLKDKIFIYDLANQRIGWTNYDC 442
>gi|356535355|ref|XP_003536212.1| PREDICTED: basic 7S globulin-like [Glycine max]
Length = 444
Score = 89.0 bits (219), Expect = 4e-15, Method: Compositional matrix adjust.
Identities = 103/410 (25%), Positives = 179/410 (43%), Gaps = 89/410 (21%)
Query: 97 TPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLGCQNPKCSWIH 156
TP + +D G W C Y +SS+S+ C + +CS
Sbjct: 60 TPLVPVKLTVDLGGGYFWVNCEKGY----------------VSSTSKPARCGSAQCSLFG 103
Query: 157 HESIQCRDCNDEPLATSKNCTQICPSYLVLYGSGLTEGIALSETL-NLPNRII--PNFLV 213
CN E S++ + + + +G + +A++ T N P R++ P FL
Sbjct: 104 -----LYGCNVEDKICSRSLSNTV-TGVSTFGEIHADVVAINATDGNNPVRVVSVPKFLF 157
Query: 214 --GCSVLSSRQPAGI---AGFGRGKTSLPSQLN-----LDKFSYCLLSHKFDDTTRTSSL 263
G +V+ + +G+ AG GR K SLPSQ + L KF+ CL S +T T+ +
Sbjct: 158 ICGANVVQNGLASGVTGMAGLGRTKVSLPSQFSSAFSFLRKFAICLSS-----STMTNGV 212
Query: 264 IL------DNGSSHSDKKTTGLTYTPFVNNPSVAERNAF----SVYYYVGLRRITVGGQR 313
+ + G +SD LT+TP + NP + F SV Y++G++ I V +
Sbjct: 213 MFFGDGPYNFGYLNSDLSKV-LTFTPLITNPVSTAPSYFQGEPSVEYFIGVKSIRVSDKN 271
Query: 314 VRVWHKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEAL 373
V + L++DR+G GGT + + +T + +++ +++ FV +A+GA +
Sbjct: 272 VPLNTTLLSIDRNGIGGTKISTVNPYTVLETTIYKAVSEAFV----------KAVGAPTV 321
Query: 374 TGLRP---CF---DVPGEKTG-SFPELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTVVT 426
+ P CF D+ + G + P++ L + EV + ++V +CL V
Sbjct: 322 APVAPFGTCFATKDIQSTRMGPAVPDINLVLQN--EVVWSIIGANSMVYTNDVICLGFV- 378
Query: 427 DREASGGP----------------SIILGNFQMQNYYVEYDLRNQRLGFK 460
+A P SI +G Q++N +++DL RLGF+
Sbjct: 379 --DAGSDPSTAQVGFVVGYSQPITSITIGAHQLENNMLQFDLATSRLGFR 426
>gi|356532672|ref|XP_003534895.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
Length = 449
Score = 89.0 bits (219), Expect = 5e-15, Method: Compositional matrix adjust.
Identities = 102/387 (26%), Positives = 162/387 (41%), Gaps = 59/387 (15%)
Query: 92 SLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLGCQNPK 151
++S G PP ++DTGS ++W CT C C + F P +SS+ L C+ P
Sbjct: 104 NISIGQPPIPQLVVMDTGSDILWVMCT---PCTNCDNHLGLLFDPSMSSTFSPL-CKTP- 158
Query: 152 CSWIHHESIQCRDCNDEPLAT--SKNCTQICPSYLVLYGSGL-TEGIALSETLNLPNRII 208
C + C C+ P + N T SG+ + ET + I
Sbjct: 159 CDFK-----GCSRCDPIPFTVTYADNST----------ASGMFGRDTVVFETTDEGTSRI 203
Query: 209 PNFLVGC--SVLSSRQPA--GIAGFGRGKTSLPSQLNLDKFSYCLLSHKFDDTTRTSSLI 264
P+ L GC ++ P GI G G SL +++ KFSYC+ D LI
Sbjct: 204 PDVLFGCGHNIGQDTDPGHNGILGLNNGPDSLATKIG-QKFSYCI-GDLADPYYNYHQLI 261
Query: 265 LDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRVRVWHKYLTLD 324
L G+ T PF + +YYV + I+VG +R+ + + +
Sbjct: 262 LGEGADLEGYST------PF---------EVHNGFYYVTMEGISVGEKRLDIAPETFEMK 306
Query: 325 RDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYT----RALGAEALTGLRPCF 380
++ GG I+D+G+T TF+ + L+ E RN R E ++ +
Sbjct: 307 KNRTGGVIIDTGSTITFLVDSVHRLLSKEV-------RNLLGWSFRQTTIEKSPWMQCFY 359
Query: 381 DVPGEKTGSFPELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTV--VTDREASGGPSIIL 438
FP + HF GA++ L ++F + + + C+TV V+ PS+I
Sbjct: 360 GSISRDLVGFPVVTFHFADGADLALDSGSFFNQLND-NVFCMTVGPVSSLNLKSKPSLI- 417
Query: 439 GNFQMQNYYVEYDLRNQRLGFKQQLCK 465
G Q+Y V YDL NQ + F++ C+
Sbjct: 418 GLLAQQSYSVGYDLVNQFVYFQRIDCE 444
>gi|293334661|ref|NP_001168795.1| uncharacterized protein LOC100382594 precursor [Zea mays]
gi|223973065|gb|ACN30720.1| unknown [Zea mays]
Length = 631
Score = 89.0 bits (219), Expect = 5e-15, Method: Compositional matrix adjust.
Identities = 102/400 (25%), Positives = 165/400 (41%), Gaps = 77/400 (19%)
Query: 87 GGYSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLG 146
G Y+ L GTPPQ I+D+GS + + PC++ C+ C + + P F P LSSS +
Sbjct: 86 GYYTTRLYIGTPPQEFALIVDSGSTVTYVPCSS---CEQCGNHQDPRFQPDLSSSYSPVK 142
Query: 147 CQNPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSGLTEGIALSETL---NL 203
C N C+ C+ + K CT Y Y + L E +
Sbjct: 143 C-NVDCT-----------CDSD----KKQCT-----YERQYAEMSSSSGVLGEDIVSFGR 181
Query: 204 PNRIIPNFLV-GCS-----VLSSRQPAGIAGFGRGKTSLPSQLNL-----DKFSYCLLSH 252
+ + P + GC L S+ GI G GRG+ S+ QL D FS C
Sbjct: 182 ESELKPQHAIFGCENSETGDLFSQHADGIMGLGRGQLSIMDQLVEKGVISDSFSLCYGGM 241
Query: 253 KFDDTTRTSSLILDNGSSHSDKKTTGLTYTP---FVNNPSVAERNAFSVYYYVGLRRITV 309
+++L G+ P F N+ + S YY + L+ I V
Sbjct: 242 DIG----GGAMVL-----------GGMLAPPDMIFSNSDPLR-----SPYYNIELKEIHV 281
Query: 310 GGQRVRVWHKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALG 369
G+ +RV + + GT++DSGTT+ ++ + F + S++ + + +
Sbjct: 282 AGKALRVESRIF----NSKHGTVLDSGTTYAYLPEQAFVAFKEAVTSKV----HSLKKIR 333
Query: 370 AEALTGLRPCFDVPGEKTGS----FPELKLHFKGGAEVTLPVENY-FAVVGEGSAVCLTV 424
+ CF G FP++ + F G +++L ENY F A CL V
Sbjct: 334 GPDPSYKDICFAGAGRNVSKLHEVFPDVDMVFGNGQKLSLTPENYLFRHSKVDGAYCLGV 393
Query: 425 VTDREASGGPSIILGNFQMQNYYVEYDLRNQRLGFKQQLC 464
+ + P+ +LG ++N V YD N+++GF + C
Sbjct: 394 FQNGK---DPTTLLGGIIVRNTLVTYDRHNEKIGFWKTNC 430
>gi|383134454|gb|AFG48206.1| Pinus taeda anonymous locus 0_7863_01 genomic sequence
gi|383134458|gb|AFG48208.1| Pinus taeda anonymous locus 0_7863_01 genomic sequence
gi|383134460|gb|AFG48209.1| Pinus taeda anonymous locus 0_7863_01 genomic sequence
gi|383134462|gb|AFG48210.1| Pinus taeda anonymous locus 0_7863_01 genomic sequence
gi|383134464|gb|AFG48211.1| Pinus taeda anonymous locus 0_7863_01 genomic sequence
gi|383134466|gb|AFG48212.1| Pinus taeda anonymous locus 0_7863_01 genomic sequence
gi|383134468|gb|AFG48213.1| Pinus taeda anonymous locus 0_7863_01 genomic sequence
gi|383134470|gb|AFG48214.1| Pinus taeda anonymous locus 0_7863_01 genomic sequence
gi|383134474|gb|AFG48216.1| Pinus taeda anonymous locus 0_7863_01 genomic sequence
gi|383134486|gb|AFG48222.1| Pinus taeda anonymous locus 0_7863_01 genomic sequence
Length = 136
Score = 89.0 bits (219), Expect = 5e-15, Method: Compositional matrix adjust.
Identities = 47/113 (41%), Positives = 70/113 (61%), Gaps = 8/113 (7%)
Query: 307 ITVGGQRVRVWHKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTR 366
IT+GGQR+++ T D++GNGG IVDSGTTFT + L+ + ++ S + Y+R
Sbjct: 1 ITIGGQRLKLPSSLTTFDKEGNGGLIVDSGTTFTMLPESLYRRVLNKLKSAI----RYSR 56
Query: 367 ALGAEALTGLRPCFDVPGEKTGSFPEL---KLHFKGGAEVTLPVENYFAVVGE 416
++ EA GL C+++P GSFP L LHFK A +TLP ENY +++ +
Sbjct: 57 SVKYEAALGLDLCYELP-SAGGSFPVLPTFSLHFKDNATITLPAENYMSMMSD 108
>gi|361066669|gb|AEW07646.1| Pinus taeda anonymous locus 0_7863_01 genomic sequence
Length = 136
Score = 89.0 bits (219), Expect = 5e-15, Method: Compositional matrix adjust.
Identities = 46/113 (40%), Positives = 68/113 (60%), Gaps = 8/113 (7%)
Query: 307 ITVGGQRVRVWHKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTR 366
IT+GGQR+++ T D++GNGG IVDSGTTFT + L+ E + ++ Y+R
Sbjct: 1 ITIGGQRLKLPSSLTTFDKEGNGGLIVDSGTTFTMLPESLYR----EVLKKLKSAIRYSR 56
Query: 367 ALGAEALTGLRPCFDVPGEKTGS---FPELKLHFKGGAEVTLPVENYFAVVGE 416
++ EA GL C+++P E GS FP LHFK A + LP ENY +++ +
Sbjct: 57 SVRYEAALGLDLCYELPSE-VGSFPVFPTFSLHFKDNATIRLPAENYMSMMSD 108
>gi|357125326|ref|XP_003564345.1| PREDICTED: aspartic proteinase-like protein 2-like [Brachypodium
distachyon]
Length = 506
Score = 88.6 bits (218), Expect = 6e-15, Method: Compositional matrix adjust.
Identities = 105/407 (25%), Positives = 161/407 (39%), Gaps = 73/407 (17%)
Query: 87 GGYSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSS-----KIPSFIPKLSSS 141
G Y + G P + +DTGS ++W C+ C C +S ++ F P SS+
Sbjct: 87 GLYFTRVKLGNPAKEYFVQIDTGSDILWVACS---PCTGCPTSSGLNIQLEFFNPDSSST 143
Query: 142 SRLLGCQNPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSGL-TEGIALSET 200
S + C + +C+ +D P S C Y YG G T G +S+T
Sbjct: 144 SSRIPCSDDRCTAALQTGEAVCQSSDSP---SSPC-----GYTFTYGDGSGTSGFYVSDT 195
Query: 201 LN----LPNRIIPN----FLVGCS-------VLSSRQPAGIAGFGRGKTSLPSQL----- 240
+ + N N + GCS + + R GI GFG+ + S+ SQL
Sbjct: 196 MYFDTVMGNEQTANSSASVVFGCSNSQSGDLMKTDRAVDGIFGFGQHQLSVVSQLYSLGV 255
Query: 241 NLDKFSYCLLSHKFDDTTRTSSLILDNGSS---HSDKKTTGLTYTPFVNNPSVAERNAFS 297
+ FS+CL DNG + GL +TP V PS
Sbjct: 256 SPKTFSHCLKGS-------------DNGGGILVLGEIVEPGLVFTPLV--PS-------Q 293
Query: 298 VYYYVGLRRITVGGQRVRVWHKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQ 357
+Y + L I V GQ++ + GTIVDSGTT ++ ++P + +
Sbjct: 294 PHYNLNLESIAVSGQKLPIDSSLFATSN--TQGTIVDSGTTLVYLVDGAYDPFINAIAAA 351
Query: 358 MVKNRNYTRALGAEALTGLRPCFDVPGEKTGSFPELKLHFKGGAEVTLPVENYFAVVGEG 417
+ + + G + CF SFP L+FKGG +T+ ENY G
Sbjct: 352 VSPSVRSVVSKGIQ-------CFVTTSSVDSSFPTATLYFKGGVSMTVKPENYLLQQGSV 404
Query: 418 SAVCLTVVTDREASGGPSIILGNFQMQNYYVEYDLRNQRLGFKQQLC 464
L + + + G ILG+ +++ YDL N R+G+ C
Sbjct: 405 DNNVLWCIGWQRSQG--ITILGDLVLKDKIFVYDLANMRMGWADYDC 449
>gi|242058093|ref|XP_002458192.1| hypothetical protein SORBIDRAFT_03g028480 [Sorghum bicolor]
gi|241930167|gb|EES03312.1| hypothetical protein SORBIDRAFT_03g028480 [Sorghum bicolor]
Length = 468
Score = 88.6 bits (218), Expect = 6e-15, Method: Compositional matrix adjust.
Identities = 108/395 (27%), Positives = 156/395 (39%), Gaps = 69/395 (17%)
Query: 89 YSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSS-----KIPSFIPKLSSSSR 143
Y +++ GTP ++DTGS L W QC+ C+S+ K P F P SS+
Sbjct: 124 YVVTVGLGTPSVSQVLLIDTGSDLSWV------QCQPCNSTTCYPQKDPLFDPSKSSTYA 177
Query: 144 LLGCQNPKCSWIHHESIQCRDCNDEPLA---TSKNCTQICPSYLVLYGSG-LTEGIALSE 199
+ C + CRD D+ S + C + + YG G T G+ +E
Sbjct: 178 PIPCN----------TDACRDLTDDGYGGGCASGDGAAQC-GFAITYGDGSQTRGVYSNE 226
Query: 200 TLNL-PNRIIPNFLVGCS---VLSSRQPAGIAGFGRGKTSLPSQ---LNLDKFSYCLLSH 252
TL L P + +F GC ++ + G+ G G SL Q + FSYCL +
Sbjct: 227 TLALAPGVAVKDFRFGCGHDQDGANDKYDGLLGLGGAPESLVVQTASVYGGAFSYCLPA- 285
Query: 253 KFDDTTRTSSLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQ 312
++ +L S T+G +TP + E F Y V + ITVGG+
Sbjct: 286 -LNNQVGFLALGGGGAPSGGVVNTSGFVFTPMIR-----EEETF---YVVNMTGITVGGE 336
Query: 313 RVRVWHKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEA 372
+ V + GG I+DSGT T + + L F M Y E
Sbjct: 337 PIDVPPSAFS------GGMIIDSGTVVTELQHTAYNALQAAFRKAMAA---YPLVRNGE- 386
Query: 373 LTGLRPCFDVPGEKTGSFPELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTVVTDREASG 432
L C+D G + P++ L F GGA + L V N + CL
Sbjct: 387 ---LDTCYDFSGYSNVTLPKVALTFSGGATIDLDVPNGILL-----DDCLAF-----QES 433
Query: 433 GPSI---ILGNFQMQNYYVEYDLRNQRLGFKQQLC 464
GP ILGN + V YD R+GF+ +C
Sbjct: 434 GPDDQPGILGNVNQRTLEVLYDAGRGRVGFRAAVC 468
>gi|357118076|ref|XP_003560785.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Brachypodium
distachyon]
Length = 477
Score = 88.6 bits (218), Expect = 6e-15, Method: Compositional matrix adjust.
Identities = 109/395 (27%), Positives = 150/395 (37%), Gaps = 73/395 (18%)
Query: 89 YSISLSFGTPPQIIPFILDTGSHLVWF---PCTNHYQCKYCSSSKIPSFIPKLSSSSRLL 145
+ + + FGTP Q ILDTGS L W PC+ H C P F P SSS +
Sbjct: 137 FVVVVGFGTPAQTAAIILDTGSDLSWIQCKPCSGH-----CYRQHDPDFDPAKSSSYAAV 191
Query: 146 GCQNPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSG-LTEGIALSETLNL- 203
C P C+ A C Y V YG G T G+ +TL
Sbjct: 192 PCGTPVCA-----------------AAGGMCNGTTCLYGVQYGDGSSTTGVLSRDTLTFN 234
Query: 204 PNRIIPNFLVGCSVLSSRQPAGIAGFGR---------GKTSLPSQLNLD---KFSYCLLS 251
+ F GC I FG GK SLPSQ FSYCL S
Sbjct: 235 SSSKFTGFTFGCGE------KNIGDFGEVDGLLGLGRGKLSLPSQAAPSFGGVFSYCLPS 288
Query: 252 HKFDDTTRTSSLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGG 311
+ T+ L+ G++ T + YT + P + +Y++ L I +GG
Sbjct: 289 YN------TTPGYLNIGATKP-TSTVPVQYTAMIKKPQ------YPSFYFIELVSINIGG 335
Query: 312 QRVRVWHKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAE 371
+ V T GT++DSGT T++ P + L D F M N+ A
Sbjct: 336 YILPVPPSVFT-----KTGTLLDSGTILTYLPPPAYTSLRDRFKFTMQGNKP------AP 384
Query: 372 ALTGLRPCFDVPGEKTGSFPELKLHFKGGAEVTLPVENYFAVVGEGSAV--CLTVVTDRE 429
L C+D G+ P + +F GA L + + CL V+
Sbjct: 385 PYEPLDTCYDFTGQGAIVIPAVSFNFSDGAVFDLDFYGIMIFPDDAKPLIGCLAFVSRPA 444
Query: 430 ASGGPSIILGNFQMQNYYVEYDLRNQRLGFKQQLC 464
A P I+GN Q + V YD+ +Q++GF C
Sbjct: 445 AM--PFSIVGNTQQRAAEVIYDVPSQKIGFIPISC 477
>gi|125579874|gb|EAZ21020.1| hypothetical protein OsJ_36669 [Oryza sativa Japonica Group]
Length = 382
Score = 88.6 bits (218), Expect = 6e-15, Method: Compositional matrix adjust.
Identities = 81/265 (30%), Positives = 118/265 (44%), Gaps = 33/265 (12%)
Query: 212 LVGCSVLSSRQ----PAGIAGFGRGKTSLPSQLNLDKFSYCLLSHKFDDTTRTSSLILDN 267
LVG S R P+G+ G GRG+ SL SQ KFSYCL + F + T L +
Sbjct: 136 LVGLRAPSRRARSMAPSGLMGLGRGRLSLVSQTGATKFSYCLTPY-FHNNGATGHLFV-- 192
Query: 268 GSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRVRVWHKYLTLDRDG 327
G+S S + T FV P S +YY+ L +TVG R+ + L
Sbjct: 193 GASASLGGHGDVMTTQFVKGPK------GSPFYYLPLIGLTVGETRLPIPATVFDLREVA 246
Query: 328 ----NGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEALTGLRPCFDVP 383
+GG I+DSG+ FT + + ++ LA E +++ +L A V
Sbjct: 247 PGLFSGGVIIDSGSPFTSLVHDAYDALASELAARL------NGSLVAPPPDADDGALCVA 300
Query: 384 GEKTGS-FPELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTVVTDREASGGP---SIILG 439
G P + HF+GGA++ +P E+Y+A V + +A GP ++G
Sbjct: 301 RRDVGRVVPAVVFHFRGGADMAVPAESYWAPVDKAAACMAIASA------GPYRRQSVIG 354
Query: 440 NFQMQNYYVEYDLRNQRLGFKQQLC 464
N+Q QN V YDL N F+ C
Sbjct: 355 NYQQQNMRVLYDLANGDFSFQPADC 379
>gi|356532674|ref|XP_003534896.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
Length = 446
Score = 88.6 bits (218), Expect = 6e-15, Method: Compositional matrix adjust.
Identities = 111/440 (25%), Positives = 192/440 (43%), Gaps = 57/440 (12%)
Query: 37 HTNPSQDSYQNLNSLVSSSLTRALHIKNPQTKTTTTTTTTTTTNISSHSYGGYSISLSFG 96
H P++ + + + S R +I+ + T + S + ++LS G
Sbjct: 49 HYKPNETAKDRMELDIEHSAARLAYIQARIEGSLVYNNDYTASVSPSLTGRTILVNLSIG 108
Query: 97 TP--PQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLGCQNPKCSW 154
P PQ++ ++DTGS ++W C C C + F P +SS+ L C+ P C +
Sbjct: 109 QPSIPQLV--VMDTGSDILWIMCN---PCTNCDNHLGLLFDPSMSSTFSPL-CKTP-CGF 161
Query: 155 IHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSGLTEG--IALSETLNLPNRIIPNFL 212
+ C +P+ + SY+ + T G I + ET + I + +
Sbjct: 162 --------KGCKCDPIPFTI-------SYVDNSSASGTFGRDILVFETTDEGTSQISDVI 206
Query: 213 VGC--SVLSSRQPA--GIAGFGRGKTSLPSQLNLDKFSYCLLSHKFDDTTRTSSLILDNG 268
+GC ++ + P GI G G SL +Q+ KFSYC+ + D + L L G
Sbjct: 207 IGCGHNIGFNSDPGYNGILGLNNGPNSLATQIG-RKFSYCI-GNLADPYYNYNQLRLGEG 264
Query: 269 SSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRVRVWHKYLTLDRDGN 328
+ T PF + +YYV + I+VG +R+ + + + R+G
Sbjct: 265 ADLEGYST------PF---------EVYHGFYYVTMEGISVGEKRLDIALETFEMKRNGT 309
Query: 329 GGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEALTGLRPC-FDVPGEKT 387
GG I+DSGTT T++ + L +E V ++K ++ + + A L C + +
Sbjct: 310 GGVILDSGTTITYLVDSAHKLLYNE-VRNLLK-WSFRQVIFENAPWKL--CYYGIISRDL 365
Query: 388 GSFPELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTV--VTDREASGGPSIILGNFQMQN 445
FP + HF GA++ L ++F+ C+TV + + PS+I G Q+
Sbjct: 366 VGFPVVTFHFVDGADLALDTGSFFS--QRDDIFCMTVSPASILNTTISPSVI-GLLAQQS 422
Query: 446 YYVEYDLRNQRLGFKQQLCK 465
Y V YDL NQ + F++ C+
Sbjct: 423 YNVGYDLVNQFVYFQRIDCE 442
>gi|222635451|gb|EEE65583.1| hypothetical protein OsJ_21095 [Oryza sativa Japonica Group]
Length = 441
Score = 88.6 bits (218), Expect = 6e-15, Method: Compositional matrix adjust.
Identities = 109/398 (27%), Positives = 155/398 (38%), Gaps = 69/398 (17%)
Query: 89 YSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSS-----KIPSFIPKLSSSSR 143
Y ++L GTP ++DTGS L W QCK C + K P F P SSS
Sbjct: 91 YVVTLGIGTPAVQQTVLIDTGSDLSWV------QCKPCGAGECYAQKDPLFDPSSSSSYA 144
Query: 144 LLGCQNPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGS-GLTEGIALSETLN 202
+ C + C + + C S +C Y + YG+ T G+ +ETL
Sbjct: 145 SVPCDSDACRKLAAGAYG-HGCT----GVSGGAAALC-EYGIEYGNRATTTGVYSTETLT 198
Query: 203 L-PNRIIPNFLVGCSVLSS---RQPAGIAGFGRGKTSLPSQLNLD---KFSYCLLSHKFD 255
L P ++ +F GC + G+ G G SL SQ + FSYCL
Sbjct: 199 LKPGVVVADFGFGCGDHQHGPYEKFDGLLGLGGAPESLVSQTSSQFGGPFSYCL------ 252
Query: 256 DTTRTSSLILDNGS---SHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQ 312
T + L G+ S S +GL++TP PSV +Y V L I+VGG
Sbjct: 253 PPTSGGAGFLTLGAPPNSSSSTAASGLSFTPMRRLPSV------PTFYIVTLTGISVGGA 306
Query: 313 RVRVWHKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEA 372
+ + + G ++DSGT T + + L F S M + R + G
Sbjct: 307 PLAIPPSAFS------SGMVIDSGTVITGLPATAYAALRSAFRSAMSEYRLLPPSNGGV- 359
Query: 373 LTGLRPCFDVPGEKTGSFPELKLHFKGGAEVTLP------VENYFAVVGEGSAVCLTVVT 426
L C+D G + P + L F GGA + L V+ A G G T
Sbjct: 360 ---LDTCYDFTGHANVTVPTISLTFSGGATIDLAAPAGVLVDGCLAFAGAG--------T 408
Query: 427 DREASGGPSIILGNFQMQNYYVEYDLRNQRLGFKQQLC 464
D I+GN + + V YD +GF+ C
Sbjct: 409 DNAIG-----IIGNVNQRTFEVLYDSGKGTVGFRAGAC 441
>gi|388518257|gb|AFK47190.1| unknown [Lotus japonicus]
Length = 478
Score = 88.6 bits (218), Expect = 6e-15, Method: Compositional matrix adjust.
Identities = 106/401 (26%), Positives = 164/401 (40%), Gaps = 61/401 (15%)
Query: 87 GGYSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKI--PSFIPKLSSSSRL 144
G Y + G+P + +DTGS ++W C +C S I + PK S +S
Sbjct: 67 GLYFTKIGLGSPSKDYYVQVDTGSDILWVNCVECTRCPRKSDIGIGLTLYDPKRSKTSEF 126
Query: 145 LGCQNPKCSWIHHESI-QCRDCNDEPLATSKNCTQICPSYLVLYGSG-LTEGIALSETLN 202
+ C++ CS + I C+ N CP Y + YG G T G + + L
Sbjct: 127 VSCEHNFCSSTYEGRILGCKAENP------------CP-YSISYGDGSATTGYYVQDYLT 173
Query: 203 LPNRIIPN---------FLVGCSVL------SSRQPA--GIAGFGRGKTSLPSQLNLDKF 245
NR+ N + GC SS + A GI GFG+ +S+ SQL
Sbjct: 174 F-NRVNGNPHTATQNSSIIFGCGAAQSGTFASSSEEALDGIIGFGQANSSVLSQLAASGK 232
Query: 246 SYCLLSHKFDDTTRTSSLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLR 305
+ SH D T I G K T TP V P++A +Y V L+
Sbjct: 233 VKKIFSHCLD--TNVGGGIFSIGEVVEPKVKT----TPLV--PNMA-------HYNVILK 277
Query: 306 RITVGGQRVRVWHKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYT 365
I V G +++ T D + GT++DSGTT ++ +++ L + +++ + + Y
Sbjct: 278 NIEVDGDILQLPSD--TFDSENGKGTVIDSGTTLAYLPRIVYDQLMSKVLAKQPRLKVY- 334
Query: 366 RALGAEALTGLRPCFDVPGEKTGSFPELKLHFKGGAEVTLPVENYFAVVGEGSAVCL--T 423
L E + CF G FP +KLHF+ +T+ +Y S C+
Sbjct: 335 --LVEEQYS----CFQYTGNVDSGFPIVKLHFEDSLSLTVYPHDYLFNYKGDSYWCIGWQ 388
Query: 424 VVTDREASGGPSIILGNFQMQNYYVEYDLRNQRLGFKQQLC 464
+G +LG+F + N V YDL N +G+ C
Sbjct: 389 KSASETKNGKDMTLLGDFVLSNKLVVYDLENMTIGWTDYNC 429
>gi|296087864|emb|CBI35120.3| unnamed protein product [Vitis vinifera]
Length = 320
Score = 88.6 bits (218), Expect = 7e-15, Method: Compositional matrix adjust.
Identities = 100/365 (27%), Positives = 148/365 (40%), Gaps = 52/365 (14%)
Query: 106 LDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLGCQNPKCSWIHHESIQCRDC 165
+DT S + W PC C CSS+ F S++ + LGCQ +C + + C
Sbjct: 1 MDTSSDVAWIPCNG---CLGCSSTL---FNSPASTTYKSLGCQAAQCKQVPKPT-----C 49
Query: 166 NDEPLATSKNCTQICPSYLVLYGSGLTEGIALSETLNLPNRIIPNFLVGC------SVLS 219
+C L GS L ++ +T+ L +P + GC L
Sbjct: 50 GGG----------VCSFNLTYGGSSLAANLS-QDTITLATDAVPGYSFGCIQKATGGSLP 98
Query: 220 SRQPAGIAGFGRGKTSLPSQLNLDKFSYCLLSHKFDDTTRTSSLILDNGSSHSDKKTTGL 279
++ G+ S L FSYCL S F + SL L G K+ +
Sbjct: 99 AQGLLGLGRGPLSLLSQTQNLYQSTFSYCLPS--FKSLNFSGSLRL--GPVGQPKR---I 151
Query: 280 TYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRVRVWHKYLTLDRDGNGGTIVDSGTTF 339
YTP + NP R + Y+V L + VG + V V T + GTI DSGT F
Sbjct: 152 KYTPLLKNP---RRPSL---YFVNLMAVRVGRRVVDVPPGSFTFNPSTGAGTIFDSGTVF 205
Query: 340 TFMAPELFEPLADEFVSQMVKNRNYTRALGAEALTGLRPCFDVPGEKTGSFPELKLHFKG 399
T + + + D F +++ +N L +L G C+ VP P + F G
Sbjct: 206 TRLVTPAYIAVRDAFRNRVGRN------LTVTSLGGFDTCYTVPIAA----PTITFMFTG 255
Query: 400 GAEVTLPVENYFAVVGEGSAVCLTVVTDREASGGPSIILGNFQMQNYYVEYDLRNQRLGF 459
VTLP +N GS CL + + ++ N Q QN+ + YD+ N RLG
Sbjct: 256 -MNVTLPPDNLLIHSTAGSTTCLAMAAAPDNVNSVLNVIANLQQQNHRLLYDVPNSRLGV 314
Query: 460 KQQLC 464
++LC
Sbjct: 315 ARELC 319
>gi|296086208|emb|CBI31649.3| unnamed protein product [Vitis vinifera]
Length = 761
Score = 88.2 bits (217), Expect = 7e-15, Method: Compositional matrix adjust.
Identities = 80/263 (30%), Positives = 118/263 (44%), Gaps = 30/263 (11%)
Query: 215 CSVLSSRQPAGIAGFGRGKTSLPSQLNLDKFSYCLLSHKFDDTTRTSSLILDNGSSHSDK 274
C + + G+ G RG S +Q+ L KFSYC+ +S ++L SS S
Sbjct: 431 CRTRTHSKTTGLIGMNRGSLSFVTQMGLQKFSYCISGQD------SSGILLFGESSFSWL 484
Query: 275 KTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRVRVWHKYLTLDRDGNGGTIVD 334
K L YTP V S V Y V L I V +++ D G G T+VD
Sbjct: 485 K--ALKYTPLVQI-STPLPYFDRVAYTVQLEGIKVANSMLQLPKSVYAPDHTGAGQTMVD 541
Query: 335 SGTTFTFMAPELFEPLADEFVSQ------MVKNRNYTRALGAEALTGLRPCFDVPGEKT- 387
SGT FTF+ ++ L +EFV Q ++++ N+ GA L C+ VP +
Sbjct: 542 SGTQFTFLLGPVYTALKNEFVRQTKASLKVLEDPNFVFQ-GAMDL-----CYRVPLTRRT 595
Query: 388 -GSFPELKLHFKGGAEVTLPVENYF-----AVVGEGSAVCLTVVTDREASGGPSIILGNF 441
P + L F+ GAE+++ E + G S C T + E G S I+G+
Sbjct: 596 LPPLPTVTLMFR-GAEMSVSAERLMYRVPGVIRGSDSVYCFT-FGNSELLGVESYIIGHH 653
Query: 442 QMQNYYVEYDLRNQRLGFKQQLC 464
QN ++E+DL R+GF + C
Sbjct: 654 HQQNVWMEFDLAKSRVGFAEVRC 676
Score = 42.0 bits (97), Expect = 0.74, Method: Compositional matrix adjust.
Identities = 26/79 (32%), Positives = 38/79 (48%), Gaps = 15/79 (18%)
Query: 78 TTNISSHSYGGYSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPS---- 133
++ +S H ++SL+ G+PPQ + +LDTGS L W C K P+
Sbjct: 364 SSKLSFHHNVSLTVSLTVGSPPQTVTMVLDTGSELSWLHC-----------KKAPNLHSV 412
Query: 134 FIPKLSSSSRLLGCQNPKC 152
F P SSS + C +P C
Sbjct: 413 FDPLRSSSYSPIPCTSPTC 431
>gi|30688682|ref|NP_197676.2| aspartyl protease family protein [Arabidopsis thaliana]
gi|110736370|dbj|BAF00154.1| protease-like protein [Arabidopsis thaliana]
gi|332005704|gb|AED93087.1| aspartyl protease family protein [Arabidopsis thaliana]
Length = 493
Score = 88.2 bits (217), Expect = 8e-15, Method: Compositional matrix adjust.
Identities = 98/398 (24%), Positives = 160/398 (40%), Gaps = 56/398 (14%)
Query: 87 GGYSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPS--FIPKLSSSSRL 144
G Y L GTPP+ +DTGS ++W C + C S +I F P S ++
Sbjct: 79 GLYYTKLRLGTPPRDFYVQVDTGSDVLWVSCASCNGCPQTSGLQIQLNFFDPGSSVTASP 138
Query: 145 LGCQNPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSGL-TEGIALSETLN- 202
+ C + +CSW S D + N +C +Y YG G T G +S+ L
Sbjct: 139 ISCSDQRCSWGIQSS-------DSGCSVQNN---LC-AYTFQYGDGSGTSGFYVSDVLQF 187
Query: 203 --------LPNRIIPNFLVGCS-------VLSSRQPAGIAGFGRGKTSLPSQLNLDKFSY 247
+PN P + GCS V S R GI GFG+ S+ SQL +
Sbjct: 188 DMIVGSSLVPNSTAP-VVFGCSTSQTGDLVKSDRAVDGIFGFGQQGMSVISQLASQGIAP 246
Query: 248 CLLSHKFDDTTRTSSLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRI 307
+ SH +++ + + +TP V PS +Y V L I
Sbjct: 247 RVFSHCLKGENGGGGILV-----LGEIVEPNMVFTPLV--PS-------QPHYNVNLLSI 292
Query: 308 TVGGQRVRVWHKYLTLDRDGNG-GTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTR 366
+V GQ + + + NG GTI+D+GTT +++ + P + + + ++
Sbjct: 293 SVNGQALPINPSVFSTS---NGQGTIIDTGTTLAYLSEAAYVPFVEAITNAVSQSVRPVV 349
Query: 367 ALGAEALTGLRPCFDVPGEKTGSFPELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTVVT 426
+ G + C+ + FP + L+F GGA + L ++Y + +
Sbjct: 350 SKGNQ-------CYVITTSVGDIFPPVSLNFAGGASMFLNPQDYLIQQNNVGGTAVWCIG 402
Query: 427 DREASGGPSIILGNFQMQNYYVEYDLRNQRLGFKQQLC 464
+ ILG+ +++ YDL QR+G+ C
Sbjct: 403 FQRIQNQGITILGDLVLKDKIFVYDLVGQRIGWANYDC 440
>gi|45735840|dbj|BAD12875.1| putative chloroplast nucleoid DNA-binding protein cnd41 [Oryza
sativa Japonica Group]
gi|45735966|dbj|BAD12995.1| putative chloroplast nucleoid DNA-binding protein cnd41 [Oryza
sativa Japonica Group]
gi|125583491|gb|EAZ24422.1| hypothetical protein OsJ_08175 [Oryza sativa Japonica Group]
Length = 475
Score = 88.2 bits (217), Expect = 8e-15, Method: Compositional matrix adjust.
Identities = 112/392 (28%), Positives = 155/392 (39%), Gaps = 74/392 (18%)
Query: 89 YSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCS-----SSKIPSFIPKLSSSSR 143
Y +++S GTP +DTGS + W QCK C S + P F P SSS
Sbjct: 142 YVVTVSLGTPAVAQTLEVDTGSDVSWV------QCKPCPSPPCYSQRDPLFDPTRSSSYS 195
Query: 144 LLGCQNPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSG-LTEGIALSETLN 202
+ C CS + S C + C Y+V YG G T G+ S+TL
Sbjct: 196 AVPCAAASCSQLALYSNGC---------SGGQC-----GYVVSYGDGSTTTGVYSSDTLT 241
Query: 203 LP-NRIIPNFLVGCSVLSSRQPAGIA---GFGRGKTSLPSQLNL---DKFSYCLLSHKFD 255
L + + FL GC AG+ G GR SL SQ + FSYCL
Sbjct: 242 LTGSNALKGFLFGCGHAQQGLFAGVDGLLGLGRQGQSLVSQASSTYGGVFSYCL------ 295
Query: 256 DTTRTSSLILDNGSSHSDKKTTGLTYTPFV---NNPSVAERNAFSVYYYVGLRRITVGGQ 312
T+ S + G S T G + TP + N+P+ YY V L I+VGGQ
Sbjct: 296 PPTQNSVGYISLGGPSS---TAGFSTTPLLTASNDPT---------YYIVMLAGISVGGQ 343
Query: 313 RVRVWHKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEA 372
+ + G +VD+GT T + P + L F + M Y A A
Sbjct: 344 PLSIDASVFA------SGAVVDTGTVVTRLPPTAYSALRSAFRAAMAP-YGYPS---APA 393
Query: 373 LTGLRPCFDVPGEKTGSFPELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTVVTDREASG 432
L C+D T + P + + F GGA + L ++ G D +AS
Sbjct: 394 TGILDTCYDFTRYGTVTLPTISIAFGGGAAMDLGTSG---ILTSGCLAFAPTGGDSQAS- 449
Query: 433 GPSIILGNFQMQNYYVEYDLRNQRLGFKQQLC 464
ILGN Q +++ V +D +GF C
Sbjct: 450 ----ILGNVQQRSFEVRFD--GSTVGFMPASC 475
>gi|222616654|gb|EEE52786.1| hypothetical protein OsJ_35257 [Oryza sativa Japonica Group]
Length = 346
Score = 88.2 bits (217), Expect = 8e-15, Method: Compositional matrix adjust.
Identities = 102/387 (26%), Positives = 166/387 (42%), Gaps = 54/387 (13%)
Query: 91 ISLSFGTPPQIIPFILDTGSHLVWFPCTN-HYQCKYCSSSKIPSFIPKLSSSSRLLGCQN 149
+ +S GTPP +DTGS L W C N +C ++ F P SS+ +GC
Sbjct: 1 MGISLGTPPVFNLVTIDTGSTLSWVQCKNCQIKCYDQAAKAGQIFNPYNSSTYSKVGCST 60
Query: 150 PKCSWIHHESIQCRDCNDEPLATSKNCTQICPS--YLVLYGSG-LTEGIALSETLNLP-N 205
C+ +H + LA C + + Y + YGSG + G + L L N
Sbjct: 61 EACNGMHMD-----------LAVEYGCVEEDDTCIYSLRYGSGEYSVGYLGKDRLTLASN 109
Query: 206 RIIPNFLVGCSV--LSSRQPAGIAGFGRGKTSLPSQL----NLDKFSYCLLSHKFDDTTR 259
R I NF+ GC L + AGI GFG S +Q+ + FSYC D
Sbjct: 110 RSIDNFIFGCGEDNLYNGVNAGIIGFGTKSYSFFNQVCQQTDYTAFSYCFPR----DHEN 165
Query: 260 TSSLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRVRVWHK 319
SL + + + T L Y + + P+ Y + + V G R+ +
Sbjct: 166 EGSLTIGPYARDINLMWTKLIY--YDHKPA----------YAIQQLDMMVNGIRLEI-DP 212
Query: 320 YLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEALTGLRPC 379
Y+ + + TIVDSGT T++ +F+ L D+ +++ ++ + YTR R C
Sbjct: 213 YIYISK----MTIVDSGTADTYILSPVFDAL-DKAMTKEMQAKGYTRGWDER-----RIC 262
Query: 380 F--DVPGEKTGSFPELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTVVTDREASGGPSII 437
F + FP +++ + + LPVEN F + +C T + D G +
Sbjct: 263 FISNSGSANWNDFPTVEMKLI-RSTLKLPVENAF-YESSNNVICSTFLPDDAGVRGVQ-M 319
Query: 438 LGNFQMQNYYVEYDLRNQRLGFKQQLC 464
LGN ++++ + +D++ GFK + C
Sbjct: 320 LGNRAVRSFKLVFDIQAMNFGFKARAC 346
>gi|340810987|gb|AEK75420.1| S5 [Oryza rufipogon]
gi|340810989|gb|AEK75421.1| S5 [Oryza rufipogon]
gi|340810991|gb|AEK75422.1| S5 [Oryza rufipogon]
gi|340811001|gb|AEK75427.1| S5 [Oryza rufipogon]
gi|340811019|gb|AEK75436.1| S5 [Oryza rufipogon]
gi|340811104|gb|AEK75478.1| S5 [Oryza rufipogon]
gi|340811124|gb|AEK75488.1| S5 [Oryza rufipogon]
Length = 472
Score = 88.2 bits (217), Expect = 8e-15, Method: Compositional matrix adjust.
Identities = 112/435 (25%), Positives = 183/435 (42%), Gaps = 79/435 (18%)
Query: 62 IKNPQTKTTTTTTTTTTTNISSHSYGGYS--ISLSFGTPPQIIPFILDTGSHLVWF---P 116
+ N Q + T++++T I S + +++S G PP + +DTGS L W P
Sbjct: 85 LNNLQEEEITSSSSTKIDVIEDSSINDFLFLMAVSLGKPPVVNLVAIDTGSTLSWVQCQP 144
Query: 117 CTNHYQCKYCSSSKIPSFIPKLSSSSRLLGCQNPKCSWIHHE-SIQCRDCNDEPLATSKN 175
C H C S+ P F P S +SR + C + KC + ++ +Q +C + +
Sbjct: 145 CAVH--CHTQSAKAGPIFDPGRSYTSRRVRCSSVKCGELRYDLRLQQANC----MEKENS 198
Query: 176 CTQICPSYLVLYGSGL--TEGIALSETLNLPNRIIPNFLVGCS--VLSSRQPAGIAGFGR 231
CT Y V YG+G + G +++TL + + + + + GCS V S AGI GFG
Sbjct: 199 CT-----YSVTYGNGWAYSVGKMVTDTLRIGDSFM-DLMFGCSMDVKYSEFEAGIFGFGS 252
Query: 232 GK-------TSLPSQLNLDKFSYCLLSHKFDDTTRTSSLILDNGSSHSDKKTTGLTYTPF 284
P L+ FSYCL + D T+ +IL D+ YTP
Sbjct: 253 SSFSFFEQLAGYPDILSYKAFSYCLPT----DETKPGYMIL----GRYDRAAMDGGYTPL 304
Query: 285 ---VNNPSVAERNAFSVYYYVGLRRITVGGQRVRVWHKYLTLDRDGNGGTIVDSGTTFTF 341
+N P+ Y + + + GQR+ + IVDSG T
Sbjct: 305 FRSINRPT----------YSLTMEMLIANGQRLVT----------SSSEMIVDSGAQRTS 344
Query: 342 MAPELFEPLADEFVSQMVKNRNYTRALGAEALTGLRPCFDVPGEKTG------------S 389
+ P F L D+ ++Q + + Y R A + + C+ + +G +
Sbjct: 345 LWPSTFA-LLDKTITQAMSSIGYHRTSRARQESYI--CYLSEHDYSGWNGTITPFSNWSA 401
Query: 390 FPELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTVVTDREASGGPSIILGNFQMQNYYVE 449
P L++ F GGA + LP N F +C+T + S ILGN +++
Sbjct: 402 LPLLEIGFAGGAALALPPRNVF-YNDPHRGLCMTFAQNPALR---SQILGNRVTRSFGTT 457
Query: 450 YDLRNQRLGFKQQLC 464
+D++ ++ GFK C
Sbjct: 458 FDIQGKQFGFKYAAC 472
>gi|388502342|gb|AFK39237.1| unknown [Lotus japonicus]
Length = 440
Score = 88.2 bits (217), Expect = 8e-15, Method: Compositional matrix adjust.
Identities = 126/487 (25%), Positives = 195/487 (40%), Gaps = 85/487 (17%)
Query: 11 SFIFFF----TLLSIFP---SSITSLTFSLSRFHTNPSQDSYQNLNSLVSSSLTRALHIK 63
+ FFF +LL+ P S T +F++ H + + N SS+TR+ I+
Sbjct: 3 ALAFFFAASCSLLATLPFTEPSKTPSSFTIDLIHHDSPPSPFYN------SSMTRSQLIR 56
Query: 64 NPQTKTTTTTT--------------TTTTTNISSHSYGGYSISLSFGTPPQIIPFILDTG 109
N ++ + ++ I + G Y + + GTP I DTG
Sbjct: 57 NAAMRSISRANQLSLSLSHSLNQLKESSPEPIIIPNNGNYLMRIYIGTPSVERLAIADTG 116
Query: 110 SHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLGCQNPKCSWIHHESIQCRDCNDEP 169
S L W C+ K C + P + P SS+ LL C + C+ + + C D D
Sbjct: 117 SDLTWVQCSPCDNTK-CFAQNTPLYDPLNSSTFTLLPCDSQPCTQLPYSQYVCSDYGDCI 175
Query: 170 LATSKNCTQICPSYLVLYGSGLTEGIALSETLNLPNRIIPNFLVGC------SVLSSRQP 223
A + YG ++ I L + L GC + S +
Sbjct: 176 YAYTYGDNSYS------YGGLSSDSIRL---MLLQLHYNSKICFGCGFQNKFTADKSGKT 226
Query: 224 AGIAGFGRGKTSLPSQLNLD---KFSYCLLSHKFDDTTRTSSLILDNGSSHSDKKTTGLT 280
GI G G G SL SQL + KFSYCLL + ++ L G + + + G+
Sbjct: 227 TGIVGLGAGPLSLVSQLGDEIGHKFSYCLLPFSSNSNSK-----LKFGEA-AIVQGNGVV 280
Query: 281 YTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRVRVWHKYLTLDRDGNGGTIVDSGTTFT 340
TP + P + +YY+ L ITVG + V+ T DGN I+DSG+T T
Sbjct: 281 STPLIIKPDLP-------FYYLNLEGITVGAKTVK------TGQTDGN--IIIDSGSTLT 325
Query: 341 FMAPELFEPLADEFVSQMVKNRNYTRALGAEALTGLRPCFD---VPGEKTGSFPELKLHF 397
++ E +EFVS +VK + E + FD E + P++ HF
Sbjct: 326 YLE----ESFYNEFVS-LVK-----ETVAVEEDQYIPYPFDFCFTYKEGMSTPPDVVFHF 375
Query: 398 KGGAEVTLPVENYFAVVGEGSAVCLTVVTDREASGGPSIILGNFQMQNYYVEYDLRNQRL 457
GG V P+ V+ E + +C TVV I GN +++V YD++ ++
Sbjct: 376 TGGDVVLKPMNT--LVLIEDNLICSTVVPSHFDGIA---IFGNLGQIDFHVGYDIQGGKV 430
Query: 458 GFKQQLC 464
F C
Sbjct: 431 SFAPTDC 437
>gi|196212952|gb|ACG76112.1| S5 [Oryza sativa Indica Group]
gi|338809989|gb|AEJ08560.1| S5 [Oryza barthii]
gi|340810883|gb|AEK75368.1| S5 [Oryza sativa]
gi|340810885|gb|AEK75369.1| S5 [Oryza sativa]
gi|340810889|gb|AEK75371.1| S5 [Oryza sativa]
gi|340810895|gb|AEK75374.1| S5 [Oryza sativa]
gi|340810897|gb|AEK75375.1| S5 [Oryza sativa]
gi|340810905|gb|AEK75379.1| S5 [Oryza sativa]
gi|340810909|gb|AEK75381.1| S5 [Oryza sativa]
gi|340810911|gb|AEK75382.1| S5 [Oryza sativa]
gi|340810913|gb|AEK75383.1| S5 [Oryza sativa]
gi|340810923|gb|AEK75388.1| S5 [Oryza sativa]
gi|340810925|gb|AEK75389.1| S5 [Oryza sativa]
gi|340810929|gb|AEK75391.1| S5 [Oryza sativa]
gi|340810935|gb|AEK75394.1| S5 [Oryza sativa]
gi|340810937|gb|AEK75395.1| S5 [Oryza sativa]
gi|340810939|gb|AEK75396.1| S5 [Oryza sativa]
gi|340810941|gb|AEK75397.1| S5 [Oryza sativa]
gi|340810943|gb|AEK75398.1| S5 [Oryza sativa]
gi|340810951|gb|AEK75402.1| S5 [Oryza sativa]
gi|340810953|gb|AEK75403.1| S5 [Oryza sativa]
gi|340810963|gb|AEK75408.1| S5 [Oryza sativa]
gi|340810965|gb|AEK75409.1| S5 [Oryza sativa]
gi|340810973|gb|AEK75413.1| S5 [Oryza nivara]
gi|340811003|gb|AEK75428.1| S5 [Oryza rufipogon]
gi|340811005|gb|AEK75429.1| S5 [Oryza rufipogon]
gi|340811009|gb|AEK75431.1| S5 [Oryza rufipogon]
gi|340811023|gb|AEK75438.1| S5 [Oryza rufipogon]
gi|340811025|gb|AEK75439.1| S5 [Oryza nivara]
gi|340811031|gb|AEK75442.1| S5 [Oryza rufipogon]
gi|340811033|gb|AEK75443.1| S5 [Oryza rufipogon]
gi|340811035|gb|AEK75444.1| S5 [Oryza nivara]
gi|340811039|gb|AEK75446.1| S5 [Oryza rufipogon]
gi|340811049|gb|AEK75451.1| S5 [Oryza nivara]
gi|340811053|gb|AEK75453.1| S5 [Oryza rufipogon]
gi|340811055|gb|AEK75454.1| S5 [Oryza nivara]
gi|340811057|gb|AEK75455.1| S5 [Oryza rufipogon]
gi|340811059|gb|AEK75456.1| S5 [Oryza rufipogon]
gi|340811061|gb|AEK75457.1| S5 [Oryza rufipogon]
gi|340811065|gb|AEK75459.1| S5 [Oryza nivara]
gi|340811067|gb|AEK75460.1| S5 [Oryza nivara]
gi|340811069|gb|AEK75461.1| S5 [Oryza nivara]
gi|340811071|gb|AEK75462.1| S5 [Oryza rufipogon]
gi|340811081|gb|AEK75467.1| S5 [Oryza nivara]
gi|340811083|gb|AEK75468.1| S5 [Oryza nivara]
gi|340811087|gb|AEK75470.1| S5 [Oryza nivara]
gi|340811092|gb|AEK75472.1| S5 [Oryza nivara]
gi|340811102|gb|AEK75477.1| S5 [Oryza rufipogon]
gi|340811106|gb|AEK75479.1| S5 [Oryza rufipogon]
gi|340811108|gb|AEK75480.1| S5 [Oryza rufipogon]
gi|340811110|gb|AEK75481.1| S5 [Oryza rufipogon]
gi|340811112|gb|AEK75482.1| S5 [Oryza rufipogon]
gi|340811118|gb|AEK75485.1| S5 [Oryza nivara]
gi|340811120|gb|AEK75486.1| S5 [Oryza rufipogon]
Length = 472
Score = 88.2 bits (217), Expect = 9e-15, Method: Compositional matrix adjust.
Identities = 112/435 (25%), Positives = 183/435 (42%), Gaps = 79/435 (18%)
Query: 62 IKNPQTKTTTTTTTTTTTNISSHSYGGYS--ISLSFGTPPQIIPFILDTGSHLVWF---P 116
+ N Q + T++++T I S + +++S G PP + +DTGS L W P
Sbjct: 85 LNNLQEEEITSSSSTKIDVIEDSSINDFLFLMAVSLGKPPVVNLVAIDTGSTLSWVQCQP 144
Query: 117 CTNHYQCKYCSSSKIPSFIPKLSSSSRLLGCQNPKCSWIHHE-SIQCRDCNDEPLATSKN 175
C H C S+ P F P S +SR + C + KC + ++ +Q +C + +
Sbjct: 145 CAVH--CHTQSAKAGPIFDPGRSYTSRRVRCSSVKCGELRYDLRLQQANC----MEKEDS 198
Query: 176 CTQICPSYLVLYGSGL--TEGIALSETLNLPNRIIPNFLVGCS--VLSSRQPAGIAGFGR 231
CT Y V YG+G + G +++TL + + + + + GCS V S AGI GFG
Sbjct: 199 CT-----YSVTYGNGWAYSVGKMVTDTLRIGDSFM-DLMFGCSMDVKYSEFEAGIFGFGS 252
Query: 232 GK-------TSLPSQLNLDKFSYCLLSHKFDDTTRTSSLILDNGSSHSDKKTTGLTYTPF 284
P L+ FSYCL + D T+ +IL D+ YTP
Sbjct: 253 SSFSFFEQLAGYPDILSYKAFSYCLPT----DETKPGYMIL----GRYDRAAMDGGYTPL 304
Query: 285 ---VNNPSVAERNAFSVYYYVGLRRITVGGQRVRVWHKYLTLDRDGNGGTIVDSGTTFTF 341
+N P+ Y + + + GQR+ + IVDSG T
Sbjct: 305 FRSINRPT----------YSLTMEMLIANGQRLVT----------SSSEMIVDSGAQRTS 344
Query: 342 MAPELFEPLADEFVSQMVKNRNYTRALGAEALTGLRPCFDVPGEKTG------------S 389
+ P F L D+ ++Q + + Y R A + + C+ + +G +
Sbjct: 345 LWPSTFA-LLDKTITQAMSSIGYHRTSRARQESYI--CYLSEHDYSGWNGTITPFSNWSA 401
Query: 390 FPELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTVVTDREASGGPSIILGNFQMQNYYVE 449
P L++ F GGA + LP N F +C+T + S ILGN +++
Sbjct: 402 LPLLEIGFAGGAALALPPRNVF-YNDPHRGLCMTFAQNPALR---SQILGNRVTRSFGTT 457
Query: 450 YDLRNQRLGFKQQLC 464
+D++ ++ GFK C
Sbjct: 458 FDIQGKQFGFKYAAC 472
>gi|356576537|ref|XP_003556387.1| PREDICTED: basic 7S globulin-like [Glycine max]
Length = 438
Score = 88.2 bits (217), Expect = 9e-15, Method: Compositional matrix adjust.
Identities = 98/404 (24%), Positives = 174/404 (43%), Gaps = 77/404 (19%)
Query: 97 TPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLGCQNPKCSW-- 154
TP + +D G +W C Y +SS+SR C + +CS
Sbjct: 54 TPLVAVKLTVDLGGGYLWVNCEKGY----------------VSSTSRPARCGSAQCSLFG 97
Query: 155 IHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSGLTEGIALSETLNLPNRIIPNFLVG 214
++ S + + C P T + + + T+G ++ +++P + F+ G
Sbjct: 98 LYGCSTEDKICGRSPSNTVTGVSTYGDIHADVVAVNSTDGNNPTKVVSVPKFL---FICG 154
Query: 215 CSVLSSRQPAGI---AGFGRGKTSLPSQLNLD-----KFSYCLLSHKFDDTTRTSSLIL- 265
+V+ +G+ AG GR K SLPSQ KF+ CL S +T T+ ++
Sbjct: 155 SNVVQKGLASGVTGMAGLGRTKVSLPSQFASAFSFHRKFAICLSS-----STMTNGVMFF 209
Query: 266 -----DNGSSHSDKKTTGLTYTPFVNNPSVAERNAF----SVYYYVGLRRITVGGQRVRV 316
+ G +SD LT+TP ++NP + F SV Y++G++ I V + V +
Sbjct: 210 GDGPYNFGYLNSDLSKV-LTFTPLISNPVSTAPSYFQGEPSVEYFIGVKSIKVSDKNVAL 268
Query: 317 WHKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEALTGL 376
L++DR+G GGT + + +T M +++ +++ FV + +GA + +
Sbjct: 269 NTTLLSIDRNGIGGTKISTVNPYTVMETTIYKAVSEVFVKE----------VGAPTVAPV 318
Query: 377 RP---CF---DVPGEKTG-SFPELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTVVTDRE 429
P CF D+ + G + P + L + T+ N V + +CL V
Sbjct: 319 APFGTCFATKDIGSTRMGPAVPGIDLVLQNDVVWTIIGANSMVYVND--VICLGFVDAGS 376
Query: 430 A---------SGGP----SIILGNFQMQNYYVEYDLRNQRLGFK 460
+ +GG SI +G Q++N +++DL RLGF+
Sbjct: 377 SPSVAQVGFVAGGSHPRTSITIGAHQLENNLLQFDLATSRLGFR 420
>gi|10177232|dbj|BAB10606.1| protease-like protein [Arabidopsis thaliana]
Length = 539
Score = 88.2 bits (217), Expect = 9e-15, Method: Compositional matrix adjust.
Identities = 97/397 (24%), Positives = 162/397 (40%), Gaps = 54/397 (13%)
Query: 87 GGYSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPS--FIPKLSSSSRL 144
G Y L GTPP+ +DTGS ++W C + C S +I F P S ++
Sbjct: 79 GLYYTKLRLGTPPRDFYVQVDTGSDVLWVSCASCNGCPQTSGLQIQLNFFDPGSSVTASP 138
Query: 145 LGCQNPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSGL-TEGIALSETLN- 202
+ C + +CSW S D + N +C +Y YG G T G +S+ L
Sbjct: 139 ISCSDQRCSWGIQSS-------DSGCSVQNN---LC-AYTFQYGDGSGTSGFYVSDVLQF 187
Query: 203 ---LPNRIIPN----FLVGCS-------VLSSRQPAGIAGFGRGKTSLPSQLNLDKFSYC 248
+ + ++PN + GCS V S R GI GFG+ S+ SQL +
Sbjct: 188 DMIVGSSLVPNSTAPVVFGCSTSQTGDLVKSDRAVDGIFGFGQQGMSVISQLASQGIAPR 247
Query: 249 LLSHKFDDTTRTSSLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRIT 308
+ SH +++ + + +TP V PS +Y V L I+
Sbjct: 248 VFSHCLKGENGGGGILV-----LGEIVEPNMVFTPLV--PS-------QPHYNVNLLSIS 293
Query: 309 VGGQRVRVWHKYLTLDRDGNG-GTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRA 367
V GQ + + + NG GTI+D+GTT +++ + P + + + ++ +
Sbjct: 294 VNGQALPINPSVFSTS---NGQGTIIDTGTTLAYLSEAAYVPFVEAITNAVSQSVRPVVS 350
Query: 368 LGAEALTGLRPCFDVPGEKTGSFPELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTVVTD 427
G + C+ + FP + L+F GGA + L ++Y + +
Sbjct: 351 KGNQ-------CYVITTSVGDIFPPVSLNFAGGASMFLNPQDYLIQQNNVGGTAVWCIGF 403
Query: 428 REASGGPSIILGNFQMQNYYVEYDLRNQRLGFKQQLC 464
+ ILG+ +++ YDL QR+G+ C
Sbjct: 404 QRIQNQGITILGDLVLKDKIFVYDLVGQRIGWANYDC 440
>gi|225431324|ref|XP_002269880.1| PREDICTED: aspartic proteinase-like protein 1 [Vitis vinifera]
gi|297739017|emb|CBI28369.3| unnamed protein product [Vitis vinifera]
Length = 518
Score = 87.8 bits (216), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 114/414 (27%), Positives = 173/414 (41%), Gaps = 82/414 (19%)
Query: 77 TTTNISSHSYGGYSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCK------YCSSSK 130
+T ISS + Y+ ++S GTP + LDTGS L W PC + +C Y S +
Sbjct: 92 STFRISSLGFLHYT-TVSLGTPGKKFLVALDTGSDLFWVPC-DCSRCAPTEGTTYASDFE 149
Query: 131 IPSFIPKLSSSSRLLGCQNPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSG 190
+ + PK SS+SR + C N C+ H + +C L T NC Y+V Y S
Sbjct: 150 LSIYNPKGSSTSRKVTCDNSLCA---HRN-RC-------LGTFSNC-----PYMVSYVSA 193
Query: 191 L--TEGIALSETLNL---PNR---IIPNFLVGC------SVLSSRQPAGIAGFGRGKTSL 236
T GI + + L+L NR + GC S L P G+ G G K S+
Sbjct: 194 ETSTSGILVEDVLHLTTEDNRQEFVEAYVTFGCGQVQTGSFLDIAAPNGLFGLGLEKISV 253
Query: 237 PSQLNLDKFSYCLLSHKF--DDTTRTSSLILDNGSSHSDKKTTGLTYTPFVNNPSVAERN 294
PS L+ + F+ S F D R S DK + TPF N
Sbjct: 254 PSILSKEGFTADSFSMCFGPDGIGRI---------SFGDKGSPDQEETPF-------NLN 297
Query: 295 AFSVYYYVGLRRITVGGQRVRVWHKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEF 354
A Y + + ++ VG + D + + DSGT+FT++ ++ + F
Sbjct: 298 ALHPTYNITVTQVRVGTTLI-----------DLDFTALFDSGTSFTYLVDPIYTNVLKSF 346
Query: 355 VSQMVKNRNYTRALGAEALTGLRPCFDV-PGEKTGSFPELKLHFKGGAEVTLPVENYFAV 413
SQ +R ++ C+D+ PGE T P + L KGG++ PV + +
Sbjct: 347 HSQAQDSRR-----PPDSRIPFEFCYDMSPGENTSLIPSMSLTMKGGSQ--FPVYDPIII 399
Query: 414 VGEGSAV--CLTVVTDREASGGPSIILGNFQMQNYYVEYDLRNQRLGFKQQLCK 465
+ S + C+ VV E + I+G M Y + +D LG+K+ C
Sbjct: 400 ISSQSELIYCMAVVRSAELN-----IIGQNFMTGYRIIFDREKLVLGWKEFECD 448
>gi|255634819|gb|ACU17770.1| unknown [Glycine max]
Length = 354
Score = 87.8 bits (216), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 92/345 (26%), Positives = 148/345 (42%), Gaps = 56/345 (16%)
Query: 87 GGYSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPS--FIPKLSSSSRL 144
G Y + GTPP +DTGS ++W C + C S +I F P SS+S +
Sbjct: 23 GLYYTKVQLGTPPVEFNVQIDTGSDVLWVSCNSCSGCPQTSGLQIQLNFFDPGSSSTSSM 82
Query: 145 LGCQNPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSGL-TEGIALSETLNL 203
+ C + +C + IQ D AT + C SY YG G T G +S+ ++L
Sbjct: 83 IACSDQRC----NNGIQSSD------ATCSSQNNQC-SYTFQYGDGSGTSGYYVSDMMHL 131
Query: 204 ---------PNRIIPNFLVGCS-------VLSSRQPAGIAGFGRGKTSLPSQLNLDKFSY 247
N P + GCS S R GI GFG+ + S+ SQL+ +
Sbjct: 132 NTIFEGSVTTNSTAP-VVFGCSNQQTGDLTKSDRAVDGIFGFGQQEMSVISQLSSQGIAP 190
Query: 248 CLLSHKFD-DTTRTSSLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRR 306
+ SH D++ L+L + + YT V P+ +Y + L+
Sbjct: 191 RVFSHCLKGDSSGGGILVL------GEIVEPNIVYTSLV--PA-------QPHYNLNLQS 235
Query: 307 ITVGGQRVRVWHKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTR 366
I V GQ +++ + GTIVDSGTT ++A E ++P + + ++ +
Sbjct: 236 IAVNGQTLQIDSSVFATSN--SRGTIVDSGTTLAYLAEEAYDPFVSAITASIPQSVHTAV 293
Query: 367 ALGAEALTGLRPCFDVPGEKTGSFPELKLHFKGGAEVTLPVENYF 411
+ G + C+ + T FP++ L+F GGA + L ++Y
Sbjct: 294 SRGNQ-------CYLITSSVTEVFPQVSLNFAGGASMILRPQDYL 331
>gi|449432735|ref|XP_004134154.1| PREDICTED: basic 7S globulin-like [Cucumis sativus]
gi|449527085|ref|XP_004170543.1| PREDICTED: basic 7S globulin-like [Cucumis sativus]
Length = 435
Score = 87.8 bits (216), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 106/403 (26%), Positives = 165/403 (40%), Gaps = 71/403 (17%)
Query: 97 TPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLGCQNPKCSWIH 156
TP + +D G +W C Y +SSS + C++ +CS +
Sbjct: 52 TPLVPVKLTVDLGGQFMWVDCDRGY----------------VSSSYKPARCRSAQCS-LA 94
Query: 157 HESIQCRDCNDEPLATSKNCT-QICPSYLVLYGSGLTEGIALSETLNLPNRI-------I 208
+S C C P N T + P ++ S E + +++ N I
Sbjct: 95 SKSSACGQCFSPPRPGCNNNTCSLFPGNTIIRLSTSGEVASDVVSVSSTNGFNPTRAVSI 154
Query: 209 PNFLVGCS---VLSSRQPA--GIAGFGRGKTSLPSQLNLD-----KFSYCLLSHKFDDTT 258
PNFL C +L P G+AGFGR SLPSQ KF+ CL +T
Sbjct: 155 PNFLFVCGSTFLLEGLAPGVTGMAGFGRNGISLPSQFAAAFSFNRKFAVCL-----SGST 209
Query: 259 RTSSLILD-NGSSH---SDKKTTGLTYTPFVNNP----SVAERNAFSVYYYVGLRRITVG 310
+ +I NG H + T TYTP NP V+ S Y++G+ I V
Sbjct: 210 SSPGVIFSGNGPYHFLPNIDLTNSFTYTPLFINPVSTAGVSSAGEKSTEYFIGVTSIVVN 269
Query: 311 GQRVRVWHKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGA 370
+ V + L +D +GNGGT + + FT + +++ L F +++ K +GA
Sbjct: 270 SKPVPLNTTLLKIDSNGNGGTKISTVNPFTVLESSIYKALVKAFTTEVSK----VPRVGA 325
Query: 371 EALTGLRPCFDVPGEKTGSFPELKLHFKGGAEVTLPVEN---YFAVVGEGSAV------- 420
A C+ + SFP +L G + L ++N +++ G S V
Sbjct: 326 VA--PFEVCYS-----SKSFPSTRLG-AGVPTIDLVLQNKKVIWSMFGANSMVQVNDEVL 377
Query: 421 CLTVVTDREASGGPSIILGNFQMQNYYVEYDLRNQRLGFKQQL 463
CL V D +I++G Q+++ +E+DL RLGF L
Sbjct: 378 CLGFV-DGGVDVRTAIVIGAHQIEDKLLEFDLATSRLGFTPTL 419
>gi|255552253|ref|XP_002517171.1| basic 7S globulin 2 precursor small subunit, putative [Ricinus
communis]
gi|223543806|gb|EEF45334.1| basic 7S globulin 2 precursor small subunit, putative [Ricinus
communis]
Length = 437
Score = 87.8 bits (216), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 105/406 (25%), Positives = 171/406 (42%), Gaps = 75/406 (18%)
Query: 97 TPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLGCQNPKCSWIH 156
TP + +D G L+W C Y +SSS R L C + CS +
Sbjct: 53 TPLVPVKLTVDLGGSLMWINCEEGY----------------VSSSYRPLSCDSALCSLSN 96
Query: 157 HESIQCRDCNDEPLATSKN--CTQICPSYLVLYGSG--LTEGIALSETLNLPN--RII-- 208
+S ++C P N C Q + +V G+G L + + ++ + N RI+
Sbjct: 97 SQSCN-KECYSSPKPGCYNNTCGQSSNNRVVYIGTGGDLGQDVVALQSFDGKNLGRIVSV 155
Query: 209 PNFLVGCSVLS-----SRQPAGIAGFGRGKTSLP----SQLNLDK-FSYCLLSHKFDDTT 258
PNF C + + G+AG GR SLP S + K FS CL S +T
Sbjct: 156 PNFPFVCGITWLLDDLADGVTGMAGLGRSNISLPAYFSSAIGFSKTFSICLSS-----ST 210
Query: 259 RTSSLILDNGSSHSDKKTTGLTYTPFVNNP-------SVAERNAFSVYYYVGLRRITVGG 311
+++ +I+ G S + L Y + NP S+ E +A YY+G++ I V G
Sbjct: 211 KSNGVIV-FGDGPSSIVSNDLIYIRLILNPVGTPGYSSLGESSA---DYYIGVKSIRVDG 266
Query: 312 QRVRVWHKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYT-----R 366
+ V+ L++D+DGNGGT++ + +T + +++ L F+ ++V +
Sbjct: 267 KEVKFDKTLLSIDKDGNGGTMLSTVNPYTVLHTSIYKALLKAFIKKLVFRFSLVVPSVPV 326
Query: 367 ALGAEALTGLRPCFDVPGEKTGSFP--ELKLHFKGGAEVTLPVENYFAVVGEGSAV---- 420
GA + F E P L+L + G V Y+ ++G S V
Sbjct: 327 PFGACVFSN---GFRTTEEFLSYVPIINLELESEQGNSV------YWRILGANSMVAVNS 377
Query: 421 ---CLTVVTDREASGGPSIILGNFQMQNYYVEYDLRNQRLGFKQQL 463
CL + P II+G Q+++ + +DL + RLGF L
Sbjct: 378 YTMCLAFIDGGSQPRTP-IIIGGHQLEDNLLHFDLASSRLGFSSSL 422
>gi|361066667|gb|AEW07645.1| Pinus taeda anonymous locus 0_7863_01 genomic sequence
gi|383134456|gb|AFG48207.1| Pinus taeda anonymous locus 0_7863_01 genomic sequence
gi|383134472|gb|AFG48215.1| Pinus taeda anonymous locus 0_7863_01 genomic sequence
gi|383134476|gb|AFG48217.1| Pinus taeda anonymous locus 0_7863_01 genomic sequence
gi|383134478|gb|AFG48218.1| Pinus taeda anonymous locus 0_7863_01 genomic sequence
gi|383134480|gb|AFG48219.1| Pinus taeda anonymous locus 0_7863_01 genomic sequence
gi|383134482|gb|AFG48220.1| Pinus taeda anonymous locus 0_7863_01 genomic sequence
gi|383134484|gb|AFG48221.1| Pinus taeda anonymous locus 0_7863_01 genomic sequence
Length = 136
Score = 87.8 bits (216), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 46/113 (40%), Positives = 69/113 (61%), Gaps = 8/113 (7%)
Query: 307 ITVGGQRVRVWHKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTR 366
IT+GGQR+++ T D++GNGG IVDSGTTFT + L+ + ++ S + Y+R
Sbjct: 1 ITIGGQRLKLPSSLTTFDKEGNGGLIVDSGTTFTMLPESLYRQVLNKLKSAI----RYSR 56
Query: 367 ALGAEALTGLRPCFDVPGEKTGSFPEL---KLHFKGGAEVTLPVENYFAVVGE 416
++ EA GL C+++P GSFP L LHFK +TLP ENY +++ +
Sbjct: 57 SVKYEAALGLDLCYELP-SAGGSFPVLPTFSLHFKDNVTITLPAENYMSMMSD 108
>gi|357492303|ref|XP_003616440.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
gi|355517775|gb|AES99398.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
Length = 521
Score = 87.8 bits (216), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 106/403 (26%), Positives = 157/403 (38%), Gaps = 100/403 (24%)
Query: 90 SISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPS----FIPKLSSSSRLL 145
++SL+ G+PPQ + +LDTGS L W C K+P+ F P +SSS
Sbjct: 37 TVSLTVGSPPQRVTMVLDTGSELSWLHC-----------KKLPNLNFIFNPLVSSSYTPT 85
Query: 146 GCQNPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSGLTEGIALSETLNLPN 205
C +P C+ Q RD + P++ N ++C G G+
Sbjct: 86 PCTSPICT------TQTRDLIN-PVSCDAN--KLCHIITFFVGGPAQRGMVF-------- 128
Query: 206 RIIPNFLVGC------SVLSSRQPAGIAGFGRGKTSLPSQLNLDKFSYCLLSHKFDDTTR 259
GC S + G+ G G S +Q+ L KFSYC+ +
Sbjct: 129 --------GCMDTGTSSGDEDSKTTGLMGMDLGSLSFSNQMRLPKFSYCI-----SNKDS 175
Query: 260 TSSLILDN-------GSSHSD---KKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITV 309
T L+L+N G H KKTT L Y F N + +++AF
Sbjct: 176 TGVLVLENIANPPRLGPLHYTPLVKKTTPLPY--FNRNCCLFQKSAF------------- 220
Query: 310 GGQRVRVWHKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALG 369
D G G T+VDS T FTF+ ++ L +EF ++ +N LG
Sbjct: 221 ------------LPDHTGAGQTMVDSATQFTFLRQPVYTALKNEFA---IQTKNILTPLG 265
Query: 370 AEALT---GLRPCFDVP-GEKTGSFPELKLHFKGGAEVTLPVENYF----AVVGEGSAVC 421
+ CF VP G P + L F GAE+ + E V S +
Sbjct: 266 DPKFVFQGVMDLCFRVPIGSTLPVLPVVTLMFD-GAELRVTGERLLYKVSNVAKSNSWIY 324
Query: 422 LTVVTDREASGGPSIILGNFQMQNYYVEYDLRNQRLGFKQQLC 464
+ + G + I+G+ +N ++EYDL N R+GF C
Sbjct: 325 CFTFGNSDLLGIEAFIIGHHHQRNVWMEYDLANSRIGFSDTNC 367
>gi|125540927|gb|EAY87322.1| hypothetical protein OsI_08726 [Oryza sativa Indica Group]
Length = 464
Score = 87.8 bits (216), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 112/392 (28%), Positives = 155/392 (39%), Gaps = 74/392 (18%)
Query: 89 YSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCS-----SSKIPSFIPKLSSSSR 143
Y +++S GTP +DTGS + W QCK C S + P F P SSS
Sbjct: 131 YVVTVSLGTPAVAQTLEVDTGSDVSWV------QCKPCPSPPCYSQRDPLFDPTRSSSYS 184
Query: 144 LLGCQNPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSG-LTEGIALSETLN 202
+ C CS + S C + C Y+V YG G T G+ S+TL
Sbjct: 185 AVPCAAASCSQLALYSNGC---------SGGQC-----GYVVSYGDGSTTTGVYSSDTLT 230
Query: 203 LP-NRIIPNFLVGCSVLSSRQPAGIA---GFGRGKTSLPSQLNL---DKFSYCLLSHKFD 255
L + + FL GC AG+ G GR SL SQ + FSYCL
Sbjct: 231 LTGSNALKGFLFGCGHAQQGLFAGVDGLLGLGRQGQSLVSQASSTYGGVFSYCL------ 284
Query: 256 DTTRTSSLILDNGSSHSDKKTTGLTYTPFV---NNPSVAERNAFSVYYYVGLRRITVGGQ 312
T+ S + G S T G + TP + N+P+ YY V L I+VGGQ
Sbjct: 285 PPTQNSVGYISLGGPSS---TAGFSTTPLLTASNDPT---------YYIVMLAGISVGGQ 332
Query: 313 RVRVWHKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEA 372
+ + G +VD+GT T + P + L F + M Y A A
Sbjct: 333 PLSIDASVFA------SGAVVDTGTVVTRLPPTAYSALRSAFRAAMAP-YGYPS---APA 382
Query: 373 LTGLRPCFDVPGEKTGSFPELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTVVTDREASG 432
L C+D T + P + + F GGA + L ++ G D +AS
Sbjct: 383 TGILDTCYDFTRYGTVTLPTISIAFGGGAAMDLGTSG---ILTSGCLAFAPTGGDSQAS- 438
Query: 433 GPSIILGNFQMQNYYVEYDLRNQRLGFKQQLC 464
ILGN Q +++ V +D +GF C
Sbjct: 439 ----ILGNVQQRSFEVRFD--GSTVGFMPASC 464
>gi|24796804|gb|AAN64480.1| hypothetical protein [Oryza sativa Japonica Group]
Length = 161
Score = 87.8 bits (216), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 54/136 (39%), Positives = 77/136 (56%), Gaps = 4/136 (2%)
Query: 185 VLYGSGLTEGIALSETLNLPNRIIPNFLVGCSVLSS-RQPAGIAGFGRGKTSLPSQLNLD 243
V+Y SG T + +S+TL P R I NF+VGCS++S +Q +G+ GF G S+PSQL L
Sbjct: 10 VVYSSGSTTRLLISDTLRTPGRTIRNFVVGCSLMSVYQQSSGLTGFSCGVPSVPSQLGLT 69
Query: 244 KFSYCLLSHKFDDTTRTSSLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVG 303
KF Y LL+ +FDD S ++ G+ D + Y P + S R SVYYY+
Sbjct: 70 KFFYFLLARRFDDNATASDELILGGAGGKDDNVR-MQYIPLARSAST--RPLCSVYYYLA 126
Query: 304 LRRITVGGQRVRVWHK 319
L ITV + V++ +
Sbjct: 127 LIAITVRRKSVQLPKR 142
>gi|357155293|ref|XP_003577072.1| PREDICTED: LOW QUALITY PROTEIN: probable aspartic protease
At2g35615-like [Brachypodium distachyon]
Length = 429
Score = 87.8 bits (216), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 105/396 (26%), Positives = 162/396 (40%), Gaps = 57/396 (14%)
Query: 87 GGYSISLSFGTPPQIIPFILDTGSHLVWFPCTN-HYQCKYCSSSKIPSFIPKLSSSSRLL 145
G + + +S GTPP +DTGS L W C C + F P S++ L+
Sbjct: 73 GKFFMDISLGTPPVANLVTVDTGSTLSWVVCQRCQISCHTTAPEAGSVFDPDKSTTYELV 132
Query: 146 GCQNPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSG----LTEGIALSETL 201
GC + C+ + + C +E T C Y + YGSG + G ++ L
Sbjct: 133 GCSSRDCADVQRSLVAPFGCIEE--------TDTC-LYSLRYGSGPSGQYSAGRLGTDKL 183
Query: 202 NLP--NRIIPNFLVGCSVLSSRQ--PAGIAGFGRGKTSLPSQL----NLDKFSYCLLSHK 253
L + II F+ GCS S + +G+ GFG S +Q+ N FSYC
Sbjct: 184 TLASSSSIIDGFIFGCSGDDSFKGYESGVIGFGGANFSFFNQVARQTNYRAFSYCFPGD- 242
Query: 254 FDDTTRTSSLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQR 313
T+ L G+ D+ L YT + P +R+ +S+ + V G R
Sbjct: 243 -----HTAEGFLSIGAYPKDE----LVYTNLI--PHFGDRSVYSLQQI----DMMVDGNR 287
Query: 314 VRVWHKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEAL 373
++V T +VDSGT TF+ +F+ + S M + +G E
Sbjct: 288 LQVDQSEYTKRM-----MVVDSGTVDTFLLGPVFDAFSKAMASAMQAKGFLSDTVGTET- 341
Query: 374 TGLRPCFDVPGE---KTGSFPELKLHFKGGAEVTLPVENYF-AVVGEGSAVCLTVVTDRE 429
CF G +G P +++ F G + LP EN F ++ +CL D
Sbjct: 342 -----CFRPNGGDSVDSGDLPTVEMRFI-GTTLKLPPENVFHDLLPSHDKICLAFKPD-- 393
Query: 430 ASGGPSI-ILGNFQMQNYYVEYDLRNQRLGFKQQLC 464
+G ++ ILGN ++ V YDL+ GF+ C
Sbjct: 394 VAGVRNVQILGNKATXSFRVVYDLQAMYFGFQAGAC 429
>gi|255585473|ref|XP_002533429.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
gi|223526717|gb|EEF28949.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
Length = 448
Score = 87.4 bits (215), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 112/445 (25%), Positives = 184/445 (41%), Gaps = 68/445 (15%)
Query: 39 NPSQDSYQNLNSLVSSSLTRALHIKNPQTKTTTTTTTTTTTNISSHSYGGYSISLSFGTP 98
NP+ + +V +S TR ++ Q K + S + ++ S G P
Sbjct: 50 NPNASVAERAERIVKTSATRIAYLY-AQIKGDIHMNDFELNLLPSTYEPLFLVNFSMGQP 108
Query: 99 --PQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLGCQNPKCSWIH 156
PQ+ I+DTGS+++W C CK C+ P P SS+ L C N C +
Sbjct: 109 ATPQLA--IMDTGSNILWVRCA---PCKRCTQQNGPLLDPSKSSTYASLPCTNTMCHY-- 161
Query: 157 HESIQCRDCNDEPLATSKNCTQICP-SYLVLYGSGLTE-GIALSETLNLPN-----RIIP 209
A S C ++ Y + Y +GL+ G+ +E L + +P
Sbjct: 162 --------------APSAYCNRLNQCGYNLSYATGLSSAGVLATEQLIFHSSDEGVNAVP 207
Query: 210 NFLVGCS----VLSSRQPAGIAGFGRGKTSLPSQLNLDKFSYCLLSHKFDDTTRTSSLIL 265
+ + GCS R+ G+ G G+G TS +++ KFSYCL + D + L+
Sbjct: 208 SVVFGCSHENGDYKDRRFTGVFGLGKGITSFVTRMG-SKFSYCL-GNIADPHYGYNQLVF 265
Query: 266 DNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRVRVWHKYLTLDR 325
++ T VN +YYV L I+VG +R+ + ++ +
Sbjct: 266 GEKANFEGYSTP----LKVVNG-----------HYYVTLEGISVGEKRLDIDSTAFSM-K 309
Query: 326 DGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEALTGLRPCFDVPGE 385
++DSGT T++A F L +E V Q++ G+ A D+ G
Sbjct: 310 GNEKSALIDSGTALTWLAESAFRALDNE-VRQLLDGVLMPFWRGSFACYKGTVSQDLIG- 367
Query: 386 KTGSFPELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTVVTDREASG-GPSI----ILGN 440
FP + HF GGA++ L E+ F +C+ V R+AS G ++G
Sbjct: 368 ----FPVVTFHFSGGADLDLDTESMF-YQATPDILCIAV---RQASAYGNDFKSFSVIGL 419
Query: 441 FQMQNYYVEYDLRNQRLGFKQQLCK 465
Q Y + YDL + +L F++ C+
Sbjct: 420 MAQQYYNMAYDLNSNKLFFQRIDCQ 444
>gi|356502091|ref|XP_003519855.1| PREDICTED: aspartic proteinase-like protein 1-like [Glycine max]
Length = 519
Score = 87.4 bits (215), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 116/416 (27%), Positives = 168/416 (40%), Gaps = 88/416 (21%)
Query: 77 TTTNISSHSYGGYSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFI- 135
+T ISS + Y+ ++ GTP LDTGS L W PC C C++S +F
Sbjct: 89 STFRISSLGFLHYT-TVQIGTPGVKFMVALDTGSDLFWVPC----DCTRCAASDSTAFAS 143
Query: 136 --------PKLSSSSRLLGCQNPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLY 187
P SS+S+ + C N C+ H S QC L T NC Y+V Y
Sbjct: 144 DFDLNVYNPNGSSTSKKVTCNNSLCT---HRS-QC-------LGTFSNC-----PYMVSY 187
Query: 188 GSGL--TEGIALSETLNLPNR------IIPNFLVGC------SVLSSRQPAGIAGFGRGK 233
S T GI + + L+L + N + GC S L P G+ G G K
Sbjct: 188 VSAETSTSGILVEDVLHLTQEDNHHDLVEANVIFGCGQIQSGSFLDVAAPNGLFGLGMEK 247
Query: 234 TSLPSQLNLDKFSYCLLSHKF--DDTTRTSSLILDNGSSHSDKKTTGLTYTPFVNNPSVA 291
S+PS L+ + F+ S F D R S D GS D+ TPF NPS
Sbjct: 248 ISVPSMLSREGFTADSFSMCFGRDGIGRIS--FGDKGSFDQDE-------TPFNLNPSHP 298
Query: 292 ERNAFSVYYYVGLRRITVGGQRVRVWHKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLA 351
Y + + ++ VG + V L DSGT+FT++ + L
Sbjct: 299 T-------YNITVTQVRVGTTVIDVEFTAL-----------FDSGTSFTYLVDPTYTRLT 340
Query: 352 DEFVSQMVKNRNYTRALGAEALTGLRPCFDV-PGEKTGSFPELKLHFKGGAEVTLPVENY 410
+ F SQ+ R+ + ++ C+D+ P T P + L GG+ V +
Sbjct: 341 ESFHSQVQDRRHRS-----DSRIPFEYCYDMSPDANTSLIPSVSLTMGGGSH--FAVYDP 393
Query: 411 FAVVGEGSAV--CLTVVTDREASGGPSIILGNFQMQNYYVEYDLRNQRLGFKQQLC 464
++ S + CL VV E + I+G M Y V +D LG+K+ C
Sbjct: 394 IIIISTQSELVYCLAVVKSAELN-----IIGQNFMTGYRVVFDREKLVLGWKKFDC 444
>gi|168065778|ref|XP_001784824.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162663621|gb|EDQ50376.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 392
Score = 87.4 bits (215), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 69/232 (29%), Positives = 110/232 (47%), Gaps = 29/232 (12%)
Query: 241 NLDKFSYCLLSHKFDDTTRTSSLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYY 300
+LD F++CL+ + TT TS+L+ S GL YTP + S + +Y
Sbjct: 180 DLDVFAFCLVPYT-AATTLTSALVF---GSRDATNALGLVYTPLLQGTSPS-------FY 228
Query: 301 YVGLRRITVGGQRVRVWHKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQM-- 358
+VG+ ++V G + G + DSGT T+ APE+++PL +
Sbjct: 229 WVGMVGVSVAGVDAGIPTALFA----STDGVLFDSGTPLTYFAPEIYDPLHQSIAGAIPY 284
Query: 359 VKNRNYTRALGAEALTGLRPCFDVPGEKTGSFPELKLHF----KGGAEV--TLPVENYFA 412
+ A+ A+ L R CFD+ G ++ P + HF GA V L +EN +
Sbjct: 285 PVAPDPVDAVVAKPLN--RLCFDLAGVQSPVLPTMAYHFTDADAAGATVDFDLGLENIY- 341
Query: 413 VVGEGSAVCLTVVTDREASGGPSIILGNFQMQNYYVEYDLRNQRLGFKQQLC 464
+ + CL +V R SG PSI+ GN Q N+Y+E+D+ R+G+ + C
Sbjct: 342 MNDMNTVWCLAIV--RGESGNPSIV-GNIQQANHYIEHDVALNRIGWTSKDC 390
Score = 45.4 bits (106), Expect = 0.064, Method: Compositional matrix adjust.
Identities = 31/77 (40%), Positives = 40/77 (51%), Gaps = 9/77 (11%)
Query: 78 TTNISSHSYGGYSISLSFGTPPQIIPFILDTGSHLVW---FPCTNHYQCKYCSSSKIPSF 134
TT I++ S+ Y I L FGTP Q ++DTGS LVW PC N Y ++ P F
Sbjct: 98 TTPITAESFE-YVIPLFFGTPLQPFTGMVDTGSDLVWIQCLPCINCY-----TTHPHPEF 151
Query: 135 IPKLSSSSRLLGCQNPK 151
P SSS + C +P
Sbjct: 152 DPTTSSSEAYVPCTDPA 168
>gi|291002742|gb|ADD71503.1| xyloglucanase inhibitor 1 [Humulus lupulus]
Length = 443
Score = 87.4 bits (215), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 106/408 (25%), Positives = 165/408 (40%), Gaps = 78/408 (19%)
Query: 97 TPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLGCQNPKCSWIH 156
TPP + +LD G +W C Y+ SS+ R + C +P+C +
Sbjct: 56 TPPVQLKVVLDVGGEFLWIDCEKGYK----------------SSTKRPVPCGSPQC--VL 97
Query: 157 HESIQCRDCNDEPLATSKNCTQICPSYLVLYGSGLTEGIALSETLNLPNR----IIPNFL 212
S C ++ P V L E I ++ N N +PN L
Sbjct: 98 SGSGACTTSDNPSDVGVCGVMPNNPFSSVGTSGDLFEDILYIQSTNGFNPGKQVSVPNLL 157
Query: 213 VGC---SVLSSRQPA--GIAGFGRGKTSLPSQLNLD-----KFSYCLLSHKFDDTTRTSS 262
C S+L G+AGFGR K +LPS + KF CL S
Sbjct: 158 FSCAPNSLLEGLASGIIGMAGFGRNKVALPSLFSSAFSFPRKFGVCLSSSNGVIFFGKEP 217
Query: 263 LILDNGSSHSDKKTTGLTYTPFVNNP----SVAERNAFSVYYYVGLRRITVGGQRVRVWH 318
+L G SD T LTYTP + NP S E N S Y++G++ I V G+ +R+
Sbjct: 218 YVLLPGIDVSDP--TSLTYTPLIQNPRSLVSSFEGNP-SAEYFIGVKSIKVDGKPLRLNT 274
Query: 319 KYLTLDRDG-NGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALG-----AEA 372
LT D +G +GGT + + FT + +++ + FV +ALG +A
Sbjct: 275 TLLTFDNEGGHGGTKISTVDPFTTLETSIYKAVVGAFV----------KALGPKVPRVKA 324
Query: 373 LTGLRPCFD---VPGEKTG-SFPELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTVVTDR 428
+ CF+ + + G + P++ L + ++ N VG+ +CL V
Sbjct: 325 VAPFGACFNAKYIGNTRVGPAVPQIDLVLRNDKLWSIFGANSMVSVGD-DVLCLGFV--- 380
Query: 429 EASGGP-------------SIILGNFQMQNYYVEYDLRNQRLGFKQQL 463
GGP ++++G Q++N ++ +DL RLGF L
Sbjct: 381 --DGGPLNFVDWGVKFTPTAVVIGGHQIENNFLLFDLGASRLGFSSSL 426
>gi|413953792|gb|AFW86441.1| hypothetical protein ZEAMMB73_342504 [Zea mays]
Length = 459
Score = 87.4 bits (215), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 116/431 (26%), Positives = 171/431 (39%), Gaps = 73/431 (16%)
Query: 55 SLTRALHIKNPQTKTTTTTTTTTTTNISSHSYGG-----YSISLSFGTPPQIIPFILDTG 109
SL+ L ++K + + + +I +H G Y +++ GTP ++DTG
Sbjct: 81 SLSERLRRSRARSKYIMSRASKSNVSIPTHLGGSVDSLEYVVTVGLGTPAVSQVLLIDTG 140
Query: 110 SHLVWFPCTNHYQCKYCSSS-----KIPSFIPKLSSSSRLLGCQNPKCSWIHHESIQCRD 164
S L W QC C+S+ K P F P SS+ + C C + + D
Sbjct: 141 SDLSWV------QCAPCNSTTCYPQKDPLFDPSRSSTYAPIPCNTDACRDLTRDGYG-SD 193
Query: 165 CNDEPLATSKNCTQICPSYLVLYGSG-LTEGIALSETLNL-PNRIIPNFLVGCSVLS--- 219
C TS + Y + YG G T G+ +ETL + P + +F GC
Sbjct: 194 C------TSGSGGGAQCGYAITYGDGSQTTGVYSNETLTMAPGVTVKDFHFGCGHDQDGP 247
Query: 220 SRQPAGIAGFGRGKTSLPSQLNL---DKFSYCLLSHKFDDTTRTSSLILDNGSSHSDKKT 276
+ + G+ G G SL Q + FSYCL + + L G+ +D
Sbjct: 248 NDKYDGLLGLGGAPESLVVQTSSVYGGAFSYCLPA------ANDQAGFLALGAPVNDA-- 299
Query: 277 TGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRVRVWHKYLTLDRDGNGGTIVDSG 336
+G +TP V E+ F Y V + ITVGG+ + V + GG I+DSG
Sbjct: 300 SGFVFTPMVR-----EQQTF---YVVNMTGITVGGEPIDVPPSAFS------GGMIIDSG 345
Query: 337 TTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEALTGLRPCFDVPGEKTGSFPELKLH 396
T T + + L F M Y E L C++ G + P + L
Sbjct: 346 TVVTELQHTAYAALQAAFRKAMAA---YPLLPNGE----LDTCYNFTGHSNVTVPRVALT 398
Query: 397 FKGGAEVTLPVENYFAVVGEGSAVCLTVVTDREASGGPS---IILGNFQMQNYYVEYDLR 453
F GGA V L V + + CL +EA GP ILGN + V YD+
Sbjct: 399 FSGGATVDLDVPDGILLDN-----CLAF---QEA--GPDNQPGILGNVNQRTLEVLYDVG 448
Query: 454 NQRLGFKQQLC 464
+ R+GF C
Sbjct: 449 HGRVGFGADAC 459
>gi|125596976|gb|EAZ36756.1| hypothetical protein OsJ_21092 [Oryza sativa Japonica Group]
Length = 435
Score = 87.0 bits (214), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 101/396 (25%), Positives = 151/396 (38%), Gaps = 68/396 (17%)
Query: 89 YSISLSFGTPPQIIPFILDTG---SHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLL 145
Y + +G P Q P DT S L PC C P+F P SSS +
Sbjct: 88 YRVLAGYGAPAQRFPVAFDTNFGVSVLRCKPCVGGAPCD-------PAFEPSRSSSFAAI 140
Query: 146 GCQNPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSGLTEGIALSETLNLP- 204
C +P+C+ ++C + CP + + G + +TL LP
Sbjct: 141 PCGSPECA------VECTGAS-------------CPFTIQFGNVTVANGTLVRDTLTLPP 181
Query: 205 NRIIPNFLVGCSVLSSRQ-----PAGIAGFGRGKTSLPSQL-------NLDKFSYCLLSH 252
+ F GC + + G+ R SL S++ + FSYCL S
Sbjct: 182 SATFAGFTFGCIEVGADADTFDGAVGLIDLSRSSHSLASRVISNGATTSAAAFSYCLPSS 241
Query: 253 KFDDTTRTSSLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQ 312
+ +S L G+S + + Y P +NP+ Y+V L I+VGG+
Sbjct: 242 ----SATSSRGFLSIGASRPEYSGGDIKYAPMSSNPNHPNS------YFVDLVGISVGGE 291
Query: 313 RVRVWHKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEA 372
+ V GT++++ T FTF+AP + L D F M A
Sbjct: 292 DLPVPPAVFAAH-----GTLLEAATEFTFLAPAAYAALRDAFRKDMAPYP------AAPP 340
Query: 373 LTGLRPCFDVPGEKTGSFPELKLHFKGGAEVTLPVEN--YFAVVGE--GSAVCLTVVTDR 428
L C+++ G + + P + L F GG E+ L V YFA S CL
Sbjct: 341 FRVLDTCYNLTGLASLAVPAVALRFAGGTELELDVRQMMYFADPSSVFSSVACLAFAAAP 400
Query: 429 EASGGPSIILGNFQMQNYYVEYDLRNQRLGFKQQLC 464
+ S+I G ++ V YDLR R+GF C
Sbjct: 401 LPAFPVSVI-GTLAQRSTEVVYDLRGGRVGFIPGRC 435
>gi|125555051|gb|EAZ00657.1| hypothetical protein OsI_22678 [Oryza sativa Indica Group]
Length = 435
Score = 87.0 bits (214), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 101/396 (25%), Positives = 152/396 (38%), Gaps = 68/396 (17%)
Query: 89 YSISLSFGTPPQIIPFILDTG---SHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLL 145
Y + +G P Q P DT S L PC C P+F P SSS +
Sbjct: 88 YRVLAGYGAPAQRFPVAFDTNFGVSVLRCKPCVGGAPCD-------PAFEPSRSSSFAAI 140
Query: 146 GCQNPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSGLTEGIALSETLNLP- 204
C +P+C+ ++C + CP + + G + +TL LP
Sbjct: 141 PCGSPECA------VECTGAS-------------CPFTIQFGNVTVANGTLVRDTLTLPP 181
Query: 205 NRIIPNFLVGCSVLSSRQ-----PAGIAGFGRGKTSLPSQL-------NLDKFSYCLLSH 252
+ F GC + + G+ R SL S++ + FSYCL S
Sbjct: 182 SATFAGFTFGCIEVGADADTFDGAVGLIDLSRSSHSLASRVISNGATTSAAAFSYCLPSS 241
Query: 253 KFDDTTRTSSLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQ 312
+ +S L G+S + + Y P +NP+ Y+V L I+VGG+
Sbjct: 242 ----SATSSRGFLSIGASRPEYSGGDIKYAPMSSNPNHPNS------YFVELVGISVGGE 291
Query: 313 RVRVWHKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEA 372
+ V GT++++ T FTF+AP + L D F R+ A
Sbjct: 292 DLPVPPAVFAAH-----GTLLEAATEFTFLAPAAYAALRDAF------RRDMAPYPAAPP 340
Query: 373 LTGLRPCFDVPGEKTGSFPELKLHFKGGAEVTLPVEN--YFAVVGE--GSAVCLTVVTDR 428
L C+++ G + + P + L F GG E+ L V YFA S CL
Sbjct: 341 FRVLDTCYNLTGLASLAVPTVALRFAGGTELELDVRQMMYFADPSSVFSSVACLAFAAAP 400
Query: 429 EASGGPSIILGNFQMQNYYVEYDLRNQRLGFKQQLC 464
+ S+I G ++ V YDLR R+GF C
Sbjct: 401 LPAFPVSVI-GTLAQRSTEVVYDLRGGRVGFIPGRC 435
>gi|225432542|ref|XP_002277699.1| PREDICTED: basic 7S globulin-like [Vitis vinifera]
Length = 435
Score = 87.0 bits (214), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 98/398 (24%), Positives = 163/398 (40%), Gaps = 72/398 (18%)
Query: 102 IPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLGCQNPKCSWIHHESIQ 161
IP LD G +W C Y +SSS R + C + +CS ++
Sbjct: 58 IPLTLDLGGQFLWVDCDQGY----------------VSSSYRPVRCGSAQCSLTRSKA-- 99
Query: 162 CRDCNDEPLATSKNCTQICPSYLVLYGSGLTEGIALSETLNL-------PNRIIP--NFL 212
C +C P+ T + + G+ T G + +++ P R++ L
Sbjct: 100 CGECFSGPVKGCNYSTCVLSPDNTVTGTA-TSGEVGEDAVSIQSTDGSNPGRVVSVRRLL 158
Query: 213 VGCSV------LSSRQPAGIAGFGRGKTSLPSQLNL-----DKFSYCLLSHKFDDTTRTS 261
C L+SR G+AG GR + +LPSQ + KFS CL S T T
Sbjct: 159 FTCGSTFLLEGLASRV-KGMAGLGRSRVALPSQFSSAFSFNRKFSICLSS----STKSTG 213
Query: 262 SLILDNGSSHSDKKTTG---LTYTPFVNNPSVAERNAF-----SVYYYVGLRRITVGGQR 313
+ +G K LTYTP + NP V+ +A+ SV Y++G++ I + G+
Sbjct: 214 VVFFGDGPYVLLPKVDASQSLTYTPLITNP-VSTASAYFQGEASVEYFIGVKSIKINGKA 272
Query: 314 VRVWHKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEAL 373
V + L++D G GGT + + +T + +++ + F+ ++ TR
Sbjct: 273 VPLNATLLSIDSQGYGGTKISTVHPYTVLETSIYKAVTQAFLKEL---STITRVASVSPF 329
Query: 374 TGLRPCFDVPGEKTG-SFPELKLHFKGGAEVTLPVENYFAVVGEGSAV-------CLTVV 425
D+ + G + P + L + + Y+ V G S V CL V
Sbjct: 330 GACFSSKDIGSTRVGPAVPPIDLVLQRQSV-------YWRVFGANSMVQVSDNVLCLGFV 382
Query: 426 TDREASGGPSIILGNFQMQNYYVEYDLRNQRLGFKQQL 463
D + SI++G Q+++ +++DL RLGF L
Sbjct: 383 -DGGVNPRTSIVIGGRQLEDNLLQFDLATSRLGFSSSL 419
>gi|54290731|dbj|BAD62401.1| putative nucleoid DNA-binding protein cnd41 [Oryza sativa Japonica
Group]
Length = 521
Score = 87.0 bits (214), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 109/395 (27%), Positives = 154/395 (38%), Gaps = 63/395 (15%)
Query: 89 YSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSS-----KIPSFIPKLSSSSR 143
Y ++L GTP ++DTGS L W QCK C + K P F P SSS
Sbjct: 171 YVVTLGIGTPAVQQTVLIDTGSDLSWV------QCKPCGAGECYAQKDPLFDPSSSSSYA 224
Query: 144 LLGCQNPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGS-GLTEGIALSETLN 202
+ C + C + + C S +C Y + YG+ T G+ +ETL
Sbjct: 225 SVPCDSDACRKLAAGAYG-HGCT----GVSGGAAALC-EYGIEYGNRATTTGVYSTETLT 278
Query: 203 L-PNRIIPNFLVGCSVLSS---RQPAGIAGFGRGKTSLPSQLNLD---KFSYCLLSHKFD 255
L P ++ +F GC + G+ G G SL SQ + FSYCL
Sbjct: 279 LKPGVVVADFGFGCGDHQHGPYEKFDGLLGLGGAPESLVSQTSSQFGGPFSYCL------ 332
Query: 256 DTTRTSSLILDNGS---SHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQ 312
T + L G+ S S +GL++TP PSV +Y V L I+VGG
Sbjct: 333 PPTSGGAGFLTLGAPPNSSSSTAASGLSFTPMRRLPSV------PTFYIVTLTGISVGGA 386
Query: 313 RVRVWHKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEA 372
+ + + G ++DSGT T + + L F S M + R + G
Sbjct: 387 PLAIPPSAFS------SGMVIDSGTVITGLPATAYAALRSAFRSAMSEYRLLPPSNGGV- 439
Query: 373 LTGLRPCFDVPGEKTGSFPELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTVV---TDRE 429
L C+D G + P + L F GGA + L V G CL TD
Sbjct: 440 ---LDTCYDFTGHANVTVPTISLTFSGGATIDLAAPAGVLVDG-----CLAFAGAGTDNA 491
Query: 430 ASGGPSIILGNFQMQNYYVEYDLRNQRLGFKQQLC 464
I+GN + + V YD +GF+ C
Sbjct: 492 IG-----IIGNVNQRTFEVLYDSGKGTVGFRAGAC 521
>gi|297812425|ref|XP_002874096.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
gi|297319933|gb|EFH50355.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
Length = 493
Score = 87.0 bits (214), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 96/398 (24%), Positives = 160/398 (40%), Gaps = 56/398 (14%)
Query: 87 GGYSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPS--FIPKLSSSSRL 144
G Y + G+PP+ +DTGS ++W C + C S +I F P S ++
Sbjct: 79 GLYYTKIRLGSPPRDFYVQVDTGSDVLWVSCASCNGCPQTSGLQIQLNFFDPGSSVTATP 138
Query: 145 LGCQNPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSGL-TEGIALSETLN- 202
+ C + +CSW S D + N +C +Y YG G T G +S+ L
Sbjct: 139 VSCSDQRCSWGIQSS-------DSGCSVQNN---LC-AYTFQYGDGSGTSGFYVSDVLQF 187
Query: 203 --------LPNRIIPNFLVGCS-------VLSSRQPAGIAGFGRGKTSLPSQLNLDKFSY 247
+PN P + GCS V S R GI GFG+ S+ SQL +
Sbjct: 188 DMIVGSSLVPNSTAP-VVFGCSTSQTGDLVKSDRAVDGIFGFGQQGMSVISQLASQGLAP 246
Query: 248 CLLSHKFDDTTRTSSLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRI 307
+ SH +++ + + +TP V PS +Y V L I
Sbjct: 247 RVFSHCLKGENGGGGILV-----LGEIVEPNMVFTPLV--PS-------QPHYNVNLLSI 292
Query: 308 TVGGQRVRVWHKYLTLDRDGNG-GTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTR 366
+V GQ + + + NG GTI+D+GTT +++ + P + + + ++
Sbjct: 293 SVNGQALPINPSVFSTS---NGQGTIIDTGTTLAYLSEAAYVPFVEAITNAVSQSVRPVV 349
Query: 367 ALGAEALTGLRPCFDVPGEKTGSFPELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTVVT 426
+ G + C+ + FP + L+F GGA + L ++Y + +
Sbjct: 350 SKGNQ-------CYVIATSVADIFPPVSLNFAGGASMFLNPQDYLIQQNNVGGTAVWCIG 402
Query: 427 DREASGGPSIILGNFQMQNYYVEYDLRNQRLGFKQQLC 464
+ ILG+ +++ YDL QR+G+ C
Sbjct: 403 FQRIQNQGITILGDLVLKDKIFVYDLVGQRIGWANYDC 440
>gi|326503794|dbj|BAK02683.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 456
Score = 86.7 bits (213), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 103/388 (26%), Positives = 154/388 (39%), Gaps = 72/388 (18%)
Query: 89 YSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLGCQ 148
Y I++ G+P ++DTGS + W C S+ + F P S++ C
Sbjct: 129 YVITVGIGSPAVTQTMMIDTGSDVSWVRCN--------STDGLTLFDPSKSTTYAPFSCS 180
Query: 149 NPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSGL-TEGIALSETLNL-PNR 206
+ C+ + + C ++ C Y V YG G T G S+TL L +
Sbjct: 181 SAACAQLGNNGDGC---------SNSGC-----QYRVQYGDGSNTTGTYSSDTLALSASD 226
Query: 207 IIPNFLVGCS----VLSSRQPAGIAGFGRGKTSLPSQLNL---DKFSYCLLSHKFDDTTR 259
+ +F GCS + G+ G G SL SQ FSYCL T R
Sbjct: 227 TVTDFHFGCSHHEEDFDGEKIDGLMGLGGDAQSLVSQTAATYGKSFSYCL-----PPTNR 281
Query: 260 TSSLI---LDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRVRV 316
TS + NG+S G TP + P Y V L+ I+VGG + +
Sbjct: 282 TSGFLTFGAPNGTSG------GFVTTPMLRWPKAP------TLYGVLLQDISVGGTPLGI 329
Query: 317 WHKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEALTGL 376
L+ G+++DSGT T++ + L+ F S M + R+ A L L
Sbjct: 330 QPSVLS------NGSVMDSGTVITWLPRRAYSALSSAFRSSMTRLRHQR----AAPLGIL 379
Query: 377 RPCFDVPGEKTGSFPELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTVVTDREASGGPSI 436
C+D G S P + L GGA V L +G+ + + A+ G SI
Sbjct: 380 DTCYDFTGLVNVSIPAVSLVLDGGAVVDL----------DGNGIMIQDCLAFAATSGDSI 429
Query: 437 ILGNFQMQNYYVEYDLRNQRLGFKQQLC 464
I GN Q + + V +D+ GF+ C
Sbjct: 430 I-GNVQQRTFEVLHDVGQGVFGFRSGAC 456
>gi|148907752|gb|ABR17002.1| unknown [Picea sitchensis]
Length = 454
Score = 86.7 bits (213), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 96/400 (24%), Positives = 159/400 (39%), Gaps = 57/400 (14%)
Query: 87 GGYSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPS--FIPKLSSSSRL 144
G Y + GTPP+ +DTGS ++W C C S + F P+ SS++
Sbjct: 39 GLYYTRIELGTPPRPFYVQIDTGSDILWVNCKPCNACPLTSGLGVALNFFDPRGSSTASP 98
Query: 145 LGCQNPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSGL-TEGIALSETLN- 202
L C + KC + ++ T + C Y YG G T G +S+ +
Sbjct: 99 LSCIDSKC-------VSSNQISESVCTTDRYC-----GYSFEYGDGSGTLGYYVSDEFDY 146
Query: 203 -------LPNRIIPNFLVGCSVLSS-------RQPAGIAGFGRGKTSLPSQLNLDKFSYC 248
+ N GCS S R GI GFG+ S+ SQLN +
Sbjct: 147 NQYVNQYVTNNASAKITFGCSYNQSGDLTKPDRAVDGIFGFGQNDLSVVSQLNSQGLAPK 206
Query: 249 LLSHKFDDTTRTSSLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRIT 308
+ SH + +++ + G+ YTP V PS +Y + L+ I
Sbjct: 207 IFSHCLEGADPGGGILV-----LGEITEPGMVYTPIV--PS-------QPHYNLNLQGIA 252
Query: 309 VGGQRVRVWHKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRAL 368
V GQ++ + + GTI+D GTT ++A E +EP + ++ + ++
Sbjct: 253 VNGQQLSIDPQVFATTN--TRGTIIDCGTTLAYLAEEAYEPFVNTIIAAVSQSTQPFMLK 310
Query: 369 GAEALTGLRPCFDVPGEKTGSFPELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTV---V 425
G PCF FP + L+F+G P + + S+ +
Sbjct: 311 G-------NPCFLTVHSIDEIFPSVTLYFEGAPMDLKPKDYLIQQLSPDSSPVWCIGWQK 363
Query: 426 TDREASGGPSI-ILGNFQMQNYYVEYDLRNQRLGFKQQLC 464
+ ++A+ + ILG+ +++ YDL NQR+G+ C
Sbjct: 364 SGQQATDSSKMTILGDLVLKDKVFVYDLENQRIGWTSFDC 403
>gi|54290724|dbj|BAD62394.1| putative nucleoid DNA-binding protein cnd41 [Oryza sativa Japonica
Group]
Length = 523
Score = 86.7 bits (213), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 101/396 (25%), Positives = 151/396 (38%), Gaps = 68/396 (17%)
Query: 89 YSISLSFGTPPQIIPFILDTG---SHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLL 145
Y + +G P Q P DT S L PC C P+F P SSS +
Sbjct: 176 YRVLAGYGAPAQRFPVAFDTNFGVSVLRCKPCVGGAPCD-------PAFEPSRSSSFAAI 228
Query: 146 GCQNPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSGLTEGIALSETLNLP- 204
C +P+C+ ++C + CP + + G + +TL LP
Sbjct: 229 PCGSPECA------VECTGAS-------------CPFTIQFGNVTVANGTLVRDTLTLPP 269
Query: 205 NRIIPNFLVGCSVLSSRQ-----PAGIAGFGRGKTSLPSQL-------NLDKFSYCLLSH 252
+ F GC + + G+ R SL S++ + FSYCL S
Sbjct: 270 SATFAGFTFGCIEVGADADTFDGAVGLIDLSRSSHSLASRVISNGATTSAAAFSYCLPSS 329
Query: 253 KFDDTTRTSSLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQ 312
+ +S L G+S + + Y P +NP+ Y+V L I+VGG+
Sbjct: 330 ----SATSSRGFLSIGASRPEYSGGDIKYAPMSSNPNHPNS------YFVDLVGISVGGE 379
Query: 313 RVRVWHKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEA 372
+ V GT++++ T FTF+AP + L D F M A
Sbjct: 380 DLPVPPAVFAAH-----GTLLEAATEFTFLAPAAYAALRDAFRKDMAPYP------AAPP 428
Query: 373 LTGLRPCFDVPGEKTGSFPELKLHFKGGAEVTLPVEN--YFAVVGE--GSAVCLTVVTDR 428
L C+++ G + + P + L F GG E+ L V YFA S CL
Sbjct: 429 FRVLDTCYNLTGLASLAVPAVALRFAGGTELELDVRQMMYFADPSSVFSSVACLAFAAAP 488
Query: 429 EASGGPSIILGNFQMQNYYVEYDLRNQRLGFKQQLC 464
+ S+I G ++ V YDLR R+GF C
Sbjct: 489 LPAFPVSVI-GTLAQRSTEVVYDLRGGRVGFIPGRC 523
>gi|51091919|dbj|BAD35188.1| putative 41 kD chloroplast nucleoid DNA binding protein [Oryza
sativa Japonica Group]
gi|125596474|gb|EAZ36254.1| hypothetical protein OsJ_20576 [Oryza sativa Japonica Group]
gi|196212950|gb|ACG76111.1| S5 [Oryza sativa Japonica Group]
gi|340810891|gb|AEK75372.1| S5 [Oryza sativa]
gi|340810893|gb|AEK75373.1| S5 [Oryza sativa]
gi|340810899|gb|AEK75376.1| S5 [Oryza sativa]
gi|340810901|gb|AEK75377.1| S5 [Oryza sativa]
gi|340810933|gb|AEK75393.1| S5 [Oryza sativa]
gi|340810947|gb|AEK75400.1| S5 [Oryza sativa]
gi|340810949|gb|AEK75401.1| S5 [Oryza sativa]
gi|340810967|gb|AEK75410.1| S5 [Oryza sativa]
gi|340810969|gb|AEK75411.1| S5 [Oryza sativa]
gi|340810999|gb|AEK75426.1| S5 [Oryza rufipogon]
gi|340811017|gb|AEK75435.1| S5 [Oryza rufipogon]
gi|340811029|gb|AEK75441.1| S5 [Oryza nivara]
gi|340811051|gb|AEK75452.1| S5 [Oryza nivara]
gi|340811075|gb|AEK75464.1| S5 [Oryza nivara]
gi|340811077|gb|AEK75465.1| S5 [Oryza rufipogon]
gi|340811085|gb|AEK75469.1| S5 [Oryza nivara]
gi|340811096|gb|AEK75474.1| S5 [Oryza rufipogon]
gi|340811100|gb|AEK75476.1| S5 [Oryza rufipogon]
gi|340811114|gb|AEK75483.1| S5 [Oryza nivara]
Length = 472
Score = 86.7 bits (213), Expect = 3e-14, Method: Compositional matrix adjust.
Identities = 111/435 (25%), Positives = 183/435 (42%), Gaps = 79/435 (18%)
Query: 62 IKNPQTKTTTTTTTTTTTNISSHSYGGYS--ISLSFGTPPQIIPFILDTGSHLVWF---P 116
+ N Q + T++++T I S + +++S G PP + +DTGS L W P
Sbjct: 85 LNNLQEEEITSSSSTKIDVIEDSSINDFLFLMAVSLGKPPVVNLVAIDTGSTLSWVQCQP 144
Query: 117 CTNHYQCKYCSSSKIPSFIPKLSSSSRLLGCQNPKCSWIHHE-SIQCRDCNDEPLATSKN 175
C H C S+ P F P S +SR + C + KC + ++ +Q +C + +
Sbjct: 145 CAVH--CHTQSAKAGPIFDPGRSYTSRRVRCSSVKCGELRYDLRLQQANC----MEKEDS 198
Query: 176 CTQICPSYLVLYGSGL--TEGIALSETLNLPNRIIPNFLVGCS--VLSSRQPAGIAGFGR 231
CT Y V YG+G + G +++TL + + + + + GCS V S AGI GFG
Sbjct: 199 CT-----YSVTYGNGWAYSVGKMVTDTLRIGDSFM-DLMFGCSMDVKYSEFEAGIFGFGS 252
Query: 232 GK-------TSLPSQLNLDKFSYCLLSHKFDDTTRTSSLILDNGSSHSDKKTTGLTYTPF 284
P L+ SYCL + D T+ +IL D+ YTP
Sbjct: 253 SSFSFFEQLAGYPDILSYKALSYCLPT----DETKPGYMIL----GRYDRAAMDGGYTPL 304
Query: 285 ---VNNPSVAERNAFSVYYYVGLRRITVGGQRVRVWHKYLTLDRDGNGGTIVDSGTTFTF 341
+N P+ Y + + + GQR+ + IVDSG T
Sbjct: 305 FRSINRPT----------YSLTMEMLIANGQRLVT----------SSSEMIVDSGAQRTS 344
Query: 342 MAPELFEPLADEFVSQMVKNRNYTRALGAEALTGLRPCFDVPGEKTG------------S 389
+ P F L D+ ++Q + + Y R A + + C+ + +G +
Sbjct: 345 LWPSTFA-LLDKTITQAMSSIGYHRTSRARQESYI--CYLSEHDYSGWNGTITPFSNWSA 401
Query: 390 FPELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTVVTDREASGGPSIILGNFQMQNYYVE 449
P L++ F GGA + LP N F +C+T + S ILGN +++
Sbjct: 402 LPLLEIGFAGGAALALPPRNVF-YNDPHRGLCMTFAQNPALR---SQILGNRVTRSFGTT 457
Query: 450 YDLRNQRLGFKQQLC 464
+D++ ++ GFK +C
Sbjct: 458 FDIQGKQFGFKYAVC 472
>gi|340810993|gb|AEK75423.1| S5 [Oryza rufipogon]
gi|340811015|gb|AEK75434.1| S5 [Oryza nivara]
gi|340811021|gb|AEK75437.1| S5 [Oryza nivara]
Length = 474
Score = 86.7 bits (213), Expect = 3e-14, Method: Compositional matrix adjust.
Identities = 111/435 (25%), Positives = 183/435 (42%), Gaps = 79/435 (18%)
Query: 62 IKNPQTKTTTTTTTTTTTNISSHSYGGYS--ISLSFGTPPQIIPFILDTGSHLVWF---P 116
+ N Q + T++++T I S + +++S G PP + +DTGS L W P
Sbjct: 87 LNNLQEEEITSSSSTKIDVIEDSSINDFLFLMAVSLGKPPVVNLVAIDTGSTLSWVQCQP 146
Query: 117 CTNHYQCKYCSSSKIPSFIPKLSSSSRLLGCQNPKCSWIHHE-SIQCRDCNDEPLATSKN 175
C H C S+ P F P S +SR + C + KC + ++ +Q +C + +
Sbjct: 147 CAVH--CHTQSAKAGPIFDPGRSYTSRRVRCSSVKCGELRYDLRLQQANC----MEKEDS 200
Query: 176 CTQICPSYLVLYGSGL--TEGIALSETLNLPNRIIPNFLVGCS--VLSSRQPAGIAGFGR 231
CT Y V YG+G + G +++TL + + + + + GCS V S AGI GFG
Sbjct: 201 CT-----YSVTYGNGWAYSVGKMVTDTLRIGDSFM-DLMFGCSMDVKYSEFEAGIFGFGS 254
Query: 232 GK-------TSLPSQLNLDKFSYCLLSHKFDDTTRTSSLILDNGSSHSDKKTTGLTYTPF 284
P L+ SYCL + D T+ +IL D+ YTP
Sbjct: 255 SSFSFFEQLAGYPDILSYKALSYCLPT----DETKPGYMIL----GRYDRAAMDGGYTPL 306
Query: 285 ---VNNPSVAERNAFSVYYYVGLRRITVGGQRVRVWHKYLTLDRDGNGGTIVDSGTTFTF 341
+N P+ Y + + + GQR+ + IVDSG T
Sbjct: 307 FRSINRPT----------YSLTMEMLIANGQRLVT----------SSSEMIVDSGAQRTS 346
Query: 342 MAPELFEPLADEFVSQMVKNRNYTRALGAEALTGLRPCFDVPGEKTG------------S 389
+ P F L D+ ++Q + + Y R A + + C+ + +G +
Sbjct: 347 LWPSTFA-LLDKTITQAMSSIGYHRTSRARQESYI--CYLSEHDYSGWNGTITPFSNWSA 403
Query: 390 FPELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTVVTDREASGGPSIILGNFQMQNYYVE 449
P L++ F GGA + LP N F +C+T + S ILGN +++
Sbjct: 404 LPLLEIGFAGGAALALPPRNVF-YNDPHRGLCMTFAQNPALR---SQILGNRVTRSFGTT 459
Query: 450 YDLRNQRLGFKQQLC 464
+D++ ++ GFK +C
Sbjct: 460 FDIQGKQFGFKYAVC 474
>gi|168042409|ref|XP_001773681.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162675069|gb|EDQ61569.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 399
Score = 86.7 bits (213), Expect = 3e-14, Method: Compositional matrix adjust.
Identities = 103/407 (25%), Positives = 166/407 (40%), Gaps = 72/407 (17%)
Query: 87 GGYSISLSFGTPPQIIPFILDTGSHLVWF---PCTNHYQCKYCSSSKIPSFIPKLSSSSR 143
G Y + GTPP +DTGS + W PCT+ S K+ ++ P SS+
Sbjct: 35 GLYYTKIYLGTPPVGYYVQVDTGSDVTWLNCAPCTSCVTETQLPSIKLTTYDPSRSSTDG 94
Query: 144 LLGCQNPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSGL-TEGIALSETL- 201
L C++ C N+ ++ C +Y YG G T+G + + +
Sbjct: 95 ALSCRDSNCG-------AALGSNEVSCTSAGYC-----AYSTTYGDGSSTQGYFIQDVMT 142
Query: 202 ------NLPNRIIPNFLVGCS-------VLSSRQPAGIAGFGRGKTSLPSQLNL-----D 243
N + GC ++SSR G+ GFG+ S+PSQL +
Sbjct: 143 FQEIHNNTQVNGTASVYFGCGTTQSGNLLMSSRALDGLIGFGQAAVSIPSQLASMGKVGN 202
Query: 244 KFSYCLLSHKFDDTTRTSSLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVG 303
+F++CL D ++++ + S + ++YTP V+ RN +Y VG
Sbjct: 203 RFAHCLQG----DNQGGGTIVIGSVSEPN------ISYTPIVS------RN----HYAVG 242
Query: 304 LRRITVGGQRVRVWHKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRN 363
++ I V G+ V + T GG I+DSGTT + L +P +FV N
Sbjct: 243 MQNIAVNGRNVTTPASFDTTSTSA-GGVIMDSGTTLAY----LVDPAYTQFV-------N 290
Query: 364 YTRALGAEALTGLRPCFDVPG-EKTGSFPELKLHFKGGAEVTLPVENYF---AVVGEGSA 419
+ + C + FP +KL F GA + L NY + +A
Sbjct: 291 AVSTFESSMFSSHSQCLQLAWCSLQADFPTVKLFFDAGAVMNLTPRNYLYSQPLQNGQAA 350
Query: 420 VCLTVVTDREASGGPSI-ILGNFQMQNYYVEYDLRNQRLGFKQQLCK 465
C+ +G S ILG+ ++++ V YD N+ +G+K CK
Sbjct: 351 YCMGWQKSTTKAGYLSYSILGDIVLKDHLVVYDNDNRVVGWKSFDCK 397
>gi|4415912|gb|AAD20143.1| putative protease [Arabidopsis thaliana]
Length = 469
Score = 86.7 bits (213), Expect = 3e-14, Method: Compositional matrix adjust.
Identities = 103/402 (25%), Positives = 165/402 (41%), Gaps = 64/402 (15%)
Query: 87 GGYSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPS--FIPKLSSSSRL 144
G Y + G+PP +DTGS ++W C++ C + S I F S ++
Sbjct: 98 GLYFTKVKLGSPPTEFNVQIDTGSDILWVTCSSCSNCPHSSGLGIDLHFFDAPGSLTAGS 157
Query: 145 LGCQNPKCSWIHH-ESIQCRDCNDEPLATSKNCTQICPSYLVLYGSGL-TEGIALSETLN 202
+ C +P CS + + QC + N C Y YG G T G +++T
Sbjct: 158 VTCSDPICSSVFQTTAAQCSENN--------QC-----GYSFRYGDGSGTSGYYMTDTFY 204
Query: 203 ----LPNRIIPN----FLVGCSVL-------SSRQPAGIAGFGRGKTSLPSQLNLDKFSY 247
L ++ N + GCS S + GI GFG+GK S+ SQL+ +
Sbjct: 205 FDAILGESLVANSSAPIVFGCSTYQSGDLTKSDKAVDGIFGFGKGKLSVVSQLSSRGITP 264
Query: 248 CLLSHKFD-DTTRTSSLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRR 306
+ SH D + +L + G+ Y+P V PS +Y + L
Sbjct: 265 PVFSHCLKGDGSGGGVFVL------GEILVPGMVYSPLV--PS-------QPHYNLNLLS 309
Query: 307 ITVGGQRVRVWHKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTR 366
I V GQ + + + GTIVD+GTT T++ E + D F++ + N
Sbjct: 310 IGVNGQMLPL--DAAVFEASNTRGTIVDTGTTLTYLVKEAY----DLFLNAI---SNSVS 360
Query: 367 ALGAEALTGLRPCFDVPGEKTGSFPELKLHFKGGAEVTLPVENY---FAVVGEGSAVCLT 423
L ++ C+ V + FP + L+F GGA + L ++Y + + S C+
Sbjct: 361 QLVTPIISNGEQCYLVSTSISDMFPSVSLNFAGGASMMLRPQDYLFHYGIYDGASMWCIG 420
Query: 424 VVTDREASGGPSIILGNFQMQNYYVEYDLRNQRLGFKQQLCK 465
E ILG+ +++ YDL QR+G+ CK
Sbjct: 421 FQKAPEE----QTILGDLVLKDKVFVYDLARQRIGWASYDCK 458
>gi|147801500|emb|CAN61502.1| hypothetical protein VITISV_011733 [Vitis vinifera]
Length = 415
Score = 86.7 bits (213), Expect = 3e-14, Method: Compositional matrix adjust.
Identities = 98/396 (24%), Positives = 163/396 (41%), Gaps = 60/396 (15%)
Query: 89 YSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLGCQ 148
Y S++ TP + ++D G +W C +Y SSS P + GC
Sbjct: 44 YVTSINQRTPLVPLQLVVDLGGQFLWVDCEQNY----VSSSYRPGAVQP--------GCN 91
Query: 149 NPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSGLTEGIALSETLNLPNRII 208
N CS + ++ R + + LA Q GS +++S+
Sbjct: 92 NNTCSVLPDNTV-TRTASSDELAEDAVSVQSTD------GSNPGRSVSVSK--------- 135
Query: 209 PNFLVGCSVLS-----SRQPAGIAGFGRGKTSLPSQLNLD-----KFSYCLLSHKFDDTT 258
FL C+ S + G+AG GR + +LPSQ KF+ CL S TT
Sbjct: 136 --FLFSCAPTSLLEGLASGAKGMAGLGRTRIALPSQFASAFSFHRKFAICLSS----STT 189
Query: 259 RTSSLILDNGSSH---SDKKTTGLTYTPFVNNP----SVAERNAFSVYYYVGLRRITVGG 311
++L +GS + + L YTP + NP S + S Y++G++ I +
Sbjct: 190 ADGVILLGDGSYGLLPNVDASQLLIYTPLILNPVSTASAHSQGEPSAEYFIGVKSIQINE 249
Query: 312 QRVRVWHKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAE 371
+ V + L+++ G GGT + + +T M ++ F+S + N TR
Sbjct: 250 KAVPLNTSLLSINSKGVGGTKISTVNPYTVMETSIYSAFTKAFISA-AASMNITR---VA 305
Query: 372 ALTGLRPCF---DVPGEKTG-SFPELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTVVTD 427
A+ CF +V + G + P + L + + V V G +CL V D
Sbjct: 306 AVAPFSVCFSSKNVYSTRGGAAVPTIGLVLQNNSVVWRIFGANSMVFVNGDVLCLGFV-D 364
Query: 428 REASGGPSIILGNFQMQNYYVEYDLRNQRLGFKQQL 463
A+ SI++G +Q+++ +++DL RLGF L
Sbjct: 365 GGANPRTSIVIGGYQLEDNLLQFDLAASRLGFSSSL 400
>gi|356500210|ref|XP_003518926.1| PREDICTED: basic 7S globulin-like [Glycine max]
Length = 435
Score = 86.3 bits (212), Expect = 3e-14, Method: Compositional matrix adjust.
Identities = 99/392 (25%), Positives = 167/392 (42%), Gaps = 64/392 (16%)
Query: 104 FILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLGCQNPKCSWIHHESIQCR 163
+LD G +W C N+Y +SS+ R C + +CS +S C
Sbjct: 60 LVLDIGGQFLWVDCDNNY----------------VSSTYRPARCGSAQCSLARSDS--CG 101
Query: 164 DCNDEPLATSKNCT-QICPSYLV---LYGSGLTEGIALSETLN----LPNRIIPNFLVGC 215
+C P N T + P V L + + ++ N + N + FL C
Sbjct: 102 NCFSAPKPGCNNNTCGVTPDNTVTGTATSGELAQDVVSLQSTNGFNPIQNATVSRFLFSC 161
Query: 216 SVLSSRQ-----PAGIAGFGRGKTSLPSQLN-----LDKFSYCLLSHK----FDDTTRTS 261
+ Q +G+AG GR + +LPSQL KF+ CL S F D
Sbjct: 162 APTFLLQGLATGVSGMAGLGRTRIALPSQLASAFSFRRKFAVCLSSSNGVAFFGDGPY-- 219
Query: 262 SLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFS-----VYYYVGLRRITVGGQRVRV 316
++L N + LT+TP + NP V+ +AFS Y++G++ I + + V +
Sbjct: 220 -VLLPNVDASQL-----LTFTPLLINP-VSTASAFSQGEPSAEYFIGVKSIKIDEKTVPL 272
Query: 317 WHKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEALTGL 376
L+++ G GGT + S +T + +F+ + + FV + RN TR ++
Sbjct: 273 NTTLLSINSKGVGGTKISSVNPYTVLEDSIFKAVTEAFV-KASSARNITR---VASVAPF 328
Query: 377 RPCF---DVPGEKTG-SFPELKLHFKGGAEV-TLPVENYFAVVGEGSAVCLTVVTDREAS 431
CF +V + G + P ++L + V + N V + +CL V E +
Sbjct: 329 EVCFSRENVLATRLGAAVPTIELVLQNQKTVWRIFGANSMVSVSDDKVLCLGFVNGGE-N 387
Query: 432 GGPSIILGNFQMQNYYVEYDLRNQRLGFKQQL 463
SI++G +Q+++ +++DL RLGF L
Sbjct: 388 PRTSIVIGGYQLEDNLLQFDLATSRLGFSSLL 419
>gi|255563741|ref|XP_002522872.1| Aspartic proteinase nepenthesin-2 precursor, putative [Ricinus
communis]
gi|223537956|gb|EEF39570.1| Aspartic proteinase nepenthesin-2 precursor, putative [Ricinus
communis]
Length = 448
Score = 86.3 bits (212), Expect = 3e-14, Method: Compositional matrix adjust.
Identities = 105/402 (26%), Positives = 158/402 (39%), Gaps = 73/402 (18%)
Query: 89 YSISLSFGTPPQIIPFILDTGSHLVWF---PCTNHYQCKYCSSSKIPSFIPKLSSSSRLL 145
Y + + G+P + + DTGS L W PCT ++ P F S + R L
Sbjct: 91 YLVKVIIGSPGVPLYLVPDTGSGLFWTQCEPCTRRFR------QLPPIFNSTASRTYRDL 144
Query: 146 GCQNPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSG-LTEGIALSETLN-L 203
CQ+ C+ + QCRD C Y + Y G T G+A + L
Sbjct: 145 PCQHQFCTN-NQNVFQCRD---------DKCV-----YRIAYAGGSATAGVAAQDILQSA 189
Query: 204 PNRIIPNFLVGCSVLSSRQPAGIAGF------------GRGKTSLPSQLN---LDKFSYC 248
N IP F GCS R + F SL Q+N ++FSYC
Sbjct: 190 ENDRIP-FYFGCS----RDNQNFSTFESSGKGGGIIGLNMSPVSLLQQMNHITKNRFSYC 244
Query: 249 LLSHKFDDTTRTSSLI-LDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRI 307
L + +SL+ N S +K TPFV+ + Y++ L +
Sbjct: 245 LNLFDLSSPSHATSLLRFGNDIRKSRRKYLS---TPFVSPRGMPN-------YFLNLIDV 294
Query: 308 TVGGQRVRVWHKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRA 367
+V G R+++ L DG GGTI+DSGT T+++ + P+ F +NY
Sbjct: 295 SVAGNRMQIPPGTFALKPDGTGGTIIDSGTAVTYISQTAYFPVITAF-------KNYFDQ 347
Query: 368 LGAE----ALTGLRPCFDVPGEKTGSFPELKLHFKGGAEVTLPVENYFAVVGEGSAVCLT 423
G + L+G C+ G ++P + HF+G P Y V G A C+
Sbjct: 348 HGFQRVNIQLSGYI-CYKQQGHTFHNYPSMAFHFQGADFFVEPEYVYLTVQDRG-AFCVA 405
Query: 424 VVTDREASGGPSIILGNFQMQNYYVEYDLRNQRLGFKQQLCK 465
+ + S I+G N YD N++L F + C+
Sbjct: 406 L---QPISPQQRTIIGALNQANTQFIYDAANRQLLFTPENCQ 444
>gi|222613193|gb|EEE51325.1| hypothetical protein OsJ_32293 [Oryza sativa Japonica Group]
Length = 371
Score = 86.3 bits (212), Expect = 3e-14, Method: Compositional matrix adjust.
Identities = 99/377 (26%), Positives = 154/377 (40%), Gaps = 60/377 (15%)
Query: 104 FILDTGSHLV---WFP-CTNHYQCKYCSSSKIPSFIPKLSSSSRLLGCQNPKCSWIHHES 159
+ D G +V W P N QC +C +P F+P SS+ + C C
Sbjct: 35 LLADGGGAVVPFHWSPELYNCSQCIHCFKQDLPVFVPNASSTFKPEPCGTDVC------- 87
Query: 160 IQCRDCNDEPLATSKNCTQICPSYLVLYGSGLTEGIALSETLNL----PNRIIPNFLVGC 215
+ + T K + +C V G T GI ++T + P R + G
Sbjct: 88 --------KSIPTPKCASDVCAYDGVTGLGGHTVGIVATDTFAIGTAAPARPPAS---GA 136
Query: 216 SVLSSRQP----AGIAGFGRGKTSLPSQLNLDKFSYCLLSHKFDDTTRTSSLILDNGSSH 271
S ++ P +G G GR SL +Q+ L +FSYCL H DT + S L L
Sbjct: 137 SWRATSTPWAGPSGFIGLGRTPWSLVAQMKLTRFSYCLAPH---DTGKNSRLFL----GA 189
Query: 272 SDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRVRVWHKYLTLDRDGNGGT 331
S K G +TPFV + + S YY + L I G +T+ R N
Sbjct: 190 SAKLAGGGAWTPFVKT---SPNDGMSQYYPIELEEIKAG-------DATITMPRGRNTVL 239
Query: 332 IVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEALTGLRPCFDVPGEKTGSFP 391
+ + + + +++ ++ + T +GA CF P P
Sbjct: 240 VQTAVVRVSLLVDSVYQEFKKAVMASVGAAPTAT-PVGAP----FEVCF--PKAGVSGAP 292
Query: 392 ELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTVVT----DREASGGPSIILGNFQMQNYY 447
+L F+ GA +T+P NY VG + VCL+V++ + A G + ILG+FQ +N +
Sbjct: 293 DLVFTFQAGAALTVPPANYLFDVGNDT-VCLSVMSIALLNITALDGLN-ILGSFQQENVH 350
Query: 448 VEYDLRNQRLGFKQQLC 464
+ +DL L F+ C
Sbjct: 351 LLFDLDKDMLSFEPADC 367
>gi|356514298|ref|XP_003525843.1| PREDICTED: aspartic proteinase-like protein 1-like [Glycine max]
Length = 663
Score = 85.9 bits (211), Expect = 4e-14, Method: Compositional matrix adjust.
Identities = 101/400 (25%), Positives = 167/400 (41%), Gaps = 77/400 (19%)
Query: 87 GGYSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLG 146
G Y+ L GTPPQ+ I+DTGS + + PC+ C+ C + P F P+ SS+ + +
Sbjct: 110 GYYTTRLWIGTPPQMFALIVDTGSTVTYVPCST---CEQCGRHQDPKFQPESSSTYQPVK 166
Query: 147 CQNPKCSWIHHESIQCRDCNDEPL--ATSKNCTQICPSYLVLYGSGLTEGIALSETLNLP 204
C +I C +C+ + + + ++ S VL ++ G +++ P
Sbjct: 167 C-----------TIDC-NCDGDRMQCVYERQYAEMSTSSGVLGEDVISFG---NQSELAP 211
Query: 205 NRIIPNFLVGCS-----VLSSRQPAGIAGFGRGKTSLPSQLN-----LDKFSYCLLSHKF 254
R + GC L S+ GI G GRG S+ QL D FS C
Sbjct: 212 QRAV----FGCENVETGDLYSQHADGIMGLGRGDLSIMDQLVDKKVISDSFSLCYGGMDV 267
Query: 255 DDTTRTSSLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRV 314
+++L S SD +T+ ++P S YY + L+ + V G+R+
Sbjct: 268 GG----GAMVLGGISPPSD-----MTFA--YSDPDR------SPYYNIDLKEMHVAGKRL 310
Query: 315 RVWHKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEALT 374
+ DG GT++DSGTT+ ++ F D V ++ + ++
Sbjct: 311 PLNANVF----DGKHGTVLDSGTTYAYLPEAAFLAFKDAIVKELQSLKQ---------IS 357
Query: 375 GLRP-----CFDVPG----EKTGSFPELKLHFKGGAEVTLPVENY-FAVVGEGSAVCLTV 424
G P CF G + + SFP + + F G + +L ENY F A CL +
Sbjct: 358 GPDPNYNDICFSGAGNDVSQLSKSFPVVDMVFGNGHKYSLSPENYMFRHSKVRGAYCLGI 417
Query: 425 VTDREASGGPSIILGNFQMQNYYVEYDLRNQRLGFKQQLC 464
+ + +LG ++N V YD ++GF + C
Sbjct: 418 F---QNGNDQTTLLGGIIVRNTLVMYDREQTKIGFWKTNC 454
>gi|357443039|ref|XP_003591797.1| Xyloglucan-specific endoglucanase inhibitor protein [Medicago
truncatula]
gi|355480845|gb|AES62048.1| Xyloglucan-specific endoglucanase inhibitor protein [Medicago
truncatula]
Length = 436
Score = 85.9 bits (211), Expect = 4e-14, Method: Compositional matrix adjust.
Identities = 104/434 (23%), Positives = 176/434 (40%), Gaps = 88/434 (20%)
Query: 65 PQTKTTTTTTTTTTTNISSHSYGGYSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCK 124
P T+ T+ +T TT I TP I +D G W C Y
Sbjct: 35 PITRDTSASTPQYTTQIKQR------------TPLVPINLTIDLGGGYFWVNCDKSY--- 79
Query: 125 YCSSSKIPSFIPKLSSSSRLLGCQNPKCSWI-HHESIQCRDCNDEP--LATSKNCTQICP 181
+SS+ + + C + +CS H + C P + T + +
Sbjct: 80 -------------VSSTLKPILCSSSQCSLFGSHGCSDKKICGRSPYNIVTGVSTSGDIQ 126
Query: 182 SYLVLYGSGLTEGIALSETLNLPNRIIPNFLVGCSVLS---SRQPAGIAGFGRGKTSLPS 238
S +V S T G +++PN + F+ G +V+ ++ G+AG GR K SLPS
Sbjct: 127 SDIVSVQS--TNGNYSGRFVSVPNFL---FICGSNVVQNGLAKGVKGMAGLGRTKVSLPS 181
Query: 239 QLN-----LDKFSYCLLSHKFDDTTRTSSLILDNGSSHSD-KKTTGLTYTPFVNNPSVAE 292
Q + +KF+ CL T+ L +G + ++ L YTP + NP
Sbjct: 182 QFSSAFSFKNKFAICL-------GTQNGVLFFGDGPYLFNFDESKNLIYTPLITNPVSTS 234
Query: 293 RNAF----SVYYYVGLRRITVGGQRVRVWHKYLTLDRDGNGGTIVDSGTTFTFMAPELFE 348
++F SV Y++G++ I V + V++ L++D++G GGT + + +T M +++
Sbjct: 235 PSSFLGEKSVEYFIGVKSIRVSSKNVKLNTTLLSIDQNGFGGTKISTVNPYTIMETSIYK 294
Query: 349 PLADEFVSQMVKNRNYTRALGAEALTGLRP---CF---DVPGEKTG-SFPELKLHFKGGA 401
+AD FV +AL + + P CF + + G P + L +
Sbjct: 295 AVADAFV----------KALNVSTVEPVAPFGTCFASQSISSSRMGPDVPSIDLVLQNEN 344
Query: 402 EV-TLPVENYFAVVGEGSAVCLTVVTDRE----------ASGGP----SIILGNFQMQNY 446
V + N + + +CL V GG SI +G Q++N
Sbjct: 345 VVWNIIGANAMVRINDKDVICLGFVDAGSDFAKTSQVGFVVGGSKPMTSITIGAHQLENN 404
Query: 447 YVEYDLRNQRLGFK 460
+++DL RLGF+
Sbjct: 405 LLQFDLATSRLGFR 418
>gi|242057809|ref|XP_002458050.1| hypothetical protein SORBIDRAFT_03g026140 [Sorghum bicolor]
gi|241930025|gb|EES03170.1| hypothetical protein SORBIDRAFT_03g026140 [Sorghum bicolor]
Length = 489
Score = 85.9 bits (211), Expect = 5e-14, Method: Compositional matrix adjust.
Identities = 107/404 (26%), Positives = 159/404 (39%), Gaps = 70/404 (17%)
Query: 87 GGYSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQC--KYCSSSKIPSFIPKLSSSSRL 144
G Y + GTPP+ +DTGS ++W C + +C K + + PK SSS
Sbjct: 82 GLYFTEIKLGTPPKRYYVQVDTGSDILWVNCISCEKCPRKSGLGLDLTFYDPKASSSGST 141
Query: 145 LGCQNPKCSWIHHESIQCRDCNDEPLATSKNCTQICP-SYLVLYGSG-LTEGIALSETLN 202
+ C C+ + + CT P Y V+YG G T G +++ L
Sbjct: 142 VSCDQGFCAATYGGKL-------------PGCTANVPCEYSVMYGDGSSTTGFFVTDALQ 188
Query: 203 L----------PNRIIPNFLVGC----SVLSSRQP-AGIAGFGRGKTSLPSQLNLDK--- 244
P F G + SS Q GI GFG+ TS+ SQL
Sbjct: 189 FDQVTGDGQTQPGNATVTFGCGAQQGGDLGSSNQALDGILGFGQANTSMLSQLAAAGKVK 248
Query: 245 --FSYCLLSHKFDDTTRTSSLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYV 302
F++CL DT + + KTT P VA+ +Y V
Sbjct: 249 KIFAHCL------DTIKGGGIFAIGNVVQPKVKTT----------PLVADMP----HYNV 288
Query: 303 GLRRITVGGQRVRV-WHKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKN 361
L+ I VGG +++ H + T +R GTI+DSGTT T++ PEL F M
Sbjct: 289 NLKSIDVGGTTLQLPAHVFETGERK---GTIIDSGTTLTYL-PELV------FKEVMAAI 338
Query: 362 RNYTRALGAEALTGLRPCFDVPGEKTGSFPELKLHFKGGAEVTL-PVENYFAVVGEGSAV 420
N + + + CF PG FP + HF+ + + P E +F + V
Sbjct: 339 FNKHQDIVFHNVQDFM-CFQYPGSVDDGFPTITFHFEDDLALHVYPHEYFFPNGNDMYCV 397
Query: 421 CLTVVTDREASGGPSIILGNFQMQNYYVEYDLRNQRLGFKQQLC 464
+ G +++G+ + N V YDL NQ +G+ C
Sbjct: 398 GFQNGALQSKDGKDIVLMGDLVLSNKLVIYDLENQVIGWTDYNC 441
>gi|357487593|ref|XP_003614084.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
gi|355515419|gb|AES97042.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
Length = 412
Score = 85.5 bits (210), Expect = 5e-14, Method: Compositional matrix adjust.
Identities = 107/446 (23%), Positives = 175/446 (39%), Gaps = 99/446 (22%)
Query: 39 NPSQDSYQNLNSLVSSSLTRALHIKNPQTKTTTTTTTTTTTNISSHSYGGYSISLSFGTP 98
NP + Q ++S+++ S+ R ++ + + + +SS GY +S S GTP
Sbjct: 43 NPKETQIQRISSILNYSINRVRYLNH---VFSFSPNKIQDVPLSSFMGAGYVMSYSIGTP 99
Query: 99 PQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLGCQNPKCS----- 153
P + ++DTG+ +WF C CK C + P F P SS+ + + C +P C
Sbjct: 100 PFQLYSLIDTGNDNIWFQCK---PCKPCLNQTSPMFHPSKSSTYKTIPCTSPICKNADGH 156
Query: 154 WIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSGLTEGIALSETLNLPNRIIPNFLV 213
++ +++ N P++ N ++
Sbjct: 157 YLGVDTLTLNSNNGTPIS------------------------------------FKNIVI 180
Query: 214 GCSVLSSRQP-----AGIAGFGRGKTSLPSQLNLD---KFSYCLLSHKFDDTTRTSSLIL 265
GC ++ P +G G RG S SQLN KFSYCL+ F +S L
Sbjct: 181 GCG-HRNQGPLEGYVSGNIGLARGPLSFISQLNSSIGGKFSYCLVP-LFSKENVSSKLHF 238
Query: 266 DNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRVRVWHKYLTLDR 325
+ S+ S G TP + E N Y+V L +VG +++ +
Sbjct: 239 GDKSTVSG---LGTVSTP------IKEENG----YFVSLEAFSVGDHIIKLE------NS 279
Query: 326 DGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVK-------NRNYTRALGAEALTGLRP 378
D G +I+DSGTT T + +++ L + V MVK ++ + + T L
Sbjct: 280 DNRGNSIIDSGTTMTILPKDVYSRL-ESVVLDMVKLKRVKDPSQQFNLCYQTTSTTLLTK 338
Query: 379 CFDVPGEKTGSFPELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTVVTDREASGGPSIIL 438
+ HF G+EV L N F + + +C V+ S I
Sbjct: 339 VLIITA-----------HFS-GSEVHLNALNTFYPITD-EVICFAFVSGGNFSS--LAIF 383
Query: 439 GNFQMQNYYVEYDLRNQRLGFKQQLC 464
GN QN+ V +DL + + FK C
Sbjct: 384 GNVVQQNFLVGFDLNKKTISFKPTDC 409
>gi|293333354|ref|NP_001169607.1| uncharacterized protein LOC100383488 [Zea mays]
gi|224030351|gb|ACN34251.1| unknown [Zea mays]
Length = 342
Score = 85.5 bits (210), Expect = 5e-14, Method: Compositional matrix adjust.
Identities = 97/370 (26%), Positives = 149/370 (40%), Gaps = 63/370 (17%)
Query: 123 CKYCSSSKIPSFIPKLSSSSRLLGCQNPKCSWIHHESIQCRDCNDEPLATSKNCTQICPS 182
C C P F PKLSSS ++ C + C+ + + +C + +D C
Sbjct: 6 CVSCYRQLDPVFNPKLSSSYAVVPCTSDTCAQL--DGHRCHEDDD----------GACQY 53
Query: 183 YLVLYGSGLTEGIALSETLNLPNRIIPNFLVGCSVLSSRQPA----GIAGFGRGKTSLPS 238
G G+T+G + L + + + GCS S PA G+ G GRG SL S
Sbjct: 54 TYKYSGHGVTKGTLAIDKLAIGGDVFHAVVFGCSDSSVGGPAAQASGLVGLGRGPLSLVS 113
Query: 239 QLNLDKFSYCLLSHKFDDTTRTS-SLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFS 297
QL++ +F YCL +RTS L+L G+ + +T T ++ +
Sbjct: 114 QLSVHRFMYCLPPP----MSRTSGKLVLGAGADAVRNMSDRVTVT-------MSSSTRYP 162
Query: 298 VYYYVGLRRITVGGQ-------------------RVRVWHKYLTLDRDGNGGTIVDSGTT 338
YYY+ L + VG Q + G IVD +T
Sbjct: 163 SYYYLNLDGLAVGDQTPGTTRNATSPPSGGAGGGGGGGGGGIVGAGGANAYGMIVDVAST 222
Query: 339 FTFMAPELFEPLADEFVSQMVKNRNYTRALGAEALTGLRPCFDVP---GEKTGSFPELKL 395
+F+ L++ LAD+ ++ RA + L GL CF +P G P + L
Sbjct: 223 ISFLETSLYDELADDLEEEI----RLPRATPSLRL-GLDLCFILPEGVGMDRVYVPTVSL 277
Query: 396 HFKGGAEVTLPVENYFAVVGEGSAVCLTVVTDREASGGPSIILGNFQMQNYYVEYDLRNQ 455
F G L ++ V +G +CL + S ILGNFQ+QN V ++LR
Sbjct: 278 SFDG---RWLELDRDRLFVTDGRMMCLMIGRTSGVS-----ILGNFQLQNMRVLFNLRRG 329
Query: 456 RLGFKQQLCK 465
++ F + C
Sbjct: 330 KITFAKASCD 339
>gi|255571588|ref|XP_002526740.1| aspartic-type endopeptidase, putative [Ricinus communis]
gi|223533929|gb|EEF35654.1| aspartic-type endopeptidase, putative [Ricinus communis]
Length = 471
Score = 85.5 bits (210), Expect = 5e-14, Method: Compositional matrix adjust.
Identities = 91/395 (23%), Positives = 149/395 (37%), Gaps = 54/395 (13%)
Query: 89 YSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLGCQ 148
Y + + G+PP I DTGS++VW C + C C KIP F P SS+ + C
Sbjct: 108 YVMKFNIGSPPVETYAIPDTGSNIVWIQCGSPI-CTNCYKQKIPLFNPTKSSTYAIRLCG 166
Query: 149 NPKCS---WIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSGLTEGIALSETLNLPN 205
+ +C W E + C K+ Q+C ++ +EG ++ + P
Sbjct: 167 HRECKQALWGLGEYLGC-----------KSSVQVCRYHISYEDHSFSEGTISTDIITFPE 215
Query: 206 RIIP------NFLVGCSVLSSRQPA---------GIAGFGRGKTSLPSQLNLDKFSYCLL 250
I GC +S P G+ G G SL QL L +FSYC+
Sbjct: 216 HIAEFGNYSLRMFFGCGYNNSETPGQDPNSFTAPGVVGLGNEMASLVGQLTLGQFSYCIS 275
Query: 251 SHKFDDTTRTSSLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVG 310
+ T + +S S T N Y + + I V
Sbjct: 276 TPDVQKPNGTIEIRFGLAASISGHST-------------ALANNLEGWYIFQNVDGIYVD 322
Query: 311 GQRVRVWHKYL-TLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALG 369
+V+ + +++ G GG I+DSGTT+T EL+ D + ++ +
Sbjct: 323 DTKVKGYPEWVFQFAEGGIGGLIMDSGTTYT----ELYFSALDALIGELKEQIELAPDTQ 378
Query: 370 AEALTGLRPCFDVPGEKTGSFPELKLHFKGGAEVTLPVENYFAVVGEGS-AVCLTVVTDR 428
+ + C++ P ++L F E P A + G+ CL +
Sbjct: 379 DHSNSNYSLCYNAANFLLTYVPAIELKFTDNKEAYFPFTLRNAWIDNGNDQYCLAMF--- 435
Query: 429 EASGGPSIILGNFQMQNYYVEYDLRNQRLGFKQQL 463
+ G SII G +Q ++ + YDL+ + F +
Sbjct: 436 -GTSGISII-GIYQHRDIKIGYDLKYNLVSFTEMF 468
>gi|297827153|ref|XP_002881459.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
gi|297327298|gb|EFH57718.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
Length = 507
Score = 85.5 bits (210), Expect = 5e-14, Method: Compositional matrix adjust.
Identities = 101/401 (25%), Positives = 165/401 (41%), Gaps = 64/401 (15%)
Query: 87 GGYSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPS--FIPKLSSSSRL 144
G Y + G+PP +DTGS ++W C++ C + S I F S ++
Sbjct: 98 GLYFTKVKLGSPPTEFNVQIDTGSDILWVTCSSCSNCPHSSGLGIDLHFFDAPGSFTAGS 157
Query: 145 LGCQNPKCSWIHH-ESIQCRDCNDEPLATSKNCTQICPSYLVLYGSGL-TEGIALSETLN 202
+ C +P CS + + QC + N C Y YG G T G +++T
Sbjct: 158 VTCSDPICSSVFQTTAAQCSENN--------QC-----GYSFRYGDGSGTSGYYMTDTFY 204
Query: 203 ----LPNRIIPN----FLVGCSVL-------SSRQPAGIAGFGRGKTSLPSQLNLDKFSY 247
L ++ N + GCS S + GI GFG+GK S+ SQL+ +
Sbjct: 205 FDAILGESLVANSSAPIVFGCSTYQSGDLTKSDKAVDGIFGFGKGKLSVVSQLSSRGITP 264
Query: 248 CLLSHKFD-DTTRTSSLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRR 306
+ SH D + +L + G+ Y+P + PS +Y + L
Sbjct: 265 PVFSHCLKGDGSGGGVFVL------GEILVPGMVYSPLL--PS-------QPHYNLNLLS 309
Query: 307 ITVGGQRVRVWHKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEF---VSQMVKNRN 363
I V GQ + + + GTIVD+GTT T++ E ++P + VSQ+V
Sbjct: 310 IGVNGQILPI--DAAVFEASNTRGTIVDTGTTLTYLVKEAYDPFLNAISNSVSQLV---- 363
Query: 364 YTRALGAEALTGLRPCFDVPGEKTGSFPELKLHFKGGAEVTLPVENYFAVVGEGSAVCLT 423
++ C+ V + FP + L+F GGA + L ++Y G +
Sbjct: 364 ------TLIISNGEQCYLVSTSISDMFPPVSLNFAGGASMMLRPQDYLFHYGFYDGASMW 417
Query: 424 VVTDREASGGPSIILGNFQMQNYYVEYDLRNQRLGFKQQLC 464
+ ++A ILG+ +++ YDL QR+G+ C
Sbjct: 418 CIGFQKAP-EEQTILGDLVLKDKVFVYDLARQRIGWANYDC 457
>gi|125575542|gb|EAZ16826.1| hypothetical protein OsJ_32298 [Oryza sativa Japonica Group]
Length = 396
Score = 85.5 bits (210), Expect = 6e-14, Method: Compositional matrix adjust.
Identities = 100/391 (25%), Positives = 160/391 (40%), Gaps = 64/391 (16%)
Query: 93 LSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLGCQNPKC 152
+ GTPPQ I+D LVW C+ +C C +P FIP SS+ R C C
Sbjct: 47 FTIGTPPQPASAIIDVAGELVWTQCS---RCSRCFKQDLPLFIPNASSTFRPEPCGTDAC 103
Query: 153 SWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSGLTEGIALSETLNLPNRIIPNFL 212
+S +C+ + + T ++ T I T GI +ET + +
Sbjct: 104 -----KSTPTSNCSGD-VCTYESTTNI------RLDRHTTLGIVGTETFAI-GTATASLA 150
Query: 213 VGCSVLSSRQP----AGIAGFGRGKTSLPSQLNLDKFSYCLLSHKFDDTTRTSSLILDNG 268
GC V S +G G GR SL +Q+ L KFSYCL T ++S L L +
Sbjct: 151 FGCVVASDIDTMDGTSGFIGLGRTPRSLVAQMKLTKFSYCLSPRG---TGKSSRLFLGSS 207
Query: 269 SSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRVRVWHKYLTLDRDGN 328
+ + ++T + PF+ + + YY + L I G + +
Sbjct: 208 AKLAGGEST--STAPFIKTSPDDDSHH---YYLLSLDAIRAGNTTIATAQ---------S 253
Query: 329 GGTIV-DSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEA----LTGLRPCFDVP 383
GG +V + + F+ + + + T A+G A T +P FD+
Sbjct: 254 GGILVMHTVSPFSLLVDSAYRAF----------KKAVTEAVGGAAEQPMATPPQP-FDLC 302
Query: 384 GEKTGSF-----PELKLHFKGGAEVTLPVENYFAVVG-EGSAVCLTVVT----DREASGG 433
+K F P+L F+G A +T+P Y VG E C +++ +R G
Sbjct: 303 FKKAAGFSRATAPDLVFTFQGAAALTVPPAKYLIDVGEEKDTACAAILSMAWLNRTGLEG 362
Query: 434 PSIILGNFQMQNYYVEYDLRNQRLGFKQQLC 464
S +LG+ Q ++ + YDL+ + L F+ C
Sbjct: 363 VS-VLGSLQQEDVHFLYDLKKETLSFEPADC 392
>gi|356559244|ref|XP_003547910.1| PREDICTED: aspartic proteinase-like protein 1-like [Glycine max]
Length = 515
Score = 85.1 bits (209), Expect = 6e-14, Method: Compositional matrix adjust.
Identities = 115/416 (27%), Positives = 167/416 (40%), Gaps = 88/416 (21%)
Query: 77 TTTNISSHSYGGYSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFI- 135
+T ISS + Y+ ++ GTP LDTGS L W PC C C+++ +F
Sbjct: 85 STFRISSLGFLHYT-TVQIGTPGVKFMVALDTGSDLFWVPC----DCTRCAATDSSAFAS 139
Query: 136 --------PKLSSSSRLLGCQNPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLY 187
P SS+S+ + C N C H S QC L T NC Y+V Y
Sbjct: 140 DFDLNVYNPNGSSTSKKVTCNNSLCM---HRS-QC-------LGTLSNC-----PYMVSY 183
Query: 188 GSGL--TEGIALSETLNLPNR------IIPNFLVGC------SVLSSRQPAGIAGFGRGK 233
S T GI + + L+L + N + GC S L P G+ G G K
Sbjct: 184 VSAETSTSGILVEDVLHLTQEDNHHDLVEANVIFGCGQIQSGSFLDVAAPNGLFGLGMEK 243
Query: 234 TSLPSQLNLDKFSYCLLSHKF--DDTTRTSSLILDNGSSHSDKKTTGLTYTPFVNNPSVA 291
S+PS L+ + F+ S F D R S D GS D+ TPF NPS
Sbjct: 244 ISVPSMLSREGFTADSFSMCFGRDGIGRIS--FGDKGSFDQDE-------TPFNLNPSHP 294
Query: 292 ERNAFSVYYYVGLRRITVGGQRVRVWHKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLA 351
Y + + ++ VG + V L DSGT+FT++ + L
Sbjct: 295 T-------YNITVTQVRVGTTLIDVEFTAL-----------FDSGTSFTYLVDPTYTRLT 336
Query: 352 DEFVSQMVKNRNYTRALGAEALTGLRPCFDV-PGEKTGSFPELKLHFKGGAEVTLPVENY 410
+ F SQ+ R+ +++ C+D+ P T P + L GG+ V +
Sbjct: 337 ESFHSQVQDRRHR-----SDSRIPFEYCYDMSPDANTSLIPSVSLTMGGGSH--FAVYDP 389
Query: 411 FAVVGEGSAV--CLTVVTDREASGGPSIILGNFQMQNYYVEYDLRNQRLGFKQQLC 464
++ S + CL VV E + I+G M Y V +D LG+K+ C
Sbjct: 390 IIIISTQSELVYCLAVVKTAELN-----IIGQNFMTGYRVVFDREKLVLGWKKFDC 440
>gi|15229663|ref|NP_190574.1| aspartyl protease family protein [Arabidopsis thaliana]
gi|6522926|emb|CAB62113.1| putative protein [Arabidopsis thaliana]
gi|53828539|gb|AAU94379.1| At3g50050 [Arabidopsis thaliana]
gi|55733749|gb|AAV59271.1| At3g50050 [Arabidopsis thaliana]
gi|332645100|gb|AEE78621.1| aspartyl protease family protein [Arabidopsis thaliana]
Length = 632
Score = 85.1 bits (209), Expect = 6e-14, Method: Compositional matrix adjust.
Identities = 107/399 (26%), Positives = 169/399 (42%), Gaps = 74/399 (18%)
Query: 87 GGYSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLG 146
G Y+ L GTPPQ+ I+D+GS + + PC++ C+ C + P F P++SS+ + +
Sbjct: 91 GYYTTRLWIGTPPQMFALIVDSGSTVTYVPCSD---CEQCGKHQDPKFQPEMSSTYQPVK 147
Query: 147 CQNPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSGLTEGIALSETLNLPNR 206
C N C+ + QC + A + + L+ +G +E+ P R
Sbjct: 148 C-NMDCN-CDDDREQC--VYEREYAEHSSSKGVLGEDLISFG---------NESQLTPQR 194
Query: 207 IIPNFLVGCSV-----LSSRQPAGIAGFGRGKTSLPSQLNLDKFSYCLLSHKF-----DD 256
+ GC L S++ GI G G+G SL QL +DK L+S+ F
Sbjct: 195 AV----FGCETVETGDLYSQRADGIIGLGQGDLSLVDQL-VDK---GLISNSFGLCYGGM 246
Query: 257 TTRTSSLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRVRV 316
S+IL SD V S +R S YY + L I V G+++ +
Sbjct: 247 DVGGGSMILGGFDYPSD----------MVFTDSDPDR---SPYYNIDLTGIRVAGKQLSL 293
Query: 317 WHKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEALTGL 376
+ DG G ++DSGTT+ ++ P+ +E V + V + + G
Sbjct: 294 HSRVF----DGEHGAVLDSGTTYAYL-PDAAFAAFEEAVMREVST--------LKQIDGP 340
Query: 377 RP-----CFDVPG-----EKTGSFPELKLHFKGGAEVTLPVENY-FAVVGEGSAVCLTVV 425
P CF V E + FP +++ FK G L ENY F A CL V
Sbjct: 341 DPNFKDTCFQVAASNYVSELSKIFPSVEMVFKSGQSWLLSPENYMFRHSKVHGAYCLGVF 400
Query: 426 TDREASGGPSIILGNFQMQNYYVEYDLRNQRLGFKQQLC 464
+ + + +LG ++N V YD N ++GF + C
Sbjct: 401 PNGKDH---TTLLGGIVVRNTLVVYDRENSKVGFWRTNC 436
>gi|340810931|gb|AEK75392.1| S5 [Oryza sativa]
gi|340810983|gb|AEK75418.1| S5 [Oryza nivara]
gi|340810985|gb|AEK75419.1| S5 [Oryza nivara]
gi|340810997|gb|AEK75425.1| S5 [Oryza nivara]
gi|340811011|gb|AEK75432.1| S5 [Oryza nivara]
gi|340811013|gb|AEK75433.1| S5 [Oryza nivara]
gi|340811041|gb|AEK75447.1| S5 [Oryza nivara]
gi|340811043|gb|AEK75448.1| S5 [Oryza nivara]
Length = 474
Score = 85.1 bits (209), Expect = 6e-14, Method: Compositional matrix adjust.
Identities = 111/435 (25%), Positives = 182/435 (41%), Gaps = 79/435 (18%)
Query: 62 IKNPQTKTTTTTTTTTTTNISSHSYGGYS--ISLSFGTPPQIIPFILDTGSHLVWF---P 116
+ N Q + T++++T I S + +++S G PP + +DTGS L W P
Sbjct: 87 LNNLQEEEITSSSSTKIDVIEDSSINDFLFLMAVSLGKPPVVNLVAIDTGSTLSWVQCQP 146
Query: 117 CTNHYQCKYCSSSKIPSFIPKLSSSSRLLGCQNPKCSWIHHE-SIQCRDCNDEPLATSKN 175
C H C S+ P F P S +SR + C + KC + ++ +Q +C + +
Sbjct: 147 CAVH--CHTQSAKAGPIFDPGRSYTSRRVRCSSVKCGELRYDLRLQQANC----MEKEDS 200
Query: 176 CTQICPSYLVLYGSGL--TEGIALSETLNLPNRIIPNFLVGCS--VLSSRQPAGIAGFGR 231
CT Y V YG+G + G +++TL + + + + + GCS V S AGI GFG
Sbjct: 201 CT-----YSVTYGNGWAYSVGKMVTDTLRIGDSFM-DLMFGCSMDVKYSEFEAGIFGFGS 254
Query: 232 GK-------TSLPSQLNLDKFSYCLLSHKFDDTTRTSSLILDNGSSHSDKKTTGLTYTPF 284
P L+ FSYCL + D T+ +IL D+ YTP
Sbjct: 255 SSFSFFEQLAGYPDILSYKAFSYCLPT----DETKPGYMIL----GRYDRAAMDGGYTPL 306
Query: 285 ---VNNPSVAERNAFSVYYYVGLRRITVGGQRVRVWHKYLTLDRDGNGGTIVDSGTTFTF 341
+N P+ Y + + + GQR+ + IVDSG T
Sbjct: 307 FRSINRPT----------YSLTMEMLIANGQRLVT----------SSSEMIVDSGAQRTS 346
Query: 342 MAPELFEPLADEFVSQMVKNRNYTRALGAEALTGLRPCFDVPGEKTG------------S 389
+ P F L D+ ++Q + + Y R A + + C+ + +G +
Sbjct: 347 LWPSTFA-LLDKTITQAMSSIGYHRTSRARQESYI--CYLSEHDYSGWNGTITPFSNWSA 403
Query: 390 FPELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTVVTDREASGGPSIILGNFQMQNYYVE 449
P L++ F GGA + L N F +C+T + S ILGN +++
Sbjct: 404 LPPLEIGFAGGAALALSPRNVF-YNDPHRGLCMTFAQNPALR---SQILGNRVTRSFGTT 459
Query: 450 YDLRNQRLGFKQQLC 464
+D++ ++ GFK C
Sbjct: 460 FDIQGKQFGFKYAAC 474
>gi|326515172|dbj|BAK03499.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 494
Score = 85.1 bits (209), Expect = 7e-14, Method: Compositional matrix adjust.
Identities = 101/375 (26%), Positives = 151/375 (40%), Gaps = 67/375 (17%)
Query: 105 ILDTGSHLVWFPCTNHYQCKY--CSSSKIPSFIPKLSSSSRLLGCQNPKCSWIHHE-SIQ 161
++DT S + W C C C K P + P SS+ + C +P C +
Sbjct: 172 VVDTSSDIPWVQC---LPCPIPQCHLQKDPLYDPAKSSTFAPIPCGSPACKELGSSYGNG 228
Query: 162 CRDCNDEPLATSKNCTQICPSYLVLYGSG-LTEGIALSETLNL-PNRIIPNFLVGCSVLS 219
C DE C Y+V YG G T G +++TL + P ++ +F GCS
Sbjct: 229 CSPTTDE-----------C-KYIVNYGDGKATTGTYVTDTLTMSPTIVVKDFRFGCSHAV 276
Query: 220 ----SRQPAGIAGFGRGKTSLPSQLNL---DKFSYCLLSHKFDDTTRTSSLILDNGSSHS 272
S Q AGI G G+ SL Q + FSYC+ ++ + G +
Sbjct: 277 RGSFSNQNAGILALGGGRGSLLEQTADAYGNAFSYCI------PKPSSAGFLSLGGPVEA 330
Query: 273 DKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRVRVWHKYLTLDRDGNGGTI 332
K +YTP + N +Y V L I V G+++ V G +
Sbjct: 331 SLK---FSYTPLIKNKHA------PTFYIVHLEAIIVAGKQLAVPPTAFAT------GAV 375
Query: 333 VDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALG--AEALTGLRPCFDVPGEKTGSF 390
+DSG T + P+++ L F S M A G A + L C+D
Sbjct: 376 MDSGAVVTQLPPQVYAALRAAFRSAMA-------AYGPLAAPVRNLDTCYDFTRFPDVKV 428
Query: 391 PELKLHFKGGAEVTLPVENYFAVVGEGSAVCLT-VVTDREASGGPSIILGNFQMQNYYVE 449
P++ L F GGA + L +++ +G CL T E S G +GN Q Q Y V
Sbjct: 429 PKVSLVFAGGATLDL---EPASIILDG---CLAFAATPGEESVG---FIGNVQQQTYEVL 479
Query: 450 YDLRNQRLGFKQQLC 464
YD+ ++GF++ C
Sbjct: 480 YDVGGGKVGFRRGAC 494
>gi|196212948|gb|ACG76110.1| S5 [Oryza sativa Japonica Group]
gi|340810887|gb|AEK75370.1| S5 [Oryza sativa]
gi|340810903|gb|AEK75378.1| S5 [Oryza sativa]
gi|340810921|gb|AEK75387.1| S5 [Oryza sativa]
gi|340810955|gb|AEK75404.1| S5 [Oryza sativa]
gi|340811079|gb|AEK75466.1| S5 [Oryza nivara]
gi|340811090|gb|AEK75471.1| S5 [Oryza rufipogon]
gi|340811116|gb|AEK75484.1| S5 [Oryza nivara]
Length = 357
Score = 85.1 bits (209), Expect = 7e-14, Method: Compositional matrix adjust.
Identities = 106/404 (26%), Positives = 171/404 (42%), Gaps = 77/404 (19%)
Query: 91 ISLSFGTPPQIIPFILDTGSHLVWF---PCTNHYQCKYCSSSKIPSFIPKLSSSSRLLGC 147
+++S G PP + +DTGS L W PC H C S+ P F P S +SR + C
Sbjct: 1 MAVSLGKPPVVNLVAIDTGSTLSWVQCQPCAVH--CHTQSAKAGPIFDPGRSYTSRRVRC 58
Query: 148 QNPKCSWIHHE-SIQCRDCNDEPLATSKNCTQICPSYLVLYGSGL--TEGIALSETLNLP 204
+ KC + ++ +Q +C ++ +CT Y V YG+G + G +++TL +
Sbjct: 59 SSVKCGELRYDLRLQQANCMEK----EDSCT-----YSVTYGNGWAYSVGKMVTDTLRIG 109
Query: 205 NRIIPNFLVGCS--VLSSRQPAGIAGFGRGK-------TSLPSQLNLDKFSYCLLSHKFD 255
+ + + + GCS V S AGI GFG P L+ FSYCL +
Sbjct: 110 DSFM-DLMFGCSMDVKYSEFEAGIFGFGSSSFSFFEQLAGYPDILSYKAFSYCLPT---- 164
Query: 256 DTTRTSSLILDNGSSHSDKKTTGLTYTPF---VNNPSVAERNAFSVYYYVGLRRITVGGQ 312
D T+ +IL D+ YTP +N P+ Y + + + GQ
Sbjct: 165 DETKPGYMIL----GRYDRAAMDGGYTPLFRSINRPT----------YSLTMEMLIANGQ 210
Query: 313 RVRVWHKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEA 372
R+ + IVDSG T + P F L D+ ++Q + + Y R A
Sbjct: 211 RLVT----------SSSEMIVDSGAQRTSLWPSTFA-LLDKTITQAMSSIGYHRTSRARQ 259
Query: 373 LTGLRPCFDVPGEKTG------------SFPELKLHFKGGAEVTLPVENYFAVVGEGSAV 420
+ + C+ + +G + P L++ F GGA + LP N F +
Sbjct: 260 ESYI--CYLSEHDYSGWNGTITPFSNWSALPLLEIGFAGGAALALPPRNVF-YNDPHRGL 316
Query: 421 CLTVVTDREASGGPSIILGNFQMQNYYVEYDLRNQRLGFKQQLC 464
C+T + S ILGN +++ +D++ ++ GFK C
Sbjct: 317 CMTFAQNPALR---SQILGNRVTRSFGTTFDIQGKQFGFKYAAC 357
>gi|42571079|ref|NP_973613.1| aspartyl protease-like protein [Arabidopsis thaliana]
gi|110737616|dbj|BAF00749.1| putative protease [Arabidopsis thaliana]
gi|330254187|gb|AEC09281.1| aspartyl protease-like protein [Arabidopsis thaliana]
Length = 507
Score = 85.1 bits (209), Expect = 7e-14, Method: Compositional matrix adjust.
Identities = 102/401 (25%), Positives = 164/401 (40%), Gaps = 64/401 (15%)
Query: 87 GGYSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPS--FIPKLSSSSRL 144
G Y + G+PP +DTGS ++W C++ C + S I F S ++
Sbjct: 98 GLYFTKVKLGSPPTEFNVQIDTGSDILWVTCSSCSNCPHSSGLGIDLHFFDAPGSLTAGS 157
Query: 145 LGCQNPKCSWIHH-ESIQCRDCNDEPLATSKNCTQICPSYLVLYGSGL-TEGIALSETLN 202
+ C +P CS + + QC + N C Y YG G T G +++T
Sbjct: 158 VTCSDPICSSVFQTTAAQCSENN--------QC-----GYSFRYGDGSGTSGYYMTDTFY 204
Query: 203 ----LPNRIIPN----FLVGCSVL-------SSRQPAGIAGFGRGKTSLPSQLNLDKFSY 247
L ++ N + GCS S + GI GFG+GK S+ SQL+ +
Sbjct: 205 FDAILGESLVANSSAPIVFGCSTYQSGDLTKSDKAVDGIFGFGKGKLSVVSQLSSRGITP 264
Query: 248 CLLSHKFD-DTTRTSSLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRR 306
+ SH D + +L + G+ Y+P V PS +Y + L
Sbjct: 265 PVFSHCLKGDGSGGGVFVL------GEILVPGMVYSPLV--PS-------QPHYNLNLLS 309
Query: 307 ITVGGQRVRVWHKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTR 366
I V GQ + + + GTIVD+GTT T++ E + D F++ + N
Sbjct: 310 IGVNGQMLPL--DAAVFEASNTRGTIVDTGTTLTYLVKEAY----DLFLNAI---SNSVS 360
Query: 367 ALGAEALTGLRPCFDVPGEKTGSFPELKLHFKGGAEVTLPVENY---FAVVGEGSAVCLT 423
L ++ C+ V + FP + L+F GGA + L ++Y + + S C+
Sbjct: 361 QLVTPIISNGEQCYLVSTSISDMFPSVSLNFAGGASMMLRPQDYLFHYGIYDGASMWCIG 420
Query: 424 VVTDREASGGPSIILGNFQMQNYYVEYDLRNQRLGFKQQLC 464
E ILG+ +++ YDL QR+G+ C
Sbjct: 421 FQKAPEE----QTILGDLVLKDKVFVYDLARQRIGWASYDC 457
>gi|302783204|ref|XP_002973375.1| hypothetical protein SELMODRAFT_413680 [Selaginella moellendorffii]
gi|300159128|gb|EFJ25749.1| hypothetical protein SELMODRAFT_413680 [Selaginella moellendorffii]
Length = 407
Score = 85.1 bits (209), Expect = 8e-14, Method: Compositional matrix adjust.
Identities = 72/263 (27%), Positives = 112/263 (42%), Gaps = 29/263 (11%)
Query: 213 VGCSVLSSR-----QPAGIAGFGRGKTSLPSQL-NLD---KFSYCLLSHKFDDTTRTSSL 263
+GC S+R +G+ GF + S QL +D KF YC S F + +
Sbjct: 127 LGCGRQSTRLLGILSTSGLVGFAKTNKSFIGQLAEMDYTGKFIYCAPSDTF-----SGKI 181
Query: 264 ILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRVRVWHKYLTL 323
+ N + + L+YTP + NP + YY+GLR I++ + L
Sbjct: 182 VFGN---YKISSNSSLSYTPMIVNP------ISTALYYIGLRSISINDMLTFLVQGILA- 231
Query: 324 DRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEALTGLRPCFDVP 383
DG GGTI+DS F++ P+ + PL + + + N + AL G C++V
Sbjct: 232 --DGTGGTIIDSTFAFSYFTPDSYTPLV-QAIQNLNSNLTKVSSNKTAALLGNDICYNVS 288
Query: 384 GEKTGSFPE-LKLHFKGGAEVTLPVENYFAVVGEGSAVCLTVVTDREASGGPSIILGNFQ 442
P+ L HF+ G +V E + VCL V D + G ++G +Q
Sbjct: 289 VNGDTPPPQTLTYHFENGTQVEFRTWFLLDDDAENATVCL-AVGDSQKVGFSLNVIGTYQ 347
Query: 443 MQNYYVEYDLRNQRLGFKQQLCK 465
+ VE+DL Q +GF C
Sbjct: 348 QLDVAVEFDLEKQEIGFGTAGCN 370
>gi|54290717|dbj|BAD62387.1| putative 41 kD chloroplast nucleoid DNA binding protein [Oryza
sativa Japonica Group]
gi|215734915|dbj|BAG95637.1| unnamed protein product [Oryza sativa Japonica Group]
Length = 469
Score = 84.7 bits (208), Expect = 8e-14, Method: Compositional matrix adjust.
Identities = 104/394 (26%), Positives = 158/394 (40%), Gaps = 58/394 (14%)
Query: 82 SSHSYGGYSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKY-CSSSKIPSFIPKLSS 140
+S + G Y L GTP ++DTGS L W C+ C C P F P+ S
Sbjct: 124 ASVAVGNYVTRLGLGTPATSYVMVVDTGSSLTWLQCS---PCSVSCHRQAGPVFDPRASG 180
Query: 141 SSRLLGCQNPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYG-SGLTEGIALSE 199
+ + C + +C + ++ C + + +C Y YG S + G +
Sbjct: 181 TYAAVQCSSSECGELQAATLNPSAC---------SVSNVC-IYQASYGDSSYSVGYLSKD 230
Query: 200 TLNLPNRIIPNFLVGCSVLSS---RQPAGIAGFGRGKTSLPSQLNLD---KFSYCLLSHK 253
T++ + P F GC + + AG+ G + K SL QL FSYCL
Sbjct: 231 TVSFGSGSFPGFYYGCGQDNEGLFGRSAGLIGLAKNKLSLLYQLAPSLGYAFSYCL---- 286
Query: 254 FDDTTRTSSLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQR 313
T+ ++ L GS + + +YTP +A + + Y+V L I+V G
Sbjct: 287 --PTSSAAAGYLSIGSYNPGQ----YSYTP------MASSSLDASLYFVTLSGISVAGAP 334
Query: 314 VRV-WHKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEA 372
+ V +Y +L TI+DSGT T + P ++ L+ + M
Sbjct: 335 LAVPPSEYRSLP------TIIDSGTVITRLPPNVYTALSRAVAAAMASAAPRAPTYSI-- 386
Query: 373 LTGLRPCFDVPGEKTG-SFPELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTVVTDREAS 431
L CF G G P + + F GGA + L N V + S CL A
Sbjct: 387 ---LDTCFR--GSAAGLRVPRVDMAFAGGATLALSPGNVLIDV-DDSTTCLAF-----AP 435
Query: 432 GGPSIILGNFQMQNYYVEYDLRNQRLGFKQQLCK 465
G + I+GN Q Q + V YD+ R+GF C
Sbjct: 436 TGGTAIIGNTQQQTFSVVYDVAQSRIGFAAGGCS 469
>gi|449451076|ref|XP_004143288.1| PREDICTED: aspartic proteinase-like protein 2-like [Cucumis
sativus]
Length = 488
Score = 84.7 bits (208), Expect = 8e-14, Method: Compositional matrix adjust.
Identities = 96/401 (23%), Positives = 176/401 (43%), Gaps = 65/401 (16%)
Query: 87 GGYSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSS-----KIPSFIPKLSSS 141
G Y + G P + +DTGS ++W C+ C C S ++ F SSS
Sbjct: 82 GLYFTKVKLGNPAREFNVQIDTGSDILWVTCS---PCDGCPDSSGLGIELNLFDTTKSSS 138
Query: 142 SRLLGCQNPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSGLTEGIALSETL 201
+R+L C +P C+ + + QC L + +C+ S+ SG T G +++++
Sbjct: 139 ARVLPCTDPICAAVSTTTDQC-------LTQTDHCSY---SFHYRDRSG-TSGFYVTDSM 187
Query: 202 N----LPNRIIPN----FLVGCSVL-------SSRQPAGIAGFGRGKTSLPSQLNLDKFS 246
+ L I N + GCS+ +++ GI GFG+G+ S+ SQL+ +
Sbjct: 188 HFDILLGESTIANSSATIVFGCSIYQYGDLTRATKALDGIFGFGQGEFSVISQLSSRGIT 247
Query: 247 YCLLSHKFDDTTRTSSLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRR 306
+ SH +++ + + Y+P + PS +Y + L+
Sbjct: 248 PKVFSHCLKGGENGGGILV-----LGEILEPSIVYSPLI--PS-------QPHYTLKLQS 293
Query: 307 ITVGGQRVRVWHKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTR 366
I + GQ ++ G TI+DSGTT ++ E+++ + S + ++ T
Sbjct: 294 IALSGQ---LFPNPTMFPISNAGETIIDSGTTLAYLVEEVYDWIVSVITSAVSQSATPTI 350
Query: 367 ALGAEALTGLRPCFDVPGEKTGSFPELKLHFKGGAEVTLPVENYF---AVVGEGSAVCLT 423
+ G++ CF V FP L+ +F+G A + + E Y ++V E + C+
Sbjct: 351 SRGSQ-------CFRVSMSVADIFPVLRFNFEGIASMVVTPEEYLQFDSIVREPALWCIG 403
Query: 424 VVTDREASGGPSIILGNFQMQNYYVEYDLRNQRLGFKQQLC 464
++A G + ILG+ +++ + YDL QR+G+ C
Sbjct: 404 F---QKAEDGLN-ILGDLVLKDKIIVYDLARQRIGWANYDC 440
>gi|293332531|ref|NP_001169558.1| uncharacterized protein LOC100383437 precursor [Zea mays]
gi|224030089|gb|ACN34120.1| unknown [Zea mays]
gi|413925069|gb|AFW65001.1| hypothetical protein ZEAMMB73_160528 [Zea mays]
Length = 491
Score = 84.7 bits (208), Expect = 8e-14, Method: Compositional matrix adjust.
Identities = 103/404 (25%), Positives = 158/404 (39%), Gaps = 70/404 (17%)
Query: 87 GGYSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSS--KIPSFIPKLSSSSRL 144
G Y + GTPP+ +DTGS ++W C QC + S + + PK SS+ +
Sbjct: 84 GLYYTEIKLGTPPKHYYVQVDTGSDILWVNCITCEQCPHKSGLGLDLTLYDPKASSTGSM 143
Query: 145 LGCQNPKCSWIHHESIQCRDCNDEPLATSKNCTQICP-SYLVLYGSG-------LTEGIA 196
+ C C+ + C P Y V YG G +T+ +
Sbjct: 144 VMCDQAFCAATFGGKL-------------PKCGANVPCEYSVTYGDGSSTIGSFVTDALQ 190
Query: 197 LSETLNLPNRIIPN--FLVGCSV-----LSSRQPA--GIAGFGRGKTSLPSQLNLDK--- 244
+ N + GC L S A GI GFG TS+ SQL
Sbjct: 191 FDQVTRDGQTQPANASVIFGCGAQQGGDLGSSNQALDGILGFGEANTSMLSQLTTAGKVK 250
Query: 245 --FSYCLLSHKFDDTTRTSSLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYV 302
F++CL DT + + KTT P VA++ +Y V
Sbjct: 251 KIFAHCL------DTIKGGGIFSIGDVVQPKVKTT----------PLVADKP----HYNV 290
Query: 303 GLRRITVGGQRVRV-WHKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKN 361
L+ I VGG +++ H + ++ G TI+DSGTT T++ PEL F M+
Sbjct: 291 NLKTIDVGGTTLQLPAHIFEPGEKKG---TIIDSGTTLTYL-PELV------FKEVMLAV 340
Query: 362 RNYTRALGAEALTGLRPCFDVPGEKTGSFPELKLHFKGGAEVTL-PVENYFAVVGEGSAV 420
N + + + G CF PG FP + HF+ + + P E +FA + V
Sbjct: 341 FNKHQDITFHDVQGFL-CFQYPGSVDDGFPTITFHFEDDLALHVYPHEYFFANGNDVYCV 399
Query: 421 CLTVVTDREASGGPSIILGNFQMQNYYVEYDLRNQRLGFKQQLC 464
+ G +++G+ + N V YDL N+ +G+ C
Sbjct: 400 GFQNGASQSKDGKDIVLMGDLVLSNKLVIYDLENRVIGWTDYNC 443
>gi|340810907|gb|AEK75380.1| S5 [Oryza sativa]
Length = 472
Score = 84.7 bits (208), Expect = 9e-14, Method: Compositional matrix adjust.
Identities = 111/435 (25%), Positives = 182/435 (41%), Gaps = 79/435 (18%)
Query: 62 IKNPQTKTTTTTTTTTTTNISSHSYGGYS--ISLSFGTPPQIIPFILDTGSHLVWF---P 116
+ N Q + T++++T I S + +++S G PP + +DTGS L W P
Sbjct: 85 LNNLQEEEITSSSSTKIDVIEDSSINDFLFLMAVSLGKPPVVNLVAIDTGSTLSWVQCQP 144
Query: 117 CTNHYQCKYCSSSKIPSFIPKLSSSSRLLGCQNPKCSWIHHE-SIQCRDCNDEPLATSKN 175
C H C S+ P F P S +SR + C + KC + ++ +Q +C + +
Sbjct: 145 CAVH--CHTQSAKAGPIFDPGRSYTSRRVRCSSVKCGELRYDLRLQQANC----MEKEDS 198
Query: 176 CTQICPSYLVLYGSGL--TEGIALSETLNLPNRIIPNFLVGCS--VLSSRQPAGIAGFGR 231
CT Y V YG+G + G +++TL + + + + + GCS V S AGI GFG
Sbjct: 199 CT-----YSVTYGNGWAYSVGKMVTDTLRIGDSFM-DLMFGCSMDVKYSEFEAGIFGFGS 252
Query: 232 GK-------TSLPSQLNLDKFSYCLLSHKFDDTTRTSSLILDNGSSHSDKKTTGLTYTPF 284
P L+ FSYCL + D T+ +IL D+ YTP
Sbjct: 253 SSFSFFEQLAGYPDILSYKAFSYCLPT----DETKPGYMIL----GRYDRAAMDGGYTPL 304
Query: 285 ---VNNPSVAERNAFSVYYYVGLRRITVGGQRVRVWHKYLTLDRDGNGGTIVDSGTTFTF 341
+N P+ Y + + + GQR+ + IVDSG T
Sbjct: 305 FRSINRPT----------YSLTMEMLIANGQRLVT----------SSSEMIVDSGAQRTS 344
Query: 342 MAPELFEPLADEFVSQMVKNRNYTRALGAEALTGLRPCFDVPGEKTG------------S 389
+ P F L D+ ++Q + + Y R A + + C+ + +G +
Sbjct: 345 LWPSTF-ALLDKTITQAMSSIGYHRTSRARQESYI--CYLSEHDYSGWNGTITPFSNWSA 401
Query: 390 FPELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTVVTDREASGGPSIILGNFQMQNYYVE 449
P L++ F GGA + L N F +C+T + S ILGN +++
Sbjct: 402 LPLLEIGFAGGAALALSPRNVF-YNDPHRGLCMTFAQNPALR---SQILGNRVTRSFGTT 457
Query: 450 YDLRNQRLGFKQQLC 464
+D++ ++ GFK C
Sbjct: 458 FDIQGKQFGFKYAAC 472
>gi|125554529|gb|EAZ00135.1| hypothetical protein OsI_22138 [Oryza sativa Indica Group]
Length = 472
Score = 84.7 bits (208), Expect = 9e-14, Method: Compositional matrix adjust.
Identities = 111/435 (25%), Positives = 182/435 (41%), Gaps = 79/435 (18%)
Query: 62 IKNPQTKTTTTTTTTTTTNISSHSYGGYS--ISLSFGTPPQIIPFILDTGSHLVWF---P 116
+ N Q + T++++T I S + +++S G PP + +DTGS L W P
Sbjct: 85 LNNLQEEEITSSSSTKIDVIEDSSINDFLFLMAVSLGKPPVVNLVAIDTGSTLSWVQCQP 144
Query: 117 CTNHYQCKYCSSSKIPSFIPKLSSSSRLLGCQNPKCSWIHHE-SIQCRDCNDEPLATSKN 175
C H C S+ P F P S +SR + C + KC + ++ +Q +C + +
Sbjct: 145 CAVH--CHTQSAKAGPIFDPGRSYTSRRVRCSSVKCGELRYDLRLQQANC----MEKEDS 198
Query: 176 CTQICPSYLVLYGSGL--TEGIALSETLNLPNRIIPNFLVGCS--VLSSRQPAGIAGFGR 231
CT Y V YG+G + G +++TL + + + + + GCS V S AGI GFG
Sbjct: 199 CT-----YSVTYGNGWAYSVGKMVTDTLRIGDSFM-DLMFGCSMDVKYSEFEAGIFGFGS 252
Query: 232 GK-------TSLPSQLNLDKFSYCLLSHKFDDTTRTSSLILDNGSSHSDKKTTGLTYTPF 284
P L+ FSYCL + D T+ +IL D+ YT
Sbjct: 253 SSFSFFEQLAGYPDILSYKAFSYCLPT----DETKPGYMIL----GRYDRAAMDGGYTSL 304
Query: 285 ---VNNPSVAERNAFSVYYYVGLRRITVGGQRVRVWHKYLTLDRDGNGGTIVDSGTTFTF 341
+N P+ Y + + + GQR+ + IVDSG T
Sbjct: 305 FRSINRPT----------YSLTMEMLIANGQRLVT----------SSSEMIVDSGAQRTS 344
Query: 342 MAPELFEPLADEFVSQMVKNRNYTRALGAEALTGLRPCFDVPGEKTG------------S 389
+ P F L D+ ++Q + + Y R A + + C+ + +G +
Sbjct: 345 LWPSTF-ALLDKTITQAMSSIGYHRTSRARQESYI--CYLSEHDYSGWNGTITPFSNWSA 401
Query: 390 FPELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTVVTDREASGGPSIILGNFQMQNYYVE 449
P L++ F GGA + LP N F +C+T + S ILGN +++
Sbjct: 402 LPLLEIGFAGGAALALPPRNVF-YNDPHRGLCMTFAQNPALR---SQILGNRVTRSFGTT 457
Query: 450 YDLRNQRLGFKQQLC 464
+D++ ++ GFK C
Sbjct: 458 FDIQGKQFGFKYAAC 472
>gi|297740344|emb|CBI30526.3| unnamed protein product [Vitis vinifera]
Length = 379
Score = 84.7 bits (208), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 79/278 (28%), Positives = 122/278 (43%), Gaps = 34/278 (12%)
Query: 204 PNRIIPNFLVGCSVLS----SRQPAGIAGFGRGKTSLPSQLNLDKFSYCLLSHKFDDTTR 259
PNR V CS L+ + G+ G RG S SQ++ KFSYC+ F
Sbjct: 108 PNRSSSYSPVPCSSLTCTDQDSKNTGLMGMNRGSLSFVSQMDFPKFSYCISDSDF----- 162
Query: 260 TSSLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRVRVWHK 319
S ++L ++ S L YTP + S V Y V L I V + + +
Sbjct: 163 -SGVLLLGDANFS--WLMPLNYTPLIQI-STPLPYFDRVAYTVQLEGIKVSSKLLPLPKS 218
Query: 320 YLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQ------MVKNRNYTRALGAEAL 373
D G G T+VDSGT FTF+ ++ L +EF++Q ++++ NY G +
Sbjct: 219 VFVPDHTGAGQTMVDSGTQFTFLLGPVYSALRNEFLNQTSQILRVLEDPNYVFQGGMDL- 277
Query: 374 TGLRPCFDVPGEKTG--SFPELKLHFKGGAEVTLPVENYF-----AVVGEGSAVCLTVVT 426
C+ VP +T P + L F+ GAE+ + + V G S C T
Sbjct: 278 -----CYRVPLSQTSLPWLPTVSLMFR-GAEMKVSGDRLLYRVPGEVRGSDSVYCFT-FG 330
Query: 427 DREASGGPSIILGNFQMQNYYVEYDLRNQRLGFKQQLC 464
+ + + ++G+ QN ++E+DL R+GF Q C
Sbjct: 331 NSDLLAVEAYVIGHHHQQNVWMEFDLEKSRIGFAQVQC 368
Score = 39.3 bits (90), Expect = 4.7, Method: Compositional matrix adjust.
Identities = 25/72 (34%), Positives = 35/72 (48%), Gaps = 11/72 (15%)
Query: 84 HSYGGYSISLSFGTPPQIIPFILDTGSHLVWFPC--TNHYQCKYCSSSKIPSFIPKLSSS 141
H ++SL+ GTPPQ + +LDTGS L W C T +Q +F P SSS
Sbjct: 63 HHNVSLTVSLTVGTPPQNVSMVLDTGSELSWLRCNKTQTFQT---------TFDPNRSSS 113
Query: 142 SRLLGCQNPKCS 153
+ C + C+
Sbjct: 114 YSPVPCSSLTCT 125
>gi|358346726|ref|XP_003637416.1| Aspartic proteinase nepenthesin-2, partial [Medicago truncatula]
gi|355503351|gb|AES84554.1| Aspartic proteinase nepenthesin-2, partial [Medicago truncatula]
Length = 165
Score = 84.3 bits (207), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 50/166 (30%), Positives = 74/166 (44%), Gaps = 10/166 (6%)
Query: 299 YYYVGLRRITVGGQRVRVWHKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQM 358
YYYVGL I+VGG+ + + +D GNGG IVDSGT T + +++ + D FV
Sbjct: 10 YYYVGLVGISVGGELLAIPETSFEVDSAGNGGIIVDSGTAVTRLQSDVYNVVRDAFV--- 66
Query: 359 VKNRNYTRALGAEALTGLRPCFDVPGEKTGSFPELKLHFKGGAEVTLPVENYFAVVGEGS 418
+ L ++ C+D+ + + P + HF G + LP +NY V
Sbjct: 67 ---KGTKDLLATNEVSLFDTCYDLSSKTSVEVPTVAFHFGEGKVLVLPAKNYLVPVDSVG 123
Query: 419 AVCLTVVTDREASGGPSIILGNFQMQNYYVEYDLRNQRLGFKQQLC 464
C + I+GN Q Q V +DL N +GF C
Sbjct: 124 TFCFAFAPTMSSLS----IIGNIQQQGTRVSFDLANSLVGFSPNRC 165
>gi|225436202|ref|XP_002271145.1| PREDICTED: probable aspartic protease At2g35615-like [Vitis
vinifera]
Length = 440
Score = 84.3 bits (207), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 120/471 (25%), Positives = 193/471 (40%), Gaps = 74/471 (15%)
Query: 15 FFTLLSIFPSSITSLTFSLSRFHTNPSQDSYQNLNSLVSSSLTRALHIKNPQTKTTTTTT 74
F ++ I + + + FS+ H + + N + + L R + + + + +
Sbjct: 19 FVSVAHISAAEVKNGRFSIDLIHRDSPKSPLYNPSETPAERLDR--FFRRFMSFSEASIS 76
Query: 75 TTTTTNISSHSYGGYSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSF 134
T S + G Y + +S GTPP + I DTGS L+W C C C K P F
Sbjct: 77 PNTPEPPVSSNNGEYLMKISIGTPPFDVYGIYDTGSDLMWTQC---LPCLSCYKQKNPMF 133
Query: 135 IPKLSSSSRLLGCQNPKCSWIHHESIQCRDCNDEPLATSKNCTQ---ICPSYLVLYGSG- 190
P S+S + + C ES QCR L + +C+Q +C + YG G
Sbjct: 134 DPSKSTSFKEVSC----------ESQQCR------LLDTVSCSQPQKLC-DFSYGYGDGS 176
Query: 191 LTEGIALSETLNL------PNRIIPNFLVGCSVLSS----RQPAGIAGFGRGKTSLPSQL 240
L +G+ +ETL L P I+ N + GC +S G+ G G SL SQ+
Sbjct: 177 LAQGVIATETLTLNSNSGQPTSIL-NIVFGCGHNNSGTFNENEMGLFGTGGRPLSLTSQI 235
Query: 241 -----NLDKFSYCLLSHKFDDTTRTSSLILDNGSSHSDKKTTGLTYTPFV--NNPSVAER 293
+ KFS CL+ + D + TS +I ++ + + TP V ++P+
Sbjct: 236 MSTLGSGRKFSQCLVPFRTDPSI-TSKIIF---GPEAEVSGSDVVSTPLVTKDDPT---- 287
Query: 294 NAFSVYYYVGLRRITVGGQRVRVWHKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADE 353
YY+V L I+V G ++ + + GN +D+GT T L +
Sbjct: 288 -----YYFVTLDGISV-GDKLFPFSSSSPMATKGN--VFIDAGTPPTL--------LPRD 331
Query: 354 FVSQMVKNRNYTRALGAEALTGLRPCFDVPGEKTGSFPELKLHFKGGAEVTLPVENYFAV 413
F +++V+ + L+P P L HF GA+V L N F
Sbjct: 332 FYNRLVQGVKEAIPMEPVQDPDLQPQLCYRSATLIDGPILTAHFD-GADVQLKPLNTFIS 390
Query: 414 VGEGSAVCLTVVTDREASGGPSIILGNFQMQNYYVEYDLRNQRLGFKQQLC 464
EG C + + G + I GNF N+ + +DL +++ FK C
Sbjct: 391 PKEG-VYCFAM----QPIDGDTGIFGNFVQMNFLIGFDLDGKKVSFKAVDC 436
>gi|449440161|ref|XP_004137853.1| PREDICTED: aspartic proteinase-like protein 2-like [Cucumis
sativus]
gi|449521209|ref|XP_004167622.1| PREDICTED: aspartic proteinase-like protein 2-like [Cucumis
sativus]
Length = 492
Score = 84.3 bits (207), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 107/397 (26%), Positives = 160/397 (40%), Gaps = 55/397 (13%)
Query: 87 GGYSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPS--FIPKLSSSSRL 144
G Y + GTPP +DTGS ++W C + C S I F SSSS L
Sbjct: 77 GLYFTKVKLGTPPMEFTVQIDTGSDILWVNCNSCNGCPRSSGLGIQLNFFDASSSSSSSL 136
Query: 145 LGCQNPKC-SWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSGL-TEGIALSETLN 202
+ C +P C S + QC L S C SY YG G T G +SE++
Sbjct: 137 VSCSDPICNSAFQTTATQC-------LTQSNQC-----SYTFQYGDGSGTSGYYVSESMY 184
Query: 203 ----LPNRIIPN----FLVGCSVL-------SSRQPAGIAGFGRGKTSLPSQLNLDKFSY 247
+ +I N + GCS S GI GFG G S+ SQL+ +
Sbjct: 185 FDMVMGQSMIANSSASVVFGCSTYQSGDLTKSDHAIDGIFGFGPGDLSVISQLSARGITP 244
Query: 248 CLLSHKFDDTTRTSSLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRI 307
+ SH +++ + G+ Y+P V PS N + L+ I
Sbjct: 245 KVFSHCLKGEGNGGGILV-----LGEVLEPGIVYSPLV--PSQPHYNLY-------LQSI 290
Query: 308 TVGGQRVRVWHKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRA 367
+V GQ + + N GTI+DSGTT ++ E + P + + ++ T +
Sbjct: 291 SVNGQTLPIDPSVFATSI--NRGTIIDSGTTLAYLVEEAYTPFVSAITAAVSQSVTPTIS 348
Query: 368 LGAEALTGLRPCFDVPGEKTGSFPELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTVVTD 427
G + C+ V FP + L+F G A + L E Y +G L +
Sbjct: 349 KGNQ-------CYLVSTSVGEIFPLVSLNFAGSASMVLKPEEYLMHLGFYDGAALWCIGF 401
Query: 428 REASGGPSIILGNFQMQNYYVEYDLRNQRLGFKQQLC 464
++ G + ILG+ M++ YDL QR+G+ C
Sbjct: 402 QKVQEGVT-ILGDLVMKDKIFVYDLARQRIGWASYDC 437
>gi|42569679|ref|NP_181205.2| aspartyl protease-like protein [Arabidopsis thaliana]
gi|330254186|gb|AEC09280.1| aspartyl protease-like protein [Arabidopsis thaliana]
Length = 512
Score = 84.3 bits (207), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 101/399 (25%), Positives = 163/399 (40%), Gaps = 64/399 (16%)
Query: 89 YSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPS--FIPKLSSSSRLLG 146
Y + G+PP +DTGS ++W C++ C + S I F S ++ +
Sbjct: 105 YFTKVKLGSPPTEFNVQIDTGSDILWVTCSSCSNCPHSSGLGIDLHFFDAPGSLTAGSVT 164
Query: 147 CQNPKCSWIHH-ESIQCRDCNDEPLATSKNCTQICPSYLVLYGSGL-TEGIALSETLN-- 202
C +P CS + + QC + N C Y YG G T G +++T
Sbjct: 165 CSDPICSSVFQTTAAQCSENN--------QC-----GYSFRYGDGSGTSGYYMTDTFYFD 211
Query: 203 --LPNRIIPN----FLVGCSVL-------SSRQPAGIAGFGRGKTSLPSQLNLDKFSYCL 249
L ++ N + GCS S + GI GFG+GK S+ SQL+ + +
Sbjct: 212 AILGESLVANSSAPIVFGCSTYQSGDLTKSDKAVDGIFGFGKGKLSVVSQLSSRGITPPV 271
Query: 250 LSHKFD-DTTRTSSLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRIT 308
SH D + +L + G+ Y+P V PS +Y + L I
Sbjct: 272 FSHCLKGDGSGGGVFVL------GEILVPGMVYSPLV--PS-------QPHYNLNLLSIG 316
Query: 309 VGGQRVRVWHKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRAL 368
V GQ + + + GTIVD+GTT T++ E + D F++ + N L
Sbjct: 317 VNGQMLPL--DAAVFEASNTRGTIVDTGTTLTYLVKEAY----DLFLNAI---SNSVSQL 367
Query: 369 GAEALTGLRPCFDVPGEKTGSFPELKLHFKGGAEVTLPVENY---FAVVGEGSAVCLTVV 425
++ C+ V + FP + L+F GGA + L ++Y + + S C+
Sbjct: 368 VTPIISNGEQCYLVSTSISDMFPSVSLNFAGGASMMLRPQDYLFHYGIYDGASMWCIGFQ 427
Query: 426 TDREASGGPSIILGNFQMQNYYVEYDLRNQRLGFKQQLC 464
E ILG+ +++ YDL QR+G+ C
Sbjct: 428 KAPEE----QTILGDLVLKDKVFVYDLARQRIGWASYDC 462
>gi|77555282|gb|ABA98078.1| Eukaryotic aspartyl protease family protein, expressed [Oryza
sativa Japonica Group]
Length = 409
Score = 84.0 bits (206), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 79/273 (28%), Positives = 117/273 (42%), Gaps = 22/273 (8%)
Query: 192 TEGIALSETLNLPNRIIPNFLVGCSVLSSRQPAG---IAGFGRGKTSLPSQLNLDKFSYC 248
T G ++T +P + GCS S AG + G GRG SL SQL KFSY
Sbjct: 129 TSGYLATDTFTFGATAVPGVVFGCSDASYGDFAGASGVIGIGRGNLSLISQLQFGKFSYQ 188
Query: 249 LLSHKFDDTTRTSSLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRIT 308
LL+ + D S+I G K G + TP +++ + +YYV L +
Sbjct: 189 LLAPEATDDGSADSVI-RFGDDAVPKTKRGRS-TPLLSS------TLYPDFYYVNLTGVR 240
Query: 309 VGGQRV-RVWHKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRA 367
V G R+ + L +G GG I+ S T T++ E A + V V +R A
Sbjct: 241 VDGNRLDAIPAGTFDLRANGTGGVILSSTTPVTYL-----EQAAYDVVRAAVASRIGLPA 295
Query: 368 LGAEALTGLRPCFDVPGEKTGSFPELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTVVTD 427
+ A L C++ P+L L F GGA++ L NYF + + CLT++
Sbjct: 296 VNGSAALELDLCYNASSMAKVKVPKLTLVFDGGADMDLSAANYFYIDNDTGLECLTMLPS 355
Query: 428 REASGGPSIILGNFQMQNYYVEYDLRNQRLGFK 460
+ S +LG + YD+ RL F+
Sbjct: 356 QGGS-----VLGTLLQTGTNMIYDVDAGRLTFE 383
>gi|125575541|gb|EAZ16825.1| hypothetical protein OsJ_32297 [Oryza sativa Japonica Group]
Length = 416
Score = 84.0 bits (206), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 106/385 (27%), Positives = 150/385 (38%), Gaps = 73/385 (18%)
Query: 94 SFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLGCQNPKCS 153
+ GTPPQ I+D PC+ P SS+ R C C
Sbjct: 72 TIGTPPQPASAIIDVAGPA---PCS----------------FPNASSTFRPEPCGTDACK 112
Query: 154 WIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSGLTEGIALSETLNLPNRIIPNFLV 213
I + C E SK G T GI ++T + +
Sbjct: 113 SIPTSNCSSNMCTYEGTINSKL-------------GGHTLGIVATDTFAI-GTATASLGF 158
Query: 214 GCSVLSSRQ----PAGIAGFGRGKTSLPSQLNLDKFSYCLLSHKFDDTTRTSSLILDNGS 269
GC V S P+G+ G GR +SL SQ+N+ KFSYCL H D+ + S L+L GS
Sbjct: 159 GCVVASGIDTMGGPSGLIGLGRAPSSLVSQMNITKFSYCLTPH---DSGKNSRLLL--GS 213
Query: 270 SHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRVRVWHKYLTLDRDGNG 329
S T TPFV + + S YY + L I G + L GN
Sbjct: 214 SAKLAGGGNSTTTPFVK---TSPGDDMSQYYPIQLDGIKAG-------DAAIALPPSGN- 262
Query: 330 GTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAE-ALTGLRP---CFDVPGE 385
+V + +F+ ++ L E T+A+GA T L+P CF G
Sbjct: 263 TVLVQTLAPMSFLVDSAYQALKKEV----------TKAVGAAPTATPLQPFDLCFPKAGL 312
Query: 386 KTGSFPELKLHF-KGGAEVTLPVENYFAVVGEGSA-VCLTVVT----DREASGGPSIILG 439
S P+L F +G A +T+P Y VGE VC+ +++ + A ILG
Sbjct: 313 SNASAPDLVFTFQQGAAALTVPPPKYLIDVGEEKGTVCMAILSTSWLNTTALDENLNILG 372
Query: 440 NFQMQNYYVEYDLRNQRLGFKQQLC 464
+ Q +N + DL + L F+ C
Sbjct: 373 SLQQENTHFLLDLEKKTLSFEPADC 397
>gi|356558300|ref|XP_003547445.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
Length = 447
Score = 84.0 bits (206), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 103/391 (26%), Positives = 163/391 (41%), Gaps = 68/391 (17%)
Query: 92 SLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLGCQNPK 151
++S G PP ++DTGS ++W CT C C + F P SS+ L C+ P
Sbjct: 104 NISIGQPPIPQLVVMDTGSDILWVMCT---PCTNCDNDLGLLFDPSKSSTFSPL-CKTP- 158
Query: 152 CSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSGLT------EGIALSETLNLPN 205
C + C +P+ + V Y T + ET +
Sbjct: 159 CDF--------EGCRCDPIP-----------FTVTYADNSTASGTFGRDTVVFETTDEGT 199
Query: 206 RIIPNFLVGC--SVLSSRQPA--GIAGFGRGKTSLPSQLNLDKFSYCLLSHKFDDTTRTS 261
I + L GC ++ P GI G G SL ++L KFSYC+ + D
Sbjct: 200 SRISDVLFGCGHNIGHDTDPGHNGILGLNNGPDSLVTKLG-QKFSYCI-GNLADPYYNYH 257
Query: 262 SLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRVRVWHKYL 321
LIL G+ T PF ++ +YYV + I+VG +R+ + +
Sbjct: 258 QLILGEGADLEGYST------PF---------EVYNGFYYVTMEGISVGEKRLDIAPETF 302
Query: 322 TLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEALTGLRP--- 378
+ + GG I+D+G+T TF+ + + L+ E RN +A P
Sbjct: 303 EMKENRAGGVIIDTGSTITFLVDSVHKLLSKEV-------RNLLGWSFRQATIEKSPWMQ 355
Query: 379 CF--DVPGEKTGSFPELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTV--VTDREASGGP 434
CF + + G FP + HF GA++ L ++F + + + C+TV V+ P
Sbjct: 356 CFYGSISRDLVG-FPVVTFHFSDGADLALDSGSFFNQLND-NVFCMTVGPVSSLNIKSKP 413
Query: 435 SIILGNFQMQNYYVEYDLRNQRLGFKQQLCK 465
S+I G Q+Y V YDL NQ + F++ C+
Sbjct: 414 SLI-GLLAQQSYNVGYDLVNQFVYFQRIDCE 443
>gi|357481199|ref|XP_003610885.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
gi|355512220|gb|AES93843.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
Length = 416
Score = 84.0 bits (206), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 100/394 (25%), Positives = 157/394 (39%), Gaps = 62/394 (15%)
Query: 87 GGYSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLG 146
G + + + GTPP I ++DTGS L+W C C C P F P SS+ +
Sbjct: 66 GQHLMEIYIGTPPIKITGLVDTGSDLIWIQCA---PCLGCYKQIKPMFDPLKSSTYNNIS 122
Query: 147 CQNPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYG-SGLTEGIALSETLNLPN 205
C +P C + C+ E K C +Y YG + LT+G+ +T +
Sbjct: 123 CDSPLC-----HKLDTGVCSPE-----KRC-----NYTYGYGDNSLTKGVLAQDTATFTS 167
Query: 206 RI-----IPNFLVGCSVLSS----RQPAGIAGFGRGKTSLPSQLNL----DKFSYCLLSH 252
+ FL GC ++ G+ G G G TSL SQ+ KFS CL+
Sbjct: 168 NTGKPVSLSRFLFGCGHNNTGGFNDHEMGLIGLGGGPTSLISQIGPLFGGKKFSQCLVPF 227
Query: 253 KFDDTTRTSSLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQ 312
D +S + GS G+ TP V E++ Y+V L I+
Sbjct: 228 -LTDIKISSRMSFGKGSQVLGN---GVVTTPLVPR----EKDT---SYFVTLLGIS---- 272
Query: 313 RVRVWHKYLTLDRD-GNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAE 371
V Y ++ G +VDSGT + +L++ + E V+N+ + + +
Sbjct: 273 ---VEDTYFPMNSTIGKANMLVDSGTPPILLPQQLYDKVFAE-----VRNKVALKPITDD 324
Query: 372 ALTGLRPCFDVPGEKTGSFPELKLHFKGGAEVTLPVENYFAVVGEGSAV-CLTVVTDREA 430
G + C+ G P L HF G + P++ + + + CL + +
Sbjct: 325 PSLGTQLCYRTQTNLKG--PTLTFHFVGANVLLTPIQTFIPPTPQTKGIFCLAIYNRTNS 382
Query: 431 SGGPSIILGNFQMQNYYVEYDLRNQRLGFKQQLC 464
G + GNF NY + +DL Q + FK C
Sbjct: 383 DPG---VYGNFAQSNYLIGFDLDRQVVSFKPTDC 413
>gi|357481191|ref|XP_003610881.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
gi|355512216|gb|AES93839.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
Length = 427
Score = 84.0 bits (206), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 112/455 (24%), Positives = 180/455 (39%), Gaps = 65/455 (14%)
Query: 23 PSSITSLTFSLSRFHTNPSQDSYQNLNSLVSSSLTRALHIKNPQTKTTTTTTTTTTTNIS 82
P+ + FS H N + N+ + L + + T T+ N
Sbjct: 22 PTEAYNKGFSFKLIHKNSPNSPFYKSNNFHKNKLRSFYQVPKKSFVQKSPYTRVTSNN-- 79
Query: 83 SHSYGGYSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSS 142
G Y + L+ G+PP I ++DTGS LVW CT C C K P F P S +
Sbjct: 80 ----GDYLMKLTLGSPPVDIYGLVDTGSDLVWAQCT---PCGGCYRQKSPMFEPLRSKTY 132
Query: 143 RLLGCQNPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSGLTEGIALSETLN 202
+ C++ +CS+ + C+ + ++C S +T+G+ E +
Sbjct: 133 SPIPCESEQCSFFGYS------CSPQ---------KMCAYSYSYADSSVTKGVLAREAIT 177
Query: 203 LPNR-----IIPNFLVGCSVLSS----RQPAGIAGFGRGKTSLPSQLNL----DKFSYCL 249
+ ++ + + GC +S GI G G G SL SQ+ +FS CL
Sbjct: 178 FSSTDGDPVVVGDIIFGCGHSNSGTFNENDMGIIGMGGGPLSLVSQIGTLYGSKRFSQCL 237
Query: 250 LSHKFDDTTRTSSLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITV 309
+ F TS I N SD G+ TP +E S Y V L I+V
Sbjct: 238 V--PFHTDAHTSGTI--NFGEESDVSGEGVVTTPL-----ASEEGQTS--YLVTLEGISV 286
Query: 310 GGQRVRVWHKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALG 369
G VR ++ TL + G ++DSGT T++ E +E L +E +K ++ +
Sbjct: 287 GDTFVR-FNSSETLSK---GNIMIDSGTPATYIPQEFYERLVEE-----LKVQSSLLPIE 337
Query: 370 AEALTGLRPCFDVPGEKTGSFPELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTVVTDRE 429
+ G + C+ G P L HF+G LP++ + + + C + +
Sbjct: 338 DDPDLGTQLCYRSETNLEG--PILTAHFEGADVQLLPIQTF--IPPKDGVFCFAMAGSTD 393
Query: 430 ASGGPSIILGNFQMQNYYVEYDLRNQRLGFKQQLC 464
I GNF N + +DL + + FK C
Sbjct: 394 G----DYIFGNFAQSNILMGFDLDRKTISFKPTDC 424
>gi|359476204|ref|XP_002262813.2| PREDICTED: LOW QUALITY PROTEIN: aspartic proteinase
nepenthesin-2-like [Vitis vinifera]
Length = 460
Score = 84.0 bits (206), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 112/412 (27%), Positives = 169/412 (41%), Gaps = 85/412 (20%)
Query: 77 TTTNISSHSY--------GGYSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSS 128
T+ N+ +H++ G + + ++FGTP I ILDTGS + W QCK C
Sbjct: 108 TSGNLKNHAHNNNLFDEDGNFLVDVAFGTPXTEIXLILDTGSSITW------TQCKAC-- 159
Query: 129 SKIPSFIPKLSSSSRLLGCQNPKCSWIHHESIQCRDCNDEPLATSKNCTQICPS-----Y 183
+ L S+R D +++ + PS Y
Sbjct: 160 ------VNCLQDSNRYF---------------------DSSASSTYSFGSCIPSTVENNY 192
Query: 184 LVLYGSGLTE-GIALSETLNL-PNRIIPNFLVGCSVLSSRQPA----GIAGFGRGKTSLP 237
+ YG T G +T+ L P+ + F GC + G+ G G+G+ S
Sbjct: 193 NMTYGDDSTSVGNYGCDTMTLEPSDVFQKFQFGCGRNNKGDFGSGVDGMLGLGQGQLSTV 252
Query: 238 SQL--NLDK-FSYCLLSHKFDDTTRTSSLILDNGSSHSDKKTTGLTYTPFVNNPSVAERN 294
SQ +K FSYCL ++ + S L + +S S + L +T VN P +
Sbjct: 253 SQTASKFNKVFSYCLP----EEDSIGSLLFGEKATSQS----SSLKFTSLVNGPGTLQE- 303
Query: 295 AFSVYYYVGLRRITVGGQRVRVWHKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEF 354
S YY+V L I+VG +R+ + + GTI+DS T T + + L F
Sbjct: 304 --SGYYFVNLSDISVGNERLNIPSSVFA-----SPGTIIDSRTVITRLPQRAYSALKAAF 356
Query: 355 VSQMVKN--RNYTRALGAEALTGLRPCFDVPGEKTGSFPELKLHFKGGAEVTLPVENYFA 412
M K N R G L C+++ G K PE+ LHF GGA+V L N
Sbjct: 357 KKAMAKYPLSNGRRKKGDI----LDTCYNLSGRKDVLLPEIVLHFGGGADVRLNGTN-IV 411
Query: 413 VVGEGSAVCLTVVTDREASGGPSIILGNFQMQNYYVEYDLRNQRLGFKQQLC 464
+ S +CL E + I+GN Q + V YD++ +R+GF C
Sbjct: 412 WGSDASRLCLAFAGTSELT-----IIGNRQQLSLTVLYDIQGRRIGFGGNGC 458
>gi|224140237|ref|XP_002323490.1| predicted protein [Populus trichocarpa]
gi|222868120|gb|EEF05251.1| predicted protein [Populus trichocarpa]
Length = 478
Score = 84.0 bits (206), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 108/403 (26%), Positives = 168/403 (41%), Gaps = 67/403 (16%)
Query: 87 GGYSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPS--FIPKLSSSSRL 144
G Y + G+PP+ +DTGS ++W C + C S I F SS++ L
Sbjct: 64 GLYFTKVKLGSPPREFNVQIDTGSDVLWVCCNSCNNCPRTSGLGIQLNFFDSSSSSTAGL 123
Query: 145 LGCQNPKC-SWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSGLTEGIALSETLN- 202
+ C +P C S + QC S Q ++ GSG T G +S+TL
Sbjct: 124 VHCSDPICTSAVQTTVTQC----------SPQTNQCSYTFQYEDGSG-TSGYYVSDTLYF 172
Query: 203 ---LPNRIIPN----FLVGCSVLSS-------RQPAGIAGFGRGKTSLPSQLNLDK---- 244
L ++ N + GCS S + GI GFG+G+ S+ SQL+
Sbjct: 173 DAILGESLVVNSSALIVFGCSTFQSGDLTMTDKAVDGIFGFGQGELSVISQLSTHGITPR 232
Query: 245 -FSYCLLSHKFDDTTRTSSLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVG 303
FS+CL IL+ G+ Y+P V PS +Y +
Sbjct: 233 VFSHCLKGEGIGGGILVLGEILE----------PGMVYSPLV--PS-------QPHYNLN 273
Query: 304 LRRITVGGQRVRVWHKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRN 363
L+ I V G+ + + + GTIVDSGTT ++ E ++P FVS + N
Sbjct: 274 LQSIAVNGKLLPIDPSVFA--TSNSQGTIVDSGTTLAYLVAEAYDP----FVSAV--NVI 325
Query: 364 YTRALGAEALTGLRPCFDVPGEKTGSFPELKLHFKGGAEVTLPVENYFAVVG--EGSAVC 421
+ ++ G C+ V + FP +F GGA + L E+Y G +G +V
Sbjct: 326 VSPSVTPIISKG-NQCYLVSTSVSQMFPLASFNFAGGASMVLKPEDYLIPFGPSQGGSV- 383
Query: 422 LTVVTDREASGGPSIILGNFQMQNYYVEYDLRNQRLGFKQQLC 464
+ + ++ G ILG+ +++ YDL QR+G+ C
Sbjct: 384 MWCIGFQKVQG--VTILGDLVLKDKIFVYDLVRQRIGWANYDC 424
>gi|222637379|gb|EEE67511.1| hypothetical protein OsJ_24961 [Oryza sativa Japonica Group]
Length = 641
Score = 84.0 bits (206), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 112/444 (25%), Positives = 175/444 (39%), Gaps = 76/444 (17%)
Query: 44 SYQNLNSLVSSSLTRAL-HIKNPQTKTTTTTTTTTTTNISSHSYGGYSISLSFGTPPQII 102
+Y N L +SS R L NP + T G Y+ L GTP Q
Sbjct: 53 AYPNATRLPASSARRGLGDGHNPNARMRLHDDLLTN--------GYYTTRLYIGTPSQEF 104
Query: 103 PFILDTGSHLVWFPCTNHYQCKYCSS-------SKIPSFIPKLSSSSRLLGCQNPKCSWI 155
I+D+GS + + PC QC S + P F P LSS+ + C N C+
Sbjct: 105 ALIVDSGSTVTYVPCATCEQCGNHQSESPNIIEAHDPRFQPDLSSTYSPVKC-NVDCT-C 162
Query: 156 HHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSGLTEGIALSETLNLPNRIIPNFLVGC 215
+E QC + ++ S VL ++ G E+ P R + GC
Sbjct: 163 DNERSQC--------TYERQYAEMSSSSGVLGEDIMSFG---KESELKPQRAV----FGC 207
Query: 216 S-----VLSSRQPAGIAGFGRGKTSLPSQLNL-----DKFSYCLLSHKFDDTTRTSSLIL 265
L S+ GI G GRG+ S+ QL D FS C T ++L
Sbjct: 208 ENTETGDLFSQHADGIMGLGRGQLSIMDQLVEKGVISDSFSLCYGGMDVGGGT----MVL 263
Query: 266 DNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRVRVWHKYLTLDR 325
+ D +++ V +P YY + L+ I V G+ +R+ K
Sbjct: 264 GGMPAPPDMV---FSHSNPVRSP----------YYNIELKEIHVAGKALRLDPKIF---- 306
Query: 326 DGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEALTGLRPCFDVPGE 385
+ GT++DSGTT+ ++ + F D +++ N + + CF G
Sbjct: 307 NSKHGTVLDSGTTYAYLPEQAFVAFKDAVTNKV----NSLKKIRGPDPNYKDICFAGAGR 362
Query: 386 KTGS----FPELKLHFKGGAEVTLPVENY-FAVVGEGSAVCLTVVTDREASGGPSIILGN 440
FP++ + F G +++L ENY F A CL V + + P+ +LG
Sbjct: 363 NVSQLSEVFPDVDMVFGNGQKLSLSPENYLFRHSKVEGAYCLGVFQNGK---DPTTLLGG 419
Query: 441 FQMQNYYVEYDLRNQRLGFKQQLC 464
++N V YD N+++GF + C
Sbjct: 420 IVVRNTLVTYDRHNEKIGFWKTNC 443
>gi|357127505|ref|XP_003565420.1| PREDICTED: probable aspartic protease At2g35615-like [Brachypodium
distachyon]
Length = 466
Score = 83.6 bits (205), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 111/410 (27%), Positives = 159/410 (38%), Gaps = 72/410 (17%)
Query: 89 YSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQ----------------CKYCSSSKIP 132
Y +++ GTPP + DTGS LVW C +
Sbjct: 82 YLAAVNVGTPPVRFLAVADTGSDLVWLKCNTTQNNNGIVSSDSGNNSNSSPPPPPPEAVV 141
Query: 133 SFIPKLSSSSRLLGCQNPKCSWIHHESIQCRDCNDEPLATSKNC---TQICPSYLVLYGS 189
F P SSS +GC P C LAT+ +C + C
Sbjct: 142 YFNPFDSSSYSRVGCDGPSC---------------LALATNASCNGDSHACDFRYSYRDG 186
Query: 190 GLTEGIALSETLNLPNRI------IPNFLVGCSVLSSR---QPAGIAGFGRGKTSLPSQL 240
G+ ++T I + GC+ ++ Q G+ G G G SL SQL
Sbjct: 187 ASATGLLAADTFTFGGNINNDTTSTASIDFGCATGTAGREFQADGMVGLGAGPLSLASQL 246
Query: 241 NLDKFSYCLLSHKFDDTTRTSSLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYY 300
KFS+CL ++ DD +S IL+ G + + G TP + + S A + YY
Sbjct: 247 GR-KFSFCLTAYDIDD----ASSILNFG-ARAVVSDPGAATTPLIASSSNA-----AAYY 295
Query: 301 YVGLRRITVGGQRVRVWHKYLTLDRDGNGGTIVDSGTTFTFMA-PELFEPLADEFVSQMV 359
+ + + V GQ V + IVD+GT TF+ L PL E +++++
Sbjct: 296 AISIDSLKVAGQPVPGTTSVSKV--------IVDTGTVLTFLDRAALLAPLT-ESLARVM 346
Query: 360 KNRNYTRALGAEALTGLRPCFDVPGEK--TGSFPE--LKLHFKGGAEVTLPVENYFAVVG 415
RA + L C+DV K G P+ L L GG EV L E F +V
Sbjct: 347 DGAGLPRAPPPDET--LELCYDVSRVKDVDGVIPDVTLVLGGGGGGEVRLTGEGTFVLVK 404
Query: 416 EGSAVCLTVVTDREASGGPSIILGNFQMQNYYVEYDLRNQRLGFKQQLCK 465
EG +CL VVT P +LGN +Q+ +V DL + F C
Sbjct: 405 EG-VLCLAVVT-TSPELQPLSVLGNVALQDLHVGIDLDARTATFATANCD 452
>gi|297819684|ref|XP_002877725.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
gi|297323563|gb|EFH53984.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
Length = 633
Score = 83.6 bits (205), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 107/399 (26%), Positives = 169/399 (42%), Gaps = 74/399 (18%)
Query: 87 GGYSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLG 146
G Y+ L GTPPQ+ I+D+GS + + PC++ C+ C + P F P+LSS+ + +
Sbjct: 92 GYYTTRLWIGTPPQMFALIVDSGSTVTYVPCSD---CEQCGKHQDPKFQPELSSTYQPVK 148
Query: 147 CQNPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSGLTEGIALSETLNLPNR 206
C N C+ + QC + A + + L+ +G +E+ P R
Sbjct: 149 C-NMDCN-CDDDKEQC--VYEREYAEHSSSKGVLGEDLISFG---------NESQLTPQR 195
Query: 207 IIPNFLVGCSV-----LSSRQPAGIAGFGRGKTSLPSQLNLDKFSYCLLSHKF-----DD 256
+ GC L S++ GI G G+G SL QL +DK L+S+ F
Sbjct: 196 AV----FGCETVETGDLYSQRADGIIGLGQGDLSLVDQL-VDK---GLISNSFGLCYGGM 247
Query: 257 TTRTSSLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRVRV 316
S+IL SD + S +R S YY + L I V G+++ +
Sbjct: 248 DVGGGSMILGGFDYPSD----------MIFTDSDPDR---SPYYNIDLTGIRVAGKKLSL 294
Query: 317 WHKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEALTGL 376
+ DG G ++DSGTT+ ++ P+ +E V + V + + G
Sbjct: 295 NSRVF----DGEHGAVLDSGTTYAYL-PDAAFAAFEEAVMREVS--------PLKQIDGP 341
Query: 377 RP-----CFDVPG-----EKTGSFPELKLHFKGGAEVTLPVENY-FAVVGEGSAVCLTVV 425
P CF V E + FP +++ FK G L ENY F A CL V
Sbjct: 342 DPNFKDTCFLVAASNDVSELSKIFPSVEMIFKSGQSWLLSPENYMFRHSKVHGAYCLGVF 401
Query: 426 TDREASGGPSIILGNFQMQNYYVEYDLRNQRLGFKQQLC 464
+ + + +LG ++N V YD N ++GF + C
Sbjct: 402 PNGKDH---TTLLGGIVVRNTLVVYDRENSKVGFWRTNC 437
>gi|302803839|ref|XP_002983672.1| hypothetical protein SELMODRAFT_118648 [Selaginella moellendorffii]
gi|300148509|gb|EFJ15168.1| hypothetical protein SELMODRAFT_118648 [Selaginella moellendorffii]
Length = 388
Score = 83.6 bits (205), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 102/420 (24%), Positives = 157/420 (37%), Gaps = 58/420 (13%)
Query: 61 HIKNPQTKTTTTTTTTTTTNISSHSYGG-YSISLSFGTPPQIIPFILDTGSHLVWFPCTN 119
H + K ++ + ++ G Y + GTPP+ +DTGS L+W C
Sbjct: 7 HDRGRMVKLKSSAVSLPVEGVADPYIAGLYFTQVQLGTPPRTYNLQVDTGSDLLWVNCHP 66
Query: 120 HYQCKYCSSSKIPSFIP---KLSSSSRLLGCQNPKCSWIHHESIQCRDCNDEPLATSKNC 176
C S KIP +P K S+SS + C +P C+ I I CND+ C
Sbjct: 67 CIGCPAFSDLKIP-IVPYDVKASASSSKVPCSDPSCTLITQ--ISESGCNDQ-----NQC 118
Query: 177 TQICPSYLVLYGSGL-TEGIALSETLNLPNRIIPNFLVGCSV-------LSSRQPAGIAG 228
Y YG G T G + + L+ + GC S R GI G
Sbjct: 119 -----GYSFQYGDGSGTLGYLVEDVLHYMVNATATVIFGCGFKQSGDLSTSERALDGIIG 173
Query: 229 FGRGKTSLPSQLNLDKFSYCLLSHKFDDTTRTSSLILDNGSSHSDKKTTGLTYTPFVNNP 288
FG S SQL + + +H D R +++ D + YTP V
Sbjct: 174 FGASDLSFNSQLAKQGKTPNVFAHCLDGGERGGGILVLGNVIEPD-----IQYTPLV--- 225
Query: 289 SVAERNAFSVYYYVGLRRITVGGQRVRVWHKYLTLDRDGNGGTIVDSGTTFTFMAPELFE 348
+ +Y V L+ I+V + + K + D GTI DSGTT ++ E ++
Sbjct: 226 ------PYMYHYNVVLQSISVNNANLTIDPKLFS--NDVMQGTIFDSGTTLAYLPDEAYQ 277
Query: 349 PLADEFVSQMVKNRNYTRALGAEALTGLRPCFDVPGEKTGSFPELKLHFKGGAEVTLPVE 408
+ VS +V + + L FP + L+F+G + P E
Sbjct: 278 AFT-QAVSLVVAPFLLCDTRLSRFIYKL-------------FPNVVLYFEGASMTLTPAE 323
Query: 409 NYFAVVGEGSAVCLTVVTDREASGGPSI---ILGNFQMQNYYVEYDLRNQRLGFKQQLCK 465
+A + S + I G+ ++N V YDL R+G++ CK
Sbjct: 324 YLIRQASAANAPIWCMGWQSMGSAESELQYTIFGDLVLKNKLVVYDLERGRIGWRPFDCK 383
>gi|297803034|ref|XP_002869401.1| aspartyl protease family [Arabidopsis lyrata subsp. lyrata]
gi|297315237|gb|EFH45660.1| aspartyl protease family [Arabidopsis lyrata subsp. lyrata]
Length = 438
Score = 83.6 bits (205), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 109/461 (23%), Positives = 191/461 (41%), Gaps = 78/461 (16%)
Query: 34 SRFHTNPSQDSYQNL---NSLVSSSLTRALHIKNPQTKTTTTTTTTTTTNISSHSY---- 86
S FH + S NL N + S + HIK + T +I +H
Sbjct: 20 SVFHLSASPTLVLNLVHSNQIYSLQSPQVSHIKEASVERLEYLKAKATGDIIAHLSPNVP 79
Query: 87 ---GGYSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSR 143
+ +++S G+PP +DT S L+W C C C + +P F P S + R
Sbjct: 80 IIPQAFLVNISIGSPPVTQLLHMDTASDLLWLQCR---PCINCYAQSLPIFDPSRSYTHR 136
Query: 144 LLGCQNPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSGLTEGIALSETLNL 203
C+ + S S++ A +++C Y + Y G L++ + +
Sbjct: 137 NESCRTSQYSM---PSLRFN-------AKTRSC-----EYSMRYMDGTGSKGILAKEMLM 181
Query: 204 PNRI--------IPNFLVGCSVLSSRQP---AGIAGFGRGKTSLPSQLNLDKFSYCLLSH 252
N I + + + GC + +P GI G G G+ SL + KFSYC S
Sbjct: 182 FNTIYDESSSAALHDVVFGCGHDNYGEPLVGTGILGLGYGEFSLVHRFGT-KFSYCFGS- 239
Query: 253 KFDDTTRTSS-LILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGG 311
DD + + L+L + ++ TT L ++ +YYV + I+V G
Sbjct: 240 -LDDPSYPHNVLVLGDDGANILGDTTPL--------------EIYNGFYYVTIEAISVDG 284
Query: 312 QRVRV--WHKYLTLDRD---GNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTR 366
+ + W +R+ G GGTI+D+G + T + E ++PL ++ + + R
Sbjct: 285 IILPIDPW----VFNRNHQTGLGGTIIDTGNSLTSLVEEAYKPLKNK-IEDYFEGRFTAA 339
Query: 367 ALGAEALTGLRPCFDVPGEK---TGSFPELKLHFKGGAEVTLPVENYFAVVGEGSAVCLT 423
+ + + + C++ E+ FP + HF GAE++L V++ F + + CL
Sbjct: 340 DVNQDDMFKVE-CYNGNLERDLVESGFPIVTFHFSDGAELSLDVKSVFMKLSP-NVFCLA 397
Query: 424 VVTDREASGGPSIILGNFQMQNYYVEYDLRNQRLGFKQQLC 464
V S +G Q+Y + YDL +++ F++ C
Sbjct: 398 VTPGNMNS------IGATAQQSYNIGYDLEAKKISFERIDC 432
>gi|168014188|ref|XP_001759635.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162689174|gb|EDQ75547.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 485
Score = 83.6 bits (205), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 101/401 (25%), Positives = 158/401 (39%), Gaps = 73/401 (18%)
Query: 87 GGYSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLG 146
G Y+ + GTP Q I+DTGS + + PC++ C + + P F P SSS + +
Sbjct: 97 GYYTSRVFIGTPAQEFALIVDTGSTVTYVPCSSCTHCGHHQACFDPRFKPDNSSSYQTVS 156
Query: 147 CQNPKC--SWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSGLTEGIALSETLNLP 204
C +P C QC+ + A + + L+ +G+G
Sbjct: 157 CNSPDCITKMCDARVHQCK--YERVYAEMSSSKGVLGKDLLGFGNG-------------- 200
Query: 205 NRIIPN-FLVGCSVLSS-----RQPAGIAGFGRGKTSLPSQL-----NLDKFSYCLLSHK 253
+R+ P+ L GC + + GI G GRG S+ QL D FS C
Sbjct: 201 SRLQPHPLLFGCETAETGDLYLQHADGIMGLGRGPLSIVDQLVGTGAMEDSFSLCYGG-- 258
Query: 254 FDDTTRTSSLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQR 313
+D G G P + ++ N S YY + L I V G
Sbjct: 259 -----------MDEGGG---SMVLGAIPPPPAMVFAKSDPNR-SNYYNLELSEIQVQGVS 303
Query: 314 VRVWHKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEAL 373
+ V + +G GT++DSGTT+ ++ + F+ D Q+ +A+
Sbjct: 304 LNVPSEVF----NGRLGTVLDSGTTYAYLPDKAFDAFKDAITQQLGS---------LQAV 350
Query: 374 TGLRP-----CFDVPGEKTGS----FPELKLHFKGGAEVTLPVENY-FAVVGEGSAVCLT 423
G P CF G + + FP + F G +V L ENY F A CL
Sbjct: 351 PGPDPSYPDVCFAGAGSDSKALGKHFPPVDFVFSGNQKVFLAPENYLFKHTKVPGAYCLG 410
Query: 424 VVTDREASGGPSIILGNFQMQNYYVEYDLRNQRLGFKQQLC 464
+++A + +LG ++N V YD N ++GF + C
Sbjct: 411 FFKNQDA----TTLLGGIVVRNTLVTYDRANHQIGFFKTNC 447
>gi|218199944|gb|EEC82371.1| hypothetical protein OsI_26705 [Oryza sativa Indica Group]
Length = 642
Score = 83.6 bits (205), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 112/444 (25%), Positives = 175/444 (39%), Gaps = 76/444 (17%)
Query: 44 SYQNLNSLVSSSLTRAL-HIKNPQTKTTTTTTTTTTTNISSHSYGGYSISLSFGTPPQII 102
+Y N L +SS R L NP + T G Y+ L GTP Q
Sbjct: 54 AYPNATRLPASSARRGLGDGHNPNARMRLHDDLLTN--------GYYTTRLYIGTPSQEF 105
Query: 103 PFILDTGSHLVWFPCTNHYQCKYCSS-------SKIPSFIPKLSSSSRLLGCQNPKCSWI 155
I+D+GS + + PC QC S + P F P LSS+ + C N C+
Sbjct: 106 ALIVDSGSTVTYVPCATCEQCGNHQSESPNIIEAHDPRFQPDLSSTYSPVKC-NVDCT-C 163
Query: 156 HHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSGLTEGIALSETLNLPNRIIPNFLVGC 215
+E QC + ++ S VL ++ G E+ P R + GC
Sbjct: 164 DNERSQC--------TYERQYAEMSSSSGVLGEDIMSFG---KESELKPQRAV----FGC 208
Query: 216 S-----VLSSRQPAGIAGFGRGKTSLPSQLNL-----DKFSYCLLSHKFDDTTRTSSLIL 265
L S+ GI G GRG+ S+ QL D FS C T ++L
Sbjct: 209 ENTETGDLFSQHADGIMGLGRGQLSIMDQLVEKGVISDSFSLCYGGMDVGGGT----MVL 264
Query: 266 DNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRVRVWHKYLTLDR 325
+ D +++ V +P YY + L+ I V G+ +R+ K
Sbjct: 265 GGMPAPPDMV---FSHSNPVRSP----------YYNIELKEIHVAGKALRLDPKIF---- 307
Query: 326 DGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEALTGLRPCFDVPGE 385
+ GT++DSGTT+ ++ + F D +++ N + + CF G
Sbjct: 308 NSKHGTVLDSGTTYAYLPEQAFVAFKDAVTNKV----NSLKKIRGPDPNYKDICFAGAGR 363
Query: 386 KTGS----FPELKLHFKGGAEVTLPVENY-FAVVGEGSAVCLTVVTDREASGGPSIILGN 440
FP++ + F G +++L ENY F A CL V + + P+ +LG
Sbjct: 364 NVSQLSEVFPDVDMVFGNGQKLSLSPENYLFRHSKVEGAYCLGVFQNGK---DPTTLLGG 420
Query: 441 FQMQNYYVEYDLRNQRLGFKQQLC 464
++N V YD N+++GF + C
Sbjct: 421 IVVRNTLVTYDRHNEKIGFWKTNC 444
>gi|340810961|gb|AEK75407.1| S5 [Oryza sativa]
gi|340811037|gb|AEK75445.1| S5 [Oryza rufipogon]
Length = 357
Score = 83.6 bits (205), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 105/404 (25%), Positives = 171/404 (42%), Gaps = 77/404 (19%)
Query: 91 ISLSFGTPPQIIPFILDTGSHLVWF---PCTNHYQCKYCSSSKIPSFIPKLSSSSRLLGC 147
+++S G PP + +DTGS L W PC H C S+ P F P S +SR + C
Sbjct: 1 MAVSLGKPPVVNLVAIDTGSTLSWVQCQPCAVH--CHTQSAKAGPIFDPGRSYTSRRVRC 58
Query: 148 QNPKCSWIHHE-SIQCRDCNDEPLATSKNCTQICPSYLVLYGSGL--TEGIALSETLNLP 204
+ KC + ++ +Q +C ++ +CT Y V YG+G + G +++TL +
Sbjct: 59 SSVKCGELRYDLRLQQANCMEK----EDSCT-----YSVTYGNGWAYSVGKMVTDTLRIG 109
Query: 205 NRIIPNFLVGCS--VLSSRQPAGIAGFGRGK-------TSLPSQLNLDKFSYCLLSHKFD 255
+ + + + GCS V S AGI GFG P L+ SYCL +
Sbjct: 110 DSFM-DLMFGCSMDVKYSEFEAGIFGFGSSSFSFFEQLAGYPDILSYKALSYCLPT---- 164
Query: 256 DTTRTSSLILDNGSSHSDKKTTGLTYTPF---VNNPSVAERNAFSVYYYVGLRRITVGGQ 312
D T+ +IL D+ YTP +N P+ Y + + + GQ
Sbjct: 165 DETKPGYMIL----GRYDRAAMDGGYTPLFRSINRPT----------YSLTMEMLIANGQ 210
Query: 313 RVRVWHKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEA 372
R+ + IVDSG T + P F L D+ ++Q + + Y R A
Sbjct: 211 RLVT----------SSSEMIVDSGAQRTSLWPSTFA-LLDKTITQAMSSIGYHRTSRARQ 259
Query: 373 LTGLRPCFDVPGEKTG------------SFPELKLHFKGGAEVTLPVENYFAVVGEGSAV 420
+ + C+ + +G + P L++ F GGA + LP N F +
Sbjct: 260 ESYI--CYLSEHDYSGWNGTITPFSNWSALPLLEIGFAGGAALALPPRNVF-YNDPHRGL 316
Query: 421 CLTVVTDREASGGPSIILGNFQMQNYYVEYDLRNQRLGFKQQLC 464
C+T + S ILGN +++ +D++ ++ GFK +C
Sbjct: 317 CMTFAQNPALR---SQILGNRVTRSFGTTFDIQGKQFGFKYAVC 357
>gi|147811402|emb|CAN61225.1| hypothetical protein VITISV_006732 [Vitis vinifera]
Length = 440
Score = 83.2 bits (204), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 119/470 (25%), Positives = 192/470 (40%), Gaps = 72/470 (15%)
Query: 15 FFTLLSIFPSSITSLTFSLSRFHTNPSQDSYQNLNSLVSSSLTRALHIKNPQTKTTTTTT 74
F ++ I + + + FS+ H + + N + + L R + + + + +
Sbjct: 19 FVSVAHISAAEVKNGRFSIDLIHRDSPKSPLYNPSETPAERLDR--FFRRFMSFSEASIS 76
Query: 75 TTTTTNISSHSYGGYSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSF 134
T S + G Y + +S GTPP + I DTGS L+W C C C K P F
Sbjct: 77 PNTPEPPVSSNNGEYLMKISIGTPPFDVYGIYDTGSDLMWTQC---LPCLSCYKQKNPMF 133
Query: 135 IPKLSSSSRLLGCQNPKCSWIHHESIQCRDCNDEPLATSKNCTQ---ICPSYLVLYGSG- 190
P S+S + + C ES QCR L + +C+Q +C + YG G
Sbjct: 134 DPSKSTSFKEVSC----------ESQQCR------LLDTVSCSQPQKLC-DFSYGYGDGS 176
Query: 191 LTEGIALSETLNLPNR-----IIPNFLVGCSVLSS----RQPAGIAGFGRGKTSLPSQL- 240
L +G+ +ETL L + I N + GC +S G+ G G SL SQ+
Sbjct: 177 LAQGVIATETLTLNSNSGQPXSIXNIVFGCGHNNSGTFNENEMGLFGTGGRPLSLTSQIM 236
Query: 241 ----NLDKFSYCLLSHKFDDTTRTSSLILDNGSSHSDKKTTGLTYTPFV--NNPSVAERN 294
+ KFS CL+ + D + TS +I ++ + + TP V ++P+
Sbjct: 237 STLGSGRKFSQCLVPFRTDPSI-TSKIIF---GPEAEVSGSXVVSTPLVTKDDPT----- 287
Query: 295 AFSVYYYVGLRRITVGGQRVRVWHKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEF 354
YY+V L I+V G ++ + + GN +D+GT T L +F
Sbjct: 288 ----YYFVTLDGISV-GDKLFPFSSSSPMATKGN--VFIDAGTPPTL--------LPRDF 332
Query: 355 VSQMVKNRNYTRALGAEALTGLRPCFDVPGEKTGSFPELKLHFKGGAEVTLPVENYFAVV 414
+++V+ + L+P P L HF GA+V L N F
Sbjct: 333 YNRLVQGVKEAIPMEPVQDPDLQPQLCYRSATLIDGPILTAHFD-GADVQLKPLNTFISP 391
Query: 415 GEGSAVCLTVVTDREASGGPSIILGNFQMQNYYVEYDLRNQRLGFKQQLC 464
EG C + + G + I GNF N+ + +DL +++ FK C
Sbjct: 392 KEG-VYCFAM----QPIDGDTGIFGNFVQMNFLIGFDLDGKKVSFKAVDC 436
>gi|340810959|gb|AEK75406.1| S5 [Oryza sativa]
gi|340810971|gb|AEK75412.1| S5 [Oryza rufipogon]
Length = 357
Score = 83.2 bits (204), Expect = 3e-13, Method: Compositional matrix adjust.
Identities = 106/404 (26%), Positives = 170/404 (42%), Gaps = 77/404 (19%)
Query: 91 ISLSFGTPPQIIPFILDTGSHLVWF---PCTNHYQCKYCSSSKIPSFIPKLSSSSRLLGC 147
+++S G PP + +DTGS L W PC H C S+ P F P S +SR + C
Sbjct: 1 MAVSLGKPPVVNLVAIDTGSTLSWVQCQPCAVH--CHTQSAKAGPIFDPGRSYTSRRVRC 58
Query: 148 QNPKCSWIHHE-SIQCRDCNDEPLATSKNCTQICPSYLVLYGSGL--TEGIALSETLNLP 204
+ KC ++ +Q +C ++ +CT Y V YG+G + G +++TL +
Sbjct: 59 SSVKCGEPRYDLRLQQANCMEK----EDSCT-----YSVTYGNGWAYSVGKMVTDTLRIG 109
Query: 205 NRIIPNFLVGCS--VLSSRQPAGIAGFGRGK-------TSLPSQLNLDKFSYCLLSHKFD 255
+ + + + GCS V S AGI GFG P L+ FSYCL +
Sbjct: 110 DSFM-DLMFGCSMDVKYSEFEAGIFGFGSSSFSFFEQLAGYPDILSYKAFSYCLPT---- 164
Query: 256 DTTRTSSLILDNGSSHSDKKTTGLTYTPF---VNNPSVAERNAFSVYYYVGLRRITVGGQ 312
D T+ +IL D+ YTP +N P+ Y + + + GQ
Sbjct: 165 DETKPGYMIL----GRYDRAAMDGGYTPLFRSINRPT----------YSLTMEMLIANGQ 210
Query: 313 RVRVWHKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEA 372
R+ + IVDSG T + P F L D+ ++Q + + Y R A
Sbjct: 211 RLVT----------SSSEMIVDSGAQRTSLWPSTFA-LLDKTITQAMSSIGYHRTSRARQ 259
Query: 373 LTGLRPCFDVPGEKTG------------SFPELKLHFKGGAEVTLPVENYFAVVGEGSAV 420
+ + C+ + +G + P L++ F GGA + LP N F +
Sbjct: 260 ESYI--CYLSEHDYSGWNGTITPFSNWSALPLLEIGFAGGAALALPPRNVF-YNDPHRGL 316
Query: 421 CLTVVTDREASGGPSIILGNFQMQNYYVEYDLRNQRLGFKQQLC 464
C+T + S ILGN +++ +D++ ++ GFK C
Sbjct: 317 CMTFAQNPALR---SQILGNRVTRSFGTTFDIQGKQFGFKYAAC 357
>gi|297825301|ref|XP_002880533.1| hypothetical protein ARALYDRAFT_481251 [Arabidopsis lyrata subsp.
lyrata]
gi|297326372|gb|EFH56792.1| hypothetical protein ARALYDRAFT_481251 [Arabidopsis lyrata subsp.
lyrata]
Length = 430
Score = 83.2 bits (204), Expect = 3e-13, Method: Compositional matrix adjust.
Identities = 102/398 (25%), Positives = 169/398 (42%), Gaps = 67/398 (16%)
Query: 89 YSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKI--PSFIPKLSSSSRLLG 146
+ ++ S G PP I+DTGS L+W C + CK+CSS+ + P F P LSS+
Sbjct: 68 FFVNFSVGQPPVPQFTIMDTGSSLLWIQC---HPCKHCSSNHMIHPVFNPALSSTFVECS 124
Query: 147 CQNPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSGL-TEGIALSETLNL-- 203
C + C + A + +C+ Y +Y SG ++G+ E L
Sbjct: 125 CDDRFCRY----------------APNGHCSSNKCVYEQVYISGTGSKGVLAKERLTFTT 168
Query: 204 PN---RIIPNFLVGCSVLSSRQPA----GIAGFGRGKTSLPSQLNLDKFSYCL--LSHKF 254
PN + GC + Q GI G G TSL QL KFSYC+ L++K
Sbjct: 169 PNGNTVVTQPIAFGCGHENGEQLESEFTGILGLGAKPTSLAVQLG-SKFSYCIGDLANK- 226
Query: 255 DDTTRTSSLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRV 314
+ L+L + + +P+ E + YY+ L I+VG +++
Sbjct: 227 --NYGYNQLVLGEDAD-------------ILGDPTPIEFETENGIYYMNLEGISVGDKQL 271
Query: 315 RVWHKYLTLDRDGN-GGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEAL 373
+ + + R G+ G I+D+GT +T++A + L +E S + + R + L
Sbjct: 272 NI--EPVVFKRRGSRTGVILDTGTLYTWLADIAYRELYNEIKS--ILDPKLERFWFRDFL 327
Query: 374 TGLRPCFD-VPGEKTGSFPELKLHFKGGAEVTLPVENYFAVVGEG----SAVCLTVVTDR 428
C+ E+ FP + HF GGAE+ + + F + E + C++V
Sbjct: 328 -----CYHGRVNEELIGFPVVTFHFAGGAELAMEATSMFYPMTESDTYHNVFCMSVRPTT 382
Query: 429 EASGGPS--IILGNFQMQNYYVEYDLRNQRLGFKQQLC 464
E G +G Q Y + YDL+ + + ++ C
Sbjct: 383 EHGGEYKDFTAIGLMAQQYYNIAYDLKERNIYLQRIDC 420
>gi|302783112|ref|XP_002973329.1| hypothetical protein SELMODRAFT_413603 [Selaginella moellendorffii]
gi|300159082|gb|EFJ25703.1| hypothetical protein SELMODRAFT_413603 [Selaginella moellendorffii]
Length = 437
Score = 82.8 bits (203), Expect = 3e-13, Method: Compositional matrix adjust.
Identities = 103/397 (25%), Positives = 169/397 (42%), Gaps = 61/397 (15%)
Query: 87 GGYSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLG 146
G Y + G P Q + I+DTGS ++W C+ C+ C S + IP LS + L
Sbjct: 81 GLYYTEIGLGNPVQKLKVIVDTGSDILWVKCS---PCRSCLSKQ--DIIPPLSIYN--LS 133
Query: 147 CQNPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSGLTEGIA-----LSETL 201
+ + + C E S++ + +Y + Y T A + L
Sbjct: 134 ASSTSSVSSCSDPL----CTGEQAVCSRSGSNSACAYGISYQDKSTSIGAYVKDDMHYVL 189
Query: 202 NLPNRIIPNFLVGCSV-LSSRQPA-GIAGFGRGKTSLPSQL----NLDK-FSYCLLSHKF 254
N + GC++ ++ PA GI GFG+ ++P+Q+ N+ + FS+CL K
Sbjct: 190 QGGNATTSHIFFGCAINITGSWPADGIMGFGQISKTVPNQIATQRNMSRVFSHCLGGEKH 249
Query: 255 DDTTRTSSLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRV 314
IL+ G + TT + +TP +N + +Y V L I+V + +
Sbjct: 250 GGG------ILEFG---EEPNTTEMVFTPLLN---------VTTHYNVDLLSISVNSKVL 291
Query: 315 RVWHKYLTL--DRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEA 372
+ K + + G I+DSGT+F +A + L E +N T A
Sbjct: 292 PIDSKEFSYVSNSTNETGVIIDSGTSFALLATKANRILFSEI-------KNLTTAKLGPK 344
Query: 373 LTGLRPCFDVPGEKT--GSFPELKLHFKGGAEVTLPVENYFAVV---GEGSAVCLTVVTD 427
L GL+ CF + T SFP + L F GG+ + L +NY +V + + C
Sbjct: 345 LEGLQ-CFYLKSGLTVETSFPNVTLTFSGGSTMKLKPDNYLVMVELKKKRNGYCYAW--- 400
Query: 428 REASGGPSIILGNFQMQNYYVEYDLRNQRLGFKQQLC 464
+S I G +++ V YD+ N+R+G+K Q C
Sbjct: 401 --SSADGLTIFGEIVLKDKLVFYDVENRRIGWKGQNC 435
>gi|18400416|ref|NP_565559.1| aspartyl protease family protein [Arabidopsis thaliana]
gi|20197296|gb|AAM15014.1| predicted protein [Arabidopsis thaliana]
gi|330252412|gb|AEC07506.1| aspartyl protease family protein [Arabidopsis thaliana]
Length = 458
Score = 82.8 bits (203), Expect = 3e-13, Method: Compositional matrix adjust.
Identities = 106/398 (26%), Positives = 166/398 (41%), Gaps = 67/398 (16%)
Query: 89 YSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKI--PSFIPKLSSSSRLLG 146
+ ++ S G PP I+DTGS L+W C CK+CSS + P F P LSS+
Sbjct: 96 FLVNFSVGQPPVPQLTIMDTGSSLLWIQCQ---PCKHCSSDHMIHPVFNPALSSTFVECS 152
Query: 147 CQNPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSGL-TEGIALSETLNL-- 203
C + CR + +S C Y +Y SG ++G+ E L
Sbjct: 153 CDDRF----------CRYAPNGHCGSSNKCV-----YEQVYISGTGSKGVLAKERLTFTT 197
Query: 204 PN---RIIPNFLVGCSVLSSRQPA----GIAGFGRGKTSLPSQLNLDKFSYCL--LSHKF 254
PN + GC + Q GI G G TSL QL KFSYC+ L++K
Sbjct: 198 PNGNTVVTQPIAFGCGYENGEQLESHFTGILGLGAKPTSLAVQLG-SKFSYCIGDLANK- 255
Query: 255 DDTTRTSSLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRV 314
+ L+L + + +P+ E + YY+ L I+VG ++
Sbjct: 256 --NYGYNQLVLGEDAD-------------ILGDPTPIEFETENSIYYMNLEGISVGDTQL 300
Query: 315 RVWHKYLTLDRDG-NGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEAL 373
+ + + R G G I+DSGT +T++A + L +E S + + R + L
Sbjct: 301 NI--EPVVFKRRGPRTGVILDSGTLYTWLADIAYRELYNEIKS--ILDPKLERFWFRDFL 356
Query: 374 TGLRPCFD--VPGEKTGSFPELKLHFKGGAEVTLPVENYFAVVGEG---SAVCLTVVTDR 428
C+ V E G FP + HF GGAE+ + + F + E + C++V +
Sbjct: 357 -----CYHGRVSEELIG-FPVVTFHFAGGAELAMEATSMFYPLSEPNTFNVFCMSVKPTK 410
Query: 429 EASG--GPSIILGNFQMQNYYVEYDLRNQRLGFKQQLC 464
E G +G Q Y + YDL+ + + ++ C
Sbjct: 411 EHGGEYKEFTAIGLMAQQYYNIGYDLKEKNIYLQRIDC 448
>gi|356568907|ref|XP_003552649.1| PREDICTED: aspartic proteinase-like protein 2-like [Glycine max]
Length = 490
Score = 82.8 bits (203), Expect = 3e-13, Method: Compositional matrix adjust.
Identities = 100/411 (24%), Positives = 158/411 (38%), Gaps = 83/411 (20%)
Query: 87 GGYSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSS-----KIPSFIPKLSSS 141
G Y + GTPP+ +DTGS ++W C QCK C + + + K SSS
Sbjct: 83 GLYYAKIGIGTPPKNYYLQVDTGSDIMWVNCI---QCKECPTRSNLGMDLTLYDIKESSS 139
Query: 142 SRLLGCQNPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSGLTEGIALSETL 201
+ + C C I + L T CP YL +YG G + + +
Sbjct: 140 GKFVPCDQEFCKEI-----------NGGLLTGCTANISCP-YLEIYGDGSSTAGYFVKDI 187
Query: 202 NLPNRIIPNF---------LVGC------SVLSSRQPA--GIAGFGRGKTSLPSQLN--- 241
L +++ + + GC + SS + A GI GFG+ +S+ SQL
Sbjct: 188 VLYDQVSGDLKTDSANGSIVFGCGARQSGDLSSSNEEALGGILGFGKANSSMISQLASSG 247
Query: 242 --LDKFSYCLLSHKFDDTTRTSSLILDNGSSHSDKKTTGLTYTPFVN-NPSVAERNAFSV 298
F++CL NG + G P VN P + ++ +SV
Sbjct: 248 KVKKMFAHCL-----------------NGVNGGGIFAIGHVVQPKVNMTPLLPDQPHYSV 290
Query: 299 YYYVGLRRITVGGQRVRVWHKYLTLDRD-----GNGGTIVDSGTTFTFMAPELFEPLADE 353
+T V+V H +L+L D GTI+DSGTT ++ ++EPL +
Sbjct: 291 -------NMTA----VQVGHAFLSLSTDTSTQGDRKGTIIDSGTTLAYLPEGIYEPLVYK 339
Query: 354 FVSQMVKNRNYTRALGAEALTGLRPCFDVPGEKTGSFPELKLHFKGGAEVTLPVENYFAV 413
+SQ L L CF FP + +F+ G + + +Y
Sbjct: 340 IISQHPD-------LKVRTLHDEYTCFQYSESVDDGFPAVTFYFENGLSLKVYPHDYLFP 392
Query: 414 VGEGSAVCLTVVTDREASGGPSIILGNFQMQNYYVEYDLRNQRLGFKQQLC 464
G+ + + +LG+ + N V YDL NQ +G+ + C
Sbjct: 393 SGDFWCIGWQNSGTQSRDSKNMTLLGDLVLSNKLVFYDLENQVIGWTEYNC 443
>gi|302817726|ref|XP_002990538.1| hypothetical protein SELMODRAFT_131679 [Selaginella moellendorffii]
gi|300141706|gb|EFJ08415.1| hypothetical protein SELMODRAFT_131679 [Selaginella moellendorffii]
Length = 434
Score = 82.8 bits (203), Expect = 3e-13, Method: Compositional matrix adjust.
Identities = 102/420 (24%), Positives = 157/420 (37%), Gaps = 58/420 (13%)
Query: 61 HIKNPQTKTTTTTTTTTTTNISSHSYGG-YSISLSFGTPPQIIPFILDTGSHLVWFPCTN 119
H + K ++ + ++ G Y + GTPP+ +DTGS L+W C
Sbjct: 7 HDRGRMVKLKSSAVSLPVEGVADPYIAGLYFTQVQLGTPPRTYNLQVDTGSDLLWVNCHP 66
Query: 120 HYQCKYCSSSKIPSFIP---KLSSSSRLLGCQNPKCSWIHHESIQCRDCNDEPLATSKNC 176
C S KIP +P K S+SS + C +P C+ I I CND+ C
Sbjct: 67 CIGCPAFSDLKIP-IVPYDVKASASSSKVPCSDPSCTLITQ--ISESGCNDQ-----NQC 118
Query: 177 TQICPSYLVLYGSGL-TEGIALSETLNLPNRIIPNFLVGCSV-------LSSRQPAGIAG 228
Y YG G T G + + L+ + GC S R GI G
Sbjct: 119 -----GYSFQYGDGSGTLGYLVEDVLHYMVNATATVIFGCGFKQSGDLSTSERALDGIIG 173
Query: 229 FGRGKTSLPSQLNLDKFSYCLLSHKFDDTTRTSSLILDNGSSHSDKKTTGLTYTPFVNNP 288
FG S SQL + + +H D R +++ D + YTP V
Sbjct: 174 FGASDLSFNSQLAKQGKTPNVFAHCLDGGERGGGILVLGNVIEPD-----IQYTPLV--- 225
Query: 289 SVAERNAFSVYYYVGLRRITVGGQRVRVWHKYLTLDRDGNGGTIVDSGTTFTFMAPELFE 348
+ +Y V L+ I+V + + K + D GTI DSGTT ++ E ++
Sbjct: 226 ------PYMSHYNVVLQSISVNNANLTIDPKLFS--NDVMQGTIFDSGTTLAYLPDEAYQ 277
Query: 349 PLADEFVSQMVKNRNYTRALGAEALTGLRPCFDVPGEKTGSFPELKLHFKGGAEVTLPVE 408
+ VS +V + + L FP + L+F+G + P E
Sbjct: 278 AFT-QAVSLVVAPFLLCDTRLSRFIYKL-------------FPNVVLYFEGASMTLTPAE 323
Query: 409 NYFAVVGEGSAVCLTVVTDREASGGPSI---ILGNFQMQNYYVEYDLRNQRLGFKQQLCK 465
+A + S + I G+ ++N V YDL R+G++ CK
Sbjct: 324 YLIRQASAANAPIWCMGWQSMGSAESELQYTIFGDLVLKNKLVVYDLERGRIGWRPFDCK 383
>gi|356537015|ref|XP_003537027.1| PREDICTED: aspartic proteinase-like protein 2-like [Glycine max]
Length = 476
Score = 82.8 bits (203), Expect = 3e-13, Method: Compositional matrix adjust.
Identities = 99/380 (26%), Positives = 159/380 (41%), Gaps = 58/380 (15%)
Query: 106 LDTGSHLVWFPCTNHYQCKYCSSSKIP-SFIPKL-SSSSRLLGCQNPKC-SWIHHESIQC 162
+DTGS ++W C C S I +F + SS++ L+ C + C S + + +C
Sbjct: 85 IDTGSDILWVNCNTCSNCPQSSQLGIELNFFDTVGSSTAALIPCSDLICTSGVQGAAAEC 144
Query: 163 RDCNDEPLATSKNCTQICPSYLVLYGSGL-TEGIALSETLNLPNRII---------PNFL 212
S Q SY YG G T G +S+ + N I+ +
Sbjct: 145 ----------SPRVNQC--SYTFQYGDGSGTSGYYVSDAMYF-NLIMGQPPAVNSTATIV 191
Query: 213 VGCSVLSS-------RQPAGIAGFGRGKTSLPSQLNLDKFSYCLLSHKFD-DTTRTSSLI 264
GCS+ S + GI GFG G S+ SQL+ + + SH D L+
Sbjct: 192 FGCSISQSGDLTKTDKAVDGIFGFGPGPLSVVSQLSSQGITPKVFSHCLKGDGNGGGILV 251
Query: 265 LDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRVRVWHKYLTLD 324
L + + Y+P V PS +Y + L+ I V GQ + + ++
Sbjct: 252 L------GEILEPSIVYSPLV--PS-------QPHYNLNLQSIAVNGQPLPINPAVFSIS 296
Query: 325 RDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEALTGLRPCFDVPG 384
+ GGTIVD GTT ++ E ++PL + + ++ T + G + C+ V
Sbjct: 297 NN-RGGTIVDCGTTLAYLIQEAYDPLVTAINTAVSQSARQTNSKGNQ-------CYLVST 348
Query: 385 EKTGSFPELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTVVTDREASGGPSIILGNFQMQ 444
FP + L+F+GGA + L E Y G + V ++ G S ILG+ ++
Sbjct: 349 SIGDIFPLVSLNFEGGASMVLKPEQYLMHNGYLDGAEMWCVGFQKLQEGAS-ILGDLVLK 407
Query: 445 NYYVEYDLRNQRLGFKQQLC 464
+ V YD+ QR+G+ C
Sbjct: 408 DKIVVYDIAQQRIGWANYDC 427
>gi|116666775|pdb|2B42|A Chain A, Crystal Structure Of The Triticum Xylanse Inhibitor-I In
Complex With Bacillus Subtilis Xylanase
Length = 381
Score = 82.8 bits (203), Expect = 3e-13, Method: Compositional matrix adjust.
Identities = 97/396 (24%), Positives = 156/396 (39%), Gaps = 88/396 (22%)
Query: 104 FILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLL-------GCQNPKCSWIH 156
+LD LVW C + P+ IP SS + LL GC P C
Sbjct: 26 LVLDVAGPLVW---------STCKGGQPPAEIP-CSSPTCLLANAYPAPGCPAPSCGSDK 75
Query: 157 HESIQCRDCNDEPL-ATSKNCTQICPSYLVLYGSGLTEGIALSETLNLPNRIIPNFLVGC 215
H+ + C P S C S+ + + T+G +N+ L C
Sbjct: 76 HD----KPCTAYPYNPVSGACAAGSLSH-TRFVANTTDGSKPVSKVNV------GVLAAC 124
Query: 216 S---VLSS--RQPAGIAGFGRGKTSLPSQLN-----LDKFSYCLLSHKFDDTTRTSSLIL 265
+ +L+S R G+AG +LP+Q+ ++F CL T I
Sbjct: 125 APSKLLASLPRGSTGVAGLANSGLALPAQVASAQKVANRFLLCL------PTGGPGVAIF 178
Query: 266 DNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRVRVWHKYLTLDR 325
G + T + YTP V S +Y+ R I VG RV V L
Sbjct: 179 GGGPVPWPQFTQSMPYTPLVTK-------GGSPAHYISARSIVVGDTRVPVPEGALA--- 228
Query: 326 DGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEALTGLRP------- 378
GG ++ + + + P+++ PL D F T+AL A+ G
Sbjct: 229 --TGGVMLSTRLPYVLLRPDVYRPLMDAF----------TKALAAQHANGAPVARAVVAV 276
Query: 379 -----CFDVP--GEKTGSF--PELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTVVTDRE 429
C+D G G + P ++L GG++ T+ +N V +G+A C+ V +
Sbjct: 277 APFGVCYDTKTLGNNLGGYAVPNVQLGLDGGSDWTMTGKNSMVDVKQGTA-CVAFVEMKG 335
Query: 430 ASGG----PSIILGNFQMQNYYVEYDLRNQRLGFKQ 461
+ G P++ILG QM+++ +++D+ +RLGF +
Sbjct: 336 VAAGDGRAPAVILGGAQMEDFVLDFDMEKKRLGFSR 371
>gi|449482385|ref|XP_004156266.1| PREDICTED: aspartic proteinase-like protein 2-like [Cucumis
sativus]
Length = 491
Score = 82.8 bits (203), Expect = 3e-13, Method: Compositional matrix adjust.
Identities = 95/401 (23%), Positives = 174/401 (43%), Gaps = 62/401 (15%)
Query: 87 GGYSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSS-----KIPSFIPKLSSS 141
G Y + G P + +DTGS ++W C+ C C S ++ F SSS
Sbjct: 82 GLYFTKVKLGNPAREFNVQIDTGSDILWVTCS---PCDGCPDSSGLGIELNLFDTTKSSS 138
Query: 142 SRLLGCQNPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSGLTEGIALSETL 201
+R+L C +P C+ + + QC L + +C+ S+ SG T G +++++
Sbjct: 139 ARVLPCTDPICAAVSTTTDQC-------LTQTDHCSY---SFHYRDRSG-TSGFYVTDSM 187
Query: 202 N----LPNRIIPN----FLVGCSVL-------SSRQPAGIAGFGRGKTSLPSQLNLDKFS 246
+ L I N + GCS+ +++ GI GFG+G+ S+ SQL+ +
Sbjct: 188 HFDILLGESTIANSSATIVFGCSIYQYGDLTRATKALDGIFGFGQGEFSVISQLSSRGIT 247
Query: 247 YCLLSHKFDDTTRTSSLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRR 306
+ SH +++ + + Y+P + PS +Y + L+
Sbjct: 248 PKVFSHCLKGGENGGGILV-----LGEILEPSIVYSPLI--PS-------QPHYTLKLQS 293
Query: 307 ITVGGQRVRVWHKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTR 366
I + GQ ++ G TI+DSGTT ++ E+++ + S + ++ T
Sbjct: 294 IALSGQ---LFPNPTMFPISNAGETIIDSGTTLAYLVEEVYDWIVSVITSAVSQSATPTI 350
Query: 367 ALGAEALTGLRPCFDVPGEKTGSFPELKLHFKGGAEVTLPVENYF---AVVGEGSAVCLT 423
+ G++ CF V FP L+ +F+G A + + E Y ++V L
Sbjct: 351 SRGSQ-------CFRVSMSVADIFPVLRFNFEGIASMVVTPEEYLQFDSIVSCYKFASLW 403
Query: 424 VVTDREASGGPSIILGNFQMQNYYVEYDLRNQRLGFKQQLC 464
+ ++A G + ILG+ +++ + YDL QR+G+ C
Sbjct: 404 CIGFQKAEDGLN-ILGDLVLKDKIIVYDLAQQRIGWANYDC 443
>gi|224082314|ref|XP_002306645.1| predicted protein [Populus trichocarpa]
gi|222856094|gb|EEE93641.1| predicted protein [Populus trichocarpa]
Length = 410
Score = 82.8 bits (203), Expect = 4e-13, Method: Compositional matrix adjust.
Identities = 98/394 (24%), Positives = 164/394 (41%), Gaps = 67/394 (17%)
Query: 87 GGYSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLG 146
G YS+ L+ G PP+ F +DTGS L W C CK C+ + + PK + L+
Sbjct: 52 GYYSVILNIGNPPKAFDFDIDTGSDLTWVQC--DAPCKGCTKPRDKLYKPK----NNLVP 105
Query: 147 CQNPKCSWIH-HESIQCRDCNDEPLATSKNCTQICPSYLVLYGSGLTEGIALSET--LNL 203
C N C + E+ C +D+ C + G + G+ LS++ L L
Sbjct: 106 CSNSLCQAVSTGENYHCDAPDDQ-----------CDYEIEYADLGSSIGVLLSDSFPLRL 154
Query: 204 PNRII--PNFLVGCSV----LSSRQP---AGIAGFGRGKTSLPSQLNLDKFSYCLLSHKF 254
N + P GC L P AGI G GRGK S+ SQL + ++ H F
Sbjct: 155 SNGTLLQPKMAFGCGYDQKHLGPHPPPDTAGILGLGRGKVSILSQLRTLGITQNVVGHCF 214
Query: 255 DDTTRTSSLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRV 314
R L + S + +T+TP + R++ Y G + GG+
Sbjct: 215 -SRARGGFLFFGDHLFPSSR----ITWTPML-------RSSSDTLYSSGPAELLFGGKPT 262
Query: 315 RVWHKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEALT 374
+ K L L I DSG+++T+ ++++ + + +V+ + L
Sbjct: 263 GI--KGLQL--------IFDSGSSYTYFNAQVYQSILN-----LVRKDLAGKPLKDAPEK 307
Query: 375 GLRPCFDVPGEKTGSFPELKLHFK---------GGAEVTLPVENYFAVVGEGSAVCLTVV 425
L C+ + S ++K +FK ++ L E+Y + +G+ VCL ++
Sbjct: 308 ELAVCWKT-AKPIKSILDIKSYFKPLTISFMNAKNVQLQLAPEDYLIITKDGN-VCLGIL 365
Query: 426 TDREASGGPSIILGNFQMQNYYVEYDLRNQRLGF 459
E G ++G+ MQ+ V YD Q++G+
Sbjct: 366 NGSEQQLGNFNVIGDIFMQDRVVIYDNEKQQIGW 399
>gi|297822477|ref|XP_002879121.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
gi|297324960|gb|EFH55380.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
Length = 711
Score = 82.8 bits (203), Expect = 4e-13, Method: Compositional matrix adjust.
Identities = 90/374 (24%), Positives = 139/374 (37%), Gaps = 91/374 (24%)
Query: 89 YSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLGCQ 148
Y + L GTPP + +LDTGS L+W C C +C K P F P SS+ + C
Sbjct: 65 YLMKLQIGTPPFEVEAVLDTGSELIWTQC---LPCLHCYDQKAPIFDPSKSSTFKETRCN 121
Query: 149 NPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSGLTEGIALSETLNLPNR-- 206
P S CP LV T+G +ET+ + +
Sbjct: 122 TPDHS--------------------------CPYKLVYDDKSYTQGTLATETVTIHSTSG 155
Query: 207 ---IIPNFLVGCSVLSSR-----QPAGIAGFGRGKTSLPSQLNLDKFSYCLLSHKFDDTT 258
++P ++GCS +S +GI G RG SL SQ+
Sbjct: 156 VPFVMPETIIGCSRNNSGSGFRPSSSGIVGLSRGSLSLISQM------------------ 197
Query: 259 RTSSLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRVR-VW 317
G+ D V + ++ + A YY+ L ++VG R+ V
Sbjct: 198 --------GGAYPGDG----------VVSTTMFAKTAKRGQYYLNLDAVSVGDTRIETVG 239
Query: 318 HKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEALTGLR 377
+ L NG ++DSGT T+ P + L + V ++V + L
Sbjct: 240 TPFHAL----NGNIVIDSGTPLTYF-PVSYCNLVRKAVERVVTADRVVDPSRNDMLCYYS 294
Query: 378 PCFDVPGEKTGSFPELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTVVTDREASGGPSII 437
++ FP + +HF GGA++ L N + + G CL ++ + I
Sbjct: 295 NTIEI-------FPVITVHFSGGADLVLDKYNMYMELNRGGVFCLAIICNNPTQ---VAI 344
Query: 438 LGNFQMQNYYVEYD 451
GN N+ V YD
Sbjct: 345 FGNRAQNNFLVGYD 358
Score = 80.1 bits (196), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 94/383 (24%), Positives = 150/383 (39%), Gaps = 63/383 (16%)
Query: 89 YSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLGCQ 148
Y + L GTPP I ++DTGS + W C C +C P F P SS+ + C
Sbjct: 380 YLMKLQVGTPPFEIEAVIDTGSEITWTQC---LPCVHCYKQNAPIFDPSKSSTFKEKRCH 436
Query: 149 NPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSGLTEGIALSETLNLPNRII 208
+ C + E K T+ G+ T+ + + T P ++
Sbjct: 437 DHSCPY-------------EVDYFDKTYTK---------GTLATDTVTIHSTSGEP-FVM 473
Query: 209 PNFLVGCSVLSSR-QPA--GIAGFGRGKTSLPSQLNLDK---FSYCLLSHKFDDTTRTSS 262
++GC +S +P+ G G G SL +Q+ + SYC
Sbjct: 474 AETIIGCGRNNSWFRPSFEGFVGLNWGPLSLITQMGGEYPGLMSYCFAG----------- 522
Query: 263 LILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRVR-VWHKYL 321
NG+S + T + V + ++ A +YY+ L ++VG R+ + +
Sbjct: 523 ----NGTSKINFGTNAIVGGGGVVSTTMFVTTARPGFYYLNLDAVSVGDTRIETLGTPFH 578
Query: 322 TLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEALTGLRPCFD 381
L+ G ++DSGTT T+ PE + L + V +V G + L C+
Sbjct: 579 ALE----GNIVIDSGTTLTYF-PESYCNLVRQAVEHVVPAVPAADPTGNDLL-----CYY 628
Query: 382 VPGEKTGSFPELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTVVTDREASGGPSIILGNF 441
T FP + +HF GGA++ L N F G CL ++ + I GN
Sbjct: 629 --SNTTEIFPVITMHFSGGADLVLDKYNMFMESYSGGLFCLAIICNNPTQ---EAIFGNR 683
Query: 442 QMQNYYVEYDLRNQRLGFKQQLC 464
N+ V YD + + FK C
Sbjct: 684 AQNNFLVGYDSSSLLVSFKPTNC 706
>gi|115463625|ref|NP_001055412.1| Os05g0384300 [Oryza sativa Japonica Group]
gi|50511407|gb|AAT77330.1| unknown protein [Oryza sativa Japonica Group]
gi|113578963|dbj|BAF17326.1| Os05g0384300 [Oryza sativa Japonica Group]
gi|222631434|gb|EEE63566.1| hypothetical protein OsJ_18383 [Oryza sativa Japonica Group]
Length = 477
Score = 82.4 bits (202), Expect = 4e-13, Method: Compositional matrix adjust.
Identities = 101/380 (26%), Positives = 152/380 (40%), Gaps = 43/380 (11%)
Query: 87 GGYSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSF--IPKLSSSSRL 144
G Y I++ GTPPQ + D S VW PC C S K + +P+ L
Sbjct: 85 GTYLITVGVGTPPQYVYGAFDISSQFVWVPCEECVSPYSCPSDKTGVYKTLPR-----EL 139
Query: 145 LGCQNPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSGLTEGIALSETLNLP 204
C +C I + DC + C C Y G+ + L + L
Sbjct: 140 YSCGEQRCRTIVGQP----DCG---APYNGPCKYTC-RYGGAGGTETEGHLGL-QPFTLG 190
Query: 205 NRIIP-NFLVGCSVLSSRQPAGIAGFGRGKTSLPSQLNLDKFSYCLLSHKFDDTTRTSSL 263
+ +P N + GC L G+ G RG+ SL SQL L +FSY + ++DDT ++
Sbjct: 191 DNTMPVNMIFGCG-LEPETNFGVIGLNRGRLSLISQLQLGRFSY-YFAPEYDDTAAGNAS 248
Query: 264 ILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRVRVWHKYLTL 323
+ G ++ +T+ YT F + E A+S Y VGL + VG L +
Sbjct: 249 FILFG-EYAVPQTSNPRYTQFWSY----ENGAYSYLYLVGLSGMRVGSNN-------LNM 296
Query: 324 DRDGNGG-----TIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEALTGLRP 378
G+GG + + TF+ ++ L E VS + + AL GL
Sbjct: 297 LGAGSGGRDPLVAYLSTSVPITFLEKNAYDLLRRELVSTVGSDTVDGSAL------GLDL 350
Query: 379 CFDVPGEKTGSFPELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTVVTDREASGGPSIIL 438
C+ FP + L F GA + L NY CLT++ A GG S++
Sbjct: 351 CYTSQYLAKAKFPAMALVFWDGAVMELQPRNYLYQDTATGLECLTILPTAVA-GGLSLLG 409
Query: 439 GNFQMQNYYVEYDLRNQRLG 458
Q + + YD++ Q G
Sbjct: 410 SLIQTGTHMMYYDIQIQGRG 429
>gi|125555058|gb|EAZ00664.1| hypothetical protein OsI_22685 [Oryza sativa Indica Group]
Length = 465
Score = 82.4 bits (202), Expect = 4e-13, Method: Compositional matrix adjust.
Identities = 109/398 (27%), Positives = 155/398 (38%), Gaps = 72/398 (18%)
Query: 89 YSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSS-----KIPSFIPKLSSSSR 143
Y ++L GTP ++DTGS L W QCK C + K P F P SSS
Sbjct: 118 YVVTLGIGTPAVQQIVLIDTGSDLSWV------QCKPCGAGECYAQKDPLFDPSSSSSYA 171
Query: 144 LLGCQNPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGS-GLTEGIALSETLN 202
+ C + C + + C + +C Y + YG+ T G+ +ETL
Sbjct: 172 SVPCDSDACRKLAAGAYG-HGC-------TSGAAALC-EYGIEYGNRATTTGVYSTETLT 222
Query: 203 L-PNRIIPNFLVGCSVLSS---RQPAGIAGFGRGKTSLPSQLNLD---KFSYCLLSHKFD 255
L P ++ +F GC + G+ G G SL SQ + FSYCL
Sbjct: 223 LKPGVVVADFGFGCGDHQHGPYEKFDGLLGLGGAPESLVSQTSSQFGGPFSYCL------ 276
Query: 256 DTTRTSSLILDNGSSHSDKKTT---GLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQ 312
T + L G+ +S +T G +TP PSV +Y V L I+VGG
Sbjct: 277 PPTSGGAGFLALGAPNSSSSSTAAAGFLFTPMRRIPSV------PTFYVVTLTGISVGGA 330
Query: 313 RVRVWHKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEA 372
+ V + G ++DSGT T + + L F S M + R + GA
Sbjct: 331 PLAVPPSAFS------SGMVIDSGTVITGLPATAYAALRSAFRSAMSEYRLLPPSNGAV- 383
Query: 373 LTGLRPCFDVPGEKTGSFPELKLHFKGGAEVTLP------VENYFAVVGEGSAVCLTVVT 426
L C+D G + P + L F GGA + L V+ A G G T
Sbjct: 384 ---LDTCYDFTGHTNVTVPTIALTFSGGATIDLATPAGVLVDGCLAFAGAG--------T 432
Query: 427 DREASGGPSIILGNFQMQNYYVEYDLRNQRLGFKQQLC 464
D I+GN + + V YD +GF+ C
Sbjct: 433 DDTIG-----IIGNVNQRTFEVLYDSGKGTVGFRAGAC 465
>gi|125552158|gb|EAY97867.1| hypothetical protein OsI_19787 [Oryza sativa Indica Group]
Length = 477
Score = 82.4 bits (202), Expect = 4e-13, Method: Compositional matrix adjust.
Identities = 101/380 (26%), Positives = 152/380 (40%), Gaps = 43/380 (11%)
Query: 87 GGYSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSF--IPKLSSSSRL 144
G Y I++ GTPPQ + D S VW PC C S K + +P+ L
Sbjct: 85 GTYLITVGVGTPPQYVYGAFDISSQFVWVPCEECVSPYSCPSDKTGVYKTLPR-----EL 139
Query: 145 LGCQNPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSGLTEGIALSETLNLP 204
C +C I + DC + C C Y G+ + L + L
Sbjct: 140 YSCGEQRCRTIVGQP----DCG---APYNGPCKYTC-RYGGAGGTETEGHLGL-QPFTLG 190
Query: 205 NRIIP-NFLVGCSVLSSRQPAGIAGFGRGKTSLPSQLNLDKFSYCLLSHKFDDTTRTSSL 263
+ +P N + GC L G+ G RG+ SL SQL L +FSY + ++DDT ++
Sbjct: 191 DNTMPVNMIFGCG-LEPETNFGVIGLNRGRLSLISQLQLGRFSY-YFAPEYDDTAAGNAS 248
Query: 264 ILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRVRVWHKYLTL 323
+ G ++ +T+ YT F + E A+S Y VGL + VG L +
Sbjct: 249 FILFG-EYAVPQTSNPRYTQFWSY----ENGAYSYLYLVGLSGMRVGSNN-------LNM 296
Query: 324 DRDGNGG-----TIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEALTGLRP 378
G+GG + + TF+ ++ L E VS + + AL GL
Sbjct: 297 LGAGSGGRDPLVAYLSTSVPVTFLEKNAYDLLRRELVSTVGSDTVDGSAL------GLDL 350
Query: 379 CFDVPGEKTGSFPELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTVVTDREASGGPSIIL 438
C+ FP + L F GA + L NY CLT++ A GG S++
Sbjct: 351 CYTSQYLAKAKFPAMALVFWDGAVMELQPRNYLYQDTATGLECLTILPTAVA-GGLSLLG 409
Query: 439 GNFQMQNYYVEYDLRNQRLG 458
Q + + YD++ Q G
Sbjct: 410 SLIQTGTHMMYYDIQIQGRG 429
>gi|297720195|ref|NP_001172459.1| Os01g0608366 [Oryza sativa Japonica Group]
gi|53792202|dbj|BAD52835.1| nucleoid DNA-binding protein cnd41-like [Oryza sativa Japonica
Group]
gi|255673454|dbj|BAH91189.1| Os01g0608366 [Oryza sativa Japonica Group]
Length = 452
Score = 82.4 bits (202), Expect = 5e-13, Method: Compositional matrix adjust.
Identities = 101/375 (26%), Positives = 147/375 (39%), Gaps = 55/375 (14%)
Query: 89 YSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLGCQ 148
Y IS+ G+P ++DTGS + W C C + F P SS+ C
Sbjct: 108 YVISVGLGSPAVTQRVVIDTGSDVSWVQCEPCPAPSPCHAHAGALFDPAASSTYAAFNCS 167
Query: 149 NPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSGL-TEGIALSETLNLP-NR 206
C+ + +S + C+ +K+ Q Y+V YG G T G S+ L L +
Sbjct: 168 AAACAQL-GDSGEANGCD------AKSRCQ----YIVKYGDGSNTTGTYSSDVLTLSGSD 216
Query: 207 IIPNFLVGCS-----VLSSRQPAGIAGFGRGKTSLPSQLNL---DKFSYCLLSHKFDDTT 258
++ F GCS + G+ G G S SQ F YCL + T
Sbjct: 217 VVRGFQFGCSHAELGAGMDDKTDGLIGLGGDAQSPVSQTAARYGKSFFYCLPA------T 270
Query: 259 RTSS--LILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRVRV 316
SS L L +S + TP + + V YY+ L I VGG+++ +
Sbjct: 271 PASSGFLTLGAPASGGGGGASRFATTPMLRSKKV------PTYYFAALEDIAVGGKKLGL 324
Query: 317 WHKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEALTGL 376
G++VDSGT T + P + L+ F + M + Y R AE L L
Sbjct: 325 SPSVFA------AGSLVDSGTVITRLPPAAYAALSSAFRAGMTR---YAR---AEPLGIL 372
Query: 377 RPCFDVPGEKTGSFPELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTVVTDREASGGPSI 436
CF+ G S P + L F GGA V L + G + D +A G
Sbjct: 373 DTCFNFTGLDKVSIPTVALVFAGGAVVDLDAHGIVS----GGCLAFAPTRDDKAFG---- 424
Query: 437 ILGNFQMQNYYVEYD 451
+GN Q + + V YD
Sbjct: 425 TIGNVQQRTFEVLYD 439
>gi|413953788|gb|AFW86437.1| hypothetical protein ZEAMMB73_618532 [Zea mays]
Length = 469
Score = 82.4 bits (202), Expect = 5e-13, Method: Compositional matrix adjust.
Identities = 104/398 (26%), Positives = 157/398 (39%), Gaps = 65/398 (16%)
Query: 82 SSHSYGGYSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSS-----KIPSFIP 136
SS+ Y ++ GTP ILDTGS L W QCK C+SS ++P F P
Sbjct: 122 SSYDSQEYVATVGLGTPAVPQTLILDTGSSLTWV------QCKPCNSSQCYPQRLPLFDP 175
Query: 137 KLSSSSRLLGCQNPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSGLT-EGI 195
SSS + C + +C + I C + C +Y + YGSG T G
Sbjct: 176 NTSSSYSPVPCDSQECRAL-AAGIDGDGCTSD---GDWGC-----AYEIHYGSGATPAGE 226
Query: 196 ALSETLNL-PNRIIPNFLVGCSVLSSR----QPAGIAGFGRGKTSLPSQLNLDK----FS 246
++ L L P I+ F GC R G+ G GR SL Q + + FS
Sbjct: 227 YSTDALTLGPGAIVKRFHFGCGHHQQRGKFDMADGVLGLGRLPQSLAWQASARRGGGVFS 286
Query: 247 YCLLSHKFDDTTRTSSLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRR 306
+CL T S+ L G+ H T+ +TP + + +Y +
Sbjct: 287 HCL------PPTGVSTGFLALGAPH---DTSAFVFTPLLT------MDDQPWFYQLMPTA 331
Query: 307 ITVGGQRVRVWHKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTR 366
I+V GQ + + G I DSGT + + + L F S M +
Sbjct: 332 ISVAGQLLDIPPAVF------REGVITDSGTVLSALQETAYTALRTAFRSAMAEYPL--- 382
Query: 367 ALGAEALTGLRPCFDVPGEKTGSFPELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTVVT 426
A + L CF+ G + P + L F+GGA V L + + G CL +
Sbjct: 383 ---APPVGHLDTCFNFTGYDNVTVPTVSLTFRGGATVHLDASSGVLMDG-----CLAFWS 434
Query: 427 DREASGGPSIILGNFQMQNYYVEYDLRNQRLGFKQQLC 464
+ G ++G+ + V YD+ +++GF+ C
Sbjct: 435 SGDEYTG---LIGSVSQRTIEVLYDMPGRKVGFRTGAC 469
>gi|357143657|ref|XP_003573000.1| PREDICTED: LOW QUALITY PROTEIN: aspartic proteinase
nepenthesin-2-like [Brachypodium distachyon]
Length = 464
Score = 82.0 bits (201), Expect = 5e-13, Method: Compositional matrix adjust.
Identities = 106/391 (27%), Positives = 150/391 (38%), Gaps = 72/391 (18%)
Query: 89 YSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLGCQ 148
Y I++S G+P +DTGS + W C + + P SS+ C
Sbjct: 131 YVITVSIGSPAVAXTMFIDTGSDVSWLRCKSRL------------YDPGTSSTYAPFSCS 178
Query: 149 NPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSGL-TEGIALSETLNLPNR- 206
P C+ + C ++ C Y V YG G T G S+TL L
Sbjct: 179 APACAQLGRRGTGC--------SSGSTCV-----YSVKYGDGSNTTGTYGSDTLTLAGTS 225
Query: 207 --IIPNFLVGCSVL----SSRQPAGIAGFGRGKTSLPSQLNL---DKFSYCLLSHKFDDT 257
+I F GCS + G+ G G S SQ FSYCL
Sbjct: 226 EPLISGFQFGCSAVEHGFEEDNTDGLMGLGGDAQSFVSQTAATYGSAFSYCL------PP 279
Query: 258 TRTSSLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRVRVW 317
T SS L G+ S F P + + A + +Y + LR I+VGG+ + +
Sbjct: 280 TWNSSGFLTLGAPSSSTSAA------FSTTPMLRSKQA-ATFYGLLLRGISVGGKTLEIP 332
Query: 318 HKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEALTGLR 377
+ G+IVDSGT T + P + L+ F M + + A L L
Sbjct: 333 SSVFS------AGSIVDSGTVITRLPPTAYGALSAAFRDGMARYQYQPAA--PRGL--LD 382
Query: 378 PCFDVPGEKTG---SFPELKLHFKGGAEVTLPVENYFAVVGEGSAVCLT-VVTDREASGG 433
CFD G G + P + L GGA V L + +V +G CL TD + G
Sbjct: 383 TCFDFTGHGEGNNFTVPSVALVLDGGAVVDL---HPNGIVQDG---CLAFAATDDDGRTG 436
Query: 434 PSIILGNFQMQNYYVEYDLRNQRLGFKQQLC 464
I+GN Q + + V YD+ GF+ C
Sbjct: 437 ---IIGNVQQRTFEVLYDVGQSVFGFRPGAC 464
>gi|449432731|ref|XP_004134152.1| PREDICTED: basic 7S globulin-like [Cucumis sativus]
gi|449527081|ref|XP_004170541.1| PREDICTED: basic 7S globulin-like [Cucumis sativus]
Length = 429
Score = 82.0 bits (201), Expect = 5e-13, Method: Compositional matrix adjust.
Identities = 104/410 (25%), Positives = 166/410 (40%), Gaps = 59/410 (14%)
Query: 81 ISSHSYGGYSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSS 140
++ H Y I + TP + +D G L+W C + +SS
Sbjct: 36 VTKHPSLQYIIQIHQRTPLVPVNLTVDLGGWLMWVDCDRGF----------------VSS 79
Query: 141 SSRLLGCQNPKCSWIHHESIQCRDCNDEPLATSKN--CTQICPSYLVLYGSG--LTEGIA 196
S + C++ +CS +SI C C P N C+ + ++ SG +T +
Sbjct: 80 SYKPARCRSAQCSL--AKSISCGKCYLPPHPGCNNYTCSLSARNTIIQLSSGGEVTSDLV 137
Query: 197 LSETLNLPNRI----IPNFLVGCS---VLSSRQPA--GIAGFGRGKTSLPSQLNLD---- 243
+ N N +PNFL CS +L G+AGFGR + SLPSQ
Sbjct: 138 SVSSTNGFNSTRALSVPNFLFICSSTFLLEGLAGGVTGMAGFGRTRISLPSQFAAAFSFS 197
Query: 244 -KFSYCLLSHKFDDTTRTSSLILDN-GSSH---SDKKTTGLTYTPFVNNPSVAERNAFSV 298
KF+ CL +T +I G H + T LTYTP + NP V S
Sbjct: 198 RKFTMCL-----SGSTGFPGVIFSGYGPYHFLPNIDLTNSLTYTPLLINP-VGFAGEKSS 251
Query: 299 YYYVGLRRITVGGQRVRVWHKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQM 358
Y++G++ I + V + L +D +GNGGT + + +T + ++ L F S++
Sbjct: 252 EYFIGVKSIEFNSKTVPLNTTLLKIDSNGNGGTKISTVNPYTVLETSIYRALVKTFTSEL 311
Query: 359 VKNRNYTRALGAEALTGLRPCFDVPG----EKTGSFPELKLHFKGGAEV-TLPVENYFAV 413
N R A+ C+ E S P + L + + + N V
Sbjct: 312 ---GNIPR---VAAVAPFEVCYSSKSFGSTELGPSVPSIDLILQNKKVIWRMFGANSMVV 365
Query: 414 VGEGSAVCLTVVTDREASGGPSIILGNFQMQNYYVEYDLRNQRLGFKQQL 463
V E +CL V + ++++G Q+++ +E+DL RLGF L
Sbjct: 366 VTE-EVLCLGFV-EGGVEAETAMVIGGHQIEDNLLEFDLATSRLGFSSTL 413
>gi|388516465|gb|AFK46294.1| unknown [Medicago truncatula]
Length = 434
Score = 82.0 bits (201), Expect = 6e-13, Method: Compositional matrix adjust.
Identities = 101/377 (26%), Positives = 153/377 (40%), Gaps = 50/377 (13%)
Query: 87 GGYSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLG 146
G Y + + GTP Q++ +LDT + + P + C CS++ +F P S+S L
Sbjct: 96 GNYIVRVKIGTPGQLLFMVLDTSTDEAFIPSSG---CIGCSAT---TFSPNASTSYVPLE 149
Query: 147 CQNPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSGLTEGIALSETLNLPNR 206
C P+CS + + C P S C S+ Y + ++L L
Sbjct: 150 CSVPQCSQVR--GLSC------PATGSGAC-----SFNKSYAGSTYSATLVQDSLRLATD 196
Query: 207 IIPNFLVGC------SVLSSRQPAGIAGFGRGKTSLPSQLNLDKFSYCLLSHKFDDTTRT 260
+IP++ G S + ++ G+ S L FSYCL S F +
Sbjct: 197 VIPSYSFGSINAISGSSIPAQGLLGLGRGPLSLLSQTGSLYSGVFSYCLPS--FKSYYFS 254
Query: 261 SSLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRVRVWHKY 320
SL L +TT P + NP R + Y+V L ITVG V +
Sbjct: 255 GSLKLGPVGQPKSIRTT-----PLLRNP---RRPSL---YFVNLTGITVGKVNVPFPKEL 303
Query: 321 LTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEALTGLRPCF 380
L D + GTI+DSGT T ++ + DEF Q+ + +LGA CF
Sbjct: 304 LAFDVNTGSGTIIDSGTVITRFVEPVYNAVRDEFRKQVTGPFS---SLGA-----FDTCF 355
Query: 381 DVPGEKTGSFPELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTVV-TDREASGGPSIILG 439
E P + LHF ++ LP+EN GS CL + T + + ++
Sbjct: 356 VKNYETLA--PAITLHFT-DLDLKLPLENSLIHSSSGSLACLAMASTPKNVNYTVLNVIA 412
Query: 440 NFQMQNYYVEYDLRNQR 456
N+Q QN V +D N +
Sbjct: 413 NYQQQNLRVLFDTVNNK 429
>gi|356535252|ref|XP_003536162.1| PREDICTED: aspartic proteinase-like protein 2-like [Glycine max]
Length = 475
Score = 82.0 bits (201), Expect = 6e-13, Method: Compositional matrix adjust.
Identities = 106/405 (26%), Positives = 166/405 (40%), Gaps = 70/405 (17%)
Query: 87 GGYSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIP--SFIPKLSSSSRL 144
G Y L G+PP+ +DTGS ++W C +C S I + PK S +S L
Sbjct: 68 GLYFTKLGLGSPPKDYYVQVDTGSDILWVNCVKCSRCPRKSDLGIDLTLYDPKGSETSEL 127
Query: 145 LGCQNPKCSWIHHESIQCRDCNDEPLATSKNCTQI-CPSYLVLYGSG-LTEGIALSETL- 201
+ C CS + D P+ K ++I CP Y + YG G T G + + L
Sbjct: 128 ISCDQEFCSATY----------DGPIPGCK--SEIPCP-YSITYGDGSATTGYYVQDYLT 174
Query: 202 ----NLPNRIIP---NFLVGCSVL------SSRQPA--GIAGFGRGKTSLPSQLNLDK-- 244
N R P + + GC + SS + A GI GFG+ +S+ SQL
Sbjct: 175 YNHVNDNLRTAPQNSSIIFGCGAVQSGTLSSSSEEALDGIIGFGQSNSSVLSQLAASGKV 234
Query: 245 ---FSYCLLSHKFDDTTRTSSLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYY 301
FS+CL D R + G P V+ + R A +Y
Sbjct: 235 KKIFSHCL------DNIRGGGIF-----------AIGEVVEPKVSTTPLVPRMA---HYN 274
Query: 302 VGLRRITVGGQRVRVWHKYLTLDRDGNG-GTIVDSGTTFTFMAPELFEPLADEFVSQMVK 360
V L+ I V +++ GNG GTI+DSGTT ++ +++ L + +++ +
Sbjct: 275 VVLKSIEVDTDILQLPSDIFD---SGNGKGTIIDSGTTLAYLPAIVYDELIPKVMARQPR 331
Query: 361 NRNYTRALGAEALTGLRPCFDVPGEKTGSFPELKLHFKGGAEVTLPVENYFAVVGEGS-A 419
+ Y L + + CF G FP +KLHF+ +T+ +Y +G
Sbjct: 332 LKLY---LVEQQFS----CFQYTGNVDRGFPVVKLHFEDSLSLTVYPHDYLFQFKDGIWC 384
Query: 420 VCLTVVTDREASGGPSIILGNFQMQNYYVEYDLRNQRLGFKQQLC 464
+ + +G +LG+ + N V YDL N +G+ C
Sbjct: 385 IGWQKSVAQTKNGKDMTLLGDLVLSNKLVIYDLENMAIGWTDYNC 429
>gi|242091325|ref|XP_002441495.1| hypothetical protein SORBIDRAFT_09g028050 [Sorghum bicolor]
gi|241946780|gb|EES19925.1| hypothetical protein SORBIDRAFT_09g028050 [Sorghum bicolor]
Length = 466
Score = 82.0 bits (201), Expect = 6e-13, Method: Compositional matrix adjust.
Identities = 104/424 (24%), Positives = 163/424 (38%), Gaps = 74/424 (17%)
Query: 66 QTKTTTTTTTTTTTNISSHSYGG---YSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQ 122
+ +++ + +SS +Y G Y + L GTP Q + DTGS L W C
Sbjct: 90 RVAAEVASSSAVSLPMSSGAYSGTGQYFVKLRVGTPVQEFTLVADTGSDLTWVKCAG--- 146
Query: 123 CKYCSSSKIPSFIPKLSSSSRLLGCQNPKCSWIHHESIQCRDCNDEPLATSKNCTQICP- 181
+S F PK S S + C + C D P + + P
Sbjct: 147 ----ASPPGRVFRPKTSRSWAPIPCSSDTCKL------------DVPFTLANCSSPASPC 190
Query: 182 --SYLVLYGSGLTEGIALSE--TLNLPNRIIP---NFLVGCSV----LSSRQPAGIAGFG 230
Y GS GI +E T+ LP + + ++GCS S R G+ G
Sbjct: 191 TYDYRYKEGSAGARGIVGTESATIALPGGKVAQLKDVVLGCSSSHDGQSFRSADGVLSLG 250
Query: 231 RGKTSLPSQLNL---DKFSYCLLSHKFDDTTRTSSLILDNGSSHSDKKTTGLTYTPFVNN 287
K S +Q FSYCL+ H R ++ L G + T T T +
Sbjct: 251 NAKISFATQAAARFGGSFSYCLVDHL---APRNATGYLAFGPGQVPR--TPATQTKLFLD 305
Query: 288 PSVAERNAFSVYYYVGLRRITVGGQRV----RVWHKYLTLDRDGNGGTIVDSGTTFTFMA 343
P + +Y V + I V G+ + VW +GG I+DSG T T +A
Sbjct: 306 PEMP-------FYGVKVDAIHVAGKALDIPAEVWDAK-------SGGVILDSGNTLTVLA 351
Query: 344 PELFEPLADEFVSQMVKNRNYTRALGAEALTGLRPCFDVPGEKTGS---FPELKLHFKGG 400
P V+ + K+ + + + C++ + G+ P+L + F G
Sbjct: 352 ----APAYKAVVAALSKHLDGVPKV---SFPPFEHCYNWTARRPGAPEIIPKLAVQFAGS 404
Query: 401 AEVTLPVENYFAVVGEGSAVCLTVVTDREASGGPSIILGNFQMQNYYVEYDLRNQRLGFK 460
A + P ++Y V G C+ V +E ++GN Q + E+DL+N ++ FK
Sbjct: 405 ARLEPPAKSYVIDVKPG-VKCIGV---QEGEWPGLSVIGNIMQQEHLWEFDLKNMQVRFK 460
Query: 461 QQLC 464
Q C
Sbjct: 461 QSNC 464
>gi|148910602|gb|ABR18371.1| unknown [Picea sitchensis]
Length = 446
Score = 82.0 bits (201), Expect = 6e-13, Method: Compositional matrix adjust.
Identities = 94/403 (23%), Positives = 161/403 (39%), Gaps = 60/403 (14%)
Query: 86 YGGYSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLL 145
YG Y +++ G P + +D+GS L W C C C+ P + KL S L+
Sbjct: 76 YGLYYVTMLVGNPSKPYFLDVDSGSELTWIQCDA--PCISCAKGPHPLY--KLKKGS-LV 130
Query: 146 GCQNPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSGLTEGIALSETLN--L 203
++P C+ + S + K +Q C + G +EG + +++ L
Sbjct: 131 PSKDPLCAAVQAGSGHYHN--------HKEASQRCDYDVAYADHGYSEGFLVRDSVRALL 182
Query: 204 PNRII--PNFLVGCSV-------LSSRQPAGIAGFGRGKTSLPSQLNLDKFSYCLLSHKF 254
N+ + N + GC +S + GI G G G SLPSQ ++ H
Sbjct: 183 TNKTVLTANSVFGCGYNQRESLPVSDARTDGILGLGSGMASLPSQWAKQGLIKNVIGHCI 242
Query: 255 DDTTRTSSLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRV 314
R + S T+ +T+ P + PS+ +YYVG ++ G +
Sbjct: 243 FGAGRDGGYMFFGDDLVS---TSAMTWVPMLGRPSIK-------HYYVGAAQMNFGNK-- 290
Query: 315 RVWHKYLTLDRDGN----GGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGA 370
LD+DG+ GG I DSG+T+T+ + + F+S + +N + +
Sbjct: 291 -------PLDKDGDGKKLGGIIFDSGSTYTYFTNQAY----GAFLSVVKENLSGKQLEQD 339
Query: 371 EALTGLRPC------FDVPGEKTGSFPELKLHFKGGAEVTLPV--ENYFAVVGEGSAVCL 422
+ + L C F E F L L F+ + + E Y V +G+ VCL
Sbjct: 340 SSDSFLSLCWRRKEGFRSVAEAAAYFKPLTLKFRSTKTKQMEIFPEGYLVVNKKGN-VCL 398
Query: 423 TVVTDREASGGPSIILGNFQMQNYYVEYDLRNQRLGFKQQLCK 465
++ + +LG+ Q V YD ++G+ + C+
Sbjct: 399 GILNGTAIGIVDTNVLGDISFQGQLVVYDNEKNQIGWARSDCQ 441
>gi|340810977|gb|AEK75415.1| S5 [Oryza rufipogon]
Length = 357
Score = 82.0 bits (201), Expect = 6e-13, Method: Compositional matrix adjust.
Identities = 105/404 (25%), Positives = 170/404 (42%), Gaps = 77/404 (19%)
Query: 91 ISLSFGTPPQIIPFILDTGSHLVWF---PCTNHYQCKYCSSSKIPSFIPKLSSSSRLLGC 147
+++S G PP + +DTGS L W PC H C S+ P F P S +SR + C
Sbjct: 1 MAVSLGKPPVVNLVAIDTGSTLSWVQCQPCAVH--CHTQSAKAGPIFDPGRSYTSRRVRC 58
Query: 148 QNPKCSWIHHE-SIQCRDCNDEPLATSKNCTQICPSYLVLYGSGL--TEGIALSETLNLP 204
+ KC ++ +Q +C ++ +CT Y V YG+G + G +++TL +
Sbjct: 59 SSVKCGEPRYDLRLQQANCMEK----EDSCT-----YSVTYGNGWAYSVGKMVTDTLRIG 109
Query: 205 NRIIPNFLVGCS--VLSSRQPAGIAGFGRGK-------TSLPSQLNLDKFSYCLLSHKFD 255
+ + + + GCS V S AGI GFG P L+ SYCL +
Sbjct: 110 DSFM-DLMFGCSMDVKYSEFEAGIFGFGSSSFSFFEQLAGYPDILSYKALSYCLPT---- 164
Query: 256 DTTRTSSLILDNGSSHSDKKTTGLTYTPF---VNNPSVAERNAFSVYYYVGLRRITVGGQ 312
D T+ +IL D+ YTP +N P+ Y + + + GQ
Sbjct: 165 DETKPGYMIL----GRYDRAAMDGGYTPLFRSINRPT----------YSLTMEMLIANGQ 210
Query: 313 RVRVWHKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEA 372
R+ + IVDSG T + P F L D+ ++Q + + Y R A
Sbjct: 211 RLVT----------SSSEMIVDSGAQRTSLWPSTFA-LLDKTITQAMSSIGYHRTSRARQ 259
Query: 373 LTGLRPCFDVPGEKTG------------SFPELKLHFKGGAEVTLPVENYFAVVGEGSAV 420
+ + C+ + +G + P L++ F GGA + LP N F +
Sbjct: 260 ESYI--CYLSEHDYSGWNGTITPFSNWSALPLLEIGFAGGAALALPPRNVF-YNDPHRGL 316
Query: 421 CLTVVTDREASGGPSIILGNFQMQNYYVEYDLRNQRLGFKQQLC 464
C+T + S ILGN +++ +D++ ++ GFK +C
Sbjct: 317 CMTFAQNPALR---SQILGNRVTRSFGTTFDIQGKQFGFKYAVC 357
>gi|356523724|ref|XP_003530485.1| PREDICTED: aspartic proteinase-like protein 2-like [Glycine max]
Length = 488
Score = 82.0 bits (201), Expect = 6e-13, Method: Compositional matrix adjust.
Identities = 103/418 (24%), Positives = 164/418 (39%), Gaps = 97/418 (23%)
Query: 87 GGYSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSS-----KIPSFIPKLSSS 141
G Y + GTPP+ +DTGS ++W C QCK C + + + K SSS
Sbjct: 81 GLYYAKIGIGTPPKNYYLQVDTGSDIMWVNCI---QCKECPTRSSLGMDLTLYDIKESSS 137
Query: 142 SRLLGCQNPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSGLTEGIALSETL 201
+L+ C C I + L T CP YL +YG G + + +
Sbjct: 138 GKLVPCDQEFCKEI-----------NGGLLTGCTANISCP-YLEIYGDGSSTAGYFVKDI 185
Query: 202 NLPNRIIPNF---------LVGC------SVLSSRQPA--GIAGFGRGKTSLPSQLN--- 241
L +++ + + GC + SS + A GI GFG+ +S+ SQL
Sbjct: 186 VLYDQVSGDLKTDSANGSIVFGCGARQSGDLSSSNEEALDGILGFGKANSSMISQLASSG 245
Query: 242 --LDKFSYCLLSHKFDDTTRTSSLILDNGSSHSDKKTTGLTYTPFVN-NPSVAERNAFSV 298
F++CL NG + G P VN P + ++ +SV
Sbjct: 246 KVKKMFAHCL-----------------NGVNGGGIFAIGHVVQPKVNMTPLLPDQPHYSV 288
Query: 299 YYYVGLRRITVGGQRVRVWHKYLTLDRDGNG-----GTIVDSGTTFTFMAPELFEPLADE 353
+T V+V H +L+L D + GTI+DSGTT ++ ++EPL +
Sbjct: 289 -------NMTA----VQVGHTFLSLSTDTSAQGDRKGTIIDSGTTLAYLPEGIYEPLVYK 337
Query: 354 FVSQMVKNRNYTRALGAEALTGLRPCFDVPGEKTGSFPELKLHFKGGAEVTL-------P 406
+SQ L + L CF FP + F+ G + + P
Sbjct: 338 MISQHPD-------LKVQTLHDEYTCFQYSESVDDGFPAVTFFFENGLSLKVYPHDYLFP 390
Query: 407 VENYFAVVGEGSAVCLTVVTDREASGGPSIILGNFQMQNYYVEYDLRNQRLGFKQQLC 464
N++ + + S R++ +LG+ + N V YDL NQ +G+ + C
Sbjct: 391 SVNFWCIGWQNSG-----TQSRDSKN--MTLLGDLVLSNKLVFYDLENQAIGWAEYNC 441
>gi|343161843|dbj|BAK57511.1| extracellular dermal glycoprotein [Nicotiana benthamiana]
Length = 440
Score = 82.0 bits (201), Expect = 7e-13, Method: Compositional matrix adjust.
Identities = 100/409 (24%), Positives = 171/409 (41%), Gaps = 83/409 (20%)
Query: 97 TPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLGCQNPKCSWIH 156
TP + LD G +W C Y +SSS + C++ +CS
Sbjct: 57 TPLVPVSLTLDLGGQFLWVDCDQGY----------------VSSSYKPARCRSAQCSLAR 100
Query: 157 HESIQCRDCNDEPLATSKNCT-QICPSYLVLYGSGLTEGIALSETLNL-------PNRII 208
C C P N T + P V + T G S+T+ + P R +
Sbjct: 101 AGG--CGQCFSPPKPGCNNDTCGLIPDNTVTQTA--TSGELASDTVQVQSSNGKNPGRNV 156
Query: 209 PN----FLVGCSVLSSRQPAGI---AGFGRGKTSLPSQLNLD-----KFSYCLLSHKFDD 256
+ F+ G + L R +G+ AG GR + SLPSQ + + KF+ CL S
Sbjct: 157 VDKDFLFVCGSTFLLKRLASGVKGMAGLGRTRISLPSQFSAEFSFPRKFAVCLSS----- 211
Query: 257 TTRTSSLILDNGSSHS-----DKKTTGLTYTPFVNNPSVAERNAFSV-----YYYVGLRR 306
+T++ ++L +S + +YTP NP V+ +AFS Y++G++
Sbjct: 212 STKSKGVVLFGDGPYSFLPNREFANDDFSYTPLFINP-VSTASAFSSGEPSSEYFIGVKS 270
Query: 307 ITVGGQRVRVWHKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTR 366
I + + V + L++D G GGT + + +T + ++ + + FV ++V N TR
Sbjct: 271 IKINQKVVSINTTLLSIDNQGVGGTKISTVNPYTILETSIYNAVTNFFVKELV---NITR 327
Query: 367 ALGAEALTGLRPCFD---VPGEKTG-SFPELKLHFKGGAEVTLPVENYF-AVVGEGSAV- 420
++ CFD + + G + P + L + EN F + G S V
Sbjct: 328 ---VASVAPFGACFDSRNIVSTRVGPTVPPIDLVLQN--------ENVFWTIFGANSMVQ 376
Query: 421 ------CLTVVTDREASGGPSIILGNFQMQNYYVEYDLRNQRLGFKQQL 463
CL V D + SI++G + +++ +++DL + RLGF +
Sbjct: 377 VSENVLCLGFV-DGGVNPRTSIVIGGYTIEDNLLQFDLASSRLGFTSSI 424
>gi|225438361|ref|XP_002273988.1| PREDICTED: aspartic proteinase Asp1 [Vitis vinifera]
gi|296082608|emb|CBI21613.3| unnamed protein product [Vitis vinifera]
Length = 426
Score = 82.0 bits (201), Expect = 7e-13, Method: Compositional matrix adjust.
Identities = 97/398 (24%), Positives = 167/398 (41%), Gaps = 67/398 (16%)
Query: 87 GGYSISLSFGTPPQIIPFILD--TGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRL 144
G Y +SLS G PP+ P+ LD TGS L W C C C+ + P + P ++ L
Sbjct: 65 GYYYVSLSIGQPPK--PYFLDPDTGSDLSWLQCDA--PCVRCTKAPHPLYRP----NNNL 116
Query: 145 LGCQNPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSGLTE-GIALSET--L 201
+ C++P C+ +H +C + C Y V Y G + G+ + + L
Sbjct: 117 VICKDPMCASLHPPGYKCEH--------PEQC-----DYEVEYADGGSSLGVLVKDVFPL 163
Query: 202 NLPN--RIIPNFLVGCSVL----SSRQP-AGIAGFGRGKTSLPSQLNLDKFSYCLLSHKF 254
N N R+ P +GC S P G+ G G+GK+S+ SQL+ ++ H
Sbjct: 164 NFTNGLRLAPRLALGCGYDQIPGQSYHPLDGVLGLGKGKSSIVSQLHSQGVIRNVVGHCV 223
Query: 255 DDTTRTSSLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRV 314
++R + + + + +TP + + +Y G + +GG+
Sbjct: 224 --SSRGGGFLFFGDDLYDSSR---VVWTPMLRDQ--------HTHYSSGYAELILGGKTT 270
Query: 315 RVWHKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEAL- 373
+ +T DSG+++T++ ++ L V + + + AL + L
Sbjct: 271 VFKNLLVTF----------DSGSSYTYLNSLAYQALV-HLVRKELSEKPVREALDDQTLP 319
Query: 374 ---TGLRPCFDVPGEKTGSFPELKLHFKGGA----EVTLPVENYFAVVGEGSAVCLTVVT 426
G RP V K F L L F GG + +P+E+Y + +G+ VCL ++
Sbjct: 320 LCWRGKRPFKSVRDVKK-FFKPLALSFPGGGRTKTQYDIPLESYLIISLKGN-VCLGILN 377
Query: 427 DREASGGPSIILGNFQMQNYYVEYDLRNQRLGFKQQLC 464
EA ++G+ MQ+ V YD ++G+ C
Sbjct: 378 GTEAGLQDFNLIGDISMQDKMVVYDNEKNQIGWAPTNC 415
>gi|168060150|ref|XP_001782061.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162666472|gb|EDQ53125.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 423
Score = 82.0 bits (201), Expect = 7e-13, Method: Compositional matrix adjust.
Identities = 100/407 (24%), Positives = 167/407 (41%), Gaps = 70/407 (17%)
Query: 87 GGYSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLG 146
G Y ++L G+PP++ +DTGS L W C C+ C+ + PK ++++
Sbjct: 38 GLYYMALLLGSPPKLYFLDMDTGSDLTWAQC--DAPCRNCAIGPHGLYNPK---KAKVVD 92
Query: 147 CQNPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSG-LTEGIALSETLNL-- 203
C P C+ I +CN + K C Y V Y G T G+ + +TL +
Sbjct: 93 CHLPVCAQIQQGG--SYECNSD----VKQC-----DYEVEYADGSSTMGVLVEDTLTVRL 141
Query: 204 --PNRIIPNFLVGCSVLS----SRQPA---GIAGFGRGKTSLPSQLNLDKFSYCLLSHKF 254
I ++GC ++ PA G+ G K +LP+QL +L H
Sbjct: 142 TNGTLIQTKAIIGCGYDQQGTLAKSPASTDGVIGLSSSKVALPAQLAEKGIIKNVLGHCL 201
Query: 255 DDTTRTSSLILDNGSSHSDK--KTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQ 312
D + + D+ + G+T+TP + P + + Y L+ I GG
Sbjct: 202 ADGSNGGGYLF-----FGDELVPSWGMTWTPMMGKPEM-------LGYQARLQSIRYGGD 249
Query: 313 RVRVWHKYLTLDRD---GNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALG 369
+ L D D + DSGT+FT++ P+ + + +S + K R
Sbjct: 250 SL-----VLNNDEDLTRSTSSVMFDSGTSFTYLVPQAYASV----LSAVTKQSGLLR--- 297
Query: 370 AEALTGLRPCFDVPG------EKTGSFPELKLHFKG------GAEVTLPVENYFAVVGEG 417
++ T L C+ P + F L L F G + + L + Y V +G
Sbjct: 298 VKSDTTLPYCWRGPSPFQSITDVHQYFKTLTLDFGGRNWFATDSTLDLSPQGYLIVSTQG 357
Query: 418 SAVCLTVVTDREASGGPSIILGNFQMQNYYVEYDLRNQRLGFKQQLC 464
+ VCL ++ AS + I+G+ M+ Y V YD R+G+ ++ C
Sbjct: 358 N-VCLGILDASGASLEVTNIIGDVSMRGYLVVYDNVRDRIGWIRRNC 403
>gi|21717154|gb|AAM76347.1|AC074196_5 putative nucleoid DNA binding protein [Oryza sativa Japonica Group]
gi|31433293|gb|AAP54831.1| Eukaryotic aspartyl protease family protein [Oryza sativa Japonica
Group]
gi|125532791|gb|EAY79356.1| hypothetical protein OsI_34485 [Oryza sativa Indica Group]
Length = 397
Score = 81.6 bits (200), Expect = 7e-13, Method: Compositional matrix adjust.
Identities = 98/384 (25%), Positives = 156/384 (40%), Gaps = 49/384 (12%)
Query: 93 LSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLGCQNPKC 152
+ GTPPQ I+D LVW C+ +C C +P FIP SS+ R C C
Sbjct: 47 FTIGTPPQPASAIIDVAGELVWTQCS---RCSRCFKQDLPLFIPNASSTFRPEPCGTDAC 103
Query: 153 SWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSGLTEGIALSETLNLPNRIIPNFL 212
+S +C+ + + T ++ T I T GI +ET + +
Sbjct: 104 -----KSTPTSNCSGD-VCTYESTTNI------RLDRHTTLGIVGTETFAI-GTATASLA 150
Query: 213 VGCSVLSSRQP----AGIAGFGRGKTSLPSQLNLDKFSYCLLSHKFDDTTRTSSLILDNG 268
GC V S +G G GR SL +Q+ L KFSYCL T ++S L L +
Sbjct: 151 FGCVVASDIDTMDGTSGFIGLGRTPRSLVAQMKLTKFSYCLSPRG---TGKSSRLFLGSS 207
Query: 269 SSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRVRVWHKYLTLDRDGN 328
+ + ++T + PF+ + + YY + L I G + +
Sbjct: 208 AKLAGGEST--STAPFIKTSPDDDSHH---YYLLSLDAIRAGNTTIATAQ---------S 253
Query: 329 GGTIV-DSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEALTGLRPCF-DVPGEK 386
GG +V + + F+ + + + V++ V L CF G
Sbjct: 254 GGILVMHTVSPFSLLVDSAYRAF-KKAVTEAVGGAAAPPMATPPQPFDL--CFKKAAGFS 310
Query: 387 TGSFPELKLHFK-GGAEVTLPVENYFAVVG-EGSAVCLTVVT----DREASGGPSIILGN 440
+ P+L F+ GGA +T+P Y VG E C +++ +R G S +LG+
Sbjct: 311 RATAPDLVFTFQGGGAALTVPPAKYLIDVGEEKDTACAAILSMARLNRTGLEGVS-VLGS 369
Query: 441 FQMQNYYVEYDLRNQRLGFKQQLC 464
Q +N + YDL+ + L F+ C
Sbjct: 370 LQQENVHFLYDLKKETLSFEPADC 393
>gi|255567949|ref|XP_002524952.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
gi|223535787|gb|EEF37449.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
Length = 394
Score = 81.6 bits (200), Expect = 7e-13, Method: Compositional matrix adjust.
Identities = 90/345 (26%), Positives = 147/345 (42%), Gaps = 71/345 (20%)
Query: 87 GGYSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLG 146
G Y+ + GTPPQ I+DTGS + + PC+ C+ C + P F P+LSS+ + +
Sbjct: 88 GYYTTRIWIGTPPQTFALIVDTGSTVTYVPCST---CEQCGRHQDPKFEPELSSTYQPVS 144
Query: 147 CQNPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSGLTEGIALSETLNLPNR 206
C N C+ +E QC + A + + + ++ +G +++ +P R
Sbjct: 145 C-NIDCT-CDNERKQC--VYERQYAEMSSSSGVLGEDIISFG---------NQSELVPQR 191
Query: 207 IIPNFLVGCS-----VLSSRQPAGIAGFGRGKTSLPSQLNL-----DKFSYCLLSHKFDD 256
I GC L S++ GI G GRG S+ QL D FS C
Sbjct: 192 AI----FGCENQETGDLYSQRADGIMGLGRGDLSIVDQLVEKGVISDSFSLCYGGMDIGG 247
Query: 257 TTRTSSLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAF-SVYYYVGLRRITVGGQRVR 315
++IL S S G+ + AE + S YY + L+ I V G+++
Sbjct: 248 ----GAMILGGISPPS-----GMVF---------AESDPVRSQYYNIDLKAIHVAGKQLH 289
Query: 316 VWHKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVK---------NRNYTR 366
+ DG GT++DSGTT+ ++ F D + ++ N N
Sbjct: 290 LDPSIF----DGKHGTVLDSGTTYAYLPEAAFTAFKDAMMKELTSLKQIHGPDPNYNDIC 345
Query: 367 ALGAEALTGLRPCFDVPGEKTGSFPELKLHFKGGAEVTLPVENYF 411
GAE+ DV + + +FP +++ F G +++L ENY
Sbjct: 346 FSGAES--------DV-SQLSNTFPAVEMVFSNGQKLSLSPENYL 381
>gi|294461757|gb|ADE76437.1| unknown [Picea sitchensis]
Length = 325
Score = 81.6 bits (200), Expect = 7e-13, Method: Compositional matrix adjust.
Identities = 100/375 (26%), Positives = 151/375 (40%), Gaps = 65/375 (17%)
Query: 104 FILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLGCQNPKCSWIHHESIQCR 163
++DTGS + W C C C + F P S++ + L C + C + S C
Sbjct: 3 LLIDTGSDITWIQCD---PCPQCYKQQDSLFQPAGSATYKPLPCNSTMCQQLQSFSHSC- 58
Query: 164 DCNDEPLATSKNCTQICPSYLVLYGS-GLTEGIALSETLNLPNRI-----IPNFLVGCSV 217
L +S N Y+V YG T G ETL L + +PNF GC
Sbjct: 59 ------LNSSCN-------YMVSYGDKSTTRGDFALETLTLRSDDTILVSVPNFAFGCGH 105
Query: 218 LSS---RQPAGIAGFGRGKTSLPSQLNL---DKFSYCLLSHKFDDTTRTSSLILDNGSSH 271
+ AG+ G G+ P+Q ++ FSYCL S ++ S IL G +
Sbjct: 106 ANKGLFNGAAGLMGLGKSSIGFPAQTSVAFGKVFSYCLPSV----SSTIPSGILHFGEAA 161
Query: 272 SDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRVRVWHKYLTLDRDGNGGT 331
+ +TP V++ S + Y+V + I VG + + + +
Sbjct: 162 --MLDYDVRFTPLVDSSSGPSQ------YFVSMTGINVGDELLPI-----------SATV 202
Query: 332 IVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEALTGLRPCFDVPGEKTGSFP 391
+VDSGT + +E L D F +Q++ A ++ CF V + P
Sbjct: 203 MVDSGTVISRFEQSAYERLRDAF-TQILPGLQT-----AVSVAPFDTCFRVSTVDDINIP 256
Query: 392 ELKLHFKGGAEVTL-PVENYFAVVGEGSAVCLTVVTDREASGGPSIILGNFQMQNYYVEY 450
+ LHF+ AE+ L PV + V + +C +S G S+ LGNFQ QN Y
Sbjct: 257 LITLHFRDDAELRLSPVHILYPV--DDGVMCFAFA---PSSSGRSV-LGNFQQQNLRFVY 310
Query: 451 DLRNQRLGFKQQLCK 465
D+ RLG C
Sbjct: 311 DIPKSRLGISAFECN 325
>gi|15234607|ref|NP_194733.1| aspartyl protease family protein [Arabidopsis thaliana]
gi|4938480|emb|CAB43839.1| putative protein [Arabidopsis thaliana]
gi|7269904|emb|CAB80997.1| putative protein [Arabidopsis thaliana]
gi|67633776|gb|AAY78812.1| aspartyl protease family protein [Arabidopsis thaliana]
gi|332660311|gb|AEE85711.1| aspartyl protease family protein [Arabidopsis thaliana]
Length = 427
Score = 81.6 bits (200), Expect = 7e-13, Method: Compositional matrix adjust.
Identities = 111/457 (24%), Positives = 180/457 (39%), Gaps = 70/457 (15%)
Query: 29 LTFSLSRFHTNPSQDSYQNL---NSLVSSSLTRALHIKNPQTKTTTTTTTTTTTNISSHS 85
L FS+S H + S NL + S HIK + TT +I +H
Sbjct: 15 LCFSISVVHLSASPTLVLNLVHSYHIYSRKPPHVYHIKEASVERLEYLKAKTTGDIIAHL 74
Query: 86 Y-------GGYSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKL 138
+ +++S G+PP +DT S L+W C C C + +P F P
Sbjct: 75 SPNVPIIPQAFLVNISIGSPPITQLLHMDTASDLLWIQCL---PCINCYAQSLPIFDPSR 131
Query: 139 SSSSRLLGCQNPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSGLTEGIALS 198
S + R C+ + S P T+ C + ++GI
Sbjct: 132 SYTHRNETCRTSQYSM--------------PSLKFNANTRSCEYSMRYVDDTGSKGILAR 177
Query: 199 ETLNLPNRI--------IPNFLVGCSVLSSRQP---AGIAGFGRGKTSLPSQLNLDKFSY 247
E L L N I + + + GC + +P GI G G G+ SL + KFSY
Sbjct: 178 EML-LFNTIYDESSSAALHDVVFGCGHDNYGEPLVGTGILGLGYGEFSLVHRFG-KKFSY 235
Query: 248 CLLSHKFDDTTRTSS-LILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRR 306
C S DD + + L+L + ++ TT L N F YYV +
Sbjct: 236 CFGS--LDDPSYPHNVLVLGDDGANILGDTTPLEI-----------HNGF---YYVTIEA 279
Query: 307 ITVGGQRVRVWHKYLTLDRD-GNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYT 365
I+V G + + + + G GGTI+D+G + T + E ++PL + + + + R
Sbjct: 280 ISVDGIILPIDPRVFNRNHQTGLGGTIIDTGNSLTSLVEEAYKPLKNR-IEDIFEGRFTA 338
Query: 366 RALGAEALTGLRPCFDVPGEK---TGSFPELKLHFKGGAEVTLPVENYFAVVGEGSAVCL 422
+ + + + C++ E+ FP + HF GAE++L V++ F + + CL
Sbjct: 339 ADVSQDDMIKME-CYNGNFERDLVESGFPIVTFHFSEGAELSLDVKSLFMKLSP-NVFCL 396
Query: 423 TVVTDREASGGPSIILGNFQMQNYYVEYDLRNQRLGF 459
V S +G Q+Y + YDL + F
Sbjct: 397 AVTPGNLNS------IGATAQQSYNIGYDLEAMEVSF 427
>gi|168030587|ref|XP_001767804.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162680886|gb|EDQ67318.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 399
Score = 81.6 bits (200), Expect = 7e-13, Method: Compositional matrix adjust.
Identities = 107/414 (25%), Positives = 164/414 (39%), Gaps = 91/414 (21%)
Query: 87 GGYSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSS--------KIPSFIPKL 138
G Y+ + GTPP I+DTGS + + PC++ C + +S + P F P+
Sbjct: 38 GYYTSRVFIGTPPNEFALIVDTGSTVTYVPCSSCTHCGHHQASFSTHRLFCRDPRFKPEN 97
Query: 139 SSSSRLLGCQNPKC--SWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSGLT-EGI 195
SSS + +GC++ C S QC+ Y +Y T +G+
Sbjct: 98 SSSYQKIGCRSSDCITGLCDSNSHQCK-------------------YERMYAEMSTSKGV 138
Query: 196 ALSETLNL--PNRIIPNFL-VGCSVLSS-----RQPAGIAGFGRGKTSLPSQLN-----L 242
+ L+ +R+ L GC S + GI G GRG S+ QL
Sbjct: 139 LGKDLLDFGPASRLQSQLLSFGCETAESGDLYLQVADGIMGLGRGPLSIVDQLVGNGAIE 198
Query: 243 DKFSYCLLSHKFDDTTRTSSLILDNGSSHSDKKTTGLTYTPFVNNPSVAERN-AFSVYYY 301
D FS C +D G + L P + A+ + S YY
Sbjct: 199 DSFSLCYGG-------------MDEGGG-----SMVLGAIPAPSGMVFAKSDPRRSNYYN 240
Query: 302 VGLRRITVGGQRVRVWHKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKN 361
+ L I V G +++ +G GTI+DSGTT+ ++ FE D V+Q
Sbjct: 241 LELTEIQVQGASLKLDSNVF----NGKFGTILDSGTTYAYLPDRAFEAFTDAVVAQ---- 292
Query: 362 RNYTRALGA-EALTGLRP-----CFDVPGEKTGS----FPELKLHFKGGAEVTLPVENY- 410
LG+ +A+ G P C+ G T FP + F +V+L ENY
Sbjct: 293 ------LGSLQAVDGPDPNYPDICYAGAGTDTKELGKHFPLVDFVFAENQKVSLAPENYL 346
Query: 411 FAVVGEGSAVCLTVVTDREASGGPSIILGNFQMQNYYVEYDLRNQRLGFKQQLC 464
F A CL +++A + +LG ++N V YD N ++GF + C
Sbjct: 347 FKHTKVPGAYCLGFFKNQDA----TTLLGGIIVRNMLVTYDRYNHQIGFLKTNC 396
>gi|413923981|gb|AFW63913.1| hypothetical protein ZEAMMB73_837345 [Zea mays]
Length = 414
Score = 81.6 bits (200), Expect = 8e-13, Method: Compositional matrix adjust.
Identities = 58/185 (31%), Positives = 84/185 (45%), Gaps = 35/185 (18%)
Query: 245 FSYCLLSHKFDDTT----RTSSLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYY 300
FSY L+ H D + R L+L +H + L YT F S A+ +Y
Sbjct: 6 FSYRLVEHDSDAVSKVVFREDDLVL----AHPE-----LKYTAFTPTSSPAD-----TFY 51
Query: 301 YVGLRRITVGGQRVRVWHKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVK 360
YV L+ + VGG+ +++ + +DG+GGTI+DSGTT ++ EP+
Sbjct: 52 YVKLKGVLVGGELLKISSDTWDVGKDGSGGTIIDSGTTLSYFV----EPV---------- 97
Query: 361 NRNYTRALGAEALTGLRPCFDVPGEKTGSFPELKLHFKGGAEVTLPVENYFAVVGEGSAV 420
Y L G PC++V G + PEL L F GA P ENYF + +
Sbjct: 98 ---YQAVPSDPGLLGAEPCYNVSGMERPEVPELSLLFPDGAVWDFPAENYFVRLDPDDIM 154
Query: 421 CLTVV 425
CL V+
Sbjct: 155 CLAVL 159
>gi|255586860|ref|XP_002534040.1| Aspartic proteinase nepenthesin-2 precursor, putative [Ricinus
communis]
gi|223525947|gb|EEF28344.1| Aspartic proteinase nepenthesin-2 precursor, putative [Ricinus
communis]
Length = 518
Score = 81.6 bits (200), Expect = 8e-13, Method: Compositional matrix adjust.
Identities = 124/501 (24%), Positives = 194/501 (38%), Gaps = 106/501 (21%)
Query: 10 LSFIFFFTLLSI--FPSSITSLTFSLSRFH----------------TNPSQDSYQNLNSL 51
L F+ F LLS+ FP + F+ H PS+ S++ L
Sbjct: 5 LKFLVFSLLLSVWVFPQNCKGRIFTFKMHHRFSDMLKDLSDSTTSRNFPSKGSFEYYAEL 64
Query: 52 V-SSSLTRALHIKNPQTKTTTTTTTTTTTNISSHSYGGYSISLSFGTPPQIIPFILDTGS 110
+ R + N + + +T ISS + Y+ ++ GTP LDTGS
Sbjct: 65 AHRDQMLRGRKLYNVEAPLAFSDGNSTF-RISSLGFLHYT-TVELGTPGMKFMVALDTGS 122
Query: 111 HLVWFPCTNHYQC------KYCSSSKIPSFIPKLSSSSRLLGCQNPKCSWIHHESIQCRD 164
L W PC + +C Y S ++ + PK SS+S+ + C N C+ H + +C
Sbjct: 123 DLFWVPC-DCSKCAPTQGVAYASDFELSIYDPKQSSTSKKVTCNNNLCA---HRN-RC-- 175
Query: 165 CNDEPLATSKNCTQICPSYLVLYGSGL--TEGIALSETLNLPNR------IIPNFLVGC- 215
L T +C Y+V Y S T GI + + L+L + I GC
Sbjct: 176 -----LGTFSSC-----PYMVSYVSAQTSTSGILVEDVLHLTSEDSNQESIKAYVTFGCG 225
Query: 216 -----SVLSSRQPAGIAGFGRGKTSLPS-----QLNLDKFSYCLLSHKFDDTTRTSSLIL 265
S L++ P G+ G G + S+PS L D FS C D R
Sbjct: 226 QVQSGSFLNTAAPNGLFGLGMDQISVPSILSREGLTADSFSMCF---GHDGVGRI----- 277
Query: 266 DNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRVRVWHKYLTLDR 325
S DK + TPF +NPS N + + ++ VG V
Sbjct: 278 ----SFGDKGSPDQEETPFNSNPSHPSYN-------ISVTQVRVGTTLV----------- 315
Query: 326 DGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEALTGLRPCFDV-PG 384
D + + DSGT+FT++ ++ +++ F +Q R + C+D+ PG
Sbjct: 316 DVDFTALFDSGTSFTYLINPIYAMVSENFHAQAQDKRR-----PPDPRIPFEYCYDMSPG 370
Query: 385 EKTGSFPELKLHFKGGAEVTLPVENYFAVVGEGSAV-CLTVVTDREASGGPSIILGNFQM 443
+ P + L KG T+ + + + V CL +V E + I+G M
Sbjct: 371 ANSSLIPSMSLTMKGRGHFTV-FDPIIVITTQNELVYCLAIVKSTELN-----IIGQNFM 424
Query: 444 QNYYVEYDLRNQRLGFKQQLC 464
Y V +D LG+K+ C
Sbjct: 425 TGYRVVFDREKLVLGWKETDC 445
>gi|30682289|ref|NP_187876.2| aspartyl protease family protein [Arabidopsis thaliana]
gi|11994412|dbj|BAB02414.1| chloroplast nucleoid DNA binding protein-like [Arabidopsis
thaliana]
gi|332641715|gb|AEE75236.1| aspartyl protease family protein [Arabidopsis thaliana]
Length = 461
Score = 81.3 bits (199), Expect = 9e-13, Method: Compositional matrix adjust.
Identities = 104/395 (26%), Positives = 160/395 (40%), Gaps = 59/395 (14%)
Query: 89 YSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLGCQ 148
Y + GTP + ++DTGS L W C Y+ + + ++ F S S + +GC
Sbjct: 106 YFTEIRVGTPAKKFRVVVDTGSELTWVNC--RYRARGKDNRRV--FRADESKSFKTVGCL 161
Query: 149 NPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSG-LTEGIALSETLNLP--- 204
C + N L T + C SY Y G +G+ ET+ +
Sbjct: 162 TQTC--------KVDLMNLFSLTTCPTPSTPC-SYDYRYADGSAAQGVFAKETITVGLTN 212
Query: 205 NRI--IPNFLVGCSVLSSRQ----PAGIAGFGRGK---TSLPSQLNLDKFSYCLLSHKFD 255
R+ +P L+GCS + Q G+ G TS + L KFSYCL+ H
Sbjct: 213 GRMARLPGHLIGCSSSFTGQSFQGADGVLGLAFSDFSFTSTATSLYGAKFSYCLVDH-LS 271
Query: 256 DTTRTSSLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRV- 314
+ ++ LI GSS S K T TP + +Y + + I++G +
Sbjct: 272 NKNVSNYLIF--GSSRSTKTAFRRT-TPL-------DLTRIPPFYAINVIGISLGYDMLD 321
Query: 315 ---RVWHKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAE 371
+VW D GGTI+DSGT+ T +A ++ + +V+ + + E
Sbjct: 322 IPSQVW------DATSGGGTILDSGTSLTLLADAAYKQVVTGLARYLVE----LKRVKPE 371
Query: 372 ALTGLRPCFD-VPGEKTGSFPELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTVVTDREA 430
+ + CF G P+L H KGGA ++Y G CL V +
Sbjct: 372 GVP-IEYCFSFTSGFNVSKLPQLTFHLKGGARFEPHRKSYLVDAAPG-VKCLGFV----S 425
Query: 431 SGGPSI-ILGNFQMQNYYVEYDLRNQRLGFKQQLC 464
+G P+ ++GN QNY E+DL L F C
Sbjct: 426 AGTPATNVIGNIMQQNYLWEFDLMASTLSFAPSAC 460
>gi|194707292|gb|ACF87730.1| unknown [Zea mays]
Length = 216
Score = 81.3 bits (199), Expect = 9e-13, Method: Compositional matrix adjust.
Identities = 64/224 (28%), Positives = 97/224 (43%), Gaps = 25/224 (11%)
Query: 245 FSYCLLSHK---FDDTTRTSSLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYY 301
FSYCL S++ F + R G++ + + YTP + NP R + YY
Sbjct: 15 FSYCLPSYRSYYFSGSLRL-------GAAGQPRN---VRYTPLLTNP---HRPSL---YY 58
Query: 302 VGLRRITVGGQRVRVWHKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKN 361
V + ++VG V+V D GT++DSGT T ++ L +EF Q+
Sbjct: 59 VNVTGLSVGRTWVKVPAGSFAFDPATGAGTVIDSGTVITRWTAPVYAALREEFRRQVAAP 118
Query: 362 RNYTRALGAEALTGLRPCFDVPGEKTGSFPELKLHFKGGAEVTLPVENYFAVVGEGSAVC 421
YT +LGA CF+ G P + LH GG ++TLP+EN C
Sbjct: 119 SGYT-SLGA-----FDTCFNTDEVAAGGAPPVTLHMDGGVDLTLPMENTLIHSSATPLAC 172
Query: 422 LTVVTDREASGGPSIILGNFQMQNYYVEYDLRNQRLGFKQQLCK 465
L + + ++ N Q QN V D+ R+GF ++ C
Sbjct: 173 LAMAEAPQNVNAVVNVVANLQQQNVRVVVDVAGSRVGFAREPCN 216
>gi|125561847|gb|EAZ07295.1| hypothetical protein OsI_29543 [Oryza sativa Indica Group]
Length = 205
Score = 81.3 bits (199), Expect = 9e-13, Method: Compositional matrix adjust.
Identities = 60/188 (31%), Positives = 96/188 (51%), Gaps = 20/188 (10%)
Query: 226 IAGFGRGKTSLPSQLNLDKFSYCLLSHKFDDTTRTSSLILD--NGSSHSDKKTTGLTY-- 281
+ G GRG SL SQL +FSYCL S + +R + + NG++ S ++GL
Sbjct: 1 MVGLGRGLLSLVSQLGPSRFSYCLTSFLSPEPSRLNFGVFATLNGTNAS---SSGLPVQS 57
Query: 282 TPFVNNPSVAERNAFSVYYYVGLRRITVGGQRVRVWHKYLTLDRDGNGGTIVDSGTTFTF 341
TP V N A Y++ L+ I++G +R+ + ++ DG GG +DSGT+ T+
Sbjct: 58 TPLVVN------AALPSLYFMSLKGISLGQKRLPIDPLVFAINDDGTGGVFIDSGTSLTW 111
Query: 342 MAPELFEPLADEFVSQMVKNRNYTRALGAEALTGLRPCFDVPGEKTGSF--PELKLHFKG 399
+ ++++ + E VS + R A E GL CF P T + P+++LHF G
Sbjct: 112 LQQDVYDAVRRELVSVL---RPLPPANDTE--IGLETCFPWPPPPTVTMTVPDMELHFDG 166
Query: 400 GAEVTLPV 407
GA + P+
Sbjct: 167 GANMLHPI 174
>gi|356548993|ref|XP_003542883.1| PREDICTED: basic 7S globulin-like [Glycine max]
Length = 473
Score = 81.3 bits (199), Expect = 9e-13, Method: Compositional matrix adjust.
Identities = 94/400 (23%), Positives = 156/400 (39%), Gaps = 70/400 (17%)
Query: 89 YSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLGCQ 148
Y S+ GTP ++D +W+ C HY SSS R + C
Sbjct: 87 YYTSVGIGTPRHNFDLVIDLSGENLWYDCDTHYN----------------SSSYRPIACG 130
Query: 149 NPKCSWIHHESIQCRDCND--EPLATSKNCTQICPSYLV--LYGSGLTEGIALSETLNLP 204
+ +C I C CN +P T+ C + L +Y GL E + + +
Sbjct: 131 SKQC-----PEIGCVGCNGPFKPGCTNNTCPANVINQLAKFIYSGGLGE-----DFIFIR 180
Query: 205 NRIIPNFLVGC-------SVLSSRQP--------AGIAGFGRGKTSLPSQLNL-----DK 244
+ L C S P GI G + + +LP QL K
Sbjct: 181 QNKVSGLLSSCIDTDAFPSFSDDELPLFGLPNNTKGIIGLSKSQLALPIQLASANKVPSK 240
Query: 245 FSYCLLSHKFDDTTRTSSLILDNGSSHSDKKTTGLTYTPF-VNNPS---VAERNAFSVYY 300
FS CL S T +L++ G H + L TP VNN S ++ S Y
Sbjct: 241 FSLCLPSLNNQGFT---NLLVRAGEEHPQGISKFLKTTPLIVNNVSTGAISVEGVPSKEY 297
Query: 301 YVGLRRITVGGQRVRVWHKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVK 360
++ ++ + + G V + L +D GNGGT + + + FT EL + F+ +K
Sbjct: 298 FIDVKAVQIDGNVVNLKPSLLAIDNKGNGGTKLSTMSPFT----ELQTTVYKTFIRDFIK 353
Query: 361 NRNYTRALGAEALTGLRPCFDVPGEKTGS----FPELKLHFKGGAEVTLPVENYFAVVGE 416
+ R ++ C+D + S P + L +GG + T+ N V+ +
Sbjct: 354 KASDRRLKRVASVAPFEACYDSTSIRNSSTGLVVPTIDLVLRGGVQWTIYGANSM-VMAK 412
Query: 417 GSAVCLTVVTD----REASGGPSIILGNFQMQNYYVEYDL 452
+ CL +V R + SI++G +Q+++ +E+D+
Sbjct: 413 KNVACLAIVDGGTEPRMSFVKASIVIGGYQLEDNLLEFDV 452
>gi|359482287|ref|XP_002263129.2| PREDICTED: aspartic proteinase-like protein 2 isoform 2 [Vitis
vinifera]
gi|297740017|emb|CBI30199.3| unnamed protein product [Vitis vinifera]
Length = 502
Score = 81.3 bits (199), Expect = 1e-12, Method: Compositional matrix adjust.
Identities = 102/402 (25%), Positives = 162/402 (40%), Gaps = 64/402 (15%)
Query: 87 GGYSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPS--FIPKLSSSSRL 144
G Y + G+PP+ +DTGS ++W C + C S I F P SS++ L
Sbjct: 84 GLYFTKVKLGSPPREFNVQIDTGSDILWVTCNSCNDCPRTSGLGIELSFFDPSSSSTTSL 143
Query: 145 LGCQNPKC-SWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSGL-TEGIALSETLN 202
+ C +P C S + + +C S C SY YG G T G +S+ L
Sbjct: 144 VSCSHPICTSLVQTTAAECS-------PQSNQC-----SYSFHYGDGSGTTGYYVSDMLY 191
Query: 203 ----LPNRIIPN----FLVGCSVLSS-------RQPAGIAGFGRGKTSLPSQLNL----- 242
L + +I N + GCS S + GI GFG+ S+ SQL+
Sbjct: 192 FDTVLGDSLIANSSASIVFGCSTYQSGDLTKVDKAIDGIFGFGQQDLSVVSQLSSLGITP 251
Query: 243 DKFSYCLLSHKFDDTTRTSSLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYV 302
FS+CL IL+ + Y+P V + S +Y +
Sbjct: 252 KVFSHCLKGEGDGGGKLVLGEILE----------PNIIYSPLVPSQS---------HYNL 292
Query: 303 GLRRITVGGQRVRVWHKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNR 362
L+ I+V GQ + + N GTIVDSGTT T++ ++P + + +
Sbjct: 293 NLQSISVNGQLLPI--DPAVFATSNNQGTIVDSGTTLTYLVETAYDPFVSAITATVSSST 350
Query: 363 NYTRALGAEALTGLRPCFDVPGEKTGSFPELKLHFKGGAEVTLPVENYFAVVGEGSAVCL 422
+ G + C+ V FP + L+F GGA + L Y +G +
Sbjct: 351 TPVLSKGNQ-------CYLVSTSVDEIFPPVSLNFAGGASMVLKPGEYLMHLGFSDGAAM 403
Query: 423 TVVTDREASGGPSIILGNFQMQNYYVEYDLRNQRLGFKQQLC 464
+ ++ + ILG+ +++ YDL +QR+G+ C
Sbjct: 404 WCIGFQKVAEPGITILGDLVLKDKIFVYDLAHQRIGWANYDC 445
>gi|17979392|gb|AAL49921.1| unknown protein [Arabidopsis thaliana]
Length = 439
Score = 81.3 bits (199), Expect = 1e-12, Method: Compositional matrix adjust.
Identities = 104/395 (26%), Positives = 159/395 (40%), Gaps = 59/395 (14%)
Query: 89 YSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLGCQ 148
Y + GTP + ++DTGS L W C Y+ + + ++ F S S + +GC
Sbjct: 84 YFTEIRVGTPAKKFRVVVDTGSELTWVNC--RYRARGKDNRRV--FRADESKSFKTVGCL 139
Query: 149 NPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSG-LTEGIALSETLNLP--- 204
C + N L T + C SY Y G +G+ ET+ +
Sbjct: 140 TQTC--------KVDLMNLFSLTTCPTPSTPC-SYDYRYADGSAAQGVFAKETITVGLTN 190
Query: 205 NRI--IPNFLVGCSVLSSRQ----PAGIAGFGRGK---TSLPSQLNLDKFSYCLLSHKFD 255
R+ +P L+GCS + Q G+ G TS + L KFSYCL+ H
Sbjct: 191 GRMARLPGHLIGCSSSFTGQSFQGADGVLGLAFSDFSFTSTATSLYGAKFSYCLVDHL-- 248
Query: 256 DTTRTSSLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRV- 314
+ + S L GSS S K T TP + +Y + + I++G +
Sbjct: 249 -SNKNVSNYLIFGSSRSTKTAFRRT-TPL-------DLTRIPPFYAINVIGISLGYDMLD 299
Query: 315 ---RVWHKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAE 371
+VW D GGTI+DSGT+ T +A ++ + +V+ + + E
Sbjct: 300 IPSQVW------DATSGGGTILDSGTSLTLLADAAYKQVVTGLARYLVE----LKRVKPE 349
Query: 372 ALTGLRPCFD-VPGEKTGSFPELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTVVTDREA 430
+ + CF G P+L H KGGA ++Y G CL V +
Sbjct: 350 GVP-IEYCFSFTSGFNVSKLPQLTFHLKGGARFEPHRKSYLVDAAPG-VKCLGFV----S 403
Query: 431 SGGPSI-ILGNFQMQNYYVEYDLRNQRLGFKQQLC 464
+G P+ ++GN QNY E+DL L F C
Sbjct: 404 AGTPATNVIGNIMQQNYLWEFDLMASTLSFAPSAC 438
>gi|242089103|ref|XP_002440384.1| hypothetical protein SORBIDRAFT_09g030880 [Sorghum bicolor]
gi|241945669|gb|EES18814.1| hypothetical protein SORBIDRAFT_09g030880 [Sorghum bicolor]
Length = 555
Score = 81.3 bits (199), Expect = 1e-12, Method: Compositional matrix adjust.
Identities = 106/453 (23%), Positives = 170/453 (37%), Gaps = 89/453 (19%)
Query: 63 KNPQTKTTTTT-TTTTTTNISSHSYGGYSISLSFGTPPQIIPFILDTGSHLVWFPCT--- 118
K P+ +TT+T + +++ G Y +S+ FGTP +LDT + L W C
Sbjct: 113 KVPKLMSTTSTFELPMRSALNTAHVGMYLVSVRFGTPALPYNLVLDTANDLTWINCRLRR 172
Query: 119 ---NHYQCKYCSSSKIPS-----------------FIPKLSSSSRLLGCQNPKCSWIHHE 158
HY + + + + P SSS R + C +C+ + +
Sbjct: 173 RKGKHYGRQSSKTMSVGGDDDVVAALAKKEARKNWYRPAKSSSWRRIRCSEQQCAHLPYN 232
Query: 159 SIQCRDCNDEPLATSKNCTQICPSYLVLYGSGLTEGIALSETLNL---PNRI--IPNFLV 213
+ Q S + + C Y +T GI +E + R+ +P ++
Sbjct: 233 TCQ-----------SPSKLESCSYYQKTQDGTVTIGIYGNEKATVTVSDGRMAKLPGLVL 281
Query: 214 GCSVLSSRQPA----GIAGFGRGKTSLPSQLNL---DKFSYCLLSHKFDDTTRTSSLILD 266
GCSVL + G+ G G S L +FS+CLLS +++R +S L
Sbjct: 282 GCSVLEAGASVDAHDGVLSLGNGHMSFAIHAVLRFGGRFSFCLLSA---NSSRDASSYLT 338
Query: 267 NGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLR-----RIT---VGGQRVRVWH 318
G + P V P E + Y V ++ R+T VGG+R+ +
Sbjct: 339 FGPN------------PAVMGPGTMETE---ILYNVDVKAAYGPRVTAVLVGGERLDIPD 383
Query: 319 KYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEALTGLRP 378
+D+ G I+D+ T+ T + PE +EPL + L E+ G
Sbjct: 384 DVWNIDKGLGSGVILDTSTSVTSLVPEAYEPLVAALDRHLAH-------LPRESFAGFEY 436
Query: 379 CF-------DVPGEKTGSFPELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTVVTDREAS 431
C+ V + P++ + GGA + P + G V
Sbjct: 437 CYRWTFTGDGVDPAHNVTIPKVTVEMTGGARLE-PEAKSVVMPEVGHGVACLAFRKLPWG 495
Query: 432 GGPSIILGNFQMQNYYVEYDLRNQRLGFKQQLC 464
GGP II GN MQ Y E D F++ C
Sbjct: 496 GGPCII-GNVLMQEYIWEIDHSKATFRFRKDKC 527
>gi|413944387|gb|AFW77036.1| hypothetical protein ZEAMMB73_461996 [Zea mays]
Length = 472
Score = 81.3 bits (199), Expect = 1e-12, Method: Compositional matrix adjust.
Identities = 105/389 (26%), Positives = 162/389 (41%), Gaps = 56/389 (14%)
Query: 89 YSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSS-----KIPSFIPKLSSSSR 143
Y ++L GTP ++DTGS L W QCK C+SS K P + P SS+
Sbjct: 127 YVVTLGIGTPAVQQTVLIDTGSDLSWV------QCKPCNSSSCYPQKDPLYDPTASSTYA 180
Query: 144 LLGCQNPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGS-GLTEGIALSETLN 202
+ C + C + ++ C T+ + T +C Y + YG+ T G+ +ETL
Sbjct: 181 PVPCDSKACKDLVPDAYD-HGC------TNSSGTSLC-QYGIEYGNRDTTVGVYSTETLT 232
Query: 203 L-PNRIIPNFLVGCSVL---SSRQPAGIAGFGRGKTSLPSQLNL---DKFSYCLLSHKFD 255
L P + +F GC ++ + G+ G G SL SQ FSYCL
Sbjct: 233 LSPQVSVKDFGFGCGLVQQGTFDLFDGLLGLGGAPESLVSQTAETYGGAFSYCL------ 286
Query: 256 DTTRTSSLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRVR 315
+++ L G+ ++ T G +TP + P A +Y V L ++VGG+ +
Sbjct: 287 PPGNSTTGFLALGAPTNNNDTAGFLFTPLHSLPEQA------TFYLVNLTGVSVGGKPLD 340
Query: 316 VWHKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEALTG 375
+ L+ GG I+DSGT T + + L F + M + L
Sbjct: 341 IPPTVLS------GGMIIDSGTIITGLPDTAYSALRTAFRTAM----SAYPLLPPNNDDV 390
Query: 376 LRPCFDVPGEKTGSFPELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTVVTDREASGGPS 435
L C++ G + P + L F GGA + L V + + CL AS G
Sbjct: 391 LDTCYNFTGIANVTVPTVALTFDGGATIDLDVPSGVLIQD-----CLAFA--GGASDGDV 443
Query: 436 IILGNFQMQNYYVEYDLRNQRLGFKQQLC 464
I+GN + + V YD +GF+ C
Sbjct: 444 GIIGNVNQRTFEVLYDSGRGHVGFRPGAC 472
>gi|42407406|dbj|BAD09564.1| nucleoid DNA-binding protein-like [Oryza sativa Japonica Group]
Length = 205
Score = 81.3 bits (199), Expect = 1e-12, Method: Compositional matrix adjust.
Identities = 60/188 (31%), Positives = 96/188 (51%), Gaps = 20/188 (10%)
Query: 226 IAGFGRGKTSLPSQLNLDKFSYCLLSHKFDDTTRTSSLILD--NGSSHSDKKTTGLTY-- 281
+ G GRG SL SQL +FSYCL S + +R + + NG++ S ++GL
Sbjct: 1 MVGLGRGLLSLVSQLGPSRFSYCLTSFLSPEPSRLNFGVFATLNGTNAS---SSGLPVQS 57
Query: 282 TPFVNNPSVAERNAFSVYYYVGLRRITVGGQRVRVWHKYLTLDRDGNGGTIVDSGTTFTF 341
TP V N A Y++ L+ I++G +R+ + ++ DG GG +DSGT+ T+
Sbjct: 58 TPLVVN------AALPSLYFMSLKGISLGQKRLPIDPLVFAINDDGTGGVFIDSGTSLTW 111
Query: 342 MAPELFEPLADEFVSQMVKNRNYTRALGAEALTGLRPCFDVPGEKTGSF--PELKLHFKG 399
+ ++++ + E VS + R A E GL CF P T + P+++LHF G
Sbjct: 112 LQQDVYDAVRRELVSVL---RPLPPANDTE--IGLETCFPWPPPPTVTMTVPDMELHFDG 166
Query: 400 GAEVTLPV 407
GA + P+
Sbjct: 167 GANMLHPI 174
>gi|148907857|gb|ABR17052.1| unknown [Picea sitchensis]
Length = 422
Score = 80.9 bits (198), Expect = 1e-12, Method: Compositional matrix adjust.
Identities = 95/398 (23%), Positives = 157/398 (39%), Gaps = 58/398 (14%)
Query: 89 YSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLGCQ 148
Y++ + TP +I +LD +W C N Y SS+ P LGC
Sbjct: 44 YTVEIRQRTPLRIQRLVLDIEEDYMWVRCDNK---SYISSTYSP------------LGCS 88
Query: 149 NPKCSWIHHESIQCRDC--NDEPLATSKNCTQICPSYLVLYGSGLTEGIALSETLNLPNR 206
C + C C + P + C + + GS E + L LP+
Sbjct: 89 AQLCKSYQYSG--CGTCYGSRGPGCNNNTCV------VAVQGSRSVE--LAQDVLVLPSS 138
Query: 207 I---------IPNFLVGCSVLSSR---QPAGIAGFGRGKTSLPSQLNL-----DKFSYCL 249
P C + S+R G+AG +LPSQL+ KF+ CL
Sbjct: 139 DGSNPGPLARFPQLAFACDLSSNRVISGTVGVAGMTSSTLALPSQLSAAEGFSRKFAMCL 198
Query: 250 LSHKFDDTTRTSSLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITV 309
S L ++ + TP + N + ++ +Y+G++RI V
Sbjct: 199 PSGNAPGALFFGDEPLVFLPPPGRDLSSQIIRTPLIKN------SVYTDVFYLGVQRIEV 252
Query: 310 GGQRVRVWHKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALG 369
GG V + + L D+DG GGT + + +T +A ++ L F S + K N TR
Sbjct: 253 GGVNVAIDAEKLRFDKDGRGGTKLSTVVRYTQLASPIYNSLEGVFTS-VAKKMNITR--- 308
Query: 370 AEALTGLRPCFDVPG---EKTG-SFPELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTVV 425
+++ CFD G + G + P + + +G + T + ++V + V
Sbjct: 309 VASVSPFGACFDSSGVGSTRVGPAVPTIDIVLQGNSTTTWRIFGANSMVRVNNKVLCLGF 368
Query: 426 TDREASGGPSIILGNFQMQNYYVEYDLRNQRLGFKQQL 463
D + SI++G +QMQ+ +++DL LGF L
Sbjct: 369 VDGGDNLQQSIVIGTYQMQDNLLQFDLATSTLGFSSSL 406
>gi|15238250|ref|NP_196637.1| aspartyl protease family protein [Arabidopsis thaliana]
gi|8979710|emb|CAB96831.1| nucleoid DNA-binding protein cnd41-like protein [Arabidopsis
thaliana]
gi|18176136|gb|AAL59990.1| putative nucleoid DNA-binding protein cnd41 [Arabidopsis thaliana]
gi|22136986|gb|AAM91722.1| putative nucleoid DNA-binding protein cnd41 [Arabidopsis thaliana]
gi|110740988|dbj|BAE98588.1| nucleoid DNA-binding protein cnd41 - like protein [Arabidopsis
thaliana]
gi|332004210|gb|AED91593.1| aspartyl protease family protein [Arabidopsis thaliana]
Length = 464
Score = 80.9 bits (198), Expect = 1e-12, Method: Compositional matrix adjust.
Identities = 106/394 (26%), Positives = 163/394 (41%), Gaps = 75/394 (19%)
Query: 87 GGYSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQC-KYCSSSKIPSFIPKLSSSSRLL 145
G Y +++ GTP + + DTGS L W C C C S K P F P SS+ + +
Sbjct: 130 GNYIVTIGIGTPKHDLSLVFDTGSDLTWTQCE---PCLGSCYSQKEPKFNPSSSSTYQNV 186
Query: 146 GCQNPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGS-GLTEGIALSETLNLP 204
C +P C D ++ NC Y ++YG T+G E L
Sbjct: 187 SCSSPM-------------CEDAESCSASNCV-----YSIVYGDKSFTQGFLAKEKFTLT 228
Query: 205 NR-IIPNFLVGCSVLSS---RQPAGIAGFGRGKTSLPSQLNL---DKFSYCLLSHKFDDT 257
N ++ + GC + AG+ G G GK SLP+Q + FSYCL S
Sbjct: 229 NSDVLEDVYFGCGENNQGLFDGVAGLLGLGPGKLSLPAQTTTTYNNIFSYCLPSF----- 283
Query: 258 TRTSSLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRVRVW 317
T S+ L GS+ + + +TP + PS AF+ Y + + I+VG + + +
Sbjct: 284 TSNSTGHLTFGSAGISES---VKFTPISSFPS-----AFN--YGIDIIGISVGDKELAIT 333
Query: 318 HKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEALTGLR 377
+ + G I+DSGT FT + +++ L F +M ++ T G
Sbjct: 334 PNSFSTE-----GAIIDSGTVFTRLPTKVYAELRSVFKEKMSSYKS-TSGYGL-----FD 382
Query: 378 PCFDVPGEKTGSFPELKLHFKG-------GAEVTLPVENYFAVVGEGSAVCLTVVTDREA 430
C+D G T ++P + F G G+ ++LP++ S VCL + +
Sbjct: 383 TCYDFTGLDTVTYPTIAFSFAGSTVVELDGSGISLPIKI--------SQVCLAFAGNDDL 434
Query: 431 SGGPSIILGNFQMQNYYVEYDLRNQRLGFKQQLC 464
I GN Q V YD+ R+GF C
Sbjct: 435 PA----IFGNVQQTTLDVVYDVAGGRVGFAPNGC 464
>gi|219886219|gb|ACL53484.1| unknown [Zea mays]
gi|219888509|gb|ACL54629.1| unknown [Zea mays]
gi|414588374|tpg|DAA38945.1| TPA: nucellin-like aspartic protease [Zea mays]
Length = 415
Score = 80.9 bits (198), Expect = 1e-12, Method: Compositional matrix adjust.
Identities = 95/396 (23%), Positives = 158/396 (39%), Gaps = 61/396 (15%)
Query: 87 GGYSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLG 146
G Y ++++ G P + +DTGS L W C C+ C+ P + P +++RL+
Sbjct: 51 GHYYVTMNIGNPAKPYFLDVDTGSDLTWLQC--DAPCRSCNKVPHPLYRP---TANRLVP 105
Query: 147 CQNPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLY-GSGLTEGIALSETLNLPN 205
C N C+ +H C + K C Y + Y S ++G+ ++++ +LP
Sbjct: 106 CANALCTALHSGQGSNNKC-----PSPKQC-----DYQIKYTDSASSQGVLINDSFSLPM 155
Query: 206 R---IIPNFLVGCSVLS------SRQPA--GIAGFGRGKTSLPSQLNLDKFSYCLLSHKF 254
R I P GC + Q A G+ G GRG SL SQL + ++ H
Sbjct: 156 RSSNIRPGLTFGCGYDQQVGKNGAVQAAIDGMLGLGRGSVSLVSQLKQQGITKNVVGHCL 215
Query: 255 DDTTRTSSLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRV 314
T L + S + +T+ P S + S Y R + V V
Sbjct: 216 -STNGGGFLFFGDDVVPSSR----VTWVPMAQRTSGNYYSPGSGTLYFDRRSLGVKPMEV 270
Query: 315 RVWHKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKN----RNYTRALGA 370
+ DSG+T+T+ + ++ + + K+ + T L
Sbjct: 271 -----------------VFDSGSTYTYFTAQPYQAVVSALKGGLSKSLKQVSDPTLPLCW 313
Query: 371 EALTGLRPCFDVPGEKTGSFPELKLHFKGG--AEVTLPVENYFAVVGEGSAVCLTVVTDR 428
+ + FDV E F + L F A + +P ENY V G+ VCL ++ D
Sbjct: 314 KGQKAFKSVFDVKNE----FKSMFLSFASAKNAAMEIPPENYLIVTKNGN-VCLGIL-DG 367
Query: 429 EASGGPSIILGNFQMQNYYVEYDLRNQRLGFKQQLC 464
A+ ++G+ MQ+ V YD +LG+ + C
Sbjct: 368 TAAKLSFNVIGDITMQDQMVIYDNEKSQLGWARGAC 403
>gi|326496543|dbj|BAJ94733.1| predicted protein [Hordeum vulgare subsp. vulgare]
gi|326511583|dbj|BAJ91936.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 430
Score = 80.9 bits (198), Expect = 1e-12, Method: Compositional matrix adjust.
Identities = 101/405 (24%), Positives = 166/405 (40%), Gaps = 76/405 (18%)
Query: 97 TPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLGCQNPKCSWIH 156
TP + +LD G +W C Y +SSS + C + C
Sbjct: 51 TPQVPVTAVLDLGGASLWVDCDAGY----------------VSSSYAGVPCASKLCRL-- 92
Query: 157 HESIQCR-DCNDEPLATSKNC-TQIC---PSYLVLYGSGLTEGIALSETLNLPNRIIPN- 210
+S+ C C +P S C C P V S T G +++ L++P P
Sbjct: 93 AKSVACATSCVGKP---SPGCLNDTCSGFPENTVTRVS--TGGNLITDVLSVPTTFRPAP 147
Query: 211 ----------FLVGCSVLSSRQPAGIAGFG---RGKTSLPSQLNLD-----KFSYCLLSH 252
F G + L+ AG G R + +LP+QL KF+ CL S
Sbjct: 148 GPLATAPAFLFTCGATFLTDGLAAGATGMASLSRARFALPTQLAATFRFSRKFALCLTS- 206
Query: 253 KFDDTTRTSSLILDNGSSHSDKK----TTGLTYTPF-VNNPS---VAERNAFSVYYYVGL 304
T + +++ + ++ + + LTYTP VNN S V+ + S Y++G+
Sbjct: 207 -----TSAAGVVVFGDAPYAFQPGVDLSKSLTYTPLLVNNVSTAGVSGQKDKSNEYFIGV 261
Query: 305 RRITVGGQRVRVWHKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNY 364
I V G+ V + L +D+ G GGT + + +T + + + + D F ++
Sbjct: 262 TAIKVNGRAVPLNASLLAIDKQGGGGTKLSTVAPYTVLETSIHKAVTDAFAAETAMIPRV 321
Query: 365 TRALGAEALTGLRPCFDVPGEKTGS------FPELKLHFKGGAEVTLPVENYFAVVGEGS 418
A+ + C+D G K GS P ++L + A + V +G
Sbjct: 322 ------RAVAPFKLCYD--GSKVGSTRVGPAVPTVELVLQNEAASWVVFGANSMVAAKGG 373
Query: 419 AVCLTVVTDREASGGPSIILGNFQMQNYYVEYDLRNQRLGFKQQL 463
A+CL VV D A+ S+++G M++ +E+DL+ RLGF L
Sbjct: 374 ALCLGVV-DGGAAPRTSVVIGGHTMEDNLLEFDLQRARLGFSSSL 417
>gi|357160697|ref|XP_003578847.1| PREDICTED: aspartic proteinase Asp1-like [Brachypodium distachyon]
Length = 421
Score = 80.9 bits (198), Expect = 1e-12, Method: Compositional matrix adjust.
Identities = 102/398 (25%), Positives = 161/398 (40%), Gaps = 57/398 (14%)
Query: 84 HSYGGYSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSR 143
+ +G Y +++S G PP+ +DTGS L W C C C+ P + P + ++
Sbjct: 53 YPHGLYYVAMSIGNPPRPYFLDVDTGSDLTWLQCDA--PCVSCNKVPHPLYRP---TKNK 107
Query: 144 LLGCQNPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSGLTEGIALSET--L 201
++ C + CS +H C D P Q C + G + G+ L+++ +
Sbjct: 108 IVPCVDQLCSSLHGGLSGKHKC-DSP-------KQQCDYEIKYADQGSSLGVLLTDSFAV 159
Query: 202 NLPNRII--PNFLVGCS----VLSSRQPA---GIAGFGRGKTSLPSQLNLDKFSYCLLSH 252
L N I P+ GC V SS + A G+ G G G SL SQL + ++ H
Sbjct: 160 RLANSSIVRPSLAFGCGYDQQVGSSTEVAPTDGVLGLGSGSISLLSQLKQHGITKNVVGH 219
Query: 253 KFDDTTRTSSLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQ 312
DN +S T+ P V R+AF YY G + GG+
Sbjct: 220 CLSIRGGGFLFFGDNLVPYSRA-----TWVPMV-------RSAFKNYYSPGTASLYFGGR 267
Query: 313 RVRVWHKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNR----NYTRAL 368
+ V + L DSG++FT+ + ++ L S + K + + L
Sbjct: 268 SLGVRPMEVVL----------DSGSSFTYFGAQPYQALVTALKSDLSKTLKEVFDPSLPL 317
Query: 369 GAEALTGLRPCFDVPGEKTGSFPELKLHFKGG--AEVTLPVENYFAVVGEGSAVCLTVVT 426
+ + DV E F L L F G A + +P ENY V G+A CL ++
Sbjct: 318 CWKGKKPFKSVLDVKKE----FKSLVLSFSNGKKALMEIPPENYLIVTKFGNA-CLGILN 372
Query: 427 DREASGGPSIILGNFQMQNYYVEYDLRNQRLGFKQQLC 464
E I+G+ MQ+ V YD ++G+ + C
Sbjct: 373 GSEIGLKDLNIVGDITMQDQMVIYDNERGQIGWIRAPC 410
>gi|195645150|gb|ACG42043.1| aspartic proteinase Asp1 precursor [Zea mays]
Length = 415
Score = 80.5 bits (197), Expect = 1e-12, Method: Compositional matrix adjust.
Identities = 95/396 (23%), Positives = 158/396 (39%), Gaps = 61/396 (15%)
Query: 87 GGYSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLG 146
G Y ++++ G P + +DTGS L W C C+ C+ P + P +++RL+
Sbjct: 51 GHYYVTMNIGNPAKPYFLDVDTGSDLTWLQC--DAPCRSCNKVPHPLYRP---TANRLVP 105
Query: 147 CQNPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLY-GSGLTEGIALSETLNLPN 205
C N C+ +H C + K C Y + Y S ++G+ ++++ +LP
Sbjct: 106 CANALCTALHSGQGSNNKC-----PSPKQC-----DYQIKYTDSASSQGVLINDSFSLPM 155
Query: 206 R---IIPNFLVGCSVLS------SRQPA--GIAGFGRGKTSLPSQLNLDKFSYCLLSHKF 254
R I P GC + Q A G+ G GRG SL SQL + ++ H
Sbjct: 156 RSSNIRPGLTFGCGYDQQVGKNGAVQAAIDGMLGLGRGSVSLVSQLKQQGITKNVVGHCL 215
Query: 255 DDTTRTSSLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRV 314
T L + S + +T+ P S + S Y R + V V
Sbjct: 216 -STNGGGFLFFGDDVVPSSR----VTWVPMAQRTSGNYYSPGSGTLYFDRRSLGVKPMEV 270
Query: 315 RVWHKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKN----RNYTRALGA 370
+ DSG+T+T+ + ++ + + K+ + T L
Sbjct: 271 -----------------VFDSGSTYTYFTAQPYQAVVSALKGGLSKSLKQVSDPTLPLCW 313
Query: 371 EALTGLRPCFDVPGEKTGSFPELKLHFKGG--AEVTLPVENYFAVVGEGSAVCLTVVTDR 428
+ + FDV E F + L F A + +P ENY V G+ VCL ++ D
Sbjct: 314 KGQKAFKSVFDVKNE----FKSMFLSFSSAKNAAMEIPPENYLIVTKNGN-VCLGIL-DG 367
Query: 429 EASGGPSIILGNFQMQNYYVEYDLRNQRLGFKQQLC 464
A+ ++G+ MQ+ V YD +LG+ + C
Sbjct: 368 TAAKLSFNVIGDITMQDQMVIYDNEKSQLGWARGAC 403
>gi|242092898|ref|XP_002436939.1| hypothetical protein SORBIDRAFT_10g011740 [Sorghum bicolor]
gi|241915162|gb|EER88306.1| hypothetical protein SORBIDRAFT_10g011740 [Sorghum bicolor]
Length = 469
Score = 80.5 bits (197), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 104/388 (26%), Positives = 150/388 (38%), Gaps = 61/388 (15%)
Query: 89 YSISLSFGTP--PQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLG 146
Y ++L FGTP PQ++ ++DTGS + W CT K C K P F P SS+ +
Sbjct: 131 YVVTLGFGTPSVPQVL--LMDTGSDVSWVQCTPCNSTK-CYPQKDPLFDPSKSSTYAPIA 187
Query: 147 CQNPKCSWI-HHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSG-LTEGIALSETLNL- 203
C C + H C + TQ Y V Y G + G+ +ETL L
Sbjct: 188 CNTDACRKLGDHYHNGC----------TSGGTQC--GYSVEYADGSHSRGVYSNETLTLA 235
Query: 204 PNRIIPNFLVGCSVLSSRQPA----GIAGFGRGKTSLPSQLNL---DKFSYCLLSHKFDD 256
P + +F GC R P+ G+ G G SL Q + FSYCL +
Sbjct: 236 PGITVEDFHFGCG-RDQRGPSDKYDGLLGLGGAPVSLVVQTSSVYGGAFSYCLPALN--- 291
Query: 257 TTRTSSLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRVRV 316
+ L+L GS S K+ +TP + P ++ +Y V + I+VGG+ + +
Sbjct: 292 -SEAGFLVL--GSPPSGNKSA-FVFTPMRHLP------GYATFYMVTMTGISVGGKPLHI 341
Query: 317 WHKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEALTGL 376
GG I+DSGT T + + L R +A
Sbjct: 342 PQSAF------RGGMIIDSGTVDTELPETAYNALEAAL-------RKALKAYPLVPSDDF 388
Query: 377 RPCFDVPGEKTGSFPELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTVVTDREASGGPSI 436
C++ G + P + F GGA + L V N V CL G
Sbjct: 389 DTCYNFTGYSNITVPRVAFTFSGGATIDLDVPNGILV-----NDCLAFQESGPDDG--LG 441
Query: 437 ILGNFQMQNYYVEYDLRNQRLGFKQQLC 464
I+GN + V YD +GF+ C
Sbjct: 442 IIGNVNQRTLEVLYDAGRGNVGFRAGAC 469
>gi|21717171|gb|AAM76364.1|AC074196_22 putative nucleoid DNA binding protein [Oryza sativa Japonica Group]
gi|31433290|gb|AAP54828.1| Eukaryotic aspartyl protease family protein, expressed [Oryza
sativa Japonica Group]
gi|125532789|gb|EAY79354.1| hypothetical protein OsI_34483 [Oryza sativa Indica Group]
Length = 382
Score = 80.5 bits (197), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 94/388 (24%), Positives = 162/388 (41%), Gaps = 53/388 (13%)
Query: 92 SLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLGCQNPK 151
S + GTPPQ +D G LVW + C + ++P F P SS+ R C
Sbjct: 27 SFTIGTPPQPASAFIDVGGLLVW-TQCSQCSSSSCFNQELPPFDPTKSSTYRPEPCGTAL 85
Query: 152 CSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSGLTEGIALSETLNLPNRIIPNF 211
C + R+C+ + A + TQ+ T G ++ + + +
Sbjct: 86 CEFF---PASIRNCSGDVCAYEAS-TQLFEH---------TSGKIGTDAVAIGTATAASV 132
Query: 212 LVGCSVLSSRQ-----PAGIAGFGRGKTSLPSQLNLDKFSYCLLSHKFDDTTRTSSLILD 266
GC + S + P+G G R SL +Q+N+ FS+CL H + S L L
Sbjct: 133 AFGCVMASDIKLMDGGPSGFVGLARTPLSLVAQMNVTAFSHCLAPHD-GGGGKNSRLFLG 191
Query: 267 NGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRVRVWHKYLTLDRD 326
+ + + TPFV + + + S+YY + L I G + + +T+ +
Sbjct: 192 AAAKLAGGGKSAAMTTPFVKS---SPDDIKSLYYLINLEGIKAGDEAI------ITVPQS 242
Query: 327 GNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEALT---GLRPCFDVP 383
G ++ + + +F+ +++ L + T A+G T + FD+
Sbjct: 243 GR-TVLLQTFSPVSFLVDGVYQDL----------KKAVTAAVGGPTATPPEQFQSIFDLC 291
Query: 384 GEKTG--SFPELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTVVTDR-----EASGGPSI 436
++ G P++ L F+G A +T+P NY VG+ + VC+ + + E +G
Sbjct: 292 FKRGGVSGAPDVVLTFQGAAALTVPPTNYLLDVGDDT-VCVAIASSARLNSTEVAG--MS 348
Query: 437 ILGNFQMQNYYVEYDLRNQRLGFKQQLC 464
ILG Q QN + YDL + L F+ C
Sbjct: 349 ILGGLQQQNVHFLYDLEKETLSFEAADC 376
>gi|302783208|ref|XP_002973377.1| hypothetical protein SELMODRAFT_413681 [Selaginella moellendorffii]
gi|300159130|gb|EFJ25751.1| hypothetical protein SELMODRAFT_413681 [Selaginella moellendorffii]
Length = 472
Score = 80.5 bits (197), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 100/394 (25%), Positives = 151/394 (38%), Gaps = 75/394 (19%)
Query: 89 YSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLGCQ 148
++++L+ GTPP F + S W C+ C S+ P F S+S + C
Sbjct: 88 FAMNLNLGTPPVQHNFTMALNSEFFWAACSPCVDCNV--STNDPLFSSASSTSYTRIPCT 145
Query: 149 NPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSGLTEGIALSETLNLPNRII 208
+P CS C + ++ SY Y S E + + P +
Sbjct: 146 SPFCS--TSPGFSTNACGSSAVGSTTCLYNF--SYSTDYSSA-GEMASDVVAMKTPRKTR 200
Query: 209 PN----FLVGC-----SVLSSRQPAGIAGFGRGKTSLPSQL-NLD---KFSYCLLSHKFD 255
N +GC ++L +G+ GF + S QL +D KF YC+ S F
Sbjct: 201 GNKSLRMSLGCGRESTTLLGILNTSGLVGFAKTDKSFIGQLAEMDYTSKFIYCVPSDTF- 259
Query: 256 DTTRTSSLILDNG--SSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQR 313
+ ++L N SSHS L+YTP + N + YY+GLR I++
Sbjct: 260 ----SGKIVLGNYKISSHSS-----LSYTPMIVNSTA--------LYYIGLRSISITDTL 302
Query: 314 VRVWHKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAE-- 371
L DG GGTI+DS F++ P+ + PL + N N T+ E
Sbjct: 303 TFPVQGILA---DGTGGTIIDSTFAFSYFTPDSYTPLVQAIQNL---NSNLTKVSSNETA 356
Query: 372 ALTGLRPCFDVPGEKTGSFPELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTVVTDREAS 431
AL G C++V + E + VCL V D E
Sbjct: 357 ALLGNDICYNVSVNDDDA--------------------------ENATVCL-AVGDSEKV 389
Query: 432 GGPSIILGNFQMQNYYVEYDLRNQRLGFKQQLCK 465
G ++G +Q + VE+DL Q +GF C
Sbjct: 390 GFSLNVIGTYQQLDVAVEFDLEKQEIGFGTAGCN 423
>gi|340810981|gb|AEK75417.1| S5 [Oryza rufipogon]
Length = 357
Score = 80.5 bits (197), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 104/404 (25%), Positives = 170/404 (42%), Gaps = 77/404 (19%)
Query: 91 ISLSFGTPPQIIPFILDTGSHLVWF---PCTNHYQCKYCSSSKIPSFIPKLSSSSRLLGC 147
+++S G PP + +DTGS L W PC H C S+ P F P S +SR + C
Sbjct: 1 MAVSLGKPPVVNLVAIDTGSTLSWVQCQPCAVH--CHTQSAKAGPIFDPGRSYTSRRVRC 58
Query: 148 QNPKCSWIHHE-SIQCRDCNDEPLATSKNCTQICPSYLVLYGSGL--TEGIALSETLNLP 204
+ KC + ++ +Q +C ++ +CT Y V YG+G + G +++TL +
Sbjct: 59 SSVKCGELRYDLRLQQANCMEK----EDSCT-----YSVTYGNGWAYSVGKMVTDTLRIG 109
Query: 205 NRIIPNFLVGCS--VLSSRQPAGIAGFGRGK-------TSLPSQLNLDKFSYCLLSHKFD 255
+ + + + GCS V S AGI GFG P L+ SYCL +
Sbjct: 110 DSFM-DLMFGCSMDVKYSEFEAGIFGFGSSSFSFFEQLAGYPDILSYKALSYCLPT---- 164
Query: 256 DTTRTSSLILDNGSSHSDKKTTGLTYTPF---VNNPSVAERNAFSVYYYVGLRRITVGGQ 312
D T+ +IL D+ YTP +N P+ Y + + + GQ
Sbjct: 165 DETKPGYMIL----GRYDRAAMDGGYTPLFRSINRPT----------YSLTMEMLIANGQ 210
Query: 313 RVRVWHKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEA 372
R+ + IVDSG T + P F L D+ ++Q + + Y R A
Sbjct: 211 RLVT----------SSSEMIVDSGAQRTSLWPSTFA-LLDKTITQAMSSIGYHRTSRARQ 259
Query: 373 LTGLRPCFDVPGEKTG------------SFPELKLHFKGGAEVTLPVENYFAVVGEGSAV 420
+ + C+ + +G + P L++ F GGA + L N F +
Sbjct: 260 ESYI--CYLSEHDYSGWNGTITPFSNWSALPLLEIGFAGGAALALSPRNVF-YNDPHRGL 316
Query: 421 CLTVVTDREASGGPSIILGNFQMQNYYVEYDLRNQRLGFKQQLC 464
C+T + S ILGN +++ +D++ ++ GFK +C
Sbjct: 317 CMTFAQNPALR---SQILGNRVTRSFGTTFDIQGKQFGFKYAVC 357
>gi|218186522|gb|EEC68949.1| hypothetical protein OsI_37668 [Oryza sativa Indica Group]
Length = 421
Score = 80.1 bits (196), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 101/400 (25%), Positives = 166/400 (41%), Gaps = 61/400 (15%)
Query: 84 HSYGGYSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSR 143
+ +G Y +++S G PP+ +DTGS L W C C CS P + P + ++
Sbjct: 53 YPHGLYYVAMSIGNPPRPYFLDVDTGSDLTWLQCDA--PCVSCSKVPHPLYRP---TKNK 107
Query: 144 LLGCQNPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSGLTEGIALSET--L 201
L+ C + C+ +H C D P Q C + G + G+ ++++ L
Sbjct: 108 LVPCVDQMCAALHGGLTGRHKC-DSP-------KQQCDYEIKYADQGSSLGVLVTDSFAL 159
Query: 202 NLPNRII--PNFLVGCS----VLSSRQPA---GIAGFGRGKTSLPSQLNLDKFSYCLLSH 252
L N I P GC V SS + + G+ G G G SL SQL + ++ H
Sbjct: 160 RLANSSIVRPGLAFGCGYDQQVGSSTEVSATDGVLGLGSGSVSLLSQLKQHGITKNVVGH 219
Query: 253 KFDDTTRTSSLIL--DNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVG 310
+TR + D+ +S T+ P + S RN YY G + G
Sbjct: 220 CL--STRGGGFLFFGDDIVPYSRA-----TWAPMARSTS---RN----YYSPGSANLYFG 265
Query: 311 GQRVRVWHKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKN----RNYTR 366
G+ + V + + DSG++FT+ + + ++ L D + KN +++
Sbjct: 266 GRPLGVRPMEV----------VFDSGSSFTYFSAQPYQALVDAIKGDLSKNLKEVPDHSL 315
Query: 367 ALGAEALTGLRPCFDVPGEKTGSFPELKLHFKGG--AEVTLPVENYFAVVGEGSAVCLTV 424
L + + DV E F + L F G A + +P ENY V G+A CL +
Sbjct: 316 PLCWKGKKPFKSVLDVKKE----FKTVVLSFSNGKKALMEIPPENYLIVTKYGNA-CLGI 370
Query: 425 VTDREASGGPSIILGNFQMQNYYVEYDLRNQRLGFKQQLC 464
+ E I+G+ MQ+ V YD ++G+ + C
Sbjct: 371 LNGSEVGLKDLNIVGDITMQDQMVIYDNERGQIGWIRAPC 410
>gi|356509399|ref|XP_003523437.1| PREDICTED: aspartic proteinase Asp1-like [Glycine max]
Length = 421
Score = 80.1 bits (196), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 94/403 (23%), Positives = 159/403 (39%), Gaps = 74/403 (18%)
Query: 87 GGYSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLG 146
G Y++SL+ G PP++ +DTGS L W C CK C+ + + P L+
Sbjct: 62 GYYTVSLAIGNPPKVYDLDIDTGSDLTWVQCD--APCKGCTLPRNRLYKPH----GDLVK 115
Query: 147 CQNPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGS-GLTEGIALSETLNLPN 205
C +P C+ I N ++ C Y V Y G + G+ L + N+P
Sbjct: 116 CVDPLCAAIQSAP------NHHCAGPNEQC-----DYEVEYADQGSSLGVLLRD--NIPL 162
Query: 206 RII------PNFLVGCSVLSSRQ-------PAGIAGFGRGKTSLPSQLN-----LDKFSY 247
+ P GC + AG+ G G G+TS+ SQL+ + +
Sbjct: 163 KFTNGSLARPMLAFGCGYDQTHHGQNPPPSTAGVLGLGNGRTSILSQLHSLGLIRNVVGH 222
Query: 248 CLLSHKFDDTTRTSSLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRI 307
CL LI +G+ +TP + + S + +
Sbjct: 223 CLSGRGGGFLFFGDQLI----------PPSGVVWTPLLQSSSAQHYKTGPADLFFDRKTT 272
Query: 308 TVGGQRVRVWHKYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRA 367
+V G + I DSG+++T+ + + L + ++ ++ + +RA
Sbjct: 273 SVKGLEL-----------------IFDSGSSYTYFNSQAHKALVN-LIANDLRGKPLSRA 314
Query: 368 LGAEAL----TGLRPCFDVPGEKTGSFPELKLHF--KGGAEVTLPVENYFAVVGEGSAVC 421
G +L G +P F + T +F L L F + + LP E Y V G+ VC
Sbjct: 315 TGDPSLPICWKGPKP-FKSLHDVTSNFKPLLLSFTKSKNSPLQLPPEAYLIVTKHGN-VC 372
Query: 422 LTVVTDREASGGPSIILGNFQMQNYYVEYDLRNQRLGFKQQLC 464
L ++ E G + I+G+ +Q+ V YD Q++G+ C
Sbjct: 373 LGILDGTEIGLGNTNIIGDISLQDKLVIYDNEKQQIGWASANC 415
>gi|15226315|ref|NP_180368.1| aspartyl protease-like protein [Arabidopsis thaliana]
gi|4510415|gb|AAD21501.1| putative chloroplast nucleoid DNA binding protein [Arabidopsis
thaliana]
gi|330252975|gb|AEC08069.1| aspartyl protease-like protein [Arabidopsis thaliana]
Length = 396
Score = 80.1 bits (196), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 90/383 (23%), Positives = 144/383 (37%), Gaps = 61/383 (15%)
Query: 89 YSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLGCQ 148
Y + L GTPP I I+DTGS + W C C +C P F P SS+ + C
Sbjct: 65 YLMKLQVGTPPFEIQAIIDTGSEITWTQC---LPCVHCYEQNAPIFDPSKSSTFKEKRCD 121
Query: 149 NPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSGLTEGIALSETLNLPNRII 208
C + + D + G+ TE I L T P ++
Sbjct: 122 GHSCPY----EVDYFD------------------HTYTMGTLATETITLHSTSGEP-FVM 158
Query: 209 PNFLVGCSVLSSR-QPA--GIAGFGRGKTSLPSQLNLDK---FSYCLLSHKFDDTTRTSS 262
P ++GC +S +P+ G+ G G +SL +Q+ + SYC
Sbjct: 159 PETIIGCGHNNSWFKPSFSGMVGLNWGPSSLITQMGGEYPGLMSYCFSGQ---------- 208
Query: 263 LILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRVRVWHKYLT 322
G+S + + V + ++ A +YY+ L ++VG R+ T
Sbjct: 209 -----GTSKINFGANAIVAGDGVVSTTMFMTTAKPGFYYLNLDAVSVGNTRIETMG---T 260
Query: 323 LDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEALTGLRPCFDV 382
G ++DSGTT T+ P + L + V +V G + L C++
Sbjct: 261 TFHALEGNIVIDSGTTLTYF-PVSYCNLVRQAVEHVVTAVRAADPTGNDML-----CYN- 313
Query: 383 PGEKTGSFPELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTVVTDREASGGPSIILGNFQ 442
+ FP + +HF GG ++ L N + G CL ++ + I GN
Sbjct: 314 -SDTIDIFPVITMHFSGGVDLVLDKYNMYMESNNGGVFCLAIICNSPTQ---EAIFGNRA 369
Query: 443 MQNYYVEYDLRNQRLGFKQQLCK 465
N+ V YD + + F C
Sbjct: 370 QNNFLVGYDSSSLLVSFSPTNCS 392
>gi|21717166|gb|AAM76359.1|AC074196_17 putative nucleoid DNA binding protein [Oryza sativa Japonica Group]
gi|31433306|gb|AAP54835.1| Eukaryotic aspartyl protease family protein, expressed [Oryza
sativa Japonica Group]
gi|125575546|gb|EAZ16830.1| hypothetical protein OsJ_32301 [Oryza sativa Japonica Group]
Length = 373
Score = 80.1 bits (196), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 97/391 (24%), Positives = 156/391 (39%), Gaps = 64/391 (16%)
Query: 89 YSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKYCSSSKIPSFIPKLSSSSRLLGCQ 148
Y +L+ GTPPQ I+ VW C+ C+ C +P F SS+ R C
Sbjct: 28 YMANLTIGTPPQPASAIIHLAGEFVWTQCS---PCRRCFKQDLPLFNRSASSTYRPEPCG 84
Query: 149 NPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYGSGLTEGIALSETLNLPNRII 208
C ES+ C+ + +C SY V G T GI ++T +
Sbjct: 85 TALC-----ESVPASTCSGD---------GVC-SYEVETMFGDTSGIGGTDTFAI-GTAT 128
Query: 209 PNFLVGCSVLSSRQ----PAGIAGFGRGKTSLPSQLNLDKFSYCLLSHKFDDTTRTSSLI 264
+ GC++ S+ + +G+ G GR SL Q+N FSYCL H + S+L+
Sbjct: 129 ASLAFGCAMDSNIKQLLGASGVVGLGRTPWSLVGQMNATAFSYCLAPHG--AAGKKSALL 186
Query: 265 LDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRVRVWHKYLTLD 324
L + + K+ TP VN + S Y + L I G + +
Sbjct: 187 LGASAKLAGGKSAA--TTPLVNT------SDDSSDYMIHLEGIKFGD---------VIIA 229
Query: 325 RDGNGGTI-VDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEALTGLRPCFDVP 383
NG + VD+ +F+ F+ + + T A+GA + FD+
Sbjct: 230 PPPNGSVVLVDTIFGVSFLVDAAFQAI----------KKAVTVAVGAAPMATPTKPFDLC 279
Query: 384 GEKTGS---------FPELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTVVTDREASGGP 434
K + P++ L F+G A +T+P Y G G+ VCL +++ +
Sbjct: 280 FPKAAAAAGANSSLPLPDVVLTFQGAAALTVPPSKYMYDAGNGT-VCLAMMSSAMLNLTT 338
Query: 435 SI-ILGNFQMQNYYVEYDLRNQRLGFKQQLC 464
+ ILG +N + +DL + L F+ C
Sbjct: 339 ELSILGRLHQENIHFLFDLDKETLSFEPADC 369
>gi|195638734|gb|ACG38835.1| aspartic proteinase nepenthesin-1 precursor [Zea mays]
Length = 465
Score = 80.1 bits (196), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 105/386 (27%), Positives = 156/386 (40%), Gaps = 54/386 (13%)
Query: 87 GGYSISLSFGTPPQIIPFILDTGSHLVWFPCTNHYQCKY-CSSSKIPSFIPKLSSSSRLL 145
G Y + GTP + ++DTGS L W C+ C C P F PK SSS +
Sbjct: 125 GNYVTRMGLGTPAKSYVMVVDTGSSLTWLQCS---PCVVSCHRQSGPVFNPKASSSYASV 181
Query: 146 GCQNPKCSWIHHESIQCRDCNDEPLATSKNCTQICPSYLVLYG-SGLTEGIALSETLNLP 204
C +CS + ++ C+ TS C Y YG S + G +T++
Sbjct: 182 SCSAQQCSDLTTATLNPASCS-----TSNVCI-----YQASYGDSSFSVGYLSKDTVSFG 231
Query: 205 NRIIPNFLVGCSVLSS---RQPAGIAGFGRGKTSLPSQLNLD---KFSYCLLSHKFDDTT 258
+ +PNF GC + Q AG+ G R K SL QL FSYCL + +
Sbjct: 232 STSVPNFYYGCGQDNEGLFGQSAGLIGLARNKLSLLYQLAPSMGYSFSYCLPTSSSSSSG 291
Query: 259 RTSSLILDNGSSHSDKKTTGLTYTPFVNNPSVAERNAFSVYYYVGLRRITVGGQRVRVWH 318
S + G +YTP +A + Y++ + I V G+ + V
Sbjct: 292 YLSIGSYNPGQ---------YSYTP------MASSSLDDSLYFIKMTGIKVAGKPLSVSS 336
Query: 319 KYLTLDRDGNGGTIVDSGTTFTFMAPELFEPLADEFVSQMVKNRNYTRALGAEALTGLRP 378
+ TI+DSGT T + ++ L+ M + R A A + L
Sbjct: 337 SAYSSLP-----TIIDSGTVITRLPTGVYSALSKAVAGAM---KGTPR---ASAFSILDT 385
Query: 379 CFDVPGEKTGSFPELKLHFKGGAEVTLPVENYFAVVGEGSAVCLTVVTDREASGGPSIIL 438
CF + PE+ + F GGA + L N V + + CL R A+ I+
Sbjct: 386 CFQGQAARL-RVPEVTMAFAGGAALKLAARNLLVDV-DSATTCLAFAPARSAA-----II 438
Query: 439 GNFQMQNYYVEYDLRNQRLGFKQQLC 464
GN Q Q + V YD++N ++GF C
Sbjct: 439 GNTQQQTFSVVYDVKNSKIGFAAAGC 464
Database: nr
Posted date: Mar 3, 2013 10:45 PM
Number of letters in database: 999,999,864
Number of sequences in database: 2,912,245
Database: /local_scratch/syshi//blastdatabase/nr.01
Posted date: Mar 3, 2013 10:52 PM
Number of letters in database: 999,999,666
Number of sequences in database: 2,912,720
Database: /local_scratch/syshi//blastdatabase/nr.02
Posted date: Mar 3, 2013 10:58 PM
Number of letters in database: 999,999,938
Number of sequences in database: 3,014,250
Database: /local_scratch/syshi//blastdatabase/nr.03
Posted date: Mar 3, 2013 11:03 PM
Number of letters in database: 999,999,780
Number of sequences in database: 2,805,020
Database: /local_scratch/syshi//blastdatabase/nr.04
Posted date: Mar 3, 2013 11:08 PM
Number of letters in database: 999,999,551
Number of sequences in database: 2,816,253
Database: /local_scratch/syshi//blastdatabase/nr.05
Posted date: Mar 3, 2013 11:13 PM
Number of letters in database: 999,999,897
Number of sequences in database: 2,981,387
Database: /local_scratch/syshi//blastdatabase/nr.06
Posted date: Mar 3, 2013 11:18 PM
Number of letters in database: 999,999,649
Number of sequences in database: 2,911,476
Database: /local_scratch/syshi//blastdatabase/nr.07
Posted date: Mar 3, 2013 11:24 PM
Number of letters in database: 999,999,452
Number of sequences in database: 2,920,260
Database: /local_scratch/syshi//blastdatabase/nr.08
Posted date: Mar 3, 2013 11:25 PM
Number of letters in database: 64,230,274
Number of sequences in database: 189,558
Lambda K H
0.319 0.134 0.407
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 7,464,278,262
Number of Sequences: 23463169
Number of extensions: 323153872
Number of successful extensions: 1304202
Number of sequences better than 100.0: 1000
Number of HSP's better than 100.0 without gapping: 757
Number of HSP's successfully gapped in prelim test: 2084
Number of HSP's that attempted gapping in prelim test: 1292431
Number of HSP's gapped (non-prelim): 7151
length of query: 465
length of database: 8,064,228,071
effective HSP length: 146
effective length of query: 319
effective length of database: 8,933,572,693
effective search space: 2849809689067
effective search space used: 2849809689067
T: 11
A: 40
X1: 16 ( 7.4 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.8 bits)
S2: 79 (35.0 bits)