BLASTP 2.2.26 [Sep-21-2011]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for compositional score matrix adjustment: Altschul, Stephen F.,
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.
Query= 031892
(151 letters)
Database: nr
23,463,169 sequences; 8,064,228,071 total letters
Searching..................................................done
>gi|224098489|ref|XP_002311193.1| predicted protein [Populus trichocarpa]
gi|222851013|gb|EEE88560.1| predicted protein [Populus trichocarpa]
Length = 150
Score = 150 bits (380), Expect = 1e-34, Method: Compositional matrix adjust.
Identities = 95/158 (60%), Positives = 113/158 (71%), Gaps = 15/158 (9%)
Query: 1 MENKRQESASSSSSPINFDHLFGPKDSSSSATT--GGFGSIFSPPTAV-QGRGSSHSGSI 57
MEN RQ ASS FDHLFGPKDSSSS++ G FGSIF PP+ V GR S +G+
Sbjct: 1 MENNRQVGASS------FDHLFGPKDSSSSSSASSGIFGSIFPPPSKVPAGRDSGTTGNH 54
Query: 58 IGS---LRKQGESSKG-GESNIFNSKDKSSIYQSESAEPCYFSSSIYYGGQENYSPRTKA 113
+G+ + + K GES+ + K +SS+YQ+E+ EPCYFSSSIYYGGQENYSPRTK
Sbjct: 55 VGNETYVNPDNATRKAKGESSGISGKGQSSVYQNETPEPCYFSSSIYYGGQENYSPRTK- 113
Query: 114 TTESHHHVVKKDGDEDDPNGNNSNSASRGNWWQGSLYY 151
+ES HV KKD +DDPNGN+ NSASRGNWWQGSLYY
Sbjct: 114 NSES-QHVFKKDYGKDDPNGNDPNSASRGNWWQGSLYY 150
>gi|255566540|ref|XP_002524255.1| conserved hypothetical protein [Ricinus communis]
gi|223536532|gb|EEF38179.1| conserved hypothetical protein [Ricinus communis]
Length = 149
Score = 143 bits (360), Expect = 3e-32, Method: Compositional matrix adjust.
Identities = 96/159 (60%), Positives = 109/159 (68%), Gaps = 18/159 (11%)
Query: 1 MENKRQESASSSSSPINFDHLFGPKDSSSSATTGG--FGSIFSPPTAVQGR------GSS 52
MENKRQ ASS DHLFGPKDSSSS+++ FGSIFSPP+ GR G+
Sbjct: 1 MENKRQTGASS------LDHLFGPKDSSSSSSSSSSVFGSIFSPPSMGMGRDSAGIMGNQ 54
Query: 53 HSGSIIGSLRKQGESSKGGESNIFNSKDKSSIYQSESAEPCYFSSSIYYGGQENYSPRTK 112
S G +S+KG S I KD SS YQ+E+AEPCYFSSSIYYGGQENYSPRT+
Sbjct: 55 AGNSKYGIPDNVTQSNKGSNSGIM-GKDSSSFYQNETAEPCYFSSSIYYGGQENYSPRTR 113
Query: 113 ATTESHHHVVKKDGDEDDPNGNNSNSASRGNWWQGSLYY 151
S HV+KKD +DDPNGN+SNSASRGNWWQGSLYY
Sbjct: 114 ---NSESHVLKKDDKKDDPNGNSSNSASRGNWWQGSLYY 149
>gi|296086145|emb|CBI31586.3| unnamed protein product [Vitis vinifera]
Length = 160
Score = 142 bits (358), Expect = 5e-32, Method: Compositional matrix adjust.
Identities = 96/166 (57%), Positives = 112/166 (67%), Gaps = 21/166 (12%)
Query: 1 MENKRQESASSSSSPINFDHLFGPKDSSSSATTGGFGS--IFSPPTAVQGRGSSHSGSII 58
MEN +Q +SSS I D LFGPKDSSSS++ IF PP++V GR SSH+G+I
Sbjct: 1 MENTKQVGSSSS---ITSD-LFGPKDSSSSSSLSSGLFGSIFQPPSSVPGRNSSHTGNI- 55
Query: 59 GSLRKQGESSK------GGESNIFN------SKDKSSIYQSESAEPCYFSSSIYYGGQEN 106
GS RKQG ++ G NI SKD+ SIY +E+ EPCYFSSSIYYGGQE
Sbjct: 56 GSFRKQGSGNQVGNGKHGTPDNITQICSGESSKDRGSIYHNETVEPCYFSSSIYYGGQEV 115
Query: 107 YSPRTKATTESHHHV-VKKDGDEDDPNGNNSNSASRGNWWQGSLYY 151
YSP + TESH V +KKDG EDD NG+NSNSASRGNWWQGSLYY
Sbjct: 116 YSPNAR-NTESHPIVTMKKDGGEDDLNGSNSNSASRGNWWQGSLYY 160
>gi|224112647|ref|XP_002316250.1| predicted protein [Populus trichocarpa]
gi|222865290|gb|EEF02421.1| predicted protein [Populus trichocarpa]
Length = 165
Score = 127 bits (319), Expect = 1e-27, Method: Compositional matrix adjust.
Identities = 83/156 (53%), Positives = 100/156 (64%), Gaps = 18/156 (11%)
Query: 1 MENKRQESASSSSSPINFDHLFGPKDSSSSATT--GGFGSIFSPPTAV-QGRGS----SH 53
MENKRQ ASS FDHLFGPKDSSSS+ G F SIF P+ V GR S +H
Sbjct: 1 MENKRQVVASS------FDHLFGPKDSSSSSPASSGIFESIFPHPSKVPAGRDSGVMENH 54
Query: 54 SGSIIGSLRKQGESSKGGESNIFNSKDKSSIYQSESAEPCYFSSSIYYGGQENYSPRTKA 113
G + R GE + K KS ++Q+E+ EPCY SSSIYYGGQENYSPRTK
Sbjct: 55 GGRGKYANRDNVAQKAKGEGS---GKGKSVVFQNETPEPCYLSSSIYYGGQENYSPRTKN 111
Query: 114 TTESHHHVVKKDGDEDDPNGNNSNSASRGNWWQGSL 149
+ + HV KKD ++++PNGN+ NSASRGNWWQG +
Sbjct: 112 S--EYQHVFKKDDEQENPNGNDPNSASRGNWWQGIM 145
>gi|255558924|ref|XP_002520485.1| conserved hypothetical protein [Ricinus communis]
gi|223540327|gb|EEF41898.1| conserved hypothetical protein [Ricinus communis]
Length = 160
Score = 126 bits (316), Expect = 3e-27, Method: Compositional matrix adjust.
Identities = 71/144 (49%), Positives = 92/144 (63%), Gaps = 14/144 (9%)
Query: 21 LFGPKDSSSSATTGGFGSIFSPPTAVQGRGSSHSGSIIGSLRKQGESSKG---------- 70
LFG K+S S++TG F SIF PP+ V GR SS S +IGS ++Q ++G
Sbjct: 18 LFGTKESPPSSSTGIFASIFPPPSKVLGRKSSGS-EVIGSWQRQSSENQGWNTKRGAPAM 76
Query: 71 -GESNIFN--SKDKSSIYQSESAEPCYFSSSIYYGGQENYSPRTKATTESHHHVVKKDGD 127
E+ +N +KD++S E EPC+ SSS+YYGGQ+NYS + + + KKDG
Sbjct: 77 SSEAASYNMPNKDRNSTLMEERGEPCHLSSSLYYGGQDNYSQPPSSRNSGSYPIFKKDGV 136
Query: 128 EDDPNGNNSNSASRGNWWQGSLYY 151
+DDPNGNNSN ASRGNWWQGSLYY
Sbjct: 137 DDDPNGNNSNGASRGNWWQGSLYY 160
>gi|225434778|ref|XP_002281905.1| PREDICTED: uncharacterized protein LOC100263768 [Vitis vinifera]
gi|297745986|emb|CBI16042.3| unnamed protein product [Vitis vinifera]
Length = 154
Score = 123 bits (308), Expect = 3e-26, Method: Compositional matrix adjust.
Identities = 77/165 (46%), Positives = 96/165 (58%), Gaps = 25/165 (15%)
Query: 1 MENKRQESASSSSSPINFDHLFGPKDSSSSATTGGFGSIFSPPTAVQGRGSSHSGSIIGS 60
ME +RQ +SSS + LFG K+S S ++G F SIF PP+ GR ++S
Sbjct: 1 MEGRRQVGSSSSFT----SELFGTKESPPSNSSGIFASIFPPPSTAVGRNPANS-----E 51
Query: 61 LR----------KQGESSKG----GESNIFNSKDKSSIYQSESAEPCYFSSSIYYGGQEN 106
+R KQG G G S+I +KDK+ I+Q E EPC+ SSS+YYGGQ+
Sbjct: 52 MRGLSGNQTWDTKQGTPDMGKSGVGASSIIPNKDKNQIFQEERVEPCHLSSSLYYGGQDI 111
Query: 107 YSPRTKATTESHHHVVKKDGDEDDPNGNNSNSASRGNWWQGSLYY 151
YS + T + V KK DEDDPNGNNS ASRGNWWQGSLYY
Sbjct: 112 YSNSSSTQTSGSYPVFKK--DEDDPNGNNSQGASRGNWWQGSLYY 154
>gi|224106371|ref|XP_002314144.1| predicted protein [Populus trichocarpa]
gi|222850552|gb|EEE88099.1| predicted protein [Populus trichocarpa]
Length = 162
Score = 119 bits (297), Expect = 5e-25, Method: Compositional matrix adjust.
Identities = 66/143 (46%), Positives = 85/143 (59%), Gaps = 13/143 (9%)
Query: 21 LFGPKDSSSSATTGGFGSIFSPPTAVQGRGSSHSGSIIGSLRKQGESSKG---------- 70
LFG +S+ ++ G F S+F PP+ V GR SS S + GS +KQ ++
Sbjct: 21 LFGTTESAPVSSAGIFASMFPPPSTVVGRKSSGS-EVTGSWQKQSYGNQAWYPKQGSPAN 79
Query: 71 --GESNIFNSKDKSSIYQSESAEPCYFSSSIYYGGQENYSPRTKATTESHHHVVKKDGDE 128
S KD++S+ Q E EPC+ SSS+YYGGQENYS + + KKDG E
Sbjct: 80 SQAASYSMPDKDRNSVIQEERVEPCHLSSSLYYGGQENYSQSPSTQMAGSYPIFKKDGGE 139
Query: 129 DDPNGNNSNSASRGNWWQGSLYY 151
DDPNG+N +SASRGNWWQGSLYY
Sbjct: 140 DDPNGSNPHSASRGNWWQGSLYY 162
>gi|356569336|ref|XP_003552858.1| PREDICTED: uncharacterized protein LOC100797772 [Glycine max]
Length = 161
Score = 114 bits (284), Expect = 2e-23, Method: Compositional matrix adjust.
Identities = 85/157 (54%), Positives = 110/157 (70%), Gaps = 13/157 (8%)
Query: 1 MENKRQESASSSSSP--INFDHLFGPKDSSSSATTGG--FGSIFSPPTAVQGRGSSHSGS 56
MEN +++ SSS NFDHLFGPKD S+++++ FGSIF PP+ V GR S+ +
Sbjct: 1 MENNKKKVGGGSSSSSFTNFDHLFGPKDPSTTSSSSTSIFGSIFPPPSTVGGRDSTKQDT 60
Query: 57 ------IIGSLRKQGESSKGGESNIFNSKDKSSIYQSESAEPCYFSSSIYYGGQENYSPR 110
G+ +GESS G SN N+ ++ +Q+E+ EP Y+SSSI+YGGQENYSPR
Sbjct: 61 GCKNYGAPGNYSYKGESSASGVSNN-NNTTSTNYHQNETVEPSYYSSSIHYGGQENYSPR 119
Query: 111 TKATTESHHHVVKKDGDEDDPNGNNSNSASRGNWWQG 147
T+ TTES HH+ KKD D+DDPNGN+SNSASRGNWWQG
Sbjct: 120 TR-TTES-HHIFKKDKDDDDPNGNDSNSASRGNWWQG 154
>gi|356537962|ref|XP_003537475.1| PREDICTED: uncharacterized protein LOC100795477 [Glycine max]
Length = 203
Score = 103 bits (256), Expect = 3e-20, Method: Compositional matrix adjust.
Identities = 85/160 (53%), Positives = 107/160 (66%), Gaps = 14/160 (8%)
Query: 1 MENKRQESAS---SSSSPINFDHLFGPKDSSSSATTGG--FGSIFSPP-TAVQGRGSSHS 54
MEN +++ SSSS NFDHLFGPKD S+++++ FGSIF PP + V GR S+
Sbjct: 1 MENNKKKVVGGGPSSSSFTNFDHLFGPKDPSTTSSSSTSIFGSIFPPPPSTVGGRDSTKQ 60
Query: 55 GSIIGSLRKQGESSKGGESN----IFNSKDKSSI---YQSESAEPCYFSSSIYYGGQENY 107
+ G S GES+ + NSK+ +S +Q+E+ EP Y+SSSIYYGGQENY
Sbjct: 61 DMGCKNYGAPGNYSYKGESSASGVVSNSKNTTSTNYHHQNETVEPSYYSSSIYYGGQENY 120
Query: 108 SPRTKATTESHHHVVKKDGDEDDPNGNNSNSASRGNWWQG 147
SPRT+ TTESHH K D+DDPNGN+SNSASRGNWWQG
Sbjct: 121 SPRTR-TTESHHSFKKDKKDDDDPNGNDSNSASRGNWWQG 159
>gi|357505741|ref|XP_003623159.1| hypothetical protein MTR_7g065150 [Medicago truncatula]
gi|355498174|gb|AES79377.1| hypothetical protein MTR_7g065150 [Medicago truncatula]
gi|388492720|gb|AFK34426.1| unknown [Medicago truncatula]
Length = 156
Score = 102 bits (253), Expect = 6e-20, Method: Compositional matrix adjust.
Identities = 71/162 (43%), Positives = 90/162 (55%), Gaps = 17/162 (10%)
Query: 1 MENKRQESASSSSSPINFDHLFGPKDSSSSATTGGFGSIFSPPTAVQGRGSSHSGSIIGS 60
ME ++Q +SSS + LFG D S+ +G F SIFSP + V GR S S ++ G
Sbjct: 1 MEGRKQFGSSSSETSSFTSELFGSNDLHKSSASGIFDSIFSPSSKVFGRESLRS-AMNGK 59
Query: 61 LRKQGESSK-----------GGESNIFNSKDKSSIYQSESAEPCYFSSSIYYGGQENYSP 109
+G +SK GE+ +KD S IYQ + +PC+ SSSIYYGGQ+ YS
Sbjct: 60 TATEGSNSKIDTQEYISKGNDGETQNKTNKDMSYIYQEQRVQPCHLSSSIYYGGQDVYSR 119
Query: 110 RTKATTESHHHVVKKDGDEDDPNGNNSNSASRGNWWQGSLYY 151
A + + KKDG EDD S SASRGNWWQGSLYY
Sbjct: 120 PQSARDSGLNTMYKKDGGEDD-----SGSASRGNWWQGSLYY 156
>gi|357492467|ref|XP_003616522.1| hypothetical protein MTR_5g081360 [Medicago truncatula]
gi|355517857|gb|AES99480.1| hypothetical protein MTR_5g081360 [Medicago truncatula]
Length = 145
Score = 102 bits (253), Expect = 7e-20, Method: Compositional matrix adjust.
Identities = 77/147 (52%), Positives = 99/147 (67%), Gaps = 7/147 (4%)
Query: 3 NKRQESASSSSSPINFDHLFGPKDSSSSATTGGFGSIFSPPT--AVQGRGSSHSGSIIGS 60
NK+Q +SSSSS + FD LFGPKD SS++++ FGSIF PP +V+GRGS +
Sbjct: 4 NKKQAWSSSSSSTVKFDQLFGPKDPSSASSSSLFGSIFPPPPTPSVEGRGSRTQEVGSKN 63
Query: 61 LRKQGESSKGGESNIFNSKDKSSIYQSESAEPCYFSSSIYYGGQENYSPRTKATTESHHH 120
L G + G I +K+ + YQ+E+ EP Y+SSSIYYGGQENYSPR + T HH
Sbjct: 64 LGATGTTPSDG---ISYNKNTCTNYQNETMEPSYYSSSIYYGGQENYSPRNRTTEP--HH 118
Query: 121 VVKKDGDEDDPNGNNSNSASRGNWWQG 147
V KKD + D NGNNS+SASRG+WW+G
Sbjct: 119 VFKKDKNHGDHNGNNSSSASRGDWWEG 145
>gi|297816874|ref|XP_002876320.1| hypothetical protein ARALYDRAFT_485995 [Arabidopsis lyrata subsp.
lyrata]
gi|297322158|gb|EFH52579.1| hypothetical protein ARALYDRAFT_485995 [Arabidopsis lyrata subsp.
lyrata]
Length = 139
Score = 99.0 bits (245), Expect = 6e-19, Method: Compositional matrix adjust.
Identities = 71/152 (46%), Positives = 93/152 (61%), Gaps = 17/152 (11%)
Query: 2 ENKRQESASSSSSPINFDHLFGPK--DSSSSATTGGFGSIFSPPTAVQGRGSSHSGSIIG 59
+NK+++ S+SS FDH+FGP+ SSSS+ TG F SIF PP+A Q S G
Sbjct: 3 KNKKKKIVSASS----FDHIFGPRVSSSSSSSATGLFKSIFPPPSADQLGRQVDFASQGG 58
Query: 60 SLRKQGESSKGGESNIFNSKDKSSIYQSESAEPCYFSSSIYYGGQENYSPRTKATTESHH 119
++ Q + +G SN K+K S Y E+ PC+ SSS+YYGGQE YS T T ++ +
Sbjct: 59 HVKYQSPNERGERSN---KKEKKSYYNEETEPPCHLSSSLYYGGQEKYSSTTTITHDTTY 115
Query: 120 HVVKKDGDEDDPNGNNSNSASRGNWWQGSLYY 151
KKDG+E D S SASRGNWW+GSLYY
Sbjct: 116 ---KKDGEEGD-----SKSASRGNWWEGSLYY 139
>gi|449460363|ref|XP_004147915.1| PREDICTED: uncharacterized protein LOC101215701 [Cucumis sativus]
Length = 158
Score = 96.7 bits (239), Expect = 3e-18, Method: Compositional matrix adjust.
Identities = 72/164 (43%), Positives = 94/164 (57%), Gaps = 19/164 (11%)
Query: 1 MENKRQESASSSSSPINFDHLFGPKDSSSSATTGGFGSIFSPPTAVQGRGS--SHSGSI- 57
ME K+ SSSS + D LFG ++S S+TTG FGSIF+P + V GR S SH+
Sbjct: 1 MEGKKHVGLGSSSSSLTTD-LFGSNETSYSSTTGIFGSIFAPSSKVLGRESLLSHTKERE 59
Query: 58 IGSLRKQGESSKGGESNIFN----------SKDKSSIYQSESAEPCYFSSSIYYGGQENY 107
S+ + + G + + N +KD SSIYQ + A+PC+ SSSIYYGGQ+ Y
Sbjct: 60 RNSVNEPWNPNAGAQDDNANHTQKESQETKNKDMSSIYQDQRAQPCHLSSSIYYGGQDVY 119
Query: 108 SPRTKATTESHHHVVKKDGDEDDPNGNNSNSASRGNWWQGSLYY 151
+ + + KK+G EDD S SASRGNWWQGSLYY
Sbjct: 120 THPQNSYNSGANSAYKKEGGEDD-----SGSASRGNWWQGSLYY 158
>gi|79315178|ref|NP_001030865.1| uncharacterized protein [Arabidopsis thaliana]
gi|98961793|gb|ABF59226.1| unknown protein [Arabidopsis thaliana]
gi|332645894|gb|AEE79415.1| uncharacterized protein [Arabidopsis thaliana]
Length = 146
Score = 90.9 bits (224), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 65/136 (47%), Positives = 81/136 (59%), Gaps = 14/136 (10%)
Query: 18 FDHLFGPK--DSSSSATTGGFGSIFSPPTAVQGRGSSHSGSIIGSLRKQGESSKGGESNI 75
FDH+FGP+ SSSS+ TG F SIF PP+A Q S G ++ Q ++KG SN
Sbjct: 23 FDHIFGPRVSSSSSSSATGLFKSIFPPPSADQLGRQVDFASQGGHVKYQSPNAKGERSN- 81
Query: 76 FNSKDKSSIYQSESAEPCYFSSSIYYGGQENYSPRTKATTESHHHVVKKDGDEDDPNGNN 135
K+K S Y E+ PC+ SSS+YYGGQE +TT + H KKDG+E D
Sbjct: 82 --KKEKKSYYNEETEPPCHLSSSLYYGGQET----YSSTTTTTHDTYKKDGEEGD----- 130
Query: 136 SNSASRGNWWQGSLYY 151
S ASRGNWW+GSLYY
Sbjct: 131 SKRASRGNWWEGSLYY 146
>gi|388494636|gb|AFK35384.1| unknown [Lotus japonicus]
Length = 153
Score = 90.9 bits (224), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 68/162 (41%), Positives = 86/162 (53%), Gaps = 20/162 (12%)
Query: 1 MENKRQESASSSSSPINFDHLFGPKDSSSSATTGGFGSIFSPPT-AVQGRGSSHSGSIIG 59
M+ K+Q+ SSS + LFG K+S S ++G FGSIFSPP+ V GR S S +
Sbjct: 1 MDEKKQKGTSSSFTA----ELFGSKESHPSPSSGIFGSIFSPPSPKVLGRESLRSELSVK 56
Query: 60 SLRK--------QGESSKG--GESNIFNSKDKSSIYQSESAEPCYFSSSIYYGGQENYSP 109
+ + Q + SKG GE+ +KD S IYQ + PC SSSIYYGGQ+ Y
Sbjct: 57 TANETWSSKIGIQDDVSKGNYGEAQHTENKDMSFIYQHQRIPPCPLSSSIYYGGQDTYYH 116
Query: 110 RTKATTESHHHVVKKDGDEDDPNGNNSNSASRGNWWQGSLYY 151
+ + K D DD S ASRGNWWQGSLYY
Sbjct: 117 SQSTQNAGSNTLHKNDVGNDD-----SGMASRGNWWQGSLYY 153
>gi|449455074|ref|XP_004145278.1| PREDICTED: uncharacterized protein LOC101214982 [Cucumis sativus]
gi|449473730|ref|XP_004153966.1| PREDICTED: uncharacterized protein LOC101213402 [Cucumis sativus]
gi|449510713|ref|XP_004163741.1| PREDICTED: uncharacterized protein LOC101228934 [Cucumis sativus]
Length = 139
Score = 90.5 bits (223), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 59/135 (43%), Positives = 72/135 (53%), Gaps = 16/135 (11%)
Query: 21 LFGPKDSSSSATTGGFGSIFSPPTAVQGRGSSHSGSIIGSLRKQGESSKGGESNIFNSKD 80
LF K++ S+ +G F +IF P GR SS S + KQ S+ + NS
Sbjct: 17 LFDVKEAPLSSASGAFATIFPSPQKGAGRNSSSSVDWL----KQTNGSQPHHTRQGNSGG 72
Query: 81 KSSIYQSESAEPCYFSSSIYYGGQENYSPRTKA----TTESHHHVVKKDGDEDDPNGNNS 136
S EPC+ SSS+YYGGQ+ YS T A H + K G +DDPNGNNS
Sbjct: 73 --------SLEPCHLSSSLYYGGQDGYSQATSAGPSPLPPPPHTMKKSGGQQDDPNGNNS 124
Query: 137 NSASRGNWWQGSLYY 151
ASRGNWWQGSLYY
Sbjct: 125 QPASRGNWWQGSLYY 139
>gi|357481999|ref|XP_003611285.1| hypothetical protein MTR_5g012280 [Medicago truncatula]
gi|355512620|gb|AES94243.1| hypothetical protein MTR_5g012280 [Medicago truncatula]
Length = 144
Score = 90.1 bits (222), Expect = 3e-16, Method: Compositional matrix adjust.
Identities = 68/153 (44%), Positives = 92/153 (60%), Gaps = 11/153 (7%)
Query: 1 MENKRQESASSSSS--PINFDHLFGPKDSSSSATTGGFGSIFSPPTAVQGRGSSHSGSII 58
M+N + ++ SSS+S +NF+ LF KDS++ A+ F P +V+G+GS +
Sbjct: 1 MDNNKNQTCSSSNSSSTVNFNQLFSSKDSTT-ASFSSFFDSLFPQPSVEGKGSRTEE--V 57
Query: 59 GSLRKQGESSKGGESNIFNSKDKSSIYQSESAEPCYFSSSIYYGGQENYSPRTKATTESH 118
GS K E+ S+I N S+ YQ+E+ EP YFSSSI+YGG E+Y +T E
Sbjct: 58 GS--KSLEAPGTPNSDISNKNTNSTDYQNETVEPTYFSSSIHYGGCEDYFSE-GSTIEPR 114
Query: 119 HHVVKKDGDEDDPNGNNSNSASRGNWWQGSLYY 151
H +GD NGNNSNSASRG+WWQGSLYY
Sbjct: 115 HVKYTNNGD---LNGNNSNSASRGDWWQGSLYY 144
>gi|42571137|ref|NP_973642.1| uncharacterized protein [Arabidopsis thaliana]
gi|28207122|gb|AAO37201.1| hypothetical protein [Arabidopsis thaliana]
gi|330254647|gb|AEC09741.1| uncharacterized protein [Arabidopsis thaliana]
Length = 142
Score = 90.1 bits (222), Expect = 3e-16, Method: Compositional matrix adjust.
Identities = 63/143 (44%), Positives = 82/143 (57%), Gaps = 27/143 (18%)
Query: 18 FDHLFGPK--DSSSSATTGGFGSIFSPPTAV-QGRGSSHSGSI------IGSLRKQGESS 68
DH+FGP+ S SS+TTG F SIF PP+AV QG +S +G+ + ++GE S
Sbjct: 18 LDHIFGPRVSHSYSSSTTGLFKSIFPPPSAVTQGNLTSRNGAAKYQPTNFETPNERGERS 77
Query: 69 KGGESNIFNSKDKSSIYQSESAEPCYFSSSIYYGGQENYSPRTKATTESHHHVVKKDGDE 128
K +K++ S E+ PC SSSIYYGGQ+NYS T ++ KKDG+E
Sbjct: 78 K--------NKERKSYQSEETQPPCNLSSSIYYGGQDNYSSST-----TNPDAYKKDGEE 124
Query: 129 DDPNGNNSNSASRGNWWQGSLYY 151
D S SASRGNWW+GS Y
Sbjct: 125 GD-----SESASRGNWWEGSFNY 142
>gi|255586976|ref|XP_002534085.1| conserved hypothetical protein [Ricinus communis]
gi|223525878|gb|EEF28301.1| conserved hypothetical protein [Ricinus communis]
Length = 146
Score = 89.4 bits (220), Expect = 4e-16, Method: Compositional matrix adjust.
Identities = 75/157 (47%), Positives = 94/157 (59%), Gaps = 17/157 (10%)
Query: 1 MENKRQESASSSSSPINFDHLFGPKDSSSSATTGGFGSIFSPPTAVQGRGSSHSGSIIGS 60
ME K+Q+ SSSSS + LFG KDSSSS++ +PP V GRGS S ++G
Sbjct: 1 MEGKKQQLGSSSSSSFTSE-LFGSKDSSSSSSGIFGSIF-APPPKVLGRGSLRS-EMVG- 56
Query: 61 LRKQ----GESSKGGESNIFN--SKDKSSIYQSESAEPCYFSSSIYYGGQENYSPRTKAT 114
+KQ E+SK ES N S+D S YQ + +PC+ SSSIYYGGQ+ Y + +
Sbjct: 57 -KKQDSANAETSKSNESEGQNIPSRDTGSFYQDQKVQPCHLSSSIYYGGQDIYH-HPQNS 114
Query: 115 TESHHHVVKKDGDEDDPNGNNSNSASRGNWWQGSLYY 151
S H + KK+G EDD S SASRGNWWQGSLYY
Sbjct: 115 QSSSHSMFKKEGTEDD-----SGSASRGNWWQGSLYY 146
>gi|15241665|ref|NP_195822.1| uncharacterized protein [Arabidopsis thaliana]
gi|7340676|emb|CAB82975.1| putative protein [Arabidopsis thaliana]
gi|16648748|gb|AAL25566.1| AT5g02020/T7H20_70 [Arabidopsis thaliana]
gi|20334906|gb|AAM16209.1| AT5g02020/T7H20_70 [Arabidopsis thaliana]
gi|332003037|gb|AED90420.1| uncharacterized protein [Arabidopsis thaliana]
Length = 149
Score = 87.8 bits (216), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 58/138 (42%), Positives = 79/138 (57%), Gaps = 16/138 (11%)
Query: 20 HLFGPKDSSSSATTGG-FGSIFSPPTAVQGRGSSHSGSIIGSLRKQGESSKGGESNIFNS 78
LFG +++ SS ++ G GSIF PP+ V GR S ++ G + S GG N+ +
Sbjct: 22 ELFGSRENPSSPSSSGILGSIFPPPSKVLGRESVRQETVTGGCWNEKTSKTGG--NVDRN 79
Query: 79 KDK-----SSIYQSESAEPCYFSSSIYYGGQENYSPRTKATTESHHHVVKKDGDEDDPNG 133
+++ S Q + +PC+ SSSIYYGG + Y +T+ S + KKDG EDD
Sbjct: 80 REQQENHGSGYQQDQRVQPCHLSSSIYYGGPDVYFQPQNSTSNSTN---KKDGGEDD--- 133
Query: 134 NNSNSASRGNWWQGSLYY 151
S SASRGNWWQGSLYY
Sbjct: 134 --SGSASRGNWWQGSLYY 149
>gi|209967469|gb|ACJ02359.1| hypothetical protein [Populus tremula x Populus alba]
Length = 157
Score = 87.4 bits (215), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 64/146 (43%), Positives = 79/146 (54%), Gaps = 28/146 (19%)
Query: 21 LFGPKDSSSSATTGGFGSIFSPPT-AVQGRGS------------------SHSGSIIGSL 61
LFG K+ SSS+T G FGSIF+P + V GR S + SG+ L
Sbjct: 18 LFGSKEYSSSST-GIFGSIFAPASPKVLGRESLRFEVDEKKQDSADDAWNTKSGTPTSDL 76
Query: 62 RKQGESSKGGESNIFNSKDKSSIYQSESAEPCYFSSSIYYGGQENYSPRTKATTESHHHV 121
+ GES +KD SSIYQ + +PC+ SSSIYYGGQ+ Y A T S + +
Sbjct: 77 TSKMNE---GESQSIPNKDMSSIYQEQRVQPCHLSSSIYYGGQDIYHHPQTAHTSSKNPM 133
Query: 122 VKKDGDEDDPNGNNSNSASRGNWWQG 147
KKDG EDD + SASRGNWWQG
Sbjct: 134 FKKDGPEDD-----TGSASRGNWWQG 154
>gi|388492094|gb|AFK34113.1| unknown [Medicago truncatula]
Length = 172
Score = 87.4 bits (215), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 64/157 (40%), Positives = 85/157 (54%), Gaps = 18/157 (11%)
Query: 1 MENKRQESASSSSSPINFDHLFGPKDSSSSATTGGFGSIFSPPTAVQGRGSSHSGSIIGS 60
ME ++Q +S S + LFG D S+ +G F SIFSP + V GR S S ++ G
Sbjct: 1 MEGRKQFGSSFSETSSFTSELFGSNDLHKSSASGIFDSIFSPSSKVFGRESLRS-AMNGK 59
Query: 61 LRKQGESSK-----------GGESNIFNSKDKSSIYQSESAEPCYFSSSIYYGGQENYS- 108
+G +SK GE+ +KD S IYQ + +PC+ SSSIYYGGQ+ YS
Sbjct: 60 TATEGSNSKIDTQEYISKGNDGETQNKTNKDMSYIYQEQRVQPCHLSSSIYYGGQDVYSR 119
Query: 109 PRTKATTESHHHVVKKDGDEDDPNGNNSNSASRGNWW 145
P++ + + KKDG EDD S SASRGNWW
Sbjct: 120 PQSARDSGLNTMQYKKDGGEDD-----SGSASRGNWW 151
>gi|449464468|ref|XP_004149951.1| PREDICTED: cucumisin-like [Cucumis sativus]
Length = 866
Score = 87.0 bits (214), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 47/83 (56%), Positives = 61/83 (73%), Gaps = 6/83 (7%)
Query: 65 GESSKGGESNIFNSKDKSSIYQSESAEPCYFSSSIYYGGQENYSPRTKATTESHHHVVKK 124
G+ GG S KD+SS+YQ+E+ EP YFSSSI+YGGQENYSPRT A ++SH + KK
Sbjct: 764 GKKGTGGASG----KDESSMYQNETMEPSYFSSSIFYGGQENYSPRTNA-SQSHPN-FKK 817
Query: 125 DGDEDDPNGNNSNSASRGNWWQG 147
+ ++D N +NSN ASRGNWW+
Sbjct: 818 EVKDNDANESNSNPASRGNWWKA 840
>gi|449516501|ref|XP_004165285.1| PREDICTED: cucumisin-like, partial [Cucumis sativus]
Length = 795
Score = 86.7 bits (213), Expect = 3e-15, Method: Compositional matrix adjust.
Identities = 47/82 (57%), Positives = 61/82 (74%), Gaps = 6/82 (7%)
Query: 65 GESSKGGESNIFNSKDKSSIYQSESAEPCYFSSSIYYGGQENYSPRTKATTESHHHVVKK 124
G+ GG S KD+SS+YQ+E+ EP YFSSSI+YGGQENYSPRT A ++SH + KK
Sbjct: 720 GKKGTGGASG----KDESSMYQNETMEPSYFSSSIFYGGQENYSPRTNA-SQSHPN-FKK 773
Query: 125 DGDEDDPNGNNSNSASRGNWWQ 146
+ ++D N +NSN ASRGNWW+
Sbjct: 774 EVKDNDANESNSNPASRGNWWK 795
>gi|225440524|ref|XP_002275314.1| PREDICTED: uncharacterized protein LOC100252790 isoform 1 [Vitis
vinifera]
gi|359481721|ref|XP_003632663.1| PREDICTED: uncharacterized protein LOC100252790 isoform 2 [Vitis
vinifera]
gi|297740289|emb|CBI30471.3| unnamed protein product [Vitis vinifera]
Length = 159
Score = 85.9 bits (211), Expect = 5e-15, Method: Compositional matrix adjust.
Identities = 61/147 (41%), Positives = 82/147 (55%), Gaps = 22/147 (14%)
Query: 21 LFGPKDSS--SSATTGGFGSIFSPPTAVQGRGSSH---------SGSII-----GSLRKQ 64
LFG K+SS S ++TG F SIFS + V GR S SG+ + G+
Sbjct: 19 LFGSKESSYPSPSSTGIFASIFSTSSKVLGRESLRPDLTKKKQDSGNEVWNAKPGTTENA 78
Query: 65 GESSKGGESNIFNSKDKSSIYQSESAEPCYFSSSIYYGGQENYSPRTKATTESHHHVVKK 124
+ S+G ES +++D S YQ + +PC+ SSSIYYGGQ+ Y + + ++KK
Sbjct: 79 LQQSEG-ESQSISNRDTGSFYQEQRVQPCHLSSSIYYGGQDIYFHPQNSQSSGMPSMLKK 137
Query: 125 DGDEDDPNGNNSNSASRGNWWQGSLYY 151
D EDD + SASRGNWWQGSLYY
Sbjct: 138 DSGEDD-----TGSASRGNWWQGSLYY 159
>gi|297823891|ref|XP_002879828.1| hypothetical protein ARALYDRAFT_321676 [Arabidopsis lyrata subsp.
lyrata]
gi|297325667|gb|EFH56087.1| hypothetical protein ARALYDRAFT_321676 [Arabidopsis lyrata subsp.
lyrata]
Length = 142
Score = 85.1 bits (209), Expect = 9e-15, Method: Compositional matrix adjust.
Identities = 68/160 (42%), Positives = 90/160 (56%), Gaps = 27/160 (16%)
Query: 1 MENKRQESASSSSSPINFDHLFGPK--DSSSSATTGGFGSIFSPPTAV-------QGRGS 51
M+NK+ S SSSSS + DH+FGP+ SSSS+TTG F SIF P+ V + +
Sbjct: 1 MDNKKSVSGSSSSSSSSLDHIFGPRVSSSSSSSTTGLFQSIFPLPSTVMQANSASRNEAA 60
Query: 52 SHSGSIIGSLRKQGESSKGGESNIFNSKDKSSIYQSESAEPCYFSSSIYYGGQENYSPRT 111
+ + G ++GE SK +K++ S E+ PC SSSIYYGGQ+NYS T
Sbjct: 61 KYQTTNFGIPNERGERSK--------NKERKSYQSEETQPPCNLSSSIYYGGQDNYSSST 112
Query: 112 KATTESHHHVVKKDGDEDDPNGNNSNSASRGNWWQGSLYY 151
++ KKDG+E D S SASRGNWW+GS Y
Sbjct: 113 -----TNPDAYKKDGEEGD-----SESASRGNWWEGSFNY 142
>gi|209967471|gb|ACJ02360.1| hypothetical protein [Populus euphratica]
Length = 157
Score = 81.6 bits (200), Expect = 9e-14, Method: Compositional matrix adjust.
Identities = 62/146 (42%), Positives = 79/146 (54%), Gaps = 28/146 (19%)
Query: 21 LFGPKDSSSSATTGGFGSIFSPPTA-VQGRGSSHSGSIIGSLRKQGES-----SKGG--- 71
LFG K++S S + G FGSIF+P + V GR S + +KQ + +K G
Sbjct: 18 LFGSKENSPS-SMGIFGSIFAPASPKVLGR---ESLRFEVAEKKQDSADDAWNTKSGIPA 73
Query: 72 ----------ESNIFNSKDKSSIYQSESAEPCYFSSSIYYGGQENYSPRTKATTESHHHV 121
ES +KD SSIYQ + +PC+ SSSIYYGGQ+ Y A T S + +
Sbjct: 74 SDLTSKMNEDESQSVPNKDMSSIYQEQRVQPCHLSSSIYYGGQDIYHHPQTAHTSSMNPM 133
Query: 122 VKKDGDEDDPNGNNSNSASRGNWWQG 147
KKDG EDD + SASRGNWWQG
Sbjct: 134 FKKDGPEDD-----TGSASRGNWWQG 154
>gi|224087557|ref|XP_002308189.1| predicted protein [Populus trichocarpa]
gi|222854165|gb|EEE91712.1| predicted protein [Populus trichocarpa]
Length = 174
Score = 80.1 bits (196), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 61/160 (38%), Positives = 77/160 (48%), Gaps = 39/160 (24%)
Query: 21 LFGPKDSSSSATTGGFGSIF---SPPTAVQGRGSSHSGSII------GSLRKQGESSKG- 70
LFG K++S S + G FGSIF SP + R + ++ SLR + K
Sbjct: 18 LFGSKENSPS-SMGIFGSIFAPASPKVLIIARRAKFIVLVLIQVLGRESLRFEVAEKKQD 76
Query: 71 -----------------------GESNIFNSKDKSSIYQSESAEPCYFSSSIYYGGQENY 107
GES +KD SSIYQ + +PC+ SSSIYYGGQ+ Y
Sbjct: 77 SADDAWNTKSGTPASDLTSKMNEGESQSVPNKDMSSIYQEQRVQPCHLSSSIYYGGQDIY 136
Query: 108 SPRTKATTESHHHVVKKDGDEDDPNGNNSNSASRGNWWQG 147
A T S + + KKDG EDD + SASRGNWWQG
Sbjct: 137 HHPQTAHTSSINPMFKKDGPEDD-----TGSASRGNWWQG 171
>gi|297793471|ref|XP_002864620.1| hypothetical protein ARALYDRAFT_496048 [Arabidopsis lyrata subsp.
lyrata]
gi|297310455|gb|EFH40879.1| hypothetical protein ARALYDRAFT_496048 [Arabidopsis lyrata subsp.
lyrata]
Length = 135
Score = 80.1 bits (196), Expect = 3e-13, Method: Compositional matrix adjust.
Identities = 57/135 (42%), Positives = 74/135 (54%), Gaps = 24/135 (17%)
Query: 21 LFGPKDSSSSATTGG-FGSIFSPPTAVQGRGSSHSGSIIGSLRKQGESSKGGESNIFNSK 79
LFG KD S +++ G F +IF P+ GR S+S K G ++ ES N+
Sbjct: 21 LFGSKDPSPPSSSSGIFSTIFPHPSKGSGRDGSNS--------KHGSQAQRRES--LNA- 69
Query: 80 DKSSIYQSESAEPCYFSSSIYYGGQENYSPRTKATTESHHHVVK---KDGDEDDPNGNNS 136
Q + EPC+ SSS+YYGGQ+ YS ++TT + VK + EDD NG NS
Sbjct: 70 ------QEDRVEPCHLSSSLYYGGQDVYS---RSTTNQTYPAVKNERRRSGEDDANGQNS 120
Query: 137 NSASRGNWWQGSLYY 151
SRGNWWQGSLYY
Sbjct: 121 QDVSRGNWWQGSLYY 135
>gi|357481929|ref|XP_003611250.1| hypothetical protein MTR_5g011930 [Medicago truncatula]
gi|355512585|gb|AES94208.1| hypothetical protein MTR_5g011930 [Medicago truncatula]
Length = 113
Score = 76.6 bits (187), Expect = 3e-12, Method: Compositional matrix adjust.
Identities = 56/153 (36%), Positives = 75/153 (49%), Gaps = 42/153 (27%)
Query: 1 MENKRQESASSSSSPINFDHLFGPKDSSSSATTGGFGSIFSPPTAVQGRGSSHSGSIIGS 60
MEN +++++S SS+ +NFD PKDS I P +V+ GS
Sbjct: 1 MENNKKQTSSPSSTTVNFDQPLSPKDS-----------ICPSPPSVEDIGSR-------- 41
Query: 61 LRKQGESSKG--GESNIFNSKDKSSIYQSESAEPCYFSSSIYYGGQENYSPRTKATTESH 118
+++ G+ S G GE+ +F SSSI YGG+E+ +P T
Sbjct: 42 IQEVGKKSLGTPGEATLF-------------------SSSILYGGREDDTPI--GNTAEP 80
Query: 119 HHVVKKDGDEDDPNGNNSNSASRGNWWQGSLYY 151
V KKD D D N NN NSASRG+WW GSLYY
Sbjct: 81 CDVFKKDTDNADLNCNNPNSASRGDWWLGSLYY 113
>gi|356494879|ref|XP_003516310.1| PREDICTED: uncharacterized protein LOC100775733 [Glycine max]
Length = 153
Score = 75.9 bits (185), Expect = 5e-12, Method: Compositional matrix adjust.
Identities = 69/163 (42%), Positives = 93/163 (57%), Gaps = 22/163 (13%)
Query: 1 MENKRQESASSSSSPINFDHLFGPKDSSSSATTGGFGSIFSPPTA-VQGRGSSHSGSIIG 59
ME +Q+ ASSS + LFG K+S S+++G FGSIFSPP++ V GR S S +
Sbjct: 1 MEGNKQKGASSSFT----SELFGSKESRPSSSSGIFGSIFSPPSSKVLGRESLRS-EMSS 55
Query: 60 SLRKQGESSKGGESNIFN-----------SKDKSSIYQSESAEPCYFSSSIYYGGQENYS 108
+ + SS+ G + F+ +KD SSIYQ + +PC+ SSSIYYGGQ+ YS
Sbjct: 56 KIANETWSSRIGIQDNFSKGYGSEAPNAVNKDMSSIYQDQRVQPCHLSSSIYYGGQDIYS 115
Query: 109 PRTKATTESHHHVVKKDGDEDDPNGNNSNSASRGNWWQGSLYY 151
E + + + DG EDD S ASRG+WWQG LYY
Sbjct: 116 NPQSTQNEGFNSMHRNDGGEDD-----SEFASRGDWWQGGLYY 153
>gi|15237814|ref|NP_200716.1| uncharacterized protein [Arabidopsis thaliana]
gi|10177635|dbj|BAB10783.1| unnamed protein product [Arabidopsis thaliana]
gi|17380960|gb|AAL36292.1| unknown protein [Arabidopsis thaliana]
gi|20465393|gb|AAM20121.1| unknown protein [Arabidopsis thaliana]
gi|110742611|dbj|BAE99218.1| hypothetical protein [Arabidopsis thaliana]
gi|332009755|gb|AED97138.1| uncharacterized protein [Arabidopsis thaliana]
Length = 135
Score = 75.1 bits (183), Expect = 9e-12, Method: Compositional matrix adjust.
Identities = 54/135 (40%), Positives = 72/135 (53%), Gaps = 24/135 (17%)
Query: 21 LFGPKDSSSSATTGG-FGSIFSPPTAVQGRGSSHSGSIIGSLRKQGESSKGGESNIFNSK 79
LFG KD S +++ G F ++F P+ R S+S K G ++ ES N+
Sbjct: 21 LFGSKDPSPPSSSSGIFSTMFPHPSKGSARDGSNS--------KHGSQAQRRES--LNA- 69
Query: 80 DKSSIYQSESAEPCYFSSSIYYGGQENYSPRTKATTESHHHVVKKD---GDEDDPNGNNS 136
Q + EPC+ SSS+YYGGQ+ Y+ ++TT + VK D EDD NG N
Sbjct: 70 ------QEDRVEPCHLSSSLYYGGQDVYA---RSTTNQTYPPVKNDRRRSGEDDANGQNP 120
Query: 137 NSASRGNWWQGSLYY 151
SRGNWWQGSLYY
Sbjct: 121 QDVSRGNWWQGSLYY 135
>gi|388493012|gb|AFK34572.1| unknown [Medicago truncatula]
Length = 161
Score = 73.9 bits (180), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 71/161 (44%), Positives = 95/161 (59%), Gaps = 10/161 (6%)
Query: 1 MENKRQE-----SASSSSSPINFDHLFGPKDSSSSATTGGFGSIFS--PPTAVQGRGSSH 53
MENK + + SSSS+P NFDHLFGPKD S+++++ PP+ R S+
Sbjct: 1 MENKNNKKKVGGAGSSSSTPTNFDHLFGPKDPSTTSSSSNSIFSSIFPPPSTGGWRDSTR 60
Query: 54 SGSIIGSLRKQGE-SSKGGESNIFNSKDKSSIYQSESAEPCYFSSSIYY-GGQENY-SPR 110
+ G S+KGG S N+ ++ YQ+E+ EP + SS Y GGQENY SP
Sbjct: 61 HDMGSNTYGVPGNYSNKGGNSANKNTGSDTTYYQNETVEPSNYYSSSIYYGGQENYNSPT 120
Query: 111 TKATTESHHHVVKKDGDEDDPNGNNSNSASRGNWWQGSLYY 151
T TT++ H K D+DD NG++S SASRGNWW+GSLYY
Sbjct: 121 TCKTTQTPHFTKKNKDDDDDSNGSDSGSASRGNWWKGSLYY 161
>gi|449439960|ref|XP_004137753.1| PREDICTED: uncharacterized protein LOC101220657 [Cucumis sativus]
gi|449511090|ref|XP_004163859.1| PREDICTED: uncharacterized LOC101220657 [Cucumis sativus]
Length = 145
Score = 70.9 bits (172), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 54/136 (39%), Positives = 72/136 (52%), Gaps = 13/136 (9%)
Query: 21 LFGPKDSSSSATTGGFGSIFSPPTAVQGRGSSHSGS--IIGSLRKQGESSKGGESN--IF 76
LFG +D+SS ++ FG +FS G+ S G + K +SK E
Sbjct: 18 LFGIRDTSSLSSNHIFGPVFSSSFKPSGQHSEVGGKSPALSYFPKTEGNSKYKECKNGST 77
Query: 77 NSKDKSSIYQSESAEPCYFSSSIYYGGQENYSPRTKATTESHHHVVKKD-GDEDDPNGNN 135
+S++ S YQ + PC+ SSSIYYGGQ+ Y T+ + +K+D G EDD G
Sbjct: 78 SSREMGSFYQEQRTNPCHLSSSIYYGGQDVY---TQNPATGFNSPLKRDHGGEDDSGG-- 132
Query: 136 SNSASRGNWWQGSLYY 151
ASRGNWWQGSLYY
Sbjct: 133 ---ASRGNWWQGSLYY 145
>gi|351722410|ref|NP_001237499.1| uncharacterized protein LOC100306610 [Glycine max]
gi|255629059|gb|ACU14874.1| unknown [Glycine max]
Length = 145
Score = 70.5 bits (171), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 58/159 (36%), Positives = 79/159 (49%), Gaps = 22/159 (13%)
Query: 1 MENKRQESASSSSSPINFDHLFGPKDSSSSATTGGFGSIFSPPTAVQGRGSSHSGSIIGS 60
ME +++ +SS +S L G SA +G F S+F PP+ V GR S S +
Sbjct: 1 MEGRKKMGSSSFTS-----ELLGSNQFQKSAASGIFESMFPPPSKVLGRESLRSEVCEKT 55
Query: 61 LRKQGES-----SKG--GESNIFNSKDKSSIYQSESAEPCYFSSSIYYGGQENYS-PRTK 112
++ S SKG GE+ KD SSIYQ + +PC SSSI+YGGQ+ S P++
Sbjct: 56 ANERWSSKIDYISKGSDGETQSTTHKDMSSIYQEQRLQPCQLSSSIHYGGQDICSCPKST 115
Query: 113 ATTESHHHVVKKDGDEDDPNGNNSNSASRGNWWQGSLYY 151
+ + + KKDG EDD N WQG YY
Sbjct: 116 QDSGYNSLLYKKDGVEDDLG---------SNLWQGGPYY 145
>gi|356566589|ref|XP_003551513.1| PREDICTED: uncharacterized protein LOC100791771 [Glycine max]
Length = 175
Score = 69.7 bits (169), Expect = 3e-10, Method: Compositional matrix adjust.
Identities = 54/143 (37%), Positives = 75/143 (52%), Gaps = 14/143 (9%)
Query: 1 MENKRQESASSSSSPINFDHLFGPKDSSSSATTGGFGSIFSPPTAVQGRGSSHSGSIIGS 60
ME +++ +SSS + LFG S+ +G F S+F PP V GRGS S + G
Sbjct: 1 MEGRKKMGSSSSFT----SELFGSNQFQKSSASGIFESMFPPPPKVLGRGSLCS-EVSGK 55
Query: 61 LRKQGESSKG--------GESNIFNSKDKSSIYQSESAEPCYFSSSIYYGGQENYS-PRT 111
+G + K GE+ KD SSIYQ + +PC SSSI+YGGQ+ YS P++
Sbjct: 56 SSSEGWNYKFDYISMGNHGETQNTAQKDMSSIYQEQRLQPCQLSSSIHYGGQDIYSCPKS 115
Query: 112 KATTESHHHVVKKDGDEDDPNGN 134
+ + + KKDG EDD N
Sbjct: 116 TQDSGYNSLLYKKDGVEDDLGSN 138
>gi|297810301|ref|XP_002873034.1| hypothetical protein ARALYDRAFT_908064 [Arabidopsis lyrata subsp.
lyrata]
gi|297318871|gb|EFH49293.1| hypothetical protein ARALYDRAFT_908064 [Arabidopsis lyrata subsp.
lyrata]
Length = 154
Score = 69.3 bits (168), Expect = 5e-10, Method: Compositional matrix adjust.
Identities = 48/116 (41%), Positives = 63/116 (54%), Gaps = 15/116 (12%)
Query: 42 PPTAVQGRGSSHSGSIIGSLR----KQGESSKGG-ESNIFNSKDKSSIYQSESAEPCYFS 96
PP+ V GR S ++ G+ K E G + N ++ S YQ + +PC+ S
Sbjct: 48 PPSQVLGRESMRQETVAGAWNDKTSKTVEDFAGDVDRNREQQENHGSGYQDQRVQPCHLS 107
Query: 97 SSIYYGGQENY-SPRTKATTESHHHVVKKDGDEDDPNGNNSNSASRGNWWQGSLYY 151
SSIYYGG + Y P+ + ++ KKDG EDD S SASRGNWWQGSLYY
Sbjct: 108 SSIYYGGPDVYFQPQNSSINSTN----KKDGGEDD-----SGSASRGNWWQGSLYY 154
>gi|42570391|ref|NP_850323.2| uncharacterized protein [Arabidopsis thaliana]
gi|28207124|gb|AAO37202.1| hypothetical protein [Arabidopsis thaliana]
gi|330254646|gb|AEC09740.1| uncharacterized protein [Arabidopsis thaliana]
Length = 151
Score = 65.9 bits (159), Expect = 5e-09, Method: Compositional matrix adjust.
Identities = 44/100 (44%), Positives = 59/100 (59%), Gaps = 17/100 (17%)
Query: 18 FDHLFGPK--DSSSSATTGGFGSIFSPPTAV-QGRGSSHSGSI------IGSLRKQGESS 68
DH+FGP+ S SS+TTG F SIF PP+AV QG +S +G+ + ++GE S
Sbjct: 18 LDHIFGPRVSHSYSSSTTGLFKSIFPPPSAVTQGNLTSRNGAAKYQPTNFETPNERGERS 77
Query: 69 KGGESNIFNSKDKSSIYQSESAEPCYFSSSIYYGGQENYS 108
K +K++ S E+ PC SSSIYYGGQ+NYS
Sbjct: 78 K--------NKERKSYQSEETQPPCNLSSSIYYGGQDNYS 109
>gi|297815854|ref|XP_002875810.1| hypothetical protein ARALYDRAFT_485059 [Arabidopsis lyrata subsp.
lyrata]
gi|297321648|gb|EFH52069.1| hypothetical protein ARALYDRAFT_485059 [Arabidopsis lyrata subsp.
lyrata]
Length = 165
Score = 65.1 bits (157), Expect = 9e-09, Method: Compositional matrix adjust.
Identities = 58/154 (37%), Positives = 76/154 (49%), Gaps = 40/154 (25%)
Query: 12 SSSPINFD-HLFGPKDSSSSATTGGFGSIFSPPTAVQGRGSSHSGSIIGSLRKQGESSKG 70
SSSP ++ +FGPK+ SSSA F SIF PP+ +S +G
Sbjct: 9 SSSPSSYTAKIFGPKEPSSSA---NFNSIFPPPS---------------------KSIQG 44
Query: 71 GESNIFNSKDKSSIYQSESAEPCYFSSSIYYGGQENYS-----------PRTKATTESH- 118
NI +SK S + ESA C SSS+YYGGQ+ YS TK+ T +
Sbjct: 45 TSRNILSSKYGSLDQRKESAT-CNLSSSLYYGGQDVYSQSTHNHTHTTVKETKSLTLQNV 103
Query: 119 -HHVVKKDGDEDDPNGNNSNSASRGNWWQGSLYY 151
H + KD + D+ N NS+ ASRGNWWQ + Y
Sbjct: 104 VSHQINKDQNRDN-NDANSSDASRGNWWQDDVCY 136
>gi|357505743|ref|XP_003623160.1| hypothetical protein MTR_7g065150 [Medicago truncatula]
gi|355498175|gb|AES79378.1| hypothetical protein MTR_7g065150 [Medicago truncatula]
Length = 148
Score = 65.1 bits (157), Expect = 9e-09, Method: Compositional matrix adjust.
Identities = 48/119 (40%), Positives = 65/119 (54%), Gaps = 12/119 (10%)
Query: 1 MENKRQESASSSSSPINFDHLFGPKDSSSSATTGGFGSIFSPPTAVQGRGSSHSGSIIGS 60
ME ++Q +SSS + LFG D S+ +G F SIFSP + V GR S S ++ G
Sbjct: 1 MEGRKQFGSSSSETSSFTSELFGSNDLHKSSASGIFDSIFSPSSKVFGRESLRS-AMNGK 59
Query: 61 LRKQGESSK-----------GGESNIFNSKDKSSIYQSESAEPCYFSSSIYYGGQENYS 108
+G +SK GE+ +KD S IYQ + +PC+ SSSIYYGGQ+ YS
Sbjct: 60 TATEGSNSKIDTQEYISKGNDGETQNKTNKDMSYIYQEQRVQPCHLSSSIYYGGQDVYS 118
>gi|357461049|ref|XP_003600806.1| hypothetical protein MTR_3g069570 [Medicago truncatula]
gi|355489854|gb|AES71057.1| hypothetical protein MTR_3g069570 [Medicago truncatula]
Length = 217
Score = 63.2 bits (152), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 66/156 (42%), Positives = 90/156 (57%), Gaps = 10/156 (6%)
Query: 1 MENKRQE-----SASSSSSPINFDHLFGPKDSSSSATTGGFGSIFS--PPTAVQGRGSSH 53
MENK + + SSSS+P NFDHLFGPKD S+++++ PP+ R S+
Sbjct: 1 MENKNNKKKVGGAGSSSSTPTNFDHLFGPKDPSTTSSSSNSIFSSIFPPPSTGGWRDSTR 60
Query: 54 SGSIIGSLRKQGE-SSKGGESNIFNSKDKSSIYQSESAEPCYFSSSIYY-GGQENY-SPR 110
+ G S+KGG S N+ ++ YQ+E+ EP + SS Y GGQENY SP
Sbjct: 61 HDMGSNTYGAPGNYSNKGGNSANKNTGSDTTYYQNETVEPSNYYSSSIYYGGQENYNSPT 120
Query: 111 TKATTESHHHVVKKDGDEDDPNGNNSNSASRGNWWQ 146
T TT++ H K D+DD NG++S SASRGNWW+
Sbjct: 121 TCKTTQTPHFTKKNKDDDDDSNGSDSGSASRGNWWK 156
>gi|194692056|gb|ACF80112.1| unknown [Zea mays]
gi|195646162|gb|ACG42549.1| hypothetical protein [Zea mays]
gi|268083689|gb|ACY95285.1| unknown [Zea mays]
gi|413949056|gb|AFW81705.1| hypothetical protein ZEAMMB73_746072 [Zea mays]
Length = 136
Score = 62.4 bits (150), Expect = 6e-08, Method: Compositional matrix adjust.
Identities = 47/124 (37%), Positives = 69/124 (55%), Gaps = 16/124 (12%)
Query: 34 GGFGSIFSPPTAVQGRGSSHSGSIIGSLRKQ---GESSKGGESNIFNS-KDKSSIYQSES 89
G F ++FS PTA + H+ + L KQ G++ + G+S+ + KD + +E
Sbjct: 23 GYFSTVFSAPTAGSASDAKHA-DLYTMLNKQSSRGQNGRDGKSHSRPTYKDGKHAHPNEP 81
Query: 90 AEPCYFSSSIYYGGQENYSP--RTKATTESHHHVVKKDGDEDDPNGNNSNSASRGNWWQG 147
+E YF SS++YGG+E YS R + E H D D+P+G SA+RG+WWQG
Sbjct: 82 SESPYFGSSVHYGGREFYSSVLRKQPANEPH-----TDYKGDNPDG----SATRGDWWQG 132
Query: 148 SLYY 151
SLYY
Sbjct: 133 SLYY 136
>gi|212722470|ref|NP_001131619.1| uncharacterized protein LOC100192973 [Zea mays]
gi|195646590|gb|ACG42763.1| hypothetical protein [Zea mays]
Length = 136
Score = 62.4 bits (150), Expect = 6e-08, Method: Compositional matrix adjust.
Identities = 47/124 (37%), Positives = 69/124 (55%), Gaps = 16/124 (12%)
Query: 34 GGFGSIFSPPTAVQGRGSSHSGSIIGSLRKQ---GESSKGGESNIFNS-KDKSSIYQSES 89
G F ++FS PTA + H+ + L KQ G++ + G+S+ + KD + +E
Sbjct: 23 GYFSTVFSAPTAGSASDAKHA-DLYTMLNKQSSRGQNGRDGKSHSRPTYKDGKHAHPNEP 81
Query: 90 AEPCYFSSSIYYGGQENYSP--RTKATTESHHHVVKKDGDEDDPNGNNSNSASRGNWWQG 147
+E YF SS++YGG+E YS R + E H D D+P+G SA+RG+WWQG
Sbjct: 82 SESPYFGSSVHYGGREFYSSVLRKQPANEPH-----TDYKGDNPHG----SATRGDWWQG 132
Query: 148 SLYY 151
SLYY
Sbjct: 133 SLYY 136
>gi|357119628|ref|XP_003561537.1| PREDICTED: uncharacterized protein LOC100840587 [Brachypodium
distachyon]
Length = 150
Score = 61.6 bits (148), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 51/162 (31%), Positives = 86/162 (53%), Gaps = 23/162 (14%)
Query: 1 MENKRQESASSSSSPIN----FDHLFGPKDSSSSATTGGFG---SIFSPPTAVQGRGSSH 53
ME K++ +A+ + S N D LFGPK +S++++ ++F P+ G+ +S
Sbjct: 1 MEGKKKPAAAGAGSKTNGSSIVDDLFGPKPDASNSSSSSASYFSTVFPTPSEAAGKDAS- 59
Query: 54 SGSIIGSLRKQGESSKGGESNIFNSKDKSSIYQSESAEPCYF-SSSIYYGGQENYSPRTK 112
++ + + G +S+ + + Y S S E YF SSS++YGG++ Y+P
Sbjct: 60 ---------RRAQYASGNKSSSSWTGQQQQQYGSGSPESPYFGSSSVHYGGRDLYTPGNV 110
Query: 113 ATTESHHH---VVKKDGDEDDPNGNNSNSASRGNWWQGSLYY 151
+S H ++K DD G+ S +A+RG+WWQGSLYY
Sbjct: 111 NGDQSKTHHGGPIRKHNKVDD--GDTSGAATRGDWWQGSLYY 150
>gi|22326622|ref|NP_680147.1| uncharacterized protein [Arabidopsis thaliana]
gi|52354445|gb|AAU44543.1| hypothetical protein AT5G05965 [Arabidopsis thaliana]
gi|332003563|gb|AED90946.1| uncharacterized protein [Arabidopsis thaliana]
Length = 131
Score = 61.2 bits (147), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 64/153 (41%), Positives = 88/153 (57%), Gaps = 24/153 (15%)
Query: 1 MENKRQESASSSSSPINFDHLFGPKDS-SSSATTGGFGSIFSPPTAVQGRGSSHSGSIIG 59
M+ K+ ++ASSS S +FD+LFGPK S S+S+ + SIF PP A + +GS + + G
Sbjct: 1 MDIKKTKAASSSHS-SSFDNLFGPKGSASASSCSTILDSIFPPPVA-RKKGSHTAPEVQG 58
Query: 60 SLRKQGESSKGGESNIFNSKDKSSIYQSESAEPCYFSSSIYYGGQENYS-PRTKATTESH 118
NI + D+ S + E E Y+SSSIYYGGQ++YS PRT ++ S
Sbjct: 59 --------------NISQATDERSSH--EKRESSYYSSSIYYGGQQHYSPPRTDGSSTSP 102
Query: 119 HHVVKKDGDEDDPNGNNSNSASRGNWWQGSLYY 151
H K+ D D + S SRGNWW+GSLYY
Sbjct: 103 SHQPKETNDRTD----ITTSTSRGNWWKGSLYY 131
>gi|242090741|ref|XP_002441203.1| hypothetical protein SORBIDRAFT_09g022260 [Sorghum bicolor]
gi|241946488|gb|EES19633.1| hypothetical protein SORBIDRAFT_09g022260 [Sorghum bicolor]
Length = 138
Score = 58.2 bits (139), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 34/73 (46%), Positives = 45/73 (61%), Gaps = 7/73 (9%)
Query: 79 KDKSSIYQSESAEPCYFSSSIYYGGQENYSPRTKATTESHHHVVKKDGDEDDPNGNNSNS 138
KD Y +ES+E YF SS++YGG+E YS + + H K ED+P+G S
Sbjct: 73 KDGKQSYPNESSESPYFGSSVHYGGREFYSSTLQKQPANEPHTNYK---EDNPDG----S 125
Query: 139 ASRGNWWQGSLYY 151
A+RG+WWQGSLYY
Sbjct: 126 ATRGDWWQGSLYY 138
>gi|326498025|dbj|BAJ94875.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 137
Score = 56.6 bits (135), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 40/152 (26%), Positives = 81/152 (53%), Gaps = 16/152 (10%)
Query: 1 MENKRQESASSSSSPINFDHLFGPKDSSSSATTGG-FGSIFSPPTAVQGRGSSHSGSIIG 59
M+ K++ +A++ SS + D LFGPK+ ++S ++ G F ++F P+A G+
Sbjct: 1 MDGKKKPAAATGSSIV--DDLFGPKNGAASPSSAGYFSTVFPTPSAATGK---------- 48
Query: 60 SLRKQGESSKGGESNIFNSKDKSSIYQSESAEPCYFSSSIYYGGQENYSPRTKATTESHH 119
++ ++ G +S+ + + + P + SSS++YGG++ Y+ A ++ +
Sbjct: 49 DASRRAAAAAGNKSSSAGHQQQGKQHGGSPESPYFGSSSVHYGGRDFYA---GAESQRQY 105
Query: 120 HVVKKDGDEDDPNGNNSNSASRGNWWQGSLYY 151
+ ++++A+RG+WWQGSLYY
Sbjct: 106 PPAAAAPAPKHKDDGDTSAATRGDWWQGSLYY 137
>gi|357133435|ref|XP_003568330.1| PREDICTED: uncharacterized protein LOC100823157 [Brachypodium
distachyon]
Length = 146
Score = 55.1 bits (131), Expect = 9e-06, Method: Compositional matrix adjust.
Identities = 43/131 (32%), Positives = 65/131 (49%), Gaps = 22/131 (16%)
Query: 34 GGFGSIFSPPTAVQGRGSSHS--GSIIGSLRKQGESSKGGESNIFNS---------KDKS 82
G F S+FS A R + + +I+ +G +S G F KD
Sbjct: 25 GYFSSVFSASPAANPRDARETDLNTILNRQNPKGPTSGGIADGKFQGSPTKGRVTYKDGQ 84
Query: 83 SIYQSESAEPCYFSSSIYYGGQENY--SPRTKATTESHHHVVKKDGDEDDPNGNNSNSAS 140
Y +E +E YF SS++YGG++ Y SP +A+ S + +D+P+G + A+
Sbjct: 85 QFYPNEPSESPYFGSSVHYGGRDFYDSSPHKQASESSRSY------KDDNPDG---SLAT 135
Query: 141 RGNWWQGSLYY 151
RG+WWQGSLYY
Sbjct: 136 RGDWWQGSLYY 146
>gi|30679498|ref|NP_850752.1| uncharacterized protein [Arabidopsis thaliana]
gi|222422999|dbj|BAH19483.1| AT5G02020 [Arabidopsis thaliana]
gi|332003038|gb|AED90421.1| uncharacterized protein [Arabidopsis thaliana]
Length = 137
Score = 54.3 bits (129), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 34/94 (36%), Positives = 52/94 (55%), Gaps = 8/94 (8%)
Query: 20 HLFGPKDSSSSATTGG-FGSIFSPPTAVQGRGSSHSGSIIGSLRKQGESSKGGESNIFNS 78
LFG +++ SS ++ G GSIF PP+ V GR S ++ G + S GG N+ +
Sbjct: 22 ELFGSRENPSSPSSSGILGSIFPPPSKVLGRESVRQETVTGGCWNEKTSKTGG--NVDRN 79
Query: 79 KDK-----SSIYQSESAEPCYFSSSIYYGGQENY 107
+++ S Q + +PC+ SSSIYYGG + Y
Sbjct: 80 REQQENHGSGYQQDQRVQPCHLSSSIYYGGPDVY 113
>gi|195657743|gb|ACG48339.1| hypothetical protein [Zea mays]
Length = 137
Score = 54.3 bits (129), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 32/73 (43%), Positives = 43/73 (58%), Gaps = 7/73 (9%)
Query: 79 KDKSSIYQSESAEPCYFSSSIYYGGQENYSPRTKATTESHHHVVKKDGDEDDPNGNNSNS 138
KD Y ++S+E YF SS++YGG+E YS + + K ED P+G S
Sbjct: 72 KDGKHAYPNDSSESPYFGSSVHYGGREFYSNTLQKQPANEPQTKYK---EDKPDG----S 124
Query: 139 ASRGNWWQGSLYY 151
A+RG+WWQGSLYY
Sbjct: 125 ATRGDWWQGSLYY 137
>gi|147862480|emb|CAN84007.1| hypothetical protein VITISV_033736 [Vitis vinifera]
Length = 158
Score = 54.3 bits (129), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 41/103 (39%), Positives = 57/103 (55%), Gaps = 17/103 (16%)
Query: 21 LFGPKDSS--SSATTGGFGSIFSPPTAVQGRGSSH---------SGSII-----GSLRKQ 64
LFG K+SS S ++TG F SIFS + V GR S SG+ + G+
Sbjct: 19 LFGSKESSYPSPSSTGIFASIFSTSSKVLGRESLRPDLTKKKQDSGNEVWNAKPGTTENA 78
Query: 65 GESSKGGESNIFNSKDKSSIYQSESAEPCYFSSSIYYGGQENY 107
+ S+G ES +++D S YQ + +PC+ SSSIYYGGQ+ Y
Sbjct: 79 LQQSEG-ESQSISNRDTGSFYQEQRVQPCHLSSSIYYGGQDIY 120
>gi|326524530|dbj|BAK00648.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 142
Score = 53.9 bits (128), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 47/141 (33%), Positives = 67/141 (47%), Gaps = 30/141 (21%)
Query: 28 SSSATTGGFGSIFSPPTAVQGRGSSH---------------SGSIIGSLRKQGESSKGGE 72
S++A G F S+FS A R + SG + QG +KG
Sbjct: 15 SAAAGNGYFSSVFSASPAANPRDARQMDLYTMLNKQNPKGQSGGGTADGKSQGSPAKG-- 72
Query: 73 SNIFNSKDKSSIYQSESAEPCYFSSSIYYGGQENY--SPRTKATTESHHHVVKKDGDEDD 130
KD Y +ES+E YF SS++YGG++ Y SP +A S ++ ED+
Sbjct: 73 --RVAYKDGKQFYPNESSESPYFGSSVHYGGRDFYDSSPHKQANESSRNY------KEDN 124
Query: 131 PNGNNSNSASRGNWWQGSLYY 151
+G + A+RG+WWQGSLYY
Sbjct: 125 TDG---SLATRGDWWQGSLYY 142
>gi|413949057|gb|AFW81706.1| hypothetical protein ZEAMMB73_746072 [Zea mays]
Length = 89
Score = 52.8 bits (125), Expect = 5e-05, Method: Compositional matrix adjust.
Identities = 33/75 (44%), Positives = 44/75 (58%), Gaps = 11/75 (14%)
Query: 79 KDKSSIYQSESAEPCYFSSSIYYGGQENYSP--RTKATTESHHHVVKKDGDEDDPNGNNS 136
KD + +E +E YF SS++YGG+E YS R + E H D D+P+G
Sbjct: 24 KDGKHAHPNEPSESPYFGSSVHYGGREFYSSVLRKQPANEPH-----TDYKGDNPDG--- 75
Query: 137 NSASRGNWWQGSLYY 151
SA+RG+WWQGSLYY
Sbjct: 76 -SATRGDWWQGSLYY 89
>gi|388511371|gb|AFK43747.1| unknown [Lotus japonicus]
Length = 115
Score = 51.2 bits (121), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 39/102 (38%), Positives = 51/102 (50%), Gaps = 23/102 (22%)
Query: 21 LFGPKDSSSSATTGGFGSIFSPPTAVQGRGSSHSGSIIGSLRKQGESSKGG--------- 71
LFGP +S S+ F S+FSP V GR S S + G + +G S+K G
Sbjct: 18 LFGPSESHPSS---AFESMFSPSPKVTGRVSLRS-EVSGKIGGEGSSTKFGTPPDSSHIN 73
Query: 72 ----------ESNIFNSKDKSSIYQSESAEPCYFSSSIYYGG 103
E+ +KD SSIYQ + +PC+ SSSIYYGG
Sbjct: 74 ADFISKGSDSETQSKANKDMSSIYQEQRVQPCHLSSSIYYGG 115
>gi|212720725|ref|NP_001131984.1| hypothetical protein [Zea mays]
gi|194693102|gb|ACF80635.1| unknown [Zea mays]
gi|413945571|gb|AFW78220.1| hypothetical protein ZEAMMB73_376464 [Zea mays]
Length = 137
Score = 50.8 bits (120), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 31/73 (42%), Positives = 42/73 (57%), Gaps = 7/73 (9%)
Query: 79 KDKSSIYQSESAEPCYFSSSIYYGGQENYSPRTKATTESHHHVVKKDGDEDDPNGNNSNS 138
KD Y ++S+E YF SS++YGG+E YS + + K ED +G S
Sbjct: 72 KDGKHAYPNDSSESPYFGSSVHYGGREFYSSTLQKQPANEPQTKYK---EDKSDG----S 124
Query: 139 ASRGNWWQGSLYY 151
A+RG+WWQGSLYY
Sbjct: 125 ATRGDWWQGSLYY 137
>gi|357470595|ref|XP_003605582.1| hypothetical protein MTR_4g034100 [Medicago truncatula]
gi|355506637|gb|AES87779.1| hypothetical protein MTR_4g034100 [Medicago truncatula]
Length = 213
Score = 49.3 bits (116), Expect = 5e-04, Method: Compositional matrix adjust.
Identities = 30/68 (44%), Positives = 37/68 (54%), Gaps = 8/68 (11%)
Query: 79 KDKSSIYQSESAEPCYFSSSIYYGGQENYSPRTKATTESHHHVVKKDGDEDDPNGNNSNS 138
K SSI+Q + PC SSSIYYGGQ+ P A S + K D + DD S
Sbjct: 145 KVTSSIFQDQIIGPCNLSSSIYYGGQDILYPAQNARLTS---LKKYDWENDD-----SGV 196
Query: 139 ASRGNWWQ 146
ASRG+WW+
Sbjct: 197 ASRGDWWK 204
>gi|357470589|ref|XP_003605579.1| hypothetical protein MTR_4g034060 [Medicago truncatula]
gi|355506634|gb|AES87776.1| hypothetical protein MTR_4g034060 [Medicago truncatula]
Length = 159
Score = 48.9 bits (115), Expect = 7e-04, Method: Compositional matrix adjust.
Identities = 27/68 (39%), Positives = 35/68 (51%), Gaps = 5/68 (7%)
Query: 79 KDKSSIYQSESAEPCYFSSSIYYGGQENYSPRTKATTESHHHVVKKDGDEDDPNGNNSNS 138
K SSI+Q + PC SSSIYYGGQ+ P + K D + N+S
Sbjct: 86 KIASSIFQDQITGPCNLSSSIYYGGQDILYPAQSTKIARLTLLTKYDWE-----NNDSEV 140
Query: 139 ASRGNWWQ 146
ASRG+WW+
Sbjct: 141 ASRGDWWK 148
>gi|297810695|ref|XP_002873231.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
gi|297319068|gb|EFH49490.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
Length = 134
Score = 47.0 bits (110), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 56/139 (40%), Positives = 78/139 (56%), Gaps = 24/139 (17%)
Query: 18 FDHLFGPKDSSSSATTGG----FGSIFSPPTAVQGRGSSHSGSIIGSLRKQGESSKGGES 73
FD+LFGPK S+S+++ SIF PP A + +GS + + GS+
Sbjct: 15 FDNLFGPKGSASASSASSCSTILDSIFPPPVA-RKKGSHTAPEVQGSI------------ 61
Query: 74 NIFNSKDKSSIYQSESAEPCYFSSSIYYGGQENYS-PRTKATTESHHHVVKKDGDEDDPN 132
+ +KD+ S + E E Y+SSSIYYGGQ++YS PRT ++ S H K E D
Sbjct: 62 TLATAKDERSSH--EKRESSYYSSSIYYGGQQHYSPPRTDGSSTSPSHQPK----ETDDR 115
Query: 133 GNNSNSASRGNWWQGSLYY 151
+ + S SRGNWW+GSLYY
Sbjct: 116 TDTTTSTSRGNWWKGSLYY 134
>gi|116793306|gb|ABK26698.1| unknown [Picea sitchensis]
Length = 116
Score = 46.6 bits (109), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 40/116 (34%), Positives = 55/116 (47%), Gaps = 18/116 (15%)
Query: 52 SHSGSIIGSLRKQGESSKGGESNIFNSKD----KSSIYQS---------ESAEPCYFSSS 98
S S ++GS +KQ S + ES + K+ K + Q + E C++ SS
Sbjct: 3 SKSCEVLGSTQKQEHSEEHQESTLNGVKEINMEKEPLQQLLNHVKGKSFANPENCFWGSS 62
Query: 99 IYYGG-QENY--SPRTKATTESHHHVVKKDGDEDDPNGNNSNSASRGNWWQGSLYY 151
+YYGG E+Y S KA + VK+ + DP N RGNWW GSLYY
Sbjct: 63 LYYGGPDEDYATSQVIKANQDEIKTEVKQLEEAMDPC--NLEYTVRGNWWNGSLYY 116
>gi|115464241|ref|NP_001055720.1| Os05g0454500 [Oryza sativa Japonica Group]
gi|53749346|gb|AAU90205.1| unknown protein [Oryza sativa Japonica Group]
gi|113579271|dbj|BAF17634.1| Os05g0454500 [Oryza sativa Japonica Group]
gi|215768458|dbj|BAH00687.1| unnamed protein product [Oryza sativa Japonica Group]
gi|218196903|gb|EEC79330.1| hypothetical protein OsI_20183 [Oryza sativa Indica Group]
gi|222631813|gb|EEE63945.1| hypothetical protein OsJ_18770 [Oryza sativa Japonica Group]
Length = 144
Score = 46.6 bits (109), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 29/73 (39%), Positives = 42/73 (57%), Gaps = 5/73 (6%)
Query: 79 KDKSSIYQSESAEPCYFSSSIYYGGQENYSPRTKATTESHHHVVKKDGDEDDPNGNNSNS 138
KD Y +ES+E YF SS++YG +E Y ++ K+ E +P+G +
Sbjct: 77 KDGKQSYPNESSESPYFGSSVHYGAREFYGNTPPKQGDASPGNQKE--QEQNPDG---SL 131
Query: 139 ASRGNWWQGSLYY 151
A+RG+WWQGSLYY
Sbjct: 132 ATRGDWWQGSLYY 144
>gi|15232675|ref|NP_190272.1| uncharacterized protein [Arabidopsis thaliana]
gi|5541671|emb|CAB51177.1| hypothetical protein [Arabidopsis thaliana]
gi|332644693|gb|AEE78214.1| uncharacterized protein [Arabidopsis thaliana]
Length = 153
Score = 44.7 bits (104), Expect = 0.014, Method: Compositional matrix adjust.
Identities = 35/88 (39%), Positives = 41/88 (46%), Gaps = 29/88 (32%)
Query: 21 LFGPKDSSSSATTGGFGSIFSPPTAVQGRGSSHSGSIIGSLRKQGESSKGGESNIFNSKD 80
LFGPK+ SSSA F SIF P SKG NI +SK
Sbjct: 19 LFGPKEPSSSA---NFNSIFPP-------------------------SKGTARNILSSKH 50
Query: 81 KSSIYQSESAEPCYFSSSIYYGGQENYS 108
S+ Q + A C SSS+YYGGQ+ YS
Sbjct: 51 -GSLEQRKEAGTCNLSSSLYYGGQDVYS 77
>gi|116790009|gb|ABK25469.1| unknown [Picea sitchensis]
Length = 161
Score = 44.3 bits (103), Expect = 0.015, Method: Compositional matrix adjust.
Identities = 33/96 (34%), Positives = 49/96 (51%), Gaps = 11/96 (11%)
Query: 62 RKQGESSKGGESNIFNSKDKSSIYQSESAEPCYFSSSIYYGG-QENYS--PRTKATTESH 118
R +G K G+ ++ K ++S E C +SS+YYGG E+Y T+ +
Sbjct: 71 RVKGGKLKKGQLHLTEYVKK---HRSADPEHCSLASSLYYGGPDEHYDSLQTTQVDQDEQ 127
Query: 119 HHV---VKKDGDEDDPNGNNSNSASRGNWWQGSLYY 151
+ VK+ + DP N A+RGNWW+GSLYY
Sbjct: 128 QKIKMEVKQLQEAMDPF--NLEYATRGNWWKGSLYY 161
>gi|116793491|gb|ABK26767.1| unknown [Picea sitchensis]
Length = 161
Score = 44.3 bits (103), Expect = 0.015, Method: Compositional matrix adjust.
Identities = 33/96 (34%), Positives = 49/96 (51%), Gaps = 11/96 (11%)
Query: 62 RKQGESSKGGESNIFNSKDKSSIYQSESAEPCYFSSSIYYGG-QENYS--PRTKATTESH 118
R +G K G+ ++ K ++S E C +SS+YYGG E+Y T+ +
Sbjct: 71 RVKGGKLKKGQLHLTEYVKK---HRSADPEHCSLASSLYYGGPDEHYDSLQTTQVDQDEQ 127
Query: 119 HHV---VKKDGDEDDPNGNNSNSASRGNWWQGSLYY 151
+ VK+ + DP N A+RGNWW+GSLYY
Sbjct: 128 QKIKMEVKQLQEAMDPF--NLEYATRGNWWKGSLYY 161
>gi|223950247|gb|ACN29207.1| unknown [Zea mays]
Length = 155
Score = 40.0 bits (92), Expect = 0.33, Method: Compositional matrix adjust.
Identities = 39/125 (31%), Positives = 54/125 (43%), Gaps = 17/125 (13%)
Query: 36 FGSIFSPPTAVQGRGSSHSGSII---GSLRKQGESSKGGESNIFNSKDKSSIYQSESAEP 92
S+F PP SII S Q ++ G+ SK ++ + P
Sbjct: 39 LSSLFYPPAPAANGKDQELYSIIYKGQSGSAQAGATGNGKPQWAPSKSHTAYAKDGKHSP 98
Query: 93 CY----FSSSIYYGGQENY--SPRTKATTESHHHVVKKDGDEDDPNGNNSNSASRGNWWQ 146
Y F SS++YGG++ + S TK TES GD+ DP + S G+WWQ
Sbjct: 99 PYDTSCFGSSVHYGGRDYFYGSSTTKKATES---STDYKGDKKDPVAD-----SHGDWWQ 150
Query: 147 GSLYY 151
GS YY
Sbjct: 151 GSFYY 155
>gi|242054979|ref|XP_002456635.1| hypothetical protein SORBIDRAFT_03g039870 [Sorghum bicolor]
gi|241928610|gb|EES01755.1| hypothetical protein SORBIDRAFT_03g039870 [Sorghum bicolor]
Length = 155
Score = 39.7 bits (91), Expect = 0.39, Method: Compositional matrix adjust.
Identities = 25/59 (42%), Positives = 33/59 (55%), Gaps = 10/59 (16%)
Query: 95 FSSSIYYGGQENY--SPRTKATTESHHHVVKKDGDEDDPNGNNSNSASRGNWWQGSLYY 151
F SS++YGG+E + S TK +ES GD+ DP + S G+WWQGS YY
Sbjct: 105 FGSSVHYGGREYFYGSSTTKKASES---STDYKGDKKDPAAD-----SHGDWWQGSFYY 155
>gi|398803908|ref|ZP_10562914.1| citrate synthase I, hexameric type [Polaromonas sp. CF318]
gi|398095334|gb|EJL85673.1| citrate synthase I, hexameric type [Polaromonas sp. CF318]
Length = 436
Score = 38.9 bits (89), Expect = 0.64, Method: Compositional matrix adjust.
Identities = 30/102 (29%), Positives = 48/102 (47%), Gaps = 13/102 (12%)
Query: 30 SATTGGFGSIFSPPTAVQGRGSSHSGSIIGSLRKQGESSKGGESNIFNSKDKSSIYQSES 89
+A G ++ P G + + +++G ++KQG K GE I KDKSS
Sbjct: 255 AAIAAGVACLWGP---AHGGANEAALNMLGDIQKQGGVEKIGEF-IKQVKDKSS-----G 305
Query: 90 AEPCYFSSSIYYGGQENYSPRTKATTESHHHVVKKDGDEDDP 131
+ F +Y +NY PR K E+ V+++ G E+DP
Sbjct: 306 VKLMGFGHRVY----KNYDPRAKLMQETCKEVLQEMGLENDP 343
>gi|326501634|dbj|BAK02606.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 195
Score = 38.1 bits (87), Expect = 1.2, Method: Compositional matrix adjust.
Identities = 28/73 (38%), Positives = 36/73 (49%), Gaps = 13/73 (17%)
Query: 81 KSSIYQSESAEPCYFSSSIYYGGQENY--SPRTKATTESHHHVVKKDGDEDDPNGNNSNS 138
K + Y S C F SS++YGG++ Y S +A S + V KKD D
Sbjct: 134 KDNKYDSVGTSSC-FGSSVHYGGRDYYGSSAPKQAAEYSDYKVDKKDPVAD--------- 183
Query: 139 ASRGNWWQGSLYY 151
S G+WWQGS YY
Sbjct: 184 -SHGDWWQGSFYY 195
>gi|326489436|dbj|BAK01699.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 195
Score = 38.1 bits (87), Expect = 1.2, Method: Compositional matrix adjust.
Identities = 28/73 (38%), Positives = 36/73 (49%), Gaps = 13/73 (17%)
Query: 81 KSSIYQSESAEPCYFSSSIYYGGQENY--SPRTKATTESHHHVVKKDGDEDDPNGNNSNS 138
K + Y S C F SS++YGG++ Y S +A S + V KKD D
Sbjct: 134 KDNKYDSVGTSSC-FGSSVHYGGRDYYGSSAPKQAAEYSDYKVDKKDPVAD--------- 183
Query: 139 ASRGNWWQGSLYY 151
S G+WWQGS YY
Sbjct: 184 -SHGDWWQGSFYY 195
>gi|222619540|gb|EEE55672.1| hypothetical protein OsJ_04080 [Oryza sativa Japonica Group]
Length = 156
Score = 37.4 bits (85), Expect = 2.3, Method: Compositional matrix adjust.
Identities = 28/81 (34%), Positives = 42/81 (51%), Gaps = 10/81 (12%)
Query: 72 ESNIFNSKDKSSIYQSESAEPCYFSSSIYYGGQENYSPR-TKATTESHHHVVKKDGDEDD 130
+S +KD Q +S + F SS+ YGG++ Y K +TES+ + D+ D
Sbjct: 85 KSRTVYTKDNKQSNQYDSVDTSCFGSSVNYGGRDYYGISGHKQSTESNDYKA----DKKD 140
Query: 131 PNGNNSNSASRGNWWQGSLYY 151
P+ + S G+WWQGS YY
Sbjct: 141 PSTD-----SHGDWWQGSFYY 156
>gi|297597964|ref|NP_001044806.2| Os01g0848900 [Oryza sativa Japonica Group]
gi|56784045|dbj|BAD82673.1| unknown protein [Oryza sativa Japonica Group]
gi|255673876|dbj|BAF06720.2| Os01g0848900 [Oryza sativa Japonica Group]
Length = 156
Score = 37.0 bits (84), Expect = 2.6, Method: Compositional matrix adjust.
Identities = 28/81 (34%), Positives = 42/81 (51%), Gaps = 10/81 (12%)
Query: 72 ESNIFNSKDKSSIYQSESAEPCYFSSSIYYGGQENYSPR-TKATTESHHHVVKKDGDEDD 130
+S +KD Q +S + F SS+ YGG++ Y K +TES+ + D+ D
Sbjct: 85 KSRTVYTKDNKQSNQYDSVDTSCFGSSVNYGGRDYYGISGHKQSTESNDYKA----DKKD 140
Query: 131 PNGNNSNSASRGNWWQGSLYY 151
P+ + S G+WWQGS YY
Sbjct: 141 PSTD-----SHGDWWQGSFYY 156
>gi|125528383|gb|EAY76497.1| hypothetical protein OsI_04437 [Oryza sativa Indica Group]
Length = 156
Score = 37.0 bits (84), Expect = 2.8, Method: Compositional matrix adjust.
Identities = 28/81 (34%), Positives = 42/81 (51%), Gaps = 10/81 (12%)
Query: 72 ESNIFNSKDKSSIYQSESAEPCYFSSSIYYGGQENYSPR-TKATTESHHHVVKKDGDEDD 130
+S +KD Q +S + F SS+ YGG++ Y K +TES+ + D+ D
Sbjct: 85 KSRTVYTKDNKQSNQYDSVDTSCFGSSVNYGGRDYYGISGHKQSTESNDYKA----DKKD 140
Query: 131 PNGNNSNSASRGNWWQGSLYY 151
P+ + S G+WWQGS YY
Sbjct: 141 PSTD-----SHGDWWQGSFYY 156
>gi|357125868|ref|XP_003564611.1| PREDICTED: uncharacterized protein LOC100824578 [Brachypodium
distachyon]
Length = 152
Score = 36.2 bits (82), Expect = 4.8, Method: Compositional matrix adjust.
Identities = 23/60 (38%), Positives = 31/60 (51%), Gaps = 13/60 (21%)
Query: 95 FSSSIYYGGQENY---SPRTKATTESHHHVVKKDGDEDDPNGNNSNSASRGNWWQGSLYY 151
F SS++YGG++ Y S +AT + + V KD D S G+WWQGS YY
Sbjct: 103 FGSSVHYGGRDYYGSCSAPKQATEYNDYKVDNKDPTTD----------SHGDWWQGSFYY 152
Database: nr
Posted date: Mar 3, 2013 10:45 PM
Number of letters in database: 999,999,864
Number of sequences in database: 2,912,245
Database: /local_scratch/syshi//blastdatabase/nr.01
Posted date: Mar 3, 2013 10:52 PM
Number of letters in database: 999,999,666
Number of sequences in database: 2,912,720
Database: /local_scratch/syshi//blastdatabase/nr.02
Posted date: Mar 3, 2013 10:58 PM
Number of letters in database: 999,999,938
Number of sequences in database: 3,014,250
Database: /local_scratch/syshi//blastdatabase/nr.03
Posted date: Mar 3, 2013 11:03 PM
Number of letters in database: 999,999,780
Number of sequences in database: 2,805,020
Database: /local_scratch/syshi//blastdatabase/nr.04
Posted date: Mar 3, 2013 11:08 PM
Number of letters in database: 999,999,551
Number of sequences in database: 2,816,253
Database: /local_scratch/syshi//blastdatabase/nr.05
Posted date: Mar 3, 2013 11:13 PM
Number of letters in database: 999,999,897
Number of sequences in database: 2,981,387
Database: /local_scratch/syshi//blastdatabase/nr.06
Posted date: Mar 3, 2013 11:18 PM
Number of letters in database: 999,999,649
Number of sequences in database: 2,911,476
Database: /local_scratch/syshi//blastdatabase/nr.07
Posted date: Mar 3, 2013 11:24 PM
Number of letters in database: 999,999,452
Number of sequences in database: 2,920,260
Database: /local_scratch/syshi//blastdatabase/nr.08
Posted date: Mar 3, 2013 11:25 PM
Number of letters in database: 64,230,274
Number of sequences in database: 189,558
Lambda K H
0.303 0.122 0.352
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 2,538,584,021
Number of Sequences: 23463169
Number of extensions: 104909039
Number of successful extensions: 265381
Number of sequences better than 100.0: 626
Number of HSP's better than 100.0 without gapping: 198
Number of HSP's successfully gapped in prelim test: 428
Number of HSP's that attempted gapping in prelim test: 248814
Number of HSP's gapped (non-prelim): 13341
length of query: 151
length of database: 8,064,228,071
effective HSP length: 114
effective length of query: 37
effective length of database: 9,684,394,101
effective search space: 358322581737
effective search space used: 358322581737
T: 11
A: 40
X1: 16 ( 7.0 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 43 (21.8 bits)
S2: 71 (32.0 bits)