BLASTP 2.2.26 [Sep-21-2011]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for compositional score matrix adjustment: Altschul, Stephen F.,
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.
Query= 019804
(335 letters)
Database: nr
23,463,169 sequences; 8,064,228,071 total letters
Searching..................................................done
>gi|297735250|emb|CBI17612.3| unnamed protein product [Vitis vinifera]
Length = 346
Score = 383 bits (984), Expect = e-104, Method: Compositional matrix adjust.
Identities = 181/225 (80%), Positives = 204/225 (90%), Gaps = 1/225 (0%)
Query: 61 PFTLRVTNDSTTST-ELSIQNQKSEADKIVDGMDFGELCNEFECISSPLVESTARQLVRD 119
PF L+VTND T E+SI++++SEADKIVDGMDFGELCNEFECISSPLVESTARQL RD
Sbjct: 89 PFPLKVTNDPNRRTAEVSIESERSEADKIVDGMDFGELCNEFECISSPLVESTARQLARD 148
Query: 120 ILELREGNRALGTFAVSVKYKDPTRSFTGREKYKRRLWATTALDNPSVTVQEMVMLSTSV 179
ILELREGNRALGTFAVSVKYKDP RSFTGREKYKRRLW T ALD+PSVTVQEMVMLSTSV
Sbjct: 149 ILELREGNRALGTFAVSVKYKDPVRSFTGREKYKRRLWVTDALDDPSVTVQEMVMLSTSV 208
Query: 180 LSIKWTLRGKPKSIIANIGGDLIVKVYSKFTLNQISGQVIEHEELWDLSASSPVARAFFW 239
LSIKWT++GKPKS++A+IGGDLI++V S+FTLNQISGQVIEHEE WDLSASS +A+A+FW
Sbjct: 209 LSIKWTIKGKPKSLLASIGGDLIIRVNSQFTLNQISGQVIEHEEFWDLSASSAIAQAYFW 268
Query: 240 ASRRLYATTEAGKDSFDLINNLKSKISTEQENLGIYPDPSGDPTK 284
ASRRL+A TEAGKD D + N S++STE+ENL IYPDPSGDPTK
Sbjct: 269 ASRRLFAATEAGKDFADSVKNWGSRLSTEKENLEIYPDPSGDPTK 313
>gi|225430926|ref|XP_002277168.1| PREDICTED: uncharacterized protein LOC100263239 [Vitis vinifera]
Length = 301
Score = 383 bits (984), Expect = e-104, Method: Compositional matrix adjust.
Identities = 181/225 (80%), Positives = 204/225 (90%), Gaps = 1/225 (0%)
Query: 61 PFTLRVTNDSTTST-ELSIQNQKSEADKIVDGMDFGELCNEFECISSPLVESTARQLVRD 119
PF L+VTND T E+SI++++SEADKIVDGMDFGELCNEFECISSPLVESTARQL RD
Sbjct: 44 PFPLKVTNDPNRRTAEVSIESERSEADKIVDGMDFGELCNEFECISSPLVESTARQLARD 103
Query: 120 ILELREGNRALGTFAVSVKYKDPTRSFTGREKYKRRLWATTALDNPSVTVQEMVMLSTSV 179
ILELREGNRALGTFAVSVKYKDP RSFTGREKYKRRLW T ALD+PSVTVQEMVMLSTSV
Sbjct: 104 ILELREGNRALGTFAVSVKYKDPVRSFTGREKYKRRLWVTDALDDPSVTVQEMVMLSTSV 163
Query: 180 LSIKWTLRGKPKSIIANIGGDLIVKVYSKFTLNQISGQVIEHEELWDLSASSPVARAFFW 239
LSIKWT++GKPKS++A+IGGDLI++V S+FTLNQISGQVIEHEE WDLSASS +A+A+FW
Sbjct: 164 LSIKWTIKGKPKSLLASIGGDLIIRVNSQFTLNQISGQVIEHEEFWDLSASSAIAQAYFW 223
Query: 240 ASRRLYATTEAGKDSFDLINNLKSKISTEQENLGIYPDPSGDPTK 284
ASRRL+A TEAGKD D + N S++STE+ENL IYPDPSGDPTK
Sbjct: 224 ASRRLFAATEAGKDFADSVKNWGSRLSTEKENLEIYPDPSGDPTK 268
>gi|255547550|ref|XP_002514832.1| conserved hypothetical protein [Ricinus communis]
gi|223545883|gb|EEF47386.1| conserved hypothetical protein [Ricinus communis]
Length = 312
Score = 380 bits (975), Expect = e-103, Method: Compositional matrix adjust.
Identities = 191/268 (71%), Positives = 215/268 (80%), Gaps = 16/268 (5%)
Query: 19 PTNGGFLYRSHPNSKKCIFTFNPFANSSSSSHHHHHHHHHHSPFTLRVTNDSTTSTELS- 77
P GG RS P+ KCIFT F S+SS + PF LRVTNDS TELS
Sbjct: 26 PATGGLHRRSFPS--KCIFT--SFRPSTSSPRY---------PFLLRVTNDSN-RTELSS 71
Query: 78 -IQNQKSEADKIVDGMDFGELCNEFECISSPLVESTARQLVRDILELREGNRALGTFAVS 136
+ +SEADKIVDGMDFGELCNEFECISSPLVE+TARQL RDILELREGNRALGTFAVS
Sbjct: 72 DVAVGRSEADKIVDGMDFGELCNEFECISSPLVEATARQLARDILELREGNRALGTFAVS 131
Query: 137 VKYKDPTRSFTGREKYKRRLWATTALDNPSVTVQEMVMLSTSVLSIKWTLRGKPKSIIAN 196
V+YKDP RSFTGREKYKR LW T ALDNPSV VQEMVMLSTSVLSIKWT++GKPKS +A+
Sbjct: 132 VRYKDPVRSFTGREKYKRPLWVTGALDNPSVAVQEMVMLSTSVLSIKWTIKGKPKSFLAS 191
Query: 197 IGGDLIVKVYSKFTLNQISGQVIEHEELWDLSASSPVARAFFWASRRLYATTEAGKDSFD 256
GG+LI+KV S+FTLNQISGQVIEHEE+WDLSASSP+A+AFFW SRRL+AT E GKD D
Sbjct: 192 AGGELIIKVNSQFTLNQISGQVIEHEEVWDLSASSPIAQAFFWTSRRLFATLETGKDLSD 251
Query: 257 LINNLKSKISTEQENLGIYPDPSGDPTK 284
+ NL +++ E+ENL I+PDPSGDPTK
Sbjct: 252 FVKNLTTRLPAERENLEIFPDPSGDPTK 279
>gi|147819671|emb|CAN76393.1| hypothetical protein VITISV_020863 [Vitis vinifera]
Length = 298
Score = 375 bits (963), Expect = e-101, Method: Compositional matrix adjust.
Identities = 178/219 (81%), Positives = 199/219 (90%), Gaps = 1/219 (0%)
Query: 67 TNDSTTST-ELSIQNQKSEADKIVDGMDFGELCNEFECISSPLVESTARQLVRDILELRE 125
TNDS T E+SI++++SEADKIVDGMDFGELCNEFECISSPLVESTARQL RDILELRE
Sbjct: 47 TNDSNRRTAEVSIESERSEADKIVDGMDFGELCNEFECISSPLVESTARQLARDILELRE 106
Query: 126 GNRALGTFAVSVKYKDPTRSFTGREKYKRRLWATTALDNPSVTVQEMVMLSTSVLSIKWT 185
GNRALGTFAVSVKYKDP RSFTGREKYKRRLW T ALD+PSVTVQEMVMLSTSVLSIKWT
Sbjct: 107 GNRALGTFAVSVKYKDPVRSFTGREKYKRRLWVTDALDDPSVTVQEMVMLSTSVLSIKWT 166
Query: 186 LRGKPKSIIANIGGDLIVKVYSKFTLNQISGQVIEHEELWDLSASSPVARAFFWASRRLY 245
++GKPKS++A+IGGDLI++V S+FTLNQISGQVIEHEE WDLSASS +A+A+FWASRRL+
Sbjct: 167 IKGKPKSLLASIGGDLIIRVNSQFTLNQISGQVIEHEEFWDLSASSAIAQAYFWASRRLF 226
Query: 246 ATTEAGKDSFDLINNLKSKISTEQENLGIYPDPSGDPTK 284
A TEAGKD D + N S++STE ENL IYPDPSGDPTK
Sbjct: 227 AATEAGKDFADSVKNWGSRLSTEXENLEIYPDPSGDPTK 265
>gi|224128790|ref|XP_002328967.1| predicted protein [Populus trichocarpa]
gi|222839201|gb|EEE77552.1| predicted protein [Populus trichocarpa]
Length = 306
Score = 372 bits (955), Expect = e-100, Method: Compositional matrix adjust.
Identities = 191/266 (71%), Positives = 214/266 (80%), Gaps = 16/266 (6%)
Query: 22 GGFLYRSHPNSKKCIFTFNPFANSSSSSHHHHHHHHHHSPFTLRVTNDSTTSTELSIQ-- 79
G F + +S + IFT PF +SS HH PF LRVTNDS+ TELS
Sbjct: 21 GSFHRQCCRSSNRFIFTCRPF----TSSLHH--------PFILRVTNDSS-RTELSPAEP 67
Query: 80 -NQKSEADKIVDGMDFGELCNEFECISSPLVESTARQLVRDILELREGNRALGTFAVSVK 138
N+ SEAD+IV GMDFGELCNEFECISSPLVESTARQLVRDIL+LREGNRALGTFAVSV+
Sbjct: 68 ANESSEADRIVGGMDFGELCNEFECISSPLVESTARQLVRDILQLREGNRALGTFAVSVR 127
Query: 139 YKDPTRSFTGREKYKRRLWATTALDNPSVTVQEMVMLSTSVLSIKWTLRGKPKSIIANIG 198
YKDP RSFTGREKYKR LWAT ALDNPSVTVQEM+MLSTS+LSIKWT+ GKPKS IA +G
Sbjct: 128 YKDPVRSFTGREKYKRPLWATGALDNPSVTVQEMIMLSTSILSIKWTITGKPKSFIAGVG 187
Query: 199 GDLIVKVYSKFTLNQISGQVIEHEELWDLSASSPVARAFFWASRRLYATTEAGKDSFDLI 258
GDLIVKV+SKFTLNQISGQVIEHEE WDLSASS V +AFFW SRRL+AT E GKD DL+
Sbjct: 188 GDLIVKVHSKFTLNQISGQVIEHEEFWDLSASSVVDQAFFWTSRRLFATIENGKDLSDLV 247
Query: 259 NNLKSKISTEQENLGIYPDPSGDPTK 284
+L ++ S++ EN+ IYPDPSGDPTK
Sbjct: 248 KSLTNRPSSKNENMEIYPDPSGDPTK 273
>gi|356568505|ref|XP_003552451.1| PREDICTED: uncharacterized protein LOC100791782 [Glycine max]
Length = 307
Score = 353 bits (907), Expect = 5e-95, Method: Compositional matrix adjust.
Identities = 187/271 (69%), Positives = 209/271 (77%), Gaps = 10/271 (3%)
Query: 42 FANSSSSSHHHHHHHHHHSPFTLRVTND---STTSTELSIQNQKSEADKIVDGMDFGELC 98
F +SS+ H HH+HS F LRV ND S S++ +I+ SEADK+VDGMDFGELC
Sbjct: 33 FHSSSTCMPSFLHDHHYHS-FLLRVANDADRSEVSSDTAIKTSYSEADKMVDGMDFGELC 91
Query: 99 NEFECISSPLVESTARQLVRDILELREGNRALGTFAVSVKYKDPTRSFTGREKYKRRLWA 158
NEFECISSPLVESTARQL RDILELREGNRALG FAVSV YKDP RSFTGREKYKRRLWA
Sbjct: 92 NEFECISSPLVESTARQLARDILELREGNRALGIFAVSVAYKDPIRSFTGREKYKRRLWA 151
Query: 159 TTALDNPSVTVQEMVMLSTSVLSIKWTLRGKPKSIIANIGGDLIVKVYSKFTLNQISGQV 218
T ALDNPSVTVQEMVMLSTSVLSIKWT+RGKPKS+ +GGDLI++V SKFTLNQISGQV
Sbjct: 152 TGALDNPSVTVQEMVMLSTSVLSIKWTIRGKPKSV---LGGDLIIRVTSKFTLNQISGQV 208
Query: 219 IEHEELWDLSASSPVARAFFWASRRLYATTEAGKDSFDLINNLKSKISTEQENLGIYPDP 278
IEHEE WDLSASS A+AFFW SR L+A E+ KD D NL SK+ST++ENL +YPDP
Sbjct: 209 IEHEEFWDLSASSASAQAFFWTSRALFAAVESVKDLGDSAKNLSSKMSTKKENLEMYPDP 268
Query: 279 SGDPTKVSSDSRCF---CFCFCLTSNVIYIA 306
SGDPTK F + L VIY+
Sbjct: 269 SGDPTKFFQRDDSFQQDAYQIALLLAVIYLV 299
>gi|255638033|gb|ACU19331.1| unknown [Glycine max]
Length = 307
Score = 352 bits (904), Expect = 1e-94, Method: Compositional matrix adjust.
Identities = 186/271 (68%), Positives = 208/271 (76%), Gaps = 10/271 (3%)
Query: 42 FANSSSSSHHHHHHHHHHSPFTLRVTND---STTSTELSIQNQKSEADKIVDGMDFGELC 98
F +SS+ H HH+HS F LRV ND S S++ +I+ SEADK+VDGMDFGELC
Sbjct: 33 FHSSSTCMPSFLHDHHYHS-FLLRVANDADRSEVSSDTAIKTSYSEADKMVDGMDFGELC 91
Query: 99 NEFECISSPLVESTARQLVRDILELREGNRALGTFAVSVKYKDPTRSFTGREKYKRRLWA 158
NEFECISSPLVESTARQL RDILELREGNRALG FAVSV YKDP RSFTGREKYKRRLWA
Sbjct: 92 NEFECISSPLVESTARQLARDILELREGNRALGIFAVSVAYKDPIRSFTGREKYKRRLWA 151
Query: 159 TTALDNPSVTVQEMVMLSTSVLSIKWTLRGKPKSIIANIGGDLIVKVYSKFTLNQISGQV 218
T ALDNPSVTVQEMVMLSTSVLSIKWT+RGKPKS+ +GGDLI++V SKFT NQISGQV
Sbjct: 152 TGALDNPSVTVQEMVMLSTSVLSIKWTIRGKPKSV---LGGDLIIRVTSKFTFNQISGQV 208
Query: 219 IEHEELWDLSASSPVARAFFWASRRLYATTEAGKDSFDLINNLKSKISTEQENLGIYPDP 278
IEHEE WDLSASS A+AFFW SR L+A E+ KD D NL SK+ST++ENL +YPDP
Sbjct: 209 IEHEEFWDLSASSASAQAFFWTSRALFAAVESVKDLGDSAKNLSSKMSTKKENLEMYPDP 268
Query: 279 SGDPTKVSSDSRCF---CFCFCLTSNVIYIA 306
SGDPTK F + L VIY+
Sbjct: 269 SGDPTKFFQRDDSFQQDAYQIALLLAVIYLV 299
>gi|449442645|ref|XP_004139091.1| PREDICTED: uncharacterized protein LOC101206988 [Cucumis sativus]
gi|449476192|ref|XP_004154667.1| PREDICTED: uncharacterized protein LOC101230689 [Cucumis sativus]
Length = 305
Score = 350 bits (899), Expect = 4e-94, Method: Compositional matrix adjust.
Identities = 177/310 (57%), Positives = 223/310 (71%), Gaps = 21/310 (6%)
Query: 1 MSACACASVASALATAPSPTNGGFLYRSHPNSKKCIFTFNPFANSSSSSHHHHHHHHHHS 60
MS C V+S PSP ++R K P A + S H
Sbjct: 1 MSTC----VSSPSLAVPSPAAASTIFRR----KILRGATTPLAVTPPPSQSRHA------ 46
Query: 61 PFTLRVT-NDSTTSTELSIQN--QKSEADKIVDGMDFGELCNEFECISSPLVESTARQLV 117
F LR NDS ++ +L+ ++ ++SEADKIVDGMDFGELCNEFECISSPLVESTARQL
Sbjct: 47 -FPLRAAANDSNSTEQLTAESAVERSEADKIVDGMDFGELCNEFECISSPLVESTARQLA 105
Query: 118 RDILELREGNRALGTFAVSVKYKDPTRSFTGREKYKRRLWATTALDNPSVTVQEMVMLST 177
RDIL+LR+G+R+LG FAV VKYKDP R FTGREKYKR+LWAT+ALDNP+ +VQEMVM+ST
Sbjct: 106 RDILQLRQGDRSLGNFAVFVKYKDPIRKFTGREKYKRQLWATSALDNPTTSVQEMVMMST 165
Query: 178 SVLSIKWTLRGKPKSIIANIGGDLIVKVYSKFTLNQISGQVIEHEELWDLSASSPVARAF 237
S+L IKWT++GKPKS++A IGGDLI+KV S+FTLNQISGQVIEHEE WD+S+SS +++AF
Sbjct: 166 SILKIKWTIKGKPKSLVAAIGGDLIIKVDSQFTLNQISGQVIEHEESWDVSSSSAISQAF 225
Query: 238 FWASRRLYATTEAGKDSFDLINNLKSKISTEQENLGIYPDPSGDPTKVSSDSRCF---CF 294
FWASR L+A+ EAGKD D +++L ++STE++NL ++PDPSGDPTK F +
Sbjct: 226 FWASRYLFASAEAGKDLGDSVSSLTGRVSTEKQNLEMFPDPSGDPTKFFQGEDNFQKDAY 285
Query: 295 CFCLTSNVIY 304
F L +IY
Sbjct: 286 QFALLLAIIY 295
>gi|363807240|ref|NP_001242357.1| uncharacterized protein LOC100780920 precursor [Glycine max]
gi|255639967|gb|ACU20276.1| unknown [Glycine max]
Length = 307
Score = 346 bits (887), Expect = 9e-93, Method: Compositional matrix adjust.
Identities = 183/270 (67%), Positives = 206/270 (76%), Gaps = 9/270 (3%)
Query: 42 FANSSSSSHHHHHHHHHHSPFTLRVTND---STTSTELSIQNQKSEADKIVDGMDFGELC 98
F +SSS+ H HH+ F LRV ND S S++ + + SEADK+VDGMDFGELC
Sbjct: 32 FFHSSSTCMPSSLHDHHYHSFLLRVANDADRSEMSSDTATEASYSEADKMVDGMDFGELC 91
Query: 99 NEFECISSPLVESTARQLVRDILELREGNRALGTFAVSVKYKDPTRSFTGREKYKRRLWA 158
NEFECISSPLVESTARQL RDILELRE NRALG FAVSV YKDP RSFTGREKYKRRLWA
Sbjct: 92 NEFECISSPLVESTARQLARDILELREENRALGIFAVSVSYKDPIRSFTGREKYKRRLWA 151
Query: 159 TTALDNPSVTVQEMVMLSTSVLSIKWTLRGKPKSIIANIGGDLIVKVYSKFTLNQISGQV 218
T+ALDNPSVTVQEMVMLSTSVLSI+WT+RGKPKS+ +GGDLI++V SKFTLNQ SGQV
Sbjct: 152 TSALDNPSVTVQEMVMLSTSVLSIRWTIRGKPKSV---LGGDLILRVTSKFTLNQTSGQV 208
Query: 219 IEHEELWDLSASSPVARAFFWASRRLYATTEAGKDSFDLINNLKSKISTEQENLGIYPDP 278
IEHEE WDLSASS A+AFFW SR L+A E+ KD D NL SKIST++ENL +YPDP
Sbjct: 209 IEHEEFWDLSASSASAQAFFWTSRALFAAVESVKDLGDNAKNLSSKISTKKENLEMYPDP 268
Query: 279 SGDPTKVSSDSRCF---CFCFCLTSNVIYI 305
SGDPTK F + L VIY+
Sbjct: 269 SGDPTKFFQRDDSFQQDVYQIALLLAVIYL 298
>gi|388521943|gb|AFK49033.1| unknown [Lotus japonicus]
Length = 309
Score = 344 bits (882), Expect = 4e-92, Method: Compositional matrix adjust.
Identities = 169/233 (72%), Positives = 194/233 (83%), Gaps = 4/233 (1%)
Query: 55 HHHHHSPFTLRVTNDSTTS---TELSIQNQKSEADKIVDGMDFGELCNEFECISSPLVES 111
H HHHS F LRV ND+ + ++ + + S+ DKIVDG+DFGELCN+FECISSPLVES
Sbjct: 45 HQHHHS-FLLRVANDADRTELPSDAATETSYSDVDKIVDGLDFGELCNDFECISSPLVES 103
Query: 112 TARQLVRDILELREGNRALGTFAVSVKYKDPTRSFTGREKYKRRLWATTALDNPSVTVQE 171
TARQLVRDILELREGNRALGTFA SV+YKDP RSFTGR+KYKR LWAT+AL+NPSVTVQE
Sbjct: 104 TARQLVRDILELREGNRALGTFAFSVRYKDPIRSFTGRDKYKRSLWATSALENPSVTVQE 163
Query: 172 MVMLSTSVLSIKWTLRGKPKSIIANIGGDLIVKVYSKFTLNQISGQVIEHEELWDLSASS 231
MVMLSTSVLSIKWT++GKPKS IA +GGD+I+++ SKFTLNQISGQVIEHEE WDLSASS
Sbjct: 164 MVMLSTSVLSIKWTMKGKPKSFIARVGGDVILRITSKFTLNQISGQVIEHEEFWDLSASS 223
Query: 232 PVARAFFWASRRLYATTEAGKDSFDLINNLKSKISTEQENLGIYPDPSGDPTK 284
A AFFW SR L+AT E+ KD D N+ S +E+ENL IYPDPSGDPTK
Sbjct: 224 ASAHAFFWTSRALFATVESVKDLGDSAKNMSSNFESEKENLEIYPDPSGDPTK 276
>gi|357507799|ref|XP_003624188.1| hypothetical protein MTR_7g080220 [Medicago truncatula]
gi|355499203|gb|AES80406.1| hypothetical protein MTR_7g080220 [Medicago truncatula]
Length = 308
Score = 320 bits (819), Expect = 9e-85, Method: Compositional matrix adjust.
Identities = 160/226 (70%), Positives = 184/226 (81%), Gaps = 5/226 (2%)
Query: 64 LRVTNDSTT---STELSIQNQKSEA-DKIVDGMDFGELCNEFECISSPLVESTARQLVRD 119
LRV ND+ S++ + +N S+A DK+VD MDFGELCNEFEC SSP VESTARQL RD
Sbjct: 50 LRVANDADRTELSSDTTTENSYSDAADKMVDSMDFGELCNEFECNSSPSVESTARQLARD 109
Query: 120 ILELREGNRALGTFAVSVKYKDPTRSFTGREKYKRRLWATTALDNPSVTVQEMVMLSTSV 179
I ELR GNRALGT+AVSV YKDP RSF+GREKYKR LWA TAL+NPSVTVQEM MLSTSV
Sbjct: 110 IFELRSGNRALGTYAVSVTYKDPIRSFSGREKYKRPLWAITALENPSVTVQEMTMLSTSV 169
Query: 180 LSIKWTLRGKPKSIIANIGGDLIVKVYSKFTLNQISGQVIEHEELWDLSASSPVARAFFW 239
LSIKWT+RGKPKSI+A +GG LI++V SKFTLNQISGQVI+HEELWDLS+S A+AFFW
Sbjct: 170 LSIKWTIRGKPKSILAGVGGGLILRVTSKFTLNQISGQVIKHEELWDLSSSPASAQAFFW 229
Query: 240 ASRRLYATTEAGKDSFDLINNLKSKIST-EQENLGIYPDPSGDPTK 284
+SR L+AT E+ KD D NL + ST ++EN IYPDPSGDPTK
Sbjct: 230 SSRVLFATVESVKDLADSAKNLSASFSTKKEENSDIYPDPSGDPTK 275
>gi|22330431|ref|NP_176702.2| uncharacterized protein [Arabidopsis thaliana]
gi|20466568|gb|AAM20601.1| unknown protein [Arabidopsis thaliana]
gi|23198128|gb|AAN15591.1| unknown protein [Arabidopsis thaliana]
gi|332196225|gb|AEE34346.1| uncharacterized protein [Arabidopsis thaliana]
Length = 286
Score = 317 bits (811), Expect = 7e-84, Method: Compositional matrix adjust.
Identities = 154/228 (67%), Positives = 185/228 (81%), Gaps = 7/228 (3%)
Query: 57 HHHSPFTLRVTNDSTTSTELSIQNQKSEADKIVDGMDFGELCNEFECISSPLVESTARQL 116
HH P +V NDST TE+SI KSE DK+VD +DFGELCN+FEC SSP VESTARQL
Sbjct: 33 HHRLP---QVMNDST-RTEVSID--KSEVDKLVDKIDFGELCNDFECTSSPQVESTARQL 86
Query: 117 VRDILELREGNRALGTFAVSVKYKDPTRSFTGREKYKRRLWATTALDNPSVTVQEMVMLS 176
VRDILE+REGNRA +AVSVKYKDP RSFTGREKYKR +W T+ L+NP+VTVQEMVMLS
Sbjct: 87 VRDILEIREGNRAFACYAVSVKYKDPVRSFTGREKYKRPMWITSGLENPTVTVQEMVMLS 146
Query: 177 TSVLSIKWTLRGKPKSIIANIGGDLIVKVYSKFTLNQISGQVIEHEELWDLSASSPVARA 236
TSVL IKWT++GKPKSI+A + GDLIVKV S+FTLNQISGQV EHEE WDLS+SSP+A+A
Sbjct: 147 TSVLRIKWTVKGKPKSILAAVSGDLIVKVKSEFTLNQISGQVFEHEESWDLSSSSPIAQA 206
Query: 237 FFWASRRLYATTEAGKDSFDLINNLKSKISTEQENLGIYPDPSGDPTK 284
+FW SRRL+A +E+ KD D+ +L + ++T +E+ IY DP+ DP K
Sbjct: 207 YFWTSRRLFAASESAKDVADVTKDLTANLTTRKEDTDIYRDPT-DPNK 253
>gi|297837895|ref|XP_002886829.1| hypothetical protein ARALYDRAFT_475533 [Arabidopsis lyrata subsp.
lyrata]
gi|297332670|gb|EFH63088.1| hypothetical protein ARALYDRAFT_475533 [Arabidopsis lyrata subsp.
lyrata]
Length = 286
Score = 316 bits (810), Expect = 9e-84, Method: Compositional matrix adjust.
Identities = 155/228 (67%), Positives = 185/228 (81%), Gaps = 7/228 (3%)
Query: 57 HHHSPFTLRVTNDSTTSTELSIQNQKSEADKIVDGMDFGELCNEFECISSPLVESTARQL 116
HH P +V NDST TE+SI KSE DK+VD +DFGELCN+FEC SSP VESTARQL
Sbjct: 33 HHRLP---QVMNDST-RTEVSID--KSEVDKLVDRIDFGELCNDFECTSSPQVESTARQL 86
Query: 117 VRDILELREGNRALGTFAVSVKYKDPTRSFTGREKYKRRLWATTALDNPSVTVQEMVMLS 176
VRDILELREGNRA +AVSVKYKDP RSFTGREKYKR +W T+ L+NP+VTVQEMVMLS
Sbjct: 87 VRDILELREGNRAFACYAVSVKYKDPVRSFTGREKYKRPMWITSGLENPTVTVQEMVMLS 146
Query: 177 TSVLSIKWTLRGKPKSIIANIGGDLIVKVYSKFTLNQISGQVIEHEELWDLSASSPVARA 236
TSVL IKWT++GKPKSI+A + GDLI+KV S+FTLNQISGQVIEHEE WDLS+SSP+A+A
Sbjct: 147 TSVLRIKWTVKGKPKSILAAVSGDLIIKVKSEFTLNQISGQVIEHEESWDLSSSSPIAQA 206
Query: 237 FFWASRRLYATTEAGKDSFDLINNLKSKISTEQENLGIYPDPSGDPTK 284
+FW SRRL+A +E+ KD D+ +L + ++T E+ IY DP+ DP K
Sbjct: 207 YFWTSRRLFAASESAKDLSDVTKDLTANLTTRTEDTDIYRDPT-DPNK 253
>gi|4646202|gb|AAD26875.1|AC007230_9 ESTs gb|H76289 and gb|H76537 come from this gene [Arabidopsis
thaliana]
Length = 218
Score = 315 bits (807), Expect = 2e-83, Method: Compositional matrix adjust.
Identities = 152/219 (69%), Positives = 182/219 (83%), Gaps = 4/219 (1%)
Query: 68 NDSTTSTELSIQNQKSEADKIVDGMDFGELCNEFECISSPLVESTARQLVRDILELREGN 127
NDST TE+SI KSE DK+VD +DFGELCN+FEC SSP VESTARQLVRDILE+REGN
Sbjct: 2 NDST-RTEVSID--KSEVDKLVDKIDFGELCNDFECTSSPQVESTARQLVRDILEIREGN 58
Query: 128 RALGTFAVSVKYKDPTRSFTGREKYKRRLWATTALDNPSVTVQEMVMLSTSVLSIKWTLR 187
RA +AVSVKYKDP RSFTGREKYKR +W T+ L+NP+VTVQEMVMLSTSVL IKWT++
Sbjct: 59 RAFACYAVSVKYKDPVRSFTGREKYKRPMWITSGLENPTVTVQEMVMLSTSVLRIKWTVK 118
Query: 188 GKPKSIIANIGGDLIVKVYSKFTLNQISGQVIEHEELWDLSASSPVARAFFWASRRLYAT 247
GKPKSI+A + GDLIVKV S+FTLNQISGQV EHEE WDLS+SSP+A+A+FW SRRL+A
Sbjct: 119 GKPKSILAAVSGDLIVKVKSEFTLNQISGQVFEHEESWDLSSSSPIAQAYFWTSRRLFAA 178
Query: 248 TEAGKDSFDLINNLKSKISTEQENLGIYPDPSGDPTKVS 286
+E+ KD D+ +L + ++T +E+ IY DP+ DP KVS
Sbjct: 179 SESAKDVADVTKDLTANLTTRKEDTDIYRDPT-DPNKVS 216
>gi|357507801|ref|XP_003624189.1| hypothetical protein MTR_7g080220 [Medicago truncatula]
gi|355499204|gb|AES80407.1| hypothetical protein MTR_7g080220 [Medicago truncatula]
Length = 231
Score = 308 bits (790), Expect = 2e-81, Method: Compositional matrix adjust.
Identities = 149/198 (75%), Positives = 167/198 (84%), Gaps = 1/198 (0%)
Query: 88 IVDGMDFGELCNEFECISSPLVESTARQLVRDILELREGNRALGTFAVSVKYKDPTRSFT 147
+VD MDFGELCNEFEC SSP VESTARQL RDI ELR GNRALGT+AVSV YKDP RSF+
Sbjct: 1 MVDSMDFGELCNEFECNSSPSVESTARQLARDIFELRSGNRALGTYAVSVTYKDPIRSFS 60
Query: 148 GREKYKRRLWATTALDNPSVTVQEMVMLSTSVLSIKWTLRGKPKSIIANIGGDLIVKVYS 207
GREKYKR LWA TAL+NPSVTVQEM MLSTSVLSIKWT+RGKPKSI+A +GG LI++V S
Sbjct: 61 GREKYKRPLWAITALENPSVTVQEMTMLSTSVLSIKWTIRGKPKSILAGVGGGLILRVTS 120
Query: 208 KFTLNQISGQVIEHEELWDLSASSPVARAFFWASRRLYATTEAGKDSFDLINNLKSKIST 267
KFTLNQISGQVI+HEELWDLS+S A+AFFW+SR L+AT E+ KD D NL + ST
Sbjct: 121 KFTLNQISGQVIKHEELWDLSSSPASAQAFFWSSRVLFATVESVKDLADSAKNLSASFST 180
Query: 268 -EQENLGIYPDPSGDPTK 284
++EN IYPDPSGDPTK
Sbjct: 181 KKEENSDIYPDPSGDPTK 198
>gi|357126446|ref|XP_003564898.1| PREDICTED: uncharacterized protein LOC100833780 [Brachypodium
distachyon]
Length = 302
Score = 302 bits (774), Expect = 1e-79, Method: Compositional matrix adjust.
Identities = 140/207 (67%), Positives = 173/207 (83%), Gaps = 2/207 (0%)
Query: 80 NQKSEADKIVDGMDFGELCNEFECISSPLVESTARQLVRDILELREGNRALGTFAVSVKY 139
+++SEAD+++DGMDFGELC++FECISSP VESTARQ+VRDILE+REGNRAL +AV+VKY
Sbjct: 63 DERSEADRLIDGMDFGELCDDFECISSPYVESTARQIVRDILEIREGNRALSCYAVAVKY 122
Query: 140 KDPTRSFTGREKYKRRLWATTALDNPSVTVQEMVMLSTSVLSIKWTLRGKP-KSIIANIG 198
KDP RSF GREKYKR LW T AL+NP+VTVQEM M STS L+IKWTLRGKP I + I
Sbjct: 123 KDPLRSFVGREKYKRPLWITEALENPTVTVQEMSMQSTSTLTIKWTLRGKPSNQIFSAIS 182
Query: 199 GDLIVKVYSKFTLNQISGQVIEHEELWDLSASSPVARAFFWASRRLYATTEAGKDSFDLI 258
G+LI++V S+F LNQISGQV+EH E WDLS SSP A+A+FW SRR+Y+T EAGKD+ +
Sbjct: 183 GELIIRVDSQFVLNQISGQVLEHVESWDLSGSSPPAQAYFWLSRRVYSTVEAGKDTVEAA 242
Query: 259 NNLKSKIS-TEQENLGIYPDPSGDPTK 284
+L S++S + +NL +YPDPSGDPTK
Sbjct: 243 KDLASRLSENKDQNLEVYPDPSGDPTK 269
>gi|115441773|ref|NP_001045166.1| Os01g0912700 [Oryza sativa Japonica Group]
gi|56784400|dbj|BAD82439.1| unknown protein [Oryza sativa Japonica Group]
gi|113534697|dbj|BAF07080.1| Os01g0912700 [Oryza sativa Japonica Group]
gi|215687175|dbj|BAG90945.1| unnamed protein product [Oryza sativa Japonica Group]
gi|218189587|gb|EEC72014.1| hypothetical protein OsI_04885 [Oryza sativa Indica Group]
gi|222619738|gb|EEE55870.1| hypothetical protein OsJ_04510 [Oryza sativa Japonica Group]
Length = 312
Score = 294 bits (752), Expect = 4e-77, Method: Compositional matrix adjust.
Identities = 139/206 (67%), Positives = 168/206 (81%), Gaps = 2/206 (0%)
Query: 81 QKSEADKIVDGMDFGELCNEFECISSPLVESTARQLVRDILELREGNRALGTFAVSVKYK 140
++S+ DK+VDGMDFGELCN+FECISSP VE+TARQL RDIL+LR+ NRA +AVSVKYK
Sbjct: 73 ERSQTDKLVDGMDFGELCNDFECISSPYVEATARQLARDILDLRDDNRAFTCYAVSVKYK 132
Query: 141 DPTRSFTGREKYKRRLWATTALDNPSVTVQEMVMLSTSVLSIKWTLRGKPKS-IIANIGG 199
DP R+F GREKYKR LW T AL+NP+VTVQEM M STS L+IKWT RGKPK+ I A IGG
Sbjct: 133 DPVRTFVGREKYKRPLWITKALENPTVTVQEMSMQSTSNLTIKWTFRGKPKNPIFATIGG 192
Query: 200 DLIVKVYSKFTLNQISGQVIEHEELWDLSASSPVARAFFWASRRLYATTEAGKDSFDLIN 259
DLIV V S+F LNQISGQV+E + WDLSASSP A+A+FW SRR ++T EAGKD+ +
Sbjct: 193 DLIVSVTSQFVLNQISGQVLEQVDSWDLSASSPPAQAYFWLSRRAFSTVEAGKDTIEAAK 252
Query: 260 NLKSKISTEQ-ENLGIYPDPSGDPTK 284
S++S+++ ENL +YPDPSGDPTK
Sbjct: 253 GTASRLSSKKDENLEVYPDPSGDPTK 278
>gi|326515272|dbj|BAK03549.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 301
Score = 293 bits (750), Expect = 8e-77, Method: Compositional matrix adjust.
Identities = 138/222 (62%), Positives = 176/222 (79%), Gaps = 7/222 (3%)
Query: 64 LRVTNDSTTSTELSIQNQKSEADKIVDGMDFGELCNEFECISSPLVESTARQLVRDILEL 123
L V +DS +Q +SE D++VDG+DFGELCN+FECISSP VESTARQ+ RDILE+
Sbjct: 53 LHVVDDSK-----EVQTARSETDRLVDGLDFGELCNDFECISSPYVESTARQIARDILEI 107
Query: 124 REGNRALGTFAVSVKYKDPTRSFTGREKYKRRLWATTALDNPSVTVQEMVMLSTSVLSIK 183
R+ NRAL +AV+VKYKDP RSF GREKYKR LW T AL+ P+VTVQEM M STS+L+IK
Sbjct: 108 RQDNRALSCYAVAVKYKDPLRSFVGREKYKRPLWITEALEKPTVTVQEMSMQSTSMLTIK 167
Query: 184 WTLRGKPK-SIIANIGGDLIVKVYSKFTLNQISGQVIEHEELWDLSASSPVARAFFWASR 242
WTLRGKPK + A GG+L+++V S+F LNQISGQV+EH E WDLS+SSP+A+A+FW SR
Sbjct: 168 WTLRGKPKNAFFAAAGGELVLRVDSQFVLNQISGQVLEHAESWDLSSSSPLAQAYFWFSR 227
Query: 243 RLYATTEAGKDSFDLINNLKSKISTEQENLGIYPDPSGDPTK 284
R+Y+T EAGKD+ + + S+++ + +NL +YPDP GDPTK
Sbjct: 228 RVYSTVEAGKDTIEAAKGVASRLNKD-DNLEVYPDPLGDPTK 268
>gi|300681438|emb|CBH32530.1| conserved hypothetical protein, expressed [Triticum aestivum]
Length = 296
Score = 286 bits (732), Expect = 9e-75, Method: Compositional matrix adjust.
Identities = 133/204 (65%), Positives = 167/204 (81%), Gaps = 2/204 (0%)
Query: 82 KSEADKIVDGMDFGELCNEFECISSPLVESTARQLVRDILELREGNRALGTFAVSVKYKD 141
+SE D++VDG++FGELCN+FECISSP VESTARQ+ RDILE+R+ NRAL +AV+VKYKD
Sbjct: 61 RSETDRLVDGLNFGELCNDFECISSPYVESTARQIARDILEIRQDNRALSCYAVAVKYKD 120
Query: 142 PTRSFTGREKYKRRLWATTALDNPSVTVQEMVMLSTSVLSIKWTLRGKPK-SIIANIGGD 200
P RSF GREKYKR LW T AL+ P+VTVQEM M STS L+IKWTLRGKPK + A GG+
Sbjct: 121 PLRSFVGREKYKRPLWITEALEKPTVTVQEMSMQSTSTLTIKWTLRGKPKNAFFAAAGGE 180
Query: 201 LIVKVYSKFTLNQISGQVIEHEELWDLSASSPVARAFFWASRRLYATTEAGKDSFDLINN 260
L+V+V S+F LNQISGQV+EH E WDLSASSP+A+ +FW SRR+Y+T EAGKD+ +
Sbjct: 181 LVVRVDSQFVLNQISGQVLEHFESWDLSASSPLAQTYFWFSRRVYSTVEAGKDTIEAAKG 240
Query: 261 LKSKISTEQENLGIYPDPSGDPTK 284
+ S+++ + +NL +YPDP GDPTK
Sbjct: 241 VVSRLNKD-DNLEVYPDPLGDPTK 263
>gi|219363629|ref|NP_001136449.1| uncharacterized protein LOC100216557 [Zea mays]
gi|194695746|gb|ACF81957.1| unknown [Zea mays]
gi|413951566|gb|AFW84215.1| hypothetical protein ZEAMMB73_334715 [Zea mays]
Length = 302
Score = 281 bits (720), Expect = 3e-73, Method: Compositional matrix adjust.
Identities = 141/241 (58%), Positives = 174/241 (72%), Gaps = 10/241 (4%)
Query: 47 SSSHHHHHHHHHHSPFTLRVTNDSTTSTELSIQNQKSEADKIVDGMDFGELCNEFECISS 106
S +H +P L V +DS E ++SE DK+VDGMDFGELC++FECISS
Sbjct: 35 SGAHSPPRRRLRLAP--LHVVDDSKNPPE-----ERSETDKMVDGMDFGELCSDFECISS 87
Query: 107 PLVESTARQLVRDILELREGNRALGTFAVSVKYKDPTRSFTGREKYKRRLWATTALDNPS 166
P VESTARQ+ RDILELRE NRA +AVSVKYKDP R+F GREKY+R LW T AL+NP
Sbjct: 88 PYVESTARQVARDILELREDNRAFNCYAVSVKYKDPLRTFVGREKYRRPLWITKALENPV 147
Query: 167 VTVQEMVMLSTSVLSIKWTLRGKPKS-IIANIGGDLIVKVYSKFTLNQISGQVIEHEELW 225
VTVQEM M STS L+I+W LRGKPK+ + A++GGD+IV+V S+F LNQISGQV+E E W
Sbjct: 148 VTVQEMSMQSTSNLTIRWALRGKPKNPLFASVGGDVIVRVESRFVLNQISGQVLEQAESW 207
Query: 226 DLSASSPVARAFFWASRRLYATTEAGKDSFDLINNLKSKISTEQ--ENLGIYPDPSGDPT 283
DLSASS A+A+FW SRR+Y+ E+GKD+ + + S T + +NL YPDPS DPT
Sbjct: 208 DLSASSLPAQAYFWLSRRVYSAVESGKDAIEAAKSAASSGLTTKGDQNLEAYPDPSADPT 267
Query: 284 K 284
K
Sbjct: 268 K 268
>gi|242055359|ref|XP_002456825.1| hypothetical protein SORBIDRAFT_03g043540 [Sorghum bicolor]
gi|241928800|gb|EES01945.1| hypothetical protein SORBIDRAFT_03g043540 [Sorghum bicolor]
Length = 302
Score = 278 bits (712), Expect = 2e-72, Method: Compositional matrix adjust.
Identities = 134/220 (60%), Positives = 165/220 (75%), Gaps = 5/220 (2%)
Query: 66 VTNDSTTSTELSIQNQKSEADKIVDGMDFGELCNEFECISSPLVESTARQLVRDILELRE 125
V +DS+T + ++S DK+VDGMDFGELCN+FECISSP VESTARQ+ RDILELR+
Sbjct: 53 VVDDSSTD----LPEERSRTDKMVDGMDFGELCNDFECISSPYVESTARQIARDILELRD 108
Query: 126 GNRALGTFAVSVKYKDPTRSFTGREKYKRRLWATTALDNPSVTVQEMVMLSTSVLSIKWT 185
NRA +AVSVKYKDP R+F GREKYKR LW T AL+ P VTVQEM M STS L+IKW
Sbjct: 109 DNRAFNCYAVSVKYKDPLRTFVGREKYKRPLWITKALEKPVVTVQEMSMQSTSNLTIKWA 168
Query: 186 LRGKPKS-IIANIGGDLIVKVYSKFTLNQISGQVIEHEELWDLSASSPVARAFFWASRRL 244
RGKPK+ A +GGD+IV+V S+F LNQISGQV+E + WDLSASS A+A+FW SRR+
Sbjct: 169 FRGKPKNPFFATMGGDVIVRVVSQFVLNQISGQVLEQVDSWDLSASSFPAQAYFWLSRRV 228
Query: 245 YATTEAGKDSFDLINNLKSKISTEQENLGIYPDPSGDPTK 284
Y+T E+GKD+ + + S ++ +N YPDPS DPTK
Sbjct: 229 YSTVESGKDTIEAAKSTTSGLTKGDQNFEAYPDPSSDPTK 268
>gi|302790600|ref|XP_002977067.1| hypothetical protein SELMODRAFT_106566 [Selaginella moellendorffii]
gi|300155043|gb|EFJ21676.1| hypothetical protein SELMODRAFT_106566 [Selaginella moellendorffii]
Length = 260
Score = 258 bits (659), Expect = 3e-66, Method: Compositional matrix adjust.
Identities = 128/219 (58%), Positives = 163/219 (74%), Gaps = 8/219 (3%)
Query: 70 STTSTELSIQNQKSEADKIVDGMDFGELCNEFECISSPLVESTARQLVRDILELREGNRA 129
STT + ++ ++S ADK+VDGM FG+LC+EF+CISSP VEST RQL RDILE+R+GNRA
Sbjct: 13 STTKSPPPVELERSPADKLVDGMSFGQLCDEFQCISSPSVESTCRQLARDILEIRDGNRA 72
Query: 130 LGTFAVSVKYKDPTRSFTGREKYKRRLWATTALDNPSVTVQEMVMLSTSVLSIKWTLRGK 189
LGT A VKYKDP RSF GR KY R W TAL+NP+V V++M M+STSVL I+WT++G+
Sbjct: 73 LGTLAYFVKYKDPLRSFQGRLKYNRPSWTRTALENPTVAVRQMEMVSTSVLRIQWTIQGR 132
Query: 190 PKSIIANIGGDLIVKVYSKFTLNQISGQVIEHEELWDLSASSPVARAFFWASRRLYATTE 249
PK A+ GD++V V S FTLNQISGQV+EH E WDLS +SP+A+A+FW SR Y+T E
Sbjct: 133 PKLAAASFVGDVLVNVTSTFTLNQISGQVLEHREEWDLSGNSPLAQAYFWLSRIAYSTVE 192
Query: 250 AGKDSFDLINNLKSKISTEQENLG----IYPDPSGDPTK 284
AGKD+ +L+ + I +NLG IY DP+ DP K
Sbjct: 193 AGKDTSELVQGVSKVID---KNLGDDSSIYVDPT-DPKK 227
>gi|168049027|ref|XP_001776966.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162671667|gb|EDQ58215.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 237
Score = 256 bits (654), Expect = 1e-65, Method: Compositional matrix adjust.
Identities = 117/205 (57%), Positives = 156/205 (76%), Gaps = 1/205 (0%)
Query: 81 QKSEADKIVDGMDFGELCNEFECISSPLVESTARQLVRDILELREGNRALGTFAVSVKYK 140
+K+E D++++G+ FG+LC+EFECISSP VE TARQLV+DI++LREG R+L +F V+VKYK
Sbjct: 1 EKTEVDRLMEGLSFGQLCDEFECISSPAVEKTARQLVKDIMDLREGRRSLSSFGVNVKYK 60
Query: 141 DPTRSFTGREKYKRRLWATTALDNPSVTVQEMVMLSTSVLSIKWTLRGKPKSIIAN-IGG 199
DP RSF GR+KY+ W +TAL+ PSV V+EM MLSTSVL+IKWTL G PK A+ +GG
Sbjct: 61 DPLRSFKGRDKYRSANWISTALEKPSVAVREMKMLSTSVLNIKWTLTGTPKLPPASALGG 120
Query: 200 DLIVKVYSKFTLNQISGQVIEHEELWDLSASSPVARAFFWASRRLYATTEAGKDSFDLIN 259
+++ V S FT+NQISGQV+ HE+ WDLSAS P A+A+FW +R ++ E GK+ D +
Sbjct: 121 KVVLAVNSTFTMNQISGQVVLHEDQWDLSASDPAAQAYFWTTRLAFSAVEGGKNLADGVQ 180
Query: 260 NLKSKISTEQENLGIYPDPSGDPTK 284
+ ++ QEN IYPDPSGDP K
Sbjct: 181 GMAKQLDKGQENNSIYPDPSGDPRK 205
>gi|302763149|ref|XP_002964996.1| hypothetical protein SELMODRAFT_83381 [Selaginella moellendorffii]
gi|300167229|gb|EFJ33834.1| hypothetical protein SELMODRAFT_83381 [Selaginella moellendorffii]
Length = 260
Score = 249 bits (636), Expect = 1e-63, Method: Compositional matrix adjust.
Identities = 126/219 (57%), Positives = 161/219 (73%), Gaps = 8/219 (3%)
Query: 70 STTSTELSIQNQKSEADKIVDGMDFGELCNEFECISSPLVESTARQLVRDILELREGNRA 129
STT + ++ ++S ADK+VDGM FG+LC+EF+CISSP VE+T RQL RDILE+R+GNRA
Sbjct: 13 STTKSPPPVELERSPADKLVDGMSFGQLCDEFQCISSPSVEATCRQLARDILEIRDGNRA 72
Query: 130 LGTFAVSVKYKDPTRSFTGREKYKRRLWATTALDNPSVTVQEMVMLSTSVLSIKWTLRGK 189
LGT A VKYKDP RSF GR KY R W TAL+NP+V V++M M+STSVL I+ T++G+
Sbjct: 73 LGTLAYFVKYKDPLRSFQGRLKYNRPSWTRTALENPTVAVRQMEMVSTSVLRIQCTIQGR 132
Query: 190 PKSIIANIGGDLIVKVYSKFTLNQISGQVIEHEELWDLSASSPVARAFFWASRRLYATTE 249
PK A+ GD+ V V S FTLNQISGQV+EH E WDLS +SP+A+A+FW SR Y+T E
Sbjct: 133 PKLAAASFVGDVRVNVTSTFTLNQISGQVLEHREEWDLSGNSPLAQAYFWFSRIAYSTVE 192
Query: 250 AGKDSFDLINNLKSKISTEQENLG----IYPDPSGDPTK 284
AGKD+ +L+ + I +NLG IY DP+ DP K
Sbjct: 193 AGKDTSELVQGVSKVID---KNLGDDSNIYVDPT-DPKK 227
>gi|168031127|ref|XP_001768073.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162680711|gb|EDQ67145.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 271
Score = 244 bits (624), Expect = 3e-62, Method: Compositional matrix adjust.
Identities = 117/223 (52%), Positives = 157/223 (70%), Gaps = 3/223 (1%)
Query: 65 RVTNDSTTSTELSIQNQKSEADKIVDGMDFGELCNEFECISSPLVESTARQLVRDILELR 124
R D T++ + +K+E D+++DG+ FG+LC+EFECISSP VE TARQLV+DI++LR
Sbjct: 16 RGPEDGATTSVVDKTAEKTEVDRLMDGLSFGQLCDEFECISSPAVERTARQLVKDIIDLR 75
Query: 125 EGNRALGTFAVSVKYKDPTRSFTGREKYKRRLWATTALDNPSVTVQEMVMLSTSVLSIKW 184
EG R+L F V+V+YKDP RSF GR+KYK W TAL+ P+V V+EM M+STSVL IKW
Sbjct: 76 EGERSLSNFGVNVEYKDPLRSFKGRDKYKGANWIKTALEKPTVAVREMKMISTSVLKIKW 135
Query: 185 TLRGKPKSIIAN-IGGDLIVKVYSKFTLNQISGQVIEHEELWDLSASSPVARAFFWASRR 243
T+ G PK A+ +GG +++ V S FT+NQISGQV HE+ WDLSAS P A+A+FW +R
Sbjct: 136 TVTGTPKLPPASALGGRVVLAVSSTFTMNQISGQVTLHEDEWDLSASDPAAQAYFWTARL 195
Query: 244 LYATTEAGKDSFDLINNLKSKISTEQEN--LGIYPDPSGDPTK 284
++ E GK+ + + ++ QEN IYPDPSGDP K
Sbjct: 196 AFSAVEGGKNFASGVQGIAKQLDKGQENNSNSIYPDPSGDPRK 238
>gi|255086449|ref|XP_002509191.1| predicted protein [Micromonas sp. RCC299]
gi|226524469|gb|ACO70449.1| predicted protein [Micromonas sp. RCC299]
Length = 214
Score = 107 bits (267), Expect = 7e-21, Method: Compositional matrix adjust.
Identities = 68/192 (35%), Positives = 99/192 (51%), Gaps = 10/192 (5%)
Query: 96 ELCNEFECISSPLVESTARQLVRDILELREGNRALGTFAVSVKYKDPTRSFTGREKYKRR 155
++C++F C SSP VES+ RQ+ DI +RE R+L FAV VKY D R F GRE +
Sbjct: 1 QICDDFVCKSSPAVESSLRQIATDICAIREDKRSLNPFAVDVKYDDGERKFEGREGFANH 60
Query: 156 LWATTALDNPSVTVQEMVMLSTSVLSIKWTLRGKPKSIIANIGGDLIVKVYSKFTLNQIS 215
+ +D+ V EM M +I W L+G+ N GG + V+V + LN I+
Sbjct: 61 TYIKDNVDDARAAVTEMRMGDLDAATIVWRLQGR------NSGGAIDVEVTTTLRLNLIT 114
Query: 216 GQVIEHEELWDLSASSPVARAFFWASRRLYATTEAGKD-SFDLINNLKSKI--STEQENL 272
G+ E E WD + S A A +SR+ A + D + L +++ K S + E
Sbjct: 115 GRATEVTERWDPAGSDAGAAAVLASSRKATALPKNIADAAARLGKSIEEKFGSSDDGEMK 174
Query: 273 GIYPDPSGDPTK 284
+Y DP+ DP K
Sbjct: 175 DVYVDPN-DPMK 185
>gi|303284547|ref|XP_003061564.1| predicted protein [Micromonas pusilla CCMP1545]
gi|226456894|gb|EEH54194.1| predicted protein [Micromonas pusilla CCMP1545]
Length = 313
Score = 103 bits (256), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 64/199 (32%), Positives = 98/199 (49%), Gaps = 17/199 (8%)
Query: 96 ELCNEFECISSPLVESTARQLVRDILELREGNRALGTFAVSVKYKDPTRSFTGREKYKRR 155
++C+EF C SSP VE+ RQ+ D+ LRE R+L FA ++Y D R F GRE +K
Sbjct: 86 QICDEFVCKSSPAVEAWLRQIATDVCALRENARSLTPFANDIEYDDGARVFKGREGFKNH 145
Query: 156 LWATTALDNPSVTVQEMVMLSTSVLSIKWTLRGKPKSIIANIGGDLIVKVYSKFTLNQIS 215
+ + NP+ V+ M M + I+W LRG+ GG L K+ S T+N I+
Sbjct: 146 AFIAENIGNPAAAVESMRMNAVDEAVIEWRLRGQ------TPGGALDAKITSTLTMNLIT 199
Query: 216 GQVIEHEELWDLS--ASSPVARAFFWASRRLYATTE--------AGKDSFDLINNLKSKI 265
G+ E WD++ S P A AF ++R+ A + K+ D + ++ +
Sbjct: 200 GRATRWTEAWDVAGGGSDPGAAAFVASTRKASALPKNAADAATAVAKELDDALGSVAGSL 259
Query: 266 STEQENLGIYPDPSGDPTK 284
E + DP+ DP K
Sbjct: 260 GLGGEEREVQVDPN-DPMK 277
>gi|194695636|gb|ACF81902.1| unknown [Zea mays]
Length = 90
Score = 89.7 bits (221), Expect = 2e-15, Method: Composition-based stats.
Identities = 40/63 (63%), Positives = 50/63 (79%)
Query: 81 QKSEADKIVDGMDFGELCNEFECISSPLVESTARQLVRDILELREGNRALGTFAVSVKYK 140
++S+ DK+VDGMDF ELC++FECISSP VES RQ+ RDI ELRE NRA +AV VKY+
Sbjct: 28 ERSQTDKMVDGMDFEELCSDFECISSPYVESIMRQVARDIFELREDNRAFSCYAVPVKYE 87
Query: 141 DPT 143
P+
Sbjct: 88 APS 90
>gi|194694442|gb|ACF81305.1| unknown [Zea mays]
Length = 68
Score = 89.4 bits (220), Expect = 2e-15, Method: Composition-based stats.
Identities = 40/63 (63%), Positives = 50/63 (79%)
Query: 81 QKSEADKIVDGMDFGELCNEFECISSPLVESTARQLVRDILELREGNRALGTFAVSVKYK 140
++S+ DK+VDGMDF ELC++FECISSP VES RQ+ RDI ELRE NRA +AV VKY+
Sbjct: 6 ERSQTDKMVDGMDFEELCSDFECISSPYVESIMRQVARDIFELREDNRAFSCYAVPVKYE 65
Query: 141 DPT 143
P+
Sbjct: 66 APS 68
>gi|145352042|ref|XP_001420368.1| predicted protein [Ostreococcus lucimarinus CCE9901]
gi|144580602|gb|ABO98661.1| predicted protein [Ostreococcus lucimarinus CCE9901]
Length = 255
Score = 87.8 bits (216), Expect = 6e-15, Method: Compositional matrix adjust.
Identities = 73/232 (31%), Positives = 112/232 (48%), Gaps = 26/232 (11%)
Query: 89 VDGMDFGELCNEFECISSPLVESTARQLVRDILELREGNRALGTFAVSVKYKDPTRSFTG 148
VDG+ E C++F C SSP VE+T + L++DI R NR+ ++ V Y D F G
Sbjct: 26 VDGL--REPCDDFVCKSSPAVENTLKSLIKDINAARGTNRSAAPYSPDVTYDDGALRFAG 83
Query: 149 REKYKRRL-WATTALDNPSVTVQEMVML-STSVLSIKWTLRGKPKSIIANIGGDLIVKVY 206
EKYKR + L + V E+ M S V ++KW L N G + V V
Sbjct: 84 SEKYKRYCSYIENNLRQVTTRVTEISMEGSLDVATVKWELNA------VNDIGRVGVDVT 137
Query: 207 SKFTLNQISGQVIEHEELWDLSASSPVAR--AFFWASRRLYA--------TTEAGKDSFD 256
+ + +N I+G+V+EH E W ++ + A+ A ++R+ +A EA K +
Sbjct: 138 ATYKMNLITGRVLEHREQWVVNPARTDAQAGALLESTRKAHALPLNAMEVADEAKKKFRE 197
Query: 257 LINNLKSKISTEQENLGIYPDPSGDPTKV---SSDSRCFCFCFCLTSNVIYI 305
+ + S+ +QEN IY DP+ DP K S+ L + VIY+
Sbjct: 198 ISKSFASE--DDQENSNIYVDPN-DPMKFFQQDDTSKTDLLQLSLVAAVIYL 246
>gi|195604514|gb|ACG24087.1| hypothetical protein [Zea mays]
Length = 89
Score = 85.9 bits (211), Expect = 2e-14, Method: Composition-based stats.
Identities = 41/70 (58%), Positives = 52/70 (74%), Gaps = 1/70 (1%)
Query: 70 STTSTELSIQN-QKSEADKIVDGMDFGELCNEFECISSPLVESTARQLVRDILELREGNR 128
S TE+ + + ++S+ DK+VDGMDF ELC++FECISSP VES RQ+ RDI ELRE NR
Sbjct: 13 SRAITEILVSSRERSQTDKMVDGMDFEELCSDFECISSPYVESIMRQVARDIFELREDNR 72
Query: 129 ALGTFAVSVK 138
A +AV VK
Sbjct: 73 AFSCYAVPVK 82
>gi|307109060|gb|EFN57299.1| hypothetical protein CHLNCDRAFT_143902 [Chlorella variabilis]
Length = 249
Score = 85.1 bits (209), Expect = 4e-14, Method: Compositional matrix adjust.
Identities = 67/214 (31%), Positives = 107/214 (50%), Gaps = 10/214 (4%)
Query: 98 CNEFECISSPLVESTARQLVRDILELREGNRALGTFAVSVKYKDPTRSFTGREKYKRRLW 157
C+EF C SSP VE++ RQL +D+ R R +A +V+Y+D R F G + Y+R +
Sbjct: 28 CDEFVCTSSPSVEASVRQLAKDLQ--RANGRWAPIYASTVEYRDAYRRFKGPDGYQRLDF 85
Query: 158 ATTALDNPSVTVQEMVMLSTSVLSIKWTLRGKPKSIIANIGGDLIVKVYSKFTLNQISGQ 217
+T + +V V M ML S I+W L GK + ++ G ++ +N ++G+
Sbjct: 86 VSTNVQQAAVAVTGMRMLDNSSAEIRWRLTGKLGVLPIDVAG------TTEIEMNLLTGR 139
Query: 218 VIEHEELWDLSASSPVARAFFWASRRLYATTEAGKDSFDLINN-LKSKISTEQENLGIYP 276
+ H E WDL+A SP A A + ASR L+A +A D+ L + S + E+ P
Sbjct: 140 IERHREKWDLAACSPPAAAAWNASRALWAAKQASGDAQRAAGRALDTLTSVDDEDQYSQP 199
Query: 277 DPSGDPTKVSSDSRCFCFCFCLTSNVIYIAFASV 310
+P+ DP + F L V+ + + V
Sbjct: 200 NPN-DPGRFFQQGDTFKQDAALFVGVVMLFWGLV 232
>gi|308809583|ref|XP_003082101.1| unnamed protein product [Ostreococcus tauri]
gi|116060568|emb|CAL55904.1| unnamed protein product [Ostreococcus tauri]
Length = 295
Score = 84.7 bits (208), Expect = 5e-14, Method: Compositional matrix adjust.
Identities = 81/283 (28%), Positives = 126/283 (44%), Gaps = 29/283 (10%)
Query: 43 ANSSSSSHHHHHHHHHHSPFTLRVTNDSTTSTE--------LSIQNQKSEADKIVDGMDF 94
A++ SS H P L + ++S++ +Q E VDG F
Sbjct: 13 ASTRRSSEEGARVDRHRQPRRLGIHHESSSIASPRRDGRRITRVQATDKEFRASVDG--F 70
Query: 95 GELCNEFECISSPLVESTARQLVRDILELREGNRALGTFAVSVKYKDPTRSFTGREKYKR 154
E C+EF C SSP VE+T + +V+DI R R+ +A V+Y D F G EKY +
Sbjct: 71 REPCDEFVCKSSPAVENTLKSVVKDINAARGTTRSSKPYAPDVEYDDGVLKFKGSEKYAK 130
Query: 155 RL-WATTALDNPSVTVQEMVMLST-SVLSIKWTLRGKPKSIIANIGGDLIVKVYSKFTLN 212
+ L + V E+ M T V +++W L G N G + V + + + +N
Sbjct: 131 YCSYVENNLRQVTTRVTEISMKDTLDVATVRWELNG------TNDIGRVGVDIEATYKMN 184
Query: 213 QISGQVIEHEELWDLSASSPVAR--AFFWASRR-----LYATTEAGKDSFDLINNLKSKI 265
I+G+V+EH E W ++ S A+ A ++R+ L A A + S L KS
Sbjct: 185 LITGRVLEHRERWTVNPSRTEAQAGALLESTRKAHALPLNAMELADRASQGLKELTKSFS 244
Query: 266 STEQENLGIYPDPSGDPTKV---SSDSRCFCFCFCLTSNVIYI 305
S ++ IY +P+ DP K S+ L + VIY+
Sbjct: 245 SGDEGEKNIYVNPN-DPMKFFQQDDTSKTDLLQLSLVAAVIYL 286
>gi|159469448|ref|XP_001692875.1| predicted protein [Chlamydomonas reinhardtii]
gi|158277677|gb|EDP03444.1| predicted protein [Chlamydomonas reinhardtii]
Length = 280
Score = 84.7 bits (208), Expect = 6e-14, Method: Compositional matrix adjust.
Identities = 63/208 (30%), Positives = 101/208 (48%), Gaps = 23/208 (11%)
Query: 83 SEADKIVDGMDFGELCNEFECISSPLVESTARQLVRDILELREGNRALGT-FAVSVKYKD 141
S A K ++ +D + C++F C SSP VE T R L R EL+ G A T + V Y D
Sbjct: 52 SIATKTIEDLDT-DYCDDFVCTSSPAVEQTVRSLAR---ELQRGRYAAATLYQPGVTYSD 107
Query: 142 PTRSFTGREKYKRRLWATTALDNPSVTVQEMVMLSTSVLSIKWTLRGKPKSIIANIGGDL 201
R+FTG E YKR+ W ++ ++ + ML I W + G ++GG
Sbjct: 108 GFRTFTGVEGYKRQRWVIDNVEKFKTSILRLRMLDKGTSEISWQVDG-------SLGGQP 160
Query: 202 IVKVYSKFTL-NQISGQVIEHEELWDLSASSPVARAFFWASRRLYATTEAGKDSFDLINN 260
+ +++ N ++G++ H E W+LS SP A A A+R ++ +A +D+ + +
Sbjct: 161 VSLLFTTTCEHNLVTGRITNHRESWNLSRCSPPAAAMATANRYAWSAKQAVEDAKEGLTK 220
Query: 261 LKSKI----STEQENLGIYPDPSGDPTK 284
K+ + NL PS DPT+
Sbjct: 221 AAEKLGGGGGADMNNL-----PS-DPTR 242
>gi|195623420|gb|ACG33540.1| hypothetical protein [Zea mays]
Length = 56
Score = 83.2 bits (204), Expect = 1e-13, Method: Composition-based stats.
Identities = 37/56 (66%), Positives = 44/56 (78%)
Query: 88 IVDGMDFGELCNEFECISSPLVESTARQLVRDILELREGNRALGTFAVSVKYKDPT 143
+VDGMDF ELC++FECISSP VES RQ+ RDI ELRE NRA +AV VKY+ P+
Sbjct: 1 MVDGMDFEELCSDFECISSPYVESIMRQVARDIFELREDNRAFSCYAVPVKYEAPS 56
>gi|302850742|ref|XP_002956897.1| hypothetical protein VOLCADRAFT_119553 [Volvox carteri f.
nagariensis]
gi|300257778|gb|EFJ42022.1| hypothetical protein VOLCADRAFT_119553 [Volvox carteri f.
nagariensis]
Length = 802
Score = 80.5 bits (197), Expect = 1e-12, Method: Compositional matrix adjust.
Identities = 52/169 (30%), Positives = 79/169 (46%), Gaps = 10/169 (5%)
Query: 98 CNEFECISSPLVESTARQLVRDILELREGNRALGTFAVSVKYKDPTRSFTGREKYKRRLW 157
C++F C SSP VE T R L R++ R +L + V Y D R+FTG E + R+ W
Sbjct: 589 CDDFLCTSSPAVEQTVRSLARELKRGRYTTTSL--YQPGVTYSDGFRTFTGPEGFTRQRW 646
Query: 158 ATTALDNPSVTVQEMVMLSTSVLSIKWTLRGKPKSIIANIGGDLI-VKVYSKFTLNQISG 216
+ PS+T+ +M ML I W L G+ +GG L+ V + N ++G
Sbjct: 647 VADNVQEPSLTIVKMRMLDKGTSEISWRLVGR-------LGGILLDVSFNTTCEHNLLTG 699
Query: 217 QVIEHEELWDLSASSPVARAFFWASRRLYATTEAGKDSFDLINNLKSKI 265
++ E WDLS P A +R ++ +A D+ D K+
Sbjct: 700 RITSLRESWDLSRCPPPAALLATFNRYSWSAKQAVADARDGAGKALQKL 748
>gi|412987779|emb|CCO19175.1| predicted protein [Bathycoccus prasinos]
Length = 299
Score = 60.8 bits (146), Expect = 8e-07, Method: Compositional matrix adjust.
Identities = 42/152 (27%), Positives = 69/152 (45%), Gaps = 32/152 (21%)
Query: 98 CNEFECI-SSPLVESTARQLVRDILELREGNRALGTFAVSVKYKDPTRSFTGREKYK--- 153
C+ FEC +SP VES+ RQL RD+ + ++ NR++ +A V++ D R F+G EK+K
Sbjct: 57 CDAFECKGTSPAVESSLRQLARDLRDGKDSNRSIFPYAKDVRFSDGARRFSGSEKFKNCN 116
Query: 154 ---RRLWATTAL---DNPSVTVQE--------------MVMLSTSVLSIKWTLRGKPKSI 193
++ + DN + V+ SV I W + K
Sbjct: 117 CELNSIFGGEKISSSDNSKTNNNDNIRNRKFTNWVDSIEVLPGGSVAKILWRIESKD--- 173
Query: 194 IANIGGDLIVKVYSKFTLNQISGQVIEHEELW 225
G VKV + +N I+G+V+ ++ W
Sbjct: 174 -----GKAKVKVETTLEMNLITGKVLSQDDQW 200
>gi|302840933|ref|XP_002952012.1| hypothetical protein VOLCADRAFT_92622 [Volvox carteri f.
nagariensis]
gi|300262598|gb|EFJ46803.1| hypothetical protein VOLCADRAFT_92622 [Volvox carteri f.
nagariensis]
Length = 200
Score = 50.8 bits (120), Expect = 8e-04, Method: Compositional matrix adjust.
Identities = 40/119 (33%), Positives = 60/119 (50%), Gaps = 17/119 (14%)
Query: 118 RDILELR---EGNRALGTFAVSVKYKDPTRSFTGREKYKRRLWATTALDNPSVTVQEMVM 174
RD +E + GN FA +KDPT G E Y R A AL +PS + +++
Sbjct: 62 RDFVERQYYVTGNLTPQLFAPDCVFKDPTVEVVGVEPYVR---ALQALFDPSTSRADLIS 118
Query: 175 L---STSVLSIKWTLRGKPKSIIANIGGDLIVKVYSKFTLNQIS--GQVIEHEELWDLS 228
+ + S + ++W L G K + G L +K Y+ T+ +S G+VI HEE WD+S
Sbjct: 119 IRATAPSTVVLRWRLEGSLK-----LAG-LKIKPYTGTTVYTLSDDGKVIRHEETWDIS 171
>gi|357111214|ref|XP_003557409.1| PREDICTED: uncharacterized protein LOC100837510 [Brachypodium
distachyon]
Length = 231
Score = 48.9 bits (115), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 37/125 (29%), Positives = 60/125 (48%), Gaps = 14/125 (11%)
Query: 126 GNRALGTFAVSVKYKDPTRSFTGREKYKRRL-WATTALDNPSV---TVQEMVMLSTSVLS 181
GN LG +A Y+DPT F GR KY + L LD+PS+ ++++ + T +
Sbjct: 111 GNFTLGIYAEDCLYEDPTIKFRGRSKYAQNLDLLVPFLDSPSLELESIEKGLRAETKSIM 170
Query: 182 IKWTLRGK---PKSIIANIGGDLIVKVYSKFTLNQISGQVIEHEELWDLSASSPVARAFF 238
WTLR P + I G+ + ++ +V+ H E WD+SA +A+ F
Sbjct: 171 ATWTLRTYLRLPWRPLIAIRGNTTYDLDEEY-------KVVRHAESWDVSALEAIAQLFV 223
Query: 239 WASRR 243
A ++
Sbjct: 224 SAPKQ 228
>gi|428779817|ref|YP_007171603.1| hypothetical protein Dacsa_1567 [Dactylococcopsis salina PCC 8305]
gi|428694096|gb|AFZ50246.1| hypothetical protein Dacsa_1567 [Dactylococcopsis salina PCC 8305]
Length = 146
Score = 44.7 bits (104), Expect = 0.061, Method: Compositional matrix adjust.
Identities = 34/126 (26%), Positives = 57/126 (45%), Gaps = 15/126 (11%)
Query: 109 VESTARQLVRDILELREGNRALGTFAVSVKYKDPTRSFTGREKYKRRLW-----ATTALD 163
VES QL D+ L + + + ++ + +KDP F + Y+ W A
Sbjct: 12 VESAIEQLSHDLPNLFQKDISYKIYSSDIFFKDPVNLFQSKFNYRIIFWTLRFHARLFFT 71
Query: 164 NPSVTVQEMVMLSTSVLSIKWTLRGKPK-SIIANI--GGDLIVKVYSKFTLNQISGQVIE 220
V ++ + T+++ + WT+RGK + ANI GD S +TLN G + +
Sbjct: 72 EIYFDVHDIQQIETNIIKVWWTVRGKLRVPWEANIFFNGD------STYTLNS-EGLISD 124
Query: 221 HEELWD 226
H + WD
Sbjct: 125 HRDSWD 130
>gi|428775767|ref|YP_007167554.1| hypothetical protein PCC7418_1134 [Halothece sp. PCC 7418]
gi|428690046|gb|AFZ43340.1| Protein of unknown function DUF2358 [Halothece sp. PCC 7418]
Length = 170
Score = 43.5 bits (101), Expect = 0.13, Method: Compositional matrix adjust.
Identities = 36/137 (26%), Positives = 61/137 (44%), Gaps = 16/137 (11%)
Query: 109 VESTARQLVRDILELREGNRALGTFAVSVKYKDPTRSFTGREKYKRRLW-----ATTALD 163
VES QL +D+ L + + + ++ ++ +KDP F G+ Y+ W A
Sbjct: 35 VESAIAQLWQDLPTLFKKDISYQIYSSNIFFKDPVNLFQGKLNYRIIFWTLRFHARLFFT 94
Query: 164 NPSVTVQEMVMLSTSVLSIKWTLRGKPK-SIIANI--GGDLIVKVYSKFTLNQISGQVIE 220
V ++ +V+ + WT+RGK + ANI GD S + LN+ G +
Sbjct: 95 EIYFDVHDIQQTENNVIKVWWTVRGKLRVPWQANIFFNGD------STYKLNE-DGLIYH 147
Query: 221 HEELWDLSASSPVARAF 237
H + WD A + + F
Sbjct: 148 HRDNWD-RAPKTILKQF 163
>gi|356526417|ref|XP_003531814.1| PREDICTED: uncharacterized protein LOC100797159 [Glycine max]
Length = 268
Score = 42.4 bits (98), Expect = 0.33, Method: Compositional matrix adjust.
Identities = 27/91 (29%), Positives = 48/91 (52%), Gaps = 9/91 (9%)
Query: 137 VKYKDPTRSFTGREKYKRRLWA-----TTALDNPSVTVQEMVMLSTSVLSIKWTLRGKPK 191
+ +KDP +FTG EKYK WA ++ V + S +V+ I+W LRG P+
Sbjct: 111 ITFKDPLNTFTGIEKYKLIFWALRFHGKILFREIALDVYRIWQPSENVILIRWNLRGVPR 170
Query: 192 SIIANIGGDLIVKVYSKFTLNQISGQVIEHE 222
+ G+ + S++ L++ +G++ EH+
Sbjct: 171 -VPWEAKGEF--QGTSRYKLDR-NGKIYEHK 197
>gi|242043308|ref|XP_002459525.1| hypothetical protein SORBIDRAFT_02g006110 [Sorghum bicolor]
gi|241922902|gb|EER96046.1| hypothetical protein SORBIDRAFT_02g006110 [Sorghum bicolor]
Length = 245
Score = 42.0 bits (97), Expect = 0.46, Method: Compositional matrix adjust.
Identities = 33/126 (26%), Positives = 59/126 (46%), Gaps = 14/126 (11%)
Query: 126 GNRALGTFAVSVKYKDPTRSFTGREKYKRRLWATTAL-DNPSV---TVQEMVMLSTSVLS 181
G+ G + ++DPT F G +Y + L+ D+PS+ ++++ + + T +
Sbjct: 124 GDFTPGIYTADCLFEDPTIKFRGLSRYSQNLYLLVPFFDSPSLELESIEKGLRVETKFVK 183
Query: 182 IKWTLRGK---PKSIIANIGGDLIVKVYSKFTLNQISGQVIEHEELWDLSASSPVARAFF 238
W LR P + I G+ + + LN+ +VI H E WD+SA +A+ F
Sbjct: 184 ATWKLRTYLRLPWRPLIAIRGN------TTYDLNE-DFKVIRHSESWDVSAVEAIAQIFV 236
Query: 239 WASRRL 244
A +L
Sbjct: 237 SAPEQL 242
>gi|351720803|ref|NP_001235141.1| uncharacterized protein LOC100306700 [Glycine max]
gi|255629317|gb|ACU15003.1| unknown [Glycine max]
Length = 255
Score = 41.2 bits (95), Expect = 0.77, Method: Compositional matrix adjust.
Identities = 34/124 (27%), Positives = 59/124 (47%), Gaps = 14/124 (11%)
Query: 126 GNRALGTFAVSVKYKDPTRSFTGREKYKRRL-WATTALDNPSVTVQEM---VMLSTSVLS 181
GN + +A + ++DPT F GRE Y R L D+ S+ +Q++ V T+ +
Sbjct: 112 GNFSSSIYAENCIFEDPTIKFRGRELYARNLKLLVPFFDSASIILQKIDKDVDSDTNFVL 171
Query: 182 IKWTLRGK---PKSIIANIGGDLIVKVYSKFTLNQISGQVIEHEELWDLSASSPVARAFF 238
W LR P + +I G + + LN+ +++ H E W++SA V + F
Sbjct: 172 ASWKLRTNLKLPWRPLISIDGSTV------YELNE-DYKIVRHVESWNISAVEAVLQIFS 224
Query: 239 WASR 242
+ S+
Sbjct: 225 FKSK 228
>gi|302836648|ref|XP_002949884.1| hypothetical protein VOLCADRAFT_109744 [Volvox carteri f.
nagariensis]
gi|300264793|gb|EFJ48987.1| hypothetical protein VOLCADRAFT_109744 [Volvox carteri f.
nagariensis]
Length = 237
Score = 39.7 bits (91), Expect = 2.0, Method: Compositional matrix adjust.
Identities = 29/108 (26%), Positives = 52/108 (48%), Gaps = 15/108 (13%)
Query: 139 YKDPTRSFTGREKYKRRLWATTA-LDNPSVTVQEMVMLS-----TSVLSIKWTLR---GK 189
+ DPT +F G + +KR L LD P+V ++ + L V+ +W LR
Sbjct: 128 FADPTVAFRGVDLWKRNLALLVPFLDQPAVQLKRVQRLGRDEQGAEVVRAEWRLRTFLRL 187
Query: 190 PKSIIANIGGDLIVKVYSKFTLNQISGQVIEHEELWDLSASSPVARAF 237
P + +I G +++TLN+ S +++ H E W +S + + + F
Sbjct: 188 PWRPLIDIDG------ATEYTLNEESNRIVRHVESWGVSGTQALLQMF 229
>gi|222636639|gb|EEE66771.1| hypothetical protein OsJ_23496 [Oryza sativa Japonica Group]
Length = 249
Score = 39.7 bits (91), Expect = 2.1, Method: Compositional matrix adjust.
Identities = 31/123 (25%), Positives = 56/123 (45%), Gaps = 10/123 (8%)
Query: 126 GNRALGTFAVSVKYKDPTRSFTGREKYKRRL-WATTALDNPSV---TVQEMVMLSTSVLS 181
GN + ++DPT F GR +Y + L D+PS+ +++ + + T +
Sbjct: 128 GNFTPDIYTEDCLFEDPTIKFRGRSRYSQNLDLLVPFFDSPSLELENIEKGLRVETKFIM 187
Query: 182 IKWTLRGKPKSIIANIGGDLIVKVYSKFTLN-QISGQVIEHEELWDLSASSPVARAFFWA 240
WTLR + + DL+V + +L + +V H E WD+SA + + F A
Sbjct: 188 ATWTLRSRYSQNL-----DLLVPFFDSPSLELENIEKVTRHAESWDVSALEAIGQIFVPA 242
Query: 241 SRR 243
++
Sbjct: 243 PKQ 245
>gi|298204898|emb|CBI34205.3| unnamed protein product [Vitis vinifera]
Length = 216
Score = 39.3 bits (90), Expect = 2.3, Method: Compositional matrix adjust.
Identities = 30/113 (26%), Positives = 53/113 (46%), Gaps = 14/113 (12%)
Query: 139 YKDPTRSFTGREKYKRRL-WATTALDNPSVTVQEMVMLS---TSVLSIKWTLRGK---PK 191
++DPT F G++ Y R L D+PS+ +Q++ S T + W LR P
Sbjct: 109 FEDPTIKFRGKDLYSRNLKLLVPFFDHPSIALQKIEKGSNSETKFVLASWKLRTYLKLPW 168
Query: 192 SIIANIGGDLIVKVYSKFTLNQISGQVIEHEELWDLSASSPVARAFFWASRRL 244
+ +I G + + +F +++ H E W++SA V + F + +RL
Sbjct: 169 RPLISIAGSTVYDLNDEF-------KIVRHAESWNISALQAVGQIFTPSLQRL 214
>gi|159481098|ref|XP_001698619.1| SOUL3-like protein [Chlamydomonas reinhardtii]
gi|158282359|gb|EDP08112.1| SOUL3-like protein [Chlamydomonas reinhardtii]
Length = 232
Score = 39.3 bits (90), Expect = 2.3, Method: Compositional matrix adjust.
Identities = 29/108 (26%), Positives = 52/108 (48%), Gaps = 15/108 (13%)
Query: 139 YKDPTRSFTGREKYKRRLWATTA-LDNPSVTVQEMVMLS-----TSVLSIKW---TLRGK 189
+ DPT +F G + +KR L T L+ P+V + M L V+ +W T+
Sbjct: 123 FADPTVAFRGTDLWKRNLALLTPFLEAPNVQLYGMRQLGRDEDGAEVVRAEWRLTTILKL 182
Query: 190 PKSIIANIGGDLIVKVYSKFTLNQISGQVIEHEELWDLSASSPVARAF 237
P + ++ G +++TLN+ S +++ H E W +S + V + F
Sbjct: 183 PWRPLIDLDG------ATEYTLNEESNRIVRHVEFWSISGTEAVLQMF 224
>gi|17228448|ref|NP_484996.1| hypothetical protein alr0953 [Nostoc sp. PCC 7120]
gi|17130299|dbj|BAB72910.1| alr0953 [Nostoc sp. PCC 7120]
Length = 139
Score = 39.3 bits (90), Expect = 2.8, Method: Compositional matrix adjust.
Identities = 29/134 (21%), Positives = 57/134 (42%), Gaps = 10/134 (7%)
Query: 109 VESTARQLVRDILELREGNRALGTFAVSVKYKDPTRSFTGREKYKRRLWAT---TALDNP 165
+E + L +D+ L E + + + + ++DP F G+ Y+ W L P
Sbjct: 7 IEQVIKTLQQDLPTLFEQDISYDIYTKDIYFQDPVNKFKGKFNYRIIFWTLRFHARLFFP 66
Query: 166 SV--TVQEMVMLSTSVLSIKWTLRGKPKSIIANIGGDLIVKVYSKFTLNQISGQVIEHEE 223
+ + E++ L + KWT+RG ++ ++ YS + L Q + + +H +
Sbjct: 67 EIYFDLHEVLQLDKDTILAKWTVRG---TLRVPWRSQMLFNGYSTYKLRQ-NNLIYQHID 122
Query: 224 LWDLSASSPVARAF 237
WD + R F
Sbjct: 123 TWD-RKPGEILRQF 135
>gi|225451265|ref|XP_002272524.1| PREDICTED: uncharacterized protein LOC100264176 [Vitis vinifera]
Length = 242
Score = 38.9 bits (89), Expect = 3.8, Method: Compositional matrix adjust.
Identities = 30/113 (26%), Positives = 53/113 (46%), Gaps = 14/113 (12%)
Query: 139 YKDPTRSFTGREKYKRRL-WATTALDNPSVTVQEMVMLS---TSVLSIKWTLRGK---PK 191
++DPT F G++ Y R L D+PS+ +Q++ S T + W LR P
Sbjct: 135 FEDPTIKFRGKDLYSRNLKLLVPFFDHPSIALQKIEKGSNSETKFVLASWKLRTYLKLPW 194
Query: 192 SIIANIGGDLIVKVYSKFTLNQISGQVIEHEELWDLSASSPVARAFFWASRRL 244
+ +I G + + +F +++ H E W++SA V + F + +RL
Sbjct: 195 RPLISIAGSTVYDLNDEF-------KIVRHAESWNISALQAVGQIFTPSLQRL 240
>gi|116792207|gb|ABK26275.1| unknown [Picea sitchensis]
Length = 222
Score = 38.5 bits (88), Expect = 4.3, Method: Compositional matrix adjust.
Identities = 29/100 (29%), Positives = 48/100 (48%), Gaps = 19/100 (19%)
Query: 138 KYKDPTRSFTGREKYKRRLWATTALDNPSVTVQEMV---MLSTSVLSIKWTLRGKPK--- 191
+++DPT TG EKY + A L NP+ + QE++ ++ + KW L G K
Sbjct: 119 RFRDPTTDLTGLEKY---ISAVKFLFNPNTSKQELLSIAVVDPHTIEAKWRLEGYLKFPW 175
Query: 192 --SIIANIGGDLIVKVYSKFTLNQISGQVIEHEELWDLSA 229
I+ G +++ L+ G ++ HEE WD+S
Sbjct: 176 NPHILPYEGS-------TRYVLDD-RGLIMSHEETWDISV 207
>gi|395644905|ref|ZP_10432765.1| polymorphic outer membrane protein [Methanofollis liminatans DSM
4140]
gi|395441645|gb|EJG06402.1| polymorphic outer membrane protein [Methanofollis liminatans DSM
4140]
Length = 1009
Score = 38.5 bits (88), Expect = 4.5, Method: Composition-based stats.
Identities = 21/76 (27%), Positives = 37/76 (48%)
Query: 119 DILELREGNRALGTFAVSVKYKDPTRSFTGREKYKRRLWATTALDNPSVTVQEMVMLSTS 178
DI LR N ++G V +KD TR T + L T ++ P+V V +MV ++
Sbjct: 441 DISNLRYSNLSVGNATVRASWKDQTRDITVSFRNYPYLSVKTRVEPPTVAVNDMVNVTVQ 500
Query: 179 VLSIKWTLRGKPKSII 194
++ + L +P ++
Sbjct: 501 LIGDGYALHSRPIDVV 516
>gi|75906754|ref|YP_321050.1| hypothetical protein Ava_0531 [Anabaena variabilis ATCC 29413]
gi|75700479|gb|ABA20155.1| conserved hypothetical protein [Anabaena variabilis ATCC 29413]
Length = 139
Score = 38.5 bits (88), Expect = 4.6, Method: Compositional matrix adjust.
Identities = 29/135 (21%), Positives = 54/135 (40%), Gaps = 10/135 (7%)
Query: 109 VESTARQLVRDILELREGNRALGTFAVSVKYKDPTRSFTGREKYKRRLW-----ATTALD 163
+E + L +D+ L E + + + + ++DP F G+ Y+ W A
Sbjct: 7 IEQVIKTLQQDLPTLFEQDISYDIYTPDIYFQDPVNKFKGKFNYRIIFWTLRFHAWLFFT 66
Query: 164 NPSVTVQEMVMLSTSVLSIKWTLRGKPKSIIANIGGDLIVKVYSKFTLNQISGQVIEHEE 223
+ E+ L + KWT+RG+ + + YS + L Q + + EH +
Sbjct: 67 EIYFDLHEVGQLDKDTILAKWTVRGRLR---VPWRSQIFFNGYSTYKLRQ-NNLIYEHID 122
Query: 224 LWDLSASSPVARAFF 238
WD + R F+
Sbjct: 123 TWD-RKPGEILRQFW 136
>gi|307108290|gb|EFN56530.1| hypothetical protein CHLNCDRAFT_144158 [Chlorella variabilis]
Length = 172
Score = 38.1 bits (87), Expect = 6.6, Method: Compositional matrix adjust.
Identities = 30/101 (29%), Positives = 47/101 (46%), Gaps = 16/101 (15%)
Query: 139 YKDPTRSFTGREKYKRRLWATTA-LDNPSVTVQEMVML------STSVLSIKWTLR---G 188
+ DPT SF GR+ YKR L L P++ ++ + L ++ L +W L
Sbjct: 57 FADPTVSFRGRDLYKRNLALLVPFLWEPAIQLRSLRRLPAPAPGGSAQLFAEWRLSCWVR 116
Query: 189 KPKSIIANIGGDLIVKVYSKFTLNQISGQVIEHEELWDLSA 229
P + +I G + +TLN Q++ H E WD+SA
Sbjct: 117 LPWAPYVDINGT------TTYTLNADGNQIVRHVEQWDVSA 151
>gi|431796645|ref|YP_007223549.1| acetate--CoA ligase [Echinicola vietnamensis DSM 17526]
gi|430787410|gb|AGA77539.1| acetate--CoA ligase [Echinicola vietnamensis DSM 17526]
Length = 632
Score = 37.7 bits (86), Expect = 7.7, Method: Compositional matrix adjust.
Identities = 31/107 (28%), Positives = 50/107 (46%), Gaps = 11/107 (10%)
Query: 111 STARQLVRDIL-ELREGNRALGTFAVSVKYKDPTRSFTGREKYKRRLWATTALDNPSVTV 169
T + V+D++ E E + + T V + K + GR+ +W A+D T
Sbjct: 178 GTKKIAVKDLVDEALEKTKTIETVIVYQRTKQEVKMVEGRD-----IWWHDAIDGQPDTN 232
Query: 170 QEMVMLSTSVLSIKWTL--RGKPKSIIANIGGDLIVKVYSKFTLNQI 214
+ VM S +L I +T GKPK ++ GG + VYSK+T +
Sbjct: 233 EAEVMDSEDMLFILYTSGSTGKPKGVVHTTGGYM---VYSKYTFENV 276
Database: nr
Posted date: Mar 3, 2013 10:45 PM
Number of letters in database: 999,999,864
Number of sequences in database: 2,912,245
Database: /local_scratch/syshi//blastdatabase/nr.01
Posted date: Mar 3, 2013 10:52 PM
Number of letters in database: 999,999,666
Number of sequences in database: 2,912,720
Database: /local_scratch/syshi//blastdatabase/nr.02
Posted date: Mar 3, 2013 10:58 PM
Number of letters in database: 999,999,938
Number of sequences in database: 3,014,250
Database: /local_scratch/syshi//blastdatabase/nr.03
Posted date: Mar 3, 2013 11:03 PM
Number of letters in database: 999,999,780
Number of sequences in database: 2,805,020
Database: /local_scratch/syshi//blastdatabase/nr.04
Posted date: Mar 3, 2013 11:08 PM
Number of letters in database: 999,999,551
Number of sequences in database: 2,816,253
Database: /local_scratch/syshi//blastdatabase/nr.05
Posted date: Mar 3, 2013 11:13 PM
Number of letters in database: 999,999,897
Number of sequences in database: 2,981,387
Database: /local_scratch/syshi//blastdatabase/nr.06
Posted date: Mar 3, 2013 11:18 PM
Number of letters in database: 999,999,649
Number of sequences in database: 2,911,476
Database: /local_scratch/syshi//blastdatabase/nr.07
Posted date: Mar 3, 2013 11:24 PM
Number of letters in database: 999,999,452
Number of sequences in database: 2,920,260
Database: /local_scratch/syshi//blastdatabase/nr.08
Posted date: Mar 3, 2013 11:25 PM
Number of letters in database: 64,230,274
Number of sequences in database: 189,558
Lambda K H
0.318 0.130 0.387
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 4,996,878,416
Number of Sequences: 23463169
Number of extensions: 194916738
Number of successful extensions: 1155199
Number of sequences better than 100.0: 1000
Number of HSP's better than 100.0 without gapping: 2249
Number of HSP's successfully gapped in prelim test: 1068
Number of HSP's that attempted gapping in prelim test: 1108459
Number of HSP's gapped (non-prelim): 25139
length of query: 335
length of database: 8,064,228,071
effective HSP length: 143
effective length of query: 192
effective length of database: 9,003,962,200
effective search space: 1728760742400
effective search space used: 1728760742400
T: 11
A: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.8 bits)
S2: 77 (34.3 bits)