BLASTP 2.2.26 [Sep-21-2011]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for compositional score matrix adjustment: Altschul, Stephen F.,
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.
Query= 029169
(198 letters)
Database: nr
23,463,169 sequences; 8,064,228,071 total letters
Searching..................................................done
>gi|449457015|ref|XP_004146244.1| PREDICTED: uncharacterized protein LOC101221005 [Cucumis sativus]
gi|449495512|ref|XP_004159863.1| PREDICTED: uncharacterized LOC101221005 [Cucumis sativus]
Length = 282
Score = 208 bits (530), Expect = 8e-52, Method: Compositional matrix adjust.
Identities = 122/198 (61%), Positives = 143/198 (72%), Gaps = 12/198 (6%)
Query: 1 MLQTHHLLSLNFP-FTVS--HHPQKLNFLQKPTISLSAFPR------RRPLIEPYCLAQA 51
M T +LLS NFP FT+S + KL FL PT +LS+ R PL C
Sbjct: 1 MWHTQNLLSSNFPLFTLSPPTYNHKL-FLSPPT-TLSSLHRPITFHSVSPLTNHRCFCLP 58
Query: 52 QEPTTNITAPTTSEEGPVELPQSIFATTDEPSSLQVATSVLLTGAISVFLFRALRRRAKR 111
Q T A + GPVELP +IFATTD+PSSLQVATSVLLTGAISVFLFR+LRRRAKR
Sbjct: 59 QF-TDLADATFLDDNGPVELPPTIFATTDDPSSLQVATSVLLTGAISVFLFRSLRRRAKR 117
Query: 112 AKELKFRSSGAKKSLKDEALDNLKALGSSSIDAKGPPSPVQALLGGLTAGVIAIILYKFT 171
KELKFRS G KKSLK+EA+D+LKA+ + I++K PSP+QA LG + AGVIA+ILYKFT
Sbjct: 118 VKELKFRSGGVKKSLKEEAMDSLKAISTGPIESKSTPSPIQAFLGAIAAGVIALILYKFT 177
Query: 172 TTIEAALNRQTISDNFSV 189
TTIEA+LNRQT+SDNFSV
Sbjct: 178 TTIEASLNRQTVSDNFSV 195
>gi|225461564|ref|XP_002282834.1| PREDICTED: uncharacterized protein LOC100267434 [Vitis vinifera]
gi|302142945|emb|CBI20240.3| unnamed protein product [Vitis vinifera]
Length = 277
Score = 205 bits (521), Expect = 1e-50, Method: Compositional matrix adjust.
Identities = 107/142 (75%), Positives = 122/142 (85%), Gaps = 7/142 (4%)
Query: 51 AQEPTTNITAPTTSEEGPVELP---QSIFATTDEPSSLQVATSVLLTGAISVFLFRALRR 107
AQ P T TAP EEGP+ELP SIFAT D+P+ LQVATSVLLTGAISVFLFR++RR
Sbjct: 53 AQVPDTTTTAP---EEGPIELPPSSSSIFATNDDPTPLQVATSVLLTGAISVFLFRSIRR 109
Query: 108 RAKRAKELKFRSSGAKKSLKDEALDNLKALGSSSIDAKGPPSPVQALLGGLTAGVIAIIL 167
R KRAKEL+FRSSG KK+LK+EALD+LKA+GS S+ A PPSPVQALLGG+TAGVIA+IL
Sbjct: 110 RVKRAKELRFRSSGVKKTLKEEALDSLKAMGSGSVKA-APPSPVQALLGGITAGVIALIL 168
Query: 168 YKFTTTIEAALNRQTISDNFSV 189
YKFT TIEA+LNRQT+SDNFSV
Sbjct: 169 YKFTITIEASLNRQTVSDNFSV 190
>gi|118488906|gb|ABK96262.1| unknown [Populus trichocarpa x Populus deltoides]
Length = 238
Score = 204 bits (520), Expect = 1e-50, Method: Compositional matrix adjust.
Identities = 119/198 (60%), Positives = 140/198 (70%), Gaps = 13/198 (6%)
Query: 2 LQTHHLLSLN---FPFT---VSHHPQKLNFLQKPTI----SLSAFPRRRPLIEPYCLAQA 51
LQ H+ L+ FP + +HH L FL +P I +LSA PR RP P LAQ
Sbjct: 4 LQCHNFHPLSPQIFPLSHHHTNHHLYPLKFLYRPPIFTPFTLSALPRSRP-DPPLWLAQL 62
Query: 52 QEPTTNITAPTTSEEGPVELPQSIFATTDEPSSLQVATSVLLTGAISVFLFRALRRRAKR 111
+ SIFAT+D+PSS+QVATSVLLTGAISVFLFR+LRRRAKR
Sbjct: 63 P--EQAPEPEEEGPIEILRSSPSIFATSDDPSSIQVATSVLLTGAISVFLFRSLRRRAKR 120
Query: 112 AKELKFRSSGAKKSLKDEALDNLKALGSSSIDAKGPPSPVQALLGGLTAGVIAIILYKFT 171
+KELKFRSSGAKK+LK+EALD+LK GS+ ID K PPSPVQA LG ++AGVIA+ILYKFT
Sbjct: 121 SKELKFRSSGAKKTLKEEALDSLKTFGSAPIDVKKPPSPVQAFLGAISAGVIALILYKFT 180
Query: 172 TTIEAALNRQTISDNFSV 189
TTIEA+LNRQT+SDNFSV
Sbjct: 181 TTIEASLNRQTVSDNFSV 198
>gi|224116710|ref|XP_002317372.1| predicted protein [Populus trichocarpa]
gi|222860437|gb|EEE97984.1| predicted protein [Populus trichocarpa]
Length = 211
Score = 194 bits (492), Expect = 2e-47, Method: Compositional matrix adjust.
Identities = 94/116 (81%), Positives = 109/116 (93%)
Query: 74 SIFATTDEPSSLQVATSVLLTGAISVFLFRALRRRAKRAKELKFRSSGAKKSLKDEALDN 133
SIFAT+D+PSS+QVATSVLLTGAISVFLFR+LRRRAKR+KELKFRSSGAKK+LK+EALD+
Sbjct: 16 SIFATSDDPSSIQVATSVLLTGAISVFLFRSLRRRAKRSKELKFRSSGAKKTLKEEALDS 75
Query: 134 LKALGSSSIDAKGPPSPVQALLGGLTAGVIAIILYKFTTTIEAALNRQTISDNFSV 189
LK GS+ ID K PPSPVQA LG ++AGVIA+ILYKF+TTIEA+LNRQT+SDNFSV
Sbjct: 76 LKTFGSAPIDVKKPPSPVQAFLGAISAGVIALILYKFSTTIEASLNRQTVSDNFSV 131
>gi|356544064|ref|XP_003540475.1| PREDICTED: uncharacterized protein LOC100799393 [Glycine max]
Length = 277
Score = 190 bits (482), Expect = 3e-46, Method: Compositional matrix adjust.
Identities = 118/200 (59%), Positives = 141/200 (70%), Gaps = 32/200 (16%)
Query: 5 HHLLSLNFPFTVSHHPQK--LNFL----QKPTI----SLSAFPRRRPLIEPYCLAQAQEP 54
HHLL FP T++HH K L+ L KPTI S SA P P+ A
Sbjct: 8 HHLL---FPLTLAHHKVKPPLSILPPLHHKPTILHSVSASAIP-------PWAAQLADAA 57
Query: 55 TTNITAPTTSEEGPVELP----QSIFATTDEPSSLQVATSVLLTGAISVFLFRALRRRAK 110
T EGPVELP SIFAT+D+PS +QVA+SVLLTGA+SVFLFR+LRRRA+
Sbjct: 58 ADIDT------EGPVELPFSSTPSIFATSDDPSPIQVASSVLLTGAVSVFLFRSLRRRAQ 111
Query: 111 RAKELKFRSSGAKKSLKDEALDNLKALGSSSIDAK-GPPSPVQALLGGLTAGVIAIILYK 169
R K+ +FRSSG +KS+K+EALD+LKA+GS+S+ AK SPVQALLGG++AGVIA+ILYK
Sbjct: 112 RVKQSQFRSSG-EKSIKEEALDSLKAMGSASVKAKEDSASPVQALLGGISAGVIALILYK 170
Query: 170 FTTTIEAALNRQTISDNFSV 189
F TTIEAALNRQTISDNFSV
Sbjct: 171 FATTIEAALNRQTISDNFSV 190
>gi|255564541|ref|XP_002523266.1| conserved hypothetical protein [Ricinus communis]
gi|223537479|gb|EEF39105.1| conserved hypothetical protein [Ricinus communis]
Length = 285
Score = 181 bits (460), Expect = 1e-43, Method: Compositional matrix adjust.
Identities = 119/200 (59%), Positives = 143/200 (71%), Gaps = 13/200 (6%)
Query: 1 MLQTHHLLSLN--FPFTVSHHPQK-----LNFLQKPTISLSAFPRRRPLIEPYCLAQAQE 53
ML HH + FP HH + KPT SLS F P +P ++
Sbjct: 1 MLHCHHYIPSPQFFPRNHHHHTSTPYLSLTSSFSKPT-SLSTF-SALPPSKPDIWLASEP 58
Query: 54 PTTNITAPTTSEEGPVELP----QSIFATTDEPSSLQVATSVLLTGAISVFLFRALRRRA 109
+P+ E+GP+ELP SIFATTD+PS +QVATSVLLTG+I VFLFRALRRRA
Sbjct: 59 TPVTPPSPSPEEDGPIELPFPSSPSIFATTDDPSPIQVATSVLLTGSIGVFLFRALRRRA 118
Query: 110 KRAKELKFRSSGAKKSLKDEALDNLKALGSSSIDAKGPPSPVQALLGGLTAGVIAIILYK 169
KRAKELKFRSSGAKK++K+EALD+LKA+GS+SIDA+ PPSP+Q LGG++A VIA+ILYK
Sbjct: 119 KRAKELKFRSSGAKKTMKEEALDSLKAMGSASIDAQKPPSPLQTFLGGVSAAVIALILYK 178
Query: 170 FTTTIEAALNRQTISDNFSV 189
FTTTIEAALNRQT+SDNFSV
Sbjct: 179 FTTTIEAALNRQTVSDNFSV 198
>gi|297834372|ref|XP_002885068.1| hypothetical protein ARALYDRAFT_478942 [Arabidopsis lyrata subsp.
lyrata]
gi|297330908|gb|EFH61327.1| hypothetical protein ARALYDRAFT_478942 [Arabidopsis lyrata subsp.
lyrata]
Length = 261
Score = 177 bits (449), Expect = 2e-42, Method: Compositional matrix adjust.
Identities = 106/200 (53%), Positives = 135/200 (67%), Gaps = 22/200 (11%)
Query: 1 MLQTHHLLSLNFPFTVSHHPQKLNFLQKPTISLSAFPRRRPLIEPYCLAQAQEPTTNITA 60
+LQ+H FT+ H + + P SLS+F R RP + ++ +E
Sbjct: 3 VLQSHQC-----RFTLPHRLRPTRLI-SPIHSLSSFTRIRP--DRIRVSVVKEIAD---V 51
Query: 61 PTTSEEGPVELPQS----------IFATTDEPSSLQVATSVLLTGAISVFLFRALRRRAK 110
E+GP+ELP S IFAT+D+P+ LQ+ATSVLLTGAI+VFL R++RRRAK
Sbjct: 52 AEVEEDGPIELPPSSTSPFSSTNSIFATSDDPTPLQLATSVLLTGAITVFLIRSVRRRAK 111
Query: 111 RAKELKFRSSGAKKSLKDEALDNLKALGSSSID-AKGPPSPVQALLGGLTAGVIAIILYK 169
RAKEL+FRS+GAKKSLK+EA+DNLKALGS+ I+ PS QA LG + AGVIA+ILYK
Sbjct: 112 RAKELRFRSTGAKKSLKEEAMDNLKALGSTPIEGGNSTPSAAQAFLGAIAAGVIALILYK 171
Query: 170 FTTTIEAALNRQTISDNFSV 189
FT T+E+ LNRQTISDNFSV
Sbjct: 172 FTVTVESGLNRQTISDNFSV 191
>gi|21617902|gb|AAM66952.1| unknown [Arabidopsis thaliana]
Length = 266
Score = 177 bits (448), Expect = 2e-42, Method: Compositional matrix adjust.
Identities = 108/200 (54%), Positives = 136/200 (68%), Gaps = 22/200 (11%)
Query: 1 MLQTHHLLSLNFPFTVSHHPQKLNFLQKPTISLSAFPRRRPLIEPYCLAQAQEPTTNITA 60
+LQ+H L + P+ + P +L P SLS+F R RP I L+ +E
Sbjct: 3 VLQSHQCL-FSLPYRL--RPTRL---ISPIHSLSSFTRIRPGI--IRLSAVKEIAD---V 51
Query: 61 PTTSEEGPVELPQS----------IFATTDEPSSLQVATSVLLTGAISVFLFRALRRRAK 110
E+GP+ELP S IFAT+D+P+ LQ+ATSVLLTGAI+VFL R++RRRAK
Sbjct: 52 AEVEEDGPIELPTSSTSPFSSTNSIFATSDDPTPLQLATSVLLTGAITVFLIRSVRRRAK 111
Query: 111 RAKELKFRSSGAKKSLKDEALDNLKALGSSSID-AKGPPSPVQALLGGLTAGVIAIILYK 169
RAKEL+FRS+GAKKSLK+EA+DNLKAL S+ I+ PS QA LG + AGVIA+ILYK
Sbjct: 112 RAKELQFRSTGAKKSLKEEAMDNLKALSSTPIEGGNSTPSAAQAFLGAIAAGVIALILYK 171
Query: 170 FTTTIEAALNRQTISDNFSV 189
FT T+E+ LNRQTISDNFSV
Sbjct: 172 FTVTVESGLNRQTISDNFSV 191
>gi|18400636|ref|NP_566500.1| uncharacterized protein [Arabidopsis thaliana]
gi|87116616|gb|ABD19672.1| At3g15110 [Arabidopsis thaliana]
gi|332642098|gb|AEE75619.1| uncharacterized protein [Arabidopsis thaliana]
Length = 266
Score = 176 bits (446), Expect = 5e-42, Method: Compositional matrix adjust.
Identities = 108/200 (54%), Positives = 135/200 (67%), Gaps = 22/200 (11%)
Query: 1 MLQTHHLLSLNFPFTVSHHPQKLNFLQKPTISLSAFPRRRPLIEPYCLAQAQEPTTNITA 60
+LQ+H L + P+ + P +L P SLS+F R RP I L+ +E
Sbjct: 3 VLQSHQCL-FSLPYRL--RPTRL---ISPIHSLSSFTRIRPGI--IRLSAVKEIAD---V 51
Query: 61 PTTSEEGPVELPQS----------IFATTDEPSSLQVATSVLLTGAISVFLFRALRRRAK 110
E+GP+ELP S IFAT+D+P+ LQ+ATSVLLTGAI+VFL R++RRRAK
Sbjct: 52 AEVEEDGPIELPTSSTSPFSSTNSIFATSDDPTPLQLATSVLLTGAITVFLIRSVRRRAK 111
Query: 111 RAKELKFRSSGAKKSLKDEALDNLKALGSSSID-AKGPPSPVQALLGGLTAGVIAIILYK 169
RAKEL FRS+GAKKSLK+EA+DNLKAL S+ I+ PS QA LG + AGVIA+ILYK
Sbjct: 112 RAKELTFRSTGAKKSLKEEAMDNLKALSSTPIEGGNSTPSAAQAFLGAIAAGVIALILYK 171
Query: 170 FTTTIEAALNRQTISDNFSV 189
FT T+E+ LNRQTISDNFSV
Sbjct: 172 FTVTVESGLNRQTISDNFSV 191
>gi|388496044|gb|AFK36088.1| unknown [Medicago truncatula]
Length = 230
Score = 158 bits (400), Expect = 1e-36, Method: Compositional matrix adjust.
Identities = 79/128 (61%), Positives = 106/128 (82%), Gaps = 6/128 (4%)
Query: 66 EGPVELPQ----SIFATTDEPSSLQVATSVLLTGAISVFLFRALRRRAKRAKELKFRSSG 121
+GP+E+P S+ ++TD+PS +Q+A S+LLTGAISV LFR+ RRRAKR K+ +FRSSG
Sbjct: 57 DGPIEIPYDSTPSLLSSTDDPSFIQIAASLLLTGAISVLLFRSFRRRAKRLKQTQFRSSG 116
Query: 122 AKKSLKDEALDNLKALGSSSID-AKGPPSPVQALLGGLTAGVIAIILYKFTTTIEAALNR 180
+KS+K+EAL+ LKA+G++SI+ KGPPSPVQ LG ++AGVI++ILYKF T IEA L+R
Sbjct: 117 -EKSVKEEALETLKAMGTASIETTKGPPSPVQTFLGAISAGVISLILYKFATIIEAGLSR 175
Query: 181 QTISDNFS 188
QTISD+FS
Sbjct: 176 QTISDDFS 183
>gi|217072908|gb|ACJ84814.1| unknown [Medicago truncatula]
Length = 271
Score = 156 bits (395), Expect = 4e-36, Method: Compositional matrix adjust.
Identities = 79/128 (61%), Positives = 105/128 (82%), Gaps = 6/128 (4%)
Query: 66 EGPVELPQ----SIFATTDEPSSLQVATSVLLTGAISVFLFRALRRRAKRAKELKFRSSG 121
+GP+E+P S+ ++TD+PS +Q+A S+LLTGAISV LFR+ RRRAKR K+ +FRSSG
Sbjct: 57 DGPIEIPYDSTPSLLSSTDDPSFIQIAASLLLTGAISVLLFRSFRRRAKRLKQTQFRSSG 116
Query: 122 AKKSLKDEALDNLKALGSSSID-AKGPPSPVQALLGGLTAGVIAIILYKFTTTIEAALNR 180
+KS+K+EAL+ LKA G++SI+ KGPPSPVQ LG ++AGVI++ILYKF T IEA L+R
Sbjct: 117 -EKSVKEEALETLKATGTASIETTKGPPSPVQTFLGAISAGVISLILYKFATIIEAGLSR 175
Query: 181 QTISDNFS 188
QTISD+FS
Sbjct: 176 QTISDDFS 183
>gi|356514641|ref|XP_003526013.1| PREDICTED: uncharacterized protein LOC100783357 [Glycine max]
Length = 276
Score = 147 bits (372), Expect = 2e-33, Method: Compositional matrix adjust.
Identities = 99/184 (53%), Positives = 121/184 (65%), Gaps = 19/184 (10%)
Query: 10 LNFPFTVSHHPQKL---NFLQKPTISLSAFPRRRPLIEPYCLAQAQEPTTNITAPTTSEE 66
L T +HH L +FL +P I+ + I P A+ TT +
Sbjct: 18 LRLSLTTTHHNSSLPSLHFLHRPIITTTTTTTTVSAIPPTAAWLAKL--------TTQDF 69
Query: 67 GPVELPQS-IFATTDEPSSLQVATSVLLTGAISVFLFRALRRRAKRAKELKFRSSGAKKS 125
ELP S +F TD+PS++QVA SVLLTGAI+VF FR+++RRAKRAKELKFRSSG KS
Sbjct: 70 AHFELPSSHLF--TDDPSTIQVACSVLLTGAITVFFFRSVQRRAKRAKELKFRSSGVNKS 127
Query: 126 LKDEALDNLKALGSSSIDAKGPPSPVQALLGGLTAGVIAIILYKFTTTIEAALNRQTISD 185
L DNLKA+ SSSI AK PPSP QALLG + AGVIA++LY+FTT+IEA L RQT+SD
Sbjct: 128 L-----DNLKAISSSSIKAKSPPSPDQALLGAIIAGVIAVVLYRFTTSIEATLGRQTLSD 182
Query: 186 NFSV 189
NFSV
Sbjct: 183 NFSV 186
>gi|115445727|ref|NP_001046643.1| Os02g0307800 [Oryza sativa Japonica Group]
gi|46390708|dbj|BAD16208.1| unknown protein [Oryza sativa Japonica Group]
gi|113536174|dbj|BAF08557.1| Os02g0307800 [Oryza sativa Japonica Group]
gi|125581805|gb|EAZ22736.1| hypothetical protein OsJ_06408 [Oryza sativa Japonica Group]
gi|215692736|dbj|BAG88156.1| unnamed protein product [Oryza sativa Japonica Group]
gi|215694440|dbj|BAG89457.1| unnamed protein product [Oryza sativa Japonica Group]
Length = 273
Score = 144 bits (364), Expect = 2e-32, Method: Compositional matrix adjust.
Identities = 94/167 (56%), Positives = 115/167 (68%), Gaps = 9/167 (5%)
Query: 29 PTISLSAFPRRRPLI---EPYCLAQAQEPTTNITAPTTSEEGPVELPQ-SIFATTDEPSS 84
P S SA RR + EP A EP +GPVEL ++F+ + P+
Sbjct: 26 PAASASALRLRRSVAVRAEPELSTSAAEPPPGDDG---EGDGPVELRTPTLFSIDENPTP 82
Query: 85 LQVATSVLLTGAISVFLFRALRRRAKRAKELKFRSSGAKK--SLKDEALDNLKALGSSSI 142
LQ ATSVLLTGAISVFLFR++RRR +RAKEL+ RS G +K +L EAL+ L+ + +S I
Sbjct: 83 LQTATSVLLTGAISVFLFRSIRRRVRRAKELRVRSGGVEKPNNLSKEALEGLRLVSASPI 142
Query: 143 DAKGPPSPVQALLGGLTAGVIAIILYKFTTTIEAALNRQTISDNFSV 189
+ PPSPVQALLGG+ AGVIA+ILYKFTTTIEAALNRQTISD+FSV
Sbjct: 143 EVDKPPSPVQALLGGIAAGVIALILYKFTTTIEAALNRQTISDSFSV 189
>gi|357144153|ref|XP_003573191.1| PREDICTED: uncharacterized protein LOC100830225 [Brachypodium
distachyon]
Length = 284
Score = 141 bits (356), Expect = 1e-31, Method: Compositional matrix adjust.
Identities = 96/163 (58%), Positives = 117/163 (71%), Gaps = 8/163 (4%)
Query: 35 AFPRRRPLI-----EPYCLAQAQEPTTNITAPTTSEEGPVEL-PQSIFATTDEPSSLQVA 88
A P RRP EP A + T EEGPVEL ++F+ D P+SLQVA
Sbjct: 28 ARPGRRPTALRAEPEPQSPPPASAAASPDTTDDGGEEGPVELRAPTLFSVDDNPTSLQVA 87
Query: 89 TSVLLTGAISVFLFRALRRRAKRAKELKFRSSGAKK--SLKDEALDNLKALGSSSIDAKG 146
TSVLLTGAISVFLFR+LRRRA+RAKEL+ RS G +K +L +EAL+ L+ + +S ++ +
Sbjct: 88 TSVLLTGAISVFLFRSLRRRARRAKELRVRSGGLEKPKNLSEEALEALRMVSTSPVETEK 147
Query: 147 PPSPVQALLGGLTAGVIAIILYKFTTTIEAALNRQTISDNFSV 189
PPSPVQALLGG+ AGVIA+ LYKF TTIEA+LNRQTISDNFSV
Sbjct: 148 PPSPVQALLGGIAAGVIALFLYKFATTIEASLNRQTISDNFSV 190
>gi|242065424|ref|XP_002454001.1| hypothetical protein SORBIDRAFT_04g022880 [Sorghum bicolor]
gi|241933832|gb|EES06977.1| hypothetical protein SORBIDRAFT_04g022880 [Sorghum bicolor]
Length = 293
Score = 138 bits (348), Expect = 1e-30, Method: Compositional matrix adjust.
Identities = 90/149 (60%), Positives = 113/149 (75%), Gaps = 5/149 (3%)
Query: 45 PYCLAQAQEPTTNITAPTTSEEGPVEL-PQSIFATTDEPSSLQVATSVLLTGAISVFLFR 103
P A+ + P A +GPVEL ++F+T D P+ LQ ATS+LLTGAISVFLFR
Sbjct: 55 PVSAAEPEPPDAGAVA-EAEGQGPVELRAPTLFSTDDNPTPLQTATSLLLTGAISVFLFR 113
Query: 104 ALRRRAKRAKELKFRSSGAKK---SLKDEALDNLKALGSSSIDAKGPPSPVQALLGGLTA 160
+LRRRA+RAKEL+ RSSG KK +L +EAL+ L+ + +S I+ + PPSPVQ LLGG+ A
Sbjct: 114 SLRRRARRAKELRVRSSGVKKKPNNLTEEALEGLRLVSASPIETEKPPSPVQTLLGGIAA 173
Query: 161 GVIAIILYKFTTTIEAALNRQTISDNFSV 189
GVIA+ILYKF+TTIEAALNRQTISD+FSV
Sbjct: 174 GVIAVILYKFSTTIEAALNRQTISDSFSV 202
>gi|388511081|gb|AFK43606.1| unknown [Lotus japonicus]
Length = 251
Score = 137 bits (344), Expect = 4e-30, Method: Compositional matrix adjust.
Identities = 88/191 (46%), Positives = 109/191 (57%), Gaps = 46/191 (24%)
Query: 9 SLNFPFTVSHHPQK---------LNFLQKPTISLSAFPRRRPLIEPYCLAQAQEPTTNIT 59
+L P T++HH L+FL +PTI P I P + E TT
Sbjct: 15 NLPVPLTLTHHHSSFSHPPPSHLLHFLHRPTI----IPHTISAIPPTLSSWLTELTTG-- 68
Query: 60 APTTSEEGPVELPQSIFATTDEPSS-LQVATSVLLTGAISVFLFRALRRRAKRAKELKFR 118
TD+PSS +QV +++L+TGAI++F FR L+RRAKRAKELKFR
Sbjct: 69 -------------------TDDPSSTIQVTSTILVTGAITLFFFRTLQRRAKRAKELKFR 109
Query: 119 SSGAKKSLKDEALDNLKALGSSSIDAKGPPSPVQALLGGLTAGVIAIILYKFTTTIEAAL 178
SSG KSL S+SI AK PSP QALLG + AGVIA+ILY+FTT++EA L
Sbjct: 110 SSGVNKSL-----------ASTSIKAKKRPSPDQALLGAIIAGVIAVILYRFTTSVEATL 158
Query: 179 NRQTISDNFSV 189
RQTISDNFSV
Sbjct: 159 YRQTISDNFSV 169
>gi|326511271|dbj|BAJ87649.1| predicted protein [Hordeum vulgare subsp. vulgare]
gi|326513094|dbj|BAK06787.1| predicted protein [Hordeum vulgare subsp. vulgare]
gi|326529293|dbj|BAK01040.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 281
Score = 134 bits (337), Expect = 2e-29, Method: Compositional matrix adjust.
Identities = 82/129 (63%), Positives = 105/129 (81%), Gaps = 3/129 (2%)
Query: 64 SEEGPVEL-PQSIFATTDEPSSLQVATSVLLTGAISVFLFRALRRRAKRAKELKFRSSGA 122
S +GPVE+ ++FA D P+ LQVATSV+LTGAISVFLFR+LRRRA+RAKEL+ RS G
Sbjct: 63 SGDGPVEIRAPTLFAVDDNPTPLQVATSVMLTGAISVFLFRSLRRRARRAKELRVRSGGV 122
Query: 123 KK--SLKDEALDNLKALGSSSIDAKGPPSPVQALLGGLTAGVIAIILYKFTTTIEAALNR 180
+K +L +EAL+ L+ + +S ++ PPSPVQALLGG+ AGVIA+ LYKF +T+EA+LNR
Sbjct: 123 EKPRNLSEEALEALRMVSTSPVETNKPPSPVQALLGGIAAGVIALFLYKFASTVEASLNR 182
Query: 181 QTISDNFSV 189
QTISDNFSV
Sbjct: 183 QTISDNFSV 191
>gi|357497633|ref|XP_003619105.1| hypothetical protein MTR_6g042030 [Medicago truncatula]
gi|355494120|gb|AES75323.1| hypothetical protein MTR_6g042030 [Medicago truncatula]
Length = 254
Score = 133 bits (334), Expect = 4e-29, Method: Compositional matrix adjust.
Identities = 72/116 (62%), Positives = 89/116 (76%), Gaps = 5/116 (4%)
Query: 75 IFATTDEPSSLQVATSVLLTGAISVFLFRALRRRAKRAKELKFRSSGAKKSLKDEALDNL 134
IF DE SS QV SV+L+ AI+VFLF ++RR KRAK+LK+RSSG KKS +L++
Sbjct: 69 IFDAIDESSSNQVTISVVLSVAIAVFLFPTIQRRIKRAKQLKYRSSGVKKS----SLNST 124
Query: 135 KAL-GSSSIDAKGPPSPVQALLGGLTAGVIAIILYKFTTTIEAALNRQTISDNFSV 189
K L GS+ +K PPSP +ALLG + AGVIA+ILY+FTTTIEA+L RQTISDNFSV
Sbjct: 125 KGLMGSTRKKSKNPPSPDKALLGAVMAGVIAVILYRFTTTIEASLYRQTISDNFSV 180
>gi|226494037|ref|NP_001143499.1| uncharacterized protein LOC100276178 [Zea mays]
gi|195621548|gb|ACG32604.1| hypothetical protein [Zea mays]
gi|224036011|gb|ACN37081.1| unknown [Zea mays]
gi|413936044|gb|AFW70595.1| hypothetical protein ZEAMMB73_826587 [Zea mays]
Length = 285
Score = 133 bits (334), Expect = 5e-29, Method: Compositional matrix adjust.
Identities = 93/167 (55%), Positives = 123/167 (73%), Gaps = 7/167 (4%)
Query: 26 LQKPTISLSAFPRRRPLIEPYCLAQAQEPTTNITAPTTSEEGPVEL-PQSIFATTDEPSS 84
+++PT + R PL P A+++ P + A +GPVEL ++F+T D P+
Sbjct: 32 VRRPTAIRAEAEVRPPL--PVSAAESEPP--DAGAVEAEGQGPVELRAPTLFSTDDNPTP 87
Query: 85 LQVATSVLLTGAISVFLFRALRRRAKRAKELKFRSSGAKK--SLKDEALDNLKALGSSSI 142
LQ ATS+LLTGAISVFLFRALRRRA+RAKEL+ RSSG KK +L +EAL+ L+ + +S I
Sbjct: 88 LQTATSLLLTGAISVFLFRALRRRARRAKELRVRSSGLKKPNNLTEEALERLRLMSASPI 147
Query: 143 DAKGPPSPVQALLGGLTAGVIAIILYKFTTTIEAALNRQTISDNFSV 189
+ + SP+QALLGG+ AGVIA+ILYKF+TT+EAALNRQTISD+FSV
Sbjct: 148 ETEKATSPIQALLGGIAAGVIALILYKFSTTVEAALNRQTISDSFSV 194
>gi|218190566|gb|EEC72993.1| hypothetical protein OsI_06901 [Oryza sativa Indica Group]
Length = 325
Score = 129 bits (324), Expect = 7e-28, Method: Compositional matrix adjust.
Identities = 84/157 (53%), Positives = 104/157 (66%), Gaps = 33/157 (21%)
Query: 66 EGPVELPQ-SIFATTDEPSSLQVATSVLLTGAISVFLFRALRRRAKRAKEL--------- 115
+GPVEL ++F+ + P+ LQ ATSVLLTGAISVFLFR++RRRA+RAKEL
Sbjct: 85 DGPVELRTPTLFSIDENPTPLQTATSVLLTGAISVFLFRSIRRRARRAKELVIPSLLIHL 144
Query: 116 ---------------------KFRSSGAKK--SLKDEALDNLKALGSSSIDAKGPPSPVQ 152
+ RS G +K +L EAL+ L+ + +S I+ PPSPVQ
Sbjct: 145 RCAFSTGEHTWSTNPISFRDTEGRSGGVEKPNNLSKEALEGLRLVSASPIEVDKPPSPVQ 204
Query: 153 ALLGGLTAGVIAIILYKFTTTIEAALNRQTISDNFSV 189
ALLGG+ AGVIA+ILYKFTTTIEAALNRQTISD+FSV
Sbjct: 205 ALLGGIAAGVIALILYKFTTTIEAALNRQTISDSFSV 241
>gi|168035549|ref|XP_001770272.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162678489|gb|EDQ64947.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 277
Score = 99.0 bits (245), Expect = 1e-18, Method: Compositional matrix adjust.
Identities = 55/127 (43%), Positives = 83/127 (65%), Gaps = 6/127 (4%)
Query: 66 EGPVELPQSIFA--TTDEPSSLQVATSVLLTGAISVFLFRALRRRAKRAKELKFRSSGAK 123
+GP+ELP + E S LQVA SV+LTG I+V L R+LRRR+K+AKE +FRS+G
Sbjct: 88 DGPIELPPELLDPLAIPEASPLQVAASVVLTGLITVLLIRSLRRRSKKAKETRFRSTG-- 145
Query: 124 KSLKDEALDNLKALGSSSIDAKG-PPSPVQALLGGLTAGVIAIILYKFTTTIEAALNRQT 182
+K++A + AL + + + + PPS +Q G + AG IA++LYKFT T+E + +
Sbjct: 146 -EIKEDARKSAMALLNKAPEVETPPPSALQTFSGAVVAGFIALVLYKFTVTVEGSFTGKA 204
Query: 183 ISDNFSV 189
+S N+S+
Sbjct: 205 VSMNYSI 211
>gi|11994500|dbj|BAB02565.1| unnamed protein product [Arabidopsis thaliana]
Length = 135
Score = 78.6 bits (192), Expect = 1e-12, Method: Compositional matrix adjust.
Identities = 39/60 (65%), Positives = 46/60 (76%), Gaps = 1/60 (1%)
Query: 131 LDNLKALGSSSIDA-KGPPSPVQALLGGLTAGVIAIILYKFTTTIEAALNRQTISDNFSV 189
+DNLKAL S+ I+ PS QA LG + AGVIA+ILYKFT T+E+ LNRQTISDNFSV
Sbjct: 1 MDNLKALSSTPIEGGNSTPSAAQAFLGAIAAGVIALILYKFTVTVESGLNRQTISDNFSV 60
>gi|302785097|ref|XP_002974320.1| hypothetical protein SELMODRAFT_442467 [Selaginella moellendorffii]
gi|300157918|gb|EFJ24542.1| hypothetical protein SELMODRAFT_442467 [Selaginella moellendorffii]
Length = 1433
Score = 75.1 bits (183), Expect = 1e-11, Method: Composition-based stats.
Identities = 56/139 (40%), Positives = 81/139 (58%), Gaps = 13/139 (9%)
Query: 51 AQEPTTNITAPTTSEEGPVELPQSIFATTDEPSS-LQVATSVLLTGAISVFLFRALRRRA 109
AQ I ++S +GP+ELP + ++P+S LQ+A SV+LTGAISV L R+LRRR
Sbjct: 1244 AQVAEQAIADASSSLDGPIELPGDL----EQPTSPLQIAGSVVLTGAISVLLLRSLRRRY 1299
Query: 110 KRAKELKFRSSGAKKSLKDEALDNLKALGSSSIDAKGPPSPVQALLGGLTAGVIAIILYK 169
K AKE KK LK++A N+ A ++ + K PPS Q L G L AG IA +LY+
Sbjct: 1300 KEAKE--------KKPLKEQAKQNVLASLANQVVEKPPPSFEQTLYGALLAGAIAFVLYQ 1351
Query: 170 FTTTIEAALNRQTISDNFS 188
TT + + + + ++
Sbjct: 1352 CTTAVTGSFASKPLPTEYA 1370
>gi|159491108|ref|XP_001703515.1| predicted protein [Chlamydomonas reinhardtii]
gi|158280439|gb|EDP06197.1| predicted protein [Chlamydomonas reinhardtii]
Length = 206
Score = 44.3 bits (103), Expect = 0.024, Method: Compositional matrix adjust.
Identities = 28/93 (30%), Positives = 50/93 (53%), Gaps = 7/93 (7%)
Query: 97 ISVFLFRALRRRAKRAKELKFRSSGAKK-SLKDEALDNLKALGSSSIDAKGPPSPVQALL 155
+ +FLFR R+RA +A+ K A S+ DE + L A +G +P+ A L
Sbjct: 89 VGIFLFRLFRKRAAKARSEKIAGQLATGPSIFDELREKLGA------PKRGKATPMNAFL 142
Query: 156 GGLTAGVIAIILYKFTTTIEAALNRQTISDNFS 188
G + A +A L+ F+T ++A++ Q++ D ++
Sbjct: 143 GAVQALGLAAGLWFFSTKVQASIEAQSLPDGYT 175
>gi|145346872|ref|XP_001417906.1| predicted protein [Ostreococcus lucimarinus CCE9901]
gi|144578134|gb|ABO96199.1| predicted protein [Ostreococcus lucimarinus CCE9901]
Length = 181
Score = 40.4 bits (93), Expect = 0.36, Method: Compositional matrix adjust.
Identities = 29/111 (26%), Positives = 58/111 (52%), Gaps = 8/111 (7%)
Query: 79 TDEPSSLQVATSVLLTGAISVFLFRALRRRAKRAKELKFRSSGAKKSLKDEALDNLKALG 138
T+E + LQ VL TG + RA+++R+ +A+E FR + + +D +A
Sbjct: 9 TEEATPLQNVFGVLFTGFSFWYFARAVQKRSGKARE--FRVANRLPEEERAKIDEERAKK 66
Query: 139 SSSIDAKGPPSPVQALLGGLTAGVIAIILYKFTTTIEAALNRQTISDNFSV 189
+ + A +Q+ GGLT I+++LY F + + A+ + + + ++ +V
Sbjct: 67 TKELTA------MQSFTGGLTGIGISVVLYAFASKVSASFDGKALPESETV 111
>gi|384247605|gb|EIE21091.1| hypothetical protein COCSUDRAFT_33788 [Coccomyxa subellipsoidea
C-169]
Length = 232
Score = 40.0 bits (92), Expect = 0.48, Method: Compositional matrix adjust.
Identities = 25/94 (26%), Positives = 46/94 (48%), Gaps = 16/94 (17%)
Query: 97 ISVFLFRALRRRAKRAKELKFRSS--GAKKSLKDEALDNLKALGSSSIDAKGPPSPVQAL 154
++VF R L +RAK A E + S+ G + D ++ + +P+Q+L
Sbjct: 84 LAVFAVRLLTKRAKFATEERLASAAKGENERPPDPSIKRV--------------TPLQSL 129
Query: 155 LGGLTAGVIAIILYKFTTTIEAALNRQTISDNFS 188
+ AGV A +LY F++ ++ RQ + D ++
Sbjct: 130 WAAVQAGVFAYLLYLFSSNVDGYFERQPLPDQYT 163
>gi|307106044|gb|EFN54291.1| expressed protein [Chlorella variabilis]
Length = 223
Score = 39.7 bits (91), Expect = 0.71, Method: Compositional matrix adjust.
Identities = 30/114 (26%), Positives = 51/114 (44%), Gaps = 10/114 (8%)
Query: 79 TDEPSSLQVATSVLLTGAISVFLFRALRRRAKRAKELKFRS-------SGAKKSLKDEAL 131
T +L+ V G + F R +RRA++A + S G S +E
Sbjct: 8 TGGSDTLKTVAGVAYIGVVIFFFLRLFQRRAQKATSERIASVADVAAYGGGADSDDEEDE 67
Query: 132 DNLKALGSSSIDAKGPPSPVQALLGGLTAGVIAIILYKFTTTIEAALNRQTISD 185
+ KA A +P+Q+ +G AGVI +L++ +T ++A +RQ + D
Sbjct: 68 EEQKA---KQQQAATEVTPLQSFIGTAQAGVICYLLFQASTAVDAYFDRQNLPD 118
>gi|302848926|ref|XP_002955994.1| hypothetical protein VOLCADRAFT_121472 [Volvox carteri f.
nagariensis]
gi|300258720|gb|EFJ42954.1| hypothetical protein VOLCADRAFT_121472 [Volvox carteri f.
nagariensis]
Length = 325
Score = 39.7 bits (91), Expect = 0.71, Method: Compositional matrix adjust.
Identities = 26/93 (27%), Positives = 45/93 (48%), Gaps = 7/93 (7%)
Query: 97 ISVFLFRALRRRAKRAKELKFRSSG-AKKSLKDEALDNLKALGSSSIDAKGPPSPVQALL 155
+ +FLFR R+RA +AK + + + DE + A S +PV A +
Sbjct: 111 VGIFLFRLFRKRAAKAKTERIAGQAPSGPTFFDELRQKVGAPKQSKA------TPVNAFI 164
Query: 156 GGLTAGVIAIILYKFTTTIEAALNRQTISDNFS 188
G A V+A L+ FTT ++ ++ Q + D ++
Sbjct: 165 GCGQALVLAYGLWFFTTKVQGSIEGQALPDGYT 197
>gi|394988291|ref|ZP_10381129.1| hypothetical protein SCD_00693 [Sulfuricella denitrificans skB26]
gi|393792749|dbj|GAB70768.1| hypothetical protein SCD_00693 [Sulfuricella denitrificans skB26]
Length = 196
Score = 38.5 bits (88), Expect = 1.4, Method: Compositional matrix adjust.
Identities = 31/117 (26%), Positives = 56/117 (47%), Gaps = 7/117 (5%)
Query: 64 SEEGPVELPQSIFATTDEPSSLQVATSVLLTGAISVFLFRALRRRAKRAKELKFRSSGAK 123
++E +E+ + IFA D L++A+ TG +V+ RAL + EL +G +
Sbjct: 11 NQEPILEVLKEIFA--DRQRVLEIASG---TGQHAVYFGRALPHLTWQTSELAQNHAGIQ 65
Query: 124 KSLKDEALDNLKALGSSSIDAKGPPSPVQALLGGLTAGVIAIILYKFTTTIEAALNR 180
L D L N+ L +ID P P++++ A + I+ ++ T+ A + R
Sbjct: 66 AWLDDSLLPNV--LPPVAIDVSAPKWPIESVDAVFNANTVHIVAWQEVETMFAGIAR 120
>gi|255071307|ref|XP_002507735.1| predicted protein [Micromonas sp. RCC299]
gi|226523010|gb|ACO68993.1| predicted protein [Micromonas sp. RCC299]
Length = 292
Score = 37.0 bits (84), Expect = 4.7, Method: Compositional matrix adjust.
Identities = 28/99 (28%), Positives = 54/99 (54%), Gaps = 8/99 (8%)
Query: 81 EPSSLQVATSVLLTGAISVFLFRALRRRAKRAKELKFRSSGAKKSLKDEALDNLKALGSS 140
E + LQ A + T + + R +R+R AK+ + +S +K+ + L+ LK +
Sbjct: 121 EATPLQNAFGLTFTIFSAWYFLRVVRKRGNAAKKFRVANSLSKEERERRDLEELKRV--K 178
Query: 141 SIDAKGPPSPVQALLGGLTAGVIAIILYKFTTTIEAALN 179
++A+ Q+ +GG+TA I+I+LY+F I+A+ +
Sbjct: 179 KLNAQ------QSFVGGITAIGISIVLYEFAQNIQASFD 211
>gi|342318945|gb|EGU10900.1| Proteophosphoglycan ppg4 [Rhodotorula glutinis ATCC 204091]
Length = 1616
Score = 37.0 bits (84), Expect = 4.8, Method: Compositional matrix adjust.
Identities = 34/98 (34%), Positives = 49/98 (50%), Gaps = 18/98 (18%)
Query: 17 SHHPQKLNFLQKPTISLSAFPRRRPLIEPYCLAQAQEPTTNITAPTTSEEGPVELPQSIF 76
+ HP K+ + P +S S F RRRP P+C IT+PT S S+
Sbjct: 224 TRHPPKV---RPPRLSPSLFARRRPSFIPHCRC--------ITSPTPSHSHRTTPCNSLL 272
Query: 77 ATTDEPSSLQVATSVLLTGAISVFL-FRALRRRAKRAK 113
P L++A VL+ GA++V L + LRRR +RA+
Sbjct: 273 -----PMLLRLAV-VLVYGAVAVSLPLQPLRRRVRRAE 304
Database: nr
Posted date: Mar 3, 2013 10:45 PM
Number of letters in database: 999,999,864
Number of sequences in database: 2,912,245
Database: /local_scratch/syshi//blastdatabase/nr.01
Posted date: Mar 3, 2013 10:52 PM
Number of letters in database: 999,999,666
Number of sequences in database: 2,912,720
Database: /local_scratch/syshi//blastdatabase/nr.02
Posted date: Mar 3, 2013 10:58 PM
Number of letters in database: 999,999,938
Number of sequences in database: 3,014,250
Database: /local_scratch/syshi//blastdatabase/nr.03
Posted date: Mar 3, 2013 11:03 PM
Number of letters in database: 999,999,780
Number of sequences in database: 2,805,020
Database: /local_scratch/syshi//blastdatabase/nr.04
Posted date: Mar 3, 2013 11:08 PM
Number of letters in database: 999,999,551
Number of sequences in database: 2,816,253
Database: /local_scratch/syshi//blastdatabase/nr.05
Posted date: Mar 3, 2013 11:13 PM
Number of letters in database: 999,999,897
Number of sequences in database: 2,981,387
Database: /local_scratch/syshi//blastdatabase/nr.06
Posted date: Mar 3, 2013 11:18 PM
Number of letters in database: 999,999,649
Number of sequences in database: 2,911,476
Database: /local_scratch/syshi//blastdatabase/nr.07
Posted date: Mar 3, 2013 11:24 PM
Number of letters in database: 999,999,452
Number of sequences in database: 2,920,260
Database: /local_scratch/syshi//blastdatabase/nr.08
Posted date: Mar 3, 2013 11:25 PM
Number of letters in database: 64,230,274
Number of sequences in database: 189,558
Lambda K H
0.318 0.131 0.365
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 2,720,084,199
Number of Sequences: 23463169
Number of extensions: 99431418
Number of successful extensions: 349558
Number of sequences better than 100.0: 87
Number of HSP's better than 100.0 without gapping: 26
Number of HSP's successfully gapped in prelim test: 61
Number of HSP's that attempted gapping in prelim test: 349491
Number of HSP's gapped (non-prelim): 93
length of query: 198
length of database: 8,064,228,071
effective HSP length: 135
effective length of query: 63
effective length of database: 9,191,667,552
effective search space: 579075055776
effective search space used: 579075055776
T: 11
A: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.7 bits)
S2: 73 (32.7 bits)