BLASTP 2.2.26 [Sep-21-2011]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for compositional score matrix adjustment: Altschul, Stephen F.,
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.
Query= 036090
(177 letters)
Database: nr
23,463,169 sequences; 8,064,228,071 total letters
Searching..................................................done
>gi|255537843|ref|XP_002509988.1| conserved hypothetical protein [Ricinus communis]
gi|223549887|gb|EEF51375.1| conserved hypothetical protein [Ricinus communis]
Length = 177
Score = 150 bits (379), Expect = 2e-34, Method: Compositional matrix adjust.
Identities = 102/190 (53%), Positives = 123/190 (64%), Gaps = 26/190 (13%)
Query: 1 MAAALLSLPAATRHETPVSQRS-STSSGYLIFHRSLSVASIKCPYNNRPTVLLCSAATSV 59
MA L+S+P T ++T SQR +T+ L R S+ SI + NR L S+++S
Sbjct: 1 MAITLVSVP--TINQTLSSQRPYNTNLNTLTLTRYSSIKSIT--FINR---LCSSSSSST 53
Query: 60 SVVED-----------VSSPLPPDAAATEGADKVGFRGCKVCGKEELEKGCNGEGTIQGG 108
SVVE+ V SPLPP AD+V GCK CG++E+EKGCNGEG IQGG
Sbjct: 54 SVVEEDGPASTPDDLPVQSPLPP-----VNADEVPVGGCKACGRQEIEKGCNGEGRIQGG 108
Query: 109 IATVPGFGWWPIKAYRPCPGFLASGGRYRRQGQTMDEVAFGRGEKRAPVSSDSENIDPSK 168
IATV GFGWWPIKAYRPCPGF+ASGGRYRRQGQ+MDEV GRG PV S +E ++ SK
Sbjct: 109 IATVSGFGWWPIKAYRPCPGFVASGGRYRRQGQSMDEVVSGRGNPGTPVGSSTE-VESSK 167
Query: 169 SRGG-RKFKR 177
R RKFKR
Sbjct: 168 ERQDPRKFKR 177
>gi|225426288|ref|XP_002267043.1| PREDICTED: uncharacterized protein LOC100264984 [Vitis vinifera]
gi|297742356|emb|CBI34505.3| unnamed protein product [Vitis vinifera]
Length = 167
Score = 146 bits (369), Expect = 3e-33, Method: Compositional matrix adjust.
Identities = 69/95 (72%), Positives = 78/95 (82%)
Query: 68 PLPPDAAATEGADKVGFRGCKVCGKEELEKGCNGEGTIQGGIATVPGFGWWPIKAYRPCP 127
PLP DAAA + GCK CG+EE+EKGCNGEG IQGGI TVPGFGWWPIKAYRPCP
Sbjct: 60 PLPSDAAAPLQSQDPPLGGCKACGREEMEKGCNGEGRIQGGIGTVPGFGWWPIKAYRPCP 119
Query: 128 GFLASGGRYRRQGQTMDEVAFGRGEKRAPVSSDSE 162
GF+ASGGRYRR+GQ+MDEVAFGRG K A ++SD +
Sbjct: 120 GFVASGGRYRRRGQSMDEVAFGRGGKGASMASDED 154
>gi|358248722|ref|NP_001239929.1| uncharacterized protein LOC100788535 [Glycine max]
gi|255640951|gb|ACU20755.1| unknown [Glycine max]
Length = 150
Score = 144 bits (363), Expect = 1e-32, Method: Compositional matrix adjust.
Identities = 69/115 (60%), Positives = 83/115 (72%), Gaps = 11/115 (9%)
Query: 49 TVLLCSAATSVSVVEDVSSPLPPDAAATEGADKVGFRGCKVCGKEELEKGCNGEGTIQGG 108
TV LCS + SV+E+ SP P + GCK CG+EE+E+GCNGEG IQGG
Sbjct: 35 TVSLCSPSPPTSVIEEGPSPSPELTS-----------GCKACGREEVEQGCNGEGRIQGG 83
Query: 109 IATVPGFGWWPIKAYRPCPGFLASGGRYRRQGQTMDEVAFGRGEKRAPVSSDSEN 163
IATVPGFGWWPIKAYRPCP F+ASGGRYRRQGQ+MDEV G G+ + P+++DS
Sbjct: 84 IATVPGFGWWPIKAYRPCPAFVASGGRYRRQGQSMDEVVSGSGKTKTPIATDSNT 138
>gi|356513275|ref|XP_003525339.1| PREDICTED: uncharacterized protein LOC100810596 [Glycine max]
Length = 166
Score = 144 bits (363), Expect = 1e-32, Method: Compositional matrix adjust.
Identities = 67/116 (57%), Positives = 83/116 (71%), Gaps = 1/116 (0%)
Query: 50 VLLCSAATSVSVVEDVSSPLPPDAAATEGADKVGFRGCKVCGKEELEKGCNGEGTIQGGI 109
V LCS + S++E+ SP P + + + GCK CG+EE+E+GCNGEG IQGGI
Sbjct: 38 VPLCSPSPPTSIIEEGPSPSPDELPVVVDSSEFT-SGCKACGREEIEQGCNGEGRIQGGI 96
Query: 110 ATVPGFGWWPIKAYRPCPGFLASGGRYRRQGQTMDEVAFGRGEKRAPVSSDSENID 165
ATVPG GWWPIKAYRPCP F+ASGGRYRRQGQ+MDEV G G+ + P ++DS D
Sbjct: 97 ATVPGLGWWPIKAYRPCPAFMASGGRYRRQGQSMDEVVSGSGKSKTPTTTDSNTRD 152
>gi|56783848|dbj|BAD81260.1| unknown protein [Oryza sativa Japonica Group]
gi|56783951|dbj|BAD81388.1| unknown protein [Oryza sativa Japonica Group]
Length = 229
Score = 141 bits (355), Expect = 1e-31, Method: Compositional matrix adjust.
Identities = 67/101 (66%), Positives = 77/101 (76%), Gaps = 13/101 (12%)
Query: 80 DKVGFRGCKVCGKEELEKGCNGEGTIQGGIATVPGFGWWPIKAYRPCPGFLASGGRYRRQ 139
D FRGC+ CG+EE+E+GCNGEG IQGGIATVPGFGWWPIKAYRPCPGF+ASGGRYRRQ
Sbjct: 90 DDPNFRGCRGCGREEVERGCNGEGRIQGGIATVPGFGWWPIKAYRPCPGFVASGGRYRRQ 149
Query: 140 GQTMDEVAFGRGEKRA----PVSS---------DSENIDPS 167
GQ+MD+VA GRG+KR PV D +N +PS
Sbjct: 150 GQSMDDVASGRGKKRELRGDPVQEISSLEMLDKDVQNFNPS 190
>gi|297596400|ref|NP_001042519.2| Os01g0235100 [Oryza sativa Japonica Group]
gi|255673036|dbj|BAF04433.2| Os01g0235100 [Oryza sativa Japonica Group]
Length = 179
Score = 140 bits (353), Expect = 2e-31, Method: Compositional matrix adjust.
Identities = 85/165 (51%), Positives = 99/165 (60%), Gaps = 12/165 (7%)
Query: 1 MAAAL-LSLPA----ATRHETPVSQRSSTSSGYLIF-HRSLSVASIKCPYNNRPTVLLCS 54
MAA+L L LPA A H PV + SS L H + SI C ++
Sbjct: 1 MAASLWLQLPAPPASAAVH-NPVFLTGNLSSSLLCKKHPQAARGSILCSSSSSSNSSASV 59
Query: 55 AATSVSVVEDVSSPLPPDAAATEGA-----DKVGFRGCKVCGKEELEKGCNGEGTIQGGI 109
V SS A +E D FRGC+ CG+EE+E+GCNGEG IQGGI
Sbjct: 60 VTKEQEAVAAASSSQEEGVAVSETEFVSYRDDPNFRGCRGCGREEVERGCNGEGRIQGGI 119
Query: 110 ATVPGFGWWPIKAYRPCPGFLASGGRYRRQGQTMDEVAFGRGEKR 154
ATVPGFGWWPIKAYRPCPGF+ASGGRYRRQGQ+MD+VA GRG+KR
Sbjct: 120 ATVPGFGWWPIKAYRPCPGFVASGGRYRRQGQSMDDVASGRGKKR 164
>gi|195615298|gb|ACG29479.1| hypothetical protein [Zea mays]
Length = 152
Score = 139 bits (351), Expect = 4e-31, Method: Compositional matrix adjust.
Identities = 68/110 (61%), Positives = 81/110 (73%), Gaps = 2/110 (1%)
Query: 47 RPTVLLCSA--ATSVSVVEDVSSPLPPDAAATEGADKVGFRGCKVCGKEELEKGCNGEGT 104
R LLC+A A+S VV P+ D FRGCK CG++EL++GCNG+G
Sbjct: 35 RGNRLLCTASGASSSVVVTKEQEEGSPEPEIFSYKDDPNFRGCKGCGRDELQRGCNGQGR 94
Query: 105 IQGGIATVPGFGWWPIKAYRPCPGFLASGGRYRRQGQTMDEVAFGRGEKR 154
IQGGIA VPGFGWWPIKAYRPCPGF+ASGGRYRRQGQ+MD+VA GRG+K+
Sbjct: 95 IQGGIAAVPGFGWWPIKAYRPCPGFVASGGRYRRQGQSMDDVASGRGKKK 144
>gi|218187828|gb|EEC70255.1| hypothetical protein OsI_01055 [Oryza sativa Indica Group]
Length = 180
Score = 139 bits (349), Expect = 5e-31, Method: Compositional matrix adjust.
Identities = 61/75 (81%), Positives = 69/75 (92%)
Query: 80 DKVGFRGCKVCGKEELEKGCNGEGTIQGGIATVPGFGWWPIKAYRPCPGFLASGGRYRRQ 139
D FRGC+ CG+EE+E+GCNGEG IQGGIATVPGFGWWPIKAYRPCPGF+ASGGRYRRQ
Sbjct: 91 DDPNFRGCRGCGREEVERGCNGEGRIQGGIATVPGFGWWPIKAYRPCPGFVASGGRYRRQ 150
Query: 140 GQTMDEVAFGRGEKR 154
GQ+MD+VA GRG+KR
Sbjct: 151 GQSMDDVASGRGKKR 165
>gi|224072220|ref|XP_002303659.1| predicted protein [Populus trichocarpa]
gi|222841091|gb|EEE78638.1| predicted protein [Populus trichocarpa]
Length = 87
Score = 139 bits (349), Expect = 5e-31, Method: Compositional matrix adjust.
Identities = 64/78 (82%), Positives = 69/78 (88%)
Query: 86 GCKVCGKEELEKGCNGEGTIQGGIATVPGFGWWPIKAYRPCPGFLASGGRYRRQGQTMDE 145
GCK CG+EE+EKGCNGEG IQGGIATVPGFGWWPIKAYRPCPGFLASGGRYRRQGQ+MDE
Sbjct: 7 GCKGCGREEIEKGCNGEGRIQGGIATVPGFGWWPIKAYRPCPGFLASGGRYRRQGQSMDE 66
Query: 146 VAFGRGEKRAPVSSDSEN 163
VAFGRG ++ PV E
Sbjct: 67 VAFGRGGEKTPVVISDEG 84
>gi|224058178|ref|XP_002299460.1| predicted protein [Populus trichocarpa]
gi|222846718|gb|EEE84265.1| predicted protein [Populus trichocarpa]
Length = 88
Score = 139 bits (349), Expect = 5e-31, Method: Compositional matrix adjust.
Identities = 64/80 (80%), Positives = 69/80 (86%)
Query: 84 FRGCKVCGKEELEKGCNGEGTIQGGIATVPGFGWWPIKAYRPCPGFLASGGRYRRQGQTM 143
+ GCK CG+EE+EKGCNGEG IQGGIATVPGFGWWPIKAYRPCPGFLASGGRYRRQGQ+M
Sbjct: 6 YSGCKGCGREEIEKGCNGEGRIQGGIATVPGFGWWPIKAYRPCPGFLASGGRYRRQGQSM 65
Query: 144 DEVAFGRGEKRAPVSSDSEN 163
DEVAFGR +K PV E
Sbjct: 66 DEVAFGREQKETPVGISDEG 85
>gi|388499094|gb|AFK37613.1| unknown [Lotus japonicus]
Length = 103
Score = 137 bits (344), Expect = 2e-30, Method: Compositional matrix adjust.
Identities = 67/89 (75%), Positives = 75/89 (84%), Gaps = 3/89 (3%)
Query: 86 GCKVCGKEELEKGCNGEGTIQGGIATVPGFGWWPIKAYRPCPGFLASGGRYRRQGQTMDE 145
GCK CG+EE+E+GCNGEG IQGGIATVPGFGWWPIKAYRPCP FLASGGRYRRQGQ+MDE
Sbjct: 17 GCKACGREEIERGCNGEGRIQGGIATVPGFGWWPIKAYRPCPEFLASGGRYRRQGQSMDE 76
Query: 146 VAFGRGEKRAPVSSDSENIDPSKSRGGRK 174
VAFGRG + V+ DS+ PSK + G K
Sbjct: 77 VAFGRGPT-SSVAGDSKK--PSKKKEGSK 102
>gi|357128931|ref|XP_003566123.1| PREDICTED: uncharacterized protein LOC100836439 [Brachypodium
distachyon]
Length = 167
Score = 136 bits (343), Expect = 3e-30, Method: Compositional matrix adjust.
Identities = 68/117 (58%), Positives = 84/117 (71%), Gaps = 4/117 (3%)
Query: 51 LLCSAAT-SVSVVEDVSSP---LPPDAAATEGADKVGFRGCKVCGKEELEKGCNGEGTIQ 106
L+CS+++ SV ++ SP LP AA D FRGCK CG+EE E+GCNGEG I
Sbjct: 49 LICSSSSPSVVTKQEGDSPAKALPASAAPVSYKDDPNFRGCKGCGREETERGCNGEGRIM 108
Query: 107 GGIATVPGFGWWPIKAYRPCPGFLASGGRYRRQGQTMDEVAFGRGEKRAPVSSDSEN 163
GGIA VP FGWWPIKAYRPCPGF+ASGGRYRR GQ+MD+V G+G K AP + +++
Sbjct: 109 GGIAAVPLFGWWPIKAYRPCPGFVASGGRYRRYGQSMDDVIAGKGRKLAPSNKKNKS 165
>gi|449462523|ref|XP_004148990.1| PREDICTED: uncharacterized protein LOC101208661 isoform 1 [Cucumis
sativus]
gi|449506193|ref|XP_004162679.1| PREDICTED: uncharacterized LOC101208661 isoform 1 [Cucumis sativus]
Length = 165
Score = 135 bits (340), Expect = 6e-30, Method: Compositional matrix adjust.
Identities = 67/115 (58%), Positives = 81/115 (70%), Gaps = 6/115 (5%)
Query: 61 VVEDVSSPLPPDAAATEGA---DKVGFRGCKVCGKEELEKGCNGEGTIQGGIATVPGFGW 117
VV++ + PD + + D FRGCK CGKEE+E+GCNGEG +QGGIATVPGFGW
Sbjct: 48 VVDEELNSSSPDQLSLQSESDPDDSSFRGCKACGKEEIERGCNGEGRMQGGIATVPGFGW 107
Query: 118 WPIKAYRPCPGFLASGGRYRRQGQTMDEVAFGRGEKRAPVS---SDSENIDPSKS 169
WPIKAYRPCPGF+ASGGRY+R+GQ+MDEV G G A S DS +P+ S
Sbjct: 108 WPIKAYRPCPGFVASGGRYKRRGQSMDEVTSGGGRTDASASVGRKDSSRYNPAIS 162
>gi|449462525|ref|XP_004148991.1| PREDICTED: uncharacterized protein LOC101208661 isoform 2 [Cucumis
sativus]
gi|449506197|ref|XP_004162680.1| PREDICTED: uncharacterized LOC101208661 isoform 2 [Cucumis sativus]
Length = 158
Score = 134 bits (337), Expect = 1e-29, Method: Compositional matrix adjust.
Identities = 63/101 (62%), Positives = 75/101 (74%), Gaps = 3/101 (2%)
Query: 61 VVEDVSSPLPPDAAATEGA---DKVGFRGCKVCGKEELEKGCNGEGTIQGGIATVPGFGW 117
VV++ + PD + + D FRGCK CGKEE+E+GCNGEG +QGGIATVPGFGW
Sbjct: 48 VVDEELNSSSPDQLSLQSESDPDDSSFRGCKACGKEEIERGCNGEGRMQGGIATVPGFGW 107
Query: 118 WPIKAYRPCPGFLASGGRYRRQGQTMDEVAFGRGEKRAPVS 158
WPIKAYRPCPGF+ASGGRY+R+GQ+MDEV G G A S
Sbjct: 108 WPIKAYRPCPGFVASGGRYKRRGQSMDEVTSGGGRTDASAS 148
>gi|195610174|gb|ACG26917.1| hypothetical protein [Zea mays]
Length = 157
Score = 133 bits (335), Expect = 2e-29, Method: Compositional matrix adjust.
Identities = 58/75 (77%), Positives = 68/75 (90%)
Query: 80 DKVGFRGCKVCGKEELEKGCNGEGTIQGGIATVPGFGWWPIKAYRPCPGFLASGGRYRRQ 139
D FRGCK CG++EL++GCNG+G IQGGIA VPGFGWWPIKAYRPCPGF+ASGGRYRRQ
Sbjct: 75 DDPNFRGCKGCGRDELQRGCNGQGRIQGGIAAVPGFGWWPIKAYRPCPGFVASGGRYRRQ 134
Query: 140 GQTMDEVAFGRGEKR 154
GQ+MD+VA GRG+K+
Sbjct: 135 GQSMDDVASGRGKKK 149
>gi|222618058|gb|EEE54190.1| hypothetical protein OsJ_01022 [Oryza sativa Japonica Group]
Length = 179
Score = 133 bits (334), Expect = 3e-29, Method: Compositional matrix adjust.
Identities = 82/165 (49%), Positives = 97/165 (58%), Gaps = 12/165 (7%)
Query: 1 MAAAL-LSLPA----ATRHETPVSQRSSTSSGYLIF-HRSLSVASIKCPYNNRPTVLLCS 54
MAA+L L LPA A H PV + SS L H + SI C ++
Sbjct: 1 MAASLWLQLPAPPASAAVH-NPVFLTGNLSSSLLCKKHPQAARGSILCSSSSSSNSSASV 59
Query: 55 AATSVSVVEDVSSPLPPDAAATEGA-----DKVGFRGCKVCGKEELEKGCNGEGTIQGGI 109
V SS A +E D FRGCK G+EE+E+GCNG+G IQGGI
Sbjct: 60 VTKEQEAVAAASSSQEEGVAVSETEFVSYRDDPNFRGCKGSGREEVERGCNGKGRIQGGI 119
Query: 110 ATVPGFGWWPIKAYRPCPGFLASGGRYRRQGQTMDEVAFGRGEKR 154
ATVPGFGWWPIKAYRPCPGF+ASGGRYRRQGQ+MD +A G+G+KR
Sbjct: 120 ATVPGFGWWPIKAYRPCPGFVASGGRYRRQGQSMDNLASGKGKKR 164
>gi|297801356|ref|XP_002868562.1| hypothetical protein ARALYDRAFT_330331 [Arabidopsis lyrata subsp.
lyrata]
gi|297314398|gb|EFH44821.1| hypothetical protein ARALYDRAFT_330331 [Arabidopsis lyrata subsp.
lyrata]
Length = 169
Score = 131 bits (330), Expect = 8e-29, Method: Compositional matrix adjust.
Identities = 64/118 (54%), Positives = 78/118 (66%), Gaps = 13/118 (11%)
Query: 44 YNNRPTVLLCSAATSVSVVEDVSSPLPPDAAATEGADKVGFRGCKVCGKEELEKGCNGEG 103
+++ P + L S+ SV D L F GC+ CGKEE E GCNG+G
Sbjct: 40 HHHHPLISLSSSPASVVETGDDEEDLI-------------FSGCRGCGKEEKESGCNGDG 86
Query: 104 TIQGGIATVPGFGWWPIKAYRPCPGFLASGGRYRRQGQTMDEVAFGRGEKRAPVSSDS 161
IQGGIATVPGFGWWPIKAYRPCP F+ +GGRYRR GQ+MDEVAFGRG+ ++ S D+
Sbjct: 87 RIQGGIATVPGFGWWPIKAYRPCPAFVEAGGRYRRIGQSMDEVAFGRGDSKSSTSIDT 144
>gi|18422170|ref|NP_568601.1| uncharacterized protein [Arabidopsis thaliana]
gi|20268798|gb|AAM14099.1| unknown protein [Arabidopsis thaliana]
gi|21281247|gb|AAM45102.1| unknown protein [Arabidopsis thaliana]
gi|21618110|gb|AAM67160.1| unknown [Arabidopsis thaliana]
gi|332007379|gb|AED94762.1| uncharacterized protein [Arabidopsis thaliana]
Length = 164
Score = 128 bits (321), Expect = 1e-27, Method: Compositional matrix adjust.
Identities = 55/71 (77%), Positives = 62/71 (87%)
Query: 84 FRGCKVCGKEELEKGCNGEGTIQGGIATVPGFGWWPIKAYRPCPGFLASGGRYRRQGQTM 143
F GC+ CGKEE E GCNG+G IQGGIATVPGFGWWPIKAYRPCP F+ +GGRYRR GQ+M
Sbjct: 68 FSGCRACGKEEKESGCNGDGRIQGGIATVPGFGWWPIKAYRPCPAFVEAGGRYRRIGQSM 127
Query: 144 DEVAFGRGEKR 154
DEVAFGRG+ +
Sbjct: 128 DEVAFGRGDSK 138
>gi|9757952|dbj|BAB08440.1| unnamed protein product [Arabidopsis thaliana]
Length = 155
Score = 128 bits (321), Expect = 1e-27, Method: Compositional matrix adjust.
Identities = 55/71 (77%), Positives = 62/71 (87%)
Query: 84 FRGCKVCGKEELEKGCNGEGTIQGGIATVPGFGWWPIKAYRPCPGFLASGGRYRRQGQTM 143
F GC+ CGKEE E GCNG+G IQGGIATVPGFGWWPIKAYRPCP F+ +GGRYRR GQ+M
Sbjct: 68 FSGCRACGKEEKESGCNGDGRIQGGIATVPGFGWWPIKAYRPCPAFVEAGGRYRRIGQSM 127
Query: 144 DEVAFGRGEKR 154
DEVAFGRG+ +
Sbjct: 128 DEVAFGRGDSK 138
>gi|116786747|gb|ABK24222.1| unknown [Picea sitchensis]
gi|224285784|gb|ACN40607.1| unknown [Picea sitchensis]
Length = 161
Score = 128 bits (321), Expect = 1e-27, Method: Compositional matrix adjust.
Identities = 57/96 (59%), Positives = 71/96 (73%), Gaps = 6/96 (6%)
Query: 65 VSSPLPP------DAAATEGADKVGFRGCKVCGKEELEKGCNGEGTIQGGIATVPGFGWW 118
+ +P+PP D A E GCK CG+ E+E+GCNGEG IQGGI T+PGF WW
Sbjct: 64 IPTPIPPTRVISEDGTAVEEEQGKVLVGCKACGRPEIERGCNGEGRIQGGIGTIPGFQWW 123
Query: 119 PIKAYRPCPGFLASGGRYRRQGQTMDEVAFGRGEKR 154
PIK +RPCPGF+ASGGRYRR+GQ+MDEVAFGR +++
Sbjct: 124 PIKVFRPCPGFVASGGRYRRRGQSMDEVAFGRDKEQ 159
>gi|116781571|gb|ABK22159.1| unknown [Picea sitchensis]
Length = 161
Score = 127 bits (319), Expect = 2e-27, Method: Compositional matrix adjust.
Identities = 57/92 (61%), Positives = 68/92 (73%), Gaps = 6/92 (6%)
Query: 65 VSSPLPP------DAAATEGADKVGFRGCKVCGKEELEKGCNGEGTIQGGIATVPGFGWW 118
+ +P+PP D A E GCK CG+ E+E+GCNGEG IQGGI T+PGF WW
Sbjct: 64 IPTPIPPTRVISEDGTAVEEEQGKVLVGCKACGRPEIERGCNGEGRIQGGIGTIPGFQWW 123
Query: 119 PIKAYRPCPGFLASGGRYRRQGQTMDEVAFGR 150
PIK +RPCPGF+ASGGRYRR+GQ+MDEVAFGR
Sbjct: 124 PIKVFRPCPGFVASGGRYRRRGQSMDEVAFGR 155
>gi|147820800|emb|CAN65190.1| hypothetical protein VITISV_032101 [Vitis vinifera]
Length = 320
Score = 120 bits (300), Expect = 2e-25, Method: Compositional matrix adjust.
Identities = 53/61 (86%), Positives = 57/61 (93%)
Query: 95 LEKGCNGEGTIQGGIATVPGFGWWPIKAYRPCPGFLASGGRYRRQGQTMDEVAFGRGEKR 154
+EKGCNGEG IQGGI TVPGFGWWPIKAYRPCPGF+ASGGRYRR+GQ+MDEVAFGRG K
Sbjct: 1 MEKGCNGEGRIQGGIGTVPGFGWWPIKAYRPCPGFVASGGRYRRRGQSMDEVAFGRGGKG 60
Query: 155 A 155
A
Sbjct: 61 A 61
>gi|168052317|ref|XP_001778597.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162670051|gb|EDQ56627.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 912
Score = 114 bits (285), Expect = 2e-23, Method: Compositional matrix adjust.
Identities = 53/96 (55%), Positives = 70/96 (72%), Gaps = 5/96 (5%)
Query: 59 VSVVEDVSSPLPPDAAATEGADKVGFRGCKVCGKEELEKGCNGEGTIQGGIATVPGFGWW 118
V+ VED S DA++++ C+ CGKE+ GCNGEG I GG+++VPGFGW+
Sbjct: 78 VTSVEDAS-----DASSSQPTRIFSPTACQACGKEDKVGGCNGEGRILGGLSSVPGFGWF 132
Query: 119 PIKAYRPCPGFLASGGRYRRQGQTMDEVAFGRGEKR 154
PIKAYRPCP F+ +GGRYRRQGQ+++EVAFGR K+
Sbjct: 133 PIKAYRPCPAFVEAGGRYRRQGQSLEEVAFGRMGKQ 168
>gi|302768933|ref|XP_002967886.1| hypothetical protein SELMODRAFT_69222 [Selaginella moellendorffii]
gi|300164624|gb|EFJ31233.1| hypothetical protein SELMODRAFT_69222 [Selaginella moellendorffii]
Length = 95
Score = 109 bits (273), Expect = 4e-22, Method: Compositional matrix adjust.
Identities = 47/65 (72%), Positives = 54/65 (83%)
Query: 86 GCKVCGKEELEKGCNGEGTIQGGIATVPGFGWWPIKAYRPCPGFLASGGRYRRQGQTMDE 145
GCK CG+ E+ GCNG G IQGGIA VPGFGWWPIKA+RPCP F+ SGGRY+RQGQ +DE
Sbjct: 10 GCKTCGRSEVMWGCNGLGRIQGGIAAVPGFGWWPIKAFRPCPAFVESGGRYKRQGQRLDE 69
Query: 146 VAFGR 150
A+GR
Sbjct: 70 FAYGR 74
>gi|367060918|gb|AEX11232.1| hypothetical protein 0_12630_01 [Pinus lambertiana]
Length = 134
Score = 88.2 bits (217), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 36/59 (61%), Positives = 42/59 (71%)
Query: 72 DAAATEGADKVGFRGCKVCGKEELEKGCNGEGTIQGGIATVPGFGWWPIKAYRPCPGFL 130
D A E GCK CG+ E+E+GCNGEG IQGGI T+PGF WWPIK +RPCPGF+
Sbjct: 74 DGMAVEEEQGKVLGGCKACGRPEIERGCNGEGRIQGGIGTIPGFSWWPIKVFRPCPGFI 132
>gi|367060914|gb|AEX11230.1| hypothetical protein 0_12630_01 [Pinus taeda]
gi|367060916|gb|AEX11231.1| hypothetical protein 0_12630_01 [Pinus taeda]
Length = 134
Score = 86.7 bits (213), Expect = 3e-15, Method: Compositional matrix adjust.
Identities = 35/59 (59%), Positives = 42/59 (71%)
Query: 72 DAAATEGADKVGFRGCKVCGKEELEKGCNGEGTIQGGIATVPGFGWWPIKAYRPCPGFL 130
D A E GCK CG+ E+++GCNGEG IQGGI T+PGF WWPIK +RPCPGF+
Sbjct: 74 DGMAVEEEQGKVLGGCKACGRPEIKRGCNGEGRIQGGIGTIPGFSWWPIKVFRPCPGFI 132
>gi|308800612|ref|XP_003075087.1| CccP Conserved Cys-containing protein (IC) [Ostreococcus tauri]
gi|116061641|emb|CAL52359.1| CccP Conserved Cys-containing protein (IC) [Ostreococcus tauri]
Length = 112
Score = 82.8 bits (203), Expect = 4e-14, Method: Compositional matrix adjust.
Identities = 39/88 (44%), Positives = 54/88 (61%), Gaps = 1/88 (1%)
Query: 66 SSPLPPDAAATEGADKVGFRGCKVCGKEELEKGCNGEGTIQGGIATVPGFGWWPIKAYRP 125
SS + P A+A++ A K C CG + C+G G IQGG+ +P F WWPIKAYRP
Sbjct: 25 SSAVVPRASASDDATKC-TGSCDSCGLARDKMQCDGSGRIQGGMGAIPLFSWWPIKAYRP 83
Query: 126 CPGFLASGGRYRRQGQTMDEVAFGRGEK 153
CPG + +GG Y R+GQ ++ + +G K
Sbjct: 84 CPGLIENGGVYTRKGQDVNSILWGESGK 111
>gi|307111329|gb|EFN59563.1| hypothetical protein CHLNCDRAFT_49318 [Chlorella variabilis]
Length = 130
Score = 82.8 bits (203), Expect = 4e-14, Method: Compositional matrix adjust.
Identities = 39/70 (55%), Positives = 46/70 (65%), Gaps = 2/70 (2%)
Query: 87 CKVCGKEELEK--GCNGEGTIQGGIATVPGFGWWPIKAYRPCPGFLASGGRYRRQGQTMD 144
C CG +K GC+G G I GGI VPGF WWPIKAYRPCP +G Y+R+GQ D
Sbjct: 61 CPTCGVALDKKAFGCDGTGRIAGGIGAVPGFKWWPIKAYRPCPRLAEAGMAYQRKGQITD 120
Query: 145 EVAFGRGEKR 154
EV FGRG ++
Sbjct: 121 EVLFGRGRQQ 130
>gi|412990045|emb|CCO20687.1| predicted protein [Bathycoccus prasinos]
Length = 117
Score = 81.3 bits (199), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 34/63 (53%), Positives = 43/63 (68%)
Query: 87 CKVCGKEELEKGCNGEGTIQGGIATVPGFGWWPIKAYRPCPGFLASGGRYRRQGQTMDEV 146
C CG + C+G G IQGG+A +P F WWPIKAYRPCPG + SGG Y R+GQ +D +
Sbjct: 50 CDSCGVAREKMQCDGTGRIQGGLAAIPLFSWWPIKAYRPCPGLIESGGTYTRKGQDVDSI 109
Query: 147 AFG 149
+G
Sbjct: 110 LWG 112
>gi|255081949|ref|XP_002508193.1| predicted protein [Micromonas sp. RCC299]
gi|226523469|gb|ACO69451.1| predicted protein [Micromonas sp. RCC299]
Length = 122
Score = 81.3 bits (199), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 37/74 (50%), Positives = 47/74 (63%), Gaps = 1/74 (1%)
Query: 77 EGADKVGFRG-CKVCGKEELEKGCNGEGTIQGGIATVPGFGWWPIKAYRPCPGFLASGGR 135
E D+ G G C CG + + C+G G I GG+ +P F WWPIKAYRPCPG + SGG
Sbjct: 45 ESKDQAGCTGSCDSCGLAKEKMQCDGSGRIIGGMGAIPLFSWWPIKAYRPCPGLIESGGV 104
Query: 136 YRRQGQTMDEVAFG 149
Y R+GQ +D + FG
Sbjct: 105 YVRKGQDVDGILFG 118
>gi|303277541|ref|XP_003058064.1| predicted protein [Micromonas pusilla CCMP1545]
gi|226460721|gb|EEH58015.1| predicted protein [Micromonas pusilla CCMP1545]
Length = 112
Score = 81.3 bits (199), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 37/76 (48%), Positives = 46/76 (60%)
Query: 74 AATEGADKVGFRGCKVCGKEELEKGCNGEGTIQGGIATVPGFGWWPIKAYRPCPGFLASG 133
AA DK C CG + + C+G G I GG+ +P F WWPIKAYRPCPG + SG
Sbjct: 33 AAGAAEDKACTGSCDSCGLAKEKMQCDGSGRIIGGMGAIPIFSWWPIKAYRPCPGLIESG 92
Query: 134 GRYRRQGQTMDEVAFG 149
G Y R+GQ +D + FG
Sbjct: 93 GVYVRKGQDVDGILFG 108
>gi|145345376|ref|XP_001417189.1| predicted protein [Ostreococcus lucimarinus CCE9901]
gi|144577416|gb|ABO95482.1| predicted protein [Ostreococcus lucimarinus CCE9901]
Length = 122
Score = 77.8 bits (190), Expect = 1e-12, Method: Compositional matrix adjust.
Identities = 32/67 (47%), Positives = 44/67 (65%)
Query: 87 CKVCGKEELEKGCNGEGTIQGGIATVPGFGWWPIKAYRPCPGFLASGGRYRRQGQTMDEV 146
C CG + + C+G G IQGG+ +P F WWPIKAYRPCPG + +GG Y R+GQ ++ +
Sbjct: 55 CDSCGLAKEKMQCDGSGRIQGGMGAIPLFAWWPIKAYRPCPGLIDNGGVYTRKGQDVNAI 114
Query: 147 AFGRGEK 153
+G K
Sbjct: 115 LWGESGK 121
>gi|57335312|emb|CAH04624.1| hypothetical protein precursor [Guillardia theta]
Length = 225
Score = 77.4 bits (189), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 31/54 (57%), Positives = 39/54 (72%)
Query: 99 CNGEGTIQGGIATVPGFGWWPIKAYRPCPGFLASGGRYRRQGQTMDEVAFGRGE 152
C G G IQGG+AT+PG WWPIKA+RPCP +G Y R+GQ +DEV FG+ +
Sbjct: 141 CGGTGRIQGGLATLPGLAWWPIKAFRPCPTCEKAGRSYNRRGQVLDEVLFGKND 194
>gi|219120546|ref|XP_002181009.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
gi|217407725|gb|EEC47661.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
Length = 119
Score = 73.6 bits (179), Expect = 3e-11, Method: Compositional matrix adjust.
Identities = 33/52 (63%), Positives = 37/52 (71%), Gaps = 2/52 (3%)
Query: 99 CNGEGTIQGGIATVPGFGWWPIKAYRPCPGFLASGGRYRRQGQTMDEVAFGR 150
C+G G I GGIA V WWPIKAYRPCP F+ GG Y R GQ +DE+AFGR
Sbjct: 64 CDGSGRILGGIAVV--LPWWPIKAYRPCPNFINRGGIYERSGQGLDEIAFGR 113
>gi|298708826|emb|CBJ30785.1| conserved unknown protein [Ectocarpus siliculosus]
Length = 164
Score = 69.3 bits (168), Expect = 6e-10, Method: Compositional matrix adjust.
Identities = 49/130 (37%), Positives = 68/130 (52%), Gaps = 23/130 (17%)
Query: 24 TSSGYLIFHRSLSVASIKCPYNNRPTVLLCSAATSVSVVEDVSSPLPPDAAATE--GADK 81
+++G+L+ RS P +R T C AT++ E+ DAA T G+
Sbjct: 18 SATGFLVVPRSP-------PNFSRQT---CRTATALHAQEE-------DAAVTAPPGSSD 60
Query: 82 VGFRGCKVCGKEELEKGCNGEGTIQGGI-ATVPGFGWWPIKAYRPCPGFLASGGRYRRQG 140
G C E ++GC+G G I GG+ A VP W PIKAYRPCP F + +Y RQG
Sbjct: 61 DACPGYPRCSGEYRDRGCDGTGRISGGVGALVP---WLPIKAYRPCPSFTEAKYKYTRQG 117
Query: 141 QTMDEVAFGR 150
Q+++EV F R
Sbjct: 118 QSLNEVVFAR 127
>gi|449017078|dbj|BAM80480.1| hypothetical protein, conserved [Cyanidioschyzon merolae strain
10D]
Length = 119
Score = 68.2 bits (165), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 34/56 (60%), Positives = 39/56 (69%), Gaps = 1/56 (1%)
Query: 97 KGCNGEGTIQGGIATVPGFGWWPIKAYRPCPGFLASGGRYRRQGQTM-DEVAFGRG 151
K C G G +QGG+A V GF WWPIKAYRPCP A RYRR+GQ++ D V GRG
Sbjct: 55 KLCGGSGRLQGGLAAVRGFHWWPIKAYRPCPACEAGRLRYRRRGQSLQDFVGNGRG 110
>gi|159487625|ref|XP_001701823.1| predicted protein [Chlamydomonas reinhardtii]
gi|158281042|gb|EDP06798.1| predicted protein [Chlamydomonas reinhardtii]
Length = 110
Score = 62.8 bits (151), Expect = 6e-08, Method: Compositional matrix adjust.
Identities = 28/67 (41%), Positives = 38/67 (56%)
Query: 105 IQGGIATVPGFGWWPIKAYRPCPGFLASGGRYRRQGQTMDEVAFGRGEKRAPVSSDSENI 164
+ GG+ +PGFGWWPIKAY+PCP +G Y R+GQ ++V FG A + E I
Sbjct: 40 VIGGMGALPGFGWWPIKAYKPCPALNEAGLEYTRKGQMTNDVLFGGVSLGAQGAQSLEEI 99
Query: 165 DPSKSRG 171
+G
Sbjct: 100 KKQADKG 106
>gi|134099934|ref|YP_001105595.1| hypothetical protein SACE_3395 [Saccharopolyspora erythraea NRRL
2338]
gi|291008671|ref|ZP_06566644.1| hypothetical protein SeryN2_29463 [Saccharopolyspora erythraea NRRL
2338]
gi|133912557|emb|CAM02670.1| hypothetical protein SACE_3395 [Saccharopolyspora erythraea NRRL
2338]
Length = 117
Score = 36.2 bits (82), Expect = 4.6, Method: Compositional matrix adjust.
Identities = 23/82 (28%), Positives = 37/82 (45%), Gaps = 14/82 (17%)
Query: 64 DVSSPLPPDAAATEGADKVGFRGCKVCGKEELEKGCNGEGTIQGGIATVPGFGWWPIKAY 123
D ++P P +A E AD + ++EL G ++ G+ P GW P+
Sbjct: 6 DATTPSPDEAPTQEPADPANLQSAPDLDEDEL-----GVDPLESGVE--PAEGWSPVAEG 58
Query: 124 RPCPGFLASGGRYRRQGQTMDE 145
RP P R +R+G ++DE
Sbjct: 59 RPTP-------REQREGDSIDE 73
Database: nr
Posted date: Mar 3, 2013 10:45 PM
Number of letters in database: 999,999,864
Number of sequences in database: 2,912,245
Database: /local_scratch/syshi//blastdatabase/nr.01
Posted date: Mar 3, 2013 10:52 PM
Number of letters in database: 999,999,666
Number of sequences in database: 2,912,720
Database: /local_scratch/syshi//blastdatabase/nr.02
Posted date: Mar 3, 2013 10:58 PM
Number of letters in database: 999,999,938
Number of sequences in database: 3,014,250
Database: /local_scratch/syshi//blastdatabase/nr.03
Posted date: Mar 3, 2013 11:03 PM
Number of letters in database: 999,999,780
Number of sequences in database: 2,805,020
Database: /local_scratch/syshi//blastdatabase/nr.04
Posted date: Mar 3, 2013 11:08 PM
Number of letters in database: 999,999,551
Number of sequences in database: 2,816,253
Database: /local_scratch/syshi//blastdatabase/nr.05
Posted date: Mar 3, 2013 11:13 PM
Number of letters in database: 999,999,897
Number of sequences in database: 2,981,387
Database: /local_scratch/syshi//blastdatabase/nr.06
Posted date: Mar 3, 2013 11:18 PM
Number of letters in database: 999,999,649
Number of sequences in database: 2,911,476
Database: /local_scratch/syshi//blastdatabase/nr.07
Posted date: Mar 3, 2013 11:24 PM
Number of letters in database: 999,999,452
Number of sequences in database: 2,920,260
Database: /local_scratch/syshi//blastdatabase/nr.08
Posted date: Mar 3, 2013 11:25 PM
Number of letters in database: 64,230,274
Number of sequences in database: 189,558
Lambda K H
0.315 0.132 0.403
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 2,920,950,819
Number of Sequences: 23463169
Number of extensions: 126307565
Number of successful extensions: 266405
Number of sequences better than 100.0: 46
Number of HSP's better than 100.0 without gapping: 38
Number of HSP's successfully gapped in prelim test: 8
Number of HSP's that attempted gapping in prelim test: 266359
Number of HSP's gapped (non-prelim): 46
length of query: 177
length of database: 8,064,228,071
effective HSP length: 133
effective length of query: 44
effective length of database: 9,238,593,890
effective search space: 406498131160
effective search space used: 406498131160
T: 11
A: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.6 bits)
S2: 71 (32.0 bits)