BLASTP 2.2.26 [Sep-21-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.


Reference for compositional score matrix adjustment: Altschul, Stephen F., 
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.

Query= 036090
         (177 letters)

Database: nr 
           23,463,169 sequences; 8,064,228,071 total letters

Searching..................................................done



>gi|255537843|ref|XP_002509988.1| conserved hypothetical protein [Ricinus communis]
 gi|223549887|gb|EEF51375.1| conserved hypothetical protein [Ricinus communis]
          Length = 177

 Score =  150 bits (379), Expect = 2e-34,   Method: Compositional matrix adjust.
 Identities = 102/190 (53%), Positives = 123/190 (64%), Gaps = 26/190 (13%)

Query: 1   MAAALLSLPAATRHETPVSQRS-STSSGYLIFHRSLSVASIKCPYNNRPTVLLCSAATSV 59
           MA  L+S+P  T ++T  SQR  +T+   L   R  S+ SI   + NR   L  S+++S 
Sbjct: 1   MAITLVSVP--TINQTLSSQRPYNTNLNTLTLTRYSSIKSIT--FINR---LCSSSSSST 53

Query: 60  SVVED-----------VSSPLPPDAAATEGADKVGFRGCKVCGKEELEKGCNGEGTIQGG 108
           SVVE+           V SPLPP       AD+V   GCK CG++E+EKGCNGEG IQGG
Sbjct: 54  SVVEEDGPASTPDDLPVQSPLPP-----VNADEVPVGGCKACGRQEIEKGCNGEGRIQGG 108

Query: 109 IATVPGFGWWPIKAYRPCPGFLASGGRYRRQGQTMDEVAFGRGEKRAPVSSDSENIDPSK 168
           IATV GFGWWPIKAYRPCPGF+ASGGRYRRQGQ+MDEV  GRG    PV S +E ++ SK
Sbjct: 109 IATVSGFGWWPIKAYRPCPGFVASGGRYRRQGQSMDEVVSGRGNPGTPVGSSTE-VESSK 167

Query: 169 SRGG-RKFKR 177
            R   RKFKR
Sbjct: 168 ERQDPRKFKR 177


>gi|225426288|ref|XP_002267043.1| PREDICTED: uncharacterized protein LOC100264984 [Vitis vinifera]
 gi|297742356|emb|CBI34505.3| unnamed protein product [Vitis vinifera]
          Length = 167

 Score =  146 bits (369), Expect = 3e-33,   Method: Compositional matrix adjust.
 Identities = 69/95 (72%), Positives = 78/95 (82%)

Query: 68  PLPPDAAATEGADKVGFRGCKVCGKEELEKGCNGEGTIQGGIATVPGFGWWPIKAYRPCP 127
           PLP DAAA   +      GCK CG+EE+EKGCNGEG IQGGI TVPGFGWWPIKAYRPCP
Sbjct: 60  PLPSDAAAPLQSQDPPLGGCKACGREEMEKGCNGEGRIQGGIGTVPGFGWWPIKAYRPCP 119

Query: 128 GFLASGGRYRRQGQTMDEVAFGRGEKRAPVSSDSE 162
           GF+ASGGRYRR+GQ+MDEVAFGRG K A ++SD +
Sbjct: 120 GFVASGGRYRRRGQSMDEVAFGRGGKGASMASDED 154


>gi|358248722|ref|NP_001239929.1| uncharacterized protein LOC100788535 [Glycine max]
 gi|255640951|gb|ACU20755.1| unknown [Glycine max]
          Length = 150

 Score =  144 bits (363), Expect = 1e-32,   Method: Compositional matrix adjust.
 Identities = 69/115 (60%), Positives = 83/115 (72%), Gaps = 11/115 (9%)

Query: 49  TVLLCSAATSVSVVEDVSSPLPPDAAATEGADKVGFRGCKVCGKEELEKGCNGEGTIQGG 108
           TV LCS +   SV+E+  SP P   +           GCK CG+EE+E+GCNGEG IQGG
Sbjct: 35  TVSLCSPSPPTSVIEEGPSPSPELTS-----------GCKACGREEVEQGCNGEGRIQGG 83

Query: 109 IATVPGFGWWPIKAYRPCPGFLASGGRYRRQGQTMDEVAFGRGEKRAPVSSDSEN 163
           IATVPGFGWWPIKAYRPCP F+ASGGRYRRQGQ+MDEV  G G+ + P+++DS  
Sbjct: 84  IATVPGFGWWPIKAYRPCPAFVASGGRYRRQGQSMDEVVSGSGKTKTPIATDSNT 138


>gi|356513275|ref|XP_003525339.1| PREDICTED: uncharacterized protein LOC100810596 [Glycine max]
          Length = 166

 Score =  144 bits (363), Expect = 1e-32,   Method: Compositional matrix adjust.
 Identities = 67/116 (57%), Positives = 83/116 (71%), Gaps = 1/116 (0%)

Query: 50  VLLCSAATSVSVVEDVSSPLPPDAAATEGADKVGFRGCKVCGKEELEKGCNGEGTIQGGI 109
           V LCS +   S++E+  SP P +      + +    GCK CG+EE+E+GCNGEG IQGGI
Sbjct: 38  VPLCSPSPPTSIIEEGPSPSPDELPVVVDSSEFT-SGCKACGREEIEQGCNGEGRIQGGI 96

Query: 110 ATVPGFGWWPIKAYRPCPGFLASGGRYRRQGQTMDEVAFGRGEKRAPVSSDSENID 165
           ATVPG GWWPIKAYRPCP F+ASGGRYRRQGQ+MDEV  G G+ + P ++DS   D
Sbjct: 97  ATVPGLGWWPIKAYRPCPAFMASGGRYRRQGQSMDEVVSGSGKSKTPTTTDSNTRD 152


>gi|56783848|dbj|BAD81260.1| unknown protein [Oryza sativa Japonica Group]
 gi|56783951|dbj|BAD81388.1| unknown protein [Oryza sativa Japonica Group]
          Length = 229

 Score =  141 bits (355), Expect = 1e-31,   Method: Compositional matrix adjust.
 Identities = 67/101 (66%), Positives = 77/101 (76%), Gaps = 13/101 (12%)

Query: 80  DKVGFRGCKVCGKEELEKGCNGEGTIQGGIATVPGFGWWPIKAYRPCPGFLASGGRYRRQ 139
           D   FRGC+ CG+EE+E+GCNGEG IQGGIATVPGFGWWPIKAYRPCPGF+ASGGRYRRQ
Sbjct: 90  DDPNFRGCRGCGREEVERGCNGEGRIQGGIATVPGFGWWPIKAYRPCPGFVASGGRYRRQ 149

Query: 140 GQTMDEVAFGRGEKRA----PVSS---------DSENIDPS 167
           GQ+MD+VA GRG+KR     PV           D +N +PS
Sbjct: 150 GQSMDDVASGRGKKRELRGDPVQEISSLEMLDKDVQNFNPS 190


>gi|297596400|ref|NP_001042519.2| Os01g0235100 [Oryza sativa Japonica Group]
 gi|255673036|dbj|BAF04433.2| Os01g0235100 [Oryza sativa Japonica Group]
          Length = 179

 Score =  140 bits (353), Expect = 2e-31,   Method: Compositional matrix adjust.
 Identities = 85/165 (51%), Positives = 99/165 (60%), Gaps = 12/165 (7%)

Query: 1   MAAAL-LSLPA----ATRHETPVSQRSSTSSGYLIF-HRSLSVASIKCPYNNRPTVLLCS 54
           MAA+L L LPA    A  H  PV    + SS  L   H   +  SI C  ++        
Sbjct: 1   MAASLWLQLPAPPASAAVH-NPVFLTGNLSSSLLCKKHPQAARGSILCSSSSSSNSSASV 59

Query: 55  AATSVSVVEDVSSPLPPDAAATEGA-----DKVGFRGCKVCGKEELEKGCNGEGTIQGGI 109
                  V   SS      A +E       D   FRGC+ CG+EE+E+GCNGEG IQGGI
Sbjct: 60  VTKEQEAVAAASSSQEEGVAVSETEFVSYRDDPNFRGCRGCGREEVERGCNGEGRIQGGI 119

Query: 110 ATVPGFGWWPIKAYRPCPGFLASGGRYRRQGQTMDEVAFGRGEKR 154
           ATVPGFGWWPIKAYRPCPGF+ASGGRYRRQGQ+MD+VA GRG+KR
Sbjct: 120 ATVPGFGWWPIKAYRPCPGFVASGGRYRRQGQSMDDVASGRGKKR 164


>gi|195615298|gb|ACG29479.1| hypothetical protein [Zea mays]
          Length = 152

 Score =  139 bits (351), Expect = 4e-31,   Method: Compositional matrix adjust.
 Identities = 68/110 (61%), Positives = 81/110 (73%), Gaps = 2/110 (1%)

Query: 47  RPTVLLCSA--ATSVSVVEDVSSPLPPDAAATEGADKVGFRGCKVCGKEELEKGCNGEGT 104
           R   LLC+A  A+S  VV        P+       D   FRGCK CG++EL++GCNG+G 
Sbjct: 35  RGNRLLCTASGASSSVVVTKEQEEGSPEPEIFSYKDDPNFRGCKGCGRDELQRGCNGQGR 94

Query: 105 IQGGIATVPGFGWWPIKAYRPCPGFLASGGRYRRQGQTMDEVAFGRGEKR 154
           IQGGIA VPGFGWWPIKAYRPCPGF+ASGGRYRRQGQ+MD+VA GRG+K+
Sbjct: 95  IQGGIAAVPGFGWWPIKAYRPCPGFVASGGRYRRQGQSMDDVASGRGKKK 144


>gi|218187828|gb|EEC70255.1| hypothetical protein OsI_01055 [Oryza sativa Indica Group]
          Length = 180

 Score =  139 bits (349), Expect = 5e-31,   Method: Compositional matrix adjust.
 Identities = 61/75 (81%), Positives = 69/75 (92%)

Query: 80  DKVGFRGCKVCGKEELEKGCNGEGTIQGGIATVPGFGWWPIKAYRPCPGFLASGGRYRRQ 139
           D   FRGC+ CG+EE+E+GCNGEG IQGGIATVPGFGWWPIKAYRPCPGF+ASGGRYRRQ
Sbjct: 91  DDPNFRGCRGCGREEVERGCNGEGRIQGGIATVPGFGWWPIKAYRPCPGFVASGGRYRRQ 150

Query: 140 GQTMDEVAFGRGEKR 154
           GQ+MD+VA GRG+KR
Sbjct: 151 GQSMDDVASGRGKKR 165


>gi|224072220|ref|XP_002303659.1| predicted protein [Populus trichocarpa]
 gi|222841091|gb|EEE78638.1| predicted protein [Populus trichocarpa]
          Length = 87

 Score =  139 bits (349), Expect = 5e-31,   Method: Compositional matrix adjust.
 Identities = 64/78 (82%), Positives = 69/78 (88%)

Query: 86  GCKVCGKEELEKGCNGEGTIQGGIATVPGFGWWPIKAYRPCPGFLASGGRYRRQGQTMDE 145
           GCK CG+EE+EKGCNGEG IQGGIATVPGFGWWPIKAYRPCPGFLASGGRYRRQGQ+MDE
Sbjct: 7   GCKGCGREEIEKGCNGEGRIQGGIATVPGFGWWPIKAYRPCPGFLASGGRYRRQGQSMDE 66

Query: 146 VAFGRGEKRAPVSSDSEN 163
           VAFGRG ++ PV    E 
Sbjct: 67  VAFGRGGEKTPVVISDEG 84


>gi|224058178|ref|XP_002299460.1| predicted protein [Populus trichocarpa]
 gi|222846718|gb|EEE84265.1| predicted protein [Populus trichocarpa]
          Length = 88

 Score =  139 bits (349), Expect = 5e-31,   Method: Compositional matrix adjust.
 Identities = 64/80 (80%), Positives = 69/80 (86%)

Query: 84  FRGCKVCGKEELEKGCNGEGTIQGGIATVPGFGWWPIKAYRPCPGFLASGGRYRRQGQTM 143
           + GCK CG+EE+EKGCNGEG IQGGIATVPGFGWWPIKAYRPCPGFLASGGRYRRQGQ+M
Sbjct: 6   YSGCKGCGREEIEKGCNGEGRIQGGIATVPGFGWWPIKAYRPCPGFLASGGRYRRQGQSM 65

Query: 144 DEVAFGRGEKRAPVSSDSEN 163
           DEVAFGR +K  PV    E 
Sbjct: 66  DEVAFGREQKETPVGISDEG 85


>gi|388499094|gb|AFK37613.1| unknown [Lotus japonicus]
          Length = 103

 Score =  137 bits (344), Expect = 2e-30,   Method: Compositional matrix adjust.
 Identities = 67/89 (75%), Positives = 75/89 (84%), Gaps = 3/89 (3%)

Query: 86  GCKVCGKEELEKGCNGEGTIQGGIATVPGFGWWPIKAYRPCPGFLASGGRYRRQGQTMDE 145
           GCK CG+EE+E+GCNGEG IQGGIATVPGFGWWPIKAYRPCP FLASGGRYRRQGQ+MDE
Sbjct: 17  GCKACGREEIERGCNGEGRIQGGIATVPGFGWWPIKAYRPCPEFLASGGRYRRQGQSMDE 76

Query: 146 VAFGRGEKRAPVSSDSENIDPSKSRGGRK 174
           VAFGRG   + V+ DS+   PSK + G K
Sbjct: 77  VAFGRGPT-SSVAGDSKK--PSKKKEGSK 102


>gi|357128931|ref|XP_003566123.1| PREDICTED: uncharacterized protein LOC100836439 [Brachypodium
           distachyon]
          Length = 167

 Score =  136 bits (343), Expect = 3e-30,   Method: Compositional matrix adjust.
 Identities = 68/117 (58%), Positives = 84/117 (71%), Gaps = 4/117 (3%)

Query: 51  LLCSAAT-SVSVVEDVSSP---LPPDAAATEGADKVGFRGCKVCGKEELEKGCNGEGTIQ 106
           L+CS+++ SV   ++  SP   LP  AA     D   FRGCK CG+EE E+GCNGEG I 
Sbjct: 49  LICSSSSPSVVTKQEGDSPAKALPASAAPVSYKDDPNFRGCKGCGREETERGCNGEGRIM 108

Query: 107 GGIATVPGFGWWPIKAYRPCPGFLASGGRYRRQGQTMDEVAFGRGEKRAPVSSDSEN 163
           GGIA VP FGWWPIKAYRPCPGF+ASGGRYRR GQ+MD+V  G+G K AP +  +++
Sbjct: 109 GGIAAVPLFGWWPIKAYRPCPGFVASGGRYRRYGQSMDDVIAGKGRKLAPSNKKNKS 165


>gi|449462523|ref|XP_004148990.1| PREDICTED: uncharacterized protein LOC101208661 isoform 1 [Cucumis
           sativus]
 gi|449506193|ref|XP_004162679.1| PREDICTED: uncharacterized LOC101208661 isoform 1 [Cucumis sativus]
          Length = 165

 Score =  135 bits (340), Expect = 6e-30,   Method: Compositional matrix adjust.
 Identities = 67/115 (58%), Positives = 81/115 (70%), Gaps = 6/115 (5%)

Query: 61  VVEDVSSPLPPDAAATEGA---DKVGFRGCKVCGKEELEKGCNGEGTIQGGIATVPGFGW 117
           VV++  +   PD  + +     D   FRGCK CGKEE+E+GCNGEG +QGGIATVPGFGW
Sbjct: 48  VVDEELNSSSPDQLSLQSESDPDDSSFRGCKACGKEEIERGCNGEGRMQGGIATVPGFGW 107

Query: 118 WPIKAYRPCPGFLASGGRYRRQGQTMDEVAFGRGEKRAPVS---SDSENIDPSKS 169
           WPIKAYRPCPGF+ASGGRY+R+GQ+MDEV  G G   A  S    DS   +P+ S
Sbjct: 108 WPIKAYRPCPGFVASGGRYKRRGQSMDEVTSGGGRTDASASVGRKDSSRYNPAIS 162


>gi|449462525|ref|XP_004148991.1| PREDICTED: uncharacterized protein LOC101208661 isoform 2 [Cucumis
           sativus]
 gi|449506197|ref|XP_004162680.1| PREDICTED: uncharacterized LOC101208661 isoform 2 [Cucumis sativus]
          Length = 158

 Score =  134 bits (337), Expect = 1e-29,   Method: Compositional matrix adjust.
 Identities = 63/101 (62%), Positives = 75/101 (74%), Gaps = 3/101 (2%)

Query: 61  VVEDVSSPLPPDAAATEGA---DKVGFRGCKVCGKEELEKGCNGEGTIQGGIATVPGFGW 117
           VV++  +   PD  + +     D   FRGCK CGKEE+E+GCNGEG +QGGIATVPGFGW
Sbjct: 48  VVDEELNSSSPDQLSLQSESDPDDSSFRGCKACGKEEIERGCNGEGRMQGGIATVPGFGW 107

Query: 118 WPIKAYRPCPGFLASGGRYRRQGQTMDEVAFGRGEKRAPVS 158
           WPIKAYRPCPGF+ASGGRY+R+GQ+MDEV  G G   A  S
Sbjct: 108 WPIKAYRPCPGFVASGGRYKRRGQSMDEVTSGGGRTDASAS 148


>gi|195610174|gb|ACG26917.1| hypothetical protein [Zea mays]
          Length = 157

 Score =  133 bits (335), Expect = 2e-29,   Method: Compositional matrix adjust.
 Identities = 58/75 (77%), Positives = 68/75 (90%)

Query: 80  DKVGFRGCKVCGKEELEKGCNGEGTIQGGIATVPGFGWWPIKAYRPCPGFLASGGRYRRQ 139
           D   FRGCK CG++EL++GCNG+G IQGGIA VPGFGWWPIKAYRPCPGF+ASGGRYRRQ
Sbjct: 75  DDPNFRGCKGCGRDELQRGCNGQGRIQGGIAAVPGFGWWPIKAYRPCPGFVASGGRYRRQ 134

Query: 140 GQTMDEVAFGRGEKR 154
           GQ+MD+VA GRG+K+
Sbjct: 135 GQSMDDVASGRGKKK 149


>gi|222618058|gb|EEE54190.1| hypothetical protein OsJ_01022 [Oryza sativa Japonica Group]
          Length = 179

 Score =  133 bits (334), Expect = 3e-29,   Method: Compositional matrix adjust.
 Identities = 82/165 (49%), Positives = 97/165 (58%), Gaps = 12/165 (7%)

Query: 1   MAAAL-LSLPA----ATRHETPVSQRSSTSSGYLIF-HRSLSVASIKCPYNNRPTVLLCS 54
           MAA+L L LPA    A  H  PV    + SS  L   H   +  SI C  ++        
Sbjct: 1   MAASLWLQLPAPPASAAVH-NPVFLTGNLSSSLLCKKHPQAARGSILCSSSSSSNSSASV 59

Query: 55  AATSVSVVEDVSSPLPPDAAATEGA-----DKVGFRGCKVCGKEELEKGCNGEGTIQGGI 109
                  V   SS      A +E       D   FRGCK  G+EE+E+GCNG+G IQGGI
Sbjct: 60  VTKEQEAVAAASSSQEEGVAVSETEFVSYRDDPNFRGCKGSGREEVERGCNGKGRIQGGI 119

Query: 110 ATVPGFGWWPIKAYRPCPGFLASGGRYRRQGQTMDEVAFGRGEKR 154
           ATVPGFGWWPIKAYRPCPGF+ASGGRYRRQGQ+MD +A G+G+KR
Sbjct: 120 ATVPGFGWWPIKAYRPCPGFVASGGRYRRQGQSMDNLASGKGKKR 164


>gi|297801356|ref|XP_002868562.1| hypothetical protein ARALYDRAFT_330331 [Arabidopsis lyrata subsp.
           lyrata]
 gi|297314398|gb|EFH44821.1| hypothetical protein ARALYDRAFT_330331 [Arabidopsis lyrata subsp.
           lyrata]
          Length = 169

 Score =  131 bits (330), Expect = 8e-29,   Method: Compositional matrix adjust.
 Identities = 64/118 (54%), Positives = 78/118 (66%), Gaps = 13/118 (11%)

Query: 44  YNNRPTVLLCSAATSVSVVEDVSSPLPPDAAATEGADKVGFRGCKVCGKEELEKGCNGEG 103
           +++ P + L S+  SV    D    L              F GC+ CGKEE E GCNG+G
Sbjct: 40  HHHHPLISLSSSPASVVETGDDEEDLI-------------FSGCRGCGKEEKESGCNGDG 86

Query: 104 TIQGGIATVPGFGWWPIKAYRPCPGFLASGGRYRRQGQTMDEVAFGRGEKRAPVSSDS 161
            IQGGIATVPGFGWWPIKAYRPCP F+ +GGRYRR GQ+MDEVAFGRG+ ++  S D+
Sbjct: 87  RIQGGIATVPGFGWWPIKAYRPCPAFVEAGGRYRRIGQSMDEVAFGRGDSKSSTSIDT 144


>gi|18422170|ref|NP_568601.1| uncharacterized protein [Arabidopsis thaliana]
 gi|20268798|gb|AAM14099.1| unknown protein [Arabidopsis thaliana]
 gi|21281247|gb|AAM45102.1| unknown protein [Arabidopsis thaliana]
 gi|21618110|gb|AAM67160.1| unknown [Arabidopsis thaliana]
 gi|332007379|gb|AED94762.1| uncharacterized protein [Arabidopsis thaliana]
          Length = 164

 Score =  128 bits (321), Expect = 1e-27,   Method: Compositional matrix adjust.
 Identities = 55/71 (77%), Positives = 62/71 (87%)

Query: 84  FRGCKVCGKEELEKGCNGEGTIQGGIATVPGFGWWPIKAYRPCPGFLASGGRYRRQGQTM 143
           F GC+ CGKEE E GCNG+G IQGGIATVPGFGWWPIKAYRPCP F+ +GGRYRR GQ+M
Sbjct: 68  FSGCRACGKEEKESGCNGDGRIQGGIATVPGFGWWPIKAYRPCPAFVEAGGRYRRIGQSM 127

Query: 144 DEVAFGRGEKR 154
           DEVAFGRG+ +
Sbjct: 128 DEVAFGRGDSK 138


>gi|9757952|dbj|BAB08440.1| unnamed protein product [Arabidopsis thaliana]
          Length = 155

 Score =  128 bits (321), Expect = 1e-27,   Method: Compositional matrix adjust.
 Identities = 55/71 (77%), Positives = 62/71 (87%)

Query: 84  FRGCKVCGKEELEKGCNGEGTIQGGIATVPGFGWWPIKAYRPCPGFLASGGRYRRQGQTM 143
           F GC+ CGKEE E GCNG+G IQGGIATVPGFGWWPIKAYRPCP F+ +GGRYRR GQ+M
Sbjct: 68  FSGCRACGKEEKESGCNGDGRIQGGIATVPGFGWWPIKAYRPCPAFVEAGGRYRRIGQSM 127

Query: 144 DEVAFGRGEKR 154
           DEVAFGRG+ +
Sbjct: 128 DEVAFGRGDSK 138


>gi|116786747|gb|ABK24222.1| unknown [Picea sitchensis]
 gi|224285784|gb|ACN40607.1| unknown [Picea sitchensis]
          Length = 161

 Score =  128 bits (321), Expect = 1e-27,   Method: Compositional matrix adjust.
 Identities = 57/96 (59%), Positives = 71/96 (73%), Gaps = 6/96 (6%)

Query: 65  VSSPLPP------DAAATEGADKVGFRGCKVCGKEELEKGCNGEGTIQGGIATVPGFGWW 118
           + +P+PP      D  A E        GCK CG+ E+E+GCNGEG IQGGI T+PGF WW
Sbjct: 64  IPTPIPPTRVISEDGTAVEEEQGKVLVGCKACGRPEIERGCNGEGRIQGGIGTIPGFQWW 123

Query: 119 PIKAYRPCPGFLASGGRYRRQGQTMDEVAFGRGEKR 154
           PIK +RPCPGF+ASGGRYRR+GQ+MDEVAFGR +++
Sbjct: 124 PIKVFRPCPGFVASGGRYRRRGQSMDEVAFGRDKEQ 159


>gi|116781571|gb|ABK22159.1| unknown [Picea sitchensis]
          Length = 161

 Score =  127 bits (319), Expect = 2e-27,   Method: Compositional matrix adjust.
 Identities = 57/92 (61%), Positives = 68/92 (73%), Gaps = 6/92 (6%)

Query: 65  VSSPLPP------DAAATEGADKVGFRGCKVCGKEELEKGCNGEGTIQGGIATVPGFGWW 118
           + +P+PP      D  A E        GCK CG+ E+E+GCNGEG IQGGI T+PGF WW
Sbjct: 64  IPTPIPPTRVISEDGTAVEEEQGKVLVGCKACGRPEIERGCNGEGRIQGGIGTIPGFQWW 123

Query: 119 PIKAYRPCPGFLASGGRYRRQGQTMDEVAFGR 150
           PIK +RPCPGF+ASGGRYRR+GQ+MDEVAFGR
Sbjct: 124 PIKVFRPCPGFVASGGRYRRRGQSMDEVAFGR 155


>gi|147820800|emb|CAN65190.1| hypothetical protein VITISV_032101 [Vitis vinifera]
          Length = 320

 Score =  120 bits (300), Expect = 2e-25,   Method: Compositional matrix adjust.
 Identities = 53/61 (86%), Positives = 57/61 (93%)

Query: 95  LEKGCNGEGTIQGGIATVPGFGWWPIKAYRPCPGFLASGGRYRRQGQTMDEVAFGRGEKR 154
           +EKGCNGEG IQGGI TVPGFGWWPIKAYRPCPGF+ASGGRYRR+GQ+MDEVAFGRG K 
Sbjct: 1   MEKGCNGEGRIQGGIGTVPGFGWWPIKAYRPCPGFVASGGRYRRRGQSMDEVAFGRGGKG 60

Query: 155 A 155
           A
Sbjct: 61  A 61


>gi|168052317|ref|XP_001778597.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162670051|gb|EDQ56627.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 912

 Score =  114 bits (285), Expect = 2e-23,   Method: Compositional matrix adjust.
 Identities = 53/96 (55%), Positives = 70/96 (72%), Gaps = 5/96 (5%)

Query: 59  VSVVEDVSSPLPPDAAATEGADKVGFRGCKVCGKEELEKGCNGEGTIQGGIATVPGFGWW 118
           V+ VED S     DA++++         C+ CGKE+   GCNGEG I GG+++VPGFGW+
Sbjct: 78  VTSVEDAS-----DASSSQPTRIFSPTACQACGKEDKVGGCNGEGRILGGLSSVPGFGWF 132

Query: 119 PIKAYRPCPGFLASGGRYRRQGQTMDEVAFGRGEKR 154
           PIKAYRPCP F+ +GGRYRRQGQ+++EVAFGR  K+
Sbjct: 133 PIKAYRPCPAFVEAGGRYRRQGQSLEEVAFGRMGKQ 168


>gi|302768933|ref|XP_002967886.1| hypothetical protein SELMODRAFT_69222 [Selaginella moellendorffii]
 gi|300164624|gb|EFJ31233.1| hypothetical protein SELMODRAFT_69222 [Selaginella moellendorffii]
          Length = 95

 Score =  109 bits (273), Expect = 4e-22,   Method: Compositional matrix adjust.
 Identities = 47/65 (72%), Positives = 54/65 (83%)

Query: 86  GCKVCGKEELEKGCNGEGTIQGGIATVPGFGWWPIKAYRPCPGFLASGGRYRRQGQTMDE 145
           GCK CG+ E+  GCNG G IQGGIA VPGFGWWPIKA+RPCP F+ SGGRY+RQGQ +DE
Sbjct: 10  GCKTCGRSEVMWGCNGLGRIQGGIAAVPGFGWWPIKAFRPCPAFVESGGRYKRQGQRLDE 69

Query: 146 VAFGR 150
            A+GR
Sbjct: 70  FAYGR 74


>gi|367060918|gb|AEX11232.1| hypothetical protein 0_12630_01 [Pinus lambertiana]
          Length = 134

 Score = 88.2 bits (217), Expect = 1e-15,   Method: Compositional matrix adjust.
 Identities = 36/59 (61%), Positives = 42/59 (71%)

Query: 72  DAAATEGADKVGFRGCKVCGKEELEKGCNGEGTIQGGIATVPGFGWWPIKAYRPCPGFL 130
           D  A E        GCK CG+ E+E+GCNGEG IQGGI T+PGF WWPIK +RPCPGF+
Sbjct: 74  DGMAVEEEQGKVLGGCKACGRPEIERGCNGEGRIQGGIGTIPGFSWWPIKVFRPCPGFI 132


>gi|367060914|gb|AEX11230.1| hypothetical protein 0_12630_01 [Pinus taeda]
 gi|367060916|gb|AEX11231.1| hypothetical protein 0_12630_01 [Pinus taeda]
          Length = 134

 Score = 86.7 bits (213), Expect = 3e-15,   Method: Compositional matrix adjust.
 Identities = 35/59 (59%), Positives = 42/59 (71%)

Query: 72  DAAATEGADKVGFRGCKVCGKEELEKGCNGEGTIQGGIATVPGFGWWPIKAYRPCPGFL 130
           D  A E        GCK CG+ E+++GCNGEG IQGGI T+PGF WWPIK +RPCPGF+
Sbjct: 74  DGMAVEEEQGKVLGGCKACGRPEIKRGCNGEGRIQGGIGTIPGFSWWPIKVFRPCPGFI 132


>gi|308800612|ref|XP_003075087.1| CccP Conserved Cys-containing protein (IC) [Ostreococcus tauri]
 gi|116061641|emb|CAL52359.1| CccP Conserved Cys-containing protein (IC) [Ostreococcus tauri]
          Length = 112

 Score = 82.8 bits (203), Expect = 4e-14,   Method: Compositional matrix adjust.
 Identities = 39/88 (44%), Positives = 54/88 (61%), Gaps = 1/88 (1%)

Query: 66  SSPLPPDAAATEGADKVGFRGCKVCGKEELEKGCNGEGTIQGGIATVPGFGWWPIKAYRP 125
           SS + P A+A++ A K     C  CG    +  C+G G IQGG+  +P F WWPIKAYRP
Sbjct: 25  SSAVVPRASASDDATKC-TGSCDSCGLARDKMQCDGSGRIQGGMGAIPLFSWWPIKAYRP 83

Query: 126 CPGFLASGGRYRRQGQTMDEVAFGRGEK 153
           CPG + +GG Y R+GQ ++ + +G   K
Sbjct: 84  CPGLIENGGVYTRKGQDVNSILWGESGK 111


>gi|307111329|gb|EFN59563.1| hypothetical protein CHLNCDRAFT_49318 [Chlorella variabilis]
          Length = 130

 Score = 82.8 bits (203), Expect = 4e-14,   Method: Compositional matrix adjust.
 Identities = 39/70 (55%), Positives = 46/70 (65%), Gaps = 2/70 (2%)

Query: 87  CKVCGKEELEK--GCNGEGTIQGGIATVPGFGWWPIKAYRPCPGFLASGGRYRRQGQTMD 144
           C  CG    +K  GC+G G I GGI  VPGF WWPIKAYRPCP    +G  Y+R+GQ  D
Sbjct: 61  CPTCGVALDKKAFGCDGTGRIAGGIGAVPGFKWWPIKAYRPCPRLAEAGMAYQRKGQITD 120

Query: 145 EVAFGRGEKR 154
           EV FGRG ++
Sbjct: 121 EVLFGRGRQQ 130


>gi|412990045|emb|CCO20687.1| predicted protein [Bathycoccus prasinos]
          Length = 117

 Score = 81.3 bits (199), Expect = 1e-13,   Method: Compositional matrix adjust.
 Identities = 34/63 (53%), Positives = 43/63 (68%)

Query: 87  CKVCGKEELEKGCNGEGTIQGGIATVPGFGWWPIKAYRPCPGFLASGGRYRRQGQTMDEV 146
           C  CG    +  C+G G IQGG+A +P F WWPIKAYRPCPG + SGG Y R+GQ +D +
Sbjct: 50  CDSCGVAREKMQCDGTGRIQGGLAAIPLFSWWPIKAYRPCPGLIESGGTYTRKGQDVDSI 109

Query: 147 AFG 149
            +G
Sbjct: 110 LWG 112


>gi|255081949|ref|XP_002508193.1| predicted protein [Micromonas sp. RCC299]
 gi|226523469|gb|ACO69451.1| predicted protein [Micromonas sp. RCC299]
          Length = 122

 Score = 81.3 bits (199), Expect = 1e-13,   Method: Compositional matrix adjust.
 Identities = 37/74 (50%), Positives = 47/74 (63%), Gaps = 1/74 (1%)

Query: 77  EGADKVGFRG-CKVCGKEELEKGCNGEGTIQGGIATVPGFGWWPIKAYRPCPGFLASGGR 135
           E  D+ G  G C  CG  + +  C+G G I GG+  +P F WWPIKAYRPCPG + SGG 
Sbjct: 45  ESKDQAGCTGSCDSCGLAKEKMQCDGSGRIIGGMGAIPLFSWWPIKAYRPCPGLIESGGV 104

Query: 136 YRRQGQTMDEVAFG 149
           Y R+GQ +D + FG
Sbjct: 105 YVRKGQDVDGILFG 118


>gi|303277541|ref|XP_003058064.1| predicted protein [Micromonas pusilla CCMP1545]
 gi|226460721|gb|EEH58015.1| predicted protein [Micromonas pusilla CCMP1545]
          Length = 112

 Score = 81.3 bits (199), Expect = 1e-13,   Method: Compositional matrix adjust.
 Identities = 37/76 (48%), Positives = 46/76 (60%)

Query: 74  AATEGADKVGFRGCKVCGKEELEKGCNGEGTIQGGIATVPGFGWWPIKAYRPCPGFLASG 133
           AA    DK     C  CG  + +  C+G G I GG+  +P F WWPIKAYRPCPG + SG
Sbjct: 33  AAGAAEDKACTGSCDSCGLAKEKMQCDGSGRIIGGMGAIPIFSWWPIKAYRPCPGLIESG 92

Query: 134 GRYRRQGQTMDEVAFG 149
           G Y R+GQ +D + FG
Sbjct: 93  GVYVRKGQDVDGILFG 108


>gi|145345376|ref|XP_001417189.1| predicted protein [Ostreococcus lucimarinus CCE9901]
 gi|144577416|gb|ABO95482.1| predicted protein [Ostreococcus lucimarinus CCE9901]
          Length = 122

 Score = 77.8 bits (190), Expect = 1e-12,   Method: Compositional matrix adjust.
 Identities = 32/67 (47%), Positives = 44/67 (65%)

Query: 87  CKVCGKEELEKGCNGEGTIQGGIATVPGFGWWPIKAYRPCPGFLASGGRYRRQGQTMDEV 146
           C  CG  + +  C+G G IQGG+  +P F WWPIKAYRPCPG + +GG Y R+GQ ++ +
Sbjct: 55  CDSCGLAKEKMQCDGSGRIQGGMGAIPLFAWWPIKAYRPCPGLIDNGGVYTRKGQDVNAI 114

Query: 147 AFGRGEK 153
            +G   K
Sbjct: 115 LWGESGK 121


>gi|57335312|emb|CAH04624.1| hypothetical protein precursor [Guillardia theta]
          Length = 225

 Score = 77.4 bits (189), Expect = 2e-12,   Method: Compositional matrix adjust.
 Identities = 31/54 (57%), Positives = 39/54 (72%)

Query: 99  CNGEGTIQGGIATVPGFGWWPIKAYRPCPGFLASGGRYRRQGQTMDEVAFGRGE 152
           C G G IQGG+AT+PG  WWPIKA+RPCP    +G  Y R+GQ +DEV FG+ +
Sbjct: 141 CGGTGRIQGGLATLPGLAWWPIKAFRPCPTCEKAGRSYNRRGQVLDEVLFGKND 194


>gi|219120546|ref|XP_002181009.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
 gi|217407725|gb|EEC47661.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
          Length = 119

 Score = 73.6 bits (179), Expect = 3e-11,   Method: Compositional matrix adjust.
 Identities = 33/52 (63%), Positives = 37/52 (71%), Gaps = 2/52 (3%)

Query: 99  CNGEGTIQGGIATVPGFGWWPIKAYRPCPGFLASGGRYRRQGQTMDEVAFGR 150
           C+G G I GGIA V    WWPIKAYRPCP F+  GG Y R GQ +DE+AFGR
Sbjct: 64  CDGSGRILGGIAVV--LPWWPIKAYRPCPNFINRGGIYERSGQGLDEIAFGR 113


>gi|298708826|emb|CBJ30785.1| conserved unknown protein [Ectocarpus siliculosus]
          Length = 164

 Score = 69.3 bits (168), Expect = 6e-10,   Method: Compositional matrix adjust.
 Identities = 49/130 (37%), Positives = 68/130 (52%), Gaps = 23/130 (17%)

Query: 24  TSSGYLIFHRSLSVASIKCPYNNRPTVLLCSAATSVSVVEDVSSPLPPDAAATE--GADK 81
           +++G+L+  RS        P  +R T   C  AT++   E+       DAA T   G+  
Sbjct: 18  SATGFLVVPRSP-------PNFSRQT---CRTATALHAQEE-------DAAVTAPPGSSD 60

Query: 82  VGFRGCKVCGKEELEKGCNGEGTIQGGI-ATVPGFGWWPIKAYRPCPGFLASGGRYRRQG 140
               G   C  E  ++GC+G G I GG+ A VP   W PIKAYRPCP F  +  +Y RQG
Sbjct: 61  DACPGYPRCSGEYRDRGCDGTGRISGGVGALVP---WLPIKAYRPCPSFTEAKYKYTRQG 117

Query: 141 QTMDEVAFGR 150
           Q+++EV F R
Sbjct: 118 QSLNEVVFAR 127


>gi|449017078|dbj|BAM80480.1| hypothetical protein, conserved [Cyanidioschyzon merolae strain
           10D]
          Length = 119

 Score = 68.2 bits (165), Expect = 1e-09,   Method: Compositional matrix adjust.
 Identities = 34/56 (60%), Positives = 39/56 (69%), Gaps = 1/56 (1%)

Query: 97  KGCNGEGTIQGGIATVPGFGWWPIKAYRPCPGFLASGGRYRRQGQTM-DEVAFGRG 151
           K C G G +QGG+A V GF WWPIKAYRPCP   A   RYRR+GQ++ D V  GRG
Sbjct: 55  KLCGGSGRLQGGLAAVRGFHWWPIKAYRPCPACEAGRLRYRRRGQSLQDFVGNGRG 110


>gi|159487625|ref|XP_001701823.1| predicted protein [Chlamydomonas reinhardtii]
 gi|158281042|gb|EDP06798.1| predicted protein [Chlamydomonas reinhardtii]
          Length = 110

 Score = 62.8 bits (151), Expect = 6e-08,   Method: Compositional matrix adjust.
 Identities = 28/67 (41%), Positives = 38/67 (56%)

Query: 105 IQGGIATVPGFGWWPIKAYRPCPGFLASGGRYRRQGQTMDEVAFGRGEKRAPVSSDSENI 164
           + GG+  +PGFGWWPIKAY+PCP    +G  Y R+GQ  ++V FG     A  +   E I
Sbjct: 40  VIGGMGALPGFGWWPIKAYKPCPALNEAGLEYTRKGQMTNDVLFGGVSLGAQGAQSLEEI 99

Query: 165 DPSKSRG 171
                +G
Sbjct: 100 KKQADKG 106


>gi|134099934|ref|YP_001105595.1| hypothetical protein SACE_3395 [Saccharopolyspora erythraea NRRL
           2338]
 gi|291008671|ref|ZP_06566644.1| hypothetical protein SeryN2_29463 [Saccharopolyspora erythraea NRRL
           2338]
 gi|133912557|emb|CAM02670.1| hypothetical protein SACE_3395 [Saccharopolyspora erythraea NRRL
           2338]
          Length = 117

 Score = 36.2 bits (82), Expect = 4.6,   Method: Compositional matrix adjust.
 Identities = 23/82 (28%), Positives = 37/82 (45%), Gaps = 14/82 (17%)

Query: 64  DVSSPLPPDAAATEGADKVGFRGCKVCGKEELEKGCNGEGTIQGGIATVPGFGWWPIKAY 123
           D ++P P +A   E AD    +      ++EL     G   ++ G+   P  GW P+   
Sbjct: 6   DATTPSPDEAPTQEPADPANLQSAPDLDEDEL-----GVDPLESGVE--PAEGWSPVAEG 58

Query: 124 RPCPGFLASGGRYRRQGQTMDE 145
           RP P       R +R+G ++DE
Sbjct: 59  RPTP-------REQREGDSIDE 73


  Database: nr
    Posted date:  Mar 3, 2013 10:45 PM
  Number of letters in database: 999,999,864
  Number of sequences in database:  2,912,245
  
  Database: /local_scratch/syshi//blastdatabase/nr.01
    Posted date:  Mar 3, 2013 10:52 PM
  Number of letters in database: 999,999,666
  Number of sequences in database:  2,912,720
  
  Database: /local_scratch/syshi//blastdatabase/nr.02
    Posted date:  Mar 3, 2013 10:58 PM
  Number of letters in database: 999,999,938
  Number of sequences in database:  3,014,250
  
  Database: /local_scratch/syshi//blastdatabase/nr.03
    Posted date:  Mar 3, 2013 11:03 PM
  Number of letters in database: 999,999,780
  Number of sequences in database:  2,805,020
  
  Database: /local_scratch/syshi//blastdatabase/nr.04
    Posted date:  Mar 3, 2013 11:08 PM
  Number of letters in database: 999,999,551
  Number of sequences in database:  2,816,253
  
  Database: /local_scratch/syshi//blastdatabase/nr.05
    Posted date:  Mar 3, 2013 11:13 PM
  Number of letters in database: 999,999,897
  Number of sequences in database:  2,981,387
  
  Database: /local_scratch/syshi//blastdatabase/nr.06
    Posted date:  Mar 3, 2013 11:18 PM
  Number of letters in database: 999,999,649
  Number of sequences in database:  2,911,476
  
  Database: /local_scratch/syshi//blastdatabase/nr.07
    Posted date:  Mar 3, 2013 11:24 PM
  Number of letters in database: 999,999,452
  Number of sequences in database:  2,920,260
  
  Database: /local_scratch/syshi//blastdatabase/nr.08
    Posted date:  Mar 3, 2013 11:25 PM
  Number of letters in database: 64,230,274
  Number of sequences in database:  189,558
  
Lambda     K      H
   0.315    0.132    0.403 

Lambda     K      H
   0.267   0.0410    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 2,920,950,819
Number of Sequences: 23463169
Number of extensions: 126307565
Number of successful extensions: 266405
Number of sequences better than 100.0: 46
Number of HSP's better than 100.0 without gapping: 38
Number of HSP's successfully gapped in prelim test: 8
Number of HSP's that attempted gapping in prelim test: 266359
Number of HSP's gapped (non-prelim): 46
length of query: 177
length of database: 8,064,228,071
effective HSP length: 133
effective length of query: 44
effective length of database: 9,238,593,890
effective search space: 406498131160
effective search space used: 406498131160
T: 11
A: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.6 bits)
S2: 71 (32.0 bits)