BLASTP 2.2.26 [Sep-21-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.


Reference for compositional score matrix adjustment: Altschul, Stephen F., 
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.

Query= 029114
         (199 letters)

Database: nr 
           23,463,169 sequences; 8,064,228,071 total letters

Searching..................................................done



>gi|449457015|ref|XP_004146244.1| PREDICTED: uncharacterized protein LOC101221005 [Cucumis sativus]
 gi|449495512|ref|XP_004159863.1| PREDICTED: uncharacterized LOC101221005 [Cucumis sativus]
          Length = 282

 Score =  221 bits (563), Expect = 1e-55,   Method: Compositional matrix adjust.
 Identities = 128/205 (62%), Positives = 150/205 (73%), Gaps = 12/205 (5%)

Query: 1   MLQTHHLLSLNFP-FTVS--HHPQKLNFLQKPTISLSAFPR------RRPLIEPYCLAQA 51
           M  T +LLS NFP FT+S   +  KL FL  PT +LS+  R        PL    C    
Sbjct: 1   MWHTQNLLSSNFPLFTLSPPTYNHKL-FLSPPT-TLSSLHRPITFHSVSPLTNHRCFCLP 58

Query: 52  QEPTTNITAPTTSEEGPVELPQSIFATTDEPSSLQVATSVLLTGAISVFLFRALRRRAKR 111
           Q  T    A    + GPVELP +IFATTD+PSSLQVATSVLLTGAISVFLFR+LRRRAKR
Sbjct: 59  QF-TDLADATFLDDNGPVELPPTIFATTDDPSSLQVATSVLLTGAISVFLFRSLRRRAKR 117

Query: 112 AKELKFRSSGAKKSLKDEALDNLKALGSSSIDAKGPPSPVQALLGGLTAGVIAIILYKFT 171
            KELKFRS G KKSLK+EA+D+LKA+ +  I++K  PSP+QA LG + AGVIA+ILYKFT
Sbjct: 118 VKELKFRSGGVKKSLKEEAMDSLKAISTGPIESKSTPSPIQAFLGAIAAGVIALILYKFT 177

Query: 172 TTIEAALNRQTISDNFSVRQITITI 196
           TTIEA+LNRQT+SDNFSVRQ+TITI
Sbjct: 178 TTIEASLNRQTVSDNFSVRQLTITI 202


>gi|118488906|gb|ABK96262.1| unknown [Populus trichocarpa x Populus deltoides]
          Length = 238

 Score =  217 bits (552), Expect = 3e-54,   Method: Compositional matrix adjust.
 Identities = 125/205 (60%), Positives = 149/205 (72%), Gaps = 13/205 (6%)

Query: 2   LQTHHLLSLN---FPFT---VSHHPQKLNFLQKPTI----SLSAFPRRRPLIEPYCLAQA 51
           LQ H+   L+   FP +    +HH   L FL +P I    +LSA PR RP  +P  L  A
Sbjct: 4   LQCHNFHPLSPQIFPLSHHHTNHHLYPLKFLYRPPIFTPFTLSALPRSRP--DP-PLWLA 60

Query: 52  QEPTTNITAPTTSEEGPVELPQSIFATTDEPSSLQVATSVLLTGAISVFLFRALRRRAKR 111
           Q P              +    SIFAT+D+PSS+QVATSVLLTGAISVFLFR+LRRRAKR
Sbjct: 61  QLPEQAPEPEEEGPIEILRSSPSIFATSDDPSSIQVATSVLLTGAISVFLFRSLRRRAKR 120

Query: 112 AKELKFRSSGAKKSLKDEALDNLKALGSSSIDAKGPPSPVQALLGGLTAGVIAIILYKFT 171
           +KELKFRSSGAKK+LK+EALD+LK  GS+ ID K PPSPVQA LG ++AGVIA+ILYKFT
Sbjct: 121 SKELKFRSSGAKKTLKEEALDSLKTFGSAPIDVKKPPSPVQAFLGAISAGVIALILYKFT 180

Query: 172 TTIEAALNRQTISDNFSVRQITITI 196
           TTIEA+LNRQT+SDNFSVRQIT+T+
Sbjct: 181 TTIEASLNRQTVSDNFSVRQITVTV 205


>gi|225461564|ref|XP_002282834.1| PREDICTED: uncharacterized protein LOC100267434 [Vitis vinifera]
 gi|302142945|emb|CBI20240.3| unnamed protein product [Vitis vinifera]
          Length = 277

 Score =  209 bits (533), Expect = 4e-52,   Method: Compositional matrix adjust.
 Identities = 109/144 (75%), Positives = 124/144 (86%), Gaps = 7/144 (4%)

Query: 51  AQEPTTNITAPTTSEEGPVELP---QSIFATTDEPSSLQVATSVLLTGAISVFLFRALRR 107
           AQ P T  TAP   EEGP+ELP    SIFAT D+P+ LQVATSVLLTGAISVFLFR++RR
Sbjct: 53  AQVPDTTTTAP---EEGPIELPPSSSSIFATNDDPTPLQVATSVLLTGAISVFLFRSIRR 109

Query: 108 RAKRAKELKFRSSGAKKSLKDEALDNLKALGSSSIDAKGPPSPVQALLGGLTAGVIAIIL 167
           R KRAKEL+FRSSG KK+LK+EALD+LKA+GS S+ A  PPSPVQALLGG+TAGVIA+IL
Sbjct: 110 RVKRAKELRFRSSGVKKTLKEEALDSLKAMGSGSVKA-APPSPVQALLGGITAGVIALIL 168

Query: 168 YKFTTTIEAALNRQTISDNFSVRQ 191
           YKFT TIEA+LNRQT+SDNFSVRQ
Sbjct: 169 YKFTITIEASLNRQTVSDNFSVRQ 192


>gi|224116710|ref|XP_002317372.1| predicted protein [Populus trichocarpa]
 gi|222860437|gb|EEE97984.1| predicted protein [Populus trichocarpa]
          Length = 211

 Score =  206 bits (525), Expect = 3e-51,   Method: Compositional matrix adjust.
 Identities = 100/123 (81%), Positives = 116/123 (94%)

Query: 74  SIFATTDEPSSLQVATSVLLTGAISVFLFRALRRRAKRAKELKFRSSGAKKSLKDEALDN 133
           SIFAT+D+PSS+QVATSVLLTGAISVFLFR+LRRRAKR+KELKFRSSGAKK+LK+EALD+
Sbjct: 16  SIFATSDDPSSIQVATSVLLTGAISVFLFRSLRRRAKRSKELKFRSSGAKKTLKEEALDS 75

Query: 134 LKALGSSSIDAKGPPSPVQALLGGLTAGVIAIILYKFTTTIEAALNRQTISDNFSVRQIT 193
           LK  GS+ ID K PPSPVQA LG ++AGVIA+ILYKF+TTIEA+LNRQT+SDNFSVRQIT
Sbjct: 76  LKTFGSAPIDVKKPPSPVQAFLGAISAGVIALILYKFSTTIEASLNRQTVSDNFSVRQIT 135

Query: 194 ITI 196
           IT+
Sbjct: 136 ITV 138


>gi|356544064|ref|XP_003540475.1| PREDICTED: uncharacterized protein LOC100799393 [Glycine max]
          Length = 277

 Score =  203 bits (516), Expect = 3e-50,   Method: Compositional matrix adjust.
 Identities = 125/207 (60%), Positives = 148/207 (71%), Gaps = 32/207 (15%)

Query: 5   HHLLSLNFPFTVSHHPQK--LNFL----QKPTI----SLSAFPRRRPLIEPYCLAQAQEP 54
           HHLL   FP T++HH  K  L+ L     KPTI    S SA P       P+    A   
Sbjct: 8   HHLL---FPLTLAHHKVKPPLSILPPLHHKPTILHSVSASAIP-------PWAAQLADAA 57

Query: 55  TTNITAPTTSEEGPVELP----QSIFATTDEPSSLQVATSVLLTGAISVFLFRALRRRAK 110
               T      EGPVELP     SIFAT+D+PS +QVA+SVLLTGA+SVFLFR+LRRRA+
Sbjct: 58  ADIDT------EGPVELPFSSTPSIFATSDDPSPIQVASSVLLTGAVSVFLFRSLRRRAQ 111

Query: 111 RAKELKFRSSGAKKSLKDEALDNLKALGSSSIDAK-GPPSPVQALLGGLTAGVIAIILYK 169
           R K+ +FRSSG +KS+K+EALD+LKA+GS+S+ AK    SPVQALLGG++AGVIA+ILYK
Sbjct: 112 RVKQSQFRSSG-EKSIKEEALDSLKAMGSASVKAKEDSASPVQALLGGISAGVIALILYK 170

Query: 170 FTTTIEAALNRQTISDNFSVRQITITI 196
           F TTIEAALNRQTISDNFSVRQITITI
Sbjct: 171 FATTIEAALNRQTISDNFSVRQITITI 197


>gi|255564541|ref|XP_002523266.1| conserved hypothetical protein [Ricinus communis]
 gi|223537479|gb|EEF39105.1| conserved hypothetical protein [Ricinus communis]
          Length = 285

 Score =  194 bits (493), Expect = 2e-47,   Method: Compositional matrix adjust.
 Identities = 125/207 (60%), Positives = 150/207 (72%), Gaps = 13/207 (6%)

Query: 1   MLQTHHLLS--LNFPFTVSHHPQK-----LNFLQKPTISLSAFPRRRPLIEPYCLAQAQE 53
           ML  HH +     FP    HH         +   KPT SLS F    P  +P     ++ 
Sbjct: 1   MLHCHHYIPSPQFFPRNHHHHTSTPYLSLTSSFSKPT-SLSTF-SALPPSKPDIWLASEP 58

Query: 54  PTTNITAPTTSEEGPVELP----QSIFATTDEPSSLQVATSVLLTGAISVFLFRALRRRA 109
                 +P+  E+GP+ELP     SIFATTD+PS +QVATSVLLTG+I VFLFRALRRRA
Sbjct: 59  TPVTPPSPSPEEDGPIELPFPSSPSIFATTDDPSPIQVATSVLLTGSIGVFLFRALRRRA 118

Query: 110 KRAKELKFRSSGAKKSLKDEALDNLKALGSSSIDAKGPPSPVQALLGGLTAGVIAIILYK 169
           KRAKELKFRSSGAKK++K+EALD+LKA+GS+SIDA+ PPSP+Q  LGG++A VIA+ILYK
Sbjct: 119 KRAKELKFRSSGAKKTMKEEALDSLKAMGSASIDAQKPPSPLQTFLGGVSAAVIALILYK 178

Query: 170 FTTTIEAALNRQTISDNFSVRQITITI 196
           FTTTIEAALNRQT+SDNFSVRQITIT+
Sbjct: 179 FTTTIEAALNRQTVSDNFSVRQITITV 205


>gi|21617902|gb|AAM66952.1| unknown [Arabidopsis thaliana]
          Length = 266

 Score =  189 bits (479), Expect = 7e-46,   Method: Compositional matrix adjust.
 Identities = 113/207 (54%), Positives = 143/207 (69%), Gaps = 22/207 (10%)

Query: 1   MLQTHHLLSLNFPFTVSHHPQKLNFLQKPTISLSAFPRRRPLIEPYCLAQAQEPTTNITA 60
           +LQ+H  L  + P+ +   P +L     P  SLS+F R RP I    L+  +E       
Sbjct: 3   VLQSHQCL-FSLPYRL--RPTRL---ISPIHSLSSFTRIRPGI--IRLSAVKEIAD---V 51

Query: 61  PTTSEEGPVELPQS----------IFATTDEPSSLQVATSVLLTGAISVFLFRALRRRAK 110
               E+GP+ELP S          IFAT+D+P+ LQ+ATSVLLTGAI+VFL R++RRRAK
Sbjct: 52  AEVEEDGPIELPTSSTSPFSSTNSIFATSDDPTPLQLATSVLLTGAITVFLIRSVRRRAK 111

Query: 111 RAKELKFRSSGAKKSLKDEALDNLKALGSSSID-AKGPPSPVQALLGGLTAGVIAIILYK 169
           RAKEL+FRS+GAKKSLK+EA+DNLKAL S+ I+     PS  QA LG + AGVIA+ILYK
Sbjct: 112 RAKELQFRSTGAKKSLKEEAMDNLKALSSTPIEGGNSTPSAAQAFLGAIAAGVIALILYK 171

Query: 170 FTTTIEAALNRQTISDNFSVRQITITI 196
           FT T+E+ LNRQTISDNFSVRQIT+T+
Sbjct: 172 FTVTVESGLNRQTISDNFSVRQITVTV 198


>gi|297834372|ref|XP_002885068.1| hypothetical protein ARALYDRAFT_478942 [Arabidopsis lyrata subsp.
           lyrata]
 gi|297330908|gb|EFH61327.1| hypothetical protein ARALYDRAFT_478942 [Arabidopsis lyrata subsp.
           lyrata]
          Length = 261

 Score =  189 bits (479), Expect = 7e-46,   Method: Compositional matrix adjust.
 Identities = 111/207 (53%), Positives = 142/207 (68%), Gaps = 22/207 (10%)

Query: 1   MLQTHHLLSLNFPFTVSHHPQKLNFLQKPTISLSAFPRRRPLIEPYCLAQAQEPTTNITA 60
           +LQ+H        FT+ H  +    +  P  SLS+F R RP  +   ++  +E       
Sbjct: 3   VLQSHQC-----RFTLPHRLRPTRLIS-PIHSLSSFTRIRP--DRIRVSVVKEIAD---V 51

Query: 61  PTTSEEGPVELPQS----------IFATTDEPSSLQVATSVLLTGAISVFLFRALRRRAK 110
               E+GP+ELP S          IFAT+D+P+ LQ+ATSVLLTGAI+VFL R++RRRAK
Sbjct: 52  AEVEEDGPIELPPSSTSPFSSTNSIFATSDDPTPLQLATSVLLTGAITVFLIRSVRRRAK 111

Query: 111 RAKELKFRSSGAKKSLKDEALDNLKALGSSSID-AKGPPSPVQALLGGLTAGVIAIILYK 169
           RAKEL+FRS+GAKKSLK+EA+DNLKALGS+ I+     PS  QA LG + AGVIA+ILYK
Sbjct: 112 RAKELRFRSTGAKKSLKEEAMDNLKALGSTPIEGGNSTPSAAQAFLGAIAAGVIALILYK 171

Query: 170 FTTTIEAALNRQTISDNFSVRQITITI 196
           FT T+E+ LNRQTISDNFSVRQIT+T+
Sbjct: 172 FTVTVESGLNRQTISDNFSVRQITVTV 198


>gi|18400636|ref|NP_566500.1| uncharacterized protein [Arabidopsis thaliana]
 gi|87116616|gb|ABD19672.1| At3g15110 [Arabidopsis thaliana]
 gi|332642098|gb|AEE75619.1| uncharacterized protein [Arabidopsis thaliana]
          Length = 266

 Score =  188 bits (477), Expect = 1e-45,   Method: Compositional matrix adjust.
 Identities = 113/207 (54%), Positives = 142/207 (68%), Gaps = 22/207 (10%)

Query: 1   MLQTHHLLSLNFPFTVSHHPQKLNFLQKPTISLSAFPRRRPLIEPYCLAQAQEPTTNITA 60
           +LQ+H  L  + P+ +   P +L     P  SLS+F R RP I    L+  +E       
Sbjct: 3   VLQSHQCL-FSLPYRL--RPTRL---ISPIHSLSSFTRIRPGI--IRLSAVKEIAD---V 51

Query: 61  PTTSEEGPVELPQS----------IFATTDEPSSLQVATSVLLTGAISVFLFRALRRRAK 110
               E+GP+ELP S          IFAT+D+P+ LQ+ATSVLLTGAI+VFL R++RRRAK
Sbjct: 52  AEVEEDGPIELPTSSTSPFSSTNSIFATSDDPTPLQLATSVLLTGAITVFLIRSVRRRAK 111

Query: 111 RAKELKFRSSGAKKSLKDEALDNLKALGSSSID-AKGPPSPVQALLGGLTAGVIAIILYK 169
           RAKEL FRS+GAKKSLK+EA+DNLKAL S+ I+     PS  QA LG + AGVIA+ILYK
Sbjct: 112 RAKELTFRSTGAKKSLKEEAMDNLKALSSTPIEGGNSTPSAAQAFLGAIAAGVIALILYK 171

Query: 170 FTTTIEAALNRQTISDNFSVRQITITI 196
           FT T+E+ LNRQTISDNFSVRQIT+T+
Sbjct: 172 FTVTVESGLNRQTISDNFSVRQITVTV 198


>gi|388496044|gb|AFK36088.1| unknown [Medicago truncatula]
          Length = 230

 Score =  170 bits (431), Expect = 2e-40,   Method: Compositional matrix adjust.
 Identities = 85/136 (62%), Positives = 113/136 (83%), Gaps = 6/136 (4%)

Query: 66  EGPVELPQ----SIFATTDEPSSLQVATSVLLTGAISVFLFRALRRRAKRAKELKFRSSG 121
           +GP+E+P     S+ ++TD+PS +Q+A S+LLTGAISV LFR+ RRRAKR K+ +FRSSG
Sbjct: 57  DGPIEIPYDSTPSLLSSTDDPSFIQIAASLLLTGAISVLLFRSFRRRAKRLKQTQFRSSG 116

Query: 122 AKKSLKDEALDNLKALGSSSIDA-KGPPSPVQALLGGLTAGVIAIILYKFTTTIEAALNR 180
            +KS+K+EAL+ LKA+G++SI+  KGPPSPVQ  LG ++AGVI++ILYKF T IEA L+R
Sbjct: 117 -EKSVKEEALETLKAMGTASIETTKGPPSPVQTFLGAISAGVISLILYKFATIIEAGLSR 175

Query: 181 QTISDNFSVRQITITI 196
           QTISD+FS RQITIT+
Sbjct: 176 QTISDDFSARQITITV 191


>gi|217072908|gb|ACJ84814.1| unknown [Medicago truncatula]
          Length = 271

 Score =  169 bits (427), Expect = 8e-40,   Method: Compositional matrix adjust.
 Identities = 85/136 (62%), Positives = 112/136 (82%), Gaps = 6/136 (4%)

Query: 66  EGPVELPQ----SIFATTDEPSSLQVATSVLLTGAISVFLFRALRRRAKRAKELKFRSSG 121
           +GP+E+P     S+ ++TD+PS +Q+A S+LLTGAISV LFR+ RRRAKR K+ +FRSSG
Sbjct: 57  DGPIEIPYDSTPSLLSSTDDPSFIQIAASLLLTGAISVLLFRSFRRRAKRLKQTQFRSSG 116

Query: 122 AKKSLKDEALDNLKALGSSSIDA-KGPPSPVQALLGGLTAGVIAIILYKFTTTIEAALNR 180
            +KS+K+EAL+ LKA G++SI+  KGPPSPVQ  LG ++AGVI++ILYKF T IEA L+R
Sbjct: 117 -EKSVKEEALETLKATGTASIETTKGPPSPVQTFLGAISAGVISLILYKFATIIEAGLSR 175

Query: 181 QTISDNFSVRQITITI 196
           QTISD+FS RQITIT+
Sbjct: 176 QTISDDFSARQITITV 191


>gi|357144153|ref|XP_003573191.1| PREDICTED: uncharacterized protein LOC100830225 [Brachypodium
           distachyon]
          Length = 284

 Score =  153 bits (387), Expect = 3e-35,   Method: Compositional matrix adjust.
 Identities = 102/170 (60%), Positives = 124/170 (72%), Gaps = 8/170 (4%)

Query: 35  AFPRRRPLI-----EPYCLAQAQEPTTNITAPTTSEEGPVEL-PQSIFATTDEPSSLQVA 88
           A P RRP       EP     A    +  T     EEGPVEL   ++F+  D P+SLQVA
Sbjct: 28  ARPGRRPTALRAEPEPQSPPPASAAASPDTTDDGGEEGPVELRAPTLFSVDDNPTSLQVA 87

Query: 89  TSVLLTGAISVFLFRALRRRAKRAKELKFRSSGAKK--SLKDEALDNLKALGSSSIDAKG 146
           TSVLLTGAISVFLFR+LRRRA+RAKEL+ RS G +K  +L +EAL+ L+ + +S ++ + 
Sbjct: 88  TSVLLTGAISVFLFRSLRRRARRAKELRVRSGGLEKPKNLSEEALEALRMVSTSPVETEK 147

Query: 147 PPSPVQALLGGLTAGVIAIILYKFTTTIEAALNRQTISDNFSVRQITITI 196
           PPSPVQALLGG+ AGVIA+ LYKF TTIEA+LNRQTISDNFSVRQIT+TI
Sbjct: 148 PPSPVQALLGGIAAGVIALFLYKFATTIEASLNRQTISDNFSVRQITVTI 197


>gi|356514641|ref|XP_003526013.1| PREDICTED: uncharacterized protein LOC100783357 [Glycine max]
          Length = 276

 Score =  152 bits (384), Expect = 6e-35,   Method: Compositional matrix adjust.
 Identities = 101/186 (54%), Positives = 123/186 (66%), Gaps = 19/186 (10%)

Query: 10  LNFPFTVSHHPQKL---NFLQKPTISLSAFPRRRPLIEPYCLAQAQEPTTNITAPTTSEE 66
           L    T +HH   L   +FL +P I+ +        I P     A+         TT + 
Sbjct: 18  LRLSLTTTHHNSSLPSLHFLHRPIITTTTTTTTVSAIPPTAAWLAKL--------TTQDF 69

Query: 67  GPVELPQS-IFATTDEPSSLQVATSVLLTGAISVFLFRALRRRAKRAKELKFRSSGAKKS 125
              ELP S +F  TD+PS++QVA SVLLTGAI+VF FR+++RRAKRAKELKFRSSG  KS
Sbjct: 70  AHFELPSSHLF--TDDPSTIQVACSVLLTGAITVFFFRSVQRRAKRAKELKFRSSGVNKS 127

Query: 126 LKDEALDNLKALGSSSIDAKGPPSPVQALLGGLTAGVIAIILYKFTTTIEAALNRQTISD 185
           L     DNLKA+ SSSI AK PPSP QALLG + AGVIA++LY+FTT+IEA L RQT+SD
Sbjct: 128 L-----DNLKAISSSSIKAKSPPSPDQALLGAIIAGVIAVVLYRFTTSIEATLGRQTLSD 182

Query: 186 NFSVRQ 191
           NFSVRQ
Sbjct: 183 NFSVRQ 188


>gi|115445727|ref|NP_001046643.1| Os02g0307800 [Oryza sativa Japonica Group]
 gi|46390708|dbj|BAD16208.1| unknown protein [Oryza sativa Japonica Group]
 gi|113536174|dbj|BAF08557.1| Os02g0307800 [Oryza sativa Japonica Group]
 gi|125581805|gb|EAZ22736.1| hypothetical protein OsJ_06408 [Oryza sativa Japonica Group]
 gi|215692736|dbj|BAG88156.1| unnamed protein product [Oryza sativa Japonica Group]
 gi|215694440|dbj|BAG89457.1| unnamed protein product [Oryza sativa Japonica Group]
          Length = 273

 Score =  149 bits (375), Expect = 9e-34,   Method: Compositional matrix adjust.
 Identities = 96/169 (56%), Positives = 117/169 (69%), Gaps = 9/169 (5%)

Query: 29  PTISLSAFPRRRPLI---EPYCLAQAQEPTTNITAPTTSEEGPVELPQ-SIFATTDEPSS 84
           P  S SA   RR +    EP     A EP           +GPVEL   ++F+  + P+ 
Sbjct: 26  PAASASALRLRRSVAVRAEPELSTSAAEPPPGDDG---EGDGPVELRTPTLFSIDENPTP 82

Query: 85  LQVATSVLLTGAISVFLFRALRRRAKRAKELKFRSSGAKK--SLKDEALDNLKALGSSSI 142
           LQ ATSVLLTGAISVFLFR++RRR +RAKEL+ RS G +K  +L  EAL+ L+ + +S I
Sbjct: 83  LQTATSVLLTGAISVFLFRSIRRRVRRAKELRVRSGGVEKPNNLSKEALEGLRLVSASPI 142

Query: 143 DAKGPPSPVQALLGGLTAGVIAIILYKFTTTIEAALNRQTISDNFSVRQ 191
           +   PPSPVQALLGG+ AGVIA+ILYKFTTTIEAALNRQTISD+FSVRQ
Sbjct: 143 EVDKPPSPVQALLGGIAAGVIALILYKFTTTIEAALNRQTISDSFSVRQ 191


>gi|388511081|gb|AFK43606.1| unknown [Lotus japonicus]
          Length = 251

 Score =  149 bits (375), Expect = 9e-34,   Method: Compositional matrix adjust.
 Identities = 94/198 (47%), Positives = 116/198 (58%), Gaps = 46/198 (23%)

Query: 9   SLNFPFTVSHHPQK---------LNFLQKPTISLSAFPRRRPLIEPYCLAQAQEPTTNIT 59
           +L  P T++HH            L+FL +PTI     P     I P   +   E TT   
Sbjct: 15  NLPVPLTLTHHHSSFSHPPPSHLLHFLHRPTI----IPHTISAIPPTLSSWLTELTTG-- 68

Query: 60  APTTSEEGPVELPQSIFATTDEPSS-LQVATSVLLTGAISVFLFRALRRRAKRAKELKFR 118
                              TD+PSS +QV +++L+TGAI++F FR L+RRAKRAKELKFR
Sbjct: 69  -------------------TDDPSSTIQVTSTILVTGAITLFFFRTLQRRAKRAKELKFR 109

Query: 119 SSGAKKSLKDEALDNLKALGSSSIDAKGPPSPVQALLGGLTAGVIAIILYKFTTTIEAAL 178
           SSG  KSL            S+SI AK  PSP QALLG + AGVIA+ILY+FTT++EA L
Sbjct: 110 SSGVNKSL-----------ASTSIKAKKRPSPDQALLGAIIAGVIAVILYRFTTSVEATL 158

Query: 179 NRQTISDNFSVRQITITI 196
            RQTISDNFSVRQ+TITI
Sbjct: 159 YRQTISDNFSVRQVTITI 176


>gi|326511271|dbj|BAJ87649.1| predicted protein [Hordeum vulgare subsp. vulgare]
 gi|326513094|dbj|BAK06787.1| predicted protein [Hordeum vulgare subsp. vulgare]
 gi|326529293|dbj|BAK01040.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 281

 Score =  146 bits (369), Expect = 4e-33,   Method: Compositional matrix adjust.
 Identities = 88/136 (64%), Positives = 112/136 (82%), Gaps = 3/136 (2%)

Query: 64  SEEGPVEL-PQSIFATTDEPSSLQVATSVLLTGAISVFLFRALRRRAKRAKELKFRSSGA 122
           S +GPVE+   ++FA  D P+ LQVATSV+LTGAISVFLFR+LRRRA+RAKEL+ RS G 
Sbjct: 63  SGDGPVEIRAPTLFAVDDNPTPLQVATSVMLTGAISVFLFRSLRRRARRAKELRVRSGGV 122

Query: 123 KK--SLKDEALDNLKALGSSSIDAKGPPSPVQALLGGLTAGVIAIILYKFTTTIEAALNR 180
           +K  +L +EAL+ L+ + +S ++   PPSPVQALLGG+ AGVIA+ LYKF +T+EA+LNR
Sbjct: 123 EKPRNLSEEALEALRMVSTSPVETNKPPSPVQALLGGIAAGVIALFLYKFASTVEASLNR 182

Query: 181 QTISDNFSVRQITITI 196
           QTISDNFSVRQIT+TI
Sbjct: 183 QTISDNFSVRQITVTI 198


>gi|357497633|ref|XP_003619105.1| hypothetical protein MTR_6g042030 [Medicago truncatula]
 gi|355494120|gb|AES75323.1| hypothetical protein MTR_6g042030 [Medicago truncatula]
          Length = 254

 Score =  145 bits (367), Expect = 6e-33,   Method: Compositional matrix adjust.
 Identities = 79/123 (64%), Positives = 96/123 (78%), Gaps = 5/123 (4%)

Query: 75  IFATTDEPSSLQVATSVLLTGAISVFLFRALRRRAKRAKELKFRSSGAKKSLKDEALDNL 134
           IF   DE SS QV  SV+L+ AI+VFLF  ++RR KRAK+LK+RSSG KKS    +L++ 
Sbjct: 69  IFDAIDESSSNQVTISVVLSVAIAVFLFPTIQRRIKRAKQLKYRSSGVKKS----SLNST 124

Query: 135 KAL-GSSSIDAKGPPSPVQALLGGLTAGVIAIILYKFTTTIEAALNRQTISDNFSVRQIT 193
           K L GS+   +K PPSP +ALLG + AGVIA+ILY+FTTTIEA+L RQTISDNFSVRQIT
Sbjct: 125 KGLMGSTRKKSKNPPSPDKALLGAVMAGVIAVILYRFTTTIEASLYRQTISDNFSVRQIT 184

Query: 194 ITI 196
           ITI
Sbjct: 185 ITI 187


>gi|242065424|ref|XP_002454001.1| hypothetical protein SORBIDRAFT_04g022880 [Sorghum bicolor]
 gi|241933832|gb|EES06977.1| hypothetical protein SORBIDRAFT_04g022880 [Sorghum bicolor]
          Length = 293

 Score =  142 bits (359), Expect = 5e-32,   Method: Compositional matrix adjust.
 Identities = 88/130 (67%), Positives = 109/130 (83%), Gaps = 4/130 (3%)

Query: 66  EGPVEL-PQSIFATTDEPSSLQVATSVLLTGAISVFLFRALRRRAKRAKELKFRSSGAKK 124
           +GPVEL   ++F+T D P+ LQ ATS+LLTGAISVFLFR+LRRRA+RAKEL+ RSSG KK
Sbjct: 75  QGPVELRAPTLFSTDDNPTPLQTATSLLLTGAISVFLFRSLRRRARRAKELRVRSSGVKK 134

Query: 125 ---SLKDEALDNLKALGSSSIDAKGPPSPVQALLGGLTAGVIAIILYKFTTTIEAALNRQ 181
              +L +EAL+ L+ + +S I+ + PPSPVQ LLGG+ AGVIA+ILYKF+TTIEAALNRQ
Sbjct: 135 KPNNLTEEALEGLRLVSASPIETEKPPSPVQTLLGGIAAGVIAVILYKFSTTIEAALNRQ 194

Query: 182 TISDNFSVRQ 191
           TISD+FSVRQ
Sbjct: 195 TISDSFSVRQ 204


>gi|226494037|ref|NP_001143499.1| uncharacterized protein LOC100276178 [Zea mays]
 gi|195621548|gb|ACG32604.1| hypothetical protein [Zea mays]
 gi|224036011|gb|ACN37081.1| unknown [Zea mays]
 gi|413936044|gb|AFW70595.1| hypothetical protein ZEAMMB73_826587 [Zea mays]
          Length = 285

 Score =  137 bits (344), Expect = 3e-30,   Method: Compositional matrix adjust.
 Identities = 95/169 (56%), Positives = 125/169 (73%), Gaps = 7/169 (4%)

Query: 26  LQKPTISLSAFPRRRPLIEPYCLAQAQEPTTNITAPTTSEEGPVEL-PQSIFATTDEPSS 84
           +++PT   +    R PL  P   A+++ P  +  A     +GPVEL   ++F+T D P+ 
Sbjct: 32  VRRPTAIRAEAEVRPPL--PVSAAESEPP--DAGAVEAEGQGPVELRAPTLFSTDDNPTP 87

Query: 85  LQVATSVLLTGAISVFLFRALRRRAKRAKELKFRSSGAKK--SLKDEALDNLKALGSSSI 142
           LQ ATS+LLTGAISVFLFRALRRRA+RAKEL+ RSSG KK  +L +EAL+ L+ + +S I
Sbjct: 88  LQTATSLLLTGAISVFLFRALRRRARRAKELRVRSSGLKKPNNLTEEALERLRLMSASPI 147

Query: 143 DAKGPPSPVQALLGGLTAGVIAIILYKFTTTIEAALNRQTISDNFSVRQ 191
           + +   SP+QALLGG+ AGVIA+ILYKF+TT+EAALNRQTISD+FSVRQ
Sbjct: 148 ETEKATSPIQALLGGIAAGVIALILYKFSTTVEAALNRQTISDSFSVRQ 196


>gi|218190566|gb|EEC72993.1| hypothetical protein OsI_06901 [Oryza sativa Indica Group]
          Length = 325

 Score =  133 bits (335), Expect = 3e-29,   Method: Compositional matrix adjust.
 Identities = 86/159 (54%), Positives = 106/159 (66%), Gaps = 33/159 (20%)

Query: 66  EGPVELPQ-SIFATTDEPSSLQVATSVLLTGAISVFLFRALRRRAKRAKEL--------- 115
           +GPVEL   ++F+  + P+ LQ ATSVLLTGAISVFLFR++RRRA+RAKEL         
Sbjct: 85  DGPVELRTPTLFSIDENPTPLQTATSVLLTGAISVFLFRSIRRRARRAKELVIPSLLIHL 144

Query: 116 ---------------------KFRSSGAKK--SLKDEALDNLKALGSSSIDAKGPPSPVQ 152
                                + RS G +K  +L  EAL+ L+ + +S I+   PPSPVQ
Sbjct: 145 RCAFSTGEHTWSTNPISFRDTEGRSGGVEKPNNLSKEALEGLRLVSASPIEVDKPPSPVQ 204

Query: 153 ALLGGLTAGVIAIILYKFTTTIEAALNRQTISDNFSVRQ 191
           ALLGG+ AGVIA+ILYKFTTTIEAALNRQTISD+FSVRQ
Sbjct: 205 ALLGGIAAGVIALILYKFTTTIEAALNRQTISDSFSVRQ 243


>gi|168035549|ref|XP_001770272.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162678489|gb|EDQ64947.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 277

 Score =  107 bits (268), Expect = 2e-21,   Method: Compositional matrix adjust.
 Identities = 59/134 (44%), Positives = 89/134 (66%), Gaps = 6/134 (4%)

Query: 66  EGPVELPQSIFA--TTDEPSSLQVATSVLLTGAISVFLFRALRRRAKRAKELKFRSSGAK 123
           +GP+ELP  +       E S LQVA SV+LTG I+V L R+LRRR+K+AKE +FRS+G  
Sbjct: 88  DGPIELPPELLDPLAIPEASPLQVAASVVLTGLITVLLIRSLRRRSKKAKETRFRSTG-- 145

Query: 124 KSLKDEALDNLKALGSSSIDAKGPP-SPVQALLGGLTAGVIAIILYKFTTTIEAALNRQT 182
             +K++A  +  AL + + + + PP S +Q   G + AG IA++LYKFT T+E +   + 
Sbjct: 146 -EIKEDARKSAMALLNKAPEVETPPPSALQTFSGAVVAGFIALVLYKFTVTVEGSFTGKA 204

Query: 183 ISDNFSVRQITITI 196
           +S N+S+R +TIT+
Sbjct: 205 VSMNYSIRNLTITV 218


>gi|11994500|dbj|BAB02565.1| unnamed protein product [Arabidopsis thaliana]
          Length = 135

 Score = 91.3 bits (225), Expect = 2e-16,   Method: Compositional matrix adjust.
 Identities = 44/67 (65%), Positives = 53/67 (79%), Gaps = 1/67 (1%)

Query: 131 LDNLKALGSSSIDAKGP-PSPVQALLGGLTAGVIAIILYKFTTTIEAALNRQTISDNFSV 189
           +DNLKAL S+ I+     PS  QA LG + AGVIA+ILYKFT T+E+ LNRQTISDNFSV
Sbjct: 1   MDNLKALSSTPIEGGNSTPSAAQAFLGAIAAGVIALILYKFTVTVESGLNRQTISDNFSV 60

Query: 190 RQITITI 196
           RQIT+T+
Sbjct: 61  RQITVTV 67


>gi|302785097|ref|XP_002974320.1| hypothetical protein SELMODRAFT_442467 [Selaginella moellendorffii]
 gi|300157918|gb|EFJ24542.1| hypothetical protein SELMODRAFT_442467 [Selaginella moellendorffii]
          Length = 1433

 Score = 83.2 bits (204), Expect = 6e-14,   Method: Composition-based stats.
 Identities = 59/147 (40%), Positives = 86/147 (58%), Gaps = 13/147 (8%)

Query: 51   AQEPTTNITAPTTSEEGPVELPQSIFATTDEPSS-LQVATSVLLTGAISVFLFRALRRRA 109
            AQ     I   ++S +GP+ELP  +    ++P+S LQ+A SV+LTGAISV L R+LRRR 
Sbjct: 1244 AQVAEQAIADASSSLDGPIELPGDL----EQPTSPLQIAGSVVLTGAISVLLLRSLRRRY 1299

Query: 110  KRAKELKFRSSGAKKSLKDEALDNLKALGSSSIDAKGPPSPVQALLGGLTAGVIAIILYK 169
            K AKE        KK LK++A  N+ A  ++ +  K PPS  Q L G L AG IA +LY+
Sbjct: 1300 KEAKE--------KKPLKEQAKQNVLASLANQVVEKPPPSFEQTLYGALLAGAIAFVLYQ 1351

Query: 170  FTTTIEAALNRQTISDNFSVRQITITI 196
             TT +  +   + +   ++ R I +T+
Sbjct: 1352 CTTAVTGSFASKPLPTEYATRNIAVTV 1378


>gi|159491108|ref|XP_001703515.1| predicted protein [Chlamydomonas reinhardtii]
 gi|158280439|gb|EDP06197.1| predicted protein [Chlamydomonas reinhardtii]
          Length = 206

 Score = 53.1 bits (126), Expect = 6e-05,   Method: Compositional matrix adjust.
 Identities = 32/101 (31%), Positives = 55/101 (54%), Gaps = 7/101 (6%)

Query: 97  ISVFLFRALRRRAKRAKELKFRSSGAKK-SLKDEALDNLKALGSSSIDAKGPPSPVQALL 155
           + +FLFR  R+RA +A+  K     A   S+ DE  + L A        +G  +P+ A L
Sbjct: 89  VGIFLFRLFRKRAAKARSEKIAGQLATGPSIFDELREKLGA------PKRGKATPMNAFL 142

Query: 156 GGLTAGVIAIILYKFTTTIEAALNRQTISDNFSVRQITITI 196
           G + A  +A  L+ F+T ++A++  Q++ D ++ R I IT+
Sbjct: 143 GAVQALGLAAGLWFFSTKVQASIEAQSLPDGYTARNIAITV 183


>gi|145346872|ref|XP_001417906.1| predicted protein [Ostreococcus lucimarinus CCE9901]
 gi|144578134|gb|ABO96199.1| predicted protein [Ostreococcus lucimarinus CCE9901]
          Length = 181

 Score = 52.0 bits (123), Expect = 1e-04,   Method: Compositional matrix adjust.
 Identities = 34/118 (28%), Positives = 65/118 (55%), Gaps = 8/118 (6%)

Query: 79  TDEPSSLQVATSVLLTGAISVFLFRALRRRAKRAKELKFRSSGAKKSLKDEALDNLKALG 138
           T+E + LQ    VL TG    +  RA+++R+ +A+E  FR +      +   +D  +A  
Sbjct: 9   TEEATPLQNVFGVLFTGFSFWYFARAVQKRSGKARE--FRVANRLPEEERAKIDEERAKK 66

Query: 139 SSSIDAKGPPSPVQALLGGLTAGVIAIILYKFTTTIEAALNRQTISDNFSVRQITITI 196
           +  + A      +Q+  GGLT   I+++LY F + + A+ + + + ++ +VRQI+IT+
Sbjct: 67  TKELTA------MQSFTGGLTGIGISVVLYAFASKVSASFDGKALPESETVRQISITV 118


>gi|384247605|gb|EIE21091.1| hypothetical protein COCSUDRAFT_33788 [Coccomyxa subellipsoidea
           C-169]
          Length = 232

 Score = 50.1 bits (118), Expect = 5e-04,   Method: Compositional matrix adjust.
 Identities = 39/140 (27%), Positives = 65/140 (46%), Gaps = 22/140 (15%)

Query: 60  APTTSEEGPVELPQSIFATTDEPSSLQVATSVLLTGAISVFLFRALRRRAKRAKELKFRS 119
            P  +E+G   L     A  D    +  A  V+L   ++VF  R L +RAK A E +  S
Sbjct: 53  GPAIAEDG---LKYEPGAGADVVKQVAGAAYVVL---LAVFAVRLLTKRAKFATEERLAS 106

Query: 120 S--GAKKSLKDEALDNLKALGSSSIDAKGPPSPVQALLGGLTAGVIAIILYKFTTTIEAA 177
           +  G  +   D ++  +              +P+Q+L   + AGV A +LY F++ ++  
Sbjct: 107 AAKGENERPPDPSIKRV--------------TPLQSLWAAVQAGVFAYLLYLFSSNVDGY 152

Query: 178 LNRQTISDNFSVRQITITIS 197
             RQ + D ++ R IT+ IS
Sbjct: 153 FERQPLPDQYTARNITVAIS 172


>gi|302848926|ref|XP_002955994.1| hypothetical protein VOLCADRAFT_121472 [Volvox carteri f.
           nagariensis]
 gi|300258720|gb|EFJ42954.1| hypothetical protein VOLCADRAFT_121472 [Volvox carteri f.
           nagariensis]
          Length = 325

 Score = 48.1 bits (113), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 30/101 (29%), Positives = 50/101 (49%), Gaps = 7/101 (6%)

Query: 97  ISVFLFRALRRRAKRAKELKFRSSG-AKKSLKDEALDNLKALGSSSIDAKGPPSPVQALL 155
           + +FLFR  R+RA +AK  +      +  +  DE    + A   S        +PV A +
Sbjct: 111 VGIFLFRLFRKRAAKAKTERIAGQAPSGPTFFDELRQKVGAPKQSKA------TPVNAFI 164

Query: 156 GGLTAGVIAIILYKFTTTIEAALNRQTISDNFSVRQITITI 196
           G   A V+A  L+ FTT ++ ++  Q + D ++ R I IT+
Sbjct: 165 GCGQALVLAYGLWFFTTKVQGSIEGQALPDGYTARNIAITV 205


>gi|303283914|ref|XP_003061248.1| predicted protein [Micromonas pusilla CCMP1545]
 gi|226457599|gb|EEH54898.1| predicted protein [Micromonas pusilla CCMP1545]
          Length = 332

 Score = 45.4 bits (106), Expect = 0.012,   Method: Compositional matrix adjust.
 Identities = 33/124 (26%), Positives = 56/124 (45%), Gaps = 13/124 (10%)

Query: 80  DEPSSLQVATSVLLTGAISVFLFRALRRRAKRAKELKFRSS-------GAKKSLKDEALD 132
           DE + LQ A     T   + +  R +++R  RAKE +  ++       G     + E  D
Sbjct: 145 DEATPLQNAFGFAFTIFCAWYFVRVVKKRGNRAKEFRVANALPVRVRAGRPSKEERERRD 204

Query: 133 NLKALGSSSIDAKGPPSPVQALLGGLTAGVIAIILYKFTTTIEAALNRQTISDNFSVRQI 192
             +   S  + AK       A +GG+T   I+  LY+F   I  +   + + +++  RQI
Sbjct: 205 AEQLAKSKKLSAK------DAFVGGVTGLAISFALYQFAVVIGESFEGRLMPESYQARQI 258

Query: 193 TITI 196
           TIT+
Sbjct: 259 TITV 262


>gi|255071307|ref|XP_002507735.1| predicted protein [Micromonas sp. RCC299]
 gi|226523010|gb|ACO68993.1| predicted protein [Micromonas sp. RCC299]
          Length = 292

 Score = 44.3 bits (103), Expect = 0.025,   Method: Compositional matrix adjust.
 Identities = 32/116 (27%), Positives = 65/116 (56%), Gaps = 8/116 (6%)

Query: 81  EPSSLQVATSVLLTGAISVFLFRALRRRAKRAKELKFRSSGAKKSLKDEALDNLKALGSS 140
           E + LQ A  +  T   + +  R +R+R   AK+ +  +S +K+  +   L+ LK +   
Sbjct: 121 EATPLQNAFGLTFTIFSAWYFLRVVRKRGNAAKKFRVANSLSKEERERRDLEELKRV--K 178

Query: 141 SIDAKGPPSPVQALLGGLTAGVIAIILYKFTTTIEAALNRQTISDNFSVRQITITI 196
            ++A+      Q+ +GG+TA  I+I+LY+F   I+A+ + + +  ++ +RQI+ T+
Sbjct: 179 KLNAQ------QSFVGGITAIGISIVLYEFAQNIQASFDSRPVPVSYQIRQISNTV 228


>gi|412988915|emb|CCO15506.1| predicted protein [Bathycoccus prasinos]
          Length = 203

 Score = 42.7 bits (99), Expect = 0.078,   Method: Compositional matrix adjust.
 Identities = 30/118 (25%), Positives = 57/118 (48%), Gaps = 8/118 (6%)

Query: 79  TDEPSSLQVATSVLLTGAISVFLFRALRRRAKRAKELKFRSSGAKKSLKDEALDNLKALG 138
           T+E + LQ     L T     + +R +++R KRA+   FR +      + +  +  + L 
Sbjct: 73  TEEATPLQNLFGFLFTAFCGWYFWRVVKKRGKRAQ--TFRVANQLSPEELKEREEERKLK 130

Query: 139 SSSIDAKGPPSPVQALLGGLTAGVIAIILYKFTTTIEAALNRQTISDNFSVRQITITI 196
              ++A       QA  GG+T   I+  LY F + I    +  ++ ++++ RQI+IT+
Sbjct: 131 DKKLNAS------QAFTGGITGIAISAGLYIFASKISTGFDNSSLPESYTARQISITV 182


>gi|307106044|gb|EFN54291.1| expressed protein [Chlorella variabilis]
          Length = 223

 Score = 39.7 bits (91), Expect = 0.62,   Method: Compositional matrix adjust.
 Identities = 30/114 (26%), Positives = 51/114 (44%), Gaps = 10/114 (8%)

Query: 79  TDEPSSLQVATSVLLTGAISVFLFRALRRRAKRAKELKFRS-------SGAKKSLKDEAL 131
           T    +L+    V   G +  F  R  +RRA++A   +  S        G   S  +E  
Sbjct: 8   TGGSDTLKTVAGVAYIGVVIFFFLRLFQRRAQKATSERIASVADVAAYGGGADSDDEEDE 67

Query: 132 DNLKALGSSSIDAKGPPSPVQALLGGLTAGVIAIILYKFTTTIEAALNRQTISD 185
           +  KA       A    +P+Q+ +G   AGVI  +L++ +T ++A  +RQ + D
Sbjct: 68  EEQKA---KQQQAATEVTPLQSFIGTAQAGVICYLLFQASTAVDAYFDRQNLPD 118


>gi|291567877|dbj|BAI90149.1| hypothetical protein [Arthrospira platensis NIES-39]
          Length = 133

 Score = 38.1 bits (87), Expect = 2.0,   Method: Compositional matrix adjust.
 Identities = 21/49 (42%), Positives = 27/49 (55%), Gaps = 1/49 (2%)

Query: 149 SPVQALLGGLTAGVIAIILYKFTTTIEAALNRQTI-SDNFSVRQITITI 196
           +P Q L+GGL AG  AI +Y  T  I      Q I SDN +  Q++I I
Sbjct: 42  NPFQVLMGGLLAGGFAIAMYSLTAAIAQTFANQNIYSDNQTALQLSIAI 90


>gi|409994073|ref|ZP_11277194.1| hypothetical protein APPUASWS_23188 [Arthrospira platensis str.
           Paraca]
 gi|409935065|gb|EKN76608.1| hypothetical protein APPUASWS_23188 [Arthrospira platensis str.
           Paraca]
          Length = 122

 Score = 38.1 bits (87), Expect = 2.0,   Method: Compositional matrix adjust.
 Identities = 21/49 (42%), Positives = 27/49 (55%), Gaps = 1/49 (2%)

Query: 149 SPVQALLGGLTAGVIAIILYKFTTTIEAALNRQTI-SDNFSVRQITITI 196
           +P Q L+GGL AG  AI +Y  T  I      Q I SDN +  Q++I I
Sbjct: 31  NPFQVLMGGLLAGGFAIAMYSLTAAIAQTFANQNIYSDNQTALQLSIAI 79


>gi|394988291|ref|ZP_10381129.1| hypothetical protein SCD_00693 [Sulfuricella denitrificans skB26]
 gi|393792749|dbj|GAB70768.1| hypothetical protein SCD_00693 [Sulfuricella denitrificans skB26]
          Length = 196

 Score = 38.1 bits (87), Expect = 2.2,   Method: Compositional matrix adjust.
 Identities = 31/117 (26%), Positives = 56/117 (47%), Gaps = 7/117 (5%)

Query: 64  SEEGPVELPQSIFATTDEPSSLQVATSVLLTGAISVFLFRALRRRAKRAKELKFRSSGAK 123
           ++E  +E+ + IFA  D    L++A+    TG  +V+  RAL     +  EL    +G +
Sbjct: 11  NQEPILEVLKEIFA--DRQRVLEIASG---TGQHAVYFGRALPHLTWQTSELAQNHAGIQ 65

Query: 124 KSLKDEALDNLKALGSSSIDAKGPPSPVQALLGGLTAGVIAIILYKFTTTIEAALNR 180
             L D  L N+  L   +ID   P  P++++     A  + I+ ++   T+ A + R
Sbjct: 66  AWLDDSLLPNV--LPPVAIDVSAPKWPIESVDAVFNANTVHIVAWQEVETMFAGIAR 120


>gi|209526258|ref|ZP_03274788.1| hypothetical protein AmaxDRAFT_3612 [Arthrospira maxima CS-328]
 gi|376007925|ref|ZP_09785107.1| conserved hypothetical protein [Arthrospira sp. PCC 8005]
 gi|423062823|ref|ZP_17051613.1| hypothetical protein SPLC1_S051400 [Arthrospira platensis C1]
 gi|209493355|gb|EDZ93680.1| hypothetical protein AmaxDRAFT_3612 [Arthrospira maxima CS-328]
 gi|375323718|emb|CCE20860.1| conserved hypothetical protein [Arthrospira sp. PCC 8005]
 gi|406715779|gb|EKD10932.1| hypothetical protein SPLC1_S051400 [Arthrospira platensis C1]
          Length = 122

 Score = 37.4 bits (85), Expect = 3.5,   Method: Compositional matrix adjust.
 Identities = 21/49 (42%), Positives = 27/49 (55%), Gaps = 1/49 (2%)

Query: 149 SPVQALLGGLTAGVIAIILYKFTTTIEAALNRQTI-SDNFSVRQITITI 196
           +P Q L+GGL +G  AI +Y  T  I      Q I SDN +  QI+I I
Sbjct: 31  NPFQVLMGGLFSGGFAIAMYSLTAAIAHTFANQNIYSDNRTALQISIAI 79


>gi|342318945|gb|EGU10900.1| Proteophosphoglycan ppg4 [Rhodotorula glutinis ATCC 204091]
          Length = 1616

 Score = 36.2 bits (82), Expect = 7.2,   Method: Compositional matrix adjust.
 Identities = 34/98 (34%), Positives = 49/98 (50%), Gaps = 18/98 (18%)

Query: 17  SHHPQKLNFLQKPTISLSAFPRRRPLIEPYCLAQAQEPTTNITAPTTSEEGPVELPQSIF 76
           + HP K+   + P +S S F RRRP   P+C          IT+PT S         S+ 
Sbjct: 224 TRHPPKV---RPPRLSPSLFARRRPSFIPHCRC--------ITSPTPSHSHRTTPCNSLL 272

Query: 77  ATTDEPSSLQVATSVLLTGAISVFL-FRALRRRAKRAK 113
                P  L++A  VL+ GA++V L  + LRRR +RA+
Sbjct: 273 -----PMLLRLAV-VLVYGAVAVSLPLQPLRRRVRRAE 304


  Database: nr
    Posted date:  Mar 3, 2013 10:45 PM
  Number of letters in database: 999,999,864
  Number of sequences in database:  2,912,245
  
  Database: /local_scratch/syshi//blastdatabase/nr.01
    Posted date:  Mar 3, 2013 10:52 PM
  Number of letters in database: 999,999,666
  Number of sequences in database:  2,912,720
  
  Database: /local_scratch/syshi//blastdatabase/nr.02
    Posted date:  Mar 3, 2013 10:58 PM
  Number of letters in database: 999,999,938
  Number of sequences in database:  3,014,250
  
  Database: /local_scratch/syshi//blastdatabase/nr.03
    Posted date:  Mar 3, 2013 11:03 PM
  Number of letters in database: 999,999,780
  Number of sequences in database:  2,805,020
  
  Database: /local_scratch/syshi//blastdatabase/nr.04
    Posted date:  Mar 3, 2013 11:08 PM
  Number of letters in database: 999,999,551
  Number of sequences in database:  2,816,253
  
  Database: /local_scratch/syshi//blastdatabase/nr.05
    Posted date:  Mar 3, 2013 11:13 PM
  Number of letters in database: 999,999,897
  Number of sequences in database:  2,981,387
  
  Database: /local_scratch/syshi//blastdatabase/nr.06
    Posted date:  Mar 3, 2013 11:18 PM
  Number of letters in database: 999,999,649
  Number of sequences in database:  2,911,476
  
  Database: /local_scratch/syshi//blastdatabase/nr.07
    Posted date:  Mar 3, 2013 11:24 PM
  Number of letters in database: 999,999,452
  Number of sequences in database:  2,920,260
  
  Database: /local_scratch/syshi//blastdatabase/nr.08
    Posted date:  Mar 3, 2013 11:25 PM
  Number of letters in database: 64,230,274
  Number of sequences in database:  189,558
  
Lambda     K      H
   0.317    0.131    0.363 

Lambda     K      H
   0.267   0.0410    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 2,694,788,118
Number of Sequences: 23463169
Number of extensions: 99127339
Number of successful extensions: 350213
Number of sequences better than 100.0: 94
Number of HSP's better than 100.0 without gapping: 32
Number of HSP's successfully gapped in prelim test: 62
Number of HSP's that attempted gapping in prelim test: 350134
Number of HSP's gapped (non-prelim): 108
length of query: 199
length of database: 8,064,228,071
effective HSP length: 135
effective length of query: 64
effective length of database: 9,191,667,552
effective search space: 588266723328
effective search space used: 588266723328
T: 11
A: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.7 bits)
S2: 73 (32.7 bits)